Skip to content

Azure-Samples/aihlsignited-medindexer

🤖 MedIndexer: Turning Unstructured Clinical Data into Value

This project is part of the HLS Ignited Program, which focuses on hands-on accelerators with the goal of democratizing how AI is transforming the healthcare industry. In this project, we accelerate healthcare providers and payer organizations in building AI-driven clinical knowledge bases using Azure AI Search. Visit the program page to explore other accelerators.

MedIndexer is an indexing framework designed for the automated creation of structured knowledge bases from unstructured clinical sources. It enables the transformation of images (X-rays), clinical text, and other unstructured data into a schema-driven, searchable format, allowing your applications to leverage state-of-the-art retrieval methodologies, including vector search and re-ranking, powered by Azure AI Search. By applying a well-defined schema and vectorizing the data into high-dimensional representations, MedIndexer empowers AI applications to retrieve more precise and context-aware information.

🔍 Turning Your Unstructured Data into Value

"About 80% of medical data remains unstructured and untapped after it is created (e.g., text, image, signal, etc.)"
Healthcare Informatics Research, Chungnam National University

In the era of AI, the rise of AI copilots and assistants has led to a shift in how we access knowledge. But retrieving clinical data that lives in disparate formats is no trivial task. Building retrieval systems takes effort—and how you structure your knowledge store matters. It’s a cyclic, iterative, and constantly evolving process. That’s why we believe in leveraging enterprise-ready retrieval platforms like Azure AI Search—designed to power intelligent search experiences across structured and unstructured data. It serves as the foundation for building advanced retrieval systems in healthcare.


Solution Diagram

However, implementing Azure AI Search alone is not enough. Mastering its capabilities and applying well-defined patterns can significantly enhance your ability to address repetitive tasks and complex retrieval scenarios. This project aims to accelerate your ability to transform raw clinical data into high-fidelity, high-value knowledge structures that can power your next-generation healthcare applications.

🚀 How to Get Started

If you're new to Azure AI Search, start with the step-by-step labs to build a solid foundation in the technology. Experienced AI engineers can jump straight to the use case sections, which showcase how to create Coded Policy Knowledge Stores and X-ray Knowledge Stores for real-world applications.

  • 📝 Creating Coded Policy Knowledge Stores: 🧾 Notebook - Creating Coded Policies Knowledge Stores Transform payer policies into machine-readable formats. This use case includes:

    • Preprocessing and cleaning PDF documents
    • Building custom OCR skills
    • Leveraging out-of-the-box Indexer capabilities and embedding skills
    • Enabling real-time AI-assisted querying for ICDs, payer names, drug names, and policy logic

    Why it matters:
    Streamlines prior authorization and coding workflows for providers and payors, reducing manual effort and increasing transparency.

  • 🩻 Creating X-ray Knowledge Stores: 🧾 Notebook - Creating X-rays Knowledge Stores Turn imaging reports and metadata into a searchable knowledge base. This includes:

    • Leveraging push APIs with custom event-driven indexing pipeline triggered on new X-ray uploads
    • Generating embeddings using Microsoft Healthcare foundation models
    • Providing an AI-powered front-end for X-ray similarity search

    Why it matters:
    Supports clinical decision-making by retrieving similar past cases, aiding diagnosis and treatment planning with contextual relevance.

📚 More Resources


Important

This software is provided for demonstration purposes only. It is not intended to be relied upon for any production workload. The creators of this software make no representations or warranties of any kind, express or implied, about the completeness, accuracy, reliability, suitability, or availability of the software or related content. Any reliance placed on such information is strictly at your own risk.

About

Indexing framework designed for the automated creation of structured knowledge bases in Azure AI Search

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •