🦉 Data Versioning and ML Experiments
-
Updated
Mar 25, 2025 - Python
🦉 Data Versioning and ML Experiments
Refine high-quality datasets and visual AI models
No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Neo4j graph construction from unstructured data using LLMs
🔮 Instill Core is a full-stack AI infrastructure tool for data, model and pipeline orchestration, designed to streamline every aspect of building versatile AI-first applications
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
Interact, analyze and structure massive text, image, embedding, audio and video datasets
A curated list of resources for Document Understanding (DU) topic
A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and ultra-low latency.
Interactively explore unstructured datasets from your dataframe.
LOTUS: A semantic query engine for fast and easy LLM-powered data processing
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
Visual Data Transformation and Data Preparation. Low-Code Python-based ETL.
Curate better data for LLMs
Enterprise-grade and API-first LLM workspace for unstructured documents, including data extraction, redaction, rights management, prompt playground, and more!
NucliaDB, The AI Search database for RAG
Embedding Studio is a framework which allows you transform your Vector Database into a feature-rich Search Engine.
python implementation of jordansissel's grok regular expression library
Open-source unstructured data (PDFs, Images, Audiofiles) processing platform built for knowledge workers
Add a description, image, and links to the unstructured-data topic page so that developers can more easily learn about it.
To associate your repository with the unstructured-data topic, visit your repo's landing page and select "manage topics."