Open source data anonymization and synthetic data orchestration for developers. Create high fidelity synthetic data and sync it across your environments.
-
Updated
Jun 30, 2024 - Go
Open source data anonymization and synthetic data orchestration for developers. Create high fidelity synthetic data and sync it across your environments.
Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing"
Generate ideal question-answers for testing RAG
PostgreSQL database anonymization tool
UniGen: A Unified Framework for Dataset Generation via Large Language Model
Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers of AI agents and LLMs.
Synthetic data generation for tabular data
Benchmarking synthetic data generation methods.
⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
Conditional GAN for generating synthetic tabular data.
Repositorio con el código de los experimentos de mi TFM titulado "Transformación de Datos Tabulares a Imágenes Sintéticas: Optimización y Evaluación de la Librería TINTOlib en Python"
Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.
Metrics to evaluate quality and efficacy of synthetic datasets.
Synthetic data generators for tabular and time-series data
Synthetic data generators for structured and unstructured text, featuring differentially private learning.
A python client used to interact with the Private AI's API
Synthetic Patient Population Simulator
a curated list of data for reasoning ai
A library to model multivariate data using copulas.
A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.
Add a description, image, and links to the synthetic-data topic page so that developers can more easily learn about it.
To associate your repository with the synthetic-data topic, visit your repo's landing page and select "manage topics."