Auto-convert multimodal data into ML-ready datasets

1 leandrenash 2 7/22/2025, 2:45:51 PM github.com ↗

Comments (2)

leandrenash · 7h ago
Unimodaly Ingest is the world’s first truly unified data-ingestion CLI for machine learning. It automatically detects text, image, audio and tabular files, then validates, samples and augments them into a single, schema-validated dataset ready for training. With built-in metadata extraction and support for JSON/JSONL/CSV exports, you’ll cut your dataset-prep time from hours to minutes. Open-source, cross-platform and extensible, it’s ideal for data engineers, researchers and AI startups everywhere.
leandrenash · 7h ago
Unimodaly Ingest is the world’s first truly unified data-ingestion CLI for machine learning. It automatically detects text, image, audio and tabular files, then validates, samples and augments them into a single, schema-validated dataset ready for training.