argilla-io/argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Low-code framework for building custom LLMs, neural networks, and other AI models
Synthetic data curation for post-training and structured data extraction
🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
1 capture since 2026-05-25