Sign in
← Back to search

argilla-io/distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Stars
3,229
Forks
242
Commits
846
Language
Python
Awesome lists
1

Similar repositories

argilla-io/argilla

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

4985 stars
Python 1 awesome list

cleanlab/cleanlab

Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

11480 stars
Python 3 awesome lists

HumanSignal/label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

27479 stars
TypeScript 1 awesome list

ludwig-ai/ludwig

Low-code framework for building custom LLMs, neural networks, and other AI models

11705 stars
Python 2 awesome lists

bespokelabsai/curator

Synthetic data curation for post-training and structured data extraction

1678 stars
Python 1 awesome list

huggingface/datasets

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

21537 stars
Python 1 awesome list

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 20:51

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2023-10-16
  • First commit: —
  • Last pushed: 2026-05-18
  • Website: https://distilabel.argilla.io
  • Archived: no
  • Stack detected: —
  • License: Apache-2.0

AI development signals

No AI development config files detected.