datajuicer/data-juicer
Data processing for and with foundation models! ๐ ๐ ๐ฝ โก๏ธ โก๏ธ๐ธ ๐น ๐ท
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. โ ๐ค๐ค
Data processing for and with foundation models! ๐ ๐ ๐ฝ โก๏ธ โก๏ธ๐ธ ๐น ๐ท
Synthetic data curation for post-training and structured data extraction
Deeplake is AI Data Runtime for Agents. It provides serverless postgres with a multimodal datalake, enabling scalable retrieval and training.
AdalFlow: The library to build & auto-optimize LLM applications.
DeepAnalyze is the first agentic LLM for autonomous data science. ๐ไฝ ็AIๆฐๆฎๅๆๅธ๏ผ่ชๅจๅๆๅคง้ๆฐๆฎ๏ผไธ้ฎ็ๆไธไธๅๆๆฅๅ๏ผ
data-to-paper: Backward-traceable AI-driven scientific research
1 capture since 2026-05-27