apache/hudi
Upserts, Deletes And Incremental Processing on Big Data.
Apache Spark - A unified analytics engine for large-scale data processing
Upserts, Deletes And Incremental Processing on Big Data.
Simple and Distributed Machine Learning
PySpark + Scikit-learn = Sparkit-learn
Apache Maven core
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster
Apache Flink
5 captures since 2026-05-22
AI agent config detected
Key config paths
AGENTS.md
CLAUDE.md