twitter/scalding
A Scala API for Cascading
Sparrow scheduling platform (U.C. Berkeley).
A Scala API for Cascading
Streaming MapReduce with Scalding and Storm
PySpark + Scikit-learn = Sparkit-learn
Apache Spark - A unified analytics engine for large-scale data processing
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
Secor is a service implementing Kafka log persistence
2 captures since 2026-05-24