twitter/scalding
A Scala API for Cascading
Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or whatever you choose to call your Hadoop data warehouse these days.
A Scala API for Cascading
KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
Apache Druid: a high performance real-time analytics database.
Collect, aggregate, and visualize a data ecosystem's metadata
A free open source IT asset/license management system
Apache Spark - A unified analytics engine for large-scale data processing
2 captures since 2026-05-24