microsoft/SynapseML
Simple and Distributed Machine Learning
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster
Simple and Distributed Machine Learning
Apache Spark - A unified analytics engine for large-scale data processing
TiDB is built for agentic workloads that grow unpredictably, with ACID guarantees and native support for transactions, analytics, and vector search. No data silos. No noisy neighbors. No infrastructure ceiling.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Internet Archive's Sparkling Data Processing Library
ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, the first Multi-Model DBMS. ArcadeDB supports Vector Embeddings.
10 captures since 2026-05-24
build.gradle
· java · 28 dependencies
settings.gradle
· java · 0 dependencies
cluster/build.gradle
· java · 17 dependencies
compatibilityTests/build.gradle
· java · 1 dependencies
connector/build.gradle
· java · 0 dependencies
core/build.gradle
· java · 11 dependencies
docs/requirements.txt
· python · 1 dependencies
dtests/build.gradle
· java · 5 dependencies
dunit/build.gradle
· java · 2 dependencies
encoders/build.gradle
· java · 3 dependencies