Sign in

Awesome List

Awesome Bigdata

A curated list of awesome big data frameworks, ressources and other awesomeness.

oxnr/awesome-bigdata #awesome#awesome-list#bigdata#data#data-analytics#data-science#data-stream#data-visualization#data-warehouse#database#distributed-database#series-database#stream-processing#streaming-data#visualize-data 404 Not Found | https://api.github.com/repos/deeplearning4j/rl4j | message=Not Found | rate_limit_remaining=3323 | rate_limit_reset=1780400982
List stars
14,417
README repos
211
Indexed repos
183
List commits
592
Forks
2,585
Open issues
3

Tracked list growth

GitHub stars and default-branch commits for oxnr/awesome-bigdata.

Latest scan 2026-06-02 10:49

Likes history

GitHub stars

Commits history

Default branch commits

Indexed repositories

183 repos currently saved from this list.

No filters applied
Latest repo push 2026-06-02

Age filters use known first-commit dates and exclude repositories that have not synced that data yet.

Reset
Highlighted

Open highlighted repo slot

Put your repository first

Promote a GitHub repo at the top of Awesome repository list views for 7 days.

tidwall/buntdb

BuntDB is an embeddable, in-memory key/value database for Go with custom indexing and geospatial support

Go #database#geospatial#golang#in-memory pushed 2026-05-19 174 commits first commit 2016-07-19 3 list mentions
apache/linkis

Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.

Java Spring BootViteVue Mavennpm #application-manager#context-service#engine#hive pushed 2026-06-01 4,310 commits first commit 2019-07-23 1 list mention
apache/gobblin

A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.

Java #apache#data#ingestion#management pushed 2026-06-01 6,552 commits first commit 2014-02-04 1 list mention
gchq/Gaffer

A large-scale entity and relation database supporting aggregation of properties

Java #accumulo#aggregation#big-data#graph pushed 2025-06-06 7,332 commits first commit 2015-12-14 1 list mention archived
bruin-data/bruin

Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.

Go CobraFlaskgRPC Go CargoGo modules #analytics#bigquery#data-analysis#data-ingestion pushed 2026-06-02 7,289 commits first commit 2023-07-25 1 list mention AI dev signals