← Back to search

github Active

Repository profile

apache/hudi

Upserts, Deletes And Incremental Processing on Big Data.

Java Apache-2.0 master Stack scanned README.md

Open website Open GitHub

Stars: 6,186
Forks: 2,495
Watchers: 1,127
Issues: 3,041
Commits: 7,605
Awesome lists: 1

Repository updates

Get generated apache/hudi development summaries by email, or follow the weekly and monthly RSS feeds.

Weekly RSS Monthly RSS

Activity and growth

Tracked growth, recent movement, and commit velocity from stored repository snapshots.

Latest capture 2026-07-15 03:04

Star growth, last 7 days: 0 0.0%
Commit velocity, last 7 days: 0 0.0%
Stars since baseline: +21
Snapshot coverage: 5

Tracked growth

5 captures since 2026-05-25

Stars from baseline +21

Time horizon

All tracked data

Custom start Custom end

Stars history

Total stars

Commits history

Default branch commits

Detected stack

Frameworks, package managers, ecosystems, and dependency manifests found during catalog scans.

Scanned 2026-07-15 03:04

Stack signals: 2
Package managers: 2
Manifest files: 80
Dependencies: 2,595

Frameworks and tools

Jupyter notebook · high confidence
Spring Boot web framework · high confidence

Maven pip java python

Dependency files

80 manifests

pom.xml java ecosystem, 241 dependencies
hudi-aws/pom.xml java ecosystem, 54 dependencies
hudi-azure/pom.xml java ecosystem, 36 dependencies
hudi-cli/pom.xml java ecosystem, 50 dependencies
hudi-client/pom.xml java ecosystem, 0 dependencies
hudi-common/pom.xml java ecosystem, 70 dependencies
hudi-examples/pom.xml java ecosystem, 0 dependencies
hudi-flink-datasource/pom.xml java ecosystem, 0 dependencies
72 more files

Classification

Searchable topics, generated tags, and stack labels that explain where this repository fits.

Topics: 9
Tags: 0
Stacks: 2

Topics

#apacheflink #apachehudi #apachespark #bigdata #data-integration #datalake #hudi #incremental-processing #stream-processing

Generated tags

No generated tags yet.

Stack labels

Jupyter Spring Boot

AI development signals

Agent instructions and tool configuration paths found in the repository tree.

0 paths

No AI development config files detected.

Similar repositories

Nearest indexed repositories by embedding similarity.

apache/spark

Apache Spark - A unified analytics engine for large-scale data processing

43,614 stars

Scala 4 awesome lists

apache/flink

Apache Flink

26,176 stars

Java 1 awesome list

TIBCOSoftware/snappydata

Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster

1,032 stars

Scala 1 awesome list

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

7,498 stars

Jupyter Notebook 3 awesome lists

apache/druid

Apache Druid: a high performance real-time analytics database.

14,030 stars

Java 3 awesome lists

apache/gobblin

A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.

2,270 stars

Java 1 awesome list

Metadata

Language: Java
License: Apache-2.0
Default branch: master
Created: 2016-12-14
First commit: 2016-12-16
Last pushed: 2026-07-14
GitHub updated: 2026-07-14
Last synced: 2026-07-15 03:04
Stack detected: 2026-07-15 03:04
Archived: no

Links and files

GitHub Website

https://hudi.apache.org/

README

Appears in

Awesome Opensource Ai

apache/hudi

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

apache/spark

apache/flink

TIBCOSoftware/snappydata

h2oai/h2o-3

apache/druid

apache/gobblin

Metadata

Links and files

Appears in

How it works

Pricing

Follow repository updates

Activity and growth

Tracked growth

Time horizon

Stars history

Commits history

Detected stack

Frameworks and tools

Dependency files

Classification

Topics

Generated tags

Stack labels

AI development signals

Similar repositories

apache/spark

apache/flink

TIBCOSoftware/snappydata

h2oai/h2o-3

apache/druid

apache/gobblin

Metadata

Links and files

Appears in