Sign in
← Back to search

sonalgoyal/hiho

Hadoop Data Integration with various databases, ftp servers, salesforce. Incremental update, dedup, append, merge your data on Hadoop.

Stars
92
Forks
31
Commits
20
Language
Java
Awesome lists
1

Similar repositories

h2oai/h2o-3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

7484 stars
Jupyter Notebook 3 awesome lists

apache/hudi

Upserts, Deletes And Incremental Processing on Big Data.

6170 stars
Java 1 awesome list

nathanmarz/elephantdb

Distributed database specialized in exporting key/value data from Hadoop

559 stars
Java 1 awesome list

ottogroup/schedoscope

Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or whatever you choose to call your Hadoop data warehouse these days.

97 stars
Scala 1 awesome list

Tracked growth

2 captures since 2026-05-24

Latest capture 2026-06-01 03:04

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2010-11-15
  • First commit: 2010-11-15
  • Last pushed: 2013-04-11
  • Website: www.nubetech.co/products
  • Archived: no
  • Stack detected: —
  • License: Apache-2.0

AI development signals

No AI development config files detected.

Appears in