Sign in
← Back to search

internetarchive/Sparkling

Internet Archive's Sparkling Data Processing Library

Stars
16
Forks
2
Commits
47
Language
Scala
Awesome lists
1

Similar repositories

helgeho/ArchiveSpark

An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.

161 stars
Scala 1 awesome list

archivesunleashed/aut

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.

157 stars
Scala 1 awesome list

svenkreiss/pysparkling

A pure Python implementation of Apache Spark's RDD and DStream interfaces.

270 stars
Python 1 awesome list

TIBCOSoftware/snappydata

Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster

1034 stars
Scala 1 awesome list

internetarchive/arch

Web application for distributed compute analysis of Archive-It web archive collections.

20 stars
Scala 1 awesome list

Tracked growth

2 captures since 2026-05-23

Latest capture 2026-05-31 03:01

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2022-04-28
  • First commit: 2022-04-28
  • Last pushed: 2026-05-04
  • Archived: no
  • Stack detected: —
  • License: MIT

AI development signals

No AI development config files detected.