Open highlighted repo slot
Put your repository first
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
GitHub projects from awesome lists
Search names, descriptions, topics, tags, and stacks, then tune results by ecosystem, freshness, health, and cross-list signal.
Open highlighted repo slot
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
An Open Source Machine Learning Framework for Everyone
Apache Superset is a Data Visualization and Data Exploration Platform
Apache ECharts is a powerful, interactive charting and data visualization library for browser
scikit-learn: machine learning in Python
Deep Learning for humans
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data :bar_chart:
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
Apache Spark - A unified analytics engine for large-scale data processing
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
A library for efficient similarity search and clustering of dense vectors.
TiDB is built for agentic workloads that grow unpredictably, with ACID guarantees and native support for transactions, analytics, and vector search. No data silos. No noisy neighbors. No infrastructure ceiling.
LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.
CockroachDB — the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placement.
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
High-performance, scalable time-series database designed for Industrial IoT (IIoT) scenarios
Data Apps & Dashboards for Python. No JavaScript Required.
matplotlib: plotting with Python
high-performance graph database for real-time use cases
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Open-source JavaScript charting library behind Plotly and Dash
VictoriaMetrics: fast, cost-effective monitoring solution and time series database
Distributed transactional key-value database, originally created to complement TiDB
An orchestration platform for the development, production, and observation of data assets.
Apache Pulsar - distributed pub-sub messaging system
An open-source graph database
An embedded key/value database for Go.
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
Highly available Prometheus setup with long term storage capabilities. A CNCF Incubating project.
Apache Druid: a high performance real-time analytics database.