Open highlighted repo slot
Put your repository first
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Awesome-list intelligence for GitHub
Discover projects curated by awesome-list maintainers, then narrow them by stars, age, freshness, archive status, language, topics, generated tags, detected stacks, package managers, and source list.
Open highlighted repo slot
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.
A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.
LlamaIndex is the leading document agent and OCR platform
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
Memory platform for AI Agents in 6 lines of code
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval and long-term memory.
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
LangChain4j is an idiomatic, open-source Java library for building LLM-powered applications on the JVM. It offers a unified API over popular LLM providers and vector stores, and makes implementing tool calling (including MCP support), agents and RAG easy. It integrates seamlessly with enterprise Java frameworks like Quarkus and Spring Boot.
Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.
🌌 A complete search engine and RAG pipeline in your browser, server or edge network with support for full-text, vector, and hybrid search in less than 2kb.
A lightweight, lightning-fast, in-process vector database
Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. — rebuilt from scratch. Unified architecture on your S3.
Deeplake is AI Data Runtime for Agents. It provides serverless postgres with a multimodal datalake, enabling scalable retrieval and training.
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
Postgres with GPUs for ML/AI apps.
A query and indexing engine for Redis, providing secondary indexing, full-text search, vector similarity search and aggregations.
Open-source framework for building AI-powered apps in JavaScript, Go, and Python, built and used in production by Google
HelixDB is an open-source graph-vector database built from scratch in Rust.
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.
Local persistent memory store for LLM applications including claude desktop, github copilot, codex, antigravity, etc.
pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tidb.ai
The PHP Agentic Framework to build production-ready AI driven applications. Connect components (LLMs, vector DBs, memory) to agents that can interact with your data.
Open-source persistent memory for AI agent pipelines (LangGraph, CrewAI, AutoGen) and Claude. REST API + knowledge graph + autonomous consolidation.
Practical course about Large Language Models.
Scalable, fast, and disk-friendly vector search in Postgres, the successor of pgvecto.rs.
Ground truth layer for humans and AI agents working together. Version control for knowledge.
Humans and AI agents, building knowledge bases together. Self-hosted document annotation, version control, semantic search, and MCP.