Open highlighted repo slot
Put your repository first
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Awesome List
Collection of AI-related utilities. Welcome to submit pull requests /收藏AI相关的实用工具,欢迎提交pull requests
GitHub stars and default-branch commits for ikaijua/Awesome-AITools.
109 repos currently saved from this list.
Open highlighted repo slot
Promote a GitHub repo at the top of Awesome repository list views for 7 days.
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
An autonomous agent for deep financial research
Google Workspace CLI — one command-line tool for Drive, Gmail, Calendar, Sheets, Docs, Chat, Admin, and more. Dynamically built from Google Discovery Service. Includes AI agent skills.
Build and run agents you can see, understand and trust.
基于 ChatGPT API 的划词翻译浏览器插件和跨平台桌面端应用 - Browser extension and cross-platform desktop application for translation based on ChatGPT API.
From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack.
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Crawl a site to generate knowledge files to create your own custom GPT from a URL
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Mobile and Web client for Codex and Claude Code, with realtime voice, encryption and fully featured
Skills Catalog for Codex
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
Use Codex from Claude Code to review code or delegate tasks.
No description.
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
YC (S26) | Give AI the ability to live your experience. Records everything you do, say, hear 24/7, local, private, secure
Get started with building Fullstack Agents using Gemini 2.5 and LangGraph
沉浸式双语网页翻译扩展 , 支持输入框翻译, 鼠标悬停翻译, PDF, Epub, 字幕文件, TXT 文件翻译 - Immersive Dual Web Page Translation Extension
Toolkit for linearizing PDFs for LLM datasets/training
Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
The official Lark/Feishu CLI tool, maintained by the larksuite team — built for humans and AI Agents. Covers core business domains including Messenger, Docs, Base, Sheets, Calendar, Mail, Tasks, Meetings, and more, with 200+ commands and 20+ AI Agent Skills.
Official implementation of AnimateDiff.
Foundational Models for State-of-the-Art Speech and Text Translation
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
Official inference library for Mistral models
The open source codebase powering HuggingChat
OpenAI ChatGPT, GPT-5, GPT-Image-1, Whisper API clients for Go
No description.
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
Open source real-time translation app for Android that runs locally
🤗 ml-intern: an open-source ML engineer that reads papers, trains models, and ships ML models
🚀💪Maximize your efficiency and productivity. The ultimate hub to manage, customize, and share prompts. (English/中文/Español/العربية). 让生产力加倍的 AI 快捷指令。更高效地管理提示词,在分享社区中发现适用于不同场景的灵感。
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Lean 4 programming language and theorem prover
🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser sandbox that lets you automate the web without worrying about infrastructure.
lightweight, standalone C++ inference engine for Google's Gemma models.
🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓
Your browser is the API. CLI + MCP server for AI agents to control Chrome with your login state.
Never stop coding. Free AI gateway: one endpoint, 160+ providers (50+ free), connect Claude Code, Codex, Cursor, Cline & Copilot to FREE Claude/GPT/Gemini. RTK+Caveman stacked compression saves 15-95% tokens, smart auto-fallback, MCP/A2A, multimodal APIs, Desktop/PWA.
Spec-driven development for large codebases
This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks
The most accurate document search and store for building AI apps
Stop configuring your AI stack. Start using it. One command brings a complete pre-wired LLM stack with hundreds of services to explore.
OpenAPI specification for the OpenAI API
The AI Agent Workforce Platform — where teams scale beyond headcount. Give every team member an AI agent squad.
企业微信开放平台命令行工具 — 让人类和 AI Agent 都能在终端中操作企业微信