Infrastructure AI Projects

Daily ranking page for Infrastructure open-source AI repositories.

Infrastructure tracks 1352 repositories with 5465025 total GitHub stars.

diegosouzapw/OmniRoute - Never stop coding. Free AI gateway: one endpoint, 231+ providers (50+ free), connect Claude Code, Codex, Cursor, Cline & Copilot to FREE Claude/GPT/Gemi... (10506 stars, TypeScript, Infrastructure)
ogulcancelik/herdr - agent multiplexer that lives in your terminal. (10391 stars, Rust, Infrastructure)
Alishahryar1/free-claude-code - Use claude code and codex for free in the terminal, VSCode extension, and discord like OpenClaw (voice supported) (38378 stars, Python, Infrastructure)
lemony-ai/cascadeflow - Cascading runtime for AI agents. Optimize cost, latency, quality, and policy decisions inside the agent loop. (3083 stars, Python, Infrastructure)
decolua/9router - Unlimited FREE AI coding. Connect Claude Code, Codex, Cursor, Cline, Copilot, Antigravity to FREE Claude/GPT/Gemini via 40+ providers. Auto-fallback, RT... (19607 stars, JavaScript, Infrastructure)
rocketride-org/rocketride-server - High-performance AI pipeline engine with a C++ core and 50+ Python-extensible nodes. Build, debug, and scale LLM workflows with 13+ model providers, 8+... (4873 stars, Python, Infrastructure)
Nasiko-Labs/nasiko - Developer Control Plane for your AI Agents (3627 stars, Python, Infrastructure)
h4ckf0r0day/obscura - The headless browser for AI agents and web scraping (17443 stars, Rust, Infrastructure)
Wei-Shaw/sub2api - Sub2API is an open-source relay platform that unifies Claude, OpenAI, Gemini, and Antigravity subscriptions into a single endpoint. It supports account... (30095 stars, Go, Infrastructure)
tashfeenahmed/freellmapi - OpenAI-compatible proxy that stacks the free tiers of 16 LLM providers (~1.7B tokens/month) behind one /v1 endpoint — plus any custom OpenAI-compatible... (14914 stars, TypeScript, Infrastructure)
HKUDS/CLI-Anything - "CLI-Anything: Making ALL Software Agent-Native" -- CLI-Hub: https://clianything.cc/ (44644 stars, Python, Infrastructure)
antirez/ds4 - DeepSeek 4 Flash and PRO local inference engine for Metal, CUDA and ROCm (17392 stars, C, Infrastructure)
ggml-org/llama.cpp - LLM inference in C/C++ (119114 stars, C++, Infrastructure)
vllm-project/vllm - A high-throughput and memory-efficient inference and serving engine for LLMs (85255 stars, Python, Infrastructure)
QuantumNous/new-api - A unified AI model hub for aggregation & distribution. It supports cross-converting various LLMs into OpenAI-compatible, Claude-compatible, or Gemini-co... (40994 stars, Go, Infrastructure)
BerriAI/litellm - Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Be... (52508 stars, Python, Infrastructure)
router-for-me/CLIProxyAPI - Wrap Antigravity, ChatGPT Codex, Claude Code, Grok Build as an OpenAI/Gemini/Claude/Codex compatible API service, allowing you to enjoy the free Gemini... (39055 stars, Go, Infrastructure)
ollama/ollama - Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models. (175354 stars, Go, Infrastructure)
seakee/CPA-Manager-Plus - Self-hosted AI gateway monitoring — track requests, cost, failures, quota, and account health for CPA / CLIProxyAPI and OpenAI-compatible gateways. (1181 stars, TypeScript, Infrastructure)
paperclipai/paperclip - The open-source app everyone uses to manage agents at work (72611 stars, TypeScript, Infrastructure)
langfuse/langfuse - 🪢 Open source AI engineering platform: LLM evals, observability, metrics, prompt management, playground, datasets. Integrates with OpenTelemetry, LangCh... (30359 stars, TypeScript, Infrastructure)
steipete/CodexBar - Show usage stats for OpenAI Codex and Claude Code, without having to login. (15734 stars, Swift, Infrastructure)
alvinunreal/oh-my-opencode-slim - Lean, fine tuned Opencode multi agent suite · Mix any models · Auto delegate tasks (6447 stars, TypeScript, Infrastructure)
adrida/tracer - TRACER: replace 90%+ of your LLM classification calls with a traditional ML model. Formal parity guarantees. Self-improving. (846 stars, Jupyter Notebook, Infrastructure)
AlexsJones/llmfit - Hundreds of models & providers. One command to find what runs on your hardware. (29017 stars, Rust, Infrastructure)
basketikun/chatgpt2api - ChatGPT官网接口纯协议的逆向实现，支持注册机维持号池额度，支持GPT-Image-2模型、文本模型，兼容OpenAI接口协议，在线批量生图/编辑图，号池管理，支持可编辑PPT/PSD文件逆向，支持导入CPA、sub2api号池、支持接入Cherry Studio、New Api 等软件 (4758 stars, Python, Infrastructure)
alibaba/zvec - A lightweight, lightning-fast, in-process vector database (12740 stars, C++, Infrastructure)
lyogavin/airllm - AirLLM 70B inference with single 4GB GPU (22102 stars, Jupyter Notebook, Infrastructure)
jundot/omlx - LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar (17430 stars, Python, Infrastructure)
sgl-project/sglang - SGLang is a high-performance serving framework for large language models and multimodal models. (29935 stars, Python, Infrastructure)
kunchenguid/axi - Design principles for agent ergonomics. Higher accuracy with lower token cost than both MCP and regular CLI. (1279 stars, TypeScript, Infrastructure)
TencentCloud/CubeSandbox - Instant, Concurrent, Secure & Lightweight Sandbox for AI Agents. (7022 stars, Rust, Infrastructure)
comet-ml/opik - Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production... (20254 stars, Python, Infrastructure)
ccusage/ccusage - npx ccusage (16807 stars, Rust, Infrastructure)
ascending-llc/jarvis-registry - Connect any AI copilot or autonomous agent to your enterprise tools — through a single, secure MCP/Agent gateway with built-in identity, access control,... (1808 stars, Python, Infrastructure)
exo-explore/exo - Run frontier AI locally. (45864 stars, Python, Infrastructure)
workweave/router - Model router for agentic systems. Routes every prompt to the right model in <50ms. Cut costs 40-70% with just an endpoint change. (713 stars, Go, Infrastructure)
maximhq/bifrost - Fastest enterprise AI gateway (50x faster than LiteLLM) with adaptive load balancer, cluster mode, guardrails, 1000+ models support & <100 µs overhead a... (6244 stars, Go, Infrastructure)
andimarafioti/faster-qwen3-tts - Real-time text-to-speech with Qwen3-TTS (1178 stars, Python, Infrastructure)
ggml-org/whisper.cpp - Port of OpenAI's Whisper model in C/C++ (51247 stars, C++, Infrastructure)
p-e-w/heretic - Fully automatic censorship removal for language models (25758 stars, Python, Infrastructure)
vllm-project/semantic-router - System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge (4754 stars, Go, Infrastructure)
songquanpeng/one-api - LLM API 管理 & 分发系统，支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型，统一 API 适配，可用于 key 管理与二次分发。单可执行文件，... (35473 stars, JavaScript, Infrastructure)
LMCache/LMCache - LMCache: Supercharge Your LLM with the Fastest KV Cache Layer (10033 stars, Python, Infrastructure)
junhoyeo/tokscale - 🛰️ A CLI tool for tracking token usage from OpenCode, Claude Code, 🦞OpenClaw, Pi, Codex, Gemini, Cursor, AmpCode, Factory Droid, Kimi, and more! • 🏅Glob... (4105 stars, Rust, Infrastructure)
NVIDIA/NemoClaw - Run agents like Hermes and OpenClaw more securely inside NVIDIA OpenShell with managed inference (21575 stars, TypeScript, Infrastructure)
SYSTRAN/faster-whisper - Faster Whisper transcription with CTranslate2 (23999 stars, Python, Infrastructure)
graykode/abtop - Like htop, but for AI coding agents. Monitor Claude Code & Codex CLI sessions, tokens, context window, rate limits, and ports in real-time. (3267 stars, Rust, Infrastructure)
ZhangJinHaHaHa/AgentLens - Agentlens is a trusted agent trading platform. Here, you can quickly find the Agent that meets your needs, and you can also publish your own Agent to tu... (883 stars, TypeScript, Infrastructure)
mudler/LocalAI - LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required. (47291 stars, Go, Infrastructure)
opensandbox-group/OpenSandbox - Secure, Fast, and Extensible Sandbox runtime for AI agents. (11798 stars, Python, Infrastructure)
watercrawl/WaterCrawl - Transform Web Content into LLM-Ready Data (1939 stars, TypeScript, Infrastructure)
trycua/cua - Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macO... (19332 stars, HTML, Infrastructure)
kvcache-ai/Mooncake - Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI. (5737 stars, C++, Infrastructure)
llm-d/llm-d - Achieve state of the art inference performance with modern accelerators on Kubernetes (3676 stars, Shell, Infrastructure)
puppyone-ai/puppyone - Context drive for your AI agents (782 stars, TypeScript, Infrastructure)
vllm-project/speculators - A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM (574 stars, Python, Infrastructure)
openlake-project/openlake - OpenLake is a high performance storage engine for efficient LLM inference and GPU Training (1709 stars, Rust, Infrastructure)
lotus-data/lotus - Optimized LLM and Agentic Data Processing (1632 stars, Python, Infrastructure)
vosen/ZLUDA - CUDA on non-NVIDIA GPUs (14535 stars, Rust, Infrastructure)

Open the interactive AI Rank dashboard