AI Rank

Infrastructure AI Projects

Daily ranking page for Infrastructure open-source AI repositories.

Infrastructure tracks 1352 repositories with 5465025 total GitHub stars.

  1. diegosouzapw/OmniRoute - Never stop coding. Free AI gateway: one endpoint, 231+ providers (50+ free), connect Claude Code, Codex, Cursor, Cline & Copilot to FREE Claude/GPT/Gemi... (10506 stars, TypeScript, Infrastructure)
  2. ogulcancelik/herdr - agent multiplexer that lives in your terminal. (10391 stars, Rust, Infrastructure)
  3. Alishahryar1/free-claude-code - Use claude code and codex for free in the terminal, VSCode extension, and discord like OpenClaw (voice supported) (38378 stars, Python, Infrastructure)
  4. lemony-ai/cascadeflow - Cascading runtime for AI agents. Optimize cost, latency, quality, and policy decisions inside the agent loop. (3083 stars, Python, Infrastructure)
  5. decolua/9router - Unlimited FREE AI coding. Connect Claude Code, Codex, Cursor, Cline, Copilot, Antigravity to FREE Claude/GPT/Gemini via 40+ providers. Auto-fallback, RT... (19607 stars, JavaScript, Infrastructure)
  6. rocketride-org/rocketride-server - High-performance AI pipeline engine with a C++ core and 50+ Python-extensible nodes. Build, debug, and scale LLM workflows with 13+ model providers, 8+... (4873 stars, Python, Infrastructure)
  7. Nasiko-Labs/nasiko - Developer Control Plane for your AI Agents (3627 stars, Python, Infrastructure)
  8. h4ckf0r0day/obscura - The headless browser for AI agents and web scraping (17443 stars, Rust, Infrastructure)
  9. Wei-Shaw/sub2api - Sub2API is an open-source relay platform that unifies Claude, OpenAI, Gemini, and Antigravity subscriptions into a single endpoint. It supports account... (30095 stars, Go, Infrastructure)
  10. tashfeenahmed/freellmapi - OpenAI-compatible proxy that stacks the free tiers of 16 LLM providers (~1.7B tokens/month) behind one /v1 endpoint — plus any custom OpenAI-compatible... (14914 stars, TypeScript, Infrastructure)
  11. HKUDS/CLI-Anything - "CLI-Anything: Making ALL Software Agent-Native" -- CLI-Hub: https://clianything.cc/ (44644 stars, Python, Infrastructure)
  12. antirez/ds4 - DeepSeek 4 Flash and PRO local inference engine for Metal, CUDA and ROCm (17392 stars, C, Infrastructure)
  13. ggml-org/llama.cpp - LLM inference in C/C++ (119114 stars, C++, Infrastructure)
  14. vllm-project/vllm - A high-throughput and memory-efficient inference and serving engine for LLMs (85255 stars, Python, Infrastructure)
  15. QuantumNous/new-api - A unified AI model hub for aggregation & distribution. It supports cross-converting various LLMs into OpenAI-compatible, Claude-compatible, or Gemini-co... (40994 stars, Go, Infrastructure)
  16. BerriAI/litellm - Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Be... (52508 stars, Python, Infrastructure)
  17. router-for-me/CLIProxyAPI - Wrap Antigravity, ChatGPT Codex, Claude Code, Grok Build as an OpenAI/Gemini/Claude/Codex compatible API service, allowing you to enjoy the free Gemini... (39055 stars, Go, Infrastructure)
  18. ollama/ollama - Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models. (175354 stars, Go, Infrastructure)
  19. seakee/CPA-Manager-Plus - Self-hosted AI gateway monitoring — track requests, cost, failures, quota, and account health for CPA / CLIProxyAPI and OpenAI-compatible gateways. (1181 stars, TypeScript, Infrastructure)
  20. paperclipai/paperclip - The open-source app everyone uses to manage agents at work (72611 stars, TypeScript, Infrastructure)
  21. langfuse/langfuse - 🪢 Open source AI engineering platform: LLM evals, observability, metrics, prompt management, playground, datasets. Integrates with OpenTelemetry, LangCh... (30359 stars, TypeScript, Infrastructure)
  22. steipete/CodexBar - Show usage stats for OpenAI Codex and Claude Code, without having to login. (15734 stars, Swift, Infrastructure)
  23. alvinunreal/oh-my-opencode-slim - Lean, fine tuned Opencode multi agent suite · Mix any models · Auto delegate tasks (6447 stars, TypeScript, Infrastructure)
  24. adrida/tracer - TRACER: replace 90%+ of your LLM classification calls with a traditional ML model. Formal parity guarantees. Self-improving. (846 stars, Jupyter Notebook, Infrastructure)
  25. AlexsJones/llmfit - Hundreds of models & providers. One command to find what runs on your hardware. (29017 stars, Rust, Infrastructure)
  26. basketikun/chatgpt2api - ChatGPT官网接口纯协议的逆向实现,支持注册机维持号池额度,支持GPT-Image-2模型、文本模型,兼容OpenAI接口协议,在线批量生图/编辑图,号池管理,支持可编辑PPT/PSD文件逆向,支持导入CPA、sub2api号池 、支持接入Cherry Studio、New Api 等软件 (4758 stars, Python, Infrastructure)
  27. alibaba/zvec - A lightweight, lightning-fast, in-process vector database (12740 stars, C++, Infrastructure)
  28. lyogavin/airllm - AirLLM 70B inference with single 4GB GPU (22102 stars, Jupyter Notebook, Infrastructure)
  29. jundot/omlx - LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar (17430 stars, Python, Infrastructure)
  30. sgl-project/sglang - SGLang is a high-performance serving framework for large language models and multimodal models. (29935 stars, Python, Infrastructure)
  31. kunchenguid/axi - Design principles for agent ergonomics. Higher accuracy with lower token cost than both MCP and regular CLI. (1279 stars, TypeScript, Infrastructure)
  32. TencentCloud/CubeSandbox - Instant, Concurrent, Secure & Lightweight Sandbox for AI Agents. (7022 stars, Rust, Infrastructure)
  33. comet-ml/opik - Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production... (20254 stars, Python, Infrastructure)
  34. ccusage/ccusage - npx ccusage (16807 stars, Rust, Infrastructure)
  35. ascending-llc/jarvis-registry - Connect any AI copilot or autonomous agent to your enterprise tools — through a single, secure MCP/Agent gateway with built-in identity, access control,... (1808 stars, Python, Infrastructure)
  36. exo-explore/exo - Run frontier AI locally. (45864 stars, Python, Infrastructure)
  37. workweave/router - Model router for agentic systems. Routes every prompt to the right model in <50ms. Cut costs 40-70% with just an endpoint change. (713 stars, Go, Infrastructure)
  38. maximhq/bifrost - Fastest enterprise AI gateway (50x faster than LiteLLM) with adaptive load balancer, cluster mode, guardrails, 1000+ models support & <100 µs overhead a... (6244 stars, Go, Infrastructure)
  39. andimarafioti/faster-qwen3-tts - Real-time text-to-speech with Qwen3-TTS (1178 stars, Python, Infrastructure)
  40. ggml-org/whisper.cpp - Port of OpenAI's Whisper model in C/C++ (51247 stars, C++, Infrastructure)
  41. p-e-w/heretic - Fully automatic censorship removal for language models (25758 stars, Python, Infrastructure)
  42. vllm-project/semantic-router - System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge (4754 stars, Go, Infrastructure)
  43. songquanpeng/one-api - LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key 管理与二次分发。单可执行文件,... (35473 stars, JavaScript, Infrastructure)
  44. LMCache/LMCache - LMCache: Supercharge Your LLM with the Fastest KV Cache Layer (10033 stars, Python, Infrastructure)
  45. junhoyeo/tokscale - 🛰️ A CLI tool for tracking token usage from OpenCode, Claude Code, 🦞OpenClaw, Pi, Codex, Gemini, Cursor, AmpCode, Factory Droid, Kimi, and more! • 🏅Glob... (4105 stars, Rust, Infrastructure)
  46. NVIDIA/NemoClaw - Run agents like Hermes and OpenClaw more securely inside NVIDIA OpenShell with managed inference (21575 stars, TypeScript, Infrastructure)
  47. SYSTRAN/faster-whisper - Faster Whisper transcription with CTranslate2 (23999 stars, Python, Infrastructure)
  48. graykode/abtop - Like htop, but for AI coding agents. Monitor Claude Code & Codex CLI sessions, tokens, context window, rate limits, and ports in real-time. (3267 stars, Rust, Infrastructure)
  49. ZhangJinHaHaHa/AgentLens - Agentlens is a trusted agent trading platform. Here, you can quickly find the Agent that meets your needs, and you can also publish your own Agent to tu... (883 stars, TypeScript, Infrastructure)
  50. mudler/LocalAI - LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required. (47291 stars, Go, Infrastructure)
  51. opensandbox-group/OpenSandbox - Secure, Fast, and Extensible Sandbox runtime for AI agents. (11798 stars, Python, Infrastructure)
  52. watercrawl/WaterCrawl - Transform Web Content into LLM-Ready Data (1939 stars, TypeScript, Infrastructure)
  53. trycua/cua - Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macO... (19332 stars, HTML, Infrastructure)
  54. kvcache-ai/Mooncake - Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI. (5737 stars, C++, Infrastructure)
  55. llm-d/llm-d - Achieve state of the art inference performance with modern accelerators on Kubernetes (3676 stars, Shell, Infrastructure)
  56. puppyone-ai/puppyone - Context drive for your AI agents (782 stars, TypeScript, Infrastructure)
  57. vllm-project/speculators - A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM (574 stars, Python, Infrastructure)
  58. openlake-project/openlake - OpenLake is a high performance storage engine for efficient LLM inference and GPU Training (1709 stars, Rust, Infrastructure)
  59. lotus-data/lotus - Optimized LLM and Agentic Data Processing (1632 stars, Python, Infrastructure)
  60. vosen/ZLUDA - CUDA on non-NVIDIA GPUs (14535 stars, Rust, Infrastructure)

Open the interactive AI Rank dashboard