Infrastructure AI Projects
Daily ranking page for Infrastructure open-source AI repositories.
Infrastructure tracks 1260 repositories with 4947247 total GitHub stars.
- decolua/9router - Unlimited FREE AI coding. Connect Claude Code, Codex, Cursor, Cline, Copilot, Antigravity to FREE Claude/GPT/Gemini via 40+ providers. Auto-fallback, RT... (10131 stars, JavaScript, Infrastructure)
- paperclipai/paperclip - The open-source app everyone uses to manage agents at work (65289 stars, TypeScript, Infrastructure)
- rocketride-org/rocketride-server - High-performance AI pipeline engine with a C++ core and 50+ Python-extensible nodes. Build, debug, and scale LLM workflows with 13+ model providers, 8+... (2840 stars, C++, Infrastructure)
- antirez/ds4 - DeepSeek 4 Flash local inference engine for Metal and CUDA (8553 stars, C, Infrastructure)
- trycua/cua - Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macO... (16686 stars, HTML, Infrastructure)
- Wei-Shaw/sub2api - Sub2API-CRS2 一站式开源中转服务,让 Claude、Openai 、Gemini、Antigravity订阅统一接入,支持拼车共享,更高效分摊成本,原生工具无缝使用。 (20799 stars, Go, Infrastructure)
- lemony-ai/cascadeflow - Cascading runtime for AI agents. Optimize cost, latency, quality, and policy decisions inside the agent loop. (1647 stars, Python, Infrastructure)
- Alishahryar1/free-claude-code - Use claude-code for free in the terminal, VSCode extension or discord like OpenClaw (voice supported) (24507 stars, Python, Infrastructure)
- h4ckf0r0day/obscura - The headless browser for AI agents and web scraping (12237 stars, Rust, Infrastructure)
- QuantumNous/new-api - A unified AI model hub for aggregation & distribution. It supports cross-converting various LLMs into OpenAI-compatible, Claude-compatible, or Gemini-co... (33274 stars, Go, Infrastructure)
- ggml-org/llama.cpp - LLM inference in C/C++ (110041 stars, C++, Infrastructure)
- jundot/omlx - LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar (14039 stars, Python, Infrastructure)
- BerriAI/litellm - Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Be... (46947 stars, Python, Infrastructure)
- router-for-me/CLIProxyAPI - Wrap Gemini CLI, Antigravity, ChatGPT Codex, Claude Code as an OpenAI/Gemini/Claude/Codex compatible API service, allowing you to enjoy the free Gemini... (32620 stars, Go, Infrastructure)
- vllm-project/vllm - A high-throughput and memory-efficient inference and serving engine for LLMs (79990 stars, Python, Infrastructure)
- lyogavin/airllm - AirLLM 70B inference with single 4GB GPU (17882 stars, Jupyter Notebook, Infrastructure)
- noonghunna/club-3090 - Community recipes for serving LLMs on RTX 3090. Multi-engine (vLLM, llama.cpp, SGLang) and model-agnostic. Currently shipping Qwen3.6-27B configs for 1×... (872 stars, Python, Infrastructure)
- jwadow/kiro-gateway - 👻 Proxy API gateway for Kiro IDE & CLI (Amazon Q Developer / AWS CodeWhisperer). Use free Claude models with any client. (1539 stars, Python, Infrastructure)
- InsForge/InsForge - The all-in-one, open-source backend platform for agentic coding. InsForge gives your coding agent database, auth, storage, compute, hosting, and AI gate... (9732 stars, TypeScript, Infrastructure)
- strukto-ai/mirage - A Unified Virtual Filesystem For AI Agents (2187 stars, TypeScript, Infrastructure)
- Soju06/codex-lb - Codex/ChatGPT multiple account load balancer & proxy with usage tracking, dashboard, and OpenCode-compatible endpoints (1373 stars, Python, Infrastructure)
- TencentCloud/CubeSandbox - Instant, Concurrent, Secure & Lightweight Sandbox for AI Agents. (5591 stars, Rust, Infrastructure)
- diegosouzapw/OmniRoute - Never stop coding. Free AI gateway: one endpoint, 160+ providers, RTK+Caveman stacked compression up to ~95% eligible context savings, smart auto-fallba... (4535 stars, TypeScript, Infrastructure)
- ollama/ollama - Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models. (171385 stars, Go, Infrastructure)
- exo-explore/exo - Run frontier AI locally. (44642 stars, Python, Infrastructure)
- cactus-compute/cactus - Low-latency AI engine for mobile devices & wearables (4883 stars, C, Infrastructure)
- CJackHwang/ds2api - DeepSeek-Compatible Middleware Interface: A technical exploration project in Go, focusing on high-concurrency protocol adaptation. It serves as a refere... (4388 stars, Go, Infrastructure)
- langfuse/langfuse - 🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langc... (27186 stars, TypeScript, Infrastructure)
- songquanpeng/one-api - LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key 管理与二次分发。单可执行文件,... (33702 stars, JavaScript, Infrastructure)
- basketikun/chatgpt2api - ChatGPT官网接口纯协议的逆向实现,支持注册机维持号池额度,支持GPT-Image-2模型、文本模型,兼容OpenAI接口协议,在线批量生图/编辑图,号池管理,支持导入CPA、sub2api号池 、支持接入Cherry Studio、New Api 等软件 (2467 stars, TypeScript, Infrastructure)
- HKUDS/CLI-Anything - "CLI-Anything: Making ALL Software Agent-Native" -- CLI-Hub: https://clianything.cc/ (34423 stars, Python, Infrastructure)
- NVIDIA/OpenShell - OpenShell is the safe, private runtime for autonomous AI agents. (5916 stars, Rust, Infrastructure)
- algorithmicsuperintelligence/optillm - Optimizing inference proxy for LLMs (3758 stars, Python, Infrastructure)
- sgl-project/sglang - SGLang is a high-performance serving framework for large language models and multimodal models. (27797 stars, Python, Infrastructure)
- steipete/CodexBar - Show usage stats for OpenAI Codex and Claude Code, without having to login. (12149 stars, Swift, Infrastructure)
- raullenchai/Rapid-MLX - The fastest local AI engine for Apple Silicon. 4.2x faster than Ollama, 0.08s cached TTFT, 100% tool calling. 17 tool parsers, prompt cache, reasoning s... (2296 stars, Python, Infrastructure)
- mostlygeek/llama-swap - Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc (4032 stars, Go, Infrastructure)
- haydenbleasel/files-sdk - A unified storage SDK for object and blob backends. One small, honest API. Web-standards I/O. (661 stars, TypeScript, Infrastructure)
- alvinunreal/oh-my-opencode-slim - Slimmed, cleaned and fine-tuned oh-my-opencode fork, consumes much less tokens (4318 stars, TypeScript, Infrastructure)
- graykode/abtop - Like htop, but for AI coding agents. Monitor Claude Code & Codex CLI sessions, tokens, context window, rate limits, and ports in real-time. (2191 stars, Rust, Infrastructure)
- AlexsJones/llmfit - Hundreds of models & providers. One command to find what runs on your hardware. (25946 stars, Rust, Infrastructure)
- Agent-Field/agentfield - Build, run and scale AI agents like API and microservices - observable,auditable and identity-aware from day one. (1735 stars, Go, Infrastructure)
- ryoppippi/ccusage - A CLI tool for analyzing Claude Code/Codex CLI usage from local JSONL files. (14164 stars, TypeScript, Infrastructure)
- Yuan-lab-LLM/ClawManager - A Kubernetes-native control plane for AI agent instance management, with governed AI access, runtime orchestration, and reusable resources across multip... (736 stars, TypeScript, Infrastructure)
- ggml-org/whisper.cpp - Port of OpenAI's Whisper model in C/C++ (49667 stars, C++, Infrastructure)
- AgentsMesh/AgentsMesh - The AI Agent Workforce Platform — where teams scale beyond headcount. Give every team member an AI agent squad. (2105 stars, Go, Infrastructure)
- ogulcancelik/herdr - agent multiplexer that lives in your terminal. (741 stars, Rust, Infrastructure)
- Luce-Org/lucebox-hub - Lucebox optimization hub: hand-tuned LLM inference, built for specific consumer hardware. (2048 stars, C++, Infrastructure)
- justlovemaki/AIClient2API - Simulates Gemini CLI, Antigravity, Codex, Grok, and Kiro client requests, compatible with the OpenAI API. It supports thousands of Gemini model requests... (7810 stars, JavaScript, Infrastructure)
- wavetermdev/waveterm - An open-source, AI-integrated, cross-platform terminal for seamless workflows (20391 stars, Go, Infrastructure)
- MetapriseAI/OrgKernel - Open-source trust layer for AI agents — cryptographic agent identity (Ed25519), instance-scoped execution tokens, SHA-256 hash-chained audit logging, an... (800 stars, Python, Infrastructure)
- qdrant/qdrant - Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://clo... (31310 stars, Rust, Infrastructure)
- mlflow/mlflow - The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize pro... (25929 stars, Python, Infrastructure)
- Wei-Shaw/claude-relay-service - CRS-自建Claude Code镜像,一站式开源中转服务,让 Claude、OpenAI、Gemini、Droid 订阅统一接入,支持拼车共享,更高效分摊成本,原生工具无缝使用。 (11712 stars, JavaScript, Infrastructure)
- maximhq/bifrost - Fastest enterprise AI gateway (50x faster than LiteLLM) with adaptive load balancer, cluster mode, guardrails, 1000+ models support & <100 µs overhead a... (4909 stars, Go, Infrastructure)
- looplj/axonhub - ⚡️ Open-source AI Gateway — Use any SDK to call 100+ LLMs. Built-in failover, load balancing, cost control & end-to-end tracing. (3753 stars, Go, Infrastructure)
- dwgx/WindsurfAPI - Windsurf-to-OpenAI compatible API proxy (2346 stars, JavaScript, Infrastructure)
- mudler/LocalAI - LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required. (46254 stars, Go, Infrastructure)
- NVIDIA/NemoClaw - Run OpenClaw more securely inside NVIDIA OpenShell with managed inference (20397 stars, TypeScript, Infrastructure)
- ascending-llc/jarvis-registry - Connect any AI copilot or autonomous agent to your enterprise tools — through a single, secure MCP/Agent gateway with built-in identity, access control,... (811 stars, Python, Infrastructure)