Model Development AI Projects

Daily ranking page for Model Development open-source AI repositories.

Model Development tracks 4203 repositories with 9434621 total GitHub stars.

pytorch/pytorch - Tensors and Dynamic neural networks in Python with strong GPU acceleration (101347 stars, Python, Model Development)
deepseek-ai/DeepSpec - DeepSpec: a full-stack codebase for training and evaluating speculative decoding algorithms (5960 stars, Python, Model Development)
tensorflow/tensorflow - An Open Source Machine Learning Framework for Everyone (196013 stars, C++, Model Development)
huggingface/transformers - 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both infe... (162199 stars, Python, Model Development)
NVlabs/ProtoMotions - ProtoMotions is a GPU-accelerated simulation and learning framework for training physically simulated digital humans and humanoid robots. (1913 stars, Python, Model Development)
OpenDCAI/DataFlow - Easy Data Preparation with latest LLMs-based Operators and Pipelines. (5844 stars, Python, Model Development)
jingyaogong/minimind - 🧠「大模型」2小时完全从0训练64M的小参数LLM！Train a 64M-parameter LLM from scratch in just 2h! (52515 stars, Python, Model Development)
areal-project/AReaL - The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible. (5461 stars, Python, Model Development)
karpathy/nanoGPT - The simplest, fastest repository for training/finetuning medium-sized GPTs. (60468 stars, Python, Model Development)
harbor-framework/harbor - Framework for evaluating and improving agents (2904 stars, Python, Model Development)
THUDM/slime - slime is an LLM post-training framework for RL Scaling. (7260 stars, Python, Model Development)
unslothai/unsloth - Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally. (67776 stars, Python, Model Development)
huggingface/lerobot - 🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning (25472 stars, Python, Model Development)
karpathy/nanochat - The best ChatGPT that $100 can buy. (55724 stars, Python, Model Development)
ScrapeGraphAI/Scrapegraph-ai - Python scraper based on AI (27942 stars, Python, Model Development)
verl-project/verl - verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework (22278 stars, Python, Model Development)
karpathy/llm.c - LLM training in simple, raw C/CUDA (30434 stars, Cuda, Model Development)
NVIDIA/Megatron-LM - Ongoing research training transformer models at scale (16939 stars, Python, Model Development)
ostris/ai-toolkit - The ultimate training toolkit for finetuning diffusion models (11180 stars, Python, Model Development)
TianhangZhuzth/Fundamental-Ava - Build digital human beings — autonomous, collaborative, and socially intelligent agents. FNzgGxU31RWiDgLr3GvxxSa42nRntvZNSG6aBMQ1pump (760 stars, Python, Model Development)
scrapy/scrapy - Scrapy, a fast high-level web crawling & scraping framework for Python. (62876 stars, Python, Model Development)
hiyouga/LlamaFactory - Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024) (72920 stars, Python, Model Development)
AI4Finance-Foundation/FinRL - FinRL®: Financial Reinforcement Learning. 🔥 (15596 stars, Jupyter Notebook, Model Development)
opencv/opencv - Open Source Computer Vision Library (89516 stars, C++, Model Development)
scikit-learn/scikit-learn - scikit-learn: machine learning in Python (66545 stars, Python, Model Development)
OpenDCAI/DataFlex - Data-centric LLM training with dynamic sample selection, domain mixture optimization, and example reweighting inside the LLaMA-Factory training loop. (1287 stars, Python, Model Development)
thinking-machines-lab/tinker-cookbook - Post-training with Tinker (3559 stars, Python, Model Development)
ray-project/ray - Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads. (43103 stars, Python, Model Development)
ml-explore/mlx - MLX: An array framework for Apple silicon (27413 stars, C++, Model Development)
CVHub520/X-AnyLabeling - Effortless data labeling with AI support from Segment Anything and other awesome models. (9635 stars, Python, Model Development)
isaac-sim/IsaacLab - Unified framework for robot learning built on NVIDIA Isaac Sim (7599 stars, Python, Model Development)
modelscope/ms-swift - Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, I... (14699 stars, Python, Model Development)
Physical-Intelligence/openpi - (12617 stars, Python, Model Development)
RLinf/RLinf - RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI (3987 stars, Python, Model Development)
state-spaces/mamba - Mamba SSM architecture (18537 stars, Python, Model Development)
PrimeIntellect-ai/prime-rl - Agentic RL Training at Scale (1583 stars, Python, Model Development)
pandas-dev/pandas - Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical fu... (49119 stars, Python, Model Development)
isaac-sim/IsaacSim - NVIDIA Isaac Sim™ is an open-source application on NVIDIA Omniverse for developing, simulating, and testing AI-driven robots in realistic virtual enviro... (3611 stars, Python, Model Development)
deepspeedai/DeepSpeed - DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective. (42642 stars, Python, Model Development)
tinygrad/tinygrad - You like pytorch? You like micrograd? You love tinygrad! ❤️ (33218 stars, Python, Model Development)
openai/tiktoken - tiktoken is a fast BPE tokeniser for use with OpenAI's models. (18659 stars, Python, Model Development)
NVIDIA-NeMo/Speech - A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Re... (17701 stars, Python, Model Development)
thuml/Time-Series-Library - A Library for Advanced Deep Time Series Models for General Time Series Analysis. (12536 stars, Python, Model Development)
Robbyant/lingbot-world - Advancing Open-source World Models (4015 stars, Python, Model Development)
mujocolab/mjlab - Isaac Lab API, powered by MuJoCo-Warp, for RL and robotics research (2636 stars, Python, Model Development)
karpathy/minGPT - A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training (24655 stars, Python, Model Development)
huggingface/trl - Train transformer language models with reinforcement learning. (18753 stars, Python, Model Development)
cvat-ai/cvat - Computer Vision Annotation Tool (CVAT) is a leading platform for building high-quality visual datasets for vision AI. It offers open-source, cloud, and... (16213 stars, Python, Model Development)
optuna/optuna - A hyperparameter optimization framework (14449 stars, Python, Model Development)
rerun-io/rerun - Visualize, query, and stream to train on multimodal robotics data. (11052 stars, Rust, Model Development)
alirezamika/autoscraper - A Smart, Automatic, Fast and Lightweight Web Scraper for Python (7487 stars, Python, Model Development)
starVLA/starVLA - StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing (3040 stars, Python, Model Development)
KEV0143/Comparative-analysis-of-hourly-load-forecasting-using-PatchTST-TFT-NHiTS-and-CatBoost - A comprehensive time-series benchmark evaluating state-of-the-art deep learning architectures (PatchTST, TFT, N-HiTS) against traditional gradient boost... (1314 stars, Python, Model Development)
cardmagic/classifier - A general classifier module to allow Bayesian and LSI classifications. (731 stars, Ruby, Model Development)
fastai/fastai - The fastai deep learning library (28060 stars, Jupyter Notebook, Model Development)
PaddlePaddle/Paddle - PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署） (24009 stars, C++, Model Development)
lightgbm-org/LightGBM - A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, cl... (18520 stars, C++, Model Development)
google-deepmind/mujoco - Multi-Joint dynamics with Contact. A general purpose physics simulator. (14072 stars, C++, Model Development)
shibing624/MedicalGPT - MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。 (5571 stars, Python, Model Development)
Vector-Wangel/XLeRobot - XLeRobot: Practical Dual-Arm Mobile Home Robot for $660 (5284 stars, Python, Model Development)

Open the interactive AI Rank dashboard