Model Development AI Projects
Daily ranking page for Model Development open-source AI repositories.
Model Development tracks 4203 repositories with 9434621 total GitHub stars.
- pytorch/pytorch - Tensors and Dynamic neural networks in Python with strong GPU acceleration (101347 stars, Python, Model Development)
- deepseek-ai/DeepSpec - DeepSpec: a full-stack codebase for training and evaluating speculative decoding algorithms (5960 stars, Python, Model Development)
- tensorflow/tensorflow - An Open Source Machine Learning Framework for Everyone (196013 stars, C++, Model Development)
- huggingface/transformers - 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both infe... (162199 stars, Python, Model Development)
- NVlabs/ProtoMotions - ProtoMotions is a GPU-accelerated simulation and learning framework for training physically simulated digital humans and humanoid robots. (1913 stars, Python, Model Development)
- OpenDCAI/DataFlow - Easy Data Preparation with latest LLMs-based Operators and Pipelines. (5844 stars, Python, Model Development)
- jingyaogong/minimind - 🧠「大模型」2小时完全从0训练64M的小参数LLM!Train a 64M-parameter LLM from scratch in just 2h! (52515 stars, Python, Model Development)
- areal-project/AReaL - The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible. (5461 stars, Python, Model Development)
- karpathy/nanoGPT - The simplest, fastest repository for training/finetuning medium-sized GPTs. (60468 stars, Python, Model Development)
- harbor-framework/harbor - Framework for evaluating and improving agents (2904 stars, Python, Model Development)
- THUDM/slime - slime is an LLM post-training framework for RL Scaling. (7260 stars, Python, Model Development)
- unslothai/unsloth - Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally. (67776 stars, Python, Model Development)
- huggingface/lerobot - 🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning (25472 stars, Python, Model Development)
- karpathy/nanochat - The best ChatGPT that $100 can buy. (55724 stars, Python, Model Development)
- ScrapeGraphAI/Scrapegraph-ai - Python scraper based on AI (27942 stars, Python, Model Development)
- verl-project/verl - verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework (22278 stars, Python, Model Development)
- karpathy/llm.c - LLM training in simple, raw C/CUDA (30434 stars, Cuda, Model Development)
- NVIDIA/Megatron-LM - Ongoing research training transformer models at scale (16939 stars, Python, Model Development)
- ostris/ai-toolkit - The ultimate training toolkit for finetuning diffusion models (11180 stars, Python, Model Development)
- TianhangZhuzth/Fundamental-Ava - Build digital human beings — autonomous, collaborative, and socially intelligent agents. FNzgGxU31RWiDgLr3GvxxSa42nRntvZNSG6aBMQ1pump (760 stars, Python, Model Development)
- scrapy/scrapy - Scrapy, a fast high-level web crawling & scraping framework for Python. (62876 stars, Python, Model Development)
- hiyouga/LlamaFactory - Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024) (72920 stars, Python, Model Development)
- AI4Finance-Foundation/FinRL - FinRL®: Financial Reinforcement Learning. 🔥 (15596 stars, Jupyter Notebook, Model Development)
- opencv/opencv - Open Source Computer Vision Library (89516 stars, C++, Model Development)
- scikit-learn/scikit-learn - scikit-learn: machine learning in Python (66545 stars, Python, Model Development)
- OpenDCAI/DataFlex - Data-centric LLM training with dynamic sample selection, domain mixture optimization, and example reweighting inside the LLaMA-Factory training loop. (1287 stars, Python, Model Development)
- thinking-machines-lab/tinker-cookbook - Post-training with Tinker (3559 stars, Python, Model Development)
- ray-project/ray - Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads. (43103 stars, Python, Model Development)
- ml-explore/mlx - MLX: An array framework for Apple silicon (27413 stars, C++, Model Development)
- CVHub520/X-AnyLabeling - Effortless data labeling with AI support from Segment Anything and other awesome models. (9635 stars, Python, Model Development)
- isaac-sim/IsaacLab - Unified framework for robot learning built on NVIDIA Isaac Sim (7599 stars, Python, Model Development)
- modelscope/ms-swift - Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, I... (14699 stars, Python, Model Development)
- Physical-Intelligence/openpi - (12617 stars, Python, Model Development)
- RLinf/RLinf - RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI (3987 stars, Python, Model Development)
- state-spaces/mamba - Mamba SSM architecture (18537 stars, Python, Model Development)
- PrimeIntellect-ai/prime-rl - Agentic RL Training at Scale (1583 stars, Python, Model Development)
- pandas-dev/pandas - Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical fu... (49119 stars, Python, Model Development)
- isaac-sim/IsaacSim - NVIDIA Isaac Sim™ is an open-source application on NVIDIA Omniverse for developing, simulating, and testing AI-driven robots in realistic virtual enviro... (3611 stars, Python, Model Development)
- deepspeedai/DeepSpeed - DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective. (42642 stars, Python, Model Development)
- tinygrad/tinygrad - You like pytorch? You like micrograd? You love tinygrad! ❤️ (33218 stars, Python, Model Development)
- openai/tiktoken - tiktoken is a fast BPE tokeniser for use with OpenAI's models. (18659 stars, Python, Model Development)
- NVIDIA-NeMo/Speech - A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Re... (17701 stars, Python, Model Development)
- thuml/Time-Series-Library - A Library for Advanced Deep Time Series Models for General Time Series Analysis. (12536 stars, Python, Model Development)
- Robbyant/lingbot-world - Advancing Open-source World Models (4015 stars, Python, Model Development)
- mujocolab/mjlab - Isaac Lab API, powered by MuJoCo-Warp, for RL and robotics research (2636 stars, Python, Model Development)
- karpathy/minGPT - A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training (24655 stars, Python, Model Development)
- huggingface/trl - Train transformer language models with reinforcement learning. (18753 stars, Python, Model Development)
- cvat-ai/cvat - Computer Vision Annotation Tool (CVAT) is a leading platform for building high-quality visual datasets for vision AI. It offers open-source, cloud, and... (16213 stars, Python, Model Development)
- optuna/optuna - A hyperparameter optimization framework (14449 stars, Python, Model Development)
- rerun-io/rerun - Visualize, query, and stream to train on multimodal robotics data. (11052 stars, Rust, Model Development)
- alirezamika/autoscraper - A Smart, Automatic, Fast and Lightweight Web Scraper for Python (7487 stars, Python, Model Development)
- starVLA/starVLA - StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing (3040 stars, Python, Model Development)
- KEV0143/Comparative-analysis-of-hourly-load-forecasting-using-PatchTST-TFT-NHiTS-and-CatBoost - A comprehensive time-series benchmark evaluating state-of-the-art deep learning architectures (PatchTST, TFT, N-HiTS) against traditional gradient boost... (1314 stars, Python, Model Development)
- cardmagic/classifier - A general classifier module to allow Bayesian and LSI classifications. (731 stars, Ruby, Model Development)
- fastai/fastai - The fastai deep learning library (28060 stars, Jupyter Notebook, Model Development)
- PaddlePaddle/Paddle - PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署) (24009 stars, C++, Model Development)
- lightgbm-org/LightGBM - A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, cl... (18520 stars, C++, Model Development)
- google-deepmind/mujoco - Multi-Joint dynamics with Contact. A general purpose physics simulator. (14072 stars, C++, Model Development)
- shibing624/MedicalGPT - MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。 (5571 stars, Python, Model Development)
- Vector-Wangel/XLeRobot - XLeRobot: Practical Dual-Arm Mobile Home Robot for $660 (5284 stars, Python, Model Development)