[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"project-9587":3},{"id":4,"name":5,"fullName":6,"owner":7,"repo":5,"description":8,"homepage":9,"htmlUrl":10,"language":10,"languages":10,"totalLinesOfCode":10,"stars":11,"forks":12,"watchers":13,"openIssues":14,"contributorsCount":15,"subscribersCount":15,"size":15,"stars1d":16,"stars7d":17,"stars30d":18,"stars90d":15,"forks30d":15,"starsTrendScore":19,"compositeScore":20,"rankGlobal":10,"rankLanguage":10,"license":21,"archived":22,"fork":22,"defaultBranch":23,"hasWiki":24,"hasPages":24,"topics":25,"createdAt":10,"pushedAt":10,"updatedAt":45,"readmeContent":46,"aiSummary":47,"trendingCount":15,"starSnapshotCount":15,"syncStatus":48,"lastSyncTime":49,"discoverSource":50},9587,"awesome-production-machine-learning","EthicalML\u002Fawesome-production-machine-learning","EthicalML","A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning","https:\u002F\u002Fethicalml.github.io\u002Fawesome-production-machine-learning",null,20614,2580,416,6,0,4,19,101,17,45,"MIT License",false,"master",true,[26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44],"awesome","awesome-list","data-mining","deep-learning","explainability","interpretability","large-scale-machine-learning","large-scale-ml","machine-learning","machine-learning-operations","ml-operations","ml-ops","mlops","privacy-preserving","privacy-preserving-machine-learning","privacy-preserving-ml","production-machine-learning","production-ml","responsible-ai","2026-06-12 02:02:09","[![Awesome](https:\u002F\u002Fawesome.re\u002Fbadge.svg)](https:\u002F\u002Fawesome.re)\n[![X](https:\u002F\u002Fimg.shields.io\u002Fbadge\u002FX-%23000000?logo=X&logoColor=white)](https:\u002F\u002Ftwitter.com\u002FEthicalML)\n\n# Awesome Production Machine Learning\n\nThis repository contains a curated list of awesome open source libraries that will help you deploy, monitor, version, scale, and secure your production machine learning 🚀\n\nYou can keep up to date by watching this github repo to get a summary of the new production ML libraries added every month [via releases](https:\u002F\u002Fgithub.com\u002FEthicalML\u002Fawesome-production-machine-learning\u002Freleases) 🤩\n\nAdditionally, we provide a [search toolkit](https:\u002F\u002Fhuggingface.co\u002Fspaces\u002Fzhiminy\u002FAwesome-Production-Machine-Learning-Search) that helps you quickly navigate through the toolchain.\n\n## Quick links to sections on this page\n\n| | | |\n|-|-|-|\n| [🔧 AutoML](#automl) | [🧮 Computation & Communication Optimisation](#computation-and-communication-optimisation) | [🏷️ Data Annotation & Synthesis](#data-annotation-and-synthesis) |\n| [🧵 Data Pipeline](#data-pipeline) | [📓 Data Science Notebook](#data-science-notebook) | [💾 Data Storage Optimisation](#data-storage-optimisation) |\n| [💸 Data Stream Processing](#data-stream-processing) | [💪 Deployment & Serving](#deployment-and-serving) | [📈 Evaluation & Monitoring](#evaluation-and-monitoring) |\n| [🔍 Explainability & Fairness](#explainability-and-fairness) | [🎁 Feature Store](#feature-store) | [🔴 Industry-strength Anomaly Detection](#industry-strength-anomaly-detection) |\n| [👁️ Industry-strength Computer Vision](#industry-strength-computer-vision) | [🔥 Industry-strength Information Retrieval](#industry-strength-information-retrieval) | [🔠 Industry-strength Natural Language Processing](#industry-strength-nlp) |\n| [🙌 Industry-strength Recommender System](#industry-strength-recommender-system) | [🍕 Industry-strength Reinforcement Learning](#industry-strength-reinforcement-learning) | [🤖 Industry-strength Robotics](#industry-strength-robotics) |\n| [📊 Industry-strength Visualisation](#industry-strength-visualisation) | [📅 Metadata Management](#metadata-management) | [📜 Model, Data & Experiment Management](#model-data-and-experiment-management) |\n| [🔩 Model Storage Optimisation](#model-storage-optimisation) | [🏁 Model Training & Orchestration](#model-training-and-orchestration) | [🔏 Privacy & Safety](#privacy-and-safety) |\n\n## Contributing to the list\n\nPlease review our [CONTRIBUTING.md](https:\u002F\u002Fgithub.com\u002FEthicalML\u002Fawesome-production-machine-learning\u002Fblob\u002Fmaster\u002FCONTRIBUTING.md) requirements when submitting a PR to help us keep the list clean and up-to-date - thank you to the community for supporting its steady growth 🚀\n\n\u003Cpicture>\n  \u003Csource\n    media=\"(prefers-color-scheme: grey)\"\n    srcset=\"\n      https:\u002F\u002Fapi.star-history.com\u002Fsvg?repos=EthicalML\u002Fawesome-production-machine-learning&type=Date&theme=dark\n    \"\n  \u002F>\n  \u003Csource\n    media=\"(prefers-color-scheme: light)\"\n    srcset=\"\n      https:\u002F\u002Fapi.star-history.com\u002Fsvg?repos=EthicalML\u002Fawesome-production-machine-learning&type=Date\n    \"\n  \u002F>\n  \u003Cimg\n    alt=\"Star History Chart\"\n    src=\"https:\u002F\u002Fapi.star-history.com\u002Fsvg?repos=EthicalML\u002Fawesome-production-machine-learning&type=Date\"\n  \u002F>\n\u003C\u002Fpicture>\n\n## 10 Min Video Overview\n\n\u003Ctable>\n  \u003Ctr>\n    \u003Ctd width=\"30%\">\n        This \u003Ca href=\"https:\u002F\u002Fwww.youtube.com\u002Fwatch?v=Ynb6X0KZKxY\">10 minute video\u003C\u002Fa> provides an overview of the motivations for machine learning operations as well as a high level overview on some of the tools in this repo. This \u003Ca href=\"https:\u002F\u002Fwww.youtube.com\u002Fwatch?v=NycftytgPnk\">newer video\u003C\u002Fa> covers the an updated 2024 version of the state of MLOps.\n    \u003C\u002Ftd>\n    \u003Ctd width=\"70%\">\n        \u003Ca href=\"https:\u002F\u002Fwww.youtube.com\u002Fwatch?v=Ynb6X0KZKxY\">\u003Cimg src=\"images\u002Fvideo.png\">\u003C\u002Fa>\n    \u003C\u002Ftd>\n  \u003C\u002Ftr>\n\u003C\u002Ftable>\n\n## Want to receive recurrent updates on this repo and other advancements?\n\n\u003Ctable>\n  \u003Ctr>\n    \u003Ctd width=\"30%\">\n         You can join the \u003Ca href=\"https:\u002F\u002Fethical.institute\u002Fmle.html\">Machine Learning Engineer\u003C\u002Fa> newsletter. Join over 70,000 ML professionals and enthusiasts who receive weekly curated articles & tutorials on production Machine Learning.\n    \u003C\u002Ftd>\n    \u003Ctd width=\"70%\">\n        \u003Ca href=\"https:\u002F\u002Fethical.institute\u002Fmle.html\">\u003Cimg src=\"images\u002Fmleng.png\">\u003C\u002Fa>\n    \u003C\u002Ftd>\n  \u003C\u002Ftr>\n  \u003Ctr>\n    \u003Ctd width=\"30%\">\n         Also check out the \u003Ca href=\"https:\u002F\u002Fgithub.com\u002FEthicalML\u002Fawesome-production-genai\u002F\">Awesome Production GenAI\u003C\u002Fa> List, where we aim to map a curated list of awesome open source libraries to deploy, monitor, version and scale your generative artificial intelligence applications and systems.\n    \u003C\u002Ftd>\n    \u003Ctd width=\"70%\">\n        \u003Ca href=\"https:\u002F\u002Fgithub.com\u002FEthicalML\u002Fawesome-production-genai\u002F\">\u003Cimg src=\"images\u002Flist.jpg\">\u003C\u002Fa>\n    \u003C\u002Ftd>\n  \u003C\u002Ftr>\n\u003C\u002Ftable>\n\n# Main Content\n\n## AutoML\n* [AIDE](https:\u002F\u002Fgithub.com\u002FWecoAI\u002Faideml) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FWecoAI\u002Faideml.svg?cacheSeconds=86400) - AIDE is an open-source ML engineering agent that uses a tree search algorithm to autonomously explore, implement, and evaluate solution strategies for machine learning tasks.\n* [AutoGluon](https:\u002F\u002Fgithub.com\u002Fautogluon\u002Fautogluon) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fautogluon\u002Fautogluon.svg?cacheSeconds=86400) - Automated feature, model, and hyperparameter selection for tabular, image, and text data on top of popular machine learning libraries (Scikit-Learn, LightGBM, CatBoost, PyTorch, MXNet).\n* [Autokeras](https:\u002F\u002Fgithub.com\u002Fkeras-team\u002Fautokeras) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fkeras-team\u002Fautokeras.svg?cacheSeconds=86400) - AutoML library for Keras based on [\"Auto-Keras: Efficient Neural Architecture Search with Network Morphism\"](https:\u002F\u002Farxiv.org\u002Fabs\u002F1806.10282).\n* [auto-sklearn](https:\u002F\u002Fgithub.com\u002Fautoml\u002Fauto-sklearn) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fautoml\u002Fauto-sklearn.svg?cacheSeconds=86400) - Framework to automate algorithm and hyperparameter tuning for sklearn.\n* [Ax](https:\u002F\u002Fgithub.com\u002Ffacebook\u002FAx) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ffacebook\u002FAx.svg?cacheSeconds=86400) - Ax is an accessible, general-purpose platform for understanding, managing, deploying, and automating adaptive experiments.\n* [BoTorch](https:\u002F\u002Fgithub.com\u002Fmeta-pytorch\u002Fbotorch) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmeta-pytorch\u002Fbotorch.svg?cacheSeconds=86400) - BoTorch is a library for Bayesian Optimization built on PyTorch.\n* [EvalML](https:\u002F\u002Fgithub.com\u002Falteryx\u002Fevalml) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Falteryx\u002Fevalml.svg?cacheSeconds=86400) - EvalML is an AutoML library which builds, optimizes, and evaluates machine learning pipelines using domain-specific objective functions.\n* [Feature Engine](https:\u002F\u002Fgithub.com\u002Ffeature-engine\u002Ffeature_engine) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ffeature-engine\u002Ffeature_engine.svg?cacheSeconds=86400) - Feature-engine is a Python library that contains several transformers to engineer features for use in machine learning models.\n* [Featuretools](https:\u002F\u002Fgithub.com\u002Falteryx\u002Ffeaturetools) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Falteryx\u002Ffeaturetools.svg?cacheSeconds=86400) - An open source framework for automated feature engineering.\n* [FLAML](https:\u002F\u002Fgithub.com\u002Fmicrosoft\u002FFLAML) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmicrosoft\u002FFLAML.svg?cacheSeconds=86400) - FLAML is a fast library for automated machine learning & tuning.\n* [HEBO](https:\u002F\u002Fgithub.com\u002Fhuawei-noah\u002FHEBO) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fhuawei-noah\u002FHEBO.svg?cacheSeconds=86400) - Set of open-source hyperparameter optimization frameworks, including the winning submission to the [NeurIPS 2020 Black-Box Optimisation Challenge](https:\u002F\u002Fbbochallenge.com\u002Fleaderboard) tested on hyperparameter tuning tasks. \n* [Katib](https:\u002F\u002Fgithub.com\u002Fkubeflow\u002Fkatib) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fkubeflow\u002Fkatib.svg?cacheSeconds=86400) - A Kubernetes-based system for Hyperparameter Tuning and Neural Architecture Search.\n* [keras-tuner](https:\u002F\u002Fgithub.com\u002Fkeras-team\u002Fkeras-tuner) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fkeras-team\u002Fkeras-tuner.svg?cacheSeconds=86400) - Keras Tuner is an easy-to-use, distributable hyperparameter optimisation framework that solves the pain points of performing a hyperparameter search. Keras Tuner makes it easy to define a search space and leverage included algorithms to find the best hyperparameter values.\n* [Optuna](https:\u002F\u002Fgithub.com\u002Foptuna\u002Foptuna) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Foptuna\u002Foptuna.svg?cacheSeconds=86400) - Optuna is an automatic hyperparameter optimisation software framework, particularly designed for machine learning.\n* [OSS Vizier](https:\u002F\u002Fgithub.com\u002Fgoogle\u002Fvizier) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fgoogle\u002Fvizier.svg?cacheSeconds=86400) - OSS Vizier is a Python-based service for black-box optimisation and research, one of the first hyperparameter tuning services designed to work at scale.\n* [Perpetual](https:\u002F\u002Fgithub.com\u002Fperpetual-ml\u002Fperpetual) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fperpetual-ml\u002Fperpetual.svg?cacheSeconds=86400) - A gradient boosting machine that doesn't need hyperparameter optimization, with a simple budget parameter to control model complexity.\n* [TPOT](https:\u002F\u002Fgithub.com\u002Fepistasislab\u002Ftpot) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fepistasislab\u002Ftpot.svg?cacheSeconds=86400) - Automation of sklearn pipeline creation (including feature selection, pre-processor, etc.).\n* [tsfresh](https:\u002F\u002Fgithub.com\u002Fblue-yonder\u002Ftsfresh) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fblue-yonder\u002Ftsfresh.svg?cacheSeconds=86400) - Automatic extraction of relevant features from time series.\n\n## Computation and Communication Optimisation\n\n* [Accelerate](https:\u002F\u002Fgithub.com\u002Fhuggingface\u002Faccelerate) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fhuggingface\u002Faccelerate.svg?cacheSeconds=86400) - Accelerate abstracts exactly and only the boilerplate code related to multi-GPU\u002FTPU\u002Fmixed-precision and leaves the rest of your code unchanged.\n* [Adapters](https:\u002F\u002Fgithub.com\u002Fadapter-hub\u002Fadapters) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fadapter-hub\u002Fadapters.svg?cacheSeconds=86400) - Adapters is a unified library for parameter-efficient and modular transfer learning.\n* [BitBLAS](https:\u002F\u002Fgithub.com\u002Fmicrosoft\u002FBitBLAS) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmicrosoft\u002FBitBLAS.svg?cacheSeconds=86400) - BitBLAS is a library to support mixed-precision BLAS operations on GPUs\n* [Cache-DiT](https:\u002F\u002Fgithub.com\u002Fvipshop\u002Fcache-dit) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fvipshop\u002Fcache-dit.svg?cacheSeconds=86400) - Cache-DiT is built on top of Diffusers and supports nearly all DiTs, providing hybrid cache acceleration (DBCache, TaylorSeer, SCM, etc.) and comprehensive parallelism optimizations including Context Parallelism, Tensor Parallelism, and hybrid 2D\u002F3D parallelism, with compatibility for compilation, CPU offloading, and quantization.\n* [Colossal-AI](https:\u002F\u002Fgithub.com\u002Fhpcaitech\u002FColossalAI) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fhpcaitech\u002FColossalAI.svg?cacheSeconds=86400) - A unified deep learning system for big model era, which helps users to efficiently and quickly deploy large AI model training and inference.\n* [Composer](https:\u002F\u002Fgithub.com\u002Fmosaicml\u002Fcomposer) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmosaicml\u002Fcomposer.svg?cacheSeconds=86400) - Composer is a PyTorch library that enables you to train neural networks faster, at lower cost, and to higher accuracy.\n* [CuDF](https:\u002F\u002Fgithub.com\u002Frapidsai\u002Fcudf) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Frapidsai\u002Fcudf.svg?cacheSeconds=86400) - Built based on the Apache Arrow columnar memory format, cuDF is a GPU DataFrame library for loading, joining, aggregating, filtering, and otherwise manipulating data.\n* [CuML](https:\u002F\u002Fgithub.com\u002Frapidsai\u002Fcuml) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Frapidsai\u002Fcuml.svg?cacheSeconds=86400) - cuML is a suite of libraries that implement machine learning algorithms and mathematical primitives functions that share compatible APIs with other RAPIDS projects.\n* [CuPy](https:\u002F\u002Fgithub.com\u002Fcupy\u002Fcupy) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fcupy\u002Fcupy.svg?cacheSeconds=86400) - An implementation of NumPy-compatible multi-dimensional array on CUDA. CuPy consists of the core multi-dimensional array class, cupy.ndarray, and many functions on it.\n* [DEAP](https:\u002F\u002Fgithub.com\u002FDEAP\u002Fdeap) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FDEAP\u002Fdeap.svg?cacheSeconds=86400) - A novel evolutionary computation framework for rapid prototyping and testing of ideas. It seeks to make algorithms explicit and data structures transparent. It works in perfect harmony with parallelisation mechanisms such as multiprocessing and SCOOP.\n* [DeepEP](https:\u002F\u002Fgithub.com\u002Fdeepseek-ai\u002FDeepEP) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fdeepseek-ai\u002FDeepEP.svg?cacheSeconds=86400) - DeepEP is a communication library tailored for Mixture-of-Experts (MoE) and expert parallelism (EP). It provides high-throughput and low-latency all-to-all GPU kernels, which are also known as MoE dispatch and combine. The library also supports low-precision operations, including FP8.\n* [DGL](https:\u002F\u002Fgithub.com\u002Fdmlc\u002Fdgl) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fdmlc\u002Fdgl.svg?cacheSeconds=86400) - DGL is an easy-to-use, high performance and scalable Python package for deep learning on graphs.\n* [DLRover](https:\u002F\u002Fgithub.com\u002Fintelligent-machine-learning\u002Fdlrover) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fintelligent-machine-learning\u002Fdlrover.svg?cacheSeconds=86400) - DLRover makes the distributed training of large AI models easy, stable, fast and green.\n* [Dask](https:\u002F\u002Fgithub.com\u002Fdask\u002Fdask) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fdask\u002Fdask.svg?cacheSeconds=86400) - Distributed parallel processing framework for Pandas and NumPy computations.\n* [DeepSpeed](https:\u002F\u002Fgithub.com\u002Fdeepspeedai\u002FDeepSpeed) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fdeepspeedai\u002FDeepSpeed.svg?cacheSeconds=86400) - DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.\n* [FlagGems](https:\u002F\u002Fgithub.com\u002FFlagOpen\u002FFlagGems) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FFlagOpen\u002FFlagGems.svg?cacheSeconds=86400) - FlagGems is a high-performance general operator library implemented in OpenAI Triton. It builds on a collection of backend neutral kernels that aims to accelerate LLM training and inference across diverse hardware platforms.\n* [Flashlight](https:\u002F\u002Fgithub.com\u002Fflashlight\u002Fflashlight) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fflashlight\u002Fflashlight.svg?cacheSeconds=86400) - A fast, flexible machine learning library written entirely in C++ from the Facebook AI Research and the creators of Torch, TensorFlow, Eigen and Deep Speech.\n* [Flax](https:\u002F\u002Fgithub.com\u002Fgoogle\u002Fflax) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fgoogle\u002Fflax.svg?cacheSeconds=86400) - A neural network library and ecosystem for JAX designed for flexibility.\n* [GPUStack](https:\u002F\u002Fgithub.com\u002Fgpustack\u002Fgpustack) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fgpustack\u002Fgpustack.svg?cacheSeconds=86400) - GPUStack is an open-source GPU cluster manager for running AI models.\n* [Hivemind](https:\u002F\u002Fgithub.com\u002Flearning-at-home\u002Fhivemind) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Flearning-at-home\u002Fhivemind.svg?cacheSeconds=86400) - Decentralized deep learning in PyTorch.\n* [Horovod](https:\u002F\u002Fgithub.com\u002Fhorovod\u002Fhorovod) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fhorovod\u002Fhorovod.svg?cacheSeconds=86400) - Uber's distributed training framework for TensorFlow, Keras, and PyTorch.\n* [Jax](https:\u002F\u002Fgithub.com\u002Fjax-ml\u002Fjax) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fjax-ml\u002Fjax.svg?cacheSeconds=86400) - Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU\u002FTPU, and more.\n* [Kompute](https:\u002F\u002Fgithub.com\u002Flava-nc\u002Flava) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Flava-nc\u002Flava.svg?cacheSeconds=86400) - Blazing fast, lightweight and mobile phone-enabled Vulkan compute framework optimized for advanced GPU data processing usecases.\n* [Lava](https:\u002F\u002Fgithub.com\u002FKomputeProject\u002Fkompute) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FKomputeProject\u002Fkompute.svg?cacheSeconds=86400) - Lava is an open source framework to develop applications for neuromorphic hardware architectures.\n* [Liger Kernel](https:\u002F\u002Fgithub.com\u002Flinkedin\u002FLiger-Kernel) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Flinkedin\u002FLiger-Kernel.svg?cacheSeconds=86400) - Liger Kernel is a collection of Triton kernels designed specifically for LLM training.\n* [LightGBM](https:\u002F\u002Fgithub.com\u002Fmicrosoft\u002FLightGBM) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmicrosoft\u002FLightGBM.svg?cacheSeconds=86400) - LightGBM is a gradient boosting framework that uses tree based learning algorithms.\n* [MLX](https:\u002F\u002Fgithub.com\u002Fml-explore\u002Fmlx) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fml-explore\u002Fmlx.svg?cacheSeconds=86400) - MLX is an array framework for machine learning on Apple silicon.\n* [Modin](https:\u002F\u002Fgithub.com\u002Fmodin-project\u002Fmodin) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmodin-project\u002Fmodin.svg?cacheSeconds=86400) - Speed up your Pandas workflows by changing a single line of code.\n* [NVIDIA TensorRT](https:\u002F\u002Fgithub.com\u002FNVIDIA\u002FTensorRT) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FNVIDIA\u002FTensorRT.svg?cacheSeconds=86400) - TensorRT is a C++ library for high-performance inference on NVIDIA GPUs and deep learning accelerators.\n* [Nevergrad](https:\u002F\u002Fgithub.com\u002Ffacebookresearch\u002Fnevergrad) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ffacebookresearch\u002Fnevergrad.svg?cacheSeconds=86400) - Nevergrad is a gradient-free optimisation platform.\n* [Norse](https:\u002F\u002Fgithub.com\u002Fnorse\u002Fnorse) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fnorse\u002Fnorse.svg?cacheSeconds=86400) - Norse aims to exploit the advantages of bio-inspired neural components, which are sparse and event-driven - a fundamental difference from artificial neural networks.\n* [Numba](https:\u002F\u002Fgithub.com\u002Fnumba\u002Fnumba) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fnumba\u002Fnumba.svg?cacheSeconds=86400)  - A compiler for Python array and numerical functions.\n* [Optimum](https:\u002F\u002Fgithub.com\u002Fhuggingface\u002Foptimum) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fhuggingface\u002Foptimum.svg?cacheSeconds=86400) - Optimum is an extension of Transformers and Diffusers, providing a set of optimization tools enabling maximum efficiency to train and run models on targeted hardware while keeping things easy to use.\n* [PEFT](https:\u002F\u002Fgithub.com\u002Fhuggingface\u002Fpeft) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fhuggingface\u002Fpeft.svg?cacheSeconds=86400) - Parameter-Efficient Fine-Tuning (PEFT) methods enable efficient adaptation of pre-trained language models (PLMs) to various downstream applications without fine-tuning all the model's parameters.\n* [PaddlePaddle](https:\u002F\u002Fgithub.com\u002FPaddlePaddle\u002FPaddle) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FPaddlePaddle\u002FPaddle.svg?cacheSeconds=86400) - PaddlePaddle is a framework to perform large-scale deep network training, using data sources distributed across hundreds of nodes. \n* [PyG](https:\u002F\u002Fgithub.com\u002Fpyg-team\u002Fpytorch_geometric) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fpyg-team\u002Fpytorch_geometric.svg?cacheSeconds=86400) - PyG (PyTorch Geometric) is a library built upon PyTorch to easily write and train Graph Neural Networks (GNNs) for a wide range of applications related to structured data.\n* [PyTorch Lightning](https:\u002F\u002Fgithub.com\u002FLightning-AI\u002Fpytorch-lightning) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FLightning-AI\u002Fpytorch-lightning.svg?cacheSeconds=86400) - PyTorch Lightning pretrains, finetunes and deploys AI models on multiple GPUs, TPUs with zero code changes.\n* [PyTorch](https:\u002F\u002Fgithub.com\u002Fpytorch\u002Fpytorch) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fpytorch\u002Fpytorch.svg?cacheSeconds=86400) - PyTorch is a library to develop and train neural network based deep learning models.\n* [Ray](https:\u002F\u002Fgithub.com\u002Fray-project\u002Fray) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fray-project\u002Fray.svg?cacheSeconds=86400) - Ray is a flexible, high-performance distributed execution framework for machine learning.\n* [SetFit](https:\u002F\u002Fgithub.com\u002Fhuggingface\u002Fsetfit) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fhuggingface\u002Fsetfit.svg?cacheSeconds=86400) - SetFit is an efficient and prompt-free framework for few-shot fine-tuning of Sentence Transformers.\n* [Sonnet](https:\u002F\u002Fgithub.com\u002Fgoogle-deepmind\u002Fsonnet) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fgoogle-deepmind\u002Fsonnet.svg?cacheSeconds=86400) - Sonnet is a library built on top of TensorFlow 2 designed to provide simple, composable abstractions for machine learning research.\n* [Streaming](https:\u002F\u002Fgithub.com\u002Fmosaicml\u002Fstreaming) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmosaicml\u002Fstreaming.svg?cacheSeconds=86400) - A Data Streaming Library for Efficient Neural Network Training.\n* [TensorFlow](https:\u002F\u002Fgithub.com\u002Ftensorflow\u002Ftensorflow) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ftensorflow\u002Ftensorflow.svg?cacheSeconds=86400) - TensorFlow is a leading library designed for developing and deploying state-of-the-art  machine learning applications.\n* [ThunderKittens](https:\u002F\u002Fgithub.com\u002FHazyResearch\u002FThunderKittens) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FHazyResearch\u002FThunderKittens.svg?cacheSeconds=86400) ThunderKittens is a framework to make it easy to write fast deep learning kernels in CUDA.\n* [TorchOpt](https:\u002F\u002Fgithub.com\u002Fmetaopt\u002Ftorchopt) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmetaopt\u002Ftorchopt.svg?cacheSeconds=86400) - TorchOpt is an efficient library for differentiable optimization built upon PyTorch.\n* [Triton](https:\u002F\u002Fgithub.com\u002Ftriton-lang\u002Ftriton) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ftriton-lang\u002Ftriton.svg?cacheSeconds=86400) - Triton is a language and compiler for writing highly efficient custom Deep-Learning primitives. The aim of Triton is to provide an open-source environment to write fast code at higher productivity than CUDA, but also with higher flexibility than other existing DSLs.\n* [Vaex](https:\u002F\u002Fgithub.com\u002Fvaexio\u002Fvaex) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fvaexio\u002Fvaex.svg?cacheSeconds=86400) Vaex is a high performance Python library for lazy Out-of-Core DataFrames (similar to Pandas), to visualize and explore big tabular datasets. Vaex uses memory mapping, zero memory copy policy and lazy computations for best performance (no memory wasted).\n* [Vowpal Wabbit](https:\u002F\u002Fgithub.com\u002FVowpalWabbit\u002Fvowpal_wabbit) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FVowpalWabbit\u002Fvowpal_wabbit.svg?cacheSeconds=86400) Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.\n* [XGBoost](https:\u002F\u002Fgithub.com\u002Fdmlc\u002Fxgboost) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fdmlc\u002Fxgboost.svg?cacheSeconds=86400) - XGBoost is an optimized distributed gradient boosting library designed to be highly efficient, flexible and portable.\n* [YDF](https:\u002F\u002Fgithub.com\u002Fgoogle\u002Fyggdrasil-decision-forests) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fgoogle\u002Fyggdrasil-decision-forests.svg?cacheSeconds=86400) - YDF (Yggdrasil Decision Forests) is a library to train, evaluate, interpret, and serve Random Forest, Gradient Boosted Decision Trees, CART and Isolation forest models.\n* [bitsandbytes](https:\u002F\u002Fgithub.com\u002Fbitsandbytes-foundation\u002Fbitsandbytes) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fbitsandbytes-foundation\u002Fbitsandbytes.svg?cacheSeconds=86400) - Bitsandbytes library is a lightweight Python wrapper around CUDA custom functions, in particular 8-bit optimizers, matrix multiplication (LLM.int8()), and 8 & 4-bit quantization functions.\n* [einops](https:\u002F\u002Fgithub.com\u002Farogozhnikov\u002Feinops) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Farogozhnikov\u002Feinops.svg?cacheSeconds=86400) - Flexible and powerful tensor operations for readable and reliable code.\n* [scikit-learn](https:\u002F\u002Fgithub.com\u002Fscikit-learn\u002Fscikit-learn) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fscikit-learn\u002Fscikit-learn.svg?cacheSeconds=86400) - Scikit-learn is a powerful machine learning library that provides a wide variety of modules for data access, data preparation and statistical model building. \n* [snnTorch](https:\u002F\u002Fgithub.com\u002Fjeshraghian\u002Fsnntorch) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fjeshraghian\u002Fsnntorch.svg?cacheSeconds=86400) - snnTorch is a deep and online learning library with spiking neural networks.\n* [torchdistill](https:\u002F\u002Fgithub.com\u002Fyoshitomo-matsubara\u002Ftorchdistill) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fyoshitomo-matsubara\u002Ftorchdistill.svg?cacheSeconds=86400) - torchdistill offers various state-of-the-art knowledge distillation methods and enables you to design (new) experiments simply by editing a declarative yaml config file instead of Python code.\n* [torchkeras](https:\u002F\u002Fgithub.com\u002Flyhue1991\u002Ftorchkeras?tab=readme-ov-file) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Flyhue1991\u002Ftorchkeras?tab=readme-ov-file.svg?cacheSeconds=86400) The torchkeras library is a simple tool for training neural network in pytorch jusk in a keras style.\n* [veScale](https:\u002F\u002Fgithub.com\u002Fvolcengine\u002FveScale) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fvolcengine\u002FveScale.svg?cacheSeconds=86400) - veScale is a PyTorch native LLM training framework.\n* [yellowbrick](https:\u002F\u002Fgithub.com\u002FDistrictDataLabs\u002Fyellowbrick) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FDistrictDataLabs\u002Fyellowbrick.svg?cacheSeconds=86400) - yellowbrick is a matplotlib-based model evaluation plots for scikit-learn and other machine learning libraries.\n\n## Data Annotation and Synthesis\n* [Argilla](https:\u002F\u002Fgithub.com\u002Fargilla-io\u002Fargilla) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fargilla-io\u002Fargilla.svg?cacheSeconds=86400) - Argilla helps domain experts and data teams to build better NLP datasets in less time.\n* [cleanlab](https:\u002F\u002Fgithub.com\u002Fcleanlab\u002Fcleanlab) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fcleanlab\u002Fcleanlab.svg?cacheSeconds=86400) - Python library for data-centric AI. Can automatically: find mislabeled data, detect outliers, estimate consensus + annotator-quality for multi-annotator datasets, suggest which data is best to (re)label next.\n* [COCO Annotator](https:\u002F\u002Fgithub.com\u002Fjsbroks\u002Fcoco-annotator) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fjsbroks\u002Fcoco-annotator.svg?cacheSeconds=86400) - Web-based image segmentation tool for object detection, localization and keypoints\n* [CVAT](https:\u002F\u002Fgithub.com\u002Fcvat-ai\u002Fcvat) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fcvat-ai\u002Fcvat.svg?cacheSeconds=86400) - CVAT (Computer Vision Annotation Tool) is OpenCV's web-based annotation tool for both videos and images for computer algorithms.\n* [Doccano](https:\u002F\u002Fgithub.com\u002Fdoccano\u002Fdoccano) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fdoccano\u002Fdoccano.svg?cacheSeconds=86400) - Open source text annotation tools for humans, providing functionality for sentiment analysis, named entity recognition, and machine translation.\n* [Gretel Synthetics](https:\u002F\u002Fgithub.com\u002Fgretelai\u002Fgretel-synthetics) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fgretelai\u002Fgretel-synthetics.svg?cacheSeconds=86400) - Gretel Synthetics is a synthetic data generators for structured and unstructured text, featuring differentially private learning.\n* [Label Studio](https:\u002F\u002Fgithub.com\u002FHumanSignal\u002Flabel-studio) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FHumanSignal\u002Flabel-studio.svg?cacheSeconds=86400) - Multi-domain data labeling and annotation tool with standardized output format.\n* [LightlyStudio](https:\u002F\u002Fgithub.com\u002Flightly-ai\u002Flightly-studio) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Flightly-ai\u002Flightly-studio.svg?cacheSeconds=86400) - An open source tool to curate, annotate, and manage vision datasets (images and videos). Supports embedding-based auto-selection, annotation, and auto-labeling for bounding boxes and segmentation.\n* [NeMo Curator](https:\u002F\u002Fgithub.com\u002FNVIDIA\u002FNeMo-Curator) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FNVIDIA\u002FNeMo-Curator.svg?cacheSeconds=86400) - NeMo Curator is a GPU-accelerated framework for efficient large language model data curation.\n* [refinery](https:\u002F\u002Fgithub.com\u002Fcode-kern-ai\u002Frefinery) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fcode-kern-ai\u002Frefinery.svg?cacheSeconds=86400) - The data scientist's open-source choice to scale, assess and maintain natural language data.\n* [SDV](https:\u002F\u002Fgithub.com\u002Fsdv-dev\u002FSDV) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fsdv-dev\u002FSDV.svg?cacheSeconds=86400) - Synthetic Data Vault (SDV) is a Synthetic Data Generation ecosystem of libraries that allows users to easily learn single-table, multi-table and timeseries datasets to later on generate new Synthetic Data that has the same format and statistical properties as the original dataset.\n* [Semantic Segmentation Editor](https:\u002F\u002Fgithub.com\u002FHitachi-Automotive-And-Industry-Lab\u002Fsemantic-segmentation-editor) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FHitachi-Automotive-And-Industry-Lab\u002Fsemantic-segmentation-editor.svg?cacheSeconds=86400) - Hitachi's Open source tool for labelling camera and LIDAR data.\n* [synthcity](https:\u002F\u002Fgithub.com\u002Fvanderschaarlab\u002Fsynthcity) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fvanderschaarlab\u002Fsynthcity.svg?cacheSeconds=86400) - synthcity is a library for generating and evaluating synthetic tabular data.\n* [TabGAN](https:\u002F\u002Fgithub.com\u002FDiyago\u002FTabular-data-generation) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FDiyago\u002FTabular-data-generation.svg?cacheSeconds=86400) - Synthetic tabular data generation using GANs (CTGAN), Diffusion Models, and LLMs with adversarial filtering, privacy metrics, and sklearn integration.\n* [ViPE](https:\u002F\u002Fgithub.com\u002Fnv-tlabs\u002Fvipe) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fnv-tlabs\u002Fvipe.svg?cacheSeconds=86400) - ViPE is a spatial AI tool for annotating camera poses and dense depth maps from raw videos.\n* [YData Synthetic](https:\u002F\u002Fgithub.com\u002Fydataai\u002Fydata-synthetic) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fydataai\u002Fydata-synthetic.svg?cacheSeconds=86400) - YData Synthetic is a package to generate synthetic tabular and time-series data leveraging the state of the art generative models.\n\n## Data Pipeline\n* [Apache Airflow](https:\u002F\u002Fgithub.com\u002Fapache\u002Fairflow) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fapache\u002Fairflow.svg?cacheSeconds=86400) - Data Pipeline framework built in Python, including scheduler, DAG definition and a UI for visualisation.\n* [Apache Nifi](https:\u002F\u002Fgithub.com\u002Fapache\u002Fnifi) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fapache\u002Fnifi.svg?cacheSeconds=86400) - Apache NiFi was made for dataflow. It supports highly configurable directed graphs of data routing, transformation, and system mediation logic.\n* [Apache Oozie](https:\u002F\u002Fgithub.com\u002Fapache\u002Foozie) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fapache\u002Foozie.svg?cacheSeconds=86400) - Workflow scheduler for Hadoop jobs.\n* [Argo Workflows](https:\u002F\u002Fgithub.com\u002Fargoproj\u002Fargo-workflows) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fargoproj\u002Fargo-workflows.svg?cacheSeconds=86400) - Argo Workflows is an open source container-native workflow engine for orchestrating parallel jobs on Kubernetes. Argo Workflows is implemented as a Kubernetes CRD (Custom Resource Definition).\n* [Couler](https:\u002F\u002Fgithub.com\u002Fcouler-proj\u002Fcouler) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fcouler-proj\u002Fcouler.svg?cacheSeconds=86400) - Unified interface for constructing and managing machine learning workflows on different workflow engines, such as Argo Workflows, Tekton Pipelines, and Apache Airflow.\n* [DataTrove](https:\u002F\u002Fgithub.com\u002Fhuggingface\u002Fdatatrove) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fhuggingface\u002Fdatatrove.svg?cacheSeconds=86400) - DataTrove is a library to process, filter and deduplicate text data at a very large scale.\n* [Dagster](https:\u002F\u002Fgithub.com\u002Fdagster-io\u002Fdagster) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fdagster-io\u002Fdagster.svg?cacheSeconds=86400) - A data orchestrator for machine learning, analytics, and ETL.\n* [DBT](https:\u002F\u002Fgithub.com\u002Fdbt-labs\u002Fdbt-core) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fdbt-labs\u002Fdbt-core.svg?cacheSeconds=86400) - ETL tool for running transformations inside data warehouses.\n* [Flyte](https:\u002F\u002Fgithub.com\u002Fflyteorg\u002Fflyte) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fflyteorg\u002Fflyte.svg?cacheSeconds=86400) - Lyft's Cloud Native Machine Learning and Data Processing Platform - [(Demo)](https:\u002F\u002Fyoutu.be\u002FKdUJGSP1h9U?t=1451).\n* [Genie](https:\u002F\u002Fgithub.com\u002FNetflix\u002Fgenie) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FNetflix\u002Fgenie.svg?cacheSeconds=86400) - Job orchestration engine to interface and trigger the execution of jobs from Hadoop-based systems.\n* [Hamilton](https:\u002F\u002Fgithub.com\u002Fdagworks-inc\u002Fhamilton) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fdagworks-inc\u002Fhamilton.svg?cacheSeconds=86400) - Hamilton is a micro-orchestration framework for defining dataflows. Runs anywhere python runs (e.g. jupyter, fastAPI, spark, ray, dask). Brings software engineering best practices without you knowing it. Use it to define feature engineering transforms, end-to-end model pipelines, and LLM workflows. It complements macro-orchestration systems (e.g. kedro, luigi, airflow, dbt, etc.) as it replaces the code within those macro tasks. Comes with a self-hostable UI that captures lineage & provenance, execution telemetry & data summaries, and builds a self-populating catalog; usable in development as well as production.\n* [Instill VDP](https:\u002F\u002Fgithub.com\u002Finstill-ai\u002Finstill-core) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Finstill-ai\u002Finstill-core.svg?cacheSeconds=86400) - Instill VDP (Versatile Data Pipeline) aims to streamline the data processing pipelines from inception to completion.\n* [Instructor](https:\u002F\u002Fgithub.com\u002Finstructor-ai\u002Finstructor) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Finstructor-ai\u002Finstructor.svg?cacheSeconds=86400) - Instructor makes it easy to get structured data like JSON from LLMs like GPT-3.5, GPT-4, GPT-4-Vision, and open-source models.\n* [Kedro](https:\u002F\u002Fgithub.com\u002Fkedro-org\u002Fkedro) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fkedro-org\u002Fkedro.svg?cacheSeconds=86400) - Kedro is a workflow development tool that helps you build data pipelines that are robust, scalable, deployable, reproducible and versioned.\n* [Luigi](https:\u002F\u002Fgithub.com\u002Fspotify\u002Fluigi) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fspotify\u002Fluigi.svg?cacheSeconds=86400) - Luigi is a Python module that helps you build complex pipelines of batch jobs, handling dependency resolution, workflow management, visualisation, etc..\n* [Metaflow](https:\u002F\u002Fgithub.com\u002FNetflix\u002Fmetaflow) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FNetflix\u002Fmetaflow.svg?cacheSeconds=86400) - A framework for data scientists to easily build and manage real-life data science projects.\n* [Pachyderm](https:\u002F\u002Fgithub.com\u002Fpachyderm\u002Fpachyderm) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fpachyderm\u002Fpachyderm.svg?cacheSeconds=86400) - Open source distributed processing framework build on Kubernetes focused mainly on dynamic building of production machine learning pipelines - [(Video)](https:\u002F\u002Fwww.youtube.com\u002Fwatch?v=LamKVhe2RSM).\n* [Ploomber](https:\u002F\u002Fgithub.com\u002Fploomber\u002Fploomber) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fploomber\u002Fploomber.svg?cacheSeconds=86400) - The fastest way to build data pipelines. Develop iteratively, deploy anywhere.\n* [Pixeltable](https:\u002F\u002Fgithub.com\u002Fpixeltable\u002Fpixeltable) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fpixeltable\u002Fpixeltable.svg?cacheSeconds=86400) – Open-source Python library providing declarative, incremental data infrastructure for building and managing multimodal AI workloads.\n* [Prefect Core](https:\u002F\u002Fgithub.com\u002FPrefectHQ\u002Fprefect) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FPrefectHQ\u002Fprefect.svg?cacheSeconds=86400) - Workflow management system that makes it easy to take your data pipelines and add semantics like retries, logging, dynamic mapping, caching, failure notifications, and more.\n* [SeqIO](https:\u002F\u002Fgithub.com\u002Fgoogle\u002Fseqio) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fgoogle\u002Fseqio.svg?cacheSeconds=86400) - SeqIO is a library for processing sequential data to be fed into downstream sequence models.\n* [Snakemake](https:\u002F\u002Fgithub.com\u002Fsnakemake\u002Fsnakemake) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fsnakemake\u002Fsnakemake.svg?cacheSeconds=86400) - Workflow management system for reproducible and scalable data analyses.\n* [Towhee](https:\u002F\u002Fgithub.com\u002Ftowhee-io\u002Ftowhee) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ftowhee-io\u002Ftowhee.svg?cacheSeconds=86400) - General-purpose machine learning pipeline for generating embedding vectors using one or many ML models.\n* [unstructured](https:\u002F\u002Fgithub.com\u002FUnstructured-IO\u002Funstructured) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FUnstructured-IO\u002Funstructured.svg?cacheSeconds=86400) - unstructured streamlines and optimizes the data processing workflow for LLMs, ingesting and pre-processing images and text documents, such as PDFs, HTML, Word docs, and many more. \n* [ZenML](https:\u002F\u002Fgithub.com\u002Fzenml-io\u002Fzenml) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fzenml-io\u002Fzenml.svg?cacheSeconds=86400) - ZenML is an extensible, open-source MLOps framework to create reproducible ML pipelines with a focus on automated metadata tracking, caching, and many integrations to other tools.\n\n## Data Science Notebook\n* [Apache Zeppelin](https:\u002F\u002Fgithub.com\u002Fapache\u002Fzeppelin) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fapache\u002Fzeppelin.svg?cacheSeconds=86400) - Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.\n* [Deepnote](https:\u002F\u002Fgithub.com\u002Fdeepnote\u002Fdeepnote) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fdeepnote\u002Fdeepnote.svg?cacheSeconds=86400) - Deepnote is a drop-in replacement for Jupyter with an AI-first design, sleek UI, new blocks, and native data integrations. Use Python, R, and SQL locally in your favorite IDE, then scale to Deepnote cloud for real-time collaboration, Deepnote agent, and deployable data apps.\n* [Jupyter Notebooks](https:\u002F\u002Fgithub.com\u002Fjupyter\u002Fnotebook) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fjupyter\u002Fnotebook.svg?cacheSeconds=86400) - Web interface python sandbox environments for reproducible development\n* [Marimo](https:\u002F\u002Fgithub.com\u002Fmarimo-team\u002Fmarimo) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmarimo-team\u002Fmarimo.svg?cacheSeconds=86400) - Reactive Python notebook — run reproducible experiments, execute as a script, deploy as an app, and version with git.\n* [Papermill](https:\u002F\u002Fgithub.com\u002Fnteract\u002Fpapermill) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fnteract\u002Fpapermill.svg?cacheSeconds=86400) - Papermill is a library for parameterizing notebooks and executing them like Python scripts.\n* [Polynote](https:\u002F\u002Fgithub.com\u002Fpolynote\u002Fpolynote) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fpolynote\u002Fpolynote.svg?cacheSeconds=86400) - Polynote is an experimental polyglot notebook environment. Currently, it supports Scala and Python (with or without Spark), SQL, and Vega.\n* [RMarkdown](https:\u002F\u002Fgithub.com\u002Frstudio\u002Frmarkdown) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Frstudio\u002Frmarkdown.svg?cacheSeconds=86400) - The rmarkdown package is a next generation implementation of R Markdown based on Pandoc.\n* [Stencila](https:\u002F\u002Fgithub.com\u002Fstencila\u002Fstencila) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fstencila\u002Fstencila.svg?cacheSeconds=86400) - Stencila is a platform for creating, collaborating on, and sharing data driven content. Content that is transparent and reproducible.\n* [Voilà](https:\u002F\u002Fgithub.com\u002Fvoila-dashboards\u002Fvoila) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fvoila-dashboards\u002Fvoila.svg?cacheSeconds=86400) - Voilà turns Jupyter notebooks into standalone web applications that can e.g. be used as dashboards.\n* [.NET Interactive](https:\u002F\u002Fgithub.com\u002Fdotnet\u002Finteractive) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fdotnet\u002Finteractive.svg?cacheSeconds=86400) - .NET Interactive takes the power of .NET and embeds it into your interactive experiences.\n\n## Data Storage Optimisation\n* [AIStore](https:\u002F\u002Fgithub.com\u002FNVIDIA\u002Faistore) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FNVIDIA\u002Faistore.svg?cacheSeconds=86400) - AIStore is a lightweight object storage system with the capability to linearly scale out with each added storage node and a special focus on petascale deep learning.\n* [Alluxio](https:\u002F\u002Fgithub.com\u002FAlluxio\u002Falluxio) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FAlluxio\u002Falluxio.svg?cacheSeconds=86400) - A virtual distributed storage system that bridges the gab between computation frameworks and storage systems.\n* [Apache Arrow](https:\u002F\u002Fgithub.com\u002Fapache\u002Farrow) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fapache\u002Farrow.svg?cacheSeconds=86400) - In-memory columnar representation of data compatible with Pandas, Hadoop-based systems, etc..\n* [Apache Druid](https:\u002F\u002Fgithub.com\u002Fapache\u002Fdruid) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fapache\u002Fdruid.svg?cacheSeconds=86400) - A high performance real-time analytics database. Check this [article](https:\u002F\u002Ftowardsdatascience.com\u002Fintroduction-to-druid-4bf285b92b5a) for introduction.\n* [Apache Hudi](https:\u002F\u002Fgithub.com\u002Fapache\u002Fhudi) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fapache\u002Fhudi.svg?cacheSeconds=86400) - Hudi is a transactional data lake platform that brings core warehouse and database functionality directly to a data lake. Hudi is great for streaming workloads, and also allows creation of efficient incremental batch pipelines. Supports popular query engines including Spark, Flink, Presto, Trino, Hive, etc. More info [here](https:\u002F\u002Fhudi.apache.org\u002F).\n* [Apache Iceberg](https:\u002F\u002Fgithub.com\u002Fapache\u002Ficeberg) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fapache\u002Ficeberg.svg?cacheSeconds=86400) - Iceberg is an ACID-compliant, high-performance format built for huge analytic tables (containing tens of petabytes of data), and it brings the reliability and simplicity of SQL tables to big data, while making it possible for engines like Spark, Trino, Flink, Presto, Hive and Impala to safely work with the same tables, at the same time. More info [here](https:\u002F\u002Ficeberg.apache.org\u002F).\n* [Apache Ignite](https:\u002F\u002Fgithub.com\u002Fapache\u002Fignite) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fapache\u002Fignite.svg?cacheSeconds=86400) - A memory-centric distributed database, caching, and processing platform for transactional, analytical, and streaming workloads delivering in-memory speeds at petabyte scale - [Demo](https:\u002F\u002Fwww.youtube.com\u002Fwatch?v=Xt4PWQ__YPw).\n* [Apache Parquet](https:\u002F\u002Fgithub.com\u002Fapache\u002Fparquet-java) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fapache\u002Fparquet-java.svg?cacheSeconds=86400) - On-disk columnar representation of data compatible with Pandas, Hadoop-based systems, etc..\n* [Apache Pinot](https:\u002F\u002Fgithub.com\u002Fapache\u002Fpinot) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fapache\u002Fpinot.svg?cacheSeconds=86400) - A realtime distributed OLAP datastore. Comparison of the open source OLAP systems for big data: ClickHouse, Druid, and Pinot is found [here](https:\u002F\u002Fmedium.com\u002F@leventov\u002Fcomparison-of-the-open-source-olap-systems-for-big-data-clickhouse-druid-and-pinot-8e042a5ed1c7).\n* [Casibase](https:\u002F\u002Fgithub.com\u002Fcasibase\u002Fcasibase) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fcasibase\u002Fcasibase.svg?cacheSeconds=86400) - Casibase is a LangChain-like RAG (Retrieval-Augmented Generation) knowledge database with web UI and Enterprise SSO.\n* [Chroma](https:\u002F\u002Fgithub.com\u002Fchroma-core\u002Fchroma) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fchroma-core\u002Fchroma.svg?cacheSeconds=86400) - Chroma is an open-source embedding database.\n* [ClickHouse](https:\u002F\u002Fgithub.com\u002FClickHouse\u002FClickHouse) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FClickHouse\u002FClickHouse.svg?cacheSeconds=86400) - ClickHouse is an open source column oriented database management system.\n* [Delta Lake](https:\u002F\u002Fgithub.com\u002Fdelta-io\u002Fdelta) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fdelta-io\u002Fdelta.svg?cacheSeconds=86400) - Delta Lake is a storage layer that brings scalable, ACID transactions to Apache Spark and other big-data engines.\n* [EdgeDB](https:\u002F\u002Fgithub.com\u002Fgeldata\u002Fgel) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fgeldata\u002Fgel.svg?cacheSeconds=86400) - Gel supercharges Postgres with a modern data model, graph queries, Auth & AI solutions, and much more.\n* [GPTCache](https:\u002F\u002Fgithub.com\u002Fzilliztech\u002FGPTCache) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fzilliztech\u002FGPTCache.svg?cacheSeconds=86400) - GPTCache is a library for creating semantic cache for large language model queries.\n* [InfluxDB](https:\u002F\u002Fgithub.com\u002Finfluxdata\u002Finfluxdb) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Finfluxdata\u002Finfluxdb.svg?cacheSeconds=86400) Scalable datastore for metrics, events, and real-time analytics.\n* [Milvus](https:\u002F\u002Fgithub.com\u002Fmilvus-io\u002Fmilvus) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmilvus-io\u002Fmilvus.svg?cacheSeconds=86400) Milvus is a cloud-native, open-source vector database built to manage embedding vectors generated by machine learning models and neural networks.\n* [Marqo](https:\u002F\u002Fgithub.com\u002Fmarqo-ai\u002Fmarqo) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmarqo-ai\u002Fmarqo.svg?cacheSeconds=86400) Marqo is an end-to-end vector search engine.\n* [pgvector](https:\u002F\u002Fgithub.com\u002Fpgvector\u002Fpgvector) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fpgvector\u002Fpgvector.svg?cacheSeconds=86400) pgvector helps with vector similarity search for Postgres.\n* [PostgresML](https:\u002F\u002Fgithub.com\u002Fpostgresml\u002Fpostgresml) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fpostgresml\u002Fpostgresml.svg?cacheSeconds=86400) PostgresML is a machine learning extension for PostgreSQL that enables you to perform training and inference on text and tabular data using SQL queries.\n* [Redis](https:\u002F\u002Fgithub.com\u002Fredis\u002Fredis) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fredis\u002Fredis.svg?cacheSeconds=86400) Redis is an open-source, in-memory data store that supports vector similarity search, making it suitable for AI\u002FML applications such as semantic search and recommendation systems.\n* [Safetensors](https:\u002F\u002Fgithub.com\u002Fhuggingface\u002Fsafetensors) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fhuggingface\u002Fsafetensors.svg?cacheSeconds=86400) Simple, safe way to store and distribute tensors.\n* [TimescaleDB](https:\u002F\u002Fgithub.com\u002Ftimescale\u002Ftimescaledb) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ftimescale\u002Ftimescaledb.svg?cacheSeconds=86400) An open-source time-series SQL database optimized for fast ingest and complex queries packaged as a PostgreSQL extension - [(Video)](https:\u002F\u002Fwww.youtube.com\u002Fwatch?v=zbjub8BQPyE).\n* [Weaviate](https:\u002F\u002Fgithub.com\u002Fweaviate\u002Fweaviate) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fweaviate\u002Fweaviate.svg?cacheSeconds=86400) - A low-latency vector search engine (GraphQL, RESTful) with out-of-the-box support for different media types. Modules include Semantic Search, Q&A, Classification, Customizable Models (PyTorch\u002FTensorFlow\u002FKeras), and more.\n* [Zarr](https:\u002F\u002Fgithub.com\u002Fzarr-developers\u002Fzarr-python) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fzarr-developers\u002Fzarr-python.svg?cacheSeconds=86400) - Python implementation of chunked, compressed, N-dimensional arrays designed for use in parallel computing.\n\n## Data Stream Processing\n* [Apache Beam](https:\u002F\u002Fgithub.com\u002Fapache\u002Fbeam) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fapache\u002Fbeam.svg?cacheSeconds=86400) Apache Beam is a unified programming model for Batch and Streaming.\n* [Apache Flink](https:\u002F\u002Fgithub.com\u002Fapache\u002Fflink) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fapache\u002Fflink.svg?cacheSeconds=86400) - Open source stream processing framework with powerful stream and batch processing capabilities.\n* [Apache Kafka](https:\u002F\u002Fgithub.com\u002Fapache\u002Fkafka) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fapache\u002Fkafka.svg?cacheSeconds=86400) - Kafka client library for building applications and microservices where the input and output are stored in kafka clusters.\n* [Apache Samza](https:\u002F\u002Fgithub.com\u002Fapache\u002Fsamza) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fapache\u002Fsamza.svg?cacheSeconds=86400) - Distributed stream processing framework. It uses Apache Kafka for messaging, and Apache Hadoop YARN to provide fault tolerance, processor isolation, security, and resource management.\n* [Apache Spark](https:\u002F\u002Fgithub.com\u002Fapache\u002Fspark) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fapache\u002Fspark.svg?cacheSeconds=86400) - Micro-batch processing for streams using the apache spark framework as a backend supporting stateful exactly-once semantics.\n* [Bytewax](https:\u002F\u002Fgithub.com\u002Fbytewax\u002Fbytewax) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fbytewax\u002Fbytewax.svg?cacheSeconds=86400) - Flexible Python-centric stateful stream processing framework built on top of Rust engine.\n* [FastStream](https:\u002F\u002Fgithub.com\u002Fairtai\u002Ffaststream) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fairtai\u002Ffaststream.svg?cacheSeconds=86400) - A modern broker-agnostic streaming Python framework supporting Apache Kafka, RabbitMQ and NATS protocols, inspired by FastAPI and easily integratable with other web frameworks.\n* [MOA](https:\u002F\u002Fgithub.com\u002FWaikato\u002Fmoa) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FWaikato\u002Fmoa.svg?cacheSeconds=86400) - MOA (Massive Online Analysis) is an open source framework for Big Data stream mining.\n* [MosaicML Streaming](https:\u002F\u002Fgithub.com\u002Fmosaicml\u002Fstreaming) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmosaicml\u002Fstreaming.svg?cacheSeconds=86400) - Fast, deterministic streaming of large datasets from cloud storage for distributed model training.\n* [RisingWave](https:\u002F\u002Fgithub.com\u002Frisingwavelabs\u002Frisingwave) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Frisingwavelabs\u002Frisingwave.svg?cacheSeconds=86400) - A distributed SQL streaming database that unifies stream processing and low-latency serving, ideal for building and serving features for online machine learning.\n* [TensorStore](https:\u002F\u002Fgithub.com\u002Fgoogle\u002Ftensorstore) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fgoogle\u002Ftensorstore.svg?cacheSeconds=86400) - Library for reading and writing large multi-dimensional arrays.\n\n\n## Deployment and Serving\n* [Agenta](https:\u002F\u002Fgithub.com\u002FAgenta-AI\u002Fagenta) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FAgenta-AI\u002Fagenta.svg?cacheSeconds=86400) - Agenta provides end-to-end tools for the entire LLMOps workflow: building (LLM playground, evaluation), deploying (prompt and configuration management), and  (LLM observability and tracing).\n* [AirLLM](https:\u002F\u002Fgithub.com\u002Flyogavin\u002Fairllm) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Flyogavin\u002Fairllm.svg?cacheSeconds=86400) - AirLLM optimizes inference memory usage, allowing 70B large language models to run inference on a single 4GB GPU card without quantization, distillation and pruning.\n* [AITemplate](https:\u002F\u002Fgithub.com\u002Ffacebookincubator\u002FAITemplate) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ffacebookincubator\u002FAITemplate.svg?cacheSeconds=86400) - AITemplate (AIT) is a Python framework that transforms deep neural networks into CUDA (NVIDIA GPU) \u002F HIP (AMD GPU) C++ code for lightning-fast inference serving.\n* [BentoML](https:\u002F\u002Fgithub.com\u002Fbentoml\u002FBentoML) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fbentoml\u002FBentoML.svg?cacheSeconds=86400) - BentoML is an open source framework for high performance ML model serving.\n* [BISHENG](https:\u002F\u002Fgithub.com\u002Fdataelement\u002Fbisheng) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fdataelement\u002Fbisheng.svg?cacheSeconds=86400) - BISHENG is an open LLM application devops platform, focusing on enterprise scenarios.\n* [DeepDetect](https:\u002F\u002Fgithub.com\u002Fjolibrain\u002Fdeepdetect) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fjolibrain\u002Fdeepdetect.svg?cacheSeconds=86400) - Machine Learning production server for TensorFlow, XGBoost and Cafe models written in C++ and maintained by Jolibrain.\n* [Dynamo](https:\u002F\u002Fgithub.com\u002Fai-dynamo\u002Fdynamo) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fai-dynamo\u002Fdynamo.svg?cacheSeconds=86400) - NVIDIA Dynamo is a high-throughput, low-latency inference framework designed for serving generative AI and reasoning models in multi-node distributed environments.\n* [exo](https:\u002F\u002Fgithub.com\u002Fexo-explore\u002Fexo) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fexo-explore\u002Fexo.svg?cacheSeconds=86400) - exo helps you run your AI cluster at home with everyday devices.\n* [Genkit](https:\u002F\u002Fgithub.com\u002Ffirebase\u002Fgenkit) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ffirebase\u002Fgenkit.svg?cacheSeconds=86400) - Genkit is an open source framework for building AI-powered apps with familiar code-centric patterns. Genkit makes it easy to develop, integrate, and test AI features with observability and evaluations.\n* [Inference](https:\u002F\u002Fgithub.com\u002Froboflow\u002Finference) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Froboflow\u002Finference.svg?cacheSeconds=86400) - A fast, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models. With Inference, you can deploy models such as YOLOv5, YOLOv8, CLIP, SAM, and CogVLM on your own hardware using Docker.\n* [Infinity](https:\u002F\u002Fgithub.com\u002Fmichaelfeil\u002Finfinity) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmichaelfeil\u002Finfinity.svg?cacheSeconds=86400) - Infinity is a high-throughput, low-latency REST API for serving text-embeddings, reranking models and clip. \n* [IPEX-LLM](https:\u002F\u002Fgithub.com\u002Fintel\u002Fipex-llm) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fintel\u002Fipex-llm.svg?cacheSeconds=86400) - IPEX-LLM is a PyTorch library for running LLM on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max) with very low latency.\n* [LiteLLM](https:\u002F\u002Fgithub.com\u002FBerriAI\u002Flitellm) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FBerriAI\u002Flitellm.svg?cacheSeconds=86400) - LiteLLM is a Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq.\n* [LiteRT](https:\u002F\u002Fgithub.com\u002Fgoogle-ai-edge\u002Flitert) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fgoogle-ai-edge\u002Flitert.svg?cacheSeconds=86400) - LiteRT (formerly TensorFlow Lite) is Google's high-performance runtime for on-device AI inference, enabling deployment of machine learning models on mobile, embedded, and edge devices.\n* [LiteRT-LM](https:\u002F\u002Fgithub.com\u002Fgoogle-ai-edge\u002FLiteRT-LM) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fgoogle-ai-edge\u002FLiteRT-LM.svg?cacheSeconds=86400) - LiteRT-LM is Google's production-ready, high-performance inference framework for deploying Large Language Models on edge devices, with cross-platform support for Android, iOS, Web, Desktop, and IoT.\n* [LitServe](https:\u002F\u002Fgithub.com\u002FLightning-AI\u002FLitServe) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FLightning-AI\u002FLitServe.svg?cacheSeconds=86400) - LitServe is a flexible serving engine for AI models built on FastAPI. It supports custom inference engines for models, agents, multi-modal systems, RAG, and complex ML pipelines.\n* [Jina-serve](https:\u002F\u002Fgithub.com\u002Fjina-ai\u002Fserve) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fjina-ai\u002Fserve.svg?cacheSeconds=86400) - Jina-serve is a framework for building and deploying AI services that communicate via gRPC, HTTP and WebSockets.\n* [Kiln](https:\u002F\u002Fgithub.com\u002Fkiln-ai\u002Fkiln) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fkiln-ai\u002Fkiln.svg?cacheSeconds=86400) - Kiln is an OSS tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.\n* [KServe](https:\u002F\u002Fgithub.com\u002Fkserve\u002Fkserve) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fkserve\u002Fkserve.svg?cacheSeconds=86400) - KServe provides a Kubernetes Custom Resource Definition for serving predictive and generative ML.\n* [KTransformers](https:\u002F\u002Fgithub.com\u002Fkvcache-ai\u002Fktransformers) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fkvcache-ai\u002Fktransformers.svg?cacheSeconds=86400) - KTransformers is a flexible framework for experiencing cutting-edge LLM inference optimizations.\n* [Langtrace](https:\u002F\u002Fgithub.com\u002FScale3-Labs\u002Flangtrace) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FScale3-Labs\u002Flangtrace.svg?cacheSeconds=86400) - Langtrace is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorDBs and more.\n* [Lepton AI](https:\u002F\u002Fgithub.com\u002Fleptonai\u002Fleptonai) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fleptonai\u002Fleptonai.svg?cacheSeconds=86400) - LeptonAI Python library allows you to build an AI service from Python code with ease.\n* [LightLLM](https:\u002F\u002Fgithub.com\u002FModelTC\u002Flightllm) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FModelTC\u002Flightllm.svg?cacheSeconds=86400) - LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.\n* [llama.cpp](https:\u002F\u002Fgithub.com\u002Fggml-org\u002Fllama.cpp) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fggml-org\u002Fllama.cpp.svg?cacheSeconds=86400) - llama.cpp is an open source software library that performs inference on various large language models such as Llama.\n* [llmfit](https:\u002F\u002Fgithub.com\u002FAlexsJones\u002Fllmfit) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FAlexsJones\u002Fllmfit.svg?cacheSeconds=86400) - A terminal tool that right-sizes LLM models to your system's RAM, CPU, and GPU. Detects your hardware, scores each model across quality, speed, fit, and context dimensions, and tells you which ones will actually run well on your machine.\n* [LMCache](https:\u002F\u002Fgithub.com\u002Flmcache\u002Flmcache) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Flmcache\u002Flmcache.svg?cacheSeconds=86400) - LMCache is a high-performance KV cache layer that accelerates LLM inference.\n* [LMDeploy](https:\u002F\u002Fgithub.com\u002FInternLM\u002Flmdeploy) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FInternLM\u002Flmdeploy.svg?cacheSeconds=86400) - LMDeploy is a toolkit for compressing, deploying, and serving LLM.\n* [LM Studio](https:\u002F\u002Fgithub.com\u002Flmstudio-ai\u002Flms) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Flmstudio-ai\u002Flms.svg?cacheSeconds=86400) - LM Studio is a tool for deploying LLM models locally on the computer, even on a relatively modest machine, provided it meets the minimum requirements.\n* [LocalAI](https:\u002F\u002Fgithub.com\u002Fmudler\u002FLocalAI) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmudler\u002FLocalAI.svg?cacheSeconds=86400) - LocalAI is a drop-in replacement REST API that's compatible with OpenAI API specifications for local inferencing.\n* [MindsDB](https:\u002F\u002Fgithub.com\u002Fmindsdb\u002Fmindsdb) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmindsdb\u002Fmindsdb.svg?cacheSeconds=86400) - MindsDB is the platform to create, serve, and fine-tune models in real-time from your database, vector store, and application data.\n* [mini-sglang](https:\u002F\u002Fgithub.com\u002Fsgl-project\u002Fmini-sglang) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fsgl-project\u002Fmini-sglang.svg?cacheSeconds=86400) - mini-sglang is a lightweight and efficient serving framework for large language models.\n* [MLRun](https:\u002F\u002Fgithub.com\u002Fmlrun\u002Fmlrun)![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmlrun\u002Fmlrun.svg?cacheSeconds=86400)- MLRun is an open MLOps framework for quickly building and managing continuous ML and generative AI applications across their lifecycle.\n* [MLServer](https:\u002F\u002Fgithub.com\u002FSeldonIO\u002Fmlserver) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FSeldonIO\u002Fmlserver.svg?cacheSeconds=86400) - An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more.\n* [Model Runner](https:\u002F\u002Fgithub.com\u002Fdocker\u002Fmodel-runner) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fdocker\u002Fmodel-runner.svg?cacheSeconds=86400) - Docker Model Runner makes it easy to manage, run, and serve AI models using Docker, supporting LLMs and other AI models pulled directly from Docker Hub or any OCI-compliant registry.\n* [Mosec](https:\u002F\u002Fgithub.com\u002Fmosecorg\u002Fmosec) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmosecorg\u002Fmosec.svg?cacheSeconds=86400) - A rust-powered and multi-stage pipelined model server which offers dynamic batching and more. Super easy to implement and deploy as micro-services.\n* [nano-vllm](https:\u002F\u002Fgithub.com\u002FGeeeekExplorer\u002Fnano-vllm) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FGeeeekExplorer\u002Fnano-vllm.svg?cacheSeconds=86400) - nano-vllm is a lightweight vLLM implementation built from scratch, providing fast offline inference with optimization techniques such as prefix caching, tensor parallelism, and CUDA graph.\n* [nndeploy](https:\u002F\u002Fgithub.com\u002Fnndeploy\u002Fnndeploy) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fnndeploy\u002Fnndeploy.svg?cacheSeconds=86400) - An Easy-to-Use and High-Performance AI deployment framework.\n* [Nuclio](https:\u002F\u002Fgithub.com\u002Fnuclio\u002Fnuclio) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fnuclio\u002Fnuclio.svg?cacheSeconds=86400) - A high-performance \"serverless\" framework focused on data, I\u002FO, and compute-intensive workloads. It is well integrated with popular data science tools, such as Jupyter and Kubeflow; supports a variety of data and streaming sources; and supports execution over CPUs and GPUs.\n* [OpenLLM](https:\u002F\u002Fgithub.com\u002Fbentoml\u002FOpenLLM) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fbentoml\u002FOpenLLM.svg?cacheSeconds=86400) - OpenLLM allows developers to run any open-source LLMs (Llama 3.1, Qwen2, Phi3 and more) or custom models as OpenAI-compatible APIs with a single command.\n* [OpenVINO](https:\u002F\u002Fgithub.com\u002Fopenvinotoolkit\u002Fopenvino) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fopenvinotoolkit\u002Fopenvino.svg?cacheSeconds=86400) - OpenVINO is an open-source toolkit for optimizing and deploying AI inference.\n* [Open WebUI](https:\u002F\u002Fgithub.com\u002Fopen-webui\u002Fopen-webui) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fopen-webui\u002Fopen-webui.svg?cacheSeconds=86400) - Open WebUI is an extensible, feature-rich, and user-friendly self-hosted AI platform designed to operate entirely offline. It supports various LLM runners like Ollama and OpenAI-compatible APIs, with built-in inference engine for RAG, making it a powerful AI deployment solution.\n* [OptiLLM](https:\u002F\u002Fgithub.com\u002Falgorithmicsuperintelligence\u002Foptillm) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Falgorithmicsuperintelligence\u002Foptillm.svg?cacheSeconds=86400) - OptiLLM is an OpenAI API-compatible optimizing inference proxy that implements 20+ state-of-the-art techniques to dramatically improve LLM accuracy and performance on reasoning tasks - without requiring any model training or fine-tuning.\n* [PowerInfer](https:\u002F\u002Fgithub.com\u002FSJTU-IPADS\u002FPowerInfer) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FSJTU-IPADS\u002FPowerInfer.svg?cacheSeconds=86400) - PowerInfer is a CPU\u002FGPU LLM inference engine leveraging activation locality for your device.\n* [Prompt2Model](https:\u002F\u002Fgithub.com\u002Fneulab\u002Fprompt2model) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fneulab\u002Fprompt2model.svg?cacheSeconds=86400) - Prompt2Model is a system that takes a natural language task description (like the prompts used for LLMs such as ChatGPT) to train a small special-purpose model that is conducive for deployment.\n* [RamaLama](https:\u002F\u002Fgithub.com\u002Fcontainers\u002Framalama) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fcontainers\u002Framalama.svg?cacheSeconds=86400) - RamaLama is an open-source tool that simplifies the local use and serving of AI models for inference through OCI containers, eliminating the need to configure the host system.\n* [RunAnywhere](https:\u002F\u002Fgithub.com\u002FRunanywhereAI\u002Frunanywhere-sdks) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FRunanywhereAI\u002Frunanywhere-sdks.svg?cacheSeconds=86400) - RunAnywhere is a production-ready SDK for running AI models (LLMs, speech-to-text, text-to-speech) on-device for iOS, Android, React Native, and Flutter - enabling private, offline, and fast mobile AI applications.\n* [Seldon Core](https:\u002F\u002Fgithub.com\u002FSeldonIO\u002Fseldon-core) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FSeldonIO\u002Fseldon-core.svg?cacheSeconds=86400) - Open source platform for deploying and  machine learning models in Kubernetes - [(Video)](https:\u002F\u002Fwww.youtube.com\u002Fwatch?v=pDlapGtecbY).\n* [SGLang](https:\u002F\u002Fgithub.com\u002Fsgl-project\u002Fsglang) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fsgl-project\u002Fsglang.svg?cacheSeconds=86400) - SGLang is a fast serving framework for large language models and vision language models.\n* [SkyPilot](https:\u002F\u002Fgithub.com\u002Fskypilot-org\u002Fskypilot) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fskypilot-org\u002Fskypilot.svg?cacheSeconds=86400) - SkyPilot is a framework for running LLMs, AI, and batch jobs on any cloud, offering maximum cost savings, highest GPU availability, and managed execution.\n* [Tensorflow Serving](https:\u002F\u002Fgithub.com\u002Ftensorflow\u002Fserving) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ftensorflow\u002Fserving.svg?cacheSeconds=86400) - High-performant framework to serve Tensorflow models via grpc protocol able to handle 100k requests per second per core.\n* [text-generation-inference](https:\u002F\u002Fgithub.com\u002Fhuggingface\u002Ftext-generation-inference) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fhuggingface\u002Ftext-generation-inference.svg?cacheSeconds=86400) - Large Language Model Text Generation Inference.\n* [TorchServe](https:\u002F\u002Fgithub.com\u002Fpytorch\u002Fserve) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fpytorch\u002Fserve.svg?cacheSeconds=86400) - TorchServe is a flexible and easy to use tool for serving PyTorch models.\n* [torchtune](https:\u002F\u002Fgithub.com\u002Fmeta-pytorch\u002Ftorchtune) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmeta-pytorch\u002Ftorchtune.svg?cacheSeconds=86400) - torchtune is a PyTorch library for easily authoring, post-training, and experimenting with LLMs.\n* [Transformer Lab](https:\u002F\u002Fgithub.com\u002Ftransformerlab\u002Ftransformerlab-app) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ftransformerlab\u002Ftransformerlab-app.svg?cacheSeconds=86400) - Transformer Lab is an open-source LLM workspace for finetuning, evaluating, exporting, and testing models locally across inference engines and platforms.\n* [Triton Inference Server](https:\u002F\u002Fgithub.com\u002Ftriton-inference-server\u002Fserver) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ftriton-inference-server\u002Fserver.svg?cacheSeconds=86400) - Triton is a high performance open source serving software to deploy AI models from any framework on GPU & CPU while maximizing utilization.\n* [Vercel AI](https:\u002F\u002Fgithub.com\u002Fvercel\u002Fai) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fvercel\u002Fai.svg?cacheSeconds=86400) - Vercel AI is a TypeScript toolkit designed to help you build AI-powered applications using popular frameworks like Next.js, React, Svelte, Vue and runtimes like Node.js.\n* [Vespa](https:\u002F\u002Fgithub.com\u002Fvespa-engine\u002Fvespa) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fvespa-engine\u002Fvespa.svg?cacheSeconds=86400) - Search, make inferences in and organize vectors, tensors, text and structured data, at serving time and any scale.\n* [vLLM](https:\u002F\u002Fgithub.com\u002Fvllm-project\u002Fvllm) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fvllm-project\u002Fvllm.svg?cacheSeconds=86400) - vLLM is a high-throughput and memory-efficient inference and serving engine for LLMs.\n\n\n## Evaluation and Monitoring\n* [AlpacaEval](https:\u002F\u002Fgithub.com\u002Ftatsu-lab\u002Falpaca_eval) ![](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ftatsu-lab\u002Fal","EthicalML\u002Fawesome-production-machine-learning 是一个精选的开源库列表，旨在帮助开发者部署、监控、版本管理和扩展机器学习模型。该项目涵盖了从自动机器学习（AutoML）、数据管道处理到模型训练与编排等多个方面，特别强调了在生产环境中实现高效、安全和负责任的人工智能解决方案的技术特点。它适合那些希望提升机器学习项目工业化水平的企业或个人使用，无论是初学者还是经验丰富的工程师都能从中找到有价值的资源。通过定期更新内容并提供搜索工具包，使得用户能够轻松获取最新的生产级机器学习库信息。",2,"2026-06-11 03:23:33","top_topic"]