[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"project-71828":3},{"id":4,"name":5,"fullName":6,"owner":7,"repo":5,"description":8,"homepage":9,"htmlUrl":10,"language":11,"languages":10,"totalLinesOfCode":10,"stars":12,"forks":13,"watchers":14,"openIssues":15,"contributorsCount":16,"subscribersCount":16,"size":16,"stars1d":17,"stars7d":18,"stars30d":19,"stars90d":16,"forks30d":16,"starsTrendScore":20,"compositeScore":21,"rankGlobal":10,"rankLanguage":10,"license":22,"archived":23,"fork":23,"defaultBranch":24,"hasWiki":25,"hasPages":23,"topics":26,"createdAt":10,"pushedAt":10,"updatedAt":31,"readmeContent":32,"aiSummary":33,"trendingCount":16,"starSnapshotCount":16,"syncStatus":34,"lastSyncTime":35,"discoverSource":36},71828,"Awesome-LLMOps","tensorchord\u002FAwesome-LLMOps","tensorchord","An awesome & curated list of best LLMOps tools for developers","",null,"Shell",5832,830,82,22,0,7,19,56,21,93.86,"Creative Commons Zero v1.0 Universal",false,"main",true,[27,28,29,30],"ai-development-tools","awesome-list","llmops","mlops","2026-06-12 04:01:01","# Awesome LLMOps\n\n\u003Ca href=\"https:\u002F\u002Fdiscord.gg\u002FKqswhpVgdU\">\u003Cimg alt=\"discord invitation link\" src=\"https:\u002F\u002Fimg.shields.io\u002Fdiscord\u002F974584200327991326?style=flat&logo=discord&cacheSeconds=60\">\u003C\u002Fa>\n\u003Ca href=\"https:\u002F\u002Fawesome.re\">\u003Cimg src=\"https:\u002F\u002Fawesome.re\u002Fbadge-flat2.svg\">\u003C\u002Fa>\n\nAn awesome & curated list of the best LLMOps tools for developers.\n\n> [!NOTE]\n> Contributions are most welcome, please adhere to the [contribution guidelines](contributing.md).\n\n## Table of Contents\n\n- [Awesome LLMOps](#awesome-llmops)\n  - [Table of Contents](#table-of-contents)\n  - [Model](#model)\n    - [Large Language Model](#large-language-model)\n    - [CV Foundation Model](#cv-foundation-model)\n    - [Audio Foundation Model](#audio-foundation-model)\n  - [Serving](#serving)\n    - [Large Model Serving](#large-model-serving)\n    - [Frameworks\u002FServers for Serving](#frameworksservers-for-serving)\n  - [Security](#security)\n    - [Frameworks for LLM security](#frameworks-for-llm-security)\n    - [Observability](#observability)\n  - [LLMOps](#llmops)\n  - [Search](#search)\n    - [Vector search](#vector-search)\n  - [Code AI](#code-ai)\n  - [Training](#training)\n    - [IDEs and Workspaces](#ides-and-workspaces)\n    - [Foundation Model Fine Tuning](#foundation-model-fine-tuning)\n    - [Frameworks for Training](#frameworks-for-training)\n    - [Experiment Tracking](#experiment-tracking)\n    - [Visualization](#visualization)\n    - [Model Editing](#model-editing)\n  - [Data](#data)\n    - [Data Management](#data-management)\n    - [Data Storage](#data-storage)\n    - [Data Tracking](#data-tracking)\n    - [Feature Engineering](#feature-engineering)\n    - [Data\u002FFeature enrichment](#datafeature-enrichment)\n  - [Large Scale Deployment](#large-scale-deployment)\n    - [ML Platforms](#ml-platforms)\n    - [Workflow](#workflow)\n    - [Scheduling](#scheduling)\n    - [Model Management](#model-management)\n  - [Performance](#performance)\n    - [ML Compiler](#ml-compiler)\n    - [Profiling](#profiling)\n  - [AutoML](#automl)\n  - [Optimizations](#optimizations)\n  - [Federated ML](#federated-ml)\n  - [Awesome Lists](#awesome-lists)\n\n\u003C!-- Created by https:\u002F\u002Fgithub.com\u002Fekalinin\u002Fgithub-markdown-toc -->\n\n## Model\n\n### Large Language Model\n\n| Project                                                                 | Details                                                                                                                                                                                    | Repository                                                                                                |\n| ----------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ | --------------------------------------------------------------------------------------------------------- |\n| [Alpaca](https:\u002F\u002Fgithub.com\u002Ftatsu-lab\u002Fstanford_alpaca)                  | Code and documentation to train Stanford's Alpaca models, and generate the data.                                                                                                           | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ftatsu-lab\u002Fstanford_alpaca.svg?style=flat-square)      |\n| [BELLE](https:\u002F\u002Fgithub.com\u002FLianjiaTech\u002FBELLE)                           | A 7B Large Language Model fine-tune by 34B Chinese Character Corpus, based on LLaMA and Alpaca.                                                                                            | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FLianjiaTech\u002FBELLE.svg?style=flat-square)              |\n| [Bloom](https:\u002F\u002Fgithub.com\u002Fbigscience-workshop\u002Fmodel_card)              | BigScience Large Open-science Open-access Multilingual Language Model                                                                                                                      | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fbigscience-workshop\u002Fmodel_card.svg?style=flat-square) |\n| [dolly](https:\u002F\u002Fgithub.com\u002Fdatabrickslabs\u002Fdolly)                        | Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform                                                                                              | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fdatabrickslabs\u002Fdolly.svg?style=flat-square)           |\n| [Falcon 40B](https:\u002F\u002Fhuggingface.co\u002Ftiiuae\u002Ffalcon-40b-instruct)         | Falcon-40B-Instruct is a 40B parameters causal decoder-only model built by TII based on Falcon-40B and finetuned on a mixture of Baize. It is made available under the Apache 2.0 license. |                                                                                                           |\n| [FastChat (Vicuna)](https:\u002F\u002Fgithub.com\u002Flm-sys\u002FFastChat)                 | An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.                                                                     | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Flm-sys\u002FFastChat.svg?style=flat-square)                |\n| [Gemma](https:\u002F\u002Fwww.kaggle.com\u002Fmodels\u002Fgoogle\u002Fgemma)                     | Gemma is a family of lightweight, open models built from the research and technology that Google used to create the Gemini models.                                                         |                                                                                                           |\n| [GLM-6B (ChatGLM)](https:\u002F\u002Fgithub.com\u002FTHUDM\u002FChatGLM-6B)                 | An Open Bilingual Pre-Trained Model, quantization of ChatGLM-130B, can run on consumer-level GPUs.                                                                                         | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FTHUDM\u002FChatGLM-6B.svg?style=flat-square)               |\n| [ChatGLM2-6B](https:\u002F\u002Fgithub.com\u002FTHUDM\u002FChatGLM2-6B)                     | ChatGLM2-6B is the second-generation version of the open-source bilingual (Chinese-English) chat model [ChatGLM-6B](https:\u002F\u002Fgithub.com\u002FTHUDM\u002FChatGLM-6B).                                  | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FTHUDM\u002FChatGLM2-6B.svg?style=flat-square)              |\n| [GLM-130B (ChatGLM)](https:\u002F\u002Fgithub.com\u002FTHUDM\u002FGLM-130B)                 | An Open Bilingual Pre-Trained Model (ICLR 2023)                                                                                                                                            | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FTHUDM\u002FGLM-130B.svg?style=flat-square)                 |\n| [GPT-NeoX](https:\u002F\u002Fgithub.com\u002FEleutherAI\u002Fgpt-neox)                      | An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.                                                                                   | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FEleutherAI\u002Fgpt-neox.svg?style=flat-square)            |\n| [Luotuo](https:\u002F\u002Fgithub.com\u002FLC1332\u002FLuotuo-Chinese-LLM)                  | A Chinese LLM, Based on LLaMA and fine tune by Stanford Alpaca, Alpaca LoRA, Japanese-Alpaca-LoRA.                                                                                         | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FLC1332\u002FLuotuo-Chinese-LLM.svg?style=flat-square)      |\n| [Mixtral-8x7B-v0.1](https:\u002F\u002Fhuggingface.co\u002Fmistralai\u002FMixtral-8x7B-v0.1) | The Mixtral-8x7B Large Language Model (LLM) is a pretrained generative Sparse Mixture of Experts.                                                                                          |                                                                                                           |\n| [StableLM](https:\u002F\u002Fgithub.com\u002FStability-AI\u002FStableLM)                    | StableLM: Stability AI Language Models                                                                                                                                                     | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FStability-AI\u002FStableLM.svg?style=flat-square)          |\n\n**[⬆ back to ToC](#table-of-contents)**\n\n### CV Foundation Model\n\n| Project                                                                        | Details                                                                                                                                          | Repository                                                                                                   |\n| ------------------------------------------------------------------------------ | ------------------------------------------------------------------------------------------------------------------------------------------------ | ------------------------------------------------------------------------------------------------------------ |\n| [disco-diffusion](https:\u002F\u002Fgithub.com\u002Falembics\u002Fdisco-diffusion)                 | A frankensteinian amalgamation of notebooks, models and techniques for the generation of AI Art and Animations.                                  | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Falembics\u002Fdisco-diffusion.svg?style=flat-square)          |\n| [midjourney](https:\u002F\u002Fwww.midjourney.com\u002Fhome\u002F)                                 | Midjourney is an independent research lab exploring new mediums of thought and expanding the imaginative powers of the human species.            |                                                                                                              |\n| [segment-anything (SAM)](https:\u002F\u002Fgithub.com\u002Ffacebookresearch\u002Fsegment-anything) | produces high quality object masks from input prompts such as points or boxes, and it can be used to generate masks for all objects in an image. | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ffacebookresearch\u002Fsegment-anything.svg?style=flat-square) |\n| [stable-diffusion](https:\u002F\u002Fgithub.com\u002FCompVis\u002Fstable-diffusion)                | A latent text-to-image diffusion model                                                                                                           | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FCompVis\u002Fstable-diffusion.svg?style=flat-square)          |\n\n**[⬆ back to ToC](#table-of-contents)**\n\n### Audio Foundation Model\n\n| Project                                      | Details                                                                                                                                                                                                       | Repository                                                                                |\n| -------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ----------------------------------------------------------------------------------------- |\n| [bark](https:\u002F\u002Fgithub.com\u002Fsuno-ai\u002Fbark)      | Bark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects. | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fsuno-ai\u002Fbark.svg?style=flat-square)   |\n| [whisper](https:\u002F\u002Fgithub.com\u002Fopenai\u002Fwhisper) | Robust Speech Recognition via Large-Scale Weak Supervision                                                                                                                                                    | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fopenai\u002Fwhisper.svg?style=flat-square) |\n\n## Serving\n\n### Large Model Serving\n\n| Project                                                                               | Details                                                                                                         | Repository                                                                                                       |\n| ------------------------------------------------------------------------------------- | --------------------------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------------------------- |\n| [Alpaca-LoRA-Serve](https:\u002F\u002Fgithub.com\u002Fdeep-diver\u002FAlpaca-LoRA-Serve)                  | Alpaca-LoRA as Chatbot service                                                                                  | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fdeep-diver\u002FAlpaca-LoRA-Serve.svg?style=flat-square)          |\n| [CTranslate2](https:\u002F\u002Fgithub.com\u002FOpenNMT\u002FCTranslate2)                                 | fast inference engine for Transformer models in C++                                                             | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FOpenNMT\u002FCTranslate2.svg?style=flat-square)                   |\n| [Clip-as-a-service](https:\u002F\u002Fgithub.com\u002Fjina-ai\u002Fclip-as-service)                       | serving the OpenAI CLIP model                                                                                   | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fjina-ai\u002Fclip-as-service.svg?style=flat-square)               |\n| [DeepSpeed-MII](https:\u002F\u002Fgithub.com\u002Fmicrosoft\u002FDeepSpeed-MII)                           | MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.                             | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmicrosoft\u002FDeepSpeed-MII.svg?style=flat-square)               |\n| [Faster Whisper](https:\u002F\u002Fgithub.com\u002Fguillaumekln\u002Ffaster-whisper)                      | fast inference engine for whisper in C++ using CTranslate2.                                                     | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fguillaumekln\u002Ffaster-whisper.svg?style=flat-square)           |\n| [FlexGen](https:\u002F\u002Fgithub.com\u002FFMInference\u002FFlexGen)                                     | Running large language models on a single GPU for throughput-oriented scenarios. *(Archived)*                   | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FFMInference\u002FFlexGen.svg?style=flat-square)                   |\n| [Flowise](https:\u002F\u002Fgithub.com\u002FFlowiseAI\u002FFlowise)                                       | Drag & drop UI to build your customized LLM flow using LangchainJS.                                             | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FFlowiseAI\u002FFlowise.svg?style=flat-square)                     |\n| [llama.cpp](https:\u002F\u002Fgithub.com\u002Fggerganov\u002Fllama.cpp)                                   | Port of Facebook's LLaMA model in C\u002FC++                                                                         | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fggerganov\u002Fllama.cpp.svg?style=flat-square)                   |\n| [Shimmy](https:\u002F\u002Fgithub.com\u002FMichael-A-Kuykendall\u002Fshimmy)                               | Python-free Rust inference server with OpenAI API compatibility and hot model swapping                        | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FMichael-A-Kuykendall\u002Fshimmy.svg?style=flat-square)        |\n| [Infinity](https:\u002F\u002Fgithub.com\u002Fmichaelfeil\u002Finfinity)                                   | Rest API server for serving text-embeddings                                                                     | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmichaelfeil\u002Finfinity.svg?style=flat-square)                  |\n| [Modelz-LLM](https:\u002F\u002Fgithub.com\u002Ftensorchord\u002Fmodelz-llm)                               | OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)                          | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ftensorchord\u002Fmodelz-llm.svg?style=flat-square)                |\n| [Ollama](https:\u002F\u002Fgithub.com\u002Fjmorganca\u002Follama)                                         | Serve Llama 2 and other large language models locally from command line or through a browser interface.         | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fjmorganca\u002Follama.svg?style=flat-square)                      |\n| [Rapid-MLX](https:\u002F\u002Fgithub.com\u002Fraullenchai\u002FRapid-MLX)                                 | OpenAI-compatible LLM inference server for Apple Silicon using MLX. 2-4x faster than Ollama with tool calling and prompt caching. | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fraullenchai\u002FRapid-MLX.svg?style=flat-square)                  |\n| [TensorRT-LLM](https:\u002F\u002Fgithub.com\u002FNVIDIA\u002FTensorRT-LLM)                                | Inference engine for TensorRT on Nvidia GPUs                                                                    | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FNVIDIA\u002FTensorRT-LLM.svg?style=flat-square)                   |\n| [text-generation-inference](https:\u002F\u002Fgithub.com\u002Fhuggingface\u002Ftext-generation-inference) | Large Language Model Text Generation Inference                                                                  | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fhuggingface\u002Ftext-generation-inference.svg?style=flat-square) |\n| [text-embeddings-inference](https:\u002F\u002Fgithub.com\u002Fhuggingface\u002Ftext-embeddings-inference) | Inference for text-embedding models                                                                             | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fhuggingface\u002Ftext-embeddings-inference.svg?style=flat-square) |\n| [tokenizers](https:\u002F\u002Fgithub.com\u002Fhuggingface\u002Ftokenizers)                               | 💥 Fast State-of-the-Art Tokenizers optimized for Research and Production                                       | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fhuggingface\u002Ftokenizers.svg?style=flat-square)                |\n| [vllm](https:\u002F\u002Fgithub.com\u002Fvllm-project\u002Fvllm)                                          | A high-throughput and memory-efficient inference and serving engine for LLMs.                                   | ![GitHub stars](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fvllm-project\u002Fvllm.svg?style=flat-square)                     |\n| [whisper.cpp](https:\u002F\u002Fgithub.com\u002Fggerganov\u002Fwhisper.cpp)                               | Port of OpenAI's Whisper model in C\u002FC++                                                                         | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fggerganov\u002Fwhisper.cpp.svg?style=flat-square)                 |\n| [x-stable-diffusion](https:\u002F\u002Fgithub.com\u002Fstochasticai\u002Fx-stable-diffusion)              | Real-time inference for Stable Diffusion - 0.88s latency. Covers AITemplate, nvFuser, TensorRT, FlashAttention. *(Archived)* | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fstochasticai\u002Fx-stable-diffusion.svg?style=flat-square)       |\n\n**[⬆ back to ToC](#table-of-contents)**\n\n### Frameworks\u002FServers for Serving\n\n| Project                                                                    | Details                                                                                                                                                                                                                                                                                                                                            | Repository                                                                                                |\n| -------------------------------------------------------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | --------------------------------------------------------------------------------------------------------- |\n| [BentoML](https:\u002F\u002Fgithub.com\u002Fbentoml\u002FBentoML)                              | The Unified Model Serving Framework                                                                                                                                                                                                                                                                                                                | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fbentoml\u002FBentoML.svg?style=flat-square)                |\n| [Jina](https:\u002F\u002Fgithub.com\u002Fjina-ai\u002Fjina)                                    | Build multimodal AI services via cloud native technologies · Model Serving · Generative AI · Neural Search · Cloud Native                                                                                                                                                                                                                          | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fjina-ai\u002Fjina.svg?style=flat-square)                   |\n| [Mosec](https:\u002F\u002Fgithub.com\u002Fmosecorg\u002Fmosec)                                 | A machine learning model serving framework with dynamic batching and pipelined stages, provides an easy-to-use Python interface.                                                                                                                                                                                                                   | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmosecorg\u002Fmosec?style=flat-square)                     |\n| [TFServing](https:\u002F\u002Fgithub.com\u002Ftensorflow\u002Fserving)                         | A flexible, high-performance serving system for machine learning models.                                                                                                                                                                                                                                                                           | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ftensorflow\u002Fserving.svg?style=flat-square)             |\n| [Torchserve](https:\u002F\u002Fgithub.com\u002Fpytorch\u002Fserve)                             | Serve, optimize and scale PyTorch models in production *(Archived)*                                                                                                                                                                                                                                                                                | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fpytorch\u002Fserve.svg?style=flat-square)                  |\n| [Triton Server (TRTIS)](https:\u002F\u002Fgithub.com\u002Ftriton-inference-server\u002Fserver) | The Triton Inference Server provides an optimized cloud and edge inferencing solution.                                                                                                                                                                                                                                                             | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ftriton-inference-server\u002Fserver.svg?style=flat-square) |\n| [langchain-serve](https:\u002F\u002Fgithub.com\u002Fjina-ai\u002Flangchain-serve)              | Serverless LLM apps on Production with Jina AI Cloud *(Archived)*                                                                                                                                                                                                                                                                                  | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fjina-ai\u002Flangchain-serve.svg?style=flat-square)        |\n| [lanarky](https:\u002F\u002Fgithub.com\u002Fajndkr\u002Flanarky)                               | FastAPI framework to build production-grade LLM applications                                                                                                                                                                                                                                                                                       | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fajndkr\u002Flanarky.svg?style=flat-square)                 |\n| [ray-llm](https:\u002F\u002Fgithub.com\u002Fray-project\u002Fray-llm)                          | LLMs on Ray - RayLLM *(Archived)*                                                                                                                                                                                                                                                                                                                  | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fray-project\u002Fray-llm.svg?style=flat-square)            |\n| [Xinference](https:\u002F\u002Fgithub.com\u002Fxorbitsai\u002Finference)                       | Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop. | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fxorbitsai\u002Finference.svg?style=flat-square)            |\n| [KubeAI](https:\u002F\u002Fgithub.com\u002Fsubstratusai\u002Fkubeai)                       | Deploy and scale machine learning models on Kubernetes. Built for LLMs, embeddings, and speech-to-text. | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fsubstratusai\u002Fkubeai.svg?style=flat-square)             |\n| [Kaito](https:\u002F\u002Fgithub.com\u002Fkaito-project\u002Fkaito)                            | A Kubernetes operator that simplifies serving and tuning large AI models (e.g. Falcon or phi-3) using container images and GPU auto-provisioning. Includes an OpenAI-compatible server for inference and preset configurations for popular runtimes such as vLLM and transformers.                                                                 | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fkaito-project\u002Fkaito.svg?style=flat-square)            |\n| [Open Responses](https:\u002F\u002Fdocs.julep.ai\u002Fopen-responses) | Serverless open-source platform for building long-running LLM agents with tool use. | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fjulep-ai\u002Fjulep.svg?style=flat-square) |\n\n\n**[⬆ back to ToC](#table-of-contents)**\n\n## Security\n\n### Frameworks for LLM security\n\n| Project                                                 | Details                                                                                                                             | Repository                                                                                    |\n| ------------------------------------------------------- | ----------------------------------------------------------------------------------------------------------------------------------- | --------------------------------------------------------------------------------------------- |\n| [Plexiglass](https:\u002F\u002Fgithub.com\u002Fkortex-labs\u002Fplexiglass) | A Python Machine Learning Pentesting Toolbox for Adversarial Attacks. Works with LLMs, DNNs, and other machine learning algorithms. | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fkortex-labs\u002Fplexiglass?style=flat-square) |\n\n**[⬆ back to ToC](#table-of-contents)**\n\n### Observability\n\n| Project                                                                        | Details                                                                                                                                                                                                                        | Repository                                                                                                       |\n| ------------------------------------------------------------------------------ | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ | ---------------------------------------------------------------------------------------------------------------- |\n| [Azure OpenAI Logger](https:\u002F\u002Fgithub.com\u002Faavetis\u002Fazure-openai-logger)          | \"Batteries included\" logging solution for your Azure OpenAI instance.                                                                                                                                                          | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Faavetis\u002Fazure-openai-logger?style=flat-square)               |\n| [ClevAgent](https:\u002F\u002Fclevagent.io)                                              | Runtime monitoring for AI agents — heartbeat watchdog, loop detection, cost tracking, auto-restart. Python SDK or HTTP API.                                                                                                    |                                                                                                                  |\n| [Deepchecks](https:\u002F\u002Fgithub.com\u002Fdeepchecks\u002Fdeepchecks)                         | Tests for Continuous Validation of ML Models & Data. Deepchecks is a Python package for comprehensively validating your machine learning models and data with minimal effort.                                                  | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fdeepchecks\u002Fdeepchecks.svg?style=flat-square)                 |\n| [Evidently](https:\u002F\u002Fgithub.com\u002Fevidentlyai\u002Fevidently)                          | An open-source framework to evaluate, test and monitor ML and LLM-powered systems.                                                                                                                                             | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fevidentlyai\u002Fevidently.svg?style=flat-square)                 |\n| [Fiddler AI](https:\u002F\u002Fgithub.com\u002Ffiddler-labs\u002Ffiddler-auditor)                  | Evaluate, monitor, analyze, and improve machine learning and generative models from pre-production to production. Ship more ML and LLMs into production, and monitor ML and LLM metrics like hallucination, PII, and toxicity. | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ffiddler-labs\u002Ffiddler-auditor.svg?style=flat-square)          |\n| [Giskard](https:\u002F\u002Fgithub.com\u002FGiskard-AI\u002Fgiskard)                               | Testing framework dedicated to ML models, from tabular to LLMs. Detect risks of biases, performance issues and errors in 4 lines of code.                                                                                      | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FGiskard-AI\u002Fgiskard.svg?style=flat-square) \n| [QWED](https:\u002F\u002Fgithub.com\u002FQWED-AI\u002Fqwed-verification) | Deterministic verification protocol for LLM outputs using 8 formal verification engines (SymPy, Z3, AST, SQLGlot). Prevents hallucinations through mathematical proofs rather than statistical methods. | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FQWED-AI\u002Fqwed-verification.svg?style=flat-square) |\n| [Great Expectations](https:\u002F\u002Fgithub.com\u002Fgreat-expectations\u002Fgreat_expectations) | Always know what to expect from your data.                                                                                                                                                                                     | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fgreat-expectations\u002Fgreat_expectations.svg?style=flat-square) |\n| [Helicone](https:\u002F\u002Fgithub.com\u002FHelicone\u002Fhelicone)                              | Open source LLM observability platform. One line of code to monitor, evaluate, and experiment with features like prompt management, agent tracing, and evaluations.                                                            | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FHelicone\u002Fhelicone.svg?style=flat-square)                     |\n| [Traceloop OpenLLMetry](https:\u002F\u002Fgithub.com\u002Ftraceloop\u002Fopenllmetry)                              | OpenTelemetry-based observability and monitoring for LLM and agents workflows.                                                           | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ftraceloop\u002Fopenllmetry.svg?style=flat-square)    \n| [Langfuse 🪢](https:\u002F\u002Flangfuse.com) | Open-source LLM observability platform that helps teams collaboratively debug, analyze, and iterate on their LLM applications. | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Flangfuse\u002Flangfuse.svg?style=flat-square)              |\n| [whylogs](https:\u002F\u002Fgithub.com\u002Fwhylabs\u002Fwhylogs)                                  | The open standard for data logging                                                                                                                                                                                             | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fwhylabs\u002Fwhylogs.svg?style=flat-square)                       |\n| [Maxim AI](https:\u002F\u002Fgetmaxim.ai) | Platform for AI Agent Simulation, Evaluation & Observability |\n| [RagTune](https:\u002F\u002Fgithub.com\u002Fmetawake\u002Fragtune) | CLI tool for debugging and benchmarking RAG retrieval. EXPLAIN ANALYZE for your retrieval layer. | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmetawake\u002Fragtune.svg?style=flat-square) |\n| [traceAI](https:\u002F\u002Fgithub.com\u002Ffuture-agi\u002FtraceAI)                                | Open-source AI tracing framework built on OpenTelemetry for deep observability across agentic and LLM workflows.                                                                   | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ffuture-agi\u002FtraceAI?style=flat-square)                         |\n| [Future AGI](https:\u002F\u002Fgithub.com\u002Ffuture-agi\u002Ffutureagi-sdk)                    | Production-grade SDK for observability, automated evaluations and prompt management with sub-100ms guardrails for LLM\u002Fagent workflows.                                             | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ffuture-agi\u002Ffutureagi-sdk?style=flat-square)                   |\n| [semantic-coverage](https:\u002F\u002Fgithub.com\u002Faashirpersonal\u002Fsemantic-coverage) | Visualizes RAG knowledge gaps and \"blind spots\" using 2D UMAP clustering and density detection.                                             | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ffuture-agi\u002Ffutureagi-sdk?style=flat-square)                   |\n\n**[⬆ back to ToC](#table-of-contents)**\n\n## LLMOps\n\n| Project                                                            | Details                                                                                                                                                                                                                                                                                                                                                                                                 | Repository                                                                                                |\n| ------------------------------------------------------------------ | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | --------------------------------------------------------------------------------------------------------- |\n| [agenta](https:\u002F\u002Fgithub.com\u002FAgenta-AI\u002Fagenta)                      | The LLMOps platform to build robust LLM apps. Easily experiment and evaluate different prompts, models, and workflows to build robust apps.                                                                                                                                                                                                                                                             | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FAgenta-AI\u002Fagenta.svg?style=flat-square)               |\n| [AgentMark](https:\u002F\u002Fgithub.com\u002Fpuzzlet-ai\u002Fagentmark)                      | Type-Safe Markdown-based Agents                                                                                                                                                                                                                                                             | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FPuzzlet-ai\u002Fagentmark.svg?style=flat-square)               |\n| [AI studio](https:\u002F\u002Fgithub.com\u002Fmissingstudio\u002Fai)                   | A Reliable Open Source AI studio to build core infrastructure stack for your LLM Applications. It allows you to gain visibility, make your application reliable, and prepare it for production with features such as caching, rate limiting, exponential retry, model fallback, and more.                                                                                                               | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmissingstudio\u002Fai.svg?style=flat-square)               |\n| [Arize-Phoenix](https:\u002F\u002Fgithub.com\u002FArize-ai\u002Fphoenix)               | ML observability for LLMs, vision, language, and tabular models.                                                                                                                                                                                                                                                                                                                                        | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FArize-ai\u002Fphoenix.svg?style=flat-square)               |\n| [BudgetML](https:\u002F\u002Fgithub.com\u002Febhy\u002Fbudgetml)                       | Deploy a ML inference service on a budget in less than 10 lines of code.                                                                                                                                                                                                                                                                                                                                | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Febhy\u002Fbudgetml.svg?style=flat-square)                  |\n| [Cheshire Cat AI](https:\u002F\u002Fgithub.com\u002Fcheshire-cat-ai\u002Fcore)         | Web framework to create vertical AI agents. FastAPI based, plugin system inspired to WordPress, admin panel, vector DB included                                                                                                                                                                                                                                                                         | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fcheshire-cat-ai\u002Fcore.svg?style=flat-square)                  |\n| [Dataoorts](https:\u002F\u002Fdataoorts.com\u002Fai)                              | Enjoy unlimited API calls with Serverless AI Workers\u002FLLMs for just $25 per month. No rate or concurrency limits.                                                                                                                                                                                                                                                                                        |                                                                                                           |\n| [deeplake](https:\u002F\u002Fgithub.com\u002Factiveloopai\u002Fdeeplake)               | Stream large multimodal datasets to achieve near 100% GPU utilization. Query, visualize, & version control data. Access data w\u002Fo the need to recompute the embeddings for the model finetuning.                                                                                                                                                                                                         | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Factiveloopai\u002FHub.svg?style=flat-square)               |\n| [Dify](https:\u002F\u002Fgithub.com\u002Flanggenius\u002Fdify)                         | Open-source framework aims to enable developers (and even non-developers) to quickly build useful applications based on large language models, ensuring they are visual, operable, and improvable.                                                                                                                                                                                                      | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Flanggenius\u002Fdify.svg?style=flat-square)                |\n| [Dstack](https:\u002F\u002Fgithub.com\u002Fdstackai\u002Fdstack)                       | Cost-effective LLM development in any cloud (AWS, GCP, Azure, Lambda, etc).                                                                                                                                                                                                                                                                                                                             | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fdstackai\u002Fdstack.svg?style=flat-square)                |\n| [Embedchain](https:\u002F\u002Fgithub.com\u002Fembedchain\u002Fembedchain)             | Framework to create ChatGPT like bots over your dataset.                                                                                                                                                                                                                                                                                                                                                | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fembedchain\u002Fembedchain.svg?style=flat-square)          |\n| [Epsilla](https:\u002F\u002Fepsilla.com)                                     | An all-in-one platform to create vertical AI agents powered by your private data and knowledge.                                                                                                                                                                                                      |               |\n| [Evidently](https:\u002F\u002Fgithub.com\u002Fevidentlyai\u002Fevidently)              | An open-source framework to evaluate, test and monitor ML and LLM-powered systems.                                                                                                                                                                                                                                                                                                                      | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fevidentlyai\u002Fevidently.svg?style=flat-square)          |\n| [Fiddler AI](https:\u002F\u002Fwww.fiddler.ai\u002Fllmops)                        | Evaluate, monitor, analyze, and improve MLOps and LLMOps from pre-production to production.                                                                                                                                                                                                                                                                                                             |                                                                                                           |\n| [Glide](https:\u002F\u002Fgithub.com\u002FEinStack\u002Fglide)                         | Cloud-Native LLM Routing Engine. Improve LLM app resilience and speed.                                                                                                                                                                                                                                                                                                                                  | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Feinstack\u002Fglide.svg?style=flat-square)                 |\n| [gotoHuman](https:\u002F\u002Fwww.gotohuman.com)                             | Bring a **human into the loop** in your LLM-based and agentic workflows. Prompt users to approve actions, select next steps, or review and validate generated results.                                                                                                                                                                                                                                  |\n| [GPTCache](https:\u002F\u002Fgithub.com\u002Fzilliztech\u002FGPTCache)                 | Creating semantic cache to store responses from LLM queries.                                                                                                                                                                                                                                                                                                                                            | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fzilliztech\u002FGPTCache.svg?style=flat-square)            |\n| [GPUStack](https:\u002F\u002Fgithub.com\u002Fgpustack\u002Fgpustack)                   | An open-source GPU cluster manager for running and managing LLMs                                                                                                                                                                                                                                                                                                                                        | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fgpustack\u002Fgpustack.svg?style=flat-square)              |\n| [Haystack](https:\u002F\u002Fgithub.com\u002Fdeepset-ai\u002Fhaystack)                 | Quickly compose applications with LLM Agents, semantic search, question-answering and more.                                                                                                                                                                                                                                                                                                             | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fdeepset-ai\u002Fhaystack.svg?style=flat-square)            |\n| [Helicone](https:\u002F\u002Fgithub.com\u002FHelicone\u002Fhelicone)                   | Open-source LLM observability platform for logging, monitoring, and debugging AI applications. Simple 1-line integration to get started.                                                                                                                                                                                                                                                                | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fhelicone\u002Fhelicone.svg?style=flat-square)              |\n| [Humanloop](https:\u002F\u002Fhumanloop.com)                                 | The LLM evals platform for enterprises, providing tools to develop, evaluate, and observe AI systems. |                                                                                                |\n| [Hypersigil](https:\u002F\u002Fgithub.com\u002Fhypersigilhq\u002Fhypersigil)           | Open-source prompt lifecycle management and gateway with a Web UI.                         | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fhypersigilhq\u002Fhypersigil.svg?style=flat-square)        | \n| [Izlo](https:\u002F\u002Fgetizlo.com\u002F)                                       | Prompt management tools for teams. Store, improve, test, and deploy your prompts in one unified workspace.                                                                                                                                                                                                                                                                                              |                                                                                                           |\n| [Keywords AI](https:\u002F\u002Fkeywordsai.co\u002F)                              | A unified DevOps platform for AI software. Keywords AI makes it easy for developers to build LLM applications.                                                                                                                                                                                                                                                                                          |                                                                                                           |\n| [MLflow](https:\u002F\u002Fgithub.com\u002Fmlflow\u002Fmlflow\u002Ftree\u002Fmaster)             | An open-source framework for the end-to-end machine learning lifecycle, helping developers track experiments, evaluate models\u002Fprompts, deploy models, and add observability with tracing. | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fmlflow\u002Fmlflow.svg?style=flat-square)  |\n| [Laminar](https:\u002F\u002Fgithub.com\u002Flmnr-ai\u002Flmnr)                         | Open-source all-in-one platform for engineering AI products. Traces, Evals, Datasets, Labels.                                                                                                                                                                                                                                                                                                           | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Flmnr-ai\u002Flmnr.svg?style=flat-square)                   |\n| [langchain](https:\u002F\u002Fgithub.com\u002Fhwchase17\u002Flangchain)                | Building applications with LLMs through composability                                                                                                                                                                                                                                                                                                                                                   | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fhwchase17\u002Flangchain.svg?style=flat-square)            |\n| [LangFlow](https:\u002F\u002Fgithub.com\u002Flogspace-ai\u002Flangflow)                | An effortless way to experiment and prototype LangChain flows with drag-and-drop components and a chat interface.                                                                                                                                                                                                                                                                                       | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Flogspace-ai\u002Flangflow.svg?style=flat-square)           |\n| [Langfuse](https:\u002F\u002Fgithub.com\u002Flangfuse\u002Flangfuse)                   | Open Source LLM Engineering Platform: Traces, evals, prompt management and metrics to debug and improve your LLM application.                                                                                                                                                                                                                                                                           | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Flangfuse\u002Flangfuse.svg?style=flat-square)              |\n| [LangKit](https:\u002F\u002Fgithub.com\u002Fwhylabs\u002Flangkit)                      | Out-of-the-box LLM telemetry collection library that extracts features and profiles prompts, responses and metadata about how your LLM is performing over time to find problems at scale.                                                                                                                                                                                                               | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fwhylabs\u002Flangkit.svg?style=flat-square)                |\n| [LangWatch](https:\u002F\u002Fgithub.com\u002Flangwatch\u002Flangwatch)                | LLM Ops platform with Analytics, Monitoring, Evaluations and an LLM Optimization Studio powered by DSPy | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Flangwatch\u002Flangwatch.svg?style=flat-square) |\n| [LiteLLM 🚅](https:\u002F\u002Fgithub.com\u002FBerriAI\u002Flitellm\u002F)                  | A simple & light 100 line package to **standardize LLM API calls** across OpenAI, Azure, Cohere, Anthropic, Replicate API Endpoints                                                                                                                                                                                                                                                                     | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FBerriAI\u002Flitellm.svg?style=flat-square)                |\n| [Literal AI](https:\u002F\u002Fliteralai.com\u002F)                               | Multi-modal LLM observability and evaluation platform. Create prompt templates, deploy prompts versions, debug LLM runs, create datasets, run evaluations, monitor LLM metrics and collect human feedback.                                                                                                                                                                                              |                                                                                                           |\n| [LlamaIndex](https:\u002F\u002Fgithub.com\u002Fjerryjliu\u002Fllama_index)             | Provides a central interface to connect your LLMs with external data.                                                                                                                                                                                                                                                                                                                                   | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fjerryjliu\u002Fllama_index.svg?style=flat-square)          |\n| [LLMApp](https:\u002F\u002Fgithub.com\u002Fpathwaycom\u002Fllm-app)                    | LLM App is a Python library that helps you build real-time LLM-enabled data pipelines with few lines of code.                                                                                                                                                                                                                                                                                           | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fpathwaycom\u002Fllm-app.svg?style=flat-square)             |\n| [LLMFlows](https:\u002F\u002Fgithub.com\u002Fstoyan-stoyanov\u002Fllmflows)            | LLMFlows is a framework for building simple, explicit, and transparent LLM applications such as chatbots, question-answering systems, and agents.                                                                                                                                                                                                                                                       | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fstoyan-stoyanov\u002Fllmflows.svg?style=flat-square)       |\n| [LRM](https:\u002F\u002Fgithub.com\u002Fnickprotop\u002FLocalizationManager)           | CLI\u002FTUI tool for managing localization files (.resx, JSON, Android, iOS) with LLM-powered translation via Ollama, validation, and code scanning for unused\u002Fmissing keys.                                                                                                                                                                                                                                | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fnickprotop\u002FLocalizationManager.svg?style=flat-square) |\n| [Lunary](https:\u002F\u002Fgithub.com\u002Flunary-ai\u002Flunary)                      | Observability and prompt management for LLM chabots and agents. Debug agents with powerful tracing and logging. Usage analytics and dive deep into the history of your requests. Developer friendly modules with plug-and-play integration into LangChain.                                                                                                                                             | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Flunary-ai\u002Flunary.svg?style=flat-square)            |\n| [magentic](https:\u002F\u002Fgithub.com\u002Fjackmpcollins\u002Fmagentic)              | Seamlessly integrate LLMs as Python functions. Use type annotations to specify structured output. Mix LLM queries and function calling with regular Python code to create complex LLM-powered functionality.                                                                                                                                                                                            | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fjackmpcollins\u002Fmagentic.svg?style=flat-square)         |\n| [Manag.ai](https:\u002F\u002Fwww.manag.ai)                                   | Your all-in-one prompt management and observability platform. Craft, track, and perfect your LLM prompts with ease.                                                                                                                                                                                                                                                                                     |                                                                                                           |\n| [Mirascope](https:\u002F\u002Fgithub.com\u002FMirascope\u002Fmirascope)                | Intuitive convenience tooling for lightning-fast, efficient development and ensuring quality in LLM-based applications                                                                                                                                                                                                                                                                                  | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FMirascope\u002Fmirascope.svg?style=flat-square)            |\n| [Neurolink](https:\u002F\u002Fgithub.com\u002Fjuspay\u002Fneurolink)                   | Multi-provider AI agent framework that unifies 12+ LLM providers (OpenAI, Google, Anthropic, AWS, Azure, Groq, etc.) with workflow orchestration. Production-grade platform for building LLM applications with streaming, tool calling, caching, and enterprise features. Battle-tested at 15M+ requests\u002Fmonth.                                                        | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fjuspay\u002Fneurolink.svg?style=flat-square)               |\n| [OpenLIT](https:\u002F\u002Fgithub.com\u002Fopenlit\u002Fopenlit)                      | OpenLIT is an OpenTelemetry-native GenAI and LLM Application Observability tool and provides OpenTelmetry Auto-instrumentation for monitoring LLMs, VectorDBs and Frameworks. It provides valuable insights into token & cost usage, user interaction, and performance related metrics.                                                                                                                 | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fdokulabs\u002Fdoku.svg?style=flat-square)                  |\n| [Opik](https:\u002F\u002Fgithub.com\u002Fcomet-ml\u002Fopik)                           | Confidently evaluate, test, and ship LLM applications with a suite of observability tools to calibrate language model outputs across your dev and production lifecycle.                                                                                                                                                                                                                                 | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fcomet-ml\u002Fopik.svg?style=flat-square)                  |\n| [Parea AI](https:\u002F\u002Fwww.parea.ai\u002F)                                  | Platform and SDK for AI Engineers providing tools for LLM evaluation, observability, and a version-controlled enhanced prompt playground.                                                                                                                                                                                                                                                               | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fparea-ai\u002Fparea-sdk-py?style=flat-square)              |\n| [Pezzo 🕹️](https:\u002F\u002Fgithub.com\u002Fpezzolabs\u002Fpezzo)                     | Pezzo is the open-source LLMOps platform built for developers and teams. In just two lines of code, you can seamlessly troubleshoot your AI operations, collaborate and manage your prompts in one place, and instantly deploy changes to any environment.                                                                                                                                              | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fpezzolabs\u002Fpezzo.svg?style=flat-square)                |\n| [PraisonAI](https:\u002F\u002Fgithub.com\u002FMervinPraison\u002FPraisonAI)            | Production-ready Multi-AI Agents framework with self-reflection. Fastest agent instantiation (3.77μs), 100+ LLM support via LiteLLM, MCP integration, agentic workflows (route\u002Fparallel\u002Floop\u002Frepeat), built-in memory, Python & JS SDKs.                                                                                                                                                               | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FMervinPraison\u002FPraisonAI.svg?style=flat-square)        |\n| [PromptDX](https:\u002F\u002Fgithub.com\u002Fpuzzlet-ai\u002Fpromptdx)                 | A declarative, extensible, and composable approach for developing LLM prompts using Markdown and JSX. | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fpuzzlet-ai\u002Fpromptdx.svg?style=flat-square) |\n| [PromptHub](https:\u002F\u002Fwww.prompthub.us)                              | Full stack prompt management tool designed to be usable by technical and non-technical team members. Test, version, collaborate, deploy, and monitor, all from one place.                                                                                                                                                                                                                               |                                                                                                           |\n| [promptfoo](https:\u002F\u002Fgithub.com\u002Ftyppo\u002Fpromptfoo)                    | Open-source tool for testing & evaluating prompt quality. Create test cases, automatically check output quality and catch regressions, and reduce evaluation cost.                                                                                                                                                                                                                                      | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ftyppo\u002Fpromptfoo.svg?style=flat-square)                |\n| [PromptFoundry](https:\u002F\u002Fwww.promptfoundry.ai)                      | The simple prompt engineering and evaluation tool designed for developers building AI applications.                                                                                                                                                                                                                                                                                                     | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Fprompt-foundry\u002Fpython-sdk.svg?style=flat-square)      |\n| [PromptLayer 🍰](https:\u002F\u002Fwww.promptlayer.com)                      | Prompt Engineering platform. Collaborate, test, evaluate, and monitor your LLM applications                                                                                                                                                                                                                                                                                                             | ![Github Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002FMagnivOrg\u002Fprompt-layer-library.svg?style=flat-square) |\n| [PromptMage](https:\u002F\u002Fgithub.com\u002Ftsterbak\u002Fpromptmage)               | Open-source tool to simplify the process of creating and managing LLM workflows and prompts as a self-hosted solution.                                                                                                                                                                                                                                                                                  | ![GitHub Badge](https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fstars\u002Ftsterbak\u002Fpromptmage.svg?style=flat-square)            |\n| [PromptSite](https:\u002F\u002Fgithub.com\u002Fdkuang1980\u002Fpromptsite)               | A lightweight Python library for prompt lifecycle management that helps you version control, track, experiment and debug with your LLM prompts with ease. Minimal setup, no servers, databases, or API keys required - works directly with your local filesystem, ideal for data scientists and engineers to easily integrate into existing LLM workflows     |                   |\n| [Prompteams](https:\u002F\u002Fwww.prompteams.com)                           | Prompt management system. Version, test, collaborate, and retrieve prompts through real-time APIs. Have GitHub style with repos, branches, and commits (and commit history).                                                                                                                                                                                                                            |                                                                                                           |\n| [prompttools](https:\u002F\u002Fgithub.com\u002Fhegelai\u002Fprompttools)              | Open-source tools for testing and experimenting with prompts. The core idea is to enable developers to evaluate prompts using familiar interfaces like code and notebooks. In just a few lines of codes, you can test your prompts and parameters across different models (whether you are using OpenAI, Anthropic, or LLaMA models). You can even evaluate the r","Awesome-LLMOps 是一个精心整理的最佳LLMOps工具列表，旨在帮助开发者选择合适的工具。该项目涵盖了从模型构建、训练到部署的全生命周期管理，包括但不限于大型语言模型、CV基础模型、音频基础模型的服务化、安全框架、向量搜索等关键领域。其内容组织清晰，便于快速查找特定需求下的解决方案。适合于正在寻找高效开发和运维大规模机器学习模型方法的团队或个人使用，尤其是那些希望在AI项目中实现更优性能与安全性的开发者。",2,"2026-06-11 03:38:51","high_star"]