giskard-oss

Giskard-AI

🐢 Open-Source Evaluation & Testing library for LLM Agents

AI 简介

Giskard 是一个面向 LLM 智能体（Agentic Systems）的开源评估与测试框架，支持动态多轮交互场景下的功能验证、安全红队演练和 RAG 系统专项评测。其核心特点包括模块化设计、轻量依赖、异步优先架构，以及内置 AI 漏洞扫描与公平性/鲁棒性检测能力。适用于 LLM 应用开发、AI 产品上线前质量保障、RAG 系统调优及负责任 AI 实践等场景。

Python

Apache License 2.0

agent-evaluation ai-red-team ai-security ai-testing fairness-ai llm llm-eval llm-evaluation llm-security llmops ml-testing ml-validation mlops rag-evaluation red-team-tools responsible-ai trustworthy-ai

在 GitHub 查看官方网站

5.5k

Stars

481

Forks

Watchers

Issues

Star 增长

今日0

近 7 天0

近 30 天+28

综合评分66.85

默认分支main

giskard-oss

Star 增长

加入交流群