C
🔄 Updated Mar 2026
🖥️ Self-hostable
Overview
Chinese Llm Benchmark is an AI agent in the AI Agents category. ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括335个大模型,覆盖chatgpt、gpt-5.2、o4-mini、谷歌gemini-3-pro、Claude-4.5、文心ERNIE-X1.1、ERNIE-5.0-Thinking、qwen3-max、百川、讯飞星火、商汤senseChat等商用模型, 以及kimi-k2、ernie4.5、minimax-M2、deepseek-v3.2、qwen3-2507、llama4、智谱GLM-4.6、gemma3、mistral等开源大模型。不仅提供排行榜,也提供规模超200万的大模型缺陷库!方便广大社区研究分析、改进大模型。
Problem It Solves
This tool addresses challenges in the ai agents domain.
Target Audience: Developers and teams working with ai agents automation.
Inputs
- • User configuration
- • API credentials (if required)
- • Task parameters
Outputs
- • Automated task results
- • Status reports
- • Generated content or actions
Example Workflow
- 1 User configures the agent with required parameters
- 2 Agent receives input data or trigger
- 3 Agent processes the request using its core logic
- 4 Agent interacts with external services if needed
- 5 Results are returned to the user
Sample System Prompt
You are Chinese Llm Benchmark, an AI assistant. Help the user accomplish their task efficiently.
Tools & Technologies
LLM APIs Python
Alternatives
- • AutoGPT
- • LangChain Agents
- • CrewAI
FAQs
- Is this agent open-source?
- Yes
- Can this agent be self-hosted?
- Yes
- What skill level is required?
- Intermediate
Rate This Agent
Your rating:
Reviews
Loading reviews...
Write a Review
Ready to try this agent?
Chinese Llm Benchmark