AISBench Benchmark Tool
🚀 Get Started
Tool Installation & Uninstallation
Quick Start
🧭 Basic Tutorials
Supported Evaluation Scenarios
Introduction to Evaluation Scenarios
Service-Oriented Accuracy Evaluation
Pure Model Accuracy Evaluation
Guide to Service-Oriented Performance Evaluation
Explanation of Evaluation Results
Detailed Parameter Description
🔬 Advanced Tutorials
Running AISBench with a Custom Configuration File
Service-Oriented Steady-State Performance Testing
Request Sending Rate (RPS) Distribution Control and Visualization Guide
Guide to Multi-Turn Dialogue Evaluation
Guide to Using Random Synthetic Datasets
Guide to Using Custom Datasets
💪 Best Practices
Evaluating the Mathematical Capabilities of DeepSeek-R1-Distill-Qwen-14B Based on NVIDIA A100 Accelerator Card: 100% Paper Reproduction
Evaluating DeepSeek-R1’s Mathematical Capabilities Based on Ascend 800I-A2: 100% Paper Reproduction
❓ FAQs
AISBench FAQ (Frequently Asked Questions)
error codes description
🏷️ Others
🔜 Coming Soon
🤝 Acknowledgments
AISBench Benchmark Tool
Supported Evaluation Scenarios
View page source
Supported Evaluation Scenarios
Introduction to Evaluation Scenarios
Accuracy Evaluation
Performance Evaluation
Service-Oriented Accuracy Evaluation
Preconditions for Service-Oriented Accuracy Evaluation
Main Functional Scenarios
Other Functional Scenarios
Pure Model Accuracy Evaluation
Test Preparation
Main Functions
Other Functions
Guide to Service-Oriented Performance Evaluation
Introduction
Quick Start for Service-Oriented Performance Evaluation
Preconditions for Service-Oriented Performance Evaluation
Main Functional Scenarios
Other Functional Scenarios
Specifications for Service-Oriented Performance Testing