User Configuration Parameters
AISBench Benchmark supports customizing the inference mode and evaluation process through two methods: Command Line Interface (CLI) Parameters and Configuration Constant File.
Command Line Parameters
The basic calling format for command line parameters [OPTIONS] is as follows:
ais_bench [OPTIONS]
Parameter Description
Based on the execution scenario, command line parameters are divided into three categories:
Common Parameters
Accuracy Evaluation Parameters (effective only when
--modeis set toall,infer,eval, orviz)Performance Evaluation Parameters (effective only when
--modeis set toperforperf_viz)
Accuracy Evaluation Parameters take effect only when the --mode parameter is specified as "all", "infer", "eval", "viz". Performance Evaluation Parameters take effect only when the --mode parameter is specified as "perf", "perf_viz". Common Parameters are not restricted by the task execution mode and can be specified in all modes.
Common Parameters
Applicable to all modes and can be used in combination with accuracy or performance parameters.
Parameter |
Description |
Example |
|---|---|---|
|
Specifies the name of the model inference backend task (corresponding to a pre-implemented default model configuration file under the path |
|
|
Specifies the name of the dataset task (corresponding to a pre-implemented default dataset configuration file under the path |
|
|
Specifies the name of the result summary task (corresponding to a pre-implemented default configuration file under the path |
|
|
Running mode, optional values: |
|
|
Specifies the timestamp in an existing working directory to continue execution and overwrite original results. Combined with the value of the |
|
|
Specifies the evaluation working directory for saving output results. The default path is |
|
|
The folder path where the configuration files for |
|
|
Enables Debug mode. This parameter is enabled if configured, and disabled if not configured; it is disabled by default. In Debug mode, all logs are printed directly to the terminal. |
|
|
Enables Dry Run mode (only prints logs to the screen without actually running the task). This parameter is enabled if configured, and disabled if not configured; it is disabled by default. |
|
|
Reserved parameter, not supported temporarily. |
|
|
Enables merged inference for datasets of the same type (running multiple datasets for the same task together). |
|
Accuracy Evaluation Parameters
Effective only when the mode is all, infer, eval, or viz.
Parameter |
Description |
Example |
|---|---|---|
|
Number of parallel tasks, range |
|
|
Switch to enable dumping details of the evaluation process. Enabled if this parameter is configured, disabled if not; disabled by default. |
|
|
Switch to enable dumping evaluation speed data. Enabled if this parameter is configured, disabled if not; disabled by default. |
|
|
Disables Continuous Batch inference (effective only for service-oriented API-type models). Disabled if this parameter is configured, enabled if not; enabled by default. When CB is enabled, multiple processes run concurrently, with a maximum concurrency limit of 500 per process. |
|
Performance Evaluation Parameters
Effective only when the mode is perf or perf_viz.
Parameter |
Description |
Example |
|---|---|---|
|
Specifies the number of data samples for dataset evaluation. A positive integer must be entered. If the value exceeds the total number of dataset samples or no value is specified, the entire dataset is used for evaluation. |
|
|
Switch to enable performance pressure testing mode. Effective only when |
|
Configuration Constant File Parameters
Some global constants are not restricted by task type, and it is recommended to keep their default values. If customization is required, edit the constant file: global_consts.py for configuration.
The currently supported parameter configurations are as follows:
Parameter Name |
Description |
Value Range / Requirements |
|---|---|---|
|
Number of processes used for sending requests. The default value is 0, which means automatic allocation based on the maximum number of concurrent requests configured by the user. |
[0, number of CPU cores] |
|
Specifies the directory path of custom Python packages. The Benchmark tool will load user-defined packages from this directory. |
Must be a local path accessible to the user, pointing to the folder containing custom packages |
|
Duration of pressure testing, effective only when |
|
|
Concurrent thread creation rate. Represents the number of new concurrent threads added per second until the maximum concurrency limit is reached. Effective only when |
|
|
Maximum cache size for a single chunk returned by the streaming inference model backend. The default value is 65535 bytes (64KB). |
|
|
Timeout period for the client to wait for a response after sending a request. The default value is None, meaning infinite waiting (always waiting for the model to return results). |
|