Evaluation of VolCano

Data Prepare

For evaluation, we also utilize the yaml file to manage the evaluation datasets.

For CLEVR and CLEVR-ref, we provide the constructed evaluation meta data in here. For other dataets, please refer to the original website for the data.

With the prepared datasets, please set the correct paths in those config files.

Run Evaluation

We provide the evaluation scripts in test_all_benchmark.sh, you can modify the model path and run the entire script or use a part of it.

Metric Computation

Similar to the evaluation, all the metrics can be computed offline with run_metric.sh. For GQA and AMBER, the output is converted into appropriate format. You need to further compute the metric, please refer to LLaVA_for_GQA and AMBER for further instruction.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluation.md

Evaluation.md

Evaluation of VolCano

Data Prepare

Run Evaluation

Metric Computation

Files

Evaluation.md

Latest commit

History

Evaluation.md

File metadata and controls

Evaluation of VolCano

Data Prepare

Run Evaluation

Metric Computation