Update README.md

BradyFU · Jun 18, 2024 · f95c420 · f95c420
1 parent e9f6a2d
commit f95c420
Showing 1 changed file with 5 additions and 1 deletion.
diff --git a/README.md b/README.md
@@ -126,7 +126,7 @@ The best answer is:
 
 📍 **Evaluation**: 
 
-To extract the answer and calculate the scores, we add the model response to a JSON file. Here we provide an [example template](./evaluation/output_test_template.json). Once you have prepared the model responses in this format, please refer to the evaluation script [eval_your_results.py](https://github.com/thanku-all/parse_answer/blob/main/eval_your_results.py), and you will get the accuracy scores across video_durations, video domains, video subcategories, and task types. 
+To extract the answer and calculate the scores, we add the model response to a JSON file. Here we provide an example template [output_test_template.json](./evaluation/output_test_template.json). Once you have prepared the model responses in this format, please refer to the evaluation script [eval_your_results.py](https://github.com/thanku-all/parse_answer/blob/main/eval_your_results.py), and you will get the accuracy scores across video_durations, video domains, video subcategories, and task types. 
 The evaluation does not introduce any third-party models, such as ChatGPT.
 
 ```bash
@@ -139,6 +139,10 @@ python eval_your_results.py \
 ```
 Please ensure that the `results_file` follows the specified JSON format stated above, and `video_duration_type` is specified as either `short`, `medium`, or `long`. If you wish to assess results across various duration types, you can specify multiple types separated by commas or organize them in a list, for example: `short,medium,long` or `["short","medium","long"]`.
 
+📍 **Leaderboard**: 
+
+If you want to add your model to our [leaderboard](https://video-mme.github.io/home_page.html#leaderboard), please send model responses to **bradyfu24@gmail.com**, as the format of [output_test_template.json](./evaluation/output_test_template.json).
+
 
 ## 📈 Experimental Results
 - **Evaluation results of different MLLMs.**