Squashed commit of the following:

commit 050b2c3 Merge: 74facb4 ef30651 Author: Li Bo <drluodian@gmail.com> Date: Tue Jun 18 13:13:38 2024 +0800 Merge pull request #114 from zjysteven/add-tinyllava add tinyllava commit ef30651 Author: Jingyang Zhang <jingyang.zhang@duke.edu> Date: Mon Jun 17 17:57:02 2024 -0400 fix typo commit 9bab677 Merge: dbfb238 74facb4 Author: Jingyang Zhang <jingyang.zhang@duke.edu> Date: Sun Jun 16 10:56:05 2024 -0400 Merge branch 'EvolvingLMMs-Lab:main' into add-tinyllava commit 74facb4 Merge: 8ba192f d5df72d Author: Li Bo <drluodian@gmail.com> Date: Sun Jun 16 17:59:19 2024 +0800 Merge pull request #118 from teowu/main Fix the potential risk by PR #117 commit d5df72d Merge: 5bf59ed 8ba192f Author: Teo (Timothy) Wu Haoning <38696372+teowu@users.noreply.github.com> Date: Sun Jun 16 15:32:13 2024 +0800 Merge branch 'EvolvingLMMs-Lab:main' into main commit 5bf59ed Author: teowu <realtimothyhwu@gmail.com> Date: Sun Jun 16 07:27:28 2024 +0000 fix #117, allow auto download with tar format videos commit 98b3955 Merge: a056f11 be9dada Author: teowu <realtimothyhwu@gmail.com> Date: Sun Jun 16 07:25:07 2024 +0000 Merge branch 'main' of https://github.com/teowu/lmms-eval into main commit a056f11 Author: teowu <realtimothyhwu@gmail.com> Date: Sun Jun 16 07:23:54 2024 +0000 fix #117, allow auto download with tar format videos commit 8ba192f Merge: 7cc2890 be9dada Author: Li Bo <drluodian@gmail.com> Date: Sat Jun 15 17:30:59 2024 +0800 Merge pull request #117 from teowu/main LongVideoBench for LMMs-Eval commit be9dada Merge: 62ea8ce 7cc2890 Author: Teo (Timothy) Wu Haoning <38696372+teowu@users.noreply.github.com> Date: Sat Jun 15 16:39:20 2024 +0800 Merge pull request #1 from EvolvingLMMs-Lab/main Merge pull request #113 from teowu/main commit 62ea8ce Author: teowu <realtimothyhwu@gmail.com> Date: Sat Jun 15 08:30:11 2024 +0000 LongVideoBench support: image LMMs (idefics2, phi3) and video LMMs (LLaVA-Next-Video-34B) commit 7cc2890 Merge: 4bc7224 ea14cd4 Author: Li Bo <drluodian@gmail.com> Date: Sat Jun 15 14:10:22 2024 +0800 Merge pull request #113 from teowu/main Q-Bench, Q-Bench2, A-Bench commit dbfb238 Author: Jingyang <jingyang.zhang@duke.edu> Date: Fri Jun 14 16:20:42 2024 -0400 add tinyllava commit ea14cd4 Author: teowu <realtimothyhwu@gmail.com> Date: Fri Jun 14 15:01:52 2024 +0000 Add qbench, qbench2, abench; fix phi3v as its current implementation does not support multi-image commit 4bc7224 Merge: 2797987 bf14cb8 Author: Li Bo <drluodian@gmail.com> Date: Fri Jun 14 02:14:43 2024 +0800 Merge pull request #111 from XinrunDu/main add II-Bench commit bf14cb8 Author: XinrunDu <duxinrun2000@gmail.com> Date: Thu Jun 13 09:37:02 2024 +0000 fix dataset_path commit 6248113 Author: XinrunDu <duxinrun2000@gmail.com> Date: Thu Jun 13 09:32:06 2024 +0000 add II-Bench commit 2797987 Merge: 63d82f1 66d4bb2 Author: Li Bo <drluodian@gmail.com> Date: Thu Jun 13 11:14:47 2024 +0800 Merge pull request #109 from EvolvingLMMs-Lab/pufanyi/update_version [Small Update] Update the version of LMMs-Eval commit 66d4bb2 Author: Fanyi Pu <FPU001@e.ntu.edu.sg> Date: Thu Jun 13 11:13:00 2024 +0800 update version commit 63d82f1 Author: Li Bo <drluodian@gmail.com> Date: Thu Jun 13 11:04:32 2024 +0800 Update README.md commit 44a3379 Merge: 5ed0035 0ce46d0 Author: Li Bo <drluodian@gmail.com> Date: Thu Jun 13 04:00:12 2024 +0800 Merge pull request #105 from tianyu-z/main Include VCR commit 0ce46d0 Author: Suyuchen <suyuchen.wang@umontreal.ca> Date: Wed Jun 12 15:56:34 2024 -0400 update README.md commit 46a88d8 Merge: 47b13b9 5ed0035 Author: Suyuchen <suyuchen.wang@umontreal.ca> Date: Wed Jun 12 15:50:26 2024 -0400 merged readme.md commit 47b13b9 Author: Suyuchen <suyuchen.wang@umontreal.ca> Date: Wed Jun 12 15:30:52 2024 -0400 update aggregation function for vcr_wiki commit 5ed0035 Author: Li Bo <drluodian@gmail.com> Date: Thu Jun 13 03:21:42 2024 +0800 Update README.md commit ed88068 Author: Li Bo <drluodian@gmail.com> Date: Thu Jun 13 03:13:59 2024 +0800 Update README.md commit fea3806 Merge: d99a24a 05dc8e8 Author: Li Bo <drluodian@gmail.com> Date: Thu Jun 13 03:11:49 2024 +0800 Merge pull request #108 from EvolvingLMMs-Lab/internal_main_dev [Upgrade to v0.2] Embracing Video Evaluations with LMMs-Eval commit 05dc8e8 Author: Bo Li <drluodian@gmail.com> Date: Wed Jun 12 15:56:04 2024 +0000 chore: Update lmms-eval to support video evaluations for LLaVA models commit cbeee20 Author: Bo Li <drluodian@gmail.com> Date: Wed Jun 12 15:50:30 2024 +0000 chore: Update lmms-eval to support video evaluations for LLaVA models commit f00d549 Author: Bo Li <drluodian@gmail.com> Date: Wed Jun 12 15:46:33 2024 +0000 Update image alignment in README.md commit 3415633 Author: Bo Li <drluodian@gmail.com> Date: Wed Jun 12 15:43:16 2024 +0000 Update llava conv_template in lmms_eval/models/llava.py commit 50575a9 Author: Bo Li <drluodian@gmail.com> Date: Wed Jun 12 15:39:03 2024 +0000 chore: Update lmms-eval to support video evaluations for LLaVA models commit c9b2252 Author: Bo Li <drluodian@gmail.com> Date: Wed Jun 12 15:33:48 2024 +0000 Bump version to 0.2.0.dev0 commit 465bd42 Merge: e43bd84 d99a24a Author: Bo Li <drluodian@gmail.com> Date: Wed Jun 12 15:04:25 2024 +0000 Merge branch 'main' of https://github.com/EvolvingLMMs-Lab/lmms-eval into internal_main_dev commit e43bd84 Author: Bo Li <drluodian@gmail.com> Date: Wed Jun 12 14:54:06 2024 +0000 chore: Remove unnecessary files and code related to live_bench and sft_eval tasks commit d99a24a Merge: 374590b a66003b Author: Li Bo <drluodian@gmail.com> Date: Wed Jun 12 19:45:57 2024 +0800 Merge pull request #107 from AtsuMiyai/new_task/upd_update update gpt-3.5-turbo version commit a66003b Author: AtsuMiyai <miyai.atsuyuki.practice@gmail.com> Date: Wed Jun 12 17:05:17 2024 +0900 update gpt-3.5-turbo version commit ee91f27 Author: AtsuMiyai <miyai.atsuyuki.practice@gmail.com> Date: Wed Jun 12 16:50:53 2024 +0900 update gpt-3.5-turbo version commit 326b969 Author: tianyu-z <zhangtianyupro@gmail.com> Date: Mon Jun 10 20:07:40 2024 -0400 include std and confidence interval commit cd050d4 Author: Suyuchen <suyuchen.wang@umontreal.ca> Date: Mon Jun 10 18:49:47 2024 -0400 update vcr_wiki tasks in README.md commit 205721e Author: Suyuchen <suyuchen.wang@umontreal.ca> Date: Mon Jun 10 18:43:15 2024 -0400 update vcr_wiki tasks commit db8e718 Author: tianyu-z <zhangtianyupro@gmail.com> Date: Mon Jun 10 16:13:58 2024 -0400 include the try-except logic for spacy commit 427dabb Author: Suyuchen <suyuchen.wang@umontreal.ca> Date: Mon Jun 10 15:51:05 2024 -0400 add crossed_text to vcr_wiki output commit 043b483 Author: tianyu-z <zhangtianyupro@gmail.com> Date: Mon Jun 10 15:47:00 2024 -0400 switch logic commit e1f04db Author: tianyu-z <zhangtianyupro@gmail.com> Date: Mon Jun 10 02:38:21 2024 -0400 modify the form of VCR commit 96e8d98 Author: tianyu-z <zhangtianyupro@gmail.com> Date: Mon Jun 10 00:10:30 2024 -0400 init include vcr commit 374590b Merge: 504685e cb3b9ce Author: Kaichen Zhang - NTU <kaichenzhang358@outlook.com> Date: Fri Jun 7 20:25:48 2024 +0800 Merge pull request #101 from Gumpest/main Update conbench in README commit 504685e Author: Li Bo <drluodian@gmail.com> Date: Thu Jun 6 15:42:15 2024 +0800 Update README.md commit cb3b9ce Merge: c9793b3 67b64ea Author: Yuan Zhang <56063339+Gumpest@users.noreply.github.com> Date: Thu Jun 6 11:22:24 2024 +0800 Merge branch 'EvolvingLMMs-Lab:main' into main commit c9793b3 Author: Yuan Zhang <gump_well_done@163.com> Date: Thu Jun 6 11:21:05 2024 +0800 update README commit 67b64ea Merge: 8ee7848 5fd6845 Author: Li Bo <drluodian@gmail.com> Date: Wed Jun 5 23:12:58 2024 +0800 Merge pull request #100 from Gumpest/main add Conbench commit 5fd6845 Author: Yuan Zhang <gump_well_done@163.com> Date: Wed Jun 5 21:52:31 2024 +0800 add conbench commit 8ee7848 Merge: 747e197 6fefaf7 Author: Li Bo <drluodian@gmail.com> Date: Tue Jun 4 17:09:33 2024 +0800 Merge pull request #95 from AtsuMiyai/new_task/upd add MM-UPD commit 747e197 Merge: 4854a34 0584307 Author: Li Bo <drluodian@gmail.com> Date: Tue Jun 4 17:09:04 2024 +0800 Merge pull request #97 from CaraJ7/update Add MathVerse in README.md commit 6fefaf7 Author: AtsuMiyai <miyai.atsuyuki.practice@gmail.com> Date: Tue Jun 4 17:36:39 2024 +0900 update utils.py for leaderboard submission commit 5f4fe36 Author: AtsuMiyai <miyai.atsuyuki.practice@gmail.com> Date: Sun Jun 2 23:28:27 2024 +0900 slightly change query_prompt for the reproduction commit 0584307 Author: CaraJ7 <1350074492@qq.com> Date: Sun Jun 2 17:05:28 2024 +0800 Add MathVerse in README.md commit 0581ab3 Author: AtsuMiyai <miyai.atsuyuki.practice@gmail.com> Date: Fri May 31 16:09:45 2024 +0900 merge model_specific_prompt_kwargs and dataset_name into each task yaml commit 4854a34 Author: Pu Fanyi <FPU001@e.ntu.edu.sg> Date: Sat May 4 19:23:39 2024 +0800 Group MMMU images into one image (#83) * update * update font * Add matplotlib.font_manager import in utils.py * Refactor font handling in add_order_label function in utils.py * group mmmu --------- Co-authored-by: Li Bo <drluodian@gmail.com> commit d224794 Author: AtsuMiyai <miyai.atsuyuki.practice@gmail.com> Date: Wed May 29 15:15:59 2024 +0900 add upd commit 453e793 Author: AtsuMiyai <miyai.atsuyuki.practice@gmail.com> Date: Wed May 29 15:03:30 2024 +0900 add upd commit 909edd6 Author: AtsuMiyai <miyai.atsuyuki.practice@gmail.com> Date: Wed May 29 12:52:21 2024 +0900 add upd commit 7c1ac97 Author: AtsuMiyai <miyai.atsuyuki.practice@gmail.com> Date: Wed May 29 12:50:32 2024 +0900 add upd commit 811301c Author: AtsuMiyai <miyai.atsuyuki.practice@gmail.com> Date: Wed May 29 12:46:58 2024 +0900 add upd commit 71401ba Author: AtsuMiyai <miyai.atsuyuki.practice@gmail.com> Date: Wed May 29 12:41:21 2024 +0900 add upd commit 24dc435 Author: Bo Li <drluodian@gmail.com> Date: Mon May 27 10:17:32 2024 +0000 fix compatibility issue of older version llava commit 616edf4 Author: Bo Li <drluodian@gmail.com> Date: Mon May 27 09:32:26 2024 +0000 [Fix] import issues of multilingual llava and olympiadbench commit 4c5a99e Merge: 45c05b2 b05c3e2 Author: Li Bo <drluodian@gmail.com> Date: Mon May 27 14:19:53 2024 +0800 Merge pull request #87 from vfragoso/vifragos/phi3v Adding microsoft/Phi-3-vision-128k-instruct model. commit b05c3e2 Author: Victor Fragoso <victor.fragoso@microsoft.com> Date: Fri May 24 16:36:37 2024 +0000 Adding documentation of Phi3v class. commit c200897 Author: Victor Fragoso <victor.fragoso@microsoft.com> Date: Fri May 24 16:25:02 2024 +0000 Adding prompt arguments for Phi3v on MathVista-TestMini commit 7f9fb6b Author: Victor Fragoso <victor.fragoso@microsoft.com> Date: Fri May 24 13:24:16 2024 +0000 Adding Phi3v model. commit 45c05b2 Author: kcz358 <kaichenzhang358@outlook.com> Date: Thu May 23 03:47:36 2024 +0000 Set printing info for llava_hf to debug level commit 53f013e Author: kcz358 <kaichenzhang358@outlook.com> Date: Thu May 23 03:41:39 2024 +0000 Fix pope random name in pope full commit 22520a9 Author: kcz358 <kaichenzhang358@outlook.com> Date: Thu May 23 03:41:14 2024 +0000 Add separated pope tasks by category commit d1eefb1 Author: kcz358 <kaichenzhang358@outlook.com> Date: Thu May 9 08:36:02 2024 +0000 Update gitignore commit b2b4dbd Author: kcz358 <kaichenzhang358@outlook.com> Date: Mon May 20 07:45:11 2024 +0000 Comment out Spice in caption task so that don't need to download stanford nlp model commit 662f05c Author: kcz358 <kaichenzhang358@outlook.com> Date: Mon May 20 03:13:13 2024 +0000 Comment out parse result in xcomposer commit 0932932 Author: kcz358 <kaichenzhang358@outlook.com> Date: Thu May 16 03:55:39 2024 +0000 Fix instructblip qformer size mismatch and multi-images problem commit 557a6a3 Author: kcz358 <kaichenzhang358@outlook.com> Date: Thu May 16 03:11:41 2024 +0000 Remove redundant code in fuyu commit 6aeb550 Author: kcz358 <kaichenzhang358@outlook.com> Date: Thu May 16 01:45:24 2024 +0000 Fix idefics2 llava in the wild bugs commit aea80e6 Author: kcz358 <kaichenzhang358@outlook.com> Date: Wed May 15 11:07:35 2024 +0000 Better task list_with_num commit 3c12a08 Author: Li Bo <drluodian@gmail.com> Date: Sat May 18 02:35:52 2024 +0800 Update LICENSE commit 82317a6 Author: Li Bo <drluodian@gmail.com> Date: Sat May 18 02:29:09 2024 +0800 Update LICENSE commit a8bba1c Author: Li Bo <drluodian@gmail.com> Date: Sat May 18 02:28:03 2024 +0800 Create LICENSE commit caa5893 Merge: c094448 423b006 Author: Li Bo <drluodian@gmail.com> Date: Mon May 13 11:45:26 2024 +0800 Merge pull request #73 from EvolvingLMMs-Lab/kc/qwen_vl_api [Feat] Add qwen vl api commit c094448 Author: kcz358 <kaichenzhang358@outlook.com> Date: Sat May 11 06:11:19 2024 +0000 Fix llava_hf image tokens number issue commit 64f07e4 Author: kcz358 <kaichenzhang358@outlook.com> Date: Thu May 9 02:04:10 2024 +0000 Fix endless warning for llava_hf generation commit 8aaa828 Author: Bo Li <drluodian@gmail.com> Date: Thu May 2 06:13:56 2024 +0000 Add model_name parameter to Llava constructor commit 7847dc4 Author: kcz358 <kaichenzhang358@outlook.com> Date: Tue May 7 03:15:59 2024 +0000 Parse result for llava_hf 1.6 commit 3e56b4f Author: kcz358 <kaichenzhang358@outlook.com> Date: Tue May 7 03:09:56 2024 +0000 Fix llava_hf generation for 1.6 commit fa3ff92 Author: kcz358 <kaichenzhang358@outlook.com> Date: Mon May 6 08:32:57 2024 +0000 Fix llava conv template for llama3 commit 423b006 Author: kcz358 <kaichenzhang358@outlook.com> Date: Sun May 5 07:54:52 2024 +0000 Add qwen vl api commit b7fd7a9 Merge: 986139a c5a130b Author: Li Bo <drluodian@gmail.com> Date: Sun May 5 13:19:48 2024 +0800 Merge pull request #59 from EvolvingLMMs-Lab/add_idefics2 add idefics2 commit 986139a Merge: b46239c 8d3526c Author: Li Bo <drluodian@gmail.com> Date: Fri May 3 01:18:18 2024 +0800 Merge pull request #36 from cocoshe/main [Fix] repr llava doc commit b46239c Merge: bc69a74 373265f Author: Li Bo <drluodian@gmail.com> Date: Fri May 3 01:17:34 2024 +0800 Merge pull request #56 from gagan3012/main Multilingual LLava bench commit bc69a74 Merge: eef3aeb 626e8a9 Author: Li Bo <drluodian@gmail.com> Date: Fri May 3 01:12:14 2024 +0800 Merge pull request #70 from hunterheiden/hsh/new_task/WebSRC Bugfix: WebSRC should be token-level F1 NOT character-level commit 626e8a9 Author: Hunter Heidenreich <hunter.heidenreich@rootsautomation.com> Date: Thu May 2 09:31:03 2024 -0400 Bugfix: WebSRC should be token-level F1 NOT character-level commit eef3aeb Merge: c4e9dd9 9bca441 Author: Li Bo <drluodian@gmail.com> Date: Thu May 2 14:38:17 2024 +0800 Merge pull request #69 from hunterheiden/hsh/new_task/WebSRC [New Task] WebSRC (multimodal Q&A on web screenshots) commit 9bca441 Author: Hunter Heidenreich <hunter.heidenreich@rootsautomation.com> Date: Wed May 1 11:07:29 2024 -0400 Add code to enable compilation of submission for WebSRC test split commit 7687495 Author: Hunter Heidenreich <hunter.heidenreich@rootsautomation.com> Date: Wed May 1 10:47:32 2024 -0400 Draft and validate websrc eval on dev split commit 4eebd3e Author: Hunter Heidenreich <hunter.heidenreich@rootsautomation.com> Date: Wed May 1 10:46:54 2024 -0400 Update main README with new task names commit 35fe80b Author: Hunter Heidenreich <hunter.heidenreich@rootsautomation.com> Date: Wed May 1 10:46:20 2024 -0400 Draft README for WebSRC commit 955bd06 Author: Hunter Heidenreich <hunter.heidenreich@rootsautomation.com> Date: Tue Apr 30 10:16:21 2024 -0400 Init webSRC commit c4e9dd9 Merge: d8a3a99 319afcc Author: Li Bo <drluodian@gmail.com> Date: Fri Apr 26 14:37:22 2024 +0800 Merge pull request #63 from hunterheiden/hsh/new_task/screenspot New Task: ScreenSpot - Grounding (REC) and instruction generation (REG) on screens commit 319afcc Author: Hunter Heidenreich <hunter.heidenreich@rootsautomation.com> Date: Thu Apr 25 11:44:34 2024 -0400 slight update commit 2f3811c Author: Hunter Heidenreich <hunter.heidenreich@rootsautomation.com> Date: Thu Apr 25 11:41:04 2024 -0400 Add README file specific to ScreenSpot commit 28962cb Author: Hunter Heidenreich <hunter.heidenreich@rootsautomation.com> Date: Wed Apr 24 11:52:33 2024 -0400 Update README to reflect new tasks commit e457cfb Author: Hunter Heidenreich <hunter.heidenreich@rootsautomation.com> Date: Tue Apr 23 18:33:16 2024 -0400 Create ScreenSpot on clean branch commit d8a3a99 Merge: 3dcd015 ed17129 Author: Li Bo <drluodian@gmail.com> Date: Tue Apr 23 10:34:03 2024 +0800 Merge pull request #61 from tupini07/patch-1 Fix typo in Qwen-VL that was causing "reference before assignment" commit ed17129 Author: Andrea Tupini <tupini07@gmail.com> Date: Mon Apr 22 14:56:41 2024 -0600 refactor query construction for clarity commit cd87420 Author: Andrea Tupini <tupini07@gmail.com> Date: Mon Apr 22 14:54:29 2024 -0600 convert contexts to list if necessary and remove unnecessary construction of `questions` commit 8557367 Author: Andrea Tupini <tupini07@gmail.com> Date: Mon Apr 22 14:47:33 2024 -0600 Fix typo in qwen_vl that was causing "reference before assignment" commit 3dcd015 Merge: 95df9fe 743673a Author: Li Bo <drluodian@gmail.com> Date: Sat Apr 20 22:03:16 2024 +0800 Merge pull request #60 from CaraJ7/main Add MathVerse commit 743673a Merge: c1a5472 95df9fe Author: CaraJ7 <1350074492@qq.com> Date: Sat Apr 20 21:49:02 2024 +0800 Merge branch 'main' of https://github.com/EvolvingLMMs-Lab/lmms-eval commit c1a5472 Author: CaraJ7 <1350074492@qq.com> Date: Sat Apr 20 21:45:34 2024 +0800 Add MathVerse commit 373265f Author: Gagan Bhatia <49101362+gagan3012@users.noreply.github.com> Date: Fri Apr 12 17:21:39 2024 -0700 Add files via upload commit d853051 Author: Gagan Bhatia <49101362+gagan3012@users.noreply.github.com> Date: Fri Apr 12 17:19:49 2024 -0700 Create README.md commit 8d3526c Author: cocoshe <1228759711@qq.com> Date: Thu Mar 28 13:38:36 2024 +0800 fix doc
EvolvingLMMs-Lab · Jun 18, 2024 · 59cce73 · 59cce73
1 parent 791e087
commit 59cce73
Show file tree

Hide file tree

Showing 187 changed files with 5,339 additions and 2,986 deletions.
diff --git a/LICENSE b/LICENSE
@@ -0,0 +1,56 @@
+# For the main pipeline structure-related code, we maintain the original license provided with lm-evaluation-harness, which is the MIT License.
+
+MIT License
+
+Copyright (c) 2024 LMMs-Lab
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.
+
+# For the multimodal models and datasets that we have added (defined as code in the lmms_eval/tasks and lmms_eval/models folders), we apply the Apache License.
+
+Apache 2.0 License
+
+Copyright (c) 2024 LMMs-Lab
+
+Licensed under the Apache License, Version 2.0 (the "License");
+you may not use this file except in compliance with the License.
+You may obtain a copy of the License at
+
+    http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing, software
+distributed under the License is distributed on an "AS IS" BASIS,
+WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+See the License for the specific language governing permissions and
+limitations under the License.
+
+When modifying the code, please include the following information about the original lmms-eval source:
+# Adopted from lmms-eval from https://github.com/EvolvingLMMs-Lab/lmms-eval. Below is the original copyright:
+#
+#    Licensed under the Apache License, Version 2.0 (the "License");
+#    you may not use this file except in compliance with the License.
+#    You may obtain a copy of the License at
+#
+#        http://www.apache.org/licenses/LICENSE-2.0
+#
+#    Unless required by applicable law or agreed to in writing, software
+#    distributed under the License is distributed on an "AS IS" BASIS,
+#    WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+#    See the License for the specific language governing permissions and
+#    limitations under the License.
diff --git a/README.md b/README.md
diff --git a/docs/current_tasks.md b/docs/current_tasks.md
@@ -0,0 +1,122 @@
+# Current Tasks
+
+> () indicates the task name in the lmms_eval. The task name is also used to specify the dataset in the configuration file.
+> The following is manually updated documentation. You could use `lmms_eval task --list` to list all supported tasks and their task names. 
+
+- AI2D (ai2d)
+- ChartQA (chartqa)
+- CMMMU (cmmmu)
+  - CMMMU Validation (cmmmu_val)
+  - CMMMU Test (cmmmu_test)
+- COCO Caption (coco_cap)
+  - COCO 2014 Caption (coco2014_cap)
+    - COCO 2014 Caption Validation (coco2014_cap_val)
+    - COCO 2014 Caption Test (coco2014_cap_test)
+  - COCO 2017 Caption (coco2017_cap)
+    - COCO 2017 Caption MiniVal (coco2017_cap_val)
+    - COCO 2017 Caption MiniTest (coco2017_cap_test)
+- [ConBench](https://github.com/foundation-multimodal-models/ConBench) (conbench)
+- DOCVQA (docvqa)
+  - DOCVQA Validation (docvqa_val)
+  - DOCVQA Test (docvqa_test)
+- Ferret (ferret)
+- Flickr30K (flickr30k)
+  - Ferret Test (ferret_test)
+- GQA (gqa)
+- HallusionBenchmark (hallusion_bench_image)
+- Infographic VQA (info_vqa)
+  - Infographic VQA Validation (info_vqa_val)
+  - Infographic VQA Test (info_vqa_test)
+- LLaVA-Bench (llava_in_the_wild)
+- LLaVA-Bench-COCO (llava_bench_coco)
+- MathVerse (mathverse)
+  - MathVerse Text Dominant (mathverse_testmini_text_dominant)
+  - MathVerse Text Only (mathverse_testmini_text_only)
+  - MathVerse Text Lite (mathverse_testmini_text_lite)
+  - MathVerse Vision Dominant (mathverse_testmini_vision_dominant)
+  - MathVerse Vision Intensive (mathverse_testmini_vision_intensive)
+  - MathVerse Vision Only (mathverse_testmini_vision_only)
+- MathVista (mathvista)
+  - MathVista Validation (mathvista_testmini)
+  - MathVista Test (mathvista_test)
+- MMBench (mmbench)
+  - MMBench English (mmbench_en)
+    - MMBench English Dev (mmbench_en_dev)
+    - MMBench English Test (mmbench_en_test)
+  - MMBench Chinese (mmbench_cn)
+    - MMBench Chinese Dev (mmbench_cn_dev)
+    - MMBench Chinese Test (mmbench_cn_test)
+- MME (mme)
+- MMMU (mmmu)
+  - MMMU Validation (mmmu_val)
+  - MMMU Test (mmmu_test)
+- MMUPD (mmupd)
+  - MMUPD Base (mmupd_base)
+    - MMAAD Base (mmaad_base)
+    - MMIASD Base (mmiasd_base)
+    - MMIVQD Base (mmivqd_base)
+  - MMUPD Option (mmupd_option)
+    - MMAAD Option (mmaad_option)
+    - MMIASD Option (mmiasd_option)
+    - MMIVQD Option (mmivqd_option)
+  - MMUPD Instruction (mmupd_instruction)
+    - MMAAD Instruction (mmaad_instruction)
+    - MMIASD Instruction (mmiasd_instruction)
+    - MMIVQD Instruction (mmivqd_instruction)
+- MMVet (mmvet)
+- Multi-DocVQA (multidocvqa)
+  - Multi-DocVQA Validation (multidocvqa_val)
+  - Multi-DocVQA Test (multidocvqa_test)
+- NoCaps (nocaps)
+  - NoCaps Validation (nocaps_val)
+  - NoCaps Test (nocaps_test)
+- OKVQA (ok_vqa)
+  - OKVQA Validation 2014 (ok_vqa_val2014)
+- POPE (pope)
+- RefCOCO (refcoco)
+    - refcoco_seg_test
+    - refcoco_seg_val
+    - refcoco_seg_testA
+    - refcoco_seg_testB
+    - refcoco_bbox_test
+    - refcoco_bbox_val
+    - refcoco_bbox_testA
+    - refcoco_bbox_testB
+- RefCOCO+ (refcoco+)
+    - refcoco+_seg
+        - refcoco+_seg_val
+        - refcoco+_seg_testA
+        - refcoco+_seg_testB
+    - refcoco+_bbox
+        - refcoco+_bbox_val
+        - refcoco+_bbox_testA
+        - refcoco+_bbox_testB
+- RefCOCOg (refcocog)
+    - refcocog_seg_test
+    - refcocog_seg_val
+    - refcocog_bbox_test
+    - refcocog_bbox_val
+- ScienceQA (scienceqa_full)
+  - ScienceQA Full (scienceqa)
+  - ScienceQA IMG (scienceqa_img)
+- ScreenSpot (screenspot)
+  - ScreenSpot REC / Grounding (screenspot_rec)
+  - ScreenSpot REG / Instruction Generation (screenspot_reg)
+- SeedBench (seedbench)
+- SeedBench 2 (seedbench_2)
+- ST-VQA (stvqa)
+- TextCaps (textcaps)
+  - TextCaps Validation (textcaps_val)
+  - TextCaps Test (textcaps_test)
+- TextVQA (textvqa)
+  - TextVQA Validation (textvqa_val)
+  - TextVQA Test (textvqa_test)
+- VizWizVQA (vizwiz_vqa)
+  - VizWizVQA Validation (vizwiz_vqa_val)
+  - VizWizVQA Test (vizwiz_vqa_test)
+- VQAv2 (vqav2)
+  - VQAv2 Validation (vqav2_val)
+  - VQAv2 Test (vqav2_test)
+- WebSRC (websrc)
+  - WebSRC Validation (websrc_val)
+  - WebSRC Test (websrc_test)
diff --git a/lmms_eval/api/task.py b/lmms_eval/api/task.py
@@ -778,6 +778,7 @@ def _download_from_youtube(path):
                     force_unzip = dataset_kwargs.get("force_unzip", False)
                     cache_path = snapshot_download(repo_id=self.DATASET_PATH, repo_type="dataset", force_download=force_download, etag_timeout=60)
                     zip_files = glob(os.path.join(cache_path, "**/*.zip"), recursive=True)
+                    tar_files = glob(os.path.join(cache_path, "**/*.tar*"), recursive=True)
 
                     def unzip_video_data(zip_file):
                         import zipfile
@@ -786,10 +787,51 @@ def unzip_video_data(zip_file):
                             zip_ref.extractall(cache_dir)
                             eval_logger.info(f"Extracted all files from {zip_file} to {cache_dir}")
 
+                    def untar_video_data(tar_file):
+                        import tarfile
+
+                        with tarfile.open(tar_file, "r") as tar_ref:
+                            tar_ref.extractall(cache_dir)
+                            eval_logger.info(f"Extracted all files from {tar_file} to {cache_dir}")
+
+                    def concat_tar_parts(tar_parts, output_tar):
+                        with open(output_tar, "wb") as out_tar:
+                            from tqdm import tqdm
+
+                            for part in tqdm(sorted(tar_parts)):
+                                with open(part, "rb") as part_file:
+                                    out_tar.write(part_file.read())
+                        eval_logger.info(f"Concatenated parts {tar_parts} into {output_tar}")
+
+                    # Unzip zip files if needed
                     if force_unzip or (not os.path.exists(cache_dir) and len(zip_files) > 0):
                         for zip_file in zip_files:
                             unzip_video_data(zip_file)
 
+                    # Concatenate and extract tar files if needed
+                    if force_unzip or (not os.path.exists(cache_dir) and len(tar_files) > 0):
+                        tar_parts_dict = {}
+
+                        # Group tar parts together
+                        for tar_file in tar_files:
+                            base_name = tar_file.split(".tar")[0]
+                            if base_name not in tar_parts_dict:
+                                tar_parts_dict[base_name] = []
+                            tar_parts_dict[base_name].append(tar_file)
+
+                        # Concatenate and untar split parts
+                        for base_name, parts in tar_parts_dict.items():
+                            eval_logger.info(f"Extracting following tar files: {parts}")
+                            output_tar = base_name + ".tar"
+                            if not os.path.exists(output_tar):
+                                eval_logger.info(f"Start concatenating tar files")
+
+                                concat_tar_parts(parts, output_tar)
+                                eval_logger.info(f"Finish concatenating tar files")
+
+                            if not os.path.exists(os.path.join(cache_dir, os.path.basename(base_name))):
+                                untar_video_data(output_tar)
+
                 accelerator.wait_for_everyone()
                 dataset_kwargs.pop("cache_dir")
                 dataset_kwargs.pop("video")

diff --git a/lmms_eval/models/__init__.py b/lmms_eval/models/__init__.py
@@ -26,7 +26,9 @@
     "reka": "Reka",
     "llava_onevision": "Llava_OneVision",
     "from_log": "FromLog",
-    "mplugOwlVideo": "mplug_Owl",
+    "mplug_owl_video": "mplug_Owl",
+    "phi3v": "Phi3v",
+    "tinyllava": "TinyLlava",
 }
 
 for model_name, model_class in AVAILABLE_MODELS.items():