Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Task] add MIA-Bench #419

Merged
merged 1 commit into from
Nov 23, 2024
Merged

[Task] add MIA-Bench #419

merged 1 commit into from
Nov 23, 2024

Conversation

Luodian
Copy link
Contributor

@Luodian Luodian commented Nov 23, 2024

MIA-Bench is from Apple and used to evaluate multimodal models instruction following abilities.

Homepage: https://github.com/[apple/ml-mia-bench](https://github.com/apple/ml-mia-bench)
HF Dataset: https://huggingface.co/datasets/lmms-lab/MIA-Bench

✨ feat(mia_bench): introduce new MIA-Bench task configuration and utilities

  • add mia_bench.yaml for task configuration with dataset and evaluation settings
  • implement utility functions for document processing and result generation
  • integrate OpenAI and Azure API support for evaluation
  • provide aggregation functions to calculate overall scores

If you meet the lint warnings, you can use following scripts to reformat code.

pip install pre-commit
pre-commit install
pre-commit run --all-files

Thank you for your contributions!

…lities

- add mia_bench.yaml for task configuration with dataset and evaluation settings
- implement utility functions for document processing and result generation
- integrate OpenAI and Azure API support for evaluation
- provide aggregation functions to calculate overall scores
@Luodian Luodian requested review from pufanyi and kcz358 November 23, 2024 10:52
@Luodian
Copy link
Contributor Author

Luodian commented Nov 23, 2024

image

@Luodian Luodian merged commit 4650689 into main Nov 23, 2024
2 checks passed
@pufanyi pufanyi deleted the dev/add_mia_bench branch November 23, 2024 11:21
ZhaoCinyu pushed a commit to ZhaoCinyu/lmms-eval that referenced this pull request Dec 9, 2024
…lities (EvolvingLMMs-Lab#419)

- add mia_bench.yaml for task configuration with dataset and evaluation settings
- implement utility functions for document processing and result generation
- integrate OpenAI and Azure API support for evaluation
- provide aggregation functions to calculate overall scores
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants