-
Notifications
You must be signed in to change notification settings - Fork 3
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge branch 'main' of https://github.com/KennethEnevoldsen/scandinav…
…ian-embedding-benchmark into add-summarization
- Loading branch information
Showing
61 changed files
with
1,071 additions
and
314 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,41 @@ | ||
<!-- This file is auto-generated --> | ||
|
||
# Command Line Interface | ||
|
||
Documentation for the command line interface of SEB. | ||
|
||
## CLI | ||
|
||
### `run` | ||
|
||
Runs the Benchmark on a specified model. | ||
|
||
**Examples:** | ||
|
||
**Examples:** | ||
|
||
To run a model on all languages and tasks | ||
|
||
```{bash} | ||
seb run sentence-transformers/all-MiniLM-L6-v2 -o results.json | ||
``` | ||
|
||
if you only want to limit it to a subset of languages or tasks you can use the `--languages` and `--tasks` flags. | ||
```{bash} | ||
# Running a model on a subset of languages | ||
seb run sentence-transformers/all-MiniLM-L6-v2 -o results.json -l nb nn | ||
# Running a model on a subset of tasks | ||
seb run sentence-transformers/all-MiniLM-L6-v2 -o results.json -t DKHate ScaLA | ||
``` | ||
|
||
| Argument | Type | Description | Default | | ||
| --------------------- | ------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------- | -------- | | ||
| `model_name` | `str` | The model name or path. If the model is not registrered in SEB it will be loaded using SentenceTransformers. | | | ||
| `--output-path`, `-o` | `Path` | The path to save the output to. Can be a directory. | | | ||
| `--languages`, `-l` | `Optional[list[str], NoneType]` | What languages subsection to run the benchmark on. If left blank it will run it on all languages. | `None` | | ||
| `--tasks`, `-t` | `Optional[list[str], NoneType]` | What tasks should model be run on. Default to all tasks within the specified languages. | `None` | | ||
| `--ignore-cache` | `bool` | Ignores caches models. Note that SEB ships with an existing cache. You can set the cache_dir using the environmental variable SEB_CACHE_DIR | `False` | | ||
| `--ignore-errors` | `bool` | Should errors be ignored when running a model on a benchmark task. | `False` | | ||
| `--code`, `-c` | `Path` | Code to run before executing benchmark. Useful for adding custom model to registries. | `None` | | ||
| `--logging-level` | `str` | Logging level for the benchmark. | `'INFO'` | |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,15 @@ | ||
# This just created a rough draft of the CLI documentation. It is not | ||
# intended to be used for anything other than a starting point. | ||
# at least we would need this issue fixed first: | ||
# https://github.com/explosion/radicli/issues/30 | ||
|
||
from pathlib import Path | ||
|
||
from seb.cli import cli | ||
|
||
title = "Command Line Interface" | ||
description = "Documentation for the command line interface of SEB." | ||
|
||
if __name__ == "__main__": | ||
with Path("docs/cli.md").open("w", encoding="utf8") as f: | ||
f.write(cli.document(title=title, description=description)) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,20 @@ | ||
# Introduction | ||
- importance of embeddings models (search, RAG) | ||
- few benchmarks for scandinavian languages | ||
|
||
|
||
## Contributions: | ||
- Creates benchmark for scandinavian languages | ||
- integrates with MTEB | ||
- with broad coverage of both domains and use-cases | ||
- Allow for custom encoding methods dependent on task (as opposed to mteb) | ||
- Added a series of new datasets (?) | ||
- easily extendable | ||
|
||
## (Design principles) | ||
- flexible (easy to add new models) | ||
- easy to run on even small laptops | ||
- minimal dependencies besides MTEB | ||
- It should be transparent how models are run as often the exact prompt used can notably influence performance. --> this models are implemented as a part of the bencmark. | ||
|
||
# Results |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,4 +1,4 @@ | ||
from seb.cli import main | ||
|
||
if __name__ == "__main__": | ||
main() | ||
from seb.cli import setup_cli | ||
|
||
setup_cli() |
Oops, something went wrong.