-
Notifications
You must be signed in to change notification settings - Fork 45
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add hyperlinks and paths validation. (#132)
Signed-off-by: ZePan110 <ze.pan@intel.com>
- Loading branch information
Showing
6 changed files
with
176 additions
and
5 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,123 @@ | ||
# Copyright (C) 2024 Intel Corporation | ||
# SPDX-License-Identifier: Apache-2.0 | ||
|
||
name: Check Paths and Hyperlinks | ||
|
||
on: | ||
pull_request: | ||
branches: [main] | ||
types: [opened, reopened, ready_for_review, synchronize] | ||
|
||
jobs: | ||
check-the-validity-of-hyperlinks-in-README: | ||
runs-on: ubuntu-latest | ||
steps: | ||
- name: Clean Up Working Directory | ||
run: sudo rm -rf ${{github.workspace}}/* | ||
|
||
- name: Checkout Repo GenAIEval | ||
uses: actions/checkout@v4 | ||
|
||
- name: Check the Validity of Hyperlinks | ||
run: | | ||
cd ${{github.workspace}} | ||
fail="FALSE" | ||
url_lines=$(grep -Eo '\]\(http[s]?://[^)]+\)' --include='*.md' -r .|grep -Ev 'GenAIEval/blob/main') | ||
if [ -n "$url_lines" ]; then | ||
for url_line in $url_lines; do | ||
url=$(echo "$url_line"|cut -d '(' -f2 | cut -d ')' -f1|sed 's/\.git$//') | ||
path=$(echo "$url_line"|cut -d':' -f1 | cut -d'/' -f2-) | ||
response=$(curl -L -s -o /dev/null -w "%{http_code}" "$url") | ||
if [ "$response" -ne 200 ]; then | ||
echo "**********Validation failed, try again**********" | ||
response_retry=$(curl -s -o /dev/null -w "%{http_code}" "$url") | ||
if [ "$response_retry" -eq 200 ]; then | ||
echo "*****Retry successfully*****" | ||
else | ||
echo "Invalid link from ${{github.workspace}}/$path: $url" | ||
fail="TRUE" | ||
fi | ||
fi | ||
done | ||
fi | ||
if [[ "$fail" == "TRUE" ]]; then | ||
exit 1 | ||
else | ||
echo "All hyperlinks are valid." | ||
fi | ||
shell: bash | ||
|
||
check-the-validity-of-relative-path: | ||
runs-on: ubuntu-latest | ||
steps: | ||
- name: Clean up Working Directory | ||
run: sudo rm -rf ${{github.workspace}}/* | ||
|
||
- name: Checkout Repo GenAIEval | ||
uses: actions/checkout@v4 | ||
|
||
- name: Checking Relative Path Validity | ||
run: | | ||
cd ${{github.workspace}} | ||
fail="FALSE" | ||
repo_name=${{ github.event.pull_request.head.repo.full_name }} | ||
if [ "$(echo "$repo_name"|cut -d'/' -f1)" != "opea-project" ]; then | ||
owner=$(echo "${{ github.event.pull_request.head.repo.full_name }}" |cut -d'/' -f1) | ||
branch="https://github.com/$owner/GenAIEval/tree/${{ github.event.pull_request.head.ref }}" | ||
else | ||
branch="https://github.com/opea-project/GenAIEval/blob/${{ github.event.pull_request.head.ref }}" | ||
fi | ||
link_head="https://github.com/opea-project/GenAIEval/blob/main" | ||
png_lines=$(grep -Eo '\]\([^)]+\)' --include='*.md' -r .|grep -Ev 'http') | ||
if [ -n "$png_lines" ]; then | ||
for png_line in $png_lines; do | ||
refer_path=$(echo "$png_line"|cut -d':' -f1 | cut -d'/' -f2-) | ||
png_path=$(echo "$png_line"|cut -d '(' -f2 | cut -d ')' -f1) | ||
if [[ "${png_path:0:1}" == "/" ]]; then | ||
check_path=${{github.workspace}}$png_path | ||
elif [[ "${png_path:0:1}" == "#" ]]; then | ||
check_path=${{github.workspace}}/$refer_path$png_path | ||
else | ||
check_path=${{github.workspace}}/$(dirname "$refer_path")/$png_path | ||
fi | ||
real_path=$(realpath $check_path) | ||
if [ $? -ne 0 ]; then | ||
echo "Path $png_path in file ${{github.workspace}}/$refer_path does not exist" | ||
fail="TRUE" | ||
else | ||
url=$link_head$(echo "$real_path" | sed 's|.*/GenAIEval||') | ||
response=$(curl -I -L -s -o /dev/null -w "%{http_code}" "$url") | ||
if [ "$response" -ne 200 ]; then | ||
echo "**********Validation failed, try again**********" | ||
response_retry=$(curl -s -o /dev/null -w "%{http_code}" "$url") | ||
if [ "$response_retry" -eq 200 ]; then | ||
echo "*****Retry successfully*****" | ||
else | ||
echo "Retry failed. Check branch ${{ github.event.pull_request.head.ref }}" | ||
url_dev=$branch$(echo "$real_path" | sed 's|.*/GenAIEval||') | ||
response=$(curl -I -L -s -o /dev/null -w "%{http_code}" "$url_dev") | ||
if [ "$response" -ne 200 ]; then | ||
echo "**********Validation failed, try again**********" | ||
response_retry=$(curl -s -o /dev/null -w "%{http_code}" "$url_dev") | ||
if [ "$response_retry" -eq 200 ]; then | ||
echo "*****Retry successfully*****" | ||
else | ||
echo "Invalid path from ${{github.workspace}}/$refer_path: $png_path" | ||
fail="TRUE" | ||
fi | ||
else | ||
echo "Check branch ${{ github.event.pull_request.head.ref }} successfully." | ||
fi | ||
fi | ||
fi | ||
fi | ||
done | ||
fi | ||
if [[ "$fail" == "TRUE" ]]; then | ||
exit 1 | ||
else | ||
echo "All hyperlinks are valid." | ||
fi | ||
shell: bash |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,48 @@ | ||
# AudioQnA accuracy Evaluation | ||
|
||
## Dataset | ||
|
||
|
||
We evaluate the ASR accuracy on the test set of librispeech [dataset](https://huggingface.co/datasets/andreagasparini/librispeech_test_only), which contains 2620 records of audio and texts. | ||
|
||
## Metrics | ||
|
||
We evaluate the WER (Word Error Rate) metric of the ASR microservice. | ||
|
||
## Evaluation | ||
|
||
### Launch ASR microservice | ||
|
||
Launch the ASR microserice with the following commands. For more details please refer to [doc](https://github.com/opea-project/GenAIComps/tree/main/comps/asr). | ||
|
||
```bash | ||
git clone https://github.com/opea-project/GenAIComps | ||
cd GenAIComps | ||
docker build -t opea/whisper:latest --build-arg https_proxy=$https_proxy --build-arg http_proxy=$http_proxy -f comps/asr/whisper/Dockerfile . | ||
# change the name of model by editing model_name_or_path you want to evaluate | ||
docker run -p 7066:7066 --ipc=host -e http_proxy=$http_proxy -e https_proxy=$https_proxy opea/whisper:latest --model_name_or_path "openai/whisper-tiny" | ||
``` | ||
|
||
### Evaluate | ||
|
||
Install dependencies: | ||
|
||
``` | ||
pip install -r requirements.txt | ||
``` | ||
|
||
Evaluate the performance with the LLM: | ||
```py | ||
# validate the offline model | ||
# python offline_evaluate.py | ||
# validate the online asr microservice accuracy | ||
python online_evaluate.py | ||
``` | ||
|
||
### Performance Result | ||
Here is the tested result for your reference | ||
|| WER | | ||
| --- | ---- | | ||
|whisper-large-v2| 2.87| | ||
|whisper-large| 2.7 | | ||
|whisper-medium| 3.45 | |