Skip to content

Commit

Permalink
fixed download datasets for tests
Browse files Browse the repository at this point in the history
  • Loading branch information
fonhorst committed Aug 2, 2023
1 parent 76ab69a commit 16cdab4
Show file tree
Hide file tree
Showing 9 changed files with 32 additions and 4,028 deletions.
3 changes: 3 additions & 0 deletions .github/workflows/CI.yml
Original file line number Diff line number Diff line change
Expand Up @@ -36,6 +36,9 @@ jobs:
pip3 install tox==3.24.4
pip3 install tox-gh-actions==2.8.1
- name: download datasets
run: ./bin/download-datasets.sh

- name: test with tox
run: |
tox
5 changes: 2 additions & 3 deletions bin/download-datasets.sh
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

set -ex

dataset_dir="/opt/spark_data/"
dataset_dir="examples/data"

mkdir -p "${dataset_dir}"

Expand All @@ -17,5 +17,4 @@ wget https://www.openml.org/data/get_csv/52422/ipums_la_97-small.arff -O ${datas

head -n 25001 ${dataset_dir}/Buzzinsocialmedia_Twitter.csv > ${dataset_dir}/Buzzinsocialmedia_Twitter_25k.csv

cp examples/data/sampled_app_train.csv ${dataset_dir}
unzip examples/data/small_used_cars_data.zip -d ${dataset_dir}
unzip ${dataset_dir}/small_used_cars_data.zip -d ${dataset_dir}
2 changes: 2 additions & 0 deletions examples/data/.gitignore
Original file line number Diff line number Diff line change
@@ -1 +1,3 @@
small_used_cars_data.csv
*.csv
!sampled_app_train.csv
3,129 changes: 0 additions & 3,129 deletions examples/data/avito1k_train.csv

This file was deleted.

201 changes: 0 additions & 201 deletions examples/data/tiny_used_cars_data.csv

This file was deleted.

Loading

0 comments on commit 16cdab4

Please sign in to comment.