IMDB crawler for movieQA genre walkthru

experiments (visualization) are done under jupyter lab = v4.4.0

# for data analysis
python >= 3.x
numpy 
pandas

# for crawler

analysis.ipynb

crawling over movieQA genre is done and summarized as imdb_crawled_whole.json file.

crawler_main.ipynb

crawler is implemented with bs4 and python requests lib.

Thx to sigran0 for many helps

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
.gitignore		.gitignore
README.md		README.md
analysis.html		analysis.html
analysis.ipynb		analysis.ipynb
clip_exist_list.txt		clip_exist_list.txt
clip_length.py		clip_length.py
crawler_main.ipynb		crawler_main.ipynb
datacube_class.py		datacube_class.py
dl_labdesk.yaml		dl_labdesk.yaml
id_ds_splits.json		id_ds_splits.json
imdb_crawled_whole.json		imdb_crawled_whole.json
num_data.txt		num_data.txt
parser.py		parser.py
plot_list.txt		plot_list.txt
probe_tr_v_test_split.ipynb		probe_tr_v_test_split.ipynb
qa.json		qa.json
script_list.txt		script_list.txt
split_list.txt		split_list.txt
temp.py		temp.py
test_clip_length.ipynb		test_clip_length.ipynb
test_parser.ipynb		test_parser.ipynb

Provide feedback