chapter06 #22
Mandeepumra
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
chapter record:
chapter06 directory
a narrative for the chapter
README.md file
environment and origins:
macOS
python version 3.8.0
docanalysis version 0.2.0
py4ami version 0.0.45
What we started with and our environment.
software used:
py4ami
docanalysis
How we set up the software:
pip install docanalysis
docanalysis --help
creation of raw HTML:
created pdf to html:
python -m py4ami.ami_pdf --inpath fulltext.pdf --outdir / --maxpage 110
tools used, comments on result.
py4ami
docanalysis
creation of sections
extraction of dictionaries
manual
abbreviation
method:
mkdir wiki_hackathon
wiki_hackathon
mkdir Chapter06
cd Chapter06
mkdir sections
cd sections
mkdir 0_main_body
docanalysis --project_name wiki_hackathon --output dict_search_5.csv --make_json dict_search_5.json --make_ami_dict entities --extract_abb ip_3_6_energy_abb
where,
--project name – the name of the project (here, wiki_hackathon)
--output - a csv for dictionary search (not of our use, but required to be created)
--make_json - just enter this. Not of current use, but required.
--make_ami_dict – uses the entities created in the above command
--extract_abb - the abbreviation dictionary that is the output.
keyword/phrase: keyword are extracted by the help of gensim method
introduction.md
table of centents.md
FAQs.md
annotation of HTML using dictionaries:
Beta Was this translation helpful? Give feedback.
All reactions