Extractive Text Summarizer is a pdf summarizer, in this user provide the pdf that needed to be summarized and also the desired length of output summary which is basically how many sentences does the user want from original pdf as a summary of pdf , then this app will provide the summarized text on next page which user can copy.It's an extractive approach in which algorithm tries to extract sentnces from original text which are highly correlated to each other and also give a good summary of text. It uses Text Rank Algorithm where score between each sentence is basically a cosine similarity in this way i am not only extracting random sentences but interconnected sentences. Used kivy to make GUI on windows.
- Either fork or downlod app files.
- Install all dependencies given in requirements.txt file in cmd using pip install -r requirements.txt
- First download "glove.6B.100d.txt" from kaggle or any other site where 100d is 100 dimension embedding.
- Then run "glo.py" file, "model.bin" will be created.
- Now run "userinterface.py"
- Screen will be displayed
- Choose file from folder window which need to be summarized(after selecting file it will take tim, wait until selected file path is displayed on screen).
- Type length of how many sentences you want in summarized text and then press enter(make sure to press enter after writing length otherwise it won't take it)
- Now press on summarize button and wait , if your file is big it will take , don't press twice on summarize button otherwise will cause error.
- After some time you will be directed to next page where your summarized text will be displayed.
- Let you select pdf and length of summarized text.
- Will display error message if you didn't select path or enter the length.
- Once summarized text displayed you can get back to original screen by pressing "Back" button.
- Show user some loading screen when he presses "Summarize" button , bcoz algorithm will run in background and user won't know he/she has to wait for process to complete and he/she may press button again causing app to crash.
- Allow user to download summarized text in pdf format, for know you can only see text and you have copy and paste summarized text yourself.
- Maybe use flask, which can be hosted online can be a good idea.
- Nltk
- Gensim
- Sklearn
- Networkx
- Kivy
- Kivymd
(https://github.com/kashif-flask/Text-summarizer/blob/master/legal_doc.pdf)