Taipei.py_20130425

Demo at Taipei.py on 2013/04/05.

Slides : http://www.slideshare.net/rueshyna/text-mining-20087054

Video : https://www.youtube.com/watch?v=svGf5Vxyx60&feature=c4-feed-u

Notification

If you have large data then you take more time to run program...

Requirement

Data

I used title column in train-sample file from Stack Overflow as example in this talk.

Demo 1 - Trem Frequency Analysis

It will count terms and plot a chart.

> python freq.py

Demo 2 - Tagged Sentence

Need to download tagged model first.

>>> import nltk  
>>> nltk.download('maxent_treebank_pos_tagger')

This model use Penn Treebank II Tags style.

> python pos.py

Demo 3 - Term Frequency with POS tag

In here, the Penn Treebank II Tags was too detail, so I simplified tags. Please refer to NLTK api doc for simplified tags.

> python freq_pos.py

Demo 4 - Collocation

In here, window size of collocation was set 5 which means it will observe next 5 words.

I forgot to preprocess lower case problem in this program, please careful about case problem.

> python collocation.py

Demo 5 - Make a Sentence

Use language model to make sentence.

> python lm.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Taipei.py_20130425

Notification

Requirement

Data

Demo 1 - Trem Frequency Analysis

Demo 2 - Tagged Sentence

Demo 3 - Term Frequency with POS tag

Demo 4 - Collocation

Demo 5 - Make a Sentence

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
.gitignore		.gitignore
README.md		README.md
collocation.py		collocation.py
freq.py		freq.py
freq_pos.py		freq_pos.py
lm.py		lm.py
pos.py		pos.py

rueshyna/Taipei.py_20130425

Folders and files

Latest commit

History

Repository files navigation

Taipei.py_20130425

Notification

Requirement

Data

Demo 1 - Trem Frequency Analysis

Demo 2 - Tagged Sentence

Demo 3 - Term Frequency with POS tag

Demo 4 - Collocation

Demo 5 - Make a Sentence

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages