Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

yaha和wordmaker的几个问题 #4

Open
sshybhy opened this issue Aug 2, 2016 · 0 comments
Open

yaha和wordmaker的几个问题 #4

sshybhy opened this issue Aug 2, 2016 · 0 comments

Comments

@sshybhy
Copy link

sshybhy commented Aug 2, 2016

编程外行,需要用matrix67的互信息和左右邻信息熵的算法来识别新词,作者的程序是少有的几个实现这个算法的。目前遇到几个问题,来寻求帮助。
一,yaha的导入经常报错。我使用python2.7 win8.1 64位,在import yaha后调用了make_new_word(file_from, file_save)函数来读取我的一个语料库。如果直接运行这个我写的.py,会报错:无法导入Cutter,现在发现必须手动在IPYTHON里import yaha再回车才能使用yaha的剩余程序。
二,我的语料库超过5MB,这个make_new_word()就无限卡住了,小于1MB的,可以比较快地得到结果,并且还都不错。那些卡死的时候内存基本占满了,如果不拆分语料库,仅仅升级内存,继续使用yaha这个包是否有效(加倍内存和处理的语料库加倍是否成正比)。
三,wordmaker这个c++的程序不会调用,我装了vc++2010 express,不知道能否通过Python调用。可有一些不错的途径推荐?
几个外行问题,还请海涵。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant