Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

在readme中增加了一个新的词库链接 #77

Closed
wants to merge 1 commit into from

Conversation

xiaowl
Copy link

@xiaowl xiaowl commented Mar 28, 2016

我在自己的配置过程中根据之前自己的一个NLP的语料库整理了一份词条比较多的字典,包涵大概330万左右的词条,文件大小80M。希望可以免去大多数人到处找字典的苦恼。

@tumashu
Copy link
Owner

tumashu commented Mar 28, 2016

牛,不过baidu网盘靠谱不? 时间长了,这个链接会不会失效? 如果没有法律方面的麻烦,你或许可以把它放到github中,或者再做一个GitHub镜像

另外,你这个词库很不错,建议起一个好一点的名字,把它做成一个project,类似 chinese-pyim-bigdict ,all.pyim 这个名称太随便了。。。不便于以后维护, 记得添加README和词库的版权协议。

@tumashu
Copy link
Owner

tumashu commented Mar 28, 2016

另外,README.md 是自动从 chinese-pyim.el 转换得到的,不能手动更改的。。。。你改完后,我添加说明吧。。。。

@xiaowl
Copy link
Author

xiaowl commented Mar 28, 2016

其它都没问题,唯一不确定的是版权协议。我是根据之前在微博上收藏的别人分享的一个NLP语料库基础上做了pinyin转换后得到的字典:

http://www.nlpcn.org/resource/25

这个资源本身也没有版权声明。

有关单独host一个project唯一的考虑是单个字典太大了,不太适合用github来host。。 你有什么建议?

@tumashu
Copy link
Owner

tumashu commented Mar 28, 2016

不太适合用github来host

我觉得可以,因为词库文件的内容更新的频率很小。

唯一不确定的是版权协议。我是根据之前在微博上收藏的别人分享的一个NLP语料库基础上做了pinyin转换后得到的字典: 

能找到原来作者吗? 如果可以的话,就问问,实在找不到,就在README中说明原始出处,但我个人不太建议这样做。。。

@xiaowl
Copy link
Author

xiaowl commented Mar 28, 2016

先试试联系原来的作者吧。

@tumashu tumashu closed this May 30, 2016
@tumashu tumashu reopened this May 30, 2016
@tumashu
Copy link
Owner

tumashu commented Jun 2, 2016

@xiaowl 你做的这个词库现在啥状态?

@tumashu
Copy link
Owner

tumashu commented Jun 2, 2016

https://github.com/lshb 原来作者的github,我准备发邮件问一下相关情况

@tumashu
Copy link
Owner

tumashu commented Jun 14, 2016

我开启了一个新项目: chinese-pyim-greatdict, 专门用来处理你发的这个文件,另外我已经邀请你为合作开发者了。。。。这个 pull request 我就关闭了。

@tumashu tumashu closed this Jun 14, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants