Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

中英文混合切词 #26

Open
linhx13 opened this issue Sep 26, 2017 · 2 comments
Open

中英文混合切词 #26

linhx13 opened this issue Sep 26, 2017 · 2 comments

Comments

@linhx13
Copy link

linhx13 commented Sep 26, 2017

hi,请问现在支持中英文混合切词的么?我这里测试是没有正确切开的。
In [4]: for t, f in seg.cut('this is a test sentence. 这个是计算广告的数据啊'):
...: print('%s %s' % (t, f))
...:
this x
v
i g
s g
g
a g
g
test np
v
sentence x
. w
j
这个 r
是 v
计算 v
广告 n
的 u
数据 n
啊 u

@sangszhou
Copy link

同问,感觉这是个 Bug

@qiaosiyi
Copy link

人家不是说只支持中文的嘛,再说英文切词,应该不用这个方法,直接按照空格划分就行。你要是标词性那肯定不行,这个得有对应的模型。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants