效果不理想，是要更新词库吗？ #14

ahumoon7421 · 2017-12-27T06:32:52Z

Loading model cost 1.286 seconds.
Prefix dict has been built succesfully.
2017-12-27 14:20:24.445937: I C:\tf_jenkins\home\workspace\rel-win\M\windows\PY
35\tensorflow\core\platform\cpu_feature_guard.cc:137] Your CPU supports instruct
ions that this TensorFlow binary was not compiled to use: AVX AVX2

hello
WARN：词汇不在服务区
你好
WARN：词汇不在服务区
呵呵
我
哈哈
就
早
WARN：词汇不在服务区

HCIS2020 · 2018-01-10T07:51:58Z

question 和 answer就各有1000个样本，所以效果比较有限

这个版本采用的是TF的seq2seq函数，目前应该有one-hot的的问题吧，支持Word2Vector的版本什么时候更新

cfso2475 · 2018-04-05T10:58:12Z

感觉是这个参数的问题。
min_freq = 10

默认的值为10导致好多词没有进词表，也就是训练的序列本身和question以及answer的文本差异比较大。
按照现有的1000条文本，词频都不高，临时改成1可能好一些。

alige32 · 2018-09-14T09:57:20Z

一个是楼上说的min_freq的问题，2、3效果会比较好，1太多低频词反而有副作用。另外size可根据过滤后词总数适当调高，基于这1000条样本的话10、12效果都是不错的。

Z1hgq · 2019-03-10T13:48:51Z

1000条3的效果比较好

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

效果不理想，是要更新词库吗？ #14

效果不理想，是要更新词库吗？ #14

ahumoon7421 commented Dec 27, 2017

HCIS2020 commented Jan 10, 2018

cfso2475 commented Apr 5, 2018 •

edited

Loading

alige32 commented Sep 14, 2018

Z1hgq commented Mar 10, 2019

效果不理想，是要更新词库吗？ #14

效果不理想，是要更新词库吗？ #14

Comments

ahumoon7421 commented Dec 27, 2017

HCIS2020 commented Jan 10, 2018

cfso2475 commented Apr 5, 2018 • edited Loading

alige32 commented Sep 14, 2018

Z1hgq commented Mar 10, 2019

cfso2475 commented Apr 5, 2018 •

edited

Loading