We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
请问同学,关于QMSum数据集,你们是怎么划分的呢?
我按照QMSum官方给定的脚本进行数据处理后,与你们论文中给出的数据差别很大。
对于Academic,原论文标注有369条,我处理下有371条,您论文中是312条; 对于Committee,原论文标注有453条,我处理下有453条,您论文中是417条; 对于Product,原论文标注有986条,我处理下有986条,您论文中是847条;
还有就是平均长度,我使用的是gold spans 作为输入,统计如下: Academic: avg.dialog.len = 3194.06; avg.summary.len=60.01; avg.quire.len=8.61 Committee: avg.dialog.len = 2145.79; avg.summary.len=88.37; avg.quire.len=14.97 Product: avg.dialog.len = 1811.55; avg.summary.len=77.98; avg.quire.len=13.21
我的统计是按照单词间的空格进行划分的。
您论文中是: Domains || Size || Dialog.len || Summ.len || QR.len Academic || 312 || 1,155.78 || 46.48 || 8.56 Committee || 417 || 757.68 || 76.00 || 14.54 Product || 847 || 971.65 || 63.96 || 13.36 All || 1,576 || 951.49 || 63.68 || 12.73
The text was updated successfully, but these errors were encountered:
No branches or pull requests
请问同学,关于QMSum数据集,你们是怎么划分的呢?
我按照QMSum官方给定的脚本进行数据处理后,与你们论文中给出的数据差别很大。
对于Academic,原论文标注有369条,我处理下有371条,您论文中是312条;
对于Committee,原论文标注有453条,我处理下有453条,您论文中是417条;
对于Product,原论文标注有986条,我处理下有986条,您论文中是847条;
还有就是平均长度,我使用的是gold spans 作为输入,统计如下:
Academic:
avg.dialog.len = 3194.06; avg.summary.len=60.01; avg.quire.len=8.61
Committee:
avg.dialog.len = 2145.79; avg.summary.len=88.37; avg.quire.len=14.97
Product:
avg.dialog.len = 1811.55; avg.summary.len=77.98; avg.quire.len=13.21
我的统计是按照单词间的空格进行划分的。
您论文中是:
Domains || Size || Dialog.len || Summ.len || QR.len
Academic || 312 || 1,155.78 || 46.48 || 8.56
Committee || 417 || 757.68 || 76.00 || 14.54
Product || 847 || 971.65 || 63.96 || 13.36
All || 1,576 || 951.49 || 63.68 || 12.73
The text was updated successfully, but these errors were encountered: