Skip to content

Commit 663a2c1

Browse files
committed
news
1 parent 658ad28 commit 663a2c1

File tree

1 file changed

+11
-4
lines changed

1 file changed

+11
-4
lines changed

index.html

Lines changed: 11 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22
<!DOCTYPE html>
33
<html>
44
<head>
5-
<title>Homepage for Language Net</title>
5+
<title>Homepage for LanguageNet</title>
66
<link rel="stylesheet" type="text/css" href="project.css">
77

88
<script>
@@ -21,16 +21,23 @@
2121

2222
<br>
2323
<center>
24-
<h1 style="color:dodgerblue">Language-Net: The Large Scale Paraphrase Dataset</h1>
24+
<h1 style="color:dodgerblue">LanguageNet: Large Scale Multilingual Paraphrase Corpus</h1>
2525
</center>
2626
<br>
2727

28+
<h3 style="color: brown">What's New</h3>
29+
30+
<ul>
31+
<li>We are building large scale multilingual paraphrase datasets right now. As planned, we will have 10-language corpus and each language has ~50k pairs!
32+
</li>
33+
</ul>
34+
2835
<h3 style="color: brown">The Corpus</h3>
2936

3037
<ul>
31-
<li>The Language-Net is a collection of sentence level paraphrases from Twitter by linking tweets through shared
38+
<li>The LanguageNet (English) is a collection of sentence level paraphrases from Twitter by linking tweets through shared
3239
URLs. This corpus is the largest up to date with 51,524 human annotated sentence pairs: 42200 for training and 9324 for testing. It can grow 30,000
33-
new sentential paraphrases per month with 70% precision. Now we have 1-year data available: 2,869,657 candidate pairs! <br><br>
40+
new sentential paraphrases per month with ~70% precision. Now we have 1-year data available: 2,869,657 candidate pairs! <br><br>
3441
The following paper introduces the corpus in detail:<br>
3542
<a class="publink" href="http://www.aclweb.org/anthology/D/D17/D17-1126.pdf">A Continuously Growing Dataset of Sentential Paraphrases</a>
3643
<br/><b><a href="https://lanwuwei.github.io/">Wuwei Lan</a></b>, Siyu Qiu, Hua He and Wei Xu. <cite>EMNLP 2017</cite>.

0 commit comments

Comments
 (0)