-
Notifications
You must be signed in to change notification settings - Fork 1
/
cfp2014.txt
194 lines (157 loc) · 8.31 KB
/
cfp2014.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
First Joint Workshop on Statistical Parsing of Morphologically Rich Languages
and Syntactic Analysis of Non-Canonical Languages (SPMRL-SANCL 2014)
ENDORSED BY SIGPARSE
Co-located with Coling 2014, August 23/24 in Dublin, Ireland
SPMRL-SANCL 2014 will feature a shared task on semi-supervised parsing
morphologically rich languages.
Outline
Statistical parsing of morphologically-rich languages has repeatedly
been shown to exhibit non-trivial challenges including, among others,
sparse lexica in the face of rich inflectional systems, parsing
deficiency in the face of free word order and treebank annotation
idiosyncrasies in the face of morphosyntactic interactions.
Similar problems arise for parsing non-canonical languages. Besides
technical issues such as lexical sparseness and ad-hoc structures, we
also face theoretical problems including constructions that do not, or
very seldomly occur in standard language, such as verbless sentences or
complex hashtags.
The first joint SPMRL-SANCL workshop addresses both the challenge of
parsing MRLs and NCLs. It provides a forum for research addressing the
often overlapping issues of both fields with the goal of identifying
cross-cutting issues in the annotation and parsing methodology for such
languages.
Areas of interest
The areas of interest of the SPMRL-SANCL workshop include, but are not
limited to, the following list of topics:
- applying cutting-edge parsing techniques to new languages and
domains
- strengths and weaknesses of current parsing techniques when applied
to morphologically-rich and/or non-canonical language
- insights and techniques that are targeted at improving parsing
quality for morphologically-rich and/or non-canonical language
- using insights from parsing and associated processing problems to
motivate decisions in the creation of new syntactically annotated
corpora
- annotation and parsing of data from domains and genres that are not
yet covered for many languages
- discussing the role of parsing in higher-level NLP applications
involving MRLs and NCLs, e.g. syntax-enhanced MT and semantic analysis.
Shared Task
The workshop will also host the second shared task on parsing morphologically
rich language (see http://spmrl.org/spmrl2014-sharedtask.html). The first
shared task was held in conjunction with SPMRL 2013. It helped show that
carefully engineered approaches can help to push the envelope on languages such
as Hungarian, Basque, Hebrew and Polish, where the shared task results for
constituency parsing are the best current known for those languages. The task
embodied a focus on realistic scenarios (no gold tokenization, no gold
part-of-speech or morphology), as well as meaningful evaluation measures,
including a cross-framework evaluation that permits comparisons between
constituent and dependency parsing models.
The second installment of the Shared Task will feature a similar range of
languages. Moreover, it will also consider a semi-supervised scenario where
larger quantities of in-domain text are available. These unlabeled data are
aimed to be used for self-training, co-training, lexical acquisition,
generating word clusters, word embeddings and so on. A separate call for the
Shared Task is forthcoming.
Special Track on Syntactic Analysis of Non-Canonical Language
In addition to regular paper submissions, we solicit poster submissions
addressing the syntactic analysis of frequent phenomena of non-canonical
language, which are difficult to annotate and parse using conventional
annotation schemes. Cases in point are the representation of verbless
utterances in a dependency scheme, the pros and cons of different
representations of disfluencies for statistical parsing, or the analysis of
complex hashtags which incorporate and merge different syntactic arguments into
one token. The posters should focus, in more detail, on one more of these
issues.
More details on the submission categories for the poster session can
be found below and at: http://spmrl.org/sancl-posters2014.html
Important Dates
--------------------- -----------------------
Submission deadline May 2, 2014
Author Notification June 6, 2014
Camera ready copy June 27, 2014
Workshop August (23/24?), 2014
--------------------- -----------------------
How to Submit
We solicit the following submission categories:
- long papers (up to 11 pages excluding references)
- short papers (up to 6 pages excluding references)
- abstracts (500 words excluding examples/references, for
SANCL poster topics)
- shared task paper submissions (deadline/format will be disclosed later)
Long papers are most appropriate for presenting substantial and
completed research addressing a topic relevant to either SANCL or SPMRL.
Short papers are suited for presenting work in progress, position papers
or short, focused contributions relevant to either SANCL or SPMRL
(including the poster session topics described above and, in more
detail, here).
Both long and short papers should present original, unpublished
research. They will be peer reviewed and will be presented as either an
oral talk or as a poster at the workshop. Long/short papers will be
included in the proceedings. Abstract submissions are most appropriate
for presenting an idea for an analysis for one or more of the poster
topics. In contrast to long/short paper submissions, abstract
submissions do not need to back up their ideas with experimental
results. Abstract submission will receive a yes/no review and will not
be included in the proceedings.
Submissions will be accepted until May, 2 , 2014, (11:59 p.m. PST) in
PDF format via the START system and must be formatted using the Coling
2014 formatting instructions: http://www.coling-2014.org/call-for-papers.php
Organizers
Workshop
- Yoav Goldberg (Bar Ilan University, Israel)
- Yuval Marton (Microsoft Inc., US)
- Ines Rehbein (Potsdam University, Germany)
- Yannick Versley (Heidelberg University, Germany)
- Özlem Çetinoğlu (University of Stuttgart, Germany)
- Joel Tetreault (Yahoo! Labs, US)
SANCL Special Track
- Ines Rehbein (Potsdam University, Germany)
- Özlem Çetinoğlu (University of Stuttgart, Germany)
- Djamé Seddah (Université Paris Sorbonne & INRIA's Alpage Project,
France)
- Joel Tetreault (Yahoo! Labs, US)
Shared task
- Sandra Kübler (Indiana University, US)
- Djamé Seddah (Université Paris Sorbonne & INRIA's Alpage Project,
France)
- Reut Tsarfaty (Weizmann Institute of Science, Israel)
Program committee
- Bernd Bohnet (University of Birmingham, UK)
- Marie Candito (University of Paris 7, France)
- Aoife Cahill (Educational Testing Service Inc., US)
- Jinho D. Choi (University of Massachusetts Amherst, US)
- Grzegorz Chrupala (Tilburg University, Netherlands)
- Gülşen Cebiroğlu Eryiğit (Istanbul Technical University, Turkey)
- Markus Dickinson (Indiana University, US)
- Stefanie Dipper (Ruhr-Universität Bochum, Germany)
- Jacob Eisenstein (Georgia Institute of Technology, US)
- Richard Farkas (University of Szeged, Hungary)
- Jennifer Foster (Dublin City University, Ireland)
- Josef van Genabith (DFKI, Germany)
- Koldo Gojenola (University of the Basque Country, Spain)
- Spence Green (Stanford University, US)
- Samar Husain (Potsdam University, Germany)
- Sandra Kübler (Indiana University, US)
- Joseph Le Roux (Université Paris-Nord, France)
- John Lee (City University of Hong Kong, China)
- Wolfgang Maier (University of Düsseldorf, Germany)
- Takuya Matsuzaki (University of Tokyo, Japan)
- David McClosky (IBM Research, US)
- Detmar Meurers (University of Tübingen, Germany)
- Joakim Nivre (Uppsala University, Sweden)
- Kemal Oflazer (Carnegie Mellon University, Qatar)
- Adam Przepiorkowski (ICS PAS, Poland)
- Owen Rambow (Columbia University, US)
- Kenji Sagae (University of Southern California, US)
- Benoit Sagot (Inria, France)
- Djamé Seddah (Univ. Paris Sorbonne, France)
- Wolfgang Seeker (IMS Stuttgart, Germany)
- Anders Soogard (University of Copenhagen, Denmark)
- Reut Tsarfaty (Uppsala University, Sweden)
- Lamia Tounsi (Dublin City University, Ireland)
- Daniel Zeman (Charles University, Czechia)
Contact
For general questions about the workshop, please email
spmrl.sancl@gmail.com. For specific questions about the shared task,
please email the shared task organizers spmrl.sharedtask@gmail.com