-
Notifications
You must be signed in to change notification settings - Fork 6
/
Copy pathsyllabus.html
248 lines (169 loc) · 15.2 KB
/
syllabus.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
<html>
<head>
<meta content='text/html;charset=UTF-8' http-equiv='Content-Type'/>
<title>NLP — Syllabus</title>
<style type='text/css'>
@import 'css/default.css';
@import 'css/syntax.css';
</style>
<link rel="shortcut icon" href="favicon.ico" />
<meta content='Natural Language Processing Class' name='subject'/>
<!--<link href='images/favicon.png' rel='shortcut icon'>-->
<!-- MathJax Section -->
<script type="text/javascript" src="http://cdn.mathjax.org/mathjax/latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script>
<script>
MathJax.Hub.Config({
tex2jax: {
skipTags: ['script', 'noscript', 'style', 'textarea', 'pre']
}
});
MathJax.Hub.Queue(function() {
var all = MathJax.Hub.getAllJax(), i;
for(i=0; i < all.length; i += 1) {
all[i].SourceElement().parentNode.className += ' has-jax';
}
});
</script>
</head>
<body>
<div id='wrap'>
<div id='header'>
<img height="100" alt='NLP Class' src='images/utexas.png'/>
<div class='tagline'>Natural Language Processing: Fall 2013</div>
</div>
<div id='pages'>
<ol class='toc'>
<li>NLP Class
<ol class="toc">
<li><a href='index.html'>Home</a></li>
<li><a href='syllabus.html'>Syllabus</a></li>
<li><a href='schedule.html'>Schedule</a></li>
<li><a href='notes'>Notes</a></li>
<li><a href='assignments.html'>Assignment Requirements</a></li>
<li><a href='links.html'>Links</a></li>
</ol>
</li>
<li>Useful Information
<ol class="toc">
<li><a href='scala'>Scala</a></li>
</ol>
</li>
<li>Assignments
<ol class="toc">
<li><a href='assignments/a0programming.html'>#0 - Programming</a></li>
<li><a href='assignments/a1prob.html'>#1 - Probability</a></li>
<li><a href='assignments/a2classification.html'>#2 - Classification</a></li>
<li><a href='assignments/a3ngrams.html'>#3 - N-Grams</a></li>
<li><a href='assignments/a4hmm.html'>#4 - HMMs</a></li>
<li><a href='assignments/a5sentiment.html'>#5 - Sentiment</a></li>
<li><a href='assignments/a6parsing.html'>#6 - Parsing</a></li>
</ol>
</li>
<li>External Links
<ol class="toc">
<li><a href='http://www.utcompling.com'>UTCL Main site</a></li>
<li><a href='https://courses.utexas.edu/webapps/portal/frameset.jsp?tab_tab_group_id=_11_1&url=%2Fwebapps%2Fblackboard%2Fexecute%2Flauncher%3Ftype%3DCourse%26id%3D_159651_1%26url%3D'>Blackboard</a></li>
</ol>
</li>
</ol>
</div>
<div id='content'>
<h1>Syllabus</h1>
<ol class="toc"><li><a href="#course_information">Course Information</a></li><li><a href="#instructor_contact_information">Instructor Contact Information</a></li><li><a href="#course_mailing_list">Course Mailing List</a></li><li><a href="#prerequisites">Prerequisites</a></li><li><a href="#syllabus">Syllabus</a></li><li><a href="#text">Text</a></li><li><a href="#exams_and_assignments">Exams and Assignments</a></li><li><a href="#philosophy_and_goal">Philosophy and Goal</a></li><li><a href="#content_overview">Content Overview</a></li><li><a href="#course_requirements">Course Requirements</a></li><li><a href="#deadline_policy">Deadline Policy</a></li><li><a href="#academic_dishonesty_policy">Academic Dishonesty Policy</a></li><li><a href="#notice_about_students_with_disabilities">Notice about students with disabilities</a></li><li><a href="#notice_about_missed_work_due_to_religious_holy_days">Notice about missed work due to religious holy days</a></li></ol>
<h2 id='course_information'>Course Information</h2>
<p><strong>Course Name:</strong> Natural Language Processing <br /><strong>Course Number:</strong> CS 378 / LIN 353N<br /><strong>Semester:</strong> Fall 2013</p>
<p><strong>Time:</strong> Tues/Thurs 2:00-3:30<br /><strong>Room:</strong> ENS 145   <span style='color: red'>UPDATED</span></p>
<h2 id='instructor_contact_information'>Instructor Contact Information</h2>
<p><a href='http://www.cs.utexas.edu/~dhg/'>Dan Garrette</a><br /><strong>office hours:</strong> Tues/Wed 9:30-10:30, or by appointment<br /><strong>office hours location:</strong> GDC 1.302<br /><strong>office:</strong> GDC 3.504A<br /><strong>email:</strong> dhg@cs.utexas.edu</p>
<p><strong>TA:</strong> <a href='http://www.cs.utexas.edu/~lewfish/'>Lewis Fishgold</a><br /><strong>office hours:</strong> Mon/Thurs 4:30-5:30<br /><strong>office hours location:</strong> GDC 1.302<br /><strong>email:</strong> lewfish@cs.utexas.edu</p>
<h2 id='course_mailing_list'>Course Mailing List</h2>
<p>Please register for the course mailing list:</p>
<p><a href='https://piazza.com/utexas/fall2013/cs388'>https://piazza.com/utexas/fall2013/cs388</a></p>
<h2 id='prerequisites'>Prerequisites</h2>
<p>Proficiency in at least one programming language. Students should have taken LIN 350 (Words in a Haystack: Methods and Tools for Working with Corpora, Introduction to Computational Linguistics), or CS 310 and CS 315, or obtain consent from the instructor.</p>
<h2 id='syllabus'>Syllabus</h2>
<p>This website serves as the syllabus for this course.</p>
<p>A <em>tentative</em> schedule for the entire semester is posted on the <a href='schedule.html'>schedule</a> page. The planned schedule, readings, and assignments will be refined and updated as the course progresses.</p>
<h2 id='text'>Text</h2>
<p>The official course text book:</p>
<ul>
<li>Daniel Jurafsky and James H. Martin. <a href='http://www.cs.colorado.edu/~martin/slp2.html'>Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, Second Edition</a>, Upper Saddle River, NJ: Prentice-Hall, 2008.</li>
</ul>
<p><strong>NOTE:</strong> The first edition of Jurafsky and Martin is <em>not appropriate</em> – the second edition is a major update, and you’ll need to have that one, not the first.</p>
<p>Selected readings from this text will be suggested, and are indicated in the schedule with “JM”. Additional readings will made available for download.</p>
<p>Manning and Schuetze will be useful as an extra resource:</p>
<ul>
<li>Christopher D. Manning and Hinrich Schuetze. <a href='http://nlp.stanford.edu/fsnlp/'>Foundations of Statistical Natural Language Processing</a>. Cambridge, MA: MIT Press.</li>
</ul>
<h2 id='exams_and_assignments'>Exams and Assignments</h2>
<p>There will be one midterm exam and one final exam. The midterm will consist of the material covered in the first half of the class, and the final will consist of material primarily from the second half.</p>
<p>Assignments will be updated on this website. Assignments may be visible on this website ahead of when they are assigned, but they are all subject to change up to the date that they are officially assigned.</p>
<p>Programming assignments must be completed in Scala.</p>
<h2 id='philosophy_and_goal'>Philosophy and Goal</h2>
<p>The foremost goal of this course is to expose the student to advanced techniques and applications of natural language processing (NLP), especially those involving statistical approaches. The course will address both theoretical and applied topics.</p>
<p>Some specific goals of the course are to enable students to:</p>
<ul>
<li>understand core algorithms and data structures used in NLP</li>
<li>utilize corpora and annotations added to them</li>
<li>build statistical NLP components, such as n-gram language models, text classifiers and part-of-speech taggers, that learn from such corpora</li>
<li>evaluate the merits of different machine learning methods for given NLP tasks</li>
<li>appreciate the relationship between linguistic representations and computational applications</li>
</ul>
<p>This course presents an opportunity for students to gain experience with models and algorithms used in computational linguistics that underly practical applications while gaining an appreciation for the theoretical questions of the field. It will thus help prepare the student both for jobs in the industry and for doing original research in computational linguistics.</p>
<h2 id='content_overview'>Content Overview</h2>
<p>Natural Language Processing (NLP) is concerned with automatically processing human language. Applications include machine translation, search, automatic summarization, and dialog systems. NLP has proved to be a hard task, among other things because of the complexity of the structure of human language, and because of the massive amount of world knowledge that humans use in language understanding.</p>
<p>The field of computational linguistics has experienced significant growth in the last ten years. In addition to the hard work of researchers in the field in general, some of the most important factors behind this include the use of statistical techniques, the availability of large (sometimes annotated) corpora (including the web itself), and the availability of relatively cheap and powerful computers. Together, these factors have played a major part in making computational linguistics very relevant in applied settings.</p>
<p>This course provides a broad introduction to NLP with a particular emphasis on core algorithms, data structures, and machine learning for NLP. Techniques we will study include:</p>
<ul>
<li>using corpora</li>
<li>n-gram language models</li>
<li>hidden markov models</li>
<li>distributional models</li>
<li>topic models</li>
<li>probabilistic classifiers</li>
<li>experimental methodology in NLP</li>
</ul>
<p>Applications discussed in the course will include:</p>
<ul>
<li>sentiment analysis</li>
<li>part-of-speech tagging</li>
<li>spelling correction</li>
<li>word sense disambiguation</li>
<li>machine translation</li>
</ul>
<p>With respect to content, the goal of this course is to give the student an appreciation for the broad research topics currently being pursued in the field of computational linguistics. By the end of the course, the student should be able to</p>
<ul>
<li>identify and discuss the characteristics of different NLP techniques</li>
<li>identify and discuss the characteristics of machine learning techniques used in NLP</li>
<li>implement a naive Bayes classifier</li>
<li>implement a hidden Markov model for part-of-speech tagging</li>
<li>understand what constitutes a probabilistic language model and understand the difference in assumptions between different types of such models (e.g. bag-of-words, n-gram, HMM, topic model)</li>
<li>create features for probabilistic classifiers to model novel NLP tasks</li>
</ul>
<h2 id='course_requirements'>Course Requirements</h2>
<ul>
<li>Assignments (70%): A series of assignments will be given out during the semester. Most of these assignments will have a programming component—these must be completed using the Scala programming language.</li>
<li>Midterm Exam (15%): There will be a midterm exam over the material covered during the first half of the semester.</li>
<li>Final Exam (15%): There will be a final exam covering all course material.</li>
</ul>
<h2 id='deadline_policy'>Deadline Policy</h2>
<p>Homework must be turned in by the date and time specified in order to receive full credit.</p>
<p>Written assignments will always be due <strong>by the time the lecture starts</strong> on the due date. This means that you should absolutely not be late to class. Any assignments turned in even a minute after the start of the lecture will be considered a day late, and will be subject to penalization as such.</p>
<p>Programming assignments that are submitted electronically will always be due <strong>two hours before the start of class</strong> on the due date. If you submit your assignment even 1 hour and 59 minutes before the start of class, it will be considered a day late. This is to discourage you from working on the assignment right up until the start of class, and then walking in late to the lecture.</p>
<p>Credit will be deducted for lateness: 10% for every 24-hour period past the deadline. For example, an assignment due at 2pm on Tuesday will have 10% deducted if it is turned in late but before 2pm on Wednesday. It will have 20% deducted if it is turned in by 2pm Thursday, etc.</p>
<p>Late submissions will not be accepted if they are more than 5 days past the deadline. No points will be received in this case.</p>
<p>Programming takes time. You are highly encouraged to start working on assignments as soon as they are announced to make sure you have enough time to complete them.</p>
<p>Penalty-free extensions will be considered on a case-by-case basis and only if the student asks for the extension before the deadline. In most cases they will not be granted. The greater the advance notice of a need for an extension, the greater the likelihood of leniency.</p>
<h2 id='academic_dishonesty_policy'>Academic Dishonesty Policy</h2>
<p>You are encouraged to discuss assignments with classmates. But all written work must be your own. If in doubt, ask the instructor.</p>
<p>Students who violate University rules on academic dishonesty are subject to disciplinary penalties, including the possibility of failure in the course and/or dismissal from the University. Since such dishonesty harms the individual, all students, and the integrity of the University, policies on academic dishonesty will be strictly enforced. For further information please visit the Student Judicial Services Web site: <a href='http://deanofstudents.utexas.edu/sjs'>http://deanofstudents.utexas.edu/sjs</a>.</p>
<h2 id='notice_about_students_with_disabilities'>Notice about students with disabilities</h2>
<p>The University of Texas at Austin provides appropriate accommodations for qualified students with disabilities. To determine if you qualify, please contact the Dean of Students at 512-471-6529 or UT Services for Students with Disabilities. If they certify your needs, we will work with you to make appropriate arrangements.</p>
<p>UT SSD Website: <a href='http://www.utexas.edu/diversity/ddce/ssd'>http://www.utexas.edu/diversity/ddce/ssd</a></p>
<h2 id='notice_about_missed_work_due_to_religious_holy_days'>Notice about missed work due to religious holy days</h2>
<p>A student who misses an examination, work assignment, or other project due to the observance of a religious holy day will be given an opportunity to complete the work missed within a reasonable time after the absence, provided that he or she has properly notified the instructor. It is the policy of the University of Texas at Austin that the student must notify the instructor at least fourteen days prior to the classes scheduled on dates he or she will be absent to observe a religious holy day. For religious holy days that fall within the first two weeks of the semester, the notice should be given on the first day of the semester. The student will not be penalized for these excused absences, but the instructor may appropriately respond if the student fails to complete satisfactorily the missed assignment or examination within a reasonable time after the excused absence.</p>
</div>
<div id='footer'></div>
</div>
</body>
</html>