-
Notifications
You must be signed in to change notification settings - Fork 771
/
00_welcome-to-the-hugging-face-course.srt
515 lines (412 loc) · 11.4 KB
/
00_welcome-to-the-hugging-face-course.srt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
1
00:00:05,850 --> 00:00:07,713
欢迎来到 Hugging Face 课程。
Welcome to the Hugging Face Course.
2
00:00:08,550 --> 00:00:10,320
本课程旨在带您了解
This course has been designed to teach you
3
00:00:10,320 --> 00:00:12,750
关于 Hugging Face 生态系统的一切
all about the Hugging Face ecosystem,
4
00:00:12,750 --> 00:00:14,700
包括如何使用数据集和模型中心
how to use the dataset and model hub
5
00:00:14,700 --> 00:00:16,803
以及我们所有的开源库。
as well as all our open-source libraries.
6
00:00:18,300 --> 00:00:19,950
这是本课程的目录。
Here is the Table of Contents.
7
00:00:19,950 --> 00:00:22,770
它共分为三个部分
As you can see, it's divided in three sections
8
00:00:22,770 --> 00:00:25,110
由浅入深地带您学习。
which become progressively more advanced.
9
00:00:25,110 --> 00:00:28,500
到目前为止,前两部分已经发布。
At this stage, the first two sections have been released.
10
00:00:28,500 --> 00:00:30,120
在课程的一开始,我们会教您基础知识
So first, we'll teach you the basics
11
00:00:30,120 --> 00:00:32,250
包括如何使用 Transformer 模型,
of how to use a Transformer model,
12
00:00:32,250 --> 00:00:34,230
以及如何基于您自己的数据集上进行微调
fine-tune it on your own data set
13
00:00:34,230 --> 00:00:36,960
并与社区分享结果。
and share the result with the community.
14
00:00:36,960 --> 00:00:39,420
然后,我们将带您深入了解我们的开源库
So second, we'll dive deeper into our libraries
15
00:00:39,420 --> 00:00:42,360
并教您如果能够处理任何 NLP 任务。
and teach you how to tackle any NLP task.
16
00:00:42,360 --> 00:00:44,430
我们正在积极研究最后一部分
We're actively working on the last one
17
00:00:44,430 --> 00:00:47,280
并希望在 2022 年春季完成并发布。
and hope to have it ready for you for the spring of 2022.
18
00:00:48,510 --> 00:00:50,880
第一章不需要技术知识
The first chapter requires no technical knowledge
19
00:00:50,880 --> 00:00:52,320
只会为您介绍一些基础知识
and is a good introduction to learn
20
00:00:52,320 --> 00:00:54,180
例如 Transformers 模型可以做什么
what Transformers models can do
21
00:00:54,180 --> 00:00:56,883
以及它如何帮助到您以及应用到您公司的业务中。
and how it could be of use to you or your company.
22
00:00:58,050 --> 00:01:01,110
第一部分之后的章节需要具备 Python 的相关知识
The next chapters require a good knowledge of Python
23
00:01:01,110 --> 00:01:02,130
以及机器学习和深度学习的
and some basic knowledge of
24
00:01:02,130 --> 00:01:04,350
一些基础知识。
Machine Learning and Deep Learning.
25
00:01:04,350 --> 00:01:07,110
如果您不知道什么是训练集和验证集
If you don't know what a training and validation set are
26
00:01:07,110 --> 00:01:09,360
或者梯度下降法意味着什么,
or what gradient decent means,
27
00:01:09,360 --> 00:01:11,340
您应该看看一些
you should look at an introductory course
28
00:01:11,340 --> 00:01:14,863
诸如 deeplearning.ai 或 fast.ai 发布的入门课程
such as the ones published by deeplearning.ai or fast.ai.
29
00:01:16,200 --> 00:01:17,910
如果您有一些关于某个
It's also best if you have some basics
30
00:01:17,910 --> 00:01:21,150
深度学习框架、PyTorch 或 TensorFlow 中的基础知识那就更好了。
in one Deep Learning Framework, PyTorch or TensorFlow.
31
00:01:21,150 --> 00:01:23,520
本课程介绍的各部分资料
Each part of the material introduced in this course
32
00:01:23,520 --> 00:01:25,590
在 PyTorch 和 TensorFlow 中都有一个相对应的版本,
has a version in both those frameworks,
33
00:01:25,590 --> 00:01:26,730
这样您就可以选择一个
so you will be able to pick the one
34
00:01:26,730 --> 00:01:28,230
您最熟悉的版本。
you are most comfortable with.
35
00:01:29,550 --> 00:01:31,740
这是开发这门课程的团队。
This is the team that developed this course.
36
00:01:31,740 --> 00:01:33,120
接下来每位讲师会先
I'll now let each of the speakers
37
00:01:33,120 --> 00:01:34,570
简单介绍一下自己。
introduce themselves briefly.
38
00:01:37,230 --> 00:01:38,880
- Hi,我叫马修,
- Hi, my name is Matthew,
39
00:01:38,880 --> 00:01:41,610
我是 Hugging Face 的机器学习工程师。
and I'm a Machine Learning Engineer at Hugging Face.
40
00:01:41,610 --> 00:01:43,200
我在开源团队工作
I work on the open-source team
41
00:01:43,200 --> 00:01:45,180
我负责维护
and I'm responsible for maintaining particularly
42
00:01:45,180 --> 00:01:47,280
团队内的 TensorFlow 代码。
the TensorFlow code there.
43
00:01:47,280 --> 00:01:50,130
在此之前,我是 Parsley 的机器学习工程师,
Previously, I was a Machine Learning Engineer at Parsley,
44
00:01:50,130 --> 00:01:52,620
最近该公司被 Automatic 收购,
who've recently been acquired by Automatic,
45
00:01:52,620 --> 00:01:54,210
我是一名博士后研究员
and I was a postdoctoral researcher
46
00:01:54,210 --> 00:01:57,000
之前在爱尔兰都柏林三一学院
before that at Trinity College, Dublin in Ireland
47
00:01:57,000 --> 00:02:00,093
致力于计算遗传学和视网膜疾病的研究。
working on computational genetics and retinal disease.
48
00:02:02,400 --> 00:02:03,870
- Hi,我是 Lysandre
- Hi, I'm Lysandre.
49
00:02:03,870 --> 00:02:05,640
我是 Hugging Face 的机器学习工程师
I'm a Machine Learning Engineer at Hugging Face
50
00:02:05,640 --> 00:02:08,700
我是开源团队的一员。
and I'm specifically part of the open-source team.
51
00:02:08,700 --> 00:02:10,890
我已经在 Hugging Face 团队和我的团队成员一起
I've been at Hugging Face for a few years now
52
00:02:10,890 --> 00:02:12,300
工作了好几年,
and alongside my team members,
53
00:02:12,300 --> 00:02:13,890
我一直致力于研究大多数您将在
I've been working on most of the tools
54
00:02:13,890 --> 00:02:15,790
本课程中看到的工具。
that you'll get to see in this course.
55
00:02:18,270 --> 00:02:20,130
- Hi,我是 Sylvain
- Hi, I'm Sylvain.
56
00:02:20,130 --> 00:02:22,140
我是 Hugging Face 的研究工程师
I'm a Research Engineer at Hugging Face
57
00:02:22,140 --> 00:02:25,830
也是 Transformers 代码库的主要维护者之一。
and one of the main maintainers of the Transformers Library.
58
00:02:25,830 --> 00:02:28,110
之前,我在 fast.ai 工作
Previously, I worked at fast.ai
59
00:02:28,110 --> 00:02:30,420
我帮助开发了 fast.ai 库
where I helped develop the fast.ai Library
60
00:02:30,420 --> 00:02:32,220
以及在线图书。
as well as the online book.
61
00:02:32,220 --> 00:02:35,340
在那之前,我是一名在法国的
Before that, I was a math and computer science teacher
62
00:02:35,340 --> 00:02:36,173
数学和计算机科学老师。
in France.
63
00:02:38,550 --> 00:02:41,340
- Hi,我叫 Sasha,是 Hugging Face 的一名研究员,
- Hi, my name is Sasha and I'm a Researcher at Hugging Face,
64
00:02:41,340 --> 00:02:42,420
致力于道德,
working on the ethical,
65
00:02:42,420 --> 00:02:46,230
机器学习模型的环境和社会影响相关的研究。
environmental and social impacts of machine learning models.
66
00:02:46,230 --> 00:02:49,020
之前,我是 Mila 蒙特利尔大学的
Previously, I was a postdoctoral researcher at Mila,
67
00:02:49,020 --> 00:02:50,400
博士后研究员
University in Montreal
68
00:02:50,400 --> 00:02:53,040
我还为联合国全球脉搏计划担任过
and I also worked as an Applied AI Researcher
69
00:02:53,040 --> 00:02:55,140
应用人工智能研究员。
for the United Nations Global Pulse.
70
00:02:55,140 --> 00:02:57,300
参与过 CodeCarbon 和
I've been involved in projects such as CodeCarbon
71
00:02:57,300 --> 00:02:59,790
机器学习影响计算器等项目
and the Machine Learning Impacts Calculator
72
00:02:59,790 --> 00:03:02,390
致力于衡量机器学习的碳足迹的研究。
to measure the carbon footprint of machine learning.
73
00:03:05,160 --> 00:03:07,650
- Hi,我是 Merve,我是 Hugging Face 团队的
- Hi, I'm Merve and I'm a Developer Advocate
74
00:03:07,650 --> 00:03:09,390
开发技术推广工程师
at Hugging Face.
75
00:03:09,390 --> 00:03:12,480
在此之前,我是一名机器学习工程师
Previously, I was working as a Machine Learning Engineer
76
00:03:12,480 --> 00:03:15,360
负责构建 NLP 工具和聊天机器人。
building NLP tools and chat bots.
77
00:03:15,360 --> 00:03:17,670
目前,我正在努力改进模型中心
Currently, I'm working to improve the hub
78
00:03:17,670 --> 00:03:19,563
并使机器学习民主化。
and democratize machine learning.
79
00:03:22,140 --> 00:03:23,670
- 大家好。
- Hello everyone.
80
00:03:23,670 --> 00:03:27,210
我叫 Lucile,是 Hugging Face 团队的
My name is Lucile and I'm a Machine Learning Engineer
81
00:03:27,210 --> 00:03:28,353
一名机器学习工程师
at Hugging Face.
82
00:03:29,580 --> 00:03:32,550
用两句话告诉您我是谁,
To tell you in two sentences who I am,
83
00:03:32,550 --> 00:03:35,590
我致力于开源工具的开发和支持
I work on the development and support of open-source tools
84
00:03:36,600 --> 00:03:39,595
我也参与了在自然语言处理领域的
and I also participate in several research project
85
00:03:39,595 --> 00:03:41,795
几个研究项目。
in the field of Natural Language Processing.
86
00:03:44,610 --> 00:03:45,540
- 大家好。
- Good day there.
87
00:03:45,540 --> 00:03:47,550
我是 Lewis,我是 Hugging Face 开源团队中的
I'm Lewis and I'm a Machine Learning Engineer
88
00:03:47,550 --> 00:03:50,130
一名机器学习工程师。
in the open-source team at Hugging Face.
89
00:03:50,130 --> 00:03:53,490
我热衷于为 NLP 社区开发工具
I'm passionate about developing tools for the NLP community
90
00:03:53,490 --> 00:03:55,050
您可以在很多 Hugging Face
and you'll see me at many of Hugging Face's
91
00:03:55,050 --> 00:03:56,910
对外的活动中见到我
outreach activities.
92
00:03:56,910 --> 00:03:58,470
在加入 Hugging Face 之前,
Before joining Hugging Face,
93
00:03:58,470 --> 00:03:59,790
我花了几年时间
I spent several years developing
94
00:03:59,790 --> 00:04:01,860
为初创公司和 NLP 领域的企业
machine learning applications for startups
95
00:04:01,860 --> 00:04:04,230
开发机器学习应用程序,
and enterprises in the domains of NLP,
96
00:04:04,230 --> 00:04:07,260
以及拓扑数据分析和时间序列。
topological data analysis and time series.
97
00:04:07,260 --> 00:04:10,110
在此之前,我是一名理论物理学家,
In a former life, I was a theoretical physicist,
98
00:04:10,110 --> 00:04:11,760
负责在大型强子对撞机等
where I researched particle collisions
99
00:04:11,760 --> 00:04:13,560
研究粒子碰撞。
at the Large Hadron Collider and so.
100
00:04:15,900 --> 00:04:18,450
- Hey,我是 Leandro,我是一名 Hugging Face 开源团队中
- Hey, I'm Leandro and I'm a Machine Learning Engineer
101
00:04:18,450 --> 00:04:21,030
的一名机器学习工程师
in the open-source team at Hugging Face.
102
00:04:21,030 --> 00:04:23,460
在加入 Hugging Face 之前,我是一名在瑞士的数据科学家
Before joining Hugging Face, I worked as a Data Scientist
103
00:04:23,460 --> 00:04:26,733
并在大学教授数据科学。
in Switzerland and have taught Data Science at University.