-
Notifications
You must be signed in to change notification settings - Fork 0
/
index.html
1157 lines (1067 loc) · 65.3 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
<!DOCTYPE html>
<html lang="en" class="no-js">
<head>
<meta charset="UTF-8" />
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<title>Hung Ngo homepage!</title>
<meta name="viewport" content="width=device-width, initial-scale=1, maximum-scale=1" />
<meta name="description" content="Welcome to my homepage!" />
<meta name="keywords" content="vcard, resposnive, retina, resume, jquery, css3, bootstrap, Sunshine, portfolio" />
<meta name="author" content="lmtheme" />
<link rel="shortcut icon" href="favicon.png">
<link rel="stylesheet" href="css/main.css" type="text/css">
<link rel="stylesheet" href="css/bootstrap.min.css">
<link rel="stylesheet" href="css/normalize.css">
<link rel="stylesheet" href="css/transition-animations.css">
<link rel="stylesheet" href="css/owl.carousel.css">
<link rel="stylesheet" href="css/magnific-popup.css">
<link rel="stylesheet" href="css/animate.css">
<link rel="stylesheet" href="css/owl.carousel.css" type="text/css">
<link rel="stylesheet" href="css/owl.theme.default.css" type="text/css">
<!-- This styles needs for demo -->
<link rel="stylesheet" href="css/lmpixels-demo-panel.css" type="text/css">
<link rel="stylesheet" href="css/custom.css" type="text/css">
<!-- /This styles needs for demo -->
<script src="js/jquery-2.1.3.min.js"></script>
<script src="js/modernizr.custom.js"></script>
<!-- Javascript for owl carousel-->
<script type="text/javascript" src="https://ajax.googleapis.com/ajax/libs/jquery/3.4.1/jquery.min.js"></script>
<script type="text/javascript" src="js/bootstrap.js"></script>
<script type="text/javascript" src="js/owl.carousel.js"></script>
<script type="text/javascript" src="js/jquery-2.1.3.min.js"></script>
<script type="text/javascript" src="js/script.js"></script>
</head>
<body>
<!-- Loading animation -->
<div class="preloader">
<div class="preloader-animation">
<div class="dot1"></div>
<div class="dot2"></div>
</div>
</div>
<!-- /Loading animation -->
<div id="page" class="page">
<!-- Header -->
<header id="site_header" class="header">
<div class="my-photo">
<img src="images/flower.jpg" alt="image">
<div class="mask"></div>
</div>
<div class="site-title-block">
<a href="https://scholar.google.com/citations?user=arBhGkkAAAAJ&hl=en" target="_blank"><h1 class="site-title">HUNG Q. NGO</h1></a>
<p class="site-description">Ph.D. in Computer Science</p>
</div>
<a class="menu-toggle mobile-visible">
<i class="fa fa-bars"></i>
</a>
</header>
<!-- /Header -->
<!-- Main Content -->
<div id="main" class="site-main">
<!-- Page changer wrapper -->
<div class="pt-wrapper">
<!-- Navigation & Social buttons -->
<div class="site-nav mobile-menu-hide">
<!-- Main menu -->
<ul id="nav" class="site-main-menu">
<!-- About Me Subpage link -->
<li>
<a class="pt-trigger" href="#about_me" data-animation="58" data-goto="1">About me</a><!-- href value = data-id without # of .pt-page -->
</li>
<li>
<a class="pt-trigger" href="#resume" data-animation="59" data-goto="2">Resume</a>
</li>
<li>
<a class="pt-trigger" href="#teaching" data-animation="60" data-goto="3">Teaching</a>
</li>
<li>
<a class="pt-trigger" href="#research" data-animation="58" data-goto="4">Research</a>
</li>
<li>
<a class="pt-trigger" href="#publication" data-animation="60" data-goto="5">Pulications</a>
</li>
<li>
<a class="pt-trigger" href="#evbcorpus" data-animation="61" data-goto="6">EVBCorpus</a>
</li>
<!--<li><a class="pt-trigger" href="#hobbies" data-animation="61" data-goto="6">Hobbies</a></li>-->
<li>
<a class="pt-trigger" href="#contact" data-animation="58" data-goto="8">Contact</a>
</li>
</ul>
<!-- /Main menu -->
<!-- Social buttons -->
<ul class="social-links">
<li><a class="tip social-button" href="https://www.linkedin.com/in/quochung-ngo/" title="Linkedin"><i class="fa fa-linkedin-square" style="font-size:40px"></i></a></li>
<li><a class="tip social-button" href="https://scholar.google.com/citations?user=arBhGkkAAAAJ&hl=en&oi=ao" title="Scholar"><i class="fa fa-scholar-square" style="font-size:40px"></i></a></li>
<li><a class="tip social-button" href="https://www.facebook.com/hungngovnie/" title="Facebook"><i class="fa fa-facebook" style="font-size:40px"></i></a></li>
</ul>
<!-- /Social buttons -->
</div>
<!-- Navigation & Social buttons -->
<!-- Subpages -->
<div class="subpages">
<!-- About Me Subpage -->
<section class="pt-page pt-page-1" data-id="about_me">
<div class="row">
<div class="col-sm-12 col-md-12 mobile-visible subpage-block">
<div class="my-photo-small">
<img src="images/intro/photo_small.jpg" alt="image">
</div>
</div>
<div class="col-sm-12 col-md-12 subpage-block">
<div class="general-info">
<h3>A data scientist in Computer Science</h3>
<p>Hung Ngo is currently a lecturer of School of Management, Technological University Dublin (TUDublin, Ireland). His research interests are Knowledge Management, Data Mining, Data Analytics, and Natural Language Processing. He has involved many research topics, including machine translation, building ontology, text mining, linked data and internet of things.</p>
<p>He receives his PhD degree in Computer Science at University College Dublin (UCD, Ireland). He received the Msc and Bsc in Computer Science at University of Science, Vietnam National University - HoChiMinh City.</p>
<p>He has spent a short research stay belong to the research exchange program at the University of Vienna, Austria. Co-operations have been established to carry out the EVBCorpus project and EVBAlign project with Prof. Dinh Dien and Prof. Werner Winiwarter.</p>
<p>He got an internship programm in INSIGHT Center, Galway, Ireland in 2014. He has worked on the GraphOfThings project which is the knowledge graph of connected things. GraphOfThings is creating meaningful links among millions of physical and virtual things to create a dynamic knowledge graph that plays the role as insight for a real-time search engine for events happening around us. This project won the 2nd Award at Semantic Web Challenge 2014 at ISWC, Trentino, Italia.</p>
<p>He also got an internship in National Institute of Informatics, Japan supervised by Prof. Nigel Collier. Researches were involved in the BioCaster project and building geographical ontology. Integrating geo-ontology into the Global Health Monitor system, building the webpage for publishing project result;</p>
</div>
</div>
<div class="col-sm-6 col-md-6 subpage-block block-title">
<h3>Research Fields</h3>
<h4>Knowledge Management</h4>
<ul>
<li>Ontology Engineering</li>
<li>Knowledge Graph</li>
<li>Linked Data</li>
</ul>
<h4>Natural Language Processing</h4>
<ul>
<li>Named Entity Recognition</li>
<li>Text Classification</li>
<li>Bilingual Corpora</li>
<li>Knowledge Graph</li>
<li>Text Generation</li>
<li>Machine Translation</li>
</ul>
<h4>Data Mining</h4>
<ul>
<li>Classification</li>
<li>Prediction</li>
<li>Association</li>
<li>Clustering</li>
</ul>
</div>
<div class="col-sm-6 col-md-6 subpage-block">
<h3 style="color:white">.</h3>
<h4>Agriculture Sector</h4>
<ul>
<li>Agriculture Ontology</li>
<li>Farming Data Analytics</li>
</ul>
<h4>Bioinformatic Sector</h4>
<ul>
<li>Bio-Ontology</li>
<li>Medical Named Entity Recognition</li>
<li>Global Disease Monitoring</li>
</ul>
<h4>IoT Sector</h4>
<ul>
<li>Linked Data</li>
<li>Semantic Sensor Network</li>
<li>Social Sensor Network</li>
</ul>
</div>
</div>
<!-- Services block -->
<div class="block-title">
<h3>Brief Introduction</h3>
</div>
<div id="slider">
<div class="owl-carousel owl-theme">
<div class="subpage-block service-block service-info">
<a href="https://www.ucd.ie/consus/">
<img src="images/consus.jpg" alt="PhD researcher">
</a>
<h4>PhD researcher</h4>
<p>Working at CONSUS Lab</p>
<p>Jan. 2018 ~ Current</p>
</div>
<div class="subpage-block service-block service-info">
<a href="https://www.ucd.ie/cs/">
<img src="images/ucd.png" alt="TA/Demonstrator">
</a>
<h4>TA/Demonstrator</h4>
<p>Working at CS School</p>
<p>Jan. 2018 ~ Current</p>
</div>
<div class="subpage-block service-block service-info">
<a href="http://www.insight.ac.jp/">
<img src="images/insight.png" alt="Internship">
</a>
<h4>Internship</h4>
<p>Worked at IoT-LMS lab</p>
<p>Jun. 2014 ~ Dec. 2014</p>
</div>
<div class="subpage-block service-block service-info">
<a href="https://cslearn.cs.univie.ac.at/">
<img src="images/uniwien.jpg" alt="Internship">
</a>
<h4>Internship</h4>
<p>Worked at CSLEARN lab</p>
<p>June. 2011 ~ July. 2011</p>
</div>
<div class="subpage-block service-block service-info">
<a href="http://www.nii.ac.jp/">
<img src="images/nii.jpg" alt="Internship">
</a>
<h4>Internship</h4>
<p>Worked at Nigel Collier lab</p>
<p>Oct. 2006 ~ Mar. 2007</p>
</div>
<div class="subpage-block service-block service-info">
<a href="https://www.uit.edu.vn/">
<img src="images/vnu-uit.png" alt="Lecturer">
</a>
<h4>Lecturer</h4>
<p>IT Lecturer at UIT-VNUHCM</p>
<p>Jun. 2006 ~ Present</p>
</div>
<div class="subpage-block service-block service-info">
<a href="https://citd.vn/">
<img src="images/citd.jpg" alt="IT Lecturer">
</a>
<h4>Lecturer</h4>
<p>IT Lecturer at CITD-VNUHCM</p>
<p>Oct. 2002 ~ Jun. 2006</p>
</div>
</div>
</div>
<!-- End of Services block -->
<!-- Clients block -->
<!--
<div class="block-title">
<h3>Worked/Visited Countries</h3>
</div>
<div id="slider2">
<div class="owl-carousel owl-theme">
<div class="service-block">
<a class="service-title" href="https://en.wikipedia.org/wiki/Vietnam" target="_blank"><img src="images/vietnam.png" alt="image" >Vietnam</a>
</div>
<div class="service-block">
<a class="service-title" href="https://en.wikipedia.org/wiki/Japan" target="_blank"><img src="images/japan.png" alt="image">Japan</a>
</div>
<div class="service-block">
<a class="service-title" href="https://en.wikipedia.org/wiki/Austria" target="_blank"><img src="images/austria.png" alt="image">Austria</a>
</div>
<div class="service-block">
<a class="service-title" href="https://en.wikipedia.org/wiki/Ireland" target="_blank"><img src="images/ireland.jpg" alt="image">Ireland</a>
</div>
<div class="service-block">
<a class="service-title" href="https://en.wikipedia.org/wiki/United_Kingdom" target="_blank"><img src="images/united_kingdom.png" alt="image">United Kingdom</a>
</div>
</div>
</div>
-->
<!-- End of Clients block -->
</section>
<!-- End of About Me Subpage -->
<!-- Resume Subpage -->
<section class="pt-page pt-page-2" data-id="resume">
<div class="section-title-block">
<h2 class="section-title">Resume</h2>
<h5 class="section-description">19 Working Years</h5>
</div>
<div class="row">
<!-- Education section-->
<div class="col-sm-6 col-md-4 subpage-block">
<div class="block-title">
<h3>Education</h3>
</div>
<div class="timeline">
<!-- Single event -->
<div class="timeline-event te-primary">
<h5 class="event-date">Jan 2018 ~ Present</h5>
<h4 class="event-name">Ph.D. in Computer Science</h4>
<span class="event-description">University College Dublin, Ireland</span>
<p>School: Computer Science</p>
<p>Dissertation: Ontology-based Knowledge Map Model for Knowledge Handling</p>
</div>
<!-- Single event -->
<div class="timeline-event te-primary">
<h5 class="event-date">Sep 2004 ~ December 2008</h5>
<h4 class="event-name">Master in Computer Science</h4>
<span class="event-description">VNUHCM-University of Science</span>
<p>Faculty: Information Technology</p>
<p>Thesis: Automatic Searching English–Vietnamese Documents From The Internet</p>
</div>
<!-- Single event -->
<div class="timeline-event te-primary">
<h5 class="event-date">Sep 1998 ~ Sep 2002</h5>
<h4 class="event-name">Bachelor in Computer Science</h4>
<span class="event-description">VNUHCM-University of Science</span>
<p>Faculty: Information Technology</p>
<p>Thesis: Word Alignment in English-Vietnamese Bilingual Text</p>
</div>
</div>
</div>
<!--End of Education-->
<!-- Experience section-->
<div class="col-sm-6 col-md-4 subpage-block">
<div class="block-title">
<h3>Experience</h3>
</div>
<div class="timeline">
<!-- Single event -->
<div class="timeline-event te-primary">
<h5 class="event-date">Jan 2018 - Current</h5>
<h4 class="event-name">PhD Researcher</h4>
<span class="event-description">University College Dublin (Ireland)</span>
</div>
<!-- Single event -->
<div class="timeline-event te-primary">
<h5 class="event-date">Jun 2014 - Dec 2014</h5>
<h4 class="event-name">Internship</h4>
<span class="event-description">INSIGHT Center, NUIG (Ireland)</span>
</div>
<!-- Single event -->
<div class="timeline-event te-primary">
<h5 class="event-date">Oct 2006 - Mar 2007</h5>
<h4 class="event-name">Internship</h4>
<span class="event-description">National Institute of Informatics (Japan)</span>
</div>
<!-- Single event -->
<div class="timeline-event te-primary">
<h5 class="event-date">Oct 2006 - Mar 2007</h5>
<h4 class="event-name">Internship</h4>
<span class="event-description">National Institute of Informatics (Japan)</span>
</div>
<!-- Single event -->
<div class="timeline-event te-primary">
<h5 class="event-date">Jun 2006 - Current</h5>
<h4 class="event-name">IT Lecturer</h4>
<span class="event-description">VNUHCM-University of Information Technology</span>
</div>
<!-- Single event -->
<div class="timeline-event te-primary">
<h5 class="event-date">Oct 2002 - Jun 2006</h5>
<h4 class="event-name">IT Lecturer</h4>
<span class="event-description">VNUHCM-Center of Information Technology Development</span>
</div>
</div>
</div>
<!-- End of Experience section-->
<!-- Skills section-->
<div class="col-sm-6 col-md-4 subpage-block">
<div class="block-title">
<h3>Personal Skills</h3>
</div>
<div class="skills-info">
<h4>Negotiation</h4>
<div class="skill-container">
<div class="skill-percentage skill-5"></div>
</div>
<h4>Presentation</h4>
<div class="skill-container">
<div class="skill-percentage skill-4"></div>
</div>
<h4>Collaboration</h4>
<div class="skill-container">
<div class="skill-percentage skill-4"></div>
</div>
<h4>Adaptability</h4>
<div class="skill-container">
<div class="skill-percentage skill-4"></div>
</div>
</div>
<div class="block-title">
<h3>Professional Skills</h3>
</div>
<div class="skills-info">
<h4>Research</h4>
<div class="skill-container">
<div class="skill-percentage skill-4"></div>
</div>
<h4>Python</h4>
<div class="skill-container">
<div class="skill-percentage skill-4"></div>
</div>
<h4>C/C++/C#</h4>
<div class="skill-container">
<div class="skill-percentage skill-4"></div>
</div>
<h4>Java</h4>
<div class="skill-container">
<div class="skill-percentage skill-5"></div>
</div>
<h4>NLTK, Sci-kitlearn</h4>
<div class="skill-container">
<div class="skill-percentage skill-4"></div>
</div>
<h4>Tensorflow/Keras</h4>
<div class="skill-container">
<div class="skill-percentage skill-5"></div>
</div>
</div>
</div>
<!-- End of Skills section-->
</div>
<divc class="row">
<!-- Honors and Awards section-->
<div class="col-sm-6 col-md-6 block-title">
<div class="block-title">
<h3>Honors and Awards</h3>
</div>
<ol>
<li><h4>SFI Scholarship for PhD (2018-2021)</h4></li>
<p>University College Dublin (UCD)</p>
<p>Dublin, Ireland</p>
<li><h4>2nd Award of the Semantic Web Challenge (2014)</h4></li>
<p>13th International Semantic Web Conference</p>
<p>Trentino, Italia</p>
<li><h4>Research Internship (2014)</h4></li>
<p>INSIGHT Center, NUIG</p>
<p>Galway, Ireland</p>
<li><h4>ASEA-UNINET scholarship (2011)</h4></li>
<p>Short research stay at the Research Group Data Analytics and Computing</p>
<p>University of Vienna, Austria</p>
<li><h4>NII International Internship (2006)</h4></li>
<p>National Institute of Informatics</p>
<p>Tokyo, Japan</p>
</ol>
</div>
<!-- End of Honors and Awards section-->
<!-- Thesis Advisor section-->
<div class="col-sm-6 col-md-6 block-title">
<div class="block-title">
<h3>Industrial Units</h3>
</div>
<ol>
<li>
<h4>FI Technology (Jan. 2017 - Dec. 2027)</h4>
<ul>
<li>NLP/AI Expert</li>
<li>Building WorldBiz, which is a news express application based on NLP techniques and existing news resources to deliver oriental news to specific users.</li>
</ul>
</li>
<li>
<h4>Robot3T Group (Feb. 2012 – Dec. 2013)</h4>
<ul>
<li>Project Leader</li>
<li>Building software to control industrial robots and automatic machines, such as book auto-scanner machines. These applications and machines are sold to industrial partners.</li>
</ul>
</li>
<li>
<h4>C&D Semiconductor (Jun. 2007 – Jun. 2009)</h4>
<ul>
<li>Project Leader</li>
<li>Building software to control wafer sorters, microscope loaders, bright light, and linear track machines in the semiconductor industry. The software and machines are widely sold out and used in industrial FAB in the US and over the world.</li>
</ul>
</li>
</ol>
</div>
<!-- End of Thesis Advisor section-->
</divc>
<div class="row">
<div class="col-sm-12 col-md-12">
<div class="download-cv-block">
<a class="button" target="_blank" href="files/HungNgo_resume.pdf">Download CV</a>
</div>
</div>
</div>
</section>
<!-- End Resume Subpage -->
<!-- Resume Subpage -->
<section class="pt-page pt-page-2" data-id="teaching">
<div class="section-title-block">
<h2 class="section-title">Teaching</h2>
<h5 class="section-description">19 Working Years</h5>
</div>
<divc class="row">
<!-- Education section-->
<div class="col-sm-6 col-md-6 block-title">
<div class="block-title">
<h3>Teaching Modules</h3>
</div>
<div class="timeline">
<!-- Single event -->
<div class="timeline-event te-primary">
<h5 class="event-date">Jan 2018 ~ Present</h5>
<h4 class="event-name">Teaching Assistant / Demonstrator</h4>
<span class="event-description">University College Dublin, Ireland</span>
<p>School of Computer Science</p>
<p>- Computer Programming I<br/>
- Python Programming<br/>
- Data and Database Forensics<br/>
- OSINT - Collection & Analysis<br/>
- Data Mining<br/>
- Cloud Computing</p>
</div>
<!-- Single event -->
<div class="timeline-event te-primary">
<h5 class="event-date">Jun 2006 ~ Jan 2018</h5>
<h4 class="event-name">IT Lecture</h4>
<span class="event-description">VNUHCM-University of Information Technology</span>
<p>Faculty of Computer Science</p>
<p>- C/C++ Programming<br/>
- Data Structure and Algorithm<br/>
- Object-Oriented Programming<br/>
- Natural Language Processing<br/>
- Machine Translation<br/>
- Corpus Linguistics<br/>
- Semantic Web</p>
</div>
<!-- Single event -->
<div class="timeline-event te-primary">
<h5 class="event-date">Oct 2002 ~ Jun 2006</h5>
<h4 class="event-name">IT Lecture</h4>
<span class="event-description">VNUHCM-Center of Information Technology Development</span>
<p>Department of Computer Science</p>
<p>- C/C++ Programming<br/>
- Data Structure and Algorithm<br/>
- Object-Oriented Programming</p>
</div>
</div>
</div>
<!--End of Education-->
<!-- Thesis Advisor section-->
<div class="col-sm-6 col-md-6 block-title">
<div class="block-title">
<h3>Bachelor Thesis Advisor</h3>
</div>
<ol>
<li>
<h4>Quoc Thai Nguyen, Thoai Linh Nguyen (2020)</h4>
<ul>
<li>Thesis title: Sentiment Analysis of Vietnamese Reviews.</li>
<li>Faculty of Computer Science, University of Information Technology, VNUHCM, Vietnam</li>
<li>01 paper published in the IEEE NAFOSTED conference</li>
</ul>
</li>
<li>
<h4>Vu C.D. Hoang, Nguyen L. Nguyen (2006)</h4>
<ul>
<li>Thesis title: Vietnamese Text Classification</li>
<li>Faculty of Information Technology, University of Science, VNUHCM, Vietnam</li>
<li>01 paper published in the IEEE RIVF conference</li>
</ul>
</li>
<li>
<h4>Quoc Tri Tran, Xuan Thao Pham (2006)</h4>
<ul>
<li>Thesis title: Named entity recognition in Vietnamese documents.</li>
<li>Faculty of Information Technology, University of Science, VNUHCM, Vietnam</li>
<li>01 paper published in the Progress in Informatics journal</li>
</ul>
</li>
</ol>
</div>
<!-- End of Thesis Advisor section-->
</divc>
</section>
<!-- End Resume Subpage -->
<!-- Research Subpage -->
<section class="pt-page pt-page-3" data-id="research">
<div class="section-title-block">
<h2 class="section-title">Research</h2>
<h5 class="section-description">Project Illustration</h5>
</div>
<div class="row">
<div class="col-sm-6 col-md-6 subpage-block" style="text-align: justify">
<h3>CONSUS - (2018-present)</h3>
<span>CONSUS is a collaborative research partnership between University College Dublin (UCD) and Origin Enterprises PLC that has been supported through the Science Foundation Ireland (SFI) Strategic Partnership Programme. The €17.6 million five-year project will investigate digital, precision agriculture and crop science through a strong multi and inter-disciplinary approach that combines the leading expertise of UCD in data science and agricultural science with Origin’s integrated crop management research, systems, capabilities and extensive on-farm knowledge exchange networks.</span><br>
<span><b>Tasks:</b> Developing Ontology-based Knowledge Map model to handle Mining Knowledge for Digital Agriculture. This task requires several sub-tasks, such as building ontology, named entity recognition, entity linking, building RDF storage based on linked data techniques, and building knowledge browser.</span>
</div>
<div class="col-sm-6 col-md-6 subpage-block" style="text-align: justify">
<h3>GraphOfThings - (2014-2016)</h3>
<span><img src="images/GoT_worldmap_small.png" style="float: right; width:250px; padding-left: 15px; padding-bottom: 15px;"> GraphOfThings is the knowledge graph of connected things. GraphOfThings (GoT) is creating meaningful links among millions of physical and virtual things to create a dynamic knowledge graph that plays the role as insight for a real-time search engine for events happening around us.</span><br>
<span><b>Tasks:</b> Integrating social media channel networks as virtual things into IoT streaming sources to create knowledge graph, GoT; and implementing visualization for GoT system as a Live Knowledge Graph;</span><br>
<span>Website is available at <a href="http://graphofthings.org/">http://graphofthings.org/</a>.</span>
</div>
</div>
<div class="row">
<div class="col-sm-6 col-md-6 subpage-block" style="text-align: justify">
<h3>EVBCorpus - (2011-2013)</h3>
<span>EVBCorpus - A Multi-Layer English-Vietnamese Bilingual Corpus for Studying Tasks in Comparative Linguistics and Machine Translation. The EVBCopus contains over 20,000,000 words (20 million) from 15 bilingual books, 100 parallel English-Vietnamese / Vietnamese-English texts, 250 parallel law and ordinance texts, 5,000 news articles, and 2,000 film subtitles. The composition, annotation, encoding and availability of the corpus are meant to facilitate developments of language technology and studies in bilingual terminology extraction, primarily for the English-Vietnamese-English language pair.</span><br>
<span>The building EVBCorpus process includes four main steps: (1) collect data and align bitext at the paragraph level; (2) align bitext at the sentence level, (3) linguistic analysis and tagging; (4) annotate and correct corpus with toolkits. As result, the EVBCopus was aligned at the sentence level; and a part of this corpus containing 1,000 news articles was aligned semi-automatically at the word level.</span><br/>
For more information, please look at <a href="#evbcorpus">here</a>.
</div>
<div class="col-sm-6 col-md-6 subpage-block" style="text-align: justify">
<h3>BioCaster - (2006-2008)</h3>
<span><img src="images/BioCaster.png" style="float: right; width:250px; padding-left: 15px; padding-bottom: 15px;"> BioCaster is an ontology-based text mining system for detecting and tracking the distribution of infectious disease outbreaks from linguistic signals on the Web. The system continuously analyzes documents reported from over 1700 RSS feeds, classifies them for topical relevance and plots them onto a Google map using geocoded information. The system consists of four main stages: topic classification, named entity recognition (NER), disease/location detection and event recognition.</span><br>
<span><b>Main tasks:</b> Building geographical ontology, integrating geo-ontology into the Global Health Monitor system, and building the webpage for publishing research result.</span><br>
<span>The BioCaster map and ontology are freely available via a web portal at http://www.biocaster.org (live update until 2012).</span>
</div>
</div>
</section>
<!-- End of Research Subpage -->
<!-- Publication Subpage -->
<section class="pt-page pt-page-2" data-id="publication">
<div class="section-title-block">
<h2 class="section-title">Publications</h2>
<h5 class="section-description">Progestious papers</h5>
</div>
<div class="row">
<div class="col-sm-12 col-md-12 subpage-block">
<ol>
<li>Quoc Hung Ngo, Tahar Kechadi, and Nhien-An Le-Khac, (2022). <a href="https://www.sciencedirect.com/science/article/pii/S0168169922004446">Knowledge representation in digital agriculture: A step towards standardised model.</a> Computers and Electronics in Agriculture, vol. 199, August 2022, 107127, 2022, Elsevier. DOI: https://doi.org/10.1016/j.compag.2022.107127.</li>
<li>Quoc Hung Ngo, Tahar Kechadi, and Nhien-An Le-Khac, (2021). <a href="https://ieeexplore.ieee.org/document/9615052">Domain Specific Entity Recognition with Semantic-based Deep Learning Approach.</a> IEEE Access, vol. 9, pp. 152892-152902, 2021. DOI: 10.1109/ACCESS.2021.3128178.</li>
<li>Quoc Hung Ngo, Tahar Kechadi, and Nhien-An Le-Khac (2020). <a href="https://link.springer.com/content/pdf/10.1007%2F978-3-030-63924-2_14.pdf">OAK: Ontology-based Knowledge Map Model for Digital Agriculture.</a> In: Dang T.K., Küng J., Takizawa M., Chung T.M. (eds) Future Data and Security Engineering. FDSE 2020. Lecture Notes in Computer Science, vol 12466. Springer, Cham. pp. 245-259. DOI: 10.1007/978-3-030-63924-2_14.</li>
<li>Quoc Thai Nguyen, Thoai Linh Nguyen, Ngoc Hoang Luong, and Quoc Hung Ngo, (2020). <a href="https://ieeexplore.ieee.org/document/9335899">Fine-Tuning BERT for Sentiment Analysis of Vietnamese Reviews</a>. 2020 7th NAFOSTED Conference on Information and Computer Science (NICS 2020), IEEE, pp.302-307, DOI:10.1109/NICS51282.2020.9335899.</li>
<li>Quoc Hung Ngo, Nhien-An Le-Khac, and Tahar Kechadi (2019). <a href="https://link.springer.com/chapter/10.1007/978-3-030-34885-4_40">Predicting Soil pH by Using Nearest Fields</a>. In AI-2019 Thirty-ninth SGAI International Conference on Artificial Intelligence, Springer, LNAI 11927, pp. 480-486. DOI: 10.1007/978-3-030-34885-4_40.</li>
<li>Quoc Hung Ngo, Nhien-An Le-Khac, and Tahar Kechadi, (2018). <a href="https://link.springer.com/chapter/10.1007/978-3-030-03014-8_15">Ontology Based Approach for Precision Agriculture</a>. In International Conference on Multi-disciplinary Trends in Artificial Intelligence, pp. 175-186. Springer, LNCS, volume 11248, 2018. DOI: 10.1007/978-3-030-03014-8_15.</li>
<li>Song Nguyen Duc Cong, Quoc Hung Ngo, Rachsuda Jiamthapthaksin, (2016). <a href="http://ieeexplore.ieee.org/abstract/document/7852619/?reload=true">State-of-the-Art Vietnamese Word Segmentation</a>, In Proceedings of the 2017 International Conference on Science in Information Technology (ICSITech), pp. 119-124, IEEE Computer Society, 2016.</li>
<li>Danh Le-Phuoc, Hoan Nguyen Mau Quoc, Quoc Hung Ngo, Tuan Tran Nhat, Manfred Hauswirth, (2016). <a href="http://www.sciencedirect.com/science/article/pii/S1570826816000196">The Graph of Things: A step towards the Live Knowledge Graph of connected things</a>, Web Semantics: Science, Services and Agents on the World Wide Web 37, Volumes 37–38, March 2016, pp. 25–35.</li>
<li>Thuy Ngan Nguyen Luu, Quoc Hung Ngo, Quoc Minh Nghiem, (2016). Machine Translation, VNU-HCM Publishing House, 2016 (Vietnamese).</li>
<li>Danh Le Phuoc, Hoan Nguyen Mau Quoc, Quoc Hung Ngo, Tuan Tran Nhat, Manfred Hauswirth, (2014). <a href="http://citeseerx.ist.psu.edu/viewdoc/download;jsessionid=7328BECE186459AE6A317542749C8125?doi=10.1.1.651.1539&rep=rep1&type=pdf">Enabling Live Exploration on The Graph of Things</a>, Semantic Web Challenge 2014, Trento, Italia, 19-23 October 2014.</li>
<li>Quoc Hung Ngo, Dinh Dien, Werner Winiwarter, (2014). <a href="http://www.aclweb.org/anthology/W/W14/W14-5512.pdf">Building English-Vietnamese Named Entity Corpus with Aligned Bilingual News Articles</a>, In Proceedings of the 5th Workshop on South and Southeast Asian Natural Languages Processing (5th WSSANLP within the COLING2014). Association for Computational Linguistics, pp. 85-93, 2014.</li>
<li>Quoc Hung Ngo, Dinh Dien, Werner Winiwarter, (2013). <a href="http://ieeeexplore.com/xpl/articleDetails.jsp?arnumber=6720528">A Hybrid Method for Word Segmentation with English-Vietnamese Bilingual Text</a>, In Proceedings of the 2013 International Conference on Control, Automation and Information Sciences (ICCAIS2013), pp. 48-52, IEEE Computer Society, 2013.</li>
<li>Quoc Hung Ngo, Werner Winiwarter, Bartholomaus Wloka, (2013). <a href="http://aclweb.org/anthology//W/W13/W13-4301.pdf">EVBCorpus - A Multi-Layer English-Vietnamese Bilingual Corpus for Studying Tasks in Comparative Linguistics</a>, In Proceedings of the 11th Workshop on Asian Language Resources (ALR11), IJCNLP2013 Workshop, pp. 1-9. AFNLP, 2013.</li>
<li>Quoc Hung Ngo, Dinh Dien, Werner Winiwarter, (2012). <a href="https://aclanthology.org/W12-5019.pdf">Automatic Searching for English-Vietnamese Documents on the Internet</a>, In Proceedings of the 3rd Workshop on South and Southeast Asian Natural Languages Processing (3rd WSSANLP within the COLING2012), pp. 211-220. Association for Computational Linguistics, 2012.</li>
<li>Quoc Hung Ngo, Werner Winiwarter, (2012). <a href="https://ieeexplore.ieee.org/abstract/document/6473720/">Building an English-Vietnamese Bilingual Corpus for Machine Translation</a>, In Proceedings of the International Conference on Asian Language Processing 2012 (IALP 2012), IEEE Computer Society, pp. 157-160.</li>
<li>Quoc Hung Ngo, Son Doan, and Werner Winiwarter (2012). <a href="https://scholar.google.com/scholar?cluster=15441925723707852391">Using Wikipedia for Extracting Hierarchy and Building Geo-Ontology</a>, In the International Journal of Web Information Systems, Vol. 8, Issue 4, pp. 401-412.</li>
<li>Quoc Hung Ngo, Werner Winiwarter (2012). <a href="http://www.lrec-conf.org/proceedings/lrec2012/workshops/16.BUCC2012%20Proceedings.pdf#page=75">A Visualizing Annotation Tool for Semi-Automatically Building a Bilingual Corpus</a>, In Proceedings of the 5th Workshop on Building and Using Comparable Corpora, LREC2012 Workshop, pages 67-74. Association for Computational Linguistics, 2012.</li>
<li>Quoc Hung Ngo, Son Doan, and Werner Winiwarter (2011). <a href="https://dl.acm.org/doi/abs/10.1145/2095536.2095600">Building a Geographical Ontology by Using Wikipedia</a>. In Proceedings of the 13th International Conference on Information Integration and Web-based Applications & Services, pages 345-348. ACM, 2011.</li>
<li>Nigel Collier, Son Doan, Ai Kawazeo, Reiko Matsuda Goodwin, Mike Conway, Yoshio Tateno, Quoc Hung Ngo, Dinh Dien, Asanee Kawtrakul, Koichi Takeuchi, Mika Shigematsu, Kiyosu Taniguchi (2008), <a href="https://pubmed.ncbi.nlm.nih.gov/18922806/">BioCaster: detecting public health rumors with a Web-based text mining system</a>, Bioinformatics, Oxford University Press, DOI: 10.1093/bioinformatics/btn534.</li>
<li>Son Doan, Quoc Hung Ngo, Ai Kawazoe and Nigel Collier (2008), <a href="#">The Use of the BioCaster Ontology for Mapping Infectious Diseases and Locations in the BioCaster Surveillance System</a>, BioLINK 2008, Toronto, Canada, July 2008.</li>
<li>Son Doan, Quoc Hung Ngo, Nigel Collier (2008), <a href="files/HungNQ_2008_npre20082110-1.PDF">Building and Using Geographical Ontology in the BioCaster Biosurveillance System</a>, Workshop on Bio-Ontologies 2008: Knowledge in Biology, July 2008.</li>
<li>Son Doan, Quoc Hung Ngo, Ai Kawazoe and Nigel Collier (2008), <a href="https://www.aclweb.org/anthology/I08-2140.pdf">Global Health Monitor - A Web-based System for Detecting and Mapping Infectious Diseases</a>, In Proceedings of the International Joint Conference on Natural Language Processing (IJCNLP), Companion Volume, pp. 951-956.</li>
<li>Nigel Collier, Ai Kawazoe, Son Doan, Mika Shigematsu, Kiyosu Taniguchi, Lihua Jin, John McCrae, Hutchatai Chanlekha, Dinh Dien, Quoc Hung, Van Chi Nam, Koichi Takeuchi, DEng, Asanee Kawtrakul (2007), <a href="files/HungNQ_2007_Detecting_Web_Rumours_with_a_Multilingual_Ontology-Supported_Text_Classification_System.pdf">Detecting Web Rumours with a Multilingual Ontology-Supported Text Classification System</a>, In Proceedings of the 2007 International Society for Disease, Indianapolis, Indiana, United States, October 11th-12th.</li>
<li>Nigel Collier, Ai Kawazoe, Mika Shigematsu, Kiyosu Taniguchi, Lihua Jin, John McCrae, Dinh Dien, Quoc Hung, Koichi Takeuchi, Asanee Kawtrakul (2007), <a href="files/HungNQ_2008_Options_VI.pdf">Ontology-driven influenza surveillance from Web rumours</a>, In Proceedings of the 2007 Options for the Control of Influenza VI (Options), pp. 225-226, Toronto, Ontario, Canada, June 17th-23rd.</li>
<li>C.D. Vu Hoang, L. Nguyen Nguyen, Q. Hung Ngo, 2007, <a href="https://ieeexplore.ieee.org/abstract/document/4223084/">A Comparative Study on Vietnamese Text Classification Methods</a>. In Proceedings of the 5th International Conference on Research, Innovation& Vision for the Future, March 2007, Hanoi, Vietnam.</li>
<li>Q. Tri Tran, T. X. Thao Pham, Q. Hung Ngo, Dien Dinh and Nigel Collier (2007), <a href="https://www.nii.ac.jp/pi/n4/4_5.pdf">Named entity recognition in Vietnamese documents</a>, Progress in Informatics, No.4, March 2007, pp. 5-13.</li>
<li>Dinh Dien, Pham Phu Hoi, Ngo Quoc Hung. 2003. <a href="https://www.academia.edu/download/55487136/HungNQ_2003_3077663.pdf">Some Lexical Issues in Building Electronic Vietnamese Dictionary</a>, PAPILLON-2003 Workshop on Multilingual Lexical Databases, Hokkaido University, Japan.</li>
<li>D. Dien, H. Kiem, T. Ngan, X. Quang, Q. Hung, P. Hoi, V. Toan. 2002. Word alignment in English - Vietnamese bilingual corpus, In Proceedings of EALPIIT'02 (the 2nd East-Asian Language Processing and Internet Information Technology), Hanoi, Vietnam, Jan 2002, pp. 3-11</li>
</ol>
</div>
For more information, please look at <a href="https://scholar.google.com/citations?hl=en&user=arBhGkkAAAAJ">Scholar</a>.
</div>
</section>
<!-- End Publication Subpage -->
<!-- Funny Subpage -->
<section class="pt-page pt-page-3" data-id="evbcorpus">
<div class="section-title-block">
<h2 class="section-title">EVBCorpus</h2>
<h5 class="section-description">English-Vietnamese Bilingual Corpus</h5>
</div>
<div class="row">
<div class="col-sm-12 col-md-12 subpage-block">
<div class="general-info">
<p><b>EVBCorpus - A Multi-Layer English-Vietnamese Bilingual Corpus</b> for Studying Tasks in Comparative Linguistics and Machine Translation. The EVBCopus contains over 20,000,000 words (20 million) from 15 bilingual books, 100 parallel English-Vietnamese / Vietnamese-English texts, 250 parallel law and ordinance texts, 5,000 news articles, and 2,000 film subtitles. The composition, annotation, encoding and availability of the corpus are meant to facilitate developments of language technology and studies in bilingual terminology extraction, primarily for the English-Vietnamese-English language pair.</p>
<p>The building EVBCorpus process includes four main steps: (1) collect data and align bitext at the paragraph level; (2) align bitext at the sentence level, (3) linguistic analysis and tagging; (4) annotate and correct corpus with toolkits. As result, the EVBCopus was aligned at the sentence level; and a part of this corpus containing 1,000 news articles was aligned semi-automatically at the word level.</p>
<div>
<img src="images/new-icon.png" title="" width="30" /> Release EVBNews v.1.0 with 1,000 parallel documents, download at:
<a href="files/EVBCorpus_EVBNews_v1.0.rar">EVBCorpus_EVBNews_v1.0.rar</a>
</div>
<div>
<img src="images/new-icon.png" title="" width="30" /> Release EVBNews v.2.0 with 1,000 word aligned parallel documents, download at:
<a href="files/EVBCorpus_EVBNews_v2.0.rar">EVBCorpus_EVBNews_v2.0.rar</a>
</div>
<div>
<img src="images/new-icon.png" title="" width="30" /> <font color="#ff0000">Download</font>
<a href="files/EVBCorpus_EVBNews_v2.0.rar">EVBNews_v2.0</a>,
<a href="files/EVBCorpus_EVBNews_v1.0.rar">EVBNews_v1.0</a>,
<a href="files/EVBCorpus_EnVnNEguide_v1.1.pdf">EnVnNEguide_v1.1</a>,
<a href="files/GetWebContent.rar">GetWebContent</a>,
<a href="files/BitextCtrl.v.1.0_[sourcecode].rar">BitextCtrl.v.1.0 [sourcecode]</a>,
<a href="files/N0001.evbcorpusv3.html">N0001.evbcorpusv3.html</a>
</div>
<br/>
<div>If you are interested in the corpus, please email to <b>hungnq(at)uit.edu.vn</b> to have more details.</div>
<br>
<div><b>Details of Upgrade EVBCorpus v.2.0 (2018)</b><br>
<table style="font-size:13.3333px">
<tbody>
<tr>
<td><b>Source</b></td>
<td style="text-align:center"><b> Document </b></td>
<td style="text-align:center"><b> Paragraph </b></td>
<td style="text-align:center"><b> Sentence </b></td>
<td style="text-align:center"><b> Word </b></td>
</tr>
<tr>
<td colspan="5">------------------------------------------------------------------------------------------</td>
</tr>
<tr>
<td>En-Vn Books</td>
<td style="text-align:right">15</td>
<td style="text-align:right">14,195</td>
<td style="text-align:right">61,167</td>
<td style="text-align:right">1,335,180</td>
</tr>
<tr>
<td>En-Vn Fictions </td>
<td style="text-align:right">100</td>
<td style="text-align:right">192,898</td>
<td style="text-align:right">489,787</td>
<td style="text-align:right">6,129,161</td>
</tr>
<tr>
<td>En-Vn Laws</td>
<td style="text-align:right"> 250 </td>
<td style="text-align:right"> 86,848 </td>
<td style="text-align:right"> 98,064 </td>
<td style="text-align:right"> 1,981,932 </td>
</tr>
<tr>
<td><b>En-Vn ETests</b></td>
<td style="text-align:right"><b> 500 </b></td>
<td style="text-align:right"><b> 20,288 </b></td>
<td style="text-align:right"><b> 21,575 </b></td>
<td style="text-align:right"><b> 411,093 </b></td>
</tr>
<tr>
<td><b>En-Vn News</b></td>
<td style="text-align:right"><b> 5,000 </b></td>
<td style="text-align:right"><b> 94,933 </b></td>
<td style="text-align:right"><b> 173,903 </b></td>
<td style="text-align:right"><b> 2,965,590 </b></td>
</tr>
<tr>
<td><b>En-Vn Subtitles</b></td>
<td style="text-align:right"><b> 2,000 </b></td>
<td style="text-align:right"><b> 1,302,839 </b></td>
<td style="text-align:right"><b> 1,447,581 </b></td>
<td style="text-align:right"><b> 8,150,080 </b></td>
</tr>
<tr>
<td colspan="5">-------------------------------------------------------------------------------------------</td>
</tr>
<tr>
<td><b>Total</b></td>
<td style="text-align:right"><b>7,865</b></td>
<td style="text-align:right"><b>1,712,001</b></td>
<td style="text-align:right"><b>2,292,077</b></td>
<td style="text-align:right"><b>20,973,036</b></td>
</tr>
</tbody>
</table>
</div>
<br>
<div><b>Details of data sources of EVBCorpus v.1.0 (2012): </b><br/>
<table>
<tbody>
<tr>
<td style="text-align:left"><b> Source </b></td>
<td style="text-align:center"><b> Document </b></td>
<td style="text-align:center"><b> Paragraph </b></td>
<td style="text-align:center"><b> Sentence </b></td>
<td style="text-align:center"><b> Word </b></td>
</tr>
<tr>
<td colspan="5" style="text-align:left">-----------------------------------------------------------------------------------------</td>
</tr>
<tr>
<td style="text-align:left">En-Vn Books </td>
<td style="text-align:right"> 15</td>
<td style="text-align:right"> 13,980 </td>
<td style="text-align:right"> 80,323 </td>
<td style="text-align:right"> 1,375,492</td>
</tr>
<tr>
<td style="text-align:left">En-Vn Fictions</td>
<td style="text-align:right"> 100</td>
<td style="text-align:right"> 192,723</td>
<td style="text-align:right"> 491,703</td>
<td style="text-align:right"> 6,307,613</td>
</tr>
<tr>
<td style="text-align:left">En-Vn Laws</td>
<td style="text-align:right"> 250</td>
<td style="text-align:right"> 86,803</td>
<td style="text-align:right"> 98,102</td>
<td style="text-align:right"> 1,912,055</td>
</tr>
<tr>
<td style="text-align:left">En-Vn News</td>
<td style="text-align:right">1,000</td>
<td style="text-align:right">24,523</td>
<td style="text-align:right">45,531</td>
<td style="text-align:right">740,534</td>
</tr>
<tr>
<td colspan="5" style="text-align:left">-----------------------------------------------------------------------------------------</td>
</tr>
<tr>
<td style="text-align:left"><b>Total </b></td>
<td style="text-align:right"><b> 1,365 </b></td>
<td style="text-align:right"><b> 318,029</b></td>
<td style="text-align:right"><b> 715,659 </b></td>
<td style="text-align:right"><b>10,431,592</b></td>
</tr>
</tbody>
</table>
</div>
<br>
<div><b>English-Vietnamese Word Alignment Corpus (EVWACorpus)</b><br/>
The EVWACorpus contains 1,000 news articles with 45,531 sentence pairs and 740,534 words which are aligned manually at the word level between English and Vietnamese sentence.<br/>
<table>
<tbody>
<tr>
<td><br /></td>
<td style="text-align:center"><b> English </b></td>
<td style="text-align:center"><b> Vietnamese </b></td>
</tr>
<tr>
<td colspan="3" style="text-align:left">-------------------------------------------------------------</td>
</tr>
<tr>
<td style="text-align:left">Files </td>
<td style="text-align:right">1,000 </td>
<td style="text-align:right">1,000 </td>
</tr>
<tr>
<td style="text-align:left">Sentences </td>
<td style="text-align:right"> 45,531</td>
<td style="text-align:right">45,531</td>
</tr>
<tr>
<td style="text-align:left">Words </td>
<td style="text-align:right">740,534 </td>
<td style="text-align:right">832,441</td>
</tr>
<tr>
<td style="text-align:left">Sure Alignments</td>
<td style="text-align:right"> 447,906</td>
<td style="text-align:right"> 447,906</td>
</tr>
<tr>
<td style="text-align:left">Possible Alignments </td>
<td style="text-align:right">560,215 </td>
<td style="text-align:right"> 560,215</td>
</tr>
<tr>
<td style="text-align:left">Words in Alignments</td>
<td style="text-align:right"> 654,060</td>
<td style="text-align:right"> 768,031 </td>
</tr>
<tr>
<td colspan="3" style="text-align:left">-------------------------------------------------------------</td>
</tr>
</tbody>
</table>
</div>
<br>
<div><b>English-Vietnamese Chunker Corpus (EVChkCorpus)</b><br/>
The EVChkCorpus contains 1,000 news articles with 45,531 sentence pairs. It is tagged 5 raw chunker tags in both English and Vietnamese text. Details of the EVChkCorpus: <br/>
<table>
<tbody>
<tr>
<td style="text-align:right">#</td>
<td style="text-align:right"><br /></td>
<td style="text-align:right"><b> English </b></td>
<td style="text-align:right"><b> Vietnamese </b></td>
</tr>
<tr>
<td colspan="4" style="text-align:left">-----------------------------------------------------------------------</td>
</tr>
<tr>
<td style="text-align:right"><b>NP </b></td>
<td style="text-align:right"> Noun Phrase </td>
<td style="text-align:right"> 212,500 </td>
<td style="text-align:right"> 209,824</td>
</tr>
<tr>
<td style="text-align:right"><b> VP </b></td>
<td style="text-align:right"> Verb Phrase </td>
<td style="text-align:right"> 90,784 </td>
<td style="text-align:right"> 123,600</td>
</tr>
<tr>
<td style="text-align:right"><b>PP </b></td>
<td style="text-align:right"> Preposition Phrase </td>
<td style="text-align:right"> 79,853 </td>
<td style="text-align:right"> 70,457</td>
</tr>
<tr>
<td style="text-align:right"><b>ADVP </b></td>
<td style="text-align:right"> Adjective Phrase </td>
<td style="text-align:right"> 18,318 </td>
<td style="text-align:right"><br />
</td>
</tr>
<tr>
<td style="text-align:right"><b>ADJP </b></td>
<td style="text-align:right"> Adverb Phrase </td>
<td style="text-align:right"> 8,367 </td>
<td style="text-align:right"> 15,104</td>
</tr>
<tr>
<td colspan="4" style="text-align:left">-----------------------------------------------------------------------</td>
</tr>
</tbody>
</table>
</div>
<br>
<div><b>English-Vietnamese Named Entities Corpus (EVNECorpus)</b><br/>
The EVNECorpus contains 1,000 news articles with 45,531 sentence pairs. It is tagged named entities in both English and Vietnamese text. Details of the EVNECorpus: <br/>
<table>
<tbody>
<tr>
<td style="text-align:right"></td>
<td style="text-align:right"></td>
<td style="text-align:right"><b> English </b></td>
<td style="text-align:right"><b> Vietnamese </b></td>
</tr>
<tr>
<td colspan="4" style="text-align:left">-------------------------------------------------------------</td>
</tr>
<tr>
<td style="text-align:right"><b>LOC </b></td>
<td style="text-align:right"> Location</td>
<td style="text-align:right"> 10,115 </td>
<td style="text-align:right"> 10,006</td>
</tr>
<tr>
<td style="text-align:right"><b>PER </b></td>
<td style="text-align:right"> Person </td>
<td style="text-align:right"> 6,869 </td>
<td style="text-align:right"> 6,741</td>
</tr>
<tr>
<td style="text-align:right"><b>ORG </b></td>
<td style="text-align:right"> Oganization </td>
<td style="text-align:right"> 7,837</td>
<td style="text-align:right"> 7,549 </td>
</tr>
<tr>
<td style="text-align:right"><b>PCT </b></td>
<td style="text-align:right"> Percentage </td>
<td style="text-align:right"> 1,107 </td>
<td style="text-align:right"> 921</td>
</tr>
<tr>