Skip to content

We explore the application of large language models for topic classification within a low-resource German web environment, leveraging a dataset comprising millions of scraped webpages aimed at evaluating policy impacts.

Notifications You must be signed in to change notification settings

julianschelb/Topic-Classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Topic Classification of German Web Data

We explore the application of large language models for topic classification within a low-resource German web environment, leveraging a dataset comprising millions of scraped webpages aimed at evaluating policy impacts.

Poster: alt text

Citation:

@inproceedings{schelb-etal-2024-assessing,
    title = "Assessing In-context Learning and Fine-tuning for Topic Classification of {G}erman Web Data",
    author = "Schelb, Julian  and
      Spitz, Andreas  and
      Ulloa, Roberto",
    booktitle = "Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop)",
    month = aug,
    year = "2024",
    address = "Bangkok, Thailand",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.acl-srw.22",
    pages = "238--252",
}

About

We explore the application of large language models for topic classification within a low-resource German web environment, leveraging a dataset comprising millions of scraped webpages aimed at evaluating policy impacts.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published