Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TorchText for Thai datasets #440

Closed
p16i opened this issue Jun 25, 2020 · 1 comment
Closed

TorchText for Thai datasets #440

p16i opened this issue Jun 25, 2020 · 1 comment
Labels
enhancement enhance functionalities

Comments

@p16i
Copy link
Contributor

p16i commented Jun 25, 2020

I've recently found TorchText, a module that provides various utilities functions for NLP. One thing that I like is the module provides a very convenient way to download datasets. Please see: https://pytorch.org/text/datasets.html.

As we already have several datasets and rely on PyTorch, it might be a useful if we implement TorchText for those datasets. We might start with the following datasets:

  • Wisesight
  • TrueVoice
  • BEST-2010
@bact bact added the enhancement enhance functionalities label Jan 7, 2021
@wannaphong
Copy link
Member

Now, I thinks we can load thai dataset from Huggingface. I thinks we should close this issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement enhance functionalities
Projects
None yet
Development

No branches or pull requests

3 participants