Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Dataloader2 training loop example with torch text #670

Closed
wants to merge 1 commit into from

Commits on Jul 21, 2022

  1. Add Dataloader2 training loop example with torch text (pytorch#670)

    Summary:
    Pull Request resolved: pytorch#670
    
    * Add the uscase and example of DataLoader2 with open source datasets/datapipes.
    TorchText provides several standard NLP datasets; here we use its SST2 OSS dataset
     in the DataLoader2 train loop.
    
    * We will have more examples to showcase the advantages:
    (1) The usage of the DLv2 with popular open source dataset.
    (2) Integrate datasets/datapipes with different reading service.
    (3) Datapipe manipulation for example batch, collate, map.
    (4) Dist usage and examples with features such as sharding_filter for the sharding feature.
    (5) Eventually add those examples to the pytorch tutorials.
    
    Reviewed By: ejguan
    
    Differential Revision: D37938017
    
    fbshipit-source-id: e00c6f7af63b5a6d33ed138c563f4b20881b5ad6
    dahsh authored and facebook-github-bot committed Jul 21, 2022
    Configuration menu
    Copy the full SHA
    68e8e89 View commit details
    Browse the repository at this point in the history