Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update v0.7.0 release branch with master #886

Merged
merged 21 commits into from
Jul 16, 2020
Merged

Update v0.7.0 release branch with master #886

merged 21 commits into from
Jul 16, 2020

Conversation

zhangguanheng66
Copy link
Contributor

Let's test those recent changes in master branch first. We probably can cut the branch by July 20, 2020.
cc @cpuhrsch @Nayef211

seemethere and others added 19 commits June 24, 2020 11:53
Signed-off-by: Eli Uriegas <eliuriegas@fb.com>
* Adding new factory methods for vectors

* Added factory methods

* Updated test assets and implementations

* Debugging tests

* Finished tests for glove and fasttext

* Resolving PR comments

* Added checksum validation for fasttext and glove

* Fixed all tests

* Updated tests

* Benchmarking various vectors implementation

* Added script to compute fast text checksums

* Removed Json files

* updated to stovec without hash

* Testing optimizations

* Benchmarking various experiments

* Benchmarking

* Debugging dim error

* Updated loading vector from file

* Fixed dup token bug

* Removing unwanted changes

* Removing unwanted changes

* Delimiting by user specified char

* Fixed dupe check

* Fixing dupe line bug
* Adding dunder methods to cpp Vectors

* Fix length

* Fixed pytest errors

* Fixed stylecheck
* Updating serialization methods

* Profiling serialization

* Added 2d tensor support with dict

* Updating becnhmark and vectors

* Updated vectors for stovec implementation

* Cleaning up PR

* Cleaning up PR

* Updated tests and fixed lint

* Added new function for vectors from tokens

* Linting

* linting

* Using dict.insert_or_assign

* Removed get vectors by tokens

* Fixing style checks
* Adding new vocab class

* Wrote the cpp Vocab class

* Completed vocab class

* Fixing style check

* resolving PR comments

* Using c::10 dict

* Added tests for new functions

* Fixing bug with specials tuple

* Annotated function return types

* Using camel case for function names

* Resolving PR comments

* Fixing unit tests

* Updated method names

* Updated vocab with unk_token in specials and updated tests

* Updated docstring to show example for Vocab class
* Adding new vocab class

* Wrote the cpp Vocab class

* Completed vocab class

* Fixing style check

* resolving PR comments

* Using c::10 dict

* Added tests for new functions

* Added benchmarking code for vovab

* Fixing bug with specials tuple

* Added benchmarking code for vocab class

* Annotated function return types

* Using camel case for function names

* Resolving PR comments

* Fixing unit tests

* Updated method names

* Updated vocab with unk_token in specials and updated tests

* Updated docstring to show example for Vocab class
* minor doc update

* checkpoint

Co-authored-by: Guanheng Zhang <zhangguanheng@devfair0197.h2.fair>
* Updating serialization methods

* Profiling serialization

* Added 2d tensor support with dict

* Updating becnhmark and vectors

* Updated vectors for stovec implementation

* Cleaning up PR

* Cleaning up PR

* Updated tests and fixed lint

* Added new function for vectors from tokens

* Linting

* linting

* Using dict.insert_or_assign

* Updated function name
* Adding new vocab class

* Wrote the cpp Vocab class

* Completed vocab class

* Fixing style check

* resolving PR comments

* Using c::10 dict

* Added tests for new functions

* Fixing bug with specials tuple

* Added factory method for vocab class

* Annotated function return types

* Using camel case for function names

* Resolving PR comments

* Fixing unit tests

* Updated method names

* Updated vocab with unk_token in specials and updated tests

* Updated docstring to show example for Vocab class

* Updated documentation for factory method

* Resolving PR comments

* Disabling tests for Glove due to URL breakage

* Reneabling glove tests

* Updating test asset name
@codecov
Copy link

codecov bot commented Jul 16, 2020

Codecov Report

Merging #886 into release/0.7 will increase coverage by 0.10%.
The diff coverage is 89.72%.

Impacted file tree graph

@@               Coverage Diff               @@
##           release/0.7     #886      +/-   ##
===============================================
+ Coverage        76.91%   77.02%   +0.10%     
===============================================
  Files               42       44       +2     
  Lines             2946     3055     +109     
===============================================
+ Hits              2266     2353      +87     
- Misses             680      702      +22     
Impacted Files Coverage Δ
torchtext/nn/modules/__init__.py 100.00% <ø> (ø)
torchtext/experimental/vectors.py 74.40% <74.46%> (-13.57%) ⬇️
torchtext/data/example.py 85.71% <83.33%> (-0.21%) ⬇️
torchtext/nn/modules/multiheadattention.py 92.40% <85.71%> (ø)
torchtext/__init__.py 88.00% <100.00%> (ø)
torchtext/data/batch.py 71.42% <100.00%> (+1.05%) ⬆️
torchtext/data/field.py 92.83% <100.00%> (+0.14%) ⬆️
torchtext/data/iterator.py 62.66% <100.00%> (+0.50%) ⬆️
torchtext/experimental/datasets/translation.py 76.81% <100.00%> (ø)
torchtext/experimental/vocab.py 100.00% <100.00%> (ø)
... and 4 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 847460a...3f433ba. Read the comment docs.

@seemethere
Copy link
Member

Are unittests expected to fail here?

@zhangguanheng66
Copy link
Contributor Author

Are unittests expected to fail here?

Glove link was broken due to the upstream dependency. @Nayef211 should we disable the tests again?

@seemethere
Copy link
Member

Yeah I'm not sure I'd be comfortable tagging a new version with failing tests.

@zhangguanheng66
Copy link
Contributor Author

Yeah I'm not sure I'd be comfortable tagging a new version with failing tests.

OK, let me send out a PR to disable those test.

@zhangguanheng66
Copy link
Contributor Author

#888

@seemethere seemethere merged commit fb6fbb7 into release/0.7 Jul 16, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants