-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Added New York city's taxi and limousine commission trip yellow taxi support #151
Added New York city's taxi and limousine commission trip yellow taxi support #151
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks!
lib/datasets/tlc-trip-yellow-taxi.rb
Outdated
@metadata.url = "https://d37ci6vzurychx.cloudfront.net/trip-data" | ||
@metadata.licenses = [ | ||
{ | ||
name: "Public", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
name: "Public", | |
spdx_id: "CC0-1.0", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fix: 5816dee Fixed metadata information
Although I was thinking I should use array style to express it like @metadata.licenses = ["CC0-1.0"]
, I couldn't find the license CC0-1.0
in NYC Open Data site so I used hash style.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I rechecked this. The dataset site doesn't mention CC0-1.0 or public domain as you said. Sorry. We should not it.
I found https://opendata.cityofnewyork.us/overview/#termsofuse instead.
How about the following?
@metadata.licenses = [
{
name: "NYC Open Data Terms of Use",
url: "https://opendata.cityofnewyork.us/overview/#termsofuse",
}
]
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fix: ca35199 Thank you so much. It sounds perfect for me.🔆
Thank you for reviewing and giving me some advice . I dealt with all of them you pointed so far🙏 Would you help me to figure out why CI was failed on Ubuntu OS and what I should do in next step?
ref: https://github.com/red-data-tools/red-datasets/runs/8173499574?check_suite_focus=true |
test/tlc/test-yellow-taxi-trip.rb
Outdated
@@ -0,0 +1,59 @@ | |||
module TLC |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We don't need this module because we don't share our test and YellowTaxiTripTest
will not be conflicted.
If we want to add TLC
information to distinct from other test cases, class TLCYellowTaxiTripTest
to reduce indent.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We don't need this module because we don't share our test and YellowTaxiTripTest will not be conflicted.
Thank you. I understood it. I will rename it to TLCYellowTaxiTripTest
🙏
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fix: c75357d
We need to prepare the Apache Arrow's APT repository for Ubuntu like https://github.com/red-data-tools/red-datasets-arrow/blob/master/.github/workflows/test.yml#L56-L64 Because Apache Arrow package isn't included in the official Ubuntu repository yet. |
We don't have to share our test and name of this test case won't be conflicted.
Apache Arrow package isn't included in the official Ubuntu repository yet. ref: red-data-tools#151 (comment)
Thank you for reviewing this PR. I dealt with all of them your comments so far🙏 |
Thanks! TODO: We should consider red-parquet dependency before we release a new version. |
… yellow taxi trip records (red-data-tools#151)" We decided not to use Apache Parquet data. Instead of it, we can use it by Red Datasets Parquet. The reason is why we don't want to add red-parquet dependency to red-datasets for easy to install. red-parquet is an extension library. So it's difficult to install rather than a pure Ruby library.
… yellow taxi trip records" (#154) ## What I did - Reverted #151 ### Reasons We decided not to use Apache Parquet data in Red Datasets because we don't want to add red-parquet dependency to red-datasets for easy to install. In details, red-parquet is an extension library. So it's difficult to install rather than a pure Ruby library. Instead of it, we created [Red Datasets Parquet](https://github.com/red-data-tools/red-datasets-parquet) to use Apache Parquet data.
Related Issue
What I did
Details
What I Didn't
What I checked
Notes for reviewers
Class Name
TLCTripYellowTaxi
orTLCTrip::YellowTaxi
or Should I addNYC
as a prefix