Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Updating refereces from parquet-mr -> parquet-java #60

Merged

Conversation

vinooganesh
Copy link
Collaborator

@vinooganesh vinooganesh commented May 21, 2024

@alamb @wgtmac I put a very basic PR together to update some of the references on the website from parquet-mr to parquet-java. I only chose to do some because I think we have a few questions to figure out first:

  1. Are we going to change the published artifact name of parquet-mr to parquet-java or do we just want to keep publishing under mr?
  2. Do we want to actually "rewrite history" and update the past references (contributions, etc..) in the docs to refer to parquet-java instead? I'm not a fan of rewriting history but figured I'd start a conversation just in case people want to.
  3. Should we also do a sweep and update the contribution guidelines / release template? Per this thread https://lists.apache.org/thread/5oohcx3m16kqs8dmtl3vm1cgd8z0q10b.
  4. Should we introduce a new section of the blog called parquet-java (I had been hacking using the blog for releases) to add a note (assuming we change the name of the artifact) that things have changed?

@@ -18,14 +18,14 @@ The parquet-format repository hosts the official specification of the Apache Par
As a repository focused on specification, the parquet-format repository does not contain source code.


### parquet-mr
### parquet-java
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe note here that this was previously referred to as parquet-mr due to the name of the repository (which has also been moved)?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i agree adding a note like this would be clearer

The parquet-java repository(previously named `parquet-mr`) is part of the Apache Parquet project and specifically focuses on providing Java tools for

@alamb
Copy link
Collaborator

alamb commented May 21, 2024

@alamb @wgtmac I put a very basic PR together to update some of the references on the website from parquet-mr to parquet-java. I only chose to do some because I think we have a few questions to figure out first:

  1. Are we going to change the published artifact name of parquet-mr to parquet-java or do we just want to keep publishing under mr?

I personally suggest not making this change unless there is a compelling usecase. It seems like it doesn't hurt to leave the artifacts as parquet-mr and would only cause downstream pain to update them now for very little gain

  1. Do we want to actually "rewrite history" and update the past references (contributions, etc..) in the docs to refer to parquet-java instead? I'm not a fan of rewriting history but figured I'd start a conversation just in case people want to.

I recommend against doing this, again on the justification of "what benefit would we get from it"?

  1. Should we also do a sweep and update the contribution guidelines to include the new ways to contribute?

I am not sure what this is asking

  1. Should we introduce a new section of the blog called parquet-java (I had been hacking using the blog for releases) to add a note (assuming we change the name of the artifact) that things have changed?

Maybe we could create a blog post announcing some of the recent changes / activity (e.g. discussion son V3 format, clarifications on repos, new website, etc).

Copy link
Collaborator

@alamb alamb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me -- thank you @vinooganesh 🙏

@@ -63,7 +63,7 @@ You can now preview the site locally on http://localhost:1313/

To create documentation for a new release of `parquet-format` create a new <releaseNumber>.md file under `content/en/blog/parquet-format`. Please see existing files in that directory as an example.

To create documentation for a new release of `parquet-mr` create a new <releaseNumber>.md file under `content/en/blog/parquet-mr`. Please see existing files in that directory as an example.
To create documentation for a new release of `parquet-java` create a new <releaseNumber>.md file under `content/en/blog/parquet-java`. Please see existing files in that directory as an example.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would personally suggest removing the discussion bout release of parquet-mr/ parquet-java to that repo. It seems confusing to have instructions on how to do a release from another repo in parquet-site

Copy link
Collaborator Author

@vinooganesh vinooganesh May 21, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah I see the confusion - these notes have to do with updating the website announce the new release: https://parquet.apache.org/blog/. So the flow would be

  1. Make a release of parquet-java in that repo
  2. Put up a blog post entry on the website containing the release information

Happy to remove this if folks feel strongly - but was thinking it may be good to have some instructions on how to actually make the post.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't feel strongly

@@ -24,8 +24,8 @@
{{% /blocks/feature %}}


{{% blocks/feature icon="fab fa-github" title="Contributions welcome!" url="https://github.com/apache/parquet-mr" %}}
We do a [Pull Request](https://github.com/apache/parquet-mr/pulls) contributions workflow on **GitHub**. New users are always welcome!
{{% blocks/feature icon="fab fa-github" title="Contributions welcome!" url="https://github.com/apache/parquet-java" %}}
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As part of another PR perhaps we should revisit this link (perhaps it should link to parquet-format?) as again linking to the java implementation from the homepage might be more confusing than helpful

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That's a great point. This was actually that was torn about the most when I first built the new site. I figured more people would want to contribute to mr than format (there's actual code in the former), so I went with mr everywhere. More than happy to revisit this as it was mostly just a guess on my part.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think it is something to revisit as a follow on PR

@@ -18,14 +18,14 @@ The parquet-format repository hosts the official specification of the Apache Par
As a repository focused on specification, the parquet-format repository does not contain source code.


### parquet-mr
### parquet-java
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i agree adding a note like this would be clearer

The parquet-java repository(previously named `parquet-mr`) is part of the Apache Parquet project and specifically focuses on providing Java tools for

@vinooganesh
Copy link
Collaborator Author

Thanks @alamb!

Should we also do a sweep and update the contribution guidelines to include the new ways to contribute?
I am not sure what this is asking

Sorry -- this is a typo on my side. I meant to include a new contribution template (edited the message above). It was a response to this thread: https://lists.apache.org/thread/5oohcx3m16kqs8dmtl3vm1cgd8z0q10b.

It's probably worth having separate release announcement templates for parquet-format and parquet-java.

Copy link
Collaborator

@alamb alamb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks again @vinooganesh

@emkornfield
Copy link
Contributor

LGTM, thank you @vinooganesh

@emkornfield emkornfield merged commit da1d9a8 into apache:production May 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants