Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix the first contributor filter #891

Closed
1 task done
orhun opened this issue Sep 27, 2024 · 12 comments · Fixed by #925
Closed
1 task done

Fix the first contributor filter #891

orhun opened this issue Sep 27, 2024 · 12 comments · Fixed by #925
Labels
bug Something isn't working help wanted Extra attention is needed integration Related to remote integration

Comments

@orhun
Copy link
Owner

orhun commented Sep 27, 2024

Is there an existing issue for this?

  • I have searched the existing issues

Description of the bug

In the following scenario:

  1. User foo contributes for the first time.
  2. New release is created and changelog is generated.
  3. User foo contributes after the release.
  4. New release is created and changelog is generated BUT user foo is removed from the first contributors section from the previous release.

Might be related to #825

Steps To Reproduce

See above.

Expected behavior

This happens due to not checking the dates of the new commits while updating the release metadata.

// mark contributors as first-time
self.$remote.contributors = contributors
.into_iter()
.map(|mut v| {
v.is_first_time = !commits
.iter()
.map(|v| v.username())
.any(|login| login == v.username);
v
})
.collect();

Here, we check all the commits, but you see, in the new release user foo already contributed, so his contribution is not marked as first time anymore.

To fix it, we should simply:

diff --git a/git-cliff-core/src/remote/mod.rs b/git-cliff-core/src/remote/mod.rs
index 77d14e7..32e61a9 100644
--- a/git-cliff-core/src/remote/mod.rs
+++ b/git-cliff-core/src/remote/mod.rs
@@ -83,6 +83,8 @@ pub trait RemoteCommit: DynClone {
 	fn id(&self) -> String;
 	/// Commit author.
 	fn username(&self) -> Option<String>;
+	/// Timestamp.
+	fn timestamp(&self) -> Option<i64>;
 }
 
 dyn_clone::clone_trait_object!(RemoteCommit);
@@ -340,6 +342,9 @@ macro_rules! update_release_metadata {
 					.map(|mut v| {
 						v.is_first_time = !commits
 							.iter()
+							.filter(|commit| {
+								commit.timestamp() < Some(self.timestamp)
+							})
 							.map(|v| v.username())
 							.any(|login| login == v.username);
 						v

Apply the patch above and you will see that we are missing the implementation for GitHub, GitLab, etc. We need to check each remote and add the appropriate field for serializing the timestamp from their API response.

Screenshots / Logs

nA

Software information

nA

Additional context

No response

@orhun orhun added bug Something isn't working good first issue Good for newcomers labels Sep 27, 2024
@jaiakash
Copy link

Hi @orhun can i work on this?

@orhun
Copy link
Owner Author

orhun commented Oct 1, 2024

please do! @jaiakash

@orhun
Copy link
Owner Author

orhun commented Oct 6, 2024

any progress/ETA about this @jaiakash? I think this is affecting more users than I was expecting #906

@orhun orhun added the integration Related to remote integration label Oct 6, 2024
@mcwarman
Copy link
Contributor

mcwarman commented Oct 7, 2024

I tested the suggested fix on mcwarman-playground/git-cliff-readme-example.

And something isn't quite correct in the logic, it repots orhun as new contributor again:

## What's Changed
* feat(config): support multiple file formats by @orhun <orhun@archlinux.org>
* feat(cache): use cache while fetching pages by @orhun <orhun@archlinux.org>
* feat(ci): add pipeline config by @mcwarman <7236500+mcwarman@users.noreply.github.com> in #1

### New Contributors
* @mcwarman <7236500+mcwarman@users.noreply.github.com> made their first contribution in [#1](https://bitbucket.org/mcwarman-playground/git-cliff-readme-example/pull-requests/1)
* @orhun <orhun@archlinux.org> made their first contribution

## What's Changed in v1.0.1
* refactor(parser): expose string functions by @orhun <orhun@archlinux.org>
* chore(release): add release script by @orhun <orhun@archlinux.org>

## What's Changed in v1.0.0
* Initial commit by @orhun <orhun@archlinux.org>
* docs(project): add README.md by @orhun <orhun@archlinux.org>
* feat(parser): add ability to parse arrays by @orhun <orhun@archlinux.org>
* fix(args): rename help argument due to conflict by @orhun <orhun@archlinux.org>
* docs(example)!: add tested usage example by @orhun <orhun@archlinux.org>

### New Contributors
* @orhun <orhun@archlinux.org> made their first contribution

<!-- generated by -cliff -->
diff --git a/git-cliff-core/src/remote/bitbucket.rs b/git-cliff-core/src/remote/bitbucket.rs
index a3e3603..26ef85a 100644
--- a/git-cliff-core/src/remote/bitbucket.rs
+++ b/git-cliff-core/src/remote/bitbucket.rs
@@ -1,5 +1,6 @@
 use crate::config::Remote;
 use crate::error::*;
+use chrono::DateTime;
 use reqwest_middleware::ClientWithMiddleware;
 use serde::{
        Deserialize,
@@ -33,6 +34,8 @@ pub(crate) const BITBUCKET_MAX_PAGE_PRS: usize = 50;
 pub struct BitbucketCommit {
        /// SHA.
        pub hash:   String,
+       /// SHA.
+       pub date:   String,
        /// Author of the commit.
        pub author: Option<BitbucketCommitAuthor>,
 }
@@ -45,6 +48,15 @@ impl RemoteCommit for BitbucketCommit {
        fn username(&self) -> Option<String> {
                self.author.clone().and_then(|v| v.login)
        }
+
+       fn timestamp(&self) -> Option<i64> {
+               debug!("{} {}", self.hash.clone(), self.date.clone());
+               Some(
+                       DateTime::parse_from_rfc3339(&self.date)
+                               .unwrap()
+                               .timestamp(),
+               )
+       }
 }
diff --git a/git-cliff-core/src/remote/mod.rs b/git-cliff-core/src/remote/mod.rs
index a79137b..d4a9681 100644
--- a/git-cliff-core/src/remote/mod.rs
+++ b/git-cliff-core/src/remote/mod.rs
@@ -82,6 +82,8 @@ pub trait RemoteCommit: DynClone {
        fn id(&self) -> String;
        /// Commit author.
        fn username(&self) -> Option<String>;
+       /// Timestamp.
+       fn timestamp(&self) -> Option<i64>;
 }

 dyn_clone::clone_trait_object!(RemoteCommit);
@@ -342,6 +344,9 @@ macro_rules! update_release_metadata {
                                        .map(|mut v| {
                                                v.is_first_time = !commits
                                                        .iter()
+                                                       .filter(|commit| {
+                                                               commit.timestamp() < Some(self.timestamp)
+                                                       })
                                                        .map(|v| v.username())
                                                        .any(|login| login == v.username);
                                                v

@orhun
Copy link
Owner Author

orhun commented Oct 8, 2024

I guess we still need to check the past releases... the logic here boggles my mind a bit.

  • We are filtering commits by timestamp to exclude those that occurred before the current release's timestamp.
  • However, this doesn't fully account for the fact that is_first_time is being evaluated globally, not per release.

🤔

@mcwarman
Copy link
Contributor

mcwarman commented Oct 8, 2024

I wasn't sure if it was some how not filtering the username correctly, and mcwarman first time contribution was marking both contributors as true.

The only reason i thought it might be this is because v1.0.1 doesn't have a new contributors section in again.

I now understand this a little bit more after some debugging.

@mcwarman
Copy link
Contributor

mcwarman commented Oct 8, 2024

Not sure if something like this works:

.filter(|commit| {
	//unreleased
	self.timestamp == 0 ||
		commit.timestamp() < Some(self.timestamp)
})

@orhun
Copy link
Owner Author

orhun commented Oct 8, 2024

Yes, I think something like that might work. Are you getting the correct results with it?

@mcwarman
Copy link
Contributor

mcwarman commented Oct 9, 2024

For the Bitbucket it was, but possibly more by the luck

The approach isn't flawless, because I realised self.timestamp is the tag timestamp, and that differs to the commit related to that tag timestamp so the filter doesn't do the 100% the correct thing but gets you close.

@orhun
Copy link
Owner Author

orhun commented Oct 14, 2024

Hmm I see.. Either way, can you create a PR with your changes? I think anything that gets us to a more accurate result is welcome :)

@orhun orhun added help wanted Extra attention is needed and removed good first issue Good for newcomers labels Oct 14, 2024
@orhun
Copy link
Owner Author

orhun commented Oct 14, 2024

Also I just updated the issue labels. This turned out to be more difficult than I expected. I'm still not sure about the logic above.

@mcwarman
Copy link
Contributor

mcwarman commented Oct 19, 2024

I think this works:

  • Store the remote commit timestamp of the current release (while iterating)
  • Use that timestamp to perform the filter
diff --git a/git-cliff-core/src/remote/mod.rs b/git-cliff-core/src/remote/mod.rs
index a79137b..d83fe28 100644
--- a/git-cliff-core/src/remote/mod.rs
+++ b/git-cliff-core/src/remote/mod.rs
@@ -82,6 +82,8 @@ pub trait RemoteCommit: DynClone {
 	fn id(&self) -> String;
 	/// Commit author.
 	fn username(&self) -> Option<String>;
+	/// Timestamp.
+	fn timestamp(&self) -> Option<i64>;
 }
 
 dyn_clone::clone_trait_object!(RemoteCommit);
@@ -299,6 +301,7 @@ macro_rules! update_release_metadata {
 				pull_requests: Vec<Box<dyn RemotePullRequest>>,
 			) -> Result<()> {
 				let mut contributors: Vec<RemoteContributor> = Vec::new();
+				let mut release_commit_timestamp: Option<i64> = None;
 				// retain the commits that are not a part of this release for later
 				// on checking the first contributors.
 				commits.retain(|v| {
@@ -331,6 +334,11 @@ macro_rules! update_release_metadata {
 							});
 						}
 						commit.remote = Some(commit.$remote.clone());
+						// if remote commit is the release commit store timestamp for
+						// use in calculation of first time
+						if Some(v.id().clone()) == self.commit_id {
+							release_commit_timestamp = v.timestamp().clone();
+						}
 						false
 					} else {
 						true
@@ -342,6 +350,12 @@ macro_rules! update_release_metadata {
 					.map(|mut v| {
 						v.is_first_time = !commits
 							.iter()
+							.filter(|commit| {
+								// if current release is unreleased no need to filter
+								// commits or filter commits that are from
+								// newer releases
+								self.timestamp == 0 ||
+									commit.timestamp() < release_commit_timestamp
+							})
 							.map(|v| v.username())
 							.any(|login| login == v.username);
 						v

Although to make it work bitbucket I had to override the endpoint because of #906

## What's Changed
* feat(config): support multiple file formats by @orhun <orhun@archlinux.org>
* feat(cache): use cache while fetching pages by @orhun <orhun@archlinux.org>
* feat(ci): add pipeline config by @mcwarman <7236500+mcwarman@users.noreply.github.com> in #1

### New Contributors
* @mcwarman <7236500+mcwarman@users.noreply.github.com> made their first contribution in [#1](https://bitbucket.org/mcwarman-playground/git-cliff-readme-example/pull-requests/1)

## What's Changed in v1.0.1
* refactor(parser): expose string functions by @orhun <orhun@archlinux.org>
* chore(release): add release script by @orhun <orhun@archlinux.org>

## What's Changed in v1.0.0
* Initial commit by @orhun <orhun@archlinux.org>
* docs(project): add README.md by @orhun <orhun@archlinux.org>
* feat(parser): add ability to parse arrays by @orhun <orhun@archlinux.org>
* fix(args): rename help argument due to conflict by @orhun <orhun@archlinux.org>
* docs(example)!: add tested usage example by @orhun <orhun@archlinux.org>

### New Contributors
* @orhun <orhun@archlinux.org> made their first contribution

<!-- generated by -cliff -->

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working help wanted Extra attention is needed integration Related to remote integration
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants