Skip to content

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Idea] Do stars or forks influence the statistics for Activity or Influence? #530

Closed
TommyLemon opened this issue Nov 19, 2022 · 3 comments

Comments

@TommyLemon
Copy link

TommyLemon commented Nov 19, 2022

Description

It doesn't seems to be that because I noticed that some famous and popular repos have small numbers.
image
image
image
image

And some of the numbers are even smaller than that of some new repo(eg: hypertrons-crx, answer).
image
image

And fastjson2 has much higher hypertrons-crx's Influence than that of fastjson, this doesn't make sense too.
image
image

Well, I think the stars and forks should also be factors for the statistics of hypertrons-crx's Influence due to they are also valuable. Yet they are much easier to get than PRs/issues. And you know many Chinese users haven't get used to use GitHub Issue, while they often chat(report bugs, give sugguestions, etc) in Wechat/QQ groups, and this results in smaller numbers of hypertrons-crx's Influence for many famous and popular Chinese repos when they are compared to American repos that have similar actual influence.
And also, an account that relates to a famous/big company/organization should influence more than other accounts.
Eg: If a Google employee starred a repo, the star he/she gave is more valuable than an account that relates to Baidu or even not relates to any company/organization.

https://ossinsight.io/analyze/Tencent/APIJSON

FA2F41AC89A553D6B88F5E1D1D83E35C

@tyn1998
Copy link
Member

tyn1998 commented Nov 20, 2022

It doesn't seems to be that because I noticed that some famous and popular repos have small numbers.

Yes, forks and stars at the moment have no influence on the caculation of Activity and Influence.

Well, I think the stars and forks should also be factors for the statistics of hypertrons-crx's Influence due to they are also valuable. Yet they are much easier to get than PRs/issues. And you know many Chinese users haven't get used to use GitHub Issue, while they often chat(report bugs, give sugguestions, etc) in Wechat/QQ groups, and this results in smaller numbers of hypertrons-crx's Influence for many famous and popular Chinese repos when they are compared to American repos that have similar actual influence.

Good point of view! Hi @frank-zsy, how do you think about this?

And also, an account that relates to a famous/big company/organization should influence more than other accounts.
Eg: If a Google employee starred a repo, the star he/she gave is more valuable than an account that relates to Baidu or even not relates to any company/organization.

That is a good advice!

Hi @TommyLemon, if you are interested in definitions of the metrics, you can visit OpenDigger. You can get more information there :-)

@frank-zsy
Copy link
Contributor

@TommyLemon Thanks for the discussion.

Metrics of open source repos are used for certain purpose. For activity and influence, I constrain the data to collaboration events, so stars and forks are not included since they are not really collaboration events.

Specially for influence, this blog tells how to calculate the result by collaboration network which is used to measure the developers eco-system influence of a certain repo for a certain time period only with GitHub data.

Actually influence is not a good expression since it means eigenvector centrality in graph theory so it can be implemented in different ways with different data. In my own research I also use open source package dependency and developers' social relationships to construct the network. So we may change influence into OpenRank in the future to avoid misunderstanding.

And also, an account that relates to a famous/big company/organization should influence more than other accounts.
Eg: If a Google employee starred a repo, the star he/she gave is more valuable than an account that relates to Baidu or even not relates to any company/organization.

It is true, and for OpenRank, actually the result is calculated by relationship data and a PageRank like algorithm, so the importance of each node(repo or developer) is different due to the position in the network, but still it is only in collaboration level.

@frank-zsy
Copy link
Contributor

@TommyLemon And I am not really sure about how OSSInsight finds out the affiliation data of each developer, but there are really some legal risks to use the privacy data of developers like company affiliation or location.

@hypertrons hypertrons locked and limited conversation to collaborators Feb 12, 2023
@tyn1998 tyn1998 converted this issue into discussion #579 Feb 12, 2023

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

Projects
None yet
Development

No branches or pull requests

3 participants