True Git Code Churn

A Python script to compute "true" code churn of a Git repository. Useful for software teams to openly help manage technical debt.

Code churn has several definitions, the one that to me provides the most value as a metric is:

"Code churn is when an engineer rewrites their own code in a short period of time."

Solutions that I've found online looked at changes to files irrespective whether these are new changes or edits to existing lines of code (LOC) within existing files. Hence this solution that segments line-of-code edits (churn) with new code changes (contribution).

How it works

This lightweight script looks at commits per author for a given date range on the current branch. For each commit it bookkeeps the files that were changed along with the LOC for each file. LOC are kept in a sparse structure and changes per LOC are taken into account as the program loops. When a change to the same LOC is detected it updates this separately to bookkeep the true code churn. Result is a print with aggregated contribution and churn per author for a given period in time.

Note: This includes the --no-merges flag as it assumes that merge commits with or without merge conflicts are not indicative of churn.

Usage

Positional (required) arguments:

dir include Git repository directory (specified as an absolute path)

Optional arguments:

--after after a certain date, in YYYY[-MM[-DD]] format
--before before a certain date, in YYYY[-MM[-DD]] format
--author author string (not a committer), leave blank to scope all authors
-exdir exclude Git repository subdirectory
--show-file-data show results per line per file result
--aggregate_file_data show results per file aggregated
--csv the resulting output is printed to the terminal formatted as CSV
-h, --h, --help show this help message and exit

Usage example with a specific author

python ./gitcodechurn.py /Users/myname/myrepo --after 2018-11-29 --before 2019-03-01 --author "an author"

Usage example without specifying an author

python ./gitcodechurn.py /Users/myname/myrepo --after 2018-11-29 --before "2019-03-01  -exdir excluded-directory

Usage example without specifying anything other than the folder of the repo

python ./gitcodechurn.py /Users/myname/myrepo

Outputs can be used as part of a pipeline (not included) that generates bar charts for reports.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
gitcodechurn		gitcodechurn
.DS_Store		.DS_Store
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MORE.md		MORE.md
README.md		README.md
gitcodechurn.py		gitcodechurn.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

True Git Code Churn

How it works

Usage

Usage example with a specific author

Usage example without specifying an author

Usage example without specifying anything other than the folder of the repo

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

syngenta/truegitcodechurn

Folders and files

Latest commit

History

Repository files navigation

True Git Code Churn

How it works

Usage

Usage example with a specific author

Usage example without specifying an author

Usage example without specifying anything other than the folder of the repo

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages