BGIT

Simple git clone in GO. BGIT can mean anything, depending on your mood.

Bastard-Git: if you think about it as just another useless implementation of GIT.
Boludos-Git: in honor of my beloved team from my company.
Bad-Git: if you just think it is bad, really bad.
Better-Git: if you're in a good mood and somehow you think this project helped you understand how git works.

Authors

Joao Lindgren
Add your clickable username here. It should point to your GitHub account.

Summary how it works

BGIT is a CLI tool pet project that mimics a small set of features from GIT. Despite covering only a small part of GIT's functionalities, the basis of how the version controller works was maintained. The idea is to simulate as much as possible what we have in GIT. Head, Index, Objects, etc.

As GIT does, BGIT stores the entire content/at each time under the ./bgit folder. When you commit file(s) BGIT will create a tree of objects of 3 types. A commit, one or more trees, and one or more blobs. The ref/head is updated to point to the recent commit. As it happens with GIT, each commit saves the entire state of the repo, not the diff, and differently from GIT we do not have special treatments for big files, so take care with them. BGIT should be able to reconstruct the working tree based on any commit at any time. In theory, for instance, cloning the repo is just copying the ./bgit folder and reconstructing the working tree from the index.

Pieces

Objects

Objects are saved under the /objects folder with their content compressed. Each object has a hash, and the first two parts of the hash are used to create a folder and the rest is the name of the saved file. store.go is responsible for saving/retrieving data from there. The command bgit cat-file <hash> can be used to retrieve the object content decompressed.

A commit has a parent commit(unless it is the first), a root tree, an author and a time. The commit hash is the message + author + time
The tree represents directories. Each tree (dir) can contain other trees or blobs. The tree has a description of each tree/blob along with file mode and hashs.
The blob represents a file at a specific point in time. The whole content is saved compressed and the hash is based on the content of the file.

HEAD

File pointing to the current branch

Refs/Heads

Each file represents one branch, and the branch is just a pointer to a commit (the content of the file is a commit hash)

Index

The file containing the state of the repo at the time. In real GIT the index is updated when the user adds a file to stage. BGIT only does that when the user commits. We can get the diff of files to add/modify/delete by comparing the index with the working tree.

Ignore files

If you want to ignore files, create a file called .bgitignore and add one relative path (folder or file) per line. Wildcards are not allowed atm.

How to run

run make build
set user

export BGIT_USER="email@gmail.com"

  make build

you can just copy the executable to a specific folder and use it, or you can set the export BGIT_TEST_REPO="srctest" and create inside the repo with the same name.

  mkdir $BGIT_TEST_REPO
  ./bgit init

run status to check and add a new file to commit

  ./bgit status
  /* add a file to folder defined in `BGIT_TEST_REPO` */
  ./bgit commit "your message"

What BGIT does not have (yet)

Concept of stage. In the real GIT, when the user does git add <filename>, it generates the blob object and adds an entry to the index, and this file starts to be tracked by GIT. You can even switch branches and this file will still be tracked. Here for simplicity, I did not implement the concept of stage. Any file added to the repo will be committed and if you switch branches without commting them, they will BE LOST.
.pack files. GIT has special treatments for big files, I did not dig deeper into it, but the idea is to optimize since regular files are always copied no matter how small were the changes. There is no concept of saving the diffs.
Parameters, a LOT of them. Each GIT command has MANY parameters to control every aspect of the command behavior. Here, for simplicity, we almost do not have parameters in the commands.
Config file. GIT uses global and local configs, which are basically files saved either on the local repo or in the user's home folder. BGIT for simplicity uses only the email which is set using the env variable BGIT_USER
Merge and Rebase
Remotes
Stash
Generate Patches
Add more things

To Be Implemented

reflog and bgit log command
reset
destroy branch
clone?

References and related links

https://medium.com/data-management-for-researchers/git-under-the-hood-part-1-object-storage-in-git-57c9adfb5e5f https://stackoverflow.com/questions/22968856/what-is-the-file-format-of-a-git-commit-object-data-structure https://www.youtube.com/watch?v=RxHJdapz2p0 (video)

https://stackoverflow.com/questions/15765366/how-does-git-track-file-changes-internally https://stackoverflow.com/questions/4084921/what-does-the-git-index-contain-exactly

https://www.freecodecamp.org/news/git-internals-objects-branches-create-repo/

https://benhoyt.com/writings/go-1brc/ https://github.com/git/git/blob/v2.21.1/commit.c https://www.freecodecamp.org/news/boost-programming-skills-read-git-code/ https://git-scm.com/book/en/v2/Git-Internals-Git-Objects https://www.freecodecamp.org/news/git-internals-objects-branches-create-repo/

About HEAD and all it means https://jvns.ca/blog/2024/03/08/how-head-works-in-git/

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
.github		.github
cmd		cmd
internal		internal
.gitignore		.gitignore
.goreleaser.yaml		.goreleaser.yaml
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BGIT

Authors

Summary how it works

Pieces

Objects

HEAD

Refs/Heads

Index

Ignore files

How to run

What BGIT does not have (yet)

To Be Implemented

References and related links

About

Releases 4

Packages

Languages

jplindgren/bastard-git

Folders and files

Latest commit

History

Repository files navigation

BGIT

Authors

Summary how it works

Pieces

Objects

HEAD

Refs/Heads

Index

Ignore files

How to run

What BGIT does not have (yet)

To Be Implemented

References and related links

About

Resources

Stars

Watchers

Forks

Releases 4

Packages 0

Languages

Packages