These guidelines are for all developers, whether occasional or regular. They have been sourced from various 'best practice' documents but primarily here.
This document assumes some working knowledge of git. We recommend Scott Chacon's excellent Pro Git a a valuable piece of background reading. The book is released under the Creative Commons license and can be downloaded in electronic form for free. At very least, we recommend that Chapter 2, Chapter 3 and Chapter 5 of Pro Git are read before proceeding.
JacORB uses http://github.com/JacORB/JacORB as its canonical repository, and this repository contains the stable code on master (currently JacORB 3.x).
Typically, only Project Admins would be able to push to this repo while all else may clone or fork this repo.
The project assumes one of 3 roles an individual may assume when interacting with the JacORB codebase. The three roles here are:
- Project Admin
- Frequent Contributor
- Occasional Contributor
This role defines a user who browses through the source code of the project and occasionally submits patches. Patches may be submitted in one of two ways:
- Attach a patch file to the Bugzila issue (JacORB Bug tracking system may be found here).
- Creating a pull request on GitHub
The approach a contributor chooses to use depends entirely his/her personal preference, but usually is tied to how the contributor accesses JacORB's source code. If the contributor directly clones the upstream repository, they should submit patch files. If the contributor instead uses their personal GitHub account to fork the upstream repository, then they are should issue a pull request.
💡 A GitHub pull request is the preferred method to submit a patch!
In this workflow, the contributor directly clones the upstream repository, makes changes to his local clone, adequately tests and commits his work with proper comments (more on commit comments later), and generates a patch. The patch should then be attached to the Bugzilla issue.
More information on generating patches suitable for attaching to a Bugzilla can be found in Chapter 5, Section 2 of Pro Git, under the section titled Public Large Project.
In this workflow, the contributor forks the JacORB upstream repository on GitHub, clones their fork, and makes changes to this private fork. When changes have been tested and are ready to be contributed back to the project, a pull request is issued via GitHub so that one of the Project Administrators can pull in the change.
Before you push your work onto your fork of the repository, it is often a good idea to review your commits. Consolidating them (squashing) or breaking them up as necessary and cleaning up commit messages should all be done while still working off your local clone. Also, prior to issuing a pull request, you should make sure you rebase your branch against the upstream branch you expect it to be merged into. Also, only submit pull requests for your branch - not for your master!
The section on Public Small Project in Chapter 5, Section 2 of Pro Git has more information on this style of workflow.
- Make sure your master is synced up with upstream. See this section for how to do this
- Create new branch for your topic and switch to it. For the example issue, BZ-1234:
git checkout -b BZ-12345 master
- Do your work. Test. Repeat
- Commit your work on your topic branch
- Push your topic branch to GitHub. For example:
git push origin BZ-12345
- Issue a pull request using the GitHub pull request system
- Once your pull request has been applied upstream, delete the topic branch both locally and on your fork. For example:
git branch -d BZ-12345 && git push origin :BZ-12345
- Sync with upstream again so that your changes now appear in your master branch
If your topic branch has been open for a while and you are afraid changes upstream may clash with your changes, it makes sense to rebase your topic branch before you issue a pull request. To do this:
- Sync your master branch with upstream
git checkout master
git pull upstream master
- Switch to your topic branch. For example:
git checkout BZ-12345
- Rebase your topic branch against master:
git rebase master
- During the rebase process you might need to fix conflicts;
- when you're done test your code again.
- Push your rebased topic branch to your repo on GitHub (you will likely need to force this with the -f option).
git push -f origin BZ-12345
- Continue your work on your topic branch.
Sometimes a feature/task is rather complex to implement and requires competence from multiple areas of the projects. In such occasions it is not uncommon for developers to coordinate feature implementation using personal forks of JacORB prior to finally issuing request to integrate into JacORB main repository on GitHub.
For example, developer A using his personal JacORB fork creates a topic branch T and completes as much work as he/she can before requesting for assistance from developer B. Developer A pushes topic T to his personal JacORB fork where developer B picks it up and brings it down to his local repo. Developer B then in turn completes necessary work, commits his/her changes on branch T, and finally pushes back T to his own personal fork. After issuing request for pull to developer A, developer B waits for notification that developer A integrated his changes. This exchange can be repeated as much as it is necessary and can involve multiple developers.
This example assumes that developer A and B have added each others JacORB forked repositories with the git add remote command. For example, developer B would add developer A's personal JacORB fork repository with the command
git remote add devA https://github.com/developerA/JacORB.git
- Developer A starts implementing feature BZ-244 and works on a local topic branch
bz244
Developer A pushesbz244
to personal JacORB fork. For example:
git push origin bz244
- Developer B fetches branch
bz244
to local repository. For example:
git fetch devA bz244:my_bz244
- Developer B works on local branch
my_bz244
- Developer B commits changes, pushes
my_bz244
to own fork.
git push origin my_bz244
- Developer B sends pull request to developer A to integrate changes from
my_bz244
tobz244
A frequent contributor will only ever submit patches via a pull requests. The pull request will be submitted via GitHub.
Frequent contributors should always fork the upstream project on GitHub and work off a clone of this fork. This is very similar to Creating a pull request on GitHub workflow used by a #Occasional Contributor.
Project Admins have a very limited role. Only Project Admins are allowed to push to upstream, and Project Admins never write any code directly on the upstream repository. All Project Admins do is pull in and merge changes from contributors (even if the "contributor" happens to be themselves) into upstream, perform code reviews and either commit or reject such changes.
This approach ensures JacORB maintains quality on the main code source tree, and allows for important code reviews to take place again ensuring quality. Further, it ensures clean and easily traceable code history and makes sure that more than one person knows about the changes being performed.
Patches submitted via Bugzilla are audited and promoted to the upstream repository as detailed above. A Project Admin would typically create a working branch to which the patch is applied and tested. The patch can be further modified, cleaned up, and commit messages made clearer if necessary. The branch should then be merged to the master or one of the maintenance branches before being pushed.
More information on applying patches can be found in Chapter 5, Section 3 of Pro Git, under Applying Patches From Email.
Project Admins are also responsible for responding to pull requests. The process is similar to applying a patch directly, except that when pulling in changes from a forked repository, more than a single commit may be pulled in. Again, this should be done on a newly created working branch, code reviewed, tested and cleaned up as necessary.
If commits need to be altered - e.g., rebasing to squash or split commits, or to alter commit messages - it is often better to contact the Contributor and ask the Contributor to do so and re-issue the pull request, since doing so on the upstream repo could cause update issues for contributors later on. If commits were altered or three-way merge was performed during a merge instead of fast-forward, it's also a good idea to check the log to make sure that the resulting repository history looks OK:
$ git log --pretty=oneline --graph --abbrev-commit # History messed up due to a bad merge
* 3005020 Merge branch 'BZ-786' of git://github.com/rnc/JacORB
|\
| * e757265 BZ-786 Make dependency to foobar optional <-- Same with cb4e5d6 - unnecessary
* | cb4e5d6 BZ-786 Make dependency to foobar optional <-- Cherry-picked commit by other admin
|/
* ...
$ git reset cb4e5d6 # revert the bad merge
It is therefore strongly recommended that you use the handle_pull_request script that ensures a clean merge. If you still wish to do this manually, please consider reading through the script first to get an idea of what needs to happen.
More information on pulling changes from remote, forked repos can be found in Chapter 5, Section 3 of Pro Git, under Checking Out Remote Branches .
- If you have warnings about "Merge made by recursive" you have to fix it rebasing.
- If you have warnings about "non-fast-forward" you have to rebase.
- If you see "non-fast-forward updates were rejected" you shall never use "force" on upstream! It means that another patch was merged before you and you have to update your master again, and rebase again.
- "force" is allowed only in special maintenance circumstances. If you find you're needing it to handle a pull request, then you're doing it wrong, and the mistake might be a dangerous one! It's like the good rule of never commit when you're drunk (coding is allowed).
🔴 Never use force on git push Using -f while pushing on a shared repository such as upstream you could effectively erase other committed patches. No-one shall ever use this option unless unanimously approved on the public mailing list: the most dangerous aspect of it is that nobody gets any notification if this happens, and we might think issues are solved but you silently removed the fix and it's history from the repository.
Releases can only be cut by Project Admins, and must be done off a recently updated (git fetch and git pull origin) clone of the upstream repo.
JacORB currently has a primary release branch of master and a maintenance branch used by Red Hat JBoss. In future further release branches may be created. (e.g. for maintenance purposes). Work should never be committed directly to a release branch; topic branches should always be used for work, and these topic branches should be merged in.
Some of the biggest features of git are speed and efficiency of branching, and accuracy of merging. As a result, best practices involve making frequent use of branches. Creating several topic branches a day, even, should not be considered excessive, and working on several topic branches simultaneously again should be commonplace.
Chapter 3, Section 4 of Pro Git has a detailed discussion of topic branches. For JacORB, it makes sense to create a topic branch and name it after the Bugzilla it corresponds to. (if it doesn't correspond to a Bugzilla, a simple but descriptive name should be used).
Most topic branches will only affect the a single release branch, so a topic branch should be created based off master. It may be that fixes apply to multiple branches. In this case, the following workflow should apply:
- Create topic branch off maintenance branch e.g. 2.3.x. For example:
git checkout -b <topic>_2.3.x 2.3.x
- Create topic branch off master. For example:
git checkout -b <topic>_master master
- Do your work on
<topic>_master
, test and commit your fixes - Switch to
<topic>_2.3.x
. For example:git checkout <topic>_2.3.x
- Cherry-pick your commit on
<topic>_master
onto<topic>_2.3.x
. For example:git cherry-pick <commit_id>
- Test
<topic>_2.3.x
for correctness, modify as necessary - Issue pull requests for both topic branches
It is extremely important that comments for each commit are clear and follow certain conventions. This allows for proper parsing of logs by git tools. Read this article on how to format comments for git and adhere to them. Further to the recommendations in the article, the short summary of the commit message should be in the following format:
BZ-XXX Subject line of the Bugzilla in question
This can optionally be followed by a detailed explanation of the commit. Why it was done, how much of it was completed, etc. You may wish to express this as a list, for example:
* Add a unit test
* Add more unit tests
* Fix regressions
* Solve major NP-Complete problems
Make sure however to split separate concerns - especially if they are unrelated - in separate commits.
Sometimes work on your topic branch may include several commits. For example, committing a test. Then committing another test. Then perhaps committing a fix. And perhaps fixing your own fix in the next commit... Before issuing a pull request for this topic branch, consider cleaning up these commits. Interactive rebasing helps you squash several commits into a single commit, which is often more coherent to deal with for others merging in your work. Chapter 6, Section 4 of Pro Git has details on how to squash commits and generally, clean up a series of commits before sharing this work with others. Note that you can also easily reorder them, just change the order of lines during the interactive rebase process.
Also, it is important to make sure you don't accidentally commit files for which no real changes have happened, but rather, whitespace has been modified. This often happens with some IDEs. git diff --check
should be run before you issue such a pull request, which will check for such "noise" commits and warn you accordingly. Such files should be reverted and not be committed to the branch.
Adhering to JacORB's code style guidelines will help minimise "noise" commits. Project Admins are going to ask contributors to reformat their code if necessary.
If you have a clone of the upstream, you may want to update it from time to time. Running:
$ git fetch origin
$ git fetch origin --tags
will often do the trick. You could then pull the specific branches you would need to update:
$ git checkout master
$ git pull origin master
$ git checkout 2.3.x
$ git pull origin 2.3.x
You should rebase your topic branches at this point so that they are up-to-date and when pulled by upstream, upstream can fast-forward the release branches:
$ git checkout <topic>_master
$ git rebase master
and/or
$ git checkout topic_2.3.x
$ git rebase 2.3.x
If you have a fork of upstream, you should probably define upstream as one of your remotes:
$ git remote add upstream git://github.com/JacORB/JacORB.git
You should now be able to fetch and pull changes from upstream into your local repository, though you should make sure you have no uncommitted changes. (You do use topic branches, right?)
$ git fetch upstream
$ git fetch upstream --tags
$ git checkout master
$ git pull upstream master
$ git push origin master
$ git checkout 2.3.x
$ git pull upstream 2.3.x
$ git push origin 2.3.x
A script can do this for you - have a look at sync_with_upstream.
Again, you should rebase your topic branches at this point so that they are up-to-date and when pulled by upstream, upstream can fast-forward the release branches:
$ git checkout topic_master
$ git rebase master
and/or
$ git checkout topic_2.3.x
$ git rebase 2.3.x
The sync_with_upstream script can do this for you if your topic branch naming conventions match the script.
There are many ways git usage may be enhanced:
- Autocompletion on the command line via Bash or ZSH
- Custom shell prompts to display GIT repository information (again, available in Bash & ZSH)
- Coloring GIT Terminal output
- Visual tools such as gitk, qgit, Eclipse integration, etc.
- etc.