Build Systems
Codebases for scientific projects are usually on the smaller end of the spectrum. They also usually have small development teams. Because they have small teams they have a low bus factor and that cause codebases to fall away quickly when core members move on.
So to solve this problem you could write down a bunch of shell commands used to install or run your code. Or you could just use GNU Make
Intro to Make
make
is a utility that allows you to declare rules to create the
files(targets) that you need.
Here's the rough format:
target ... : prerequisites ...
command
...
...
Here's a few example rules:
# A rule - this rule will only run if file0.txt doesn't exist.
file0.txt:
echo "foo" > file0.txt
# Even comments in these 'recipe' sections get passed to the shell.
# Try `make file0.txt` or simply `make` - first rule is the default.
# This rule will only run if file0.txt is newer than file1.txt.
file1.txt: file0.txt
cat file0.txt > file1.txt
# use the same quoting rules as in the shell.
@cat file0.txt >> file1.txt
# @ stops the command from being echoed to stdout.
-@echo 'hello'
# - means that make will keep going in the case of an error.
# Try `make file1.txt` on the commandline.
# A rule can have multiple targets and multiple prerequisites
file2.txt file3.txt: file0.txt file1.txt
touch file2.txt
touch file3.txt
# A phony target. Any target that isn't a file.
# It will never be up to date so make will always try to run it.
clean:
rm file*.txt
Let's throw this in a Makefile and play around with it.
warning
Makefiles use tabs and not spaces
- Try running
make
- Try running
make file1.txt
- Try running
make clean
- Try running
make file2.txt
How does it work?
make
works by reading a Makefilewhich describes:- Files to be created
- Their dependencies
- Instructions to create the specified files
Makefiles describe a DAG of the dependencies
- Directed Acyclic Graph
- circular dependencies are dealt with by dropping the most recently discovered dependency and forging ahead (more later)
make
then works its way (using post-order traversal) up the dependency graph building files until the goal file is up-to-date- Only creates outputs whose dependencies are newer than the goal file itself
Let's commit the changes!
Creating a branch
First, let's learn to make a branch. Here's a visual representation of branches
A branch represents an independent line of development. Branches serve as an abstraction for the edit/stage/commit process. You can think of them as a way to request a brand new working directory, staging area, and project history. New commits are recorded in the history for the current branch, which results in a fork in the history of the project. 3
There are a few ways to create a branch using VS Code. Let's use Ctrl+Shift+P
and search for Git: Create Branch from
. Then use the name make
and then select head
or master
.
Ctrl+Shift+P
- Search for
Git: Create Branch from
. - Use
make
for the name of the branch. - Create the branch from
master
.
.gitignore
But we have a bunch of build artifacts in our directory. These shouldn't be added to the git repo, because they aren't code(A source of truth), they're something the code creates(A side effect).
You might notice that git picked up all of the txt files. A special file called
.gitignore
will tell git to ignore them and avoid tracking their history. Create it and throw the following the file
file*.txt
Commit
Now let's commit it. Same thing as before, refer to The git first commit section if you need a refresher!
Further reading
- Learn X in Y minutes where X=make↩
- UTD HPC Course Lecture 08↩
- Atlassian Git Branch Tutorial↩