Slash and burn: build complexity begone! #284

rmunn · 2022-08-18T10:46:08Z

This PR radically simplifies the build by:

Getting rid of the split between FW 8 and FW 9 builds now that we no longer need to build FW 8 versions
Getting rid of mono5-sil entirely
Getting rid of NuGet.targets in favor of a simple dotnet restore
Getting rid of MsBuild targets in LfMerge.proj in favor of a simple dotnet build
Getting rid of Test target in LfMerge.proj in favor of a simple dotnet test
Getting rid of build/LfMerge.proj entirely and using LfMerge.sln as the entry point to the project
Getting rid of the entire build directory tree as it's no longer needed!

The result is a build that runs much faster, won't fail when TeamCity goes down, and uses standard dotnet tools to build. With this change, we'll be able to run local builds on developer machines, without needing to run them inside a Docker image. That should allow running unit tests in VS Code using standard VS Code test-runner extensions.

This change is

We now build for FW9 only, which mean we no longer need the entire fieldworks8-* branch system. The calculate-branches and pbuild scripts can become way simpler.

This lets us get rid of the NuGet.targets file and its complexity.

A lot of failures, but now `dotnet test` is able to find and run the tests. We'll work on the test failures later.

Now that mono-sil5 and its complexity are gone, we can have pbuild.sh create the base images directly since that doesn't take very long. This will allow us to iterate more quickly on base image changes.

We no longer need to use LfMerge.proj at all! We can now rely entirely on the standard `dotnet build` and `dotnet test` build process.

We can move most things into a single build-and-test.sh script.

We only need a few environment variables (such as adjusting the PATH to point to the version of Mercurial that we need to use), and soon we'll be able to get rid of those as well.

Instead of using a non-standard `/storage/nuget` folder that won't be on most people's computers, use the NuGet package cache that they alreday have, located in `~/.nuget/packages`. Because if they're using VS Code or any other IDE that knows about .NET and NuGet, that folder is probably already populated with the appropriate packages, which makes even the first build go faster.

PrivateAssets="All" is a simpler way to pull in the build targets we need from that package.

Set TEST_SPEC to a partial (or full) name of a test class or test method and `dotnet test` will use a "name contains X" filter to locate tests.

To use this, choose the "Run Test Task" command in VS Code. The dotnet test will run, then pause waiting for a debugger to attach. Go to the debug menu and choose the "Attach" configuration, then enter the PID printed on the test console. (There will be two, one for LfMerge.Tests and one for LfMerge.Core.Tests; most of the tests are in LfMerge.Core so that's the one you'll need most of the time). The debugger will attach to the test process, which is still in a paused state. Set any breakpoints you want to hit in the unit tests, then unpause the debugger and it will start running the unit tests.

LfMergeSettings was caching the value of environment variables, but some strings like BaseDir were being accessed in the constructor, before derived classes like LfMergeSettingsDouble had a chance to change them. This meant that many tests, which relied on testlangproj or other projects being copied into the test's temp dir, were failing because LfMergeSettings was using the default /var/lib/languageforge/... base directory instead. By removing the caching, we ensure that the LfMergeSettingsDouble class is able to change the BaseDir correctly.

ProcessingState tests need their expected results updated to include the new Error property, and the MercurialServer code needs to update its "shutdown the server" logic to expect "kill" to be a shell builtin rather than a separate binary. That fixes a total of 9 failing tests.

By removing the `git checkout` and `git clean` steps, we ensure that we can run the build process against uncomitted code in the repository. This will make testing changes much simpler. A leter commit will probably remove the copying of the Git repo into a temporary working directory, and simply run the build in the user's repo via a bind mount. That will mean that build outputs (such as TestResult directories) will end up in your repo, which will be useful in any situation where you want to debug the unit tests.

Now that LfMergeSettings uses environment variables, we no longer need to copy lfmerge.conf into the final Docker image. This resolves a long-standing TODO.

Now that LfMerge logs to the console rather than syslog, we don't need rsyslog or its config files in the final Docker image.

One step was still using version 2.4.0 even though the rest of the workflow had moved to version 3.0.2.

This was needed when we were building FW8 and FW9 builds from different branches, but it's no longer useful. The only thing it was still being used for is in figuring out which branch to tag in the release workflow, but since we only run that step when the branch is "live" anyway, we can just hardcode the branch name. Yet more build complexity, begone!

This fulfills a long-standing TODO comment.

This allows the build to create the installation tarball directly in the bind-mounted repo, eliminating a copy step.

rmunn · 2022-08-19T07:30:25Z

With all these changes, the build has essentially boiled down to three (or four) steps:

dotnet build
(Optional): dotnet test
docker/scripts/create-installation-tarball.sh
docker build -t ghcr.io/sillsdev/lfmerge -f Dockerfile.finalresult .

The only bit of complexity left is that we do need to be able to build different lfmerge images for different model versions in the future, so when model version 7000073 comes out, we'll need to run steps 1-3 in a for loop:

# This is an array; see https://www.gnu.org/software/bash/manual/html_node/Arrays.html
DBMODEL_VERSIONS=(7000072 7000073)

# Run the build once for each DbVersion
for DbVersion in ${DBMODEL_VERSIONS[@]}; do
    dotnet build /v:m /property:Configuration=Release /property:DatabaseVersion=${DbVersion} LfMerge.sln
    docker/scripts/create-installation-tarball.sh
done

# Build final Docker image
docker build -t ghcr.io/sillsdev/lfmerge -f Dockerfile.finalresult .

rmunn · 2022-08-19T07:37:07Z

You could almost skip running the build in a Docker container at all, but that causes issues with shared libraries. For example, when I ran the build with dotnet build on an Ubuntu 22.04 machine, then ran the lfmerge image (which uses Microsoft's dotnet/sdk:6.0 image, which is based on Debian 11), lfmergeqm failed to run and I got the following error messages:

/usr/lib/lfmerge/7000072/LfMergeQueueManager: /lib/x86_64-linux-gnu/libc.so.6: version `GLIBC_2.34' not found (required by /usr/lib/lfmerge/7000072/LfMergeQueueManager)
/usr/lib/lfmerge/7000072/LfMergeQueueManager: /lib/x86_64-linux-gnu/libc.so.6: version `GLIBC_2.33' not found (required by /usr/lib/lfmerge/7000072/LfMergeQueueManager)

So building in the same environment that the image will run in is still a good idea, and we can't quite radically simplify the build down to a simple six-line shell script as shown in the comment above. BUT... the dotnet build and dotnet test steps do work well enough that you can debug unit tests without needing a Docker container involved. Which is a big win.

This will avoid the possibility of creating a "mixed" lfmerge installation containing files from a previous successful build, alongside a build that only partially succeeded and overwrote some but not all files from before. This also remvoes just a little bit of unnecessary chattiness from the build output.

We no longer need the lines about LfMerge.proj or the ones about build configurations for DbVersions 68, 69, or 70.

We're never going to use MonoDevelop in the future, as VS Code is so much better. So the whole MonoDevelopProperties section in the .sln file is a waste of space and processing time. Let's dump the whole thing.

megahirt

What a great PR! A long time in coming. Thanks also to @josephmyers for laying the foundation, making this slash and burn possible.

.vscode/tasks.json

Co-authored-by: Christopher Hirt <chris@hirtfamily.net>

rmunn added 25 commits August 18, 2022 10:24

No more FW8 branches

d0523f6

We now build for FW9 only, which mean we no longer need the entire fieldworks8-* branch system. The calculate-branches and pbuild scripts can become way simpler.

Slash and burn: complexity begone!

75f0a4b

Rely entirely on dotnet restore for packages

6928646

This lets us get rid of the NuGet.targets file and its complexity.

Get unit tests running

f8dd9c3

A lot of failures, but now `dotnet test` is able to find and run the tests. We'll work on the test failures later.

Slash and burn: mono5-sil et al, begone!

07e3713

Have pbuild.sh create base images directly

06a1a68

Now that mono-sil5 and its complexity are gone, we can have pbuild.sh create the base images directly since that doesn't take very long. This will allow us to iterate more quickly on base image changes.

Slash and burn: LfMerge.proj, begone! Hurrah!

dd21a62

We no longer need to use LfMerge.proj at all! We can now rely entirely on the standard `dotnet build` and `dotnet test` build process.

No longer need to keep NuGet packages in build dir

0f8707a

Slash and burn: build complexity begone!

8dc75e1

We can move most things into a single build-and-test.sh script.

Slash and burn: most of environ script, begone!

1ea67e7

We only need a few environment variables (such as adjusting the PATH to point to the version of Mercurial that we need to use), and soon we'll be able to get rid of those as well.

Slightly simpler handling of SIL.Chorus.Mercurial

5152959

PrivateAssets="All" is a simpler way to pull in the build targets we need from that package.

Honor TEST_SPEC env var in unit testing

57c41bf

Set TEST_SPEC to a partial (or full) name of a test class or test method and `dotnet test` will use a "name contains X" filter to locate tests.

Remove lfmerge.conf (replaced by env vars)

ba6c819

Now that LfMergeSettings uses environment variables, we no longer need to copy lfmerge.conf into the final Docker image. This resolves a long-standing TODO.

Remove rsyslog from final Dockerfile

b17ddf0

Now that LfMerge logs to the console rather than syslog, we don't need rsyslog or its config files in the final Docker image.

Use same checkout action in entire GHA workflow

20e839d

One step was still using version 2.4.0 even though the rest of the workflow had moved to version 3.0.2.

Remove unnecessary step from release workflow

0cc5424

This fulfills a long-standing TODO comment.

Run build directly in repo (no more copying)

0f1ba67

This allows the build to create the installation tarball directly in the bind-mounted repo, eliminating a copy step.

Add sanity check to installation script

c8312f5

Fix workflow syntax error

4ca0a46

rmunn added 3 commits August 19, 2022 14:49

Update README with new build instructions

5e8f346

Remove unused lines from LfMerge.sln

8bc4e8c

We no longer need the lines about LfMerge.proj or the ones about build configurations for DbVersions 68, 69, or 70.

Delete MonoDevelopProperties from LfMerge.sln

d6bbedc

We're never going to use MonoDevelop in the future, as VS Code is so much better. So the whole MonoDevelopProperties section in the .sln file is a waste of space and processing time. Let's dump the whole thing.

megahirt approved these changes Aug 19, 2022

View reviewed changes

.vscode/tasks.json Outdated Show resolved Hide resolved

Remove unneeded test filter in .vscode/tasks.json

ece4e67

Co-authored-by: Christopher Hirt <chris@hirtfamily.net>

rmunn merged commit a47416d into master Aug 19, 2022

rmunn deleted the chore/slash-and-burn-complexity-begone branch August 19, 2022 08:48

rmunn self-assigned this Aug 19, 2022

This was referenced Aug 19, 2022

Get unit tests running again in Docker containers #240

Closed

WIP: Modernize build #96

Closed

Update dependencies #124

Open

Resolve MSB3026 warnings in LFMerge build #285

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Slash and burn: build complexity begone! #284

Slash and burn: build complexity begone! #284

rmunn commented Aug 18, 2022 •

edited by megahirt

Loading

rmunn commented Aug 19, 2022

rmunn commented Aug 19, 2022

megahirt left a comment

Slash and burn: build complexity begone! #284

Slash and burn: build complexity begone! #284

Conversation

rmunn commented Aug 18, 2022 • edited by megahirt Loading

rmunn commented Aug 19, 2022

rmunn commented Aug 19, 2022

megahirt left a comment

Choose a reason for hiding this comment

rmunn commented Aug 18, 2022 •

edited by megahirt

Loading