Right now it's possible for the metrics generated by one test to be picked up by another test. Each test should somehow "clear" the collected metrics, so that their actions do not pollute the result of (e.g.) `generate_latest`