feat: add a dense memory backend implementation #1342

tianrui-wei · 2023-04-25T17:29:20Z

Currently, the mem_t only supports a sparse memory implementation. This
commit adds in dense memory support. It behaves as follows:

it will try to take advantage of HugeTLB and Transparent Huge Pages
in modern Linux kernels
it will allocate once/free once for every memory region, instead of
on demand
paves the way for a simple memory backed block device implementation

Signed-off-by: Tianrui Wei tianrui@tianruiwei.com

scottj97

This does far too much in one commit:

Adding command-line option :d
Creating class hierarchy in mem_t
Adding dense to mem_cfg_t
Implementing dense_mem_t

That's at least 4 separate commits.

The bugfix just pushed should be squashed back in so there is no bug in any commit.

But more importantly, I think this needs more justification. What is the benefit here? Does this improve performance when simulating programs that actually use a large block of memory? By how much?

spike_main/spike.cc

scottj97 · 2023-04-25T21:41:23Z

riscv/devices.h

+  sparse_mem_t(reg_t size);
+  ~sparse_mem_t();
+
+  char* contents(reg_t addr);


Use virtual and override keywords on overridden virtual methods

tianrui-wei · 2023-04-25T21:52:11Z

Hi Scott,

Thanks for reviewing my PR. I'll address your comments in subsequent commits. I'll split this single commit as you suggested.

The benefit is that the current lazy allocation scheme is inefficient compared to allocating the entire backing memory at once. For example, in ucb-bar/chipyard#1438, we unify the backing memory in both spike cosim and the actual RTL simulation to detect divergence. At the start of the cosimulation, the entire RTL memory is copied into spike, which in turn calls calloc many times.

The other reason is so that this could serve as the backing memory for a block device for using with drivers like https://github.com/u-boot/u-boot/blob/master/drivers/mmc/piton_mmc.c. If you'd like I could run some benchmark comparison on memcpy benchmarks.

Thanks,
Tianrui

aswaterman · 2023-04-25T22:10:51Z

I suspect we can improve the current scheme to be more efficient, e.g. by using memalign and allocating larger chunks (say, 2 MiB to match the superpage size). This would require some experimentation, but it seems preferable to having two different schemes.

I suspect the block device will end up being a separate device_t anyway, so I question whether this PR really serves as a building block towards that.

tianrui-wei · 2023-04-25T23:34:23Z

I would argue that multiple backends are beneficial, and is also practical in other projects as qemu: https://github.com/qemu/qemu/blob/master/backends/hostmem.c. For one, the dense allocation will crash if allocated physical memory is larger than host memory, but the sparse backend could effectively test memory address > 39 bits for example. The dense allocation will also make sharing memory easier and more flexible instead of performing a lookup.

jerryz123 · 2023-04-26T00:06:40Z

@tianrui-wei for our use case, we can just provide our own implementation of mem_t that uses a dense memory in our code that links with spike, and pass it to the sim_t constructor.

tianrui-wei · 2023-04-26T01:22:26Z

Could we perhaps only cherry pick e71df64 that implements an abstract class for memory interface, and expose a flexible interface in libriscv?

scottj97

Commit e71df64 fails to compile:

/local_home/sjohnson/spike-regress/riscv-isa-sim/spike_main/spike.cc: In function ‘std::vector<std::pair<long unsigned int, mem_t*> > make_mems(const std::vector<mem_cfg_t>&)’:
/local_home/sjohnson/spike-regress/riscv-isa-sim/spike_main/spike.cc:273:75: error: invalid new-expression of abstract class type ‘mem_t’
             mems.push_back(std::make_pair(cfg.get_base(), new mem_t(cfg.get_size())));
                                                                                   ^

riscv/devices.h

tianrui-wei · 2023-04-26T19:43:33Z

Hi Scott,

Thank you for reviewing and shepherding the PR. I've updated the commit to address your reviews and only cherry-picked the first commit.

Thanks,
Tianrui

Signed-off-by: Tianrui Wei <tianrui@tianruiwei.com>

scottj97

I'll leave it up to @aswaterman but this seems like a pointless change now.

scottj97 · 2023-04-27T18:31:01Z

riscv/devices.h

+  virtual void dump(std::ostream& o) override;
+
+ private:
+  bool load_store(reg_t addr, size_t len, uint8_t* bytes, bool store) override;


Needs virtual

jerryz123 · 2023-07-25T18:23:57Z

Resolved by #1408

tianrui-wei force-pushed the tianrui-dense-mem-impl branch 2 times, most recently from 5ed5f1b to 32acdef Compare April 25, 2023 21:46

scottj97 suggested changes Apr 25, 2023

View reviewed changes

tianrui-wei force-pushed the tianrui-dense-mem-impl branch from 32acdef to ca08a1f Compare April 25, 2023 22:00

tianrui-wei requested a review from scottj97 April 25, 2023 22:01

tianrui-wei force-pushed the tianrui-dense-mem-impl branch from e2300a3 to 286f8b5 Compare April 25, 2023 23:33

scottj97 suggested changes Apr 26, 2023

View reviewed changes

scottj97 reviewed Apr 26, 2023

View reviewed changes

riscv/devices.h Outdated Show resolved Hide resolved

tianrui-wei force-pushed the tianrui-dense-mem-impl branch 2 times, most recently from 3d6c41b to 69e70ad Compare April 26, 2023 19:42

jerryz123 force-pushed the tianrui-dense-mem-impl branch from 69e70ad to 30a0169 Compare April 26, 2023 21:59

feat: creating class hierarchy in mem_t

b72cf05

Signed-off-by: Tianrui Wei <tianrui@tianruiwei.com>

tianrui-wei force-pushed the tianrui-dense-mem-impl branch from 30a0169 to b72cf05 Compare April 26, 2023 22:23

tianrui-wei requested a review from scottj97 April 26, 2023 22:25

scottj97 suggested changes Apr 27, 2023

View reviewed changes

michalt mentioned this pull request Jul 10, 2023

Allow overriding mem_t #1408

Closed

jerryz123 closed this Jul 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add a dense memory backend implementation #1342

feat: add a dense memory backend implementation #1342

tianrui-wei commented Apr 25, 2023

scottj97 left a comment

scottj97 Apr 25, 2023

tianrui-wei commented Apr 25, 2023

aswaterman commented Apr 25, 2023

tianrui-wei commented Apr 25, 2023

jerryz123 commented Apr 26, 2023

tianrui-wei commented Apr 26, 2023

scottj97 left a comment •

edited

Loading

tianrui-wei commented Apr 26, 2023

scottj97 left a comment

scottj97 Apr 27, 2023

jerryz123 commented Jul 25, 2023

feat: add a dense memory backend implementation #1342

feat: add a dense memory backend implementation #1342

Conversation

tianrui-wei commented Apr 25, 2023

scottj97 left a comment

Choose a reason for hiding this comment

scottj97 Apr 25, 2023

Choose a reason for hiding this comment

tianrui-wei commented Apr 25, 2023

aswaterman commented Apr 25, 2023

tianrui-wei commented Apr 25, 2023

jerryz123 commented Apr 26, 2023

tianrui-wei commented Apr 26, 2023

scottj97 left a comment • edited Loading

Choose a reason for hiding this comment

tianrui-wei commented Apr 26, 2023

scottj97 left a comment

Choose a reason for hiding this comment

scottj97 Apr 27, 2023

Choose a reason for hiding this comment

jerryz123 commented Jul 25, 2023

scottj97 left a comment •

edited

Loading