Too many `memcpy`s

Cachegrind profiles indicate that the Rust compiler often spends 3-6% of its executed instructions within `memcpy` (specifically `__memcpy_avx_unaligned_erms` on my Linux box), which is pretty incredible.

I have modified DHAT to track `memcpy`/`memmove` calls and have discovered that a lot are caused by obligation types, such as `PendingPredicateObligations` and `PendingObligations`, which are quite large (160 bytes and 136 bytes respectively on my Linux64 machine).

For example, for the `keccak` benchmark, 33% of the copied bytes occur in the `swap` call in the `compress` function:
https://github.com/rust-lang/rust/blob/a6624ed9806fe4caa10de5b94e590f71a4a1eab9/src/librustc_data_structures/obligation_forest/mod.rs#L607-L620

For `serde`, 11% of the copied bytes occur constructing this vector of obligations:
https://github.com/rust-lang/rust/blob/a6624ed9806fe4caa10de5b94e590f71a4a1eab9/src/librustc/ty/wf.rs#L150-L157
and 5% occur appending to this vector of obligations:
https://github.com/rust-lang/rust/blob/ac21131f7859836cd3fcb39231c0162fd892d960/src/librustc/traits/project.rs#L570-L574

It also looks like some functions such as `FulfillmentContext::register_predicate_obligation()` might be passed a `PredicateObligation` by value (using a `memcpy`) rather than by reference, though I'm not sure about that.

I have some ideas to shrink these types a little, and improve how they're used, but these changes will be tinkering around the edges. It's possible that more fundamental changes to how the obligation system works could elicit bigger wins.



	// Now move all popped nodes to the end. Try to keep the order.
	//
	// LOOP INVARIANT:
	// self.nodes[0..i - dead_nodes] are the first remaining nodes
	// self.nodes[i - dead_nodes..i] are all dead
	// self.nodes[i..] are unchanged
	for i in 0..self.nodes.len() {
	match self.nodes[i].state.get() {
	NodeState::Pending \| NodeState::Waiting => {
	if dead_nodes > 0 {
	self.nodes.swap(i, i - dead_nodes);
	node_rewrites[i] -= dead_nodes;
	}
	}

	self.out.iter()
	.inspect(\|pred\| assert!(!pred.has_escaping_bound_vars()))
	.flat_map(\|pred\| {
	let mut selcx = traits::SelectionContext::new(infcx);
	let pred = traits::normalize(&mut selcx, param_env, cause.clone(), pred);
	once(pred.value).chain(pred.obligations)
	})
	.collect()

	obligations.push(get_paranoid_cache_value_obligation(infcx,
	param_env,
	projection_ty,
	cause,
	depth));

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Too many `memcpy`s #64301

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Too many memcpys #64301

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Too many `memcpy`s #64301