Fall back to CpuId if failed to get cache size from OS #24989

ghost · 2019-06-06T01:06:01Z

It's possible for GetLogicalProcessorCacheSizeFromOS() to fail;
this happens on alpine linux where it compiles to just return 0;.

As a fallback, we can get the cache size from CpuId. Previously that
was specific to x86; this PR preserves the behavior that we never call
GetLogicalProcessorCacheSizeFromOS on x86.

CpuId only works on x86 and amd64; on other systems we may still return
0 from here. Then GC defaults to a cache size of only 0.25MB.

Fix #16071

It's possible for GetLogicalProcessorCacheSizeFromOS() to fail; this happens on alpine linux where it compiles to just `return 0;`. As a fallback, we can get the cache size from CpuId. Previously that was specific to x86; this PR preserves the behavior that we never call GetLogicalProcessorCacheSizeFromOS on x86. CpuId only works on x86 and amd64; on other systems we may still return 0 from here. Then GC defaults to a cache size of only 0.25MB. Note: Removed the code in an `#ifdef _WIN64` that was nested inside of `#if defined (_TARGET_X86_)`. Presuming that is dead code.

janvorli · 2019-06-06T09:01:52Z

src/vm/util.cpp

+}
+#endif // _TARGET_X86_
+
+// fix this if/when AMD does multicore or SMT


This comment is very obsolete. I was just about to ask about AMD support. I've found the following CPUID specification from AMD that contains all the necessary info here: https://www.amd.com/system/files/TechDocs/25481.pdf

Edit: Please ignore this comment, I've missed the code above that already handles the AMD.

janvorli · 2019-06-06T09:17:09Z

src/vm/util.cpp


-    if (maxSize)
+    PAL_TRY(Param *, pParam, &param)


Since the attached PAL_EXCEPT_FILTER is using DefaultCatchFilter, the parameter type has to be derived from DefaultCatchFilterParam. The DefaultCatchFilter casts the parameter * to DefaultCatchFilterParam* and reads the pv field out of it.
The DefaultCatchFilter is used to swallow hardware exceptions that can stem from the CPUID.

janvorli

LGTM, thank you!

Maoni0

LGTM!

This typo was in #24989 so would be a new regression in 3.0. In an x86 build, it causes us to not get the cache size correct, leading us to use a smaller default cache size and do more GCs. Tested with GCPerfSim and this PR reduces TotalNumberGCs by 33% using an x86 build.

This typo was in dotnet#24989 so would be a new regression in 3.0. In an x86 build, it causes us to not get the cache size correct, leading us to use a smaller default cache size and do more GCs. Tested with GCPerfSim and this PR reduces TotalNumberGCs by 33% using an x86 build.

This typo was in #24989 so would be a new regression in 3.0. In an x86 build, it causes us to not get the cache size correct, leading us to use a smaller default cache size and do more GCs. Tested with GCPerfSim and this PR reduces TotalNumberGCs by 33% using an x86 build.

…r#24989) * Fall back to CpuId if failed to get cache size from OS It's possible for GetLogicalProcessorCacheSizeFromOS() to fail; this happens on alpine linux where it compiles to just `return 0;`. As a fallback, we can get the cache size from CpuId. Previously that was specific to x86; this PR preserves the behavior that we never call GetLogicalProcessorCacheSizeFromOS on x86. CpuId only works on x86 and amd64; on other systems we may still return 0 from here. Then GC defaults to a cache size of only 0.25MB. Note: Removed the code in an `#ifdef _WIN64` that was nested inside of `#if defined (_TARGET_X86_)`. Presuming that is dead code. * Fix exception handler Commit migrated from dotnet/coreclr@6d29903

This typo was in dotnet/coreclr#24989 so would be a new regression in 3.0. In an x86 build, it causes us to not get the cache size correct, leading us to use a smaller default cache size and do more GCs. Tested with GCPerfSim and this PR reduces TotalNumberGCs by 33% using an x86 build. Commit migrated from dotnet/coreclr@ba39a15

janvorli reviewed Jun 6, 2019

View reviewed changes

Fix exception handler

9c09803

janvorli approved these changes Jun 6, 2019

View reviewed changes

Merge remote-tracking branch 'upstream/master' into alpine_cache_size

aee00c3

ghost requested a review from Maoni0 June 11, 2019 22:34

Maoni0 approved these changes Jun 11, 2019

View reviewed changes

ghost merged commit 6d29903 into dotnet:master Jun 11, 2019

ghost deleted the alpine_cache_size branch June 11, 2019 23:20

jkotas mentioned this pull request Jul 19, 2019

Cleanup processor cache size computation #25781

Merged

ghost mentioned this pull request Jul 19, 2019

Fix typo: _TARGET_X86 -> _TARGET_X86_ #25788

Merged

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fall back to CpuId if failed to get cache size from OS #24989

Fall back to CpuId if failed to get cache size from OS #24989

Uh oh!

ghost commented Jun 6, 2019 •

edited by ghost

Loading

Uh oh!

janvorli Jun 6, 2019 •

edited

Loading

Uh oh!

janvorli Jun 6, 2019

Uh oh!

janvorli left a comment

Uh oh!

Maoni0 left a comment

Uh oh!

Uh oh!

Fall back to CpuId if failed to get cache size from OS #24989

Fall back to CpuId if failed to get cache size from OS #24989

Uh oh!

Conversation

ghost commented Jun 6, 2019 • edited by ghost Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

janvorli Jun 6, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

janvorli Jun 6, 2019

Choose a reason for hiding this comment

Uh oh!

janvorli left a comment

Choose a reason for hiding this comment

Uh oh!

Maoni0 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ghost commented Jun 6, 2019 •

edited by ghost

Loading

janvorli Jun 6, 2019 •

edited

Loading