-
Notifications
You must be signed in to change notification settings - Fork 4.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SQLite] Fix continuous probing for sqlite #25590
Conversation
cc @sharwell @CyrusNajmabadi 👀 @jasonmalinowski is there a way to get nupkgs for this PR so I can test this locally? |
@@ -64,6 +64,13 @@ protected override string GetDatabaseFilePath(string workingFolderPath) | |||
sqlStorage.Initialize(solution); | |||
|
|||
} | |||
catch (Exception ex) when (ex is DllNotFoundException || ex is EntryPointNotFoundException) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Need to catch TypeInitializationException and check InnerException for the data and throw that
10c2346
to
1b7537a
Compare
// sqlStorage was not instantiated yet, due to an exception in the static constructor. | ||
dbOwnershipLock.Dispose(); | ||
DisableStorage(); | ||
throw ex.InnerException; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
❓ Why not just throw;
here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The exception thrown from here propagates into ShouldDeleteDatabase
, via here:
http://source.roslyn.io/#Microsoft.CodeAnalysis.Workspaces.Desktop/Workspace/Storage/AbstractPersistentStorageService.cs,184
Maybe throw;
is the better option here, and the check in ShouldDeleteDatabase
should be done for TypeInitializationException and its inner exception.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We're masking an exception and changing its stacktrace, so yes, I'll fix this.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
💭 It seems like the less risky change in the absence of a need to drop the outer exception. I could go either way, let me know which one you will go with.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Fixed.
@@ -98,10 +114,23 @@ private static void EnsureDirectory(string databaseFilePath) | |||
Directory.CreateDirectory(directory); | |||
} | |||
|
|||
static bool IsNativeLibraryResolutionException(Exception ex) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
💡 Missing access modifier (will send PR to add code style rule so this is more apparent)
ex = typeInitializationException.InnerException; | ||
} | ||
|
||
return exception is DllNotFoundException || exception is EntryPointNotFoundException; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
❗️ exception
ex
47064db
to
9badc11
Compare
{ | ||
sqlStorage?.Dispose(); | ||
if (IsNativeLibraryResolutionException(exception)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
rather than doing this way, can we add explicit check for those dependencies? basically, rather than doing this implicitly through static constructor (http://source.roslyn.io/#Microsoft.CodeAnalysis.Workspaces.Desktop/Workspace/SQLite/SQLitePersistentStorage.cs,96)
make an explicit static API for it such as bool SQLitePersistentStorage.InitLibraries(). which will run the LoadLibrary once and cache the result. and next time it is called, it just return true or false.
and in TryOpenDatabase, before TryGetDatabaseOwnership, it calls SQLitePersistentStorage.InitiLibrary and if that return false, return storage = null and return false like what dbOwnershipLock does.
I think that is more explicit check and follow pattern already in this method.
...
if you are worry about same thing repeatedly get called, then, I think that check should move up to here.
http://source.roslyn.io/#Microsoft.CodeAnalysis.Workspaces.Desktop/Workspace/Storage/PersistenceStorageServiceFactory.cs,24
since the condition where that LoadLibrary will fail is process wide, not per workspace or solution. the factory itself should check such condition and return
http://source.roslyn.io/#Microsoft.CodeAnalysis.Workspaces/Workspace/Host/PersistentStorage/NoOpPersistentStorageService.cs,0ec32e8cdd068f49
since in the process, SQLite will be never supported.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, that's a better solution. This PR ended up being bigger than I initially imagined 😛
I'll open a PR writing it way.
31a50b5
to
8fb8ada
Compare
This came up so testing doesn't work
|
3deceba
to
17141e1
Compare
Hmmm, so I think I'm hitting a rather peculiar bug right now. It seems to say that it can't probe for e_sqlite.dll in either x86/x64 dir relative to the workspaces.dll, but that should definitely be copied over. Any help here? I don't really understand how this could've passed before, except for a hidden exception in FatalError/DatabaseLogger |
Ah, I think I figured it out. |
@dotnet/roslyn-infrastructure is our jenkins currently having some issues? |
@@ -66,7 +104,17 @@ protected override string GetDatabaseFilePath(string workingFolderPath) | |||
} | |||
catch (Exception) | |||
{ | |||
sqlStorage?.Dispose(); | |||
if (sqlStorage != null) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
if exception was thrown, shouldn't we always dispose the owner lock?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
the lock is to make sure we own sqlite file, so if we dont have it (since it is either null or disposed here), then I think we should let it go?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ah. just realized that the dispose of the storage will dispose the lock. sorry about that.
@@ -100,7 +148,7 @@ private static void EnsureDirectory(string databaseFilePath) | |||
|
|||
protected override bool ShouldDeleteDatabase(Exception exception) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: convert to => ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
you are good. just realized that dispose of sqlite storage will dispose the lock.
@Pilchie for ask mode approval for 15.7 preview 4 |
Approved - pending passing unit test runs... |
Rebasing again hoping they pass... |
@Therzok Looks like the 64-bit tests have a legitimate failure |
|
||
lock (_gate) | ||
{ | ||
if (_initialized) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
please don't do double checked locking. i still don't trust that .net has a consistent story around reordered writes and whatnot to make this safe. It's trivial, and no perf problem, to just take the lock and do the check.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh, this part was just for testing. This is not needed after all.
[DllImport("kernel32.dll")] | ||
private static extern IntPtr LoadLibrary(string dllToLoad); | ||
private static bool _initialized; | ||
private static object _gate = new object(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it might make a lot of sense to extract all this code related to laoding sqlite and put it in its own class. The class would expose almost nothing except a bit saying if you can use sqlite or not. You would never have to care or look for exceptions relating to asking the question in teh first place if you can use it.
if (_initialized) | ||
{ | ||
return true; | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
i feel like the momeent after checking, we should set _initialized to true, so we never execute this block more than once.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Instead of just returning 'true', i feel a better API would be to have it actually have a different bool stating if we actually initialized properly or not. (Alternatively, use a bool?
where 'null' means 'have not computed', and non-null is the actual value.)
catch (Exception e) when (e is DllNotFoundException || e is EntryPointNotFoundException) | ||
{ | ||
StorageDatabaseLogger.LogException(e); | ||
return false; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
i really dislike that we haven't marked _iniitialized here. so this can just be hit over and over again.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This was just to test whether unit tests passed with this. I used to have a Lazy<bool>
in an earlier version, but that broke unit tests in the same way they are broken now.
@@ -66,7 +120,17 @@ protected override string GetDatabaseFilePath(string workingFolderPath) | |||
} | |||
catch (Exception) | |||
{ | |||
sqlStorage?.Dispose(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
i feel like, at the top of this method, we would see in the first place if we had been able to load sqlite. And, if not, we wouldn't do any work. no need for locks, exception handling, and wahtnot in that case. Basically, we shouldn't even be trying this complex logic if we can't even get at sqlite in the first place.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This isn't even called if we haven't been able to load sqlite.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@Therzok the tests probably are failing because some unrelated tests ran before the Sqlite tests and set the static field to false and Sqlite tests are bailing out due to the static field already being set as false.
I bet before, except Sqlite tests, other conditions made Sqlite service to bail out before creating SqliteStorage (so no static ctor get called) and let the tests to actually run.
...
I think either you take out Sqlite persistent service out from default test MEF so that only SQLite test uses real Sqlite persistent service, or addin test checking code somewhere.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
or you don't save the flag as you did :) that will work as well.
@Therzok alright, test is still failing. so I looked through tests. looks like Sqlite tests bypass Factory and create Sqlite service directly. so there is no one who initialize sqlite. I think that is why your test is failing. put the initialize in here you probably can add it as static ctor of the test. and I think you can put back the flag to make it not run twice if you want to. looks like test MEF doesn't include Sqlite service so static flag shouldn't be a problem. |
Omg, thanks! I missed that! Will fix! |
|
||
internal static bool TryInitializeLibraries() => _initialized.Value; | ||
|
||
private static Lazy<bool> _initialized = new Lazy<bool>(() => TryInitializeLibrariesLazy()); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
readonly and s_initialized
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Fixed!
The static constructor of SQLitePersistentStorage would throw an exception, which would then get logged by the exception logger. This changes the following logic: * The database used to be deleted if e_sqlite was not found. * The sqlite persistent storage is not used if e_sqlite is not found. * The database ownership lock is released if any exceptions happen while trying to instantiate the storage. Fixes dotnet#24042 Extra story: There is a architectures of e_sqlite not existing for some platforms, i.e. ARM64. Therefore, in MonoDevelop, it would throw a TypeInitializationException on trying to initialize the SQLitePersistentStorage. In this case, what would happen is: * The persistent storage service would create the lock file. * It would throw in the static constructor, without having passed ownership * Thus, the lock file is created by the service, not held by anyone specifically * The exception wasn't accounted for, so the storage file would be deleted * We now have a lock file present and no storage created * Roslyn will keep trying to create the persistent storage on every subsequent try, ending up in a performance issue caused by continuous IO on trying to get ownership on the lock file, but it can't take it because nobody released the old lock Therefore, shortcircuiting and disabling persistent storage in case the lib is not found is better.
Thanks! |
The static constructor of SQLitePersistentStorage would throw an
exception, which would then get logged by the exception logger.
This changes the following logic:
Fixes #24042
Extra story:
There is a architectures of e_sqlite not existing for some platforms,
i.e. ARM64. Therefore, in MonoDevelop, it would throw a
TypeInitializationException on trying to initialize the SQLitePersistentStorage.
In this case, what would happen is:
ownership
specifically
deleted
subsequent try, ending up in a performance issue caused by continuous
IO on trying to get ownership on the lock file, but it can't take it
because nobody released the old lock
Therefore, shortcircuiting and disabling persistent storage in case the
lib is not found is better.
Ask Mode template not completed
Customer scenario
What does the customer do to get into this situation, and why do we think this
is common enough to address for this release. (Granted, sometimes this will be
obvious "Open project, VS crashes" but in general, I need to understand how
common a scenario is)
Bugs this fixes
(either VSO or GitHub links)
Workarounds, if any
Also, why we think they are insufficient for RC vs. RC2, RC3, or RTW
Risk
This is generally a measure our how central the affected code is to adjacent
scenarios and thus how likely your fix is to destabilize a broader area of code
Performance impact
(with a brief justification for that assessment (e.g. "Low perf impact because no extra allocations/no complexity changes" vs. "Low")
Is this a regression from a previous update?
Root cause analysis
How did we miss it? What tests are we adding to guard against it in the future?
How was the bug found?
(E.g. customer reported it vs. ad hoc testing)
Test documentation updated?
If this is a new non-compiler feature or a significant improvement to an existing feature, update https://github.com/dotnet/roslyn/wiki/Manual-Testing once you know which release it is targeting.