Add ExtendedData to FSharpDiagnostic #15840

DedSec256 · 2023-08-22T17:14:22Z

This PR implements fsharp/fslang-suggestions#1094 and proposes the following:

Add property ExendedData: IFSharpDiagnosticExtendedData option to FSharpDiagnostic,
where IFSharpDiagnosticExtendedData implementations are created from PhasedDiagnostic exceptions and can lazily provide the data needed for quick fixes and other IDE-related things
Add several IFSharpDiagnosticExtendedData implementations as a proof-of-concept (and which are already needed to stop parsing error messages for some quick fixes :) )

To do:

Add missing docs for the new api
Fix CI

src/Compiler/FSComp.txt

src/Compiler/Symbols/FSharpDiagnostic.fs

vzarytovskii · 2023-08-22T22:11:59Z

Probably needs some docs + examples of usage.

vzarytovskii · 2023-08-22T22:16:49Z

Binary compatibility bothers me a bit here. Will need to think about implications, if we'll need to be making
any changes.

@baronfel @TheAngryByrd thoughts?

src/Compiler/Symbols/FSharpDiagnostic.fs

TheAngryByrd · 2023-08-23T13:39:10Z

Binary compatibility bothers me a bit here. Will need to think about implications, if we'll need to be making any changes.

@baronfel @TheAngryByrd thoughts?

From a FSAC point of view, I think we've grown accustomed to dealing with breaking changes, so not too worried there.

The bigger concern would be from some analyzer to be point of view, where if analyzers were relying on some form of structured data, I think FCS would need to publish it for ever, or at least until some understandable breaking change point (and not in a hotfix release). Would have to probably create new types if we need new ways to portray the information. Feels akin to an event sourcing versioning problem.

kerams · 2023-08-23T13:59:13Z

True enough, but I don't see a way around it unless you want to do IDictionary<string, obj> containing the barest primitives.

baronfel · 2023-08-23T14:36:01Z

@TheAngryByrd yes, event sourcing was exactly the problem I had in mind. That's why my initial suggestion was something fairly untyped as the message envelope (IDictionary<string, obj> or IDictionary<string, string>) with 'schemas' of particular string-based keys being probe-able from the consumer's side to prevent typecasting/API-change-based errors.

That's just a looser form of API compatibility problem, though.

T-Gro · 2023-08-23T17:23:39Z

Binary compatibility bothers me a bit here. Will need to think about implications, if we'll need to be making any changes.
@baronfel @TheAngryByrd thoughts?

From a FSAC point of view, I think we've grown accustomed to dealing with breaking changes, so not too worried there.

The bigger concern would be from some analyzer to be point of view, where if analyzers were relying on some form of structured data, I think FCS would need to publish it for ever, or at least until some understandable breaking change point (and not in a hotfix release). Would have to probably create new types if we need new ways to portray the information. Feels akin to an event sourcing versioning problem.

That is the reason why I suggested using a composable list of lower-level extended data items , instead of an option.
That way, an analyzer could have a logic for searching for a particular item, e.g. "ActualTypeExtendedData".

And it would not fail if the list is extended in the future, neither compile-time nor run-time.

It is similar to an approach with a dictionairy of primitives, but instead of .NET primitives, it would use compiler-domain-primitives, and instead of string-based keys, the naming would come from the name of a type.

From the point of view of this PR, it's not much more to change - turn option into a list, and remodel some of the examples.

DedSec256 · 2023-08-24T17:23:04Z

@vzarytovskii,

Probably needs some docs + examples of usage.

The documentation is not yet written, while the api discussions are going on. But of course it will be added.

Small prototype JetBrains/resharper-fsharp@fd9dd3c.
One of the problems that can be solved is related to parsing diagnostic messages for additional information necessary to create strongly typed warnings/errors and quick fixes for them — these hacks stop working when changing localization.

Also, we have a bunch of PRs with new quick fixes:

https://github.com/JetBrains/resharper-fsharp/blob/f04f7326b3888b778238d0a1e0e6283a39f6d532/ReSharper.FSharp/src/FSharp.Psi.Daemon/src/Stages/FcsErrorsStageProcessBase.fs#L319-L325 (from JetBrains/resharper-fsharp#545)

https://github.com/JetBrains/resharper-fsharp/blob/884e2a68eda184534cd349cc893175978df8bcb8/ReSharper.FSharp/src/FSharp.Psi.Daemon/src/Stages/FcsErrorsStageProcessBase.fs#L319-L325 (from JetBrains/resharper-fsharp#549)

https://github.com/JetBrains/resharper-fsharp/blob/432d2009fdda494222184ab2865f77195c7c5678/ReSharper.FSharp/src/FSharp.Psi.Daemon/src/Stages/FcsErrorsStageProcessBase.fs#L319-L329 (from JetBrains/resharper-fsharp#546)

Where instead of error.Message.StartsWith/EndsWith can be used type check for ValueNotContainedDiagnosticExtendedData and data to compare from it:

fsharp/src/Compiler/Symbols/FSharpDiagnostic.fs

Lines 103 to 107 in 917fa2d

    
           type ValueNotContainedDiagnosticExtendedData 
        
               internal (symbolEnv: SymbolEnv, signatureValue: Val, implValue: Val) = 
        
               interface IFSharpDiagnosticExtendedData 
        
               member x.SignatureValue = FSharpMemberOrFunctionOrValue(symbolEnv, mkLocalValRef signatureValue) 
        
               member x.ImplementationValue = FSharpMemberOrFunctionOrValue(symbolEnv, mkLocalValRef implValue)

As you can see from these examples, a single extended data type (such as TypeMismatchDiagnosticExtendedData) can be used for several different diagnostics, even if they have a different number, but the same (or similar) semantics.

At the same time, different extended data types can be used to restore the diagnostic context in the case of heterogeneous diagnostics under the same number (as, for example, in MissingErrorNumber diagnostics group)

@baronfel,

That's why my initial suggestion was something fairly untyped as the message envelope (IDictionary<string, obj> or IDictionary<string, string>) with 'schemas' of particular string-based keys being probe-able from the consumer's side to prevent typecasting/API-change-based errors.

Currently, FCS provides a strongly typed API for all public subsystems. Quite unexpectedly and confusing to receive in this case a set of raw strings/objects with which additional manipulations must be carried out. At the same time, lazy-computed properties with the necessary data can be located inside a separate type, which can play a role in large sets of additional data. For example, extended data might provide a set of missing elements to implement an interface that might not be needed at all, because no one would call a quick fix for it, but it would still have to be computed and put into a dictionary. With properties, we also can change data calculation to lazy without losing backward compatibility.

@T-Gro,

That is the reason why I suggested using a composable list of lower-level extended data items , instead of an option.
That way, an analyzer could have a logic for searching for a particular item, e.g. "ActualTypeExtendedData".

Together with diagnostic numbers and strongly typed extended data, we can not only get additional data, but also the ability to restore the context of the diagnostic itself.

For example, in this code for heterogeneous diagnostics with a single number, from the ExtendedData type we can restore the context without a message

If we want to make a list of ExtendedData, we need to think about how the semantics of diagnostic can be restored from the list? Does the order of the data in this list matter? For example, it is easy to imagine how the developer of some analyzer simply accesses the first element in the list, always consider it appropriate. And it's easy to accidentally break the order of extended data types in a list.

In the current PR, we have the following options, which seem to be no worse than lists and dictionaries

Use existing diagnostic extended data if it makes sense
Add properties to existing diagnostic if necessary
Create a new extended data type for new diagnostics if others do not fit

ExtendedData can also be composed with each other. Types can be composed in a variety of ways, unlike lists and dictionaries – there are no contracts in collections, and the type has an explicit contract.

baronfel · 2023-08-24T17:40:39Z

Thanks for the detailed responses @DedSec256. Your point about lazy calculation of properties allowing for minimizing the cost of generating good diagnostic data is a great one - it's conceptually similar the reason why detecting a codefix is relevant and applying the codefix is logically separate in models like the LSP.

I think I'm beginning to be convinced. We already have some amount of API compat issues, but this kind of API compat is entirely in-process and manageable by the editors - it will only appear as each editor updates its FCS dependency. It's not going to start happening by e.g. a .NET SDK update alone. And the mitigations would hopefully be reasonable - only areas that dealt directly with each diagnostic would be impacted. And putting the relevant codefix data directly on the extended data would reduce the amount of duplicated code in each editor (to do things like determine the name of a type to suggest, or similar).

T-Gro · 2023-08-25T06:58:25Z

Together, diagnostic numbers and strongly typed extended data, we can not only get additional data, but also the ability to restore the context of the diagnostic itself.

Strongly typed data tailored for each diagnostic specifically is of course a superior data-transfer model from the consumer point of view, but it only holds in a closed-world assumption.
That is, the consumer is tightly coupled to the type definitions exposed and possibly updated with every release.

Which I don't believe can hold in an external analyzer scenario, which is on the roadmap.

That being said, could you please mark all the added types as [<Experimental>] ?

nojaf · 2023-08-25T08:25:31Z

I want to throw in my plus one for the current proposal.

but this kind of API compat is entirely in-process and manageable by the editors

I'm pretty convinced that the amount of consumers of this exact API will be minimal. Getting a sense of what FCS consumers would actually use this and how they feel about nice typed data should definitely weigh in on the decision here.

As for the compatibility issue, this could be detected early on if public preview FCS releases are consumable for all parties and dropped frequently. I only want to state that this potential problem might not be as painful as you currently think it is.

DedSec256 · 2023-08-25T14:43:17Z

Strongly typed data tailored for each diagnostic specifically is of course a superior data-transfer model from the consumer point of view, but it only holds in a closed-world assumption.
That is, the consumer is tightly coupled to the type definitions exposed and possibly updated with every release.
Which I don't believe can hold in an external analyzer scenario, which is on the roadmap.

Existing F# diagnostics almost never change, this means that the ExtendedData Type corresponding to the diagnostic is unlikely to ever change, but, of course, its content may change. Here we can use standard API versioning approaches: leave the old properties for compatibility, mark them with [<Deprecated>], add new properties, which may contain even a separate subtype, and so on.

A loosely typed solution will just silently break external users.

DedSec256 · 2023-08-31T17:10:21Z

I think it's ready for review and further discussion

nojaf · 2023-09-05T14:26:31Z

I'm genuinely enthusiastic about the potential trajectory here. I've been actively experimenting with incorporating additional data into #15256, and you can check out my work at https://github.com/DedSec256/fsharp/pull/2/files.

The notion of having both the error code and supplementary information seamlessly integrated into the IDE holds significant promise for uncovering optimal refactoring opportunities.

In the example I've provided, I can readily guide users towards considering the inclusion of a type annotation precisely where the ambiguity arises within the record definition.

vzarytovskii · 2023-09-05T16:40:15Z

We'll need to go over it next Monday on our review session and discuss implications for the public surface and both future Analyzers SDK and LSP, and how will we go about deprecating stuff if needed.

vzarytovskii

I like the idea, but a bit on the fence because of how ad-hoc this solution is (not necessary a bad thing), and how much of the new public surface we're exposing.

I don't mind merging it and see how does it look with real compiler diagnostics exposed to the user, but would like to hear others' opinions about it.

psfinaki · 2023-09-11T17:37:10Z

Hey, I will go thorough this tomorrow to get the idea of the possible implications on the editor side - stay tuned.

src/Compiler/FSComp.txt

src/Compiler/Symbols/FSharpDiagnostic.fsi

psfinaki

Sorry didn't want to block this. Left some notes - might be misunderstanding something.

by mistake

DedSec256 · 2023-09-12T16:15:47Z

@psfinaki,

Hey this might indeed break a code fix, the RenameParamToMatchSignature one, here. This should be addressed - which might be just changing the diag code in the fix/tests or something more elaborate.
Should be easy since we have tests for this now. But I can help with this if needed.
That said, this should be breaking now then...

As we can see, this test is green after the changes, since the quickfix itself is tied to the diagnostic number, which I did not change.

And even more, in the future this code can be replaced by simply getting the names of the arguments from ArgumentsInSigAndImplMismatchExtendedData

fsharp/vsintegration/src/FSharp.Editor/CodeFixes/RenameParamToMatchSignature.fs

Lines 23 to 29 in d95e8f3

    
           let getSuggestion (d: Diagnostic) = 
        
               let parts = Regex.Match(d.GetMessage(), ".+'(.+)'.+'(.+)'.+") 
        
               if parts.Success then 
        
                   ValueSome parts.Groups.[1].Value 
        
               else 
        
                   ValueNone

But yeah more importantly - why the removal is needed?

For this error, instead of the general diagnostic exception type with a number and message, I created the separate strict type ArgumentsInSigAndImplMismatch, in order to then create the ArgumentsInSigAndImplMismatchExtendedData for it, by analogy with other more typed errors.

fsharp/src/Compiler/Symbols/FSharpDiagnostic.fs

Lines 188 to 189 in 9cf7dcb

    
           | ArgumentsInSigAndImplMismatch(sigArg, implArg) -> 
        
               Some(ArgumentsInSigAndImplMismatchExtendedData(sigArg, implArg))

Not strongly against but not a fan of marker interfaces either. What could be alternative designs here?

Base class? :)
Using the marker interface, the API consumer can find a list of all its implementations and thereby understand which ExtendedData exist for diagnostics. You can also later specify some common properties for all ExtendedData by adding them to this interface/base class.

src/Compiler/Symbols/FSharpDiagnostic.fs

.../FSharp.Compiler.Service.Tests/FSharp.Compiler.Service.SurfaceArea.netstandard20.release.bsl

psfinaki

@DedSec256 thanks for the explanations, it took me some time to understand what's going on here.

I don't have any general objections, things "feel" a bit heavy but I don't have a better design idea and the potential benefits for the tooling outweigh the big API surface for me.

We have this as an experimental so for me it would be important to try building something on top of it to see this in action. Maybe we can then adjust the API on-the-go a bit.

Thanks for this!

tests/FSharp.Compiler.ComponentTests/ErrorMessages/ExtendedDiagnosticDataTests.fs

src/Compiler/Symbols/FSharpDiagnostic.fs

DedSec256 · 2023-09-13T15:03:14Z

Can you, please, mark conversations that are no longer relevant as resolved? :)
So that I can better understand what issues about this PR are relevant and require my changes.

DedSec256 · 2023-09-13T19:52:28Z

Thanks to everyone who took part in this PR (:

wip

821e496

DedSec256 requested a review from a team as a code owner August 22, 2023 17:14

Merge branch 'main' into ber.a/diagnosticData

917fa2d

vzarytovskii reviewed Aug 22, 2023

View reviewed changes

src/Compiler/FSComp.txt Show resolved Hide resolved

vzarytovskii reviewed Aug 22, 2023

View reviewed changes

src/Compiler/FSComp.txt Show resolved Hide resolved

vzarytovskii reviewed Aug 22, 2023

View reviewed changes

src/Compiler/Symbols/FSharpDiagnostic.fs Show resolved Hide resolved

vzarytovskii reviewed Aug 22, 2023

View reviewed changes

src/Compiler/Symbols/FSharpDiagnostic.fs Show resolved Hide resolved

vzarytovskii reviewed Aug 22, 2023

View reviewed changes

src/Compiler/Symbols/FSharpDiagnostic.fs Show resolved Hide resolved

T-Gro reviewed Aug 23, 2023

View reviewed changes

src/Compiler/Symbols/FSharpDiagnostic.fs Show resolved Hide resolved

DedSec256 added 5 commits August 25, 2023 21:18

wip

769fd71

Merge branch 'main' into ber.a/diagnosticData

0176bec

wip

6969538

docs

4b61787

fix

7a7e9bc

T-Gro approved these changes Sep 1, 2023

View reviewed changes

DedSec256 and others added 2 commits September 5, 2023 03:24

Merge branch 'main' into ber.a/diagnosticData

825d557

Merge branch 'main' into ber.a/diagnosticData

2be0669

vzarytovskii added the Needs-Triage label Sep 5, 2023

vzarytovskii approved these changes Sep 11, 2023

View reviewed changes

Merge branch 'main' into ber.a/diagnosticData

9cf7dcb

psfinaki previously requested changes Sep 12, 2023

View reviewed changes

src/Compiler/FSComp.txt Show resolved Hide resolved

src/Compiler/Symbols/FSharpDiagnostic.fsi Outdated Show resolved Hide resolved

psfinaki reviewed Sep 12, 2023

View reviewed changes

vzarytovskii reviewed Sep 12, 2023

View reviewed changes

src/Compiler/Symbols/FSharpDiagnostic.fs Show resolved Hide resolved

vzarytovskii reviewed Sep 12, 2023

View reviewed changes

src/Compiler/Symbols/FSharpDiagnostic.fs Outdated Show resolved Hide resolved

vzarytovskii reviewed Sep 12, 2023

View reviewed changes

.../FSharp.Compiler.Service.Tests/FSharp.Compiler.Service.SurfaceArea.netstandard20.release.bsl Outdated Show resolved Hide resolved

DedSec256 added 2 commits September 12, 2023 21:08

move extended data to separate module

91df846

fix build

8daea20

psfinaki approved these changes Sep 13, 2023

View reviewed changes

tests/FSharp.Compiler.ComponentTests/ErrorMessages/ExtendedDiagnosticDataTests.fs Show resolved Hide resolved

src/Compiler/Symbols/FSharpDiagnostic.fs Show resolved Hide resolved

Merge branch 'main' into ber.a/diagnosticData

2e9c086

T-Gro enabled auto-merge (squash) September 13, 2023 07:33

DedSec256 added 2 commits September 13, 2023 16:41

Merge branch 'main' into ber.a/diagnosticData

e21df1e

Merge branch 'main' into ber.a/diagnosticData

e32eacf

T-Gro merged commit 4dc1f3a into dotnet:main Sep 13, 2023

allisonchou mentioned this pull request Sep 19, 2023

[Automated] PRs inserted in VS build main-34119.64 #16003

Closed

DedSec256 mentioned this pull request Oct 16, 2023

FcsErrorsStageProcessBase: use FSharpDiagnosticExtendedData JetBrains/resharper-fsharp#568

Merged

T-Gro mentioned this pull request Nov 2, 2023

Optional rich info for FsharpDiagnostics out of FCS to enable easier development of CodeFixes #14288

Closed

edgarfgp mentioned this pull request Jan 22, 2024

Extend FSharpDiagnostic with a property bag for structured relevant data fsharp/fslang-suggestions#1094

Closed

5 tasks

majocha mentioned this pull request Feb 21, 2024

Fix #16708 / Some errors are not reported if any of previous files contain errors #16719

Merged

3 tasks

nojaf mentioned this pull request Mar 5, 2024

Type abbreviation mismatch extended data #16811

Merged

3 tasks

edgarfgp mentioned this pull request Aug 28, 2024

Change format + fix for --richerrors #17614

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ExtendedData to FSharpDiagnostic #15840

Add ExtendedData to FSharpDiagnostic #15840

DedSec256 commented Aug 22, 2023 •

edited

Loading

vzarytovskii commented Aug 22, 2023

vzarytovskii commented Aug 22, 2023 •

edited

Loading

TheAngryByrd commented Aug 23, 2023

kerams commented Aug 23, 2023

baronfel commented Aug 23, 2023

T-Gro commented Aug 23, 2023 •

edited

Loading

DedSec256 commented Aug 24, 2023 •

edited

Loading

baronfel commented Aug 24, 2023

T-Gro commented Aug 25, 2023

nojaf commented Aug 25, 2023

DedSec256 commented Aug 25, 2023 •

edited

Loading

DedSec256 commented Aug 31, 2023

nojaf commented Sep 5, 2023

vzarytovskii commented Sep 5, 2023 •

edited

Loading

vzarytovskii left a comment

psfinaki commented Sep 11, 2023 •

edited

Loading

psfinaki left a comment

DedSec256 commented Sep 12, 2023

psfinaki left a comment

DedSec256 commented Sep 13, 2023

DedSec256 commented Sep 13, 2023

Add ExtendedData to FSharpDiagnostic #15840

Add ExtendedData to FSharpDiagnostic #15840

Conversation

DedSec256 commented Aug 22, 2023 • edited Loading

vzarytovskii commented Aug 22, 2023

vzarytovskii commented Aug 22, 2023 • edited Loading

TheAngryByrd commented Aug 23, 2023

kerams commented Aug 23, 2023

baronfel commented Aug 23, 2023

T-Gro commented Aug 23, 2023 • edited Loading

DedSec256 commented Aug 24, 2023 • edited Loading

baronfel commented Aug 24, 2023

T-Gro commented Aug 25, 2023

nojaf commented Aug 25, 2023

DedSec256 commented Aug 25, 2023 • edited Loading

DedSec256 commented Aug 31, 2023

nojaf commented Sep 5, 2023

vzarytovskii commented Sep 5, 2023 • edited Loading

vzarytovskii left a comment

Choose a reason for hiding this comment

psfinaki commented Sep 11, 2023 • edited Loading

psfinaki left a comment

Choose a reason for hiding this comment

DedSec256 commented Sep 12, 2023

psfinaki left a comment

Choose a reason for hiding this comment

DedSec256 commented Sep 13, 2023

DedSec256 commented Sep 13, 2023

DedSec256 commented Aug 22, 2023 •

edited

Loading

vzarytovskii commented Aug 22, 2023 •

edited

Loading

T-Gro commented Aug 23, 2023 •

edited

Loading

DedSec256 commented Aug 24, 2023 •

edited

Loading

DedSec256 commented Aug 25, 2023 •

edited

Loading

vzarytovskii commented Sep 5, 2023 •

edited

Loading

psfinaki commented Sep 11, 2023 •

edited

Loading