Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Concat iqueryable too many times results on StackOverflowException when ToList is called #21580

Closed
xaviergxf opened this issue Jul 10, 2020 · 7 comments

Comments

@xaviergxf
Copy link

xaviergxf commented Jul 10, 2020

This code produces an StackOverflowException when ToList is called on IQueryable. It seems that the expression tree is too long, with too many inner expressions and the generation of sql is done recursively.

Steps to reproduce

var query = dbContext.Users.Take(0).AsQueryable();
foreach(var user in users)
{
    query = query.Concat(dbContext.Users.Where(p=>p.Username == user.Username && p.Email == user.Email));
}
var result = await query.ToList();

When calling ToList() it throws an StackOverflowException.

Further technical details

EF Core version:
Database provider: Microsoft.EntityFrameworkCore.SqlServer
Target framework: .NET Core 3.1
Operating system: Windows 10 1909
IDE: Visual Studio 2019 16.6.3

@ajcvickers
Copy link
Contributor

/cc @roji

@roji
Copy link
Member

roji commented Jul 10, 2020

@xaviergxf how many users are being iterated over here? Dynamically generating queries with query nodes being proportional to data in this way is inherently not a good idea.

The specific query above looks like it could be easily rewritten to use OR conditions instead of Concat - that should probably make the StackOverflowException disappear - but is still probably going to be a bad idea from a perf perspective.

@xaviergxf
Copy link
Author

@roji there's thousands of users... I i thought about rewriting it with or, but i would not get a good perf and i would need to use LinqKit for this. Is there anything planed with EF Core to enable a scenario like this one?

@roji
Copy link
Member

roji commented Jul 13, 2020

I i thought about rewriting it with or, but i would not get a good perf and i would need to use LinqKit for this

I highly doubt that Concat would give you better performance than conditions with Or - be sure to measure that carefully (by writing a test raw SQL query and measuring with BenchmarkDotNet), you may be surprised. You also don't require LinqKit to produce dynamic queries like this - you may construct them yourself via System.Linq.Expressions.

However, stepping back, for thousands of users, I'd strongly suggest reviewing your general design beyond this specific query. Regardless of whether you use Concat or conditions with Or, databases and SQL are generally not geared towards massive dynamic queries. Each query would have a unique SQL, and therefore no plans would be cached, etc.

If you can change your query to only check one column (e.g. compare Username only, instead of Username and Email as above), you should be able to use Contains in your query, which EF Core will translate to an SQL IN expression. This also will not provide good perf for thousands of dynamic values, but we at least have some plans to improve that in the future (i.e. #13617). However, a larger redesign may obviate dynamic queries entirely, providing optimal perf. I don't have any context on what you're doing so I can't give further advice...

@ajcvickers
Copy link
Contributor

@smitpatel Were you referring to #11799 or #14661?

@smitpatel
Copy link
Contributor

Both.

@ajcvickers ajcvickers reopened this Oct 16, 2022
@ajcvickers ajcvickers closed this as not planned Won't fix, can't repro, duplicate, stale Oct 16, 2022
@Bouke
Copy link

Bouke commented Jan 9, 2025

If you need to query for a lot of separate conditions, AggregateBalanced from LinqKit can also be helpful. Instead of producing a deep expression like (((A OR B) OR C) OR D) it'll produce something like ((A OR B) OR (C OR D)). That way you'll workaround the StackOverflowException, but probably run into some other limitation. For me it was the 65k parameter limit of Postgres. Also beware that dynamically constructing queries like this will pollute the query cache.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants