[Timebox] fix: performance degradation of flexible back-end with many workspaces #2722

magrinj · 2023-11-27T10:21:09Z

Scope & Context

The flexible backend was officially released on November 24th. With the production database accumulating an increasing number of workspaces, we've observed performance degradation directly proportional to the number of workspaces. This issue appears to be associated with pg_graphql and our current method of utilization.

Current behavior

Currently, we assign a specific data source for each workspace. This approach disrupts the caching mechanism of pg_graphql, leading to frequent regeneration of the GraphQL schema. pg_graphql iterates over all database schemas during the generation of the GraphQL schema. Since we create a separate database schema for each workspace, this significantly slows down pg_graphql.

Expected behavior

The optimal solution would involve utilizing a single data source for all workspaces, resorting to a new data source only when querying foreign objects (a functionality that is not yet implemented). Further investigation into the internal workings of pg_graphql is necessary, possibly leading to an improved method for generating and versioning the GraphQL schema based on the database schema.

Technical inputs

Data Source Optimization: Explore consolidating the multiple data sources into a singular, centralized source. This change aims to reduce the load on pg_graphql by minimizing schema generation processes.
Schema Caching Strategy: Investigate alternative caching strategies within pg_graphql. This could involve implementing a more efficient caching mechanism that doesn't require schema regeneration for each workspace.
Performance Profiling: Conduct thorough performance profiling to pinpoint specific bottlenecks associated with pg_graphql when handling multiple schemas. Tools like EXPLAIN ANALYZE in PostgreSQL can maybe be useful for this analysis.
Code Review and Refactoring: Review the current implementation code for pg_graphql integration. Look for any inefficient practices or potential improvements that could be contributing to the performance degradation.

The text was updated successfully, but these errors were encountered:

charlesBochet · 2023-11-27T13:35:39Z

In scope for this ticket: #2190

charlesBochet · 2023-11-27T13:46:33Z

Scope of this ticket:

understand the issue and have an idea of how to solve it
take a decision on datasources by workspace

github-project-automation bot added this to 🎯 Roadmap & Sprints Nov 27, 2023

github-project-automation bot moved this to 🆕 New in 🎯 Roadmap & Sprints Nov 27, 2023

magrinj moved this from 🆕 New to 🔖 Planned in 🎯 Roadmap & Sprints Nov 27, 2023

magrinj self-assigned this Nov 27, 2023

magrinj added the scope: backend Issues that are affecting the backend side only label Nov 27, 2023

charlesBochet added the type: chore label Dec 1, 2023

magrinj moved this from 🔖 Planned to 🏗 In progress in 🎯 Roadmap & Sprints Dec 20, 2023

magrinj linked a pull request Jan 2, 2024 that will close this issue

fix: pg_graphql performance #3204

Merged

magrinj moved this from 🏗 In progress to ✅ Done in 🎯 Roadmap & Sprints Jan 2, 2024

FelixMalfait closed this as completed Jan 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Timebox] fix: performance degradation of flexible back-end with many workspaces #2722

[Timebox] fix: performance degradation of flexible back-end with many workspaces #2722

magrinj commented Nov 27, 2023

charlesBochet commented Nov 27, 2023

charlesBochet commented Nov 27, 2023

[Timebox] fix: performance degradation of flexible back-end with many workspaces #2722

[Timebox] fix: performance degradation of flexible back-end with many workspaces #2722

Comments

magrinj commented Nov 27, 2023

Scope & Context

Current behavior

Expected behavior

Technical inputs

charlesBochet commented Nov 27, 2023

charlesBochet commented Nov 27, 2023