Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimizing the comparison feature using Clickhouse #11376

Open
alisman opened this issue Feb 5, 2025 · 0 comments
Open

Optimizing the comparison feature using Clickhouse #11376

alisman opened this issue Feb 5, 2025 · 0 comments
Assignees

Comments

@alisman
Copy link
Contributor

alisman commented Feb 5, 2025

When we create a comparison session from a chart, we first need to group the sampleids by the data. Each chart type has it's own mechanism for creating those groups. In case of clinical data charts, it has to fetch all the clinical data for that type in order to make the groups. This is time consuming. All these various mechanisms need to be studied and optimized. (It occurs in the study view, NOT the newly opened comparison page).

Image

In the new opened comparison page, we await the creation of the session in the opening page. When it resolves, we can then load the session in the new page. The main bottlenecks here are the sample endpoint and then the enrichments endpoint.

Image

We have a prototype that uses Clickhouse to improve the enrichments endpoint by 10x.

Note that caching solves the clinical data and samples endpoint latency, at least for single study queries. The enrichments endpoint is the highest priority.

@alisman alisman changed the title Optimizing the comparison feature Optimizing the comparison feature using Clickhouse Feb 5, 2025
@alisman alisman self-assigned this Feb 5, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant