RESTful sample status #3139

wasade · 2021-08-27T16:26:51Z

This PR adds in the ability to query a study for detail on a specific sample or set of samples. The returned details include EBI accession information, and what preparations the sample was observed on. The output is represented in an flat fashion suitable to be fed directly into a DataFrame or DataTable.

cc @dhakim87 @antgonza

antgonza · 2021-08-27T17:09:37Z

Just FYI, moved the base branch to dev and I'm going to restart the builds.

antgonza

Thank you @wasade, some minor comments.

qiita_pet/handlers/rest/study_samples.py

antgonza · 2021-08-27T17:42:16Z

qiita_pet/handlers/rest/study_samples.py

+
+    # cache sample detail for lookup
+    study_samples = set(study.sample_template.keys())
+    sample_accessions = study.sample_template.ebi_sample_accessions


Note that ebi_sample_accessions will return all available samples with Nones where there is no accession; in other words, len(sample_accessions) == len(study_samples)

I dont think that impacts the subsequent use

qiita_pet/handlers/rest/study_samples.py

antgonza · 2021-08-27T17:54:48Z

qiita_pet/handlers/rest/study_samples.py

+    study_samples = set(study.sample_template.keys())
+    sample_accessions = study.sample_template.ebi_sample_accessions
+
+    # cache preparation information that we'll need
+
+    # map of {sample_id: [indices, of, light, prep, info, ...]}
+    sample_prep_mapping = defaultdict(list)
+    pt_light = []
+    for idx, pt in enumerate(study.prep_templates()):
+        pt_light.append((pt.id, pt.ebi_experiment_accessions,
+                         pt.status, pt.data_type()))
+
+        for ptsample in pt.keys():
+            sample_prep_mapping[ptsample].append(idx)
+


My concern with this block is that it will always load all samples, even when len(samples) == 1.

Another way to do this could be to first select which preps have the samples you are looking for and then build the details, something like this:

samples_set = set(samples) # not sure if this is required as its own var. prep_templates = [pt for pt in study.prep_templates() if set(pt) & samples_set] ... for idx, pt in enumerate(prep_templates): ...

Isn't the cost of this the same as it's still necessary to iterate over all preps?

Yes time wise but my concern is the memory (should have said this in my previous message) to store all prep info data in pt_light, in specific due to pt.ebi_experiment_accessions, the other values are pretty small; you can imagine that this can grow a lot for studies like the AGP. However, this is something internal and if you think this is not that large or important we can improve in a future iteration, if it actually becomes a problem.

Okay sounds good, last commit should reduce what's cached

antgonza

@wasade, thank you; looks great! Let's wait for tests ...

coveralls · 2021-08-27T18:26:33Z

Coverage increased (+0.02%) to 91.171% when pulling 67b4fd6 on wasade:rest-sample-status into ab438cc on qiita-spots:dev.

wasade added 3 commits August 27, 2021 08:20

TST: tests for the sample status end points

000a5d1

API: add sample status endpoints

510518e

Defensive assertion on detail maker

b45a0c4

antgonza changed the base branch from master to dev August 27, 2021 17:09

antgonza reviewed Aug 27, 2021

View reviewed changes

wasade added 2 commits August 27, 2021 11:04

Address @antgonza's comments

119d4ed

Limit memory use when caching prep info

67b4fd6

antgonza approved these changes Aug 27, 2021

View reviewed changes

antgonza merged commit 7d4ad95 into qiita-spots:dev Aug 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

RESTful sample status #3139

RESTful sample status #3139

Uh oh!

wasade commented Aug 27, 2021

Uh oh!

antgonza commented Aug 27, 2021

Uh oh!

antgonza left a comment

Uh oh!

Uh oh!

antgonza Aug 27, 2021

Uh oh!

wasade Aug 27, 2021

Uh oh!

Uh oh!

antgonza Aug 27, 2021

Uh oh!

wasade Aug 27, 2021

Uh oh!

antgonza Aug 27, 2021

Uh oh!

wasade Aug 27, 2021

Uh oh!

antgonza left a comment

Uh oh!

coveralls commented Aug 27, 2021 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

RESTful sample status #3139

RESTful sample status #3139

Uh oh!

Conversation

wasade commented Aug 27, 2021

Uh oh!

antgonza commented Aug 27, 2021

Uh oh!

antgonza left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

antgonza Aug 27, 2021

Choose a reason for hiding this comment

Uh oh!

wasade Aug 27, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

antgonza Aug 27, 2021

Choose a reason for hiding this comment

Uh oh!

wasade Aug 27, 2021

Choose a reason for hiding this comment

Uh oh!

antgonza Aug 27, 2021

Choose a reason for hiding this comment

Uh oh!

wasade Aug 27, 2021

Choose a reason for hiding this comment

Uh oh!

antgonza left a comment

Choose a reason for hiding this comment

Uh oh!

coveralls commented Aug 27, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

coveralls commented Aug 27, 2021 •

edited

Loading