Load figure caption from the CORD-19 data and add links to PMC. #16

adamjhn · 2020-09-21T15:14:17Z

Currently the links only work with PMC new url format, some papers are still using the old url format.

…ntly the links only work with PMC new url format, some papers are still using the old url format.

ramcdougal

I really like the to_datetime part of the example as that touches on the new functionality and is simple to understand. I wonder if the rest of the load_csv changes introduce too much complexity to be useful as an example, and that what you're really doing is the start of a research project that uses the pipeline (possibly as a sub-module?) but should be in its own repo?

Also, note the comment about fields vs list_fields; we ought to be able to have one list and just do the right thing based on data type.

ramcdougal · 2020-09-21T16:19:11Z

Project/pipeline_views.py

+            field: _nicestr(paper[field])
+            if field in paper["field_order"]
+            else paper[field]
+            for d in ["field_order", "list_field_order"]


Why do field_order and list_field_order need to be separate? I could imagine one might want things with or without lists in any order, not all the non-lists before all the lists.

i.e. can't we figure this out based on data type?

ramcdougal · 2020-09-21T16:46:41Z

examples/load_csv.py

+                    label = caption.split(":")[0]
+                    fignum = int(re.findall("\d+", label)[0])
+                except:
+                    fignum = int(k.lstrip("TABREF")) + 1


I don't think lstrip is what you're looking for here; consider:

>>> 'TARANTULA'.lstrip('TABREF') 'NTULA'

ramcdougal · 2020-09-21T16:48:39Z

Project/pipeline_views.py

@@ -110,12 +110,18 @@ def _nicestr(item):
        if any("," in thing for thing in item):
            joiner = "; "
        return joiner.join(thing for thing in item)
+    elif isinstance(item, datetime.datetime):


This is a great idea.

…ies.

Load figure caption from the CORD-19 data and add links to PMC. Curre…

089e95a

…ntly the links only work with PMC new url format, some papers are still using the old url format.

ramcdougal requested changes Sep 21, 2020

View reviewed changes

Removed list_field_order and made field_order a list of dictionar…

67d9059

…ies.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Load figure caption from the CORD-19 data and add links to PMC. #16

Load figure caption from the CORD-19 data and add links to PMC. #16

adamjhn commented Sep 21, 2020

ramcdougal left a comment

ramcdougal Sep 21, 2020

ramcdougal Sep 21, 2020

ramcdougal Sep 21, 2020

ramcdougal Sep 21, 2020

Load figure caption from the CORD-19 data and add links to PMC. #16

Are you sure you want to change the base?

Load figure caption from the CORD-19 data and add links to PMC. #16

Conversation

adamjhn commented Sep 21, 2020

ramcdougal left a comment

Choose a reason for hiding this comment

ramcdougal Sep 21, 2020

Choose a reason for hiding this comment

ramcdougal Sep 21, 2020

Choose a reason for hiding this comment

ramcdougal Sep 21, 2020

Choose a reason for hiding this comment

ramcdougal Sep 21, 2020

Choose a reason for hiding this comment