-
-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
"unknown" publications #1570
Comments
AWG: 3/7/2019: Deprecate Priority-critical. Individuals will tackle and cleanup on their own in time for next month's Issues Meeting. |
|
Just some notes for the cleanup: (1) add DOI at the same time, (2) may need to update author in full citation manually (?). |
That would be amazing. With DOI we can talk to the world, without we can't really successfully talk to ourselves (eg, duplicate publications are still being created).
There are some publications that contain "unknown" in the citation. There is no link between the citation and Agents, so those should be updated. |
I can try to fix the herp review/copeia obvious herp ones. Note that Herp Review, Southwestern Naturalist and older Zootaxa (and possibly older Herp journals) don't have DOIs associated with articles. So don't get rid of those without DOIs! please |
I have fixed all the herps, maybe Chris Conroy or someone wants to fix the mammal ones? They all seem to be MVZ pubs!! |
@atrox10 DOI isn't mandatory but it is REALLY useful - I won't delete anything. Here are some duplicates - they're getting hard to find!
|
Ok thanks, I finished the herps,, only 1 had a DOI. Chris is working on the
mammal unknowns. Can we just delete the dupes?. As long as the don’t have
associated citations? Or does someone need to check them?
On Thu, Mar 7, 2019 at 4:03 PM dustymc ***@***.***> wrote:
@atrox10 <https://github.com/atrox10> DOI isn't mandatory but it is
REALLY useful - I won't delete anything.
Here are some duplicates - they're getting hard to find!
@campmlc <https://github.com/campmlc>
select full_citation from publication where full_citation not like '%Field Notes%' and regexp_replace(full_citation,'[^A-Za-z]','X') in (
select regexp_replace(full_citation,'[^A-Za-z]','X') from publication having count(*)>1 group by regexp_replace(full_citation,'[^A-Za-z]','X')
) order by full_citation;
Fernando Torres-Perez, R. Eduardo Palma, Brian Hjelle, Marcela Ferres, Joseph A. Cook. 2009. Andes virus infections in t
he rodent reservoir and in humans vary across contrasting landscapes in Chile. Infection, Genetics and Evolution 10:820-
825.
Fernando Torres-Perez, R. Eduardo Palma, Brian Hjelle, Marcela Ferres, Joseph A. Cook. 2010. Andes virus infections in t
he rodent reservoir and in humans vary across contrasting landscapes in Chile. Infection, Genetics and Evolution 10:820-
825.
Terry L. Yates. 1984. The role of voucher specimens in mammal collections: characterization and funding responsibilities
. Acta Zoologica 170(2):81-82.
Terry L. Yates. 1985. The role of voucher specimens in mammal collections: characterization and funding responsibilities
. Acta Zoologica 170(2):81-82.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#1570 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AESS8dFSyh5s6pkMvR1eAbZbC5uYu17Hks5vUai9gaJpZM4UoSSR>
.
--
Sent from Gmail Mobile
|
Yes you can delete the dups if they're not used - they're slightly different so it may not be completely arbitrary. Here's some better SQL
Summary:
Maybe we should delete the low-data stuff that nobody's using??? |
I fixed these two. I'll work on MSB related mammal pubs.
…On Thu, Mar 7, 2019 at 5:03 PM dustymc ***@***.***> wrote:
@atrox10 <https://github.com/atrox10> DOI isn't mandatory but it is
REALLY useful - I won't delete anything.
Here are some duplicates - they're getting hard to find!
@campmlc <https://github.com/campmlc>
select full_citation from publication where full_citation not like '%Field Notes%' and regexp_replace(full_citation,'[^A-Za-z]','X') in (
select regexp_replace(full_citation,'[^A-Za-z]','X') from publication having count(*)>1 group by regexp_replace(full_citation,'[^A-Za-z]','X')
) order by full_citation;
Fernando Torres-Perez, R. Eduardo Palma, Brian Hjelle, Marcela Ferres, Joseph A. Cook. 2009. Andes virus infections in t
he rodent reservoir and in humans vary across contrasting landscapes in Chile. Infection, Genetics and Evolution 10:820-
825.
Fernando Torres-Perez, R. Eduardo Palma, Brian Hjelle, Marcela Ferres, Joseph A. Cook. 2010. Andes virus infections in t
he rodent reservoir and in humans vary across contrasting landscapes in Chile. Infection, Genetics and Evolution 10:820-
825.
Terry L. Yates. 1984. The role of voucher specimens in mammal collections: characterization and funding responsibilities
. Acta Zoologica 170(2):81-82.
Terry L. Yates. 1985. The role of voucher specimens in mammal collections: characterization and funding responsibilities
. Acta Zoologica 170(2):81-82.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#1570 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AOH0hBI_e3sGYuexpMMSCzVXzVHRCF1Hks5vUai9gaJpZM4UoSSR>
.
|
All MVZ unknown author pubs fixed, so no unknown as author for MVZ now! (yay Carol and Chris!) |
see what I can magic for no-authors |
Elapsed: 00:00:07.28 R PUBLICATION_ID FULL_CITATIONGUID_PREFIXunknown_title 10005108 unknown_title 10005392 unknown_title 10005066 unknown_title 10005106 unknown_title 10004891 unknown_title 10005188 unknown_title 10005061 unknown_title 10005295 unknown_title 10005111 unknown_title 10005245 unknown_title 10005026 unknown_title 10005136 unknown_title 10005382 unknown_title 10005163 unknown_title 10005174 unknown_title 10005112 unknown_title 10004997 unknown_title 10005222 unknown_title 10005433 unknown_title 10005178 unknown_title 10005437 unknown_title 10004628 unknown_title 10005137 unknown_title 10005152 unknown_title 10004930 unknown_title 10005169 unknown_title 10004838 unknown_title 10005359 |
UTEP Authors added. Will get a student to work on this next week - I have someone in mind.... |
I added the journal for this:
unknown_title 10005382
David Smith. 1989. The sawfly genus Arge (Hymenoptera; Argidae) in the
Western Hemisphere. unknown 115(2):83-205.
…On Thu, Mar 14, 2019 at 11:56 AM Teresa Mayfield-Meyer < ***@***.***> wrote:
UTEP Authors added. Will get a student to work on this next week - I have
someone in mind....
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#1570 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AIraMwS02wf51uP-p-5Acpf3Ev9doJFmks5vWql7gaJpZM4UoSSR>
.
--
+++++++++++++++++++++++++++++++++++
Derek S. Sikes, Curator of Insects
Professor of Entomology
University of Alaska Museum
1962 Yukon Drive
Fairbanks, AK 99775-6960
dssikes@alaska.edu
phone: 907-474-6278
FAX: 907-474-5469
University of Alaska Museum - search 400,276 digitized arthropod records
http://arctos.database.museum/uam_ento_all
<http://www.uaf.edu/museum/collections/ento/>
+++++++++++++++++++++++++++++++++++
Interested in Alaskan Entomology? Join the Alaska Entomological
Society and / or sign up for the email listserv "Alaska Entomological
Network" at
http://www.akentsoc.org/contact_us <http://www.akentsoc.org/contact.php>
|
suggest two triggers
The forms already "require" one author - that's about the best we can do with that. |
Current data:
|
trigger created to disallow agent_id=0 in publication_agent |
Publication trigger now contains
|
I now find 447 publications without authors or with 'unknown' in the title - CSV attached. temp_funky_publications.csv.zip This was to happen by April 2019 - can we try something other than #1570 (comment) now? Summary:
Postgres SQL:
|
I fixed the MSB:Mamm ones. To make this process easier, I suggest making
the required "author" role be a yellow field, like all the other required
fields in the Edit Publications form. Otherwise, there is a cryptic error
message.
Do we have a standard process for dealing with pubs that cannot be linked
to dois, except for adding a remark?
…On Wed, Apr 28, 2021 at 10:07 AM dustymc ***@***.***> wrote:
* [EXTERNAL]*
I now find 447 publications without authors or with 'unknown' in the title
- CSV attached.
temp_funky_publications.csv.zip
<https://github.com/ArctosDB/arctos/files/6393222/temp_funky_publications.csv.zip>
This was to happen by April 2019 - can we try something other than #1570
(comment)
<#1570 (comment)>
now?
Summary:
r | guid_prefix | count
---------------+-------------+-------
no_authors | DMNS:Mamm | 1
no_authors | MSB:Para | 1
no_authors | KNWR:Ento | 1
no_authors | UAM:Bird | 7
no_authors | KWP:Ento | 2
no_authors | UAM:Mamm | 7
no_authors | UWBM:Herp | 1
no_authors | | 127
no_authors | UAM:Ento | 38
no_authors | UAM:Herb | 1
unknown_title | | 24
no_authors | UTEP:Herp | 4
no_authors | MVZ:Mamm | 7
no_authors | UCM:Mamm | 1
no_authors | DMNS:Bird | 2
no_authors | MSB:Mamm | 6
no_authors | UTEP:Herb | 1
no_authors | UAM:EH | 3
no_authors | MLZ:Bird | 47
no_authors | UAMObs:Ento | 165
no_authors | UAM:Inv | 1
Postgres SQL:
select
'unknown_title' r,
publication.publication_id,
full_citation,
guid_prefix
from
publication
left outer join citation on publication.publication_id=citation.publication_id
left outer join cataloged_item on citation.collection_object_id=cataloged_item.collection_object_id
left outer join collection on cataloged_item.collection_id=collection.collection_id
where
lower(full_citation) like '%unknown%' group by publication.publication_id,full_citation, guid_prefix
union
select 'agent_zero' r,
publication.publication_id,
full_citation,
guid_prefix
from
publication
inner join publication_agent on publication.publication_id=publication_agent.publication_id
left outer join citation on publication.publication_id=citation.publication_id
left outer join cataloged_item on citation.collection_object_id=cataloged_item.collection_object_id
left outer join collection on cataloged_item.collection_id=collection.collection_id
where
agent_id=0
group by publication.publication_id,full_citation, guid_prefix
union
select
'no_authors' r,
publication.publication_id,
full_citation,
guid_prefix
from publication
left outer join citation on publication.publication_id=citation.publication_id
left outer join cataloged_item on citation.collection_object_id=cataloged_item.collection_object_id
left outer join collection on cataloged_item.collection_id=collection.collection_id
where
publication.publication_id not in (select publication_id from publication_agent)
;
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#1570 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADQ7JBCW33LA3PXPB2YSOJTTLAXFFANCNFSM4FFBESIQ>
.
|
Excellent, here's new data. temp_funky_publications(1).csv.zip And the summary
I don't think so, other than https://handbook.arctosdb.org/documentation/publications.html#doi
I'll update. |
MSB Para fixed.
…On Wed, Apr 28, 2021 at 10:40 AM dustymc ***@***.***> wrote:
* [EXTERNAL]*
fixed
Excellent, here's new data.
temp_funky_publications(1).csv.zip
<https://github.com/ArctosDB/arctos/files/6393407/temp_funky_publications.1.csv.zip>
And the summary
r | guid_prefix | count
---------------+-------------+-------
no_authors | KNWR:Ento | 1
no_authors | UAMObs:Ento | 165
no_authors | UAM:Herb | 1
no_authors | UAM:EH | 3
unknown_title | | 24
no_authors | UCM:Mamm | 1
no_authors | MLZ:Bird | 47
no_authors | UAM:Ento | 38
no_authors | UWBM:Herp | 1
no_authors | MVZ:Mamm | 3
no_authors | MSB:Para | 1
no_authors | | 127
no_authors | UTEP:Herb | 1
no_authors | UAM:Mamm | 7
no_authors | DMNS:Mamm | 1
no_authors | UAM:Bird | 7
no_authors | UAM:Inv | 1
no_authors | UTEP:Herp | 4
no_authors | KWP:Ento | 2
no_authors | DMNS:Bird | 2
standard process f
I don't think so, other than
https://handbook.arctosdb.org/documentation/publications.html#doi
yellow
I'll update.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#1570 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADQ7JBEGDWBOLJW2NMGRAGTTLA3A7ANCNFSM4FFBESIQ>
.
|
MVZ mammals fixed, but found a duplicate. Both of these, Miguel Camacho Sanchez. 2017. Evolution in Sundaland: . One had an author, but no citations, the other citation but no author. both have authors and cited specimens now. What's the best way to delete one of them? |
If there are few citations you can just manually delete from one and add to the other. If there are many let me know and I'll figure it out. Once there are no dependencies you should be able to delete the publication. |
There was only one each. I'll delete a citation and try to delete the pub and see how that goes. |
Fixed UTEP and MSB Para - all the rest are NOT associated with a GUID Prefix that I can see. |
Is there anything else we need to do here? What I find today are about 390 no-author pubs and 24 with some form of "unknown" in the title. |
Fixed some of the unknown titles. |
The "unknown" look like mostly MSB and easy (??) fixes. I'm not sure what to do with the no-authors - call it good and close? They'll probably eventually get authors since the UI is requiring that to save.
|
I've been working on those, but it was wearing me out. Some of the older ones I cannot find. Once I get down to things I can't resolve with Google I'll post here.
I was thinking of trying to get a list of the authors from the full citation and adding them in bulk, but many of them include multiple authors and almost all of them format the author names as "Last, F.M." which is drag when the multiple authors are also separated by commas. Given that - I think I agree with you.... |
Here is what's left:
|
OK to close? |
Suggest we clean up or delete these publications which contain "unknown" or use agent unknown.
The text was updated successfully, but these errors were encountered: