forked from ontoportal/ncbo_cron
-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Spam deletion script broken in production #60
Comments
Looks like our current Github Authorization token is invalid |
I added some additional handling to the script to fail with a corresponding error is anything other than a successful fetch of the SPAM user list occurs. |
mdorf
added a commit
that referenced
this issue
Apr 11, 2023
mdorf
added a commit
that referenced
this issue
Apr 11, 2023
syphax-bouazzouni
referenced
this issue
in ontoportal-lirmm/ncbo_cron
Dec 27, 2023
…its, and the Most visited pages in the month (#17) * remove forgot variables * fix for #61 - create contact instance if it doesn't exist - changed --from-api to --from-apikey - minor linting * Restore branch specifier to develop * Optimization - remove repeated query * Gemfile.lock update * Gemfile.lock update * Gemfile.lock update * Gemfile had references to develop branch * implemented #64 - ability to generate labels independently of RDF processing (and vise versa) * Gemfile.lock update * fixed a bug in #64 * Relocate docker-compose file and update default configs * Add GH workflow for publishing docker images * use ruby native method for listing files instead of a git function Resolves warning messages when we exclude .git directory from docker image * remove comment * capitalize argument in order to be consistent with other scripts * add arm/64 platform * additional error handling for SPAM deletion script, #60 * additional error handling for SPAM deletion script, #60 * implemented #67 - improved corrupt data and error handling * Gemfile.lock update * exclude test/data/dictionary.txt from git commits * update version of solr-ut * Gemfile.lock update * Restore branch specifier to master * fixed configuration for the analytics module * Gemfile.lock update * implemented #69 - scheduled annotator dictionary file generation should be a configurable option instead of the default * Gemfile.lock update * gem update * create new rake taks for updating purls for all ontologies moved from ontologies_api/fix_purls.rb * initial implementation of #70 - Google Analytics v4 Update Compatibility Issue * added the /data folder to ignore * update gems * Gemfile.lock update * Gemfile.lock update * Gemfile.lock update * use patched version of agraph v7.3.1 * unpin faraday gem * A chnage to reference Analytics Redis from LinkedData block * Gemfile.lock update * Gemfile.lock update * Gemfile.lock update * Gemfile.lock update * use assert_operator instead of assert minitest style guide adherence. encountered an intermittent unit test failure so assert_operator will provide better failure feedback than assert * use local solr to pass the tests * fixed ncbo_ontology_archive_old_submissions error output * Gemfile.lock update * Gemfile.lock update * Gemfile update * Gemfile update * fixes to the analytics script and a new script to generate UA analytics for documentation * Gemfile.lock update * Gemfile.lock update * implemented the first pass at bmir-radx/radx-project#37 * implemented the first pass at bmir-radx/radx-project#37 * set bundler version to be comptatible with ruby 2.7 + AG v8 * refactor ontologies analytics job to handle the new google analytics migration * add user analytics fetching the monthly user visits count * add page visits analytics fetching last month most visited pages * extract google analytics UA import code to a script to make current code clean of it * add option to force submission archiving even if already archived --------- Co-authored-by: Alex Skrenchuk <alexskr@stanford.edu> Co-authored-by: mdorf <mdorf@stanford.edu> Co-authored-by: Jennifer Vendetti <vendetti@stanford.edu>
syphax-bouazzouni
referenced
this issue
in ontoportal-lirmm/ncbo_cron
Dec 28, 2023
…its, and the Most visited pages in the month (#17) * remove forgot variables * fix for #61 - create contact instance if it doesn't exist - changed --from-api to --from-apikey - minor linting * Restore branch specifier to develop * Optimization - remove repeated query * Gemfile.lock update * Gemfile.lock update * Gemfile.lock update * Gemfile had references to develop branch * implemented #64 - ability to generate labels independently of RDF processing (and vise versa) * Gemfile.lock update * fixed a bug in #64 * Relocate docker-compose file and update default configs * Add GH workflow for publishing docker images * use ruby native method for listing files instead of a git function Resolves warning messages when we exclude .git directory from docker image * remove comment * capitalize argument in order to be consistent with other scripts * add arm/64 platform * additional error handling for SPAM deletion script, #60 * additional error handling for SPAM deletion script, #60 * implemented #67 - improved corrupt data and error handling * Gemfile.lock update * exclude test/data/dictionary.txt from git commits * update version of solr-ut * Gemfile.lock update * Restore branch specifier to master * fixed configuration for the analytics module * Gemfile.lock update * implemented #69 - scheduled annotator dictionary file generation should be a configurable option instead of the default * Gemfile.lock update * gem update * create new rake taks for updating purls for all ontologies moved from ontologies_api/fix_purls.rb * initial implementation of #70 - Google Analytics v4 Update Compatibility Issue * added the /data folder to ignore * update gems * Gemfile.lock update * Gemfile.lock update * Gemfile.lock update * use patched version of agraph v7.3.1 * unpin faraday gem * A chnage to reference Analytics Redis from LinkedData block * Gemfile.lock update * Gemfile.lock update * Gemfile.lock update * Gemfile.lock update * use assert_operator instead of assert minitest style guide adherence. encountered an intermittent unit test failure so assert_operator will provide better failure feedback than assert * use local solr to pass the tests * fixed ncbo_ontology_archive_old_submissions error output * Gemfile.lock update * Gemfile.lock update * Gemfile update * Gemfile update * fixes to the analytics script and a new script to generate UA analytics for documentation * Gemfile.lock update * Gemfile.lock update * implemented the first pass at bmir-radx/radx-project#37 * implemented the first pass at bmir-radx/radx-project#37 * set bundler version to be comptatible with ruby 2.7 + AG v8 * refactor ontologies analytics job to handle the new google analytics migration * add user analytics fetching the monthly user visits count * add page visits analytics fetching last month most visited pages * extract google analytics UA import code to a script to make current code clean of it * add option to force submission archiving even if already archived --------- Co-authored-by: Alex Skrenchuk <alexskr@stanford.edu> Co-authored-by: mdorf <mdorf@stanford.edu> Co-authored-by: Jennifer Vendetti <vendetti@stanford.edu>
syphax-bouazzouni
referenced
this issue
in ontoportal-lirmm/ncbo_cron
Dec 28, 2023
…its, and the Most visited pages in the month (#17) * remove forgot variables * fix for #61 - create contact instance if it doesn't exist - changed --from-api to --from-apikey - minor linting * Restore branch specifier to develop * Optimization - remove repeated query * Gemfile.lock update * Gemfile.lock update * Gemfile.lock update * Gemfile had references to develop branch * implemented #64 - ability to generate labels independently of RDF processing (and vise versa) * Gemfile.lock update * fixed a bug in #64 * Relocate docker-compose file and update default configs * Add GH workflow for publishing docker images * use ruby native method for listing files instead of a git function Resolves warning messages when we exclude .git directory from docker image * remove comment * capitalize argument in order to be consistent with other scripts * add arm/64 platform * additional error handling for SPAM deletion script, #60 * additional error handling for SPAM deletion script, #60 * implemented #67 - improved corrupt data and error handling * Gemfile.lock update * exclude test/data/dictionary.txt from git commits * update version of solr-ut * Gemfile.lock update * Restore branch specifier to master * fixed configuration for the analytics module * Gemfile.lock update * implemented #69 - scheduled annotator dictionary file generation should be a configurable option instead of the default * Gemfile.lock update * gem update * create new rake taks for updating purls for all ontologies moved from ontologies_api/fix_purls.rb * initial implementation of #70 - Google Analytics v4 Update Compatibility Issue * added the /data folder to ignore * update gems * Gemfile.lock update * Gemfile.lock update * Gemfile.lock update * use patched version of agraph v7.3.1 * unpin faraday gem * A chnage to reference Analytics Redis from LinkedData block * Gemfile.lock update * Gemfile.lock update * Gemfile.lock update * Gemfile.lock update * use assert_operator instead of assert minitest style guide adherence. encountered an intermittent unit test failure so assert_operator will provide better failure feedback than assert * use local solr to pass the tests * fixed ncbo_ontology_archive_old_submissions error output * Gemfile.lock update * Gemfile.lock update * Gemfile update * Gemfile update * fixes to the analytics script and a new script to generate UA analytics for documentation * Gemfile.lock update * Gemfile.lock update * implemented the first pass at bmir-radx/radx-project#37 * implemented the first pass at bmir-radx/radx-project#37 * set bundler version to be comptatible with ruby 2.7 + AG v8 * refactor ontologies analytics job to handle the new google analytics migration * add user analytics fetching the monthly user visits count * add page visits analytics fetching last month most visited pages * extract google analytics UA import code to a script to make current code clean of it * add option to force submission archiving even if already archived --------- Co-authored-by: Alex Skrenchuk <alexskr@stanford.edu> Co-authored-by: mdorf <mdorf@stanford.edu> Co-authored-by: Jennifer Vendetti <vendetti@stanford.edu>
syphax-bouazzouni
referenced
this issue
in ontoportal/ncbo_cron
Jan 16, 2024
…onward (#2) * add a script to eradicate (delete data+ files) submissions of an ontology * Auto stash before merge of "development" and "master" * omit logs link file * update the eradicator to support the eradication of not archived submissions if wanted * fix the delete submission files to not let behind empty directories * not remove the submission directory beaucse it's already done by the submission.delete * Update Gemfile.lock * Reset branch specifier to develop * extract do_ontology_pull function * some simple code refactor in the ontology_pull * simple code refactor of test_ontology_pull * add a script to do a ontology pull on an ontology on demand * set the name of the new script in $0 * extract new_file_exists? method from do_ontology_pull * save the submission in the RemoteFileException * some automatic code refactor/lint * use the new do_ontology_pull in the old do_remote_ontology_pull * fixed an API call mentioned by @syphax-bouazzouni in ncbo/bioportal-project#254 * fixed an API call mentioned by @syphax-bouazzouni in ncbo/bioportal-project#254 * Gemfile.lock update * bump up version of actions/checkout from v2->v3 * Gemfile.lock update * Merge branch 'develop' * remove forgot variables * GH Actions unit test workflow refactor - add ruby versioning via docker-compose.yml file - bump up ruby v2.6 -> v2.7 - add AllegroGraph backend - add code coverage * Remove extra space * fix for #61 - create contact instance if it doesn't exist - changed --from-api to --from-apikey - minor linting * Restore branch specifier to develop * Optimization - remove repeated query * Gemfile.lock update * Gemfile.lock update * Gemfile.lock update * Gemfile had references to develop branch * implemented #64 - ability to generate labels independently of RDF processing (and vise versa) * Gemfile.lock update * fixed a bug in #64 * Relocate docker-compose file and update default configs * Add GH workflow for publishing docker images * use ruby native method for listing files instead of a git function Resolves warning messages when we exclude .git directory from docker image * remove comment * capitalize argument in order to be consistent with other scripts * add arm/64 platform * additional error handling for SPAM deletion script, #60 * additional error handling for SPAM deletion script, #60 * implemented #67 - improved corrupt data and error handling * Gemfile.lock update * exclude test/data/dictionary.txt from git commits * update version of solr-ut * Gemfile.lock update * Restore branch specifier to master * fixed configuration for the analytics module * Gemfile.lock update * implemented #69 - scheduled annotator dictionary file generation should be a configurable option instead of the default * Gemfile.lock update * gem update * create new rake taks for updating purls for all ontologies moved from ontologies_api/fix_purls.rb * initial implementation of #70 - Google Analytics v4 Update Compatibility Issue * added the /data folder to ignore * update gems * Gemfile.lock update * Gemfile.lock update * Gemfile.lock update * use patched version of agraph v7.3.1 * unpin faraday gem * A chnage to reference Analytics Redis from LinkedData block * Gemfile.lock update * Gemfile.lock update * Gemfile.lock update * Gemfile.lock update * use assert_operator instead of assert minitest style guide adherence. encountered an intermittent unit test failure so assert_operator will provide better failure feedback than assert * fixed ncbo_ontology_archive_old_submissions error output * Gemfile.lock update * Gemfile.lock update * Gemfile update * Gemfile update * fixes to the analytics script and a new script to generate UA analytics for documentation * Gemfile.lock update * Gemfile.lock update * implemented the first pass at bmir-radx/radx-project#37 * implemented the first pass at bmir-radx/radx-project#37 * set bundler version to be comptatible with ruby 2.7 + AG v8 * Gemfile.lock update * Gemfile.lock update --------- Co-authored-by: Jennifer Vendetti <vendetti@stanford.edu> Co-authored-by: mdorf <mdorf@stanford.edu> Co-authored-by: Alex Skrenchuk <alexskr@stanford.edu>
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
The spam deletion script is executing nightly as scheduled, but doesn't appear to be deleting spam anymore.
I added a new account to the list of spam users in this commit, then ran the spam deletion script manually. The script output shows that the user ("buyadderallonline") and the ontology they uploaded (acronym ADDERALL) weren't deleted as expected:
A BioPortal user reported the spam ontology, so I manually deleted it (and also the user account). However, there appear to be a couple of newer spam entries on the Projects page that could be used for testing, e.g. /projects/PAGOMU.
The scheduler-spam-deletion.log file shows no errors.
The text was updated successfully, but these errors were encountered: