Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consider having Ingest run on separate server / Modularize #2331

Closed
scolapasta opened this issue Jul 9, 2015 · 11 comments
Closed

Consider having Ingest run on separate server / Modularize #2331

scolapasta opened this issue Jul 9, 2015 · 11 comments
Labels
Component: Code Infrastructure formerly "Feature: Code Infrastructure"

Comments

@scolapasta
Copy link
Contributor

No description provided.

@scolapasta scolapasta added the Component: Code Infrastructure formerly "Feature: Code Infrastructure" label Jul 9, 2015
@bencomp
Copy link
Contributor

bencomp commented Jul 14, 2015

@scolapasta what? Why?

@landreev
Copy link
Contributor

@bencomp: We always wanted to make the application more modular; so that individual pieces/components could run independently; and, if necessary, run on different servers - to deal with load issues and such.
This ticket is for making ingest such an independent service; so that it could run on another host, similarly to how Solr and Postgres are integrated into the db.

This doesn't mean it will have to run on a separate server.

Ingest jobs can potentially be expensive, with both memory and CPU cycles. So if this makes the rest of the app run slow, this could be a simple solution - just move it onto another server and let it chew all the CPU time and memory there.

@landreev
Copy link
Contributor

@bencomp : (I meant to say "... are integrated into the application")

@mercecrosas mercecrosas modified the milestone: In Review Nov 30, 2015
@pdurbin
Copy link
Member

pdurbin commented Jan 25, 2016

@landreev check out the comment by @wbuchanan about how he has built https://github.com/wbuchanan/data-transfer processing Stata files in Java: #2883 (comment)

@scolapasta scolapasta removed this from the Not Assigned to a Release milestone Jan 28, 2016
@landreev landreev removed their assignment Jan 29, 2016
@pdurbin pdurbin removed the zTriaged label Jun 29, 2017
@pdurbin
Copy link
Member

pdurbin commented Dec 12, 2017

I just went to a meeting with @jonc1438 and he mentioned that @akio-sone is working on something related to this. He's factoring out the ingest code from Dataverse to allow it to run as a standalone app, from what I understand.

@pdurbin
Copy link
Member

pdurbin commented May 24, 2018

As I work on #2301 I feel like it would be nice if our Stata parser were a stand alone app or library that I could use from Dataverse.

@djbrooke
Copy link
Contributor

djbrooke commented Oct 2, 2019

Closing this, as making the code modular will part of the work for TRSAs. We can re-evaluate after that. As far as setting it up to run on the separate server, there may be some options for ECS/Fargate for those installations running on AWS.

@djbrooke djbrooke closed this as completed Oct 2, 2019
@pdurbin
Copy link
Member

pdurbin commented Oct 2, 2019

@djbrooke cool but heads up that @akio-sone has apparently already started working on this. The other day @donsizemore said "note that he carved out the ingest functionality so TRSA wants to talk to its corresponding fork of Dataverse: https://github.com/OdumInstitute/dataverse/tree/trsa-api " at http://irclog.iq.harvard.edu/dataverse/2019-09-06#i_104872

The plot thickens. @akio-sone let's use Quarkus for the ingest microservice! 😄 I have some notes at http://wiki.greptilian.com/java/quarkus/ 😄

@djbrooke
Copy link
Contributor

djbrooke commented Oct 2, 2019

Yes, that's what I was trying to express in my comment as I closed.

@qqmyers
Copy link
Member

qqmyers commented Nov 13, 2020

Also a connection with UCLA's interest in geodata processing, i.e. https://groups.google.com/g/dataverse-community/c/XKC3wwssniM/m/7lkhgnQfBAAJ .

@djbrooke djbrooke changed the title Consider having Ingest run on separate server Consider having Ingest run on separate server / Modularize Apr 14, 2021
@djbrooke djbrooke closed this as completed May 5, 2021
@pdurbin
Copy link
Member

pdurbin commented Feb 3, 2022

See also:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Component: Code Infrastructure formerly "Feature: Code Infrastructure"
Projects
None yet
Development

No branches or pull requests

7 participants