Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FMRC logfile errors: fmrc:SABGOM_ROMS_Forecast_Model_Run_Collection #3

Open
jbzambon opened this issue Sep 1, 2016 · 9 comments
Open

Comments

@jbzambon
Copy link
Contributor

jbzambon commented Sep 1, 2016

FMRC is throwing errors referencing some file "fmrc:SABGOM_ROMS_Forecast_Model_Run_Collection" instead of pulling variables listed in the catalog (sabgom_ROMS_nf_catalog.xml) and getting errors when it can't find some variables (e.g. AKt)...

[2016-08-31T19:33:16.585+0000] ERROR ucar.nc2.ft.fmrc.FmrcDataset: buildDataset2D: cant find ugrid variable AKt in collection fmrc:SABGOM_ROMS_Forecast_Model_Run_Collection
result dataset fmrc:SABGOM_ROMS_Forecast_Model_Run_Collection
ntimes
ndtfast
dt
dtfast
dstart
....

@rsignell-usgs
Copy link
Collaborator

rsignell-usgs commented Sep 5, 2016

@jbzambon, you are running thredds on oceanus "the old way", not using docker, right?
I see one potential problem is that there is a mixture of files owned by root and tomcat under:
/usr/local/tds/apache-tomcat-8.0.32
which is usually caused by someone running tomcat as root by mistake.

When I just logged in and checked, tomcat is running under user tomcat as it should be, but perhaps it's having trouble reading or writing certain files. Can you please shutdown, change all files under tomcat to be owned by user tomcat and restart?
Should be:

sudo -u tomcat /usr/local/tds/apache-tomcat-8.0.32/bin/shutdown.sh
sudo chown -R tomcat:tomcat /usr/local/tds/apache-tomcat-8.0.32
sudo -u tomcat /usr/local/tds/apache-tomcat-8.0.32/bin/startup.sh

@jbzambon
Copy link
Contributor Author

@rsignell-usgs Thanks! I did this and restarted it.

On a similar topic, we've been having trouble with our tomcat server leaking memory through the java server. A currently top reveals >10GB of memory usage. We have a similar issue with omgsrv1 and I usually will just restart when the usage gets too obscene. Any hints on solving this besides with a restart? Thanks!

3113 tomcat 20 0 10.134g 1.603g 20132 S 0.0 5.1 0:58.53 java

@kwilcox
Copy link
Member

kwilcox commented Mar 30, 2017

THREDDS does leak memory and there is no way around it. We run over 10 thredds servers seeing high volume and restart them every 2 days.

@rsignell-usgs
Copy link
Collaborator

@lesserwhirls, do you guys restart http://thredds.ucar.edu/thredds/catalog.html regularly (e.g. every night or every few days?)

@rsignell-usgs
Copy link
Collaborator

BTW, @cwardgar bet me $5 you don't. 😺

@lesserwhirls
Copy link

@rsignell-usgs, we do not do regular restarts of our TDS, so I guess you need to pay up 💵? @kwilcox we do not see memory leaks on our systems, but we do not have any FMRCs, so that might be why. Do you have any more info on the leaks?

@kwilcox
Copy link
Member

kwilcox commented Mar 30, 2017

Do you host any NetCDF aggregations on http://thredds.ucar.edu/thredds/catalog.html? From what I gather that has always been the issue. In the ocean world most of our aggregations are NetCDF3/4, not GRIB.

@lesserwhirls
Copy link

We do not host any netCDF aggregations. I admit it's shameful that our support for GRIB is better than our support for netCDF3/4 👎

We have been talking quite a bit about creating a grid feature collection, much like the grib feature collections we have now (grid feature collections would replace FMRC under the hood, but would be more generic than that). This would help in the area of aggregating netCDF files, but I suspect there would still be a need for using NcML aggregations. We are getting the new GOES-16 data in real-time, which are netCDF-4 files. I'll set up a test aggregation collection on our test server and keep an eye on the memory usage to see what is going on. Are you using just simple time aggregations?

@rsignell-usgs
Copy link
Collaborator

rsignell-usgs commented Mar 31, 2017

Fascinating. @lesserwhirls, I can't think of anything else that would be more useful for the IOOS community than to have NetCDF aggregations (both joinExisting and FMRC-type) working efficiently and without leaking memory.

And I'll see @cwardgar in a few short weeks to pay up. 😺

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants