-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
writer: Events get discarded on raw datastore IO errors #29
Comments
This means we lost several days of raw data.. |
The permissions of files incorrectly being owned by root have been fixed? I guess this only affects 'old' data being uploaded now for days during which root was active and was the one to initially create the datafile. So only several days of raw data for specific stations are lost, and perhaps a partial day for all stations for the day on which the writer was restarted as running under |
Permissions have been fixed. However, as these things go, some failures and maintenance added up and we lost about a week of data for all active stations:
|
Yes, that sums it up. On Friday (after moving and restart) I found out we didn't have NFS access anymore because all stations gave errors. I fixed that and all stations were correctly pushing data to frome. I started the writer as user www and everything seemed to work. Unfortunately I didn't check the writer log.. I don't really remember anymore why we decided to check the log yesterday but luckily we did. |
Pity! But after a restart, always check the logs, ;-)!
|
Today we discovered the writer had been erroneously running as
root
lately, thus creating raw datastore hdf5 fileschown root.root
. A few days ago frome was physically moved to a new location and the server was restarted. The writer was restarted as userwww
(as specified in the docs).The writer running as user
www
could not write to the raw data store. All data was dropped:The code that generates this error:
https://github.com/HiSPARC/datastore/blob/master/writer/store_events.py#L127L148
When
store_events.store_event_list
is unsuccesful, we still remove the incoming pickled data from thepartial
folder!Solution: Only remove the pickle if
process_data
is succesful: https://github.com/HiSPARC/datastore/blob/master/writer/writer_app.py#L73The text was updated successfully, but these errors were encountered: