Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Permission denied when generating parquet data for lab runs #108

Open
steinnymir opened this issue Jan 20, 2022 · 5 comments
Open

Permission denied when generating parquet data for lab runs #108

steinnymir opened this issue Jan 20, 2022 · 5 comments
Assignees
Labels
bug Something isn't working

Comments

@steinnymir
Copy link
Member

Beamtime directories generated for lab data seem to have restricted permissions to few users.
This is absoultely correct, but limits the work that other users (me and Zain for example) can do.

Any suggestions how to solve this?

  • Add permissions for all interested users
  • Add a dummy test beamtime where we can dump test files?
@steinnymir steinnymir added the bug Something isn't working label Jan 20, 2022
@steinnymir
Copy link
Member Author

@zainsohail04 suggested using local directories for test purpouses.
The parquet dir was hardcoded but I added an optional argument to read_data to overwrite this.

now the parquet folder can be arbitrarily selected, and data readout works as intended

@zain-sohail
Copy link
Member

Thanks. I think I will restructure the code because this functionality is also in express dataframe creator class and can be generalized for both.

@kutnyakhov
Copy link

We are currently trying to write parquet in the "raw" folder, but they should go into "processed" folder
/asap3/fs-flash-o/gpfs/hextof/2022/data/11013726/processed
For this folder, everyone should have write permission.

@kutnyakhov
Copy link

One of our last gray days at FLASH, where we were using fs-flash-o/hextof for data storage all parquet files were written into folder
/asap3/fs-flash-o/gpfs/hextof/2021/data/11013350/processed/parquet/per_file

So it was not even trying to write into "raw" folder - could it be that this was changed and now code trying to write into current ("raw") folder?

@zain-sohail
Copy link
Member

When using readData, there is a optional argument for parquet_path. The default is it adds a new parquet dir to the the file path.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants