-
Notifications
You must be signed in to change notification settings - Fork 40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Loading multiple excel sheets #110
Conversation
roll
commented
Sep 21, 2019
- connects Excel parser handle regular expression sheet name BCODMO/frictionless-usecases#23
Pull Request Test Coverage Report for Build 363
💛 - Coveralls |
@roll I have no special comment about the implementation - however I think that it's not the right location Imagine the scenario of a big excel file with many sheets stored somewhere online. If the user uses this feature, this file will be downloaded n+1 times (n is the number of sheets), each sheet will be read and its schema will be inferred - all for getting one sheet only. A tabulator based solution could look like this - we create a wdyt? |
@akariv The Anyway, @cschloer, I would say that, for now, I see that the only option is to use this code in your custom BCO-DMO processor ( |
Hmm okay. I've been resisting overwriting the dataflows |
Just throwing this out here: I was able to use the xlrd open_workbook fucntion with
|
@cschloer BTW, is it possible to run this code on BCO-DMO side? And then just use the standard |
I close it for now as |
Hey, sorry I didn't understand that xlrd_open_workbook() was only working on local files. You are totally correct! This is not too much of a limitation for us, as most of our files come from local paths as you said. I've already added this to my own load processor and am able to keep using the dataflows load so I am good to go 👍 |
@roll We are moving our infrastructure to start using remote files inside of local file paths (s3 urls) so the solution I original made does not work (or rather it loads the file n+1 times). Do you think we could revisit the solution suggested by @akariv of handling multiple excel sheets within tabulator? |