Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Configure Bulkrax #268

Open
Tracked by #264
jillpe opened this issue Sep 16, 2024 · 3 comments
Open
Tracked by #264

Configure Bulkrax #268

jillpe opened this issue Sep 16, 2024 · 3 comments
Assignees
Labels

Comments

@jillpe
Copy link

jillpe commented Sep 16, 2024

No description provided.

@bkiahstroud
Copy link
Contributor

bkiahstroud commented Sep 27, 2024

Attempting to import the sample CSV that the client provided caused the importer to fail with this error:

CSV::InvalidEncodingError - Invalid byte sequence in UTF-8 in line 3.

After some digging, I'm pretty sure the file is encoded in WINDOWS-1252 (primarily due to seeing <92> in vim instead of '). Running the following seems to have resolved the importer error:

iconv -f WINDOWS-1252 -t UTF-8 input.csv > output.csv

[UPDATE] -- We'll need to override this Bulkrax::CsvEntry option from utf-8 to WINDOWS-1252:UTF-8. This will read the file as WINDOWS-1252 encoded and then transcode it to UTF-8 (i.e. the default internal encoding for Ruby)

[UPDATE 2] -- The client was able to encode their CSVs as UTF-8, so this is no longer necessary

@bkiahstroud bkiahstroud moved this from In Development to Code Review in Western Carolina University - HykuUp Nov 20, 2024
@bkiahstroud bkiahstroud closed this as completed by moving to Code Review in Western Carolina University - HykuUp Nov 20, 2024
@bkiahstroud bkiahstroud reopened this Nov 20, 2024
@bkiahstroud
Copy link
Contributor

While configuring Bulkrax's field mappings using the CSVs provided by the client, it became apparent that WCU's field mappings conflict with other existing HykuUP tenants. Because of this, @orangewolf and I decided that the best short-term1 solution was to add a new Hyku Account setting that would allow for per-tenant field mappings. That Hyku PR can be found here:

Footnotes

  1. A better long-term solution would be if Bulkrax allowed you to edit a given importer's field mappings. That feature would live entirely within Bulkrax and thus wouldn't have to "bleed" into Hyku. However, that's too much of a lift for the scope of this ticket

@bkiahstroud
Copy link
Contributor

The Hyku feature that allows for per-tenant field mappings has been merged into Hyku main. The next steps for this ticket are to:

  1. Update the submodule in HykuUP knapsack to latest Hyku main
  2. Deploy to prod
  3. Configure the WCU and UNCA tenant field mappings

The Mobius field mappings should also be updated and ensured that they still work

@bkiahstroud bkiahstroud moved this from Code Review to In Development in Western Carolina University - HykuUp Nov 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Development

No branches or pull requests

4 participants