-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
🐛 Source Salesforce: increase CSV field_size_limit #10012
Conversation
Signed-off-by: Sergey Chvalyuk <grubberr@gmail.com>
Signed-off-by: Sergey Chvalyuk <grubberr@gmail.com>
Codecov Report
@@ Coverage Diff @@
## master #10012 +/- ##
=========================================
Coverage ? 83.61%
=========================================
Files ? 7
Lines ? 476
Branches ? 0
=========================================
Hits ? 398
Misses ? 78
Partials ? 0 Continue to review full report at Codecov.
|
/test connector=connectors/source-salesforce
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please add more comments about this csv limitation into the code.
CSV_FIELD_SIZE_LIMIT = 1024 * 1024 | ||
csv.field_size_limit(CSV_FIELD_SIZE_LIMIT) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we set maximum available value here?
And I propose to print a log message about this limitation because this maximum value can be different for every OS type.
data = [ | ||
{"Id": "1", "Name": '"first_name" "last_name"'}, | ||
{"Id": "2", "Name": "'" + 'first_name"\n' + "'" + 'last_name\n"'}, | ||
{"Id": "3", "Name": "first_name last_name"}, | ||
{"Id": "3", "Name": "first_name last_name" + (CSV_FIELD_SIZE_LIMIT - 100) * "e"}, | ||
{"Id": "4", "Name": "first_name last_name"}, | ||
] | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please add a test where we can check that the function csv.field_size_limit
raises this limit really
Signed-off-by: Sergey Chvalyuk <grubberr@gmail.com>
/test connector=connectors/source-salesforce
|
…-csv_field_size_limit
/test connector=connectors/source-salesforce
|
/test connector=connectors/source-salesforce
|
/test connector=connectors/source-salesforce
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
/publish connector=connectors/source-salesforce
|
Signed-off-by: Sergey Chvalyuk <grubberr@gmail.com>
…:airbytehq/airbyte into grubberr/oncall-115-csv_field_size_limit
Signed-off-by: Sergey Chvalyuk grubberr@gmail.com
What
Some users catch CSV exception airbytehq/oncall#115
Increasing LIMIT for CSV field from default
it has to eliminated this Exception.
We need to keep an eye on this solution because it can potentially consume too much RAM.
How
Describe the solution
Recommended reading order
x.java
y.python
🚨 User Impact 🚨
Are there any breaking changes? What is the end result perceived by the user? If yes, please merge this PR with the 🚨🚨 emoji so changelog authors can further highlight this if needed.
Pre-merge Checklist
Expand the relevant checklist and delete the others.
Updating a connector
Community member or Airbyter
airbyte_secret
./gradlew :airbyte-integrations:connectors:<name>:integrationTest
.README.md
bootstrap.md
. See description and examplesdocs/integrations/<source or destination>/<name>.md
including changelog. See changelog exampleAirbyter
If this is a community PR, the Airbyte engineer reviewing this PR is responsible for the below items.
/test connector=connectors/<name>
command is passing./publish
command described here