ByteInputStream does not implement mark/reset #2940

prb112 · 2021-11-04T17:27:24Z

Describe the bug
A clear and concise description of what the bug is.
The fast Bulk Export implementation uses a custom InputOutput stream paradigm.

[INFO    ] multiPartUpload: Upload part Error - The request to the service failed with a retryable reason, but resetting the request input stream has failed. See exception.getExtraInfo or debug-level logging for the original failure that caused this retry.;  If the request involves an input stream, the maximum stream buffer size can be configured via request.getRequestClientOptions().setReadLimit(int)
[ERROR   ] bulkexportfastjob[102]
The request to the service failed with a retryable reason, but resetting the request input stream has failed. See exception.getExtraInfo or debug-level logging for the original failure that caused this retry.;  If the request involves an input stream, the maximum stream buffer size can be configured via request.getRequestClientOptions().setReadLimit(int)
[INFO    ] bulkexportfastjob[102] processed 5696 resources in 39.88 seconds (rate=142.8 resources/second)
[INFO    ] FFDC1015I: An FFDC Incident has been created: "com.ibm.cloud.objectstorage.ResetException: The request to the service failed with a retryable reason, but resetting the request input stream has failed. See exception.getExtraInfo or debug-level logging for the original failure that caused this retry.;  If the request involves an input stream, the maximum stream buffer size can be configured via request.getRequestClientOptions().setReadLimit(int) com.ibm.jbatch.container.controller.impl.ChunkStepControllerImpl 342" at ffdc_21.11.04_17.17.28.0.log
[INFO    ] FFDC1015I: An FFDC Incident has been created: "com.ibm.jbatch.container.exception.BatchContainerRuntimeException: com.ibm.cloud.objectstorage.ResetException: The request to the service failed with a retryable reason, but resetting the request input stream has failed. See exception.getExtraInfo or debug-level logging for the original failure that caused this retry.;  If the request involves an input stream, the maximum stream buffer size can be configured via request.getRequestClientOptions().setReadLimit(int) com.ibm.jbatch.container.controller.impl.ChunkStepControllerImpl 994" at ffdc_21.11.04_17.17.28.1.log
[INFO    ] FFDC1015I: An FFDC Incident has been created: "com.ibm.jbatch.container.exception.BatchContainerRuntimeException: com.ibm.cloud.objectstorage.ResetException: The request to the service failed with a retryable reason, but resetting the request input stream has failed. See exception.getExtraInfo or debug-level logging for the original failure that caused this retry.;  If the request involves an input stream, the maximum stream buffer size can be configured via request.getRequestClientOptions().setReadLimit(int) com.ibm.jbatch.container.controller.impl.ChunkStepControllerImpl 982" at ffdc_21.11.04_17.17.28.2.log
[INFO    ] FFDC1015I: An FFDC Incident has been created: "com.ibm.jbatch.container.exception.BatchContainerRuntimeException: com.ibm.cloud.objectstorage.ResetException: The request to the service failed with a retryable reason, but resetting the request input stream has failed. See exception.getExtraInfo or debug-level logging for the original failure that caused this retry.;  If the request involves an input stream, the maximum stream buffer size can be configured via request.getRequestClientOptions().setReadLimit(int) com.ibm.jbatch.container.controller.impl.ChunkStepControllerImpl 681" at ffdc_21.11.04_17.17.28.3.log
[ERROR   ] StepChunkListener: job[bulkexportfastjob/82/863] --- com.ibm.cloud.objectstorage.ResetException: The request to the service failed with a retryable reason, but resetting the request input stream has failed. See exception.getExtraInfo or debug-level logging for the original failure that caused this retry.;  If the request involves an input stream, the maximum stream buffer size can be configured via request.getRequestClientOptions().setReadLimit(int)
com.ibm.cloud.objectstorage.ResetException: The request to the service failed with a retryable reason, but resetting the request input stream has failed. See exception.getExtraInfo or debug-level logging for the original failure that caused this retry.;  If the request involves an input stream, the maximum stream buffer size can be configured via request.getRequestClientOptions().setReadLimit(int)
[INFO    ] Transaction failed - afterCompletion(status = 4)

@NotThreadSafe
public class InputOutputByteStream {
... 
 private class ByteInputStream extends InputStream {

Environment
Which version of IBM FHIR Server? 4.10.0-SNAPSHOT

To Reproduce
Steps to reproduce the behavior:

Setup S3 export with the System level export with the Patient resource type.

curl --location --request GET 'https://localhost:9443/fhir-server/api/v4/$export?_outputFormat=application/fhir+ndjson&_type=Patient' \
--header 'X-FHIR-TENANT-ID: default' \
--header 'Content-Type: application/fhir+json' \
--header 'X-FHIR-BULKDATA-PROVIDER: default' \
--header 'X-FHIR-BULKDATA-PROVIDER-OUTCOME: default' \
--header 'Accept: application/fhir+json' \
--header 'Authorization: Basic BBBBBBBB'

Expected behavior
Export should succeed

Additional context
Suspect transfer to bucket failed and was trying to retry.
Possibly This feature is only available in the latest ibm-cos update.

The text was updated successfully, but these errors were encountered:

- Add mark/reset to the InputStream - Add Test for mark/reset Signed-off-by: Paul Bastide <pbastide@us.ibm.com>

ByteInputStream does not implement mark/reset #2940

d0roppe · 2021-11-05T12:39:21Z

Verified that this fixed the issue with BulkData Export of a large dataset.

prb112 added bug Something isn't working P1 Priority 1 - Must Have labels Nov 4, 2021

prb112 added this to the Sprint 2021-15 milestone Nov 4, 2021

prb112 self-assigned this Nov 4, 2021

prb112 added the bulk-data label Nov 4, 2021

prb112 added a commit that referenced this issue Nov 4, 2021

ByteInputStream does not implement mark/reset #2940

56fac31

- Add mark/reset to the InputStream - Add Test for mark/reset Signed-off-by: Paul Bastide <pbastide@us.ibm.com>

punktilious added a commit that referenced this issue Nov 5, 2021

Merge pull request #2941 from IBM/issue-2940

87d6517

ByteInputStream does not implement mark/reset #2940

d0roppe closed this as completed Nov 5, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ByteInputStream does not implement mark/reset #2940

ByteInputStream does not implement mark/reset #2940

prb112 commented Nov 4, 2021

d0roppe commented Nov 5, 2021

ByteInputStream does not implement mark/reset #2940

ByteInputStream does not implement mark/reset #2940

Comments

prb112 commented Nov 4, 2021

d0roppe commented Nov 5, 2021