add support for streaming uploads #8

fabian-paul · 2022-12-04T17:19:11Z

Hi all,

I happened to use streaming uploads with requests and curl a lot (using both individually but not in combination). So I felt like adding support for the data access protocols that requests supports to this cool project would a useful contribution.
The PR isn't ready yet and I hope to finish it soon. There are still a few sections that I have marked with TODO comments. I'm also not completely sure how the Content-Length header and libcurl's INFILESIZE_LARGE interact. Would be nice to have your opinion on this PR.

Fabian

dcoles · 2023-01-06T07:59:25Z

Hi @fabian-paul,

Sorry! I only just noticed this PR (end-of-year was pretty crazy).

Supporting streaming uploads sounds like a great idea.
I'll take a look over the PR tomorrow morning.

David

dcoles · 2023-01-07T22:56:49Z

pycurl_requests/adapters/pycurl.py

+            elif isinstance(self.prepared.body, (io.RawIOBase, io.BufferedIOBase)):
+                self.curl.setopt(pycurl.READFUNCTION, self.prepared.body.read)
+                self.curl.setopt(pycurl.TRANSFER_ENCODING, 1)
+            elif hasattr(self.prepared.body, "__iter__"):  # TODO: call iter instead of checking (e.g. to support delegates)


It might be better to use isinstance(obj, Collection).

collections.abc.Collection guarantees that a type implements __contains__, __iter__ and __len__. You can also use collections.abc.Iterable to specifically test for __iter__.

dcoles · 2023-01-07T23:04:49Z

pycurl_requests/adapters/pycurl.py

+                else:
+                    self.curl.setopt(pycurl.TRANSFER_ENCODING, 0)
+                    self.curl.setopt(pycurl.INFILESIZE_LARGE, n_bytes)
+                reader = ChunkIterableReader(iter(self.prepared.body))


I wonder if we can use the two argument form of iter here:

iter(self.prepared.body, "") # for string iter(self.prepared.body, b"") # for bytes

dcoles · 2023-01-08T02:47:02Z

pycurl_requests/adapters/pycurl.py

+    def close(self): # TODO
+        try:
+            self._iterator.close()
+        except AttributeError:
+            pass


I don't think you should be closing the iterator from within the library. Typically I would expect a user to be using a with block or calling close manually.

dcoles · 2023-01-08T02:49:06Z

pycurl_requests/tests/utils.py


 #: Is this _really_ PyCurl-Requests?
 #: Should be used when testing for PyCurl-Requests extensions.
 IS_PYCURL_REQUESTS = requests.__name__ == 'pycurl_requests'


+test_data = bytes(random.getrandbits(8) for _ in range(123456))


I'd recommend using os.urandom here.

dcoles · 2023-01-08T02:53:32Z

pycurl_requests/tests/utils.py

+            if not allow_chunked:
+                self.response('This endpoint has chunked transfer deactivated.', status=(400, "Bad Request"))
+                return
+            body = b""


Joining immutable bytes objects can be potentially expensive. bytearray is a better choice if you need something mutable.

add support for streaming uploads WIP

052c792

dcoles self-requested a review January 6, 2023 07:44

dcoles reviewed Jan 8, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add support for streaming uploads #8

add support for streaming uploads #8

fabian-paul commented Dec 4, 2022

dcoles commented Jan 6, 2023

dcoles Jan 7, 2023

dcoles Jan 7, 2023

dcoles Jan 8, 2023

dcoles Jan 8, 2023

dcoles Jan 8, 2023

add support for streaming uploads #8

Are you sure you want to change the base?

add support for streaming uploads #8

Conversation

fabian-paul commented Dec 4, 2022

dcoles commented Jan 6, 2023

dcoles Jan 7, 2023

Choose a reason for hiding this comment

dcoles Jan 7, 2023

Choose a reason for hiding this comment

dcoles Jan 8, 2023

Choose a reason for hiding this comment

dcoles Jan 8, 2023

Choose a reason for hiding this comment

dcoles Jan 8, 2023

Choose a reason for hiding this comment