Add support for nvCOMP batch API #249

Alexey-Kamenev · 2023-06-27T21:59:03Z

See #248 for more details.

rapids-bot · 2023-06-27T21:59:08Z

Pull requests from external contributors require approval from a rapidsai organization member with write permissions or greater before CI can begin.

jakirkham · 2023-06-27T22:46:36Z

cc @thomcom @madsbk (for awareness)

jakirkham · 2023-06-28T00:35:10Z

/ok to test

madsbk

Nice work @Alexey-Kamenev !
Have some minor suggestion for my first review pass.

python/kvikio/nvcomp_codec.py

madsbk · 2023-06-28T07:39:43Z

python/kvikio/nvcomp_codec.py

+        """
+        return self.encode_batch([buf])[0]
+
+    def encode_batch(self, bufs):


Suggested change

def encode_batch(self, bufs):

def encode_batch(self, bufs : List[Any]) -> List[Any]:

Done. I did not add type hints since numcodecs Codec does not use them, so I decided to do the same (but I still prefer to use type hints).

numcodecs Codec does not use them

Could you please raise an upstream issue?

madsbk · 2023-06-28T07:56:59Z

python/kvikio/nvcomp_codec.py

+            max_chunk_size,
+            num_chunks,
+            temp_buf,
+            comp_chunks,


I don't think comp_chunks is used afterwards, I guess it should be part of the returned result?

That's correct - comp_chunks is used only as a container that stores pointers to actual chunks. nvCOMP requires this container to be on GPU as well i.e. it's a pointer to pointers and it has to be in GPU memory, same as actual chunk pointers. Once compress returns, this container is not needed/used anymore.

Ahh got it, could you add some comments describing the nature of comp_chunks and comp_chunks_header in more detail?

Done - also added similar comments to decode_batch.

python/kvikio/nvcomp_codec.py

* Addressed review feedback. * Added sample Jupyter notebook.

Alexey-Kamenev · 2023-06-28T17:50:10Z

It looks like there are 3 pipeline failures for this PR but I don't think they are related to the PR itself, since the errors look like this:

Error: OpenIDConnect provider's HTTPS certificate doesn't match configured thumbprint

jakirkham · 2023-06-29T01:00:42Z

/ok to test

madsbk

Only have some minor comments.

I think an important follow-up PR would be to support GPU memory output of encode_batch and decode_batch: #251

madsbk · 2023-06-29T07:11:27Z

python/kvikio/nvcomp_codec.py

+            max_chunk_size,
+            num_chunks,
+            temp_buf,
+            comp_chunks,


Ahh got it, could you add some comments describing the nature of comp_chunks and comp_chunks_header in more detail?

python/tests/test_nvcomp_codec.py

madsbk

Looks good, thanks @Alexey-Kamenev !

jakirkham · 2023-06-30T08:25:31Z

/ok to test

madsbk · 2023-07-03T19:12:57Z

/merge

Add support for nvCOMP batch API

c89c4a9

Alexey-Kamenev requested review from a team as code owners June 27, 2023 21:59

jakirkham added improvement Improves an existing functionality non-breaking Introduces a non-breaking change c++ Affects the C++ API of KvikIO python Affects the Python API of KvikIO labels Jun 27, 2023

madsbk requested changes Jun 28, 2023

View reviewed changes

Address review feedback

114ef42

* Addressed review feedback. * Added sample Jupyter notebook.

Add missing docstrings to the codec.

eb257d4

jakirkham requested a review from madsbk June 29, 2023 01:00

madsbk reviewed Jun 29, 2023

View reviewed changes

Address review feedback

ff14cc0

madsbk approved these changes Jun 30, 2023

View reviewed changes

rapids-bot bot merged commit 9e004ce into rapidsai:branch-23.08 Jul 3, 2023

akshaysubr mentioned this pull request Jul 11, 2023

Allow batched/concurrent (de)compression support zarr-developers/zarr-python#1398

Closed

akshaysubr mentioned this pull request Jul 25, 2023

Add entrypoints for Numcodecs compressors #66

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for nvCOMP batch API #249

Add support for nvCOMP batch API #249

Alexey-Kamenev commented Jun 27, 2023

rapids-bot bot commented Jun 27, 2023

jakirkham commented Jun 27, 2023

jakirkham commented Jun 28, 2023

madsbk left a comment

madsbk Jun 28, 2023

Alexey-Kamenev Jun 28, 2023

jakirkham Jun 29, 2023

Alexey-Kamenev Jun 29, 2023

madsbk Jun 28, 2023

Alexey-Kamenev Jun 28, 2023

madsbk Jun 29, 2023

Alexey-Kamenev Jun 29, 2023

Alexey-Kamenev commented Jun 28, 2023

jakirkham commented Jun 29, 2023

madsbk left a comment

madsbk Jun 29, 2023

madsbk left a comment

jakirkham commented Jun 30, 2023

madsbk commented Jul 3, 2023

	def encode_batch(self, bufs):
	def encode_batch(self, bufs : List[Any]) -> List[Any]:

Add support for nvCOMP batch API #249

Add support for nvCOMP batch API #249

Conversation

Alexey-Kamenev commented Jun 27, 2023

rapids-bot bot commented Jun 27, 2023

jakirkham commented Jun 27, 2023

jakirkham commented Jun 28, 2023

madsbk left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Alexey-Kamenev commented Jun 28, 2023

jakirkham commented Jun 29, 2023

madsbk left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

madsbk left a comment

Choose a reason for hiding this comment

jakirkham commented Jun 30, 2023

madsbk commented Jul 3, 2023