specialize collections holding bytes to turn into `PyBytes` rather than `PyList` #4417

Icxolu · 2024-08-04T21:50:15Z

Followup to #4060
XRef #4182

This implements specialization for collections holding bytes (in the new IntoPyObject trait) by a provided, sealed iter_into_pyobject method. The method is intended to be overwritten by the element type (for now u8 only) and turns an iterator of elements into an appropriate Python type in type erased form. The default implementation creates a PyList while the u8 override returns PyBytes.

This method is then used by the generic IntoPyObject::into_pyobject implementation of collection types. I updated the implementations of Vec<T>, &[T], [T; N] and SmallVec<A> accordingly. (I hope did not forget something)

From a quick test everything seems to work out. If this seems sensible to work with, I will add a few tests and the newsfragment.

Big thanks to @diliop for the previous work in #4182!

…an `PyList`

davidhewitt

Thanks, I'm very excited to see this and fix the longstanding footgun! We should probably mention this in the migration guide, as this will be a breaking change for return values of types like Vec<u8>.

davidhewitt · 2024-08-06T17:58:15Z

src/conversions/std/num.rs

+        PyBytes::new_with(py, len, |buf| {
+            let mut counter = 0;
+            for (slot, byte) in buf.iter_mut().zip(&mut iter) {
+                *slot = byte;
+                counter += 1;
+            }
+
+            assert!(iter.next().is_none(), "Attempted to create PyBytes but `iter` was larger than reported by its `ExactSizeIterator` implementation.");
+            assert_eq!(len, counter, "Attempted to create PyBytes but `iter` was smaller than reported by its `ExactSizeIterator` implementation.");


So this is interesting, I wonder, does this have a performance impact over just using PyBytes::new? As far as I can see, the problem is that we can't trivially get a slice, hence this approach.

I think the way to use PyBytes::new would be to make a trait like SliceableIterator: IntoIterator<Item = Self> which can either return a slice of all the elements in one go, or just call .into_iterator() to do normal iteration which transfers ownership. Then we use this trait instead of I: IntoIterator as the input.

Good question, I haven't tested whether this has any performance impact. I initially tried to do this with slices rather than iterators, but ran into problems with the generics (there were always T and &T involved which caused all sorts of problems)

I'm not sure if I get the idea of SliceableIterator. How would we specialize between slices and other IntoIterators?

Hmm yeah playing around with this locally I see exactly what you mean by the interaction between T and &T. I feel like there should be a solution so I'm going to continue to mull this over for a day or two.

So far I got to something like this:

pub trait SliceableIntoPyObjectIterator<'py>: Sized { type SliceItem; type IntoPyObjectItem: IntoPyObject<'py>; type IntoIter: ExactSizeIterator<Item = Self::IntoPyObjectItem>; fn into_py_object_iterator(self) -> Self::IntoIter; fn as_slice(&self) -> &[Self::SliceItem]; }

where the bytes case then becomes

fn iter_into_pyobject<I>( iter: I, py: Python<'py>, _: crate::conversion::private::Token, ) -> Result<Bound<'py, PyAny>, PyErr> where I: crate::conversion::SliceableIntoPyObjectIterator<'py, SliceItem = Self>, { Ok(PyBytes::new(py, iter.as_slice()).into_any()) }

... and the default implementation calls the .into_py_object_iterator() method instead to consume the elements.

But as you found, I'm having trouble with T and &T. 😂

Ok, so I gave this a really deep investigation and found that yes, new_with is a bit slower than PyBytes::new, but it's not so much slower that it's a deal breaker.

I opened #4423 and #4424 which are two attempts to try to optimise the conversion a little bit. Both branches include two commits; the first is a set of benchmarks which I was using to study performance.

Overall, I think I'm more comfortable with the approach taken in #4424 which doesn't attempt to use unsafe to be able to then use PyBytes::new, and instead is essentially a small tidy up on top of this branch. If users report performance issues we can always revisit the unsafe option later. But this is already so much faster than the current Vec<u8> conversion which creates a list of integers that I doubt users will really notice the overhead of us using PyBytes::new_with here.

Would be interested to hear what you think of those options.

Thanks for the throughout investigation! I'm also way more comfortable with the iterator approach in #4424. Unsafely casting all these types seems like at footgun to me at best, even if this would be fully contained within PyO3.

For the general case I'm unsure about whether it matters for soundness to have an instantiation of impl or whether it needs to be valid for all T in general. Having a safety guarantee about the instantiation of a generic type sounds a bit sketchy to me. If we're on the cautious side and say it has to be valid for all T I think this is at least invalid for zero sized types, since these would only be valid for zero sized reads on their dangling pointer and not any following address.

And also for this specific case I think this is unsound for something like a Vec<&u8> which would use <&u8>::iter_into_pyobject, which itself would use Vec<&u8>::as_bytes_slice and thus cast &u8 to u8 ⚡

But this is already so much faster than the current Vec<u8> conversion which creates a list of integers that I doubt users will really notice the overhead of us using PyBytes::new_with here.

I agree here. This was also a reason why I initially did not put too much thought into it, since it looked like I would have at worst a similar performance characteristic to the list case we have currently.

How do we want to proceed here @davidhewitt? Should we merge this as is and then followup in #4424 or should we just cleanup #4424 and merge that directly?

Sorry for the delay, I just had a potentially better idea in #4442 which uses PyBytes::new and allows &[u8] target type to be PyBytes. If you like it, maybe let's merge that one directly?

No worries, I just wasn't sure whether we were still thinking here of we reached consent 🙃

src/conversion.rs

Icxolu · 2024-08-06T18:26:43Z

You're right, this has an immediate effect, because of the macro specialization. I totally forgot that 😅 , I will add a migration entry 👍

davidhewitt · 2024-08-06T18:49:25Z

I mean I guess we can probably add another layer to the autoderef specialization to prefer IntoPy only for these types, and emit a deprecation warning unless they wrap in a PyBytes::new or similar 😂

Icxolu · 2024-08-06T19:00:49Z

I kind of hope that it won't be necessary 🙃 , since it's also fairly easy to work around, if that conversion is not desired.

Icxolu · 2024-08-15T20:17:37Z

Superseded by #4442

Icxolu added 4 commits August 6, 2024 19:31

specialize collections holding bytes to turn into PyBytes rather th…

142e79c

…an `PyList`

specialize Cow<'_, T>

605abeb

add tests

5720ee7

add newsfragment

16120ec

Icxolu force-pushed the pybytes-specialization branch from bec3c73 to 16120ec Compare August 6, 2024 17:31

davidhewitt reviewed Aug 6, 2024

View reviewed changes

hide iter_into_pyobject

b918618

Icxolu force-pushed the pybytes-specialization branch from b8978fc to e1fe5ec Compare August 6, 2024 19:01

add migration entry

35c005b

Icxolu force-pushed the pybytes-specialization branch from e1fe5ec to 35c005b Compare August 6, 2024 19:12

This was referenced Aug 7, 2024

bytes specialization via unsafe slice cast #4423

Closed

bytes specialization, optimised from iterator implementation #4424

Closed

remove redundant generic

ef25633

davidhewitt mentioned this pull request Aug 15, 2024

Pybytes specialization slices #4442

Merged

Icxolu closed this Aug 15, 2024

Icxolu deleted the pybytes-specialization branch August 15, 2024 20:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

specialize collections holding bytes to turn into `PyBytes` rather than `PyList` #4417

specialize collections holding bytes to turn into `PyBytes` rather than `PyList` #4417

Icxolu commented Aug 4, 2024

davidhewitt left a comment

davidhewitt Aug 6, 2024

Icxolu Aug 6, 2024

davidhewitt Aug 6, 2024

davidhewitt Aug 7, 2024

Icxolu Aug 8, 2024

Icxolu Aug 13, 2024

davidhewitt Aug 15, 2024

Icxolu Aug 15, 2024

Icxolu commented Aug 6, 2024

davidhewitt commented Aug 6, 2024

Icxolu commented Aug 6, 2024

Icxolu commented Aug 15, 2024

specialize collections holding bytes to turn into PyBytes rather than PyList #4417

specialize collections holding bytes to turn into PyBytes rather than PyList #4417

Conversation

Icxolu commented Aug 4, 2024

davidhewitt left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Icxolu commented Aug 6, 2024

davidhewitt commented Aug 6, 2024

Icxolu commented Aug 6, 2024

Icxolu commented Aug 15, 2024

specialize collections holding bytes to turn into `PyBytes` rather than `PyList` #4417

specialize collections holding bytes to turn into `PyBytes` rather than `PyList` #4417