Add SequenceProtocol and MappingProtocol descriptions to the guide #1546

ravenexp · 2021-04-05T11:52:31Z

References:

Supposedly resolved by PyO3#1107 Fix a typo in the subsection header.

Updates the class customization guide.

References: [1]: https://docs.python.org/3/reference/datamodel.html#emulating-container-types [2]: https://docs.python.org/3/c-api/sequence.html [3]: https://docs.python.org/3/c-api/typeobj.html#c.PySequenceMethods

References: [1]: https://docs.python.org/3/reference/datamodel.html#emulating-container-types [2]: https://docs.python.org/3/c-api/mapping.html [3]: https://docs.python.org/3/c-api/typeobj.html#c.PyMappingMethods

davidhewitt

Thank you for giving some much needed love to our documentation!

I have a couple of suggestions and questions; comments below.

Also, a more general question I have wondered about: the PyResult bit of all the return values is optional - e.g. for a proto which returns PyResult<T>, just T is also an acceptable return value. Do you think it's better if:

we include PyResult<T> on all these return values, with a note in an # Error Handling section saying PyResult is optional
we just have T on all the return value, with a note in an # Error Handling section saying all return types can be wrapped in PyResult

I think that just having T on the return value is easier to read, but it's maybe less obvious for new users to then figure out how to return errors unless they read the whole document and find the Error Handling section!

guide/src/class/protocols.md

davidhewitt · 2021-04-06T21:15:52Z

guide/src/class/protocols.md

+    *Note:* Negative integer indexes are handled as follows: if `__len__()` is defined,
+    it is called and the sequence length is used to compute a positive index,
+    which is passed to `__getitem__()`.
+    If `__len__()` is not defined, the index is passed as is to the function.


This differs from https://docs.python.org/3/reference/datamodel.html#object.__getitem__, which says that negative integers are are passed directly to __getitem__ which then can interpret the negative numbers as it likes?

I presume PyO3 plugs the __getitem__ method implementation into the PySequenceMethods.sq_item slot. The C function in that slot has a slightly different interface from the Python-native __getitem__() method, as described in the C-API docs linked above. PyObject_GetItem() and PySequence_GetItem() handle the negative indexes themselves when __len__() is also implemented.

My implementations of __getitem__() always start with assert!(idx >= 0); because I also implement __len__(), and they indeed work with the negative indexes.

Ah yes, this is a really good point. Perhaps in the documentation here we should link all these methods to their relevant slot information?

All the relationships between these methods and their slots is defined in https://github.com/PyO3/pyo3/blob/main/pyo3-macros-backend/src/defs.rs

Perhaps in the documentation here we should link all these methods to their relevant slot information?

I like this idea, but the mapping between methods and slots looks not so straightforward. There are methods that do not have a C function slot like __bytes__(), __format__() and __reversed__(). And then there are methods that share the same slot like __setitem__() and __delitem__(). I think we could add a symbolic tag like PySequenceMethods.sq_item to each trait method.

Another interesting issue here is the implicit slot method probing order: for example the evaluation of obj[i] Python expression probes PyMappingMethods.mp_subscript before PySequenceMethods.sq_item. This is relevant when someone implements both of these traits for some reason.

davidhewitt · 2021-04-06T21:20:34Z

guide/src/class/protocols.md

+  * `fn __reversed__(&self) -> PyResult<impl ToPyObject>`
+
+    Called (if present) by the `reversed()` built-in to implement reverse iteration.
+    It should return a new iterator object that iterates over all the objects in
+    the container in reverse order.


Hmm we should probably move __reversed__ to the PyIterProtocol trait? (I can do it in a separate PR.)

Well, __reversed__() is a standalone method, not a PyMappingMethods slot, so there is no particular reason it should be a part of PyMappingProtocol.
However, implementing __reversed__() for sequences is not very useful, because the default implementation seems sufficient for most cases.

I personally do not like the idea of adding anything to PyIterProtocol though. It's already semantically overloaded as it encompasses both IntoIterator and Iterator Rust traits. But then the user is expected to implement a different subset of methods for each of the intended uses (as an iterator or as an iterable).

Currently, we also have to write mandatory boilerplate code like

fn __iter__(slf: PyRef<Self>) -> PyRef<Self> { slf }

for every custom iterator class.

Most Rust programmers would expect a blanket implementation like

impl<I> IntoIterator for I where I: Iterator,

from the stdlib to exist instead.

I'm sorry about the long rant, but PyIterProtocol was the biggest road bump for me when working with PyO3.
I really wish it was two distinct traits, like for example PyIterProtocol and PyIntoIterProtocol, so that anyone who knows how Rust iterators work could easily implement Python iterators without scratching their head about "why do I need to return PyRef<Self> instead Self here?"

Thanks, I completely agree that the current situation is confusing in many ways. I wonder if alternatively here I should remove __reversed__ from #[pyproto] completely. In my opinion #[pymethods] are very easy to learn, and implementing __reversed__ in #[pymethods] will already work.

I like this idea very much. There are other non-slotted methods that are included in the protocol traits, like __bytes__(). And then there are C function slots that have no corresponding PyObjectProtocol method like tp_call aka __call__().

I'm all for implementing true non-slotted methods as #[pymethods]. This way we can avoid surprises like #1465 in the future.

- Style Co-authored-by: David Hewitt <1939362+davidhewitt@users.noreply.github.com>

ravenexp · 2021-04-07T06:40:52Z

Also, a more general question I have wondered about: the PyResult bit of all the return values is optional - e.g. for a proto which returns PyResult<T>, just T is also an acceptable return value. Do you think it's better if:
* we include `PyResult<T>` on all these return values, with a note in an `# Error Handling` section saying _PyResult is optional_

* we just have `T` on all the return value, with a note in an `# Error Handling` section saying _all return types can be wrapped in PyResult_
I think that just having T on the return value is easier to read, but it's maybe less obvious for new users to then figure out how to return errors unless they read the whole document and find the Error Handling section!

Hmm, either way is fine with me. I mostly followed the convention PyObjectProtocol and PyNumberProtocol docs already used.
It is also not very clear whether PyResult<()> return type can be removed entirely.
New Rust programmers would probably expect the trait method signatures to match exactly, and the fact that one can implement either fn foo(&self) or fn foo(&self) -> PyResult<()> is not obvious at all.

davidhewitt · 2021-04-07T22:09:22Z

New Rust programmers would probably expect the trait method signatures to match exactly

This is very true. I think in that case it's better to leave the documentation with PyResult<_> on all the methods.

the fact that one can implement either fn foo(&self) or fn foo(&self) -> PyResult<()> is not obvious at all.

I agree. There's a lot of magic going on in these trait implementations, which I really wish we could simplify.

davidhewitt

Based on the discussion here, I'm happy that this is good to be merged to the guide as-is. Thanks very much for this; the #[pyproto] documentation is desperately in need of love!

ravenexp added 4 commits April 5, 2021 08:35

Remove issue PyO3#844 mention from the guide

0c02146

Supposedly resolved by PyO3#1107 Fix a typo in the subsection header.

Insert missing impl keywords

4b675cc

Updates the class customization guide.

Add PySequenceProtocol description to the guide

b1aae93

References: [1]: https://docs.python.org/3/reference/datamodel.html#emulating-container-types [2]: https://docs.python.org/3/c-api/sequence.html [3]: https://docs.python.org/3/c-api/typeobj.html#c.PySequenceMethods

Add PyMappingProtocol description to the guide

88849bd

References: [1]: https://docs.python.org/3/reference/datamodel.html#emulating-container-types [2]: https://docs.python.org/3/c-api/mapping.html [3]: https://docs.python.org/3/c-api/typeobj.html#c.PyMappingMethods

davidhewitt reviewed Apr 6, 2021

View reviewed changes

Apply suggestions from code review

e8a277e

- Style Co-authored-by: David Hewitt <1939362+davidhewitt@users.noreply.github.com>

davidhewitt approved these changes Apr 7, 2021

View reviewed changes

davidhewitt merged commit f5188bb into PyO3:main Apr 7, 2021

davidhewitt mentioned this pull request Apr 12, 2021

pyproto: deprecate py_methods #1560

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add SequenceProtocol and MappingProtocol descriptions to the guide #1546

Add SequenceProtocol and MappingProtocol descriptions to the guide #1546

ravenexp commented Apr 5, 2021

davidhewitt left a comment

davidhewitt Apr 6, 2021

ravenexp Apr 7, 2021

davidhewitt Apr 7, 2021

ravenexp Apr 8, 2021

davidhewitt Apr 6, 2021

ravenexp Apr 7, 2021

davidhewitt Apr 7, 2021

ravenexp Apr 8, 2021

ravenexp commented Apr 7, 2021

davidhewitt commented Apr 7, 2021

davidhewitt left a comment

Add SequenceProtocol and MappingProtocol descriptions to the guide #1546

Add SequenceProtocol and MappingProtocol descriptions to the guide #1546

Conversation

ravenexp commented Apr 5, 2021

davidhewitt left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ravenexp commented Apr 7, 2021

davidhewitt commented Apr 7, 2021

davidhewitt left a comment

Choose a reason for hiding this comment