-
Notifications
You must be signed in to change notification settings - Fork 64
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Data API] Array Object Implementation #261
[Data API] Array Object Implementation #261
Conversation
…ithmetic operators
45d1b5c
to
ae6be05
Compare
|
||
|
||
class Array: | ||
def __init__( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Todo: move to asarray() function
follow constructor like done in numpy/etc
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There is a custom array object, numpy.array_api.Array, which is returned by
all functions in this module. All functions in the array API namespace
implicitly assume that they will only receive this object as input. The only
way to create instances of this object is to use one of the array creation
functions. It does not have a public constructor on the object itself. The
object is a small wrapper class around numpy.ndarray. The main purpose of it
is to restrict the namespace of the array object to only those dtypes and
only those methods that are required by the spec, as well as to limit/change
certain behavior that differs in the spec. In particular:
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This PR ought to describe a new structure with fewer changes to ArrayFire API. First of many, tbh :)
Changing init now will affect a major part of this PR and follow-up #262. Can we postpone these changes (that are crucial I agree) to the third one, so as to avoid huge code merge issues? Proposed changes in #262 will simplify the implementation of asarray
method, thus I wanted to keep creation functions (https://data-apis.org/array-api/latest/API_specification/creation_functions.html) till the finished backend.
# TODO | ||
return NotImplemented | ||
|
||
def __len__(self) -> int: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
remove, not part of the spec. Would be good to verify all of these and remove extra functions or make them private
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let's move/group all non-standard functions into separate region, at bottom? delineate with comments
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It is now grouped by another principle - magic methods, attributes, and aux functions. Mixing it all under another group will lead us to a messy search across the source code. I'm proposing using built-in tags like #NOTE to highlight such methods with comments if it's necessary.
return out | ||
|
||
@property | ||
def size(self) -> int: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not following spec, should be Optional[Int]
Copy docstrings from https://data-apis.org/array-api/latest/API_specification/generated/array_api.array.size.html#array_api.array.size
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
And still, we won't have a case where size equals None
, but making the return Optional
will lead us to major logical code changes if we want to keep typing work.
Mypy doesn't implement it in their source code as well https://github.com/numpy/numpy/blob/6f3e1f458e04d13bdd56cff5669f9fd96a25fb66/numpy/array_api/_array_object.py#L1088
My point is to keep int
as an expected return and not try to imagine cases that likely never happen in the future with None
return.
Or, let's switch to TDD and add valid test case for None
return and then fix typing in the source code 🤷♂️
@@ -0,0 +1,5 @@ | |||
# Build requirements | |||
wheel~=0.38.4 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
loosen version requirements
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It's already loosened :)
We stick to the 0.38.X version because it's not the latest stable release and we want to be sure that other minor version bumps won't affect our dev setup. ~=
means we are OK with bug fixes in fixes versions, thus X variable changes won't affect us at all.
FYI https://peps.python.org/pep-0440/#compatible-release
…ec. Dump minimal numpy version required.
Purpose
Data API documentation - https://data-apis.org/array-api/latest/purpose_and_scope.html
Change List
Array Object -
class Array
src
->x
. Motivation:src
is commonly used to name the source code folders and not as a parameter in the class. Other options likearr
(is used in the class),a
(not intuitive), orobject
(likely meant by ANY object, but it is not true in this case) seem like not a good fit, so I chosex
is as it is an intuitive parameter to any function. Also, it occurs in the specification, e.g., here.dims
->shape
. Motivation:dims
(orndim
) is more commonly used asint
, andshape
astuple
, so the change is understandable and should not cause any confusion.is_device
->pointer_source
. Motivation: add more intuition, already discussed with @syurkevi, but most likely is subject to change in further PRs with Device implementation.dtype
,offset
, andstrides
are left unchanged due to their pretty clear purpose.BaseClass
because it has only one purpose to set a defaultarr
value asctypes.c_void_p
, so it was moved to theArray
class.__repr__
to makeArray
more descriptive during the prototyping. Will be merged withdisplay()
method as numpy was selected as a state-of-the-art exampleelement
method tosize
as it is more common in data APIs.to_ctype
method toto_ctype_array
as it actually returns the C Arraydims
as has the same functionality as theshape
method and the last one is required by the specificationnumdims
tondim
as it is shorter and no more mess with other dim-like synonyms included. And yes. it is mentioned in the specificationtype
method as it has no applied sense. The value can now be retrieved by accessing the specificDtype
value, e.g.,Array().dtype.c_api_value
(more about data types changes below)Data Types -
class Dtype
dataclass
(also, subject to change) because its main purpose was to be a "bridge" between python types and C types, so the batch of different dictionaries was reduced to classes with 4 parameters and a couple of helper methodstypecode
- previouslyto_dtype()
c_type
- previouslyto_c_type()
typename
- left unchangedc_api_value
- previously was an_Enum
that was a match to C++ library dtypeEnum
. (P.S. naming was discussed with @syurkevi and I really liked that idea)Other (subject to change in follow-up PRs)
dim4()
which is now converted to classCShape
. The main purpose was to convert Python shapes to C Shapes, so the logic remains the same, but the current solution helps to avoid extra code duplication and bugs during the type conversions.c_dim_t
from the arch investigationall()
andany()
methods), and ArrayFire custom backend solutionTesting
import *
to simplify debugging process and avoid recursive re-importing as it meets in the original libraryDev Flow
flake8
andautopep8
code style checkersisort
imports checkerflake8-quotes
for quotes usage unification to avoid the mess with docstrings and commentsmypy
for typing as dev requirements to the new dev flow samplepytest
Notes
This is the first PR among the series of Data API implementations. A lot of changes are subject to change in follow-ups and a bunch of bags may occur till the whole
array_api
implementation. Any suggestions, bug reports, and feature requests are highly welcome (but may be implemented sooner or later due to the marked plan of development).