-
-
Notifications
You must be signed in to change notification settings - Fork 18.5k
DISC: Directory for code "upstream" from _libs #25162
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Making the codebase DAG-like is not necessarily the same as separating out pieces of the codebase from one another. That being said if DAG and increased modularity happen to go hand-in-hand, by all means. Also, is |
True. In the case of
No, but a subset of
That leaves:
Some of these ( |
Motivated in part by #24737, which is discussing vendoring
tzlocal
for use intslibs
.A lot of effort has gone into making
_libs
, and in particulartslibs
, self-contained within the pandas code base. This makes these parts of the code base much easier to reason about. It also makes it easier to test, profile, and measure coverage on pieces of the code base in isolation.Furthermore, outside of
_libs
the dependency structure is not quite DAG, but bears a resemblance if you squint and tilt your head a little:compat
,errors
, andutil._*
are almost independent ofcore
.core.dtypes
is almost-independent of the rest of core, etc.AFAICT there are two pieces of code that prevent us from further simplifying the dependency structure:
config.get_option
andpprint_thing
.The proposal is to put these two functions in a directory that is explicitly "upstream" of everything else, allowing us to simplify the dependency structure.
[Side-note: It might also make sense to make a
config
shared within the ecosystem to de-duplicate e.g.pd.set_option('display.width', ...)
andnp.set_printoption(...)
]Thoughts?
The text was updated successfully, but these errors were encountered: