TYP: make the type annotations of read_csv & read_table discoverable #34976

topper-123 · 2020-06-24T18:42:37Z

closes ERR: read_csv exposes an internal function when bad argument is specified #25648
tests added / passed
passes black pandas
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

In master the type of read_csv and read_table is 'Any'. This PR makes the functions signatures discoverable by mypy. Also fixes the error message when wrong kwarg is passed.

WillAyd · 2020-06-24T20:57:10Z

I am on board with this. This probably also closes #25648

WillAyd · 2020-06-24T20:58:23Z

So maybe worth pulling the test case from #33023

WillAyd · 2020-06-24T20:58:42Z

@gfyoung

gfyoung · 2020-06-24T21:24:01Z

I'm a bit torn on this TBH

I see the benefits to this change (mypy + #25648, #33023), but on the other hand, it does mean we will need to maintain two copies of essentially the same signature. This function generator setup was meant to alleviate that.

If we're mostly okay with maintaining two copies of the signature, then go for it. @pandas-dev/pandas-core

topper-123 · 2020-06-24T23:25:58Z

@WillAyd , good idea taking in that test from #33023. I'll do that in the next commit.

@gfyoung , that was the reason I added the test test_read_table_same_signature_as_read_csv, to ensure the signatures don't drift apart :-)

gfyoung · 2020-06-25T01:34:51Z

@topper-123 : Ah, that's a fair point. Okay...it still feels little hacky, but the test does make me feel better about this.

simonjayhawkins

Thanks @topper-123 lgtm.

simonjayhawkins · 2020-06-25T09:49:00Z

pandas/io/parsers.py

+        engine = "c"
+        engine_specified = False
+
+    kwds.update(


at some point, would be beneficial for consistency checking to not use the dictionary here as mypy doesn't check this.

jreback

any comments @TomAugspurger or @jorisvandenbossche

TomAugspurger · 2020-06-25T14:43:32Z

No comments.

…

On Thu, Jun 25, 2020 at 9:37 AM Jeff Reback ***@***.***> wrote: ***@***.**** approved this pull request. any comments @TomAugspurger <https://github.com/TomAugspurger> or @jorisvandenbossche <https://github.com/jorisvandenbossche> — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#34976 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAKAOIVCUXXW5OVFSKHFMHTRYNOJDANCNFSM4OHB6EIQ> .

WillAyd · 2020-06-25T17:38:16Z

Thanks @topper-123

…andas-dev#34976)

TYP: make the type annotations of read_csv & read_table discoverable

ef4b0d2

topper-123 force-pushed the read_csv_table_type branch from babb386 to ef4b0d2 Compare June 24, 2020 18:43

fixes formatting

2ee24c0

WillAyd added the IO CSV read_csv, to_csv label Jun 24, 2020

gfyoung added the Refactor Internal refactoring of code label Jun 24, 2020

gfyoung approved these changes Jun 25, 2020

View reviewed changes

gfyoung requested review from gfyoung and removed request for gfyoung June 25, 2020 01:35

add test for correct error message

1febe42

simonjayhawkins approved these changes Jun 25, 2020

View reviewed changes

jreback added this to the 1.1 milestone Jun 25, 2020

jreback approved these changes Jun 25, 2020

View reviewed changes

WillAyd merged commit a7d96fa into pandas-dev:master Jun 25, 2020

topper-123 deleted the read_csv_table_type branch June 25, 2020 17:43

fangchenli pushed a commit to fangchenli/pandas that referenced this pull request Jun 27, 2020

TYP: make the type annotations of read_csv & read_table discoverable (p…

4bab429

…andas-dev#34976)

arw2019 mentioned this pull request Jul 27, 2020

API: read_csv, to_csv line_terminator keyword inconsistency #35399

Closed

5 tasks

mroeschke mentioned this pull request Aug 8, 2020

ENH: Make top-level Pandas functions serializable #35611

Closed

asishm mentioned this pull request Aug 28, 2020

BUG: read_table raises ValueError when delim_whitespace is set to True #35958

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TYP: make the type annotations of read_csv & read_table discoverable #34976

TYP: make the type annotations of read_csv & read_table discoverable #34976

topper-123 commented Jun 24, 2020 •

edited

Loading

WillAyd commented Jun 24, 2020

WillAyd commented Jun 24, 2020

WillAyd commented Jun 24, 2020

gfyoung commented Jun 24, 2020 •

edited

Loading

topper-123 commented Jun 24, 2020

gfyoung commented Jun 25, 2020 •

edited

Loading

simonjayhawkins left a comment

simonjayhawkins Jun 25, 2020

jreback left a comment

TomAugspurger commented Jun 25, 2020 via email

WillAyd commented Jun 25, 2020

TYP: make the type annotations of read_csv & read_table discoverable #34976

TYP: make the type annotations of read_csv & read_table discoverable #34976

Conversation

topper-123 commented Jun 24, 2020 • edited Loading

WillAyd commented Jun 24, 2020

WillAyd commented Jun 24, 2020

WillAyd commented Jun 24, 2020

gfyoung commented Jun 24, 2020 • edited Loading

topper-123 commented Jun 24, 2020

gfyoung commented Jun 25, 2020 • edited Loading

simonjayhawkins left a comment

Choose a reason for hiding this comment

simonjayhawkins Jun 25, 2020

Choose a reason for hiding this comment

jreback left a comment

Choose a reason for hiding this comment

TomAugspurger commented Jun 25, 2020 via email

WillAyd commented Jun 25, 2020

topper-123 commented Jun 24, 2020 •

edited

Loading

gfyoung commented Jun 24, 2020 •

edited

Loading

gfyoung commented Jun 25, 2020 •

edited

Loading