Skip to content

hdfs: should not require hostname #3226

@efiop

Description

@efiop

using hdfs://path/to/something is pretty normal when you are trying to access your local hdfs, but dvc thinks that path is a hostname and fails to connect. We could think of a way to separate those or maybe just document it somewhere (and or provide a nicer hint). First idea would be to make users use localhost or 127.0.0.1 as a hostname in those cases, but that won't actually work. What would work is using default, as that is the default value in pyarrow.connect https://arrow.apache.org/docs/python/generated/pyarrow.hdfs.connect.html and I was able to verify that hdfs://default/path/to/something works great.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugDid we break something?p2-mediumMedium priority, should be done, but less importantresearch

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions