You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Yeah I've thought about this, should be fairly straightforward to do in Python. One thing that would need to be decided is whether to try to make it similar to the Hadoop CLI, or make it more based on the standard Linux CLI. Also, the main thing missing from this library currently is any kind of glob handling that a lot of the CLI supports.
One thing that would need to be decided is whether to try to make it similar to the Hadoop CLI, or make it more based on the standard Linux CLI.
It is probably better to keep these two(hadoop CLI and hdfs-native) as close to each other as possible unless the cost of keeping them close is too high. I see this CLI more as a replacement of Hadoop CLI in containers, especially in jupyter environment. That being said, I am not opposed to the idea of alternatives either :)
It would be great to support shell commands from https://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-common/FileSystemShell.html
I believe this implementation should be straightforward for python.
Found an example where this is done for golang https://github.com/colinmarc/hdfs
The text was updated successfully, but these errors were encountered: