Data Engineering Project with Hadoop HDFS and Kafka
-
Updated
Nov 4, 2023 - Python
Data Engineering Project with Hadoop HDFS and Kafka
基于Hadoop的分布式云存储系统 🌴
A flexible database focused on performance and scalability
Simple File Operation Client for Multiple File Systems such as Hadoop HDFS, AWS S3, SFTP and etc.
News Sentiment Analysis using ETL pipeline
Thrift based Client library for Hadoop Distributed FileSystem (HDFS) <http://hadoop.apache.org/hdfs>
STM data enrichment, Extract, Transform, Load (e.g., ETL)
Dockerfile setup with HDFS client + Kerberos + AWS S3 features.
Java hadoop client that provides convenients api for file management and interaction with hadoop file system
Your go-to-cheatsheet to learn apache-Hadoop.
WIP: hdfs/libhdfs drop-in replacements without Java
Client application for HDFS clone ("SUFS")
hdfs-directory-utils ties provides Scala API for Zookeeper Watcher events
This service is a component inside the petroleum production information system that I conceived and proposed.
Big Data project. Web client for HDFS. Working in the terminal. Has ability to manipulate local and Hadoop storage
Add a description, image, and links to the hdfs-client topic page so that developers can more easily learn about it.
To associate your repository with the hdfs-client topic, visit your repo's landing page and select "manage topics."