Running big data components on kubernetes
- kubernetes cluster v1.18+
- kubectl with cluster admin role on linux
-
Planning Nodes for Service Deployment, eg:
namespace: bigdatanode services node1 nn1 rm1 dn nm node2 nn2 rm2 dn nm node3 zk dn nm kafka node4 zk dn nm kafka node5 zk dn nm kafka -
Modify the configuration file config.sh to suit your plan
./installl.sh
service | ports |
---|---|
zookeeper | 2181 |
journalnode | 8485 8480 |
namenode | 8020 9870 |
resourcemanager | 8088 |
histroyserver | 10020 19888 |
hive | 9083 10000 10002 |
spark | 12222 10000 |
trino | 8082 |
clickhouse | 8123 |
kafka | 9092 |
- namenode failover not implemented