Skip to content
This repository has been archived by the owner on Feb 8, 2024. It is now read-only.

CORTX-33876: Log device size should be power of 2 with limits #2151

Merged
merged 1 commit into from
Aug 22, 2022

Conversation

Shreya-18
Copy link
Contributor

The validation for log device size was missing which resulted
in failure of cluster bootstrap. Default size
was taken from lsblk if not provided via confstore.

Solution:
Following motr constraints are considered:

  • the log device size upper and lower bounds is set to 4GB and
    128MB respectively.
  • The size of the log device is rounded off to the nearest
    power of 2.

Signed-off-by: Shreya Karmakar shreya.karmakar@seagate.com

cfgen/cfgen Show resolved Hide resolved
cfgen/cfgen Outdated Show resolved Hide resolved
@Shreya-18 Shreya-18 force-pushed the log_dev branch 2 times, most recently from 6cc485b to 948b2db Compare August 19, 2022 11:18
@d-nayak
Copy link
Contributor

d-nayak commented Aug 19, 2022

Tested PR on 1N bootstrap with loop devices.
A loop device of 10GB given as log device ends up with 4GB in the motr sysconfig file:

[root@centos01 sysconfig]# cat m0d-0x7200000000000001:0x2
MOTR_M0D_EP='inet:tcp:192.168.122.69@21003'
MOTR_HA_EP='inet:tcp:192.168.122.69@22001'
MOTR_PROCESS_FID='0x7200000000000001:0x2'
MOTR_NODE_UUID='4e74b656-1fb3-11ed-8fbb-525400cb70c9'
MOTR_LOG_PATH='/dev/loop9'
MOTR_LOG_SIZE='4294967296'

@vaibhavparatwar
Copy link
Contributor

retest this please

@pavankrishnat
Copy link
Contributor

@Shreya-18 Please check why 3N and 5N deployment checks are failing.

@pavankrishnat
Copy link
Contributor

retest this please

@Shreya-18
Copy link
Contributor Author

3N deployment is failing due to server pod timeout issue. Will test it manually and update it.

@vaibhavparatwar
Copy link
Contributor

3N deployment is failing due to server pod timeout issue. Will test it manually and update it.

@vaibhavparatwar
Copy link
Contributor

3N deployment is failing due to server pod timeout issue. Will test it manually and update it.

Ok.. i think it is because of CORTX-33786 for which we have PR #2152

The validation for log device size was missing which resulted
in failure of cluster bootstrap. Default size
was taken from lsblk if not provided via confstore.

Solution:
Following motr constraints are considered:
- the log device size upper and lower bounds is set to 4GB and
128MB respectively.
- The size of the log device is rounded off to the nearest
power of 2.

Signed-off-by: Shreya Karmakar <shreya.karmakar@seagate.com>
@Shreya-18
Copy link
Contributor Author

Deployed successfully on 5Node setup !!

CORTX Cluster Setup SUCCESS
Please check [cluster setup logs](https://eos-jenkins.colo.seagate.com/job/Cortx-Automation/job/RGW/job/setup-cortx-rgw-cluster/11383//console) for more info

Cluster Status:
==================== Image Details ====================

ghcr.io/seagate/consul:1.11.4
cortx-docker.colo.seagate.com/seagate/cortx-control:2.0.0-7608-custom-ci
cortx-docker.colo.seagate.com/seagate/cortx-data:2.0.0-7608-custom-ci
cortx-docker.colo.seagate.com/seagate/cortx-control:2.0.0-7608-custom-ci
ghcr.io/seagate/kafka:3.0.0-debian-10-r97
cortx-docker.colo.seagate.com/seagate/cortx-rgw:2.0.0-7608-custom-ci
ghcr.io/seagate/zookeeper:3.8.0-debian-10-r9

==================== POD Status ====================

NAME                             READY   STATUS    RESTARTS       AGE     IP               NODE                                    NOMINATED NODE   READINESS GATES
cortx-consul-client-4plwh        1/1     Running   0              3m22s   172.16.244.201   ssc-vm-rhev4-2451.colo.seagate.com                 
cortx-consul-client-dq8g5        1/1     Running   0              3m22s   172.16.115.5     ssc-vm-g4-rhev4-1589.colo.seagate.com              
cortx-consul-client-ksx9d        1/1     Running   0              3m22s   172.16.52.200    ssc-vm-rhev4-2635.colo.seagate.com                 
cortx-consul-client-rcbcn        1/1     Running   0              3m22s   172.16.18.200    ssc-vm-rhev4-2450.colo.seagate.com                 
cortx-consul-client-tq8tm        1/1     Running   0              3m22s   172.16.164.251   ssc-vm-g4-rhev4-1588.colo.seagate.com              
cortx-consul-server-0            1/1     Running   0              3m22s   172.16.18.202    ssc-vm-rhev4-2450.colo.seagate.com                 
cortx-consul-server-1            1/1     Running   0              3m22s   172.16.164.255   ssc-vm-g4-rhev4-1588.colo.seagate.com              
cortx-consul-server-2            1/1     Running   0              3m22s   172.16.115.15    ssc-vm-g4-rhev4-1589.colo.seagate.com              
cortx-control-76d5fd4c56-ppgrg   1/1     Running   0              3m22s   172.16.52.194    ssc-vm-rhev4-2635.colo.seagate.com                 
cortx-data-g0-0                  3/3     Running   0              3m20s   172.16.164.195   ssc-vm-g4-rhev4-1588.colo.seagate.com              
cortx-data-g0-1                  3/3     Running   0              3m19s   172.16.52.206    ssc-vm-rhev4-2635.colo.seagate.com                 
cortx-data-g0-2                  3/3     Running   0              3m19s   172.16.18.215    ssc-vm-rhev4-2450.colo.seagate.com                 
cortx-data-g0-3                  3/3     Running   0              3m18s   172.16.244.210   ssc-vm-rhev4-2451.colo.seagate.com                 
cortx-data-g0-4                  3/3     Running   0              3m17s   172.16.115.23    ssc-vm-g4-rhev4-1589.colo.seagate.com              
cortx-data-g1-0                  3/3     Running   0              3m22s   172.16.164.199   ssc-vm-g4-rhev4-1588.colo.seagate.com              
cortx-data-g1-1                  3/3     Running   0              3m21s   172.16.52.201    ssc-vm-rhev4-2635.colo.seagate.com                 
cortx-data-g1-2                  3/3     Running   0              3m20s   172.16.18.211    ssc-vm-rhev4-2450.colo.seagate.com                 
cortx-data-g1-3                  3/3     Running   0              3m19s   172.16.244.209   ssc-vm-rhev4-2451.colo.seagate.com                 
cortx-data-g1-4                  3/3     Running   0              3m18s   172.16.115.19    ssc-vm-g4-rhev4-1589.colo.seagate.com              
cortx-ha-57f7f95f9f-fws46        3/3     Running   2 (101s ago)   3m22s   172.16.52.196    ssc-vm-rhev4-2635.colo.seagate.com                 
cortx-kafka-0                    1/1     Running   0              3m22s   172.16.244.206   ssc-vm-rhev4-2451.colo.seagate.com                 
cortx-kafka-1                    1/1     Running   0              3m22s   172.16.164.198   ssc-vm-g4-rhev4-1588.colo.seagate.com              
cortx-kafka-2                    1/1     Running   0              3m22s   172.16.115.12    ssc-vm-g4-rhev4-1589.colo.seagate.com              
cortx-server-0                   2/2     Running   0              3m22s   172.16.115.10    ssc-vm-g4-rhev4-1589.colo.seagate.com              
cortx-server-1                   2/2     Running   0              3m22s   172.16.244.200   ssc-vm-rhev4-2451.colo.seagate.com                 
cortx-server-2                   2/2     Running   0              3m21s   172.16.18.208    ssc-vm-rhev4-2450.colo.seagate.com                 
cortx-server-3                   2/2     Running   0              3m21s   172.16.52.204    ssc-vm-rhev4-2635.colo.seagate.com                 
cortx-server-4                   2/2     Running   0              3m21s   172.16.164.200   ssc-vm-g4-rhev4-1588.colo.seagate.com              
cortx-zookeeper-0                1/1     Running   0              3m22s   172.16.164.254   ssc-vm-g4-rhev4-1588.colo.seagate.com              
cortx-zookeeper-1                1/1     Running   0              3m22s   172.16.18.206    ssc-vm-rhev4-2450.colo.seagate.com                 
cortx-zookeeper-2                1/1     Running   0              3m22s   172.16.244.205   ssc-vm-rhev4-2451.colo.seagate.com                 

--------------- Sleeping for 1min before checking hctl status.... ---------------


--------------- hctl status ---------------

Mon Aug 22 02:12:16 MDT 2022
Bytecount:
    critical : 0
    damaged : 0
    degraded : 0
    healthy : 0
Data pool:
    # fid name
    0x6f00000000000001:0x0 'storage-set-1__sns'
Profile:
    # fid name: pool(s)
    0x7000000000000001:0x0 'Profile_the_pool': 'storage-set-1__sns' 'storage-set-1__dix' None
Services:
    cortx-data-g0-0.cortx-data-headless.cortx.svc.cluster.local 
    [started]  hax                 0x7200000000000001:0x0          inet:tcp:cortx-data-g0-0.cortx-data-headless.cortx.svc.cluster.local@22001
    [started]  ioservice           0x7200000000000001:0x1          inet:tcp:cortx-data-g0-0.cortx-data-headless.cortx.svc.cluster.local@21001
    [started]  confd               0x7200000000000001:0x2          inet:tcp:cortx-data-g0-0.cortx-data-headless.cortx.svc.cluster.local@21002
    cortx-data-g0-1.cortx-data-headless.cortx.svc.cluster.local 
    [started]  hax                 0x7200000000000001:0x3          inet:tcp:cortx-data-g0-1.cortx-data-headless.cortx.svc.cluster.local@22001
    [started]  ioservice           0x7200000000000001:0x4          inet:tcp:cortx-data-g0-1.cortx-data-headless.cortx.svc.cluster.local@21001
    [started]  confd               0x7200000000000001:0x5          inet:tcp:cortx-data-g0-1.cortx-data-headless.cortx.svc.cluster.local@21002
    cortx-data-g0-2.cortx-data-headless.cortx.svc.cluster.local 
    [started]  hax                 0x7200000000000001:0x6          inet:tcp:cortx-data-g0-2.cortx-data-headless.cortx.svc.cluster.local@22001
    [started]  ioservice           0x7200000000000001:0x7          inet:tcp:cortx-data-g0-2.cortx-data-headless.cortx.svc.cluster.local@21001
    [started]  confd               0x7200000000000001:0x8          inet:tcp:cortx-data-g0-2.cortx-data-headless.cortx.svc.cluster.local@21002
    cortx-data-g0-3.cortx-data-headless.cortx.svc.cluster.local 
    [started]  hax                 0x7200000000000001:0x9          inet:tcp:cortx-data-g0-3.cortx-data-headless.cortx.svc.cluster.local@22001
    [started]  ioservice           0x7200000000000001:0xa          inet:tcp:cortx-data-g0-3.cortx-data-headless.cortx.svc.cluster.local@21001
    [started]  confd               0x7200000000000001:0xb          inet:tcp:cortx-data-g0-3.cortx-data-headless.cortx.svc.cluster.local@21002
    cortx-data-g0-4.cortx-data-headless.cortx.svc.cluster.local 
    [started]  hax                 0x7200000000000001:0xc          inet:tcp:cortx-data-g0-4.cortx-data-headless.cortx.svc.cluster.local@22001
    [started]  ioservice           0x7200000000000001:0xd          inet:tcp:cortx-data-g0-4.cortx-data-headless.cortx.svc.cluster.local@21001
    [started]  confd               0x7200000000000001:0xe          inet:tcp:cortx-data-g0-4.cortx-data-headless.cortx.svc.cluster.local@21002
    cortx-data-g1-0.cortx-data-headless.cortx.svc.cluster.local 
    [started]  hax                 0x7200000000000001:0xf          inet:tcp:cortx-data-g1-0.cortx-data-headless.cortx.svc.cluster.local@22001
    [started]  ioservice           0x7200000000000001:0x10         inet:tcp:cortx-data-g1-0.cortx-data-headless.cortx.svc.cluster.local@21001
    [started]  confd               0x7200000000000001:0x11         inet:tcp:cortx-data-g1-0.cortx-data-headless.cortx.svc.cluster.local@21002
    cortx-data-g1-1.cortx-data-headless.cortx.svc.cluster.local  (RC)
    [started]  hax                 0x7200000000000001:0x12         inet:tcp:cortx-data-g1-1.cortx-data-headless.cortx.svc.cluster.local@22001
    [started]  ioservice           0x7200000000000001:0x13         inet:tcp:cortx-data-g1-1.cortx-data-headless.cortx.svc.cluster.local@21001
    [started]  confd               0x7200000000000001:0x14         inet:tcp:cortx-data-g1-1.cortx-data-headless.cortx.svc.cluster.local@21002
    cortx-data-g1-2.cortx-data-headless.cortx.svc.cluster.local 
    [started]  hax                 0x7200000000000001:0x15         inet:tcp:cortx-data-g1-2.cortx-data-headless.cortx.svc.cluster.local@22001
    [started]  ioservice           0x7200000000000001:0x16         inet:tcp:cortx-data-g1-2.cortx-data-headless.cortx.svc.cluster.local@21001
    [started]  confd               0x7200000000000001:0x17         inet:tcp:cortx-data-g1-2.cortx-data-headless.cortx.svc.cluster.local@21002
    cortx-data-g1-3.cortx-data-headless.cortx.svc.cluster.local 
    [started]  hax                 0x7200000000000001:0x18         inet:tcp:cortx-data-g1-3.cortx-data-headless.cortx.svc.cluster.local@22001
    [started]  ioservice           0x7200000000000001:0x19         inet:tcp:cortx-data-g1-3.cortx-data-headless.cortx.svc.cluster.local@21001
    [started]  confd               0x7200000000000001:0x1a         inet:tcp:cortx-data-g1-3.cortx-data-headless.cortx.svc.cluster.local@21002
    cortx-data-g1-4.cortx-data-headless.cortx.svc.cluster.local 
    [started]  hax                 0x7200000000000001:0x1b         inet:tcp:cortx-data-g1-4.cortx-data-headless.cortx.svc.cluster.local@22001
    [started]  ioservice           0x7200000000000001:0x1c         inet:tcp:cortx-data-g1-4.cortx-data-headless.cortx.svc.cluster.local@21001
    [started]  confd               0x7200000000000001:0x1d         inet:tcp:cortx-data-g1-4.cortx-data-headless.cortx.svc.cluster.local@21002
    cortx-server-0.cortx-server-headless.cortx.svc.cluster.local 
    [started]  hax                 0x7200000000000001:0x1e         inet:tcp:cortx-server-0.cortx-server-headless.cortx.svc.cluster.local@22001
    [started]  rgw_s3              0x7200000000000001:0x1f         inet:tcp:cortx-server-0.cortx-server-headless.cortx.svc.cluster.local@22501
    cortx-server-1.cortx-server-headless.cortx.svc.cluster.local 
    [started]  hax                 0x7200000000000001:0x20         inet:tcp:cortx-server-1.cortx-server-headless.cortx.svc.cluster.local@22001
    [started]  rgw_s3              0x7200000000000001:0x21         inet:tcp:cortx-server-1.cortx-server-headless.cortx.svc.cluster.local@22501
    cortx-server-2.cortx-server-headless.cortx.svc.cluster.local 
    [started]  hax                 0x7200000000000001:0x22         inet:tcp:cortx-server-2.cortx-server-headless.cortx.svc.cluster.local@22001
    [started]  rgw_s3              0x7200000000000001:0x23         inet:tcp:cortx-server-2.cortx-server-headless.cortx.svc.cluster.local@22501
    cortx-server-3.cortx-server-headless.cortx.svc.cluster.local 
    [started]  hax                 0x7200000000000001:0x24         inet:tcp:cortx-server-3.cortx-server-headless.cortx.svc.cluster.local@22001
    [started]  rgw_s3              0x7200000000000001:0x25         inet:tcp:cortx-server-3.cortx-server-headless.cortx.svc.cluster.local@22501
    cortx-server-4.cortx-server-headless.cortx.svc.cluster.local 
    [started]  hax                 0x7200000000000001:0x26         inet:tcp:cortx-server-4.cortx-server-headless.cortx.svc.cluster.local@22001
    [started]  rgw_s3              0x7200000000000001:0x27         inet:tcp:cortx-server-4.cortx-server-headless.cortx.svc.cluster.local@22501
   

@d-nayak d-nayak merged commit 5d94737 into Seagate:main Aug 22, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants