-
Notifications
You must be signed in to change notification settings - Fork 549
[Feedback v0.14.0] how to use Team wise storage plugin via nfs ? #4001
Comments
Which version of PAI do you use. If you use 0.14 release, please refer to doc: https://github.com/microsoft/pai/tree/pai-0.14.y/contrib/storage_plugin This doc has been updated in master branch |
How about 0.16? I have tried for few days without any progress... |
@shiyemin you can try according to this doc: https://github.com/microsoft/pai/tree/master/contrib/storage_plugin, and check secret in pai-storage namespace to make sure it take effect |
Do you assume the groups are managered by winbind or AAD? I can not find a way to create group. If the storage is configured for 'default' or 'admingroup' groups, then the rest-server will fail. |
After create group and add user to the group manually, the NFS works now. |
|
my openpai version = 0.14.0 but when i use root to submit a job the data show nothing . |
I wrote a small program to manipulate group and grouplist of user. Then "python storagectl.py groupsc set" will work. |
@Binyang2014 my pai version is 0.14.0 and i read https://github.com/microsoft/pai/tree/pai-0.14.y/contrib/storage_plugin doc . but it still show nothing as above. how could i do? is there somthing i miss? via nfs i should create user first ? |
@shiyemin thx u , but in v0.14.0 storagectl.py couldn't use groupsc as param |
In 0.14, you will have to use "default" as config name. |
@Binyang2014 the config already show in k8s config |
@wangdian Can you take a look? |
@apri30th Do you know which group the |
@Binyang2014 thank you for your reply , i can see the data now. the default group name is 'default' and the root user belongs to default group |
Hi, @Binyang2014 |
Hi @nan0755 The team-wise plugin should run after PAI service start |
@Binyang2014
It seems my NFS server has some version incompatible problem with PAI's mount command. Thanks! |
@zeyu-hello Can you show me your job config? And full logs include stdout and stderr? |
1. Team-wise job config(note that the python storagectl.py server set nfsserver nfs 10.10.30.90 /volume
python storagectl.py config set confignfs default -s nfsserver -m /data nfsserver data_folder -d 2. Stdout logpython-crypto_2.6.1-6ubuntu0.16.04.3_amd64.deb ...
Unpacking python-crypto (2.6.1-6ubuntu0.16.04.3) ...
Selecting previously unselected package python-ldb.
Preparing to unpack .../python-ldb_2%3a1.1.24-1ubuntu3.1_amd64.deb ...
Unpacking python-ldb (2:1.1.24-1ubuntu3.1) ...
Selecting previously unselected package python-tdb.
Preparing to unpack .../python-tdb_1.3.8-2_amd64.deb ...
Unpacking python-tdb (1.3.8-2) ...
Selecting previously unselected package python-talloc.
Preparing to unpack .../python-talloc_2.1.5-2_amd64.deb ...
Unpacking python-talloc (2.1.5-2) ...
Selecting previously unselected package samba-libs:amd64.
Preparing to unpack .../samba-libs_2%3a4.3.11+dfsg-0ubuntu0.16.04.24_amd64.deb ...
Unpacking samba-libs:amd64 (2:4.3.11+dfsg-0ubuntu0.16.04.24) ...
Selecting previously unselected package python-samba.
Preparing to unpack .../python-samba_2%3a4.3.11+dfsg-0ubuntu0.16.04.24_amd64.deb ...
Unpacking python-samba (2:4.3.11+dfsg-0ubuntu0.16.04.24) ...
Selecting previously unselected package samba-common-bin.
Preparing to unpack .../samba-common-bin_2%3a4.3.11+dfsg-0ubuntu0.16.04.24_amd64.deb ...
Unpacking samba-common-bin (2:4.3.11+dfsg-0ubuntu0.16.04.24) ...
Selecting previously unselected package sshpass.
Preparing to unpack .../sshpass_1.05-1_amd64.deb ...
Unpacking sshpass (1.05-1) ...
Processing triggers for libc-bin (2.23-0ubuntu11) ...
Processing triggers for systemd (229-4ubuntu21.22) ...
Setting up libpopt0:amd64 (1.16-10) ...
Setting up libnfsidmap2:amd64 (0.25-5) ...
Setting up libwbclient0:amd64 (2:4.3.11+dfsg-0ubuntu0.16.04.24) ...
Setting up samba-common (2:4.3.11+dfsg-0ubuntu0.16.04.24) ...
debconf: unable to initialize frontend: Dialog
debconf: (No usable dialog-like program is installed, so the dialog based frontend cannot be used. at /usr/share/perl5/Debconf/FrontEnd/Dialog.pm line 76.)
debconf: falling back to frontend: Readline
Creating config file /etc/samba/smb.conf with new version
Setting up libtalloc2:amd64 (2.1.5-2) ...
Setting up cifs-utils (2:6.4-1ubuntu1.1) ...
Setting up keyutils (1.5.9-8ubuntu1) ...
Setting up libevent-2.0-5:amd64 (2.0.21-stable-2ubuntu0.16.04.1) ...
Setting up libtdb1:amd64 (1.3.8-2) ...
Setting up libtevent0:amd64 (0.9.28-0ubuntu0.16.04.1) ...
Setting up libldb1:amd64 (2:1.1.24-1ubuntu3.1) ...
Setting up libtirpc1:amd64 (0.2.5-1ubuntu0.1) ...
Setting up rpcbind (0.2.3-0.2) ...
invoke-rc.d: could not determine current runlevel
invoke-rc.d: policy-rc.d denied execution of start.
Setting up nfs-common (1:1.2.8-9ubuntu12.2) ...
debconf: unable to initialize frontend: Dialog
debconf: (No usable dialog-like program is installed, so the dialog based frontend cannot be used. at /usr/share/perl5/Debconf/FrontEnd/Dialog.pm line 76.)
debconf: falling back to frontend: Readline
Creating config file /etc/idmapd.conf with new version
debconf: unable to initialize frontend: Dialog
debconf: (No usable dialog-like program is installed, so the dialog based frontend cannot be used. at /usr/share/perl5/Debconf/FrontEnd/Dialog.pm line 76.)
debconf: falling back to frontend: Readline
Creating config file /etc/default/nfs-common with new version
Adding system user `statd' (UID 107) ...
Adding new user `statd' (UID 107) with group `nogroup' ...
Not creating home directory `/var/lib/nfs'.
invoke-rc.d: unknown initscript, /etc/init.d/gssd not found.
invoke-rc.d: could not determine current runlevel
invoke-rc.d: unknown initscript, /etc/init.d/idmapd not found.
invoke-rc.d: could not determine current runlevel
Setting up python-crypto (2.6.1-6ubuntu0.16.04.3) ...
Setting up python-ldb (2:1.1.24-1ubuntu3.1) ...
Setting up python-tdb (1.3.8-2) ...
Setting up python-talloc (2.1.5-2) ...
Setting up samba-libs:amd64 (2:4.3.11+dfsg-0ubuntu0.16.04.24) ...
Setting up python-samba (2:4.3.11+dfsg-0ubuntu0.16.04.24) ...
Setting up samba-common-bin (2:4.3.11+dfsg-0ubuntu0.16.04.24) ...
Setting up sshpass (1.05-1) ...
Processing triggers for libc-bin (2.23-0ubuntu11) ...
Processing triggers for systemd (229-4ubuntu21.22) ... 3. Stderr logdebconf: delaying package configuration, since apt-utils is not installed
mount.nfs4: Protocol not supported |
Hi, @Binyang2014 , thanks for your reply. 1. docker image in job config
2. Stdout log
3. Stderr log
4. PAI auto-generated command is incompatible with nfs server versionI noticed the auto-generated command using
However, my nfs server version is:
So, if I cannot change my nfs server version due to some reason, can PAI support this nfs version? |
I finally change the nfs server to support nfs4 to solve this probelm. Thanks for your help anyway! @Binyang2014 |
i read the Team wise storage doc and do this as fallow
python storagectl.py server set nfsserver nfs 172.18.67.7 /data/nfs_data
python storagectl.py config set confignfs paigroup -s nfsserver -m /data nfsserver nfs_data -m /user nfsserver 'users/${PAI_USER_NAME}' -d
python storagectl.py groupsc add paigroup confignfs(this command fail and the log is storagectl.py: error: invalid choice: 'groupsc' (choose from 'server', 'config', 'user'))
how can i use nfs plugin ? any suggestions?
The text was updated successfully, but these errors were encountered: