-
Notifications
You must be signed in to change notification settings - Fork 23
/
FAQ-USER
91 lines (61 loc) · 2.96 KB
/
FAQ-USER
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
FAQ - USER
==========
Release policy
--------------
Since the version 2.2, release numbers are divided into 3 parts:
- The first represents the design and the implementation used.
- The second represents a set of OAR functionalities.
- The third is incremented after bug fixes.
How can I submit a moldable job?
--------------------------------
You just have to use several "-l" oarsub_ option (one for each moldable
description). By default the OAR scheduler will launch the moldable job which
will end first.
So you can see some free resources but the scheduler can decide to start your
job later because they will have more free resources and the job walltime will
be smaller.
How can I submit a job with a non uniform description?
------------------------------------------------------
Example:
::
oarsub -I -l '{switch="sw1" or switch="sw5"}/switch=1+/node=1'
This example asks OAR to reserve all resources from the switch sw1 or the
switch sw2 **and** a node on another switch.
You can see the "+" syntax as a sub-reservation directive.
Can I perform a fix scheduled reservation and then launch several jobs in it?
-----------------------------------------------------------------------------
Yes. You have to use the OAR scheduler "timesharing" feature.
To use it, the reservation and your further jobs must be of the type
timesharing (only for you).
Example:
1. Make your reservation:
::
oarsub -r "2006-09-12 8:00:00" -l /switch=1 -t 'timesharing=user,*'
This command asks all resources from one switch at the given date for the
default walltime. It also specifies that this job can be shared with
himself and without a constraint on the job name.
2. Once your reservation has begun then you can launch:
::
oarsub -I -l /node=2,walltime=0:50:00 -p 'switch="'scheduled_switch_name'\
-t 'timesharing=user,*'
So this job will be scheduled on nodes assigned from the previous reservation.
The "timesharing" oarsub_ command possibilities are enumerated in Timesharing_.
How can a checkpointable job be resubmitted automatically?
----------------------------------------------------------
You have to specify that your job is *idempotent* and exit from your script
with the exit code 99. So, after a successful checkpoint, if the job is
resubmitted then all will go right and there will have no problem (like file
creation, deletion, ...).
Example:
::
oarsub --checkpoint 600 --signal 2 -t idempotent /path/to/prog
So this job will send a signal *SIGINT* (see *man kill* to know signal
numbers) 10 minutes before the walltime ends. Then if everything goes
well and the exit code is 99 it will be resubmitted.
How to submit a non disturbing job for other users?
---------------------------------------------------
You can use the *besteffort* job type. Thus your job will be launched only
if there is a hole and will be deleted if another job wants its resources.
Example:
::
oarsub -t besteffort /path/to/prog