This is an old revision of the document!
TORQUE provides control over batch jobs and distributed computing resources. It is an advanced open-source product based on the original PBS project and incorporates the best of both community and professional development. It incorporates significant advances in the areas of scalability, reliability, and functionality and is currently in use at tens of thousands of leading government, academic, and commercial sites throughout the world. TORQUE may be freely used, modified, and distributed under the constraints of the included license.
We have 16 nodes
Nodes | CPUs | Memory | Notes |
---|---|---|---|
polyp1–polyp15 | 16 AMD Opteron™ Processor 6128 | 32 GB | — |
polyp30 | 24 Intel(R) Xeon(R) CPU E5-2620 v3 @ 2.40GHz | 128 GB | 2x K80 (4GPUs) |
Jobs can be submitted either using a submission file or directly from command line. First we explain how it is done and then we will discuss the options.
We will create a file test.pbs
#PBS -N JobName #PBS -e /home/mat614/TEST.err #PBS -o /home/mat614/TEST.out #PBS -l nodes=1:ppn=4 #PBS -q batch cd /home/mat614/ ./test_code sleep 60
First few lines contains settings for the job. This is followed by commands for running a particular job. The job can be submitted by running qsub test.pbs
You do not need to write submission script. However, you can submit only bash script by default. Let's create a file myscript.sh
which contains following
cd /home/mat614/ ./test_code
If you do not want to write the submission script you can do it just by calling
qsub -N JobName -q batch -l nodes=1:pnn=2 myscript.sh
Now, we will run the code but we are setting the job parameters using -
character (e.g. -N JobName
)
-q <queue>
set the queue. Often you will use the standard queue, so no need to set this up.-V
will pass all environment variables to the job-v var[=value]
will specifically pass environment variable 'var' to the job-b y
allow command to be a binary file instead of a script.-w e
verify options and abort if there is an error-N <jobname>
name of the job. This you will see when you use qstat, to check status of your jobs.-l resource_list
specify resources-l h_rt=<hh:mm:ss>
specify the maximum run time (hours, minutes and seconds)-l s_rt=hh:mm:ss
specify the soft run time limit (hours, minutes and seconds) - Remember to set both s_rt and h_rt.-cwd
run in current working directory-wd <dir>
Set working directory for this job as <dir>-o <output_logfile>
name of the output log file-e <error_logfile>
name of the error log file-m ea
Will send email when job ends or aborts-P <projectName>
set the job's project-M <emailaddress>
Email address to send email to-t <start>-<end>:<incr>
submit a job array with start index , stop index in increments usingSee THIS for more details