User Tools

Site Tools


Sidebar


Wiki

Info / Resources

Guides

Software

Sample Pages

Quick Navigation

tutorial:torque

This is an old revision of the document!


Torque

TORQUE provides control over batch jobs and distributed computing resources. It is an advanced open-source product based on the original PBS project and incorporates the best of both community and professional development. It incorporates significant advances in the areas of scalability, reliability, and functionality and is currently in use at tens of thousands of leading government, academic, and commercial sites throughout the world. TORQUE may be freely used, modified, and distributed under the constraints of the included license.

Hardware

We have 16 nodes

Nodes CPUs Memory Notes
polyp1–polyp15 16 AMD Opteron™ Processor 6128 32 GB
polyp30 24 Intel(R) Xeon(R) CPU E5-2620 v3 @ 2.40GHz 128 GB 2x K80 (4GPUs)

Submitting Jobs

Jobs can be submitted either using a submission file or directly from command line. First we explain how it is done and then we will discuss the options.

Using submission script

We will create a file test.pbs

test.pbs
#PBS -N JobName
#PBS -e /home/mat614/TEST.err
#PBS -o /home/mat614/TEST.out
#PBS -l nodes=1:ppn=4 
#PBS -q batch
 
cd /home/mat614/
./test_code
sleep 60

First few lines contains settings for the job. This is followed by commands for running a particular job. The job can be submitted by running qsub test.pbs

Directly submitting job

You do not need to write submission script. However, you can submit only bash script by default. Let's create a file myscript.sh which contains following

myscript.sh
cd /home/mat614/
./test_code

If you do not want to write the submission script you can do it just by calling

qsub -N JobName -q batch -l nodes=1:pnn=2  myscript.sh

Now, we will run the code but we are setting the job parameters using - character (e.g. -N JobName)

Important Options

  • -q <queue> set the queue. Often you will use the standard queue, so no need to set this up.
  • -V will pass all environment variables to the job
  • -v var[=value] will specifically pass environment variable 'var' to the job
  • -b y allow command to be a binary file instead of a script.
  • -w e verify options and abort if there is an error
  • -N <jobname> name of the job. This you will see when you use qstat, to check status of your jobs.
  • -l resource_list specify resources
  • -l h_rt=<hh:mm:ss> specify the maximum run time (hours, minutes and seconds)
  • -l s_rt=hh:mm:ss specify the soft run time limit (hours, minutes and seconds) - Remember to set both s_rt and h_rt.
  • -cwd run in current working directory
  • -wd <dir> Set working directory for this job as <dir>
  • -o <output_logfile> name of the output log file
  • -e <error_logfile> name of the error log file
  • -m ea Will send email when job ends or aborts
  • -P <projectName> set the job's project
  • -M <emailaddress> Email address to send email to
  • -t <start>-<end>:<incr> submit a job array with start index , stop index in increments using

See THIS for more details

tutorial/torque.1475846041.txt.gz · Last modified: 2016/10/07 09:14 by sertalpbilal