User Tools

Site Tools


Sidebar


Wiki

Info / Resources

Guides

Software

Sample Pages

Quick Navigation

condor

This is an old revision of the document!


CONDOR

What is CONDOR

CONDOR is a job manager to schedule computational jobs.

Check the following link for an introduction to CONDOR.

Using CONDOR

Submitting Jobs

To submit a job via CONDOR, you need to create a .sub file. This sub file must include program that you will execute (matlab, cplex, etc…) along with the arguments (such as your file to be executed). It's an automated way to run programs.

FIXME Provide your files here to show different usage of programs!

Matlab

Here is an example .sub file which submits the matlab file 'test.m' to condor for running:

myexp.sub
# Specify the executable software, i.e. mathematica, mosek, etc
Executable = /usr/local/matlab/latest/bin/matlab
Universe   = vanilla
getenv     = true
# Specify argument file
arguments  = -nosplash -nodesktop -logfile test.log -r test
#request_cpus = 16
#request_memory = 2
# name output file 
output     = ./out.txt
# name error file
error      = ./error.txt
#name log file
log      = ./log.txt
transfer_executable = false
# Submit to queue
Queue

After making sure all the files you specified exists in the correct directory, use

condor_submit myexp.sub

to submit the file to condor.

Checking Jobs

To check the job progress, use command

condor_q -global   #this checks all the jobs on condor
condor_q -run      #this checks all running jobs
condor_q userid    #this checks all jobs under specific user name

Removing Jobs

First find the ID of the job you will terminate

condor_q userid

Then call

condor_rm ID

Example: I call condor_q sec312 to list all jobs belong to my username. This gives a list similar to this

-- Submitter: polyp1.ie.lehigh.edu : <128.180.35.200:50671> : polyp1.ie.lehigh.edu
 ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD
42989.0   sec312         10/25 19:56   0+00:00:29 R  0   0.0  symphony -F air04.
42989.1   sec312         10/25 19:56   0+00:00:29 R  0   0.0  symphony -F air05.
42989.5   sec312         10/25 19:56   0+00:00:28 R  0   0.0  symphony -F dsbmip

Now let say I want to terminate 42989.5. I call condor_rm 42989.5. CONDOR confirms by saying
Job 42989.5 marked for removal

Frequently Used CONDOR Commands

A summary of frequently used commands in CONDOR:

Command Action Basic Usage Example
condor_submit submit a job condor_submit [submit file] $ condor_submit job.condor |
| condor_q | show status of jobs | condor_q [cluster] | $ condor_q 1170
condor_rm remove jobs from the queue condor_rm [cluster] $ condor_rm 1170

Source

To submit MPI jobs to our condor pool you can check Dr. Takac's http://polyps.ie.lehigh.edu/mpi

condor.1417563242.txt.gz · Last modified: 1998/12/03 12:11 (external edit)