Batch Basics

Batch
 Processing 
Terminology


These 
terms
 are
 specific 
to
 HTCondor,
 our
 batch
 processing
 scheduler:


  • Node - 
A
 processor
 (or 
set 
of
 processors) 
capable 
of
 running 
a 
job

  • Pool
 - A
 collection 
of 
nodes

  • Job
 - A 
single 
process 
with 
an
 executable, 
arguments,
 input, 
output
 and
 error 
files

  • Cluster - 
A
 collection 
of 
jobs 
that 
share 
common
 executables 
and/or
 input
files

  • Queue
 - 
The
 list 
of 
jobs 
that 
have
 been 
submitted 
to 
run 
on
 the 
pool

  • Scheduler - 
A
 process
 responsible 
for 
determining 
which
 jobs 
in
 the
 queue
 are 
run
next

Batch Processing Utilities 

  • condor_status
 - shows 
the 
status 
of 
all 
of 
the
 nodes
 in
 the
 pool

  • condor_q
 - 
shows 
the
 status 
of 
all 
jobs 
in 
the
 queue

  • condor_submit
 - submit
 a
 cluster
 of
 jobs
 to
 the
 queue

  • condor_submit_util - 
RCE 
helper 
application
 that
 automates 
the 
submission 
process (should use in place of condor_submit in most cases)

  • condor_userprio
 - shows
 usage
 statistics
 and
 priorities 
for 
users 
who
are 
actively
using 
pool
 resources

What do I need to get started?
To get started setting up and submitting a job follow the Batch Workflow link in the left menu