Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  1. Requested CPUs are not available for requested walltime. Cluster is busy (example1).
  2. Requested RAM(memory) is not available for requested walltime. 

  3. Requested RAM(memory) doesn't exist on cluster per host. The output from bjobs won't tell directly this reason.  
  4. Requested GPUs are not available for requested walltime.

    Expand
    titleGPU example

    > bjobs -p3 -l .. Tue Sep 8 16:01:28: Submitted from host , CWD <$HOME>, Requested Resources <select[gpu_model0=='geforcegtx1000']>, Requested GPU; PENDING REASONS: Candidate host pending reasons (0 of 123 hosts). Non-candidate host pending reasons (123 of 123 hosts): Job's resource requirements not satisfied: lp35, lx10, lx11, lx12, lx13, lx14, boson, lt01, lt02, lt03, lt04, lt05, lt06, lt07, lt08 ….. Not specified in job submission: ld01, ld02, ld03, ld04, ld05, ld07, lv01, l i01, lila-sched01, lila-sched02; Load information unavailable: ld06, lg05, lp08, lp09, ls10, ls18, lp21, lp26 Closed by LSF administrator: lw01, lw02, ls05, lu04, lu05, lx09; RUNLIMIT 10.0 min</select[gpu_model0=='geforcegtx1000']>


    This job won’t run because the gpu_model0 is not correct in bsub and this resource is not available on Lilac cluster : Candidate host : 0 Correct name is GeForceGTX1080 >lshosts -gpu HOST_NAME gpu_id gpu_model gpu_driver gpu_factor numa_id ls01 0 GeForceGTX1080 440.33.01 6.1 0 1 GeForceGTX1080 440.33.01 6.1 0 2 GeForceGTX1080 440.33.01 6.1 1 3 GeForceGTX1080 440.33.01 6.1 1


  5. GPU type doesn't exist on cluster.  The correct syntax for available GPUs can be found  with lshost --gpu
  6. Nodes are in system level reservation used for rolling upgrade or for scheduling cluster level downtime.
  7. Nodes are reserved under SLA.  

...