Help:Cluster

From MMAE

Jump to: navigation, search

This is general documentation for all clusters at UCF. Some may not apply to your specific cluster.

You can view the status of all known clusters in CECS at http://www2.mmae.ucf.edu/ganglia/

See also the general cluster notes.

Contents

[edit] Cluster status and news blogs

Please subscribe to or regularly check the blogs below relevant to your cluster use:

[edit] external documentation

The cluster software is composed of various components, each of which has their own documentation.

See also Steve's cluster research page.

[edit] tutorial

This is an outline of a future tutorial.

[edit] general info

  • Monitor cluster status with ganglia
  • use ssh to enter the cluster
  • use ssh or rsh to get between nodes if necessary
  • use MPI to program and start your job if appropriate
  • use Xming or similar to access the system from windows if you need graphics

[edit] specific info

(These should probably be expanded--ask if you need details.)

  • Use the batch queue systems appropriate for your cluster to schedule your jobs, such as Lava or Sun grid engine
  • Watch out for hidden performance penalties if hyperthreading is enabled.
  • your job can be optimized in various ways.
  • please use ganglia to make sure your job is not left running accidentally and does not interfere with others' jobs.
  • Be aware of disk quotas, move files to an appropriate storage location (or delete) when you are done with them.

[edit] Cluster programming guides

Personal tools