A Cluster System for IRIMO

We designed and implemented a Cluster System for Iranian Meteorological Organization (IRIMO). Based on the needs of IRIMO, it consists of 48 computing nodes, 2 special nodes, and 4 spare nodes. Special nodes are a “Master & Configure node” and a “Storage & Login node”. The main application of the system is MM5 program which fully supports both MPI and OpenMP; hence, it can effectively deploy the computing resources of the system.

Compute nodes are the computational heart of the cluster system. The duty of Master and Configure node is to monitor and manage the whole activities of cluster and the related components. In addition to above, the administrator of the system can install and configure dedicated software on compute nodes through this node. The Storage and Login Node with a storage space of about 1TB is assigned dual duties. First, not only can the system’s users benefit from this space as storage space, but also the compute nodes can use it as a shared storage. RAID 5 was selected to improve the reliability of the system while keeping the cost of the design low. Moreover, this node plays the role of login node for the system. In other words, it is a gateway from outside to the cluster. Users must login to this node to compile or run their jobs.

A variety of useful software is installed on the system to ease effective operations. Since we have employed standard OS, compiler, and MPI library for our system, all supporting weather applications including ALADIN, VGA, ARPS, and MM5 can be installed easily. Moreover, we have implemented a user interface which is a graphical web-based interface to monitor and manage the cluster system. The core engine of this GUI is developed by php, and enjoys two modes: Administrator and User.