Fire Phoenix cluster operating system kernel and its evaluation
Jianfeng zhan
IEEE International Conference on Cluster Computing (Cluster 2005)
Boston, Massachusetts, USA, September 27 - 30, 2005
Abstract
Fire Phoenix cluster operating system kernel (Phoenix kernel) is a minimum set of cluster core function with scalability and fault-tolerance support. In this paper, we define components of cluster operating system kernel, and introduce its internal mechanism for scalability and fault-tolerance support. Based on Phoenix kernel, user environments can be easily constructed, adapted and extended according to users’ needs. In addition, we also evaluate Phoenix kernel from four different perspectives, such as fault-tolerance, scalability, performance impact on scientific computing, and easiness of constructing user environment. Our design has been proved in the practices of Dawning 4000A super server, which are the biggest cluster systems for scientific computing in China.