Analytical modelling and simulation of small scale, typical and highly available Beowulf clusters with breakdowns and repairs


EVER E., Gemikonakli O., Chakka R.

Simulation Modelling Practice and Theory, cilt.17, sa.2, ss.327-347, 2009 (SCI-Expanded) identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 17 Sayı: 2
  • Basım Tarihi: 2009
  • Doi Numarası: 10.1016/j.simpat.2008.08.016
  • Dergi Adı: Simulation Modelling Practice and Theory
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Sayfa Sayıları: ss.327-347
  • Anahtar Kelimeler: Clustering, High performance computing, Markov processes, Performability, Queuing theory
  • Orta Doğu Teknik Üniversitesi Kuzey Kıbrıs Kampüsü Adresli: Evet

Özet

Beowulf clusters are very popular because of the high computational power they can provide at reasonably low costs. However, the most pressing issues of today's cluster solutions are the need for high availability and performance. Cluster systems are clearly prone to failures. Even if cover is provided with some probability c, there would be reconfiguration and/or rebooting delays to resume the operation following a failure. In this paper, the performability modelling of both typical and highly available Beowulf multiprocessor systems is presented. The models developed provide a large degree of flexibility to evaluate the performability of typical and highly available Beowulf cluster systems. © 2008 Elsevier B.V. All rights reserved.