Tang, Hongyan; Li, Ying; Jia, Tong; Yuan, Xiaoyong; Wu, … - In: International Journal of Distributed Systems and … 9 (2018) 1, pp. 16-38
To better understand task failures in cloud computing systems, the authors analyze failure frequency of tasks based on Google cluster dataset, and find some frequently failing tasks that suffer from long-term failures and repeated rescheduling, which are called killer tasks as they can be a big...