Zhiling Lan
Titel
Zitiert von
Zitiert von
Jahr
System log pre-processing to improve failure prediction
Z Zheng, Z Lan, BH Park, A Geist
2009 IEEE/IFIP International Conference on Dependable Systems & Networks …, 2009
1262009
Toward automated anomaly identification in large-scale systems
Z Lan, Z Zheng, Y Li
IEEE Transactions on Parallel and Distributed Systems 21 (2), 174-187, 2009
1242009
Exploit failure prediction for adaptive fault-tolerance in cluster computing
Y Li, Z Lan
Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID …, 2006
1012006
A survey of load balancing in grid computing
Y Li, Z Lan
International Conference on Computational and Information Science, 280-285, 2004
902004
Lightweight silent data corruption detection based on runtime data analysis for HPC applications
E Berrocal, L Bautista-Gomez, S Di, Z Lan, F Cappello
Proceedings of the 24th International Symposium on High-Performance Parallel …, 2015
872015
Co-analysis of RAS log and job log on Blue Gene/P
Z Zheng, L Yu, W Tang, Z Lan, R Gupta, N Desai, S Coghlan, D Buettner
2011 IEEE International Parallel & Distributed Processing Symposium, 840-851, 2011
852011
Integrating dynamic pricing of electricity into energy aware scheduling for HPC systems
X Yang, Z Zhou, S Wallace, Z Lan, W Tang, S Coghlan, ME Papka
SC'13: Proceedings of the International Conference on High Performance …, 2013
842013
Dynamic load balancing of SAMR applications on distributed systems
Z Lan, VE Taylor, G Bryan
SC'01: Proceedings of the 2001 ACM/IEEE Conference on Supercomputing, 24-24, 2001
842001
Practical online failure prediction for blue gene/p: Period-based vs event-driven
L Yu, Z Zheng, Z Lan, S Coghlan
2011 IEEE/IFIP 41st International Conference on Dependable Systems and …, 2011
762011
Fault-aware, utility-based job scheduling on blue, gene/p systems
W Tang, Z Lan, N Desai, D Buettner
2009 IEEE International Conference on Cluster Computing and Workshops, 1-10, 2009
762009
Analyzing and adjusting user runtime estimates to improve job scheduling on the Blue Gene/P
W Tang, N Desai, D Buettner, Z Lan
2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010
752010
Dynamic meta-learning for failure prediction in large-scale systems: A case study
J Gu, Z Zheng, Z Lan, J White, E Hocks, BH Park
2008 37th International Conference on Parallel Processing, 157-164, 2008
742008
A meta-learning failure predictor for blue gene/l systems
P Gujrati, Y Li, Z Lan, R Thakur, J White
2007 International Conference on Parallel Processing (ICPP 2007), 40-40, 2007
742007
Watch out for the bully! job interference study on dragonfly network
X Yang, J Jenkins, M Mubarak, RB Ross, Z Lan
SC'16: Proceedings of the International Conference for High Performance …, 2016
712016
A practical failure prediction with location and lead time for blue gene/p
Z Zheng, Z Lan, R Gupta, S Coghlan, P Beckman
2010 International Conference on Dependable Systems and Networks Workshops …, 2010
692010
Reliability-aware scalability models for high performance computing
Z Zheng, Z Lan
2009 IEEE International Conference on Cluster Computing and Workshops, 1-9, 2009
692009
Dynamic load balancing for structured adaptive mesh refinement applications
Z Lan, VE Taylor, G Bryan
International Conference on Parallel Processing, 2001., 571-579, 2001
692001
Reducing energy costs for IBM Blue Gene/P via power-aware job scheduling
Z Zhou, Z Lan, W Tang, N Desai
Workshop on Job Scheduling Strategies for Parallel Processing, 96-115, 2013
662013
Adaptive fault management of parallel applications for high-performance computing
Z Lan, Y Li
IEEE Transactions on Computers 57 (12), 1647-1660, 2008
612008
A study of dynamic meta-learning for failure prediction in large-scale systems
Z Lan, J Gu, Z Zheng, R Thakur, S Coghlan
Journal of parallel and distributed computing 70 (6), 630-643, 2010
592010
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20