|
Biography |
Research |
Publications |
Presentations |
Teaching |
Links
I have moved to Colorado School of Mines. Please visit my current home page at http://www.mines.edu/~zchen.
Biography
Zizhong Chen is a Sr. Research Associate in the Innovative Computing Laboratory of the Department of
Computer Science at the University of Tennessee, Knoxville.
He received his Ph.D. from the University of Tennessee, Knoxville in 2006, under the direction of
Jack Dongarra.
Research Interests
- High Performance Computing
- Parallel and Distributed Processing
- Cluster and Grid Computing
- Numerical Analysis and Scientific Computing
- Computational Science and Engineering
Publications
Journal Articles
-
"Recovery Patterns for Iterative Methods in a Parallel Unstable Environment."
George Bosilca, Zizhong Chen, Jack Dongarra, and Julien Langou.
SIAM Journal on Scientific Computing. Accepted, May, 2006.
-
"Self Adapting Numerical Software (SANS) Effort."
Jack Dongarra, George Bosilca, Zizhong Chen, Victor Eijkhout, Graham Fagg,
Erika Fuentes, Julien Langou, Piotr Luszczek, Jelena Pjesivac-Grbovic,
Keith Seymour, Haihang You, and Satish S. Vadiyar.
IBM Journal of Research and Development. Volume 50, Number 2/3, Page 223-238, 2006.
-
"Process Fault-Tolerance: Semantics, Design and Applications for High Performance Computing."
Graham E. Fagg, Edgar Gabriel, Zizhong Chen, Thara Angskun, George Bosilca, Jelena
Pjesivac-Grbovic, and Jack Dongarra.
International Journal of High Performance Computing Applications, Volume 19, Number 4, Page 465-477, Winter, 2005.
-
"Condition Numbers of Gaussian Random Matrices."
Zizhong Chen and Jack J. Dongarra.
SIAM Journal on Matrix Analysis and Applications, Volume 27, Number 3, Page 603-620, 2005.
-
"Self Adapting Software for Numerical Linear Algebra and LAPACK for Clusters."
Zizhong Chen, Jack Dongarra, Piotr Luszczek, and Kenneth Roche.
Parallel Computing, Volume 29, Number 11-12, Page 1723-1743, November-December, 2003.
Conference and Workshop Papers
-
"Algorithm-Based Checkpoint-Free Fault Tolerance for Parallel Matrix Computations on Volatile Resources."
Zizhong Chen and Jack Dongarra.
Proceedings of the 20th IEEE International Parallel & Distributed Processing Symposium (IPDPS 2006),
Rhodes Island, Greece, April 25-29, 2006.
-
"Fault Tolerant High Performance Computing by a Coding Approach."
Zizhong Chen, Graham E. Fagg, Edgar Gabriel, Julien Langou,
Thara Angskun, George Bosilca, and Jack J. Dongarra.
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of
Parallel Programming (PPoPP'05), Chicago, Illinois, USA, June 15-17, 2005.
-
"Numerically Stable Real Number Codes Based on Random Matrices."
Zizhong Chen and Jack J. Dongarra.
Proceedings of the 5th International Conference on Computational Science (ICCS2005),
Atlanta, Georgia, USA, May 22-25, 2005.
-
"Extending the MPI Specification for Process Fault Tolerance on High Performance Computing Systems."
Graham E. Fagg, Edgar Gabriel, George Bosilca, Thara Angskun, Zizhong Chen,
Jelena Pjesivac-Grbovic, Kevin London and Jack J. Dongarra.
Proceedings of the 19th International Supercomputer Conference (ISC2004), Heidelberg, German, June 21-24, 2004.
-
"LAPACK for Clusters Project: An Example of Self Adapting Numerical Software."
Zizhong Chen, Jack Dongarra, Piotr Luszczek, and Kenneth Roche.
Proceedings of the 37th Hawaii International Conference on System Sciences (HICSS-37),
Kauai, Hawaii, USA, January 5-8, 2004.
-
"Fault Tolerant Communication Library and Applications for High Performance Computing."
Graham E. Fagg, Edgar Gabriel, Zizhong Chen, Thara Angskun, George Bosilca, Antonin Bukovsky,
and Jack J. Dongarra.
Proceedings of the 4th Los Alamos Computer Science Institute Symposium (LACSI'03), Santa Fe, NM, USA, October 27-29, 2003.
-
"Self Adaptive Software for Numerical Linear Algebra Library Routines on Clusters."
Zizhong Chen, Jack Dongarra, Piotr Luszczek, and Kenneth Roche.
Proceedings of the 3rd International Conference on Computational Science (ICCS2003),
Melbourne, Australia, June 7-9, 2003.
Technical Reports
-
"Building Fault Survivable MPI Programs with FT-MPI Using Diskless Checkpointing."
Zizhong Chen, Graham E. Fagg, Edgar Gabriel, Julien Langou,
Thara Angskun, George Bosilca, and Jack J. Dongarra.
University of Tennessee Computer Science Department Technical Report. Technical Report UT-CS-04-540, 2004.
Presentations
-
"Algorithm-Based Checkpoint-Free Fault Tolerance for Parallel Matrix Computations on Volatile Resources."
The 12th SIAM Conference on Parallel Processing for Scientific Computing, San Francisco, California, USA, February 22-24, 2006.
-
"Scalable Techniques for Fault Tolerant High Performance Computing."
Ph.D. Dissertation Defense, Knoxville, Tennessee, USA, February 20, 2006.
-
"Scalable Fault Tolerance for Large Parallel Systems."
The 5th ICL Retreat, Townsend, Tennessee, USA, August 22-23, 2005.
-
"Fault Tolerant High Performance Computing by Coding Approaches."
The 10th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP'05),
Chicago, Illinois, USA, June 15-17, 2005.
Invited Talk, Oak Ridge National Laboratory, Oak Ridge, Tennessee, USA, July 12, 2005.
-
"Random Matrices and Their Applications in Fault Tolerant Parallel Computing."
The 12th International Linear Algebra Society Conference, Regina, Saskatchewan, Canada, June 26-29, 2005.
-
"Real Number Codes Based on Random Matrices."
The 5th International Conference on Computational Science (ICCS2005), Atlanta, Georgia, USA, May 22-25, 2005.
-
"Recovery Patterns for Iterative Methods in a Parallel Unstable Environment."
The 7th IMACS International Symposium on Iterative Methods in Scientific Computing, Toronto, Ontario, Canada, May 5-8, 2005.
Teaching
Teaching Assistant, University of Tennessee, Knoxville, USA
Instructor, Beijing No.13 Middle School, Beijing, P. R. China
- Elementary Algebra, Spring 1997
Links
|