Selected List of Antoine Petitet's Publications
Journal Articles
-
Automated Empirical Optimization of Software and the ATLAS Project,
with R. Clint Whaley and Jack J. Dongarra, in Parallel Computing, Volume 27,
Numbers 1-2, pp 3-25, 2001, ISSN 0167-8191. (also available as University
of Tennessee LAPACK
Working Note No. 147, UT-CS-00-448, 2000)
-
Programming Crashworthiness Simulation for Parallel Platforms,
with G. Lonsdale, F. Zimmermann, J. Clinckemaillie, S. Meliciani and
S. Vlachoutsis in Mathematical and Computer Modeling, Vol. 31, pp 61-76,
2000
-
Algorithmic Redistribution Methods for Block-Cyclic Decompositions,
with J. Dongarra in IEEE Transactions on Parallel and Distributed Systems
Vol. 10, No. 12, pp 1201-1216, 1999
-
Scheduling Block-Cyclic Array Redistribution, with F. Desprez,
J. Dongarra, C. Randriamaro and Y. Robert in IEEE Transactions on Parallel
and Distributed Systems Vol. 9, No. 2, pp 192-205, 1998 (also
LAPACK Working Note
No. 120)
-
Practical Experience in the Dangers of Heterogeneous Computing,
with S. Blackford, A. Cleary, J. Demmel, I. Dhillon, J. Dongarra,
S. Hammarling, H. Ren, K. Stanley and R. C. Whaley in ACM Trans.
Math. Soft. Vol. 23, No. 2, 1997 (also
LAPACK Working
Note No. 112)
-
Efficient Solution of the Rank-Deficient Linear Least Squares Problem,
with G. Quintana-Orti and E. Quintana-Orti in SIAM Journal on Scientific and
Statistical Computing, Vol. 20, No. 3, pp 1155-1163, 1999
(also LAPACK Working
Note No. 113)
-
The Spectral Decomposition of Nonsymmetric Matrices on Massively Parallel
Machines, with Z. Bai, J. Demmel, J. Dongarra, H. Robinson, K. Stanley in
SIAM Journal on Scientific Computing, Vol. 18, No. 5, pp 1446-1461, 1997
(also LAPACK Working
Note No. 91)
-
The Design and Implementation of the Reduction Routines in ScaLAPACK,
with J. Choi, J. Dongarra, S. Ostrouchov, D. Walker and R. C. Whaley in
High Performance Computing: Technology, Methods and Applications, Advances
in Parallel Computing Series, 1995
-
The Design and Implementation of the ScaLAPACK LU, QR, and Cholesky
Factorization Routines, with J. Choi and J. Dongarra and S. Ostrouchov
D. Walker and R. C. Whaley, in Scientific Programming, Vol. 5, 1996
(also LAPACK Working
Note No. 80)
-
ScaLAPACK: A Portable Linear Algebra Library for Distributed Memory
Computers - Design Issues and Performance, with J. Choi, J. Demmel,
I. Dhillon, J. Dongarra, S. Ostrouchov, K. Stanley, D. Walker and
R. C. Whaley in Computer Physics Communications, Vol. 97, 1996
(also LAPACK
Working Note No. 95)
-
The Design and Implementation of the Reduction Routines in ScaLAPACK,
with J. Choi, J. Dongarra, S. Ostrouchov, D. Walker and R. C. Whaley,
in High Performance Computing: Technology, Methods and Applications,
Dongarra, J. J. and Grandinetti, L. and Joubert, G. R. and Kowalik, J.
editors, Advances in Parallel Computing series, Vol. 10, Elsevier,
Amsterdam, The Netherlands, 1995
-
A Proposal for a Set of Parallel Basic Linear Algebra Subprograms,
with J. Choi, J. Dongarra, S. Ostrouchov, D. Walker and R. C. Whaley in
Applied Parallel Computing, 1995
(also LAPACK
Working Note No. 100)
-
A Parallel Block Implementation of Level 3 BLAS Kernels for MIMD
Vector Processors, with M. Dayde and I. Duff, in ACM Trans. Math.
Soft. Vol. 20, No. 2, 1994 (also CERFACS Technical Report TR/PA/92/74)
Books, Book Chapters
-
Parallel and Distributed Scientific Computing: A Numerical Linear Algebra
Problem Solving Environment Designer's Perspective, with H. Casanova,
J. Dongarra, Y. Robert and R. C. Whaley in Handbook on Parallel and Distributed
Processing, International Handbook on Information Systems Vol. 3, J. Blazewicz,
K. Ecker, B. Plateau and D. Trystram Editors, Springer Verlag, 2000
-
ScaLAPACK Users' Guide, with L. Blackford, J. Choi, A. Cleary,
E. D'Azevedo, J. Demmel, I. Dhillon, J. Dongarra, S. Hammarling, G. Henry,
K. Stanley, D. Walker and R. C. Whaley. SIAM, Philadelphia, 1997.
Conference Proceedings and Technical Reports
-
Numerical Libraries and the Grid: The GrADS Experiments with
ScaLAPACK, with S. Blackford, J. Dongarra, B. Ellis, G. Fagg,
K. Roche and S. Vadhiyar submited to SC2001. (also available as
University of Tennesse
Technical Report UT-CS-01-460).
-
Data Allocation Strategies for Dense Linear Algebra Kernels on
Heterogeneous Two-Dimensional Grid, with V. Boudet, F. Rastello
and Y. Robert, Proceedings of the International Conference on Parallel
and Distributed Computing (PDCS'99), IASTED Press, pages 561-569,
Boston, MA, 1999
-
More on Scheduling Block-Cyclic Array
Redistribution, with F. Desprez, S. Domas, J. Dongarra,
C. Randriamaro and Y. Robert, in Lecture Notes in Computer Science,
Vol. 1511, Springer-Verlag, Proceedings of 4th Workshop on Languages,
Compilers, and Run-time Systems for Scalable Computers (LCR98),
Pittsburgh, PA, 1998
-
Scheduling Block-Cyclic Array Redistribution,
with F. Desprez, J. Dongarra, C. Randriamaro and Y. Robert, in Parallel
Computing: Fundamentals, Applications and New Directions, E. D'Hollander,
G. Joubert, F. Peters and U. Trottenberg Editors, North Holland, 1998 (ParCo97)
-
Algorithmic Redistribution Methods for Block Cyclic Decompositions,
Ph.D Thesis, University of Tennessee, Knoxville, 1996
(also LAPACK Working
Note No. 128 and
No. 133)
-
Case Studies on the Development of ScaLAPACK and the NAG Numerical PVM
Library, with J. Dongarra and S. Hammarling in Quality of Numerical
Software: Assessment and Enhancement (1997), (Proceedings of the IFIP
TC2/WG 2.5 Working Conference, Oxford, UK, 8-12 July, 1996)
-
ScaLAPACK: A Portable Linear Algebra Library for Distributed Memory
Computers - Design Issues and Performance, with L. Blackford, J. Choi,
A. Cleary, J. Demmel, I. Dhillon, J. Dongarra, S. Hammarling, G. Henry,
K. Stanley, D. Walker and R. C. Whaley, Proceedings of Supercomputing '96,
ACM SIGARCH and IEEE Computer Society publishers, ISBN 0-89791-854-1, 1996
-
Data Parallel GMRES on the CM-5, with V. Laubie and S. Petiton,
in Proceedings of the 1993 SIAM Annual Meeting, Philadelphia, Pennsylvania,
July 12-16, 1993