Publications and White Papers

An Open Source Performance Tools Software Suite for Scientific Computing [PDF]
Mohan, T., Mucci, P.
Concurrency and Computation: Practice and Experience, Wiley and Sons, 2009.

METRIC: Memory Tracing via Dynamic Binary Rewriting to Identify Cache Inefficiencies [PDF]
J. Marathe, F. Mueller, T. Mohan, S.A. McKee, B.R. de Supinski, A. Yoo
ACM Transactions on Programming Languages and Systems Vol. 29, No. 2, Article 12, April 2007.

An Open Source Performance Tools Software Suite for Scientific Computing [PDF]
Mohan, T., Mucci, P.
International Supercomputing Conference 2007, Dresden, Germany, June, 2007.

Analysis and Optimization of Yee_Bench using Hardware Performance Counters [PDF]
Andersson, U., Mucci, P.
ParCo 2005: Parallel Computing 2005, Malaga, Spain, September, 2005.

PerfMiner: Cluster-Wide Collection, Storage and Presentation of Application Level Hardware Performance Data [PDF]
Mucci, P., Ahlin, D., Danielsson, J., Ekman, P., Malinowski, L.
Euro-Par 2005: European Conference on Parallel Computers, Monte de Caparica, Portugal, August/September 2005.

Design Considerations for Shared Memory MPI Implementations on Linux NUMA Systems: An MPICH/MPICH2 Case Study [PDF]
Ekman, P., Mucci, P.
Advanced Micro Devices, July, 2005.

Memory Bandwidth and the Performance of Scientific Applications: A Study of the AMD Opteron Processor [PDF]
Mucci, P.
Advanced Micro Devices Technical Whitepaper, June 2004.

Accurate Cache and TLB Characterization Using Hardware Counters [PDF]
Dongarra, J., Moore, S., Mucci, P., Seymour, K., You, H.
International Conference on Computational Science 2004, Krakow, Poland, June 2004.

Optimizing Cluster Applications with PAPI [PDF]
Mucci, P., London K.
ClusterWorld Magazine, May 2004.

Automating the Large-scale Collection and Analysis of Performance Data on Linux Clusters [PDF]
Mucci, P., Dongarra, J., Moore, S., Song, F., Wolf, F.
Proceedings of the 5th LCI International Conference on Linux Clusters: The HPC Revolution, Austin, Texas, May 2004.

Identifying and Exploiting Spatial Regularity in Data Memory References [PDF]
Mohan, T., de Supinski, B., McKee, S., Mueller, F., Yoo, A., Schulz, M.
Proceedings of the 2003 ACM/IEEE conference on Supercomputing, Phoenix, Arizona, Nov 2003.

Production Quality Open Source Performance Tools: A White Paper Prepared by the Performance Evaluation Research Center and Collaborators [PDF]
Jack Dongarra, Shirley Moore, Philip Mucci, Daniel Terpstra, Allen Malony, Sameer Shende, Jeffrey Hollingsworth, Barton Miller, Daniel Reed, Celso Mendes, Allan Snavely
Prepared in Response to the Request for Information for Open Source Software Development Acceleration
National Nuclear Security Administration, Advanced Simulation and Computing Initiative and the ASCI Pathforward Program, 2003.

Performance Instrumentation and Measurement for Terascale Systems [PDF]
Dongarra, J., Malony, A., Moore, S., Mucci, P., Shende, S.
International Conference on Computational Science 2003, Melbourne, Australia, June 2003.

Experiences and Lessons Learned with a Portable Interface to Hardware Performance Counters [PDF]
Dongarra, J., London, K., Moore, S., Mucci, P., Terpstra, D., You, H., Zhou, M.
IPDPS2003, Nice, France, April 2003. and
Lecture Notes in Computer Science, Springer-Verlag, Heidelberg, Volume 2723, pp. 53-62, January, 2003.

Partial Data Traces: Efficient Generation and Representation [PDF]
F. Mueller, T. Mohan, B.R. de Supinski, S.A. McKee, A. Yoo Proc. PACT 2001 Workshop on Binary Translation, Barcelona, ES, September 2001.

End-user Tools for Application Performance Analysis, Using Hardware Counters [PDF]
London, K., Dongarra, J., Moore, S., Mucci, P., Seymour, K., Spencer, T.
International Conference on Parallel and Distributed Computing Systems, Dallas, TX, August 8-10, 2001.

Using PAPI for Hardware Performance Monitoring on Linux Systems [PDF]
Dongarra, J., London, K., Moore, S., Mucci, P., Terpstra, D.
Conference on Linux Clusters: The HPC Revolution, Urbana, Illinois, June 25-27, 2001.

The PAPI Cross-Platform Interface to Hardware Performance Counters [PDF]
London, K., Moore, S., Mucci, P., Seymour, K., Luczak, R.
Department of Defense Users' Group Conference Proceedings, Biloxi, Mississippi, June 18-21, 2001.

A Scalable Cross-Platform Infrastructure for Application Performance Tuning Using Hardware Counters [PDF]
Browne, S., Dongarra, J., Garner, N., London, K., Mucci, P.
Proceedings of SuperComputing 2000 (SC'00), Dallas, TX, November 2000.

A Portable Programming Interface for Performance Evaluation on Modern Processors [PDF]
Browne, S., Dongarra, J., Garner, N., Ho, G., Mucci, P.
The International Journal of High Performance Computing Applications, Volume 14, number 3, pp. 189-204, Fall 2000.

A Portable Programming Interface for Performance Evaluation on Modern Processors [PDF]
Browne, S., Dongarra, J., Garner, N., London, K., Mucci, P.
UT Computer Science Technical Report #444, July 2000.

PAPI: A Portable Interface to Hardware Performance Counters [PDF]
Browne, S., Deane, C., Ho, G., Mucci, P.
Proceedings of Department of Defense HPCMP Users Group Conference, June 1999.

Efficient Transport Independent Active Messaging Implementation for PVM [PDF]
Mucci, P.
UT Computer Science Technical Report #399, August 1998.

Low Level Architectural Characterization Benchmarks for Parallel Computers [PDF]
Mucci, P., London, K.
UT Computer Science Technical Report #394, July 1998.

Architectural Characterization of DoD MSRC HPC Platforms [PDF]
Mucci, P., London, K.
D.o.D. HPC Users' Group Conference, July 1998.

The BLASBench Report [PDF]
Mucci, P., London, K.
CEWES/ERDC MSRC/PET Technical Report 98-27, July 1998.

The MPBench Report [PDF]
Mucci, P., London, K.
CEWES/ERDC MSRC/PET Technical Report 98-26, July 1998.

The CacheBench Report [PDF]
Mucci, P., London, K.
CEWES/ERDC MSRC/PET Technical Report 98-25, July 1998.

Possibilities for Active Messaging in PVM [PDF]
Dongarra, J., Mucci, P.
UT Computer Science Technical Report #277, February 1995.

A Test Suite for PVM [PDF]
Do, M., Dongarra, J., Jeannot, E., Mucci, P.
UT Computer Science Technical Report #276, February 1995.