11
IBM Labs in Haifa © 2003 IBM Corporation Compiler and Architecture Seminar, November 11 th , 2003 Subsetting SPEC When Measuring Results: Research vs. Industry [email protected] A D’oh… Torus

Subsetting SPEC When Measuring Results: Research vs ......facerec fma d # of paper appearances C F77 F90 C++ Average of 8 (CINT) and 6 (CFP) Benchmarks per Paper! The Tower of Babel

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Subsetting SPEC When Measuring Results: Research vs ......facerec fma d # of paper appearances C F77 F90 C++ Average of 8 (CINT) and 6 (CFP) Benchmarks per Paper! The Tower of Babel

IBM Labs in Haifa © 2003 IBM Corporation

Compiler and Architecture Seminar, November 11th, 2003

Subsetting SPEC When Measuring Results:Research vs. Industry

[email protected]

A D’oh… Torus

Page 2: Subsetting SPEC When Measuring Results: Research vs ......facerec fma d # of paper appearances C F77 F90 C++ Average of 8 (CINT) and 6 (CFP) Benchmarks per Paper! The Tower of Babel

IBM Labs in Haifa

© 2003 IBM Corporation2/11 Subsetting SPEC When Measuring Results: Research vs. Industry

Production History

IEEE Micro, July/August 2003 (Vol. 23, No. 4)

ISCA 2003 Panel, June 9th, 2003, San Diego, CA : Subsetting SPEC when measuring results: valid or manipulative?

Mis ulation: Partial and Misleading Use of SPEC CPU 2000 in Computer Architecture Conferences

The Use and Abuse of SPEC: An ISCA Panel

Revenge of the SPEC

Page 3: Subsetting SPEC When Measuring Results: Research vs ......facerec fma d # of paper appearances C F77 F90 C++ Average of 8 (CINT) and 6 (CFP) Benchmarks per Paper! The Tower of Babel

IBM Labs in Haifa

© 2003 IBM Corporation3/11 Subsetting SPEC When Measuring Results: Research vs. Industry

��

��

��

������

��

����

����

������

# o

f p

aper

s

ISCA �� Micro �� HPCA �� ISCA �� Micro �� HPCA �� ISCA ��

Computer Architecture ConferencesResearch:

� Number of papers published: 209� Papers that used a version of SPEC: 140 (66%)� Earliest conference deadline: December 2000� SPEC CPU2000 announced: December 1999

HPCA-8 HPCA-9

Page 4: Subsetting SPEC When Measuring Results: Research vs ......facerec fma d # of paper appearances C F77 F90 C++ Average of 8 (CINT) and 6 (CFP) Benchmarks per Paper! The Tower of Babel

IBM Labs in Haifa

© 2003 IBM Corporation4/11 Subsetting SPEC When Measuring Results: Research vs. Industry

Industry: Quotes from www.spec.org

� “SPEC CPU2000 is the next-generation industry-standardizedCPU-intensive benchmark suite.”

� “These benchmarks measure the performance of the processor, memory and compiler on the tested system.”

SPEC CPU2000

��

���

��

��

�����

��

��

��

# o

f p

ap

ers

Datapath Memory Multi-Thread Media Simulation Compiler Misc.

Page 5: Subsetting SPEC When Measuring Results: Research vs ......facerec fma d # of paper appearances C F77 F90 C++ Average of 8 (CINT) and 6 (CFP) Benchmarks per Paper! The Tower of Babel

IBM Labs in Haifa

© 2003 IBM Corporation5/11 Subsetting SPEC When Measuring Results: Research vs. Industry

Partial Use of CPU2000Research:

%

��%

�%

��%

�%

per

cent

of p

aper

s

Proc.CINT

Proc.CFP

Mem.CINT

Mem.CFP

Comp.CINT

Comp.CFP

# of benchmarks used per paper

�-��(�-�) �-���(-��) ���(��)

%

��%

�%

��%

�%

per

cent

of p

aper

s

Proc.CINT

Proc.CFP

Mem.CINT

Mem.CFP

Comp.CINT

Comp.CFP

# of benchmarks used per paper

�-��(�-�) �-���(-��) ���(��)

# of Papers

Full UseNo ReasonReason Given

Authors omit benchmarks, suites, and explanations!

Page 6: Subsetting SPEC When Measuring Results: Research vs ......facerec fma d # of paper appearances C F77 F90 C++ Average of 8 (CINT) and 6 (CFP) Benchmarks per Paper! The Tower of Babel

IBM Labs in Haifa

© 2003 IBM Corporation6/11 Subsetting SPEC When Measuring Results: Research vs. Industry

Research:

����

gzipvprparsergccmcfvortextwolfbzip�

perlbmkcraftygapeon

# of paper appearances

The Chosen Few

�����

artequakeammpswimmesaapplu

mgridlucaswupwiseapsigalgelsixtrackfacerecfma�d

# of paper appearances

C

F77

F90

C++

Average of 8 (CINT) and 6 (CFP) Benchmarks per Paper!

Research: The Tower of Babel

Page 7: Subsetting SPEC When Measuring Results: Research vs ......facerec fma d # of paper appearances C F77 F90 C++ Average of 8 (CINT) and 6 (CFP) Benchmarks per Paper! The Tower of Babel

IBM Labs in Haifa

© 2003 IBM Corporation7/11 Subsetting SPEC When Measuring Results: Research vs. Industry

Top 10 Systems (CINT)

Top Industry SPECint ratios (��/��)

�����

Intel Pentium �

AMD Athlon

AMD Opteron

Dell Xeon

HP Itanium �

HP Xeon

Fujitsu Pentium �

IBM Opteron

Einux Opteron

IBM POWER�+

Top Research SPECint ratios (/��)

�����

Intel Pentium �

AMD Athlon

HP Itanium �

SGI Itanium�

AMD Opteron

Dell Xeon

HP Xeon

Dell Pentium �

IBM Opteron

Bull Itanium �

Research vs. Industry:

Page 8: Subsetting SPEC When Measuring Results: Research vs ......facerec fma d # of paper appearances C F77 F90 C++ Average of 8 (CINT) and 6 (CFP) Benchmarks per Paper! The Tower of Babel

IBM Labs in Haifa

© 2003 IBM Corporation8/11 Subsetting SPEC When Measuring Results: Research vs. Industry

Research vs. Industry:� 496 CINT2000 and 488 CFP2000

results published by SPEC � Linear Regression model of

SPECratio by the CPU clock rate� Multiple Linear Regression model of

SPECratio by speed and Memory-Hierarchy

.%

�.%

�.%

�.%

.%

CFPBP

CINTBP

CFPI$

CINTI$

CFPD$

CINTD$

PowerPCAlpha

.�

.�

.�

.

Linear Regressionby MHz

Multiple LR by MHzand MH

CINTCFP

105 papers used CINT2000, 63 used CFP2000 (60%)CFP2000 considered more regular and predictableCFP2000 has higher Dcache miss rateOnly 35% of Memory-Hierarchy papers use a majority of CFP2000

CFP2000

Page 9: Subsetting SPEC When Measuring Results: Research vs ......facerec fma d # of paper appearances C F77 F90 C++ Average of 8 (CINT) and 6 (CFP) Benchmarks per Paper! The Tower of Babel

IBM Labs in Haifa

© 2003 IBM Corporation9/11 Subsetting SPEC When Measuring Results: Research vs. Industry

Top 10 Systems (CFP)Research vs. Industry:

Top Industry SPECfp ratios (��/��)

�����

HP Itanium �

SGI Itanium �

Bull Itanium �

Dell Itanium �

ION Itanium�

IBM Power�+

Intel Pentium �

HP Alpha ����

AMD Athlon

HP Alpha �����C

Top Research SPECfp ratios (�/��)

�������

HP Itanium �

SGI Itanium �

Bull Itanium �

ION Itanium�

Dell Itanium�

Sun UltraSparc III

Fujitsu SPARC���V

HP Alpha ����

IBM POWER�+

NEC Itanium �

Page 10: Subsetting SPEC When Measuring Results: Research vs ......facerec fma d # of paper appearances C F77 F90 C++ Average of 8 (CINT) and 6 (CFP) Benchmarks per Paper! The Tower of Babel

IBM Labs in Haifa

© 2003 IBM Corporation10/11 Subsetting SPEC When Measuring Results: Research vs. Industry

Processor ArchitectureResearch vs. Industry:

Use Architectures of the Future, not the Past!SimpleScalar – A Two-Edged Sword

%

�%

�%

�%

%

�%

ResearchCINT

SPECCINT

ResearchCFP

SPECCFP

MIPSAlphaMedia/SIMDHP-PAIA-��PowerPCSPARCAMDx�

Page 11: Subsetting SPEC When Measuring Results: Research vs ......facerec fma d # of paper appearances C F77 F90 C++ Average of 8 (CINT) and 6 (CFP) Benchmarks per Paper! The Tower of Babel

IBM Labs in Haifa

© 2003 IBM Corporation11/11 Subsetting SPEC When Measuring Results: Research vs. Industry

Research

Industry

SPEC: Support research with simulation sized input sets

Vendors: Get simulation tools out to the field