ANSYS FLUENT Performance
Intel Xeon 5675 Comparison
ANSYS FLUENT 12.1 to ANSYS FLUENT 13.0
Performance Testing Performed
at
QLogic NETtrack Developer Center
Agenda
• Partners in the Testing
• Test Configuration
• Performance Profile
• Difference in Performance Rating
• User and MPI Utilization Differences
• MPI Function Calls
• Message Rate
• NETtrack Developer Center
• ANSYS Customer Test Drive Program
Month DD, YYYYQLogic Confidential2
Partners in the Testing
• The information contained within this document was generated in QLogic’s
NDC laboratory
• Participating parties: ANSYS, Dell, Intel and QLogic
• Cluster Resources: NETTrack Developer Center (NDC) Dell C6100 – Sapphire II
• For more info regarding participants, please refer to the following sites:
www.qlogic.com, www.Dell.com, www.ansys.com, www.Intel.com
Month DD, YYYY3 QLogic Confidential
NDC – Test Configuration
QLogic Confidential4
Server: Dell PowerEdge C6100 (Sapphire-II)
CPU: 2 - Intel Xeon X5675 @ 3.07 GHz (Hex Core)
Mem: 48GB (12x4G) DDR3 @ 1333MHz
OS: RHEL 5.5 / Kernel 2.6.18-194.el5
App: ANSYS FLUENT 12.1.11 and 13.0
MPI: Platform MPI
HCA: QLE7340 - QDR InfiniBand
Stack: OFED 1.5.2
Switch: 12300 - 36-port QDR InfiniBand
F/W: 6.0.2.1.11
Settings: 4K MTU 1 Virtual Lane
Performance Profiles
Month DD, YYYYQLogic Confidential5
Performance Profile
ANSYS FLUENT 12.1 vs. ANSYS FLUENT 13.0
• 14% average performance improvement with ANSYS FLUENT 13.0 across the seven benchmark tests
• Greatest performance improvement shown with Sedan (k-eps Turbulence model with Pressure-Based Coupled Implicit solver). Significant gains due to new hybrid parallel implementation in ANSYS FLUENT 13.0
• Least performance gains shown with Truck_14M and Truck_111M benchmark tests, both tests use DES Turbulence model with Segregated Implicit solver
6 QLogic Confidential
0
2000
4000
6000
8000
10000
12000
14000
Aircraft_2M Sedan_4m Eddy_417k Truck_14M Truck_Poly_14M Truck_111m
ANSYS FLUENT Comparison16-nodes/192-cores
FLUENT r12.1
FLUENT r13.0
11%53%
7%
1% 13%
-2%
Profile: User and MPI Overhead
16-Nodes/192-Cores
• ANSYS FLUENT 13 achieves a 14%
better average rating across the
seven tests. In producing these better
results:
• Utilizes an average of 2% less time in
computing
• Increased MPI communications by an
average of 4%
QLogic Confidential7
Benchmark Name Usage v12.1.11 v13.0 % Change
aircraft_2m Rating 9818.2 10922.9 11%
User Time 39.33% 35.82% -9%
MPI time 60.67% 64.18% 6%
sedan_4m Rating 7912.1 12134.8 53%
User Time 49.62% 49.31% -1%
MPI time 50.38% 50.69% 1%
eddy_417k Rating 12272.7 13110.8 7%
User Time 30.51% 27.81% -9%
MPI time 69.49% 72.19% 4%
truck_14m Rating 1497.1 1511 1%
User Time 48.00% 38.51% -20%
MPI time 52.00% 61.49% 18%
truck_poly_14m Rating 1346.8 1525.7 13%
User Time 47.76% 40.41% -15%
MPI time 52.24% 59.59% 14%
truck_111m Rating 180.6 176.9 -2%
User Time 30.51% 44.99% 47%
MPI time 69.49% 55.01% -21%
Average Change Rating 31%
User Time -2%
MPI time 4%
Fluent Version
Profile: MPI Functions
Eddy_417
• The following are the most used calls– MPI_Iprobe (stat gathering), MPI_Isend,
MPI_Irecv, MPI_ALLreduce
• The following calls produce the most Overhead: MPI_Recv, MPI_Allreduce,
MPI_Barrier, MPI_Init, MPI_Waitall, MPI_Iprobe, MPI_Isend.
Month DD, YYYYQLogic Confidential8
ANSYS FLUENT 12.1.11 ANSYS FLUENT 13.0
Profile: MPI Functions
Truck_111
• The following are the most used calls– MPI_Iprobe (stat gathering), MPI_Isend,
MPI_Irecv, MPI_ALLreduce
• The following calls produce the most Overhead (73%): MPI_Recv, MPI_Allreduce,
MPI_Waitall.
Month DD, YYYYQLogic Confidential9
ANSYS FLUENT 12.1.11 ANSYS FLUENT 13.0
Profile: Message Rate
Eddy_417K & Truck_111M
Eddy_417K Model
• ANSYS FLUENT 13.0 sends 8% less messages at 192-cores
• Average message size at 192-cores is 15% smaller with 13.0
• 97% of the messages are less than 1K in size
Truck_111M Model
• ANSYS FLUENT 13.0 sends 57% less messages at 192-cores
• Average message size at 192-cores is 66% larger with13.0
• 98% of the messages are less than 65K in size
Month DD, YYYYQLogic Confidential10
Take a
Test
Dri
ve
QLogic NETtrack Developer Center
Test Drive Program
• Test Drive your models/simulations at the QLogic NETtrack Developer
Center
• Range of Configurations
• Nodes, Cores, Memory
• Interconnects
• Storage
• Determine the Optimal Performance Setup for your Business
Get Started – Contact us at [email protected]
Month DD, YYYY13 QLogic Confidential