22
Performance Modelling and Dynamic Scheduling on Heterogeneous-ISA multi-core Architectures Nirmal Kumar Boran, Dinesh Kumar Yadav, Rishabh Iyer CADSL Lab, IIT Bombay VDAT2019 1

Performance Modelling and Dynamic Scheduling on

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Performance Modelling and Dynamic Scheduling on

Performance Modelling and Dynamic Scheduling on Heterogeneous-ISA

multi-core Architectures

Nirmal Kumar Boran, Dinesh Kumar Yadav, Rishabh IyerCADSL Lab, IIT Bombay

VDAT2019 1

Page 2: Performance Modelling and Dynamic Scheduling on

Introduction

• Unicore : Single threaded programs

Uniprocessor e.g. Pentium 4

• Multicore : For multithreading

Homogeneous-CMP: e.g. intel i7, HyperX Processors

Heterogeneous-CMP: To optimize both serial and parallel code e.g. big.LITTLE

Heterogeneous-ISA CMP: ISA affinity is exploited

VDAT2019 2

Page 3: Performance Modelling and Dynamic Scheduling on

ISA Diversity

• Code Density

• Dynamic Instruction Count

• Register Pressure

• Floating point and SIMD support

[1] Venkat, Ashish, and Dean M. Tullsen. ”Harnessing ISA diversity: Design of a heterogeneous-ISA chip multiprocessor.” 2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA). IEEE, 2014.

VDAT2019 3

Page 4: Performance Modelling and Dynamic Scheduling on

Motivation

Execution time for different phases of ‘astar’ benchmark

VDAT2019 4

Page 5: Performance Modelling and Dynamic Scheduling on

Motivation

VDAT2019 5

Page 6: Performance Modelling and Dynamic Scheduling on

Migration Decision: Equivalence Point

• Function calls are potential Equivalence states[1][4]

x86 HLL ARM

VDAT2019 6

Page 7: Performance Modelling and Dynamic Scheduling on

Problem Statement• "Building a performance model and scheduler for

heterogeneous ISA asymmetric multi-core architectures“

Benchmark

Scheduler

Core 2Core 1

Architectural Parameters

Architectural Parameters

VDAT2019 7

Page 8: Performance Modelling and Dynamic Scheduling on

Model1A

1

A2

A13

A3

B1

B2

B13

B3

Model2

Model3

Model13

ExecutionTimeB

Model14

ISAA

ISAB

Previous Work – Training Phase

[4]: Boran, N.K., Meghwal, R.P., Sharma, K., Kumar, B., Singh, V.: Performance modelling of heterogeneous ISA multicore architectures. In: East-West Design & TestSymposium (EWDTS), 2016 IEEE. pp. 1{4. IEEE (2016) VDAT2019 8

Page 9: Performance Modelling and Dynamic Scheduling on

A1

A2

A13

A3

PB1

PB2

PB13

PB3

PredictedExecutionTime

B

Model14Model1

Model2

Model3

Model13

ISAA

ISAB

Previous Work – Prediction Phase

[4]: Boran, N.K., Meghwal, R.P., Sharma, K., Kumar, B., Singh, V.: Performance modelling of heterogeneous ISA multicore architectures. In: East-West Design & TestSymposium (EWDTS), 2016 IEEE. pp. 1{4. IEEE (2016) VDAT2019 9

Page 10: Performance Modelling and Dynamic Scheduling on

A1

A2

A13

A3 ExecutionTime

B

Model1

ISAA

ISAB

Proposed Work – Training Phase

VDAT2019 10

Page 11: Performance Modelling and Dynamic Scheduling on

A1

A2

A13

A3

PredictedExecutionTime

B

Model1

ISAA

ISAB

Proposed Work – Prediction Phase

VDAT2019 11

Page 12: Performance Modelling and Dynamic Scheduling on

Linear Regression Equation

VDAT2019 12

Page 13: Performance Modelling and Dynamic Scheduling on

Configuration

Design Parameter ARM Alpha x86

Architectural Registers 32 GPR 64 GPR 16 GPR

Cache line sizes (bytes) 64 64 64

LSQ size (bytes) 32 32 32

Fetch Width 4 4 4

Instruction Queue entries 64 64 64

ROB entries 192 192 192

DCache, ICache size 32KB 32KB 32KB

L2 Cache size 256KB 256KB 256KBVDAT2019 13

Page 14: Performance Modelling and Dynamic Scheduling on

Results: Root mean squared error

VDAT2019 14

Page 15: Performance Modelling and Dynamic Scheduling on

Results: Standard Deviation

VDAT2019 15

Page 16: Performance Modelling and Dynamic Scheduling on

Scheduling Prediction

• If PredictedTime < (CurrentTime + migrationOverhead)

Migrate to another ISA

ISA Prediction Accuracy

ARM 92.3 %

Alpha 92.7 %

X86 83.4 %

VDAT2019 16

Page 17: Performance Modelling and Dynamic Scheduling on

Results: Speed Up 1

VDAT201917

Page 18: Performance Modelling and Dynamic Scheduling on

Results: Speed Up 2

VDAT201918

Page 19: Performance Modelling and Dynamic Scheduling on

Results: Energy Comparison

VDAT201919

Page 20: Performance Modelling and Dynamic Scheduling on

References

• [1]: Venkat, Ashish, and Dean M. Tullsen. ”Harnessing ISA diversity: Design of a heterogeneous-ISA chip multiprocessor.” 2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA). IEEE, 2014.

• [2]: Von Bank, David G., Charles M. Shub, and Robert W. Sebesta. "A unified model of pointwise equivalence of procedural computations." ACM Transactions on Programming Languages and Systems (TOPLAS) 16.6 (1994)

• [3]: M. Pricopi, T. S. Muthukaruppan, V. Venkataramani, T. Mitra, and S. Vishin, “Power-performance modeling on asymmetric multi-cores,” in Proceedings of the 2013 International Conference on Compilers, Architectures and Synthesis for Embedded Systems, ser. CASES ’13.

• [4]: Boran, N.K., Meghwal, R.P., Sharma, K., Kumar, B., Singh, V.: Performance modelling of heterogeneous ISA multicore architectures. In: East-West Design & TestSymposium (EWDTS), 2016 IEEE. pp. 1{4. IEEE (2016)

• [5]: N. L. Binkert, R. G. Dreslinski, L. R. Hsu, K. T. Lim, A. G. Saidi, and S. K. Reinhardt, The m5 simulator: Modeling networked systems, IEEE Micro, vol. 26, no. 4, pp. 5260, July 2006.

• [6]: A. Lukefahr, S. Padmanabha, R. Das, F. M. Sleiman, R. Dreslinski, T. F. Wenisch, and S. Mahlke, Composite cores: Pushing heterogeneity into a core, in Proceedings of the 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture, ser. MICRO-45. Washington, DC, USA: IEEE Computer Society, 2012, pp. 317328.

• [7]: Kumar, R., Farkas, K.I., Jouppi, N.P., Ranganathan, P., Tullsen, D.M.: SingleISA heterogeneous multi-core architectures: The potential for processor power reduction. In: Microarchitecture, 2003. MICRO-36. Proceedings. 36th Annual IEEE/ACM International Symposium on. pp. 81{92. IEEE (2003)

VDAT2019 20

Page 21: Performance Modelling and Dynamic Scheduling on

Thank you

VDAT2019 21

Page 22: Performance Modelling and Dynamic Scheduling on

Migration decision

• Execution of a program on a processor- sequence of states

• State change - execution of a single instruction

• There exists certain well-defined states of computation where one to one correspondence is possible with another instance of computation[2]

VDAT2019 22