24
Live migration of Virtual Machines Nour Stefan, SCPD

Live migration of Virtual Machines Nour Stefan, SCPD

Embed Size (px)

Citation preview

Page 1: Live migration of Virtual Machines Nour Stefan, SCPD

Live migration of Virtual Machines

Nour Stefan, SCPD

Page 2: Live migration of Virtual Machines Nour Stefan, SCPD

• Introduction• Related work• Design• Writable Working Sets• Implementation Issues• Evaluation• Future work• Conclusions

Page 3: Live migration of Virtual Machines Nour Stefan, SCPD

Introduction• OS virtualization

– Data centers– Cluster computing

• Live OS migration– Avoid problem of “residual dependencies”– In-memory state can be transferred in a consistent and efficient way

• Kernel-internal state• Application-level state

– Separation of concerns between Users and Operator of a data center or cluster

– Separation of hardware and software considerations, and consolidating clustered hardware into a single coherent management domain

• High-performance migration support for Xen

Page 4: Live migration of Virtual Machines Nour Stefan, SCPD

Related work

• Collective project– For slow connections and longer time spans– Stop the OS execution while transfer

• Zap• NomadBIOS– Pre-copy migration – Not adapting to the writable working set

Page 5: Live migration of Virtual Machines Nour Stefan, SCPD

Design• Migrating memory

• Balancing Downtime and Total migration time

– Push phase– Stop-and-copy phase– Pull phase

• Local resources – Connections to local devices(disks , network interfaces)– Single switched LAN– Generate an unsolicited ARP reply from migrated host,

advertising that the IP has moved to a new location– Network-Attached Storage

Page 6: Live migration of Virtual Machines Nour Stefan, SCPD
Page 7: Live migration of Virtual Machines Nour Stefan, SCPD

Writable Working Sets

Page 8: Live migration of Virtual Machines Nour Stefan, SCPD

Measuring Writable Working Sets

Page 9: Live migration of Virtual Machines Nour Stefan, SCPD

Measuring Writable Working Sets

Page 10: Live migration of Virtual Machines Nour Stefan, SCPD

Measuring Writable Working Sets

Page 11: Live migration of Virtual Machines Nour Stefan, SCPD
Page 12: Live migration of Virtual Machines Nour Stefan, SCPD
Page 13: Live migration of Virtual Machines Nour Stefan, SCPD
Page 14: Live migration of Virtual Machines Nour Stefan, SCPD

Implementation Issues

• Managed migration– Performed largely outside the migratee– Migration daemons running in the management VM of

the source and destination (new VM on destination)– Rounds of copying (dirtied during the previous round)– Dirty bitmap copied from Xen at start of each round– Shadow page tables (read-only page-tables entries =>

page fault trapped by Xen)

Page 15: Live migration of Virtual Machines Nour Stefan, SCPD

Implementation Issues

• Self migration– Implemented within the migratee OS– Migration stub on destination machine– Consistent OS checkpointing• Two-stage stop-and-copy phase

– Disables all OS activity except for migration => final scan of dirty bitmap => shadow buffer

• Transfer shadow buffer

Page 16: Live migration of Virtual Machines Nour Stefan, SCPD

Implementation Issues

• Dynamic Rate-Limiting• Rapid Page Dirtying

Page 17: Live migration of Virtual Machines Nour Stefan, SCPD

Evaluation

• Test setup– Dual Intel Xeon 2GHz CPU and 2GB memory– TG3 broadband

Page 18: Live migration of Virtual Machines Nour Stefan, SCPD

Evaluation

Page 19: Live migration of Virtual Machines Nour Stefan, SCPD

Evaluation

Page 20: Live migration of Virtual Machines Nour Stefan, SCPD

Evaluation

Page 21: Live migration of Virtual Machines Nour Stefan, SCPD

Evaluation

Page 22: Live migration of Virtual Machines Nour Stefan, SCPD

Future work

• Cluster management• Wide Area Network redirection• Migrating Block Devices

Page 23: Live migration of Virtual Machines Nour Stefan, SCPD

Conclusions

• By integrating live OS migration into the Xen virtual machine monitor we enable rapid movement of interactive workloads within clusters and data centers. Our dynamic network-bandwidth adaptation allows migration to proceed with minimal impact on running services, while reducing total downtime to below discernable thresholds.

• Our comprehensive evaluation shows that realistic server workloads such as SPECweb99 can be migrated with just 210ms downtime, while a Quake3 game server is migrated with an imperceptible 60ms outage.

Page 24: Live migration of Virtual Machines Nour Stefan, SCPD

?