4.7 Article

Live Virtual Machine Migration via Asynchronous Replication and State Synchronization

Journal

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS
Volume 22, Issue 12, Pages 1986-1999

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TPDS.2011.86

Keywords

Copy-on-write checkpointing; deterministic replay; live migration; trace; virtual machine

Funding

  1. National 973 Basic Research Program of China [2007CB310900]
  2. Program for New Century Excellent Talents in University [NCET-08-0218]
  3. China National Natural Science Foundation (NSFC) [60703050, 60973133]
  4. FOK YING TUNG Education Foundation [122007]
  5. Wuhan Chenguang Research Plan Foundation [200850731350]
  6. MoE-Intel Information Technology Special Research Foundation [MOE-INTEL-10-05]
  7. Division Of Computer and Network Systems
  8. Direct For Computer & Info Scie & Enginr [0914330] Funding Source: National Science Foundation
  9. Division of Computing and Communication Foundations
  10. Direct For Computer & Info Scie & Enginr [1016966] Funding Source: National Science Foundation

Ask authors/readers for more resources

Live migration of virtual machines (VM) across physical hosts provides a significant new benefit for administrators of data centers and clusters. Previous memory-to-memory approaches demonstrate the effectiveness of live VM migration in local area networks (LAN), but they would cause a long period of downtime in a wide area network (WAN) environment. This paper describes the design and implementation of a novel approach, namely, CR/TR-Motion, which adopts checkpointing/recovery and trace/replay technologies to provide fast, transparent VM migration for both LAN and WAN environments. With execution trace logged on the source host, a synchronization algorithm is performed to orchestrate the running source and target VMs until they reach a consistent state. CR/TR-Motion can greatly reduce the migration downtime and network bandwidth consumption. Experimental results show that the approach can drastically reduce migration overheads compared with memory-to-memory approach in a LAN: up to 72.4 percent on application observed downtime, up to 31.5 percent on total migration time, and up to 95.9 percent on the data to synchronize the VM state. The application performance overhead due to migration is kept within 8.54 percent on average. The results also show that for a variety of workloads migrated across WANs, the migration downtime is less than 300 milliseconds.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available