DOI QR코드

DOI QR Code

An Efficient VM-Level Scaling Scheme in an IaaS Cloud Computing System: A Queueing Theory Approach

  • Lee, Doo Ho (Department of Industrial & Management Engineering Kangwon National University)
  • Received : 2017.03.09
  • Accepted : 2017.04.25
  • Published : 2017.06.28

Abstract

Cloud computing is becoming an effective and efficient way of computing resources and computing service integration. Through centralized management of resources and services, cloud computing delivers hosted services over the internet, such that access to shared hardware, software, applications, information, and all resources is elastically provided to the consumer on-demand. The main enabling technology for cloud computing is virtualization. Virtualization software creates a temporarily simulated or extended version of computing and network resources. The objectives of virtualization are as follows: first, to fully utilize the shared resources by applying partitioning and time-sharing; second, to centralize resource management; third, to enhance cloud data center agility and provide the required scalability and elasticity for on-demand capabilities; fourth, to improve testing and running software diagnostics on different operating platforms; and fifth, to improve the portability of applications and workload migration capabilities. One of the key features of cloud computing is elasticity. It enables users to create and remove virtual computing resources dynamically according to the changing demand, but it is not easy to make a decision regarding the right amount of resources. Indeed, proper provisioning of the resources to applications is an important issue in IaaS cloud computing. Most web applications encounter large and fluctuating task requests. In predictable situations, the resources can be provisioned in advance through capacity planning techniques. But in case of unplanned and spike requests, it would be desirable to automatically scale the resources, called auto-scaling, which adjusts the resources allocated to applications based on its need at any given time. This would free the user from the burden of deciding how many resources are necessary each time. In this work, we propose an analytical and efficient VM-level scaling scheme by modeling each VM in a data center as an M/M/1 processor sharing queue. Our proposed VM-level scaling scheme is validated via a numerical experiment.

Keywords

References

  1. M. K. Kim and J. Y. Choi, "An efficient two-phase heuristic policy for acceptance control in IaaS cloud service," Journal of the Society of Korea Industrial and Systems Engineering, vol. 38, no. 2, 2015, pp. 91-100. https://doi.org/10.11627/jkise.2015.38.2.91
  2. T. W. Um, H. Lee, R. Woo, and J. K. Choi, "Dynamic resource allocation and scheduling for cloud-based virtual content delivery networks," ETRI Journal, vol. 36, no. 2, 2014, pp. 197-205. https://doi.org/10.4218/etrij.14.2113.0085
  3. Z. Zhuang and C. Guo, "Building cloud-ready video transcoding system for content delivery networks (CDNs)," Proc. IEEE GCC, 2012, pp. 2048-2053.
  4. J. He, Y. Wen, J. Huang, and D. Wu, "On the cost-QoE tradeoff for cloud-based video streaming under Amazon EC2's pricing models," IEEE Transactions on Circuits and Systems for Video Technology, vol. 24, no. 4, 2013, pp. 669-680. https://doi.org/10.1109/TCSVT.2013.2283430
  5. S. P. Ponnusamy and E. Karthikeyan, "Cache optimization on hot-point proxy caching using weighted-Rank cache replacement policy," ETRI Journal, vol. 35, no. 4, 2013, pp. 687-696. https://doi.org/10.4218/etrij.13.0112.0606
  6. R. S. Huckman, G. P. Pisano, and L. Kind, Amazon web service, Harvard Business School Case (609-048), 2008.
  7. http://www.rightscale.com.
  8. http://software.dell.com/products/cloud-manager.
  9. H. Masuyama and T. Takine, "Sojourn time distribution in a MAP/M/1 processor-sharing queue," Operations Research Letters, vol. 31, no. 5, 2003, pp. 406-412. https://doi.org/10.1016/S0167-6377(03)00028-2
  10. H. C. Tijms, Stochastic models: an algorithmic approach, John Wiley & Sons, Inc, 1994.
  11. F. Guillemin and J. Boyer, "Analysis of the M/M/1 queue with processor sharing via spectral theory," Queueing Systems, vol. 39, no. 4, 2001, pp. 377-397. https://doi.org/10.1023/A:1013913827667
  12. P. Barham, B. Dragovic, K. Fraser, S. Hand, T. Harris, A. Ho, R. Neugebauer, I. Pratt, and A. Warfield, "Xen and the art of virtualization," Proc. ACM SOSP, 2003, pp. 164-177.
  13. B. Sotomayor, R. S. Montero, I. M. Llorente, and I. Foster, "Virtual infrastructure management in private and hybrid clouds," IEEE Internet Computing, vol. 34, no. 5, 2009, pp. 14-22.
  14. O. Litvinski and A. Gherbi, "Openstack scheduler evaluation using design of experiment," Proc. IEEE ISORC, 2013, pp. 1-7.
  15. J. Huang, C. Li, and J. Yu, "Resource prediction based on double exponential smoothing in cloud computing," Proc. IEEE CECNet, 2012, pp. 2056-2060.