An Efficient VM-Level Scaling Scheme in an IaaS Cloud Computing System: A Queueing Theory Approach

Lee, Doo Ho;

doi:10.5392/IJoC.2017.13.2.029

International Journal of Contents

Volume 13 Issue 2
/
Pages.29-34
/
2017
/
1738-6764(pISSN)
/
2093-7504(eISSN)

The Korea Contents Association (한국콘텐츠학회)

DOI QR Code

An Efficient VM-Level Scaling Scheme in an IaaS Cloud Computing System: A Queueing Theory Approach

Lee, Doo Ho (Department of Industrial & Management Engineering Kangwon National University)

Received : 2017.03.09
Accepted : 2017.04.25
Published : 2017.06.28

https://doi.org/10.5392/IJoC.2017.13.2.029 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

Cloud computing is becoming an effective and efficient way of computing resources and computing service integration. Through centralized management of resources and services, cloud computing delivers hosted services over the internet, such that access to shared hardware, software, applications, information, and all resources is elastically provided to the consumer on-demand. The main enabling technology for cloud computing is virtualization. Virtualization software creates a temporarily simulated or extended version of computing and network resources. The objectives of virtualization are as follows: first, to fully utilize the shared resources by applying partitioning and time-sharing; second, to centralize resource management; third, to enhance cloud data center agility and provide the required scalability and elasticity for on-demand capabilities; fourth, to improve testing and running software diagnostics on different operating platforms; and fifth, to improve the portability of applications and workload migration capabilities. One of the key features of cloud computing is elasticity. It enables users to create and remove virtual computing resources dynamically according to the changing demand, but it is not easy to make a decision regarding the right amount of resources. Indeed, proper provisioning of the resources to applications is an important issue in IaaS cloud computing. Most web applications encounter large and fluctuating task requests. In predictable situations, the resources can be provisioned in advance through capacity planning techniques. But in case of unplanned and spike requests, it would be desirable to automatically scale the resources, called auto-scaling, which adjusts the resources allocated to applications based on its need at any given time. This would free the user from the burden of deciding how many resources are necessary each time. In this work, we propose an analytical and efficient VM-level scaling scheme by modeling each VM in a data center as an M/M/1 processor sharing queue. Our proposed VM-level scaling scheme is validated via a numerical experiment.

Keywords

References

M. K. Kim and J. Y. Choi, "An efficient two-phase heuristic policy for acceptance control in IaaS cloud service," Journal of the Society of Korea Industrial and Systems Engineering, vol. 38, no. 2, 2015, pp. 91-100. https://doi.org/10.11627/jkise.2015.38.2.91
T. W. Um, H. Lee, R. Woo, and J. K. Choi, "Dynamic resource allocation and scheduling for cloud-based virtual content delivery networks," ETRI Journal, vol. 36, no. 2, 2014, pp. 197-205. https://doi.org/10.4218/etrij.14.2113.0085
Z. Zhuang and C. Guo, "Building cloud-ready video transcoding system for content delivery networks (CDNs)," Proc. IEEE GCC, 2012, pp. 2048-2053.
J. He, Y. Wen, J. Huang, and D. Wu, "On the cost-QoE tradeoff for cloud-based video streaming under Amazon EC2's pricing models," IEEE Transactions on Circuits and Systems for Video Technology, vol. 24, no. 4, 2013, pp. 669-680. https://doi.org/10.1109/TCSVT.2013.2283430
S. P. Ponnusamy and E. Karthikeyan, "Cache optimization on hot-point proxy caching using weighted-Rank cache replacement policy," ETRI Journal, vol. 35, no. 4, 2013, pp. 687-696. https://doi.org/10.4218/etrij.13.0112.0606
R. S. Huckman, G. P. Pisano, and L. Kind, Amazon web service, Harvard Business School Case (609-048), 2008.
http://www.rightscale.com.
http://software.dell.com/products/cloud-manager.
H. Masuyama and T. Takine, "Sojourn time distribution in a MAP/M/1 processor-sharing queue," Operations Research Letters, vol. 31, no. 5, 2003, pp. 406-412. https://doi.org/10.1016/S0167-6377(03)00028-2
H. C. Tijms, Stochastic models: an algorithmic approach, John Wiley & Sons, Inc, 1994.
F. Guillemin and J. Boyer, "Analysis of the M/M/1 queue with processor sharing via spectral theory," Queueing Systems, vol. 39, no. 4, 2001, pp. 377-397. https://doi.org/10.1023/A:1013913827667
P. Barham, B. Dragovic, K. Fraser, S. Hand, T. Harris, A. Ho, R. Neugebauer, I. Pratt, and A. Warfield, "Xen and the art of virtualization," Proc. ACM SOSP, 2003, pp. 164-177.
B. Sotomayor, R. S. Montero, I. M. Llorente, and I. Foster, "Virtual infrastructure management in private and hybrid clouds," IEEE Internet Computing, vol. 34, no. 5, 2009, pp. 14-22.
O. Litvinski and A. Gherbi, "Openstack scheduler evaluation using design of experiment," Proc. IEEE ISORC, 2013, pp. 1-7.
J. Huang, C. Li, and J. Yu, "Resource prediction based on double exponential smoothing in cloud computing," Proc. IEEE CECNet, 2012, pp. 2056-2060.

International Journal of Contents

An Efficient VM-Level Scaling Scheme in an IaaS Cloud Computing System: A Queueing Theory Approach

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)