Evaluation of Clustering Techniques to Estimate the Effective Bandwidth of a Markovian Fluid from Traffic Traces

Authors

  • Carina Fernández Department of Mathematics, Universidad Nacional del Sur, Bahía Blanca (zip code 8000), Buenos Aires, Argentina https://orcid.org/0000-0002-8460-0278
  • José Bavio Department of Mathematics, Universidad Nacional del Sur, Bahía Blanca (zip code 8000), Buenos Aires, Argentina https://orcid.org/0000-0001-8195-2406
  • Beatriz Marrón Department of Mathematics, Universidad Nacional del Sur, Bahía Blanca (zip code 8000), Buenos Aires, Argentina https://orcid.org/0000-0002-5447-8515

Keywords:

Markovian fluid, Effective bandwidth, Supervised and unsupervised estimation methods, Clustering

Abstract

Integrated services digital networks, designed to transport data in real time, are modeled by a multiplexer system, where several data fluids share a single output. At the time of admission of a new connection, in order to maintain the quality of service (QoS), it is important to know the amount of available resources required by the connections sharing the channel. It is therefore important to have models for the sources and techniques that provide accurate estimates of the resources required for each of them. It is assumed that the network sources are modeled using the Generalized Markovian Fluid Model (GMFM) because of its versatility in describing traffic fluctuations. This is a Markovian fluid where the transfer rate is a random variable whose range and probability distribution are determined by the state of the modulating chain. To measure resource allocation, the concept of Effective Bandwidth (EB) is used, since it allows expressing this magnitude as a function of the model parameters, which will be estimated from traffic traces. As the size of the data to describe the behavior of a source is clearly enormous, they are studied using clustering techniques. In this work, different methods, supervised and unsupervised, are presented to estimate the parameters involved in the calculation of the EB. Finally, the performance of the estimators is analyzed by calculating partition comparison indices, corresponding to the dispatch intervals, which are based on the confusion matrix and the notion of mutual information entropy between partitions, on simulated data.

Downloads

Download data is not yet available.

Author Biographies

Carina Fernández, Department of Mathematics, Universidad Nacional del Sur, Bahía Blanca (zip code 8000), Buenos Aires, Argentina

I had my undergraduate degree in mathematics in 2011. I am a PhD student in mathematics at the Universidad Nacional del Sur. I am currently developing my teaching work in the area of Probability and Statistics at the Universidad Nacional del Sur. My research has focused on the study of the modeling of data networks using stochastic processes, the subject of my doctoral thesis. I have also published papers related to statistics didactics. I have done consulting for private companies. I am a member of two accredited research projects from 2019 to date.

José Bavio, Department of Mathematics, Universidad Nacional del Sur, Bahía Blanca (zip code 8000), Buenos Aires, Argentina

José Bavio, holds a degree in Mathematics, and the Ph.D. in Mathematics from the Universidad Nacional del Sur, Bahía Blanca, Argentina. He is currently developing his teaching work at the Math Department of Universidad Nacional del Sur and he is also a part time Professor at Universidad Salesiana, in Bahía Blanca. His research has been concerned with stochastic processes simulation and estimation of its parameters. He had also work in colaboration in different fields making statistic data analisys within python framework.

Beatriz Marrón, Department of Mathematics, Universidad Nacional del Sur, Bahía Blanca (zip code 8000), Buenos Aires, Argentina

Beatriz. S. Marrón holds a degree in Mathematics and the Ph.D. in Mathematics from the Universidad Nacional del Sur, Bahía Blanca, Argentina. She is a full time professor at the Department of Mathematics at the Universidad Nacional del Sur, Argentina. She is currently developing her teaching and research work in the area of Probability and Statistics and is director of accredited research projects. She is a delegate, representing the Universidad Nacional del Sur, in the Disciplinary Nucleus "Applied Mathematics" of the AUGM (Association of Group Montevideo Universities).

References

F. P. Kelly, S. Zachary, and I. B. Ziedins, eds., Notes on Effective Bandwidth, pp. 141–168. Oxford University Press, 1996.

C. A. Courcoubetis and A. Dimakis, “The price of queueing,” ArXiv, vol. abs/2007.08318, 2020.

M. Zehri, A. Haastrup, D. Rincon, J. R. Piney, S. Sallent, and A. Bazzi, “A qos-aware dynamic bandwidth allocation algorithm for passive optical networks with non-zero laser tuning time,” Photonics, 03 2021.

J. Zhao, J. Du, Y. Yue, and J. Liu, “Special issue on advanced technique and future perspective for next generation optical fiber communications,” Photonics, vol. 9, p. 280, 04 2022.

A. Parra León, E. M. Piedrahita, and O. Salcedo, “Aplicaciones del modelo on/of al tráfico agregado en las redes de comunicaciones,” Tecnura, vol. 15, p. 129–147, jun. 2011.

A. Seal, A. Karlekar, O. Krejcar, and E. Herrera-Viedma, “Performance and convergence analysis of modified c-means using jeffreys-divergence for clustering,” International Journal of Interactive Multimedia and Artificial Intelligence, vol. In Press, p. 1, 12 2021.

J. Bavio, C. Fernández, and B. Marrón, “Comparison of effective bandwidth estimation methods for data networks,” Global Journal of Computer Science and Technology: ENetwork, Web & Security, vol. 22, no. 2, pp. 12–20, 2022.

M. Meila and D. Heckerman, “An experimental comparison of model-based clustering methods,” Machine Learning, vol. 42, pp. 9–29, 2001.

N. X. Vinh, J. Epps, and J. Bailey, “Information theoretic measures for clusterings comparison: Variants, properties, normalization and correction for chance,” J. Mach. Learn. Res., vol. 11, p. 2837–2854, dec 2010.

J. Bavio and B. Marrón, “Properties of the estimators for the effective bandwidth in a generalized markov fluid model,” Open Journal of Statistics, vol. 8, no. 1, pp. 69–84, 2018.

C. M. Bishop, Pattern Recognition and Machine Learning. Springer, 2006.

M. Rosenblatt, “Remarks on some nonparametric estimates of a density function,” The Annals of Mathematical Statistics, vol. 27, no. 3, pp. 832 – 837, 1956.

R. O. Duda and P. E. Hart, Pattern classification and scene analysis. Wiley, New York, 1973.

A. P. Dempster, N. M. Laird, and D. B. Rubin, “Maximum likelihood from incomplete data via the em algorithm,” Journal of the Royal Statistical Society: Series B (Methodological), vol. 39, no. 1, pp. 1—22, 1977.

E. H. Koroishi, F. A. L. Molina, A. W. Faria, and V. Steffen Junior, “Clustering by passing messages between data points,” Journal of Aerospace Technology and Management, vol. 315, pp. 972–6, 2007.

P. N. Tan, M. Steinbach, and V. Kumar, Introduction to Data Mining. Pearson Addison Wesley, 2005.

L. Kaufman and P. Rousseeuw, Finding Groups in Data: An Introduction To Cluster Analysis. Wiley, New York, 1990.

F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay, “Scikit-learn: Machine learning in Python,” Journal of Machine Learning Research, vol. 12, pp. 2825–2830, 2011.

C. E. Shannon, “A mathematical theory of communication,” The Bell System Technical Journal, vol. 27, no. 3, pp. 379–423, 1948.

Published

2023-04-18

How to Cite

Fernández, C., Bavio, J., & Marrón, B. (2023). Evaluation of Clustering Techniques to Estimate the Effective Bandwidth of a Markovian Fluid from Traffic Traces. IEEE Latin America Transactions, 21(5), 636–642. Retrieved from https://latamt.ieeer9.org/index.php/transactions/article/view/7678