Univariate Time Series missing data Imputation using Pix2Pix GAN
Keywords:
Time Series, Imputation, cGAN, Rede Pix2PixAbstract
The use of data is essential for the supply of business, scientific and other processes. Often the consumption of these data is hampered when there are sample losses. Aiming to recover values representative of these losses, there are several approaches for filling them. In this paper, we propose a new method for imputation of missing data that transforms time series into an image and thus performs imputation using the conditional generative adversarial network (cGAN) pix2pix GAN. The results of ASMAPE and MAE show that the network outperforms all methods in 50% of the datasets. It was also revealed that the proposed network can learn time series features and retain some advantages over traditional methods, such as imputing the data in its entirety and exploiting spatial and temporal features for imputation.
Downloads
References
E. A. Codling, M. J. Plank, and S. Benhamou, “Random walk models in
biology,” Journal of the Royal society interface, vol. 5, no. 25, pp. 813–
, 2008
R. S. Tsay, “Testing and modeling threshold autoregressive processes,”
Journal of the American statistical association, vol. 84, no. 405, pp. 231–
, 1989.
H. Aboussaid, “The effect of ethnic fractionalization on economic
development. a multilevel analysis,” 2021.
W. A. Fuller, Introduction to statistical time series. John Wiley & Sons,
K. Lakshminarayan, S. A. Harp, R. P. Goldman, T. Samad, et al.,
“Imputation of missing data using machine learning techniques.,” in
KDD, vol. 96, 1996.
S. Rani and A. Solanki, “Data imputation in wireless sensor network
using deep learning techniques,” in Data analytics and management,
pp. 579–594, Springer, 2021.
S. N. Hussain, A. A. Aziz, M. Hossen, N. A. A. Aziz, G. R. Murthy, F. B. Mustakim, et al., “A novel framework based on cnn-lstm neural network for prediction of missing values in electricity consumption time-series datasets.,” Journal of Information Processing Systems, vol. 18, no. 1, 2022.
T. Huamin, D. Qiuqun, and X. Shanzhu, “Reconstruction of time series
with missing value using 2d representation-based denoising autoen-
coder,” Journal of Systems Engineering and Electronics, vol. 31, no. 6,
pp. 1087–1096, 2020.
Z. Guo, Y. Wan, and H. Ye, “A data imputation method for multivariate
time series based on generative adversarial network,” Neurocomputing,
vol. 360, pp. 185–197, 2019.
S. Kawakura and R. Shibasaki, “Deep learning-based self-driving car:
Jetbot with nvidia ai board to deliver items at agricultural workplace
with object-finding and avoidance functions,” European Journal of
Agriculture and Food Sciences, vol. 2, no. 3, 2020.
Y. Kortli, M. Jridi, A. Al Falou, and M. Atri, “Face recognition systems:
A survey,” Sensors, vol. 20, no. 2, p. 342, 2020.
Z. Wang and T. Oates, “Imaging time-series to improve classification
and imputation,” in Twenty-Fourth International Joint Conference on
Artificial Intelligence, 2015.
N. Hatami, Y. Gavet, and J. Debayle, “Classification of time-series
images using deep convolutional neural networks,” in Tenth international
conference on machine vision (ICMV 2017), vol. 10696, p. 106960Y,
International Society for Optics and Photonics, 2018.
Y. Zhuang, R. Ke, and Y. Wang, “An innovative method for traffic
data imputation based on convolutional neural network,” IET Intelligent
Transport Systems, vol. 13, 11 2018.
Y. Luo, X. Cai, Y. Zhang, J. Xu, et al., “Multivariate time series
imputation with generative adversarial networks,” Advances in neural
information processing systems, vol. 31, 2018.
E. Brophy, Z. Wang, and T. E. Ward, “Quick and easy time se-
ries generation with established image-based gans,” arXiv preprint
arXiv:1902.05624, 2019.
E. Brophy, Z. Wang, Q. She, and T. Ward, “Generative adversarial
networks in time series: A survey and taxonomy,” arXiv preprint
arXiv:2107.11098, 2021.
S. Oehmcke, T.-H. K. Chen, A. V. Prishchepov, and F. Gieseke,
“Creating cloud-free satellite imagery from image time series with deep
learning,” in Proceedings of the 9th ACM SIGSPATIAL International
Workshop on Analytics for Big Geospatial Data, pp. 1–10, 2020.
R. C. Pereira, M. S. Santos, P. P. Rodrigues, and P. H. Abreu, “Reviewing
autoencoders for missing data imputation: Technical trends, applications
and outcomes,” Journal of Artificial Intelligence Research, vol. 69,
pp. 1255–1285, 2020.
Y. Xia, L. Zhang, N. Ravikumar, R. Attar, S. K. Piechnik, S. Neubauer,
S. E. Petersen, and A. F. Frangi, “Recovering from missing data
in population imaging–cardiac mr image imputation via conditional
generative adversarial nets,” Medical Image Analysis, vol. 67, p. 101812,
A. T. Elergone, “ElectricityLoadDiagrams20112014 Data Set,” 2020.
K. Dunn, “OpenMV.net Datasets.” https://openmv.net/, 2018.
H. Fanaee-T and J. Gama, “Event labeling combining ensemble detectors
and background knowledge,” Progress in Artificial Intelligence, pp. 1–
, 2013.
D. H. Stolfi, E. Alba, and X. Yao, “Predicting car park occupancy rates
in smart cities,” in International Conference on Smart Cities, pp. 107–
, Springer, 2017.
KANKANA, “Daily minimum temperatures in me.”
J. Brownlee,“Machine learning datasets.”
urlhttps://raw.githubusercontent.com/jbrownlee/Datasets/master/monthly-
sunspots.csv, dez 2020. accessed: 20-12-2021.
S. Moritz, M. Friese, A. Fischbach, C. Schlitt, and T. Bartz-Beielstein,
“GECCO Industrial Challenge 2015 Dataset: A heating system dataset
for the ’Recovering missing information in heating system operating
data’ competition at the Genetic and Evolutionary Computation Confer-
ence 2015, Madrid, Spain,” May 2015.
P. Isola, J.-Y. Zhu, T. Zhou, and A. A. Efros, “Image-to-image translation
with conditional adversarial networks,” in Proceedings of the IEEE
conference on computer vision and pattern recognition, pp. 1125–1134,
M. Mirza and S. Osindero, “Conditional generative adversarial nets,”
arXiv preprint arXiv:1411.1784, 2014.
A. Flores, H. Tito, and C. Silva, “Local average of nearest neighbors:
Univariate time series imputation,” International Journal of Advanced
Computer Science and Applications, vol. 10, no. 8, pp. 45–50, 2019.
C. J. Willmott and K. Matsuura, “Advantages of the mean absolute error
(mae) over the root mean square error (rmse) in assessing average model
performance,” Climate research, vol. 30, no. 1, pp. 79–82, 2005.
V. Kreinovich, H. T. Nguyen, and R. Ouncharoen, “How to estimate
forecasting quality: A system-motivated derivation of symmetric mean
absolute percentage error (smape) and other similar characteristics,”
M. Daraghmeh, A. Agarwal, R. Manzano, and M. Zaman, “Time series
forecasting using facebook prophet for cloud resource management,”
in 2021 IEEE International Conference on Communications Workshops
(ICC Workshops), pp. 1–6, IEEE, 2021.
S. Zhang, L. Gong, Q. Zeng, W. Li, F. Xiao, and J. Lei, “Imputation of
gps coordinate time series using missforest,” Remote Sensing, vol. 13,
no. 12, p. 2312, 2021.