Geometry and Color Transformation Data Augmentation for YOLOV8 in Beverage Waste Detection

Sabar Muhamad Itikap; Muhammad Syahid Abdurrahman; Eddy Bambang Soewono; Trisna Gelar

doi:10.17509/seict.v4i2.64400

Geometry and Color Transformation Data Augmentation for YOLOV8 in Beverage Waste Detection

Sabar Muhamad Itikap, Muhammad Syahid Abdurrahman, Eddy Bambang Soewono, Trisna Gelar

Abstract

In the bottle sorting process in real world, there are some beverage packaging waste that is deformed. Deformed objects can result in detection errors by an object detection system. Detection errors can also occur in attributes that share similar feature maps. Detection errors can be caused by models that are unable to generalize to the data. Several methods have been devised to prevent such issues, with data augmentation being one of them. To increase the variety of data, data enhancement techniques will be utilized. This research employs a data augmentation technique that concentrates on geometry transformations such as scaling and rotation, as well as color transformations such as hue, saturation, and brightness. Additionally, a combination of geometry and color transformations was conducted, resulting in a total of 39 experimental scenarios. This study demonstrates that data augmentation can affect the model's performance in terms of accuracy and the number of detection results. The combined method of scaling and rotation, which is applied to the original data, reveals the optimal experimental scenario with an accuracy of 88.4%.

Keywords

Data Augmentation; Detection; Data Variation; Scaling

Full Text:

PDF

References

Brownlee, J. (2019). Better Deep Learning Train Faster, Reduce Overfitting, and Make Better Predictions.

Chen, L., & Wang, C. (2019). Application of Deep Convolutional Neural Network in Computer Vision. International Journal of Engineering Intelligent Systems, 27(4), 185–192.

Chin, J. (2021, September 30). Hue, Saturation, Value: How to Use HSV Color Model in Photography – 2023

Color Jitter. (t.t.). Hasty.Ai Documentation. Diambil 24 November 2023, dari https://wiki.cloudfactory.com/docs/mp-wiki/augmentations/color-jitter

Deepan, P., & Sudha, L. R. (2020). Object classification of remote sensing image using deep convolutional neural network. The

Cognitive Approach in Cloud Computing and Internet of Things Technologies for Surveillance Tracking Systems, 107–120. https://doi.org/10.1016/b978-0-12-816385-6.00008-8

FromjintoA. (2021, November 23). [hyperparameters] batch/batch size/epoch/iteration 배치, 에포크. JINSTORY. https://geniewishescometrue.tistory.com/entry/ML-DL-WIKI-BatchBatch-sizeEpochIteration

Kim, P. (2017). Matlab deep learning: With machine learning, neural networks and artificial intelligence. Apress.

Lei, C., Hu, B., Wang, D., Zhang, S., & Chen, Z. (2019). A Preliminary Study on Data Augmentation of Deep Learning for Image Classification. Proceedings of the 11th Asia-Pacific Symposium on Internetware, 1–6.

Muttaqin, F. A., & Mukaharil Bachtiar, A. (2020). Implementasi Teks Mining Pada Aplikasi Pengawasan Penggunaan Internet Anak “Dodo Kids Browser. Jurnal Ilmiah Komputer dan Informatika (KOMPUTA)

Nyuytiyimbiy, K. (2020, December 30). Parameters, Hyperparameters, Machine Learning | Towards Data Science. https://towardsdatascience.com/parameters-and-hyperparameters-aa609601a9acS.

Patel, R., & Patel, S. (2020). A Comprehensive Study of Applying Convolutional Neural Network for Computer Vision. International Journal of Advanced Science and Technology, 6, 2161–2174.

Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). You only look once: Unified, real-time object detection. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). https://doi.org/10.1109/cvpr.2016.91

Seita, D. (t.t.). 1000x Faster Data Augmentation. The Berkeley Artificial Intelligence Research Blog. Diambil 24 November 2023, dari http://bair.berkeley.edu/blog/2019/06/07/data_aug/

Shorten, C., & Khoshgoftaar, T. M. (2019). A survey on Image Data Augmentation for Deep Learning. Journal of Big Data, 6(1), 60. https://doi.org/10.1186/s40537-019-0197-0

Solawetz, J. (2023, January 25). What is Yolov8? the ultimate guide. Roboflow Blog. https://blog.roboflow.com/whats-new-in-yolov8

Soviany, P., & Ionescu, R. T. (2018). Optimizing the trade-off between single-stage and two-stage deep object detectors using image difficulty prediction. 2018 20th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC).

Thanapol, P. et al. (2020) ‘Reducing overfitting and improving generalization in training convolutional neural network (CNN) under limited sample sizes in image recognition’, 2020 - 5th International Conference on Information Technology (InCIT) [Preprint].

Ting, K. M. (2010). Precision and Recall. Dalam C. Sammut & G. I. Webb (Ed.), Encyclopedia of Machine Learning (hlm. 781–781). Springer US.

Ultralytics. (n.d.). Home. Ultralytics YOLOv8 Docs. https://docs.ultralytics.com/

Understanding the HSV Color Model. (t.t.). Lifewire. Diambil 24 November 2023, dari https://www.lifewire.com/what-is-hsv-in-design-1078068

Yamashita, R., Nishio, M., Do, R. K., & Togashi, K. (2018). Convolutional Neural Networks: An overview and application in Radiology. Insights into Imaging, 9(4), 611–629.

Yang, S., Xiao, W., Zhang, M., Guo, S., Zhao, J., & Shen, F. (2022). Image Data Augmentation for Deep Learning: A Survey (arXiv:2204.08610; Versi 1). arXiv.

Ying, X. (2019). An overview of overfitting and its solutions. Journal of Physics: Conference Series, 1168, 022022.

Zeugmann, T., Poupart, P., Kennedy, J., Jin, X., Han, J., Saitta, L., Sebag, M., Peters, J., Bagnell, J. A., Daelemans, W., Webb, G. I.,

Ting, K. M., Ting, K. M., Webb, G. I., Shirabad, J. S., Fürnkranz, J., Hüllermeier, E., Matwin, S., Sakakibara, Y., … Fürnkranz, J. (2011). Precision and Recall. Encyclopedia of Machine Learning, 781–781.

Zhang, H., Zhang, L., & Jiang, Y. (2019). Overfitting and underfitting analysis for deep learning based end-to-end Communication Systems. 2019 11th International Conference on Wireless Communications and Signal Processing (WCSP).

Zhao, Z. Q., Zheng, P., Xu, S. T., & Wu, X. (2019). Object Detection with Deep Learning: A Review. IEEE Transactions on Neural Networks and Learning Systems, 30(11), 3212–3232. https://doi.org/10.1109/TNNLS.2018.2876865

DOI: https://doi.org/10.17509/seict.v4i2.64400

Refbacks

There are currently no refbacks.

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Journal of Software Engineering, Information and Communicaton Technology (SEICT),
(e-ISSN:2774-1699 | p-ISSN:2744-1656) published by Program Studi Rekayasa Perangkat Lunak, Kampus UPI di Cibiru.

Indexed by.

Username
Password
Remember me