Vehicle CO2 Emission Predictive Analytics Using HistGradientBoosting Regression Algorithms

Authors

  • Bagus Al Qohar Universitas Negeri Semarang https://orcid.org/0009-0004-9677-455X
  • Ahmad Ubai Dullah Universitas Negeri Semarang
  • Putri Utami Universitas Negeri Semarang
  • Jumanto Unjung Universitas Negeri Semarang

DOI:

https://doi.org/10.69616/mit.v2i1.198
\

Keywords:

Environmental sustainability, HistGradientBoosting, Machine learning, Predictive analytics, Vehicle CO? emissions

Abstract

Vehicle CO2 emissions are a significant contributor to climate change, so research on this subject is needed. Strong prediction models and data analysis techniques are required to obtain accurate results. This research aims to analyze and predict vehicle CO2 emissions using machine learning algorithms. Given its efficiency in handling large datasets, the HistGradientBoosting Regression algorithm was selected for predicting vehicle CO2 emissions. The process commenced with meticulous data preparation, which involved cleaning and feature engineering. Key factors such as engine size, fuel economy, and vehicle weight were analyzed to gain insights into their impact on emissions. The study utilized a dataset comprising vehicle specifications and emissions, training and testing the HistGradientBoosting. The model's performance was evaluated using metrics like Mean Absolute Error (MAE),  Mean Squared Error (MSE), Root Mean Squared Error (RMSE), and the Coefficient of Determination (R²). The findings indicate that this approach effectively identifies significant factors influencing emissions while achieving impressive prediction accuracy. This research offers valuable insights for policymakers and manufacturers aiming to develop low-emission vehicles and promote sustainable transportation initiatives. The paper highlights the capability of machine learning to address environmental challenges.

References

M. Filonchyk, M. P. Peterson, L. Zhang, V. Hurynovich, and Y. He, “Greenhouse gases emissions and global climate change: Examining the influence of CO2, CH4, and N2O,” Sci. Total Environ., vol. 935, p. 173359, Jul. 2024, https://doi.org/10.1016/j.scitotenv.2024.173359.

C. Yin, J. Wu, X. Sun, Z. Meng, and C. Lee, “Road transportation emission prediction and policy formulation: Machine learning model analysis,” Transp. Res. Part D Transp. Environ., vol. 135, p. 104390, Oct. 2024, https://doi.org/10.1016/j.trd.2024.104390.

B. Zhu, S. Hu, X. (Michael) Chen, C. Roncoli, and D.-H. Lee, “Uncovering driving factors and spatiotemporal patterns of urban passenger car CO2 emissions: A case study in Hangzhou, China,” Appl. Energy, vol. 375, p. 124094, Dec. 2024, https://doi.org/10.1016/j.apenergy.2024.124094.

W. Eko Cahyono, Parikesit, B. Joy, W. Setyawati, and R. Mahdi, “Projection of CO2 emissions in Indonesia,” Mater. Today Proc., vol. 63, pp. S438–S444, 2022, https://doi.org/10.1016/j.matpr.2022.04.091.

T. Thurner, K. Fursov, and A. Nefedova, “Early adopters of new transportation technologies: Attitudes of Russia’s population towards car sharing, the electric car and autonomous driving,” Transp. Res. Part A Policy Pract., vol. 155, pp. 403–417, Jan. 2022, https://doi.org/10.1016/j.tra.2021.11.006.

S. Cesar de Lima Nogueira, S. H. Och, L. M. Moura, E. Domingues, L. dos S. Coelho, and V. C. Mariani, “Prediction of the NOx and CO2 emissions from an experimental dual fuel engine using optimized random forest combined with feature engineering,” Energy, vol. 280, p. 128066, Oct. 2023, https://doi.org/10.1016/j.energy.2023.128066.

H. Fu, D. Yang, S. Wang, L. Wang, and D. Wang, “A novel online energy management strategy for fuel cell vehicles based on improved random forest regression in multi road modes,” Energy Convers. Manag., vol. 305, p. 118261, Apr. 2024, https://doi.org/10.1016/j.enconman.2024.118261.

G. Zhou, L. Mao, T. Bao, and F. Zhuang, “Machine learning-driven CO2 emission forecasting for light-duty vehicles in China,” Transp. Res. Part D Transp. Environ., vol. 137, p. 104502, Dec. 2024, https://doi.org/10.1016/j.trd.2024.104502.

T. A. Munshi, L. N. Jahan, M. F. Howladar, and M. Hashan, “Prediction of gross calorific value from coal analysis using decision tree-based bagging and boosting techniques,” Heliyon, vol. 10, no. 1, p. e23395, Jan. 2024, https://doi.org/10.1016/j.heliyon.2023.e23395.

B. Sahan, “Vehicle CO2 Emissions Dataset.” Accessed: Nov. 25, 2024. [Online]. Available: https://www.kaggle.com/datasets/brsahan/vehicle-co2-emissions-dataset/

N. B. Shaik, K. Jongkittinarukorn, and K. Bingi, “XGBoost based enhanced predictive model for handling missing input parameters: A case study on gas turbine,” Case Stud. Chem. Environ. Eng., vol. 10, p. 100775, Dec. 2024, https://doi.org/10.1016/j.cscee.2024.100775.

H. Mesghali, B. Akhlaghi, N. Gozalpour, J. Mohammadpour, F. Salehi, and R. Abbassi, “Predicting maximum pitting corrosion depth in buried transmission pipelines: Insights from tree-based machine learning and identification of influential factors,” Process Saf. Environ. Prot., vol. 187, pp. 1269–1285, Jul. 2024, https://doi.org/10.1016/j.psep.2024.05.014.

R. N. Nashikkar, Y. N. Padhye, and R. Ingle, “Enhancing Malicious Domain Detection Using Advanced Machine Learning Techniques,” in 2023 IEEE Pune Section International Conference (PuneCon), IEEE, Dec. 2023, pp. 1–6. https://doi.org/10.1109/PuneCon58714.2023.10450038.

E. A. Elhadjamor and H. Harbaoui, “A Comparison Analysis of Heart Disease Prediction Using Supervised Machine Learning Techniques,” in 2024 IEEE Symposium on Computers and Communications (ISCC), IEEE, Jun. 2024, pp. 1–6. https://doi.org/10.1109/ISCC61673.2024.10733656.

N. S. K. M. K. Tirumanadham, T. S, and S. M, “Evaluating Boosting Algorithms for Academic Performance Prediction in E-Learning Environments,” in 2024 International Conference on Intelligent and Innovative Technologies in Computing, Electrical and Electronics (IITCEE), IEEE, Jan. 2024, pp. 1–8. https://doi.org/10.1109/IITCEE59897.2024.10467968.

D. A. A. Pertiwi, K. Ahmad, S. N. Salahudin, A. M. Annegrat, and M. A. Muslim, “Using Genetic Algorithm Feature Selection to Optimize XGBoost Performance in Australian Credit,” J. Soft Comput. Explor., vol. 5, no. 1, pp. 92–98, Apr. 2024, https://doi.org/10.52465/joscex.v5i1.302.

R. Rofik, R. Aulia, K. Musaadah, S. S. F. Ardyani, and A. A. Hakim, “Optimization of Credit Scoring Model Using Stacking Ensemble Learning and Oversampling Techniques,” J. Inf. Syst. Explor. Res., vol. 2, no. 1, Dec. 2023, https://doi.org/10.52465/joiser.v2i1.203.

A. S. Mahajan and A. Shrivastav, “Short Term Load Forecasting based on Regression models,” in 2023 International Conference for Advancement in Technology (ICONAT), IEEE, Jan. 2023, pp. 1–8. https://doi.org/10.1109/ICONAT57137.2023.10080359.

J. Qi, J. Du, S. M. Siniscalchi, X. Ma, and C.-H. Lee, “On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression,” IEEE Signal Process. Lett., vol. 27, pp. 1485–1489, 2020, https://doi.org/10.1109/LSP.2020.3016837.

K. Rajesh and M. S. Saravanan, “Prediction of Customer Spending Score for the Shopping Mall using Gaussian Mixture Model comparing with Linear Spline Regression Algorithm to reduce Root Mean Square Error,” in 2022 6th International Conference on Intelligent Computing and Control Systems (ICICCS), IEEE, May 2022, pp. 335–341. https://doi.org/10.1109/ICICCS53718.2022.9788162.

D. S. K. Karunasingha, “Root mean square error or mean absolute error? Use their ratio as well,” Inf. Sci. (Ny)., vol. 585, pp. 609–629, Mar. 2022, https://doi.org/10.1016/j.ins.2021.11.036.

J. Rewilak, “The (non) determinants of Olympic success,” J. Sports Econom., vol. 22, no. 5, pp. 546–570, Jun. 2021, https://doi.org/10.1177/1527002521992833.

G. Mani, J. K. Viswanadhapalli, and A. A. Stonie, “Prediction and Forecasting of Air Quality Index in Chennai using Regression and ARIMA time series models,” J. Eng. Res., vol. 9, Sep. 2021, https://doi.org/10.36909/jer.10253.

Published

2024-02-05

How to Cite

Al Qohar, B., Dullah, A. U., Utami, P., & Unjung, J. (2024). Vehicle CO2 Emission Predictive Analytics Using HistGradientBoosting Regression Algorithms. Jurnal Media Informasi Teknologi, 2(1), 1-12. https://doi.org/10.69616/mit.v2i1.198