Statistical modeling of photovoltaic performance via PCA and regression: a case study

  • Abbes KADYRI CHOUAIB DOUKKALI UNIVERSITY
  • Mehdi FARHANE
  • Khalid KANDOUSSI
  • Otmane SOUHAR

Abstract

The performance of photovoltaic (PV) systems is highly influenced by a complex set of climatic and industrial parameters. This study aims to identify the most significant factors affecting PV power output by combining a predictive modeling approach using multiple linear regression with a multivariate statistical analysis based on Principal Component Analysis (PCA). The analysis is based on real-world weekly data that includes a wide range of \textbf{climatic factors} (solar irradiance, ambient and cell temperature, relative humidity, wind speed) and \textbf{industrial or technological parameters} (bandgap energy, diode ideality factor, thermal coefficient, angle of incidence ). The linear regression model quantifies the individual impact of each variable on PV power, while PCA reveals the latent structure of the dataset by identifying the most informative linear combinations of variables. Results consistently highlight solar irradiance, cell temperature, and technological factors such as $E_g$ and $A$ as the most critical determinants of PV output. This combined approach provides a robust and interpretable method for reducing model complexity, with practical implications for the design, monitoring, and optimization of photovoltaic systems in real-world conditions.

Downloads

Download data is not yet available.

References

Abbes Kadyri, Khalid Kandoussi, Otmane Souhar, An approach on mathematical modeling of PV module with sensitivity analysis: a case study. Journal of Computational Electronics, (2022), 1365-1372.

Abbes Kadyri, El Mahdi Assaid, Khalid Kandoussi, and Otmane Souhar, Numerical Model for Two Dimensional Temperature Distribution in Photovoltaic Panels with Experimental Validation, Preprint, 2025.

Hanna, S. R., D. G. Strimaitis, and J. C. Chang. ”Hazard response modeling uncertainty (a quantitative method). Volume 1. User’s Guide for Software for Evaluating Hazardous Gas Dispersion Models.” Sigma Research Corporation, Westford (1991).

Ibtissam Lamaamar, Amine Tilioua, Zaineb Benzaid, et al., Modelling Investigation of the Heat Transfer in a Polycrystalline Photovoltaic Module, E3S Web of Conferences, Vol. 336, EDP Sciences, 2022, p. 00038.

I. T. Jolliffe, Principal Component Analysis, 2nd ed., Springer, 2002.

S. Kavitha, S. Varuna, and R. Ramya, A Comparative Analysis on Linear Regression and Support Vector Regression, in 2016 Online International Conference on Green Engineering and Technologies (IC-GET), IEEE, 2016, pp. 1–5.

Mehdi Farhane, Omar Alehyane, and Otmane Souhar, Three-dimensional analytical solution of the advection-diffusion equation for air pollution dispersion, The ANZIAM Journal, 2022, vol. 64, no. 1, pp. 40–53.

Souhaila Chahboun and Mohamed Maaroufi, Principal Component Analysis and Artificial Intelligence Approaches, Advances in Principal Component Analysis, (2022), p. 67.Statistical Analysis of Photovoltaic Performance via PCA and Regression: A Case Study 11

M. Tranmer and M. Elliot, Multiple Linear Regression, The Cathie Marsh Centre for Census and Survey Research (CCSR), 2008, vol. 5,

M. W. Wang, W. Y. Wang, C. M. Chen, and C. C. Tseng, Mechanical and Cellular Evaluations of ACP-Enriched Biodegradable Micromolded PLA/PCL Bone Screws, Journal of Manufacturing and Materials Processing, 2025, vol. 9, no. 5, p. 154.

L. Zhang, Y. Da, W. Zhang, F. Li, J. Xu, L. Jing, et al., Evaluating the Output Performance of the Semiconductor Bridge Through Principal Component Analysis, Nanomaterials, vol. 15, no. 9, 2025, p. 672.

Published
2025-07-13
Section
Research Articles