TY - GEN
T1 - Exploring New Horizons on the Interpretation of PCA Using LLM for Data Analysis
AU - Saquicela, Victor
AU - Palacio-Baus, Kenneth
AU - Bravo, Mercy Orellana
AU - Espinoza-Mejía, Mauricio
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
PY - 2026
Y1 - 2026
N2 - This paper proposes an innovative approach to address the challenge of semantic interpretation of principal components generated by the widely used Principal Component Analysis (PCA) on complex datasets. We propose a novel method that incorporates large language models (LLM) into the interpretation process. This approach aims to bridge the gap between the statistical complexity of PCA and the practical applicability of latent variables. Our initial results demonstrate the effectiveness and promising precision that this approach can achieve in translating complex mathematical relationships into contextually rich semantic interpretation. This work represents a significant step towards improving interpretability in data science, and a valuable resource to facilitate informed decision making and understanding in diverse research and data analysis contexts.
AB - This paper proposes an innovative approach to address the challenge of semantic interpretation of principal components generated by the widely used Principal Component Analysis (PCA) on complex datasets. We propose a novel method that incorporates large language models (LLM) into the interpretation process. This approach aims to bridge the gap between the statistical complexity of PCA and the practical applicability of latent variables. Our initial results demonstrate the effectiveness and promising precision that this approach can achieve in translating complex mathematical relationships into contextually rich semantic interpretation. This work represents a significant step towards improving interpretability in data science, and a valuable resource to facilitate informed decision making and understanding in diverse research and data analysis contexts.
KW - Interpretation
KW - LLM
KW - PCA
UR - https://www.scopus.com/pages/publications/105016223884
UR - https://www.mendeley.com/catalogue/84a688f9-4dad-3396-beec-1876c27d0de3/
U2 - 10.1007/978-3-031-98287-3_22
DO - 10.1007/978-3-031-98287-3_22
M3 - Contribución a la conferencia
AN - SCOPUS:105016223884
SN - 9783031982866
T3 - Communications in Computer and Information Science
SP - 306
EP - 321
BT - Smart Technologies, Systems and Applications - 4th International Conference, SmartTech-IC 2024, Revised Selected Papers
A2 - Narváez, Fabián R.
A2 - Villa, Micaela N.
A2 - Díaz, Gloria M.
PB - Springer Science and Business Media Deutschland GmbH
T2 - 4th International Conference on Smart Technologies, Systems and Applications, SmartTech-IC 2024
Y2 - 2 December 2024 through 4 December 2024
ER -