Abstract
Natural Language Processing (NLP) is used to identify key information, generating predictive models, and explaining global events or trends. Also, NLP is supported during the process to create knowledge. Therefore, it is important to apply refinement techniques in major stages such as preprocessing, when data is frequently produced and processed with poor results. This document analyzes and measures the impact of combinations of preprocessing techniques and libraries for short texts that have been written in Spanish. These techniques were applied in tweets for analysis of sentiments considering evaluation parameters in its analysis, the processing time and characteristics of the techniques for each library. The performed experimentation provides readers insights for choosing the appropriate combination of techniques during preprocessing. The results show improvement of up to 5% to 9% in the performance of the classification.
| Original language | English |
|---|---|
| Title of host publication | Advances in Information and Communication - Proceedings of the 2020 Future of Information and Communication Conference FICC |
| Editors | Kohei Arai, Supriya Kapoor, Rahul Bhatia |
| Publisher | Springer |
| Pages | 111-124 |
| Number of pages | 14 |
| ISBN (Print) | 9783030394417 |
| DOIs | |
| State | Published - 2020 |
| Event | Future of Information and Communication Conference, FICC 2020 - San Francisco, United States Duration: 5 Mar 2020 → 6 Mar 2020 |
Publication series
| Name | Advances in Intelligent Systems and Computing |
|---|---|
| Volume | 1130 AISC |
| ISSN (Print) | 2194-5357 |
| ISSN (Electronic) | 2194-5365 |
Conference
| Conference | Future of Information and Communication Conference, FICC 2020 |
|---|---|
| Country/Territory | United States |
| City | San Francisco |
| Period | 5/03/20 → 6/03/20 |
Keywords
- Natural Language Processing
- Preprocessing
- Sentiment analysis
- Text mining
Fingerprint
Dive into the research topics of 'A Comparative Evaluation of Preprocessing Techniques for Short Texts in Spanish'. Together they form a unique fingerprint.Projects
- 1 Finished
-
FOG Computing applied to monitoring devices used in assisted life environments (Environment Living); Study case: platform for the elderly.
Cedillo Orellana, I. P. (Director), Campos Argudo, K. P. (Researcher), Granda Juca, M. F. (Researcher), Ortiz Segarra, J. I. (Researcher), Parra Gonzalez, L. O. (Researcher), Acosta-Urigüen, M. I. (Research Associate), Erazo Garzon, L. X. (Research Associate), Orellana Cordero, M. P. (Research Associate), Bermeo Arpi, A. E. (Assimilated Technical Staff), Arias Ochoa, J. H. (Research Assistant) & Arteaga Garcia, E. J. (Research Assistant)
3/09/18 → 28/02/21
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver