Abstract
The outlier detection in the field of data mining and Knowledge Discovering from Data (KDD) is capturing special interest due to its benefits. It can be applied in the financial area; because the obtained data patterns can help finding possible frauds and user errors. Therefore, it is essential to assess the truthfulness of the information. In this context, data auditory process uses techniques of data mining that play a significant role in the detection of unusual behavior. Here, a method for detecting values that can be considered as outliers in a nominal database is proposed. The basic idea in this method is to implement: a Global k-Nearest Neighbors algorithm, a clustering algorithm named k-means, and a statistical method of chi-square. The application of algorithms has been developed with a database of candidate people for the granting of a loan. Each test was made on a dataset of 1180 registers in which outliers have been introduced deliberately. The experimental results show that the method is able to detect all introduced values, which were previously labeled to be differentiated. Consequently, there were found a total of 48 tuples with outliers of 11 nominal columns.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2019 International Conference on Information Systems and Computer Science, INCISCOS 2019 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 51-56 |
| Number of pages | 6 |
| ISBN (Electronic) | 9781728155814 |
| DOIs | |
| State | Published - Nov 2019 |
| Event | 4th International Conference on Information Systems and Computer Science, INCISCOS 2019 - Quito, Pichincha, Ecuador Duration: 20 Nov 2019 → 22 Nov 2019 |
Publication series
| Name | Proceedings - 2019 International Conference on Information Systems and Computer Science, INCISCOS 2019 |
|---|
Conference
| Conference | 4th International Conference on Information Systems and Computer Science, INCISCOS 2019 |
|---|---|
| Country/Territory | Ecuador |
| City | Quito, Pichincha |
| Period | 20/11/19 → 22/11/19 |
Keywords
- -chi-square
- -data-mining
- -financial-fraud
- -KNN
- outlier
Fingerprint
Dive into the research topics of 'Outlier Detection with Data Mining Techniques and Statistical Methods'. Together they form a unique fingerprint.Projects
- 1 Finished
-
FOG Computing applied to monitoring devices used in assisted life environments (Environment Living); Study case: platform for the elderly.
Cedillo Orellana, I. P. (Director), Campos Argudo, K. P. (Researcher), Granda Juca, M. F. (Researcher), Ortiz Segarra, J. I. (Researcher), Parra Gonzalez, L. O. (Researcher), Acosta-Urigüen, M. I. (Research Associate), Erazo Garzon, L. X. (Research Associate), Orellana Cordero, M. P. (Research Associate), Bermeo Arpi, A. E. (Assimilated Technical Staff), Arias Ochoa, J. H. (Research Assistant) & Arteaga Garcia, E. J. (Research Assistant)
3/09/18 → 28/02/21
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver