TY - GEN
T1 - A text mining methodology to discover syllabi similarities among higher education institutions
AU - Orellana, Gerardo
AU - Orellana, Marcos
AU - Saquicela, Victor
AU - Baculima, Fernando
AU - Piedra, Nelson
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/12/5
Y1 - 2018/12/5
N2 - Students' mobility and credit validation has been a concern for several years among higher education institutions in Ecuador, this process involves a huge amount of manual work due to the absence of an automatic system to measure the similarity between different course contents. In order to tackle this problem, we propose an approach to semantically compare the syllabi contents through text similarity methods. Such methods have been widely used in different domains, in this work we take the higher education institutions syllabi to the Text mining world and develop a method to compare their semantic contents. We propose an approach that uses pre-processing techniques, Latent Semantic Analysis for dimensionality reduction, text enrichment through the Wikipedia API and Google Engine, Support Vector Machine as classifier, and cosine similarity as similarity metric. Our results show that our method successfully measures similarity among higher education institutions syllabi and can be generalized to most Ecuadorian institutions.
AB - Students' mobility and credit validation has been a concern for several years among higher education institutions in Ecuador, this process involves a huge amount of manual work due to the absence of an automatic system to measure the similarity between different course contents. In order to tackle this problem, we propose an approach to semantically compare the syllabi contents through text similarity methods. Such methods have been widely used in different domains, in this work we take the higher education institutions syllabi to the Text mining world and develop a method to compare their semantic contents. We propose an approach that uses pre-processing techniques, Latent Semantic Analysis for dimensionality reduction, text enrichment through the Wikipedia API and Google Engine, Support Vector Machine as classifier, and cosine similarity as similarity metric. Our results show that our method successfully measures similarity among higher education institutions syllabi and can be generalized to most Ecuadorian institutions.
KW - Education
KW - Students mobility
KW - Syllabus similarity
KW - Text mining
KW - Text similarity
UR - https://www.scopus.com/pages/publications/85063225864
U2 - 10.1109/INCISCOS.2018.00045
DO - 10.1109/INCISCOS.2018.00045
M3 - Contribución a la conferencia
AN - SCOPUS:85063225864
T3 - Proceedings - 3rd International Conference on Information Systems and Computer Science, INCISCOS 2018
SP - 261
EP - 268
BT - Proceedings - 3rd International Conference on Information Systems and Computer Science, INCISCOS 2018
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 3rd International Conference on Information Systems and Computer Science, INCISCOS 2018
Y2 - 14 November 2018 through 16 November 2018
ER -