TIIS (Çѱ¹ÀÎÅͳÝÁ¤º¸ÇÐȸ)
Current Result Document :
ÇѱÛÁ¦¸ñ(Korean Title) |
Malaysian Name-based Ethnicity Classification using LSTM |
¿µ¹®Á¦¸ñ(English Title) |
Malaysian Name-based Ethnicity Classification using LSTM |
ÀúÀÚ(Author) |
Xibin Jia
Zijia Lu
Qing Mi
Zhefeng An
Xiaoyong Li
Min Hong
Youngbum Hur
|
¿ø¹®¼ö·Ïó(Citation) |
VOL 16 NO. 12 PP. 3855 ~ 3867 (2022. 12) |
Çѱ۳»¿ë (Korean Abstract) |
|
¿µ¹®³»¿ë (English Abstract) |
Name separation (splitting full names into surnames and given names) is not a tedious task in a multiethnic country because the procedure for splitting surnames and given names is ethnicity-specific. Malaysia has multiple main ethnic groups; therefore, separating Malaysian full names into surnames and given names proves a challenge. In this study, we develop a two-phase framework for Malaysian name separation using deep learning. In the initial phase, we predict the ethnicity of full names. We propose a recurrent neural network with long short-term memory network–based model with character embeddings for prediction. Based on the predicted ethnicity, we use a rule-based algorithm for splitting full names into surnames and given names in the second phase. We evaluate the performance of the proposed model against various machine learning models and demonstrate that it outperforms them by an average of 9%. Moreover, transfer learning and fine-tuning of the proposed model with an additional dataset results in an improvement of up to 7% on average. |
Å°¿öµå(Keyword) |
graph deep clustering
heterogeneous information networks
representation learning
student behavior modeling
Deep Learning
Recurrent Neural Network
LSTM
Machine Learning
Ethnicity Classification
Malaysian Name Separation
Deep Learning-based Name Separation
|
ÆÄÀÏ÷ºÎ |
PDF ´Ù¿î·Îµå
|