Thiptanawat Phongwattana. Development of named entity recognition algorithm using long short-term memory and conditional random field as a hybrid technique. (). King Mongkut's University of Technology North Bangkok. Central Library. : , 2020.
Development of named entity recognition algorithm using long short-term memory and conditional random field as a hybrid technique
Abstract:
Currently, the data on the web has been dramatically
increased and the popular definition of a kind of the data
is "big data", however, the essence of data analysis is
regarding deep data that the word is used to define useful
information, practical use as well as it can be utilized. We
found that over 90 percent of the data on the internet is a
kind of text data. Hence, the key point of our research is
challenging with information extraction from a ton of data
from the internet that we call "text mining" in the
terminology. To achieve this goal, we propose deep
learning architecture. In general, a deep learning
technique can be utilized in part-of-speech tagging to
sentences, we found that long short-term memory
(LSTM) recurrent neural nets (RNN) with a conventional
named entity recognition (NER) method in text mining as
a hybrid algorithm can optimize in classification and
tagging of part-of-speech. With the state-of-the-art
method, the limitation of LSTM can only make use of the
neighboring tagging decisions. This research proposes a
method to support LSTM making a decision of tagging
words. The proposed method that we used is conditional
random field (CRF) to outperform in NER tagging to
sentences by making a decision of tags using other word
embedding. The evaluation of the contextual word
representation is also provided, which both accuracy and
F1 score are over 90 percent at the end of 15 epochs.
King Mongkut's University of Technology North Bangkok. Central Library