A new statistics-based approach to improve Word2Vec's sentiment classification success

dc.authorid0000-0002-4216-0542en_US
dc.contributor.authorBilgin, Metin
dc.date.accessioned2023-01-20T07:55:55Z
dc.date.available2023-01-20T07:55:55Z
dc.date.issued2021en_US
dc.departmentBaşka Kurumen_US
dc.description.abstractSentiment classification is the process of predicting the emotion that the text wants to give by analyzing the written texts. Studies on estimating the emotion of a sentence or a document rather than the meaning of a word have increased in recent years. In this study, statistical approaches that can be alternative to the current use of the Word2Vec method in sentiment classification are presented. Currently, when a sentiment classification is desired to be made with Word2Vec, the arithmetic average of the vectors created for all words in the relevant document is taken. In this study, the performances of the statistical methods presented as an alternative to the arithmetic mean for 5 different machine learning methods on 2 different data sets were compared. In addition, the results obtained by performing the same studies in Doc2Vec and BoW were compared with Word2Vec. Among the proposed approaches, Median has achieved better results than both the mean and the other two proposed methods. As a reason for this, it can be said that the media shows the central distribution better. Although the Word2Vec-CBOW approach obtained similar values to SG, it was observed that it produced more stable results. Word2Vec has achieved better results than both Doc2Vec and BoW. Among the proposed statistical approaches, it can be said that Median has a positive effect on the success of the system when used with Word2Vec and can be an alternative to the mean approach used in the literature.en_US
dc.identifier.citationBilgin, M., (2021). A new statistics-based approach to improve Word2Vec's sentiment classification success. Selcuk University Journal of Engineering Sciences, 20 (03), 63-72.en_US
dc.identifier.endpage72en_US
dc.identifier.issn2757-8828en_US
dc.identifier.issue3en_US
dc.identifier.startpage63en_US
dc.identifier.urihttps://hdl.handle.net/20.500.12395/44963
dc.identifier.volume20en_US
dc.language.isoenen_US
dc.publisherSelçuk Üniversitesien_US
dc.relation.ispartofSelcuk University Journal of Engineering Sciencesen_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Başka Kurum Yazarıen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.selcuk20240510_oaigen_US
dc.subjectNatural Language Processingen_US
dc.subjectMachine Learningen_US
dc.subjectWord Vectorsen_US
dc.subjectSentiment Classificationen_US
dc.subjectDeep Learningen_US
dc.titleA new statistics-based approach to improve Word2Vec's sentiment classification successen_US
dc.typeArticleen_US

Dosyalar

Orijinal paket
Listeleniyor 1 - 1 / 1
Yükleniyor...
Küçük Resim
İsim:
558-2684-1-PB.pdf
Boyut:
496.68 KB
Biçim:
Adobe Portable Document Format
Açıklama:
Makale Dosyası
Lisans paketi
Listeleniyor 1 - 1 / 1
Küçük Resim Yok
İsim:
license.txt
Boyut:
1.44 KB
Biçim:
Item-specific license agreed upon to submission
Açıklama: