Improving the sentiment classification of stock tweets

Li, Sheng (2014). Improving the sentiment classification of stock tweets. University of Birmingham. Ph.D.

[img]
Preview
Li14PhD.pdf
PDF - Accepted Version

Download (2MB)

Abstract

This research focuses on improving stock tweet sentiment classification accuracy with the addition of the linguistic features of stock tweets. Stock prediction based on social media data has been popular in recent years, but none of the previous studies have provided a comprehensive understanding of the linguistic features of stock tweets. Hence, applying a simple statistical model to classifying the sentiment of stock tweets has reached a bottleneck. Thus, after analysing the linguistic features of stock tweets, this research used these features to train four machine learning classifiers. Each of them showed an improvement, and the best one achieved a 9.7% improvement compared to the baseline model. The main contributions of this research are fivefold: (a) it provides an in-­depth linguistic analysis of stock tweets; (b) it gives a clear and comprehensive definition of stock tweets; (c) it provides a simple but effective way to automatically identify stock tweets; (d) it provides a simple but effective method of generating a localised sentiment keyword list; and (e) it demonstrates a significant improvement of stock tweet sentiment classification accuracy.

Type of Work: Thesis (Doctorates > Ph.D.)
Award Type: Doctorates > Ph.D.
Supervisor(s):
Supervisor(s)EmailORCID
Groom, NicholasUNSPECIFIEDUNSPECIFIED
Millar, NeilUNSPECIFIEDUNSPECIFIED
Licence:
College/Faculty: Colleges (2008 onwards) > College of Arts & Law
School or Department: School of English, Drama and American & Canadian Studies
Funders: None/not applicable
Subjects: H Social Sciences > HG Finance
H Social Sciences > HM Sociology
P Language and Literature > P Philology. Linguistics
URI: http://etheses.bham.ac.uk/id/eprint/5521

Actions

Request a Correction Request a Correction
View Item View Item

Downloads

Downloads per month over past year