An automated learner-based reading ability estimation strategy using concept indexing with integrated Part-of-Speech n-gram features

Razon, Abigail R. (2017). An automated learner-based reading ability estimation strategy using concept indexing with integrated Part-of-Speech n-gram features. University of Birmingham. Ph.D.

[img]
Preview
Razon17PhD.pdf
PDF - Accepted Version

Download (5MB)

Abstract

This study is about the development of a retrainable reading ability estimation system based on concepts from the Text Readability Indexing (TRI) domain. This system aims to promote self-directed language learning and to serve as an educational reinforcement tool for English language learners. Student essays were used to calibrate the system which provided realistic approximations of their actual reading levels.

In this thesis, we compared the performance of two vector semantics-based algorithms, namely, Latent Semantic Indexing (LSI) and Concept Indexing (CI) for content analysis. Since these algorithms rely on the bag-of-words approach and inherently lack grammatical analysis, we augmented them using Part-of-Speech (POS) n-gram features to approximate the syntactic complexity of text documents.

Results show that directly combining the content-and grammar-based feature sets yielded lower classification accuracies than utilising each feature set alone. Using a sparsification strategy, we were able to optimise the combination process and, with the integration of POS bi-grams, we achieved our overall highest mean exact agreement accuracies (MEAA) of 0.924 and 0.952 for LSI and CI, respectively.

We have also conducted error analyses on our results where we examined overestimation and underestimation error types to uncover the probable causes for the systems' misclassifications.

Type of Work: Thesis (Doctorates > Ph.D.)
Award Type: Doctorates > Ph.D.
Supervisor(s):
Supervisor(s)EmailORCID
Barnden, John A.UNSPECIFIEDUNSPECIFIED
Licence:
College/Faculty: Colleges (2008 onwards) > College of Engineering & Physical Sciences
School or Department: School of Computer Science
Funders: None/not applicable
Subjects: L Education > L Education (General)
L Education > LG Individual institutions (Asia. Africa)
P Language and Literature > PE English
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
URI: http://etheses.bham.ac.uk/id/eprint/7260

Actions

Request a Correction Request a Correction
View Item View Item

Downloads

Downloads per month over past year