About this paper

Appears in:
Pages: 8417-8426
Publication year: 2018
ISBN: 978-84-09-02709-5
ISSN: 2340-1117
doi: 10.21125/edulearn.2018.1959

Conference name: 10th International Conference on Education and New Learning Technologies
Dates: 2-4 July, 2018
Location: Palma, Spain

PREDICTING THE CONFUSION LEVEL OF TEXT EXCERPTS WITH SYNTACTIC, LEXICAL AND N-GRAM FEATURES

T. Silva Pedro1, J. Luís Silva2, R. Pereira1

1ISCTE-IUL, ISTAR-IUL (PORTUGAL)
2ISCTE-IUL, ISTAR-IUL, Madeira-ITI (PORTUGAL)
Distance learning, offline presentations (presentations that are not being carried in a live fashion but were instead pre-recorded) and such activities whose main goal is to convey information are getting increasingly relevant with digital media such as Virtual Reality (VR) and Massive Online Open Courses (MOOCs). While MOOCs are a well-established reality in the learning environment, VR is also being used to promote learning in virtual rooms, be it in the academia or in the industry. Oftentimes these methods are based on written scripts that take the learner through the content, making them critical components to these tools. With such an important role, it is important to ensure the efficiency of these scripts.
Confusion is a non-basic emotion associated with learning. This process often leads to a cognitive disequilibrium either caused by the content itself or due to the way it is conveyed when it comes to its syntactic and lexical features. We hereby propose a supervised model that can predict the likelihood of confusion an input text excerpt can cause on the learner. To achieve this, we performed syntactic and lexical analyses over 300 text excerpts and collected 5 confusion level classifications (0 – 6) per excerpt from 51 annotators to use their respective means as labels. These examples that compose the dataset were collected from random presentations transcripts across various fields of knowledge. The learning model was trained with this data with the results being included in the body of the paper.
This model allows the design of clearer scripts of offline presentations and similar approaches and we expect that it improves the efficiency of these speeches. While this model is applied to this specific case, we hope to pave the way to generalize this approach to other contexts where clearness of text is critical, such as the scripts of MOOCs or academic abstracts.
@InProceedings{SILVAPEDRO2018PRE,
author = {Silva Pedro, T. and Lu{\'{i}}s Silva, J. and Pereira, R.},
title = {PREDICTING THE CONFUSION LEVEL OF TEXT EXCERPTS WITH SYNTACTIC, LEXICAL AND N-GRAM FEATURES},
series = {10th International Conference on Education and New Learning Technologies},
booktitle = {EDULEARN18 Proceedings},
isbn = {978-84-09-02709-5},
issn = {2340-1117},
doi = {10.21125/edulearn.2018.1959},
url = {http://dx.doi.org/10.21125/edulearn.2018.1959},
publisher = {IATED},
location = {Palma, Spain},
month = {2-4 July, 2018},
year = {2018},
pages = {8417-8426}}
TY - CONF
AU - T. Silva Pedro AU - J. Luís Silva AU - R. Pereira
TI - PREDICTING THE CONFUSION LEVEL OF TEXT EXCERPTS WITH SYNTACTIC, LEXICAL AND N-GRAM FEATURES
SN - 978-84-09-02709-5/2340-1117
DO - 10.21125/edulearn.2018.1959
PY - 2018
Y1 - 2-4 July, 2018
CI - Palma, Spain
JO - 10th International Conference on Education and New Learning Technologies
JA - EDULEARN18 Proceedings
SP - 8417
EP - 8426
ER -
T. Silva Pedro, J. Luís Silva, R. Pereira (2018) PREDICTING THE CONFUSION LEVEL OF TEXT EXCERPTS WITH SYNTACTIC, LEXICAL AND N-GRAM FEATURES, EDULEARN18 Proceedings, pp. 8417-8426.
User:
Pass: