About this paper

Appears in:
Pages: 2785-2793
Publication year: 2014
ISBN: 978-84-617-2484-0
ISSN: 2340-1095

Conference name: 7th International Conference of Education, Research and Innovation
Dates: 17-19 November, 2014
Location: Seville, Spain

TRAINING IN MULTIVARIATE DATA ANALYSIS IN THE MASTER OF HUMAN EVOLUTION

A. Herrejón1, M. Martín1, O. Prado1, Y. Quintino1, M.A. Vidal1, J.M. Carretero1, M.C. Ortiz2, L.A. Sarabia3

1University of Burgos, Laboratory of Human Evolution (SPAIN)
2University of Burgos, Faculty of Sciences (SPAIN)
3University of Burgos, Department. Mathematics and Computation (SPAIN)
The curricula defined by the University of Burgos in the Master of Human Evolution establish generic skills as criticism, analysis, synthesis, problem-solving, independent reading and project work and other specific skills as handling and processing data [1]. Visualization and data analysis are central in modern anthropology for analysis of thousands of data. For several years, our postgraduate students in the Human Evolution Master are learning the skills, mentioned above, dealing with methodology of statistical techniques in the subject “Advanced methods for data analysis” .

The statistical procedures developed in the course are:
i) Univariate analysis
ii) Distribution fitting to data
iii) Outlier detection;
iv) Hypothesis test;
v) Principal Component Analysis;
vi) Multivariate regression.

The tools, summarized above, are used to evaluate the sexual dimorphism by means of craniometric data downloaded from reference [1]. All students made a practice that consists in building a rule of decision that uses 21 craniometric variables from five populations (BERG, PERU, EGYPT, BUSHMAN and ZULU) to solve the problem of sexual dimorphism. Each student makes a rule of decision for every population which implies to work with two- thousands of data for each case.

By means of a regression on principal components the distribution of probability that corresponds to the null hypothesis H0 (the individual is a female) versus the alternative Ha (the individual is a male). After both probabilities of error type I (to affirm that the individual is a male when is actually a female) and type II (to affirm that the individual is a female when is actually a male) are evaluated. Finally the operative curve of this test, that describes graphically the goodness of the decision rule, is built.

The practice demands to combine the six statistical tools mentioned to answer a question posed in the common terms of anthropology. Thus the students understand that a complex question: “to decide what the sex of an individual in a concrete population is” is solved with multivariate indirect information (21 craniometric variables) taking into account the uncertainty that accompanies these determinations. The practice concludes with a public oral exhibition followed by a debate with all students and the teachers.

Every step will be argued according to the mutual relations between variables that are different in every population therefore every statistical procedure is not possible to be used in an automatic way. In addition, the ratio individual/variables (for male or female) is low in every population with the added problem of colinearity in the craniometric variables. So a projection on the principal components as previous step for to reduce the dimension of raw data without significant loss of information, is demanded.

References:
[1] http://www.ubu.es/titulaciones/es/master_evolucion/informacion-academica/objetivos-competencias/competencias
[2] W.W. Howells' Craniometric Data set. Website: http://konig.la.utk.edu/howells.htm.
@InProceedings{HERREJON2014TRA,
author = {Herrej{\'{o}}n, A. and Mart{\'{i}}n, M. and Prado, O. and Quintino, Y. and Vidal, M.A. and Carretero, J.M. and Ortiz, M.C. and Sarabia, L.A.},
title = {TRAINING IN MULTIVARIATE DATA ANALYSIS IN THE MASTER OF HUMAN EVOLUTION},
series = {7th International Conference of Education, Research and Innovation},
booktitle = {ICERI2014 Proceedings},
isbn = {978-84-617-2484-0},
issn = {2340-1095},
publisher = {IATED},
location = {Seville, Spain},
month = {17-19 November, 2014},
year = {2014},
pages = {2785-2793}}
TY - CONF
AU - A. Herrejón AU - M. Martín AU - O. Prado AU - Y. Quintino AU - M.A. Vidal AU - J.M. Carretero AU - M.C. Ortiz AU - L.A. Sarabia
TI - TRAINING IN MULTIVARIATE DATA ANALYSIS IN THE MASTER OF HUMAN EVOLUTION
SN - 978-84-617-2484-0/2340-1095
PY - 2014
Y1 - 17-19 November, 2014
CI - Seville, Spain
JO - 7th International Conference of Education, Research and Innovation
JA - ICERI2014 Proceedings
SP - 2785
EP - 2793
ER -
A. Herrejón, M. Martín, O. Prado, Y. Quintino, M.A. Vidal, J.M. Carretero, M.C. Ortiz, L.A. Sarabia (2014) TRAINING IN MULTIVARIATE DATA ANALYSIS IN THE MASTER OF HUMAN EVOLUTION, ICERI2014 Proceedings, pp. 2785-2793.
User:
Pass: