Objective: Stress at work is a significant occupational health concern. Recent studies have used various sensing modalities to model stress behaviour based on non-obtrusive data obtained from smartphones. However, when the data for a subject is scarce it becomes a challenge to obtain a good model. Methods: We propose an approach based on a combination of techniques: semi-supervised learning, ensemble methods and transfer learning to build a model of a subject with scarce data. Our approach is based on the comparison of decision trees to select the closest subject for knowledge transfer. Results: We present a real-life, unconstrained study carried out with 30 employees within two organisations. The results show that using information (instances or model) from similar subjects can improve the accuracy of the subjects with scarce data. However, using transfer learning from dissimilar subjects can have a detrimental effect on the accuracy. Our proposed ensemble approach increased the accuracy by ≈10% to 71.58% compared to not using any transfer learning technique. Conclusions: In contrast to high precision but highly obtrusive sensors, using smartphone sensors for measuring daily behaviours allowed us to quantify behaviour changes, relevant to occupational stress. Furthermore, we have shown that use of transfer learning to select data from close models is a useful approach to improve accuracy in presence of scarce data.
Additional Metadata
Keywords Stress modelling, Transfer learning, Semi-supervised learning, Ensemble methods
THEME Data (theme 8)
Persistent URL dx.doi.org/10.1016/j.jbi.2016.08.023
Journal Journal of Biomedical Informatics
Project Demand response for grid-friendly quasi-autarkic energy cooperatives
Grant This work was funded by the The Netherlands Organisation for Scientific Research (NWO); grant id nwo/651.001.003 - Demand response for grid-friendly quasi-autarkic energy cooperatives
Citation
Maxhuni, A, Hernandez-Leal, P, Sucar, L.E, Osmani, V, Morales, E.F, & Mayora, O. (2016). Stress modelling and prediction in presence of scarce data. Journal of Biomedical Informatics, 63, 344–356. doi:10.1016/j.jbi.2016.08.023