Utilizing a transparency-driven environment toward trusted automatic genre classification: A case study in journalism history

Bilgin, Aysenur; Hollink, Laura; van Ossenbruggen, Jacco; Tjong Kim Sang, Erik; Smeenk, Kim; Harbers, Frank; Broersma, Marcel

doi:10.1109/eScience.2018.00137

A. Bilgin (Aysenur), L. Hollink (Laura), J.R. van Ossenbruggen (Jacco), E. Tjong Kim Sang (Erik), K. Smeenk (Kim), F. Harbers (Frank) and M. Broersma (Marcel)

2018-10-29

Utilizing a transparency-driven environment toward trusted automatic genre classification: A case study in journalism history

Presented at the IEEE International Conference on e-Science (October 2018), Amsterdam, the Netherlands

With the growing abundance of unlabeled data in real-world tasks, researchers have to rely on the predictions given by black-boxed computational models. However, it is an often neglected fact that these models may be scoring high on accuracy for the wrong reasons. In this paper, we present a practical impact analysis of enabling model transparency by various presentation forms. For this purpose, we developed an environment that empowers non-computer scientists to become practicing data scientists in their own research field. We demonstrate the gradually increasing understanding of journalism historians through a real-world use case study on automatic genre classification of newspaper articles. This study is a first step towards trusted usage of machine learning pipelines in a responsible way.

Additional Metadata
Keywords	Genre classification, Journalism history, Machine learning, Transparency
Persistent URL	doi.org/10.1109/eScience.2018.00137
Project	News Genres: Advancing Media History by Transparent Automatic Genre Classification
Conference	IEEE International Conference on e-Science
Organisation	Human-Centered Data Analytics
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Bilgin, A., Hollink, L., van Ossenbruggen, J., Tjong Kim Sang, E., Smeenk, K., Harbers, F., & Broersma, M. (2018). Utilizing a transparency-driven environment toward trusted automatic genre classification: A case study in journalism history. In IEEE 14th International Conference on eScience, e-Science 2018 (pp. 486–496). doi:10.1109/eScience.2018.00137

View at Publisher

Full Text ( Author Manuscript , 853kb )

See Also
presentation Towards Transparent Linguistic Analysis of Dutch Newspaper Article Genres using Machine Learning A. Bilgin (Aysenur), E. Tjong Kim Sang (Erik), K. Smeenk (Kim), T. Klaver (Tom), L. Hollink (Laura), J.R. van Ossenbruggen (Jacco), F. Harbers (Frank) and M. Broersma (Marcel)
techReport Utilizing a transparency-driven environment toward trusted automatic genre classification: A case study in journalism history A. Bilgin (Aysenur), L. Hollink (Laura), J.R. van Ossenbruggen (Jacco), E. Tjong Kim Sang (Erik), K. Smeenk (Kim), F. Harbers (Frank) and M. Broersma (Marcel)

Utilizing a transparency-driven environment toward trusted automatic genre classification: A case study in journalism history

Publication

Publication

presentation
Towards Transparent Linguistic Analysis of Dutch Newspaper Article Genres using Machine Learning

techReport
Utilizing a transparency-driven environment toward trusted automatic genre classification: A case study in journalism history

Address

CWI researchers

Questions or comments?

Utilizing a transparency-driven environment toward trusted automatic genre classification: A case study in journalism history

Publication

Publication

presentation Towards Transparent Linguistic Analysis of Dutch Newspaper Article Genres using Machine Learning

techReport Utilizing a transparency-driven environment toward trusted automatic genre classification: A case study in journalism history

Workflow

Workflow

Add Content

presentation
Towards Transparent Linguistic Analysis of Dutch Newspaper Article Genres using Machine Learning

techReport
Utilizing a transparency-driven environment toward trusted automatic genre classification: A case study in journalism history