TV newscasts report about the latest event-related facts occurring in the world. Relying exclusively on them is, however, insufficient to fully grasp the context of the story being reported. In this paper, we propose an approach that retrieves and analyzes related documents from the Web to automatically generate semantic annotations that provide viewers and experts comprehensive information about the news. We detect named entities in the retrieved documents that further disclose relevant concepts that were not explicitly mentioned in the original newscast. A ranking algorithm based on entity frequency, popularity peak analysis, and domain experts’ rules sorts those annotations to generate what we call Semantic Snapshot of a Newscast (NSS). We benchmark this method against a gold standard generated by domain experts and assessed via a user survey over five BBC newscasts. Results of the experiments show the robustness of our approach holding an Average Normalized Discounted Cumulative Gain of 66.6%.

doi.org/10.1007/978-3-319-19890-3_26
International Conference on Web Engineering
Human-Centered Data Analytics

Redondo Garcia, J. L., Rizzo, G. (Giuseppe), Pérez Romero, L., Hildebrand, M., & Troncy, R. (2015). Generating semantic snapshots of newscasts using entity expansion. doi:10.1007/978-3-319-19890-3_26