Carlos Martin
Real-time topic detection with bursty n-grams: RGU's submission to the 2014 SNOW challenge.
Martin, Carlos; Goker, Ayse
Authors
Ayse Goker
Abstract
Twitter is becoming an ever more popular platform for discovering and sharing information about current events, both personal and global. The scale and diversity of messages makes the discovery and analysis of breaking news very challenging. Nonetheless, journalists and other news consumers are increasingly relying on tools to help them make sense of Twitter. Here, we describe a fully-automated system capable of detecting trends related to breaking news in real-time. It identifies words or phrases that `burst' with sudden increased frequencies, and groups these into topics. It identifies a diverse set of recent tweets that are related to these topics, and uses these to create a suitable human-readable headline. In addition, images coming from the diverse tweets are also added to the topic. Our system was evaluated using 24 hours of tweets as part of the Social News On the Web (SNOW) 2014 data challenge.
Citation
MARTIN, C. and GOKER, A. 2014. Real-time topic detection with bursty n-grams: RGU's submission to the 2014 SNOW challenge. In Proceedings of the 2014 Social news on the web data challenge (SNOW-DC 2014), 8th April 2014, Seoul, Korea. CEUR workshop proceedings, 1150. Aachen: CEUR-WS [online], pages 9-16. Available from http://ceur-ws.org/Vol-1150/martin.pdf
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | 2014 Social news on the web data challenge (SNOW-DC 2014) |
Start Date | Apr 8, 2014 |
End Date | Apr 8, 2014 |
Acceptance Date | Apr 8, 2014 |
Online Publication Date | Apr 8, 2014 |
Publication Date | Apr 28, 2014 |
Deposit Date | Sep 2, 2016 |
Publicly Available Date | Sep 2, 2016 |
Print ISSN | 1613-0073 |
Publisher | CEUR-WS |
Peer Reviewed | Peer Reviewed |
Pages | 9-16 |
Series Title | CEUR workshop proceedings |
Series Number | 1150 |
Series ISSN | 1613-0073 |
Keywords | Twitter; Current events; News; Social networking; Journalists; Breaking news; Bursts |
Public URL | http://hdl.handle.net/10059/1605 |
Publisher URL | http://ceur-ws.org/Vol-1150/martin.pdf |
Contract Date | Sep 2, 2016 |
Files
MARTIN 2014 Real-time topic detection with bursty (v3)
(385 Kb)
PDF
Publisher Licence URL
https://creativecommons.org/licenses/by-nc-nd/4.0/
Downloadable Citations
About OpenAIR@RGU
Administrator e-mail: publications@rgu.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search