Carlos Martin
Real-time topic detection with bursty n-grams: RGU's submission to the 2014 SNOW challenge.
Martin, Carlos; Goker, Ayse
Authors
Ayse Goker
Abstract
Twitter is becoming an ever more popular platform for discovering and sharing information about current events, both personal and global. The scale and diversity of messages makes the discovery and analysis of breaking news very challenging. Nonetheless, journalists and other news consumers are increasingly relying on tools to help them make sense of Twitter. Here, we describe a fully-automated system capable of detecting trends related to breaking news in real-time. It identifies words or phrases that `burst' with sudden increased frequencies, and groups these into topics. It identifies a diverse set of recent tweets that are related to these topics, and uses these to create a suitable human-readable headline. In addition, images coming from the diverse tweets are also added to the topic. Our system was evaluated using 24 hours of tweets as part of the Social News On the Web (SNOW) 2014 data challenge.
Citation
MARTIN, C. and GOKER, A. 2014. Real-time topic detection with bursty n-grams: RGU's submission to the 2014 SNOW challenge. In Proceedings of the 2014 Social news on the web data challenge (SNOW-DC 2014), 8th April 2014, Seoul, Korea. New Haven: CEUR workshop proceedings [online], pages 9-16. Available from http://ceur-ws.org/Vol-1150/martin.pdf
Conference Name | 2014 Social news on the web data challenge (SNOW-DC 2014) |
---|---|
Conference Location | Seoul, Korea |
Start Date | Apr 8, 2014 |
End Date | Apr 8, 2014 |
Acceptance Date | Apr 8, 2014 |
Online Publication Date | Apr 8, 2014 |
Publication Date | Apr 28, 2014 |
Deposit Date | Sep 2, 2016 |
Publicly Available Date | Sep 2, 2016 |
Print ISSN | 1613-0073 |
Publisher | CEUR Workshop Proceedings |
Pages | 9-16 |
Series Title | CEUR workshop proceedings |
Series ISSN | 1613-0073 |
Keywords | Twitter; Current events; News; Social networking; Journalists; Breaking news; Bursts |
Public URL | http://hdl.handle.net/10059/1605 |
Publisher URL | http://ceur-ws.org/Vol-1150/martin.pdf |
Files
MARTIN 2014 Real-time topic detection with bursty
(850 Kb)
PDF
Publisher Licence URL
https://creativecommons.org/licenses/by-nc-nd/4.0/
You might also like
Analysing creative image search information needs.
(2015)
Conference Proceeding
Twitter response to televised political debates in Election 2015.
(2015)
Book Chapter
Early fusion and query modification in their dual late fusion forms.
(2015)
Journal Article