TY - GEN
T1 - Story Link Detection in Turkish corpus
AU - Köse, Güven
AU - Tonta, Yaçar
AU - Ahmadlouei, Hamid
AU - Polatkan, Aydin Can
PY - 2013
Y1 - 2013
N2 - Story Link Detection (SLD) is known as a sub-task of Topic Detection and Tracking (TDT). SLD aims to specify whether two randomly selected stories discuss the same topic or not. This sub-task drew special attention within the TDT research community as many tasks in TDT are thought to be solved automatically once SLD performs as expected. In this study, performance tests were carried out on the BilCol-2005 Turkish news corpus composed of approximately 209, 000 news items using vector space model (VSM) and relevance model (RM) methods with respect to varied index term counts. Accordingly, best results obtained were as follows: the VSM method performed best with 30 terms (F-measure=0.2970) while RM method did with 4 terms (F-measure=0.1910). Furthermore, the combination of two methods using the AND and OR functions increased the precision ratio by 7.9% and recall ratio by 1.2%, respectively, indicating that retrieval performance of SLD algorithms can be increased to some extent by employing both VSM and RM models.
AB - Story Link Detection (SLD) is known as a sub-task of Topic Detection and Tracking (TDT). SLD aims to specify whether two randomly selected stories discuss the same topic or not. This sub-task drew special attention within the TDT research community as many tasks in TDT are thought to be solved automatically once SLD performs as expected. In this study, performance tests were carried out on the BilCol-2005 Turkish news corpus composed of approximately 209, 000 news items using vector space model (VSM) and relevance model (RM) methods with respect to varied index term counts. Accordingly, best results obtained were as follows: the VSM method performed best with 30 terms (F-measure=0.2970) while RM method did with 4 terms (F-measure=0.1910). Furthermore, the combination of two methods using the AND and OR functions increased the precision ratio by 7.9% and recall ratio by 1.2%, respectively, indicating that retrieval performance of SLD algorithms can be increased to some extent by employing both VSM and RM models.
KW - Relevance model
KW - Story Link Detection
KW - Topic detection and tracking
KW - Vector space model
UR - https://www.scopus.com/pages/publications/84893241689
U2 - 10.1109/WI-IAT.2013.23
DO - 10.1109/WI-IAT.2013.23
M3 - Conference contribution
AN - SCOPUS:84893241689
SN - 9781479929023
T3 - Proceedings - 2013 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2013
SP - 154
EP - 158
BT - Proceedings - 2013 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2013
T2 - 2013 12th IEEE/WIC/ACM International Conference on Web Intelligence, WI 2013
Y2 - 17 November 2013 through 20 November 2013
ER -