Author: Keller, Mikaela; Freifeld, Clark C; Brownstein, John S
Title: Automated vocabulary discovery for geo-parsing online epidemic intelligence Cord-id: f4q1r3vu Document date: 2009_11_24
ID: f4q1r3vu
Snippet: BACKGROUND: Automated surveillance of the Internet provides a timely and sensitive method for alerting on global emerging infectious disease threats. HealthMap is part of a new generation of online systems designed to monitor and visualize, on a real-time basis, disease outbreak alerts as reported by online news media and public health sources. HealthMap is of specific interest for national and international public health organizations and international travelers. A particular task that makes su
Document: BACKGROUND: Automated surveillance of the Internet provides a timely and sensitive method for alerting on global emerging infectious disease threats. HealthMap is part of a new generation of online systems designed to monitor and visualize, on a real-time basis, disease outbreak alerts as reported by online news media and public health sources. HealthMap is of specific interest for national and international public health organizations and international travelers. A particular task that makes such a surveillance useful is the automated discovery of the geographic references contained in the retrieved outbreak alerts. This task is sometimes referred to as "geo-parsing". A typical approach to geo-parsing would demand an expensive training corpus of alerts manually tagged by a human. RESULTS: Given that human readers perform this kind of task by using both their lexical and contextual knowledge, we developed an approach which relies on a relatively small expert-built gazetteer, thus limiting the need of human input, but focuses on learning the context in which geographic references appear. We show in a set of experiments, that this approach exhibits a substantial capacity to discover geographic locations outside of its initial lexicon. CONCLUSION: The results of this analysis provide a framework for future automated global surveillance efforts that reduce manual input and improve timeliness of reporting.
Search related documents:
Co phrase search for related documents- actual outbreak and acute respiratory syndrome: 1, 2
- actual outbreak and log likelihood: 1
- acute respiratory syndrome and additional information: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25
- acute respiratory syndrome and location consider: 1
- acute respiratory syndrome and location name: 1
- acute respiratory syndrome and location pattern: 1
- acute respiratory syndrome and location reference: 1
Co phrase search for related documents, hyperlinks ordered by date