Islamic State Battle Plan: Press Release Natural Language Processing   [open pdf - 1MB]

From the thesis abstract: "The purpose of this study is to develop methods to accelerate and enhance the analysis of Islamic State Movement text documents. We analyze a unique database collected by Dr. Craig Whiteside, which is comprised of nearly 3,000 open-source translated press releases from 2003-2014. Using Natural Language Processing tools, the text data is aggregated into a corpus and processed based on document term structure and frequency. In order to reduce analyst workload, we validate Whiteside's manual analysis and construct cross-validated generalized linear models to automatically classify documents into one of seven types. A cascade classification model outperforms all other models with a mean cross-validated misclassification rate of 5.71 percent. Islamic State Movement operational summaries are classified as type 'Celebrate.' We develop a layered algorithm based on regular expressions and location searches to extract critical information from each attack event and display the details on a map using a web-based interactive R Shiny application. With the ability to automatically classify Islamic State Movement text documents and visually interact with the data contained within those classified as type 'Celebrate,' analysts and decision makers are able to process and understand large amounts of text data more quickly and effectively."

Public Domain
Retrieved From:
Naval Postgraduate School, Dudley Knox Library: http://www.nps.edu/Library/index.aspx
Media Type:
Help with citations