Visualising text co-occurrence networks

HIRSCH, Laurie and ANDREWS, Simon (2016). Visualising text co-occurrence networks. CEUR Workshop Proceedings, 1637, 19-27.

hirsch_visualisingText.pdf - Accepted Version

Download (599kB) | Preview
[img] PDF (Acceptance email)
Hirsch 12832.pdf - Other
Restricted to Repository staff only

Download (97kB) | Contact the author
Official URL:


We present a tool for automatically generating a visual summary of unstructured text data retrieved from documents, web sites or social media feeds. Unlike tools such as word clouds, we are able to visualise structures and topic relationships occurring in a document. These relationships are determined by a unique approach to co-occurrence analysis. The algorithm applies a decaying function to the distance between word pairs found in the original text such that words regularly occurring close to each other score highly, but even words occurring some distance apart will make a small contribution to the overall co-occurrence score. This is in contrast to other algorithms which simply count adjacent words or use a sliding window of fixed size. We show, with examples, how the network generated can be presented in tree or graph format. The tree format allows for the user to interact with the visualisation and expand or contract the data to a preferred level of detail. The tool is available as a web application and can be viewed using any modern web browser

Item Type: Article
Uncontrolled Keywords: text visualisation
Research Institute, Centre or Group - Does NOT include content added after October 2018: Cultural Communication and Computing Research Institute > Communication and Computing Research Centre
Departments - Does NOT include content added after October 2018: Faculty of Science, Technology and Arts > Department of Computing
Page Range: 19-27
Depositing User: Laurence Hirsch
Date Deposited: 29 Jul 2016 11:51
Last Modified: 18 Mar 2021 06:54

Actions (login required)

View Item View Item


Downloads per month over past year

View more statistics