HIRSCH, Laurence, SAEEDI, M and HIRSCH, R (2007). Evolving Lucene search queries for text classification. In: Genetic and Evolutionary Computation Conference, London, 7-11 July.
Download (240kB) | Preview
We describe a method for generating accurate, compact, human understandable text classifiers. Text datasets are indexed using Apache Lucene and Genetic Programs are used to construct Lucene search queries. Genetic programs acquire fitness by producing queries that are effective binary classifiers for a particular category when evaluated against a set of training documents. We describe a set of functions and terminals and provide results from classification tasks.
|Item Type:||Conference or Workshop Item (Paper)|
|Research Institute, Centre or Group:||Cultural Communication and Computing Research Institute > Communication and Computing Research Centre|
|Depositing User:||Laurence Hirsch|
|Date Deposited:||25 Feb 2013 16:29|
|Last Modified:||24 Aug 2015 08:11|
Actions (login required)
Downloads per month over past year