Evolving Lucene search queries for text classification

HIRSCH, Laurence, SAEEDI, M and HIRSCH, R (2007). Evolving Lucene search queries for text classification. In: Genetic and Evolutionary Computation Conference, London, 7-11 July.

[img]
Preview
PDF
pap206t2-hirsch.pdf

Download (240kB) | Preview
Official URL: http://dl.acm.org/citation.cfm?doid=1276958.127727...

Abstract

We describe a method for generating accurate, compact, human understandable text classifiers. Text datasets are indexed using Apache Lucene and Genetic Programs are used to construct Lucene search queries. Genetic programs acquire fitness by producing queries that are effective binary classifiers for a particular category when evaluated against a set of training documents. We describe a set of functions and terminals and provide results from classification tasks.

Item Type: Conference or Workshop Item (Paper)
Research Institute, Centre or Group: Cultural Communication and Computing Research Institute > Communication and Computing Research Centre
Depositing User: Laurence Hirsch
Date Deposited: 25 Feb 2013 16:29
Last Modified: 09 Nov 2016 22:44
URI: http://shura.shu.ac.uk/id/eprint/6624

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics