AKT EPrint Archive

Learning Information Extraction Rules: An Inductive Logic Programming approach

Aitken, J.S. (2002) Learning Information Extraction Rules: An Inductive Logic Programming approach. In van Harmelen, Prof Frank, Eds. Proceedings 15th European Conference on Artificial Intelligence, pages pp. 355-359, Lyon, France.

Full text available as:

PDF - Requires Adobe Acrobat Reader or other PDF viewer.

The objective of this work is to learn information extraction rules by applying Inductive Logic Programming (ILP) techniques to natural language data. The approach is ontology-based, which means that the extraction rules conclude with specific ontology relations that characterise the meaning of sentences in the text. An existing ILP system, FOIL, is used to learn attribute-value relations. This enables instances of these relations to be identified in the text. In specific, we explore the linguistic preprocessing of the data, the use of background knowledge in the learning process, and the practical considerations of applying a supervised learning approach to rule induction, i.e. in terms of the human effort in creating the data set, and in the inherent biases in the use of small data sets.

Keywords:information extraction ontology
Subjects:AKT Challenges > Knowledge acquisition
ID Code:355
Deposited By:Aitken, Dr S
Deposited On:08 July 2004
Alternative Locations:http://www.aiai.ed.ac.uk/~stuart/Papers/ecai02-paper.pdf

Contact the site administrator at: hg@ecs.soton.ac.uk