Mining Web Sites Using Unsupervised Adaptive Information Extraction

Ciravegna, Dr. Fabio and Dingli, Mr. Alexeie and Guthrie, Mr. David and Wilks, Prof. Yorick (2003) Mining Web Sites Using Unsupervised Adaptive Information Extraction. In Proceedings 10th Conference of the European Chapter of the Association for Computational Linguistics, Budapest, Hungary.

Full text available as:

PDF - Requires Adobe Acrobat Reader or other PDF viewer.

Adaptive Information Extraction systems (IES) are currently used by some Semantic Web (SW) annotation tools as support to annotation (Handschuh et al., 2002; Vargas-Vera et al., 2002). They are generally based on fully supervised methodologies requiring fairly intense domain-specific annotation. Unfortunately, selecting representative examples may be difficult and annotations can be incorrect and require time. In this paper we present a methodology that drastically reduce (or even remove) the amount of manual annotation required when annotating consistent sets of pages. A very limited number of user-defined examples are used to bootstrap learning. Simple, high precision (and possibly high recall) IE patterns are induced using such examples, these patterns will then discover more examples which will in turn discover more patterns, etc.

Keywords:	Knowledge acquisition, adaptive information extraction, semantic web,
Subjects:	AKT Challenges > Knowledge acquisition
ID Code:	171
Deposited By:	Brewster, Christopher
Deposited On:	28 March 2003

Contact the site administrator at: hg@ecs.soton.ac.uk


	AKT EPrint Archive
	AKT EPrints Home \|\| AKT Home \|\| About \|\| Browse Subjects \|\| Browse by Year