<?xml version="1.0" encoding="UTF-8"?>
<XML><RECORDS>
<RECORD>
	<REFERENCE_TYPE>3</REFERENCE_TYPE>
	<AUTHORS>
		<AUTHOR>Giorgio Fumera</AUTHOR>
		<AUTHOR>Ignazio Pillai</AUTHOR>
		<AUTHOR>Fabio Roli</AUTHOR>
	</AUTHORS>
	<YEAR>2004</YEAR>
	<TITLE>A Two-Stage Classifier with Reject Option for Text Categorisation</TITLE>
	<SECONDARY_TITLE>5th Int. Workshop on Statistical Techniques in Pattern Recognition (SPR 2004)</SECONDARY_TITLE>
	<PLACE_PUBLISHED>Lisbon, Portugal</PLACE_PUBLISHED>
	<PUBLISHER>Springer</PUBLISHER>
	<VOLUME>3138</VOLUME>
	<PAGES>771-779</PAGES>
	<DATE>18/08/2004</DATE>
	<KEYWORDS>
		<KEYWORD>document categorisation</KEYWORD>
		<KEYWORD>text categorisation</KEYWORD>
		<KEYWORD>classification reliability</KEYWORD>
		<KEYWORD>reject option</KEYWORD>
		<KEYWORD>rej00</KEYWORD>
		<KEYWORD>doc01</KEYWORD>
		<KEYWORD>doc00</KEYWORD>
	</KEYWORDS>
	<ABSTRACT>Abstract. In this paper, we investigate the usefulness of the reject option in text categorisation systems. The reject option is introduced by allowing a text classi&iuml;&not;er to withhold the decision of assigning or not a document to any subset of categories, for which the decision is considered not su&iuml;&not;ciently reliable. To automatically handle rejections, a two-stage classi&iuml;&not;er architecture is used, in which documents rejected at the &iuml;&not;rst stage are automatically classi&iuml;&not;ed at the second stage, so that no rejections eventually remain. The performance improvement achievable by using the reject option is assessed on a real text categorisation task, using the well known Reuters data set.
</ABSTRACT>
</RECORD>
</RECORDS></XML>
