E-Book Overview
Pattern Recognition Algorithms for Data Mining covers the topic of data mining from a pattern recognition perspective. This unique book presents real life data sets from various domains, such as geographic information systems, remote sensing imagery, and population census, to demonstrate the use of innovative new methodologies. Classical approaches are covered along with granular computation by integrating fuzzy sets, artificial neural networks, and genetic algorithms for efficient knowledge discovery. The authors then compare the granular computing and rough fuzzy approaches with the more classical methods and clearly demonstrate why they are more efficient.
E-Book Content
Pattern Recognition Algorithms for Data Mining Scalability, Knowledge Discovery and Soft Granular Computing Sankar K. Pal and Pabitra Mitra Machine Intelligence Unit Indian Statistical Institute Calcutta, India CHAPMAN & HALL/CRC A CRC Press Company Boca Raton London New York Washington, D.C. Cover art provided by Laura Bright (http://laurabright.com). http://www.ciaadvertising.org/SA/sping_03/391K/ lbright/paper/site/report/introduction.html © 2004 by Taylor & Francis Group, LLC C4576 disclaimer.fm Page 1 Tuesday, April 6, 2004 10:36 AM Library of Congress Cataloging-in-Publication Data Pal, Sankar K. Pattern recognition algorithms for data mining : scalability, knowledge discovery, and soft granular computing / Sankar K. Pal and Pabitra Mitra. p. cm. Includes bibliographical references and index. ISBN 1-58488-457-6 (alk. paper) 1. Data mining. 2. Pattern recognition systems. 3. Computer algorithms. 4. Granular computing / Sankar K. Pal and Pabita Mitra. QA76.9.D343P38 2004 006.3'12—dc22 2004043539 This book contains information obtained from authentic and highly regarded sources. Reprinted material is quoted with permission, and sources are indicated. A wide variety of references are listed. Reasonable efforts have been made to publish reliable data and information, but the author and the publisher cannot assume responsibility for the validity of all materials or for the consequences of their use. Neither this book nor any part may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, microfilming, and recording, or by any information storage or retrieval system, without prior permission in writing from the publisher. The consent of CRC Press LLC does not extend to copying for general distribution, for promotion, for creating new works, or for resale. Specific permission must be obtained in writing from CRC Press LLC for such copying. Direct all inquiries to CRC Press LLC, 2000 N.W. Corporate Blvd., Boca Raton, Florida 33431. Trademark Notice: Product or corporate names may be trademarks or registered trademarks, and are used only for identification and explanation, without intent to infringe. Visit the CRC Press Web site at www.crcpress.com © 2004 by CRC Press LLC No claim to original U.S. Government works International Standard Book Number 1-58488-457-6 Library of Congress Card Number 2004043539 Printed in the United States of America 1 2 3 4 5 6 7 8 9 0 Printed on acid-free paper © 2004 by Taylor & Francis Group, LLC To our parents © 2004 by Taylor & Francis Group, LLC Contents Foreword xiii Preface xxi List of Tables xxv List of Figures 1 Introduction 1.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . 1.2 Pattern Recognition in Brief . . . . . . . . . . . . . . 1.2.1 Data acquisition . . . . . . . . . . . . . . . . . 1.2.2 Feature selection/extraction . . . . . . . . . . . 1.2.3 Classification . . . . . . . . . . . . . . . . . . . 1.3 Knowledge Discovery in Databases (KDD) . . . . . . 1.4 Data Mining . . . . . . .