Meta-Algorithmics_ Patterns for Robust, Low-Cost, High-Quality Systems [Simske 2013-07-29].pdf

(5725 KB) Pobierz
META-ALGORITHMICS
META-ALGORITHMICS
PATTERNS FOR ROBUST, LOW-COST,
HIGH-QUALITY SYSTEMS
Steven J. Simske
HP Labs, Colorado, USA
C
2013 John Wiley & Sons, Ltd
Registered office
John Wiley & Sons Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, United Kingdom
For details of our global editorial offices, for customer services and for information about how to apply for
permission to reuse the copyright material in this book please see our website at www.wiley.com.
The right of the author to be identified as the author of this work has been asserted in accordance with the Copyright,
Designs and Patents Act 1988.
All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any
form or by any means, electronic, mechanical, photocopying, recording or otherwise, except as permitted by the UK
Copyright, Designs and Patents Act 1988, without the prior permission of the publisher.
Wiley also publishes its books in a variety of electronic formats. Some content that appears in print may not be
available in electronic books.
Designations used by companies to distinguish their products are often claimed as trademarks. All brand names and
product names used in this book are trade names, service marks, trademarks or registered trademarks of their
respective owners. The publisher is not associated with any product or vendor mentioned in this book.
Limit of Liability/Disclaimer of Warranty: While the publisher and author have used their best efforts in preparing
this book, they make no representations or warranties with respect to the accuracy or completeness of the contents of
this book and specifically disclaim any implied warranties of merchantability or fitness for a particular purpose. It is
sold on the understanding that the publisher is not engaged in rendering professional services and neither the
publisher nor the author shall be liable for damages arising herefrom. If professional advice or other expert
assistance is required, the services of a competent professional should be sought.
Library of Congress Cataloging-in-Publication Data
Simske, Steven J.
Meta-algorithmics : patterns for robust, low-cost, high-quality systems / Dr. Steven J. Simske,
Hewlett-Packard Labs.
pages cm
ISBN 978-1-118-34336-4 (hardback)
1. Computer algorithms. 2. Parallel algorithms. 3. Heuristic programming. 4. Computer systems–Costs.
5. Computer systems–Quality control. I. Title.
QA76.9.A43S543 2013
005.1–dc23
2013004488
A catalogue record for this book is available from the British Library.
ISBN: 9781118343364
Typeset in 10/12pt Times by Aptara Inc., New Delhi, India
Contents
Acknowledgments
1
1.1
1.2
1.3
1.4
1.5
1.6
Introduction and Overview
Introduction
Why Is This Book Important?
Organization of the Book
Informatics
Ensemble Learning
Machine Learning/Intelligence
1.6.1
Regression and Entropy
1.6.2
SVMs and Kernels
1.6.3
Probability
1.6.4
Unsupervised Learning
1.6.5
Dimensionality Reduction
1.6.6
Optimization and Search
Artificial Intelligence
1.7.1
Neural Networks
1.7.2
Genetic Algorithms
1.7.3
Markov Models
Data Mining/Knowledge Discovery
Classification
Recognition
System-Based Analysis
Summary
References
Parallel Forms of Parallelism
Introduction
Parallelism by Task
2.2.1
Definition
2.2.2
Application to Algorithms and Architectures
2.2.3
Application to Scheduling
Parallelism by Component
2.3.1
Definition and Extension to Parallel-Conditional Processing
xi
1
1
2
3
4
6
7
8
9
15
17
18
20
22
22
25
28
31
32
38
39
39
40
42
42
43
43
46
51
52
52
1.7
1.8
1.9
1.10
1.11
1.12
2
2.1
2.2
2.3
vi
Contents
2.4
2.5
2.3.2
Application to Data Mining, Search, and Other Algorithms
2.3.3
Application to Software Development
Parallelism by Meta-algorithm
2.4.1
Meta-algorithmics and Algorithms
2.4.2
Meta-algorithmics and Systems
2.4.3
Meta-algorithmics and Parallel Processing
2.4.4
Meta-algorithmics and Data Collection
2.4.5
Meta-algorithmics and Software Development
Summary
References
Domain Areas: Where Are These Relevant?
Introduction
Overview of the Domains
Primary Domains
3.3.1
Document Understanding
3.3.2
Image Understanding
3.3.3
Biometrics
3.3.4
Security Printing
Secondary Domains
3.4.1
Image Segmentation
3.4.2
Speech Recognition
3.4.3
Medical Signal Processing
3.4.4
Medical Imaging
3.4.5
Natural Language Processing
3.4.6
Surveillance
3.4.7
Optical Character Recognition
3.4.8
Security Analytics
Summary
References
Applications of Parallelism by Task
Introduction
Primary Domains
4.2.1
Document Understanding
4.2.2
Image Understanding
4.2.3
Biometrics
4.2.4
Security Printing
Summary
References
Application of Parallelism by Component
Introduction
Primary Domains
5.2.1
Document Understanding
5.2.2
Image Understanding
55
59
64
66
67
68
69
70
71
72
73
73
74
75
75
77
78
79
86
86
90
90
92
95
97
98
101
101
102
104
104
105
112
118
126
131
135
136
137
137
138
138
152
3
3.1
3.2
3.3
3.4
3.5
4
4.1
4.2
4.3
5
5.1
5.2
Zgłoś jeśli naruszono regulamin