Overview

Authors:

Robson L. F. Cordeiro ⁰,
Christos Faloutsos ¹,
Caetano Traina Júnior ²

Robson L. F. Cordeiro
1. Computer Science Department - ICMC, University of São Paulo, São Carlos, Brazil
View author publications

You can also search for this author in PubMed Google Scholar
Christos Faloutsos
1. School of Computer Science, Carnegie Mellon University, Pittsburgh, USA
View author publications

You can also search for this author in PubMed Google Scholar
Caetano Traina Júnior
1. Computer Science Department - ICMC, University of São Paulo, São Carlos, Brazil
View author publications

You can also search for this author in PubMed Google Scholar

Contains a survey on clustering algorithms for moderate-to-high dimensionality data
Includes examples of applications in breast cancer diagnosis, region detection in satellite images, assistance to climate change forecast, recommender systems for the Web, and social networks
Includes supplementary material: sn.pub/extras

Part of the book series: SpringerBriefs in Computer Science (BRIEFSCOMPUTER)

13k Accesses
3 Citations
1 Altmetric

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 16.99 ~~USD 39.99~~

Discount applied Price excludes VAT (USA)

Softcover Book USD 16.99 ~~USD 54.99~~

Discount applied Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (7 chapters)

Front Matter

Pages i-xi

Download chapter PDF
Introduction
- Robson L. F. Cordeiro, Christos Faloutsos, Caetano Traina Júnior
Pages 1-6
Related Work and Concepts
- Robson L. F. Cordeiro, Christos Faloutsos, Caetano Traina Júnior
Pages 7-20
Clustering Methods for Moderate-to-High Dimensionality Data
- Robson L. F. Cordeiro, Christos Faloutsos, Caetano Traina Júnior
Pages 21-32
Halite
- Robson L. F. Cordeiro, Christos Faloutsos, Caetano Traina Júnior
Pages 33-67
BoW
- Robson L. F. Cordeiro, Christos Faloutsos, Caetano Traina Júnior
Pages 69-92
QMAS
- Robson L. F. Cordeiro, Christos Faloutsos, Caetano Traina Júnior
Pages 93-109
Conclusion
- Robson L. F. Cordeiro, Christos Faloutsos, Caetano Traina Júnior
Pages 111-116

Keywords

About this book

The amount and the complexity of the data gathered by current enterprises are increasing at an exponential rate. Consequently, the analysis of Big Data is nowadays a central challenge in Computer Science, especially for complex data. For example, given a satellite image database containing tens of Terabytes, how can we find regions aiming at identifying native rainforests, deforestation or reforestation? Can it be made automatically? Based on the work discussed in this book, the answers to both questions are a sound “yes”, and the results can be obtained in just minutes. In fact, results that used to require days or weeks of hard work from human specialists can now be obtained in minutes with high precision. Data Mining in Large Sets of Complex Data discusses new algorithms that take steps forward from traditional data mining (especially for clustering) by considering large, complex datasets. Usually, other works focus in one aspect, either data size or complexity. This work considers both: it enables mining complex data from high impact applications, such as breast cancer diagnosis, region classification in satellite images, assistance to climate change forecast, recommendation systems for the Web and social networks; the data are large in the Terabyte-scale, not in Giga as usual; and very accurate results are found in just minutes. Thus, it provides a crucial and well timed contribution for allowing the creation of real time applications that deal with Big Data of high complexity in which mining on the fly can make an immeasurable difference, such as supporting cancer diagnosis or detecting deforestation.

Reviews

From the reviews:

“This book is a must-read for all data mining professionals, as it explains new and superior techniques for clustering large datasets of high-dimensional data. It would also be interesting for professionals who work with large volumes of complex data and want real-time information for better decision making.” (Alexis Leon, Computing Reviews, July, 2013)

Authors and Affiliations

Computer Science Department - ICMC, University of São Paulo, São Carlos, Brazil

Robson L. F. Cordeiro, Caetano Traina Júnior
School of Computer Science, Carnegie Mellon University, Pittsburgh, USA

Christos Faloutsos

Bibliographic Information

Book Title: Data Mining in Large Sets of Complex Data
Authors: Robson L. F. Cordeiro, Christos Faloutsos, Caetano Traina Júnior
Series Title: SpringerBriefs in Computer Science
DOI: https://doi.org/10.1007/978-1-4471-4890-6
Publisher: Springer London
eBook Packages: Computer Science, Computer Science (R0)
Softcover ISBN: 978-1-4471-4889-0Published: 11 January 2013
eBook ISBN: 978-1-4471-4890-6Published: 11 January 2013
Series ISSN: 2191-5768
Series E-ISSN: 2191-5776
Edition Number: 1
Number of Pages: XI, 116
Number of Illustrations: 12 b/w illustrations, 25 illustrations in colour
Topics: Data Mining and Knowledge Discovery, Database Management

Publish with us

Policies and ethics

Data Mining in Large Sets of Complex Data

Overview

Access this book

Other ways to access

Table of contents (7 chapters)

Front Matter

Introduction

Related Work and Concepts

Clustering Methods for Moderate-to-High Dimensionality Data

Halite

BoW

QMAS

Conclusion

Keywords

About this book

Reviews

Authors and Affiliations

Computer Science Department - ICMC, University of São Paulo, São Carlos, Brazil

School of Computer Science, Carnegie Mellon University, Pittsburgh, USA

Bibliographic Information

Publish with us

Search

Navigation