Nonlinear Dimensionality Reduction
This book describes existing and advanced methods to reduce the dimensionality of numerical databases. For each method, the description starts from intuitive ideas, develops the necessary mathematical details, and ends by outlining the algorithmic implementation. Methods are compared with each other with the help of different illustrative examples. The purpose of the book is to summarize clear facts and ideas about well-known methods as well as recent developments in the topic of nonlinear dimensionality reduction. With this goal in mind, methods are all described from a unifying point of view, in order to highlight their respective strengths and shortcomings.
New Developments in Classification and Data Analysis ; Proceedings of the Meeting of the Classification and Data Analysis Group (CLADAG) of the Italian Statistical Society, University of Bologna, September 22-24, 2003
This volume contains revised versions of selected papers presented during the biannual meeting of the Classification and Data Analysis Group of SocietA Italiana di Statistica, which was held in Bologna, September 22-24, 2003. The scientific program of the conference included 80 contributed papers. the conference provided a large number of scientists and experts from home and abroad with an attractive forum for discussions and mutual exchange of knowledge. The talks in the different sessions focused on methodological developments in supervised and unsupervised classification and in data analysis, also p- viding relevant contributions in the context of applications. This suggested the presentation of the 43 selected papers in three parts as follows: CLASSIFICATION AND CLUSTERING Non parametric classification Clustering and dissimilarities MULTIVARIATE STATISTICS AND DATA ANALYSIS APPLIED MULTIVARIATE STATISTICS Environmental data Microarray data Behavioural and text data Financial data
Multivariate Statistics : Exercises and Solutions
The first part is devoted to graphical techniques. The second part deals with multivariate random variables and presents the derivation of estimators and tests for various practical situations. The last part introduces a wide variety of exercises in applied multivariate data analysis. The book demonstrates the application of simple calculus and basic multivariate methods in real life situations. It contains altogether 234 solved exercises which can assist a university teacher in setting up a modern multivariate analysis course. All computer-based exercises are available in the R or XploRe languages.
Modern Multivariate Statistical Techniques : Regression, Classification, and Manifold Learning
Remarkable advances in computation and data storage and the ready availability of huge data sets have been the keys to the growth of the new disciplines of data mining and machine learning, while the enormous success of the Human Genome Project has opened up the field of bioinformatics. These exciting developments, which led to the introduction of many innovative statistical tools for high-dimensional data analysis, are described here in detail. The author takes a broad perspective; for the first time in a book on multivariate analysis, nonlinear methods are discussed in detail as well as linear methods.
Modern Multidimensional Scalin : Theory and Applications
The book provides a comprehensive treatment of multidimensional scaling (MDS), a family of statistical techniques for analyzing the structure of (dis)similarity data. This book may be used as an introduction to MDS for students in psychology, sociology, and marketing. The prerequisite is an elementary background in statistics. The book is also well suited for a variety of advanced courses on MDS topics. All the mathematics required for more advanced topics is developed systematically. This second edition adds some 140 pages of new material. Many chapters are revised or have sections reflecting new insights and developments in MDS. There are two new chapters, one on asymmetric models and the other on unfolding. There are also numerous exercises that help the reader to practice what he or she has learned, and to delve deeper into the models and its intricacies.
Handbook of Data Visualization
This new volume in the series Springer Handbooks of Computational Statistics gives an overview of modern data visualization methods, both in theory and practice. There are definitive chapters on modern graphical tools such as mosaic plots, parallel coordinate plots and linked views. There are chapters dedicated to graphical methodology for particular areas of statistics, for example Bayesian analysis, genomic data and cluster analysis, as well as chapters on software for graphics.
Fundamentals of pattern recognition and machine learning
Fundamentals of Pattern Recognition and Machine Learning is designed for a one or two-semester introductory course in Pattern Recognition or Machine Learning at the graduate or advanced undergraduate level. The book combines theory and practice and is suitable to the classroom and self-study.
Data science in theory and practice : Techniques for big data analytics and complex data sets
Delivers a comprehensive treatment of the mathematical and statistical models useful for analyzing data sets arising in various disciplines, like banking, finance, health care, bioinformatics, security, education, and social services. Written in five parts, the book examines some of the most commonly used and fundamental mathematical and statistical concepts that form the basis of data science. The authors go on to analyze various data transformation techniques useful for extracting information from raw data, long memory behavior, and predictive modeling. Readers will also learn from topics like: Analyses of foundational theoretical subjects, including the history of data science, matrix algebra and random vectors, and multivariate analysis A comprehensive examination of time series forecasting, including the different components of time series and transformations to achieve stationarity Introductions to both the R and Python programming languages, including basic data types and sample manipulations for both languages An exploration of algorithms, including how to write one and how to perform an asymptotic analysis A comprehensive discussion of several techniques for analyzing and predicting complex data sets
Data Science and Classification
This volume provides new methodological developments in data analysis and classification. A wide range of topics is covered that includes the measurement of similarity and dissimilarity, methods for classification and clustering, network and graph analyses, analysis of symbolic data, and web mining. Apart from structural and theoretical results the book shows how to apply the proposed to a variety of problems, for example in medicine, microarray analysis, social network structures, and music. The combination of new methodological advances with the wide range of real applications collected in this volume is of special value for researchers when choosing the appropriate among newly developed analytical tools for their research problems in classification and data analysis.
Data Analysis, Classification and the Forward Search ; Proceedings of the Meeting of the Classification and Data Analysis Group (CLADAG) of the Italian Statistical Society, University of Parma, June 6-8, 2005
This book presents new developments in data analysis, classification and multivariate statistics, and in their algorithmic implementation. The volume offers contributions to the theory of clustering and discrimination, multidimensional data analysis, data mining, and robust statistics with a special emphasis on the novel Forward Search approach. Many papers provide significant insight in a wide range of fields of application. Customer satisfaction and service evaluation are two examples of such emerging fields.
Data Analysis and Decision Support
Presents advances in data analysis and decision support and gives an actual overview on the interface between mathematics, operations research, statistics, computer science, and management science. Areas that receive considerable attention in the book are discrimination and clustering, multidimensional scaling, data mining and more.
Applied Multivariate Statistical Analysis
This book presents the tools and concepts of multivariate data analysis in a way that is understandable for non-mathematicians and practitioners who face statistical data analysis.
Analysis, Synthesis, and Perception of Musical Sounds : The Sound of Music
Analysis, Synthesis, and Perception of Musical Sounds contains a detailed treatment of basic methods for analysis and synthesis of musical sounds, including the phase vocoder method, the McAulay-Quatieri frequency-tracking method, the constant-Q transform, and methods for pitch tracking with several examples shown. Various aspects of musical sound spectra such as spectral envelope, spectral centroid, spectral flux, and spectral irregularity are defined and discussed. One chapter is devoted to the control and synthesis of spectral envelopes. Two advanced methods of analysis/synthesis are given: "Sines Plus Transients Plus Noise" and "Spectrotemporal Reassignment" are covered. Methods for timbre morphing are given. The last two chapters discuss the perception of musical sounds based on discrimination and multidimensional scaling timbre models.
An R and S-Plus® Companion to Multivariate Analysis
Most data sets collected by researchers are multivariate, and in the majority of cases the variables need to be examined simultaneously to get the most informative results. This requires the use of one or other of the many methods of multivariate analysis, and the use of a suitable software package such as S-PLUS or R. In this book the core multivariate methodology is covered along with some basic theory for each method described. The necessary R and S-PLUS code is given for each analysis in the book, with any differences between the two highlighted.
Advances in Data Mining ; Vol. 4065 : Applications in Medicine, Web Mining, Marketing, Image and Signal Mining, 6th Industrial Conference on Data Mining, ICDM 2006, Leipzig, Germany, July 14-15, 2006, Proceedings
The Industrial Conference on Data Mining ICDM-Leipzig was the sixth event in a series of annual events which started in 2000. We are pleased to note that the topic data mining with special emphasis on real-world applications has been adopted by so many researchers all over the world into their research work. We received 156 papers from 19 different countries. The main topics are data mining in medicine and marketing, web mining, mining of images and signals, theoretical aspects of data mining, and aspects of data mining that bundle a series of different data mining applications such as intrusion detection, knowledge management, manufacturing process control, time-series mining and criminal investigations. The Program Committee worked hard in order to select the best papers.














