Nonparametric Functional Data Analysis : Theory and Practice
Modern apparatuses allow us to collect samples of functional data, mainly curves but also images. On the other hand, nonparametric statistics produces useful tools for standard data exploration. This book links these two fields of modern statistics by explaining how functional data can be studied through parameter-free statistical ideas. This book starts from theoretical foundations including functional nonparametric modeling, description of the mathematical framework, construction of the statistical methods, and statements of their asymptotic behaviors. It proceeds to computational issues including R and S-PLUS routines. Several functional datasets in chemometrics, econometrics, and pattern recognition are used to emphasize the wide scope of nonparametric functional data analysis in applied sciences. The companion Web site includes R and S-PLUS routines, command lines for reproducing examples presented in the book, and the functional datasets. Rather than set application against theory, this book is really an interface of these two features of statistics. A special effort has been made in writing this book to accommodate several levels of reading.
Integrating Data Science and Earth Science : Challenges and Solutions
This book presents the results of three years collaboration between earth scientists and data scientist, in developing and applying data science methods for scientific discovery. The book will be highly beneficial for other researchers at senior and graduate level, interested in applying visual data exploration, computational approaches and scientifc workflows.
Data science on the Google cloud platform : Implementing end-to-end real-time data pipelines : From ingest to machine learning
Learn how easy it is to apply sophisticated statistical and machine learning methods to real-world problems when you build using Google Cloud Platform (GCP). This hands-on guide shows data engineers and data scientists how to implement an end-to-end data pipeline with cloud native tools on GCP. You'll work through a sample business decision by employing a variety of data science approaches. Follow along by building a data pipeline in your own project on GCP, and discover how to solve data science problems in a transformative and more collaborative way. Employ best practices in building highly scalable data and ML pipelines on Google Cloud Automate and schedule data ingest using Cloud Run Create and populate a dashboard in Data Studio Build a real-time analytics pipeline using Pub/Sub, Dataflow, and BigQuery Conduct interactive data exploration with BigQuery Create a Bayesian model with Spark on Cloud Dataproc Forecast time series and do anomaly detection with BigQuery ML Aggregate within time windows with Dataflow Train explainable machine learning models with Vertex AI Operationalize ML with Vertex AI Pipelines
Linked Open Data -- Creating Knowledge Out of Interlinked Data : Results of the LOD2 Project
Linked Open Data (LOD) is a pragmatic approach for realizing the Semantic Web vision of making the Web a global, distributed, semantics-based information system. This book presents an overview on the results of the research project “LOD2 -- Creating Knowledge out of Interlinked Data”. LOD2 is a large-scale integrating project co-funded by the European Commission within the FP7 Information and Communication Technologies Work Program. Commencing in September 2010, this 4-year project comprised leading Linked Open Data research groups, companies, and service providers from across 11 European countries and South Korea.
Analysing Ecological Data
This book provides a practical introduction to analysing ecological data using real data sets collected as part of postgraduate ecological studies or research projects. The first part of the book gives a largely non-mathematical introduction to data exploration, univariate methods (including GAM and mixed modelling techniques), multivariate analysis, time series analysis (e.g. common trends) and spatial statistics. The second part provides 17 case studies, mainly written together with biologists who attended courses given by the first authors. The case studies include topics ranging from terrestrial ecology to marine biology.




