Data Streams : Models and Algorithms
It primarily discusses issues related to the mining aspects of data streams rather than the database management aspect of streams. This volume covers mining aspects of data streams in a comprehensive style. Each contributed chapter, from a variety of well known researchers in the data mining field, contains a survey on the topic, the key ideas in the field from that particular topic, and future research directions.
Data security : Technical and organizational protection measures against data loss and computer Crime
Offers an easy-to understand introduction to technical and organizational data security. It provides an insight into the technical knowledge that is mandatory for data protection officers. Data security is an inseparable part of data protection, which is becoming more and more important in our society. It can only be implemented effectively if there is an understanding of technical interrelationships and threats.
Data Science-Based Full-Lifespan Management of Lithium-Ion Battery : Manufacturing, Operation and Reutilization
This book comprehensively consolidates studies in the rapidly emerging field of battery management. The primary focus is to overview the new and emerging data science technologies for full-lifespan management of Li-ion batteries, which are categorized into three groups, namely (i) battery manufacturing management, (ii) battery operation management, and (iii) battery reutilization management. The key challenges, future trends as well as promising data-science technologies to further improve this research field are discussed.
Data science, AI, and machine learning in drug development
The confluence of big data, AI, and machine learning has led to a paradigm shift in how innovative medicines are developed and healthcare delivered. To fully capitalize on these technological advances, it is essential to systematically harness data from diverse sources and leverage digital technologies and advanced analytics to enable data-driven decisions. Data science stands at a unique moment of opportunity to lead such a transformative change. Intended to be a single source of information, Data Science, AI, and Machine Learning in Drug Research and Development covers a wide range of topics on the changing landscape of drug R&D, emerging applications of big data, AI and machine learning in drug development, and the build of robust data science organizations to drive biopharmaceutical digital transformations
Data science on the Google cloud platform : Implementing end-to-end real-time data pipelines : From ingest to machine learning
Learn how easy it is to apply sophisticated statistical and machine learning methods to real-world problems when you build using Google Cloud Platform (GCP). This hands-on guide shows data engineers and data scientists how to implement an end-to-end data pipeline with cloud native tools on GCP. You'll work through a sample business decision by employing a variety of data science approaches. Follow along by building a data pipeline in your own project on GCP, and discover how to solve data science problems in a transformative and more collaborative way. Employ best practices in building highly scalable data and ML pipelines on Google Cloud Automate and schedule data ingest using Cloud Run Create and populate a dashboard in Data Studio Build a real-time analytics pipeline using Pub/Sub, Dataflow, and BigQuery Conduct interactive data exploration with BigQuery Create a Bayesian model with Spark on Cloud Dataproc Forecast time series and do anomaly detection with BigQuery ML Aggregate within time windows with Dataflow Train explainable machine learning models with Vertex AI Operationalize ML with Vertex AI Pipelines
Data science in theory and practice : Techniques for big data analytics and complex data sets
Delivers a comprehensive treatment of the mathematical and statistical models useful for analyzing data sets arising in various disciplines, like banking, finance, health care, bioinformatics, security, education, and social services. Written in five parts, the book examines some of the most commonly used and fundamental mathematical and statistical concepts that form the basis of data science. The authors go on to analyze various data transformation techniques useful for extracting information from raw data, long memory behavior, and predictive modeling. Readers will also learn from topics like: Analyses of foundational theoretical subjects, including the history of data science, matrix algebra and random vectors, and multivariate analysis A comprehensive examination of time series forecasting, including the different components of time series and transformations to achieve stationarity Introductions to both the R and Python programming languages, including basic data types and sample manipulations for both languages An exploration of algorithms, including how to write one and how to perform an asymptotic analysis A comprehensive discussion of several techniques for analyzing and predicting complex data sets
Data science for economics and finance : Methodologies and applications
The book starts with an introduction on the use of data science technologies in economics and finance and is followed by thirteen chapters showing success stories of the application of specific data science methodologies, touching on particular topics related to novel big data sources and technologies for economic analysis (e.g. social media and news); big data models leveraging on supervised/unsupervised (deep) machine learning; natural language processing to build economic and financial indicators; and forecasting and nowcasting of economic variables through time series analysis.
Data Science for Civil Engineering : A Beginner's Guide
Explains use of data science-based techniques for modeling and providing optimal solutions to complex problems in civil engineering. It discusses civil engineering problems like air, water and land pollution, climate crisis, transportation infrastructures, traffic and travel modes, mobility services, and so forth. Divided into two sections, the first one deals with the basics of data science and essential mathematics while the second section covers pertinent applications in structural and environmental engineering, construction management, and transportation.
Data science and data analytics : Opportunities and challenges
Gives the concept of data science, tools, and algorithms that exist for many useful applications / Provides many challenges and opportunities in data science and data analytics that help researchers to identify research gaps or problems / Identifies many areas and uses of data science in the smart era / Applies data science to agriculture, healthcare, graph mining, education, security, etc.
Data Science and Classification
This volume provides new methodological developments in data analysis and classification. A wide range of topics is covered that includes the measurement of similarity and dissimilarity, methods for classification and clustering, network and graph analyses, analysis of symbolic data, and web mining. Apart from structural and theoretical results the book shows how to apply the proposed to a variety of problems, for example in medicine, microarray analysis, social network structures, and music. The combination of new methodological advances with the wide range of real applications collected in this volume is of special value for researchers when choosing the appropriate among newly developed analytical tools for their research problems in classification and data analysis.
Data science and analytics ; 5th International conference on recent developments in science, engineering and technology, REDSET 2019, Gurugram, India, November 15–16, 2019, Revised Selected Papers, Part II
This two-volume set (CCIS 1229 and CCIS 1230) constitutes the refereed proceedings of the 5th International Conference on Recent Developments in Science, Engineering and Technology, REDSET 2019, held in Gurugram, India, in November 2019. The 74 revised full papers presented were carefully reviewed and selected from total 353 submissions. The papers are organized in topical sections on data centric programming; next generation computing; social and web analytics; security in data science analytics; big data analytics
Data science and analytics ; 5th International conference on recent developments in science, engineering and technology, REDSET 2019, Gurugram, India, November 15–16, 2019, Revised Selected Papers, Part I
This two-volume set (CCIS 1229 and CCIS 1230) constitutes the refereed proceedings of the 5th International Conference on Recent Developments in Science, Engineering and Technology, REDSET 2019, held in Gurugram, India, in November 2019. The 74 revised full papers presented were carefully reviewed and selected from total 353 submissions. The papers are organized in topical sections on data centric programming; next generation computing; social and web analytics; security in data science analytics; big data analytics.
Data science ; 6th International Conference of Pioneering Computer Scientists, Engineers and Educators, ICPCSEE 2020, Taiyuan, China, September 18-21, 2020, Proceedings, Part II
This two volume set (CCIS 1257 and 1258) constitutes the refereed proceedings of the 6th International Conference of Pioneering Computer Scientists, Engineers and Educators, ICPCSEE 2020 held in Taiyuan, China, in September 2020. The 98 papers presented in these two volumes were carefully reviewed and selected from 392 submissions. The papers are organized in topical sections: database, machine learning, network, graphic images, system, natural language processing, security, algorithm, application, and education.
Data Science ; 6th International Conference of Pioneering Computer Scientists, Engineers and Educators, ICPCSEE 2020, Taiyuan, China, September 18-21, 2020, Proceedings, Part I
This two volume set (CCIS 1257 and 1258) constitutes the refereed proceedings of the 6th International Conference of Pioneering Computer Scientists, Engineers and Educators, ICPCSEE 2020 held in Taiyuan, China, in September 2020. The 98 papers presented in these two volumes were carefully reviewed and selected from 392 submissions. The papers are organized in topical sections: database, machine learning, network, graphic images, system, natural language processing, security, algorithm, application, and education.
Data rights in transition
Maps the development of data rights that formed and reformed in response to the socio-technical transformations of the postwar twentieth century. The authors situate these rights, with their early pragmatic emphasis on fair information processing, as different from and less symbolically powerful than utopian human rights of older centuries
Data Quality and Record Linkage Techniques
This book helps practitioners gain a deeper understanding, at an applied level, of the issues involved in improving data quality through editing, imputation, and record linkage. The first part of the book deals with methods and models. Here, we focus on the Fellegi-Holt edit-imputation model, the Little-Rubin multiple-imputation scheme, and the Fellegi-Sunter record linkage model. Brief examples are included to show how these techniques work. In the second part of the book, the authors present real-world case studies in which one or more of these techniques are used. They cover a wide variety of application areas. These include mortgage guarantee insurance, medical, biomedical, highway safety, and social insurance as well as the construction of list frames and administrative lists.
Data Quality : Concepts, Methodologies and Techniques
Batini and Scannapieco present a comprehensive and systematic introduction to the wide set of issues related to data quality. They start with a detailed description of different data quality dimensions, like accuracy, completeness, and consistency, and their importance in different types of data, like federated data, web data, or time-dependent data, and in different data categories classified according to frequency of change, like stable, long-term, and frequently changing data. The book's extensive description of techniques and methodologies from core data quality research as well as from related fields like data mining, probability theory, statistical data analysis, and machine learning gives an excellent overview of the current state of the art.
Data Processing in Precise Time and Frequency Applications
The book describes the data processing at various levels: design of the time and frequency references, characterization of the time and frequency references, applications involving precise time and/or frequency references. The metrological properties stability, accuracy and reproducibility are defined and the processes leading to their characterization are shown.
Data Privacy and Trust in Cloud Computing : Building trust in the cloud through assurance and accountability
This book brings together perspectives from multiple disciplines including psychology, law, IS, and computer science on data privacy and trust in the cloud. Cloud technology has fueled rapid, dramatic technological change, enabling a level of connectivity that has never been seen before in human history.
Data parallel C++programming accelerated systems using C++ and SYCL
Full of practical advice, detailed explanations, and code examples to illustrate key topics. SYCL enables access to parallel resources in modern accelerated heterogeneous systems. Now, a single C++ application can use any combination of devices–including GPUs, CPUs, FPGAs, and ASICs–that are suitable to the problems at hand. This book teaches data-parallel programming using C++ with SYCL and walks through everything needed to program accelerated systems. The book begins by introducing data parallelism and foundational topics for effective use of SYCL. Later chapters cover advanced topics, including error handling, hardware-specific programming, communication and synchronization, and memory model considerations.



















