Articles by "Data Analysis"

Showing posts with label Data Analysis. Show all posts

Bradley Efron, Trevor Hastie ... 495 pages - Publisher: Cambridge Univ. Press; (July, 2016) ... Language: English - ISBN-10: 1107149894 - ISBN-13: 978-1107149892.

The twenty-first century has seen a breathtaking expansion of statistical methodology, both in scope and in influence. 'Big data', 'data science', and 'machine learning' have become familiar terms in the news, as statistical methods are brought to bear upon the enormous data sets of modern science and commerce. How did we get here? And where are we going? This book takes us on an exhilarating journey through the revolution in data analysis following the introduction of electronic computation in the 1950s. Beginning with classical inferential theories - Bayesian, frequentist, Fisherian - individual chapters take up a series of influential topics: survival analysis, logistic regression, empirical Bayes, the jackknife and bootstrap, random forests, neural networks, Markov chain Monte Carlo, inference after model selection, and dozens more. The distinctly modern approach integrates methodology and algorithms with statistical inference. The book ends with speculation on the future direction of statistics and data science.

Valentina Emilia Balas, Sanjiban Sekhar Roy, Dharmendra Sharma, Pijush Samui ... 389 pages - Publisher: Springer; (March, 2019) ... Language: English - ISBN-10: 3030114783 - ISBN-13: 978-303011478.

This book presents a broad range of deep-learning applications related to vision, natural language processing, gene expression, arbitrary object recognition, driverless cars, semantic image segmentation, deep visual residual abstraction, brain–computer interfaces, big data processing, hierarchical deep learning networks as game-playing artefacts using regret matching, and building GPU-accelerated deep learning frameworks. Deep learning, an advanced level of machine learning technique that combines class of learning algorithms with the use of many layers of nonlinear units, has gained considerable attention in recent times. Unlike other books on the market, this volume addresses the challenges of deep learning implementation, computation time, and the complexity of reasoning and modeling different type of data. As such, it is a valuable and comprehensive resource for engineers, researchers, graduate students and Ph.D. scholars.

Ralf T. Kreutzer, Marie Sirrenberg ... 313 pages - Publisher: Springer; (September, 2019) ... Language: English - ISBN-10: 3030252701 - ISBN-13: 978-3030252700.

Artificial Intelligence (AI) will change the lives of people and businesses more fundamentally than many people can even imagine today. This book illustrates the importance of AI in an era of digitalization. It introduces the foundations of AI and explains its benefits and challenges for companies and entire industries. In this regard, AI is approached not just as yet another technology, but as a fundamental innovation, which will spread into all areas of the economy and life, and will disrupt business processes and business models in the years to come. In turn, the book assesses the potential that AI holds, and clarifies the framework that is necessary for pursuing a responsible approach to AI. In a series of best-practice cases, the book subsequently highlights a broad range of sectors and industries, from production to services; from customer service to marketing and sales; and in industries like retail, health care, energy, transportation and many more. In closing, a dedicated chapter outlines a roadmap for a specific corporate AI journey.

Kieth A. Carlson, Jennifer R. Winquist ... 656 pages - Publisher: SAGE Publications, Inc; 2nd edition (February, 2017) ... Language: English - ISBN-10: 148337873X - ISBN-13: 978-1483378732.

An Introduction to Statistics: An Active Learning Approach, Second Edition by Kieth A. Carlson and Jennifer R. Winquist takes a unique, active approach to teaching and learning introductory statistics that allows students to discover and correct their misunderstandings as chapters progress rather than at their conclusion. Empirically-developed, self-correcting activities reinforce and expand on fundamental concepts, targeting and holding students’ attention. Based on contemporary memory research, this learner-centered approach leads to better long-term retention through active engagement while generating explanations. Along with carefully placed reading questions, this edition includes learning objectives, realistic research scenarios, practice problems, self-test questions, problem sets, and practice tests to help students become more confident in their ability to perform statistics.

Narayan C. Giri ... 537 pages - Publisher: CRC Press; 2nd edition (January, 2019) ... Language: English - AmazonSIN: B07M7XVF7J.

Beginning with the historical background of probability theory, this thoroughly revised text examines all important aspects of mathematical probability - including random variables, probability distributions, characteristic and generating functions, stochatic convergence, and limit theorems - and provides an introduction to various types of statistical problems, covering the broad range of statistical inference.; Requiring a prerequisite in calculus for complete understanding of the topics discussed, the Second Edition contains new material on: univariate distributions; multivariate distributions; large-sample methods; decision theory; and applications of ANOVA.; A primary text for a year-long undergraduate course in statistics (but easily adapted for a one-semester course in probability only), Introduction to Probability and Statistics is for undergraduate students in a wide range of disciplines-statistics, probability, mathematics, social science, economics, engineering, agriculture, biometry, and education.

Larry Wasserman ... 442 pages - Publisher: Springer; (September, 2004) ... Language: English - ISBN-10: 0387402721 - ISBN-13: 978-0387402727.

Taken literally, the title "All of Statistics" is an exaggeration. But in spirit, the title is apt, as the book does cover a much broader range of topics than a typical introductory book on mathematical statistics. This book is for people who want to learn probability and statistics quickly. It is suitable for graduate or advanced undergraduate students in computer science, mathematics, statistics, and related disciplines. The book includes modern topics like non-parametric curve estimation, bootstrapping, and classification, topics that are usually relegated to follow-up courses. The reader is presumed to know calculus and a little linear algebra. No previous knowledge of probability and statistics is required. Statistics, data mining, and machine learning are all concerned with collecting and analysing data.

Chris Chatfield, A. Collins ... 248 pages - Publisher: Chapman and Hall/CRC; (May, 1981) ... Language: English - ISBN-10: 9780412160400 - ISBN-13: 978-0412160400.

This book provides an introduction to the analysis of multivariate data.It describes multivariate probability distributions, the preliminary analysisof a large -scale set of data, princ iple component and factor analysis,traditional normal theory material, as well as multidimensional scaling andcluster analysis.Introduction to Multivariate Analysis provides a reasonable blend oftheory and practice. Enough theory is given to introduce the concepts andto make the topics mathematically interesting. In addition the authors discussthe use (and misuse) of the techniques in pra ctice and present appropriatereal-life examples from a variety of areas includ ing agricultural research,soc iology and crim inology. The book should be suitable both for researchworkers and as a text for students taking a course on multivariate analysis.

Sadanori Konishi ... 338 pages - Publisher: Chapman and Hall/CRC; (June, 2014) ... Language: English - ISBN-10: 1466567287 - ISBN-13: 978-1466567283.

Introduction to Multivariate Analysis: Linear and Nonlinear Modeling shows how multivariate analysis is widely used for extracting useful information and patterns from multivariate data and for understanding the structure of random phenomena. Along with the basic concepts of various procedures in traditional multivariate analysis, the book covers nonlinear techniques for clarifying phenomena behind observed multivariate data. It primarily focuses on regression modeling, classification and discrimination, dimension reduction, and clustering. The text thoroughly explains the concepts and derivations of the AIC, BIC, and related criteria and includes a wide range of practical examples of model selection and evaluation criteria. To estimate and evaluate models with a large number of predictor variables, the author presents regularization methods, including the L1 norm regularization that gives simultaneous model estimation and variable selection. For advanced undergraduate and graduate students in statistical science, this text provides a systematic description of both traditional and newer techniques in multivariate analysis and machine learning. It also introduces linear and nonlinear statistical modeling for researchers and practitioners in industrial and systems engineering, information science, life science, and other areas.

Wolfgang Karl Härdle, Léopold Simar ... 558 pages - Publisher: Springer; 5th edition(November, 2019) ... Language: English - ISBN-10: 3030260054 - ISBN-13: 978-3030260057.

This textbook presents the tools and concepts used in multivariate data analysis in a style accessible for non-mathematicians and practitioners. All chapters include practical exercises that highlight applications in different multivariate data analysis fields, and all the examples involve high to ultra-high dimensions and represent a number of major fields in big data analysis.

For this new edition, the book has been updated and extensively revised and now includes an extended chapter on cluster analysis. All solutions to the exercises are supplemented by R and MATLAB or SAS computer code and can be downloaded from the Quantlet platform. Practical exercises from this book and their solutions can also be found in the accompanying Springer book by W.K. Härdle and Z. Hlávka: Multivariate Statistics - Exercises and Solutions.

Roman Vershynin ... 296 pages - Publisher: Cambridge Univ.Press; (September, 2018) ... Language: English - ISBN-10: 1108415199 - ISBN-13: 978-1108415194.

High-dimensional probability offers insight into the behavior of random vectors, random matrices, random subspaces, and objects used to quantify uncertainty in high dimensions. Drawing on ideas from probability, analysis, and geometry, it lends itself to applications in mathematics, statistics, theoretical computer science, signal processing, optimization, and more. It is the first to integrate theory, key tools, and modern applications of high-dimensional probability. Concentration inequalities form the core, and it covers both classical results such as Hoeffding's and Chernoff's inequalities and modern developments such as the matrix Bernstein's inequality. It then introduces the powerful methods based on stochastic processes, including such tools as Slepian's, Sudakov's, and Dudley's inequalities, as well as generic chaining and bounds based on VC dimension. A broad range of illustrations is embedded throughout, including classical and modern results for covariance estimation, clustering, networks, semidefinite programming, coding, dimension reduction, matrix completion, machine learning, compressed sensing, and sparse regression.

Debbie L. Hahs-Vaughn ... 662 pages - Publisher: Routledge; (November, 2016) ... Language: English - ISBN-10: 0415842360 - ISBN-13: 978-0415842365.

More comprehensive than other texts, this new book covers the classic and cutting edge multivariate techniques used in today’s research. Ideal for courses on multivariate statistics/analysis/design, advanced statistics or quantitative techniques taught in psychology, education, sociology, and business, the book also appeals to researchers with no training in multivariate methods. Through clear writing and engaging pedagogy and examples using real data, Hahs-Vaughn walks students through the most used methods to learn why and how to apply each technique. A conceptual approach with a higher than usual text-to-formula ratio helps reader’s master key concepts so they can implement and interpret results generated by today’s sophisticated software. Annotated screenshots from SPSS and other packages are integrated throughout. Designed for course flexibility, after the first 4 chapters, instructors can use chapters in any sequence or combination to fit the needs of their students. Each chapter includes a ‘mathematical snapshot’ that highlights the technical components of each procedure, so only the most crucial equations are included.

Highlights include: -Outlines, key concepts, and vignettes related to key concepts preview what’s to come in each chapter. -Examples using real data from education, psychology, and other social sciences illustrate key concepts. -Extensive coverage of assumptions including tables, the effects of their violation, and how to test for each technique. -Conceptual, computational, and interpretative problems mirror the real-world problems students encounter in their studies and careers. -A focus on data screening and power analysis with attention on the special needs of each particular method. -Instructions for using SPSS via screenshots and annotated output along with HLM, Mplus, LISREL, and G*Power where appropriate, to demonstrate how to interpret results. -Templates for writing research questions and APA-style write-ups of results which serve as models. -Propensity score analysis chapter that demonstrates the use of this increasingly popular technique. -A review of matrix algebra for those who want an introduction (prerequisites include an introduction to factorial ANOVA, ANCOVA, and simple linear regression, but knowledge of matrix algebra is not assumed).

Ulrich Kohler, Frauke Kreuter ... 497 pages - Publisher: Stata Press; 3rd edition (August, 2012) ... Language: English - ISBN-10: 1597181102 - ISBN-13: 978-1597181105.

Data Analysis Using Stata, Third Edition is a comprehensive introduction to both statistical methods and Stata. Beginners will learn the logic of data analysis and interpretation and easily become self-sufficient data analysts. Readers already familiar with Stata will find it an enjoyable resource for picking up new tips and tricks. The book is written as a self-study tutorial and organized around examples. It interactively introduces statistical techniques such as data exploration, description, and regression techniques for continuous and binary dependent variables. Step by step, readers move through the entire process of data analysis and in doing so learn the principles of Stata, data manipulation, graphical representation, and programs to automate repetitive tasks. This third edition includes advanced topics, such as factor-variables notation, average marginal effects, standard errors in complex survey, and multiple imputation in a way, that beginners of both data analysis and Stata can understand. Using data from a longitudinal study of private households, the authors provide examples from the social sciences that are relatable to researchers from all disciplines. The examples emphasize good statistical practice and reproducible research. Readers are encouraged to download the companion package of datasets to replicate the examples as they work through the book. Each chapter ends with exercises to consolidate acquired skills.

Max Bramer ... 544 pages - Publisher: Springer; 3rd edition (November, 2016) ... Language: English - AmazonSIN: B01N3LZ1KI.

This book explains and explores the principal techniques of Data Mining, the automatic extraction of implicit and potentially useful information from data, which is increasingly used in commercial, scientific and other application areas. It focuses on classification, association rule mining and clustering. Each topic is clearly explained, with a focus on algorithms not mathematical formalism, and is illustrated by detailed worked examples. The book is written for readers without a strong background in mathematics or statistics and any formulae used are explained in detail. It can be used as a textbook to support courses at undergraduate or postgraduate levels in a wide range of subjects including Computer Science, Business Studies, Marketing, Artificial Intelligence, Bioinformatics and Forensic Science. As an aid to self study, this book aims to help general readers develop the necessary understanding of what is inside the 'black box' so they can use commercial data mining packages discriminatingly, as well as enabling advanced readers or academic researchers to understand or contribute to future technical advances in the field. Each chapter has practical exercises to enable readers to check their progress. A full glossary of technical terms used is included. This expanded third edition includes detailed descriptions of algorithms for classifying streaming data, both stationary data, where the underlying model is fixed, and data that is time-dependent, where the underlying model changes from time to time - a phenomenon known as concept drift.

Alan Graham ... 320 pages - Publisher: Teach Yourself; (April, 2017) ... Language: English - AmazonSIN: B01LZ6WZXS.

Do you need to gain confidence with handling numbers and formulae? Do you want a clear, step-by-step guide to the key concepts and principles of statistics? Nearly all aspects of our lives can be subject to statistical analysis. Statistics: An Introduction shows you how to interpret, analyze and present figures. Assuming minimal knowledge of maths and using examples from a wide variety of everyday contexts, this book makes often complex concepts and techniques easy to get to grips with. This new edition has been fully updated. Whether you want to understand the statistics that you are bombarded with every day or are a student or professional coming to statistics from a wide range of disciplines, Statistics: An Introduction covers it all.

Nilanjan Dey ... 266 pages - Publisher: Springer; (November, 2019) ... Language: English - AmazonSIN: B0818MWNQJ.

The book discusses advantages of the firefly algorithm over other well-known metaheuristic algorithms in various engineering studies. The book provides a brief outline of various application-oriented problem solving methods, like economic emission load dispatch problem, designing a fully digital controlled reconfigurable switched beam nonconcentric ring array antenna, image segmentation, span minimization in permutation flow shop scheduling, multi-objective load dispatch problems, image compression, etc., using FA and its variants. It also covers the use of the firefly algorithm to select features, as research has shown that the firefly algorithm generates precise and optimal results in terms of time and optimality. In addition, the book also explores the potential of the firefly algorithm to provide a solution to traveling salesman problem, graph coloring problem, etc.

Seth Weidman ... 253 pages - Publisher: O'Reilly Media; (September, 2019) ... Language: English - AmazonSIN: B07XL53Y4C.

With the resurgence of neural networks in the 2010s, deep learning has become essential for machine learning practitioners and even many software engineers. This book provides a comprehensive introduction for data scientists and software engineers with machine learning experience. You’ll start with deep learning basics and move quickly to the details of important advanced architectures, implementing everything from scratch along the way. Author Seth Weidman shows you how neural networks work using a first principles approach. You’ll learn how to apply multilayer neural networks, convolutional neural networks, and recurrent neural networks from the ground up. With a thorough understanding of how neural networks work mathematically, computationally, and conceptually, you’ll be set up for success on all future deep learning projects.

This book provides: Extremely clear and thorough mental models—accompanied by working code examples and mathematical explanations—for understanding neural networks + Methods for implementing multilayer neural networks from scratch, using an easy-to-understand object-oriented framework + Working implementations and clear-cut explanations of convolutional and recurrent neural networks + Implementation of these neural network concepts using the popular PyTorch framework.

Richard S. Sutton, Andrew G. Barto ... 532 pages - Publisher: A Bradford Book; 2nd edition (October, 2018) ... Language: English - AmazonSIN: B07JN1QFW5.

The significantly expanded and updated new edition of a widely used text on reinforcement learning, one of the most active research areas in artificial intelligence: Reinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives while interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the field's key ideas and algorithms. This second edition has been significantly expanded and updated, presenting new topics and updating coverage of other topics.

Like the first edition, this second edition focuses on core online learning algorithms, with the more mathematical material set off in shaded boxes. Part I covers as much of reinforcement learning as possible without going beyond the tabular case for which exact solutions can be found. Many algorithms presented in this part are new to the second edition, including UCB, Expected Sarsa, and Double Learning. Part II extends these ideas to function approximation, with new sections on such topics as artificial neural networks and the Fourier basis, and offers expanded treatment of off-policy learning and policy-gradient methods. Part III has new chapters on reinforcement learning's relationships to psychology and neuroscience, as well as an updated case-studies chapter including AlphaGo and AlphaGo Zero, Atari game playing, and IBM Watson's wagering strategy. The final chapter discusses the future societal impacts of reinforcement learning.

Eugene Charniak ... 192 pages - Publisher: The MIT Press; (January, 2019) ... Language: English - AmazonSIN: B07PGRZXN8.

This concise, project-driven guide to deep learning takes readers through a series of program-writing tasks that introduce them to the use of deep learning in such areas of artificial intelligence as computer vision, natural-language processing, and reinforcement learning. The author, a longtime artificial intelligence researcher specializing in natural-language processing, covers feed-forward neural nets, convolutional neural nets, word embeddings, recurrent neural nets, sequence-to-sequence learning, deep reinforcement learning, unsupervised models, and other fundamental concepts and techniques. Students and practitioners learn the basics of deep learning by working through programs in Tensorflow, an open-source machine learning framework. “I find I learn computer science material best by sitting down and writing programs,” the author writes, and the book reflects this approach.

Each chapter includes a programming project, exercises, and references for further reading. An early chapter is devoted to Tensorflow and its interface with Python, the widely used programming language. Familiarity with linear algebra, multivariate calculus, and probability and statistics is required, as is a rudimentary knowledge of programming in Python. The book can be used in both undergraduate and graduate courses; practitioners will find it an essential reference.

Jesús Rogel-Salazar ... 420 pages - Publisher: Chapman and Hall/CRC; (May, 2020) ... Language: English - AmazonSIN: B0883XB13B.

Advanced Data Science and Analytics with Python enables data scientists to continue developing their skills and apply them in business as well as academic settings. The subjects discussed in this book are complementary and a follow-up to the topics discussed in Data Science and Analytics with Python. The aim is to cover important advanced areas in data science using tools developed in Python such as SciKit-learn, Pandas, Numpy, Beautiful Soup, NLTK, NetworkX and others. The model development is supported by the use of frameworks such as Keras, TensorFlow and Core ML, as well as Swift for the development of iOS and MacOS applications.

Features: Targets readers with a background in programming, who are interested in the tools used in data analytics and data science + Uses Python throughout + Presents tools, alongside solved examples, with steps that the reader can easily reproduce and adapt to their needs + Focuses on the practical use of the tools rather than on lengthy explanations + Provides the reader with the opportunity to use the book whenever needed rather than following a sequential path.

The book can be read independently from the previous volume and each of the chapters in this volume is sufficiently independent from the others, providing flexibility for the reader. Each of the topics addressed in the book tackles the data science workflow from a practical perspective, concentrating on the process and results obtained. The implementation and deployment of trained models are central to the book. Time series analysis, natural language processing, topic modelling, social network analysis, neural networks and deep learning are comprehensively covered. The book discusses the need to develop data products and addresses the subject of bringing models to their intended audiences – in this case, literally to the users’ fingertips in the form of an iPhone app.

Kumar Molugaram, G. Shanker Rao ... 538 pages - Publisher: Butterworth-Heinemann; (March, 2017) ... Language: English - AmazonSIN: B06XFRF985.

Statistical Techniques for Transportation Engineering is written with a systematic approach in mind and covers a full range of data analysis topics, from the introductory level (basic probability, measures of dispersion, random variable, discrete and continuous distributions) through more generally used techniques (common statistical distributions, hypothesis testing), to advanced analysis and statistical modeling techniques (regression, AnoVa, and time series). The book also provides worked out examples and solved problems for a wide variety of transportation engineering challenges.

Demonstrates how to effectively interpret, summarize, and report transportation data using appropriate statistical descriptors + Teaches how to identify and apply appropriate analysis methods for transportation data + Explains how to evaluate transportation proposals and schemes with statistical rigor.

Contact Form

Name

Email *

Message *

Powered by Blogger.