Articles by "Data Analysis"

Showing posts with label Data Analysis. Show all posts

Wes McKinney ... 544 pages - Publisher: O'Reilly Media; 2nd edition (October, 2017) ... Language: English - ISBN-10: 1491957662 - ISBN-13: 978-1491957660 ... 

Get complete instructions for manipulating, processing, cleaning, and crunching datasets in Python. Updated for Python 3.6, the second edition of this hands-on guide is packed with practical case studies that show you how to solve a broad set of data analysis problems effectively. You’ll learn the latest versions of pandas, NumPy, IPython, and Jupyter in the process. Written by Wes McKinney, the creator of the Python pandas project, this book is a practical, modern introduction to data science tools in Python. It’s ideal for analysts new to Python and for Python programmers new to data science and scientific computing. Data files and related material are available on GitHub. Use the IPython shell and Jupyter notebook for exploratory computing * Learn basic and advanced features in NumPy (Numerical Python) * Get started with data analysis tools in the pandas library * Use flexible tools to load, clean, transform, merge, and reshape data * Create informative visualizations with matplotlib * Apply the pandas groupby facility to slice, dice, and summarize datasets * Analyze and manipulate regular and irregular time series data * Learn how to solve real-world data analysis problems with thorough, detailed examples.

Alan Agresti ... 400 pages - Publisher: Wiley-Interscience; 2nd edition (March, 2007) ... Language: English - ISBN-10: 0471226181 - ISBN-13: 978-0471226185 ...

The use of statistical methods for categorical data is ever increasing in today's world. An Introduction to Categorical Data Analysis, Second Edition provides an applied introduction to the most important methods for analyzing categorical data. This new edition summarizes methods that have long played a prominent role in data analysis, such as chi-squared tests, and also places special emphasis on logistic regression and other modeling techniques for univariate and correlated multivariate categorical responses. This Second Edition features: Two new chapters on the methods for clustered data, with an emphasis on generalized estimating equations (GEE) and random effects models * A unified perspective based on generalized linear models * An emphasis on logistic regression modeling * An appendix that demonstrates the use of SAS(r) for all methods * An entertaining historical perspective on the development of the methods * Specialized methods for ordinal data, small samples, multicategory data, and matched pairs * More than 100 analyses of real data sets and nearly 300 exercises. Written in an applied, nontechnical style, the book illustrates methods using a wide variety of real data, including medical clinical trials, drug use by teenagers, basketball shooting, horseshoe crab mating, environmental opinions, correlates of happiness, and much more. An Introduction to Categorical Data Analysis, Second Edition is an invaluable tool for social, behavioral, and biomedical scientists, as well as researchers in public health, marketing, education, biological and agricultural sciences, and industrial quality control.

Hector Guerrero ... 338 pages - Publisher: Springer; (March, 2010) ... Language: English - ISBN-10: 3642108342 - ISBN-13: 978-3642108341 ...

Why does the World Need―Excel Data Analysis, Modeling, and Simulation? When spreadsheets ?rst became widely available in the early 1980s, it spawned a revolution in teaching. What previously could only be done with arcane software and large scale computing was now available to the common-man, on a desktop. Also, before spreadsheets, most substantial analytical work was done outside the classroom where the tools were; spreadsheets and personal computers moved the work into the classroom. Not only did it change how the analysis curriculum was taught, but it also empowered students to venture out on their own to explore new ways to use the tools. I can’t tell you how many phone calls, of?ce visits, and/or emails I have received in my teaching career from ecstatic students crowing about what they have just done with a spreadsheet model. I have been teaching courses related to spreadsheet based analysis and modeling for about 25 years and I have watched and participated in the spreadsheet revolution.

R. Lyman Ott, Micheal T. Longnecker ... 1297 pages - Publisher: Brooks Cole; 6th edition (December, 2008) ... Language: English - ISBN-10: 0495017582 - ISBN-13: 978-0495017585 ...

Ott and Longnecker's AN INTRODUCTION TO STATISTICAL METHODS AND DATA ANALYSIS, Sixth Edition, provides a broad overview of statistical methods for advanced undergraduate and graduate students from a variety of disciplines who have little or no prior course work in statistics. The authors teach students to solve problems encountered in research projects, to make decisions based on data in general settings both within and beyond the university setting, and to become critical readers of statistical analyses in research papers and in news reports. The first eleven chapters present material typically covered in an introductory statistics course, as well as case studies and examples that are often encountered in undergraduate capstone courses. The remaining chapters cover regression modeling and design of experiments.

Gareth James, Daniela Witten, Trevor Hastie, Robert Tibshirani ... 426 pages - Publisher: Springer; (September, 2017) ... Language: English - ISBN-10: 1461471370 - ISBN-13: 978-1461471370.

An Introduction to Statistical Learning provides an accessible overview of the field of statistical learning, an essential toolset for making sense of the vast and complex data sets that have emerged in fields ranging from biology to finance to marketing to astrophysics in the past twenty years. This book presents some of the most important modeling and prediction techniques, along with relevant applications. Topics include linear regression, classification, resampling methods, shrinkage approaches, tree-based methods, support vector machines, clustering, and more. Color graphics and real-world examples are used to illustrate the methods presented. Since the goal of this textbook is to facilitate the use of these statistical learning techniques by practitioners in science, industry, and other fields, each chapter contains a tutorial on implementing the analyses and methods presented in R, an extremely popular open source statistical software platform. Two of the authors co-wrote The Elements of Statistical Learning (Hastie, Tibshirani and Friedman, 2nd edition 2009), a popular reference book for statistics and machine learning researchers. An Introduction to Statistical Learning covers many of the same topics, but at a level accessible to a much broader audience. This book is targeted at statisticians and non-statisticians alike who wish to use cutting-edge statistical learning techniques to analyze their data. The text assumes only a previous course in linear regression and no knowledge of matrix algebra.

Brian Steele, John Chandler, Swarna Reddy ... 430 pages - Publisher: Springer; 1st edition (January, 2017) ... Language: English - ISBN-10: 3319457950 - ISBN-13: 978-3319457956 ...

This textbook on practical data analytics unites fundamental principles, algorithms, and data. Algorithms are the keystone of data analytics and the focal point of this textbook. Clear and intuitive explanations of the mathematical and statistical foundations make the algorithms transparent. But practical data analytics requires more than just the foundations. Problems and data are enormously variable and only the most elementary of algorithms can be used without modification. Programming fluency and experience with real and challenging data is indispensable and so the reader is immersed in Python and R and real data analysis. By the end of the book, the reader will have gained the ability to adapt algorithms to new problems and carry out innovative analyses. This book has three parts: (a) Data Reduction: Begins with the concepts of data reduction, data maps, and information extraction. The second chapter introduces associative statistics, the mathematical foundation of scalable algorithms and distributed computing. Practical aspects of distributed computing is the subject of the Hadoop and MapReduce chapter. (b) Extracting Information from Data: Linear regression and data visualization are the principal topics of Part II. The authors dedicate a chapter to the critical domain of Healthcare Analytics for an extended example of practical data analytics. The algorithms and analytics will be of much interest to practitioners interested in utilizing the large and unwieldly data sets of the Centers for Disease Control and Prevention's Behavioral Risk Factor Surveillance System. (c) Predictive Analytics Two foundational and widely used algorithms, k-nearest neighbors and naive Bayes, are developed in detail. A chapter is dedicated to forecasting. The last chapter focuses on streaming data and uses publicly accessible data streams originating from the Twitter API and the NASDAQ stock market in the tutorials. This book is intended for a one- or two-semester course in data analytics for upper-division undergraduate and graduate students in mathematics, statistics, and computer science. The prerequisites are kept low, and students with one or two courses in probability or statistics, an exposure to vectors and matrices, and a programming course will have no difficulty. The core material of every chapter is accessible to all with these prerequisites. The chapters often expand at the close with innovations of interest to practitioners of data science. Each chapter includes exercises of varying levels of difficulty. The text is eminently suitable for self-study and an exceptional resource for practitioners.

Mohssen Mohammed,‎ Muhammad Badruddin Khan,‎ Eihab Bashier Mohammed Bashier ... 226 pages - Publisher: CRC Press; 1st edition (July, 2016) ... Language: English - ISBN-10: 1498705383 - ISBN-13: 978-1498705387 ...

Machine learning, one of the top emerging sciences, has an extremely broad range of applications. However, many books on the subject provide only a theoretical approach, making it difficult for a newcomer to grasp the subject material. This book provides a more practical approach by explaining the concepts of machine learning algorithms and describing the areas of application for each algorithm, using simple practical examples to demonstrate each algorithm and showing how different issues related to these algorithms are applied.

IBM SPSS Statistics v25 [Size: 722 MB] ... IBM SPSS Statistics Subscription delivers the power of SPSS Statistics predictive analytics capabilities with a flexible subscription payment option, including an easier purchasing, management and licensing experience. With the introduction of SPSS Statistics Subscription, organizations, groups and individuals can take advantage of predictive analytics capabilities to deliver the maximum amount of value to the user. Propel research & analysis with a fast and powerful solution: A comprehensive set of statistical tools - Work inside a single, integrated interface to run descriptive statistics, regression, advanced statistics and many more. Create publication ready charts, tables, and decision trees in one tool. - Integration with Open Source - Enhance the SPSS Syntax with R and Python through specialized extensions. Leverage the 130+ extensions available on our Extension Hub, or build your own and share with your peers to create a customized solution. - Easy statistical analysis - Use a simple drag and drop interface to access a wide range of capabilities and work across multiple data sources. Plus, flexible deployment options make purchasing and managing your software easy. Subscription Base: Data Preparation - IBM SPSS Data Preparation is a fully integrated module of IBM SPSS Statistics. - Bootstrapping - Bootstrapping is a useful technique for testing model stability. Custom Tables and Advanced Statistics Add-On: Advanced Statistics - Improve the accuracy of your analysis and the dependability of your conclusions with statistics designed for complex relationships. - Regression - IBM SPSS Regression enables you to predict categorical outcomes and apply various nonlinear regression procedures. - Custom Tables: IBM SPSS Custom Tables enables you to summarize SPSS Statistics data, and display your analyses as presentation-quality, production-ready tables.

Bradley P. Carlin, Thomas A. Louis ... 552 pages - Publisher: Chapman and Hall/CRC; 3rd edition (June, 2008) ... Language: English - ISBN-10: 1584886978 - ISBN-13: 978-1584886976 ...

Broadening its scope to nonstatisticians, Bayesian Methods for Data Analysis, Third Edition provides an accessible introduction to the foundations and applications of Bayesian analysis. Along with a complete reorganization of the material, this edition concentrates more on hierarchical Bayesian modeling as implemented via Markov chain Monte Carlo (MCMC) methods and related data analytic techniques. New to the Third Edition: New data examples, corresponding R and WinBUGS code, and homework problems - Explicit descriptions and illustrations of hierarchical modeling―now commonplace in Bayesian data analysis - A new chapter on Bayesian design that emphasizes Bayesian clinical trials - A completely revised and expanded section on ranking and histogram estimation - A new case study on infectious disease modeling and the 1918 flu epidemic - A solutions manual for qualifying instructors that contains solutions, computer code, and associated output for every homework problem―available both electronically and in print. Ideal for Anyone Performing Statistical Analyses: Focusing on applications from biostatistics, epidemiology, and medicine, this text builds on the popularity of its predecessors by making it suitable for even more practitioners and students.

Rao V. Dukkipati ... 680 pages - Publisher: Anshan Publishers; (October, 2010) ... Language: English - ISBN-10: 1904798675 - ISBN-13: 978-1848290433

This book presents an introduction to MATLAB and its applications in engineering problem solving. It is designed as an introductory course in MATLAB for engineers. The classical methods of electrical circuits,control systems, numerical methods, optimization, direct numerical integration methods, engineering mechanics and mechanical vibrations are covered using MATLAB software. The numerous worked examples and unsolved exercise problems are intended to provide the reader with an awareness of the general applicability to electrical circuits, control systems, numerical methods, optimization, direct numerical integration methods, engineering mechanics and mechanical vibrations using MATLAB. Contents: MATLAB Basics - Electrical Circuits - Control Systems - Numerical Methods - Optimization - Direct Numerical Integration Methods - Engineering Mechanics - Mechanical Vibration.

Steve Brooks, Andrew Gelman ... 619 pages - Publisher: Chapman and Hall/CRC; (May, 2011) ... Language: English - ISBN-10: 1420079417 - ISBN-13: 978-1420079418.

Since their popularization in the 1990s, Markov chain Monte Carlo (MCMC) methods have revolutionized statistical computing and have had an especially profound impact on the practice of Bayesian statistics. Furthermore, MCMC methods have enabled the development and use of intricate models in an astonishing array of disciplines as diverse as fisheries science and economics. The wide-ranging practical importance of MCMC has sparked an expansive and deep investigation into fundamental Markov chain theory. 

The Handbook of Markov Chain Monte Carlo provides a reference for the broad audience of developers and users of MCMC methodology interested in keeping up with cutting-edge theory and applications. The first half of the book covers MCMC foundations, methodology, and algorithms. The second half considers the use of MCMC in a variety of practical applications including in educational research, astrophysics, brain imaging, ecology, and sociology. The in-depth introductory section of the book allows graduate students and practicing scientists new to MCMC to become thoroughly acquainted with the basic theory, algorithms, and applications. The book supplies detailed examples and case studies of realistic scientific problems presenting the diversity of methods used by the wide-ranging MCMC community. Those familiar with MCMC methods will find this book a useful refresher of current theory and recent developments.

Ronald Christensen, Wesley Johnson, Adam Branscum, T. E. Hanson ... 516 pages - Publisher: CRC Press; 1st edition (July, 2010) - Language: English - ISBN-10: 1439803544 - ISBN-13: 978-1439803547 ...

Emphasizing the use of WinBUGS and R to analyze real data, Bayesian Ideas and Data Analysis: An Introduction for Scientists and Statisticians presents statistical tools to address scientific questions. It highlights foundational issues in statistics, the importance of making accurate predictions, and the need for scientists and statisticians to collaborate in analyzing data. The WinBUGS code provided offers a convenient platform to model and analyze a wide range of data. The first five chapters of the book contain core material that spans basic Bayesian ideas, calculations, and inference, including modeling one and two sample data from traditional sampling models. The text then covers Monte Carlo methods, such as Markov chain Monte Carlo (MCMC) simulation. After discussing linear structures in regression, it presents binomial regression, normal regression, analysis of variance, and Poisson regression, before extending these methods to handle correlated data. The authors also examine survival analysis and binary diagnostic testing. A complementary chapter on diagnostic testing for continuous outcomes is available on the book’s website. The last chapter on nonparametric inference explores density estimation and flexible regression modeling of mean functions. The appropriate statistical analysis of data involves a collaborative effort between scientists and statisticians. Exemplifying this approach, Bayesian Ideas and Data Analysis focuses on the necessary tools and concepts for modeling and analyzing scientific data.

Daniel Graupe ... 500 pages - Publisher: World Scientific Publishing; 3rd edition (November 19, 2013) ... Language: English - ISBN-10: 9814522732 - ISBN-13: 978-9814522731 ...

Artificial neural networks are most suitable for solving problems that are complex, ill-defined, highly nonlinear, of many and different variables, and/or stochastic. Such problems are abundant in medicine, in finance, in security and beyond. This volume covers the basic theory and architecture of the major artificial neural networks. Uniquely, it presents 18 complete case studies of applications of neural networks in various fields, ranging from cell-shape classification to micro-trading in finance and to constellation recognition all with their respective source codes. These case studies demonstrate to the readers in detail how such case studies are designed and executed and how their specific results are obtained. The book is written for a one-semester graduate or senior-level undergraduate course on artificial neural networks.

Karl Hinderer, Ulrich Rieder, Michael Stieglitz ... 530 pages - Publisher: Springer; 1st edition (January, 2017) ... Language: English - ISBN-10: 3319488139 - ISBN-13: 978-3319488134 ...

This book explores discrete-time dynamic optimization and provides a detailed introduction to both deterministic and stochastic models. Covering problems with finite and infinite horizon, as well as Markov renewal programs, Bayesian control models and partially observable processes, the book focuses on the precise modelling of applications in a variety of areas, including operations research, computer science, mathematics, statistics, engineering, economics and finance. Dynamic Optimization is a carefully presented textbook which starts with discrete-time deterministic dynamic optimization problems, providing readers with the tools for sequential decision-making, before proceeding to the more complicated stochastic models. The authors present complete and simple proofs and illustrate the main results with numerous examples and exercises (without solutions). With relevant material covered in four appendices, this book is completely self-contained.

Joseph P. Bigus ... 220 pages - Publisher: Mcgraw-Hill; (May, 1996) ... Language: English - ISBN-10: 0070057796 - ISBN-13: 978-0070057791 ...

Readers will find concrete implementation strategies, reinforced with real-world business examples and a minimum of formulas, and case studies drawn from a broad range of industries. The book illustrates the popular data mining functions of classification, clustering, modeling, and time-series forecasting--through examples developed using the IBM Neural Network Utility.

Paul Attewell, David Monaghan ... 264 pages - Publisher: Univ. of California Press; (May, 2015) ... Language: English - ISBN-10: 0520280989 - ISBN-13: 978-0520280984 ...

We live in a world of big data: the amount of information collected on human behavior each day is staggering, and exponentially greater than at any time in the past. Additionally, powerful algorithms are capable of churning through seas of data to uncover patterns. Providing a simple and accessible introduction to data mining, Paul Attewell and David B. Monaghan discuss how data mining substantially differs from conventional statistical modeling familiar to most social scientists. The authors also empower social scientists to tap into these new resources and incorporate data mining methodologies in their analytical toolkits. Data Mining for the Social Sciences demystifies the process by describing the diverse set of techniques available, discussing the strengths and weaknesses of various approaches, and giving practical demonstrations of how to carry out analyses using tools in various statistical software packages.

Claus Thorn Ekstrom, Helle Sørensen  ... 526 pages - Publisher: Chapman and Hall/CRC; 2nd edition (November, 2014) ... Language: English - ISBN-10: 1482238934 - ISBN-13: 978-1482238938 ...

A Hands-On Approach to Teaching Introductory Statistics: Expanded with over 100 more pages, Introduction to Statistical Data Analysis for the Life Sciences, Second Edition presents the right balance of data examples, statistical theory, and computing to teach introductory statistics to students in the life sciences. This popular textbook covers the mathematics underlying classical statistical analysis, the modeling aspects of statistical analysis and the biological interpretation of results, and the application of statistical software in analyzing real-world problems and datasets. New to the Second Edition: A new chapter on non-linear regression models + A new chapter that contains examples of complete data analyses, illustrating how a full-fledged statistical analysis is undertaken + Additional exercises in most chapters + A summary of statistical formulas related to the specific designs used to teach the statistical concepts. This text provides a computational toolbox that enables students to analyze real datasets and gain the confidence and skills to undertake more sophisticated analyses. Although accessible with any statistical software, the text encourages a reliance on R. For those new to R, an introduction to the software is available in an appendix. The book also includes end-of-chapter exercises as well as an entire chapter of case exercises that help students apply their knowledge to larger datasets and learn more about approaches specific to the life sciences.

Paul Murrell ... 418 pages - Publisher: Chapman and Hall/CRC; 1st edition (February, 2009) ... Language: English - ISBN-10: 1420065173 - ISBN-13: 978-1420065176 ...

Providing key information on how to work with research data, Introduction to Data Technologies presents ideas and techniques for performing critical, behind-the-scenes tasks that take up so much time and effort yet typically receive little attention in formal education. With a focus on computational tools, the book shows readers how to improve their awareness of what tasks can be achieved and describes the correct approach to perform these tasks. Practical examples demonstrate the most important points: The author first discusses how to write computer code using HTML as a concrete example. He then covers a variety of data storage topics, including different file formats, XML, and the structure and design issues of relational databases. After illustrating how to extract data from a relational database using SQL, the book presents tools and techniques for searching, sorting, tabulating, and manipulating data. It also introduces some very basic programming concepts as well as the R language for statistical computing. Each of these topics has supporting chapters that offer reference material on HTML, CSS, XML, DTD, SQL, R, and regular expressions. One-stop shop of introductory computing information: Written by a member of the R Development Core Team, this resource shows readers how to apply data technologies to tasks within a research setting. Collecting material otherwise scattered across many books and the web, it explores how to publish information via the web, how to access information stored in different formats, and how to write small programs to automate simple, repetitive tasks.

Nazmul Siddique, Hojjat Adeli ... 532 pages - Publisher: Wiley; 1st edition (May, 2013) ... Language: English - ISBN-10: 1118337840 - ISBN-13: 978-1118337844 ...

Computational Intelligence: Synergies of Fuzzy Logic, Neural Networks and Evolutionary Computing presents an introduction to some of the cutting edge technological paradigms under the umbrella of computational intelligence. Computational intelligence schemes are investigated with the development of a suitable framework for fuzzy logic, neural networks and evolutionary computing, neuro-fuzzy systems, evolutionary-fuzzy systems and evolutionary neural systems. Applications to linear and non-linear systems are discussed with examples. Key features: Covers all the aspects of fuzzy, neural and evolutionary approaches with worked out examples, MATLAB® exercises and applications in each chapter - Presents the synergies of technologies of computational intelligence such as evolutionary fuzzy neural fuzzy and evolutionary neural systems - Considers real world problems in the domain of systems modelling, control and optimization - Contains a foreword written by Lotfi Zadeh. Computational Intelligence: Synergies of Fuzzy Logic, Neural Networks and Evolutionary Computing is an ideal text for final year undergraduate, postgraduate and research students in electrical, control, computer, industrial and manufacturing engineering.

Contact Form

Name

Email *

Message *

Powered by Blogger.