Articles by "Data Analysis"

Showing posts with label Data Analysis. Show all posts

David L. Olson ... 173 pages - Publisher: Business Expert Press; (June, 2016) ... Language: English - ISBN-10: 9781631575488 - ISBN-13: 978-1631575488 ...

Data mining has become the fastest growing topic of interest in business programs in the past decade. This book is intended to describe the benefits of data mining in business, the process and typical business applications, the workings of basic data mining models, and demonstrate each with widely available free software. The book focuses on demonstrating common business data mining applications. It provides exposure to the data mining process, to include problem identification, data management, and available modeling tools. The book takes the approach of demonstrating typical business data sets with open source software. KNIME is a very easy-to-use tool, and is used as the primary means of demonstration. R is much more powerful and is a commercially viable data mining tool. We also demonstrate WEKA, which is a highly useful academic software, although it is difficult to manipulate test sets and new cases, making it problematic for commercial use.

Nilanjan Dey ... 335 pages - Publisher: IGI Global; 1st edition (November, 2017) ... Language: English - ISBN-10: 9781522541516 - ISBN-13: 978-1522541516 ...

Metaheuristic algorithms are present in various applications for different domains. Recently, researchers have conducted studies on the effectiveness of these algorithms in providing optimal solutions to complicated problems. Advancements in Applied Metaheuristic Computing is a crucial reference source for the latest empirical research on methods and approaches that include metaheuristics for further system improvements, and it offers outcomes of employing optimization algorithms. Featuring coverage on a broad range of topics such as manufacturing, genetic programming, and medical imaging, this publication is ideal for researchers, academicians, advanced-level students, and technology developers seeking current research on the use of optimization algorithms in several applications.

Rob J. Hyndman, George Athanasopoulos ... 382 pages - Publisher: OTexts; 2nd edition (May, 2018) ... Language: English - ISBN-10: 0987507117 - ISBN-13: 978-0987507112 ...

Forecasting is required in many situations. Deciding whether to build another power generation plant in the next five years requires forecasts of future demand. Scheduling staff in a call centre next week requires forecasts of call volumes. Stocking an inventory requires forecasts of stock requirements. Telecommunication routing requires traffic forecasts a few minutes ahead. Whatever the circumstances or time horizons involved, forecasting is an important aid in effective and efficient planning. This textbook provides a comprehensive introduction to forecasting methods and presents enough information about each method for readers to use them sensibly. Examples use R with many data sets taken from the authors' own consulting experience. In this second edition, all chapters have been updated to cover the latest research and forecasting methods. Three new chapters have been added on dynamic regression forecasting, hierarchical forecasting and practical forecasting issues.

Xiaofeng Wang, Yu Ryan Yue, Julian J. Faraway ... 324 pages - Publisher: Chapman and Hall/CRC; Language: English - ISBN-10: 1498727255 - ISBN-13: 978-1498727259 ...

INLA stands for Integrated Nested Laplace Approximations, which is a new method for fitting a broad class of Bayesian regression models. No samples of the posterior marginal distributions need to be drawn using INLA, so it is a computationally convenient alternative to Markov chain Monte Carlo (MCMC), the standard tool for Bayesian inference. Bayesian Regression Modeling with INLA covers a wide range of modern regression models and focuses on the INLA technique for building Bayesian models using real-world data and assessing their validity. A key theme throughout the book is that it makes sense to demonstrate the interplay of theory and practice with reproducible studies. Complete R commands are provided for each example, and a supporting website holds all of the data described in the book. An R package including the data and additional functions in the book is available to download. The book is aimed at readers who have a basic knowledge of statistical theory and Bayesian methodology. It gets readers up to date on the latest in Bayesian inference using INLA and prepares them for sophisticated, real-world work.

National Research University Higher School of Economics [Size: 1.26 GB] ... The goal of this course is to give learners basic understanding of modern neural networks and their applications in computer vision and natural language understanding. The course starts with a recap of linear models and discussion of stochastic optimization methods that are crucial for training deep neural networks. Learners will study all popular building blocks of neural networks including fully connected layers, convolutional and recurrent layers. Learners will use these building blocks to define complex modern architectures in TensorFlow and Keras frameworks. In the course project learner will implement deep neural network for the task of image captioning which solves the problem of giving a text description for an input image. The prerequisites for this course are: 1) Basic knowledge of Python. 2) Basic linear algebra and probability. Please note that this is an advanced course and we assume basic knowledge of machine learning. You should understand: 1) Linear regression: mean squared error, analytical solution. 2) Logistic regression: model, cross-entropy loss, class probability estimation. 3) Gradient descent for linear models. Derivatives of MSE and cross-entropy loss functions. 4) The problem of overfitting. 5) Regularization for linear models. Who is this class for: Developers, analysts and researchers who are faced with tasks involving complex structure understanding such as image, sound and text analysis.

Brian Hahn, Daniel Valentine ... 424 pages - Publisher: Academic Press; 5th edition (January, 2013) ... Language: English - ISBN-10: 0123943981 - ISBN-13: 978-0123943989 ...

The fifth edition of Essential MATLAB for Engineers and Scientists provides a concise, balanced overview of MATLAB's functionality that facilitates independent learning, with coverage of both the fundamentals and applications. The essentials of MATLAB are illustrated throughout, featuring complete coverage of the software's windows and menus. Program design and algorithm development are presented clearly and intuitively, along with many examples from a wide range of familiar scientific and engineering areas. This is an ideal book for a first course on MATLAB or for an engineering problem-solving course using MATLAB, as well as a self-learning tutorial for professionals and students expected to learn and apply MATLAB. Updated with the features of MATLAB R2012b + Expanded discussion of writing functions and scripts + Revised and expanded Part II: Applications + Expanded section on GUIs + More exercises and examples throughout + Companion website for students providing M-files used within the book and selected solutions to end-of-chapter problems.

The topics in this course come from an analysis of real requirements in data scientist job listings from the biggest tech employers. We'll cover the machine learning, AI, and data mining techniques real employers are looking for, including: Deep Learning / Neural Networks (MLP's, CNN's, RNN's) with TensorFlow and Keras + Sentiment analysis + Image recognition and classification + Regression analysis + K-Means Clustering + Principal Component Analysis + Train/Test and cross validation + Bayesian Methods + Decision Trees and Random Forests + Multivariate Regression + Multi-Level Models + Support Vector Machines + Reinforcement Learning + Collaborative Filtering + K-Nearest Neighbor + Bias/Variance Tradeoff + Ensemble Learning + Term Frequency / Inverse Document Frequency + Experimental Design and A/B Tests.

Machine Learning, Classification and Algorithms using MATLAB: Learn to Implement Classification Algorithms In One of the Most Power Tool used by Scientists and Engineer.

 This course is designed to cover one of the most interesting areas of machine learning called classification. I will take you step-by-step in this course and will first cover the basics of MATLAB. Following that we will look into the details of how to use different machine learning algorithms using MATLAB. Specifically, we will be looking at the MATLAB toolbox called statistic and machine learning toolbox. We will implement some of the most commonly used classification algorithms such as K-Nearest Neighbor, Naive Bayes, Discriminant Analysis, Decision Tress, Support Vector Machines, Error Correcting Output Codes and Ensembles. Following that we will be looking at how to cross validate these models and how to evaluate their performances. Intuition into the classification algorithms is also included so that a person with no mathematical background can still comprehend the essential ideas. The following are the course outlines. Segment 1: Grabbing and Importing Dataset + Segment 2: K-Nearest Neighbor + Segment 3: Naive Bayes + Segment 4: Decision Trees + Segment 5: Discriminant Analysis + Segment 6: Support Vector Machines + Segment 7: Error Correcting Output Codes + Segment 8: Classification with Ensembles + Segment 9: Validation Methods + Segment 10: Evaluating Performance.

Conrad Carlberg ... 426 pages - Publisher: Que Publishing; (April, 2011) ... Language: English - ASIN: B0054RF7YI by Amazon

Top Excel guru Conrad Carlberg shows how to use Excel 2010 to perform the core statistical tasks every business professional, student, and researcher should master. Using real-world examples, Carlberg helps you choose the right technique for each problem and get the most out of Excel’s statistical features, including its new consistency functions. Along the way, you discover the most effective ways to use correlation and regression and analysis of variance and covariance. You see how to use Excel to test statistical hypotheses using the normal, binomial, t and F distributions. Becoming an expert with Excel statistics has never been easier! You’ll find crystal-clear instructions, insider insights, and complete step-by-step projects—all complemented by an extensive set of web-based resources. • Master Excel’s most useful descriptive and inferential statistical tools. • Tell the truth with statistics, and recognize when others don’t. • Accurately summarize sets of values. • View how values cluster and disperse. • Infer a population’s characteristics from a sample’s frequency distribution. • Explore correlation and regression to learn how variables move in tandem. • Understand Excel’s new consistency functions. • Test differences between two means using z tests, t tests, and Excel’s Data Analysis Add-in. • Use ANOVA and ANCOVA to test differences between more than two means. • Explore statistical power by manipulating mean differences, standard errors, directionality, and alpha.

Douglas C. Montgomery, Elizabeth A. Peck, G. Geoffrey Vining ... 672 pages - Publisher: Wiley; 5th edition (April, 2012) ... Language: English - ISBN-10: 9780470542811 - ISBN-13: 978-0470542811 ... 

A comprehensive and up-to-date introduction to the fundamentals of regression analysis: Introduction to Linear Regression Analysis, Fifth Edition continues to present both the conventional and less common uses of linear regression in today’s cutting-edge scientific research. The authors blend both theory and application to equip readers with an understanding of the basic principles needed to apply regression model-building techniques in various fields of study, including engineering, management, and the health sciences. Following a general introduction to regression modeling,including typical applications, a host of technical tools are outlined such as basic inference procedures, introductory aspects of model adequacy checking, and polynomial regression models and their variations. The book then discusses how transformations and weighted least squares can be used to resolve problems of model inadequacy and also how to deal with influential observations. The Fifth Edition features numerous newly added topics,including: * A chapter on regression analysis of time series data that presents the Durbin-Watson test and other techniques for detecting autocorrelation as well as parameter estimation in time series regression models. * Regression models with random effects in addition to a discussion on subsampling and the importance of the mixed model. * Tests on individual regression coefficients and subsets of coefficients. * Examples of current uses of simple linear regression models and the use of multiple regression models for understanding patient satisfaction data. In addition to Minitab, SAS, and S-PLUS, the authors have incorporated JMP and the freely available R software to illustrate the discussed techniques and procedures in this new edition.Numerous exercises have been added throughout, allowing readers to test their understanding of the material. Introduction to Linear Regression Analysis, Fifth Edition is an excellent book for statistics and engineering courses on regression at the upper-undergraduate and graduate levels. The book also serves as a valuable, robust resource for professionals in the fields of engineering, life and biological sciences, and the social sciences.

Luca Massaron ... 312 pages - Publisher: Packt Publishing; (February, 2016) ... Language: English - ISBN-10: 1785286315 - ISBN-13: 978-1785286315 ...

Regression is the process of learning relationships between inputs and continuous outputs from example data, which enables predictions for novel inputs. There are many kinds of regression algorithms, and the aim of this book is to explain which is the right one to use for each set of problems and how to prepare real-world data for it. With this book you will learn to define a simple regression problem and evaluate its performance. The book will help you understand how to properly parse a dataset, clean it, and create an output matrix optimally built for regression. You will begin with a simple regression algorithm to solve some data science problems and then progress to more complex algorithms. The book will enable you to use regression models to predict outcomes and take critical business decisions. Through the book, you will gain knowledge to use Python for building fast better linear models and to apply the results in Python or in any computer language you prefer.

Andrew Gelman, Jennifer Hill ... 648 pages - Publisher: Cambridge Univ. Press; (December, 2006) ... Language: English - ISBN-10: 052168689X - ISBN-13: 978-0521686891 ...

Data Analysis Using Regression and Multilevel/Hierarchical Models is a comprehensive manual for the applied researcher who wants to perform data analysis using linear and nonlinear regression and multilevel models. The book introduces a wide variety of models, whilst at the same time instructing the reader in how to fit these models using available software packages. The book illustrates the concepts by working through scores of real data examples that have arisen from the authors' own applied research, with programming codes provided for each one. Topics covered include causal inference, including regression, poststratification, matching, regression discontinuity, and instrumental variables, as well as multilevel logistic regression and missing-data imputation. Practical tips regarding building, fitting, and understanding are provided throughout.

Stata Software for Statistics and Data Science v15 [Size: 295.5 MB] ... Stata is a general-purpose statistical software package created in 1985 by StataCorp. Most of its users work in research, especially in the fields of economics, sociology, political science, biomedicine and epidemiology.[citation needed]Stata's capabilities include data management, statistical analysis, graphics, simulations, regression analysis (linear and multiple), and custom programming. The name Stata is a portmanteau of the words statistics and data. The correct English pronunciation of Stata "must remain a mystery"; any of "Stay-ta", "Sta-ta" or "Stah-ta" are considered acceptable.

Features: Linear models: regression • censored outcomes • endogenous regressors • bootstrap, jackknife, and robust and cluster–robust variance • instrumental variables • three-stage least squares • constraints • quantile regression • GLS • more. Panel/longitudinal data: random and fixed effects with robust standard errors • linear mixed models • random-effects probit • GEE • random- and fixed-effects Poisson • dynamic panel-data models • instrumental variables • panel unit-root tests • more. Multilevel mixed-effects models: continuous, binary, count, and survival outcomes • two-, three-, and higher-level models • generalized linear models • nonlinear models • random intercepts • random slopes • crossed random effects • BLUPs of effects and fitted values • hierarchical models • residual error structures • DDF adjustments • support for survey data • more. Binary, count, and limited outcomes: logistic, probit, tobit • Poisson and negative binomial • conditional, multinomial, nested, ordered, rank-ordered, and stereotype logistic • multinomial probit • zero-inflated and left-truncated count models • selection models • marginal effects • more. Extended regression models (ERMs): combine endogenous covariates, sample selection, and nonrandom treatment in models for continuous, interval-censored, binary, and ordinal outcomes • more.

Johan A. K. Suykens, Marco Signoretto ... 525 pages - Publisher: Chapman and Hall/CRC; (October, 2014) ... Language: English - ISBN-10: 1482241390 - ISBN-13: 978-1482241396 ...

Regularization, Optimization, Kernels, and Support Vector Machines offers a snapshot of the current state of the art of large-scale machine learning, providing a single multidisciplinary source for the latest research and advances in regularization, sparsity, compressed sensing, convex and large-scale optimization, kernel methods, and support vector machines. Consisting of 21 chapters authored by leading researchers in machine learning, this comprehensive reference: Covers the relationship between support vector machines (SVMs) and the Lasso * Discusses multi-layer SVMs * Explores nonparametric feature selection, basis pursuit methods, and robust compressive sensing * Describes graph-based regularization methods for single- and multi-task learning * Considers regularized methods for dictionary learning and portfolio selection * Addresses non-negative matrix factorization * Examines low-rank matrix and tensor-based models * Presents advanced kernel methods for batch and online machine learning, system identification, domain adaptation, and image processing * Tackles large-scale algorithms including conditional gradient methods, (non-convex) proximal techniques, and stochastic gradient descent. Regularization, Optimization, Kernels, and Support Vector Machines is ideal for researchers in machine learning, pattern recognition, data mining, signal processing, statistical learning, and related areas.

Jian Qing Shi, Taeryon Choi ... 216 pages - Publisher: CRC; (July, 2011) .. Language: English - ISBN-10: 1439837732 - ISBN-13: 978-1439837733 ...

Gaussian Process Regression Analysis for Functional Data presents nonparametric statistical methods for functional regression analysis, specifically the methods based on a Gaussian process prior in a functional space. The authors focus on problems involving functional response variables and mixed covariates of functional and scalar variables. Covering the basics of Gaussian process regression, the first several chapters discuss functional data analysis, theoretical aspects based on the asymptotic properties of Gaussian process regression models, and new methodological developments for high dimensional data and variable selection. The remainder of the text explores advanced topics of functional regression analysis, including novel nonparametric statistical methods for curve prediction, curve clustering, functional ANOVA, and functional regression analysis of batch data, repeated curves, and non-Gaussian data. Many flexible models based on Gaussian processes provide efficient ways of model learning, interpreting model structure, and carrying out inference, particularly when dealing with large dimensional functional data. This book shows how to use these Gaussian process regression models in the analysis of functional data. Some MATLAB® and C codes are available on the first author’s website.

Geoff Der, Brian S. Everitt ... 246 pages - Publisher: SAS Institute; (December, 2015) ... Language: English - ISBN-10: 1629598437 - ISBN-13: 978-1629598437

Students and instructors of statistics courses using SAS University Edition will welcome this book. Learning fundamental statistics is essential to solving problems with SAS. Essential Statistics Using SAS University Edition demonstrates how to use SAS University Edition to apply a variety of statistical methodologies, from the simple to the not-so-simple, to a range of data sets. Learn how to apply the appropriate statistical method to answer a particular question about a data set, and correctly interpret the numerical results that you obtain. SAS University Edition users who are new to SAS or who need a refresher course will benefit from the statistics overview and topics, such as multiple linear regression, logistic regression, and Poisson regression.

Sabine Landau, Brian S. Everitt ... 366 pages - Publisher: Chapman and Hall/CRC; 1st edition (November, 2003) ... Language: English - ISBN-10: 1584883693 - ISBN-13: 978-1584883692 ...

A Handbook of Statistical Analyses Using SPSS clearly describes how to conduct a range of univariate and multivariate statistical analyses using the latest version of the Statistical Package for the Social Sciences, SPSS 11. Each chapter addresses a different type of analytical procedure applied to one or more data sets, primarily from the social and behavioral sciences areas. Each chapter also contains exercises relating to the data sets introduced, providing readers with a means to develop both their SPSS and statistical skills. Model answers to the exercises are also provided. Readers can download all of the data sets from a companion Web site furnished by the authors.

Samprit Chatterjee, Ali S. Hadi ... 424 pages - Publisher: Wiley; 5th edition (September, 2012) ... Language: English - ISBN-10: 0470905840 - ISBN-13: 978-0470905845...

Regression analysis is a conceptually simple method for investigating relationships among variables. Carrying out a successful application of regression analysis, however,requires a balance of theoretical results, empirical rules, and subjective judgment. Regression Analysis by Example, Fifth Edition has been expanded and thoroughly updated to reflect recent advances in the field. The emphasis continues to be on exploratory data analysis rather than statistical theory. The book offers in-depth treatment of regression diagnostics,transformation, multicollinearity, logistic regression, and robust regression. The book now includes a new chapter on the detection and correction of multicollinearity, while also showcasing the use of the discussed methods on newly added data sets from the fields of engineering, medicine, and business. The Fifth Edition also explores additional topics, including: Surrogate ridge regression * Fitting nonlinear models * Errors in variables * ANOVA for designed experiments. Methods of regression analysis are clearly demonstrated, and examples containing the types of irregularities commonly encountered in the real world are provided. Each example isolates one or two techniques and features detailed discussions, the required assumptions, and the evaluated success of each technique.Additionally, methods described throughout the book can be carried out with most of the currently available statistical software packages, such as the software package R.

Mohammed M. Alani, Hissam Tawfik, Mohammed Saeed, Obinna Anya ... 214 pages - Publisher: Springer; 1st edition (July, 2018) ... Language: English - ASIN: B07FP1J69L by Amazon ...

This timely text/reference reviews the state of the art of big data analytics, with a particular focus on practical applications. An authoritative selection of leading international researchers present detailed analyses of existing trends for storing and analyzing big data, together with valuable insights into the challenges inherent in current approaches and systems. This is further supported by real-world examples drawn from a broad range of application areas, including healthcare, education, and disaster management. The text also covers, typically from an application-oriented perspective, advances in data science in such areas as big data collection, searching, analysis, and knowledge discovery.

Miguel F. Acevedo ... 557 pages - Publisher: CRC Press; 1st edition (December, 2012) ... Language: English - ASIN: B00AR3E3C6 by Amazon ...

Providing a solid foundation for twenty-first-century scientists and engineers, Data Analysis and Statistics for Geography, Environmental Science, and Engineering guides readers in learning quantitative methodology, including how to implement data analysis methods using open-source software. Given the importance of interdisciplinary work in sustainability, the book brings together principles of statistics and probability, multivariate analysis, and spatial analysis methods applicable across a variety of science and engineering disciplines. Each chapter starts with conceptual and theoretical material to give a firm foundation in how the methods work. Examples and exercises illustrate the applications and demonstrate how to go from concepts to problem solving. Hands-on computer sessions allow students to grasp the practical implications and learn by doing. Throughout, the computer examples and exercises use seeg and Rcmdr Plugin.seeg, open-source R packages developed by the author, which help students acquire the skills to implement and conduct analysis and to analyze the results. This self-contained book offers a unified presentation of data analysis methods for more effective problem solving. With clear, easy-to-follow explanations, the book helps students to develop a solid understanding of basic statistical analysis and prepares them for learning the more advanced and specialized methods they will need in their work.

Contact Form

Name

Email *

Message *

Powered by Blogger.