Monday, 13 July 2015

Sampling Design and Analysis_Full

Sampling Design and Analysis_Full



Surveys and samples sometimes seem to surround you. Many give valuable information;

some, unfortunately, are so poorly conceived and implemented that it would

be better for science and society if they were simply not done. This book gives you

guidance on how to tell when a sample is valid or not, and how to design and analyze

many different forms of sample surveys.

The book concentrates on the statistical aspects of taking and analyzing a sample.

How to design and pretest a questionnaire, construct a sampling frame, and train field

investigators are all important issues, but are not treated comprehensively in this hook.

I have written the book to be accessible to a wide audience, and to allow flexibility

in choosing topics to be read. To read most of Chapters 1 through 6, you need to be

familiar with basic ideas of expectation, sampling distributions, confidence intervals,

and linear regression-material covered in most introductory statistics classes. These

chapters cover the basic sampling designs of simple random sampling, stratification,

and cluster sampling with equal and unequal probabilities of selection. The optional

sections on the statistical theory for these designs are marked with asterisks-these

sections require you to be familiar with calculus or mathematical statistics. Appendix

B gives a review of probability concepts used in the theory of probability sampling.

Chapters 7 through 12 discuss issues not found in many other sampling textbooks:

how to analyze complex surveys such as those administered by the United States

Bureau of the Census or by Statistics Canada, different approaches to analyzing

sample surveys, what to do if there is nonresponse, and how to perform chi-squared

tests and regression analyses using data from complex surveys. The National Crime

Victimization Survey is discussed in detail as an example of a complex survey. Since

many of the formulas used to find standard errors in simpler sampling designs are

difficult to implement in complex samples, computer-intensive methods are discussed

for estimating the variances.


Introductory Statistics with R

Introductory Statistics with R



R is a statistical computer program made available through the Internet under the General Public License (GPL). That is, it is supplied with a license that allows you to use it freely, distribute it, or even sell it, as long as the receiver has the same rights and the source code is freely available. It exists for Microsoft Windows XP or later, for a variety of Unix and Linux platforms, and for Apple Macintosh OS X. R provides an environment in which you can perform statistical analysis and produce graphics. It is actually a complete programming language, although that is only marginally described in this book. Here we content ourselves with learning the elementary concepts and seeing a number of cookbook examples. R is designed in such a way that it is always possible to do further computations on the results of a statistical procedure. Furthermore, the design for graphical presentation of data allows both no-nonsense methods, for example plot(x,y), and the possibility of fine-grained control of the output’s appearance. The fact that R is based on a formal computer language gives it tremendous flexibility. Other systems present simpler interfaces in terms of menus and forms, but often the apparent userfriendliness turns into a hindrance in the longer run. Although elementary statistics is often presented as a collection of fixed procedures, analysis of moderately complex data requires ad hoc statistical model building, which makes the added flexibility of R highly desirable. R owes its name to typical Internet humour. You may be familiar with the programming language C (whose name is a story in itself). Inspired by this, Becker and Chambers chose in the early 1980s to call their newly developed statistical programming language S. This language was further developed into the commercial product S-PLUS, which by the end of the decade was in widespread use among statisticians of all kinds. Ross Ihaka and Robert Gentleman from the University of Auckland, New Zealand, chose to write a reduced version of S for teaching purposes, and what was more natural than choosing the immediately preceding letter? Ross’ and Robert’s initials may also have played a role. In 1995, Martin Maechler persuaded Ross and Robert to release the source code for R under the GPL. This coincided with the upsurge in Open Source software spurred by the Linux system. R soon turned out to fill a gap for people like me who intended to use Linux for statistical computing but had no statistical package available at the time. A mailing list was set up for the communication of bug reports and discussions of the development of R. - See more at: http://www.aazzhosting.com/books/introductory-statistics-with-r/#sthash.XsVJlsIe.dpuf

Applied Probability

Applied Probability

Preface to Pfei er Applied Probability1 The course This is a “ rst course” in the sense that it presumes no previous course in probability. The units are modules taken from the unpublished text: Paul E. Pfei er, ELEMENTS OF APPLIED PROBABILITY, USING MATLAB. The units are numbered as they appear in the text, although of course they may be used in any desired order. For those who wish to use the order of the text, an outline is provided, with indication of which modules contain the material. The mathematical prerequisites are ordinary calculus and the elements of matrix algebra. 

A few standard series and integrals are used, and double integrals are evaluated as iterated integrals. The reader who can evaluate simple integrals can learn quickly from the examples how to deal with the iterated integrals used in the theory of expectation and conditional expectation. Appendix B (Section 17.2) provides a convenient compendium of mathematical facts used frequently in this work. And the symbolic toolbox, implementing MAPLE, may be used to evaluate integrals, if desired. In addition to an introduction to the essential features of basic probability in terms of a precise mathematical model, the work describes and employs user de ned MATLAB procedures and functions (which we refer to as m-programs, or simply programs) to solve many important problems in basic probability. This should make the work useful as a stand alone exposition as well as a supplement to any of several current textbooks. - See more at: http://www.aazzhosting.com/books/applied-probability/#sthash.O0lbmGMu.dpuf

Probability for Dummies

Probability for Dummies









































This book is organized into five major parts that explore the main topic areas
in probability. I also include a part that offers a couple quick top-ten references
for you to use. Each part contains chapters that break down each major
objective into understandable pieces.
Part I: The Certainty of Uncertainty:
Probability Basics
This part gives you the fundamentals of probability, along with strageties for
setting up and solving the most common probability problems in the introductory
course. It starts by introducing probability as a topic that has an impact
on all of us every day and underscores the point that probability often goes
against our intuition. You discover the basic definitions, terms, notation, and
rules for probability, and you get answers to those all-important (and often
frustrating) questions that perplex students of probability, such as, “What’s
the real difference between independent and mutually exclusive events?”
You also see different methods for organizing the information given to you,
including Venn diagrams, tree diagrams, and tables. Finally, you discover
good strategies for solving more complex probability problems involving the
Law of Total Probability and Bayes’ Theorem.
Part II: Counting on Probability
and Betting to Win
In this part, you get down to the nitty gritty of probability, solving problems that
involve two-way tables, permutations and combinations, and games of chance.
The bottom line in this part? Probability and intuition don’t always mix!
Part III: From A to Binomial:
Basic Probability Models
In this part, you build an important foundation for creating, using, and evaluating
probability models. You discover all the ins and outs of a probability
distribution; the basic concepts and rules for defining probability distributions;
and how to find probabilities, means, and variances. You work with the
binomial and normal distributions, and you find out how probability ties in to
the major results from statistics: the Central Limit Theorem, hypothesis testing,
and overall decision making in the real world.
Part IV: Taking It Up a Notch:
Advanced Probability Models
In this part, you work with more intermediate probability models that count
and try to predict the number of arrivals, successes, or the number of trials
needed to achieve a certain goal. The probability distributions I focus on are
the Poisson, negative binomial, geometric, and hypergeometric. You find out
how many customers you expect to come into a bank (Poisson distribution);
the number of poker hands you need to draw before you get four of a kind
(geometric distribution); the number of frames you need to bowl before getting
your third strike (the negative binomial distribution); and the probability
of getting a hand in poker (hypergeometric distribution).
Part V: For the Hotshots: Continuous
Probability Models
In this part, you look at some of the models you find in probability and statistics
courses that have calculus as a prerequisite — mainly the uniform
(continuous case) distribution, exponential distribution, and other user defined
probability density functions. You see how to find probabilities and
the expected value, variance, and standard deviation of continuous probability
models. And you apply the models to situations such as the time between arrivals of customers at the bank, time to complete a task, or the length of a
phone call. Note: Calculus is useful but not required for this part. I introduce
the methods that use calculus, but I also provide formulas and other methods
of solution that don’t use calculus for the uniform and exponential.
- See more at: http://www.aazzhosting.com/books/probability-for-dummies/#sthash.lvIQBEoF.dpuf