# A tibble: 2 × 3
arms_reduction num_responses pct
<fct> <int> <dbl>
1 against 452 0.440
2 favor 576 0.560
IMS, Ch. 16
Smith College
Apr 1, 2026
A simple random sample of 1,028 US adults in March 2013 found that 56% support nuclear arms reduction.
# A tibble: 2 × 3
arms_reduction num_responses pct
<fct> <int> <dbl>
1 against 452 0.440
2 favor 576 0.560
Response: arms_reduction (factor)
# A tibble: 1 × 1
stat
<dbl>
1 0.560
p_hat is a \(1 \times 1\) data.framep_hat$stat is the actual (scalar) valueBig Question
How do we construct the null distribution?
Pros:
Cons:
Let \(X \sim Bernoulli(p)\). Then,
\[ \mathbb{E}[X] = p, Var(X) = p(1-p) \]
Let \(Y = X_1 + \cdots + X_n\). Then \(Y \sim Binom(n, p)\) and,
\[ \mathbb{E}[Y] = np, Var(Y) = np(1-p) \]
\(Z = Y/n\) is a r.v. giving the mean of \(n\) draws from \(X\)!
Then,
\[ \mathbb{E}[Z] = p \]
And,
\[ Var(Z) = \frac{1}{n^2} \cdot np(1-p) = \frac{p(1-p)}{n} \]
And, \(sd(Z) = \sqrt{\frac{p(1-p)}{n}}\) is the standard error!
se <- sqrt(p_0 * (1 - p_0) / n)
dbinom_p <- function (x, size, prob, log = FALSE) {
size * dbinom(round(x * size), size, prob, log)
}
null_dist_2 <- null_dist +
geom_function(
fun = dbinom_p, color = "red",
args = list(size = n, prob = p_0)
) +
geom_errorbarh(
aes(xmax = p_0 + se, xmin = p_0 - se, y = 10),
color = "red"
)Does a majority support nuclear arms reduction?
Compare the SE and p-value from the previous method. Are they meaningfully different?
Pros:
Cons:
By CLT, for \(n\) large enough, null distribution is approximately normal
If \(np_0(1-p_0) > 10\), then approximation is reasonably good
Use \(SE_{\hat{p}} = \sqrt{\frac{p_0(1-p_0)}{n}}\) (from before)
inferDoes a majority support nuclear arms reduction?
Compare the SE and p-value from the previous method. Are they meaningfully different?
Pros:
Cons:
Solution is approximate, and sometimes approximation is not great
SE formula comes out of nowhere
harder to connect to big ideas?
