Two sample bootstrap r code

Performs one and two sample bootstrap t-tests on vectors of data.

Usage

boot.t.test(x, . ) ## Default S3 method: boot.t.test(x, y = NULL, alternative = c("two.sided", "less", "greater"), mu = 0, paired = FALSE, var.equal = FALSE, conf.level = 0.95, R = 9999, symmetric = FALSE, . ) ## S3 method for class 'formula' boot.t.test(formula, data, subset, na.action, . )

Arguments

a (non-empty) numeric vector of data values.

an optional (non-empty) numeric vector of data values.

a character string specifying the alternative hypothesis, must be one of "two.sided" (default), "greater" or "less" . You can specify just the initial letter.

a number indicating the true value of the mean (or difference in means if you are performing a two sample test).

a logical indicating whether you want a paired t-test.

a logical variable indicating whether to treat the two variances as being equal. If TRUE then the pooled variance is used to estimate the variance otherwise the Welch (or Satterthwaite) approximation to the degrees of freedom is used.

confidence level of the interval.

number of bootstrap replicates.

a logical variable indicating whether to assume symmetry in the two-sided test. If TRUE then the symmetric bootstrap p value otherwise the equal-tail boostrap p value is computed.

a formula of the form lhs ~ rhs where lhs is a numeric variable giving the data values and rhs a factor with two levels giving the corresponding groups.

an optional matrix or data frame (or similar: see model.frame ) containing the variables in the formula formula . By default the variables are taken from environment(formula) .

an optional vector specifying a subset of observations to be used.

a function which indicates what should happen when the data contain NA s. Defaults to getOption("na.action") .

further arguments to be passed to or from methods.

Details

The implemented test corresponds to the proposal of Chapter 16 of Efron and Tibshirani (1993).

The function returns bootstrapped p values and confidence intervals as well as the results ot the t-test without bootstrap.

The formula interface is only applicable for the 2-sample tests.

alternative = "greater" is the alternative that x has a larger mean than y .

If paired is TRUE then both x and y must be specified and they must be the same length. Missing values are silently removed (in pairs if paired is TRUE ). If var.equal is TRUE then the pooled estimate of the variance is used. By default, if var.equal is FALSE then the variance is estimated separately for both groups and the Welch modification to the degrees of freedom is used.

If the input data are effectively constant (compared to the larger of the two means) an error is generated.

Value

A list with class "boot.htest" (derived from class htest ) containing the following components:

the value of the t-statistic.

the degrees of freedom for the t-statistic.

the p-value for the test.

the bootstrapped p-value for the test.

a confidence interval for the mean appropriate to the specified alternative hypothesis.

a bootstrap percentile confidence interval for the mean appropriate to the specified alternative hypothesis.

the estimated mean or difference in means depending on whether it was a one-sample test or a two-sample test.

the specified hypothesized value of the mean or mean difference depending on whether it was a one-sample test or a two-sample test.

the standard error of the mean (difference), used as denominator in the t-statistic formula.

bootstrapped standard error.

a character string describing the alternative hypothesis.

a character string indicating what type of t-test was performed.

a character string giving the name(s) of the data.

Note

Code and documentation are for large parts identical to function t.test .

References

B. Efron, R.J. Tibshirani. An Introduction to the Bootstrap. Chapman and Hall/CRC 1993.

Examples

require(graphics) t.test(1:10, y = c(7:20)) # P = .00001855 boot.t.test(1:10, y = c(7:20)) t.test(1:10, y = c(7:20, 200)) # P = .1245 -- NOT significant anymore boot.t.test(1:10, y = c(7:20, 200)) ## Classical example: Student's sleep data plot(extra ~ group, data = sleep) ## Traditional interface with(sleep, t.test(extra[group == 1], extra[group == 2])) with(sleep, boot.t.test(extra[group == 1], extra[group == 2])) ## Formula interface t.test(extra ~ group, data = sleep) boot.t.test(extra ~ group, data = sleep)