t_test.Rmd

---
title: "<center>**Parametric and non-parametric t-tests in R**</center>"
author: "<center>Wyclife Agumba Oluoch (wyclifeoluoch@gmail.com) </center>"
date: "<center>`r Sys.time()`</center>"
bibliography: 
  - bib/packages.bib
nocite: '@*'
output: 
  html_document:
    toc: true
    toc_depth: 2
    toc_float: true
---

```{r setup, include=FALSE}
knitr::opts_chunk$set(echo = TRUE)
```

```{r libs, echo=FALSE, warning=FALSE, message=FALSE, include=FALSE}
packages <- c("base",
              'knitr',
              'rmarkdown',
              'utils',
              'stats',
              'nortest',
              'car')
installed_packages <- packages %in% rownames(installed.packages())
if(any(installed_packages == FALSE)){
  install.packages(packages[!installed_packages])
}
lapply(packages, library, character.only = TRUE) |> invisible()
```

```{r write_bib, echo=FALSE, warning=FALSE, message=FALSE, include=FALSE}
knitr::write_bib(c(
  .packages(), packages
), 'bib/packages.bib')
```

# Creating the data_set

We create random data with two factor levels and continuous variable.

```{r dataset, echo = TRUE}
data_set <- data.frame(Gender = c(rep('Male', 100), rep('Female', 100)),
                   abundance = sample(x = 7:40, size = 200, replace = TRUE))
```

# Checking for normality

We use a number of normality checks on our created data including qqplot, qqline, shapiro, and anderson darling.

## Using qqnorm and qqline

```{r normality_check_qq}
qqnorm(data_set$abundance)
qqline(data_set$abundance)
```

## Using Shapiro test

```{r normality_check_shapiro}
shapiro.test(data_set$abundance)
```

p<a hence we reject the null hypothesis and conclude that the data is no-parametric

## Using Anderson-Darling test

```{r normality_check_Anderson_Darling}
ad.test(data_set$abundance)
```

p>a hence we fail to reject the null hypothesis and conclude that the data is parametric

# Test for equal variance using levene's test

```{r}
leveneTest(data_set$abundance ~ factor(data_set$Gender))
```

p>a hence we fail to reject the null hypothesis and conclude that the data 
have equal variance

# Parametric t-test with equal variance

```{r t-test_equal_variance}
t.test(abundance ~ Gender,
       data = data_set,
       alternative = 'two.sided',
       paired = F,
       var.equal = T)
```

# Non-parametric t-test with equal variance

```{r non-para_equal_variance}
wilcox.test(abundance ~ Gender,
       data = data_set,
       alternative = 'two.sided',
       paired = F,
       var.equal = T)
```

This is normally used when the data is non parametric, i.e. p<a in case of  shapiro.test or ad.test results.

# References