Files
ArtStudies/M1/General Linear Models/TP1/TP1.Rmd
2024-11-13 18:30:38 +01:00

77 lines
1.6 KiB
Plaintext

```{r}
setwd('/Users/arthurdanjou/Workspace/studies/M1/General Linear Models/TP1')
```
```{r}
library(rmarkdown)
health <- read.table("./health.txt", header = TRUE, sep = " ", dec = ".")
paged_table(health)
```
```{r}
Health <- health[2:5]
library(dplyr)
library(corrplot)
correlation_matrix <- cor(Health)
corrplot(correlation_matrix, order = 'hclust', addrect = 3)
```
```{r}
model <- lm(y ~ ., data = Health)
coefficients(model)
summary(model)
```
```{r}
library(ggfortify)
library(car)
autoplot(model, 1:3)
```
The points are not well distributed around 0 -> [P1] is not verified
The points are not well distributed around 1 -> [P2] is not verified
The QQPlot is aligned with the line y = x, so it is globally gaussian -> [P4] is verified
```{r}
set.seed(0)
durbinWatsonTest(model)
```
The p-value is 0.58 > 0.05 -> We do not reject H0 so the residuals are not auto-correlated -> [P3] is verified
```{r}
library(GGally)
ggpairs(Health, progress = F)
```
We observe that the variable age is correlated with the variable y. There is a quadratic relation between both variables.
```{r}
Health2 <- Health
Health2$age_sq <- Health2$age^2
Health2 <- Health2[1:24,]
model2 <- lm(y ~ ., data = Health2)
summary(model2)
coefficients(model2)
```
```{r}
library(ggfortify)
library(car)
autoplot(model2, 1:4)
```
The points are well distributed around 0 -> [P1] is verified
The points are not well distributed around 1 -> [P2] is verified
The QQPlot is aligned with the line y = x, so it is gaussian -> [P4] is verified
```{r}
set.seed(0)
durbinWatsonTest(model2)
```
The p-value is 0.294 > 0.05 -> We do not reject H0 so the residuals are not auto-correlated -> [P3] is verified