You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: data-cleaning_empty.qmd
+17-50Lines changed: 17 additions & 50 deletions
Original file line number
Diff line number
Diff line change
@@ -50,8 +50,6 @@ Let's focus on the non-numeric values as a starting point:
50
50
51
51
```{r}
52
52
snowsurvey_csv %>%
53
-
count(Snow_cover) %>%
54
-
filter(is.na(as.numeric(Snow_cover)))
55
53
56
54
```
57
55
@@ -70,29 +68,22 @@ snowsurvey_csv %>%
70
68
Interestingly, when there is a "dot" for snow cover, it is also the case for all the other covers. Let's replace those with NA since there is no supplemental information in the provided metadata about the use of dots
We want to focus on the presence and absence of species and not the count. Let's create a new column for presence where anything else than 0 is considered present
We can remove some columns: "Num_observers", "All_obs_reported", "Observer_hours" are here to help to compute the effort of observation but since we just want presence and absence, we do not need it. We can also remove all the zeros values to reduce the size of our data set:
0 commit comments