Baptiste Nusaibah validation #11

BaptisteArchambaud · 2024-09-23T14:32:13Z

Add of functions for validation of function ae_table_grade

into baptiste-validation

DanChaltiel

Très bon début c'est super !

Il faut qu'on discute de l'output.
A terme, il faut que ça ait la structure d'un test de package, donc avec la syntaxe de testthat que je vous avais présentée.
https://github.com/Oncostat/grstat/blob/main/tests/testthat/test-ae-tables.R
Cette syntaxe n'a pas de nuance, soit le test passe, soit il échoue.

Proposition :

utiliser expect_xxx() pour tester les différences de N et pct (diff majeure)
utiliser message() pour signaler les différences de style (diff mineure), genre les informations manquantes dans une tables mais pas dans l'autre

Par contre c'est difficile de lire le code sans voir l'application vu que les data sont private.
Ce serait compliqué de faire ce qu'il y a dans #9 pour pouvoir tout faire sur GitHub ?

DanChaltiel · 2024-09-23T14:37:14Z

R/validation fonctions AE_table_grade/compair_grade.R

ce seront des fonctions de testing, elles ne doivent pas aller dans R/
Utilise usethis::use_test_helper()
Tu peux aussi aller voir la doc de testthat: https://cran.r-project.org/web/packages/testthat/vignettes/special-files.html

R/validation fonctions AE_table_grade/compair_grade.R

DanChaltiel · 2024-09-23T14:37:49Z

R/validation fonctions AE_table_grade/compair_grade.R

+
+  if (ncol(tabR)!=ncol(tabSAS)){stop("Different number of arm")}
+  if (all(dim(tabR)==dim(tabSAS))){
+    print("Check: same dimension of tables")


programmation défensive: on ne print pas si tout va bien, on warn s'il y a un problème

DanChaltiel · 2024-09-23T14:38:05Z

R/validation fonctions AE_table_grade/compair_grade.R

+  if (all(dim(tabR)==dim(tabSAS))){
+    print("Check: same dimension of tables")
+    df=tabR%>%arrange(grade)%>%full_join(tabSAS,by="grade",suffix = c(".r",".sas"))
+    indice=which(df[,paste0(tabR%>%select(-grade)%>%colnames(),paste=".r")]!=df[,paste0(tabSAS%>%select(-grade)%>%colnames(),paste=".sas")],


je n'aime vraiment pas les indices, je trouve que c'est à risque d'erreur
Cf mon commentaire sur Teams

DanChaltiel · 2024-09-23T14:38:30Z

R/validation fonctions AE_table_grade/group_grades_zeroNA.R

+    mutate(grade = replace_na(grade, 0)) %>% 
+    group_by(grade) %>% 
+    mutate(across(starts_with("N"), ~sum(., na.rm = T))) %>% 
+    distinct(grade, .keep_all = T) %>% 


On ne peut pas remplacer mutate+distinct par summarise ?

je crois que c'est parce que je n'arrivais pas à keep toutes les variables dans la base avec summarize(.by=) quand il y avait plusieurs bras

DanChaltiel · 2024-09-23T14:49:39Z

R/validation fonctions AE_table_grade/separate_n_pct.R

+  data <- colnames(data) %>% 
+    imap(
+      ~data %>% select(all_of(.x)) %>% 
+        separate(.x, into = c(paste0("N", .y), paste0("pct", .y)), sep = "\\(")
+    ) %>% 
+    bind_cols()


https://tidyr.tidyverse.org/reference/separate_wider_delim.html
On doit pouvoir s'en sortir sans boucle avec separate_wider_regex(), pas évident mais l'exemple aide beaucoup.

DanChaltiel · 2024-09-23T14:50:01Z

R/validation fonctions AE_table_grade/group_grades_zeroNA.R

+  ngroups <- (ncol(data) - 1) / 2
+  for(i in 1:ngroups){
+    npatients <- sum(data[, paste0("N", i)])
+    data[data$grade == 0, paste0("pct", i)] <- round(data[data$grade == 0, paste0("N", i)] * 100 / npatients, round)
+  }


Je vais avoir besoin de lancer le code pour trouver comment appliquer purrr

DanChaltiel · 2024-09-23T14:50:20Z

R/validation fonctions AE_table_grade/group_grades_zeroNA.R

+    mutate(grade = replace_na(grade, 0)) %>% 
+    group_by(grade) %>% 


mutate(.by=grade), plus concis et ne nécessite pas de ungroup()

DanChaltiel · 2024-09-23T14:53:39Z

R/validation fonctions AE_table_grade/compair_grade.R

+  if (nrow(tabR)!=nrow(tabSAS)){stop("Different number of grade levels")
+  }


soit sur une ligne, sans les {},
soit sur 3 lignes
jamais sur 2

DanChaltiel · 2024-11-12T14:55:41Z

R/utils_outputs_ae_table_grade.R

+  data <- colnames(data) %>% 
+    imap(
+      ~data %>% select(all_of(.x)) %>% 
+        separate(.x, into = c(paste0("N", .y), paste0("pct", .y)), sep = "\\(")
+    ) %>% 
+    bind_cols()
+
+  #extraction of figures into numeric columns
+  data <- data %>% 
+    mutate(
+      across(everything(), ~as.numeric(str_extract(.x, "\\d+\\.?\\d*")))
+    )


Belle utilisation de imap() :-)
Il y a une nouvelle fonction detidyr qui ferait le taff aussi (à adapter à plusieurs bras, c'est juste pour l'exemple):

data %>% separate_wider_regex(cols=-c(.id, label, variable, grade), patterns=c(N="\\d+", " \\(", pct="\\d+", "%\\)"))

DanChaltiel

Super Nusaibah, merci beaucoup !
Par contre il y a des erreurs dans les outputs donc ce n'est malheureusement pas terminé 😕.
Désolé de t'embêter encore !

Pour simplifier le process, je t'ai créé un dossier \rsas\_test_nusaibah\ avec tous les inputs/outputs/tests standardisés. Tu peux lancer le rproj et sourcer (ctrl shift s) le fichier test.nusaibah.R directement.
Le dossier fonctions_validationR contient tes fonctions de validations sans modification de ma part (je crois, en tout cas pas dans compare_xxx, mais remplace par tes propres fichiers pour être sûr).
Tu peux modifier ce que tu veux comme bon te semble, je n'y touche plus.

La fonction add_errors() ajoute une erreur standard à la 2ème ligne des tableaux R. Ca devrait correspondre au grade 1 dans AEGRADE, mais l'output donne le grade 0 pour les pourcentages.
Dans AESOC, il y a un problème de jointure quand on renseigne les termes and plus des SOC, je t'ai mis la ligne en code-review.

Je reste évidemment dispo si besoin qu'on en discute :-) !

DanChaltiel · 2025-04-07T08:33:52Z

R/compare_soc.R

+
+  if (nrow(indice)!=0){
+
+    tab =rbind.fill(as.data.frame(tab),indice%>%


La fonction rbind.fill() vient du package plyr qui n'est pas importé dans grstat et ne devrait vraiment plus être utilisé aujourd'hui (package non maintenu). Je te conseillerais même de désinstaller plyr pour éviter de l'utiliser par erreur. Je crois qu'on peut remplacer par bind_rows().

DanChaltiel · 2025-04-07T08:59:14Z

R/compare_soc.R

+    df=tabR%>%arrange(soc)%>%
+      pivot_longer(-c("soc"),names_to = "grade",values_to = "count")%>%
+      mutate(table="R")%>%
+      full_join(tabSAS%>%   #instead of full


Ce full join émet un warning :

Avis dans full_join(., tabSAS %>% pivot_longer(-c("soc"), names_to = "grade", : Detected an unexpected many-to-many relationship between `x` and `y`. ℹ Row 1 of `x` matches multiple rows in `y`. ℹ Row 1 of `y` matches multiple rows in `x`. ℹ If a many-to-many relationship is expected, set `relationship = "many-to-many"` to silence this warning.

Il est émit quand term est renseigné, et je crois qu'il fautdrait écrire pivot_longer(-any_of(c("soc", "term"), ...), ou quelque chose du genre (et dans by aussi).
J'ai l'impression que le problème d'output vient de là.

github-actions bot and others added 4 commits September 23, 2024 14:05

Update dev version (Github Actions)

63521d9

validation functions added for ae_table_grade

15f314a

Merge branch 'baptiste-validation' of https://github.com/Oncostat/grstat

b3c2fb8

into baptiste-validation

Update dev version (Github Actions)

2af87f8

DanChaltiel added this to the v0.2 milestone Sep 23, 2024

DanChaltiel requested changes Sep 23, 2024

View reviewed changes

NusaibahIbr changed the title ~~Baptiste nusaibah validation~~ Baptiste Nusaibah validation Oct 24, 2024

BaptisteArchambaud added 4 commits October 29, 2024 12:10

updating structure of AE_table_grade validation files

6a4b9e7

Merge branch 'main' into baptiste-nusaibah-validation

c39b6fd

Update outputs_ae_table_grade.R

7d743c5

ae_table_soc validation - formatting SAS and R outputs

e5d6c1a

DanChaltiel reviewed Nov 12, 2024

View reviewed changes

NusaibahIbr added 9 commits November 19, 2024 14:20

compare_grade output table

99e08ff

sortie des 2 tables concordantes

dcb1467

create compare_soc (copy compare_grade)

e5447f8

modification according AE output with SOC & PT

d8c1d68

correction

cc6e7fa

Nettoyer le script et compatibilité avec les tables avec bras de trt

60b5318

improve and finish compare_soc function

e272d83

work in progress

3d6074c

Merge branch 'main' into baptiste-nusaibah-validation

09a1057

DanChaltiel mentioned this pull request Jan 7, 2025

Validation AE: SAS vs R sur simulations #46

Open

NusaibahIbr added 3 commits January 16, 2025 17:28

end to major modifications

632809c

formatting with flextable package

eca9df5

change to make the comparison more exhaustive

4115b8a

DanChaltiel reviewed Apr 7, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Baptiste Nusaibah validation #11

Baptiste Nusaibah validation #11

Uh oh!

BaptisteArchambaud commented Sep 23, 2024

Uh oh!

DanChaltiel left a comment

Uh oh!

DanChaltiel Sep 23, 2024

Uh oh!

Uh oh!

DanChaltiel Sep 23, 2024

Uh oh!

DanChaltiel Sep 23, 2024

Uh oh!

DanChaltiel Sep 23, 2024

Uh oh!

BaptisteArchambaud Sep 23, 2024

Uh oh!

DanChaltiel Sep 23, 2024

Uh oh!

DanChaltiel Sep 23, 2024

Uh oh!

DanChaltiel Sep 23, 2024

Uh oh!

DanChaltiel Sep 23, 2024

Uh oh!

DanChaltiel Nov 12, 2024 •

edited

Loading

Uh oh!

DanChaltiel left a comment

Uh oh!

DanChaltiel Apr 7, 2025

Uh oh!

DanChaltiel Apr 7, 2025

Uh oh!

Uh oh!

		if (nrow(tabR)!=nrow(tabSAS)){stop("Different number of grade levels")
		}


		if (nrow(indice)!=0){

		tab =rbind.fill(as.data.frame(tab),indice%>%

Baptiste Nusaibah validation #11

Are you sure you want to change the base?

Baptiste Nusaibah validation #11

Uh oh!

Conversation

BaptisteArchambaud commented Sep 23, 2024

Uh oh!

DanChaltiel left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DanChaltiel Nov 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DanChaltiel left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

DanChaltiel Nov 12, 2024 •

edited

Loading