Feature/DIMS_QCinfo_in_mail by mraves2 · Pull Request #93 · UMCUGenetics/CustomModules

mraves2 · 2026-01-16T14:33:34Z

Deze feature zorgt ervoor dat er extra QC informatie vanuit de DIMS pipeline in de eindmail komt, zodat de gebruiker in 1 oogopslag de kwaliteit van de run kan beoordelen.
Verschillende stappen van de pipeline, met name AverageTechReplicates en GenerateQCOutput, genereren extra txt bestanden, die als content opgenomen worden in DIMS.nf.

…o_in_mail

BasMonkey

I've made some remarks about performance and code duplication. Please see the comments left on the code.

BasMonkey · 2026-01-19T09:29:27Z

DIMS/GenerateQCOutput.R

 is_codes <- rownames(is_list)

-# check if there is data present for all the samples that the pipeline started with,
-# if not write sample name to a log file.


Seems like this comment has been lost in this change. I think its quite useful

DIMS/GenerateQCOutput.R

BasMonkey · 2026-01-19T11:58:42Z

DIMS/GenerateQCOutput.R

+  # pos
+  for (line_index in seq_len(nrow(is_pos_selection_subset))) {
+    is_selected <- is_pos_selection_subset$HMDB_name[line_index]
+    thresh_selected <- all_is_thresholds$plasma$pos[which(all_is_thresholds$names$pos == is_selected)]


The which() method preforms a linear search action per row. A more efficient way is to make use of the match() method.

In this case, the list of values is really small, so I'm not too worried about performance. Which() is used often in different scripts of the pipeline, so it will be a focus of the refactor of v3.5 to investigate which instances of which() can be replaced by match(). The only fundamental difference between which() and match() is that the former returns all instances, whereas the latter returns only the first. For each occurrence of which() in the code, we'll have to decide whether it can be replaced by match().

BasMonkey · 2026-01-19T12:02:10Z

DIMS/GenerateQCOutput.R

+    is_selected <- is_pos_selection_subset$HMDB_name[line_index]
+    thresh_selected <- all_is_thresholds$plasma$pos[which(all_is_thresholds$names$pos == is_selected)]
+    if (is_pos_selection_subset$Intensity[line_index] < thresh_selected) {
+      is_below_threshold <- rbind(is_below_threshold, is_pos_selection_subset[line_index, ])


Avoid rbind() in a loop, since it repeatedly reallocates and copies the data frame, which is inefficient and may use a huge amount of ram for larger datasets. Consider collecting indices or rows first and binding once at the end.

Duly noted. This is a very small data frame, so I'll leave it as is, but in the refactor for v3.5 where all scripts are evaluated, I will take this point into consideration.

…ction

mraves2 added 5 commits October 3, 2025 16:39

added extra output on QC of SST sample to DIMS/GenerateQCOutput

000b178

Merge remote-tracking branch 'origin/develop' into feature/DIMS_QCinf…

6000929

…o_in_mail

added output for QC on internal standards to include in mail

17b6d50

modified DIMS AverageTechReplicates for QC info in mail

af62a0e

modified DIMS GenerateQCOutput for QC info in mail

7c349ce

BasMonkey requested changes Jan 19, 2026

View reviewed changes

mraves2 added 6 commits January 22, 2026 12:14

moved generation of list of internal standards below threshold to fun…

76baae9

…ction

added unit test for function find_is_below_threshold

22140fb

removed erroneous space

4501fd9

corrected typo

5f32d2f

added library data.table

534002e

replaced library data.table with reshape2

7c0d056

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/DIMS_QCinfo_in_mail#93

Feature/DIMS_QCinfo_in_mail#93
mraves2 wants to merge 11 commits intodevelopfrom
feature/DIMS_QCinfo_in_mail

mraves2 commented Jan 16, 2026

Uh oh!

BasMonkey left a comment

Uh oh!

BasMonkey Jan 19, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

BasMonkey Jan 19, 2026

Uh oh!

mraves2 Jan 22, 2026

Uh oh!

BasMonkey Jan 19, 2026

Uh oh!

mraves2 Jan 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mraves2 commented Jan 16, 2026

Uh oh!

BasMonkey left a comment

Choose a reason for hiding this comment

Uh oh!

BasMonkey Jan 19, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

BasMonkey Jan 19, 2026

Choose a reason for hiding this comment

Uh oh!

mraves2 Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

BasMonkey Jan 19, 2026

Choose a reason for hiding this comment

Uh oh!

mraves2 Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants