Addition of threading in GroupReadsByUmi and some other performance optimizations. #950

tfenne · 2023-11-27T22:03:36Z

No description provided.

…ptimizations.

codecov · 2023-11-27T22:21:18Z

Codecov Report

Attention: 1 lines in your changes are missing coverage. Please review.

Comparison is base (ff1ca67) 95.61% compared to head (226d08b) 95.62%.

❗ Current head 226d08b differs from pull request most recent head f233be2. Consider uploading reports for the commit f233be2 to get more accurate results

Files	Patch %	Lines
...cala/com/fulcrumgenomics/umi/GroupReadsByUmi.scala	98.11%	1 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #950   +/-   ##
=======================================
  Coverage   95.61%   95.62%           
=======================================
  Files         126      126           
  Lines        7321     7354   +33     
  Branches      504      502    -2     
=======================================
+ Hits         7000     7032   +32     
- Misses        321      322    +1

Flag	Coverage Δ
unittests	`95.62% <98.11%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

nh13

Lots of really nice thoughtful and concise improvements here. I suggested a few ideas, that likely you can skip in this round of improvements but could be interesting to think about.

src/main/scala/com/fulcrumgenomics/umi/GroupReadsByUmi.scala

nh13 · 2023-12-02T15:59:36Z

src/main/scala/com/fulcrumgenomics/umi/GroupReadsByUmi.scala

+              .getOrElse(-1)
+
+            if (searchFromIdx >= 0) {
+              val hits = taskSupport match {


Since taskSupport is defined outside this loop, would you consider making a partial function to use here (or use currying), so you don't have to match each time?

src/main/scala/com/fulcrumgenomics/umi/GroupReadsByUmi.scala

nh13 · 2023-12-02T16:01:09Z

src/main/scala/com/fulcrumgenomics/umi/GroupReadsByUmi.scala

@@ -513,13 +577,14 @@ class GroupReadsByUmi
 @arg(flag='x', doc= """
                         |DEPRECATED: this option will be removed in future versions and inter-contig reads will be
                         |automatically processed.""")
-  @deprecated val allowInterContig: Boolean = true
+  @deprecated val allowInterContig: Boolean = true,
+@arg(flag='@', doc="Number of threads to use when comparing UMIs. Only recommended for amplicon or similar data.") val threads: Int = 1,


Suggested change

@arg(flag='@', doc="Number of threads to use when comparing UMIs. Only recommended for amplicon or similar data.") val threads: Int = 1,

@arg(flag='@', doc="Number of threads to use when comparing UMIs. Only recommended for amplicon or similar data.") val threads: Int = 1,

Addition of threading in GroupReadsByUmi and some other performance o…

4c0c101

…ptimizations.

tfenne self-assigned this Nov 27, 2023

Fix failing tests.

4d43e87

One more minor tweak.

b8357e2

tfenne requested a review from nh13 December 1, 2023 21:36

tfenne assigned nh13 Dec 1, 2023

nh13 approved these changes Dec 2, 2023

View reviewed changes

tfenne added 2 commits December 8, 2023 09:35

A couple more small optimizations and extended the tests.

226d08b

Fixing some whitespace

f233be2

tfenne merged commit 9bb97ed into main Dec 8, 2023
4 checks passed

tfenne deleted the tf_speedup_group_reads branch December 8, 2023 16:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Addition of threading in GroupReadsByUmi and some other performance optimizations. #950

Addition of threading in GroupReadsByUmi and some other performance optimizations. #950

tfenne commented Nov 27, 2023

codecov bot commented Nov 27, 2023 •

edited

Loading

nh13 left a comment

nh13 Dec 2, 2023

nh13 Dec 2, 2023

	@arg(flag='@', doc="Number of threads to use when comparing UMIs. Only recommended for amplicon or similar data.") val threads: Int = 1,
	@arg(flag='@', doc="Number of threads to use when comparing UMIs. Only recommended for amplicon or similar data.") val threads: Int = 1,

Addition of threading in GroupReadsByUmi and some other performance optimizations. #950

Addition of threading in GroupReadsByUmi and some other performance optimizations. #950

Conversation

tfenne commented Nov 27, 2023

codecov bot commented Nov 27, 2023 • edited Loading

Codecov Report

nh13 left a comment

Choose a reason for hiding this comment

nh13 Dec 2, 2023

Choose a reason for hiding this comment

nh13 Dec 2, 2023

Choose a reason for hiding this comment

codecov bot commented Nov 27, 2023 •

edited

Loading