Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Addition of threading in GroupReadsByUmi and some other performance optimizations. #950

Merged
merged 5 commits into from
Dec 8, 2023

Conversation

tfenne
Copy link
Member

@tfenne tfenne commented Nov 27, 2023

No description provided.

@tfenne tfenne self-assigned this Nov 27, 2023
Copy link

codecov bot commented Nov 27, 2023

Codecov Report

Attention: 1 lines in your changes are missing coverage. Please review.

Comparison is base (ff1ca67) 95.61% compared to head (226d08b) 95.62%.

❗ Current head 226d08b differs from pull request most recent head f233be2. Consider uploading reports for the commit f233be2 to get more accurate results

Files Patch % Lines
...cala/com/fulcrumgenomics/umi/GroupReadsByUmi.scala 98.11% 1 Missing ⚠️
Additional details and impacted files
@@           Coverage Diff           @@
##             main     #950   +/-   ##
=======================================
  Coverage   95.61%   95.62%           
=======================================
  Files         126      126           
  Lines        7321     7354   +33     
  Branches      504      502    -2     
=======================================
+ Hits         7000     7032   +32     
- Misses        321      322    +1     
Flag Coverage Δ
unittests 95.62% <98.11%> (+<0.01%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@tfenne tfenne requested a review from nh13 December 1, 2023 21:36
Copy link
Member

@nh13 nh13 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lots of really nice thoughtful and concise improvements here. I suggested a few ideas, that likely you can skip in this round of improvements but could be interesting to think about.

.getOrElse(-1)

if (searchFromIdx >= 0) {
val hits = taskSupport match {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Since taskSupport is defined outside this loop, would you consider making a partial function to use here (or use currying), so you don't have to match each time?

@@ -513,13 +577,14 @@ class GroupReadsByUmi
@arg(flag='x', doc= """
|DEPRECATED: this option will be removed in future versions and inter-contig reads will be
|automatically processed.""")
@deprecated val allowInterContig: Boolean = true
@deprecated val allowInterContig: Boolean = true,
@arg(flag='@', doc="Number of threads to use when comparing UMIs. Only recommended for amplicon or similar data.") val threads: Int = 1,
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
@arg(flag='@', doc="Number of threads to use when comparing UMIs. Only recommended for amplicon or similar data.") val threads: Int = 1,
@arg(flag='@', doc="Number of threads to use when comparing UMIs. Only recommended for amplicon or similar data.") val threads: Int = 1,

@tfenne tfenne merged commit 9bb97ed into main Dec 8, 2023
4 checks passed
@tfenne tfenne deleted the tf_speedup_group_reads branch December 8, 2023 16:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants