Skip to content

Commit

Permalink
Merge branch 'master' of github.com:glencoesoftware/bioformats2raw in…
Browse files Browse the repository at this point in the history
…to rgb
  • Loading branch information
melissalinkert committed Dec 13, 2023
2 parents 8a201f8 + 3def4ea commit 489cb66
Show file tree
Hide file tree
Showing 8 changed files with 424 additions and 128 deletions.
82 changes: 77 additions & 5 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -96,16 +96,17 @@ Alternatively, the `--resolutions` options can be passed to specify the exact nu
bioformats2raw /path/to/file.svs /path/to/zarr-pyramid --resolutions 6


Maximum tile dimensions can be configured with the `--tile_width` and `--tile_height` options. Defaults can be viewed with
Maximum tile dimensions can be configured with the `--tile-width` and `--tile-height` options. Defaults can be viewed with
`bioformats2raw --help`. Be mindful of the downstream workflow when selecting a tile size other than the default.
A smaller than default tile size is rarely recommended.

If the input file has multiple series, a subset of the series can be converted by specifying a comma-separated list of indexes:

bioformats2raw /path/to/file.scn /path/to/zarr-pyramid --series 0,2,3,4

By default, four additional readers (MiraxReader, PyramidTiffReader, BioTekReader, and ND2PlateReader) are added to the beginning of Bio-Formats' list of reader classes.
By default, several additional readers are added to the beginning of Bio-Formats' list of reader classes.
These readers are considered to be experimental and as a result only a limited range of input data is supported.
See the [Additional readers](#additional-readers) section below for more information.

Any of these readers can be excluded with the `--extra-readers` option:

Expand Down Expand Up @@ -275,9 +276,9 @@ Performance
This package is __highly__ sensitive to underlying hardware as well as
the following configuration options:

* `--max_workers`
* `--tile_width`
* `--tile_height`
* `--max-workers`
* `--tile-width`
* `--tile-height`

On systems with significant I/O bandwidth, particularly SATA or
NVMe based storage, you may find sharply diminishing returns with high
Expand Down Expand Up @@ -308,6 +309,77 @@ This is particularly helpful if you do not have write permissions in the input d
As of version 0.5.0, `.*.bfmemo` files are deleted at the end of conversion by default. We do not recommend keeping these files for normal
conversions, but if they are needed for troubleshooting then the `--keep-memo-files` option can be used.

Additional readers
==================

Readers are listed here in the order in which they appear on the reader list.

PyramidTiffReader
-----------------

Supports TIFF files that contain a pyramid, with one pyramid resolution per IFD. While this is different from standard pyramid OME-TIFF files,
any OME-XML stored in the first IFD's `ImageDescription` tag will be used to set the number of channels, timepoints, and Z sections.
If no OME-XML is present, each IFD is assumed to represent one channel at a particular resolution; multiple IFDs at the same resolution
therefore indicates multiple channels.

MiraxReader
-----------

Supports 3D HISTECH .mrxs data. Only the full-resolution image is read; bioformats2raw will generate a pyramid from
the full-resolution image, but will not read the original pyramid for this format. Datasets in this format include
a .mrxs file (which is a JPEG thumbnail), along with a similarly-named directory containing a `Slidedat.ini`, `Index.dat`,
and many `Data*.dat` files. The .mrxs file alone does not contain anything apart from the thumbnail;
it is very important to include the entire corresponding directory when transferring these datasets.

The `mirax.use_metadata_dimensions` reader option can be used change how XY dimensions are calculated.
By default, this option is `true`, but setting it to `false` may be helpful if the image size appears incorrect.

BioTekReader
------------

Supports BioTek Cytation 5 plates. Plates in this format consist of .tif files that follow a specific naming scheme;
unlike most other plate formats, there are no metadata files that describe the whole plate. All files for a plate must be in
the same folder as the selected bioformats2raw input file. File names must match one of a limited set of regular expressions:

* `([A-Z]{1,2})(\\d{1,2})_(-?\\d+)_(\\d+)_(\\d+)_([A-Za-z0-9 ,\\[\\]]+)_(\\d+).tif[f]?`
- This corresponds to: `<well row letter><well column index>_<ignored index>_<ignored index>_<field index>_<channel name>_<ignored index>`
- Examples:
- `A1_01_1_1_Phase Contrast_001.tif` (well A1, field 1, `Phase Contrast` channel)
- `P24_01_1_9_DAPI_002.tif` (well P24, field 9, `DAPI` channel)
- `A1_-2_1_1_Tsf[Stitched[Channel1 300,400]]_001.tif` (well A1, field 1, `Tsf[Stitched[Channel1 300,400]]` channel)
* `([A-Z]{1,2})(\\d{1,2})_(-?\\d+)(Z(\\d+))?_([A-Za-z0-9 ,\\[\\]]+)_(\\d+)_(\\d+)_(\\d+)?.tif[f]?`
- This corresponds to: `<well row letter><well column index>_<field index><optional 'Z' and index>_<channel name>_<ignored index>_<t index>_<optional ignored index>`
- Examples:
- `A1_1Z0_DAPI_1_001_.tif` (well A1, field 1, Z slice 0, `DAPI` channel, timepoint 1)
- `A1_1Z4_DAPI_1_003_.tif` (well A1, field 1, Z slice 4, `DAPI` channel, timepoint 3)
- `B2_1_Bright Field_1_001_02.tif` (well B2, field 1, `Bright Field` channel, timepoint 1)
* `([A-Z]{1,2})(\\d{1,2})_(-?\\d+)_.+\\[(.+)_([A-Za-z0-9 ,\\[\\]]+)\\]_(\\d+)_(\\d+)_([0-9-]+)?.tif[f]?`
- This corresponds to: `<well row letter><well column index>_<field index>_<optional ignored data>[<ignored data>_<channel name>]_<ignored index>_<t index>_<optional ignored index>`
- Example:
- `H10_1_Stitched[AandB_Phase Contrast]_1_001_-1.tif` (well H10, field 1, `Phase Contrast` channel, timepoint 1)

If the input file does not match the given regular expression, then the basic TIFF reader will be used to convert the
single input file without looking for other .tif files. It is especially important to check the conversion output when
working with BioTek plates for the first time or after any acquisition system updates, as there will not be an error in
the logs if the file name does not match any of the above regular expressions.

ND2PlateReader
--------------

Supports grouping multiple .nd2 files into a single HCS plate. To our knowledge, .nd2 files contain no HCS metadata or
awareness that multiple files are part of the same acquisition. This reader relies entirely upon the file name structure
to group .nd2 files in the same directory into a plate. Each file is assumed to represent one well, which may contain
multiple fields. All files for a plate must be in the same folder as the selected bioformats2raw input file.

File names must match the regular expression `(.*_?)Well([A-Z])(\\d{2})_Channel(.*)_Seq(\\d{4}).nd2`, e.g.
`Plate000_WellB02_ChannelDAPI,CY5,CY3_Seq0000.nd2`. In this case, `Plate000` is the plate name; only files with the
same plate name will be grouped together.

If the input file does not match the given regular expression, then the base `ND2Reader` will be used to convert the
single input file without looking for other .nd2 files. It is especially important to check the conversion output when
working with ND2 plates for the first time or after any acquisition system updates, as there will not be an error in
the logs if the file name does not match `ND2PlateReader`'s expectations.

License
=======

Expand Down
31 changes: 15 additions & 16 deletions build.gradle
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ plugins {
}

group = 'com.glencoesoftware'
version = '0.7.1-SNAPSHOT'
version = '0.9.0-SNAPSHOT'

mainClassName = 'com.glencoesoftware.bioformats2raw.Converter'
sourceCompatibility = 1.8
Expand All @@ -18,21 +18,12 @@ repositories {
maven {
url 'https://artifacts.glencoesoftware.com/artifactory/ome.releases'
}
maven {
url 'https://repo.glencoesoftware.com/repository/n5-zarr-snapshots/'
}
maven {
url 'https://repo.glencoesoftware.com/repository/jzarr-snapshots'
}
maven {
url 'https://artifacts.glencoesoftware.com/artifactory/ome.external'
}
maven {
url 'https://artifacts.glencoesoftware.com/artifactory/scijava-thirdparty'
}
maven {
url 'https://artifacts.glencoesoftware.com/artifactory/jzarr-releases'
}
maven {
name 'Unidata'
url 'https://artifacts.glencoesoftware.com/artifactory/unidata-releases'
Expand All @@ -46,19 +37,19 @@ configurations.all {

dependencies {
implementation 'net.java.dev.jna:jna:5.10.0'
implementation 'ome:formats-gpl:7.0.0'
implementation 'info.picocli:picocli:4.6.1'
implementation 'ome:formats-gpl:7.1.0'
implementation 'info.picocli:picocli:4.7.5'
implementation 'com.univocity:univocity-parsers:2.8.4'
implementation 'com.bc.zarr:jzarr:0.3.5'
implementation 'dev.zarr:jzarr:0.4.2'
// implementation 'org.carlspring.cloud.aws:s3fs-nio:1.0-SNAPSHOT'
// implementation 'io.nextflow:nxf-s3fs:1.1.0'
implementation 'org.lasersonlab:s3fs:2.2.3'
implementation 'javax.xml.bind:jaxb-api:2.3.0'
implementation group: 'ch.qos.logback', name: 'logback-classic', version: '1.3.4'
implementation group: 'ch.qos.logback', name: 'logback-core', version: '1.3.4'
implementation group: 'ch.qos.logback', name: 'logback-classic', version: '1.3.14'
implementation group: 'ch.qos.logback', name: 'logback-core', version: '1.3.14'


implementation 'org.openpnp:opencv:3.4.2-1'
implementation 'org.openpnp:opencv:4.7.0-0'
implementation 'me.tongfei:progressbar:0.9.0'

// https://github.com/junit-team/junit5-samples/blob/master/junit5-migration-gradle/build.gradle
Expand Down Expand Up @@ -124,6 +115,14 @@ publishing {
publications {
bioformats2raw(MavenPublication) {
from components.java
pom {
licenses {
license {
name = 'GNU General Public License version 2'
url = 'http://www.gnu.org/licenses/gpl-2.0.txt'
}
}
}
}
}

Expand Down
Loading

0 comments on commit 489cb66

Please sign in to comment.