libBIDS.sh

A Bash library for parsing and processing BIDS datasets into CSV-like structures, enabling flexible data filtering, extraction, and iteration within shell scripts.

Pattern matching is permissive with respect to BIDS spec, it may match some files which do not meet validation requirements.

Features

Converts BIDS datasets into a flat CSV format
Extracts key BIDS entities from filenames
Provides filtering, column selection, and row operations
Allows iteration over rows with associative arrays
Handles JSON sidecar files and metadata
Designed for shell scripting in pipelines and automation

Requirements

Bash version: ≥ 4.3 (Due to associative arrays, readarray, and declare -n usage)

macOS users: Apple's default Bash (3.2) is too old. You must upgrade to ≥ 4.3.

See: https://apple.stackexchange.com/questions/193411/update-bash-to-version-4-0-on-osx

Usage

Sourcing the Library

Include the library in your script:

source libBIDS.sh

Command-Line Execution

Run directly to dump dataset as CSV:

./libBIDS.sh /path/to/bids

Core Functions

`libBIDSsh_parse_bids_to_csv`

Parses a directory tree, identifies BIDS files, extracts BIDS entities, and outputs CSV.

csv_data=$(libBIDSsh_parse_bids_to_csv "/path/to/bids")

Output columns:

derivatives: Pipeline name if in derivatives folder
data_type: BIDS data type (anat, func, dwi, etc.)
BIDS entities: subject, session, sample, task, acquisition, etc.
suffix: File suffix (bold, T1w, dwi, etc.)
extension: File extension
path: Full file path

Filtering and Subsetting

`libBIDSsh_csv_filter`

Filters CSV data by columns, values, regex, and missing data.

libBIDSsh_csv_filter "${csv_data}" [OPTIONS]

Options:

-c, --columns <col1,col2,...>: Select columns by name or index
-r, --row-filter <col:pattern>: Keep rows matching value/regex (AND logic for multiple filters)
-d, --drop-na <col1,col2,...>: Drop rows where listed columns are "NA"

Examples:

# Keep only subject and task columns
libBIDSsh_csv_filter "$csv_data" -c "sub,task"

# Filter for resting-state tasks
libBIDSsh_csv_filter "$csv_data" -r "task:rest"

# Multiple filters: rest task AND drop missing sessions
libBIDSsh_csv_filter "$csv_data" -r "task:rest" -d "ses"

# Complex filtering with regex
libBIDSsh_csv_filter "$csv_data" -r "task:(rest|motor)" -r "run:[1-3]"

`libBIDSsh_drop_na_columns`

Removes columns that contain only NA values across all rows.

cleaned_csv=$(libBIDSsh_drop_na_columns "$csv_data")

Example:

# Remove empty columns from dataset
csv_data=$(libBIDSsh_parse_bids_to_csv "/path/to/bids")
cleaned_csv=$(libBIDSsh_drop_na_columns "$csv_data")

JSON Processing

`libBIDSsh_extension_json_rows_to_column_json_path`

Processes CSV data to add a json_path column that links data files to their JSON sidecars.

updated_csv=$(libBIDSsh_extension_json_rows_to_column_json_path "$csv_data")

Behavior:

Matches JSON files to corresponding data files based on BIDS entities
Drops JSON rows that have matching data files
Keeps unmatched JSON files with their path in json_path
Adds NA for data files without JSON sidecars

Example:

csv_data=$(libBIDSsh_parse_bids_to_csv "/path/to/bids")
csv_with_json=$(libBIDSsh_extension_json_rows_to_column_json_path "$csv_data")

`libBIDSsh_json_to_associative_array`

Parses a JSON file into a bash associative array with type information.

declare -A json_data
libBIDSsh_json_to_associative_array "file.json" json_data

Value format:

type:value for primitives (e.g., string:hello, number:42)
array:item1,item2,item3 for arrays
object:{json_string} for nested objects

Example:

declare -A sidecar
libBIDSsh_json_to_associative_array "sub-01_task-rest_bold.json" sidecar
echo "TR: ${sidecar[RepetitionTime]}"  # Output: number:2.0

Column Extraction

`libBIDSsh_csv_column_to_array`

Extracts a column as a Bash array with deduplication and NA filtering.

libBIDSsh_csv_column_to_array "$csv_data" "column" array_var [unique] [exclude_NA]

Arguments:

csv_data: CSV-formatted string
column: Column name or index
array_var: Name of array variable to populate
unique: "true" (default) to return only unique values
exclude_NA: "true" (default) to exclude NA values

Example:

declare -a subjects
libBIDSsh_csv_column_to_array "$csv_data" "sub" subjects true true
echo "Unique subjects: ${subjects[@]}"

declare -a all_runs
libBIDSsh_csv_column_to_array "$csv_data" "run" all_runs false false
echo "All runs (including duplicates and NA): ${all_runs[@]}"

Row Iteration

`libBIDS_csv_iterator`

Iterates CSV rows, exposes fields in an associative array with optional sorting.

while libBIDS_csv_iterator "$csv_data" row_var [sort_col1] [sort_col2] [-r]; do
  # Process row
done

Arguments:

csv_data: CSV data string
row_var: Name of associative array to populate with each row
sort_columns: Optional column names to sort by
-r: Optional reverse sort flag

Example:

declare -A row
while libBIDS_csv_iterator "$csv_data" row "sub" "ses" "run"; do
  echo "Processing: ${row[sub]} ${row[ses]} ${row[run]}: ${row[path]}"
done

## Internal Functions

### `_libBIDSsh_parse_filename`

Internal function that parses BIDS filenames into component entities.

```bash
declare -A file_info
_libBIDSsh_parse_filename "sub-01_task-rest_bold.nii.gz" file_info

Populated fields:

Individual BIDS entities (sub, ses, task, etc.)
suffix: File suffix
extension: File extension
data_type: Inferred data type
derivatives: Pipeline name if applicable
path: Full path
_key_order: Order of keys for iteration

Complete Examples

Basic Dataset Processing

#!/usr/bin/env bash
source libBIDS.sh

bids_path="/path/to/bids"
csv_data=$(libBIDSsh_parse_bids_to_csv "$bids_path")

# Extract unique subjects
declare -a subjects
libBIDSsh_csv_column_to_array "$csv_data" "sub" subjects true true
echo "Found subjects: ${subjects[*]}"

# Clean up empty columns
csv_data=$(libBIDSsh_drop_na_columns "$csv_data")

# Add JSON sidecar information
csv_data=$(libBIDSsh_extension_json_rows_to_column_json_path "$csv_data")

Functional Data Processing

#!/usr/bin/env bash
source libBIDS.sh

bids_path="/path/to/bids"
csv_data=$(libBIDSsh_parse_bids_to_csv "$bids_path")

# Filter for functional BOLD data with JSON sidecars
func_csv=$(libBIDSsh_csv_filter "$csv_data" \
  -r "data_type:func" \
  -r "suffix:bold")

# Add JSON paths
func_csv=$(libBIDSsh_extension_json_rows_to_column_json_path "$func_csv")

# Process each file with its JSON metadata
declare -A row
while libBIDS_csv_iterator "$func_csv" row "sub" "ses" "run"; do
  echo "Processing: ${row[path]}"

  if [[ "${row[json_path]}" != "NA" ]]; then
    declare -A json_data
    libBIDSsh_json_to_associative_array "${row[json_path]}" json_data
    echo "  TR: ${json_data[RepetitionTime]}"
    echo "  Task: ${json_data[TaskName]}"
  fi
done

Notes

All functions handle CSV data as strings, not files
NA values are used for missing BIDS entities
Pattern matching is permissive and may match non-BIDS-compliant files
JSON processing requires jq to be installed
Sort operations use version sort for natural ordering of numbers

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
LICENSE		LICENSE
README.md		README.md
generate_entity_patterns.sh		generate_entity_patterns.sh
libBIDS.sh		libBIDS.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

libBIDS.sh

Features

Requirements

Usage

Sourcing the Library

Command-Line Execution

Core Functions

`libBIDSsh_parse_bids_to_csv`

Filtering and Subsetting

`libBIDSsh_csv_filter`

`libBIDSsh_drop_na_columns`

JSON Processing

`libBIDSsh_extension_json_rows_to_column_json_path`

`libBIDSsh_json_to_associative_array`

Column Extraction

`libBIDSsh_csv_column_to_array`

Row Iteration

`libBIDS_csv_iterator`

Complete Examples

Basic Dataset Processing

Functional Data Processing

Notes

About

Uh oh!

Releases 2

Packages

Languages

License

CoBrALab/libBIDS.sh

Folders and files

Latest commit

History

Repository files navigation

libBIDS.sh

Features

Requirements

Usage

Sourcing the Library

Command-Line Execution

Core Functions

libBIDSsh_parse_bids_to_csv

Filtering and Subsetting

libBIDSsh_csv_filter

libBIDSsh_drop_na_columns

JSON Processing

libBIDSsh_extension_json_rows_to_column_json_path

libBIDSsh_json_to_associative_array

Column Extraction

libBIDSsh_csv_column_to_array

Row Iteration

libBIDS_csv_iterator

Complete Examples

Basic Dataset Processing

Functional Data Processing

Notes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

`libBIDSsh_parse_bids_to_csv`

`libBIDSsh_csv_filter`

`libBIDSsh_drop_na_columns`

`libBIDSsh_extension_json_rows_to_column_json_path`

`libBIDSsh_json_to_associative_array`

`libBIDSsh_csv_column_to_array`

`libBIDS_csv_iterator`

Packages