SCRATCH-CNV Subworkflows

Introduction

This repository contains subworkflows for performing copy number variation (CNV) analysis on single-cell RNA-seq (scRNA-seq) data. The CNV subworkflows support multiple tools including inferCNV, SCEVAN, and CopyKAT to assess tumor heterogeneity and identify chromosomal aberrations.

Disclaimer: Subworkflows are high-level wrappers around chained Nextflow modules. They should be used as part of a pipeline and can be extended or reused across different scRNA-seq workflows.

Prerequisites

Ensure the following tools are installed:

Nextflow (v21.04.0 or higher)
Java (v8 or higher)
Singularity or Docker for container execution
Git

Installation

Clone the repository:

git clone https://github.com/WangLab-ComputationalBiology/SCRATCH-CNV.git
cd SCRATCH-CNV

Subworkflows

`main.nf`

This is the main entry script that orchestrates CNV analysis using inferCNV, SCEVAN, and CopyKAT subworkflows.

1. inferCNV

Performs CNV inference using gene expression intensities compared between reference (normal) and observation (tumor) cells.

Usage

nextflow run main.nf -profile singularity --input_seurat_object <path/to/seurat_object.RDS> --input_reference_table <path/to/reference_table.csv>

Parameters

--input_seurat_object: Seurat object with UMAP and count layers
--input_reference_table: CSV with barcode and reference label columns
--project_name: Output project name (optional)
--skip_infercnv: Skip running inferCNV (default: false)

2. SCEVAN

Performs CNV detection using Bayesian inference across multiple tumor samples.

Parameters (passed via `ext.args`)

project_name: Name for output and figures
input_model: Organism (e.g., human)
n_threads: Number of threads
n_memory: Memory in GB
workdir: Working directory for outputs
auto_save: Save intermediate objects (true/false)

3. CopyKAT

Optional module for an alternative CNV inference strategy.

Example

nextflow run main.nf -profile singularity \
  --input_seurat_object project_cluster_object.RDS \
  --input_reference_table assets/OV_reference_table.csv \
  --project_name OV_CNV \
  -resume

Annotated Object Requirement

To ensure successful CNV analysis, your input Seurat object must include one of the following annotation columns in meta.data and contain the minimum required cell types:

Annotated Metadata Requirements

Annotation Column	Required Cell Types	Role
`azimuth_labels`	B cell, T cell, Fibroblast, Epithelial	Reference + Observation
`sctype`	B_Plasma_Cells, T_Cells, Fibroblast, Epithelial	Reference + Observation
`cell_label`	B cell, T cell, Fibroblast, Epithelial	Reference + Observation

Note: The presence of these cell types is critical to define both reference (normal) and observation (tumor) populations.

Configuration

Default parameters and paths can be set in nextflow.config. Use institutional profiles for HPC environments.

Output

./<project_name>/data/infercnv: CNV matrices and plots from inferCNV
./<project_name>/data/scevan: CNV profiles and oncoheatmaps from SCEVAN
./<project_name>/report: Consolidated report

Notes on Troubleshooting

For inferCNV HMM mode on large matrices, increase cutoff (e.g., 0.25) or use HMM_type = "i3" to reduce model complexity.
Inspect logs via .nextflow.log and .command.out in work directories for failed tasks.
Avoid missing parameters in ext.args block for modules like SCEVAN to prevent pipeline crashes.

Contributing

Open issues or submit PRs for bugs, enhancements, or suggestions.

License

This project is licensed under the GNU General Public License v3.0.

Contact

For help and questions:

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.cirro		.cirro
.devcontainer		.devcontainer
.github/workflows		.github/workflows
assets		assets
conf		conf
docker		docker
modules/local		modules/local
subworkflow/local		subworkflow/local
test		test
.gitignore		.gitignore
README.md		README.md
main.nf		main.nf
nextflow.config		nextflow.config

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SCRATCH-CNV Subworkflows

Introduction

Prerequisites

Installation

Subworkflows

`main.nf`

1. inferCNV

Usage

Parameters

2. SCEVAN

Parameters (passed via `ext.args`)

3. CopyKAT

Example

Annotated Object Requirement

Annotated Metadata Requirements

Configuration

Output

Notes on Troubleshooting

Contributing

License

Contact

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

WangLab-ComputationalBiology/SCRATCH-CNV

Folders and files

Latest commit

History

Repository files navigation

SCRATCH-CNV Subworkflows

Introduction

Prerequisites

Installation

Subworkflows

main.nf

1. inferCNV

Usage

Parameters

2. SCEVAN

Parameters (passed via ext.args)

3. CopyKAT

Example

Annotated Object Requirement

Annotated Metadata Requirements

Configuration

Output

Notes on Troubleshooting

Contributing

License

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

`main.nf`

Parameters (passed via `ext.args`)

Packages