This repository contains the scripts and data used in the nf-core/crisprseq publication.
-
benchmarking_results/: Contains tables of resources of different runs with nf-core/crisprseq (crisprseq_full) and CRISPR-Analytics (crispra_full). Contains tables of analysis results run with nf-core/crisprseq (crisprseq_spikes). The tables are obtained from the MultiQC process, which summarises the results. Samples from two projects were analysed, Connelly JP et.al. (2019) (crisp.py) and Sanvicente-García M et.al. (2023) (crispr-A). -
crisprseq_vs_crispra.R: R script used to calculate the average run time and CPU hours of both workflows. -
spikein_benchmarking.R: R script used to plot a comparison of indels detected in spike-in samples with nf-core/crisprseq and compared with the original publications which analysed the data with crisp.py and CRISPR-A. -
public_data/: Contains the projects from the European Nucleotide Archive (ENA) repository filtered by the keywordcrispr. -
analysis_public_data.R: R script used to classify public available projects into different categories, calculate the percentages and plot them.