Skip to content

Latest commit

 

History

History
10 lines (6 loc) · 851 Bytes

File metadata and controls

10 lines (6 loc) · 851 Bytes

Contaminated ASE

DOI

Data Production Methods

This Dataset comprises of Synthetically Contaminated ASE Data from 40 LCL cell-line samples from the GEUVADIS Consortium. ASE was produced by spiking in a set proportion of reads (identified by the filename <contamination_in_percent>.csv) from contamination sample NA19159. ASE was produced at the variant level following best practice protocols.

Data Structure

The xz archive contains file .csv where each file represents ASE data from each of the 40 samples at that contamination percentage. Rows in the file correspond to Gene IDS (idenitified by ENSEMBL ID) and column names represent the various sample IDS.