CNA-seq / Correct for GC content

Description

Takes the counts per bin for a CNA-seq data set and corrects them for GC content.

Parameters

Details

Correcting for GC content is necessary because it affects enzyme chemistry and therefore also the depth of sequencing coverage. The correction is performed by rounding the GC contents of each bin to integer values, calculating the median read count for each GC content, and fitting a loess curve to the data.

Output

The input data set with raw counts replaced by GC corrected ones.

References

Manuscript under preparation.