PennCNV
Home
Download
Installation
Tutorial
Quick Examples
Input Files
CNV Calling
Trio Calling
Joint Calling
Validation Calling
QC & Annotation
Visualization
PennCNV Plug-in
PennCNV-Affy
Reference
FAQ
Mailing List

Download PennCNV

PennCNV is a software originally developed for CNV detection from Illumina whole-genome SNP genotyping arrays. It has been extended to handle candidate gene SNP arrays, to handle recent high-density arrays with non-polymorphic markers (so-called CN markers), and to handle Affymetrix genome-wide arrays.

PennCNV-Affy is a collection of data pre-processing protocols and helper scripts that convert Affymetrix CEL files into a data format suitable for CNV detection by PennCNV.

1 PennCNV main package
2 PennCNV example data sets
3 PennCNV-Affy protocols
4 Genome Browser tracks for commonly used SNP arrays
5 Archives of older versions of PennCNV

 

PennCNV main package

The link to latest version (2009Aug27 version) of PennCNV main package is given below. The package includes both source codes and pre-compiled executables for several commonly used system architecture, including 32-bit Windows. (Due to the problem of several unzipping software under Windows in handling *.tar.gz files, Windows users should try to download the zip file instead. The contents of these two files are identical).

Additionally, the hc12v1.hg18.pfb and hc12v1.hg18.gcmodel file are provided for HumanCytoSNP12 V1 array with 300K markers on that array specifically. The ho1v1.hg18.pfb and ho1v1.hg18.gcmodel files are provided for HumanOmni1 QuadV1 array specifically.

 

 PennCNV example data sets

This example data set contains genotyping data for a father-mother-offspring trio, genotyped on the Illumina HumanHap550 array. The file is in BeadStudio project format, and can be opened by the Illumina BeadStudio software. The file is used in the tutorial in the web site.

This file is an exported text file from the Illumina BeadStudio software using the above project file. It is useful for users without access to BeadStudio to test PennCNV.

This “serial dilution” data set contains genotyping data on a sample genotyped five times, each time with ~2-fold dilutions. For more details please see the Diskin et al paper. The file is in BeadStudio project format, and can be opened by the Illumina BeadStudio software.

 

PennCNV-Affy protocols

This package contains PennCNV-Affy protocols and helper scripts for handling Affymetrix Mapping 500K, genome-wide 5.0 and genome-wide 6.0 data as raw CEL files. For instructions on how to use the PennCNV-Affy package, please refer to the PennCNV-Affy link in the menu bar to the left of the page. The package below is updated on 2009Aug27, with very minor bug fixes only. It should produce identical results as previous version.

 

Genome Browser tracks for commonly used SNP arrays

Click any of the link below will load the corresponding track in the UCSC Genome Browser on hg18 coordinate, or 2006 human genome assembly. It may take several minutes to load each of the track. The user can visualize the location of the marker coverage in each specific arrays and compare them with each other. If the user does not want to see the marker names in the browser, the “dense” drop-down menu option can be used in the browser page to display red vertical bars for each marker.

All the genome coordinates are also annotated in the PFB file supplied with the PennCNV package. The Illumina data were based on supplied annotation from Illumina's sample files. The Affymetrix data were based on na26 (July 2008) annotation from Affymetrix website.

Please report annotation mistakes/bugs or request for additional arrays.

 

Archives of older versions of PennCNV

The older versions of PennCNV are given below together with one-sentence description of changes. It is highly recommended to use only the latest version. More detailed change log is recorded in the program code per se.

  • Latest version (2009Aug27): minor bug fix, added --lastchr argument to detect_cnv.pl to handle non-human arrays. Edded infer_snp_allele.pl program to infer CNV-based SNP genotypes, or to validate de novo CNVs and assign P-values. Reorganize kext/ directory structure to accormodate different Perl versions. Fix the missing "-" before "minsnp" problem in the BeadStudio/GenomeStudio plug-in. Enhanced functionality of convert_cnv.pl, which now handles XML files exported from BeadStudio/GenomeStudio. Added -reciprocal argumen to the compare_cnv.pl program to fine-tune -minoverlap argument. Updated scan_region.pl program for better functionality and more accurate control of overlapping criteria. See "PennCNV main package" section above for link to download files.
  • 2008Nov19 version: adding functionality in the filter_cnv.pl and compare_cnv.pl program.
  • 2008Jun26 version: GC-model signal pre-processing to handle low-quality samples; family-based CNV calls by joint-calling algorithm; other enhancements
  • 2008Mar11 version: better compatibility with BeadStudio, beta-version of CNV visualization
  • 2008Mar03 version: beta-version for using PennCNV with BeadStudio plug-in, beta-version for signal adjustment for wavy samples
  • 2008Feb15 version: minor changes with more QC summary
  • 2007Dec14 version: support 64bit system and support cygwin now, print sample quality measure, produce confidence score by --conf argument (experimental feature)
  • 2007Nov13 version: cumulative minor bug fix and function enhancement
  • 2007Oct31 version: fix bugs for chrX processing; fix bugs in kcolumn.pl for splitting huge files
  • 2007Sep28 version: re-implement calling algorithm to process each chromosome separately; change output format