|Publication Type||Journal Article|
|Year of Publication||2005|
|Authors||LaFramboise, T, Weir, BA, Zhao, X, Beroukhim, R, Li, C, Harrington, D, Sellers, WR, Meyerson, M|
|Journal||PLoS Computational Biology|
|Pages||e65 - e65|
|Keywords||Alleles, Cancer, Cell Line, Chromosomes, DNA, Gene Amplification, Gene Dosage, Gene Expression Regulation, Genetic, Haplotypes, Human, Humans, Models, Neoplasm, Neoplasms, Neoplastic, Oligonucleotide Array Sequence Analysis, Polymorphism, Single Nucleotide, Tumor|
Amplification, deletion, and loss of heterozygosity of genomic DNA are hallmarks of cancer. In recent years a variety of studies have emerged measuring total chromosomal copy number at increasingly high resolution. Similarly, loss-of-heterozygosity events have been finely mapped using high-throughput genotyping technologies. We have developed a probe-level allele-specific quantitation procedure that extracts both copy number and allelotype information from single nucleotide polymorphism (SNP) array data to arrive at allele-specific copy number across the genome. Our approach applies an expectation-maximization algorithm to a model derived from a novel classification of SNP array probes. This method is the first to our knowledge that is able to (a) determine the generalized genotype of aberrant samples at each SNP site (e.g., CCCCT at an amplified site), and (b) infer the copy number of each parental chromosome across the genome. With this method, we are able to determine not just where amplifications and deletions occur, but also the haplotype of the region being amplified or deleted. The merit of our model and general approach is demonstrated by very precise genotyping of normal samples, and our allele-specific copy number inferences are validated using PCR experiments. Applying our method to a collection of lung cancer samples, we are able to conclude that amplification is essentially monoallelic, as would be expected under the mechanisms currently believed responsible for gene amplification. This suggests that a specific parental chromosome may be targeted for amplification, whether because of germ line or somatic variation. An R software package containing the methods described in this paper is freely available at http://genome.dfci.harvard.edu/~tlaframb/PLASQ.