jhu-usc.edu_LAML_HumanMethylation450

Created By Kyle Ellrott kellrott
This data archive contains the Cancer Genome Atlas (TCGA) analysis of DNA methylation profiling using the IIllumina Infinium HumanMethylation450 platform. The Infinium platform analyzes up to 482,421 CpG dinucleotides and 3091 CpH trinucleotides, spanning gene-associated elements as well as intergenic regions. DNA samples were received, bisulfite converted and cytosine methylation was evaluated using IIllumina Infinium HumanMethylation450 microarrays. This archive contains Infinium DNA methylation data for LAML samples. The files contained in each data level package are as follows: AUX: Auxilary directory with .csv mappings, .sdf files, and processing logs. LEVEL 1: Level 1 data contain raw IDAT files (two per sample) as produced by the iScan system and as mapped by the SDRF and also in the disease mapping file. These files can be directly processed by the methylumi or minfi R packages; a comma-separated value (CSV) file in the AUX archive is provided to ease this. LEVEL 2: Level 2 data contain background-corrected methylated (M) and unmethylated (U) summary intensities as extracted by the methylumi package. Non-detection probabilities (P-values) were computed as the minimum of the two values (one per allele) for the empirical cumulative density function of the negative control probes in the appropriate color channel. Background correction is performed via normal-exponential deconvolution (currently NOT stratified by probe sequence). Multiple-batch archives have the intensities in each of the two channels multiplicatively scaled to match a reference sample (sample with R/G ratio closest to 1.0.) LEVEL 3: Derived summary measures (beta values: M/(M+U) for each interrogated locus) with annotations for gene symbol, chromosome (UCSC hg19, Feb 2009), and CpG/CpH coordinate (UCSC hg19, Feb 2009). Probes having a SNP within 10bp of the interrogated CpG site or having 15bp from the interrogated CpG site overlap with a REPEAT element (as defined by RepeatMasker and Tandem Repeat Finder Masks based on UCSC hg19, Feb 2009) are masked as NA across all samples, and probes with a non-detection probability (P-value) greater than 0.05 in a given sample are masked as NA on that chip. Probes that are mapped to multiple sites on hg19 are annotated as NA for chromosome and 0 for CpG/CpH coordinate

freeze: tcga_pancancer_v4
acronym: LAML
disease: cancer
species: Homo sapiens
fileType: genomicMatrix
platform: HumanMethylation450
dataCenter: jhu-usc.edu
lastUpdate: 2012-10-01
tissueType: blood
dataSubType: DNAMethylation
lastUpdated: 2012-10-01