Hi all,
I found a sample discrepancy for MSBB RNAseq dataset between individual studies and consortium studies. The count matrix file for the individual study (syn7391749) contains 938 samples in total, whilst the one in the rnaSeqReprocessing folder (syn10507727) contains 1026 samples. And the former is not a strict subset of the later. My understanding is that the rnaSeqReprocessing project basically applied a uniform RNAseq processing pipeline to data from the 3 studies (MSBB, ROSMAP, Mayo). In that case one would expect the same number of samples before and after reprocessing. I wonder what could explain this discrepancy and any help is much appreciated!
Best,
Jiali