Hi, We are trying to process the single nucleus component of the DiverseCohort single cell multiomics dataset (syn53643818) and we're looking at the matching metadata file (syn53649030). The description of the dataset mentions the following: "Importantly, the pooling of samples was designed such that no pair of individuals was represented in more than one library – this design allowed us to demultiplex and assign nuclei to individuals even without their reference genotype information." However, when looking at the filenames and the "libraryBatch" column in the metadata file, we observe that for many batches (e.g. VR001) there are two samples of the same donor from different brain regions in a batch. For instance, VR001 contains both the temporal gyrus and prefrontal cortex sample for the same individual. Similarly, VR003 contains three samples from a single donor. For these batches, we are therefore wondering how to assign the cells originating from the same individual to their brain regions of origin (e.g. assign neurons in batch VR001 to the prefrontal cortex or temporal gyrus regions)? We are assuming that a standard demultiplexing strategy (e.g. Freemuxlet or Demuxlet) using donor genotypes will not work here? Perhaps cells from different brain regions were given an additional barcode? Another option that we can think of is that the metadata is not correct. Please let us know how to best approach this data and/or whether the metadata is correct. With kind regards, A. Kooijmans

Created by Anoek Kooijmans akooijmans
Thanks @jaclynbeck!
Hello! After investigating this we found that the `libraryBatch` column had been erroneously sorted, so the batches no longer matched up to the correct individuals. This has been fixed and there is now a [new version of the metadata file](https://www.synapse.org/Synapse:syn53649030) (version 5). I've confirmed that all individuals in each batch are now unique. Hopefully this will clear up any issues with assigning cells to each individual. Let us know here if you find any other issues with the batch information. Jaclyn Beck

DiverseCohorts: different samples of same individual in one snRNA-seq sequencing batch page is loading…