Hi all, I just started using the ROSMAP WGS data. the joint VCF files contain 1196 subjects, and looking at the metadata, they all seem to be either from 'MAP' or 'ROS' cohorts. However, according to the cohorts' description here https://adknowledgeportal.synapse.org/Explore/Studies/DetailsPage/StudyDetails?Study=syn22264775, the number of samples should be the following: ROSMAP: 1200 samples MSBB: 349 samples, Mayo: 349 samples, which sum is 1898. This is the same number included in the VCF files' name: NIA_JG_1898_samples_GRM_WGS_b37_JointAnalysisXXXX. So what I would like to know is: 1) do the 1196 come from the three datasets? 2) where are the missing ~700? Thank you very much

Created by Marianna Sanna marianna
Hi Jared, thank you for you reply. I will then use those files. Best wishes, Marianna
Hi Marianna, The VCF file of the three cohorts are split by cohort and chromosome. ROSMAP: https://www.synapse.org/#!Synapse:syn11707419 Mayo: https://www.synapse.org/#!Synapse:syn11707308 MSBB: https://www.synapse.org/#!Synapse:syn11707204 The VCF file for everything is very large (you saw 25 GB for all of ROSMAP only), so that's why the splits occur. Let me know if you have further questions. Regards, Jared
Marianna, Thanks Marianna. I will look into the VCF file shortly. Regards, Jared
Hi Jared Hendrickson, thank you for getting back to me. One of the files is _syn11714389_ . Thanks for looking into this. Best wishes, Marianna
Hi Marianna Sanna, Can you please direct me to the exact VCF file you are looking at, preferably by Synapse ID? From there, I can do some data exploration and contact other members of my team if needed. Regards, Jared

.sg-noscript { font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Helvetica, Arial, sans-serif; max-width: 860px; margin: 40px auto; padding: 0 24px; color: #141414; line-height: 1.6; } .sg-noscript h1 { font-size: 1.8rem; margin-bottom: 0.25rem; } .sg-noscript h2 { font-size: 1.2rem; margin-top: 2rem; margin-bottom: 0.5rem; border-bottom: 1px solid #e0e0e0; padding-bottom: 0.25rem; } .sg-noscript ul { padding-left: 1.5rem; } .sg-noscript li { margin-bottom: 0.4rem; } .sg-noscript a { color: #1a6fa8; } .sg-noscript address { font-style: normal; } .sg-noscript .note { margin-top: 2rem; color: #666; font-size: 0.85rem; }

Synapse — A Collaborative Platform for Open Biomedical Science

Synapse is a collaborative data-sharing and analysis platform built and operated by Sage Bionetworks, a 501(c)(3) nonprofit biomedical research organization based in Seattle, Washington.

About Sage Bionetworks

Sage Bionetworks is a nonprofit research organization whose mission is to drive a new age of discovery through truly open science and radical collaboration.

Our vision is to create a world where silos within and across science and technology no longer exist, forging a path to optimal human health.

We are a trusted leader in data sharing and reuse, enabling a rapid acceleration in biomedical discoveries and the transformation of medicine. Better Science Together is the principle that guides our work with researchers, clinicians, patient communities, and funders worldwide.

What Synapse Does

Synapse is the platform Sage Bionetworks uses to make biomedical research data findable, accessible, interoperable, and reusable (FAIR). Researchers, clinicians, and data scientists use Synapse to:

Share large biomedical datasets across institutions, with appropriate access controls, data-use agreements, and governance.
Run reproducible analyses on shared data with documented provenance.
Coordinate consortium science across disease areas including Alzheimer's disease, neurofibromatosis, ALS, rare cancers, and others.
Power public-facing knowledge portals such as the AD Knowledge Portal, the NF Data Portal, and the ALS Knowledge Portal.

Nonprofit Identity

Sage Bionetworks
A 501(c)(3) nonprofit research organization
EIN: 26-4489946
Seattle, Washington, USA
sagebionetworks.org
Trust Center — Terms of Service, Privacy Policy, financial statements, and governance documents

Learn More

This static content is provided for search engines and users with JavaScript disabled. For the full Synapse experience, please enable JavaScript in your browser.

Drop files to upload

WGS data sample size page is loading…