I am trying to analyze the proteomics data published on synapse.org. In the MayoRnaSeq data neurodegenerative diseases like AD, PSP, pathological aging, etc, there is one cohort with 5 batches and 3 conditions (AD, PSP, and control) in all the 5 batches, there are terms egis and mgis. I am writing because I am unable to understand what those terms mean and why in my PCA, (unable to upload the plot here) these form different clusters as outliers would do. Is it advisable to remove these values from the dataset or do they have important information? Please see the below as an illustration to explain my question. Such terms are present in all batches (b1,b2,b3,b4,b5) mayo_b5_egis_02 mayo_b5_egis_24 mayo_b5_egis_45 mayo_b5_mgis_01 mayo_b5_mgis_23 mayo_b5_mgis_44 Also, these data are not annotated to any conditions (AD, PSP or control ) The data has the following synapseID: syn7431988 and the file I am trying to analyze is Mayo_Proteomics_TC_proteinoutput.txt I tried to look at the Mayo_Proteomics_TC_searchparameters.xml But couldn't understand the 'egis' and 'mgis' terms that appeared in all the 5 batches of the data. I will be grateful if I could understand the above-mentioned parameters and can get answers to my questions.

Created by sumode
Thank you!
Hi @sumode The 'gis' files are global internal standards, which is why they grouping separately in your PCA. You can map the sampleIDs to the individual covariates (syn3817650) through syn9782771. We are working on providing a simpler set of metadata for this study.

.sg-noscript { font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Helvetica, Arial, sans-serif; max-width: 860px; margin: 40px auto; padding: 0 24px; color: #141414; line-height: 1.6; } .sg-noscript h1 { font-size: 1.8rem; margin-bottom: 0.25rem; } .sg-noscript h2 { font-size: 1.2rem; margin-top: 2rem; margin-bottom: 0.5rem; border-bottom: 1px solid #e0e0e0; padding-bottom: 0.25rem; } .sg-noscript ul { padding-left: 1.5rem; } .sg-noscript li { margin-bottom: 0.4rem; } .sg-noscript a { color: #1a6fa8; } .sg-noscript address { font-style: normal; } .sg-noscript .note { margin-top: 2rem; color: #666; font-size: 0.85rem; }

Synapse — A Collaborative Platform for Open Biomedical Science

Synapse is a collaborative data-sharing and analysis platform built and operated by Sage Bionetworks, a 501(c)(3) nonprofit biomedical research organization based in Seattle, Washington.

About Sage Bionetworks

Sage Bionetworks is a nonprofit research organization whose mission is to drive a new age of discovery through truly open science and radical collaboration.

Our vision is to create a world where silos within and across science and technology no longer exist, forging a path to optimal human health.

We are a trusted leader in data sharing and reuse, enabling a rapid acceleration in biomedical discoveries and the transformation of medicine. Better Science Together is the principle that guides our work with researchers, clinicians, patient communities, and funders worldwide.

What Synapse Does

Synapse is the platform Sage Bionetworks uses to make biomedical research data findable, accessible, interoperable, and reusable (FAIR). Researchers, clinicians, and data scientists use Synapse to:

Share large biomedical datasets across institutions, with appropriate access controls, data-use agreements, and governance.
Run reproducible analyses on shared data with documented provenance.
Coordinate consortium science across disease areas including Alzheimer's disease, neurofibromatosis, ALS, rare cancers, and others.
Power public-facing knowledge portals such as the AD Knowledge Portal, the NF Data Portal, and the ALS Knowledge Portal.

Nonprofit Identity

Sage Bionetworks
A 501(c)(3) nonprofit research organization
EIN: 26-4489946
Seattle, Washington, USA
sagebionetworks.org
Trust Center — Terms of Service, Privacy Policy, financial statements, and governance documents

Learn More

This static content is provided for search engines and users with JavaScript disabled. For the full Synapse experience, please enable JavaScript in your browser.

Drop files to upload

Issue with Proteomics data page is loading…