Hello, I am interested in analysing this data and was wondering if it is possible to get access to the fastq files for the samples? Currently only the bam files are available. Many thanks Devika

Created by Devika Agarwal dpag0891
This is true and escaped being in the documentation. My apologies. The RNA prep was neither poly A selection nor ribozero depletion. According to our vendor, RNA levels were really low in the microglial samples, so the procedure was to use the Nugen Ovation RNA Seq V2 system. The resulting cDNA was then sheared using our Covaris and the libraries were prepared using the Kapa LTP Library Preparation Kit.
Hi Corey, I believe since I first posted the query two months ago , I have an answer to my initial question in regards to why the mapping rates were about 45% when the FQs i re-generated were quantified to transcripts with Salmon. I have since then carried out Picard CollectRNASeq metrics and found that there is a high percentage of non coding reads in bam files. This makes me think that the RNA seq carried out on these samples was not with a PolyA selection method, so having a low rate with Salmon, when quantifying based on a reference transcriptome (rather than the genome) ,makes sense. The only reason I asked for the FQ files was because of the low mapping rate. I was encouraged to request the FQ files to make sure, there wasnt a problem with the regeneration. Sorry I should have updated an answer to my query. if I am correct , then i believe there is no need to take the time to upload the FQ files. In order to convert the bam files to FQ. I first used samtools to query sort them and then used bedtools bamtofastq command. This did give me a lot of warnings about certain reads not being paired correctly or having a missing mate. So that is why I was worried. The commands used were based on the bedtools documentation (http://bedtools.readthedocs.io/en/latest/content/tools/bamtofastq.html ) Thank you for taking the time to respond to my query Devika
@dpag0891 Devika, Hello, I'm one of the individuals who runs the RNASeq pipeline so Dave reached out to me and encouraged me to reply. I'm looking at our FQ files for this project and we have separated these files by batch for each sample, so I'm seeing 192 batches for only 32 samples. Before I take the time to stitch these FQ files back together and upload ~431GB of data, do you mind sharing the command you used to convert the BAMs to FQ along with the version of bedtools? Unfortunately we use a custom script to convert FQ to BAMs so I don't have any experience with bedtools. Thanks, Corey
Hi @david_c_airey - I have created this folder syn11036177 within your project and gave you access. It will stay private until the data is uploaded. We have a new method for uploading data, provenance and annotations in bulk based on creating a manifest. See here - http://docs.synapse.org/articles/uploading_in_bulk.html Since these are fastq, don't worry about provenance, but it would be great if we could capture the annotations on upload. I will send you a manifest template that has the keys for the annotations we would like to see (+ a list of values) Thanks!
Hi Devika and Ben, Yes, we can upload these files. Where do we put them, Ben? Is there a particular synapse ID we should use? It's been too long since I UPloaded to Synapse. -Dave
Thanks Ben
Hi Devika, I'm adding @david_c_airey on this thread as he may be able to answer your question. Ben
Hi Ben, I have managed to regenerated the fastq files using bedtools. However my maprate to the transcriptome while using Salmon was not ideal and was worried that it might be due to a lot of reads not being recognised as paired during the conversion. Hence wanted to double check my work and thought to ask for the fastq files to make sure i havent gone wrong somewhere along the way. Devika
Hi Devika, I'm not sure it is, but you should be able to regenerate fastqs from the bams using the picard tool: http://broadinstitute.github.io/picard/command-line-overview.html#BamToBfq Ben

.sg-noscript { font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Helvetica, Arial, sans-serif; max-width: 860px; margin: 40px auto; padding: 0 24px; color: #141414; line-height: 1.6; } .sg-noscript h1 { font-size: 1.8rem; margin-bottom: 0.25rem; } .sg-noscript h2 { font-size: 1.2rem; margin-top: 2rem; margin-bottom: 0.5rem; border-bottom: 1px solid #e0e0e0; padding-bottom: 0.25rem; } .sg-noscript ul { padding-left: 1.5rem; } .sg-noscript li { margin-bottom: 0.4rem; } .sg-noscript a { color: #1a6fa8; } .sg-noscript address { font-style: normal; } .sg-noscript .note { margin-top: 2rem; color: #666; font-size: 0.85rem; }

Synapse — A Collaborative Platform for Open Biomedical Science

Synapse is a collaborative data-sharing and analysis platform built and operated by Sage Bionetworks, a 501(c)(3) nonprofit biomedical research organization based in Seattle, Washington.

About Sage Bionetworks

Sage Bionetworks is a nonprofit research organization whose mission is to drive a new age of discovery through truly open science and radical collaboration.

Our vision is to create a world where silos within and across science and technology no longer exist, forging a path to optimal human health.

We are a trusted leader in data sharing and reuse, enabling a rapid acceleration in biomedical discoveries and the transformation of medicine. Better Science Together is the principle that guides our work with researchers, clinicians, patient communities, and funders worldwide.

What Synapse Does

Synapse is the platform Sage Bionetworks uses to make biomedical research data findable, accessible, interoperable, and reusable (FAIR). Researchers, clinicians, and data scientists use Synapse to:

Share large biomedical datasets across institutions, with appropriate access controls, data-use agreements, and governance.
Run reproducible analyses on shared data with documented provenance.
Coordinate consortium science across disease areas including Alzheimer's disease, neurofibromatosis, ALS, rare cancers, and others.
Power public-facing knowledge portals such as the AD Knowledge Portal, the NF Data Portal, and the ALS Knowledge Portal.

Nonprofit Identity

Sage Bionetworks
A 501(c)(3) nonprofit research organization
EIN: 26-4489946
Seattle, Washington, USA
sagebionetworks.org
Trust Center — Terms of Service, Privacy Policy, financial statements, and governance documents

Learn More

This static content is provided for search engines and users with JavaScript disabled. For the full Synapse experience, please enable JavaScript in your browser.

Hello, I am interested in analysing this data and was wondering if it is possible to get access to the fastq files for the samples? Currently only the bam files are available. Many thanks Devika

Drop files to upload

syn5478323 is it possible to get the fastq files for the samples page is loading…