Flow Logo

Core Flow Concepts

Filesets

What is a fileset?

A fileset is just a way to group files that belong together in some way.

What do we use filesets for?

We use filesets:

  • To link the two paired fastq files in a paired-end sample. In this case the fileset has the sample attribute set to the sample in question.
  • To link the two paired multiplexed fastq files in paired-end multiplexed sequencing run.
  • To link the raw files of a genome together. In this case the organism field will be set to the organism in question, and often the URL will be set. Or you can just use them to make arbitrary groupings of data however you want.

What’s the difference between a genome and a fileset?

A genome is the informal name for a fileset that has its organism field set to some organism. It is a specific kind of fileset.

How do I create a fileset?

Currently the way to create a fileset is:

  • Upload the first file of a fileset as normal.
  • Upload the second file of the fileset, and select the first file as the data to pair with.
  • Once uploaded, this creates the fileset.

How do I add more than one file to the fileset?

From the edit fileset page, you can add a third, fourth etc. data file, or remove them. You don’t need to do anything special when you upload this data.

I just want to make a genome

Create a fileset in the way described above, and on the edit fileset page set the organism. Flow will now show it in the list of genomes.

The fileset I want to make is from option 1 or option 2 above

If you are uploading paired-end samples, or paired-end multiplexed data, use the Demultiplexed or Multiplexed tab respectively on the upload page, and it will handle the pairing for you.

Previous
Monitoring Flow.bio