-
Notifications
You must be signed in to change notification settings - Fork 34
SQL Schema
lohrd edited this page Jul 21, 2020
·
14 revisions
The following image shows the format of the summary reports that are generated upon each serratus run:
The SQL Schema for accessing each summary report is made up of four tables: Runs, FamilySections, AccessionSections, and FastaSections.
'Runs' corresponds to the first line of the summary file, where the data for the SRA, reference genome, and date is present. This table has a one to many relationship the the three following tables, all linked by the SRA and auto-generated RunId.
'FamilySections' corresponds to the next section of the summary report, where the data for the pan-genome is present. The columns present on this table are as follows:
- FamilySectionId:
- FamilySectionLineId:
- RunId:
- Sra:
- Family:
- Score:
- PctId:
- Aln:
- Glb:
- PanLen:
- Cvg:
- Top:
- TopAln:
- TopName: