Skip to content
lohrd edited this page Jul 21, 2020 · 14 revisions

SQL Schema for Summary Reports

The following image shows the format of the summary reports that are generated upon each serratus run:

Summary report example

The SQL Schema for accessing each summary report is made up of four tables: Runs, FamilySections, AccessionSections, and FastaSections.

Runs

'Runs' corresponds to the first line of the summary file, where the data for the SRA, reference genome, and date is present. This table has a one to many relationship the the three following tables, all linked by the SRA and auto-generated RunId.

FamilySections

'FamilySections' corresponds to the next section of the summary report, where the data for the pan-genome is present. The columns present on this table are as follows:

  • FamilySectionId:
  • FamilySectionLineId:
  • RunId:
  • Sra:
  • Family:
  • Score:
  • PctId:
  • Aln:
  • Glb:
  • PanLen:
  • Cvg:
  • Top:
  • TopAln:
  • TopName:
Clone this wiki locally