This repository has been archived by the owner on Jul 9, 2019. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 1
Loading Genomics DB
Jagannath Premkumar edited this page Sep 26, 2016
·
1 revision
This section assumes that you have an existing csv file which has associated data in MetaDB, as well as vid_map and a callset_map. If you do not have this and would like to produce one, you can run the vcf2tile.py
or maf2tile.py
as appropriate without -l
.
To load the data:
-
Create a GenomicsDB loader config as specified here in the GemoicsDB wiki.
-
Assuming GenomicsDB/bin is in your path run:
vcf2tiledb /path/to/loader/config
or if you have mpi partitions:
mpirun -n <num_partitions> [-hostfile <hostfile>] ./bin/vcf2tiledb /path/to/loader/config
Note: mpi options can be specified in load_to_tile.cfg in option1.
-
Sanity Check: provides more information for validating that the variant data was correctly loaded into the TileDB array.
- Variant Store
- Python API
- Utils
- MAF to TileDB Import
- VCF to TileDB Import
- Additional Info