forked from kbaseattic/gene_families
-
Notifications
You must be signed in to change notification settings - Fork 1
/
readme.txt
35 lines (28 loc) · 1.52 KB
/
readme.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
The Gene Families service finds domains (from public protein domain
libraries such as COGs and Pfam) in proteins in a user-supplied
genome.
SETUP:
It requires binaries from HMMER and RPS-BLAST, which are retrieved
using the "download_3rd_party_bins.sh" script, and stored in the
JAR file. This is done automatically as part of the "make" process.
It requires several public PSSM and HMM libraries, which are retrieved
using the "prepare_3rd_party_dbs.sh" script. The script also formats
the libraries for use by HMMER or RPS-BLAST. This is done
automatically as part of the "make" process.
The libraries have to be parsed and stored in DomainModelSet objects
in a public KBase workspace called "KBasePublicGeneDomains." This
needs to be done once every time a new version of the libraries is
desired. The code to do this is in
us.kbase.kbasegenefamilies.prepare.DomainModelLibPreparation
and needs to be run by a developer with write access to that workspace.
(A token needs to be stored in auth.properties to grant this access.)
Once the libraries are downloaded using the script above, the
developer needs to run "make prepare-library-objects"
COMPILING:
To compile this yourself, you need certain branches of other modules:
1) the dev-prototypes branch of typecomp
2) the dev branch of java_type_generator
Do "make compile-java-typespec" to regenerate the java files from the
spec, and "make compile-typespec" to regenerate client libraries.
However, to make the deployment team's job easier, all these generated
files are currently checked in.