-
Notifications
You must be signed in to change notification settings - Fork 10
/
Copy pathadamian_06_prediction_794241.pdf.txt
1747 lines (1503 loc) · 66.9 KB
/
adamian_06_prediction_794241.pdf.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"><html xmlns="http://www.w3.org/1999/xhtml">
<head>
<title>1472-6807-6-13.fm</title>
<meta name="Author" content="BMCProductionmachine"/>
<meta name="Creator" content="FrameMaker 7.0"/>
<meta name="Producer" content="Acrobat Distiller 7.0 (Windows)"/>
<meta name="CreationDate" content=""/>
</head>
<body>
<pre>
BMC Structural Biology
BioMed Central
Open Access
Methodology article
Prediction of transmembrane helix orientation in polytopic
membrane proteins
Larisa Adamian* and Jie Liang
Address: Department of Bioengineering, University of Illinois at Chicago, M/C 563, 835 S. Wolcott St, Chicago, IL 60612-7340, USA
Email: Larisa Adamian* - larisa@uic.edu; Jie Liang - jliang@uic.edu
* Corresponding author
Published: 22 June 2006
BMC Structural Biology 2006, 6:13
doi:10.1186/1472-6807-6-13
Received: 08 March 2006
Accepted: 22 June 2006
This article is available from: http://www.biomedcentral.com/1472-6807/6/13
© 2006 Adamian and Liang; licensee BioMed Central Ltd.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0),
which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Abstract
Background: Membrane proteins compose up to 30% of coding sequences within genomes.
However, their structure determination is lagging behind compared with soluble proteins due to
the experimental difficulties. Therefore, it is important to develop reliable computational methods
to predict structures of membrane proteins.
Results: We present a method for prediction of the TM helix orientation, which is an essential
step in ab initio modeling of membrane proteins. Our method is based on a canonical model of the
heptad repeat originally developed for coiled coils. We identify the helical surface patches that
interface with lipid molecules at an accuracy of about 88% from the sequence information alone,
using an empirical scoring function LIPS (LIPid-facing Surface), which combines lipophilicity and
conservation of residues in the helix. We test and discuss results of prediction of helix-lipid
interfaces on 162 transmembrane helices from 18 polytopic membrane proteins and present
predicted orientations of TM helices in TRPV1 channel. We also apply our method to two
structures of homologous cytochrome b6f complexes and find discrepancy in the assignment of TM
helices from subunits PetG, PetN and PetL. The results of LIPS calculations and analysis of packing
and H-bonding interactions support the helix assignment found in the cytochrome b6f structure
from green alga but not the assignment of TM helices in the cyanobacterium b6f structure.
Conclusion: LIPS calculations can be used for the prediction of helix orientation in ab initio
modeling of polytopic membrane proteins. We also show with the example of two cytochrome b6f
structures that our method can identify questionable helix assignments in membrane proteins. The
LIPS server is available online at http://gila.bioengr.uic.edu/lab/larisa/lips.html.
Background
A significant increase in the number of structures of alpha
helical membrane proteins in recent years revealed a
remarkable complexity of interacting transmembrane
(TM) helices. A great variation in length, shape, and tilt
angles relative to the membrane plane is found in helical
membrane proteins. For example, a structure of protein
transporter (1RH5) contains a helix that is only about one
half the length of the TM region, the structure of ClC chloride channel (1KPL) contains discontinuous helices,
while all aquaporin structures (1FX8, 1J4N, 1RC2) contain two half helices important for function. Some TM
helices are tilted and packed within the helical bundle so
that they are only partially exposed to the membrane.
Membrane proteins with ten or more TM helices may have
helices that are completely buried within the helical bunPage 1 of 17
(page number not for citation purposes)
BMC Structural Biology 2006, 6:13
dle [1]. This great diversity only reflects the conformational space of integral alpha-helical membrane proteins
sampled by the existing 30+ unique structures. If the
number of structures of integral membrane proteins
grows exponentially as predicted [2], even greater complexity in structural elements can be expected for transmembrane domains. This poses a great challenge for
prediction and modeling of polytopic membrane proteins
[3].
Recently, the structures of homo-oligomeric transmembrane (TM) proteins were successfully modeled using the
techniques of simulated annealing, molecular dynamics
[4-7], Monte-Carlo simulations [8], and an empirical scoring function designed to specifically distinguish tightly
packed TM oligomers [9]. Ab initio modeling of structures
of polytopic membrane proteins is more complicated
[10]. Recently, the Rosetta structure prediction method,
which uses a new membrane-specific version of the
Rosetta low-resolution energy function, was successfully
implemented for the prediction of structures of polytopic
membrane proteins [11]. Other methods such as MembStruk [12] and PREDICT [13] start from ideal helices that
are later subjected to "coarse" and "fine" optimization
steps by energy minimization and molecular dynamics
simulations. A required step in these methods is the prediction of TM helix orientation. Typically, a hydrophobic
moment for every TM helix is determined [14] under the
assumption that the hydrophobicity moment should
point in the direction towards the lipid bilayer. However,
Stevens and Arkin [15] showed that hydrophobicity
moment alone is a poor indicator of the lipid-accessible
surface in membrane proteins.
A strategy to improve the accuracy of prediction of helix
orientation is to take advantage of available evolutionary
information. It is well known that solvent-exposed residues in both soluble [16] and membrane [17-19] proteins
are less conserved than buried residues. Komiya et al [20]
proposed a method for characterizing the exposure of αhelices to the membrane that was based on the periodicity
of conserved residues. Taylor et al [10] developed an automatic method that can proceed from a scan of the protein
sequences to a predicted three-dimensional structure. In
this method, finding non-conserved hydrophobic positions in multiple sequence alignments identifies the lipidexposed surfaces of TM helices. The same idea was used to
aid in the building of an alpha-carbon template for the
TM helices of rhodopsin [21]. Briggs et al [22] suggested
the use of widely available evolutionary information to
find variable residues within homologous TM helices that
are not important for the native structure. This information then can be used as constraints for global searching
molecular dynamics simulations. Recently, an automated
method for the analysis and prediction of buried and
http://www.biomedcentral.com/1472-6807/6/13
exposed residues of TM proteins with an impressive prediction accuracy of 80% was developed by Beuming and
Weinstein [23]. This method is based on a new amino
acid surface propensity (SP) scale derived from membrane
protein structures and evolutionary conservation of buried and exposed residues. The probability of finding a residue in the protein interior is calculated for every residue
in the TM helix. A cut-off value of this probability, which
depends on the number of sequences in the multiple
sequence alignment (MSA), is used to predict whether the
residue is at an interior or exterior position.
In this study, we propose and extensively test a new
approach for the prediction of helix-lipid interfaces of TM
helices from sequence information alone based on a
canonical model of the alpha helix. This method features
a collective assessment of conservation and physicochemical properties of the residues forming surface
patches along the TM helix. Each surface patch is centered
on one of the positions of the coiled coil heptad repeat
and can be in contact with lipids or other helices. We score
every patch with a scoring function, LIPS (LIPid-facing
Surface), that combines a lipophilicity scale (TMLIP2)
[24] and conservation entropy scores of residues [25]. We
demonstrate using a set of 162 TM helices with helix-lipid
interfaces that our method achieves an accuracy of 88%.
We also describe how our method can be used to resolve
inconsistency in experimental structures of cytochrome
b6f complex orthologs.
Results
Heptad motif in TM helix-helix interactions
The atomic interactions between two helices can be represented graphically as a two-dimensional atomic contact
map as shown in Figure 1, which records the number
count of the interhelical atomic contacts within the
antiparallel left-handed helical pair formed by helices tm
I and tm II of subunit SdhC of succinate dehydrogenase
from E. coli (1NEK) [26]. We use the heptad motif as a
structural template, where the structurally equivalent
positions occur every two turns of the α-helix. The interacting residues on helices tm I and tm II of subunit SdhC
can be mapped to the heptad repeat positions a, d, e and
g by assigning the optimal starting position for the first
interacting residue, i.e., when the first interacting residue
of tm I (Thr23) is assigned to position e of the heptad
repeat, the rest of the interacting residues can be assigned
to positions d, g, a and e of the heptad repeat. Similarly,
the correct starting point for mapping the interacting residue of tm II to the heptad repeat is position e for the first
interacting residue Met74.
We performed a similar mapping on 850 interacting helical interfaces from 425 TM helical pairs, where we
obtained the best alignments of the interacting residues
Page 2 of 17
(page number not for citation purposes)
BMC Structural Biology 2006, 6:13
http://www.biomedcentral.com/1472-6807/6/13
35
H e f g a b c d e f g a b c d e f g a b c d e f g a
N 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47
H N R T A I A S I L H R V S G V I T F V A V G I L L W L
3
74 M
P-LH
P-RH
25
76 G
4
77 I
2 3
78 L
79 T
4
80 A
82 A
3
9 5
11
3
83 Y
8
84 H
20
15
2
2
81 L
3
8 4
85 V
10
5
0
86 V
87 V
a
b
c
d
e
f
g
5
88 G
Heptad repeat position
89 I
90 R
AP-RH
1
75 W
%
e
f
g
a
b
c
d
e
f
g
a
b
c
d
e
f
g
AP-LH
30
1
Figure helicalof helix-helix by helices tm I and tm II from
SdhC subunit pair formed interactions within the lefthanded map of E. coli succinate dehydrogenase (PDB:
Contact1
1NEK)
Contact map of helix-helix interactions within the
left-handed helical pair formed by helices tm I and
tm II from SdhC subunit of E. coli succinate dehydrogenase (PDB: 1NEK). The three top rows and three left
columns contain information on coiled coil heptad positions
abcdefg (marked as H), residue numbers (N) and residue
types (R) for tm I and tm II, respectively. Residues involved
into interhelical interactions between these two TM helices
are in bold. The number of atomic contacts as determined by
INTERFACE calculations with probe radius of 0.5 Å between
two interacting residues is listed at the intersections of the
respective row and column. For example, Ser33 from tm I
has 4 atomic contacts with Ala80, 3 atomic contacts with
Tyr83, and 8 atomic contacts with His84 from tm II, while
neighboring residue Val32 has no contacts with any residue
from tm I.
from every helical interface to the heptad repeat motif.
Figure 2 summarizes the alignments of interacting residues in helices to the positions of the heptad repeat in 294
parallel (192 right-handed, 102 left-handed) and 546
antiparallel (222 right-handed, 324 left-handed) helical
pairs. We find that the majority of interacting residues can
be accurately mapped to heptad repeats.
Left-handed helical pairs have a higher fraction of interacting residues that can be mapped to canonical positions
a, d, e and g of the heptad repeat (94.7% and 92.4% for
parallel and anti-parallel helices, respectively). For righthanded parallel and antiparallel helical pairs, about
86.9% and 86.5% of interacting residues can be mapped
to heptad repeats, respectively. In the right-handed helical
coiled coils, structurally equivalent positions occur every
three helical turns with the undecad repeat a-k [27,28].
Peters et al [] found that the packing of residues in the
right-handed coiled coils follows the general principle of
knobs-into-holes packing of the left-handed coiled coils,
with residues in the a and h positions of the undecad
repeat structurally corresponding to residues in the a and
Figure 2
teins at heptad amino positions abcdefg
interactions of repeat acid residues involved into interhelical
Distribution in transmembrane regions of membrane proDistribution of amino acid residues involved into
interhelical interactions in transmembrane regions
of membrane proteins at heptad repeat positions
abcdefg. Types of helical pairs are: AP-LH – antiparallel lefthanded; AP-RH – antiparallel right-handed; P-LH – parallel
left-handed; P-RH – parallel right-handed. The combined frequencies of interacting residues from helix-helix interfaces at
a, d, e and g positions in the heptad repeat are: left-handed
parallel helical pairs, 94.7%; left-handed antiparallel helical
pairs, 92.4%; right-handed parallel helical pairs, 86.9%; and
right-handed antiparallel helical pairs, 86.5%.
d position of the heptad repeat, respectively. This allows
us to use a heptad repeat as a simplified model for the
interacting residues in the right-handed helical pairs as
well.
Helical faces
Since ~90% of all interacting residues in TM helices can be
aligned to the heptad repeat positions, it is reasonable to
assume that the residues forming lipid-accessible helical
faces should follow similar patterns, as the degrees of freedom in the lipid-facing residues are constrained by the
residues involved in helix-helix interactions, the majority
of which are adequately characterized by the heptad
motifs.
In the canonical model of the alpha helix [29], every seventh residue along the N- to C-terminus direction occupies a position that is roughly two turns away and
underneath the previous one. The positions of every 7th
residue, "the anchoring residue", would form a slightly
twisted surface along the helix, formed by 2 to 5 residues
depending on the length of the helix. To make the number
of residues in a TM helical face consistent, we add every
third and fourth residues to the interfaces defined by the
anchoring residues. These additional residues occupy
positions one turn away from the anchoring residue. Figure 3A shows in spacefill the Cα atoms that form such a
Page 3 of 17
(page number not for citation purposes)
BMC Structural Biology 2006, 6:13
http://www.biomedcentral.com/1472-6807/6/13
mostly lipid-exposed helical faces, which should contain
the least conserved but predominantly lipophilic residues,
as well as the mostly buried helical faces, which should
contain the most conserved and less lipophilic residues.
Figure 3
Model of canonical helical faces
Model of canonical helical faces. (A) Cα atoms that form
a helical face are shown in space filling. Residues 3 and 4, 10
and 11 and 17 and 18 complement the anchoring residues 0,
7, 14 and 21, respectively. (B) Helical wheels illustrating the
seven canonical helical faces centered at each of the seven
positions of the heptad repeat. The central helical wheel contains numbers that label the helical faces 0 through 6. These
numbers correspond to heptad positions a through f around
which the helical surfaces are centered. The corresponding
helical face is depicted as an outer wheel located next to that
label. The outer helical wheels depict residues forming each
of the seven helical faces, e.g., face 0 is formed by residues at
heptad positions a-d-e, face 1: b-e-f, face 2: c-f-g, face 3: d-g-a,
face 4: e-a-b, face 5: f-b-c, and face 6: g-c-d.
helical face. Here, the helical face is centered around the
anchoring residues "0", "7", "14", and "21" and complemented with residues "3" and "4", "10" and "11", and
"17" and "18", respectively. Similarly, taking each of the
seven positions of the heptad repeat in turn as the anchor
position, we can build seven different helical faces. In Figure 3B, seven different helical wheels illustrate these seven
faces anchored at each of the seven positions of the heptad
repeat. The nodes of the central helical wheel contain
numbers labeling the helical faces 0 through 6, which correspond to the heptad positions a through f as anchoring
positions. The corresponding helical face is depicted by a
helical wheel, located directly along the direction of the
vector from the origin to the anchoring residue. The outer
helical wheels depict residues forming each of the seven
helical faces, e.g., face 0 is formed by residues at heptad
positions a-d-e, face 1: b-e-f, face 2: c-f-g, face 3: d-g-a, face
4: e-a-b, face 5: f-b-c and face 6: g-c-d.
Prediction of TM helix orientation
The orientation of a TM helix relative to a lipid bilayer can
be predicted by finding either the mostly lipid-exposed or
the mostly buried helical face. We explore both
approaches by comparing prediction results for the
Table 1 summarizes the results of prediction of helix orientation for 162 helices from 18 membrane proteins for
which a sufficient number of homologous sequences is
available to compute positional entropy (see Methods
section). The average lipophilicity of a helical face was calculated using TMLIP2 scale [24], which was recomputed
every time to exclude the protein that was tested. TMLIP2
contains separate scales for the headgroup and hydrophobic regions of phospholipid bilayer to account for the different physico-chemical properties of the membrane.
Table 1 lists the number of correctly predicted lipidexposed and buried faces, based on the percentage of the
lipid-accessible surface area of the residues forming a helical face. We compare prediction results when only the
average helical face lipophilicity L f was used, when only
the average positional entropy E f of a helical face was
used, and when the product of both lipophilicity and
positional entropy (Sf = E f L f ) was used.
All lipid-exposed helical faces were determined correctly
in 8 out of 18 proteins (Table 1). These proteins are relatively small and contain between 5 to 10 TM helices
together with the strong ligand binding sites within the
TM bundle, e.g., bacteriorhodopsin (1C3W) and rhodopsin (1L9H) contain covalently bound retinal, while
succinate dehydrogenase (1NEK) and nitrate reductase A
(1Q16) bind heme molecules. These binding sites provide
strong evolutionary and physico-chemical constraints for
the buried residues, resulting in strong discrimination
between buried and exposed helical faces.
Results in Table 1 show that the prediction of the lipidexposed faces is more reliable than the prediction of the
buried faces. Indeed, we find that the face with the smallest lipophilicity index or the smallest entropy is not
always the most buried: it often represents a tightly
packed helix-helix interface and should not be used for
prediction of helix orientation alone. There are 48 TM helices where prediction of buried faces has failed. We compare the positions of the predicted buried faces for these
helices with the positions of the correct helical faces on
the helical wheel (see Additional File 1). We find that in
most cases the correct faces have significant overlap with
the predicted faces. The anchor residues are shifted only
by one residue on the helical wheel in 28 helices, and by
two residues in 18 helices. For example, predicted buried
Page 4 of 17
(page number not for citation purposes)
BMC Structural Biology 2006, 6:13
http://www.biomedcentral.com/1472-6807/6/13
Table 1: Results of prediction of buried and lipid-exposed faces of TM helices. The columns list the number of helices with correctly
predicted lipid-exposed face or lipid-buried face when average lipophilicity ( L f ), average entropy ( E f ) or LIPS function (S f) was
used.
PDB
1C3W
1EUL
1FX8
1IWG
1J4N
1KPL
1KQF
1L9H
1M3X
1NEK
1OCR
1OKC
1PV6
1PW4
1Q16
1Q90
1RH5
1ZCD
Total
Helix count1
S f Exp2
S f Brd3
7
7
3
6
4
8
3
7
6
4
13
3
4
7
3
7
3
6
7
9
3
7
4
8
5
7
10
5
21
5
9
10
5
11
7
10
6
7
3
6
4
9
3
7
8
3
11
3
6
11
5
11
3
8
102
63%
143
88%
114
70%
L f Exp2
L f Brd3
E f Exp2
E f Brd3
7
9
4
10
4
10
5
7
11
5
24
6
12
12
5
12
9
10
2
4
1
4
4
4
4
5
7
5
14
2
6
8
3
6
6
5
2
4
0
4
2
6
2
5
6
2
11
1
3
9
5
4
2
3
7
8
3
8
3
7
4
7
9
5
18
4
9
8
5
10
7
9
162
100%
90
56%
71
44%
131
81%
1 Number
count of lipid-exposed TM helices.
lipid-exposed helical face is predicted correctly if the highest scoring face has the largest lipid accessible area.
3 The buried helical face is predicted correctly if the lowest scoring face has the smallest lipid accessible area.
2 The
face on TM4 (D) of bacteriorhodopsin is 6, while the correct face is 3. Helical wheel diagram (Figure 3B) shows
that these two faces overlap by two residues and are neighbors to each other.
The best result (with average of 88% of correct predictions) was obtained for prediction of lipid-exposed helical
faces with a scoring function that combines average
lipophilicity and entropy. Using either lipophilicity or
entropy scorings correctly predicts 55% and 81% of lipidexposed helical faces, respectively. This result demonstrates that evolutionary conservation as measured by
positional entropy provides stronger discrimination than
lipophilicity. Prediction of the mostly buried helical face
was less successful: the combined Sf scoring function produced 70% of correct predictions, while lipophilicity and
entropy scorings produced only 44% and 63% of correct
predictions, respectively.
TM helices may have very complex packing patterns
within a helical bundle. For example, the tm II helices of
SdhC and SdhD hydrophobic membrane anchor subunits
of succinate dehydrogenase (SQR) cross the TM bundle
with a large tilt angle and have lipid-exposed surfaces on
the two opposite sides of the transmembrane domain.
Our method works well for this complicated example. Figure 4 shows residues on the predicted lipid-exposed faces
for all TM helices in SQR. There are two disconnected
faces for both tm II helices from SdhC and SdhD subunits.
The predicted lipid-exposed faces in each case contain two
segments that are found to be lipid-accessible on opposite
sides of the helical bundle.
Prediction of lipid-exposed faces for structure-based and
predicted TM helices
To assess how the definition of the TM helix boundaries
affects the results of prediction of helix orientation, we
compare the results of predicted helix orientations for
structure-based and predicted TM helices in Leu transporter LeuTAa from Aquifex aeolicus (PDB ID: 2A65), a bacdependent
terial
homologue
of
Na+/Cl- –
neurotransmitter transporter. All 12 TM helices are taken
as predicted by the hidden Markov model topology predictor TMHMM [30]. With exception of helix 7, the
sequences of computationally predicted and structurebased TM helices have significant overlap throughout the
whole length of the helix (Table 2). There are two buried
helices, which have most of their surfaces hidden within a
helical bundle. These are not suitable for LIPS calculations, but can be predicted using RANTS scoring method
Page 5 of 17
(page number not for citation purposes)
BMC Structural Biology 2006, 6:13
http://www.biomedcentral.com/1472-6807/6/13
Table 2: Comparison of prediction of helix orientation for structure-based and computationally predicted helices in Leu transporter
LeuTAa from Aquifex aeolicus (PDB ID: 2A65).
TM Helix
Source
First Residue Number
1
Struct
TMHMM
Struct
TMHMM
Struct
TMHMM
Struct
TMHMM
Struct
TMHMM
Struct
TMHMM
Struct
TMHMM
Struct
TMHMM
Struct
TMHMM
Struct
TMHMM
Struct
TMHMM
Struct
TMHMM
15
7
41
39
92
89
166
165
193
194
241
243
279
293
342
335
379
378
398
405
450
447
482
484
2
3
4
5
6
7
8
9
10
11
12
TM helix sequence
LIPS prediction
ILAMAGNAVGLGNFLRFPVQ
HWATRLGLILAMAGNAVGLGNF
GAFMIPYIIAFLLVGIPLMW
GGGAFMIPYIIAFLLVGIPLMWI
ILGVFGLWIPLVVAIYYVYIESWTLGFAIK
FAKILGVFGLWIPLVVAIYYVYI
LFAYIVFLITMFINVSILI
SLFAYIVFLITMFINVSILIRGI
RFAKIAMPTLFILAVFLVIR
FAKIAMPTLFILAVFLVIRVFLL
PGVWIAAVGQIFFTLSLGFGAI
VWIAAVGQIFFTLSLGFGAIITY
GLTAATLNEKAEVILGGSI
LGGSISIPAAVAFFGVANAVAIA
FLWFFLLFFAGLTSSIAIMQPMI
AGGTFLGFLWFFLLFFAGLTSSI
VLWTAAIVFFSAHLVMFL
AVLWTAAIVFFSAHLVMF
KSLDEMDFWAGTIGVVFFGLTELI
FWAGTIGVVFFGLTELIIFFWIF
YVMRYITPAFLAVLLVVWAREYI
IYYYVMRYITPAFLAVLLVVWAR
TVWITRFYIIGLFLFLTF
WITRFYIIGLFLFLTFLVFL
Buried
Correct
Correct
Correct
Correct
Correct
2nd best
Wrong
Wrong
Buried
Correct
*
Correct
Correct
Wrong
Correct
Correct
Correct
Correct
Correct
2nd best
2nd best
* Predicted and structure-based TM helices have little overlap.
of lipid accessibility [1]. The results of LIPS predictions of
the lipid-exposed faces for the remaining 10 TM helices
with TMHMM and structure-based sequences are summarized on Table 2. Correct predictions were made for both
structure-based and computationally predicted TM helices
for 5 helices with the N-terminal helix boundary difference ranging from 1 to 7 residues. For helix TM4, LIPS correctly predicted the lipid-exposed face for structure-based
helix and gave the second best choice (with the majority
of the predicted residues being lipid-accessible) for
TMHMM helix. For helix TM9, LIPS correctly predicted the
lipid-exposed face for TMHMM helix and failed for the
structure-based helix. LIPS failed for both sequences of
helix TM5, which differ only by 1 residue at the N-terminus and by 4 residues at the C-terminus. The second best
choice was predicted for both structure-based and computed versions of helix TM12. LIPS correctly predicted
lipid-exposed surface for structure-based TM7. We were
unable to assess the correctness of the LIPS prediction for
computationally predicted TM7 helix because the predicted sequence mainly represented the interhelical loop
region according to 2A65 structure. The results on Table 2
demonstrate that predictions with identical outcome were
obtained for 7 out of 9 helices for which the comparison
between structure-based and computationally predicted
sequences was plausible indicating a significant tolerance
of the LIPS method to the definition of helical boundaries.
Prediction of orientation of TM helices in transient
receptor potential vanillin subtype 1 (TRPV1) channel
Members of the protein family of transient receptor
potential (TRP) ion channels mediate a wide range of sensory responses, including thermosensation and taste.
Vanillin receptor TRPV1, which possesses weak voltage
sensitivity, is activated by warm temperatures and is a
molecular sensor for detecting multiple pain-producing
stimuli. TRPV1 is a key element for inflammatory nociception and an attractive drug target.
Although there are no high-resolution structures of TRP
channels available at the present time, the overall structural organization of TRP channels should be similar to
that of voltage-gated potassium (Kv) channels [31], which
are composed of tetramers of subunits containing two
pore-forming helices (S5 and S6) connected by a membrane-re-entrant loop P and the voltage-sensing transmembrane segments S1 to S4. There are several structures
of voltage-dependent channels available, including the Xray structure of a rat brain voltage-dependent Shaker family K+ channel (PDB:2A79) [32]. TRPV1 has no significant
sequence similarity with these sequences, indicating that
it would be difficult to produce a reliable sequence align-
Page 6 of 17
(page number not for citation purposes)
BMC Structural Biology 2006, 6:13
(PDB:1NEK):lipid accessible helices dehydrogenase
Transmembrane regionof the helical bundle (center) and side
Figure the top view of succinate
views of4
Transmembrane region of succinate dehydrogenase
(PDB:1NEK): top view of the helical bundle (center)
and side views of the lipid accessible helices. Predicted
residues on the lipid accessible faces are shown in CPK
color, the rest of the molecule is in orange. Labels for helices
from the SdhC subunit are in red, and labels for helices from
the SdhD subunit are in blue. Predicted lipid-facing residues
from the tm II helices of SdhC and SdhD are shown on two
separate side views, because these helices traverse the helical
bundle and appear on two different sides of the TM domain.
ment to build a homology model. Here, we use LIPS scoring to predict orientation of TM helices in TRPV1 channel.
First, we predict the locations of the six TM helices of
human TRPV1 channel (accession number CAB95729)
using a hidden Markov model topology predictor
TMHMM [30]. We then build a multiple sequence alignment profile for every TM helix and rank helices by their
lipid accessibility using RANTS server [1]. Our ranking by
lipid accessibility shows that TM helices S6 and S5 are the
least lipid accessible, while helix S3 is the most lipidaccessible (Figure 5). This agrees well with the hypothesis
that the transmembrane part of TRPV1 channel should be
similar to that of voltage-gated K+ channels.
Finally, we use multiple sequence alignment profiles for
all TRPV1 TM helices to calculate LIPS scores for helical
faces on every helix. The results of LIPS calculations are
presented on Table 3, which gives information about helical faces with the largest and the smallest LIPS scores on
helices S1–S6, as well as the residues found on the respec-
http://www.biomedcentral.com/1472-6807/6/13
Figure 5
ces S1–S6 by RANTS server [1]
Results of prediction of lipid-accessibility of TRPV1 TM heliResults of prediction of lipid-accessibility of TRPV1
TM helices S1–S6 by RANTS server [1]. Ranking by solvent accessibility shows that TM helices S6 and S5 are the
least lipid accessible, while helix S3 is the most lipid-accessible.
tive helical face. Figure 6 further illustrates the results of
LIPS calculations by showing the TM region of the tetrameric rat brain K+ channel structure (2A79, Figure 6A),
which is used here as a structural template for TRPV1
channel, and the structure of the TM region of a single
monomer with cartoons of helical wheels with predicted
lipid-exposed and buried faces highlighted by orange and
green lines, respectively (Figure 6B). Figure 6B also shows
that the exposed and buried faces are predicted on the
opposite surfaces for all TM helices with the exception of
helix S4, where the buried and exposed faces overlap,
which may indicate that the prediction for this helix is
incorrect.
Assessment of different structural assignment in
homologous membrane proteins: two cytochrome b6f
structures (1Q90 vs. 1VF5)
The multisubunit cytochrome b6f complex mediates electron transfer between the photosystem II and the photosystem I reaction centers, in which H2O is the electron
donor [33]. Each cytochrome b6f monomer contains eight
subunits and seven natural prosthetic groups. Almost
identical structures of cytochrome b6f complex were
obtained from the thermophilic cyanobacterium Mastigocladus laminosus (PDB: 1VF5) [34] and the green alga
Chlamydomonas reinhardtii (PDB: 1Q90) [35]. Six out of
eight polypeptide chains with TM helices are highly
homologous in both structures, with sequence identity
between the respective orthologous chains of 56.6% for
cytochrome f, 83.7% for cytochrome b6, 78.1% for subu-
Page 7 of 17
(page number not for citation purposes)
BMC Structural Biology 2006, 6:13
http://www.biomedcentral.com/1472-6807/6/13
Table 3: Summary of prediction of lipid-facing and buried residues in transmembrane region of TRPV1 channel
TM Helix
S1
S2
S3
S4
S5
S6
Buried
Exposed
Buried
Exposed
Buried
Exposed
Buried
Exposed
Buried
Exposed
Buried
Exposed
Helical Face
LIPS score
2
0
6
4
1
3
1
0
4
2
5
0
4.65
9.35
3.55
8.50
5.11
9.53
2.96
5.61