-
Notifications
You must be signed in to change notification settings - Fork 0
/
homologs.msa
290 lines (290 loc) · 17.9 KB
/
homologs.msa
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
>BacillusSubtilis:NP_387998.1 NP_387998.1 ribosomal protein L4 [Bacillus subtilis subsp. subtilis str. 168]
------------------------------------------------------------
------------------------------------------------------------
---------------------------------------------------------MPK
VALY-NQNGSTAGDIELNASVFGIEPNE-SVVFDAILMQRASLRQGTHKVKNRSEVRGGG
RKPWRQKGTGRARQGSIRSPQWRGGGVVFGPTPRS-YSYKLPKKVRRLAIKSVLSSKVID
NNIIVLEDLT--LDTAKTK------------------EMAAILK----------------
------------------------GLSVE---KKALIVTA-DA-NEAVALSARNIP----
GVTVVEANGINVLDVVNHEKLLITKAAVEKVEEVLA------------------------
------------------------------------
>CaenorhabditisElegans:NP_505181.1 NP_505181.1 Mitochondrial Ribosomal Protein, Large [Caenorhabditis elegans]
------------------------------------------------------------
----------------------------------MLSRQLRFLISSRAFSSAVQ------
------------DVSSTSS----------GESIDTRRELWR------KPENPFIKTPQAW
VSNLDTIEDEKLGLVDLHPDIFRTSPRI-DILHRNLTWQSVYRNVQMTKMLTKAEMPGGG
RKPWPQKKTGRAHVGSIRSPQFIRGGFANGVRGPRTWFYMLPDAVRIQGLCVALTLKHTQ
DDLHIVDKIQNLQN-GDPK-------------------YWIDLC----------------
--------------------------EARNYGYSVLFVDDCDEISGGLAEAQQALP----
WLNVMPVYGLNCFSLMKYDTIVLSRSALERVEERLLTQMHRAGPMNKKYRYMD---YKDK
ILQEAEAEEDP----LMPPVV---------------
>ChlamydomonasReinhardtii:XP_001697380.1 XP_001697380.1 uncharacterized protein CHLRE_11g479500v5 [Chlamydomonas reinhardtii]
------------------------------------------------------------
------------------------------------MQ-----TMRVAF--RPAATS---
RSTVVTRAS--A-------------------------VAA-----------------PAS
IPYK-AADGSSKGTQQLALKV-AEDSAK-GLVHRYLVMVQQNARQGTASTLTRSEVRGGG
KKPYAQKGTGNARRGSSVSPLFPGGGVTFGPKPKD-WSISMNKKERRLALATALQSATA-
-DMIVVESLAGKLQDTKTK------------------SMVALLE----------------
------------------------KLGANAMERKVLLITK-EE-RPDVTLAGRNIA----
KLTMNTASAISVFDVLNADHIIIEDEALAHVQSFYGAAAPASA-----------------
------------------------------------
>ChlamydomonasReinhardtii:XP_042918467.1 XP_042918467.1 uncharacterized protein CHLRE_12g520400v5 [Chlamydomonas reinhardtii]
---------------------------------MLAGALRGCASESAFAWRQVVSAAAAA
GSCSGSAGRLMVSSPCGVAQTSSPLQRWLFQGLRSSST---G-AASIS----GG-LS-EA
GSLPPLVLRRVDDEA-LSPHP--------PTTDVGPLTVRYPFPIEYYK--------DRE
AVIY-SLDERPLGLAPLPGAAFNVPVRI-DILHRVVRYWRAKWQQGTHKAKSRAEVSGGG
KKPWNQKKTGRARQGSIRSPLWKGGGVSHAPRPRS-HAHALPRSTRLLGMRCALSAKINE
GRFFVVDDLINLRAAPLQDADDAAAAAGVAQPAPLASGYLSALKPASGPGSDSSNKNPAR
WSRHGLSPADRPIREYGELKRRLGALTEGSFGSSWLLVDSGEA-GRDGGLRLRKLLKCSV
VMEVVSPEELTVYHVLKYHRLVVTRDALQRISEALTRPHRVTKPVKHAWWARRRQAIDAA
VQELTQAEAQA-------------------------
>ClostridiumPerfringens:WP_003454270.1 WP_003454270.1 MULTISPECIES: 50S ribosomal protein L4 [Clostridium]
------------------------------------------------------------
------------------------------------------------------------
---------------------------------------------------------MPK
VGLF-NKEGQQVGDIQLNEQVFGVEVNK-YALHQVVVAQLANKRQGTQSAKTRSEVRGGG
IKPWRQKGTGRARQGSIRAPQWIKGGVVFAPKPRD-YRMSIPKSMRKVAMTSALTSKVA-
-DMVVLEDLT--FEAPKTK------------------EAVKMLN----------------
------------------------AFEA----KKTLIITA-EV-NENVYKSARNIE----
GVTVMPVNNINVYDLLNCKTLMITKEAVNKIEEVYA------------------------
------------------------------------
>CyanobiumGracile:WP_015109401.1 WP_015109401.1 50S ribosomal protein L4 [Cyanobium gracile]
------------------------------------------------------------
------------------------------------------------------------
---------------------------------------------------------MTD
CVIR-DWQGKESGKAPLDLKVAKETSAN-GLLHRAVVRQLAHARQGTASTLTRAEVAGGG
RKPYKQKGTGRARQGSIRTPLRPGGGVIFGPKPRS-YAVSMNRKERRLALRTALMSRVA-
-DITVVKGFAEGLDTPKTK------------------EITAALA----------------
------------------------RFGIDAGAKV-LLILD-GA-SDAVSRSVRNLE----
KVKLIAADQLNVFDLLHANKLVLSEEALAKIQEVYGDV----------------------
------------------------------------
>DrosophilaMelanogaster:NP_524939.1 NP_524939.1 mitochondrial ribosomal protein L4 [Drosophila melanogaster]
------------------------------------------------------------
----------------------------MLNNILKTSRQVLYPV-ARTFSRSGNHG----
---------NVVTEAAATV----------GAPPATRSPLILPQDYTDCLPVSRNTARQAW
IENTDAVAERKVGLIELHPDVFAAQPRV-DIIQENVEWQSKYRYVSMAHTKTRAEVRGGG
RKPWPQKGGGRARHGSLRSPMLKGGGVVHGPRSPTTHFYMLPFYKRVLGLTSTLSVKLAQ
DDLHIIDNVD-IPT-GDAE-------------------FLKDLI----------------
--------------------------AERNWGPSVLIVDEDHMFPANICQASDDLG----
YVNLMPTFGLNVYSMLKHDTLVLTVAAVKHLEQRLLYQLNRNDAASKGGKFKL---DQV-
------------------------------------
>EnterobacterCloacae:WP_000424395.1 WP_000424395.1 MULTISPECIES: 50S ribosomal protein L4 [Bacteria]
------------------------------------------------------------
------------------------------------------------------------
------------------------------------------------------------
MELV--L-KDAQSALTVSETTFGRDFNE-ALVHQVVVAYAAGARQGTRAQKTRAEVTGSG
KKPWRQKGTGRARSGSIKSPIWRSGGVTFAARPQD-HSQKVNKKMYRGALKSILSELVRQ
DRLIVVEKFS--VEAPKTK------------------LLAQKLK----------------
------------------------DMAL----EDVLIITG-EL-DENLFLAARNLH----
KVDVRDATGIDPVSLIAFDKVVMTADAVKQVEEMLA------------------------
------------------------------------
>EscherichiaColiBL21:NP_417778.1 NP_417778.1 50S ribosomal subunit protein L4 [Escherichia coli str. K-12 substr. MG1655]
------------------------------------------------------------
------------------------------------------------------------
------------------------------------------------------------
MELV--L-KDAQSALTVSETTFGRDFNE-ALVHQVVVAYAAGARQGTRAQKTRAEVTGSG
KKPWRQKGTGRARSGSIKSPIWRSGGVTFAARPQD-HSQKVNKKMYRGALKSILSELVRQ
DRLIVVEKFS--VEAPKTK------------------LLAQKLK----------------
------------------------DMAL----EDVLIITG-EL-DENLFLAARNLH----
KVDVRDATGIDPVSLIAFDKVVMTADAVKQVEEMLA------------------------
------------------------------------
>FusariumOxysporum:XP_031036278.1 XP_031036278.1 mitochondrial 54S ribosomal protein YmL6 [Fusarium oxysporum NRRL 32931]
----------------------------------MAGKGIGCLAEAM-------------
-------GALRVSAK------PA----TLNKAFTRSMATEVSPK----------------
------P----TAEN-KSPNT-----------PQGILESWKPITTV-----------PVT
VHAFPSL--EPTSLERWDVNHLYLPLRR-DLLHLAVVYEGDNTRQGTASSKTRYDVHGSH
RKMRPQKGTGRARMGTKQSPVNRGGGKTFGPHPRD-FGTSLTRKVYDKAWRTALSYRYRK
GDLIVCEDGMDLVLPTDYEL-----VAGKYLKDGLKEAYLKR------------------
-----------------YMTGVLGNLGLGRASGRTLFVTG-NR-REALFGAMEQLPW---
EGRALDLEDVDVKDLLETGKVVLERSVLKEMIKKHQSDLVS-RVVMQGL-VKGGPKLGTP
VIRA--------------------------------
>GloeobacterViolaceus:WP_011140090.1 WP_011140090.1 50S ribosomal protein L4 [Gloeobacter violaceus]
------------------------------------------------------------
------------------------------------------------------------
---------------------------------------------------------MAT
CSIK-DWQGNATGEVDLDLPVASAATAS-HVVYLAFKRQMVNSRQGTASTLTRGEVRGGG
RKPWKQKGTGRARAGSIRSPLWRKGGVIFGPKPRD-FEIKMNRKERRLALRTALQSRVE-
-DLIVVDEFEGQLAAPKTR------------------ELVQAFE----------------
------------------------RWGVDMASQSILLILR-ER-QTNTYLSARNLP----
NVKVITAGNLNVRDLLATDWIVVTGPALELIKETYGAVA---------------------
------------------------------------
>HelianthusAnnuus:XP_021973724.1 XP_021973724.1 50S ribosomal protein L4, chloroplastic [Helianthus annuus]
------------------------------------------------------------
------------------------------------MATFIRPPSSLSFLSSQTPSSL--
FTKPPKPHT-LK-------------------------PISN---LTRCQ--------LST
LPIL-SFDGTKVGETSINLKSASPDTAR-AVVHRGITTDLNNKRRGTASTLTRAEVRGGG
KKPFPQKKLGRARRGSQRTPLRPGGGVVFGPKPRD-WSVKINKKEKRLAISTALASAAV-
-NGIVVEEFGGKFEKPKTK------------------EFIEALR----------------
------------------------RWGIDPKEKSMFFMTEDEV-EDNVVLSSRNIG----
TLRMLTPRTLNLFDILNADKLVFTKGGLEYLNEAYGADDGEDEEDFEEETEEGTEAEE-I
VVPPSDS-----------------------------
>HelianthusAnnuus:XP_021975671.1 XP_021975671.1 50S ribosomal protein L4 [Helianthus annuus]
MALRCSRKLLPTVVSGYKPHCNNNLDVARRSFHILSNGLHDHENADT-------------
----------QSSMECSILRKVG----FSLMGTRGLCTSMLSPES------SEG------
-SFPSDLLSRKQ-------------------------IITPERAIGQLQ--------DLV
IPVT-NFHNEDKGMMVLAGDVFDVPIRK-DIIHRVVRWQLAKRQQGTHSTKTISEVSGTG
RKPWRQKGTGRARHGTLRGPQFRHGAVMHGPKPRS-HAFKLNKKVRRLGLKIALSARAAE
GKLLVFDDLE--LLTHKTK------------------NIVSYVK----------------
------------------------QMEET---KKLLLVDGGPI-DEKLKLATQNLH----
YVNVLPSIGLNVYSILLHDTLVMSRDAVNKIVERMHTPINR-------------------
------------------------------------
>HelianthusAnnuus:XP_035830377.1 XP_035830377.1 uncharacterized protein LOC110865164 [Helianthus annuus]
------------------------------------------------------------
----------------------------MFQLCEMNTR-ETVPKHSLF------------
-RFGVQLKRAFVGDNATRCHQSNNRKVAANSEQAKRRARWN------------------N
CPMF--YCSNGSHKIVLACGVLKVPIRK-DIIHRVVRWQLDKGQQVLYISLIYNLILEYQ
------------------------------------------------------------
------------------------------------------------------------
------------------------------------------------------------
------------------------------------------------------------
------------------------------------
>HydraVulgaris:XP_047133073.1 XP_047133073.1 50S ribosomal protein L4-like [Hydra vulgaris]
----MSRMMIPRFFSQFS----------------------RCLEIK--------------
------------TSLCRNYHHVR----P------VIKRNLIKPEVDLSLHDPHG------
-KYSPQVR-----------------------------EFIKARDFENAG--------GRK
VDVISLTTGENKGVIELNNFVFGANPRI-DILQRNVVWYRACIRAGTACTKTRGEVRGGG
RKPWQQKGLGKARQGSIRAPHWRKGGVSGGPKPKD-YSYELPFKVRRMGLRTALSCKFAQ
GDLTVVEDYN-NLTETN---------------------FSDAVT----------------
------------------------SLSL----QSSLFVDG-FE-NDYLDSLVSGFE----
KIDFKPALLLHVYGMLIRSKLVLSLQAVRILEEKLCEDNRIVTDPRYELYHQNMLLDKSN
LFKEFDPKKKELRGRLIPELPKKSIRHKLPMSRQDQ
>KlebsiellaPneumoniae:YP_005229167.1 YP_005229167.1 50S ribosomal protein L4 [Klebsiella pneumoniae subsp. pneumoniae HS11286]
------------------------------------------------------------
------------------------------------------------------------
------------------------------------------------------------
MELV--L-KDAQSALTVSETTFGRDFNE-ALVHQVVVAYAAGARQGTRAQKTRAEITGSG
KKPWRQKGTGRARSGSIKSPIWRSGGVTFAARPQD-HSQKVNKKMYRGALKSILSELVRQ
DRLIVVEKFS--VEAPKTK------------------LLAQKLK----------------
------------------------DMAL----EDVLIITG-EL-DENLFLAARNLH----
KVDVRDANGIDPVSLIAFDKVVMTADAVKQVEEMLA------------------------
------------------------------------
>LactobacillusRhamnosus:WP_005686703.1 WP_005686703.1 50S ribosomal protein L4 [Lacticaseibacillus rhamnosus]
------------------------------------------------------------
------------------------------------------------------------
---------------------------------------------------------MAN
VTLY-KQDGSENGTVELNDAIWAVEPNE-NVVFDAVVMQRASLRQGTHAVKNRSAVSGGG
RKPWRQKGTGRARQGSIRSPQWRGGGIVFGPTPRS-YAYKLPKKVRRLAIKSVLSQKVLD
GDLVVVDGLS--FDAPKTK------------------AFLNVLD----------------
------------------------GLKVN---DKALVVLE-DG-NDVAAKAARNLP----
NVKVVPAEGINVLDAVNYKKLILTQSALQKIEEVLA------------------------
------------------------------------
>ListeriaMonocytogenes:NP_466154.1 NP_466154.1 50S ribosomal protein L4 [Listeria monocytogenes EGD-e]
------------------------------------------------------------
------------------------------------------------------------
---------------------------------------------------------MPK
LSLL-KQDGTNAGEITLNDTVFGIEPNE-KVVVDVILSQRASLRQGTHKVKNRSEVRGGG
RKPWRQKGTGRARQGSIRSPQWRGGGVVFGPTPRS-YAYKLPKKVRRLAIKSILSSKVNE
EKLVVLEGLT--FDAPKTK------------------EFAAFLK----------------
------------------------NISVD---TKALIVVA-GE-SENVELSARNLQ----
GITVIPAESISVLEVAKHDKLIITKAAVEKVEEVLA------------------------
------------------------------------
>MicromonasPusilla:XP_003061099.1 XP_003061099.1 predicted protein [Micromonas pusilla CCMP1545]
------------------------------------------------------------
------------------------------------MS-----SVSLSLSARSAVAG---
AKVPVRRAR--A-------------------------AAAKATGPVDVL--------AAA
VEKV-SFDGATKSTADLTLKTARADVAK-GLVHKYVVMVRQNARRGTASTLTKSEVRGGG
RKPFNQKGTGNARAGSIRSPLKPGGGVSFGPKPKD-WSIKMNKKERRLAMATAIQSAAG-
-SMIVVDDLGANVSVAKTK------------------TMANALK----------------
------------------------AWGVEEGE-KAYVITK-DA-SDAVKLSTRNMA----
KVVQSDISHLNVYDVLNADKVVVEESALKYINDFYGAEGGAWA-----------------
------------------------------------
>NitrosomonasEutropha:WP_011633674.1 WP_011633674.1 50S ribosomal protein L4 [Nitrosomonas eutropha]
------------------------------------------------------------
------------------------------------------------------------
---------------------------------------------------------MVK
IPCR-SE-NGQVVNIEVSDSVFDRVYNE-ALVHQIVTSYLANARSGTRAQKGRSEVAGST
RKQWRQKGTGRARVGAASNPLWRGGGKIFPNKPTENFTKKVNRKMYRAGMCTIFSQLLRN
SKLVAISEFR--VETTKTK------------------FFLQKLK----------------
------------------------NYQL----ENVMIITD-EV-DENLYLASRNVP----
NIKVVEIDLIDPVSLLSYDNVVITREAVNKIESVLQ------------------------
------------------------------------
>NitrospiraDefluvii:WP_213041942.1 WP_213041942.1 50S ribosomal protein L4 [Nitrospira defluvii]
------------------------------------------------------------
------------------------------------------------------------
---------------------------------------------------------MPI
VDVV-DTKKKKIGTVDLPNEVFGCKPHG-SLVHEAVVMQRACGRQGTASTLRRGEVSGSG
KKPWKQKHTGRARAGSLRSPVWRHGGTVFGPKPRS-YAVGMPKKKYRAAIQSALSAKVSE
GGVIVVAELV--IAEAKTK------------------LLAAALA----------------
------------------------QLEIG---GHALLVVG-DQ-NSHVVQAGKNLS----
NVTVLRPEDLNVYDVLRCRSLVIPQGELDRVKEVWS------------------------
------------------------------------
>PseudomonasAeruginosa:NP_252952.1 NP_252952.1 50S ribosomal protein L4 [Pseudomonas aeruginosa PAO1]
------------------------------------------------------------
------------------------------------------------------------
------------------------------------------------------------
MQLN--V--NGAQAIEVSERTFGGEFNE-TLVHQAVVAYMAGGRQGSKAQKTRSEVSGGG
KKPWRQKGTGRARAGTIRSPIWRGGGTTFAAKPRS-HEQKLNKKMYRAALRSILAELVRL
DRLVVVADFA--VDAPKTK------------------GLVAKLD----------------
------------------------TLGL----KDVLIVTD-GV-DENLYLAARNLA----
HVDVRDVQGSDPVSLIAYDKVLVTVSAVKKFEELLG------------------------
------------------------------------
>RhodobacterCapsulatus:WP_013066045.1 WP_013066045.1 50S ribosomal protein L4 [Rhodobacter capsulatus]
------------------------------------------------------------
------------------------------------------------------------
----------------------------------------------------------MK
LDVI-KLDGGTAGSIELDEALFGLEPRA-DILHRVVRWQRAKAQAGTHSVLGKSDVSYST
KKIYRQKGTGGARHGSKKAPIFRHGGVYKGPTPRS-HAHDLTKKFRALGLRHALSAKAKS
GSLVVIEAAD--MAEAKTA------------------LLAKAAK----------------
------------------------E-LGW---KKVLVIDGASV-NENFALAARNLD----
GIDVLPTMGANVYDILKRDTLVITKAGVEALEARLK------------------------
------------------------------------
>SaccharomycesCerevisiae:NP_013687.1 NP_013687.1 mitochondrial 54S ribosomal protein YmL6 [Saccharomyces cerevisiae S288C]
------------------------------------------------------------
----------------------------------MTIKRNLVKT----------------
--LQS---IRYQATTATAHAE------------STLNPLPNAAIPPKYA--------LVT
VRSFPSL--EPLTFVPVPTSTVAAPLRR-DILWRAVVYENDNRRVGASNPPGRSENGFSR
RKLMPQKGSGRARVGDANSPTRHNGGRALARTAPNDYTTELPSKVYSMAFNNALSHQYKS
GKLFVIGGEKVDLISPTPELD------------------LNRLDLVNT-----NTVEGKE
IFEGEV----------I-FRKFLEE--FQLKGKRLLFITD-KT-REGLIK---SSDPYKQ
KVDVIQKELVEVNDILRAQAVFIELEALEYLAMAHQKEILH--SVSN-------------
------------------------------------
>StaphylococcusAureus:YP_500978.1 YP_500978.1 50S ribosomal protein L4 [Staphylococcus aureus subsp. aureus NCTC 8325]
------------------------------------------------------------
------------------------------------------------------------
---------------------------------------------------------MAN
YDVL-KLDGTKSGSIELSDAVFGIEPNN-SVLFEAINLQRASLRQGTHAVKNRSAVSGGG
RKPWKQKGTGRARQGTIRAPQWRGGGIVFGPTPRS-YAYKMPKKMRRLALRSALSFKAQE
NGLTVVDAFN--FEAPKTK------------------EFKNVLS----------------
------------------------TLEQP---KKVLVVTE-NE-DVNVELSARNIP----
GVQVTTAQGLNVLDITNADSLVITEAAAKKVEEVLG------------------------
------------------------------------
>StreptomycesCoelicolor:NP_628862.1 NP_628862.1 50S ribosomal protein L4 [Streptomyces coelicolor A3(2)]
------------------------------------------------------------
------------------------------------------------------------
---------------------------------------------------------MST
VDIL-SPAGEKTGSVELPAEIFGVEKISIPLIHQVVVAQNAAARQGTHKTKRRGEVRGGG
KKPYRQKGTGRARQGSTRAPQFAGGGVVHGPQPRD-YSQRTPKKMKAAALRHALTDRARH
NRIHVVTGVI-EGENPSTK------------------AARTLFG----------------
------------------------KISER---KNLLLVVD-RA-DEAAWLSARNLP----
QIHILEPGQLNTYDVLVSDDVVFTQAAFESFVSGPNKAVDTEGSEA--------------
------------------------------------
>TheobromaCacao:XP_007026182.2 XP_007026182.2 PREDICTED: 50S ribosomal protein L4 [Theobroma cacao]
MALSISRRILRSFGSLSALARWDSLSIPSHSFQ--ASDLNACISGDN-------------
----------LPHAECFSFSKGG----LSFLACRKFATTILTPDS------AES------
-AFPSDLLSAKT-------------------------VLTPDRTIGLYQ--------DLV
IPVT-NFHNEDKGLMVLAGDVFDVPIRK-DIIHRVVRWQLAKRQQGTHSTKTISEVSGTG
RKPWRQKGTGRARHGTLRGPQFRGGATMHGPKPRS-HAIKLNKKVRRLGLKIALSASAAE
GKLLVFEDLE--VPTHKTK------------------NIVNYVN----------------
------------------------QMEKT---KKLLLVDGGPI-NEKLKLATQNLH----
YVNVLPSIGLNVYSILLHDTLVMSRDAVNRIVERMHTPINR-------------------
------------------------------------
>TheobromaCacao:XP_017984011.1 XP_017984011.1 PREDICTED: 50S ribosomal protein L4, chloroplastic [Theobroma cacao]
------------------------------------------------------------
------------------------------------MATSTPTPTSLSFFSSSLFLSSSS
TKLPCLSLS-FK-------------------------TSSNPNLCIASQ--------LST
LSIL-SFTGEKIGETYLDLKSAPPETAR-AVVHRAIITDQQNKRRGTASTLTRSEVRGGG
KKPYPQKKTGRARRGSMRSPLRPGGGVIFGPKPRD-WSIKINKKEKRLAISTALSSAAQ-
-NTIVVEEFGDKFEKPKTK------------------DFMEALK----------------
------------------------RWGLDPKQKSMFLMME--V-PENVNLSSRNIG----
TLRMLTPRTLNLFDILNCDNLVLTPDAVDYLNGRYGEDYEGDTEDDDEEEEEGGGGGGEE
ANENADAER---------------------------
>YarrowiaLipolytica:XP_504189.1 XP_504189.1 YALI0E20493p [Yarrowia lipolytica CLIB122]
------------------------------------------------------------
------------------------------------MIRRLIPG----------------
--LTSGLRRGFASEASGAAKQ------------ATVDTLPGAAKLPEFV--------LTS
VRQFPSL--EPTRLQPVTSQMLGAQVHK-DLLWRAVVYEADRQRVGASNPPGREQMGYST
RKLHKQKGMGKARVGDAGSPTRTQGGFALDRNAPNKMATGLPKQVYASAIRAALTSQYQE
GRLFVVDGPCELP------------------------------ESVKD-----NTTFGQQ
WLQGEL----------G-FGKKE----------MTVFLVD-TE-RPILDS---VLGKDNL
KADIVPKEFIEVRDILKARNVVVEYDVLKWLAAKYPTRDLF--GSFKPWTN---------
------------------------------------