; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g1132 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g1132
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUPF0261 protein
Genome locationMC01:16682896..16688383
RNA-Seq ExpressionMC01g1132
SyntenyMC01g1132
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR009215 - TIM-barrel domain, IGPS-like
IPR013785 - Aldolase-type TIM barrel
IPR015813 - Pyruvate/Phosphoenolpyruvate kinase-like domain superfamily
IPR044122 - UPF0261 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594812.1 ToMV susceptible protein tm-1(GCR26), partial [Cucurbita argyrosperma subsp. sororia]0.083.27Show/hide
Query:  MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSK
        MATRG+SKTLRVFCIATAD+KLEELRF+SD VRSNLN F+R S +KVEVTVVDVST + + IESLDDFVFVSR+ VLSCYD T N LPDDRGKA+SIMSK
Subjt:  MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSK

Query:  ALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEK
        ALE FLSK + DGVIAGAIGLGGSGGTSLISSALKSL IGIPKLIVST+ASGQTESYI TSDLIL PSIVDVCGINSVSRVV SNA AAFAGMV+GRLEK
Subjt:  ALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEK

Query:  FKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVL
          DS  FNEKPTVGLTMFGVTTPCVN+VKERL KEGYESLVFHATG+GGKAMESLVREGFIQGVLDITTTEVADYL+GGVMAC SSRFDAIIEK+IPLVL
Subjt:  FKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVL

Query:  SVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSND
        SVGALDMVNFGS D IPSNFHGRNIYEHNKQV+LMRTTVDENKKIA FIADK+NNS  KVRVCLPQ G+SALDAPGKP YDP+AT+TL+DELQ+ IQSN+
Subjt:  SVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSND

Query:  DRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPI
        DR+V VYPYHIND EF+EVLVNSFLEITSK T D C PK+V  E SQDL++ S SESNLS   +I YSPSDFP+ RP TLRRTR ILENLKAQ+V+GVPI
Subjt:  DRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPI

Query:  IGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYA
        IGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLP                                        
Subjt:  IGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYA

Query:  DVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVA
         VV TVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIE AHKMGLLTTPYAFNQDEA EMAKAGADIIVA
Subjt:  DVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVA

Query:  HMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISMR
        HMGLTTSGSIGAKTAL++EESVLRVQAIADAAHRINPNVLVLCHGGPISGP+EAAFILKRT GVHGFYGASSMERLPVEQAITST+QEYKSISMR
Subjt:  HMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISMR

KAG7026778.1 hypothetical protein SDJN02_10785 [Cucurbita argyrosperma subsp. argyrosperma]0.083.27Show/hide
Query:  MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSK
        MATRG+SKTLRVFCIATAD+KLEELRF+SD VRSNLN F+R S +KVEVTVVDVST + + IESLDDFVFVSR+ VLSCYD T N LPDDRGKA+SIMSK
Subjt:  MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSK

Query:  ALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEK
        ALE FLSK + DGVIAGAIGLGGSGGTSLISSALKSL IGIPKLIVST+ASGQTESYI TSDLIL PSIVDVCGINSVSRVV SNA AAFAGMV+GRLEK
Subjt:  ALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEK

Query:  FKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVL
          DS  FNEKPTVGLTMFGVTTPCVN+VKERL KEGYESLVFHATG+GGKAMESLVREGFIQGVLDITTTEVADYL+GGVMAC SSRFDAIIEK+IPLVL
Subjt:  FKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVL

Query:  SVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSND
        SVGALDMVNFGS D IPSNFHGRNIYEHNKQV+LMRTTVDENKKIA FIADK+NNS  KVRVCLPQ G+SALDAPGKP YDP+AT+TL+DELQ+ IQSN+
Subjt:  SVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSND

Query:  DRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPI
        DR+V VYPYHIND EF+EVLVNSFLEITSK T D C PK+V  E SQDL++ S SESNLS   +I YSPSDFP+ RP TLRRTR ILENLKAQ+V+GVPI
Subjt:  DRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPI

Query:  IGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYA
        IGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLP                                        
Subjt:  IGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYA

Query:  DVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVA
         VV TVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIE AHKMGLLTTPYAFNQDEA EMAKAGADIIVA
Subjt:  DVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVA

Query:  HMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISMR
        HMGLTTSGSIGAKTAL++EESVLRVQAIADAAHRINPNVLVLCHGGPISGP+EAAFILKRT GVHGFYGASSMERLPVEQAITST+QEYKSISMR
Subjt:  HMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISMR

XP_022132686.1 uncharacterized protein LOC111005487 isoform X1 [Momordica charantia]0.094.84Show/hide
Query:  MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSK
        MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSK
Subjt:  MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSK

Query:  ALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEK
        ALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEK
Subjt:  ALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEK

Query:  FKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVL
        FKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVL
Subjt:  FKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVL

Query:  SVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSND
        SVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSND
Subjt:  SVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSND

Query:  DRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPI
        DRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPI
Subjt:  DRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPI

Query:  IGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYA
        IGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLP                                        
Subjt:  IGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYA

Query:  DVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVA
         VVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVA
Subjt:  DVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVA

Query:  HMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISMR
        HMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISMR
Subjt:  HMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISMR

XP_022132687.1 uncharacterized protein LOC111005487 isoform X2 [Momordica charantia]0.094.44Show/hide
Query:  MHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSKALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYI
        MHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSKALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYI
Subjt:  MHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSKALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYI

Query:  ETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEKFKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVRE
        ETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEKFKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVRE
Subjt:  ETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEKFKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVRE

Query:  GFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVLSVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLV
        GFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVLSVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLV
Subjt:  GFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVLSVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLV

Query:  KVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSNDDRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESN
        KVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSNDDRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESN
Subjt:  KVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSNDDRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESN

Query:  LSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPIIGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEV
        LSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPIIGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEV
Subjt:  LSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPIIGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEV

Query:  LPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYADVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGM
        LP                                         VVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGM
Subjt:  LPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYADVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGM

Query:  GYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVAHMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFIL
        GYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVAHMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFIL
Subjt:  GYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVAHMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFIL

Query:  KRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISMR
        KRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISMR
Subjt:  KRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISMR

XP_038882760.1 toMV susceptible protein tm-1(GCR26) isoform X1 [Benincasa hispida]0.084.03Show/hide
Query:  MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSK
        MATRG++KT RVFCIATAD+KLEELRFISD+VR+NLN FSR+SSFKVEVTVVDVST R +GIESLDDFVFVSREDVLSC D TGN LPDDRGKA+SIMSK
Subjt:  MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSK

Query:  ALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEK
        ALE +LSKA+ DG+IAG IGLGGSGGTSLISSAL+SL IGIPKLIVSTVASGQTESYI TSDLIL PSIVDVCGINSVSRV+LSNAGAAFAGMVVGRLEK
Subjt:  ALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEK

Query:  FKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVL
         KDS + NEKPTVGLTMFGVTTPCVN+VKERL KEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYL+GGVMAC SSRFDAIIEKRIPLVL
Subjt:  FKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVL

Query:  SVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSND
        S+GALDMVNFGS+D I SN+H RNIYEHNKQVSLMRTT +E++KIAHFIADKINNS  KVRVCLPQNGVSALDAPGKPFYDPEAT+TL+DELQR IQ N+
Subjt:  SVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSND

Query:  DRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPI
        DRQVKVYPYHIND EFAEVLVNSFLEITSK+T D C PK+V  E SQDL ++SIS SNLS + +I YSPSDFP+A+P TL+RTRMILENLKAQI++GVPI
Subjt:  DRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPI

Query:  IGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYA
        IGAGAGTGISAKFEE GGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLP                                        
Subjt:  IGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYA

Query:  DVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVA
         VV TVPVLAGVCASDPFRRMD+FLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFN+DEALEMAKAGADIIVA
Subjt:  DVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVA

Query:  HMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISMR
        HMGLTTSGSIGAKTAL+MEES + VQAIADAAHRINPNVLVLCHGGPISGP+EAAFILKRTKGVHGFYGASSMERLPVEQAIT+T+QEYKSISMR
Subjt:  HMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISMR

TrEMBL top hitse value%identityAlignment
A0A5A7SKR1 UPF0261 protein0.082.24Show/hide
Query:  MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSK
        MAT   +KT RVFCIATAD+KL+ELRFISD+VR NLNSFS +SSFKVEVT+VDVST    GI+SLDDF FVSRE+VLSCY+ TGNDLPDDRGKA+SIMSK
Subjt:  MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSK

Query:  ALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEK
        ALE +LSKA+ DG+IAG IGLGGSGGTSLISSALKSL IGIPKLIVSTVASGQTESYI TSD+IL PSIVDVCGINSVSRV+LSNAG+AFAGMVVGRLE 
Subjt:  ALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEK

Query:  FKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVL
         KDS + N KPTVGLTMFGVTTPCVN+VKERL KEGYE+LVFHATGVGGKAMESLV+EGFIQGVLDITTTEVADYL+GGVMAC S+RFDAIIEKRIPLVL
Subjt:  FKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVL

Query:  SVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSND
        SVGA+DMVNFGS+D IPSNFHGRNIYEHNKQVSLMRTTV+EN+KIAHFIADKINNS  KVRVCLP+NGVSALDAPGK FYDPEAT+TL++ELQ+ IQ N+
Subjt:  SVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSND

Query:  DRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPI
        DRQVKVYPYHIND EFAE LVNSFLEIT K+T D C PK+V  E SQDLQ++ IS+ NLS +G+I YS SDFP+ARP TL+RTR IL NLKAQI +GVPI
Subjt:  DRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPI

Query:  IGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYA
        IGAGAGTGISAKFEE GGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLP                                        
Subjt:  IGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYA

Query:  DVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVA
         VV TVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFN+DEAL+MA AGADIIVA
Subjt:  DVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVA

Query:  HMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISM
        HMGLTTSGSIGAKTAL+MEESV+RVQAIADAAHRINPNVLVLCHGGPISGP+EAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSIS+
Subjt:  HMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISM

A0A6J1BSZ4 uncharacterized protein LOC111005487 isoform X10.094.84Show/hide
Query:  MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSK
        MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSK
Subjt:  MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSK

Query:  ALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEK
        ALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEK
Subjt:  ALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEK

Query:  FKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVL
        FKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVL
Subjt:  FKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVL

Query:  SVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSND
        SVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSND
Subjt:  SVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSND

Query:  DRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPI
        DRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPI
Subjt:  DRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPI

Query:  IGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYA
        IGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLP                                        
Subjt:  IGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYA

Query:  DVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVA
         VVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVA
Subjt:  DVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVA

Query:  HMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISMR
        HMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISMR
Subjt:  HMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISMR

A0A6J1BUJ0 uncharacterized protein LOC111005487 isoform X20.094.44Show/hide
Query:  MHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSKALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYI
        MHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSKALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYI
Subjt:  MHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSKALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYI

Query:  ETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEKFKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVRE
        ETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEKFKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVRE
Subjt:  ETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEKFKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVRE

Query:  GFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVLSVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLV
        GFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVLSVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLV
Subjt:  GFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVLSVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLV

Query:  KVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSNDDRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESN
        KVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSNDDRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESN
Subjt:  KVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSNDDRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESN

Query:  LSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPIIGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEV
        LSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPIIGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEV
Subjt:  LSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPIIGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEV

Query:  LPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYADVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGM
        LP                                         VVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGM
Subjt:  LPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYADVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGM

Query:  GYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVAHMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFIL
        GYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVAHMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFIL
Subjt:  GYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVAHMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFIL

Query:  KRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISMR
        KRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISMR
Subjt:  KRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISMR

A0A6J1HG11 uncharacterized protein LOC1114631630.083.14Show/hide
Query:  MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSK
        MATRG+SKTLRVFCIATAD+KLEELRF+SD VRSNLN F+R S +KVEVTVVDVST + + IESLDDFVFVSR+ VLSCYD T N LPDDRGKA+SIMSK
Subjt:  MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSK

Query:  ALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEK
        ALE FLSK + DGVIAGAIGLGGSGGTSLISSALKSL IGIPKLIVST+ASGQTESYI TSDLIL PSIVDVCGINSVSRVV SNA AAFAGMV+GRLEK
Subjt:  ALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEK

Query:  FKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVL
          DS  FNEKPTVGLTMFGVTTPCVN+VKERL KEGYESLVFHATG+GGKAMESLVREGFIQGVLDITTTEVADYL+GGVMAC SSRFDAIIEK+IPLVL
Subjt:  FKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVL

Query:  SVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSND
        SVGALDMVNFGS D IPSNFHGRNIYEHNKQV+LMRTTVDENKKIA FIADK+NNS  KVRVCLPQ G+SALDAPGKP YDP+AT+TL+DELQ+ IQSN+
Subjt:  SVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSND

Query:  DRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPI
        DR+V VYPYHIND EF+EVLVNSFLEITSK T D C PK+V  E SQDL++ S SESNLS   +I YSPSD P+ RP TLRRTR ILENLKAQ+V+GVPI
Subjt:  DRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPI

Query:  IGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYA
        IGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLP                                        
Subjt:  IGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYA

Query:  DVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVA
         VV TVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIE AHKMGLLTTPYAFNQDEA EMAKAGADIIVA
Subjt:  DVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVA

Query:  HMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISMR
        HMGLTTSGSIGAKTAL++EESVLRVQAIADAAHRINPNVLVLCHGGPISGP+EAAFILKRT GVHGFYGASSMERLPVEQAITST+QEYKSISMR
Subjt:  HMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISMR

A0A6J1KM41 uncharacterized protein LOC1114969600.082.77Show/hide
Query:  MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSK
        MATRG+SKTLRVFCIATAD+KLEELRF+SD VRSNLN F+R S  KVEVTVVDVST + + IESLDDFVFVSREDVLSCYD T N LPDDRGKA+SIMSK
Subjt:  MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSK

Query:  ALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEK
        ALE FLSKA+ DGVIAGAIGLGGSGGTSLISSALKSL IGIPKLIVSTVA+GQTESYI TSDLIL PSIVDVCGINSVSRVV SNA AAFAGMV+GRLEK
Subjt:  ALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEK

Query:  FKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVL
          DS  FNEKPTVGLTMFGVTTPCVN+VKERL K GYE LVFHATG+GGKAMESLVREGFIQGVLDITTTEVADYL+GGVMAC SSRFDAIIEKRIPLVL
Subjt:  FKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVL

Query:  SVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSND
        SVGALDMVNFGS D IPSNFHGRNIYEHNKQV+LMRTT+DENKKIA FIADK+NNS  KVRVCLP+ G+SALDAPGKPFYDP+AT+TL+DELQ+ IQSN+
Subjt:  SVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSND

Query:  DRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPI
         R+V VYPYHIND EFA  LVNSFLEITSK T   C PK+V  E SQDL++NS SESNLS    I YSPSDFP+ RP T RRTR ILENLKAQ+++GVPI
Subjt:  DRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPI

Query:  IGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYA
        IGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLP                                        
Subjt:  IGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYA

Query:  DVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVA
         VV TVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIE AHKMGLLTTPYAFNQDEA EMAKAGADIIVA
Subjt:  DVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVA

Query:  HMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISMR
        HMGLTTSGSIGAKTAL++EESVLRVQ IADAAHRINPNVLVLCHGGPISGP+EAAFILKRT GVHGFYGASSMERLPVEQAITST+QEYKSISMR
Subjt:  HMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISMR

SwissProt top hitse value%identityAlignment
A4FLF6 UPF0261 protein SACE_56961.7e-8743.71Show/hide
Query:  FCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCY-DRTGNDLPDDRGKALSIMSKALEYFLSKARG
        + + T D+K  EL +++  V +            V V  VDVST       +      V   +V   + D        DRG A++ M+ ALE FL+    
Subjt:  FCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCY-DRTGNDLPDDRGKALSIMSKALEYFLSKARG

Query:  DGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEKFKDSRNFNEKP
           + G I LGGSGGT+L + A+++LP+G+PK++VSTVASG   SY++ +D+ + PS+ DV G+N +SR VL NA  A AG + G      D  +  +KP
Subjt:  DGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEKFKDSRNFNEKP

Query:  TVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVLSVGALDMVNFG
         V LTMFGVTTPCV  V  RL +  Y+ LVFHATG GG+AME LV +G I  VLD+TTTEV D + GGVM+ G  R DAI    +P V S GALDMVNFG
Subjt:  TVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVLSVGALDMVNFG

Query:  SRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSNDDRQVKVYPYHI
        + + +P  +  RN+Y HN QV+LMRTT DE ++I  FIA K+N     VR  LP+ GVS LDAPG+PF+DP+A   L + L+  ++ + DR++   P++I
Subjt:  SRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSNDDRQVKVYPYHI

Query:  NDLEFAEVLVNSFLEITSKET
        ND  FA+ ++ +F E+    T
Subjt:  NDLEFAEVLVNSFLEITSKET

A7M6E7 ToMV resistance protein Tm-1(GCR237)9.7e-29368.42Show/hide
Query:  MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCY---DRTGNDLPDDRGKALSI
        MAT   S + RVFCI TAD+K +ELRF+S+ VRS+LNSFS  SSFKV VTVVDVST       S  DF FV  +DVLSC+   + T     D RG A++I
Subjt:  MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCY---DRTGNDLPDDRGKALSI

Query:  MSKALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGR
        MSKALE FLS A  +  +AG IGLGGSGGTSL+SSA +SLPIGIPK+I+STVASGQTESYI TSDL+L PS+VD+CGIN+VS+VVLSNAGAAFAGMV+GR
Subjt:  MSKALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGR

Query:  LEKFKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIP
        LE  K+    N K TVG+TMFGVTTPCVN+VKERL KEGYE+LVFHATGVGG+AME LVR GFIQGVLDITTTEVADY+VGGVMAC SSRFDAI+EK+IP
Subjt:  LEKFKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIP

Query:  LVLSVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQ
        LVLSVGALDMVNFG +  IP  F  R I+EHN+QVSLMRTTV ENKK A FIA+K+N +   V VCLP+ GVSALDAPGK FYDPEATS L  ELQ  ++
Subjt:  LVLSVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQ

Query:  SNDDRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQ-ENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVE
        +N+  QVKV PYHIND EFA  LV+SFLEI+ K     C P     +  QD+Q +N++ E   S +G      +DFP+A+P TL++  +IL+ LK QI +
Subjt:  SNDDRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQ-ENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVE

Query:  GVPIIGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCT
        G PIIGAGAGTGISAKFEEAGGVDLIV+YNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLP                                    
Subjt:  GVPIIGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCT

Query:  FEYADVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGAD
             VV  V VLAGVCA+DPFRRMD FLKQ+ES+GF GVQNFPTVGLFDGNFRQNLEETGMGYGLEV+MI  AH+MGLLTTPYAF  DEA+ MA+AGAD
Subjt:  FEYADVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGAD

Query:  IIVAHMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISM
        IIVAHMGLTTSGSIGAKTA+++EESV  VQAIADA HRI P+ +VLCHGGPIS P EAA++LKRT GVHGFYGASSMERLPVEQAIT+TVQ+YKSISM
Subjt:  IIVAHMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISM

A7M6E8 ToMV susceptible protein tm-1(GCR26)1.6e-29568.76Show/hide
Query:  MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDR---TGNDLPDDRGKALSI
        MAT   S + RVFCI TAD+K +ELRF+S+ VRS+LNSFS  SSFKV VTVVDVST R     S  DF FV  +DVLSCY R   T    PD RG+A++I
Subjt:  MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDR---TGNDLPDDRGKALSI

Query:  MSKALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGR
        M+KALE FLSKA G+  +AG IGLGGSGGTSL+SSA +SLPIGIPK+I+STVASGQTESYI TSDL+L PS+VD+CGIN+VS+VVLSNAGAAFAGMV+GR
Subjt:  MSKALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGR

Query:  LEKFKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIP
        LE  K+    N K TVG+TMFGVTTPCVN+VKERL KEGYE+LVFHATGVGG+AME LVR GFIQGVLDITTTEVADY+VGGVMAC SSRFDAI+EK+IP
Subjt:  LEKFKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIP

Query:  LVLSVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQ
        LVLSVGALDMVNFG +  IP  F  R I++HN+QVSLM TTV ENKK A FIA+K+N +   V VCLP+ GVSALDAPGK FYDPEATS L  ELQ  ++
Subjt:  LVLSVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQ

Query:  SNDDRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEG
        +N+  QVKVYPYHIND+EFA  LV+SFLE++ K     C     S  I      N++ E   S +G      +DFP+A+P TL++  +IL+ LK QI +G
Subjt:  SNDDRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEG

Query:  VPIIGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTF
         PIIGAGAGTGISAKFEEAGGVDLIV+YNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLP                                     
Subjt:  VPIIGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTF

Query:  EYADVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADI
            VV  V VLAGVCA+DPFRRMD FLKQ+ES+GF GVQNFPTVGLFDGNFRQNLEETGMGYGLEV+MI  AH+MGLLTTPYAF  DEA+ MA+AGADI
Subjt:  EYADVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADI

Query:  IVAHMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISM
        IVAHMGLTTSGSIGAKTA+++EESV  VQAIADA HRINP+ +VLCHGGPIS P EAA++LKRT GVHGFYGASSMERLPVEQAIT+TVQ+YKSISM
Subjt:  IVAHMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGASSMERLPVEQAITSTVQEYKSISM

P55606 UPF0261 protein y4oU3.4e-8844.82Show/hide
Query:  VFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSKALEYFLSKARG
        V+ + T D+K  ELR++ D +R              +  +VDVS    H  ++ D  V  S             DL  DRGKA++ M++AL  F+ ++R 
Subjt:  VFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSKALEYFLSKARG

Query:  DGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEKFKDSRNFNEKP
        D  + G IG GG+GGT+LI+ AL++LPIG PK++VSTVASG    Y+  +D+ ++ S+ DV G+N +SRVVL+NA  + AGMV+ ++   KD     E+P
Subjt:  DGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEKFKDSRNFNEKP

Query:  TVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVLSVGALDMVNFG
         +GLTMFGVTTPCV +V   L  + ++ LVFHATG GG++ E L     + G +D++TTEV DYLVGGV  C + RF A    R+P V S GALDMVNFG
Subjt:  TVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVLSVGALDMVNFG

Query:  SRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSNDDRQVKVYPYHI
        + D +PS F  R ++ HN QV+LMRTT +E  +I  +IA+++N     VR  +P+ GVSA+DAPG+PF+DPEA S L   L+R ++    RQ+   P HI
Subjt:  SRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSNDDRQVKVYPYHI

Query:  NDLEFAEVLVNSFLE
        ND +FAE+LV +F E
Subjt:  NDLEFAEVLVNSFLE

Q981G1 UPF0261 protein mll93883.4e-8844.26Show/hide
Query:  VFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDR--TGNDLPDDRGKALSIMSKAL-EYFLSK
        V+ + T D+K  ELR++ D +R              +V +VDVS    H  E+ D  V V   +V  C+       +L  DRGKA++ MS+AL EY  S+
Subjt:  VFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDR--TGNDLPDDRGKALSIMSKAL-EYFLSK

Query:  ARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEKFKDSRNFN
           DG+    IG GGSGGT+LI+ A+++LPIG+PK++VSTVASG    Y+  +D+ ++ S+ DV G+N +SRVVL+NA  + AGMV+ ++    D     
Subjt:  ARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEKFKDSRNFN

Query:  EKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVLSVGALDMV
        E+P +GLTMFGVTTPCV +V  R  +  ++ LVFHATG GG++ E L   G + G +D++TTEV DYLVGGV  C   RF A    ++P V S GALDMV
Subjt:  EKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVLSVGALDMV

Query:  NFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSNDDRQVKVYP
        NFG+ + +PS F  R ++ HN QV+LMRT  +E  +I  +I +++N     VR  +P+ GVSA+DAPG+PF+DPEA + L   L+R ++  D RQ+   P
Subjt:  NFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSNDDRQVKVYP

Query:  YHINDLEFAEVLVNSFLE
         HIND +FA++LV +  E
Subjt:  YHINDLEFAEVLVNSFLE

Arabidopsis top hitse value%identityAlignment
AT5G66420.1 CONTAINS InterPro DOMAIN/s: Uncharacterised conserved protein UCP033271 (InterPro:IPR008322), TIM-barrel signal transduction protein, predicted (InterPro:IPR009215); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).2.7e-27470.96Show/hide
Query:  MSKALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGR
        MSK LE FL +A  D  +AGAIGLGGSGGTSLISSA +SLPIGIPK+IVSTVASGQTE Y+ TSDL+LIPS+VDVCGINSVSRVV SNAGA+FAGMVVGR
Subjt:  MSKALEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGR

Query:  LEKFKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIP
        LE F+ S + N K TVG+TMFGVTTPCVN+V+E L +EGYE+LVFHATGVGG+AMESLV+EGFIQGV+DITTTEVAD+LVGGVMAC SSRFD  IEK IP
Subjt:  LEKFKDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIP

Query:  LVLSVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQ
        LVLSVGALDMVNFG +D IPS+F  R I+ HN+QVSL+RTT +ENKK A FIADK+N S  KVRVC+P+ G+SALDAPGKPF DPEAT  L++ELQ  IQ
Subjt:  LVLSVGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQ

Query:  SNDDRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEG
        +NDDRQV +Y +HIND EFAE LV SFLEI  K    +   +  S + S    ++    S L     I YSP +FP+A+P TL RT+ IL  L+ QI +G
Subjt:  SNDDRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEG

Query:  VPIIGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTF
        +PIIG GAGTGISAKFEEAGG+DLIV+YNSGRFRMAGRGSLAGLLPFADANA+VLEMANEVLP                                     
Subjt:  VPIIGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTF

Query:  EYADVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADI
            VV  VPVLAGVCA+DPFRRMDYFLKQ+ESIGF GVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMI  AHKMGLLTTPYAFN  E  EMAKAGADI
Subjt:  EYADVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADI

Query:  IVAHMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKG-VHGFYGASSMERLPVEQAITSTVQEYKSISMR
        IVAHMGLTTSG+IGAKTA+++EESV+RVQAIADAA R NP+++VLCHGGPISGP EA F+LKRT+G VHGFYGASSMERLPVEQAIT+TVQ+YKSIS++
Subjt:  IVAHMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKG-VHGFYGASSMERLPVEQAITSTVQEYKSISMR

AT5G66420.2 LOCATED IN: cellular_component unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Uncharacterised conserved protein UCP033271 (InterPro:IPR008322), TIM-barrel signal transduction protein, predicted (InterPro:IPR009215); Has 1232 Blast hits to 1230 proteins in 311 species: Archae - 18; Bacteria - 959; Metazoa - 1; Fungi - 84; Plants - 42; Viruses - 0; Other Eukaryotes - 128 (source: NCBI BLink).1.4e-30268.68Show/hide
Query:  KTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGR-MHGIESLDDFVFVSREDVLSCYDRTGND-----LPDDRGKALSIMSKA
        +T RVFC+ TAD+KL+ELRF++ +VRSN+ +FS++SS KVEV +VDVS G     I+++ DF FV+RE+VLSCY  +  +     LPDDRG+A+ +MSK 
Subjt:  KTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGR-MHGIESLDDFVFVSREDVLSCYDRTGND-----LPDDRGKALSIMSKA

Query:  LEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEKF
        LE FL +A  D  +AGAIGLGGSGGTSLISSA +SLPIGIPK+IVSTVASGQTE Y+ TSDL+LIPS+VDVCGINSVSRVV SNAGA+FAGMVVGRLE F
Subjt:  LEYFLSKARGDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEKF

Query:  KDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVLS
        + S + N K TVG+TMFGVTTPCVN+V+E L +EGYE+LVFHATGVGG+AMESLV+EGFIQGV+DITTTEVAD+LVGGVMAC SSRFD  IEK IPLVLS
Subjt:  KDSRNFNEKPTVGLTMFGVTTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVLS

Query:  VGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSNDD
        VGALDMVNFG +D IPS+F  R I+ HN+QVSL+RTT +ENKK A FIADK+N S  KVRVC+P+ G+SALDAPGKPF DPEAT  L++ELQ  IQ+NDD
Subjt:  VGALDMVNFGSRDAIPSNFHGRNIYEHNKQVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSNDD

Query:  RQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPII
        RQV +Y +HIND EFAE LV SFLEI  K    +   +  S + S    ++    S L     I YSP +FP+A+P TL RT+ IL  L+ QI +G+PII
Subjt:  RQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKIVSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPII

Query:  GAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYAD
        G GAGTGISAKFEEAGG+DLIV+YNSGRFRMAGRGSLAGLLPFADANA+VLEMANEVLP                                         
Subjt:  GAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYAD

Query:  VVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVAH
        VV  VPVLAGVCA+DPFRRMDYFLKQ+ESIGF GVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMI  AHKMGLLTTPYAFN  E  EMAKAGADIIVAH
Subjt:  VVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVAH

Query:  MGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKG-VHGFYGASSMERLPVEQAITSTVQEYKSISMR
        MGLTTSG+IGAKTA+++EESV+RVQAIADAA R NP+++VLCHGGPISGP EA F+LKRT+G VHGFYGASSMERLPVEQAIT+TVQ+YKSIS++
Subjt:  MGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKG-VHGFYGASSMERLPVEQAITSTVQEYKSISMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACGCGAGGTGATAGTAAGACTCTCCGAGTTTTCTGTATCGCAACGGCTGATTCAAAGCTTGAGGAGCTCCGATTCATTTCCGATACAGTTCGATCCAACCTCAA
TAGCTTCTCTAGAAGTTCTTCTTTTAAGGTCGAAGTGACGGTTGTTGATGTTTCTACCGGCCGTATGCATGGGATCGAGAGTCTGGATGATTTCGTCTTCGTGTCGAGGG
AGGATGTTCTCTCTTGCTATGATCGCACTGGAAATGATCTTCCCGATGACCGCGGAAAAGCTCTCTCTATAATGAGTAAAGCACTTGAATATTTCCTTAGTAAAGCCCGG
GGGGATGGGGTAATTGCTGGAGCTATTGGACTTGGAGGCAGTGGAGGGACATCTCTCATATCCTCTGCGTTAAAATCTCTTCCAATTGGAATCCCTAAGCTTATCGTCTC
AACCGTTGCTAGTGGTCAGACAGAATCTTATATCGAGACATCCGATCTGATACTCATCCCGTCTATAGTGGATGTGTGCGGGATTAATAGTGTCAGTAGGGTCGTCTTAT
CAAATGCTGGTGCTGCATTTGCTGGTATGGTGGTCGGAAGACTCGAGAAGTTTAAAGATTCTCGTAATTTCAATGAAAAACCAACAGTGGGTTTGACAATGTTTGGAGTT
ACAACTCCTTGTGTAAATTCTGTCAAAGAAAGATTGGCTAAAGAAGGCTACGAGAGCCTTGTTTTTCATGCTACCGGGGTTGGGGGCAAGGCAATGGAATCTCTGGTCAG
AGAGGGATTTATCCAGGGAGTCTTGGACATCACAACAACAGAGGTTGCAGACTATTTAGTGGGAGGTGTCATGGCTTGTGGCAGTTCCCGCTTTGATGCCATTATAGAGA
AGAGAATCCCACTAGTCCTAAGTGTAGGAGCACTGGATATGGTGAACTTTGGATCCAGAGATGCAATACCTTCTAATTTTCACGGAAGGAATATATATGAACATAACAAG
CAGGTTTCGCTGATGCGAACTACAGTGGATGAGAACAAAAAAATCGCTCATTTTATAGCTGATAAGATAAACAATTCATTGGTGAAGGTTCGTGTGTGCCTGCCGCAGAA
TGGTGTATCTGCTCTGGATGCACCAGGGAAGCCATTTTATGATCCTGAAGCTACTTCTACTCTTTTAGATGAACTACAGAGGCACATTCAGTCAAATGATGATAGGCAGG
TGAAGGTGTATCCTTATCATATTAACGATCTTGAGTTCGCCGAGGTATTGGTCAACTCATTCTTGGAAATTACTTCAAAAGAGACGAAGGACTTGTGTGACCCAAAAATT
GTTTCAGTTGAAATTAGTCAAGACCTTCAAGAGAACTCCATTTCTGAGTCAAATTTATCTGTTCATGGAAGCATTATCTATAGTCCTAGTGATTTCCCAGATGCAAGACC
AGTAACTTTGAGAAGAACACGTATGATATTGGAGAACCTGAAAGCTCAAATAGTTGAAGGAGTGCCCATAATAGGGGCTGGTGCGGGGACTGGCATATCTGCCAAGTTTG
AAGAAGCTGGTGGCGTTGATCTGATAGTGGTGTACAATTCGGGACGCTTTCGCATGGCTGGGAGGGGTTCTTTAGCAGGCTTACTACCTTTTGCCGATGCTAATGCCATA
GTGCTCGAGATGGCCAATGAAGTTCTGCCATTAAACCTTGTGATATATCCAGTTGGACTGGAGAGCTTGAATGCGGAGCATTATATCAGTAGATTACTTAGGGACTCTCC
CAATCTCCTGAGTGATGGAGTTTGTACATTCGAGTATGCTGATGTGGTGAACACAGTTCCCGTGCTTGCTGGAGTATGTGCTTCTGATCCATTTCGTCGAATGGATTACT
TCCTAAAGCAGGTGGAGTCAATTGGATTTTCTGGAGTGCAGAACTTTCCTACTGTTGGATTATTTGATGGTAACTTCAGACAGAACCTTGAAGAAACAGGAATGGGATAT
GGATTGGAGGTCAAGATGATTGAAAGAGCACACAAAATGGGTCTCTTGACAACCCCATATGCTTTTAACCAAGATGAAGCCTTGGAAATGGCAAAAGCTGGTGCAGACAT
TATAGTTGCCCACATGGGGCTCACTACCTCTGGATCTATTGGAGCCAAAACTGCCCTTACAATGGAGGAAAGTGTACTCCGTGTACAGGCAATAGCAGATGCTGCTCACA
GAATCAATCCTAATGTCTTAGTGCTCTGTCATGGAGGTCCTATATCGGGTCCTAGTGAAGCTGCATTCATTCTGAAGAGAACCAAGGGAGTTCATGGATTTTATGGTGCA
TCAAGCATGGAGAGGCTACCAGTTGAACAAGCAATAACTAGCACAGTCCAAGAGTACAAATCAATTTCAATGAGATAA
mRNA sequenceShow/hide mRNA sequence
AGAATACTTGTCATCTTTCACGAGGTCGTAGGTCGAGATTTCTAATACCGCTTCAAGTTGTTTGAGGAGTGAGATTCTAATCCAACGATTCACAGGAATCTTCTTCCACC
TTTTTCATTCACAAAGAAAATACATATTTACATATGGACATGCCCTTTTGCGGAAGAAATAAGAATCCGATTTCCAGATTACCTACTTTTTGCAGTCGGAAAGTTGGGAT
CTCTGAGAAGAAAAAAGATCAAATTAAAAAGGAAAAAAATGGCGACGCGAGGTGATAGTAAGACTCTCCGAGTTTTCTGTATCGCAACGGCTGATTCAAAGCTTGAGGAG
CTCCGATTCATTTCCGATACAGTTCGATCCAACCTCAATAGCTTCTCTAGAAGTTCTTCTTTTAAGGTCGAAGTGACGGTTGTTGATGTTTCTACCGGCCGTATGCATGG
GATCGAGAGTCTGGATGATTTCGTCTTCGTGTCGAGGGAGGATGTTCTCTCTTGCTATGATCGCACTGGAAATGATCTTCCCGATGACCGCGGAAAAGCTCTCTCTATAA
TGAGTAAAGCACTTGAATATTTCCTTAGTAAAGCCCGGGGGGATGGGGTAATTGCTGGAGCTATTGGACTTGGAGGCAGTGGAGGGACATCTCTCATATCCTCTGCGTTA
AAATCTCTTCCAATTGGAATCCCTAAGCTTATCGTCTCAACCGTTGCTAGTGGTCAGACAGAATCTTATATCGAGACATCCGATCTGATACTCATCCCGTCTATAGTGGA
TGTGTGCGGGATTAATAGTGTCAGTAGGGTCGTCTTATCAAATGCTGGTGCTGCATTTGCTGGTATGGTGGTCGGAAGACTCGAGAAGTTTAAAGATTCTCGTAATTTCA
ATGAAAAACCAACAGTGGGTTTGACAATGTTTGGAGTTACAACTCCTTGTGTAAATTCTGTCAAAGAAAGATTGGCTAAAGAAGGCTACGAGAGCCTTGTTTTTCATGCT
ACCGGGGTTGGGGGCAAGGCAATGGAATCTCTGGTCAGAGAGGGATTTATCCAGGGAGTCTTGGACATCACAACAACAGAGGTTGCAGACTATTTAGTGGGAGGTGTCAT
GGCTTGTGGCAGTTCCCGCTTTGATGCCATTATAGAGAAGAGAATCCCACTAGTCCTAAGTGTAGGAGCACTGGATATGGTGAACTTTGGATCCAGAGATGCAATACCTT
CTAATTTTCACGGAAGGAATATATATGAACATAACAAGCAGGTTTCGCTGATGCGAACTACAGTGGATGAGAACAAAAAAATCGCTCATTTTATAGCTGATAAGATAAAC
AATTCATTGGTGAAGGTTCGTGTGTGCCTGCCGCAGAATGGTGTATCTGCTCTGGATGCACCAGGGAAGCCATTTTATGATCCTGAAGCTACTTCTACTCTTTTAGATGA
ACTACAGAGGCACATTCAGTCAAATGATGATAGGCAGGTGAAGGTGTATCCTTATCATATTAACGATCTTGAGTTCGCCGAGGTATTGGTCAACTCATTCTTGGAAATTA
CTTCAAAAGAGACGAAGGACTTGTGTGACCCAAAAATTGTTTCAGTTGAAATTAGTCAAGACCTTCAAGAGAACTCCATTTCTGAGTCAAATTTATCTGTTCATGGAAGC
ATTATCTATAGTCCTAGTGATTTCCCAGATGCAAGACCAGTAACTTTGAGAAGAACACGTATGATATTGGAGAACCTGAAAGCTCAAATAGTTGAAGGAGTGCCCATAAT
AGGGGCTGGTGCGGGGACTGGCATATCTGCCAAGTTTGAAGAAGCTGGTGGCGTTGATCTGATAGTGGTGTACAATTCGGGACGCTTTCGCATGGCTGGGAGGGGTTCTT
TAGCAGGCTTACTACCTTTTGCCGATGCTAATGCCATAGTGCTCGAGATGGCCAATGAAGTTCTGCCATTAAACCTTGTGATATATCCAGTTGGACTGGAGAGCTTGAAT
GCGGAGCATTATATCAGTAGATTACTTAGGGACTCTCCCAATCTCCTGAGTGATGGAGTTTGTACATTCGAGTATGCTGATGTGGTGAACACAGTTCCCGTGCTTGCTGG
AGTATGTGCTTCTGATCCATTTCGTCGAATGGATTACTTCCTAAAGCAGGTGGAGTCAATTGGATTTTCTGGAGTGCAGAACTTTCCTACTGTTGGATTATTTGATGGTA
ACTTCAGACAGAACCTTGAAGAAACAGGAATGGGATATGGATTGGAGGTCAAGATGATTGAAAGAGCACACAAAATGGGTCTCTTGACAACCCCATATGCTTTTAACCAA
GATGAAGCCTTGGAAATGGCAAAAGCTGGTGCAGACATTATAGTTGCCCACATGGGGCTCACTACCTCTGGATCTATTGGAGCCAAAACTGCCCTTACAATGGAGGAAAG
TGTACTCCGTGTACAGGCAATAGCAGATGCTGCTCACAGAATCAATCCTAATGTCTTAGTGCTCTGTCATGGAGGTCCTATATCGGGTCCTAGTGAAGCTGCATTCATTC
TGAAGAGAACCAAGGGAGTTCATGGATTTTATGGTGCATCAAGCATGGAGAGGCTACCAGTTGAACAAGCAATAACTAGCACAGTCCAAGAGTACAAATCAATTTCAATG
AGATAAAATTTGAAGTGCGGTCCTTCACGAGCTTGATACTTGTCATGTATATTATAACTAGTACTAACTTCTCACACACATGTGATGGGTGAAAATTGTGGCCAATCGTA
TCTTTTAGCTTCGATATGTTTCAGTTTTTTTTTTTTGGTTTGTTGCATTATAAGTATCAAATTTGGACCTTAACCAAGACGGAGCAAAGATTTTATTGTCCTTGACGTTA
TGCAGATGCTTAGCCTCCTCTTAATCTTCTACTCAAGCGACAACGACACTAGTGTAGAT
Protein sequenceShow/hide protein sequence
MATRGDSKTLRVFCIATADSKLEELRFISDTVRSNLNSFSRSSSFKVEVTVVDVSTGRMHGIESLDDFVFVSREDVLSCYDRTGNDLPDDRGKALSIMSKALEYFLSKAR
GDGVIAGAIGLGGSGGTSLISSALKSLPIGIPKLIVSTVASGQTESYIETSDLILIPSIVDVCGINSVSRVVLSNAGAAFAGMVVGRLEKFKDSRNFNEKPTVGLTMFGV
TTPCVNSVKERLAKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLVGGVMACGSSRFDAIIEKRIPLVLSVGALDMVNFGSRDAIPSNFHGRNIYEHNK
QVSLMRTTVDENKKIAHFIADKINNSLVKVRVCLPQNGVSALDAPGKPFYDPEATSTLLDELQRHIQSNDDRQVKVYPYHINDLEFAEVLVNSFLEITSKETKDLCDPKI
VSVEISQDLQENSISESNLSVHGSIIYSPSDFPDARPVTLRRTRMILENLKAQIVEGVPIIGAGAGTGISAKFEEAGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAI
VLEMANEVLPLNLVIYPVGLESLNAEHYISRLLRDSPNLLSDGVCTFEYADVVNTVPVLAGVCASDPFRRMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGY
GLEVKMIERAHKMGLLTTPYAFNQDEALEMAKAGADIIVAHMGLTTSGSIGAKTALTMEESVLRVQAIADAAHRINPNVLVLCHGGPISGPSEAAFILKRTKGVHGFYGA
SSMERLPVEQAITSTVQEYKSISMR