; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0004618 (gene) of Chayote v1 genome

Gene IDSed0004618
OrganismSechium edule (Chayote v1)
Descriptionprotein HLB1-like isoform X2
Genome locationLG04:8533836..8548576
RNA-Seq ExpressionSed0004618
SyntenySed0004618
Gene Ontology termsGO:0006887 - exocytosis (biological process)
GO:0048768 - root hair cell tip growth (biological process)
GO:0005769 - early endosome (cellular component)
GO:0005802 - trans-Golgi network (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146133.1 protein HLB1 isoform X1 [Cucumis sativus]1.3e-26687.48Show/hide
Query:  MSPTPEEPNNLQNGIETESHISSESERADERRSHP-ETLADTIPNAGLRPEQESESV-NEEPDSEPENRGQQPSESVRLQVVMDVADP-----KETSTPS
        MSPTPEEPNNLQNGIE + HISSES++  E RS P E   D+IP++ L+ E+ESESV N  PDSEPE+  +Q SES+ L VV  V DP     KETSTPS
Subjt:  MSPTPEEPNNLQNGIETESHISSESERADERRSHP-ETLADTIPNAGLRPEQESESV-NEEPDSEPENRGQQPSESVRLQVVMDVADP-----KETSTPS

Query:  NG-TDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAKRYAS
        NG T+N QPALRKDEGSRTFTMRELLNGLKGEDGSDS+N SEG+ PE NS YSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAA+RYAS
Subjt:  NG-TDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAKRYAS

Query:  AIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
        AIERN QDYDALYNWALVLQESADNVSPDST+PSKDALLEEACKKYDEATH CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Subjt:  AIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN

Query:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYSV
        SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFR AIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+GNVKD SPN+LYSQSAIYIA+AHALKPNYSV
Subjt:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYSV

Query:  YSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTVKVEIPDIVSVSACADLTLPP
        YSSALRLVRSMLPLPYLKVGYLTAPP+GRPLAPHSDWKRSQFFLNH+VLQKL IGGEQ+Q+SPS+LGRSGSTLN GDRT+KVEIPDIVSVSACADLTLPP
Subjt:  YSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTVKVEIPDIVSVSACADLTLPP

Query:  GAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG
        GAGLCIDTIHGP+FLVADSWD LD WLDAIRLVYTIYARGKNEVLAGI+TG
Subjt:  GAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG

XP_008448563.1 PREDICTED: uncharacterized protein LOC103490705 isoform X1 [Cucumis melo]2.8e-26486.75Show/hide
Query:  MSPTPEEPNNLQNGIETESHISSESERADERRSH-PETLADTIPNAGLRPEQESESV-NEEPDSEPENRGQQPSESVRLQVVMDVADP-----KETSTPS
        MSPTPEEPNNLQNGIE + HISSES++  E RS   E  AD+IP++ L+ E+ESESV N   DSEPE+  +Q SES+ L VV  V DP     KETSTP 
Subjt:  MSPTPEEPNNLQNGIETESHISSESERADERRSH-PETLADTIPNAGLRPEQESESV-NEEPDSEPENRGQQPSESVRLQVVMDVADP-----KETSTPS

Query:  NG-TDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAKRYAS
        NG T+N QPALRKDEGSRTFTMRELLNGLKGEDGSD +N SEG+ PE NS +SLNQDSPHQPYSEQSRAAMELINS+TGVDEEGRSRQRILTFAA+RYAS
Subjt:  NG-TDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAKRYAS

Query:  AIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
        AIERN QDYDALYNWALVLQESADNVSPDST+PSKDALLEEACKKYDEATH CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Subjt:  AIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN

Query:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYSV
        SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFR AIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN+KD SPN+LYSQSAIYIA+AHALKPNYSV
Subjt:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYSV

Query:  YSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTVKVEIPDIVSVSACADLTLPP
        YSSALRLVRSMLPLPYLKVGYLTAPP+GRPLAPHSDWKRSQFFLNH+VLQKL IGGEQ+Q+SPS LGRSGSTLN GDRT+KVEIPDIVSVSACADLTLPP
Subjt:  YSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTVKVEIPDIVSVSACADLTLPP

Query:  GAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG
        GAGLCIDTIHGP+FLVADSWDALD WLDAIRLVYTIYARGKNEVLAGI+TG
Subjt:  GAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG

XP_022965252.1 protein HLB1-like isoform X2 [Cucurbita maxima]3.2e-26084.96Show/hide
Query:  MSPTPEEPNNLQNGIETESHISSESERADERRSHPETLADTIPNAGLRPEQESESVNEEPDSEPENRGQQP----SESVRLQVVMDVAD-----PKETST
        MSP PEEPNNLQNGIE E HIS ES +  E +S PE+ AD IP A L+ E+ESESVN   DSEP++    P    SES+ LQVV DV D     PK TS 
Subjt:  MSPTPEEPNNLQNGIETESHISSESERADERRSHPETLADTIPNAGLRPEQESESVNEEPDSEPENRGQQP----SESVRLQVVMDVAD-----PKETST

Query:  PSNGTDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAKRYA
         SNG +NSQPALRKDEGSRTFTMRELLNGLK EDG+DS+N SEG+ PE NS YSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAA+RYA
Subjt:  PSNGTDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAKRYA

Query:  SAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW
        SAIERN QDYDALYNWALVLQESADNVSPDST+PSKDALLEEACKKYDEAT  CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKAVQLNW
Subjt:  SAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW

Query:  NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYS
        NSPQALNNWGLALQELSAIVPAREK TIVKTAISKFR AIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG VKD SPN+LYSQSAIYIA+AHALKP+YS
Subjt:  NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYS

Query:  VYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTVKVEIPDIVSVSACADLTLP
        VYSSALRLVRSMLPLPYLKVGYLTAPP+GRP APH DWKRSQFFLNH+VLQKL IGGEQ+Q+SP++LGRSGSTLN GDRT+KVEIPDIVSVSACADLTLP
Subjt:  VYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTVKVEIPDIVSVSACADLTLP

Query:  PGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG
        PGAGLCIDTIHG +FLVADSWDALD WLDAIRLVYTIYARGKNEVLAGI+ G
Subjt:  PGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG

XP_023552571.1 protein HLB1-like [Cucurbita pepo subsp. pepo]1.0e-25881.38Show/hide
Query:  MSPTPEEPNNLQNGIETESHISSESERADERRSHPETLADTIPNAGLRPEQESESVN--------------------------------EEPDSEPENRG
        MSPTPEEPNNLQNGIE ESHIS ES +  E +S PE+ AD +P A L+ E++SESVN                                 EP SE ++  
Subjt:  MSPTPEEPNNLQNGIETESHISSESERADERRSHPETLADTIPNAGLRPEQESESVN--------------------------------EEPDSEPENRG

Query:  QQPSESVRLQVVMDVAD-----PKETSTPSNGTDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAM
        +Q SES+ LQVV DV D     PK TS  SNGT+NSQPALRKDEGSRTFTMRELLNGLK EDG+DS+N SEG+ PE NS YSLNQDSPHQPYSEQSRAAM
Subjt:  QQPSESVRLQVVMDVAD-----PKETSTPSNGTDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAM

Query:  ELINSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRA
        ELINSVTGVDEEGRSRQRILTFAA+RYASAIERN QDYDALYNWALVLQESADNVSPDST+PSKDALLEEACKKYDEAT  CPTLHDAFYNWAIAISDRA
Subjt:  ELINSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRA

Query:  KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN
        KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFR AIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG 
Subjt:  KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN

Query:  VKDASPNDLYSQSAIYIASAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGS
        VKD SPN+LYSQSAIYIA+AHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPP+GRP APH DWKRSQFFLNH+VLQKL IGGEQ Q+SP++LGRSGS
Subjt:  VKDASPNDLYSQSAIYIASAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGS

Query:  TLNGGDRTVKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG
        TLN GDRT+KVEIPDIVSVSACADLTLPPGAGLCIDTIHG +FLVADSWDALD WLDAIRLVYTIYARGKNEVLAGI+ G
Subjt:  TLNGGDRTVKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG

XP_038876586.1 protein HLB1 [Benincasa hispida]1.1e-26386.73Show/hide
Query:  MSPTPEEPNNLQNGIETESHISSESERADERRSHPETLADTIPNAGLRPEQESESVNE-EPDSEPENRGQQPSESVRLQVVMDVADP-----KETSTPSN
        MSPTPEEPNNLQNGIE + HIS ES++  E RS PE  AD I ++ L  E+ESESVN    DSEP +R +Q  ES+ LQV  DVADP     KETS PSN
Subjt:  MSPTPEEPNNLQNGIETESHISSESERADERRSHPETLADTIPNAGLRPEQESESVNE-EPDSEPENRGQQPSESVRLQVVMDVADP-----KETSTPSN

Query:  G-TDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAKRYASA
        G T+NS+PALRKDEGSRTFTMRELLNGLKGEDG+DS+N SEG+ PE N  YSLNQDSPHQPYSEQSRAAMELI+SVTGVDEEGRSRQRILTFAA+RYASA
Subjt:  G-TDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAKRYASA

Query:  IERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNS
        IERN QDYDALYNWALVLQESADNVSPDST+PSKDALLEEACKKYDEAT  CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNS
Subjt:  IERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNS

Query:  PQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYSVY
        PQALNNWGLALQELSAIVPAREKQTIVKTAISKFR AIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKD SPN+LYSQSAIYIA+AHALKPNYSVY
Subjt:  PQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYSVY

Query:  SSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTVKVEIPDIVSVSACADLTLPPG
        SSALRLVRSMLPLPYLKVGYLTAPP+GRPLAPH DWKRSQFFLNH+VLQKL IGGEQ+Q+SPS+LGRSGSTLN GD T+KVEIPDIVSVSACADLTLPPG
Subjt:  SSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTVKVEIPDIVSVSACADLTLPPG

Query:  AGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG
        AGLCIDTIHGPVFLVADSWDALD WLDAIRLVYTIYARGKNEVLAGI+TG
Subjt:  AGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG

TrEMBL top hitse value%identityAlignment
A0A0A0L688 Uncharacterized protein6.5e-26787.48Show/hide
Query:  MSPTPEEPNNLQNGIETESHISSESERADERRSHP-ETLADTIPNAGLRPEQESESV-NEEPDSEPENRGQQPSESVRLQVVMDVADP-----KETSTPS
        MSPTPEEPNNLQNGIE + HISSES++  E RS P E   D+IP++ L+ E+ESESV N  PDSEPE+  +Q SES+ L VV  V DP     KETSTPS
Subjt:  MSPTPEEPNNLQNGIETESHISSESERADERRSHP-ETLADTIPNAGLRPEQESESV-NEEPDSEPENRGQQPSESVRLQVVMDVADP-----KETSTPS

Query:  NG-TDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAKRYAS
        NG T+N QPALRKDEGSRTFTMRELLNGLKGEDGSDS+N SEG+ PE NS YSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAA+RYAS
Subjt:  NG-TDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAKRYAS

Query:  AIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
        AIERN QDYDALYNWALVLQESADNVSPDST+PSKDALLEEACKKYDEATH CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Subjt:  AIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN

Query:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYSV
        SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFR AIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+GNVKD SPN+LYSQSAIYIA+AHALKPNYSV
Subjt:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYSV

Query:  YSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTVKVEIPDIVSVSACADLTLPP
        YSSALRLVRSMLPLPYLKVGYLTAPP+GRPLAPHSDWKRSQFFLNH+VLQKL IGGEQ+Q+SPS+LGRSGSTLN GDRT+KVEIPDIVSVSACADLTLPP
Subjt:  YSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTVKVEIPDIVSVSACADLTLPP

Query:  GAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG
        GAGLCIDTIHGP+FLVADSWD LD WLDAIRLVYTIYARGKNEVLAGI+TG
Subjt:  GAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG

A0A1S3BJC9 uncharacterized protein LOC103490705 isoform X11.4e-26486.75Show/hide
Query:  MSPTPEEPNNLQNGIETESHISSESERADERRSH-PETLADTIPNAGLRPEQESESV-NEEPDSEPENRGQQPSESVRLQVVMDVADP-----KETSTPS
        MSPTPEEPNNLQNGIE + HISSES++  E RS   E  AD+IP++ L+ E+ESESV N   DSEPE+  +Q SES+ L VV  V DP     KETSTP 
Subjt:  MSPTPEEPNNLQNGIETESHISSESERADERRSH-PETLADTIPNAGLRPEQESESV-NEEPDSEPENRGQQPSESVRLQVVMDVADP-----KETSTPS

Query:  NG-TDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAKRYAS
        NG T+N QPALRKDEGSRTFTMRELLNGLKGEDGSD +N SEG+ PE NS +SLNQDSPHQPYSEQSRAAMELINS+TGVDEEGRSRQRILTFAA+RYAS
Subjt:  NG-TDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAKRYAS

Query:  AIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
        AIERN QDYDALYNWALVLQESADNVSPDST+PSKDALLEEACKKYDEATH CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Subjt:  AIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN

Query:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYSV
        SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFR AIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN+KD SPN+LYSQSAIYIA+AHALKPNYSV
Subjt:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYSV

Query:  YSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTVKVEIPDIVSVSACADLTLPP
        YSSALRLVRSMLPLPYLKVGYLTAPP+GRPLAPHSDWKRSQFFLNH+VLQKL IGGEQ+Q+SPS LGRSGSTLN GDRT+KVEIPDIVSVSACADLTLPP
Subjt:  YSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTVKVEIPDIVSVSACADLTLPP

Query:  GAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG
        GAGLCIDTIHGP+FLVADSWDALD WLDAIRLVYTIYARGKNEVLAGI+TG
Subjt:  GAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG

A0A6J1EA05 protein HLB1-like1.2e-25781.03Show/hide
Query:  MSPTPEEPNNLQNGIETESHISSESERADERRSHPETLADTIPNAGLRPEQESESVNEEPDSEPENR--------------------------------G
        MSPTPEEPNNLQNGIE E HIS ES +  E +S PE+ AD +P A L+ E+E ESVN   D EP++                                  
Subjt:  MSPTPEEPNNLQNGIETESHISSESERADERRSHPETLADTIPNAGLRPEQESESVNEEPDSEPENR--------------------------------G

Query:  QQPSESVRLQVVMDVAD-----PKETSTPSNGTDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAM
        +Q SES++LQV  DVAD     PK TS  SNGT+NSQPALRKDEGSRTFTMRELLNGLK EDG+DS+N SEG+ PE NS YSLNQDSPHQPYSEQSRAAM
Subjt:  QQPSESVRLQVVMDVAD-----PKETSTPSNGTDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAM

Query:  ELINSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRA
        ELINSVTGVDEEGRSRQRILTFAA+RYASAIERN QDYDALYNWALVLQESADNVSPDST+PSKDALLEEACKKYDEAT  CPTLHDAFYNWAIAISDRA
Subjt:  ELINSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRA

Query:  KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN
        KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFR AIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG 
Subjt:  KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN

Query:  VKDASPNDLYSQSAIYIASAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGS
        VKD SPN+LYSQSAIYIA+AHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPP+GRP APH DWKRSQFFLNH+VLQKL IGGEQ Q+SP++LGRSGS
Subjt:  VKDASPNDLYSQSAIYIASAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGS

Query:  TLNGGDRTVKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG
        TLN GDRT+KVEIPDIVSVSACADLTLPPGAGLCIDTIHG +FLVADSWDALD WLDAIRLVYTIYARGKNEVLAGI+ G
Subjt:  TLNGGDRTVKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG

A0A6J1HJU5 protein HLB1-like isoform X21.5e-26084.96Show/hide
Query:  MSPTPEEPNNLQNGIETESHISSESERADERRSHPETLADTIPNAGLRPEQESESVNEEPDSEPENRGQQP----SESVRLQVVMDVAD-----PKETST
        MSP PEEPNNLQNGIE E HIS ES +  E +S PE+ AD IP A L+ E+ESESVN   DSEP++    P    SES+ LQVV DV D     PK TS 
Subjt:  MSPTPEEPNNLQNGIETESHISSESERADERRSHPETLADTIPNAGLRPEQESESVNEEPDSEPENRGQQP----SESVRLQVVMDVAD-----PKETST

Query:  PSNGTDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAKRYA
         SNG +NSQPALRKDEGSRTFTMRELLNGLK EDG+DS+N SEG+ PE NS YSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAA+RYA
Subjt:  PSNGTDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAKRYA

Query:  SAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW
        SAIERN QDYDALYNWALVLQESADNVSPDST+PSKDALLEEACKKYDEAT  CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKAVQLNW
Subjt:  SAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW

Query:  NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYS
        NSPQALNNWGLALQELSAIVPAREK TIVKTAISKFR AIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG VKD SPN+LYSQSAIYIA+AHALKP+YS
Subjt:  NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYS

Query:  VYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTVKVEIPDIVSVSACADLTLP
        VYSSALRLVRSMLPLPYLKVGYLTAPP+GRP APH DWKRSQFFLNH+VLQKL IGGEQ+Q+SP++LGRSGSTLN GDRT+KVEIPDIVSVSACADLTLP
Subjt:  VYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTVKVEIPDIVSVSACADLTLP

Query:  PGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG
        PGAGLCIDTIHG +FLVADSWDALD WLDAIRLVYTIYARGKNEVLAGI+ G
Subjt:  PGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG

A0A6J1KVY8 protein HLB1-like isoform X12.1e-25784.48Show/hide
Query:  MSPTPEEPNNLQNGIETESHISSESERADERRSHPETLADTIPNAGLRPEQESES------VNEEPDSEPENRGQQPSESVRLQVVMDVADP-----KET
        MS TPEEPNNLQNGI TE  ISSESE+ DE RS PE +AD IP A  + E+ESES         E +SE  +R +Q SES+ LQVV +V+DP     K T
Subjt:  MSPTPEEPNNLQNGIETESHISSESERADERRSHPETLADTIPNAGLRPEQESES------VNEEPDSEPENRGQQPSESVRLQVVMDVADP-----KET

Query:  STPSNGTDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAKR
        S PSNG +NSQP LRKDEGSRTFTMRELLNGLKGEDG+DSVN SEG+ P+    YSLNQDSP QPYSEQSRAAMELI+SVTGVDEEGRSRQRILTFAAKR
Subjt:  STPSNGTDNSQPALRKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAKR

Query:  YASAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQL
        YASAIERNAQDYDALYNWALVLQESADNVSPDST+PSKDALLEEACKKYDEAT  CPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQL
Subjt:  YASAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQL

Query:  NWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPN
        NWNSPQALNNWGLALQELSAIVPAREKQTIV+TAISKFR AIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN KD SPN+LYSQSAIYIA+AHALKP+
Subjt:  NWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPN

Query:  YSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTVKVEIPDIVSVSACADLT
        YSVYSSALRLVRSMLPLPYLKVGYLTAPP+G+PLAPHSDWKRSQ+FLNH+VLQKLKIGGEQ+Q+SP+ LGRSGSTLN GD  +KVEIPDIVSVSACADLT
Subjt:  YSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEVLQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTVKVEIPDIVSVSACADLT

Query:  LPPGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG
        LPPGAGLCIDTIHGPVFLVADSWDALD WLDA+RLVYTIYARGKN+VLAGI TG
Subjt:  LPPGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG

SwissProt top hitse value%identityAlignment
Q9FHY8 Protein HLB11.6e-17460.9Show/hide
Query:  MSPTPEEPNNLQNGI-----ETESHISSESERADERR---SHPETLADTIPN------AGLRPEQESESVNEE------PDSEPEN-RGQQPSESVRLQV
        M+ T EEP  LQNG      ETE +   E +   E +     PE  AD  P          +PE+    V  E       D++PE  + +   E V+  V
Subjt:  MSPTPEEPNNLQNGI-----ETESHISSESERADERR---SHPETLADTIPN------AGLRPEQESESVNEE------PDSEPEN-RGQQPSESVRLQV

Query:  V--------MDVADPKETSTPSNGTDNSQPAL-----RKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELI
                 +D++       P   T+  Q +      + D+G++TFTMRELL+ LK E         EGD    +SA   +++S  QP   ++  AM+LI
Subjt:  V--------MDVADPKETSTPSNGTDNSQPAL-----RKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELI

Query:  NSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMR
        N +   DEEGRSRQR+L FAA++YASAIERN  D+DALYNWAL+LQESADNVSPDS +PSKD LLEEACKKYDEAT  CPTL+DA+YNWAIAISDRAK+R
Subjt:  NSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMR

Query:  GRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKD
        GRTKEAEELW+QA  NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +V+TAISKFR AI+LQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN KD
Subjt:  GRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKD

Query:  ASPNDLYSQSAIYIASAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHE-VLQKLKIGGEQVQSSPSVLGRSGSTL
          P +LYSQSAIYIA+AH+LKP+YSVYSSALRLVRSMLPLP+LKVGYLTAPP+G  LAPHSDWKR++F LNHE +LQ LK    ++  + S    + ST 
Subjt:  ASPNDLYSQSAIYIASAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHE-VLQKLKIGGEQVQSSPSVLGRSGSTL

Query:  NGGDRTVKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG
        N   +TVKV I +IVSV+ CADLTLPPGAGLCIDTIHGPVFLVADSW++LD WLDAIRLVYTIYARGK++VLAGI+TG
Subjt:  NGGDRTVKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG

Arabidopsis top hitse value%identityAlignment
AT5G41950.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-17560.9Show/hide
Query:  MSPTPEEPNNLQNGI-----ETESHISSESERADERR---SHPETLADTIPN------AGLRPEQESESVNEE------PDSEPEN-RGQQPSESVRLQV
        M+ T EEP  LQNG      ETE +   E +   E +     PE  AD  P          +PE+    V  E       D++PE  + +   E V+  V
Subjt:  MSPTPEEPNNLQNGI-----ETESHISSESERADERR---SHPETLADTIPN------AGLRPEQESESVNEE------PDSEPEN-RGQQPSESVRLQV

Query:  V--------MDVADPKETSTPSNGTDNSQPAL-----RKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELI
                 +D++       P   T+  Q +      + D+G++TFTMRELL+ LK E         EGD    +SA   +++S  QP   ++  AM+LI
Subjt:  V--------MDVADPKETSTPSNGTDNSQPAL-----RKDEGSRTFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELI

Query:  NSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMR
        N +   DEEGRSRQR+L FAA++YASAIERN  D+DALYNWAL+LQESADNVSPDS +PSKD LLEEACKKYDEAT  CPTL+DA+YNWAIAISDRAK+R
Subjt:  NSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVSPDSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMR

Query:  GRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKD
        GRTKEAEELW+QA  NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +V+TAISKFR AI+LQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN KD
Subjt:  GRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKD

Query:  ASPNDLYSQSAIYIASAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHE-VLQKLKIGGEQVQSSPSVLGRSGSTL
          P +LYSQSAIYIA+AH+LKP+YSVYSSALRLVRSMLPLP+LKVGYLTAPP+G  LAPHSDWKR++F LNHE +LQ LK    ++  + S    + ST 
Subjt:  ASPNDLYSQSAIYIASAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHE-VLQKLKIGGEQVQSSPSVLGRSGSTL

Query:  NGGDRTVKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG
        N   +TVKV I +IVSV+ CADLTLPPGAGLCIDTIHGPVFLVADSW++LD WLDAIRLVYTIYARGK++VLAGI+TG
Subjt:  NGGDRTVKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCCTACTCCCGAGGAACCCAACAATTTGCAGAACGGAATCGAAACCGAATCGCACATTTCTTCGGAATCAGAGCGAGCTGACGAACGCAGATCGCACCCAGAAAC
CCTAGCAGATACAATCCCCAATGCCGGATTACGGCCAGAACAAGAATCGGAATCAGTCAACGAAGAACCAGATTCCGAGCCGGAGAATCGAGGGCAGCAGCCGTCGGAGT
CAGTCCGGTTACAGGTTGTGATGGATGTTGCAGATCCGAAGGAAACCTCGACCCCGTCCAACGGCACCGATAACTCGCAACCTGCGCTGCGTAAAGACGAAGGAAGCCGC
ACGTTTACGATGAGAGAGTTGCTGAACGGATTGAAGGGTGAAGATGGCAGCGACAGCGTTAATGGATCTGAAGGCGACGTGCCCGAGCCCAACTCCGCTTACAGTCTTAA
TCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGCAGAGCTGCCATGGAGTTGATCAACAGTGTTACAGGGGTTGATGAAGAAGGCCGTTCTCGCCAACGGATTCTCA
CATTCGCTGCTAAGAGATATGCTAGTGCAATTGAGAGAAATGCCCAAGACTATGATGCTCTATATAATTGGGCTTTGGTTCTGCAGGAGAGTGCAGACAATGTTAGTCCA
GATTCTACTACACCTTCTAAAGATGCATTGCTTGAGGAGGCTTGTAAAAAGTACGATGAGGCTACCCATTTTTGCCCAACACTTCATGATGCTTTTTACAATTGGGCTAT
TGCAATCTCTGATCGAGCCAAAATGCGTGGTCGTACAAAGGAGGCTGAAGAACTGTGGAAGCAGGCTACCAAAAATTATGAAAAAGCTGTCCAACTCAACTGGAATAGTC
CCCAGGCGCTTAATAATTGGGGGCTTGCTCTTCAGGAACTCAGTGCGATTGTGCCAGCACGAGAAAAACAGACAATTGTAAAAACAGCTATCAGCAAGTTTCGTGGCGCT
ATACAGTTGCAATTTGATTTTCATCGAGCCATCTACAACCTTGGTACTGTTCTGTATGGACTAGCTGAGGACACATTACGGACTGGTGGAACTGGCAACGTCAAGGATGC
TTCCCCCAATGACTTGTACAGCCAATCAGCAATTTATATTGCATCAGCTCATGCTCTAAAACCAAATTACTCTGTTTACAGCAGTGCCTTGCGCTTGGTTCGTTCAATGC
TGCCGTTACCGTATTTAAAAGTTGGATACCTGACAGCACCTCCTCTGGGGAGACCACTTGCTCCTCACAGTGATTGGAAACGTTCACAATTTTTTCTAAATCATGAAGTA
TTGCAAAAGCTTAAAATAGGAGGGGAACAAGTACAATCATCTCCTAGTGTTTTAGGAAGATCTGGAAGTACCTTGAATGGCGGCGATAGGACAGTCAAAGTAGAAATTCC
AGACATTGTCTCTGTATCGGCATGCGCAGATCTAACCTTACCTCCTGGTGCTGGACTCTGCATTGACACAATCCATGGACCAGTTTTCTTGGTTGCTGACTCATGGGATG
CGCTCGATGCATGGCTCGATGCAATTAGACTAGTTTACACTATCTACGCTCGAGGCAAGAACGAAGTTTTGGCTGGCATCTTAACCGGTTGA
mRNA sequenceShow/hide mRNA sequence
GTAGAAGTTATTTTGAATAATTACTCGAAGAGTAATTAGCCGAACAAATTCAACAAAGAAACACCTCATTGGCCAAACACTCCCTAAGCAACTTGTGTGTCACTCATCCA
TGGCGGAGCAATTACAGGAGGGGGATGGAAATGGCTAATTGAGCATTTTCTCTCTCGTCTTTCAGTTTCGATCTTGGCGGATCTTCGATCTGCTCGTTTTTCGGAACATT
GCTTCAATCCCGTTCTCTCCCTTTCCGCCACTTCACCATGTCTCCTACTCCCGAGGAACCCAACAATTTGCAGAACGGAATCGAAACCGAATCGCACATTTCTTCGGAAT
CAGAGCGAGCTGACGAACGCAGATCGCACCCAGAAACCCTAGCAGATACAATCCCCAATGCCGGATTACGGCCAGAACAAGAATCGGAATCAGTCAACGAAGAACCAGAT
TCCGAGCCGGAGAATCGAGGGCAGCAGCCGTCGGAGTCAGTCCGGTTACAGGTTGTGATGGATGTTGCAGATCCGAAGGAAACCTCGACCCCGTCCAACGGCACCGATAA
CTCGCAACCTGCGCTGCGTAAAGACGAAGGAAGCCGCACGTTTACGATGAGAGAGTTGCTGAACGGATTGAAGGGTGAAGATGGCAGCGACAGCGTTAATGGATCTGAAG
GCGACGTGCCCGAGCCCAACTCCGCTTACAGTCTTAATCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGCAGAGCTGCCATGGAGTTGATCAACAGTGTTACAGGG
GTTGATGAAGAAGGCCGTTCTCGCCAACGGATTCTCACATTCGCTGCTAAGAGATATGCTAGTGCAATTGAGAGAAATGCCCAAGACTATGATGCTCTATATAATTGGGC
TTTGGTTCTGCAGGAGAGTGCAGACAATGTTAGTCCAGATTCTACTACACCTTCTAAAGATGCATTGCTTGAGGAGGCTTGTAAAAAGTACGATGAGGCTACCCATTTTT
GCCCAACACTTCATGATGCTTTTTACAATTGGGCTATTGCAATCTCTGATCGAGCCAAAATGCGTGGTCGTACAAAGGAGGCTGAAGAACTGTGGAAGCAGGCTACCAAA
AATTATGAAAAAGCTGTCCAACTCAACTGGAATAGTCCCCAGGCGCTTAATAATTGGGGGCTTGCTCTTCAGGAACTCAGTGCGATTGTGCCAGCACGAGAAAAACAGAC
AATTGTAAAAACAGCTATCAGCAAGTTTCGTGGCGCTATACAGTTGCAATTTGATTTTCATCGAGCCATCTACAACCTTGGTACTGTTCTGTATGGACTAGCTGAGGACA
CATTACGGACTGGTGGAACTGGCAACGTCAAGGATGCTTCCCCCAATGACTTGTACAGCCAATCAGCAATTTATATTGCATCAGCTCATGCTCTAAAACCAAATTACTCT
GTTTACAGCAGTGCCTTGCGCTTGGTTCGTTCAATGCTGCCGTTACCGTATTTAAAAGTTGGATACCTGACAGCACCTCCTCTGGGGAGACCACTTGCTCCTCACAGTGA
TTGGAAACGTTCACAATTTTTTCTAAATCATGAAGTATTGCAAAAGCTTAAAATAGGAGGGGAACAAGTACAATCATCTCCTAGTGTTTTAGGAAGATCTGGAAGTACCT
TGAATGGCGGCGATAGGACAGTCAAAGTAGAAATTCCAGACATTGTCTCTGTATCGGCATGCGCAGATCTAACCTTACCTCCTGGTGCTGGACTCTGCATTGACACAATC
CATGGACCAGTTTTCTTGGTTGCTGACTCATGGGATGCGCTCGATGCATGGCTCGATGCAATTAGACTAGTTTACACTATCTACGCTCGAGGCAAGAACGAAGTTTTGGC
TGGCATCTTAACCGGTTGATTGTTACCAAGTATGCGAATGTATTATTGATATTACCTTGATGTTTATATGATGCTTACTCACAGTCGATTGAGTATTCATTTCTCTAAAT
TGAAACTCCAAATTTTGGGGTGCTTAATACATGTTTTTCTAGTCAGGTTCCTTCTCCATGTGTGTAATAATCTATTATGTACATGTCAGCTTGTTAACAGCACGCACTAG
TGGTGCCGTTTTGGACTTTTGAAAAGAGCTTCGGATTAATTTCATTTGAAAATTCAA
Protein sequenceShow/hide protein sequence
MSPTPEEPNNLQNGIETESHISSESERADERRSHPETLADTIPNAGLRPEQESESVNEEPDSEPENRGQQPSESVRLQVVMDVADPKETSTPSNGTDNSQPALRKDEGSR
TFTMRELLNGLKGEDGSDSVNGSEGDVPEPNSAYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAKRYASAIERNAQDYDALYNWALVLQESADNVSP
DSTTPSKDALLEEACKKYDEATHFCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRGA
IQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDASPNDLYSQSAIYIASAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHEV
LQKLKIGGEQVQSSPSVLGRSGSTLNGGDRTVKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDAWLDAIRLVYTIYARGKNEVLAGILTG