; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10002935 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10002935
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein HLB1-like isoform X2
Genome locationChr11:15578767..15587417
RNA-Seq ExpressionHG10002935
SyntenyHG10002935
Gene Ontology termsGO:0006887 - exocytosis (biological process)
GO:0048768 - root hair cell tip growth (biological process)
GO:0005769 - early endosome (cellular component)
GO:0005802 - trans-Golgi network (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146133.1 protein HLB1 isoform X1 [Cucumis sativus]1.9e-29294.55Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISPESDQTSEHRSEP-EPTADAIPSSELQQERESESVNNGVADSEPESRRKQLSESIHLQVVTDVTDPRFEEHKETSIPS
        MSPTPEEPNNLQNGIE QPHIS ESDQ +E RS P EPT D+IPSSELQ+ERESESV+NGV DSEPES RKQLSESIHL VVT VTDP  EEHKETS PS
Subjt:  MSPTPEEPNNLQNGIETQPHISPESDQTSEHRSEP-EPTADAIPSSELQQERESESVNNGVADSEPESRRKQLSESIHLQVVTDVTDPRFEEHKETSIPS

Query:  NGNTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
        NGNTEN QPALRKDEGSRTFTMRELLNGLKGEDG+DSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
Subjt:  NGNTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS

Query:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATQNYEKAVQLNWN
        AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKAVQLNWN
Subjt:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATQNYEKAVQLNWN

Query:  SPQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSV
        SPQALNNWGLALQELSAIVPAREKQTIVKTAI+KFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+GNVKDVSPNELYSQSAIYIAAAHALKPNYSV
Subjt:  SPQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSV

Query:  YSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGNRAIKVEIPDIVSVSACADLTLPPG
        YSSALRLVRSMLPLPYLKVGYLTAPP+GRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNG+R IKVEIPDIVSVSACADLTLPPG
Subjt:  YSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGNRAIKVEIPDIVSVSACADLTLPPG

Query:  AGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG
        AGLCIDTIHGP+FLVADSWD LDGWLDAIRLVYTIYARGKNEVLAGII G
Subjt:  AGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG

XP_008448563.1 PREDICTED: uncharacterized protein LOC103490705 isoform X1 [Cucumis melo]1.4e-29294.55Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISPESDQTSEHRSE-PEPTADAIPSSELQQERESESVNNGVADSEPESRRKQLSESIHLQVVTDVTDPRFEEHKETSIPS
        MSPTPEEPNNLQNGIE QPHIS ESDQ SE RSE  EPTAD+IPSSELQQERESESV+NGVADSEPES RKQLSESIHL VVT VTDP  EEHKETS P 
Subjt:  MSPTPEEPNNLQNGIETQPHISPESDQTSEHRSE-PEPTADAIPSSELQQERESESVNNGVADSEPESRRKQLSESIHLQVVTDVTDPRFEEHKETSIPS

Query:  NGNTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
        NGNTEN QPALRKDEGSRTFTMRELLNGLKGEDG+D LNESEGERPEGNSG+SLNQDSPHQPYSEQSRAAMELINS+TGVDEEGRSRQRILTFAARRYAS
Subjt:  NGNTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS

Query:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATQNYEKAVQLNWN
        AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKAVQLNWN
Subjt:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATQNYEKAVQLNWN

Query:  SPQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSV
        SPQALNNWGLALQELSAIVPAREKQTIVKTAI+KFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN+KDVSPNELYSQSAIYIAAAHALKPNYSV
Subjt:  SPQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSV

Query:  YSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGNRAIKVEIPDIVSVSACADLTLPPG
        YSSALRLVRSMLPLPYLKVGYLTAPP+GRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS LGRSGSTLNG+R IKVEIPDIVSVSACADLTLPPG
Subjt:  YSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGNRAIKVEIPDIVSVSACADLTLPPG

Query:  AGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG
        AGLCIDTIHGP+FLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGII G
Subjt:  AGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG

XP_022965251.1 protein HLB1-like isoform X1 [Cucurbita maxima]6.3e-28087.76Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISPESDQTSEHRSEPEPTADAIPSSELQQERESESVN---------------------------NGVADSEPESR----R
        MSP PEEPNNLQNGIE +PHIS ES+Q  E +SEPE TAD +P++ELQQERESESVN                           NGVADSEP+S     R
Subjt:  MSPTPEEPNNLQNGIETQPHISPESDQTSEHRSEPEPTADAIPSSELQQERESESVN---------------------------NGVADSEPESR----R

Query:  KQLSESIHLQVVTDVTDPRFEEHKETSIPSNGNTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAA
        KQLSESI LQVVTDVTDPRFEE K TSI SNG  ENSQPALRKDEGSRTFTMRELLNGLK EDGNDSLNESEGE+PE NSGYSLNQDSPHQPYSEQSRAA
Subjt:  KQLSESIHLQVVTDVTDPRFEEHKETSIPSNGNTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAA

Query:  MELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDR
        MELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDR
Subjt:  MELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDR

Query:  AKMRGRTKEAEELWKQATQNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG
        AKMRGRTKEAEELWKQAT+NYEKAVQLNWNSPQALNNWGLALQELSAIVPAREK TIVKTAI+KFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG
Subjt:  AKMRGRTKEAEELWKQATQNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG

Query:  NVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSG
         VKDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPP+GRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP++LGRSG
Subjt:  NVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSG

Query:  STLNGNRAIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG
        STLNG+R +KVEIPDIVSVSACADLTLPPGAGLCIDTIHG +FLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG
Subjt:  STLNGNRAIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG

XP_022965252.1 protein HLB1-like isoform X2 [Cucurbita maxima]1.8e-28292.04Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISPESDQTSEHRSEPEPTADAIPSSELQQERESESVNNGVADSEPESR----RKQLSESIHLQVVTDVTDPRFEEHKETS
        MSP PEEPNNLQNGIE +PHIS ES+Q  E +SEPE TAD IP++ELQQERESESV NGVADSEP+S     RKQLSESI LQVVTDVTDPRFEE K TS
Subjt:  MSPTPEEPNNLQNGIETQPHISPESDQTSEHRSEPEPTADAIPSSELQQERESESVNNGVADSEPESR----RKQLSESIHLQVVTDVTDPRFEEHKETS

Query:  IPSNGNTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARR
        I SNG  ENSQPALRKDEGSRTFTMRELLNGLK EDGNDSLNESEGE+PE NSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARR
Subjt:  IPSNGNTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARR

Query:  YASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATQNYEKAVQL
        YASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKAVQL
Subjt:  YASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATQNYEKAVQL

Query:  NWNSPQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPN
        NWNSPQALNNWGLALQELSAIVPAREK TIVKTAI+KFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG VKDVSPNELYSQSAIYIAAAHALKP+
Subjt:  NWNSPQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPN

Query:  YSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGNRAIKVEIPDIVSVSACADLTL
        YSVYSSALRLVRSMLPLPYLKVGYLTAPP+GRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP++LGRSGSTLNG+R +KVEIPDIVSVSACADLTL
Subjt:  YSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGNRAIKVEIPDIVSVSACADLTL

Query:  PPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG
        PPGAGLCIDTIHG +FLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG
Subjt:  PPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG

XP_038876586.1 protein HLB1 [Benincasa hispida]7.9e-29996.54Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISPESDQTSEHRSEPEPTADAIPSSELQQERESESVNNGVADSEPESRRKQLSESIHLQVVTDVTDPRFEEHKETSIPSN
        MSPTPEEPNNLQNGIE QPHISPESDQTSE RSEPEPTADAI SSEL QERESESVNNGVADSEP SRRKQL ESIHLQV TDV DPRFEEHKETSIPSN
Subjt:  MSPTPEEPNNLQNGIETQPHISPESDQTSEHRSEPEPTADAIPSSELQQERESESVNNGVADSEPESRRKQLSESIHLQVVTDVTDPRFEEHKETSIPSN

Query:  GNTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASA
        GNTENS+PALRKDEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGN GYSLNQDSPHQPYSEQSRAAMELI+SVTGVDEEGRSRQRILTFAARRYASA
Subjt:  GNTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASA

Query:  IERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATQNYEKAVQLNWNS
        IERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKAVQLNWNS
Subjt:  IERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATQNYEKAVQLNWNS

Query:  PQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSVY
        PQALNNWGLALQELSAIVPAREKQTIVKTAI+KFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSVY
Subjt:  PQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSVY

Query:  SSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGNRAIKVEIPDIVSVSACADLTLPPGA
        SSALRLVRSMLPLPYLKVGYLTAPP+GRPLAPH DWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNG+  IKVEIPDIVSVSACADLTLPPGA
Subjt:  SSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGNRAIKVEIPDIVSVSACADLTLPPGA

Query:  GLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG
        GLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGII G
Subjt:  GLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG

TrEMBL top hitse value%identityAlignment
A0A0A0L688 Uncharacterized protein9.1e-29394.55Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISPESDQTSEHRSEP-EPTADAIPSSELQQERESESVNNGVADSEPESRRKQLSESIHLQVVTDVTDPRFEEHKETSIPS
        MSPTPEEPNNLQNGIE QPHIS ESDQ +E RS P EPT D+IPSSELQ+ERESESV+NGV DSEPES RKQLSESIHL VVT VTDP  EEHKETS PS
Subjt:  MSPTPEEPNNLQNGIETQPHISPESDQTSEHRSEP-EPTADAIPSSELQQERESESVNNGVADSEPESRRKQLSESIHLQVVTDVTDPRFEEHKETSIPS

Query:  NGNTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
        NGNTEN QPALRKDEGSRTFTMRELLNGLKGEDG+DSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
Subjt:  NGNTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS

Query:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATQNYEKAVQLNWN
        AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKAVQLNWN
Subjt:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATQNYEKAVQLNWN

Query:  SPQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSV
        SPQALNNWGLALQELSAIVPAREKQTIVKTAI+KFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+GNVKDVSPNELYSQSAIYIAAAHALKPNYSV
Subjt:  SPQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSV

Query:  YSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGNRAIKVEIPDIVSVSACADLTLPPG
        YSSALRLVRSMLPLPYLKVGYLTAPP+GRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNG+R IKVEIPDIVSVSACADLTLPPG
Subjt:  YSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGNRAIKVEIPDIVSVSACADLTLPPG

Query:  AGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG
        AGLCIDTIHGP+FLVADSWD LDGWLDAIRLVYTIYARGKNEVLAGII G
Subjt:  AGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG

A0A1S3BJC9 uncharacterized protein LOC103490705 isoform X17.0e-29394.55Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISPESDQTSEHRSE-PEPTADAIPSSELQQERESESVNNGVADSEPESRRKQLSESIHLQVVTDVTDPRFEEHKETSIPS
        MSPTPEEPNNLQNGIE QPHIS ESDQ SE RSE  EPTAD+IPSSELQQERESESV+NGVADSEPES RKQLSESIHL VVT VTDP  EEHKETS P 
Subjt:  MSPTPEEPNNLQNGIETQPHISPESDQTSEHRSE-PEPTADAIPSSELQQERESESVNNGVADSEPESRRKQLSESIHLQVVTDVTDPRFEEHKETSIPS

Query:  NGNTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
        NGNTEN QPALRKDEGSRTFTMRELLNGLKGEDG+D LNESEGERPEGNSG+SLNQDSPHQPYSEQSRAAMELINS+TGVDEEGRSRQRILTFAARRYAS
Subjt:  NGNTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS

Query:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATQNYEKAVQLNWN
        AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKAVQLNWN
Subjt:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATQNYEKAVQLNWN

Query:  SPQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSV
        SPQALNNWGLALQELSAIVPAREKQTIVKTAI+KFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN+KDVSPNELYSQSAIYIAAAHALKPNYSV
Subjt:  SPQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSV

Query:  YSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGNRAIKVEIPDIVSVSACADLTLPPG
        YSSALRLVRSMLPLPYLKVGYLTAPP+GRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS LGRSGSTLNG+R IKVEIPDIVSVSACADLTLPPG
Subjt:  YSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGNRAIKVEIPDIVSVSACADLTLPPG

Query:  AGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG
        AGLCIDTIHGP+FLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGII G
Subjt:  AGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG

A0A6J1EA05 protein HLB1-like9.8e-27987.41Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISPESDQTSEHRSEPEPTADAIPSSELQQERESESVN---------------------------NGVADSEPESR----R
        MSPTPEEPNNLQNGIE +PHIS ES+Q  E +SEPE TAD +P++ELQQERE ESVN                           NGVADSE +S     R
Subjt:  MSPTPEEPNNLQNGIETQPHISPESDQTSEHRSEPEPTADAIPSSELQQERESESVN---------------------------NGVADSEPESR----R

Query:  KQLSESIHLQVVTDVTDPRFEEHKETSIPSNGNTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAA
        KQLSESI LQV TDV DPRFEE K TSI SNG TENSQPALRKDEGSRTFTMRELLNGLK EDGNDSLNESEGE+PE NSGYSLNQDSPHQPYSEQSRAA
Subjt:  KQLSESIHLQVVTDVTDPRFEEHKETSIPSNGNTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAA

Query:  MELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDR
        MELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDR
Subjt:  MELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDR

Query:  AKMRGRTKEAEELWKQATQNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG
        AKMRGRTKEAEELWKQAT+NYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAI+KFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG
Subjt:  AKMRGRTKEAEELWKQATQNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG

Query:  NVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSG
         VKDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPP+GRP APH DWKRSQFFLNHDVLQKLNIGGEQ QTSP++LGRSG
Subjt:  NVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSG

Query:  STLNGNRAIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG
        STLNG+R +KVEIPDIVSVSACADLTLPPGAGLCIDTIHG +FLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG
Subjt:  STLNGNRAIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG

A0A6J1HJU5 protein HLB1-like isoform X28.5e-28392.04Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISPESDQTSEHRSEPEPTADAIPSSELQQERESESVNNGVADSEPESR----RKQLSESIHLQVVTDVTDPRFEEHKETS
        MSP PEEPNNLQNGIE +PHIS ES+Q  E +SEPE TAD IP++ELQQERESESV NGVADSEP+S     RKQLSESI LQVVTDVTDPRFEE K TS
Subjt:  MSPTPEEPNNLQNGIETQPHISPESDQTSEHRSEPEPTADAIPSSELQQERESESVNNGVADSEPESR----RKQLSESIHLQVVTDVTDPRFEEHKETS

Query:  IPSNGNTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARR
        I SNG  ENSQPALRKDEGSRTFTMRELLNGLK EDGNDSLNESEGE+PE NSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARR
Subjt:  IPSNGNTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARR

Query:  YASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATQNYEKAVQL
        YASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKAVQL
Subjt:  YASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATQNYEKAVQL

Query:  NWNSPQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPN
        NWNSPQALNNWGLALQELSAIVPAREK TIVKTAI+KFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG VKDVSPNELYSQSAIYIAAAHALKP+
Subjt:  NWNSPQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPN

Query:  YSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGNRAIKVEIPDIVSVSACADLTL
        YSVYSSALRLVRSMLPLPYLKVGYLTAPP+GRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP++LGRSGSTLNG+R +KVEIPDIVSVSACADLTL
Subjt:  YSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGNRAIKVEIPDIVSVSACADLTL

Query:  PPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG
        PPGAGLCIDTIHG +FLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG
Subjt:  PPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG

A0A6J1HL68 protein HLB1-like isoform X13.0e-28087.76Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISPESDQTSEHRSEPEPTADAIPSSELQQERESESVN---------------------------NGVADSEPESR----R
        MSP PEEPNNLQNGIE +PHIS ES+Q  E +SEPE TAD +P++ELQQERESESVN                           NGVADSEP+S     R
Subjt:  MSPTPEEPNNLQNGIETQPHISPESDQTSEHRSEPEPTADAIPSSELQQERESESVN---------------------------NGVADSEPESR----R

Query:  KQLSESIHLQVVTDVTDPRFEEHKETSIPSNGNTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAA
        KQLSESI LQVVTDVTDPRFEE K TSI SNG  ENSQPALRKDEGSRTFTMRELLNGLK EDGNDSLNESEGE+PE NSGYSLNQDSPHQPYSEQSRAA
Subjt:  KQLSESIHLQVVTDVTDPRFEEHKETSIPSNGNTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAA

Query:  MELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDR
        MELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDR
Subjt:  MELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDR

Query:  AKMRGRTKEAEELWKQATQNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG
        AKMRGRTKEAEELWKQAT+NYEKAVQLNWNSPQALNNWGLALQELSAIVPAREK TIVKTAI+KFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG
Subjt:  AKMRGRTKEAEELWKQATQNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG

Query:  NVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSG
         VKDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPP+GRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP++LGRSG
Subjt:  NVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSG

Query:  STLNGNRAIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG
        STLNG+R +KVEIPDIVSVSACADLTLPPGAGLCIDTIHG +FLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG
Subjt:  STLNGNRAIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG

SwissProt top hitse value%identityAlignment
Q9FHY8 Protein HLB11.3e-17962.05Show/hide
Query:  MSPTPEEPNNLQNGI-----ETQPHISPESDQTSEHR---SEPEPTADAIPS-----------SELQQERESESVNNGVADSEPESRRKQLSESIHLQVV
        M+ T EEP  LQNG      ET+ +  PE    +E +     PE  AD  P             E+Q E + E V   V D++PE  + ++       VV
Subjt:  MSPTPEEPNNLQNGI-----ETQPHISPESDQTSEHR---SEPEPTADAIPS-----------SELQQERESESVNNGVADSEPESRRKQLSESIHLQVV

Query:  T----DVTDPRFEEHKETSIP---SNGNTENSQPALRK-DEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELI
        T    D+TD          IP   +    E++   L+K D+G++TFTMRELL+ LK E         EG+    +S    +++S  QP   ++  AM+LI
Subjt:  T----DVTDPRFEEHKETSIP---SNGNTENSQPALRK-DEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELI

Query:  NSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMR
        N +   DEEGRSRQR+L FAAR+YASAIERN  D+DALYNWAL+LQESADNVSPDS SPSKD LLEEACKKYDEATRLCPTL+DA+YNWAIAISDRAK+R
Subjt:  NSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMR

Query:  GRTKEAEELWKQATQNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKD
        GRTKEAEELW+QA  NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +V+TAI+KFRAAI+LQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN KD
Subjt:  GRTKEAEELWKQATQNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKD

Query:  VSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHD-VLQKLNIGGEQIQTSPSILGRSGSTL
        + P ELYSQSAIYIAAAH+LKP+YSVYSSALRLVRSMLPLP+LKVGYLTAPP+G  LAPHSDWKR++F LNH+ +LQ L     ++  + S    + ST 
Subjt:  VSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHD-VLQKLNIGGEQIQTSPSILGRSGSTL

Query:  NGNRAIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG
           + +KV I +IVSV+ CADLTLPPGAGLCIDTIHGPVFLVADSW++LDGWLDAIRLVYTIYARGK++VLAGII G
Subjt:  NGNRAIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG

Arabidopsis top hitse value%identityAlignment
AT5G41950.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.3e-18162.05Show/hide
Query:  MSPTPEEPNNLQNGI-----ETQPHISPESDQTSEHR---SEPEPTADAIPS-----------SELQQERESESVNNGVADSEPESRRKQLSESIHLQVV
        M+ T EEP  LQNG      ET+ +  PE    +E +     PE  AD  P             E+Q E + E V   V D++PE  + ++       VV
Subjt:  MSPTPEEPNNLQNGI-----ETQPHISPESDQTSEHR---SEPEPTADAIPS-----------SELQQERESESVNNGVADSEPESRRKQLSESIHLQVV

Query:  T----DVTDPRFEEHKETSIP---SNGNTENSQPALRK-DEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELI
        T    D+TD          IP   +    E++   L+K D+G++TFTMRELL+ LK E         EG+    +S    +++S  QP   ++  AM+LI
Subjt:  T----DVTDPRFEEHKETSIP---SNGNTENSQPALRK-DEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELI

Query:  NSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMR
        N +   DEEGRSRQR+L FAAR+YASAIERN  D+DALYNWAL+LQESADNVSPDS SPSKD LLEEACKKYDEATRLCPTL+DA+YNWAIAISDRAK+R
Subjt:  NSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMR

Query:  GRTKEAEELWKQATQNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKD
        GRTKEAEELW+QA  NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +V+TAI+KFRAAI+LQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN KD
Subjt:  GRTKEAEELWKQATQNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAINKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKD

Query:  VSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHD-VLQKLNIGGEQIQTSPSILGRSGSTL
        + P ELYSQSAIYIAAAH+LKP+YSVYSSALRLVRSMLPLP+LKVGYLTAPP+G  LAPHSDWKR++F LNH+ +LQ L     ++  + S    + ST 
Subjt:  VSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQFFLNHD-VLQKLNIGGEQIQTSPSILGRSGSTL

Query:  NGNRAIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG
           + +KV I +IVSV+ CADLTLPPGAGLCIDTIHGPVFLVADSW++LDGWLDAIRLVYTIYARGK++VLAGII G
Subjt:  NGNRAIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCCTACTCCCGAGGAACCTAATAATTTGCAGAACGGAATCGAAACCCAACCACACATTTCGCCAGAATCAGATCAGACTAGTGAGCACAGATCAGAGCCAGAACC
CACAGCAGATGCAATTCCCAGTTCTGAATTACAACAAGAACGCGAATCGGAATCAGTTAATAATGGAGTAGCAGATTCGGAGCCGGAGTCTCGAAGGAAACAGTTATCGG
AGTCCATCCATTTACAGGTAGTGACGGATGTTACAGATCCGAGGTTTGAAGAGCACAAAGAAACCTCCATCCCATCCAACGGCAACACCGAGAACTCGCAACCTGCGTTG
CGTAAAGACGAAGGAAGCCGAACATTTACAATGAGAGAGTTGCTGAATGGATTGAAAGGTGAAGATGGTAACGACAGCCTTAACGAATCTGAAGGCGAGAGGCCCGAGGG
GAACTCCGGTTACAGTCTTAATCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGCAGAGCTGCCATGGAGTTAATCAACAGTGTTACAGGTGTCGATGAAGAGGGTC
GTTCTCGCCAAAGGATTCTCACATTTGCTGCTAGGAGGTATGCTAGTGCAATTGAGAGAAATGGTCAAGACTATGACGCTCTATACAATTGGGCTTTGGTCCTCCAGGAG
AGTGCAGATAATGTTAGTCCAGATTCCACTTCACCTTCTAAAGATGCATTGCTTGAGGAGGCTTGTAAAAAGTATGATGAAGCTACCCGTCTTTGCCCAACACTTCATGA
TGCTTTTTATAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATGCGTGGTCGTACAAAGGAGGCCGAAGAACTGTGGAAGCAGGCTACCCAAAATTATGAAAAAGCTG
TCCAACTCAACTGGAATAGTCCCCAGGCACTAAATAATTGGGGGCTTGCTCTCCAGGAACTCAGTGCAATTGTGCCAGCACGAGAAAAGCAGACAATTGTAAAAACAGCT
ATCAATAAGTTCCGTGCTGCTATACAGTTGCAATTTGATTTTCATCGAGCAATCTACAACCTTGGTACTGTTCTGTATGGATTAGCTGAGGACACATTAAGGACTGGTGG
AACGGGAAATGTTAAGGATGTTTCCCCTAATGAGTTATACAGCCAATCTGCAATTTATATTGCAGCTGCTCATGCTCTAAAACCAAATTACTCTGTTTACAGCAGTGCCT
TGCGATTGGTCCGTTCGATGCTGCCGTTACCCTATCTAAAAGTTGGATACTTGACTGCACCTCCTCTGGGGAGACCACTGGCTCCTCACAGTGATTGGAAACGTTCACAA
TTTTTTCTAAATCATGATGTATTGCAAAAGCTTAACATAGGAGGGGAACAAATACAAACATCCCCTAGTATTTTAGGAAGATCTGGAAGTACCTTGAATGGCAACAGGGC
AATCAAAGTAGAAATCCCTGATATTGTCTCTGTATCAGCATGTGCAGATTTAACTTTACCACCTGGTGCTGGACTCTGCATTGACACAATCCATGGACCAGTTTTCTTGG
TTGCTGACTCATGGGACGCACTCGATGGATGGCTCGATGCAATTAGATTAGTTTACACGATCTACGCTCGAGGCAAAAACGAGGTTTTGGCTGGCATCATAGCAGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGCCTACTCCCGAGGAACCTAATAATTTGCAGAACGGAATCGAAACCCAACCACACATTTCGCCAGAATCAGATCAGACTAGTGAGCACAGATCAGAGCCAGAACC
CACAGCAGATGCAATTCCCAGTTCTGAATTACAACAAGAACGCGAATCGGAATCAGTTAATAATGGAGTAGCAGATTCGGAGCCGGAGTCTCGAAGGAAACAGTTATCGG
AGTCCATCCATTTACAGGTAGTGACGGATGTTACAGATCCGAGGTTTGAAGAGCACAAAGAAACCTCCATCCCATCCAACGGCAACACCGAGAACTCGCAACCTGCGTTG
CGTAAAGACGAAGGAAGCCGAACATTTACAATGAGAGAGTTGCTGAATGGATTGAAAGGTGAAGATGGTAACGACAGCCTTAACGAATCTGAAGGCGAGAGGCCCGAGGG
GAACTCCGGTTACAGTCTTAATCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGCAGAGCTGCCATGGAGTTAATCAACAGTGTTACAGGTGTCGATGAAGAGGGTC
GTTCTCGCCAAAGGATTCTCACATTTGCTGCTAGGAGGTATGCTAGTGCAATTGAGAGAAATGGTCAAGACTATGACGCTCTATACAATTGGGCTTTGGTCCTCCAGGAG
AGTGCAGATAATGTTAGTCCAGATTCCACTTCACCTTCTAAAGATGCATTGCTTGAGGAGGCTTGTAAAAAGTATGATGAAGCTACCCGTCTTTGCCCAACACTTCATGA
TGCTTTTTATAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATGCGTGGTCGTACAAAGGAGGCCGAAGAACTGTGGAAGCAGGCTACCCAAAATTATGAAAAAGCTG
TCCAACTCAACTGGAATAGTCCCCAGGCACTAAATAATTGGGGGCTTGCTCTCCAGGAACTCAGTGCAATTGTGCCAGCACGAGAAAAGCAGACAATTGTAAAAACAGCT
ATCAATAAGTTCCGTGCTGCTATACAGTTGCAATTTGATTTTCATCGAGCAATCTACAACCTTGGTACTGTTCTGTATGGATTAGCTGAGGACACATTAAGGACTGGTGG
AACGGGAAATGTTAAGGATGTTTCCCCTAATGAGTTATACAGCCAATCTGCAATTTATATTGCAGCTGCTCATGCTCTAAAACCAAATTACTCTGTTTACAGCAGTGCCT
TGCGATTGGTCCGTTCGATGCTGCCGTTACCCTATCTAAAAGTTGGATACTTGACTGCACCTCCTCTGGGGAGACCACTGGCTCCTCACAGTGATTGGAAACGTTCACAA
TTTTTTCTAAATCATGATGTATTGCAAAAGCTTAACATAGGAGGGGAACAAATACAAACATCCCCTAGTATTTTAGGAAGATCTGGAAGTACCTTGAATGGCAACAGGGC
AATCAAAGTAGAAATCCCTGATATTGTCTCTGTATCAGCATGTGCAGATTTAACTTTACCACCTGGTGCTGGACTCTGCATTGACACAATCCATGGACCAGTTTTCTTGG
TTGCTGACTCATGGGACGCACTCGATGGATGGCTCGATGCAATTAGATTAGTTTACACGATCTACGCTCGAGGCAAAAACGAGGTTTTGGCTGGCATCATAGCAGGTTGA
Protein sequenceShow/hide protein sequence
MSPTPEEPNNLQNGIETQPHISPESDQTSEHRSEPEPTADAIPSSELQQERESESVNNGVADSEPESRRKQLSESIHLQVVTDVTDPRFEEHKETSIPSNGNTENSQPAL
RKDEGSRTFTMRELLNGLKGEDGNDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQE
SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATQNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTA
INKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPLGRPLAPHSDWKRSQ
FFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGNRAIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIIAG