; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy6G127460 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy6G127460
OrganismCucumis hystrix (Cucumber (hystrix) v1)
Descriptionprotein HLB1-like isoform X2
Genome locationchrH06:26347246..26354795
RNA-Seq ExpressionChy6G127460
SyntenyChy6G127460
Gene Ontology termsGO:0006887 - exocytosis (biological process)
GO:0048768 - root hair cell tip growth (biological process)
GO:0005769 - early endosome (cellular component)
GO:0005802 - trans-Golgi network (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146133.1 protein HLB1 isoform X1 [Cucumis sativus]0.099.09Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS
        MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQ+ERESESVSNGV DSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS

Query:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
        NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
Subjt:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS

Query:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
        AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Subjt:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN

Query:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV
        SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV
Subjt:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV

Query:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVQIPDIVSVSACADLTLPPG
        YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKV+IPDIVSVSACADLTLPPG
Subjt:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVQIPDIVSVSACADLTLPPG

Query:  AGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG
        AGLCIDTIHGPIFLVADSWD LDGWLDA+RLVYTIYARGKNEVLAGIITG
Subjt:  AGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG

XP_008448563.1 PREDICTED: uncharacterized protein LOC103490705 isoform X1 [Cucumis melo]0.097.64Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS
        MSPTPEEPNNLQNGIEIQPHISSESDQI+EPRS  EEPT DSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTP 
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS

Query:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
        NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSD LNESEGERPEGNSG+SLNQDSPHQPYSEQSRAAMELINS+TGVDEEGRSRQRILTFAARRYAS
Subjt:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS

Query:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
        AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Subjt:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN

Query:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV
        SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN+KDVSPNELYSQSAIYIAAAHALKPNYSV
Subjt:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV

Query:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVQIPDIVSVSACADLTLPPG
        YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS LGRSGSTLNGDRTIKV+IPDIVSVSACADLTLPPG
Subjt:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVQIPDIVSVSACADLTLPPG

Query:  AGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG
        AGLCIDTIHGPIFLVADSWDALDGWLDA+RLVYTIYARGKNEVLAGIITG
Subjt:  AGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG

XP_022965252.1 protein HLB1-like isoform X2 [Cucurbita maxima]0.090.79Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQQERESESVSNGVADSEPES----PRKQLSESIHLHVVTGVTDPSVEEHKET
        MSP PEEPNNLQNGIEI+PHIS ES+QI E +S PE  T D IP++ELQQERESESV NGVADSEP+S    PRKQLSESI L VVT VTDP  EE K T
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQQERESESVSNGVADSEPES----PRKQLSESIHLHVVTGVTDPSVEEHKET

Query:  STPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAR
        S  SNG  EN QPALRKDEGSRTFTMRELLNGLK EDG+DSLNESEGE+PE NSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAR
Subjt:  STPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAR

Query:  RYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQ
        RYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKAVQ
Subjt:  RYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQ

Query:  LNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKP
        LNWNSPQALNNWGLALQELSAIVPAREK TIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+G VKDVSPNELYSQSAIYIAAAHALKP
Subjt:  LNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKP

Query:  NYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVQIPDIVSVSACADLT
        +YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP++LGRSGSTLNGDRT+KV+IPDIVSVSACADLT
Subjt:  NYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVQIPDIVSVSACADLT

Query:  LPPGAGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG
        LPPGAGLCIDTIHG IFLVADSWDALDGWLDA+RLVYTIYARGKNEVLAGII G
Subjt:  LPPGAGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG

XP_023552571.1 protein HLB1-like [Cucurbita pepo subsp. pepo]0.086.4Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQQERESESVS---------------------------NGVADSEPES----P
        MSPTPEEPNNLQNGIEI+ HIS ES+QI E +S PE  T D +P++ELQQER+SESV+                           NG ADSEP+S    P
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQQERESESVS---------------------------NGVADSEPES----P

Query:  RKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRA
        RKQLSESI L VVT VTDP  EE K TS  SNG TEN QPALRKDEGSRTFTMRELLNGLK EDG+DSLNESEGE+PE NSGYSLNQDSPHQPYSEQSRA
Subjt:  RKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRA

Query:  AMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISD
        AMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISD
Subjt:  AMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISD

Query:  RAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGS
        RAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+
Subjt:  RAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGS

Query:  GNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRS
        G VKDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQ QTSP++LGRS
Subjt:  GNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRS

Query:  GSTLNGDRTIKVQIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG
        GSTLNGDRT+KV+IPDIVSVSACADLTLPPGAGLCIDTIHG IFLVADSWDALDGWLDA+RLVYTIYARGKNEVLAGII G
Subjt:  GSTLNGDRTIKVQIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG

XP_038876586.1 protein HLB1 [Benincasa hispida]0.094.18Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS
        MSPTPEEPNNLQNGIEIQPHIS ESDQ +EPRS PE PT D+I SSEL QERESESV+NGVADSEP S RKQL ESIHL V T V DP  EEHKETS PS
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS

Query:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
        NGNTEN +PALRKDEGSRTFTMRELLNGLKGEDG+DSLNESEGERPEGN GYSLNQDSPHQPYSEQSRAAMELI+SVTGVDEEGRSRQRILTFAARRYAS
Subjt:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS

Query:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
        AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Subjt:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN

Query:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV
        SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+GNVKDVSPNELYSQSAIYIAAAHALKPNYSV
Subjt:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV

Query:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVQIPDIVSVSACADLTLPPG
        YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPH DWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGD TIKV+IPDIVSVSACADLTLPPG
Subjt:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVQIPDIVSVSACADLTLPPG

Query:  AGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG
        AGLCIDTIHGP+FLVADSWDALDGWLDA+RLVYTIYARGKNEVLAGIITG
Subjt:  AGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG

TrEMBL top hitse value%identityAlignment
A0A0A0L688 Uncharacterized protein0.0e+0099.09Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS
        MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQ+ERESESVSNGV DSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS

Query:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
        NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
Subjt:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS

Query:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
        AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Subjt:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN

Query:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV
        SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV
Subjt:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV

Query:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVQIPDIVSVSACADLTLPPG
        YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKV+IPDIVSVSACADLTLPPG
Subjt:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVQIPDIVSVSACADLTLPPG

Query:  AGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG
        AGLCIDTIHGPIFLVADSWD LDGWLDA+RLVYTIYARGKNEVLAGIITG
Subjt:  AGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG

A0A1S3BJC9 uncharacterized protein LOC103490705 isoform X12.5e-30697.64Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS
        MSPTPEEPNNLQNGIEIQPHISSESDQI+EPRS  EEPT DSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTP 
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS

Query:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
        NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSD LNESEGERPEGNSG+SLNQDSPHQPYSEQSRAAMELINS+TGVDEEGRSRQRILTFAARRYAS
Subjt:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS

Query:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
        AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Subjt:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN

Query:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV
        SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN+KDVSPNELYSQSAIYIAAAHALKPNYSV
Subjt:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV

Query:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVQIPDIVSVSACADLTLPPG
        YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS LGRSGSTLNGDRTIKV+IPDIVSVSACADLTLPPG
Subjt:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVQIPDIVSVSACADLTLPPG

Query:  AGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG
        AGLCIDTIHGPIFLVADSWDALDGWLDA+RLVYTIYARGKNEVLAGIITG
Subjt:  AGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG

A0A6J1EA05 protein HLB1-like3.3e-27486.23Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQQERESESVS---------------------------NGVAD----SEPESP
        MSPTPEEPNNLQNGIEI+PHIS ES+QI E +S PE  T D +P++ELQQERE ESV+                           NGVAD    SE +SP
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQQERESESVS---------------------------NGVAD----SEPESP

Query:  RKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRA
        RKQLSESI L V T V DP  EE K TS  SNG TEN QPALRKDEGSRTFTMRELLNGLK EDG+DSLNESEGE+PE NSGYSLNQDSPHQPYSEQSRA
Subjt:  RKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRA

Query:  AMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISD
        AMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISD
Subjt:  AMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISD

Query:  RAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGS
        RAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+
Subjt:  RAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGS

Query:  GNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRS
        G VKDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQ QTSP++LGRS
Subjt:  GNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRS

Query:  GSTLNGDRTIKVQIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG
        GSTLNGDRT+KV+IPDIVSVSACADLTLPPGAGLCIDTIHG IFLVADSWDALDGWLDA+RLVYTIYARGKNEVLAGII G
Subjt:  GSTLNGDRTIKVQIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG

A0A6J1HJU5 protein HLB1-like isoform X22.2e-27890.79Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQQERESESVSNGVADSEP----ESPRKQLSESIHLHVVTGVTDPSVEEHKET
        MSP PEEPNNLQNGIEI+PHIS ES+QI E +S PE  T D IP++ELQQERESESV NGVADSEP    +SPRKQLSESI L VVT VTDP  EE K T
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQQERESESVSNGVADSEP----ESPRKQLSESIHLHVVTGVTDPSVEEHKET

Query:  STPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAR
        S  SNG  EN QPALRKDEGSRTFTMRELLNGLK EDG+DSLNESEGE+PE NSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAR
Subjt:  STPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAR

Query:  RYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQ
        RYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKAVQ
Subjt:  RYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQ

Query:  LNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKP
        LNWNSPQALNNWGLALQELSAIVPAREK TIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+G VKDVSPNELYSQSAIYIAAAHALKP
Subjt:  LNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKP

Query:  NYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVQIPDIVSVSACADLT
        +YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP++LGRSGSTLNGDRT+KV+IPDIVSVSACADLT
Subjt:  NYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVQIPDIVSVSACADLT

Query:  LPPGAGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG
        LPPGAGLCIDTIHG IFLVADSWDALDGWLDA+RLVYTIYARGKNEVLAGII G
Subjt:  LPPGAGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG

A0A6J1HL68 protein HLB1-like isoform X12.3e-27586.4Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQQERESESVS---------------------------NGVADSEP----ESP
        MSP PEEPNNLQNGIEI+PHIS ES+QI E +S PE  T D +P++ELQQERESESV+                           NGVADSEP    +SP
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQQERESESVS---------------------------NGVADSEP----ESP

Query:  RKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRA
        RKQLSESI L VVT VTDP  EE K TS  SNG  EN QPALRKDEGSRTFTMRELLNGLK EDG+DSLNESEGE+PE NSGYSLNQDSPHQPYSEQSRA
Subjt:  RKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRA

Query:  AMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISD
        AMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISD
Subjt:  AMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISD

Query:  RAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGS
        RAKMRGRTKEAEELWKQAT+NYEKAVQLNWNSPQALNNWGLALQELSAIVPAREK TIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+
Subjt:  RAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGS

Query:  GNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRS
        G VKDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP++LGRS
Subjt:  GNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRS

Query:  GSTLNGDRTIKVQIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG
        GSTLNGDRT+KV+IPDIVSVSACADLTLPPGAGLCIDTIHG IFLVADSWDALDGWLDA+RLVYTIYARGKNEVLAGII G
Subjt:  GSTLNGDRTIKVQIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG

SwissProt top hitse value%identityAlignment
Q9FHY8 Protein HLB11.0e-17961.59Show/hide
Query:  MSPTPEEPNNLQNG-----------------IEIQPHISSESDQITEPRSGPEE--PTVDSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHV
        M+ T EEP  LQNG                 ++ +P ++ E  +I E    PEE    V      E+Q E + E V   V D++PE  + ++       V
Subjt:  MSPTPEEPNNLQNG-----------------IEIQPHISSESDQITEPRSGPEE--PTVDSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHV

Query:  VT----GVTDPSVEEHKETSTP---SNGNTENLQPALRK-DEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMEL
        VT     +TD  +        P   +    E+    L+K D+G++TFTMRELL+ LK E         EG+    +S    +++S  QP   ++  AM+L
Subjt:  VT----GVTDPSVEEHKETSTP---SNGNTENLQPALRK-DEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMEL

Query:  INSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKM
        IN +   DEEGRSRQR+L FAAR+YASAIERN  D+DALYNWAL+LQESADNVSPDS SPSKD LLEEACKKYDEAT LCPTL+DA+YNWAIAISDRAK+
Subjt:  INSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKM

Query:  RGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVK
        RGRTKEAEELW+QA  NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +V+TAISKFRAAI+LQFDFHRAIYNLGTVLYGLAEDTLRTGGSGN K
Subjt:  RGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVK

Query:  DVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD-VLQKLNIGGEQIQTSPSILGRSGST
        D+ P ELYSQSAIYIAAAH+LKP+YSVYSSALRLVRSMLPLP+LKVGYLTAPPVG  LAPHSDWKR++F LNH+ +LQ L     ++  + S    + ST
Subjt:  DVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD-VLQKLNIGGEQIQTSPSILGRSGST

Query:  LNGDRTIKVQIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG
            +T+KV I +IVSV+ CADLTLPPGAGLCIDTIHGP+FLVADSW++LDGWLDA+RLVYTIYARGK++VLAGIITG
Subjt:  LNGDRTIKVQIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG

Arabidopsis top hitse value%identityAlignment
AT5G41950.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.1e-18161.59Show/hide
Query:  MSPTPEEPNNLQNG-----------------IEIQPHISSESDQITEPRSGPEE--PTVDSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHV
        M+ T EEP  LQNG                 ++ +P ++ E  +I E    PEE    V      E+Q E + E V   V D++PE  + ++       V
Subjt:  MSPTPEEPNNLQNG-----------------IEIQPHISSESDQITEPRSGPEE--PTVDSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHV

Query:  VT----GVTDPSVEEHKETSTP---SNGNTENLQPALRK-DEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMEL
        VT     +TD  +        P   +    E+    L+K D+G++TFTMRELL+ LK E         EG+    +S    +++S  QP   ++  AM+L
Subjt:  VT----GVTDPSVEEHKETSTP---SNGNTENLQPALRK-DEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMEL

Query:  INSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKM
        IN +   DEEGRSRQR+L FAAR+YASAIERN  D+DALYNWAL+LQESADNVSPDS SPSKD LLEEACKKYDEAT LCPTL+DA+YNWAIAISDRAK+
Subjt:  INSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKM

Query:  RGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVK
        RGRTKEAEELW+QA  NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +V+TAISKFRAAI+LQFDFHRAIYNLGTVLYGLAEDTLRTGGSGN K
Subjt:  RGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVK

Query:  DVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD-VLQKLNIGGEQIQTSPSILGRSGST
        D+ P ELYSQSAIYIAAAH+LKP+YSVYSSALRLVRSMLPLP+LKVGYLTAPPVG  LAPHSDWKR++F LNH+ +LQ L     ++  + S    + ST
Subjt:  DVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD-VLQKLNIGGEQIQTSPSILGRSGST

Query:  LNGDRTIKVQIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG
            +T+KV I +IVSV+ CADLTLPPGAGLCIDTIHGP+FLVADSW++LDGWLDA+RLVYTIYARGK++VLAGIITG
Subjt:  LNGDRTIKVQIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCCTACTCCCGAGGAACCTAATAATTTGCAGAACGGAATCGAAATCCAACCACACATTTCATCAGAATCAGATCAAATTACTGAACCCAGATCAGGGCCAGAAGA
ACCTACAGTAGATTCAATTCCCAGTTCTGAATTACAACAAGAACGTGAATCGGAATCAGTTAGTAATGGAGTAGCAGATTCGGAGCCGGAGTCTCCAAGGAAACAGTTAT
CGGAGTCAATTCATTTACATGTAGTGACGGGTGTTACAGATCCGAGTGTTGAAGAGCATAAAGAAACTTCCACCCCATCCAACGGCAACACGGAGAACTTGCAACCTGCG
TTGCGTAAAGACGAAGGAAGCCGAACGTTTACAATGAGAGAGTTGTTGAATGGATTGAAAGGTGAAGATGGTAGCGACAGCCTTAATGAATCTGAAGGCGAGAGGCCCGA
GGGGAACTCCGGTTACAGCCTTAATCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGTAGAGCTGCCATGGAGTTGATCAACAGCGTTACAGGTGTCGATGAAGAGG
GTCGTTCTCGCCAAAGGATTCTCACATTTGCTGCTAGGAGGTATGCTAGTGCAATTGAGAGAAATGGTCAAGACTATGATGCTTTGTACAATTGGGCTTTGGTCCTCCAG
GAGAGTGCAGACAATGTTAGTCCAGATTCCACCTCACCTTCTAAAGATGCGTTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCCACCCATCTGTGCCCAACACTTCA
TGATGCTTTTTATAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATGCGTGGTCGTACGAAGGAGGCCGAAGAACTGTGGAAGCAGGCTACTAAAAATTATGAAAAAG
CTGTCCAACTCAACTGGAATAGTCCCCAGGCGCTAAATAATTGGGGGCTTGCCCTACAGGAACTCAGTGCGATTGTGCCGGCACGAGAAAAGCAGACAATTGTGAAAACA
GCTATCAGTAAGTTCCGTGCTGCTATACAGTTGCAATTTGATTTTCATCGAGCAATCTACAACCTTGGCACTGTTCTGTATGGATTAGCTGAGGACACATTAAGAACTGG
TGGATCAGGAAATGTTAAGGATGTTTCCCCCAATGAGTTATACAGCCAATCTGCTATTTATATTGCAGCTGCTCATGCTCTAAAACCAAACTACTCTGTGTATAGCAGCG
CCTTACGATTGGTCCGCTCCATGCTGCCATTACCCTATCTAAAAGTTGGATACCTGACTGCACCTCCTGTGGGGAGACCTCTGGCTCCTCACAGTGATTGGAAACGTTCA
CAATTTTTTCTAAATCATGATGTATTGCAAAAGCTTAACATAGGAGGGGAACAAATACAAACATCCCCTAGTATTTTAGGAAGATCTGGAAGTACCTTGAATGGCGACAG
GACAATCAAAGTACAAATTCCTGATATCGTCTCTGTATCCGCATGTGCCGATCTTACTTTACCACCTGGTGCTGGACTCTGCATTGACACAATCCATGGACCCATTTTCT
TGGTTGCCGACTCATGGGACGCACTCGATGGATGGCTTGATGCAGTTAGATTAGTTTACACGATCTACGCTCGAGGCAAGAATGAGGTTTTGGCTGGCATCATAACAGGT
TGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGCCTACTCCCGAGGAACCTAATAATTTGCAGAACGGAATCGAAATCCAACCACACATTTCATCAGAATCAGATCAAATTACTGAACCCAGATCAGGGCCAGAAGA
ACCTACAGTAGATTCAATTCCCAGTTCTGAATTACAACAAGAACGTGAATCGGAATCAGTTAGTAATGGAGTAGCAGATTCGGAGCCGGAGTCTCCAAGGAAACAGTTAT
CGGAGTCAATTCATTTACATGTAGTGACGGGTGTTACAGATCCGAGTGTTGAAGAGCATAAAGAAACTTCCACCCCATCCAACGGCAACACGGAGAACTTGCAACCTGCG
TTGCGTAAAGACGAAGGAAGCCGAACGTTTACAATGAGAGAGTTGTTGAATGGATTGAAAGGTGAAGATGGTAGCGACAGCCTTAATGAATCTGAAGGCGAGAGGCCCGA
GGGGAACTCCGGTTACAGCCTTAATCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGTAGAGCTGCCATGGAGTTGATCAACAGCGTTACAGGTGTCGATGAAGAGG
GTCGTTCTCGCCAAAGGATTCTCACATTTGCTGCTAGGAGGTATGCTAGTGCAATTGAGAGAAATGGTCAAGACTATGATGCTTTGTACAATTGGGCTTTGGTCCTCCAG
GAGAGTGCAGACAATGTTAGTCCAGATTCCACCTCACCTTCTAAAGATGCGTTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCCACCCATCTGTGCCCAACACTTCA
TGATGCTTTTTATAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATGCGTGGTCGTACGAAGGAGGCCGAAGAACTGTGGAAGCAGGCTACTAAAAATTATGAAAAAG
CTGTCCAACTCAACTGGAATAGTCCCCAGGCGCTAAATAATTGGGGGCTTGCCCTACAGGAACTCAGTGCGATTGTGCCGGCACGAGAAAAGCAGACAATTGTGAAAACA
GCTATCAGTAAGTTCCGTGCTGCTATACAGTTGCAATTTGATTTTCATCGAGCAATCTACAACCTTGGCACTGTTCTGTATGGATTAGCTGAGGACACATTAAGAACTGG
TGGATCAGGAAATGTTAAGGATGTTTCCCCCAATGAGTTATACAGCCAATCTGCTATTTATATTGCAGCTGCTCATGCTCTAAAACCAAACTACTCTGTGTATAGCAGCG
CCTTACGATTGGTCCGCTCCATGCTGCCATTACCCTATCTAAAAGTTGGATACCTGACTGCACCTCCTGTGGGGAGACCTCTGGCTCCTCACAGTGATTGGAAACGTTCA
CAATTTTTTCTAAATCATGATGTATTGCAAAAGCTTAACATAGGAGGGGAACAAATACAAACATCCCCTAGTATTTTAGGAAGATCTGGAAGTACCTTGAATGGCGACAG
GACAATCAAAGTACAAATTCCTGATATCGTCTCTGTATCCGCATGTGCCGATCTTACTTTACCACCTGGTGCTGGACTCTGCATTGACACAATCCATGGACCCATTTTCT
TGGTTGCCGACTCATGGGACGCACTCGATGGATGGCTTGATGCAGTTAGATTAGTTTACACGATCTACGCTCGAGGCAAGAATGAGGTTTTGGCTGGCATCATAACAGGT
TGA
Protein sequenceShow/hide protein sequence
MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPA
LRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQ
ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKT
AISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRS
QFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVQIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAVRLVYTIYARGKNEVLAGIITG