; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G749 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G749
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
Descriptionprotein HLB1-like isoform X2
Genome locationctg1:582575..590553
RNA-Seq ExpressionCucsat.G749
SyntenyCucsat.G749
Gene Ontology termsGO:0006887 - exocytosis (biological process)
GO:0048768 - root hair cell tip growth (biological process)
GO:0005769 - early endosome (cellular component)
GO:0005802 - trans-Golgi network (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146133.1 protein HLB1 isoform X1 [Cucumis sativus]0.0100Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS
        MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS

Query:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
        NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
Subjt:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS

Query:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
        AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Subjt:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN

Query:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV
        SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV
Subjt:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV

Query:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPG
        YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPG
Subjt:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPG

Query:  AGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG
        AGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG
Subjt:  AGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG

XP_008448563.1 PREDICTED: uncharacterized protein LOC103490705 isoform X1 [Cucumis melo]0.097.45Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS
        MSPTPEEPNNLQNGIEIQPHISSESDQI+EPRS  EEPT DSIPSSELQ+ERESESVSNGV DSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTP 
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS

Query:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
        NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSD LNESEGERPEGNSG+SLNQDSPHQPYSEQSRAAMELINS+TGVDEEGRSRQRILTFAARRYAS
Subjt:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS

Query:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
        AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Subjt:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN

Query:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV
        SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN+KDVSPNELYSQSAIYIAAAHALKPNYSV
Subjt:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV

Query:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPG
        YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS LGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPG
Subjt:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPG

Query:  AGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG
        AGLCIDTIHGPIFLVADSWD LDGWLDAIRLVYTIYARGKNEVLAGIITG
Subjt:  AGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG

XP_022965252.1 protein HLB1-like isoform X2 [Cucurbita maxima]0.090.61Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPES----PRKQLSESIHLHVVTGVTDPSVEEHKET
        MSP PEEPNNLQNGIEI+PHIS ES+QI E +S PE  T D IP++ELQ+ERESESV NGV DSEP+S    PRKQLSESI L VVT VTDP  EE K T
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPES----PRKQLSESIHLHVVTGVTDPSVEEHKET

Query:  STPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAR
        S  SNG  EN QPALRKDEGSRTFTMRELLNGLK EDG+DSLNESEGE+PE NSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAR
Subjt:  STPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAR

Query:  RYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQ
        RYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKAVQ
Subjt:  RYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQ

Query:  LNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKP
        LNWNSPQALNNWGLALQELSAIVPAREK TIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+G VKDVSPNELYSQSAIYIAAAHALKP
Subjt:  LNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKP

Query:  NYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLT
        +YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP++LGRSGSTLNGDRT+KVEIPDIVSVSACADLT
Subjt:  NYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLT

Query:  LPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG
        LPPGAGLCIDTIHG IFLVADSWD LDGWLDAIRLVYTIYARGKNEVLAGII G
Subjt:  LPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG

XP_023552571.1 protein HLB1-like [Cucurbita pepo subsp. pepo]0.086.23Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVS---------------------------NGVPDSEPES----P
        MSPTPEEPNNLQNGIEI+ HIS ES+QI E +S PE  T D +P++ELQ+ER+SESV+                           NG  DSEP+S    P
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVS---------------------------NGVPDSEPES----P

Query:  RKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRA
        RKQLSESI L VVT VTDP  EE K TS  SNG TEN QPALRKDEGSRTFTMRELLNGLK EDG+DSLNESEGE+PE NSGYSLNQDSPHQPYSEQSRA
Subjt:  RKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRA

Query:  AMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISD
        AMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISD
Subjt:  AMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISD

Query:  RAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGS
        RAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+
Subjt:  RAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGS

Query:  GNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRS
        G VKDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQ QTSP++LGRS
Subjt:  GNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRS

Query:  GSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG
        GSTLNGDRT+KVEIPDIVSVSACADLTLPPGAGLCIDTIHG IFLVADSWD LDGWLDAIRLVYTIYARGKNEVLAGII G
Subjt:  GSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG

XP_038876586.1 protein HLB1 [Benincasa hispida]0.094Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS
        MSPTPEEPNNLQNGIEIQPHIS ESDQ +EPRS PE PT D+I SSEL +ERESESV+NGV DSEP S RKQL ESIHL V T V DP  EEHKETS PS
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS

Query:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
        NGNTEN +PALRKDEGSRTFTMRELLNGLKGEDG+DSLNESEGERPEGN GYSLNQDSPHQPYSEQSRAAMELI+SVTGVDEEGRSRQRILTFAARRYAS
Subjt:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS

Query:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
        AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Subjt:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN

Query:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV
        SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+GNVKDVSPNELYSQSAIYIAAAHALKPNYSV
Subjt:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV

Query:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPG
        YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPH DWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGD TIKVEIPDIVSVSACADLTLPPG
Subjt:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPG

Query:  AGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG
        AGLCIDTIHGP+FLVADSWD LDGWLDAIRLVYTIYARGKNEVLAGIITG
Subjt:  AGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG

TrEMBL top hitse value%identityAlignment
A0A0A0L688 Uncharacterized protein0.0100Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS
        MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS

Query:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
        NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
Subjt:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS

Query:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
        AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Subjt:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN

Query:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV
        SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV
Subjt:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV

Query:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPG
        YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPG
Subjt:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPG

Query:  AGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG
        AGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG
Subjt:  AGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG

A0A1S3BJC9 uncharacterized protein LOC103490705 isoform X10.097.45Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS
        MSPTPEEPNNLQNGIEIQPHISSESDQI+EPRS  EEPT DSIPSSELQ+ERESESVSNGV DSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTP 
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPS

Query:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS
        NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSD LNESEGERPEGNSG+SLNQDSPHQPYSEQSRAAMELINS+TGVDEEGRSRQRILTFAARRYAS
Subjt:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYAS

Query:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
        AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN
Subjt:  AIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWN

Query:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV
        SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN+KDVSPNELYSQSAIYIAAAHALKPNYSV
Subjt:  SPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSV

Query:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPG
        YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS LGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPG
Subjt:  YSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPG

Query:  AGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG
        AGLCIDTIHGPIFLVADSWD LDGWLDAIRLVYTIYARGKNEVLAGIITG
Subjt:  AGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG

A0A6J1EA05 protein HLB1-like0.085.91Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPES--------------------------------
        MSPTPEEPNNLQNGIEI+PHIS ES+QI E +S PE  T D +P++ELQ+ERE ESV NGV D EP+S                                
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPES--------------------------------

Query:  PRKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSR
        PRKQLSESI L V T V DP  EE K TS  SNG TEN QPALRKDEGSRTFTMRELLNGLK EDG+DSLNESEGE+PE NSGYSLNQDSPHQPYSEQSR
Subjt:  PRKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSR

Query:  AAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAIS
        AAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAIS
Subjt:  AAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAIS

Query:  DRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG
        DRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG
Subjt:  DRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG

Query:  SGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGR
        +G VKDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQ QTSP++LGR
Subjt:  SGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGR

Query:  SGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG
        SGSTLNGDRT+KVEIPDIVSVSACADLTLPPGAGLCIDTIHG IFLVADSWD LDGWLDAIRLVYTIYARGKNEVLAGII G
Subjt:  SGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG

A0A6J1HJU5 protein HLB1-like isoform X20.090.61Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPES----PRKQLSESIHLHVVTGVTDPSVEEHKET
        MSP PEEPNNLQNGIEI+PHIS ES+QI E +S PE  T D IP++ELQ+ERESESV NGV DSEP+S    PRKQLSESI L VVT VTDP  EE K T
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPES----PRKQLSESIHLHVVTGVTDPSVEEHKET

Query:  STPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAR
        S  SNG  EN QPALRKDEGSRTFTMRELLNGLK EDG+DSLNESEGE+PE NSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAR
Subjt:  STPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAAR

Query:  RYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQ
        RYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKAVQ
Subjt:  RYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQ

Query:  LNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKP
        LNWNSPQALNNWGLALQELSAIVPAREK TIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+G VKDVSPNELYSQSAIYIAAAHALKP
Subjt:  LNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKP

Query:  NYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLT
        +YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP++LGRSGSTLNGDRT+KVEIPDIVSVSACADLT
Subjt:  NYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLT

Query:  LPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG
        LPPGAGLCIDTIHG IFLVADSWD LDGWLDAIRLVYTIYARGKNEVLAGII G
Subjt:  LPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG

A0A6J1HL68 protein HLB1-like isoform X10.086.23Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVS---------------------------NGVPDSEPES----P
        MSP PEEPNNLQNGIEI+PHIS ES+QI E +S PE  T D +P++ELQ+ERESESV+                           NGV DSEP+S    P
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVS---------------------------NGVPDSEPES----P

Query:  RKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRA
        RKQLSESI L VVT VTDP  EE K TS  SNG  EN QPALRKDEGSRTFTMRELLNGLK EDG+DSLNESEGE+PE NSGYSLNQDSPHQPYSEQSRA
Subjt:  RKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRA

Query:  AMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISD
        AMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISD
Subjt:  AMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISD

Query:  RAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGS
        RAKMRGRTKEAEELWKQAT+NYEKAVQLNWNSPQALNNWGLALQELSAIVPAREK TIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+
Subjt:  RAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGS

Query:  GNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRS
        G VKDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP++LGRS
Subjt:  GNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSILGRS

Query:  GSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG
        GSTLNGDRT+KVEIPDIVSVSACADLTLPPGAGLCIDTIHG IFLVADSWD LDGWLDAIRLVYTIYARGKNEVLAGII G
Subjt:  GSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG

SwissProt top hitse value%identityAlignment
Q9FHY8 Protein HLB11.3e-17961.41Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTP-
        M+ T EEP  LQNG        +E + I EP+   E      IP  E++ +   E V + V D++PE  + ++       V T VTD   EE +    P 
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTP-

Query:  ------------------SNGNTENL------------QPALRK-DEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSR
                          S G +E +               L+K D+G++TFTMRELL+ LK E         EG+    +S    +++S  QP   ++ 
Subjt:  ------------------SNGNTENL------------QPALRK-DEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSR

Query:  AAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAIS
         AM+LIN +   DEEGRSRQR+L FAAR+YASAIERN  D+DALYNWAL+LQESADNVSPDS SPSKD LLEEACKKYDEAT LCPTL+DA+YNWAIAIS
Subjt:  AAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAIS

Query:  DRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG
        DRAK+RGRTKEAEELW+QA  NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +V+TAISKFRAAI+LQFDFHRAIYNLGTVLYGLAEDTLRTGG
Subjt:  DRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG

Query:  SGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD-VLQKLNIGGEQIQTSPSILG
        SGN KD+ P ELYSQSAIYIAAAH+LKP+YSVYSSALRLVRSMLPLP+LKVGYLTAPPVG  LAPHSDWKR++F LNH+ +LQ L     ++  + S   
Subjt:  SGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD-VLQKLNIGGEQIQTSPSILG

Query:  RSGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG
         + ST    +T+KV I +IVSV+ CADLTLPPGAGLCIDTIHGP+FLVADSW++LDGWLDAIRLVYTIYARGK++VLAGIITG
Subjt:  RSGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG

Arabidopsis top hitse value%identityAlignment
AT5G41950.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.3e-18161.41Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTP-
        M+ T EEP  LQNG        +E + I EP+   E      IP  E++ +   E V + V D++PE  + ++       V T VTD   EE +    P 
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTP-

Query:  ------------------SNGNTENL------------QPALRK-DEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSR
                          S G +E +               L+K D+G++TFTMRELL+ LK E         EG+    +S    +++S  QP   ++ 
Subjt:  ------------------SNGNTENL------------QPALRK-DEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSR

Query:  AAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAIS
         AM+LIN +   DEEGRSRQR+L FAAR+YASAIERN  D+DALYNWAL+LQESADNVSPDS SPSKD LLEEACKKYDEAT LCPTL+DA+YNWAIAIS
Subjt:  AAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAIS

Query:  DRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG
        DRAK+RGRTKEAEELW+QA  NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +V+TAISKFRAAI+LQFDFHRAIYNLGTVLYGLAEDTLRTGG
Subjt:  DRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG

Query:  SGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD-VLQKLNIGGEQIQTSPSILG
        SGN KD+ P ELYSQSAIYIAAAH+LKP+YSVYSSALRLVRSMLPLP+LKVGYLTAPPVG  LAPHSDWKR++F LNH+ +LQ L     ++  + S   
Subjt:  SGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD-VLQKLNIGGEQIQTSPSILG

Query:  RSGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG
         + ST    +T+KV I +IVSV+ CADLTLPPGAGLCIDTIHGP+FLVADSW++LDGWLDAIRLVYTIYARGK++VLAGIITG
Subjt:  RSGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCCTACTCCCGAGGAACCTAATAATTTGCAGAACGGAATCGAAATCCAACCACACATTTCATCAGAATCAGATCAAATTACTGAACCCAGATCAGGGCCAGAAGA
ACCTACAGTAGATTCAATTCCCAGTTCTGAATTACAACGAGAACGTGAATCGGAATCAGTTAGTAATGGAGTACCAGATTCGGAGCCGGAGTCTCCAAGGAAACAGTTAT
CGGAGTCAATTCATTTACATGTAGTGACGGGTGTTACAGATCCGAGTGTTGAAGAGCATAAAGAAACTTCCACCCCATCCAACGGCAACACGGAGAACTTGCAACCTGCG
TTGCGTAAAGACGAAGGAAGCCGAACGTTTACAATGAGAGAGTTGTTGAATGGATTGAAAGGTGAAGATGGTAGCGACAGCCTTAATGAATCTGAAGGCGAGAGGCCCGA
GGGGAACTCCGGTTACAGCCTTAATCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGTAGAGCTGCCATGGAGTTGATCAACAGTGTTACAGGTGTCGATGAAGAGG
GTCGTTCTCGCCAAAGGATTCTCACATTTGCTGCTAGGAGGTATGCTAGTGCAATTGAGAGAAATGGTCAAGACTATGATGCTTTGTACAATTGGGCTTTGGTCCTCCAG
GAGAGTGCAGACAATGTTAGTCCAGATTCCACCTCACCTTCTAAAGATGCGTTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCCACCCATCTGTGCCCAACACTTCA
TGATGCTTTTTATAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATGCGTGGTCGTACGAAGGAGGCCGAAGAACTGTGGAAGCAGGCTACCAAAAATTATGAAAAAG
CTGTCCAACTCAACTGGAATAGTCCCCAGGCGTTAAATAATTGGGGGCTTGCCCTACAGGAACTCAGTGCGATTGTGCCGGCACGAGAAAAGCAGACAATTGTAAAAACA
GCTATCAGTAAGTTCCGTGCTGCTATACAGTTGCAATTTGATTTTCATCGAGCAATCTACAACCTTGGCACTGTTCTGTATGGATTAGCTGAGGACACATTAAGAACTGG
TGGATCAGGAAATGTTAAGGATGTTTCCCCCAATGAGTTATACAGCCAATCTGCTATTTATATTGCAGCTGCTCATGCTCTAAAACCAAACTACTCTGTTTATAGCAGCG
CCTTACGGTTGGTCCGCTCCATGCTGCCCTTACCCTATCTAAAAGTTGGATACCTGACTGCACCTCCTGTGGGGAGGCCACTTGCTCCTCACAGTGATTGGAAACGTTCA
CAATTTTTTCTAAATCATGATGTATTGCAAAAGCTTAACATAGGAGGGGAACAAATACAAACATCCCCTAGTATTTTAGGAAGATCTGGAAGTACCTTGAATGGCGACAG
GACAATCAAAGTAGAAATTCCCGATATCGTCTCTGTATCCGCATGTGCCGATCTTACTTTACCACCTGGTGCTGGACTCTGCATTGACACAATCCATGGACCCATTTTCT
TGGTTGCTGACTCATGGGACACACTCGATGGATGGCTTGATGCTATTAGATTAGTTTACACGATCTACGCTCGAGGCAAGAACGAGGTTTTGGCTGGCATCATAACAGGT
TGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGCCTACTCCCGAGGAACCTAATAATTTGCAGAACGGAATCGAAATCCAACCACACATTTCATCAGAATCAGATCAAATTACTGAACCCAGATCAGGGCCAGAAGA
ACCTACAGTAGATTCAATTCCCAGTTCTGAATTACAACGAGAACGTGAATCGGAATCAGTTAGTAATGGAGTACCAGATTCGGAGCCGGAGTCTCCAAGGAAACAGTTAT
CGGAGTCAATTCATTTACATGTAGTGACGGGTGTTACAGATCCGAGTGTTGAAGAGCATAAAGAAACTTCCACCCCATCCAACGGCAACACGGAGAACTTGCAACCTGCG
TTGCGTAAAGACGAAGGAAGCCGAACGTTTACAATGAGAGAGTTGTTGAATGGATTGAAAGGTGAAGATGGTAGCGACAGCCTTAATGAATCTGAAGGCGAGAGGCCCGA
GGGGAACTCCGGTTACAGCCTTAATCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGTAGAGCTGCCATGGAGTTGATCAACAGTGTTACAGGTGTCGATGAAGAGG
GTCGTTCTCGCCAAAGGATTCTCACATTTGCTGCTAGGAGGTATGCTAGTGCAATTGAGAGAAATGGTCAAGACTATGATGCTTTGTACAATTGGGCTTTGGTCCTCCAG
GAGAGTGCAGACAATGTTAGTCCAGATTCCACCTCACCTTCTAAAGATGCGTTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCCACCCATCTGTGCCCAACACTTCA
TGATGCTTTTTATAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATGCGTGGTCGTACGAAGGAGGCCGAAGAACTGTGGAAGCAGGCTACCAAAAATTATGAAAAAG
CTGTCCAACTCAACTGGAATAGTCCCCAGGCGTTAAATAATTGGGGGCTTGCCCTACAGGAACTCAGTGCGATTGTGCCGGCACGAGAAAAGCAGACAATTGTAAAAACA
GCTATCAGTAAGTTCCGTGCTGCTATACAGTTGCAATTTGATTTTCATCGAGCAATCTACAACCTTGGCACTGTTCTGTATGGATTAGCTGAGGACACATTAAGAACTGG
TGGATCAGGAAATGTTAAGGATGTTTCCCCCAATGAGTTATACAGCCAATCTGCTATTTATATTGCAGCTGCTCATGCTCTAAAACCAAACTACTCTGTTTATAGCAGCG
CCTTACGGTTGGTCCGCTCCATGCTGCCCTTACCCTATCTAAAAGTTGGATACCTGACTGCACCTCCTGTGGGGAGGCCACTTGCTCCTCACAGTGATTGGAAACGTTCA
CAATTTTTTCTAAATCATGATGTATTGCAAAAGCTTAACATAGGAGGGGAACAAATACAAACATCCCCTAGTATTTTAGGAAGATCTGGAAGTACCTTGAATGGCGACAG
GACAATCAAAGTAGAAATTCCCGATATCGTCTCTGTATCCGCATGTGCCGATCTTACTTTACCACCTGGTGCTGGACTCTGCATTGACACAATCCATGGACCCATTTTCT
TGGTTGCTGACTCATGGGACACACTCGATGGATGGCTTGATGCTATTAGATTAGTTTACACGATCTACGCTCGAGGCAAGAACGAGGTTTTGGCTGGCATCATAACAGGT
TGA
Protein sequenceShow/hide protein sequence
MSPTPEEPNNLQNGIEIQPHISSESDQITEPRSGPEEPTVDSIPSSELQRERESESVSNGVPDSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPSNGNTENLQPA
LRKDEGSRTFTMRELLNGLKGEDGSDSLNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNGQDYDALYNWALVLQ
ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKT
AISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGSGNVKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRS
QFFLNHDVLQKLNIGGEQIQTSPSILGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDTLDGWLDAIRLVYTIYARGKNEVLAGIITG