; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0003307 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0003307
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
Descriptionprotein HLB1-like isoform X2
Genome locationchr06:35475933..35483550
RNA-Seq ExpressionPay0003307
SyntenyPay0003307
Gene Ontology termsGO:0006887 - exocytosis (biological process)
GO:0048768 - root hair cell tip growth (biological process)
GO:0005769 - early endosome (cellular component)
GO:0005802 - trans-Golgi network (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146133.1 protein HLB1 isoform X1 [Cucumis sativus]2.4e-29294.75Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPF
        MSPTPEEPNNLQNGIEIQPHISSESDQI+EPRS  EEPT DSIPSSELQ+ERESESVSNGV DSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTP 
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPF

Query:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMELINSITGVDEEGRSRQRILTFAARRYAR
        NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSD LNESEGERPEGNSG+SLNQDSPHQPYSEQSRAAMELINS+TGVDEEGRSRQRILTFAARRYA 
Subjt:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMELINSITGVDEEGRSRQRILTFAARRYAR

Query:  ACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLN
        A  IE+NGQ +D L        ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLN
Subjt:  ACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLN

Query:  WNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNIKDVSPNELYSQSAIYIAAAHALKPNY
        WNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN+KDVSPNELYSQSAIYIAAAHALKPNY
Subjt:  WNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNIKDVSPNELYSQSAIYIAAAHALKPNY

Query:  SVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLGRSGSTLNGDRTIKVEIPDIVSVSACADLTLP
        SVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS LGRSGSTLNGDRTIKVEIPDIVSVSACADLTLP
Subjt:  SVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLGRSGSTLNGDRTIKVEIPDIVSVSACADLTLP

Query:  PGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG
        PGAGLCIDTIHGPIFLVADSWD LDGWLDAIRLVYTIYARGKNEVLAGIITG
Subjt:  PGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG

XP_008448563.1 PREDICTED: uncharacterized protein LOC103490705 isoform X1 [Cucumis melo]5.4e-30097.28Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPF
        MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPF
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPF

Query:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMELINSITGVDEEGRSRQRILTFAARRYAR
        NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMELINSITGVDEEGRSRQRILTFAARRYA 
Subjt:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMELINSITGVDEEGRSRQRILTFAARRYAR

Query:  ACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLN
        A  IE+NGQ +D L        ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLN
Subjt:  ACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLN

Query:  WNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNIKDVSPNELYSQSAIYIAAAHALKPNY
        WNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNIKDVSPNELYSQSAIYIAAAHALKPNY
Subjt:  WNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNIKDVSPNELYSQSAIYIAAAHALKPNY

Query:  SVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLGRSGSTLNGDRTIKVEIPDIVSVSACADLTLP
        SVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLGRSGSTLNGDRTIKVEIPDIVSVSACADLTLP
Subjt:  SVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLGRSGSTLNGDRTIKVEIPDIVSVSACADLTLP

Query:  PGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG
        PGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG
Subjt:  PGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG

XP_022965252.1 protein HLB1-like isoform X2 [Cucurbita maxima]2.1e-26487.95Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVSNGVADSEP----ESPRKQLSESIHLHVVTGVTDPSVEEHKET
        MSP PEEPNNLQNGIEI+PHIS ES+QI E +SE  E TAD IP++ELQQERESESV NGVADSEP    +SPRKQLSESI L VVT VTDP  EE K T
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVSNGVADSEP----ESPRKQLSESIHLHVVTGVTDPSVEEHKET

Query:  STPFNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMELINSITGVDEEGRSRQRILTFAAR
        S   NG  EN QPALRKDEGSRTFTMRELLNGLK EDG+D LNESEGE+PE NSG+SLNQDSPHQPYSEQSRAAMELINS+TGVDEEGRSRQRILTFAAR
Subjt:  STPFNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMELINSITGVDEEGRSRQRILTFAAR

Query:  RYARACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKA
        RYA A  IE+NGQ +D L        ESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKA
Subjt:  RYARACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKA

Query:  VQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNIKDVSPNELYSQSAIYIAAAHAL
        VQLNWNSPQALNNWGLALQELSAIVPAREK TIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG +KDVSPNELYSQSAIYIAAAHAL
Subjt:  VQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNIKDVSPNELYSQSAIYIAAAHAL

Query:  KPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLGRSGSTLNGDRTIKVEIPDIVSVSACAD
        KP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP+ LGRSGSTLNGDRT+KVEIPDIVSVSACAD
Subjt:  KPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLGRSGSTLNGDRTIKVEIPDIVSVSACAD

Query:  LTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG
        LTLPPGAGLCIDTIHG IFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGII G
Subjt:  LTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG

XP_023552571.1 protein HLB1-like [Cucurbita pepo subsp. pepo]2.9e-26183.7Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVS---------------------------NGVADSEP----ESP
        MSPTPEEPNNLQNGIEI+ HIS ES+QI E +SE  E TAD +P++ELQQER+SESV+                           NG ADSEP    +SP
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVS---------------------------NGVADSEP----ESP

Query:  RKQLSESIHLHVVTGVTDPSVEEHKETSTPFNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRA
        RKQLSESI L VVT VTDP  EE K TS   NG TEN QPALRKDEGSRTFTMRELLNGLK EDG+D LNESEGE+PE NSG+SLNQDSPHQPYSEQSRA
Subjt:  RKQLSESIHLHVVTGVTDPSVEEHKETSTPFNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRA

Query:  AMELINSITGVDEEGRSRQRILTFAARRYARACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAI
        AMELINS+TGVDEEGRSRQRILTFAARRYA A  IE+NGQ +D L        ESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAI
Subjt:  AMELINSITGVDEEGRSRQRILTFAARRYARACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAI

Query:  SDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG
        SDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG
Subjt:  SDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG

Query:  GTGNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLG
        GTG +KDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQ QTSP+ LG
Subjt:  GTGNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLG

Query:  RSGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG
        RSGSTLNGDRT+KVEIPDIVSVSACADLTLPPGAGLCIDTIHG IFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGII G
Subjt:  RSGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG

XP_038876586.1 protein HLB1 [Benincasa hispida]4.5e-27891.3Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPF
        MSPTPEEPNNLQNGIEIQPHIS ESDQ SEPRSE  EPTAD+I SSEL QERESESV+NGVADSEP S RKQL ESIHL V T V DP  EEHKETS P 
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPF

Query:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMELINSITGVDEEGRSRQRILTFAARRYAR
        NGNTEN +PALRKDEGSRTFTMRELLNGLKGEDG+D LNESEGERPEGN G+SLNQDSPHQPYSEQSRAAMELI+S+TGVDEEGRSRQRILTFAARRYA 
Subjt:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMELINSITGVDEEGRSRQRILTFAARRYAR

Query:  ACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLN
        A  IE+NGQ +D L        ESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLN
Subjt:  ACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLN

Query:  WNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNIKDVSPNELYSQSAIYIAAAHALKPNY
        WNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN+KDVSPNELYSQSAIYIAAAHALKPNY
Subjt:  WNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNIKDVSPNELYSQSAIYIAAAHALKPNY

Query:  SVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLGRSGSTLNGDRTIKVEIPDIVSVSACADLTLP
        SVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPH DWKRSQFFLNHDVLQKLNIGGEQIQTSPS LGRSGSTLNGD TIKVEIPDIVSVSACADLTLP
Subjt:  SVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLGRSGSTLNGDRTIKVEIPDIVSVSACADLTLP

Query:  PGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG
        PGAGLCIDTIHGP+FLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG
Subjt:  PGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG

TrEMBL top hitse value%identityAlignment
A0A0A0L688 Uncharacterized protein1.2e-29294.75Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPF
        MSPTPEEPNNLQNGIEIQPHISSESDQI+EPRS  EEPT DSIPSSELQ+ERESESVSNGV DSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTP 
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPF

Query:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMELINSITGVDEEGRSRQRILTFAARRYAR
        NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSD LNESEGERPEGNSG+SLNQDSPHQPYSEQSRAAMELINS+TGVDEEGRSRQRILTFAARRYA 
Subjt:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMELINSITGVDEEGRSRQRILTFAARRYAR

Query:  ACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLN
        A  IE+NGQ +D L        ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLN
Subjt:  ACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLN

Query:  WNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNIKDVSPNELYSQSAIYIAAAHALKPNY
        WNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN+KDVSPNELYSQSAIYIAAAHALKPNY
Subjt:  WNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNIKDVSPNELYSQSAIYIAAAHALKPNY

Query:  SVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLGRSGSTLNGDRTIKVEIPDIVSVSACADLTLP
        SVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPS LGRSGSTLNGDRTIKVEIPDIVSVSACADLTLP
Subjt:  SVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLGRSGSTLNGDRTIKVEIPDIVSVSACADLTLP

Query:  PGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG
        PGAGLCIDTIHGPIFLVADSWD LDGWLDAIRLVYTIYARGKNEVLAGIITG
Subjt:  PGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG

A0A1S3BJC9 uncharacterized protein LOC103490705 isoform X12.6e-30097.28Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPF
        MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPF
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPF

Query:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMELINSITGVDEEGRSRQRILTFAARRYAR
        NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMELINSITGVDEEGRSRQRILTFAARRYA 
Subjt:  NGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMELINSITGVDEEGRSRQRILTFAARRYAR

Query:  ACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLN
        A  IE+NGQ +D L        ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLN
Subjt:  ACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLN

Query:  WNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNIKDVSPNELYSQSAIYIAAAHALKPNY
        WNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNIKDVSPNELYSQSAIYIAAAHALKPNY
Subjt:  WNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNIKDVSPNELYSQSAIYIAAAHALKPNY

Query:  SVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLGRSGSTLNGDRTIKVEIPDIVSVSACADLTLP
        SVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLGRSGSTLNGDRTIKVEIPDIVSVSACADLTLP
Subjt:  SVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLGRSGSTLNGDRTIKVEIPDIVSVSACADLTLP

Query:  PGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG
        PGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG
Subjt:  PGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG

A0A6J1EA05 protein HLB1-like2.0e-26083.53Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVS---------------------------NGVAD----SEPESP
        MSPTPEEPNNLQNGIEI+PHIS ES+QI E +SE  E TAD +P++ELQQERE ESV+                           NGVAD    SE +SP
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVS---------------------------NGVAD----SEPESP

Query:  RKQLSESIHLHVVTGVTDPSVEEHKETSTPFNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRA
        RKQLSESI L V T V DP  EE K TS   NG TEN QPALRKDEGSRTFTMRELLNGLK EDG+D LNESEGE+PE NSG+SLNQDSPHQPYSEQSRA
Subjt:  RKQLSESIHLHVVTGVTDPSVEEHKETSTPFNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRA

Query:  AMELINSITGVDEEGRSRQRILTFAARRYARACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAI
        AMELINS+TGVDEEGRSRQRILTFAARRYA A  IE+NGQ +D L        ESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAI
Subjt:  AMELINSITGVDEEGRSRQRILTFAARRYARACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAI

Query:  SDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG
        SDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG
Subjt:  SDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG

Query:  GTGNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLG
        GTG +KDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQ QTSP+ LG
Subjt:  GTGNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLG

Query:  RSGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG
        RSGSTLNGDRT+KVEIPDIVSVSACADLTLPPGAGLCIDTIHG IFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGII G
Subjt:  RSGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG

A0A6J1HJU5 protein HLB1-like isoform X21.0e-26487.95Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVSNGVADSEP----ESPRKQLSESIHLHVVTGVTDPSVEEHKET
        MSP PEEPNNLQNGIEI+PHIS ES+QI E +SE  E TAD IP++ELQQERESESV NGVADSEP    +SPRKQLSESI L VVT VTDP  EE K T
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVSNGVADSEP----ESPRKQLSESIHLHVVTGVTDPSVEEHKET

Query:  STPFNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMELINSITGVDEEGRSRQRILTFAAR
        S   NG  EN QPALRKDEGSRTFTMRELLNGLK EDG+D LNESEGE+PE NSG+SLNQDSPHQPYSEQSRAAMELINS+TGVDEEGRSRQRILTFAAR
Subjt:  STPFNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMELINSITGVDEEGRSRQRILTFAAR

Query:  RYARACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKA
        RYA A  IE+NGQ +D L        ESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKA
Subjt:  RYARACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKA

Query:  VQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNIKDVSPNELYSQSAIYIAAAHAL
        VQLNWNSPQALNNWGLALQELSAIVPAREK TIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTG +KDVSPNELYSQSAIYIAAAHAL
Subjt:  VQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNIKDVSPNELYSQSAIYIAAAHAL

Query:  KPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLGRSGSTLNGDRTIKVEIPDIVSVSACAD
        KP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP+ LGRSGSTLNGDRT+KVEIPDIVSVSACAD
Subjt:  KPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLGRSGSTLNGDRTIKVEIPDIVSVSACAD

Query:  LTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG
        LTLPPGAGLCIDTIHG IFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGII G
Subjt:  LTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG

A0A6J1HL68 protein HLB1-like isoform X11.4e-26183.7Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVS---------------------------NGVADSEP----ESP
        MSP PEEPNNLQNGIEI+PHIS ES+QI E +SE  E TAD +P++ELQQERESESV+                           NGVADSEP    +SP
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVS---------------------------NGVADSEP----ESP

Query:  RKQLSESIHLHVVTGVTDPSVEEHKETSTPFNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRA
        RKQLSESI L VVT VTDP  EE K TS   NG  EN QPALRKDEGSRTFTMRELLNGLK EDG+D LNESEGE+PE NSG+SLNQDSPHQPYSEQSRA
Subjt:  RKQLSESIHLHVVTGVTDPSVEEHKETSTPFNGNTENLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRA

Query:  AMELINSITGVDEEGRSRQRILTFAARRYARACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAI
        AMELINS+TGVDEEGRSRQRILTFAARRYA A  IE+NGQ +D L        ESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAI
Subjt:  AMELINSITGVDEEGRSRQRILTFAARRYARACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAI

Query:  SDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG
        SDRAKMRGRTKEAEELWKQAT+NYEKAVQLNWNSPQALNNWGLALQELSAIVPAREK TIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG
Subjt:  SDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG

Query:  GTGNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLG
        GTG +KDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQKLNIGGEQIQTSP+ LG
Subjt:  GTGNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLG

Query:  RSGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG
        RSGSTLNGDRT+KVEIPDIVSVSACADLTLPPGAGLCIDTIHG IFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGII G
Subjt:  RSGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG

SwissProt top hitse value%identityAlignment
Q9FHY8 Protein HLB11.6e-16960Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPS-------------------SELQQERESESVSNGVADSEPESPRKQLSESIHLHV
        M+ T EEP  LQNG        +E + I EP+ + E      IP                     E+Q E + E V   V D++PE  + ++       V
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPS-------------------SELQQERESESVSNGVADSEPESPRKQLSESIHLHV

Query:  VT----GVTDPSVEEHKETSTPFNG---NTENLQPALRK-DEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMEL
        VT     +TD  +        P        E+    L+K D+G++TFTMRELL+ LK E+G DG   S        S    +++S  QP   ++  AM+L
Subjt:  VT----GVTDPSVEEHKETSTPFNG---NTENLQPALRK-DEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMEL

Query:  INSITGVDEEGRSRQRILTFAARRYARACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRA
        IN I   DEEGRSRQR+L FAAR+YA A  IE+N   HD L        ESADNVSPDS SPSKD LLEEACKKYDEAT LCPTL+DA+YNWAIAISDRA
Subjt:  INSITGVDEEGRSRQRILTFAARRYARACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRA

Query:  KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN
        K+RGRTKEAEELW+QA  NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +V+TAISKFRAAI+LQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN
Subjt:  KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN

Query:  IKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD-VLQKLNIGGEQIQTSPSTLGRSG
         KD+ P ELYSQSAIYIAAAH+LKP+YSVYSSALRLVRSMLPLP+LKVGYLTAPPVG  LAPHSDWKR++F LNH+ +LQ L     ++  + S    + 
Subjt:  IKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD-VLQKLNIGGEQIQTSPSTLGRSG

Query:  STLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG
        ST    +T+KV I +IVSV+ CADLTLPPGAGLCIDTIHGP+FLVADSW++LDGWLDAIRLVYTIYARGK++VLAGIITG
Subjt:  STLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG

Arabidopsis top hitse value%identityAlignment
AT5G41950.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-17060Show/hide
Query:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPS-------------------SELQQERESESVSNGVADSEPESPRKQLSESIHLHV
        M+ T EEP  LQNG        +E + I EP+ + E      IP                     E+Q E + E V   V D++PE  + ++       V
Subjt:  MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPS-------------------SELQQERESESVSNGVADSEPESPRKQLSESIHLHV

Query:  VT----GVTDPSVEEHKETSTPFNG---NTENLQPALRK-DEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMEL
        VT     +TD  +        P        E+    L+K D+G++TFTMRELL+ LK E+G DG   S        S    +++S  QP   ++  AM+L
Subjt:  VT----GVTDPSVEEHKETSTPFNG---NTENLQPALRK-DEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMEL

Query:  INSITGVDEEGRSRQRILTFAARRYARACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRA
        IN I   DEEGRSRQR+L FAAR+YA A  IE+N   HD L        ESADNVSPDS SPSKD LLEEACKKYDEAT LCPTL+DA+YNWAIAISDRA
Subjt:  INSITGVDEEGRSRQRILTFAARRYARACEIEKNGQTHDCL--------ESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRA

Query:  KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN
        K+RGRTKEAEELW+QA  NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +V+TAISKFRAAI+LQFDFHRAIYNLGTVLYGLAEDTLRTGG+GN
Subjt:  KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGN

Query:  IKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD-VLQKLNIGGEQIQTSPSTLGRSG
         KD+ P ELYSQSAIYIAAAH+LKP+YSVYSSALRLVRSMLPLP+LKVGYLTAPPVG  LAPHSDWKR++F LNH+ +LQ L     ++  + S    + 
Subjt:  IKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHD-VLQKLNIGGEQIQTSPSTLGRSG

Query:  STLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG
        ST    +T+KV I +IVSV+ CADLTLPPGAGLCIDTIHGP+FLVADSW++LDGWLDAIRLVYTIYARGK++VLAGIITG
Subjt:  STLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAIRLVYTIYARGKNEVLAGIITG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCCTACTCCCGAGGAACCTAATAATTTGCAGAACGGAATCGAAATCCAACCACACATTTCATCAGAATCAGATCAAATTAGTGAACCCAGATCAGAGCTT
GAAGAACCTACAGCAGATTCAATTCCCAGTTCGGAATTACAACAAGAACGTGAATCGGAATCAGTTAGTAATGGAGTAGCAGATTCGGAGCCGGAGTCCCCAAGG
AAACAGTTATCGGAGTCAATTCATTTACATGTAGTGACGGGTGTTACAGATCCGAGTGTTGAAGAGCATAAAGAAACTTCCACCCCATTCAACGGCAACACGGAG
AACTTGCAACCTGCGTTGCGTAAAGACGAAGGAAGCAGAACGTTTACAATGAGAGAGTTGTTGAATGGATTGAAAGGTGAAGATGGTAGCGACGGCCTTAATGAA
TCTGAAGGCGAGAGGCCCGAGGGGAACTCCGGTCACAGCCTTAATCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGTAGAGCTGCCATGGAGTTGATCAAC
AGTATTACAGGTGTCGATGAAGAGGGCCGTTCTCGTCAAAGGATTCTCACATTTGCTGCTAGGAGGTATGCTAGGGCATGCGAGATTGAGAAGAATGGTCAAACT
CATGATTGCTTAGAGAGTGCAGACAATGTTAGTCCAGATTCCACCTCGCCCTCTAAAGATGCGTTGCTTGAGGAGGCTTGTAAAAAGTACGATGAGGCCACCCAT
CTGTGCCCAACACTTCATGATGCTTTTTATAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATGCGTGGTCGTACGAAGGAGGCCGAAGAACTGTGGAAGCAG
GCTACCAAAAATTATGAAAAAGCTGTCCAACTCAACTGGAATAGCCCCCAGGCGCTAAATAATTGGGGGCTTGCCCTACAGGAACTCAGTGCGATTGTGCCAGCA
CGAGAAAAGCAGACAATTGTAAAAACAGCGATCAGTAAGTTCCGTGCTGCTATACAGTTGCAATTTGATTTTCATCGAGCAATCTACAACCTTGGCACTGTTCTG
TATGGATTAGCTGAGGACACATTAAGGACTGGTGGAACAGGAAATATTAAAGATGTTTCCCCCAATGAGTTATACAGTCAATCTGCTATTTATATTGCAGCTGCT
CATGCTCTAAAACCAAACTACTCTGTGTATAGCAGCGCCTTACGGTTGGTCCGTTCCATGCTGCCGTTACCCTATCTAAAAGTCGGATACCTGACTGCACCTCCT
GTGGGGAGACCACTTGCTCCTCACAGTGATTGGAAACGTTCACAATTTTTTCTAAACCATGATGTATTGCAAAAGCTTAACATAGGAGGGGAACAAATACAAACA
TCCCCTAGTACTTTAGGAAGATCTGGAAGTACCTTGAATGGCGACAGGACAATCAAAGTAGAAATTCCTGATATTGTCTCTGTATCCGCATGTGCCGATCTTACT
TTACCACCTGGTGCTGGACTCTGCATTGACACAATCCATGGACCCATTTTCTTGGTTGCTGACTCATGGGACGCACTCGATGGATGGCTCGATGCAATTAGATTA
GTTTACACGATCTACGCTCGAGGCAAGAACGAGGTTTTGGCTGGCATCATAACAGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGCCTACTCCCGAGGAACCTAATAATTTGCAGAACGGAATCGAAATCCAACCACACATTTCATCAGAATCAGATCAAATTAGTGAACCCAGATCAGAGCTT
GAAGAACCTACAGCAGATTCAATTCCCAGTTCGGAATTACAACAAGAACGTGAATCGGAATCAGTTAGTAATGGAGTAGCAGATTCGGAGCCGGAGTCCCCAAGG
AAACAGTTATCGGAGTCAATTCATTTACATGTAGTGACGGGTGTTACAGATCCGAGTGTTGAAGAGCATAAAGAAACTTCCACCCCATTCAACGGCAACACGGAG
AACTTGCAACCTGCGTTGCGTAAAGACGAAGGAAGCAGAACGTTTACAATGAGAGAGTTGTTGAATGGATTGAAAGGTGAAGATGGTAGCGACGGCCTTAATGAA
TCTGAAGGCGAGAGGCCCGAGGGGAACTCCGGTCACAGCCTTAATCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGTAGAGCTGCCATGGAGTTGATCAAC
AGTATTACAGGTGTCGATGAAGAGGGCCGTTCTCGTCAAAGGATTCTCACATTTGCTGCTAGGAGGTATGCTAGGGCATGCGAGATTGAGAAGAATGGTCAAACT
CATGATTGCTTAGAGAGTGCAGACAATGTTAGTCCAGATTCCACCTCGCCCTCTAAAGATGCGTTGCTTGAGGAGGCTTGTAAAAAGTACGATGAGGCCACCCAT
CTGTGCCCAACACTTCATGATGCTTTTTATAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATGCGTGGTCGTACGAAGGAGGCCGAAGAACTGTGGAAGCAG
GCTACCAAAAATTATGAAAAAGCTGTCCAACTCAACTGGAATAGCCCCCAGGCGCTAAATAATTGGGGGCTTGCCCTACAGGAACTCAGTGCGATTGTGCCAGCA
CGAGAAAAGCAGACAATTGTAAAAACAGCGATCAGTAAGTTCCGTGCTGCTATACAGTTGCAATTTGATTTTCATCGAGCAATCTACAACCTTGGCACTGTTCTG
TATGGATTAGCTGAGGACACATTAAGGACTGGTGGAACAGGAAATATTAAAGATGTTTCCCCCAATGAGTTATACAGTCAATCTGCTATTTATATTGCAGCTGCT
CATGCTCTAAAACCAAACTACTCTGTGTATAGCAGCGCCTTACGGTTGGTCCGTTCCATGCTGCCGTTACCCTATCTAAAAGTCGGATACCTGACTGCACCTCCT
GTGGGGAGACCACTTGCTCCTCACAGTGATTGGAAACGTTCACAATTTTTTCTAAACCATGATGTATTGCAAAAGCTTAACATAGGAGGGGAACAAATACAAACA
TCCCCTAGTACTTTAGGAAGATCTGGAAGTACCTTGAATGGCGACAGGACAATCAAAGTAGAAATTCCTGATATTGTCTCTGTATCCGCATGTGCCGATCTTACT
TTACCACCTGGTGCTGGACTCTGCATTGACACAATCCATGGACCCATTTTCTTGGTTGCTGACTCATGGGACGCACTCGATGGATGGCTCGATGCAATTAGATTA
GTTTACACGATCTACGCTCGAGGCAAGAACGAGGTTTTGGCTGGCATCATAACAGGTTGA
Protein sequenceShow/hide protein sequence
MSPTPEEPNNLQNGIEIQPHISSESDQISEPRSELEEPTADSIPSSELQQERESESVSNGVADSEPESPRKQLSESIHLHVVTGVTDPSVEEHKETSTPFNGNTE
NLQPALRKDEGSRTFTMRELLNGLKGEDGSDGLNESEGERPEGNSGHSLNQDSPHQPYSEQSRAAMELINSITGVDEEGRSRQRILTFAARRYARACEIEKNGQT
HDCLESADNVSPDSTSPSKDALLEEACKKYDEATHLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPA
REKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGTGNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPP
VGRPLAPHSDWKRSQFFLNHDVLQKLNIGGEQIQTSPSTLGRSGSTLNGDRTIKVEIPDIVSVSACADLTLPPGAGLCIDTIHGPIFLVADSWDALDGWLDAIRL
VYTIYARGKNEVLAGIITG