; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017212 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017212
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionprotein HLB1-like isoform X2
Genome locationchr5:1035731..1045513
RNA-Seq ExpressionLag0017212
SyntenyLag0017212
Gene Ontology termsGO:0006887 - exocytosis (biological process)
GO:0048768 - root hair cell tip growth (biological process)
GO:0005769 - early endosome (cellular component)
GO:0005802 - trans-Golgi network (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146133.1 protein HLB1 isoform X1 [Cucumis sativus]8.7e-27490.2Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISSESNPTDEEPRSESSQLIADAIPKPELQQERDSESL-NEEPDSEPESRRKQFAESIQLQVVTDVSDPGFEEPKETSIP
        MSPTPEEPNNLQNGIE QPHISSES+    EPRS   +   D+IP  ELQ+ER+SES+ N  PDSEPES RKQ +ESI L VVT V+DP  EE KETS P
Subjt:  MSPTPEEPNNLQNGIETQPHISSESNPTDEEPRSESSQLIADAIPKPELQQERDSESL-NEEPDSEPESRRKQFAESIQLQVVTDVSDPGFEEPKETSIP

Query:  SNG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA
        SNG TEN QPALRKDEGSRTFTMRELLNGLKGEDG+DS+NESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA
Subjt:  SNG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA

Query:  SAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW
        SAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW
Subjt:  SAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW

Query:  NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPNYS
        NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG  GN+KDVSPNELYSQSAIYIAAAHALKPNYS
Subjt:  NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPNYS

Query:  VYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSGST-NGDRTIKVEIPNIVSVSACADLTLPP
        VYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQ+L +GGEQ+Q+SP+ LGRSGST NGDRTIKVEIP+IVSVSACADLTLPP
Subjt:  VYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSGST-NGDRTIKVEIPNIVSVSACADLTLPP

Query:  GAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG
        GAGLCIDTIHGP+FLVADSWD LDGWLDAIRLVYTIYARGKN+VLAGIITG
Subjt:  GAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG

XP_008448563.1 PREDICTED: uncharacterized protein LOC103490705 isoform X1 [Cucumis melo]4.3e-27390.2Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISSESNPTDEEPRSESSQLIADAIPKPELQQERDSESL-NEEPDSEPESRRKQFAESIQLQVVTDVSDPGFEEPKETSIP
        MSPTPEEPNNLQNGIE QPHISSES+    EPRSE  +  AD+IP  ELQQER+SES+ N   DSEPES RKQ +ESI L VVT V+DP  EE KETS P
Subjt:  MSPTPEEPNNLQNGIETQPHISSESNPTDEEPRSESSQLIADAIPKPELQQERDSESL-NEEPDSEPESRRKQFAESIQLQVVTDVSDPGFEEPKETSIP

Query:  SNG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA
         NG TEN QPALRKDEGSRTFTMRELLNGLKGEDG+D +NESEGERPEGNSG+SLNQDSPHQPYSEQSRAAMELINS+TGVDEEGRSRQRILTFAARRYA
Subjt:  SNG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA

Query:  SAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW
        SAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW
Subjt:  SAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW

Query:  NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPNYS
        NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG  GNIKDVSPNELYSQSAIYIAAAHALKPNYS
Subjt:  NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPNYS

Query:  VYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSGST-NGDRTIKVEIPNIVSVSACADLTLPP
        VYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQ+L +GGEQ+Q+SP+ LGRSGST NGDRTIKVEIP+IVSVSACADLTLPP
Subjt:  VYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSGST-NGDRTIKVEIPNIVSVSACADLTLPP

Query:  GAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG
        GAGLCIDTIHGP+FLVADSWDALDGWLDAIRLVYTIYARGKN+VLAGIITG
Subjt:  GAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG

XP_022965252.1 protein HLB1-like isoform X2 [Cucurbita maxima]2.5e-26888.07Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISSESNPTDEEPRSESSQLIADAIPKPELQQERDSESLNEEPDSEPESR----RKQFAESIQLQVVTDVSDPGFEEPKET
        MSP PEEPNNLQNGIE +PHIS ESN   E      S   AD IP  ELQQER+SES+N   DSEP+S     RKQ +ESI+LQVVTDV+DP FEEPK T
Subjt:  MSPTPEEPNNLQNGIETQPHISSESNPTDEEPRSESSQLIADAIPKPELQQERDSESLNEEPDSEPESR----RKQFAESIQLQVVTDVSDPGFEEPKET

Query:  SIPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARR
        SI SNG ENSQPALRKDEGSRTFTMRELLNGLK EDGNDS+NESEGE+PE NSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARR
Subjt:  SIPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARR

Query:  YASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQL
        YASAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKAVQL
Subjt:  YASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQL

Query:  NWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPN
        NWNSPQALNNWGLALQELSAIVPAREK TIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG  G +KDVSPNELYSQSAIYIAAAHALKP+
Subjt:  NWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPN

Query:  YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSGST-NGDRTIKVEIPNIVSVSACADLTL
        YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQ+L +GGEQ+Q+SP  LGRSGST NGDRT+KVEIP+IVSVSACADLTL
Subjt:  YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSGST-NGDRTIKVEIPNIVSVSACADLTL

Query:  PPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG
        PPGAGLCIDTIHG +FLVADSWDALDGWLDAIRLVYTIYARGKN+VLAGII G
Subjt:  PPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG

XP_023552571.1 protein HLB1-like [Cucurbita pepo subsp. pepo]2.3e-26685.32Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISSESN----------------PTDE---EPRSESSQLIAD-------AIPKPELQQERDSESLNEEPDSEPESR----R
        MSPTPEEPNNLQNGIE + HIS ESN                PT E   E +SES   +A         IP  ELQQER+SES N   DSEP+S     R
Subjt:  MSPTPEEPNNLQNGIETQPHISSESN----------------PTDE---EPRSESSQLIAD-------AIPKPELQQERDSESLNEEPDSEPESR----R

Query:  KQFAESIQLQVVTDVSDPGFEEPKETSIPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAM
        KQ +ESI+LQVVTDV+DP FEEPK TSI SNGTENSQPALRKDEGSRTFTMRELLNGLK EDGNDS+NESEGE+PE NSGYSLNQDSPHQPYSEQSRAAM
Subjt:  KQFAESIQLQVVTDVSDPGFEEPKETSIPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAM

Query:  ELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA
        ELINSVTGVDEEGRSRQRILTFAARRYASAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA
Subjt:  ELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA

Query:  KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GN
        KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG  G 
Subjt:  KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GN

Query:  IKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSGS
        +KDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQ+L +GGEQ Q+SP  LGRSGS
Subjt:  IKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSGS

Query:  T-NGDRTIKVEIPNIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG
        T NGDRT+KVEIP+IVSVSACADLTLPPGAGLCIDTIHG +FLVADSWDALDGWLDAIRLVYTIYARGKN+VLAGII G
Subjt:  T-NGDRTIKVEIPNIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG

XP_038876586.1 protein HLB1 [Benincasa hispida]2.5e-27390.93Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISSESNPTDEEPRSESSQLIADAIPKPELQQERDSESLNE-EPDSEPESRRKQFAESIQLQVVTDVSDPGFEEPKETSIP
        MSPTPEEPNNLQNGIE QPHIS ES+ T  EPRSE  +  ADAI   EL QER+SES+N    DSEP SRRKQ  ESI LQV TDV+DP FEE KETSIP
Subjt:  MSPTPEEPNNLQNGIETQPHISSESNPTDEEPRSESSQLIADAIPKPELQQERDSESLNE-EPDSEPESRRKQFAESIQLQVVTDVSDPGFEEPKETSIP

Query:  SNG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA
        SNG TENS+PALRKDEGSRTFTMRELLNGLKGEDGNDS+NESEGERPEGN GYSLNQDSPHQPYSEQSRAAMELI+SVTGVDEEGRSRQRILTFAARRYA
Subjt:  SNG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA

Query:  SAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW
        SAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW
Subjt:  SAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW

Query:  NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPNYS
        NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG  GN+KDVSPNELYSQSAIYIAAAHALKPNYS
Subjt:  NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPNYS

Query:  VYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSGST-NGDRTIKVEIPNIVSVSACADLTLPP
        VYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPH DWKRSQFFLNHDVLQ+L +GGEQ+Q+SP+ LGRSGST NGD TIKVEIP+IVSVSACADLTLPP
Subjt:  VYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSGST-NGDRTIKVEIPNIVSVSACADLTLPP

Query:  GAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG
        GAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKN+VLAGIITG
Subjt:  GAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG

TrEMBL top hitse value%identityAlignment
A0A0A0L688 Uncharacterized protein4.2e-27490.2Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISSESNPTDEEPRSESSQLIADAIPKPELQQERDSESL-NEEPDSEPESRRKQFAESIQLQVVTDVSDPGFEEPKETSIP
        MSPTPEEPNNLQNGIE QPHISSES+    EPRS   +   D+IP  ELQ+ER+SES+ N  PDSEPES RKQ +ESI L VVT V+DP  EE KETS P
Subjt:  MSPTPEEPNNLQNGIETQPHISSESNPTDEEPRSESSQLIADAIPKPELQQERDSESL-NEEPDSEPESRRKQFAESIQLQVVTDVSDPGFEEPKETSIP

Query:  SNG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA
        SNG TEN QPALRKDEGSRTFTMRELLNGLKGEDG+DS+NESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA
Subjt:  SNG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA

Query:  SAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW
        SAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW
Subjt:  SAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW

Query:  NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPNYS
        NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG  GN+KDVSPNELYSQSAIYIAAAHALKPNYS
Subjt:  NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPNYS

Query:  VYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSGST-NGDRTIKVEIPNIVSVSACADLTLPP
        VYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQ+L +GGEQ+Q+SP+ LGRSGST NGDRTIKVEIP+IVSVSACADLTLPP
Subjt:  VYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSGST-NGDRTIKVEIPNIVSVSACADLTLPP

Query:  GAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG
        GAGLCIDTIHGP+FLVADSWD LDGWLDAIRLVYTIYARGKN+VLAGIITG
Subjt:  GAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG

A0A1S3BJC9 uncharacterized protein LOC103490705 isoform X12.1e-27390.2Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISSESNPTDEEPRSESSQLIADAIPKPELQQERDSESL-NEEPDSEPESRRKQFAESIQLQVVTDVSDPGFEEPKETSIP
        MSPTPEEPNNLQNGIE QPHISSES+    EPRSE  +  AD+IP  ELQQER+SES+ N   DSEPES RKQ +ESI L VVT V+DP  EE KETS P
Subjt:  MSPTPEEPNNLQNGIETQPHISSESNPTDEEPRSESSQLIADAIPKPELQQERDSESL-NEEPDSEPESRRKQFAESIQLQVVTDVSDPGFEEPKETSIP

Query:  SNG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA
         NG TEN QPALRKDEGSRTFTMRELLNGLKGEDG+D +NESEGERPEGNSG+SLNQDSPHQPYSEQSRAAMELINS+TGVDEEGRSRQRILTFAARRYA
Subjt:  SNG-TENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYA

Query:  SAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW
        SAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEAT LCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW
Subjt:  SAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNW

Query:  NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPNYS
        NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG  GNIKDVSPNELYSQSAIYIAAAHALKPNYS
Subjt:  NSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPNYS

Query:  VYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSGST-NGDRTIKVEIPNIVSVSACADLTLPP
        VYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQ+L +GGEQ+Q+SP+ LGRSGST NGDRTIKVEIP+IVSVSACADLTLPP
Subjt:  VYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSGST-NGDRTIKVEIPNIVSVSACADLTLPP

Query:  GAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG
        GAGLCIDTIHGP+FLVADSWDALDGWLDAIRLVYTIYARGKN+VLAGIITG
Subjt:  GAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG

A0A6J1EA05 protein HLB1-like5.5e-26685.15Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISSESNPTDE---EPRS----------------ESSQLIAD-------AIPKPELQQERDSESLNEEPDSEPESR----R
        MSPTPEEPNNLQNGIE +PHIS ESN   E   EP S                ES   + D        IP  ELQQER+SES+N   DSE +S     R
Subjt:  MSPTPEEPNNLQNGIETQPHISSESNPTDE---EPRS----------------ESSQLIAD-------AIPKPELQQERDSESLNEEPDSEPESR----R

Query:  KQFAESIQLQVVTDVSDPGFEEPKETSIPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAM
        KQ +ESIQLQV TDV+DP FEEPK TSI SNGTENSQPALRKDEGSRTFTMRELLNGLK EDGNDS+NESEGE+PE NSGYSLNQDSPHQPYSEQSRAAM
Subjt:  KQFAESIQLQVVTDVSDPGFEEPKETSIPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAM

Query:  ELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA
        ELINSVTGVDEEGRSRQRILTFAARRYASAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA
Subjt:  ELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA

Query:  KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GN
        KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG  G 
Subjt:  KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GN

Query:  IKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSGS
        +KDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQ+L +GGEQ Q+SP  LGRSGS
Subjt:  IKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSGS

Query:  T-NGDRTIKVEIPNIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG
        T NGDRT+KVEIP+IVSVSACADLTLPPGAGLCIDTIHG +FLVADSWDALDGWLDAIRLVYTIYARGKN+VLAGII G
Subjt:  T-NGDRTIKVEIPNIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG

A0A6J1HJU5 protein HLB1-like isoform X21.2e-26888.07Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISSESNPTDEEPRSESSQLIADAIPKPELQQERDSESLNEEPDSEPESR----RKQFAESIQLQVVTDVSDPGFEEPKET
        MSP PEEPNNLQNGIE +PHIS ESN   E      S   AD IP  ELQQER+SES+N   DSEP+S     RKQ +ESI+LQVVTDV+DP FEEPK T
Subjt:  MSPTPEEPNNLQNGIETQPHISSESNPTDEEPRSESSQLIADAIPKPELQQERDSESLNEEPDSEPESR----RKQFAESIQLQVVTDVSDPGFEEPKET

Query:  SIPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARR
        SI SNG ENSQPALRKDEGSRTFTMRELLNGLK EDGNDS+NESEGE+PE NSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARR
Subjt:  SIPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARR

Query:  YASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQL
        YASAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQAT+NYEKAVQL
Subjt:  YASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQL

Query:  NWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPN
        NWNSPQALNNWGLALQELSAIVPAREK TIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG  G +KDVSPNELYSQSAIYIAAAHALKP+
Subjt:  NWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GNIKDVSPNELYSQSAIYIAAAHALKPN

Query:  YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSGST-NGDRTIKVEIPNIVSVSACADLTL
        YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQ+L +GGEQ+Q+SP  LGRSGST NGDRT+KVEIP+IVSVSACADLTL
Subjt:  YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSGST-NGDRTIKVEIPNIVSVSACADLTL

Query:  PPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG
        PPGAGLCIDTIHG +FLVADSWDALDGWLDAIRLVYTIYARGKN+VLAGII G
Subjt:  PPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG

A0A6J1HL68 protein HLB1-like isoform X12.5e-26684.97Show/hide
Query:  MSPTPEEPNNLQNGIETQPHISSESN----------------PTDE---EPRSESSQLIAD-------AIPKPELQQERDSESLNEEPDSEPESR----R
        MSP PEEPNNLQNGIE +PHIS ESN                PT E   E  SES   +AD        IP  ELQQER+SES+N   DSEP+S     R
Subjt:  MSPTPEEPNNLQNGIETQPHISSESN----------------PTDE---EPRSESSQLIAD-------AIPKPELQQERDSESLNEEPDSEPESR----R

Query:  KQFAESIQLQVVTDVSDPGFEEPKETSIPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAM
        KQ +ESI+LQVVTDV+DP FEEPK TSI SNG ENSQPALRKDEGSRTFTMRELLNGLK EDGNDS+NESEGE+PE NSGYSLNQDSPHQPYSEQSRAAM
Subjt:  KQFAESIQLQVVTDVSDPGFEEPKETSIPSNGTENSQPALRKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAM

Query:  ELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA
        ELINSVTGVDEEGRSRQRILTFAARRYASAIERN QDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA
Subjt:  ELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRA

Query:  KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GN
        KMRGRTKEAEELWKQAT+NYEKAVQLNWNSPQALNNWGLALQELSAIVPAREK TIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG  G 
Subjt:  KMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--GN

Query:  IKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSGS
        +KDVSPNELYSQSAIYIAAAHALKP+YSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRP APH DWKRSQFFLNHDVLQ+L +GGEQ+Q+SP  LGRSGS
Subjt:  IKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSGS

Query:  T-NGDRTIKVEIPNIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG
        T NGDRT+KVEIP+IVSVSACADLTLPPGAGLCIDTIHG +FLVADSWDALDGWLDAIRLVYTIYARGKN+VLAGII G
Subjt:  T-NGDRTIKVEIPNIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG

SwissProt top hitse value%identityAlignment
Q9FHY8 Protein HLB14.9e-17961.68Show/hide
Query:  MSPTPEEPNNLQNG-----------------IETQPHISS-----ESNPTDEEPRSESSQLIADAIPKPELQQERDSESLNEEPDSEPESRRKQFAESIQ
        M+ T EEP  LQNG                 ++T+P ++      E++ T EE +SE    + DA P+    + +  E      D++PE  + +      
Subjt:  MSPTPEEPNNLQNG-----------------IETQPHISS-----ESNPTDEEPRSESSQLIADAIPKPELQQERDSESLNEEPDSEPESRRKQFAESIQ

Query:  LQVVT----DVSDPGFEEPKETSIPSNGTENSQPAL-----RKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAA
          VVT    D++D          IP   TE  Q +      + D+G++TFTMRELL+ LK E+G+ + +         +S    +++S  QP   ++  A
Subjt:  LQVVT----DVSDPGFEEPKETSIPSNGTENSQPAL-----RKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAA

Query:  MELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDR
        M+LIN +   DEEGRSRQR+L FAAR+YASAIERN  D+DALYNWAL+LQESADNVSPDS SPSKD LLEEACKKYDEATRLCPTL+DA+YNWAIAISDR
Subjt:  MELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDR

Query:  AKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--G
        AK+RGRTKEAEELW+QA  NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +V+TAISKFRAAI+LQFDFHRAIYNLGTVLYGLAEDTLRTG  G
Subjt:  AKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--G

Query:  NIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSG
        N KD+ P ELYSQSAIYIAAAH+LKP+YSVYSSALRLVRSMLPLP+LKVGYLTAPPVG  LAPHSDWKR++F LNH+ L + +L  E  +   N  G++ 
Subjt:  NIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSG

Query:  --STNGDR-TIKVEIPNIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG
          STN +R T+KV I  IVSV+ CADLTLPPGAGLCIDTIHGPVFLVADSW++LDGWLDAIRLVYTIYARGK+DVLAGIITG
Subjt:  --STNGDR-TIKVEIPNIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG

Arabidopsis top hitse value%identityAlignment
AT5G41950.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.5e-18061.68Show/hide
Query:  MSPTPEEPNNLQNG-----------------IETQPHISS-----ESNPTDEEPRSESSQLIADAIPKPELQQERDSESLNEEPDSEPESRRKQFAESIQ
        M+ T EEP  LQNG                 ++T+P ++      E++ T EE +SE    + DA P+    + +  E      D++PE  + +      
Subjt:  MSPTPEEPNNLQNG-----------------IETQPHISS-----ESNPTDEEPRSESSQLIADAIPKPELQQERDSESLNEEPDSEPESRRKQFAESIQ

Query:  LQVVT----DVSDPGFEEPKETSIPSNGTENSQPAL-----RKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAA
          VVT    D++D          IP   TE  Q +      + D+G++TFTMRELL+ LK E+G+ + +         +S    +++S  QP   ++  A
Subjt:  LQVVT----DVSDPGFEEPKETSIPSNGTENSQPAL-----RKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAA

Query:  MELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDR
        M+LIN +   DEEGRSRQR+L FAAR+YASAIERN  D+DALYNWAL+LQESADNVSPDS SPSKD LLEEACKKYDEATRLCPTL+DA+YNWAIAISDR
Subjt:  MELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQESADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDR

Query:  AKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--G
        AK+RGRTKEAEELW+QA  NYEKAVQLNWNS QALNNWGL LQELS IVPAREK+ +V+TAISKFRAAI+LQFDFHRAIYNLGTVLYGLAEDTLRTG  G
Subjt:  AKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTAISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTG--G

Query:  NIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSG
        N KD+ P ELYSQSAIYIAAAH+LKP+YSVYSSALRLVRSMLPLP+LKVGYLTAPPVG  LAPHSDWKR++F LNH+ L + +L  E  +   N  G++ 
Subjt:  NIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFFLNHDVLQRLILGGEQLQSSPNFLGRSG

Query:  --STNGDR-TIKVEIPNIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG
          STN +R T+KV I  IVSV+ CADLTLPPGAGLCIDTIHGPVFLVADSW++LDGWLDAIRLVYTIYARGK+DVLAGIITG
Subjt:  --STNGDR-TIKVEIPNIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCCTACTCCCGAGGAACCTAATAATCTGCAGAACGGAATCGAAACCCAACCACACATTTCTTCAGAATCAAACCCAACTGATGAAGAACCCAGATCAGAGTCGTC
ACAACTCATAGCAGATGCAATTCCCAAACCTGAATTGCAACAGGAACGCGATTCAGAATCACTCAATGAAGAGCCAGATTCGGAGCCGGAGTCTCGAAGGAAACAGTTCG
CCGAGTCAATCCAATTACAGGTAGTGACGGATGTTTCAGATCCGGGATTTGAAGAGCCCAAAGAAACCTCGATCCCATCCAACGGCACTGAGAACTCGCAACCTGCGCTG
CGTAAGGACGAAGGAAGCCGGACGTTTACAATGAGGGAGTTGTTGAATGGATTGAAAGGTGAAGATGGTAACGACAGCGTTAACGAATCTGAAGGCGAGAGGCCCGAGGG
GAACTCCGGTTACAGTCTTAATCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGCAGAGCTGCCATGGAGTTGATCAACAGTGTTACAGGTGTTGATGAAGAGGGTC
GTTCTCGCCAACGGATTCTCACATTTGCTGCTAGGAGGTATGCTAGTGCAATTGAGAGAAATGCTCAAGACTATGATGCTCTATACAATTGGGCTTTGGTGCTCCAGGAG
AGTGCAGACAATGTTAGTCCAGATTCCACTTCACCTTCTAAAGACGCATTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCTACCCGTCTTTGTCCAACACTTCACGA
TGCCTTCTACAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATGCGTGGTCGTACAAAGGAGGCCGAAGAACTGTGGAAGCAGGCTACCAAAAATTACGAAAAGGCTG
TCCAGCTCAACTGGAATAGTCCCCAGGCGCTAAATAATTGGGGACTTGCTCTACAGGAACTCAGTGCGATTGTGCCAGCACGAGAAAAGCAAACAATTGTAAAAACAGCT
ATCAGTAAGTTTCGTGCAGCTATACAGTTGCAATTTGATTTTCATCGAGCAATCTACAACCTTGGTACTGTTTTGTATGGATTAGCTGAGGACACATTACGGACTGGTGG
AAATATTAAGGATGTTTCCCCTAATGAGTTGTACAGCCAATCTGCAATTTATATTGCAGCTGCTCATGCTCTAAAACCAAATTACTCTGTTTACAGCAGTGCCTTGCGGT
TGGTTCGTTCAATGCTGCCGTTACCCTATCTAAAAGTTGGATACCTGACTGCACCTCCTGTGGGGAGACCACTTGCTCCTCACAGTGATTGGAAACGTTCACAATTTTTC
CTGAATCATGATGTATTGCAACGGCTTATCCTAGGGGGTGAACAATTACAATCATCCCCTAATTTTTTAGGAAGATCTGGAAGTACCAATGGCGACAGGACAATCAAAGT
TGAAATTCCAAATATTGTCTCTGTATCAGCATGTGCAGATCTAACTTTACCACCTGGTGCTGGACTCTGCATTGACACAATCCATGGACCAGTTTTCTTGGTTGCTGACT
CATGGGACGCGCTCGATGGATGGCTCGACGCAATAAGATTAGTTTACACAATCTATGCTCGAGGCAAGAACGACGTTTTAGCTGGCATCATAACGGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGCCTACTCCCGAGGAACCTAATAATCTGCAGAACGGAATCGAAACCCAACCACACATTTCTTCAGAATCAAACCCAACTGATGAAGAACCCAGATCAGAGTCGTC
ACAACTCATAGCAGATGCAATTCCCAAACCTGAATTGCAACAGGAACGCGATTCAGAATCACTCAATGAAGAGCCAGATTCGGAGCCGGAGTCTCGAAGGAAACAGTTCG
CCGAGTCAATCCAATTACAGGTAGTGACGGATGTTTCAGATCCGGGATTTGAAGAGCCCAAAGAAACCTCGATCCCATCCAACGGCACTGAGAACTCGCAACCTGCGCTG
CGTAAGGACGAAGGAAGCCGGACGTTTACAATGAGGGAGTTGTTGAATGGATTGAAAGGTGAAGATGGTAACGACAGCGTTAACGAATCTGAAGGCGAGAGGCCCGAGGG
GAACTCCGGTTACAGTCTTAATCAAGATAGCCCACATCAGCCTTATTCTGAACAGAGCAGAGCTGCCATGGAGTTGATCAACAGTGTTACAGGTGTTGATGAAGAGGGTC
GTTCTCGCCAACGGATTCTCACATTTGCTGCTAGGAGGTATGCTAGTGCAATTGAGAGAAATGCTCAAGACTATGATGCTCTATACAATTGGGCTTTGGTGCTCCAGGAG
AGTGCAGACAATGTTAGTCCAGATTCCACTTCACCTTCTAAAGACGCATTGCTTGAGGAGGCTTGTAAAAAGTATGATGAGGCTACCCGTCTTTGTCCAACACTTCACGA
TGCCTTCTACAATTGGGCTATTGCAATCTCTGATCGGGCCAAAATGCGTGGTCGTACAAAGGAGGCCGAAGAACTGTGGAAGCAGGCTACCAAAAATTACGAAAAGGCTG
TCCAGCTCAACTGGAATAGTCCCCAGGCGCTAAATAATTGGGGACTTGCTCTACAGGAACTCAGTGCGATTGTGCCAGCACGAGAAAAGCAAACAATTGTAAAAACAGCT
ATCAGTAAGTTTCGTGCAGCTATACAGTTGCAATTTGATTTTCATCGAGCAATCTACAACCTTGGTACTGTTTTGTATGGATTAGCTGAGGACACATTACGGACTGGTGG
AAATATTAAGGATGTTTCCCCTAATGAGTTGTACAGCCAATCTGCAATTTATATTGCAGCTGCTCATGCTCTAAAACCAAATTACTCTGTTTACAGCAGTGCCTTGCGGT
TGGTTCGTTCAATGCTGCCGTTACCCTATCTAAAAGTTGGATACCTGACTGCACCTCCTGTGGGGAGACCACTTGCTCCTCACAGTGATTGGAAACGTTCACAATTTTTC
CTGAATCATGATGTATTGCAACGGCTTATCCTAGGGGGTGAACAATTACAATCATCCCCTAATTTTTTAGGAAGATCTGGAAGTACCAATGGCGACAGGACAATCAAAGT
TGAAATTCCAAATATTGTCTCTGTATCAGCATGTGCAGATCTAACTTTACCACCTGGTGCTGGACTCTGCATTGACACAATCCATGGACCAGTTTTCTTGGTTGCTGACT
CATGGGACGCGCTCGATGGATGGCTCGACGCAATAAGATTAGTTTACACAATCTATGCTCGAGGCAAGAACGACGTTTTAGCTGGCATCATAACGGGCTGA
Protein sequenceShow/hide protein sequence
MSPTPEEPNNLQNGIETQPHISSESNPTDEEPRSESSQLIADAIPKPELQQERDSESLNEEPDSEPESRRKQFAESIQLQVVTDVSDPGFEEPKETSIPSNGTENSQPAL
RKDEGSRTFTMRELLNGLKGEDGNDSVNESEGERPEGNSGYSLNQDSPHQPYSEQSRAAMELINSVTGVDEEGRSRQRILTFAARRYASAIERNAQDYDALYNWALVLQE
SADNVSPDSTSPSKDALLEEACKKYDEATRLCPTLHDAFYNWAIAISDRAKMRGRTKEAEELWKQATKNYEKAVQLNWNSPQALNNWGLALQELSAIVPAREKQTIVKTA
ISKFRAAIQLQFDFHRAIYNLGTVLYGLAEDTLRTGGNIKDVSPNELYSQSAIYIAAAHALKPNYSVYSSALRLVRSMLPLPYLKVGYLTAPPVGRPLAPHSDWKRSQFF
LNHDVLQRLILGGEQLQSSPNFLGRSGSTNGDRTIKVEIPNIVSVSACADLTLPPGAGLCIDTIHGPVFLVADSWDALDGWLDAIRLVYTIYARGKNDVLAGIITG