; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025940 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025940
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLate embryogenesis abundant protein-related / LEA protein-related protein
Genome locationchr10:24719578..24721393
RNA-Seq ExpressionLag0025940
SyntenyLag0025940
Gene Ontology termsGO:0001505 - regulation of neurotransmitter levels (biological process)
GO:0007186 - G protein-coupled receptor signaling pathway (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004969 - histamine receptor activity (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR003980 - Histamine H3 receptor
IPR009646 - Root cap


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585982.1 hypothetical protein SDJN03_18715, partial [Cucurbita argyrosperma subsp. sororia]7.2e-22184.7Show/hide
Query:  MGKITVVLFLFLFFLSAAVEAAPKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVCT----PPPSPPPPPPKPRKLKS-PPPPYIYSSPPP
        M KI  +LFLF  FLSAAVEA PKPKKVKCKD K +P CYKSE YCPA+C RTCVVDCSSC+PVCT    PPPSPPPPPPKPRKLKS PPPPY+YSSPPP
Subjt:  MGKITVVLFLFLFFLSAAVEAAPKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVCT----PPPSPPPPPPKPRKLKS-PPPPYIYSSPPP

Query:  PPPRIYSSPPPPP-YIYSSPPPPPPYIYSSP---------PPPPSTTEPSPPLPPTPTPPTSTPPSLSPPPSSEASGQKRVKCKNRGFPHCYGMELTCPS
        PPP IYSSPPPPP ++YSS PPPPPYIYSSP         PPPP+TTEP PP+PPTPTPPT    SLSPPPSSEASGQK+V+CKNR FPHCYGMELTCP+
Subjt:  PPPRIYSSPPPPP-YIYSSPPPPPPYIYSSP---------PPPPSTTEPSPPLPPTPTPPTSTPPSLSPPPSSEASGQKRVKCKNRGFPHCYGMELTCPS

Query:  DCPGHCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTW
        DCPG CEVDCVTCS VCNCNRPGAVCQDPRFIGGDGITFYFHGKKD+DFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSH+LFIGARKT TW
Subjt:  DCPGHCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTW

Query:  DDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGV
        DDA DRLSL FN++TI+L + EGATWSNST+YEGITITRTRNTNAVEI VPGNFKIKA+ VPITEKESRIHKYG+TQEDCFAHLDLSFKFYALSG+VNGV
Subjt:  DDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGV

Query:  LGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLEAAAYANMSCGSDMGGEGVVCKR
        LGQTY SNYVSRAKMGVAMPVLGGDKEFASSG F TDCAVARFNGQLEGKD+SLE  AY NMSCGSDM GEGVVCKR
Subjt:  LGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLEAAAYANMSCGSDMGGEGVVCKR

KAG7020779.1 hypothetical protein SDJN02_17467, partial [Cucurbita argyrosperma subsp. argyrosperma]3.6e-22084.49Show/hide
Query:  MGKITVVLFLFLFFLSAAVEAAPKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVCT----PPPSPPPPPPKPRKLKS-PPPPYIYSSPPP
        M KI  +LFLF  FLSAAVEA PKPKKVKCKD K +P CYKSE YCPA+C RTCVVDCSSC+PVCT    PPPSPPPPPPKPRKLKS PPPPY+YSSPPP
Subjt:  MGKITVVLFLFLFFLSAAVEAAPKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVCT----PPPSPPPPPPKPRKLKS-PPPPYIYSSPPP

Query:  PPPRIYSSPPPPP-YIYSSPPPPPPYIYSSP---------PPPPSTTEPSPPLPPTPTPPTSTPPSLSPPPSSEASGQKRVKCKNRGFPHCYGMELTCPS
        PPP IYSSPPPPP ++Y S PPPPPYIYSSP         PPPP+TTEP PP+PPTPTPPT    SLSPPPSSEASGQK+V+CKNR FPHCYGMELTCP+
Subjt:  PPPRIYSSPPPPP-YIYSSPPPPPPYIYSSP---------PPPPSTTEPSPPLPPTPTPPTSTPPSLSPPPSSEASGQKRVKCKNRGFPHCYGMELTCPS

Query:  DCPGHCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTW
        DCPG CEVDCVTCS VCNCNRPGAVCQDPRFIGGDGITFYFHGKKD+DFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSH+LFIGARKT TW
Subjt:  DCPGHCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTW

Query:  DDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGV
        DDA DRLSL FN++TI+L + EGATWSNST+YEGITITRTRNTNAVEI VPGNFKIKA+ VPITEKESRIHKYG+TQEDCFAHLDLSFKFYALSG+VNGV
Subjt:  DDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGV

Query:  LGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLEAAAYANMSCGSDMGGEGVVCKR
        LGQTY SNYVSRAKMGVAMPVLGGDKEFASSG F TDCAVARFNGQLEGKD+SLE  AY NMSCGSDM GEGVVCKR
Subjt:  LGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLEAAAYANMSCGSDMGGEGVVCKR

XP_022937516.1 uncharacterized protein LOC111443907 [Cucurbita moschata]7.2e-22184.7Show/hide
Query:  MGKITVVLFLFLFFLSAAVEAAPKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVCT----PPPSPPPPPPKPRKLKS-PPPPYIYSSPPP
        M KI  +LFLF  FLSAAVEA PKPKKVKCKD K +P CYKSE YCPA+C RTCVVDCSSC+PVCT    PPPSPPPPPPKPRKLKS PPPPY+YSSPPP
Subjt:  MGKITVVLFLFLFFLSAAVEAAPKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVCT----PPPSPPPPPPKPRKLKS-PPPPYIYSSPPP

Query:  PPPRIYSSPPPPP-YIYSSPPPPPPYIYSSP---------PPPPSTTEPSPPLPPTPTPPTSTPPSLSPPPSSEASGQKRVKCKNRGFPHCYGMELTCPS
        PPP IYSSPPPPP ++YSS PPPPPYIYSSP         PPPP+TTEP PP+PPTPTPPT    SLSPPPSSEASGQK+V+CKNR FPHCYGMELTCP+
Subjt:  PPPRIYSSPPPPP-YIYSSPPPPPPYIYSSP---------PPPPSTTEPSPPLPPTPTPPTSTPPSLSPPPSSEASGQKRVKCKNRGFPHCYGMELTCPS

Query:  DCPGHCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTW
        DCPG CEVDCVTCS VCNCNRPGAVCQDPRFIGGDGITFYFHGKKD+DFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSH+LFIGARKT TW
Subjt:  DCPGHCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTW

Query:  DDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGV
        DDA DRLSL FN++TI+L + EGATWSNST+YEGITITRTRNTNAVEI VPGNFKIKA+ VPITEKESRIHKYG+TQEDCFAHLDLSFKFYALSG+VNGV
Subjt:  DDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGV

Query:  LGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLEAAAYANMSCGSDMGGEGVVCKR
        LGQTY SNYVSRAKMGVAMPVLGGDKEFASSG F TDCAVARFNGQLEGKD+SLE  AY NMSCGSDM GEGVVCKR
Subjt:  LGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLEAAAYANMSCGSDMGGEGVVCKR

XP_022937854.1 uncharacterized protein LOC111444116 [Cucurbita moschata]7.2e-22184.7Show/hide
Query:  MGKITVVLFLFLFFLSAAVEAAPKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVCT----PPPSPPPPPPKPRKLKS-PPPPYIYSSPPP
        M KI  +LFLF  FLSAAVEA PKPKKVKCKD K +P CYKSE YCPA+C RTCVVDCSSC+PVCT    PPPSPPPPPPKPRKLKS PPPPY+YSSPPP
Subjt:  MGKITVVLFLFLFFLSAAVEAAPKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVCT----PPPSPPPPPPKPRKLKS-PPPPYIYSSPPP

Query:  PPPRIYSSPPPPP-YIYSSPPPPPPYIYSSP---------PPPPSTTEPSPPLPPTPTPPTSTPPSLSPPPSSEASGQKRVKCKNRGFPHCYGMELTCPS
        PPP IYSSPPPPP ++Y S PPPPPYIYSSP         PPPP+TTEP PP+PPTPTPPT    SLSPPPSSEASGQK+V+CKNR FPHCYGMELTCP+
Subjt:  PPPRIYSSPPPPP-YIYSSPPPPPPYIYSSP---------PPPPSTTEPSPPLPPTPTPPTSTPPSLSPPPSSEASGQKRVKCKNRGFPHCYGMELTCPS

Query:  DCPGHCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTW
        DCPG CEVDCVTCS VCNCNRPGAVCQDPRFIGGDGITFYFHGKKD+DFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSH+LFIGARKT TW
Subjt:  DCPGHCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTW

Query:  DDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGV
        DDA DRLSLSFN++TI+L + EGATWSNST+YEGITITRTRNTNAVEI VPGNFKIKA+ VPITEKESRIHKYG+TQEDCFAHLDLSFKFYALSG+VNGV
Subjt:  DDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGV

Query:  LGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLEAAAYANMSCGSDMGGEGVVCKR
        LGQTY SNYVSRAKMGVAMPVLGGDKEFASSG F TDCAVARFNGQLEGKD+SLE  AY NMSCGSDM GEGVVCKR
Subjt:  LGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLEAAAYANMSCGSDMGGEGVVCKR

XP_022969544.1 uncharacterized protein LOC111468530 [Cucurbita maxima]9.1e-22484.73Show/hide
Query:  MGKITVVLFLFLFFLSAAVEAAPKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVCT----PPPSPPPPPPKPRKLKS------------P
        M KI  +LFLF  FLSAAVEA PKPKKVKCKD KK+P CYKSE YCPA+C RTCVVDCSSC+PVCT    PPPSPPPPPPKPRKLKS            P
Subjt:  MGKITVVLFLFLFFLSAAVEAAPKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVCT----PPPSPPPPPPKPRKLKS------------P

Query:  PPPYIYSSPPPPPPRIYSSPPPPPYIYSSPPPPPPYIYSSPPPPPSTTEPSPPLPPTPTPPTSTPPSLSPPPSSEASGQKRVKCKNRGFPHCYGMELTCP
        PPPYIYSSPPPPP  +YSSPPPPPYIYSS PPPPPY+YSSPPPPP+TTEP PP+PPTP PPTS     SPPPSSEASGQK+V+CKNR FPHCYGMELTCP
Subjt:  PPPYIYSSPPPPPPRIYSSPPPPPYIYSSPPPPPPYIYSSPPPPPSTTEPSPPLPPTPTPPTSTPPSLSPPPSSEASGQKRVKCKNRGFPHCYGMELTCP

Query:  SDCPGHCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGT
        +DCPG CEVDCVTCS VCNCNRPGAVCQDPRFIGGDGITFYFHGKKD+DFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSH+LFIGARKT T
Subjt:  SDCPGHCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGT

Query:  WDDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNG
        WDDA DRLSLSFN++TI+L + EGATWSNST+YEGITITRTRNTNAVEI VPGNFKIKA+ VPITEKESRIHKYG+TQEDCFAHLDLSFKFYALSGEVNG
Subjt:  WDDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNG

Query:  VLGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLEAAAYANMSCGSDMGGEGVVCKR
        VLGQTY SNYVSRAKMGVAMPVLGGDKEFASSG F TDCAVARFNGQLEGKD+SLE  AY NMSCGSDM GEGVVCKR
Subjt:  VLGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLEAAAYANMSCGSDMGGEGVVCKR

TrEMBL top hitse value%identityAlignment
A0A0A0LSM1 Uncharacterized protein2.1e-21883.33Show/hide
Query:  MGKITVVLFLFLFFLSAAVEAAPKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVCT----PPPSPPPPPPKPRKLKSPPPPYIYSSPPPP
        M +I + LF    FLSA VE APK KKVKCKD KK+P CYKSE YCPA+C RTCVVDCSSCQPVCT    PPPSPPPPPPKPRKLKSPPPPYIYSSPPPP
Subjt:  MGKITVVLFLFLFFLSAAVEAAPKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVCT----PPPSPPPPPPKPRKLKSPPPPYIYSSPPPP

Query:  PPRIYSS-PPPPPYIYSSPPPPPPYIYSSPPPPPSTT-EPSPPLPPTPTPPTSTPPSLSPPPSSEASGQKRVKCKNRGFPHCYGMELTCPSDCPGHCEVD
        PPRIYSS PPPPPYIYSS PPPPP+IYSSPPPPP TT EPSPPLPP PTPP+S+PP LSPPPSSEASGQK+V+CKNRG+PHCYGMEL+CPSDCP  CEVD
Subjt:  PPRIYSS-PPPPPYIYSSPPPPPPYIYSSPPPPPSTT-EPSPPLPPTPTPPTSTPPSLSPPPSSEASGQKRVKCKNRGFPHCYGMELTCPSDCPGHCEVD

Query:  CVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTWDDAIDRLSL
        CVTCSPVCNCNRPGAVCQDP+FIGGDGITFYFHGK+D+DFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSH+LFI ARKT TWDDA DRL +
Subjt:  CVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTWDDAIDRLSL

Query:  SFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGVLGQTYASNY
        S +DETI+LP+ EGATWSNSTSYEGI ITR+R TNAVEIEVPGNFKIKA+ VPITEKES IHKYG+TQEDCFAHLDLSFKFYALSG VNGVLGQTY  NY
Subjt:  SFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGVLGQTYASNY

Query:  VSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLEAAAYANMSCGSDMGGEGVVCKR
        VSRAKMGVAMPVLGGDKEFASS +F TDC V RF  +++ K++ +EAAAYANMSCGSDM G+GVVCKR
Subjt:  VSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLEAAAYANMSCGSDMGGEGVVCKR

A0A1S3CF51 uncharacterized protein LOC1035002224.6e-21381.37Show/hide
Query:  MGKITVVLFLFLFFLSAAVEAAPKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVCT----PPPSPPPPPPKPRKLKSPPPPYIYSSPPPP
        M +I + LF F  FLSA VE  PK KKVKCKD KK+P CYKS+ YCP +C RTCVVDCSSCQPVCT    PPPSPPPPPPKPRKL+SPPPPYIYSSPPPP
Subjt:  MGKITVVLFLFLFFLSAAVEAAPKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVCT----PPPSPPPPPPKPRKLKSPPPPYIYSSPPPP

Query:  PPRIYSSPPPPPYIYSSPPPPPPYIYSS-PPPPPSTTEPSPPLPPTPTPPTSTPPSLSPPPSSEASGQKRVKCKNRGFPHCYGMELTCPSDCPGHCEVDC
        PPR          +YSSPPPPPPYIYSS PPPPP+T EPSPPLPPTPTPP S+PP LSPPPSSEASGQK+V+CKNRG+PHCYGMEL+CPSDCP  CEVDC
Subjt:  PPRIYSSPPPPPYIYSSPPPPPPYIYSS-PPPPPSTTEPSPPLPPTPTPPTSTPPSLSPPPSSEASGQKRVKCKNRGFPHCYGMELTCPSDCPGHCEVDC

Query:  VTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTWDDAIDRLSLS
        VTCSPVCNCNRPGAVCQDP+FIGGDGITFYFHGKKD+DFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILF SH+LFI ARKT TWDDA DRL +S
Subjt:  VTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTWDDAIDRLSLS

Query:  FNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGVLGQTYASNYV
         +DETILLP+ EGATWSNSTSYEGI I+R+R TNAVEIEVPGNFKIKA+ VPITEKES IHKYG+TQEDCFAHLDLSFKFYALSG V+GVLGQTY +NYV
Subjt:  FNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGVLGQTYASNYV

Query:  SRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLEAAAYANMSCGSDMGGEGVVCKR
        SRAKMGVAMPVLGGDKEFASS +F TDC VARF+ +L+GK++S+EAAAYANMSCG+DM G+GVVCKR
Subjt:  SRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLEAAAYANMSCGSDMGGEGVVCKR

A0A6J1FAJ8 uncharacterized protein LOC1114439073.5e-22184.7Show/hide
Query:  MGKITVVLFLFLFFLSAAVEAAPKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVCT----PPPSPPPPPPKPRKLKS-PPPPYIYSSPPP
        M KI  +LFLF  FLSAAVEA PKPKKVKCKD K +P CYKSE YCPA+C RTCVVDCSSC+PVCT    PPPSPPPPPPKPRKLKS PPPPY+YSSPPP
Subjt:  MGKITVVLFLFLFFLSAAVEAAPKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVCT----PPPSPPPPPPKPRKLKS-PPPPYIYSSPPP

Query:  PPPRIYSSPPPPP-YIYSSPPPPPPYIYSSP---------PPPPSTTEPSPPLPPTPTPPTSTPPSLSPPPSSEASGQKRVKCKNRGFPHCYGMELTCPS
        PPP IYSSPPPPP ++YSS PPPPPYIYSSP         PPPP+TTEP PP+PPTPTPPT    SLSPPPSSEASGQK+V+CKNR FPHCYGMELTCP+
Subjt:  PPPRIYSSPPPPP-YIYSSPPPPPPYIYSSP---------PPPPSTTEPSPPLPPTPTPPTSTPPSLSPPPSSEASGQKRVKCKNRGFPHCYGMELTCPS

Query:  DCPGHCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTW
        DCPG CEVDCVTCS VCNCNRPGAVCQDPRFIGGDGITFYFHGKKD+DFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSH+LFIGARKT TW
Subjt:  DCPGHCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTW

Query:  DDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGV
        DDA DRLSL FN++TI+L + EGATWSNST+YEGITITRTRNTNAVEI VPGNFKIKA+ VPITEKESRIHKYG+TQEDCFAHLDLSFKFYALSG+VNGV
Subjt:  DDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGV

Query:  LGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLEAAAYANMSCGSDMGGEGVVCKR
        LGQTY SNYVSRAKMGVAMPVLGGDKEFASSG F TDCAVARFNGQLEGKD+SLE  AY NMSCGSDM GEGVVCKR
Subjt:  LGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLEAAAYANMSCGSDMGGEGVVCKR

A0A6J1FCE2 uncharacterized protein LOC1114441163.5e-22184.7Show/hide
Query:  MGKITVVLFLFLFFLSAAVEAAPKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVCT----PPPSPPPPPPKPRKLKS-PPPPYIYSSPPP
        M KI  +LFLF  FLSAAVEA PKPKKVKCKD K +P CYKSE YCPA+C RTCVVDCSSC+PVCT    PPPSPPPPPPKPRKLKS PPPPY+YSSPPP
Subjt:  MGKITVVLFLFLFFLSAAVEAAPKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVCT----PPPSPPPPPPKPRKLKS-PPPPYIYSSPPP

Query:  PPPRIYSSPPPPP-YIYSSPPPPPPYIYSSP---------PPPPSTTEPSPPLPPTPTPPTSTPPSLSPPPSSEASGQKRVKCKNRGFPHCYGMELTCPS
        PPP IYSSPPPPP ++Y S PPPPPYIYSSP         PPPP+TTEP PP+PPTPTPPT    SLSPPPSSEASGQK+V+CKNR FPHCYGMELTCP+
Subjt:  PPPRIYSSPPPPP-YIYSSPPPPPPYIYSSP---------PPPPSTTEPSPPLPPTPTPPTSTPPSLSPPPSSEASGQKRVKCKNRGFPHCYGMELTCPS

Query:  DCPGHCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTW
        DCPG CEVDCVTCS VCNCNRPGAVCQDPRFIGGDGITFYFHGKKD+DFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSH+LFIGARKT TW
Subjt:  DCPGHCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTW

Query:  DDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGV
        DDA DRLSLSFN++TI+L + EGATWSNST+YEGITITRTRNTNAVEI VPGNFKIKA+ VPITEKESRIHKYG+TQEDCFAHLDLSFKFYALSG+VNGV
Subjt:  DDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGV

Query:  LGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLEAAAYANMSCGSDMGGEGVVCKR
        LGQTY SNYVSRAKMGVAMPVLGGDKEFASSG F TDCAVARFNGQLEGKD+SLE  AY NMSCGSDM GEGVVCKR
Subjt:  LGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLEAAAYANMSCGSDMGGEGVVCKR

A0A6J1I078 uncharacterized protein LOC1114685304.4e-22484.73Show/hide
Query:  MGKITVVLFLFLFFLSAAVEAAPKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVCT----PPPSPPPPPPKPRKLKS------------P
        M KI  +LFLF  FLSAAVEA PKPKKVKCKD KK+P CYKSE YCPA+C RTCVVDCSSC+PVCT    PPPSPPPPPPKPRKLKS            P
Subjt:  MGKITVVLFLFLFFLSAAVEAAPKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVCT----PPPSPPPPPPKPRKLKS------------P

Query:  PPPYIYSSPPPPPPRIYSSPPPPPYIYSSPPPPPPYIYSSPPPPPSTTEPSPPLPPTPTPPTSTPPSLSPPPSSEASGQKRVKCKNRGFPHCYGMELTCP
        PPPYIYSSPPPPP  +YSSPPPPPYIYSS PPPPPY+YSSPPPPP+TTEP PP+PPTP PPTS     SPPPSSEASGQK+V+CKNR FPHCYGMELTCP
Subjt:  PPPYIYSSPPPPPPRIYSSPPPPPYIYSSPPPPPPYIYSSPPPPPSTTEPSPPLPPTPTPPTSTPPSLSPPPSSEASGQKRVKCKNRGFPHCYGMELTCP

Query:  SDCPGHCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGT
        +DCPG CEVDCVTCS VCNCNRPGAVCQDPRFIGGDGITFYFHGKKD+DFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSH+LFIGARKT T
Subjt:  SDCPGHCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGT

Query:  WDDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNG
        WDDA DRLSLSFN++TI+L + EGATWSNST+YEGITITRTRNTNAVEI VPGNFKIKA+ VPITEKESRIHKYG+TQEDCFAHLDLSFKFYALSGEVNG
Subjt:  WDDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNG

Query:  VLGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLEAAAYANMSCGSDMGGEGVVCKR
        VLGQTY SNYVSRAKMGVAMPVLGGDKEFASSG F TDCAVARFNGQLEGKD+SLE  AY NMSCGSDM GEGVVCKR
Subjt:  VLGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLEAAAYANMSCGSDMGGEGVVCKR

SwissProt top hitse value%identityAlignment
O65375 Leucine-rich repeat extensin-like protein 11.7e-1060.78Show/hide
Query:  PPPSPPPPPPKPRKLKSPPPPYIYSSPPPPPPRIYSSPPPPPYIYSSPPPPPPYIYSSPPPP--PSTTEPSPPLPPTPTPPTSTPPSL--------SPPP
        PPP P P PP P    SPPPPY+YSS PPPPP +YSSPPPPPY+YSS  PPPPY+YSSPPPP   S+  P PP PP P P +S PP +        SPPP
Subjt:  PPPSPPPPPPKPRKLKSPPPPYIYSSPPPPPPRIYSSPPPPPYIYSSPPPPPPYIYSSPPPP--PSTTEPSPPLPPTPTPPTSTPPSL--------SPPP

Query:  SS
         S
Subjt:  SS

Q9T0K5 Leucine-rich repeat extensin-like protein 33.0e-0455.77Show/hide
Query:  PVCTPPPSPPPPPPKPRKLKSPPPPYIYSSPPPP----PPRIYSSPPPPPYIYS------SPPPPPPYIYSSPPPPPSTTEPSPPLPPTPTPPTSTPPSL
        PV +PPP PPPPPP P  + SPPPP +YSSPPPP    P  +Y + PPPP  +S      SPPPP PY YSSPPPP S+  P  P PP  +PP    P L
Subjt:  PVCTPPPSPPPPPPKPRKLKSPPPPYIYSSPPPP----PPRIYSSPPPPPYIYS------SPPPPPPYIYSSPPPPPSTTEPSPPLPPTPTPPTSTPPSL

Query:  SPPP
        SPPP
Subjt:  SPPP

Arabidopsis top hitse value%identityAlignment
AT3G19430.1 late embryogenesis abundant protein-related / LEA protein-related2.4e-11346.76Show/hide
Query:  APKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVC------------------------------------------TPPPSPPPPP----
        A  P    CK  KKY  CY  E  CP  CP +C V+C+SC+P+C                                          TPP SPPPP     
Subjt:  APKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVC------------------------------------------TPPPSPPPPP----

Query:  -PKPRKLKSPPPPY-----------IYSSPPPPPPRIYS-----SPPPP----------PYIYSSPPPPPPYIYSSPPPPPSTTEPSP----PLPP----
         P P    SPPPP            +   PP P P + S     SPPPP          P + + P P PP   S PPP P+ + PSP    P PP    
Subjt:  -PKPRKLKSPPPPY-----------IYSSPPPPPPRIYS-----SPPPP----------PYIYSSPPPPPPYIYSSPPPPPSTTEPSP----PLPP----

Query:  ------TPTPPT-------------STPPSLS---------PPPS--SEASGQKRVKCKNRGFPHCYGMELTCPSDCPGHCEVDCVTCSPVCNCNRPGAV
              TPTPPT              TPPS+          PPPS   EA+G KRV+CK +  P CYG+E TCP+DCP  C+VDCVTC PVCNC++PG+V
Subjt:  ------TPTPPT-------------STPPSLS---------PPPS--SEASGQKRVKCKNRGFPHCYGMELTCPSDCPGHCEVDCVTCSPVCNCNRPGAV

Query:  CQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTWDDAIDRLSLSFNDETILLPDHEGAT
        CQDPRFIGGDG+TFYFHGKKD +FC+++D NLHINAHFIG+R   M RDFTWVQS+ ILF +HRL++GA KT TWDD++DR+++SF+   I LP  +GA 
Subjt:  CQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTWDDAIDRLSLSFNDETILLPDHEGAT

Query:  WSNSTS-YEGITITRTR-NTNAVEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGVLGQTYASNYVSRAKMGVAMPVLG
        W++S   Y  +++ R   +TN +E+EV G  KI A  VPIT ++SRIH Y V ++DC AHLDL FKF  LS  V+GVLGQTY SNYVSR K+GV MPV+G
Subjt:  WSNSTS-YEGITITRTR-NTNAVEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGVLGQTYASNYVSRAKMGVAMPVLG

Query:  GDKEFASSGLFTTDCAVARF--NGQLEGKDTSLEAAAYANMSCGSDMGGEGVVCKR
        GD+EF ++GLF  DC+ ARF  NG      + LE      MSC S +GG+GVVCKR
Subjt:  GDKEFASSGLFTTDCAVARF--NGQLEGKDTSLEAAAYANMSCGSDMGGEGVVCKR

AT4G27400.1 Late embryogenesis abundant (LEA) protein-related8.9e-6037.38Show/hide
Query:  CKNRGFPHCYGMELTCPSDCPGH---------CEVDCV--TCSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGR
        C     P C    + CP +CP           C VDC    C  VC     NC   G++C DPRFIGGDGI FYFHGK ++ F IV+D +  INA F G 
Subjt:  CKNRGFPHCYGMELTCPSDCPGH---------CEVDCV--TCSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGR

Query:  RNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTWDDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPITEK
        R     RDFTW+Q+LG LF+SH+  +   K  TWD  +D L  + + + +++P    +TW +S   + I I R    N+V + +    +I    VP+T++
Subjt:  RNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTWDDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPITEK

Query:  ESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGVLGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLE-AAAYANMSCG
        + RIH Y +  +DCFAH ++ FKF  LS +V+G+LG+TY  ++ + AK GV MPV+GG+  F +S L +  C    F+        S++  + YA + C 
Subjt:  ESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGVLGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLE-AAAYANMSCG

Query:  -SDMGGEGVVCKR
             G G+VC++
Subjt:  -SDMGGEGVVCKR

AT5G54370.1 Late embryogenesis abundant (LEA) protein-related2.0e-6740.13Show/hide
Query:  VKCKNRGFPHCYGMELTCPSDCPGH---------CEVDC--VTCSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFI
        V C N  +  CY   + CP +CP           C  DC   TC   C     NCNRPG+ C DPRFIGGDGI FYFHGK +++F +V+DS+L IN  FI
Subjt:  VKCKNRGFPHCYGMELTCPSDCPGH---------CEVDC--VTCSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHINAHFI

Query:  GRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTWDDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPIT
        G R     RDFTW+Q+LG LF+S++  + A KT +WD+ ID L  S++ + + +P+   +TW +    + I I R    N+V + +    +I    VP+T
Subjt:  GRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTWDDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAMAVPIT

Query:  EKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGVLGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLEAAAYANMSC
        +++ RIH Y V  +DCFAHL++ F+F+ LS +V+G+LG+TY  ++ + AK GVAMPV+GG+  F +S L + DC    F+      D+      YA + C
Subjt:  EKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGVLGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLEAAAYANMSC

Query:  -GSDMGGEGVVCKR
              G G+VC++
Subjt:  -GSDMGGEGVVCKR

AT5G60520.1 Late embryogenesis abundant (LEA) protein-related3.1e-6844.25Show/hide
Query:  SGQKRVKCKNRGFPHCYGMELTCPSDCPGH----------CEVDCVT-CSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHI
        SGQ+RV+C  RG   C    LTCP +CP            C +DC + C   C     NCN  G++C DPRF+GGDG+ FYFHG KD +F IV+D NL I
Subjt:  SGQKRVKCKNRGFPHCYGMELTCPSDCPGH----------CEVDCVT-CSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFCIVTDSNLHI

Query:  NAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTWDDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAM
        NAHFIG R     RDFTWVQ+  ++FDSH L I A+K  +WDD++D L + +N E + +P    A W        + + RT   N V + V G  +I   
Subjt:  NAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTWDDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNAVEIEVPGNFKIKAM

Query:  AVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGVLGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQ
          PI ++E R+HKY + ++D FAHL+  FKF+ LS  V GVLG+TY   YVS  K GV MP++GG+ ++ +  LF+  C V RF G+
Subjt:  AVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGVLGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQ

AT5G60530.1 late embryogenesis abundant protein-related / LEA protein-related1.4e-6543.14Show/hide
Query:  LSPPPSSEASGQKRVKCKNRGFPHCYGMELTCPSDCPGH----------CEVDCVT-CSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFC
        LSP P    +GQ++  C+ RG   CY   L CP +CP            C +DC   C   C     NCN  G++C DPRF+GGDG+ FYFHG K  +F 
Subjt:  LSPPPSSEASGQKRVKCKNRGFPHCYGMELTCPSDCPGH----------CEVDCVT-CSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGKKDQDFC

Query:  IVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTWDDAIDRLSLSFNDETILLPDHEGATWSN-STSYEGITITRTRNTNAVEIE
        IV+D+NL INAHFIG R V   RDFTWVQ+L ++F++H+L I A +   WD+  D  ++ ++ E I LP+ E + W   S   + I I RT   N+V + 
Subjt:  IVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTWDDAIDRLSLSFNDETILLPDHEGATWSN-STSYEGITITRTRNTNAVEIE

Query:  VPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGVLGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLE
        V    ++     PI ++E+R+H Y + Q+D FAHL+  FKF  LS  V GVLG+TY  +YVS AK GV MPVLGG+ ++ +  LF+  C + RF  Q E
Subjt:  VPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGVLGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAAATCACGGTGGTTCTCTTCTTGTTCCTTTTCTTCCTCTCAGCCGCTGTTGAGGCAGCTCCAAAGCCCAAGAAAGTTAAATGCAAAGACAAGAAGAAGTATCC
CCTATGTTACAAATCCGAGTTCTATTGCCCTGCTAATTGCCCTCGAACTTGTGTTGTTGATTGTTCATCTTGCCAACCTGTTTGTACTCCGCCACCCTCGCCTCCTCCAC
CACCACCCAAACCACGTAAGCTCAAATCTCCACCACCGCCCTACATTTACTCTTCGCCCCCACCGCCACCCCCACGCATTTACTCTTCCCCACCACCACCTCCCTACATT
TATTCTTCTCCTCCCCCACCACCTCCATACATTTACTCTTCTCCACCACCTCCACCATCTACGACGGAGCCTTCACCTCCACTCCCTCCGACTCCAACTCCTCCGACATC
TACACCACCTTCTCTTTCTCCACCACCGTCGTCCGAGGCGTCGGGGCAGAAGAGAGTTAAGTGCAAGAATAGGGGCTTTCCACATTGTTATGGAATGGAGCTAACTTGTC
CAAGTGATTGTCCTGGCCACTGTGAGGTCGACTGTGTTACTTGCAGCCCCGTTTGCAATTGCAACCGTCCAGGCGCAGTATGCCAAGACCCAAGATTCATCGGAGGAGAT
GGAATCACCTTCTACTTCCATGGCAAAAAAGACCAAGATTTCTGCATCGTCACTGACTCCAACCTCCACATCAATGCCCACTTCATCGGCCGACGAAACGTCGACATGAA
GAGAGACTTCACTTGGGTCCAATCTCTTGGCATCCTCTTCGACTCTCACAGACTCTTCATCGGCGCCCGAAAAACCGGGACATGGGACGATGCCATCGACCGCCTCTCTC
TCTCCTTCAACGACGAAACCATCCTCCTCCCAGACCACGAGGGCGCCACCTGGAGTAATTCGACCTCATACGAGGGGATCACCATAACCAGAACTCGCAACACCAACGCC
GTCGAGATCGAAGTGCCCGGGAACTTCAAGATCAAGGCCATGGCGGTCCCGATAACGGAGAAGGAATCGAGGATCCACAAGTATGGGGTTACACAAGAGGATTGCTTTGC
CCATTTGGACTTGAGCTTCAAGTTCTATGCATTGAGTGGCGAAGTGAATGGGGTTTTGGGGCAGACTTATGCCAGCAACTACGTGAGCAGGGCAAAGATGGGAGTGGCAA
TGCCTGTTTTGGGTGGCGATAAGGAGTTTGCTTCTTCAGGTCTTTTTACTACGGATTGTGCAGTGGCACGTTTCAACGGGCAGTTGGAAGGAAAAGACACTTCTTTGGAG
GCTGCAGCCTATGCCAATATGAGCTGCGGCAGTGACATGGGAGGTGAAGGAGTTGTTTGCAAACGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAAAATCACGGTGGTTCTCTTCTTGTTCCTTTTCTTCCTCTCAGCCGCTGTTGAGGCAGCTCCAAAGCCCAAGAAAGTTAAATGCAAAGACAAGAAGAAGTATCC
CCTATGTTACAAATCCGAGTTCTATTGCCCTGCTAATTGCCCTCGAACTTGTGTTGTTGATTGTTCATCTTGCCAACCTGTTTGTACTCCGCCACCCTCGCCTCCTCCAC
CACCACCCAAACCACGTAAGCTCAAATCTCCACCACCGCCCTACATTTACTCTTCGCCCCCACCGCCACCCCCACGCATTTACTCTTCCCCACCACCACCTCCCTACATT
TATTCTTCTCCTCCCCCACCACCTCCATACATTTACTCTTCTCCACCACCTCCACCATCTACGACGGAGCCTTCACCTCCACTCCCTCCGACTCCAACTCCTCCGACATC
TACACCACCTTCTCTTTCTCCACCACCGTCGTCCGAGGCGTCGGGGCAGAAGAGAGTTAAGTGCAAGAATAGGGGCTTTCCACATTGTTATGGAATGGAGCTAACTTGTC
CAAGTGATTGTCCTGGCCACTGTGAGGTCGACTGTGTTACTTGCAGCCCCGTTTGCAATTGCAACCGTCCAGGCGCAGTATGCCAAGACCCAAGATTCATCGGAGGAGAT
GGAATCACCTTCTACTTCCATGGCAAAAAAGACCAAGATTTCTGCATCGTCACTGACTCCAACCTCCACATCAATGCCCACTTCATCGGCCGACGAAACGTCGACATGAA
GAGAGACTTCACTTGGGTCCAATCTCTTGGCATCCTCTTCGACTCTCACAGACTCTTCATCGGCGCCCGAAAAACCGGGACATGGGACGATGCCATCGACCGCCTCTCTC
TCTCCTTCAACGACGAAACCATCCTCCTCCCAGACCACGAGGGCGCCACCTGGAGTAATTCGACCTCATACGAGGGGATCACCATAACCAGAACTCGCAACACCAACGCC
GTCGAGATCGAAGTGCCCGGGAACTTCAAGATCAAGGCCATGGCGGTCCCGATAACGGAGAAGGAATCGAGGATCCACAAGTATGGGGTTACACAAGAGGATTGCTTTGC
CCATTTGGACTTGAGCTTCAAGTTCTATGCATTGAGTGGCGAAGTGAATGGGGTTTTGGGGCAGACTTATGCCAGCAACTACGTGAGCAGGGCAAAGATGGGAGTGGCAA
TGCCTGTTTTGGGTGGCGATAAGGAGTTTGCTTCTTCAGGTCTTTTTACTACGGATTGTGCAGTGGCACGTTTCAACGGGCAGTTGGAAGGAAAAGACACTTCTTTGGAG
GCTGCAGCCTATGCCAATATGAGCTGCGGCAGTGACATGGGAGGTGAAGGAGTTGTTTGCAAACGATAA
Protein sequenceShow/hide protein sequence
MGKITVVLFLFLFFLSAAVEAAPKPKKVKCKDKKKYPLCYKSEFYCPANCPRTCVVDCSSCQPVCTPPPSPPPPPPKPRKLKSPPPPYIYSSPPPPPPRIYSSPPPPPYI
YSSPPPPPPYIYSSPPPPPSTTEPSPPLPPTPTPPTSTPPSLSPPPSSEASGQKRVKCKNRGFPHCYGMELTCPSDCPGHCEVDCVTCSPVCNCNRPGAVCQDPRFIGGD
GITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHRLFIGARKTGTWDDAIDRLSLSFNDETILLPDHEGATWSNSTSYEGITITRTRNTNA
VEIEVPGNFKIKAMAVPITEKESRIHKYGVTQEDCFAHLDLSFKFYALSGEVNGVLGQTYASNYVSRAKMGVAMPVLGGDKEFASSGLFTTDCAVARFNGQLEGKDTSLE
AAAYANMSCGSDMGGEGVVCKR