; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi03G001070 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi03G001070
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionLate embryogenesis abundant protein-related / LEA protein-related protein
Genome locationchr03:1477349..1489928
RNA-Seq ExpressionLsi03G001070
SyntenyLsi03G001070
Gene Ontology termsGO:0000160 - phosphorelay signal transduction system (biological process)
GO:0001505 - regulation of neurotransmitter levels (biological process)
GO:0007186 - G protein-coupled receptor signaling pathway (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004969 - histamine receptor activity (molecular function)
InterPro domainsIPR009646 - Root cap
IPR036641 - HPT domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139573.2 uncharacterized protein LOC101207232 [Cucumis sativus]1.8e-22486.05Show/hide
Query:  MARIAIFLFFFFLFLSAMVEGAPKAKKVKCKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPVCTPPPPPPPSPPPPPPKPRKLKSPPPPYIYSS-----
        MARIAIFLFF FLFLSA+VEGAPKAKKVKCKDKK+PQCYKSEHYCP+DCLRTCVVDC+SCQPVCTPPPPPPPSPPPPPPKPRKLKSPPPPYIYSS     
Subjt:  MARIAIFLFFFFLFLSAMVEGAPKAKKVKCKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPVCTPPPPPPPSPPPPPPKPRKLKSPPPPYIYSS-----

Query:  --------PPPPHIYSSPPPPPYVYSSPPPP---TAEPSPSTP---------TPPTSPPPSSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQCEVDCV
                PPPP+IYSSPPPPP++YSSPPPP   T EPSP  P          PP SPPPSSEASGQKK RCKNRG+PHCYGMELSCPSDCPSQCEVDCV
Subjt:  --------PPPPHIYSSPPPPPYVYSSPPPP---TAEPSPSTP---------TPPTSPPPSSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQCEVDCV

Query:  TCSPVCNCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDRLYLSL
        TCSPVCNCNRPGAVCQDPKFIGGDGITFYFHGK+DKDFCIVTDSNLHINAHFIGRRNV+MKRDFTWVQSLGILFDSH+LFI ARKT+TWDDA DRLY+SL
Subjt:  TCSPVCNCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDRLYLSL

Query:  DNERILLPNQEGATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYGNKYVS
        D+E I+LPNQEGATWSNSTSYEGI ITRSR TNAVEIEVPGNFKIKAVVVPITEKES IHKYGITQEDCFAHLDLSFKFYALSG+VNGVLGQTYG  YVS
Subjt:  DNERILLPNQEGATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYGNKYVS

Query:  RAKMGVAMPVLGGDKEFASSSIFATDCKVARFNGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR
        RAKMGVAMPVLGGDKEFASSSIFATDC+V RF  E+D K+S +EAAAYANMSCGSDMDGQGVVCKR
Subjt:  RAKMGVAMPVLGGDKEFASSSIFATDCKVARFNGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR

XP_008461680.1 PREDICTED: uncharacterized protein LOC103500222 [Cucumis melo]1.1e-22488.35Show/hide
Query:  MARIAIFLFFFFLFLSAMVEGAPKAKKVKCKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPVCTPPPPPPPSPPPPPPKPRKLKSPPPPYIYSS--PPP
        MARIAIFLFFFFLFLSA+VEG PKAKKVKCKDKK+PQCYKS+HYCP DCLRTCVVDC+SCQPVCT PPPPPPSPPPPPPKPRKL+SPPPPYIYSS  PPP
Subjt:  MARIAIFLFFFFLFLSAMVEGAPKAKKVKCKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPVCTPPPPPPPSPPPPPPKPRKLKSPPPPYIYSS--PPP

Query:  PHIYSS-PPPPPYVYSSPPPP---TAEPS---PSTPTPPT-----SPPPSSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQCEVDCVTCSPVCNCNRP
        P +YSS PPPPPY+YSSPPPP   T EPS   P TPTPP+     SPPPSSEASGQKK RCKNRG+PHCYGMELSCPSDCPSQCEVDCVTCSPVCNCNRP
Subjt:  PHIYSS-PPPPPYVYSSPPPP---TAEPS---PSTPTPPT-----SPPPSSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQCEVDCVTCSPVCNCNRP

Query:  GAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDRLYLSLDNERILLPNQE
        GAVCQDPKFIGGDGITFYFHGKKD+DFCIVTDSNLHINAHFIGRRNV+MKRDFTWVQSLGILF SHKLFI ARKT+TWDDA DRLY+SLD+E ILLPNQE
Subjt:  GAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDRLYLSLDNERILLPNQE

Query:  GATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYGNKYVSRAKMGVAMPVL
        GATWSNSTSYEGI I+RSR TNAVEIEVPGNFKIKAVVVPITEKES IHKYGITQEDCFAHLDLSFKFYALSG+V+GVLGQTYGN YVSRAKMGVAMPVL
Subjt:  GATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYGNKYVSRAKMGVAMPVL

Query:  GGDKEFASSSIFATDCKVARFNGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR
        GGDKEFASSSIFATDC+VARF+ ELDGK+SS+EAAAYANMSCG+DM+GQGVVCKR
Subjt:  GGDKEFASSSIFATDCKVARFNGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR

XP_022937516.1 uncharacterized protein LOC111443907 [Cucurbita moschata]1.0e-21181.53Show/hide
Query:  MARIAIFLFFFFLFLSAMVEGAPKAKKVKCKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPVCTPPPPPPPSPPPPPPKPRKLKS------------PP
        M +I   LF FFLFLSA VE  PK KKVKCKDK +PQCYKSEHYCP+DCLRTCVVDC+SC+PVCTPPPPPPPSPPPPPPKPRKLKS            PP
Subjt:  MARIAIFLFFFFLFLSAMVEGAPKAKKVKCKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPVCTPPPPPPPSPPPPPPKPRKLKS------------PP

Query:  PPYIYSSPPPP--HIYSSPPPPPYVYSSPPPP-----------TAEPS---PSTPTPPT--SPPPSSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQC
        PPYIYSSPPPP  H+YSSPPPPPY+YSSPPPP           T EP    P TPTPPT  SPPPSSEASGQKK RCKNR FPHCYGMEL+CP+DCP QC
Subjt:  PPYIYSSPPPP--HIYSSPPPPPYVYSSPPPP-----------TAEPS---PSTPTPPT--SPPPSSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQC

Query:  EVDCVTCSPVCNCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDR
        EVDCVTCS VCNCNRPGAVCQDP+FIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNV+MKRDFTWVQSLGILFDSH+LFI ARKT+TWDDA DR
Subjt:  EVDCVTCSPVCNCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDR

Query:  LYLSLDNERILLPNQEGATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYG
        L L  +N+ I+L NQEGATWSNST+YEGITITR+RNTNAVEI VPGNFKIKAVVVPITEKES IHKYGITQEDCFAHLDLSFKFYALSG VNGVLGQTYG
Subjt:  LYLSLDNERILLPNQEGATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYG

Query:  NKYVSRAKMGVAMPVLGGDKEFASSSIFATDCKVARFNGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR
        + YVSRAKMGVAMPVLGGDKEFASS  FATDC VARFNG+L+GKDSSLE  AY NMSCGSDM+G+GVVCKR
Subjt:  NKYVSRAKMGVAMPVLGGDKEFASSSIFATDCKVARFNGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR

XP_022969544.1 uncharacterized protein LOC111468530 [Cucurbita maxima]3.6e-21281.14Show/hide
Query:  MARIAIFLFFFFLFLSAMVEGAPKAKKVKCKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPVCTPPPPPPPSPPPPPPKPRKLKS------------PP
        M +I   LF FFLFLSA VE  PK KKVKCKDKK+PQCYKSEHYCP+DCLRTCVVDC+SC+PVCTPPPPPPPSPPPPPPKPRKLKS            PP
Subjt:  MARIAIFLFFFFLFLSAMVEGAPKAKKVKCKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPVCTPPPPPPPSPPPPPPKPRKLKS------------PP

Query:  PPYIYSSPPPP--HIYSSPPPPPYVYSSPPPP------------TAEPS---PSTPTPPT--SPPPSSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQ
        PPYIYSSPPPP  H+YSSPPPPPY+YSSPPPP            T EP    P TP PPT  SPPPSSEASGQKK RCKNR FPHCYGMEL+CP+DCP Q
Subjt:  PPYIYSSPPPP--HIYSSPPPPPYVYSSPPPP------------TAEPS---PSTPTPPT--SPPPSSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQ

Query:  CEVDCVTCSPVCNCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAID
        CEVDCVTCS VCNCNRPGAVCQDP+FIGGDGITFYFHGKKD+DFCIVTDSNLHINAHFIGRRNV+MKRDFTWVQSLGILFDSH+LFI ARKT+TWDDA D
Subjt:  CEVDCVTCSPVCNCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAID

Query:  RLYLSLDNERILLPNQEGATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTY
        RL LS +N+ I+L N+EGATWSNST+YEGITITR+RNTNAVEI VPGNFKIKAVVVPITEKES IHKYGITQEDCFAHLDLSFKFYALSG+VNGVLGQTY
Subjt:  RLYLSLDNERILLPNQEGATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTY

Query:  GNKYVSRAKMGVAMPVLGGDKEFASSSIFATDCKVARFNGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR
        G+ YVSRAKMGVAMPVLGGDKEFASS  FATDC VARFNG+L+GKDSSLE  AY NMSCGSDM+G+GVVCKR
Subjt:  GNKYVSRAKMGVAMPVLGGDKEFASSSIFATDCKVARFNGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR

XP_038894596.1 uncharacterized protein LOC120083110 [Benincasa hispida]1.1e-22490.74Show/hide
Query:  MARIAIFLFFFFLFLSAMVEGAPKAKKVKCKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPVCTPPPPPPPSPPPPPPKPRKLKSPPPPYIYSS-PPPP
        MA+IAIFL  F LFLSA+VEGAPKAKKVKCKDKKYPQCYKS+HYCP+DCLRTCVVDC+SC+PVCTPPPPPPPSPPPPPPKPRKLKSPPPPYIYSS PPPP
Subjt:  MARIAIFLFFFFLFLSAMVEGAPKAKKVKCKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPVCTPPPPPPPSPPPPPPKPRKLKSPPPPYIYSS-PPPP

Query:  HIYSSPPPPPYVYSSPPPPTAEPSPSTPTPPT-SPPPSSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQCEVDCVTCSPVCNCNRPGAVCQDPKFIGG
        +IYSSPPPPP   S P PPT  P+P T  PP+ SPPPSSEASGQKK RCKNRG+PHCYGMELSCPSDCPSQCEVDCVTCSPVCNC+RPGAVCQDPKFIGG
Subjt:  HIYSSPPPPPYVYSSPPPPTAEPSPSTPTPPT-SPPPSSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQCEVDCVTCSPVCNCNRPGAVCQDPKFIGG

Query:  DGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDRLYLSLDNERILLPNQEGATWSNSTSYEG
        DGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNV MKRDFTWVQSLGILFDSHKLFI A+KTTTWDDA DRLYLSLD+ERILLPNQEGATW NSTSYEG
Subjt:  DGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDRLYLSLDNERILLPNQEGATWSNSTSYEG

Query:  ITITRSRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYGNKYVSRAKMGVAMPVLGGDKEFASSSIF
        ITITRSRNTNAVEIEV GNFKIKA VVPITEKES IHKYGITQEDCFAHLDLSFKFYALSG+VNGVLGQTYG  YVSRAKMGVAMPVLGGDKEFASSSIF
Subjt:  ITITRSRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYGNKYVSRAKMGVAMPVLGGDKEFASSSIF

Query:  ATDCKVARFNGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR
        ATDC+VARF+GELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR
Subjt:  ATDCKVARFNGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR

TrEMBL top hitse value%identityAlignment
A0A0A0LSM1 Uncharacterized protein8.8e-22586.05Show/hide
Query:  MARIAIFLFFFFLFLSAMVEGAPKAKKVKCKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPVCTPPPPPPPSPPPPPPKPRKLKSPPPPYIYSS-----
        MARIAIFLFF FLFLSA+VEGAPKAKKVKCKDKK+PQCYKSEHYCP+DCLRTCVVDC+SCQPVCTPPPPPPPSPPPPPPKPRKLKSPPPPYIYSS     
Subjt:  MARIAIFLFFFFLFLSAMVEGAPKAKKVKCKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPVCTPPPPPPPSPPPPPPKPRKLKSPPPPYIYSS-----

Query:  --------PPPPHIYSSPPPPPYVYSSPPPP---TAEPSPSTP---------TPPTSPPPSSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQCEVDCV
                PPPP+IYSSPPPPP++YSSPPPP   T EPSP  P          PP SPPPSSEASGQKK RCKNRG+PHCYGMELSCPSDCPSQCEVDCV
Subjt:  --------PPPPHIYSSPPPPPYVYSSPPPP---TAEPSPSTP---------TPPTSPPPSSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQCEVDCV

Query:  TCSPVCNCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDRLYLSL
        TCSPVCNCNRPGAVCQDPKFIGGDGITFYFHGK+DKDFCIVTDSNLHINAHFIGRRNV+MKRDFTWVQSLGILFDSH+LFI ARKT+TWDDA DRLY+SL
Subjt:  TCSPVCNCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDRLYLSL

Query:  DNERILLPNQEGATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYGNKYVS
        D+E I+LPNQEGATWSNSTSYEGI ITRSR TNAVEIEVPGNFKIKAVVVPITEKES IHKYGITQEDCFAHLDLSFKFYALSG+VNGVLGQTYG  YVS
Subjt:  DNERILLPNQEGATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYGNKYVS

Query:  RAKMGVAMPVLGGDKEFASSSIFATDCKVARFNGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR
        RAKMGVAMPVLGGDKEFASSSIFATDC+V RF  E+D K+S +EAAAYANMSCGSDMDGQGVVCKR
Subjt:  RAKMGVAMPVLGGDKEFASSSIFATDCKVARFNGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR

A0A1S3CF51 uncharacterized protein LOC1035002225.2e-22588.35Show/hide
Query:  MARIAIFLFFFFLFLSAMVEGAPKAKKVKCKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPVCTPPPPPPPSPPPPPPKPRKLKSPPPPYIYSS--PPP
        MARIAIFLFFFFLFLSA+VEG PKAKKVKCKDKK+PQCYKS+HYCP DCLRTCVVDC+SCQPVCT PPPPPPSPPPPPPKPRKL+SPPPPYIYSS  PPP
Subjt:  MARIAIFLFFFFLFLSAMVEGAPKAKKVKCKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPVCTPPPPPPPSPPPPPPKPRKLKSPPPPYIYSS--PPP

Query:  PHIYSS-PPPPPYVYSSPPPP---TAEPS---PSTPTPPT-----SPPPSSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQCEVDCVTCSPVCNCNRP
        P +YSS PPPPPY+YSSPPPP   T EPS   P TPTPP+     SPPPSSEASGQKK RCKNRG+PHCYGMELSCPSDCPSQCEVDCVTCSPVCNCNRP
Subjt:  PHIYSS-PPPPPYVYSSPPPP---TAEPS---PSTPTPPT-----SPPPSSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQCEVDCVTCSPVCNCNRP

Query:  GAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDRLYLSLDNERILLPNQE
        GAVCQDPKFIGGDGITFYFHGKKD+DFCIVTDSNLHINAHFIGRRNV+MKRDFTWVQSLGILF SHKLFI ARKT+TWDDA DRLY+SLD+E ILLPNQE
Subjt:  GAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDRLYLSLDNERILLPNQE

Query:  GATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYGNKYVSRAKMGVAMPVL
        GATWSNSTSYEGI I+RSR TNAVEIEVPGNFKIKAVVVPITEKES IHKYGITQEDCFAHLDLSFKFYALSG+V+GVLGQTYGN YVSRAKMGVAMPVL
Subjt:  GATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYGNKYVSRAKMGVAMPVL

Query:  GGDKEFASSSIFATDCKVARFNGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR
        GGDKEFASSSIFATDC+VARF+ ELDGK+SS+EAAAYANMSCG+DM+GQGVVCKR
Subjt:  GGDKEFASSSIFATDCKVARFNGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR

A0A6J1FAJ8 uncharacterized protein LOC1114439075.0e-21281.53Show/hide
Query:  MARIAIFLFFFFLFLSAMVEGAPKAKKVKCKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPVCTPPPPPPPSPPPPPPKPRKLKS------------PP
        M +I   LF FFLFLSA VE  PK KKVKCKDK +PQCYKSEHYCP+DCLRTCVVDC+SC+PVCTPPPPPPPSPPPPPPKPRKLKS            PP
Subjt:  MARIAIFLFFFFLFLSAMVEGAPKAKKVKCKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPVCTPPPPPPPSPPPPPPKPRKLKS------------PP

Query:  PPYIYSSPPPP--HIYSSPPPPPYVYSSPPPP-----------TAEPS---PSTPTPPT--SPPPSSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQC
        PPYIYSSPPPP  H+YSSPPPPPY+YSSPPPP           T EP    P TPTPPT  SPPPSSEASGQKK RCKNR FPHCYGMEL+CP+DCP QC
Subjt:  PPYIYSSPPPP--HIYSSPPPPPYVYSSPPPP-----------TAEPS---PSTPTPPT--SPPPSSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQC

Query:  EVDCVTCSPVCNCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDR
        EVDCVTCS VCNCNRPGAVCQDP+FIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNV+MKRDFTWVQSLGILFDSH+LFI ARKT+TWDDA DR
Subjt:  EVDCVTCSPVCNCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDR

Query:  LYLSLDNERILLPNQEGATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYG
        L L  +N+ I+L NQEGATWSNST+YEGITITR+RNTNAVEI VPGNFKIKAVVVPITEKES IHKYGITQEDCFAHLDLSFKFYALSG VNGVLGQTYG
Subjt:  LYLSLDNERILLPNQEGATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYG

Query:  NKYVSRAKMGVAMPVLGGDKEFASSSIFATDCKVARFNGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR
        + YVSRAKMGVAMPVLGGDKEFASS  FATDC VARFNG+L+GKDSSLE  AY NMSCGSDM+G+GVVCKR
Subjt:  NKYVSRAKMGVAMPVLGGDKEFASSSIFATDCKVARFNGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR

A0A6J1FCE2 uncharacterized protein LOC1114441161.5e-21181.32Show/hide
Query:  MARIAIFLFFFFLFLSAMVEGAPKAKKVKCKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPVCTPPPPPPPSPPPPPPKPRKLKS------------PP
        M +I   LF FFLFLSA VE  PK KKVKCKDK +PQCYKSEHYCP+DCLRTCVVDC+SC+PVCTPPPPPPPSPPPPPPKPRKLKS            PP
Subjt:  MARIAIFLFFFFLFLSAMVEGAPKAKKVKCKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPVCTPPPPPPPSPPPPPPKPRKLKS------------PP

Query:  PPYIYSSPPPP--HIYSSPPPPPYVYSSPPPP-----------TAEPS---PSTPTPPT--SPPPSSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQC
        PPYIYSSPPPP  H+Y SPPPPPY+YSSPPPP           T EP    P TPTPPT  SPPPSSEASGQKK RCKNR FPHCYGMEL+CP+DCP QC
Subjt:  PPYIYSSPPPP--HIYSSPPPPPYVYSSPPPP-----------TAEPS---PSTPTPPT--SPPPSSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQC

Query:  EVDCVTCSPVCNCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDR
        EVDCVTCS VCNCNRPGAVCQDP+FIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNV+MKRDFTWVQSLGILFDSH+LFI ARKT+TWDDA DR
Subjt:  EVDCVTCSPVCNCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDR

Query:  LYLSLDNERILLPNQEGATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYG
        L LS +N+ I+L N+EGATWSNST+YEGITITR+RNTNAVEI VPGNFKIKAVVVPITEKES IHKYGITQEDCFAHLDLSFKFYALSG VNGVLGQTYG
Subjt:  LYLSLDNERILLPNQEGATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYG

Query:  NKYVSRAKMGVAMPVLGGDKEFASSSIFATDCKVARFNGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR
        + YVSRAKMGVAMPVLGGDKEFASS  FATDC VARFNG+L+GKDSSLE  AY NMSCGSDM+G+GVVCKR
Subjt:  NKYVSRAKMGVAMPVLGGDKEFASSSIFATDCKVARFNGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR

A0A6J1I078 uncharacterized protein LOC1114685301.7e-21281.14Show/hide
Query:  MARIAIFLFFFFLFLSAMVEGAPKAKKVKCKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPVCTPPPPPPPSPPPPPPKPRKLKS------------PP
        M +I   LF FFLFLSA VE  PK KKVKCKDKK+PQCYKSEHYCP+DCLRTCVVDC+SC+PVCTPPPPPPPSPPPPPPKPRKLKS            PP
Subjt:  MARIAIFLFFFFLFLSAMVEGAPKAKKVKCKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPVCTPPPPPPPSPPPPPPKPRKLKS------------PP

Query:  PPYIYSSPPPP--HIYSSPPPPPYVYSSPPPP------------TAEPS---PSTPTPPT--SPPPSSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQ
        PPYIYSSPPPP  H+YSSPPPPPY+YSSPPPP            T EP    P TP PPT  SPPPSSEASGQKK RCKNR FPHCYGMEL+CP+DCP Q
Subjt:  PPYIYSSPPPP--HIYSSPPPPPYVYSSPPPP------------TAEPS---PSTPTPPT--SPPPSSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQ

Query:  CEVDCVTCSPVCNCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAID
        CEVDCVTCS VCNCNRPGAVCQDP+FIGGDGITFYFHGKKD+DFCIVTDSNLHINAHFIGRRNV+MKRDFTWVQSLGILFDSH+LFI ARKT+TWDDA D
Subjt:  CEVDCVTCSPVCNCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAID

Query:  RLYLSLDNERILLPNQEGATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTY
        RL LS +N+ I+L N+EGATWSNST+YEGITITR+RNTNAVEI VPGNFKIKAVVVPITEKES IHKYGITQEDCFAHLDLSFKFYALSG+VNGVLGQTY
Subjt:  RLYLSLDNERILLPNQEGATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTY

Query:  GNKYVSRAKMGVAMPVLGGDKEFASSSIFATDCKVARFNGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR
        G+ YVSRAKMGVAMPVLGGDKEFASS  FATDC VARFNG+L+GKDSSLE  AY NMSCGSDM+G+GVVCKR
Subjt:  GNKYVSRAKMGVAMPVLGGDKEFASSSIFATDCKVARFNGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR

SwissProt top hitse value%identityAlignment
O65375 Leucine-rich repeat extensin-like protein 14.5e-0867.53Show/hide
Query:  PPPPPPSPPPPPPKPRKLKSPPPPYIYSS-PPPPHIYSSPPPPPYVYSSPPPPTAEPSP-------STPTPPTSPPP
        PPPPPPSP PPP  P    SPPPPY+YSS PPPP++YSSPPPPPYVYSSPPPP    SP       S P PP SPPP
Subjt:  PPPPPPSPPPPPPKPRKLKSPPPPYIYSS-PPPPHIYSSPPPPPYVYSSPPPPTAEPSP-------STPTPPTSPPP

Q9LUI1 Leucine-rich repeat extensin-like protein 61.8e-0455.68Show/hide
Query:  VDCAS--CQPVC---TPPPPPPPSPPPPPPKPRKLKSPPPPYIYSSPPPPHIYSSPPPPPYVYSSPPPPTAEPSPSTPTPPTSPPPSS
        +DCAS  C P      PPPPPPP PPPPPP P     PPPPY+Y SPPPP     P PPPYVY  PPPP   P P +P     PPP S
Subjt:  VDCAS--CQPVC---TPPPPPPPSPPPPPPKPRKLKSPPPPYIYSSPPPPHIYSSPPPPPYVYSSPPPPTAEPSPSTPTPPTSPPPSS

Q9T0K5 Leucine-rich repeat extensin-like protein 33.9e-0449Show/hide
Query:  PVCTPPPPPP---PSPPPPPPKPRKLKSPPPPYIYSSPPPPHI-----------------------YSSPPPPPYVYSSPPPPTAEPSPSTPTPPTSPPP
        P   PPPPPP   P PPPPPP P  + SPPPP +YSSPPPP                         +S PPP PY YSSPPPP + P P +P PP SPPP
Subjt:  PVCTPPPPPP---PSPPPPPPKPRKLKSPPPPYIYSSPPPPHI-----------------------YSSPPPPPYVYSSPPPPTAEPSPSTPTPPTSPPP

Arabidopsis top hitse value%identityAlignment
AT3G19430.1 late embryogenesis abundant protein-related / LEA protein-related5.7e-10743.69Show/hide
Query:  CKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPVC---------------------TPPPPPPPSPPPPP----PKPRKLKSPPPPYIYSSPPPPHIYSS
        CK KKY  CY  EH CP  C  +C V+CASC+P+C                     TPP P PP  PPPP    P P    SPPPP    S P P    S
Subjt:  CKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPVC---------------------TPPPPPPPSPPPPP----PKPRKLKSPPPPYIYSSPPPPHIYSS

Query:  PPPP---PYVYS-----SPPPPTAEPSPSTPTPPTSPPP-------------------------------------------------------------
        PPPP   P V S     SPPPPT  PS  +PTPP SPPP                                                             
Subjt:  PPPP---PYVYS-----SPPPPTAEPSPSTPTPPTSPPP-------------------------------------------------------------

Query:  -------------------------------------SSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQCEVDCVTCSPVCNCNRPGAVCQDPKFIGG
                                               EA+G K+ RCK +  P CYG+E +CP+DCP  C+VDCVTC PVCNC++PG+VCQDP+FIGG
Subjt:  -------------------------------------SSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQCEVDCVTCSPVCNCNRPGAVCQDPKFIGG

Query:  DGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDRLYLSLDNERILLPNQEGATWSNSTS-YE
        DG+TFYFHGKKD +FC+++D NLHINAHFIG+R   M RDFTWVQS+ ILF +H+L++ A KT TWDD++DR+ +S D   I LP  +GA W++S   Y 
Subjt:  DGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDRLYLSLDNERILLPNQEGATWSNSTS-YE

Query:  GITITR-SRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYGNKYVSRAKMGVAMPVLGGDKEFASSS
         +++ R + +TN +E+EV G  KI A VVPIT ++S IH Y + ++DC AHLDL FKF  LS +V+GVLGQTY + YVSR K+GV MPV+GGD+EF ++ 
Subjt:  GITITR-SRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYGNKYVSRAKMGVAMPVLGGDKEFASSS

Query:  IFATDCKVARF--NGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR
        +FA DC  ARF  NG+ +   S LE      MSC S + G+GVVCKR
Subjt:  IFATDCKVARF--NGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR

AT4G27400.1 Late embryogenesis abundant (LEA) protein-related3.6e-6138.02Show/hide
Query:  CKNRGFPHCYGMELSCPSDCPSQ---------CEVDCV--TCSPVC-----NCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGR
        C     P C    + CP +CP++         C VDC    C  VC     NC   G++C DP+FIGGDGI FYFHGK ++ F IV+D +  INA F G 
Subjt:  CKNRGFPHCYGMELSCPSDCPSQ---------CEVDCV--TCSPVC-----NCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGR

Query:  RNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDRLYLSLDNERILLPNQEGATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEK
        R     RDFTW+Q+LG LF+SHK  ++  K  TWD  +D L  ++D + +++P +  +TW +S   + I I R    N+V + +    +I   VVP+T++
Subjt:  RNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDRLYLSLDNERILLPNQEGATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEK

Query:  ESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYGNKYVSRAKMGVAMPVLGGDKEFASSSIFATDCKVARFNGELDGKDSSLE-AAAYANMSCG
        +  IH Y +  +DCFAH ++ FKF  LS  V+G+LG+TY   + + AK GV MPV+GG+  F +SS+ +  CK   F+ +      S++  + YA + C 
Subjt:  ESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYGNKYVSRAKMGVAMPVLGGDKEFASSSIFATDCKVARFNGELDGKDSSLE-AAAYANMSCG

Query:  -SDMDGQGVVCKR
             G G+VC++
Subjt:  -SDMDGQGVVCKR

AT5G54370.1 Late embryogenesis abundant (LEA) protein-related2.6e-6741.27Show/hide
Query:  CKNRGFPHCYGMELSCPSDCPSQ---------CEVDC--VTCSPVC-----NCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGR
        C N  +  CY   + CP +CPS+         C  DC   TC   C     NCNRPG+ C DP+FIGGDGI FYFHGK +++F +V+DS+L IN  FIG 
Subjt:  CKNRGFPHCYGMELSCPSDCPSQ---------CEVDC--VTCSPVC-----NCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGR

Query:  RNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDRLYLSLDNERILLPNQEGATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEK
        R     RDFTW+Q+LG LF+S+K  ++A KT +WD+ ID L  S D + + +P +  +TW +    + I I R    N+V + +    +I   VVP+T++
Subjt:  RNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDRLYLSLDNERILLPNQEGATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEK

Query:  ESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYGNKYVSRAKMGVAMPVLGGDKEFASSSIFATDCKVARFN---GELDGKDSSLEAAAYANMS
        +  IH Y +  +DCFAHL++ F+F+ LS  V+G+LG+TY   + + AK GVAMPV+GG+  F +SS+ + DCK   F+    E+D   S +E   YA + 
Subjt:  ESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYGNKYVSRAKMGVAMPVLGGDKEFASSSIFATDCKVARFN---GELDGKDSSLEAAAYANMS

Query:  C-GSDMDGQGVVCKR
        C      G G+VC++
Subjt:  C-GSDMDGQGVVCKR

AT5G60520.1 Late embryogenesis abundant (LEA) protein-related7.0e-6542.16Show/hide
Query:  SGQKKARCKNRGFPHCYGMELSCPSDCPSQ----------CEVDCVT-CSPVC-----NCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHI
        SGQ++ +C  RG   C    L+CP +CP +          C +DC + C   C     NCN  G++C DP+F+GGDG+ FYFHG KD +F IV+D NL I
Subjt:  SGQKKARCKNRGFPHCYGMELSCPSDCPSQ----------CEVDCVT-CSPVC-----NCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHI

Query:  NAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDRLYLSLDNERILLPNQEGATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAV
        NAHFIG R     RDFTWVQ+  ++FDSH L I A+K  +WDD++D L +  + E + +P +  A W        + + R+   N V + V G  +I   
Subjt:  NAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDRLYLSLDNERILLPNQEGATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAV

Query:  VVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYGNKYVSRAKMGVAMPVLGGDKEFASSSIFATDCKVARFNGE
        V PI ++E  +HKY + ++D FAHL+  FKF+ LS  V GVLG+TY   YVS  K GV MP++GG+ ++ + S+F+  C V RF G+
Subjt:  VVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYGNKYVSRAKMGVAMPVLGGDKEFASSSIFATDCKVARFNGE

AT5G60530.1 late embryogenesis abundant protein-related / LEA protein-related5.9e-6443.51Show/hide
Query:  SGQKKARCKNRGFPHCYGMELSCPSDCPSQ----------CEVDCVT-CSPVC-----NCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHI
        +GQ++A C+ RG   CY   L CP +CP +          C +DC   C   C     NCN  G++C DP+F+GGDG+ FYFHG K  +F IV+D+NL I
Subjt:  SGQKKARCKNRGFPHCYGMELSCPSDCPSQ----------CEVDCVT-CSPVC-----NCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHI

Query:  NAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDRLYLSLDNERILLPNQEGATWSN-STSYEGITITRSRNTNAVEIEVPGNFKIKA
        NAHFIG R V   RDFTWVQ+L ++F++HKL I A +   WD+  D   +  D E I LP  E + W   S   + I I R+   N+V + V    ++  
Subjt:  NAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDRLYLSLDNERILLPNQEGATWSN-STSYEGITITRSRNTNAVEIEVPGNFKIKA

Query:  VVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYGNKYVSRAKMGVAMPVLGGDKEFASSSIFATDCKVARF
         V PI ++E+ +H Y + Q+D FAHL+  FKF  LS  V GVLG+TY   YVS AK GV MPVLGG+ ++ + S+F+  C++ RF
Subjt:  VVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYGNKYVSRAKMGVAMPVLGGDKEFASSSIFATDCKVARF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGCGTCGGCGTCTTCCTCTTCCTCTCGTCCACAGTCGGAATATCCTCAGATAATCATGGTCCAGAAATTCATTGACGCTTTAAGCGAACAGGGATTTCTTGACGA
GAACTTTAGTAACTTGAGAACTCTCCCTGAGAATCAAAACTCCTTGGTTATATTCTGCAGTGACGCCGAACCTAAGCTCGAAACTATTGAGAAAGCTCTGGCAGATGAGC
CTCTGGATTTTCGTACTGTGATAAACACCGTGGAAATGATCAGATCTGGTGCTGTTAGAGTTGGTGGAACTCGATTAGCAGGGGCTTGTACTGTCTTACTACAACATTGC
AATGACAGTAACATTCAGCGTTCAAAAGATGCTTACAAGAAGCTCAATTGGGAATACTATGTTTTACGTGATTGTTTCCACCATATTCTTCAGACGTTAGACAAAAACAA
TTACAAGAGCCAAATGTTTGTAGCTTCCAATATGGCAAGAATCGCCATATTCCTCTTCTTTTTCTTTCTCTTCCTCTCAGCTATGGTTGAGGGAGCTCCAAAGGCAAAGA
AAGTTAAATGCAAAGACAAGAAATACCCTCAATGTTACAAATCCGAACACTATTGTCCATCCGATTGCCTTCGAACTTGCGTCGTAGATTGCGCATCTTGTCAACCTGTT
TGCACTCCGCCTCCGCCTCCGCCACCATCGCCTCCTCCACCACCACCGAAACCGCGCAAACTCAAATCTCCACCCCCACCATACATTTACTCTTCACCTCCGCCCCCACA
CATTTACTCTTCCCCACCACCTCCTCCCTATGTTTATTCTTCTCCCCCACCACCTACGGCAGAACCTTCACCTTCAACTCCGACACCTCCGACGTCTCCACCGCCGTCGT
CTGAGGCGTCGGGGCAAAAGAAAGCTAGGTGCAAGAATAGGGGGTTTCCACATTGCTATGGAATGGAGCTAAGTTGTCCAAGTGATTGTCCTAGCCAATGTGAGGTTGAT
TGTGTTACTTGCAGTCCTGTCTGTAATTGCAACCGTCCAGGTGCAGTATGCCAAGACCCAAAATTCATTGGAGGAGATGGAATCACCTTCTACTTCCATGGTAAAAAGGA
TAAAGATTTTTGCATCGTCACCGACTCGAACCTCCACATCAATGCCCATTTCATCGGCCGACGAAACGTCGAAATGAAGAGGGACTTCACTTGGGTCCAATCCCTTGGCA
TTCTCTTCGACTCTCACAAGCTCTTCATCAAAGCGAGAAAAACCACAACATGGGACGACGCCATTGACCGCCTCTACCTTTCCCTCGACAACGAAAGAATCCTCCTCCCT
AACCAAGAGGGCGCCACTTGGAGTAATTCAACTTCGTACGAGGGGATCACCATAACCCGGAGTAGAAACACAAACGCAGTCGAGATCGAAGTCCCTGGAAACTTCAAGAT
CAAAGCAGTCGTGGTCCCGATAACGGAAAAGGAATCAACGATCCACAAGTATGGGATTACACAAGAGGATTGCTTTGCACATTTGGACTTGAGCTTCAAGTTCTATGCAT
TGAGTGGGGATGTGAATGGAGTTTTGGGGCAAACTTATGGGAATAAGTATGTGAGTAGGGCTAAGATGGGAGTGGCAATGCCTGTTTTGGGTGGCGATAAGGAGTTTGCT
TCTTCAAGTATTTTTGCTACGGATTGCAAAGTTGCACGTTTCAATGGGGAGTTGGATGGAAAAGACAGTTCTTTGGAAGCTGCAGCCTATGCCAATATGAGCTGCGGCAG
TGACATGGATGGTCAAGGAGTTGTTTGCAAACGA
mRNA sequenceShow/hide mRNA sequence
ATTTCATAAACCAAAAACTTAGAACAATTCATTGATCCAAAGAAAACTTCTCCCAATGTCGGCGTCGGCGTCTTCCTCTTCCTCTCGTCCACAGTCGGAATATCCTCAGA
TAATCATGGTCCAGAAATTCATTGACGCTTTAAGCGAACAGGGATTTCTTGACGAGAACTTTAGTAACTTGAGAACTCTCCCTGAGAATCAAAACTCCTTGGTTATATTC
TGCAGTGACGCCGAACCTAAGCTCGAAACTATTGAGAAAGCTCTGGCAGATGAGCCTCTGGATTTTCGTACTGTGATAAACACCGTGGAAATGATCAGATCTGGTGCTGT
TAGAGTTGGTGGAACTCGATTAGCAGGGGCTTGTACTGTCTTACTACAACATTGCAATGACAGTAACATTCAGCGTTCAAAAGATGCTTACAAGAAGCTCAATTGGGAAT
ACTATGTTTTACGTGATTGTTTCCACCATATTCTTCAGACGTTAGACAAAAACAATTACAAGAGCCAAATGTTTGTAGCTTCCAATATGGCAAGAATCGCCATATTCCTC
TTCTTTTTCTTTCTCTTCCTCTCAGCTATGGTTGAGGGAGCTCCAAAGGCAAAGAAAGTTAAATGCAAAGACAAGAAATACCCTCAATGTTACAAATCCGAACACTATTG
TCCATCCGATTGCCTTCGAACTTGCGTCGTAGATTGCGCATCTTGTCAACCTGTTTGCACTCCGCCTCCGCCTCCGCCACCATCGCCTCCTCCACCACCACCGAAACCGC
GCAAACTCAAATCTCCACCCCCACCATACATTTACTCTTCACCTCCGCCCCCACACATTTACTCTTCCCCACCACCTCCTCCCTATGTTTATTCTTCTCCCCCACCACCT
ACGGCAGAACCTTCACCTTCAACTCCGACACCTCCGACGTCTCCACCGCCGTCGTCTGAGGCGTCGGGGCAAAAGAAAGCTAGGTGCAAGAATAGGGGGTTTCCACATTG
CTATGGAATGGAGCTAAGTTGTCCAAGTGATTGTCCTAGCCAATGTGAGGTTGATTGTGTTACTTGCAGTCCTGTCTGTAATTGCAACCGTCCAGGTGCAGTATGCCAAG
ACCCAAAATTCATTGGAGGAGATGGAATCACCTTCTACTTCCATGGTAAAAAGGATAAAGATTTTTGCATCGTCACCGACTCGAACCTCCACATCAATGCCCATTTCATC
GGCCGACGAAACGTCGAAATGAAGAGGGACTTCACTTGGGTCCAATCCCTTGGCATTCTCTTCGACTCTCACAAGCTCTTCATCAAAGCGAGAAAAACCACAACATGGGA
CGACGCCATTGACCGCCTCTACCTTTCCCTCGACAACGAAAGAATCCTCCTCCCTAACCAAGAGGGCGCCACTTGGAGTAATTCAACTTCGTACGAGGGGATCACCATAA
CCCGGAGTAGAAACACAAACGCAGTCGAGATCGAAGTCCCTGGAAACTTCAAGATCAAAGCAGTCGTGGTCCCGATAACGGAAAAGGAATCAACGATCCACAAGTATGGG
ATTACACAAGAGGATTGCTTTGCACATTTGGACTTGAGCTTCAAGTTCTATGCATTGAGTGGGGATGTGAATGGAGTTTTGGGGCAAACTTATGGGAATAAGTATGTGAG
TAGGGCTAAGATGGGAGTGGCAATGCCTGTTTTGGGTGGCGATAAGGAGTTTGCTTCTTCAAGTATTTTTGCTACGGATTGCAAAGTTGCACGTTTCAATGGGGAGTTGG
ATGGAAAAGACAGTTCTTTGGAAGCTGCAGCCTATGCCAATATGAGCTGCGGCAGTGACATGGATGGTCAAGGAGTTGTTTGCAAACGA
Protein sequenceShow/hide protein sequence
MSASASSSSSRPQSEYPQIIMVQKFIDALSEQGFLDENFSNLRTLPENQNSLVIFCSDAEPKLETIEKALADEPLDFRTVINTVEMIRSGAVRVGGTRLAGACTVLLQHC
NDSNIQRSKDAYKKLNWEYYVLRDCFHHILQTLDKNNYKSQMFVASNMARIAIFLFFFFLFLSAMVEGAPKAKKVKCKDKKYPQCYKSEHYCPSDCLRTCVVDCASCQPV
CTPPPPPPPSPPPPPPKPRKLKSPPPPYIYSSPPPPHIYSSPPPPPYVYSSPPPPTAEPSPSTPTPPTSPPPSSEASGQKKARCKNRGFPHCYGMELSCPSDCPSQCEVD
CVTCSPVCNCNRPGAVCQDPKFIGGDGITFYFHGKKDKDFCIVTDSNLHINAHFIGRRNVEMKRDFTWVQSLGILFDSHKLFIKARKTTTWDDAIDRLYLSLDNERILLP
NQEGATWSNSTSYEGITITRSRNTNAVEIEVPGNFKIKAVVVPITEKESTIHKYGITQEDCFAHLDLSFKFYALSGDVNGVLGQTYGNKYVSRAKMGVAMPVLGGDKEFA
SSSIFATDCKVARFNGELDGKDSSLEAAAYANMSCGSDMDGQGVVCKR