; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G12200 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G12200
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionlate embryogenesis abundant protein-related / LEA protein-related
Genome locationClcChr01:21096583..21099708
RNA-Seq ExpressionClc01G12200
SyntenyClc01G12200
Gene Ontology termsGO:0009664 - plant-type cell wall organization (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0065007 - biological regulation (biological process)
GO:0005199 - structural constituent of cell wall (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR006706 - Extensin domain
IPR009646 - Root cap


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461654.1 PREDICTED: uncharacterized protein LOC103500203 [Cucumis melo]2.7e-17483.85Show/hide
Query:  PPPPYIYSSPPPPYIYSSPPPPPYIYSSPPPPPYIYSSPPPPPYIYSSPQPPPPARTEPSPPT---STPTPPTSPPP-SEAWGQKKARCKNRSYPQCYGM
        PPPP    SPPPPYIYSSPPPPPYIYSSP         PPPPPYIY+SP PPPPA   PSPPT   STPTPPTSPPP SEA GQKKARCKNRSYP CYGM
Subjt:  PPPPYIYSSPPPPYIYSSPPPPPYIYSSPPPPPYIYSSPPPPPYIYSSPQPPPPARTEPSPPT---STPTPPTSPPP-SEAWGQKKARCKNRSYPQCYGM

Query:  ELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRDFTWVQSLGILFDSHKLFIGA
        ELSCPS+CPD CEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHG+K++DFCIVTDSNLHINAHFIGRRN NMKRDFTWVQSLGILFDSHKLFIGA
Subjt:  ELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRDFTWVQSLGILFDSHKLFIGA

Query:  RKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEKESRVHNYGITQEDCFAHLDLSFKFYSLS
        +KTATWNDAIDRLS+SLDDETILL N+EGA+W NS+S + ITITR++NTNAVEIEV GNFKIKAVVVPITE +SR+HNYGITQEDCFAHLDLSFKFY+LS
Subjt:  RKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEKESRVHNYGITQEDCFAHLDLSFKFYSLS

Query:  GDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDG-NDSSLEAETYAN-MDCGSDMEGGVVCKR
        GDVNGVLGQTY SNYVSK KMG AMPVFGGVNEFASSNIF+TDC+VARF+G+ DG +DSSLEAE YA+ M CGSD EGGVVCKR
Subjt:  GDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDG-NDSSLEAETYAN-MDCGSDMEGGVVCKR

XP_011648458.1 uncharacterized protein LOC101207483 [Cucumis sativus]2.5e-18084.34Show/hide
Query:  NYSPPPSRKLKSPPPPYIYSS-PPPPYIYSS-PPPPPYIYSSPPPPPYIYSS-PPPPPYIYSSPQPPPPARTEPSPPTSTPTPPTSPPP-SEAWGQKKAR
        N  PPP RKLKSPPPPYIYSS PPPPYIYSS PPPPP +YSSPPPPPYIYSS PPPPPYI +SP PPPP+    SPPT TP+ PTSPPP SE  GQKKAR
Subjt:  NYSPPPSRKLKSPPPPYIYSS-PPPPYIYSS-PPPPPYIYSSPPPPPYIYSS-PPPPPYIYSSPQPPPPARTEPSPPTSTPTPPTSPPP-SEAWGQKKAR

Query:  CKNRSYPQCYGMELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRDFTWVQSLG
        CKNR YP CYGMELSCPS+CPD CEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHG+K++DFCIVTDSNLHINAHFIGRRN NMKRDFTWVQSLG
Subjt:  CKNRSYPQCYGMELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRDFTWVQSLG

Query:  ILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEKESRVHNYGITQEDCFA
        ILFDSHKLFIGA+KTATWNDA DRLS+SLD+ETI+LPN+EGA+WSNS+S +GITITR++NTNAVEI+V GNFKIKAVVVPITE +SR+HNYGITQEDCFA
Subjt:  ILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEKESRVHNYGITQEDCFA

Query:  HLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLD-GNDSSLEAETYAN-MDCGSDMEGGVVCKR
        HLDLSFKFY+LSGDVNGVLGQTY SNYVSK KMGVAMPVFGG+NEFASSNIFAT+C+VARF+G+LD  +DSSLEAE YAN M CGSD+EGGVVCKR
Subjt:  HLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLD-GNDSSLEAETYAN-MDCGSDMEGGVVCKR

XP_022937854.1 uncharacterized protein LOC111444116 [Cucurbita moschata]3.9e-17381.31Show/hide
Query:  PPPSRKLKS-PPPPYIYSS--PPPPYIYSSPPPPP-YIYSSPPPPPYIYSSPPPPPYIYSSPQPPPPARTEPSPP-TSTPTPPTS---PPPSEAWGQKKA
        PP  RKLKS PPPPY+YSS  PPPPYIYSSPPPPP ++Y SPPPPPYIYSSPPPPP+IYSS  PPPPA TEP PP   TPTPPTS   PP SEA GQKK 
Subjt:  PPPSRKLKS-PPPPYIYSS--PPPPYIYSSPPPPP-YIYSSPPPPPYIYSSPPPPPYIYSSPQPPPPARTEPSPP-TSTPTPPTS---PPPSEAWGQKKA

Query:  RCKNRSYPQCYGMELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRDFTWVQSL
        RCKNRS+P CYGMEL+CP+ CP QCEVDCVTCS VCNCNRPG+VCQDP+F+GGDGITFYFHG+K+KDFCIVTDSNLHINAHFIGRRN +MKRDFTWVQSL
Subjt:  RCKNRSYPQCYGMELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRDFTWVQSL

Query:  GILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEKESRVHNYGITQEDCF
        GILFDSH+LFIGARKT+TW+DA DRLS+S +++TI+L NREGA+WSNS++YEGITITR+RNTNAVEI V GNFKIKAVVVPITEKESR+H YGITQEDCF
Subjt:  GILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEKESRVHNYGITQEDCF

Query:  AHLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDGNDSSLEAETYANMDCGSDMEG-GVVCKR
        AHLDLSFKFY+LSG VNGVLGQTYGSNYVS+AKMGVAMPV GG  EFASS  FATDC VARFNGQL+G DSSLE   Y NM CGSDMEG GVVCKR
Subjt:  AHLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDGNDSSLEAETYANMDCGSDMEG-GVVCKR

XP_022969544.1 uncharacterized protein LOC111468530 [Cucurbita maxima]3.5e-17481.31Show/hide
Query:  PPPSRKLKS-PPPPYIYSS--PPPPYIYSSPPPPP-YIYSSPPPPPYIYSSPPPPPYIYSSPQPPPPARTEPSPP-TSTPTPPTS---PPPSEAWGQKKA
        PP  RKLKS PPPPY+YSS  PPPPYIYSSPPPPP ++YSSPPPPPYIYSSPPPPPY+YSSP PPPPA TEP PP   TP PPTS   PP SEA GQKK 
Subjt:  PPPSRKLKS-PPPPYIYSS--PPPPYIYSSPPPPP-YIYSSPPPPPYIYSSPPPPPYIYSSPQPPPPARTEPSPP-TSTPTPPTS---PPPSEAWGQKKA

Query:  RCKNRSYPQCYGMELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRDFTWVQSL
        RCKNRS+P CYGMEL+CP+ CP QCEVDCVTCS VCNCNRPG+VCQDP+F+GGDGITFYFHG+K++DFCIVTDSNLHINAHFIGRRN +MKRDFTWVQSL
Subjt:  RCKNRSYPQCYGMELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRDFTWVQSL

Query:  GILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEKESRVHNYGITQEDCF
        GILFDSH+LFIGARKT+TW+DA DRLS+S +++TI+L NREGA+WSNS++YEGITITR+RNTNAVEI V GNFKIKAVVVPITEKESR+H YGITQEDCF
Subjt:  GILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEKESRVHNYGITQEDCF

Query:  AHLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDGNDSSLEAETYANMDCGSDMEG-GVVCKR
        AHLDLSFKFY+LSG+VNGVLGQTYGSNYVS+AKMGVAMPV GG  EFASS  FATDC VARFNGQL+G DSSLE   Y NM CGSDMEG GVVCKR
Subjt:  AHLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDGNDSSLEAETYANMDCGSDMEG-GVVCKR

XP_038894866.1 uncharacterized protein LOC120083266 [Benincasa hispida]6.0e-18276.47Show/hide
Query:  MARMTIFLFFFFLLFSVLVEGAPNA-------NNY--------------------------------SPPPSRKLKSPPPPYIYSSPPPPYIYSSPPPPP
        MAR+ IFLF F L FS +VEGAPNA       +NY                                 PPPSRKLKSPPPPYIYSSP        PPPPP
Subjt:  MARMTIFLFFFFLLFSVLVEGAPNA-------NNY--------------------------------SPPPSRKLKSPPPPYIYSSPPPPYIYSSPPPPP

Query:  YIYSS-PPPPPYIYSSPPPPPYIYSSPQPPPPARTEPSPPTSTPTPPTSPPP-SEAWGQKKARCKNRSYPQCYGMELSCPSACPDQCEVDCVTCSPVCNC
        YIYSS PPPPPYIYSSPPPPP       P   A + P PP +TPTPPTSPPP SE  GQKKARCKNR YP CYGMELSCPSACPDQCEVDCVTCSPVCNC
Subjt:  YIYSS-PPPPPYIYSSPPPPPYIYSSPQPPPPARTEPSPPTSTPTPPTSPPP-SEAWGQKKARCKNRSYPQCYGMELSCPSACPDQCEVDCVTCSPVCNC

Query:  NRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRDFTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLP
        NRPGSVCQDPKFVGGDGITFYFHGQK+++FCIVTDSNLHINAHFIGRRN +MKRDFTWVQSLGILFDSHKLFIGA+KTATWNDAIDRLSISLDDETILLP
Subjt:  NRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRDFTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLP

Query:  NREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEKESRVHNYGITQEDCFAHLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAM
         +EGA+WSNS+ Y+G+ ITRSRNTNAVEI+V GNFKIKAVVVPITEK+SRVHNYGITQEDCFAHLDLSFKFY+LSGDVNGVLGQTYGSNYVSK KMGVAM
Subjt:  NREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEKESRVHNYGITQEDCFAHLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAM

Query:  PVFGGVNEFASSNIFATDCKVARFNGQLDG-NDSSLEAETYANMDCGSDME-GGVVCKR
        PVFGG NEFASSN+FATDC+VARF+GQL G +DSSLEAE +ANM CGSDME GGVVCKR
Subjt:  PVFGGVNEFASSNIFATDCKVARFNGQLDG-NDSSLEAETYANMDCGSDME-GGVVCKR

TrEMBL top hitse value%identityAlignment
A0A0A0LVQ3 Uncharacterized protein1.2e-18084.34Show/hide
Query:  NYSPPPSRKLKSPPPPYIYSS-PPPPYIYSS-PPPPPYIYSSPPPPPYIYSS-PPPPPYIYSSPQPPPPARTEPSPPTSTPTPPTSPPP-SEAWGQKKAR
        N  PPP RKLKSPPPPYIYSS PPPPYIYSS PPPPP +YSSPPPPPYIYSS PPPPPYI +SP PPPP+    SPPT TP+ PTSPPP SE  GQKKAR
Subjt:  NYSPPPSRKLKSPPPPYIYSS-PPPPYIYSS-PPPPPYIYSSPPPPPYIYSS-PPPPPYIYSSPQPPPPARTEPSPPTSTPTPPTSPPP-SEAWGQKKAR

Query:  CKNRSYPQCYGMELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRDFTWVQSLG
        CKNR YP CYGMELSCPS+CPD CEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHG+K++DFCIVTDSNLHINAHFIGRRN NMKRDFTWVQSLG
Subjt:  CKNRSYPQCYGMELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRDFTWVQSLG

Query:  ILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEKESRVHNYGITQEDCFA
        ILFDSHKLFIGA+KTATWNDA DRLS+SLD+ETI+LPN+EGA+WSNS+S +GITITR++NTNAVEI+V GNFKIKAVVVPITE +SR+HNYGITQEDCFA
Subjt:  ILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEKESRVHNYGITQEDCFA

Query:  HLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLD-GNDSSLEAETYAN-MDCGSDMEGGVVCKR
        HLDLSFKFY+LSGDVNGVLGQTY SNYVSK KMGVAMPVFGG+NEFASSNIFAT+C+VARF+G+LD  +DSSLEAE YAN M CGSD+EGGVVCKR
Subjt:  HLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLD-GNDSSLEAETYAN-MDCGSDMEGGVVCKR

A0A1S3CF01 uncharacterized protein LOC1035002031.3e-17483.85Show/hide
Query:  PPPPYIYSSPPPPYIYSSPPPPPYIYSSPPPPPYIYSSPPPPPYIYSSPQPPPPARTEPSPPT---STPTPPTSPPP-SEAWGQKKARCKNRSYPQCYGM
        PPPP    SPPPPYIYSSPPPPPYIYSSP         PPPPPYIY+SP PPPPA   PSPPT   STPTPPTSPPP SEA GQKKARCKNRSYP CYGM
Subjt:  PPPPYIYSSPPPPYIYSSPPPPPYIYSSPPPPPYIYSSPPPPPYIYSSPQPPPPARTEPSPPT---STPTPPTSPPP-SEAWGQKKARCKNRSYPQCYGM

Query:  ELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRDFTWVQSLGILFDSHKLFIGA
        ELSCPS+CPD CEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHG+K++DFCIVTDSNLHINAHFIGRRN NMKRDFTWVQSLGILFDSHKLFIGA
Subjt:  ELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRDFTWVQSLGILFDSHKLFIGA

Query:  RKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEKESRVHNYGITQEDCFAHLDLSFKFYSLS
        +KTATWNDAIDRLS+SLDDETILL N+EGA+W NS+S + ITITR++NTNAVEIEV GNFKIKAVVVPITE +SR+HNYGITQEDCFAHLDLSFKFY+LS
Subjt:  RKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEKESRVHNYGITQEDCFAHLDLSFKFYSLS

Query:  GDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDG-NDSSLEAETYAN-MDCGSDMEGGVVCKR
        GDVNGVLGQTY SNYVSK KMG AMPVFGGVNEFASSNIF+TDC+VARF+G+ DG +DSSLEAE YA+ M CGSD EGGVVCKR
Subjt:  GDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDG-NDSSLEAETYAN-MDCGSDMEGGVVCKR

A0A5D3DR05 TGF-beta-activated kinase 1 and MAP3K7-binding protein 3-like1.3e-17483.85Show/hide
Query:  PPPPYIYSSPPPPYIYSSPPPPPYIYSSPPPPPYIYSSPPPPPYIYSSPQPPPPARTEPSPPT---STPTPPTSPPP-SEAWGQKKARCKNRSYPQCYGM
        PPPP    SPPPPYIYSSPPPPPYIYSSP         PPPPPYIY+SP PPPPA   PSPPT   STPTPPTSPPP SEA GQKKARCKNRSYP CYGM
Subjt:  PPPPYIYSSPPPPYIYSSPPPPPYIYSSPPPPPYIYSSPPPPPYIYSSPQPPPPARTEPSPPT---STPTPPTSPPP-SEAWGQKKARCKNRSYPQCYGM

Query:  ELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRDFTWVQSLGILFDSHKLFIGA
        ELSCPS+CPD CEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHG+K++DFCIVTDSNLHINAHFIGRRN NMKRDFTWVQSLGILFDSHKLFIGA
Subjt:  ELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRDFTWVQSLGILFDSHKLFIGA

Query:  RKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEKESRVHNYGITQEDCFAHLDLSFKFYSLS
        +KTATWNDAIDRLS+SLDDETILL N+EGA+W NS+S + ITITR++NTNAVEIEV GNFKIKAVVVPITE +SR+HNYGITQEDCFAHLDLSFKFY+LS
Subjt:  RKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEKESRVHNYGITQEDCFAHLDLSFKFYSLS

Query:  GDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDG-NDSSLEAETYAN-MDCGSDMEGGVVCKR
        GDVNGVLGQTY SNYVSK KMG AMPVFGGVNEFASSNIF+TDC+VARF+G+ DG +DSSLEAE YA+ M CGSD EGGVVCKR
Subjt:  GDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDG-NDSSLEAETYAN-MDCGSDMEGGVVCKR

A0A6J1FCE2 uncharacterized protein LOC1114441161.9e-17381.31Show/hide
Query:  PPPSRKLKS-PPPPYIYSS--PPPPYIYSSPPPPP-YIYSSPPPPPYIYSSPPPPPYIYSSPQPPPPARTEPSPP-TSTPTPPTS---PPPSEAWGQKKA
        PP  RKLKS PPPPY+YSS  PPPPYIYSSPPPPP ++Y SPPPPPYIYSSPPPPP+IYSS  PPPPA TEP PP   TPTPPTS   PP SEA GQKK 
Subjt:  PPPSRKLKS-PPPPYIYSS--PPPPYIYSSPPPPP-YIYSSPPPPPYIYSSPPPPPYIYSSPQPPPPARTEPSPP-TSTPTPPTS---PPPSEAWGQKKA

Query:  RCKNRSYPQCYGMELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRDFTWVQSL
        RCKNRS+P CYGMEL+CP+ CP QCEVDCVTCS VCNCNRPG+VCQDP+F+GGDGITFYFHG+K+KDFCIVTDSNLHINAHFIGRRN +MKRDFTWVQSL
Subjt:  RCKNRSYPQCYGMELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRDFTWVQSL

Query:  GILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEKESRVHNYGITQEDCF
        GILFDSH+LFIGARKT+TW+DA DRLS+S +++TI+L NREGA+WSNS++YEGITITR+RNTNAVEI V GNFKIKAVVVPITEKESR+H YGITQEDCF
Subjt:  GILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEKESRVHNYGITQEDCF

Query:  AHLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDGNDSSLEAETYANMDCGSDMEG-GVVCKR
        AHLDLSFKFY+LSG VNGVLGQTYGSNYVS+AKMGVAMPV GG  EFASS  FATDC VARFNGQL+G DSSLE   Y NM CGSDMEG GVVCKR
Subjt:  AHLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDGNDSSLEAETYANMDCGSDMEG-GVVCKR

A0A6J1I078 uncharacterized protein LOC1114685301.7e-17481.31Show/hide
Query:  PPPSRKLKS-PPPPYIYSS--PPPPYIYSSPPPPP-YIYSSPPPPPYIYSSPPPPPYIYSSPQPPPPARTEPSPP-TSTPTPPTS---PPPSEAWGQKKA
        PP  RKLKS PPPPY+YSS  PPPPYIYSSPPPPP ++YSSPPPPPYIYSSPPPPPY+YSSP PPPPA TEP PP   TP PPTS   PP SEA GQKK 
Subjt:  PPPSRKLKS-PPPPYIYSS--PPPPYIYSSPPPPP-YIYSSPPPPPYIYSSPPPPPYIYSSPQPPPPARTEPSPP-TSTPTPPTS---PPPSEAWGQKKA

Query:  RCKNRSYPQCYGMELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRDFTWVQSL
        RCKNRS+P CYGMEL+CP+ CP QCEVDCVTCS VCNCNRPG+VCQDP+F+GGDGITFYFHG+K++DFCIVTDSNLHINAHFIGRRN +MKRDFTWVQSL
Subjt:  RCKNRSYPQCYGMELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRDFTWVQSL

Query:  GILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEKESRVHNYGITQEDCF
        GILFDSH+LFIGARKT+TW+DA DRLS+S +++TI+L NREGA+WSNS++YEGITITR+RNTNAVEI V GNFKIKAVVVPITEKESR+H YGITQEDCF
Subjt:  GILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEKESRVHNYGITQEDCF

Query:  AHLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDGNDSSLEAETYANMDCGSDMEG-GVVCKR
        AHLDLSFKFY+LSG+VNGVLGQTYGSNYVS+AKMGVAMPV GG  EFASS  FATDC VARFNGQL+G DSSLE   Y NM CGSDMEG GVVCKR
Subjt:  AHLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDGNDSSLEAETYANMDCGSDMEG-GVVCKR

SwissProt top hitse value%identityAlignment
O65375 Leucine-rich repeat extensin-like protein 12.8e-1756.76Show/hide
Query:  APNANNYSPPPSRKLKSPPPPYIYSSPPPPYIYSSPPPPPYIYSSPPPPPYIYSSP--------PPPPYIYSSPQPPPPARTEPSPPTSTPTP-------
        +P+   Y PPP     SPPPPY+YSSPPPPY+YSSPPPPPY+YSSPPPPPY+YSSP        PPPPY+YSSP PPPP+   P P +S P P       
Subjt:  APNANNYSPPPSRKLKSPPPPYIYSSPPPPYIYSSPPPPPYIYSSPPPPPYIYSSP--------PPPPYIYSSPQPPPPARTEPSPPTSTPTP-------

Query:  -PTSPPPSEAW
          + PPPS  +
Subjt:  -PTSPPPSEAW

P13983 Extensin9.4e-0546.55Show/hide
Query:  VEGAPNANNYSPPPSRKLK------------SPPPPYIYSSPPPPYIYSSPPPPPYIYSSPP----PPPYIYSSPPPPPYIYSSPQPPPPARTEPSPPTS
        V   P   +YSPPP   L             SPPPP    SPPPP  YS P P P  YS PP    PPP  Y+ PPP P  YS   PPPPA + P PPT 
Subjt:  VEGAPNANNYSPPPSRKLK------------SPPPPYIYSSPPPPYIYSSPPPPPYIYSSPP----PPPYIYSSPPPPPYIYSSPQPPPPARTEPSPPTS

Query:  TPTPPTSPPPSEAWGQ
        +P PPT  PP  A+ Q
Subjt:  TPTPPTSPPPSEAWGQ

Q9M1G9 Extensin-21.8e-0851.85Show/hide
Query:  VEGAPNANNYSPPPSRKLKSPPPPYIYSSPPPPYIYSSP------PPPPYIYSSPPPP-----PYIYSSPPPPPYIYSSPQPPPPARTEPSPPTSTPTPP
        V  +P    YSP P    KSPPPPY+YSSPPPPY   SP      PPPPY+Y+SPPPP     P +Y   PPPPY+YSSP PPP     P     +P PP
Subjt:  VEGAPNANNYSPPPSRKLKSPPPPYIYSSPPPPYIYSSP------PPPPYIYSSPPPP-----PYIYSSPPPPPYIYSSPQPPPPARTEPSPPTSTPTPP

Query:  ---TSPPP
           +SPPP
Subjt:  ---TSPPP

Q9T0K5 Leucine-rich repeat extensin-like protein 31.0e-0650.94Show/hide
Query:  VEGAPNANNYSPPPSRKLKSPPPP---YIYSSPPPPYI--YSSPPPPPYIYSSPPPPPYIYSSPPPPPYIYSSPQPPPPARTEPSPPT----STPTPPTS
        V   P    YSPPP      PPPP     YS PPPP +  YSSPPPPP  YSSPPPPP  YSSPPPPP ++ S  PPP       PP+    S+P PP S
Subjt:  VEGAPNANNYSPPPSRKLKSPPPP---YIYSSPPPPYI--YSSPPPPPYIYSSPPPPPYIYSSPPPPPYIYSSPQPPPPARTEPSPPT----STPTPPTS

Query:  PPPSEA
         P  E+
Subjt:  PPPSEA

Q9XIL9 Pollen-specific leucine-rich repeat extensin-like protein 31.0e-0654.84Show/hide
Query:  YSPPPSRKLKSPPPPYIYSSPPPPYIYSSPPPPPYIYSSPPPPPYIYSSPP----PPPYIYSSPQP---PPPARTEPSPPTSTPTPPT-SPPP
        +SPPP   + SPPPP +YS PPPP +YS PPPPP +YS PPPPP ++S PP    PPP ++S P P   PPP    P PP  +P PP  SPPP
Subjt:  YSPPPSRKLKSPPPPYIYSSPPPPYIYSSPPPPPYIYSSPPPPPYIYSSPP----PPPYIYSSPQP---PPPARTEPSPPTSTPTPPT-SPPP

Arabidopsis top hitse value%identityAlignment
AT3G19430.1 late embryogenesis abundant protein-related / LEA protein-related5.3e-10451.85Show/hide
Query:  PNANNYSPP-PSRKLKSPPPPYIYSSPPPPYIYSSPPPPPYIYSSPP----PPPYIYSSPPPPPYIYSSPQPPPPARTEPSPPTSTPTPPTSPPPS---E
        P+  + +PP P+  + SPPPP    SPPPP    S P PP +  +PP    P P   +  PP P + S P   P   T PS PT + +PP  PPPS   E
Subjt:  PNANNYSPP-PSRKLKSPPPPYIYSSPPPPYIYSSPPPPPYIYSSPP----PPPYIYSSPPPPPYIYSSPQPPPPARTEPSPPTSTPTPPTSPPPS---E

Query:  AWGQKKARCKNRSYPQCYGMELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRD
        A G K+ RCK +  P CYG+E +CP+ CP  C+VDCVTC PVCNC++PGSVCQDP+F+GGDG+TFYFHG+K+ +FC+++D NLHINAHFIG+R   M RD
Subjt:  AWGQKKARCKNRSYPQCYGMELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRD

Query:  FTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSNSSS-YEGITITR-SRNTNAVEIEVLGNFKIKAVVVPITEKESRVHN
        FTWVQS+ ILF +H+L++GA KTATW+D++DR+++S D   I LP  +GA W++S   Y  +++ R + +TN +E+EV G  KI A VVPIT ++SR+H 
Subjt:  FTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSNSSS-YEGITITR-SRNTNAVEIEVLGNFKIKAVVVPITEKESRVHN

Query:  YGITQEDCFAHLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDGNDSSLEAETYANMDCGSDMEG-G
        Y + ++DC AHLDL FKF  LS +V+GVLGQTY SNYVS+ K+GV MPV GG  EF ++ +FA DC  ARF G  D N+   + E    M C S + G G
Subjt:  YGITQEDCFAHLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDGNDSSLEAETYANMDCGSDMEG-G

Query:  VVCKR
        VVCKR
Subjt:  VVCKR

AT4G27400.1 Late embryogenesis abundant (LEA) protein-related1.5e-6137.89Show/hide
Query:  EAWGQKKARCKNRSYPQCYGMELSCPSACPDQ---------CEVDCV--TCSPVC-----NCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNL
        +AW   +  C   + P+C    + CP  CP +         C VDC    C  VC     NC   GS+C DP+F+GGDGI FYFHG+  + F IV+D + 
Subjt:  EAWGQKKARCKNRSYPQCYGMELSCPSACPDQ---------CEVDCV--TCSPVC-----NCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNL

Query:  HINAHFIGRRNNNMKRDFTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIK
         INA F G R     RDFTW+Q+LG LF+SHK  +   K ATW+  +D L  ++D + +++P    ++W   SS + I I R    N+V + +    +I 
Subjt:  HINAHFIGRRNNNMKRDFTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIK

Query:  AVVVPITEKESRVHNYGITQEDCFAHLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDGNDSSLEAE
          VVP+T+++ R+HNY +  +DCFAH ++ FKF +LS  V+G+LG+TY  ++ + AK GV MPV GG + F +S++ +  CK   F+        S++ +
Subjt:  AVVVPITEKESRVHNYGITQEDCFAHLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDGNDSSLEAE

Query:  -TYANMDC--GSDMEGGVVCKR
         TYA +DC  G+    G+VC++
Subjt:  -TYANMDC--GSDMEGGVVCKR

AT5G54370.1 Late embryogenesis abundant (LEA) protein-related8.9e-6740.71Show/hide
Query:  CKNRSYPQCYGMELSCPSACPDQ---------CEVDC--VTCSPVC-----NCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGR
        C N  Y +CY   + CP  CP +         C  DC   TC   C     NCNRPGS C DP+F+GGDGI FYFHG+  ++F +V+DS+L IN  FIG 
Subjt:  CKNRSYPQCYGMELSCPSACPDQ---------CEVDC--VTCSPVC-----NCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGR

Query:  RNNNMKRDFTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEK
        R     RDFTW+Q+LG LF+S+K  + A KTA+W++ ID L  S D + + +P    ++W   S  + I I R    N+V + +    +I   VVP+T++
Subjt:  RNNNMKRDFTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEK

Query:  ESRVHNYGITQEDCFAHLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDGNDSSLEAETYANMDC--
        + R+H+Y +  +DCFAHL++ F+F++LS  V+G+LG+TY  ++ + AK GVAMPV GG + F +S++ + DCK   F+      DS      YA +DC  
Subjt:  ESRVHNYGITQEDCFAHLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDGNDSSLEAETYANMDC--

Query:  GSDMEGGVVCKR
        G+    G+VC++
Subjt:  GSDMEGGVVCKR

AT5G60520.1 Late embryogenesis abundant (LEA) protein-related3.5e-6341.96Show/hide
Query:  GQKKARCKNRSYPQCYGMELSCPSACPDQ----------CEVDCVT-CSPVC-----NCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHIN
        GQ++ +C  R    C    L+CP  CP++          C +DC + C   C     NCN  GS+C DP+FVGGDG+ FYFHG K+ +F IV+D NL IN
Subjt:  GQKKARCKNRSYPQCYGMELSCPSACPDQ----------CEVDCVT-CSPVC-----NCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHIN

Query:  AHFIGRRNNNMKRDFTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVV
        AHFIG R     RDFTWVQ+  ++FDSH L I A+K A+W+D++D L +  + E + +P    A W        + + R+   N V + V G  +I   V
Subjt:  AHFIGRRNNNMKRDFTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVV

Query:  VPITEKESRVHNYGITQEDCFAHLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQ
         PI ++E RVH Y + ++D FAHL+  FKF++LS  V GVLG+TY   YVS  K GV MP+ GG +++ + ++F+  C V RF G+
Subjt:  VPITEKESRVHNYGITQEDCFAHLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQ

AT5G60530.1 late embryogenesis abundant protein-related / LEA protein-related7.8e-6343.14Show/hide
Query:  SPPPSEAWGQKKARCKNRSYPQCYGMELSCPSACPDQ----------CEVDCVT-CSPVC-----NCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIV
        SP P+   GQ++A C+ R    CY   L CP  CP +          C +DC   C   C     NCN  GS+C DP+FVGGDG+ FYFHG K  +F IV
Subjt:  SPPPSEAWGQKKARCKNRSYPQCYGMELSCPSACPDQ----------CEVDCVT-CSPVC-----NCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIV

Query:  TDSNLHINAHFIGRRNNNMKRDFTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSN-SSSYEGITITRSRNTNAVEIEVL
        +D+NL INAHFIG R     RDFTWVQ+L ++F++HKL I A +   W++  D  +I  D E I LP  E + W   S   + I I R+   N+V + V 
Subjt:  TDSNLHINAHFIGRRNNNMKRDFTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSN-SSSYEGITITRSRNTNAVEIEVL

Query:  GNFKIKAVVVPITEKESRVHNYGITQEDCFAHLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDGND
           ++   V PI ++E+RVHNY + Q+D FAHL+  FKF  LS  V GVLG+TY  +YVS AK GV MPV GG +++ + ++F+  C++ RF  Q    +
Subjt:  GNFKIKAVVVPITEKESRVHNYGITQEDCFAHLDLSFKFYSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDGND

Query:  SSLEAE
         SL A+
Subjt:  SSLEAE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGAATGACCATATTCCTCTTCTTCTTCTTCCTCTTGTTCTCAGTCTTGGTTGAGGGAGCTCCAAACGCCAACAATTATTCTCCTCCGCCGTCACGTAAGCTCAA
ATCTCCACCACCACCGTACATTTATTCTTCCCCACCACCACCGTACATTTATTCTTCCCCACCACCACCACCGTACATCTATTCTTCCCCACCACCACCTCCGTACATTT
ACTCTTCCCCACCACCGCCTCCCTACATTTACTCTTCTCCCCAACCACCTCCGCCAGCTAGGACGGAGCCTTCACCTCCGACTTCGACTCCAACTCCTCCAACATCTCCT
CCGCCGTCCGAGGCGTGGGGCCAAAAGAAAGCTAGGTGCAAGAATAGGAGCTATCCACAATGCTACGGCATGGAGCTAAGTTGTCCAAGTGCTTGTCCTGACCAATGTGA
GGTAGACTGTGTTACTTGCAGCCCTGTTTGCAATTGCAACCGTCCAGGCTCGGTGTGCCAAGACCCAAAATTCGTTGGAGGAGACGGAATCACCTTCTACTTCCATGGCC
AAAAAGAAAAAGATTTCTGCATTGTCACCGATTCAAACCTCCACATCAACGCCCACTTCATTGGCCGACGAAACAACAACATGAAAAGGGACTTCACTTGGGTTCAATCC
CTTGGCATCCTCTTTGACTCCCACAAGCTCTTCATAGGTGCACGAAAAACAGCAACATGGAATGATGCTATCGACCGCCTCTCAATCTCCCTTGATGACGAAACTATCCT
CCTCCCTAACCGAGAGGGTGCTAGCTGGAGCAATTCAAGCTCGTACGAAGGAATCACCATAACCAGGAGTAGAAATACGAACGCGGTCGAAATCGAAGTACTTGGAAACT
TCAAGATCAAAGCCGTCGTGGTTCCGATAACAGAAAAGGAATCAAGGGTCCACAATTATGGGATTACACAAGAGGATTGCTTTGCCCATTTGGACTTGAGCTTCAAGTTC
TATTCATTGAGTGGGGATGTGAATGGGGTTCTGGGGCAGACTTATGGTAGCAACTATGTGAGCAAGGCCAAGATGGGAGTGGCAATGCCTGTTTTTGGTGGTGTCAATGA
GTTTGCTTCTTCAAATATTTTTGCTACGGATTGCAAAGTGGCACGTTTTAATGGGCAGTTGGATGGAAATGACAGTTCTTTAGAGGCTGAAACCTATGCCAATATGGACT
GTGGCAGTGACATGGAAGGTGGAGTTGTTTGCAAAAGATAA
mRNA sequenceShow/hide mRNA sequence
CTCAAAGCTATAAAGCTTGAAACCCAACTCAAAGGTTTTCATATGTATGATGTACAATAATTCCCCATTGCTGAAAGCTGCAGCAAAAGATATTTAATATCAATATATAT
TTTTTGGTTCTGCGCAAAGGAATTAAAGAGCTTCAAGTTTACACCAATTGGATATTGACTTTTGGGAAATTATTGGGGTCCATGTGAATTGTTTATGAATTAAATATGAA
GTAAAGTTGATGATTCATTTGCTTCCAAACCCAAATAGCTAAATTCAAGGCCACACATGTTGGAGAAATTCACTATAAAAAAGAGGCAATTCCATGCATTTGTTACAATA
ACACAAATTTGAGAATTAAATCAGCTTCCAAATATGGCAAGAATGACCATATTCCTCTTCTTCTTCTTCCTCTTGTTCTCAGTCTTGGTTGAGGGAGCTCCAAACGCCAA
CAATTATTCTCCTCCGCCGTCACGTAAGCTCAAATCTCCACCACCACCGTACATTTATTCTTCCCCACCACCACCGTACATTTATTCTTCCCCACCACCACCACCGTACA
TCTATTCTTCCCCACCACCACCTCCGTACATTTACTCTTCCCCACCACCGCCTCCCTACATTTACTCTTCTCCCCAACCACCTCCGCCAGCTAGGACGGAGCCTTCACCT
CCGACTTCGACTCCAACTCCTCCAACATCTCCTCCGCCGTCCGAGGCGTGGGGCCAAAAGAAAGCTAGGTGCAAGAATAGGAGCTATCCACAATGCTACGGCATGGAGCT
AAGTTGTCCAAGTGCTTGTCCTGACCAATGTGAGGTAGACTGTGTTACTTGCAGCCCTGTTTGCAATTGCAACCGTCCAGGCTCGGTGTGCCAAGACCCAAAATTCGTTG
GAGGAGACGGAATCACCTTCTACTTCCATGGCCAAAAAGAAAAAGATTTCTGCATTGTCACCGATTCAAACCTCCACATCAACGCCCACTTCATTGGCCGACGAAACAAC
AACATGAAAAGGGACTTCACTTGGGTTCAATCCCTTGGCATCCTCTTTGACTCCCACAAGCTCTTCATAGGTGCACGAAAAACAGCAACATGGAATGATGCTATCGACCG
CCTCTCAATCTCCCTTGATGACGAAACTATCCTCCTCCCTAACCGAGAGGGTGCTAGCTGGAGCAATTCAAGCTCGTACGAAGGAATCACCATAACCAGGAGTAGAAATA
CGAACGCGGTCGAAATCGAAGTACTTGGAAACTTCAAGATCAAAGCCGTCGTGGTTCCGATAACAGAAAAGGAATCAAGGGTCCACAATTATGGGATTACACAAGAGGAT
TGCTTTGCCCATTTGGACTTGAGCTTCAAGTTCTATTCATTGAGTGGGGATGTGAATGGGGTTCTGGGGCAGACTTATGGTAGCAACTATGTGAGCAAGGCCAAGATGGG
AGTGGCAATGCCTGTTTTTGGTGGTGTCAATGAGTTTGCTTCTTCAAATATTTTTGCTACGGATTGCAAAGTGGCACGTTTTAATGGGCAGTTGGATGGAAATGACAGTT
CTTTAGAGGCTGAAACCTATGCCAATATGGACTGTGGCAGTGACATGGAAGGTGGAGTTGTTTGCAAAAGATAAGTTTGATTTCCGTTAATTGATCAGATTCTTCATTCA
AGGAAGAATAAAATATATGCTCTAATTAATTATACTTTAATAAGTGTAAAATAAAGTGGCAACTTGAAATGTGAAAGGACGTTTGTGTTCTTTTTCAGAAATGAAAAAGG
GTTTATTTATGTATTATTTTCTCTTCTTAATTAATCATGAGATTACTTGTAAGGGAAAGGTATATTGATATTATGTGATGTCTAAAAATTATTTGCTATTTTCTTT
Protein sequenceShow/hide protein sequence
MARMTIFLFFFFLLFSVLVEGAPNANNYSPPPSRKLKSPPPPYIYSSPPPPYIYSSPPPPPYIYSSPPPPPYIYSSPPPPPYIYSSPQPPPPARTEPSPPTSTPTPPTSP
PPSEAWGQKKARCKNRSYPQCYGMELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGQKEKDFCIVTDSNLHINAHFIGRRNNNMKRDFTWVQS
LGILFDSHKLFIGARKTATWNDAIDRLSISLDDETILLPNREGASWSNSSSYEGITITRSRNTNAVEIEVLGNFKIKAVVVPITEKESRVHNYGITQEDCFAHLDLSFKF
YSLSGDVNGVLGQTYGSNYVSKAKMGVAMPVFGGVNEFASSNIFATDCKVARFNGQLDGNDSSLEAETYANMDCGSDMEGGVVCKR