; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr030219 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr030219
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionLate embryogenesis abundant protein-related / LEA protein-related protein
Genome locationtig00153574:1485774..1488200
RNA-Seq ExpressionSgr030219
SyntenySgr030219
Gene Ontology termsGO:0001505 - regulation of neurotransmitter levels (biological process)
GO:0007186 - G protein-coupled receptor signaling pathway (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004969 - histamine receptor activity (molecular function)
InterPro domainsIPR009646 - Root cap


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585982.1 hypothetical protein SDJN03_18715, partial [Cucurbita argyrosperma subsp. sororia]1.9e-19477.33Show/hide
Query:  MARTTVVLFLFCLFLSSAVEGAPKAKKVKCKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTPPPPPPPPPPPPPPKQRKLKSPPPPRYIYSS-PPP
        M +   +LFLF LFLS+AVE  PK KKVKCKDK +P CYKSEH CPADCLRTCVVDCS+C+PVCTPPPPPPP PPPPPPK RKLKSPPPP Y+YSS PPP
Subjt:  MARTTVVLFLFCLFLSSAVEGAPKAKKVKCKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTPPPPPPPPPPPPPPKQRKLKSPPPPRYIYSS-PPP

Query:  PPHIYSSPPPPPIAAQPSPPLPPV--------------PTPPAST--PPSLPP-PTPPAS-GTPPSSEASGQKKVRCKNRLSFPHCYGMELTCPSDCPSQ
        PP+IYSSPPPPP     SPP PP               P PPA+T  PP +PP PTPP S   PPSSEASGQKKVRCKNR SFPHCYGMELTCP+DCP Q
Subjt:  PPHIYSSPPPPPIAAQPSPPLPPV--------------PTPPAST--PPSLPP-PTPPAS-GTPPSSEASGQKKVRCKNRLSFPHCYGMELTCPSDCPSQ

Query:  CEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAVD
        CEVDCVTCS VCNCNRPGAVCQDPRFIGGDGITFYFHG+KD+DFCI+TDSNLHINAHFIGRRN DM RDFTWVQSLGILF +H+LFI ARKT+TW+DA D
Subjt:  CEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAVD

Query:  RLSLSFNGATILLPDRDGATW-NST---GIVITRTRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQTY
        RLSL FN  TI+L +R+GATW NST   GI ITRTRNTNAVEI+VPGNFKIKA+VVPITEK+SRIHKYG+TQEDCFAHLDLSFKFYALSG VNGVLGQTY
Subjt:  RLSLSFNGATILLPDRDGATW-NST---GIVITRTRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQTY

Query:  ASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNCASELGGRGVVCKR
         SNYVSRAKMGVAMPVLGGDKEFASS  FA+DCAVARF+GQ++ +DSS E  AY NM+C S++ G GVVCKR
Subjt:  ASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNCASELGGRGVVCKR

XP_004139573.2 uncharacterized protein LOC101207232 [Cucumis sativus]4.2e-19775.79Show/hide
Query:  MARTTVVLFLFCLFLSSAVEGAPKAKKVKCKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTPPPPPPPPPPPPPPKQRKLKS--------------
        MAR  + LF   LFLS+ VEGAPKAKKVKCKDKK+P CYKSEH CPADCLRTCVVDCS+CQPVCTPPPPPPP PPPPPPK RKLKS              
Subjt:  MARTTVVLFLFCLFLSSAVEGAPKAKKVKCKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTPPPPPPPPPPPPPPKQRKLKS--------------

Query:  -------PPPPRYIYSSPPPPPHIYSS-PPPPPIAAQPSPPLPPVPTPPASTPPSLPPPTPPASGTPPSSEASGQKKVRCKNRLSFPHCYGMELTCPSDC
               PPPP YIYSSPPPPPHIYSS PPPPP   +PSPPLPP PTPP+S+PP L P        PPSSEASGQKKVRCKNR  +PHCYGMEL+CPSDC
Subjt:  -------PPPPRYIYSSPPPPPHIYSS-PPPPPIAAQPSPPLPPVPTPPASTPPSLPPPTPPASGTPPSSEASGQKKVRCKNRLSFPHCYGMELTCPSDC

Query:  PSQCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWND
        PSQCEVDCVTCSPVCNCNRPGAVCQDP+FIGGDGITFYFHG++D+DFCI+TDSNLHINAHFIGRRN DM RDFTWVQSLGILF +H+LFI+ARKT+TW+D
Subjt:  PSQCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWND

Query:  AVDRLSLSFNGATILLPDRDGATW-NST---GIVITRTRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLG
        A DRL +S +  TI+LP+++GATW NST   GI ITR+R TNAVEI+VPGNFKIKA+VVPITEK+S IHKYG+TQEDCFAHLDLSFKFYALSG+VNGVLG
Subjt:  AVDRLSLSFNGATILLPDRDGATW-NST---GIVITRTRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLG

Query:  QTYASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNCASELGGRGVVCKR
        QTY  NYVSRAKMGVAMPVLGGDKEFASSS+FA+DC V RF+ +MDE++S  EAAAYANM+C S++ G+GVVCKR
Subjt:  QTYASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNCASELGGRGVVCKR

XP_022937854.1 uncharacterized protein LOC111444116 [Cucurbita moschata]6.7e-19577.54Show/hide
Query:  MARTTVVLFLFCLFLSSAVEGAPKAKKVKCKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTPPPPPPPPPPPPPPKQRKLKSPPPPRYIYSS-PPP
        M +   +LFLF LFLS+AVE  PK KKVKCKDK +P CYKSEH CPADCLRTCVVDCS+C+PVCTPPPPPPP PPPPPPK RKLKSPPPP Y+YSS PPP
Subjt:  MARTTVVLFLFCLFLSSAVEGAPKAKKVKCKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTPPPPPPPPPPPPPPKQRKLKSPPPPRYIYSS-PPP

Query:  PPHIYSSPPPPPIAAQPSPPLPPV--------------PTPPAST--PPSLPP-PTPPAS-GTPPSSEASGQKKVRCKNRLSFPHCYGMELTCPSDCPSQ
        PP+IYSSPPPPP     SPP PP               P PPA+T  PP +PP PTPP S   PPSSEASGQKKVRCKNR SFPHCYGMELTCP+DCP Q
Subjt:  PPHIYSSPPPPPIAAQPSPPLPPV--------------PTPPAST--PPSLPP-PTPPAS-GTPPSSEASGQKKVRCKNRLSFPHCYGMELTCPSDCPSQ

Query:  CEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAVD
        CEVDCVTCS VCNCNRPGAVCQDPRFIGGDGITFYFHG+KD+DFCI+TDSNLHINAHFIGRRN DM RDFTWVQSLGILF +H+LFI ARKT+TW+DA D
Subjt:  CEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAVD

Query:  RLSLSFNGATILLPDRDGATW-NST---GIVITRTRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQTY
        RLSLSFN  TI+L +R+GATW NST   GI ITRTRNTNAVEI+VPGNFKIKA+VVPITEK+SRIHKYG+TQEDCFAHLDLSFKFYALSG VNGVLGQTY
Subjt:  RLSLSFNGATILLPDRDGATW-NST---GIVITRTRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQTY

Query:  ASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNCASELGGRGVVCKR
         SNYVSRAKMGVAMPVLGGDKEFASS  FA+DCAVARF+GQ++ +DSS E  AY NM+C S++ G GVVCKR
Subjt:  ASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNCASELGGRGVVCKR

XP_022969544.1 uncharacterized protein LOC111468530 [Cucurbita maxima]4.6e-19677.59Show/hide
Query:  MARTTVVLFLFCLFLSSAVEGAPKAKKVKCKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTPPPPPPPPPPPPPPKQRKLKSPPPPRYIYSS-PPP
        M +   +LFLF LFLS+AVE  PK KKVKCKDKK+P CYKSEH CPADCLRTCVVDCS+C+PVCTPPPPPPP PPPPPPK RKLKSPPPP Y+YSS PPP
Subjt:  MARTTVVLFLFCLFLSSAVEGAPKAKKVKCKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTPPPPPPPPPPPPPPKQRKLKSPPPPRYIYSS-PPP

Query:  PPHIYSSPPPPPIAAQPSPPLPPV---------------PTPPAST--PPSLPP-PTPPAS-GTPPSSEASGQKKVRCKNRLSFPHCYGMELTCPSDCPS
        PP+IYSSPPPPP     SPP PP                P PPA+T  PP +PP P PP S   PPSSEASGQKKVRCKNR SFPHCYGMELTCP+DCP 
Subjt:  PPHIYSSPPPPPIAAQPSPPLPPV---------------PTPPAST--PPSLPP-PTPPAS-GTPPSSEASGQKKVRCKNRLSFPHCYGMELTCPSDCPS

Query:  QCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAV
        QCEVDCVTCS VCNCNRPGAVCQDPRFIGGDGITFYFHG+KDRDFCI+TDSNLHINAHFIGRRN DM RDFTWVQSLGILF +H+LFI ARKT+TW+DA 
Subjt:  QCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAV

Query:  DRLSLSFNGATILLPDRDGATW-NST---GIVITRTRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQT
        DRLSLSFN  TI+L +R+GATW NST   GI ITRTRNTNAVEI+VPGNFKIKA+VVPITEK+SRIHKYG+TQEDCFAHLDLSFKFYALSG+VNGVLGQT
Subjt:  DRLSLSFNGATILLPDRDGATW-NST---GIVITRTRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQT

Query:  YASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNCASELGGRGVVCKR
        Y SNYVSRAKMGVAMPVLGGDKEFASS  FA+DCAVARF+GQ++ +DSS E  AY NM+C S++ G GVVCKR
Subjt:  YASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNCASELGGRGVVCKR

XP_038894596.1 uncharacterized protein LOC120083110 [Benincasa hispida]3.2e-19779.47Show/hide
Query:  MARTTVVLFLFCLFLSSAVEGAPKAKKVKCKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTPPPPPPPPPPPPPPKQRKLKSPPPPRYIYSSPPPP
        MA+  + L LF LFLS+ VEGAPKAKKVKCKDKKYP CYKS+H CPADCLRTCVVDCS+C+PVCTPPPPPPP PPPPPPK RKLKSPPPP YIYSSPPPP
Subjt:  MARTTVVLFLFCLFLSSAVEGAPKAKKVKCKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTPPPPPPPPPPPPPPKQRKLKSPPPPRYIYSSPPPP

Query:  PHIYSSPPPPPIAAQPSPPLPPVPTPPASTPPSLPPPTPPASGTPPSSEASGQKKVRCKNRLSFPHCYGMELTCPSDCPSQCEVDCVTCSPVCNCNRPGA
        P+IYSSPPPPP+  + SPPLPP PTP  S PPSL P        PPSSEASGQKKVRCKNR  +PHCYGMEL+CPSDCPSQCEVDCVTCSPVCNC+RPGA
Subjt:  PHIYSSPPPPPIAAQPSPPLPPVPTPPASTPPSLPPPTPPASGTPPSSEASGQKKVRCKNRLSFPHCYGMELTCPSDCPSQCEVDCVTCSPVCNCNRPGA

Query:  VCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAVDRLSLSFNGATILLPDRDGA
        VCQDP+FIGGDGITFYFHG+KD+DFCI+TDSNLHINAHFIGRRN +M RDFTWVQSLGILF +HKLFI+A+KT TW+DA DRL LS +   ILLP+++GA
Subjt:  VCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAVDRLSLSFNGATILLPDRDGA

Query:  TW-NST---GIVITRTRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQTYASNYVSRAKMGVAMPVLGG
        TW NST   GI ITR+RNTNAVEI+V GNFKIKA VVPITEK+S IHKYG+TQEDCFAHLDLSFKFYALSG+VNGVLGQTY  NYVSRAKMGVAMPVLGG
Subjt:  TW-NST---GIVITRTRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQTYASNYVSRAKMGVAMPVLGG

Query:  DKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNCASELGGRGVVCKR
        DKEFASSS+FA+DC VARFSG++D +DSS EAAAYANM+C S++ G+GVVCKR
Subjt:  DKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNCASELGGRGVVCKR

TrEMBL top hitse value%identityAlignment
A0A0A0LSM1 Uncharacterized protein2.0e-19775.79Show/hide
Query:  MARTTVVLFLFCLFLSSAVEGAPKAKKVKCKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTPPPPPPPPPPPPPPKQRKLKS--------------
        MAR  + LF   LFLS+ VEGAPKAKKVKCKDKK+P CYKSEH CPADCLRTCVVDCS+CQPVCTPPPPPPP PPPPPPK RKLKS              
Subjt:  MARTTVVLFLFCLFLSSAVEGAPKAKKVKCKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTPPPPPPPPPPPPPPKQRKLKS--------------

Query:  -------PPPPRYIYSSPPPPPHIYSS-PPPPPIAAQPSPPLPPVPTPPASTPPSLPPPTPPASGTPPSSEASGQKKVRCKNRLSFPHCYGMELTCPSDC
               PPPP YIYSSPPPPPHIYSS PPPPP   +PSPPLPP PTPP+S+PP L P        PPSSEASGQKKVRCKNR  +PHCYGMEL+CPSDC
Subjt:  -------PPPPRYIYSSPPPPPHIYSS-PPPPPIAAQPSPPLPPVPTPPASTPPSLPPPTPPASGTPPSSEASGQKKVRCKNRLSFPHCYGMELTCPSDC

Query:  PSQCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWND
        PSQCEVDCVTCSPVCNCNRPGAVCQDP+FIGGDGITFYFHG++D+DFCI+TDSNLHINAHFIGRRN DM RDFTWVQSLGILF +H+LFI+ARKT+TW+D
Subjt:  PSQCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWND

Query:  AVDRLSLSFNGATILLPDRDGATW-NST---GIVITRTRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLG
        A DRL +S +  TI+LP+++GATW NST   GI ITR+R TNAVEI+VPGNFKIKA+VVPITEK+S IHKYG+TQEDCFAHLDLSFKFYALSG+VNGVLG
Subjt:  AVDRLSLSFNGATILLPDRDGATW-NST---GIVITRTRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLG

Query:  QTYASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNCASELGGRGVVCKR
        QTY  NYVSRAKMGVAMPVLGGDKEFASSS+FA+DC V RF+ +MDE++S  EAAAYANM+C S++ G+GVVCKR
Subjt:  QTYASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNCASELGGRGVVCKR

A0A6J1CLE3 uncharacterized protein LOC1110126812.1e-19479.96Show/hide
Query:  MARTTVVLFLFCLFLSSAVEGAP-----KAKKVKCKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTPPPPPPPPPPPPPPKQRKLKSPPPPRYIYS
        MAR  + LFLF L LS+AVE AP     K KKVKCKDKKYP CYKS+  CP +C RTCVVDCSTCQPVC PPPPPPP PPPPPPK RK KSPPPP YIYS
Subjt:  MARTTVVLFLFCLFLSSAVEGAP-----KAKKVKCKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTPPPPPPPPPPPPPPKQRKLKSPPPPRYIYS

Query:  S-PPPPPHIYSSPPPPPIAAQPSPPLPPV---PTPPASTPPSLPPPTPPASGTPPSSEASGQKKVRCKNRLSFPHCYGMELTCPSDCPSQCEVDCVTCSP
        S PPPPP+IYSSPPPPP      PP PP    PTPP   PP+L PP PP+   PPSSEASGQK+VRCK+R SFPHCYGMEL+CPSDCPSQCEVDCVTCSP
Subjt:  S-PPPPPHIYSSPPPPPIAAQPSPPLPPV---PTPPASTPPSLPPPTPPASGTPPSSEASGQKKVRCKNRLSFPHCYGMELTCPSDCPSQCEVDCVTCSP

Query:  VCNCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAVDRLSLSFNGAT
        VCNC+RPGAVCQDP+FIGGDGITFYFHG+KDRDFCI+TDSNLHINA FIGRRN DMNRDFTWVQSLGILF +HKLFI ARKTA W+DAVDRLSLS N  T
Subjt:  VCNCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAVDRLSLSFNGAT

Query:  ILLPDRDGATWNSTGIVITRTRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQTYASNYVSRAKMGVAM
        ILLP++D +TWNSTGI ITRTRNTNAVEI+VPGNFKIKA+VVPITEK+SRIHKYG+TQEDCFAHLDLSFKFYALSGDVNGVLGQTYA+NYVSRAKMGVAM
Subjt:  ILLPDRDGATWNSTGIVITRTRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQTYASNYVSRAKMGVAM

Query:  PVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNCASE-LGGRGVVCKR
        PVLGGDKEFASSSLFA+DCAVA+FSG    E++S EA AYANMNC S+ LG +GVVCKR
Subjt:  PVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNCASE-LGGRGVVCKR

A0A6J1FAJ8 uncharacterized protein LOC1114439072.7e-19477.12Show/hide
Query:  MARTTVVLFLFCLFLSSAVEGAPKAKKVKCKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTPPPPPPPPPPPPPPKQRKLKSPPPPRYIYSS-PPP
        M +   +LFLF LFLS+AVE  PK KKVKCKDK +P CYKSEH CPADCLRTCVVDCS+C+PVCTPPPPPPP PPPPPPK RKLKSPPPP Y+YSS PPP
Subjt:  MARTTVVLFLFCLFLSSAVEGAPKAKKVKCKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTPPPPPPPPPPPPPPKQRKLKSPPPPRYIYSS-PPP

Query:  PPHIYSSPPPPPIAAQPSPPLPPV--------------PTPPAST--PPSLPP-PTPPAS-GTPPSSEASGQKKVRCKNRLSFPHCYGMELTCPSDCPSQ
        PP+IYSSPPPPP     SPP PP               P PPA+T  PP +PP PTPP S   PPSSEASGQKKVRCKNR SFPHCYGMELTCP+DCP Q
Subjt:  PPHIYSSPPPPPIAAQPSPPLPPV--------------PTPPAST--PPSLPP-PTPPAS-GTPPSSEASGQKKVRCKNRLSFPHCYGMELTCPSDCPSQ

Query:  CEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAVD
        CEVDCVTCS VCNCNRPGAVCQDPRFIGGDGITFYFHG+KD+DFCI+TDSNLHINAHFIGRRN DM RDFTWVQSLGILF +H+LFI ARKT+TW+DA D
Subjt:  CEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAVD

Query:  RLSLSFNGATILLPDRDGATW-NST---GIVITRTRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQTY
        RLSL FN  TI+L +++GATW NST   GI ITRTRNTNAVEI+VPGNFKIKA+VVPITEK+SRIHKYG+TQEDCFAHLDLSFKFYALSG VNGVLGQTY
Subjt:  RLSLSFNGATILLPDRDGATW-NST---GIVITRTRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQTY

Query:  ASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNCASELGGRGVVCKR
         SNYVSRAKMGVAMPVLGGDKEFASS  FA+DCAVARF+GQ++ +DSS E  AY NM+C S++ G GVVCKR
Subjt:  ASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNCASELGGRGVVCKR

A0A6J1FCE2 uncharacterized protein LOC1114441163.2e-19577.54Show/hide
Query:  MARTTVVLFLFCLFLSSAVEGAPKAKKVKCKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTPPPPPPPPPPPPPPKQRKLKSPPPPRYIYSS-PPP
        M +   +LFLF LFLS+AVE  PK KKVKCKDK +P CYKSEH CPADCLRTCVVDCS+C+PVCTPPPPPPP PPPPPPK RKLKSPPPP Y+YSS PPP
Subjt:  MARTTVVLFLFCLFLSSAVEGAPKAKKVKCKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTPPPPPPPPPPPPPPKQRKLKSPPPPRYIYSS-PPP

Query:  PPHIYSSPPPPPIAAQPSPPLPPV--------------PTPPAST--PPSLPP-PTPPAS-GTPPSSEASGQKKVRCKNRLSFPHCYGMELTCPSDCPSQ
        PP+IYSSPPPPP     SPP PP               P PPA+T  PP +PP PTPP S   PPSSEASGQKKVRCKNR SFPHCYGMELTCP+DCP Q
Subjt:  PPHIYSSPPPPPIAAQPSPPLPPV--------------PTPPAST--PPSLPP-PTPPAS-GTPPSSEASGQKKVRCKNRLSFPHCYGMELTCPSDCPSQ

Query:  CEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAVD
        CEVDCVTCS VCNCNRPGAVCQDPRFIGGDGITFYFHG+KD+DFCI+TDSNLHINAHFIGRRN DM RDFTWVQSLGILF +H+LFI ARKT+TW+DA D
Subjt:  CEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAVD

Query:  RLSLSFNGATILLPDRDGATW-NST---GIVITRTRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQTY
        RLSLSFN  TI+L +R+GATW NST   GI ITRTRNTNAVEI+VPGNFKIKA+VVPITEK+SRIHKYG+TQEDCFAHLDLSFKFYALSG VNGVLGQTY
Subjt:  RLSLSFNGATILLPDRDGATW-NST---GIVITRTRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQTY

Query:  ASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNCASELGGRGVVCKR
         SNYVSRAKMGVAMPVLGGDKEFASS  FA+DCAVARF+GQ++ +DSS E  AY NM+C S++ G GVVCKR
Subjt:  ASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNCASELGGRGVVCKR

A0A6J1I078 uncharacterized protein LOC1114685302.2e-19677.59Show/hide
Query:  MARTTVVLFLFCLFLSSAVEGAPKAKKVKCKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTPPPPPPPPPPPPPPKQRKLKSPPPPRYIYSS-PPP
        M +   +LFLF LFLS+AVE  PK KKVKCKDKK+P CYKSEH CPADCLRTCVVDCS+C+PVCTPPPPPPP PPPPPPK RKLKSPPPP Y+YSS PPP
Subjt:  MARTTVVLFLFCLFLSSAVEGAPKAKKVKCKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTPPPPPPPPPPPPPPKQRKLKSPPPPRYIYSS-PPP

Query:  PPHIYSSPPPPPIAAQPSPPLPPV---------------PTPPAST--PPSLPP-PTPPAS-GTPPSSEASGQKKVRCKNRLSFPHCYGMELTCPSDCPS
        PP+IYSSPPPPP     SPP PP                P PPA+T  PP +PP P PP S   PPSSEASGQKKVRCKNR SFPHCYGMELTCP+DCP 
Subjt:  PPHIYSSPPPPPIAAQPSPPLPPV---------------PTPPAST--PPSLPP-PTPPAS-GTPPSSEASGQKKVRCKNRLSFPHCYGMELTCPSDCPS

Query:  QCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAV
        QCEVDCVTCS VCNCNRPGAVCQDPRFIGGDGITFYFHG+KDRDFCI+TDSNLHINAHFIGRRN DM RDFTWVQSLGILF +H+LFI ARKT+TW+DA 
Subjt:  QCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAV

Query:  DRLSLSFNGATILLPDRDGATW-NST---GIVITRTRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQT
        DRLSLSFN  TI+L +R+GATW NST   GI ITRTRNTNAVEI+VPGNFKIKA+VVPITEK+SRIHKYG+TQEDCFAHLDLSFKFYALSG+VNGVLGQT
Subjt:  DRLSLSFNGATILLPDRDGATW-NST---GIVITRTRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQT

Query:  YASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNCASELGGRGVVCKR
        Y SNYVSRAKMGVAMPVLGGDKEFASS  FA+DCAVARF+GQ++ +DSS E  AY NM+C S++ G GVVCKR
Subjt:  YASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNCASELGGRGVVCKR

SwissProt top hitse value%identityAlignment
O65375 Leucine-rich repeat extensin-like protein 15.1e-0456.98Show/hide
Query:  PPPPPPPPPPPP-----PPKQRKLKSPPPPRYIYSSPPPPPHIYSSPPPPPIAAQPSPPLPPVPTPPASTPPSLPPPTPPASGTPP
        PPPPP P PPPP     PP      SPPPP Y+YSSPPPPP++YSSPPPP + +  SPP P V + P   PPS PPP P +S  PP
Subjt:  PPPPPPPPPPPP-----PPKQRKLKSPPPPRYIYSSPPPPPHIYSSPPPPPIAAQPSPPLPPVPTPPASTPPSLPPPTPPASGTPP

Arabidopsis top hitse value%identityAlignment
AT3G19430.1 late embryogenesis abundant protein-related / LEA protein-related2.3e-11346.52Show/hide
Query:  CKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTP---------------------------------------------PPPP--------------
        CK KKY +CY  EH+CP  C  +C V+C++C+P+C P                                             PPPP              
Subjt:  CKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTP---------------------------------------------PPPP--------------

Query:  PPPP--------------PPPP------PKQRKLKSPPPPRYIYSSPPPPPHIYSSP---PPPPIA---------------AQPSPPLPPVPTPPASTP-
        PPPP              PPPP      P      SPPPP    S P P P + + P   PPPP++                 P+PP P VP+PP  TP 
Subjt:  PPPP--------------PPPP------PKQRKLKSPPPPRYIYSSPPPPPHIYSSP---PPPPIA---------------AQPSPPLPPVPTPPASTP-

Query:  ---PSLP--------PPTPPA----SGTPP-------SSEASGQKKVRCKNRLSFPHCYGMELTCPSDCPSQCEVDCVTCSPVCNCNRPGAVCQDPRFIG
           PS+P        PPTPP+    SG+PP         EA+G K+VRCK + S   CYG+E TCP+DCP  C+VDCVTC PVCNC++PG+VCQDPRFIG
Subjt:  ---PSLP--------PPTPPA----SGTPP-------SSEASGQKKVRCKNRLSFPHCYGMELTCPSDCPSQCEVDCVTCSPVCNCNRPGAVCQDPRFIG

Query:  GDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAVDRLSLSFNGATILLPDRDGATWNSTGIVI
        GDG+TFYFHG+KD +FC+++D NLHINAHFIG+R A M RDFTWVQS+ ILF TH+L++ A KTATW+D+VDR+++SF+G  I LP  DGA W S+  V 
Subjt:  GDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAVDRLSLSFNGATILLPDRDGATWNSTGIVI

Query:  TR------TRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQTYASNYVSRAKMGVAMPVLGGDKEFASS
                  +TN +E++V G  KI A VVPIT +DSRIH Y V ++DC AHLDL FKF  LS +V+GVLGQTY SNYVSR K+GV MPV+GGD+EF ++
Subjt:  TR------TRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQTYASNYVSRAKMGVAMPVLGGDKEFASS

Query:  SLFASDCAVARFSGQMDEEDSSSEAAAYANMNCASELGGRGVVCKR
         LFA DC+ ARF+G  D  +  S+      M+CAS LGG+GVVCKR
Subjt:  SLFASDCAVARFSGQMDEEDSSSEAAAYANMNCASELGGRGVVCKR

AT4G27400.1 Late embryogenesis abundant (LEA) protein-related2.3e-6039.34Show/hide
Query:  PHCYGMELTCPSDCPSQ---------CEVDCV--TCSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMN
        P C    + CP +CP++         C VDC    C  VC     NC   G++C DPRFIGGDGI FYFHG+ +  F I++D +  INA F G R A   
Subjt:  PHCYGMELTCPSDCPSQ---------CEVDCV--TCSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHFIGRRNADMN

Query:  RDFTWVQSLGILFHTHKLFIAARKTATWNDAVDRLSLSFNGATILLPDRDGATWNST--GIVITRTRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYG
        RDFTW+Q+LG LF++HK  +   K ATW+  +D L  + +G  +++P    +TW S+   I I R    N+V + +    +I   VVP+T++D RIH Y 
Subjt:  RDFTWVQSLGILFHTHKLFIAARKTATWNDAVDRLSLSFNGATILLPDRDGATWNST--GIVITRTRNTNAVEIDVPGNFKIKAIVVPITEKDSRIHKYG

Query:  VTQEDCFAHLDLSFKFYALSGDVNGVLGQTYASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSS-SEAAAYANMNCA-SELGGRG
        +  +DCFAH ++ FKF  LS  V+G+LG+TY  ++ + AK GV MPV+GG+  F +SSL +  C    FS        S    + YA ++C+     G G
Subjt:  VTQEDCFAHLDLSFKFYALSGDVNGVLGQTYASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSS-SEAAAYANMNCA-SELGGRG

Query:  VVCKR
        +VC++
Subjt:  VVCKR

AT5G54370.1 Late embryogenesis abundant (LEA) protein-related1.9e-7042.49Show/hide
Query:  VRCKNRLSFPHCYGMELTCPSDCPSQ---------CEVDC--VTCSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHF
        V C N   +  CY   + CP +CPS+         C  DC   TC   C     NCNRPG+ C DPRFIGGDGI FYFHG+ + +F +++DS+L IN  F
Subjt:  VRCKNRLSFPHCYGMELTCPSDCPSQ---------CEVDC--VTCSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLHINAHF

Query:  IGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAVDRLSLSFNGATILLPDRDGATWNSTG--IVITRTRNTNAVEIDVPGNFKIKAIVVPITE
        IG R A   RDFTW+Q+LG LF+++K  + A KTA+W++ +D L  S++G  + +P+   +TW S    I I R    N+V + +    +I   VVP+T+
Subjt:  IGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAVDRLSLSFNGATILLPDRDGATWNSTG--IVITRTRNTNAVEIDVPGNFKIKAIVVPITE

Query:  KDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQTYASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNCA
        +D RIH Y V  +DCFAHL++ F+F+ LS  V+G+LG+TY  ++ + AK GVAMPV+GG+  F +SSL ++DC    FS    E DS      YA ++C 
Subjt:  KDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQTYASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNCA

Query:  -SELGGRGVVCKR
             G G+VC++
Subjt:  -SELGGRGVVCKR

AT5G60520.1 Late embryogenesis abundant (LEA) protein-related2.8e-6644.1Show/hide
Query:  SGQKKVRCKNRLSFPHCYGMELTCPSDCPSQ----------CEVDCVT-CSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLH
        SGQ++V+C  R S   C    LTCP +CP +          C +DC + C   C     NCN  G++C DPRF+GGDG+ FYFHG KD +F I++D NL 
Subjt:  SGQKKVRCKNRLSFPHCYGMELTCPSDCPSQ----------CEVDCVT-CSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLH

Query:  INAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAVDRLSLSFNGATILLPDRDGATW----NSTGIVITRTRNTNAVEIDVPGNFKIKA
        INAHFIG R A   RDFTWVQ+  ++F +H L IAA+K A+W+D+VD L + +NG  + +P    A W    +   +++ RT   N V + V G  +I  
Subjt:  INAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAVDRLSLSFNGATILLPDRDGATW----NSTGIVITRTRNTNAVEIDVPGNFKIKA

Query:  IVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQTYASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQ
         V PI +++ R+HKY + ++D FAHL+  FKF+ LS  V GVLG+TY   YVS  K GV MP++GG+ ++ + SLF+  C V RF G+
Subjt:  IVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQTYASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQ

AT5G60530.1 late embryogenesis abundant protein-related / LEA protein-related6.5e-6341.61Show/hide
Query:  SGQKKVRCKNRLSFPHCYGMELTCPSDCPSQ----------CEVDCVT-CSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLH
        +GQ++  C+ R S   CY   L CP +CP +          C +DC   C   C     NCN  G++C DPRF+GGDG+ FYFHG K  +F I++D+NL 
Subjt:  SGQKKVRCKNRLSFPHCYGMELTCPSDCPSQ----------CEVDCVT-CSPVC-----NCNRPGAVCQDPRFIGGDGITFYFHGRKDRDFCILTDSNLH

Query:  INAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAVDRLSLSFNGATILLPDRDGATWNSTG-----IVITRTRNTNAVEIDVPGNFKIK
        INAHFIG R     RDFTWVQ+L ++F  HKL I A +   W++  D  ++ ++G  I LP+ + + W         I+I RT   N+V + V    ++ 
Subjt:  INAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAVDRLSLSFNGATILLPDRDGATWNSTG-----IVITRTRNTNAVEIDVPGNFKIK

Query:  AIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQTYASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSE
          V PI ++++R+H Y + Q+D FAHL+  FKF  LS  V GVLG+TY  +YVS AK GV MPVLGG+ ++ + SLF+  C + RF  Q  EE  S++
Subjt:  AIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQTYASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGCGGCGAATGGCGCGAACCACCGTGGTTCTCTTTCTGTTCTGTCTCTTCCTCTCCTCCGCCGTCGAGGGAGCTCCCAAGGCCAAGAAAGTTAAATGCAAAGATAA
GAAGTACCCCAACTGTTACAAATCCGAGCACCTTTGCCCCGCCGATTGCCTTCGAACTTGTGTTGTTGACTGTTCAACTTGCCAACCTGTGTGTACTCCTCCCCCGCCTC
CTCCGCCGCCTCCCCCTCCGCCACCACCAAAACAACGCAAGCTGAAATCTCCACCACCACCGCGGTATATTTACTCTTCCCCTCCGCCGCCTCCGCACATTTACTCTTCT
CCACCACCTCCACCAATAGCGGCCCAACCTTCACCTCCGCTCCCACCAGTTCCTACTCCTCCAGCGTCGACACCGCCTTCTCTTCCCCCTCCAACTCCGCCGGCTTCAGG
AACACCGCCGTCGTCTGAAGCGTCGGGGCAAAAGAAAGTCAGGTGCAAGAATAGGCTGAGCTTTCCGCATTGCTATGGCATGGAGCTAACTTGTCCAAGTGATTGCCCTA
GCCAATGTGAGGTTGATTGTGTTACTTGCAGCCCTGTTTGCAATTGCAATCGTCCCGGCGCAGTGTGCCAAGACCCACGATTCATCGGAGGAGATGGAATCACCTTCTAC
TTCCATGGCAGAAAAGACAGAGATTTCTGCATCCTCACCGACTCCAACCTCCACATCAACGCCCACTTCATCGGCCGGCGAAACGCCGACATGAACAGAGACTTCACTTG
GGTCCAATCCCTCGGCATCCTCTTCCACACCCACAAACTCTTCATCGCCGCCCGCAAAACCGCCACCTGGAACGACGCCGTCGACCGCCTCTCCCTCTCCTTCAACGGCG
CAACCATCCTCCTCCCCGACCGCGACGGTGCAACGTGGAACTCGACGGGGATCGTCATAACCAGAACCCGAAACACCAACGCCGTCGAGATCGACGTCCCCGGGAACTTC
AAGATCAAAGCCATCGTGGTCCCGATAACGGAGAAGGATTCGAGGATACACAAGTATGGGGTTACGCAGGAGGATTGCTTCGCGCATTTGGATCTGAGCTTCAAGTTCTA
TGCTCTCAGTGGCGACGTCAATGGCGTTCTGGGGCAGACTTACGCGAGCAACTATGTCAGCAGGGCGAAGATGGGGGTGGCGATGCCGGTTCTCGGCGGGGATAAGGAGT
TCGCTTCTTCCAGCCTTTTTGCTTCAGATTGCGCCGTCGCCCGATTCAGTGGGCAGATGGATGAAGAAGACAGTTCTTCAGAGGCTGCGGCCTATGCGAATATGAACTGT
GCCAGTGAATTGGGAGGCCGAGGAGTTGTTTGCAAACGATAA
mRNA sequenceShow/hide mRNA sequence
ATGCGGCGGCGAATGGCGCGAACCACCGTGGTTCTCTTTCTGTTCTGTCTCTTCCTCTCCTCCGCCGTCGAGGGAGCTCCCAAGGCCAAGAAAGTTAAATGCAAAGATAA
GAAGTACCCCAACTGTTACAAATCCGAGCACCTTTGCCCCGCCGATTGCCTTCGAACTTGTGTTGTTGACTGTTCAACTTGCCAACCTGTGTGTACTCCTCCCCCGCCTC
CTCCGCCGCCTCCCCCTCCGCCACCACCAAAACAACGCAAGCTGAAATCTCCACCACCACCGCGGTATATTTACTCTTCCCCTCCGCCGCCTCCGCACATTTACTCTTCT
CCACCACCTCCACCAATAGCGGCCCAACCTTCACCTCCGCTCCCACCAGTTCCTACTCCTCCAGCGTCGACACCGCCTTCTCTTCCCCCTCCAACTCCGCCGGCTTCAGG
AACACCGCCGTCGTCTGAAGCGTCGGGGCAAAAGAAAGTCAGGTGCAAGAATAGGCTGAGCTTTCCGCATTGCTATGGCATGGAGCTAACTTGTCCAAGTGATTGCCCTA
GCCAATGTGAGGTTGATTGTGTTACTTGCAGCCCTGTTTGCAATTGCAATCGTCCCGGCGCAGTGTGCCAAGACCCACGATTCATCGGAGGAGATGGAATCACCTTCTAC
TTCCATGGCAGAAAAGACAGAGATTTCTGCATCCTCACCGACTCCAACCTCCACATCAACGCCCACTTCATCGGCCGGCGAAACGCCGACATGAACAGAGACTTCACTTG
GGTCCAATCCCTCGGCATCCTCTTCCACACCCACAAACTCTTCATCGCCGCCCGCAAAACCGCCACCTGGAACGACGCCGTCGACCGCCTCTCCCTCTCCTTCAACGGCG
CAACCATCCTCCTCCCCGACCGCGACGGTGCAACGTGGAACTCGACGGGGATCGTCATAACCAGAACCCGAAACACCAACGCCGTCGAGATCGACGTCCCCGGGAACTTC
AAGATCAAAGCCATCGTGGTCCCGATAACGGAGAAGGATTCGAGGATACACAAGTATGGGGTTACGCAGGAGGATTGCTTCGCGCATTTGGATCTGAGCTTCAAGTTCTA
TGCTCTCAGTGGCGACGTCAATGGCGTTCTGGGGCAGACTTACGCGAGCAACTATGTCAGCAGGGCGAAGATGGGGGTGGCGATGCCGGTTCTCGGCGGGGATAAGGAGT
TCGCTTCTTCCAGCCTTTTTGCTTCAGATTGCGCCGTCGCCCGATTCAGTGGGCAGATGGATGAAGAAGACAGTTCTTCAGAGGCTGCGGCCTATGCGAATATGAACTGT
GCCAGTGAATTGGGAGGCCGAGGAGTTGTTTGCAAACGATAA
Protein sequenceShow/hide protein sequence
MRRRMARTTVVLFLFCLFLSSAVEGAPKAKKVKCKDKKYPNCYKSEHLCPADCLRTCVVDCSTCQPVCTPPPPPPPPPPPPPPKQRKLKSPPPPRYIYSSPPPPPHIYSS
PPPPPIAAQPSPPLPPVPTPPASTPPSLPPPTPPASGTPPSSEASGQKKVRCKNRLSFPHCYGMELTCPSDCPSQCEVDCVTCSPVCNCNRPGAVCQDPRFIGGDGITFY
FHGRKDRDFCILTDSNLHINAHFIGRRNADMNRDFTWVQSLGILFHTHKLFIAARKTATWNDAVDRLSLSFNGATILLPDRDGATWNSTGIVITRTRNTNAVEIDVPGNF
KIKAIVVPITEKDSRIHKYGVTQEDCFAHLDLSFKFYALSGDVNGVLGQTYASNYVSRAKMGVAMPVLGGDKEFASSSLFASDCAVARFSGQMDEEDSSSEAAAYANMNC
ASELGGRGVVCKR