; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G00660 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G00660
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionUnknown protein
Genome locationClcChr08:1190494..1201306
RNA-Seq ExpressionClc08G00660
SyntenyClc08G00660
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022925493.1 uncharacterized protein LOC111432779 isoform X2 [Cucurbita moschata]1.3e-28289Show/hide
Query:  IFSSFCEMGSLFSPSFFFLLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRTPFASDQQDCS
        +F S+CEMGSLFS SFF LLL      FFL LTHCS H +LE+ISAIGDPGMK+PNVRVAFEAWNFCNEVGAEAP MGSPRLADCADLR P ASD+QDC 
Subjt:  IFSSFCEMGSLFSPSFFFLLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRTPFASDQQDCS

Query:  GLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPCF
        G GSDSNCIVL KV+ESDNKLGAGEKFPSERFK Y+DPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLC ENGK + K+ITDR FPCF
Subjt:  GLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPCF

Query:  GKGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDGR
        G+GCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADL+NGIGKNSYFSVSWHKNVS+GSWIFSN+LTTSSKYPWLMLYLRSDAT GFNGGYHYDGR
Subjt:  GKGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDGR

Query:  GIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFPY
        GIMRKLPESPNFKVRLTLD+K+GGGKNSQFYLIDIGSCWKNNGDACNG+TTTDVTRYSEMIINPET+SWCRP+NL SCPPYHV ASGEKIYRNETSRFPY
Subjt:  GIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFPY

Query:  SAYHLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYIS
        SAYHLYCSPGN  HLEKPYDICDPYSNPQ+QEL+QILPH EW VHGYP KQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARR+WTSINVG EIYIS
Subjt:  SAYHLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYIS

Query:  EGATAEWNV
        EG+TAEW+V
Subjt:  EGATAEWNV

XP_022973970.1 uncharacterized protein LOC111472586 isoform X2 [Cucurbita maxima]2.7e-28389.19Show/hide
Query:  IFSSFCEMGSLFSPSFFFLLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRTPFASDQQDCS
        +F S+CEMGSLFS SFF LLL      FFL LTHCS H SLE+ISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAP MGSPRLADCADLR P ASD+QDC 
Subjt:  IFSSFCEMGSLFSPSFFFLLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRTPFASDQQDCS

Query:  GLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPCF
        G GSDSNCIVL KV+ESDNKL AGEKFPS+RFK Y+DPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGK   K+ITDR FPCF
Subjt:  GLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPCF

Query:  GKGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDGR
        G+GCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADL+NGIGKNSYFSVSWHKNVS+GSWIFSN+LTTSSKYPWLMLYLRSDAT GFNGGYHYDGR
Subjt:  GKGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDGR

Query:  GIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFPY
        GIMRKLPESPNFKVRLTLD+K+GGGKNSQFYLIDIGSCWKNNGDACNG+TTTDVTRYSEMIINPET+SWCRP+NL SCPPYHV ASGEKIYRNETSRFPY
Subjt:  GIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFPY

Query:  SAYHLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYIS
        SAYHLYCSPGNA HLEKPYD+CDPYSNPQ+QEL+QILPH EW VHGYP KQGDGW+GDPRTWELDVGALSNRLYFYQDPGTKPARR+WTSINVG EIYIS
Subjt:  SAYHLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYIS

Query:  EGATAEWNV
        EGATAEW+V
Subjt:  EGATAEWNV

XP_023535714.1 uncharacterized protein LOC111797062 isoform X1 [Cucurbita pepo subsp. pepo]5.1e-28289.02Show/hide
Query:  IFSSFCEMGSLFSPSFFFLLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRTPFAS-DQQDC
        +F S+CEMGSLFS SFF LLL      FFL LTHCS H +LE+ISAIGDPGMK+PNVRVAFEAWNFCNEVGAEAP MGSPRLADCADLR P AS D+QDC
Subjt:  IFSSFCEMGSLFSPSFFFLLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRTPFAS-DQQDC

Query:  SGLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPC
         G GSDSNCIVL KV+ESDNKLGAGEKFPSERFK Y+DPDLYVVEKERYL SLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKK+ K+ITDR FPC
Subjt:  SGLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPC

Query:  FGKGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDG
        FG+GCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADL+NGIGKNSYFSVSWHKNVS+GSWIFSN+LTTSSKYPWLMLYLRSDAT GFNGGYHYDG
Subjt:  FGKGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDG

Query:  RGIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFP
        RGIMRKLPESPNFKVRLTLD+++GGGKNSQFYLIDIGSCWKNNGDACNG+TTTDVTRYSEMIINPET+SWCRP+NL SCPPYHV ASGEKIYRNETSRFP
Subjt:  RGIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFP

Query:  YSAYHLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYI
        YSAYHLYCSPGN  HLEKPYDICDPYSNPQ+QEL+QILPH EW VHGYP KQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARR+WTSINVG EIYI
Subjt:  YSAYHLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYI

Query:  SEGATAEWNV
        SEGATAEW+V
Subjt:  SEGATAEWNV

XP_023535715.1 uncharacterized protein LOC111797062 isoform X2 [Cucurbita pepo subsp. pepo]2.1e-28389.19Show/hide
Query:  IFSSFCEMGSLFSPSFFFLLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRTPFASDQQDCS
        +F S+CEMGSLFS SFF LLL      FFL LTHCS H +LE+ISAIGDPGMK+PNVRVAFEAWNFCNEVGAEAP MGSPRLADCADLR P ASD+QDC 
Subjt:  IFSSFCEMGSLFSPSFFFLLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRTPFASDQQDCS

Query:  GLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPCF
        G GSDSNCIVL KV+ESDNKLGAGEKFPSERFK Y+DPDLYVVEKERYL SLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKK+ K+ITDR FPCF
Subjt:  GLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPCF

Query:  GKGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDGR
        G+GCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADL+NGIGKNSYFSVSWHKNVS+GSWIFSN+LTTSSKYPWLMLYLRSDAT GFNGGYHYDGR
Subjt:  GKGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDGR

Query:  GIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFPY
        GIMRKLPESPNFKVRLTLD+++GGGKNSQFYLIDIGSCWKNNGDACNG+TTTDVTRYSEMIINPET+SWCRP+NL SCPPYHV ASGEKIYRNETSRFPY
Subjt:  GIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFPY

Query:  SAYHLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYIS
        SAYHLYCSPGN  HLEKPYDICDPYSNPQ+QEL+QILPH EW VHGYP KQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARR+WTSINVG EIYIS
Subjt:  SAYHLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYIS

Query:  EGATAEWNV
        EGATAEW+V
Subjt:  EGATAEWNV

XP_038889615.1 uncharacterized protein LOC120079486 [Benincasa hispida]2.4e-30093.5Show/hide
Query:  MKFQ-LTFSIHIFSSFCEMGSLFSPS---FFFLLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCA
        MKFQ  TFSIHIF SFCEMGSLFS S   FFF LL LLSQPFFL LT CSPH SLEYISAIGDPGMK+PNVRVAFEAWNFCNEVGAEAPHMGSPRLADCA
Subjt:  MKFQ-LTFSIHIFSSFCEMGSLFSPS---FFFLLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCA

Query:  DLRTPFASDQQDCSGLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGK
        DLRTPFASDQQDCSGL  D+NC+VLQKVSE+DNKLGAGEKFPSERFKSY DPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGK
Subjt:  DLRTPFASDQQDCSGLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGK

Query:  KITKLITDRKFPCFGKGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSD
         ITKLITDRKFPCFGKGCMNQP++YHNYSRLVSF KRMVSLTGGFYGTYELDADL++GIGKNSYFSVSWHKNVSTGSWIFSN+L TSSKYPWLMLYLRSD
Subjt:  KITKLITDRKFPCFGKGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSD

Query:  ATTGFNGGYHYDGRGIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSAS
        ATTGFNGGYHYDGRGIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCWKNNGDACNG+TTTDVTRYSEMIINPETSSWCRPNNL SCPPYHVSAS
Subjt:  ATTGFNGGYHYDGRGIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSAS

Query:  GEKIYRNETSRFPYSAYHLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARR
        GEKIYRNETSRFPYSAYHLYCSPGNA HLEKPYDICDPYSNPQ+QELLQILPH EWAVHGYPKKQGDGW+GDPRTWELDVGALSNRLYFYQDPGTKPARR
Subjt:  GEKIYRNETSRFPYSAYHLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARR

Query:  VWTSINVGVEIYISEGATAEWNV
        VWTSINVGVEIYISEGATAEW+V
Subjt:  VWTSINVGVEIYISEGATAEWNV

TrEMBL top hitse value%identityAlignment
A0A6J1D795 uncharacterized protein LOC111018276 isoform X12.4e-26184.22Show/hide
Query:  SFCEMGSLFSPSFFFLLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRTPFASDQQDCSGLG
        S  EMGSLFS SFFFL         FLIL H SPH S+EY SAIGDPGMK+PNVRV FEAWNFCNEVGAEA HMGSPR+ADCADLR P ASD +DC  L 
Subjt:  SFCEMGSLFSPSFFFLLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRTPFASDQQDCSGLG

Query:  SDSNCIVLQKVSESDNKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPCFGKG
        SD+ C+ LQKV+ESDNKLGAGEKFPSERFK Y DPDLY VEKERYLGSLCEVHDSS+PW FWMIMLKNGNFDKNSTLCPENGK ++K++TDR FPCFG+G
Subjt:  SDSNCIVLQKVSESDNKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPCFGKG

Query:  CMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDGRGIM
        CMNQPLVYH  SRLVS  +RMVSLTGGFYGTYELDADL+NGIGKNSYFSV+W KNVSTGSWIF ++LTTSSKYPWLMLYLRSDA TGF+GGYHYDGRGIM
Subjt:  CMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDGRGIM

Query:  RKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFPYSAY
        RKLPESPNFKVRLTL IK+GGG N+QFYLIDIGSCWKNNGD CNG+TTTDVTRYSEMIINPET+S C+P+NL +CPPYHVSA+GEKIYRNETSRFPYSAY
Subjt:  RKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFPYSAY

Query:  HLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYISE-G
        HLYCSPGNA HLEKPYDICDPYSNPQ+QEL+QILPH EWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVW SINVG EIYISE G
Subjt:  HLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYISE-G

Query:  ATAEWNV
         TAEW+V
Subjt:  ATAEWNV

A0A6J1EBU8 uncharacterized protein LOC111432779 isoform X11.6e-28188.82Show/hide
Query:  IFSSFCEMGSLFSPSFFFLLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRTPFAS-DQQDC
        +F S+CEMGSLFS SFF LLL      FFL LTHCS H +LE+ISAIGDPGMK+PNVRVAFEAWNFCNEVGAEAP MGSPRLADCADLR P AS D+QDC
Subjt:  IFSSFCEMGSLFSPSFFFLLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRTPFAS-DQQDC

Query:  SGLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPC
         G GSDSNCIVL KV+ESDNKLGAGEKFPSERFK Y+DPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLC ENGK + K+ITDR FPC
Subjt:  SGLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPC

Query:  FGKGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDG
        FG+GCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADL+NGIGKNSYFSVSWHKNVS+GSWIFSN+LTTSSKYPWLMLYLRSDAT GFNGGYHYDG
Subjt:  FGKGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDG

Query:  RGIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFP
        RGIMRKLPESPNFKVRLTLD+K+GGGKNSQFYLIDIGSCWKNNGDACNG+TTTDVTRYSEMIINPET+SWCRP+NL SCPPYHV ASGEKIYRNETSRFP
Subjt:  RGIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFP

Query:  YSAYHLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYI
        YSAYHLYCSPGN  HLEKPYDICDPYSNPQ+QEL+QILPH EW VHGYP KQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARR+WTSINVG EIYI
Subjt:  YSAYHLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYI

Query:  SEGATAEWNV
        SEG+TAEW+V
Subjt:  SEGATAEWNV

A0A6J1EFC7 uncharacterized protein LOC111432779 isoform X26.5e-28389Show/hide
Query:  IFSSFCEMGSLFSPSFFFLLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRTPFASDQQDCS
        +F S+CEMGSLFS SFF LLL      FFL LTHCS H +LE+ISAIGDPGMK+PNVRVAFEAWNFCNEVGAEAP MGSPRLADCADLR P ASD+QDC 
Subjt:  IFSSFCEMGSLFSPSFFFLLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRTPFASDQQDCS

Query:  GLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPCF
        G GSDSNCIVL KV+ESDNKLGAGEKFPSERFK Y+DPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLC ENGK + K+ITDR FPCF
Subjt:  GLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPCF

Query:  GKGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDGR
        G+GCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADL+NGIGKNSYFSVSWHKNVS+GSWIFSN+LTTSSKYPWLMLYLRSDAT GFNGGYHYDGR
Subjt:  GKGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDGR

Query:  GIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFPY
        GIMRKLPESPNFKVRLTLD+K+GGGKNSQFYLIDIGSCWKNNGDACNG+TTTDVTRYSEMIINPET+SWCRP+NL SCPPYHV ASGEKIYRNETSRFPY
Subjt:  GIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFPY

Query:  SAYHLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYIS
        SAYHLYCSPGN  HLEKPYDICDPYSNPQ+QEL+QILPH EW VHGYP KQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARR+WTSINVG EIYIS
Subjt:  SAYHLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYIS

Query:  EGATAEWNV
        EG+TAEW+V
Subjt:  EGATAEWNV

A0A6J1ICQ8 uncharacterized protein LOC111472586 isoform X21.3e-28389.19Show/hide
Query:  IFSSFCEMGSLFSPSFFFLLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRTPFASDQQDCS
        +F S+CEMGSLFS SFF LLL      FFL LTHCS H SLE+ISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAP MGSPRLADCADLR P ASD+QDC 
Subjt:  IFSSFCEMGSLFSPSFFFLLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRTPFASDQQDCS

Query:  GLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPCF
        G GSDSNCIVL KV+ESDNKL AGEKFPS+RFK Y+DPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGK   K+ITDR FPCF
Subjt:  GLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPCF

Query:  GKGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDGR
        G+GCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADL+NGIGKNSYFSVSWHKNVS+GSWIFSN+LTTSSKYPWLMLYLRSDAT GFNGGYHYDGR
Subjt:  GKGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDGR

Query:  GIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFPY
        GIMRKLPESPNFKVRLTLD+K+GGGKNSQFYLIDIGSCWKNNGDACNG+TTTDVTRYSEMIINPET+SWCRP+NL SCPPYHV ASGEKIYRNETSRFPY
Subjt:  GIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFPY

Query:  SAYHLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYIS
        SAYHLYCSPGNA HLEKPYD+CDPYSNPQ+QEL+QILPH EW VHGYP KQGDGW+GDPRTWELDVGALSNRLYFYQDPGTKPARR+WTSINVG EIYIS
Subjt:  SAYHLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYIS

Query:  EGATAEWNV
        EGATAEW+V
Subjt:  EGATAEWNV

A0A6J1IG58 uncharacterized protein LOC111472586 isoform X13.2e-28289.02Show/hide
Query:  IFSSFCEMGSLFSPSFFFLLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRTPFAS-DQQDC
        +F S+CEMGSLFS SFF LLL      FFL LTHCS H SLE+ISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAP MGSPRLADCADLR P AS D+QDC
Subjt:  IFSSFCEMGSLFSPSFFFLLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRTPFAS-DQQDC

Query:  SGLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPC
         G GSDSNCIVL KV+ESDNKL AGEKFPS+RFK Y+DPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGK   K+ITDR FPC
Subjt:  SGLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPC

Query:  FGKGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDG
        FG+GCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADL+NGIGKNSYFSVSWHKNVS+GSWIFSN+LTTSSKYPWLMLYLRSDAT GFNGGYHYDG
Subjt:  FGKGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDG

Query:  RGIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFP
        RGIMRKLPESPNFKVRLTLD+K+GGGKNSQFYLIDIGSCWKNNGDACNG+TTTDVTRYSEMIINPET+SWCRP+NL SCPPYHV ASGEKIYRNETSRFP
Subjt:  RGIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFP

Query:  YSAYHLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYI
        YSAYHLYCSPGNA HLEKPYD+CDPYSNPQ+QEL+QILPH EW VHGYP KQGDGW+GDPRTWELDVGALSNRLYFYQDPGTKPARR+WTSINVG EIYI
Subjt:  YSAYHLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYI

Query:  SEGATAEWNV
        SEGATAEW+V
Subjt:  SEGATAEWNV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G17030.1 unknown protein7.3e-16252.85Show/hide
Query:  LLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRTPFASDQQDCSGLGSDSNCIVLQKVSESD
        L++ L    +  + +  +   ++ Y+SA+GDPGM++ N+RVA EAWN CNEVG EA +MGSPR+ADC D+               S     ++ KV E D
Subjt:  LLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRTPFASDQQDCSGLGSDSNCIVLQKVSESD

Query:  NKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPCFGKGCMNQPLVYHNYSRLV
        N+LG G         +  + D+Y  +KE YLG+ C+V D  NPW FWMIMLKNGN D  + +CPENGKK        +FPCFGKGCMN P ++H Y+ LV
Subjt:  NKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPCFGKGCMNQPLVYHNYSRLV

Query:  SFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNV-STGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDGRGIMRKLPESPNFKVRLT
          D     ++G FYGT++LD D  + +G NSY+ V W K +    SW+F + L TSSKYPWLMLYLR+DA+ GF+GGYHYD RG+M+   +SP+FKV+  
Subjt:  SFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNV-STGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDGRGIMRKLPESPNFKVRLT

Query:  LDIKNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFPYSAYHLYCSPGNARHLEK
        L+I  GGG  SQFYL+D+GSCWKN+G  C+G+ TTDVTRYSEMIINP  ++ C  N L +CPP H   +G K++R +  +FP+ AYH YC PGNAR  E 
Subjt:  LDIKNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFPYSAYHLYCSPGNARHLEK

Query:  PYDICDPYSNPQSQELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYISEGATAEWNV
        PY++CDPYSNPQ QE+LQILPH  W   GYP K+G GWIGDPRTWELDVG LS  L+FYQDPGTKP  R W+SI++G EIY+S+   AEW V
Subjt:  PYDICDPYSNPQSQELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYISEGATAEWNV

AT2G47010.1 unknown protein4.6e-18060.64Show/hide
Query:  SAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRT----PFASDQQDCSGLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYL-DPD
        SA+GDPGMK   +RVAFEAWNFCNEVG EAPHMGSPR ADC DL +     +  DQ + +  GS     ++ KVS+SDN+LG G+  P    +S L +PD
Subjt:  SAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRT----PFASDQQDCSGLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYL-DPD

Query:  LYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPCFGKGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDA
        LY VEKE YLGSLC+V D  NPWSFWM+MLKNGN+D  S LCP+NGKKI        FPCFG GCMNQP + H  + L    +   ++ G F GTYE  A
Subjt:  LYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPCFGKGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDA

Query:  DLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDGRGIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCW
        D  NG+   SY+ V W K V  G W+F ++L TS+KYPWLMLYLR+DAT GF+GGYHYD RG+++ LPESPNFKVRLTL++K GGG  SQFYL+DIGSCW
Subjt:  DLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDGRGIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCW

Query:  KNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFPYSAYHLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPH
        KNNG  C+G+ TTDVTRYSEMIINPET  WC P +L +CPPYH   +G +++R +   FPY AYH+YC+PGNA HLE P   CD YSNPQ+QE+LQ+LPH
Subjt:  KNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFPYSAYHLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPH

Query:  SEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYISEGATAEWNV
          W  +GYP + GDGW+GDPRTW+LDVG LS+RL+FYQDPGT PARR+WTS++VG EIY  + A AEW++
Subjt:  SEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYISEGATAEWNV

AT2G47010.2 unknown protein4.6e-18060.64Show/hide
Query:  SAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRT----PFASDQQDCSGLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYL-DPD
        SA+GDPGMK   +RVAFEAWNFCNEVG EAPHMGSPR ADC DL +     +  DQ + +  GS     ++ KVS+SDN+LG G+  P    +S L +PD
Subjt:  SAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRT----PFASDQQDCSGLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYL-DPD

Query:  LYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPCFGKGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDA
        LY VEKE YLGSLC+V D  NPWSFWM+MLKNGN+D  S LCP+NGKKI        FPCFG GCMNQP + H  + L    +   ++ G F GTYE  A
Subjt:  LYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPCFGKGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDA

Query:  DLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDGRGIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCW
        D  NG+   SY+ V W K V  G W+F ++L TS+KYPWLMLYLR+DAT GF+GGYHYD RG+++ LPESPNFKVRLTL++K GGG  SQFYL+DIGSCW
Subjt:  DLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDGRGIMRKLPESPNFKVRLTLDIKNGGGKNSQFYLIDIGSCW

Query:  KNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFPYSAYHLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPH
        KNNG  C+G+ TTDVTRYSEMIINPET  WC P +L +CPPYH   +G +++R +   FPY AYH+YC+PGNA HLE P   CD YSNPQ+QE+LQ+LPH
Subjt:  KNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFPYSAYHLYCSPGNARHLEKPYDICDPYSNPQSQELLQILPH

Query:  SEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYISEGATAEWNV
          W  +GYP + GDGW+GDPRTW+LDVG LS+RL+FYQDPGT PARR+WTS++VG EIY  + A AEW++
Subjt:  SEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYISEGATAEWNV

AT4G09965.1 unknown protein4.5e-1869.35Show/hide
Query:  KQGDGWIGDPRTWELDVGALSNRLYFYQD-PGTKPARRVWTSINVGVEIYIS-EGATAEWNV
        KQG+GWIGD RTWE++ GALS+RLYFYQ+ PGTKPA+R+WTSINV  +IY+S    TAEW V
Subjt:  KQGDGWIGDPRTWELDVGALSNRLYFYQD-PGTKPARRVWTSINVGVEIYIS-EGATAEWNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATTCCAGCTCACGTTCTCGATTCACATTTTTTCATCATTTTGCGAAATGGGTTCTCTGTTTTCACCTTCATTCTTCTTCCTCCTCCTCTTCCTTCTATCCCAACC
ATTTTTTCTAATTTTGACCCATTGTTCGCCTCATGCATCGTTGGAGTATATATCCGCCATTGGAGACCCTGGAATGAAGAGCCCAAATGTTAGAGTTGCATTTGAGGCAT
GGAATTTTTGTAATGAAGTTGGAGCTGAAGCTCCTCACATGGGCAGCCCCAGATTGGCTGATTGTGCTGATTTGCGAACCCCATTTGCTTCTGATCAACAAGATTGTTCT
GGTCTTGGAAGTGATAGCAACTGTATAGTACTTCAGAAAGTAAGTGAGTCGGATAATAAACTTGGGGCTGGTGAAAAGTTTCCATCAGAGCGTTTTAAGTCATACCTGGA
TCCAGATTTGTATGTTGTGGAGAAGGAGCGCTATCTTGGTTCACTATGCGAGGTTCATGATTCTTCGAATCCATGGAGTTTCTGGATGATTATGCTAAAAAATGGGAATT
TTGACAAGAACTCTACTCTCTGCCCTGAAAATGGTAAAAAGATTACTAAACTTATAACTGACAGAAAGTTTCCGTGTTTCGGCAAAGGTTGTATGAACCAGCCTCTTGTT
TACCATAACTATTCGAGATTGGTGTCTTTTGACAAACGAATGGTGTCTTTAACTGGTGGTTTCTATGGAACCTATGAACTAGATGCTGATCTGAATAATGGTATAGGGAA
GAACTCTTACTTTTCTGTCTCCTGGCACAAGAATGTTAGTACAGGGAGTTGGATATTTTCAAATCAATTGACGACATCTTCCAAATATCCTTGGCTTATGCTGTACCTTC
GATCCGATGCAACAACAGGTTTCAATGGTGGATATCACTACGATGGTCGTGGCATTATGAGAAAGTTGCCCGAGTCCCCAAATTTCAAAGTGAGATTAACACTTGACATT
AAAAATGGAGGTGGAAAAAACAGCCAATTCTATCTCATTGACATAGGAAGCTGTTGGAAGAACAATGGAGATGCTTGCAACGGCAACACTACCACTGATGTAACTCGATA
CAGTGAAATGATTATCAACCCCGAAACTAGTAGCTGGTGTAGGCCGAACAATCTTGCATCGTGTCCGCCTTATCATGTTAGTGCTTCAGGTGAGAAGATATACAGGAATG
AGACATCAAGATTTCCATATTCAGCATATCACCTGTACTGCAGTCCTGGAAATGCTAGACATTTGGAGAAACCATATGACATTTGTGATCCATATAGCAACCCACAGTCT
CAAGAGTTGCTACAAATTCTTCCACATTCTGAATGGGCTGTACATGGCTATCCAAAGAAGCAAGGAGATGGATGGATTGGAGATCCTAGAACTTGGGAGCTCGACGTAGG
AGCTTTGTCGAACCGCTTATACTTCTACCAGGATCCGGGAACGAAGCCAGCAAGGCGGGTATGGACTTCGATCAATGTCGGTGTAGAAATATACATTAGCGAAGGGGCCA
CAGCAGAGTGGAATGTAATGAAAGAGTTCTCGAAGCTTAGTCTATTTAAGCCTTGTCTTCCTCAATTTCAGGACATCAGCATACTCATTTCCCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAATTCCAGCTCACGTTCTCGATTCACATTTTTTCATCATTTTGCGAAATGGGTTCTCTGTTTTCACCTTCATTCTTCTTCCTCCTCCTCTTCCTTCTATCCCAACC
ATTTTTTCTAATTTTGACCCATTGTTCGCCTCATGCATCGTTGGAGTATATATCCGCCATTGGAGACCCTGGAATGAAGAGCCCAAATGTTAGAGTTGCATTTGAGGCAT
GGAATTTTTGTAATGAAGTTGGAGCTGAAGCTCCTCACATGGGCAGCCCCAGATTGGCTGATTGTGCTGATTTGCGAACCCCATTTGCTTCTGATCAACAAGATTGTTCT
GGTCTTGGAAGTGATAGCAACTGTATAGTACTTCAGAAAGTAAGTGAGTCGGATAATAAACTTGGGGCTGGTGAAAAGTTTCCATCAGAGCGTTTTAAGTCATACCTGGA
TCCAGATTTGTATGTTGTGGAGAAGGAGCGCTATCTTGGTTCACTATGCGAGGTTCATGATTCTTCGAATCCATGGAGTTTCTGGATGATTATGCTAAAAAATGGGAATT
TTGACAAGAACTCTACTCTCTGCCCTGAAAATGGTAAAAAGATTACTAAACTTATAACTGACAGAAAGTTTCCGTGTTTCGGCAAAGGTTGTATGAACCAGCCTCTTGTT
TACCATAACTATTCGAGATTGGTGTCTTTTGACAAACGAATGGTGTCTTTAACTGGTGGTTTCTATGGAACCTATGAACTAGATGCTGATCTGAATAATGGTATAGGGAA
GAACTCTTACTTTTCTGTCTCCTGGCACAAGAATGTTAGTACAGGGAGTTGGATATTTTCAAATCAATTGACGACATCTTCCAAATATCCTTGGCTTATGCTGTACCTTC
GATCCGATGCAACAACAGGTTTCAATGGTGGATATCACTACGATGGTCGTGGCATTATGAGAAAGTTGCCCGAGTCCCCAAATTTCAAAGTGAGATTAACACTTGACATT
AAAAATGGAGGTGGAAAAAACAGCCAATTCTATCTCATTGACATAGGAAGCTGTTGGAAGAACAATGGAGATGCTTGCAACGGCAACACTACCACTGATGTAACTCGATA
CAGTGAAATGATTATCAACCCCGAAACTAGTAGCTGGTGTAGGCCGAACAATCTTGCATCGTGTCCGCCTTATCATGTTAGTGCTTCAGGTGAGAAGATATACAGGAATG
AGACATCAAGATTTCCATATTCAGCATATCACCTGTACTGCAGTCCTGGAAATGCTAGACATTTGGAGAAACCATATGACATTTGTGATCCATATAGCAACCCACAGTCT
CAAGAGTTGCTACAAATTCTTCCACATTCTGAATGGGCTGTACATGGCTATCCAAAGAAGCAAGGAGATGGATGGATTGGAGATCCTAGAACTTGGGAGCTCGACGTAGG
AGCTTTGTCGAACCGCTTATACTTCTACCAGGATCCGGGAACGAAGCCAGCAAGGCGGGTATGGACTTCGATCAATGTCGGTGTAGAAATATACATTAGCGAAGGGGCCA
CAGCAGAGTGGAATGTAATGAAAGAGTTCTCGAAGCTTAGTCTATTTAAGCCTTGTCTTCCTCAATTTCAGGACATCAGCATACTCATTTCCCCCTAG
Protein sequenceShow/hide protein sequence
MKFQLTFSIHIFSSFCEMGSLFSPSFFFLLLFLLSQPFFLILTHCSPHASLEYISAIGDPGMKSPNVRVAFEAWNFCNEVGAEAPHMGSPRLADCADLRTPFASDQQDCS
GLGSDSNCIVLQKVSESDNKLGAGEKFPSERFKSYLDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKKITKLITDRKFPCFGKGCMNQPLV
YHNYSRLVSFDKRMVSLTGGFYGTYELDADLNNGIGKNSYFSVSWHKNVSTGSWIFSNQLTTSSKYPWLMLYLRSDATTGFNGGYHYDGRGIMRKLPESPNFKVRLTLDI
KNGGGKNSQFYLIDIGSCWKNNGDACNGNTTTDVTRYSEMIINPETSSWCRPNNLASCPPYHVSASGEKIYRNETSRFPYSAYHLYCSPGNARHLEKPYDICDPYSNPQS
QELLQILPHSEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGVEIYISEGATAEWNVMKEFSKLSLFKPCLPQFQDISILISP