; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G00200 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G00200
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTransmembrane protein
Genome locationClcChr04:575819..577853
RNA-Seq ExpressionClc04G00200
SyntenyClc04G00200
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045443.1 uncharacterized protein E6C27_scaffold294G00460 [Cucumis melo var. makuwa]1.5e-23283.46Show/hide
Query:  MGLTLTGKSKSTAGENWGMGLLLVFFSEDSPPSSIADHNNLFPSSSLPSSG-----------------RRSNYNLLTKAQSTISVCALLVFVSLLLFTLS
        MGLTLTGKSKSTAGENWGMGLLLVFFSEDS PS IADH+NLFPSSS  SS                  RRSNYNLLTKAQSTISVCALLVF+SLLLFTLS
Subjt:  MGLTLTGKSKSTAGENWGMGLLLVFFSEDSPPSSIADHNNLFPSSSLPSSG-----------------RRSNYNLLTKAQSTISVCALLVFVSLLLFTLS

Query:  TFEPTIKMNLAPPRRLLSQKSMPIEVRTPLENRWNWFGKMWKQKSPMGK-TTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFH
        TFEPTIKMNL PPRRLL+QKSMPI+VR PL NRWNWFGKMWKQK  MGK TTTDAV T ALQRMGTLYMRGTRAM DLTVVHVSEDVGEED RLFLRLFH
Subjt:  TFEPTIKMNLAPPRRLLSQKSMPIEVRTPLENRWNWFGKMWKQKSPMGK-TTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFH

Query:  RSGVTAKSDSVFVFPSPVFSSRFGSIIREENELFLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGS
        RSGVTAKSDSVF+FPSP FS RFG IIREEN+ FLKLLGRYRNLN TA+RSAAAGFDVT+  K +EKKE EEPIWGK+VKR  N SNGGEDELTRLSYGS
Subjt:  RSGVTAKSDSVFVFPSPVFSSRFGSIIREENELFLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGS

Query:  VVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIG
        VVSFDA EIDPENSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKHVML+DAK+SLLLGDPL RVRNKG ESVI F NKH K+NSEKSNSH IVNP+IVIG
Subjt:  VVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIG

Query:  GARGIRRLSNAAVVEIARILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIPEASSLAGVELDSVGSWSAAEKKMFQRGNNGNSREINSIIMKK
        GARGIRR+SNAA+VEI R+LMQHKKKN VSDSGVLSHLVNSEFLLKNVKVIMA ESIPEASS  GVEL+SVG  SA EK MF +GNNGNS EINS+IMKK
Subjt:  GARGIRRLSNAAVVEIARILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIPEASSLAGVELDSVGSWSAAEKKMFQRGNNGNSREINSIIMKK

Query:  ICSSEIDSCVYSDC
        ICSSEIDS VY+DC
Subjt:  ICSSEIDSCVYSDC

TYK19793.1 uncharacterized protein E5676_scaffold307G00200 [Cucumis melo var. makuwa]1.0e-23384.95Show/hide
Query:  MGLTLTGKSKSTAGENWGMGLLLVFFSEDSPPSSIADHNNLFPSSSLPSSG--------RRSNYNLLTKAQSTISVCALLVFVSLLLFTLSTFEPTIKMN
        MGLTLTGKSKSTAGENWGMGLLLVFFSEDS PS IADH+NLFPSSS  SS         RRSNYNLLTKAQSTISVCALLVF+SLLLFTLSTFEPTIKMN
Subjt:  MGLTLTGKSKSTAGENWGMGLLLVFFSEDSPPSSIADHNNLFPSSSLPSSG--------RRSNYNLLTKAQSTISVCALLVFVSLLLFTLSTFEPTIKMN

Query:  LAPPRRLLSQKSMPIEVRTPLENRWNWFGKMWKQKSPMGK-TTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFHRSGVTAKSD
        L PPRRLL+QKSMPI+VR PL NRWNWFGKMWKQK  MGK TTTDAV T ALQRMGTLYMRGTRAM DLTVVHVSEDVGEED RLFLRLFHRSGVTAKSD
Subjt:  LAPPRRLLSQKSMPIEVRTPLENRWNWFGKMWKQKSPMGK-TTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFHRSGVTAKSD

Query:  SVFVFPSPVFSSRFGSIIREENELFLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGSVVSFDAAEI
        SVF+FPSP FS RFG IIREEN+ FLKLLGRYRNLN TA+RSAAAGFDVT+  K +EKKE EEPIWGK+VKR  N SNGGEDELTRLSYGSVVSFDA EI
Subjt:  SVFVFPSPVFSSRFGSIIREENELFLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGSVVSFDAAEI

Query:  DPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIGGARGIRRLS
        DPENSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKHVML+DAK+SLLLGDPL RVRNKG ESVI F NKH K+NSEKSNSH IVNP+IVIGGARGIRR+S
Subjt:  DPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIGGARGIRRLS

Query:  NAAVVEIARILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIPEASSLAGVELDSVGSWSAAEKKMFQRGNNGNSREINSIIMKKICSSEIDSC
        NAA+VEI R+LMQHKKKN VSDSGVLSHLVNSEFLLKNVKVIMA+ESIPEASS  GVEL+SVG  SA EK MF +GNNGNS EINS+IMKKICSSEIDS 
Subjt:  NAAVVEIARILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIPEASSLAGVELDSVGSWSAAEKKMFQRGNNGNSREINSIIMKKICSSEIDSC

Query:  VYSDC
        VY+DC
Subjt:  VYSDC

XP_004151150.1 uncharacterized protein LOC101208268 [Cucumis sativus]8.0e-23486.2Show/hide
Query:  MGLTLTGKSKSTAGENWGMGLLLVFFSEDSPPSSIADHNNLFPSSSLPS---SGRRSNYNLLTKAQSTISVCALLVFVSLLLFTLSTFEPTIKMNLAPPR
        MGLTLTGKSKSTAG+NWGMGLLLVFFSEDS PS IADH NLFPSSS  S   SGRRSNYNLLTKAQSTISVCALLVF+SLLLFTLSTFEPTIKMNL PPR
Subjt:  MGLTLTGKSKSTAGENWGMGLLLVFFSEDSPPSSIADHNNLFPSSSLPS---SGRRSNYNLLTKAQSTISVCALLVFVSLLLFTLSTFEPTIKMNLAPPR

Query:  RLLSQKSMPIEVRTPLENRWNWFGKMWKQKSPMGK-TTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFHRSGVTAKSDSVFVF
        RLL+QKSMPIE+R PL NRWNWF +MWKQK  MGK TTTDAV T ALQRMGTLYMRGTRAM DLTVVHVSED+GEED RLFLRLFHRSGVTAKSDSVFVF
Subjt:  RLLSQKSMPIEVRTPLENRWNWFGKMWKQKSPMGK-TTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFHRSGVTAKSDSVFVF

Query:  PSPVFSSRFGSIIREENELFLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGSVVSFDAAEIDPENS
        PSP FS RFG IIR+ENE FLKLLGRYRNLNGT +RSAAAGFDVTQ  K +EKKE EEPIWGK+VKRL N SNGGEDELTRLSYGSVVSFDA EIDPENS
Subjt:  PSPVFSSRFGSIIREENELFLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGSVVSFDAAEIDPENS

Query:  LSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIGGARGIRRLSNAAVV
        LSGFSDHIPMSLRRW+CYPMLLGRVRRNFKHVML+DAK+SLLLGDPL RVRNKG ESVI F NKHSK+NSEKSNSH +VNP+IVIGGARGIRRLSNAA V
Subjt:  LSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIGGARGIRRLSNAAVV

Query:  EIARILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIPEASSLAGVELDSVGSWSAAEKKMFQRGNNGNSREINSIIMKKICSSEIDSCVYSDC
        EI RILMQHKKKN VSDSGVLS LVNSEFLLKNVKVIMA+ESIPEASSL GVEL+SVGS SA EK MF +GNNGNS EINS+IMKKICSSEIDS VY+ C
Subjt:  EIARILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIPEASSLAGVELDSVGSWSAAEKKMFQRGNNGNSREINSIIMKKICSSEIDSCVYSDC

XP_008460778.1 PREDICTED: uncharacterized protein LOC103499540 [Cucumis melo]8.9e-23383.63Show/hide
Query:  MGLTLTGKSKSTAGENWGMGLLLVFFSEDSPPSSIADHNNLFPSSSLPSSG----------------RRSNYNLLTKAQSTISVCALLVFVSLLLFTLST
        MGLTLTGKSKSTAGENWGMGLLLVFFSEDS PS IADH+NLFPSSS  SS                 RRSNYNLLTKAQSTISVCALLVF+SLLLFTLST
Subjt:  MGLTLTGKSKSTAGENWGMGLLLVFFSEDSPPSSIADHNNLFPSSSLPSSG----------------RRSNYNLLTKAQSTISVCALLVFVSLLLFTLST

Query:  FEPTIKMNLAPPRRLLSQKSMPIEVRTPLENRWNWFGKMWKQKSPMGK-TTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFHR
        FEPTIKMNL PPRRLL+QKSMPI+VR PL NRWNWFGKMWKQK  MGK TTTDAV T ALQRMGTLYMRGTRAM DLTVVHVSEDVGEED RLFLRLFHR
Subjt:  FEPTIKMNLAPPRRLLSQKSMPIEVRTPLENRWNWFGKMWKQKSPMGK-TTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFHR

Query:  SGVTAKSDSVFVFPSPVFSSRFGSIIREENELFLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGSV
        SGVTAKSDSVF+FPSP FS RFG IIREEN+ FLKLLGRYRNLN TA+RSAAAGFDVT+  K +EKKE EEPIWGK+VKR  N SNGGEDELTRLSYGSV
Subjt:  SGVTAKSDSVFVFPSPVFSSRFGSIIREENELFLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGSV

Query:  VSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIGG
        VSFDA EIDPENSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKHVML+DAK+SLLLGDPL RVRNKG ESVI F NKH K+NSEKSNSH IVNP+IVIGG
Subjt:  VSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIGG

Query:  ARGIRRLSNAAVVEIARILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIPEASSLAGVELDSVGSWSAAEKKMFQRGNNGNSREINSIIMKKI
        ARGIRR+SNAA+VEI R+LMQHKKKN VSDSGVLSHLVNSEFLLKNVKVIMA+ESIPEASS  GVEL+SVG  SA EK MF +GNNGNS EINS+IMKKI
Subjt:  ARGIRRLSNAAVVEIARILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIPEASSLAGVELDSVGSWSAAEKKMFQRGNNGNSREINSIIMKKI

Query:  CSSEIDSCVYSDC
        CSSEIDS VY+DC
Subjt:  CSSEIDSCVYSDC

XP_038883664.1 uncharacterized protein LOC120074578 [Benincasa hispida]9.8e-24087.4Show/hide
Query:  MGLTLTGKSKSTAGENWGMGLLLVFFSEDSPPSSIADHNNLF----PSSSLPSSGRRSNYNLLTKAQSTISVCALLVFVSLLLFTLSTFEPTIKMNLAPP
        MGLTLTGKSKS+AGENWGMGLLLVFFSEDS  S+IAD   LF    PSSS  SSGRRSNYNLL KAQSTISVCALLVFVSLLLFTLSTFEP IKMNL PP
Subjt:  MGLTLTGKSKSTAGENWGMGLLLVFFSEDSPPSSIADHNNLF----PSSSLPSSGRRSNYNLLTKAQSTISVCALLVFVSLLLFTLSTFEPTIKMNLAPP

Query:  RRLLSQKSMPIEVRTPLENRWNWFGKMWKQKSPMGKTTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFHRSGVTAKSDSVFVF
        RRLLSQKSMPIEVRTP +N+WNWFGKMWKQK   GK T DAV TAALQRMGTLYMRGTRAM DLTVVHVSEDVGEEDLRLFLRLFHRSGVTAKSDSVFVF
Subjt:  RRLLSQKSMPIEVRTPLENRWNWFGKMWKQKSPMGKTTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFHRSGVTAKSDSVFVF

Query:  PSPVFSSRFGSIIREENELFLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGSVVSFDAAEIDPENS
        PSP  S RFG IIREENE FLKLLG+YRNLNGTA+RSAAAGFDVTQF+K +EKKE EEPIWGK+VKR+ NDSNG  DELTRLSYGSVV FDAAEIDPENS
Subjt:  PSPVFSSRFGSIIREENELFLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGSVVSFDAAEIDPENS

Query:  LSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIGGARGIRRLSNAAVV
        LSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSL+LGDPL RVRNKG ESVILF NKH+K+NSE+SN+H +VNPAIV+GGARGIRRLSNAAVV
Subjt:  LSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIGGARGIRRLSNAAVV

Query:  EIARILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIPEASSLAGVELDSVGSWSAAEKKMFQRGNNGNSREINSIIMKKICSSEIDSCVYSDC
        EIARILMQHKKKN VSDSGVLSHLVNSEFLLKNVKVI +TESIPE SSLAGVELDSVGS SA EK MFQRGNNGNSREINS+IMKKICSSEIDS VYSDC
Subjt:  EIARILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIPEASSLAGVELDSVGSWSAAEKKMFQRGNNGNSREINSIIMKKICSSEIDSCVYSDC

TrEMBL top hitse value%identityAlignment
A0A0A0KWA6 Uncharacterized protein3.9e-23486.2Show/hide
Query:  MGLTLTGKSKSTAGENWGMGLLLVFFSEDSPPSSIADHNNLFPSSSLPS---SGRRSNYNLLTKAQSTISVCALLVFVSLLLFTLSTFEPTIKMNLAPPR
        MGLTLTGKSKSTAG+NWGMGLLLVFFSEDS PS IADH NLFPSSS  S   SGRRSNYNLLTKAQSTISVCALLVF+SLLLFTLSTFEPTIKMNL PPR
Subjt:  MGLTLTGKSKSTAGENWGMGLLLVFFSEDSPPSSIADHNNLFPSSSLPS---SGRRSNYNLLTKAQSTISVCALLVFVSLLLFTLSTFEPTIKMNLAPPR

Query:  RLLSQKSMPIEVRTPLENRWNWFGKMWKQKSPMGK-TTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFHRSGVTAKSDSVFVF
        RLL+QKSMPIE+R PL NRWNWF +MWKQK  MGK TTTDAV T ALQRMGTLYMRGTRAM DLTVVHVSED+GEED RLFLRLFHRSGVTAKSDSVFVF
Subjt:  RLLSQKSMPIEVRTPLENRWNWFGKMWKQKSPMGK-TTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFHRSGVTAKSDSVFVF

Query:  PSPVFSSRFGSIIREENELFLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGSVVSFDAAEIDPENS
        PSP FS RFG IIR+ENE FLKLLGRYRNLNGT +RSAAAGFDVTQ  K +EKKE EEPIWGK+VKRL N SNGGEDELTRLSYGSVVSFDA EIDPENS
Subjt:  PSPVFSSRFGSIIREENELFLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGSVVSFDAAEIDPENS

Query:  LSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIGGARGIRRLSNAAVV
        LSGFSDHIPMSLRRW+CYPMLLGRVRRNFKHVML+DAK+SLLLGDPL RVRNKG ESVI F NKHSK+NSEKSNSH +VNP+IVIGGARGIRRLSNAA V
Subjt:  LSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIGGARGIRRLSNAAVV

Query:  EIARILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIPEASSLAGVELDSVGSWSAAEKKMFQRGNNGNSREINSIIMKKICSSEIDSCVYSDC
        EI RILMQHKKKN VSDSGVLS LVNSEFLLKNVKVIMA+ESIPEASSL GVEL+SVGS SA EK MF +GNNGNS EINS+IMKKICSSEIDS VY+ C
Subjt:  EIARILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIPEASSLAGVELDSVGSWSAAEKKMFQRGNNGNSREINSIIMKKICSSEIDSCVYSDC

A0A1S3CD81 uncharacterized protein LOC1034995404.3e-23383.63Show/hide
Query:  MGLTLTGKSKSTAGENWGMGLLLVFFSEDSPPSSIADHNNLFPSSSLPSSG----------------RRSNYNLLTKAQSTISVCALLVFVSLLLFTLST
        MGLTLTGKSKSTAGENWGMGLLLVFFSEDS PS IADH+NLFPSSS  SS                 RRSNYNLLTKAQSTISVCALLVF+SLLLFTLST
Subjt:  MGLTLTGKSKSTAGENWGMGLLLVFFSEDSPPSSIADHNNLFPSSSLPSSG----------------RRSNYNLLTKAQSTISVCALLVFVSLLLFTLST

Query:  FEPTIKMNLAPPRRLLSQKSMPIEVRTPLENRWNWFGKMWKQKSPMGK-TTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFHR
        FEPTIKMNL PPRRLL+QKSMPI+VR PL NRWNWFGKMWKQK  MGK TTTDAV T ALQRMGTLYMRGTRAM DLTVVHVSEDVGEED RLFLRLFHR
Subjt:  FEPTIKMNLAPPRRLLSQKSMPIEVRTPLENRWNWFGKMWKQKSPMGK-TTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFHR

Query:  SGVTAKSDSVFVFPSPVFSSRFGSIIREENELFLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGSV
        SGVTAKSDSVF+FPSP FS RFG IIREEN+ FLKLLGRYRNLN TA+RSAAAGFDVT+  K +EKKE EEPIWGK+VKR  N SNGGEDELTRLSYGSV
Subjt:  SGVTAKSDSVFVFPSPVFSSRFGSIIREENELFLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGSV

Query:  VSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIGG
        VSFDA EIDPENSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKHVML+DAK+SLLLGDPL RVRNKG ESVI F NKH K+NSEKSNSH IVNP+IVIGG
Subjt:  VSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIGG

Query:  ARGIRRLSNAAVVEIARILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIPEASSLAGVELDSVGSWSAAEKKMFQRGNNGNSREINSIIMKKI
        ARGIRR+SNAA+VEI R+LMQHKKKN VSDSGVLSHLVNSEFLLKNVKVIMA+ESIPEASS  GVEL+SVG  SA EK MF +GNNGNS EINS+IMKKI
Subjt:  ARGIRRLSNAAVVEIARILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIPEASSLAGVELDSVGSWSAAEKKMFQRGNNGNSREINSIIMKKI

Query:  CSSEIDSCVYSDC
        CSSEIDS VY+DC
Subjt:  CSSEIDSCVYSDC

A0A5A7TPI4 Uncharacterized protein7.3e-23383.46Show/hide
Query:  MGLTLTGKSKSTAGENWGMGLLLVFFSEDSPPSSIADHNNLFPSSSLPSSG-----------------RRSNYNLLTKAQSTISVCALLVFVSLLLFTLS
        MGLTLTGKSKSTAGENWGMGLLLVFFSEDS PS IADH+NLFPSSS  SS                  RRSNYNLLTKAQSTISVCALLVF+SLLLFTLS
Subjt:  MGLTLTGKSKSTAGENWGMGLLLVFFSEDSPPSSIADHNNLFPSSSLPSSG-----------------RRSNYNLLTKAQSTISVCALLVFVSLLLFTLS

Query:  TFEPTIKMNLAPPRRLLSQKSMPIEVRTPLENRWNWFGKMWKQKSPMGK-TTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFH
        TFEPTIKMNL PPRRLL+QKSMPI+VR PL NRWNWFGKMWKQK  MGK TTTDAV T ALQRMGTLYMRGTRAM DLTVVHVSEDVGEED RLFLRLFH
Subjt:  TFEPTIKMNLAPPRRLLSQKSMPIEVRTPLENRWNWFGKMWKQKSPMGK-TTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFH

Query:  RSGVTAKSDSVFVFPSPVFSSRFGSIIREENELFLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGS
        RSGVTAKSDSVF+FPSP FS RFG IIREEN+ FLKLLGRYRNLN TA+RSAAAGFDVT+  K +EKKE EEPIWGK+VKR  N SNGGEDELTRLSYGS
Subjt:  RSGVTAKSDSVFVFPSPVFSSRFGSIIREENELFLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGS

Query:  VVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIG
        VVSFDA EIDPENSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKHVML+DAK+SLLLGDPL RVRNKG ESVI F NKH K+NSEKSNSH IVNP+IVIG
Subjt:  VVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIG

Query:  GARGIRRLSNAAVVEIARILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIPEASSLAGVELDSVGSWSAAEKKMFQRGNNGNSREINSIIMKK
        GARGIRR+SNAA+VEI R+LMQHKKKN VSDSGVLSHLVNSEFLLKNVKVIMA ESIPEASS  GVEL+SVG  SA EK MF +GNNGNS EINS+IMKK
Subjt:  GARGIRRLSNAAVVEIARILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIPEASSLAGVELDSVGSWSAAEKKMFQRGNNGNSREINSIIMKK

Query:  ICSSEIDSCVYSDC
        ICSSEIDS VY+DC
Subjt:  ICSSEIDSCVYSDC

A0A5D3D8F3 Uncharacterized protein5.1e-23484.95Show/hide
Query:  MGLTLTGKSKSTAGENWGMGLLLVFFSEDSPPSSIADHNNLFPSSSLPSSG--------RRSNYNLLTKAQSTISVCALLVFVSLLLFTLSTFEPTIKMN
        MGLTLTGKSKSTAGENWGMGLLLVFFSEDS PS IADH+NLFPSSS  SS         RRSNYNLLTKAQSTISVCALLVF+SLLLFTLSTFEPTIKMN
Subjt:  MGLTLTGKSKSTAGENWGMGLLLVFFSEDSPPSSIADHNNLFPSSSLPSSG--------RRSNYNLLTKAQSTISVCALLVFVSLLLFTLSTFEPTIKMN

Query:  LAPPRRLLSQKSMPIEVRTPLENRWNWFGKMWKQKSPMGK-TTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFHRSGVTAKSD
        L PPRRLL+QKSMPI+VR PL NRWNWFGKMWKQK  MGK TTTDAV T ALQRMGTLYMRGTRAM DLTVVHVSEDVGEED RLFLRLFHRSGVTAKSD
Subjt:  LAPPRRLLSQKSMPIEVRTPLENRWNWFGKMWKQKSPMGK-TTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFHRSGVTAKSD

Query:  SVFVFPSPVFSSRFGSIIREENELFLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGSVVSFDAAEI
        SVF+FPSP FS RFG IIREEN+ FLKLLGRYRNLN TA+RSAAAGFDVT+  K +EKKE EEPIWGK+VKR  N SNGGEDELTRLSYGSVVSFDA EI
Subjt:  SVFVFPSPVFSSRFGSIIREENELFLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGSVVSFDAAEI

Query:  DPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIGGARGIRRLS
        DPENSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKHVML+DAK+SLLLGDPL RVRNKG ESVI F NKH K+NSEKSNSH IVNP+IVIGGARGIRR+S
Subjt:  DPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIGGARGIRRLS

Query:  NAAVVEIARILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIPEASSLAGVELDSVGSWSAAEKKMFQRGNNGNSREINSIIMKKICSSEIDSC
        NAA+VEI R+LMQHKKKN VSDSGVLSHLVNSEFLLKNVKVIMA+ESIPEASS  GVEL+SVG  SA EK MF +GNNGNS EINS+IMKKICSSEIDS 
Subjt:  NAAVVEIARILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIPEASSLAGVELDSVGSWSAAEKKMFQRGNNGNSREINSIIMKKICSSEIDSC

Query:  VYSDC
        VY+DC
Subjt:  VYSDC

A0A6J1EAJ1 uncharacterized protein LOC1114323123.6e-21681.49Show/hide
Query:  MGLTLTGKSKSTAGENWGMGLLLVFFSEDSPPSSIADHNNLFPSSSLPSSGRRSNYNLLTKAQSTISVCALLVFVSLLLFTLSTFEPTIKMNLAPPRRLL
        MGL +TGKSKSTA ENWGMGL LVFFSEDS PS+IADHN LFPSSS  SSGRRSNYNLL+KAQSTISVCALLVFVSLLLFTLSTFEP IKMNL PPRRLL
Subjt:  MGLTLTGKSKSTAGENWGMGLLLVFFSEDSPPSSIADHNNLFPSSSLPSSGRRSNYNLLTKAQSTISVCALLVFVSLLLFTLSTFEPTIKMNLAPPRRLL

Query:  SQKSMPIEVRTPLENRWNWFGKMWKQKSPMGKTTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPV
        S+KS PIE+RTP  NRWNWF KMWKQK  +  T T ++  AALQRMGTLY+RGTRAMAD+TVVHV EDV E+D RLFLRLFHRSGVTAKSDSVF+F S  
Subjt:  SQKSMPIEVRTPLENRWNWFGKMWKQKSPMGKTTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPV

Query:  FSSRFGSIIREENELFLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGSVVSFDAAEIDPENSLSGF
        FS +FG IIREENE FLKLL R RN N TANR A AGFDV QF+K +EKKEPEEPIWGKK KR  NDS GGEDELTRLSYGSVVSFDAAEIDPENSLSGF
Subjt:  FSSRFGSIIREENELFLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGSVVSFDAAEIDPENSLSGF

Query:  SDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIGGARGIRRLSNAAVVEIAR
        SDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNS+L+GDPLGR+RNKG ESVILF NKH+K+NSEK  SH +VNPA+VIGGARG+RRLSNA VVEIAR
Subjt:  SDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIGGARGIRRLSNAAVVEIAR

Query:  ILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIPEASSLAGVELDSVGSWSAAEK-KMFQRGNNGNSREINSIIMKKICSSEIDSCVYSDC
         LMQH KKN VSDS VLSHLVNSEFLLKNVKVIMATESIP+AS LAGVE  SVGS SA EK  + +R N GN REINS+I+KKICSSEIDS VYSDC
Subjt:  ILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIPEASSLAGVELDSVGSWSAAEK-KMFQRGNNGNSREINSIIMKKICSSEIDSCVYSDC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G57400.1 unknown protein2.3e-10949.22Show/hide
Query:  SKSTAGENWGMGLLLVFF------SEDSPPSSIAD--HNNLFPSSSLPSSGRRSNYNLLTKAQSTISVCALLVFVSLLLFTLSTFEPT----IKMNLAPP
        +K T     GMGLLLVFF      ++DSP SS +      LF S        RS+  LL+KAQSTIS+C LL+F++L LFTLSTFEP+       +  P 
Subjt:  SKSTAGENWGMGLLLVFF------SEDSPPSSIAD--HNNLFPSSSLPSSGRRSNYNLLTKAQSTISVCALLVFVSLLLFTLSTFEPT----IKMNLAPP

Query:  RRLLSQKSMPIEVRTPLENRWNWFGKMWKQKSPMGKTTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFHRSGVTAKSDSVFVF
        RR L  +   I   +    R+N F                     ALQ MGTL++RGT++M DL VVH+S D  E+DLRLF+RL HRSGVT+KSD V +F
Subjt:  RRLLSQKSMPIEVRTPLENRWNWFGKMWKQKSPMGKTTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFHRSGVTAKSDSVFVF

Query:  PSPVFSSRFGSIIREENELFLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRL-VNDS---NGGEDELTRLSYGSVVSFDAAEID
         S    +RF  +I EEN+ FLKL+  +RN +   +  +  GF++T+F+KK+ K    EPIWGKK  R   ND+   N   +    L++GSVV FD  E+D
Subjt:  PSPVFSSRFGSIIREENELFLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRL-VNDS---NGGEDELTRLSYGSVVSFDAAEID

Query:  PENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIGGARGIRRLSN
        PENSLSGF DH+P+SLRRWACYPMLLGRVRRNFKHVMLVDAK SL LGDPL R+RN+  ESV+ F +KHS  +S+KS+    VNPAI+IGGA+GIRRLS+
Subjt:  PENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIGGARGIRRLSN

Query:  AAVVEIAR--ILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIPEASSLAGVELDSVGSWSAAEKKMFQR-GNNGNSR---EINSIIMKKICSS
        +   EI R  I  QHKKKN V++S VLS LV +  + KN +V+ +   +PEASSLA +   +  + S     + QR G N NS    +I +IIMK+ICS 
Subjt:  AAVVEIAR--ILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIPEASSLAGVELDSVGSWSAAEKKMFQR-GNNGNSR---EINSIIMKKICSS

Query:  EIDSCVYSDC
        E+DS VY+ C
Subjt:  EIDSCVYSDC

AT5G52500.1 unknown protein3.0e-7742.19Show/hide
Query:  MGLLLVFFSEDSPPSSIADHNNLFPSSSLPSSGRRSNYNLLTKAQSTISVCALLVFVSLLLFTLSTF-EPTIKMNL--APPRRLLSQKSMPIEVRTPLEN
        MGL LV F    P  +  D ++ +  +      + S+  LL+KA+STIS C +L+F++L LFTLSTF EP+ +     +P  R L  +++       +  
Subjt:  MGLLLVFFSEDSPPSSIADHNNLFPSSSLPSSGRRSNYNLLTKAQSTISVCALLVFVSLLLFTLSTF-EPTIKMNL--APPRRLLSQKSMPIEVRTPLEN

Query:  RWNWFGKMWKQKSPMGKTTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPVFSSRFGSIIREENEL
        R+                        ALQ MGTL++RGT++M DL + H++    E DLRLF+RL HRSGVT+KSD V +F SP  ++RF  +I +EN  
Subjt:  RWNWFGKMWKQKSPMGKTTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPVFSSRFGSIIREENEL

Query:  FLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGSVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYP
        FLKL+  +RN + T++ S++                 E  IWGKK +   N ++        L++GS+V FD  E+DPENSLSGF D +P+SLRRWACYP
Subjt:  FLKLLGRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGSVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYP

Query:  MLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIGGARGIRRLSNAAVVEIARILMQHKKK-NWVSDS
        MLLGRVRR+FKHVMLVDAK S  +GDP  R+RN+  +SV+ F +KH K  SE       VNP I+IGGA+GIRRLS++   EI R  M  K     V++S
Subjt:  MLLGRVRRNFKHVMLVDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIGGARGIRRLSNAAVVEIARILMQHKKK-NWVSDS

Query:  GVLSHLVNSEFLLKNVKVIMATESIPEAS
         VLS LV +  + KN +V+++   +PEA+
Subjt:  GVLSHLVNSEFLLKNVKVIMATESIPEAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCTCACTCTCACCGGAAAATCCAAATCCACTGCTGGCGAAAATTGGGGAATGGGTCTTCTTCTCGTTTTCTTCTCCGAAGATTCACCTCCTTCCTCCATTGCTGA
CCACAACAACCTCTTTCCATCTTCCTCCCTTCCTTCTTCAGGTCGTCGGAGTAATTACAATCTTCTCACCAAAGCTCAGTCCACCATTTCTGTTTGTGCGCTCCTCGTCT
TTGTTTCTCTTCTTCTCTTCACTCTCTCCACATTCGAGCCCACCATTAAAATGAACCTCGCTCCCCCTCGGAGGCTCCTCTCCCAGAAATCAATGCCGATTGAAGTTCGT
ACGCCGTTGGAGAATCGGTGGAACTGGTTTGGCAAAATGTGGAAGCAGAAATCGCCGATGGGGAAGACGACGACTGACGCTGTTCCCACGGCCGCGCTGCAACGAATGGG
GACTCTGTACATGCGAGGTACTCGAGCTATGGCGGACCTGACGGTGGTCCACGTATCGGAAGATGTCGGTGAAGAAGACCTCCGCCTCTTTCTCCGACTGTTCCATCGGT
CCGGCGTCACCGCGAAATCTGATTCGGTCTTCGTCTTTCCCTCGCCGGTGTTCTCGTCGAGATTCGGTTCGATTATTCGAGAGGAAAACGAATTGTTTCTGAAACTACTT
GGTCGGTACCGGAATTTGAACGGAACGGCCAACCGGAGCGCGGCGGCGGGATTTGATGTGACTCAGTTTCTTAAGAAAAGAGAGAAGAAGGAGCCGGAGGAGCCGATTTG
GGGGAAGAAAGTGAAACGACTAGTGAACGATTCTAACGGCGGCGAGGACGAGTTGACTCGGCTGAGTTACGGCTCGGTGGTGAGTTTCGACGCGGCGGAAATTGATCCGG
AGAATTCACTTTCCGGCTTCTCAGATCATATTCCGATGAGTCTGCGGCGGTGGGCGTGTTATCCAATGCTCCTCGGCCGAGTCCGCCGGAATTTCAAGCACGTAATGCTG
GTTGACGCTAAAAACTCGCTTCTTCTCGGGGATCCACTCGGCCGCGTTAGAAACAAAGGAGCCGAGTCAGTAATTCTCTTCCCGAACAAGCACAGCAAAAGGAACTCGGA
AAAGTCAAACTCGCACCAAATTGTCAATCCGGCCATCGTGATCGGCGGAGCGCGCGGCATCCGGCGGCTATCGAACGCGGCGGTGGTAGAAATCGCCAGAATTCTCATGC
AGCACAAGAAGAAAAACTGGGTCTCCGACTCGGGAGTACTGAGTCACCTCGTTAACAGTGAGTTTCTATTAAAGAATGTGAAAGTGATTATGGCGACTGAGTCGATTCCA
GAAGCGAGTTCACTTGCCGGAGTGGAATTGGACTCTGTCGGTTCCTGGTCGGCGGCGGAGAAGAAGATGTTCCAGAGGGGCAATAATGGTAATTCACGTGAAATTAATTC
CATTATTATGAAGAAAATATGTTCTTCCGAAATTGATTCTTGTGTCTATAGTGATTGTTAG
mRNA sequenceShow/hide mRNA sequence
CTGAAATTTCTTTTACTAAGAAAATCAAGATTTTTCAAAATATAGAAACTAAAATAAAAAGAAAATACAAAAATCATAATAGTATTTTGAATCAAGGAAAAAGAAAAAAA
TAATAAAATAAAAAAGAAAAGAAAATGAATTGAAAGAGACAGCGGCGGAGTGTAAATGGGAAAATGCAGCAGCACTCCACAAGAGAACTTCACTCACTCTTTCCTTCTCT
TTAAATTTTTTAAAAAAAATAAAATAAAATAAAAATAAATAATTAAATATAAAAGCCTTCGCCTCAAAATTCCCCAAAATGCCAATCCCAAATCCCCCGTTGAAGAACTC
CAAATAACAACCAAAAATGGGTCTCACTCTCACCGGAAAATCCAAATCCACTGCTGGCGAAAATTGGGGAATGGGTCTTCTTCTCGTTTTCTTCTCCGAAGATTCACCTC
CTTCCTCCATTGCTGACCACAACAACCTCTTTCCATCTTCCTCCCTTCCTTCTTCAGGTCGTCGGAGTAATTACAATCTTCTCACCAAAGCTCAGTCCACCATTTCTGTT
TGTGCGCTCCTCGTCTTTGTTTCTCTTCTTCTCTTCACTCTCTCCACATTCGAGCCCACCATTAAAATGAACCTCGCTCCCCCTCGGAGGCTCCTCTCCCAGAAATCAAT
GCCGATTGAAGTTCGTACGCCGTTGGAGAATCGGTGGAACTGGTTTGGCAAAATGTGGAAGCAGAAATCGCCGATGGGGAAGACGACGACTGACGCTGTTCCCACGGCCG
CGCTGCAACGAATGGGGACTCTGTACATGCGAGGTACTCGAGCTATGGCGGACCTGACGGTGGTCCACGTATCGGAAGATGTCGGTGAAGAAGACCTCCGCCTCTTTCTC
CGACTGTTCCATCGGTCCGGCGTCACCGCGAAATCTGATTCGGTCTTCGTCTTTCCCTCGCCGGTGTTCTCGTCGAGATTCGGTTCGATTATTCGAGAGGAAAACGAATT
GTTTCTGAAACTACTTGGTCGGTACCGGAATTTGAACGGAACGGCCAACCGGAGCGCGGCGGCGGGATTTGATGTGACTCAGTTTCTTAAGAAAAGAGAGAAGAAGGAGC
CGGAGGAGCCGATTTGGGGGAAGAAAGTGAAACGACTAGTGAACGATTCTAACGGCGGCGAGGACGAGTTGACTCGGCTGAGTTACGGCTCGGTGGTGAGTTTCGACGCG
GCGGAAATTGATCCGGAGAATTCACTTTCCGGCTTCTCAGATCATATTCCGATGAGTCTGCGGCGGTGGGCGTGTTATCCAATGCTCCTCGGCCGAGTCCGCCGGAATTT
CAAGCACGTAATGCTGGTTGACGCTAAAAACTCGCTTCTTCTCGGGGATCCACTCGGCCGCGTTAGAAACAAAGGAGCCGAGTCAGTAATTCTCTTCCCGAACAAGCACA
GCAAAAGGAACTCGGAAAAGTCAAACTCGCACCAAATTGTCAATCCGGCCATCGTGATCGGCGGAGCGCGCGGCATCCGGCGGCTATCGAACGCGGCGGTGGTAGAAATC
GCCAGAATTCTCATGCAGCACAAGAAGAAAAACTGGGTCTCCGACTCGGGAGTACTGAGTCACCTCGTTAACAGTGAGTTTCTATTAAAGAATGTGAAAGTGATTATGGC
GACTGAGTCGATTCCAGAAGCGAGTTCACTTGCCGGAGTGGAATTGGACTCTGTCGGTTCCTGGTCGGCGGCGGAGAAGAAGATGTTCCAGAGGGGCAATAATGGTAATT
CACGTGAAATTAATTCCATTATTATGAAGAAAATATGTTCTTCCGAAATTGATTCTTGTGTCTATAGTGATTGTTAGCAAATCCACAGATTGAAAGAATTCAGATGTATA
ATTTAAAATGAGTGAACTATAAACATTTTATTTTATTTTATTTTATTTTTTGTTCTCCTTTAATTTGTACATATACTTTGACACCTTGGAATAAAGATTGTAATGTTTCT
TTTAGGATTATTTAATACTTTTCATTACCTTAATTTTATTACTTGTTTGTTTCTT
Protein sequenceShow/hide protein sequence
MGLTLTGKSKSTAGENWGMGLLLVFFSEDSPPSSIADHNNLFPSSSLPSSGRRSNYNLLTKAQSTISVCALLVFVSLLLFTLSTFEPTIKMNLAPPRRLLSQKSMPIEVR
TPLENRWNWFGKMWKQKSPMGKTTTDAVPTAALQRMGTLYMRGTRAMADLTVVHVSEDVGEEDLRLFLRLFHRSGVTAKSDSVFVFPSPVFSSRFGSIIREENELFLKLL
GRYRNLNGTANRSAAAGFDVTQFLKKREKKEPEEPIWGKKVKRLVNDSNGGEDELTRLSYGSVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHVML
VDAKNSLLLGDPLGRVRNKGAESVILFPNKHSKRNSEKSNSHQIVNPAIVIGGARGIRRLSNAAVVEIARILMQHKKKNWVSDSGVLSHLVNSEFLLKNVKVIMATESIP
EASSLAGVELDSVGSWSAAEKKMFQRGNNGNSREINSIIMKKICSSEIDSCVYSDC