; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g0186 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g0186
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionTransmembrane protein
Genome locationMC01:8054438..8055955
RNA-Seq ExpressionMC01g0186
SyntenyMC01g0186
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045443.1 uncharacterized protein E6C27_scaffold294G00460 [Cucumis melo var. makuwa]1.06e-26475.38Show/hide
Query:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF---------------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLST
        MGL LTGKSKSTAGENWGMG LLVFFSEDSPS IAD   LF                     SS+S+RRSNYNLLT+AQS ISVCALLVF+SLLLFTLST
Subjt:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF---------------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLST

Query:  FEPAIKMNLTPPRRFLSQKSAPIEVRTPSGKYVYRWSWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRL
        FEP IKMNLTPPRR L+QKS PI+VR P G    RW+W GKMWKQKPA  K T   A   VALQ MGTLYMRGTRAM DL VVHV+EDVGEED RLFLRL
Subjt:  FEPAIKMNLTPPRRFLSQKSAPIEVRTPSGKYVYRWSWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRL

Query:  FHRSGVTAKSDSVFIFPSATFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSY
        FHRSGVTAKSDSVF+FPS  FS RFG +IREEN+SFLKL+ RYRNLN TASR AA GFDVT+  KSKEKKETEEP+WGKRVKR +N S+GGEDELTRLSY
Subjt:  FHRSGVTAKSDSVFIFPSATFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSY

Query:  GSVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKS
        GSVVSFDA EIDPENSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKH+MLIDAKSSLLLGDPL RVR+K TESVI FT            KHGKKNS+KS
Subjt:  GSVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKS

Query:  NSRRHLVNPAVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGN
        NS  H+VNP++VIGGARG+RR+SNAA+VEI R LMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVI A ESIPEASS  G + +SVG LSAPE  +F +GN
Subjt:  NSRRHLVNPAVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGN

Query:  NGNFREINSIIMKRICSSEMDSSVYSGC
        NGN  EINS+IMK+ICSSE+DSSVY+ C
Subjt:  NGNFREINSIIMKRICSSEMDSSVYSGC

TYK19793.1 uncharacterized protein E5676_scaffold307G00200 [Cucumis melo var. makuwa]1.14e-26676.69Show/hide
Query:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNL
        MGL LTGKSKSTAGENWGMG LLVFFSEDSPS IAD   LF            SS+S+RRSNYNLLT+AQS ISVCALLVF+SLLLFTLSTFEP IKMNL
Subjt:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNL

Query:  TPPRRFLSQKSAPIEVRTPSGKYVYRWSWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAK
        TPPRR L+QKS PI+VR P G    RW+W GKMWKQKPA  K T   A   VALQ MGTLYMRGTRAM DL VVHV+EDVGEED RLFLRLFHRSGVTAK
Subjt:  TPPRRFLSQKSAPIEVRTPSGKYVYRWSWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAK

Query:  SDSVFIFPSATFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAA
        SDSVF+FPS  FS RFG +IREEN+SFLKL+ RYRNLN TASR AA GFDVT+  KSKEKKETEEP+WGKRVKR +N S+GGEDELTRLSYGSVVSFDA 
Subjt:  SDSVFIFPSATFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAA

Query:  EIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNP
        EIDPENSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKH+MLIDAKSSLLLGDPL RVR+K TESVI FT            KHGKKNS+KSNS  H+VNP
Subjt:  EIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNP

Query:  AVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNNGNFREINS
        ++VIGGARG+RR+SNAA+VEI R LMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVI A+ESIPEASS  G + +SVG LSAPE  +F +GNNGN  EINS
Subjt:  AVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNNGNFREINS

Query:  IIMKRICSSEMDSSVYSGC
        +IMK+ICSSE+DSSVY+ C
Subjt:  IIMKRICSSEMDSSVYSGC

XP_004151150.1 uncharacterized protein LOC101208268 [Cucumis sativus]5.01e-26477.24Show/hide
Query:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF-------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNLTPPRR
        MGL LTGKSKSTAG+NWGMG LLVFFSEDSPS IAD K LF       SS+S RRSNYNLLT+AQS ISVCALLVF+SLLLFTLSTFEP IKMNLTPPRR
Subjt:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF-------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNLTPPRR

Query:  FLSQKSAPIEVRTPSGKYVYRWSWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVF
         L+QKS PIE+R P G    RW+W  +MWKQKPA  K T   A   VALQ MGTLYMRGTRAM DL VVHV+ED+GEED RLFLRLFHRSGVTAKSDSVF
Subjt:  FLSQKSAPIEVRTPSGKYVYRWSWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVF

Query:  IFPSATFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAAEIDPE
        +FPS  FS RFG +IR+ENESFLKL+ RYRNLN T SR AA GFDVTQ  KSKEKKETEEP+WGKRVKR+ N S+GGEDELTRLSYGSVVSFDA EIDPE
Subjt:  IFPSATFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAAEIDPE

Query:  NSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAVVIG
        NSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKH+MLIDAKSSLLLGDPL RVR+K TESVI FT            KH KKNS+KSNS  HLVNP++VIG
Subjt:  NSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAVVIG

Query:  GARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNNGNFREINSIIMKR
        GARG+RRLSNAA VEI R LMQHKKKNSVSDSGVLS LVNSEFLLKNVKVI A+ESIPEASSL G + +SVGSLSAPE  +F +GNNGN  EINS+IMK+
Subjt:  GARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNNGNFREINSIIMKR

Query:  ICSSEMDSSVYSGC
        ICSSE+DSSVY+ C
Subjt:  ICSSEMDSSVYSGC

XP_008460778.1 PREDICTED: uncharacterized protein LOC103499540 [Cucumis melo]2.53e-26575.52Show/hide
Query:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF--------------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTF
        MGL LTGKSKSTAGENWGMG LLVFFSEDSPS IAD   LF                    SS+S+RRSNYNLLT+AQS ISVCALLVF+SLLLFTLSTF
Subjt:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF--------------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTF

Query:  EPAIKMNLTPPRRFLSQKSAPIEVRTPSGKYVYRWSWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLF
        EP IKMNLTPPRR L+QKS PI+VR P G    RW+W GKMWKQKPA  K T   A   VALQ MGTLYMRGTRAM DL VVHV+EDVGEED RLFLRLF
Subjt:  EPAIKMNLTPPRRFLSQKSAPIEVRTPSGKYVYRWSWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLF

Query:  HRSGVTAKSDSVFIFPSATFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYG
        HRSGVTAKSDSVF+FPS  FS RFG +IREEN+SFLKL+ RYRNLN TASR AA GFDVT+  KSKEKKETEEP+WGKRVKR +N S+GGEDELTRLSYG
Subjt:  HRSGVTAKSDSVFIFPSATFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYG

Query:  SVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSN
        SVVSFDA EIDPENSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKH+MLIDAKSSLLLGDPL RVR+K TESVI FT            KHGKKNS+KSN
Subjt:  SVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSN

Query:  SRRHLVNPAVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNN
        S  H+VNP++VIGGARG+RR+SNAA+VEI R LMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVI A+ESIPEASS  G + +SVG LSAPE  +F +GNN
Subjt:  SRRHLVNPAVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNN

Query:  GNFREINSIIMKRICSSEMDSSVYSGC
        GN  EINS+IMK+ICSSE+DSSVY+ C
Subjt:  GNFREINSIIMKRICSSEMDSSVYSGC

XP_038883664.1 uncharacterized protein LOC120074578 [Benincasa hispida]1.49e-27779.57Show/hide
Query:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLFSSSSV--------RRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNLTPPR
        MGL LTGKSKS+AGENWGMG LLVFFSEDS SAIADQKKLFSSSS         RRSNYNLL +AQS ISVCALLVFVSLLLFTLSTFEPAIKMNLTPPR
Subjt:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLFSSSSV--------RRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNLTPPR

Query:  RFLSQKSAPIEVRTPSGKYVYRWSWLGKMWKQKPATAKRTDAAAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVF
        R LSQKS PIEVRTPS     +W+W GKMWKQKPA  K T+ A    ALQ MGTLYMRGTRAM DL VVHV+EDVGEEDLRLFLRLFHRSGVTAKSDSVF
Subjt:  RFLSQKSAPIEVRTPSGKYVYRWSWLGKMWKQKPATAKRTDAAAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVF

Query:  IFPSATFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAAEIDPE
        +FPS T S RFG +IREENESFLKL+ +YRNLN TASR AA GFDVTQFVK+KEKKETEEP+WGKRVKR++N+S+G  DELTRLSYGSVV FDAAEIDPE
Subjt:  IFPSATFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAAEIDPE

Query:  NSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAVVIG
        NSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKH+ML+DAK+SL+LGDPL RVR+K TESVILFT            KH KKNS++SN+  HLVNPA+V+G
Subjt:  NSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAVVIG

Query:  GARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNNGNFREINSIIMKR
        GARG+RRLSNAA+VEIAR LMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVIT+ ESIPE SSLAG + DSVGS SAPE  +FQRGNNGN REINS+IMK+
Subjt:  GARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNNGNFREINSIIMKR

Query:  ICSSEMDSSVYSGC
        ICSSE+DSSVYS C
Subjt:  ICSSEMDSSVYSGC

TrEMBL top hitse value%identityAlignment
A0A0A0KWA6 Uncharacterized protein2.43e-26477.24Show/hide
Query:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF-------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNLTPPRR
        MGL LTGKSKSTAG+NWGMG LLVFFSEDSPS IAD K LF       SS+S RRSNYNLLT+AQS ISVCALLVF+SLLLFTLSTFEP IKMNLTPPRR
Subjt:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF-------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNLTPPRR

Query:  FLSQKSAPIEVRTPSGKYVYRWSWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVF
         L+QKS PIE+R P G    RW+W  +MWKQKPA  K T   A   VALQ MGTLYMRGTRAM DL VVHV+ED+GEED RLFLRLFHRSGVTAKSDSVF
Subjt:  FLSQKSAPIEVRTPSGKYVYRWSWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVF

Query:  IFPSATFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAAEIDPE
        +FPS  FS RFG +IR+ENESFLKL+ RYRNLN T SR AA GFDVTQ  KSKEKKETEEP+WGKRVKR+ N S+GGEDELTRLSYGSVVSFDA EIDPE
Subjt:  IFPSATFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAAEIDPE

Query:  NSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAVVIG
        NSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKH+MLIDAKSSLLLGDPL RVR+K TESVI FT            KH KKNS+KSNS  HLVNP++VIG
Subjt:  NSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAVVIG

Query:  GARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNNGNFREINSIIMKR
        GARG+RRLSNAA VEI R LMQHKKKNSVSDSGVLS LVNSEFLLKNVKVI A+ESIPEASSL G + +SVGSLSAPE  +F +GNNGN  EINS+IMK+
Subjt:  GARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNNGNFREINSIIMKR

Query:  ICSSEMDSSVYSGC
        ICSSE+DSSVY+ C
Subjt:  ICSSEMDSSVYSGC

A0A1S3CD81 uncharacterized protein LOC1034995401.22e-26575.52Show/hide
Query:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF--------------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTF
        MGL LTGKSKSTAGENWGMG LLVFFSEDSPS IAD   LF                    SS+S+RRSNYNLLT+AQS ISVCALLVF+SLLLFTLSTF
Subjt:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF--------------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTF

Query:  EPAIKMNLTPPRRFLSQKSAPIEVRTPSGKYVYRWSWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLF
        EP IKMNLTPPRR L+QKS PI+VR P G    RW+W GKMWKQKPA  K T   A   VALQ MGTLYMRGTRAM DL VVHV+EDVGEED RLFLRLF
Subjt:  EPAIKMNLTPPRRFLSQKSAPIEVRTPSGKYVYRWSWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLF

Query:  HRSGVTAKSDSVFIFPSATFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYG
        HRSGVTAKSDSVF+FPS  FS RFG +IREEN+SFLKL+ RYRNLN TASR AA GFDVT+  KSKEKKETEEP+WGKRVKR +N S+GGEDELTRLSYG
Subjt:  HRSGVTAKSDSVFIFPSATFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYG

Query:  SVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSN
        SVVSFDA EIDPENSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKH+MLIDAKSSLLLGDPL RVR+K TESVI FT            KHGKKNS+KSN
Subjt:  SVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSN

Query:  SRRHLVNPAVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNN
        S  H+VNP++VIGGARG+RR+SNAA+VEI R LMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVI A+ESIPEASS  G + +SVG LSAPE  +F +GNN
Subjt:  SRRHLVNPAVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNN

Query:  GNFREINSIIMKRICSSEMDSSVYSGC
        GN  EINS+IMK+ICSSE+DSSVY+ C
Subjt:  GNFREINSIIMKRICSSEMDSSVYSGC

A0A5A7TPI4 Uncharacterized protein5.16e-26575.38Show/hide
Query:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF---------------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLST
        MGL LTGKSKSTAGENWGMG LLVFFSEDSPS IAD   LF                     SS+S+RRSNYNLLT+AQS ISVCALLVF+SLLLFTLST
Subjt:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF---------------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLST

Query:  FEPAIKMNLTPPRRFLSQKSAPIEVRTPSGKYVYRWSWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRL
        FEP IKMNLTPPRR L+QKS PI+VR P G    RW+W GKMWKQKPA  K T   A   VALQ MGTLYMRGTRAM DL VVHV+EDVGEED RLFLRL
Subjt:  FEPAIKMNLTPPRRFLSQKSAPIEVRTPSGKYVYRWSWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRL

Query:  FHRSGVTAKSDSVFIFPSATFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSY
        FHRSGVTAKSDSVF+FPS  FS RFG +IREEN+SFLKL+ RYRNLN TASR AA GFDVT+  KSKEKKETEEP+WGKRVKR +N S+GGEDELTRLSY
Subjt:  FHRSGVTAKSDSVFIFPSATFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSY

Query:  GSVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKS
        GSVVSFDA EIDPENSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKH+MLIDAKSSLLLGDPL RVR+K TESVI FT            KHGKKNS+KS
Subjt:  GSVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKS

Query:  NSRRHLVNPAVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGN
        NS  H+VNP++VIGGARG+RR+SNAA+VEI R LMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVI A ESIPEASS  G + +SVG LSAPE  +F +GN
Subjt:  NSRRHLVNPAVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGN

Query:  NGNFREINSIIMKRICSSEMDSSVYSGC
        NGN  EINS+IMK+ICSSE+DSSVY+ C
Subjt:  NGNFREINSIIMKRICSSEMDSSVYSGC

A0A5D3D8F3 Uncharacterized protein5.53e-26776.69Show/hide
Query:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNL
        MGL LTGKSKSTAGENWGMG LLVFFSEDSPS IAD   LF            SS+S+RRSNYNLLT+AQS ISVCALLVF+SLLLFTLSTFEP IKMNL
Subjt:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF------------SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNL

Query:  TPPRRFLSQKSAPIEVRTPSGKYVYRWSWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAK
        TPPRR L+QKS PI+VR P G    RW+W GKMWKQKPA  K T   A   VALQ MGTLYMRGTRAM DL VVHV+EDVGEED RLFLRLFHRSGVTAK
Subjt:  TPPRRFLSQKSAPIEVRTPSGKYVYRWSWLGKMWKQKPATAKRTDA-AAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAK

Query:  SDSVFIFPSATFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAA
        SDSVF+FPS  FS RFG +IREEN+SFLKL+ RYRNLN TASR AA GFDVT+  KSKEKKETEEP+WGKRVKR +N S+GGEDELTRLSYGSVVSFDA 
Subjt:  SDSVFIFPSATFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAA

Query:  EIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNP
        EIDPENSLSGFSDHIPMSLRRW+CYPMLLGRVRRNFKH+MLIDAKSSLLLGDPL RVR+K TESVI FT            KHGKKNS+KSNS  H+VNP
Subjt:  EIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNP

Query:  AVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNNGNFREINS
        ++VIGGARG+RR+SNAA+VEI R LMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVI A+ESIPEASS  G + +SVG LSAPE  +F +GNNGN  EINS
Subjt:  AVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNNGNFREINS

Query:  IIMKRICSSEMDSSVYSGC
        +IMK+ICSSE+DSSVY+ C
Subjt:  IIMKRICSSEMDSSVYSGC

A0A6J1EAJ1 uncharacterized protein LOC1114323123.39e-25375.93Show/hide
Query:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF----SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNLTPPRRFLS
        MGL +TGKSKSTA ENWGMG  LVFFSEDSPSAIAD  KLF    SSSS RRSNYNLL++AQS ISVCALLVFVSLLLFTLSTFEPAIKMNLTPPRR LS
Subjt:  MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLF----SSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNLTPPRRFLS

Query:  QKSAPIEVRTPSGKYVYRWSWLGKMWKQKPATAKRTDAAAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVFIFPS
        +KS PIE+RTPS   V RW+W  KMWKQKPA       +    ALQ MGTLY+RGTRAMAD+ VVHV EDV E+D RLFLRLFHRSGVTAKSDSVFIF S
Subjt:  QKSAPIEVRTPSGKYVYRWSWLGKMWKQKPATAKRTDAAAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVFIFPS

Query:  ATFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAAEIDPENSLS
          FS +FG +IREENESFLKL+ R RN N TA+RRA  GFDV QFVK KEKKE EEP+WGK+ KR +N+S GGEDELTRLSYGSVVSFDAAEIDPENSLS
Subjt:  ATFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAAEIDPENSLS

Query:  GFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAVVIGGARG
        GFSDHIPMSLRRWACYPMLLGRVRRNFKH+ML+DAK+S+L+GDPLGR+R+K TESVILFT            KH KKNS+KS++   LVNPAVVIGGARG
Subjt:  GFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAVVIGGARG

Query:  VRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENA-IFQRGNNGNFREINSIIMKRICS
        VRRLSNA +VEIAR LMQHKK NSVSDS VLSHLVNSEFLLKNVKVI A ESIP+AS LAG +S SVGSLSAPE   I +R N GN REINS+I+K+ICS
Subjt:  VRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENA-IFQRGNNGNFREINSIIMKRICS

Query:  SEMDSSVYSGC
        SE+DSSVYS C
Subjt:  SEMDSSVYSGC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G57400.1 unknown protein5.7e-10847.04Show/hide
Query:  SKSTAGENWGMGFLLVFF------SEDSPSAIADQ---KKLFSSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPA----IKMNLTPPRRFL
        +K T     GMG LLVFF      ++DSPS+ +       LF S    RS+  LL++AQS IS+C LL+F++L LFTLSTFEP+       +  P RRFL
Subjt:  SKSTAGENWGMGFLLVFF------SEDSPSAIADQ---KKLFSSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPA----IKMNLTPPRRFL

Query:  SQK--SAPIEVRTPSGKYVYRWSWLGKMWKQKPATAKRTDAAAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVFI
          +  SA  E R    ++                            ALQGMGTL++RGT++M DL VVH++ D  E+DLRLF+RL HRSGVT+KSD V +
Subjt:  SQK--SAPIEVRTPSGKYVYRWSWLGKMWKQKPATAKRTDAAAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVFI

Query:  FPSATFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTR----LSYGSVVSFDAAEI
        F S T   RF  +I EEN+SFLKLV  +R  NS+    +  GF++T+F+K + K  + EP+WGK+  R +       +  T     L++GSVV FD  E+
Subjt:  FPSATFSSRFGSVIREENESFLKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTR----LSYGSVVSFDAAEI

Query:  DPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAV
        DPENSLSGF DH+P+SLRRWACYPMLLGRVRRNFKH+ML+DAK+SL LGDPL R+R++S ESV+ F+K       SSS+   KK+S+        VNPA+
Subjt:  DPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAV

Query:  VIGGARGVRRLSNAALVEIARTLM--QHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRG----NNGNFR
        +IGGA+G+RRLS++   EI R  +  QHKKKNSV++S VLS LV +  + KN +V+T+   +PEASSLA   + +  + S   + I QRG    N+ +  
Subjt:  VIGGARGVRRLSNAALVEIARTLM--QHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRG----NNGNFR

Query:  EINSIIMKRICSSEMDSSVYSGC
        +I +IIMKRICS E+DSSVY+ C
Subjt:  EINSIIMKRICSSEMDSSVYSGC

AT5G52500.1 unknown protein1.7e-7540.23Show/hide
Query:  MGFLLVFF----SEDSPSAIADQKKLFSSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNLTPPRRFLSQKSAPIEVRTPSGKYVYR
        MG  LV F    + DS S  +  K  F S   + S+  LL++A+S IS C +L+F++L LFTLSTFE        P  RF +       + +P  +++  
Subjt:  MGFLLVFF----SEDSPSAIADQKKLFSSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNLTPPRRFLSQKSAPIEVRTPSGKYVYR

Query:  WSWLGKMWKQKPATAKRTDAAAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVFIFPSATFSSRFGSVIREENESF
         + + + +                 ALQGMGTL++RGT++M DL + H+A    E DLRLF+RL HRSGVT+KSD V +F S + ++RF  +I +EN SF
Subjt:  WSWLGKMWKQKPATAKRTDAAAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVFIFPSATFSSRFGSVIREENESF

Query:  LKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPM
        LKLV  +RN + T+S                    +E  +WGK+ +  +N +         L++GS+V FD  E+DPENSLSGF D +P+SLRRWACYPM
Subjt:  LKLVRRYRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPM

Query:  LLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAVVIGGARGVRRLSNAALVEIAR-TLM
        LLGRVRR+FKH+ML+DAK+S  +GDP  R+R++S +SV+ F            +KH  KN+ +       VNP ++IGGA+G+RRLS++   EI R T+M
Subjt:  LLGRVRRNFKHIMLIDAKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAVVIGGARGVRRLSNAALVEIAR-TLM

Query:  QHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEAS
        +      V++S VLS LV +  + KN +V+ +   +PEA+
Subjt:  QHKKKNSVSDSGVLSHLVNSEFLLKNVKVITAAESIPEAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCTCCCTCTCACCGGAAAATCCAAATCCACCGCCGGCGAGAACTGGGGCATGGGCTTTCTTCTCGTCTTCTTCTCGGAGGACTCCCCCTCCGCCATTGCCGACCA
GAAGAAGCTCTTCTCTTCTTCCTCTGTTCGTCGGAGTAATTACAATCTTCTCACCAGAGCTCAGTCCGTTATCTCCGTCTGCGCTCTCCTCGTCTTCGTCTCTCTCCTTC
TCTTCACTCTCTCGACCTTCGAGCCCGCCATTAAAATGAACCTCACTCCGCCTCGGAGGTTTTTATCTCAGAAATCGGCGCCGATTGAAGTCCGTACGCCGTCGGGGAAG
TATGTGTATCGGTGGAGTTGGTTGGGGAAAATGTGGAAGCAGAAACCGGCAACGGCGAAGAGGACCGATGCGGCTGCTCCGGCGGTGGCGCTGCAAGGAATGGGGACTCT
GTACATGCGAGGTACTCGAGCCATGGCGGACCTCGCGGTGGTTCACGTGGCGGAGGATGTCGGAGAAGAAGACCTCCGACTCTTTCTCCGACTGTTTCATCGGTCCGGTG
TCACCGCGAAATCCGATTCCGTGTTCATCTTCCCTTCGGCGACGTTCTCCTCGAGATTCGGTTCAGTGATTCGGGAGGAAAACGAGTCGTTCTTGAAACTCGTTCGTCGG
TACCGGAATTTGAACAGCACAGCTAGCCGGAGAGCGGCGGTGGGTTTCGACGTGACTCAGTTTGTAAAGAGTAAAGAGAAGAAGGAAACGGAGGAGCCGCTGTGGGGGAA
GAGAGTGAAACGAGTCTCCAACGAGTCGGACGGCGGCGAGGACGAGTTGACTCGGCTGAGTTACGGCTCGGTAGTGAGTTTCGACGCGGCGGAGATAGATCCGGAGAATT
CGCTCTCCGGATTCTCAGATCACATTCCGATGAGTCTACGGCGGTGGGCGTGCTATCCGATGCTCCTCGGCCGAGTCCGCCGGAATTTCAAGCACATAATGCTCATCGAC
GCCAAAAGCTCGCTCCTACTCGGCGATCCACTCGGCCGAGTCAGAAGCAAAAGCACCGAGTCAGTAATTCTCTTCACAAAACCAGAACAAATCCAATTCCCATCCTCATC
AACCAAGCACGGCAAAAAGAACTCGGACAAATCCAATTCCCGCCGCCACCTGGTAAATCCGGCCGTCGTGATCGGCGGGGCTCGCGGCGTCCGGCGGCTATCGAACGCCG
CACTGGTGGAGATAGCCCGAACCCTAATGCAGCACAAGAAGAAGAACTCGGTCTCCGACTCGGGAGTACTGAGTCACCTCGTTAACAGCGAGTTCTTGTTGAAGAATGTG
AAGGTGATCACGGCGGCTGAGTCGATTCCCGAAGCGAGTTCACTCGCCGGGTTTGACTCGGATTCAGTCGGTTCCTTGTCGGCGCCGGAGAATGCGATATTCCAGCGGGG
TAATAATGGTAATTTTCGGGAAATTAATTCTATAATTATGAAGAGAATATGTTCGTCGGAAATGGATTCTTCTGTCTATAGTGGCTGC
mRNA sequenceShow/hide mRNA sequence
ATGGGTCTCCCTCTCACCGGAAAATCCAAATCCACCGCCGGCGAGAACTGGGGCATGGGCTTTCTTCTCGTCTTCTTCTCGGAGGACTCCCCCTCCGCCATTGCCGACCA
GAAGAAGCTCTTCTCTTCTTCCTCTGTTCGTCGGAGTAATTACAATCTTCTCACCAGAGCTCAGTCCGTTATCTCCGTCTGCGCTCTCCTCGTCTTCGTCTCTCTCCTTC
TCTTCACTCTCTCGACCTTCGAGCCCGCCATTAAAATGAACCTCACTCCGCCTCGGAGGTTTTTATCTCAGAAATCGGCGCCGATTGAAGTCCGTACGCCGTCGGGGAAG
TATGTGTATCGGTGGAGTTGGTTGGGGAAAATGTGGAAGCAGAAACCGGCAACGGCGAAGAGGACCGATGCGGCTGCTCCGGCGGTGGCGCTGCAAGGAATGGGGACTCT
GTACATGCGAGGTACTCGAGCCATGGCGGACCTCGCGGTGGTTCACGTGGCGGAGGATGTCGGAGAAGAAGACCTCCGACTCTTTCTCCGACTGTTTCATCGGTCCGGTG
TCACCGCGAAATCCGATTCCGTGTTCATCTTCCCTTCGGCGACGTTCTCCTCGAGATTCGGTTCAGTGATTCGGGAGGAAAACGAGTCGTTCTTGAAACTCGTTCGTCGG
TACCGGAATTTGAACAGCACAGCTAGCCGGAGAGCGGCGGTGGGTTTCGACGTGACTCAGTTTGTAAAGAGTAAAGAGAAGAAGGAAACGGAGGAGCCGCTGTGGGGGAA
GAGAGTGAAACGAGTCTCCAACGAGTCGGACGGCGGCGAGGACGAGTTGACTCGGCTGAGTTACGGCTCGGTAGTGAGTTTCGACGCGGCGGAGATAGATCCGGAGAATT
CGCTCTCCGGATTCTCAGATCACATTCCGATGAGTCTACGGCGGTGGGCGTGCTATCCGATGCTCCTCGGCCGAGTCCGCCGGAATTTCAAGCACATAATGCTCATCGAC
GCCAAAAGCTCGCTCCTACTCGGCGATCCACTCGGCCGAGTCAGAAGCAAAAGCACCGAGTCAGTAATTCTCTTCACAAAACCAGAACAAATCCAATTCCCATCCTCATC
AACCAAGCACGGCAAAAAGAACTCGGACAAATCCAATTCCCGCCGCCACCTGGTAAATCCGGCCGTCGTGATCGGCGGGGCTCGCGGCGTCCGGCGGCTATCGAACGCCG
CACTGGTGGAGATAGCCCGAACCCTAATGCAGCACAAGAAGAAGAACTCGGTCTCCGACTCGGGAGTACTGAGTCACCTCGTTAACAGCGAGTTCTTGTTGAAGAATGTG
AAGGTGATCACGGCGGCTGAGTCGATTCCCGAAGCGAGTTCACTCGCCGGGTTTGACTCGGATTCAGTCGGTTCCTTGTCGGCGCCGGAGAATGCGATATTCCAGCGGGG
TAATAATGGTAATTTTCGGGAAATTAATTCTATAATTATGAAGAGAATATGTTCGTCGGAAATGGATTCTTCTGTCTATAGTGGCTGC
Protein sequenceShow/hide protein sequence
MGLPLTGKSKSTAGENWGMGFLLVFFSEDSPSAIADQKKLFSSSSVRRSNYNLLTRAQSVISVCALLVFVSLLLFTLSTFEPAIKMNLTPPRRFLSQKSAPIEVRTPSGK
YVYRWSWLGKMWKQKPATAKRTDAAAPAVALQGMGTLYMRGTRAMADLAVVHVAEDVGEEDLRLFLRLFHRSGVTAKSDSVFIFPSATFSSRFGSVIREENESFLKLVRR
YRNLNSTASRRAAVGFDVTQFVKSKEKKETEEPLWGKRVKRVSNESDGGEDELTRLSYGSVVSFDAAEIDPENSLSGFSDHIPMSLRRWACYPMLLGRVRRNFKHIMLID
AKSSLLLGDPLGRVRSKSTESVILFTKPEQIQFPSSSTKHGKKNSDKSNSRRHLVNPAVVIGGARGVRRLSNAALVEIARTLMQHKKKNSVSDSGVLSHLVNSEFLLKNV
KVITAAESIPEASSLAGFDSDSVGSLSAPENAIFQRGNNGNFREINSIIMKRICSSEMDSSVYSGC