; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022318 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022318
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF1005)
Genome locationtig00154107:543521..546346
RNA-Seq ExpressionSgr022318
SyntenySgr022318
Gene Ontology termsNA
InterPro domainsIPR010410 - Protein of unknown function DUF1005


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061519.1 DUF1005 domain-containing protein [Cucumis melo var. makuwa]2.6e-21788.42Show/hide
Query:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGRMG
        MDPCPFVRLMV+SLALNLPQ TRPAG  VHPS TPCFCKI+IKNFPSQTALLPLSS+S GDS PDS ASS+GFHLDPSSLRRLS KP+++CLSVF+GRMG
Subjt:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGRMG

Query:  HTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS
        HTCGVNSGK LGRVRI+V++DGA++KP+VFQNGWVKLGK++DKISARLHLVVRS+PDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS
Subjt:  HTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS

Query:  VTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDG
        + SDFSFNSTKGKWMRTFSGEREK GRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWR+RGP+DG
Subjt:  VTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDG

Query:  LGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERKTVRDLSPNSRS---ASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDA
        LGYKF+LVADTGL TGIPIAEATMSVKKGGQFCI+RKTVRD + NS+S    SFVMASSVEGEGKVSKP+V+VGVQHVTCMADAALFVAL+AAIDLSMDA
Subjt:  LGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERKTVRDLSPNSRS---ASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDA

Query:  CRHFTQKLRRELCHDYQHHSNFL
        CRHFTQKLRRELCHD +H S+FL
Subjt:  CRHFTQKLRRELCHDYQHHSNFL

XP_008458969.1 PREDICTED: uncharacterized protein LOC103498220 [Cucumis melo]7.5e-21788.18Show/hide
Query:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGRMG
        MDPCPFVRLMV+SLALNLPQ TRPAG  VHPS TPCFCKI+IKNFPSQTALLPLSS+S GDS PDS ASS+GFHLDPSSLRRLS KP+++CLSVF+GRMG
Subjt:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGRMG

Query:  HTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS
        HTCGVNSGK LGRVRI+V++DGA++KP+VFQNGWVKLGK++DKISARLHLVVRS+PDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS
Subjt:  HTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS

Query:  VTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDG
        + SDFSFNSTKGKWMRTFSGEREK GRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWR+RGP+DG
Subjt:  VTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDG

Query:  LGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERKTVRDLSPNSRS---ASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDA
        LGYKF+LVAD+GL TGIPIAEATMSVKKGGQFCI+RKTVRD + NS+S    SFVMASSVEGEGKVSKP+V+VGVQHVTCMADAALFVAL+AAIDLSMDA
Subjt:  LGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERKTVRDLSPNSRS---ASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDA

Query:  CRHFTQKLRRELCHDYQHHSNFL
        CRHFTQKLRRELCHD +H S+FL
Subjt:  CRHFTQKLRRELCHDYQHHSNFL

XP_022154692.1 uncharacterized protein LOC111021889 isoform X1 [Momordica charantia]1.1e-21889.86Show/hide
Query:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGRMG
        MDPCPFVRLMVESLALNLPQ TRPAG  VHPS TPCFCKIAIKNFPSQTALLPLSSIS GDS PDS ASSSGFHLDPSSLRRLS KPL++CLSVF+GRMG
Subjt:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGRMG

Query:  HTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS
        HTCGVNSGKFLGRVRI+V LDGADS+PRVF NGWVKLGKE+DKISARLHLVVRS+PDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS
Subjt:  HTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS

Query:  VTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDG
        + SDFSFNSTKGKWMRTFSGEREK GRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWR+RGP+DG
Subjt:  VTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDG

Query:  LGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERKTVRDLSPNSRS----ASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMD
        LGYKF+LVA+TGL TGI IAEATMSVKKGGQFCI+ +T+RD SPNSRS     +FVMASSVEGEGKVSKP+VEVGV+HVTCMADAALFVALAAAIDLSMD
Subjt:  LGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERKTVRDLSPNSRS----ASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMD

Query:  ACRHFTQKLRRELCHDYQHHSNFL
        ACRHFTQKLRRELCHD +H S+FL
Subjt:  ACRHFTQKLRRELCHDYQHHSNFL

XP_022154693.1 uncharacterized protein LOC111021889 isoform X2 [Momordica charantia]8.1e-21990.07Show/hide
Query:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGRMG
        MDPCPFVRLMVESLALNLPQ TRPAG  VHPS TPCFCKIAIKNFPSQTALLPLSSIS GDS PDS ASSSGFHLDPSSLRRLS KPL++CLSVF+GRMG
Subjt:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGRMG

Query:  HTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS
        HTCGVNSGKFLGRVRI+V LDGADS+PRVF NGWVKLGKE+DKISARLHLVVRS+PDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS
Subjt:  HTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS

Query:  VTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDG
        + SDFSFNSTKGKWMRTFSGEREK GRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWR+RGP+DG
Subjt:  VTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDG

Query:  LGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERKTVRDLSPNSRS---ASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDA
        LGYKF+LVA+TGL TGI IAEATMSVKKGGQFCI+ +T+RD SPNSRS    +FVMASSVEGEGKVSKP+VEVGV+HVTCMADAALFVALAAAIDLSMDA
Subjt:  LGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERKTVRDLSPNSRS---ASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDA

Query:  CRHFTQKLRRELCHDYQHHSNFL
        CRHFTQKLRRELCHD +H S+FL
Subjt:  CRHFTQKLRRELCHDYQHHSNFL

XP_038889424.1 uncharacterized protein LOC120079336 [Benincasa hispida]6.8e-21889.13Show/hide
Query:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGRMG
        MDPCPFVRLMVESLALNLPQ TRPAG  VHPS TPCFCKIAIKNFPSQTALLPLSS+S GDS PDS ASS+GFHLDPSSLRR+S  P+++CLSVF+GRMG
Subjt:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGRMG

Query:  HTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS
        HTCGVNSGK LGRVRI+V++DGA++KPRVFQNGWVKLGK+DDKISARLHLVVRS+PDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS
Subjt:  HTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS

Query:  VTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDG
        + SDFSFNSTKGKWMRTFSGEREK GRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWR+RGP+DG
Subjt:  VTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDG

Query:  LGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERKTVRDLSPNSRS---ASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDA
        LGYKF+LVADTGL TGIPIAEATMSVKKGGQFCI+RKTVRD S NS+S    +FVMASSVEGEGKVSKPVV+VGVQHVTCMADAALFVAL+AAIDLSMDA
Subjt:  LGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERKTVRDLSPNSRS---ASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDA

Query:  CRHFTQKLRRELCHDYQHHSNFL
        CRHFTQKLRRELCHD +H S+FL
Subjt:  CRHFTQKLRRELCHDYQHHSNFL

TrEMBL top hitse value%identityAlignment
A0A0A0LIQ0 Uncharacterized protein4.8e-21788.65Show/hide
Query:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGRMG
        MDPCPFVRLMV+SLALNLPQ TRPAG  VHPS TPCFCKI+IKNFPSQTALLPLSS+S GDS PDS ASS+GFHLDPSSLRRLS KP+++CLSVF+GRMG
Subjt:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGRMG

Query:  HTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS
        HTCGVNSGK LGRVRI+V++DGA+SKP+VFQNGWVKLGK +DKISARLHLVVRS+PDPRFVFQFG EPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS
Subjt:  HTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS

Query:  VTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDG
        + SDFSFNSTKGKWMRTFSGEREK GRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWR+RGP+DG
Subjt:  VTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDG

Query:  LGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERKTVRDLSPNSRS---ASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDA
        LGYKF+LVADTGL TGIPIAEATMSVKKGGQFCI+RKTVRDL+ NS+S    SFVMASSVEGEGKVSKP+V+VGVQHVTCMADAALFVAL+AAIDLSMDA
Subjt:  LGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERKTVRDLSPNSRS---ASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDA

Query:  CRHFTQKLRRELCHDYQHHSNFL
        CRHFTQKLRRELCHD +H S+FL
Subjt:  CRHFTQKLRRELCHDYQHHSNFL

A0A1S3C9N5 uncharacterized protein LOC1034982203.6e-21788.18Show/hide
Query:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGRMG
        MDPCPFVRLMV+SLALNLPQ TRPAG  VHPS TPCFCKI+IKNFPSQTALLPLSS+S GDS PDS ASS+GFHLDPSSLRRLS KP+++CLSVF+GRMG
Subjt:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGRMG

Query:  HTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS
        HTCGVNSGK LGRVRI+V++DGA++KP+VFQNGWVKLGK++DKISARLHLVVRS+PDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS
Subjt:  HTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS

Query:  VTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDG
        + SDFSFNSTKGKWMRTFSGEREK GRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWR+RGP+DG
Subjt:  VTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDG

Query:  LGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERKTVRDLSPNSRS---ASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDA
        LGYKF+LVAD+GL TGIPIAEATMSVKKGGQFCI+RKTVRD + NS+S    SFVMASSVEGEGKVSKP+V+VGVQHVTCMADAALFVAL+AAIDLSMDA
Subjt:  LGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERKTVRDLSPNSRS---ASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDA

Query:  CRHFTQKLRRELCHDYQHHSNFL
        CRHFTQKLRRELCHD +H S+FL
Subjt:  CRHFTQKLRRELCHDYQHHSNFL

A0A5A7V777 DUF1005 domain-containing protein1.3e-21788.42Show/hide
Query:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGRMG
        MDPCPFVRLMV+SLALNLPQ TRPAG  VHPS TPCFCKI+IKNFPSQTALLPLSS+S GDS PDS ASS+GFHLDPSSLRRLS KP+++CLSVF+GRMG
Subjt:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGRMG

Query:  HTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS
        HTCGVNSGK LGRVRI+V++DGA++KP+VFQNGWVKLGK++DKISARLHLVVRS+PDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS
Subjt:  HTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS

Query:  VTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDG
        + SDFSFNSTKGKWMRTFSGEREK GRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWR+RGP+DG
Subjt:  VTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDG

Query:  LGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERKTVRDLSPNSRS---ASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDA
        LGYKF+LVADTGL TGIPIAEATMSVKKGGQFCI+RKTVRD + NS+S    SFVMASSVEGEGKVSKP+V+VGVQHVTCMADAALFVAL+AAIDLSMDA
Subjt:  LGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERKTVRDLSPNSRS---ASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDA

Query:  CRHFTQKLRRELCHDYQHHSNFL
        CRHFTQKLRRELCHD +H S+FL
Subjt:  CRHFTQKLRRELCHDYQHHSNFL

A0A6J1DL04 uncharacterized protein LOC111021889 isoform X23.9e-21990.07Show/hide
Query:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGRMG
        MDPCPFVRLMVESLALNLPQ TRPAG  VHPS TPCFCKIAIKNFPSQTALLPLSSIS GDS PDS ASSSGFHLDPSSLRRLS KPL++CLSVF+GRMG
Subjt:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGRMG

Query:  HTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS
        HTCGVNSGKFLGRVRI+V LDGADS+PRVF NGWVKLGKE+DKISARLHLVVRS+PDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS
Subjt:  HTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS

Query:  VTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDG
        + SDFSFNSTKGKWMRTFSGEREK GRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWR+RGP+DG
Subjt:  VTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDG

Query:  LGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERKTVRDLSPNSRS---ASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDA
        LGYKF+LVA+TGL TGI IAEATMSVKKGGQFCI+ +T+RD SPNSRS    +FVMASSVEGEGKVSKP+VEVGV+HVTCMADAALFVALAAAIDLSMDA
Subjt:  LGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERKTVRDLSPNSRS---ASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDA

Query:  CRHFTQKLRRELCHDYQHHSNFL
        CRHFTQKLRRELCHD +H S+FL
Subjt:  CRHFTQKLRRELCHDYQHHSNFL

A0A6J1DMB6 uncharacterized protein LOC111021889 isoform X15.1e-21989.86Show/hide
Query:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGRMG
        MDPCPFVRLMVESLALNLPQ TRPAG  VHPS TPCFCKIAIKNFPSQTALLPLSSIS GDS PDS ASSSGFHLDPSSLRRLS KPL++CLSVF+GRMG
Subjt:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGRMG

Query:  HTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS
        HTCGVNSGKFLGRVRI+V LDGADS+PRVF NGWVKLGKE+DKISARLHLVVRS+PDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS
Subjt:  HTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRS

Query:  VTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDG
        + SDFSFNSTKGKWMRTFSGEREK GRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWR+RGP+DG
Subjt:  VTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDG

Query:  LGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERKTVRDLSPNSRS----ASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMD
        LGYKF+LVA+TGL TGI IAEATMSVKKGGQFCI+ +T+RD SPNSRS     +FVMASSVEGEGKVSKP+VEVGV+HVTCMADAALFVALAAAIDLSMD
Subjt:  LGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERKTVRDLSPNSRS----ASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMD

Query:  ACRHFTQKLRRELCHDYQHHSNFL
        ACRHFTQKLRRELCHD +H S+FL
Subjt:  ACRHFTQKLRRELCHDYQHHSNFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10020.1 Protein of unknown function (DUF1005)1.7e-12150.11Show/hide
Query:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLL-----LCLSVF
        MDPCPF+RL + +LAL +P   +     VHPS++PCFCKI +KNFP QTA +P   +      P+    ++ FHL  S ++RL+ + +      L + ++
Subjt:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLL-----LCLSVF

Query:  SGRMGHTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDK--ISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS--
        +GR G  CGV+SG+ L +V + + L G  SKP VF NGW+ +GK   K   SA+ HL V+++PDPRFVFQF GEPECSP V QIQGNIRQPVF+CKFS  
Subjt:  SGRMGHTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDK--ISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFS--

Query:  --ADRNSRTRSVTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRL
           DR  R+RS+ ++    S    W+ +F  ERE+ G+ERKGW I V+DLSGSPVA AS++TPFV SPGTDRVSRSNPG+WLILRP      +W+PWGRL
Subjt:  --ADRNSRTRSVTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRL

Query:  EAWRDR-GPVDGLGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIE-RKTVRDLSPNS---RSAS----------------------------------
        EAWR+R G  DGLGY+F+L+ D     GI +AE+T+S  +GG+F IE   +    SP S   RS S                                  
Subjt:  EAWRDR-GPVDGLGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIE-RKTVRDLSPNS---RSAS----------------------------------

Query:  ---FVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCHD
           FVM++SVEGEGK SKP VEV VQHV+CM DAA +VAL+AAIDLSMDACR F Q++R+ELCH+
Subjt:  ---FVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCHD

AT1G50040.1 Protein of unknown function (DUF1005)1.5e-8543.16Show/hide
Query:  MDPCPFVRLMVESLALNLPQ-------PTRPAGPGVHP-STTPCFCKIAIKNFPSQTALLPLSSISPGDS-----LPDSTASSSGFHLDPSSLRR--LSC
        MDPC FVR++V +LA+  P+        +  +GP V   S+  C+CKI  K+FP Q   +P+   +  +S       + +  ++ F L  S +       
Subjt:  MDPCPFVRLMVESLALNLPQ-------PTRPAGPGVHP-STTPCFCKIAIKNFPSQTALLPLSSISPGDS-----LPDSTASSSGFHLDPSSLRR--LSC

Query:  KPLLLCLSVFSGRMGHTCG---VNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLG---KEDDKISA--RLHLVVRSQPDPRFVFQFGGEPECSPVVFQI
        K  +L + V+S R   +CG    +  K +GR ++++ L  A+SK  +  NGWV LG   K + K  +   LH+ VR +PD RFVFQF GEPECSP VFQ+
Subjt:  KPLLLCLSVFSGRMGHTCG---VNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLG---KEDDKISA--RLHLVVRSQPDPRFVFQFGGEPECSPVVFQI

Query:  QGNIRQPVFSCKFSADRNSRTRSVTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHG
        QGN +Q VF+CKF   RNS  R+++   S + T GK         E+  +ERKGW I ++DLSGSPVA ASM+TPFVPSPG++RVSRS+PGAWLILRP G
Subjt:  QGNIRQPVFSCKFSADRNSRTRSVTSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHG

Query:  FSVSSWKPWGRLEAWRDRGPVDGLGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERKT----------------------VRDLSPNSRSAS-----
        +   +WKPW RL+AWR+ G  D LGY+F+L  D G+   +  A +++S K GG F I+  T                      +R    +S S S     
Subjt:  FSVSSWKPWGRLEAWRDRGPVDGLGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERKT----------------------VRDLSPNSRSAS-----

Query:  ---------FVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL
                 FVM++ V+G  K SKP VEVGV+HVTC  DAA  VALAAA+DLSMDACR F+QKLR EL
Subjt:  ---------FVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL

AT3G19680.1 Protein of unknown function (DUF1005)2.3e-9142.94Show/hide
Query:  MDPCPFVRLMVESLALNLPQPTR-------PAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSG--------FHLDPSSLRRLSC
        MDPC FVR++V +LA+  P  +        P+  G++P+   C+CKI  KNFP +   +P+   +  +S  ++  SSSG        F L  + +     
Subjt:  MDPCPFVRLMVESLALNLPQPTR-------PAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSG--------FHLDPSSLRRLSC

Query:  KPLLLCLSVFS----------GRMGHTCGVNSG--KFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISA----RLHLVVRSQPDPRFVFQFGGEPE
        KP    LSV +          G  G +CG+ +   K LGR  +S+ L  A++K  +  NGWV L  +  K        LH+ VR +PDPRFVFQF GEPE
Subjt:  KPLLLCLSVFS----------GRMGHTCGVNSG--KFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISA----RLHLVVRSQPDPRFVFQFGGEPE

Query:  CSPVVFQIQGNIRQPVFSCKF-SADRNSRTRSV---TSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRS
        CSP VFQ+QGN +Q VF+CKF S + NS  R++   +S  S  S+    + + + E+E+  +ERKGW I V+DLSGSPVA ASM+TPFVPSPG++RV+RS
Subjt:  CSPVVFQIQGNIRQPVFSCKF-SADRNSRTRSV---TSDFSFNSTKGKWMRTFSGEREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRS

Query:  NPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDGLGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIE-------RKTVRDLSP------------NS
        +PGAWLILRP G    +WKPWGRLEAWR+ G  D LGY+F+L  D G+ T +  A +++S+K GG F I+         +    SP             S
Subjt:  NPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDGLGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIE-------RKTVRDLSP------------NS

Query:  RSAS--------------------------FVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL
        R AS                          FVM+++VEG GK SKP VEVGV HVTC  DAA  VALAAA+DLS+DACR F+ KLR+EL
Subjt:  RSAS--------------------------FVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRREL

AT4G29310.1 Protein of unknown function (DUF1005)1.4e-15766.27Show/hide
Query:  MDPCPFVRLMVESLALNLPQ--PTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGR
        MDPCPFVRL ++SLAL LP+    +  G  VHPS+TPC+CK+ IK+FPSQ ALLPLSS S   S P+S+ S+ GFHLD  ++RR+S K + L +SV++GR
Subjt:  MDPCPFVRLMVESLALNLPQ--PTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGR

Query:  MGHTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRT
         GHTCGV SGK LG+V ++V L  A S+   F NGW KLG + DK SARLHL+V ++PDPRFVFQFGGEPECSPVV+QIQ N++QPVFSCKFS+DRN R+
Subjt:  MGHTCGVNSGKFLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRT

Query:  RSVTSDFSFNSTKGKWMRTFSGER--EKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRG
        RS+ S F++ S++G   RT SG++  +K  RERKGWMI ++DLSGSPVAAASMITPFV SPG+DRVSRSNPGAWLILRPHG  VSSWKPWGRLEAWR+RG
Subjt:  RSVTSDFSFNSTKGKWMRTFSGER--EKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRG

Query:  PVDGLGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERK-TVRDLSP--NSRSASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDL
         +DGLGYKF+LV D    TGIPIAE TMS K+GG+F I+R+ + +  SP  +S    FVM SSVEGEGKVSKPVV VG QHVTCMADAALFVAL+AA+DL
Subjt:  PVDGLGYKFDLVADTGLPTGIPIAEATMSVKKGGQFCIERK-TVRDLSP--NSRSASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDL

Query:  SMDACRHFTQKLRRELCHDYQ
        S+DAC+ F++KLR+ELCHD Q
Subjt:  SMDACRHFTQKLRRELCHDYQ

AT5G17640.1 Protein of unknown function (DUF1005)9.9e-8240.82Show/hide
Query:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPS---TTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLL------LC
        MDP  F+RL V SLAL +P+    +    +     ++ C C+I ++ FP QT  +PL  +   D+ PD  + S+ F+L+ S LR L            L 
Subjt:  MDPCPFVRLMVESLALNLPQPTRPAGPGVHPS---TTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLL------LC

Query:  LSVFSGRMGHTCGVNSGK-FLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKF
        +SVF+G+    CGV   +  +G  ++ V  +  + KP +  NGW+ +GK     +A LHL V+  PDPR+VFQF      SP + Q++G+++QP+FSCKF
Subjt:  LSVFSGRMGHTCGVNSGK-FLGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKF

Query:  SADRNSRTRSVTSDFSFNSTKGKWMRTFSG-EREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRL
        S DR S+          +   G W  +  G E E   RERKGW + ++DLSGS VAAA + TPFVPS G D V++SNPGAWL++RP     +SW+PWG+L
Subjt:  SADRNSRTRSVTSDFSFNSTKGKWMRTFSG-EREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRL

Query:  EAWRDRGPVDGLGYKFDLVADTGLPTG-IPIAEATMSVKKGGQFCIERK----TVRDL---SPNS-----------RSASFVMASSVEGEGKVSKPVVEV
        EAWR+RG  D +  +F L+++ GL  G + ++E  +S +KGG+F I+      TV      SP S               FVM+S V+GEGK SKPVV++
Subjt:  EAWRDRGPVDGLGYKFDLVADTGLPTG-IPIAEATMSVKKGGQFCIERK----TVRDL---SPNS-----------RSASFVMASSVEGEGKVSKPVVEV

Query:  GVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCH
         ++HVTC+ DAA+F+ALAAA+DLS+ AC+ F +  RR   H
Subjt:  GVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCGTGTCCGTTTGTCCGGCTGATGGTCGAGTCGCTCGCTCTCAACCTCCCTCAGCCCACCCGACCCGCCGGCCCCGGCGTCCACCCTTCGACCACGCCGTGCTT
CTGCAAGATTGCGATCAAGAATTTCCCCTCACAAACGGCTCTTCTTCCCCTTTCCTCCATTTCCCCCGGCGACTCTCTGCCGGACTCCACCGCGTCTTCCTCCGGCTTCC
ACCTCGACCCATCTTCTCTCCGCCGCCTCTCCTGCAAGCCACTCCTCCTCTGCTTGTCGGTTTTCTCCGGCCGCATGGGCCACACGTGCGGGGTGAATTCCGGCAAATTT
CTCGGCCGGGTTCGTATCTCTGTTACGCTCGACGGCGCTGATAGTAAACCGAGAGTGTTTCAGAACGGGTGGGTGAAACTGGGTAAGGAAGACGATAAAATCTCGGCTCG
GCTGCACTTGGTTGTCCGGTCTCAACCGGACCCCCGGTTCGTGTTCCAGTTCGGCGGCGAACCGGAATGTAGCCCCGTGGTTTTCCAGATCCAAGGCAATATCAGACAGC
CGGTTTTCAGCTGCAAGTTCAGTGCGGATCGGAATTCGCGAACCCGGTCAGTGACGTCGGATTTCAGCTTCAACAGCACGAAAGGAAAATGGATGAGAACATTTTCAGGG
GAGAGAGAGAAGGGGGGTAGAGAGAGAAAGGGGTGGATGATCATGGTTTACGATCTCTCGGGGTCCCCCGTGGCGGCCGCGTCGATGATCACGCCGTTCGTCCCATCCCC
CGGCACGGACCGAGTGTCCCGGTCGAACCCGGGCGCCTGGCTCATCCTCCGCCCCCACGGCTTCTCCGTCAGCAGCTGGAAGCCCTGGGGCCGCCTCGAGGCTTGGCGCG
ACCGAGGCCCCGTCGACGGCCTCGGCTACAAGTTCGACCTGGTCGCCGACACCGGATTACCCACGGGCATCCCCATCGCGGAAGCCACCATGAGCGTTAAAAAAGGTGGC
CAGTTTTGCATCGAGCGGAAAACTGTGAGGGATTTGAGTCCCAATTCCAGATCCGCCAGTTTTGTAATGGCATCGAGCGTGGAAGGAGAAGGGAAGGTGAGCAAGCCAGT
GGTAGAAGTGGGAGTTCAGCACGTGACATGCATGGCGGATGCTGCTTTATTCGTAGCCCTTGCAGCCGCCATTGATCTAAGCATGGACGCTTGCAGACACTTCACCCAGA
AGCTCAGGAGGGAGCTCTGCCACGACTACCAACACCATTCCAACTTTCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCCGTGTCCGTTTGTCCGGCTGATGGTCGAGTCGCTCGCTCTCAACCTCCCTCAGCCCACCCGACCCGCCGGCCCCGGCGTCCACCCTTCGACCACGCCGTGCTT
CTGCAAGATTGCGATCAAGAATTTCCCCTCACAAACGGCTCTTCTTCCCCTTTCCTCCATTTCCCCCGGCGACTCTCTGCCGGACTCCACCGCGTCTTCCTCCGGCTTCC
ACCTCGACCCATCTTCTCTCCGCCGCCTCTCCTGCAAGCCACTCCTCCTCTGCTTGTCGGTTTTCTCCGGCCGCATGGGCCACACGTGCGGGGTGAATTCCGGCAAATTT
CTCGGCCGGGTTCGTATCTCTGTTACGCTCGACGGCGCTGATAGTAAACCGAGAGTGTTTCAGAACGGGTGGGTGAAACTGGGTAAGGAAGACGATAAAATCTCGGCTCG
GCTGCACTTGGTTGTCCGGTCTCAACCGGACCCCCGGTTCGTGTTCCAGTTCGGCGGCGAACCGGAATGTAGCCCCGTGGTTTTCCAGATCCAAGGCAATATCAGACAGC
CGGTTTTCAGCTGCAAGTTCAGTGCGGATCGGAATTCGCGAACCCGGTCAGTGACGTCGGATTTCAGCTTCAACAGCACGAAAGGAAAATGGATGAGAACATTTTCAGGG
GAGAGAGAGAAGGGGGGTAGAGAGAGAAAGGGGTGGATGATCATGGTTTACGATCTCTCGGGGTCCCCCGTGGCGGCCGCGTCGATGATCACGCCGTTCGTCCCATCCCC
CGGCACGGACCGAGTGTCCCGGTCGAACCCGGGCGCCTGGCTCATCCTCCGCCCCCACGGCTTCTCCGTCAGCAGCTGGAAGCCCTGGGGCCGCCTCGAGGCTTGGCGCG
ACCGAGGCCCCGTCGACGGCCTCGGCTACAAGTTCGACCTGGTCGCCGACACCGGATTACCCACGGGCATCCCCATCGCGGAAGCCACCATGAGCGTTAAAAAAGGTGGC
CAGTTTTGCATCGAGCGGAAAACTGTGAGGGATTTGAGTCCCAATTCCAGATCCGCCAGTTTTGTAATGGCATCGAGCGTGGAAGGAGAAGGGAAGGTGAGCAAGCCAGT
GGTAGAAGTGGGAGTTCAGCACGTGACATGCATGGCGGATGCTGCTTTATTCGTAGCCCTTGCAGCCGCCATTGATCTAAGCATGGACGCTTGCAGACACTTCACCCAGA
AGCTCAGGAGGGAGCTCTGCCACGACTACCAACACCATTCCAACTTTCTCTGA
Protein sequenceShow/hide protein sequence
MDPCPFVRLMVESLALNLPQPTRPAGPGVHPSTTPCFCKIAIKNFPSQTALLPLSSISPGDSLPDSTASSSGFHLDPSSLRRLSCKPLLLCLSVFSGRMGHTCGVNSGKF
LGRVRISVTLDGADSKPRVFQNGWVKLGKEDDKISARLHLVVRSQPDPRFVFQFGGEPECSPVVFQIQGNIRQPVFSCKFSADRNSRTRSVTSDFSFNSTKGKWMRTFSG
EREKGGRERKGWMIMVYDLSGSPVAAASMITPFVPSPGTDRVSRSNPGAWLILRPHGFSVSSWKPWGRLEAWRDRGPVDGLGYKFDLVADTGLPTGIPIAEATMSVKKGG
QFCIERKTVRDLSPNSRSASFVMASSVEGEGKVSKPVVEVGVQHVTCMADAALFVALAAAIDLSMDACRHFTQKLRRELCHDYQHHSNFL