; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0007139 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0007139
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionProtein of unknown function (DUF707)
Genome locationtig00000084_ERROPOS800000:591984..607808
RNA-Seq ExpressionIVF0007139
SyntenyIVF0007139
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007877 - Protein of unknown function DUF707


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142563.1 uncharacterized protein LOC101221459 isoform X2 [Cucumis sativus]5.93e-28896.65Show/hide
Query:  MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS
        MK SGCLPLLAEQKSR+SCLCS LPTASLLCLALFVGSVYVAP+YREKISRWGIDGLV SKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS
Subjt:  MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS

Query:  YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF
        YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWK F+WSNRVIHVTAVNQTKWWFAKRFLHPDIVEEY+YVFLWDEDLGVDNF
Subjt:  YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF

Query:  DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ
        +PKLYV II+SEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ
Subjt:  DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ

Query:  LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSSNSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKTS
        LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSS S VKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKTS
Subjt:  LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSSNSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKTS

XP_008443712.1 PREDICTED: uncharacterized protein LOC103487236 isoform X1 [Cucumis melo]1.79e-298100Show/hide
Query:  MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS
        MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS
Subjt:  MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS

Query:  YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF
        YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF
Subjt:  YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF

Query:  DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ
        DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ
Subjt:  DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ

Query:  LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSSNSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKTS
        LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSSNSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKTS
Subjt:  LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSSNSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKTS

XP_023001869.1 uncharacterized protein LOC111495920 isoform X2 [Cucurbita maxima]2.75e-27090.44Show/hide
Query:  MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS
        MK SGCLPLLAEQKSR+S LC + P ASLLCL LFVGS YVAP+YRE+I RWGIDGLVSSKFNKCE QCRPNGSEPLPKDIVVTASNLEMRPLWGASK S
Subjt:  MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS

Query:  YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF
        YQNPVNSSSN+FA AVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTA+NQTKWWFAKRFLHPDIV EY+Y+FLWDEDLGV+ F
Subjt:  YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF

Query:  DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ
        +PK YVHIIESEGLEISQPALDPY+SEVHHQITARGRRSTVHRRTF+ SNGGK CDVNS APPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIH WGLDMQ
Subjt:  DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ

Query:  LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSSNSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKT
        LGYCAQGDRTKNVGVVD+EY+IHYGRPTLGGPEENETSS S VKDHRADVRRQSYIELDVFRKRWQKAA+QDECWQDPYPETVE  T
Subjt:  LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSSNSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKT

XP_031736155.1 uncharacterized protein LOC101221459 isoform X1 [Cucumis sativus]3.18e-28291.02Show/hide
Query:  MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS
        MK SGCLPLLAEQKSR+SCLCS LPTASLLCLALFVGSVYVAP+YREKISRWGIDGLV SKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS
Subjt:  MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS

Query:  YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF
        YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWK F+WSNRVIHVTAVNQTKWWFAKRFLHPDIVEEY+YVFLWDEDLGVDNF
Subjt:  YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF

Query:  DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ
        +PKLYV II+SEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ
Subjt:  DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ

Query:  LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSSNSRVKDHRAD------------------------VRRQSYIELDVFRKRWQKAAEQDECWQ
        LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSS S VKDHRAD                        VRRQSYIELDVFRKRWQKAAEQDECWQ
Subjt:  LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSSNSRVKDHRAD------------------------VRRQSYIELDVFRKRWQKAAEQDECWQ

Query:  DPYPETVEGKTS
        DPYPETVEGKTS
Subjt:  DPYPETVEGKTS

XP_038880303.1 uncharacterized protein LOC120071938 isoform X1 [Benincasa hispida]2.59e-28293.56Show/hide
Query:  MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS
        MK SGCLPLL+EQKSR+SCLCSLLPTASL+CL LFVGSVYVAP+YREKISRWGIDGLV SKFNKCE QCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS
Subjt:  MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS

Query:  YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF
        YQNPVNSS N+FAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEW++FDWSNRV+HVTAVNQTKWWFAKRFLHPDIV EY+Y+FLWDEDLGVDNF
Subjt:  YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF

Query:  DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ
        +P+ YVHII+SEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCD NSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ
Subjt:  DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ

Query:  LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSSNSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKTS
        LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSS S VKDHRADVRRQSYIELDVFRKRWQKAAEQDECW DPYPETVEG TS
Subjt:  LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSSNSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKTS

TrEMBL top hitse value%identityAlignment
A0A0A0M0M3 Uncharacterized protein4.4e-22596.65Show/hide
Query:  MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS
        MK SGCLPLLAEQKSR+SCLCS LPTASLLCLALFVGSVYVAP+YREKISRWGIDGLV SKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS
Subjt:  MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS

Query:  YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF
        YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWK F+WSNRVIHVTAVNQTKWWFAKRFLHPDIVEEY+YVFLWDEDLGVDNF
Subjt:  YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF

Query:  DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ
        +PKLYV II+SEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ
Subjt:  DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ

Query:  LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSSNSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKTS
        LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSS S VKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKTS
Subjt:  LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSSNSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKTS

A0A1S3B872 uncharacterized protein LOC103487236 isoform X14.4e-233100Show/hide
Query:  MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS
        MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS
Subjt:  MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS

Query:  YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF
        YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF
Subjt:  YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF

Query:  DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ
        DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ
Subjt:  DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ

Query:  LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSSNSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKTS
        LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSSNSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKTS
Subjt:  LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSSNSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKTS

A0A1S3B8N0 uncharacterized protein LOC103487236 isoform X24.4e-20992.27Show/hide
Query:  MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS
        MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS
Subjt:  MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS

Query:  YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF
        YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF
Subjt:  YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF

Query:  DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ
        DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ
Subjt:  DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ

Query:  LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSSNSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKTS
        LGYCAQ                              TSSNSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKTS
Subjt:  LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSSNSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKTS

A0A6J1F2L3 uncharacterized protein LOC111439192 isoform X21.1e-21090.44Show/hide
Query:  MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS
        MK SGCLPLLAEQKSR+S LC + PT SLLCL LFVGS YVAP+YRE+I RWGIDGLVSSKFNKCE QCRPNGSEPLPKDIVVTASNLEMRPLWGASK S
Subjt:  MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS

Query:  YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF
        YQNPVNSSSN+FA AVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTA+NQTKWWFAKRFLHPDIV EY+YVFLWDEDLGV+ F
Subjt:  YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF

Query:  DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ
        +PK YVHII SEGLEISQPALDPY+SEVHHQITARGRRSTVHRRTF+ SNGGK CDVNS APPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIH WGLDMQ
Subjt:  DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ

Query:  LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSSNSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKT
        LGYCAQGDRTKNVGVVD+EY+IHYGRPTLGGPEENETSS S VKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPY ETVE  T
Subjt:  LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSSNSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKT

A0A6J1KNU9 uncharacterized protein LOC111495920 isoform X21.2e-21190.44Show/hide
Query:  MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS
        MK SGCLPLLAEQKSR+S LC + P ASLLCL LFVGS YVAP+YRE+I RWGIDGLVSSKFNKCE QCRPNGSEPLPKDIVVTASNLEMRPLWGASK S
Subjt:  MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRS

Query:  YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF
        YQNPVNSSSN+FA AVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTA+NQTKWWFAKRFLHPDIV EY+Y+FLWDEDLGV+ F
Subjt:  YQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNF

Query:  DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ
        +PK YVHIIESEGLEISQPALDPY+SEVHHQITARGRRSTVHRRTF+ SNGGK CDVNS APPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIH WGLDMQ
Subjt:  DPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQ

Query:  LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSSNSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKT
        LGYCAQGDRTKNVGVVD+EY+IHYGRPTLGGPEENETSS S VKDHRADVRRQSYIELDVFRKRWQKAA+QDECWQDPYPETVE  T
Subjt:  LGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETSSNSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11170.1 Protein of unknown function (DUF707)2.8e-9953.05Show/hide
Query:  LPKDIVVTASNLEMRPLWGASKRSYQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRF
        LP+ I+ + S+LE++PLW       +    ++ N+ A+ VG+KQK  V+ +V KFL ++F ++LFHYDG +D+W D +WS++ IH+ A NQTKWWFAKRF
Subjt:  LPKDIVVTASNLEMRPLWGASKRSYQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRF

Query:  LHPDIVEEYDYVFLWDEDLGVDNFDPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFS
        LHPD+V  YDY+FLWDEDLGV+NF+P+ Y+ I++S GLEISQPALD   +E+HH+IT R +    HRR +  + G K C   S+ PPCTG++E MAPVFS
Subjt:  LHPDIVEEYDYVFLWDEDLGVDNFDPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFS

Query:  RAAWRCVWYMIQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDSEYVIHYGRPTLGG--PEENETSSNSRVK-------DHRADVRRQSYIELDVFRKRWQK
        +AAW C W +IQNDL+H WG+DM+LGYCAQGDRTKNVG+VDSEY++H G  TLG   PE+ +T+ +   +       D R ++RRQS  EL  F++RW K
Subjt:  RAAWRCVWYMIQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDSEYVIHYGRPTLGG--PEENETSSNSRVK-------DHRADVRRQSYIELDVFRKRWQK

Query:  AAEQDECWQDP
        A E+D  W DP
Subjt:  AAEQDECWQDP

AT1G61240.1 Protein of unknown function (DUF707)4.1e-9853.72Show/hide
Query:  LPKDIVVTASNLEMRPLWGASKRSYQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRF
        LP  I+   S+LE++PLW +S    ++   ++ N+ AM VG+KQKD V+ +V KFL ++F V+LFHYDG +D+W D +WS++ IH+ A NQTKWWFAKRF
Subjt:  LPKDIVVTASNLEMRPLWGASKRSYQNPVNSSSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRF

Query:  LHPDIVEEYDYVFLWDEDLGVDNFDPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFS
        LHPDIV  YDYVFLWDEDLGV+NF+P+ Y+ I+++ GLEISQPAL P  +EVHH+IT R R    HRR +  S G   C   S  PPCTG++E MAPVFS
Subjt:  LHPDIVEEYDYVFLWDEDLGVDNFDPKLYVHIIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFS

Query:  RAAWRCVWYMIQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDSEYVIHYGRPTLGG---PEENETSSN-------SRVKDHRADVRRQSYIELDVFRKRWQ
        R+AW C W +IQNDL+H WG+DM+LGYCAQGDR+K VG+VDSEY+ H G  TLGG   P++  ++ +       S   D R ++RRQS  EL  F++RW 
Subjt:  RAAWRCVWYMIQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDSEYVIHYGRPTLGG---PEENETSSN-------SRVKDHRADVRRQSYIELDVFRKRWQ

Query:  KAAEQDECW
        +A  +D+ W
Subjt:  KAAEQDECW

AT4G12840.1 Protein of unknown function (DUF707)1.5e-12455.56Show/hide
Query:  LLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRW-GIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRSYQNPVNS
        L   QK +   L  L      L +   +G+ ++  +Y+E I+ W  I  L  +K   C+ Q RP GSE LP+ IV + S+LEMRPLWGA +     P   
Subjt:  LLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRW-GIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRSYQNPVNS

Query:  SSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNFDPKLYVH
          ++ AMAVGI+QK+ VNK+V KF SS+F VMLFHYDG VDEWK+F+WS+  IH++ VNQTKWWFAKRFLHPDIV  Y Y+FLWDEDLGVD+FD + YV 
Subjt:  SSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNFDPKLYVH

Query:  IIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQLGYCAQG
        II+ E LEISQPALDP  SEVHHQ+T+R ++S VHRRT++   G   C+ NST PPCTG++EMMAPVFSRAAWRC W+MIQNDL H WG+D QLGYCAQG
Subjt:  IIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQLGYCAQG

Query:  DRTKNVGVVDSEYVIHYGRPTL-GGPEENETSS--------------NSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPY
        DRTKN+G+VDSEY++H G PTL GG  EN+T S              +S V   R +VR+Q+Y+EL+ F+ RW+ A + DECW D +
Subjt:  DRTKNVGVVDSEYVIHYGRPTL-GGPEENETSS--------------NSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPY

AT4G12840.2 Protein of unknown function (DUF707)1.5e-12455.56Show/hide
Query:  LLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRW-GIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRSYQNPVNS
        L   QK +   L  L      L +   +G+ ++  +Y+E I+ W  I  L  +K   C+ Q RP GSE LP+ IV + S+LEMRPLWGA +     P   
Subjt:  LLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRW-GIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRSYQNPVNS

Query:  SSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNFDPKLYVH
          ++ AMAVGI+QK+ VNK+V KF SS+F VMLFHYDG VDEWK+F+WS+  IH++ VNQTKWWFAKRFLHPDIV  Y Y+FLWDEDLGVD+FD + YV 
Subjt:  SSNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNFDPKLYVH

Query:  IIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQLGYCAQG
        II+ E LEISQPALDP  SEVHHQ+T+R ++S VHRRT++   G   C+ NST PPCTG++EMMAPVFSRAAWRC W+MIQNDL H WG+D QLGYCAQG
Subjt:  IIESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQLGYCAQG

Query:  DRTKNVGVVDSEYVIHYGRPTL-GGPEENETSS--------------NSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPY
        DRTKN+G+VDSEY++H G PTL GG  EN+T S              +S V   R +VR+Q+Y+EL+ F+ RW+ A + DECW D +
Subjt:  DRTKNVGVVDSEYVIHYGRPTL-GGPEENETSS--------------NSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPY

AT4G18530.1 Protein of unknown function (DUF707)1.1e-12756.92Show/hide
Query:  SCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNK---------CEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRSYQNPVNSS
        SCLCS+L T +L+C A F+ + Y+A +++EK+ +W I   + +  +K         C+   +P G+E LP+ I+   SNLE + LW       + P N S
Subjt:  SCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNK---------CEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRSYQNPVNSS

Query:  SNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNFDPKLYVHI
         ++ AMAVGIKQK+LVNK++ KF   DFAVMLFHYDG+VD+WK + W+N  IHV+ +NQTKWWFAKRFLHPDIV EY+Y+FLWDEDLGV +F+P+ Y+ I
Subjt:  SNIFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNFDPKLYVHI

Query:  IESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQLGYCAQGD
        ++ EGLEISQPALD  KSEVHH ITAR ++S VHRR ++    G+ CD +ST PPC GW+EMMAPVFSRAAWRC WYMIQNDLIHAWGLD QLGYCAQGD
Subjt:  IESEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQLGYCAQGD

Query:  RTKNVGVVDSEYVIHYGRPTLGGPE------ENETSS------NSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPY
        R KNVGVVD+EY+IHYG PTLG  E       NET S       SR  D+R +VR +S++E+  F++RW+KA   D CW DPY
Subjt:  RTKNVGVVDSEYVIHYGRPTLGGPE------ENETSS------NSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCTCTCCGGTTGTTTGCCATTGCTTGCAGAGCAAAAAAGTAGGCATTCCTGTCTTTGTAGCCTTCTTCCAACTGCTTCTTTGCTTTGTCTTGCATTGTTTGTGGG
GAGTGTATATGTAGCACCCAACTATAGGGAGAAAATATCTAGATGGGGAATAGATGGTTTAGTGAGTTCAAAGTTCAATAAATGTGAGAAACAATGTAGGCCAAATGGAA
GTGAGCCACTACCTAAAGACATTGTTGTCACTGCATCTAACTTGGAAATGCGACCGCTGTGGGGCGCGTCAAAGCGTTCTTATCAGAATCCCGTTAACTCATCAAGTAAC
ATATTCGCCATGGCCGTTGGGATTAAACAAAAAGATCTTGTGAATAAAATGGTGACAAAGTTTCTTTCTAGCGACTTTGCTGTGATGCTTTTCCATTATGATGGTATTGT
GGACGAGTGGAAGGATTTTGATTGGAGTAACCGCGTAATACACGTAACTGCAGTCAATCAAACTAAGTGGTGGTTTGCCAAGCGCTTCTTACATCCTGATATTGTAGAAG
AATATGATTACGTCTTTCTTTGGGATGAGGACCTTGGAGTTGACAATTTCGATCCAAAACTGTATGTACATATTATTGAAAGTGAAGGGCTAGAGATTTCACAACCTGCA
CTTGATCCATACAAATCAGAGGTACACCATCAAATTACTGCACGCGGGAGGCGATCAACAGTCCACAGGAGAACTTTTAGACCTAGTAATGGTGGAAAGGGTTGTGATGT
CAACAGTACAGCCCCTCCATGCACTGGATGGATAGAAATGATGGCCCCTGTTTTTTCCCGAGCTGCATGGCGTTGTGTTTGGTATATGATCCAAAATGATTTGATCCATG
CTTGGGGCTTGGATATGCAACTGGGATACTGTGCTCAGGGTGATCGAACAAAGAATGTTGGTGTCGTTGACTCTGAATACGTAATCCATTACGGAAGACCTACACTAGGT
GGTCCAGAAGAAAATGAGACATCTTCCAATTCTCGTGTTAAGGATCATAGAGCTGATGTAAGAAGACAGTCCTATATCGAACTAGATGTATTTAGAAAAAGATGGCAAAA
GGCCGCTGAGCAAGACGAGTGTTGGCAAGATCCTTACCCAGAGACAGTGGAGGGTAAAACTTCATAA
mRNA sequenceShow/hide mRNA sequence
AAAAAGGAAAAGGAAAAGGAACTGAAGAATCCCTTTCATTTGCTCTCTTTTGAAACTGCTTTTCTTTCTCTCTCTATAATGACTTTCCTTCTCTCCCACGTCTGCTTTTA
AATTCATTTCCCCAAGAGGGTATCTAAATTTTGGATGTTATTGCCGAGTGTTTTCCGGTAAACGTTGTTGGGTATTGAAATTTGGGCAGTTTTTCTCTCTTTTACTAACC
ATCTCTCTGCAATTTCATGGAAATATGAAGCTCTCCGGTTGTTTGCCATTGCTTGCAGAGCAAAAAAGTAGGCATTCCTGTCTTTGTAGCCTTCTTCCAACTGCTTCTTT
GCTTTGTCTTGCATTGTTTGTGGGGAGTGTATATGTAGCACCCAACTATAGGGAGAAAATATCTAGATGGGGAATAGATGGTTTAGTGAGTTCAAAGTTCAATAAATGTG
AGAAACAATGTAGGCCAAATGGAAGTGAGCCACTACCTAAAGACATTGTTGTCACTGCATCTAACTTGGAAATGCGACCGCTGTGGGGCGCGTCAAAGCGTTCTTATCAG
AATCCCGTTAACTCATCAAGTAACATATTCGCCATGGCCGTTGGGATTAAACAAAAAGATCTTGTGAATAAAATGGTGACAAAGTTTCTTTCTAGCGACTTTGCTGTGAT
GCTTTTCCATTATGATGGTATTGTGGACGAGTGGAAGGATTTTGATTGGAGTAACCGCGTAATACACGTAACTGCAGTCAATCAAACTAAGTGGTGGTTTGCCAAGCGCT
TCTTACATCCTGATATTGTAGAAGAATATGATTACGTCTTTCTTTGGGATGAGGACCTTGGAGTTGACAATTTCGATCCAAAACTGTATGTACATATTATTGAAAGTGAA
GGGCTAGAGATTTCACAACCTGCACTTGATCCATACAAATCAGAGGTACACCATCAAATTACTGCACGCGGGAGGCGATCAACAGTCCACAGGAGAACTTTTAGACCTAG
TAATGGTGGAAAGGGTTGTGATGTCAACAGTACAGCCCCTCCATGCACTGGATGGATAGAAATGATGGCCCCTGTTTTTTCCCGAGCTGCATGGCGTTGTGTTTGGTATA
TGATCCAAAATGATTTGATCCATGCTTGGGGCTTGGATATGCAACTGGGATACTGTGCTCAGGGTGATCGAACAAAGAATGTTGGTGTCGTTGACTCTGAATACGTAATC
CATTACGGAAGACCTACACTAGGTGGTCCAGAAGAAAATGAGACATCTTCCAATTCTCGTGTTAAGGATCATAGAGCTGATGTAAGAAGACAGTCCTATATCGAACTAGA
TGTATTTAGAAAAAGATGGCAAAAGGCCGCTGAGCAAGACGAGTGTTGGCAAGATCCTTACCCAGAGACAGTGGAGGGTAAAACTTCATAAGATCATAATTCCAACGATT
TTATAACATTATACATCGTCATTGAAGTTGAGAATGGGGCAACTTACACGCCCATTGCTATTCAAGGATGAGGAGGAAAGAATGATTGTATATCAGCAATGAAATCGGGT
ACGGATTCTGATCTTATCGTGTTCTGTTCTGTATTTAAAATGCTGTGACAGTTTCAACTTGTTGTGTTGAAATGTGTTTTAAGTGTTGCTGCTGGGGGTATAGAGTTGTA
TTGTAAGTAAGTTAGTTTACTTTTAGGTGAACACCTTTCTTCCTATCTTTTATACACACTTCACCTTCAATCCCCTTGCCTTTCGGAAAAGAATATGTTTAAATACTATT
TCAGTAGGTTAATTTCAATTTGAAA
Protein sequenceShow/hide protein sequence
MKLSGCLPLLAEQKSRHSCLCSLLPTASLLCLALFVGSVYVAPNYREKISRWGIDGLVSSKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRSYQNPVNSSSN
IFAMAVGIKQKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKDFDWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYDYVFLWDEDLGVDNFDPKLYVHIIESEGLEISQPA
LDPYKSEVHHQITARGRRSTVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDSEYVIHYGRPTLG
GPEENETSSNSRVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPETVEGKTS