; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0014776 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0014776
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationchr01:26849832..26852970
RNA-Seq ExpressionPI0014776
SyntenyPI0014776
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136097.2 uncharacterized protein LOC101205558 [Cucumis sativus]2.2e-20498.38Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQ VESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWP
        SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSG LAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWP

Query:  AILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        AILNKRHLFREIFLDFDPT+VSKLNEKKMVAPGSAATSLLSELK+RAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  AILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERD-GEMKLNPNEKMPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGLTNDHLI CFRFTECIETQTAEKGERD GEMKLNPNEKMPEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERD-GEMKLNPNEKMPEALKNLEL

XP_008461179.1 PREDICTED: probable GMP synthase [glutamine-hydrolyzing] [Cucumis melo]1.5e-20598.92Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQ VESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWP
        SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVV DTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSG LAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWP

Query:  AILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        AILNKRHLFREIFLDFDPT VSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  AILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL

XP_022155202.1 uncharacterized protein LOC111022341 [Momordica charantia]5.1e-18591.62Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VE RKPG KPLKKLEKP Q  ESKDKRVPLSPPQCV+VPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWP
        SRASSARGTRQRGPNLRRKQ STVK A+KAVEKVGVESV VVVDTV  LE KKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSG LAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWP

Query:  AILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        AILNKRHLFREIFLDFDP +VSKLNEKKMVA GSAATSLLSELK+RAIIENGRQMCKVIDEFGSF+VY+WNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  AILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRF ECIE  TAE+GE+DGE+K   NEK+PEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL

XP_023514420.1 uncharacterized protein LOC111778684 [Cucurbita pepo subsp. pepo]1.5e-18189.22Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP Q  ESKDKRVPLSPPQCV TVPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTW
        NSRASSARGTRQRGPNLRRK  S+VK A+KAVEKVG ESV  V +TVGCLE KKRCAWVT NTDPCYAAFHDEEWGVPVHDDKKLFELLCLSG LAELTW
Subjt:  NSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTW

Query:  PAILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVIS
        P IL KRHLFRE FLDFDP +VSKLNEKKMVAPGSAATSLLSE K+RAIIENGRQMCKVIDEFGSFNVY+WNFVNHKP ISQFRYPRQVPDKTSKAEVIS
Subjt:  PAILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRF ECIE  T EKGERDG++K    EK+PEALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL

XP_038900164.1 probable GMP synthase [glutamine-hydrolyzing] [Benincasa hispida]4.9e-19694.91Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VETRKPGVKPLKKLEKPRQ VESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVES---VAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAEL
        SRASSARGTRQRGPNLRRKQ STVKGA K+VEKVGVES   VAVV DTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSG LAEL
Subjt:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVES---VAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAEL

Query:  TWPAILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEV
        TWPAILNKR+LFREIFLDFDP  VSKLNEKKMVAPGSAATSLLSELK+RAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEV
Subjt:  TWPAILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEV

Query:  ISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL
        ISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRF ECIETQTAEKGE+DGE+KL  NEKMPEALKNLEL
Subjt:  ISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL

TrEMBL top hitse value%identityAlignment
A0A0A0K8L6 Uncharacterized protein1.1e-20498.38Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQ VESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWP
        SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSG LAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWP

Query:  AILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        AILNKRHLFREIFLDFDPT+VSKLNEKKMVAPGSAATSLLSELK+RAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  AILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERD-GEMKLNPNEKMPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGLTNDHLI CFRFTECIETQTAEKGERD GEMKLNPNEKMPEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERD-GEMKLNPNEKMPEALKNLEL

A0A1S3CE52 probable GMP synthase [glutamine-hydrolyzing]7.4e-20698.92Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQ VESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWP
        SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVV DTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSG LAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWP

Query:  AILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        AILNKRHLFREIFLDFDPT VSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  AILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL

A0A5A7UYZ9 Putative GMP synthase7.4e-20698.92Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQ VESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWP
        SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVV DTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSG LAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWP

Query:  AILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        AILNKRHLFREIFLDFDPT VSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  AILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL

A0A6J1DNQ3 uncharacterized protein LOC1110223412.5e-18591.62Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VE RKPG KPLKKLEKP Q  ESKDKRVPLSPPQCV+VPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWP
        SRASSARGTRQRGPNLRRKQ STVK A+KAVEKVGVESV VVVDTV  LE KKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSG LAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWP

Query:  AILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        AILNKRHLFREIFLDFDP +VSKLNEKKMVA GSAATSLLSELK+RAIIENGRQMCKVIDEFGSF+VY+WNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  AILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRF ECIE  TAE+GE+DGE+K   NEK+PEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL

A0A6J1H7A2 uncharacterized protein LOC1114610811.7e-18188.95Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP Q  ESKDKRVPLSPPQCV TVPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTW
        NSRASSARGTRQRGPNLRRK  STVK A+KAVEKVG ESV    +TVGCLE KKRCAWVT NTDPCYAAFHDEEWGVPVHDDKKLFELLCLSG LAELTW
Subjt:  NSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTW

Query:  PAILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVIS
        P IL KRHLFRE FLDFDP +VSKLNEKKMVAPGSAATSLLSE K+RAIIENGRQMCKVIDEFGSFNVY+WNFVNHKP ISQFRYPRQVPDKTSKA+VIS
Subjt:  PAILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRF ECIE  T EKGERDG++K    EK+PEALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 15.8e-3842.22Show/hide
Query:  KRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWPAILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENG
        +RC WV  + DP Y A+HD EWGVP  D KKLFE++CL G  A L+W  +L KR  +R  F  FDP  V+ + E+ +      A  +    KI+AII N 
Subjt:  KRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWPAILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENG

Query:  RQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISC
        R   ++      F  ++W+FVNH+P ++Q     ++P  TS ++ +SK L KRGF+ VG T+ Y+FMQ  GL NDH++ C
Subjt:  RQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISC

P44321 DNA-3-methyladenine glycosylase1.2e-3239.11Show/hide
Query:  RCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWPAILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGR
        RC WV       Y  +HD+EWG P  D +KLFE +CL G  A L+W  +L KR  +RE F  FDP  ++K+    + A    +  +    K+ AI++N +
Subjt:  RCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWPAILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGR

Query:  QMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISC
            +     +F+ ++W+FVNHKPI++     R VP KT  ++ +SK L KRGF  +G T  Y FMQ  GL +DHL  C
Subjt:  QMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]6.8e-3943.15Show/hide
Query:  LESKKRCAWVTPNTDPC---YAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWPAILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKI
        +  K RCAW T   +     Y  +HD EWG P+H+DKKLFE L L G  A L+W  IL KR  FR  F DFDP  V+  +E K+         + +  KI
Subjt:  LESKKRCAWVTPNTDPC---YAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWPAILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKI

Query:  RAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQ
         A I N +    V  EFGSF+ Y+W FV  KPII+ F     +P  T  ++ I+KDL KRGF+ VG T +Y  MQ  G+ NDHL SCF+    +  Q
Subjt:  RAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQ

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein3.8e-9353.24Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKP---RQVVESKDKR-----VPLSP----PQCVTV-PSVLRQQDRHQAILNLSMN
        MS PPR RS+N  + + R VLGPTGNK +    RKP   P  KLEKP   + +++SKD++      P SP     QC ++  S+LR+        + SM 
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKP---RQVVESKDKR-----VPLSP----PQCVTV-PSVLRQQDRHQAILNLSMN

Query:  ASCSSDASSDSFNSRASSARGTRQRGPNLRRKQCSTVK--GADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLF
        AS SSDASS   +S  S A  +  +    R    S+ +     K  EKV  +  A         + +KRCAW+TP  DPCY AFHDEEWGVPVHDDKKLF
Subjt:  ASCSSDASSDSFNSRASSARGTRQRGPNLRRKQCSTVK--GADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLF

Query:  ELLCLSGTLAELTWPAILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYP
        ELLCLSG LAEL+W  IL++RH+ RE+F+DFDP +V++LN+KK+ APG+AA SLLSE+KIR+I++N R + K+I E GS   YMWNFVN+KP  SQFRY 
Subjt:  ELLCLSGTLAELTWPAILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYP

Query:  RQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTEC---IETQTAEKGERDGE
        RQVP KTSKAE ISKDLV+RGFRSV PTVIY+FMQ AGLTNDHLI CFR+ +C    ET T  K ++  E
Subjt:  RQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTEC---IETQTAEKGERDGE

AT1G75090.1 DNA glycosylase superfamily protein3.5e-6743.44Show/hide
Query:  PLKKLEKPRQVVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVG--VE
        P+K +++ R ++ S   R  ++  +    P +  +  +  A      N S S+D SS S +S   S+  T   G         T       VEK+   V 
Subjt:  PLKKLEKPRQVVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVG--VE

Query:  SVAVVVDTVGCLESK-KRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWPAILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAA
        SVAVV D    +    KRC W+TPN+DP Y  FHDEEWGVPV DDKKLFELL  S  LAE +WP+IL +R  FR++F +FDP+++++  EK++++     
Subjt:  SVAVVVDTVGCLESK-KRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWPAILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAA

Query:  TSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFT
          +LSE K+RAI+EN + + KV  EFGSF+ Y W FVNHKP+ + +RY RQVP K+ KAE ISKD+++RGFR VGPTV+Y+F+Q +G+ NDHL +CFR+ 
Subjt:  TSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFT

Query:  ECIETQTAEKGERDGEMKLN
        EC      E    + E KL+
Subjt:  ECIETQTAEKGERDGEMKLN

AT1G80850.1 DNA glycosylase superfamily protein7.3e-8953.39Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MS PPR+RS++ +D + R VLGP GNK +     KP  KP+ +  K     E   +  PLSPP       +LR+         +SM AS SSDASS S  
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWP
        S   S   T      LRR    +V  +      +  E      D     + +KRCAW+TP +D CY AFHDEEWGVPVHDDK+LFELL LSG LAEL+W 
Subjt:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWP

Query:  AILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
         IL+KR LFRE+F+DFDP ++S+L  KK+ +P  AAT+LLSE K+R+I+EN  Q+CK+I  FGSF+ Y+WNFVN KP  SQFRYPRQVP KTSKAE+ISK
Subjt:  AILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECI
        DLV+RGFRSV PTVIY+FMQ AGLTNDHL  CFR  +C+
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECI

AT5G57970.1 DNA glycosylase superfamily protein1.9e-9754.42Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA
        MSG PR++SMNVA++++R  LG T  KA    T K   K L+KLE+        D++   + P            +   S+LR   RH+  L  NLS+NA
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL
        S SSDAS DSF+SRAS+ R  R      R K   +        +   V S   +       E+KKRC WVTPN+DPCY  FHDEEWGVPVHDDK+LFELL
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL

Query:  CLSGTLAELTWPAILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQV
         LSG LAE TWP IL+KR  FRE+F DFDP ++ K+NEKK++ PGS A++LLS+LK+RA+IEN RQ+ KVI+E+GSF+ Y+W+FV +K I+S+FRY RQV
Subjt:  CLSGTLAELTWPAILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQV

Query:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECI
        P KT KAEVISKDLV+RGFRSVGPTV+Y+FMQ AG+TNDHL SCFRF  CI
Subjt:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECI

AT5G57970.2 DNA glycosylase superfamily protein1.9e-9754.42Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA
        MSG PR++SMNVA++++R  LG T  KA    T K   K L+KLE+        D++   + P            +   S+LR   RH+  L  NLS+NA
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL
        S SSDAS DSF+SRAS+ R  R      R K   +        +   V S   +       E+KKRC WVTPN+DPCY  FHDEEWGVPVHDDK+LFELL
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL

Query:  CLSGTLAELTWPAILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQV
         LSG LAE TWP IL+KR  FRE+F DFDP ++ K+NEKK++ PGS A++LLS+LK+RA+IEN RQ+ KVI+E+GSF+ Y+W+FV +K I+S+FRY RQV
Subjt:  CLSGTLAELTWPAILNKRHLFREIFLDFDPTSVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQV

Query:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECI
        P KT KAEVISKDLV+RGFRSVGPTV+Y+FMQ AG+TNDHL SCFRF  CI
Subjt:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGCCCTCCGAGAATCCGGTCTATGAATGTGGCGGATTCCGATTCACGACCGGTACTTGGGCCTACTGGGAACAAAGCGCGAACTGTAGAGACTAGAAAACCTGG
TGTGAAGCCATTGAAGAAGCTTGAAAAGCCTCGTCAAGTAGTTGAATCAAAGGACAAAAGGGTGCCATTGTCCCCGCCTCAATGTGTTACAGTCCCATCGGTTTTAAGAC
AACAGGATCGCCACCAGGCGATTCTCAATCTGTCGATGAATGCCTCATGTTCTTCGGATGCGTCATCTGATTCGTTTAATAGTCGGGCGTCCAGTGCAAGAGGTACGAGA
CAGCGTGGTCCGAATTTGAGAAGGAAGCAATGTAGTACGGTTAAGGGGGCTGACAAGGCTGTTGAAAAGGTTGGTGTTGAAAGTGTGGCCGTGGTGGTGGATACAGTTGG
TTGCTTAGAGTCCAAAAAACGATGTGCTTGGGTAACGCCTAATACAGATCCATGTTATGCTGCTTTTCATGACGAAGAATGGGGAGTACCAGTTCACGATGACAAAAAAT
TGTTTGAACTGCTTTGCCTATCGGGCACATTGGCTGAACTTACATGGCCTGCCATCCTCAACAAAAGACATCTATTTAGGGAAATTTTTTTGGACTTCGACCCAACTTCT
GTTTCAAAATTAAACGAGAAAAAGATGGTTGCTCCTGGAAGCGCTGCTACTTCTTTACTGTCAGAACTCAAGATTCGAGCTATCATTGAAAATGGTCGTCAAATGTGCAA
GGTAATTGATGAATTTGGTTCCTTCAACGTGTACATGTGGAACTTTGTGAACCATAAACCAATCATCAGTCAGTTCCGGTACCCACGTCAAGTCCCGGATAAGACATCGA
AAGCAGAGGTGATAAGCAAGGATCTCGTAAAGAGAGGGTTTCGAAGCGTAGGACCAACAGTCATCTATACATTCATGCAGGTGGCTGGGTTAACTAATGACCATCTCATC
AGTTGCTTTAGGTTTACAGAATGTATAGAGACACAAACAGCAGAGAAAGGAGAAAGAGATGGTGAAATGAAGCTTAATCCTAATGAGAAAATGCCAGAGGCTTTGAAAAA
CTTGGAACTATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGGCCCTCCGAGAATCCGGTCTATGAATGTGGCGGATTCCGATTCACGACCGGTACTTGGGCCTACTGGGAACAAAGCGCGAACTGTAGAGACTAGAAAACCTGG
TGTGAAGCCATTGAAGAAGCTTGAAAAGCCTCGTCAAGTAGTTGAATCAAAGGACAAAAGGGTGCCATTGTCCCCGCCTCAATGTGTTACAGTCCCATCGGTTTTAAGAC
AACAGGATCGCCACCAGGCGATTCTCAATCTGTCGATGAATGCCTCATGTTCTTCGGATGCGTCATCTGATTCGTTTAATAGTCGGGCGTCCAGTGCAAGAGGTACGAGA
CAGCGTGGTCCGAATTTGAGAAGGAAGCAATGTAGTACGGTTAAGGGGGCTGACAAGGCTGTTGAAAAGGTTGGTGTTGAAAGTGTGGCCGTGGTGGTGGATACAGTTGG
TTGCTTAGAGTCCAAAAAACGATGTGCTTGGGTAACGCCTAATACAGATCCATGTTATGCTGCTTTTCATGACGAAGAATGGGGAGTACCAGTTCACGATGACAAAAAAT
TGTTTGAACTGCTTTGCCTATCGGGCACATTGGCTGAACTTACATGGCCTGCCATCCTCAACAAAAGACATCTATTTAGGGAAATTTTTTTGGACTTCGACCCAACTTCT
GTTTCAAAATTAAACGAGAAAAAGATGGTTGCTCCTGGAAGCGCTGCTACTTCTTTACTGTCAGAACTCAAGATTCGAGCTATCATTGAAAATGGTCGTCAAATGTGCAA
GGTAATTGATGAATTTGGTTCCTTCAACGTGTACATGTGGAACTTTGTGAACCATAAACCAATCATCAGTCAGTTCCGGTACCCACGTCAAGTCCCGGATAAGACATCGA
AAGCAGAGGTGATAAGCAAGGATCTCGTAAAGAGAGGGTTTCGAAGCGTAGGACCAACAGTCATCTATACATTCATGCAGGTGGCTGGGTTAACTAATGACCATCTCATC
AGTTGCTTTAGGTTTACAGAATGTATAGAGACACAAACAGCAGAGAAAGGAGAAAGAGATGGTGAAATGAAGCTTAATCCTAATGAGAAAATGCCAGAGGCTTTGAAAAA
CTTGGAACTATAA
Protein sequenceShow/hide protein sequence
MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQVVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTR
QRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGTLAELTWPAILNKRHLFREIFLDFDPTS
VSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLI
SCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL