; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS009382 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS009382
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationscaffold813:755924..758764
RNA-Seq ExpressionMS009382
SyntenyMS009382
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593364.1 hypothetical protein SDJN03_12840, partial [Cucurbita argyrosperma subsp. sororia]2.1e-18391.08Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCV-SVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VE RK G KPLKKLEKPHQEAESKDKRVPLSPPQCV +VPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCV-SVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRK-QSTVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
        NSRASSARGTRQRGPNLRRK  STVKRAEKAVEKVG ESVV   +TV  LEPKKRCAWVT NTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
Subjt:  NSRASSARGTRQRGPNLRRK-QSTVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW

Query:  PAILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI
        P IL KRHLFRE FLDFDPNAVSKLNEKKMVA GSAATSLLSE KVRAIIENGRQMCK VIDEFGSF+VY+WNFVNHKP ISQFRYPRQVPDKTSKAEVI
Subjt:  PAILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI

Query:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAERGEKDGEIKPIINEKIPEALKNLEL
        SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRFPECIET E+GE+DG+IKP I EKIPEALKNLEL
Subjt:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAERGEKDGEIKPIINEKIPEALKNLEL

XP_004136097.2 uncharacterized protein LOC101205558 [Cucumis sativus]2.1e-18391.94Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VE RKPG KPLKKLEKP QE ESKDKRVPLSPPQCV+VPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQ-STVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
        SRASSARGTRQRGPNLRRKQ STVK A+KAVEKVGVESV VVVDTV  LE KKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQ-STVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP

Query:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVIS
        AILNKRHLFREIFLDFDP AVSKLNEKKMVA GSAATSLLSELKVRAIIENGRQMCK VIDEFGSF+VY+WNFVNHKPIISQFRYPRQVPDKTSKAEVIS
Subjt:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIE--TAERGEKD-GEIKPIINEKIPEALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLI CFRF ECIE  TAE+GE+D GE+K   NEK+PEALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIE--TAERGEKD-GEIKPIINEKIPEALKNLEL

XP_008461179.1 PREDICTED: probable GMP synthase [glutamine-hydrolyzing] [Cucumis melo]3.3e-18491.64Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VE RKPG KPLKKLEKP QE ESKDKRVPLSPPQCV+VPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQ-STVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
        SRASSARGTRQRGPNLRRKQ STVK A+KAVEKVGVESV VV DTV  LE KKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQ-STVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP

Query:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVIS
        AILNKRHLFREIFLDFDP  VSKLNEKKMVA GSAATSLLSELK+RAIIENGRQMCK VIDEFGSF+VY+WNFVNHKPIISQFRYPRQVPDKTSKAEVIS
Subjt:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIE--TAERGEKDGEIKPIINEKIPEALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRF ECIE  TAE+GE+DGE+K   NEK+PEALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIE--TAERGEKDGEIKPIINEKIPEALKNLEL

XP_022155202.1 uncharacterized protein LOC111022341 [Momordica charantia]3.8e-20499.73Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPA
        SRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPA
Subjt:  SRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPA

Query:  ILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        ILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCK VIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  ILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAERGEKDGEIKPIINEKIPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAERGEKDGEIKPIINEKIPEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAERGEKDGEIKPIINEKIPEALKNLEL

XP_023514420.1 uncharacterized protein LOC111778684 [Cucurbita pepo subsp. pepo]2.1e-18391.08Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCV-SVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VE RK G KPLKKLEKPHQEAESKDKRVPLSPPQCV +VPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCV-SVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRK-QSTVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
        NSRASSARGTRQRGPNLRRK  S+VKRAEKAVEKVG ESVV V +TV  LEPKKRCAWVT NTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
Subjt:  NSRASSARGTRQRGPNLRRK-QSTVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW

Query:  PAILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI
        P IL KRHLFRE FLDFDPNAVSKLNEKKMVA GSAATSLLSE KVRAIIENGRQMCK VIDEFGSF+VY+WNFVNHKP ISQFRYPRQVPDKTSKAEVI
Subjt:  PAILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI

Query:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAERGEKDGEIKPIINEKIPEALKNLEL
        SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRFPECIET E+GE+DG+IKP I EKIPEALKNLEL
Subjt:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAERGEKDGEIKPIINEKIPEALKNLEL

TrEMBL top hitse value%identityAlignment
A0A0A0K8L6 Uncharacterized protein1.0e-18391.94Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VE RKPG KPLKKLEKP QE ESKDKRVPLSPPQCV+VPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQ-STVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
        SRASSARGTRQRGPNLRRKQ STVK A+KAVEKVGVESV VVVDTV  LE KKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQ-STVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP

Query:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVIS
        AILNKRHLFREIFLDFDP AVSKLNEKKMVA GSAATSLLSELKVRAIIENGRQMCK VIDEFGSF+VY+WNFVNHKPIISQFRYPRQVPDKTSKAEVIS
Subjt:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIE--TAERGEKD-GEIKPIINEKIPEALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLI CFRF ECIE  TAE+GE+D GE+K   NEK+PEALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIE--TAERGEKD-GEIKPIINEKIPEALKNLEL

A0A1S3CE52 probable GMP synthase [glutamine-hydrolyzing]1.6e-18491.64Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VE RKPG KPLKKLEKP QE ESKDKRVPLSPPQCV+VPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQ-STVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
        SRASSARGTRQRGPNLRRKQ STVK A+KAVEKVGVESV VV DTV  LE KKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQ-STVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP

Query:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVIS
        AILNKRHLFREIFLDFDP  VSKLNEKKMVA GSAATSLLSELK+RAIIENGRQMCK VIDEFGSF+VY+WNFVNHKPIISQFRYPRQVPDKTSKAEVIS
Subjt:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIE--TAERGEKDGEIKPIINEKIPEALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRF ECIE  TAE+GE+DGE+K   NEK+PEALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIE--TAERGEKDGEIKPIINEKIPEALKNLEL

A0A5A7UYZ9 Putative GMP synthase1.6e-18491.64Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VE RKPG KPLKKLEKP QE ESKDKRVPLSPPQCV+VPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQ-STVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
        SRASSARGTRQRGPNLRRKQ STVK A+KAVEKVGVESV VV DTV  LE KKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQ-STVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP

Query:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVIS
        AILNKRHLFREIFLDFDP  VSKLNEKKMVA GSAATSLLSELK+RAIIENGRQMCK VIDEFGSF+VY+WNFVNHKPIISQFRYPRQVPDKTSKAEVIS
Subjt:  AILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIE--TAERGEKDGEIKPIINEKIPEALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRF ECIE  TAE+GE+DGE+K   NEK+PEALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIE--TAERGEKDGEIKPIINEKIPEALKNLEL

A0A6J1DNQ3 uncharacterized protein LOC1110223411.8e-20499.73Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPA
        SRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPA
Subjt:  SRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPA

Query:  ILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        ILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCK VIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  ILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAERGEKDGEIKPIINEKIPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAERGEKDGEIKPIINEKIPEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAERGEKDGEIKPIINEKIPEALKNLEL

A0A6J1H7A2 uncharacterized protein LOC1114610812.0e-18290.54Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCV-SVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VE RK G KPLKKLEKPHQEAESKDKRVPLSPPQCV +VPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCV-SVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRK-QSTVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
        NSRASSARGTRQRGPNLRRK  STVKRAEKAVEKVG ESVV   +TV  LEPKKRCAWVT NTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
Subjt:  NSRASSARGTRQRGPNLRRK-QSTVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW

Query:  PAILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI
        P IL KRHLFRE FLDFDPNAVSKLNEKKMVA GSAATSLLSE KVRAIIENGRQMCK VIDEFGSF+VY+WNFVNHKP ISQFRYPRQVPDKTSKA+VI
Subjt:  PAILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI

Query:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAERGEKDGEIKPIINEKIPEALKNLEL
        SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRF ECIET E+GE+DG+IKP I EKIPEALKNLEL
Subjt:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAERGEKDGEIKPIINEKIPEALKNLEL

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 11.3e-3741.4Show/hide
Query:  KRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENG
        +RC WV  + DP Y A+HD EWGVP  D KKLFE++CL G  A L+W  +L KR  +R  F  FDP  V+ + E+ +      A  +    K++AII N 
Subjt:  KRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENG

Query:  RQMCKVVIDEFGS-FDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFP
        R   +  +++ G  F  ++W+FVNH+P ++Q     ++P  TS ++ +SK L KRGF+ VG T+ Y+FMQ  GL NDH++ C  +P
Subjt:  RQMCKVVIDEFGS-FDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFP

P44321 DNA-3-methyladenine glycosylase7.2e-3339.78Show/hide
Query:  RCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGR
        RC WV       Y  +HD+EWG P  D +KLFE +CL G  A L+W  +L KR  +RE F  FDP  ++K+    + A    +  +    K+ AI++N +
Subjt:  RCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGR

Query:  QMCKVVIDEFG-SFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISC
            + +++ G +F  +IW+FVNHKPI++     R VP KT  ++ +SK L KRGF  +G T  Y FMQ  GL +DHL  C
Subjt:  QMCKVVIDEFG-SFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]6.1e-4044.62Show/hide
Query:  DTVAGLEPKKRCAWVTPNTDPC---YAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLL
        D+  G+  K RCAW T   +     Y  +HD EWG P+H+DKKLFE L L G  A L+W  IL KR  FR  F DFDP+ V+  +E K+         + 
Subjt:  DTVAGLEPKKRCAWVTPNTDPC---YAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLL

Query:  SELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFR
        +  K+ A I N +     V  EFGSFD YIW FV  KPII+ F     +P  T  ++ I+KDL KRGF+ VG T +Y  MQ  G+ NDHL SCF+
Subjt:  SELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFR

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein3.9e-9055.34Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQE---AESKDKR-----VPLSP----PQCVSV-PSVLRQQDRHQAILNLSMN
        MS PPR RS+N  + + R VLGPTGNK +    RKP G    KLEKP  E    +SKD++      P SP     QC S+  S+LR+        + SM 
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQE---AESKDKR-----VPLSP----PQCVSV-PSVLRQQDRHQAILNLSMN

Query:  ASCSSDASSDSFNSRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL
        AS SSDASS   +S  S A  +  +   + R+  +V    K    VG E   V  D  A  + +KRCAW+TP  DPCY AFHDEEWGVPVHDDKKLFELL
Subjt:  ASCSSDASSDSFNSRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL

Query:  CLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQ
        CLSGALAEL+W  IL++RH+ RE+F+DFDP AV++LN+KK+ A G+AA SLLSE+K+R+I++N R + K +I E GS   Y+WNFVN+KP  SQFRY RQ
Subjt:  CLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQ

Query:  VPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAE
        VP KTSKAE ISKDLV+RGFRSV PTVIY+FMQ AGLTNDHLI CFR+ +C   AE
Subjt:  VPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAE

AT1G75090.1 DNA glycosylase superfamily protein2.3e-6644.59Show/hide
Query:  PLKKLEKPHQEAESKDKRVPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVG--VES
        P+K +++      S   R  ++  +    P +  +  +  A      N S S+D SS S +S   S+  T   G      + T       VEK+   V S
Subjt:  PLKKLEKPHQEAESKDKRVPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVG--VES

Query:  VVVVVDTVAGLE-PKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAAT
        V VV D    +  P KRC W+TPN+DP Y  FHDEEWGVPV DDKKLFELL  S ALAE +WP+IL +R  FR++F +FDP+A+++  EK++++      
Subjt:  VVVVVDTVAGLE-PKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAAT

Query:  SLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFP
         +LSE K+RAI+EN + + KV   EFGSF  Y W FVNHKP+ + +RY RQVP K+ KAE ISKD+++RGFR VGPTV+Y+F+Q +G+ NDHL +CFR+ 
Subjt:  SLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFP

Query:  ECIETAERGEKDGE
        EC    ER  K  E
Subjt:  ECIETAERGEKDGE

AT1G80850.1 DNA glycosylase superfamily protein7.3e-8952.96Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSD---
        MS PPR+RS++ +D + R VLGP GNK +     KP  KP+ +  K     E   +  PLSPP       +LR+         +SM AS SSDASS    
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSD---

Query:  ---SFNSRASSARGTRQRG----PNLRRKQSTVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLS
           S  S +S  R  R+ G     +  R+  T +R EKA +                 + +KRCAW+TP +D CY AFHDEEWGVPVHDDK+LFELL LS
Subjt:  ---SFNSRASSARGTRQRG----PNLRRKQSTVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLS

Query:  GALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPD
        GALAEL+W  IL+KR LFRE+F+DFDP A+S+L  KK+ +   AAT+LLSE K+R+I+EN  Q+CK +I  FGSFD YIWNFVN KP  SQFRYPRQVP 
Subjt:  GALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPD

Query:  KTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAERG
        KTSKAE+ISKDLV+RGFRSV PTVIY+FMQ AGLTNDHL  CFR  +C+   E G
Subjt:  KTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAERG

AT5G57970.1 DNA glycosylase superfamily protein2.7e-9955.9Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPP----------QCVSVPSVLRQQDRHQAIL--NLSMNA
        MSG PR++SMNVA++++R  LG T  KA P    K   K L+KLE+        D++   + P            ++  S+LR   RH+  L  NLS+NA
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPP----------QCVSVPSVLRQQDRHQAIL--NLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLC
        S SSDAS DSF+SRAS+ R  R      R K S   +    V +  ++S         G E KKRC WVTPN+DPCY  FHDEEWGVPVHDDK+LFELL 
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLC

Query:  LSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQV
        LSGALAE TWP IL+KR  FRE+F DFDPNA+ K+NEKK++  GS A++LLS+LK+RA+IEN RQ+ K VI+E+GSFD YIW+FV +K I+S+FRY RQV
Subjt:  LSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQV

Query:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAER
        P KT KAEVISKDLV+RGFRSVGPTV+Y+FMQ AG+TNDHL SCFRF  CI   ER
Subjt:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAER

AT5G57970.2 DNA glycosylase superfamily protein2.7e-9955.9Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPP----------QCVSVPSVLRQQDRHQAIL--NLSMNA
        MSG PR++SMNVA++++R  LG T  KA P    K   K L+KLE+        D++   + P            ++  S+LR   RH+  L  NLS+NA
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPP----------QCVSVPSVLRQQDRHQAIL--NLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLC
        S SSDAS DSF+SRAS+ R  R      R K S   +    V +  ++S         G E KKRC WVTPN+DPCY  FHDEEWGVPVHDDK+LFELL 
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSTVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLC

Query:  LSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQV
        LSGALAE TWP IL+KR  FRE+F DFDPNA+ K+NEKK++  GS A++LLS+LK+RA+IEN RQ+ K VI+E+GSFD YIW+FV +K I+S+FRY RQV
Subjt:  LSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQV

Query:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAER
        P KT KAEVISKDLV+RGFRSVGPTV+Y+FMQ AG+TNDHL SCFRF  CI   ER
Subjt:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAER


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGCCCTCCCAGAATCCGGTCGATGAATGTGGCGGATTCCGACTCCCGGCCGGTTCTTGGGCCTACCGGAAACAAAGCTCGACCTGTCGAGCCCAGAAAACCTGG
TGGGAAGCCATTGAAGAAGCTTGAAAAGCCTCACCAGGAGGCTGAATCGAAGGACAAGAGGGTGCCATTGTCGCCGCCTCAATGCGTCTCGGTGCCATCGGTTCTGAGGC
AGCAGGACCGGCACCAGGCGATTCTCAATCTCTCGATGAATGCGTCGTGTTCTTCCGATGCGTCGTCTGATTCGTTCAATAGCCGGGCGTCTAGCGCGAGAGGTACGAGG
CAGCGCGGTCCCAATTTGAGGAGAAAGCAAAGTACGGTAAAGAGGGCTGAAAAGGCCGTTGAAAAGGTTGGCGTTGAAAGTGTGGTGGTGGTGGTGGATACAGTTGCTGG
ATTAGAGCCAAAAAAACGATGTGCTTGGGTAACACCTAATACAGATCCATGTTATGCTGCTTTTCATGATGAAGAGTGGGGAGTACCTGTTCACGATGACAAAAAATTGT
TTGAACTGCTCTGCCTATCGGGTGCTTTGGCTGAACTTACATGGCCTGCTATCCTTAACAAAAGACATCTATTTAGGGAAATCTTTTTGGACTTTGACCCAAATGCTGTT
TCAAAATTAAACGAGAAAAAGATGGTTGCAGCTGGAAGTGCTGCTACCTCTTTACTGTCAGAACTTAAGGTGCGAGCTATCATTGAAAATGGTCGTCAAATGTGCAAGGT
AGTAATTGATGAATTTGGTTCCTTCGACGTGTACATTTGGAACTTTGTCAACCACAAACCGATCATCAGTCAGTTTCGGTACCCACGCCAGGTCCCCGATAAGACTTCAA
AAGCAGAGGTGATTAGCAAGGATCTCGTTAAGAGAGGGTTTCGTAGCGTGGGACCGACAGTCATCTATACATTCATGCAGGTGGCAGGGTTAACCAACGACCATCTCATC
AGTTGCTTTAGGTTTCCAGAATGTATAGAGACAGCAGAGAGAGGAGAAAAGGATGGTGAAATCAAGCCTATTATTAACGAGAAAATACCAGAGGCTCTGAAAAACTTGGA
ACTA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGGCCCTCCCAGAATCCGGTCGATGAATGTGGCGGATTCCGACTCCCGGCCGGTTCTTGGGCCTACCGGAAACAAAGCTCGACCTGTCGAGCCCAGAAAACCTGG
TGGGAAGCCATTGAAGAAGCTTGAAAAGCCTCACCAGGAGGCTGAATCGAAGGACAAGAGGGTGCCATTGTCGCCGCCTCAATGCGTCTCGGTGCCATCGGTTCTGAGGC
AGCAGGACCGGCACCAGGCGATTCTCAATCTCTCGATGAATGCGTCGTGTTCTTCCGATGCGTCGTCTGATTCGTTCAATAGCCGGGCGTCTAGCGCGAGAGGTACGAGG
CAGCGCGGTCCCAATTTGAGGAGAAAGCAAAGTACGGTAAAGAGGGCTGAAAAGGCCGTTGAAAAGGTTGGCGTTGAAAGTGTGGTGGTGGTGGTGGATACAGTTGCTGG
ATTAGAGCCAAAAAAACGATGTGCTTGGGTAACACCTAATACAGATCCATGTTATGCTGCTTTTCATGATGAAGAGTGGGGAGTACCTGTTCACGATGACAAAAAATTGT
TTGAACTGCTCTGCCTATCGGGTGCTTTGGCTGAACTTACATGGCCTGCTATCCTTAACAAAAGACATCTATTTAGGGAAATCTTTTTGGACTTTGACCCAAATGCTGTT
TCAAAATTAAACGAGAAAAAGATGGTTGCAGCTGGAAGTGCTGCTACCTCTTTACTGTCAGAACTTAAGGTGCGAGCTATCATTGAAAATGGTCGTCAAATGTGCAAGGT
AGTAATTGATGAATTTGGTTCCTTCGACGTGTACATTTGGAACTTTGTCAACCACAAACCGATCATCAGTCAGTTTCGGTACCCACGCCAGGTCCCCGATAAGACTTCAA
AAGCAGAGGTGATTAGCAAGGATCTCGTTAAGAGAGGGTTTCGTAGCGTGGGACCGACAGTCATCTATACATTCATGCAGGTGGCAGGGTTAACCAACGACCATCTCATC
AGTTGCTTTAGGTTTCCAGAATGTATAGAGACAGCAGAGAGAGGAGAAAAGGATGGTGAAATCAAGCCTATTATTAACGAGAAAATACCAGAGGCTCTGAAAAACTTGGA
ACTA
Protein sequenceShow/hide protein sequence
MSGPPRIRSMNVADSDSRPVLGPTGNKARPVEPRKPGGKPLKKLEKPHQEAESKDKRVPLSPPQCVSVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTR
QRGPNLRRKQSTVKRAEKAVEKVGVESVVVVVDTVAGLEPKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAV
SKLNEKKMVAAGSAATSLLSELKVRAIIENGRQMCKVVIDEFGSFDVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLI
SCFRFPECIETAERGEKDGEIKPIINEKIPEALKNLEL