; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0033231 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0033231
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionDNA glycosylase superfamily protein
Genome locationCMiso1.1chr01:33428408..33432321
RNA-Seq ExpressionCmc01g0033231
SyntenyCmc01g0033231
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136097.2 uncharacterized protein LOC101205558 [Cucumis sativus]4.5e-20598.65Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
        SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVV DTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP

Query:  AILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        AILNKRHLFREIFLDFDPT VSKLNEKKMVAPGSAATSLLSELK+RAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  AILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERD-GEMKLNPNEKMPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGLTNDHLI CFRFTECIETQTAEKGERD GEMKLNPNEKMPEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERD-GEMKLNPNEKMPEALKNLEL

XP_008461179.1 PREDICTED: probable GMP synthase [glutamine-hydrolyzing] [Cucumis melo]5.6e-208100Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
        SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP

Query:  AILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        AILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  AILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL

XP_022155202.1 uncharacterized protein LOC111022341 [Momordica charantia]1.4e-18591.89Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VE RKPG KPLKKLEKP QE ESKDKRVPLSPPQCV+VPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
        SRASSARGTRQRGPNLRRKQ STVK A+KAVEKVGVESV VV DTV  LE KKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP

Query:  AILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        AILNKRHLFREIFLDFDP  VSKLNEKKMVA GSAATSLLSELK+RAIIENGRQMCKVIDEFGSF+VY+WNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  AILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRF ECIE  TAE+GE+DGE+K   NEK+PEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL

XP_023514420.1 uncharacterized protein LOC111778684 [Cucurbita pepo subsp. pepo]3.7e-18390.03Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP QE ESKDKRVPLSPPQCV TVPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
        NSRASSARGTRQRGPNLRRK  S+VK A+KAVEKVG ESV  VA+TVGCLE KKRCAWVT NTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
Subjt:  NSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW

Query:  PAILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVIS
        P IL KRHLFRE FLDFDP  VSKLNEKKMVAPGSAATSLLSE K+RAIIENGRQMCKVIDEFGSFNVY+WNFVNHKP ISQFRYPRQVPDKTSKAEVIS
Subjt:  PAILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRF ECIE  T EKGERDG++K    EK+PEALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL

XP_038900164.1 probable GMP synthase [glutamine-hydrolyzing] [Benincasa hispida]1.8e-19895.98Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVES---VAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAEL
        SRASSARGTRQRGPNLRRKQ STVKGA K+VEKVGVES   VAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAEL
Subjt:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVES---VAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAEL

Query:  TWPAILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEV
        TWPAILNKR+LFREIFLDFDP VVSKLNEKKMVAPGSAATSLLSELK+RAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEV
Subjt:  TWPAILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEV

Query:  ISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL
        ISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRF ECIETQTAEKGE+DGE+KL  NEKMPEALKNLEL
Subjt:  ISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL

TrEMBL top hitse value%identityAlignment
A0A0A0K8L6 Uncharacterized protein2.2e-20598.65Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
        SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVV DTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP

Query:  AILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        AILNKRHLFREIFLDFDPT VSKLNEKKMVAPGSAATSLLSELK+RAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  AILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERD-GEMKLNPNEKMPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGLTNDHLI CFRFTECIETQTAEKGERD GEMKLNPNEKMPEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERD-GEMKLNPNEKMPEALKNLEL

A0A1S3CE52 probable GMP synthase [glutamine-hydrolyzing]2.7e-208100Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
        SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP

Query:  AILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        AILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  AILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL

A0A5A7UYZ9 Putative GMP synthase2.7e-208100Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
        SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP

Query:  AILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        AILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  AILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL

A0A6J1DNQ3 uncharacterized protein LOC1110223416.5e-18691.89Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VE RKPG KPLKKLEKP QE ESKDKRVPLSPPQCV+VPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
        SRASSARGTRQRGPNLRRKQ STVK A+KAVEKVGVESV VV DTV  LE KKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
Subjt:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP

Query:  AILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
        AILNKRHLFREIFLDFDP  VSKLNEKKMVA GSAATSLLSELK+RAIIENGRQMCKVIDEFGSF+VY+WNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Subjt:  AILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL
        DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRF ECIE  TAE+GE+DGE+K   NEK+PEALKNLEL
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL

A0A6J1H7A2 uncharacterized protein LOC1114610811.2e-18289.49Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP QE ESKDKRVPLSPPQCV TVPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
        NSRASSARGTRQRGPNLRRK  STVK A+KAVEKVG ESV    +TVGCLE KKRCAWVT NTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW
Subjt:  NSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTW

Query:  PAILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVIS
        P IL KRHLFRE FLDFDP  VSKLNEKKMVAPGSAATSLLSE K+RAIIENGRQMCKVIDEFGSFNVY+WNFVNHKP ISQFRYPRQVPDKTSKA+VIS
Subjt:  PAILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVIS

Query:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL
        KDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRF ECIE  T EKGERDG++K    EK+PEALKNLEL
Subjt:  KDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 19.8e-3842.22Show/hide
Query:  KRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENG
        +RC WV  + DP Y A+HD EWGVP  D KKLFE++CL G  A L+W  +L KR  +R  F  FDP  V+ + E+ +      A  +    KI+AII N 
Subjt:  KRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENG

Query:  RQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISC
        R   ++      F  ++W+FVNH+P ++Q     ++P  TS ++ +SK L KRGF+ VG T+ Y+FMQ  GL NDH++ C
Subjt:  RQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISC

P44321 DNA-3-methyladenine glycosylase2.1e-3239.11Show/hide
Query:  RCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGR
        RC WV       Y  +HD+EWG P  D +KLFE +CL G  A L+W  +L KR  +RE F  FDP  ++K+    + A    +  +    K+ AI++N +
Subjt:  RCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGR

Query:  QMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISC
            +     +F+ ++W+FVNHKPI++     R VP KT  ++ +SK L KRGF  +G T  Y FMQ  GL +DHL  C
Subjt:  QMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]1.8e-3943.15Show/hide
Query:  LESKKRCAWVTPNTDPC---YAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKI
        +  K RCAW T   +     Y  +HD EWG P+H+DKKLFE L L G  A L+W  IL KR  FR  F DFDP +V+  +E K+         + +  KI
Subjt:  LESKKRCAWVTPNTDPC---YAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKI

Query:  RAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQ
         A I N +    V  EFGSF+ Y+W FV  KPII+ F     +P  T  ++ I+KDL KRGF+ VG T +Y  MQ  G+ NDHL SCF+    +  Q
Subjt:  RAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQ

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein2.2e-9354.08Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQE---VESKDKR-----VPLSP----PQCVTV-PSVLRQQDRHQAILNLSMN
        MS PPR RS+N  + + R VLGPTGNK +    RKP   P  KLEKP  E   ++SKD++      P SP     QC ++  S+LR+        + SM 
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQE---VESKDKR-----VPLSP----PQCVTV-PSVLRQQDRHQAILNLSMN

Query:  ASCSSDASSDSFNSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFEL
        AS SSDASS   +S  S A  +  +    R    S+ +        VG E   V  D     + +KRCAW+TP  DPCY AFHDEEWGVPVHDDKKLFEL
Subjt:  ASCSSDASSDSFNSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFEL

Query:  LCLSGALAELTWPAILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQ
        LCLSGALAEL+W  IL++RH+ RE+F+DFDP  V++LN+KK+ APG+AA SLLSE+KIR+I++N R + K+I E GS   YMWNFVN+KP  SQFRY RQ
Subjt:  LCLSGALAELTWPAILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQ

Query:  VPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTEC---IETQTAEKGERDGE
        VP KTSKAE ISKDLV+RGFRSV PTVIY+FMQ AGLTNDHLI CFR+ +C    ET T  K ++  E
Subjt:  VPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTEC---IETQTAEKGERDGE

AT1G75090.1 DNA glycosylase superfamily protein7.9e-6743.75Show/hide
Query:  PLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVG--VE
        P+K +++ R  + S   R  ++  +    P +  +  +  A      N S S+D SS S +S   S+  T   G         T       VEK+   V 
Subjt:  PLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVG--VE

Query:  SVAVVADTVGCLESK-KRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAA
        SVAVV D    +    KRC W+TPN+DP Y  FHDEEWGVPV DDKKLFELL  S ALAE +WP+IL +R  FR++F +FDP+ +++  EK++++     
Subjt:  SVAVVADTVGCLESK-KRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAA

Query:  TSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFT
          +LSE K+RAI+EN + + KV  EFGSF+ Y W FVNHKP+ + +RY RQVP K+ KAE ISKD+++RGFR VGPTV+Y+F+Q +G+ NDHL +CFR+ 
Subjt:  TSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFT

Query:  ECIETQTAEKGERDGEMKLN
        EC      E    + E KL+
Subjt:  ECIETQTAEKGERDGEMKLN

AT1G80850.1 DNA glycosylase superfamily protein3.3e-8953.69Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MS PPR+RS++ +D + R VLGP GNK +     KP  KP+ +  K     E   +  PLSPP       +LR+         +SM AS SSDASS S  
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP
        S   S   T      LRR    +V  +      +  E     +D     + +KRCAW+TP +D CY AFHDEEWGVPVHDDK+LFELL LSGALAEL+W 
Subjt:  SRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP

Query:  AILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
         IL+KR LFRE+F+DFDP  +S+L  KK+ +P  AAT+LLSE K+R+I+EN  Q+CK+I  FGSF+ Y+WNFVN KP  SQFRYPRQVP KTSKAE+ISK
Subjt:  AILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK

Query:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECI
        DLV+RGFRSV PTVIY+FMQ AGLTNDHL  CFR  +C+
Subjt:  DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECI

AT5G57970.1 DNA glycosylase superfamily protein5.1e-9854.7Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA
        MSG PR++SMNVA++++R  LG T  KA    T K   K L+KLE+        D++   + P            +   S+LR   RH+  L  NLS+NA
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL
        S SSDAS DSF+SRAS+ R  R      R K   +        +   V S   +       E+KKRC WVTPN+DPCY  FHDEEWGVPVHDDK+LFELL
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL

Query:  CLSGALAELTWPAILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQV
         LSGALAE TWP IL+KR  FRE+F DFDP  + K+NEKK++ PGS A++LLS+LK+RA+IEN RQ+ KVI+E+GSF+ Y+W+FV +K I+S+FRY RQV
Subjt:  CLSGALAELTWPAILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQV

Query:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECI
        P KT KAEVISKDLV+RGFRSVGPTV+Y+FMQ AG+TNDHL SCFRF  CI
Subjt:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECI

AT5G57970.2 DNA glycosylase superfamily protein5.1e-9854.7Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA
        MSG PR++SMNVA++++R  LG T  KA    T K   K L+KLE+        D++   + P            +   S+LR   RH+  L  NLS+NA
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL
        S SSDAS DSF+SRAS+ R  R      R K   +        +   V S   +       E+KKRC WVTPN+DPCY  FHDEEWGVPVHDDK+LFELL
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL

Query:  CLSGALAELTWPAILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQV
         LSGALAE TWP IL+KR  FRE+F DFDP  + K+NEKK++ PGS A++LLS+LK+RA+IEN RQ+ KVI+E+GSF+ Y+W+FV +K I+S+FRY RQV
Subjt:  CLSGALAELTWPAILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQV

Query:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECI
        P KT KAEVISKDLV+RGFRSVGPTV+Y+FMQ AG+TNDHL SCFRF  CI
Subjt:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGCCCTCCGAGAATCCGGTCTATGAATGTGGCGGATTCCGATTCACGACCGGTTCTTGGGCCTACTGGGAACAAAGCGCGAACTGTAGAGACTAGAAAACCTGG
TGTGAAGCCATTGAAGAAGCTTGAGAAGCCTCGTCAAGAAGTTGAATCAAAGGACAAAAGGGTGCCATTGTCTCCGCCTCAATGCGTTACAGTTCCATCGGTTTTAAGGC
AACAGGATCGCCACCAGGCGATTCTCAATCTGTCAATGAATGCCTCGTGTTCTTCGGATGCGTCGTCTGATTCGTTTAATAGTCGGGCGTCCAGTGCAAGAGGTACGAGA
CAGCGTGGTCCGAATTTGAGAAGGAAGCAATGTAGTACGGTTAAGGGGGCTGACAAGGCTGTTGAAAAGGTTGGTGTCGAAAGTGTGGCCGTGGTGGCGGATACAGTTGG
TTGCTTAGAGTCCAAAAAACGATGTGCTTGGGTAACGCCTAATACAGATCCATGTTATGCTGCTTTTCACGACGAAGAATGGGGAGTACCAGTTCACGATGACAAAAAAT
TGTTTGAATTGCTTTGCCTATCGGGTGCATTGGCTGAACTTACATGGCCTGCCATCCTCAACAAAAGACATCTATTTAGGGAAATTTTTTTGGACTTCGACCCAACTGTC
GTTTCGAAATTAAACGAGAAAAAAATGGTTGCTCCTGGAAGTGCTGCTACTTCTTTACTGTCAGAACTCAAGATTCGAGCTATCATTGAAAATGGTCGTCAAATGTGCAA
GGTAATTGATGAATTTGGTTCCTTCAACGTGTACATGTGGAACTTTGTGAACCATAAACCAATCATCAGTCAGTTCCGGTACCCACGTCAAGTCCCGGATAAGACATCGA
AAGCAGAGGTGATAAGCAAGGATCTCGTAAAGAGAGGGTTTCGAAGCGTAGGACCAACAGTCATCTATACATTCATGCAGGTGGCTGGGTTAACTAATGACCATCTCATC
AGTTGCTTTAGGTTTACAGAATGTATAGAGACACAAACAGCAGAGAAAGGAGAAAGAGATGGTGAAATGAAGCTTAATCCTAATGAGAAAATGCCAGAGGCTTTGAAAAA
CTTGGAACTATAA
mRNA sequenceShow/hide mRNA sequence
TTTTCCTTTCTCTCACCCCCACCCTTCTCTCTCTCGATTCGTTGCTTTCTTCAGCAGGAAAGTCGAACAAAAAATTTTGAGTCTCAACCCTCAAAAAAAATTTCTCCTCA
CTTTCTCATATTTTTCCCATTTCTCTCTCTTCTTTCCCTTTTGATTTTCATACTCTCAAACACAATCATGGCCGTTGCGAATTTTTCATCATCTTCTTCCAGATAAGTTT
TTCCCTCTTTGAGCTCTTTTTTCTTCATCAATGTCGCTCCATTTTTCAAACCCACTTAGGGTTCTGCCTCTTCTTCTTTCCCATTCACTCCCAAACCCTACTGTTTCAAA
CCCCTTTTGACCTTTTTTGCTCAAATGAAGTAAAGGGGTTCTTTTTTTTTTTTTTTGGTTTAGTCTGACCCCTGCTTGATTTCGACACCCAAAATTGTGGTTGTGGTGAT
TATTGATTTTGTTTGTGCATTTTTTAATTGAGAACAAGAAATTTCACTGAAATGTCAGGCCCTCCGAGAATCCGGTCTATGAATGTGGCGGATTCCGATTCACGACCGGT
TCTTGGGCCTACTGGGAACAAAGCGCGAACTGTAGAGACTAGAAAACCTGGTGTGAAGCCATTGAAGAAGCTTGAGAAGCCTCGTCAAGAAGTTGAATCAAAGGACAAAA
GGGTGCCATTGTCTCCGCCTCAATGCGTTACAGTTCCATCGGTTTTAAGGCAACAGGATCGCCACCAGGCGATTCTCAATCTGTCAATGAATGCCTCGTGTTCTTCGGAT
GCGTCGTCTGATTCGTTTAATAGTCGGGCGTCCAGTGCAAGAGGTACGAGACAGCGTGGTCCGAATTTGAGAAGGAAGCAATGTAGTACGGTTAAGGGGGCTGACAAGGC
TGTTGAAAAGGTTGGTGTCGAAAGTGTGGCCGTGGTGGCGGATACAGTTGGTTGCTTAGAGTCCAAAAAACGATGTGCTTGGGTAACGCCTAATACAGATCCATGTTATG
CTGCTTTTCACGACGAAGAATGGGGAGTACCAGTTCACGATGACAAAAAATTGTTTGAATTGCTTTGCCTATCGGGTGCATTGGCTGAACTTACATGGCCTGCCATCCTC
AACAAAAGACATCTATTTAGGGAAATTTTTTTGGACTTCGACCCAACTGTCGTTTCGAAATTAAACGAGAAAAAAATGGTTGCTCCTGGAAGTGCTGCTACTTCTTTACT
GTCAGAACTCAAGATTCGAGCTATCATTGAAAATGGTCGTCAAATGTGCAAGGTAATTGATGAATTTGGTTCCTTCAACGTGTACATGTGGAACTTTGTGAACCATAAAC
CAATCATCAGTCAGTTCCGGTACCCACGTCAAGTCCCGGATAAGACATCGAAAGCAGAGGTGATAAGCAAGGATCTCGTAAAGAGAGGGTTTCGAAGCGTAGGACCAACA
GTCATCTATACATTCATGCAGGTGGCTGGGTTAACTAATGACCATCTCATCAGTTGCTTTAGGTTTACAGAATGTATAGAGACACAAACAGCAGAGAAAGGAGAAAGAGA
TGGTGAAATGAAGCTTAATCCTAATGAGAAAATGCCAGAGGCTTTGAAAAACTTGGAACTATAAAAGAAACCATTGGTAGCTTTTGAACCTTGCCTCATTGTAATTAGCT
TCCAGAGTTCTTTTTTCTTTTCTTTTCTTTCTTTTTTGTAATGATGGCTTGTAAATTCCTTGATGGGATCCATTCGCCACTTCTTTCAATGGGGTAAATTTTATCAATGA
TTTTGTGTATAAACTGAATTGGATACAGAAGACAGCTAGAATGAGTTCTGTTGGTCGTCTATTACTTCAAGCAATGTGGTTGGTTATTTATATTTAGAATTTATATTTAG
AACCATGATGTTGGTTCCACTTCTACTTCTAACATGAGCTCTC
Protein sequenceShow/hide protein sequence
MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTR
QRGPNLRRKQCSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTV
VSKLNEKKMVAPGSAATSLLSELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLI
SCFRFTECIETQTAEKGERDGEMKLNPNEKMPEALKNLEL