; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg014080 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg014080
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationscaffold3:47084784..47088616
RNA-Seq ExpressionSpg014080
SyntenySpg014080
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593364.1 hypothetical protein SDJN03_12840, partial [Cucurbita argyrosperma subsp. sororia]9.4e-19090.18Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP QEAESKDKRVPLSPPQCV TVPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDD
        NSRASSARGTRQRGPNLRRK S STVKRAEKAVEKVG ESVVA  +TVGCLEPKKRCAWVT N                   PCYAAFHDEEWGVPVHDD
Subjt:  NSRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDD

Query:  KKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQ
        KKLFELLCLSGALAELTWP+IL KRHLFRETFLDFDPNAVSKLNEKKMVAPGSAAT+LLSE KVRAIIENGRQMCKVIDEFGSFNVY+WNFVNHKP ISQ
Subjt:  KKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQ

Query:  FRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETTEKGERDGDIKPTINEKIPEALKNLEL
        FRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRFPECIETTEKGERDGDIKPTI EKIPEALKNLEL
Subjt:  FRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETTEKGERDGDIKPTINEKIPEALKNLEL

XP_022155202.1 uncharacterized protein LOC111022341 [Momordica charantia]1.0e-18889.64Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VE RKPG KPLKKLEKP QEAESKDKRVPLSPPQCV+VPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDDK
        SRASSARGTRQRGPNLRRKQ  STVKRAEKAVEKVGVESVV V DTV  LEPKKRCAWVTPN                   PCYAAFHDEEWGVPVHDDK
Subjt:  SRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDDK

Query:  KLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQF
        KLFELLCLSGALAELTWP+ILNKRHLFRE FLDFDPNAVSKLNEKKMVA GSAAT+LLSELKVRAIIENGRQMCKVIDEFGSF+VYIWNFVNHKPIISQF
Subjt:  KLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQF

Query:  RYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETTEKGERDGDIKPTINEKIPEALKNLEL
        RYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIET E+GE+DG+IKP INEKIPEALKNLEL
Subjt:  RYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETTEKGERDGDIKPTINEKIPEALKNLEL

XP_022960311.1 uncharacterized protein LOC111461081 [Cucurbita moschata]1.8e-18889.66Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP QEAESKDKRVPLSPPQCV TVPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDD
        NSRASSARGTRQRGPNLRRK S STVKRAEKAVEKVG ESVVA  +TVGCLEPKKRCAWVT N                   PCYAAFHDEEWGVPVHDD
Subjt:  NSRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDD

Query:  KKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQ
        KKLFELLCLSGALAELTWP+IL KRHLFRETFLDFDPNAVSKLNEKKMVAPGSAAT+LLSE KVRAIIENGRQMCKVIDEFGSFNVY+WNFVNHKP ISQ
Subjt:  KKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQ

Query:  FRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETTEKGERDGDIKPTINEKIPEALKNLEL
        FRYPRQVPDKTSKA+VISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRF ECIETTEKGERDGDIKPTI EKIPEALKNLEL
Subjt:  FRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETTEKGERDGDIKPTINEKIPEALKNLEL

XP_023004117.1 uncharacterized protein LOC111497544 [Cucurbita maxima]2.3e-18889.66Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP QEAESKDKRVPLSPPQCV TVPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDD
        NSRASSARGTRQRGPNLRRK S STVK+AEKA+EKVG ESVVAVA+TVGCLEPKKRCAWVT N                   PCYAAFHDEEWGVPVHDD
Subjt:  NSRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDD

Query:  KKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQ
        KKLFELLCLSGALAELTWP+IL KRHLFRETFLDFDPNAVSKLNEKKMVAPGSAAT+LLSE KVRAIIENGRQM KVIDEFGSFNVY+WNFVNHKP ISQ
Subjt:  KKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQ

Query:  FRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETTEKGERDGDIKPTINEKIPEALKNLEL
        FRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRFPECIETTEKGERDGDIKP+I EKIPEALKNLEL
Subjt:  FRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETTEKGERDGDIKPTINEKIPEALKNLEL

XP_023514420.1 uncharacterized protein LOC111778684 [Cucurbita pepo subsp. pepo]3.2e-19090.44Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP QEAESKDKRVPLSPPQCV TVPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDD
        NSRASSARGTRQRGPNLRRK S S+VKRAEKAVEKVG ESVVAVA+TVGCLEPKKRCAWVT N                   PCYAAFHDEEWGVPVHDD
Subjt:  NSRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDD

Query:  KKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQ
        KKLFELLCLSGALAELTWP+IL KRHLFRETFLDFDPNAVSKLNEKKMVAPGSAAT+LLSE KVRAIIENGRQMCKVIDEFGSFNVY+WNFVNHKP ISQ
Subjt:  KKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQ

Query:  FRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETTEKGERDGDIKPTINEKIPEALKNLEL
        FRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRFPECIETTEKGERDGDIKPTI EKIPEALKNLEL
Subjt:  FRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETTEKGERDGDIKPTINEKIPEALKNLEL

TrEMBL top hitse value%identityAlignment
A0A1S3CE52 probable GMP synthase [glutamine-hydrolyzing]2.1e-18788.66Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQE ESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDDK
        SRASSARGTRQRGPNLRRKQ  STVK A+KAVEKVGVESV  VADTVGCLE KKRCAWVTPN                   PCYAAFHDEEWGVPVHDDK
Subjt:  SRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDDK

Query:  KLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQF
        KLFELLCLSGALAELTWP+ILNKRHLFRE FLDFDP  VSKLNEKKMVAPGSAAT+LLSELK+RAIIENGRQMCKVIDEFGSFNVY+WNFVNHKPIISQF
Subjt:  KLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQF

Query:  RYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIE--TTEKGERDGDIKPTINEKIPEALKNLEL
        RYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRF ECIE  T EKGERDG++K   NEK+PEALKNLEL
Subjt:  RYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIE--TTEKGERDGDIKPTINEKIPEALKNLEL

A0A5A7UYZ9 Putative GMP synthase2.1e-18788.66Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQE ESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDDK
        SRASSARGTRQRGPNLRRKQ  STVK A+KAVEKVGVESV  VADTVGCLE KKRCAWVTPN                   PCYAAFHDEEWGVPVHDDK
Subjt:  SRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDDK

Query:  KLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQF
        KLFELLCLSGALAELTWP+ILNKRHLFRE FLDFDP  VSKLNEKKMVAPGSAAT+LLSELK+RAIIENGRQMCKVIDEFGSFNVY+WNFVNHKPIISQF
Subjt:  KLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQF

Query:  RYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIE--TTEKGERDGDIKPTINEKIPEALKNLEL
        RYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRF ECIE  T EKGERDG++K   NEK+PEALKNLEL
Subjt:  RYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIE--TTEKGERDGDIKPTINEKIPEALKNLEL

A0A6J1DNQ3 uncharacterized protein LOC1110223415.0e-18989.64Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VE RKPG KPLKKLEKP QEAESKDKRVPLSPPQCV+VPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDDK
        SRASSARGTRQRGPNLRRKQ  STVKRAEKAVEKVGVESVV V DTV  LEPKKRCAWVTPN                   PCYAAFHDEEWGVPVHDDK
Subjt:  SRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDDK

Query:  KLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQF
        KLFELLCLSGALAELTWP+ILNKRHLFRE FLDFDPNAVSKLNEKKMVA GSAAT+LLSELKVRAIIENGRQMCKVIDEFGSF+VYIWNFVNHKPIISQF
Subjt:  KLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQF

Query:  RYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETTEKGERDGDIKPTINEKIPEALKNLEL
        RYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIET E+GE+DG+IKP INEKIPEALKNLEL
Subjt:  RYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETTEKGERDGDIKPTINEKIPEALKNLEL

A0A6J1H7A2 uncharacterized protein LOC1114610818.6e-18989.66Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP QEAESKDKRVPLSPPQCV TVPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDD
        NSRASSARGTRQRGPNLRRK S STVKRAEKAVEKVG ESVVA  +TVGCLEPKKRCAWVT N                   PCYAAFHDEEWGVPVHDD
Subjt:  NSRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDD

Query:  KKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQ
        KKLFELLCLSGALAELTWP+IL KRHLFRETFLDFDPNAVSKLNEKKMVAPGSAAT+LLSE KVRAIIENGRQMCKVIDEFGSFNVY+WNFVNHKP ISQ
Subjt:  KKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQ

Query:  FRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETTEKGERDGDIKPTINEKIPEALKNLEL
        FRYPRQVPDKTSKA+VISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRF ECIETTEKGERDGDIKPTI EKIPEALKNLEL
Subjt:  FRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETTEKGERDGDIKPTINEKIPEALKNLEL

A0A6J1KPI7 uncharacterized protein LOC1114975441.1e-18889.66Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP QEAESKDKRVPLSPPQCV TVPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDD
        NSRASSARGTRQRGPNLRRK S STVK+AEKA+EKVG ESVVAVA+TVGCLEPKKRCAWVT N                   PCYAAFHDEEWGVPVHDD
Subjt:  NSRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDD

Query:  KKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQ
        KKLFELLCLSGALAELTWP+IL KRHLFRETFLDFDPNAVSKLNEKKMVAPGSAAT+LLSE KVRAIIENGRQM KVIDEFGSFNVY+WNFVNHKP ISQ
Subjt:  KKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQ

Query:  FRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETTEKGERDGDIKPTINEKIPEALKNLEL
        FRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRFPECIETTEKGERDGDIKP+I EKIPEALKNLEL
Subjt:  FRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETTEKGERDGDIKPTINEKIPEALKNLEL

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 15.6e-3641.04Show/hide
Query:  PCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFG
        P Y A+HD EWGVP  D KKLFE++CL G  A L+W ++L KR  +R  F  FDP  V+ + E+ +      A  +    K++AII N R   ++     
Subjt:  PCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFG

Query:  SFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFP
         F  ++W+FVNH+P ++Q     ++P  TS ++ +SK L KRGF+ VG T+ Y+FMQ  GL NDH++ C  +P
Subjt:  SFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFP

P44321 DNA-3-methyladenine glycosylase6.4e-3240.12Show/hide
Query:  YAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSF
        Y  +HD+EWG P  D +KLFE +CL G  A L+W ++L KR  +RE F  FDP  ++K+    + A    +  +    K+ AI++N +    +     +F
Subjt:  YAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSF

Query:  NVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISC
        + +IW+FVNHKPI++     R VP KT  ++ +SK L KRGF  +G T  Y FMQ  GL +DHL  C
Subjt:  NVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]3.5e-3843Show/hide
Query:  KKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGS
        K RCAW T +  E  R               Y  +HD EWG P+H+DKKLFE L L G  A L+W +IL KR  FR  F DFDP+ V+  +E K+     
Subjt:  KKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGS

Query:  AATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFR
            + +  K+ A I N +    V  EFGSF+ YIW FV  KPII+ F     +P  T  ++ I+KDL KRGF+ VG T +Y  MQ  G+ NDHL SCF+
Subjt:  AATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFR

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein6.5e-8851.28Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQE---AESKDKR-----VPLSP----PQCVTV-PSVLRQQDRHQAILNLSMN
        MS PPR RS+N  + + R VLGPTGNK +    RKP   P  KLEKP  E    +SKD++      P SP     QC ++  S+LR+        + SM 
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQE---AESKDKR-----VPLSP----PQCVTV-PSVLRQQDRHQAILNLSMN

Query:  ASCSSDASSDSFNSRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAF
        AS SSDASS   +S  S A  +  +   +RR  S S+ ++       VG E      D     + +KRCAW+TP A                  PCY AF
Subjt:  ASCSSDASSDSFNSRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAF

Query:  HDEEWGVPVHDDKKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYI
        HDEEWGVPVHDDKKLFELLCLSGALAEL+W  IL++RH+ RE F+DFDP AV++LN+KK+ APG+AA +LLSE+K+R+I++N R + K+I E GS   Y+
Subjt:  HDEEWGVPVHDDKKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYI

Query:  WNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPEC---IETT------EKGERDGD
        WNFVN+KP  SQFRY RQVP KTSKAE ISKDLV+RGFRSV PTVIY+FMQ AGLTNDHLI CFR+ +C    ETT      +K ER+ D
Subjt:  WNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPEC---IETT------EKGERDGD

AT1G75090.1 DNA glycosylase superfamily protein5.5e-6340.64Show/hide
Query:  PLKKLEKPRQEAESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVG--V
        P+K +++ R    S   R  ++  +    P +  +  +  A      N S S+D SS S +S   S+  T   G          T       VEK+   V
Subjt:  PLKKLEKPRQEAESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVG--V

Query:  ESVVAVADTVGCLE-PKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDP
         SV  V D    +  P KRC W+TPN+                  P Y  FHDEEWGVPV DDKKLFELL  S ALAE +WPSIL +R  FR+ F +FDP
Subjt:  ESVVAVADTVGCLE-PKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDP

Query:  NAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTF
        +A+++  EK++++       +LSE K+RAI+EN + + KV  EFGSF+ Y W FVNHKP+ + +RY RQVP K+ KAE ISKD+++RGFR VGPTV+Y+F
Subjt:  NAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTF

Query:  MQVAGLTNDHLISCFRFPECIETTEKGERDGDIKPTINEKIP
        +Q +G+ NDHL +CFR+ EC   TE+  +  + +  ++   P
Subjt:  MQVAGLTNDHLISCFRFPECIETTEKGERDGDIKPTINEKIP

AT1G80850.1 DNA glycosylase superfamily protein9.4e-8751.52Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MS PPR+RS++ +D + R VLGP GNK +     KP  KP+ +  K     E   +  PLSPP       +LR+         +SM AS SSDASS S  
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDDK
        S   S   T      LRR  S S+     + + +   E     A    C + +KRCAW+TP + +                 CY AFHDEEWGVPVHDDK
Subjt:  SRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDDK

Query:  KLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQF
        +LFELL LSGALAEL+W  IL+KR LFRE F+DFDP A+S+L  KK+ +P  AAT LLSE K+R+I+EN  Q+CK+I  FGSF+ YIWNFVN KP  SQF
Subjt:  KLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQF

Query:  RYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETTEKG
        RYPRQVP KTSKAE+ISKDLV+RGFRSV PTVIY+FMQ AGLTNDHL  CFR  +C+   E G
Subjt:  RYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETTEKG

AT5G57970.1 DNA glycosylase superfamily protein1.9e-9552.67Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA
        MSG PR++SMNVA++++R  LG T  KA    T K   K L+KLE+        D++   + P            +   S+LR   RH+  L  NLS+NA
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFH
        S SSDAS DSF+SRAS+ R  R      R K   S  +          V S  A+       E KKRC WVTPN+                  PCY  FH
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFH

Query:  DEEWGVPVHDDKKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIW
        DEEWGVPVHDDK+LFELL LSGALAE TWP+IL+KR  FRE F DFDPNA+ K+NEKK++ PGS A+ LLS+LK+RA+IEN RQ+ KVI+E+GSF+ YIW
Subjt:  DEEWGVPVHDDKKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIW

Query:  NFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETTEK
        +FV +K I+S+FRY RQVP KT KAEVISKDLV+RGFRSVGPTV+Y+FMQ AG+TNDHL SCFRF  CI   E+
Subjt:  NFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETTEK

AT5G57970.2 DNA glycosylase superfamily protein1.9e-9552.67Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA
        MSG PR++SMNVA++++R  LG T  KA    T K   K L+KLE+        D++   + P            +   S+LR   RH+  L  NLS+NA
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFH
        S SSDAS DSF+SRAS+ R  R      R K   S  +          V S  A+       E KKRC WVTPN+                  PCY  FH
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFH

Query:  DEEWGVPVHDDKKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIW
        DEEWGVPVHDDK+LFELL LSGALAE TWP+IL+KR  FRE F DFDPNA+ K+NEKK++ PGS A+ LLS+LK+RA+IEN RQ+ KVI+E+GSF+ YIW
Subjt:  DEEWGVPVHDDKKLFELLCLSGALAELTWPSILNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIW

Query:  NFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETTEK
        +FV +K I+S+FRY RQVP KT KAEVISKDLV+RGFRSVGPTV+Y+FMQ AG+TNDHL SCFRF  CI   E+
Subjt:  NFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETTEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGCCCTCCGAGAATCCGGTCTATGAATGTGGCGGATTCCGATTCCCGACCGGTACTTGGGCCTACTGGGAACAAAGCACGAACTGTTGAGACTAGAAAACCTGG
TGTGAAGCCATTGAAGAAGCTTGAAAAGCCTCGCCAAGAAGCTGAATCAAAGGACAAGAGGGTCCCATTGTCGCCGCCTCAATGTGTTACAGTGCCGTCGGTTTTGAGGC
AACAGGACCGCCACCAGGCGATTCTCAATCTGTCGATGAATGCCTCGTGTTCTTCTGATGCGTCGTCTGATTCGTTTAATAGTAGGGCATCTAGCGCAAGAGGTACGAGG
CAGCGCGGTCCGAATTTGAGGAGAAAGCAAAGTGGTAGTACGGTAAAGAGGGCTGAAAAGGCCGTTGAAAAGGTTGGTGTTGAAAGTGTGGTGGCGGTGGCGGATACAGT
TGGTTGCTTAGAGCCCAAAAAACGATGTGCTTGGGTTACGCCTAATGCAGGAGAAATGATTAGGATTGACGGCCCTGATGTAGTTGCCACTTTTCATGACTATCCATGTT
ATGCTGCTTTTCATGATGAAGAATGGGGAGTACCAGTTCACGATGACAAAAAATTGTTCGAACTGCTCTGCCTATCTGGTGCTTTGGCTGAACTTACGTGGCCTTCCATC
CTCAACAAAAGGCATCTATTTAGGGAAACCTTCTTGGACTTCGACCCAAATGCTGTTTCAAAATTAAACGAGAAAAAGATGGTTGCACCTGGAAGTGCTGCTACCGCTTT
ACTGTCAGAACTTAAGGTGCGAGCTATCATTGAAAATGGTCGTCAAATGTGCAAGGTAATTGATGAATTTGGTTCCTTCAATGTGTACATTTGGAACTTTGTCAACCACA
AACCTATCATCAGTCAGTTCCGGTACCCACGCCAGGTCCCTGATAAGACGTCAAAAGCAGAAGTGATTAGCAAGGATCTCGTTAAGAGAGGGTTTCGAAGCGTGGGACCA
ACAGTCATCTACACATTCATGCAGGTGGCAGGGTTAACTAACGACCATCTCATCAGTTGCTTTAGATTCCCAGAATGTATAGAGACAACAGAGAAAGGAGAAAGAGATGG
TGACATCAAGCCTACTATTAACGAGAAAATACCAGAGGCTCTGAAAAACTTGGAACTATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGGCCCTCCGAGAATCCGGTCTATGAATGTGGCGGATTCCGATTCCCGACCGGTACTTGGGCCTACTGGGAACAAAGCACGAACTGTTGAGACTAGAAAACCTGG
TGTGAAGCCATTGAAGAAGCTTGAAAAGCCTCGCCAAGAAGCTGAATCAAAGGACAAGAGGGTCCCATTGTCGCCGCCTCAATGTGTTACAGTGCCGTCGGTTTTGAGGC
AACAGGACCGCCACCAGGCGATTCTCAATCTGTCGATGAATGCCTCGTGTTCTTCTGATGCGTCGTCTGATTCGTTTAATAGTAGGGCATCTAGCGCAAGAGGTACGAGG
CAGCGCGGTCCGAATTTGAGGAGAAAGCAAAGTGGTAGTACGGTAAAGAGGGCTGAAAAGGCCGTTGAAAAGGTTGGTGTTGAAAGTGTGGTGGCGGTGGCGGATACAGT
TGGTTGCTTAGAGCCCAAAAAACGATGTGCTTGGGTTACGCCTAATGCAGGAGAAATGATTAGGATTGACGGCCCTGATGTAGTTGCCACTTTTCATGACTATCCATGTT
ATGCTGCTTTTCATGATGAAGAATGGGGAGTACCAGTTCACGATGACAAAAAATTGTTCGAACTGCTCTGCCTATCTGGTGCTTTGGCTGAACTTACGTGGCCTTCCATC
CTCAACAAAAGGCATCTATTTAGGGAAACCTTCTTGGACTTCGACCCAAATGCTGTTTCAAAATTAAACGAGAAAAAGATGGTTGCACCTGGAAGTGCTGCTACCGCTTT
ACTGTCAGAACTTAAGGTGCGAGCTATCATTGAAAATGGTCGTCAAATGTGCAAGGTAATTGATGAATTTGGTTCCTTCAATGTGTACATTTGGAACTTTGTCAACCACA
AACCTATCATCAGTCAGTTCCGGTACCCACGCCAGGTCCCTGATAAGACGTCAAAAGCAGAAGTGATTAGCAAGGATCTCGTTAAGAGAGGGTTTCGAAGCGTGGGACCA
ACAGTCATCTACACATTCATGCAGGTGGCAGGGTTAACTAACGACCATCTCATCAGTTGCTTTAGATTCCCAGAATGTATAGAGACAACAGAGAAAGGAGAAAGAGATGG
TGACATCAAGCCTACTATTAACGAGAAAATACCAGAGGCTCTGAAAAACTTGGAACTATAA
Protein sequenceShow/hide protein sequence
MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTR
QRGPNLRRKQSGSTVKRAEKAVEKVGVESVVAVADTVGCLEPKKRCAWVTPNAGEMIRIDGPDVVATFHDYPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPSI
LNKRHLFRETFLDFDPNAVSKLNEKKMVAPGSAATALLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGP
TVIYTFMQVAGLTNDHLISCFRFPECIETTEKGERDGDIKPTINEKIPEALKNLEL