; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018591 (gene) of Snake gourd v1 genome

Gene IDTan0018591
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA glycosylase superfamily protein
Genome locationLG06:3418463..3423073
RNA-Seq ExpressionTan0018591
SyntenyTan0018591
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593364.1 hypothetical protein SDJN03_12840, partial [Cucurbita argyrosperma subsp. sororia]5.9e-18992.25Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDS
        MSGPPRIRSMNVADSDSRPVLGP GNKARTVETRK GVKPLKKLEKP +EAESKDKRVP LSPPQCVTTVPSVLRQQDRHQAIL LSMNASCSSDASSDS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDS

Query:  FNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL
        FNSRASSARGTRQRGPNLRRKSS+TVKRAEKAVEKVG ESV    VA  +TVGCLEPKKRCAWVT N DPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL
Subjt:  FNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL

Query:  AELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSK
        AELTWP IL KRHLFRE FLDFDPNAVSKLNEKKMVA GSAATSLLSE KVRAIIENGRQMCKVIDEFGSFNVY+WNFVNHKP ISQFRYPRQVPDKTSK
Subjt:  AELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSK

Query:  AEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIETTEKGERDGEIKPTVNEKIPEALKNLEL
        AEVISKDLVKRGFRSVGPTVIYTFMQV+GLTNDHLVSCFRF ECIETTEKGERDG+IKPT+ EKIPEALKNLEL
Subjt:  AEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIETTEKGERDGEIKPTVNEKIPEALKNLEL

XP_022960311.1 uncharacterized protein LOC111461081 [Cucurbita moschata]1.0e-18891.98Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDS
        MSGPPRIRSMNVADSDSRPVLGP GNKARTVETRK GVKPLKKLEKP +EAESKDKRVP LSPPQCVTTVPSVLRQQDRHQAIL LSMNASCSSDASSDS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDS

Query:  FNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL
        FNSRASSARGTRQRGPNLRRKSS+TVKRAEKAVEKVG ESV    VA  +TVGCLEPKKRCAWVT N DPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL
Subjt:  FNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL

Query:  AELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSK
        AELTWP IL KRHLFRE FLDFDPNAVSKLNEKKMVA GSAATSLLSE KVRAIIENGRQMCKVIDEFGSFNVY+WNFVNHKP ISQFRYPRQVPDKTSK
Subjt:  AELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSK

Query:  AEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIETTEKGERDGEIKPTVNEKIPEALKNLEL
        A+VISKDLVKRGFRSVGPTVIYTFMQV+GLTNDHLVSCFRF ECIETTEKGERDG+IKPT+ EKIPEALKNLEL
Subjt:  AEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIETTEKGERDGEIKPTVNEKIPEALKNLEL

XP_023004117.1 uncharacterized protein LOC111497544 [Cucurbita maxima]1.5e-18791.71Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDS
        MSGPPRIRSMNVADSDSRPVLGP GNKARTVETRK GVKPLKKLEKP +EAESKDKRVP LSPPQCVTTVPSVLRQQDRHQAIL LSMNASCSSDASSDS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDS

Query:  FNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL
        FNSRASSARGTRQRGPNLRRKSS+TVK+AEKA+EKVG ESV    VAVA+TVGCLEPKKRCAWVT N DPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL
Subjt:  FNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL

Query:  AELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSK
        AELTWP IL KRHLFRE FLDFDPNAVSKLNEKKMVA GSAATSLLSE KVRAIIENGRQM KVIDEFGSFNVY+WNFVNHKP ISQFRYPRQVPDKTSK
Subjt:  AELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSK

Query:  AEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIETTEKGERDGEIKPTVNEKIPEALKNLEL
        AEVISKDLVKRGFRSVGPTVIYTFMQV+GLTNDHLVSCFRF ECIETTEKGERDG+IKP++ EKIPEALKNLEL
Subjt:  AEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIETTEKGERDGEIKPTVNEKIPEALKNLEL

XP_023514420.1 uncharacterized protein LOC111778684 [Cucurbita pepo subsp. pepo]2.0e-18992.51Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDS
        MSGPPRIRSMNVADSDSRPVLGP GNKARTVETRK GVKPLKKLEKP +EAESKDKRVP LSPPQCVTTVPSVLRQQDRHQAIL LSMNASCSSDASSDS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDS

Query:  FNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL
        FNSRASSARGTRQRGPNLRRKSS++VKRAEKAVEKVG ESV    VAVA+TVGCLEPKKRCAWVT N DPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL
Subjt:  FNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL

Query:  AELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSK
        AELTWP IL KRHLFRE FLDFDPNAVSKLNEKKMVA GSAATSLLSE KVRAIIENGRQMCKVIDEFGSFNVY+WNFVNHKP ISQFRYPRQVPDKTSK
Subjt:  AELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSK

Query:  AEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIETTEKGERDGEIKPTVNEKIPEALKNLEL
        AEVISKDLVKRGFRSVGPTVIYTFMQV+GLTNDHLVSCFRF ECIETTEKGERDG+IKPT+ EKIPEALKNLEL
Subjt:  AEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIETTEKGERDGEIKPTVNEKIPEALKNLEL

XP_038900164.1 probable GMP synthase [glutamine-hydrolyzing] [Benincasa hispida]1.2e-18691.76Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDS
        MSGPPRIRSMNVADSDSRPVLGP GNKAR VETRKPGVKPLKKLEKPR+E ESKDKRVP LSPPQCV TVPSVLRQQDRHQAILNLSMNASCSSDASSDS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDS

Query:  FNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL
        FNSRASSARGTRQRGPNLRRK ++TVK A K+VEKVGVES   VAV VADTVGCLE KKRCAWVTPN DPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL
Subjt:  FNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL

Query:  AELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSK
        AELTWPAILNKR+LFREIFLDFDPN VSKLNEKKMVA GSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVY+WNFVNHKPIISQFRYPRQVPDKTSK
Subjt:  AELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSK

Query:  AEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIE--TTEKGERDGEIKPTVNEKIPEALKNLEL
        AEVISKDLVKRGFRSVGPTVIYTFMQV+GLTNDHL+SCFRF ECIE  T EKGE+DGEIK TVNEK+PEALKNLEL
Subjt:  AEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIE--TTEKGERDGEIKPTVNEKIPEALKNLEL

TrEMBL top hitse value%identityAlignment
A0A1S3CE52 probable GMP synthase [glutamine-hydrolyzing]6.6e-18690.96Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDS
        MSGPPRIRSMNVADSDSRPVLGP GNKARTVETRKPGVKPLKKLEKPR+E ESKDKRVP LSPPQCV TVPSVLRQQDRHQAILNLSMNASCSSDASSDS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDS

Query:  FNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL
        FNSRASSARGTRQRGPNLRRK  +TVK A+KAVEKVGVESV      VADTVGCLE KKRCAWVTPN DPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL
Subjt:  FNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL

Query:  AELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSK
        AELTWPAILNKRHLFREIFLDFDP  VSKLNEKKMVA GSAATSLLSELK+RAIIENGRQMCKVIDEFGSFNVY+WNFVNHKPIISQFRYPRQVPDKTSK
Subjt:  AELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSK

Query:  AEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIE--TTEKGERDGEIKPTVNEKIPEALKNLEL
        AEVISKDLVKRGFRSVGPTVIYTFMQV+GLTNDHL+SCFRF+ECIE  T EKGERDGE+K   NEK+PEALKNLEL
Subjt:  AEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIE--TTEKGERDGEIKPTVNEKIPEALKNLEL

A0A5A7UYZ9 Putative GMP synthase6.6e-18690.96Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDS
        MSGPPRIRSMNVADSDSRPVLGP GNKARTVETRKPGVKPLKKLEKPR+E ESKDKRVP LSPPQCV TVPSVLRQQDRHQAILNLSMNASCSSDASSDS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDS

Query:  FNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL
        FNSRASSARGTRQRGPNLRRK  +TVK A+KAVEKVGVESV      VADTVGCLE KKRCAWVTPN DPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL
Subjt:  FNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL

Query:  AELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSK
        AELTWPAILNKRHLFREIFLDFDP  VSKLNEKKMVA GSAATSLLSELK+RAIIENGRQMCKVIDEFGSFNVY+WNFVNHKPIISQFRYPRQVPDKTSK
Subjt:  AELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSK

Query:  AEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIE--TTEKGERDGEIKPTVNEKIPEALKNLEL
        AEVISKDLVKRGFRSVGPTVIYTFMQV+GLTNDHL+SCFRF+ECIE  T EKGERDGE+K   NEK+PEALKNLEL
Subjt:  AEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIE--TTEKGERDGEIKPTVNEKIPEALKNLEL

A0A6J1DNQ3 uncharacterized protein LOC1110223412.3e-18691.98Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDS
        MSGPPRIRSMNVADSDSRPVLGP GNKAR VE RKPG KPLKKLEKP +EAESKDKRVP LSPPQCV +VPSVLRQQDRHQAILNLSMNASCSSDASSDS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDS

Query:  FNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL
        FNSRASSARGTRQRGPNLRRK S TVKRAEKAVEKVGVESV    V V DTV  LEPKKRCAWVTPN DPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL
Subjt:  FNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL

Query:  AELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSK
        AELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVA+GSAATSLLSELKVRAIIENGRQMCKVIDEFGSF+VYIWNFVNHKPIISQFRYPRQVPDKTSK
Subjt:  AELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSK

Query:  AEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIETTEKGERDGEIKPTVNEKIPEALKNLEL
        AEVISKDLVKRGFRSVGPTVIYTFMQV+GLTNDHL+SCFRF ECIET E+GE+DGEIKP +NEKIPEALKNLEL
Subjt:  AEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIETTEKGERDGEIKPTVNEKIPEALKNLEL

A0A6J1H7A2 uncharacterized protein LOC1114610814.9e-18991.98Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDS
        MSGPPRIRSMNVADSDSRPVLGP GNKARTVETRK GVKPLKKLEKP +EAESKDKRVP LSPPQCVTTVPSVLRQQDRHQAIL LSMNASCSSDASSDS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDS

Query:  FNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL
        FNSRASSARGTRQRGPNLRRKSS+TVKRAEKAVEKVG ESV    VA  +TVGCLEPKKRCAWVT N DPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL
Subjt:  FNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL

Query:  AELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSK
        AELTWP IL KRHLFRE FLDFDPNAVSKLNEKKMVA GSAATSLLSE KVRAIIENGRQMCKVIDEFGSFNVY+WNFVNHKP ISQFRYPRQVPDKTSK
Subjt:  AELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSK

Query:  AEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIETTEKGERDGEIKPTVNEKIPEALKNLEL
        A+VISKDLVKRGFRSVGPTVIYTFMQV+GLTNDHLVSCFRF ECIETTEKGERDG+IKPT+ EKIPEALKNLEL
Subjt:  AEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIETTEKGERDGEIKPTVNEKIPEALKNLEL

A0A6J1KPI7 uncharacterized protein LOC1114975447.1e-18891.71Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDS
        MSGPPRIRSMNVADSDSRPVLGP GNKARTVETRK GVKPLKKLEKP +EAESKDKRVP LSPPQCVTTVPSVLRQQDRHQAIL LSMNASCSSDASSDS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDS

Query:  FNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL
        FNSRASSARGTRQRGPNLRRKSS+TVK+AEKA+EKVG ESV    VAVA+TVGCLEPKKRCAWVT N DPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL
Subjt:  FNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGAL

Query:  AELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSK
        AELTWP IL KRHLFRE FLDFDPNAVSKLNEKKMVA GSAATSLLSE KVRAIIENGRQM KVIDEFGSFNVY+WNFVNHKP ISQFRYPRQVPDKTSK
Subjt:  AELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSK

Query:  AEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIETTEKGERDGEIKPTVNEKIPEALKNLEL
        AEVISKDLVKRGFRSVGPTVIYTFMQV+GLTNDHLVSCFRF ECIETTEKGERDG+IKP++ EKIPEALKNLEL
Subjt:  AEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIETTEKGERDGEIKPTVNEKIPEALKNLEL

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 11.3e-3742.22Show/hide
Query:  KRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENG
        +RC WV  + DP Y A+HD EWGVP  D KKLFE++CL G  A L+W  +L KR  +R  F  FDP  V+ + E+ +      A  +    K++AII N 
Subjt:  KRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENG

Query:  RQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSC
        R   ++      F  ++W+FVNH+P ++Q     ++P  TS ++ +SK L KRGF+ VG T+ Y+FMQ  GL NDH+V C
Subjt:  RQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSC

P44321 DNA-3-methyladenine glycosylase7.4e-3339.66Show/hide
Query:  RCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGR
        RC WV       Y  +HD+EWG P  D +KLFE +CL G  A L+W  +L KR  +RE F  FDP  ++K+    + A    +  +    K+ AI++N +
Subjt:  RCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGR

Query:  QMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSC
            +     +F+ +IW+FVNHKPI++     R VP KT  ++ +SK L KRGF  +G T  Y FMQ  GL +DHL  C
Subjt:  QMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]3.1e-3945.7Show/hide
Query:  KKRCAWVTPN---ADPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAI
        K RCAW T     A   Y  +HD EWG P+H+DKKLFE L L G  A L+W  IL KR  FR  F DFDP+ V+  +E K+         + +  K+ A 
Subjt:  KKRCAWVTPN---ADPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAI

Query:  IENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFR
        I N +    V  EFGSF+ YIW FV  KPII+ F     +P  T  ++ I+KDL KRGF+ VG T +Y  MQ  G+ NDHL SCF+
Subjt:  IENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFR

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein1.3e-8851.61Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREE---AESKDKRV--------PLLSPPQCVTTVPSVLRQQDRHQAILNLSMN
        MS PPR RS+N  + + R VLGP GNK +    RKP   P  KLEKP  E    +SKD++         P  +  QC +   S+LR+        + SM 
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREE---AESKDKRV--------PLLSPPQCVTTVPSVLRQQDRHQAILNLSMN

Query:  ASCSSDASSDSFNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKK
        AS SSDASS   +S  S A  +  +    R  S ++ ++       VG E         AD       +KRCAW+TP ADPCY AFHDEEWGVPVHDDKK
Subjt:  ASCSSDASSDSFNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKK

Query:  LFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFR
        LFELLCLSGALAEL+W  IL++RH+ RE+F+DFDP AV++LN+KK+ A G+AA SLLSE+K+R+I++N R + K+I E GS   Y+WNFVN+KP  SQFR
Subjt:  LFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFR

Query:  YPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSEC-----IETTEKGERDGE
        Y RQVP KTSKAE ISKDLV+RGFRSV PTVIY+FMQ +GLTNDHL+ CFR+ +C       TT K ++  E
Subjt:  YPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSEC-----IETTEKGERDGE

AT1G75090.1 DNA glycosylase superfamily protein4.0e-6644.21Show/hide
Query:  PLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVE
        P+K +++ R    S   R  +      +T  P +  +  +  A      N S S+D SS S +S   S+  T   G     K +T  KR    VEK  + 
Subjt:  PLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVE

Query:  SVVAVAVAVADTVGCLE-PKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAS
        +VVA    V D    +  P KRC W+TPN+DP Y  FHDEEWGVPV DDKKLFELL  S ALAE +WP+IL +R  FR++F +FDP+A+++  EK++++ 
Subjt:  SVVAVAVAVADTVGCLE-PKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAS

Query:  GSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSC
              +LSE K+RAI+EN + + KV  EFGSF+ Y W FVNHKP+ + +RY RQVP K+ KAE ISKD+++RGFR VGPTV+Y+F+Q SG+ NDHL +C
Subjt:  GSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSC

Query:  FRFSECIETTEKGERDGEIKPTVNEKIP
        FR+ EC   TE+  +  E +  ++   P
Subjt:  FRFSECIETTEKGERDGEIKPTVNEKIP

AT1G80850.1 DNA glycosylase superfamily protein7.4e-8951.97Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDS
        MS PPR+RS++ +D + R VLGPAGNK +     KP  KP+ +  K     E           PQC    P +LR+         +SM AS SSDASS  
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDS

Query:  FNS-----RASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLC
         +S       SS +   +R  ++   SS      E+  EK             A    C + +KRCAW+TP +D CY AFHDEEWGVPVHDDK+LFELL 
Subjt:  FNS-----RASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLC

Query:  LSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVP
        LSGALAEL+W  IL+KR LFRE+F+DFDP A+S+L  KK+ +   AAT+LLSE K+R+I+EN  Q+CK+I  FGSF+ YIWNFVN KP  SQFRYPRQVP
Subjt:  LSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVP

Query:  DKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIETTEKG
         KTSKAE+ISKDLV+RGFRSV PTVIY+FMQ +GLTNDHL  CFR  +C+   E G
Subjt:  DKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIETTEKG

AT5G57970.1 DNA glycosylase superfamily protein2.3e-9855Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTT--------VPSVLRQQDRHQAIL--NLSMNA
        MSG PR++SMNVA++++R  LG    KA    T K   K L+KLE+        D++    +P + V++          S+LR   RH+  L  NLS+NA
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTT--------VPSVLRQQDRHQAIL--NLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKL
        S SSDAS DSF+SRAS+ R  R      R KS  +  R           SVV+     +   G  E KKRC WVTPN+DPCY  FHDEEWGVPVHDDK+L
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKL

Query:  FELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRY
        FELL LSGALAE TWP IL+KR  FRE+F DFDPNA+ K+NEKK++  GS A++LLS+LK+RA+IEN RQ+ KVI+E+GSF+ YIW+FV +K I+S+FRY
Subjt:  FELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRY

Query:  PRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIETTEK
         RQVP KT KAEVISKDLV+RGFRSVGPTV+Y+FMQ +G+TNDHL SCFRF  CI   E+
Subjt:  PRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIETTEK

AT5G57970.2 DNA glycosylase superfamily protein2.3e-9855Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTT--------VPSVLRQQDRHQAIL--NLSMNA
        MSG PR++SMNVA++++R  LG    KA    T K   K L+KLE+        D++    +P + V++          S+LR   RH+  L  NLS+NA
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTT--------VPSVLRQQDRHQAIL--NLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKL
        S SSDAS DSF+SRAS+ R  R      R KS  +  R           SVV+     +   G  E KKRC WVTPN+DPCY  FHDEEWGVPVHDDK+L
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKL

Query:  FELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRY
        FELL LSGALAE TWP IL+KR  FRE+F DFDPNA+ K+NEKK++  GS A++LLS+LK+RA+IEN RQ+ KVI+E+GSF+ YIW+FV +K I+S+FRY
Subjt:  FELLCLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRY

Query:  PRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIETTEK
         RQVP KT KAEVISKDLV+RGFRSVGPTV+Y+FMQ +G+TNDHL SCFRF  CI   E+
Subjt:  PRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVSGLTNDHLVSCFRFSECIETTEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGCCCTCCGAGAATCCGGTCTATGAATGTGGCGGATTCCGATTCCCGACCGGTACTTGGGCCTGCCGGGAACAAAGCACGAACTGTAGAGACTAGGAAACCTGG
TGTGAAGCCATTGAAGAAGCTTGAAAAGCCTCGCGAAGAAGCTGAATCAAAGGACAAGAGGGTGCCATTGTTGTCGCCGCCTCAATGTGTTACTACAGTGCCATCGGTTT
TGAGGCAACAGGACCGCCACCAGGCGATTCTCAATCTGTCGATGAATGCCTCGTGTTCTTCTGATGCGTCGTCTGATTCGTTTAATAGTCGGGCATCTAGTGCTAGAGGT
ACGAGGCAGCGCGGTCCGAATTTGAGGAGAAAGTCAAGTACTACGGTAAAGAGGGCTGAAAAGGCCGTTGAAAAGGTTGGTGTTGAAAGTGTGGTGGCAGTAGCGGTGGC
GGTGGCAGATACAGTTGGTTGCTTAGAGCCCAAAAAACGATGTGCTTGGGTAACACCTAATGCAGATCCGTGTTATGCTGCTTTTCATGATGAAGAATGGGGAGTACCAG
TTCACGATGACAAAAAATTGTTTGAACTGCTTTGTCTATCGGGTGCTTTGGCTGAACTTACATGGCCTGCCATTCTCAACAAAAGACATCTATTTAGGGAAATCTTTCTG
GACTTCGACCCAAATGCCGTTTCAAAATTAAACGAGAAAAAGATGGTTGCTTCTGGAAGTGCTGCTACCTCTTTACTGTCAGAACTTAAGGTGCGAGCTATCATTGAAAA
TGGTCGTCAAATGTGCAAGGTAATTGATGAATTTGGTTCCTTCAACGTGTACATTTGGAACTTTGTTAACCACAAACCTATCATCAGTCAATTCCGATACCCCCGACAGG
TTCCTGATAAGACGTCAAAAGCAGAAGTGATTAGCAAGGATCTCGTAAAGAGAGGATTTCGAAGCGTGGGACCAACAGTCATCTATACATTCATGCAGGTGTCTGGGTTA
ACTAATGACCATCTTGTCAGCTGCTTTAGATTCTCAGAATGTATAGAGACAACAGAGAAAGGAGAAAGAGATGGTGAAATCAAGCCTACTGTTAACGAGAAAATACCAGA
GGCTCTGAAAAACTTGGAACTATAA
mRNA sequenceShow/hide mRNA sequence
CAAAAATTTTGCTTTCACTTTCCCTTATTTTCTTTCTCTCTCTAAATTCTCTCTCTCTTTGTTTCTCTTTTGATTTTCATACTCACAAACACAATCATGGCCGTTGCGAA
TTTTTCATCATCCACTTCCAGATAAGTCTTTCGCTCTCTGAGCTATTTTTTGTTCATCAATGTGGCTCCATTTCTCGGCCCCACCTAGGGTTCTGCCTCTTCCTCTCACA
GTTCACTCCCAAACCCTACTGTTCAACTTCGAACCCCCTCTTTCCCCCTTGTTGCCTCGAATGAAGTAGTGGGGTTCTCTTGTTTTTTTTTTTTTTTTTTTTTTTTGTTT
TTGTTTCTTTAATCTGAATCCTGGTTGATTTTGGCACCACCCCAATTTGTGGCGATATCTGTGTGTTTCTGTTAGTGCATTTTTTGAATTTGAGAAAAGAATTTTCACTG
AAATGTCAGGCCCTCCGAGAATCCGGTCTATGAATGTGGCGGATTCCGATTCCCGACCGGTACTTGGGCCTGCCGGGAACAAAGCACGAACTGTAGAGACTAGGAAACCT
GGTGTGAAGCCATTGAAGAAGCTTGAAAAGCCTCGCGAAGAAGCTGAATCAAAGGACAAGAGGGTGCCATTGTTGTCGCCGCCTCAATGTGTTACTACAGTGCCATCGGT
TTTGAGGCAACAGGACCGCCACCAGGCGATTCTCAATCTGTCGATGAATGCCTCGTGTTCTTCTGATGCGTCGTCTGATTCGTTTAATAGTCGGGCATCTAGTGCTAGAG
GTACGAGGCAGCGCGGTCCGAATTTGAGGAGAAAGTCAAGTACTACGGTAAAGAGGGCTGAAAAGGCCGTTGAAAAGGTTGGTGTTGAAAGTGTGGTGGCAGTAGCGGTG
GCGGTGGCAGATACAGTTGGTTGCTTAGAGCCCAAAAAACGATGTGCTTGGGTAACACCTAATGCAGATCCGTGTTATGCTGCTTTTCATGATGAAGAATGGGGAGTACC
AGTTCACGATGACAAAAAATTGTTTGAACTGCTTTGTCTATCGGGTGCTTTGGCTGAACTTACATGGCCTGCCATTCTCAACAAAAGACATCTATTTAGGGAAATCTTTC
TGGACTTCGACCCAAATGCCGTTTCAAAATTAAACGAGAAAAAGATGGTTGCTTCTGGAAGTGCTGCTACCTCTTTACTGTCAGAACTTAAGGTGCGAGCTATCATTGAA
AATGGTCGTCAAATGTGCAAGGTAATTGATGAATTTGGTTCCTTCAACGTGTACATTTGGAACTTTGTTAACCACAAACCTATCATCAGTCAATTCCGATACCCCCGACA
GGTTCCTGATAAGACGTCAAAAGCAGAAGTGATTAGCAAGGATCTCGTAAAGAGAGGATTTCGAAGCGTGGGACCAACAGTCATCTATACATTCATGCAGGTGTCTGGGT
TAACTAATGACCATCTTGTCAGCTGCTTTAGATTCTCAGAATGTATAGAGACAACAGAGAAAGGAGAAAGAGATGGTGAAATCAAGCCTACTGTTAACGAGAAAATACCA
GAGGCTCTGAAAAACTTGGAACTATAAAAAAAGAAGCCATGGTCGCATTGCCTTGAACCTTGCCTCAGTGTAATTAACTTCAAGAGTTTTTTTCTTCTTCTTTTTTTGTA
ATGGCTTGTAAATTCCATGATGGGATCTCTGCCACTTCCTTTGATGGGGTAAATTTTAGCAATGTTTTTGTGTATAAACTGACTTGGATAGAGAAGACAGCT
Protein sequenceShow/hide protein sequence
MSGPPRIRSMNVADSDSRPVLGPAGNKARTVETRKPGVKPLKKLEKPREEAESKDKRVPLLSPPQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARG
TRQRGPNLRRKSSTTVKRAEKAVEKVGVESVVAVAVAVADTVGCLEPKKRCAWVTPNADPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFL
DFDPNAVSKLNEKKMVASGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVSGL
TNDHLVSCFRFSECIETTEKGERDGEIKPTVNEKIPEALKNLEL