; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0005397 (gene) of Chayote v1 genome

Gene IDSed0005397
OrganismSechium edule (Chayote v1)
DescriptionDNA glycosylase superfamily protein
Genome locationLG07:5610675..5614836
RNA-Seq ExpressionSed0005397
SyntenySed0005397
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593364.1 hypothetical protein SDJN03_12840, partial [Cucurbita argyrosperma subsp. sororia]2.9e-18088.11Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTDS
        MSGPPRIRSMNVADSDSRPVLGP GNKARTVE+RK GVKPLKKLEKP  EAESKDKRVP S PPQCVTT+PSVLRQQDRHQAIL +SMNASCSSDAS+DS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTDS

Query:  FNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELT
        FNSRASSAR TRQ+GP+LRR+SS+T+KRAEKAVEKVG ESVVA  +TVGCL+ KKRCAWVT+N DPCYAAFHDEEWG+PV DDKKLFELL LSGALAELT
Subjt:  FNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELT

Query:  WPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI
        WP IL KRHLFRE FLDFDPNAVSKLNEKKMVAPGSAATSLLSE KVRAIIENGRQMCKV+DEFGSFNVY+WNFVNHKP ISQFRYPRQVPDKTSKAEVI
Subjt:  WPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI

Query:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAVKGESDAEIKPIIDEKIPEDLKNLEL
        SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRFPECIET  KGE D +IKP I EKIPE LKNLEL
Subjt:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAVKGESDAEIKPIIDEKIPEDLKNLEL

XP_022155202.1 uncharacterized protein LOC111022341 [Momordica charantia]2.1e-17889.46Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTDS
        MSGPPRIRSMNVADSDSRPVLGP GNKAR VE RKPG KPLKKLEKP  EAESKDKRVP S PPQCV ++PSVLRQQDRHQAILN+SMNASCSSDAS+DS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTDS

Query:  FNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELT
        FNSRASSAR TRQ+GP+LRR+ S T+KRAEKAVEKVGVESVV VVDTV  L+ KKRCAWVT N DPCYAAFHDEEWG+PV DDKKLFELL LSGALAELT
Subjt:  FNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELT

Query:  WPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI
        WPAILNKRHLFREIFLDFDPNAVSKLNEKKMVA GSAATSLLSELKVRAIIENGRQMCKV+DEFGSF+VYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI
Subjt:  WPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI

Query:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAVKGESDAEIKPIIDEKIPEDLKNLEL
        SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETA +GE D EIKPII+EKIPE LKNLEL
Subjt:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAVKGESDAEIKPIIDEKIPEDLKNLEL

XP_022960311.1 uncharacterized protein LOC111461081 [Cucurbita moschata]5.5e-17987.57Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTDS
        MSGPPRIRSMNVADSDSRPVLGP GNKARTVE+RK GVKPLKKLEKP  EAESKDKRVP S PPQCVTT+PSVLRQQDRHQAIL +SMNASCSSDAS+DS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTDS

Query:  FNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELT
        FNSRASSAR TRQ+GP+LRR+SS+T+KRAEKAVEKVG ESVVA  +TVGCL+ KKRCAWVT+N DPCYAAFHDEEWG+PV DDKKLFELL LSGALAELT
Subjt:  FNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELT

Query:  WPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI
        WP IL KRHLFRE FLDFDPNAVSKLNEKKMVAPGSAATSLLSE KVRAIIENGRQMCKV+DEFGSFNVY+WNFVNHKP ISQFRYPRQVPDKTSKA+VI
Subjt:  WPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI

Query:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAVKGESDAEIKPIIDEKIPEDLKNLEL
        SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRF ECIET  KGE D +IKP I EKIPE LKNLEL
Subjt:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAVKGESDAEIKPIIDEKIPEDLKNLEL

XP_023004117.1 uncharacterized protein LOC111497544 [Cucurbita maxima]9.4e-17987.57Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTDS
        MSGPPRIRSMNVADSDSRPVLGP GNKARTVE+RK GVKPLKKLEKP  EAESKDKRVP S PPQCVTT+PSVLRQQDRHQAIL +SMNASCSSDAS+DS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTDS

Query:  FNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELT
        FNSRASSAR TRQ+GP+LRR+SS+T+K+AEKA+EKVG ESVVAV +TVGCL+ KKRCAWVT+N DPCYAAFHDEEWG+PV DDKKLFELL LSGALAELT
Subjt:  FNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELT

Query:  WPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI
        WP IL KRHLFRE FLDFDPNAVSKLNEKKMVAPGSAATSLLSE KVRAIIENGRQM KV+DEFGSFNVY+WNFVNHKP ISQFRYPRQVPDKTSKAEVI
Subjt:  WPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI

Query:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAVKGESDAEIKPIIDEKIPEDLKNLEL
        SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRFPECIET  KGE D +IKP I EKIPE LKNLEL
Subjt:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAVKGESDAEIKPIIDEKIPEDLKNLEL

XP_023514420.1 uncharacterized protein LOC111778684 [Cucurbita pepo subsp. pepo]2.9e-18088.11Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTDS
        MSGPPRIRSMNVADSDSRPVLGP GNKARTVE+RK GVKPLKKLEKP  EAESKDKRVP S PPQCVTT+PSVLRQQDRHQAIL +SMNASCSSDAS+DS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTDS

Query:  FNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELT
        FNSRASSAR TRQ+GP+LRR+SS+++KRAEKAVEKVG ESVVAV +TVGCL+ KKRCAWVT+N DPCYAAFHDEEWG+PV DDKKLFELL LSGALAELT
Subjt:  FNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELT

Query:  WPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI
        WP IL KRHLFRE FLDFDPNAVSKLNEKKMVAPGSAATSLLSE KVRAIIENGRQMCKV+DEFGSFNVY+WNFVNHKP ISQFRYPRQVPDKTSKAEVI
Subjt:  WPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI

Query:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAVKGESDAEIKPIIDEKIPEDLKNLEL
        SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRFPECIET  KGE D +IKP I EKIPE LKNLEL
Subjt:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAVKGESDAEIKPIIDEKIPEDLKNLEL

TrEMBL top hitse value%identityAlignment
A0A1S3CE52 probable GMP synthase [glutamine-hydrolyzing]6.1e-17686.83Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTDS
        MSGPPRIRSMNVADSDSRPVLGP GNKARTVE+RKPGVKPLKKLEKP  E ESKDKRVP S PPQCV T+PSVLRQQDRHQAILN+SMNASCSSDAS+DS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTDS

Query:  FNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELT
        FNSRASSAR TRQ+GP+LRR+  +T+K A+KAVEKVGVESV  V DTVGCL++KKRCAWVT N DPCYAAFHDEEWG+PV DDKKLFELL LSGALAELT
Subjt:  FNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELT

Query:  WPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI
        WPAILNKRHLFREIFLDFDP  VSKLNEKKMVAPGSAATSLLSELK+RAIIENGRQMCKV+DEFGSFNVY+WNFVNHKPIISQFRYPRQVPDKTSKAEVI
Subjt:  WPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI

Query:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIE--TAVKGESDAEIKPIIDEKIPEDLKNLEL
        SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRF ECIE  TA KGE D E+K   +EK+PE LKNLEL
Subjt:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIE--TAVKGESDAEIKPIIDEKIPEDLKNLEL

A0A5A7UYZ9 Putative GMP synthase6.1e-17686.83Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTDS
        MSGPPRIRSMNVADSDSRPVLGP GNKARTVE+RKPGVKPLKKLEKP  E ESKDKRVP S PPQCV T+PSVLRQQDRHQAILN+SMNASCSSDAS+DS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTDS

Query:  FNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELT
        FNSRASSAR TRQ+GP+LRR+  +T+K A+KAVEKVGVESV  V DTVGCL++KKRCAWVT N DPCYAAFHDEEWG+PV DDKKLFELL LSGALAELT
Subjt:  FNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELT

Query:  WPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI
        WPAILNKRHLFREIFLDFDP  VSKLNEKKMVAPGSAATSLLSELK+RAIIENGRQMCKV+DEFGSFNVY+WNFVNHKPIISQFRYPRQVPDKTSKAEVI
Subjt:  WPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI

Query:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIE--TAVKGESDAEIKPIIDEKIPEDLKNLEL
        SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRF ECIE  TA KGE D E+K   +EK+PE LKNLEL
Subjt:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIE--TAVKGESDAEIKPIIDEKIPEDLKNLEL

A0A6J1DNQ3 uncharacterized protein LOC1110223411.0e-17889.46Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTDS
        MSGPPRIRSMNVADSDSRPVLGP GNKAR VE RKPG KPLKKLEKP  EAESKDKRVP S PPQCV ++PSVLRQQDRHQAILN+SMNASCSSDAS+DS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTDS

Query:  FNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELT
        FNSRASSAR TRQ+GP+LRR+ S T+KRAEKAVEKVGVESVV VVDTV  L+ KKRCAWVT N DPCYAAFHDEEWG+PV DDKKLFELL LSGALAELT
Subjt:  FNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELT

Query:  WPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI
        WPAILNKRHLFREIFLDFDPNAVSKLNEKKMVA GSAATSLLSELKVRAIIENGRQMCKV+DEFGSF+VYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI
Subjt:  WPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI

Query:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAVKGESDAEIKPIIDEKIPEDLKNLEL
        SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETA +GE D EIKPII+EKIPE LKNLEL
Subjt:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAVKGESDAEIKPIIDEKIPEDLKNLEL

A0A6J1H7A2 uncharacterized protein LOC1114610812.7e-17987.57Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTDS
        MSGPPRIRSMNVADSDSRPVLGP GNKARTVE+RK GVKPLKKLEKP  EAESKDKRVP S PPQCVTT+PSVLRQQDRHQAIL +SMNASCSSDAS+DS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTDS

Query:  FNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELT
        FNSRASSAR TRQ+GP+LRR+SS+T+KRAEKAVEKVG ESVVA  +TVGCL+ KKRCAWVT+N DPCYAAFHDEEWG+PV DDKKLFELL LSGALAELT
Subjt:  FNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELT

Query:  WPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI
        WP IL KRHLFRE FLDFDPNAVSKLNEKKMVAPGSAATSLLSE KVRAIIENGRQMCKV+DEFGSFNVY+WNFVNHKP ISQFRYPRQVPDKTSKA+VI
Subjt:  WPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI

Query:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAVKGESDAEIKPIIDEKIPEDLKNLEL
        SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRF ECIET  KGE D +IKP I EKIPE LKNLEL
Subjt:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAVKGESDAEIKPIIDEKIPEDLKNLEL

A0A6J1KPI7 uncharacterized protein LOC1114975444.5e-17987.57Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTDS
        MSGPPRIRSMNVADSDSRPVLGP GNKARTVE+RK GVKPLKKLEKP  EAESKDKRVP S PPQCVTT+PSVLRQQDRHQAIL +SMNASCSSDAS+DS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTDS

Query:  FNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELT
        FNSRASSAR TRQ+GP+LRR+SS+T+K+AEKA+EKVG ESVVAV +TVGCL+ KKRCAWVT+N DPCYAAFHDEEWG+PV DDKKLFELL LSGALAELT
Subjt:  FNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELT

Query:  WPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI
        WP IL KRHLFRE FLDFDPNAVSKLNEKKMVAPGSAATSLLSE KVRAIIENGRQM KV+DEFGSFNVY+WNFVNHKP ISQFRYPRQVPDKTSKAEVI
Subjt:  WPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVI

Query:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAVKGESDAEIKPIIDEKIPEDLKNLEL
        SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+SCFRFPECIET  KGE D +IKP I EKIPE LKNLEL
Subjt:  SKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECIETAVKGESDAEIKPIIDEKIPEDLKNLEL

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 14.9e-3740.22Show/hide
Query:  KRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENG
        +RC WV  + DP Y A+HD EWG+P  D KKLFE++ L G  A L+W  +L KR  +R  F  FDP  V+ + E+ +      A  +    K++AII N 
Subjt:  KRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENG

Query:  RQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFP
        R   ++      F  ++W+FVNH+P ++Q     ++P  TS ++ +SK L KRGF+ VG T+ Y+FMQ  GL NDH++ C  +P
Subjt:  RQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFP

P44321 DNA-3-methyladenine glycosylase1.4e-3139.11Show/hide
Query:  RCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGR
        RC WV       Y  +HD+EWG P  D +KLFE + L G  A L+W  +L KR  +RE F  FDP  ++K+    + A    +  +    K+ AI++N +
Subjt:  RCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGR

Query:  QMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISC
            +     +F+ +IW+FVNHKPI++     R VP KT  ++ +SK L KRGF  +G T  Y FMQ  GL +DHL  C
Subjt:  QMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]2.0e-3845.16Show/hide
Query:  KKRCAWVTAN---ADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAI
        K RCAW T     A   Y  +HD EWG P+ +DKKLFE L L G  A L+W  IL KR  FR  F DFDP+ V+  +E K+         + +  K+ A 
Subjt:  KKRCAWVTAN---ADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAI

Query:  IENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFR
        I N +    V  EFGSF+ YIW FV  KPII+ F     +P  T  ++ I+KDL KRGF+ VG T +Y  MQ  G+ NDHL SCF+
Subjt:  IENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFR

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein9.6e-8953.26Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLE---AESKDKR-----VPASPP---PQCVTTMPSVLRQQDRHQAILNMSMN
        MS PPR RS+N  + + R VLGP GNK +    RKP   P  KLEKP++E    +SKD++      PASP     QC +   S+LR+        + SM 
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLE---AESKDKR-----VPASPP---PQCVTTMPSVLRQQDRHQAILNMSMN

Query:  ASCSSDASTDSFNSRASSARATRQQGPSLRRRSSNTMKRAE--KAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLF
        AS SSDAS+   +S  S A ++  +    R  S ++ ++    K  EKV  +            D +KRCAW+T  ADPCY AFHDEEWG+PV DDKKLF
Subjt:  ASCSSDASTDSFNSRASSARATRQQGPSLRRRSSNTMKRAE--KAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLF

Query:  ELLSLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYP
        ELL LSGALAEL+W  IL++RH+ RE+F+DFDP AV++LN+KK+ APG+AA SLLSE+K+R+I++N R + K++ E GS   Y+WNFVN+KP  SQFRY 
Subjt:  ELLSLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYP

Query:  RQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPEC
        RQVP KTSKAE ISKDLV+RGFRSV PTVIY+FMQ AGLTNDHLI CFR+ +C
Subjt:  RQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPEC

AT1G75090.1 DNA glycosylase superfamily protein3.7e-6446.64Show/hide
Query:  VTTMPSVLRQQDRHQAILNMSMNASCSSDASTDSFNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTK-----KRCAWVT
        +T  P +  +  +  A      N S S+D S+ S +S   S+  T   G         T       VEK  + +VVA V  V  +  K     KRC W+T
Subjt:  VTTMPSVLRQQDRHQAILNMSMNASCSSDASTDSFNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTK-----KRCAWVT

Query:  ANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVM
         N+DP Y  FHDEEWG+PVRDDKKLFELL  S ALAE +WP+IL +R  FR++F +FDP+A+++  EK++++       +LSE K+RAI+EN + + KV 
Subjt:  ANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVM

Query:  DEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPEC-IET
         EFGSF+ Y W FVNHKP+ + +RY RQVP K+ KAE ISKD+++RGFR VGPTV+Y+F+Q +G+ NDHL +CFR+ EC +ET
Subjt:  DEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPEC-IET

AT1G80850.1 DNA glycosylase superfamily protein1.9e-8952.99Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTD-
        MS PPR+RS++ +D + R VLGPAGNK +     KP  KP+ +  K L   E           PQC    P +LR+         +SM AS SSDAS+  
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTD-

Query:  -----SFNSRASSARATRQQG---PSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCL-DTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELL
             S  S +S  R  R+ G    S   R + T +R EKA +               C  D +KRCAW+T  +D CY AFHDEEWG+PV DDK+LFELL
Subjt:  -----SFNSRASSARATRQQG---PSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCL-DTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELL

Query:  SLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQV
        SLSGALAEL+W  IL+KR LFRE+F+DFDP A+S+L  KK+ +P  AAT+LLSE K+R+I+EN  Q+CK++  FGSF+ YIWNFVN KP  SQFRYPRQV
Subjt:  SLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQV

Query:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECI
        P KTSKAE+ISKDLV+RGFRSV PTVIY+FMQ AGLTNDHL  CFR  +C+
Subjt:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECI

AT5G57970.1 DNA glycosylase superfamily protein9.6e-9754.42Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTT--------MPSVLRQQDRHQAIL--NMSMNA
        MSG PR++SMNVA++++R  LG    KA    + K   K L+KLE+        D++   + P + V++          S+LR   RH+  L  N+S+NA
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTT--------MPSVLRQQDRHQAIL--NMSMNA

Query:  SCSSDASTDSFNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELL
        S SSDAS DSF+SRAS+ R  R      R +S  +  R+        V S  A+       +TKKRC WVT N+DPCY  FHDEEWG+PV DDK+LFELL
Subjt:  SCSSDASTDSFNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELL

Query:  SLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQV
         LSGALAE TWP IL+KR  FRE+F DFDPNA+ K+NEKK++ PGS A++LLS+LK+RA+IEN RQ+ KV++E+GSF+ YIW+FV +K I+S+FRY RQV
Subjt:  SLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQV

Query:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECI
        P KT KAEVISKDLV+RGFRSVGPTV+Y+FMQ AG+TNDHL SCFRF  CI
Subjt:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECI

AT5G57970.2 DNA glycosylase superfamily protein9.6e-9754.42Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTT--------MPSVLRQQDRHQAIL--NMSMNA
        MSG PR++SMNVA++++R  LG    KA    + K   K L+KLE+        D++   + P + V++          S+LR   RH+  L  N+S+NA
Subjt:  MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTT--------MPSVLRQQDRHQAIL--NMSMNA

Query:  SCSSDASTDSFNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELL
        S SSDAS DSF+SRAS+ R  R      R +S  +  R+        V S  A+       +TKKRC WVT N+DPCY  FHDEEWG+PV DDK+LFELL
Subjt:  SCSSDASTDSFNSRASSARATRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELL

Query:  SLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQV
         LSGALAE TWP IL+KR  FRE+F DFDPNA+ K+NEKK++ PGS A++LLS+LK+RA+IEN RQ+ KV++E+GSF+ YIW+FV +K I+S+FRY RQV
Subjt:  SLSGALAELTWPAILNKRHLFREIFLDFDPNAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQV

Query:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECI
        P KT KAEVISKDLV+RGFRSVGPTV+Y+FMQ AG+TNDHL SCFRF  CI
Subjt:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFPECI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGGCCCTCCTAGAATCCGGTCGATGAATGTGGCGGATTCTGATTCGCGACCGGTGCTTGGACCGGCTGGGAACAAAGCGAGAACTGTGGAGAGTAGAAAGCCTGG
TGTGAAGCCATTGAAGAAGCTTGAAAAGCCTCTCCTAGAAGCTGAGTCGAAGGACAAGAGGGTGCCGGCGTCGCCGCCGCCTCAGTGTGTTACTACAATGCCGTCGGTTT
TGAGGCAACAGGATCGCCACCAGGCGATTCTTAATATGTCTATGAATGCTTCGTGTTCTTCTGATGCGTCCACGGATTCGTTTAATAGTCGGGCTTCTAGCGCGAGAGCT
ACGAGGCAGCAGGGTCCGAGTTTGAGGAGAAGGTCGAGTAATACGATGAAGAGGGCTGAAAAGGCCGTTGAGAAAGTTGGTGTTGAAAGTGTGGTGGCGGTGGTGGATAC
AGTTGGTTGCTTGGATACCAAAAAACGATGTGCTTGGGTAACAGCTAATGCAGATCCATGTTATGCTGCTTTTCATGATGAAGAATGGGGATTACCAGTTCGTGATGACA
AAAAATTGTTTGAACTGCTTTCCCTATCGGGTGCTTTGGCTGAACTTACATGGCCTGCCATCCTCAACAAAAGACATCTATTTAGGGAAATCTTTCTGGACTTTGACCCA
AATGCCGTTTCAAAATTAAACGAGAAAAAAATGGTTGCTCCTGGAAGTGCTGCTACCTCTTTACTGTCAGAGCTTAAGGTGCGAGCTATCATTGAAAATGGTCGTCAAAT
GTGCAAGGTAATGGATGAATTTGGTTCCTTCAATGTCTATATTTGGAACTTCGTCAACCACAAACCCATCATCAGTCAGTTCCGGTATCCACGCCAGGTCCCCGACAAGA
CGTCGAAAGCAGAGGTGATTAGCAAGGATCTCGTAAAGAGAGGATTTCGAAGCGTTGGACCAACAGTCATCTACACATTCATGCAGGTGGCTGGCTTAACAAACGATCAT
CTCATCAGTTGCTTTAGATTCCCAGAATGTATAGAGACGGCAGTGAAAGGAGAAAGTGATGCTGAAATCAAGCCTATTATTGACGAGAAAATACCAGAGGATCTGAAAAA
TTTGGAACTATAA
mRNA sequenceShow/hide mRNA sequence
CTTTCTTCTAAACCCTCAAAAAAAAATTTCTTCCACTTTCCTGATTTTCTTTCTCTCTCTAAGTTCCAAAATTTTCTCTCTCTTGTTCCTCCATTGATTTTCACACTCAC
AAACCACAATCCATGGCCGTTGCAAATTCTTCATCATCCACTTCCAGATAAGAAGCGTCTTCACCTCTCTGAGCTCTCTTTTTTCTTCATCAATGGCGGCTCCATTTCTC
GGCCCCGCCTAGGGTTCTTCTCACTCCCATACCCTACTCTCCAACTTCAAATCCCTTTCCACATTGTTGCTTCCATTGATTTACTGAGGTTATCTTGTTTGAATCCTGGT
TGATTTTGCCAATTTGTGGCGATATCTGTTTTTTGAATTTGGGGAAAAAGGGTTTTACCTGAAATGTCGGGCCCTCCTAGAATCCGGTCGATGAATGTGGCGGATTCTGA
TTCGCGACCGGTGCTTGGACCGGCTGGGAACAAAGCGAGAACTGTGGAGAGTAGAAAGCCTGGTGTGAAGCCATTGAAGAAGCTTGAAAAGCCTCTCCTAGAAGCTGAGT
CGAAGGACAAGAGGGTGCCGGCGTCGCCGCCGCCTCAGTGTGTTACTACAATGCCGTCGGTTTTGAGGCAACAGGATCGCCACCAGGCGATTCTTAATATGTCTATGAAT
GCTTCGTGTTCTTCTGATGCGTCCACGGATTCGTTTAATAGTCGGGCTTCTAGCGCGAGAGCTACGAGGCAGCAGGGTCCGAGTTTGAGGAGAAGGTCGAGTAATACGAT
GAAGAGGGCTGAAAAGGCCGTTGAGAAAGTTGGTGTTGAAAGTGTGGTGGCGGTGGTGGATACAGTTGGTTGCTTGGATACCAAAAAACGATGTGCTTGGGTAACAGCTA
ATGCAGATCCATGTTATGCTGCTTTTCATGATGAAGAATGGGGATTACCAGTTCGTGATGACAAAAAATTGTTTGAACTGCTTTCCCTATCGGGTGCTTTGGCTGAACTT
ACATGGCCTGCCATCCTCAACAAAAGACATCTATTTAGGGAAATCTTTCTGGACTTTGACCCAAATGCCGTTTCAAAATTAAACGAGAAAAAAATGGTTGCTCCTGGAAG
TGCTGCTACCTCTTTACTGTCAGAGCTTAAGGTGCGAGCTATCATTGAAAATGGTCGTCAAATGTGCAAGGTAATGGATGAATTTGGTTCCTTCAATGTCTATATTTGGA
ACTTCGTCAACCACAAACCCATCATCAGTCAGTTCCGGTATCCACGCCAGGTCCCCGACAAGACGTCGAAAGCAGAGGTGATTAGCAAGGATCTCGTAAAGAGAGGATTT
CGAAGCGTTGGACCAACAGTCATCTACACATTCATGCAGGTGGCTGGCTTAACAAACGATCATCTCATCAGTTGCTTTAGATTCCCAGAATGTATAGAGACGGCAGTGAA
AGGAGAAAGTGATGCTGAAATCAAGCCTATTATTGACGAGAAAATACCAGAGGATCTGAAAAATTTGGAACTATAAAGTAGCCAGCCATGATAGCCCTGAACCTTGCCTC
AGTGTAATTAACTTCCAGAGTTATTTTCTTTTTCTTTTTTGTATTGGGTTGTAAATTCCATGATGGGATATCTGCCACTTCCTTTGATGGGGTAAATTTTAGCAATGTTT
TTGTGTATAAAACTGACTTGGATACAGAAGACAGCTAGAATCAGTTCTGTTAGTTTACTACTTCAAGCATGTGGATTAGTTATTACGTTGTTTAGTATTACATTTGTCGG
TAATAGCTCAATCGGTATTAGTATGTATCCATGACATTTGAATTTCTTTGCCTCGAATATTGTTGAATTAAAAAGAAAAATTAAATTTAG
Protein sequenceShow/hide protein sequence
MSGPPRIRSMNVADSDSRPVLGPAGNKARTVESRKPGVKPLKKLEKPLLEAESKDKRVPASPPPQCVTTMPSVLRQQDRHQAILNMSMNASCSSDASTDSFNSRASSARA
TRQQGPSLRRRSSNTMKRAEKAVEKVGVESVVAVVDTVGCLDTKKRCAWVTANADPCYAAFHDEEWGLPVRDDKKLFELLSLSGALAELTWPAILNKRHLFREIFLDFDP
NAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVMDEFGSFNVYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDH
LISCFRFPECIETAVKGESDAEIKPIIDEKIPEDLKNLEL