; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0018153 (gene) of Chayote v1 genome

Gene IDSed0018153
OrganismSechium edule (Chayote v1)
DescriptionDNA glycosylase superfamily protein
Genome locationLG09:37906130..37911626
RNA-Seq ExpressionSed0018153
SyntenySed0018153
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593364.1 hypothetical protein SDJN03_12840, partial [Cucurbita argyrosperma subsp. sororia]3.6e-18690.35Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDAS
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP QEAESKDKRVPLSPP    QCVTTVPSVLRQQDRHQAIL LSMNASCSSDAS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDAS

Query:  SDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA
        SDSFNSR SSARGTR RGPN RRKSSS VK+AEKAV+KVG ESVVA  +TVGCLE +KRC WVT NTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA
Subjt:  SDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA

Query:  ELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKA
        EL WP IL KRHLFRETFLDFDP+AVSKLNEKKMVA GSAAT+LLSEPKVRAIIENGRQMCKVIDEFGSFN+Y+WNFVNHKP ISQFRYPRQVPDKTSKA
Subjt:  ELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKA

Query:  EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIETTEKGERNGEIKPTINEKIPEALKNLEL
        EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLV CFR+ ECIETTEKGER+G+IKPTI EKIPEALKNLEL
Subjt:  EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIETTEKGERNGEIKPTINEKIPEALKNLEL

XP_004136097.2 uncharacterized protein LOC101205558 [Cucumis sativus]9.1e-18289.36Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDAS
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQE ESKDKRVPLSPP    QCV TVPSVLRQQDRHQAILNLSMNASCSSDAS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDAS

Query:  SDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA
        SDSFNSR SSARGTR RGPN RRK  S VK A+KAV+KVGVESV  VVDTVGCLE +KRC WVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA
Subjt:  SDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA

Query:  ELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKA
        EL WPAILNKRHLFRE FLDFDP+AVSKLNEKKMVA GSAAT+LLSE KVRAIIENGRQMCKVIDEFGSFN+Y+WNFVNHKPIISQFRYPRQVPDKTSKA
Subjt:  ELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKA

Query:  EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIE--TTEKGERN-GEIKPTINEKIPEALKNLEL
        EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+GCFR+TECIE  T EKGER+ GE+K   NEK+PEALKNLEL
Subjt:  EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIE--TTEKGERN-GEIKPTINEKIPEALKNLEL

XP_022960311.1 uncharacterized protein LOC111461081 [Cucurbita moschata]8.0e-18690.08Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDAS
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP QEAESKDKRVPLSPP    QCVTTVPSVLRQQDRHQAIL LSMNASCSSDAS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDAS

Query:  SDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA
        SDSFNSR SSARGTR RGPN RRKSSS VK+AEKAV+KVG ESVVA  +TVGCLE +KRC WVT NTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA
Subjt:  SDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA

Query:  ELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKA
        EL WP IL KRHLFRETFLDFDP+AVSKLNEKKMVA GSAAT+LLSEPKVRAIIENGRQMCKVIDEFGSFN+Y+WNFVNHKP ISQFRYPRQVPDKTSKA
Subjt:  ELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKA

Query:  EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIETTEKGERNGEIKPTINEKIPEALKNLEL
        +VISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLV CFR+ ECIETTEKGER+G+IKPTI EKIPEALKNLEL
Subjt:  EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIETTEKGERNGEIKPTINEKIPEALKNLEL

XP_023004117.1 uncharacterized protein LOC111497544 [Cucurbita maxima]5.2e-18590.08Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDAS
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP QEAESKDKRVPLSPP    QCVTTVPSVLRQQDRHQAIL LSMNASCSSDAS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDAS

Query:  SDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA
        SDSFNSR SSARGTR RGPN RRKSSS VKKAEKA++KVG ESVVAV +TVGCLE +KRC WVT NTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA
Subjt:  SDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA

Query:  ELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKA
        EL WP IL KRHLFRETFLDFDP+AVSKLNEKKMVA GSAAT+LLSEPKVRAIIENGRQM KVIDEFGSFN+Y+WNFVNHKP ISQFRYPRQVPDKTSKA
Subjt:  ELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKA

Query:  EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIETTEKGERNGEIKPTINEKIPEALKNLEL
        EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLV CFR+ ECIETTEKGER+G+IKP+I EKIPEALKNLEL
Subjt:  EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIETTEKGERNGEIKPTINEKIPEALKNLEL

XP_023514420.1 uncharacterized protein LOC111778684 [Cucurbita pepo subsp. pepo]9.4e-18790.62Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDAS
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP QEAESKDKRVPLSPP    QCVTTVPSVLRQQDRHQAIL LSMNASCSSDAS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDAS

Query:  SDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA
        SDSFNSR SSARGTR RGPN RRKSSS+VK+AEKAV+KVG ESVVAV +TVGCLE +KRC WVT NTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA
Subjt:  SDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA

Query:  ELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKA
        EL WP IL KRHLFRETFLDFDP+AVSKLNEKKMVA GSAAT+LLSEPKVRAIIENGRQMCKVIDEFGSFN+Y+WNFVNHKP ISQFRYPRQVPDKTSKA
Subjt:  ELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKA

Query:  EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIETTEKGERNGEIKPTINEKIPEALKNLEL
        EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLV CFR+ ECIETTEKGER+G+IKPTI EKIPEALKNLEL
Subjt:  EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIETTEKGERNGEIKPTINEKIPEALKNLEL

TrEMBL top hitse value%identityAlignment
A0A0A0K8L6 Uncharacterized protein4.4e-18289.36Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDAS
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQE ESKDKRVPLSPP    QCV TVPSVLRQQDRHQAILNLSMNASCSSDAS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDAS

Query:  SDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA
        SDSFNSR SSARGTR RGPN RRK  S VK A+KAV+KVGVESV  VVDTVGCLE +KRC WVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA
Subjt:  SDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA

Query:  ELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKA
        EL WPAILNKRHLFRE FLDFDP+AVSKLNEKKMVA GSAAT+LLSE KVRAIIENGRQMCKVIDEFGSFN+Y+WNFVNHKPIISQFRYPRQVPDKTSKA
Subjt:  ELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKA

Query:  EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIE--TTEKGERN-GEIKPTINEKIPEALKNLEL
        EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+GCFR+TECIE  T EKGER+ GE+K   NEK+PEALKNLEL
Subjt:  EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIE--TTEKGERN-GEIKPTINEKIPEALKNLEL

A0A1S3CE52 probable GMP synthase [glutamine-hydrolyzing]9.8e-18288.53Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDAS
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQE ESKDKRVPLSPP    QCV TVPSVLRQQDRHQAILNLSMNASCSSDAS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDAS

Query:  SDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA
        SDSFNSR SSARGTR RGPN RRK  S VK A+KAV+KVGVESV  V DTVGCLE +KRC WVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA
Subjt:  SDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA

Query:  ELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKA
        EL WPAILNKRHLFRE FLDFDP+ VSKLNEKKMVA GSAAT+LLSE K+RAIIENGRQMCKVIDEFGSFN+Y+WNFVNHKPIISQFRYPRQVPDKTSKA
Subjt:  ELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKA

Query:  EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIE--TTEKGERNGEIKPTINEKIPEALKNLEL
        EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+ CFR+TECIE  T EKGER+GE+K   NEK+PEALKNLEL
Subjt:  EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIE--TTEKGERNGEIKPTINEKIPEALKNLEL

A0A5A7UYZ9 Putative GMP synthase9.8e-18288.53Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDAS
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQE ESKDKRVPLSPP    QCV TVPSVLRQQDRHQAILNLSMNASCSSDAS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDAS

Query:  SDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA
        SDSFNSR SSARGTR RGPN RRK  S VK A+KAV+KVGVESV  V DTVGCLE +KRC WVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA
Subjt:  SDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA

Query:  ELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKA
        EL WPAILNKRHLFRE FLDFDP+ VSKLNEKKMVA GSAAT+LLSE K+RAIIENGRQMCKVIDEFGSFN+Y+WNFVNHKPIISQFRYPRQVPDKTSKA
Subjt:  ELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKA

Query:  EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIE--TTEKGERNGEIKPTINEKIPEALKNLEL
        EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL+ CFR+TECIE  T EKGER+GE+K   NEK+PEALKNLEL
Subjt:  EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIE--TTEKGERNGEIKPTINEKIPEALKNLEL

A0A6J1H7A2 uncharacterized protein LOC1114610813.9e-18690.08Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDAS
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP QEAESKDKRVPLSPP    QCVTTVPSVLRQQDRHQAIL LSMNASCSSDAS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDAS

Query:  SDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA
        SDSFNSR SSARGTR RGPN RRKSSS VK+AEKAV+KVG ESVVA  +TVGCLE +KRC WVT NTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA
Subjt:  SDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA

Query:  ELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKA
        EL WP IL KRHLFRETFLDFDP+AVSKLNEKKMVA GSAAT+LLSEPKVRAIIENGRQMCKVIDEFGSFN+Y+WNFVNHKP ISQFRYPRQVPDKTSKA
Subjt:  ELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKA

Query:  EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIETTEKGERNGEIKPTINEKIPEALKNLEL
        +VISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLV CFR+ ECIETTEKGER+G+IKPTI EKIPEALKNLEL
Subjt:  EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIETTEKGERNGEIKPTINEKIPEALKNLEL

A0A6J1KPI7 uncharacterized protein LOC1114975442.5e-18590.08Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDAS
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP QEAESKDKRVPLSPP    QCVTTVPSVLRQQDRHQAIL LSMNASCSSDAS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDAS

Query:  SDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA
        SDSFNSR SSARGTR RGPN RRKSSS VKKAEKA++KVG ESVVAV +TVGCLE +KRC WVT NTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA
Subjt:  SDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALA

Query:  ELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKA
        EL WP IL KRHLFRETFLDFDP+AVSKLNEKKMVA GSAAT+LLSEPKVRAIIENGRQM KVIDEFGSFN+Y+WNFVNHKP ISQFRYPRQVPDKTSKA
Subjt:  ELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKA

Query:  EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIETTEKGERNGEIKPTINEKIPEALKNLEL
        EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLV CFR+ ECIETTEKGER+G+IKP+I EKIPEALKNLEL
Subjt:  EVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIETTEKGERNGEIKPTINEKIPEALKNLEL

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 16.2e-4043.17Show/hide
Query:  KRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENG
        +RCGWV  + DP Y A+HD EWGVP  D KKLFE++CL G  A L+W  +L KR  +R  F  FDP  V+ + E+ +      A  +    K++AII N 
Subjt:  KRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENG

Query:  RQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRY
        R   ++      F  ++W+FVNH+P ++Q     ++P  TS ++ +SK L KRGF+ VG T+ Y+FMQ  GL NDH+VGC  Y
Subjt:  RQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRY

P44321 DNA-3-methyladenine glycosylase5.6e-3339.66Show/hide
Query:  RCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGR
        RC WV       Y  +HD+EWG P  D +KLFE +CL G  A L+W  +L KR  +RE F  FDP  ++K+    + A    +  +    K+ AI++N +
Subjt:  RCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGR

Query:  QMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGC
            +     +F+ +IW+FVNHKPI++     R VP KT  ++ +SK L KRGF  +G T  Y FMQ  GL +DHL  C
Subjt:  QMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]2.6e-3844.02Show/hide
Query:  RCGWVTPNTDPC---YAAFHDEEWGVPVHDDKKLFELLCLSGALAELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIE
        RC W T   +     Y  +HD EWG P+H+DKKLFE L L G  A L+W  IL KR  FR  F DFDP  V+  +E K+         + +  K+ A I 
Subjt:  RCGWVTPNTDPC---YAAFHDEEWGVPVHDDKKLFELLCLSGALAELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIE

Query:  NGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFR
        N +    V  EFGSF+ YIW FV  KPII+ F     +P  T  ++ I+KDL KRGF+ VG T +Y  MQ  G+ NDHL  CF+
Subjt:  NGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFR

Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein7.2e-9253.53Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQE---AESKDKR-----VPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMN
        MS PPR RS+N  + + R VLGPTGNK +    RKP   P  KLEKP  E    +SKD++      P SP     QC +   S+LR+        + SM 
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQE---AESKDKR-----VPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMN

Query:  ASCSSDASSDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFEL
        AS SSDASS   +S  S A  +  +    R  S S+ +K       VG E      D     + RKRC W+TP  DPCY AFHDEEWGVPVHDDKKLFEL
Subjt:  ASCSSDASSDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFEL

Query:  LCLSGALAELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQ
        LCLSGALAEL+W  IL++RH+ RE F+DFDP AV++LN+KK+ A G+AA +LLSE K+R+I++N R + K+I E GS   Y+WNFVN+KP  SQFRY RQ
Subjt:  LCLSGALAELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQ

Query:  VPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTEC-----IETTEKGERNGE
        VP KTSKAE ISKDLV+RGFRSV PTVIY+FMQ AGLTNDHL+GCFRY +C       TT K ++  E
Subjt:  VPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTEC-----IETTEKGERNGE

AT1G75090.1 DNA glycosylase superfamily protein2.6e-6540.81Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNK---ARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSS
        MS   ++RS      +SR +L  TGN+    +T  T+KP + P                RV  SP                      A      N S S+
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNK---ARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSS

Query:  DASSDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVA--VVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCL
        D SS S +S   S+  T   G  +     + V+K    V  V V   ++  +   V      KRC W+TPN+DP Y  FHDEEWGVPV DDKKLFELL  
Subjt:  DASSDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVA--VVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCL

Query:  SGALAELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPD
        S ALAE +WP+IL +R  FR+ F +FDPSA+++  EK++++       +LSE K+RAI+EN + + KV  EFGSF+ Y W FVNHKP+ + +RY RQVP 
Subjt:  SGALAELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPD

Query:  KTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIETTEKGERNGEIKPTINEKIP
        K+ KAE ISKD+++RGFR VGPTV+Y+F+Q +G+ NDHL  CFRY EC   TE+  ++ E +  ++   P
Subjt:  KTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIETTEKGERNGEIKPTINEKIP

AT1G80850.1 DNA glycosylase superfamily protein6.3e-8851.94Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDAS
        MS PPR+RS++ +D + R VLGP GNK +     KP  KP+ +  K     E   +  PLSPP            +LR+         +SM AS SSDAS
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDAS

Query:  SD------SFNSRGSSARGTRPRG---PNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCL-ELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLF
        S       S  S  S  R  R  G    +S  + +   ++ EKA D               C  + RKRC W+TP +D CY AFHDEEWGVPVHDDK+LF
Subjt:  SD------SFNSRGSSARGTRPRG---PNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCL-ELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLF

Query:  ELLCLSGALAELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYP
        ELL LSGALAEL+W  IL+KR LFRE F+DFDP A+S+L  KK+ +   AAT LLSE K+R+I+EN  Q+CK+I  FGSF+ YIWNFVN KP  SQFRYP
Subjt:  ELLCLSGALAELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYP

Query:  RQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIETTEKG
        RQVP KTSKAE+ISKDLV+RGFRSV PTVIY+FMQ AGLTNDHL  CFR+ +C+   E G
Subjt:  RQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIETTEKG

AT5G57970.1 DNA glycosylase superfamily protein5.3e-9553.65Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPP-----AAAAQCVTTVPSVLRQQDRHQAIL--NLSMNA
        MSG PR++SMNVA++++R  LG T  KA    T K   K L+KLE+        D++   + P     +++ +      S+LR   RH+  L  NLS+NA
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPP-----AAAAQCVTTVPSVLRQQDRHQAIL--NLSMNA

Query:  SCSSDASSDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL
        S SSDAS DSF+SR S+ R  R     SR KS  +  ++        V S  A+       E +KRC WVTPN+DPCY  FHDEEWGVPVHDDK+LFELL
Subjt:  SCSSDASSDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL

Query:  CLSGALAELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQV
         LSGALAE  WP IL+KR  FRE F DFDP+A+ K+NEKK++  GS A+ LLS+ K+RA+IEN RQ+ KVI+E+GSF+ YIW+FV +K I+S+FRY RQV
Subjt:  CLSGALAELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQV

Query:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIETTEK
        P KT KAEVISKDLV+RGFRSVGPTV+Y+FMQ AG+TNDHL  CFR+  CI   E+
Subjt:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIETTEK

AT5G57970.2 DNA glycosylase superfamily protein5.3e-9553.65Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPP-----AAAAQCVTTVPSVLRQQDRHQAIL--NLSMNA
        MSG PR++SMNVA++++R  LG T  KA    T K   K L+KLE+        D++   + P     +++ +      S+LR   RH+  L  NLS+NA
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPP-----AAAAQCVTTVPSVLRQQDRHQAIL--NLSMNA

Query:  SCSSDASSDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL
        S SSDAS DSF+SR S+ R  R     SR KS  +  ++        V S  A+       E +KRC WVTPN+DPCY  FHDEEWGVPVHDDK+LFELL
Subjt:  SCSSDASSDSFNSRGSSARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELL

Query:  CLSGALAELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQV
         LSGALAE  WP IL+KR  FRE F DFDP+A+ K+NEKK++  GS A+ LLS+ K+RA+IEN RQ+ KVI+E+GSF+ YIW+FV +K I+S+FRY RQV
Subjt:  CLSGALAELAWPAILNKRHLFRETFLDFDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQV

Query:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIETTEK
        P KT KAEVISKDLV+RGFRSVGPTV+Y+FMQ AG+TNDHL  CFR+  CI   E+
Subjt:  PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLVGCFRYTECIETTEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGCCCTCCAAGAATCCGGTCGATGAATGTGGCGGATTCCGATTCCCGGCCGGTTCTTGGGCCGACCGGGAACAAAGCAAGAACTGTAGAGACTCGGAAACCTGG
CGTAAAGCCATTGAAGAAGCTTGAAAAACCTCGCCAAGAAGCTGAATCGAAGGACAAGAGGGTGCCATTGTCGCCCCCGGCCGCGGCGGCTCAATGTGTTACTACAGTGC
CGTCGGTTTTGAGGCAACAGGATCGCCATCAGGCGATTCTTAATCTCTCTATGAATGCTTCGTGTTCTTCGGATGCGTCGTCGGATTCGTTTAATAGTCGGGGTTCTAGT
GCTAGAGGTACGAGGCCGCGTGGGCCGAATTCGAGGCGAAAGTCGAGTAGTGCGGTGAAGAAGGCTGAGAAGGCTGTTGATAAGGTTGGTGTTGAAAGTGTGGTGGCTGT
TGTGGATACGGTTGGGTGCTTAGAGCTTAGAAAAAGATGTGGTTGGGTAACACCTAATACAGATCCATGTTATGCTGCCTTTCATGATGAAGAATGGGGAGTACCAGTTC
ACGATGACAAAAAACTGTTCGAACTGCTTTGCCTCTCGGGTGCTTTGGCGGAACTTGCGTGGCCTGCCATCCTCAACAAAAGACATCTATTTAGGGAAACCTTCTTGGAC
TTTGACCCAAGTGCTGTTTCAAAATTAAACGAGAAAAAGATGGTTGCATCTGGAAGTGCTGCTACTGCTTTACTGTCAGAACCTAAGGTGCGAGCTATCATTGAAAATGG
TCGTCAAATGTGCAAGGTAATTGATGAATTTGGTTCCTTCAACATGTACATTTGGAACTTTGTCAACCACAAACCCATCATCAGTCAGTTTCGGTATCCACGCCAGGTCC
CCGACAAGACGTCAAAAGCAGAAGTGATTAGCAAGGATCTCGTAAAGAGAGGATTTCGAAGCGTGGGACCGACAGTCATATATACGTTCATGCAGGTGGCCGGGTTAACT
AATGACCATCTCGTCGGTTGCTTTAGATACACAGAGTGTATAGAGACAACAGAGAAAGGAGAAAGAAATGGTGAGATCAAGCCTACAATTAATGAGAAAATACCAGAGGC
TCTGAAAAACTTGGAACTATAA
mRNA sequenceShow/hide mRNA sequence
AAAAACCTCAAAAAAATTTCTCTTCCACTTTCCCTAATTTTTTTTTATTTCAATACACAATCATGGCCGTTTCTGATAAGTCTTTTGTTCTCTGAGCTCTCTTCTCCTTC
ATCAATGTCGCTCCATTTCACTTCCCTTTCAAACCCTATTCCCCAATTTCAAACCCCTTTTCCCATTCTTCCCTAATTTCATCTGGGGTTCTCTTAATTTCCTCTAATCC
GAATCCTGTTCGATTTTGGCGCCGACCCAGATAAACTTTTCGCCGGAAATGTCAGGCCCTCCAAGAATCCGGTCGATGAATGTGGCGGATTCCGATTCCCGGCCGGTTCT
TGGGCCGACCGGGAACAAAGCAAGAACTGTAGAGACTCGGAAACCTGGCGTAAAGCCATTGAAGAAGCTTGAAAAACCTCGCCAAGAAGCTGAATCGAAGGACAAGAGGG
TGCCATTGTCGCCCCCGGCCGCGGCGGCTCAATGTGTTACTACAGTGCCGTCGGTTTTGAGGCAACAGGATCGCCATCAGGCGATTCTTAATCTCTCTATGAATGCTTCG
TGTTCTTCGGATGCGTCGTCGGATTCGTTTAATAGTCGGGGTTCTAGTGCTAGAGGTACGAGGCCGCGTGGGCCGAATTCGAGGCGAAAGTCGAGTAGTGCGGTGAAGAA
GGCTGAGAAGGCTGTTGATAAGGTTGGTGTTGAAAGTGTGGTGGCTGTTGTGGATACGGTTGGGTGCTTAGAGCTTAGAAAAAGATGTGGTTGGGTAACACCTAATACAG
ATCCATGTTATGCTGCCTTTCATGATGAAGAATGGGGAGTACCAGTTCACGATGACAAAAAACTGTTCGAACTGCTTTGCCTCTCGGGTGCTTTGGCGGAACTTGCGTGG
CCTGCCATCCTCAACAAAAGACATCTATTTAGGGAAACCTTCTTGGACTTTGACCCAAGTGCTGTTTCAAAATTAAACGAGAAAAAGATGGTTGCATCTGGAAGTGCTGC
TACTGCTTTACTGTCAGAACCTAAGGTGCGAGCTATCATTGAAAATGGTCGTCAAATGTGCAAGGTAATTGATGAATTTGGTTCCTTCAACATGTACATTTGGAACTTTG
TCAACCACAAACCCATCATCAGTCAGTTTCGGTATCCACGCCAGGTCCCCGACAAGACGTCAAAAGCAGAAGTGATTAGCAAGGATCTCGTAAAGAGAGGATTTCGAAGC
GTGGGACCGACAGTCATATATACGTTCATGCAGGTGGCCGGGTTAACTAATGACCATCTCGTCGGTTGCTTTAGATACACAGAGTGTATAGAGACAACAGAGAAAGGAGA
AAGAAATGGTGAGATCAAGCCTACAATTAATGAGAAAATACCAGAGGCTCTGAAAAACTTGGAACTATAAAGAAGCCTTCAATCTTGCCTCAGTGTAATTAACTACCAGA
GTTCCTTTTTTTTTTTTGGTAATGGCTTGTAAATTCCATGATGGGATATCTGCCACTTCCCTTGATGGGGTAAATGTAGCAATGTTTTTTTTGTGTGCATACATTGACTT
GAATACAGAAGACAGGTAGAATCAGTTCTGTTAGTTTTACTACTTCAAGCATGTGGTTGGTTATTACATTTAGTATTAAA
Protein sequenceShow/hide protein sequence
MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEAESKDKRVPLSPPAAAAQCVTTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRGSS
ARGTRPRGPNSRRKSSSAVKKAEKAVDKVGVESVVAVVDTVGCLELRKRCGWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELAWPAILNKRHLFRETFLD
FDPSAVSKLNEKKMVASGSAATALLSEPKVRAIIENGRQMCKVIDEFGSFNMYIWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLT
NDHLVGCFRYTECIETTEKGERNGEIKPTINEKIPEALKNLEL