; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G42760 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G42760
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionDNA-3-methyladenine glycosylase 1
Genome locationChr3:36718191..36722206
RNA-Seq ExpressionCSPI03G42760
SyntenyCSPI03G42760
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006307 - DNA dealkylation involved in DNA repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0032993 - protein-DNA complex (cellular component)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
GO:0032131 - alkylated DNA binding (molecular function)
GO:0043916 - DNA-7-methylguanine glycosylase activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041602.1 DNA-3-methyladenine glycosylase 1 [Cucumis melo var. makuwa]8.4e-19997.61Show/hide
Query:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK
        MGEQTQVQVQTQ QSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK
Subjt:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK

Query:  SKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTA QRAAFASATVP ARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +LNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLG
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQ QDHHQEH   QHPQHPQQPQLLDPLN ILNLG
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLG

XP_004147864.1 DNA-3-methyladenine glycosylase 1 [Cucumis sativus]4.3e-21199.74Show/hide
Query:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK
        MGEQTQVQVQTQ QSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK
Subjt:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK

Query:  SKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ

XP_008466558.1 PREDICTED: DNA-3-methyladenine glycosylase 1 [Cucumis melo]1.1e-20397.64Show/hide
Query:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK
        MGEQTQVQVQTQ QSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK
Subjt:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK

Query:  SKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTA QRAAFASATVP ARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +LNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQ QDHHQEH   QHPQHPQQPQLLDPLN ILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ

XP_023536439.1 probable DNA-3-methyladenine glycosylase 2 [Cucurbita pepo subsp. pepo]3.0e-18891.88Show/hide
Query:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK
        MGE TQVQVQTQ QSQ Q QSQAQNTFHESSNSTT IAQATV LSEVMNAPSQ SSPPSKMPLRPRKIRKLSP+ESDPNSS VVAIPDGPKPIAT KSNK
Subjt:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK

Query:  SKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTA QRAAFASA V  ARSLSCEGEVE+ALRHLRNADPLLA LIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        AL+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQ Q+H  EH      QH QQPQLLDPLNSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ

XP_038904569.1 DNA-3-methyladenine glycosylase 1-like [Benincasa hispida]1.3e-19995.81Show/hide
Query:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK
        MGEQTQVQVQTQ QSQ QPQSQ QNT HESSNSTTPIAQATVMLSEVMNAPSQ SSPPSKMPLRPRKIRKLSPEESDPNSSH +AIPDGPKPIAT KSNK
Subjt:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK

Query:  SKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKT  QRAAFASA VPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQ Q+HHQEH   QHPQHPQQPQLLDPLNSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ

TrEMBL top hitse value%identityAlignment
A0A0A0LIY1 ENDO3c domain-containing protein2.1e-21199.74Show/hide
Query:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK
        MGEQTQVQVQTQ QSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK
Subjt:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK

Query:  SKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ

A0A1S3CRJ5 DNA-3-methyladenine glycosylase 15.5e-20497.64Show/hide
Query:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK
        MGEQTQVQVQTQ QSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK
Subjt:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK

Query:  SKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTA QRAAFASATVP ARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +LNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQ QDHHQEH   QHPQHPQQPQLLDPLN ILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ

A0A5A7TDX5 DNA-3-methyladenine glycosylase 14.1e-19997.61Show/hide
Query:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK
        MGEQTQVQVQTQ QSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK
Subjt:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK

Query:  SKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTA QRAAFASATVP ARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +LNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLG
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQ QDHHQEH   QHPQHPQQPQLLDPLN ILNLG
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLG

A0A6J1F7I4 probable DNA-3-methyladenine glycosylase 23.2e-18892.15Show/hide
Query:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK
        MGE TQVQVQTQ QSQ Q QSQAQNTFHESSNSTT IAQATV LSEVMNAPSQ SSPPSKMPLRPRKIRKLSP+ESDPNSS VVAIPDGPKPIAT KSNK
Subjt:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK

Query:  SKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTA QRAAFASA V  ARSLSCEGEVE+ALRHLRNADPLLA LIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        AL+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQ Q+H      PQH QH QQPQLLDPLNSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ

A0A6J1IIA5 probable DNA-3-methyladenine glycosylase 21.5e-18891.88Show/hide
Query:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK
        MGE TQVQVQTQ QSQ Q QSQAQNTFHESSNSTT IAQATV LSEVMNAPSQ SSPPSKMPLRPRKIRKLSP+ESDPNSS VVAIPDGPKPIAT KSNK
Subjt:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNK

Query:  SKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTA QRAAFASA V  ARSLSCEGEVE+ALRHLRNADPLLA LIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        AL+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQ Q+H  EH      QH QQPQLLDPLNSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ

SwissProt top hitse value%identityAlignment
O31544 Putative DNA-3-methyladenine glycosylase YfjP2.4e-1830.72Show/hide
Query:  LTRSILYQQLAYKAGTSIYTRFIALCGGEA-GV----LPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGS
        + + I++QQL      ++  RF+   G +  GV     PET+  L+ Q LR +  S RK+ Y  D +R    G LS   + +M D+ +   L  + GIG 
Subjt:  LTRSILYQQLAYKAGTSIYTRFIALCGGEA-GV----LPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGS

Query:  WSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAE
        W+V   ++F L RP++ P+ D+ ++  ++  + L++ P    M  + ++W PY S  S Y+WR  E
Subjt:  WSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAE

O94468 Alkylbase DNA glycosidase-like protein mag22.6e-1724.88Show/hide
Query:  LSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSF--QTPFLALTRSILYQQLAYKAGTSIYTRFIALCG-GEAGVLPETVLALNPQQLRQIGISGRKS
        +S + + + A +HL + D   + L+      T        P+  + R+I  Q+L+  A  SI  +F   C   +    P+ ++  + + L + G S  KS
Subjt:  LSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSF--QTPFLALTRSILYQQLAYKAGTSIYTRFIALCG-GEAGVLPETVLALNPQQLRQIGISGRKS

Query:  SYLHDLARKYQN-GILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSW
          +H +A    N  I S   I  M ++ L   L+ + G+  W++ M+ IF+L R D++P +D  ++   +  + L   P+  ++++L +  +PYR++ +W
Subjt:  SYLHDLARKYQN-GILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSW

Query:  YMWRL
        Y+W++
Subjt:  YMWRL

P37878 DNA-3-methyladenine glycosylase5.6e-1225.73Show/hide
Query:  FLALTRSILYQQLAYKAGTSIYTRFIALCGG-------EAGVLP--ETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLT
        F AL   +L QQ+      S+  +F+   G        +  V P  E +  L P  L  I ++ +KS Y+  +AR   +G LS   ++ M+ K     L 
Subjt:  FLALTRSILYQQLAYKAGTSIYTRFIALCGG-------EAGVLP--ETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLT

Query:  MVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRL
         + GIG W+ +  ++  L  P   PI+D+ +   +++L N+   P   ++ ++   W+ ++S  ++Y+WR+
Subjt:  MVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRL

Q92383 DNA-3-methyladenine glycosylase 11.1e-2029.74Show/hide
Query:  HLRNADPLLAQLIDL--HQRPTFD-SFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQN
        HL   D    +L+ L  + RP      + P+  L R++  QQL  KA  +I+ RF ++        PE +  ++ + +R  G S RK   L  +A    +
Subjt:  HLRNADPLLAQLIDL--HQRPTFD-SFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQN

Query:  GIL-SDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAE
        G++ +      + ++ L   LT + GIG W+V M +IFSL+R DV+P +DL++R G + L+ L ++P    + +  E   P+R+  +WY+W+ ++
Subjt:  GIL-SDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAE

Arabidopsis top hitse value%identityAlignment
AT1G19480.1 DNA glycosylase superfamily protein8.1e-11560.3Show/hide
Query:  MGEQTQVQVQTQAQSQPQ-PQSQAQNTFH--------ESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPE---ESDPNSSHVVAIPD
        MGEQ+  Q  TQ QS PQ P+    N           +S+  +  I  +T + +  +     +SSPPSK+PLRPRKIRKL+ +     +   +  ++   
Subjt:  MGEQTQVQVQTQAQSQPQ-PQSQAQNTFH--------ESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPE---ESDPNSSHVVAIPD

Query:  GPKPIAT--VKSNKSKTAHQRAAFASATVP--PARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRF
           P+AT      K K +H RA     TVP   AR L+CEGE+E A+ +LRNADPLLA LID+H  PTF+SF+TPFLAL R+ILYQQLA KAG SIYTRF
Subjt:  GPKPIAT--VKSNKSKTAHQRAAFASATVP--PARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRF

Query:  IALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG
        ++LCGGE  V+PETVL+LNPQQLRQIG+SGRK+SYLHDLARKYQNGILSD AI+NMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRKG
Subjt:  IALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG

Query:  VQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ
        VQLLY L++LPRPSQM+Q C KWRPYRSVGSWYMWRL EAK  S+S AAVAAG SL    +D  QEH         QQ QL+DPLN + ++G  AWGQ
Subjt:  VQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ

AT1G19480.2 DNA glycosylase superfamily protein3.4e-11360.2Show/hide
Query:  MGEQTQVQVQTQAQSQPQ-PQSQAQNTFH--------ESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPE---ESDPNSSHVVAIPD
        MGEQ+  Q  TQ QS PQ P+    N           +S+  +  I  +T + +  +     +SSPPSK+PLRPRKIRKL+ +     +   +  ++   
Subjt:  MGEQTQVQVQTQAQSQPQ-PQSQAQNTFH--------ESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPE---ESDPNSSHVVAIPD

Query:  GPKPIAT--VKSNKSKTAHQRAAFASATVP--PARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRF
           P+AT      K K +H RA     TVP   AR L+CEGE+E A+ +LRNADPLLA LID+H  PTF+SF+TPFLAL R+ILYQQLA KAG SIYTRF
Subjt:  GPKPIAT--VKSNKSKTAHQRAAFASATVP--PARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRF

Query:  IALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG
        ++LCGGE  V+PETVL+LNPQQLRQIG+SGRK+SYLHDLARKYQNGILSD AI+NMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRKG
Subjt:  IALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG

Query:  VQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLG
        VQLLY L++LPRPSQM+Q C KWRPYRSVGSWYMWRL EAK  S+S AAVAAG SL    +D  QEH         QQ QL+DPLN + ++G
Subjt:  VQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLG

AT1G75230.1 DNA glycosylase superfamily protein1.7e-11759.26Show/hide
Query:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPI-----------AQATVMLSEVMNAP-----SQISSPPSKMPLRPRKIRKLSPEE--SDP-NSS
        MGE +     +Q  S   P +Q ++  HE+ N   P               +++ S  + AP       +SSPP+K+PLRPRKIRKLSP++  SD  N  
Subjt:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPI-----------AQATVMLSEVMNAP-----SQISSPPSKMPLRPRKIRKLSPEE--SDP-NSS

Query:  HVVAIPDGPKPIATVKSNKSKTAHQRAAFASATVP--PARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTS
        H ++     KP    K ++S+T          TVP   ARSL+CEGE+E AL HLR+ DPLLA LID+H  PTF++FQTPFLAL RSILYQQLA KAG S
Subjt:  HVVAIPDGPKPIATVKSNKSKTAHQRAAFASATVP--PARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTS

Query:  IYTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDL
        IYTRF+ALCGGE GV+PE VL L PQQLRQIG+SGRK+SYLHDLARKYQNGILSD  IVNMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL
Subjt:  IYTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDL

Query:  NVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQL-QHQDHHQEHQHPQHPQH-PQQPQLLDPLNSILNLGA
         VRKGVQ+L  +E+LPRPS+M+QLCEKWRPYRSV SWY+WRL E+K    +AAA  AGA+L   Q +D  Q+ Q  QH QH  QQPQL+DPLN++ ++G 
Subjt:  NVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQL-QHQDHHQEHQHPQHPQH-PQQPQLLDPLNSILNLGA

Query:  CAWGQ
         AWGQ
Subjt:  CAWGQ

AT1G75230.2 DNA glycosylase superfamily protein9.5e-11659.15Show/hide
Query:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPI-----------AQATVMLSEVMNAP-----SQISSPPSKMPLRPRKIRKLSPEE--SDP-NSS
        MGE +     +Q  S   P +Q ++  HE+ N   P               +++ S  + AP       +SSPP+K+PLRPRKIRKLSP++  SD  N  
Subjt:  MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPI-----------AQATVMLSEVMNAP-----SQISSPPSKMPLRPRKIRKLSPEE--SDP-NSS

Query:  HVVAIPDGPKPIATVKSNKSKTAHQRAAFASATVP--PARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTS
        H ++     KP    K ++S+T          TVP   ARSL+CEGE+E AL HLR+ DPLLA LID+H  PTF++FQTPFLAL RSILYQQLA KAG S
Subjt:  HVVAIPDGPKPIATVKSNKSKTAHQRAAFASATVP--PARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTS

Query:  IYTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDL
        IYTRF+ALCGGE GV+PE VL L PQQLRQIG+SGRK+SYLHDLARKYQNGILSD  IVNMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL
Subjt:  IYTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDL

Query:  NVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQL-QHQDHHQEHQHPQHPQH-PQQPQLLDPLNSILNLG
         VRKGVQ+L  +E+LPRPS+M+QLCEKWRPYRSV SWY+WRL E+K    +AAA  AGA+L   Q +D  Q+ Q  QH QH  QQPQL+DPLN++ ++G
Subjt:  NVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQL-QHQDHHQEHQHPQHPQH-PQQPQLLDPLNSILNLG

AT3G50880.1 DNA glycosylase superfamily protein6.2e-7556.16Show/hide
Query:  SQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNKSKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLH-QR
        S++S   S++  RPRKIRK+S   SDP+          P+ I T                    PP   LS +  V+IALRHL+++D LL  LI  H   
Subjt:  SQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNKSKTAHQRAAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLH-QR

Query:  PTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC-GGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTML
        P FDS  TPFL+L RSILYQQLA KA   IY RFI+L  GGEAGV+PE+V++L+   LR+IG+SGRK+SYLHDLA KY NG+LSD  I+ M D+ L   L
Subjt:  PTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC-GGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTML

Query:  TMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAK
        T+V GIG W+VHMFMIFSLHRPDVLP+ DL VRKGV+ LY L+ LP P QM+QLCEKWRPYRSVGSWYMWRL E++
Subjt:  TMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGAGCAAACGCAAGTGCAAGTTCAGACACAAGCCCAAAGCCAACCTCAACCACAATCGCAGGCTCAGAATACGTTTCATGAATCCTCCAACTCCACAACCCCTAT
CGCTCAAGCCACCGTAATGCTAAGCGAGGTGATGAATGCGCCATCGCAAATCTCTTCTCCTCCATCGAAAATGCCTTTGCGACCCCGGAAGATCCGAAAGCTCTCGCCGG
AGGAATCAGATCCGAATTCCTCCCATGTTGTTGCCATCCCGGATGGACCGAAACCTATCGCCACCGTTAAATCTAACAAAAGCAAGACGGCCCATCAACGCGCCGCCTTC
GCGTCTGCCACAGTTCCGCCTGCACGATCACTTTCCTGCGAGGGCGAGGTGGAAATCGCGCTTCGGCATCTACGGAATGCGGATCCACTCCTTGCACAGTTGATTGACCT
CCATCAACGTCCTACCTTCGACAGTTTCCAAACCCCATTCCTTGCCCTAACTCGAAGTATCTTGTATCAGCAGCTGGCTTATAAGGCTGGCACTTCTATCTACACCCGTT
TCATCGCCCTTTGTGGTGGTGAGGCTGGTGTTCTTCCTGAAACCGTTCTTGCTTTGAACCCTCAACAACTCAGGCAAATTGGAATTTCGGGTCGTAAGTCTAGTTACCTT
CATGACCTTGCAAGGAAATATCAAAATGGGATTCTTTCAGACCCGGCAATTGTAAATATGGATGATAAATCGCTTTTCACGATGCTCACAATGGTCAATGGAATTGGGTC
TTGGTCTGTTCATATGTTCATGATTTTCTCGCTGCACAGACCAGATGTGCTTCCTATCAATGATCTTAATGTTCGCAAAGGGGTTCAGCTTCTTTACAATCTTGAAGAGT
TGCCTCGACCATCGCAGATGGATCAGTTATGCGAGAAGTGGAGGCCTTATCGATCGGTTGGGTCGTGGTATATGTGGAGGCTTGCTGAGGCAAAGGGGGCTTCTTCAAGC
GCTGCAGCAGTGGCTGCTGGTGCCAGTTTACAACTGCAGCATCAAGATCACCATCAGGAACACCAGCATCCACAGCATCCACAGCATCCTCAGCAGCCACAACTTCTTGA
CCCGCTTAATAGCATTCTCAATCTCGGGGCCTGTGCATGGGGACAGTGA
mRNA sequenceShow/hide mRNA sequence
GGAAATCGAAAGCCAGAACTGAACGGAGCGACGGACAGAAACAGAGAAACAAAGAACAACGGCGGTGGGATTTCTTTCTCCTCCATTTCTATGCCATTGATCATCCACTT
CTGTACTCTGTAACATTATTCCAAATCCCCAATCTTCTCAGTTTCACAGTTGATTTTCAATTTCTCTTTCTGTTAGACTTCAACCCGTTCTTGCGGGCATCGAGTTTTGC
CCCCTTTCTGATTCTCCGCCGTTACCTTCCCCACCGATTCTTGGCTGTGTGTTTTGTTTCATTTCGACCGATTTGTCTGAGCAAAGGGGCAAATTACAGATTGAATTGAA
TCTCTTACATATGGGAGAGCAAACGCAAGTGCAAGTTCAGACACAAGCCCAAAGCCAACCTCAACCACAATCGCAGGCTCAGAATACGTTTCATGAATCCTCCAACTCCA
CAACCCCTATCGCTCAAGCCACCGTAATGCTAAGCGAGGTGATGAATGCGCCATCGCAAATCTCTTCTCCTCCATCGAAAATGCCTTTGCGACCCCGGAAGATCCGAAAG
CTCTCGCCGGAGGAATCAGATCCGAATTCCTCCCATGTTGTTGCCATCCCGGATGGACCGAAACCTATCGCCACCGTTAAATCTAACAAAAGCAAGACGGCCCATCAACG
CGCCGCCTTCGCGTCTGCCACAGTTCCGCCTGCACGATCACTTTCCTGCGAGGGCGAGGTGGAAATCGCGCTTCGGCATCTACGGAATGCGGATCCACTCCTTGCACAGT
TGATTGACCTCCATCAACGTCCTACCTTCGACAGTTTCCAAACCCCATTCCTTGCCCTAACTCGAAGTATCTTGTATCAGCAGCTGGCTTATAAGGCTGGCACTTCTATC
TACACCCGTTTCATCGCCCTTTGTGGTGGTGAGGCTGGTGTTCTTCCTGAAACCGTTCTTGCTTTGAACCCTCAACAACTCAGGCAAATTGGAATTTCGGGTCGTAAGTC
TAGTTACCTTCATGACCTTGCAAGGAAATATCAAAATGGGATTCTTTCAGACCCGGCAATTGTAAATATGGATGATAAATCGCTTTTCACGATGCTCACAATGGTCAATG
GAATTGGGTCTTGGTCTGTTCATATGTTCATGATTTTCTCGCTGCACAGACCAGATGTGCTTCCTATCAATGATCTTAATGTTCGCAAAGGGGTTCAGCTTCTTTACAAT
CTTGAAGAGTTGCCTCGACCATCGCAGATGGATCAGTTATGCGAGAAGTGGAGGCCTTATCGATCGGTTGGGTCGTGGTATATGTGGAGGCTTGCTGAGGCAAAGGGGGC
TTCTTCAAGCGCTGCAGCAGTGGCTGCTGGTGCCAGTTTACAACTGCAGCATCAAGATCACCATCAGGAACACCAGCATCCACAGCATCCACAGCATCCTCAGCAGCCAC
AACTTCTTGACCCGCTTAATAGCATTCTCAATCTCGGGGCCTGTGCATGGGGACAGTGACTCAGATTGAAAAGAGTACATCTCTGAATCCCAATCAATTTGTCCACTAAA
TGAAAAGTATGCCGACTATGTGGGAGTAGAAGACTCAACAATGAATATTTTGTTCCTCGAGGTTGGATACTCTAGACTTCCATACTAAATAATGAAATGACATGTGCTAG
TTTCTTTTCTAGGTTGGTTGACATCACTTTTAAGGCCGTCAACATATATCATGCGGTGGGTGGGAATTTATCCGTAGAATGTTCCTCTGAATCTCTGCTTTCAATGTGAT
TTAGCACAGTTGGACTTCAACCAAACCGCTTGTTCATTTGCGCATATTCAACGGCAGAATAGTCTGTAATGACTTTATGATCTATCTCTTACCCTTCTCTCCATTTCAAA
TTTACTGCCTCCTTGGTCATAGTTTGTATATAGAACAAGGATCATGAATGCGGATAATCAATGACTGACTGGCTGTGGCATTTGCTATTTGGACTCATATTCTTATCTTA
TCTTTCCCCATCTATTCAACATGGTGGGAGAAAAGACATCGCTCAATAGCCCCATTCATCTTTCTTTCTTTTGAAAATATAGCTGATGTTTATAATAATAAAGTGTAGTA
TTGTAGATTTTACCTCTGAGAACTTGATTTATGGGGATTGGGGACCATGAGGCTTGAATTCTG
Protein sequenceShow/hide protein sequence
MGEQTQVQVQTQAQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNKSKTAHQRAAF
ASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYL
HDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSS
AAAVAAGASLQLQHQDHHQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ