; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC05G098460 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC05G098460
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionDNA-3-methyladenine glycosylase 1
Genome locationCiama_Chr05:29653102..29658181
RNA-Seq ExpressionCaUC05G098460
SyntenyCaUC05G098460
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006307 - DNA dealkylation involved in DNA repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0032993 - protein-DNA complex (cellular component)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
GO:0032131 - alkylated DNA binding (molecular function)
GO:0043916 - DNA-7-methylguanine glycosylase activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041602.1 DNA-3-methyladenine glycosylase 1 [Cucumis melo var. makuwa]6.7e-19295.98Show/hide
Query:  MGEQTQVQVQTQTQTQSQPQSQAQNTLHESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPEESDP-TSHVVAISDGPKPIATGKSNK
        MGEQTQVQVQTQTQ+Q QPQSQAQNT HESSNSTTPIAQATVMLSEVMN PSQ SSPPSKMPLRPRKIRKLSPEESDP +SHVVAI DGPKPIAT KSNK
Subjt:  MGEQTQVQVQTQTQTQSQPQSQAQNTLHESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPEESDP-TSHVVAISDGPKPIATGKSNK

Query:  SKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAFASA VP ARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +LNPQQLRQIGISGRKSSYLHDLARKYQNGILSD AIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEHQHPQHPQQPQLLDPLNSILNLG
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQ+HHQEHQHPQHPQQPQLLDPLN ILNLG
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEHQHPQHPQQPQLLDPLNSILNLG

KAG7024195.1 mag1 [Cucurbita argyrosperma subsp. argyrosperma]3.2e-20288.68Show/hide
Query:  MGEQTQVQVQTQTQTQSQPQSQAQNTLHESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPEESDP-TSHVVAISDGPKPIATGKSNK
        MGE TQVQVQTQTQ+QSQ QSQAQNT HESSNSTT IAQATV LSEVMN PSQTSSPPSKMPLRPRKIRKLSP+ESDP +S VVAI DGPKPIAT KSNK
Subjt:  MGEQTQVQVQTQTQTQSQPQSQAQNTLHESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPEESDP-TSHVVAISDGPKPIATGKSNK

Query:  SKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAFASAPV  ARSLSCEGEVE+ALRHLRNADPLLA LIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        AL+PQQLRQIGISGRKSSYLHDLARKYQNGILSD AIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEHQHPQHPQQPQLLDPLNSILNLG--LGTLFRKSKVEGLCLGAVTQIEKST
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEH  EH   QH QQPQLLDPLNSILNLG   G++ +K    GLCLGAVT+IEKS 
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEHQHPQHPQQPQLLDPLNSILNLG--LGTLFRKSKVEGLCLGAVTQIEKST

Query:  SLQISQSIYPLNEKYANYVGVEDSTMNIWFLEV
        SLQI+QS++ LN+KYANYVGVEDS M+IW LEV
Subjt:  SLQISQSIYPLNEKYANYVGVEDSTMNIWFLEV

XP_004147864.1 DNA-3-methyladenine glycosylase 1 [Cucumis sativus]4.3e-19195.48Show/hide
Query:  MGEQTQVQVQTQTQTQSQPQSQAQNTLHESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPEESDP-TSHVVAISDGPKPIATGKSNK
        MGEQTQVQVQTQTQ+Q QPQSQAQNT HESSNSTTPIAQATVMLSEVMN PSQ SSPPSKMPLRPRKIRKLSPEESDP +SHVVAI DGPKPIAT KSNK
Subjt:  MGEQTQVQVQTQTQTQSQPQSQAQNTLHESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPEESDP-TSHVVAISDGPKPIATGKSNK

Query:  SKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTA QRAAFASA VPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSD AIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEH---QHPQHPQQPQLLDPLNSILNLG
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQ Q+HHQEH   QHPQHPQQPQLLDPLNSILNLG
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEH---QHPQHPQQPQLLDPLNSILNLG

XP_008466558.1 PREDICTED: DNA-3-methyladenine glycosylase 1 [Cucumis melo]6.7e-19295.98Show/hide
Query:  MGEQTQVQVQTQTQTQSQPQSQAQNTLHESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPEESDP-TSHVVAISDGPKPIATGKSNK
        MGEQTQVQVQTQTQ+Q QPQSQAQNT HESSNSTTPIAQATVMLSEVMN PSQ SSPPSKMPLRPRKIRKLSPEESDP +SHVVAI DGPKPIAT KSNK
Subjt:  MGEQTQVQVQTQTQTQSQPQSQAQNTLHESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPEESDP-TSHVVAISDGPKPIATGKSNK

Query:  SKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAFASA VP ARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +LNPQQLRQIGISGRKSSYLHDLARKYQNGILSD AIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEHQHPQHPQQPQLLDPLNSILNLG
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQ+HHQEHQHPQHPQQPQLLDPLN ILNLG
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEHQHPQHPQQPQLLDPLNSILNLG

XP_038904569.1 DNA-3-methyladenine glycosylase 1-like [Benincasa hispida]4.5e-19697.32Show/hide
Query:  MGEQTQVQVQTQTQTQSQPQSQAQNTLHESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPEESDP-TSHVVAISDGPKPIATGKSNK
        MGEQTQVQVQTQTQ+QSQPQSQ QNTLHESSNSTTPIAQATVMLSEVMN PSQTSSPPSKMPLRPRKIRKLSPEESDP +SH +AI DGPKPIATGKSNK
Subjt:  MGEQTQVQVQTQTQTQSQPQSQAQNTLHESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPEESDP-TSHVVAISDGPKPIATGKSNK

Query:  SKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKT QQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSD AIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEHQHPQHPQQPQLLDPLNSILNLG
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEHQHPQHPQQPQLLDPLNSILNLG
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEHQHPQHPQQPQLLDPLNSILNLG

TrEMBL top hitse value%identityAlignment
A0A0A0LIY1 ENDO3c domain-containing protein2.1e-19195.48Show/hide
Query:  MGEQTQVQVQTQTQTQSQPQSQAQNTLHESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPEESDP-TSHVVAISDGPKPIATGKSNK
        MGEQTQVQVQTQTQ+Q QPQSQAQNT HESSNSTTPIAQATVMLSEVMN PSQ SSPPSKMPLRPRKIRKLSPEESDP +SHVVAI DGPKPIAT KSNK
Subjt:  MGEQTQVQVQTQTQTQSQPQSQAQNTLHESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPEESDP-TSHVVAISDGPKPIATGKSNK

Query:  SKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTA QRAAFASA VPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSD AIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEH---QHPQHPQQPQLLDPLNSILNLG
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQ Q+HHQEH   QHPQHPQQPQLLDPLNSILNLG
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEH---QHPQHPQQPQLLDPLNSILNLG

A0A1S3CRJ5 DNA-3-methyladenine glycosylase 13.2e-19295.98Show/hide
Query:  MGEQTQVQVQTQTQTQSQPQSQAQNTLHESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPEESDP-TSHVVAISDGPKPIATGKSNK
        MGEQTQVQVQTQTQ+Q QPQSQAQNT HESSNSTTPIAQATVMLSEVMN PSQ SSPPSKMPLRPRKIRKLSPEESDP +SHVVAI DGPKPIAT KSNK
Subjt:  MGEQTQVQVQTQTQTQSQPQSQAQNTLHESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPEESDP-TSHVVAISDGPKPIATGKSNK

Query:  SKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAFASA VP ARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +LNPQQLRQIGISGRKSSYLHDLARKYQNGILSD AIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEHQHPQHPQQPQLLDPLNSILNLG
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQ+HHQEHQHPQHPQQPQLLDPLN ILNLG
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEHQHPQHPQQPQLLDPLNSILNLG

A0A5A7TDX5 DNA-3-methyladenine glycosylase 13.2e-19295.98Show/hide
Query:  MGEQTQVQVQTQTQTQSQPQSQAQNTLHESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPEESDP-TSHVVAISDGPKPIATGKSNK
        MGEQTQVQVQTQTQ+Q QPQSQAQNT HESSNSTTPIAQATVMLSEVMN PSQ SSPPSKMPLRPRKIRKLSPEESDP +SHVVAI DGPKPIAT KSNK
Subjt:  MGEQTQVQVQTQTQTQSQPQSQAQNTLHESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPEESDP-TSHVVAISDGPKPIATGKSNK

Query:  SKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAFASA VP ARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +LNPQQLRQIGISGRKSSYLHDLARKYQNGILSD AIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEHQHPQHPQQPQLLDPLNSILNLG
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQ+HHQEHQHPQHPQQPQLLDPLN ILNLG
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEHQHPQHPQQPQLLDPLNSILNLG

A0A6J1CKY0 uncharacterized protein LOC1110122475.2e-18291.42Show/hide
Query:  MGEQTQVQVQTQTQTQSQPQSQAQNTLHESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPEESDPTSHVVA-ISDGPKPIATGKSNK
        MGEQTQVQVQTQTQ+QSQPQSQAQNT+H+SSNSTT IAQATV LSEVMN P+QTSSPPSKMPLRPRKIRKLSP+ESD  S  +A ++DGPKPI++GKSNK
Subjt:  MGEQTQVQVQTQTQTQSQPQSQAQNTLHESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPEESDPTSHVVA-ISDGPKPIATGKSNK

Query:  SKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        +KTAQQRAAFASAP+ PARSLSCEGEVEIALRHLRNADPLLA LIDLHQRP FDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +LNPQQLRQIGISGRKSSYLHDLARKYQNGILSD AIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEHQHPQHPQQPQLLDPLNSILNLG
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAA+AAGASLQLQQQEH QEH   QH QQPQLLDP+NSILNLG
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEHQHPQHPQQPQLLDPLNSILNLG

A0A6J1IIA5 probable DNA-3-methyladenine glycosylase 21.2e-18192.49Show/hide
Query:  MGEQTQVQVQTQTQTQSQPQSQAQNTLHESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPEESDP-TSHVVAISDGPKPIATGKSNK
        MGE TQVQVQTQTQ+QSQ QSQAQNT HESSNSTT IAQATV LSEVMN PSQTSSPPSKMPLRPRKIRKLSP+ESDP +S VVAI DGPKPIAT KSNK
Subjt:  MGEQTQVQVQTQTQTQSQPQSQAQNTLHESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPEESDP-TSHVVAISDGPKPIATGKSNK

Query:  SKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAFASAPV  ARSLSCEGEVE+ALRHLRNADPLLA LIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        AL+PQQLRQIGISGRKSSYLHDLARKYQNGILSD AIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEHQHPQHPQQPQLLDPLNSILNLG
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEH  EH   QH QQPQLLDPLNSILNLG
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEHQHPQHPQQPQLLDPLNSILNLG

SwissProt top hitse value%identityAlignment
O31544 Putative DNA-3-methyladenine glycosylase YfjP3.5e-1830.72Show/hide
Query:  LTRSILYQQLAYKAGTSIYTRFIALCGGEA-GV----LPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGS
        + + I++QQL      ++  RF+   G +  GV     PET+  L+ Q LR +  S RK+ Y  D +R    G LS   + +M D+ +   L  + GIG 
Subjt:  LTRSILYQQLAYKAGTSIYTRFIALCGGEA-GV----LPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGS

Query:  WSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAE
        W+V   ++F L RP++ P+ D+ ++  ++  + L++ P    M  + ++W PY S  S Y+WR  E
Subjt:  WSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAE

O94468 Alkylbase DNA glycosidase-like protein mag23.9e-1724.88Show/hide
Query:  LSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSF--QTPFLALTRSILYQQLAYKAGTSIYTRFIALCG-GEAGVLPETVLALNPQQLRQIGISGRKS
        +S + + + A +HL + D   + L+      T        P+  + R+I  Q+L+  A  SI  +F   C   +    P+ ++  + + L + G S  KS
Subjt:  LSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSF--QTPFLALTRSILYQQLAYKAGTSIYTRFIALCG-GEAGVLPETVLALNPQQLRQIGISGRKS

Query:  SYLHDLARKYQN-GILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSW
          +H +A    N  I S   I  M ++ L   L+ + G+  W++ M+ IF+L R D++P +D  ++   +  + L   P+  ++++L +  +PYR++ +W
Subjt:  SYLHDLARKYQN-GILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSW

Query:  YMWRL
        Y+W++
Subjt:  YMWRL

P37878 DNA-3-methyladenine glycosylase3.8e-1225.73Show/hide
Query:  FLALTRSILYQQLAYKAGTSIYTRFIALCGG-------EAGVLP--ETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLT
        F AL   +L QQ+      S+  +F+   G        +  V P  E +  L P  L  I ++ +KS Y+  +AR   +G LS + ++ M+ K     L 
Subjt:  FLALTRSILYQQLAYKAGTSIYTRFIALCGG-------EAGVLP--ETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLT

Query:  MVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRL
         + GIG W+ +  ++  L  P   PI+D+ +   +++L N+   P   ++ ++   W+ ++S  ++Y+WR+
Subjt:  MVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRL

Q92383 DNA-3-methyladenine glycosylase 19.9e-2129.74Show/hide
Query:  HLRNADPLLAQLIDL--HQRPTFD-SFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQN
        HL   D    +L+ L  + RP      + P+  L R++  QQL  KA  +I+ RF ++        PE +  ++ + +R  G S RK   L  +A    +
Subjt:  HLRNADPLLAQLIDL--HQRPTFD-SFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQN

Query:  GIL-SDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAE
        G++ + +    + ++ L   LT + GIG W+V M +IFSL+R DV+P +DL++R G + L+ L ++P    + +  E   P+R+  +WY+W+ ++
Subjt:  GIL-SDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAE

Arabidopsis top hitse value%identityAlignment
AT1G19480.1 DNA glycosylase superfamily protein2.8e-11160.93Show/hide
Query:  MGEQTQVQVQTQTQTQSQPQSQAQNTLH-----------ESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPE----ESDPTSHVVAI
        MGEQ+    Q  TQ QS PQS   +T +           +S+  +  I  +T + +  +      SSPPSK+PLRPRKIRKL+ +      D  +  ++ 
Subjt:  MGEQTQVQVQTQTQTQSQPQSQAQNTLH-----------ESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPE----ESDPTSHVVAI

Query:  SDGPKPIAT-GKS-NKSKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRF
        S    P+AT GKS  K K +  RA   + P   AR L+CEGE+E A+ +LRNADPLLA LID+H  PTF+SF+TPFLAL R+ILYQQLA KAG SIYTRF
Subjt:  SDGPKPIAT-GKS-NKSKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRF

Query:  IALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG
        ++LCGGE  V+PETVL+LNPQQLRQIG+SGRK+SYLHDLARKYQNGILSD AI+NMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRKG
Subjt:  IALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG

Query:  VQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEHQHPQHPQQPQLLDPLNSILNLG
        VQLLY L++LPRPSQM+Q C KWRPYRSVGSWYMWRL EAK  S+S AAVAAG SL    ++  QEH      QQ QL+DPLN + ++G
Subjt:  VQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEHQHPQHPQQPQLLDPLNSILNLG

AT1G19480.2 DNA glycosylase superfamily protein2.8e-11160.93Show/hide
Query:  MGEQTQVQVQTQTQTQSQPQSQAQNTLH-----------ESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPE----ESDPTSHVVAI
        MGEQ+    Q  TQ QS PQS   +T +           +S+  +  I  +T + +  +      SSPPSK+PLRPRKIRKL+ +      D  +  ++ 
Subjt:  MGEQTQVQVQTQTQTQSQPQSQAQNTLH-----------ESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPE----ESDPTSHVVAI

Query:  SDGPKPIAT-GKS-NKSKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRF
        S    P+AT GKS  K K +  RA   + P   AR L+CEGE+E A+ +LRNADPLLA LID+H  PTF+SF+TPFLAL R+ILYQQLA KAG SIYTRF
Subjt:  SDGPKPIAT-GKS-NKSKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRF

Query:  IALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG
        ++LCGGE  V+PETVL+LNPQQLRQIG+SGRK+SYLHDLARKYQNGILSD AI+NMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRKG
Subjt:  IALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKG

Query:  VQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEHQHPQHPQQPQLLDPLNSILNLG
        VQLLY L++LPRPSQM+Q C KWRPYRSVGSWYMWRL EAK  S+S AAVAAG SL    ++  QEH      QQ QL+DPLN + ++G
Subjt:  VQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHHQEHQHPQHPQQPQLLDPLNSILNLG

AT1G75230.1 DNA glycosylase superfamily protein1.9e-11558.84Show/hide
Query:  MGEQTQVQVQTQTQTQSQPQSQAQNT-------LHESSNSTTPIAQATVMLSEVMNVPSQT-----SSPPSKMPLRPRKIRKLSPEESDPTSHVVAISDG
        MGE +  Q  + T   +QP+S    T        ++  ++++     +++ S  +  P  T     SSPP+K+PLRPRKIRKLSP++          SDG
Subjt:  MGEQTQVQVQTQTQTQSQPQSQAQNT-------LHESSNSTTPIAQATVMLSEVMNVPSQT-----SSPPSKMPLRPRKIRKLSPEESDPTSHVVAISDG

Query:  PKP-------IATGKSNKSKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYT
          P         T  + KSK +Q R    + P   ARSL+CEGE+E AL HLR+ DPLLA LID+H  PTF++FQTPFLAL RSILYQQLA KAG SIYT
Subjt:  PKP-------IATGKSNKSKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYT

Query:  RFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVR
        RF+ALCGGE GV+PE VL L PQQLRQIG+SGRK+SYLHDLARKYQNGILSD  IVNMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VR
Subjt:  RFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVR

Query:  KGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQ----QEHHQEHQHPQH-PQQPQLLDPLNSILNLG
        KGVQ+L  +E+LPRPS+M+QLCEKWRPYRSV SWY+WRL E+K    +AAA  AGA+L   Q    Q+  QE QH QH  QQPQL+DPLN++ ++G
Subjt:  KGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQ----QEHHQEHQHPQH-PQQPQLLDPLNSILNLG

AT1G75230.2 DNA glycosylase superfamily protein1.1e-11558.5Show/hide
Query:  MGEQTQVQVQTQTQTQSQPQSQAQNT-------LHESSNSTTPIAQATVMLSEVMNVPSQT-----SSPPSKMPLRPRKIRKLSPEESDPTSHVVAISDG
        MGE +  Q  + T   +QP+S    T        ++  ++++     +++ S  +  P  T     SSPP+K+PLRPRKIRKLSP++          SDG
Subjt:  MGEQTQVQVQTQTQTQSQPQSQAQNT-------LHESSNSTTPIAQATVMLSEVMNVPSQT-----SSPPSKMPLRPRKIRKLSPEESDPTSHVVAISDG

Query:  PKP-------IATGKSNKSKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYT
          P         T  + KSK +Q R    + P   ARSL+CEGE+E AL HLR+ DPLLA LID+H  PTF++FQTPFLAL RSILYQQLA KAG SIYT
Subjt:  PKP-------IATGKSNKSKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYT

Query:  RFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVR
        RF+ALCGGE GV+PE VL L PQQLRQIG+SGRK+SYLHDLARKYQNGILSD  IVNMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VR
Subjt:  RFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVR

Query:  KGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQ----QEHHQEHQHPQH-PQQPQLLDPLNSILNLGLGTL
        KGVQ+L  +E+LPRPS+M+QLCEKWRPYRSV SWY+WRL E+K    +AAA  AGA+L   Q    Q+  QE QH QH  QQPQL+DPLN++ ++G   L
Subjt:  KGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQ----QEHHQEHQHPQH-PQQPQLLDPLNSILNLGLGTL

AT3G50880.1 DNA glycosylase superfamily protein2.5e-7554.91Show/hide
Query:  SQTSSPPSKMPLRPRKIRKLSPEESDPTSHVVAISDGPKPIATGKSNKSKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLH-QRP
        S+ S   S++  RPRKIRK+S   SDP+  ++  +  P                              LS +  V+IALRHL+++D LL  LI  H   P
Subjt:  SQTSSPPSKMPLRPRKIRKLSPEESDPTSHVVAISDGPKPIATGKSNKSKTAQQRAAFASAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLH-QRP

Query:  TFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC-GGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLT
         FDS  TPFL+L RSILYQQLA KA   IY RFI+L  GGEAGV+PE+V++L+   LR+IG+SGRK+SYLHDLA KY NG+LSD+ I+ M D+ L   LT
Subjt:  TFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC-GGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDQAIVNMDDKSLFTMLT

Query:  MVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAK
        +V GIG W+VHMFMIFSLHRPDVLP+ DL VRKGV+ LY L+ LP P QM+QLCEKWRPYRSVGSWYMWRL E++
Subjt:  MVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGAGCAAACACAAGTGCAGGTTCAGACTCAGACCCAAACCCAATCTCAACCGCAATCGCAGGCTCAGAATACGCTTCATGAATCCTCCAACTCCACAACCCCTAT
CGCCCAAGCCACTGTAATGTTAAGCGAGGTGATGAATGTGCCATCGCAAACCTCTTCTCCCCCATCCAAAATGCCCTTGCGTCCGCGGAAGATCCGAAAGCTCTCGCCGG
AGGAATCAGATCCAACCTCTCATGTTGTTGCCATTTCGGATGGACCGAAACCTATCGCCACCGGCAAATCTAACAAGAGCAAGACGGCCCAACAACGCGCCGCCTTCGCA
TCTGCCCCAGTACCGCCTGCCCGATCCCTTTCCTGCGAGGGCGAGGTGGAAATCGCCCTTCGGCATCTTCGGAATGCGGATCCACTCCTTGCACAGTTGATCGACCTCCA
TCAACGTCCTACCTTCGACAGTTTTCAAACCCCATTTCTTGCCCTAACTAGAAGTATCCTATATCAGCAGTTAGCGTACAAAGCCGGCACTTCTATCTACACCCGTTTCA
TCGCCCTTTGTGGTGGCGAGGCTGGTGTTCTTCCTGAAACCGTTCTTGCCTTGAACCCTCAACAACTCAGGCAAATTGGAATTTCGGGTCGTAAATCTAGTTACCTTCAC
GACCTTGCGAGGAAATACCAAAATGGGATTCTTTCAGACCAGGCAATTGTAAATATGGATGATAAATCGCTTTTCACGATGCTCACGATGGTCAATGGAATTGGGTCTTG
GTCTGTTCATATGTTCATGATTTTCTCGCTGCACAGACCAGATGTTCTTCCTATCAACGATCTTAATGTTCGCAAAGGGGTTCAGCTTCTTTACAATCTCGAAGAGTTGC
CTCGGCCATCGCAGATGGATCAGTTATGCGAGAAGTGGAGGCCGTATCGATCGGTTGGGTCGTGGTATATGTGGAGGCTCGCTGAGGCCAAGGGGGCTTCTTCAAGTGCT
GCAGCAGTGGCTGCTGGTGCCAGTTTACAACTGCAGCAACAGGAGCACCACCAGGAACACCAGCATCCACAGCATCCACAGCAGCCGCAACTTCTTGACCCGCTTAATAG
CATTCTCAATCTTGGGCTAGGCACGCTTTTCAGAAAATCCAAAGTCGAGGGCCTGTGCTTGGGGGCAGTGACTCAGATTGAAAAGAGTACATCTCTGCAGATATCCCAAT
CAATTTATCCACTTAATGAAAAGTATGCCAATTATGTGGGAGTAGAAGACTCAACAATGAATATTTGGTTCCTCGAGGTTGGATACTCTAGACTTCCATACTAA
mRNA sequenceShow/hide mRNA sequence
GGGAAATCGAAAGCCAGTTGGGTTAGAGAACTGAAGGAGCGAGCGACAGGCAGAAACAGAGAAGAGAAAGTGCCTGAGAGAAAGAACAACAGCGCCGGGATTTCCTTCTC
CTCATTCGCTATGCCATTGATAACCTAATTCGAAATCCCTAACCTCCTCAATTTCACACTTAATTTTCAATTTCTCTTTCTGTTTGATCAATCAACTTCTTGCTCTTTCA
TCCGGGCATCGAGTTTTGCCCTCTTTCTGATTCTCCGCCGCTATCGTCGCCTCCGATTCTGGGCGGTGTATTGTGTTTCCTTTCGACCGAATTGTCTGAGTGAAGGGGAA
AAATACAGACTGATTTGAATCTCTTACATATGGGAGAGCAAACACAAGTGCAGGTTCAGACTCAGACCCAAACCCAATCTCAACCGCAATCGCAGGCTCAGAATACGCTT
CATGAATCCTCCAACTCCACAACCCCTATCGCCCAAGCCACTGTAATGTTAAGCGAGGTGATGAATGTGCCATCGCAAACCTCTTCTCCCCCATCCAAAATGCCCTTGCG
TCCGCGGAAGATCCGAAAGCTCTCGCCGGAGGAATCAGATCCAACCTCTCATGTTGTTGCCATTTCGGATGGACCGAAACCTATCGCCACCGGCAAATCTAACAAGAGCA
AGACGGCCCAACAACGCGCCGCCTTCGCATCTGCCCCAGTACCGCCTGCCCGATCCCTTTCCTGCGAGGGCGAGGTGGAAATCGCCCTTCGGCATCTTCGGAATGCGGAT
CCACTCCTTGCACAGTTGATCGACCTCCATCAACGTCCTACCTTCGACAGTTTTCAAACCCCATTTCTTGCCCTAACTAGAAGTATCCTATATCAGCAGTTAGCGTACAA
AGCCGGCACTTCTATCTACACCCGTTTCATCGCCCTTTGTGGTGGCGAGGCTGGTGTTCTTCCTGAAACCGTTCTTGCCTTGAACCCTCAACAACTCAGGCAAATTGGAA
TTTCGGGTCGTAAATCTAGTTACCTTCACGACCTTGCGAGGAAATACCAAAATGGGATTCTTTCAGACCAGGCAATTGTAAATATGGATGATAAATCGCTTTTCACGATG
CTCACGATGGTCAATGGAATTGGGTCTTGGTCTGTTCATATGTTCATGATTTTCTCGCTGCACAGACCAGATGTTCTTCCTATCAACGATCTTAATGTTCGCAAAGGGGT
TCAGCTTCTTTACAATCTCGAAGAGTTGCCTCGGCCATCGCAGATGGATCAGTTATGCGAGAAGTGGAGGCCGTATCGATCGGTTGGGTCGTGGTATATGTGGAGGCTCG
CTGAGGCCAAGGGGGCTTCTTCAAGTGCTGCAGCAGTGGCTGCTGGTGCCAGTTTACAACTGCAGCAACAGGAGCACCACCAGGAACACCAGCATCCACAGCATCCACAG
CAGCCGCAACTTCTTGACCCGCTTAATAGCATTCTCAATCTTGGGCTAGGCACGCTTTTCAGAAAATCCAAAGTCGAGGGCCTGTGCTTGGGGGCAGTGACTCAGATTGA
AAAGAGTACATCTCTGCAGATATCCCAATCAATTTATCCACTTAATGAAAAGTATGCCAATTATGTGGGAGTAGAAGACTCAACAATGAATATTTGGTTCCTCGAGGTTG
GATACTCTAGACTTCCATACTAAATAATGAAATGACGTGCTAGTTTTCTTTTCTAGGTTGGTTGACATCACTTTTAAGGCCGTCAACATGTATCATGGGGTGGGGATTTA
TTGATAGAATGTTCCTCTGAATGTCTGCTTTCAATGTGATCTAGCACAGTTGGACTTGGAGAAAATCGCTTGTTCATTTGCGCATATTCAACGGCAGAACAAACTATCTG
TAATGACTTTATGATCTATCTCTCACCCTTTTCTCCATTTCAACTTTACTGCCACCTTGGTCATAGTTTGTATATAGAAAAGGATCATGAATGCGGATAATCAATGACTG
ACTGGCTGTGGCATTTGCTATTCAGACTCGTATTCTTATCTTATCTTATCTTTCCTCCTTTGTTCATCATGGTGGGAGAAAAAAATATTGCCCAGTAGCCCCATTCATCC
TTCTTTCCTTTGAAAATATATATGATGTT
Protein sequenceShow/hide protein sequence
MGEQTQVQVQTQTQTQSQPQSQAQNTLHESSNSTTPIAQATVMLSEVMNVPSQTSSPPSKMPLRPRKIRKLSPEESDPTSHVVAISDGPKPIATGKSNKSKTAQQRAAFA
SAPVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLH
DLARKYQNGILSDQAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSA
AAVAAGASLQLQQQEHHQEHQHPQHPQQPQLLDPLNSILNLGLGTLFRKSKVEGLCLGAVTQIEKSTSLQISQSIYPLNEKYANYVGVEDSTMNIWFLEVGYSRLPY