; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036844 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036844
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDNA-3-methyladenine glycosylase 1
Genome locationchr2:1588891..1592797
RNA-Seq ExpressionLag0036844
SyntenyLag0036844
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006307 - DNA dealkylation involved in DNA repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0032993 - protein-DNA complex (cellular component)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
GO:0032131 - alkylated DNA binding (molecular function)
GO:0043916 - DNA-7-methylguanine glycosylase activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142016.1 uncharacterized protein LOC111012247 [Momordica charantia]1.0e-19394.41Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNK
        MGEQTQVQVQTQTQSQSQ QSQAQNT+H+SSNSTT IAQATVALSEVMNAP+QTSSPPSKMPLRPRKIRKLSPDESD NSSQI  ++DGPKPI++GKSNK
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNK

Query:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        +KTAQQRAAFASAP+LPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRP FDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +LNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEHQHPQQSQLLDPLNSILNLGACAWGQ
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAA+AAGASLQLQQQEHQ EHQH QQ QLLDP+NSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEHQHPQQSQLLDPLNSILNLGACAWGQ

XP_022936456.1 probable DNA-3-methyladenine glycosylase 2 [Cucurbita moschata]1.6e-19495.21Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNK
        MGE TQVQVQTQTQSQSQAQSQAQNT HESSNSTT IAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQ+V I DGPKPIA  KSNK
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNK

Query:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAFASAPV+ ARSLSCEGEVE+ALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        AL+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEHQHPQQSQLLDPLNSILNLGACAWGQ
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQP+HQH QQ QLLDPLNSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEHQHPQQSQLLDPLNSILNLGACAWGQ

XP_022976000.1 probable DNA-3-methyladenine glycosylase 2 [Cucurbita maxima]1.6e-19495.21Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNK
        MGE TQVQVQTQTQSQSQAQSQAQNT HESSNSTT IAQATV+LSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQ+V I DGPKPIA  KSNK
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNK

Query:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAFASAPV+ ARSLSCEGEVE+ALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        AL+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEHQHPQQSQLLDPLNSILNLGACAWGQ
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQPEHQH QQ QLLDPLNSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEHQHPQQSQLLDPLNSILNLGACAWGQ

XP_023536439.1 probable DNA-3-methyladenine glycosylase 2 [Cucurbita pepo subsp. pepo]7.3e-19595.48Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNK
        MGE TQVQVQTQTQSQSQAQSQAQNT HESSNSTT IAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQ+V I DGPKPIA  KSNK
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNK

Query:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAFASAPV+ ARSLSCEGEVE+ALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        AL+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEHQHPQQSQLLDPLNSILNLGACAWGQ
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQPEHQH QQ QLLDPLNSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEHQHPQQSQLLDPLNSILNLGACAWGQ

XP_038904569.1 DNA-3-methyladenine glycosylase 1-like [Benincasa hispida]8.9e-19394.99Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNK
        MGEQTQVQVQTQTQSQSQ QSQ QNTLHESSNSTTPIAQATV LSEVMNAPSQTSSPPSKMPLRPRKIRKLSP+ESDPNSS  + I DGPKPIA GKSNK
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNK

Query:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKT QQRAAFASAPV PARSLSCEGEVEIALRHLRNADPLLA LIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEH---QHPQQSQLLDPLNSILNLGACAWGQ
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEH  EH   QHPQQ QLLDPLNSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEH---QHPQQSQLLDPLNSILNLGACAWGQ

TrEMBL top hitse value%identityAlignment
A0A0A0LIY1 ENDO3c domain-containing protein2.2e-18992.93Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNK
        MGEQTQVQVQTQTQSQ Q QSQAQNT HESSNSTTPIAQATV LSEVMNAPSQ SSPPSKMPLRPRKIRKLSP+ESDPNSS +V I DGPKPIA  KSNK
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNK

Query:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTA QRAAFASA V PARSLSCEGEVEIALRHLRNADPLLA LIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEH------QHPQQSQLLDPLNSILNLGACAWGQ
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQ Q+H  EH      QHPQQ QLLDPLNSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEH------QHPQQSQLLDPLNSILNLGACAWGQ

A0A1S3CRJ5 DNA-3-methyladenine glycosylase 16.4e-18993.4Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNK
        MGEQTQVQVQTQTQSQ Q QSQAQNT HESSNSTTPIAQATV LSEVMNAPSQ SSPPSKMPLRPRKIRKLSP+ESDPNSS +V I DGPKPIA  KSNK
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNK

Query:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAFASA V  ARSLSCEGEVEIALRHLRNADPLLA LIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +LNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEH---QHPQQSQLLDPLNSILNLGACAWGQ
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQ+H  EH   QHPQQ QLLDPLN ILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEH---QHPQQSQLLDPLNSILNLGACAWGQ

A0A6J1CKY0 uncharacterized protein LOC1110122475.1e-19494.41Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNK
        MGEQTQVQVQTQTQSQSQ QSQAQNT+H+SSNSTT IAQATVALSEVMNAP+QTSSPPSKMPLRPRKIRKLSPDESD NSSQI  ++DGPKPI++GKSNK
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNK

Query:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        +KTAQQRAAFASAP+LPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRP FDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +LNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEHQHPQQSQLLDPLNSILNLGACAWGQ
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAA+AAGASLQLQQQEHQ EHQH QQ QLLDP+NSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEHQHPQQSQLLDPLNSILNLGACAWGQ

A0A6J1F7I4 probable DNA-3-methyladenine glycosylase 27.8e-19595.21Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNK
        MGE TQVQVQTQTQSQSQAQSQAQNT HESSNSTT IAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQ+V I DGPKPIA  KSNK
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNK

Query:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAFASAPV+ ARSLSCEGEVE+ALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        AL+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEHQHPQQSQLLDPLNSILNLGACAWGQ
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQP+HQH QQ QLLDPLNSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEHQHPQQSQLLDPLNSILNLGACAWGQ

A0A6J1IIA5 probable DNA-3-methyladenine glycosylase 27.8e-19595.21Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNK
        MGE TQVQVQTQTQSQSQAQSQAQNT HESSNSTT IAQATV+LSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQ+V I DGPKPIA  KSNK
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNK

Query:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAFASAPV+ ARSLSCEGEVE+ALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        AL+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEHQHPQQSQLLDPLNSILNLGACAWGQ
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQPEHQH QQ QLLDPLNSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEHQHPQQSQLLDPLNSILNLGACAWGQ

SwissProt top hitse value%identityAlignment
O31544 Putative DNA-3-methyladenine glycosylase YfjP2.3e-1830.72Show/hide
Query:  LTRSILYQQLAYKAGTSIYTRFIALCGGEA-GV----LPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGS
        + + I++QQL      ++  RF+   G +  GV     PET+  L+ Q LR +  S RK+ Y  D +R    G LS   + +M D+ +   L  + GIG 
Subjt:  LTRSILYQQLAYKAGTSIYTRFIALCGGEA-GV----LPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGS

Query:  WSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAE
        W+V   ++F L RP++ P+ D+ ++  ++  + L++ P    M  + ++W PY S  S Y+WR  E
Subjt:  WSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAE

O94468 Alkylbase DNA glycosidase-like protein mag23.3e-1724.88Show/hide
Query:  LSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSF--QTPFLALTRSILYQQLAYKAGTSIYTRFIALCG-GEAGVLPETVLALNPQQLRQIGISGRKS
        +S + + + A +HL + D   + L+      T        P+  + R+I  Q+L+  A  SI  +F   C   +    P+ ++  + + L + G S  KS
Subjt:  LSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSF--QTPFLALTRSILYQQLAYKAGTSIYTRFIALCG-GEAGVLPETVLALNPQQLRQIGISGRKS

Query:  SYLHDLARKYQN-GILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSW
          +H +A    N  I S   I  M ++ L   L+ + G+  W++ M+ IF+L R D++P +D  ++   +  + L   P+  ++++L +  +PYR++ +W
Subjt:  SYLHDLARKYQN-GILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSW

Query:  YMWRL
        Y+W++
Subjt:  YMWRL

P37878 DNA-3-methyladenine glycosylase5.5e-1225.73Show/hide
Query:  FLALTRSILYQQLAYKAGTSIYTRFIALCGG-------EAGVLP--ETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLT
        F AL   +L QQ+      S+  +F+   G        +  V P  E +  L P  L  I ++ +KS Y+  +AR   +G LS   ++ M+ K     L 
Subjt:  FLALTRSILYQQLAYKAGTSIYTRFIALCGG-------EAGVLP--ETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLT

Query:  MVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRL
         + GIG W+ +  ++  L  P   PI+D+ +   +++L N+   P   ++ ++   W+ ++S  ++Y+WR+
Subjt:  MVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRL

Q92383 DNA-3-methyladenine glycosylase 11.9e-2030.72Show/hide
Query:  PFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGIL-SDPAIVNMDDKSLFTMLTMVNGIGS
        P+  L R++  QQL  KA  +I+ RF ++        PE +  ++ + +R  G S RK   L  +A    +G++ +      + ++ L   LT + GIG 
Subjt:  PFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGIL-SDPAIVNMDDKSLFTMLTMVNGIGS

Query:  WSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAE
        W+V M +IFSL+R DV+P +DL++R G + L+ L ++P    + +  E   P+R+  +WY+W+ ++
Subjt:  WSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAE

Arabidopsis top hitse value%identityAlignment
AT1G19480.1 DNA glycosylase superfamily protein1.5e-11360.77Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQS---------QAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPD---ESDPNSSQIVNISD
        MGEQ+  Q  TQ QS  Q+           ++ +   +S+  +  I  +T   +  +      SSPPSK+PLRPRKIRKL+ D     +   ++ ++ S 
Subjt:  MGEQTQVQVQTQTQSQSQAQS---------QAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPD---ESDPNSSQIVNISD

Query:  GPKPIAA-GKS-NKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIA
           P+A  GKS  K K +  RA   + P + AR L+CEGE+E A+ +LRNADPLLA LID+H  PTF+SF+TPFLAL R+ILYQQLA KAG SIYTRF++
Subjt:  GPKPIAA-GKS-NKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIA

Query:  LCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQ
        LCGGE  V+PETVL+LNPQQLRQIG+SGRK+SYLHDLARKYQNGILSD AI+NMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRKGVQ
Subjt:  LCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQ

Query:  LLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEHQHPQQSQLLDPLNSILNLGACAWGQ
        LLY L++LPRPSQM+Q C KWRPYRSVGSWYMWRL EAK  S+S AAVAAG SL    ++ Q EH   QQ QL+DPLN + ++G  AWGQ
Subjt:  LLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEHQHPQQSQLLDPLNSILNLGACAWGQ

AT1G19480.2 DNA glycosylase superfamily protein6.3e-11260.68Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQS---------QAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPD---ESDPNSSQIVNISD
        MGEQ+  Q  TQ QS  Q+           ++ +   +S+  +  I  +T   +  +      SSPPSK+PLRPRKIRKL+ D     +   ++ ++ S 
Subjt:  MGEQTQVQVQTQTQSQSQAQS---------QAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPD---ESDPNSSQIVNISD

Query:  GPKPIAA-GKS-NKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIA
           P+A  GKS  K K +  RA   + P + AR L+CEGE+E A+ +LRNADPLLA LID+H  PTF+SF+TPFLAL R+ILYQQLA KAG SIYTRF++
Subjt:  GPKPIAA-GKS-NKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIA

Query:  LCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQ
        LCGGE  V+PETVL+LNPQQLRQIG+SGRK+SYLHDLARKYQNGILSD AI+NMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRKGVQ
Subjt:  LCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQ

Query:  LLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEHQHPQQSQLLDPLNSILNLG
        LLY L++LPRPSQM+Q C KWRPYRSVGSWYMWRL EAK  S+S AAVAAG SL    ++ Q EH   QQ QL+DPLN + ++G
Subjt:  LLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQPEHQHPQQSQLLDPLNSILNLG

AT1G75230.1 DNA glycosylase superfamily protein3.6e-11558.06Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNT-------LHESSNSTTPIAQATVALSEVMNAPSQT-----SSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISD
        MGE +  Q  + T   +Q +S    T        ++  ++++     ++  S  + AP  T     SSPP+K+PLRPRKIRKLSPD+         + SD
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNT-------LHESSNSTTPIAQATVALSEVMNAPSQT-----SSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISD

Query:  GPKP-------IAAGKSNKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIY
        G  P            + KSK +Q R    + P + ARSL+CEGE+E AL HLR+ DPLLA LID+H  PTF++FQTPFLAL RSILYQQLA KAG SIY
Subjt:  GPKP-------IAAGKSNKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIY

Query:  TRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV
        TRF+ALCGGE GV+PE VL L PQQLRQIG+SGRK+SYLHDLARKYQNGILSD  IVNMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL V
Subjt:  TRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV

Query:  RKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASL------QLQQQEHQPEHQ--HPQQSQLLDPLNSILNLGACA
        RKGVQ+L  +E+LPRPS+M+QLCEKWRPYRSV SWY+WRL E+K    +AAA  AGA+L       +QQQE + +HQ    QQ QL+DPLN++ ++G  A
Subjt:  RKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASL------QLQQQEHQPEHQ--HPQQSQLLDPLNSILNLGACA

Query:  WGQ
        WGQ
Subjt:  WGQ

AT1G75230.2 DNA glycosylase superfamily protein1.5e-11357.93Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNT-------LHESSNSTTPIAQATVALSEVMNAPSQT-----SSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISD
        MGE +  Q  + T   +Q +S    T        ++  ++++     ++  S  + AP  T     SSPP+K+PLRPRKIRKLSPD+         + SD
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNT-------LHESSNSTTPIAQATVALSEVMNAPSQT-----SSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISD

Query:  GPKP-------IAAGKSNKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIY
        G  P            + KSK +Q R    + P + ARSL+CEGE+E AL HLR+ DPLLA LID+H  PTF++FQTPFLAL RSILYQQLA KAG SIY
Subjt:  GPKP-------IAAGKSNKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIY

Query:  TRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV
        TRF+ALCGGE GV+PE VL L PQQLRQIG+SGRK+SYLHDLARKYQNGILSD  IVNMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL V
Subjt:  TRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV

Query:  RKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASL------QLQQQEHQPEHQ--HPQQSQLLDPLNSILNLG
        RKGVQ+L  +E+LPRPS+M+QLCEKWRPYRSV SWY+WRL E+K    +AAA  AGA+L       +QQQE + +HQ    QQ QL+DPLN++ ++G
Subjt:  RKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASL------QLQQQEHQPEHQ--HPQQSQLLDPLNSILNLG

AT3G50880.1 DNA glycosylase superfamily protein3.0e-7455.8Show/hide
Query:  SQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLH-QR
        S+ S   S++  RPRKIRK+S   SDP+   I+                          AS P      LS +  V+IALRHL+++D LL  LI  H   
Subjt:  SQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLH-QR

Query:  PTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC-GGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTML
        P FDS  TPFL+L RSILYQQLA KA   IY RFI+L  GGEAGV+PE+V++L+   LR+IG+SGRK+SYLHDLA KY NG+LSD  I+ M D+ L   L
Subjt:  PTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC-GGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTML

Query:  TMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAK
        T+V GIG W+VHMFMIFSLHRPDVLP+ DL VRKGV+ LY L+ LP P QM+QLCEKWRPYRSVGSWYMWRL E++
Subjt:  TMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGAGCAAACGCAAGTGCAGGTTCAGACTCAGACCCAAAGCCAATCTCAAGCGCAATCGCAGGCTCAGAACACGCTTCATGAATCCTCCAACTCCACAACCCCTAT
CGCCCAAGCCACTGTAGCACTAAGCGAGGTGATGAATGCTCCATCGCAAACCTCTTCTCCGCCATCTAAAATGCCCTTGCGTCCACGCAAGATCCGAAAGCTCTCGCCCG
ATGAATCGGATCCAAATTCCTCTCAGATTGTCAACATTTCGGATGGGCCGAAACCTATCGCCGCCGGAAAATCTAACAAGAGCAAGACGGCCCAACAACGCGCCGCCTTC
GCGTCTGCCCCGGTACTGCCTGCCCGATCACTCTCCTGCGAAGGCGAGGTGGAAATCGCGCTTCGGCACCTCCGGAATGCCGATCCGCTCCTTGCACCTTTGATCGACCT
CCATCAACGTCCTACCTTCGATAGTTTCCAAACCCCATTTCTTGCCCTAACTAGAAGTATCCTATATCAGCAGCTGGCTTACAAGGCTGGCACTTCAATCTACACCCGTT
TCATCGCTCTCTGTGGCGGCGAGGCTGGTGTTCTTCCCGAAACCGTTCTTGCCTTGAACCCTCAACAGCTGAGGCAAATTGGAATTTCGGGTCGTAAATCTAGTTACCTT
CATGACCTTGCGAGGAAATATCAAAATGGGATTCTTTCAGACCCAGCAATTGTAAATATGGATGATAAATCGCTTTTCACGATGCTTACAATGGTCAATGGAATTGGATC
TTGGTCTGTTCATATGTTCATGATTTTCTCGCTGCACAGACCAGATGTGCTTCCTATCAACGATCTTAATGTTCGGAAAGGTGTTCAGCTTCTCTACAATCTTGAAGAGT
TGCCTCGACCATCACAAATGGATCAGTTATGCGAGAAGTGGAGGCCGTATCGATCCGTTGGGTCGTGGTATATGTGGAGGCTTGCTGAGGCAAAGGGGGCTTCTTCAAGC
GCTGCAGCGGTGGCTGCTGGTGCTAGTTTACAACTGCAGCAACAAGAGCACCAGCCGGAGCACCAGCATCCACAGCAGTCGCAGCTTCTTGACCCACTCAATAGCATTCT
CAATCTTGGGGCCTGTGCTTGGGGGCAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGAGCAAACGCAAGTGCAGGTTCAGACTCAGACCCAAAGCCAATCTCAAGCGCAATCGCAGGCTCAGAACACGCTTCATGAATCCTCCAACTCCACAACCCCTAT
CGCCCAAGCCACTGTAGCACTAAGCGAGGTGATGAATGCTCCATCGCAAACCTCTTCTCCGCCATCTAAAATGCCCTTGCGTCCACGCAAGATCCGAAAGCTCTCGCCCG
ATGAATCGGATCCAAATTCCTCTCAGATTGTCAACATTTCGGATGGGCCGAAACCTATCGCCGCCGGAAAATCTAACAAGAGCAAGACGGCCCAACAACGCGCCGCCTTC
GCGTCTGCCCCGGTACTGCCTGCCCGATCACTCTCCTGCGAAGGCGAGGTGGAAATCGCGCTTCGGCACCTCCGGAATGCCGATCCGCTCCTTGCACCTTTGATCGACCT
CCATCAACGTCCTACCTTCGATAGTTTCCAAACCCCATTTCTTGCCCTAACTAGAAGTATCCTATATCAGCAGCTGGCTTACAAGGCTGGCACTTCAATCTACACCCGTT
TCATCGCTCTCTGTGGCGGCGAGGCTGGTGTTCTTCCCGAAACCGTTCTTGCCTTGAACCCTCAACAGCTGAGGCAAATTGGAATTTCGGGTCGTAAATCTAGTTACCTT
CATGACCTTGCGAGGAAATATCAAAATGGGATTCTTTCAGACCCAGCAATTGTAAATATGGATGATAAATCGCTTTTCACGATGCTTACAATGGTCAATGGAATTGGATC
TTGGTCTGTTCATATGTTCATGATTTTCTCGCTGCACAGACCAGATGTGCTTCCTATCAACGATCTTAATGTTCGGAAAGGTGTTCAGCTTCTCTACAATCTTGAAGAGT
TGCCTCGACCATCACAAATGGATCAGTTATGCGAGAAGTGGAGGCCGTATCGATCCGTTGGGTCGTGGTATATGTGGAGGCTTGCTGAGGCAAAGGGGGCTTCTTCAAGC
GCTGCAGCGGTGGCTGCTGGTGCTAGTTTACAACTGCAGCAACAAGAGCACCAGCCGGAGCACCAGCATCCACAGCAGTCGCAGCTTCTTGACCCACTCAATAGCATTCT
CAATCTTGGGGCCTGTGCTTGGGGGCAGTGA
Protein sequenceShow/hide protein sequence
MGEQTQVQVQTQTQSQSQAQSQAQNTLHESSNSTTPIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVNISDGPKPIAAGKSNKSKTAQQRAAF
ASAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYL
HDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSS
AAAVAAGASLQLQQQEHQPEHQHPQQSQLLDPLNSILNLGACAWGQ