; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC11g0162 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC11g0162
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDNA-3-methyladenine glycosylase 1
Genome locationMC11:1193414..1199076
RNA-Seq ExpressionMC11g0162
SyntenyMC11g0162
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006307 - DNA dealkylation involved in DNA repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0032993 - protein-DNA complex (cellular component)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
GO:0032131 - alkylated DNA binding (molecular function)
GO:0043916 - DNA-7-methylguanine glycosylase activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142016.1 uncharacterized protein LOC111012247 [Momordica charantia]1.10e-261100Show/hide
Query:  MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNK
        MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNK
Subjt:  MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNK

Query:  NKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        NKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  NKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQQPQLLDPINSILNLGACAWGQ
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQQPQLLDPINSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQQPQLLDPINSILNLGACAWGQ

XP_022936456.1 probable DNA-3-methyladenine glycosylase 2 [Cucurbita moschata]1.75e-24292.02Show/hide
Query:  MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNK
        MGE TQVQVQTQTQSQSQ QSQAQNT H+SSNSTT+IAQATVALSEVMNAP+QTSSPPSKMPLRPRKIRKLSPDESD NSSQ+  + DGPKPI++ KSNK
Subjt:  MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNK

Query:  NKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        +KTAQQRAAFASAP++ ARSLSCEGEVE+ALRHLRNADPLLAPLIDLHQRP FDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  NKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +L+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQQPQLLDPINSILNLGACAWGQ
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQ +HQH QQPQLLDP+NSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQQPQLLDPINSILNLGACAWGQ

XP_022976000.1 probable DNA-3-methyladenine glycosylase 2 [Cucurbita maxima]1.75e-24292.02Show/hide
Query:  MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNK
        MGE TQVQVQTQTQSQSQ QSQAQNT H+SSNSTT+IAQATV+LSEVMNAP+QTSSPPSKMPLRPRKIRKLSPDESD NSSQ+  + DGPKPI++ KSNK
Subjt:  MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNK

Query:  NKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        +KTAQQRAAFASAP++ ARSLSCEGEVE+ALRHLRNADPLLAPLIDLHQRP FDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  NKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +L+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQQPQLLDPINSILNLGACAWGQ
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQ EHQH QQPQLLDP+NSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQQPQLLDPINSILNLGACAWGQ

XP_023536439.1 probable DNA-3-methyladenine glycosylase 2 [Cucurbita pepo subsp. pepo]6.10e-24392.29Show/hide
Query:  MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNK
        MGE TQVQVQTQTQSQSQ QSQAQNT H+SSNSTT+IAQATVALSEVMNAP+QTSSPPSKMPLRPRKIRKLSPDESD NSSQ+  + DGPKPI++ KSNK
Subjt:  MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNK

Query:  NKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        +KTAQQRAAFASAP++ ARSLSCEGEVE+ALRHLRNADPLLAPLIDLHQRP FDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  NKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +L+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQQPQLLDPINSILNLGACAWGQ
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQ EHQH QQPQLLDP+NSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQQPQLLDPINSILNLGACAWGQ

XP_038904569.1 DNA-3-methyladenine glycosylase 1-like [Benincasa hispida]4.59e-24192.08Show/hide
Query:  MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNK
        MGEQTQVQVQTQTQSQSQPQSQ QNT+H+SSNSTT IAQATV LSEVMNAP+QTSSPPSKMPLRPRKIRKLSP+ESD NSS    + DGPKPI++GKSNK
Subjt:  MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNK

Query:  NKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        +KT QQRAAFASAP+ PARSLSCEGEVEIALRHLRNADPLLA LIDLHQRP FDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  NKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +LNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQ---QPQLLDPINSILNLGACAWGQ
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAA+AAGASLQLQQQEH QEHQH Q   QPQLLDP+NSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQ---QPQLLDPINSILNLGACAWGQ

TrEMBL top hitse value%identityAlignment
A0A0A0LIY1 ENDO3c domain-containing protein1.87e-23690.05Show/hide
Query:  MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNK
        MGEQTQVQVQTQTQSQ QPQSQAQNT H+SSNSTT IAQATV LSEVMNAP+Q SSPPSKMPLRPRKIRKLSP+ESD NSS +  + DGPKPI++ KSNK
Subjt:  MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNK

Query:  NKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        +KTA QRAAFASA + PARSLSCEGEVEIALRHLRNADPLLA LIDLHQRP FDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  NKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +LNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQ------QPQLLDPINSILNLGACAWGQ
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAA+AAGASLQLQ Q+H QEHQH Q      QPQLLDP+NSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQ------QPQLLDPINSILNLGACAWGQ

A0A1S3CRJ5 DNA-3-methyladenine glycosylase 18.27e-23791.03Show/hide
Query:  MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNK
        MGEQTQVQVQTQTQSQ QPQSQAQNT H+SSNSTT IAQATV LSEVMNAP+Q SSPPSKMPLRPRKIRKLSP+ESD NSS +  + DGPKPI++ KSNK
Subjt:  MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNK

Query:  NKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        +KTAQQRAAFASA +  ARSLSCEGEVEIALRHLRNADPLLA LIDLHQRP FDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  NKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQ---QPQLLDPINSILNLGACAWGQ
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAA+AAGASLQLQQQ+H QEHQH Q   QPQLLDP+N ILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQ---QPQLLDPINSILNLGACAWGQ

A0A6J1CKY0 uncharacterized protein LOC1110122475.33e-262100Show/hide
Query:  MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNK
        MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNK
Subjt:  MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNK

Query:  NKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        NKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  NKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQQPQLLDPINSILNLGACAWGQ
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQQPQLLDPINSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQQPQLLDPINSILNLGACAWGQ

A0A6J1F7I4 probable DNA-3-methyladenine glycosylase 28.46e-24392.02Show/hide
Query:  MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNK
        MGE TQVQVQTQTQSQSQ QSQAQNT H+SSNSTT+IAQATVALSEVMNAP+QTSSPPSKMPLRPRKIRKLSPDESD NSSQ+  + DGPKPI++ KSNK
Subjt:  MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNK

Query:  NKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        +KTAQQRAAFASAP++ ARSLSCEGEVE+ALRHLRNADPLLAPLIDLHQRP FDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  NKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +L+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQQPQLLDPINSILNLGACAWGQ
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQ +HQH QQPQLLDP+NSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQQPQLLDPINSILNLGACAWGQ

A0A6J1IIA5 probable DNA-3-methyladenine glycosylase 28.46e-24392.02Show/hide
Query:  MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNK
        MGE TQVQVQTQTQSQSQ QSQAQNT H+SSNSTT+IAQATV+LSEVMNAP+QTSSPPSKMPLRPRKIRKLSPDESD NSSQ+  + DGPKPI++ KSNK
Subjt:  MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNK

Query:  NKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        +KTAQQRAAFASAP++ ARSLSCEGEVE+ALRHLRNADPLLAPLIDLHQRP FDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  NKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +L+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  SLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQQPQLLDPINSILNLGACAWGQ
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQ EHQH QQPQLLDP+NSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQQPQLLDPINSILNLGACAWGQ

SwissProt top hitse value%identityAlignment
O31544 Putative DNA-3-methyladenine glycosylase YfjP1.8e-1830.72Show/hide
Query:  LTRSILYQQLAYKAGTSIYTRFIALCGGEA-GV----LPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGS
        + + I++QQL      ++  RF+   G +  GV     PET+  L+ Q LR +  S RK+ Y  D +R    G LS   + +M D+ +   L  + GIG 
Subjt:  LTRSILYQQLAYKAGTSIYTRFIALCGGEA-GV----LPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGS

Query:  WSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAE
        W+V   ++F L RP++ P+ D+ ++  ++  + L++ P    M  + ++W PY S  S Y+WR  E
Subjt:  WSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAE

O94468 Alkylbase DNA glycosidase-like protein mag25.7e-1726.67Show/hide
Query:  PFLALTRSILYQQLAYKAGTSIYTRFIALCG-GEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQN-GILSDPAIVNMDDKSLFTMLTMVNGIG
        P+  + R+I  Q+L+  A  SI  +F   C   +    P+ ++  + + L + G S  KS  +H +A    N  I S   I  M ++ L   L+ + G+ 
Subjt:  PFLALTRSILYQQLAYKAGTSIYTRFIALCG-GEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQN-GILSDPAIVNMDDKSLFTMLTMVNGIG

Query:  SWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRL
         W++ M+ IF+L R D++P +D  ++   +  + L   P+  ++++L +  +PYR++ +WY+W++
Subjt:  SWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRL

P37878 DNA-3-methyladenine glycosylase5.5e-1225.73Show/hide
Query:  FLALTRSILYQQLAYKAGTSIYTRFIALCGG-------EAGVLP--ETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLT
        F AL   +L QQ+      S+  +F+   G        +  V P  E +  L P  L  I ++ +KS Y+  +AR   +G LS   ++ M+ K     L 
Subjt:  FLALTRSILYQQLAYKAGTSIYTRFIALCGG-------EAGVLP--ETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLT

Query:  MVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRL
         + GIG W+ +  ++  L  P   PI+D+ +   +++L N+   P   ++ ++   W+ ++S  ++Y+WR+
Subjt:  MVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRL

Q92383 DNA-3-methyladenine glycosylase 11.1e-2030.72Show/hide
Query:  PFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGIL-SDPAIVNMDDKSLFTMLTMVNGIGS
        P+  L R++  QQL  KA  +I+ RF ++        PE +  ++ + +R  G S RK   L  +A    +G++ +      + ++ L   LT + GIG 
Subjt:  PFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGIL-SDPAIVNMDDKSLFTMLTMVNGIGS

Query:  WSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAE
        W+V M +IFSL+R DV+P +DL++R G + L+ L ++P    + +  E   P+R+  +WY+W+ ++
Subjt:  WSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAE

Arabidopsis top hitse value%identityAlignment
AT1G19480.1 DNA glycosylase superfamily protein6.1e-11561.62Show/hide
Query:  MGEQTQVQVQTQTQSQSQ-PQSQAQNTIH--------DSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPD---------ESDANSSQ
        MGEQ+  Q  TQ QS  Q P+    N I         DS+  + SI  +T   +  +      SSPPSK+PLRPRKIRKL+ D           D +SSQ
Subjt:  MGEQTQVQVQTQTQSQSQ-PQSQAQNTIH--------DSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPD---------ESDANSSQ

Query:  I-APM-TDGPKPISSGKSNKNKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSI
        + +P+ TDG  P       K K +  RA   + P + AR L+CEGE+E A+ +LRNADPLLA LID+H  P F+SF+TPFLAL R+ILYQQLA KAG SI
Subjt:  I-APM-TDGPKPISSGKSNKNKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSI

Query:  YTRFIALCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLN
        YTRF++LCGGE  V+PETVLSLNPQQLRQIG+SGRK+SYLHDLARKYQNGILSD AI+NMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL 
Subjt:  YTRFIALCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLN

Query:  VRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQQPQLLDPINSILNLGACAWGQ
        VRKGVQLLY L++LPRPSQM+Q C KWRPYRSVGSWYMWRL EAK  S+S AA+AAG SL    ++ QQEH   QQ QL+DP+N + ++G  AWGQ
Subjt:  VRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQQPQLLDPINSILNLGACAWGQ

AT1G19480.2 DNA glycosylase superfamily protein2.5e-11361.54Show/hide
Query:  MGEQTQVQVQTQTQSQSQ-PQSQAQNTIH--------DSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPD---------ESDANSSQ
        MGEQ+  Q  TQ QS  Q P+    N I         DS+  + SI  +T   +  +      SSPPSK+PLRPRKIRKL+ D           D +SSQ
Subjt:  MGEQTQVQVQTQTQSQSQ-PQSQAQNTIH--------DSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPD---------ESDANSSQ

Query:  I-APM-TDGPKPISSGKSNKNKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSI
        + +P+ TDG  P       K K +  RA   + P + AR L+CEGE+E A+ +LRNADPLLA LID+H  P F+SF+TPFLAL R+ILYQQLA KAG SI
Subjt:  I-APM-TDGPKPISSGKSNKNKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSI

Query:  YTRFIALCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLN
        YTRF++LCGGE  V+PETVLSLNPQQLRQIG+SGRK+SYLHDLARKYQNGILSD AI+NMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL 
Subjt:  YTRFIALCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLN

Query:  VRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQQPQLLDPINSILNLG
        VRKGVQLLY L++LPRPSQM+Q C KWRPYRSVGSWYMWRL EAK  S+S AA+AAG SL    ++ QQEH   QQ QL+DP+N + ++G
Subjt:  VRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQQPQLLDPINSILNLG

AT1G75230.1 DNA glycosylase superfamily protein2.2e-11759.09Show/hide
Query:  MGEQTQVQVQTQTQSQSQPQSQAQNT-------IHDSSNSTTSIAQATVALSEVMNAPTQT-----SSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTD
        MGE +  Q  + T   +QP+S    T        +D  +++++    ++  S  + AP  T     SSPP+K+PLRPRKIRKLSPD  D  S    P  +
Subjt:  MGEQTQVQVQTQTQSQSQPQSQAQNT-------IHDSSNSTTSIAQATVALSEVMNAPTQT-----SSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTD

Query:  GPKPISSGKSNKNKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC
          +  ++  + K+K +Q R    + P + ARSL+CEGE+E AL HLR+ DPLLA LID+H  P F++FQTPFLAL RSILYQQLA KAG SIYTRF+ALC
Subjt:  GPKPISSGKSNKNKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC

Query:  GGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLL
        GGE GV+PE VL L PQQLRQIG+SGRK+SYLHDLARKYQNGILSD  IVNMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRKGVQ+L
Subjt:  GGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLL

Query:  YNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASL------QLQQQEHQQEHQ--HSQQPQLLDPINSILNLGACAWGQ
          +E+LPRPS+M+QLCEKWRPYRSV SWY+WRL E+K    +AAA  AGA+L       +QQQE +Q+HQ    QQPQL+DP+N++ ++G  AWGQ
Subjt:  YNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASL------QLQQQEHQQEHQ--HSQQPQLLDPINSILNLGACAWGQ

AT1G75230.2 DNA glycosylase superfamily protein9.4e-11658.97Show/hide
Query:  MGEQTQVQVQTQTQSQSQPQSQAQNT-------IHDSSNSTTSIAQATVALSEVMNAPTQT-----SSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTD
        MGE +  Q  + T   +QP+S    T        +D  +++++    ++  S  + AP  T     SSPP+K+PLRPRKIRKLSPD  D  S    P  +
Subjt:  MGEQTQVQVQTQTQSQSQPQSQAQNT-------IHDSSNSTTSIAQATVALSEVMNAPTQT-----SSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTD

Query:  GPKPISSGKSNKNKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC
          +  ++  + K+K +Q R    + P + ARSL+CEGE+E AL HLR+ DPLLA LID+H  P F++FQTPFLAL RSILYQQLA KAG SIYTRF+ALC
Subjt:  GPKPISSGKSNKNKTAQQRAAFASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC

Query:  GGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLL
        GGE GV+PE VL L PQQLRQIG+SGRK+SYLHDLARKYQNGILSD  IVNMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRKGVQ+L
Subjt:  GGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLL

Query:  YNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASL------QLQQQEHQQEHQ--HSQQPQLLDPINSILNLG
          +E+LPRPS+M+QLCEKWRPYRSV SWY+WRL E+K    +AAA  AGA+L       +QQQE +Q+HQ    QQPQL+DP+N++ ++G
Subjt:  YNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASL------QLQQQEHQQEHQ--HSQQPQLLDPINSILNLG

AT3G50880.1 DNA glycosylase superfamily protein2.0e-7352.86Show/hide
Query:  TSIAQATVALSEVMNA----PTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNKNKTAQQRAAFASAPILPARSLSCEGEVEIA
        T++ Q+++    +++A     ++ S   S++  RPRKIRK+S D S             P+ I +               AS P      LS +  V+IA
Subjt:  TSIAQATVALSEVMNA----PTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNKNKTAQQRAAFASAPILPARSLSCEGEVEIA

Query:  LRHLRNADPLLAPLIDLH-QRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC-GGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQ
        LRHL+++D LL  LI  H   P+FDS  TPFL+L RSILYQQLA KA   IY RFI+L  GGEAGV+PE+V+SL+   LR+IG+SGRK+SYLHDLA KY 
Subjt:  LRHLRNADPLLAPLIDLH-QRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC-GGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQ

Query:  NGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAK
        NG+LSD  I+ M D+ L   LT+V GIG W+VHMFMIFSLHRPDVLP+ DL VRKGV+ LY L+ LP P QM+QLCEKWRPYRSVGSWYMWRL E++
Subjt:  NGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGAGCAAACACAAGTGCAGGTTCAGACTCAGACCCAAAGCCAATCTCAACCGCAATCGCAGGCTCAGAACACGATTCATGACTCCTCGAATTCCACAACTTCTAT
CGCCCAAGCCACTGTTGCGTTGAGCGAGGTGATGAATGCGCCAACGCAAACGTCTTCTCCGCCATCCAAAATGCCCTTGCGTCCGCGGAAGATCCGAAAGCTCTCGCCCG
ATGAATCGGATGCAAATTCCTCCCAGATTGCCCCCATGACCGATGGGCCGAAACCAATCAGCTCCGGTAAATCGAACAAGAACAAGACGGCTCAGCAACGCGCCGCCTTC
GCGTCTGCCCCAATACTGCCAGCCCGATCACTTTCTTGTGAAGGCGAGGTGGAAATCGCGCTTCGGCATCTCCGGAATGCCGATCCGCTCCTTGCACCTTTGATCGACCT
CCATCAACGTCCCATCTTCGACAGTTTTCAGACCCCATTCCTTGCCCTAACTAGAAGTATCCTATATCAGCAGCTGGCTTACAAAGCTGGAACTTCTATCTACACCCGTT
TCATCGCCCTTTGTGGTGGCGAGGCCGGTGTTCTTCCTGAAACTGTTCTTTCCTTGAACCCTCAACAGCTAAGGCAAATTGGAATTTCGGGTCGTAAGTCTAGTTACCTT
CACGACCTTGCAAGGAAGTACCAAAATGGGATTCTTTCAGACCCGGCAATTGTAAATATGGATGATAAATCGCTTTTCACGATGCTGACAATGGTCAATGGAATTGGGTC
TTGGTCTGTTCATATGTTCATGATTTTCTCGCTGCACAGACCAGATGTGCTTCCTATCAACGATCTTAATGTTCGCAAAGGGGTTCAGCTTCTTTACAATCTTGAAGAGT
TGCCTCGGCCATCACAGATGGATCAGTTATGCGAGAAATGGAGGCCATATAGATCGGTTGGGTCGTGGTATATGTGGAGGCTTGCTGAAGCAAAGGGGGCTTCTTCAAGC
GCAGCTGCACTGGCTGCTGGTGCTAGTTTACAACTGCAGCAACAAGAGCACCAGCAGGAGCACCAGCATTCACAGCAGCCGCAGCTTCTTGACCCTATTAATAGCATTCT
CAATCTTGGGGCCTGTGCATGGGGGCAGTGA
mRNA sequenceShow/hide mRNA sequence
GTGATTGTAACGGGCCGAGCCCACGGTGGGAAAAATATGAGTACGTGTACGTACGGCAAACGCCGACAAAGGAGCGTGAAGCAGCGGAAAAGTTGATTTCCCAGTCTGGA
CTCTGGAGAGGAAACCGAAAGCCAGTTGGGTTAGAGAAGTGAGGAACACAGAAGAGGAGAGAAGAGAAGAAGAAGAATAAGAATAAGAAGAAGAGCAGCGGGATTTCCTT
CTTCTGATTCGCTACACCATTGCCAATCCATTTGTCTTTTCTGTAACCTAATTCAAAATCCCTAAGCTGCTCAATTTCACCAATAATTTTGAATTTCTTCCTCTTCTACC
TGGGGCATCGAGTTTTCCCCTTTTTCTGATTCTCCGCCGCCGCCGTTGTTTTTCCGTCCGACTATTGGCTGTTTACCGTGTTTCGTTTCCACGGAATATTCTGAGTGAAG
GGGGAAAACGCAGATTGAATCTCTTACATATGGGAGAGCAAACACAAGTGCAGGTTCAGACTCAGACCCAAAGCCAATCTCAACCGCAATCGCAGGCTCAGAACACGATT
CATGACTCCTCGAATTCCACAACTTCTATCGCCCAAGCCACTGTTGCGTTGAGCGAGGTGATGAATGCGCCAACGCAAACGTCTTCTCCGCCATCCAAAATGCCCTTGCG
TCCGCGGAAGATCCGAAAGCTCTCGCCCGATGAATCGGATGCAAATTCCTCCCAGATTGCCCCCATGACCGATGGGCCGAAACCAATCAGCTCCGGTAAATCGAACAAGA
ACAAGACGGCTCAGCAACGCGCCGCCTTCGCGTCTGCCCCAATACTGCCAGCCCGATCACTTTCTTGTGAAGGCGAGGTGGAAATCGCGCTTCGGCATCTCCGGAATGCC
GATCCGCTCCTTGCACCTTTGATCGACCTCCATCAACGTCCCATCTTCGACAGTTTTCAGACCCCATTCCTTGCCCTAACTAGAAGTATCCTATATCAGCAGCTGGCTTA
CAAAGCTGGAACTTCTATCTACACCCGTTTCATCGCCCTTTGTGGTGGCGAGGCCGGTGTTCTTCCTGAAACTGTTCTTTCCTTGAACCCTCAACAGCTAAGGCAAATTG
GAATTTCGGGTCGTAAGTCTAGTTACCTTCACGACCTTGCAAGGAAGTACCAAAATGGGATTCTTTCAGACCCGGCAATTGTAAATATGGATGATAAATCGCTTTTCACG
ATGCTGACAATGGTCAATGGAATTGGGTCTTGGTCTGTTCATATGTTCATGATTTTCTCGCTGCACAGACCAGATGTGCTTCCTATCAACGATCTTAATGTTCGCAAAGG
GGTTCAGCTTCTTTACAATCTTGAAGAGTTGCCTCGGCCATCACAGATGGATCAGTTATGCGAGAAATGGAGGCCATATAGATCGGTTGGGTCGTGGTATATGTGGAGGC
TTGCTGAAGCAAAGGGGGCTTCTTCAAGCGCAGCTGCACTGGCTGCTGGTGCTAGTTTACAACTGCAGCAACAAGAGCACCAGCAGGAGCACCAGCATTCACAGCAGCCG
CAGCTTCTTGACCCTATTAATAGCATTCTCAATCTTGGGGCCTGTGCATGGGGGCAGTGACTCGGATTGAAAAGAGTACCATCTTTGCAACTATCCCAATCAATTTACTC
ACTGAATGAAAAATATGCCAATTATCTGGGATTAGAAGGCTCAGCAATGAATATTTGGTCCCTCGAGGATGGAAGAAAATGCTCTCTGAATCTCCGCCATTAAAGCAGCA
TCGGAGGCCGCCGACGCCAACAGTTCCGATCCAAATACCGGAATAATCTGCAGAAGAGCCGGAAGGACGAGGCTCCCCAACGGCGGCGCTGATAACAGTGATGGAGCCGT
ACAGATCCGCCATGGTTAATGTTAACGATGATGGAGAGAGAGATAGAGAGGAAAAGCTGTAGATATAAGCAAGGTTTTAATTGTAACTGTAATTTAACAAATAAACCTTT
CGCAGTTTCCAGAGAGATTAAAAATAATTAATATTAATATTCAATCTGTATTAGTCAG
Protein sequenceShow/hide protein sequence
MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSKMPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNKNKTAQQRAAF
ASAPILPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYL
HDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSS
AAALAAGASLQLQQQEHQQEHQHSQQPQLLDPINSILNLGACAWGQ