; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014900 (gene) of Snake gourd v1 genome

Gene IDTan0014900
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA-3-methyladenine glycosylase 1
Genome locationLG01:115039821..115044388
RNA-Seq ExpressionTan0014900
SyntenyTan0014900
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006307 - DNA dealkylation involved in DNA repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0032993 - protein-DNA complex (cellular component)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
GO:0032131 - alkylated DNA binding (molecular function)
GO:0043916 - DNA-7-methylguanine glycosylase activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142016.1 uncharacterized protein LOC111012247 [Momordica charantia]3.6e-19494.15Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNK
        MGEQTQVQVQTQTQSQSQ QSQAQNT+HDSSNSTT IAQATV LSEVMNAP+ TSSPPSKMPLRPRKIRKLSPDESD NSSQI  + DGPKPI++GKSNK
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNK

Query:  SKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        +KTAQQRAAF+SAP+LPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRP FDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQM
        +LNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLE+LPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEHQHPQQPQLLDPLNSILNLGACAWGQ
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAA+AAGASLQLQQQEHQQEHQH QQPQLLDP+NSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEHQHPQQPQLLDPLNSILNLGACAWGQ

XP_022936456.1 probable DNA-3-methyladenine glycosylase 2 [Cucurbita moschata]8.0e-19494.15Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNK
        MGE TQVQVQTQTQSQSQAQSQAQNT H+SSNSTT IAQATV LSEVMNAPS TSSPPSKMPLRPRKIRKLSPDESDPNSSQ+V IPDGPKPIA  KSNK
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNK

Query:  SKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAF+SAPV+ ARSLSCEGEVE+ALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQM
        AL+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLE+LPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEHQHPQQPQLLDPLNSILNLGACAWGQ
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQ +HQH QQPQLLDPLNSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEHQHPQQPQLLDPLNSILNLGACAWGQ

XP_022976000.1 probable DNA-3-methyladenine glycosylase 2 [Cucurbita maxima]6.1e-19494.41Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNK
        MGE TQVQVQTQTQSQSQAQSQAQNT H+SSNSTT IAQATV LSEVMNAPS TSSPPSKMPLRPRKIRKLSPDESDPNSSQ+V IPDGPKPIA  KSNK
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNK

Query:  SKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAF+SAPV+ ARSLSCEGEVE+ALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQM
        AL+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLE+LPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEHQHPQQPQLLDPLNSILNLGACAWGQ
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQ EHQH QQPQLLDPLNSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEHQHPQQPQLLDPLNSILNLGACAWGQ

XP_023536439.1 probable DNA-3-methyladenine glycosylase 2 [Cucurbita pepo subsp. pepo]3.6e-19494.41Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNK
        MGE TQVQVQTQTQSQSQAQSQAQNT H+SSNSTT IAQATV LSEVMNAPS TSSPPSKMPLRPRKIRKLSPDESDPNSSQ+V IPDGPKPIA  KSNK
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNK

Query:  SKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAF+SAPV+ ARSLSCEGEVE+ALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQM
        AL+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLE+LPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEHQHPQQPQLLDPLNSILNLGACAWGQ
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQ EHQH QQPQLLDPLNSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEHQHPQQPQLLDPLNSILNLGACAWGQ

XP_038904569.1 DNA-3-methyladenine glycosylase 1-like [Benincasa hispida]2.1e-19494.72Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNK
        MGEQTQVQVQTQTQSQSQ QSQ QNTLH+SSNSTTPIAQATV+LSEVMNAPS TSSPPSKMPLRPRKIRKLSP+ESDPNSS  + IPDGPKPIA GKSNK
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNK

Query:  SKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKT QQRAAF+SAPV PARSLSCEGEVEIALRHLRNADPLLA LIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQM
        ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLE+LPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH---QHPQQPQLLDPLNSILNLGACAWGQ
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEH QEH   QHPQQPQLLDPLNSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH---QHPQQPQLLDPLNSILNLGACAWGQ

TrEMBL top hitse value%identityAlignment
A0A0A0LIY1 ENDO3c domain-containing protein5.3e-19192.67Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNK
        MGEQTQVQVQTQTQSQ Q QSQAQNT H+SSNSTTPIAQATV+LSEVMNAPS  SSPPSKMPLRPRKIRKLSP+ESDPNSS +V IPDGPKPIA  KSNK
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNK

Query:  SKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTA QRAAF+SA V PARSLSCEGEVEIALRHLRNADPLLA LIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQM
        ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLE+LPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH------QHPQQPQLLDPLNSILNLGACAWGQ
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQ Q+H QEH      QHPQQPQLLDPLNSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH------QHPQQPQLLDPLNSILNLGACAWGQ

A0A1S3CRJ5 DNA-3-methyladenine glycosylase 11.5e-19093.14Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNK
        MGEQTQVQVQTQTQSQ Q QSQAQNT H+SSNSTTPIAQATV+LSEVMNAPS  SSPPSKMPLRPRKIRKLSP+ESDPNSS +V IPDGPKPIA  KSNK
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNK

Query:  SKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAF+SA V  ARSLSCEGEVEIALRHLRNADPLLA LIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQM
        +LNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLE+LPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH---QHPQQPQLLDPLNSILNLGACAWGQ
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQ+H QEH   QHPQQPQLLDPLN ILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH---QHPQQPQLLDPLNSILNLGACAWGQ

A0A6J1CKY0 uncharacterized protein LOC1110122471.7e-19494.15Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNK
        MGEQTQVQVQTQTQSQSQ QSQAQNT+HDSSNSTT IAQATV LSEVMNAP+ TSSPPSKMPLRPRKIRKLSPDESD NSSQI  + DGPKPI++GKSNK
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNK

Query:  SKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        +KTAQQRAAF+SAP+LPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRP FDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQM
        +LNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLE+LPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEHQHPQQPQLLDPLNSILNLGACAWGQ
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAA+AAGASLQLQQQEHQQEHQH QQPQLLDP+NSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEHQHPQQPQLLDPLNSILNLGACAWGQ

A0A6J1F7I4 probable DNA-3-methyladenine glycosylase 23.9e-19494.15Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNK
        MGE TQVQVQTQTQSQSQAQSQAQNT H+SSNSTT IAQATV LSEVMNAPS TSSPPSKMPLRPRKIRKLSPDESDPNSSQ+V IPDGPKPIA  KSNK
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNK

Query:  SKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAF+SAPV+ ARSLSCEGEVE+ALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQM
        AL+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLE+LPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEHQHPQQPQLLDPLNSILNLGACAWGQ
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQ +HQH QQPQLLDPLNSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEHQHPQQPQLLDPLNSILNLGACAWGQ

A0A6J1IIA5 probable DNA-3-methyladenine glycosylase 23.0e-19494.41Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNK
        MGE TQVQVQTQTQSQSQAQSQAQNT H+SSNSTT IAQATV LSEVMNAPS TSSPPSKMPLRPRKIRKLSPDESDPNSSQ+V IPDGPKPIA  KSNK
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNK

Query:  SKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAF+SAPV+ ARSLSCEGEVE+ALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQM
        AL+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLE+LPRPSQM
Subjt:  ALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEHQHPQQPQLLDPLNSILNLGACAWGQ
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQ EHQH QQPQLLDPLNSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEHQHPQQPQLLDPLNSILNLGACAWGQ

SwissProt top hitse value%identityAlignment
O31544 Putative DNA-3-methyladenine glycosylase YfjP6.1e-1931.33Show/hide
Query:  LTRSILYQQLAYKAGTSIYTRFIALCGGEA-GV----LPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGS
        + + I++QQL      ++  RF+   G +  GV     PET+  L+ Q LR +  S RK+ Y  D +R    G LS   + +M D+ +   L  + GIG 
Subjt:  LTRSILYQQLAYKAGTSIYTRFIALCGGEA-GV----LPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGS

Query:  WSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQMDQLCEKWRPYRSVGSWYMWRLAE
        W+V   ++F L RP++ P+ D+ ++  ++  + L+D P    M  + ++W PY S  S Y+WR  E
Subjt:  WSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQMDQLCEKWRPYRSVGSWYMWRLAE

O94468 Alkylbase DNA glycosidase-like protein mag22.6e-1724.88Show/hide
Query:  LSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSF--QTPFLALTRSILYQQLAYKAGTSIYTRFIALCG-GEAGVLPETVLALNPQQLRQIGISGRKS
        +S + + + A +HL + D   + L+      T        P+  + R+I  Q+L+  A  SI  +F   C   +    P+ ++  + + L + G S  KS
Subjt:  LSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSF--QTPFLALTRSILYQQLAYKAGTSIYTRFIALCG-GEAGVLPETVLALNPQQLRQIGISGRKS

Query:  SYLHDLARKYQN-GILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQMDQLCEKWRPYRSVGSW
          +H +A    N  I S   I  M ++ L   L+ + G+  W++ M+ IF+L R D++P +D  ++   +  + L   P+  ++++L +  +PYR++ +W
Subjt:  SYLHDLARKYQN-GILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQMDQLCEKWRPYRSVGSW

Query:  YMWRL
        Y+W++
Subjt:  YMWRL

P37878 DNA-3-methyladenine glycosylase7.2e-1225.73Show/hide
Query:  FLALTRSILYQQLAYKAGTSIYTRFIALCGG-------EAGVLP--ETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLT
        F AL   +L QQ+      S+  +F+   G        +  V P  E +  L P  L  I ++ +KS Y+  +AR   +G LS   ++ M+ K     L 
Subjt:  FLALTRSILYQQLAYKAGTSIYTRFIALCGG-------EAGVLP--ETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLT

Query:  MVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQMDQLCEKWRPYRSVGSWYMWRL
         + GIG W+ +  ++  L  P   PI+D+ +   +++L N+   P   ++ ++   W+ ++S  ++Y+WR+
Subjt:  MVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQMDQLCEKWRPYRSVGSWYMWRL

Q92383 DNA-3-methyladenine glycosylase 12.5e-2030.72Show/hide
Query:  PFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGIL-SDPAIVNMDDKSLFTMLTMVNGIGS
        P+  L R++  QQL  KA  +I+ RF ++        PE +  ++ + +R  G S RK   L  +A    +G++ +      + ++ L   LT + GIG 
Subjt:  PFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGIL-SDPAIVNMDDKSLFTMLTMVNGIGS

Query:  WSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQMDQLCEKWRPYRSVGSWYMWRLAE
        W+V M +IFSL+R DV+P +DL++R G + L+ L  +P    + +  E   P+R+  +WY+W+ ++
Subjt:  WSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQMDQLCEKWRPYRSVGSWYMWRLAE

Arabidopsis top hitse value%identityAlignment
AT1G19480.1 DNA glycosylase superfamily protein4.6e-11561.28Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQS---------QAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPD---ESDPNSSQIVTIPD
        MGEQ+  Q  TQ QS  Q+           ++ +   DS+  +  I  +T + +  +      SSPPSK+PLRPRKIRKL+ D     +   ++ ++   
Subjt:  MGEQTQVQVQTQTQSQSQAQS---------QAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPD---ESDPNSSQIVTIPD

Query:  GPKPIAA-GKS-NKSKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIA
           P+A  GKS  K K +  RA   + P + AR L+CEGE+E A+ +LRNADPLLA LID+H  PTF+SF+TPFLAL R+ILYQQLA KAG SIYTRF++
Subjt:  GPKPIAA-GKS-NKSKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIA

Query:  LCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQ
        LCGGE  V+PETVL+LNPQQLRQIG+SGRK+SYLHDLARKYQNGILSD AI+NMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRKGVQ
Subjt:  LCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQ

Query:  LLYNLEDLPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEHQHPQQPQLLDPLNSILNLGACAWGQ
        LLY L+DLPRPSQM+Q C KWRPYRSVGSWYMWRL EAK  S+S AAVAAG SL    ++ QQEH   QQ QL+DPLN + ++G  AWGQ
Subjt:  LLYNLEDLPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEHQHPQQPQLLDPLNSILNLGACAWGQ

AT1G19480.2 DNA glycosylase superfamily protein2.5e-11361.2Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQS---------QAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPD---ESDPNSSQIVTIPD
        MGEQ+  Q  TQ QS  Q+           ++ +   DS+  +  I  +T + +  +      SSPPSK+PLRPRKIRKL+ D     +   ++ ++   
Subjt:  MGEQTQVQVQTQTQSQSQAQS---------QAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPD---ESDPNSSQIVTIPD

Query:  GPKPIAA-GKS-NKSKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIA
           P+A  GKS  K K +  RA   + P + AR L+CEGE+E A+ +LRNADPLLA LID+H  PTF+SF+TPFLAL R+ILYQQLA KAG SIYTRF++
Subjt:  GPKPIAA-GKS-NKSKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIA

Query:  LCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQ
        LCGGE  V+PETVL+LNPQQLRQIG+SGRK+SYLHDLARKYQNGILSD AI+NMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRKGVQ
Subjt:  LCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQ

Query:  LLYNLEDLPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEHQHPQQPQLLDPLNSILNLG
        LLY L+DLPRPSQM+Q C KWRPYRSVGSWYMWRL EAK  S+S AAVAAG SL    ++ QQEH   QQ QL+DPLN + ++G
Subjt:  LLYNLEDLPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEHQHPQQPQLLDPLNSILNLG

AT1G75230.1 DNA glycosylase superfamily protein2.4e-11959.8Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNT-------LHDSSNSTTPIAQATVVLSEVMNAPSLT-----SSPPSKMPLRPRKIRKLSPDES-------DPNSS
        MGE +  Q  + T   +Q +S    T        +D  ++++     ++V S  + AP +T     SSPP+K+PLRPRKIRKLSPD+        + N S
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNT-------LHDSSNSTTPIAQATVVLSEVMNAPSLT-----SSPPSKMPLRPRKIRKLSPDES-------DPNSS

Query:  QIVTIPDGPKPIAAGKSNKSKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIY
        Q+ T     KP     + KSK +Q R    + P + ARSL+CEGE+E AL HLR+ DPLLA LID+H  PTF++FQTPFLAL RSILYQQLA KAG SIY
Subjt:  QIVTIPDGPKPIAAGKSNKSKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIY

Query:  TRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV
        TRF+ALCGGE GV+PE VL L PQQLRQIG+SGRK+SYLHDLARKYQNGILSD  IVNMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL V
Subjt:  TRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV

Query:  RKGVQLLYNLEDLPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASL------QLQQQEHQQEHQ--HPQQPQLLDPLNSILNLGACA
        RKGVQ+L  +EDLPRPS+M+QLCEKWRPYRSV SWY+WRL E+K    +AAA  AGA+L       +QQQE +Q+HQ    QQPQL+DPLN++ ++G  A
Subjt:  RKGVQLLYNLEDLPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASL------QLQQQEHQQEHQ--HPQQPQLLDPLNSILNLGACA

Query:  WGQ
        WGQ
Subjt:  WGQ

AT1G75230.2 DNA glycosylase superfamily protein1.0e-11759.7Show/hide
Query:  MGEQTQVQVQTQTQSQSQAQSQAQNT-------LHDSSNSTTPIAQATVVLSEVMNAPSLT-----SSPPSKMPLRPRKIRKLSPDES-------DPNSS
        MGE +  Q  + T   +Q +S    T        +D  ++++     ++V S  + AP +T     SSPP+K+PLRPRKIRKLSPD+        + N S
Subjt:  MGEQTQVQVQTQTQSQSQAQSQAQNT-------LHDSSNSTTPIAQATVVLSEVMNAPSLT-----SSPPSKMPLRPRKIRKLSPDES-------DPNSS

Query:  QIVTIPDGPKPIAAGKSNKSKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIY
        Q+ T     KP     + KSK +Q R    + P + ARSL+CEGE+E AL HLR+ DPLLA LID+H  PTF++FQTPFLAL RSILYQQLA KAG SIY
Subjt:  QIVTIPDGPKPIAAGKSNKSKTAQQRAAFSSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIY

Query:  TRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV
        TRF+ALCGGE GV+PE VL L PQQLRQIG+SGRK+SYLHDLARKYQNGILSD  IVNMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL V
Subjt:  TRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV

Query:  RKGVQLLYNLEDLPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASL------QLQQQEHQQEHQ--HPQQPQLLDPLNSILNLG
        RKGVQ+L  +EDLPRPS+M+QLCEKWRPYRSV SWY+WRL E+K    +AAA  AGA+L       +QQQE +Q+HQ    QQPQL+DPLN++ ++G
Subjt:  RKGVQLLYNLEDLPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASL------QLQQQEHQQEHQ--HPQQPQLLDPLNSILNLG

AT3G50880.1 DNA glycosylase superfamily protein1.0e-7453.2Show/hide
Query:  TPIAQATVVLSEVMNAPSLT----SSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNKSKTAQQRAAFSSAPVLPARSLSCEGEVEIA
        T + Q+++    +++A +LT    S   S++  RPRKIRK+S   SDP+   I+T                         +S P      LS +  V+IA
Subjt:  TPIAQATVVLSEVMNAPSLT----SSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNKSKTAQQRAAFSSAPVLPARSLSCEGEVEIA

Query:  LRHLRNADPLLAPLIDLH-QRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC-GGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQ
        LRHL+++D LL  LI  H   P FDS  TPFL+L RSILYQQLA KA   IY RFI+L  GGEAGV+PE+V++L+   LR+IG+SGRK+SYLHDLA KY 
Subjt:  LRHLRNADPLLAPLIDLH-QRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC-GGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQ

Query:  NGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAK
        NG+LSD  I+ M D+ L   LT+V GIG W+VHMFMIFSLHRPDVLP+ DL VRKGV+ LY L++LP P QM+QLCEKWRPYRSVGSWYMWRL E++
Subjt:  NGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGAGCAAACGCAAGTGCAGGTTCAGACTCAGACCCAAAGCCAATCTCAAGCGCAGTCGCAGGCTCAGAACACGCTTCATGATTCCTCCAACTCCACAACCCCTAT
CGCCCAAGCCACTGTTGTACTAAGCGAGGTGATGAATGCACCATCGCTAACCTCTTCTCCGCCATCCAAAATGCCCTTGCGTCCGCGGAAGATCCGAAAGCTCTCGCCCG
ACGAATCGGATCCAAATTCCTCACAGATTGTCACCATTCCGGATGGGCCGAAACCTATCGCCGCCGGAAAATCTAACAAGAGCAAGACGGCCCAACAACGCGCCGCCTTC
TCGTCTGCCCCAGTACTGCCTGCCCGATCACTCTCCTGCGAAGGCGAGGTGGAAATCGCGCTTCGGCATCTCCGGAATGCGGATCCGCTTCTTGCACCTTTGATCGACCT
CCATCAACGTCCTACCTTCGACAGTTTTCAAACCCCATTTCTTGCCCTGACTAGAAGTATCCTATATCAGCAGCTGGCTTACAAAGCTGGCACTTCTATCTACACCCGTT
TCATCGCCCTTTGTGGCGGCGAGGCTGGTGTTCTTCCCGAAACCGTTCTTGCCTTGAACCCTCAACAGCTCAGGCAAATTGGAATTTCGGGTCGTAAATCTAGTTACCTT
CATGACCTTGCGAGGAAATACCAAAATGGGATTCTTTCAGACCCGGCAATTGTAAATATGGACGATAAATCGCTTTTCACGATGCTCACAATGGTCAATGGAATTGGGTC
TTGGTCTGTTCATATGTTCATGATTTTCTCGCTGCACAGACCAGATGTGCTTCCTATCAACGATCTTAATGTTCGCAAAGGTGTTCAGCTTCTCTACAATCTTGAAGATT
TGCCTCGACCATCACAAATGGATCAGTTATGCGAGAAGTGGAGGCCGTATCGATCGGTTGGGTCATGGTATATGTGGAGGCTTGCTGAGGCAAAGGGGGCTTCTTCAAGC
GCAGCAGCAGTGGCTGCTGGTGCTAGTTTGCAACTGCAACAACAAGAGCACCAGCAGGAGCACCAGCATCCACAGCAGCCGCAGCTTCTTGACCCACTTAATAGCATTCT
CAATCTTGGGGCCTGTGCTTGGGGGCAGTGA
mRNA sequenceShow/hide mRNA sequence
GGGAAATCGAAAGCCAGTTGGGTTCGAGAACTGAGGAGCGAGCGGCCAGAGATAGAGACAGAGAAGAGAGAGCCTGAAGGAAGAACAACAGCACCGGAATTTCCTTCTCC
TCATTCGCTATAGCATCGACAATCAACATTTCTCTTTCCTGTATCCTATTTCAAATTCCCTAACCTGGTCAATTTCACTGTTAATTTCAATTTATCTTTCTGTTAGACTC
GAATCACTTCTCGCTCTTTCACCTGGGGCATCGAGTTTTGCCCTCTTTCTGATTCTCCTCCTCCGCCGCCGTATTGTCTCCGATTATTGGCTGTGTACTGTGTTTTCTGT
CAACCGAAAAGTCTGAGTGAAGGGGGAAGATACAGATTGAATTTAATCTCTTACATATGGGAGAGCAAACGCAAGTGCAGGTTCAGACTCAGACCCAAAGCCAATCTCAA
GCGCAGTCGCAGGCTCAGAACACGCTTCATGATTCCTCCAACTCCACAACCCCTATCGCCCAAGCCACTGTTGTACTAAGCGAGGTGATGAATGCACCATCGCTAACCTC
TTCTCCGCCATCCAAAATGCCCTTGCGTCCGCGGAAGATCCGAAAGCTCTCGCCCGACGAATCGGATCCAAATTCCTCACAGATTGTCACCATTCCGGATGGGCCGAAAC
CTATCGCCGCCGGAAAATCTAACAAGAGCAAGACGGCCCAACAACGCGCCGCCTTCTCGTCTGCCCCAGTACTGCCTGCCCGATCACTCTCCTGCGAAGGCGAGGTGGAA
ATCGCGCTTCGGCATCTCCGGAATGCGGATCCGCTTCTTGCACCTTTGATCGACCTCCATCAACGTCCTACCTTCGACAGTTTTCAAACCCCATTTCTTGCCCTGACTAG
AAGTATCCTATATCAGCAGCTGGCTTACAAAGCTGGCACTTCTATCTACACCCGTTTCATCGCCCTTTGTGGCGGCGAGGCTGGTGTTCTTCCCGAAACCGTTCTTGCCT
TGAACCCTCAACAGCTCAGGCAAATTGGAATTTCGGGTCGTAAATCTAGTTACCTTCATGACCTTGCGAGGAAATACCAAAATGGGATTCTTTCAGACCCGGCAATTGTA
AATATGGACGATAAATCGCTTTTCACGATGCTCACAATGGTCAATGGAATTGGGTCTTGGTCTGTTCATATGTTCATGATTTTCTCGCTGCACAGACCAGATGTGCTTCC
TATCAACGATCTTAATGTTCGCAAAGGTGTTCAGCTTCTCTACAATCTTGAAGATTTGCCTCGACCATCACAAATGGATCAGTTATGCGAGAAGTGGAGGCCGTATCGAT
CGGTTGGGTCATGGTATATGTGGAGGCTTGCTGAGGCAAAGGGGGCTTCTTCAAGCGCAGCAGCAGTGGCTGCTGGTGCTAGTTTGCAACTGCAACAACAAGAGCACCAG
CAGGAGCACCAGCATCCACAGCAGCCGCAGCTTCTTGACCCACTTAATAGCATTCTCAATCTTGGGGCCTGTGCTTGGGGGCAGTGATTCAGATTGAAAAGAGTACATCG
TTGCAGAGATCCCAATCAATTTACCCTCTGAATGAAAAGTATGCCAATTATGTGGGAATACAAGACTCAACAATGAATATTTGGTTCCTCGAGGTTGGATATTCTAGACT
TCCATACTAAATAATGAAATGACATGTGCTAGCTTTCTTTTCTAGGTTGGTTGACATCACTTTTATGGCCGTCAACGTGTATCATGTGGTGGGGATATTGATAGAATGTT
CCCCTGAATGTCTGCTTTCAATATGATTTAGCATGAGTTGGACTTCGAGAAAACCGCTTGTTCATTTGCGTATATTCAACGACAGAACAGTCTGTAATGACTTTATGATC
TATCTCTTACTCTTTTCTCCATTTCAAATTTTCTGCCACATTGGTCATAGTTTGTATATAGAAATACTCATGAATGCAGAGAATCAACAAGTGGCTGTGGCATTTGTTAT
TCGGACTCGTAATCTTATCTTATCTTTCCCCAACTGTTGGACATGGTGGGAGAAAAGACATTGCTCAGTAGCCTCGTTCGTCTTTCTTTCTTTTGTAAATATAGCATATA
TTGTTTATAATAGGCGTAGGATTGTAGATTTTACCACTGAGAACTTGATTTATAGGTACCATGAAGCTTGAATTCTGATGATGTGAAATGCTGGAGTTGGGTAACAGTCA
ACTTGGAATCAAATAACTTTGAATGTATGTCTGAAAGTGACAAGATGTAGGCCAAATACTTCCTCTTTTACATTGTTTTTGGTCTTTTTGGCAGGGCTAATTTCCTGCTG
TTCTCAAGTGATGGGCCATTGTATTCTGTCTCATGTATGTTTAGCAATAGCATCAAATGGCCCTTTGGGGTTTAATGGGTCAAAATCGACATCCAACTAACTTGTCGGTT
TTTTAAAATAAAACTGATCGTTTCTCTCTCTTTCCATAATTGGAC
Protein sequenceShow/hide protein sequence
MGEQTQVQVQTQTQSQSQAQSQAQNTLHDSSNSTTPIAQATVVLSEVMNAPSLTSSPPSKMPLRPRKIRKLSPDESDPNSSQIVTIPDGPKPIAAGKSNKSKTAQQRAAF
SSAPVLPARSLSCEGEVEIALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYL
HDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEDLPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSS
AAAVAAGASLQLQQQEHQQEHQHPQQPQLLDPLNSILNLGACAWGQ