; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0025948 (gene) of Chayote v1 genome

Gene IDSed0025948
OrganismSechium edule (Chayote v1)
DescriptionDNA-3-methyladenine glycosylase 1
Genome locationLG13:21553994..21557245
RNA-Seq ExpressionSed0025948
SyntenySed0025948
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006307 - DNA dealkylation involved in DNA repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0032993 - protein-DNA complex (cellular component)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
GO:0032131 - alkylated DNA binding (molecular function)
GO:0043916 - DNA-7-methylguanine glycosylase activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142016.1 uncharacterized protein LOC111012247 [Momordica charantia]1.6e-17887.83Show/hide
Query:  MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNSSTSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKP
        MGEQTQVQVQ QT  QSQ+Q QSQA N++H+ SNS+TSIAQATVALSEVMN P+QTSSPPSKMPLRPRKIRKLSP E+D NSS+IAP++DG KPI++GK 
Subjt:  MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNSSTSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKP

Query:  SKSRAAQQRA----APVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPET
        +K++ AQQRA    AP+LPARSLSCEGEVEI+LRHLRNADPLLAPLIDLHQRPIFD+FQTPFLALTRSILYQQLAYKAGTSIYTRFI+LCGGEAGVLPET
Subjt:  SKSRAAQQRA----APVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPET

Query:  VLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPS
        VLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLNVRKGVQLLY+LE+LPRPS
Subjt:  VLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPS

Query:  QMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH----QPQLLDPLNSILNLGACGWGQ
        QMD LCEKWRPYRSVGSWYMWRLAEAKGASSSAAA+AAGASLQLQQQEHQQEH    QPQLLDP+NSILNLGAC WGQ
Subjt:  QMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH----QPQLLDPLNSILNLGACGWGQ

XP_022936456.1 probable DNA-3-methyladenine glycosylase 2 [Cucurbita moschata]1.8e-17486.24Show/hide
Query:  MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNSSTSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKP
        MGE TQVQVQ QTQ QSQA  QSQA N+ HE SNS+T+IAQATVALSEVMN PSQTSSPPSKMPLRPRKIRKLSP E+DPNSS++  + DG KPIAT K 
Subjt:  MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNSSTSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKP

Query:  SKSRAAQQRA----APVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPET
        +KS+ AQQRA    APV+ ARSLSCEGEVE++LRHLRNADPLLAPLIDLHQRP FD+FQTPFLALTRSILYQQLAYKAGTSIYTRFI+LCGGEAGVLPET
Subjt:  SKSRAAQQRA----APVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPET

Query:  VLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPS
        VL+L+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLNVRKGVQLLY+LE+LPRPS
Subjt:  VLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPS

Query:  QMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH----QPQLLDPLNSILNLGACGWGQ
        QMD LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQ +H    QPQLLDPLNSILNLGAC WGQ
Subjt:  QMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH----QPQLLDPLNSILNLGACGWGQ

XP_022976000.1 probable DNA-3-methyladenine glycosylase 2 [Cucurbita maxima]1.8e-17486.24Show/hide
Query:  MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNSSTSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKP
        MGE TQVQVQ QTQ QSQA  QSQA N+ HE SNS+T+IAQATV+LSEVMN PSQTSSPPSKMPLRPRKIRKLSP E+DPNSS++  + DG KPIAT K 
Subjt:  MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNSSTSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKP

Query:  SKSRAAQQRA----APVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPET
        +KS+ AQQRA    APV+ ARSLSCEGEVE++LRHLRNADPLLAPLIDLHQRP FD+FQTPFLALTRSILYQQLAYKAGTSIYTRFI+LCGGEAGVLPET
Subjt:  SKSRAAQQRA----APVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPET

Query:  VLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPS
        VL+L+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLNVRKGVQLLY+LE+LPRPS
Subjt:  VLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPS

Query:  QMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH----QPQLLDPLNSILNLGACGWGQ
        QMD LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQ EH    QPQLLDPLNSILNLGAC WGQ
Subjt:  QMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH----QPQLLDPLNSILNLGACGWGQ

XP_023536439.1 probable DNA-3-methyladenine glycosylase 2 [Cucurbita pepo subsp. pepo]8.2e-17586.51Show/hide
Query:  MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNSSTSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKP
        MGE TQVQVQ QTQ QSQA  QSQA N+ HE SNS+T+IAQATVALSEVMN PSQTSSPPSKMPLRPRKIRKLSP E+DPNSS++  + DG KPIAT K 
Subjt:  MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNSSTSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKP

Query:  SKSRAAQQRA----APVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPET
        +KS+ AQQRA    APV+ ARSLSCEGEVE++LRHLRNADPLLAPLIDLHQRP FD+FQTPFLALTRSILYQQLAYKAGTSIYTRFI+LCGGEAGVLPET
Subjt:  SKSRAAQQRA----APVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPET

Query:  VLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPS
        VL+L+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLNVRKGVQLLY+LE+LPRPS
Subjt:  VLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPS

Query:  QMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH----QPQLLDPLNSILNLGACGWGQ
        QMD LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQ EH    QPQLLDPLNSILNLGAC WGQ
Subjt:  QMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH----QPQLLDPLNSILNLGACGWGQ

XP_038904569.1 DNA-3-methyladenine glycosylase 1-like [Benincasa hispida]2.4e-17486.61Show/hide
Query:  MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNSSTSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKP
        MGEQTQVQVQ QT  QSQ+Q QSQ  N+LHE SNS+T IAQATV LSEVMN PSQTSSPPSKMPLRPRKIRKLSP E+DPNSS    + DG KPIATGK 
Subjt:  MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNSSTSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKP

Query:  SKSRAAQQRA----APVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPET
        +KS+  QQRA    APV PARSLSCEGEVEI+LRHLRNADPLLA LIDLHQRP FD+FQTPFLALTRSILYQQLAYKAGTSIYTRFI+LCGGEAGVLPET
Subjt:  SKSRAAQQRA----APVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPET

Query:  VLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPS
        VL+LNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLNVRKGVQLLY+LE+LPRPS
Subjt:  VLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPS

Query:  QMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH-------QPQLLDPLNSILNLGACGWGQ
        QMD LCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEH QEH       QPQLLDPLNSILNLGAC WGQ
Subjt:  QMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH-------QPQLLDPLNSILNLGACGWGQ

TrEMBL top hitse value%identityAlignment
A0A1S3CRJ5 DNA-3-methyladenine glycosylase 12.7e-17185.56Show/hide
Query:  MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNSSTSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKP
        MGEQTQVQVQ QT  QSQ Q QSQA N+ HE SNS+T IAQATV LSEVMN PSQ SSPPSKMPLRPRKIRKLSP E+DPNSS +  + DG KPIAT K 
Subjt:  MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNSSTSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKP

Query:  SKSRAAQQRA----APVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPET
        +KS+ AQQRA    A V  ARSLSCEGEVEI+LRHLRNADPLLA LIDLHQRP FD+FQTPFLALTRSILYQQLAYKAGTSIYTRFI+LCGGEAGVLPET
Subjt:  SKSRAAQQRA----APVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPET

Query:  VLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPS
        VLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLNVRKGVQLLY+LE+LPRPS
Subjt:  VLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPS

Query:  QMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH-------QPQLLDPLNSILNLGACGWGQ
        QMD LCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQ+H QEH       QPQLLDPLN ILNLGAC WGQ
Subjt:  QMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH-------QPQLLDPLNSILNLGACGWGQ

A0A6J1CKY0 uncharacterized protein LOC1110122477.7e-17987.83Show/hide
Query:  MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNSSTSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKP
        MGEQTQVQVQ QT  QSQ+Q QSQA N++H+ SNS+TSIAQATVALSEVMN P+QTSSPPSKMPLRPRKIRKLSP E+D NSS+IAP++DG KPI++GK 
Subjt:  MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNSSTSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKP

Query:  SKSRAAQQRA----APVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPET
        +K++ AQQRA    AP+LPARSLSCEGEVEI+LRHLRNADPLLAPLIDLHQRPIFD+FQTPFLALTRSILYQQLAYKAGTSIYTRFI+LCGGEAGVLPET
Subjt:  SKSRAAQQRA----APVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPET

Query:  VLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPS
        VLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLNVRKGVQLLY+LE+LPRPS
Subjt:  VLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPS

Query:  QMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH----QPQLLDPLNSILNLGACGWGQ
        QMD LCEKWRPYRSVGSWYMWRLAEAKGASSSAAA+AAGASLQLQQQEHQQEH    QPQLLDP+NSILNLGAC WGQ
Subjt:  QMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH----QPQLLDPLNSILNLGACGWGQ

A0A6J1F7I4 probable DNA-3-methyladenine glycosylase 28.9e-17586.24Show/hide
Query:  MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNSSTSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKP
        MGE TQVQVQ QTQ QSQA  QSQA N+ HE SNS+T+IAQATVALSEVMN PSQTSSPPSKMPLRPRKIRKLSP E+DPNSS++  + DG KPIAT K 
Subjt:  MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNSSTSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKP

Query:  SKSRAAQQRA----APVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPET
        +KS+ AQQRA    APV+ ARSLSCEGEVE++LRHLRNADPLLAPLIDLHQRP FD+FQTPFLALTRSILYQQLAYKAGTSIYTRFI+LCGGEAGVLPET
Subjt:  SKSRAAQQRA----APVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPET

Query:  VLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPS
        VL+L+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLNVRKGVQLLY+LE+LPRPS
Subjt:  VLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPS

Query:  QMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH----QPQLLDPLNSILNLGACGWGQ
        QMD LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQ +H    QPQLLDPLNSILNLGAC WGQ
Subjt:  QMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH----QPQLLDPLNSILNLGACGWGQ

A0A6J1IIA5 probable DNA-3-methyladenine glycosylase 28.9e-17586.24Show/hide
Query:  MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNSSTSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKP
        MGE TQVQVQ QTQ QSQA  QSQA N+ HE SNS+T+IAQATV+LSEVMN PSQTSSPPSKMPLRPRKIRKLSP E+DPNSS++  + DG KPIAT K 
Subjt:  MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNSSTSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKP

Query:  SKSRAAQQRA----APVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPET
        +KS+ AQQRA    APV+ ARSLSCEGEVE++LRHLRNADPLLAPLIDLHQRP FD+FQTPFLALTRSILYQQLAYKAGTSIYTRFI+LCGGEAGVLPET
Subjt:  SKSRAAQQRA----APVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPET

Query:  VLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPS
        VL+L+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLNVRKGVQLLY+LE+LPRPS
Subjt:  VLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPS

Query:  QMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH----QPQLLDPLNSILNLGACGWGQ
        QMD LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQ EH    QPQLLDPLNSILNLGAC WGQ
Subjt:  QMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEH----QPQLLDPLNSILNLGACGWGQ

A0A6J1J2W7 uncharacterized protein LOC1114807951.2e-17186.65Show/hide
Query:  QTQIQSQAQLQSQALNSLHEPSNSSTSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKPSKSRAAQQRA-
        +TQ+Q Q Q QSQA N+LHE SNS+T+IAQAT+ALSEVMN PSQTSS PSKMPLRPRKIRKLSPAE+DPNSS+I  + DG KPI TGK  KS+ AQQRA 
Subjt:  QTQIQSQAQLQSQALNSLHEPSNSSTSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKPSKSRAAQQRA-

Query:  ---APVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPETVLSLNPQQLRQ
           APVLPARSLSCEGEVE++LRHLRNADPLLAPLIDLHQRP FD+FQTPFLALTRSILYQQLAYKAGTSIYTRFI+LCGG+AGVLPETVL+LN QQLRQ
Subjt:  ---APVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPETVLSLNPQQLRQ

Query:  IGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPSQMDSLCEKWRP
        IGISGRKSSYLHDLARKYQNGILSD AIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLNVRKGVQLLY+LE+LPRPSQMD LCEKW+P
Subjt:  IGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPSQMDSLCEKWRP

Query:  YRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQE----HQPQLLDPLNSILNLGACGWGQ
        YRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQE     QPQLLDPLNSILNLGAC WGQ
Subjt:  YRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQE----HQPQLLDPLNSILNLGACGWGQ

SwissProt top hitse value%identityAlignment
O31544 Putative DNA-3-methyladenine glycosylase YfjP2.1e-1931.33Show/hide
Query:  LTRSILYQQLAYKAGTSIYTRFISLCGGEA-GV----LPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGS
        + + I++QQL      ++  RF+   G +  GV     PET+  L+ Q LR +  S RK+ Y  D +R    G LS   + +M D+ +   L  + GIG 
Subjt:  LTRSILYQQLAYKAGTSIYTRFISLCGGEA-GV----LPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGS

Query:  WSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPSQMDSLCEKWRPYRSVGSWYMWRLAE
        W+V   ++F L RP++ P+ D+ ++  ++  + L+D P    M ++ ++W PY S  S Y+WR  E
Subjt:  WSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPSQMDSLCEKWRPYRSVGSWYMWRLAE

O94468 Alkylbase DNA glycosidase-like protein mag23.3e-1726.67Show/hide
Query:  PFLALTRSILYQQLAYKAGTSIYTRFISLCG-GEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQN-GILSDPAIVNMDDKSLFTMLTMVNGIG
        P+  + R+I  Q+L+  A  SI  +F + C   +    P+ ++  + + L + G S  KS  +H +A    N  I S   I  M ++ L   L+ + G+ 
Subjt:  PFLALTRSILYQQLAYKAGTSIYTRFISLCG-GEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQN-GILSDPAIVNMDDKSLFTMLTMVNGIG

Query:  SWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPSQMDSLCEKWRPYRSVGSWYMWRL
         W++ M+ IF+L R D++P +D  ++   +  + L   P+  +++ L +  +PYR++ +WY+W++
Subjt:  SWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPSQMDSLCEKWRPYRSVGSWYMWRL

P37878 DNA-3-methyladenine glycosylase6.0e-1124.56Show/hide
Query:  FLALTRSILYQQLAYKAGTSIYTRFISLCGG-------EAGVLP--ETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLT
        F AL   +L QQ+      S+  +F+   G        +  V P  E +  L P  L  I ++ +KS Y+  +AR   +G LS   ++ M+ K     L 
Subjt:  FLALTRSILYQQLAYKAGTSIYTRFISLCGG-------EAGVLP--ETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLT

Query:  MVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPSQMDSLCEKWRPYRSVGSWYMWRL
         + GIG W+ +  ++  L  P   P++D+ +   +++L ++   P   ++  +   W+ ++S  ++Y+WR+
Subjt:  MVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPSQMDSLCEKWRPYRSVGSWYMWRL

Q92383 DNA-3-methyladenine glycosylase 11.1e-2031.33Show/hide
Query:  PFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGIL-SDPAIVNMDDKSLFTMLTMVNGIGS
        P+  L R++  QQL  KA  +I+ RF S+        PE +  ++ + +R  G S RK   L  +A    +G++ +      + ++ L   LT + GIG 
Subjt:  PFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGIL-SDPAIVNMDDKSLFTMLTMVNGIGS

Query:  WSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPSQMDSLCEKWRPYRSVGSWYMWRLAE
        W+V M +IFSL+R DV+P +DL++R G + L+ L  +P    +    E   P+R+  +WY+W+ ++
Subjt:  WSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPSQMDSLCEKWRPYRSVGSWYMWRLAE

Arabidopsis top hitse value%identityAlignment
AT1G19480.1 DNA glycosylase superfamily protein1.5e-11360.41Show/hide
Query:  MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNS---------STSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLS----------PAETDPN
        MGEQ+    Q  TQ QS  Q      ++L  P ++         S SI  +T   +  +      SSPPSK+PLRPRKIRKL+           AE   +
Subjt:  MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNS---------STSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLS----------PAETDPN

Query:  SSRIAPVSDGAKPIATGKPSKSRAAQQRAAPVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTR
        S   +P++   K    GK S  RA      P + AR L+CEGE+E ++ +LRNADPLLA LID+H  P F++F+TPFLAL R+ILYQQLA KAG SIYTR
Subjt:  SSRIAPVSDGAKPIATGKPSKSRAAQQRAAPVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTR

Query:  FISLCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRK
        F+SLCGGE  V+PETVLSLNPQQLRQIG+SGRK+SYLHDLARKYQNGILSD AI+NMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRK
Subjt:  FISLCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRK

Query:  GVQLLYSLEDLPRPSQMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEHQPQLLDPLNSILNLGACGWGQ
        GVQLLY L+DLPRPSQM+  C KWRPYRSVGSWYMWRL EAK  S+S AAVAAG SL   +   Q+  Q QL+DPLN + ++GA  WGQ
Subjt:  GVQLLYSLEDLPRPSQMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEHQPQLLDPLNSILNLGACGWGQ

AT1G19480.2 DNA glycosylase superfamily protein6.2e-11260.31Show/hide
Query:  MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNS---------STSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLS----------PAETDPN
        MGEQ+    Q  TQ QS  Q      ++L  P ++         S SI  +T   +  +      SSPPSK+PLRPRKIRKL+           AE   +
Subjt:  MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNS---------STSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLS----------PAETDPN

Query:  SSRIAPVSDGAKPIATGKPSKSRAAQQRAAPVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTR
        S   +P++   K    GK S  RA      P + AR L+CEGE+E ++ +LRNADPLLA LID+H  P F++F+TPFLAL R+ILYQQLA KAG SIYTR
Subjt:  SSRIAPVSDGAKPIATGKPSKSRAAQQRAAPVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTR

Query:  FISLCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRK
        F+SLCGGE  V+PETVLSLNPQQLRQIG+SGRK+SYLHDLARKYQNGILSD AI+NMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRK
Subjt:  FISLCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRK

Query:  GVQLLYSLEDLPRPSQMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEHQPQLLDPLNSILNLG
        GVQLLY L+DLPRPSQM+  C KWRPYRSVGSWYMWRL EAK  S+S AAVAAG SL   +   Q+  Q QL+DPLN + ++G
Subjt:  GVQLLYSLEDLPRPSQMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEHQPQLLDPLNSILNLG

AT1G75230.1 DNA glycosylase superfamily protein3.0e-11458.63Show/hide
Query:  MGEQTQVQVQAQTQIQSQAQLQS-QALNSLHEPSNSSTSIAQA----TVALSEVMNGPSQT-----SSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSD
        MGE +  Q  + T   +Q +  + +  N +   +N   S + A    ++  S  +  P  T     SSPP+K+PLRPRKIRKLSP   D  S    P  +
Subjt:  MGEQTQVQVQAQTQIQSQAQLQS-QALNSLHEPSNSSTSIAQA----TVALSEVMNGPSQT-----SSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSD

Query:  GAKPIATGKPSKSRAAQQR--AAPVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGG
         ++   T   +KS+ +Q R    P + ARSL+CEGE+E +L HLR+ DPLLA LID+H  P F+TFQTPFLAL RSILYQQLA KAG SIYTRF++LCGG
Subjt:  GAKPIATGKPSKSRAAQQR--AAPVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGG

Query:  EAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYS
        E GV+PE VL L PQQLRQIG+SGRK+SYLHDLARKYQNGILSD  IVNMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRKGVQ+L  
Subjt:  EAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYS

Query:  LEDLPRPSQMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASL------QLQQQEHQQEH------QPQLLDPLNSILNLGACGWGQ
        +EDLPRPS+M+ LCEKWRPYRSV SWY+WRL E+K    +AAA  AGA+L       +QQQE +Q+H      QPQL+DPLN++ ++GA  WGQ
Subjt:  LEDLPRPSQMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASL------QLQQQEHQQEH------QPQLLDPLNSILNLGACGWGQ

AT1G75230.2 DNA glycosylase superfamily protein1.2e-11258.51Show/hide
Query:  MGEQTQVQVQAQTQIQSQAQLQS-QALNSLHEPSNSSTSIAQA----TVALSEVMNGPSQT-----SSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSD
        MGE +  Q  + T   +Q +  + +  N +   +N   S + A    ++  S  +  P  T     SSPP+K+PLRPRKIRKLSP   D  S    P  +
Subjt:  MGEQTQVQVQAQTQIQSQAQLQS-QALNSLHEPSNSSTSIAQA----TVALSEVMNGPSQT-----SSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSD

Query:  GAKPIATGKPSKSRAAQQR--AAPVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGG
         ++   T   +KS+ +Q R    P + ARSL+CEGE+E +L HLR+ DPLLA LID+H  P F+TFQTPFLAL RSILYQQLA KAG SIYTRF++LCGG
Subjt:  GAKPIATGKPSKSRAAQQR--AAPVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGG

Query:  EAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYS
        E GV+PE VL L PQQLRQIG+SGRK+SYLHDLARKYQNGILSD  IVNMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRKGVQ+L  
Subjt:  EAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYS

Query:  LEDLPRPSQMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASL------QLQQQEHQQEH------QPQLLDPLNSILNLG
        +EDLPRPS+M+ LCEKWRPYRSV SWY+WRL E+K    +AAA  AGA+L       +QQQE +Q+H      QPQL+DPLN++ ++G
Subjt:  LEDLPRPSQMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASL------QLQQQEHQQEH------QPQLLDPLNSILNLG

AT3G50880.1 DNA glycosylase superfamily protein1.3e-7252.63Show/hide
Query:  VALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKPSKSRAAQQRAAP---VLPARSLSCEGEVEISLRHLRNADPLLA
        ++L + +   +Q+S PP  +         L+ +E   +SSRI             +P K R      +P   +  +  LS +  V+I+LRHL+++D LL 
Subjt:  VALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKPSKSRAAQQRAAP---VLPARSLSCEGEVEISLRHLRNADPLLA

Query:  PLIDLH-QRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLC-GGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM
         LI  H   P+FD+  TPFL+L RSILYQQLA KA   IY RFISL  GGEAGV+PE+V+SL+   LR+IG+SGRK+SYLHDLA KY NG+LSD  I+ M
Subjt:  PLIDLH-QRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLC-GGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM

Query:  DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPSQMDSLCEKWRPYRSVGSWYMWRLAEAK
         D+ L   LT+V GIG W+VHMFMIFSLHRPDVLP+ DL VRKGV+ LY L++LP P QM+ LCEKWRPYRSVGSWYMWRL E++
Subjt:  DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPSQMDSLCEKWRPYRSVGSWYMWRLAEAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGAGCAAACGCAAGTTCAGGTTCAGGCTCAGACCCAAATCCAATCTCAAGCGCAATTGCAATCGCAGGCTTTGAATTCGCTTCATGAACCTTCGAATTCCTCAAC
CTCTATCGCTCAAGCCACTGTAGCTCTAAGCGAGGTGATGAATGGGCCATCACAAACCTCTTCTCCGCCGTCCAAAATGCCCTTGCGTCCACGGAAGATTCGAAAGCTCT
CTCCCGCTGAAACTGACCCGAATTCGTCTCGGATTGCCCCTGTTTCGGATGGGGCGAAACCGATCGCCACTGGAAAACCTAGCAAGAGCAGGGCAGCTCAACAACGGGCG
GCCCCGGTACTGCCTGCCCGATCGCTTTCTTGTGAAGGAGAAGTTGAAATTTCGCTTCGGCATCTTCGGAATGCTGATCCGCTCCTTGCACCGTTGATTGATCTCCATCA
ACGTCCTATTTTTGATACTTTTCAAACCCCATTCCTTGCCCTAACTAGAAGTATCCTGTATCAGCAATTGGCTTACAAAGCTGGCACTTCAATCTACACCCGTTTCATTT
CCCTTTGTGGCGGCGAGGCTGGTGTTCTACCCGAAACTGTTCTTTCCTTGAATCCTCAACAGCTTAGGCAAATTGGCATTTCGGGTCGGAAATCTAGTTACCTTCATGAT
CTTGCTAGGAAGTACCAAAATGGGATTCTTTCAGACCCGGCAATTGTTAATATGGATGATAAATCGCTTTTCACTATGCTCACAATGGTCAATGGAATTGGGTCTTGGTC
TGTTCATATGTTCATGATTTTCTCGCTGCACAGACCAGATGTGCTTCCTATGAATGATCTTAATGTTCGCAAAGGTGTTCAGCTTCTCTATAGTCTTGAAGACTTGCCTC
GACCATCGCAAATGGATTCGTTATGCGAGAAGTGGAGGCCGTATCGATCGGTTGGGTCGTGGTATATGTGGAGGCTTGCTGAGGCAAAGGGTGCTTCTTCAAGTGCAGCA
GCAGTGGCTGCTGGTGCTAGTTTACAACTGCAGCAACAAGAGCACCAGCAGGAGCACCAGCCGCAGCTTCTTGACCCACTTAATAGCATTCTCAATCTTGGGGCCTGTGG
TTGGGGGCAGTGA
mRNA sequenceShow/hide mRNA sequence
CCCAGTTGCGTTCAAGAACTGAGAAGCAAGCGGCCAGAAACAAGAGAAGAGAAAGAAGAAGAACGAGCCAAGGTTTCATTCTCTTCAATCGCTATACATTTGTATTTTCT
CTAACCCTAATTCAATTTCACTGTTGATTTTCAATTTCTCTTTCCGTTTCGATTCAATCCGCTTCTTGTTCTTTCATCGGAAGCATCGATTTTCGCCCTTTTTCTGAATC
TCCGCCGTCGCTTCCGCCGCCGTGATTCTTGGGTGAAGGGGAGAAAATATAGGTTGATTTGAGTTTGTTTTTGTTTTCATATGGGAGAGCAAACGCAAGTTCAGGTTCAG
GCTCAGACCCAAATCCAATCTCAAGCGCAATTGCAATCGCAGGCTTTGAATTCGCTTCATGAACCTTCGAATTCCTCAACCTCTATCGCTCAAGCCACTGTAGCTCTAAG
CGAGGTGATGAATGGGCCATCACAAACCTCTTCTCCGCCGTCCAAAATGCCCTTGCGTCCACGGAAGATTCGAAAGCTCTCTCCCGCTGAAACTGACCCGAATTCGTCTC
GGATTGCCCCTGTTTCGGATGGGGCGAAACCGATCGCCACTGGAAAACCTAGCAAGAGCAGGGCAGCTCAACAACGGGCGGCCCCGGTACTGCCTGCCCGATCGCTTTCT
TGTGAAGGAGAAGTTGAAATTTCGCTTCGGCATCTTCGGAATGCTGATCCGCTCCTTGCACCGTTGATTGATCTCCATCAACGTCCTATTTTTGATACTTTTCAAACCCC
ATTCCTTGCCCTAACTAGAAGTATCCTGTATCAGCAATTGGCTTACAAAGCTGGCACTTCAATCTACACCCGTTTCATTTCCCTTTGTGGCGGCGAGGCTGGTGTTCTAC
CCGAAACTGTTCTTTCCTTGAATCCTCAACAGCTTAGGCAAATTGGCATTTCGGGTCGGAAATCTAGTTACCTTCATGATCTTGCTAGGAAGTACCAAAATGGGATTCTT
TCAGACCCGGCAATTGTTAATATGGATGATAAATCGCTTTTCACTATGCTCACAATGGTCAATGGAATTGGGTCTTGGTCTGTTCATATGTTCATGATTTTCTCGCTGCA
CAGACCAGATGTGCTTCCTATGAATGATCTTAATGTTCGCAAAGGTGTTCAGCTTCTCTATAGTCTTGAAGACTTGCCTCGACCATCGCAAATGGATTCGTTATGCGAGA
AGTGGAGGCCGTATCGATCGGTTGGGTCGTGGTATATGTGGAGGCTTGCTGAGGCAAAGGGTGCTTCTTCAAGTGCAGCAGCAGTGGCTGCTGGTGCTAGTTTACAACTG
CAGCAACAAGAGCACCAGCAGGAGCACCAGCCGCAGCTTCTTGACCCACTTAATAGCATTCTCAATCTTGGGGCCTGTGGTTGGGGGCAGTGATTCAGACTGAAAAGATT
ACATCTTTGCAAATATCTCAATCAATCTATCCACTGAATGAAAAAGTATGCCAATCATGCGGGAGTAGAAGACCTCAACAATGAATATTTGGTTCCTCGAGGTTGGATAT
ATTCTAGACATCCATACTAAATAATGAAATGACATGTGCTAGTTTTCTTTTCTTAGGTTGGTTGACATCGCCTTACGGCCGTCAACGTGTATCATGTGATGGGGATATTG
ATAGAATTTTCCTCTGAATGTCTTCTGCTTTTGTTGTTATTTAGCATGACTTGGACGTCGAGAAAACTGCTCGTTGGTTGCGCATATTCAGCGATAGAACAGTTTGTATT
GACTTTATAATCTATCTCTTACCCTTTTCTCCGTTTCAATTTTTCTTCCACATTGGTCATAGCTTGTATATAGGAAATAATCACGAATGCAGATGATCAATTATGGACTG
TGG
Protein sequenceShow/hide protein sequence
MGEQTQVQVQAQTQIQSQAQLQSQALNSLHEPSNSSTSIAQATVALSEVMNGPSQTSSPPSKMPLRPRKIRKLSPAETDPNSSRIAPVSDGAKPIATGKPSKSRAAQQRA
APVLPARSLSCEGEVEISLRHLRNADPLLAPLIDLHQRPIFDTFQTPFLALTRSILYQQLAYKAGTSIYTRFISLCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHD
LARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPMNDLNVRKGVQLLYSLEDLPRPSQMDSLCEKWRPYRSVGSWYMWRLAEAKGASSSAA
AVAAGASLQLQQQEHQQEHQPQLLDPLNSILNLGACGWGQ