; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg08720 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg08720
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionDNA-3-methyladenine glycosylase 1
Genome locationCarg_Chr09:658507..663211
RNA-Seq ExpressionCarg08720
SyntenyCarg08720
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006307 - DNA dealkylation involved in DNA repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0032993 - protein-DNA complex (cellular component)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
GO:0032131 - alkylated DNA binding (molecular function)
GO:0043916 - DNA-7-methylguanine glycosylase activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7024195.1 mag1 [Cucurbita argyrosperma subsp. argyrosperma]3.9e-251100Show/hide
Query:  MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK
        MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK
Subjt:  MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK

Query:  SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLGKRFGSVLKKFGLCLGAVTRIEKSKSLQIAQ
        DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLGKRFGSVLKKFGLCLGAVTRIEKSKSLQIAQ
Subjt:  DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLGKRFGSVLKKFGLCLGAVTRIEKSKSLQIAQ

Query:  SVHLLNDKYANYVGVEDSAMDIWLLEVDVTFKAVNMYHVARNIDIMFLLNVCAFNVF
        SVHLLNDKYANYVGVEDSAMDIWLLEVDVTFKAVNMYHVARNIDIMFLLNVCAFNVF
Subjt:  SVHLLNDKYANYVGVEDSAMDIWLLEVDVTFKAVNMYHVARNIDIMFLLNVCAFNVF

XP_022142016.1 uncharacterized protein LOC111012247 [Momordica charantia]2.2e-18592.16Show/hide
Query:  MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK
        MGE TQVQVQTQTQSQSQ QSQAQNT H+SSNSTT+IAQATVALSEVMNAP+QTSSPPSKMPLRPRKIRKLSPDESD NSSQ+  + DGPKPI++ KSNK
Subjt:  MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK

Query:  SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        +KTAQQRAAFASAP++ ARSLSCEGEVE+ALRHLRNADPLLAPLIDLHQRP FDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +L+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLG
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQ EHQH QQPQLLDP+NSILNLG
Subjt:  DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLG

XP_022936456.1 probable DNA-3-methyladenine glycosylase 2 [Cucurbita moschata]9.1e-20099.73Show/hide
Query:  MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK
        MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK
Subjt:  MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK

Query:  SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLG
        DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQP+HQHQQQPQLLDPLNSILNLG
Subjt:  DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLG

XP_022976000.1 probable DNA-3-methyladenine glycosylase 2 [Cucurbita maxima]9.1e-20099.73Show/hide
Query:  MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK
        MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATV+LSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK
Subjt:  MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK

Query:  SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLG
        DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLG
Subjt:  DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLG

XP_023536439.1 probable DNA-3-methyladenine glycosylase 2 [Cucurbita pepo subsp. pepo]4.1e-200100Show/hide
Query:  MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK
        MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK
Subjt:  MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK

Query:  SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLG
        DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLG
Subjt:  DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLG

TrEMBL top hitse value%identityAlignment
A0A1S3CRJ5 DNA-3-methyladenine glycosylase 18.9e-18593.03Show/hide
Query:  MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK
        MGE TQVQVQTQTQSQ Q QSQAQNTFHESSNSTT IAQATV LSEVMNAPSQ SSPPSKMPLRPRKIRKLSP+ESDPNSS VVAIPDGPKPIAT KSNK
Subjt:  MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK

Query:  SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAFASA V LARSLSCEGEVE+ALRHLRNADPLLA LIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +L+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEH---QHQQQPQLLDPLNSILNLG
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQ+H  EH   QH QQPQLLDPLN ILNLG
Subjt:  DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEH---QHQQQPQLLDPLNSILNLG

A0A5A7TDX5 DNA-3-methyladenine glycosylase 18.9e-18593.03Show/hide
Query:  MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK
        MGE TQVQVQTQTQSQ Q QSQAQNTFHESSNSTT IAQATV LSEVMNAPSQ SSPPSKMPLRPRKIRKLSP+ESDPNSS VVAIPDGPKPIAT KSNK
Subjt:  MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK

Query:  SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAFASA V LARSLSCEGEVE+ALRHLRNADPLLA LIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +L+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEH---QHQQQPQLLDPLNSILNLG
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQ+H  EH   QH QQPQLLDPLN ILNLG
Subjt:  DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEH---QHQQQPQLLDPLNSILNLG

A0A6J1CKY0 uncharacterized protein LOC1110122471.1e-18592.16Show/hide
Query:  MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK
        MGE TQVQVQTQTQSQSQ QSQAQNT H+SSNSTT+IAQATVALSEVMNAP+QTSSPPSKMPLRPRKIRKLSPDESD NSSQ+  + DGPKPI++ KSNK
Subjt:  MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK

Query:  SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        +KTAQQRAAFASAP++ ARSLSCEGEVE+ALRHLRNADPLLAPLIDLHQRP FDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +L+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLG
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQ EHQH QQPQLLDP+NSILNLG
Subjt:  DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLG

A0A6J1F7I4 probable DNA-3-methyladenine glycosylase 24.4e-20099.73Show/hide
Query:  MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK
        MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK
Subjt:  MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK

Query:  SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLG
        DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQP+HQHQQQPQLLDPLNSILNLG
Subjt:  DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLG

A0A6J1IIA5 probable DNA-3-methyladenine glycosylase 24.4e-20099.73Show/hide
Query:  MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK
        MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATV+LSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK
Subjt:  MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNK

Query:  SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
        SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL
Subjt:  SKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVL

Query:  ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLG
        DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLG
Subjt:  DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLG

SwissProt top hitse value%identityAlignment
O31544 Putative DNA-3-methyladenine glycosylase YfjP4.8e-1826.89Show/hide
Query:  LARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEA-GV----LPETVLALSPQQLRQIG
        + R    E  ++  L H       L+ + + H         + +  + + I++QQL      ++  RF+   G +  GV     PET+  L  Q LR + 
Subjt:  LARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEA-GV----LPETVLALSPQQLRQIG

Query:  ISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDHLCEKWRPYR
         S RK+ Y  D +R    G LS   + +M D+ +   L  + GIG W+V   ++F L RP++ P+ D+ ++  ++  + L++ P    M  + ++W PY 
Subjt:  ISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDHLCEKWRPYR

Query:  SVGSWYMWRFAE
        S  S Y+WR  E
Subjt:  SVGSWYMWRFAE

O94468 Alkylbase DNA glycosidase-like protein mag22.0e-1625Show/hide
Query:  LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSF--QTPFLALTRSILYQQLAYKAGTSIYTRFIALCG-GEAGVLPETVLALSPQQLRQIGISGRKS
        +S + + + A +HL + D   + L+      T        P+  + R+I  Q+L+  A  SI  +F   C   +    P+ ++    + L + G S  KS
Subjt:  LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSF--QTPFLALTRSILYQQLAYKAGTSIYTRFIALCG-GEAGVLPETVLALSPQQLRQIGISGRKS

Query:  SYLHDLARKYQN-GILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDHLCEKWRPYRSVGSW
          +H +A    N  I S   I  M ++ L   L+ + G+  W++ M+ IF+L R D++P +D  ++   +  + L   P+  +++ L +  +PYR++ +W
Subjt:  SYLHDLARKYQN-GILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDHLCEKWRPYRSVGSW

Query:  YMWR
        Y+W+
Subjt:  YMWR

Q92383 DNA-3-methyladenine glycosylase 16.7e-2031.29Show/hide
Query:  PFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGIL-SDPAIVNMDDKSLFTMLTMVNGIGS
        P+  L R++  QQL  KA  +I+ RF ++        PE +  +  + +R  G S RK   L  +A    +G++ +      + ++ L   LT + GIG 
Subjt:  PFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGIL-SDPAIVNMDDKSLFTMLTMVNGIGS

Query:  WSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDHLCEKWRPYRSVGSWYMWR
        W+V M +IFSL+R DV+P +DL++R G + L+ L ++P    +    E   P+R+  +WY+W+
Subjt:  WSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDHLCEKWRPYRSVGSWYMWR

Arabidopsis top hitse value%identityAlignment
AT1G19480.1 DNA glycosylase superfamily protein2.1e-10958.12Show/hide
Query:  MGEHTQVQVQTQTQSQSQAQS---------QAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPD---ESDPNSSQVVAIPD
        MGE +  Q  TQ QS  Q+           ++ +   +S+  + +I  +T   +  +      SSPPSK+PLRPRKIRKL+ D     +   ++ ++   
Subjt:  MGEHTQVQVQTQTQSQSQAQS---------QAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPD---ESDPNSSQVVAIPD

Query:  GPKPIATSKSNKSKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC
           P+AT   +  K         + P + AR L+CEGE+E A+ +LRNADPLLA LID+H  PTF+SF+TPFLAL R+ILYQQLA KAG SIYTRF++LC
Subjt:  GPKPIATSKSNKSKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC

Query:  GGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLL
        GGE  V+PETVL+L+PQQLRQIG+SGRK+SYLHDLARKYQNGILSD AI+NMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRKGVQLL
Subjt:  GGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLL

Query:  YNLEELPRPSQMDHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLG
        Y L++LPRPSQM+  C KWRPYRSVGSWYMWR  EAK  S+S AA+AAG SL    ++ Q EHQ Q   QL+DPLN + ++G
Subjt:  YNLEELPRPSQMDHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLG

AT1G19480.2 DNA glycosylase superfamily protein2.1e-10958.12Show/hide
Query:  MGEHTQVQVQTQTQSQSQAQS---------QAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPD---ESDPNSSQVVAIPD
        MGE +  Q  TQ QS  Q+           ++ +   +S+  + +I  +T   +  +      SSPPSK+PLRPRKIRKL+ D     +   ++ ++   
Subjt:  MGEHTQVQVQTQTQSQSQAQS---------QAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPD---ESDPNSSQVVAIPD

Query:  GPKPIATSKSNKSKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC
           P+AT   +  K         + P + AR L+CEGE+E A+ +LRNADPLLA LID+H  PTF+SF+TPFLAL R+ILYQQLA KAG SIYTRF++LC
Subjt:  GPKPIATSKSNKSKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC

Query:  GGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLL
        GGE  V+PETVL+L+PQQLRQIG+SGRK+SYLHDLARKYQNGILSD AI+NMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRKGVQLL
Subjt:  GGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLL

Query:  YNLEELPRPSQMDHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLG
        Y L++LPRPSQM+  C KWRPYRSVGSWYMWR  EAK  S+S AA+AAG SL    ++ Q EHQ Q   QL+DPLN + ++G
Subjt:  YNLEELPRPSQMDHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLG

AT1G75230.1 DNA glycosylase superfamily protein3.3e-11558.19Show/hide
Query:  MGEHTQVQVQTQTQSQSQAQSQAQNT-------FHESSNSTTTIAQATVALSEVMNAPSQT-----SSPPSKMPLRPRKIRKLSPDES-------DPNSS
        MGEH+  Q  + T   +Q +S    T        ++  ++++     ++  S  + AP  T     SSPP+K+PLRPRKIRKLSPD+        + N S
Subjt:  MGEHTQVQVQTQTQSQSQAQSQAQNT-------FHESSNSTTTIAQATVALSEVMNAPSQT-----SSPPSKMPLRPRKIRKLSPDES-------DPNSS

Query:  QVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIY
        Q+           T  + KSK +Q R    + P + ARSL+CEGE+E AL HLR+ DPLLA LID+H  PTF++FQTPFLAL RSILYQQLA KAG SIY
Subjt:  QVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIY

Query:  TRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV
        TRF+ALCGGE GV+PE VL L+PQQLRQIG+SGRK+SYLHDLARKYQNGILSD  IVNMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL V
Subjt:  TRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV

Query:  RKGVQLLYNLEELPRPSQMDHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASL------QLQQQEHQPEHQ--HQQQPQLLDPLNSILNLG
        RKGVQ+L  +E+LPRPS+M+ LCEKWRPYRSV SWY+WR  E+K    +AAA  AGA+L       +QQQE + +HQ   QQQPQL+DPLN++ ++G
Subjt:  RKGVQLLYNLEELPRPSQMDHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASL------QLQQQEHQPEHQ--HQQQPQLLDPLNSILNLG

AT1G75230.2 DNA glycosylase superfamily protein3.3e-11558.19Show/hide
Query:  MGEHTQVQVQTQTQSQSQAQSQAQNT-------FHESSNSTTTIAQATVALSEVMNAPSQT-----SSPPSKMPLRPRKIRKLSPDES-------DPNSS
        MGEH+  Q  + T   +Q +S    T        ++  ++++     ++  S  + AP  T     SSPP+K+PLRPRKIRKLSPD+        + N S
Subjt:  MGEHTQVQVQTQTQSQSQAQSQAQNT-------FHESSNSTTTIAQATVALSEVMNAPSQT-----SSPPSKMPLRPRKIRKLSPDES-------DPNSS

Query:  QVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIY
        Q+           T  + KSK +Q R    + P + ARSL+CEGE+E AL HLR+ DPLLA LID+H  PTF++FQTPFLAL RSILYQQLA KAG SIY
Subjt:  QVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIY

Query:  TRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV
        TRF+ALCGGE GV+PE VL L+PQQLRQIG+SGRK+SYLHDLARKYQNGILSD  IVNMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL V
Subjt:  TRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV

Query:  RKGVQLLYNLEELPRPSQMDHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASL------QLQQQEHQPEHQ--HQQQPQLLDPLNSILNLG
        RKGVQ+L  +E+LPRPS+M+ LCEKWRPYRSV SWY+WR  E+K    +AAA  AGA+L       +QQQE + +HQ   QQQPQL+DPLN++ ++G
Subjt:  RKGVQLLYNLEELPRPSQMDHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASL------QLQQQEHQPEHQ--HQQQPQLLDPLNSILNLG

AT3G50880.1 DNA glycosylase superfamily protein2.4e-7355.43Show/hide
Query:  SQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLH-QR
        S+ S   S++  RPRKIRK+S D S             P+ I T               AS P      LS +  V++ALRHL+++D LL  LI  H   
Subjt:  SQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLH-QR

Query:  PTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC-GGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTML
        P FDS  TPFL+L RSILYQQLA KA   IY RFI+L  GGEAGV+PE+V++LS   LR+IG+SGRK+SYLHDLA KY NG+LSD  I+ M D+ L   L
Subjt:  PTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC-GGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTML

Query:  TMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDHLCEKWRPYRSVGSWYMWRFAEAK
        T+V GIG W+VHMFMIFSLHRPDVLP+ DL VRKGV+ LY L+ LP P QM+ LCEKWRPYRSVGSWYMWR  E++
Subjt:  TMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDHLCEKWRPYRSVGSWYMWRFAEAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGAGCACACGCAAGTGCAGGTTCAGACTCAGACCCAAAGCCAATCTCAAGCGCAATCGCAGGCTCAGAACACGTTTCATGAATCCTCCAACTCCACAACCACTAT
CGCCCAAGCTACTGTGGCATTAAGCGAGGTGATGAATGCGCCATCGCAAACCTCTTCTCCGCCGTCCAAAATGCCCTTGCGTCCGCGGAAGATCCGGAAGCTCTCGCCTG
ATGAATCGGATCCAAATTCCTCTCAGGTTGTTGCCATTCCGGATGGGCCGAAACCTATAGCCACCAGCAAATCTAACAAGAGCAAGACGGCCCAACAACGCGCCGCATTC
GCGTCTGCCCCTGTAATGCTTGCCCGATCACTTTCCTGTGAAGGCGAGGTGGAAGTCGCGCTTCGGCATCTCCGGAATGCCGATCCGCTCCTTGCACCTTTGATCGACCT
TCATCAACGTCCTACCTTCGACAGTTTTCAAACCCCATTCCTTGCCCTTACTAGAAGTATCCTTTATCAGCAGCTGGCTTACAAAGCTGGCACCTCAATCTACACTCGTT
TCATCGCTCTCTGTGGCGGCGAGGCTGGCGTTCTTCCTGAAACTGTACTTGCCTTGAGCCCTCAACAGCTCAGGCAAATTGGAATTTCGGGTCGTAAGTCTAGTTACCTT
CATGATCTTGCGAGGAAGTATCAAAATGGGATTCTTTCAGACCCCGCAATTGTAAATATGGATGATAAATCGCTTTTCACGATGCTTACAATGGTCAATGGAATTGGGTC
TTGGTCTGTTCATATGTTCATGATTTTCTCGCTGCACAGACCAGACGTGCTTCCTATCAATGATCTTAATGTTCGCAAAGGTGTTCAACTTCTCTACAATCTTGAAGAGT
TGCCTCGACCATCGCAGATGGATCACTTATGCGAGAAGTGGAGGCCGTATCGATCGGTTGGGTCGTGGTATATGTGGAGGTTTGCTGAGGCAAAGGGGGCTTCTTCAAGC
GCAGCAGCAATAGCTGCTGGTGCTAGTTTACAACTGCAGCAACAAGAGCACCAGCCGGAGCACCAGCATCAACAGCAGCCGCAGCTTCTTGACCCACTTAATAGCATTCT
CAATCTTGGGAAGAGATTCGGAAGTGTTCTTAAAAAGTTCGGCCTGTGCTTGGGGGCAGTGACTCGGATCGAAAAGAGTAAATCTTTGCAGATAGCCCAATCAGTTCATC
TACTGAATGATAAGTATGCCAATTATGTGGGAGTAGAAGACTCAGCAATGGATATTTGGCTCCTTGAGGTTGATGTCACTTTTAAGGCCGTCAACATGTATCATGTGGCA
AGGAATATTGATATAATGTTCCTCTTGAATGTCTGTGCTTTCAATGTATTTTAA
mRNA sequenceShow/hide mRNA sequence
AGAGAACTGAGGAGCGAGCGGCCAGAGGCAGAGATAGAGAAGAGAAAGAGACTGAAAGAAGAACGTCACCGGGAATTCATTCTCCTCAGTCGCTGGTTATACCATTCACA
ATCCACTTGTCTATTCTTTAACCTAATTTAAAATCCCTAATCTGTTCAATTTCACATTTACTTTTCAATTTCTTTCTGTTAGCCTCAATAATCCACTTCTTGTTCTTTCA
CTTGGGGCATCGAGTTTTGCCCTTCTTCCGATTCTCCACCGCCGCTGATCTTGGTTTTGTACAGTGTTTTCTTTCAACTGACTTGTCTGGGTGAAAGGAGATAGCGCAGA
TTGAATCCTTTACATATGGGAGAGCACACGCAAGTGCAGGTTCAGACTCAGACCCAAAGCCAATCTCAAGCGCAATCGCAGGCTCAGAACACGTTTCATGAATCCTCCAA
CTCCACAACCACTATCGCCCAAGCTACTGTGGCATTAAGCGAGGTGATGAATGCGCCATCGCAAACCTCTTCTCCGCCGTCCAAAATGCCCTTGCGTCCGCGGAAGATCC
GGAAGCTCTCGCCTGATGAATCGGATCCAAATTCCTCTCAGGTTGTTGCCATTCCGGATGGGCCGAAACCTATAGCCACCAGCAAATCTAACAAGAGCAAGACGGCCCAA
CAACGCGCCGCATTCGCGTCTGCCCCTGTAATGCTTGCCCGATCACTTTCCTGTGAAGGCGAGGTGGAAGTCGCGCTTCGGCATCTCCGGAATGCCGATCCGCTCCTTGC
ACCTTTGATCGACCTTCATCAACGTCCTACCTTCGACAGTTTTCAAACCCCATTCCTTGCCCTTACTAGAAGTATCCTTTATCAGCAGCTGGCTTACAAAGCTGGCACCT
CAATCTACACTCGTTTCATCGCTCTCTGTGGCGGCGAGGCTGGCGTTCTTCCTGAAACTGTACTTGCCTTGAGCCCTCAACAGCTCAGGCAAATTGGAATTTCGGGTCGT
AAGTCTAGTTACCTTCATGATCTTGCGAGGAAGTATCAAAATGGGATTCTTTCAGACCCCGCAATTGTAAATATGGATGATAAATCGCTTTTCACGATGCTTACAATGGT
CAATGGAATTGGGTCTTGGTCTGTTCATATGTTCATGATTTTCTCGCTGCACAGACCAGACGTGCTTCCTATCAATGATCTTAATGTTCGCAAAGGTGTTCAACTTCTCT
ACAATCTTGAAGAGTTGCCTCGACCATCGCAGATGGATCACTTATGCGAGAAGTGGAGGCCGTATCGATCGGTTGGGTCGTGGTATATGTGGAGGTTTGCTGAGGCAAAG
GGGGCTTCTTCAAGCGCAGCAGCAATAGCTGCTGGTGCTAGTTTACAACTGCAGCAACAAGAGCACCAGCCGGAGCACCAGCATCAACAGCAGCCGCAGCTTCTTGACCC
ACTTAATAGCATTCTCAATCTTGGGAAGAGATTCGGAAGTGTTCTTAAAAAGTTCGGCCTGTGCTTGGGGGCAGTGACTCGGATCGAAAAGAGTAAATCTTTGCAGATAG
CCCAATCAGTTCATCTACTGAATGATAAGTATGCCAATTATGTGGGAGTAGAAGACTCAGCAATGGATATTTGGCTCCTTGAGGTTGATGTCACTTTTAAGGCCGTCAAC
ATGTATCATGTGGCAAGGAATATTGATATAATGTTCCTCTTGAATGTCTGTGCTTTCAATGTATTTTAA
Protein sequenceShow/hide protein sequence
MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAF
ASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYL
HDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDHLCEKWRPYRSVGSWYMWRFAEAKGASSS
AAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLGKRFGSVLKKFGLCLGAVTRIEKSKSLQIAQSVHLLNDKYANYVGVEDSAMDIWLLEVDVTFKAVNMYHVA
RNIDIMFLLNVCAFNVF