; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg17150 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg17150
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionDNA-3-methyladenine glycosylase 1
Genome locationCarg_Chr01:12696569..12697663
RNA-Seq ExpressionCarg17150
SyntenyCarg17150
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006307 - DNA dealkylation involved in DNA repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0032993 - protein-DNA complex (cellular component)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
GO:0032131 - alkylated DNA binding (molecular function)
GO:0043916 - DNA-7-methylguanine glycosylase activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608541.1 hypothetical protein SDJN03_01883, partial [Cucurbita argyrosperma subsp. sororia]1.8e-19599.45Show/hide
Query:  MGEQTQVQVRTQTQSQAQNALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHKSKTAQQ
        MGEQTQVQV+TQTQSQAQNALH+SSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHKSKTAQQ
Subjt:  MGEQTQVQVRTQTQSQAQNALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHKSKTAQQ

Query:  RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ
        RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ
Subjt:  RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ

Query:  LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEK
        LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEK
Subjt:  LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEK

Query:  WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG
        WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG
Subjt:  WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG

KAG7037864.1 mag1, partial [Cucurbita argyrosperma subsp. argyrosperma]2.9e-196100Show/hide
Query:  MGEQTQVQVRTQTQSQAQNALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHKSKTAQQ
        MGEQTQVQVRTQTQSQAQNALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHKSKTAQQ
Subjt:  MGEQTQVQVRTQTQSQAQNALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHKSKTAQQ

Query:  RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ
        RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ
Subjt:  RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ

Query:  LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEK
        LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEK
Subjt:  LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEK

Query:  WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG
        WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG
Subjt:  WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG

XP_022940509.1 probable DNA-3-methyladenine glycosylase 2 [Cucurbita moschata]1.7e-19398.63Show/hide
Query:  MGEQTQVQVRTQTQSQAQNALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHKSKTAQQ
        MGEQTQVQV+TQTQSQAQNALH+SSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTI DGPKPIVTGKSHKSKTAQQ
Subjt:  MGEQTQVQVRTQTQSQAQNALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHKSKTAQQ

Query:  RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ
        RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ
Subjt:  RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ

Query:  LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEK
        LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGV+LLYNLEELPRPSQMDQLCEK
Subjt:  LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEK

Query:  WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG
        WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQE QQEQQHQRQPQLLDPLNSILNLG
Subjt:  WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG

XP_022981728.1 uncharacterized protein LOC111480795 [Cucurbita maxima]3.6e-19197.79Show/hide
Query:  EQTQVQVRTQTQSQAQNALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHKSKTAQQRA
        E+TQVQV+TQTQSQAQNALH+SSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTI DGPKPIVTGKSHKSKTAQQRA
Subjt:  EQTQVQVRTQTQSQAQNALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHKSKTAQQRA

Query:  AFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQLR
        AFASAPVLPARSLSCEGEVE+ALRHL NADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQLR
Subjt:  AFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQLR

Query:  QIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWR
        QIGISGRKSSYLHDLARKYQNGILSD AIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKW+
Subjt:  QIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWR

Query:  PYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG
        PYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG
Subjt:  PYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG

XP_023525023.1 probable DNA-3-methyladenine glycosylase 2 [Cucurbita pepo subsp. pepo]1.3e-19398.63Show/hide
Query:  MGEQTQVQVRTQTQSQAQNALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHKSKTAQQ
        MGEQTQVQV+TQTQSQAQNALH+SSNSTT IAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTI DGPKPIVTGKSHKSKTAQQ
Subjt:  MGEQTQVQVRTQTQSQAQNALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHKSKTAQQ

Query:  RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ
        RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ
Subjt:  RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ

Query:  LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEK
        LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEK
Subjt:  LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEK

Query:  WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG
        WRPYRSVGSWYMWRLAEAKGASSSAAAVA GASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG
Subjt:  WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG

TrEMBL top hitse value%identityAlignment
A0A6J1CKY0 uncharacterized protein LOC1110122473.4e-17990.27Show/hide
Query:  MGEQTQVQVRTQT------QSQAQNALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHK
        MGEQTQVQV+TQT      QSQAQN +H SSNSTT+IAQAT+ALSEVMNAP+QTSS PSKMPLRPRKIRKLSP ESD NSSQI  ++DGPKPI +GKS+K
Subjt:  MGEQTQVQVRTQT------QSQAQNALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHK

Query:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVL
        +KTAQQRAAFASAP+LPARSLSCEGEVEIALRHL NADPLLAPLIDLHQRP FDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGG+AGVLPETVL
Subjt:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVL

Query:  ALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        +LN QQLRQIGISGRKSSYLHDLARKYQNGILSD AIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAA+AAGASLQLQQQEHQQE QH +QPQLLDP+NSILNLG
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG

A0A6J1F7I4 probable DNA-3-methyladenine glycosylase 22.6e-17990.54Show/hide
Query:  MGEQTQVQVRTQT------QSQAQNALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHK
        MGE TQVQV+TQT      QSQAQN  H+SSNSTTTIAQAT+ALSEVMNAPSQTSS PSKMPLRPRKIRKLSP ESDPNSSQ+V I DGPKPI T KS+K
Subjt:  MGEQTQVQVRTQT------QSQAQNALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHK

Query:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVL
        SKTAQQRAAFASAPV+ ARSLSCEGEVE+ALRHL NADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGG+AGVLPETVL
Subjt:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVL

Query:  ALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        AL+ QQLRQIGISGRKSSYLHDLARKYQNGILSD AIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQ + QHQ+QPQLLDPLNSILNLG
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG

A0A6J1FKF6 probable DNA-3-methyladenine glycosylase 28.4e-19498.63Show/hide
Query:  MGEQTQVQVRTQTQSQAQNALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHKSKTAQQ
        MGEQTQVQV+TQTQSQAQNALH+SSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTI DGPKPIVTGKSHKSKTAQQ
Subjt:  MGEQTQVQVRTQTQSQAQNALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHKSKTAQQ

Query:  RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ
        RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ
Subjt:  RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ

Query:  LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEK
        LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGV+LLYNLEELPRPSQMDQLCEK
Subjt:  LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEK

Query:  WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG
        WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQE QQEQQHQRQPQLLDPLNSILNLG
Subjt:  WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG

A0A6J1IIA5 probable DNA-3-methyladenine glycosylase 22.6e-17990.54Show/hide
Query:  MGEQTQVQVRTQT------QSQAQNALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHK
        MGE TQVQV+TQT      QSQAQN  H+SSNSTTTIAQAT++LSEVMNAPSQTSS PSKMPLRPRKIRKLSP ESDPNSSQ+V I DGPKPI T KS+K
Subjt:  MGEQTQVQVRTQT------QSQAQNALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHK

Query:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVL
        SKTAQQRAAFASAPV+ ARSLSCEGEVE+ALRHL NADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGG+AGVLPETVL
Subjt:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVL

Query:  ALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
        AL+ QQLRQIGISGRKSSYLHDLARKYQNGILSD AIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Subjt:  ALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQ E QHQ+QPQLLDPLNSILNLG
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG

A0A6J1J2W7 uncharacterized protein LOC1114807951.7e-19197.79Show/hide
Query:  EQTQVQVRTQTQSQAQNALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHKSKTAQQRA
        E+TQVQV+TQTQSQAQNALH+SSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTI DGPKPIVTGKSHKSKTAQQRA
Subjt:  EQTQVQVRTQTQSQAQNALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHKSKTAQQRA

Query:  AFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQLR
        AFASAPVLPARSLSCEGEVE+ALRHL NADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQLR
Subjt:  AFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQLR

Query:  QIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWR
        QIGISGRKSSYLHDLARKYQNGILSD AIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKW+
Subjt:  QIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWR

Query:  PYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG
        PYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG
Subjt:  PYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG

SwissProt top hitse value%identityAlignment
O31544 Putative DNA-3-methyladenine glycosylase YfjP1.7e-1827.14Show/hide
Query:  RSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDA-GV----LPETVLALNTQQLRQIGIS
        R    E  ++  L H       L+ + + H         + +  + + I++QQL      ++  RF+   G    GV     PET+  L+ Q LR +  S
Subjt:  RSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDA-GV----LPETVLALNTQQLRQIGIS

Query:  GRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSV
         RK+ Y  D +R    G LS + + +M D+ +   L  + GIG W+V   ++F L RP++ P+ D+ ++  ++  + L++ P    M  + ++W PY S 
Subjt:  GRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSV

Query:  GSWYMWRLAE
         S Y+WR  E
Subjt:  GSWYMWRLAE

O94468 Alkylbase DNA glycosidase-like protein mag23.8e-1825.37Show/hide
Query:  LSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSF--QTPFLALTRSILYQQLAYKAGTSIYTRFIALCG-GDAGVLPETVLALNTQQLRQIGISGRKS
        +S + + + A +HL + D   + L+      T        P+  + R+I  Q+L+  A  SI  +F   C   D    P+ ++  + + L + G S  KS
Subjt:  LSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSF--QTPFLALTRSILYQQLAYKAGTSIYTRFIALCG-GDAGVLPETVLALNTQQLRQIGISGRKS

Query:  SYLHDLARKYQN-GILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSW
          +H +A    N  I S + I  M ++ L   L+ + G+  W++ M+ IF+L R D++P +D  ++   +  + L   P+  ++++L +  +PYR++ +W
Subjt:  SYLHDLARKYQN-GILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSW

Query:  YMWRL
        Y+W++
Subjt:  YMWRL

P37878 DNA-3-methyladenine glycosylase7.7e-1125.15Show/hide
Query:  FLALTRSILYQQLAYKAGTSIYTRFIALCGGDA-------GVLP--ETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLT
        F AL   +L QQ+      S+  +F+   G           V P  E +  L    L  I ++ +KS Y+  +AR   +G LS   ++ M+ K     L 
Subjt:  FLALTRSILYQQLAYKAGTSIYTRFIALCGGDA-------GVLP--ETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLT

Query:  MVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRL
         + GIG W+ +  ++  L  P   PI+D+ +   +++L N+   P   ++ ++   W+ ++S  ++Y+WR+
Subjt:  MVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRL

Q92383 DNA-3-methyladenine glycosylase 11.4e-2030.72Show/hide
Query:  PFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGIL-SDTAIVNMDDKSLFTMLTMVNGIGS
        P+  L R++  QQL  KA  +I+ RF ++        PE +  ++ + +R  G S RK   L  +A    +G++ +      + ++ L   LT + GIG 
Subjt:  PFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGIL-SDTAIVNMDDKSLFTMLTMVNGIGS

Query:  WSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAE
        W+V M +IFSL+R DV+P +DL++R G + L+ L ++P    + +  E   P+R+  +WY+W+ ++
Subjt:  WSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAE

Arabidopsis top hitse value%identityAlignment
AT1G19480.1 DNA glycosylase superfamily protein2.8e-10959.28Show/hide
Query:  MGEQTQVQVRTQTQSQAQN----------------ALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLS---PAESDPNSSQIVTIS
        MGEQ+  Q  TQ QS  Q+                 L  +  S + ++  TI    +       SS PSK+PLRPRKIRKL+       +   ++ ++ S
Subjt:  MGEQTQVQVRTQTQSQAQN----------------ALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLS---PAESDPNSSQIVTIS

Query:  DGPKPIVT-GKS-HKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFI
            P+ T GKS  K K +  RA   + P + AR L+CEGE+E A+ +L NADPLLA LID+H  PTF+SF+TPFLAL R+ILYQQLA KAG SIYTRF+
Subjt:  DGPKPIVT-GKS-HKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFI

Query:  ALCGGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGV
        +LCGG+  V+PETVL+LN QQLRQIG+SGRK+SYLHDLARKYQNGILSD+AI+NMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRKGV
Subjt:  ALCGGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGV

Query:  QLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGAS---LQLQQQEHQQEQQHQRQPQLLDPLNSILNLG
        QLLY L++LPRPSQM+Q C KWRPYRSVGSWYMWRL EAK  S+S AAVAAG S   L+  QQEHQQ+       QL+DPLN + ++G
Subjt:  QLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGAS---LQLQQQEHQQEQQHQRQPQLLDPLNSILNLG

AT1G19480.2 DNA glycosylase superfamily protein2.8e-10959.28Show/hide
Query:  MGEQTQVQVRTQTQSQAQN----------------ALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLS---PAESDPNSSQIVTIS
        MGEQ+  Q  TQ QS  Q+                 L  +  S + ++  TI    +       SS PSK+PLRPRKIRKL+       +   ++ ++ S
Subjt:  MGEQTQVQVRTQTQSQAQN----------------ALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLS---PAESDPNSSQIVTIS

Query:  DGPKPIVT-GKS-HKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFI
            P+ T GKS  K K +  RA   + P + AR L+CEGE+E A+ +L NADPLLA LID+H  PTF+SF+TPFLAL R+ILYQQLA KAG SIYTRF+
Subjt:  DGPKPIVT-GKS-HKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFI

Query:  ALCGGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGV
        +LCGG+  V+PETVL+LN QQLRQIG+SGRK+SYLHDLARKYQNGILSD+AI+NMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRKGV
Subjt:  ALCGGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGV

Query:  QLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGAS---LQLQQQEHQQEQQHQRQPQLLDPLNSILNLG
        QLLY L++LPRPSQM+Q C KWRPYRSVGSWYMWRL EAK  S+S AAVAAG S   L+  QQEHQQ+       QL+DPLN + ++G
Subjt:  QLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGAS---LQLQQQEHQQEQQHQRQPQLLDPLNSILNLG

AT1G75230.1 DNA glycosylase superfamily protein7.9e-11257.43Show/hide
Query:  MGEQTQVQVRTQT--QSQAQNALHQSSN-----------STTTIAQATIALSEVMNAPSQT-----SSLPSKMPLRPRKIRKLSPAES-------DPNSS
        MGE +  Q  + T   +Q ++  H++ N           +++     +I  S  + AP  T     SS P+K+PLRPRKIRKLSP +        + N S
Subjt:  MGEQTQVQVRTQT--QSQAQNALHQSSN-----------STTTIAQATIALSEVMNAPSQT-----SSLPSKMPLRPRKIRKLSPAES-------DPNSS

Query:  QIVTISDGPKPIVTGKSHKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIY
        Q+ T         T  + KSK +Q R    + P + ARSL+CEGE+E AL HL + DPLLA LID+H  PTF++FQTPFLAL RSILYQQLA KAG SIY
Subjt:  QIVTISDGPKPIVTGKSHKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIY

Query:  TRFIALCGGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV
        TRF+ALCGG+ GV+PE VL L  QQLRQIG+SGRK+SYLHDLARKYQNGILSD+ IVNMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL V
Subjt:  TRFIALCGGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV

Query:  RKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASL--------QLQQQEHQQEQQHQRQPQLLDPLNSILNLG
        RKGVQ+L  +E+LPRPS+M+QLCEKWRPYRSV SWY+WRL E+K    +AAA  AGA+L        Q Q+QE Q +Q  Q+QPQL+DPLN++ ++G
Subjt:  RKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASL--------QLQQQEHQQEQQHQRQPQLLDPLNSILNLG

AT1G75230.2 DNA glycosylase superfamily protein7.9e-11257.43Show/hide
Query:  MGEQTQVQVRTQT--QSQAQNALHQSSN-----------STTTIAQATIALSEVMNAPSQT-----SSLPSKMPLRPRKIRKLSPAES-------DPNSS
        MGE +  Q  + T   +Q ++  H++ N           +++     +I  S  + AP  T     SS P+K+PLRPRKIRKLSP +        + N S
Subjt:  MGEQTQVQVRTQT--QSQAQNALHQSSN-----------STTTIAQATIALSEVMNAPSQT-----SSLPSKMPLRPRKIRKLSPAES-------DPNSS

Query:  QIVTISDGPKPIVTGKSHKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIY
        Q+ T         T  + KSK +Q R    + P + ARSL+CEGE+E AL HL + DPLLA LID+H  PTF++FQTPFLAL RSILYQQLA KAG SIY
Subjt:  QIVTISDGPKPIVTGKSHKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIY

Query:  TRFIALCGGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV
        TRF+ALCGG+ GV+PE VL L  QQLRQIG+SGRK+SYLHDLARKYQNGILSD+ IVNMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL V
Subjt:  TRFIALCGGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV

Query:  RKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASL--------QLQQQEHQQEQQHQRQPQLLDPLNSILNLG
        RKGVQ+L  +E+LPRPS+M+QLCEKWRPYRSV SWY+WRL E+K    +AAA  AGA+L        Q Q+QE Q +Q  Q+QPQL+DPLN++ ++G
Subjt:  RKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASL--------QLQQQEHQQEQQHQRQPQLLDPLNSILNLG

AT3G50880.1 DNA glycosylase superfamily protein1.5e-7353.2Show/hide
Query:  TTIAQATIALSEVMNA----PSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHKSKTAQQRAAFASAPVLPARSLSCEGEVEIA
        T + Q+++    +++A     S+ S   S++  RPRKIRK+S   SDP+          P+ I+T               AS P      LS +  V+IA
Subjt:  TTIAQATIALSEVMNA----PSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHKSKTAQQRAAFASAPVLPARSLSCEGEVEIA

Query:  LRHLGNADPLLAPLIDLH-QRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC-GGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQ
        LRHL ++D LL  LI  H   P FDS  TPFL+L RSILYQQLA KA   IY RFI+L  GG+AGV+PE+V++L+   LR+IG+SGRK+SYLHDLA KY 
Subjt:  LRHLGNADPLLAPLIDLH-QRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC-GGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQ

Query:  NGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAK
        NG+LSD  I+ M D+ L   LT+V GIG W+VHMFMIFSLHRPDVLP+ DL VRKGV+ LY L+ LP P QM+QLCEKWRPYRSVGSWYMWRL E++
Subjt:  NGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGAGCAAACGCAAGTGCAGGTTCGGACTCAAACGCAATCGCAGGCTCAGAACGCGCTTCATCAATCCTCCAACTCTACAACCACTATTGCTCAAGCCACTATAGC
ACTAAGCGAGGTGATGAATGCGCCATCGCAAACCTCTTCTCTGCCATCCAAAATGCCTTTGCGTCCACGGAAGATTCGAAAGCTCTCGCCCGCTGAATCCGATCCGAATT
CCTCTCAGATTGTCACCATTTCGGATGGGCCGAAACCTATCGTCACCGGGAAATCTCACAAGAGCAAGACGGCCCAACAACGCGCCGCCTTCGCGTCTGCTCCAGTGCTG
CCAGCCCGATCACTTTCCTGTGAAGGCGAGGTGGAAATCGCGCTTCGGCATCTTGGGAATGCGGATCCGCTCCTTGCACCTTTGATCGACCTCCATCAACGTCCTACCTT
CGACAGTTTTCAAACCCCATTCCTTGCCCTAACTAGAAGTATCCTATATCAGCAGCTGGCTTACAAAGCTGGCACATCTATTTACACCCGTTTTATCGCCCTTTGTGGCG
GCGACGCTGGTGTTCTTCCCGAAACCGTTCTTGCCTTGAACACTCAACAGCTCAGGCAAATTGGAATTTCGGGTCGTAAATCTAGTTATCTTCATGACCTTGCTAGGAAA
TACCAAAATGGGATTCTTTCAGACACGGCAATTGTAAATATGGATGATAAATCGCTTTTCACGATGCTCACAATGGTGAATGGAATTGGGTCTTGGTCTGTTCATATGTT
CATGATTTTCTCGCTGCACAGACCAGATGTGCTTCCTATCAACGATCTTAATGTTCGCAAAGGTGTTCAGCTTCTCTACAATCTTGAAGAGTTGCCTCGACCATCACAAA
TGGATCAGTTATGCGAGAAGTGGAGGCCGTATCGATCGGTTGGGTCCTGGTATATGTGGAGGCTTGCTGAGGCAAAGGGAGCTTCTTCGAGCGCAGCAGCAGTGGCTGCT
GGTGCTAGCTTACAGCTGCAGCAACAAGAGCACCAGCAGGAGCAACAGCATCAACGGCAGCCGCAGCTTCTTGATCCACTCAATAGCATTCTCAATCTTGGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGAGCAAACGCAAGTGCAGGTTCGGACTCAAACGCAATCGCAGGCTCAGAACGCGCTTCATCAATCCTCCAACTCTACAACCACTATTGCTCAAGCCACTATAGC
ACTAAGCGAGGTGATGAATGCGCCATCGCAAACCTCTTCTCTGCCATCCAAAATGCCTTTGCGTCCACGGAAGATTCGAAAGCTCTCGCCCGCTGAATCCGATCCGAATT
CCTCTCAGATTGTCACCATTTCGGATGGGCCGAAACCTATCGTCACCGGGAAATCTCACAAGAGCAAGACGGCCCAACAACGCGCCGCCTTCGCGTCTGCTCCAGTGCTG
CCAGCCCGATCACTTTCCTGTGAAGGCGAGGTGGAAATCGCGCTTCGGCATCTTGGGAATGCGGATCCGCTCCTTGCACCTTTGATCGACCTCCATCAACGTCCTACCTT
CGACAGTTTTCAAACCCCATTCCTTGCCCTAACTAGAAGTATCCTATATCAGCAGCTGGCTTACAAAGCTGGCACATCTATTTACACCCGTTTTATCGCCCTTTGTGGCG
GCGACGCTGGTGTTCTTCCCGAAACCGTTCTTGCCTTGAACACTCAACAGCTCAGGCAAATTGGAATTTCGGGTCGTAAATCTAGTTATCTTCATGACCTTGCTAGGAAA
TACCAAAATGGGATTCTTTCAGACACGGCAATTGTAAATATGGATGATAAATCGCTTTTCACGATGCTCACAATGGTGAATGGAATTGGGTCTTGGTCTGTTCATATGTT
CATGATTTTCTCGCTGCACAGACCAGATGTGCTTCCTATCAACGATCTTAATGTTCGCAAAGGTGTTCAGCTTCTCTACAATCTTGAAGAGTTGCCTCGACCATCACAAA
TGGATCAGTTATGCGAGAAGTGGAGGCCGTATCGATCGGTTGGGTCCTGGTATATGTGGAGGCTTGCTGAGGCAAAGGGAGCTTCTTCGAGCGCAGCAGCAGTGGCTGCT
GGTGCTAGCTTACAGCTGCAGCAACAAGAGCACCAGCAGGAGCAACAGCATCAACGGCAGCCGCAGCTTCTTGATCCACTCAATAGCATTCTCAATCTTGGGTAA
Protein sequenceShow/hide protein sequence
MGEQTQVQVRTQTQSQAQNALHQSSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTISDGPKPIVTGKSHKSKTAQQRAAFASAPVL
PARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARK
YQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAA
GASLQLQQQEHQQEQQHQRQPQLLDPLNSILNLG