; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh01G019760 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh01G019760
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionDNA-3-methyladenine glycosylase 1
Genome locationCmo_Chr01:14002842..14006051
RNA-Seq ExpressionCmoCh01G019760
SyntenyCmoCh01G019760
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0006307 - DNA dealkylation involved in DNA repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0032993 - protein-DNA complex (cellular component)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
GO:0032131 - alkylated DNA binding (molecular function)
GO:0043916 - DNA-7-methylguanine glycosylase activity (molecular function)
InterPro domainsIPR003265 - HhH-GPD domain
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608541.1 hypothetical protein SDJN03_01883, partial [Cucurbita argyrosperma subsp. sororia]6.2e-19999.19Show/hide
Query:  MGEQTQVQVQTQTQSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHKSKTAQQ
        MGEQTQVQVQTQTQSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTI DGPKPIVTGKSHKSKTAQQ
Subjt:  MGEQTQVQVQTQTQSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHKSKTAQQ

Query:  RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ
        RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ
Subjt:  RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ

Query:  LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEK
        LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGV+LLYNLEELPRPSQMDQLCEK
Subjt:  LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEK

Query:  WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ
        WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQE QQEQQHQRQPQLLDPLNSILNLGACAWGQ
Subjt:  WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ

KAG7037864.1 mag1, partial [Cucurbita argyrosperma subsp. argyrosperma]2.3e-19398.63Show/hide
Query:  MGEQTQVQVQTQTQSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHKSKTAQQ
        MGEQTQVQV+TQTQSQAQNALH+SSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTI DGPKPIVTGKSHKSKTAQQ
Subjt:  MGEQTQVQVQTQTQSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHKSKTAQQ

Query:  RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ
        RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ
Subjt:  RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ

Query:  LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEK
        LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGV+LLYNLEELPRPSQMDQLCEK
Subjt:  LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEK

Query:  WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLG
        WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQE QQEQQHQRQPQLLDPLNSILNLG
Subjt:  WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLG

XP_022940509.1 probable DNA-3-methyladenine glycosylase 2 [Cucurbita moschata]6.7e-201100Show/hide
Query:  MGEQTQVQVQTQTQSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHKSKTAQQ
        MGEQTQVQVQTQTQSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHKSKTAQQ
Subjt:  MGEQTQVQVQTQTQSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHKSKTAQQ

Query:  RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ
        RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ
Subjt:  RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ

Query:  LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEK
        LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEK
Subjt:  LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEK

Query:  WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ
        WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ
Subjt:  WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ

XP_022981728.1 uncharacterized protein LOC111480795 [Cucurbita maxima]3.8e-19698.1Show/hide
Query:  EQTQVQVQTQTQSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHKSKTAQQRA
        E+TQVQVQTQTQSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHKSKTAQQRA
Subjt:  EQTQVQVQTQTQSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHKSKTAQQRA

Query:  AFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQLR
        AFASAPVLPARSLSCEGEVE+ALRHL NADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQLR
Subjt:  AFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQLR

Query:  QIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEKWR
        QIGISGRKSSYLHDLARKYQNGILSD AIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGV+LLYNLEELPRPSQMDQLCEKW+
Subjt:  QIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEKWR

Query:  PYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ
        PYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQE QQEQQHQRQPQLLDPLNSILNLGACAWGQ
Subjt:  PYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ

XP_023525023.1 probable DNA-3-methyladenine glycosylase 2 [Cucurbita pepo subsp. pepo]1.4e-19898.92Show/hide
Query:  MGEQTQVQVQTQTQSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHKSKTAQQ
        MGEQTQVQVQTQTQSQAQNALHESSNSTT IAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHKSKTAQQ
Subjt:  MGEQTQVQVQTQTQSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHKSKTAQQ

Query:  RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ
        RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ
Subjt:  RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ

Query:  LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEK
        LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGV+LLYNLEELPRPSQMDQLCEK
Subjt:  LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEK

Query:  WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ
        WRPYRSVGSWYMWRLAEAKGASSSAAAVA GASLQLQQQE QQEQQHQRQPQLLDPLNSILNLGACAWGQ
Subjt:  WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ

TrEMBL top hitse value%identityAlignment
A0A6J1CKY0 uncharacterized protein LOC1110122476.8e-18390.16Show/hide
Query:  MGEQTQVQVQTQT------QSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHK
        MGEQTQVQVQTQT      QSQAQN +H+SSNSTT+IAQAT+ALSEVMNAP+QTSS PSKMPLRPRKIRKLSP ESD NSSQI  + DGPKPI +GKS+K
Subjt:  MGEQTQVQVQTQT------QSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHK

Query:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVL
        +KTAQQRAAFASAP+LPARSLSCEGEVEIALRHL NADPLLAPLIDLHQRP FDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGG+AGVLPETVL
Subjt:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVL

Query:  ALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQM
        +LN QQLRQIGISGRKSSYLHDLARKYQNGILSD AIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGV+LLYNLEELPRPSQM
Subjt:  ALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ
        DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAA+AAGASLQLQQQE QQE QH +QPQLLDP+NSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ

A0A6J1F7I4 probable DNA-3-methyladenine glycosylase 22.7e-18490.96Show/hide
Query:  MGEQTQVQVQTQT------QSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHK
        MGE TQVQVQTQT      QSQAQN  HESSNSTTTIAQAT+ALSEVMNAPSQTSS PSKMPLRPRKIRKLSP ESDPNSSQ+V IPDGPKPI T KS+K
Subjt:  MGEQTQVQVQTQT------QSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHK

Query:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVL
        SKTAQQRAAFASAPV+ ARSLSCEGEVE+ALRHL NADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGG+AGVLPETVL
Subjt:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVL

Query:  ALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQM
        AL+ QQLRQIGISGRKSSYLHDLARKYQNGILSD AIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGV+LLYNLEELPRPSQM
Subjt:  ALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQE Q + QHQ+QPQLLDPLNSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ

A0A6J1FKF6 probable DNA-3-methyladenine glycosylase 23.2e-201100Show/hide
Query:  MGEQTQVQVQTQTQSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHKSKTAQQ
        MGEQTQVQVQTQTQSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHKSKTAQQ
Subjt:  MGEQTQVQVQTQTQSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHKSKTAQQ

Query:  RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ
        RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ
Subjt:  RAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQ

Query:  LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEK
        LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEK
Subjt:  LRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEK

Query:  WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ
        WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ
Subjt:  WRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ

A0A6J1IIA5 probable DNA-3-methyladenine glycosylase 22.7e-18490.96Show/hide
Query:  MGEQTQVQVQTQT------QSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHK
        MGE TQVQVQTQT      QSQAQN  HESSNSTTTIAQAT++LSEVMNAPSQTSS PSKMPLRPRKIRKLSP ESDPNSSQ+V IPDGPKPI T KS+K
Subjt:  MGEQTQVQVQTQT------QSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHK

Query:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVL
        SKTAQQRAAFASAPV+ ARSLSCEGEVE+ALRHL NADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGG+AGVLPETVL
Subjt:  SKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVL

Query:  ALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQM
        AL+ QQLRQIGISGRKSSYLHDLARKYQNGILSD AIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGV+LLYNLEELPRPSQM
Subjt:  ALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQM

Query:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ
        D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQE Q E QHQ+QPQLLDPLNSILNLGACAWGQ
Subjt:  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ

A0A6J1J2W7 uncharacterized protein LOC1114807951.8e-19698.1Show/hide
Query:  EQTQVQVQTQTQSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHKSKTAQQRA
        E+TQVQVQTQTQSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHKSKTAQQRA
Subjt:  EQTQVQVQTQTQSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHKSKTAQQRA

Query:  AFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQLR
        AFASAPVLPARSLSCEGEVE+ALRHL NADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQLR
Subjt:  AFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQLR

Query:  QIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEKWR
        QIGISGRKSSYLHDLARKYQNGILSD AIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGV+LLYNLEELPRPSQMDQLCEKW+
Subjt:  QIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEKWR

Query:  PYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ
        PYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQE QQEQQHQRQPQLLDPLNSILNLGACAWGQ
Subjt:  PYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ

SwissProt top hitse value%identityAlignment
O31544 Putative DNA-3-methyladenine glycosylase YfjP1.0e-1827.14Show/hide
Query:  RSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDA-GV----LPETVLALNTQQLRQIGIS
        R    E  ++  L H       L+ + + H         + +  + + I++QQL      ++  RF+   G    GV     PET+  L+ Q LR +  S
Subjt:  RSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDA-GV----LPETVLALNTQQLRQIGIS

Query:  GRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEKWRPYRSV
         RK+ Y  D +R    G LS + + +M D+ +   L  + GIG W+V   ++F L RP++ P+ D+ ++  ++  + L++ P    M  + ++W PY S 
Subjt:  GRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEKWRPYRSV

Query:  GSWYMWRLAE
         S Y+WR  E
Subjt:  GSWYMWRLAE

O94468 Alkylbase DNA glycosidase-like protein mag23.0e-1825.37Show/hide
Query:  LSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSF--QTPFLALTRSILYQQLAYKAGTSIYTRFIALCG-GDAGVLPETVLALNTQQLRQIGISGRKS
        +S + + + A +HL + D   + L+      T        P+  + R+I  Q+L+  A  SI  +F   C   D    P+ ++  + + L + G S  KS
Subjt:  LSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSF--QTPFLALTRSILYQQLAYKAGTSIYTRFIALCG-GDAGVLPETVLALNTQQLRQIGISGRKS

Query:  SYLHDLARKYQN-GILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEKWRPYRSVGSW
          +H +A    N  I S + I  M ++ L   L+ + G+  W++ M+ IF+L R D++P +D  ++   +  + L   P+  ++++L +  +PYR++ +W
Subjt:  SYLHDLARKYQN-GILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEKWRPYRSVGSW

Query:  YMWRL
        Y+W++
Subjt:  YMWRL

P37878 DNA-3-methyladenine glycosylase4.6e-1125.15Show/hide
Query:  FLALTRSILYQQLAYKAGTSIYTRFIALCGGDA-------GVLP--ETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLT
        F AL   +L QQ+      S+  +F+   G           V P  E +  L    L  I ++ +KS Y+  +AR   +G LS   ++ M+ K     L 
Subjt:  FLALTRSILYQQLAYKAGTSIYTRFIALCGGDA-------GVLP--ETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLT

Query:  MVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRL
         + GIG W+ +  ++  L  P   PI+D+ +   +++L N+   P   ++ ++   W+ ++S  ++Y+WR+
Subjt:  MVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRL

Q92383 DNA-3-methyladenine glycosylase 14.9e-2131.33Show/hide
Query:  PFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGIL-SDTAIVNMDDKSLFTMLTMVNGIGS
        P+  L R++  QQL  KA  +I+ RF ++        PE +  ++ + +R  G S RK   L  +A    +G++ +      + ++ L   LT + GIG 
Subjt:  PFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGIL-SDTAIVNMDDKSLFTMLTMVNGIGS

Query:  WSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAE
        W+V M +IFSL+R DV+P +DL++R G R L+ L ++P    + +  E   P+R+  +WY+W+ ++
Subjt:  WSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAE

Arabidopsis top hitse value%identityAlignment
AT1G19480.1 DNA glycosylase superfamily protein5.8e-11058.82Show/hide
Query:  MGEQTQVQVQTQTQSQAQNALHESSN----------------STTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLS---PAESDPNSSQIVTIP
        MGEQ+  Q  TQ QS  Q+   ++ N                S + ++  TI    +       SS PSK+PLRPRKIRKL+       +   ++ ++  
Subjt:  MGEQTQVQVQTQTQSQAQNALHESSN----------------STTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLS---PAESDPNSSQIVTIP

Query:  DGPKPIVT-GKS-HKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFI
            P+ T GKS  K K +  RA   + P + AR L+CEGE+E A+ +L NADPLLA LID+H  PTF+SF+TPFLAL R+ILYQQLA KAG SIYTRF+
Subjt:  DGPKPIVT-GKS-HKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFI

Query:  ALCGGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGV
        +LCGG+  V+PETVL+LN QQLRQIG+SGRK+SYLHDLARKYQNGILSD+AI+NMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRKGV
Subjt:  ALCGGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGV

Query:  RLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ
        +LLY L++LPRPSQM+Q C KWRPYRSVGSWYMWRL EAK  S+S AAVAAG SL   +  QQ+ QQ     QL+DPLN + ++G  AWGQ
Subjt:  RLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ

AT1G19480.2 DNA glycosylase superfamily protein2.4e-10858.7Show/hide
Query:  MGEQTQVQVQTQTQSQAQNALHESSN----------------STTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLS---PAESDPNSSQIVTIP
        MGEQ+  Q  TQ QS  Q+   ++ N                S + ++  TI    +       SS PSK+PLRPRKIRKL+       +   ++ ++  
Subjt:  MGEQTQVQVQTQTQSQAQNALHESSN----------------STTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLS---PAESDPNSSQIVTIP

Query:  DGPKPIVT-GKS-HKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFI
            P+ T GKS  K K +  RA   + P + AR L+CEGE+E A+ +L NADPLLA LID+H  PTF+SF+TPFLAL R+ILYQQLA KAG SIYTRF+
Subjt:  DGPKPIVT-GKS-HKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFI

Query:  ALCGGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGV
        +LCGG+  V+PETVL+LN QQLRQIG+SGRK+SYLHDLARKYQNGILSD+AI+NMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRKGV
Subjt:  ALCGGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGV

Query:  RLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLG
        +LLY L++LPRPSQM+Q C KWRPYRSVGSWYMWRL EAK  S+S AAVAAG SL   +  QQ+ QQ     QL+DPLN + ++G
Subjt:  RLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLG

AT1G75230.1 DNA glycosylase superfamily protein5.1e-11457.82Show/hide
Query:  MGEQTQVQVQTQT--QSQAQNALHESSN-----------STTTIAQATIALSEVMNAPSQT-----SSLPSKMPLRPRKIRKLSPAES-------DPNSS
        MGE +  Q  + T   +Q ++  HE+ N           +++     +I  S  + AP  T     SS P+K+PLRPRKIRKLSP +        + N S
Subjt:  MGEQTQVQVQTQT--QSQAQNALHESSN-----------STTTIAQATIALSEVMNAPSQT-----SSLPSKMPLRPRKIRKLSPAES-------DPNSS

Query:  QIVTIPDGPKPIVTGKSHKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIY
        Q+ T         T  + KSK +Q R    + P + ARSL+CEGE+E AL HL + DPLLA LID+H  PTF++FQTPFLAL RSILYQQLA KAG SIY
Subjt:  QIVTIPDGPKPIVTGKSHKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIY

Query:  TRFIALCGGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV
        TRF+ALCGG+ GV+PE VL L  QQLRQIG+SGRK+SYLHDLARKYQNGILSD+ IVNMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL V
Subjt:  TRFIALCGGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV

Query:  RKGVRLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASL--------QLQQQEQQQEQQHQRQPQLLDPLNSILNLGACA
        RKGV++L  +E+LPRPS+M+QLCEKWRPYRSV SWY+WRL E+K    +AAA  AGA+L        Q Q+QEQQ +Q  Q+QPQL+DPLN++ ++G  A
Subjt:  RKGVRLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASL--------QLQQQEQQQEQQHQRQPQLLDPLNSILNLGACA

Query:  WGQ
        WGQ
Subjt:  WGQ

AT1G75230.2 DNA glycosylase superfamily protein2.8e-11257.68Show/hide
Query:  MGEQTQVQVQTQT--QSQAQNALHESSN-----------STTTIAQATIALSEVMNAPSQT-----SSLPSKMPLRPRKIRKLSPAES-------DPNSS
        MGE +  Q  + T   +Q ++  HE+ N           +++     +I  S  + AP  T     SS P+K+PLRPRKIRKLSP +        + N S
Subjt:  MGEQTQVQVQTQT--QSQAQNALHESSN-----------STTTIAQATIALSEVMNAPSQT-----SSLPSKMPLRPRKIRKLSPAES-------DPNSS

Query:  QIVTIPDGPKPIVTGKSHKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIY
        Q+ T         T  + KSK +Q R    + P + ARSL+CEGE+E AL HL + DPLLA LID+H  PTF++FQTPFLAL RSILYQQLA KAG SIY
Subjt:  QIVTIPDGPKPIVTGKSHKSKTAQQRAAFASAPVLPARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIY

Query:  TRFIALCGGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV
        TRF+ALCGG+ GV+PE VL L  QQLRQIG+SGRK+SYLHDLARKYQNGILSD+ IVNMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL V
Subjt:  TRFIALCGGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV

Query:  RKGVRLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASL--------QLQQQEQQQEQQHQRQPQLLDPLNSILNLG
        RKGV++L  +E+LPRPS+M+QLCEKWRPYRSV SWY+WRL E+K    +AAA  AGA+L        Q Q+QEQQ +Q  Q+QPQL+DPLN++ ++G
Subjt:  RKGVRLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASL--------QLQQQEQQQEQQHQRQPQLLDPLNSILNLG

AT3G50880.1 DNA glycosylase superfamily protein8.7e-7453.2Show/hide
Query:  TTIAQATIALSEVMNA----PSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHKSKTAQQRAAFASAPVLPARSLSCEGEVEIA
        T + Q+++    +++A     S+ S   S++  RPRKIRK+S   SDP+          P+ I+T               AS P      LS +  V+IA
Subjt:  TTIAQATIALSEVMNA----PSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHKSKTAQQRAAFASAPVLPARSLSCEGEVEIA

Query:  LRHLGNADPLLAPLIDLH-QRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC-GGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQ
        LRHL ++D LL  LI  H   P FDS  TPFL+L RSILYQQLA KA   IY RFI+L  GG+AGV+PE+V++L+   LR+IG+SGRK+SYLHDLA KY 
Subjt:  LRHLGNADPLLAPLIDLH-QRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALC-GGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARKYQ

Query:  NGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAK
        NG+LSD  I+ M D+ L   LT+V GIG W+VHMFMIFSLHRPDVLP+ DL VRKGV+ LY L+ LP P QM+QLCEKWRPYRSVGSWYMWRL E++
Subjt:  NGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGAGCAAACGCAAGTGCAGGTTCAGACTCAAACGCAATCGCAGGCTCAGAACGCGCTTCATGAATCCTCCAACTCTACAACCACTATTGCTCAAGCCACTATAGC
ACTAAGCGAGGTGATGAATGCGCCATCGCAAACCTCTTCTCTGCCATCCAAAATGCCTTTGCGTCCACGGAAAATTCGAAAGCTCTCGCCCGCTGAATCGGATCCGAATT
CCTCTCAGATTGTCACCATTCCGGATGGGCCGAAACCTATCGTCACCGGGAAATCTCACAAGAGCAAGACGGCCCAACAACGCGCCGCCTTCGCGTCTGCTCCAGTGCTG
CCAGCCCGATCACTTTCCTGTGAAGGCGAGGTGGAAATCGCGCTTCGGCATCTTGGGAATGCGGATCCGCTCCTTGCACCTTTGATCGACCTCCATCAACGTCCTACCTT
CGACAGTTTTCAAACCCCATTCCTTGCCCTAACTAGAAGTATCCTATATCAGCAGCTGGCTTACAAAGCTGGCACATCTATTTACACCCGTTTTATCGCCCTTTGTGGCG
GCGACGCTGGTGTTCTTCCCGAAACCGTTCTTGCCTTGAACACTCAACAGCTCAGGCAAATTGGAATTTCGGGTCGTAAATCTAGTTATCTTCATGACCTTGCTAGGAAA
TACCAAAATGGGATTCTTTCAGACACGGCAATTGTAAATATGGATGATAAATCGCTTTTCACGATGCTCACAATGGTGAATGGAATTGGGTCTTGGTCTGTTCATATGTT
CATGATTTTCTCGCTGCACAGACCAGATGTGCTTCCTATCAACGATCTTAATGTTCGCAAAGGTGTTCGGCTTCTCTACAATCTTGAAGAGTTGCCTCGACCATCACAAA
TGGATCAATTATGCGAGAAGTGGAGGCCGTATCGATCGGTTGGGTCCTGGTATATGTGGAGGCTTGCTGAGGCAAAGGGAGCTTCTTCGAGCGCAGCAGCAGTGGCTGCT
GGTGCTAGCTTACAGCTGCAGCAACAAGAGCAGCAGCAGGAGCAACAGCATCAACGGCAGCCGCAGCTTCTTGATCCACTCAATAGCATTCTCAATCTTGGGGCCTGTGC
TTGGGGGCAGTGA
mRNA sequenceShow/hide mRNA sequence
GAGAAATCGAAAGCCAGTTGGGATCAAGAACGGAGGAGCGAGCGACCGGAAACGGAGACGGAGAAGAGAAAGAGAGAGCATGAAAGAAGAACAGCAGCACCGGGATCTCC
TTCTTCATTCGCGATGCCATTGAAAATTCATTTGTGCTTTCTGTCCTGTTCTATTTCGCAATTGATTTTCAATTTCTCTTTCTGTTAGACTCCAAACACTTATGCTCTGC
ACCTTGGTTATCGAATTTCGCCCCTTTTCTGATTCTCCGCCGCCGTCGCCGCCGCCGTCTTGACTCCGACTCTTACACTGTGTTTCCTCTCAACCGAAGAGTCTGAGTGG
AGGGAAAATAGAGGTTGAATTGAATTTCTTACATATGGGAGAGCAAACGCAAGTGCAGGTTCAGACTCAAACGCAATCGCAGGCTCAGAACGCGCTTCATGAATCCTCCA
ACTCTACAACCACTATTGCTCAAGCCACTATAGCACTAAGCGAGGTGATGAATGCGCCATCGCAAACCTCTTCTCTGCCATCCAAAATGCCTTTGCGTCCACGGAAAATT
CGAAAGCTCTCGCCCGCTGAATCGGATCCGAATTCCTCTCAGATTGTCACCATTCCGGATGGGCCGAAACCTATCGTCACCGGGAAATCTCACAAGAGCAAGACGGCCCA
ACAACGCGCCGCCTTCGCGTCTGCTCCAGTGCTGCCAGCCCGATCACTTTCCTGTGAAGGCGAGGTGGAAATCGCGCTTCGGCATCTTGGGAATGCGGATCCGCTCCTTG
CACCTTTGATCGACCTCCATCAACGTCCTACCTTCGACAGTTTTCAAACCCCATTCCTTGCCCTAACTAGAAGTATCCTATATCAGCAGCTGGCTTACAAAGCTGGCACA
TCTATTTACACCCGTTTTATCGCCCTTTGTGGCGGCGACGCTGGTGTTCTTCCCGAAACCGTTCTTGCCTTGAACACTCAACAGCTCAGGCAAATTGGAATTTCGGGTCG
TAAATCTAGTTATCTTCATGACCTTGCTAGGAAATACCAAAATGGGATTCTTTCAGACACGGCAATTGTAAATATGGATGATAAATCGCTTTTCACGATGCTCACAATGG
TGAATGGAATTGGGTCTTGGTCTGTTCATATGTTCATGATTTTCTCGCTGCACAGACCAGATGTGCTTCCTATCAACGATCTTAATGTTCGCAAAGGTGTTCGGCTTCTC
TACAATCTTGAAGAGTTGCCTCGACCATCACAAATGGATCAATTATGCGAGAAGTGGAGGCCGTATCGATCGGTTGGGTCCTGGTATATGTGGAGGCTTGCTGAGGCAAA
GGGAGCTTCTTCGAGCGCAGCAGCAGTGGCTGCTGGTGCTAGCTTACAGCTGCAGCAACAAGAGCAGCAGCAGGAGCAACAGCATCAACGGCAGCCGCAGCTTCTTGATC
CACTCAATAGCATTCTCAATCTTGGGGCCTGTGCTTGGGGGCAGTGACTCCGTTTGAAAAGAGTACATGTTAGCAAATATCCCAATCAATGTACCCACTGAATGAAAAGT
ATGCCAATTATGTGGGAGCAGAGGACTCGACAACGTATATTCGTTCGGTACCTCGAGGTTGGATATTCTAAACATCTATGCTAAATAATGAAATGACATGTGCCATTTAT
TTTTCTTAGATTGGTTGACATCACTTGTATGGCCGTCAACATACATGTATCATATGGTGGTGATATTGATAGAATGTTCCTCTGAATGCCTGCTTTTAATGTGATTTAGA
AACCGCTTGTTCATTTGCGCATATTCAATGACAGAACAGTCTGTAATGACTTTATGATCCATCTCTTACCTTGTTCTTCATTTCAACTTTTCTGCCACATTGATCATAAT
AATCATAAATGCAGAGAATCAATATGTGGCTGTGGCATTTGTTGTGTAATCTTATCTTATGTCGCCCCATCATCTGTTTGACATGATGGGAGGAAACACAGAGCTCAGTT
TCTCATTCGTGTTTCTTCGTCTGAGAACTTGATCTATAGGTACACCAGGAAGCTTGAATTTTGATGATTTTAGCTGGAGAAATGCTGCGGTTGGGTGAGTCTTCTTCTCA
CTTTGGAACCAAATAACTTTGAATATCCGAAAGTGACGAGACGTTGGCCAACTACTTCCCTTTTTCATTGTGTTTGGTCTTTTGGGCAGAGCCGATTTCATTGGCAGTTC
TCAAGCGATGGGCCATTTTCTGTCCCATGCCATATGTTAGCATCAAATTGGCCTTTGGGGTGTAATGGGTCAAAATTGACATCCAACTAACTTACACGTAGGTTTTAAAG
ATGAAAGATGATATCGACATATTTAATATCCATCTGATTCGATTTTCATAGACA
Protein sequenceShow/hide protein sequence
MGEQTQVQVQTQTQSQAQNALHESSNSTTTIAQATIALSEVMNAPSQTSSLPSKMPLRPRKIRKLSPAESDPNSSQIVTIPDGPKPIVTGKSHKSKTAQQRAAFASAPVL
PARSLSCEGEVEIALRHLGNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGDAGVLPETVLALNTQQLRQIGISGRKSSYLHDLARK
YQNGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVRLLYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAA
GASLQLQQQEQQQEQQHQRQPQLLDPLNSILNLGACAWGQ