; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G036310 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G036310
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationCicolChr02:32100469..32103222
RNA-Seq ExpressionCcUC02G036310
SyntenyCcUC02G036310
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054725.1 putative GMP synthase [Cucumis melo var. makuwa]2.3e-19992.57Show/hide
Query:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-HPSLKP--PAAAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLS
        MCRSEEALEAT+VVVDSKFN+RPVLQPTCNRVLDRRNSLKK HPSLKP  PAAAVSPTSPKSKSPRPPATKRANDGNN MN SS+KILIPAAA      S
Subjt:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-HPSLKP--PAAAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLS

Query:  RPRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSF
        RPRATLDRKKSKSFKLGGNGN VICDNGGFEVA   YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA FEKIVPLDSKIKP+VEDRRCSF
Subjt:  RPRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSF

Query:  ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK
        ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFS+KQMVSIS+EYGIDINRVRGVVDN+IRILQIK
Subjt:  ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK

Query:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRGTPA-VTTTVEVEE
        KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGR TPA  TTT EVEE
Subjt:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRGTPA-VTTTVEVEE

Query:  TAAS
          A+
Subjt:  TAAS

KAG6570606.1 hypothetical protein SDJN03_29521, partial [Cucurbita argyrosperma subsp. sororia]4.9e-18987.47Show/hide
Query:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKHPSLKPPAAAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLSRPR
        MCRSE+ALEAT VVVDSKF ARPVLQPTCNRVLDRRNSLK     KPP+AAVSPTSPKSKSPRPPATKRAND  N MN SSDKILIPAAA     LSRP+
Subjt:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKHPSLKPPAAAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLSRPR

Query:  ATLDRKKSKSFKLGGNGNVVICDN----GGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCS
        A LDRKKSKSFKL GNGNVVICDN    GGFEVA LSYASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA F+K+VPLDSKIKPAVEDRRCS
Subjt:  ATLDRKKSKSFKLGGNGNVVICDN----GGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCS

Query:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI
        FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF +E VA FSDKQM+SISSEYGIDINRVRGVVDNAIRIL+I
Subjt:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI

Query:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRGTPAVTTTVEVEE
        KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHC++ AA R  PAV   V  E 
Subjt:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRGTPAVTTTVEVEE

Query:  TAASETL
        T ASETL
Subjt:  TAASETL

XP_004139917.2 uncharacterized protein LOC101218536 [Cucumis sativus]4.0e-19992.87Show/hide
Query:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-HPSLKPP-AAAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLSR
        MCRSEE LEAT+VVVDSKFN+RPVLQPT NRVLDRRNSLKK HPSLKPP AAAVSPTSPKSKSPRPPATKRANDGNN MN SS+KILIPAA      +SR
Subjt:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-HPSLKPP-AAAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLSR

Query:  PRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSFI
        PRATLDRKKSKSFKLGGNGN VICDNGGFEVA   YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA FEKIVPLDSKIKPAVEDRRCSFI
Subjt:  PRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSFI

Query:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKK
        TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSIS+EYGIDINRVRGVVDNAIRILQIKK
Subjt:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKK

Query:  EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRGTPA-VTTTVEVEET
        EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGR TPA  TTT EVE+T
Subjt:  EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRGTPA-VTTTVEVEET

Query:  AA-SETL
        AA  ETL
Subjt:  AA-SETL

XP_022943791.1 uncharacterized protein LOC111448434 [Cucurbita moschata]3.7e-18987.47Show/hide
Query:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKHPSLKPPAAAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLSRPR
        MCRSE+ALEAT+VVVDSKF ARPVLQPTCNRVLDRRNSLK     KPP+AAVSPTSPKSKSPRPPATKRAND  N MN SSDKILIPAAA     LSRP+
Subjt:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKHPSLKPPAAAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLSRPR

Query:  ATLDRKKSKSFKLGGNGNVVICDN----GGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCS
        A LDRKKSKSFKL GNGNVVICDN    GGFEVA LSYASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA F+K+VPLDSKIKPAVEDRRCS
Subjt:  ATLDRKKSKSFKLGGNGNVVICDN----GGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCS

Query:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI
        FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF +E VA FSDKQM+SISSEYGIDINRVRGVVDNAIRIL+I
Subjt:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI

Query:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRGTPAVTTTVEVEE
        KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHC++ AA R  PAV   V  E 
Subjt:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRGTPAVTTTVEVEE

Query:  TAASETL
        T ASETL
Subjt:  TAASETL

XP_038902889.1 uncharacterized protein LOC120089476 [Benincasa hispida]1.2e-21194.63Show/hide
Query:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKHPSLKPPA---AAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLS
        MCRSEEALEA+TVVVDSKFNARPVLQPTCNRVLDRRNSLKK PSLKPP+   AAVSPTSPKSKSPRPPATKRANDGNN MN SSDKILIPAA NGGGS+S
Subjt:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKHPSLKPPA---AAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLS

Query:  RPRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSF
        RPRATLDRKKSKSFKLGGNGNVVICDNGG+EVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA FEKIVPLDSKIKPAVEDRRCSF
Subjt:  RPRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSF

Query:  ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK
        ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVA FSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK
Subjt:  ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK

Query:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRGTPAVTTTVEVEE-
        KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGR T A TTT EVEE 
Subjt:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRGTPAVTTTVEVEE-

Query:  ---TAASETL
           TA SETL
Subjt:  ---TAASETL

TrEMBL top hitse value%identityAlignment
A0A0A0KED6 Uncharacterized protein1.9e-19992.87Show/hide
Query:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-HPSLKPP-AAAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLSR
        MCRSEE LEAT+VVVDSKFN+RPVLQPT NRVLDRRNSLKK HPSLKPP AAAVSPTSPKSKSPRPPATKRANDGNN MN SS+KILIPAA      +SR
Subjt:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-HPSLKPP-AAAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLSR

Query:  PRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSFI
        PRATLDRKKSKSFKLGGNGN VICDNGGFEVA   YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA FEKIVPLDSKIKPAVEDRRCSFI
Subjt:  PRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSFI

Query:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKK
        TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSIS+EYGIDINRVRGVVDNAIRILQIKK
Subjt:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKK

Query:  EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRGTPA-VTTTVEVEET
        EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGR TPA  TTT EVE+T
Subjt:  EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRGTPA-VTTTVEVEET

Query:  AA-SETL
        AA  ETL
Subjt:  AA-SETL

A0A5A7UM21 Putative GMP synthase1.1e-19992.57Show/hide
Query:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-HPSLKP--PAAAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLS
        MCRSEEALEAT+VVVDSKFN+RPVLQPTCNRVLDRRNSLKK HPSLKP  PAAAVSPTSPKSKSPRPPATKRANDGNN MN SS+KILIPAAA      S
Subjt:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-HPSLKP--PAAAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLS

Query:  RPRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSF
        RPRATLDRKKSKSFKLGGNGN VICDNGGFEVA   YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA FEKIVPLDSKIKP+VEDRRCSF
Subjt:  RPRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSF

Query:  ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK
        ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFS+KQMVSIS+EYGIDINRVRGVVDN+IRILQIK
Subjt:  ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK

Query:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRGTPA-VTTTVEVEE
        KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGR TPA  TTT EVEE
Subjt:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRGTPA-VTTTVEVEE

Query:  TAAS
          A+
Subjt:  TAAS

A0A6J1D778 uncharacterized protein LOC1110179892.3e-16881.48Show/hide
Query:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKH-PSLKPPAAAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLSRP
        MCRSE+ +EAT+VV       R VLQPTCNR L RRNSLKK  PS  PP +  SP SPKSKSPRPPATKRAND   +MN SSDK+++PAAA       RP
Subjt:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKH-PSLKPPAAAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLSRP

Query:  RATLDRKKSKSFKLGGNGNVVICDNGGFEVAP-LSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSFI
        RA LDRKKSKSFKLGG        +G  E AP LSYASSLITESPGSIAAVRREQVALQQAQRKM+IAHYGRSKSA FEKIVP+DSK KPAVEDRRCSFI
Subjt:  RATLDRKKSKSFKLGGNGNVVICDNGGFEVAP-LSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSFI

Query:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKK
        TPNSDPIYVAYHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFD+E VANFSDKQMVSIS+EYGIDINRVRGVVDNAIRIL+IKK
Subjt:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKK

Query:  EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRGTPAVTTTVEVEETA
        EFGSFDKYIWGFVN+KPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHL CTL+AAGR  P     VEVEET 
Subjt:  EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRGTPAVTTTVEVEETA

Query:  ASETL
         SETL
Subjt:  ASETL

A0A6J1FSP1 uncharacterized protein LOC1114484341.8e-18987.47Show/hide
Query:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKHPSLKPPAAAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLSRPR
        MCRSE+ALEAT+VVVDSKF ARPVLQPTCNRVLDRRNSLK     KPP+AAVSPTSPKSKSPRPPATKRAND  N MN SSDKILIPAAA     LSRP+
Subjt:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKHPSLKPPAAAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLSRPR

Query:  ATLDRKKSKSFKLGGNGNVVICDN----GGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCS
        A LDRKKSKSFKL GNGNVVICDN    GGFEVA LSYASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA F+K+VPLDSKIKPAVEDRRCS
Subjt:  ATLDRKKSKSFKLGGNGNVVICDN----GGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCS

Query:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI
        FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF +E VA FSDKQM+SISSEYGIDINRVRGVVDNAIRIL+I
Subjt:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI

Query:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRGTPAVTTTVEVEE
        KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHC++ AA R  PAV   V  E 
Subjt:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRGTPAVTTTVEVEE

Query:  TAASETL
        T ASETL
Subjt:  TAASETL

A0A6J1J7H3 uncharacterized protein LOC1114841734.5e-18886.73Show/hide
Query:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKHPSLKPPAAAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLSRPR
        MCRSE+ALEAT+VVVDSKF ARPVLQPTCNRVLDRRNSLK     KPP+AAVSPTSPKSKSPRPPATKRAN+  N MN SSDKILIPAAA     LSRP+
Subjt:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKHPSLKPPAAAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLSRPR

Query:  ATLDRKKSKSFKLGGNGNVVICDN----GGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCS
        A LDRKKSKSFKL GNGNVVICDN    GGFEVA LSYASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA F+K+VPLDSKIKPAVE RRCS
Subjt:  ATLDRKKSKSFKLGGNGNVVICDN----GGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCS

Query:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI
        FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF +E VA FSDKQM+SISSEYGIDINRVRGVVDNAIRIL+I
Subjt:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI

Query:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRGTPAVTTTVEVEE
        KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQ AGLTNDHLT+CHRHLHC++ AAGR  PAV   V  E 
Subjt:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRGTPAVTTTVEVEE

Query:  TAASETL
        T ASE+L
Subjt:  TAASETL

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 13.2e-3439.66Show/hide
Query:  RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINR--VRGVVDNAI
        RC ++  + DP+Y+AYHD EWGVP  D K LFE++ L   Q G  W ++LKKR+++R  F  FD   VA   ++ +  +  + GI  +R  ++ ++ NA 
Subjt:  RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINR--VRGVVDNAI

Query:  RILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTC
          LQ+++    F  ++W FVN++P   Q  +  +IP  TS S+ +SK + +RGF+ VG T+ +SFMQA GL NDH+  C
Subjt:  RILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTC

P44321 DNA-3-methyladenine glycosylase2.0e-2836.31Show/hide
Query:  RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVR--GVVDNAI
        RC ++   S  IY+ YHD+EWG P  D + LFE + L   Q G  W ++LKKR+ +R AF  FD + +A  +   + +     G+  +R +   +V NA 
Subjt:  RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVR--GVVDNAI

Query:  RILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTC
          L ++K   +F  +IW FVN+KP          +P KT  S+ +SK + +RGF  +G T  ++FMQ+ GL +DHL  C
Subjt:  RILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]3.9e-4043.85Show/hide
Query:  EDRRCSFITPNSD---PIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINR--VRG
        E  RC++ T   +    +Y  YHD EWG P+H+DK LFE LVL   Q G  W +ILKKR+ FR AF  FD  IVAN+ + ++  +    GI  NR  +  
Subjt:  EDRRCSFITPNSD---PIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINR--VRG

Query:  VVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHR
         + NA   + +++EFGSFDKYIWGFV  KP    ++S   +P  T  S+ I+KD+ +RGF+ VG T +++ MQ+ G+ NDHLT+C +
Subjt:  VVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHR

Arabidopsis top hitse value%identityAlignment
AT1G75090.1 DNA glycosylase superfamily protein2.3e-5652.66Show/hide
Query:  RRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDIN--RVRGVVDNA
        +RC +ITPNSDPIYV +HDEEWGVPV DDK LFELLV S A     W SIL++R DFR  F  FD   +A F++K+++S+     + ++  ++R +V+NA
Subjt:  RRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDIN--RVRGVVDNA

Query:  IRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTL
          +L++K+EFGSF  Y W FVN+KP    Y+ G ++PVK+ K+E ISKDM++RGFR VGPTV++SF+QA+G+ NDHLT C R+  C +
Subjt:  IRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTL

AT3G12710.1 DNA glycosylase superfamily protein2.9e-9964.6Show/hide
Query:  GGSLSRPRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA---HFEKIVPLDSKIKPA
        G   ++ R +L+RKKSKSFK G                  SY+S LITE+PGSIAAVRREQVA QQA RK++IAHYGRSKS       K+VPL +   P 
Subjt:  GGSLSRPRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA---HFEKIVPLDSKIKPA

Query:  VEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDN
           +RCSF+TP SDPIYVAYHDEEWGVPVHDDK LFELL LS AQVGSDWTS L+KR D+R AF  F++E+VA  ++K+M +IS EY I++++VRGVV+N
Subjt:  VEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDN

Query:  AIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIA
        A +I++IKK F S +KY+WGFVN+KP S  YK GHKIPVKTSKSE+ISKDMVRRGFR VGPTVVHSFMQAAGLTNDHL TC RH  CTL+A
Subjt:  AIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIA

AT5G44680.1 DNA glycosylase superfamily protein1.4e-9351.95Show/hide
Query:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRV--LDRRNSLKKHPSLKPPAAAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLSR
        MC S+  L+  T    S+ N RPVLQP  N+V  LDRRNSLKK P  KP    ++P + K  SPRP +                 ++ P  +    SL +
Subjt:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRV--LDRRNSLKKHPSLKPPAAAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLSR

Query:  PRATL-DRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHF-EKIVPLDSKIKPAVEDRRCS
        P  +  +  +S S K     +    D G  EV P+     ++ + PGSIAA RRE+VA++Q +RK +I+HYGR KS    EK + ++ + K     +RCS
Subjt:  PRATL-DRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHF-EKIVPLDSKIKPAVEDRRCS

Query:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI
        FIT +SDPIYVAYHD+EWGVPVHDD +LFELLVL+ AQVGSDWTS+LK+R  FR AFS F++E+VA+F++K++ SI ++YGI++++V  VVDNA +IL++
Subjt:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI

Query:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAA
        K++ GSF+KYIWGF+ +KP + +Y S  KIPVKTSKSETISKDMVRRGFR VGPTV+HS MQAAGLTNDHL TC RHL CT +AA
Subjt:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAA

AT5G57970.1 DNA glycosylase superfamily protein1.5e-5853.3Show/hide
Query:  LDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDIN
        LDS    +   +RC+++TPNSDP Y+ +HDEEWGVPVHDDK LFELLVLS A     W +IL KRQ FR  F+ FD   +   ++K+++   S     ++
Subjt:  LDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDIN

Query:  --RVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHC
          ++R V++NA +IL++ +E+GSFDKYIW FV NK    +++   ++P KT K+E ISKD+VRRGFRSVGPTVV+SFMQAAG+TNDHLT+C R  HC
Subjt:  --RVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHC

AT5G57970.2 DNA glycosylase superfamily protein1.5e-5853.3Show/hide
Query:  LDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDIN
        LDS    +   +RC+++TPNSDP Y+ +HDEEWGVPVHDDK LFELLVLS A     W +IL KRQ FR  F+ FD   +   ++K+++   S     ++
Subjt:  LDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDIN

Query:  --RVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHC
          ++R V++NA +IL++ +E+GSFDKYIW FV NK    +++   ++P KT K+E ISKD+VRRGFRSVGPTVV+SFMQAAG+TNDHLT+C R  HC
Subjt:  --RVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTCGTTCCGAGGAGGCCTTGGAAGCCACTACTGTCGTCGTTGATTCTAAATTCAACGCCCGTCCTGTCCTTCAACCCACTTGCAACCGTGTCCTCGACCGCCGTAA
TTCCCTCAAAAAACACCCTTCTCTCAAACCCCCCGCCGCCGCCGTCTCGCCCACCTCTCCCAAATCCAAATCCCCCCGTCCTCCGGCCACCAAGCGGGCCAATGATGGTA
ATAATTCCATGAACTGCAGCTCCGACAAGATCCTCATTCCGGCCGCCGCGAACGGTGGCGGGTCTCTGTCACGGCCGAGAGCTACCTTAGATAGGAAGAAATCGAAAAGC
TTCAAATTGGGTGGAAATGGGAATGTTGTGATTTGTGATAATGGTGGATTTGAGGTGGCGCCGTTGAGCTACGCTTCTTCTTTGATCACTGAGTCGCCGGGAAGTATCGC
CGCCGTGAGAAGAGAGCAGGTGGCTCTGCAACAGGCACAGAGGAAGATGAGAATTGCTCATTATGGAAGATCTAAATCTGCTCATTTTGAAAAAATTGTTCCTCTTGATT
CTAAAATTAAACCTGCTGTTGAAGATAGAAGATGTAGCTTCATCACTCCCAATTCAGATCCCATTTATGTTGCTTACCATGATGAAGAATGGGGCGTCCCTGTTCATGAT
GACAAGATGTTGTTTGAATTGCTGGTTCTAAGCGTAGCCCAGGTGGGTTCTGATTGGACTTCAATTTTGAAGAAACGCCAAGATTTCAGAAATGCTTTTTCAAGTTTCGA
TTCAGAAATTGTGGCAAATTTTTCCGACAAACAGATGGTTTCAATCAGCTCAGAATATGGCATCGACATTAACAGAGTCCGAGGAGTCGTGGATAACGCAATCCGAATCC
TCCAGATTAAGAAGGAATTTGGGTCATTCGACAAATACATTTGGGGATTTGTGAACAACAAACCGTTTTCACCGCAGTACAAATCCGGCCATAAAATTCCGGTGAAGACG
TCAAAATCAGAGACCATAAGCAAAGACATGGTCCGGCGAGGTTTCCGGTCGGTCGGACCGACGGTGGTTCACTCCTTCATGCAAGCCGCCGGTCTGACCAACGATCATCT
GACCACTTGCCACAGGCACCTCCACTGCACCTTAATCGCCGCCGGCCGCGGCACTCCGGCGGTAACGACGACGGTGGAAGTGGAGGAGACGGCGGCTTCTGAAACTCTCT
AG
mRNA sequenceShow/hide mRNA sequence
CATTTCTCCATTTCTCTCTCTTTCTCTCATTTTCCCCAAAATTAAAAACTGAAAAAAACGATGTGTCGTTCCGAGGAGGCCTTGGAAGCCACTACTGTCGTCGTTGATTC
TAAATTCAACGCCCGTCCTGTCCTTCAACCCACTTGCAACCGTGTCCTCGACCGCCGTAATTCCCTCAAAAAACACCCTTCTCTCAAACCCCCCGCCGCCGCCGTCTCGC
CCACCTCTCCCAAATCCAAATCCCCCCGTCCTCCGGCCACCAAGCGGGCCAATGATGGTAATAATTCCATGAACTGCAGCTCCGACAAGATCCTCATTCCGGCCGCCGCG
AACGGTGGCGGGTCTCTGTCACGGCCGAGAGCTACCTTAGATAGGAAGAAATCGAAAAGCTTCAAATTGGGTGGAAATGGGAATGTTGTGATTTGTGATAATGGTGGATT
TGAGGTGGCGCCGTTGAGCTACGCTTCTTCTTTGATCACTGAGTCGCCGGGAAGTATCGCCGCCGTGAGAAGAGAGCAGGTGGCTCTGCAACAGGCACAGAGGAAGATGA
GAATTGCTCATTATGGAAGATCTAAATCTGCTCATTTTGAAAAAATTGTTCCTCTTGATTCTAAAATTAAACCTGCTGTTGAAGATAGAAGATGTAGCTTCATCACTCCC
AATTCAGATCCCATTTATGTTGCTTACCATGATGAAGAATGGGGCGTCCCTGTTCATGATGACAAGATGTTGTTTGAATTGCTGGTTCTAAGCGTAGCCCAGGTGGGTTC
TGATTGGACTTCAATTTTGAAGAAACGCCAAGATTTCAGAAATGCTTTTTCAAGTTTCGATTCAGAAATTGTGGCAAATTTTTCCGACAAACAGATGGTTTCAATCAGCT
CAGAATATGGCATCGACATTAACAGAGTCCGAGGAGTCGTGGATAACGCAATCCGAATCCTCCAGATTAAGAAGGAATTTGGGTCATTCGACAAATACATTTGGGGATTT
GTGAACAACAAACCGTTTTCACCGCAGTACAAATCCGGCCATAAAATTCCGGTGAAGACGTCAAAATCAGAGACCATAAGCAAAGACATGGTCCGGCGAGGTTTCCGGTC
GGTCGGACCGACGGTGGTTCACTCCTTCATGCAAGCCGCCGGTCTGACCAACGATCATCTGACCACTTGCCACAGGCACCTCCACTGCACCTTAATCGCCGCCGGCCGCG
GCACTCCGGCGGTAACGACGACGGTGGAAGTGGAGGAGACGGCGGCTTCTGAAACTCTCTAGAATTGACTCGAGAATTTAATTAACAGACAAAAAGAAAAAGTGATAACC
TTTACGTGGAGTCCCATCAATGATGATTTGCTTGCTAATTAACTAGATAACTATTTTTTTTTTTTTTTTTTTTGTGGGGTTTGTGTATATTAATGTCTATATAAATAGAC
TTGTAAGGGGAAAAAAAATGAAAGAAAAAAAGAGATTGTGGGGTTGTGAATTTGTGTGAAAGTTTTTGTGTGAGAATTTTAGTATAAGTGTTTGTATAATTAAAAGAAAA
AAAGAAAAGGAAAAAAAGAAAATTATTTGAAGTGGTAGGGCTAGAATAGAAGACAGACAGCATGTGCTTGTGCAATTGGGAGGCAATGGCATGTGAGTGTGTCAGTTTGC
TTTTGTAAATTCCCATGTGATCCACCCAAATTTCAAACATTATTAATCATTATTATATTATTTTTATTTTATTTTAGGGGCC
Protein sequenceShow/hide protein sequence
MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKHPSLKPPAAAVSPTSPKSKSPRPPATKRANDGNNSMNCSSDKILIPAAANGGGSLSRPRATLDRKKSKS
FKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHD
DKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKT
SKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRGTPAVTTTVEVEETAASETL