; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC02G044420 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC02G044420
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationCmU531Chr02:32441459..32444267
RNA-Seq ExpressionCmUC02G044420
SyntenyCmUC02G044420
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054725.1 putative GMP synthase [Cucumis melo var. makuwa]1.1e-19993.07Show/hide
Query:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-HPSLKPPS--AAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLS
        MCRSEEALEAT+VVVDSKFN+RPVLQPTCNRVLDRRNSLKK HPSLKPPS  AAVSPTSPKSKSPRPPATKRANDGNN MNSSS+KILIPAAA      S
Subjt:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-HPSLKPPS--AAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLS

Query:  RPRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSF
        RPRATLDRKKSKSFKLGGNGN VICDNGGFEVA   YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA FEKIVPLDSKIKP+VEDRRCSF
Subjt:  RPRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSF

Query:  ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK
        ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFS+KQMVSIS+EYGIDINRVRGVVDN+IRILQIK
Subjt:  ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK

Query:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPA-VTTTAEVEE
        KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPA  TTT EVEE
Subjt:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPA-VTTTAEVEE

Query:  TATA
           A
Subjt:  TATA

KAG6570606.1 hypothetical protein SDJN03_29521, partial [Cucurbita argyrosperma subsp. sororia]9.9e-19087.83Show/hide
Query:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLSRPR
        MCRSE+ALEAT VVVDSKF ARPVLQPTCNRVLDRRNSLK     KPPSAAVSPTSPKSKSPRPPATKRAND  N MNSSSDKILIPAAA     LSRP+
Subjt:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLSRPR

Query:  ATLDRKKSKSFKLGGNGNVVICDN----GGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCS
        A LDRKKSKSFKL GNGNVVICDN    GGFEVA LSYASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA F+K+VPLDSKIKPAVEDRRCS
Subjt:  ATLDRKKSKSFKLGGNGNVVICDN----GGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCS

Query:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI
        FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF +E VA FSDKQM+SISSEYGIDINRVRGVVDNAIRIL+I
Subjt:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI

Query:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPAVTTTAEVEE
        KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHC++ AA RR PAV     VEE
Subjt:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPAVTTTAEVEE

Query:  TATATAASETL
        T   T ASETL
Subjt:  TATATAASETL

XP_004139917.2 uncharacterized protein LOC101218536 [Cucumis sativus]2.4e-19992.68Show/hide
Query:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-HPSLKPPS-AAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLSR
        MCRSEE LEAT+VVVDSKFN+RPVLQPT NRVLDRRNSLKK HPSLKPPS AAVSPTSPKSKSPRPPATKRANDGNN MNSSS+KILIPAA      +SR
Subjt:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-HPSLKPPS-AAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLSR

Query:  PRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSFI
        PRATLDRKKSKSFKLGGNGN VICDNGGFEVA   YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA FEKIVPLDSKIKPAVEDRRCSFI
Subjt:  PRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSFI

Query:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKK
        TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSIS+EYGIDINRVRGVVDNAIRILQIKK
Subjt:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKK

Query:  EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPA-VTTTAEVEET
        EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPA  TTT EVE+T
Subjt:  EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPA-VTTTAEVEET

Query:  ATATAASETL
        A   A  ETL
Subjt:  ATATAASETL

XP_022943791.1 uncharacterized protein LOC111448434 [Cucurbita moschata]7.6e-19087.83Show/hide
Query:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLSRPR
        MCRSE+ALEAT+VVVDSKF ARPVLQPTCNRVLDRRNSLK     KPPSAAVSPTSPKSKSPRPPATKRAND  N MNSSSDKILIPAAA     LSRP+
Subjt:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLSRPR

Query:  ATLDRKKSKSFKLGGNGNVVICDN----GGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCS
        A LDRKKSKSFKL GNGNVVICDN    GGFEVA LSYASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA F+K+VPLDSKIKPAVEDRRCS
Subjt:  ATLDRKKSKSFKLGGNGNVVICDN----GGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCS

Query:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI
        FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF +E VA FSDKQM+SISSEYGIDINRVRGVVDNAIRIL+I
Subjt:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI

Query:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPAVTTTAEVEE
        KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHC++ AA RR PAV     VEE
Subjt:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPAVTTTAEVEE

Query:  TATATAASETL
        T   T ASETL
Subjt:  TATATAASETL

XP_038902889.1 uncharacterized protein LOC120089476 [Benincasa hispida]1.4e-21596.34Show/hide
Query:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKHPSLKPPS---AAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLS
        MCRSEEALEA+TVVVDSKFNARPVLQPTCNRVLDRRNSLKK PSLKPPS   AAVSPTSPKSKSPRPPATKRANDGNN MNSSSDKILIPAA NGGGS+S
Subjt:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKHPSLKPPS---AAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLS

Query:  RPRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSF
        RPRATLDRKKSKSFKLGGNGNVVICDNGG+EVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA FEKIVPLDSKIKPAVEDRRCSF
Subjt:  RPRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSF

Query:  ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK
        ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVA FSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK
Subjt:  ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK

Query:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPAVTTTAEVEET
        KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRT A TTT EVEET
Subjt:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPAVTTTAEVEET

Query:  ATATAASETL
        ATATA SETL
Subjt:  ATATAASETL

TrEMBL top hitse value%identityAlignment
A0A0A0KED6 Uncharacterized protein1.1e-19992.68Show/hide
Query:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-HPSLKPPS-AAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLSR
        MCRSEE LEAT+VVVDSKFN+RPVLQPT NRVLDRRNSLKK HPSLKPPS AAVSPTSPKSKSPRPPATKRANDGNN MNSSS+KILIPAA      +SR
Subjt:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-HPSLKPPS-AAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLSR

Query:  PRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSFI
        PRATLDRKKSKSFKLGGNGN VICDNGGFEVA   YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA FEKIVPLDSKIKPAVEDRRCSFI
Subjt:  PRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSFI

Query:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKK
        TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSIS+EYGIDINRVRGVVDNAIRILQIKK
Subjt:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKK

Query:  EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPA-VTTTAEVEET
        EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPA  TTT EVE+T
Subjt:  EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPA-VTTTAEVEET

Query:  ATATAASETL
        A   A  ETL
Subjt:  ATATAASETL

A0A5A7UM21 Putative GMP synthase5.1e-20093.07Show/hide
Query:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-HPSLKPPS--AAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLS
        MCRSEEALEAT+VVVDSKFN+RPVLQPTCNRVLDRRNSLKK HPSLKPPS  AAVSPTSPKSKSPRPPATKRANDGNN MNSSS+KILIPAAA      S
Subjt:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-HPSLKPPS--AAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLS

Query:  RPRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSF
        RPRATLDRKKSKSFKLGGNGN VICDNGGFEVA   YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA FEKIVPLDSKIKP+VEDRRCSF
Subjt:  RPRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSF

Query:  ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK
        ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFS+KQMVSIS+EYGIDINRVRGVVDN+IRILQIK
Subjt:  ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK

Query:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPA-VTTTAEVEE
        KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPA  TTT EVEE
Subjt:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPA-VTTTAEVEE

Query:  TATA
           A
Subjt:  TATA

A0A6J1D778 uncharacterized protein LOC1110179894.0e-16881.45Show/hide
Query:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKH-PSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLSRP
        MCRSE+ +EAT+VV       R VLQPTCNR L RRNSLKK  PS  PP +  SP SPKSKSPRPPATKRAND   +MNSSSDK+++PAAA       RP
Subjt:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKH-PSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLSRP

Query:  RATLDRKKSKSFKLGGNGNVVICDNGGFEVAP-LSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSFI
        RA LDRKKSKSFKLGG        +G  E AP LSYASSLITESPGSIAAVRREQVALQQAQRKM+IAHYGRSKSA FEKIVP+DSK KPAVEDRRCSFI
Subjt:  RATLDRKKSKSFKLGGNGNVVICDNGGFEVAP-LSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSFI

Query:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKK
        TPNSDPIYVAYHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFD+E VANFSDKQMVSIS+EYGIDINRVRGVVDNAIRIL+IKK
Subjt:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKK

Query:  EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPAVTTTAEVEET
        EFGSFDKYIWGFVN+KPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHL CTL+AAGRR P      E  ET
Subjt:  EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPAVTTTAEVEET

A0A6J1FSP1 uncharacterized protein LOC1114484343.7e-19087.83Show/hide
Query:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLSRPR
        MCRSE+ALEAT+VVVDSKF ARPVLQPTCNRVLDRRNSLK     KPPSAAVSPTSPKSKSPRPPATKRAND  N MNSSSDKILIPAAA     LSRP+
Subjt:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLSRPR

Query:  ATLDRKKSKSFKLGGNGNVVICDN----GGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCS
        A LDRKKSKSFKL GNGNVVICDN    GGFEVA LSYASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA F+K+VPLDSKIKPAVEDRRCS
Subjt:  ATLDRKKSKSFKLGGNGNVVICDN----GGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCS

Query:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI
        FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF +E VA FSDKQM+SISSEYGIDINRVRGVVDNAIRIL+I
Subjt:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI

Query:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPAVTTTAEVEE
        KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHC++ AA RR PAV     VEE
Subjt:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPAVTTTAEVEE

Query:  TATATAASETL
        T   T ASETL
Subjt:  TATATAASETL

A0A6J1J7H3 uncharacterized protein LOC1114841735.3e-18987.65Show/hide
Query:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLSRPR
        MCRSE+ALEAT+VVVDSKF ARPVLQPTCNRVLDRRNSLK     KPPSAAVSPTSPKSKSPRPPATKRAN+  N MNSSSDKILIPAAA     LSRP+
Subjt:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLSRPR

Query:  ATLDRKKSKSFKLGGNGNVVICDN----GGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCS
        A LDRKKSKSFKL GNGNVVICDN    GGFEVA LSYASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA F+K+VPLDSKIKPAVE RRCS
Subjt:  ATLDRKKSKSFKLGGNGNVVICDN----GGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCS

Query:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI
        FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF +E VA FSDKQM+SISSEYGIDINRVRGVVDNAIRIL+I
Subjt:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI

Query:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPAVTTTAEVEE
        KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQ AGLTNDHLT+CHRHLHC++ AAGRR PAV     VEE
Subjt:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPAVTTTAEVEE

Query:  TATAT
        T TA+
Subjt:  TATAT

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 13.2e-3439.66Show/hide
Query:  RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINR--VRGVVDNAI
        RC ++  + DP+Y+AYHD EWGVP  D K LFE++ L   Q G  W ++LKKR+++R  F  FD   VA   ++ +  +  + GI  +R  ++ ++ NA 
Subjt:  RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINR--VRGVVDNAI

Query:  RILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTC
          LQ+++    F  ++W FVN++P   Q  +  +IP  TS S+ +SK + +RGF+ VG T+ +SFMQA GL NDH+  C
Subjt:  RILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTC

P44321 DNA-3-methyladenine glycosylase2.0e-2836.31Show/hide
Query:  RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVR--GVVDNAI
        RC ++   S  IY+ YHD+EWG P  D + LFE + L   Q G  W ++LKKR+ +R AF  FD + +A  +   + +     G+  +R +   +V NA 
Subjt:  RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVR--GVVDNAI

Query:  RILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTC
          L ++K   +F  +IW FVN+KP          +P KT  S+ +SK + +RGF  +G T  ++FMQ+ GL +DHL  C
Subjt:  RILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]4.0e-4043.85Show/hide
Query:  EDRRCSFITPNSD---PIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINR--VRG
        E  RC++ T   +    +Y  YHD EWG P+H+DK LFE LVL   Q G  W +ILKKR+ FR AF  FD  IVAN+ + ++  +    GI  NR  +  
Subjt:  EDRRCSFITPNSD---PIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINR--VRG

Query:  VVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHR
         + NA   + +++EFGSFDKYIWGFV  KP    ++S   +P  T  S+ I+KD+ +RGF+ VG T +++ MQ+ G+ NDHLT+C +
Subjt:  VVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHR

Arabidopsis top hitse value%identityAlignment
AT1G75090.1 DNA glycosylase superfamily protein9.0e-5649.76Show/hide
Query:  RRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDIN--RVRGVVDNA
        +RC +ITPNSDPIYV +HDEEWGVPV DDK LFELLV S A     W SIL++R DFR  F  FD   +A F++K+++S+     + ++  ++R +V+NA
Subjt:  RRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDIN--RVRGVVDNA

Query:  IRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPAVTT
          +L++K+EFGSF  Y W FVN+KP    Y+ G ++PVK+ K+E ISKDM++RGFR VGPTV++SF+QA+G+ NDHLT C R+  C  +   R T +  T
Subjt:  IRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPAVTT

Query:  TAEVE
          +++
Subjt:  TAEVE

AT3G12710.1 DNA glycosylase superfamily protein7.3e-9860.43Show/hide
Query:  SKSPRPPATKRANDGNNSMNSSSDKI-LIPAAANGGGSLSRPRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQ
        SK+     TKR     +S NS  D+   +   +  G   ++ R +L+RKKSKSFK G                  SY+S LITE+PGSIAAVRREQVA Q
Subjt:  SKSPRPPATKRANDGNNSMNSSSDKI-LIPAAANGGGSLSRPRATLDRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQ

Query:  QAQRKMRIAHYGRSKSA---HFEKIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFS
        QA RK++IAHYGRSKS       K+VPL +   P    +RCSF+TP SDPIYVAYHDEEWGVPVHDDK LFELL LS AQVGSDWTS L+KR D+R AF 
Subjt:  QAQRKMRIAHYGRSKSA---HFEKIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFS

Query:  SFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVH
         F++E+VA  ++K+M +IS EY I++++VRGVV+NA +I++IKK F S +KY+WGFVN+KP S  YK GHKIPVKTSKSE+ISKDMVRRGFR VGPTVVH
Subjt:  SFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVH

Query:  SFMQAAGLTNDHLTTCHRHLHCTLIA
        SFMQAAGLTNDHL TC RH  CTL+A
Subjt:  SFMQAAGLTNDHLTTCHRHLHCTLIA

AT5G44680.1 DNA glycosylase superfamily protein3.2e-9351.95Show/hide
Query:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRV--LDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLSR
        MC S+  L+  T    S+ N RPVLQP  N+V  LDRRNSLKK P  KP    ++P + K  SPRP +                 ++ P  +    SL +
Subjt:  MCRSEEALEATTVVVDSKFNARPVLQPTCNRV--LDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLSR

Query:  PRATL-DRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHF-EKIVPLDSKIKPAVEDRRCS
        P  +  +  +S S K     +    D G  EV P+     ++ + PGSIAA RRE+VA++Q +RK +I+HYGR KS    EK + ++ + K     +RCS
Subjt:  PRATL-DRKKSKSFKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHF-EKIVPLDSKIKPAVEDRRCS

Query:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI
        FIT +SDPIYVAYHD+EWGVPVHDD +LFELLVL+ AQVGSDWTS+LK+R  FR AFS F++E+VA+F++K++ SI ++YGI++++V  VVDNA +IL++
Subjt:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI

Query:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAA
        K++ GSF+KYIWGF+ +KP + +Y S  KIPVKTSKSETISKDMVRRGFR VGPTV+HS MQAAGLTNDHL TC RHL CT +AA
Subjt:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAA

AT5G57970.1 DNA glycosylase superfamily protein1.5e-5853.3Show/hide
Query:  LDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDIN
        LDS    +   +RC+++TPNSDP Y+ +HDEEWGVPVHDDK LFELLVLS A     W +IL KRQ FR  F+ FD   +   ++K+++   S     ++
Subjt:  LDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDIN

Query:  --RVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHC
          ++R V++NA +IL++ +E+GSFDKYIW FV NK    +++   ++P KT K+E ISKD+VRRGFRSVGPTVV+SFMQAAG+TNDHLT+C R  HC
Subjt:  --RVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHC

AT5G57970.2 DNA glycosylase superfamily protein1.5e-5853.3Show/hide
Query:  LDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDIN
        LDS    +   +RC+++TPNSDP Y+ +HDEEWGVPVHDDK LFELLVLS A     W +IL KRQ FR  F+ FD   +   ++K+++   S     ++
Subjt:  LDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDIN

Query:  --RVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHC
          ++R V++NA +IL++ +E+GSFDKYIW FV NK    +++   ++P KT K+E ISKD+VRRGFRSVGPTVV+SFMQAAG+TNDHLT+C R  HC
Subjt:  --RVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTCGTTCCGAGGAGGCCTTGGAAGCCACTACTGTCGTCGTTGATTCCAAATTCAACGCCCGTCCTGTCCTTCAACCCACTTGCAACCGTGTCCTCGACCGCCGTAA
TTCCCTCAAAAAACACCCTTCTCTCAAACCCCCCTCCGCCGCCGTCTCGCCCACCTCTCCCAAATCCAAATCCCCCCGTCCTCCGGCCACCAAGCGGGCCAATGACGGTA
ATAATTCCATGAACTCCAGCTCCGACAAGATCCTCATTCCGGCCGCCGCGAACGGTGGCGGGTCTCTGTCACGGCCGAGAGCTACCTTAGATAGGAAGAAATCGAAAAGC
TTCAAATTGGGTGGAAATGGGAATGTTGTGATTTGTGATAATGGTGGATTTGAGGTGGCGCCGTTGAGCTACGCTTCTTCTTTGATCACTGAGTCGCCGGGAAGTATCGC
CGCCGTGAGAAGAGAGCAGGTGGCTCTGCAACAGGCACAGAGGAAGATGAGAATTGCTCATTATGGAAGATCTAAATCTGCTCATTTTGAAAAAATTGTTCCTCTTGATT
CTAAAATTAAACCTGCTGTTGAAGATAGAAGATGTAGCTTCATCACTCCCAATTCAGATCCCATTTATGTTGCTTACCATGATGAAGAATGGGGCGTCCCTGTTCATGAT
GACAAGATGTTGTTTGAATTGCTGGTTCTAAGCGTAGCCCAGGTGGGTTCTGATTGGACTTCAATTTTGAAGAAACGCCAAGATTTCAGAAATGCTTTTTCAAGTTTCGA
TTCAGAAATTGTGGCAAATTTTTCCGACAAACAGATGGTTTCAATCAGCTCAGAATATGGCATCGACATTAACAGAGTCCGAGGAGTCGTGGATAACGCAATCCGGATCC
TCCAGATTAAGAAGGAATTTGGGTCATTCGACAAATACATTTGGGGATTTGTGAACAACAAACCGTTTTCACCGCAGTACAAATCCGGCCATAAAATTCCGGTGAAGACA
TCAAAATCAGAGACCATAAGCAAAGACATGGTCCGACGAGGTTTCCGGTCGGTCGGACCGACGGTGGTTCACTCCTTCATGCAAGCCGCCGGTCTGACCAACGATCATCT
GACCACTTGCCACAGGCACCTCCACTGTACCTTAATCGCCGCCGGCCGCCGCACTCCGGCGGTAACGACGACGGCGGAAGTGGAGGAGACGGCGACGGCGACGGCGGCTT
CTGAAACTCTCTAG
mRNA sequenceShow/hide mRNA sequence
CATTCCCATTTCTCCATTTCTCTCTCTTTCTCTCATTTTCCCCAAAATTAAAAACTGAAAAAAACGATGTGTCGTTCCGAGGAGGCCTTGGAAGCCACTACTGTCGTCGT
TGATTCCAAATTCAACGCCCGTCCTGTCCTTCAACCCACTTGCAACCGTGTCCTCGACCGCCGTAATTCCCTCAAAAAACACCCTTCTCTCAAACCCCCCTCCGCCGCCG
TCTCGCCCACCTCTCCCAAATCCAAATCCCCCCGTCCTCCGGCCACCAAGCGGGCCAATGACGGTAATAATTCCATGAACTCCAGCTCCGACAAGATCCTCATTCCGGCC
GCCGCGAACGGTGGCGGGTCTCTGTCACGGCCGAGAGCTACCTTAGATAGGAAGAAATCGAAAAGCTTCAAATTGGGTGGAAATGGGAATGTTGTGATTTGTGATAATGG
TGGATTTGAGGTGGCGCCGTTGAGCTACGCTTCTTCTTTGATCACTGAGTCGCCGGGAAGTATCGCCGCCGTGAGAAGAGAGCAGGTGGCTCTGCAACAGGCACAGAGGA
AGATGAGAATTGCTCATTATGGAAGATCTAAATCTGCTCATTTTGAAAAAATTGTTCCTCTTGATTCTAAAATTAAACCTGCTGTTGAAGATAGAAGATGTAGCTTCATC
ACTCCCAATTCAGATCCCATTTATGTTGCTTACCATGATGAAGAATGGGGCGTCCCTGTTCATGATGACAAGATGTTGTTTGAATTGCTGGTTCTAAGCGTAGCCCAGGT
GGGTTCTGATTGGACTTCAATTTTGAAGAAACGCCAAGATTTCAGAAATGCTTTTTCAAGTTTCGATTCAGAAATTGTGGCAAATTTTTCCGACAAACAGATGGTTTCAA
TCAGCTCAGAATATGGCATCGACATTAACAGAGTCCGAGGAGTCGTGGATAACGCAATCCGGATCCTCCAGATTAAGAAGGAATTTGGGTCATTCGACAAATACATTTGG
GGATTTGTGAACAACAAACCGTTTTCACCGCAGTACAAATCCGGCCATAAAATTCCGGTGAAGACATCAAAATCAGAGACCATAAGCAAAGACATGGTCCGACGAGGTTT
CCGGTCGGTCGGACCGACGGTGGTTCACTCCTTCATGCAAGCCGCCGGTCTGACCAACGATCATCTGACCACTTGCCACAGGCACCTCCACTGTACCTTAATCGCCGCCG
GCCGCCGCACTCCGGCGGTAACGACGACGGCGGAAGTGGAGGAGACGGCGACGGCGACGGCGGCTTCTGAAACTCTCTAGAATTGACTCGAGACTTTAATTAACAGACAA
AAAGAAAAAGTGATAACCTTTACGTGGAGTCCCATCAATGATGATTTGCTTGCTAATTAACTAGATAACTTTTTTTTTTTTTTTGTGGGGTTTATGTATATTAATGTCTA
TATAAATAGACTTGTAAGAGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGAAAGAAAAAAAGAGATTGTGGGGTTGTGAATTTGTGTGAAAGTTTTTGTGTGAGAAT
TTTAGTGTAAGTGTTTGTATAATTAGAAGAAAAAAAGAAAAGGAAAAAAAGAAAATTATTTGAAGTGGTAGGGATAGAATAGAAAGACAGACAGCATGTGCTTGTGCAAT
TGGGAGGCAATGGCATGTGAGTGTGTCAGTTTGCTTTTGTAAATTCCCATGTGATCCACCCAAATTTCAAACATTATTAATCATTATTATATTATTTTTATTTTA
Protein sequenceShow/hide protein sequence
MCRSEEALEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNSMNSSSDKILIPAAANGGGSLSRPRATLDRKKSKS
FKLGGNGNVVICDNGGFEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSAHFEKIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHD
DKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKT
SKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPAVTTTAEVEETATATAASETL