; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007923 (gene) of Snake gourd v1 genome

Gene IDTan0007923
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA glycosylase superfamily protein
Genome locationLG05:4063227..4065319
RNA-Seq ExpressionTan0007923
SyntenyTan0007923
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570606.1 hypothetical protein SDJN03_29521, partial [Cucurbita argyrosperma subsp. sororia]4.6e-18786.89Show/hide
Query:  MCRSDQALEATSVVVDSKFNARPVLQPTGNRVLDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDSNSMTSSSDKILLPAATAAAANSGSLSR
        MCRS+QALEAT+VVVDSKF ARPVLQPT NRVLDRRNSLK     KPPSAAVSPTSPKSKSPRPPATKRAND+N M SSSDKIL+PAA        +LSR
Subjt:  MCRSDQALEATSVVVDSKFNARPVLQPTGNRVLDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDSNSMTSSSDKILLPAATAAAANSGSLSR

Query:  PRAALDRKKSKSFKLGGNGN-VICDNVV-----DVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVEDRR
        P+AALDRKKSKSFKL GNGN VICDNV      +VASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF+K+VP+DSKIKPAVEDRR
Subjt:  PRAALDRKKSKSFKLGGNGN-VICDNVV-----DVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVEDRR

Query:  CSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRIL
        CSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFS+F AE VA FSDKQM+SISSEYGIDINRVRGVVDNAIRIL
Subjt:  CSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRIL

Query:  EIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRAPATLAAEVE
        EI+KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC++ AA RRAPA +   VE
Subjt:  EIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRAPATLAAEVE

Query:  ETTTATAASETL
        ETTT   ASETL
Subjt:  ETTTATAASETL

XP_004139917.2 uncharacterized protein LOC101218536 [Cucumis sativus]1.9e-18887.1Show/hide
Query:  MCRSDQALEATSVVVDSKFNARPVLQPTGNRVLDRRNSLKK-HPSLKPPS-AAVSPTSPKSKSPRPPATKRANDSNS-MTSSSDKILLPAATAAAANSGS
        MCRS++ LEATSVVVDSKFN+RPVLQPTGNRVLDRRNSLKK HPSLKPPS AAVSPTSPKSKSPRPPATKRAND N+ M SSS+KIL+PAA         
Subjt:  MCRSDQALEATSVVVDSKFNARPVLQPTGNRVLDRRNSLKK-HPSLKPPS-AAVSPTSPKSKSPRPPATKRANDSNS-MTSSSDKILLPAATAAAANSGS

Query:  LSRPRAALDRKKSKSFKLGGNGNVICDNVVDVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVEDRRCSF
        +SRPRA LDRKKSKSFKLGGNGNVICDN      ++YASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVP+DSKIKPAVEDRRCSF
Subjt:  LSRPRAALDRKKSKSFKLGGNGNVICDNVVDVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVEDRRCSF

Query:  ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRILEIR
        ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFS+FD+EIVANFSDKQM+SIS+EYGIDINRVRGVVDNAIRIL+I+
Subjt:  ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRILEIR

Query:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRR--APATLAAEVEE
        KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHCTLIAAGRR  AP T   EVE+
Subjt:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRR--APATLAAEVEE

Query:  TTTATAASETL
        T    A  ETL
Subjt:  TTTATAASETL

XP_022943791.1 uncharacterized protein LOC111448434 [Cucurbita moschata]2.1e-18787.14Show/hide
Query:  MCRSDQALEATSVVVDSKFNARPVLQPTGNRVLDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDSNSMTSSSDKILLPAATAAAANSGSLSR
        MCRS+QALEATSVVVDSKF ARPVLQPT NRVLDRRNSLK     KPPSAAVSPTSPKSKSPRPPATKRAND+N M SSSDKIL+PAA        +LSR
Subjt:  MCRSDQALEATSVVVDSKFNARPVLQPTGNRVLDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDSNSMTSSSDKILLPAATAAAANSGSLSR

Query:  PRAALDRKKSKSFKLGGNGN-VICDNVV-----DVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVEDRR
        P+AALDRKKSKSFKL GNGN VICDNV      +VASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF+K+VP+DSKIKPAVEDRR
Subjt:  PRAALDRKKSKSFKLGGNGN-VICDNVV-----DVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVEDRR

Query:  CSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRIL
        CSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFS+F AE VA FSDKQM+SISSEYGIDINRVRGVVDNAIRIL
Subjt:  CSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRIL

Query:  EIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRAPATLAAEVE
        EI+KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC++ AA RRAPA +   VE
Subjt:  EIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRAPATLAAEVE

Query:  ETTTATAASETL
        ETTT   ASETL
Subjt:  ETTTATAASETL

XP_023511876.1 uncharacterized protein LOC111776761 [Cucurbita pepo subsp. pepo]3.5e-18787.44Show/hide
Query:  MCRSDQALEATSVVVDSKFNARPVLQPTGNRVLDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDSNSMTSSSDKILLPAATAAAANSGSLSR
        MCRS+QALEATSVVVDSKF ARPVLQPTGNRVLDRRNSLK     KPPSAAVSPTSPKSKSPRPPATKRAND+N M SSSDKIL+PAA        +LSR
Subjt:  MCRSDQALEATSVVVDSKFNARPVLQPTGNRVLDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDSNSMTSSSDKILLPAATAAAANSGSLSR

Query:  PRAALDRKKSKSFKLGGNGN-VICDNVV-----DVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVEDRR
        P+AALDRKKSKSFKL GNGN VICDNV      +VASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF+K+VPIDSKIKPAVEDRR
Subjt:  PRAALDRKKSKSFKLGGNGN-VICDNVV-----DVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVEDRR

Query:  CSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRIL
        CSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFS+F AE VA FSDKQM+SISSEYGIDINRVRGVVDNAIRIL
Subjt:  CSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRIL

Query:  EIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRAPATLAAEVE
        EI+KEF SFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTV+HSFMQAAGLTNDHLTSCHRHLHC++ AA RRAPA +   VE
Subjt:  EIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRAPATLAAEVE

Query:  ETTTAT
        ETTTA+
Subjt:  ETTTAT

XP_038902889.1 uncharacterized protein LOC120089476 [Benincasa hispida]4.3e-19389.1Show/hide
Query:  MCRSDQALEATSVVVDSKFNARPVLQPTGNRVLDRRNSLKKHPSLKPPS---AAVSPTSPKSKSPRPPATKRANDSNS-MTSSSDKILLPAATAAAANSG
        MCRS++ALEA++VVVDSKFNARPVLQPT NRVLDRRNSLKK PSLKPPS   AAVSPTSPKSKSPRPPATKRAND N+ M SSSDKIL+PAAT      G
Subjt:  MCRSDQALEATSVVVDSKFNARPVLQPTGNRVLDRRNSLKKHPSLKPPS---AAVSPTSPKSKSPRPPATKRANDSNS-MTSSSDKILLPAATAAAANSG

Query:  SLSRPRAALDRKKSKSFKLGGNGN-VICDN-VVDVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVEDRR
        S+SRPRA LDRKKSKSFKLGGNGN VICDN   +VA LSYASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVP+DSKIKPAVEDRR
Subjt:  SLSRPRAALDRKKSKSFKLGGNGN-VICDN-VVDVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVEDRR

Query:  CSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRIL
        CSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFS+FD+EIVA FSDKQM+SISSEYGIDINRVRGVVDNAIRIL
Subjt:  CSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRIL

Query:  EIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRAPA-TLAAEV
        +I+KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHCTLIAAGRR  A T   EV
Subjt:  EIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRAPA-TLAAEV

Query:  EETTTATAASETL
        EET TATA SETL
Subjt:  EETTTATAASETL

TrEMBL top hitse value%identityAlignment
A0A0A0KED6 Uncharacterized protein9.1e-18987.1Show/hide
Query:  MCRSDQALEATSVVVDSKFNARPVLQPTGNRVLDRRNSLKK-HPSLKPPS-AAVSPTSPKSKSPRPPATKRANDSNS-MTSSSDKILLPAATAAAANSGS
        MCRS++ LEATSVVVDSKFN+RPVLQPTGNRVLDRRNSLKK HPSLKPPS AAVSPTSPKSKSPRPPATKRAND N+ M SSS+KIL+PAA         
Subjt:  MCRSDQALEATSVVVDSKFNARPVLQPTGNRVLDRRNSLKK-HPSLKPPS-AAVSPTSPKSKSPRPPATKRANDSNS-MTSSSDKILLPAATAAAANSGS

Query:  LSRPRAALDRKKSKSFKLGGNGNVICDNVVDVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVEDRRCSF
        +SRPRA LDRKKSKSFKLGGNGNVICDN      ++YASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVP+DSKIKPAVEDRRCSF
Subjt:  LSRPRAALDRKKSKSFKLGGNGNVICDNVVDVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVEDRRCSF

Query:  ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRILEIR
        ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFS+FD+EIVANFSDKQM+SIS+EYGIDINRVRGVVDNAIRIL+I+
Subjt:  ITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRILEIR

Query:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRR--APATLAAEVEE
        KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHCTLIAAGRR  AP T   EVE+
Subjt:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRR--APATLAAEVEE

Query:  TTTATAASETL
        T    A  ETL
Subjt:  TTTATAASETL

A0A5A7UM21 Putative GMP synthase2.2e-18787.16Show/hide
Query:  MCRSDQALEATSVVVDSKFNARPVLQPTGNRVLDRRNSLKK-HPSLKPPS--AAVSPTSPKSKSPRPPATKRANDSNS-MTSSSDKILLPAATAAAANSG
        MCRS++ALEATSVVVDSKFN+RPVLQPT NRVLDRRNSLKK HPSLKPPS  AAVSPTSPKSKSPRPPATKRAND N+ M SSS+KIL+PAA        
Subjt:  MCRSDQALEATSVVVDSKFNARPVLQPTGNRVLDRRNSLKK-HPSLKPPS--AAVSPTSPKSKSPRPPATKRANDSNS-MTSSSDKILLPAATAAAANSG

Query:  SLSRPRAALDRKKSKSFKLGGNGNVICDNVVDVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVEDRRCS
          SRPRA LDRKKSKSFKLGGNGNVICDN      ++YASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVP+DSKIKP+VEDRRCS
Subjt:  SLSRPRAALDRKKSKSFKLGGNGNVICDNVVDVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVEDRRCS

Query:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRILEI
        FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFS+FD+EIVANFS+KQM+SIS+EYGIDINRVRGVVDN+IRIL+I
Subjt:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRILEI

Query:  RKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRR--APATLAAEVE
        +KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHCTLIAAGRR  AP T   EVE
Subjt:  RKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRR--APATLAAEVE

Query:  ETTTA
        E T A
Subjt:  ETTTA

A0A6J1D778 uncharacterized protein LOC1110179893.7e-16682.21Show/hide
Query:  MCRSDQALEATSVVVDSKFNARPVLQPTGNRVLDRRNSLKKH-PSLKPPSAAVSPTSPKSKSPRPPATKRAND-SNSMTSSSDKILLPAATAAAANSGSL
        MCRS+Q +EATSVV       R VLQPT NR L RRNSLKK  PS  PP +  SP SPKSKSPRPPATKRAND + +M SSSDK++LPAA          
Subjt:  MCRSDQALEATSVVVDSKFNARPVLQPTGNRVLDRRNSLKKH-PSLKPPSAAVSPTSPKSKSPRPPATKRAND-SNSMTSSSDKILLPAATAAAANSGSL

Query:  SRPRAALDRKKSKSFKLGGNGNVICDNVVDVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVEDRRCSFI
        +RPR ALDRKKSKSFKLGG+G    D      SLSYASSLIT+SPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARFEKIVPIDSK KPAVEDRRCSFI
Subjt:  SRPRAALDRKKSKSFKLGGNGNVICDNVVDVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVEDRRCSFI

Query:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRILEIRK
        TPNSDPIYVAYHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILKKRQDFRNAFS+FDAE VANFSDKQM+SIS+EYGIDINRVRGVVDNAIRILEI+K
Subjt:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRILEIRK

Query:  EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRAPATLAAEVEETT
        EFGSFDKYIWGFVN+KPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHL CTL+AAGRRAP   A EVEET+
Subjt:  EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRAPATLAAEVEETT

A0A6J1FSP1 uncharacterized protein LOC1114484341.0e-18787.14Show/hide
Query:  MCRSDQALEATSVVVDSKFNARPVLQPTGNRVLDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDSNSMTSSSDKILLPAATAAAANSGSLSR
        MCRS+QALEATSVVVDSKF ARPVLQPT NRVLDRRNSLK     KPPSAAVSPTSPKSKSPRPPATKRAND+N M SSSDKIL+PAA        +LSR
Subjt:  MCRSDQALEATSVVVDSKFNARPVLQPTGNRVLDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDSNSMTSSSDKILLPAATAAAANSGSLSR

Query:  PRAALDRKKSKSFKLGGNGN-VICDNVV-----DVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVEDRR
        P+AALDRKKSKSFKL GNGN VICDNV      +VASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF+K+VP+DSKIKPAVEDRR
Subjt:  PRAALDRKKSKSFKLGGNGN-VICDNVV-----DVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVEDRR

Query:  CSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRIL
        CSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFS+F AE VA FSDKQM+SISSEYGIDINRVRGVVDNAIRIL
Subjt:  CSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRIL

Query:  EIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRAPATLAAEVE
        EI+KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC++ AA RRAPA +   VE
Subjt:  EIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRAPATLAAEVE

Query:  ETTTATAASETL
        ETTT   ASETL
Subjt:  ETTTATAASETL

A0A6J1J7H3 uncharacterized protein LOC1114841731.4e-18686.95Show/hide
Query:  MCRSDQALEATSVVVDSKFNARPVLQPTGNRVLDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDSNSMTSSSDKILLPAATAAAANSGSLSR
        MCRS+QALEATSVVVDSKF ARPVLQPT NRVLDRRNSLK     KPPSAAVSPTSPKSKSPRPPATKRAN++N M SSSDKIL+PAA        +LSR
Subjt:  MCRSDQALEATSVVVDSKFNARPVLQPTGNRVLDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDSNSMTSSSDKILLPAATAAAANSGSLSR

Query:  PRAALDRKKSKSFKLGGNGN-VICDNVV-----DVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVEDRR
        P+AALDRKKSKSFKL GNGN VICDNV      +VASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF+K+VP+DSKIKPAVE RR
Subjt:  PRAALDRKKSKSFKLGGNGN-VICDNVV-----DVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVEDRR

Query:  CSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRIL
        CSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFS+F AE VA FSDKQM+SISSEYGIDINRVRGVVDNAIRIL
Subjt:  CSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRIL

Query:  EIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRAPATLAAEVE
        EI+KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQ AGLTNDHLTSCHRHLHC++ AAGRRAPA +   VE
Subjt:  EIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRAPATLAAEVE

Query:  ETTTAT
        ETTTA+
Subjt:  ETTTAT

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 13.2e-3439.11Show/hide
Query:  RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINR--VRGVVDNAI
        RC ++  + DP+Y+AYHD EWGVP  D K LFE++ L   Q G  W ++LKKR+++R  F  FD   VA   ++ +  +  + GI  +R  ++ ++ NA 
Subjt:  RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINR--VRGVVDNAI

Query:  RILEIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC
          L++ +    F  ++W FVN++P   Q  +  +IP  TS S+ +SK + +RGF+ VG T+ +SFMQA GL NDH+  C
Subjt:  RILEIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC

P44321 DNA-3-methyladenine glycosylase9.1e-2936.31Show/hide
Query:  RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVR--GVVDNAI
        RC ++   S  IY+ YHD+EWG P  D + LFE + L   Q G  W ++LKKR+ +R AF  FD + +A  +   + +     G+  +R +   +V NA 
Subjt:  RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVR--GVVDNAI

Query:  RILEIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC
          L + K   +F  +IW FVN+KP          +P KT  S+ +SK + +RGF  +G T  ++FMQ+ GL +DHL  C
Subjt:  RILEIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]1.0e-4044.39Show/hide
Query:  EDRRCSFITPNSD---PIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINR--VRG
        E  RC++ T   +    +Y  YHD EWG P+H+DK LFE LVL   Q G  W +ILKKR+ FR AF +FD  IVAN+ + ++  +    GI  NR  +  
Subjt:  EDRRCSFITPNSD---PIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINR--VRG

Query:  VVDNAIRILEIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHR
         + NA   + +++EFGSFDKYIWGFV  KP    ++S   +P  T  S+ I+KD+ +RGF+ VG T +++ MQ+ G+ NDHLTSC +
Subjt:  VVDNAIRILEIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHR

Arabidopsis top hitse value%identityAlignment
AT1G75090.1 DNA glycosylase superfamily protein6.9e-5641.45Show/hide
Query:  KKSKSFKLGGNGNVICDNVVDVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVED--RRCSFITPNSDPI
        K   + K   N +V  D+    +S S  SS+ T + G +    +                 G  K       V +   I P +    +RC +ITPNSDPI
Subjt:  KKSKSFKLGGNGNVICDNVVDVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVED--RRCSFITPNSDPI

Query:  YVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDIN--RVRGVVDNAIRILEIRKEFGSF
        YV +HDEEWGVPV DDK LFELLV S A     W SIL++R DFR  F  FD   +A F++K+++S+     + ++  ++R +V+NA  +L++++EFGSF
Subjt:  YVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDIN--RVRGVVDNAIRILEIRKEFGSF

Query:  DKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTL
          Y W FVN+KP    Y+ G ++PVK+ K+E ISKDM++RGFR VGPTV++SF+QA+G+ NDHLT+C R+  C +
Subjt:  DKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTL

AT3G12710.1 DNA glycosylase superfamily protein3.1e-9658.97Show/hide
Query:  SKSPRPPATKRA----NDSNSMTSSSDKILLPAATAAAANSGSLSRPRAALDRKKSKSFKLGGNGNVICDNVVDVASLSYASSLITDSPGSIAAVRREQV
        SK+     TKR     +  NS+   S+ +   +     A     ++ R +L+RKKSKSFK G                SY+S LIT++PGSIAAVRREQV
Subjt:  SKSPRPPATKRA----NDSNSMTSSSDKILLPAATAAAANSGSLSRPRAALDRKKSKSFKLGGNGNVICDNVVDVASLSYASSLITDSPGSIAAVRREQV

Query:  ALQQAQRKMRIAHYGRSKSA---RFEKIVPIDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRN
        A QQA RK++IAHYGRSKS       K+VP+ +   P    +RCSF+TP SDPIYVAYHDEEWGVPVHDDK LFELL LS AQVGSDWTS L+KR D+R 
Subjt:  ALQQAQRKMRIAHYGRSKSA---RFEKIVPIDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRN

Query:  AFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRILEIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPT
        AF  F+AE+VA  ++K+M +IS EY I++++VRGVV+NA +I+EI+K F S +KY+WGFVN+KP S  YK GHKIPVKTSKSE+ISKDMVRRGFR VGPT
Subjt:  AFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRILEIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPT

Query:  VVHSFMQAAGLTNDHLTSCHRHLHCTLIA
        VVHSFMQAAGLTNDHL +C RH  CTL+A
Subjt:  VVHSFMQAAGLTNDHLTSCHRHLHCTLIA

AT5G44680.1 DNA glycosylase superfamily protein1.1e-9051.63Show/hide
Query:  SKFNARPVLQPTGNRV--LDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDSNSMTSSSDKILLPAATAAAANSGSLSRPRAALDRKKSKSFK
        S+ N RPVLQP  N+V  LDRRNSLKK P  KP    ++P + K  SPRP +      S  ++ ++  +  PA +       S ++ +  +  + S    
Subjt:  SKFNARPVLQPTGNRV--LDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDSNSMTSSSDKILLPAATAAAANSGSLSRPRAALDRKKSKSFK

Query:  LGGNGNVICDNVVDVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF-EKIVPIDSKIKPAVEDRRCSFITPNSDPIYVAYHDEE
         GG   V+               ++   PGSIAA RRE+VA++Q +RK +I+HYGR KS +  EK + ++ + K     +RCSFIT +SDPIYVAYHD+E
Subjt:  LGGNGNVICDNVVDVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF-EKIVPIDSKIKPAVEDRRCSFITPNSDPIYVAYHDEE

Query:  WGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRILEIRKEFGSFDKYIWGFVNN
        WGVPVHDD +LFELLVL+ AQVGSDWTS+LK+R  FR AFS F+AE+VA+F++K++ SI ++YGI++++V  VVDNA +IL+++++ GSF+KYIWGF+ +
Subjt:  WGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRILEIRKEFGSFDKYIWGFVNN

Query:  KPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAA
        KP + +Y S  KIPVKTSKSETISKDMVRRGFR VGPTV+HS MQAAGLTNDHL +C RHL CT +AA
Subjt:  KPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAA

AT5G57970.1 DNA glycosylase superfamily protein5.1e-5953.81Show/hide
Query:  IDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDIN
        +DS    +   +RC+++TPNSDP Y+ +HDEEWGVPVHDDK LFELLVLS A     W +IL KRQ FR  F++FD   +   ++K++I   S     ++
Subjt:  IDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDIN

Query:  --RVRGVVDNAIRILEIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC
          ++R V++NA +IL++ +E+GSFDKYIW FV NK    +++   ++P KT K+E ISKD+VRRGFRSVGPTVV+SFMQAAG+TNDHLTSC R  HC
Subjt:  --RVRGVVDNAIRILEIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC

AT5G57970.2 DNA glycosylase superfamily protein5.1e-5953.81Show/hide
Query:  IDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDIN
        +DS    +   +RC+++TPNSDP Y+ +HDEEWGVPVHDDK LFELLVLS A     W +IL KRQ FR  F++FD   +   ++K++I   S     ++
Subjt:  IDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDIN

Query:  --RVRGVVDNAIRILEIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC
          ++R V++NA +IL++ +E+GSFDKYIW FV NK    +++   ++P KT K+E ISKD+VRRGFRSVGPTVV+SFMQAAG+TNDHLTSC R  HC
Subjt:  --RVRGVVDNAIRILEIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTCGTTCCGACCAAGCCTTGGAAGCCACTTCTGTCGTCGTTGATTCCAAATTCAACGCCCGACCCGTCCTTCAACCCACTGGCAACCGCGTCCTCGACCGCCGTAA
TTCCCTCAAAAAACACCCCTCTCTCAAACCCCCCTCCGCCGCCGTCTCCCCCACTTCCCCGAAGTCCAAATCCCCCCGCCCTCCGGCCACCAAGCGTGCCAATGACAGCA
ATTCCATGACCTCAAGCTCTGACAAGATTCTCCTTCCTGCCGCCACCGCCGCCGCCGCTAACAGCGGCTCTCTGTCTCGCCCCCGGGCTGCCTTGGATAGGAAGAAATCC
AAAAGCTTCAAATTGGGTGGAAATGGGAATGTGATTTGTGATAATGTTGTTGACGTCGCGTCTTTGAGCTACGCTTCTTCTTTAATCACTGACTCGCCGGGGAGTATCGC
CGCCGTGAGAAGAGAACAGGTGGCGCTGCAGCAGGCGCAGAGGAAGATGAGAATTGCCCATTATGGAAGATCTAAATCTGCTCGGTTTGAGAAAATTGTTCCGATTGATT
CTAAAATTAAACCCGCTGTTGAAGATAGAAGATGTAGCTTCATCACTCCCAATTCAGATCCCATTTATGTTGCTTATCATGATGAAGAATGGGGCGTCCCTGTTCATGAT
GACAAAATGCTGTTTGAATTGCTGGTTCTGAGTGTGGCCCAAGTGGGTTCTGATTGGACTTCGATTTTAAAGAAACGCCAAGATTTCAGAAATGCATTTTCAAATTTTGA
TGCAGAAATTGTGGCGAATTTTTCCGACAAACAGATGATTTCAATCAGCTCAGAATATGGCATCGACATTAACAGAGTCCGAGGAGTCGTCGACAACGCAATCCGAATCC
TGGAGATTAGGAAGGAATTTGGTTCATTCGACAAATACATTTGGGGATTTGTGAACAACAAGCCATTTTCACCGCAATACAAATCCGGCCACAAAATTCCGGTCAAGACA
TCAAAATCAGAGACCATAAGCAAAGACATGGTCCGACGAGGATTCCGGTCGGTCGGTCCAACGGTGGTCCACTCCTTCATGCAAGCCGCCGGTCTAACCAACGACCACCT
CACCAGCTGCCACAGGCACCTCCACTGCACGTTAATCGCCGCCGGCCGCCGCGCTCCGGCAACACTGGCGGCGGAAGTGGAGGAGACGACGACGGCGACGGCAGCCTCTG
AAACTCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATTTATTCTTCCCTAAACCAAAAGAAAAAAAAAAAGAAAAAAAGAAAAAAAAAACGATGTGTCGTTCCGACCAAGCCTTGGAAGCCACTTCTGTCGTCGTTGATTCCAAA
TTCAACGCCCGACCCGTCCTTCAACCCACTGGCAACCGCGTCCTCGACCGCCGTAATTCCCTCAAAAAACACCCCTCTCTCAAACCCCCCTCCGCCGCCGTCTCCCCCAC
TTCCCCGAAGTCCAAATCCCCCCGCCCTCCGGCCACCAAGCGTGCCAATGACAGCAATTCCATGACCTCAAGCTCTGACAAGATTCTCCTTCCTGCCGCCACCGCCGCCG
CCGCTAACAGCGGCTCTCTGTCTCGCCCCCGGGCTGCCTTGGATAGGAAGAAATCCAAAAGCTTCAAATTGGGTGGAAATGGGAATGTGATTTGTGATAATGTTGTTGAC
GTCGCGTCTTTGAGCTACGCTTCTTCTTTAATCACTGACTCGCCGGGGAGTATCGCCGCCGTGAGAAGAGAACAGGTGGCGCTGCAGCAGGCGCAGAGGAAGATGAGAAT
TGCCCATTATGGAAGATCTAAATCTGCTCGGTTTGAGAAAATTGTTCCGATTGATTCTAAAATTAAACCCGCTGTTGAAGATAGAAGATGTAGCTTCATCACTCCCAATT
CAGATCCCATTTATGTTGCTTATCATGATGAAGAATGGGGCGTCCCTGTTCATGATGACAAAATGCTGTTTGAATTGCTGGTTCTGAGTGTGGCCCAAGTGGGTTCTGAT
TGGACTTCGATTTTAAAGAAACGCCAAGATTTCAGAAATGCATTTTCAAATTTTGATGCAGAAATTGTGGCGAATTTTTCCGACAAACAGATGATTTCAATCAGCTCAGA
ATATGGCATCGACATTAACAGAGTCCGAGGAGTCGTCGACAACGCAATCCGAATCCTGGAGATTAGGAAGGAATTTGGTTCATTCGACAAATACATTTGGGGATTTGTGA
ACAACAAGCCATTTTCACCGCAATACAAATCCGGCCACAAAATTCCGGTCAAGACATCAAAATCAGAGACCATAAGCAAAGACATGGTCCGACGAGGATTCCGGTCGGTC
GGTCCAACGGTGGTCCACTCCTTCATGCAAGCCGCCGGTCTAACCAACGACCACCTCACCAGCTGCCACAGGCACCTCCACTGCACGTTAATCGCCGCCGGCCGCCGCGC
TCCGGCAACACTGGCGGCGGAAGTGGAGGAGACGACGACGGCGACGGCAGCCTCTGAAACTCTCTAGAATTGACTTAACAGACAAAAAGAAAAAATGATAACCTTTACAA
GGAGTCAATCAATGATGATTTGCTTGCTAATTAACTAGATAACTGTTTTTTTTTTTTTTTTTTGTGGGGTTTGTGTATATTAATGTCTATATAAAAAAATAGACTTGTAA
GAGGAAAAAAAAAAGATTGTGAGGTTGTGAATTTGTGTGAAAGTTTTATTATTATTATTTATTATTGTTTTTTTTTTTGTGGGAATTTTAGTGAAAGTGTTTGTATGAAT
AGAAAAGAAAAGAAAATTATTGAGGAGGTTTGAAGGGGGAAGTGGTAGGGCTAGAATAGGAGAC
Protein sequenceShow/hide protein sequence
MCRSDQALEATSVVVDSKFNARPVLQPTGNRVLDRRNSLKKHPSLKPPSAAVSPTSPKSKSPRPPATKRANDSNSMTSSSDKILLPAATAAAANSGSLSRPRAALDRKKS
KSFKLGGNGNVICDNVVDVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHD
DKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSNFDAEIVANFSDKQMISISSEYGIDINRVRGVVDNAIRILEIRKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKT
SKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRAPATLAAEVEETTTATAASETL