; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001612 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001612
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationChr09:18663174..18665278
RNA-Seq ExpressionHG10001612
SyntenyHG10001612
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054725.1 putative GMP synthase [Cucumis melo var. makuwa]2.4e-19992.59Show/hide
Query:  MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-NPSLKPPS--AAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLS
        MCRSEE LEAT+VVVDSKFN+RPVLQPTCNRVLDRRNSLKK +PSLKPPS  AAVSPTSPKSKSPRPPATKRANDGNNPMNSSS+KILIPAAA      S
Subjt:  MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-NPSLKPPS--AAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLS

Query:  RPRATLDRKKSKSFKLGGNGNVVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCSF
        RPRATLDRKKSKSFKLGGNGN VICDNGG+EVA   YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKP+VEDRRCSF
Subjt:  RPRATLDRKKSKSFKLGGNGNVVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCSF

Query:  ITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK
        ITPNSDPIYVAYHDEEWGVPVHDD+MLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFS+KQMVSIS+EYGIDINRVRGVVDN+IRILQIK
Subjt:  ITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK

Query:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPA-TTTTAEVEE
        KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHCTLIAAGRRTPA TTTT EVEE
Subjt:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPA-TTTTAEVEE

Query:  TTETV
         T  V
Subjt:  TTETV

KAG6570606.1 hypothetical protein SDJN03_29521, partial [Cucurbita argyrosperma subsp. sororia]4.5e-19087.83Show/hide
Query:  MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKNPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRPR
        MCRSE+ LEAT VVVDSKF ARPVLQPTCNRVLDRRNSLK     KPPSAAVSPTSPKSKSPRPPATKRAND  NPMNSSSDKILIPAAA     LSRP+
Subjt:  MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKNPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRPR

Query:  ATLDRKKSKSFKLGGNGNVVICDN----GGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCS
        A LDRKKSKSFKL GNGNVVICDN    GG+EVASLSYASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF+K+VPLDSKIKPAVEDRRCS
Subjt:  ATLDRKKSKSFKLGGNGNVVICDN----GGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCS

Query:  FITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI
        FITPNSDPIYVAYHDEEWGVPVHDD+MLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF +E VA FSDKQM+SISSEYGIDINRVRGVVDNAIRIL+I
Subjt:  FITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI

Query:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPATTTTAEVEE
        KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC++ AA RR PA      VEE
Subjt:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPATTTTAEVEE

Query:  TTETVAASETL
        TT    ASETL
Subjt:  TTETVAASETL

XP_004139917.2 uncharacterized protein LOC101218536 [Cucumis sativus]3.1e-19992.44Show/hide
Query:  MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-NPSLKPPS-AAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSR
        MCRSEETLEAT+VVVDSKFN+RPVLQPT NRVLDRRNSLKK +PSLKPPS AAVSPTSPKSKSPRPPATKRANDGNNPMNSSS+KILIPAA      +SR
Subjt:  MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-NPSLKPPS-AAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSR

Query:  PRATLDRKKSKSFKLGGNGNVVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCSFI
        PRATLDRKKSKSFKLGGNGN VICDNGG+EVA   YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCSFI
Subjt:  PRATLDRKKSKSFKLGGNGNVVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCSFI

Query:  TPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKK
        TPNSDPIYVAYHDEEWGVPVHDD+MLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSIS+EYGIDINRVRGVVDNAIRILQIKK
Subjt:  TPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKK

Query:  EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPA-TTTTAEVEET
        EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHCTLIAAGRRTPA TTTT EVE  
Subjt:  EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPA-TTTTAEVEET

Query:  TETVAASETL
         +T A  ETL
Subjt:  TETVAASETL

XP_022943791.1 uncharacterized protein LOC111448434 [Cucurbita moschata]3.4e-19087.83Show/hide
Query:  MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKNPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRPR
        MCRSE+ LEAT+VVVDSKF ARPVLQPTCNRVLDRRNSLK     KPPSAAVSPTSPKSKSPRPPATKRAND  NPMNSSSDKILIPAAA     LSRP+
Subjt:  MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKNPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRPR

Query:  ATLDRKKSKSFKLGGNGNVVICDN----GGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCS
        A LDRKKSKSFKL GNGNVVICDN    GG+EVASLSYASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF+K+VPLDSKIKPAVEDRRCS
Subjt:  ATLDRKKSKSFKLGGNGNVVICDN----GGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCS

Query:  FITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI
        FITPNSDPIYVAYHDEEWGVPVHDD+MLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF +E VA FSDKQM+SISSEYGIDINRVRGVVDNAIRIL+I
Subjt:  FITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI

Query:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPATTTTAEVEE
        KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC++ AA RR PA      VEE
Subjt:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPATTTTAEVEE

Query:  TTETVAASETL
        TT    ASETL
Subjt:  TTETVAASETL

XP_038902889.1 uncharacterized protein LOC120089476 [Benincasa hispida]1.7e-21395.37Show/hide
Query:  MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKNPSLKPPS---AAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLS
        MCRSEE LEA+TVVVDSKFNARPVLQPTCNRVLDRRNSLKK PSLKPPS   AAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAA NGGGS+S
Subjt:  MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKNPSLKPPS---AAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLS

Query:  RPRATLDRKKSKSFKLGGNGNVVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCSF
        RPRATLDRKKSKSFKLGGNGNVVICDNGGYEVA LSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCSF
Subjt:  RPRATLDRKKSKSFKLGGNGNVVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCSF

Query:  ITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK
        ITPNSDPIYVAYHDEEWGVPVHDD+MLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVA FSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK
Subjt:  ITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK

Query:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPATTTTAEVEET
        KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHCTLIAAGRRT ATTTT EVEET
Subjt:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPATTTTAEVEET

Query:  TETVAASETL
            A SETL
Subjt:  TETVAASETL

TrEMBL top hitse value%identityAlignment
A0A0A0KED6 Uncharacterized protein1.5e-19992.44Show/hide
Query:  MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-NPSLKPPS-AAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSR
        MCRSEETLEAT+VVVDSKFN+RPVLQPT NRVLDRRNSLKK +PSLKPPS AAVSPTSPKSKSPRPPATKRANDGNNPMNSSS+KILIPAA      +SR
Subjt:  MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-NPSLKPPS-AAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSR

Query:  PRATLDRKKSKSFKLGGNGNVVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCSFI
        PRATLDRKKSKSFKLGGNGN VICDNGG+EVA   YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCSFI
Subjt:  PRATLDRKKSKSFKLGGNGNVVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCSFI

Query:  TPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKK
        TPNSDPIYVAYHDEEWGVPVHDD+MLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSIS+EYGIDINRVRGVVDNAIRILQIKK
Subjt:  TPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKK

Query:  EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPA-TTTTAEVEET
        EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHCTLIAAGRRTPA TTTT EVE  
Subjt:  EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPA-TTTTAEVEET

Query:  TETVAASETL
         +T A  ETL
Subjt:  TETVAASETL

A0A5A7UM21 Putative GMP synthase1.1e-19992.59Show/hide
Query:  MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-NPSLKPPS--AAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLS
        MCRSEE LEAT+VVVDSKFN+RPVLQPTCNRVLDRRNSLKK +PSLKPPS  AAVSPTSPKSKSPRPPATKRANDGNNPMNSSS+KILIPAAA      S
Subjt:  MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-NPSLKPPS--AAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLS

Query:  RPRATLDRKKSKSFKLGGNGNVVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCSF
        RPRATLDRKKSKSFKLGGNGN VICDNGG+EVA   YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKP+VEDRRCSF
Subjt:  RPRATLDRKKSKSFKLGGNGNVVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCSF

Query:  ITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK
        ITPNSDPIYVAYHDEEWGVPVHDD+MLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFS+KQMVSIS+EYGIDINRVRGVVDN+IRILQIK
Subjt:  ITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK

Query:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPA-TTTTAEVEE
        KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHCTLIAAGRRTPA TTTT EVEE
Subjt:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPA-TTTTAEVEE

Query:  TTETV
         T  V
Subjt:  TTETV

A0A6J1D778 uncharacterized protein LOC1110179891.0e-16881.59Show/hide
Query:  MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKN-PSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRP
        MCRSE+ +EAT+VV       R VLQPTCNR L RRNSLKK  PS  PP +  SP SPKSKSPRPPATKRAND    MNSSSDK+++PAAA       RP
Subjt:  MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKN-PSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRP

Query:  RATLDRKKSKSFKLGGNGNVVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCSFIT
        RA LDRKKSKSFKLGG+G             SLSYASSLITESPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARFEKIVP+DSK KPAVEDRRCSFIT
Subjt:  RATLDRKKSKSFKLGGNGNVVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCSFIT

Query:  PNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKKE
        PNSDPIYVAYHDEEWGVPVH+D++LFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFD+E VANFSDKQMVSIS+EYGIDINRVRGVVDNAIRIL+IKKE
Subjt:  PNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKKE

Query:  FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPATTTTAEVEETTE
        FGSFDKYIWGFVN+KPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHL CTL+AAGRR P      EVEET+E
Subjt:  FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPATTTTAEVEETTE

Query:  TV
        T+
Subjt:  TV

A0A6J1FSP1 uncharacterized protein LOC1114484341.7e-19087.83Show/hide
Query:  MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKNPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRPR
        MCRSE+ LEAT+VVVDSKF ARPVLQPTCNRVLDRRNSLK     KPPSAAVSPTSPKSKSPRPPATKRAND  NPMNSSSDKILIPAAA     LSRP+
Subjt:  MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKNPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRPR

Query:  ATLDRKKSKSFKLGGNGNVVICDN----GGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCS
        A LDRKKSKSFKL GNGNVVICDN    GG+EVASLSYASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF+K+VPLDSKIKPAVEDRRCS
Subjt:  ATLDRKKSKSFKLGGNGNVVICDN----GGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCS

Query:  FITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI
        FITPNSDPIYVAYHDEEWGVPVHDD+MLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF +E VA FSDKQM+SISSEYGIDINRVRGVVDNAIRIL+I
Subjt:  FITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI

Query:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPATTTTAEVEE
        KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC++ AA RR PA      VEE
Subjt:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPATTTTAEVEE

Query:  TTETVAASETL
        TT    ASETL
Subjt:  TTETVAASETL

A0A6J1J7H3 uncharacterized protein LOC1114841734.1e-18987.1Show/hide
Query:  MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKNPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRPR
        MCRSE+ LEAT+VVVDSKF ARPVLQPTCNRVLDRRNSLK     KPPSAAVSPTSPKSKSPRPPATKRAN+  NPMNSSSDKILIPAAA     LSRP+
Subjt:  MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKNPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRPR

Query:  ATLDRKKSKSFKLGGNGNVVICDN----GGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCS
        A LDRKKSKSFKL GNGNVVICDN    GG+EVASLSYASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF+K+VPLDSKIKPAVE RRCS
Subjt:  ATLDRKKSKSFKLGGNGNVVICDN----GGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCS

Query:  FITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI
        FITPNSDPIYVAYHDEEWGVPVHDD+MLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF +E VA FSDKQM+SISSEYGIDINRVRGVVDNAIRIL+I
Subjt:  FITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI

Query:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPATTTTAEVEE
        KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQ AGLTNDHLTSCHRHLHC++ AAGRR PA      VEE
Subjt:  KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPATTTTAEVEE

Query:  TTETVAASETL
        TT    ASE+L
Subjt:  TTETVAASETL

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 13.2e-3439.11Show/hide
Query:  RCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINR--VRGVVDNAI
        RC ++  + DP+Y+AYHD EWGVP  D + LFE++ L   Q G  W ++LKKR+++R  F  FD   VA   ++ +  +  + GI  +R  ++ ++ NA 
Subjt:  RCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINR--VRGVVDNAI

Query:  RILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC
          LQ+++    F  ++W FVN++P   Q  +  +IP  TS S+ +SK + +RGF+ VG T+ +SFMQA GL NDH+  C
Subjt:  RILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC

P44321 DNA-3-methyladenine glycosylase1.2e-2836.31Show/hide
Query:  RCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVR--GVVDNAI
        RC ++   S  IY+ YHD+EWG P  D + LFE + L   Q G  W ++LKKR+ +R AF  FD + +A  +   + +     G+  +R +   +V NA 
Subjt:  RCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVR--GVVDNAI

Query:  RILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC
          L ++K   +F  +IW FVN+KP          +P KT  S+ +SK + +RGF  +G T  ++FMQ+ GL +DHL  C
Subjt:  RILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]3.0e-4043.85Show/hide
Query:  EDRRCSFITPNSD---PIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINR--VRG
        E  RC++ T   +    +Y  YHD EWG P+H+D+ LFE LVL   Q G  W +ILKKR+ FR AF  FD  IVAN+ + ++  +    GI  NR  +  
Subjt:  EDRRCSFITPNSD---PIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINR--VRG

Query:  VVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHR
         + NA   + +++EFGSFDKYIWGFV  KP    ++S   +P  T  S+ I+KD+ +RGF+ VG T +++ MQ+ G+ NDHLTSC +
Subjt:  VVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHR

Arabidopsis top hitse value%identityAlignment
AT1G75090.1 DNA glycosylase superfamily protein1.2e-5549.27Show/hide
Query:  RRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDIN--RVRGVVDNA
        +RC +ITPNSDPIYV +HDEEWGVPV DD+ LFELLV S A     W SIL++R DFR  F  FD   +A F++K+++S+     + ++  ++R +V+NA
Subjt:  RRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDIN--RVRGVVDNA

Query:  IRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPATTT
          +L++K+EFGSF  Y W FVN+KP    Y+ G ++PVK+ K+E ISKDM++RGFR VGPTV++SF+QA+G+ NDHLT+C R+  C  +   R T +  T
Subjt:  IRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPATTT

Query:  TAEVE
          +++
Subjt:  TAEVE

AT3G12710.1 DNA glycosylase superfamily protein2.5e-9863.92Show/hide
Query:  GGSLSRPRATLDRKKSKSFKLGGNGNVVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA---RFEKIVPLDSKIKPA
        G   ++ R +L+RKKSKSFK G                  SY+S LITE+PGSIAAVRREQVA QQA RK++IAHYGRSKS       K+VPL +   P 
Subjt:  GGSLSRPRATLDRKKSKSFKLGGNGNVVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSA---RFEKIVPLDSKIKPA

Query:  VEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDN
           +RCSF+TP SDPIYVAYHDEEWGVPVHDD+ LFELL LS AQVGSDWTS L+KR D+R AF  F++E+VA  ++K+M +IS EY I++++VRGVV+N
Subjt:  VEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDN

Query:  AIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIA
        A +I++IKK F S +KY+WGFVN+KP S  YK GHKIPVKTSKSE+ISKDMVRRGFR VGPTVVHSFMQAAGLTNDHL +C RH  CTL+A
Subjt:  AIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIA

AT5G44680.1 DNA glycosylase superfamily protein1.0e-9152.08Show/hide
Query:  MCRSEETLEATTVVVDSKFNARPVLQPTCNRV--LDRRNSLKKNPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSR
        MC S+  L+  T    S+ N RPVLQP  N+V  LDRRNSLKK+P  KP    ++P + K  SPRP +       + P++ ++  +  PA +     L R
Subjt:  MCRSEETLEATTVVVDSKFNARPVLQPTCNRV--LDRRNSLKKNPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSR

Query:  PRATLDRKKSKSFKLGGNGNVVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF-EKIVPLDSKIKPAVEDRRCSF
          +T    KSK      N       +GGY+         ++ + PGSIAA RRE+VA++Q +RK +I+HYGR KS +  EK + ++ + K     +RCSF
Subjt:  PRATLDRKKSKSFKLGGNGNVVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF-EKIVPLDSKIKPAVEDRRCSF

Query:  ITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK
        IT +SDPIYVAYHD+EWGVPVHDD +LFELLVL+ AQVGSDWTS+LK+R  FR AFS F++E+VA+F++K++ SI ++YGI++++V  VVDNA +IL++K
Subjt:  ITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK

Query:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAA
        ++ GSF+KYIWGF+ +KP + +Y S  KIPVKTSKSETISKDMVRRGFR VGPTV+HS MQAAGLTNDHL +C RHL CT +AA
Subjt:  KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAA

AT5G57970.1 DNA glycosylase superfamily protein1.5e-5853.3Show/hide
Query:  LDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDIN
        LDS    +   +RC+++TPNSDP Y+ +HDEEWGVPVHDD+ LFELLVLS A     W +IL KRQ FR  F+ FD   +   ++K+++   S     ++
Subjt:  LDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDIN

Query:  --RVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC
          ++R V++NA +IL++ +E+GSFDKYIW FV NK    +++   ++P KT K+E ISKD+VRRGFRSVGPTVV+SFMQAAG+TNDHLTSC R  HC
Subjt:  --RVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC

AT5G57970.2 DNA glycosylase superfamily protein1.5e-5853.3Show/hide
Query:  LDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDIN
        LDS    +   +RC+++TPNSDP Y+ +HDEEWGVPVHDD+ LFELLVLS A     W +IL KRQ FR  F+ FD   +   ++K+++   S     ++
Subjt:  LDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDIN

Query:  --RVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC
          ++R V++NA +IL++ +E+GSFDKYIW FV NK    +++   ++P KT K+E ISKD+VRRGFRSVGPTVV+SFMQAAG+TNDHLTSC R  HC
Subjt:  --RVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTCGTTCCGAGGAGACCTTGGAAGCTACTACTGTCGTCGTTGATTCCAAATTCAATGCCCGTCCTGTCCTTCAACCCACTTGCAACCGTGTCCTCGACCGCCGTAA
TTCCCTAAAAAAAAACCCTTCTCTCAAACCCCCTTCCGCCGCCGTCTCCCCCACCTCCCCCAAATCTAAATCCCCCCGTCCTCCGGCCACCAAGCGGGCGAATGACGGAA
ATAATCCCATGAACTCTAGCTCCGACAAGATCCTTATTCCGGCCGCCGCGAACGGTGGCGGGTCTCTGTCACGGCCGAGAGCTACCTTGGATAGAAAGAAATCGAAAAGC
TTCAAATTGGGTGGAAATGGGAATGTTGTGATTTGTGATAATGGTGGATATGAGGTGGCGTCGTTGAGCTACGCTTCTTCTTTGATCACTGAGTCGCCGGGAAGTATCGC
CGCCGTGAGAAGAGAGCAGGTGGCTCTGCAACAGGCGCAGAGGAAGATGAGAATTGCTCATTATGGAAGATCTAAATCTGCTCGTTTTGAAAAAATTGTTCCTCTTGATT
CTAAAATTAAACCCGCTGTTGAAGATAGAAGATGTAGCTTCATCACTCCCAATTCAGATCCCATTTATGTTGCTTACCATGATGAGGAATGGGGCGTCCCTGTTCATGAT
GACAGGATGTTGTTTGAATTGCTGGTTCTAAGCGTAGCCCAGGTGGGTTCGGATTGGACTTCAATTTTGAAGAAACGCCAAGATTTCAGAAATGCATTTTCAAGTTTCGA
TTCAGAAATTGTGGCAAATTTCTCCGACAAACAGATGGTTTCAATCAGCTCAGAATATGGCATCGACATTAACAGAGTCCGAGGAGTCGTCGACAACGCAATCCGAATCC
TCCAGATTAAGAAGGAATTTGGGTCATTCGACAAATACATTTGGGGATTTGTGAACAACAAGCCGTTTTCACCGCAGTACAAATCCGGCCACAAAATTCCGGTGAAAACA
TCAAAATCAGAGACCATAAGCAAAGACATGGTCCGACGAGGTTTCCGGTCGGTCGGACCGACGGTGGTCCACTCCTTCATGCAAGCCGCCGGTCTGACCAACGACCATCT
GACCAGTTGCCACAGGCACCTCCACTGCACCTTAATCGCCGCCGGCCGCCGCACTCCGGCGACGACGACGACGGCAGAAGTGGAGGAGACGACAGAGACGGTGGCAGCTT
CTGAAACTCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGTCGTTCCGAGGAGACCTTGGAAGCTACTACTGTCGTCGTTGATTCCAAATTCAATGCCCGTCCTGTCCTTCAACCCACTTGCAACCGTGTCCTCGACCGCCGTAA
TTCCCTAAAAAAAAACCCTTCTCTCAAACCCCCTTCCGCCGCCGTCTCCCCCACCTCCCCCAAATCTAAATCCCCCCGTCCTCCGGCCACCAAGCGGGCGAATGACGGAA
ATAATCCCATGAACTCTAGCTCCGACAAGATCCTTATTCCGGCCGCCGCGAACGGTGGCGGGTCTCTGTCACGGCCGAGAGCTACCTTGGATAGAAAGAAATCGAAAAGC
TTCAAATTGGGTGGAAATGGGAATGTTGTGATTTGTGATAATGGTGGATATGAGGTGGCGTCGTTGAGCTACGCTTCTTCTTTGATCACTGAGTCGCCGGGAAGTATCGC
CGCCGTGAGAAGAGAGCAGGTGGCTCTGCAACAGGCGCAGAGGAAGATGAGAATTGCTCATTATGGAAGATCTAAATCTGCTCGTTTTGAAAAAATTGTTCCTCTTGATT
CTAAAATTAAACCCGCTGTTGAAGATAGAAGATGTAGCTTCATCACTCCCAATTCAGATCCCATTTATGTTGCTTACCATGATGAGGAATGGGGCGTCCCTGTTCATGAT
GACAGGATGTTGTTTGAATTGCTGGTTCTAAGCGTAGCCCAGGTGGGTTCGGATTGGACTTCAATTTTGAAGAAACGCCAAGATTTCAGAAATGCATTTTCAAGTTTCGA
TTCAGAAATTGTGGCAAATTTCTCCGACAAACAGATGGTTTCAATCAGCTCAGAATATGGCATCGACATTAACAGAGTCCGAGGAGTCGTCGACAACGCAATCCGAATCC
TCCAGATTAAGAAGGAATTTGGGTCATTCGACAAATACATTTGGGGATTTGTGAACAACAAGCCGTTTTCACCGCAGTACAAATCCGGCCACAAAATTCCGGTGAAAACA
TCAAAATCAGAGACCATAAGCAAAGACATGGTCCGACGAGGTTTCCGGTCGGTCGGACCGACGGTGGTCCACTCCTTCATGCAAGCCGCCGGTCTGACCAACGACCATCT
GACCAGTTGCCACAGGCACCTCCACTGCACCTTAATCGCCGCCGGCCGCCGCACTCCGGCGACGACGACGACGGCAGAAGTGGAGGAGACGACAGAGACGGTGGCAGCTT
CTGAAACTCTCTAG
Protein sequenceShow/hide protein sequence
MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKNPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRPRATLDRKKSKS
FKLGGNGNVVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHD
DRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKT
SKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPATTTTAEVEETTETVAASETL