; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0011063 (gene) of Chayote v1 genome

Gene IDSed0011063
OrganismSechium edule (Chayote v1)
DescriptionDNA glycosylase superfamily protein
Genome locationLG05:42570507..42573056
RNA-Seq ExpressionSed0011063
SyntenySed0011063
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570606.1 hypothetical protein SDJN03_29521, partial [Cucurbita argyrosperma subsp. sororia]5.7e-17483.25Show/hide
Query:  MCRSDQALEAASVVVVVDNSKFTPRHVLQPTGNRVIDRRGSLKKPPSAAAVAAVPAGPISPKSKSPRPPATKRGNDANPMSSSSDKVLIPAAA--RPRPA
        MCRS+QALEA +VVV   +SKFT R VLQPT NRV+DRR SLKKPPSAA        P SPKSKSPRPPATKR ND NPM+SSSDK+LIPAAA  RP+ A
Subjt:  MCRSDQALEAASVVVVVDNSKFTPRHVLQPTGNRVIDRRGSLKKPPSAAAVAAVPAGPISPKSKSPRPPATKRGNDANPMSSSSDKVLIPAAA--RPRPA

Query:  LERKKSKSFKLGGNGN-VISDNAV-----EVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVEDRRCSFI
        L+RKKSKSFKL GNGN VI DN       EV SL YASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFD++VP+ SKIKPAVEDRRCSFI
Subjt:  LERKKSKSFKLGGNGN-VISDNAV-----EVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVEDRRCSFI

Query:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKK
        TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILK+RQDFRNAFS+F +E VA FSDKQM+SIS+EYGIDINRVRGVVDNAIRILEIKK
Subjt:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKK

Query:  EFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISSGRRAPAEVAAE
        EFGSFDKYIWGFVNNK FSPQYKSGH+IPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRH HCS+ ++ RRAPA V  E
Subjt:  EFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISSGRRAPAEVAAE

XP_004139917.2 uncharacterized protein LOC101218536 [Cucumis sativus]4.4e-17482.34Show/hide
Query:  MCRSDQALEAASVVVVVDNSKFTPRHVLQPTGNRVIDRRGSLK------KPPSAAAVAAVPAGPISPKSKSPRPPATKRGNDA-NPMSSSSDKVLIPAA-
        MCRS++ LEA SVVV   +SKF  R VLQPTGNRV+DRR SLK      KPPSAAAV+     P SPKSKSPRPPATKR ND  NPM+SSS+K+LIPAA 
Subjt:  MCRSDQALEAASVVVVVDNSKFTPRHVLQPTGNRVIDRRGSLK------KPPSAAAVAAVPAGPISPKSKSPRPPATKRGNDA-NPMSSSSDKVLIPAA-

Query:  ARPRPALERKKSKSFKLGGNGNVISDNAVEVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVEDRRCSFI
        +RPR  L+RKKSKSFKLGGNGNVI DN      + YASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF++IVP+ SKIKPAVEDRRCSFI
Subjt:  ARPRPALERKKSKSFKLGGNGNVISDNAVEVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVEDRRCSFI

Query:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKK
        TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILK+RQDFRNAFS+FDSEIVANFSDKQM+SISTEYGIDINRVRGVVDNAIRIL+IKK
Subjt:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKK

Query:  EFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISSGRRAPAEVAA--EVEEK
        EFGSFDKYIWGFVNNK FSPQYKSGH+IPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRH HC+LI++GRR PA      EVE+ 
Subjt:  EFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISSGRRAPAEVAA--EVEEK

Query:  AA
        AA
Subjt:  AA

XP_022943791.1 uncharacterized protein LOC111448434 [Cucurbita moschata]2.6e-17483.5Show/hide
Query:  MCRSDQALEAASVVVVVDNSKFTPRHVLQPTGNRVIDRRGSLKKPPSAAAVAAVPAGPISPKSKSPRPPATKRGNDANPMSSSSDKVLIPAAA--RPRPA
        MCRS+QALEA SVVV   +SKFT R VLQPT NRV+DRR SLKKPPSAA        P SPKSKSPRPPATKR ND NPM+SSSDK+LIPAAA  RP+ A
Subjt:  MCRSDQALEAASVVVVVDNSKFTPRHVLQPTGNRVIDRRGSLKKPPSAAAVAAVPAGPISPKSKSPRPPATKRGNDANPMSSSSDKVLIPAAA--RPRPA

Query:  LERKKSKSFKLGGNGN-VISDNAV-----EVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVEDRRCSFI
        L+RKKSKSFKL GNGN VI DN       EV SL YASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFD++VP+ SKIKPAVEDRRCSFI
Subjt:  LERKKSKSFKLGGNGN-VISDNAV-----EVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVEDRRCSFI

Query:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKK
        TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILK+RQDFRNAFS+F +E VA FSDKQM+SIS+EYGIDINRVRGVVDNAIRILEIKK
Subjt:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKK

Query:  EFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISSGRRAPAEVAAE
        EFGSFDKYIWGFVNNK FSPQYKSGH+IPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRH HCS+ ++ RRAPA V  E
Subjt:  EFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISSGRRAPAEVAAE

XP_022986422.1 uncharacterized protein LOC111484173 [Cucurbita maxima]2.2e-17382.99Show/hide
Query:  MCRSDQALEAASVVVVVDNSKFTPRHVLQPTGNRVIDRRGSLKKPPSAAAVAAVPAGPISPKSKSPRPPATKRGNDANPMSSSSDKVLIPAAA--RPRPA
        MCRS+QALEA SVVV   +SKFT R VLQPT NRV+DRR SLKKPPSAA        P SPKSKSPRPPATKR N+ NPM+SSSDK+LIPAAA  RP+ A
Subjt:  MCRSDQALEAASVVVVVDNSKFTPRHVLQPTGNRVIDRRGSLKKPPSAAAVAAVPAGPISPKSKSPRPPATKRGNDANPMSSSSDKVLIPAAA--RPRPA

Query:  LERKKSKSFKLGGNGN-VISDNAV-----EVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVEDRRCSFI
        L+RKKSKSFKL GNGN VI DN       EV SL YASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFD++VP+ SKIKPAVE RRCSFI
Subjt:  LERKKSKSFKLGGNGN-VISDNAV-----EVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVEDRRCSFI

Query:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKK
        TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILK+RQDFRNAFS+F +E VA FSDKQM+SIS+EYGIDINRVRGVVDNAIRILEIKK
Subjt:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKK

Query:  EFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISSGRRAPAEVAAE
        EFGSFDKYIWGFVNNK FSPQYKSGH+IPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQ AGLTNDHLTSCHRH HCS+ ++GRRAPA V  E
Subjt:  EFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISSGRRAPAEVAAE

XP_023511876.1 uncharacterized protein LOC111776761 [Cucurbita pepo subsp. pepo]2.6e-17483.5Show/hide
Query:  MCRSDQALEAASVVVVVDNSKFTPRHVLQPTGNRVIDRRGSLKKPPSAAAVAAVPAGPISPKSKSPRPPATKRGNDANPMSSSSDKVLIPAAA--RPRPA
        MCRS+QALEA SVVV   +SKFT R VLQPTGNRV+DRR SLKKPPSAA        P SPKSKSPRPPATKR ND NPM+SSSDK+LIPAAA  RP+ A
Subjt:  MCRSDQALEAASVVVVVDNSKFTPRHVLQPTGNRVIDRRGSLKKPPSAAAVAAVPAGPISPKSKSPRPPATKRGNDANPMSSSSDKVLIPAAA--RPRPA

Query:  LERKKSKSFKLGGNGN-VISDNAV-----EVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVEDRRCSFI
        L+RKKSKSFKL GNGN VI DN       EV SL YASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFD++VPI SKIKPAVEDRRCSFI
Subjt:  LERKKSKSFKLGGNGN-VISDNAV-----EVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVEDRRCSFI

Query:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKK
        TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILK+RQDFRNAFS+F +E VA FSDKQM+SIS+EYGIDINRVRGVVDNAIRILEIKK
Subjt:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKK

Query:  EFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISSGRRAPAEVAAE
        EF SFDKYIWGFVNNK FSPQYKSGH+IPVKTSKSETISKDM+RRGFRSVGPTV+HSFMQAAGLTNDHLTSCHRH HCS+ ++ RRAPA V  E
Subjt:  EFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISSGRRAPAEVAAE

TrEMBL top hitse value%identityAlignment
A0A0A0KED6 Uncharacterized protein2.1e-17482.34Show/hide
Query:  MCRSDQALEAASVVVVVDNSKFTPRHVLQPTGNRVIDRRGSLK------KPPSAAAVAAVPAGPISPKSKSPRPPATKRGNDA-NPMSSSSDKVLIPAA-
        MCRS++ LEA SVVV   +SKF  R VLQPTGNRV+DRR SLK      KPPSAAAV+     P SPKSKSPRPPATKR ND  NPM+SSS+K+LIPAA 
Subjt:  MCRSDQALEAASVVVVVDNSKFTPRHVLQPTGNRVIDRRGSLK------KPPSAAAVAAVPAGPISPKSKSPRPPATKRGNDA-NPMSSSSDKVLIPAA-

Query:  ARPRPALERKKSKSFKLGGNGNVISDNAVEVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVEDRRCSFI
        +RPR  L+RKKSKSFKLGGNGNVI DN      + YASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF++IVP+ SKIKPAVEDRRCSFI
Subjt:  ARPRPALERKKSKSFKLGGNGNVISDNAVEVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVEDRRCSFI

Query:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKK
        TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILK+RQDFRNAFS+FDSEIVANFSDKQM+SISTEYGIDINRVRGVVDNAIRIL+IKK
Subjt:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKK

Query:  EFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISSGRRAPAEVAA--EVEEK
        EFGSFDKYIWGFVNNK FSPQYKSGH+IPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRH HC+LI++GRR PA      EVE+ 
Subjt:  EFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISSGRRAPAEVAA--EVEEK

Query:  AA
        AA
Subjt:  AA

A0A5A7UM21 Putative GMP synthase3.4e-17281.39Show/hide
Query:  MCRSDQALEAASVVVVVDNSKFTPRHVLQPTGNRVIDRRGSLK------KPPSAAAVAAVPAGPISPKSKSPRPPATKRGNDA-NPMSSSSDKVLIPAAA
        MCRS++ALEA SVVV   +SKF  R VLQPT NRV+DRR SLK      KPPS AA  +    P SPKSKSPRPPATKR ND  NPM+SSS+K+LIPAAA
Subjt:  MCRSDQALEAASVVVVVDNSKFTPRHVLQPTGNRVIDRRGSLK------KPPSAAAVAAVPAGPISPKSKSPRPPATKRGNDA-NPMSSSSDKVLIPAAA

Query:  -RPRPALERKKSKSFKLGGNGNVISDNAVEVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVEDRRCSFI
         RPR  L+RKKSKSFKLGGNGNVI DN      + YASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF++IVP+ SKIKP+VEDRRCSFI
Subjt:  -RPRPALERKKSKSFKLGGNGNVISDNAVEVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVEDRRCSFI

Query:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKK
        TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILK+RQDFRNAFS+FDSEIVANFS+KQM+SISTEYGIDINRVRGVVDN+IRIL+IKK
Subjt:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKK

Query:  EFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISSGRRAPAEVAA--EVEEK
        EFGSFDKYIWGFVNNK FSPQYKSGH+IPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRH HC+LI++GRR PA      EVEE 
Subjt:  EFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISSGRRAPAEVAA--EVEEK

Query:  AAA
         AA
Subjt:  AAA

A0A6J1D778 uncharacterized protein LOC1110179897.0e-16280Show/hide
Query:  MCRSDQALEAASVVVVVDNSKFTPRHVLQPTGNRVIDRRGSLKKPPSAAAVAAVPAGPISPKSKSPRPPATKRGND-ANPMSSSSDKVLIPAAARPRPAL
        MCRS+Q +EA SVV V        R VLQPT NR + RR SLKK P + +    P  P SPKSKSPRPPATKR ND A  M+SSSDK+++PAAARPR AL
Subjt:  MCRSDQALEAASVVVVVDNSKFTPRHVLQPTGNRVIDRRGSLKKPPSAAAVAAVPAGPISPKSKSPRPPATKRGND-ANPMSSSSDKVLIPAAARPRPAL

Query:  ERKKSKSFKLGGNGNVISDNAVEVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVEDRRCSFITPNSDPI
        +RKKSKSFKLGG+G   +D A    SL YASSLIT+SPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARF++IVPI SK KPAVEDRRCSFITPNSDPI
Subjt:  ERKKSKSFKLGGNGNVISDNAVEVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVEDRRCSFITPNSDPI

Query:  YVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDK
        YVAYHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILK+RQDFRNAFS+FD+E VANFSDKQM+SISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDK
Subjt:  YVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDK

Query:  YIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISSGRRAPAEVAAEVEE
        YIWGFVN+K FSPQYKSGH+IPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRH  C+L+++GRRAP   A EVEE
Subjt:  YIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISSGRRAPAEVAAEVEE

A0A6J1FSP1 uncharacterized protein LOC1114484341.2e-17483.5Show/hide
Query:  MCRSDQALEAASVVVVVDNSKFTPRHVLQPTGNRVIDRRGSLKKPPSAAAVAAVPAGPISPKSKSPRPPATKRGNDANPMSSSSDKVLIPAAA--RPRPA
        MCRS+QALEA SVVV   +SKFT R VLQPT NRV+DRR SLKKPPSAA        P SPKSKSPRPPATKR ND NPM+SSSDK+LIPAAA  RP+ A
Subjt:  MCRSDQALEAASVVVVVDNSKFTPRHVLQPTGNRVIDRRGSLKKPPSAAAVAAVPAGPISPKSKSPRPPATKRGNDANPMSSSSDKVLIPAAA--RPRPA

Query:  LERKKSKSFKLGGNGN-VISDNAV-----EVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVEDRRCSFI
        L+RKKSKSFKL GNGN VI DN       EV SL YASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFD++VP+ SKIKPAVEDRRCSFI
Subjt:  LERKKSKSFKLGGNGN-VISDNAV-----EVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVEDRRCSFI

Query:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKK
        TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILK+RQDFRNAFS+F +E VA FSDKQM+SIS+EYGIDINRVRGVVDNAIRILEIKK
Subjt:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKK

Query:  EFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISSGRRAPAEVAAE
        EFGSFDKYIWGFVNNK FSPQYKSGH+IPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRH HCS+ ++ RRAPA V  E
Subjt:  EFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISSGRRAPAEVAAE

A0A6J1J7H3 uncharacterized protein LOC1114841731.0e-17382.99Show/hide
Query:  MCRSDQALEAASVVVVVDNSKFTPRHVLQPTGNRVIDRRGSLKKPPSAAAVAAVPAGPISPKSKSPRPPATKRGNDANPMSSSSDKVLIPAAA--RPRPA
        MCRS+QALEA SVVV   +SKFT R VLQPT NRV+DRR SLKKPPSAA        P SPKSKSPRPPATKR N+ NPM+SSSDK+LIPAAA  RP+ A
Subjt:  MCRSDQALEAASVVVVVDNSKFTPRHVLQPTGNRVIDRRGSLKKPPSAAAVAAVPAGPISPKSKSPRPPATKRGNDANPMSSSSDKVLIPAAA--RPRPA

Query:  LERKKSKSFKLGGNGN-VISDNAV-----EVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVEDRRCSFI
        L+RKKSKSFKL GNGN VI DN       EV SL YASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFD++VP+ SKIKPAVE RRCSFI
Subjt:  LERKKSKSFKLGGNGN-VISDNAV-----EVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVEDRRCSFI

Query:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKK
        TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILK+RQDFRNAFS+F +E VA FSDKQM+SIS+EYGIDINRVRGVVDNAIRILEIKK
Subjt:  TPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKK

Query:  EFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISSGRRAPAEVAAE
        EFGSFDKYIWGFVNNK FSPQYKSGH+IPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQ AGLTNDHLTSCHRH HCS+ ++GRRAPA V  E
Subjt:  EFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISSGRRAPAEVAAE

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 11.0e-3237.99Show/hide
Query:  RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINR--VRGVVDNAI
        RC ++  + DP+Y+AYHD EWGVP  D K LFE++ L   Q G  W ++LK+R+++R  F  FD   VA   ++ +  +  + GI  +R  ++ ++ NA 
Subjt:  RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINR--VRGVVDNAI

Query:  RILEIKKEFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC
          L++++    F  ++W FVN++    Q  +   IP  TS S+ +SK + +RGF+ VG T+ +SFMQA GL NDH+  C
Subjt:  RILEIKKEFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC

P44321 DNA-3-methyladenine glycosylase2.9e-2735.2Show/hide
Query:  RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVR--GVVDNAI
        RC ++   S  IY+ YHD+EWG P  D + LFE + L   Q G  W ++LK+R+ +R AF  FD + +A  +   + +     G+  +R +   +V NA 
Subjt:  RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVR--GVVDNAI

Query:  RILEIKKEFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC
          L ++K   +F  +IW FVN+K           +P KT  S+ +SK + +RGF  +G T  ++FMQ+ GL +DHL  C
Subjt:  RILEIKKEFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]3.3e-3943.32Show/hide
Query:  EDRRCSFITPNSD---PIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINR--VRG
        E  RC++ T   +    +Y  YHD EWG P+H+DK LFE LVL   Q G  W +ILK+R+ FR AF +FD  IVAN+ + ++  +    GI  NR  +  
Subjt:  EDRRCSFITPNSD---PIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINR--VRG

Query:  VVDNAIRILEIKKEFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHR
         + NA   + +++EFGSFDKYIWGFV  K     ++S   +P  T  S+ I+KD+ +RGF+ VG T +++ MQ+ G+ NDHLTSC +
Subjt:  VVDNAIRILEIKKEFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHR

Arabidopsis top hitse value%identityAlignment
AT1G75090.1 DNA glycosylase superfamily protein3.0e-5641.2Show/hide
Query:  RPALERKKSKS---FKLGGNGNVISDNAVEVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVED--RRCS
        +P L  + +KS    K   N +V +D++   +S    SS+ T + G +    +                 G  K       V +   I P +    +RC 
Subjt:  RPALERKKSKS---FKLGGNGNVISDNAVEVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVED--RRCS

Query:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDIN--RVRGVVDNAIRIL
        +ITPNSDPIYV +HDEEWGVPV DDK LFELLV S A     W SIL+RR DFR  F  FD   +A F++K+++S+     + ++  ++R +V+NA  +L
Subjt:  FITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDIN--RVRGVVDNAIRIL

Query:  EIKKEFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSL
        ++K+EFGSF  Y W FVN+K     Y+ G ++PVK+ K+E ISKDM++RGFR VGPTV++SF+QA+G+ NDHLT+C R+  C++
Subjt:  EIKKEFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSL

AT3G12710.1 DNA glycosylase superfamily protein5.1e-9658.75Show/hide
Query:  ISPKSKSPRPPATKRGNDANPMSSSSDKVLIPAAARPRPALERKKSKSFKLGGNGNVISDNAVEVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMR
        IS   +   PP++         S   D V+   AA+ R +LERKKSKSFK G +               Y+S LIT++PGSIAAVRREQVA QQA RK++
Subjt:  ISPKSKSPRPPATKRGNDANPMSSSSDKVLIPAAARPRPALERKKSKSFKLGGNGNVISDNAVEVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMR

Query:  IAHYGRSKSA---RFDQIVPIHSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIV
        IAHYGRSKS       ++VP+ +   P    +RCSF+TP SDPIYVAYHDEEWGVPVHDDK LFELL LS AQVGSDWTS L++R D+R AF  F++E+V
Subjt:  IAHYGRSKSA---RFDQIVPIHSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIV

Query:  ANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAG
        A  ++K+M +IS EY I++++VRGVV+NA +I+EIKK F S +KY+WGFVN+K  S  YK GH+IPVKTSKSE+ISKDMVRRGFR VGPTVVHSFMQAAG
Subjt:  ANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAG

Query:  LTNDHLTSCHRHHHCSLISS
        LTNDHL +C RH  C+L+++
Subjt:  LTNDHLTSCHRHHHCSLISS

AT5G44680.1 DNA glycosylase superfamily protein1.9e-8749.45Show/hide
Query:  SKFTPRHVLQPTGNRV--IDRRGSLKKPPSAAAVAAVPAGPISPKSKSPR------PPATKRGNDANPMSSSSDKVLIPAAARPRPALERKKSKSFKLGG
        S+   R VLQP  N+V  +DRR SLKK P        P  PI+ K  SPR      PP +         + S  ++L  ++ + +P +  + S     GG
Subjt:  SKFTPRHVLQPTGNRV--IDRRGSLKKPPSAAAVAAVPAGPISPKSKSPR------PPATKRGNDANPMSSSSDKVLIPAAARPRPALERKKSKSFKLGG

Query:  NGNVISDNAVEVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQ--IVPIHSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWG
           V+               ++   PGSIAA RRE+VA++Q +RK +I+HYGR KS + ++  +   H K K      RCSFIT +SDPIYVAYHD+EWG
Subjt:  NGNVISDNAVEVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQ--IVPIHSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWG

Query:  VPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNNKQ
        VPVHDD +LFELLVL+ AQVGSDWTS+LKRR  FR AFS F++E+VA+F++K++ SI  +YGI++++V  VVDNA +IL++K++ GSF+KYIWGF+ +K 
Subjt:  VPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNNKQ

Query:  FSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISS
         + +Y S  +IPVKTSKSETISKDMVRRGFR VGPTV+HS MQAAGLTNDHL +C RH  C+ +++
Subjt:  FSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISS

AT5G57970.1 DNA glycosylase superfamily protein5.0e-5955.38Show/hide
Query:  RRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDIN--RVRGVVDNA
        +RC+++TPNSDP Y+ +HDEEWGVPVHDDK LFELLVLS A     W +IL +RQ FR  F++FD   +   ++K++I   +     ++  ++R V++NA
Subjt:  RRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDIN--RVRGVVDNA

Query:  IRILEIKKEFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHC
         +IL++ +E+GSFDKYIW FV NK    +++   ++P KT K+E ISKD+VRRGFRSVGPTVV+SFMQAAG+TNDHLTSC R HHC
Subjt:  IRILEIKKEFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHC

AT5G57970.2 DNA glycosylase superfamily protein5.0e-5955.38Show/hide
Query:  RRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDIN--RVRGVVDNA
        +RC+++TPNSDP Y+ +HDEEWGVPVHDDK LFELLVLS A     W +IL +RQ FR  F++FD   +   ++K++I   +     ++  ++R V++NA
Subjt:  RRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDIN--RVRGVVDNA

Query:  IRILEIKKEFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHC
         +IL++ +E+GSFDKYIW FV NK    +++   ++P KT K+E ISKD+VRRGFRSVGPTVV+SFMQAAG+TNDHLTSC R HHC
Subjt:  IRILEIKKEFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTCGTTCCGACCAAGCCTTGGAAGCCGCTTCTGTCGTCGTCGTCGTCGACAACTCCAAATTCACCCCCCGCCACGTCCTCCAACCCACCGGAAACCGCGTCATCGA
CCGCCGTGGTTCCCTCAAAAAACCCCCCTCCGCCGCCGCCGTCGCCGCCGTACCCGCCGGCCCGATTTCCCCGAAATCAAAATCCCCGCGGCCGCCGGCCACCAAGCGCG
GTAATGACGCCAACCCCATGAGCTCCAGCTCCGACAAGGTCCTCATTCCTGCCGCCGCCCGCCCCCGGCCGGCCCTGGAGAGGAAGAAATCAAAAAGCTTCAAACTTGGA
GGGAATGGGAACGTGATTAGTGATAACGCCGTTGAGGTCACGTCTTTGAGGTACGCGTCTTCTCTAATCACTGACTCGCCGGGAAGTATCGCCGCCGTGAGAAGAGAACA
GGTGGCGCTGCAACAGGCGCAGAGGAAAATGAGAATCGCTCATTATGGAAGGTCTAAATCTGCCAGATTTGATCAAATTGTTCCAATTCATTCCAAAATTAAACCCGCCG
TTGAAGATCGAAGATGCAGCTTCATCACCCCCAATTCAGATCCAATTTATGTTGCTTATCATGATGAAGAATGGGGCGTGCCTGTTCATGATGACAAAATGCTGTTTGAA
TTGCTGGTTCTGAGTGTTGCTCAAGTGGGTTCGGATTGGACTTCCATTTTGAAGAGACGTCAAGATTTCAGAAACGCATTTTCAAATTTCGATTCGGAAATTGTGGCAAA
TTTTTCGGACAAACAGATGATTTCAATCAGCACAGAATATGGAATCGACATTAACAGAGTCCGAGGAGTCGTTGACAACGCAATCCGAATTCTTGAGATTAAGAAGGAAT
TTGGGTCATTCGACAAATACATTTGGGGATTTGTGAACAACAAGCAATTTTCACCGCAGTACAAATCGGGCCACAGAATTCCGGTCAAGACATCAAAATCAGAGACCATA
AGCAAAGACATGGTTCGACGAGGATTCCGGTCGGTCGGTCCAACCGTGGTTCATTCCTTTATGCAAGCCGCCGGTCTGACCAACGACCATCTCACCAGCTGTCACAGACA
CCACCACTGCTCACTAATTTCCTCCGGCCGCCGTGCTCCGGCGGAGGTGGCGGCGGAAGTGGAGGAGAAGGCGGCGGCTGATAACCTCTAG
mRNA sequenceShow/hide mRNA sequence
CCCACTAAACTCCCCATTTCTCAATTCCTCTTTAATTTTTAAAAATTAAATTCCCAAAAAACCAAATCACAAAAACGATGTGTCGTTCCGACCAAGCCTTGGAAGCCGCT
TCTGTCGTCGTCGTCGTCGACAACTCCAAATTCACCCCCCGCCACGTCCTCCAACCCACCGGAAACCGCGTCATCGACCGCCGTGGTTCCCTCAAAAAACCCCCCTCCGC
CGCCGCCGTCGCCGCCGTACCCGCCGGCCCGATTTCCCCGAAATCAAAATCCCCGCGGCCGCCGGCCACCAAGCGCGGTAATGACGCCAACCCCATGAGCTCCAGCTCCG
ACAAGGTCCTCATTCCTGCCGCCGCCCGCCCCCGGCCGGCCCTGGAGAGGAAGAAATCAAAAAGCTTCAAACTTGGAGGGAATGGGAACGTGATTAGTGATAACGCCGTT
GAGGTCACGTCTTTGAGGTACGCGTCTTCTCTAATCACTGACTCGCCGGGAAGTATCGCCGCCGTGAGAAGAGAACAGGTGGCGCTGCAACAGGCGCAGAGGAAAATGAG
AATCGCTCATTATGGAAGGTCTAAATCTGCCAGATTTGATCAAATTGTTCCAATTCATTCCAAAATTAAACCCGCCGTTGAAGATCGAAGATGCAGCTTCATCACCCCCA
ATTCAGATCCAATTTATGTTGCTTATCATGATGAAGAATGGGGCGTGCCTGTTCATGATGACAAAATGCTGTTTGAATTGCTGGTTCTGAGTGTTGCTCAAGTGGGTTCG
GATTGGACTTCCATTTTGAAGAGACGTCAAGATTTCAGAAACGCATTTTCAAATTTCGATTCGGAAATTGTGGCAAATTTTTCGGACAAACAGATGATTTCAATCAGCAC
AGAATATGGAATCGACATTAACAGAGTCCGAGGAGTCGTTGACAACGCAATCCGAATTCTTGAGATTAAGAAGGAATTTGGGTCATTCGACAAATACATTTGGGGATTTG
TGAACAACAAGCAATTTTCACCGCAGTACAAATCGGGCCACAGAATTCCGGTCAAGACATCAAAATCAGAGACCATAAGCAAAGACATGGTTCGACGAGGATTCCGGTCG
GTCGGTCCAACCGTGGTTCATTCCTTTATGCAAGCCGCCGGTCTGACCAACGACCATCTCACCAGCTGTCACAGACACCACCACTGCTCACTAATTTCCTCCGGCCGCCG
TGCTCCGGCGGAGGTGGCGGCGGAAGTGGAGGAGAAGGCGGCGGCTGATAACCTCTAGAATTGACTTAACAGACAAAACGAAAAATGGTTTGTTTGCTAATTAACTAGAT
AACTGTTTTTGTTTTTTTTTTTTTTTTGGGTGTGGGGTTTGTGTATATTAATGTCTATATATAAAAATAGACTTGCAAAAGAGAAACAAAAAAAGTGGGGAAAAAAATAT
GGGGTTGTGAATTTGTGTGAAAGTTTTTTTTTTTTTTTTTTTTTTTGTGGGAATTTTAGTGAAAAGAGAAAATTATTGAGGTTTGAAGTGAAGTGAAGTGGTAGGGCTAG
AATAGGAGGCAGACAGCATGTGCAATTGGGAGGCCTTGGCATGTGAGTGTGTCAGTTTGCTTTTGTAAATTCCCATGTGATCCATC
Protein sequenceShow/hide protein sequence
MCRSDQALEAASVVVVVDNSKFTPRHVLQPTGNRVIDRRGSLKKPPSAAAVAAVPAGPISPKSKSPRPPATKRGNDANPMSSSSDKVLIPAAARPRPALERKKSKSFKLG
GNGNVISDNAVEVTSLRYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDQIVPIHSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFE
LLVLSVAQVGSDWTSILKRRQDFRNAFSNFDSEIVANFSDKQMISISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNNKQFSPQYKSGHRIPVKTSKSETI
SKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHHHCSLISSGRRAPAEVAAEVEEKAAADNL