; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g0655 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g0655
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationMC02:5302299..5304775
RNA-Seq ExpressionMC02g0655
SyntenyMC02g0655
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054725.1 putative GMP synthase [Cucumis melo var. makuwa]8.87e-21081.98Show/hide
Query:  MCRSEQVMEATSVVAVG----RAVLQPTCNR-LHRRNSLKKQPPSPSPPLSPPSPA------SPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAA-RP
        MCRSE+ +EATSVV       R VLQPTCNR L RRNSLKKQ PS    L PPSPA      SPKSKSPRPPATKRAND    MNSSS+K+++PAAA RP
Subjt:  MCRSEQVMEATSVVAVG----RAVLQPTCNR-LHRRNSLKKQPPSPSPPLSPPSPA------SPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAA-RP

Query:  RA-LDRKKSKSFKLGGSG---ADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSD
        RA LDRKKSKSFKLGG+G    D     ++YASSLITESPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARFEKIVP+DSK KP+VEDRRCSFITPNSD
Subjt:  RA-LDRKKSKSFKLGGSG---ADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSD

Query:  PIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSF
        PIYVAYHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFD+E VANFS+KQMVSISTEYGIDINRVRGVVDN+IRIL+IKKEFGSF
Subjt:  PIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSF

Query:  DKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAP-PAVEVEETSE
        DKYIWGFVN+KPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHL CTL+AAGRR P P     E  E
Subjt:  DKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAP-PAVEVEETSE

XP_004139917.2 uncharacterized protein LOC101218536 [Cucumis sativus]2.36e-20982.56Show/hide
Query:  MCRSEQVMEATSVVAVG----RAVLQPTCNR-LHRRNSLKKQPPSPSPPLSPP-SPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAA-ARPRA-LD
        MCRSE+ +EATSVV       R VLQPT NR L RRNSLKKQ PS  PP +   SP SPKSKSPRPPATKRAND    MNSSS+K+++PAA +RPRA LD
Subjt:  MCRSEQVMEATSVVAVG----RAVLQPTCNR-LHRRNSLKKQPPSPSPPLSPP-SPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAA-ARPRA-LD

Query:  RKKSKSFKLGGSG---ADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVA
        RKKSKSFKLGG+G    D     ++YASSLITESPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARFEKIVP+DSK KPAVEDRRCSFITPNSDPIYVA
Subjt:  RKKSKSFKLGGSG---ADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVA

Query:  YHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIW
        YHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFD+E VANFSDKQMVSISTEYGIDINRVRGVVDNAIRIL+IKKEFGSFDKYIW
Subjt:  YHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIW

Query:  GFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAP-PAVEVEETSET
        GFVN+KPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHL CTL+AAGRR P P     E  +T
Subjt:  GFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAP-PAVEVEETSET

XP_022149598.1 uncharacterized protein LOC111017989 [Momordica charantia]2.51e-264100Show/hide
Query:  MCRSEQVMEATSVVAVGRAVLQPTCNRLHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAARPRALDRKKSKSFK
        MCRSEQVMEATSVVAVGRAVLQPTCNRLHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAARPRALDRKKSKSFK
Subjt:  MCRSEQVMEATSVVAVGRAVLQPTCNRLHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAARPRALDRKKSKSFK

Query:  LGGSGADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVH
        LGGSGADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVH
Subjt:  LGGSGADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVH

Query:  EDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQ
        EDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQ
Subjt:  EDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQ

Query:  YKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAPPAVEVEETSETL
        YKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAPPAVEVEETSETL
Subjt:  YKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAPPAVEVEETSETL

XP_022943791.1 uncharacterized protein LOC111448434 [Cucurbita moschata]1.30e-20581.3Show/hide
Query:  MCRSEQVMEATSVVA----VGRAVLQPTCNR-LHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAA--RPRA-LD
        MCRSEQ +EATSVV       R VLQPTCNR L RRNSLKK      PP +  SP SPKSKSPRPPATKRAND    MNSSSDK+++PAAA  RP+A LD
Subjt:  MCRSEQVMEATSVVA----VGRAVLQPTCNR-LHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAA--RPRA-LD

Query:  RKKSKSFKLGGSG----ADEAA-------PSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITP
        RKKSKSFKL G+G     D  A        SLSYASSLIT+SPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARF+K+VP+DSK KPAVEDRRCSFITP
Subjt:  RKKSKSFKLGGSG----ADEAA-------PSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITP

Query:  NSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEF
        NSDPIYVAYHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF AETVA FSDKQM+SIS+EYGIDINRVRGVVDNAIRILEIKKEF
Subjt:  NSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEF

Query:  GSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAPPAVEVEET---SET
        GSFDKYIWGFVN+KPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHL C++ AA RRAP AV VEET   SET
Subjt:  GSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAPPAVEVEET---SET

Query:  L
        L
Subjt:  L

XP_038902889.1 uncharacterized protein LOC120089476 [Benincasa hispida]7.27e-20780.3Show/hide
Query:  MCRSEQVMEATSVVAVG----RAVLQPTCNR-LHRRNSLKKQPP--SPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAA-------A
        MCRSE+ +EA++VV       R VLQPTCNR L RRNSLKKQP    PS  ++  SP SPKSKSPRPPATKRAND    MNSSSDK+++PAA       +
Subjt:  MCRSEQVMEATSVVAVG----RAVLQPTCNR-LHRRNSLKKQPP--SPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAA-------A

Query:  RPRA-LDRKKSKSFKLGGSG--------ADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCS
        RPRA LDRKKSKSFKLGG+G          E AP LSYASSLITESPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARFEKIVP+DSK KPAVEDRRCS
Subjt:  RPRA-LDRKKSKSFKLGGSG--------ADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCS

Query:  FITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEI
        FITPNSDPIYVAYHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFD+E VA FSDKQMVSIS+EYGIDINRVRGVVDNAIRIL+I
Subjt:  FITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEI

Query:  KKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAPPAVEVEETSE
        KKEFGSFDKYIWGFVN+KPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHL CTL+AAGRR        E  E
Subjt:  KKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAPPAVEVEETSE

Query:  T
        T
Subjt:  T

TrEMBL top hitse value%identityAlignment
A0A0A0KED6 Uncharacterized protein1.14e-20982.56Show/hide
Query:  MCRSEQVMEATSVVAVG----RAVLQPTCNR-LHRRNSLKKQPPSPSPPLSPP-SPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAA-ARPRA-LD
        MCRSE+ +EATSVV       R VLQPT NR L RRNSLKKQ PS  PP +   SP SPKSKSPRPPATKRAND    MNSSS+K+++PAA +RPRA LD
Subjt:  MCRSEQVMEATSVVAVG----RAVLQPTCNR-LHRRNSLKKQPPSPSPPLSPP-SPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAA-ARPRA-LD

Query:  RKKSKSFKLGGSG---ADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVA
        RKKSKSFKLGG+G    D     ++YASSLITESPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARFEKIVP+DSK KPAVEDRRCSFITPNSDPIYVA
Subjt:  RKKSKSFKLGGSG---ADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVA

Query:  YHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIW
        YHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFD+E VANFSDKQMVSISTEYGIDINRVRGVVDNAIRIL+IKKEFGSFDKYIW
Subjt:  YHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIW

Query:  GFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAP-PAVEVEETSET
        GFVN+KPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHL CTL+AAGRR P P     E  +T
Subjt:  GFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAP-PAVEVEETSET

A0A5A7UM21 Putative GMP synthase4.30e-21081.98Show/hide
Query:  MCRSEQVMEATSVVAVG----RAVLQPTCNR-LHRRNSLKKQPPSPSPPLSPPSPA------SPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAA-RP
        MCRSE+ +EATSVV       R VLQPTCNR L RRNSLKKQ PS    L PPSPA      SPKSKSPRPPATKRAND    MNSSS+K+++PAAA RP
Subjt:  MCRSEQVMEATSVVAVG----RAVLQPTCNR-LHRRNSLKKQPPSPSPPLSPPSPA------SPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAA-RP

Query:  RA-LDRKKSKSFKLGGSG---ADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSD
        RA LDRKKSKSFKLGG+G    D     ++YASSLITESPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARFEKIVP+DSK KP+VEDRRCSFITPNSD
Subjt:  RA-LDRKKSKSFKLGGSG---ADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSD

Query:  PIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSF
        PIYVAYHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFD+E VANFS+KQMVSISTEYGIDINRVRGVVDN+IRIL+IKKEFGSF
Subjt:  PIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSF

Query:  DKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAP-PAVEVEETSE
        DKYIWGFVN+KPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHL CTL+AAGRR P P     E  E
Subjt:  DKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAP-PAVEVEETSE

A0A6J1D778 uncharacterized protein LOC1110179891.21e-264100Show/hide
Query:  MCRSEQVMEATSVVAVGRAVLQPTCNRLHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAARPRALDRKKSKSFK
        MCRSEQVMEATSVVAVGRAVLQPTCNRLHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAARPRALDRKKSKSFK
Subjt:  MCRSEQVMEATSVVAVGRAVLQPTCNRLHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAARPRALDRKKSKSFK

Query:  LGGSGADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVH
        LGGSGADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVH
Subjt:  LGGSGADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVH

Query:  EDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQ
        EDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQ
Subjt:  EDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQ

Query:  YKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAPPAVEVEETSETL
        YKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAPPAVEVEETSETL
Subjt:  YKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAPPAVEVEETSETL

A0A6J1FSP1 uncharacterized protein LOC1114484346.30e-20681.3Show/hide
Query:  MCRSEQVMEATSVVA----VGRAVLQPTCNR-LHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAA--RPRA-LD
        MCRSEQ +EATSVV       R VLQPTCNR L RRNSLKK      PP +  SP SPKSKSPRPPATKRAND    MNSSSDK+++PAAA  RP+A LD
Subjt:  MCRSEQVMEATSVVA----VGRAVLQPTCNR-LHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAA--RPRA-LD

Query:  RKKSKSFKLGGSG----ADEAA-------PSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITP
        RKKSKSFKL G+G     D  A        SLSYASSLIT+SPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARF+K+VP+DSK KPAVEDRRCSFITP
Subjt:  RKKSKSFKLGGSG----ADEAA-------PSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITP

Query:  NSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEF
        NSDPIYVAYHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF AETVA FSDKQM+SIS+EYGIDINRVRGVVDNAIRILEIKKEF
Subjt:  NSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEF

Query:  GSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAPPAVEVEET---SET
        GSFDKYIWGFVN+KPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHL C++ AA RRAP AV VEET   SET
Subjt:  GSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAPPAVEVEET---SET

Query:  L
        L
Subjt:  L

A0A6J1J7H3 uncharacterized protein LOC1114841732.98e-20481.01Show/hide
Query:  MCRSEQVMEATSVVA----VGRAVLQPTCNR-LHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAA--RPRA-LD
        MCRSEQ +EATSVV       R VLQPTCNR L RRNSLKK      PP +  SP SPKSKSPRPPATKRAN+    MNSSSDK+++PAAA  RP+A LD
Subjt:  MCRSEQVMEATSVVA----VGRAVLQPTCNR-LHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAA--RPRA-LD

Query:  RKKSKSFKLGGSG----ADEAA-------PSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITP
        RKKSKSFKL G+G     D  A        SLSYASSLIT+SPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARF+K+VP+DSK KPAVE RRCSFITP
Subjt:  RKKSKSFKLGGSG----ADEAA-------PSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITP

Query:  NSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEF
        NSDPIYVAYHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF AETVA FSDKQM+SIS+EYGIDINRVRGVVDNAIRILEIKKEF
Subjt:  NSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEF

Query:  GSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAPPAVEVEETS
        GSFDKYIWGFVN+KPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQ AGLTNDHLTSCHRHL C++ AAGRRAP AV VEET+
Subjt:  GSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAPPAVEVEETS

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 11.4e-3439.11Show/hide
Query:  RCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINR--VRGVVDNAI
        RC ++  + DP+Y+AYHD EWGVP  + K LFE++ L   Q G  W ++LKKR+++R  F  FD   VA   ++ +  +  + GI  +R  ++ ++ NA 
Subjt:  RCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINR--VRGVVDNAI

Query:  RILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC
          L++++    F  ++W FVNH+P   Q  +  +IP  TS S+ +SK + +RGF+ VG T+ +SFMQA GL NDH+  C
Subjt:  RILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC

P44321 DNA-3-methyladenine glycosylase5.0e-2936.31Show/hide
Query:  RCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVR--GVVDNAI
        RC ++   S  IY+ YHD+EWG P  + + LFE + L   Q G  W ++LKKR+ +R AF  FD + +A  +   + +     G+  +R +   +V NA 
Subjt:  RCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVR--GVVDNAI

Query:  RILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC
          L ++K   +F  +IW FVNHKP          +P KT  S+ +SK + +RGF  +G T  ++FMQ+ GL +DHL  C
Subjt:  RILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]4.8e-4044.39Show/hide
Query:  EDRRCSFITPNSD---PIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINR--VRG
        E  RC++ T   +    +Y  YHD EWG P+HEDK LFE LVL   Q G  W +ILKKR+ FR AF  FD   VAN+ + ++  +    GI  NR  +  
Subjt:  EDRRCSFITPNSD---PIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINR--VRG

Query:  VVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHR
         + NA   + +++EFGSFDKYIWGFV  KP    ++S   +P  T  S+ I+KD+ +RGF+ VG T +++ MQ+ G+ NDHLTSC +
Subjt:  VVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHR

Arabidopsis top hitse value%identityAlignment
AT1G75090.1 DNA glycosylase superfamily protein3.4e-5752.66Show/hide
Query:  RRCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDIN--RVRGVVDNA
        +RC +ITPNSDPIYV +HDEEWGVPV +DK LFELLV S A     W SIL++R DFR  F  FD   +A F++K+++S+     + ++  ++R +V+NA
Subjt:  RRCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDIN--RVRGVVDNA

Query:  IRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTL
          +L++K+EFGSF  Y W FVNHKP    Y+ G ++PVK+ K+E ISKDM++RGFR VGPTV++SF+QA+G+ NDHLT+C R+  C +
Subjt:  IRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTL

AT1G80850.1 DNA glycosylase superfamily protein1.8e-5345.35Show/hide
Query:  SGADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVED--RRCSFITPNSDPIYVAYHDEEWGVPVHE
        +G    A   S ASS    SP S+ +    +  L+++           S S R       D K      D  +RC++ITP SD  Y+A+HDEEWGVPVH+
Subjt:  SGADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVED--RRCSFITPNSDPIYVAYHDEEWGVPVHE

Query:  DKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVS--ISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSP
        DK LFELL LS A     W  IL KRQ FR  F  FD   ++  ++K++ S  I+    +   ++R +++NA ++ +I   FGSFDKYIW FVN KP   
Subjt:  DKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVS--ISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSP

Query:  QYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRC
        Q++   ++PVKTSK+E ISKD+VRRGFRSV PTV++SFMQ AGLTNDHLT C RH  C
Subjt:  QYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRC

AT3G12710.1 DNA glycosylase superfamily protein8.9e-9862.38Show/hide
Query:  SKSPRPPATKRANDAATAMNS--------SSDKLVLPAAARPR-ALDRKKSKSFKLGGSGADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMK
        SK+     TKR     ++ NS          D ++   AA+ R +L+RKKSKSFK G           SY+S LITE+PGSIAAVRREQVA QQA RK+K
Subjt:  SKSPRPPATKRANDAATAMNS--------SSDKLVLPAAARPR-ALDRKKSKSFKLGGSGADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMK

Query:  IAHYGRSKSA---RFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETV
        IAHYGRSKS       K+VP+     P    +RCSF+TP SDPIYVAYHDEEWGVPVH+DK LFELL LS AQVGSDWTS L+KR D+R AF  F+AE V
Subjt:  IAHYGRSKSA---RFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETV

Query:  ANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAG
        A  ++K+M +IS EY I++++VRGVV+NA +I+EIKK F S +KY+WGFVNHKP S  YK GHKIPVKTSKSE+ISKDMVRRGFR VGPTVVHSFMQAAG
Subjt:  ANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAG

Query:  LTNDHLTSCHRHLRCTLLA
        LTNDHL +C RH  CTLLA
Subjt:  LTNDHLTSCHRHLRCTLLA

AT5G44680.1 DNA glycosylase superfamily protein1.0e-9053.24Show/hide
Query:  GRAVLQPTCNR---LHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAARPRALDRKKSKSFKL-----GGSGADE
        GR VLQP  N+   L RRNSLKK PP P  P++   P      SPRP +       +  ++ ++  L  PA +    L    +KS  +        G  E
Subjt:  GRAVLQPTCNR---LHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAARPRALDRKKSKSFKL-----GGSGADE

Query:  AAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARF-EKIVPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFE
          P +     ++ + PGSIAA RRE+VA++Q +RK KI+HYGR KS +  EK + ++ + K     +RCSFIT +SDPIYVAYHD+EWGVPVH+D +LFE
Subjt:  AAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARF-EKIVPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFE

Query:  LLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKI
        LLVL+ AQVGSDWTS+LK+R  FR AFS F+AE VA+F++K++ SI  +YGI++++V  VVDNA +IL++K++ GSF+KYIWGF+ HKP + +Y S  KI
Subjt:  LLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKI

Query:  PVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAA
        PVKTSKSETISKDMVRRGFR VGPTV+HS MQAAGLTNDHL +C RHL CT +AA
Subjt:  PVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAA

AT5G57970.1 DNA glycosylase superfamily protein2.8e-5136.87Show/hide
Query:  RNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATA-------MNSSSDKLVLPAAARPRALDRKKSKSFKLGGSGADEAAPSLSYASSLITES
        R++L       SP ++  + +    K  R  + +  +D  T+       ++SSS K  L AA+  R  ++  + +  L  S + +A+    ++ +    S
Subjt:  RNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATA-------MNSSSDKLVLPAAARPRALDRKKSKSFKLGGSGADEAAPSLSYASSLITES

Query:  PGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIV---PIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDW
         G +              R   +    +S  ++   +V    +DS    +   +RC+++TPNSDP Y+ +HDEEWGVPVH+DK LFELLVLS A     W
Subjt:  PGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIV---PIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDW

Query:  TSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDIN--RVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETIS
         +IL KRQ FR  F+ FD   +   ++K+++   +     ++  ++R V++NA +IL++ +E+GSFDKYIW FV +K    +++   ++P KT K+E IS
Subjt:  TSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDIN--RVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETIS

Query:  KDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRC
        KD+VRRGFRSVGPTVV+SFMQAAG+TNDHLTSC R   C
Subjt:  KDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLRC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCCGTTCGGAGCAGGTCATGGAGGCCACTTCTGTTGTTGCAGTTGGAAGAGCCGTCCTCCAACCCACCTGCAACCGTCTCCACCGCCGTAATTCCCTCAAAAAACA
ACCCCCATCTCCCTCTCCGCCTCTCTCTCCGCCGTCCCCCGCCTCTCCCAAGTCCAAGTCCCCCCGCCCCCCGGCCACCAAGCGGGCCAATGACGCCGCTACTGCCATGA
ACTCCAGCTCCGACAAGCTCGTTCTTCCCGCCGCCGCTCGACCCAGGGCTCTCGATAGGAAGAAATCCAAGAGCTTCAAATTGGGCGGGAGTGGGGCCGATGAGGCGGCG
CCGTCTTTGAGCTACGCTTCGTCTCTGATCACTGAGTCGCCGGGGAGTATCGCCGCCGTGAGGAGGGAGCAGGTGGCGCTGCAGCAGGCGCAGAGGAAGATGAAGATTGC
CCATTATGGAAGATCTAAATCTGCACGGTTTGAAAAAATTGTTCCTATTGATTCTAAAACTAAACCCGCTGTCGAAGATCGAAGATGCAGCTTCATCACACCTAATTCAG
ATCCCATCTATGTTGCTTACCATGATGAAGAATGGGGTGTCCCTGTTCATGAGGACAAGGTGCTGTTTGAATTGCTGGTTCTGAGCGTAGCCCAGGTGGGTTCAGATTGG
ACTTCAATATTGAAGAAACGCCAAGATTTCAGAAACGCATTTTCAAGCTTCGATGCAGAAACTGTGGCTAATTTTTCCGACAAACAGATGGTTTCCATCAGCACGGAATA
TGGCATCGACATTAACCGAGTCCGAGGAGTTGTCGACAACGCAATCCGAATCCTCGAGATTAAGAAGGAATTTGGGTCATTCGATAAATACATTTGGGGATTTGTGAACC
ACAAGCCCTTCTCGCCGCAGTACAAGTCCGGCCACAAAATTCCGGTGAAGACATCCAAATCGGAGACCATAAGCAAAGACATGGTCCGACGAGGTTTCCGGTCGGTCGGA
CCGACGGTGGTCCACTCGTTCATGCAAGCCGCCGGCCTGACCAACGACCATTTGACCAGCTGCCACAGGCACCTCCGCTGCACGCTACTCGCCGCCGGCCGCCGCGCTCC
GCCGGCCGTGGAAGTGGAGGAGACTTCCGAAACTCTCTAG
mRNA sequenceShow/hide mRNA sequence
CGAGAACTAAGACGAAAGAGTAGAGGCTAAAAGGAGAAAAACTAGAGATGGTGAAAAGTTAAGAAGATTAAGAGGAAGAAAAGAAAAAGGGAAGAAGAATAATTAAGTAG
AGGGAGAGGGTGTCATGATCATCCTCGTGGAGTAGTTCCCAAAAAGCAATGCCCCTCACTTGAGGCTGCCATAGCTATAGCCATAGCCAGAACCAGAGCTCTTCCTCTGT
CCCCACTCCCCTTTGATAAAACCCTCACCTCTCCCCTTCCTCTCTCTTCACCACTCCCCCAATTTCTCATTTCTTCTCTCTCTCTCTCTACAAAAATGTGCCGTTCGGAG
CAGGTCATGGAGGCCACTTCTGTTGTTGCAGTTGGAAGAGCCGTCCTCCAACCCACCTGCAACCGTCTCCACCGCCGTAATTCCCTCAAAAAACAACCCCCATCTCCCTC
TCCGCCTCTCTCTCCGCCGTCCCCCGCCTCTCCCAAGTCCAAGTCCCCCCGCCCCCCGGCCACCAAGCGGGCCAATGACGCCGCTACTGCCATGAACTCCAGCTCCGACA
AGCTCGTTCTTCCCGCCGCCGCTCGACCCAGGGCTCTCGATAGGAAGAAATCCAAGAGCTTCAAATTGGGCGGGAGTGGGGCCGATGAGGCGGCGCCGTCTTTGAGCTAC
GCTTCGTCTCTGATCACTGAGTCGCCGGGGAGTATCGCCGCCGTGAGGAGGGAGCAGGTGGCGCTGCAGCAGGCGCAGAGGAAGATGAAGATTGCCCATTATGGAAGATC
TAAATCTGCACGGTTTGAAAAAATTGTTCCTATTGATTCTAAAACTAAACCCGCTGTCGAAGATCGAAGATGCAGCTTCATCACACCTAATTCAGATCCCATCTATGTTG
CTTACCATGATGAAGAATGGGGTGTCCCTGTTCATGAGGACAAGGTGCTGTTTGAATTGCTGGTTCTGAGCGTAGCCCAGGTGGGTTCAGATTGGACTTCAATATTGAAG
AAACGCCAAGATTTCAGAAACGCATTTTCAAGCTTCGATGCAGAAACTGTGGCTAATTTTTCCGACAAACAGATGGTTTCCATCAGCACGGAATATGGCATCGACATTAA
CCGAGTCCGAGGAGTTGTCGACAACGCAATCCGAATCCTCGAGATTAAGAAGGAATTTGGGTCATTCGATAAATACATTTGGGGATTTGTGAACCACAAGCCCTTCTCGC
CGCAGTACAAGTCCGGCCACAAAATTCCGGTGAAGACATCCAAATCGGAGACCATAAGCAAAGACATGGTCCGACGAGGTTTCCGGTCGGTCGGACCGACGGTGGTCCAC
TCGTTCATGCAAGCCGCCGGCCTGACCAACGACCATTTGACCAGCTGCCACAGGCACCTCCGCTGCACGCTACTCGCCGCCGGCCGCCGCGCTCCGCCGGCCGTGGAAGT
GGAGGAGACTTCCGAAACTCTCTAGAATTTTCTCGACAATTTAATTAACAGACAAAAGAGAAAAGAAAAATGGTAACCTTTACGAGGAGTCAATCAATGATTTGTTTGCT
AATTAACTAGATAGCTGTTTTTTTTTCTTCCCTTTTTCAGATTTGTGGGGTTTGTGTATATTAATGTCTATATTAATAGGCTTGTAAGAGAAACCAAAAAAAAAAAAGTG
AGAACAGATTGTGGGGTTGTGAAATTGTGTGAAAGTTTAAGACTTTTTGGGTTTTTTTTGGTGGGGATTTTTCATTTTAGTAAGAGTGTTTGGATGACTAGAAAAGAAAA
GAAAAGAAAATAATTGGGAGGGGATTGTGGTGGTAGGGCTAGTGTAGAAGACAGACAGCATGTGCTTGTGCAATTGGGAGGCAATGGCATGTGAGTGTGTCAGTTTGCTT
TTGTAAATTCCCATGTGATCCATCAAAATTTCAACATTATTAATCAATATTATATTATTTCTAATTTCTATTTTGGATTTCTTTCTCTTGGGGCTGGCTTAACTTTGCTT
GCAATGCAATGCAATTCTCC
Protein sequenceShow/hide protein sequence
MCRSEQVMEATSVVAVGRAVLQPTCNRLHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAARPRALDRKKSKSFKLGGSGADEAA
PSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDW
TSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVG
PTVVHSFMQAAGLTNDHLTSCHRHLRCTLLAAGRRAPPAVEVEETSETL