; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS014143 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS014143
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationscaffold5:930235..931827
RNA-Seq ExpressionMS014143
SyntenyMS014143
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054725.1 putative GMP synthase [Cucumis melo var. makuwa]8.4e-16781.77Show/hide
Query:  MCRSEQVMEATSVVA----VGRPVLQPTCNR-LHRRNSLKKQPPS--PSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAA-RPRA-L
        MCRSE+ +EATSVV       RPVLQPTCNR L RRNSLKKQ PS  P  P +  SP SPKSKSPRPPATKRAND    MNSSS+K+++PAAA RPRA L
Subjt:  MCRSEQVMEATSVVA----VGRPVLQPTCNR-LHRRNSLKKQPPS--PSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAA-RPRA-L

Query:  DRKKSKSFKLGGSG---ADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYV
        DRKKSKSFKLGG+G    D     ++YASSLITESPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARFEKIVP+DSK KP+VEDRRCSFITPNSDPIYV
Subjt:  DRKKSKSFKLGGSG---ADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYV

Query:  AYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYI
        AYHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFD+E VANFS+KQMVSISTEYGIDINRVRGVVDN+IRIL+IKKEFGSFDKYI
Subjt:  AYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYI

Query:  WAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAAGRRAPPAV----EVEETSGTL
        W FVN+KPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHCTL+AAGRR P       EVEE +  +
Subjt:  WAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAAGRRAPPAV----EVEETSGTL

XP_004139917.2 uncharacterized protein LOC101218536 [Cucumis sativus]1.9e-16682.86Show/hide
Query:  MCRSEQVMEATSVVA----VGRPVLQPTCNR-LHRRNSLKKQPPSPSPP-LSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAA-ARPRA-LD
        MCRSE+ +EATSVV       RPVLQPT NR L RRNSLKKQ PS  PP  +  SP SPKSKSPRPPATKRAND    MNSSS+K+++PAA +RPRA LD
Subjt:  MCRSEQVMEATSVVA----VGRPVLQPTCNR-LHRRNSLKKQPPSPSPP-LSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAA-ARPRA-LD

Query:  RKKSKSFKLGGSG---ADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVA
        RKKSKSFKLGG+G    D     ++YASSLITESPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARFEKIVP+DSK KPAVEDRRCSFITPNSDPIYVA
Subjt:  RKKSKSFKLGGSG---ADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVA

Query:  YHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIW
        YHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFD+E VANFSDKQMVSISTEYGIDINRVRGVVDNAIRIL+IKKEFGSFDKYIW
Subjt:  YHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIW

Query:  AFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAAGRRAPPAV----EVEETS
         FVN+KPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHCTL+AAGRR P       EVE+T+
Subjt:  AFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAAGRRAPPAV----EVEETS

XP_022149598.1 uncharacterized protein LOC111017989 [Momordica charantia]3.0e-20498.94Show/hide
Query:  MCRSEQVMEATSVVAVGRPVLQPTCNRLHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAARPRALDRKKSKSFK
        MCRSEQVMEATSVVAVGR VLQPTCNRLHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAARPRALDRKKSKSFK
Subjt:  MCRSEQVMEATSVVAVGRPVLQPTCNRLHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAARPRALDRKKSKSFK

Query:  LGGSGADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVH
        LGGSGADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVH
Subjt:  LGGSGADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVH

Query:  EDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWAFVNHKPFSPQ
        EDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIW FVNHKPFSPQ
Subjt:  EDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWAFVNHKPFSPQ

Query:  YKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAAGRRAPPAVEVEETSGTL
        YKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHL CTLLAAGRRAPPAVEVEETS TL
Subjt:  YKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAAGRRAPPAVEVEETSGTL

XP_022943791.1 uncharacterized protein LOC111448434 [Cucurbita moschata]1.9e-16381.77Show/hide
Query:  MCRSEQVMEATSVVA----VGRPVLQPTCNR-LHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAA--RPR-ALD
        MCRSEQ +EATSVV       RPVLQPTCNR L RRNSLKK      PP +  SP SPKSKSPRPPATKRAND    MNSSSDK+++PAAA  RP+ ALD
Subjt:  MCRSEQVMEATSVVA----VGRPVLQPTCNR-LHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAA--RPR-ALD

Query:  RKKSKSFKLGGSG----ADEAA-------PSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITP
        RKKSKSFKL G+G     D  A        SLSYASSLIT+SPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARF+K+VP+DSK KPAVEDRRCSFITP
Subjt:  RKKSKSFKLGGSG----ADEAA-------PSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITP

Query:  NSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEF
        NSDPIYVAYHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF AETVA FSDKQM+SIS+EYGIDINRVRGVVDNAIRILEIKKEF
Subjt:  NSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEF

Query:  GSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAAGRRAPPAVEVEETS
        GSFDKYIW FVN+KPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC++ AA RRA PAV VEET+
Subjt:  GSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAAGRRAPPAVEVEETS

XP_038902889.1 uncharacterized protein LOC120089476 [Benincasa hispida]1.8e-16480.85Show/hide
Query:  MCRSEQVMEATSVVA----VGRPVLQPTCNR-LHRRNSLKKQP--PSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAA-------A
        MCRSE+ +EA++VV       RPVLQPTCNR L RRNSLKKQP    PS  ++  SP SPKSKSPRPPATKRAND    MNSSSDK+++PAA       +
Subjt:  MCRSEQVMEATSVVA----VGRPVLQPTCNR-LHRRNSLKKQP--PSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAA-------A

Query:  RPRA-LDRKKSKSFKLGG--------SGADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCS
        RPRA LDRKKSKSFKLGG        +G  E AP LSYASSLITESPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARFEKIVP+DSK KPAVEDRRCS
Subjt:  RPRA-LDRKKSKSFKLGG--------SGADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCS

Query:  FITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEI
        FITPNSDPIYVAYHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFD+E VA FSDKQMVSIS+EYGIDINRVRGVVDNAIRIL+I
Subjt:  FITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEI

Query:  KKEFGSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAAGRR---APPAVEVEE
        KKEFGSFDKYIW FVN+KPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHCTL+AAGRR        EVEE
Subjt:  KKEFGSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAAGRR---APPAVEVEE

Query:  TS
        T+
Subjt:  TS

TrEMBL top hitse value%identityAlignment
A0A0A0KED6 Uncharacterized protein9.1e-16782.86Show/hide
Query:  MCRSEQVMEATSVVA----VGRPVLQPTCNR-LHRRNSLKKQPPSPSPP-LSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAA-ARPRA-LD
        MCRSE+ +EATSVV       RPVLQPT NR L RRNSLKKQ PS  PP  +  SP SPKSKSPRPPATKRAND    MNSSS+K+++PAA +RPRA LD
Subjt:  MCRSEQVMEATSVVA----VGRPVLQPTCNR-LHRRNSLKKQPPSPSPP-LSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAA-ARPRA-LD

Query:  RKKSKSFKLGGSG---ADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVA
        RKKSKSFKLGG+G    D     ++YASSLITESPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARFEKIVP+DSK KPAVEDRRCSFITPNSDPIYVA
Subjt:  RKKSKSFKLGGSG---ADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVA

Query:  YHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIW
        YHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFD+E VANFSDKQMVSISTEYGIDINRVRGVVDNAIRIL+IKKEFGSFDKYIW
Subjt:  YHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIW

Query:  AFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAAGRRAPPAV----EVEETS
         FVN+KPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHCTL+AAGRR P       EVE+T+
Subjt:  AFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAAGRRAPPAV----EVEETS

A0A5A7UM21 Putative GMP synthase4.1e-16781.77Show/hide
Query:  MCRSEQVMEATSVVA----VGRPVLQPTCNR-LHRRNSLKKQPPS--PSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAA-RPRA-L
        MCRSE+ +EATSVV       RPVLQPTCNR L RRNSLKKQ PS  P  P +  SP SPKSKSPRPPATKRAND    MNSSS+K+++PAAA RPRA L
Subjt:  MCRSEQVMEATSVVA----VGRPVLQPTCNR-LHRRNSLKKQPPS--PSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAA-RPRA-L

Query:  DRKKSKSFKLGGSG---ADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYV
        DRKKSKSFKLGG+G    D     ++YASSLITESPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARFEKIVP+DSK KP+VEDRRCSFITPNSDPIYV
Subjt:  DRKKSKSFKLGGSG---ADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYV

Query:  AYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYI
        AYHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFD+E VANFS+KQMVSISTEYGIDINRVRGVVDN+IRIL+IKKEFGSFDKYI
Subjt:  AYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYI

Query:  WAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAAGRRAPPAV----EVEETSGTL
        W FVN+KPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT+CHRHLHCTL+AAGRR P       EVEE +  +
Subjt:  WAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAAGRRAPPAV----EVEETSGTL

A0A6J1D778 uncharacterized protein LOC1110179891.4e-20498.94Show/hide
Query:  MCRSEQVMEATSVVAVGRPVLQPTCNRLHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAARPRALDRKKSKSFK
        MCRSEQVMEATSVVAVGR VLQPTCNRLHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAARPRALDRKKSKSFK
Subjt:  MCRSEQVMEATSVVAVGRPVLQPTCNRLHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAARPRALDRKKSKSFK

Query:  LGGSGADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVH
        LGGSGADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVH
Subjt:  LGGSGADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVH

Query:  EDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWAFVNHKPFSPQ
        EDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIW FVNHKPFSPQ
Subjt:  EDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWAFVNHKPFSPQ

Query:  YKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAAGRRAPPAVEVEETSGTL
        YKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHL CTLLAAGRRAPPAVEVEETS TL
Subjt:  YKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAAGRRAPPAVEVEETSGTL

A0A6J1FSP1 uncharacterized protein LOC1114484349.4e-16481.77Show/hide
Query:  MCRSEQVMEATSVVA----VGRPVLQPTCNR-LHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAA--RPR-ALD
        MCRSEQ +EATSVV       RPVLQPTCNR L RRNSLKK      PP +  SP SPKSKSPRPPATKRAND    MNSSSDK+++PAAA  RP+ ALD
Subjt:  MCRSEQVMEATSVVA----VGRPVLQPTCNR-LHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAA--RPR-ALD

Query:  RKKSKSFKLGGSG----ADEAA-------PSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITP
        RKKSKSFKL G+G     D  A        SLSYASSLIT+SPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARF+K+VP+DSK KPAVEDRRCSFITP
Subjt:  RKKSKSFKLGGSG----ADEAA-------PSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITP

Query:  NSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEF
        NSDPIYVAYHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF AETVA FSDKQM+SIS+EYGIDINRVRGVVDNAIRILEIKKEF
Subjt:  NSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEF

Query:  GSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAAGRRAPPAVEVEETS
        GSFDKYIW FVN+KPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC++ AA RRA PAV VEET+
Subjt:  GSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAAGRRAPPAVEVEETS

A0A6J1J7H3 uncharacterized protein LOC1114841738.0e-16381.27Show/hide
Query:  MCRSEQVMEATSVVA----VGRPVLQPTCNR-LHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAA--RPR-ALD
        MCRSEQ +EATSVV       RPVLQPTCNR L RRNSLKK      PP +  SP SPKSKSPRPPATKRAN+    MNSSSDK+++PAAA  RP+ ALD
Subjt:  MCRSEQVMEATSVVA----VGRPVLQPTCNR-LHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAA--RPR-ALD

Query:  RKKSKSFKLGGSG----ADEAA-------PSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITP
        RKKSKSFKL G+G     D  A        SLSYASSLIT+SPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARF+K+VP+DSK KPAVE RRCSFITP
Subjt:  RKKSKSFKLGGSG----ADEAA-------PSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITP

Query:  NSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEF
        NSDPIYVAYHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF AETVA FSDKQM+SIS+EYGIDINRVRGVVDNAIRILEIKKEF
Subjt:  NSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEF

Query:  GSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAAGRRAPPAVEVEETS
        GSFDKYIW FVN+KPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQ AGLTNDHLTSCHRHLHC++ AAGRRA PAV VEET+
Subjt:  GSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAAGRRAPPAVEVEETS

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 11.0e-3439.11Show/hide
Query:  RCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINR--VRGVVDNAI
        RC ++  + DP+Y+AYHD EWGVP  + K LFE++ L   Q G  W ++LKKR+++R  F  FD   VA   ++ +  +  + GI  +R  ++ ++ NA 
Subjt:  RCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINR--VRGVVDNAI

Query:  RILEIKKEFGSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC
          L++++    F  ++W+FVNH+P   Q  +  +IP  TS S+ +SK + +RGF+ VG T+ +SFMQA GL NDH+  C
Subjt:  RILEIKKEFGSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC

P44321 DNA-3-methyladenine glycosylase5.0e-2936.31Show/hide
Query:  RCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVR--GVVDNAI
        RC ++   S  IY+ YHD+EWG P  + + LFE + L   Q G  W ++LKKR+ +R AF  FD + +A  +   + +     G+  +R +   +V NA 
Subjt:  RCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVR--GVVDNAI

Query:  RILEIKKEFGSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC
          L ++K   +F  +IW+FVNHKP          +P KT  S+ +SK + +RGF  +G T  ++FMQ+ GL +DHL  C
Subjt:  RILEIKKEFGSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]3.1e-3943.85Show/hide
Query:  EDRRCSFITPNSD---PIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINR--VRG
        E  RC++ T   +    +Y  YHD EWG P+HEDK LFE LVL   Q G  W +ILKKR+ FR AF  FD   VAN+ + ++  +    GI  NR  +  
Subjt:  EDRRCSFITPNSD---PIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINR--VRG

Query:  VVDNAIRILEIKKEFGSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHR
         + NA   + +++EFGSFDKYIW FV  KP    ++S   +P  T  S+ I+KD+ +RGF+ VG T +++ MQ+ G+ NDHLTSC +
Subjt:  VVDNAIRILEIKKEFGSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHR

Arabidopsis top hitse value%identityAlignment
AT1G75090.1 DNA glycosylase superfamily protein3.4e-5752.66Show/hide
Query:  RRCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDIN--RVRGVVDNA
        +RC +ITPNSDPIYV +HDEEWGVPV +DK LFELLV S A     W SIL++R DFR  F  FD   +A F++K+++S+     + ++  ++R +V+NA
Subjt:  RRCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDIN--RVRGVVDNA

Query:  IRILEIKKEFGSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTL
          +L++K+EFGSF  Y W FVNHKP    Y+ G ++PVK+ K+E ISKDM++RGFR VGPTV++SF+QA+G+ NDHLT+C R+  C +
Subjt:  IRILEIKKEFGSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTL

AT3G12710.1 DNA glycosylase superfamily protein5.7e-9762.07Show/hide
Query:  SKSPRPPATKRANDAATAMNS--------SSDKLVLPAAARPR-ALDRKKSKSFKLGGSGADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMK
        SK+     TKR     ++ NS          D ++   AA+ R +L+RKKSKSFK G           SY+S LITE+PGSIAAVRREQVA QQA RK+K
Subjt:  SKSPRPPATKRANDAATAMNS--------SSDKLVLPAAARPR-ALDRKKSKSFKLGGSGADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMK

Query:  IAHYGRSKSA---RFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETV
        IAHYGRSKS       K+VP+     P    +RCSF+TP SDPIYVAYHDEEWGVPVH+DK LFELL LS AQVGSDWTS L+KR D+R AF  F+AE V
Subjt:  IAHYGRSKSA---RFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETV

Query:  ANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAG
        A  ++K+M +IS EY I++++VRGVV+NA +I+EIKK F S +KY+W FVNHKP S  YK GHKIPVKTSKSE+ISKDMVRRGFR VGPTVVHSFMQAAG
Subjt:  ANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAG

Query:  LTNDHLTSCHRHLHCTLLA
        LTNDHL +C RH  CTLLA
Subjt:  LTNDHLTSCHRHLHCTLLA

AT5G44680.1 DNA glycosylase superfamily protein8.0e-9153.24Show/hide
Query:  GRPVLQPTCNR---LHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAARPRALDRKKSKSFKL-----GGSGADE
        GRPVLQP  N+   L RRNSLKK PP P  P++   P      SPRP +       +  ++ ++  L  PA +    L    +KS  +        G  E
Subjt:  GRPVLQPTCNR---LHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAARPRALDRKKSKSFKL-----GGSGADE

Query:  AAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARF-EKIVPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFE
          P +     ++ + PGSIAA RRE+VA++Q +RK KI+HYGR KS +  EK + ++ + K     +RCSFIT +SDPIYVAYHD+EWGVPVH+D +LFE
Subjt:  AAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARF-EKIVPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFE

Query:  LLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWAFVNHKPFSPQYKSGHKI
        LLVL+ AQVGSDWTS+LK+R  FR AFS F+AE VA+F++K++ SI  +YGI++++V  VVDNA +IL++K++ GSF+KYIW F+ HKP + +Y S  KI
Subjt:  LLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWAFVNHKPFSPQYKSGHKI

Query:  PVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAA
        PVKTSKSETISKDMVRRGFR VGPTV+HS MQAAGLTNDHL +C RHL CT +AA
Subjt:  PVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAA

AT5G57970.1 DNA glycosylase superfamily protein4.3e-5237.17Show/hide
Query:  RNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATA-------MNSSSDKLVLPAAARPRALDRKKSKSFKLGGSGADEAAPSLSYASSLITES
        R++L       SP ++  + +    K  R  + +  +D  T+       ++SSS K  L AA+  R  ++  + +  L  S + +A+    ++ +    S
Subjt:  RNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATA-------MNSSSDKLVLPAAARPRALDRKKSKSFKLGGSGADEAAPSLSYASSLITES

Query:  PGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIV---PIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDW
         G +              R   +    +S  ++   +V    +DS    +   +RC+++TPNSDP Y+ +HDEEWGVPVH+DK LFELLVLS A     W
Subjt:  PGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIV---PIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDW

Query:  TSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDIN--RVRGVVDNAIRILEIKKEFGSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETIS
         +IL KRQ FR  F+ FD   +   ++K+++   +     ++  ++R V++NA +IL++ +E+GSFDKYIW+FV +K    +++   ++P KT K+E IS
Subjt:  TSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDIN--RVRGVVDNAIRILEIKKEFGSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETIS

Query:  KDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC
        KD+VRRGFRSVGPTVV+SFMQAAG+TNDHLTSC R  HC
Subjt:  KDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC

AT5G57970.2 DNA glycosylase superfamily protein4.3e-5237.17Show/hide
Query:  RNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATA-------MNSSSDKLVLPAAARPRALDRKKSKSFKLGGSGADEAAPSLSYASSLITES
        R++L       SP ++  + +    K  R  + +  +D  T+       ++SSS K  L AA+  R  ++  + +  L  S + +A+    ++ +    S
Subjt:  RNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATA-------MNSSSDKLVLPAAARPRALDRKKSKSFKLGGSGADEAAPSLSYASSLITES

Query:  PGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIV---PIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDW
         G +              R   +    +S  ++   +V    +DS    +   +RC+++TPNSDP Y+ +HDEEWGVPVH+DK LFELLVLS A     W
Subjt:  PGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIV---PIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDW

Query:  TSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDIN--RVRGVVDNAIRILEIKKEFGSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETIS
         +IL KRQ FR  F+ FD   +   ++K+++   +     ++  ++R V++NA +IL++ +E+GSFDKYIW+FV +K    +++   ++P KT K+E IS
Subjt:  TSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDIN--RVRGVVDNAIRILEIKKEFGSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETIS

Query:  KDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC
        KD+VRRGFRSVGPTVV+SFMQAAG+TNDHLTSC R  HC
Subjt:  KDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCCGTTCGGAGCAGGTCATGGAGGCCACTTCTGTTGTTGCAGTTGGAAGACCCGTCCTCCAACCCACCTGCAACCGTCTCCACCGCCGTAATTCCCTCAAAAAACA
ACCCCCATCTCCCTCTCCGCCTCTCTCTCCGCCGTCCCCCGCCTCTCCCAAGTCCAAGTCCCCCCGCCCCCCGGCCACCAAGCGGGCCAATGACGCCGCTACTGCCATGA
ACTCCAGCTCCGACAAGCTCGTTCTTCCCGCCGCCGCTCGACCCAGGGCTCTCGATAGGAAGAAATCCAAGAGCTTCAAATTGGGCGGGAGTGGGGCCGATGAGGCGGCG
CCGTCTTTGAGCTACGCTTCGTCTCTGATCACTGAGTCGCCGGGGAGTATCGCCGCCGTGAGGAGGGAGCAGGTGGCGCTGCAGCAGGCGCAGAGGAAGATGAAGATTGC
CCATTATGGAAGATCTAAATCTGCACGGTTTGAAAAAATTGTTCCTATTGATTCTAAAACTAAACCCGCTGTCGAAGATCGAAGATGCAGCTTCATCACACCTAATTCAG
ATCCCATCTATGTTGCTTACCATGATGAAGAATGGGGTGTCCCTGTTCATGAGGACAAGGTGCTGTTTGAATTGCTGGTTCTGAGCGTAGCCCAGGTGGGTTCAGATTGG
ACTTCAATATTGAAGAAACGCCAAGATTTCAGAAACGCATTTTCAAGCTTCGATGCAGAAACTGTGGCTAATTTTTCCGACAAACAGATGGTTTCCATCAGCACGGAATA
TGGCATCGACATTAACCGAGTCCGAGGAGTTGTCGACAACGCAATCCGAATCCTCGAGATTAAGAAGGAATTTGGGTCATTCGATAAATACATTTGGGCATTTGTGAACC
ACAAGCCCTTCTCGCCGCAGTACAAGTCCGGCCACAAAATTCCGGTGAAGACATCCAAATCGGAGACCATAAGCAAAGACATGGTCCGACGAGGTTTCCGGTCGGTCGGA
CCGACGGTGGTCCACTCGTTCATGCAAGCCGCCGGCCTGACCAACGACCATTTGACCAGCTGCCACAGGCACCTCCACTGCACGCTACTCGCCGCCGGCCGCCGCGCTCC
GCCGGCCGTGGAAGTGGAGGAGACTTCCGGAACTCTC
mRNA sequenceShow/hide mRNA sequence
ATGTGCCGTTCGGAGCAGGTCATGGAGGCCACTTCTGTTGTTGCAGTTGGAAGACCCGTCCTCCAACCCACCTGCAACCGTCTCCACCGCCGTAATTCCCTCAAAAAACA
ACCCCCATCTCCCTCTCCGCCTCTCTCTCCGCCGTCCCCCGCCTCTCCCAAGTCCAAGTCCCCCCGCCCCCCGGCCACCAAGCGGGCCAATGACGCCGCTACTGCCATGA
ACTCCAGCTCCGACAAGCTCGTTCTTCCCGCCGCCGCTCGACCCAGGGCTCTCGATAGGAAGAAATCCAAGAGCTTCAAATTGGGCGGGAGTGGGGCCGATGAGGCGGCG
CCGTCTTTGAGCTACGCTTCGTCTCTGATCACTGAGTCGCCGGGGAGTATCGCCGCCGTGAGGAGGGAGCAGGTGGCGCTGCAGCAGGCGCAGAGGAAGATGAAGATTGC
CCATTATGGAAGATCTAAATCTGCACGGTTTGAAAAAATTGTTCCTATTGATTCTAAAACTAAACCCGCTGTCGAAGATCGAAGATGCAGCTTCATCACACCTAATTCAG
ATCCCATCTATGTTGCTTACCATGATGAAGAATGGGGTGTCCCTGTTCATGAGGACAAGGTGCTGTTTGAATTGCTGGTTCTGAGCGTAGCCCAGGTGGGTTCAGATTGG
ACTTCAATATTGAAGAAACGCCAAGATTTCAGAAACGCATTTTCAAGCTTCGATGCAGAAACTGTGGCTAATTTTTCCGACAAACAGATGGTTTCCATCAGCACGGAATA
TGGCATCGACATTAACCGAGTCCGAGGAGTTGTCGACAACGCAATCCGAATCCTCGAGATTAAGAAGGAATTTGGGTCATTCGATAAATACATTTGGGCATTTGTGAACC
ACAAGCCCTTCTCGCCGCAGTACAAGTCCGGCCACAAAATTCCGGTGAAGACATCCAAATCGGAGACCATAAGCAAAGACATGGTCCGACGAGGTTTCCGGTCGGTCGGA
CCGACGGTGGTCCACTCGTTCATGCAAGCCGCCGGCCTGACCAACGACCATTTGACCAGCTGCCACAGGCACCTCCACTGCACGCTACTCGCCGCCGGCCGCCGCGCTCC
GCCGGCCGTGGAAGTGGAGGAGACTTCCGGAACTCTC
Protein sequenceShow/hide protein sequence
MCRSEQVMEATSVVAVGRPVLQPTCNRLHRRNSLKKQPPSPSPPLSPPSPASPKSKSPRPPATKRANDAATAMNSSSDKLVLPAAARPRALDRKKSKSFKLGGSGADEAA
PSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDW
TSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWAFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVG
PTVVHSFMQAAGLTNDHLTSCHRHLHCTLLAAGRRAPPAVEVEETSGTL