; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023519 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023519
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationtig00000892:4020615..4022332
RNA-Seq ExpressionSgr023519
SyntenySgr023519
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsIPR005019 - Methyladenine glycosylase
IPR011257 - DNA glycosylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570606.1 hypothetical protein SDJN03_29521, partial [Cucurbita argyrosperma subsp. sororia]2.4e-16779.47Show/hide
Query:  MCRSEQQVMEATSVVLPTPTSSEAKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPISPPPSSSHMGPTSPNSKSPRPPATKRANE---MNSSSEK
        MCRSE Q +EAT+VV+      ++K   RPVLQPTCNRV   LDRRNSLKKP P AA            + PTSP SKSPRPPATKRAN+   MNSSS+K
Subjt:  MCRSEQQVMEATSVVLPTPTSSEAKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPISPPPSSSHMGPTSPNSKSPRPPATKRANE---MNSSSEK

Query:  VVIPAGNSNSSRACRALDRKKSKSFKLGGSCGSVVSDNV--GGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVP
        ++IPA  +  SR   ALDRKKSKSFKL G+   V+ DNV  GGGFE  ASLSYASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF+K+VP
Subjt:  VVIPAGNSNSSRACRALDRKKSKSFKLGGSCGSVVSDNV--GGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVP

Query:  IDSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDIN
        +DSK KPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF AETVA FSDKQM+SISSEYGIDIN
Subjt:  IDSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDIN

Query:  RVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTLLPA
        RVRGVVDNAIRILEIKKEFGSFDKYIWGFVN+KPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQA GLTNDHL SCHRHLHC++  A
Subjt:  RVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTLLPA

Query:  GRRLPTVQVEE--TSDETL
         RR P V VEE  T+ ETL
Subjt:  GRRLPTVQVEE--TSDETL

XP_022149598.1 uncharacterized protein LOC111017989 [Momordica charantia]2.0e-16680.43Show/hide
Query:  MCRSEQQVMEATSVVLPTPTSSEAKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPISPPPSSSHMGPTSPNSKSPRPPATKRANE----MNSSSE
        MCRSE QVMEATSVV            GR VLQPTCNR    L RRNSLKK  P + +PP+SPP       P SP SKSPRPPATKRAN+    MNSSS+
Subjt:  MCRSEQQVMEATSVVLPTPTSSEAKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPISPPPSSSHMGPTSPNSKSPRPPATKRANE----MNSSSE

Query:  KVVIPAGNSNSSRACRALDRKKSKSFKLGGSCGSVVSDNVGGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPI
        K+V+PA     +   RALDRKKSKSFKLGGS          G  EAA SLSYASSLITESPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARFEKIVPI
Subjt:  KVVIPAGNSNSSRACRALDRKKSKSFKLGGSCGSVVSDNVGGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPI

Query:  DSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDINR
        DSK KPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVA+FSDKQM+SIS+EYGIDINR
Subjt:  DSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDINR

Query:  VRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTLLPAG
        VRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQA GLTNDHL SCHRHL CTLL AG
Subjt:  VRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTLLPAG

Query:  RRL-PTVQVEETSD
        RR  P V+VEETS+
Subjt:  RRL-PTVQVEETSD

XP_022943791.1 uncharacterized protein LOC111448434 [Cucurbita moschata]1.1e-16779.71Show/hide
Query:  MCRSEQQVMEATSVVLPTPTSSEAKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPISPPPSSSHMGPTSPNSKSPRPPATKRANE---MNSSSEK
        MCRSE Q +EATSVV+      ++K   RPVLQPTCNRV   LDRRNSLKKP P AA            + PTSP SKSPRPPATKRAN+   MNSSS+K
Subjt:  MCRSEQQVMEATSVVLPTPTSSEAKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPISPPPSSSHMGPTSPNSKSPRPPATKRANE---MNSSSEK

Query:  VVIPAGNSNSSRACRALDRKKSKSFKLGGSCGSVVSDNV--GGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVP
        ++IPA  +  SR   ALDRKKSKSFKL G+   V+ DNV  GGGFE  ASLSYASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF+K+VP
Subjt:  VVIPAGNSNSSRACRALDRKKSKSFKLGGSCGSVVSDNV--GGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVP

Query:  IDSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDIN
        +DSK KPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF AETVA FSDKQM+SISSEYGIDIN
Subjt:  IDSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDIN

Query:  RVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTLLPA
        RVRGVVDNAIRILEIKKEFGSFDKYIWGFVN+KPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQA GLTNDHL SCHRHLHC++  A
Subjt:  RVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTLLPA

Query:  GRRLPTVQVEE--TSDETL
         RR P V VEE  T+ ETL
Subjt:  GRRLPTVQVEE--TSDETL

XP_022986422.1 uncharacterized protein LOC111484173 [Cucurbita maxima]1.8e-16780.15Show/hide
Query:  MCRSEQQVMEATSVVLPTPTSSEAKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPISPPPSSSHMGPTSPNSKSPRPPATKRANE---MNSSSEK
        MCRSE Q +EATSVV+      ++K   RPVLQPTCNRV   LDRRNSLKKP P AA            + PTSP SKSPRPPATKRANE   MNSSS+K
Subjt:  MCRSEQQVMEATSVVLPTPTSSEAKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPISPPPSSSHMGPTSPNSKSPRPPATKRANE---MNSSSEK

Query:  VVIPAGNSNSSRACRALDRKKSKSFKLGGSCGSVVSDNV--GGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVP
        ++IPA  +  SR   ALDRKKSKSFKL G+   V+ DNV  GGGFE  ASLSYASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF+K+VP
Subjt:  VVIPAGNSNSSRACRALDRKKSKSFKLGGSCGSVVSDNV--GGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVP

Query:  IDSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDIN
        +DSK KPAVE RRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF AETVA FSDKQM+SISSEYGIDIN
Subjt:  IDSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDIN

Query:  RVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTLLPA
        RVRGVVDNAIRILEIKKEFGSFDKYIWGFVN+KPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQ  GLTNDHL SCHRHLHC++  A
Subjt:  RVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTLLPA

Query:  GRRLPTVQVEETS
        GRR P V VEET+
Subjt:  GRRLPTVQVEETS

XP_038902889.1 uncharacterized protein LOC120089476 [Benincasa hispida]5.9e-16677.78Show/hide
Query:  MCRSEQQVMEATSVVLPTPTSSEAKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPISPPPSS-SHMGPTSPNSKSPRPPATKRANE----MNSSS
        MCRSE + +EA++VV+      ++K N RPVLQPTCNRV   LDRRNSLKK       P + PP ++ + + PTSP SKSPRPPATKRAN+    MNSSS
Subjt:  MCRSEQQVMEATSVVLPTPTSSEAKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPISPPPSS-SHMGPTSPNSKSPRPPATKRANE----MNSSS

Query:  EKVVIPA---GNSNSSRACRALDRKKSKSFKLGGSCGSVVSDNVGGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEK
        +K++IPA   G  + SR    LDRKKSKSFKLGG+   V+ DN  GG+E  A LSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEK
Subjt:  EKVVIPA---GNSNSSRACRALDRKKSKSFKLGGSCGSVVSDNVGGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEK

Query:  IVPIDSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGI
        IVP+DSK KPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFD+E VA FSDKQM+SISSEYGI
Subjt:  IVPIDSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGI

Query:  DINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTL
        DINRVRGVVDNAIRIL+IKKEFGSFDKYIWGFVN+KPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQA GLTNDHL +CHRHLHCTL
Subjt:  DINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTL

Query:  LPAGRR----LPTVQVEETSDET
        + AGRR      T +VEET+  T
Subjt:  LPAGRR----LPTVQVEETSDET

TrEMBL top hitse value%identityAlignment
A0A0A0KED6 Uncharacterized protein4.9e-16678.9Show/hide
Query:  MCRSEQQVMEATSVVLPTPTSSEAKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPISPPPSSSHMGPTSPNSKSPRPPATKRANE----MNSSSE
        MCRSE + +EATSVV+      ++K N RPVLQPT NRV   LDRRNSLKK  P         PPS++ + PTSP SKSPRPPATKRAN+    MNSSSE
Subjt:  MCRSEQQVMEATSVVLPTPTSSEAKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPISPPPSSSHMGPTSPNSKSPRPPATKRANE----MNSSSE

Query:  KVVIPAGNSNSSRACRALDRKKSKSFKLGGSCGSVVSDNVGGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPI
        K++IPA     SR    LDRKKSKSFKLGG+ G+V+ DN  GGFE A    YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVP+
Subjt:  KVVIPAGNSNSSRACRALDRKKSKSFKLGGSCGSVVSDNVGGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPI

Query:  DSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDINR
        DSK KPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFD+E VA+FSDKQM+SIS+EYGIDINR
Subjt:  DSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDINR

Query:  VRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTLLPAG
        VRGVVDNAIRIL+IKKEFGSFDKYIWGFVN+KPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQA GLTNDHL +CHRHLHCTL+ AG
Subjt:  VRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTLLPAG

Query:  RRLP-----TVQVEETS
        RR P     T +VE+T+
Subjt:  RRLP-----TVQVEETS

A0A5A7UM21 Putative GMP synthase2.9e-16677.94Show/hide
Query:  MCRSEQQVMEATSVVLPTPTSSEAKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPISPPPSSSHMGPTSPNSKSPRPPATKRANE----MNSSSE
        MCRSE + +EATSVV+      ++K N RPVLQPTCNRV   LDRRNSLKK  P      + PP  ++ + PTSP SKSPRPPATKRAN+    MNSSSE
Subjt:  MCRSEQQVMEATSVVLPTPTSSEAKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPISPPPSSSHMGPTSPNSKSPRPPATKRANE----MNSSSE

Query:  KVVIPAGNSNSSRACRALDRKKSKSFKLGGSCGSVVSDNVGGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPI
        K++IPA    +SR    LDRKKSKSFKLGG+ G+V+ DN  GGFE A    YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVP+
Subjt:  KVVIPAGNSNSSRACRALDRKKSKSFKLGGSCGSVVSDNVGGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPI

Query:  DSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDINR
        DSK KP+VEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFD+E VA+FS+KQM+SIS+EYGIDINR
Subjt:  DSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDINR

Query:  VRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTLLPAG
        VRGVVDN+IRIL+IKKEFGSFDKYIWGFVN+KPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQA GLTNDHL +CHRHLHCTL+ AG
Subjt:  VRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTLLPAG

Query:  RR--LPTVQVEETSDET
        RR   PT    E  ++T
Subjt:  RR--LPTVQVEETSDET

A0A6J1D778 uncharacterized protein LOC1110179899.9e-16780.43Show/hide
Query:  MCRSEQQVMEATSVVLPTPTSSEAKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPISPPPSSSHMGPTSPNSKSPRPPATKRANE----MNSSSE
        MCRSE QVMEATSVV            GR VLQPTCNR    L RRNSLKK  P + +PP+SPP       P SP SKSPRPPATKRAN+    MNSSS+
Subjt:  MCRSEQQVMEATSVVLPTPTSSEAKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPISPPPSSSHMGPTSPNSKSPRPPATKRANE----MNSSSE

Query:  KVVIPAGNSNSSRACRALDRKKSKSFKLGGSCGSVVSDNVGGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPI
        K+V+PA     +   RALDRKKSKSFKLGGS          G  EAA SLSYASSLITESPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARFEKIVPI
Subjt:  KVVIPAGNSNSSRACRALDRKKSKSFKLGGSCGSVVSDNVGGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPI

Query:  DSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDINR
        DSK KPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVA+FSDKQM+SIS+EYGIDINR
Subjt:  DSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDINR

Query:  VRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTLLPAG
        VRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQA GLTNDHL SCHRHL CTLL AG
Subjt:  VRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTLLPAG

Query:  RRL-PTVQVEETSD
        RR  P V+VEETS+
Subjt:  RRL-PTVQVEETSD

A0A6J1FSP1 uncharacterized protein LOC1114484345.2e-16879.71Show/hide
Query:  MCRSEQQVMEATSVVLPTPTSSEAKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPISPPPSSSHMGPTSPNSKSPRPPATKRANE---MNSSSEK
        MCRSE Q +EATSVV+      ++K   RPVLQPTCNRV   LDRRNSLKKP P AA            + PTSP SKSPRPPATKRAN+   MNSSS+K
Subjt:  MCRSEQQVMEATSVVLPTPTSSEAKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPISPPPSSSHMGPTSPNSKSPRPPATKRANE---MNSSSEK

Query:  VVIPAGNSNSSRACRALDRKKSKSFKLGGSCGSVVSDNV--GGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVP
        ++IPA  +  SR   ALDRKKSKSFKL G+   V+ DNV  GGGFE  ASLSYASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF+K+VP
Subjt:  VVIPAGNSNSSRACRALDRKKSKSFKLGGSCGSVVSDNV--GGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVP

Query:  IDSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDIN
        +DSK KPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF AETVA FSDKQM+SISSEYGIDIN
Subjt:  IDSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDIN

Query:  RVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTLLPA
        RVRGVVDNAIRILEIKKEFGSFDKYIWGFVN+KPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQA GLTNDHL SCHRHLHC++  A
Subjt:  RVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTLLPA

Query:  GRRLPTVQVEE--TSDETL
         RR P V VEE  T+ ETL
Subjt:  GRRLPTVQVEE--TSDETL

A0A6J1J7H3 uncharacterized protein LOC1114841738.9e-16880.15Show/hide
Query:  MCRSEQQVMEATSVVLPTPTSSEAKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPISPPPSSSHMGPTSPNSKSPRPPATKRANE---MNSSSEK
        MCRSE Q +EATSVV+      ++K   RPVLQPTCNRV   LDRRNSLKKP P AA            + PTSP SKSPRPPATKRANE   MNSSS+K
Subjt:  MCRSEQQVMEATSVVLPTPTSSEAKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPISPPPSSSHMGPTSPNSKSPRPPATKRANE---MNSSSEK

Query:  VVIPAGNSNSSRACRALDRKKSKSFKLGGSCGSVVSDNV--GGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVP
        ++IPA  +  SR   ALDRKKSKSFKL G+   V+ DNV  GGGFE  ASLSYASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF+K+VP
Subjt:  VVIPAGNSNSSRACRALDRKKSKSFKLGGSCGSVVSDNV--GGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVP

Query:  IDSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDIN
        +DSK KPAVE RRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSF AETVA FSDKQM+SISSEYGIDIN
Subjt:  IDSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDIN

Query:  RVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTLLPA
        RVRGVVDNAIRILEIKKEFGSFDKYIWGFVN+KPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQ  GLTNDHL SCHRHLHC++  A
Subjt:  RVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTLLPA

Query:  GRRLPTVQVEETS
        GRR P V VEET+
Subjt:  GRRLPTVQVEETS

SwissProt top hitse value%identityAlignment
P05100 DNA-3-methyladenine glycosylase 12.3e-3539.66Show/hide
Query:  RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDINR--VRGVVDNAI
        RC ++  + DP+Y+AYHD EWGVP  D K LFE++ L   Q G  W ++LKKR+++R  F  FD   VA+  ++ +  +  + GI  +R  ++ ++ NA 
Subjt:  RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDINR--VRGVVDNAI

Query:  RILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASC
          L++++    F  ++W FVNH+P   Q  +  +IP  TS S+ +SK + +RGF+ VG T+ +SFMQA GL NDH+  C
Subjt:  RILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASC

P44321 DNA-3-methyladenine glycosylase1.9e-2936.87Show/hide
Query:  RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDINRVR--GVVDNAI
        RC ++   S  IY+ YHD+EWG P  D + LFE + L   Q G  W ++LKKR+ +R AF  FD + +A  +   + +     G+  +R +   +V NA 
Subjt:  RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDINRVR--GVVDNAI

Query:  RILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASC
          L ++K   +F  +IW FVNHKP          +P KT  S+ +SK + +RGF  +G T  ++FMQ+ GL +DHL  C
Subjt:  RILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASC

Q7VG78 Probable GMP synthase [glutamine-hydrolyzing]1.3e-3842.19Show/hide
Query:  NKPAVEDRRCSFITPNSD---PIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDINR
        N+   E  RC++ T   +    +Y  YHD EWG P+H+DK LFE LVL   Q G  W +ILKKR+ FR AF  FD   VA++ + ++  +    GI  NR
Subjt:  NKPAVEDRRCSFITPNSD---PIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDINR

Query:  --VRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHR
          +   + NA   + +++EFGSFDKYIWGFV  KP    ++S   +P  T  S+ I+KD+ +RGF+ VG T +++ MQ+ G+ NDHL SC +
Subjt:  --VRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHR

Arabidopsis top hitse value%identityAlignment
AT1G75090.1 DNA glycosylase superfamily protein6.3e-5752.66Show/hide
Query:  RRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDIN--RVRGVVDNA
        +RC +ITPNSDPIYV +HDEEWGVPV DDK LFELLV S A     W SIL++R DFR  F  FD   +A F++K+++S+     + ++  ++R +V+NA
Subjt:  RRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDIN--RVRGVVDNA

Query:  IRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTL
          +L++K+EFGSF  Y W FVNHKP    Y+ G ++PVK+ K+E ISKDM++RGFR VGPTV++SF+QA+G+ NDHL +C R+  C +
Subjt:  IRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTL

AT3G12710.1 DNA glycosylase superfamily protein6.2e-9760.62Show/hide
Query:  PTSPNSKSPRPPATKRANEMNSSSEKVVIPAGNSNSSRACRALDRKKSKSFKLGGSCGSVVSDNVGGGFEAAASLSYASSLITESPGSIAAVRREQVALQ
        P+S NS   R  + KR + M + + KV              +L+RKKSKSFK G                     SY+S LITE+PGSIAAVRREQVA Q
Subjt:  PTSPNSKSPRPPATKRANEMNSSSEKVVIPAGNSNSSRACRALDRKKSKSFKLGGSCGSVVSDNVGGGFEAAASLSYASSLITESPGSIAAVRREQVALQ

Query:  QAQRKMRIAHYGRSKSA---RFEKIVPIDSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFS
        QA RK++IAHYGRSKS       K+VP+ + N P    +RCSF+TP SDPIYVAYHDEEWGVPVHDDK LFELL LS AQVGSDWTS L+KR D+R AF 
Subjt:  QAQRKMRIAHYGRSKSA---RFEKIVPIDSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFS

Query:  SFDAETVASFSDKQMISISSEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVH
         F+AE VA  ++K+M +IS EY I++++VRGVV+NA +I+EIKK F S +KY+WGFVNHKP S  YK GHKIPVKTSKSE+ISKDMVRRGFR VGPTVVH
Subjt:  SFDAETVASFSDKQMISISSEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVH

Query:  SFMQATGLTNDHLASCHRHLHCTLL
        SFMQA GLTNDHL +C RH  CTLL
Subjt:  SFMQATGLTNDHLASCHRHLHCTLL

AT5G44680.1 DNA glycosylase superfamily protein3.4e-9553.05Show/hide
Query:  AKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPI----SPPPSSSHMGPTSPNSKSPRPPATKRANEMNSSSEKVVIPAGNSNSSRACRALDRKKS
        ++INGRPVLQP  N+VP TLDRRNSLKK  PK   P      SP P S    P SPN+KS R PA      + SSS K                      
Subjt:  AKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPI----SPPPSSSHMGPTSPNSKSPRPPATKRANEMNSSSEKVVIPAGNSNSSRACRALDRKKS

Query:  KSFKLGGSCGSVVSDNVGGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF-EKIVPIDSKNKPAVEDRRCSFITPNSDP
               S   +  +N  GG++    +     ++ + PGSIAA RRE+VA++Q +RK +I+HYGR KS +  EK + ++ + K     +RCSFIT +SDP
Subjt:  KSFKLGGSCGSVVSDNVGGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF-EKIVPIDSKNKPAVEDRRCSFITPNSDP

Query:  IYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDINRVRGVVDNAIRILEIKKEFGSFD
        IYVAYHD+EWGVPVHDD +LFELLVL+ AQVGSDWTS+LK+R  FR AFS F+AE VA F++K++ SI ++YGI++++V  VVDNA +IL++K++ GSF+
Subjt:  IYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDINRVRGVVDNAIRILEIKKEFGSFD

Query:  KYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTLLPA
        KYIWGF+ HKP + +Y S  KIPVKTSKSETISKDMVRRGFR VGPTV+HS MQA GLTNDHL +C RHL CT + A
Subjt:  KYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTLLPA

AT5G57970.1 DNA glycosylase superfamily protein2.8e-5752.28Show/hide
Query:  IDSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDIN
        +DS    +   +RC+++TPNSDP Y+ +HDEEWGVPVHDDK LFELLVLS A     W +IL KRQ FR  F+ FD   +   ++K++I   S     ++
Subjt:  IDSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDIN

Query:  --RVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHC
          ++R V++NA +IL++ +E+GSFDKYIW FV +K    +++   ++P KT K+E ISKD+VRRGFRSVGPTVV+SFMQA G+TNDHL SC R  HC
Subjt:  --RVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHC

AT5G57970.2 DNA glycosylase superfamily protein2.8e-5752.28Show/hide
Query:  IDSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDIN
        +DS    +   +RC+++TPNSDP Y+ +HDEEWGVPVHDDK LFELLVLS A     W +IL KRQ FR  F+ FD   +   ++K++I   S     ++
Subjt:  IDSKNKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDIN

Query:  --RVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHC
          ++R V++NA +IL++ +E+GSFDKYIW FV +K    +++   ++P KT K+E ISKD+VRRGFRSVGPTVV+SFMQA G+TNDHL SC R  HC
Subjt:  --RVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTCGCTCCGAGCAGCAGGTCATGGAAGCCACTTCTGTTGTTCTCCCCACACCCACCTCCTCTGAAGCTAAAATCAATGGCAGACCTGTCCTTCAACCCACCTGCAA
CCGTGTCCCCACCACCCTCGACCGCCGCAATTCCCTCAAAAAACCCTCTCCCAAGGCCGCCGCCCCGCCCATATCGCCCCCGCCCTCTTCCAGCCACATGGGTCCGACCT
CTCCTAACTCCAAATCGCCCCGGCCTCCGGCGACGAAGCGTGCCAATGAGATGAACTCCAGCTCCGAGAAGGTGGTTATTCCGGCGGGGAATAGTAATAGCTCTCGTGCC
TGCAGGGCTTTGGACAGGAAGAAATCGAAAAGCTTTAAATTGGGTGGGAGCTGTGGGAGCGTTGTTTCTGATAATGTTGGTGGTGGGTTTGAGGCGGCGGCGTCTTTGAG
CTACGCTTCTTCTCTGATCACGGAGTCGCCCGGAAGCATCGCCGCCGTGAGGAGGGAGCAGGTGGCGCTGCAACAAGCGCAGAGGAAGATGAGGATTGCCCACTATGGAA
GATCCAAATCTGCCCGATTTGAAAAGATTGTTCCCATTGATTCTAAAAACAAACCGGCTGTGGAAGACAGAAGATGCAGCTTCATCACTCCCAACTCAGATCCCATCTAC
GTTGCTTACCATGATGAAGAATGGGGTGTCCCTGTTCATGATGACAAGATGCTGTTTGAATTGCTGGTTCTGAGCGTAGCCCAGGTGGGTTCAGACTGGACTTCAATATT
GAAGAAACGCCAAGATTTCAGAAATGCATTTTCGAGTTTCGATGCAGAAACTGTGGCCAGCTTCTCCGACAAACAGATGATATCGATCAGCTCAGAATATGGCATCGACA
TTAACAGAGTCCGAGGAGTCGTCGACAACGCAATCCGAATCCTTGAGATTAAGAAGGAATTTGGGTCATTCGATAAATACATTTGGGGATTTGTGAACCACAAGCCGTTC
TCGCCGCAGTACAAGTCCGGCCACAAAATTCCGGTGAAGACATCGAAATCGGAGACCATAAGCAAAGACATGGTCCGACGAGGTTTCCGGTCGGTTGGCCCGACGGTAGT
CCACTCCTTCATGCAAGCCACCGGCCTGACCAACGACCACCTGGCCAGCTGCCACAGGCACCTCCACTGCACCTTACTCCCCGCCGGCCGCCGCCTTCCGACGGTCCAAG
TGGAGGAGACTTCTGATGAAACTCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGTCGCTCCGAGCAGCAGGTCATGGAAGCCACTTCTGTTGTTCTCCCCACACCCACCTCCTCTGAAGCTAAAATCAATGGCAGACCTGTCCTTCAACCCACCTGCAA
CCGTGTCCCCACCACCCTCGACCGCCGCAATTCCCTCAAAAAACCCTCTCCCAAGGCCGCCGCCCCGCCCATATCGCCCCCGCCCTCTTCCAGCCACATGGGTCCGACCT
CTCCTAACTCCAAATCGCCCCGGCCTCCGGCGACGAAGCGTGCCAATGAGATGAACTCCAGCTCCGAGAAGGTGGTTATTCCGGCGGGGAATAGTAATAGCTCTCGTGCC
TGCAGGGCTTTGGACAGGAAGAAATCGAAAAGCTTTAAATTGGGTGGGAGCTGTGGGAGCGTTGTTTCTGATAATGTTGGTGGTGGGTTTGAGGCGGCGGCGTCTTTGAG
CTACGCTTCTTCTCTGATCACGGAGTCGCCCGGAAGCATCGCCGCCGTGAGGAGGGAGCAGGTGGCGCTGCAACAAGCGCAGAGGAAGATGAGGATTGCCCACTATGGAA
GATCCAAATCTGCCCGATTTGAAAAGATTGTTCCCATTGATTCTAAAAACAAACCGGCTGTGGAAGACAGAAGATGCAGCTTCATCACTCCCAACTCAGATCCCATCTAC
GTTGCTTACCATGATGAAGAATGGGGTGTCCCTGTTCATGATGACAAGATGCTGTTTGAATTGCTGGTTCTGAGCGTAGCCCAGGTGGGTTCAGACTGGACTTCAATATT
GAAGAAACGCCAAGATTTCAGAAATGCATTTTCGAGTTTCGATGCAGAAACTGTGGCCAGCTTCTCCGACAAACAGATGATATCGATCAGCTCAGAATATGGCATCGACA
TTAACAGAGTCCGAGGAGTCGTCGACAACGCAATCCGAATCCTTGAGATTAAGAAGGAATTTGGGTCATTCGATAAATACATTTGGGGATTTGTGAACCACAAGCCGTTC
TCGCCGCAGTACAAGTCCGGCCACAAAATTCCGGTGAAGACATCGAAATCGGAGACCATAAGCAAAGACATGGTCCGACGAGGTTTCCGGTCGGTTGGCCCGACGGTAGT
CCACTCCTTCATGCAAGCCACCGGCCTGACCAACGACCACCTGGCCAGCTGCCACAGGCACCTCCACTGCACCTTACTCCCCGCCGGCCGCCGCCTTCCGACGGTCCAAG
TGGAGGAGACTTCTGATGAAACTCTCTAG
Protein sequenceShow/hide protein sequence
MCRSEQQVMEATSVVLPTPTSSEAKINGRPVLQPTCNRVPTTLDRRNSLKKPSPKAAAPPISPPPSSSHMGPTSPNSKSPRPPATKRANEMNSSSEKVVIPAGNSNSSRA
CRALDRKKSKSFKLGGSCGSVVSDNVGGGFEAAASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPIDSKNKPAVEDRRCSFITPNSDPIY
VAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDAETVASFSDKQMISISSEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIWGFVNHKPF
SPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQATGLTNDHLASCHRHLHCTLLPAGRRLPTVQVEETSDETL