; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C034251 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C034251
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
DescriptionEncodes a protein whose expression is responsive to nematode infection, putative
Genome locationchr10:18883852..18884761
RNA-Seq ExpressionMELO3C034251
SyntenyMELO3C034251
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140437.1 uncharacterized protein At1g66480 [Cucumis sativus]2.0e-8793.33Show/hide
Query:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVMKISGET+KL TPVQ GDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLP+LPKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLTEEGGGES-EGSGSGA-TRVKVRLPKAEVERLLKECKDEAEAAERIMGLYKTRESVCENDHKEKE-KKDIIKPRE
        LMLARRSASDLTIMKPKS+LTEEGGGES EGS SGA TRVK+RLPKAEVERLLKECKDEAEAAERIMGLYKTRE+  ENDHKEKE KKDIIKPRE
Subjt:  LMLARRSASDLTIMKPKSVLTEEGGGES-EGSGSGA-TRVKVRLPKAEVERLLKECKDEAEAAERIMGLYKTRESVCENDHKEKE-KKDIIKPRE

XP_008454771.1 PREDICTED: uncharacterized protein At1g66480 isoform X1 [Cucumis melo]3.6e-97100Show/hide
Query:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLYKTRESVCENDHKEKEKKDIIKPRE
        LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLYKTRESVCENDHKEKEKKDIIKPRE
Subjt:  LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLYKTRESVCENDHKEKEKKDIIKPRE

XP_008454773.1 PREDICTED: uncharacterized protein At1g66480 isoform X2 [Cucumis melo]3.6e-97100Show/hide
Query:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLYKTRESVCENDHKEKEKKDIIKPRE
        LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLYKTRESVCENDHKEKEKKDIIKPRE
Subjt:  LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLYKTRESVCENDHKEKEKKDIIKPRE

XP_022942788.1 uncharacterized protein At1g66480 isoform X1 [Cucurbita moschata]1.2e-7684.1Show/hide
Query:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES
        MGN FG+KKTVKVM +SG+T+KL  PVQ  DVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLV+LPK+PKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLY--KTRESVCENDHK-EKEKKDIIKPRE
        LML+RRSASDLTIMKPKSVL EEGG + E  GS ATRVKVRLPKAEVER+LKE KDEAEAAERIMGLY  K RESVC+N+ K EKEK  IIKPRE
Subjt:  LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLY--KTRESVCENDHK-EKEKKDIIKPRE

XP_038891934.1 uncharacterized protein At1g66480 [Benincasa hispida]1.9e-9092.82Show/hide
Query:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVM ISGET+KLNTPVQ GDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLV+LPK+ KEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSG---ATRVKVRLPKAEVERLLKECKDEAEAAERIMGLYKTRESVCENDHKEKEKKDIIKPRE
        LMLARRSASDLTIMKPKSVL EEGGGES+GSGSG   ATRVKVRLPKAEVERLLKECKDEAEAAERIMGLYKTRE+VCENDHK++EKKDIIKPRE
Subjt:  LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSG---ATRVKVRLPKAEVERLLKECKDEAEAAERIMGLYKTRESVCENDHKEKEKKDIIKPRE

TrEMBL top hitse value%identityAlignment
A0A1S3BZC7 uncharacterized protein At1g66480 isoform X11.7e-97100Show/hide
Query:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLYKTRESVCENDHKEKEKKDIIKPRE
        LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLYKTRESVCENDHKEKEKKDIIKPRE
Subjt:  LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLYKTRESVCENDHKEKEKKDIIKPRE

A0A1S3C0K8 uncharacterized protein At1g66480 isoform X21.7e-97100Show/hide
Query:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLYKTRESVCENDHKEKEKKDIIKPRE
        LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLYKTRESVCENDHKEKEKKDIIKPRE
Subjt:  LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLYKTRESVCENDHKEKEKKDIIKPRE

A0A6J1FSC9 uncharacterized protein At1g66480 isoform X25.8e-7784.1Show/hide
Query:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES
        MGN FG+KKTVKVM +SG+T+KL  PVQ  DVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLV+LPK+PKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLY--KTRESVCENDHK-EKEKKDIIKPRE
        LML+RRSASDLTIMKPKSVL EEGG + E  GS ATRVKVRLPKAEVER+LKE KDEAEAAERIMGLY  K RESVC+N+ K EKEK  IIKPRE
Subjt:  LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLY--KTRESVCENDHK-EKEKKDIIKPRE

A0A6J1FVP5 uncharacterized protein At1g66480 isoform X15.8e-7784.1Show/hide
Query:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES
        MGN FG+KKTVKVM +SG+T+KL  PVQ  DVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLV+LPK+PKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLY--KTRESVCENDHK-EKEKKDIIKPRE
        LML+RRSASDLTIMKPKSVL EEGG + E  GS ATRVKVRLPKAEVER+LKE KDEAEAAERIMGLY  K RESVC+N+ K EKEK  IIKPRE
Subjt:  LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLY--KTRESVCENDHK-EKEKKDIIKPRE

A0A6J1JDY8 uncharacterized protein At1g66480 isoform X12.4e-7582.56Show/hide
Query:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES
        MGNTFG+KKT KVM +SG+T+KL  PVQ  DVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLV+LPK+PKEQAPRRVRS INMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLY--KTRESVCENDHK-EKEKKDIIKPRE
        LML+RRSASDLTIMKPKSVL EEG GE E  GS ATRVKVRLPKAEVER+LKE +DEAEAAERIMGLY  K  E+VC+N  K EKEK  IIKPRE
Subjt:  LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLY--KTRESVCENDHK-EKEKKDIIKPRE

SwissProt top hitse value%identityAlignment
Q6NLC8 Uncharacterized protein At1g664802.4e-3547.8Show/hide
Query:  MGNTFGVK-KTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAP---------RRVRSAI
        MGN+  VK K  KVMKI GET ++ TPV   +V  DYPG+VLL+S+AVKH+GVR+KPLE +Q L  K+ YFLV+LPKLP E            RRV S I
Subjt:  MGNTFGVK-KTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAP---------RRVRSAI

Query:  NMSAKDRLESLMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEA-AERIMGLYKTR
        ++ AK+RL+ LML+RR+ SD+TI +     ++ G G     G G T V++RLP++++ +L++E  ++A A AE+I+G+Y  R
Subjt:  NMSAKDRLESLMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEA-AERIMGLYKTR

Arabidopsis top hitse value%identityAlignment
AT1G66480.1 plastid movement impaired 21.7e-3647.8Show/hide
Query:  MGNTFGVK-KTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAP---------RRVRSAI
        MGN+  VK K  KVMKI GET ++ TPV   +V  DYPG+VLL+S+AVKH+GVR+KPLE +Q L  K+ YFLV+LPKLP E            RRV S I
Subjt:  MGNTFGVK-KTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAP---------RRVRSAI

Query:  NMSAKDRLESLMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEA-AERIMGLYKTR
        ++ AK+RL+ LML+RR+ SD+TI +     ++ G G     G G T V++RLP++++ +L++E  ++A A AE+I+G+Y  R
Subjt:  NMSAKDRLESLMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEA-AERIMGLYKTR

AT1G71015.1 unknown protein6.0e-4254.49Show/hide
Query:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES
        MGN+ G KKT  +M I+GE+ KL TPV+ G VVKD+PG VLLESEAVK  G+RAKPLE HQ L +KR+YF+V+LP+  KE+ PRRVRS I MSAK+RLE+
Subjt:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGL
        L L+RRS+SDL++MK K+ + +E   E E S      VK++LPK ++E+L KE +  ++ + +I  L
Subjt:  LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGL

AT2G01340.1 Encodes a protein whose expression is responsive to nematode infection.4.4e-4558.14Show/hide
Query:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES
        MGN+ G KKT KVMKI GET KL TPV   +V+KD+PG VLL+SE+VKHYG RAKPLE  Q+L  KRLYF+V+     KE  PRRVRS I++SAK+RLES
Subjt:  MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLYKTRE
        LMLARRS+SDL+I+KP       GG  +E       RVKVR+PKAE+E+L+KE   EAEA ++I  L+  ++
Subjt:  LMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLYKTRE

AT5G37840.1 BEST Arabidopsis thaliana protein match is: plastid movement impaired 2 (TAIR:AT1G66480.1)8.1e-3146.51Show/hide
Query:  MGNTFGVKKT-VKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQ--APRRVRSA-INMSAKD
        MGNT  V++  VKVMKI G+  +L TPV   D  K+YPGFVLL+SE VK  GVRAKPLE +Q L     YFLVDLP + K      RRV S  I++ AK+
Subjt:  MGNTFGVKKT-VKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQ--APRRVRSA-INMSAKD

Query:  RLESLMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLY
        RLE LML+RR+ SD+   +   V         +G   G TRV++RLP++++ +L++E  D +E A +I+  Y
Subjt:  RLESLMLARRSASDLTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAACACCTTTGGAGTCAAGAAGACGGTTAAGGTTATGAAGATCTCCGGCGAGACCCTGAAACTAAATACCCCGGTTCAAGTTGGAGATGTCGTCAAGGATTATCC
TGGCTTTGTTCTACTCGAATCTGAGGCTGTTAAGCACTACGGGGTTCGAGCAAAGCCTTTGGAGCTCCACCAGAAGCTCAGCACCAAGAGACTTTATTTCCTCGTCGATC
TTCCTAAACTTCCAAAAGAACAGGCTCCACGACGAGTACGGTCAGCGATCAACATGAGTGCAAAGGATAGGTTAGAGAGCTTGATGTTGGCACGACGATCAGCATCGGAC
CTAACTATCATGAAACCAAAGAGCGTGTTGACGGAGGAGGGAGGTGGAGAGAGTGAGGGATCGGGATCAGGAGCAACACGGGTGAAGGTACGGTTGCCGAAGGCCGAAGT
GGAAAGACTGTTGAAGGAGTGTAAAGATGAAGCAGAGGCAGCAGAAAGGATTATGGGATTGTACAAAACAAGAGAGAGTGTTTGTGAAAATGATCACAAGGAGAAGGAGA
AGAAGGATATCATCAAGCCCCGTGAG
mRNA sequenceShow/hide mRNA sequence
CGCCCACCCACCACCCTCTTCTTTCTCTCTCTAATTTTCAAAACCCTTTCCCTTTTGCCCGCCATGGGAAACACCTTTGGAGTCAAGAAGACGGTTAAGGTTATGAAGAT
CTCCGGCGAGACCCTGAAACTAAATACCCCGGTTCAAGTTGGAGATGTCGTCAAGGATTATCCTGGCTTTGTTCTACTCGAATCTGAGGCTGTTAAGCACTACGGGGTTC
GAGCAAAGCCTTTGGAGCTCCACCAGAAGCTCAGCACCAAGAGACTTTATTTCCTCGTCGATCTTCCTAAACTTCCAAAAGAACAGGCTCCACGACGAGTACGGTCAGCG
ATCAACATGAGTGCAAAGGATAGGTTAGAGAGCTTGATGTTGGCACGACGATCAGCATCGGACCTAACTATCATGAAACCAAAGAGCGTGTTGACGGAGGAGGGAGGTGG
AGAGAGTGAGGGATCGGGATCAGGAGCAACACGGGTGAAGGTACGGTTGCCGAAGGCCGAAGTGGAAAGACTGTTGAAGGAGTGTAAAGATGAAGCAGAGGCAGCAGAAA
GGATTATGGGATTGTACAAAACAAGAGAGAGTGTTTGTGAAAATGATCACAAGGAGAAGGAGAAGAAGGATATCATCAAGCCCCGTGAGGT
Protein sequenceShow/hide protein sequence
MGNTFGVKKTVKVMKISGETLKLNTPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPKLPKEQAPRRVRSAINMSAKDRLESLMLARRSASD
LTIMKPKSVLTEEGGGESEGSGSGATRVKVRLPKAEVERLLKECKDEAEAAERIMGLYKTRESVCENDHKEKEKKDIIKPRE