; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0017657 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0017657
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr11:13106657..13107504
RNA-Seq ExpressionPI0017657
SyntenyPI0017657
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0005488 - binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039128.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.8e-7552.48Show/hide
Query:  MKQVTIPIKGNYGRGEPPLKRLSDTKFRDKLDKGLCFRCNDKYSLGHRCKVREKREMMLLILNEEEGEMEEEPRVERGEGIVELKNLEVLVDAEVELRTI
        MKQ+TIPIKGN+ +GEPP+KRLSD +FR +LD+GLCFRCNDKYS GHRCK +EKRE+M  I+NEEE + E +   E  EG VELK LE+  DA +EL+T+
Subjt:  MKQVTIPIKGNYGRGEPPLKRLSDTKFRDKLDKGLCFRCNDKYSLGHRCKVREKREMMLLILNEEEGEMEEEPRVERGEGIVELKNLEVLVDAEVELRTI

Query:  MGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKLNC--------------------------
           S+K TMK+ G I+ +E+++LIDSGATHNFIH +L  +LKL +   T FG TIG+G+  KG+G+C+RV++                            
Subjt:  MGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKLNC--------------------------

Query:  -TTGFIDIHWPSLTMLFMVGDKPIILKGDPALTRMECSLKTITKTWEEEDQAFLLKFQDREVEGELEEKLEQRKKGDERSCP
         TTG + IHWPSLTM F  G + IILKGDP+L R ECSL+T+ KTW+EEDQ FLL++ + EVE E   K ++++KGDE   P
Subjt:  -TTGFIDIHWPSLTMLFMVGDKPIILKGDPALTRMECSLKTITKTWEEEDQAFLLKFQDREVEGELEEKLEQRKKGDERSCP

KAA0055376.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.0e-7551.77Show/hide
Query:  MKQVTIPIKGNYGRGEPPLKRLSDTKFRDKLDKGLCFRCNDKYSLGHRCKVREKREMMLLILNEEEGEMEEEPRVERGEGIVELKNLEVLVDAEVELRTI
        MKQ+TIPIKGN+ +GEPP+KRLSD +FR +LD+GLCFRCNDKYS GHRCK +EKRE+M  I+NEEE + E + + E  EG VELK LE+  +A +EL+T+
Subjt:  MKQVTIPIKGNYGRGEPPLKRLSDTKFRDKLDKGLCFRCNDKYSLGHRCKVREKREMMLLILNEEEGEMEEEPRVERGEGIVELKNLEVLVDAEVELRTI

Query:  MGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKLNC--------------------------
           S+KGTMK+ G I+ +EV+VLIDSGATHNFIHH+L  +L+L + P T FG TIG+G+  +G+G+C+RV++                            
Subjt:  MGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKLNC--------------------------

Query:  -TTGFIDIHWPSLTMLFMVGDKPIILKGDPALTRMECSLKTITKTWEEEDQAFLLKFQDREVEGELEEKLEQRKKGDERSCP
         TTG + IHWPSLTM F  G + I+LKGDP+L R ECSL+T+ KTW+EEDQ FLL++ + +VE +   K +++++GDE   P
Subjt:  -TTGFIDIHWPSLTMLFMVGDKPIILKGDPALTRMECSLKTITKTWEEEDQAFLLKFQDREVEGELEEKLEQRKKGDERSCP

TYK28976.1 Retrotransposable element Tf2 [Cucumis melo var. makuwa]1.8e-7551.77Show/hide
Query:  MKQVTIPIKGNYGRGEPPLKRLSDTKFRDKLDKGLCFRCNDKYSLGHRCKVREKREMMLLILNEEEGEMEEEPRVERGEGIVELKNLEVLVDAEVELRTI
        MKQ+TIP+KG+Y +GEPP+KRLSD +FR +LDKGLCFRCN+KYS GHRCK++EKRE+ML ILNEEE   E E    +    +E+K LE L +A +E R I
Subjt:  MKQVTIPIKGNYGRGEPPLKRLSDTKFRDKLDKGLCFRCNDKYSLGHRCKVREKREMMLLILNEEEGEMEEEPRVERGEGIVELKNLEVLVDAEVELRTI

Query:  MGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKLNC--------------------------
           +TKGTMK+ G +KG+E+IVLIDSGATHNFIH+ LV+E K+P+   T F VTIGDG+  KG+G+C RV++                            
Subjt:  MGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKLNC--------------------------

Query:  -TTGFIDIHWPSLTMLFMVGDKPIILKGDPALTRMECSLKTITKTWEEEDQAFLLKFQDREVEGELEEKLEQRKKGDERSCP
         TTG + +HWPSLTM+F      ++LKGDPAL R ECS KT+ KTW++EDQ FL+ +Q+ E E E +E   Q + GDE   P
Subjt:  -TTGFIDIHWPSLTMLFMVGDKPIILKGDPALTRMECSLKTITKTWEEEDQAFLLKFQDREVEGELEEKLEQRKKGDERSCP

XP_008448087.1 PREDICTED: uncharacterized protein LOC103490375 [Cucumis melo]2.7e-7652.84Show/hide
Query:  MKQVTIPIKGNYGRGEPPLKRLSDTKFRDKLDKGLCFRCNDKYSLGHRCKVREKREMMLLILNEEEGEMEEEPRVERGEGIVELKNLEVLVDAEVELRTI
        MKQ+TIPIKGN+ +GEPP+KRLSD +FR +LD+GLCFRCNDKYS GHRCK +EKRE+M  I+NEEE + E E   E  EG VELK LE+  D  +E++T+
Subjt:  MKQVTIPIKGNYGRGEPPLKRLSDTKFRDKLDKGLCFRCNDKYSLGHRCKVREKREMMLLILNEEEGEMEEEPRVERGEGIVELKNLEVLVDAEVELRTI

Query:  MGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKLNC--------------------------
           S+KGTMKI G I+ +E+++LIDSGATHNFIH +LV +LKL +   T FG TIG+G+  KG+G+C+RV++                            
Subjt:  MGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKLNC--------------------------

Query:  -TTGFIDIHWPSLTMLFMVGDKPIILKGDPALTRMECSLKTITKTWEEEDQAFLLKFQDREVEGELEEKLEQRKKGDERSCP
         TTG + IHWPSLTM F  G + IILKGDP+L R ECSL+T+ KTW+E+DQ FLL++ + EVE E   K ++++KGDE   P
Subjt:  -TTGFIDIHWPSLTMLFMVGDKPIILKGDPALTRMECSLKTITKTWEEEDQAFLLKFQDREVEGELEEKLEQRKKGDERSCP

XP_031745528.1 uncharacterized protein LOC116405915 [Cucumis sativus]9.3e-7754.58Show/hide
Query:  MKQVTIPIKGNYGRGEPPLKRLSDTKFRDKLDKGLCFRCNDKYSLGHRCKVREKREMMLLILNEEEGEMEEEPRVERGEGIVELKNLEVLVDAEVELRTI
        +KQVTIPIKGNY + EPP+KRLSD +FR +LDKGLCF+CN++YS GHRCK+++KRE+ML I+NEEE   +E+   E  E ++EL  L +    E+EL+ I
Subjt:  MKQVTIPIKGNYGRGEPPLKRLSDTKFRDKLDKGLCFRCNDKYSLGHRCKVREKREMMLLILNEEEGEMEEEPRVERGEGIVELKNLEVLVDAEVELRTI

Query:  MGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKLNC--------------------------
         G ++KGTMKI G+IKG+EV++LIDSGATHNFIH+ +VEE+ L +   T FGVTIGDG+  +GRGVC R++L                            
Subjt:  MGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKLNC--------------------------

Query:  -TTGFIDIHWPSLTMLFMVGDKPIILKGDPALTRMECSLKTITKTWEEEDQAFLLKFQDREVE--GELEEKLEQRKKGDERSCP
         TTG + IHWPSLTM F +G K  ILKGDP+L R ECSLKTI KTWEE+DQ FLL+ Q+ E E  GEL+E   QR KGDE   P
Subjt:  -TTGFIDIHWPSLTMLFMVGDKPIILKGDPALTRMECSLKTITKTWEEEDQAFLLKFQDREVE--GELEEKLEQRKKGDERSCP

TrEMBL top hitse value%identityAlignment
A0A1S3BJT6 uncharacterized protein LOC1034903751.3e-7652.84Show/hide
Query:  MKQVTIPIKGNYGRGEPPLKRLSDTKFRDKLDKGLCFRCNDKYSLGHRCKVREKREMMLLILNEEEGEMEEEPRVERGEGIVELKNLEVLVDAEVELRTI
        MKQ+TIPIKGN+ +GEPP+KRLSD +FR +LD+GLCFRCNDKYS GHRCK +EKRE+M  I+NEEE + E E   E  EG VELK LE+  D  +E++T+
Subjt:  MKQVTIPIKGNYGRGEPPLKRLSDTKFRDKLDKGLCFRCNDKYSLGHRCKVREKREMMLLILNEEEGEMEEEPRVERGEGIVELKNLEVLVDAEVELRTI

Query:  MGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKLNC--------------------------
           S+KGTMKI G I+ +E+++LIDSGATHNFIH +LV +LKL +   T FG TIG+G+  KG+G+C+RV++                            
Subjt:  MGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKLNC--------------------------

Query:  -TTGFIDIHWPSLTMLFMVGDKPIILKGDPALTRMECSLKTITKTWEEEDQAFLLKFQDREVEGELEEKLEQRKKGDERSCP
         TTG + IHWPSLTM F  G + IILKGDP+L R ECSL+T+ KTW+E+DQ FLL++ + EVE E   K ++++KGDE   P
Subjt:  -TTGFIDIHWPSLTMLFMVGDKPIILKGDPALTRMECSLKTITKTWEEEDQAFLLKFQDREVEGELEEKLEQRKKGDERSCP

A0A5A7T8C0 Ty3/gypsy retrotransposon protein8.5e-7652.48Show/hide
Query:  MKQVTIPIKGNYGRGEPPLKRLSDTKFRDKLDKGLCFRCNDKYSLGHRCKVREKREMMLLILNEEEGEMEEEPRVERGEGIVELKNLEVLVDAEVELRTI
        MKQ+TIPIKGN+ +GEPP+KRLSD +FR +LD+GLCFRCNDKYS GHRCK +EKRE+M  I+NEEE + E +   E  EG VELK LE+  DA +EL+T+
Subjt:  MKQVTIPIKGNYGRGEPPLKRLSDTKFRDKLDKGLCFRCNDKYSLGHRCKVREKREMMLLILNEEEGEMEEEPRVERGEGIVELKNLEVLVDAEVELRTI

Query:  MGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKLNC--------------------------
           S+K TMK+ G I+ +E+++LIDSGATHNFIH +L  +LKL +   T FG TIG+G+  KG+G+C+RV++                            
Subjt:  MGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKLNC--------------------------

Query:  -TTGFIDIHWPSLTMLFMVGDKPIILKGDPALTRMECSLKTITKTWEEEDQAFLLKFQDREVEGELEEKLEQRKKGDERSCP
         TTG + IHWPSLTM F  G + IILKGDP+L R ECSL+T+ KTW+EEDQ FLL++ + EVE E   K ++++KGDE   P
Subjt:  -TTGFIDIHWPSLTMLFMVGDKPIILKGDPALTRMECSLKTITKTWEEEDQAFLLKFQDREVEGELEEKLEQRKKGDERSCP

A0A5A7UM77 Ty3/gypsy retrotransposon protein5.0e-7651.77Show/hide
Query:  MKQVTIPIKGNYGRGEPPLKRLSDTKFRDKLDKGLCFRCNDKYSLGHRCKVREKREMMLLILNEEEGEMEEEPRVERGEGIVELKNLEVLVDAEVELRTI
        MKQ+TIPIKGN+ +GEPP+KRLSD +FR +LD+GLCFRCNDKYS GHRCK +EKRE+M  I+NEEE + E + + E  EG VELK LE+  +A +EL+T+
Subjt:  MKQVTIPIKGNYGRGEPPLKRLSDTKFRDKLDKGLCFRCNDKYSLGHRCKVREKREMMLLILNEEEGEMEEEPRVERGEGIVELKNLEVLVDAEVELRTI

Query:  MGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKLNC--------------------------
           S+KGTMK+ G I+ +EV+VLIDSGATHNFIHH+L  +L+L + P T FG TIG+G+  +G+G+C+RV++                            
Subjt:  MGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKLNC--------------------------

Query:  -TTGFIDIHWPSLTMLFMVGDKPIILKGDPALTRMECSLKTITKTWEEEDQAFLLKFQDREVEGELEEKLEQRKKGDERSCP
         TTG + IHWPSLTM F  G + I+LKGDP+L R ECSL+T+ KTW+EEDQ FLL++ + +VE +   K +++++GDE   P
Subjt:  -TTGFIDIHWPSLTMLFMVGDKPIILKGDPALTRMECSLKTITKTWEEEDQAFLLKFQDREVEGELEEKLEQRKKGDERSCP

A0A5D3DZ80 Retrotransposable element Tf28.5e-7651.77Show/hide
Query:  MKQVTIPIKGNYGRGEPPLKRLSDTKFRDKLDKGLCFRCNDKYSLGHRCKVREKREMMLLILNEEEGEMEEEPRVERGEGIVELKNLEVLVDAEVELRTI
        MKQ+TIP+KG+Y +GEPP+KRLSD +FR +LDKGLCFRCN+KYS GHRCK++EKRE+ML ILNEEE   E E    +    +E+K LE L +A +E R I
Subjt:  MKQVTIPIKGNYGRGEPPLKRLSDTKFRDKLDKGLCFRCNDKYSLGHRCKVREKREMMLLILNEEEGEMEEEPRVERGEGIVELKNLEVLVDAEVELRTI

Query:  MGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKLNC--------------------------
           +TKGTMK+ G +KG+E+IVLIDSGATHNFIH+ LV+E K+P+   T F VTIGDG+  KG+G+C RV++                            
Subjt:  MGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKLNC--------------------------

Query:  -TTGFIDIHWPSLTMLFMVGDKPIILKGDPALTRMECSLKTITKTWEEEDQAFLLKFQDREVEGELEEKLEQRKKGDERSCP
         TTG + +HWPSLTM+F      ++LKGDPAL R ECS KT+ KTW++EDQ FL+ +Q+ E E E +E   Q + GDE   P
Subjt:  -TTGFIDIHWPSLTMLFMVGDKPIILKGDPALTRMECSLKTITKTWEEEDQAFLLKFQDREVEGELEEKLEQRKKGDERSCP

A0A5D3E325 Ty3/gypsy retrotransposon protein1.9e-7552.48Show/hide
Query:  MKQVTIPIKGNYGRGEPPLKRLSDTKFRDKLDKGLCFRCNDKYSLGHRCKVREKREMMLLILNEEEGEMEEEPRVERGEGIVELKNLEVLVDAEVELRTI
        MKQ+TIP+KG+Y +GEPP+KRLSD +FR +LDKGLCFRCN+KYS GHRCK++EKRE+ML ILNEEE   E E       G VE+  LE   +  +E R I
Subjt:  MKQVTIPIKGNYGRGEPPLKRLSDTKFRDKLDKGLCFRCNDKYSLGHRCKVREKREMMLLILNEEEGEMEEEPRVERGEGIVELKNLEVLVDAEVELRTI

Query:  MGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKLNC--------------------------
           +TKGTMK+ G +KG+EVIVLIDSGATHNFIHH LV E K+P+   T FG+TIGDG+  KG G+C +V++                            
Subjt:  MGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKLNC--------------------------

Query:  -TTGFIDIHWPSLTMLFMVGDKPIILKGDPALTRMECSLKTITKTWEEEDQAFLLKFQDREVEGELEEKLEQRKKGDERSCP
         TTG + IHWPSLTM+F      ++LKGDPAL R ECSLKT+ KTWE EDQ FLL +Q  E+E E  +     + GDE   P
Subjt:  -TTGFIDIHWPSLTMLFMVGDKPIILKGDPALTRMECSLKTITKTWEEEDQAFLLKFQDREVEGELEEKLEQRKKGDERSCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein2.9e-0430.1Show/hide
Query:  VERGEGIVELKNLEVLVDAEVELR-----TIMGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKR
        V++ +G++    LE L      LR      ++  +    M+  G I   +V+V IDSGAT NFI   L   LKLP +      V +G     +  G C  
Subjt:  VERGEGIVELKNLEVLVDAEVELR-----TIMGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKR

Query:  VKL
        ++L
Subjt:  VKL

AT3G30770.1 Eukaryotic aspartyl protease family protein3.5e-0532.91Show/hide
Query:  EVELRTIMGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKL
        +V+ ++   F+    M+  G I   +V+V+IDSGAT+NFI   L   LKLP +      V +G     +  G C  + L
Subjt:  EVELRTIMGFSTKGTMKILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACAGGTTACTATTCCAATCAAGGGGAACTATGGAAGGGGAGAACCACCACTTAAAAGACTGTCCGATACGAAATTCAGGGATAAATTGGATAAGGGTCTGTGTTT
TAGATGTAATGACAAGTACTCACTAGGGCATCGCTGTAAAGTGAGAGAAAAAAGAGAAATGATGCTACTCATCTTGAACGAAGAGGAGGGAGAGATGGAGGAAGAACCCA
GGGTGGAAAGAGGAGAAGGAATCGTGGAGTTGAAAAATTTGGAGGTCCTAGTGGATGCTGAGGTAGAATTGAGAACGATTATGGGATTTTCAACAAAGGGCACTATGAAG
ATACTAGGAAAGATTAAGGGTAGGGAGGTCATTGTGCTGATTGACAGTGGAGCCACCCACAATTTCATTCACCATAATTTGGTGGAGGAATTAAAACTACCTGTTACCCC
AAGGACTACCTTTGGAGTAACAATAGGAGATGGATCTGATCGAAAAGGAAGGGGAGTTTGCAAAAGAGTGAAGTTAAACTGCACAACTGGCTTCATAGACATCCACTGGC
CATCACTGACCATGTTGTTCATGGTTGGGGATAAGCCTATTATTCTGAAAGGGGATCCAGCTTTAACAAGGATGGAATGTTCTTTGAAGACGATTACTAAAACATGGGAG
GAAGAAGACCAAGCTTTTCTCCTTAAATTCCAGGATAGGGAGGTAGAAGGGGAACTTGAAGAAAAGTTAGAGCAGAGGAAAAAGGGGGATGAAAGGAGTTGCCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAACAGGTTACTATTCCAATCAAGGGGAACTATGGAAGGGGAGAACCACCACTTAAAAGACTGTCCGATACGAAATTCAGGGATAAATTGGATAAGGGTCTGTGTTT
TAGATGTAATGACAAGTACTCACTAGGGCATCGCTGTAAAGTGAGAGAAAAAAGAGAAATGATGCTACTCATCTTGAACGAAGAGGAGGGAGAGATGGAGGAAGAACCCA
GGGTGGAAAGAGGAGAAGGAATCGTGGAGTTGAAAAATTTGGAGGTCCTAGTGGATGCTGAGGTAGAATTGAGAACGATTATGGGATTTTCAACAAAGGGCACTATGAAG
ATACTAGGAAAGATTAAGGGTAGGGAGGTCATTGTGCTGATTGACAGTGGAGCCACCCACAATTTCATTCACCATAATTTGGTGGAGGAATTAAAACTACCTGTTACCCC
AAGGACTACCTTTGGAGTAACAATAGGAGATGGATCTGATCGAAAAGGAAGGGGAGTTTGCAAAAGAGTGAAGTTAAACTGCACAACTGGCTTCATAGACATCCACTGGC
CATCACTGACCATGTTGTTCATGGTTGGGGATAAGCCTATTATTCTGAAAGGGGATCCAGCTTTAACAAGGATGGAATGTTCTTTGAAGACGATTACTAAAACATGGGAG
GAAGAAGACCAAGCTTTTCTCCTTAAATTCCAGGATAGGGAGGTAGAAGGGGAACTTGAAGAAAAGTTAGAGCAGAGGAAAAAGGGGGATGAAAGGAGTTGCCCATGA
Protein sequenceShow/hide protein sequence
MKQVTIPIKGNYGRGEPPLKRLSDTKFRDKLDKGLCFRCNDKYSLGHRCKVREKREMMLLILNEEEGEMEEEPRVERGEGIVELKNLEVLVDAEVELRTIMGFSTKGTMK
ILGKIKGREVIVLIDSGATHNFIHHNLVEELKLPVTPRTTFGVTIGDGSDRKGRGVCKRVKLNCTTGFIDIHWPSLTMLFMVGDKPIILKGDPALTRMECSLKTITKTWE
EEDQAFLLKFQDREVEGELEEKLEQRKKGDERSCP