; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g20160 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g20160
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCACTA en-spm transposon protein
Genome locationchr3:13612467..13618157
RNA-Seq ExpressionMoc03g20160
SyntenyMoc03g20160
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK09483.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]4.7e-1724.76Show/hide
Query:  DFNESDRVMMKFIEHEMRSTHKAYRAKLRQHYLSYPKPEIARNNPPKRLLAVGSIRFCTVQRRRPKHSSLLIKLSQSNSPSLVPKKRSHKRVLEIQEDIK
        DFN  D+ M +F+EH+M +T K +RA   +H+  Y  PE AR NPP  L                                                   
Subjt:  DFNESDRVMMKFIEHEMRSTHKAYRAKLRQHYLSYPKPEIARNNPPKRLLAVGSIRFCTVQRRRPKHSSLLIKLSQSNSPSLVPKKRSHKRVLEIQEDIK

Query:  EDRLVVFVGKSLKKHPLPLRDWHPFMAKPDDWNMLCDRWETDEWKQKYETNKCNRAKMSFNHRAGLKALAIIAKEKKEEEGGDNFSKIDLFRETRYSDTK
                                 + + +DW+ LCD + +  ++++  TNK  R K  +NH +G K+  +  + +  E  G    +++LFRET      
Subjt:  EDRLVVFVGKSLKKHPLPLRDWHPFMAKPDDWNMLCDRWETDEWKQKYETNKCNRAKMSFNHRAGLKALAIIAKEKKEEEGGDNFSKIDLFRETRYSDTK

Query:  GWIEGVETACLDMMRVREESIQDGGEPMPDPEVLETVLGYRLGYVKSAGWGPKPKLVEDRFTQKNKELEDCLSQKDKKLEDRLSQKDKELDERLAAKDSE
           +  E A   M+ ++ + I +G +P+ + E+ + VLG R GY K  GWGPKPK    R    +     C     K++E  L  K  E  ER+  +D  
Subjt:  GWIEGVETACLDMMRVREESIQDGGEPMPDPEVLETVLGYRLGYVKSAGWGPKPKLVEDRFTQKNKELEDCLSQKDKKLEDRLSQKDKELDERLAAKDSE

Query:  IMGLHGEMSYLRLMVLQLI
           L  ++  ++ M+  LI
Subjt:  IMGLHGEMSYLRLMVLQLI

XP_022156286.1 uncharacterized protein LOC111023212 [Momordica charantia]1.2e-2037.14Show/hide
Query:  MAKPDDWNMLCDRWETDEWKQKYETNKCNRAKMSFNHRAGLKALAIIAKEKKEEEGGDNFSKIDLFRETRYSDTKGWI-EGVETACLDMMRVREESIQDG
        +  P+DWN LCDRWET EWK+  + NK NRAK+ FNHRAG K+   +  E K +EG D    +DLF E+ Y++  G + +  E A  +M  + +   Q+G
Subjt:  MAKPDDWNMLCDRWETDEWKQKYETNKCNRAKMSFNHRAGLKALAIIAKEKKEEEGGDNFSKIDLFRETRYSDTKGWI-EGVETACLDMMRVREESIQDG

Query:  GEPMPDPEVLETVLGYRLGYVKSAGWGPKPKLVEDRFTQKNKELEDCLSQKDKKLEDRLSQKDKELDERLAAKDS
         EP+  PE    VLG R  +VK  G+GP+P L +   +           + +KK+ED   +  +   E    K+S
Subjt:  GEPMPDPEVLETVLGYRLGYVKSAGWGPKPKLVEDRFTQKNKELEDCLSQKDKKLEDRLSQKDKELDERLAAKDS

XP_038887408.1 poly [ADP-ribose] polymerase 1-like isoform X1 [Benincasa hispida]3.6e-2538.46Show/hide
Query:  VPKKRSHKRVLE-IQEDIKEDRLVVFVGKSLKKHPLPLRDWHP-FMAKPDDWNMLCDRWETDEWKQKYETNKCNRAKMSFNHRAGLKALAIIAKEKKEEE
        V KK   K VL+ +Q   KE R  ++      K P   R   P  +    DWN+LC+RWET EWK+K ETNK +R+K+ + HR G K+   +  E K +E
Subjt:  VPKKRSHKRVLE-IQEDIKEDRLVVFVGKSLKKHPLPLRDWHP-FMAKPDDWNMLCDRWETDEWKQKYETNKCNRAKMSFNHRAGLKALAIIAKEKKEEE

Query:  GGDNFSKIDLFRETRYSDTKGWI-EGVETACLDMMRVREESIQDGGEPMPDPEVLETVLGYRLGYVKSAGWGPKPKLVED--RFTQKNKELEDCLSQKDK
        G D   ++DLFR++ + +  GW+    + A L+M R+ E S Q+   PM   EV + VLG+R GY+K  G  PKP        + Q  KELE    +K +
Subjt:  GGDNFSKIDLFRETRYSDTKGWI-EGVETACLDMMRVREESIQDGGEPMPDPEVLETVLGYRLGYVKSAGWGPKPKLVED--RFTQKNKELEDCLSQKDK

Query:  KLEDRLSQ
        K+ED + Q
Subjt:  KLEDRLSQ

XP_038887409.1 poly [ADP-ribose] polymerase 1-like isoform X2 [Benincasa hispida]3.6e-2538.46Show/hide
Query:  VPKKRSHKRVLE-IQEDIKEDRLVVFVGKSLKKHPLPLRDWHP-FMAKPDDWNMLCDRWETDEWKQKYETNKCNRAKMSFNHRAGLKALAIIAKEKKEEE
        V KK   K VL+ +Q   KE R  ++      K P   R   P  +    DWN+LC+RWET EWK+K ETNK +R+K+ + HR G K+   +  E K +E
Subjt:  VPKKRSHKRVLE-IQEDIKEDRLVVFVGKSLKKHPLPLRDWHP-FMAKPDDWNMLCDRWETDEWKQKYETNKCNRAKMSFNHRAGLKALAIIAKEKKEEE

Query:  GGDNFSKIDLFRETRYSDTKGWI-EGVETACLDMMRVREESIQDGGEPMPDPEVLETVLGYRLGYVKSAGWGPKPKLVED--RFTQKNKELEDCLSQKDK
        G D   ++DLFR++ + +  GW+    + A L+M R+ E S Q+   PM   EV + VLG+R GY+K  G  PKP        + Q  KELE    +K +
Subjt:  GGDNFSKIDLFRETRYSDTKGWI-EGVETACLDMMRVREESIQDGGEPMPDPEVLETVLGYRLGYVKSAGWGPKPKLVED--RFTQKNKELEDCLSQKDK

Query:  KLEDRLSQ
        K+ED + Q
Subjt:  KLEDRLSQ

XP_038887413.1 uncharacterized protein LOC120077557 isoform X5 [Benincasa hispida]3.6e-2538.46Show/hide
Query:  VPKKRSHKRVLE-IQEDIKEDRLVVFVGKSLKKHPLPLRDWHP-FMAKPDDWNMLCDRWETDEWKQKYETNKCNRAKMSFNHRAGLKALAIIAKEKKEEE
        V KK   K VL+ +Q   KE R  ++      K P   R   P  +    DWN+LC+RWET EWK+K ETNK +R+K+ + HR G K+   +  E K +E
Subjt:  VPKKRSHKRVLE-IQEDIKEDRLVVFVGKSLKKHPLPLRDWHP-FMAKPDDWNMLCDRWETDEWKQKYETNKCNRAKMSFNHRAGLKALAIIAKEKKEEE

Query:  GGDNFSKIDLFRETRYSDTKGWI-EGVETACLDMMRVREESIQDGGEPMPDPEVLETVLGYRLGYVKSAGWGPKPKLVED--RFTQKNKELEDCLSQKDK
        G D   ++DLFR++ + +  GW+    + A L+M R+ E S Q+   PM   EV + VLG+R GY+K  G  PKP        + Q  KELE    +K +
Subjt:  GGDNFSKIDLFRETRYSDTKGWI-EGVETACLDMMRVREESIQDGGEPMPDPEVLETVLGYRLGYVKSAGWGPKPKLVED--RFTQKNKELEDCLSQKDK

Query:  KLEDRLSQ
        K+ED + Q
Subjt:  KLEDRLSQ

TrEMBL top hitse value%identityAlignment
A0A5A7T4P0 CACTA en-spm transposon protein3.9e-1724.53Show/hide
Query:  DFNESDRVMMKFIEHEMRSTHKAYRAKLRQHYLSYPKPEIARNNPPKRLLAVGSIRFCTVQRRRPKHSSLLIKLSQSNSPSLVPKKRSHKRVLEIQEDIK
        DFN  D+ M +F+EH+M +T K +RA   +H+  Y  PE AR NPP  L                                                   
Subjt:  DFNESDRVMMKFIEHEMRSTHKAYRAKLRQHYLSYPKPEIARNNPPKRLLAVGSIRFCTVQRRRPKHSSLLIKLSQSNSPSLVPKKRSHKRVLEIQEDIK

Query:  EDRLVVFVGKSLKKHPLPLRDWHPFMAKPDDWNMLCDRWETDEWKQKYETNKCNRAKMSFNHRAGLKALAIIAKEKKEEEGGDNFSKIDLFRETRYSDTK
                                 + + +DW+ LCD + +  ++++  TNK  R K  +NH +G K+  +  + +  E  G    +++LFRET      
Subjt:  EDRLVVFVGKSLKKHPLPLRDWHPFMAKPDDWNMLCDRWETDEWKQKYETNKCNRAKMSFNHRAGLKALAIIAKEKKEEEGGDNFSKIDLFRETRYSDTK

Query:  GWIEGVETACLDMMRVREESIQDGGEPMPDPEVLETVLGYRLGYVKSAGWGPKPKLVEDRFTQKNKELEDCLSQKDKKLEDRLSQKDKELDERLAAKDSE
           +  E A   M+ ++ + I +G +P+ + E+ + VLG R GY K  GWGPKPK    R    +     C     K++E  L  K  E  ER+  +D  
Subjt:  GWIEGVETACLDMMRVREESIQDGGEPMPDPEVLETVLGYRLGYVKSAGWGPKPKLVEDRFTQKNKELEDCLSQKDKKLEDRLSQKDKELDERLAAKDSE

Query:  IMGLHGEMSYLRLMVLQL
           L  ++  ++ M+  L
Subjt:  IMGLHGEMSYLRLMVLQL

A0A5D3C6Z8 CACTA en-spm transposon protein3.9e-1724.53Show/hide
Query:  DFNESDRVMMKFIEHEMRSTHKAYRAKLRQHYLSYPKPEIARNNPPKRLLAVGSIRFCTVQRRRPKHSSLLIKLSQSNSPSLVPKKRSHKRVLEIQEDIK
        DFN  D+ M +F+EH+M +T K +RA   +H+  Y  PE AR NPP  L                                                   
Subjt:  DFNESDRVMMKFIEHEMRSTHKAYRAKLRQHYLSYPKPEIARNNPPKRLLAVGSIRFCTVQRRRPKHSSLLIKLSQSNSPSLVPKKRSHKRVLEIQEDIK

Query:  EDRLVVFVGKSLKKHPLPLRDWHPFMAKPDDWNMLCDRWETDEWKQKYETNKCNRAKMSFNHRAGLKALAIIAKEKKEEEGGDNFSKIDLFRETRYSDTK
                                 + + +DW+ LCD + +  ++++  TNK  R K  +NH +G K+  +  + +  E  G    +++LFRET      
Subjt:  EDRLVVFVGKSLKKHPLPLRDWHPFMAKPDDWNMLCDRWETDEWKQKYETNKCNRAKMSFNHRAGLKALAIIAKEKKEEEGGDNFSKIDLFRETRYSDTK

Query:  GWIEGVETACLDMMRVREESIQDGGEPMPDPEVLETVLGYRLGYVKSAGWGPKPKLVEDRFTQKNKELEDCLSQKDKKLEDRLSQKDKELDERLAAKDSE
           +  E A   M+ ++ + I +G +P+ + E+ + VLG R GY K  GWGPKPK    R    +     C     K++E  L  K  E  ER+  +D  
Subjt:  GWIEGVETACLDMMRVREESIQDGGEPMPDPEVLETVLGYRLGYVKSAGWGPKPKLVEDRFTQKNKELEDCLSQKDKKLEDRLSQKDKELDERLAAKDSE

Query:  IMGLHGEMSYLRLMVLQL
           L  ++  ++ M+  L
Subjt:  IMGLHGEMSYLRLMVLQL

A0A5D3CCA5 CACTA en-spm transposon protein2.3e-1724.76Show/hide
Query:  DFNESDRVMMKFIEHEMRSTHKAYRAKLRQHYLSYPKPEIARNNPPKRLLAVGSIRFCTVQRRRPKHSSLLIKLSQSNSPSLVPKKRSHKRVLEIQEDIK
        DFN  D+ M +F+EH+M +T K +RA   +H+  Y  PE AR NPP  L                                                   
Subjt:  DFNESDRVMMKFIEHEMRSTHKAYRAKLRQHYLSYPKPEIARNNPPKRLLAVGSIRFCTVQRRRPKHSSLLIKLSQSNSPSLVPKKRSHKRVLEIQEDIK

Query:  EDRLVVFVGKSLKKHPLPLRDWHPFMAKPDDWNMLCDRWETDEWKQKYETNKCNRAKMSFNHRAGLKALAIIAKEKKEEEGGDNFSKIDLFRETRYSDTK
                                 + + +DW+ LCD + +  ++++  TNK  R K  +NH +G K+  +  + +  E  G    +++LFRET      
Subjt:  EDRLVVFVGKSLKKHPLPLRDWHPFMAKPDDWNMLCDRWETDEWKQKYETNKCNRAKMSFNHRAGLKALAIIAKEKKEEEGGDNFSKIDLFRETRYSDTK

Query:  GWIEGVETACLDMMRVREESIQDGGEPMPDPEVLETVLGYRLGYVKSAGWGPKPKLVEDRFTQKNKELEDCLSQKDKKLEDRLSQKDKELDERLAAKDSE
           +  E A   M+ ++ + I +G +P+ + E+ + VLG R GY K  GWGPKPK    R    +     C     K++E  L  K  E  ER+  +D  
Subjt:  GWIEGVETACLDMMRVREESIQDGGEPMPDPEVLETVLGYRLGYVKSAGWGPKPKLVEDRFTQKNKELEDCLSQKDKKLEDRLSQKDKELDERLAAKDSE

Query:  IMGLHGEMSYLRLMVLQLI
           L  ++  ++ M+  LI
Subjt:  IMGLHGEMSYLRLMVLQLI

A0A5D3D5L1 CACTA en-spm transposon protein3.0e-1724.13Show/hide
Query:  DFNESDRVMMKFIEHEMRSTHKAYRAKLRQHYLSYPKPEIARNNPPKRLLAVGSIRFCTVQRRRPKHSSLLIKLSQSNSPSLVPKKRSHKRVLEIQEDIK
        DFN  D+ M +F+EH+M +T K +RA   +H+  Y  PE AR NPP  L                                                   
Subjt:  DFNESDRVMMKFIEHEMRSTHKAYRAKLRQHYLSYPKPEIARNNPPKRLLAVGSIRFCTVQRRRPKHSSLLIKLSQSNSPSLVPKKRSHKRVLEIQEDIK

Query:  EDRLVVFVGKSLKKHPLPLRDWHPFMAKPDDWNMLCDRWETDEWKQKYETNKCNRAKMSFNHRAGLKALAIIAKEKKEEEGGDNFSKIDLFRETRYSDTK
                                 + + +DW+ LCD + +  ++++  TNK  R K  +NH +G K+  +  + +  E  G++  +++LFRET      
Subjt:  EDRLVVFVGKSLKKHPLPLRDWHPFMAKPDDWNMLCDRWETDEWKQKYETNKCNRAKMSFNHRAGLKALAIIAKEKKEEEGGDNFSKIDLFRETRYSDTK

Query:  GWIEGVETACLDMMRVREESIQDGGEPMPDPEVLETVLGYRLGYVKSAGWGPKPKLVEDRFTQKNKELEDCLSQKDKKLEDRLSQKDKELDERLAAKDSE
           +  E A   M+ ++ +   +G +P+ + E+ + VLG R GY K  GWGPKPK    R    +     C    +K++E  L  K +E  ER+  +D  
Subjt:  GWIEGVETACLDMMRVREESIQDGGEPMPDPEVLETVLGYRLGYVKSAGWGPKPKLVEDRFTQKNKELEDCLSQKDKKLEDRLSQKDKELDERLAAKDSE

Query:  IMGLHGEMSYLRLMV
           L  ++  ++ M+
Subjt:  IMGLHGEMSYLRLMV

A0A6J1DUH3 uncharacterized protein LOC1110232125.8e-2137.14Show/hide
Query:  MAKPDDWNMLCDRWETDEWKQKYETNKCNRAKMSFNHRAGLKALAIIAKEKKEEEGGDNFSKIDLFRETRYSDTKGWI-EGVETACLDMMRVREESIQDG
        +  P+DWN LCDRWET EWK+  + NK NRAK+ FNHRAG K+   +  E K +EG D    +DLF E+ Y++  G + +  E A  +M  + +   Q+G
Subjt:  MAKPDDWNMLCDRWETDEWKQKYETNKCNRAKMSFNHRAGLKALAIIAKEKKEEEGGDNFSKIDLFRETRYSDTKGWI-EGVETACLDMMRVREESIQDG

Query:  GEPMPDPEVLETVLGYRLGYVKSAGWGPKPKLVEDRFTQKNKELEDCLSQKDKKLEDRLSQKDKELDERLAAKDS
         EP+  PE    VLG R  +VK  G+GP+P L +   +           + +KK+ED   +  +   E    K+S
Subjt:  GEPMPDPEVLETVLGYRLGYVKSAGWGPKPKLVEDRFTQKNKELEDCLSQKDKKLEDRLSQKDKELDERLAAKDS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGTCCGTCCATCCCCGTCACCGCTCGACCTGTGACGTCCATCCATCCCCGTCGCCGCCTGGTCCCGCACCCTCCCTTCTTCTATCTAACGTTGAGGGAGCGCCGCC
GCACCCGCTTCGCCGTCGCCCGCACTTGTTTCGCCACCGTGCCGACGCCCCAACAACCCGTGGCTTGAGACGTGAAACTCCTTCGCACAATAAATTAAGTAGAGAATCCA
ATGTTATGAATAATGAGGATGGTGATATGTCTGGACTTCTCAATGATTTACAGTACCCCATGGATCATTTCGATTTCAACGAATCAGATCGAGTTATGATGAAATTCATC
GAGCATGAGATGAGATCGACACATAAGGCGTATAGGGCAAAATTGCGCCAACACTACCTGAGTTATCCCAAACCTGAGATTGCGCGCAACAATCCTCCGAAGCGGTTGTT
GGCTGTTGGGTCCATTAGGTTCTGCACAGTACAGAGGAGAAGACCCAAGCATTCTTCTCTCTTGATAAAGCTCTCTCAATCAAACTCTCCCTCTCTCGTTCCAAAGAAAC
GCTCCCACAAGCGTGTTCTCGAAATCCAAGAGGATATCAAGGAAGATCGTTTGGTGGTGTTCGTTGGGAAATCGTTGAAGAAACATCCGCTTCCACTTCGGGATTGGCAT
CCCTTCATGGCAAAACCAGACGATTGGAACATGTTGTGCGATAGATGGGAGACAGATGAGTGGAAGCAAAAATATGAGACCAACAAATGCAACAGGGCGAAGATGTCATT
CAACCATCGTGCGGGGCTGAAGGCGCTTGCTATTATTGCAAAGGAGAAGAAAGAAGAAGAGGGAGGTGACAACTTTTCTAAAATCGATTTGTTCAGAGAGACGAGATACT
CTGACACTAAAGGTTGGATCGAAGGAGTGGAGACAGCTTGTCTAGACATGATGCGTGTTAGAGAAGAATCTATACAAGATGGAGGCGAACCGATGCCAGACCCAGAAGTA
TTGGAAACAGTTCTTGGTTATCGATTAGGCTACGTTAAGAGTGCTGGTTGGGGCCCAAAACCAAAGCTCGTAGAGGACCGTTTCACGCAGAAAAACAAAGAGCTTGAGGA
TTGTCTCTCTCAGAAAGACAAAAAGCTTGAGGATCGTCTCTCTCAGAAAGACAAGGAGTTAGATGAGCGTCTCGCAGCCAAAGACTCTGAGATTATGGGTCTACACGGTG
AAATGTCATATTTGAGATTAATGGTGCTACAACTAATTAATGAGTCGGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGACGTCCGTCCATCCCCGTCACCGCTCGACCTGTGACGTCCATCCATCCCCGTCGCCGCCTGGTCCCGCACCCTCCCTTCTTCTATCTAACGTTGAGGGAGCGCCGCC
GCACCCGCTTCGCCGTCGCCCGCACTTGTTTCGCCACCGTGCCGACGCCCCAACAACCCGTGGCTTGAGACGTGAAACTCCTTCGCACAATAAATTAAGTAGAGAATCCA
ATGTTATGAATAATGAGGATGGTGATATGTCTGGACTTCTCAATGATTTACAGTACCCCATGGATCATTTCGATTTCAACGAATCAGATCGAGTTATGATGAAATTCATC
GAGCATGAGATGAGATCGACACATAAGGCGTATAGGGCAAAATTGCGCCAACACTACCTGAGTTATCCCAAACCTGAGATTGCGCGCAACAATCCTCCGAAGCGGTTGTT
GGCTGTTGGGTCCATTAGGTTCTGCACAGTACAGAGGAGAAGACCCAAGCATTCTTCTCTCTTGATAAAGCTCTCTCAATCAAACTCTCCCTCTCTCGTTCCAAAGAAAC
GCTCCCACAAGCGTGTTCTCGAAATCCAAGAGGATATCAAGGAAGATCGTTTGGTGGTGTTCGTTGGGAAATCGTTGAAGAAACATCCGCTTCCACTTCGGGATTGGCAT
CCCTTCATGGCAAAACCAGACGATTGGAACATGTTGTGCGATAGATGGGAGACAGATGAGTGGAAGCAAAAATATGAGACCAACAAATGCAACAGGGCGAAGATGTCATT
CAACCATCGTGCGGGGCTGAAGGCGCTTGCTATTATTGCAAAGGAGAAGAAAGAAGAAGAGGGAGGTGACAACTTTTCTAAAATCGATTTGTTCAGAGAGACGAGATACT
CTGACACTAAAGGTTGGATCGAAGGAGTGGAGACAGCTTGTCTAGACATGATGCGTGTTAGAGAAGAATCTATACAAGATGGAGGCGAACCGATGCCAGACCCAGAAGTA
TTGGAAACAGTTCTTGGTTATCGATTAGGCTACGTTAAGAGTGCTGGTTGGGGCCCAAAACCAAAGCTCGTAGAGGACCGTTTCACGCAGAAAAACAAAGAGCTTGAGGA
TTGTCTCTCTCAGAAAGACAAAAAGCTTGAGGATCGTCTCTCTCAGAAAGACAAGGAGTTAGATGAGCGTCTCGCAGCCAAAGACTCTGAGATTATGGGTCTACACGGTG
AAATGTCATATTTGAGATTAATGGTGCTACAACTAATTAATGAGTCGGTCTAG
Protein sequenceShow/hide protein sequence
MTSVHPRHRSTCDVHPSPSPPGPAPSLLLSNVEGAPPHPLRRRPHLFRHRADAPTTRGLRRETPSHNKLSRESNVMNNEDGDMSGLLNDLQYPMDHFDFNESDRVMMKFI
EHEMRSTHKAYRAKLRQHYLSYPKPEIARNNPPKRLLAVGSIRFCTVQRRRPKHSSLLIKLSQSNSPSLVPKKRSHKRVLEIQEDIKEDRLVVFVGKSLKKHPLPLRDWH
PFMAKPDDWNMLCDRWETDEWKQKYETNKCNRAKMSFNHRAGLKALAIIAKEKKEEEGGDNFSKIDLFRETRYSDTKGWIEGVETACLDMMRVREESIQDGGEPMPDPEV
LETVLGYRLGYVKSAGWGPKPKLVEDRFTQKNKELEDCLSQKDKKLEDRLSQKDKELDERLAAKDSEIMGLHGEMSYLRLMVLQLINESV