; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc10G21870 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc10G21870
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionUPF0098 protein
Genome locationClcChr10:35007313..35008432
RNA-Seq ExpressionClc10G21870
SyntenyClc10G21870
Gene Ontology termsNA
InterPro domainsIPR005247 - YbhB/YbcL
IPR008914 - Phosphatidylethanolamine-binding protein
IPR036610 - PEBP-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137699.1 uncharacterized protein LOC101207571 [Cucumis sativus]8.5e-8792.81Show/hide
Query:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDY
        M D+GEFRL SSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLP+GTKTLALVVQDIDAPDP+GPIVPWTVWVVVNIPPTLKGLPEDFSGNQ+GLGGDY
Subjt:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDY

Query:  AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF
        A IQEGNND+KVPGWRAPTLPSHGHRFEFKL ALDDHLNLGNK TK+KLL+AIEGHVLGEAVLMAVF
Subjt:  AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF

XP_008442384.1 PREDICTED: UPF0098 protein TC_0109 [Cucumis melo]1.2e-8895.81Show/hide
Query:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDY
        M D+GEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIP TLKGLPEDFSGNQ+GLGGDY
Subjt:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDY

Query:  AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF
        AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKL ALDDHLNLGNK TKEKLL+AIEGHVLGEAVLMAVF
Subjt:  AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF

XP_022145739.1 uncharacterized protein LOC111015122 [Momordica charantia]1.4e-8188.02Show/hide
Query:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDY
        MADS +FRL SSAIDHEGRLPRKYTSEGQG QKNKSPPLEWYN+P+GTK+LALVVQDIDAPDP GPIVPWTVWVV+NIPPTLKGLPEDFSG  E LG DY
Subjt:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDY

Query:  AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF
        AAIQEGNND+K+PGWRAPTLPSHGHRFEFKL ALDD LNLG+K TK+KLLDAIEGHVLGEAVLMAVF
Subjt:  AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF

XP_023529183.1 uncharacterized protein LOC111791902 [Cucurbita pepo subsp. pepo]2.2e-8289.22Show/hide
Query:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDY
        MADS EFRLASSAIDHEGRLPRKYTSEGQGAQKN+SPPLEWYN+P+GTKTLALVVQDIDAP+P+GPIVPWTVWVVVNIPP+LKGLPEDFSGNQ+GLG DY
Subjt:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDY

Query:  AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF
        AAIQEGNNDDK+PGWR PTLPS GHRFEFKL ALDD +NLGNK TKEKLLD IEGHVLGEAVLMA+F
Subjt:  AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF

XP_038904153.1 UPF0098 protein TC_0109 [Benincasa hispida]1.8e-8997.01Show/hide
Query:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDY
        MADSG FRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDP+GPIVPWTVWVVVNIPPTLKGLPEDFSGNQ+GLGGDY
Subjt:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDY

Query:  AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF
        AAIQEGNND+KVPGWRAPTLPSHGHRFEFKL ALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF
Subjt:  AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF

TrEMBL top hitse value%identityAlignment
A0A0A0L9T7 Uncharacterized protein4.1e-8792.81Show/hide
Query:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDY
        M D+GEFRL SSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLP+GTKTLALVVQDIDAPDP+GPIVPWTVWVVVNIPPTLKGLPEDFSGNQ+GLGGDY
Subjt:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDY

Query:  AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF
        A IQEGNND+KVPGWRAPTLPSHGHRFEFKL ALDDHLNLGNK TK+KLL+AIEGHVLGEAVLMAVF
Subjt:  AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF

A0A1S3B537 UPF0098 protein TC_01095.7e-8995.81Show/hide
Query:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDY
        M D+GEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIP TLKGLPEDFSGNQ+GLGGDY
Subjt:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDY

Query:  AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF
        AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKL ALDDHLNLGNK TKEKLL+AIEGHVLGEAVLMAVF
Subjt:  AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF

A0A5A7TL06 UPF0098 protein5.7e-8995.81Show/hide
Query:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDY
        M D+GEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIP TLKGLPEDFSGNQ+GLGGDY
Subjt:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDY

Query:  AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF
        AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKL ALDDHLNLGNK TKEKLL+AIEGHVLGEAVLMAVF
Subjt:  AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF

A0A6J1CW59 uncharacterized protein LOC1110151226.8e-8288.02Show/hide
Query:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDY
        MADS +FRL SSAIDHEGRLPRKYTSEGQG QKNKSPPLEWYN+P+GTK+LALVVQDIDAPDP GPIVPWTVWVV+NIPPTLKGLPEDFSG  E LG DY
Subjt:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDY

Query:  AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF
        AAIQEGNND+K+PGWRAPTLPSHGHRFEFKL ALDD LNLG+K TK+KLLDAIEGHVLGEAVLMAVF
Subjt:  AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF

A0A6J1IX32 uncharacterized protein LOC1114814418.8e-8288.62Show/hide
Query:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDY
        MADS EFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYN+P+GTKTLALVVQDIDAP+P+ PIVPWTVWVVVNIPP+LKGLPEDFSGNQ+GLG DY
Subjt:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDY

Query:  AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF
        AAIQEGNNDDK+PGWR PTLPS GHRFEFKL ALDD +NLGNK +KEKLLD IEGHVLGEAVLMA+F
Subjt:  AAIQEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF

SwissProt top hitse value%identityAlignment
O26373 UPF0098 protein MTH_2731.4e-2341.51Show/hide
Query:  LASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDYAAIQEGNN
        L + A +  GR+P +YT +G+    N SPPL W  +P   K+LAL+  D DAP        WT WV+ NIPP   GL E    N    G       +G N
Subjt:  LASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDYAAIQEGNN

Query:  DDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF
        D    G+R P  PS  HR+ F+L ALD  L+L    +KE +L+A+EGHVLGEA L+ ++
Subjt:  DDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF

O28575 UPF0098 protein AF_16981.1e-1232.65Show/hide
Query:  SAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDYAAIQEGNNDDK
        SA ++ G++P KYT +G+    + SPPL    L E  K+L ++ +     DP+ P+  +T W+  N+ PT + +PE+    +     D   + +G ND  
Subjt:  SAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDYAAIQEGNNDDK

Query:  VPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVL
          G+  P  PS  HR+ F++ A+D  L      ++++LL AIEGH+L
Subjt:  VPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVL

O84741 UPF0098 protein CT_7361.1e-1533.12Show/hide
Query:  RLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDYAAIQEGN
        +L S A  +   +P+KY+ +G G     SPPL + ++P   K+L L+V+D D P        W  W+V N+ P +  L E         G    A+Q  N
Subjt:  RLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDYAAIQEGN

Query:  NDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF
           ++ G+  P  P   HR+ F   ALD  L+    VTKE+LL+A++GH++  A LM  +
Subjt:  NDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF

Q9PLJ0 UPF0098 protein TC_01092.8e-1635Show/hide
Query:  RLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDYAAIQEGN
        +L S A  +   +P+KY+ +G G     SPPL + ++P   K+LAL+V+D D P        W  W+V N+ P +  L E         G    A+Q  N
Subjt:  RLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDYAAIQEGN

Query:  NDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF
           ++ G+  P  P   HR+ F   ALD  L     VTKE+LL+A+EGH+L  A LM  +
Subjt:  NDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF

Q9Z729 UPF0098 protein CPn_0877/CP_0992/CPj0877/CpB09062.4e-1533.12Show/hide
Query:  RLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDYAAIQEGN
        +L S A  +   +P+KYT +G G     SPPL + ++P   ++LAL+V+D D P        W  W+V N+  T+  L E         G +  A+Q G 
Subjt:  RLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDYAAIQEGN

Query:  NDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF
        N    P +  P  P   HR+ F L ALD  L     VT+++L +A+E H++ +A LM  +
Subjt:  NDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF

Arabidopsis top hitse value%identityAlignment
AT5G01300.1 PEBP (phosphatidylethanolamine-binding protein) family protein1.8e-6370Show/hide
Query:  SGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDYAAI
        S E RL S  ID++G+LPRKYT  GQG +K+ SPPLEWYN+PEGTKTLALVV+DIDAPDP+GP+VPWTVWVVV+IPP +KGLPE +SGN++   G    I
Subjt:  SGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDYAAI

Query:  QEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVL
        +EGNND K+PGWR P LPSHGHRF+FKL ALDD   +G+ VTKE+LL AIEGHVLGEA+L
Subjt:  QEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVL

AT5G01300.2 PEBP (phosphatidylethanolamine-binding protein) family protein1.3e-4872.13Show/hide
Query:  YNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDYAAIQEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLG
        YN+PEGTKTLALVV+DIDAPDP+GP+VPWTVWVVV+IPP +KGLPE +SGN++   G    I+EGNND K+PGWR P LPSHGHRF+FKL ALDD   +G
Subjt:  YNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDYAAIQEGNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLG

Query:  NKVTKEKLLDAIEGHVLGEAVL
        + VTKE+LL AIEGHVLGEA+L
Subjt:  NKVTKEKLLDAIEGHVLGEAVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGATAGCGGGGAGTTCAGGCTGGCGTCTTCAGCAATAGACCACGAAGGAAGATTGCCACGGAAGTACACATCGGAAGGGCAAGGGGCGCAGAAGAACAAA
TCTCCGCCGTTGGAATGGTACAATCTGCCTGAAGGGACCAAGACCCTGGCTCTTGTGGTGCAGGACATTGACGCGCCGGACCCGAACGGGCCGATCGTGCCGTGG
ACGGTGTGGGTGGTTGTGAACATACCGCCCACCTTGAAGGGCCTGCCCGAAGATTTCTCTGGGAACCAAGAGGGGCTCGGGGGTGATTATGCGGCTATCCAAGAG
GGCAACAATGACGACAAAGTCCCTGGTTGGCGTGCCCCCACTTTGCCCTCACATGGCCACCGCTTTGAGTTCAAGCTCTGCGCTTTAGACGACCACTTGAACCTC
GGCAATAAGGTGACAAAGGAGAAGCTGTTAGACGCAATAGAAGGACACGTGCTGGGAGAAGCCGTATTAATGGCGGTCTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGATAGCGGGGAGTTCAGGCTGGCGTCTTCAGCAATAGACCACGAAGGAAGATTGCCACGGAAGTACACATCGGAAGGGCAAGGGGCGCAGAAGAACAAA
TCTCCGCCGTTGGAATGGTACAATCTGCCTGAAGGGACCAAGACCCTGGCTCTTGTGGTGCAGGACATTGACGCGCCGGACCCGAACGGGCCGATCGTGCCGTGG
ACGGTGTGGGTGGTTGTGAACATACCGCCCACCTTGAAGGGCCTGCCCGAAGATTTCTCTGGGAACCAAGAGGGGCTCGGGGGTGATTATGCGGCTATCCAAGAG
GGCAACAATGACGACAAAGTCCCTGGTTGGCGTGCCCCCACTTTGCCCTCACATGGCCACCGCTTTGAGTTCAAGCTCTGCGCTTTAGACGACCACTTGAACCTC
GGCAATAAGGTGACAAAGGAGAAGCTGTTAGACGCAATAGAAGGACACGTGCTGGGAGAAGCCGTATTAATGGCGGTCTTCTGA
Protein sequenceShow/hide protein sequence
MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPPTLKGLPEDFSGNQEGLGGDYAAIQE
GNNDDKVPGWRAPTLPSHGHRFEFKLCALDDHLNLGNKVTKEKLLDAIEGHVLGEAVLMAVF