; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008123 (gene) of Snake gourd v1 genome

Gene IDTan0008123
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUPF0098 protein
Genome locationLG08:73514080..73515531
RNA-Seq ExpressionTan0008123
SyntenyTan0008123
Gene Ontology termsNA
InterPro domainsIPR005247 - YbhB/YbcL
IPR008914 - Phosphatidylethanolamine-binding protein
IPR036610 - PEBP-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137699.1 uncharacterized protein LOC101207571 [Cucumis sativus]8.2e-8287.43Show/hide
Query:  MAESGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDY
        M ++GEFRL SSAIDH+GRLPRKYTSEGQGAQKNKSPPLEWYN+P+GTK+LA+VVQDIDAPDP+GPIVPWTVWVVVNIPP+LK LPEDFSG +  LGGDY
Subjt:  MAESGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDY

Query:  AGIQEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF
        A IQEG ND+KVPGWRAPTLPSHGHRFEFKLYALD HLNLGNKATKDKLL+AIEGHVLGEAVLMAVF
Subjt:  AGIQEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF

XP_008442384.1 PREDICTED: UPF0098 protein TC_0109 [Cucumis melo]1.3e-8288.62Show/hide
Query:  MAESGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDY
        M ++GEFRLASSAIDH+GRLPRKYTSEGQGAQKNKSPPLEWYN+PEGTK+LA+VVQDIDAPDPNGPIVPWTVWVVVNIP +LK LPEDFSG +  LGGDY
Subjt:  MAESGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDY

Query:  AGIQEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF
        A IQEG NDDKVPGWRAPTLPSHGHRFEFKLYALD HLNLGNKATK+KLL+AIEGHVLGEAVLMAVF
Subjt:  AGIQEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF

XP_022145739.1 uncharacterized protein LOC111015122 [Momordica charantia]1.5e-8086.83Show/hide
Query:  MAESGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDY
        MA+S +FRL SSAIDH+GRLPRKYTSEGQG QKNKSPPLEWYNVP+GTKSLA+VVQDIDAPDP GPIVPWTVWVV+NIPP+LK LPEDFSGK +ALG DY
Subjt:  MAESGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDY

Query:  AGIQEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF
        A IQEG ND+K+PGWRAPTLPSHGHRFEFKLYALD  LNLG+KATKDKLLDAIEGHVLGEAVLMAVF
Subjt:  AGIQEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF

XP_023529183.1 uncharacterized protein LOC111791902 [Cucurbita pepo subsp. pepo]1.6e-7783.83Show/hide
Query:  MAESGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDY
        MA+S EFRLASSAIDH+GRLPRKYTSEGQGAQKN+SPPLEWYNVP+GTK+LA+VVQDIDAP+P+GPIVPWTVWVVVNIPPSLK LPEDFSG +  LG DY
Subjt:  MAESGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDY

Query:  AGIQEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF
        A IQEG NDDK+PGWR PTLPS GHRFEFKLYALD  +NLGNK TK+KLLD IEGHVLGEAVLMA+F
Subjt:  AGIQEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF

XP_038904153.1 UPF0098 protein TC_0109 [Benincasa hispida]1.7e-8288.62Show/hide
Query:  MAESGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDY
        MA+SG FRLASSAIDH+GRLPRKYTSEGQGAQKNKSPPLEWYN+PEGTK+LA+VVQDIDAPDP+GPIVPWTVWVVVNIPP+LK LPEDFSG +  LGGDY
Subjt:  MAESGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDY

Query:  AGIQEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF
        A IQEG ND+KVPGWRAPTLPSHGHRFEFKLYALD HLNLGNK TK+KLLDAIEGHVLGEAVLMAVF
Subjt:  AGIQEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF

TrEMBL top hitse value%identityAlignment
A0A0A0L9T7 Uncharacterized protein4.0e-8287.43Show/hide
Query:  MAESGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDY
        M ++GEFRL SSAIDH+GRLPRKYTSEGQGAQKNKSPPLEWYN+P+GTK+LA+VVQDIDAPDP+GPIVPWTVWVVVNIPP+LK LPEDFSG +  LGGDY
Subjt:  MAESGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDY

Query:  AGIQEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF
        A IQEG ND+KVPGWRAPTLPSHGHRFEFKLYALD HLNLGNKATKDKLL+AIEGHVLGEAVLMAVF
Subjt:  AGIQEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF

A0A1S3B537 UPF0098 protein TC_01096.1e-8388.62Show/hide
Query:  MAESGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDY
        M ++GEFRLASSAIDH+GRLPRKYTSEGQGAQKNKSPPLEWYN+PEGTK+LA+VVQDIDAPDPNGPIVPWTVWVVVNIP +LK LPEDFSG +  LGGDY
Subjt:  MAESGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDY

Query:  AGIQEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF
        A IQEG NDDKVPGWRAPTLPSHGHRFEFKLYALD HLNLGNKATK+KLL+AIEGHVLGEAVLMAVF
Subjt:  AGIQEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF

A0A5A7TL06 UPF0098 protein6.1e-8388.62Show/hide
Query:  MAESGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDY
        M ++GEFRLASSAIDH+GRLPRKYTSEGQGAQKNKSPPLEWYN+PEGTK+LA+VVQDIDAPDPNGPIVPWTVWVVVNIP +LK LPEDFSG +  LGGDY
Subjt:  MAESGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDY

Query:  AGIQEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF
        A IQEG NDDKVPGWRAPTLPSHGHRFEFKLYALD HLNLGNKATK+KLL+AIEGHVLGEAVLMAVF
Subjt:  AGIQEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF

A0A6J1CW59 uncharacterized protein LOC1110151227.5e-8186.83Show/hide
Query:  MAESGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDY
        MA+S +FRL SSAIDH+GRLPRKYTSEGQG QKNKSPPLEWYNVP+GTKSLA+VVQDIDAPDP GPIVPWTVWVV+NIPP+LK LPEDFSGK +ALG DY
Subjt:  MAESGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDY

Query:  AGIQEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF
        A IQEG ND+K+PGWRAPTLPSHGHRFEFKLYALD  LNLG+KATKDKLLDAIEGHVLGEAVLMAVF
Subjt:  AGIQEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF

A0A6J1IX32 uncharacterized protein LOC1114814416.6e-7783.23Show/hide
Query:  MAESGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDY
        MA+S EFRLASSAIDH+GRLPRKYTSEGQGAQKNKSPPLEWYNVP+GTK+LA+VVQDIDAP+P+ PIVPWTVWVVVNIPPSLK LPEDFSG +  LG DY
Subjt:  MAESGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDY

Query:  AGIQEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF
        A IQEG NDDK+PGWR PTLPS GHRFEFKLYALD  +NLGNK +K+KLLD IEGHVLGEAVLMA+F
Subjt:  AGIQEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF

SwissProt top hitse value%identityAlignment
O26373 UPF0098 protein MTH_2731.5e-2544.03Show/hide
Query:  LASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDYAGIQEGYN
        L + A +  GR+P +YT +G+    N SPPL W  VP   KSLA++  D DAP        WT WV+ NIPP    L E+     DA G    G  +GYN
Subjt:  LASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDYAGIQEGYN

Query:  DDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF
        D    G+R P  PS  HR+ F+LYALD  L+L   A+K+ +L+A+EGHVLGEA L+ ++
Subjt:  DDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF

O84741 UPF0098 protein CT_7361.4e-1533.12Show/hide
Query:  RLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDYAGIQEGY
        +L S A  +   +P+KY+ +G G     SPPL + +VP   KSL ++V+D D P        W  W+V N+ P +  L E         G     +Q G 
Subjt:  RLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDYAGIQEGY

Query:  NDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF
        N     G+  P  P   HR+ F  YALD  L+     TK++LL+A++GH++  A LM  +
Subjt:  NDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF

Q9PLJ0 UPF0098 protein TC_01094.8e-1634.38Show/hide
Query:  RLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDYAGIQEGY
        +L S A  +   +P+KY+ +G G     SPPL + ++P   KSLA++V+D D P        W  W+V N+ P +  L E         G     +Q G 
Subjt:  RLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDYAGIQEGY

Query:  NDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF
        N     G+  P  P   HR+ F  YALD  L      TK++LL+A+EGH+L  A LM  +
Subjt:  NDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF

Q9UZJ3 UPF0098 protein PYRAB115304.5e-1435.22Show/hide
Query:  SSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDYAGIQEGYNDD
        SS   +D  +P KYT EG     + +PPL    + E  KSL ++V      DP+ P+  +T W+  NIPP ++ +PE    K+  +      IQ G ND 
Subjt:  SSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDYAGIQEGYNDD

Query:  KVPGWRAPTLP-SHG-HRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF
           G+  P  P  HG H + FK+Y LD  LNL   AT+++L  A+EGH++    L+ ++
Subjt:  KVPGWRAPTLP-SHG-HRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF

Q9Z729 UPF0098 protein CPn_0877/CP_0992/CPj0877/CpB09061.4e-1532.5Show/hide
Query:  RLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDYAGIQEGY
        +L S A  +   +P+KYT +G G     SPPL + +VP   +SLA++V+D D P        W  W+V N+  ++  L E         G +   +Q G 
Subjt:  RLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDYAGIQEGY

Query:  NDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF
        N    P +  P  P   HR+ F L+ALD  L      T+D+L +A+E H++ +A LM  +
Subjt:  NDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF

Arabidopsis top hitse value%identityAlignment
AT5G01300.1 PEBP (phosphatidylethanolamine-binding protein) family protein1.7e-6167.5Show/hide
Query:  SGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDYAGI
        S E RL S  ID+DG+LPRKYT  GQG +K+ SPPLEWYNVPEGTK+LA+VV+DIDAPDP+GP+VPWTVWVVV+IPP +K LPE +SG ED       GI
Subjt:  SGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDYAGI

Query:  QEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVL
        +EG ND K+PGWR P LPSHGHRF+FKL+ALD    +G+  TK++LL AIEGHVLGEA+L
Subjt:  QEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVL

AT5G01300.2 PEBP (phosphatidylethanolamine-binding protein) family protein3.5e-4668.03Show/hide
Query:  YNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDYAGIQEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLG
        YNVPEGTK+LA+VV+DIDAPDP+GP+VPWTVWVVV+IPP +K LPE +SG ED       GI+EG ND K+PGWR P LPSHGHRF+FKL+ALD    +G
Subjt:  YNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDYAGIQEGYNDDKVPGWRAPTLPSHGHRFEFKLYALDQHLNLG

Query:  NKATKDKLLDAIEGHVLGEAVL
        +  TK++LL AIEGHVLGEA+L
Subjt:  NKATKDKLLDAIEGHVLGEAVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGAGAGCGGCGAGTTCAGGCTGGCGTCATCAGCGATAGATCACGACGGAAGGTTGCCGAGAAAGTACACGTCGGAGGGCCAAGGGGCGCAGAAGAATAAATCGCC
GCCGTTGGAATGGTACAATGTGCCGGAAGGGACCAAGAGTTTGGCCGTTGTGGTTCAGGACATCGACGCGCCGGACCCTAATGGGCCGATCGTGCCGTGGACGGTGTGGG
TGGTTGTGAATATACCGCCCAGCTTGAAGGCCCTGCCCGAAGATTTCTCTGGGAAAGAAGACGCGCTCGGTGGGGATTATGCGGGCATTCAAGAAGGGTATAATGACGAT
AAAGTCCCTGGTTGGCGTGCTCCCACTCTACCCTCCCATGGTCACCGATTTGAGTTTAAGCTCTACGCTTTGGACCAACACTTGAACCTTGGCAATAAGGCGACAAAGGA
CAAGCTTTTAGATGCAATAGAAGGGCACGTGCTCGGAGAAGCAGTTCTAATGGCGGTCTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGAGAGCGGCGAGTTCAGGCTGGCGTCATCAGCGATAGATCACGACGGAAGGTTGCCGAGAAAGTACACGTCGGAGGGCCAAGGGGCGCAGAAGAATAAATCGCC
GCCGTTGGAATGGTACAATGTGCCGGAAGGGACCAAGAGTTTGGCCGTTGTGGTTCAGGACATCGACGCGCCGGACCCTAATGGGCCGATCGTGCCGTGGACGGTGTGGG
TGGTTGTGAATATACCGCCCAGCTTGAAGGCCCTGCCCGAAGATTTCTCTGGGAAAGAAGACGCGCTCGGTGGGGATTATGCGGGCATTCAAGAAGGGTATAATGACGAT
AAAGTCCCTGGTTGGCGTGCTCCCACTCTACCCTCCCATGGTCACCGATTTGAGTTTAAGCTCTACGCTTTGGACCAACACTTGAACCTTGGCAATAAGGCGACAAAGGA
CAAGCTTTTAGATGCAATAGAAGGGCACGTGCTCGGAGAAGCAGTTCTAATGGCGGTCTTCTGA
Protein sequenceShow/hide protein sequence
MAESGEFRLASSAIDHDGRLPRKYTSEGQGAQKNKSPPLEWYNVPEGTKSLAVVVQDIDAPDPNGPIVPWTVWVVVNIPPSLKALPEDFSGKEDALGGDYAGIQEGYNDD
KVPGWRAPTLPSHGHRFEFKLYALDQHLNLGNKATKDKLLDAIEGHVLGEAVLMAVF