; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020430 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020430
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUPF0098 protein
Genome locationChr04:31793005..31794258
RNA-Seq ExpressionHG10020430
SyntenyHG10020430
Gene Ontology termsNA
InterPro domainsIPR005247 - YbhB/YbcL
IPR008914 - Phosphatidylethanolamine-binding protein
IPR036610 - PEBP-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580458.1 hypothetical protein SDJN03_20460, partial [Cucurbita argyrosperma subsp. sororia]5.3e-8187.43Show/hide
Query:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADY
        MA S EFRLASSAIDHEGRLPRKYTSEGQGAQKN+SPPLEWYN+P+GTKTLALVVQDIDAP+P+GPIVPWTVWVVVNIP +LKGLPEDFSGNQQGLG DY
Subjt:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADY

Query:  APIQEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF
        A IQEG+NDDK+PGWR PTLPS GHRFEFKLYALDD +NLGNK TKEKLLD I+GHVLGEAVLMA+F
Subjt:  APIQEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF

XP_004137699.1 uncharacterized protein LOC101207571 [Cucumis sativus]3.2e-8692.22Show/hide
Query:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADY
        M D+GEFRL SSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLP+GTKTLALVVQDIDAPDP+GPIVPWTVWVVVNIP TLKGLPEDFSGNQQGLG DY
Subjt:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADY

Query:  APIQEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF
        A IQEG+ND+KVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATK+KLL+AI+GHVLGEAVLMAVF
Subjt:  APIQEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF

XP_008442384.1 PREDICTED: UPF0098 protein TC_0109 [Cucumis melo]4.1e-8995.81Show/hide
Query:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADY
        M D+GEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLG DY
Subjt:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADY

Query:  APIQEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF
        A IQEG+NDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLL+AI+GHVLGEAVLMAVF
Subjt:  APIQEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF

XP_023529183.1 uncharacterized protein LOC111791902 [Cucurbita pepo subsp. pepo]8.2e-8288.02Show/hide
Query:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADY
        MADS EFRLASSAIDHEGRLPRKYTSEGQGAQKN+SPPLEWYN+P+GTKTLALVVQDIDAP+P+GPIVPWTVWVVVNIP +LKGLPEDFSGNQQGLG DY
Subjt:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADY

Query:  APIQEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF
        A IQEG+NDDK+PGWR PTLPS GHRFEFKLYALDD +NLGNK TKEKLLD I+GHVLGEAVLMA+F
Subjt:  APIQEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF

XP_038904153.1 UPF0098 protein TC_0109 [Benincasa hispida]1.7e-8794.61Show/hide
Query:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADY
        MADSG FRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDP+GPIVPWTVWVVVNIP TLKGLPEDFSGNQQGLG DY
Subjt:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADY

Query:  APIQEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF
        A IQEG+ND+KVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNK TKEKLLDAI+GHVLGEAVLMAVF
Subjt:  APIQEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF

TrEMBL top hitse value%identityAlignment
A0A0A0L9T7 Uncharacterized protein1.6e-8692.22Show/hide
Query:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADY
        M D+GEFRL SSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLP+GTKTLALVVQDIDAPDP+GPIVPWTVWVVVNIP TLKGLPEDFSGNQQGLG DY
Subjt:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADY

Query:  APIQEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF
        A IQEG+ND+KVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATK+KLL+AI+GHVLGEAVLMAVF
Subjt:  APIQEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF

A0A1S3B537 UPF0098 protein TC_01092.0e-8995.81Show/hide
Query:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADY
        M D+GEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLG DY
Subjt:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADY

Query:  APIQEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF
        A IQEG+NDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLL+AI+GHVLGEAVLMAVF
Subjt:  APIQEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF

A0A5A7TL06 UPF0098 protein2.0e-8995.81Show/hide
Query:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADY
        M D+GEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLG DY
Subjt:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADY

Query:  APIQEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF
        A IQEG+NDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLL+AI+GHVLGEAVLMAVF
Subjt:  APIQEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF

A0A6J1CW59 uncharacterized protein LOC1110151222.6e-8186.83Show/hide
Query:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADY
        MADS +FRL SSAIDHEGRLPRKYTSEGQG QKNKSPPLEWYN+P+GTK+LALVVQDIDAPDP GPIVPWTVWVV+NIP TLKGLPEDFSG  + LGADY
Subjt:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADY

Query:  APIQEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF
        A IQEG+ND+K+PGWRAPTLPSHGHRFEFKLYALDD LNLG+KATK+KLLDAI+GHVLGEAVLMAVF
Subjt:  APIQEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF

A0A6J1IX32 uncharacterized protein LOC1114814413.4e-8187.43Show/hide
Query:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADY
        MADS EFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYN+P+GTKTLALVVQDIDAP+P+ PIVPWTVWVVVNIP +LKGLPEDFSGNQQGLG DY
Subjt:  MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADY

Query:  APIQEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF
        A IQEG+NDDK+PGWR PTLPS GHRFEFKLYALDD +NLGNK +KEKLLD I+GHVLGEAVLMA+F
Subjt:  APIQEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF

SwissProt top hitse value%identityAlignment
O26373 UPF0098 protein MTH_2732.8e-2441.51Show/hide
Query:  LASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADYAPIQEGHN
        L + A +  GR+P +YT +G+    N SPPL W  +P   K+LAL+  D DAP        WT WV+ NIP    GL E    N    G       +G+N
Subjt:  LASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADYAPIQEGHN

Query:  DDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF
        D    G+R P  PS  HR+ F+LYALD  L+L   A+KE +L+A++GHVLGEA L+ ++
Subjt:  DDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF

O84741 UPF0098 protein CT_7366.9e-1533.12Show/hide
Query:  RLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADYAPIQEGH
        +L S A  +   +P+KY+ +G G     SPPL + ++P   K+L L+V+D D P        W  W+V N+   +  L E         GA    +Q G 
Subjt:  RLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADYAPIQEGH

Query:  NDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF
        N     G+  P  P   HR+ F  YALD  L+     TKE+LL+A+ GH++  A LM  +
Subjt:  NDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF

Q9PLJ0 UPF0098 protein TC_01092.4e-1534.38Show/hide
Query:  RLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADYAPIQEGH
        +L S A  +   +P+KY+ +G G     SPPL + ++P   K+LAL+V+D D P        W  W+V N+   +  L E         GA    +Q G 
Subjt:  RLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADYAPIQEGH

Query:  NDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF
        N     G+  P  P   HR+ F  YALD  L      TKE+LL+A++GH+L  A LM  +
Subjt:  NDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF

Q9UZJ3 UPF0098 protein PYRAB115306.4e-1332.7Show/hide
Query:  SSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADYAPIQEGHNDD
        SS   ++  +P KYT EG     + +PPL    + E  K+L ++V      DP+ P+  +T W+  NIP  ++ +PE     +QG       + +G ND 
Subjt:  SSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADYAPIQEGHNDD

Query:  KVPGWRAPTLP-SHG-HRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF
           G+  P  P  HG H + FK+Y LD  LNL   AT+E+L  A++GH++    L+ ++
Subjt:  KVPGWRAPTLP-SHG-HRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF

Q9Z729 UPF0098 protein CPn_0877/CP_0992/CPj0877/CpB09063.1e-1531.87Show/hide
Query:  RLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADYAPIQEGH
        +L S A  +   +P+KYT +G G     SPPL + ++P   ++LAL+V+D D P        W  W+V N+  T+  L E         GA+   +Q G 
Subjt:  RLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADYAPIQEGH

Query:  NDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF
        N    P +  P  P   HR+ F L+ALD  L      T+++L +A++ H++ +A LM  +
Subjt:  NDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF

Arabidopsis top hitse value%identityAlignment
AT5G01300.1 PEBP (phosphatidylethanolamine-binding protein) family protein3.8e-6166.88Show/hide
Query:  SGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADYAPI
        S E RL S  ID++G+LPRKYT  GQG +K+ SPPLEWYN+PEGTKTLALVV+DIDAPDP+GP+VPWTVWVVV+IP  +KGLPE +SGN+         I
Subjt:  SGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADYAPI

Query:  QEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVL
        +EG+ND K+PGWR P LPSHGHRF+FKL+ALDD   +G+  TKE+LL AI+GHVLGEA+L
Subjt:  QEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVL

AT5G01300.2 PEBP (phosphatidylethanolamine-binding protein) family protein2.7e-4668.03Show/hide
Query:  YNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADYAPIQEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLG
        YN+PEGTKTLALVV+DIDAPDP+GP+VPWTVWVVV+IP  +KGLPE +SGN+         I+EG+ND K+PGWR P LPSHGHRF+FKL+ALDD   +G
Subjt:  YNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADYAPIQEGHNDDKVPGWRAPTLPSHGHRFEFKLYALDDHLNLG

Query:  NKATKEKLLDAIQGHVLGEAVL
        +  TKE+LL AI+GHVLGEA+L
Subjt:  NKATKEKLLDAIQGHVLGEAVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGATAGTGGCGAGTTCAGGCTAGCGTCTTCAGCAATAGACCACGAAGGAAGATTGCCAAGAAAGTACACGTCAGAAGGACAAGGGGCGCAGAAGAACAAATCTCC
GCCGTTGGAATGGTACAATCTGCCTGAAGGGACCAAGACCCTGGCTCTAGTGGTGCAGGACATTGACGCGCCGGACCCTAACGGCCCGATCGTGCCGTGGACCGTCTGGG
TGGTTGTTAACATACCGGCTACCTTGAAGGGCCTCCCCGAAGATTTCTCTGGGAACCAACAGGGGCTCGGGGCTGATTATGCGCCTATCCAAGAGGGTCACAATGACGAT
AAAGTCCCTGGTTGGCGTGCCCCCACTTTGCCCTCACATGGCCACCGCTTTGAGTTCAAGCTTTACGCATTAGACGACCACTTGAACCTCGGCAATAAGGCGACAAAGGA
GAAGCTGTTAGACGCAATACAAGGGCACGTGCTGGGAGAAGCCGTATTAATGGCGGTCTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGATAGTGGCGAGTTCAGGCTAGCGTCTTCAGCAATAGACCACGAAGGAAGATTGCCAAGAAAGTACACGTCAGAAGGACAAGGGGCGCAGAAGAACAAATCTCC
GCCGTTGGAATGGTACAATCTGCCTGAAGGGACCAAGACCCTGGCTCTAGTGGTGCAGGACATTGACGCGCCGGACCCTAACGGCCCGATCGTGCCGTGGACCGTCTGGG
TGGTTGTTAACATACCGGCTACCTTGAAGGGCCTCCCCGAAGATTTCTCTGGGAACCAACAGGGGCTCGGGGCTGATTATGCGCCTATCCAAGAGGGTCACAATGACGAT
AAAGTCCCTGGTTGGCGTGCCCCCACTTTGCCCTCACATGGCCACCGCTTTGAGTTCAAGCTTTACGCATTAGACGACCACTTGAACCTCGGCAATAAGGCGACAAAGGA
GAAGCTGTTAGACGCAATACAAGGGCACGTGCTGGGAGAAGCCGTATTAATGGCGGTCTTCTGA
Protein sequenceShow/hide protein sequence
MADSGEFRLASSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNLPEGTKTLALVVQDIDAPDPNGPIVPWTVWVVVNIPATLKGLPEDFSGNQQGLGADYAPIQEGHNDD
KVPGWRAPTLPSHGHRFEFKLYALDDHLNLGNKATKEKLLDAIQGHVLGEAVLMAVF