; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014445 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014445
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUPF0098 protein
Genome locationtig00000589:442298..443879
RNA-Seq ExpressionSgr014445
SyntenySgr014445
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005247 - YbhB/YbcL
IPR008914 - Phosphatidylethanolamine-binding protein
IPR036610 - PEBP-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137699.1 uncharacterized protein LOC101207571 [Cucumis sativus]5.7e-7889.1Show/hide
Query:  MADSDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDY
        M D+ EFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYN+P+GTKTLALVVQDIDAPDP GPIVPWT+WVVVNIPP+LKGLPE+FSG Q+ LGGDY
Subjt:  MADSDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDY

Query:  AGIQEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH
        A IQEGNNDEKVPGWRAPT+PSHGHRFEFKLYALDD LNLGNKATK+KLL+AIEGH
Subjt:  AGIQEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH

XP_008442384.1 PREDICTED: UPF0098 protein TC_0109 [Cucumis melo]6.3e-7787.82Show/hide
Query:  MADSDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDY
        M D+ EFRL SSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYN+P+GTKTLALVVQDIDAPDP GPIVPWT+WVVVNIP +LKGLPE+FSG Q+ LGGDY
Subjt:  MADSDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDY

Query:  AGIQEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH
        A IQEGNND+KVPGWRAPT+PSHGHRFEFKLYALDD LNLGNKATKEKLL+AIEGH
Subjt:  AGIQEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH

XP_022145739.1 uncharacterized protein LOC111015122 [Momordica charantia]8.8e-7989.1Show/hide
Query:  MADSDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDY
        MADS++FRLVSSAIDHEGRLPRKYTSEGQG QKNKSPPLEWYNVP+GTK+LALVVQDIDAPDP+GPIVPWT+WVV+NIPP+LKGLPE+FSGK EALG DY
Subjt:  MADSDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDY

Query:  AGIQEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH
        A IQEGNNDEK+PGWRAPT+PSHGHRFEFKLYALDDQLNLG+KATK+KLLDAIEGH
Subjt:  AGIQEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH

XP_023529183.1 uncharacterized protein LOC111791902 [Cucurbita pepo subsp. pepo]7.0e-7686.54Show/hide
Query:  MADSDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDY
        MADS+EFRL SSAIDHEGRLPRKYTSEGQGAQKN+SPPLEWYNVPQGTKTLALVVQDIDAP+P+GPIVPWT+WVVVNIPPSLKGLPE+FSG Q+ LG DY
Subjt:  MADSDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDY

Query:  AGIQEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH
        A IQEGNND+K+PGWR PT+PS GHRFEFKLYALDD++NLGNK TKEKLLD IEGH
Subjt:  AGIQEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH

XP_038904153.1 UPF0098 protein TC_0109 [Benincasa hispida]3.3e-7889.74Show/hide
Query:  MADSDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDY
        MADS  FRL SSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYN+P+GTKTLALVVQDIDAPDP GPIVPWT+WVVVNIPP+LKGLPE+FSG Q+ LGGDY
Subjt:  MADSDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDY

Query:  AGIQEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH
        A IQEGNNDEKVPGWRAPT+PSHGHRFEFKLYALDD LNLGNK TKEKLLDAIEGH
Subjt:  AGIQEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH

TrEMBL top hitse value%identityAlignment
A0A0A0L9T7 Uncharacterized protein2.8e-7889.1Show/hide
Query:  MADSDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDY
        M D+ EFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYN+P+GTKTLALVVQDIDAPDP GPIVPWT+WVVVNIPP+LKGLPE+FSG Q+ LGGDY
Subjt:  MADSDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDY

Query:  AGIQEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH
        A IQEGNNDEKVPGWRAPT+PSHGHRFEFKLYALDD LNLGNKATK+KLL+AIEGH
Subjt:  AGIQEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH

A0A1S3B537 UPF0098 protein TC_01093.1e-7787.82Show/hide
Query:  MADSDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDY
        M D+ EFRL SSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYN+P+GTKTLALVVQDIDAPDP GPIVPWT+WVVVNIP +LKGLPE+FSG Q+ LGGDY
Subjt:  MADSDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDY

Query:  AGIQEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH
        A IQEGNND+KVPGWRAPT+PSHGHRFEFKLYALDD LNLGNKATKEKLL+AIEGH
Subjt:  AGIQEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH

A0A5A7TL06 UPF0098 protein3.1e-7787.82Show/hide
Query:  MADSDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDY
        M D+ EFRL SSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYN+P+GTKTLALVVQDIDAPDP GPIVPWT+WVVVNIP +LKGLPE+FSG Q+ LGGDY
Subjt:  MADSDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDY

Query:  AGIQEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH
        A IQEGNND+KVPGWRAPT+PSHGHRFEFKLYALDD LNLGNKATKEKLL+AIEGH
Subjt:  AGIQEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH

A0A6J1CW59 uncharacterized protein LOC1110151224.3e-7989.1Show/hide
Query:  MADSDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDY
        MADS++FRLVSSAIDHEGRLPRKYTSEGQG QKNKSPPLEWYNVP+GTK+LALVVQDIDAPDP+GPIVPWT+WVV+NIPP+LKGLPE+FSGK EALG DY
Subjt:  MADSDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDY

Query:  AGIQEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH
        A IQEGNNDEK+PGWRAPT+PSHGHRFEFKLYALDDQLNLG+KATK+KLLDAIEGH
Subjt:  AGIQEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH

A0A6J1IX32 uncharacterized protein LOC1114814411.3e-7586.54Show/hide
Query:  MADSDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDY
        MADSDEFRL SSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAP+P+ PIVPWT+WVVVNIPPSLKGLPE+FSG Q+ LG DY
Subjt:  MADSDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDY

Query:  AGIQEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH
        A IQEGNND+K+PGWR PT+PS GHRFEFKLYALDD++NLGNK +KEKLLD IEGH
Subjt:  AGIQEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH

SwissProt top hitse value%identityAlignment
O26373 UPF0098 protein MTH_2733.2e-2342.57Show/hide
Query:  LVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDYAGIQEGNN
        L + A +  GR+P +YT +G+    N SPPL W  VP   K+LAL+  D DAP        WT WV+ NIPP   GL EN         G      +G N
Subjt:  LVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDYAGIQEGNN

Query:  DEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH
        D    G+R P  PS  HR+ F+LYALD  L+L   A+KE +L+A+EGH
Subjt:  DEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH

O28575 UPF0098 protein AF_16982.7e-1430.82Show/hide
Query:  VSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDYAGIQEGNND
        V SA ++ G++P KYT +G+    + SPPL    + +  K+L ++ +     DP+ P+  +T W+  N+ P+ + +PEN    +     D   + +G ND
Subjt:  VSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDYAGIQEGNND

Query:  EKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGHSISLLAVHGVVS
            G+  P  PS  HR+ F++YA+D  L      ++++LL AIEGH +    ++G+ S
Subjt:  EKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGHSISLLAVHGVVS

O84741 UPF0098 protein CT_7367.1e-1533.54Show/hide
Query:  RLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDYAGIQEGN
        +L S A  +   +P+KY+ +G G     SPPL + +VP+  K+L L+V+D D P        W  W+V N+ P +  L E         G     +Q G 
Subjt:  RLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDYAGIQEGN

Query:  NDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGHSISLLAVHG
        N     G+  P  P   HR+ F  YALD  L+     TKE+LL+A++GH I+   + G
Subjt:  NDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGHSISLLAVHG

Q9PLJ0 UPF0098 protein TC_01099.3e-1533.54Show/hide
Query:  RLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDYAGIQEGN
        +L S A  +   +P+KY+ +G G     SPPL + ++P   K+LAL+V+D D P        W  W+V N+ P +  L E         G     +Q G 
Subjt:  RLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDYAGIQEGN

Query:  NDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGHSISLLAVHG
        N     G+  P  P   HR+ F  YALD  L      TKE+LL+A+EGH ++   + G
Subjt:  NDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGHSISLLAVHG

Q9UZJ3 UPF0098 protein PYRAB115302.7e-1436.54Show/hide
Query:  DEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDYAGIQ
        +E   VSS   ++  +P KYT EG     + +PPL    + +  K+L ++V      DP+ P+  +T W+  NIPP ++ +PE    KQ  +      IQ
Subjt:  DEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDYAGIQ

Query:  EGNNDEKVPGWRAPTMP-SHG-HRFEFKLYALDDQLNLGNKATKEKLLDAIEGHSI
         G ND    G+  P  P  HG H + FK+Y LD  LNL   AT+E+L  A+EGH I
Subjt:  EGNNDEKVPGWRAPTMP-SHG-HRFEFKLYALDDQLNLGNKATKEKLLDAIEGHSI

Arabidopsis top hitse value%identityAlignment
AT5G01300.1 PEBP (phosphatidylethanolamine-binding protein) family protein1.2e-6067.32Show/hide
Query:  SDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDYAGI
        S+E RLVS  ID++G+LPRKYT  GQG +K+ SPPLEWYNVP+GTKTLALVV+DIDAPDP GP+VPWT+WVVV+IPP +KGLPE +SG ++       GI
Subjt:  SDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDYAGI

Query:  QEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH
        +EGNND K+PGWR P +PSHGHRF+FKL+ALDD+  +G+  TKE+LL AIEGH
Subjt:  QEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGH

AT5G01300.2 PEBP (phosphatidylethanolamine-binding protein) family protein8.9e-4567.83Show/hide
Query:  YNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDYAGIQEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLG
        YNVP+GTKTLALVV+DIDAPDP GP+VPWT+WVVV+IPP +KGLPE +SG ++       GI+EGNND K+PGWR P +PSHGHRF+FKL+ALDD+  +G
Subjt:  YNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDYAGIQEGNNDEKVPGWRAPTMPSHGHRFEFKLYALDDQLNLG

Query:  NKATKEKLLDAIEGH
        +  TKE+LL AIEGH
Subjt:  NKATKEKLLDAIEGH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGATAGCGATGAGTTCAGGCTGGTGTCTTCGGCGATAGATCATGAAGGAAGATTGCCGAGAAAGTACACGTCGGAAGGTCAAGGTGCGCAGAAGAACAAATCTCC
GCCATTGGAGTGGTACAATGTACCTCAAGGAACCAAGACTCTTGCCCTCGTGGTGCAGGACATCGACGCACCGGACCCGGAGGGGCCGATCGTGCCGTGGACCTTGTGGG
TGGTCGTGAACATACCGCCGAGCTTGAAGGGCCTGCCGGAAAATTTCTCTGGGAAACAAGAGGCGCTCGGCGGTGATTACGCCGGCATCCAAGAGGGAAACAACGACGAG
AAAGTCCCTGGTTGGCGCGCCCCCACCATGCCTTCCCACGGCCACCGGTTCGAGTTCAAGCTCTACGCTTTGGACGACCAGTTGAACCTCGGCAATAAGGCGACGAAGGA
GAAGCTGTTAGACGCAATAGAAGGGCACAGCATAAGCCTCCTCGCCGTGCACGGCGTCGTCTCCGATCATCAGTTCCTCGTCCGGCATCCTCAATGGCATAAACAGCCTG
ATAAACAGGAGTATAATCGTCGTGGATACAATGTTCCATCCTATTATAAACATGGCTCCGGCCAGTTGTTTCAGAAACTGAACGCCGCCGGTACCTCCGTAAAAGGCACC
GCGTGTGTTGGGGACGG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGATAGCGATGAGTTCAGGCTGGTGTCTTCGGCGATAGATCATGAAGGAAGATTGCCGAGAAAGTACACGTCGGAAGGTCAAGGTGCGCAGAAGAACAAATCTCC
GCCATTGGAGTGGTACAATGTACCTCAAGGAACCAAGACTCTTGCCCTCGTGGTGCAGGACATCGACGCACCGGACCCGGAGGGGCCGATCGTGCCGTGGACCTTGTGGG
TGGTCGTGAACATACCGCCGAGCTTGAAGGGCCTGCCGGAAAATTTCTCTGGGAAACAAGAGGCGCTCGGCGGTGATTACGCCGGCATCCAAGAGGGAAACAACGACGAG
AAAGTCCCTGGTTGGCGCGCCCCCACCATGCCTTCCCACGGCCACCGGTTCGAGTTCAAGCTCTACGCTTTGGACGACCAGTTGAACCTCGGCAATAAGGCGACGAAGGA
GAAGCTGTTAGACGCAATAGAAGGGCACAGCATAAGCCTCCTCGCCGTGCACGGCGTCGTCTCCGATCATCAGTTCCTCGTCCGGCATCCTCAATGGCATAAACAGCCTG
ATAAACAGGAGTATAATCGTCGTGGATACAATGTTCCATCCTATTATAAACATGGCTCCGGCCAGTTGTTTCAGAAACTGAACGCCGCCGGTACCTCCGTAAAAGGCACC
GCGTGTGTTGGGGACGG
Protein sequenceShow/hide protein sequence
MADSDEFRLVSSAIDHEGRLPRKYTSEGQGAQKNKSPPLEWYNVPQGTKTLALVVQDIDAPDPEGPIVPWTLWVVVNIPPSLKGLPENFSGKQEALGGDYAGIQEGNNDE
KVPGWRAPTMPSHGHRFEFKLYALDDQLNLGNKATKEKLLDAIEGHSISLLAVHGVVSDHQFLVRHPQWHKQPDKQEYNRRGYNVPSYYKHGSGQLFQKLNAAGTSVKGT
ACVGDG