; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G014840 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G014840
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionEncodes a protein involved in salt tolerance, names SIS (Salt Induced Serine rich).
Genome locationchr01:12937430..12939517
RNA-Seq ExpressionLsi01G014840
SyntenyLsi01G014840
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008448572.1 PREDICTED: uncharacterized protein LOC103490707 isoform X1 [Cucumis melo]7.1e-6676.22Show/hide
Query:  MEGKKHAGLGSSSSSLTTDLFGSKETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQ
        MEGKKH GLGSSSSSLTTDLFGS ET YSSTTGIFGSIFAPSSKVLGR+SLLSQTKE ER SVNEPW PNAEAQDD+ANHTQKESQEMKNKDMSSIYQDQ
Subjt:  MEGKKHAGLGSSSSSLTTDLFGSKETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQ

Query:  RAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG
         AQPCHLSSSIYYGGQDVYTHPQNS+NS  N A                               YKKEGGEDDSGSASRGNWWQG
Subjt:  RAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG

XP_008448581.1 PREDICTED: uncharacterized protein LOC103490707 isoform X2 [Cucumis melo]7.1e-6676.22Show/hide
Query:  MEGKKHAGLGSSSSSLTTDLFGSKETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQ
        MEGKKH GLGSSSSSLTTDLFGS ET YSSTTGIFGSIFAPSSKVLGR+SLLSQTKE ER SVNEPW PNAEAQDD+ANHTQKESQEMKNKDMSSIYQDQ
Subjt:  MEGKKHAGLGSSSSSLTTDLFGSKETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQ

Query:  RAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG
         AQPCHLSSSIYYGGQDVYTHPQNS+NS  N A                               YKKEGGEDDSGSASRGNWWQG
Subjt:  RAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG

XP_011653659.1 uncharacterized protein LOC101215701 isoform X2 [Cucumis sativus]1.7e-6475.14Show/hide
Query:  MEGKKHAGLGSSSSSLTTDLFGSKETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQ
        MEGKKH GLGSSSSSLTTDLFGS ET YSSTTGIFGSIFAPSSKVLGR+SLLS TKE ER SVNEPW PNA AQDD+ANHTQKESQE KNKDMSSIYQDQ
Subjt:  MEGKKHAGLGSSSSSLTTDLFGSKETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQ

Query:  RAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG
        RAQPCHLSSSIYYGGQDVYTHPQNS+NS  N A                               YKKEGGEDDSGSASRGNWWQG
Subjt:  RAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG

XP_038882075.1 uncharacterized protein LOC120073353 isoform X1 [Benincasa hispida]8.1e-7079.46Show/hide
Query:  MEGKKHAGLGSSSSSLTTDLFGSKETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQ
        MEGKKH GLGSSSSSLTTDLFGSKET YSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGER SVNEPWIPNAEAQDD+ANHTQKES EMKNKDMSSIYQDQ
Subjt:  MEGKKHAGLGSSSSSLTTDLFGSKETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQ

Query:  RAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG
        RAQPCHLSSSIYYGGQDVYTHPQNS+NSEVN A                               YKKEGGEDDSGSASRGNWWQG
Subjt:  RAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG

XP_038882076.1 uncharacterized protein LOC120073353 isoform X2 [Benincasa hispida]7.5e-6877.84Show/hide
Query:  MEGKKHAGLGSSSSSLTTDLFGSKETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQ
        MEGKKH GLGSSSSSLTTDLFGSKET YSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGER SVNEPWIPNAEAQDD+ANHTQKES EMKNKDMSSIYQDQ
Subjt:  MEGKKHAGLGSSSSSLTTDLFGSKETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQ

Query:  RAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG
        RAQPCHLSSSIYYGGQDVYTHPQNS+NSE                                   YKKEGGEDDSGSASRGNWWQG
Subjt:  RAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG

TrEMBL top hitse value%identityAlignment
A0A0A0L143 Uncharacterized protein8.4e-6575.14Show/hide
Query:  MEGKKHAGLGSSSSSLTTDLFGSKETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQ
        MEGKKH GLGSSSSSLTTDLFGS ET YSSTTGIFGSIFAPSSKVLGR+SLLS TKE ER SVNEPW PNA AQDD+ANHTQKESQE KNKDMSSIYQDQ
Subjt:  MEGKKHAGLGSSSSSLTTDLFGSKETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQ

Query:  RAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG
        RAQPCHLSSSIYYGGQDVYTHPQNS+NS  N A                               YKKEGGEDDSGSASRGNWWQG
Subjt:  RAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG

A0A1S3BK03 uncharacterized protein LOC103490707 isoform X23.4e-6676.22Show/hide
Query:  MEGKKHAGLGSSSSSLTTDLFGSKETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQ
        MEGKKH GLGSSSSSLTTDLFGS ET YSSTTGIFGSIFAPSSKVLGR+SLLSQTKE ER SVNEPW PNAEAQDD+ANHTQKESQEMKNKDMSSIYQDQ
Subjt:  MEGKKHAGLGSSSSSLTTDLFGSKETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQ

Query:  RAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG
         AQPCHLSSSIYYGGQDVYTHPQNS+NS  N A                               YKKEGGEDDSGSASRGNWWQG
Subjt:  RAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG

A0A1S3BKM1 uncharacterized protein LOC103490707 isoform X13.4e-6676.22Show/hide
Query:  MEGKKHAGLGSSSSSLTTDLFGSKETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQ
        MEGKKH GLGSSSSSLTTDLFGS ET YSSTTGIFGSIFAPSSKVLGR+SLLSQTKE ER SVNEPW PNAEAQDD+ANHTQKESQEMKNKDMSSIYQDQ
Subjt:  MEGKKHAGLGSSSSSLTTDLFGSKETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQ

Query:  RAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG
         AQPCHLSSSIYYGGQDVYTHPQNS+NS  N A                               YKKEGGEDDSGSASRGNWWQG
Subjt:  RAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG

A0A5D3DIY4 Uncharacterized protein3.4e-6676.22Show/hide
Query:  MEGKKHAGLGSSSSSLTTDLFGSKETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQ
        MEGKKH GLGSSSSSLTTDLFGS ET YSSTTGIFGSIFAPSSKVLGR+SLLSQTKE ER SVNEPW PNAEAQDD+ANHTQKESQEMKNKDMSSIYQDQ
Subjt:  MEGKKHAGLGSSSSSLTTDLFGSKETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQ

Query:  RAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG
         AQPCHLSSSIYYGGQDVYTHPQNS+NS  N A                               YKKEGGEDDSGSASRGNWWQG
Subjt:  RAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG

A0A6J1DDV5 uncharacterized protein LOC1110194805.8e-5869.73Show/hide
Query:  MEGKKHAGLGSSSSSLTTDLFGSKETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQ
        MEGKK  G   SSSSLT DLFGSKET YSSTTGIFGSIFAPSSKVLG +SLLSQ KEGER SVNEPWIPN EA+DD+ANH QKESQEMKNKD+SSIYQ+Q
Subjt:  MEGKKHAGLGSSSSSLTTDLFGSKETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQ

Query:  RAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG
        RAQPCHLSSSIYYGGQDVY+  QNSHNS VN                                ++KK+GGEDDSGSASRGNWWQG
Subjt:  RAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39855.2 unknown protein4.8e-1234.24Show/hide
Query:  KKHAGLGSSSSSLTTDLFGSK--ETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQR
        K  +G  SSSSS    +FG +   +  SSTTG+F SIF P S V           +G   S N      A+ Q  +     +  +  KNK+  S   ++ 
Subjt:  KKHAGLGSSSSSLTTDLFGSK--ETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQR

Query:  AQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG
          PC+LSSSIYYGGQD Y+      +S  NP                                YKK+G E DS SASRGNWW+G
Subjt:  AQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG

AT3G55646.1 unknown protein8.2e-1232.8Show/hide
Query:  KKHAGLGSSSSSLTT--DLFGSK--ETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQD
        KK     SSSSSL++   +FG +   +  SS TG+F SIF P S     D L  Q     +G   +   PNA            + +    K+  S Y +
Subjt:  KKHAGLGSSSSSLTT--DLFGSK--ETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQD

Query:  QRAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG
        +   PCHLSSS+YYGGQ+ Y                 +   TT H +                  YKK+G E DS  ASRGNWW+G
Subjt:  QRAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG

AT5G02020.1 Encodes a protein involved in salt tolerance, names SIS (Salt Induced Serine rich).3.1e-2744.44Show/hide
Query:  MEGKKHAGLGS---SSSSLTTDLFGSKETPYS-STTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSI
        MEG+K     S   SSSSLT++LFGS+E P S S++GI GSIF P SKVLGR+S+  +T  G  G  NE        +          ++E +    S  
Subjt:  MEGKKHAGLGS---SSSSLTTDLFGSKETPYS-STTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSI

Query:  YQDQRAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG
         QDQR QPCHLSSSIYYGG DVY  PQNS ++  N                                  KK+GGEDDSGSASRGNWWQG
Subjt:  YQDQRAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQG

AT5G02020.2 Encodes a protein involved in salt tolerance, names SIS (Salt Induced Serine rich).1.4e-1949.61Show/hide
Query:  MEGKKHAGLGS---SSSSLTTDLFGSKETPYS-STTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSI
        MEG+K     S   SSSSLT++LFGS+E P S S++GI GSIF P SKVLGR+S+  +T  G  G  NE        +          ++E +    S  
Subjt:  MEGKKHAGLGS---SSSSLTTDLFGSKETPYS-STTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSI

Query:  YQDQRAQPCHLSSSIYYGGQDVYTHPQNS
         QDQR QPCHLSSSIYYGG DVY  PQNS
Subjt:  YQDQRAQPCHLSSSIYYGGQDVYTHPQNS

AT5G59080.1 unknown protein1.9e-1333.33Show/hide
Query:  MEGKKHAGLGSS-SSSLTTDLFGSKE-TPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQ
        MEGK   G  SS SSS T +LFGSK+ +P SS++GIF ++F   SK   RD   S +K G                          SQ  + + +++  Q
Subjt:  MEGKKHAGLGSS-SSSLTTDLFGSKE-TPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQ

Query:  DQRAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSG-----SASRGNWWQG
        + R +PCHLSSS+YYGGQDVY     +                             + +PP  N   ++  GEDD+        SRGNWWQG
Subjt:  DQRAQPCHLSSSIYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSG-----SASRGNWWQG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGAAAGAAGCATGCGGGTCTCGGATCCTCTTCTTCTTCTCTCACTACTGACCTCTTTGGCTCCAAAGAGACTCCCTATTCTTCCACCACTGGGATTTTTGGTTC
TATTTTTGCACCTTCTTCCAAGGTGTTGGGGAGAGACTCTCTGCTCTCTCAGACCAAAGAGGGAGAGAGGGGTTCTGTAAATGAGCCATGGATCCCCAACGCTGAAGCTC
AAGATGATTCTGCTAATCATACACAAAAGGAGAGTCAGGAGATGAAGAATAAAGATATGAGTTCCATTTATCAGGATCAAAGAGCACAACCATGTCATCTTAGCTCATCA
ATCTATTATGGTGGCCAAGATGTTTACACTCATCCTCAAAATTCCCACAATTCCGAGGTGAACCCTGCGGTGAATTCAACCGATTTCCTAACCACTAAACATGTTAGTAC
TTTTCTGTTTGAAAAATACTTGGAATGGTTTCCCCCTTGTTCAAACATAATGTACAAGAAGGAAGGGGGAGAAGATGATTCTGGGAGTGCTTCAAGAGGAAATTGGTGGC
AAGGTATGAAATAG
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAAAGAAAAAAAAAAAAAACTTTTAGAGGGAGAGAGAAAGAGCGAGAGATCGAGGCAAATGAGAAATTCGCAGAGATTTTGAAAGAGGTTTTAGTTTTGATTCT
GTTTTCTCCATTTTCACAAACCTCTTTTCTGCCTTATAAATTGGTGTAAGGAAGTGGCAGTTTCATCATCATTTCAACTCTGTATCCCAGCATTCACGTGTTCTTCCTTC
TCTTTTCTTCTTTTTCCTGCTCCTCCTCTGTTTTCTTCCCATTTACGTTCCTCCTACTTTTCTTTCCCGGTGAAACAACACAAACCCATTAATCTTTGTCCCTCCATTAA
AGGGGTTCTTCAGGACTCAAGTTTTATTGAGGGGTAACTCTGTTTTCCACTGATAACAATCAAAATGGAAGGAAAGAAGCATGCGGGTCTCGGATCCTCTTCTTCTTCTC
TCACTACTGACCTCTTTGGCTCCAAAGAGACTCCCTATTCTTCCACCACTGGGATTTTTGGTTCTATTTTTGCACCTTCTTCCAAGGTGTTGGGGAGAGACTCTCTGCTC
TCTCAGACCAAAGAGGGAGAGAGGGGTTCTGTAAATGAGCCATGGATCCCCAACGCTGAAGCTCAAGATGATTCTGCTAATCATACACAAAAGGAGAGTCAGGAGATGAA
GAATAAAGATATGAGTTCCATTTATCAGGATCAAAGAGCACAACCATGTCATCTTAGCTCATCAATCTATTATGGTGGCCAAGATGTTTACACTCATCCTCAAAATTCCC
ACAATTCCGAGGTGAACCCTGCGGTGAATTCAACCGATTTCCTAACCACTAAACATGTTAGTACTTTTCTGTTTGAAAAATACTTGGAATGGTTTCCCCCTTGTTCAAAC
ATAATGTACAAGAAGGAAGGGGGAGAAGATGATTCTGGGAGTGCTTCAAGAGGAAATTGGTGGCAAGGTATGAAATAG
Protein sequenceShow/hide protein sequence
MEGKKHAGLGSSSSSLTTDLFGSKETPYSSTTGIFGSIFAPSSKVLGRDSLLSQTKEGERGSVNEPWIPNAEAQDDSANHTQKESQEMKNKDMSSIYQDQRAQPCHLSSS
IYYGGQDVYTHPQNSHNSEVNPAVNSTDFLTTKHVSTFLFEKYLEWFPPCSNIMYKKEGGEDDSGSASRGNWWQGMK