; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C02G036150 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C02G036150
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionMetal-independent phosphoserine phosphatase
Genome locationCla97Chr02:15988812..15998518
RNA-Seq ExpressionCla97C02G036150
SyntenyCla97C02G036150
Gene Ontology termsNA
InterPro domainsIPR013078 - Histidine phosphatase superfamily, clade-1
IPR029033 - Histidine phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604968.1 hypothetical protein SDJN03_02285, partial [Cucurbita argyrosperma subsp. sororia]7.4e-9176.29Show/hide
Query:  MATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQFLKATQIQFMSLGFSSPEKYIFIPQKVKTIHTAKVASSVLNLPFE
        MATA FLRN+YWILRHGKSIPNEKGLIVSSIENG LPEYQLA EGVGQA LAGEQFLK  +   + L      +  + P   +TIHTAKVA+S LNLPFE
Subjt:  MATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQFLKATQIQFMSLGFSSPEKYIFIPQKVKTIHTAKVASSVLNLPFE

Query:  GPQCKMMEDLRERYFGPSFELLSHNKYAEIWALDEEDPFKRPEGGESVADVASRFAKAILQIESLFQGCAILVVSHGDPLQIFQTVVGSVVKQEDESSSD
         PQCKMM+DLRERYFGPSFELLSH+KYAEIWALDEEDPFKRPEGGESV DVASR AKA+LQ+ES FQGCA+LVVSHGDPLQIFQTV+G+ +  E+ S SD
Subjt:  GPQCKMMEDLRERYFGPSFELLSHNKYAEIWALDEEDPFKRPEGGESVADVASRFAKAILQIESLFQGCAILVVSHGDPLQIFQTVVGSVVKQEDESSSD

Query:  DLSSTLQAVITKPILSKHRKFALLTGELRSVI
        +L+S LQA ITKPILS+HRKFALLTGELR+V+
Subjt:  DLSSTLQAVITKPILSKHRKFALLTGELRSVI

XP_008460226.1 PREDICTED: uncharacterized protein LOC103499109 [Cucumis melo]6.7e-9275Show/hide
Query:  IKNTEEKQRG-GVQGMATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQFLKATQIQFMSLGFSSPEKYIFIPQKVKTI
        I NT E QRG   QGMATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQL PEGV QA LAG QFLK  +   + L      +  + P   +TI
Subjt:  IKNTEEKQRG-GVQGMATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQFLKATQIQFMSLGFSSPEKYIFIPQKVKTI

Query:  HTAKVASSVLNLPFEGPQCKMMEDLRERYFGPSFELLSHNKYAEIWALDEEDPFKRPEGGESVADVASRFAKAILQIESLFQGCAILVVSHGDPLQIFQT
        HTAKVA+SVLNLPFE PQCKM+E+LRERYFGPSFEL SH KY +IWALDEEDPFKRPEGGESV DVASR A+AIL+IESLFQGCAILVVSHGDPLQIFQ 
Subjt:  HTAKVASSVLNLPFEGPQCKMMEDLRERYFGPSFELLSHNKYAEIWALDEEDPFKRPEGGESVADVASRFAKAILQIESLFQGCAILVVSHGDPLQIFQT

Query:  VVGSVVKQEDESSSDDLSSTLQAVITKPILSKHRKFALLTGELR
        ++GS  KQ+  +SS+DLSS  QA+ITKP+LS HR+FALLTGELR
Subjt:  VVGSVVKQEDESSSDDLSSTLQAVITKPILSKHRKFALLTGELR

XP_022140404.1 uncharacterized protein LOC111011089 [Momordica charantia]4.7e-9378.54Show/hide
Query:  MATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQFLKATQIQFMSLGFSSPEKYIFIPQKVKTIHTAKVASSVLNLPFE
        M TASFLRNRYW+LRHGKSIPNEKGLIVSSIENG LPEYQLA EGVGQA LAGEQFLK  +   +SL      +  + P   +TIHTAKVA+S LN+PFE
Subjt:  MATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQFLKATQIQFMSLGFSSPEKYIFIPQKVKTIHTAKVASSVLNLPFE

Query:  GPQCKMMEDLRERYFGPSFELLSHNKYAEIWALDEEDPFKRP-EGGESVADVASRFAKAILQIESLFQGCAILVVSHGDPLQIFQTVVGSVVKQEDESSS
        GPQCKM+EDLRERYFGPSFEL+SH+KYA+IWALDEEDPFKRP EGGESV DVASR AKAILQIES FQGCAIL+VSHGDPLQIFQTVVG+  KQEDESSS
Subjt:  GPQCKMMEDLRERYFGPSFELLSHNKYAEIWALDEEDPFKRP-EGGESVADVASRFAKAILQIESLFQGCAILVVSHGDPLQIFQTVVGSVVKQEDESSS

Query:  DDLSSTLQAVITKPILSKHRKFALLTGELRSVI
        D+L S LQAVITK +LS+HRKFALLTGELR+V+
Subjt:  DDLSSTLQAVITKPILSKHRKFALLTGELRSVI

XP_023532426.1 uncharacterized protein LOC111794603 [Cucurbita pepo subsp. pepo]5.1e-9277.59Show/hide
Query:  MATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQFLKATQIQFMSLGFSSPEKYIFIPQKVKTIHTAKVASSVLNLPFE
        MATA FLRNRYWILRHGKSIPNEKGLIVSSIENG LPEYQLA EGVGQA LAGEQFLK  +   + L      +  + P   +TIHTAKVA+S LNLPFE
Subjt:  MATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQFLKATQIQFMSLGFSSPEKYIFIPQKVKTIHTAKVASSVLNLPFE

Query:  GPQCKMMEDLRERYFGPSFELLSHNKYAEIWALDEEDPFKRPEGGESVADVASRFAKAILQIESLFQGCAILVVSHGDPLQIFQTVVGSVVKQEDESSSD
         PQCKMMEDLRERYFGPSFELLSH+KYAEIWALDEEDPFKRPEGGESV DVASR AKA+LQ+ES FQGCA+LVVSHGDPLQIFQTV+G+    E+ SSSD
Subjt:  GPQCKMMEDLRERYFGPSFELLSHNKYAEIWALDEEDPFKRPEGGESVADVASRFAKAILQIESLFQGCAILVVSHGDPLQIFQTVVGSVVKQEDESSSD

Query:  DLSSTLQAVITKPILSKHRKFALLTGELRSVI
        +L+S LQA ITKPILS+HRKFALLTGELR+V+
Subjt:  DLSSTLQAVITKPILSKHRKFALLTGELRSVI

XP_038901554.1 uncharacterized protein LOC120088380 isoform X2 [Benincasa hispida]4.5e-9681.9Show/hide
Query:  MATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQFLKATQIQFMSLGFSSPEKYIFIPQKVKTIHTAKVASSVLNLPFE
        MATASFL N+YWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGV QA LAGEQFLK  +    SL      +  + P   +TIHTAKVA+SVLNL FE
Subjt:  MATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQFLKATQIQFMSLGFSSPEKYIFIPQKVKTIHTAKVASSVLNLPFE

Query:  GPQCKMMEDLRERYFGPSFELLSHNKYAEIWALDEEDPFKRPEGGESVADVASRFAKAILQIESLFQGCAILVVSHGDPLQIFQTVVGSVVKQEDESSSD
        GPQCKMMEDLRER FGPSFELLSH+KYAEIWALDEEDPFKRPEGGESV DVASR AKAILQIESLFQGCAILVVSHGDPLQIFQ VVGS  KQED S+S+
Subjt:  GPQCKMMEDLRERYFGPSFELLSHNKYAEIWALDEEDPFKRPEGGESVADVASRFAKAILQIESLFQGCAILVVSHGDPLQIFQTVVGSVVKQEDESSSD

Query:  DLSSTLQAVITKPILSKHRKFALLTGELRSVI
        DL STLQA ITK ILSKHRKFALLTGELRSV+
Subjt:  DLSSTLQAVITKPILSKHRKFALLTGELRSVI

TrEMBL top hitse value%identityAlignment
A0A0A0KQV4 Uncharacterized protein4.7e-9174.79Show/hide
Query:  EKQRG-GVQGMATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQFLKATQIQFMSLGFSSPEKYIFIPQKVKTIHTAKV
        E +RG  VQGMATASFLRNRYWILRHGKSIPNEKGLIVSS ENGILPEYQLAPEGV QA LAG QFLK  +   + L      +  + P   +TIHTAKV
Subjt:  EKQRG-GVQGMATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQFLKATQIQFMSLGFSSPEKYIFIPQKVKTIHTAKV

Query:  ASSVLNLPFEGPQCKMMEDLRERYFGPSFELLSHNKYAEIWALDEEDPFKRPEGGESVADVASRFAKAILQIESLFQGCAILVVSHGDPLQIFQTVVGSV
        A+SVLNLPFEGPQCKM+E+LRERYFGPSFELLSH+KY EIWALDEED FKRPEGGESV DVASR AKAIL+IESLFQGCAILVVSHGDPLQI Q ++GS 
Subjt:  ASSVLNLPFEGPQCKMMEDLRERYFGPSFELLSHNKYAEIWALDEEDPFKRPEGGESVADVASRFAKAILQIESLFQGCAILVVSHGDPLQIFQTVVGSV

Query:  VKQEDESSSDDLSSTLQAVITKPILSKHRKFALLTGELRSVI
         KQ+  + S+DLSS L+A++TKPILS HR+FALLTGELR ++
Subjt:  VKQEDESSSDDLSSTLQAVITKPILSKHRKFALLTGELRSVI

A0A1S3CBK1 uncharacterized protein LOC1034991093.3e-9275Show/hide
Query:  IKNTEEKQRG-GVQGMATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQFLKATQIQFMSLGFSSPEKYIFIPQKVKTI
        I NT E QRG   QGMATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQL PEGV QA LAG QFLK  +   + L      +  + P   +TI
Subjt:  IKNTEEKQRG-GVQGMATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQFLKATQIQFMSLGFSSPEKYIFIPQKVKTI

Query:  HTAKVASSVLNLPFEGPQCKMMEDLRERYFGPSFELLSHNKYAEIWALDEEDPFKRPEGGESVADVASRFAKAILQIESLFQGCAILVVSHGDPLQIFQT
        HTAKVA+SVLNLPFE PQCKM+E+LRERYFGPSFEL SH KY +IWALDEEDPFKRPEGGESV DVASR A+AIL+IESLFQGCAILVVSHGDPLQIFQ 
Subjt:  HTAKVASSVLNLPFEGPQCKMMEDLRERYFGPSFELLSHNKYAEIWALDEEDPFKRPEGGESVADVASRFAKAILQIESLFQGCAILVVSHGDPLQIFQT

Query:  VVGSVVKQEDESSSDDLSSTLQAVITKPILSKHRKFALLTGELR
        ++GS  KQ+  +SS+DLSS  QA+ITKP+LS HR+FALLTGELR
Subjt:  VVGSVVKQEDESSSDDLSSTLQAVITKPILSKHRKFALLTGELR

A0A6J1CFM2 uncharacterized protein LOC1110110892.3e-9378.54Show/hide
Query:  MATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQFLKATQIQFMSLGFSSPEKYIFIPQKVKTIHTAKVASSVLNLPFE
        M TASFLRNRYW+LRHGKSIPNEKGLIVSSIENG LPEYQLA EGVGQA LAGEQFLK  +   +SL      +  + P   +TIHTAKVA+S LN+PFE
Subjt:  MATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQFLKATQIQFMSLGFSSPEKYIFIPQKVKTIHTAKVASSVLNLPFE

Query:  GPQCKMMEDLRERYFGPSFELLSHNKYAEIWALDEEDPFKRP-EGGESVADVASRFAKAILQIESLFQGCAILVVSHGDPLQIFQTVVGSVVKQEDESSS
        GPQCKM+EDLRERYFGPSFEL+SH+KYA+IWALDEEDPFKRP EGGESV DVASR AKAILQIES FQGCAIL+VSHGDPLQIFQTVVG+  KQEDESSS
Subjt:  GPQCKMMEDLRERYFGPSFELLSHNKYAEIWALDEEDPFKRP-EGGESVADVASRFAKAILQIESLFQGCAILVVSHGDPLQIFQTVVGSVVKQEDESSS

Query:  DDLSSTLQAVITKPILSKHRKFALLTGELRSVI
        D+L S LQAVITK +LS+HRKFALLTGELR+V+
Subjt:  DDLSSTLQAVITKPILSKHRKFALLTGELRSVI

A0A6J1G8R3 uncharacterized protein LOC111451758 isoform X18.0e-9176.29Show/hide
Query:  MATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQFLKATQIQFMSLGFSSPEKYIFIPQKVKTIHTAKVASSVLNLPFE
        MATA FLRNRYWILRHGKSIPNEKGLIVSSIENG LPEYQLA EGVGQA LAGEQFLK  +   + L      +  + P   +TIHTAKVA+S LNLPFE
Subjt:  MATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQFLKATQIQFMSLGFSSPEKYIFIPQKVKTIHTAKVASSVLNLPFE

Query:  GPQCKMMEDLRERYFGPSFELLSHNKYAEIWALDEEDPFKRPEGGESVADVASRFAKAILQIESLFQGCAILVVSHGDPLQIFQTVVGSVVKQEDESSSD
         P CKMM+DLRERYFGPSFELLSH+KYAEIWALDEEDPFKRPEGGESV DVASR AKA+LQ+ES FQGCA+LVVSHGDPLQIFQTV+G+ +  E+ S SD
Subjt:  GPQCKMMEDLRERYFGPSFELLSHNKYAEIWALDEEDPFKRPEGGESVADVASRFAKAILQIESLFQGCAILVVSHGDPLQIFQTVVGSVVKQEDESSSD

Query:  DLSSTLQAVITKPILSKHRKFALLTGELRSVI
        +L+S LQA ITKPILS+HRKFALLTGELR+V+
Subjt:  DLSSTLQAVITKPILSKHRKFALLTGELRSVI

A0A6J1I4S5 uncharacterized protein LOC1114698676.1e-9176.72Show/hide
Query:  MATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQFLKATQIQFMSLGFSSPEKYIFIPQKVKTIHTAKVASSVLNLPFE
        MATA FLRNRYWILRHGKSIPNEKGLIVSSIENG LPEYQLA EGVGQA LAGEQFLK  +   + L      +  + P   +TIHTAKV +S LNLPFE
Subjt:  MATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQFLKATQIQFMSLGFSSPEKYIFIPQKVKTIHTAKVASSVLNLPFE

Query:  GPQCKMMEDLRERYFGPSFELLSHNKYAEIWALDEEDPFKRPEGGESVADVASRFAKAILQIESLFQGCAILVVSHGDPLQIFQTVVGSVVKQEDESSSD
         PQCKMMEDLRERYFGPSFELLSH+KYAEIWALDEEDPF RPEGGESV DVASR AKA+LQ+ES FQGCAILVVSHGDPLQIFQTV+G+ +  E+ S SD
Subjt:  GPQCKMMEDLRERYFGPSFELLSHNKYAEIWALDEEDPFKRPEGGESVADVASRFAKAILQIESLFQGCAILVVSHGDPLQIFQTVVGSVVKQEDESSSD

Query:  DLSSTLQAVITKPILSKHRKFALLTGELRSVI
        +L+S LQA ITKPILS+HRKFALLTGELR+V+
Subjt:  DLSSTLQAVITKPILSKHRKFALLTGELRSVI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G38370.1 Phosphoglycerate mutase family protein2.4e-7160.71Show/hide
Query:  NRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQFLKATQIQFMSLGFSSPEKYIFIPQKVKTIHTAKVASSVLNLPFEGPQCKMME
        NRYW+LRHGKSIPNE+GL+VSS+ENG+LPEYQLAP+GV QA LAGE FL+    Q         +  I      +T HTA+V + VLNLPF+ PQCKMME
Subjt:  NRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQFLKATQIQFMSLGFSSPEKYIFIPQKVKTIHTAKVASSVLNLPFEGPQCKMME

Query:  DLRERYFGPSFELLSHNKYAEIWALDEEDPFKRPEGGESVADVASRFAKAILQIESLFQGCAILVVSHGDPLQIFQTVVGSVVKQEDESSSDDLSSTLQA
        DLRERYFGP+FEL SH+KY EIWALDE+DPF  PEGGES  DV SR A A+  +E+ +Q CAILVVSHGDPLQ+ Q V  S  +QE     D L+   Q 
Subjt:  DLRERYFGPSFELLSHNKYAEIWALDEEDPFKRPEGGESVADVASRFAKAILQIESLFQGCAILVVSHGDPLQIFQTVVGSVVKQEDESSSDDLSSTLQA

Query:  VITKPILSKHRKFALLTGELRSVI
             +LS+HRKFALLTGELR +I
Subjt:  VITKPILSKHRKFALLTGELRSVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATTCGTAGCCCAGTAAAATTCAGTCCCCTTCTTTTCCAGAGGGAATCGAAAACTTCGGTTTTGGAGGATTCGTTCTTTCAATTCGCCGAATCAGAGTTCTCCAA
ATGCGACGGGATTAAAAATACGGAAGAGAAGCAAAGAGGGGGAGTCCAGGGAATGGCAACGGCGTCGTTTTTGCGGAACAGATACTGGATTCTCAGGCACGGCAAGAGCA
TCCCAAATGAGAAGGGCCTCATTGTTTCCTCAATAGAAAATGGTATCCTTCCGGAGTATCAACTGGCCCCAGAGGGTGTTGGACAAGCGCATTTGGCTGGAGAACAGTTC
TTAAAGGCTACACAAATCCAATTCATGAGCCTTGGTTTTTCTTCCCCAGAAAAGTATATATTCATACCACAGAAAGTAAAAACAATTCATACAGCTAAAGTTGCTTCATC
TGTATTGAATCTTCCATTTGAAGGCCCCCAGTGTAAGATGATGGAAGATCTTAGGGAACGCTACTTTGGTCCTTCATTTGAACTCTTGTCTCATAACAAATATGCAGAAA
TCTGGGCTCTTGATGAGGAAGATCCATTCAAGCGGCCTGAAGGTGGAGAAAGCGTTGCAGATGTTGCTTCAAGGTTTGCCAAAGCAATTCTTCAAATAGAATCGCTATTT
CAAGGGTGTGCGATCTTGGTGGTCAGCCATGGGGATCCCCTACAAATTTTTCAGACGGTGGTGGGATCAGTCGTCAAGCAAGAAGATGAATCGAGTTCTGATGATTTGTC
ATCAACATTACAAGCCGTCATTACCAAACCTATTCTCTCCAAGCACCGGAAATTCGCACTCCTCACCGGAGAGCTTCGATCTGTCATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGATTCGTAGCCCAGTAAAATTCAGTCCCCTTCTTTTCCAGAGGGAATCGAAAACTTCGGTTTTGGAGGATTCGTTCTTTCAATTCGCCGAATCAGAGTTCTCCAA
ATGCGACGGGATTAAAAATACGGAAGAGAAGCAAAGAGGGGGAGTCCAGGGAATGGCAACGGCGTCGTTTTTGCGGAACAGATACTGGATTCTCAGGCACGGCAAGAGCA
TCCCAAATGAGAAGGGCCTCATTGTTTCCTCAATAGAAAATGGTATCCTTCCGGAGTATCAACTGGCCCCAGAGGGTGTTGGACAAGCGCATTTGGCTGGAGAACAGTTC
TTAAAGGCTACACAAATCCAATTCATGAGCCTTGGTTTTTCTTCCCCAGAAAAGTATATATTCATACCACAGAAAGTAAAAACAATTCATACAGCTAAAGTTGCTTCATC
TGTATTGAATCTTCCATTTGAAGGCCCCCAGTGTAAGATGATGGAAGATCTTAGGGAACGCTACTTTGGTCCTTCATTTGAACTCTTGTCTCATAACAAATATGCAGAAA
TCTGGGCTCTTGATGAGGAAGATCCATTCAAGCGGCCTGAAGGTGGAGAAAGCGTTGCAGATGTTGCTTCAAGGTTTGCCAAAGCAATTCTTCAAATAGAATCGCTATTT
CAAGGGTGTGCGATCTTGGTGGTCAGCCATGGGGATCCCCTACAAATTTTTCAGACGGTGGTGGGATCAGTCGTCAAGCAAGAAGATGAATCGAGTTCTGATGATTTGTC
ATCAACATTACAAGCCGTCATTACCAAACCTATTCTCTCCAAGCACCGGAAATTCGCACTCCTCACCGGAGAGCTTCGATCTGTCATTTGAATTCCGATAACATCTCGCC
AGACTTATTGAAGATTTCAATAAACGCCTAAAAAGCTTCTTCTTCTTCCCTTTTTCTTTTTGGTAGTTATATTACTTTTATTTTTAAGTTGGCTTTTAGGTTTAAGCCGT
GAATGTTGAATTATTGAGATAAGCTCCTACTCTCGACTTCGACTTCGACGTTAAAGGTTCAATTTCTCCACCTTCAAATTGTCAAAATATATAATATTAACAATAACAAT
ATTATTATTATTATAAATTTAA
Protein sequenceShow/hide protein sequence
MAIRSPVKFSPLLFQRESKTSVLEDSFFQFAESEFSKCDGIKNTEEKQRGGVQGMATASFLRNRYWILRHGKSIPNEKGLIVSSIENGILPEYQLAPEGVGQAHLAGEQF
LKATQIQFMSLGFSSPEKYIFIPQKVKTIHTAKVASSVLNLPFEGPQCKMMEDLRERYFGPSFELLSHNKYAEIWALDEEDPFKRPEGGESVADVASRFAKAILQIESLF
QGCAILVVSHGDPLQIFQTVVGSVVKQEDESSSDDLSSTLQAVITKPILSKHRKFALLTGELRSVI