; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0012425 (gene) of Chayote v1 genome

Gene IDSed0012425
OrganismSechium edule (Chayote v1)
DescriptionUnknown protein
Genome locationLG09:37875126..37876062
RNA-Seq ExpressionSed0012425
SyntenySed0012425
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575681.1 Zinc finger A20 and AN1 domain-containing stress-associated protein 8, partial [Cucurbita argyrosperma subsp. sororia]4.0e-5856.43Show/hide
Query:  MAAIVTRRLSSKFLKPFPSSTFF-----KDPFPQISPPNSNPPFLQS----LTNPNFPLFHSPTPFP-----------RSTN------------------
        MAAIVTRRLSSKFL+P PSSTF      ++ F +I   +S+P F QS       P   L  SPT  P           RS N                  
Subjt:  MAAIVTRRLSSKFLKPFPSSTFF-----KDPFPQISPPNSNPPFLQS----LTNPNFPLFHSPTPFP-----------RSTN------------------

Query:  ---SSLNLIQRSRNPSFLTPDFNQKPGFSSNHGGE-------FKHQEIEGPTVERDLSALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYS-
           SS  LI R RNPSF   D +QK GFSS    E       FKHQ+IEGPTVERDLSALAGETREV++AMMKNVY LS A+A+LGLVQLGIGAWISY+ 
Subjt:  ---SSLNLIQRSRNPSFLTPDFNQKPGFSSNHGGE-------FKHQEIEGPTVERDLSALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYS-

Query:  -VSDAAAVSIQSVAAFGLPFSVAFTLRQCLKPMVFFRKMEKEGRLQILTLTLQIAKNLNVLFVRFRIVCFLCVAGLSVGG
          S    VSIQS  +FG PFS+AF LRQ LKPM+FF+KME++GRLQILTLTLQIAKNLN LFVR RIV FLCV G    G
Subjt:  -VSDAAAVSIQSVAAFGLPFSVAFTLRQCLKPMVFFRKMEKEGRLQILTLTLQIAKNLNVLFVRFRIVCFLCVAGLSVGG

XP_004135922.1 uncharacterized protein LOC101204591 [Cucumis sativus]6.9e-5857.41Show/hide
Query:  MAAIVTRRLSSKFLKPFPSSTFF-----KDPFPQISPPNSNPPFLQS--LTNPN--FPLFHSPTP--------------FPRSTNSSLNLIQRSRNPSFL
        MAAIVTRRLSS   +PF  STF       +P  +   P+S+P FL S   T+PN    LF+S +               F     S    I + RNPSF+
Subjt:  MAAIVTRRLSSKFLKPFPSSTFF-----KDPFPQISPPNSNPPFLQS--LTNPN--FPLFHSPTP--------------FPRSTNSSLNLIQRSRNPSFL

Query:  T-PDFNQKPGFSSNHGGE-------FKHQEIEGPTVERDLSALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYSV--SDAAAVSIQSVAAFG
        +  DF++K  FS+    E       FKHQ+IEGPTVERDLSALA ETR+VI+AMMKNVYRLS A+AVLGLVQLGIGAWISY    S    VSIQS  AFG
Subjt:  T-PDFNQKPGFSSNHGGE-------FKHQEIEGPTVERDLSALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYSV--SDAAAVSIQSVAAFG

Query:  LPFSVAFTLRQCLKPMVFFRKMEKEGRLQILTLTLQIAKNLNVLFVRFRIVCFLCVAGLSVGGLFALISR
         PFS+AF LRQ LKPM+FF+KME++GRLQILTL+LQI KNLN LFVR R V FLCV GLSVG LFAL+SR
Subjt:  LPFSVAFTLRQCLKPMVFFRKMEKEGRLQILTLTLQIAKNLNVLFVRFRIVCFLCVAGLSVGGLFALISR

XP_022954073.1 uncharacterized protein LOC111456447 [Cucurbita moschata]1.5e-6559.43Show/hide
Query:  MAAIVTRRLSSKFLKPFPSSTFF-----KDPFPQISPPNSNPPFLQS----LTNPNFPLFHSPTPFP---------RSTN-----------------SSL
        MAAIVTRRLSSKFL+P PSSTF      ++ F +I   +S+P F QS       P   L +S TPF          RS N                 SS 
Subjt:  MAAIVTRRLSSKFLKPFPSSTFF-----KDPFPQISPPNSNPPFLQS----LTNPNFPLFHSPTPFP---------RSTN-----------------SSL

Query:  NLIQRSRNPSFLTPDFNQKPGFSSNHGGE-------FKHQEIEGPTVERDLSALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYS--VSDAA
         LI R RNPSF   D +QK GFSS    E       FKHQ+IEGPTVERDLSALAGETREV++AMMKNVY LS A+A+LGLVQLGIGAWISY+   S   
Subjt:  NLIQRSRNPSFLTPDFNQKPGFSSNHGGE-------FKHQEIEGPTVERDLSALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYS--VSDAA

Query:  AVSIQSVAAFGLPFSVAFTLRQCLKPMVFFRKMEKEGRLQILTLTLQIAKNLNVLFVRFRIVCFLCVAGLSVGGLFALISR
         VSIQS  +FG PFS+AF LRQ LKPM+FF+KME++GRLQILTLTLQIAKNLN LFVR RIV FLCV GLSVG LFAL+SR
Subjt:  AVSIQSVAAFGLPFSVAFTLRQCLKPMVFFRKMEKEGRLQILTLTLQIAKNLNVLFVRFRIVCFLCVAGLSVGGLFALISR

XP_022991920.1 uncharacterized protein LOC111488415 [Cucurbita maxima]4.9e-6458.04Show/hide
Query:  MAAIVTRRLSSKFLKPFPSSTFF-----KDPFPQISPPNSNPPFLQS----LTNPNFPLFHSPTPFP---------RSTNSSLN----------------
        MAAIVTRRLSSK+L+PFPSST        + F +I   +S+P F QS       P   L +S TPF          RS N +LN                
Subjt:  MAAIVTRRLSSKFLKPFPSSTFF-----KDPFPQISPPNSNPPFLQS----LTNPNFPLFHSPTPFP---------RSTNSSLN----------------

Query:  ------LIQRSRNPSFLTPDFNQKPGFSSNHGGE-------FKHQEIEGPTVERDLSALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYS--
              LI R RNPSF   D +QK GFSS    E       FKHQ+IEGPTVERDLSALAGETREV++AMMKNVY LS A+A+LGLVQLGIGAWISY+  
Subjt:  ------LIQRSRNPSFLTPDFNQKPGFSSNHGGE-------FKHQEIEGPTVERDLSALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYS--

Query:  VSDAAAVSIQSVAAFGLPFSVAFTLRQCLKPMVFFRKMEKEGRLQILTLTLQIAKNLNVLFVRFRIVCFLCVAGLSVGGLFALISR
         S    VSIQS  +FG PFS+AF LRQ LKPM+FF+KME++GRLQILTLTLQIAKNLN LFVR RIV FLCV GLSVG LFAL+SR
Subjt:  VSDAAAVSIQSVAAFGLPFSVAFTLRQCLKPMVFFRKMEKEGRLQILTLTLQIAKNLNVLFVRFRIVCFLCVAGLSVGGLFALISR

XP_023549384.1 uncharacterized protein LOC111807746 [Cucurbita pepo subsp. pepo]9.9e-6558.6Show/hide
Query:  MAAIVTRRLSSKFLKPFPSSTFF-----KDPFPQISPPNSNPPFLQS----LTNPNFPLFHSPTPFP---------RSTN--------------------
        MAAIVTRRLSSKFL+P PSSTF       + F +I   +S+P F QS       P   L +S TPF          RS N                    
Subjt:  MAAIVTRRLSSKFLKPFPSSTFF-----KDPFPQISPPNSNPPFLQS----LTNPNFPLFHSPTPFP---------RSTN--------------------

Query:  -SSLNLIQRSRNPSFLTPDFNQKPGFSSNHGGE-------FKHQEIEGPTVERDLSALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYS--V
         SS  LI R RNPSF   D +QK GFSS    E       FKHQ+IEGPTVERDLSALAGETREV++AMMKNVY LS A+A+LGLVQLGIGAWISY+   
Subjt:  -SSLNLIQRSRNPSFLTPDFNQKPGFSSNHGGE-------FKHQEIEGPTVERDLSALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYS--V

Query:  SDAAAVSIQSVAAFGLPFSVAFTLRQCLKPMVFFRKMEKEGRLQILTLTLQIAKNLNVLFVRFRIVCFLCVAGLSVGGLFALISR
        S    VSIQS  +FG PFS+AF LRQ LKPM+FF+KME++GRLQILTLTLQIAKNLN LFVR RIV FLCV GLSVG LFAL+SR
Subjt:  SDAAAVSIQSVAAFGLPFSVAFTLRQCLKPMVFFRKMEKEGRLQILTLTLQIAKNLNVLFVRFRIVCFLCVAGLSVGGLFALISR

TrEMBL top hitse value%identityAlignment
A0A0A0K8D3 Uncharacterized protein3.3e-5857.41Show/hide
Query:  MAAIVTRRLSSKFLKPFPSSTFF-----KDPFPQISPPNSNPPFLQS--LTNPN--FPLFHSPTP--------------FPRSTNSSLNLIQRSRNPSFL
        MAAIVTRRLSS   +PF  STF       +P  +   P+S+P FL S   T+PN    LF+S +               F     S    I + RNPSF+
Subjt:  MAAIVTRRLSSKFLKPFPSSTFF-----KDPFPQISPPNSNPPFLQS--LTNPN--FPLFHSPTP--------------FPRSTNSSLNLIQRSRNPSFL

Query:  T-PDFNQKPGFSSNHGGE-------FKHQEIEGPTVERDLSALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYSV--SDAAAVSIQSVAAFG
        +  DF++K  FS+    E       FKHQ+IEGPTVERDLSALA ETR+VI+AMMKNVYRLS A+AVLGLVQLGIGAWISY    S    VSIQS  AFG
Subjt:  T-PDFNQKPGFSSNHGGE-------FKHQEIEGPTVERDLSALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYSV--SDAAAVSIQSVAAFG

Query:  LPFSVAFTLRQCLKPMVFFRKMEKEGRLQILTLTLQIAKNLNVLFVRFRIVCFLCVAGLSVGGLFALISR
         PFS+AF LRQ LKPM+FF+KME++GRLQILTL+LQI KNLN LFVR R V FLCV GLSVG LFAL+SR
Subjt:  LPFSVAFTLRQCLKPMVFFRKMEKEGRLQILTLTLQIAKNLNVLFVRFRIVCFLCVAGLSVGGLFALISR

A0A1S3CFJ7 uncharacterized protein LOC1034999119.7e-5856.83Show/hide
Query:  MAAIVTRRLSSKFLKPFPSSTFF-----KDPFPQISPPNSNPPFLQS--LTNPN---FPLFH----SPTPFPRSTNSSLNL----------IQRSRNPSF
        MAAIVTRRLSS   +PF  STF        PF +I  P+S+P FL+S   T+PN     LF+    S TP  ++ N S             I +  NP+F
Subjt:  MAAIVTRRLSSKFLKPFPSSTFF-----KDPFPQISPPNSNPPFLQS--LTNPN---FPLFH----SPTPFPRSTNSSLNL----------IQRSRNPSF

Query:  L-TPDFNQKPGFSSNHGGE-------FKHQEIEGPTVERDLSALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYSV--SDAAAVSIQSVAAF
        +   DF++K  FS+    E       FKHQ+IEGPTVERDLSALA ETR+V++AMMKNVYRLS A+AVLGLVQLG+GAWISY    S    VSIQS  AF
Subjt:  L-TPDFNQKPGFSSNHGGE-------FKHQEIEGPTVERDLSALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYSV--SDAAAVSIQSVAAF

Query:  GLPFSVAFTLRQCLKPMVFFRKMEKEGRLQILTLTLQIAKNLNVLFVRFRIVCFLCVAGLSVGGLFALISR
        G PFS+AF LRQ LKPM+FF+KME++GRLQILTL+LQI KNLN LFVR R V  LCV GLSVG LFAL+SR
Subjt:  GLPFSVAFTLRQCLKPMVFFRKMEKEGRLQILTLTLQIAKNLNVLFVRFRIVCFLCVAGLSVGGLFALISR

A0A5A7UUD1 Uncharacterized protein9.7e-5856.83Show/hide
Query:  MAAIVTRRLSSKFLKPFPSSTFF-----KDPFPQISPPNSNPPFLQS--LTNPN---FPLFH----SPTPFPRSTNSSLNL----------IQRSRNPSF
        MAAIVTRRLSS   +PF  STF        PF +I  P+S+P FL+S   T+PN     LF+    S TP  ++ N S             I +  NP+F
Subjt:  MAAIVTRRLSSKFLKPFPSSTFF-----KDPFPQISPPNSNPPFLQS--LTNPN---FPLFH----SPTPFPRSTNSSLNL----------IQRSRNPSF

Query:  L-TPDFNQKPGFSSNHGGE-------FKHQEIEGPTVERDLSALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYSV--SDAAAVSIQSVAAF
        +   DF++K  FS+    E       FKHQ+IEGPTVERDLSALA ETR+V++AMMKNVYRLS A+AVLGLVQLG+GAWISY    S    VSIQS  AF
Subjt:  L-TPDFNQKPGFSSNHGGE-------FKHQEIEGPTVERDLSALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYSV--SDAAAVSIQSVAAF

Query:  GLPFSVAFTLRQCLKPMVFFRKMEKEGRLQILTLTLQIAKNLNVLFVRFRIVCFLCVAGLSVGGLFALISR
        G PFS+AF LRQ LKPM+FF+KME++GRLQILTL+LQI KNLN LFVR R V  LCV GLSVG LFAL+SR
Subjt:  GLPFSVAFTLRQCLKPMVFFRKMEKEGRLQILTLTLQIAKNLNVLFVRFRIVCFLCVAGLSVGGLFALISR

A0A6J1GQ34 uncharacterized protein LOC1114564477.4e-6659.43Show/hide
Query:  MAAIVTRRLSSKFLKPFPSSTFF-----KDPFPQISPPNSNPPFLQS----LTNPNFPLFHSPTPFP---------RSTN-----------------SSL
        MAAIVTRRLSSKFL+P PSSTF      ++ F +I   +S+P F QS       P   L +S TPF          RS N                 SS 
Subjt:  MAAIVTRRLSSKFLKPFPSSTFF-----KDPFPQISPPNSNPPFLQS----LTNPNFPLFHSPTPFP---------RSTN-----------------SSL

Query:  NLIQRSRNPSFLTPDFNQKPGFSSNHGGE-------FKHQEIEGPTVERDLSALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYS--VSDAA
         LI R RNPSF   D +QK GFSS    E       FKHQ+IEGPTVERDLSALAGETREV++AMMKNVY LS A+A+LGLVQLGIGAWISY+   S   
Subjt:  NLIQRSRNPSFLTPDFNQKPGFSSNHGGE-------FKHQEIEGPTVERDLSALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYS--VSDAA

Query:  AVSIQSVAAFGLPFSVAFTLRQCLKPMVFFRKMEKEGRLQILTLTLQIAKNLNVLFVRFRIVCFLCVAGLSVGGLFALISR
         VSIQS  +FG PFS+AF LRQ LKPM+FF+KME++GRLQILTLTLQIAKNLN LFVR RIV FLCV GLSVG LFAL+SR
Subjt:  AVSIQSVAAFGLPFSVAFTLRQCLKPMVFFRKMEKEGRLQILTLTLQIAKNLNVLFVRFRIVCFLCVAGLSVGGLFALISR

A0A6J1JU98 uncharacterized protein LOC1114884152.4e-6458.04Show/hide
Query:  MAAIVTRRLSSKFLKPFPSSTFF-----KDPFPQISPPNSNPPFLQS----LTNPNFPLFHSPTPFP---------RSTNSSLN----------------
        MAAIVTRRLSSK+L+PFPSST        + F +I   +S+P F QS       P   L +S TPF          RS N +LN                
Subjt:  MAAIVTRRLSSKFLKPFPSSTFF-----KDPFPQISPPNSNPPFLQS----LTNPNFPLFHSPTPFP---------RSTNSSLN----------------

Query:  ------LIQRSRNPSFLTPDFNQKPGFSSNHGGE-------FKHQEIEGPTVERDLSALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYS--
              LI R RNPSF   D +QK GFSS    E       FKHQ+IEGPTVERDLSALAGETREV++AMMKNVY LS A+A+LGLVQLGIGAWISY+  
Subjt:  ------LIQRSRNPSFLTPDFNQKPGFSSNHGGE-------FKHQEIEGPTVERDLSALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYS--

Query:  VSDAAAVSIQSVAAFGLPFSVAFTLRQCLKPMVFFRKMEKEGRLQILTLTLQIAKNLNVLFVRFRIVCFLCVAGLSVGGLFALISR
         S    VSIQS  +FG PFS+AF LRQ LKPM+FF+KME++GRLQILTLTLQIAKNLN LFVR RIV FLCV GLSVG LFAL+SR
Subjt:  VSDAAAVSIQSVAAFGLPFSVAFTLRQCLKPMVFFRKMEKEGRLQILTLTLQIAKNLNVLFVRFRIVCFLCVAGLSVGGLFALISR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G12650.1 unknown protein2.9e-3846.25Show/hide
Query:  IVTRRLSSKFLKPFPSSTF-FKDPFPQISPPNSNPPFLQSLTNPNFPLFHSPTPFPRSTNSSLNLIQRSRNPSFLTPDFNQKPGFSSNHGGE----FKHQ
        +++RRL SKFLKP  S +F       Q S        +         L  S + F      S     R     F TP    +P        E     KHQ
Subjt:  IVTRRLSSKFLKPFPSSTF-FKDPFPQISPPNSNPPFLQSLTNPNFPLFHSPTPFPRSTNSSLNLIQRSRNPSFLTPDFNQKPGFSSNHGGE----FKHQ

Query:  EIEGPTVERDLSALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYS--VSDAAAVSIQSVAAFGLPFSVAFTLRQCLKPMVFFRKMEKEGRLQ
        EIEGPTVERDLSAL  ETR+V++ MMKN+Y LS A+  LGL QL +GA I Y+        ++IQS  AFG PF++A  +R+ LKPM FF+KME+ GRLQ
Subjt:  EIEGPTVERDLSALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYS--VSDAAAVSIQSVAAFGLPFSVAFTLRQCLKPMVFFRKMEKEGRLQ

Query:  ILTLTLQIAKNLNVLFVRFRIVCFLCVAGLSVGGLFALIS
        ILTLTLQ+AKNLN+LFVR R+V  LCV  L  G LF L+S
Subjt:  ILTLTLQIAKNLNVLFVRFRIVCFLCVAGLSVGGLFALIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCATTGTTACGCGCAGGTTAAGCTCCAAATTTCTCAAACCTTTTCCTTCTTCCACCTTCTTCAAAGATCCATTTCCCCAAATCTCACCCCCAAATTCCAATCC
CCCATTTCTCCAATCCCTCACAAACCCTAATTTCCCCCTCTTCCATTCCCCCACACCCTTCCCACGATCCACGAATTCGAGTCTCAATCTGATTCAAAGATCGCGAAACC
CTAGCTTCTTGACCCCGGATTTCAATCAGAAGCCTGGGTTTTCTTCGAATCATGGCGGCGAGTTCAAGCACCAAGAGATCGAAGGGCCGACGGTCGAGAGAGATCTCTCG
GCGTTGGCCGGCGAAACCAGGGAGGTGATTGACGCGATGATGAAGAATGTGTATAGATTAAGCACAGCGCTGGCGGTTCTTGGCCTGGTTCAGCTTGGAATTGGGGCTTG
GATTTCGTATTCGGTTTCGGATGCGGCCGCGGTTTCGATCCAGAGCGTCGCGGCTTTCGGGCTGCCGTTCTCGGTCGCTTTTACTCTGCGGCAGTGCTTGAAGCCGATGG
TGTTCTTCAGGAAAATGGAGAAGGAAGGTCGGTTGCAGATTCTTACTCTTACTCTTCAGATTGCTAAGAATTTGAATGTTTTGTTTGTTAGGTTTCGGATTGTTTGTTTC
TTGTGTGTTGCTGGATTGTCTGTTGGTGGTTTGTTTGCTTTGATTTCTAGATGA
mRNA sequenceShow/hide mRNA sequence
CTTGCCGAAGACCTTAAAAATCGATCAAAACCAAACCAGTTCAGAAATGGCCGCCATTGTTACGCGCAGGTTAAGCTCCAAATTTCTCAAACCTTTTCCTTCTTCCACCT
TCTTCAAAGATCCATTTCCCCAAATCTCACCCCCAAATTCCAATCCCCCATTTCTCCAATCCCTCACAAACCCTAATTTCCCCCTCTTCCATTCCCCCACACCCTTCCCA
CGATCCACGAATTCGAGTCTCAATCTGATTCAAAGATCGCGAAACCCTAGCTTCTTGACCCCGGATTTCAATCAGAAGCCTGGGTTTTCTTCGAATCATGGCGGCGAGTT
CAAGCACCAAGAGATCGAAGGGCCGACGGTCGAGAGAGATCTCTCGGCGTTGGCCGGCGAAACCAGGGAGGTGATTGACGCGATGATGAAGAATGTGTATAGATTAAGCA
CAGCGCTGGCGGTTCTTGGCCTGGTTCAGCTTGGAATTGGGGCTTGGATTTCGTATTCGGTTTCGGATGCGGCCGCGGTTTCGATCCAGAGCGTCGCGGCTTTCGGGCTG
CCGTTCTCGGTCGCTTTTACTCTGCGGCAGTGCTTGAAGCCGATGGTGTTCTTCAGGAAAATGGAGAAGGAAGGTCGGTTGCAGATTCTTACTCTTACTCTTCAGATTGC
TAAGAATTTGAATGTTTTGTTTGTTAGGTTTCGGATTGTTTGTTTCTTGTGTGTTGCTGGATTGTCTGTTGGTGGTTTGTTTGCTTTGATTTCTAGATGAATTCATTGTT
GTGGAGTTTTTTGGATTTGGGATTTGCTTGGAATAGATTTCTGATTTTGTTTTGTTGTTTAGTTTTTCTGTTCTTCTCTGCTTCATTGATATTGTTTAATTACTAGTGGG
TGTGATAAAAGGCATTTGGTGAGTGGGTATGATTCTCCCTTTTGAGTGCTAGAGGTC
Protein sequenceShow/hide protein sequence
MAAIVTRRLSSKFLKPFPSSTFFKDPFPQISPPNSNPPFLQSLTNPNFPLFHSPTPFPRSTNSSLNLIQRSRNPSFLTPDFNQKPGFSSNHGGEFKHQEIEGPTVERDLS
ALAGETREVIDAMMKNVYRLSTALAVLGLVQLGIGAWISYSVSDAAAVSIQSVAAFGLPFSVAFTLRQCLKPMVFFRKMEKEGRLQILTLTLQIAKNLNVLFVRFRIVCF
LCVAGLSVGGLFALISR