; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017626 (gene) of Chayote v1 genome

Gene IDSed0017626
OrganismSechium edule (Chayote v1)
DescriptionAnkyrin repeat-containing protein
Genome locationLG13:25902981..25903918
RNA-Seq ExpressionSed0017626
SyntenySed0017626
Gene Ontology termsNA
InterPro domainsIPR026961 - PGG domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141217.1 uncharacterized protein LOC101204214 [Cucumis sativus]4.5e-3543.29Show/hide
Query:  KRHKQEHASLLSPSSKETRSWYWNILQKKLKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKN----MQLGFSRLKIYTHLCF
        +R ++E  SL +    + R+  ++ + KKL+Y+GDWV EVQ TMMLVATVIATVTFQGG NPPGG+WQQDT  N S   N        F  L +Y  L  
Subjt:  KRHKQEHASLLSPSSKETRSWYWNILQKKLKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKN----MQLGFSRLKIYTHLCF

Query:  ----NENSTVIFPAGTPIMRYQQSDTYWPYVVVNSLSLMA-----------------------SMTMCAAVVFLAIGYFLGTQMVNQVRVFSRDLSYGAF
            N N TV+FPAGT +M YQQ   YW Y+ VN++S +A                       S+TMC AVV LAIGY +G +M+N + +    + +  F
Subjt:  ----NENSTVIFPAGTPIMRYQQSDTYWPYVVVNSLSLMA-----------------------SMTMCAAVVFLAIGYFLGTQMVNQVRVFSRDLSYGAF

Query:  YFLFMS----WLGLVGMVALWHIIRFLVWLF
          +  S    WLG+VGMV LW +  FL  LF
Subjt:  YFLFMS----WLGLVGMVALWHIIRFLVWLF

XP_004141217.1 uncharacterized protein LOC101204214 [Cucumis sativus]7.7e-0348.15Show/hide
Query:  MEEKHKDHTSLFSSSMARSNEITIAMSSTEEDIRKLYDASKMGSIQILKSFIKE
        ME  H++ T L SSS A + ++ I+MS  EEDI KLY+ASK+G ++ LK+ I++
Subjt:  MEEKHKDHTSLFSSSMARSNEITIAMSSTEEDIRKLYDASKMGSIQILKSFIKE

XP_004141217.1 uncharacterized protein LOC101204214 [Cucumis sativus]2.2e-3443.46Show/hide
Query:  KETRSWYWNILQKKLKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKNMQ--LGFSRLKIYTHLCFNEN----STVIFPAGTP
        ++T+    N+L KKLKYKGDWV + Q T+MLVATVIAT+TFQGG NPPGG+WQQDT  N S+  +      F RL +Y +L  N+N    +T++FPAGT 
Subjt:  KETRSWYWNILQKKLKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKNMQ--LGFSRLKIYTHLCFNEN----STVIFPAGTP

Query:  IMRYQQSDTYWPYVVVNSLSLMAS-----------------------MTMCAAVVFLAIGYFLGTQMVNQVRVFSRDLSYGAFYFLFMS----WLGLVGM
        +M YQQ   YW Y+ VN++S +AS                       + MC AVV LAIGY +G +MVN + + +  + +  +  +  S    WL +VGM
Subjt:  IMRYQQSDTYWPYVVVNSLSLMAS-----------------------MTMCAAVVFLAIGYFLGTQMVNQVRVFSRDLSYGAFYFLFMS----WLGLVGM

Query:  VALWHIIRFLVWLF
        V LW +  FL  LF
Subjt:  VALWHIIRFLVWLF

XP_008447612.1 PREDICTED: uncharacterized protein LOC103490026 [Cucumis melo]9.0e-3643.15Show/hide
Query:  TRFSNSKRHKQEHASLLSPSSKETRSWYWNILQKKLKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKN-MQLGFSR----LK
        T+  N K  ++E  SL   S+K+     W + +KKLKY+GDWV+EVQGTMMLVATVIATVTFQGG NPPGG+WQQDT    SSI N  + GFS       
Subjt:  TRFSNSKRHKQEHASLLSPSSKETRSWYWNILQKKLKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKN-MQLGFSR----LK

Query:  IYTHLCFNENST-VIFPAGTPIMRYQQSDTYWPYVVVNSLSLMASMT-----------------------MCAAVVFLAIGYFLGTQMVNQVRVFSRD--
        +Y    +  N+T V+FPAGT +MR+QQ      Y+ VN++S +ASM+                       MC AVV LAIGY LG +MVN +    ++  
Subjt:  IYTHLCFNENST-VIFPAGTPIMRYQQSDTYWPYVVVNSLSLMASMT-----------------------MCAAVVFLAIGYFLGTQMVNQVRVFSRD--

Query:  LSYGAFYFLFMSWLGLVGMVALWHIIRFLVWLFKKLAYALS
         +   F    + W G+VG+V LW I   L+W+ K L ++ +
Subjt:  LSYGAFYFLFMSWLGLVGMVALWHIIRFLVWLFKKLAYALS

XP_011649355.1 uncharacterized protein LOC101212496 [Cucumis sativus]2.6e-3540.83Show/hide
Query:  RFSNSKRHKQEHASLLSPSSKETRSWYWNILQKKLKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSS--------IKNMQLGFSR
        + + ++  K +   L+S  +K+     W + +KKLKYKGDWV+EVQGTMMLVATVIATVTFQGG NPPGG+WQQDT    SS        +    + F  
Subjt:  RFSNSKRHKQEHASLLSPSSKETRSWYWNILQKKLKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSS--------IKNMQLGFSR

Query:  LKIYTHLCFNENSTVIFPAGTPIMRYQQSDTYWPYVVVNSLSLMASMT-----------------------MCAAVVFLAIGYFLGTQMVNQV-----RV
          ++++     N+TV+F AGT +M+ QQ + Y  Y+ VN++S +ASMT                       MC AV+ LAIGY LG +MV+ +     R+
Subjt:  LKIYTHLCFNENSTVIFPAGTPIMRYQQSDTYWPYVVVNSLSLMASMT-----------------------MCAAVVFLAIGYFLGTQMVNQV-----RV

Query:  FSRDLSYGAFYFLFMSWLGLVGMVALWHIIRFLVWLFKKL
        F+    Y  F    + WLG+VG+V L  I R L+W+ K L
Subjt:  FSRDLSYGAFYFLFMSWLGLVGMVALWHIIRFLVWLFKKL

XP_016903124.1 PREDICTED: uncharacterized protein LOC107992044 [Cucumis melo]2.1e-3243.62Show/hide
Query:  TRFSNSKRHKQEHASLLSPSSKETRSW-YWNILQKKLKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKNMQLGFSRLKIYTH
        T+  NSKR ++E  +L   + K T  W  W   +K LKYKG+W+EEVQ TMMLVATVIATVTFQ G N PGG+WQQDT  + +S +      + + +Y+ 
Subjt:  TRFSNSKRHKQEHASLLSPSSKETRSW-YWNILQKKLKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKNMQLGFSRLKIYTH

Query:  LCFNENSTVIFPAGTPIMRYQQSDTYWPYVVVNSLSLMASMTM-----------------------CAAVVFLAIGYFLGTQMVNQVRVFSRDLSY----
        L    N TV+ PAGT IM YQQ + YW Y+ +N++S +ASM++                       C AVV LAIGY LG +MVN +  FS  +      
Subjt:  LCFNENSTVIFPAGTPIMRYQQSDTYWPYVVVNSLSLMASMTM-----------------------CAAVVFLAIGYFLGTQMVNQVRVFSRDLSY----

Query:  GAFYFLFMSWLGLVGMVALWHIIRFLVWLFKKLAYALSHLKSL
         AF    M  LG+VGMV LW +  FL  LF       S LKSL
Subjt:  GAFYFLFMSWLGLVGMVALWHIIRFLVWLFKKLAYALSHLKSL

TrEMBL top hitse value%identityAlignment
A0A0A0LCQ0 ANK_REP_REGION domain-containing protein2.2e-3543.29Show/hide
Query:  KRHKQEHASLLSPSSKETRSWYWNILQKKLKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKN----MQLGFSRLKIYTHLCF
        +R ++E  SL +    + R+  ++ + KKL+Y+GDWV EVQ TMMLVATVIATVTFQGG NPPGG+WQQDT  N S   N        F  L +Y  L  
Subjt:  KRHKQEHASLLSPSSKETRSWYWNILQKKLKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKN----MQLGFSRLKIYTHLCF

Query:  ----NENSTVIFPAGTPIMRYQQSDTYWPYVVVNSLSLMA-----------------------SMTMCAAVVFLAIGYFLGTQMVNQVRVFSRDLSYGAF
            N N TV+FPAGT +M YQQ   YW Y+ VN++S +A                       S+TMC AVV LAIGY +G +M+N + +    + +  F
Subjt:  ----NENSTVIFPAGTPIMRYQQSDTYWPYVVVNSLSLMA-----------------------SMTMCAAVVFLAIGYFLGTQMVNQVRVFSRDLSYGAF

Query:  YFLFMS----WLGLVGMVALWHIIRFLVWLF
          +  S    WLG+VGMV LW +  FL  LF
Subjt:  YFLFMS----WLGLVGMVALWHIIRFLVWLF

A0A0A0LCQ0 ANK_REP_REGION domain-containing protein3.7e-0348.15Show/hide
Query:  MEEKHKDHTSLFSSSMARSNEITIAMSSTEEDIRKLYDASKMGSIQILKSFIKE
        ME  H++ T L SSS A + ++ I+MS  EEDI KLY+ASK+G ++ LK+ I++
Subjt:  MEEKHKDHTSLFSSSMARSNEITIAMSSTEEDIRKLYDASKMGSIQILKSFIKE

A0A0A0LCQ0 ANK_REP_REGION domain-containing protein1.1e-3443.46Show/hide
Query:  KETRSWYWNILQKKLKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKNMQ--LGFSRLKIYTHLCFNEN----STVIFPAGTP
        ++T+    N+L KKLKYKGDWV + Q T+MLVATVIAT+TFQGG NPPGG+WQQDT  N S+  +      F RL +Y +L  N+N    +T++FPAGT 
Subjt:  KETRSWYWNILQKKLKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKNMQ--LGFSRLKIYTHLCFNEN----STVIFPAGTP

Query:  IMRYQQSDTYWPYVVVNSLSLMAS-----------------------MTMCAAVVFLAIGYFLGTQMVNQVRVFSRDLSYGAFYFLFMS----WLGLVGM
        +M YQQ   YW Y+ VN++S +AS                       + MC AVV LAIGY +G +MVN + + +  + +  +  +  S    WL +VGM
Subjt:  IMRYQQSDTYWPYVVVNSLSLMAS-----------------------MTMCAAVVFLAIGYFLGTQMVNQVRVFSRDLSYGAFYFLFMS----WLGLVGM

Query:  VALWHIIRFLVWLF
        V LW +  FL  LF
Subjt:  VALWHIIRFLVWLF

A0A1S3BIS1 uncharacterized protein LOC1034900264.4e-3643.15Show/hide
Query:  TRFSNSKRHKQEHASLLSPSSKETRSWYWNILQKKLKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKN-MQLGFSR----LK
        T+  N K  ++E  SL   S+K+     W + +KKLKY+GDWV+EVQGTMMLVATVIATVTFQGG NPPGG+WQQDT    SSI N  + GFS       
Subjt:  TRFSNSKRHKQEHASLLSPSSKETRSWYWNILQKKLKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKN-MQLGFSR----LK

Query:  IYTHLCFNENST-VIFPAGTPIMRYQQSDTYWPYVVVNSLSLMASMT-----------------------MCAAVVFLAIGYFLGTQMVNQVRVFSRD--
        +Y    +  N+T V+FPAGT +MR+QQ      Y+ VN++S +ASM+                       MC AVV LAIGY LG +MVN +    ++  
Subjt:  IYTHLCFNENST-VIFPAGTPIMRYQQSDTYWPYVVVNSLSLMASMT-----------------------MCAAVVFLAIGYFLGTQMVNQVRVFSRD--

Query:  LSYGAFYFLFMSWLGLVGMVALWHIIRFLVWLFKKLAYALS
         +   F    + W G+VG+V LW I   L+W+ K L ++ +
Subjt:  LSYGAFYFLFMSWLGLVGMVALWHIIRFLVWLFKKLAYALS

A0A1S4E4H3 uncharacterized protein LOC1079920441.0e-3243.62Show/hide
Query:  TRFSNSKRHKQEHASLLSPSSKETRSW-YWNILQKKLKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKNMQLGFSRLKIYTH
        T+  NSKR ++E  +L   + K T  W  W   +K LKYKG+W+EEVQ TMMLVATVIATVTFQ G N PGG+WQQDT  + +S +      + + +Y+ 
Subjt:  TRFSNSKRHKQEHASLLSPSSKETRSW-YWNILQKKLKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKNMQLGFSRLKIYTH

Query:  LCFNENSTVIFPAGTPIMRYQQSDTYWPYVVVNSLSLMASMTM-----------------------CAAVVFLAIGYFLGTQMVNQVRVFSRDLSY----
        L    N TV+ PAGT IM YQQ + YW Y+ +N++S +ASM++                       C AVV LAIGY LG +MVN +  FS  +      
Subjt:  LCFNENSTVIFPAGTPIMRYQQSDTYWPYVVVNSLSLMASMTM-----------------------CAAVVFLAIGYFLGTQMVNQVRVFSRDLSY----

Query:  GAFYFLFMSWLGLVGMVALWHIIRFLVWLFKKLAYALSHLKSL
         AF    M  LG+VGMV LW +  FL  LF       S LKSL
Subjt:  GAFYFLFMSWLGLVGMVALWHIIRFLVWLFKKLAYALSHLKSL

A0A5D3BCC9 Ankyrin repeat-containing protein1.4e-3145.58Show/hide
Query:  WNILQKK-LKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKNMQLGFSRLKIYTHLCFNENSTVIFPAGTPIMRYQQSDTYWP
        W I +KK LKYKG+W+EEVQ TMMLVATVIATVTFQ G N PGG+WQQDT  + +S +      + + +Y+ L    N TV+ PAGT IM YQQ + YW 
Subjt:  WNILQKK-LKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKNMQLGFSRLKIYTHLCFNENSTVIFPAGTPIMRYQQSDTYWP

Query:  YVVVNSLSLMASMTM-----------------------CAAVVFLAIGYFLGTQMVNQVRVFSRDLSY----GAFYFLFMSWLGLVGMVALWHIIRFLVW
        Y+ +N++S +ASM++                       C AVV LAIGY LG +MVN +  FS  +       AF    M  LG+VGMV LW +  FL  
Subjt:  YVVVNSLSLMASMTM-----------------------CAAVVFLAIGYFLGTQMVNQVRVFSRDLSY----GAFYFLFMSWLGLVGMVALWHIIRFLVW

Query:  LFKKLAYALSHLKSL
        LF       S LKSL
Subjt:  LFKKLAYALSHLKSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G13950.1 unknown protein1.0e-0828.93Show/hide
Query:  KKLKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKNMQLGFSRLKIYTHLCFNENSTVIF---PAGTPIMRYQQSD--TYWPY
        K LK +GDW+E+ +G +M+ ATVIA ++FQ   NPPGG+WQ D   NCS                    N+  T  F    AGT ++ Y+ S    Y   
Subjt:  KKLKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKNMQLGFSRLKIYTHLCFNENSTVIF---PAGTPIMRYQQSD--TYWPY

Query:  VVVNSLSLMASMTMCAAVVF------LAIGYFLGTQMVNQVRVFSRDLSYGAFYFLFM------------------SWLGLVGMVALWHIIRFLVWL
        ++ +++S   SM++   V+         I   LGT MV  V   S      AF+F  +                   W+    ++ L  ++RF+ WL
Subjt:  VVVNSLSLMASMTMCAAVVF------LAIGYFLGTQMVNQVRVFSRDLSYGAFYFLFM------------------SWLGLVGMVALWHIIRFLVWL

AT4G13266.1 unknown protein3.5e-0624.55Show/hide
Query:  NSKRHKQEHASLLSPSSKETRSWYWNILQKKLKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKNMQLGFSRLKIYTHLCFNE
        N++  ++EH        +++ +W+     + L ++GDW+E+ +G +++ ATVIA ++F    NPPGG+WQ +   +CSS         +    T  C  +
Subjt:  NSKRHKQEHASLLSPSSKETRSWYWNILQKKLKYKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKNMQLGFSRLKIYTHLCFNE

Query:  NSTVIFPAGTPIMRYQQSDT--YWPYVVVNSLSLMASMTMCAAVVFLA-IGYFLGTQMVNQVRVFSRDLSY----GAFYF-----------------LFM
                GT I+ +  S    Y   V+ N +S  ASM     ++FL  IG+    +++  + V    ++      AF+F                 +++
Subjt:  NSTVIFPAGTPIMRYQQSDT--YWPYVVVNSLSLMASMTMCAAVVFLA-IGYFLGTQMVNQVRVFSRDLSY----GAFYF-----------------LFM

Query:  S-WLGLVGMVALWHIIRFLVWLFK
          W+ L  +V L  ++RFL W+ +
Subjt:  S-WLGLVGMVALWHIIRFLVWLFK

AT5G54700.1 Ankyrin repeat family protein4.3e-0427.2Show/hide
Query:  VEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKNMQLGFSRLKIYTHLCFNENSTVIFPAGTPIMRYQQSDTYWPYVVVN---------S
        ++  + T+ +VA +IA+VTF  G NPPGG++Q+ T     S+    + F   KI+    +  NS  +F +   ++       + P  + N         S
Subjt:  VEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKNMQLGFSRLKIYTHLCFNENSTVIFPAGTPIMRYQQSDTYWPYVVVN---------S

Query:  LSLMASMTMCAAVVFLAIGYFLGTQ
        +S+ A  T   AV ++ + +F GT+
Subjt:  LSLMASMTMCAAVVFLAIGYFLGTQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAGAAACACAAAGATCATACATCACTTTTCAGTTCTTCAATGGCAAGAAGCAATGAAATTACTATAGCCATGTCGTCGACCGAAGAAGATATAAGAAAGCTCTA
TGATGCATCAAAGATGGGATCTATACAAATTTTAAAAAGCTTTATAAAAGAAAGTTCTTTGAAGACCATATTGAAAAAGTGGGAAAGGAATTGTTTGTTAAAAACCGGAA
CAAGGTTTTCTAATTCCAAAAGACACAAACAAGAACATGCATCGTTGCTGTCGCCGTCCTCCAAAGAGACAAGATCATGGTATTGGAACATTCTACAAAAGAAACTGAAA
TATAAAGGAGATTGGGTGGAAGAAGTGCAAGGGACAATGATGTTAGTGGCTACGGTTATCGCAACCGTGACTTTCCAAGGTGGACACAACCCTCCAGGCGGTCTTTGGCA
ACAAGACACTCAACTAAATTGTTCAAGTATTAAGAATATGCAATTAGGATTTTCGAGGTTAAAGATATATACCCATTTATGCTTCAATGAAAATTCGACGGTCATTTTCC
CAGCTGGAACTCCAATAATGAGATACCAACAATCCGATACGTATTGGCCTTACGTGGTGGTTAACTCGTTGTCGTTGATGGCATCAATGACCATGTGTGCGGCTGTGGTG
TTCTTAGCAATTGGGTATTTCCTTGGAACTCAAATGGTTAACCAAGTGAGAGTATTTTCTCGAGATTTGAGCTATGGGGCATTTTATTTTTTATTTATGTCTTGGCTCGG
GTTGGTCGGAATGGTTGCTTTGTGGCACATAATTCGATTTCTTGTTTGGCTGTTCAAAAAGCTAGCTTACGCGCTTTCACATCTCAAAAGCCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAGAAACACAAAGATCATACATCACTTTTCAGTTCTTCAATGGCAAGAAGCAATGAAATTACTATAGCCATGTCGTCGACCGAAGAAGATATAAGAAAGCTCTA
TGATGCATCAAAGATGGGATCTATACAAATTTTAAAAAGCTTTATAAAAGAAAGTTCTTTGAAGACCATATTGAAAAAGTGGGAAAGGAATTGTTTGTTAAAAACCGGAA
CAAGGTTTTCTAATTCCAAAAGACACAAACAAGAACATGCATCGTTGCTGTCGCCGTCCTCCAAAGAGACAAGATCATGGTATTGGAACATTCTACAAAAGAAACTGAAA
TATAAAGGAGATTGGGTGGAAGAAGTGCAAGGGACAATGATGTTAGTGGCTACGGTTATCGCAACCGTGACTTTCCAAGGTGGACACAACCCTCCAGGCGGTCTTTGGCA
ACAAGACACTCAACTAAATTGTTCAAGTATTAAGAATATGCAATTAGGATTTTCGAGGTTAAAGATATATACCCATTTATGCTTCAATGAAAATTCGACGGTCATTTTCC
CAGCTGGAACTCCAATAATGAGATACCAACAATCCGATACGTATTGGCCTTACGTGGTGGTTAACTCGTTGTCGTTGATGGCATCAATGACCATGTGTGCGGCTGTGGTG
TTCTTAGCAATTGGGTATTTCCTTGGAACTCAAATGGTTAACCAAGTGAGAGTATTTTCTCGAGATTTGAGCTATGGGGCATTTTATTTTTTATTTATGTCTTGGCTCGG
GTTGGTCGGAATGGTTGCTTTGTGGCACATAATTCGATTTCTTGTTTGGCTGTTCAAAAAGCTAGCTTACGCGCTTTCACATCTCAAAAGCCTTTAG
Protein sequenceShow/hide protein sequence
MEEKHKDHTSLFSSSMARSNEITIAMSSTEEDIRKLYDASKMGSIQILKSFIKESSLKTILKKWERNCLLKTGTRFSNSKRHKQEHASLLSPSSKETRSWYWNILQKKLK
YKGDWVEEVQGTMMLVATVIATVTFQGGHNPPGGLWQQDTQLNCSSIKNMQLGFSRLKIYTHLCFNENSTVIFPAGTPIMRYQQSDTYWPYVVVNSLSLMASMTMCAAVV
FLAIGYFLGTQMVNQVRVFSRDLSYGAFYFLFMSWLGLVGMVALWHIIRFLVWLFKKLAYALSHLKSL