; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0011717 (gene) of Chayote v1 genome

Gene IDSed0011717
OrganismSechium edule (Chayote v1)
DescriptionTransmembrane protein
Genome locationLG08:39948717..39949758
RNA-Seq ExpressionSed0011717
SyntenySed0011717
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650098.1 hypothetical protein Csa_010636 [Cucumis sativus]9.4e-8285.86Show/hide
Query:  MGFSLGTTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGSVA
        MGFS+ +T S+SM R HYHTHKLFLYTNY+LLGAASSCIFLTLSLRLLPSLCGLSLIFLH LTIAAAVSGCAMA+  S RWFGVHMVFTVLTAIFQGSVA
Subjt:  MGFSLGTTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGSVA

Query:  VLVYTRTGDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL
        +LVYTRTGDFL ELKSYVRE+DGAVILK+AGGLSV+MFVLEWVVLTLAFFL+YYAFVE  EG+ N G AMR S+KVQQDEDLKDWPWPFQL
Subjt:  VLVYTRTGDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL

KAG6596776.1 hypothetical protein SDJN03_09956, partial [Cucurbita argyrosperma subsp. sororia]8.5e-8386.46Show/hide
Query:  MGFSLG--TTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGS
        MGFS+     SS+SM R HYHTHKLFLYTNY+LLGAASSCIFLTLSLRLLPSLCGLSLIFLH LTIAAAVSGCAMAST S RWFG+HMVFTVLTAIFQGS
Subjt:  MGFSLG--TTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGS

Query:  VAVLVYTRTGDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQ
        VAVLVYTRTGDFLGELKSYVRE+DGAVILK+AGGLSV MFVLEWVVLTLAFFLKYYAFVEGG G+ N   AMR S+KV+QDEDLKDWPWPFQ
Subjt:  VAVLVYTRTGDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQ

XP_022958196.1 uncharacterized protein LOC111459467 [Cucurbita moschata]5.0e-8386.01Show/hide
Query:  MGFSLG--TTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGS
        MGFS+     SS+SM R HYHTHKLFLYTNY+LLGAASSCIFLTLSLRLLPSLCGLSLIFLH LTIAAAVSGCAMAST S RWFG+HMVFTVLTAIFQGS
Subjt:  MGFSLG--TTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGS

Query:  VAVLVYTRTGDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL
        VAVLVYTRTGDFLGELKSYVRE+DGAVILK+AGGLSV MFVLEWVVLTLAFFLKYYAFVEGG G+ N   AMR S+KV+QDEDLKDWPWPFQ+
Subjt:  VAVLVYTRTGDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL

XP_023005568.1 uncharacterized protein LOC111498512 [Cucurbita maxima]6.5e-8386.01Show/hide
Query:  MGFSLG--TTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGS
        MGFS+     SS+SM R HYHTHKLFLYTNY+LLGAASSCIFLTLSLRLLPSLCGLSLIFLH LTIAAAVSGCAMAST S RWFG HMVFTVLTAIFQGS
Subjt:  MGFSLG--TTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGS

Query:  VAVLVYTRTGDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL
        VAVLVYTRTGDFLGELKSYVRE+DGAVILK+AGGLSV+MFVLEWVVLTLAFFLKYYAFVEGG G+ N   AMR S+KV+QDEDLKDWPWPFQ+
Subjt:  VAVLVYTRTGDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL

XP_023540529.1 uncharacterized protein LOC111800866 [Cucurbita pepo subsp. pepo]2.9e-8386.01Show/hide
Query:  MGFSLG--TTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGS
        MGFS+     SS+SM R HYHTHKLFLYTNY+LLGAASSCIFLTLSLRLLPSLCGLSLIFLH LTIAAAVSGCAMAST S RWFG+HMVFTVLTAIFQGS
Subjt:  MGFSLG--TTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGS

Query:  VAVLVYTRTGDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL
        VAVLVYTRTGDFLGELKSYVRE+DGAVILK+AGGLSV+MFVLEWVVLTLAFFLKYYAFVEGG G+ N   AMR S+KV+QDEDLKDWPWPFQ+
Subjt:  VAVLVYTRTGDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL

TrEMBL top hitse value%identityAlignment
A0A0A0L253 Uncharacterized protein1.0e-8185.34Show/hide
Query:  MGFSLGTTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGSVA
        MGFS+ +T S+SM R HYHTHKLFLYTNY+LLGAASSCIFLTLSLRLLPSLCGLSLIFLH LTIAAAVSGCAMA+  S RWFGVHMVFTVLTAIFQGSVA
Subjt:  MGFSLGTTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGSVA

Query:  VLVYTRTGDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL
        +LVYTRTGDFL ELKSYVRE+DGAVILK+AGGLSV+MFVLEWVVLTLAFFL+YYAFVE  EG+ N G AMR S+KVQQDEDLKDWPWPFQ+
Subjt:  VLVYTRTGDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL

A0A1S3BL21 uncharacterized protein LOC1034910041.0e-8185.34Show/hide
Query:  MGFSLGTTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGSVA
        MGFS+ +T S+SM R HYHTHKLFLYTNY+LLGAASSCIFLTLSLRLLPSLCGLSLIFLH LTIAAAVSGCAMA+  S RWFGVHMVFTVLTAIFQGSVA
Subjt:  MGFSLGTTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGSVA

Query:  VLVYTRTGDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL
        +LVYTRTGDFL ELKSYVRE+DGAVILK+AGGLSV+MFVLEWVVLTLAFFL+YYAFVE G+G+ N G AMR S+KVQQDEDLKDWPWPFQ+
Subjt:  VLVYTRTGDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL

A0A5D3D734 Uncharacterized protein1.0e-8185.34Show/hide
Query:  MGFSLGTTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGSVA
        MGFS+ +T S+SM R HYHTHKLFLYTNY+LLGAASSCIFLTLSLRLLPSLCGLSLIFLH LTIAAAVSGCAMA+  S RWFGVHMVFTVLTAIFQGSVA
Subjt:  MGFSLGTTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGSVA

Query:  VLVYTRTGDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL
        +LVYTRTGDFL ELKSYVRE+DGAVILK+AGGLSV+MFVLEWVVLTLAFFL+YYAFVE G+G+ N G AMR S+KVQQDEDLKDWPWPFQ+
Subjt:  VLVYTRTGDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL

A0A6J1H1G8 uncharacterized protein LOC1114594672.4e-8386.01Show/hide
Query:  MGFSLG--TTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGS
        MGFS+     SS+SM R HYHTHKLFLYTNY+LLGAASSCIFLTLSLRLLPSLCGLSLIFLH LTIAAAVSGCAMAST S RWFG+HMVFTVLTAIFQGS
Subjt:  MGFSLG--TTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGS

Query:  VAVLVYTRTGDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL
        VAVLVYTRTGDFLGELKSYVRE+DGAVILK+AGGLSV MFVLEWVVLTLAFFLKYYAFVEGG G+ N   AMR S+KV+QDEDLKDWPWPFQ+
Subjt:  VAVLVYTRTGDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL

A0A6J1L2J4 uncharacterized protein LOC1114985123.1e-8386.01Show/hide
Query:  MGFSLG--TTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGS
        MGFS+     SS+SM R HYHTHKLFLYTNY+LLGAASSCIFLTLSLRLLPSLCGLSLIFLH LTIAAAVSGCAMAST S RWFG HMVFTVLTAIFQGS
Subjt:  MGFSLG--TTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGS

Query:  VAVLVYTRTGDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL
        VAVLVYTRTGDFLGELKSYVRE+DGAVILK+AGGLSV+MFVLEWVVLTLAFFLKYYAFVEGG G+ N   AMR S+KV+QDEDLKDWPWPFQ+
Subjt:  VAVLVYTRTGDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G02640.1 unknown protein5.3e-5965Show/hide
Query:  SMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGSVAVLVYTRTGDFL
        S++  HY+THKLFL  NYVLLGA+SSCIFLTLSLRL+PSLCG  LI LHA TIAAAVSGCA AS    RW+  HM+ TVLTAIFQGSV+VL++T T +FL
Subjt:  SMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGSVAVLVYTRTGDFL

Query:  GELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL
          L SYVRE + ++ILK+AGGL V++F LEW+VL LAFFLKYYA+V+G     N GVAM+ + KVQ +E LK+ PW FQ+
Subjt:  GELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL

AT5G16250.1 unknown protein1.7e-6565.45Show/hide
Query:  MGFSLGTTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGSVA
        MGF    +SSS +   HYHTHK+FL++NY+LLGAASSCIFLTLSLRL+PS+CG  LI LHA TIAAAVSGCA AS    RW+  HMV TVLTAIFQGSV+
Subjt:  MGFSLGTTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGSVA

Query:  VLVYTRTGDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL
        VL++T T  FLG LKSYVRE+D AVILK+ GGL +++F L+W+VL  AFFLKYYA+V+GG+     GVAM+ + KVQ +E+ KDWPWPFQ+
Subjt:  VLVYTRTGDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL

AT5G36710.1 unknown protein7.7e-5060.75Show/hide
Query:  MRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAM-----ASTDSARWFGVHMVFTVLTAIFQGSVAVLVYTRT
        M +   +TH +FL  NY+LLG+ASSCIFLT+SLRL PSL GLSLIFL+ LTIA AVSGC++     ++T S R +G HMV TVLTAIFQG+V+VL++TRT
Subjt:  MRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAM-----ASTDSARWFGVHMVFTVLTAIFQGSVAVLVYTRT

Query:  GDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKV-QQDEDLKDWP-WPFQL
        GDFL  LKSYVRE+DG VILK++GGL VLMF LEW+VL LAF LKY  +++         V      KV +Q+EDLKDWP +PFQL
Subjt:  GDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKV-QQDEDLKDWP-WPFQL

AT5G36800.1 unknown protein7.7e-5060.75Show/hide
Query:  MRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAM-----ASTDSARWFGVHMVFTVLTAIFQGSVAVLVYTRT
        M +   +TH +FL  NY+LLG+ASSCIFLT+SLRL PSL GLSLIFL+ LTIA AVSGC++     ++T S R +G HMV TVLTAIFQG+V+VL++TRT
Subjt:  MRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAM-----ASTDSARWFGVHMVFTVLTAIFQGSVAVLVYTRT

Query:  GDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKV-QQDEDLKDWP-WPFQL
        GDFL  LKSYVRE+DG VILK++GGL VLMF LEW+VL LAF LKY  +++         V      KV +Q+EDLKDWP +PFQL
Subjt:  GDFLGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKV-QQDEDLKDWP-WPFQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTCTCGTTGGGCACAACAAGCAGCAGTTCCATGAGGCGAGGCCATTACCACACCCACAAGCTGTTCCTGTACACAAACTACGTCCTCCTCGGGGCGGCCTCGAG
CTGCATCTTCCTCACCCTCTCCCTCCGCCTCCTCCCGTCTCTCTGCGGCCTCTCCCTCATCTTCCTCCACGCCCTCACCATCGCCGCCGCCGTCTCCGGCTGCGCAATGG
CCTCCACCGACTCCGCTCGCTGGTTCGGCGTCCACATGGTGTTCACCGTGCTCACCGCCATCTTCCAGGGCTCCGTCGCGGTGTTGGTGTACACCAGAACTGGGGATTTC
CTGGGCGAGCTCAAATCGTACGTGAGGGAGGATGACGGGGCGGTGATCCTGAAAATGGCGGGTGGGTTGAGTGTGCTTATGTTTGTGTTGGAGTGGGTTGTTCTCACTTT
GGCGTTCTTTCTCAAGTATTATGCGTTTGTGGAAGGAGGAGAAGGGACTGCAAATGCGGGAGTGGCTATGAGGAGTTCTTCTAAGGTTCAGCAGGATGAGGATTTGAAGG
ATTGGCCATGGCCTTTTCAACTTTAG
mRNA sequenceShow/hide mRNA sequence
CTCCACACCTTATCACGCAATCTCAAAAAAATTGATCACAATTCAATTGAAATTGTTTCTGTAAGAAATTGGAAAATCCCGAAAATGGGTTTCTCGTTGGGCACAACAAG
CAGCAGTTCCATGAGGCGAGGCCATTACCACACCCACAAGCTGTTCCTGTACACAAACTACGTCCTCCTCGGGGCGGCCTCGAGCTGCATCTTCCTCACCCTCTCCCTCC
GCCTCCTCCCGTCTCTCTGCGGCCTCTCCCTCATCTTCCTCCACGCCCTCACCATCGCCGCCGCCGTCTCCGGCTGCGCAATGGCCTCCACCGACTCCGCTCGCTGGTTC
GGCGTCCACATGGTGTTCACCGTGCTCACCGCCATCTTCCAGGGCTCCGTCGCGGTGTTGGTGTACACCAGAACTGGGGATTTCCTGGGCGAGCTCAAATCGTACGTGAG
GGAGGATGACGGGGCGGTGATCCTGAAAATGGCGGGTGGGTTGAGTGTGCTTATGTTTGTGTTGGAGTGGGTTGTTCTCACTTTGGCGTTCTTTCTCAAGTATTATGCGT
TTGTGGAAGGAGGAGAAGGGACTGCAAATGCGGGAGTGGCTATGAGGAGTTCTTCTAAGGTTCAGCAGGATGAGGATTTGAAGGATTGGCCATGGCCTTTTCAACTTTAG
TTCTTCTTCATTCTGTTGTCATTCTTTTGTTTCTTGGCTTTGGCTGTCATTTTATTTCTGTATGTGTTTGTTTTCTGGAAATTTCTTAACTGAAATCATCAATATCTCAA
CTGCCTTATGATCAATATAATAGGATACTCCTTTCTCCTCTGTTTTTTTCTTTTGTACTCGAGAATGAATTTGTGCTAACAAAGTACTCCTGAACCAATTGGGATTTTTT
TTCAGGATAAACCAAATGGGAATTGGATCAGGGATGAGACTCCTCAACATTTTAACTTTTTTTTACATCAAATTGTGAACCAACTTCGTCAAGGGTTACAAGAGAATGAT
TCAATTTAATTTAACAAGTTTAGCAGCACTCAAATTAGGTCACTCAAATAAG
Protein sequenceShow/hide protein sequence
MGFSLGTTSSSSMRRGHYHTHKLFLYTNYVLLGAASSCIFLTLSLRLLPSLCGLSLIFLHALTIAAAVSGCAMASTDSARWFGVHMVFTVLTAIFQGSVAVLVYTRTGDF
LGELKSYVREDDGAVILKMAGGLSVLMFVLEWVVLTLAFFLKYYAFVEGGEGTANAGVAMRSSSKVQQDEDLKDWPWPFQL