; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022093 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022093
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTransmembrane protein
Genome locationtig00153874:1101685..1105554
RNA-Seq ExpressionSgr022093
SyntenySgr022093
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650098.1 hypothetical protein Csa_010636 [Cucumis sativus]1.2e-7983.6Show/hide
Query:  MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGS
        MGFS+   PS+SM TRSHYHTHK+FLY NYILLGAASSCIFLTLSLRL+PSLCG+S++FLHILTIA+AVSGCAM A+A +TRWFGVHMVFTVLTAIFQGS
Subjt:  MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGS

Query:  VAVLIFTRTGEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAMRSAKVQQDEDLKDWPWPFQ
        VA+L++TRTG+FL +LKSYVREEDGAVI+KLAGGLSVVMF LEWVVLTLAFFLRYYA+VE    NNG AMRSAKVQQDEDLKDWPWPFQ
Subjt:  VAVLIFTRTGEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAMRSAKVQQDEDLKDWPWPFQ

XP_004148268.1 uncharacterized protein LOC101206234 [Cucumis sativus]1.2e-7983.6Show/hide
Query:  MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGS
        MGFS+   PS+SM TRSHYHTHK+FLY NYILLGAASSCIFLTLSLRL+PSLCG+S++FLHILTIA+AVSGCAM A+A +TRWFGVHMVFTVLTAIFQGS
Subjt:  MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGS

Query:  VAVLIFTRTGEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAMRSAKVQQDEDLKDWPWPFQ
        VA+L++TRTG+FL +LKSYVREEDGAVI+KLAGGLSVVMF LEWVVLTLAFFLRYYA+VE    NNG AMRSAKVQQDEDLKDWPWPFQ
Subjt:  VAVLIFTRTGEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAMRSAKVQQDEDLKDWPWPFQ

XP_008449000.1 PREDICTED: uncharacterized protein LOC103491004 [Cucumis melo]5.5e-8084.21Show/hide
Query:  MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGS
        MGFS+   PS+SM TRSHYHTHK+FLY NYILLGAASSCIFLTLSLRL+PSLCG+S++FLHILTIA+AVSGCAM A+A +TRWFGVHMVFTVLTAIFQGS
Subjt:  MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGS

Query:  VAVLIFTRTGEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGV-NNGVAMRSAKVQQDEDLKDWPWPFQ
        VA+L++TRTG+FL +LKSYVREEDGAVI+KLAGGLSVVMF LEWVVLTLAFFLRYYA+VEG+G  NNG AMRSAKVQQDEDLKDWPWPFQ
Subjt:  VAVLIFTRTGEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGV-NNGVAMRSAKVQQDEDLKDWPWPFQ

XP_023540529.1 uncharacterized protein LOC111800866 [Cucurbita pepo subsp. pepo]2.5e-7780.73Show/hide
Query:  MGFSMNGDPSSSMT-TRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQG
        MGFS++G P SS + +RSHYHTHK+FLY NYILLGAASSCIFLTLSLRL+PSLCG+S++FLHILTIA+AVSGCAM AS  + RWFG+HMVFTVLTAIFQG
Subjt:  MGFSMNGDPSSSMT-TRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQG

Query:  SVAVLIFTRTGEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEG--VNNGVAMRSAKVQQDEDLKDWPWPFQ
        SVAVL++TRTG+FLG+LKSYVREEDGAVI+KLAGGLSVVMF LEWVVLTLAFFL+YYA+VEG G   N   AMRSAKV+QDEDLKDWPWPFQ
Subjt:  SVAVLIFTRTGEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEG--VNNGVAMRSAKVQQDEDLKDWPWPFQ

XP_038904159.1 uncharacterized protein LOC120090516 [Benincasa hispida]7.1e-8083.77Show/hide
Query:  MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGS
        MGFS+  +PS+SM +RSHYHTHK+FLY NYILLGAASSCIFLTLSLRL+PSLCG+S++FLHILTIA+AVSGCAM A+A A RWFGVHMVFTVLTAIFQGS
Subjt:  MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGS

Query:  VAVLIFTRTGEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVN--NGVAMRSAKVQQDEDLKDWPWPFQ
        VA+L+FTRTG+FL +LKSYVREEDGAVI+KLAGGLSVVMF LEWVVLTLAFFL+YYA+VEGEG N  NGV MRSAKVQQDEDLKDWPWPFQ
Subjt:  VAVLIFTRTGEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVN--NGVAMRSAKVQQDEDLKDWPWPFQ

TrEMBL top hitse value%identityAlignment
A0A0A0L253 Uncharacterized protein5.9e-8083.6Show/hide
Query:  MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGS
        MGFS+   PS+SM TRSHYHTHK+FLY NYILLGAASSCIFLTLSLRL+PSLCG+S++FLHILTIA+AVSGCAM A+A +TRWFGVHMVFTVLTAIFQGS
Subjt:  MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGS

Query:  VAVLIFTRTGEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAMRSAKVQQDEDLKDWPWPFQ
        VA+L++TRTG+FL +LKSYVREEDGAVI+KLAGGLSVVMF LEWVVLTLAFFLRYYA+VE    NNG AMRSAKVQQDEDLKDWPWPFQ
Subjt:  VAVLIFTRTGEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAMRSAKVQQDEDLKDWPWPFQ

A0A1S3BL21 uncharacterized protein LOC1034910042.6e-8084.21Show/hide
Query:  MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGS
        MGFS+   PS+SM TRSHYHTHK+FLY NYILLGAASSCIFLTLSLRL+PSLCG+S++FLHILTIA+AVSGCAM A+A +TRWFGVHMVFTVLTAIFQGS
Subjt:  MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGS

Query:  VAVLIFTRTGEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGV-NNGVAMRSAKVQQDEDLKDWPWPFQ
        VA+L++TRTG+FL +LKSYVREEDGAVI+KLAGGLSVVMF LEWVVLTLAFFLRYYA+VEG+G  NNG AMRSAKVQQDEDLKDWPWPFQ
Subjt:  VAVLIFTRTGEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGV-NNGVAMRSAKVQQDEDLKDWPWPFQ

A0A5D3D734 Uncharacterized protein2.6e-8084.21Show/hide
Query:  MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGS
        MGFS+   PS+SM TRSHYHTHK+FLY NYILLGAASSCIFLTLSLRL+PSLCG+S++FLHILTIA+AVSGCAM A+A +TRWFGVHMVFTVLTAIFQGS
Subjt:  MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGS

Query:  VAVLIFTRTGEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGV-NNGVAMRSAKVQQDEDLKDWPWPFQ
        VA+L++TRTG+FL +LKSYVREEDGAVI+KLAGGLSVVMF LEWVVLTLAFFLRYYA+VEG+G  NNG AMRSAKVQQDEDLKDWPWPFQ
Subjt:  VAVLIFTRTGEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGV-NNGVAMRSAKVQQDEDLKDWPWPFQ

A0A6J1H1G8 uncharacterized protein LOC1114594673.6e-7780.21Show/hide
Query:  MGFSMNGDPSSSMT-TRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQG
        MGFS++G P SS + +RSHYHTHK+FLY NYILLGAASSCIFLTLSLRL+PSLCG+S++FLHILTIA+AVSGCAM AS  + RWFG+HMVFTVLTAIFQG
Subjt:  MGFSMNGDPSSSMT-TRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQG

Query:  SVAVLIFTRTGEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEG--VNNGVAMRSAKVQQDEDLKDWPWPFQ
        SVAVL++TRTG+FLG+LKSYVREEDGAVI+KLAGGLSV MF LEWVVLTLAFFL+YYA+VEG G   N   AMRSAKV+QDEDLKDWPWPFQ
Subjt:  SVAVLIFTRTGEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEG--VNNGVAMRSAKVQQDEDLKDWPWPFQ

A0A6J1L2J4 uncharacterized protein LOC1114985122.7e-7780.73Show/hide
Query:  MGFSMNGDPSSSMT-TRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQG
        MGFS++G P SS + +RSHYHTHK+FLY NYILLGAASSCIFLTLSLRL+PSLCG+S++FLHILTIA+AVSGCAM AS  + RWFG HMVFTVLTAIFQG
Subjt:  MGFSMNGDPSSSMT-TRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQG

Query:  SVAVLIFTRTGEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEG--VNNGVAMRSAKVQQDEDLKDWPWPFQ
        SVAVL++TRTG+FLG+LKSYVREEDGAVI+KLAGGLSVVMF LEWVVLTLAFFL+YYA+VEG G   N   AMRSAKV+QDEDLKDWPWPFQ
Subjt:  SVAVLIFTRTGEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEG--VNNGVAMRSAKVQQDEDLKDWPWPFQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G02640.1 unknown protein6.3e-5867.24Show/hide
Query:  SHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGSVAVLIFTRTGEFLGQL
        SHY+THK+FL  NY+LLGA+SSCIFLTLSLRL+PSLCG  ++ LH  TIA+AVSGCA  AS G  RW+  HM+ TVLTAIFQGSV+VLIFT T  FL  L
Subjt:  SHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGSVAVLIFTRTGEFLGQL

Query:  KSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAM-RSAKVQQDEDLKDWPWPFQ
         SYVRE++ ++I+KLAGGL VV+F LEW+VL LAFFL+YYAYV+G+  NNGVAM R+ KVQ +E LK+ PW FQ
Subjt:  KSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAM-RSAKVQQDEDLKDWPWPFQ

AT5G16250.1 unknown protein1.3e-6369.61Show/hide
Query:  SSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGSVAVLIFTRT
        SSS    SHYHTHKIFL+ NYILLGAASSCIFLTLSLRL+PS+CG  ++ LH  TIA+AVSGCA  AS G  RW+  HMV TVLTAIFQGSV+VLIFT T
Subjt:  SSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGSVAVLIFTRT

Query:  GEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAM-RSAKVQQDEDLKDWPWPFQ
         +FLG LKSYVREED AVI+KL GGL +V+F L+W+VL  AFFL+YYAYV+G    +GVAM R+ KVQ +E+ KDWPWPFQ
Subjt:  GEFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAM-RSAKVQQDEDLKDWPWPFQ

AT5G36710.1 unknown protein1.3e-5062.43Show/hide
Query:  TRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASA----GATRWFGVHMVFTVLTAIFQGSVAVLIFTRTG
        ++S  +TH IFL CNYILLG+ASSCIFLT+SLRL PSL G+S++FL+ LTIA+AVSGC++ AS+     + R +G HMV TVLTAIFQG+V+VLIFTRTG
Subjt:  TRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASA----GATRWFGVHMVFTVLTAIFQGSVAVLIFTRTG

Query:  EFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAMRSAKV-QQDEDLKDWP-WPFQ
        +FL  LKSYVREEDG VI+KL+GGL V+MF LEW+VL LAF L+Y  Y++   V++       KV +Q+EDLKDWP +PFQ
Subjt:  EFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAMRSAKV-QQDEDLKDWP-WPFQ

AT5G36800.1 unknown protein1.3e-5062.43Show/hide
Query:  TRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASA----GATRWFGVHMVFTVLTAIFQGSVAVLIFTRTG
        ++S  +TH IFL CNYILLG+ASSCIFLT+SLRL PSL G+S++FL+ LTIA+AVSGC++ AS+     + R +G HMV TVLTAIFQG+V+VLIFTRTG
Subjt:  TRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASA----GATRWFGVHMVFTVLTAIFQGSVAVLIFTRTG

Query:  EFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAMRSAKV-QQDEDLKDWP-WPFQ
        +FL  LKSYVREEDG VI+KL+GGL V+MF LEW+VL LAF L+Y  Y++   V++       KV +Q+EDLKDWP +PFQ
Subjt:  EFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAMRSAKV-QQDEDLKDWP-WPFQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTCTCCATGAACGGCGACCCATCAAGCTCCATGACTACCCGATCCCACTACCACACCCACAAGATCTTCCTCTACTGCAACTACATCCTCCTCGGTGCCGCCTC
CAGCTGCATCTTCCTGACGCTCTCCCTCCGCCTGGTCCCCTCCCTGTGCGGCGTCTCCATCGTCTTCCTCCACATCCTCACCATCGCCAGCGCCGTCTCGGGGTGTGCCA
TGGTGGCCTCCGCCGGCGCCACCCGGTGGTTCGGAGTGCACATGGTGTTCACCGTCCTCACCGCTATCTTCCAAGGGTCGGTGGCGGTGCTGATCTTCACGAGAACGGGG
GAGTTTCTGGGGCAGCTGAAGTCGTACGTGAGGGAGGAGGATGGGGCGGTGATAGTGAAGTTGGCGGGGGGGCTGAGCGTGGTGATGTTCTGGTTGGAGTGGGTGGTGCT
CACGCTGGCTTTTTTCTTGAGGTATTATGCGTATGTTGAAGGAGAGGGAGTGAATAATGGGGTGGCGATGAGGAGTGCGAAAGTGCAGCAGGATGAGGATTTGAAGGACT
GGCCATGGCCATTCCAAGCAACGGCCGACAGACGACATACAAGTGAGGGTGTAGGCAACAGTAAAGGGGAGGCTAAAGGCGACACTGCGATGGGTGAGGCTGAAGGTGTT
GCGGCAGTGATGGCAAAGGCGAGGTTGACAGCAATGGAGACAGGTGAGGCTGAAGGTCTTGCAATGGTGACAGGTGAGGCAATTGCGACAGCGAGGAGTGGTCTCGTGGT
CATGGCGTGGGCAACGAAAAGCAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTCTCCATGAACGGCGACCCATCAAGCTCCATGACTACCCGATCCCACTACCACACCCACAAGATCTTCCTCTACTGCAACTACATCCTCCTCGGTGCCGCCTC
CAGCTGCATCTTCCTGACGCTCTCCCTCCGCCTGGTCCCCTCCCTGTGCGGCGTCTCCATCGTCTTCCTCCACATCCTCACCATCGCCAGCGCCGTCTCGGGGTGTGCCA
TGGTGGCCTCCGCCGGCGCCACCCGGTGGTTCGGAGTGCACATGGTGTTCACCGTCCTCACCGCTATCTTCCAAGGGTCGGTGGCGGTGCTGATCTTCACGAGAACGGGG
GAGTTTCTGGGGCAGCTGAAGTCGTACGTGAGGGAGGAGGATGGGGCGGTGATAGTGAAGTTGGCGGGGGGGCTGAGCGTGGTGATGTTCTGGTTGGAGTGGGTGGTGCT
CACGCTGGCTTTTTTCTTGAGGTATTATGCGTATGTTGAAGGAGAGGGAGTGAATAATGGGGTGGCGATGAGGAGTGCGAAAGTGCAGCAGGATGAGGATTTGAAGGACT
GGCCATGGCCATTCCAAGCAACGGCCGACAGACGACATACAAGTGAGGGTGTAGGCAACAGTAAAGGGGAGGCTAAAGGCGACACTGCGATGGGTGAGGCTGAAGGTGTT
GCGGCAGTGATGGCAAAGGCGAGGTTGACAGCAATGGAGACAGGTGAGGCTGAAGGTCTTGCAATGGTGACAGGTGAGGCAATTGCGACAGCGAGGAGTGGTCTCGTGGT
CATGGCGTGGGCAACGAAAAGCAGTTGA
Protein sequenceShow/hide protein sequence
MGFSMNGDPSSSMTTRSHYHTHKIFLYCNYILLGAASSCIFLTLSLRLVPSLCGVSIVFLHILTIASAVSGCAMVASAGATRWFGVHMVFTVLTAIFQGSVAVLIFTRTG
EFLGQLKSYVREEDGAVIVKLAGGLSVVMFWLEWVVLTLAFFLRYYAYVEGEGVNNGVAMRSAKVQQDEDLKDWPWPFQATADRRHTSEGVGNSKGEAKGDTAMGEAEGV
AAVMAKARLTAMETGEAEGLAMVTGEAIATARSGLVVMAWATKSS