; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0009153 (gene) of Chayote v1 genome

Gene IDSed0009153
OrganismSechium edule (Chayote v1)
DescriptionDUF4050 family protein
Genome locationLG12:33647780..33652150
RNA-Seq ExpressionSed0009153
SyntenySed0009153
Gene Ontology termsNA
InterPro domainsIPR025124 - Domain of unknown function DUF4050


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004150590.1 uncharacterized protein LOC101203426 isoform X2 [Cucumis sativus]4.4e-7084.76Show/hide
Query:  SSFAAWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNH
        SSFAAWISR FACMGGCFGCCTKPTPI AVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNV  S S+ SEFVNH
Subjt:  SSFAAWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNH

Query:  GLLLWNQNRMQWIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS--VSLLLKV
        GLLLWNQ RMQWIGSG++ T D+T+QR+K KIS RATYDSLL TRQPFPH IPLS  V+ L++V
Subjt:  GLLLWNQNRMQWIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS--VSLLLKV

XP_022144929.1 uncharacterized protein LOC111014486 [Momordica charantia]3.4e-7086.59Show/hide
Query:  SSFAAWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNH
        SSFAAWISR FACMGGCFGCCTKPTPI AVDEPSKGLRIQGR+VKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTL+ SNVG S SNPSEFVNH
Subjt:  SSFAAWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNH

Query:  GLLLWNQNRMQWIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS--VSLLLKV
        GLLLWNQNR+QW GS SS T DQTQQRRK KIS RATYDSLL TRQPFPH IPLS  V+ L++V
Subjt:  GLLLWNQNRMQWIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS--VSLLLKV

XP_022933439.1 uncharacterized protein LOC111440853 [Cucurbita moschata]5.8e-7086.06Show/hide
Query:  SSFAAWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNH
        SSFAAWISR FACMGGCFGCCTKPTPI AVDEPSKGLRIQGRVVKK SISDGFWSTSTCDLDNSTIQSQ SISSISTSNLTL+HSNVG S SNPSEFVNH
Subjt:  SSFAAWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNH

Query:  GLLLWNQNRMQWI-GSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS--VSLLLKV
        GLLLWNQNR+QWI GS SSNT DQTQQ+RK KIS RATYDSLL+TRQ FPH IPLS  V  L++V
Subjt:  GLLLWNQNRMQWI-GSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS--VSLLLKV

XP_031739168.1 uncharacterized protein LOC101203426 isoform X1 [Cucumis sativus]5.8e-7087.74Show/hide
Query:  SSFAAWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNH
        SSFAAWISR FACMGGCFGCCTKPTPI AVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNV  S S+ SEFVNH
Subjt:  SSFAAWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNH

Query:  GLLLWNQNRMQWIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS
        GLLLWNQ RMQWIGSG++ T D+T+QR+K KIS RATYDSLL TRQPFPH IPLS
Subjt:  GLLLWNQNRMQWIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS

XP_038879580.1 uncharacterized protein LOC120071390 [Benincasa hispida]4.4e-7084.76Show/hide
Query:  SSFAAWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNH
        SSFAAWISR FACMGGCFGCCTKPTPI AVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVG S S+ SEFVNH
Subjt:  SSFAAWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNH

Query:  GLLLWNQNRMQWIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS--VSLLLKV
        GLLLWNQ R+QWIGS ++ T D+TQQR+K KIS RATYDSLL TRQPFPH IPLS  V+ L++V
Subjt:  GLLLWNQNRMQWIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS--VSLLLKV

TrEMBL top hitse value%identityAlignment
A0A5D3C7W3 DUF4050 family protein2.0e-6882.93Show/hide
Query:  SSFAAWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNH
        SSFAAWISR FACMGGCFGCCTKPTPI AVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSIST NLTLS+SNV  S S+ SEF+NH
Subjt:  SSFAAWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNH

Query:  GLLLWNQNRMQWIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS--VSLLLKV
        GLLLWNQ RMQWIGSG++   D+TQQR+K KIS RATYDSLL TRQPFPH IPLS  V+ L++V
Subjt:  GLLLWNQNRMQWIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS--VSLLLKV

A0A6J1CTQ5 uncharacterized protein LOC1110144861.6e-7086.59Show/hide
Query:  SSFAAWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNH
        SSFAAWISR FACMGGCFGCCTKPTPI AVDEPSKGLRIQGR+VKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTL+ SNVG S SNPSEFVNH
Subjt:  SSFAAWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNH

Query:  GLLLWNQNRMQWIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS--VSLLLKV
        GLLLWNQNR+QW GS SS T DQTQQRRK KIS RATYDSLL TRQPFPH IPLS  V+ L++V
Subjt:  GLLLWNQNRMQWIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS--VSLLLKV

A0A6J1EZS6 uncharacterized protein LOC1114408532.8e-7086.06Show/hide
Query:  SSFAAWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNH
        SSFAAWISR FACMGGCFGCCTKPTPI AVDEPSKGLRIQGRVVKK SISDGFWSTSTCDLDNSTIQSQ SISSISTSNLTL+HSNVG S SNPSEFVNH
Subjt:  SSFAAWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNH

Query:  GLLLWNQNRMQWI-GSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS--VSLLLKV
        GLLLWNQNR+QWI GS SSNT DQTQQ+RK KIS RATYDSLL+TRQ FPH IPLS  V  L++V
Subjt:  GLLLWNQNRMQWI-GSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS--VSLLLKV

A0A6J1H9M2 uncharacterized protein LOC1114618001.8e-6983.54Show/hide
Query:  SSFAAWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNH
        SSFAAWISR FACMGGCFGCCTKPTPI AVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLT S SNVG S SNPSE+VNH
Subjt:  SSFAAWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNH

Query:  GLLLWNQNRMQWIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS--VSLLLKV
        GLLLWNQ R+QWIGS ++NT D+T +R+K KIS RATYDSLL TRQPFPH IPLS  V+ L++V
Subjt:  GLLLWNQNRMQWIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS--VSLLLKV

A0A6J1JKP7 uncharacterized protein LOC1114852944.0e-6982.93Show/hide
Query:  SSFAAWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNH
        SSFAAWISR FACMGGCFGCCTKPT I AVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLT S SNVG S SNPSE+VNH
Subjt:  SSFAAWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNH

Query:  GLLLWNQNRMQWIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS--VSLLLKV
        GLLLWNQ R+QWIGS ++NT D+TQ+R+K KIS RATYD+LL TRQPFPH IPLS  V+ L++V
Subjt:  GLLLWNQNRMQWIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS--VSLLLKV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15350.1 unknown protein2.8e-2243.75Show/hide
Query:  MGGCFGCCT--KPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNHGLLLWNQNRMQ
        MGGC GC    + T  +  D PS  +    R  KKPS+S+ FWSTST D+DN T  SQ S+SS   SN T    +   +++ P E+VN GLLLWNQ R +
Subjt:  MGGCFGCCT--KPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNHGLLLWNQNRMQ

Query:  WIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS
        W+G    N      Q  K   +  ATYDSLL + + FP  IPL+
Subjt:  WIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS

AT1G15350.2 unknown protein2.8e-2243.75Show/hide
Query:  MGGCFGCCT--KPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNHGLLLWNQNRMQ
        MGGC GC    + T  +  D PS  +    R  KKPS+S+ FWSTST D+DN T  SQ S+SS   SN T    +   +++ P E+VN GLLLWNQ R +
Subjt:  MGGCFGCCT--KPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNHGLLLWNQNRMQ

Query:  WIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS
        W+G    N      Q  K   +  ATYDSLL + + FP  IPL+
Subjt:  WIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLS

AT4G32342.1 unknown protein2.1e-2549.65Show/hide
Query:  CFGCCTKPTP-ITAVDEPSKGLRIQGRVVKKPSI-SDGFWSTSTCDLD-NSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNHGLLLWNQNRMQWI
        CFGCC +    +  VDEPSKGL+IQG++VKK S  SD FWSTSTCD+D N TIQSQ S                 CS SN +EFVNHGL+LWN  R QW 
Subjt:  CFGCCTKPTP-ITAVDEPSKGLRIQGRVVKKPSI-SDGFWSTSTCDLD-NSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNHGLLLWNQNRMQWI

Query:  GSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPL
                 Q     +  IS  +TYDSLL+T + FP  IPL
Subjt:  GSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPL

AT5G25360.1 unknown protein9.0e-4559.33Show/hide
Query:  AWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNHGLLL
        +WI + F CMGGCFGCC KP  I AVDEPSKGLRIQGR+VKKPS+S+ FWSTSTC++DNST+QSQRS+SSIS +N    +++   S SNP+EFVNHGL L
Subjt:  AWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNHGLLL

Query:  WNQNRMQWIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPL
        WNQ R QW+ +G   T+ +  + R+  IS  ATY+SLL   + F   IPL
Subjt:  WNQNRMQWIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPL

AT5G25360.2 unknown protein9.0e-4559.33Show/hide
Query:  AWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNHGLLL
        +WI + F CMGGCFGCC KP  I AVDEPSKGLRIQGR+VKKPS+S+ FWSTSTC++DNST+QSQRS+SSIS +N    +++   S SNP+EFVNHGL L
Subjt:  AWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNHGLLL

Query:  WNQNRMQWIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPL
        WNQ R QW+ +G   T+ +  + R+  IS  ATY+SLL   + F   IPL
Subjt:  WNQNRMQWIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTAGCTCCTTCGCTGCCTGGATCAGCCGCTTCTTTGCCTGCATGGGGGGTTGTTTTGGATGCTGCACTAAACCCACACCTATTACTGCTGTGGACGAGCCATCTAA
GGGATTAAGAATTCAAGGACGAGTTGTTAAGAAACCTAGCATCTCTGACGGTTTTTGGAGCACGAGCACGTGCGATTTGGATAATAGCACCATTCAATCTCAACGAAGCA
TCTCGTCTATTAGTACATCAAACCTCACACTCAGTCATAGCAATGTTGGTTGCAGTGCGAGCAACCCTTCTGAATTTGTAAACCATGGTCTCCTTCTCTGGAATCAGAAT
AGGATGCAGTGGATTGGAAGTGGTAGTAGCAATACAGCGGATCAAACTCAACAAAGACGAAAGGAAAAGATCAGTTTGCGTGCAACATATGACAGTTTACTGGCTACAAG
ACAGCCTTTTCCCCATTCAATTCCTTTGTCTGTAAGTTTGCTGTTGAAAGTTTTGTCATTTCATTCTCCCCACTATTACATTCTATGA
mRNA sequenceShow/hide mRNA sequence
ATCTACCATTATATTCCCATTTCTCTTCCCTGAATTTCCCAGAAACAACCCACCCATTTCCTTTTTTGGTATTTGCTTCTCATTTTCAGCTTTTTTGACCTTTTTCCACT
CTTATCTTTTCCCAATTCCAGGCTTTTGCTTTTGGTCTTTGCCGTTCCTCAATCGATGCCCTTTTCTGCGTTTTTCAATCATCGCCGGCATGGTTAGCTCCTTCGCTGCC
TGGATCAGCCGCTTCTTTGCCTGCATGGGGGGTTGTTTTGGATGCTGCACTAAACCCACACCTATTACTGCTGTGGACGAGCCATCTAAGGGATTAAGAATTCAAGGACG
AGTTGTTAAGAAACCTAGCATCTCTGACGGTTTTTGGAGCACGAGCACGTGCGATTTGGATAATAGCACCATTCAATCTCAACGAAGCATCTCGTCTATTAGTACATCAA
ACCTCACACTCAGTCATAGCAATGTTGGTTGCAGTGCGAGCAACCCTTCTGAATTTGTAAACCATGGTCTCCTTCTCTGGAATCAGAATAGGATGCAGTGGATTGGAAGT
GGTAGTAGCAATACAGCGGATCAAACTCAACAAAGACGAAAGGAAAAGATCAGTTTGCGTGCAACATATGACAGTTTACTGGCTACAAGACAGCCTTTTCCCCATTCAAT
TCCTTTGTCTGTAAGTTTGCTGTTGAAAGTTTTGTCATTTCATTCTCCCCACTATTACATTCTATGAATAAGTTCAAAATCTTCATACAGGAAATGGTGAACTTTCTTGT
GGAAGTATGGGAACAGGAGGGCCTATATGATTGAAAATGCTTTGTTTATTTTGGATAAATTCCTTGAATTGCAAAAAGAAAGGGAAAAAAAGGTGTTTTTTTTTTTCCTT
CTTACCTTTATCAATCATCCTACATTTTTGGTCTGAAACAGCAGTAAAATCAGCTGCCTTTGGTTGTGTTAATTCAATGCATGTGCATTCTAAGTGTTATTTTCTTTGTA
AATCTCCTTTCTTTTTCTTCATTGTTCATCCCATGAATAAGAGCTGGAAAATCACCATCATTCACTAGTTTCTGTATGCTGCCCAGATAAAAAGGAAACATTTTGATGTA
TTTATAGAGTGATGACTTGGGTTATACTGTGAATTTGTATCTATATTTTGATGACTTGGCTTTTTGTGCCTTTTTTTCCTTCTTTTTTGTTATTATTATATTATTATTTT
GTAAAATAATGTTTGTATGATTTGCCATGTGTTGGATGCGGTAGAGATGAAAATTGTCATAACGTGATACAAGGCAGGAATCTTCAGGGAAAATAAGAGGGGTAGGAGGA
TAAAGATTCTGAGAGGAAAATATATTTTTCATCACGTTTGTCTCTTTTATTTATTTTGTTTAATTTAAATTAGTTTACATGTAAACAATTTAAATGGTTTTTTTTTACTT
CATTAGAAACTAAAAACTAGCCTTGCAACAACCAATCCAAATGAGGGGAAACAACCCCAACCACCACTTTCGACTTGACCCACAAATCGTACCGAGATCAACCGTACTTC
AAAGAGCCATCGATACAGAACAAGAAGAGGAAAATATTGAGTTCATAGAGGACCAATACTTCCCTTGAACCTGAATGACTTTTATCAACCAACCCAAAGAAGTTAATTCT
ATATATCCAATCCCAGTCCTTATATAAAGTAAATAAACCAATCTGTCATAATAAAACAACAACAAAAAACTGTAATTAAGAACTTTAATACTGAGCAGGGCCAGCTCAAA
ACAACCTTCTGAACTTAAATACTTGGTCCAACTGCCTGTTCAGGCCCCTCGGTTTAATGACATATGCTCAGCCTTCGTATGCCTCTTGGAGATCTCGATTGGAGCCTGCT
CCTTGTCCACTCCAAGACTACCCAACCCGACCACCACGGTCGACATCGTGTTCTTAGCTTGCAACTTCAGAAATCTGCTAATCCTCTCATTGTCCAGGCTGTTGTGGGAT
AACAAGTAACTTCTGGGCAAGACTCTGGCTGTGATGAACCTTCAACAGCAGTTGCTCAGTTGAAGGGGTTGTAGAGACATCCTTGTACATAAAAAAGGGACCAAACAGAC
CATCGTCTGAATAAATGCAAAAAAGAAAATAGTTCTTGAGGGCTTTGTTGTTGTCCATTTTATGAACACAAATGCTTGTTTCTATCTTACCTTCCACT
Protein sequenceShow/hide protein sequence
MVSSFAAWISRFFACMGGCFGCCTKPTPITAVDEPSKGLRIQGRVVKKPSISDGFWSTSTCDLDNSTIQSQRSISSISTSNLTLSHSNVGCSASNPSEFVNHGLLLWNQN
RMQWIGSGSSNTADQTQQRRKEKISLRATYDSLLATRQPFPHSIPLSVSLLLKVLSFHSPHYYIL