; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg031797 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg031797
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold11:39357359..39367380
RNA-Seq ExpressionSpg031797
SyntenySpg031797
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018715.1 hypothetical protein SDJN02_20586, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-8064.62Show/hide
Query:  MAEALLELKQVLRSKQNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGA----LGV--VSARISWIPMAF---KFALAGTWRLNKFIRLILSGG
        M EAL EL+QVLRSKQN LT EEAN+LQTC SKAVRD  F  L GGGVTWA  G     LG+   S   +    A     FAL GTWRLNKF+RL LSGG
Subjt:  MAEALLELKQVLRSKQNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGA----LGV--VSARISWIPMAF---KFALAGTWRLNKFIRLILSGG

Query:  AAGIFGIWRFSRSLNSCVDHILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFSDDVAHAQRAHDNDPKNNLHG
        A  +FG+ RFSRSL+SCVDHILA+DGSRMQKELANIVVTKYHNDPRTMQ ISKHF+ E+VFDDSTLDRPKIR RYRNFFSDDVAHAQR H NDPK+NLHG
Subjt:  AAGIFGIWRFSRSLNSCVDHILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFSDDVAHAQRAHDNDPKNNLHG

Query:  NSHDDSSNRDSSSNQGDSYGESDDKGNALEFKPVLVSAIIQLSWARMLPQTLWILFSVHWQEKKKFNNRVPLAHHRN
        N H DSSNRDS+ NQ DSYG+ DDKGNA EF PVL       + A  L      L      +    ++  P +HHR+
Subjt:  NSHDDSSNRDSSSNQGDSYGESDDKGNALEFKPVLVSAIIQLSWARMLPQTLWILFSVHWQEKKKFNNRVPLAHHRN

XP_022138115.1 uncharacterized protein LOC111009363 isoform X1 [Momordica charantia]4.2e-8476.11Show/hide
Query:  MAEALLELKQVLRSKQNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGALGVVSARISWIPMAFKFALAGTWRLNKFIRLILSGGAAGIFGIWR
        M EAL EL+QVLRSKQN LT EEA LLQTC SKAVRD  F AL GGGVTW                        AGTWRLNKFIRL LSGGAA +FG+WR
Subjt:  MAEALLELKQVLRSKQNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGALGVVSARISWIPMAFKFALAGTWRLNKFIRLILSGGAAGIFGIWR

Query:  FSRSLNSCVDHILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDDSSNR
        FSRSLNSCVDHILA+DGSRMQKELANIVVTKYHNDPRTMQHISKHFY EKVFDDSTLDRP+IR RYRNFFSDDVAH QR HDND KNNLHGNSH  SSN 
Subjt:  FSRSLNSCVDHILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDDSSNR

Query:  DSSSNQGDSYGESDDKGNALEFKPVL
        DS+SNQ  SY E DDKGNALEFKPVL
Subjt:  DSSSNQGDSYGESDDKGNALEFKPVL

XP_022138116.1 uncharacterized protein LOC111009363 isoform X2 [Momordica charantia]4.2e-8476.11Show/hide
Query:  MAEALLELKQVLRSKQNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGALGVVSARISWIPMAFKFALAGTWRLNKFIRLILSGGAAGIFGIWR
        M EAL EL+QVLRSKQN LT EEA LLQTC SKAVRD  F AL GGGVTW                        AGTWRLNKFIRL LSGGAA +FG+WR
Subjt:  MAEALLELKQVLRSKQNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGALGVVSARISWIPMAFKFALAGTWRLNKFIRLILSGGAAGIFGIWR

Query:  FSRSLNSCVDHILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDDSSNR
        FSRSLNSCVDHILA+DGSRMQKELANIVVTKYHNDPRTMQHISKHFY EKVFDDSTLDRP+IR RYRNFFSDDVAH QR HDND KNNLHGNSH  SSN 
Subjt:  FSRSLNSCVDHILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDDSSNR

Query:  DSSSNQGDSYGESDDKGNALEFKPVL
        DS+SNQ  SY E DDKGNALEFKPVL
Subjt:  DSSSNQGDSYGESDDKGNALEFKPVL

XP_022956077.1 uncharacterized protein LOC111457878 [Cucurbita moschata]9.6e-8164.18Show/hide
Query:  MAEALLELKQVLRSKQNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGALGVVSARISWIPMAFKFALAGTWRLNKFIRLILSGGAAGIFGIWR
        M EAL EL+QVLRSKQN LT EEAN+LQTC SKAVRD  F  L GGGVTW                        AGTWRLNKF+RL LSGGA  +FG+ R
Subjt:  MAEALLELKQVLRSKQNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGALGVVSARISWIPMAFKFALAGTWRLNKFIRLILSGGAAGIFGIWR

Query:  FSRSLNSCVDHILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDDSSNR
        FSRSL+SCVDHILA+DGSRMQKELANIVVTKYHNDPRTMQHISKHF+ E+VFDDSTLDRPKIR RYRNFFSDDVAHAQR H NDPK+NLHGN H DSSNR
Subjt:  FSRSLNSCVDHILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDDSSNR

Query:  DSSSNQGDSYGESDDKGNALEFKPVLVSAIIQLSWARMLPQTLWILFSVHWQEKKKFNNRVPLAHHRN
        DS+ NQ DSYG+ DDKGNA EF PVL       + A  L      L      +    ++  P +HHR+
Subjt:  DSSSNQGDSYGESDDKGNALEFKPVLVSAIIQLSWARMLPQTLWILFSVHWQEKKKFNNRVPLAHHRN

XP_023527180.1 uncharacterized protein LOC111790494 isoform X1 [Cucurbita pepo subsp. pepo]1.6e-8063.81Show/hide
Query:  MAEALLELKQVLRSKQNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGALGVVSARISWIPMAFKFALAGTWRLNKFIRLILSGGAAGIFGIWR
        M EAL EL+QVLRSKQN LT EEAN+LQTC SKAVRD  F  L GGGVTW                        AGTWRLNKF+RL LSGGA  +FG+ R
Subjt:  MAEALLELKQVLRSKQNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGALGVVSARISWIPMAFKFALAGTWRLNKFIRLILSGGAAGIFGIWR

Query:  FSRSLNSCVDHILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDDSSNR
        FSRSL+SCVDHILA+DGSRMQKELANIVVTKYHNDPRTMQHISKHF+ E+VFDDSTLDRPKIR RYRNFFSDDVAHAQR H NDPK+NLHGN H DSSNR
Subjt:  FSRSLNSCVDHILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDDSSNR

Query:  DSSSNQGDSYGESDDKGNALEFKPVLVSAIIQLSWARMLPQTLWILFSVHWQEKKKFNNRVPLAHHRN
        DS+ NQ DSYG+ DDKGNA EF PVL       + A  L      +      +    ++  P +HHR+
Subjt:  DSSSNQGDSYGESDDKGNALEFKPVLVSAIIQLSWARMLPQTLWILFSVHWQEKKKFNNRVPLAHHRN

TrEMBL top hitse value%identityAlignment
A0A1S3AWL2 uncharacterized protein LOC103483703 isoform X19.7e-7167.26Show/hide
Query:  MAEALLELKQVLRSKQNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGALGVVSARISWIPMAFKFALAGTWRLNKFIRLILSGGAAGIFGIWR
        M E L EL+ VLRSK NGLT EEA LLQTC SKAVRD  F  + GGG+TW                        AGTWRLNKF RL LSGGAA + G WR
Subjt:  MAEALLELKQVLRSKQNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGALGVVSARISWIPMAFKFALAGTWRLNKFIRLILSGGAAGIFGIWR

Query:  FSRSLNSCVDHILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDDSSNR
        FSRSLNSCVD+IL++DGSRMQKELANIVVT+YHNDPR MQ+ISKHF+ E+VFDDST DRPKIR RYRNFFSDDVAH+QR H ND  NN+H NSH     R
Subjt:  FSRSLNSCVDHILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDDSSNR

Query:  DSSSNQGDSYGESDDKGNALEFKPVL
        DSS++Q DSYG+SDDKGNA EFKPVL
Subjt:  DSSSNQGDSYGESDDKGNALEFKPVL

A0A6J1C8I6 uncharacterized protein LOC111009363 isoform X12.0e-8476.11Show/hide
Query:  MAEALLELKQVLRSKQNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGALGVVSARISWIPMAFKFALAGTWRLNKFIRLILSGGAAGIFGIWR
        M EAL EL+QVLRSKQN LT EEA LLQTC SKAVRD  F AL GGGVTW                        AGTWRLNKFIRL LSGGAA +FG+WR
Subjt:  MAEALLELKQVLRSKQNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGALGVVSARISWIPMAFKFALAGTWRLNKFIRLILSGGAAGIFGIWR

Query:  FSRSLNSCVDHILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDDSSNR
        FSRSLNSCVDHILA+DGSRMQKELANIVVTKYHNDPRTMQHISKHFY EKVFDDSTLDRP+IR RYRNFFSDDVAH QR HDND KNNLHGNSH  SSN 
Subjt:  FSRSLNSCVDHILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDDSSNR

Query:  DSSSNQGDSYGESDDKGNALEFKPVL
        DS+SNQ  SY E DDKGNALEFKPVL
Subjt:  DSSSNQGDSYGESDDKGNALEFKPVL

A0A6J1C8T0 uncharacterized protein LOC111009363 isoform X22.0e-8476.11Show/hide
Query:  MAEALLELKQVLRSKQNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGALGVVSARISWIPMAFKFALAGTWRLNKFIRLILSGGAAGIFGIWR
        M EAL EL+QVLRSKQN LT EEA LLQTC SKAVRD  F AL GGGVTW                        AGTWRLNKFIRL LSGGAA +FG+WR
Subjt:  MAEALLELKQVLRSKQNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGALGVVSARISWIPMAFKFALAGTWRLNKFIRLILSGGAAGIFGIWR

Query:  FSRSLNSCVDHILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDDSSNR
        FSRSLNSCVDHILA+DGSRMQKELANIVVTKYHNDPRTMQHISKHFY EKVFDDSTLDRP+IR RYRNFFSDDVAH QR HDND KNNLHGNSH  SSN 
Subjt:  FSRSLNSCVDHILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDDSSNR

Query:  DSSSNQGDSYGESDDKGNALEFKPVL
        DS+SNQ  SY E DDKGNALEFKPVL
Subjt:  DSSSNQGDSYGESDDKGNALEFKPVL

A0A6J1GVC2 uncharacterized protein LOC1114578784.6e-8164.18Show/hide
Query:  MAEALLELKQVLRSKQNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGALGVVSARISWIPMAFKFALAGTWRLNKFIRLILSGGAAGIFGIWR
        M EAL EL+QVLRSKQN LT EEAN+LQTC SKAVRD  F  L GGGVTW                        AGTWRLNKF+RL LSGGA  +FG+ R
Subjt:  MAEALLELKQVLRSKQNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGALGVVSARISWIPMAFKFALAGTWRLNKFIRLILSGGAAGIFGIWR

Query:  FSRSLNSCVDHILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDDSSNR
        FSRSL+SCVDHILA+DGSRMQKELANIVVTKYHNDPRTMQHISKHF+ E+VFDDSTLDRPKIR RYRNFFSDDVAHAQR H NDPK+NLHGN H DSSNR
Subjt:  FSRSLNSCVDHILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDDSSNR

Query:  DSSSNQGDSYGESDDKGNALEFKPVLVSAIIQLSWARMLPQTLWILFSVHWQEKKKFNNRVPLAHHRN
        DS+ NQ DSYG+ DDKGNA EF PVL       + A  L      L      +    ++  P +HHR+
Subjt:  DSSSNQGDSYGESDDKGNALEFKPVLVSAIIQLSWARMLPQTLWILFSVHWQEKKKFNNRVPLAHHRN

A0A6J1IXZ4 uncharacterized protein LOC1114795421.8e-8064.18Show/hide
Query:  MAEALLELKQVLRSKQNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGALGVVSARISWIPMAFKFALAGTWRLNKFIRLILSGGAAGIFGIWR
        M EAL EL+QVLRSKQN LT EEAN+LQTC SKAVRD  F  L GGGVTW                        AGTWRLNKF+RL LSGGA  +FG+ R
Subjt:  MAEALLELKQVLRSKQNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGALGVVSARISWIPMAFKFALAGTWRLNKFIRLILSGGAAGIFGIWR

Query:  FSRSLNSCVDHILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDDSSNR
        FSRSL+SCVDHILA+DGSRMQKELANI+VTK HNDPRTMQHISKHF+ E+VFDDSTLDRPKIR RYRNFFSDDVAHAQRAH NDPK+NLHGN H DSSNR
Subjt:  FSRSLNSCVDHILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDDSSNR

Query:  DSSSNQGDSYGESDDKGNALEFKPVLVSAIIQLSWARMLPQTLWILFSVHWQEKKKFNNRVPLAHHRN
        DS+ NQ DSYGE DDKGNA EF PVL       + A  L      L      +    ++  P +HHR+
Subjt:  DSSSNQGDSYGESDDKGNALEFKPVLVSAIIQLSWARMLPQTLWILFSVHWQEKKKFNNRVPLAHHRN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05430.1 unknown protein8.5e-1132Show/hide
Query:  ALLELKQVLRSK--QNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGALGVVSARISWIPMAFKFALAGTWRLNKFIRLILSGGAAGIFGI---
        AL +L  VL SK  Q  +T EE+  + +C  KA+    F +  GGG+TW        V+ ++       + ALA             +G AA  F +   
Subjt:  ALLELKQVLRSK--QNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGALGVVSARISWIPMAFKFALAGTWRLNKFIRLILSGGAAGIFGI---

Query:  WRFSRSLNSCVDHILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFS------DDV-AHAQRAHDNDPKNNLH-
        W  S+   S +DHIL+ D +RMQKEL N++V     +    Q +SKHFY E V+ D   D+P++R R R  F+      DDV A   + + N   N  H 
Subjt:  WRFSRSLNSCVDHILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFS------DDV-AHAQRAHDNDPKNNLH-

Query:  ---GNSHDDSSNRDSSSNQGDSYGE
           G S    + +   ++ G+S GE
Subjt:  ---GNSHDDSSNRDSSSNQGDSYGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGAAGCTTTATTAGAACTTAAACAAGTTCTCAGGTCCAAACAGAACGGCTTGACGTTCGAGGAAGCGAATTTGCTCCAAACATGTGTGTCTAAGGCTGTTCGAGA
TGGTAGATTTGAAGCTCTCTTTGGAGGTGGTGTGACATGGGCAGCTGGTGGGGCCTTAGGAGTAGTTTCCGCTAGAATTTCATGGATACCAATGGCATTCAAGTTTGCCC
TGGCAGGAACATGGAGGCTGAATAAGTTCATTCGGCTAATTCTTTCTGGAGGAGCTGCTGGGATATTTGGAATATGGAGATTTAGCAGGTCCCTAAATTCATGCGTCGAT
CATATTCTTGCAATGGATGGAAGTAGAATGCAAAAGGAGTTGGCAAATATTGTAGTGACGAAATATCACAACGATCCTCGTACAATGCAGCACATATCCAAGCATTTTTA
TTGTGAGAAAGTGTTTGACGATTCAACATTGGACCGGCCAAAAATAAGGTTGCGTTATCGAAATTTCTTTAGTGATGATGTTGCTCATGCTCAGAGGGCACATGACAATG
ACCCTAAGAATAACTTGCATGGAAATTCCCACGATGACTCATCCAACCGCGACTCCAGTTCCAACCAGGGTGACTCCTATGGTGAGTCTGATGACAAAGGAAATGCACTT
GAGTTCAAGCCAGTCCTTGTAAGTGCAATAATTCAACTAAGCTGGGCACGGATGCTACCGCAGACCCTCTGGATTTTATTTTCGGTTCACTGGCAAGAGAAGAAGAAATT
CAACAATCGAGTACCTCTAGCACATCACCGAAATCTCACTCTCGTAGTAGAAGATACCACCGTCGGCATCGAAGACATAACGAGACAATGCCAACAAGCTTTGAACATGT
GTAGTTCCAGGTTCAATTCATTATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGAAGCTTTATTAGAACTTAAACAAGTTCTCAGGTCCAAACAGAACGGCTTGACGTTCGAGGAAGCGAATTTGCTCCAAACATGTGTGTCTAAGGCTGTTCGAGA
TGGTAGATTTGAAGCTCTCTTTGGAGGTGGTGTGACATGGGCAGCTGGTGGGGCCTTAGGAGTAGTTTCCGCTAGAATTTCATGGATACCAATGGCATTCAAGTTTGCCC
TGGCAGGAACATGGAGGCTGAATAAGTTCATTCGGCTAATTCTTTCTGGAGGAGCTGCTGGGATATTTGGAATATGGAGATTTAGCAGGTCCCTAAATTCATGCGTCGAT
CATATTCTTGCAATGGATGGAAGTAGAATGCAAAAGGAGTTGGCAAATATTGTAGTGACGAAATATCACAACGATCCTCGTACAATGCAGCACATATCCAAGCATTTTTA
TTGTGAGAAAGTGTTTGACGATTCAACATTGGACCGGCCAAAAATAAGGTTGCGTTATCGAAATTTCTTTAGTGATGATGTTGCTCATGCTCAGAGGGCACATGACAATG
ACCCTAAGAATAACTTGCATGGAAATTCCCACGATGACTCATCCAACCGCGACTCCAGTTCCAACCAGGGTGACTCCTATGGTGAGTCTGATGACAAAGGAAATGCACTT
GAGTTCAAGCCAGTCCTTGTAAGTGCAATAATTCAACTAAGCTGGGCACGGATGCTACCGCAGACCCTCTGGATTTTATTTTCGGTTCACTGGCAAGAGAAGAAGAAATT
CAACAATCGAGTACCTCTAGCACATCACCGAAATCTCACTCTCGTAGTAGAAGATACCACCGTCGGCATCGAAGACATAACGAGACAATGCCAACAAGCTTTGAACATGT
GTAGTTCCAGGTTCAATTCATTATAA
Protein sequenceShow/hide protein sequence
MAEALLELKQVLRSKQNGLTFEEANLLQTCVSKAVRDGRFEALFGGGVTWAAGGALGVVSARISWIPMAFKFALAGTWRLNKFIRLILSGGAAGIFGIWRFSRSLNSCVD
HILAMDGSRMQKELANIVVTKYHNDPRTMQHISKHFYCEKVFDDSTLDRPKIRLRYRNFFSDDVAHAQRAHDNDPKNNLHGNSHDDSSNRDSSSNQGDSYGESDDKGNAL
EFKPVLVSAIIQLSWARMLPQTLWILFSVHWQEKKKFNNRVPLAHHRNLTLVVEDTTVGIEDITRQCQQALNMCSSRFNSL