; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017673 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017673
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionIntegral membrane HPP family protein
Genome locationtig00153054:661616..670459
RNA-Seq ExpressionSgr017673
SyntenySgr017673
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019853.1 hypothetical protein SDJN02_18818, partial [Cucurbita argyrosperma subsp. argyrosperma]8.6e-4865.24Show/hide
Query:  MSLQLKPIHHHHHLRHRGRGHCHCQQQYQLSSNVRLQASSTPPHPNQSFMSLLPNCHLLYRKRGIAAEGSVGSL-RLFSDRRRRRSSRS----SYRNIVA
        MSLQLKPIHH        RG    QQ YQ S  V           N SF+SLLPNCHLL  KRG++ +GSV  L  L +DRRRRR+        YR+IVA
Subjt:  MSLQLKPIHHHHHLRHRGRGHCHCQQQYQLSSNVRLQASSTPPHPNQSFMSLLPNCHLLYRKRGIAAEGSVGSL-RLFSDRRRRRSSRS----SYRNIVA

Query:  SGIVGAPLSDGWKPDKGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARVLALSLSLSLM
        SGI  AP+SDG KPDKGF SPPLSDILWPSAGAFAAMA+LGKMDQILAPKGLSMTIAPLGAVCA+LFA PSSPAARVL  SL+  L+
Subjt:  SGIVGAPLSDGWKPDKGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARVLALSLSLSLM

XP_011651222.2 uncharacterized protein LOC105434855 [Cucumis sativus]6.6e-4863.73Show/hide
Query:  MSLQLKPIHHHHHLRHRGRGHCHCQQQYQLSSNVRLQASSTPPHPNQSFMSLLPNCHLLYRKRGIAAEGSVGSLRLFSDRRRRR---SSRSSYRNIVASG
        MSLQLKPIHHH H  H G  HCH  + YQ S    +Q  S     N SF+SLLP+CHLL  KRGI+A     SL LF+D RRRR   S R  +R+IVAS 
Subjt:  MSLQLKPIHHHHHLRHRGRGHCHCQQQYQLSSNVRLQASSTPPHPNQSFMSLLPNCHLLYRKRGIAAEGSVGSLRLFSDRRRRR---SSRSSYRNIVASG

Query:  IVGAPLSDGWKPDKGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR---VLALSLSLSLMGPLLF
        I G P+SDG KP+KGF SPPLSDILWPSAGAFAAMA+LGKMDQILAPKGLSMTIAPLGAVCAVLFATPS+PAAR   +    +  + +G L F
Subjt:  IVGAPLSDGWKPDKGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR---VLALSLSLSLMGPLLF

XP_022137446.1 uncharacterized protein LOC111008888 [Momordica charantia]2.0e-5273.84Show/hide
Query:  MSLQLKPIHHHHHLRHRGRGHCHCQQQY-QLSSNVRLQASSTPPHPNQSFMSLLPNCHLLYRKRGIAAEGSVGSLRLFSDRRRRRSSRSSYRNIVASGIV
        MSLQLKPI  HHHLRHRGR H H QQQY Q SSNVRLQASS  P PNQSF+SLLPN HL    RG         +RLF DRRR    RS +R I ASGIV
Subjt:  MSLQLKPIHHHHHLRHRGRGHCHCQQQY-QLSSNVRLQASSTPPHPNQSFMSLLPNCHLLYRKRGIAAEGSVGSLRLFSDRRRRRSSRSSYRNIVASGIV

Query:  GAPLSDGWKPDKGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR
        G  +SDG KP+KG ASP LSDILWPSAGAFAAMA+LGKMDQILA KGLSMTIAPLGAVCAVLFATPS+PAAR
Subjt:  GAPLSDGWKPDKGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR

XP_023519271.1 uncharacterized protein LOC111782708 isoform X1 [Cucurbita pepo subsp. pepo]2.5e-4767.61Show/hide
Query:  MSLQLKPIHHHHHLRHRGRGHCHCQQQYQLSSNVRLQASSTPPHPNQSFMSLLPNCHLLYRKRGIAAEGSVGSL-RLFSDRRRRRSSRS----SYRNIVA
        MSLQLKPIHH        RG    QQ YQ S  V           N SF+SLLPNCHLL  KRG++ +GSV  L  L +DRRRRR+       SYR+IVA
Subjt:  MSLQLKPIHHHHHLRHRGRGHCHCQQQYQLSSNVRLQASSTPPHPNQSFMSLLPNCHLLYRKRGIAAEGSVGSL-RLFSDRRRRRSSRS----SYRNIVA

Query:  SGIVGAPLSDGWKPDKGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR
        SGI GAP+SDG KPDKGF SPPLSDILWPSAGAFAAMA+LGKMDQILAPKGLSMTIAPLGAVCA+LFA PSSPAAR
Subjt:  SGIVGAPLSDGWKPDKGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR

XP_038894638.1 uncharacterized protein LOC120083135 [Benincasa hispida]1.2e-5265.28Show/hide
Query:  MSLQLKPIHHHHHLRHRGRGHCHCQQQYQLSSNVRLQASSTPPHPNQSFMSLLPNCHLLYRKRGIAAEGSVGSLRLFSDRRRRR---SSRSSYRNIVASG
        MSLQLKPIHHH H  H GR HCH Q+ YQ S  V++QA S   H N SF+SLLPNCHLL   RG+       SL LF++RR+RR     R  +R IVASG
Subjt:  MSLQLKPIHHHHHLRHRGRGHCHCQQQYQLSSNVRLQASSTPPHPNQSFMSLLPNCHLLYRKRGIAAEGSVGSLRLFSDRRRRR---SSRSSYRNIVASG

Query:  IVGAPLSDGWKPDKGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR---VLALSLSLSLMGPLLF
        I G P+SDG K +KGF SPPLSDILWPSAGAFAAMA+LGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR   V    +  + +G L F
Subjt:  IVGAPLSDGWKPDKGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR---VLALSLSLSLMGPLLF

TrEMBL top hitse value%identityAlignment
A0A0A0LR37 Uncharacterized protein4.6e-4763.21Show/hide
Query:  MSLQLKPIHHHHHLRHRGRGHCHCQQQYQLSSNVRLQASSTPPHPNQSFMSLLPNCHLLYRKRGIAAEGSVGSLRLFSDRRRRR---SSRSSYRNIVASG
        MSLQLKPIHHH H  H G   CH  + YQ S    +Q  S     N SF+SLLP+CHLL  KRGI+A     SL LF+D RRRR   S R  +R+IVAS 
Subjt:  MSLQLKPIHHHHHLRHRGRGHCHCQQQYQLSSNVRLQASSTPPHPNQSFMSLLPNCHLLYRKRGIAAEGSVGSLRLFSDRRRRR---SSRSSYRNIVASG

Query:  IVGAPLSDGWKPDKGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR---VLALSLSLSLMGPLLF
        I G P+SDG KP+KGF SPPLSDILWPSAGAFAAMA+LGKMDQILAPKGLSMTIAPLGAVCAVLFATPS+PAAR   +    +  + +G L F
Subjt:  IVGAPLSDGWKPDKGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR---VLALSLSLSLMGPLLF

A0A1S3AUM8 uncharacterized protein LOC1034829851.0e-4663.73Show/hide
Query:  MSLQLKPIHHHHHLRHRGRGHCHCQQQYQLSSNVRLQASSTPPHPNQSFMSLLPNCHLLYRKRGIAAEGSVGSLRLFSDRRRRRSSRS---SYRNIVASG
        MSLQLKPIHHH H  H G  HCH  + YQ S   ++QA S     N S +SLLP CHLL  KRGI     V SL LF+D RRRRS  S    +R+IVAS 
Subjt:  MSLQLKPIHHHHHLRHRGRGHCHCQQQYQLSSNVRLQASSTPPHPNQSFMSLLPNCHLLYRKRGIAAEGSVGSLRLFSDRRRRRSSRS---SYRNIVASG

Query:  IVGAPLSDGWKPDKGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR---VLALSLSLSLMGPLLF
        I G P+SDG KP+KGF SPPLSDILWPSAGAFAAMA+LGKMDQILAPKGLSMTIAPLGAVCAVLFATPS+PAAR   +    +  + +G L F
Subjt:  IVGAPLSDGWKPDKGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR---VLALSLSLSLMGPLLF

A0A6J1C6M6 uncharacterized protein LOC1110088889.6e-5373.84Show/hide
Query:  MSLQLKPIHHHHHLRHRGRGHCHCQQQY-QLSSNVRLQASSTPPHPNQSFMSLLPNCHLLYRKRGIAAEGSVGSLRLFSDRRRRRSSRSSYRNIVASGIV
        MSLQLKPI  HHHLRHRGR H H QQQY Q SSNVRLQASS  P PNQSF+SLLPN HL    RG         +RLF DRRR    RS +R I ASGIV
Subjt:  MSLQLKPIHHHHHLRHRGRGHCHCQQQY-QLSSNVRLQASSTPPHPNQSFMSLLPNCHLLYRKRGIAAEGSVGSLRLFSDRRRRRSSRSSYRNIVASGIV

Query:  GAPLSDGWKPDKGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR
        G  +SDG KP+KG ASP LSDILWPSAGAFAAMA+LGKMDQILA KGLSMTIAPLGAVCAVLFATPS+PAAR
Subjt:  GAPLSDGWKPDKGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR

A0A6J1E7R0 uncharacterized protein LOC111431576 isoform X13.0e-4666.48Show/hide
Query:  MSLQLKPIHHHHHLRHRGRGHCHCQQQYQLSSNVRLQASSTPPHPNQSFMSLLPNCHLLYRKRGIAAEGSVGSL-RLFSDRRRRRSSRS----SYRNIVA
        MSLQLKPIHH        RG    QQ YQ S  V           N SF+SLLPNCHLL  KRG++ +GSV  L  L +DRRRRR+        YR+IVA
Subjt:  MSLQLKPIHHHHHLRHRGRGHCHCQQQYQLSSNVRLQASSTPPHPNQSFMSLLPNCHLLYRKRGIAAEGSVGSL-RLFSDRRRRRSSRS----SYRNIVA

Query:  SGIVGAPLSDGWKPDKGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR
        SGI  AP+SDG KPDKGF SPPLSDILWPSAGAFAAMA+LGKMDQILAPKGLSMTIAPLGAVCA+LFA PSSPAAR
Subjt:  SGIVGAPLSDGWKPDKGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR

A0A6J1EB70 uncharacterized protein LOC111431576 isoform X21.1e-4561.86Show/hide
Query:  MSLQLKPIHHHHHLRHRGRGHCHCQQQYQLSSNVRLQASSTPPHPNQSFMSLLPNCHLLYRKRGIAAEGSVGSL-RLFSDRRRRRSSRS----SYRNIVA
        MSLQLKPIHH        RG    QQ YQ S  V           N SF+SLLPNCHLL  KRG++ +GSV  L  L +DRRRRR+        YR+IVA
Subjt:  MSLQLKPIHHHHHLRHRGRGHCHCQQQYQLSSNVRLQASSTPPHPNQSFMSLLPNCHLLYRKRGIAAEGSVGSL-RLFSDRRRRRSSRS----SYRNIVA

Query:  SGIVGAPLSDGWKPDKGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR--VLALSLSLSLMGPLLF
        SGI  AP+SDG KPDKGF SPPLSDILWPSAGAFAAMA+LGKMDQILAPKGLSMTIAPLGAVCA+LFA PSSPAAR  +    +  + +G L F
Subjt:  SGIVGAPLSDGWKPDKGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR--VLALSLSLSLMGPLLF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G47980.1 Integral membrane HPP family protein5.1e-2243.48Show/hide
Query:  SLRLFSDRRRRRSSRSSYRNIVASGIVGAPLSDGWKPDKGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR-
        SLR  S+RRR   S      + +S    A   + WKP+K   +P LSD++WP+AGAFAAMAI+G++DQ+L PKG+SM++APLGAV A+LF TPS+PAAR 
Subjt:  SLRLFSDRRRRRSSRSSYRNIVASGIVGAPLSDGWKPDKGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR-

Query:  --VLALSLSLSLMGPLLFNQKVHVVFAEVQYVHGPDWV
          +    +  + +G L F+              GP W+
Subjt:  --VLALSLSLSLMGPLLFNQKVHVVFAEVQYVHGPDWV

AT5G62720.1 Integral membrane HPP family protein8.7e-2246.67Show/hide
Query:  IVASGIVGAPLSDGWKPDKGFASPP--LSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR---VLALSLSLSLMGPLLF
        + ++G + AP  D WKPDK  A+    LSD++WP+AGAFAAMA+LG+MDQ+L+PKG+SM++APLGAV A+LF TPS+PAAR   +    +  + +G + F
Subjt:  IVASGIVGAPLSDGWKPDKGFASPP--LSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR---VLALSLSLSLMGPLLF

Query:  NQKVHVVFAEVQYVHGPDWV
        +            V GP W+
Subjt:  NQKVHVVFAEVQYVHGPDWV

AT5G62720.2 Integral membrane HPP family protein8.7e-2246.67Show/hide
Query:  IVASGIVGAPLSDGWKPDKGFASPP--LSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR---VLALSLSLSLMGPLLF
        + ++G + AP  D WKPDK  A+    LSD++WP+AGAFAAMA+LG+MDQ+L+PKG+SM++APLGAV A+LF TPS+PAAR   +    +  + +G + F
Subjt:  IVASGIVGAPLSDGWKPDKGFASPP--LSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAAR---VLALSLSLSLMGPLLF

Query:  NQKVHVVFAEVQYVHGPDWV
        +            V GP W+
Subjt:  NQKVHVVFAEVQYVHGPDWV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCTGCAACTGAAGCCAATTCACCACCACCACCACCTCCGCCACCGTGGTCGCGGCCATTGCCACTGTCAGCAGCAATATCAACTCAGTTCCAATGTACGATTACA
GGCTTCATCGACACCGCCGCACCCGAACCAATCATTCATGTCGTTGCTGCCGAATTGCCATTTATTGTATAGAAAACGAGGGATTGCGGCAGAAGGGTCCGTCGGATCGC
TGAGATTATTCAGCGATCGAAGGAGAAGACGAAGTAGCAGAAGCAGTTACCGGAATATTGTGGCGTCTGGCATTGTTGGTGCGCCGCTTTCAGATGGCTGGAAACCAGAC
AAAGGCTTTGCTTCTCCTCCCCTCAGTGACATCCTTTGGCCTTCAGCAGGGGCATTTGCAGCAATGGCAATACTGGGGAAAATGGATCAGATTCTAGCGCCTAAGGGCCT
CTCCATGACAATTGCTCCATTGGGCGCTGTCTGTGCTGTCCTCTTCGCCACTCCCTCCTCCCCTGCAGCTCGAGTACTCGCCCTCTCTCTCTCTCTCTCTCTTATGGGTC
CGTTGCTTTTTAACCAAAAAGTTCATGTTGTGTTTGCAGAAGTACAATATGTTCATGGCCCAGATTGGGTGTGCGGCAATTGGTGTATTGGCGTTTACTTTGTTTGGGCC
AGGATGGCTAGCTCGGAGCTCTGCTCTTGCTGCATCCATGGCGTTTATGATCTACACTGGTTCGACGCACCCACCTGCTGCAAGCTTGCCGATTCTGTTCATCGATGGAG
CCAAGTTGCAGCATCTGAATTTCTGGTACGCTCTGTTTCCAGGTGCCGCTGGCTGCATTCTCCTTTGCTTCATAGTGTCAAGCATGTCCACGTCAGTGGGCCCCCGCACG
ATGGAGATTCCAAGACGAGGCTTTGGGGTTTTTTTTTTTTGCCTAGTGCAGGGGCTGTCCCCAATTTGATGGTTTGTGCCCATCATTGCACACGTGGCTCTAATCTATCG
AAGCCCCTGACTTCCGGGCTTTCGTGGTCGGTCGATCTTAACCGGCTGACCCCGGAAGATGACGGTACGGGAAAGCAAAACACAGAAAAGGACTCACTAAGACCAGGACC
AAAGGGCATTGTCTTCAATGGGGAATACAAAATTACAGGTACTAGTGGGCGGGTCAGAGTAATCGACATCTTCGAGCTTTGGGAAGATCCAGCTTTGAAGCTTTGGAGAC
TTCATCTTCCAAATGTGAATCAGCAACCAGATTATGTTGTGGTTCTTCAGCAAATTTGTCAACTTCAGACAGCACAGAGTTTCCCCTTTCTTTTTCTTTGTGTAGCAGTT
TGTCTGTTAGGAGAAGAATCTGTTTTCATAGTCAACCTTGAGGCTTCCATAGCTGGATTGAAGAGCCTCATAATCCTTCTCCAGCTGTTTGGTTTTCCACCGTGCCCGAC
GATTCTGAAACCATATAGCAACCTGACGAGGCTGCAATCCGAGTCTTCATCCACATTATCTTCACTATCGAACGGGCAAAAAAACGATCTATTCGTTCCATTTCCTCCAC
GGATATCTTCGAAACTCATCATGGATCTTGAACCTACACCAGAACAATCTGTTATGAAGATCAAGTTGGGCATGGAGATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCCTGCAACTGAAGCCAATTCACCACCACCACCACCTCCGCCACCGTGGTCGCGGCCATTGCCACTGTCAGCAGCAATATCAACTCAGTTCCAATGTACGATTACA
GGCTTCATCGACACCGCCGCACCCGAACCAATCATTCATGTCGTTGCTGCCGAATTGCCATTTATTGTATAGAAAACGAGGGATTGCGGCAGAAGGGTCCGTCGGATCGC
TGAGATTATTCAGCGATCGAAGGAGAAGACGAAGTAGCAGAAGCAGTTACCGGAATATTGTGGCGTCTGGCATTGTTGGTGCGCCGCTTTCAGATGGCTGGAAACCAGAC
AAAGGCTTTGCTTCTCCTCCCCTCAGTGACATCCTTTGGCCTTCAGCAGGGGCATTTGCAGCAATGGCAATACTGGGGAAAATGGATCAGATTCTAGCGCCTAAGGGCCT
CTCCATGACAATTGCTCCATTGGGCGCTGTCTGTGCTGTCCTCTTCGCCACTCCCTCCTCCCCTGCAGCTCGAGTACTCGCCCTCTCTCTCTCTCTCTCTCTTATGGGTC
CGTTGCTTTTTAACCAAAAAGTTCATGTTGTGTTTGCAGAAGTACAATATGTTCATGGCCCAGATTGGGTGTGCGGCAATTGGTGTATTGGCGTTTACTTTGTTTGGGCC
AGGATGGCTAGCTCGGAGCTCTGCTCTTGCTGCATCCATGGCGTTTATGATCTACACTGGTTCGACGCACCCACCTGCTGCAAGCTTGCCGATTCTGTTCATCGATGGAG
CCAAGTTGCAGCATCTGAATTTCTGGTACGCTCTGTTTCCAGGTGCCGCTGGCTGCATTCTCCTTTGCTTCATAGTGTCAAGCATGTCCACGTCAGTGGGCCCCCGCACG
ATGGAGATTCCAAGACGAGGCTTTGGGGTTTTTTTTTTTTGCCTAGTGCAGGGGCTGTCCCCAATTTGATGGTTTGTGCCCATCATTGCACACGTGGCTCTAATCTATCG
AAGCCCCTGACTTCCGGGCTTTCGTGGTCGGTCGATCTTAACCGGCTGACCCCGGAAGATGACGGTACGGGAAAGCAAAACACAGAAAAGGACTCACTAAGACCAGGACC
AAAGGGCATTGTCTTCAATGGGGAATACAAAATTACAGGTACTAGTGGGCGGGTCAGAGTAATCGACATCTTCGAGCTTTGGGAAGATCCAGCTTTGAAGCTTTGGAGAC
TTCATCTTCCAAATGTGAATCAGCAACCAGATTATGTTGTGGTTCTTCAGCAAATTTGTCAACTTCAGACAGCACAGAGTTTCCCCTTTCTTTTTCTTTGTGTAGCAGTT
TGTCTGTTAGGAGAAGAATCTGTTTTCATAGTCAACCTTGAGGCTTCCATAGCTGGATTGAAGAGCCTCATAATCCTTCTCCAGCTGTTTGGTTTTCCACCGTGCCCGAC
GATTCTGAAACCATATAGCAACCTGACGAGGCTGCAATCCGAGTCTTCATCCACATTATCTTCACTATCGAACGGGCAAAAAAACGATCTATTCGTTCCATTTCCTCCAC
GGATATCTTCGAAACTCATCATGGATCTTGAACCTACACCAGAACAATCTGTTATGAAGATCAAGTTGGGCATGGAGATTTAA
Protein sequenceShow/hide protein sequence
MSLQLKPIHHHHHLRHRGRGHCHCQQQYQLSSNVRLQASSTPPHPNQSFMSLLPNCHLLYRKRGIAAEGSVGSLRLFSDRRRRRSSRSSYRNIVASGIVGAPLSDGWKPD
KGFASPPLSDILWPSAGAFAAMAILGKMDQILAPKGLSMTIAPLGAVCAVLFATPSSPAARVLALSLSLSLMGPLLFNQKVHVVFAEVQYVHGPDWVCGNWCIGVYFVWA
RMASSELCSCCIHGVYDLHWFDAPTCCKLADSVHRWSQVAASEFLVRSVSRCRWLHSPLLHSVKHVHVSGPPHDGDSKTRLWGFFFLPSAGAVPNLMVCAHHCTRGSNLS
KPLTSGLSWSVDLNRLTPEDDGTGKQNTEKDSLRPGPKGIVFNGEYKITGTSGRVRVIDIFELWEDPALKLWRLHLPNVNQQPDYVVVLQQICQLQTAQSFPFLFLCVAV
CLLGEESVFIVNLEASIAGLKSLIILLQLFGFPPCPTILKPYSNLTRLQSESSSTLSSLSNGQKNDLFVPFPPRISSKLIMDLEPTPEQSVMKIKLGMEI