; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg016725 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg016725
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotransposon gag protein
Genome locationscaffold9:39220681..39225596
RNA-Seq ExpressionSpg016725
SyntenySpg016725
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036008.1 Retrotransposon gag protein [Cucumis melo var. makuwa]3.0e-3568.8Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSD
        EEVDNS + +QRTSVFDRIKP TTR  VFQR+SMA  EEENQC +ST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M + + K F E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSD

Query:  KKLHSRIPSRMKRKFSVLINTEGSL
         K+HSR+PSRMKRK SV INTEGSL
Subjt:  KKLHSRIPSRMKRKFSVLINTEGSL

KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]3.5e-3666.67Show/hide
Query:  KKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMD
        K +N   SY    EEVDNS + +QRTSVFDRIKP TTR SVFQR+SMA  EEENQC  ST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M 
Subjt:  KKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMD

Query:  NLEVKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL
        +L+ K F E N D K+HSR+PSRMKRK SV INTEGSL
Subjt:  NLEVKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL

KAA0050734.1 gag protease polyprotein [Cucumis melo var. makuwa]1.8e-3568.8Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSD
        EEVDNS + +QRTS+FDRIKP TTR  VFQR+SMA  EEENQC  ST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSD

Query:  KKLHSRIPSRMKRKFSVLINTEGSL
         K+HSR+PSRMKRK SV INTEGSL
Subjt:  KKLHSRIPSRMKRKFSVLINTEGSL

KAA0055462.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.8e-3568.8Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSD
        EEVDNS + +QRTSVFDRIKP TTR SVFQR+SMA  EE+NQC  ST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSD

Query:  KKLHSRIPSRMKRKFSVLINTEGSL
         K+H+R+PSRMKRK SV INTEGSL
Subjt:  KKLHSRIPSRMKRKFSVLINTEGSL

TYK08944.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.8e-3568.8Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSD
        EEVDNS + +QRTSVFDRIKP TTR SVFQR+SMA  EE+NQC  ST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSD

Query:  KKLHSRIPSRMKRKFSVLINTEGSL
         K+H+R+PSRMKRK SV INTEGSL
Subjt:  KKLHSRIPSRMKRKFSVLINTEGSL

TrEMBL top hitse value%identityAlignment
A0A5A7SZJ7 Retrotransposon gag protein1.5e-3568.8Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSD
        EEVDNS + +QRTSVFDRIKP TTR  VFQR+SMA  EEENQC +ST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M + + K F E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSD

Query:  KKLHSRIPSRMKRKFSVLINTEGSL
         K+HSR+PSRMKRK SV INTEGSL
Subjt:  KKLHSRIPSRMKRKFSVLINTEGSL

A0A5A7TQ06 Retrotransposon gag protein1.7e-3666.67Show/hide
Query:  KKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMD
        K +N   SY    EEVDNS + +QRTSVFDRIKP TTR SVFQR+SMA  EEENQC  ST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M 
Subjt:  KKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMD

Query:  NLEVKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL
        +L+ K F E N D K+HSR+PSRMKRK SV INTEGSL
Subjt:  NLEVKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL

A0A5A7UI09 Retrotransposon gag protein8.5e-3668.8Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSD
        EEVDNS + +QRTSVFDRIKP TTR SVFQR+SMA  EE+NQC  ST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSD

Query:  KKLHSRIPSRMKRKFSVLINTEGSL
         K+H+R+PSRMKRK SV INTEGSL
Subjt:  KKLHSRIPSRMKRKFSVLINTEGSL

A0A5D3BBF9 Gag protease polyprotein8.5e-3668.8Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSD
        EEVDNS + +QRTS+FDRIKP TTR  VFQR+SMA  EEENQC  ST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSD

Query:  KKLHSRIPSRMKRKFSVLINTEGSL
         K+HSR+PSRMKRK SV INTEGSL
Subjt:  KKLHSRIPSRMKRKFSVLINTEGSL

A0A5D3CCI8 Retrotransposon gag protein8.5e-3668.8Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSD
        EEVDNS + +QRTSVFDRIKP TTR SVFQR+SMA  EE+NQC  ST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSD

Query:  KKLHSRIPSRMKRKFSVLINTEGSL
         K+H+R+PSRMKRK SV INTEGSL
Subjt:  KKLHSRIPSRMKRKFSVLINTEGSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCTACTGCCCATCGTTGCTTCAACGGACTGACGTTGCAAGAAGATAAAGCTTCTGTCGTTGCAGGCCAAGAAACAACCTTGCAGGGGGCATATACTAATGACAA
GTTTTTTGTCAAGTATAACCCTTTGTTTGAACCTGATTCTGACGTAGTGACTGTTATGATGACTGAGACAAAAACTATGGAAGAAAGAATGGCTGAGATGCAAGAACACA
TCAACAACTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATTGAGCAACTAAAGAGCCAAATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGAGATCTTC
TCCAAAACTTTCCACAAAAAGAAAAAAGAGAACCTTGCAACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCCAAGAAAAGTGAACAAAGGACTTCTGTCTTCGA
TCGCATCAAGCCTCCAACTACTCGTCCTTCAGTATTCCAAAGAATGAGTATGGCCGCGACAGAAGAAGAAAATCAATGTTCGGTGTCCACCTTCACTCGACCTTCAGCTT
TCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTCGACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAACTTGGAG
GTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGAATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGTG
GGGGCAACACAGCGAATGGAAGTTGCTTCCTCCAAGTTCGAAGGTTTCCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGT
TCCTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGATCCTTCCCTCCAAGTTCGAAGGTTCTCACCTCGCTTCGCTGCAGTTCCCTCCTCCAAGTTCGAAGGT
TCTCACGCGTTTCACTGCAGTTCCTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCACTGAAGTTCCTTCCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGT
TCCTTCCTCCAAGTTTGAAGCTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATT
CCTTCCCCAAGATCGAAGGTTCTCATGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCCAAATTTGAAGGTTCTC
ACGTGCTTCGCTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACACGCGCTTCGCTGACGCCGCTTTGCGCTGTAGTTCCTTCCCTCCAAGTTTGAAGGTTCTCACATC
GCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGTTCCTTCCCTCCAAGTTCGAAGGTGTTCTCACG
CGCTTCGTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAGTTCCTTCCC
CCGAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGTCGCTTCGCTGCGCTCATGCGCTTCGCTACAGTTCCTTCCTCCAA
GTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGATCGAAGATTCTCATGCGCTTC
GTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTC
GAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAATTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTG
CAGTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTCAA
AGGTTCTCACGCGCTTCGCTGCACTCCAGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCTCACGCTGCGCTGCTTCCTTCTCCAAGTTCAAGGGTCCTCATGCTA
CGCTCGGCTACATTGCTGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCCTGCACTCATGCTGTAAAGGGCATGGCGGCGACACAAGTCCAAGGACATGCGTGGCA
GCGACACAACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGG
AACATGTCCGTGCACTCGTGCTGAAAGGCGCGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGTGGCGGCGACACAAGTCTA
AGGAACATGTCCCAGCTCAAGGAACATGTCCGTGCACTCGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAG
GCGTGACGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCTTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCTCAACT
CAAGGAACATGTCCGTGCACTCGTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTCTACTGCCCATCGTTGCTTCAACGGACTGACGTTGCAAGAAGATAAAGCTTCTGTCGTTGCAGGCCAAGAAACAACCTTGCAGGGGGCATATACTAATGACAA
GTTTTTTGTCAAGTATAACCCTTTGTTTGAACCTGATTCTGACGTAGTGACTGTTATGATGACTGAGACAAAAACTATGGAAGAAAGAATGGCTGAGATGCAAGAACACA
TCAACAACTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATTGAGCAACTAAAGAGCCAAATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGAGATCTTC
TCCAAAACTTTCCACAAAAAGAAAAAAGAGAACCTTGCAACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCCAAGAAAAGTGAACAAAGGACTTCTGTCTTCGA
TCGCATCAAGCCTCCAACTACTCGTCCTTCAGTATTCCAAAGAATGAGTATGGCCGCGACAGAAGAAGAAAATCAATGTTCGGTGTCCACCTTCACTCGACCTTCAGCTT
TCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTCGACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAACTTGGAG
GTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGAATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGTG
GGGGCAACACAGCGAATGGAAGTTGCTTCCTCCAAGTTCGAAGGTTTCCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGT
TCCTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGATCCTTCCCTCCAAGTTCGAAGGTTCTCACCTCGCTTCGCTGCAGTTCCCTCCTCCAAGTTCGAAGGT
TCTCACGCGTTTCACTGCAGTTCCTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCACTGAAGTTCCTTCCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGT
TCCTTCCTCCAAGTTTGAAGCTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATT
CCTTCCCCAAGATCGAAGGTTCTCATGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCCAAATTTGAAGGTTCTC
ACGTGCTTCGCTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACACGCGCTTCGCTGACGCCGCTTTGCGCTGTAGTTCCTTCCCTCCAAGTTTGAAGGTTCTCACATC
GCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGTTCCTTCCCTCCAAGTTCGAAGGTGTTCTCACG
CGCTTCGTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAGTTCCTTCCC
CCGAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGTCGCTTCGCTGCGCTCATGCGCTTCGCTACAGTTCCTTCCTCCAA
GTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGATCGAAGATTCTCATGCGCTTC
GTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTC
GAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAATTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTG
CAGTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTCAA
AGGTTCTCACGCGCTTCGCTGCACTCCAGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCTCACGCTGCGCTGCTTCCTTCTCCAAGTTCAAGGGTCCTCATGCTA
CGCTCGGCTACATTGCTGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCCTGCACTCATGCTGTAAAGGGCATGGCGGCGACACAAGTCCAAGGACATGCGTGGCA
GCGACACAACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGG
AACATGTCCGTGCACTCGTGCTGAAAGGCGCGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGTGGCGGCGACACAAGTCTA
AGGAACATGTCCCAGCTCAAGGAACATGTCCGTGCACTCGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAG
GCGTGACGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCTTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCTCAACT
CAAGGAACATGTCCGTGCACTCGTGCTGA
Protein sequenceShow/hide protein sequence
MGSTAHRCFNGLTLQEDKASVVAGQETTLQGAYTNDKFFVKYNPLFEPDSDVVTVMMTETKTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQEIF
SKTFHKKKKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLE
VKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKWGQHSEWKLLPPSSKVSTRFAAVPSPQIRRFSRASLQFLPHSSKVLTRFAADPSLQVRRFSPRFAAVPSSKFEG
SHAFHCSSFLTVRRFSRASLKFLPPQVRRFSRRFAAVPSSKFEAPSSKFEGSHIASLRSFLQVRRFSRASLCNSFPKIEGSHALRAVPSSKFEGSHALRCISFPPNLKVL
TCFAAVPSSKFEGSHTRFADAALRCSSFPPSLKVLTSLRCDPSSKFEGSHALRSAIPSPKFEVPSLQVRRCSHALRAVPSPKFEGSHALRAVPLPPSSKVLTRFALQFLP
PKFEGSHALRCSSFLQVQRFSRRFAALMRFATVPSSKFEGSHIASLRSFLQVRRFSRASLCNSFPKIEDSHALRAVPSSKFEGSHALRCISFPPNSKVLTRFAAVPSSKF
EGSHALRSAIPSPKFEGSHALRAIPSSKFEGSHALRCSSFPPNSKVLTRFAAVPSPQVRRFSRTSLQFLPPSSKVLTRFAALQRYFLKSKDVNCPHAALLPSPSSRVLML
RSATLLRYFLKSKDVNCPCTHAVKGMAATQVQGHAWQRHNKSKEHVPTQGTCPCTRAERRGGDTSPRNMSQLKEHVRALVLKGAAAAQVQGTCPNSRNMSVHSCGGDTSL
RNMSQLKEHVRALAWRRHKSKEHVPTQGTCPCTRAERRDGGTSPRNMSQLKEHVLALVLKGVAATQVQGTCLNSRNMSVHSC