; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0023088 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0023088
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr7:43956287..43961616
RNA-Seq ExpressionLag0023088
SyntenyLag0023088
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036008.1 Retrotransposon gag protein [Cucumis melo var. makuwa]1.9e-3267.72Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSD
        EEVDNS + +QRTSVFDRIKP TTR  VF R+SMA  EEENQC MST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M + + K F E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSD

Query:  KKLHSSISSRMKRKFSVLINTEGSLKV
         K+HS + SRMKRK SV INTEGSL V
Subjt:  KKLHSSISSRMKRKFSVLINTEGSLKV

KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]6.4e-3368.5Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSD
        EEVDNS + +QRTSVFDRIKP TTR SVF R+SMA  EEENQC  ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSD

Query:  KKLHSSISSRMKRKFSVLINTEGSLKV
         K+HS + SRMKRK SV INTEGSL V
Subjt:  KKLHSSISSRMKRKFSVLINTEGSLKV

KAA0050736.1 retrotransposon gag protein [Cucumis melo var. makuwa]3.2e-3266.93Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSD
        EEVDNS + +QRTSVFDRIKP TTR SVF R+SMA  EEENQC MST TR SAF+RLS+S SKK R STS FDRLK+TNDQ +R+M +L+ K F E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSD

Query:  KKLHSSISSRMKRKFSVLINTEGSLKV
         K++S + SR+KRK S+ INTEGSL V
Subjt:  KKLHSSISSRMKRKFSVLINTEGSLKV

KAA0055462.1 retrotransposon gag protein [Cucumis melo var. makuwa]4.2e-3266.93Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSD
        EEVDNS + +QRTSVFDRIKP TTR SVF R+SMA  EE+NQC  ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSD

Query:  KKLHSSISSRMKRKFSVLINTEGSLKV
         K+H+ + SRMKRK SV INTEGSL V
Subjt:  KKLHSSISSRMKRKFSVLINTEGSLKV

TYK08944.1 retrotransposon gag protein [Cucumis melo var. makuwa]4.2e-3266.93Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSD
        EEVDNS + +QRTSVFDRIKP TTR SVF R+SMA  EE+NQC  ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSD

Query:  KKLHSSISSRMKRKFSVLINTEGSLKV
         K+H+ + SRMKRK SV INTEGSL V
Subjt:  KKLHSSISSRMKRKFSVLINTEGSLKV

TrEMBL top hitse value%identityAlignment
A0A5A7SMQ5 Retrotransposon gag protein2.0e-3267.72Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSD
        EEVDNS + +QRTSVFDRIKP TTR S F R+SMA  EEENQC  ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSD

Query:  KKLHSSISSRMKRKFSVLINTEGSLKV
         K+ S +SSRMKRK SV INTEGSL V
Subjt:  KKLHSSISSRMKRKFSVLINTEGSLKV

A0A5A7SZJ7 Retrotransposon gag protein9.1e-3367.72Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSD
        EEVDNS + +QRTSVFDRIKP TTR  VF R+SMA  EEENQC MST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M + + K F E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSD

Query:  KKLHSSISSRMKRKFSVLINTEGSLKV
         K+HS + SRMKRK SV INTEGSL V
Subjt:  KKLHSSISSRMKRKFSVLINTEGSLKV

A0A5A7TQ06 Retrotransposon gag protein3.1e-3368.5Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSD
        EEVDNS + +QRTSVFDRIKP TTR SVF R+SMA  EEENQC  ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSD

Query:  KKLHSSISSRMKRKFSVLINTEGSLKV
         K+HS + SRMKRK SV INTEGSL V
Subjt:  KKLHSSISSRMKRKFSVLINTEGSLKV

A0A5A7U974 Retrotransposon gag protein1.5e-3266.93Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSD
        EEVDNS + +QRTSVFDRIKP TTR SVF R+SMA  EEENQC MST TR SAF+RLS+S SKK R STS FDRLK+TNDQ +R+M +L+ K F E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSD

Query:  KKLHSSISSRMKRKFSVLINTEGSLKV
         K++S + SR+KRK S+ INTEGSL V
Subjt:  KKLHSSISSRMKRKFSVLINTEGSLKV

A0A5D3BBF9 Gag protease polyprotein2.0e-3266.93Show/hide
Query:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSD
        EEVDNS + +QRTS+FDRIKP TTR  VF R+SMA  EEENQC  ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D
Subjt:  EEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSD

Query:  KKLHSSISSRMKRKFSVLINTEGSLKV
         K+HS + SRMKRK SV INTEGSL V
Subjt:  KKLHSSISSRMKRKFSVLINTEGSLKV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAATAAATCGTTCTCCAAAAATTTCCACAAAAAGGAAAAAAAGAACCTTGCAACTTCCTACTGCATCAACGTAGAAGAAGTTGACAATTCTAAGAAGAGTGAACA
AAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCATAGAATGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCT
CCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGTCTCAAAGTAACAAACGATCAACCTAAAAGA
AAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGCATCTCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATAC
GGAAGGTTCCTTGAAGGTTCCCACATTGCGCTGTTGTGCTGCTTCCTTCTCCAAGTTCGAAGGTTCTGACGCTGCGCTGCTACCTTCCTCCAAGTTCGAAGGTTTTCATG
CGCTTTGTTGCAGTTCCTTCTCTCCAAACTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTTTCATGCGCTTTGTTGCAGTTCCTTCTCTCCAAGTTCGAAGGTGTTCTCGC
GCACTTTGCTGCCGTTCCTTCCTCTCAAATTCGAAGGTTCTCTCACGCGCTTCGTTGCAGTTCCTTCCTCCCAAATTTGAAGGTTCTCACGACGCTCCGCTGCAGTTCCT
TCTCTCTCCAAATTCGAAGGTTCTCACGCGCTCCGCTGCAGTTCCTTCGCTTCCGCTGCAGTTCATTCTCTCTCCAAATTCGAAGGTTCTCACGCGCTCCGCTGCATTCC
TTCGCTTTCGCTGCAATTCCTTCTCTCCGAGTTCGAAGGTTCTCACGACGTTTCGTTGCAGTTCCTTCCTCCCAAATTCGAAGGTTCTCACGACGCTCCGCTATAGTTCC
TTCTCTCCAAATTTGAAGGTGTTCTCACGCGCGCCGCTGCAGTTCCATCTCTCCAAGTTCGAAGGTGTTCTCGCGCGCTTCGCTGCAGTTCCTTCCTCCCAAATTCGAAG
GTTCTCTCACGCGCTTCGTTACAGTTCCTTCTCTCCAAGTATGAAGGTTCTCTCCTCCAAGTCGAAGGTTCTCACGTTGCTTCACTGCAGTTCCTTCCTCCAAGTTCGAA
GGTTCTCACGTTGCTTCGTCGTAGTTCCTTCTCTCCAAGTACGAAGGTTCTCTCCTCCAAGTCTGAAGGTGCTCACGTGCTTCGGTAAAGTTCCTTCCTCCCAAGTTCGA
AGTTTCTTCTCCCTAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCCTAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAATTCGA
AAGTTTCAAAGGCCCTCACGCGCTGCGCTTCGTTGCAGTTCCTTCTTCCAAGTTCGAAGGTTCTCATGCGTTTCGATGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCAC
GCGCTGCTGCAGTTCCTGCCTCCAAGTTTGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACTTCTCCAAGTTCGAA
GGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCTACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCTGTTGCTACCTTCCTCCAAGTTCGAA
GTTCCTTCCTCCAAGTTTGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACTTCTCCAAGTTCGAAGGCGCTTCTCT
CCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTTTTGAAGGTTCGCCACTGAGGTTCTCC
TTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGAGGTTGACGTCCTCGTTCCGCTTCATCTTCAAATGTTGGTAGTTGACGGCGTCTGCTGCGC
TTCATCTTCAAATGTTGGCAGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAGTCACTGCAATTGAATCTGATGACGACCGTTGAAGGCGAGTCGGGTC
TGGTGACCACCCCTGCAGGTTACTCAAATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCAC
CCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAATGAAATAGGGGACTGGTCTAGCAGGAG
TGATATCACTGCAAGCGAATTTGGGGGGTTCACCACCATTTCAAGGGGTCAGAATTTTGAAGCTCAAAGCCAGAGTCAGAGAATTCAGATAACTTCATCAAGATTGAAGA
CTGAAGACTCCTTCAAGATTGGAAGACTTCAAGCTCCAAGAGATCAACAAGCCAACCGACGATCAAGAAGATCACAAGTCAGCAGCGATCATCCAAGAAGATCAACAAGC
CAACCGATCGAACAGATCATCAAGCTAACCGACCATCAAGAAGATCAACAAGTCAGCAGGCTGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAACAAATCAT
CAAGCGACAGGCAGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAGAAGATCAACAAGTCAGCAGGCCGATCATC
CAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACAGGCCGATCATCCAAGAGGATCAACAAGC
TAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGAATAAATCGTTCTCCAAAAATTTCCACAAAAAGGAAAAAAAGAACCTTGCAACTTCCTACTGCATCAACGTAGAAGAAGTTGACAATTCTAAGAAGAGTGAACA
AAGGACTTCCGTCTTTGATCGCATCAAGCCTCCAACTACTCGTCCTTCGGTATTCCATAGAATGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCT
CCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGTCTCAAAGTAACAAACGATCAACCTAAAAGA
AAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGCATCTCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATAC
GGAAGGTTCCTTGAAGGTTCCCACATTGCGCTGTTGTGCTGCTTCCTTCTCCAAGTTCGAAGGTTCTGACGCTGCGCTGCTACCTTCCTCCAAGTTCGAAGGTTTTCATG
CGCTTTGTTGCAGTTCCTTCTCTCCAAACTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTTTCATGCGCTTTGTTGCAGTTCCTTCTCTCCAAGTTCGAAGGTGTTCTCGC
GCACTTTGCTGCCGTTCCTTCCTCTCAAATTCGAAGGTTCTCTCACGCGCTTCGTTGCAGTTCCTTCCTCCCAAATTTGAAGGTTCTCACGACGCTCCGCTGCAGTTCCT
TCTCTCTCCAAATTCGAAGGTTCTCACGCGCTCCGCTGCAGTTCCTTCGCTTCCGCTGCAGTTCATTCTCTCTCCAAATTCGAAGGTTCTCACGCGCTCCGCTGCATTCC
TTCGCTTTCGCTGCAATTCCTTCTCTCCGAGTTCGAAGGTTCTCACGACGTTTCGTTGCAGTTCCTTCCTCCCAAATTCGAAGGTTCTCACGACGCTCCGCTATAGTTCC
TTCTCTCCAAATTTGAAGGTGTTCTCACGCGCGCCGCTGCAGTTCCATCTCTCCAAGTTCGAAGGTGTTCTCGCGCGCTTCGCTGCAGTTCCTTCCTCCCAAATTCGAAG
GTTCTCTCACGCGCTTCGTTACAGTTCCTTCTCTCCAAGTATGAAGGTTCTCTCCTCCAAGTCGAAGGTTCTCACGTTGCTTCACTGCAGTTCCTTCCTCCAAGTTCGAA
GGTTCTCACGTTGCTTCGTCGTAGTTCCTTCTCTCCAAGTACGAAGGTTCTCTCCTCCAAGTCTGAAGGTGCTCACGTGCTTCGGTAAAGTTCCTTCCTCCCAAGTTCGA
AGTTTCTTCTCCCTAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCCTAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAATTCGA
AAGTTTCAAAGGCCCTCACGCGCTGCGCTTCGTTGCAGTTCCTTCTTCCAAGTTCGAAGGTTCTCATGCGTTTCGATGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCAC
GCGCTGCTGCAGTTCCTGCCTCCAAGTTTGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACTTCTCCAAGTTCGAA
GGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCTACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCTGTTGCTACCTTCCTCCAAGTTCGAA
GTTCCTTCCTCCAAGTTTGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACTTCTCCAAGTTCGAAGGCGCTTCTCT
CCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTTTTGAAGGTTCGCCACTGAGGTTCTCC
TTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGAGGTTGACGTCCTCGTTCCGCTTCATCTTCAAATGTTGGTAGTTGACGGCGTCTGCTGCGC
TTCATCTTCAAATGTTGGCAGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAGTCACTGCAATTGAATCTGATGACGACCGTTGAAGGCGAGTCGGGTC
TGGTGACCACCCCTGCAGGTTACTCAAATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCAC
CCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAATGAAATAGGGGACTGGTCTAGCAGGAG
TGATATCACTGCAAGCGAATTTGGGGGGTTCACCACCATTTCAAGGGGTCAGAATTTTGAAGCTCAAAGCCAGAGTCAGAGAATTCAGATAACTTCATCAAGATTGAAGA
CTGAAGACTCCTTCAAGATTGGAAGACTTCAAGCTCCAAGAGATCAACAAGCCAACCGACGATCAAGAAGATCACAAGTCAGCAGCGATCATCCAAGAAGATCAACAAGC
CAACCGATCGAACAGATCATCAAGCTAACCGACCATCAAGAAGATCAACAAGTCAGCAGGCTGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAACAAATCAT
CAAGCGACAGGCAGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAGAAGATCAACAAGTCAGCAGGCCGATCATC
CAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACAGGCCGATCATCCAAGAGGATCAACAAGC
TAA
Protein sequenceShow/hide protein sequence
MLNKSFSKNFHKKEKKNLATSYCINVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFHRMSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKR
KMNNLELKLFDEVNSDKKLHSSISSRMKRKFSVLINTEGSLKVPTLRCCAASFSKFEGSDAALLPSSKFEGFHALCCSSFSPNSKFLPPSSKVFMRFVAVPSLQVRRCSR
ALCCRSFLSNSKVLSRASLQFLPPKFEGSHDAPLQFLLSPNSKVLTRSAAVPSLPLQFILSPNSKVLTRSAAFLRFRCNSFSPSSKVLTTFRCSSFLPNSKVLTTLRYSS
FSPNLKVFSRAPLQFHLSKFEGVLARFAAVPSSQIRRFSHALRYSSFSPSMKVLSSKSKVLTLLHCSSFLQVRRFSRCFVVVPSLQVRRFSPPSLKVLTCFGKVPSSQVR
SFFSLSSKVLTRFAAVPSSLSSKVLTRFAAVPSPKFESFKGPHALRFVAVPSSKFEGSHAFRCYLPPSSKVLSRAAAVPASKFEGSLTRFARSFSKFEGASLRCYFSKFE
GASLHCSFSKFEGASLYCSFSKFEGASLCCYLPPSSKFLPPSLKVPSRASLAPSPSSKALLSVATSPSSKALLSTAPSPSSKALLSTAPSPSSKVLLSTPLFEGSPLRFS
FSKFEGSPLLLFKCLAEVDVLVPLHLQMLVVDGVCCASSSNVGRNYSHQSDWSRQVVKSLQLNLMTTVEGESGLVTTPAGYSNHPIKWGLGLAGVHEANLVTTPAGYSDH
PIKWGLGLAGVHEGESGDYPCRLLRSPNEIGDWSSRSDITASEFGGFTTISRGQNFEAQSQSQRIQITSSRLKTEDSFKIGRLQAPRDQQANRRSRRSQVSSDHPRRSTS
QPIEQIIKLTDHQEDQQVSRLIIQEDQQANKPIQQIIKRQADPRDQQANRPIKKINKSAGRSSRRSTSQQADHPRDQQANRPIKKINKSAGRSSKRSTSQQADHPRGSTS