; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg011043 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg011043
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotransposon gag protein
Genome locationscaffold4:29499948..29505824
RNA-Seq ExpressionSpg011043
SyntenySpg011043
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040811.1 retrotransposon gag protein [Cucumis melo var. makuwa]3.1e-3351.52Show/hide
Query:  IEHLKSQIENQHIAESSQTQRQRSKKFSQPRQPVTVKELFSRTF---HKKEKENFA---TSYCIE-------EEEVDNSKKGEQRTSIFDRIKPPTTCPS
        I H K    N+ + +S    + + + F QPRQ +T+ E F R+F   H KE        T+  +E        EEVDNS + +QRTS+FDRIKP TT  S
Subjt:  IEHLKSQIENQHIAESSQTQRQRSKKFSQPRQPVTVKELFSRTF---HKKEKENFA---TSYCIE-------EEEVDNSKKGEQRTSIFDRIKPPTTCPS

Query:  VFQRMSMTATEEENQCVVSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPRRKIDNLDVKLFNEVGSDKKLQSSIPSRMKRKFSVLINTEGSL
        VFQR+S+T  EEENQC  ST TR SAF+ LS+STSKK R STS FDRLK+ NDQ +R++ +L VK F+E   D K+ S +PSRMKRK SV INTEGSL
Subjt:  VFQRMSMTATEEENQCVVSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPRRKIDNLDVKLFNEVGSDKKLQSSIPSRMKRKFSVLINTEGSL

KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.4e-3355.29Show/hide
Query:  QPRQPVTVKELFSRTF---HKKEKENFATSYCI----------EEEEVDNSKKGEQRTSIFDRIKPPTTCPSVFQRMSMTATEEENQCVVSTSTRPSAFQ
        QPRQ +T+ E F R+F   H +E     T +              EEVDNS + +QRTS+FDRIKP TT  SVFQR+SM   EEENQC  ST  R SAF+
Subjt:  QPRQPVTVKELFSRTF---HKKEKENFATSYCI----------EEEEVDNSKKGEQRTSIFDRIKPPTTCPSVFQRMSMTATEEENQCVVSTSTRPSAFQ

Query:  RLSVSTSKKSRSSTSVFDRLKVTNDQPRRKIDNLDVKLFNEVGSDKKLQSSIPSRMKRKFSVLINTEGSL
        RLS+STSKK R STS FDRLK+TNDQ +R++ +L  K F+E   D K+ S +PSRMKRK SV INTEGSL
Subjt:  RLSVSTSKKSRSSTSVFDRLKVTNDQPRRKIDNLDVKLFNEVGSDKKLQSSIPSRMKRKFSVLINTEGSL

KAA0050734.1 gag protease polyprotein [Cucumis melo var. makuwa]4.5e-3252.84Show/hide
Query:  RSKKFSQPRQPVTVKELFSRTFHKKEKENFA------TSYCIE-------EEEVDNSKKGEQRTSIFDRIKPPTTCPSVFQRMSMTATEEENQCVVSTST
        + + F QPR+ +T+ E   R+F +   E         T+  +E        EEVDNS + +QRTSIFDRIKP TT   VFQR+SM   EEENQC  ST  
Subjt:  RSKKFSQPRQPVTVKELFSRTFHKKEKENFA------TSYCIE-------EEEVDNSKKGEQRTSIFDRIKPPTTCPSVFQRMSMTATEEENQCVVSTST

Query:  RPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPRRKIDNLDVKLFNEVGSDKKLQSSIPSRMKRKFSVLINTEGSL
        R SAF+RLS+STSKK R STS FDRLK+TNDQ +R++ +L  K F+E   D K+ S +PSRMKRK SV INTEGSL
Subjt:  RPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPRRKIDNLDVKLFNEVGSDKKLQSSIPSRMKRKFSVLINTEGSL

KAA0050736.1 retrotransposon gag protein [Cucumis melo var. makuwa]8.2e-3452.25Show/hide
Query:  RQRSKKFSQPRQPVTVKELFSRTFHKKEKENFA------TSYCIE-------EEEVDNSKKGEQRTSIFDRIKPPTTCPSVFQRMSMTATEEENQCVVST
        +++ +KF QPR+ +T+ E F R+F +   E         T+  +E        EEVDNS + +QRTS+FDRIKP TT  SVFQR+SM   EEENQC +ST
Subjt:  RQRSKKFSQPRQPVTVKELFSRTFHKKEKENFA------TSYCIE-------EEEVDNSKKGEQRTSIFDRIKPPTTCPSVFQRMSMTATEEENQCVVST

Query:  STRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPRRKIDNLDVKLFNEVGSDKKLQSSIPSRMKRKFSVLINTEGSL
         TR SAF+RLS+S SKK R STS FDRLK+TNDQ +R++ +L  K F+E   D K+ S +PSR+KRK S+ INTEGSL
Subjt:  STRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPRRKIDNLDVKLFNEVGSDKKLQSSIPSRMKRKFSVLINTEGSL

TYK08944.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.0e-3252.27Show/hide
Query:  RSKKFSQPRQPVTVKELFSRTFHKKEKENFA------TSYCIE-------EEEVDNSKKGEQRTSIFDRIKPPTTCPSVFQRMSMTATEEENQCVVSTST
        + + F QPR+ +T+ E  SR+F +   E         T+  +E        EEVDNS + +QRTS+FDRIKP TT  SVFQR+SM   EE+NQC  ST  
Subjt:  RSKKFSQPRQPVTVKELFSRTFHKKEKENFA------TSYCIE-------EEEVDNSKKGEQRTSIFDRIKPPTTCPSVFQRMSMTATEEENQCVVSTST

Query:  RPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPRRKIDNLDVKLFNEVGSDKKLQSSIPSRMKRKFSVLINTEGSL
        R SAF+RLS+STSKK R STS FDRLK+TNDQ +R++ +L  K F+E   D K+ + +PSRMKRK SV INTEGSL
Subjt:  RPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPRRKIDNLDVKLFNEVGSDKKLQSSIPSRMKRKFSVLINTEGSL

TrEMBL top hitse value%identityAlignment
A0A5A7TGM1 Retrotransposon gag protein1.5e-3351.52Show/hide
Query:  IEHLKSQIENQHIAESSQTQRQRSKKFSQPRQPVTVKELFSRTF---HKKEKENFA---TSYCIE-------EEEVDNSKKGEQRTSIFDRIKPPTTCPS
        I H K    N+ + +S    + + + F QPRQ +T+ E F R+F   H KE        T+  +E        EEVDNS + +QRTS+FDRIKP TT  S
Subjt:  IEHLKSQIENQHIAESSQTQRQRSKKFSQPRQPVTVKELFSRTF---HKKEKENFA---TSYCIE-------EEEVDNSKKGEQRTSIFDRIKPPTTCPS

Query:  VFQRMSMTATEEENQCVVSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPRRKIDNLDVKLFNEVGSDKKLQSSIPSRMKRKFSVLINTEGSL
        VFQR+S+T  EEENQC  ST TR SAF+ LS+STSKK R STS FDRLK+ NDQ +R++ +L VK F+E   D K+ S +PSRMKRK SV INTEGSL
Subjt:  VFQRMSMTATEEENQCVVSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPRRKIDNLDVKLFNEVGSDKKLQSSIPSRMKRKFSVLINTEGSL

A0A5A7TQ06 Retrotransposon gag protein1.2e-3355.29Show/hide
Query:  QPRQPVTVKELFSRTF---HKKEKENFATSYCI----------EEEEVDNSKKGEQRTSIFDRIKPPTTCPSVFQRMSMTATEEENQCVVSTSTRPSAFQ
        QPRQ +T+ E F R+F   H +E     T +              EEVDNS + +QRTS+FDRIKP TT  SVFQR+SM   EEENQC  ST  R SAF+
Subjt:  QPRQPVTVKELFSRTF---HKKEKENFATSYCI----------EEEEVDNSKKGEQRTSIFDRIKPPTTCPSVFQRMSMTATEEENQCVVSTSTRPSAFQ

Query:  RLSVSTSKKSRSSTSVFDRLKVTNDQPRRKIDNLDVKLFNEVGSDKKLQSSIPSRMKRKFSVLINTEGSL
        RLS+STSKK R STS FDRLK+TNDQ +R++ +L  K F+E   D K+ S +PSRMKRK SV INTEGSL
Subjt:  RLSVSTSKKSRSSTSVFDRLKVTNDQPRRKIDNLDVKLFNEVGSDKKLQSSIPSRMKRKFSVLINTEGSL

A0A5A7U974 Retrotransposon gag protein4.0e-3452.25Show/hide
Query:  RQRSKKFSQPRQPVTVKELFSRTFHKKEKENFA------TSYCIE-------EEEVDNSKKGEQRTSIFDRIKPPTTCPSVFQRMSMTATEEENQCVVST
        +++ +KF QPR+ +T+ E F R+F +   E         T+  +E        EEVDNS + +QRTS+FDRIKP TT  SVFQR+SM   EEENQC +ST
Subjt:  RQRSKKFSQPRQPVTVKELFSRTFHKKEKENFA------TSYCIE-------EEEVDNSKKGEQRTSIFDRIKPPTTCPSVFQRMSMTATEEENQCVVST

Query:  STRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPRRKIDNLDVKLFNEVGSDKKLQSSIPSRMKRKFSVLINTEGSL
         TR SAF+RLS+S SKK R STS FDRLK+TNDQ +R++ +L  K F+E   D K+ S +PSR+KRK S+ INTEGSL
Subjt:  STRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPRRKIDNLDVKLFNEVGSDKKLQSSIPSRMKRKFSVLINTEGSL

A0A5D3BBF9 Gag protease polyprotein2.2e-3252.84Show/hide
Query:  RSKKFSQPRQPVTVKELFSRTFHKKEKENFA------TSYCIE-------EEEVDNSKKGEQRTSIFDRIKPPTTCPSVFQRMSMTATEEENQCVVSTST
        + + F QPR+ +T+ E   R+F +   E         T+  +E        EEVDNS + +QRTSIFDRIKP TT   VFQR+SM   EEENQC  ST  
Subjt:  RSKKFSQPRQPVTVKELFSRTFHKKEKENFA------TSYCIE-------EEEVDNSKKGEQRTSIFDRIKPPTTCPSVFQRMSMTATEEENQCVVSTST

Query:  RPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPRRKIDNLDVKLFNEVGSDKKLQSSIPSRMKRKFSVLINTEGSL
        R SAF+RLS+STSKK R STS FDRLK+TNDQ +R++ +L  K F+E   D K+ S +PSRMKRK SV INTEGSL
Subjt:  RPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPRRKIDNLDVKLFNEVGSDKKLQSSIPSRMKRKFSVLINTEGSL

A0A5D3CCI8 Retrotransposon gag protein9.8e-3352.27Show/hide
Query:  RSKKFSQPRQPVTVKELFSRTFHKKEKENFA------TSYCIE-------EEEVDNSKKGEQRTSIFDRIKPPTTCPSVFQRMSMTATEEENQCVVSTST
        + + F QPR+ +T+ E  SR+F +   E         T+  +E        EEVDNS + +QRTS+FDRIKP TT  SVFQR+SM   EE+NQC  ST  
Subjt:  RSKKFSQPRQPVTVKELFSRTFHKKEKENFA------TSYCIE-------EEEVDNSKKGEQRTSIFDRIKPPTTCPSVFQRMSMTATEEENQCVVSTST

Query:  RPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPRRKIDNLDVKLFNEVGSDKKLQSSIPSRMKRKFSVLINTEGSL
        R SAF+RLS+STSKK R STS FDRLK+TNDQ +R++ +L  K F+E   D K+ + +PSRMKRK SV INTEGSL
Subjt:  RPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPRRKIDNLDVKLFNEVGSDKKLQSSIPSRMKRKFSVLINTEGSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCTACTGCCCATTGTTGCTTCAATGAACTGAGGTTACAAGAAGATAAAGCTTCTATCATTGCAAGCGAAGAAACAACATTGCAGGGGGCATGTACCAATGACAA
GTTTCTTGCTAAGTATAACCCTTTGTTTGAACCTGATTCTGACATAGTGACCGTTATGATGACTGAGACAAGAACTATGGAAGAAAGAATGGCTGAGATGCAAGAGCACA
TCAACACCTTGATGAAGGCAATTGAAGAAAAAGATTCTCAAATTGAGCACCTAAAGAGTCAGATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAAGGCAGAGA
AGTAAAAAGTTTTCTCAACCTCGACAACCGGTGACAGTGAAGGAACTCTTCTCCAGAACTTTCCACAAAAAAGAAAAAGAAAACTTTGCAACTTCCTACTGCATCGAGGA
GGAAGAAGTTGACAATTCCAAGAAGGGTGAACAAAGGACCTCCATCTTCGATCGCATCAAGCCTCCAACTACTTGTCCTTCGGTATTCCAAAGAATGAGTATGACCGCGA
CAGAAGAAGAAAATCAATGTGTGGTGTCCACCTCCACTCGACCTTCGGCTTTCCAAAGACTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTCAACATCTGTCTTTGAT
CGCCTCAAAGTAACAAACGATCAACCTCGAAGAAAGATAGATAACTTAGATGTGAAATTGTTCAATGAAGTAGGCAGTGACAAGAAGCTTCAAAGTAGCATCCCGTCACG
TATGAAGAGGAAGTTCTCTGTTCTCATAAATACAGAAGGTTCCTTGAAGCAAATGGAGGTTATGCATCGTTATGGATGTGAAGCTACGAGTTGGATGAATAAAAGAAAAC
TTCATTCCTCCAAGTTCAATGTCTCGCTAGCCTCAAGTTCGGTGTTTCACTCACCCTATGTTCGTTGTTCTCTCTTCTTCAAGTTTGAAGGTTCTTACGCTGCACTGCTT
CCTTCACCAAGTTTGAAGGTTATCATGCTGCGCTGTTTCGCTGTTCCTTCTCCAAGTTCGAAGGTTCCCACATTGCGCTGTTGTGCTGCTTCCTTCTCCAAGTTTGAAGG
TCCTGACACTGGATCCTCCAAGTCGAAGGTTCTCAAGTGGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTGCTTCGCTGCAGTTTCCTTCCTCCAAG
TTCGAAGGTTCTCAGTTCCTTCCTCCAAGTTCGAGAAGGTTCTCATCCGCTTCGCTGGAGTTCTTTCTCCCCAAGTTTTAAACTTCTCATGTGCTTCGTTGCAGTTCGAA
GGTTTCAAGTTGCTTCGCTGCAGTTTCCTTCCTCCAACTTCGAAGGATCCTCCAGGTCGAAGGTTCTCAGGTTCTCATCCGCTTCACTGGAGTTCTTTCTCCCCAAGTTT
GAAACTTCTCATGCGCTCGTGTTCTCACGTTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAGAAGGTTCTCATCCGCTTTGCTGGAGTTCTTCTTTCTCCCCAAGTTTG
AAACTTCTCATGTGCTTCGTTGCAGTTCGAAGGTTTCAAAGTTGCTTCGCTGCGTGTTTCTCACGTTGCTTCGCTGTAGTTCCTTCCTCCAAGTTCGAGAAGGTTCTCAT
CCGCTTCGCTGGAGTTCTTCTTTCTCCCCAAGTTTGAAACTTCTCATGTGCTTCGTTGAAGGTTTCAAGTTGCTTCGCTGCAGTTTCCTTCCTCCAAAGGCAAATCTGGT
GACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTAAAATCACTACAAGTGAAGTTGATGACGACCGTGGTGACCACCCCTGCAGGAAACT
ACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTACAAGAGAAGTTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGG
TCTAGACAGGTGGTGAAATCACTGCAAGTGAAGTTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGGCTGGTCTAGACAGGTGGTGGTGAA
ATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGA
TGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCATTGCAAGTGAAGCTGATGACGACCGTGGTGACCACC
CCTGGAGGAAACTACAGTCATCAAAGTGACTGGGAAAGAGGAGGAGTTGGAAGCTCTCAGCCAGAGTCAGAGAATTCAGAGAAACTCCACCAAGACTTCTTGAAGACTGA
AGACCCTTCAAGACTAGAAGACTTCAACGATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTCTACTGCCCATTGTTGCTTCAATGAACTGAGGTTACAAGAAGATAAAGCTTCTATCATTGCAAGCGAAGAAACAACATTGCAGGGGGCATGTACCAATGACAA
GTTTCTTGCTAAGTATAACCCTTTGTTTGAACCTGATTCTGACATAGTGACCGTTATGATGACTGAGACAAGAACTATGGAAGAAAGAATGGCTGAGATGCAAGAGCACA
TCAACACCTTGATGAAGGCAATTGAAGAAAAAGATTCTCAAATTGAGCACCTAAAGAGTCAGATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAAGGCAGAGA
AGTAAAAAGTTTTCTCAACCTCGACAACCGGTGACAGTGAAGGAACTCTTCTCCAGAACTTTCCACAAAAAAGAAAAAGAAAACTTTGCAACTTCCTACTGCATCGAGGA
GGAAGAAGTTGACAATTCCAAGAAGGGTGAACAAAGGACCTCCATCTTCGATCGCATCAAGCCTCCAACTACTTGTCCTTCGGTATTCCAAAGAATGAGTATGACCGCGA
CAGAAGAAGAAAATCAATGTGTGGTGTCCACCTCCACTCGACCTTCGGCTTTCCAAAGACTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTCAACATCTGTCTTTGAT
CGCCTCAAAGTAACAAACGATCAACCTCGAAGAAAGATAGATAACTTAGATGTGAAATTGTTCAATGAAGTAGGCAGTGACAAGAAGCTTCAAAGTAGCATCCCGTCACG
TATGAAGAGGAAGTTCTCTGTTCTCATAAATACAGAAGGTTCCTTGAAGCAAATGGAGGTTATGCATCGTTATGGATGTGAAGCTACGAGTTGGATGAATAAAAGAAAAC
TTCATTCCTCCAAGTTCAATGTCTCGCTAGCCTCAAGTTCGGTGTTTCACTCACCCTATGTTCGTTGTTCTCTCTTCTTCAAGTTTGAAGGTTCTTACGCTGCACTGCTT
CCTTCACCAAGTTTGAAGGTTATCATGCTGCGCTGTTTCGCTGTTCCTTCTCCAAGTTCGAAGGTTCCCACATTGCGCTGTTGTGCTGCTTCCTTCTCCAAGTTTGAAGG
TCCTGACACTGGATCCTCCAAGTCGAAGGTTCTCAAGTGGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTTGCTTCGCTGCAGTTTCCTTCCTCCAAG
TTCGAAGGTTCTCAGTTCCTTCCTCCAAGTTCGAGAAGGTTCTCATCCGCTTCGCTGGAGTTCTTTCTCCCCAAGTTTTAAACTTCTCATGTGCTTCGTTGCAGTTCGAA
GGTTTCAAGTTGCTTCGCTGCAGTTTCCTTCCTCCAACTTCGAAGGATCCTCCAGGTCGAAGGTTCTCAGGTTCTCATCCGCTTCACTGGAGTTCTTTCTCCCCAAGTTT
GAAACTTCTCATGCGCTCGTGTTCTCACGTTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAGAAGGTTCTCATCCGCTTTGCTGGAGTTCTTCTTTCTCCCCAAGTTTG
AAACTTCTCATGTGCTTCGTTGCAGTTCGAAGGTTTCAAAGTTGCTTCGCTGCGTGTTTCTCACGTTGCTTCGCTGTAGTTCCTTCCTCCAAGTTCGAGAAGGTTCTCAT
CCGCTTCGCTGGAGTTCTTCTTTCTCCCCAAGTTTGAAACTTCTCATGTGCTTCGTTGAAGGTTTCAAGTTGCTTCGCTGCAGTTTCCTTCCTCCAAAGGCAAATCTGGT
GACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTAAAATCACTACAAGTGAAGTTGATGACGACCGTGGTGACCACCCCTGCAGGAAACT
ACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTACAAGAGAAGTTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGG
TCTAGACAGGTGGTGAAATCACTGCAAGTGAAGTTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGGCTGGTCTAGACAGGTGGTGGTGAA
ATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGA
TGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCATTGCAAGTGAAGCTGATGACGACCGTGGTGACCACC
CCTGGAGGAAACTACAGTCATCAAAGTGACTGGGAAAGAGGAGGAGTTGGAAGCTCTCAGCCAGAGTCAGAGAATTCAGAGAAACTCCACCAAGACTTCTTGAAGACTGA
AGACCCTTCAAGACTAGAAGACTTCAACGATCCTTGA
Protein sequenceShow/hide protein sequence
MGSTAHCCFNELRLQEDKASIIASEETTLQGACTNDKFLAKYNPLFEPDSDIVTVMMTETRTMEERMAEMQEHINTLMKAIEEKDSQIEHLKSQIENQHIAESSQTQRQR
SKKFSQPRQPVTVKELFSRTFHKKEKENFATSYCIEEEEVDNSKKGEQRTSIFDRIKPPTTCPSVFQRMSMTATEEENQCVVSTSTRPSAFQRLSVSTSKKSRSSTSVFD
RLKVTNDQPRRKIDNLDVKLFNEVGSDKKLQSSIPSRMKRKFSVLINTEGSLKQMEVMHRYGCEATSWMNKRKLHSSKFNVSLASSSVFHSPYVRCSLFFKFEGSYAALL
PSPSLKVIMLRCFAVPSPSSKVPTLRCCAASFSKFEGPDTGSSKSKVLKWLRCSSFLQVRRFSRCFAAVSFLQVRRFSVPSSKFEKVLIRFAGVLSPQVLNFSCASLQFE
GFKLLRCSFLPPTSKDPPGRRFSGSHPLHWSSFSPSLKLLMRSCSHVASLQFLPPSSRRFSSALLEFFFLPKFETSHVLRCSSKVSKLLRCVFLTLLRCSSFLQVREGSH
PLRWSSSFSPSLKLLMCFVEGFKLLRCSFLPPKANLVTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQVVKSLQEKLMTTVVTTPAGNYSHQSDW
SRQVVKSLQVKLMTTVVTTPAGNYSHQSGWSRQVVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTT
PGGNYSHQSDWERGGVGSSQPESENSEKLHQDFLKTEDPSRLEDFNDP