; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg023905 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg023905
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotransposon gag protein
Genome locationscaffold13:1820145..1826615
RNA-Seq ExpressionSpg023905
SyntenySpg023905
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036008.1 Retrotransposon gag protein [Cucumis melo var. makuwa]9.8e-3754.55Show/hide
Query:  RSKKFSQPRKPVTVKEIFSKTFHKKKKENLA------TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFT
        + + F QPR+ +T+ E   ++F +   E +       T+  ++V       EEVDNS + +QRTSVFDRIKP TTR  VFQR+SMA  EEENQC +ST+ 
Subjt:  RSKKFSQPRKPVTVKEIFSKTFHKKKKENLA------TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFT

Query:  RPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL
        R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M + + K F E N D K+HSR+PSRMKRK SV INTEGSL
Subjt:  RPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL

KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.7e-3648.4Show/hide
Query:  EEKDSQIEQLKSQIENQH----IAESSQTQRSKKF-------------SQPRKPVTVKEIFSKTF---HKKKKENLATSYCIDV----------EEVDNS
        +E+ +   Q KS    +H    I+   + +R+KK               QPR+ +T+ E F ++F   H ++   + T +   +          EEVDNS
Subjt:  EEKDSQIEQLKSQIENQH----IAESSQTQRSKKF-------------SQPRKPVTVKEIFSKTF---HKKKKENLATSYCIDV----------EEVDNS

Query:  KKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSR
         + +QRTSVFDRIKP TTR SVFQR+SMA  EEENQC  ST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D K+HSR
Subjt:  KKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSR

Query:  IPSRMKRKFSVLINTEGSL
        +PSRMKRK SV INTEGSL
Subjt:  IPSRMKRKFSVLINTEGSL

KAA0050734.1 gag protease polyprotein [Cucumis melo var. makuwa]5.7e-3754.55Show/hide
Query:  RSKKFSQPRKPVTVKEIFSKTFHKKKKENLA------TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFT
        + + F QPR+ +T+ E   ++F +   E +       T+  ++V       EEVDNS + +QRTS+FDRIKP TTR  VFQR+SMA  EEENQC  ST+ 
Subjt:  RSKKFSQPRKPVTVKEIFSKTFHKKKKENLA------TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFT

Query:  RPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL
        R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D K+HSR+PSRMKRK SV INTEGSL
Subjt:  RPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL

KAA0050736.1 retrotransposon gag protein [Cucumis melo var. makuwa]4.0e-3854.8Show/hide
Query:  QRSKKFSQPRKPVTVKEIFSKTFHKKKKENLA------TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTF
        ++ +KF QPR+ +T+ E F ++F +   E +       T+  ++V       EEVDNS + +QRTSVFDRIKP TTR SVFQR+SMA  EEENQC +ST+
Subjt:  QRSKKFSQPRKPVTVKEIFSKTFHKKKKENLA------TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTF

Query:  TRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL
        TR SAF+RLS+S SKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D K++SR+PSR+KRK S+ INTEGSL
Subjt:  TRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL

TYK08944.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.3e-3648.4Show/hide
Query:  EEKDSQIEQLKSQIENQH----IAESSQTQRSKK-------------FSQPRKPVTVKEIFSKTFHKKKKENLA------TSYCIDV-------EEVDNS
        +E+ +   Q KS    +H    I+   + +R+KK             F QPR+ +T+ E  S++F +   E +       T+  ++V       EEVDNS
Subjt:  EEKDSQIEQLKSQIENQH----IAESSQTQRSKK-------------FSQPRKPVTVKEIFSKTFHKKKKENLA------TSYCIDV-------EEVDNS

Query:  KKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSR
         + +QRTSVFDRIKP TTR SVFQR+SMA  EE+NQC  ST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D K+H+R
Subjt:  KKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSR

Query:  IPSRMKRKFSVLINTEGSL
        +PSRMKRK SV INTEGSL
Subjt:  IPSRMKRKFSVLINTEGSL

TrEMBL top hitse value%identityAlignment
A0A5A7SZJ7 Retrotransposon gag protein4.7e-3754.55Show/hide
Query:  RSKKFSQPRKPVTVKEIFSKTFHKKKKENLA------TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFT
        + + F QPR+ +T+ E   ++F +   E +       T+  ++V       EEVDNS + +QRTSVFDRIKP TTR  VFQR+SMA  EEENQC +ST+ 
Subjt:  RSKKFSQPRKPVTVKEIFSKTFHKKKKENLA------TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFT

Query:  RPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL
        R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M + + K F E N D K+HSR+PSRMKRK SV INTEGSL
Subjt:  RPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL

A0A5A7TQ06 Retrotransposon gag protein8.1e-3748.4Show/hide
Query:  EEKDSQIEQLKSQIENQH----IAESSQTQRSKKF-------------SQPRKPVTVKEIFSKTF---HKKKKENLATSYCIDV----------EEVDNS
        +E+ +   Q KS    +H    I+   + +R+KK               QPR+ +T+ E F ++F   H ++   + T +   +          EEVDNS
Subjt:  EEKDSQIEQLKSQIENQH----IAESSQTQRSKKF-------------SQPRKPVTVKEIFSKTF---HKKKKENLATSYCIDV----------EEVDNS

Query:  KKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSR
         + +QRTSVFDRIKP TTR SVFQR+SMA  EEENQC  ST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D K+HSR
Subjt:  KKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSR

Query:  IPSRMKRKFSVLINTEGSL
        +PSRMKRK SV INTEGSL
Subjt:  IPSRMKRKFSVLINTEGSL

A0A5A7U974 Retrotransposon gag protein1.9e-3854.8Show/hide
Query:  QRSKKFSQPRKPVTVKEIFSKTFHKKKKENLA------TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTF
        ++ +KF QPR+ +T+ E F ++F +   E +       T+  ++V       EEVDNS + +QRTSVFDRIKP TTR SVFQR+SMA  EEENQC +ST+
Subjt:  QRSKKFSQPRKPVTVKEIFSKTFHKKKKENLA------TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTF

Query:  TRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL
        TR SAF+RLS+S SKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D K++SR+PSR+KRK S+ INTEGSL
Subjt:  TRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL

A0A5D3BBF9 Gag protease polyprotein2.8e-3754.55Show/hide
Query:  RSKKFSQPRKPVTVKEIFSKTFHKKKKENLA------TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFT
        + + F QPR+ +T+ E   ++F +   E +       T+  ++V       EEVDNS + +QRTS+FDRIKP TTR  VFQR+SMA  EEENQC  ST+ 
Subjt:  RSKKFSQPRKPVTVKEIFSKTFHKKKKENLA------TSYCIDV-------EEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFT

Query:  RPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL
        R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D K+HSR+PSRMKRK SV INTEGSL
Subjt:  RPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL

A0A5D3CCI8 Retrotransposon gag protein6.2e-3748.4Show/hide
Query:  EEKDSQIEQLKSQIENQH----IAESSQTQRSKK-------------FSQPRKPVTVKEIFSKTFHKKKKENLA------TSYCIDV-------EEVDNS
        +E+ +   Q KS    +H    I+   + +R+KK             F QPR+ +T+ E  S++F +   E +       T+  ++V       EEVDNS
Subjt:  EEKDSQIEQLKSQIENQH----IAESSQTQRSKK-------------FSQPRKPVTVKEIFSKTFHKKKKENLA------TSYCIDV-------EEVDNS

Query:  KKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSR
         + +QRTSVFDRIKP TTR SVFQR+SMA  EE+NQC  ST+ R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D K+H+R
Subjt:  KKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNSDKKLHSR

Query:  IPSRMKRKFSVLINTEGSL
        +PSRMKRK SV INTEGSL
Subjt:  IPSRMKRKFSVLINTEGSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCTACTGCCCATCGTTGCTTCAACGGACTGACGTTGCAAGAAGATAAAGCTTCTGTCGTTGCAGGCCAAGAAACAACCTTGCAGGGGGCATATACTAATGACAA
GTTTTTTGTCAAGTATAACCCTTTGTTTGAACCTGATTCTGACGTAGTGACTGTTATGATGACTGAGACAAAAACTATGGAAGAAAGAATGGCTGAGATGCAAGAACACA
TCAACAACTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATTGAGCAACTAAAGAGCCAAATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAAGAAGTAAA
AAGTTTTCTCAACCTCGAAAACCGGTGACTGTGAAGGAGATCTTCTCCAAAACTTTCCACAAAAAGAAAAAAGAGAACCTTGCAACTTCCTACTGCATCGACGTAGAAGA
AGTTGACAATTCCAAGAAAAGTGAACAAAGGACTTCTGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTCAGTATTCCAAAGAATGAGTATGGCCGCGACAGAAG
AAGAAAATCAATGTTCGGTGTCCACCTTCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTCGACATCTGTTTTTGATCGCCTC
AAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAACTTGGAGGTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGAATCCCGTCACGTATGAA
GAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGTGGGGGCAACACAGCGAATGGAAGTTGCTTCCTCCAAGTTCGAAGGTTTCCACGCGCTTCGCTGCAG
TTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGATCCTTCCCTCCAAGTTCGAAGG
TTCTCACCTCGCTTCGCTGCTGTTCCCTCCTCCAAGTTCGAAGGTTCTCACGCGTTTCACTGCAGTTCCTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCACTGAAGTT
CCTTCCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGCTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGAT
CCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGATCGAAGGTTCTCATGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCT
CACGCGCTTCGTTGCATTTCCTTCCCCCCAAATTTGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACACGCGCTTCGCTGACGCCGC
TTTGCGCTGTAGTTCCTTCCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCC
CCAAGTTCGAAGTTCCTTCCCTCCAAGTTCGAAGGTGTTCTCACGCGCTTCGTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCCCTTCCT
CCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAGTTCCTTCCCCCGAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACG
TCGCTTCGCTGCGCTCATGCGCTTCGCTACAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCGTCCAAGTTCGAAGGTTCTCACGCGCTT
CGCTCTGCAATTCCTTCCCCAAGATCGAAGATTCTCATGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCCAAAT
TCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCG
TGCAATTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTC
GAAGGTTCTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGCTGCACTCCAGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCT
CACGCTGCGCTGCTTCCTTCTCCAAGTTCAAGGGTCCTCATGCTACGCTCGGCTACATTGCTGCGCTACTTTCTAAAGTCCAAAGACGTCAATTGTCCCTGCACTCATGC
TGTAAAGGGCATGGCGGCGACACAAGTCCAAGGACATGCGTGGCAGCGGCACAACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAA
GGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGCGGCGGAGGCACAAGTCCAAGGAACATGTCCCAAC
TCAAGGAACATGTCCGTGCACTCGTGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAGCTCAAGGAACATGTCCGTGCACTCGCGTGGCGGCGACACAAGTCCAAGGA
ACATGTCTCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGTGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCCTGTACTCATGC
TGAAAAGGGCGTGGCGGCGACACAAGTCCAAGGAACATGTTCCAACTCAAGGAACATGTCCTTGCACTCGTGTTGAAAGGCGTGGCAGCGGCACAACAAGTCCAAGGAAC
ATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGCGTGG
CGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCAACTCGTGTTGAAAGGTGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAGCTCAAGG
AACATGTCCGTGCACTCGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGAGTGACGGCGGCACAAGTCCA
AGGAACATGTCCCAACTCAAGGAACATGTCCTTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCTCAACTCAAGGAACATGTCCGTGCACTC
GGCGTGGCGGCGACACAAGTCCAAGGAACATGTTCCAACTCAAGGAACATGTCCTTGCACTCGTGTTGAAAGGCGTGGCAGCGGCACAACAAGTCCAAGGAACATGTCCC
AACTCAAGGAACATGTCCGTGCACTCGCGCGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGTGGCGGCGGC
ACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCAACTCGTGCTGAAAGGTGTGGCGGCGACACAAGTCTAAGGAACATGTCCCAGCTCAAGGAACATGT
CCGTGCACTCGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGTGACGGCGGCACAAGTCCAAGGAACA
TGTCCCAACTCAAGGAACATGTCCTTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCTCAACTCAAGGAACATGTCCGTGCACTCGTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTCTACTGCCCATCGTTGCTTCAACGGACTGACGTTGCAAGAAGATAAAGCTTCTGTCGTTGCAGGCCAAGAAACAACCTTGCAGGGGGCATATACTAATGACAA
GTTTTTTGTCAAGTATAACCCTTTGTTTGAACCTGATTCTGACGTAGTGACTGTTATGATGACTGAGACAAAAACTATGGAAGAAAGAATGGCTGAGATGCAAGAACACA
TCAACAACTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATTGAGCAACTAAAGAGCCAAATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAAGAAGTAAA
AAGTTTTCTCAACCTCGAAAACCGGTGACTGTGAAGGAGATCTTCTCCAAAACTTTCCACAAAAAGAAAAAAGAGAACCTTGCAACTTCCTACTGCATCGACGTAGAAGA
AGTTGACAATTCCAAGAAAAGTGAACAAAGGACTTCTGTCTTCGATCGCATCAAGCCTCCAACTACTCGTCCTTCAGTATTCCAAAGAATGAGTATGGCCGCGACAGAAG
AAGAAAATCAATGTTCGGTGTCCACCTTCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTCGACATCTGTTTTTGATCGCCTC
AAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAACTTGGAGGTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGAATCCCGTCACGTATGAA
GAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGTGGGGGCAACACAGCGAATGGAAGTTGCTTCCTCCAAGTTCGAAGGTTTCCACGCGCTTCGCTGCAG
TTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGATCCTTCCCTCCAAGTTCGAAGG
TTCTCACCTCGCTTCGCTGCTGTTCCCTCCTCCAAGTTCGAAGGTTCTCACGCGTTTCACTGCAGTTCCTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCACTGAAGTT
CCTTCCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGCTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGAT
CCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGATCGAAGGTTCTCATGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCT
CACGCGCTTCGTTGCATTTCCTTCCCCCCAAATTTGAAGGTTCTCACGTGCTTCGCTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACACGCGCTTCGCTGACGCCGC
TTTGCGCTGTAGTTCCTTCCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCC
CCAAGTTCGAAGTTCCTTCCCTCCAAGTTCGAAGGTGTTCTCACGCGCTTCGTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCCCTTCCT
CCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAGTTCCTTCCCCCGAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACG
TCGCTTCGCTGCGCTCATGCGCTTCGCTACAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCGTCCAAGTTCGAAGGTTCTCACGCGCTT
CGCTCTGCAATTCCTTCCCCAAGATCGAAGATTCTCATGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCCAAAT
TCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCG
TGCAATTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTC
GAAGGTTCTCACGCACTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGCTGCACTCCAGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCT
CACGCTGCGCTGCTTCCTTCTCCAAGTTCAAGGGTCCTCATGCTACGCTCGGCTACATTGCTGCGCTACTTTCTAAAGTCCAAAGACGTCAATTGTCCCTGCACTCATGC
TGTAAAGGGCATGGCGGCGACACAAGTCCAAGGACATGCGTGGCAGCGGCACAACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAA
GGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGCGGCGGAGGCACAAGTCCAAGGAACATGTCCCAAC
TCAAGGAACATGTCCGTGCACTCGTGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAGCTCAAGGAACATGTCCGTGCACTCGCGTGGCGGCGACACAAGTCCAAGGA
ACATGTCTCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGTGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCCTGTACTCATGC
TGAAAAGGGCGTGGCGGCGACACAAGTCCAAGGAACATGTTCCAACTCAAGGAACATGTCCTTGCACTCGTGTTGAAAGGCGTGGCAGCGGCACAACAAGTCCAAGGAAC
ATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGCGTGG
CGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCAACTCGTGTTGAAAGGTGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAGCTCAAGG
AACATGTCCGTGCACTCGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGAGTGACGGCGGCACAAGTCCA
AGGAACATGTCCCAACTCAAGGAACATGTCCTTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCTCAACTCAAGGAACATGTCCGTGCACTC
GGCGTGGCGGCGACACAAGTCCAAGGAACATGTTCCAACTCAAGGAACATGTCCTTGCACTCGTGTTGAAAGGCGTGGCAGCGGCACAACAAGTCCAAGGAACATGTCCC
AACTCAAGGAACATGTCCGTGCACTCGCGCGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGTGGCGGCGGC
ACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCAACTCGTGCTGAAAGGTGTGGCGGCGACACAAGTCTAAGGAACATGTCCCAGCTCAAGGAACATGT
CCGTGCACTCGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGTGACGGCGGCACAAGTCCAAGGAACA
TGTCCCAACTCAAGGAACATGTCCTTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCTCAACTCAAGGAACATGTCCGTGCACTCGTGCTGA
Protein sequenceShow/hide protein sequence
MGSTAHRCFNGLTLQEDKASVVAGQETTLQGAYTNDKFFVKYNPLFEPDSDVVTVMMTETKTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQRSK
KFSQPRKPVTVKEIFSKTFHKKKKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAATEEENQCSVSTFTRPSAFQRLSVSTSKKSQPSTSVFDRL
KVTSDQPKRKMDNLEVKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKWGQHSEWKLLPPSSKVSTRFAAVPSPQIRRFSRASLQFLPHSSKVLTRFAADPSLQVRR
FSPRFAAVPSSKFEGSHAFHCSSFLTVRRFSRASLKFLPPQVRRFSRRFAAVPSSKFEAPSSKFEGSHIASLRSFLQVRRFSRASLCNSFPKIEGSHALRAVPSSKFEGS
HALRCISFPPNLKVLTCFAAVPSSKFEGSHTRFADAALRCSSFPPSLKVLTSLRCDPSSKFEGSHALRSAIPSPKFEVPSLQVRRCSHALRAVPSPKFEGSHALRAVPLP
PSSKVLTRFALQFLPPKFEGSHALRCSSFLQVQRFSRRFAALMRFATVPSSKFEGSHIASLRSFVQVRRFSRASLCNSFPKIEDSHALRAVPSSKFEGSHALRCISFPPN
SKVLTRFAAVPSSKFEGSHALRSAIPSPKFEGSHALRAIPSSKFEGSHALRCSSFPPNSKVLTRFAAVPSPQVRRFSRTSLQFLPPSSKVLTRFAALQRYFLKSKDVNCP
HAALLPSPSSRVLMLRSATLLRYFLKSKDVNCPCTHAVKGMAATQVQGHAWQRHNKSKEHVPTQGTCPCTRAERRGGDTSPRNMSQLKEHVRALVLKGAAEAQVQGTCPN
SRNMSVHSCGGDTSPRNMSQLKEHVRALAWRRHKSKEHVSTQGTCPCTRAERRGGGTSPRNMSQLKEHVPVLMLKRAWRRHKSKEHVPTQGTCPCTRVERRGSGTTSPRN
MSQLKEHVRALVLKGVAATQVQGTCPNSRNMSVHSRGGGTSPRNMSQLKEHVLATRVERCGGDTSPRNMSQLKEHVRALAWRRHKSKEHVPTQGTCPCTRAERSDGGTSP
RNMSQLKEHVLALVLKGVAATQVQGTCLNSRNMSVHSAWRRHKSKEHVPTQGTCPCTRVERRGSGTTSPRNMSQLKEHVRALARRRHKSKEHVPTQGTCPCTRAERRGGG
TSPRNMSQLKEHVLATRAERCGGDTSLRNMSQLKEHVRALAWRRHKSKEHVPTQGTCPCTRAERRDGGTSPRNMSQLKEHVLALVLKGVAATQVQGTCLNSRNMSVHSC