; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002397 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002397
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold6:396373..400918
RNA-Seq ExpressionSpg002397
SyntenySpg002397
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047189.1 hypothetical protein E6C27_scaffold83G00690 [Cucumis melo var. makuwa]7.2e-1630.37Show/hide
Query:  WVRKEKEVVDLKLDEFCVVSRMFAHNAWKEVKQVLEDYFHSEVILNPFMADNALVKLND---------------------KFSEKKFDGNISSQTL----
        WV +  EV+    +   +++++FA +  ++++++LE+YF +++++NP   +NAL+ L++                     KF  +K+D +  S+ L    
Subjt:  WVRKEKEVVDLKLDEFCVVSRMFAHNAWKEVKQVLEDYFHSEVILNPFMADNALVKLND---------------------KFSEKKFDGNISSQTL----

Query:  --------NLL-------DCSEARIEVKRNFCGFIPAEIVVTDKIHGNFALRFGDISSLDPSCFIPMDLSLSDFDNEIDLKRVSQVLMDEG
                NLL         SEARI+VK N CGF+P+ I + D   GN  L FGD   L+P       + +SDF   I L R+ +VL DEG
Subjt:  --------NLL-------DCSEARIEVKRNFCGFIPAEIVVTDKIHGNFALRFGDISSLDPSCFIPMDLSLSDFDNEIDLKRVSQVLMDEG

KAA0063414.1 uncharacterized protein E6C27_scaffold508G00510 [Cucumis melo var. makuwa]4.0e-2232.17Show/hide
Query:  GWFLECSIWPPSGGKKKVQVPVGYTKIGWSIFWEMIRDFLLKFGEKKSVENISKKSSYEELSKLVFNANKNLDGSFDDEVKFKSGEPLLGSHSKKLPVMS
        GW L C++WP SGG+  + +PVG  + GW  F  MI+DFL     K+ +    +    + L    F+       S  +   FK                S
Subjt:  GWFLECSIWPPSGGKKKVQVPVGYTKIGWSIFWEMIRDFLLKFGEKKSVENISKKSSYEELSKLVFNANKNLDGSFDDEVKFKSGEPLLGSHSKKLPVMS

Query:  SFWVRKEKEVVDLKLDEFCVVSRMFAHNAWKEVKQVLEDYFHSEVILNPFMADNALVKLNDKFSEKKFDG--NISSQTLNLLDCSEARIEVKRNFCGFIP
        S WV K  EV  L+ D   ++  +  +  W  +K +  DY+  +V                K     F G  +IS +T+NL++CSEA+I+V +N CGF+P
Subjt:  SFWVRKEKEVVDLKLDEFCVVSRMFAHNAWKEVKQVLEDYFHSEVILNPFMADNALVKLNDKFSEKKFDG--NISSQTLNLLDCSEARIEVKRNFCGFIP

Query:  AEIVVTDKIHGNFALRFGDISSLDPSCFIPMDLSLSDFDNEIDLKRVSQVLMDEGFSS
        A + + D    N  L FGDI  L+    I   L +S  +N IDL R++QVL+DEG  S
Subjt:  AEIVVTDKIHGNFALRFGDISSLDPSCFIPMDLSLSDFDNEIDLKRVSQVLMDEGFSS

TYJ99818.1 kinesin heavy chain-like [Cucumis melo var. makuwa]3.6e-1526.94Show/hide
Query:  RDFLLKFGEKKSVENISKKSSYEELSKLVFNANKNLDGSFDDEVKFKSGEPLLGSHSKKLPVMSSFWVRKEKEVVDLKLDEFCVVSRMFAHNAWKEVKQV
        +D   K  EK+     + K   E+  ++    N+NL      + +FKS E +  +   +       WV +  EV     +   +V+++FA + W+ +++ 
Subjt:  RDFLLKFGEKKSVENISKKSSYEELSKLVFNANKNLDGSFDDEVKFKSGEPLLGSHSKKLPVMSSFWVRKEKEVVDLKLDEFCVVSRMFAHNAWKEVKQV

Query:  LEDYFHSEVILNPFMADNALVKLNDKFSEKKFDGNISSQTLNLLDCSEARIEVKRNFCGFIPAEIVVTDKIHGNFALRFGDISSLDPSCFIPM---DLSL
        LE+YF +++++NP   +NA + ++    + +       Q +      EARI+VK+N CGF+P+ I +T+   GN    FGD       C + +    L +
Subjt:  LEDYFHSEVILNPFMADNALVKLNDKFSEKKFDGNISSQTLNLLDCSEARIEVKRNFCGFIPAEIVVTDKIHGNFALRFGDISSLDPSCFIPM---DLSL

Query:  SDFDNEIDLKRVSQVLMDE
          F N  DL R+ +VL DE
Subjt:  SDFDNEIDLKRVSQVLMDE

TYK16775.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]5.4e-1145.78Show/hide
Query:  NISSQTLNLLDCSEARIEVKRNFCGFIPAEIVVTDKIHGNFALRFGDISSLDPSCFIPMDLSLSDFDNEIDLKRVSQVLMDEG
        NI+ +T+N ++ SEA+I+ K N CGFIPA I ++D+  GN  L FG IS  +P   +  DL   +F N +D+ R+SQV+ DEG
Subjt:  NISSQTLNLLDCSEARIEVKRNFCGFIPAEIVVTDKIHGNFALRFGDISSLDPSCFIPMDLSLSDFDNEIDLKRVSQVLMDEG

TYK28792.1 pseudouridylate synthase 7-like protein isoform X1 [Cucumis melo var. makuwa]5.9e-1050.62Show/hide
Query:  ISSQTLNLLDCSEARIEVKRNFCGFIPAEIVVTDKIHGNFALRFGDISSLDPSCFIPMDLSLSDFDNEIDLKRVSQVLMDE
        IS +TLN  +  EA+I++K N CGF+PA I + D+  G F L +GDI  L PS  +  DL L DF N IDL RV QV+ DE
Subjt:  ISSQTLNLLDCSEARIEVKRNFCGFIPAEIVVTDKIHGNFALRFGDISSLDPSCFIPMDLSLSDFDNEIDLKRVSQVLMDE

TrEMBL top hitse value%identityAlignment
A0A5A7U128 Uncharacterized protein3.5e-1630.37Show/hide
Query:  WVRKEKEVVDLKLDEFCVVSRMFAHNAWKEVKQVLEDYFHSEVILNPFMADNALVKLND---------------------KFSEKKFDGNISSQTL----
        WV +  EV+    +   +++++FA +  ++++++LE+YF +++++NP   +NAL+ L++                     KF  +K+D +  S+ L    
Subjt:  WVRKEKEVVDLKLDEFCVVSRMFAHNAWKEVKQVLEDYFHSEVILNPFMADNALVKLND---------------------KFSEKKFDGNISSQTL----

Query:  --------NLL-------DCSEARIEVKRNFCGFIPAEIVVTDKIHGNFALRFGDISSLDPSCFIPMDLSLSDFDNEIDLKRVSQVLMDEG
                NLL         SEARI+VK N CGF+P+ I + D   GN  L FGD   L+P       + +SDF   I L R+ +VL DEG
Subjt:  --------NLL-------DCSEARIEVKRNFCGFIPAEIVVTDKIHGNFALRFGDISSLDPSCFIPMDLSLSDFDNEIDLKRVSQVLMDEG

A0A5A7V878 DUF4283 domain-containing protein1.9e-2232.17Show/hide
Query:  GWFLECSIWPPSGGKKKVQVPVGYTKIGWSIFWEMIRDFLLKFGEKKSVENISKKSSYEELSKLVFNANKNLDGSFDDEVKFKSGEPLLGSHSKKLPVMS
        GW L C++WP SGG+  + +PVG  + GW  F  MI+DFL     K+ +    +    + L    F+       S  +   FK                S
Subjt:  GWFLECSIWPPSGGKKKVQVPVGYTKIGWSIFWEMIRDFLLKFGEKKSVENISKKSSYEELSKLVFNANKNLDGSFDDEVKFKSGEPLLGSHSKKLPVMS

Query:  SFWVRKEKEVVDLKLDEFCVVSRMFAHNAWKEVKQVLEDYFHSEVILNPFMADNALVKLNDKFSEKKFDG--NISSQTLNLLDCSEARIEVKRNFCGFIP
        S WV K  EV  L+ D   ++  +  +  W  +K +  DY+  +V                K     F G  +IS +T+NL++CSEA+I+V +N CGF+P
Subjt:  SFWVRKEKEVVDLKLDEFCVVSRMFAHNAWKEVKQVLEDYFHSEVILNPFMADNALVKLNDKFSEKKFDG--NISSQTLNLLDCSEARIEVKRNFCGFIP

Query:  AEIVVTDKIHGNFALRFGDISSLDPSCFIPMDLSLSDFDNEIDLKRVSQVLMDEGFSS
        A + + D    N  L FGDI  L+    I   L +S  +N IDL R++QVL+DEG  S
Subjt:  AEIVVTDKIHGNFALRFGDISSLDPSCFIPMDLSLSDFDNEIDLKRVSQVLMDEGFSS

A0A5D3BNW8 Kinesin heavy chain-like1.7e-1526.94Show/hide
Query:  RDFLLKFGEKKSVENISKKSSYEELSKLVFNANKNLDGSFDDEVKFKSGEPLLGSHSKKLPVMSSFWVRKEKEVVDLKLDEFCVVSRMFAHNAWKEVKQV
        +D   K  EK+     + K   E+  ++    N+NL      + +FKS E +  +   +       WV +  EV     +   +V+++FA + W+ +++ 
Subjt:  RDFLLKFGEKKSVENISKKSSYEELSKLVFNANKNLDGSFDDEVKFKSGEPLLGSHSKKLPVMSSFWVRKEKEVVDLKLDEFCVVSRMFAHNAWKEVKQV

Query:  LEDYFHSEVILNPFMADNALVKLNDKFSEKKFDGNISSQTLNLLDCSEARIEVKRNFCGFIPAEIVVTDKIHGNFALRFGDISSLDPSCFIPM---DLSL
        LE+YF +++++NP   +NA + ++    + +       Q +      EARI+VK+N CGF+P+ I +T+   GN    FGD       C + +    L +
Subjt:  LEDYFHSEVILNPFMADNALVKLNDKFSEKKFDGNISSQTLNLLDCSEARIEVKRNFCGFIPAEIVVTDKIHGNFALRFGDISSLDPSCFIPM---DLSL

Query:  SDFDNEIDLKRVSQVLMDE
          F N  DL R+ +VL DE
Subjt:  SDFDNEIDLKRVSQVLMDE

A0A5D3CZ35 DNA/RNA polymerases superfamily protein2.6e-1145.78Show/hide
Query:  NISSQTLNLLDCSEARIEVKRNFCGFIPAEIVVTDKIHGNFALRFGDISSLDPSCFIPMDLSLSDFDNEIDLKRVSQVLMDEG
        NI+ +T+N ++ SEA+I+ K N CGFIPA I ++D+  GN  L FG IS  +P   +  DL   +F N +D+ R+SQV+ DEG
Subjt:  NISSQTLNLLDCSEARIEVKRNFCGFIPAEIVVTDKIHGNFALRFGDISSLDPSCFIPMDLSLSDFDNEIDLKRVSQVLMDEG

A0A5D3DYX0 Pseudouridylate synthase 7-like protein isoform X12.9e-1050.62Show/hide
Query:  ISSQTLNLLDCSEARIEVKRNFCGFIPAEIVVTDKIHGNFALRFGDISSLDPSCFIPMDLSLSDFDNEIDLKRVSQVLMDE
        IS +TLN  +  EA+I++K N CGF+PA I + D+  G F L +GDI  L PS  +  DL L DF N IDL RV QV+ DE
Subjt:  ISSQTLNLLDCSEARIEVKRNFCGFIPAEIVVTDKIHGNFALRFGDISSLDPSCFIPMDLSLSDFDNEIDLKRVSQVLMDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACTCTTCTTCGGTTGGAAAAGAGGCGGAGCCTACCACCGCCCTCTCACCACAATCAACAATCGTACGCTTGTTGTCGGTTGAACAAGATATGAAGTCATTAAA
GAGTGATGTCGGTGAGATCAAAAAGATCTTGGAAATGATTTGCGAAAAGATGGGAAACAAAGGGGAACCAAGTTGTAACCTAATGGGGCAATCTACCATGGAGAGAACTT
ATCAAGAACAAGATAAAACTTCTCGGGAATTGGAAGAAAGAATGGGACAGCGACAAGAAAGGCAAACAATGGAGCAAAGAATAGTTCAAGAAACACATTTAGCTCCAAGA
AGTTCTCATTTGAATTGGCTTGAGCTAGCTTTAGTCGATTTATTGCAATGCCCGGTTCATTTGTTCTTTCGCAAGAAGTTAAGAGATTCAAATGGAACAATTCAGTTATC
AAAGTTCAATTCCCAGCAAGGTTGGTTCCTTGAGTGCTCTATTTGGCCCCCCTCGGGTGGAAAGAAAAAAGTGCAAGTTCCAGTTGGTTACACAAAGATTGGTTGGTCAA
TATTTTGGGAAATGATAAGAGACTTTCTTTTGAAATTTGGGGAAAAAAAATCTGTTGAGAATATTTCAAAGAAGTCCAGTTATGAAGAATTATCCAAGCTGGTTTTTAAT
GCTAATAAGAATTTGGATGGAAGTTTTGATGATGAAGTGAAGTTTAAATCTGGGGAACCTCTTCTTGGTTCTCATTCTAAGAAGTTGCCAGTAATGTCTTCATTCTGGGT
TAGAAAAGAAAAAGAAGTGGTGGATTTAAAGCTAGATGAATTTTGTGTGGTTTCTAGAATGTTTGCACATAATGCTTGGAAGGAAGTAAAGCAAGTTTTGGAAGATTATT
TTCATTCTGAAGTTATTCTCAACCCTTTTATGGCAGATAACGCTTTGGTTAAACTGAATGATAAATTTTCTGAAAAGAAGTTTGATGGCAACATTTCTTCTCAAACATTG
AATCTTTTGGATTGTTCAGAAGCTCGTATAGAGGTGAAAAGGAATTTTTGTGGATTTATACCAGCTGAAATTGTGGTTACGGACAAGATTCACGGAAACTTTGCTCTTCG
TTTTGGTGATATATCTTCTTTGGACCCTTCATGTTTTATTCCTATGGACTTATCATTGAGTGATTTTGATAATGAAATTGATTTAAAAAGGGTTTCTCAGGTTTTGATGG
ATGAAGGTTTTTCCTCATCACAAGAAGATCTTAATTCCTTCAATGATCAAGGATTAAACTTTCCAGCCTTGCAAACTTGGGTAAATCAAGAAAACTTTCCATCTTCTAAA
GTTCCCAACGACATGATTAATGATTCATTGGAAACATTAAATGATGGAGGTTTAAATTTAGAAAAGTTGGTTGAGTTACCAAAGGAAAAGGCTGAATTTTCACGCTCCAA
GGAAGCATTAATGGGAGAAGTTAATCATCATTTAATTGGGCCGTTTGAATTTTCAAAGAATAAAAGTGCTTTGTTGCTGGAGAAGAATTTTAATGCTAACGGTAAGGAAT
TTAATGCCATCCATTCAGATTTTAATGGAGCATTTAATGAAGGTGCTGTGCATAAGTCCCTAAATTTTTCAGCCTTGCAAACTTGGGAAAATCAAGAAAACTTTCCATCT
TCTAAAGCTCCCAACGGCATGATCAATGATTCATTAGAAATATTAAATGATGGAGATTTAAATTTAGAAAAGTTGGTTGAGTTACCAAAGGAAAAGGCTGACTATTCACG
TTCAAATGAAGCATTAATGGGAGAAGTTAATTGTCATTTAATTGGGCTGGTTGAATTTTCAAAGAAGAAAAGTGCTTTGTTGCTGGAGAAGAATTTTAATGACAACGTAG
AAGATGAATCGATTGTTCCTAAGGTTTTACATTTAGAAAAGTCGGTTGAGTTACCAAAGGAAAAGGCTGAGATTTCACGTTCCAAGGAAGCATTAATGGAAGAAGTTAGT
GTTAATTTCATTGGGCCGGTTGAGTTTTCAAAGGAGAAAAGTGCTTTGTTGCTGGAGAATGATTTTAATGCCAACGGTAAGGTTTTTAATGGCATCAAATCAGAGATTAG
CAAAGCATTTACTGATGGTGCTTTGCATGAGTCCCAGGTTTTATTATTCTCGCCTATTCAAGACATTCCTTCGGGTTTGAAGTGCTGTAATGCAGTGGGCTTGGAACCAA
ATGAACTGTTTGTTCCTAAGGCTTTAAAGAAGAAATATGAATCATTTCCTCTTCATTATTCTCGAAGGAAATATGAAAAGTCAGAAATTTTGGACTCAATTCCCATTAAT
TCCAATTATAACCCTGATGTTATTGAAGAATCTTGTTCTCAATCTTTGCTCCCTGCTTTGAATCAGTCTAGATGCTGCCAAACTAATCTTAATGAGTTATCAAATTCCAC
ATCATCCAATCAGTATATTCTTTCAAACATTCAATCTGACCGTTCTTTAACAAAGGGGGTTTTTATTCCTTCATCCAAAGTTGAAAACAAAGTTGATCAATCATATTCAT
CTCCTATTGATTCTGATAATGATTCAGTGGTGAGTATTAGTAGTGTAGAGGCTGAAAATCAGTATTTGAATGATGAAATCAATGAATTGTTGGAGGAAGATTCTTTTGCA
CTGGCTTTTAATCGGATTTTCCAGAATAATGAAGATATTTCTGAAGTTCAGTTGAATGATTGTGATGTTTCAGCAACACCCTTAGTATCTGTTCCAAGTAAATTTTCATC
TCTGCTAAAAGATTGTGACATTCAGTTGAAGGAAATTCAGCCCTTTTTACCCCCTGAGCAATCTAAAAATTGTGGAATTTCTTCAAGATTTCTGATTTCCATGGAATGGG
ATGTGATGTTTGATAAATCTAGAGTCTCCAAACAGTTTTCTTGTGTGAGAGATCAATTTAATGAGGTGTTGGGCTCTCCAAAAGGTACTTCATTGCATGAAAAGGCCCTC
ATGAAGTCCAGCCGTTCATGCCAAGCGGAGATAGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGACTCTTCTTCGGTTGGAAAAGAGGCGGAGCCTACCACCGCCCTCTCACCACAATCAACAATCGTACGCTTGTTGTCGGTTGAACAAGATATGAAGTCATTAAA
GAGTGATGTCGGTGAGATCAAAAAGATCTTGGAAATGATTTGCGAAAAGATGGGAAACAAAGGGGAACCAAGTTGTAACCTAATGGGGCAATCTACCATGGAGAGAACTT
ATCAAGAACAAGATAAAACTTCTCGGGAATTGGAAGAAAGAATGGGACAGCGACAAGAAAGGCAAACAATGGAGCAAAGAATAGTTCAAGAAACACATTTAGCTCCAAGA
AGTTCTCATTTGAATTGGCTTGAGCTAGCTTTAGTCGATTTATTGCAATGCCCGGTTCATTTGTTCTTTCGCAAGAAGTTAAGAGATTCAAATGGAACAATTCAGTTATC
AAAGTTCAATTCCCAGCAAGGTTGGTTCCTTGAGTGCTCTATTTGGCCCCCCTCGGGTGGAAAGAAAAAAGTGCAAGTTCCAGTTGGTTACACAAAGATTGGTTGGTCAA
TATTTTGGGAAATGATAAGAGACTTTCTTTTGAAATTTGGGGAAAAAAAATCTGTTGAGAATATTTCAAAGAAGTCCAGTTATGAAGAATTATCCAAGCTGGTTTTTAAT
GCTAATAAGAATTTGGATGGAAGTTTTGATGATGAAGTGAAGTTTAAATCTGGGGAACCTCTTCTTGGTTCTCATTCTAAGAAGTTGCCAGTAATGTCTTCATTCTGGGT
TAGAAAAGAAAAAGAAGTGGTGGATTTAAAGCTAGATGAATTTTGTGTGGTTTCTAGAATGTTTGCACATAATGCTTGGAAGGAAGTAAAGCAAGTTTTGGAAGATTATT
TTCATTCTGAAGTTATTCTCAACCCTTTTATGGCAGATAACGCTTTGGTTAAACTGAATGATAAATTTTCTGAAAAGAAGTTTGATGGCAACATTTCTTCTCAAACATTG
AATCTTTTGGATTGTTCAGAAGCTCGTATAGAGGTGAAAAGGAATTTTTGTGGATTTATACCAGCTGAAATTGTGGTTACGGACAAGATTCACGGAAACTTTGCTCTTCG
TTTTGGTGATATATCTTCTTTGGACCCTTCATGTTTTATTCCTATGGACTTATCATTGAGTGATTTTGATAATGAAATTGATTTAAAAAGGGTTTCTCAGGTTTTGATGG
ATGAAGGTTTTTCCTCATCACAAGAAGATCTTAATTCCTTCAATGATCAAGGATTAAACTTTCCAGCCTTGCAAACTTGGGTAAATCAAGAAAACTTTCCATCTTCTAAA
GTTCCCAACGACATGATTAATGATTCATTGGAAACATTAAATGATGGAGGTTTAAATTTAGAAAAGTTGGTTGAGTTACCAAAGGAAAAGGCTGAATTTTCACGCTCCAA
GGAAGCATTAATGGGAGAAGTTAATCATCATTTAATTGGGCCGTTTGAATTTTCAAAGAATAAAAGTGCTTTGTTGCTGGAGAAGAATTTTAATGCTAACGGTAAGGAAT
TTAATGCCATCCATTCAGATTTTAATGGAGCATTTAATGAAGGTGCTGTGCATAAGTCCCTAAATTTTTCAGCCTTGCAAACTTGGGAAAATCAAGAAAACTTTCCATCT
TCTAAAGCTCCCAACGGCATGATCAATGATTCATTAGAAATATTAAATGATGGAGATTTAAATTTAGAAAAGTTGGTTGAGTTACCAAAGGAAAAGGCTGACTATTCACG
TTCAAATGAAGCATTAATGGGAGAAGTTAATTGTCATTTAATTGGGCTGGTTGAATTTTCAAAGAAGAAAAGTGCTTTGTTGCTGGAGAAGAATTTTAATGACAACGTAG
AAGATGAATCGATTGTTCCTAAGGTTTTACATTTAGAAAAGTCGGTTGAGTTACCAAAGGAAAAGGCTGAGATTTCACGTTCCAAGGAAGCATTAATGGAAGAAGTTAGT
GTTAATTTCATTGGGCCGGTTGAGTTTTCAAAGGAGAAAAGTGCTTTGTTGCTGGAGAATGATTTTAATGCCAACGGTAAGGTTTTTAATGGCATCAAATCAGAGATTAG
CAAAGCATTTACTGATGGTGCTTTGCATGAGTCCCAGGTTTTATTATTCTCGCCTATTCAAGACATTCCTTCGGGTTTGAAGTGCTGTAATGCAGTGGGCTTGGAACCAA
ATGAACTGTTTGTTCCTAAGGCTTTAAAGAAGAAATATGAATCATTTCCTCTTCATTATTCTCGAAGGAAATATGAAAAGTCAGAAATTTTGGACTCAATTCCCATTAAT
TCCAATTATAACCCTGATGTTATTGAAGAATCTTGTTCTCAATCTTTGCTCCCTGCTTTGAATCAGTCTAGATGCTGCCAAACTAATCTTAATGAGTTATCAAATTCCAC
ATCATCCAATCAGTATATTCTTTCAAACATTCAATCTGACCGTTCTTTAACAAAGGGGGTTTTTATTCCTTCATCCAAAGTTGAAAACAAAGTTGATCAATCATATTCAT
CTCCTATTGATTCTGATAATGATTCAGTGGTGAGTATTAGTAGTGTAGAGGCTGAAAATCAGTATTTGAATGATGAAATCAATGAATTGTTGGAGGAAGATTCTTTTGCA
CTGGCTTTTAATCGGATTTTCCAGAATAATGAAGATATTTCTGAAGTTCAGTTGAATGATTGTGATGTTTCAGCAACACCCTTAGTATCTGTTCCAAGTAAATTTTCATC
TCTGCTAAAAGATTGTGACATTCAGTTGAAGGAAATTCAGCCCTTTTTACCCCCTGAGCAATCTAAAAATTGTGGAATTTCTTCAAGATTTCTGATTTCCATGGAATGGG
ATGTGATGTTTGATAAATCTAGAGTCTCCAAACAGTTTTCTTGTGTGAGAGATCAATTTAATGAGGTGTTGGGCTCTCCAAAAGGTACTTCATTGCATGAAAAGGCCCTC
ATGAAGTCCAGCCGTTCATGCCAAGCGGAGATAGTTTAG
Protein sequenceShow/hide protein sequence
MGDSSSVGKEAEPTTALSPQSTIVRLLSVEQDMKSLKSDVGEIKKILEMICEKMGNKGEPSCNLMGQSTMERTYQEQDKTSRELEERMGQRQERQTMEQRIVQETHLAPR
SSHLNWLELALVDLLQCPVHLFFRKKLRDSNGTIQLSKFNSQQGWFLECSIWPPSGGKKKVQVPVGYTKIGWSIFWEMIRDFLLKFGEKKSVENISKKSSYEELSKLVFN
ANKNLDGSFDDEVKFKSGEPLLGSHSKKLPVMSSFWVRKEKEVVDLKLDEFCVVSRMFAHNAWKEVKQVLEDYFHSEVILNPFMADNALVKLNDKFSEKKFDGNISSQTL
NLLDCSEARIEVKRNFCGFIPAEIVVTDKIHGNFALRFGDISSLDPSCFIPMDLSLSDFDNEIDLKRVSQVLMDEGFSSSQEDLNSFNDQGLNFPALQTWVNQENFPSSK
VPNDMINDSLETLNDGGLNLEKLVELPKEKAEFSRSKEALMGEVNHHLIGPFEFSKNKSALLLEKNFNANGKEFNAIHSDFNGAFNEGAVHKSLNFSALQTWENQENFPS
SKAPNGMINDSLEILNDGDLNLEKLVELPKEKADYSRSNEALMGEVNCHLIGLVEFSKKKSALLLEKNFNDNVEDESIVPKVLHLEKSVELPKEKAEISRSKEALMEEVS
VNFIGPVEFSKEKSALLLENDFNANGKVFNGIKSEISKAFTDGALHESQVLLFSPIQDIPSGLKCCNAVGLEPNELFVPKALKKKYESFPLHYSRRKYEKSEILDSIPIN
SNYNPDVIEESCSQSLLPALNQSRCCQTNLNELSNSTSSNQYILSNIQSDRSLTKGVFIPSSKVENKVDQSYSSPIDSDNDSVVSISSVEAENQYLNDEINELLEEDSFA
LAFNRIFQNNEDISEVQLNDCDVSATPLVSVPSKFSSLLKDCDIQLKEIQPFLPPEQSKNCGISSRFLISMEWDVMFDKSRVSKQFSCVRDQFNEVLGSPKGTSLHEKAL
MKSSRSCQAEIV