; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg011094 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg011094
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionNucleolar protein 58-like
Genome locationscaffold4:27701732..27704364
RNA-Seq ExpressionSpg011094
SyntenySpg011094
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]1.9e-3030.81Show/hide
Query:  PKAKSSKAASPKNPFPEVFRDVNF------QEMMEIMRKRDFLKEKGF---SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYASLREESISM
        P    + AA P +   +V  +  F      +   E +  R+ +KEKGF    +     P F+S VI    WQ FC HP + +VPLV+EFYA+L+ +  + 
Subjt:  PKAKSSKAASPKNPFPEVFRDVNF------QEMMEIMRKRDFLKEKGF---SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYASLREESISM

Query:  AVVRGKMVSFSSMDINRVYRIKAPLHPRGND----VIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISV
          V    ++F+S  IN V  I     P  +D    +I +   +Q+KE LK +A  G QW  S     T    +L+P + VW HFL +RL+ +TH  TIS 
Subjt:  AVVRGKMVSFSSMDINRVYRIKAPLHPRGND----VIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISV

Query:  DRVMLLYCIMKGLEINIGSIIREEILACGRKRAGKLFFGSLITQLCQRVKIVSGKDEEHHFFKPTIDLSLIGKLQPNSLQRKDKASTSQ--------ATP
        +R +LLY ++ G  IN+G +I ++I AC  K  G L+F SLI++LC +  +     E        +DL  I ++     ++ +K    +        +T 
Subjt:  DRVMLLYCIMKGLEINIGSIIREEILACGRKRAGKLFFGSLITQLCQRVKIVSGKDEEHHFFKPTIDLSLIGKLQPNSLQRKDKASTSQ--------ATP

Query:  PSGSNMASPSQQTPFTRPSPSFEALAIAYRQLDQIRENLKTYWAYTKERDEAIREFY
         + S  A+ SQ+    R S         +  L Q +E L  +W Y+++RD A+++ +
Subjt:  PSGSNMASPSQQTPFTRPSPSFEALAIAYRQLDQIRENLKTYWAYTKERDEAIREFY

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]3.0e-3636.88Show/hide
Query:  MRKRDFLKEKGF----SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYASLREESISMAVVRGKMVSFSSMDINRVYRIKAPLHPRGNDVIRN
        ++ R    EKGF    S   G LP F+++VI+Q+ W++FCAHP++ +VPLVREFYA+L +   +   VRG  VS+S   IN V+ +  P+    ++ I N
Subjt:  MRKRDFLKEKGF----SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYASLREESISMAVVRGKMVSFSSMDINRVYRIKAPLHPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKRAGKLFFG
         +   +   L+ VA  G +W  S     T + S L P + VW HFLK+ L+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF 
Subjt:  PSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKRAGKLFFG

Query:  SLITQLCQRVKIVSGKDEEHHFFKPTIDLSLIGKLQPNSLQRKDKASTSQATPPSGSNMASPS
        SLIT+LC+  +     +EE       ID   + ++         +  T     PS S  A+ S
Subjt:  SLITQLCQRVKIVSGKDEEHHFFKPTIDLSLIGKLQPNSLQRKDKASTSQATPPSGSNMASPS

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.5e-4034.46Show/hide
Query:  MRKRDFLKEKGF----SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYASLREESISMAVVRGKMVSFSSMDINRVYRIKAPLHPRGNDVIRN
        ++ R    EKGF    S   G LP F+++VI+Q+ W++FCAHP++ +VPLVREFYA+L +   +   VRG  VS+S   IN V+ +  P+    ++ I+N
Subjt:  MRKRDFLKEKGF----SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYASLREESISMAVVRGKMVSFSSMDINRVYRIKAPLHPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKRAGKLFFG
         + + +   L+ VA  G +W  S     T + S L P + VW HFLK+RL+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF 
Subjt:  PSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKRAGKLFFG

Query:  SLITQLCQRVKIVSGKDEEHHFFKPTIDLSLIGKL-QPNSLQRKDKASTSQ-ATPPSGSNMASPSQQTPFTRPSPSFEALAIAYRQ--LDQIRENLKTYW
        SLIT+LC+  +     +EE       ID   + ++ Q    +   + S+S+ AT  S        QQ        S + +   +    L    +  + +W
Subjt:  SLITQLCQRVKIVSGKDEEHHFFKPTIDLSLIGKL-QPNSLQRKDKASTSQ-ATPPSGSNMASPSQQTPFTRPSPSFEALAIAYRQ--LDQIRENLKTYW

Query:  AYTKERDEAIREFYLFISPSIAPVFPNFPQSLLPQEDKDSDEEEDENDDEEDEE
        AY+KERD A+++          P FP FPQ +L   D + + E D++   E  E
Subjt:  AYTKERDEAIREFYLFISPSIAPVFPNFPQSLLPQEDKDSDEEEDENDDEEDEE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]4.4e-3234.47Show/hide
Query:  SKAASPKNPFPEVFRDVNFQEMMEIMRKRDFLKEKGF---SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYASLREESISMAVVRGKMVSFS
        ++ AS    F     ++ ++E ++    R    EK F   +++    P F++ VI Q+ WQ FCAHP++ +VPLVREFY ++         +RG  V  S
Subjt:  SKAASPKNPFPEVFRDVNFQEMMEIMRKRDFLKEKGF---SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYASLREESISMAVVRGKMVSFS

Query:  SMDINRVYRIKAPLHPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEI
           IN ++ +  P+    ++ + + +  ++   L+ VA  G +W  S     T + S L P + VW HFLK+RL+PTTH  T+S + V LLY ++ G  I
Subjt:  SMDINRVYRIKAPLHPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEI

Query:  NIGSIIREEILACGRKRAGKLFFGSLITQLCQRVK
        N+G +I  EI AC  +++G LFF SLIT +C+  +
Subjt:  NIGSIIREEILACGRKRAGKLFFGSLITQLCQRVK

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.4e-3032.67Show/hide
Query:  VPLVREFYASLREESISMAVVRGKMVSFSSMDINRVYRIKAPLHPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLK
        +PLVREFYA+L +   +   VRG  VS+S   IN V+ +  P+    ++ I N +  ++   L+ VA  G +W  S     T + S L P + VW HFLK
Subjt:  VPLVREFYASLREESISMAVVRGKMVSFSSMDINRVYRIKAPLHPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLK

Query:  NRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKRAGKLFFGSLITQLCQRVKIVSGKDEEHHFFKPTIDLSLIGKLQPNSLQRKDKAS
        +RL+PTTH   +S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF SLIT+LC+    +  +++ H+     ID   + ++         +  
Subjt:  NRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKRAGKLFFGSLITQLCQRVKIVSGKDEEHHFFKPTIDLSLIGKLQPNSLQRKDKAS

Query:  TSQATPPSGSNMASPSQQTPFTRPSPSFEALAIAYRQLDQIRENLKTYWAYTKERDEAIREFYLFISPSIAPVFPNFPQSLLPQEDKDSDEEEDENDDEE
        T     PS S  A+ S            +AL     Q +   +  + +WAY+KERD A+++          P FP FPQ +L   D + + E D++   E
Subjt:  TSQATPPSGSNMASPSQQTPFTRPSPSFEALAIAYRQLDQIRENLKTYWAYTKERDEAIREFYLFISPSIAPVFPNFPQSLLPQEDKDSDEEEDENDDEE

Query:  DEE
          E
Subjt:  DEE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.4e-3636.88Show/hide
Query:  MRKRDFLKEKGF----SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYASLREESISMAVVRGKMVSFSSMDINRVYRIKAPLHPRGNDVIRN
        ++ R    EKGF    S   G LP F+++VI+Q+ W++FCAHP++ +VPLVREFYA+L +   +   VRG  VS+S   IN V+ +  P+    ++ I N
Subjt:  MRKRDFLKEKGF----SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYASLREESISMAVVRGKMVSFSSMDINRVYRIKAPLHPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKRAGKLFFG
         +   +   L+ VA  G +W  S     T + S L P + VW HFLK+ L+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF 
Subjt:  PSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKRAGKLFFG

Query:  SLITQLCQRVKIVSGKDEEHHFFKPTIDLSLIGKLQPNSLQRKDKASTSQATPPSGSNMASPS
        SLIT+LC+  +     +EE       ID   + ++         +  T     PS S  A+ S
Subjt:  SLITQLCQRVKIVSGKDEEHHFFKPTIDLSLIGKLQPNSLQRKDKASTSQATPPSGSNMASPS

A0A2P5BCG4 Uncharacterized protein (Fragment)7.4e-4134.46Show/hide
Query:  MRKRDFLKEKGF----SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYASLREESISMAVVRGKMVSFSSMDINRVYRIKAPLHPRGNDVIRN
        ++ R    EKGF    S   G LP F+++VI+Q+ W++FCAHP++ +VPLVREFYA+L +   +   VRG  VS+S   IN V+ +  P+    ++ I+N
Subjt:  MRKRDFLKEKGF----SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYASLREESISMAVVRGKMVSFSSMDINRVYRIKAPLHPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKRAGKLFFG
         + + +   L+ VA  G +W  S     T + S L P + VW HFLK+RL+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF 
Subjt:  PSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKRAGKLFFG

Query:  SLITQLCQRVKIVSGKDEEHHFFKPTIDLSLIGKL-QPNSLQRKDKASTSQ-ATPPSGSNMASPSQQTPFTRPSPSFEALAIAYRQ--LDQIRENLKTYW
        SLIT+LC+  +     +EE       ID   + ++ Q    +   + S+S+ AT  S        QQ        S + +   +    L    +  + +W
Subjt:  SLITQLCQRVKIVSGKDEEHHFFKPTIDLSLIGKL-QPNSLQRKDKASTSQ-ATPPSGSNMASPSQQTPFTRPSPSFEALAIAYRQ--LDQIRENLKTYW

Query:  AYTKERDEAIREFYLFISPSIAPVFPNFPQSLLPQEDKDSDEEEDENDDEEDEE
        AY+KERD A+++          P FP FPQ +L   D + + E D++   E  E
Subjt:  AYTKERDEAIREFYLFISPSIAPVFPNFPQSLLPQEDKDSDEEEDENDDEEDEE

A0A2P5DAQ2 Uncharacterized protein2.1e-3234.47Show/hide
Query:  SKAASPKNPFPEVFRDVNFQEMMEIMRKRDFLKEKGF---SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYASLREESISMAVVRGKMVSFS
        ++ AS    F     ++ ++E ++    R    EK F   +++    P F++ VI Q+ WQ FCAHP++ +VPLVREFY ++         +RG  V  S
Subjt:  SKAASPKNPFPEVFRDVNFQEMMEIMRKRDFLKEKGF---SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYASLREESISMAVVRGKMVSFS

Query:  SMDINRVYRIKAPLHPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEI
           IN ++ +  P+    ++ + + +  ++   L+ VA  G +W  S     T + S L P + VW HFLK+RL+PTTH  T+S + V LLY ++ G  I
Subjt:  SMDINRVYRIKAPLHPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEI

Query:  NIGSIIREEILACGRKRAGKLFFGSLITQLCQRVK
        N+G +I  EI AC  +++G LFF SLIT +C+  +
Subjt:  NIGSIIREEILACGRKRAGKLFFGSLITQLCQRVK

A0A2P5DXM3 Uncharacterized protein6.9e-3132.67Show/hide
Query:  VPLVREFYASLREESISMAVVRGKMVSFSSMDINRVYRIKAPLHPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLK
        +PLVREFYA+L +   +   VRG  VS+S   IN V+ +  P+    ++ I N +  ++   L+ VA  G +W  S     T + S L P + VW HFLK
Subjt:  VPLVREFYASLREESISMAVVRGKMVSFSSMDINRVYRIKAPLHPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLK

Query:  NRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKRAGKLFFGSLITQLCQRVKIVSGKDEEHHFFKPTIDLSLIGKLQPNSLQRKDKAS
        +RL+PTTH   +S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF SLIT+LC+    +  +++ H+     ID   + ++         +  
Subjt:  NRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKRAGKLFFGSLITQLCQRVKIVSGKDEEHHFFKPTIDLSLIGKLQPNSLQRKDKAS

Query:  TSQATPPSGSNMASPSQQTPFTRPSPSFEALAIAYRQLDQIRENLKTYWAYTKERDEAIREFYLFISPSIAPVFPNFPQSLLPQEDKDSDEEEDENDDEE
        T     PS S  A+ S            +AL     Q +   +  + +WAY+KERD A+++          P FP FPQ +L   D + + E D++   E
Subjt:  TSQATPPSGSNMASPSQQTPFTRPSPSFEALAIAYRQLDQIRENLKTYWAYTKERDEAIREFYLFISPSIAPVFPNFPQSLLPQEDKDSDEEEDENDDEE

Query:  DEE
          E
Subjt:  DEE

W9RBS1 Uncharacterized protein9.0e-3130.81Show/hide
Query:  PKAKSSKAASPKNPFPEVFRDVNF------QEMMEIMRKRDFLKEKGF---SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYASLREESISM
        P    + AA P +   +V  +  F      +   E +  R+ +KEKGF    +     P F+S VI    WQ FC HP + +VPLV+EFYA+L+ +  + 
Subjt:  PKAKSSKAASPKNPFPEVFRDVNF------QEMMEIMRKRDFLKEKGF---SNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYASLREESISM

Query:  AVVRGKMVSFSSMDINRVYRIKAPLHPRGND----VIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISV
          V    ++F+S  IN V  I     P  +D    +I +   +Q+KE LK +A  G QW  S     T    +L+P + VW HFL +RL+ +TH  TIS 
Subjt:  AVVRGKMVSFSSMDINRVYRIKAPLHPRGND----VIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISV

Query:  DRVMLLYCIMKGLEINIGSIIREEILACGRKRAGKLFFGSLITQLCQRVKIVSGKDEEHHFFKPTIDLSLIGKLQPNSLQRKDKASTSQ--------ATP
        +R +LLY ++ G  IN+G +I ++I AC  K  G L+F SLI++LC +  +     E        +DL  I ++     ++ +K    +        +T 
Subjt:  DRVMLLYCIMKGLEINIGSIIREEILACGRKRAGKLFFGSLITQLCQRVKIVSGKDEEHHFFKPTIDLSLIGKLQPNSLQRKDKASTSQ--------ATP

Query:  PSGSNMASPSQQTPFTRPSPSFEALAIAYRQLDQIRENLKTYWAYTKERDEAIREFY
         + S  A+ SQ+    R S         +  L Q +E L  +W Y+++RD A+++ +
Subjt:  PSGSNMASPSQQTPFTRPSPSFEALAIAYRQLDQIRENLKTYWAYTKERDEAIREFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAACTCGCCCAAATCATCATCATCTCGCAAGATCACTCGAGCCCAGAATGCTCAAACCGCCCAAGAAGTTGAAGCAAACGTTCAACGTCAAGAAGAGCACCCCGA
CGCCCCCATGCACGGCACGAGAAGGACGAGACCTTCGGGTTTCTCGCCGGTGATCGTGAACCAAGGAACCGCTGCTCAAACTCCCTCTTCCTCGACAATGTCGGCCACCT
CGAGGGAGAATCCGAGTTCGTCTCAACTTAGGAGGTCCACGCGCGCCAATGCCGTCCATAAAACCCAAAAACCCGCAACCCAACAATTCAGAAAATGCTCGCAGGAGTGG
TTTTCAATGATCCGGGCGATGAGAGCTCAAAGATGCGCAACTCTTGAAGAAGAAGCGAATAGGCGAGATGAAGAAGAAGCCACCAAGGCAGCAGAAAGCTCTCGGCAAAG
AGAGGCTTCAACGGGTAAACCTTCTGAACCTTCAACTAACCCCTCTTCATCTTGCAGGAACAAACCATTCGTTACTTACAGTGCAAGGAAGGGGAGTCCCAAGAAAGTTG
TGCCCGAAAAGCCGCTTGTAATCGAGCCCCTTAAAACTGCAAGAATGCCCCCGGATGTGTTCGAGGACATAATCCGCCAAGCTGTGGCAAAGGCTCTAGTGATTTTCGAA
GGCTACAAGGCTGAACAAGAAGCCTTGAAAGATATTGAGGCTGAAAGAGAACTTGAAAACCAGCATATGAGGAAAGAGGATGAGGTTGCGAGAAAAAGAGATCTTGAAGA
TGAAAAGAAGAAAGAAGAGGAAAGGCAAGAGGCCGAGAGGGCCAAGTTAGCTGAAGAAGAGGAAAGAAAGTTAGGTGAAAACCTTAGGAGGGCAGCAGTTGAATTGCAAC
TTCTTGAGGAAGAAAAACAAAGAAAAGAAAAGGCCTATCAGGGGCCACATGGAGAAAATTCAGAGAAAAAGAAAGAAAGAGAAGTAGTGGATGAAGGCCAGAATGCGACC
GCATCTAGGCCGCATTCTGGTGAAAGCCACGAAGAGGCCACTGAAGCTCAGCCAGCTGATGAGGTTTTCGAACCTCTATTCAAAGATGACCCACCAGTAGTTGACAGCAC
CTCTTCGGGAGAGAAGAGGGATGAAGAAGAGAAAGAAAGCAAGGAGGTCGAGACCTCCAGTGACTCTGAAACAGAATCTGACTTGGAGATCAAGGAATTGGATGACGACC
AAGTCCATGACGCCTCTCCTGCAGCTCCTCCTACCCTCTCACCCACCAAGCCAAAAGCCAAATCTTCTAAGGCTGCATCTCCCAAAAATCCTTTCCCCGAAGTATTCAGA
GATGTAAATTTTCAGGAAATGATGGAGATAATGAGAAAAAGAGATTTCCTCAAGGAGAAGGGATTCTCTAACAGAGCTGGAGCACTACCAGAGTTCGTAAGCAGAGTTAT
CTCACAATACAAGTGGCAGGAGTTCTGTGCTCACCCTCAGGAGGTCGTGGTGCCTTTAGTGCGAGAGTTTTACGCCAGCCTGAGGGAGGAAAGCATCAGTATGGCAGTGG
TGAGAGGCAAAATGGTCAGCTTCTCTTCAATGGACATCAACCGGGTGTACAGAATCAAAGCACCCTTACATCCAAGAGGGAATGATGTCATTAGGAACCCCTCGGCCAAG
CAGATGAAAGAAGCGTTGAAATTAGTGGCCAACAAGGGAGTTCAGTGGAAAGAGTCCCAGACGAAGGTGAAGACTTTAGTGCCCAGCGATCTCAAGCCAGAATCGGCAGT
TTGGCTTCACTTTCTGAAGAACCGTTTGATGCCGACCACCCACGACAACACCATCTCAGTAGATAGAGTCATGCTCCTCTACTGTATTATGAAGGGGTTGGAGATCAATA
TTGGGAGCATAATCAGGGAGGAGATTCTAGCCTGTGGAAGGAAAAGAGCAGGTAAACTTTTCTTTGGATCACTTATCACCCAGCTTTGTCAGAGGGTGAAGATAGTTTCG
GGCAAGGATGAGGAGCATCATTTCTTCAAGCCTACCATTGACCTGTCCTTGATTGGGAAGCTTCAACCGAATAGCCTCCAAAGGAAAGATAAAGCCTCCACATCTCAGGC
CACTCCACCATCAGGGTCGAACATGGCTTCTCCATCCCAACAAACTCCTTTTACAAGGCCCTCACCATCATTTGAAGCCCTAGCCATTGCCTACCGTCAACTAGATCAAA
TCAGGGAAAACCTGAAGACTTATTGGGCATATACAAAGGAGAGGGATGAAGCCATTAGAGAGTTTTATCTCTTTATCTCCCCGAGTATTGCCCCGGTCTTTCCCAATTTC
CCTCAATCGCTGCTGCCTCAAGAAGACAAGGATTCTGATGAAGAAGAAGATGAAAATGATGATGAAGAAGATGAAGAGAAAGAGAGTTCCTTGGACGAGGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGAACTCGCCCAAATCATCATCATCTCGCAAGATCACTCGAGCCCAGAATGCTCAAACCGCCCAAGAAGTTGAAGCAAACGTTCAACGTCAAGAAGAGCACCCCGA
CGCCCCCATGCACGGCACGAGAAGGACGAGACCTTCGGGTTTCTCGCCGGTGATCGTGAACCAAGGAACCGCTGCTCAAACTCCCTCTTCCTCGACAATGTCGGCCACCT
CGAGGGAGAATCCGAGTTCGTCTCAACTTAGGAGGTCCACGCGCGCCAATGCCGTCCATAAAACCCAAAAACCCGCAACCCAACAATTCAGAAAATGCTCGCAGGAGTGG
TTTTCAATGATCCGGGCGATGAGAGCTCAAAGATGCGCAACTCTTGAAGAAGAAGCGAATAGGCGAGATGAAGAAGAAGCCACCAAGGCAGCAGAAAGCTCTCGGCAAAG
AGAGGCTTCAACGGGTAAACCTTCTGAACCTTCAACTAACCCCTCTTCATCTTGCAGGAACAAACCATTCGTTACTTACAGTGCAAGGAAGGGGAGTCCCAAGAAAGTTG
TGCCCGAAAAGCCGCTTGTAATCGAGCCCCTTAAAACTGCAAGAATGCCCCCGGATGTGTTCGAGGACATAATCCGCCAAGCTGTGGCAAAGGCTCTAGTGATTTTCGAA
GGCTACAAGGCTGAACAAGAAGCCTTGAAAGATATTGAGGCTGAAAGAGAACTTGAAAACCAGCATATGAGGAAAGAGGATGAGGTTGCGAGAAAAAGAGATCTTGAAGA
TGAAAAGAAGAAAGAAGAGGAAAGGCAAGAGGCCGAGAGGGCCAAGTTAGCTGAAGAAGAGGAAAGAAAGTTAGGTGAAAACCTTAGGAGGGCAGCAGTTGAATTGCAAC
TTCTTGAGGAAGAAAAACAAAGAAAAGAAAAGGCCTATCAGGGGCCACATGGAGAAAATTCAGAGAAAAAGAAAGAAAGAGAAGTAGTGGATGAAGGCCAGAATGCGACC
GCATCTAGGCCGCATTCTGGTGAAAGCCACGAAGAGGCCACTGAAGCTCAGCCAGCTGATGAGGTTTTCGAACCTCTATTCAAAGATGACCCACCAGTAGTTGACAGCAC
CTCTTCGGGAGAGAAGAGGGATGAAGAAGAGAAAGAAAGCAAGGAGGTCGAGACCTCCAGTGACTCTGAAACAGAATCTGACTTGGAGATCAAGGAATTGGATGACGACC
AAGTCCATGACGCCTCTCCTGCAGCTCCTCCTACCCTCTCACCCACCAAGCCAAAAGCCAAATCTTCTAAGGCTGCATCTCCCAAAAATCCTTTCCCCGAAGTATTCAGA
GATGTAAATTTTCAGGAAATGATGGAGATAATGAGAAAAAGAGATTTCCTCAAGGAGAAGGGATTCTCTAACAGAGCTGGAGCACTACCAGAGTTCGTAAGCAGAGTTAT
CTCACAATACAAGTGGCAGGAGTTCTGTGCTCACCCTCAGGAGGTCGTGGTGCCTTTAGTGCGAGAGTTTTACGCCAGCCTGAGGGAGGAAAGCATCAGTATGGCAGTGG
TGAGAGGCAAAATGGTCAGCTTCTCTTCAATGGACATCAACCGGGTGTACAGAATCAAAGCACCCTTACATCCAAGAGGGAATGATGTCATTAGGAACCCCTCGGCCAAG
CAGATGAAAGAAGCGTTGAAATTAGTGGCCAACAAGGGAGTTCAGTGGAAAGAGTCCCAGACGAAGGTGAAGACTTTAGTGCCCAGCGATCTCAAGCCAGAATCGGCAGT
TTGGCTTCACTTTCTGAAGAACCGTTTGATGCCGACCACCCACGACAACACCATCTCAGTAGATAGAGTCATGCTCCTCTACTGTATTATGAAGGGGTTGGAGATCAATA
TTGGGAGCATAATCAGGGAGGAGATTCTAGCCTGTGGAAGGAAAAGAGCAGGTAAACTTTTCTTTGGATCACTTATCACCCAGCTTTGTCAGAGGGTGAAGATAGTTTCG
GGCAAGGATGAGGAGCATCATTTCTTCAAGCCTACCATTGACCTGTCCTTGATTGGGAAGCTTCAACCGAATAGCCTCCAAAGGAAAGATAAAGCCTCCACATCTCAGGC
CACTCCACCATCAGGGTCGAACATGGCTTCTCCATCCCAACAAACTCCTTTTACAAGGCCCTCACCATCATTTGAAGCCCTAGCCATTGCCTACCGTCAACTAGATCAAA
TCAGGGAAAACCTGAAGACTTATTGGGCATATACAAAGGAGAGGGATGAAGCCATTAGAGAGTTTTATCTCTTTATCTCCCCGAGTATTGCCCCGGTCTTTCCCAATTTC
CCTCAATCGCTGCTGCCTCAAGAAGACAAGGATTCTGATGAAGAAGAAGATGAAAATGATGATGAAGAAGATGAAGAGAAAGAGAGTTCCTTGGACGAGGACTAG
Protein sequenceShow/hide protein sequence
MKNSPKSSSSRKITRAQNAQTAQEVEANVQRQEEHPDAPMHGTRRTRPSGFSPVIVNQGTAAQTPSSSTMSATSRENPSSSQLRRSTRANAVHKTQKPATQQFRKCSQEW
FSMIRAMRAQRCATLEEEANRRDEEEATKAAESSRQREASTGKPSEPSTNPSSSCRNKPFVTYSARKGSPKKVVPEKPLVIEPLKTARMPPDVFEDIIRQAVAKALVIFE
GYKAEQEALKDIEAERELENQHMRKEDEVARKRDLEDEKKKEEERQEAERAKLAEEEERKLGENLRRAAVELQLLEEEKQRKEKAYQGPHGENSEKKKEREVVDEGQNAT
ASRPHSGESHEEATEAQPADEVFEPLFKDDPPVVDSTSSGEKRDEEEKESKEVETSSDSETESDLEIKELDDDQVHDASPAAPPTLSPTKPKAKSSKAASPKNPFPEVFR
DVNFQEMMEIMRKRDFLKEKGFSNRAGALPEFVSRVISQYKWQEFCAHPQEVVVPLVREFYASLREESISMAVVRGKMVSFSSMDINRVYRIKAPLHPRGNDVIRNPSAK
QMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKRAGKLFFGSLITQLCQRVKIVS
GKDEEHHFFKPTIDLSLIGKLQPNSLQRKDKASTSQATPPSGSNMASPSQQTPFTRPSPSFEALAIAYRQLDQIRENLKTYWAYTKERDEAIREFYLFISPSIAPVFPNF
PQSLLPQEDKDSDEEEDENDDEEDEEKESSLDED