; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005684 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005684
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionNucleolar protein 58-like
Genome locationscaffold8:20226517..20232166
RNA-Seq ExpressionSpg005684
SyntenySpg005684
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]7.1e-2931.09Show/hide
Query:  KPKAKSPKAASPKNPFPEVFRDVNFQERM-EIMRKRDFLNEKGF-SNRAGTL--PEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVSLREESISMAMVR
        K  A  P +++ +      F D   ++R  E +  R+ + EKGF  + + TL  P F+S VI    WQ FC HP + +VPLV+EFY +L+ +  +   V 
Subjt:  KPKAKSPKAASPKNPFPEVFRDVNFQERM-EIMRKRDFLNEKGF-SNRAGTL--PEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVSLREESISMAMVR

Query:  GKMVSFSSVDINRVYRLKAPLNPRGND----VIRNPSAKQMKEALKLVANKGVKWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVM
           ++F+S  IN V  +     P  +D    +I +   +Q+KE LK +A  G +W  S     T    +L+P + VW HFL +RL+ +TH  TIS +R +
Subjt:  GKMVSFSSVDINRVYRLKAPLNPRGND----VIRNPSAKQMKEALKLVANKGVKWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVM

Query:  LLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKRQQNSIQRKDKA-FTSQATPPSGSSMAFPSQ
        LLY ++ G  IN+G +I D+I AC  K  G L+F SLI++LC +  +     E R      +DL  I +      ++ +K     +   PS  S    + 
Subjt:  LLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKRQQNSIQRKDKA-FTSQATPPSGSSMAFPSQ

Query:  HTPFIGPSPSSEALA-----------IAYRQLDQIRENLKTYWAYAKERDEAIREFY
        HT     + S E L              +  L Q +E L  +W Y+++RD A+++ +
Subjt:  HTPFIGPSPSSEALA-----------IAYRQLDQIRENLKTYWAYAKERDEAIREFY

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]6.0e-3637.21Show/hide
Query:  MRKRDFLNEKGF----SNRAGTLPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVSLREESISMAMVRGKMVSFSSVDINRVYRLKAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F+++VI+Q+ W++FCAHP++ +VPLVREFY +L +   +   VRG  VS+S   IN V+ L  P++   ++ I N
Subjt:  MRKRDFLNEKGF----SNRAGTLPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVSLREESISMAMVRGKMVSFSSVDINRVYRLKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVKWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFG
         +   +   L+ VA  G +W  S     T + S L P + VW HFLK+ L+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF 
Subjt:  PSAKQMKEALKLVANKGVKWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFG

Query:  SLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKRQQNSIQRKDKAFTSQATPPSGSS
        SLIT+LC+  +     +EE+      ID   + +  Q       +   S + P + SS
Subjt:  SLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKRQQNSIQRKDKAFTSQATPPSGSS

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]6.2e-4133.9Show/hide
Query:  MRKRDFLNEKGF----SNRAGTLPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVSLREESISMAMVRGKMVSFSSVDINRVYRLKAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F+++VI+Q+ W++FCAHP++ +VPLVREFY +L +   +   VRG  VS+S   IN V+ L  P++   ++ I+N
Subjt:  MRKRDFLNEKGF----SNRAGTLPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVSLREESISMAMVRGKMVSFSSVDINRVYRLKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVKWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFG
         + + +   L+ VA  G +W  S     T + S L P + VW HFLK+RL+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF 
Subjt:  PSAKQMKEALKLVANKGVKWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFG

Query:  SLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGK-RQQNSIQRKDKAFTSQ-ATPPSGSSMAFPSQHTPFIGPSPSSEALAIAYRQ--LDQIRENLKTYW
        SLIT+LC+  +     +EE+      ID   + +  Q+   +   +  +S+ AT  S  +     Q    +    S + +   +    L    +  + +W
Subjt:  SLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGK-RQQNSIQRKDKAFTSQ-ATPPSGSSMAFPSQHTPFIGPSPSSEALAIAYRQ--LDQIRENLKTYW

Query:  AYAKERDEAIREFYLSIASSIAPVFPNFPQSLLPQEDKDSDDEEDENDDEENEE
        AY+KERD A+++   +  +   P FP FPQ +L   D + + E D++   E  E
Subjt:  AYAKERDEAIREFYLSIASSIAPVFPNFPQSLLPQEDKDSDDEEDENDDEENEE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]1.8e-3234.85Show/hide
Query:  ASPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGF---SNRAGTLPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVSLREESISMAMVRGKMVSFSSVD
        AS    F     ++ ++E ++    R    EK F   +++    P F++ VI Q+ WQ FCAHP++ +VPLVREFY ++         +RG  V  S   
Subjt:  ASPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGF---SNRAGTLPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVSLREESISMAMVRGKMVSFSSVD

Query:  INRVYRLKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVKWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINIG
        IN ++ L  P++   ++ + + +  ++   L+ VA  G +W  S     T + S L P + VW HFLK+RL+PTTH  T+S + V LLY ++ G  IN+G
Subjt:  INRVYRLKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVKWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINIG

Query:  SIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEER
         +I  EI AC  +++G LFF SLIT +C+  +     +EE+
Subjt:  SIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEER

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]4.9e-3033.33Show/hide
Query:  VPLVREFYVSLREESISMAMVRGKMVSFSSVDINRVYRLKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVKWKESQTKVKTLVPSDLKPESAVWLHFLK
        +PLVREFY +L +   +   VRG  VS+S   IN V+ L  P++   ++ I N +  ++   L+ VA  G +W  S     T + S L P + VW HFLK
Subjt:  VPLVREFYVSLREESISMAMVRGKMVSFSSVDINRVYRLKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVKWKESQTKVKTLVPSDLKPESAVWLHFLK

Query:  NRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKRQQNSIQRKDKAF
        +RL+PTTH   +S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF SLIT+LC+    +   +EE+      ID   + +  Q       +  
Subjt:  NRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKRQQNSIQRKDKAF

Query:  TSQATPPSGSSMAFPSQHTPFIGPSPSSEALAIAYRQLDQIRENLKTYWAYAKERDEAIREFYLSIASSIAPVFPNFPQSLLPQEDKDSDDEEDENDDEE
        T     PS S  A  S            +AL     Q +   +  + +WAY+KERD A+++   +  +   P FP FPQ +L   D + + E D++   E
Subjt:  TSQATPPSGSSMAFPSQHTPFIGPSPSSEALAIAYRQLDQIRENLKTYWAYAKERDEAIREFYLSIASSIAPVFPNFPQSLLPQEDKDSDDEEDENDDEE

Query:  NEE
          E
Subjt:  NEE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.9e-3637.21Show/hide
Query:  MRKRDFLNEKGF----SNRAGTLPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVSLREESISMAMVRGKMVSFSSVDINRVYRLKAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F+++VI+Q+ W++FCAHP++ +VPLVREFY +L +   +   VRG  VS+S   IN V+ L  P++   ++ I N
Subjt:  MRKRDFLNEKGF----SNRAGTLPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVSLREESISMAMVRGKMVSFSSVDINRVYRLKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVKWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFG
         +   +   L+ VA  G +W  S     T + S L P + VW HFLK+ L+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF 
Subjt:  PSAKQMKEALKLVANKGVKWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFG

Query:  SLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKRQQNSIQRKDKAFTSQATPPSGSS
        SLIT+LC+  +     +EE+      ID   + +  Q       +   S + P + SS
Subjt:  SLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKRQQNSIQRKDKAFTSQATPPSGSS

A0A2P5BCG4 Uncharacterized protein (Fragment)3.0e-4133.9Show/hide
Query:  MRKRDFLNEKGF----SNRAGTLPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVSLREESISMAMVRGKMVSFSSVDINRVYRLKAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F+++VI+Q+ W++FCAHP++ +VPLVREFY +L +   +   VRG  VS+S   IN V+ L  P++   ++ I+N
Subjt:  MRKRDFLNEKGF----SNRAGTLPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVSLREESISMAMVRGKMVSFSSVDINRVYRLKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVKWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFG
         + + +   L+ VA  G +W  S     T + S L P + VW HFLK+RL+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF 
Subjt:  PSAKQMKEALKLVANKGVKWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFG

Query:  SLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGK-RQQNSIQRKDKAFTSQ-ATPPSGSSMAFPSQHTPFIGPSPSSEALAIAYRQ--LDQIRENLKTYW
        SLIT+LC+  +     +EE+      ID   + +  Q+   +   +  +S+ AT  S  +     Q    +    S + +   +    L    +  + +W
Subjt:  SLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGK-RQQNSIQRKDKAFTSQ-ATPPSGSSMAFPSQHTPFIGPSPSSEALAIAYRQ--LDQIRENLKTYW

Query:  AYAKERDEAIREFYLSIASSIAPVFPNFPQSLLPQEDKDSDDEEDENDDEENEE
        AY+KERD A+++   +  +   P FP FPQ +L   D + + E D++   E  E
Subjt:  AYAKERDEAIREFYLSIASSIAPVFPNFPQSLLPQEDKDSDDEEDENDDEENEE

A0A2P5DAQ2 Uncharacterized protein8.8e-3334.85Show/hide
Query:  ASPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGF---SNRAGTLPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVSLREESISMAMVRGKMVSFSSVD
        AS    F     ++ ++E ++    R    EK F   +++    P F++ VI Q+ WQ FCAHP++ +VPLVREFY ++         +RG  V  S   
Subjt:  ASPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGF---SNRAGTLPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVSLREESISMAMVRGKMVSFSSVD

Query:  INRVYRLKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVKWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINIG
        IN ++ L  P++   ++ + + +  ++   L+ VA  G +W  S     T + S L P + VW HFLK+RL+PTTH  T+S + V LLY ++ G  IN+G
Subjt:  INRVYRLKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVKWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINIG

Query:  SIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEER
         +I  EI AC  +++G LFF SLIT +C+  +     +EE+
Subjt:  SIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEER

A0A2P5DXM3 Uncharacterized protein2.4e-3033.33Show/hide
Query:  VPLVREFYVSLREESISMAMVRGKMVSFSSVDINRVYRLKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVKWKESQTKVKTLVPSDLKPESAVWLHFLK
        +PLVREFY +L +   +   VRG  VS+S   IN V+ L  P++   ++ I N +  ++   L+ VA  G +W  S     T + S L P + VW HFLK
Subjt:  VPLVREFYVSLREESISMAMVRGKMVSFSSVDINRVYRLKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVKWKESQTKVKTLVPSDLKPESAVWLHFLK

Query:  NRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKRQQNSIQRKDKAF
        +RL+PTTH   +S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF SLIT+LC+    +   +EE+      ID   + +  Q       +  
Subjt:  NRLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKRQQNSIQRKDKAF

Query:  TSQATPPSGSSMAFPSQHTPFIGPSPSSEALAIAYRQLDQIRENLKTYWAYAKERDEAIREFYLSIASSIAPVFPNFPQSLLPQEDKDSDDEEDENDDEE
        T     PS S  A  S            +AL     Q +   +  + +WAY+KERD A+++   +  +   P FP FPQ +L   D + + E D++   E
Subjt:  TSQATPPSGSSMAFPSQHTPFIGPSPSSEALAIAYRQLDQIRENLKTYWAYAKERDEAIREFYLSIASSIAPVFPNFPQSLLPQEDKDSDDEEDENDDEE

Query:  NEE
          E
Subjt:  NEE

W9RBS1 Uncharacterized protein3.5e-2931.09Show/hide
Query:  KPKAKSPKAASPKNPFPEVFRDVNFQERM-EIMRKRDFLNEKGF-SNRAGTL--PEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVSLREESISMAMVR
        K  A  P +++ +      F D   ++R  E +  R+ + EKGF  + + TL  P F+S VI    WQ FC HP + +VPLV+EFY +L+ +  +   V 
Subjt:  KPKAKSPKAASPKNPFPEVFRDVNFQERM-EIMRKRDFLNEKGF-SNRAGTL--PEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVSLREESISMAMVR

Query:  GKMVSFSSVDINRVYRLKAPLNPRGND----VIRNPSAKQMKEALKLVANKGVKWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVM
           ++F+S  IN V  +     P  +D    +I +   +Q+KE LK +A  G +W  S     T    +L+P + VW HFL +RL+ +TH  TIS +R +
Subjt:  GKMVSFSSVDINRVYRLKAPLNPRGND----VIRNPSAKQMKEALKLVANKGVKWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVM

Query:  LLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKRQQNSIQRKDKA-FTSQATPPSGSSMAFPSQ
        LLY ++ G  IN+G +I D+I AC  K  G L+F SLI++LC +  +     E R      +DL  I +      ++ +K     +   PS  S    + 
Subjt:  LLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKRQQNSIQRKDKA-FTSQATPPSGSSMAFPSQ

Query:  HTPFIGPSPSSEALA-----------IAYRQLDQIRENLKTYWAYAKERDEAIREFY
        HT     + S E L              +  L Q +E L  +W Y+++RD A+++ +
Subjt:  HTPFIGPSPSSEALA-----------IAYRQLDQIRENLKTYWAYAKERDEAIREFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAGAAAGTTTTGCCCAAACAGAGGGAAGACAAGGGCAAAGGTATTGCTGAAGCATCGGTTGAGGCTGAGAATCTTGATGCTGAGGAGCCACGGTTGCCATACAA
CCGCTTCATCAATAATCTTGCCCGAGCAAACTGTCCGGGAAGTTGGCATCGAGGGGGTTCAGTGGAGACTGTCGAAGCATTTCAAGCAGCTTCCCTAAAGAGCGAAGCCA
ATACATGGATGGGCTTCATCAAGTTGCGCGTACTGCCGACAACTCACGATTCAACGGTGTCTCGGGTTAGGGTACTTCTTGTGTTTGCTAGTCTGCGTTCCATGAGCATT
GATGTGGGCAAAATAATTTCCAGTGAGATTTTCGATTTCTGGCGGAAAAAGGTTGGGAAGCTGTTTTTCCCGAACACCATAACGATGTTATGCCAAAGGGCAGGGGTTCC
AATGAATGCGGATGATGTCACTCTAATAGACAAGGGAATAATTGACACGCCAAACTTGGCTAGGCTTCAAAGGACACAAGAGGCACGCCAAGGTGGTTTGGTGTGCGGCA
TCCATCAAATACAAGAGCAATTACAGATGCATTCCAGCAGGATGGAGTTTGTCGAAAGCTTGGACTTGTTAAGCTTAATTAGATTAAGGTGTTATGATCTCCAGAAACCT
TTTGCTTGGGCATTTCTTCTAGCCTGGTCATTGCTGCGGCAAGAAGATTCTGAGAATTATTTTGCTGCAGCAGAACTTGGTTTTGCAGAATGCTCAGACCCTGGATTCGA
TAAATACTTATTACTTGCTGCGGATTGTGTATACTTGCACAATGTTCTAGAGTTTCTTTTTGCTATATTTTCACCCACTCTGGAAACCCAACACCCATCTTCGTCTTGTA
GGAACAAACCATTCGTTACCAATAGTGCAAGGAAGATGAGTCCTAAGAAAGTTGTGATCGAAAAGCCGCTTGTTATTGAGCCTCTCAAAGTAGCAAGAATTCCCCCGGAC
GTGTTCGAGGACATAATTTGCCAAGCTGTGGCAAAGGCCCTTGTGATTGCTGAAGGATATAAGGTTGAACAAGAAGCCTTGAAGGATATTGAGGCAGAGAGAGAGATGGA
AAATCAACACATGAGGGAAGACGATGAAGGTGCAAGAGAAAGAGATTTTGAAGAAGAGAGGAAAAAGGAAGAGGAAAGGCAAGAGGTCGAGAGGGCCTTAAAAGCTGAAA
AAGAAAGAAAGTTAGATGAAGACCTCAGGAGGGTAGTTGCTGATTTGCAACTTCTTGAGGAAGAAAAACACGGAAGGGAAGAGTTGAAAGAAGACGAAAAAAGAAGGAAG
GAAGTTGAAGACTTCCTTGCAGCTTTTGAACCACTCCACAAGGCTCAAAGCCTGGCAGAGGCCACTGAAGTTCAGCCTGCTGATGAGGTTTTCGAACCTCTATTCAAATA
TGATCCACCAGCAGCTGATAGCACCTCTTCGGGAGAGAAGAGGGATGAAGAAGAAAAAGAAAGCAAGGAGGCCGAGACCTCTAGTGACTCTGAAACAGAATCCGATTCAG
AGATCAAGGAACTGGATGATGACCAAGTTCCTATCTCTGCAGCGTTGAGGAGAAAGAAGAGAAGAGAGATTAAAGTTGAACGGAGGACCAAAAACAAGAATGACCCGATA
TTTTCCAAGAGGTCGAGGACTAGGTCCATGGATGCCTCTCCAGCAGTACCTCCTACCATCTCACCCGCCAAGCCAAAGGCTAAATCACCTAAGGCTGCATCTCCCAAAAA
TCCATTCCCCGAGGTATTTAGAGATGTAAATTTTCAGGAACGGATGGAGATCATGAGAAAGAGAGATTTCCTCAACGAGAAGGGATTCTCTAACAGAGCTGGAACACTGC
CAGAGTTCGTAAGCAGAGTTATCTCACAGTACAAGTGGCAGGAGTTCTGTGCTCACCCTCAGGAGGCTGTCGTGCCTCTAGTTCGTGAATTTTACGTCAGCTTGAGGGAG
GAAAGTATAAGTATGGCGATGGTGAGAGGCAAGATGGTCAGCTTCTCTTCAGTAGACATTAACAGGGTGTACAGACTCAAAGCACCCTTGAATCCAAGAGGGAACGATGT
TATCAGGAACCCCTCGGCCAAGCAGATGAAAGAAGCACTTAAACTCGTGGCCAACAAGGGAGTTAAGTGGAAAGAGTCTCAGACGAAGGTGAAGACTCTAGTGCCAAGCG
ACTTAAAGCCAGAATCGGCAGTTTGGCTTCACTTTCTGAAGAACCGTTTGATGCCAACCACCCACGACAGCACGATCTCAGTGGATAGAGTTATGCTACTCTATTGCATT
ATGAAGGGGTTGGAGATCAACATTGGGAGCATAATCAGGGATGAGATTCTAGCCTGTGGAAGAAAACGAGCAGGTAAACTTTTCTTTGGATCACTCATCACCCAGCTCTG
TCAGAGGGTGAAGATAGTTCCAGGCAAGGACGAGGAGCGTCATTTCTTCAAGCCGACCATTGACCTGTCTTTGATCGGGAAACGCCAGCAGAATAGCATCCAGAGGAAAG
ATAAAGCCTTCACATCTCAGGCCACTCCACCATCAGGGTCGAGCATGGCTTTTCCATCCCAGCACACTCCTTTTATAGGGCCCTCACCATCATCGGAAGCCCTAGCCATT
GCCTACCGCCAGCTAGATCAAATCAGGGAAAACCTGAAGACTTATTGGGCATATGCAAAGGAGAGGGATGAGGCCATTAGAGAGTTCTATCTCTCTATCGCCTCGAGTAT
TGCTCCAGTTTTTCCCAATTTCCCTCAGTCGCTTTTGCCTCAAGAAGACAAGGATTCTGATGATGAAGAAGATGAAAATGATGATGAAGAGAATGAAGAGAAAAAGAGTT
CCTCGAATGAGGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAGAAAGTTTTGCCCAAACAGAGGGAAGACAAGGGCAAAGGTATTGCTGAAGCATCGGTTGAGGCTGAGAATCTTGATGCTGAGGAGCCACGGTTGCCATACAA
CCGCTTCATCAATAATCTTGCCCGAGCAAACTGTCCGGGAAGTTGGCATCGAGGGGGTTCAGTGGAGACTGTCGAAGCATTTCAAGCAGCTTCCCTAAAGAGCGAAGCCA
ATACATGGATGGGCTTCATCAAGTTGCGCGTACTGCCGACAACTCACGATTCAACGGTGTCTCGGGTTAGGGTACTTCTTGTGTTTGCTAGTCTGCGTTCCATGAGCATT
GATGTGGGCAAAATAATTTCCAGTGAGATTTTCGATTTCTGGCGGAAAAAGGTTGGGAAGCTGTTTTTCCCGAACACCATAACGATGTTATGCCAAAGGGCAGGGGTTCC
AATGAATGCGGATGATGTCACTCTAATAGACAAGGGAATAATTGACACGCCAAACTTGGCTAGGCTTCAAAGGACACAAGAGGCACGCCAAGGTGGTTTGGTGTGCGGCA
TCCATCAAATACAAGAGCAATTACAGATGCATTCCAGCAGGATGGAGTTTGTCGAAAGCTTGGACTTGTTAAGCTTAATTAGATTAAGGTGTTATGATCTCCAGAAACCT
TTTGCTTGGGCATTTCTTCTAGCCTGGTCATTGCTGCGGCAAGAAGATTCTGAGAATTATTTTGCTGCAGCAGAACTTGGTTTTGCAGAATGCTCAGACCCTGGATTCGA
TAAATACTTATTACTTGCTGCGGATTGTGTATACTTGCACAATGTTCTAGAGTTTCTTTTTGCTATATTTTCACCCACTCTGGAAACCCAACACCCATCTTCGTCTTGTA
GGAACAAACCATTCGTTACCAATAGTGCAAGGAAGATGAGTCCTAAGAAAGTTGTGATCGAAAAGCCGCTTGTTATTGAGCCTCTCAAAGTAGCAAGAATTCCCCCGGAC
GTGTTCGAGGACATAATTTGCCAAGCTGTGGCAAAGGCCCTTGTGATTGCTGAAGGATATAAGGTTGAACAAGAAGCCTTGAAGGATATTGAGGCAGAGAGAGAGATGGA
AAATCAACACATGAGGGAAGACGATGAAGGTGCAAGAGAAAGAGATTTTGAAGAAGAGAGGAAAAAGGAAGAGGAAAGGCAAGAGGTCGAGAGGGCCTTAAAAGCTGAAA
AAGAAAGAAAGTTAGATGAAGACCTCAGGAGGGTAGTTGCTGATTTGCAACTTCTTGAGGAAGAAAAACACGGAAGGGAAGAGTTGAAAGAAGACGAAAAAAGAAGGAAG
GAAGTTGAAGACTTCCTTGCAGCTTTTGAACCACTCCACAAGGCTCAAAGCCTGGCAGAGGCCACTGAAGTTCAGCCTGCTGATGAGGTTTTCGAACCTCTATTCAAATA
TGATCCACCAGCAGCTGATAGCACCTCTTCGGGAGAGAAGAGGGATGAAGAAGAAAAAGAAAGCAAGGAGGCCGAGACCTCTAGTGACTCTGAAACAGAATCCGATTCAG
AGATCAAGGAACTGGATGATGACCAAGTTCCTATCTCTGCAGCGTTGAGGAGAAAGAAGAGAAGAGAGATTAAAGTTGAACGGAGGACCAAAAACAAGAATGACCCGATA
TTTTCCAAGAGGTCGAGGACTAGGTCCATGGATGCCTCTCCAGCAGTACCTCCTACCATCTCACCCGCCAAGCCAAAGGCTAAATCACCTAAGGCTGCATCTCCCAAAAA
TCCATTCCCCGAGGTATTTAGAGATGTAAATTTTCAGGAACGGATGGAGATCATGAGAAAGAGAGATTTCCTCAACGAGAAGGGATTCTCTAACAGAGCTGGAACACTGC
CAGAGTTCGTAAGCAGAGTTATCTCACAGTACAAGTGGCAGGAGTTCTGTGCTCACCCTCAGGAGGCTGTCGTGCCTCTAGTTCGTGAATTTTACGTCAGCTTGAGGGAG
GAAAGTATAAGTATGGCGATGGTGAGAGGCAAGATGGTCAGCTTCTCTTCAGTAGACATTAACAGGGTGTACAGACTCAAAGCACCCTTGAATCCAAGAGGGAACGATGT
TATCAGGAACCCCTCGGCCAAGCAGATGAAAGAAGCACTTAAACTCGTGGCCAACAAGGGAGTTAAGTGGAAAGAGTCTCAGACGAAGGTGAAGACTCTAGTGCCAAGCG
ACTTAAAGCCAGAATCGGCAGTTTGGCTTCACTTTCTGAAGAACCGTTTGATGCCAACCACCCACGACAGCACGATCTCAGTGGATAGAGTTATGCTACTCTATTGCATT
ATGAAGGGGTTGGAGATCAACATTGGGAGCATAATCAGGGATGAGATTCTAGCCTGTGGAAGAAAACGAGCAGGTAAACTTTTCTTTGGATCACTCATCACCCAGCTCTG
TCAGAGGGTGAAGATAGTTCCAGGCAAGGACGAGGAGCGTCATTTCTTCAAGCCGACCATTGACCTGTCTTTGATCGGGAAACGCCAGCAGAATAGCATCCAGAGGAAAG
ATAAAGCCTTCACATCTCAGGCCACTCCACCATCAGGGTCGAGCATGGCTTTTCCATCCCAGCACACTCCTTTTATAGGGCCCTCACCATCATCGGAAGCCCTAGCCATT
GCCTACCGCCAGCTAGATCAAATCAGGGAAAACCTGAAGACTTATTGGGCATATGCAAAGGAGAGGGATGAGGCCATTAGAGAGTTCTATCTCTCTATCGCCTCGAGTAT
TGCTCCAGTTTTTCCCAATTTCCCTCAGTCGCTTTTGCCTCAAGAAGACAAGGATTCTGATGATGAAGAAGATGAAAATGATGATGAAGAGAATGAAGAGAAAAAGAGTT
CCTCGAATGAGGACTAG
Protein sequenceShow/hide protein sequence
MEEKVLPKQREDKGKGIAEASVEAENLDAEEPRLPYNRFINNLARANCPGSWHRGGSVETVEAFQAASLKSEANTWMGFIKLRVLPTTHDSTVSRVRVLLVFASLRSMSI
DVGKIISSEIFDFWRKKVGKLFFPNTITMLCQRAGVPMNADDVTLIDKGIIDTPNLARLQRTQEARQGGLVCGIHQIQEQLQMHSSRMEFVESLDLLSLIRLRCYDLQKP
FAWAFLLAWSLLRQEDSENYFAAAELGFAECSDPGFDKYLLLAADCVYLHNVLEFLFAIFSPTLETQHPSSSCRNKPFVTNSARKMSPKKVVIEKPLVIEPLKVARIPPD
VFEDIICQAVAKALVIAEGYKVEQEALKDIEAEREMENQHMREDDEGARERDFEEERKKEEERQEVERALKAEKERKLDEDLRRVVADLQLLEEEKHGREELKEDEKRRK
EVEDFLAAFEPLHKAQSLAEATEVQPADEVFEPLFKYDPPAADSTSSGEKRDEEEKESKEAETSSDSETESDSEIKELDDDQVPISAALRRKKRREIKVERRTKNKNDPI
FSKRSRTRSMDASPAVPPTISPAKPKAKSPKAASPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGFSNRAGTLPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYVSLRE
ESISMAMVRGKMVSFSSVDINRVYRLKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVKWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCI
MKGLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKPTIDLSLIGKRQQNSIQRKDKAFTSQATPPSGSSMAFPSQHTPFIGPSPSSEALAI
AYRQLDQIRENLKTYWAYAKERDEAIREFYLSIASSIAPVFPNFPQSLLPQEDKDSDDEEDENDDEENEEKKSSSNED