; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg013594 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg013594
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionNucleolar protein 58-like
Genome locationscaffold2:24322394..24337386
RNA-Seq ExpressionSpg013594
SyntenySpg013594
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]3.0e-2930.87Show/hide
Query:  FAKRPRTRSMDASPAVPPTVSAAKPKGKSPKAASPKNPFPEVFKDVNFQERMEIMRKRDFLNEKGF---SNRAGALPEFVSRVISQYKWQEFCARPQEAV
        FAKRP + S    PA+    +AA     S +  S    F +   +  ++E    +  R+ + EKGF    +     P F+S VI    WQ FC  P + +
Subjt:  FAKRPRTRSMDASPAVPPTVSAAKPKGKSPKAASPKNPFPEVFKDVNFQERMEIMRKRDFLNEKGF---SNRAGALPEFVSRVISQYKWQEFCARPQEAV

Query:  VPLVREFYAGLREESISMAVARGKMINFSSVDINRVYRIKAPLNPRGND----VIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLMPSDLKPESAVWL
        VPLV+EFYA L+ +  +        I F+S  IN V  I     P  +D    +I +   +Q+KE LK +A  G QW  S     T    +L+P + VW 
Subjt:  VPLVREFYAGLREESISMAVARGKMINFSSVDINRVYRIKAPLNPRGND----VIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLMPSDLKPESAVWL

Query:  HFLKNCLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKQAGKLFFGSLITQLCQRVKIVPRKDEERHFFKPTIDLFLIGKLQQNIIQRK
        HFL + L+ +TH  TIS +R +LLY ++ G  IN+G +I D+I AC  K  G L+F SLI++LC +  +     E R      +DL  I ++     ++ 
Subjt:  HFLKNCLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKQAGKLFFGSLITQLCQRVKIVPRKDEERHFFKPTIDLFLIGKLQQNIIQRK

Query:  DK-TSTSQATPPSGPSMASPSQHISFTGPSPSSEALA-----------IAYRQLDQIRENLKTYWAYAKERDEAIREFY
        +K     +   PS PS    + H      + S E L              +  L Q +E L  +W Y+++RD A+++ +
Subjt:  DK-TSTSQATPPSGPSMASPSQHISFTGPSPSSEALA-----------IAYRQLDQIRENLKTYWAYAKERDEAIREFY

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]9.7e-3635.98Show/hide
Query:  MRKRDFLNEKGF----SNRAGALPEFVSRVISQYKWQEFCARPQEAVVPLVREFYAGLREESISMAVARGKMINFSSVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F+++VI+Q+ W++FCA P++ +VPLVREFYA L +   +    RG  +++S   IN V+ +  P++   ++ I N
Subjt:  MRKRDFLNEKGF----SNRAGALPEFVSRVISQYKWQEFCARPQEAVVPLVREFYAGLREESISMAVARGKMINFSSVDINRVYRIKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESQTKVKTLMPSDLKPESAVWLHFLKNCLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKQAGKLFFG
         +   +   L+ VA  G +W  S     T + S L P + VW HFLK+ L+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF 
Subjt:  PSAKQMKEALKLVANKGVQWKESQTKVKTLMPSDLKPESAVWLHFLKNCLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKQAGKLFFG

Query:  SLITQLCQRVKIVPRKDEERHFFKPTIDLFLIGKLQQNIIQRKDKTSTSQATPPSGPSMASPSQ
        SLIT+LC+  +     +EE+      ID   + ++ Q     +  T ++Q    S P+ AS S+
Subjt:  SLITQLCQRVKIVPRKDEERHFFKPTIDLFLIGKLQQNIIQRKDKTSTSQATPPSGPSMASPSQ

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]5.0e-4033.15Show/hide
Query:  MRKRDFLNEKGF----SNRAGALPEFVSRVISQYKWQEFCARPQEAVVPLVREFYAGLREESISMAVARGKMINFSSVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F+++VI+Q+ W++FCA P++ +VPLVREFYA L +   +    RG  +++S   IN V+ +  P++   ++ I+N
Subjt:  MRKRDFLNEKGF----SNRAGALPEFVSRVISQYKWQEFCARPQEAVVPLVREFYAGLREESISMAVARGKMINFSSVDINRVYRIKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESQTKVKTLMPSDLKPESAVWLHFLKNCLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKQAGKLFFG
         + + +   L+ VA  G +W  S     T + S L P + VW HFLK+ L+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF 
Subjt:  PSAKQMKEALKLVANKGVQWKESQTKVKTLMPSDLKPESAVWLHFLKNCLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKQAGKLFFG

Query:  SLITQLCQRVKIVPRKDEERHFFKPTIDLFLIGKLQQNIIQRKDKTSTSQATPPSGPSMASPS-------QHISFTGPSPSSEALAIAYRQ--LDQIREN
        SLIT+LC+  +     +EE+      ID   + ++ Q     +  T ++Q    S P+ AS +       Q +       S + +   +    L    + 
Subjt:  SLITQLCQRVKIVPRKDEERHFFKPTIDLFLIGKLQQNIIQRKDKTSTSQATPPSGPSMASPS-------QHISFTGPSPSSEALAIAYRQ--LDQIREN

Query:  LKTYWAYAKERDEAIREFYLFIALSIAPVFPDFPQSLLPQENKDSDEEDDENNDED
         + +WAY+KERD A+++          P FP FPQ +L    KD D E +  +D+D
Subjt:  LKTYWAYAKERDEAIREFYLFIALSIAPVFPDFPQSLLPQENKDSDEEDDENNDED

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]3.6e-3033.2Show/hide
Query:  ASPKNPFPEVFKDVNFQERMEIMRKRDFLNEKGF---SNRAGALPEFVSRVISQYKWQEFCARPQEAVVPLVREFYAGLREESISMAVARGKMINFSSVD
        AS    F     ++ ++E ++    R    EK F   +++    P F++ VI Q+ WQ FCA P++ +VPLVREFY  +          RG  +  S   
Subjt:  ASPKNPFPEVFKDVNFQERMEIMRKRDFLNEKGF---SNRAGALPEFVSRVISQYKWQEFCARPQEAVVPLVREFYAGLREESISMAVARGKMINFSSVD

Query:  INRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLMPSDLKPESAVWLHFLKNCLMPTTHDSTISVDRVMLLYCIMKGLEINIG
        IN ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     T + S L P + VW HFLK+ L+PTTH  T+S + V LLY ++ G  IN+G
Subjt:  INRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLMPSDLKPESAVWLHFLKNCLMPTTHDSTISVDRVMLLYCIMKGLEINIG

Query:  SIIRDEILACGRKQAGKLFFGSLITQLCQRVKIVPRKDEER
         +I  EI AC  +++G LFF SLIT +C+  +     +EE+
Subjt:  SIIRDEILACGRKQAGKLFFGSLITQLCQRVKIVPRKDEER

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.1e-2831.8Show/hide
Query:  VPLVREFYAGLREESISMAVARGKMINFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLMPSDLKPESAVWLHFLK
        +PLVREFYA L +   +    RG  +++S   IN V+ +  P++   ++ I N +  ++   L+ VA  G +W  S     T + S L P + VW HFLK
Subjt:  VPLVREFYAGLREESISMAVARGKMINFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLMPSDLKPESAVWLHFLK

Query:  NCLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKQAGKLFFGSLITQLCQRVKIVPRKDEERHFFKPTIDLFLIGKLQQNIIQRKDKTS
        + L+PTTH   +S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF SLIT+LC+    +   +EE+      ID   + ++ Q     +  T 
Subjt:  NCLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKQAGKLFFGSLITQLCQRVKIVPRKDEERHFFKPTIDLFLIGKLQQNIIQRKDKTS

Query:  TSQATPPSGPSMASPSQHISFTGPSPSSEALAIAYR--QLDQIRENLKTYWAYAKERDEAIREFYLFIALSIAPVFPDFPQSLLPQENKDSDEEDDENND
        ++Q    S P+ AS S+    T      +  A+  R  Q +   +  + +WAY+KERD A+++          P FP FPQ +L   + + + E D++  
Subjt:  TSQATPPSGPSMASPSQHISFTGPSPSSEALAIAYR--QLDQIRENLKTYWAYAKERDEAIREFYLFIALSIAPVFPDFPQSLLPQENKDSDEEDDENND

Query:  EDDEE
         +  E
Subjt:  EDDEE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)4.7e-3635.98Show/hide
Query:  MRKRDFLNEKGF----SNRAGALPEFVSRVISQYKWQEFCARPQEAVVPLVREFYAGLREESISMAVARGKMINFSSVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F+++VI+Q+ W++FCA P++ +VPLVREFYA L +   +    RG  +++S   IN V+ +  P++   ++ I N
Subjt:  MRKRDFLNEKGF----SNRAGALPEFVSRVISQYKWQEFCARPQEAVVPLVREFYAGLREESISMAVARGKMINFSSVDINRVYRIKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESQTKVKTLMPSDLKPESAVWLHFLKNCLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKQAGKLFFG
         +   +   L+ VA  G +W  S     T + S L P + VW HFLK+ L+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF 
Subjt:  PSAKQMKEALKLVANKGVQWKESQTKVKTLMPSDLKPESAVWLHFLKNCLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKQAGKLFFG

Query:  SLITQLCQRVKIVPRKDEERHFFKPTIDLFLIGKLQQNIIQRKDKTSTSQATPPSGPSMASPSQ
        SLIT+LC+  +     +EE+      ID   + ++ Q     +  T ++Q    S P+ AS S+
Subjt:  SLITQLCQRVKIVPRKDEERHFFKPTIDLFLIGKLQQNIIQRKDKTSTSQATPPSGPSMASPSQ

A0A2P5BCG4 Uncharacterized protein (Fragment)2.4e-4033.15Show/hide
Query:  MRKRDFLNEKGF----SNRAGALPEFVSRVISQYKWQEFCARPQEAVVPLVREFYAGLREESISMAVARGKMINFSSVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F+++VI+Q+ W++FCA P++ +VPLVREFYA L +   +    RG  +++S   IN V+ +  P++   ++ I+N
Subjt:  MRKRDFLNEKGF----SNRAGALPEFVSRVISQYKWQEFCARPQEAVVPLVREFYAGLREESISMAVARGKMINFSSVDINRVYRIKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESQTKVKTLMPSDLKPESAVWLHFLKNCLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKQAGKLFFG
         + + +   L+ VA  G +W  S     T + S L P + VW HFLK+ L+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF 
Subjt:  PSAKQMKEALKLVANKGVQWKESQTKVKTLMPSDLKPESAVWLHFLKNCLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKQAGKLFFG

Query:  SLITQLCQRVKIVPRKDEERHFFKPTIDLFLIGKLQQNIIQRKDKTSTSQATPPSGPSMASPS-------QHISFTGPSPSSEALAIAYRQ--LDQIREN
        SLIT+LC+  +     +EE+      ID   + ++ Q     +  T ++Q    S P+ AS +       Q +       S + +   +    L    + 
Subjt:  SLITQLCQRVKIVPRKDEERHFFKPTIDLFLIGKLQQNIIQRKDKTSTSQATPPSGPSMASPS-------QHISFTGPSPSSEALAIAYRQ--LDQIREN

Query:  LKTYWAYAKERDEAIREFYLFIALSIAPVFPDFPQSLLPQENKDSDEEDDENNDED
         + +WAY+KERD A+++          P FP FPQ +L    KD D E +  +D+D
Subjt:  LKTYWAYAKERDEAIREFYLFIALSIAPVFPDFPQSLLPQENKDSDEEDDENNDED

A0A2P5DAQ2 Uncharacterized protein1.7e-3033.2Show/hide
Query:  ASPKNPFPEVFKDVNFQERMEIMRKRDFLNEKGF---SNRAGALPEFVSRVISQYKWQEFCARPQEAVVPLVREFYAGLREESISMAVARGKMINFSSVD
        AS    F     ++ ++E ++    R    EK F   +++    P F++ VI Q+ WQ FCA P++ +VPLVREFY  +          RG  +  S   
Subjt:  ASPKNPFPEVFKDVNFQERMEIMRKRDFLNEKGF---SNRAGALPEFVSRVISQYKWQEFCARPQEAVVPLVREFYAGLREESISMAVARGKMINFSSVD

Query:  INRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLMPSDLKPESAVWLHFLKNCLMPTTHDSTISVDRVMLLYCIMKGLEINIG
        IN ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     T + S L P + VW HFLK+ L+PTTH  T+S + V LLY ++ G  IN+G
Subjt:  INRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLMPSDLKPESAVWLHFLKNCLMPTTHDSTISVDRVMLLYCIMKGLEINIG

Query:  SIIRDEILACGRKQAGKLFFGSLITQLCQRVKIVPRKDEER
         +I  EI AC  +++G LFF SLIT +C+  +     +EE+
Subjt:  SIIRDEILACGRKQAGKLFFGSLITQLCQRVKIVPRKDEER

A0A2P5DXM3 Uncharacterized protein5.6e-2931.8Show/hide
Query:  VPLVREFYAGLREESISMAVARGKMINFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLMPSDLKPESAVWLHFLK
        +PLVREFYA L +   +    RG  +++S   IN V+ +  P++   ++ I N +  ++   L+ VA  G +W  S     T + S L P + VW HFLK
Subjt:  VPLVREFYAGLREESISMAVARGKMINFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLMPSDLKPESAVWLHFLK

Query:  NCLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKQAGKLFFGSLITQLCQRVKIVPRKDEERHFFKPTIDLFLIGKLQQNIIQRKDKTS
        + L+PTTH   +S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF SLIT+LC+    +   +EE+      ID   + ++ Q     +  T 
Subjt:  NCLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKQAGKLFFGSLITQLCQRVKIVPRKDEERHFFKPTIDLFLIGKLQQNIIQRKDKTS

Query:  TSQATPPSGPSMASPSQHISFTGPSPSSEALAIAYR--QLDQIRENLKTYWAYAKERDEAIREFYLFIALSIAPVFPDFPQSLLPQENKDSDEEDDENND
        ++Q    S P+ AS S+    T      +  A+  R  Q +   +  + +WAY+KERD A+++          P FP FPQ +L   + + + E D++  
Subjt:  TSQATPPSGPSMASPSQHISFTGPSPSSEALAIAYR--QLDQIRENLKTYWAYAKERDEAIREFYLFIALSIAPVFPDFPQSLLPQENKDSDEEDDENND

Query:  EDDEE
         +  E
Subjt:  EDDEE

W9RBS1 Uncharacterized protein1.5e-2930.87Show/hide
Query:  FAKRPRTRSMDASPAVPPTVSAAKPKGKSPKAASPKNPFPEVFKDVNFQERMEIMRKRDFLNEKGF---SNRAGALPEFVSRVISQYKWQEFCARPQEAV
        FAKRP + S    PA+    +AA     S +  S    F +   +  ++E    +  R+ + EKGF    +     P F+S VI    WQ FC  P + +
Subjt:  FAKRPRTRSMDASPAVPPTVSAAKPKGKSPKAASPKNPFPEVFKDVNFQERMEIMRKRDFLNEKGF---SNRAGALPEFVSRVISQYKWQEFCARPQEAV

Query:  VPLVREFYAGLREESISMAVARGKMINFSSVDINRVYRIKAPLNPRGND----VIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLMPSDLKPESAVWL
        VPLV+EFYA L+ +  +        I F+S  IN V  I     P  +D    +I +   +Q+KE LK +A  G QW  S     T    +L+P + VW 
Subjt:  VPLVREFYAGLREESISMAVARGKMINFSSVDINRVYRIKAPLNPRGND----VIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLMPSDLKPESAVWL

Query:  HFLKNCLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKQAGKLFFGSLITQLCQRVKIVPRKDEERHFFKPTIDLFLIGKLQQNIIQRK
        HFL + L+ +TH  TIS +R +LLY ++ G  IN+G +I D+I AC  K  G L+F SLI++LC +  +     E R      +DL  I ++     ++ 
Subjt:  HFLKNCLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKQAGKLFFGSLITQLCQRVKIVPRKDEERHFFKPTIDLFLIGKLQQNIIQRK

Query:  DK-TSTSQATPPSGPSMASPSQHISFTGPSPSSEALA-----------IAYRQLDQIRENLKTYWAYAKERDEAIREFY
        +K     +   PS PS    + H      + S E L              +  L Q +E L  +W Y+++RD A+++ +
Subjt:  DK-TSTSQATPPSGPSMASPSQHISFTGPSPSSEALA-----------IAYRQLDQIRENLKTYWAYAKERDEAIREFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAATCAGCATATGAGGGAAGATGATGACTTTGCGAGAGAAAGAGATCTTGAAGAGGAAAGGAAAAGAGAAGAGGAAAGACAAGAGGCCGAGAGGGCCAAGATAGC
TGAGGAAAAAGAGAGAAAATTAGCTTTTGAGCCACTCCACAAGGCTCAAAGTGAAGCGGAAGTGCTGCAAGGGAGAGAAGAAGAGGCCCAACAGGGGCTAACTGAAGAAT
GTTCAGAAAAAGAAAAAGAAAGAGAGGTAGAGATTGAAGGCCAGAATGCGACCGCATCTGGGCCGCATTCTGAAGAAGGCCTAGCCGAGGCCACCAATGAGCAACCTGCT
GATGAGGTTTTCAAACCTCTATTCAAAGATGACCCACCAGCAGCTGACAGCAGCTCTTCGGGAGAGAAGAGGAATGAAGAAGAGAAAGAAAGCAAGGAGGCCGAGACCTC
CAGGGACTCTGAAACAGAATCCGAATCAGAGATTAGGAAATTGGATGATGACCAAGTTCCTATCTCTGCAGCATTGAGAAGAAAGAGAAAAAGAAAGATCAAAGCTGAAA
GGAGGACCAAAAACAAGAATGATCCTATTTTTGCCAAGAGGCCGAGGACTAGGTCCATGGACGCCTCTCCTGCAGTTCCTCCTACTGTCTCAGCCGCCAAGCCAAAGGGA
AAATCACCTAAGGCTGCATCTCCTAAAAATCCATTCCCCGAGGTATTCAAAGATGTTAATTTTCAGGAAAGGATGGAGATTATGAGAAAAAGGGATTTCCTAAACGAGAA
GGGATTCTCTAACAGAGCAGGAGCACTGCCAGAGTTCGTAAGCAGAGTTATCTCCCAGTACAAGTGGCAGGAGTTCTGTGCTCGCCCTCAGGAGGCTGTAGTGCCTTTAG
TTCGTGAATTTTACGCCGGCCTGAGGGAGGAAAGCATTAGTATGGCGGTGGCGAGAGGCAAAATGATCAACTTCTCTTCAGTAGACATCAACAGGGTGTACAGAATCAAG
GCACCCCTGAATCCAAGAGGGAACGATGTTATTAGGAACCCCTCGGCCAAGCAGATGAAAGAAGCATTAAAACTTGTTGCCAACAAGGGTGTTCAGTGGAAAGAGTCCCA
GACGAAGGTGAAGACTTTAATGCCAAGCGATCTAAAGCCAGAATCGGCTGTTTGGCTTCACTTTCTGAAGAACTGCTTGATGCCAACCACCCACGATAGCACGATCTCAG
TGGATAGAGTGATGCTACTCTATTGCATTATGAAGGGGTTGGAGATCAACATTGGGAGCATAATCAGGGATGAGATTCTAGCCTGTGGAAGAAAACAAGCAGGTAAACTT
TTCTTTGGATCACTTATCACCCAGCTTTGTCAGAGGGTGAAGATAGTTCCAAGAAAGGACGAGGAACGTCATTTCTTCAAGCCGACTATTGACCTGTTCTTGATCGGGAA
GCTTCAACAGAACATCATCCAAAGGAAAGATAAAACCTCCACATCTCAGGCCACTCCACCATCAGGGCCGAGCATGGCTTCTCCATCCCAGCACATTTCTTTTACAGGGC
CCTCACCGTCATCAGAAGCCCTAGCTATTGCCTACCGCCAACTTGATCAAATCAGGGAAAACCTAAAGACATATTGGGCATATGCAAAGGAGAGGGATGAGGCCATTAGA
GAGTTCTATCTCTTTATCGCCCTAAGTATTGCTCCGGTCTTTCCCGATTTTCCTCAGTCGCTGCTACCTCAAGAAAACAAGGATTCTGATGAAGAGGATGATGAAAATAA
TGATGAAGATGATGAAGAGAAAGAGAGTTCCTCGGACGAGGACTACGGGAGTTTTCTGACCCCTTTACCTGCTGTTTTTCTTTGCAAGATGGAATTCACTGCTACCCGTG
TTAGGGTTAGTAAAGGAAAACAGAGGAAAAGCTGGATTCTGCCCAGAAATGCGACCGCATTTCTGGGAAGGCAAAATGAAATGCGACCACATTTCTGGAAAATAGAGGCA
GTTTCGAGTCGTCTGCGGGTCGTTGTTGACGAGTCTTCTTCGCACCTAACCGGCTACTTTGCATGCTATGTGAGTTTGATCTCGAAAGGGACAAACGATATTGGTTCTCA
TAAGGATACCCCCACTCGCATGTCTACTACATGGACGCTTTGGATCAATACGTCTGTATCAAATACAAAGCAAGTCGTATCACATAGTGTTACCAGGATAAGGTTGCAAC
CAGGCTACAACGATCGTCTCAGCAGGATACAACGACTAATTCCGTATAGGGATAAGGCTGGGTACCTTATCCTGGTGACACTATGTGATACGGCCCACCTTGTATTCGAT
ACAGATGCAATGATCCAAAGCATCCATGTAGAAGACATGCGGGTGGAGGGGCAAGACCGAATGGGGGCTGGGAACATAACAGTACAAGATGGAATTCACTCCTTCTCCTA
TTGGGAAATATTCTATAGTGAGAAGAGTGCAGCTAGAATTCTCCCAAAGGCTCCCACAAGTCTCCTGCCTCTAGAAGTCTCAGAGTCATACCGGTCACGCGCTAGAAATT
CGCAACAGGACGGTTCAAGGCTGTTGGGCGGTCCGATTCACGCGGTCTGGTTGGGCTGGGACCGATTTGGTCCGGTTCAGCTGGAATTTAGCTTGGTTCGTGGTTTTCAT
AGCTGGTTCGAAGTGGTTCGGGTCGGTTCGGGCGGTCTAGACCAGTATTTGGAGCTGCAGAAGGATTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAAATCAGCATATGAGGGAAGATGATGACTTTGCGAGAGAAAGAGATCTTGAAGAGGAAAGGAAAAGAGAAGAGGAAAGACAAGAGGCCGAGAGGGCCAAGATAGC
TGAGGAAAAAGAGAGAAAATTAGCTTTTGAGCCACTCCACAAGGCTCAAAGTGAAGCGGAAGTGCTGCAAGGGAGAGAAGAAGAGGCCCAACAGGGGCTAACTGAAGAAT
GTTCAGAAAAAGAAAAAGAAAGAGAGGTAGAGATTGAAGGCCAGAATGCGACCGCATCTGGGCCGCATTCTGAAGAAGGCCTAGCCGAGGCCACCAATGAGCAACCTGCT
GATGAGGTTTTCAAACCTCTATTCAAAGATGACCCACCAGCAGCTGACAGCAGCTCTTCGGGAGAGAAGAGGAATGAAGAAGAGAAAGAAAGCAAGGAGGCCGAGACCTC
CAGGGACTCTGAAACAGAATCCGAATCAGAGATTAGGAAATTGGATGATGACCAAGTTCCTATCTCTGCAGCATTGAGAAGAAAGAGAAAAAGAAAGATCAAAGCTGAAA
GGAGGACCAAAAACAAGAATGATCCTATTTTTGCCAAGAGGCCGAGGACTAGGTCCATGGACGCCTCTCCTGCAGTTCCTCCTACTGTCTCAGCCGCCAAGCCAAAGGGA
AAATCACCTAAGGCTGCATCTCCTAAAAATCCATTCCCCGAGGTATTCAAAGATGTTAATTTTCAGGAAAGGATGGAGATTATGAGAAAAAGGGATTTCCTAAACGAGAA
GGGATTCTCTAACAGAGCAGGAGCACTGCCAGAGTTCGTAAGCAGAGTTATCTCCCAGTACAAGTGGCAGGAGTTCTGTGCTCGCCCTCAGGAGGCTGTAGTGCCTTTAG
TTCGTGAATTTTACGCCGGCCTGAGGGAGGAAAGCATTAGTATGGCGGTGGCGAGAGGCAAAATGATCAACTTCTCTTCAGTAGACATCAACAGGGTGTACAGAATCAAG
GCACCCCTGAATCCAAGAGGGAACGATGTTATTAGGAACCCCTCGGCCAAGCAGATGAAAGAAGCATTAAAACTTGTTGCCAACAAGGGTGTTCAGTGGAAAGAGTCCCA
GACGAAGGTGAAGACTTTAATGCCAAGCGATCTAAAGCCAGAATCGGCTGTTTGGCTTCACTTTCTGAAGAACTGCTTGATGCCAACCACCCACGATAGCACGATCTCAG
TGGATAGAGTGATGCTACTCTATTGCATTATGAAGGGGTTGGAGATCAACATTGGGAGCATAATCAGGGATGAGATTCTAGCCTGTGGAAGAAAACAAGCAGGTAAACTT
TTCTTTGGATCACTTATCACCCAGCTTTGTCAGAGGGTGAAGATAGTTCCAAGAAAGGACGAGGAACGTCATTTCTTCAAGCCGACTATTGACCTGTTCTTGATCGGGAA
GCTTCAACAGAACATCATCCAAAGGAAAGATAAAACCTCCACATCTCAGGCCACTCCACCATCAGGGCCGAGCATGGCTTCTCCATCCCAGCACATTTCTTTTACAGGGC
CCTCACCGTCATCAGAAGCCCTAGCTATTGCCTACCGCCAACTTGATCAAATCAGGGAAAACCTAAAGACATATTGGGCATATGCAAAGGAGAGGGATGAGGCCATTAGA
GAGTTCTATCTCTTTATCGCCCTAAGTATTGCTCCGGTCTTTCCCGATTTTCCTCAGTCGCTGCTACCTCAAGAAAACAAGGATTCTGATGAAGAGGATGATGAAAATAA
TGATGAAGATGATGAAGAGAAAGAGAGTTCCTCGGACGAGGACTACGGGAGTTTTCTGACCCCTTTACCTGCTGTTTTTCTTTGCAAGATGGAATTCACTGCTACCCGTG
TTAGGGTTAGTAAAGGAAAACAGAGGAAAAGCTGGATTCTGCCCAGAAATGCGACCGCATTTCTGGGAAGGCAAAATGAAATGCGACCACATTTCTGGAAAATAGAGGCA
GTTTCGAGTCGTCTGCGGGTCGTTGTTGACGAGTCTTCTTCGCACCTAACCGGCTACTTTGCATGCTATGTGAGTTTGATCTCGAAAGGGACAAACGATATTGGTTCTCA
TAAGGATACCCCCACTCGCATGTCTACTACATGGACGCTTTGGATCAATACGTCTGTATCAAATACAAAGCAAGTCGTATCACATAGTGTTACCAGGATAAGGTTGCAAC
CAGGCTACAACGATCGTCTCAGCAGGATACAACGACTAATTCCGTATAGGGATAAGGCTGGGTACCTTATCCTGGTGACACTATGTGATACGGCCCACCTTGTATTCGAT
ACAGATGCAATGATCCAAAGCATCCATGTAGAAGACATGCGGGTGGAGGGGCAAGACCGAATGGGGGCTGGGAACATAACAGTACAAGATGGAATTCACTCCTTCTCCTA
TTGGGAAATATTCTATAGTGAGAAGAGTGCAGCTAGAATTCTCCCAAAGGCTCCCACAAGTCTCCTGCCTCTAGAAGTCTCAGAGTCATACCGGTCACGCGCTAGAAATT
CGCAACAGGACGGTTCAAGGCTGTTGGGCGGTCCGATTCACGCGGTCTGGTTGGGCTGGGACCGATTTGGTCCGGTTCAGCTGGAATTTAGCTTGGTTCGTGGTTTTCAT
AGCTGGTTCGAAGTGGTTCGGGTCGGTTCGGGCGGTCTAGACCAGTATTTGGAGCTGCAGAAGGATTTTTAG
Protein sequenceShow/hide protein sequence
MENQHMREDDDFARERDLEEERKREEERQEAERAKIAEEKERKLAFEPLHKAQSEAEVLQGREEEAQQGLTEECSEKEKEREVEIEGQNATASGPHSEEGLAEATNEQPA
DEVFKPLFKDDPPAADSSSSGEKRNEEEKESKEAETSRDSETESESEIRKLDDDQVPISAALRRKRKRKIKAERRTKNKNDPIFAKRPRTRSMDASPAVPPTVSAAKPKG
KSPKAASPKNPFPEVFKDVNFQERMEIMRKRDFLNEKGFSNRAGALPEFVSRVISQYKWQEFCARPQEAVVPLVREFYAGLREESISMAVARGKMINFSSVDINRVYRIK
APLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLMPSDLKPESAVWLHFLKNCLMPTTHDSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKQAGKL
FFGSLITQLCQRVKIVPRKDEERHFFKPTIDLFLIGKLQQNIIQRKDKTSTSQATPPSGPSMASPSQHISFTGPSPSSEALAIAYRQLDQIRENLKTYWAYAKERDEAIR
EFYLFIALSIAPVFPDFPQSLLPQENKDSDEEDDENNDEDDEEKESSSDEDYGSFLTPLPAVFLCKMEFTATRVRVSKGKQRKSWILPRNATAFLGRQNEMRPHFWKIEA
VSSRLRVVVDESSSHLTGYFACYVSLISKGTNDIGSHKDTPTRMSTTWTLWINTSVSNTKQVVSHSVTRIRLQPGYNDRLSRIQRLIPYRDKAGYLILVTLCDTAHLVFD
TDAMIQSIHVEDMRVEGQDRMGAGNITVQDGIHSFSYWEIFYSEKSAARILPKAPTSLLPLEVSESYRSRARNSQQDGSRLLGGPIHAVWLGWDRFGPVQLEFSLVRGFH
SWFEVVRVGSGGLDQYLELQKDF