; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg028660 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg028660
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionNucleolar protein 58-like
Genome locationscaffold7:16833187..16843328
RNA-Seq ExpressionSpg028660
SyntenySpg028660
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]2.1e-2830.42Show/hide
Query:  FAKRPRTR-----AMDASPAVPPTISPAKPKGKSPKAASPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGF-SNRAGTL--PEFISRVISQNKWQKFCAH
        FAKRP +      A+D + A  P+         S +  S    F +   +  ++E    +  R+ + EKGF  + + TL  P FIS VI    WQ FC H
Subjt:  FAKRPRTR-----AMDASPAVPPTISPAKPKGKSPKAASPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGF-SNRAGTL--PEFISRVISQNKWQKFCAH

Query:  PQEVVVPLVREFYVGLREESISMAVVRGKMVSFSSVDINRVYRLKEPLNPRGNDVIRNPSAKQMKETLKLMANKGVQWKKSQTNVKTLMSSDLKPESTVW
        P + +VPLV+EFY  L+ +  +   V    ++F+S  IN V  +    +    ++I +   +Q+KE LK +A  G QW  S     T    +L+P + VW
Subjt:  PQEVVVPLVREFYVGLREESISMAVVRGKMVSFSSVDINRVYRLKEPLNPRGNDVIRNPSAKQMKETLKLMANKGVQWKKSQTNVKTLMSSDLKPESTVW

Query:  LHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIRDEILDCGRKRADKLFFGSLITQLCQRVKIVPGKDEELHFFKPTIDLSLIGKLRQNNIQR
         HFL +RL+ +TH  TIS +R +LLY ++ G  INVG +I D+I  C  K    L+F SLI++LC +  +     E        +DL  I ++     ++
Subjt:  LHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIRDEILDCGRKRADKLFFGSLITQLCQRVKIVPGKDEELHFFKPTIDLSLIGKLRQNNIQR

Query:  KDKA-STSQATPPSGPSMASPSHHTSFTGPSPSSEAL--AIAYHQPDQI---------KENLKTYWAYAKERDEAIRE
         +K     +   PS PS    + HT     + S E L   ++  +  Q          +E L  +W Y+++RD A+++
Subjt:  KDKA-STSQATPPSGPSMASPSHHTSFTGPSPSSEAL--AIAYHQPDQI---------KENLKTYWAYAKERDEAIRE

PIN01433.1 hypothetical protein CDL12_26059 [Handroanthus impetiginosus]4.7e-2835.32Show/hide
Query:  EKGFSNRAGTLPEFISRVISQNKWQKFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVSFSSVDINRVYRLKEPLNPRGNDVIRN--PSAKQMKETL
        E+GF  +     E I   + + KW+ F A P+  V+PLVREFY    E      +VRG+ V F SV IN +Y +     P   D   N   +    +E  
Subjt:  EKGFSNRAGTLPEFISRVISQNKWQKFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVSFSSVDINRVYRLKEPLNPRGNDVIRN--PSAKQMKETL

Query:  KLMANKGVQWKKSQTNVKTLMSSDLKPESTVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIRDEILDCGRKRADKLFFGSLITQLCQRV
        + +   G QWK ++    +  S+ L   + +WL F+  R++PT H   ++ DR +LLYCIM G   +VG II D I+       D L+F SLIT+LC R 
Subjt:  KLMANKGVQWKKSQTNVKTLMSSDLKPESTVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIRDEILDCGRKRADKLFFGSLITQLCQRV

Query:  KIVPGKDEELHFFKPTID
         +   + EEL F +  ID
Subjt:  KIVPGKDEELHFFKPTID

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]8.8e-3536.88Show/hide
Query:  MRKRDFLNEKGF----SNRAGTLPEFISRVISQNKWQKFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVSFSSVDINRVYRLKEPLNPRGNDVIRN
        ++ R    EKGF    S   G LP FI++VI+Q+ W++FCAHP++ +VPLVREFY  L +   +   VRG  VS+S   IN V+ L +P++   ++ I N
Subjt:  MRKRDFLNEKGF----SNRAGTLPEFISRVISQNKWQKFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVSFSSVDINRVYRLKEPLNPRGNDVIRN

Query:  PSAKQMKETLKLMANKGVQWKKSQTNVKTLMSSDLKPESTVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIRDEILDCGRKRADKLFFG
         +   +   L+ +A  G +W  S     T + S L P + VW HFLK+ L+PTTH  T+S DR++LL+ ++ G  INVG +I  EI  C  ++   LFF 
Subjt:  PSAKQMKETLKLMANKGVQWKKSQTNVKTLMSSDLKPESTVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIRDEILDCGRKRADKLFFG

Query:  SLITQLCQRVKIVPGKDEELHFFKPTIDLSLIGKLRQNNIQRKDKASTSQATPPSGPSMASPS
        SLIT+LC+  +     +EE       ID   + ++ Q     +    ++Q    S P+ AS S
Subjt:  SLITQLCQRVKIVPGKDEELHFFKPTIDLSLIGKLRQNNIQRKDKASTSQATPPSGPSMASPS

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]8.5e-3834.48Show/hide
Query:  MRKRDFLNEKGF----SNRAGTLPEFISRVISQNKWQKFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVSFSSVDINRVYRLKEPLNPRGNDVIRN
        ++ R    EKGF    S   G LP FI++VI+Q+ W++FCAHP++ +VPLVREFY  L +   +   VRG  VS+S   IN V+ L +P++   ++ I+N
Subjt:  MRKRDFLNEKGF----SNRAGTLPEFISRVISQNKWQKFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVSFSSVDINRVYRLKEPLNPRGNDVIRN

Query:  PSAKQMKETLKLMANKGVQWKKSQTNVKTLMSSDLKPESTVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIRDEILDCGRKRADKLFFG
         + + +   L+ +A  G +W  S     T + S L P + VW HFLK+RL+PTTH  T+S DR++LL+ ++ G  INVG +I  EI  C  ++   LFF 
Subjt:  PSAKQMKETLKLMANKGVQWKKSQTNVKTLMSSDLKPESTVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIRDEILDCGRKRADKLFFG

Query:  SLITQLCQRVKIVPGKDEELHFFKPTIDLSLIGKLRQNNIQRKDKASTSQATPPSGPSMASPSHHTSFTGPSPSSEAL--------AIAYHQPDQIKENL
        SLIT+LC+  +     +EE       ID   + ++ Q       +  T     PS    A+ S + +        +AL           YH    ++   
Subjt:  SLITQLCQRVKIVPGKDEELHFFKPTIDLSLIGKLRQNNIQRKDKASTSQATPPSGPSMASPSHHTSFTGPSPSSEAL--------AIAYHQPDQIKENL

Query:  K---TYWAYAKERDEAIRE
        K    +WAY+KERD A+++
Subjt:  K---TYWAYAKERDEAIRE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]1.5e-3134.91Show/hide
Query:  ASPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGF---SNRAGTLPEFISRVISQNKWQKFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVSFSSVD
        AS    F     ++ ++E ++    R    EK F   +++    P FI+ VI Q+ WQ FCAHP++ +VPLVREFY  +         +RG  V  S   
Subjt:  ASPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGF---SNRAGTLPEFISRVISQNKWQKFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVSFSSVD

Query:  INRVYRLKEPLNPRGNDVIRNPSAKQMKETLKLMANKGVQWKKSQTNVKTLMSSDLKPESTVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVG
        IN ++ L +P++   ++ + + +  ++   L+ +A  G +W  S     T + S L P + VW HFLK+RL+PTTH  T+S + V LLY ++ G  INVG
Subjt:  INRVYRLKEPLNPRGNDVIRNPSAKQMKETLKLMANKGVQWKKSQTNVKTLMSSDLKPESTVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVG

Query:  SIIRDEILDCGRKRADKLFFGSLITQLCQRVK
         +I  EI  C  +++  LFF SLIT +C+  +
Subjt:  SIIRDEILDCGRKRADKLFFGSLITQLCQRVK

TrEMBL top hitse value%identityAlignment
A0A2G9G807 Uncharacterized protein2.3e-2835.32Show/hide
Query:  EKGFSNRAGTLPEFISRVISQNKWQKFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVSFSSVDINRVYRLKEPLNPRGNDVIRN--PSAKQMKETL
        E+GF  +     E I   + + KW+ F A P+  V+PLVREFY    E      +VRG+ V F SV IN +Y +     P   D   N   +    +E  
Subjt:  EKGFSNRAGTLPEFISRVISQNKWQKFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVSFSSVDINRVYRLKEPLNPRGNDVIRN--PSAKQMKETL

Query:  KLMANKGVQWKKSQTNVKTLMSSDLKPESTVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIRDEILDCGRKRADKLFFGSLITQLCQRV
        + +   G QWK ++    +  S+ L   + +WL F+  R++PT H   ++ DR +LLYCIM G   +VG II D I+       D L+F SLIT+LC R 
Subjt:  KLMANKGVQWKKSQTNVKTLMSSDLKPESTVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIRDEILDCGRKRADKLFFGSLITQLCQRV

Query:  KIVPGKDEELHFFKPTID
         +   + EEL F +  ID
Subjt:  KIVPGKDEELHFFKPTID

A0A2P5AGA5 Uncharacterized protein (Fragment)4.2e-3536.88Show/hide
Query:  MRKRDFLNEKGF----SNRAGTLPEFISRVISQNKWQKFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVSFSSVDINRVYRLKEPLNPRGNDVIRN
        ++ R    EKGF    S   G LP FI++VI+Q+ W++FCAHP++ +VPLVREFY  L +   +   VRG  VS+S   IN V+ L +P++   ++ I N
Subjt:  MRKRDFLNEKGF----SNRAGTLPEFISRVISQNKWQKFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVSFSSVDINRVYRLKEPLNPRGNDVIRN

Query:  PSAKQMKETLKLMANKGVQWKKSQTNVKTLMSSDLKPESTVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIRDEILDCGRKRADKLFFG
         +   +   L+ +A  G +W  S     T + S L P + VW HFLK+ L+PTTH  T+S DR++LL+ ++ G  INVG +I  EI  C  ++   LFF 
Subjt:  PSAKQMKETLKLMANKGVQWKKSQTNVKTLMSSDLKPESTVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIRDEILDCGRKRADKLFFG

Query:  SLITQLCQRVKIVPGKDEELHFFKPTIDLSLIGKLRQNNIQRKDKASTSQATPPSGPSMASPS
        SLIT+LC+  +     +EE       ID   + ++ Q     +    ++Q    S P+ AS S
Subjt:  SLITQLCQRVKIVPGKDEELHFFKPTIDLSLIGKLRQNNIQRKDKASTSQATPPSGPSMASPS

A0A2P5BCG4 Uncharacterized protein (Fragment)4.1e-3834.48Show/hide
Query:  MRKRDFLNEKGF----SNRAGTLPEFISRVISQNKWQKFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVSFSSVDINRVYRLKEPLNPRGNDVIRN
        ++ R    EKGF    S   G LP FI++VI+Q+ W++FCAHP++ +VPLVREFY  L +   +   VRG  VS+S   IN V+ L +P++   ++ I+N
Subjt:  MRKRDFLNEKGF----SNRAGTLPEFISRVISQNKWQKFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVSFSSVDINRVYRLKEPLNPRGNDVIRN

Query:  PSAKQMKETLKLMANKGVQWKKSQTNVKTLMSSDLKPESTVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIRDEILDCGRKRADKLFFG
         + + +   L+ +A  G +W  S     T + S L P + VW HFLK+RL+PTTH  T+S DR++LL+ ++ G  INVG +I  EI  C  ++   LFF 
Subjt:  PSAKQMKETLKLMANKGVQWKKSQTNVKTLMSSDLKPESTVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIRDEILDCGRKRADKLFFG

Query:  SLITQLCQRVKIVPGKDEELHFFKPTIDLSLIGKLRQNNIQRKDKASTSQATPPSGPSMASPSHHTSFTGPSPSSEAL--------AIAYHQPDQIKENL
        SLIT+LC+  +     +EE       ID   + ++ Q       +  T     PS    A+ S + +        +AL           YH    ++   
Subjt:  SLITQLCQRVKIVPGKDEELHFFKPTIDLSLIGKLRQNNIQRKDKASTSQATPPSGPSMASPSHHTSFTGPSPSSEAL--------AIAYHQPDQIKENL

Query:  K---TYWAYAKERDEAIRE
        K    +WAY+KERD A+++
Subjt:  K---TYWAYAKERDEAIRE

A0A2P5DAQ2 Uncharacterized protein7.5e-3234.91Show/hide
Query:  ASPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGF---SNRAGTLPEFISRVISQNKWQKFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVSFSSVD
        AS    F     ++ ++E ++    R    EK F   +++    P FI+ VI Q+ WQ FCAHP++ +VPLVREFY  +         +RG  V  S   
Subjt:  ASPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGF---SNRAGTLPEFISRVISQNKWQKFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVSFSSVD

Query:  INRVYRLKEPLNPRGNDVIRNPSAKQMKETLKLMANKGVQWKKSQTNVKTLMSSDLKPESTVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVG
        IN ++ L +P++   ++ + + +  ++   L+ +A  G +W  S     T + S L P + VW HFLK+RL+PTTH  T+S + V LLY ++ G  INVG
Subjt:  INRVYRLKEPLNPRGNDVIRNPSAKQMKETLKLMANKGVQWKKSQTNVKTLMSSDLKPESTVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVG

Query:  SIIRDEILDCGRKRADKLFFGSLITQLCQRVK
         +I  EI  C  +++  LFF SLIT +C+  +
Subjt:  SIIRDEILDCGRKRADKLFFGSLITQLCQRVK

W9RBS1 Uncharacterized protein1.0e-2830.42Show/hide
Query:  FAKRPRTR-----AMDASPAVPPTISPAKPKGKSPKAASPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGF-SNRAGTL--PEFISRVISQNKWQKFCAH
        FAKRP +      A+D + A  P+         S +  S    F +   +  ++E    +  R+ + EKGF  + + TL  P FIS VI    WQ FC H
Subjt:  FAKRPRTR-----AMDASPAVPPTISPAKPKGKSPKAASPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGF-SNRAGTL--PEFISRVISQNKWQKFCAH

Query:  PQEVVVPLVREFYVGLREESISMAVVRGKMVSFSSVDINRVYRLKEPLNPRGNDVIRNPSAKQMKETLKLMANKGVQWKKSQTNVKTLMSSDLKPESTVW
        P + +VPLV+EFY  L+ +  +   V    ++F+S  IN V  +    +    ++I +   +Q+KE LK +A  G QW  S     T    +L+P + VW
Subjt:  PQEVVVPLVREFYVGLREESISMAVVRGKMVSFSSVDINRVYRLKEPLNPRGNDVIRNPSAKQMKETLKLMANKGVQWKKSQTNVKTLMSSDLKPESTVW

Query:  LHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIRDEILDCGRKRADKLFFGSLITQLCQRVKIVPGKDEELHFFKPTIDLSLIGKLRQNNIQR
         HFL +RL+ +TH  TIS +R +LLY ++ G  INVG +I D+I  C  K    L+F SLI++LC +  +     E        +DL  I ++     ++
Subjt:  LHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIRDEILDCGRKRADKLFFGSLITQLCQRVKIVPGKDEELHFFKPTIDLSLIGKLRQNNIQR

Query:  KDKA-STSQATPPSGPSMASPSHHTSFTGPSPSSEAL--AIAYHQPDQI---------KENLKTYWAYAKERDEAIRE
         +K     +   PS PS    + HT     + S E L   ++  +  Q          +E L  +W Y+++RD A+++
Subjt:  KDKA-STSQATPPSGPSMASPSHHTSFTGPSPSSEAL--AIAYHQPDQI---------KENLKTYWAYAKERDEAIRE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAATTCGCCCAAGCCTTCATCATCACGCAAGATCACTCGATCTCAGAGTAATCCAACCGCCCACGAAGCTGAAGCAAGTGTTCGATGGCAAGAGGAGCAACCCGA
AACACCCATGCACGGCACGAGAAGGACGAGACCCACGAGTTTCTCACAGACAATCGTGAACCAAGGTGCTTCCGTCGTTCAAACTCCTTCTTCTACAATGCCGGCCAGTT
CGAGGGAGAATTCGAGTTCATCTGCACCTAGGAGGTCCACGCACGCCACTGCCGTTCGCCAAAACCAAAAACCCGTCGCCCAACAATTCAAGAAACGCTCGCGGGAGTGG
TTTTCAATGATTGGGCAATGGGAGCTCAGAGACGTGCTGCTCTTAAAGAAGAAGGAAATAGGCGAGATGAAGAAGAAGCCGCCAAAGCAGCAAGAAGCTCTCGGCAAGGA
GAGGCTTCAACGGAGGAGTCATATGAAAGTTGTGCCCAAAAAGTCGCTTGTTATTAAGCCTCTCAAAGTAGCAAGAATGCCACTGGACATGTTCGAGGACATAATTCACC
AAGCTGTGGCAAAGGCCCTTGTGATTGCTGAAGGGTATAAGGCTGAACAAGAAGCCTTGAAGGATATTGAGGCTGAGAGAGAGATGGAAAATCAGCATATGAGGGAAGAA
GATGAAGGATTAGATGAAGACCTCAGGAGGGTAGCTGCTGATTTGCAACTCCTTGAGGAAGAAAAACACAGAAGGGAAGAATTGAAAGAAGACGAAGAAAGAAGGAAGGA
AGCTGAAGACTTCCTTGCAGCTTTTGATCCACTCCACAAGGCTCAAAGTGAGGCTGAAGTGCTGCAAGGAAGGGAAGAAAAGGCCCAACCGGGGCCAAGTGATGAGAATT
TAGAAAAAGAAAAAGAAACAGAAGAAGTACTTGAAGCCCAAAATGCAACCGCATCTGGGCCGCATTCTGAAGAAGGCCTGGCAGAGGCCACTGAAGTTCAGCCTGCTGAT
GAGGTTTTCGAACCTCTATTCAAATATGATCTACTAGCAGCTGATAGCACCTCTTCGGGAGAGAAGAGGGATGAAGAAGAGAAAGAAATTAAGGAGGTCGAGACCTCTAG
TGACTCTGAAACAGAATCCGACTCAGAGATCAAGGAGCTGGATGACGACCAAGTTCCTATCTCTGCAGCTTTGAGGAGAAAGAGAAGAAGAGATATTAAAGCTGAAAGGA
GGACTAAGAACAAAAATGACCCGATATTTGCCAAGAGGCCGAGGACTAGGGCCATGGACGCCTCTCCAGCAGTTCCTCCTACCATCTCACCTGCCAAGCCAAAGGGTAAA
TCACCCAAGGCTGCATCTCCTAAAAATCCATTCCCCGAGGTATTCAGAGATGTTAATTTTCAGGAACGGATGGAGATCATGAGAAAGAGAGATTTCCTCAACGAGAAGGG
ATTCTCTAACAGAGCGGGAACACTGCCAGAGTTCATAAGCAGAGTTATCTCACAGAACAAGTGGCAGAAGTTCTGTGCTCACCCTCAGGAGGTTGTCGTGCCTTTAGTTC
GTGAATTTTACGTCGGCTTGAGGGAGGAAAGCATCAGTATGGCGGTGGTGAGAGGCAAAATGGTCAGCTTCTCTTCTGTAGACATTAACAGGGTGTACAGACTCAAAGAA
CCCTTGAATCCAAGAGGGAACGATGTTATCAGGAACCCCTCGGCCAAGCAAATGAAAGAAACACTTAAACTCATGGCCAACAAGGGAGTTCAGTGGAAAAAGTCCCAAAC
GAATGTGAAGACTCTAATGTCAAGCGATCTAAAGCCAGAATCGACAGTTTGGCTTCACTTTCTGAAGAACCGTTTGATGCCAACCACCCACGACAGCACGATCTCAGTAG
ATAGAGTTATGCTACTCTACTGCATCATGAAGGGGTTGGAGATCAATGTTGGGAGCATAATCAGGGATGAGATTCTAGACTGTGGAAGAAAAAGAGCAGATAAACTTTTC
TTTGGATCACTCATCACCCAGCTCTGTCAGAGGGTGAAGATAGTTCCAGGCAAGGACGAGGAGCTTCATTTCTTCAAGCCGACCATTGACCTGTCCTTGATCGGGAAGCT
CCGACAGAACAACATCCAGAGGAAAGACAAAGCCTCCACATCTCAGGCCACTCCACCATCAGGGCCGAGCATGGCTTCTCCATCCCATCACACTTCTTTTACAGGACCCT
CACCGTCATCGGAAGCCCTAGCTATTGCCTACCACCAGCCAGATCAAATCAAGGAAAACCTGAAGACATATTGGGCATATGCAAAGGAGAGGGATGAAGCCATTAGAGAA
GGAAATGCTGAAATCTGCCCAGAAATGCGACCGCATTTCTGGGAAGGAAAAATGAAATGCGATCGCATTTCTGGAAAAACAGAGGCAGTTCCGAGTCGTCTACAGTGTAT
ACTTGCACTAGGATTGGCTGAATTTATTTTACTATTATCTATCATCTTATCTCACATCCATCATTCAGCCGACGACGCACCACAGGTGCAGGCAGTGACTCTAAGGAGTG
GTAAGCCACTAGAAGAAAGAAGAGAGTCTAGTAAACCCCAGGATGTAGAGGAAAATAGTAATAAAAATGTTGTTGTTGAGAAAGAGTTGGAGTCTGGTAAAAATGATGGA
GGCAACAATAGTAATGCTGGAGCATCTGATGCAATGATCCAAAGCATCCATGTAGGAGACATGCGAGTGTGGGGGTTGAAGGATTGGAATTCCCAATTCCGGACCAAATG
TGAAATCTTGGGACACCACCACAAGATTACCCCGCTATTCTCTGAGTTTGGAAATGAGATTGTGGGATCTCATATTCTTGAGAGGGAATGTTTTACAGAGAAGTTCTTCA
TTGTATTTCTTGTTGAGAGAGAAAATGAGAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAAATTCGCCCAAGCCTTCATCATCACGCAAGATCACTCGATCTCAGAGTAATCCAACCGCCCACGAAGCTGAAGCAAGTGTTCGATGGCAAGAGGAGCAACCCGA
AACACCCATGCACGGCACGAGAAGGACGAGACCCACGAGTTTCTCACAGACAATCGTGAACCAAGGTGCTTCCGTCGTTCAAACTCCTTCTTCTACAATGCCGGCCAGTT
CGAGGGAGAATTCGAGTTCATCTGCACCTAGGAGGTCCACGCACGCCACTGCCGTTCGCCAAAACCAAAAACCCGTCGCCCAACAATTCAAGAAACGCTCGCGGGAGTGG
TTTTCAATGATTGGGCAATGGGAGCTCAGAGACGTGCTGCTCTTAAAGAAGAAGGAAATAGGCGAGATGAAGAAGAAGCCGCCAAAGCAGCAAGAAGCTCTCGGCAAGGA
GAGGCTTCAACGGAGGAGTCATATGAAAGTTGTGCCCAAAAAGTCGCTTGTTATTAAGCCTCTCAAAGTAGCAAGAATGCCACTGGACATGTTCGAGGACATAATTCACC
AAGCTGTGGCAAAGGCCCTTGTGATTGCTGAAGGGTATAAGGCTGAACAAGAAGCCTTGAAGGATATTGAGGCTGAGAGAGAGATGGAAAATCAGCATATGAGGGAAGAA
GATGAAGGATTAGATGAAGACCTCAGGAGGGTAGCTGCTGATTTGCAACTCCTTGAGGAAGAAAAACACAGAAGGGAAGAATTGAAAGAAGACGAAGAAAGAAGGAAGGA
AGCTGAAGACTTCCTTGCAGCTTTTGATCCACTCCACAAGGCTCAAAGTGAGGCTGAAGTGCTGCAAGGAAGGGAAGAAAAGGCCCAACCGGGGCCAAGTGATGAGAATT
TAGAAAAAGAAAAAGAAACAGAAGAAGTACTTGAAGCCCAAAATGCAACCGCATCTGGGCCGCATTCTGAAGAAGGCCTGGCAGAGGCCACTGAAGTTCAGCCTGCTGAT
GAGGTTTTCGAACCTCTATTCAAATATGATCTACTAGCAGCTGATAGCACCTCTTCGGGAGAGAAGAGGGATGAAGAAGAGAAAGAAATTAAGGAGGTCGAGACCTCTAG
TGACTCTGAAACAGAATCCGACTCAGAGATCAAGGAGCTGGATGACGACCAAGTTCCTATCTCTGCAGCTTTGAGGAGAAAGAGAAGAAGAGATATTAAAGCTGAAAGGA
GGACTAAGAACAAAAATGACCCGATATTTGCCAAGAGGCCGAGGACTAGGGCCATGGACGCCTCTCCAGCAGTTCCTCCTACCATCTCACCTGCCAAGCCAAAGGGTAAA
TCACCCAAGGCTGCATCTCCTAAAAATCCATTCCCCGAGGTATTCAGAGATGTTAATTTTCAGGAACGGATGGAGATCATGAGAAAGAGAGATTTCCTCAACGAGAAGGG
ATTCTCTAACAGAGCGGGAACACTGCCAGAGTTCATAAGCAGAGTTATCTCACAGAACAAGTGGCAGAAGTTCTGTGCTCACCCTCAGGAGGTTGTCGTGCCTTTAGTTC
GTGAATTTTACGTCGGCTTGAGGGAGGAAAGCATCAGTATGGCGGTGGTGAGAGGCAAAATGGTCAGCTTCTCTTCTGTAGACATTAACAGGGTGTACAGACTCAAAGAA
CCCTTGAATCCAAGAGGGAACGATGTTATCAGGAACCCCTCGGCCAAGCAAATGAAAGAAACACTTAAACTCATGGCCAACAAGGGAGTTCAGTGGAAAAAGTCCCAAAC
GAATGTGAAGACTCTAATGTCAAGCGATCTAAAGCCAGAATCGACAGTTTGGCTTCACTTTCTGAAGAACCGTTTGATGCCAACCACCCACGACAGCACGATCTCAGTAG
ATAGAGTTATGCTACTCTACTGCATCATGAAGGGGTTGGAGATCAATGTTGGGAGCATAATCAGGGATGAGATTCTAGACTGTGGAAGAAAAAGAGCAGATAAACTTTTC
TTTGGATCACTCATCACCCAGCTCTGTCAGAGGGTGAAGATAGTTCCAGGCAAGGACGAGGAGCTTCATTTCTTCAAGCCGACCATTGACCTGTCCTTGATCGGGAAGCT
CCGACAGAACAACATCCAGAGGAAAGACAAAGCCTCCACATCTCAGGCCACTCCACCATCAGGGCCGAGCATGGCTTCTCCATCCCATCACACTTCTTTTACAGGACCCT
CACCGTCATCGGAAGCCCTAGCTATTGCCTACCACCAGCCAGATCAAATCAAGGAAAACCTGAAGACATATTGGGCATATGCAAAGGAGAGGGATGAAGCCATTAGAGAA
GGAAATGCTGAAATCTGCCCAGAAATGCGACCGCATTTCTGGGAAGGAAAAATGAAATGCGATCGCATTTCTGGAAAAACAGAGGCAGTTCCGAGTCGTCTACAGTGTAT
ACTTGCACTAGGATTGGCTGAATTTATTTTACTATTATCTATCATCTTATCTCACATCCATCATTCAGCCGACGACGCACCACAGGTGCAGGCAGTGACTCTAAGGAGTG
GTAAGCCACTAGAAGAAAGAAGAGAGTCTAGTAAACCCCAGGATGTAGAGGAAAATAGTAATAAAAATGTTGTTGTTGAGAAAGAGTTGGAGTCTGGTAAAAATGATGGA
GGCAACAATAGTAATGCTGGAGCATCTGATGCAATGATCCAAAGCATCCATGTAGGAGACATGCGAGTGTGGGGGTTGAAGGATTGGAATTCCCAATTCCGGACCAAATG
TGAAATCTTGGGACACCACCACAAGATTACCCCGCTATTCTCTGAGTTTGGAAATGAGATTGTGGGATCTCATATTCTTGAGAGGGAATGTTTTACAGAGAAGTTCTTCA
TTGTATTTCTTGTTGAGAGAGAAAATGAGAGTTAG
Protein sequenceShow/hide protein sequence
MKNSPKPSSSRKITRSQSNPTAHEAEASVRWQEEQPETPMHGTRRTRPTSFSQTIVNQGASVVQTPSSTMPASSRENSSSSAPRRSTHATAVRQNQKPVAQQFKKRSREW
FSMIGQWELRDVLLLKKKEIGEMKKKPPKQQEALGKERLQRRSHMKVVPKKSLVIKPLKVARMPLDMFEDIIHQAVAKALVIAEGYKAEQEALKDIEAEREMENQHMREE
DEGLDEDLRRVAADLQLLEEEKHRREELKEDEERRKEAEDFLAAFDPLHKAQSEAEVLQGREEKAQPGPSDENLEKEKETEEVLEAQNATASGPHSEEGLAEATEVQPAD
EVFEPLFKYDLLAADSTSSGEKRDEEEKEIKEVETSSDSETESDSEIKELDDDQVPISAALRRKRRRDIKAERRTKNKNDPIFAKRPRTRAMDASPAVPPTISPAKPKGK
SPKAASPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGFSNRAGTLPEFISRVISQNKWQKFCAHPQEVVVPLVREFYVGLREESISMAVVRGKMVSFSSVDINRVYRLKE
PLNPRGNDVIRNPSAKQMKETLKLMANKGVQWKKSQTNVKTLMSSDLKPESTVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLEINVGSIIRDEILDCGRKRADKLF
FGSLITQLCQRVKIVPGKDEELHFFKPTIDLSLIGKLRQNNIQRKDKASTSQATPPSGPSMASPSHHTSFTGPSPSSEALAIAYHQPDQIKENLKTYWAYAKERDEAIRE
GNAEICPEMRPHFWEGKMKCDRISGKTEAVPSRLQCILALGLAEFILLLSIILSHIHHSADDAPQVQAVTLRSGKPLEERRESSKPQDVEENSNKNVVVEKELESGKNDG
GNNSNAGASDAMIQSIHVGDMRVWGLKDWNSQFRTKCEILGHHHKITPLFSEFGNEIVGSHILERECFTEKFFIVFLVERENES