; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg000124 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg000124
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionNucleolar protein 58-like
Genome locationscaffold6:21122150..21124786
RNA-Seq ExpressionSpg000124
SyntenySpg000124
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]1.8e-2531.27Show/hide
Query:  WQEFCAHPQEVVVPLVREFYAGLREESISMAVVRAKMVSFSSVDINWVYRLKAPLNPRGND----VIRNLSANQMKEALKLVANKGVKWKKSQMKVKTLV
        WQ FC HP + +VPLV+EFYA L+ +  +   V    ++F+S  IN V  +     P  +D    +I +    Q+KE LK +A  G +W  S     T  
Subjt:  WQEFCAHPQEVVVPLVREFYAGLREESISMAVVRAKMVSFSSVDINWVYRLKAPLNPRGND----VIRNLSANQMKEALKLVANKGVKWKKSQMKVKTLV

Query:  PSDLKSESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKWLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKLTIELSL
          +L+  + VW HFL +RL+ +TH  TIS +R +LLY ++    IN+G +I D+I AC  K  G L+F SLI++LC +  +     E R      ++L  
Subjt:  PSDLKSESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKWLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKLTIELSL

Query:  IGKLQHKSLQRKDKASTSQD-TPPSGPSMASPSQHTPFTGPSPSSEALS-----------IAYRQLDQIKENLKTYWAYAKERDKAIREFY
        I ++     ++ +K    ++   PS PS    + HT     + S E L              +  L Q +E L  +W Y+++RD A+++ +
Subjt:  IGKLQHKSLQRKDKASTSQD-TPPSGPSMASPSQHTPFTGPSPSSEALS-----------IAYRQLDQIKENLKTYWAYAKERDKAIREFY

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.9e-3036.17Show/hide
Query:  LISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVRAKMVSFSSVDINWVYRLKAPLNPRGNDVIRNLSANQMKEALKLVANKGVKWKKSQMKVKT
        +I+Q+ W++FCAHP++ +VPLVREFYA L +   +   VR   VS+S   IN V+ L  P++   ++ I N++ + +   L+ VA  G +W  S     T
Subjt:  LISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVRAKMVSFSSVDINWVYRLKAPLNPRGNDVIRNLSANQMKEALKLVANKGVKWKKSQMKVKT

Query:  LVPSDLKSESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKWLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKLTIEL
         + S L   + VW HFLK+ L+PTTH  T+S DR++LL+ ++    IN+G +I  EI AC  ++ G LFF SLIT+LC+  +     +EE+     T E+
Subjt:  LVPSDLKSESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKWLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKLTIEL

Query:  SLIGKLQHKSLQRKDKASTSQDTPPSGPSMASPSQ
          I   +   + ++    ++Q    S P+ AS S+
Subjt:  SLIGKLQHKSLQRKDKASTSQDTPPSGPSMASPSQ

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]3.1e-3332.53Show/hide
Query:  LISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVRAKMVSFSSVDINWVYRLKAPLNPRGNDVIRNLSANQMKEALKLVANKGVKWKKSQMKVKT
        +I+Q+ W++FCAHP++ +VPLVREFYA L +   +   VR   VS+S   IN V+ L  P++   ++ I+N++   +   L+ VA  G +W  S     T
Subjt:  LISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVRAKMVSFSSVDINWVYRLKAPLNPRGNDVIRNLSANQMKEALKLVANKGVKWKKSQMKVKT

Query:  LVPSDLKSESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKWLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKLTIEL
         + S L   + VW HFLK+RL+PTTH  T+S DR++LL+ ++    IN+G +I  EI AC  ++ G LFF SLIT+LC+  +     +EE+      I+ 
Subjt:  LVPSDLKSESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKWLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKLTIEL

Query:  SLIGKLQHKSLQRKDKASTSQDTPPSGPSMASPSQHTPFTGPSPSSEALSIAYRQ-----------LDQIKENLKTYWAYAKERDKAIREFYLSIAPSTA
          + ++     Q     ST Q   PS    A+ S +          +AL     Q           L    +  + +WAY+KERD A+++   +      
Subjt:  SLIGKLQHKSLQRKDKASTSQDTPPSGPSMASPSQHTPFTGPSPSSEALSIAYRQ-----------LDQIKENLKTYWAYAKERDKAIREFYLSIAPSTA

Query:  LVFPNFPQSSMPQEDKDSDEEEDENNDDEYEE
          FP FPQ  +   D + + E D++  +E  E
Subjt:  LVFPNFPQSSMPQEDKDSDEEEDENNDDEYEE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]1.8e-2836.65Show/hide
Query:  LISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVRAKMVSFSSVDINWVYRLKAPLNPRGNDVIRNLSANQMKEALKLVANKGVKWKKSQMKVKT
        +I Q+ WQ FCAHP++ +VPLVREFY  +         +R   V  S   IN ++ L  P++   ++ + +++  ++   L+ VA  G +W  S     T
Subjt:  LISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVRAKMVSFSSVDINWVYRLKAPLNPRGNDVIRNLSANQMKEALKLVANKGVKWKKSQMKVKT

Query:  LVPSDLKSESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKWLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEER
         + S L   + VW HFLK+RL+PTTH  T+S + V LLY ++    IN+G +I  EI AC  +++G LFF SLIT +C+  +     +EE+
Subjt:  LVPSDLKSESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKWLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEER

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]2.8e-2631.35Show/hide
Query:  VPLVREFYAGLREESISMAVVRAKMVSFSSVDINWVYRLKAPLNPRGNDVIRNLSANQMKEALKLVANKGVKWKKSQMKVKTLVPSDLKSESAVWLHFLK
        +PLVREFYA L +   +   VR   VS+S   IN V+ L  P++   ++ I N++  ++   L+ VA  G +W  S     T + S L   + VW HFLK
Subjt:  VPLVREFYAGLREESISMAVVRAKMVSFSSVDINWVYRLKAPLNPRGNDVIRNLSANQMKEALKLVANKGVKWKKSQMKVKTLVPSDLKSESAVWLHFLK

Query:  NRLMPTTHDSTISVDRVMLLYCIMKWLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKLTIELSLIGKLQHKSLQRKDKAS
        +RL+PTTH   +S DR++LL+ ++    IN+G +I  EI AC  ++ G LFF SLIT+LC+    +  +++  +    T E+  I   +   + ++    
Subjt:  NRLMPTTHDSTISVDRVMLLYCIMKWLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKLTIELSLIGKLQHKSLQRKDKAS

Query:  TSQDTPPSGPSMASPSQHTPFTGPSPSSEALSIAYRQLDQIKENLKTYWAYAKERDKAIREFYLSIAPSTALVFPNFPQSSMPQEDKDSDEEEDENNDDE
        ++Q    S P+ AS S+           +AL     Q +   +  + +WAY+KERD A+++   +        FP FPQ  +   D + + E D++  +E
Subjt:  TSQDTPPSGPSMASPSQHTPFTGPSPSSEALSIAYRQLDQIKENLKTYWAYAKERDKAIREFYLSIAPSTALVFPNFPQSSMPQEDKDSDEEEDENNDDE

Query:  YEE
          E
Subjt:  YEE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)9.2e-3136.17Show/hide
Query:  LISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVRAKMVSFSSVDINWVYRLKAPLNPRGNDVIRNLSANQMKEALKLVANKGVKWKKSQMKVKT
        +I+Q+ W++FCAHP++ +VPLVREFYA L +   +   VR   VS+S   IN V+ L  P++   ++ I N++ + +   L+ VA  G +W  S     T
Subjt:  LISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVRAKMVSFSSVDINWVYRLKAPLNPRGNDVIRNLSANQMKEALKLVANKGVKWKKSQMKVKT

Query:  LVPSDLKSESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKWLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKLTIEL
         + S L   + VW HFLK+ L+PTTH  T+S DR++LL+ ++    IN+G +I  EI AC  ++ G LFF SLIT+LC+  +     +EE+     T E+
Subjt:  LVPSDLKSESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKWLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKLTIEL

Query:  SLIGKLQHKSLQRKDKASTSQDTPPSGPSMASPSQ
          I   +   + ++    ++Q    S P+ AS S+
Subjt:  SLIGKLQHKSLQRKDKASTSQDTPPSGPSMASPSQ

A0A2P5BCG4 Uncharacterized protein (Fragment)1.5e-3332.53Show/hide
Query:  LISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVRAKMVSFSSVDINWVYRLKAPLNPRGNDVIRNLSANQMKEALKLVANKGVKWKKSQMKVKT
        +I+Q+ W++FCAHP++ +VPLVREFYA L +   +   VR   VS+S   IN V+ L  P++   ++ I+N++   +   L+ VA  G +W  S     T
Subjt:  LISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVRAKMVSFSSVDINWVYRLKAPLNPRGNDVIRNLSANQMKEALKLVANKGVKWKKSQMKVKT

Query:  LVPSDLKSESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKWLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKLTIEL
         + S L   + VW HFLK+RL+PTTH  T+S DR++LL+ ++    IN+G +I  EI AC  ++ G LFF SLIT+LC+  +     +EE+      I+ 
Subjt:  LVPSDLKSESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKWLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKLTIEL

Query:  SLIGKLQHKSLQRKDKASTSQDTPPSGPSMASPSQHTPFTGPSPSSEALSIAYRQ-----------LDQIKENLKTYWAYAKERDKAIREFYLSIAPSTA
          + ++     Q     ST Q   PS    A+ S +          +AL     Q           L    +  + +WAY+KERD A+++   +      
Subjt:  SLIGKLQHKSLQRKDKASTSQDTPPSGPSMASPSQHTPFTGPSPSSEALSIAYRQ-----------LDQIKENLKTYWAYAKERDKAIREFYLSIAPSTA

Query:  LVFPNFPQSSMPQEDKDSDEEEDENNDDEYEE
          FP FPQ  +   D + + E D++  +E  E
Subjt:  LVFPNFPQSSMPQEDKDSDEEEDENNDDEYEE

A0A2P5DAQ2 Uncharacterized protein8.6e-2936.65Show/hide
Query:  LISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVRAKMVSFSSVDINWVYRLKAPLNPRGNDVIRNLSANQMKEALKLVANKGVKWKKSQMKVKT
        +I Q+ WQ FCAHP++ +VPLVREFY  +         +R   V  S   IN ++ L  P++   ++ + +++  ++   L+ VA  G +W  S     T
Subjt:  LISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVRAKMVSFSSVDINWVYRLKAPLNPRGNDVIRNLSANQMKEALKLVANKGVKWKKSQMKVKT

Query:  LVPSDLKSESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKWLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEER
         + S L   + VW HFLK+RL+PTTH  T+S + V LLY ++    IN+G +I  EI AC  +++G LFF SLIT +C+  +     +EE+
Subjt:  LVPSDLKSESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKWLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEER

A0A2P5DXM3 Uncharacterized protein1.4e-2631.35Show/hide
Query:  VPLVREFYAGLREESISMAVVRAKMVSFSSVDINWVYRLKAPLNPRGNDVIRNLSANQMKEALKLVANKGVKWKKSQMKVKTLVPSDLKSESAVWLHFLK
        +PLVREFYA L +   +   VR   VS+S   IN V+ L  P++   ++ I N++  ++   L+ VA  G +W  S     T + S L   + VW HFLK
Subjt:  VPLVREFYAGLREESISMAVVRAKMVSFSSVDINWVYRLKAPLNPRGNDVIRNLSANQMKEALKLVANKGVKWKKSQMKVKTLVPSDLKSESAVWLHFLK

Query:  NRLMPTTHDSTISVDRVMLLYCIMKWLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKLTIELSLIGKLQHKSLQRKDKAS
        +RL+PTTH   +S DR++LL+ ++    IN+G +I  EI AC  ++ G LFF SLIT+LC+    +  +++  +    T E+  I   +   + ++    
Subjt:  NRLMPTTHDSTISVDRVMLLYCIMKWLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKLTIELSLIGKLQHKSLQRKDKAS

Query:  TSQDTPPSGPSMASPSQHTPFTGPSPSSEALSIAYRQLDQIKENLKTYWAYAKERDKAIREFYLSIAPSTALVFPNFPQSSMPQEDKDSDEEEDENNDDE
        ++Q    S P+ AS S+           +AL     Q +   +  + +WAY+KERD A+++   +        FP FPQ  +   D + + E D++  +E
Subjt:  TSQDTPPSGPSMASPSQHTPFTGPSPSSEALSIAYRQLDQIKENLKTYWAYAKERDKAIREFYLSIAPSTALVFPNFPQSSMPQEDKDSDEEEDENNDDE

Query:  YEE
          E
Subjt:  YEE

W9RBS1 Uncharacterized protein8.9e-2631.27Show/hide
Query:  WQEFCAHPQEVVVPLVREFYAGLREESISMAVVRAKMVSFSSVDINWVYRLKAPLNPRGND----VIRNLSANQMKEALKLVANKGVKWKKSQMKVKTLV
        WQ FC HP + +VPLV+EFYA L+ +  +   V    ++F+S  IN V  +     P  +D    +I +    Q+KE LK +A  G +W  S     T  
Subjt:  WQEFCAHPQEVVVPLVREFYAGLREESISMAVVRAKMVSFSSVDINWVYRLKAPLNPRGND----VIRNLSANQMKEALKLVANKGVKWKKSQMKVKTLV

Query:  PSDLKSESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKWLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKLTIELSL
          +L+  + VW HFL +RL+ +TH  TIS +R +LLY ++    IN+G +I D+I AC  K  G L+F SLI++LC +  +     E R      ++L  
Subjt:  PSDLKSESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKWLEINIGSIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPGKDEERHFFKLTIELSL

Query:  IGKLQHKSLQRKDKASTSQD-TPPSGPSMASPSQHTPFTGPSPSSEALS-----------IAYRQLDQIKENLKTYWAYAKERDKAIREFY
        I ++     ++ +K    ++   PS PS    + HT     + S E L              +  L Q +E L  +W Y+++RD A+++ +
Subjt:  IGKLQHKSLQRKDKASTSQD-TPPSGPSMASPSQHTPFTGPSPSSEALS-----------IAYRQLDQIKENLKTYWAYAKERDKAIREFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAACAATCCAAAATCATCATCATCACTTAAGATCACTCGATCTCAGAGTGCTCAAACCGCCCAAGAAGCTGAAGCACATGCGCAACGCCAAGAGGAGCAACCCGA
AACACCCATGCAAGGCACGAGAAGGACGAGATGCACGGGTTTCTCGCTGGCGATTGTGGACCAAGGTACTTCCAGAGTTCAAACTTCTTCTTCCTCGACAATTTCGGCCA
CCTTGAGGGAGAATCTGAGTTCCTCTCAACCTAGGAGGTCCACGCGTGCTACTGCCGTCCACAAAACCCAAAAACCCACAACTCGACAGTACAGAAAACGCTCGCAGGAG
TGGTTTTCAATGATCCGAGCAATGGGAGCTCAGAGACGGGCTGCTGTTGAAGAAGAAGTGAATAGGCAAGATGAAGAAGAAGCCGCCAAGGCAGCAGGAAGCTCTCGGCA
AAGGGAGGCTTCAACGGGTAAACATTCTAAACCTTCAACTAACCCCTCTTCGTCTTGCAGGAACAAACCATTTGTTACCTACAGTGCAAGGAGGAGGAGTCCCAAGAAAG
TTGTTCCCAAAAAGCCACTTGTAATTGAGTCTCTCAAAGTAGCAAGAATGCCCCCGGACGTGTTCGAAGGCATAATTTGCCAAGCCGTGGCAAAGGCCCTTGTGATTGCC
GAAGGGTATAAGGCTGAACAAGAAGCCTTGAAGGATATTGAGGCTGAGAGAGAGATGGAAAATCAACACATGAGGGAAGAAGATGAGGGTGCAAGAGAAAGAGATCTTGA
AGAAGAGAGGAAAAAGGAAGAAGAAAAGCCAAGAGAAAAACAAAGAAGGAAAGAGTTAAAGAAAGATGAAGAAAGAAGGAAGGAAGCGGAAGACTTCATTGCAGCTTTTG
AGCCACTCCACAAGGCTCAAAGTGAGGCTGAGATGCTGCAAGGGAAAGAAGAAAAGGCCCAACAAGGGCCAACTGAAGGAAATTCAGAAAAAGGAAAAGAAAGAGAAGTA
GAGGATGAAGGCCAGAATGTGACCGCATCTGGGTCGCATTCTGAGGAAGGCCAAAGAACGACCACTGAAGCTCAGCCAGCTGATGAGGTTTTCGAACCTCTATTCAAATA
TGATCCACCAGCAGATGATAGCACCTCTTCGAGAGAGAAGAGGGATGAAAAAAAAGAGAACAAGGAGGCCGAGACCTCTAGTGATTCAGAAACAGAATCCGACTCAGAGA
TCAAGGAGCTGGATGATGACCAAGTTCCCATCTCTGCGGCTTTGACTAGAAAGAGAAGAAGAGAGATTAAAGTTGAACGGAGGACCAAAAACAAGAATGACCCTATTTTT
TCCAAGAGGCCGAGGACGAGGTCTATGGACGCCTCTCCAACAGCTCCTCCTACCGTCTCGCCTGCCAAGTCAAAAGCCAAATCACCCAAGGCTGCATCTCTTAAAAATCC
ATTCCCCGGGAGTGGGAACACTGCCAGAGTTCGTAAGCAGAGTTATCTCATCTCACAATACAAGTGGCAGGAGTTCTGTGCTCACCCTCAGGAGGTTGTGGTACCTTTAG
TTCGAGAATTTTACGCCGGCCTGAGGGAGGAAAGCATAAGTATGGCGGTGGTGAGAGCCAAAATGGTCAGCTTCTCTTCTGTAGACATTAACTGGGTGTACAGACTCAAA
GCACCCTTGAATCCAAGAGGGAACGATGTTATCAGGAACCTCTCGGCCAACCAAATGAAAGAAGCATTGAAACTAGTGGCCAACAAGGGAGTTAAGTGGAAAAAATCCCA
AATGAAGGTGAAGACTCTAGTGCCAAGCGATCTAAAGTCAGAATCGGCAGTTTGGCTTCACTTTCTGAAGAATAGATTGATGCCAACCACCCACGATAGTACCATCTCAG
TAGATAGAGTTATGCTACTCTACTGTATTATGAAGTGGTTGGAGATCAATATTGGGAGCATAATCAGGGATGAAATTCTAGCCTGTGGAAGAAAAAGAGCAGGTAAACTT
TTCTTTGGATCACTCATCACCCAACTTTGTCAAAGGGTGAAGATAGTTCCTGGCAAGGACGAGGAGCGTCATTTCTTCAAGCTTACCATTGAATTGTCCTTAATCGGGAA
GCTTCAACATAAGAGCCTCCAAAGGAAAGACAAAGCCTCCACATCTCAGGACACTCCACCATCAGGGCCGAGCATGGCTTCTCCATCCCAACACACTCCTTTTACAGGGC
CCTCACCATCGTCTGAAGCCCTATCCATTGCCTACCGTCAGCTAGATCAAATCAAGGAAAACCTGAAGACATATTGGGCATATGCAAAGGAGAGAGATAAAGCCATTAGA
GAGTTTTATCTCTCTATCGCCCCGAGTACTGCTCTGGTCTTTCCCAATTTCCCTCAATCGTCGATGCCTCAAGAAGATAAGGACTCTGATGAAGAAGAAGATGAGAATAA
TGATGATGAATATGAAGAGAAAGAGAGTTCCTCAAACGAGGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGAACAATCCAAAATCATCATCATCACTTAAGATCACTCGATCTCAGAGTGCTCAAACCGCCCAAGAAGCTGAAGCACATGCGCAACGCCAAGAGGAGCAACCCGA
AACACCCATGCAAGGCACGAGAAGGACGAGATGCACGGGTTTCTCGCTGGCGATTGTGGACCAAGGTACTTCCAGAGTTCAAACTTCTTCTTCCTCGACAATTTCGGCCA
CCTTGAGGGAGAATCTGAGTTCCTCTCAACCTAGGAGGTCCACGCGTGCTACTGCCGTCCACAAAACCCAAAAACCCACAACTCGACAGTACAGAAAACGCTCGCAGGAG
TGGTTTTCAATGATCCGAGCAATGGGAGCTCAGAGACGGGCTGCTGTTGAAGAAGAAGTGAATAGGCAAGATGAAGAAGAAGCCGCCAAGGCAGCAGGAAGCTCTCGGCA
AAGGGAGGCTTCAACGGGTAAACATTCTAAACCTTCAACTAACCCCTCTTCGTCTTGCAGGAACAAACCATTTGTTACCTACAGTGCAAGGAGGAGGAGTCCCAAGAAAG
TTGTTCCCAAAAAGCCACTTGTAATTGAGTCTCTCAAAGTAGCAAGAATGCCCCCGGACGTGTTCGAAGGCATAATTTGCCAAGCCGTGGCAAAGGCCCTTGTGATTGCC
GAAGGGTATAAGGCTGAACAAGAAGCCTTGAAGGATATTGAGGCTGAGAGAGAGATGGAAAATCAACACATGAGGGAAGAAGATGAGGGTGCAAGAGAAAGAGATCTTGA
AGAAGAGAGGAAAAAGGAAGAAGAAAAGCCAAGAGAAAAACAAAGAAGGAAAGAGTTAAAGAAAGATGAAGAAAGAAGGAAGGAAGCGGAAGACTTCATTGCAGCTTTTG
AGCCACTCCACAAGGCTCAAAGTGAGGCTGAGATGCTGCAAGGGAAAGAAGAAAAGGCCCAACAAGGGCCAACTGAAGGAAATTCAGAAAAAGGAAAAGAAAGAGAAGTA
GAGGATGAAGGCCAGAATGTGACCGCATCTGGGTCGCATTCTGAGGAAGGCCAAAGAACGACCACTGAAGCTCAGCCAGCTGATGAGGTTTTCGAACCTCTATTCAAATA
TGATCCACCAGCAGATGATAGCACCTCTTCGAGAGAGAAGAGGGATGAAAAAAAAGAGAACAAGGAGGCCGAGACCTCTAGTGATTCAGAAACAGAATCCGACTCAGAGA
TCAAGGAGCTGGATGATGACCAAGTTCCCATCTCTGCGGCTTTGACTAGAAAGAGAAGAAGAGAGATTAAAGTTGAACGGAGGACCAAAAACAAGAATGACCCTATTTTT
TCCAAGAGGCCGAGGACGAGGTCTATGGACGCCTCTCCAACAGCTCCTCCTACCGTCTCGCCTGCCAAGTCAAAAGCCAAATCACCCAAGGCTGCATCTCTTAAAAATCC
ATTCCCCGGGAGTGGGAACACTGCCAGAGTTCGTAAGCAGAGTTATCTCATCTCACAATACAAGTGGCAGGAGTTCTGTGCTCACCCTCAGGAGGTTGTGGTACCTTTAG
TTCGAGAATTTTACGCCGGCCTGAGGGAGGAAAGCATAAGTATGGCGGTGGTGAGAGCCAAAATGGTCAGCTTCTCTTCTGTAGACATTAACTGGGTGTACAGACTCAAA
GCACCCTTGAATCCAAGAGGGAACGATGTTATCAGGAACCTCTCGGCCAACCAAATGAAAGAAGCATTGAAACTAGTGGCCAACAAGGGAGTTAAGTGGAAAAAATCCCA
AATGAAGGTGAAGACTCTAGTGCCAAGCGATCTAAAGTCAGAATCGGCAGTTTGGCTTCACTTTCTGAAGAATAGATTGATGCCAACCACCCACGATAGTACCATCTCAG
TAGATAGAGTTATGCTACTCTACTGTATTATGAAGTGGTTGGAGATCAATATTGGGAGCATAATCAGGGATGAAATTCTAGCCTGTGGAAGAAAAAGAGCAGGTAAACTT
TTCTTTGGATCACTCATCACCCAACTTTGTCAAAGGGTGAAGATAGTTCCTGGCAAGGACGAGGAGCGTCATTTCTTCAAGCTTACCATTGAATTGTCCTTAATCGGGAA
GCTTCAACATAAGAGCCTCCAAAGGAAAGACAAAGCCTCCACATCTCAGGACACTCCACCATCAGGGCCGAGCATGGCTTCTCCATCCCAACACACTCCTTTTACAGGGC
CCTCACCATCGTCTGAAGCCCTATCCATTGCCTACCGTCAGCTAGATCAAATCAAGGAAAACCTGAAGACATATTGGGCATATGCAAAGGAGAGAGATAAAGCCATTAGA
GAGTTTTATCTCTCTATCGCCCCGAGTACTGCTCTGGTCTTTCCCAATTTCCCTCAATCGTCGATGCCTCAAGAAGATAAGGACTCTGATGAAGAAGAAGATGAGAATAA
TGATGATGAATATGAAGAGAAAGAGAGTTCCTCAAACGAGGACTAG
Protein sequenceShow/hide protein sequence
MKNNPKSSSSLKITRSQSAQTAQEAEAHAQRQEEQPETPMQGTRRTRCTGFSLAIVDQGTSRVQTSSSSTISATLRENLSSSQPRRSTRATAVHKTQKPTTRQYRKRSQE
WFSMIRAMGAQRRAAVEEEVNRQDEEEAAKAAGSSRQREASTGKHSKPSTNPSSSCRNKPFVTYSARRRSPKKVVPKKPLVIESLKVARMPPDVFEGIICQAVAKALVIA
EGYKAEQEALKDIEAEREMENQHMREEDEGARERDLEEERKKEEEKPREKQRRKELKKDEERRKEAEDFIAAFEPLHKAQSEAEMLQGKEEKAQQGPTEGNSEKGKEREV
EDEGQNVTASGSHSEEGQRTTTEAQPADEVFEPLFKYDPPADDSTSSREKRDEKKENKEAETSSDSETESDSEIKELDDDQVPISAALTRKRRREIKVERRTKNKNDPIF
SKRPRTRSMDASPTAPPTVSPAKSKAKSPKAASLKNPFPGSGNTARVRKQSYLISQYKWQEFCAHPQEVVVPLVREFYAGLREESISMAVVRAKMVSFSSVDINWVYRLK
APLNPRGNDVIRNLSANQMKEALKLVANKGVKWKKSQMKVKTLVPSDLKSESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKWLEINIGSIIRDEILACGRKRAGKL
FFGSLITQLCQRVKIVPGKDEERHFFKLTIELSLIGKLQHKSLQRKDKASTSQDTPPSGPSMASPSQHTPFTGPSPSSEALSIAYRQLDQIKENLKTYWAYAKERDKAIR
EFYLSIAPSTALVFPNFPQSSMPQEDKDSDEEEDENNDDEYEEKESSSNED