; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg032855 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg032855
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionNucleolar protein 58-like
Genome locationscaffold11:15568892..15576662
RNA-Seq ExpressionSpg032855
SyntenySpg032855
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.0e-2536.89Show/hide
Query:  ALPEFEAVVPLVREFYAGLREESISMAVVRGKMVSLSSVDINRVYRLKAPLNSRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPES
        A PE + +VPLVREFYA L +   +   VRG  VS S   IN V+ L  P++   ++ I N +   +   L+ VA  G +W  S     T + S L P +
Subjt:  ALPEFEAVVPLVREFYAGLREESISMAVVRGKMVSLSSVDINRVYRLKAPLNSRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPES

Query:  TVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLQINIGNIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPDKDEECHFFKPTIDLSLIGKLQQNS
         VW HFLK+ L+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF SLIT+LC+  +     +EE       ID   + ++ Q  
Subjt:  TVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLQINIGNIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPDKDEECHFFKPTIDLSLIGKLQQNS

Query:  IQRKDKASTSQATPPSGPSMASPSQ
           +    ++Q    S P+ AS S+
Subjt:  IQRKDKASTSQATPPSGPSMASPSQ

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]6.7e-3033.02Show/hide
Query:  ALPEFEAVVPLVREFYAGLREESISMAVVRGKMVSLSSVDINRVYRLKAPLNSRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPES
        A PE + +VPLVREFYA L +   +   VRG  VS S   IN V+ L  P++   ++ I+N + + +   L+ VA  G +W  S     T + S L P +
Subjt:  ALPEFEAVVPLVREFYAGLREESISMAVVRGKMVSLSSVDINRVYRLKAPLNSRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPES

Query:  TVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLQINIGNIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPDKDEECHFFKPTIDLSLIGKLQQNS
         VW HFLK+RL+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF SLIT+LC+  +     +EE       ID   + ++ Q  
Subjt:  TVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLQINIGNIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPDKDEECHFFKPTIDLSLIGKLQQNS

Query:  IQRKDKASTSQATPPSGPSMASPSQHTSFTGPSPSSEALAISYRQLDQ---------IKENLKTYWAYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLP
           +    ++Q    S P+ AS ++          +    +S +++ Q           +  + +WAY+KERD A+++   +      P FP FPQ +L 
Subjt:  IQRKDKASTSQATPPSGPSMASPSQHTSFTGPSPSSEALAISYRQLDQ---------IKENLKTYWAYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLP

Query:  QEEQDSDNEEDDENDEENDEE
           +D D E + E+D++   E
Subjt:  QEEQDSDNEEDDENDEENDEE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]1.9e-2438.37Show/hide
Query:  ALPEFEAVVPLVREFYAGLREESISMAVVRGKMVSLSSVDINRVYRLKAPLNSRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPES
        A PE + +VPLVREFY  +         +RG  V LS   IN ++ L  P++   ++ + + +  ++   L+ VA  G +W  S     T + S L P +
Subjt:  ALPEFEAVVPLVREFYAGLREESISMAVVRGKMVSLSSVDINRVYRLKAPLNSRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPES

Query:  TVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLQINIGNIIRDEILACGRKRAGKLFFGSLITQLCQRVK
         VW HFLK+RL+PTTH  T+S + V LLY ++ G  IN+G +I  EI AC  +++G LFF SLIT +C+  +
Subjt:  TVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLQINIGNIIRDEILACGRKRAGKLFFGSLITQLCQRVK

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]5.2e-3033.88Show/hide
Query:  VPLVREFYAGLREESISMAVVRGKMVSLSSVDINRVYRLKAPLNSRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESTVWLHFLK
        +PLVREFYA L +   +   VRG  VS S   IN V+ L  P++   ++ I N +  ++   L+ VA  G +W  S     T + S L P + VW HFLK
Subjt:  VPLVREFYAGLREESISMAVVRGKMVSLSSVDINRVYRLKAPLNSRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESTVWLHFLK

Query:  NRLMPTTHDNTISVDRVMLLYCIMKGLQINIGNIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPDKDEECHFFKPTIDLSLIGKLQQNSIQRKDKAS
        +RL+PTTH   +S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF SLIT+LC+    + + +E+ H     ID   + ++ Q       +  
Subjt:  NRLMPTTHDNTISVDRVMLLYCIMKGLQINIGNIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPDKDEECHFFKPTIDLSLIGKLQQNSIQRKDKAS

Query:  TSQATPPSGPSMASPSQHTSFTGPSPSSEALAISYRQLDQIKENLKTYWAYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEQDSDNEEDDENDEE
        T     PS    A+ S   +        +AL     Q +   +  + +WAY+KERD A+++   +      P FP FPQ +L    QD D E + E+D++
Subjt:  TSQATPPSGPSMASPSQHTSFTGPSPSSEALAISYRQLDQIKENLKTYWAYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEQDSDNEEDDENDEE

Query:  NDEE
           E
Subjt:  NDEE

XP_024971944.1 uncharacterized protein LOC112510826 [Cynara cardunculus var. scolymus]3.6e-2333.66Show/hide
Query:  REFYAGLREESISMAVVRGKMVSLSSVDINRVYRLKAPLNSRGNDVIRNPSAKQMKEALKLVANKGVQW-KESQTKVKTLVPSDLKPESTVWLHFLKNRL
        REFYA     S +   VRG  VS+ +  IN++  L  P  +    + ++ S  +++E  + +   G++W   S   ++T   S+LKP + VW++F++  L
Subjt:  REFYAGLREESISMAVVRGKMVSLSSVDINRVYRLKAPLNSRGNDVIRNPSAKQMKEALKLVANKGVQW-KESQTKVKTLVPSDLKPESTVWLHFLKNRL

Query:  MPTTHDNTISVDRVMLLYCIMKGLQINIGNIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPDKDE----ECHFFKPTIDLSLIGKLQQNSIQRKDKA
         PTTHD++ISV++++LLYC++ G  IN+G ++   IL C ++R GKLFF SLI +L  +  +    D+    EC   K TID+  + KL++ S + ++  
Subjt:  MPTTHDNTISVDRVMLLYCIMKGLQINIGNIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPDKDE----ECHFFKPTIDLSLIGKLQQNSIQRKDKA

Query:  STSQA
          ++A
Subjt:  STSQA

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)4.9e-2636.89Show/hide
Query:  ALPEFEAVVPLVREFYAGLREESISMAVVRGKMVSLSSVDINRVYRLKAPLNSRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPES
        A PE + +VPLVREFYA L +   +   VRG  VS S   IN V+ L  P++   ++ I N +   +   L+ VA  G +W  S     T + S L P +
Subjt:  ALPEFEAVVPLVREFYAGLREESISMAVVRGKMVSLSSVDINRVYRLKAPLNSRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPES

Query:  TVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLQINIGNIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPDKDEECHFFKPTIDLSLIGKLQQNS
         VW HFLK+ L+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF SLIT+LC+  +     +EE       ID   + ++ Q  
Subjt:  TVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLQINIGNIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPDKDEECHFFKPTIDLSLIGKLQQNS

Query:  IQRKDKASTSQATPPSGPSMASPSQ
           +    ++Q    S P+ AS S+
Subjt:  IQRKDKASTSQATPPSGPSMASPSQ

A0A2P5BCG4 Uncharacterized protein (Fragment)3.3e-3033.02Show/hide
Query:  ALPEFEAVVPLVREFYAGLREESISMAVVRGKMVSLSSVDINRVYRLKAPLNSRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPES
        A PE + +VPLVREFYA L +   +   VRG  VS S   IN V+ L  P++   ++ I+N + + +   L+ VA  G +W  S     T + S L P +
Subjt:  ALPEFEAVVPLVREFYAGLREESISMAVVRGKMVSLSSVDINRVYRLKAPLNSRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPES

Query:  TVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLQINIGNIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPDKDEECHFFKPTIDLSLIGKLQQNS
         VW HFLK+RL+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF SLIT+LC+  +     +EE       ID   + ++ Q  
Subjt:  TVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLQINIGNIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPDKDEECHFFKPTIDLSLIGKLQQNS

Query:  IQRKDKASTSQATPPSGPSMASPSQHTSFTGPSPSSEALAISYRQLDQ---------IKENLKTYWAYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLP
           +    ++Q    S P+ AS ++          +    +S +++ Q           +  + +WAY+KERD A+++   +      P FP FPQ +L 
Subjt:  IQRKDKASTSQATPPSGPSMASPSQHTSFTGPSPSSEALAISYRQLDQ---------IKENLKTYWAYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLP

Query:  QEEQDSDNEEDDENDEENDEE
           +D D E + E+D++   E
Subjt:  QEEQDSDNEEDDENDEENDEE

A0A2P5BNT0 Uncharacterized protein (Fragment)2.3e-2334.03Show/hide
Query:  LVREFYAGLREESISMAVVRGKMVSLSSVDINRVYRLKAPLNSRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESTVWLHFLKNR
        LVREFY  L         VRG  V LS+  IN +Y L   L    ++ + + +  ++   L+ VA  G +W  S   V T + S L P + +W HFLK+R
Subjt:  LVREFYAGLREESISMAVVRGKMVSLSSVDINRVYRLKAPLNSRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESTVWLHFLKNR

Query:  LMPTTHDNTISVDRVMLLYCIMKGLQINIGNIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPDKDEECHFFKPTIDLSLIGKLQQNSIQRKDKASTS
        L+PTTH   +S +RV+LLY ++ G  IN+G +I  EI AC  +++G LFF SLI ++C+  +     +EE       ID   + ++ Q     +  A  S
Subjt:  LMPTTHDNTISVDRVMLLYCIMKGLQINIGNIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPDKDEECHFFKPTIDLSLIGKLQQNSIQRKDKASTS

Query:  QATPPSGPSMASPSQHTSFTGPSPSSEALAISYRQLDQ
             S P+  S S  TS T     S    IS +++ Q
Subjt:  QATPPSGPSMASPSQHTSFTGPSPSSEALAISYRQLDQ

A0A2P5DAQ2 Uncharacterized protein9.2e-2538.37Show/hide
Query:  ALPEFEAVVPLVREFYAGLREESISMAVVRGKMVSLSSVDINRVYRLKAPLNSRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPES
        A PE + +VPLVREFY  +         +RG  V LS   IN ++ L  P++   ++ + + +  ++   L+ VA  G +W  S     T + S L P +
Subjt:  ALPEFEAVVPLVREFYAGLREESISMAVVRGKMVSLSSVDINRVYRLKAPLNSRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPES

Query:  TVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLQINIGNIIRDEILACGRKRAGKLFFGSLITQLCQRVK
         VW HFLK+RL+PTTH  T+S + V LLY ++ G  IN+G +I  EI AC  +++G LFF SLIT +C+  +
Subjt:  TVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLQINIGNIIRDEILACGRKRAGKLFFGSLITQLCQRVK

A0A2P5DXM3 Uncharacterized protein2.5e-3033.88Show/hide
Query:  VPLVREFYAGLREESISMAVVRGKMVSLSSVDINRVYRLKAPLNSRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESTVWLHFLK
        +PLVREFYA L +   +   VRG  VS S   IN V+ L  P++   ++ I N +  ++   L+ VA  G +W  S     T + S L P + VW HFLK
Subjt:  VPLVREFYAGLREESISMAVVRGKMVSLSSVDINRVYRLKAPLNSRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESTVWLHFLK

Query:  NRLMPTTHDNTISVDRVMLLYCIMKGLQINIGNIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPDKDEECHFFKPTIDLSLIGKLQQNSIQRKDKAS
        +RL+PTTH   +S DR++LL+ ++ G  IN+G +I  EI AC  ++ G LFF SLIT+LC+    + + +E+ H     ID   + ++ Q       +  
Subjt:  NRLMPTTHDNTISVDRVMLLYCIMKGLQINIGNIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPDKDEECHFFKPTIDLSLIGKLQQNSIQRKDKAS

Query:  TSQATPPSGPSMASPSQHTSFTGPSPSSEALAISYRQLDQIKENLKTYWAYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEQDSDNEEDDENDEE
        T     PS    A+ S   +        +AL     Q +   +  + +WAY+KERD A+++   +      P FP FPQ +L    QD D E + E+D++
Subjt:  TSQATPPSGPSMASPSQHTSFTGPSPSSEALAISYRQLDQIKENLKTYWAYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEQDSDNEEDDENDEE

Query:  NDEE
           E
Subjt:  NDEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAAGGGGTGTCCACTCCTTGACATTGATCTGGAGATAGAAAGAACCTTTCGTCGTCGAAGGAAGGAAAAAAGACGAAAGAGAAAGGAACAACAAGAGTTGAGCGC
ACAGGAACCTCTAGAAGAAGCTTCTTACATACAAGAGTTTCCAATGGAACCTCCTGGAGTCGATCCTCAAGTTGATCCACAGAATCGTGGAAGAGAGCAAAATGGTGGGA
GAATTCCTCCTGTTCCTCCAGTTCCACCGGTGCCACAACAAGAAAACCCAATTTACATTGCTGAAGACCGCAATAGAGCAATAATGGATTACACTTTACCAGTTTTTCAG
ACATTAAGGGGAACACCTTTTCCTGGGGATTTACCTGGGAGTTTGAAAAGTCCTTACTATTTTTGGAACAAACCATTCGTTACCTATAGTGCAAGGAAGAGGAGTCCTAA
GAAAGTTGTGCCCGACAAGCCGCTTGTTATTGAGCCTCTCAAAGTAGCAAGAATGCCACCAGACGTGTTCGAGGACATAATTCGCCAAGCTGTGGCAAAGGCTCTCGTGA
TTGCTGAAGGGTATAAGGCTGAACAGGAAGCCTTGAAGGATATTGAGGCTGAGAGAGAGATGGAAAATCAGCACATGATGGAGGAAGATGAGTCTGTGAGAAAAAGAGAT
CTTGAAAAAGAGAAAAGACAAGAGGCCGAGAGGGCCTTAGAAGCTGAAAAAGAGAGAAAATTAGATGAAGACCTCATGAGAGTAGCAGCTGATTTGCAACTCCTTGAGGA
AGAAAAACACAGAAGGGAAGAGTTGAAAGAAGACGAGGAAAGAAGGAAGGAAGCTGAAGACTTCCTTGCAGCTTTTGAGCCACTCCACAAGGCTCAAAAAGGCCTAGCAG
AGGCCACTGAAGTTCAGCCTGCTGATGAGGTTTTCGAACCTCTATTCAAAGATGACCCACCAACAGCTGATAGCACCTCTTCGGGAGAGAAGAGGGATGAAGAAAAAAGC
AAGGAGGCCAAGACCGCCAGTGACTCTGAAACAGAATCCGATTCAGAGATCAAGGAACTGGATGATGACCAAGTTCCTATCTCTGCGACATTGAGGAGAAAGAGAAAAAG
AGAGATCAAAGCTGAACGGAGGACTAAGAACAAAAATGACCCGATATTTGCCAAGAGGCCGAGGACAAGGTCCATGGACGCCTCTCCTACAGTTCCTCCTACCATCTCAC
CCGCCAAGCCAAAGGACAAATTATCTAAGGCTGCATTGATAGCGAGCAGAAATGTCGCTATTTTAGAACGGATGGAGATCATGAAGAAAAGGGATTTCCTTAATGAGAAG
GGATTCTCTAACAGAGCAGGAGCACTACCAGAGTTCGAGGCAGTAGTGCCTTTAGTTCGTGAATTTTATGCCGGCCTGAGGGAGGAAAGTATAAGTATGGCGGTGGTGAG
AGGCAAGATGGTTAGCTTATCTTCAGTAGACATTAACAGGGTGTACAGACTCAAAGCACCCTTGAATTCAAGAGGGAACGATGTTATCAGGAACCCCTCGGCCAAGCAGA
TGAAGGAAGCATTAAAACTCGTGGCCAACAAGGGAGTTCAGTGGAAAGAGTCTCAGACGAAGGTGAAGACTCTAGTGCCAAGCGATCTAAAGCCAGAATCGACAGTTTGG
CTTCACTTTCTGAAGAACCGCTTGATGCCAACCACCCACGACAACACGATCTCAGTGGATAGAGTTATGCTACTCTATTGCATTATGAAGGGGTTGCAGATCAACATTGG
GAACATAATTAGGGATGAGATTCTAGCCTGTGGAAGAAAAAGAGCAGGTAAACTTTTCTTTGGATCACTCATCACCCAGCTTTGCCAGAGGGTGAAGATAGTTCCAGACA
AGGACGAGGAGTGTCATTTCTTCAAGCCGACCATTGACCTGTCCTTGATCGGGAAGCTCCAACAGAATAGCATCCAAAGGAAAGATAAAGCCTCCACATCTCAGGCCACT
CCACCATCAGGGCCGAGCATGGCTTCTCCATCCCAGCACACTTCTTTTACAGGGCCCTCACCATCATCGGAAGCCCTAGCTATTTCCTACCGCCAGCTTGATCAAATCAA
GGAAAACCTGAAGACATATTGGGCATATGCAAAAGAGAGGGATGAAGCCATTAGAGAGTTCTATCTCTCTATCGCCCCGAGTATTGCTCCGGTCTTTCCCAATTTCCCTC
AATCGCTACTGCCTCAAGAAGAACAGGATTCTGATAATGAAGAAGATGATGAGAATGATGAAGAAAATGATGAAGAGAAAGAGAGTTCCTCGAACGAGGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACAAGGGGTGTCCACTCCTTGACATTGATCTGGAGATAGAAAGAACCTTTCGTCGTCGAAGGAAGGAAAAAAGACGAAAGAGAAAGGAACAACAAGAGTTGAGCGC
ACAGGAACCTCTAGAAGAAGCTTCTTACATACAAGAGTTTCCAATGGAACCTCCTGGAGTCGATCCTCAAGTTGATCCACAGAATCGTGGAAGAGAGCAAAATGGTGGGA
GAATTCCTCCTGTTCCTCCAGTTCCACCGGTGCCACAACAAGAAAACCCAATTTACATTGCTGAAGACCGCAATAGAGCAATAATGGATTACACTTTACCAGTTTTTCAG
ACATTAAGGGGAACACCTTTTCCTGGGGATTTACCTGGGAGTTTGAAAAGTCCTTACTATTTTTGGAACAAACCATTCGTTACCTATAGTGCAAGGAAGAGGAGTCCTAA
GAAAGTTGTGCCCGACAAGCCGCTTGTTATTGAGCCTCTCAAAGTAGCAAGAATGCCACCAGACGTGTTCGAGGACATAATTCGCCAAGCTGTGGCAAAGGCTCTCGTGA
TTGCTGAAGGGTATAAGGCTGAACAGGAAGCCTTGAAGGATATTGAGGCTGAGAGAGAGATGGAAAATCAGCACATGATGGAGGAAGATGAGTCTGTGAGAAAAAGAGAT
CTTGAAAAAGAGAAAAGACAAGAGGCCGAGAGGGCCTTAGAAGCTGAAAAAGAGAGAAAATTAGATGAAGACCTCATGAGAGTAGCAGCTGATTTGCAACTCCTTGAGGA
AGAAAAACACAGAAGGGAAGAGTTGAAAGAAGACGAGGAAAGAAGGAAGGAAGCTGAAGACTTCCTTGCAGCTTTTGAGCCACTCCACAAGGCTCAAAAAGGCCTAGCAG
AGGCCACTGAAGTTCAGCCTGCTGATGAGGTTTTCGAACCTCTATTCAAAGATGACCCACCAACAGCTGATAGCACCTCTTCGGGAGAGAAGAGGGATGAAGAAAAAAGC
AAGGAGGCCAAGACCGCCAGTGACTCTGAAACAGAATCCGATTCAGAGATCAAGGAACTGGATGATGACCAAGTTCCTATCTCTGCGACATTGAGGAGAAAGAGAAAAAG
AGAGATCAAAGCTGAACGGAGGACTAAGAACAAAAATGACCCGATATTTGCCAAGAGGCCGAGGACAAGGTCCATGGACGCCTCTCCTACAGTTCCTCCTACCATCTCAC
CCGCCAAGCCAAAGGACAAATTATCTAAGGCTGCATTGATAGCGAGCAGAAATGTCGCTATTTTAGAACGGATGGAGATCATGAAGAAAAGGGATTTCCTTAATGAGAAG
GGATTCTCTAACAGAGCAGGAGCACTACCAGAGTTCGAGGCAGTAGTGCCTTTAGTTCGTGAATTTTATGCCGGCCTGAGGGAGGAAAGTATAAGTATGGCGGTGGTGAG
AGGCAAGATGGTTAGCTTATCTTCAGTAGACATTAACAGGGTGTACAGACTCAAAGCACCCTTGAATTCAAGAGGGAACGATGTTATCAGGAACCCCTCGGCCAAGCAGA
TGAAGGAAGCATTAAAACTCGTGGCCAACAAGGGAGTTCAGTGGAAAGAGTCTCAGACGAAGGTGAAGACTCTAGTGCCAAGCGATCTAAAGCCAGAATCGACAGTTTGG
CTTCACTTTCTGAAGAACCGCTTGATGCCAACCACCCACGACAACACGATCTCAGTGGATAGAGTTATGCTACTCTATTGCATTATGAAGGGGTTGCAGATCAACATTGG
GAACATAATTAGGGATGAGATTCTAGCCTGTGGAAGAAAAAGAGCAGGTAAACTTTTCTTTGGATCACTCATCACCCAGCTTTGCCAGAGGGTGAAGATAGTTCCAGACA
AGGACGAGGAGTGTCATTTCTTCAAGCCGACCATTGACCTGTCCTTGATCGGGAAGCTCCAACAGAATAGCATCCAAAGGAAAGATAAAGCCTCCACATCTCAGGCCACT
CCACCATCAGGGCCGAGCATGGCTTCTCCATCCCAGCACACTTCTTTTACAGGGCCCTCACCATCATCGGAAGCCCTAGCTATTTCCTACCGCCAGCTTGATCAAATCAA
GGAAAACCTGAAGACATATTGGGCATATGCAAAAGAGAGGGATGAAGCCATTAGAGAGTTCTATCTCTCTATCGCCCCGAGTATTGCTCCGGTCTTTCCCAATTTCCCTC
AATCGCTACTGCCTCAAGAAGAACAGGATTCTGATAATGAAGAAGATGATGAGAATGATGAAGAAAATGATGAAGAGAAAGAGAGTTCCTCGAACGAGGACTAG
Protein sequenceShow/hide protein sequence
MNKGCPLLDIDLEIERTFRRRRKEKRRKRKEQQELSAQEPLEEASYIQEFPMEPPGVDPQVDPQNRGREQNGGRIPPVPPVPPVPQQENPIYIAEDRNRAIMDYTLPVFQ
TLRGTPFPGDLPGSLKSPYYFWNKPFVTYSARKRSPKKVVPDKPLVIEPLKVARMPPDVFEDIIRQAVAKALVIAEGYKAEQEALKDIEAEREMENQHMMEEDESVRKRD
LEKEKRQEAERALEAEKERKLDEDLMRVAADLQLLEEEKHRREELKEDEERRKEAEDFLAAFEPLHKAQKGLAEATEVQPADEVFEPLFKDDPPTADSTSSGEKRDEEKS
KEAKTASDSETESDSEIKELDDDQVPISATLRRKRKREIKAERRTKNKNDPIFAKRPRTRSMDASPTVPPTISPAKPKDKLSKAALIASRNVAILERMEIMKKRDFLNEK
GFSNRAGALPEFEAVVPLVREFYAGLREESISMAVVRGKMVSLSSVDINRVYRLKAPLNSRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESTVW
LHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLQINIGNIIRDEILACGRKRAGKLFFGSLITQLCQRVKIVPDKDEECHFFKPTIDLSLIGKLQQNSIQRKDKASTSQAT
PPSGPSMASPSQHTSFTGPSPSSEALAISYRQLDQIKENLKTYWAYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEQDSDNEEDDENDEENDEEKESSSNED