; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019481 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019481
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionNucleolar protein 58-like
Genome locationscaffold1:41072198..41074822
RNA-Seq ExpressionSpg019481
SyntenySpg019481
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]1.5e-2226.39Show/hide
Query:  FAKRSWTGSMDASPAAPPTVSPAKSKAKSPKAASPKNPFLEVFRDVNFQEMMEIMRKKDFLNDKGF---SNRAGALPEFVSKVISQYKWQEFCAHPQEGV
        F+   W  ++D + AA P        + S +  S    F++   +  ++E    +  ++ + +KGF    +     P F+S VI    WQ FC HP + +
Subjt:  FAKRSWTGSMDASPAAPPTVSPAKSKAKSPKAASPKNPFLEVFRDVNFQEMMEIMRKKDFLNDKGF---SNRAGALPEFVSKVISQYKWQEFCAHPQEGV

Query:  VPLVREFYANLREESISMAVVRGKMVRFSSVDINTVYRIKAPLHPKGND----VISNPSAKQMREALKLVANKGVQWKESQTKVKTLVPNDLKPESAMWL
        VPLV+EFYANL+ +  +   V    + F+S  IN V  I     P  +D    +I++   +Q++E LK +A  G QW  S     T   ++L+P + +W 
Subjt:  VPLVREFYANLREESISMAVVRGKMVRFSSVDINTVYRIKAPLHPKGND----VISNPSAKQMREALKLVANKGVQWKESQTKVKTLVPNDLKPESAMWL

Query:  HFLKKRLMPTTHDSTISVDRVMLLYCIMTGLEINIGSIIREEILAYGRK------------------QAVSGKDEKRHFFKPTIDLSLIVKLRQNSLQRK
        HFL  RL+ +TH  TIS +R +LLY ++ G  IN+G +I ++I A   K                         E R      +DL  I ++     ++ 
Subjt:  HFLKKRLMPTTHDSTISVDRVMLLYCIMTGLEINIGSIIREEILAYGRK------------------QAVSGKDEKRHFFKPTIDLSLIVKLRQNSLQRK

Query:  DKA-STSQATPPSGPNMDFPSQHTSFTGPSPSSEALAIAYCQLE-----------QIRDNMRTYWAYAKERDEAIREFY
        +K     +   PS P+    + HT     + S E L       E           Q ++ +  +W Y+++RD A+++ +
Subjt:  DKA-STSQATPPSGPNMDFPSQHTSFTGPSPSSEALAIAYCQLE-----------QIRDNMRTYWAYAKERDEAIREFY

EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]2.3e-2334.29Show/hide
Query:  PEFVSKVISQYKWQEFCAHPQEGVVPLVREFYANLREESISMAVVRGKMVRFSSVDINTVYRIKAPLHPKGNDVISNPSAKQMREALKLVANKGVQWKES
        P F+++VI Q+ W++FC HP   +VPLVREFYANL + +     V+   V F++  IN+++ ++  +  +  D  S  + +Q+   L  VA +G  W+ S
Subjt:  PEFVSKVISQYKWQEFCAHPQEGVVPLVREFYANLREESISMAVVRGKMVRFSSVDINTVYRIKAPLHPKGNDVISNPSAKQMREALKLVANKGVQWKES

Query:  QTKVKTLVPNDLKPESAMWLHFLKKRLMPTTHDSTISVDRVMLLYCIMTGLEINIGSIIREEILAYGRKQAVSGKDEKRHFFKPTIDLSLIVKLRQNSLQ
             T +  +LK  + +W HFL  R MP+TH  T++ DRV+LLY I+TG+ +NI  I  +EI      +A S   ++   + P++   L   L+ N   
Subjt:  QTKVKTLVPNDLKPESAMWLHFLKKRLMPTTHDSTISVDRVMLLYCIMTGLEINIGSIIREEILAYGRKQAVSGKDEKRHFFKPTIDLSLIVKLRQNSLQ

Query:  RKDKASTSQA
         KD+A    A
Subjt:  RKDKASTSQA

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]9.2e-2840Show/hide
Query:  DKGF----SNRAGALPEFVSKVISQYKWQEFCAHPQEGVVPLVREFYANLREESISMAVVRGKMVRFSSVDINTVYRIKAPLHPKGNDVISNPSAKQMRE
        +KGF    S   G LP F+++VI+Q+ W++FCAHP++ +VPLVREFYANL +   +   VRG  V +S   IN V+ +  P+  + ++ I N +   +  
Subjt:  DKGF----SNRAGALPEFVSKVISQYKWQEFCAHPQEGVVPLVREFYANLREESISMAVVRGKMVRFSSVDINTVYRIKAPLHPKGNDVISNPSAKQMRE

Query:  ALKLVANKGVQWKESQTKVKTLVPNDLKPESAMWLHFLKKRLMPTTHDSTISVDRVMLLYCIMTGLEINIGSIIREEILA
         L+ VA  G +W  S     T + + L P + +W HFLK  L+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI A
Subjt:  ALKLVANKGVQWKESQTKVKTLVPNDLKPESAMWLHFLKKRLMPTTHDSTISVDRVMLLYCIMTGLEINIGSIIREEILA

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]6.8e-3131.1Show/hide
Query:  DKGF----SNRAGALPEFVSKVISQYKWQEFCAHPQEGVVPLVREFYANLREESISMAVVRGKMVRFSSVDINTVYRIKAPLHPKGNDVISNPSAKQMRE
        +KGF    S   G LP F+++VI+Q+ W++FCAHP++ +VPLVREFYANL +   +   VRG  V +S   IN V+ +  P+  + ++ I N + + +  
Subjt:  DKGF----SNRAGALPEFVSKVISQYKWQEFCAHPQEGVVPLVREFYANLREESISMAVVRGKMVRFSSVDINTVYRIKAPLHPKGNDVISNPSAKQMRE

Query:  ALKLVANKGVQWKESQTKVKTLVPNDLKPESAMWLHFLKKRLMPTTHDSTISVDRVMLLYCIMTGLEINIGSIIREEILAYGRKQAVS------------
         L+ VA  G +W  S     T + + L P + +W HFLK RL+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI A   ++  +            
Subjt:  ALKLVANKGVQWKESQTKVKTLVPNDLKPESAMWLHFLKKRLMPTTHDSTISVDRVMLLYCIMTGLEINIGSIIREEILAYGRKQAVS------------

Query:  -------GKDEKRHFFKPTIDLSLIVKLRQN--SLQRKDKASTSQATPPSG-PNMDFPSQHTSFTGPSPSSEALAIAYCQ-LEQIRDNMRTYWAYAKERD
                 +EK H     ID   + ++ Q   +   +  +S+  AT  S   N D   Q  +        E         L+      + +WAY+KERD
Subjt:  -------GKDEKRHFFKPTIDLSLIVKLRQN--SLQRKDKASTSQATPPSG-PNMDFPSQHTSFTGPSPSSEALAIAYCQ-LEQIRDNMRTYWAYAKERD

Query:  EAIREFYLSIAPSIAPVFPNFPQSLLPQEDKDSDEEEDENDDEE
         A+++   +      P FP FPQ +L   D + + E D++   E
Subjt:  EAIREFYLSIAPSIAPVFPNFPQSLLPQEDKDSDEEEDENDDEE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]3.9e-2637.43Show/hide
Query:  PEFVSKVISQYKWQEFCAHPQEGVVPLVREFYANLREESISMAVVRGKMVRFSSVDINTVYRIKAPLHPKGNDVISNPSAKQMREALKLVANKGVQWKES
        P F++ VI Q+ WQ FCAHP++ +VPLVREFY N+         +RG  V  S   INT++ +  P+  + ++ + + +  ++   L+ VA  G +W  S
Subjt:  PEFVSKVISQYKWQEFCAHPQEGVVPLVREFYANLREESISMAVVRGKMVRFSSVDINTVYRIKAPLHPKGNDVISNPSAKQMREALKLVANKGVQWKES

Query:  QTKVKTLVPNDLKPESAMWLHFLKKRLMPTTHDSTISVDRVMLLYCIMTGLEINIGSIIREEILAYGRKQA
             T + + L P + +W HFLK RL+PTTH  T+S + V LLY ++TG  IN+G +I  EI A   +++
Subjt:  QTKVKTLVPNDLKPESAMWLHFLKKRLMPTTHDSTISVDRVMLLYCIMTGLEINIGSIIREEILAYGRKQA

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)4.5e-2840Show/hide
Query:  DKGF----SNRAGALPEFVSKVISQYKWQEFCAHPQEGVVPLVREFYANLREESISMAVVRGKMVRFSSVDINTVYRIKAPLHPKGNDVISNPSAKQMRE
        +KGF    S   G LP F+++VI+Q+ W++FCAHP++ +VPLVREFYANL +   +   VRG  V +S   IN V+ +  P+  + ++ I N +   +  
Subjt:  DKGF----SNRAGALPEFVSKVISQYKWQEFCAHPQEGVVPLVREFYANLREESISMAVVRGKMVRFSSVDINTVYRIKAPLHPKGNDVISNPSAKQMRE

Query:  ALKLVANKGVQWKESQTKVKTLVPNDLKPESAMWLHFLKKRLMPTTHDSTISVDRVMLLYCIMTGLEINIGSIIREEILA
         L+ VA  G +W  S     T + + L P + +W HFLK  L+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI A
Subjt:  ALKLVANKGVQWKESQTKVKTLVPNDLKPESAMWLHFLKKRLMPTTHDSTISVDRVMLLYCIMTGLEINIGSIIREEILA

A0A2P5BCG4 Uncharacterized protein (Fragment)3.3e-3131.1Show/hide
Query:  DKGF----SNRAGALPEFVSKVISQYKWQEFCAHPQEGVVPLVREFYANLREESISMAVVRGKMVRFSSVDINTVYRIKAPLHPKGNDVISNPSAKQMRE
        +KGF    S   G LP F+++VI+Q+ W++FCAHP++ +VPLVREFYANL +   +   VRG  V +S   IN V+ +  P+  + ++ I N + + +  
Subjt:  DKGF----SNRAGALPEFVSKVISQYKWQEFCAHPQEGVVPLVREFYANLREESISMAVVRGKMVRFSSVDINTVYRIKAPLHPKGNDVISNPSAKQMRE

Query:  ALKLVANKGVQWKESQTKVKTLVPNDLKPESAMWLHFLKKRLMPTTHDSTISVDRVMLLYCIMTGLEINIGSIIREEILAYGRKQAVS------------
         L+ VA  G +W  S     T + + L P + +W HFLK RL+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI A   ++  +            
Subjt:  ALKLVANKGVQWKESQTKVKTLVPNDLKPESAMWLHFLKKRLMPTTHDSTISVDRVMLLYCIMTGLEINIGSIIREEILAYGRKQAVS------------

Query:  -------GKDEKRHFFKPTIDLSLIVKLRQN--SLQRKDKASTSQATPPSG-PNMDFPSQHTSFTGPSPSSEALAIAYCQ-LEQIRDNMRTYWAYAKERD
                 +EK H     ID   + ++ Q   +   +  +S+  AT  S   N D   Q  +        E         L+      + +WAY+KERD
Subjt:  -------GKDEKRHFFKPTIDLSLIVKLRQN--SLQRKDKASTSQATPPSG-PNMDFPSQHTSFTGPSPSSEALAIAYCQ-LEQIRDNMRTYWAYAKERD

Query:  EAIREFYLSIAPSIAPVFPNFPQSLLPQEDKDSDEEEDENDDEE
         A+++   +      P FP FPQ +L   D + + E D++   E
Subjt:  EAIREFYLSIAPSIAPVFPNFPQSLLPQEDKDSDEEEDENDDEE

A0A2P5DAQ2 Uncharacterized protein1.9e-2637.43Show/hide
Query:  PEFVSKVISQYKWQEFCAHPQEGVVPLVREFYANLREESISMAVVRGKMVRFSSVDINTVYRIKAPLHPKGNDVISNPSAKQMREALKLVANKGVQWKES
        P F++ VI Q+ WQ FCAHP++ +VPLVREFY N+         +RG  V  S   INT++ +  P+  + ++ + + +  ++   L+ VA  G +W  S
Subjt:  PEFVSKVISQYKWQEFCAHPQEGVVPLVREFYANLREESISMAVVRGKMVRFSSVDINTVYRIKAPLHPKGNDVISNPSAKQMREALKLVANKGVQWKES

Query:  QTKVKTLVPNDLKPESAMWLHFLKKRLMPTTHDSTISVDRVMLLYCIMTGLEINIGSIIREEILAYGRKQA
             T + + L P + +W HFLK RL+PTTH  T+S + V LLY ++TG  IN+G +I  EI A   +++
Subjt:  QTKVKTLVPNDLKPESAMWLHFLKKRLMPTTHDSTISVDRVMLLYCIMTGLEINIGSIIREEILAYGRKQA

W9QTD9 Uncharacterized protein1.1e-2334.29Show/hide
Query:  PEFVSKVISQYKWQEFCAHPQEGVVPLVREFYANLREESISMAVVRGKMVRFSSVDINTVYRIKAPLHPKGNDVISNPSAKQMREALKLVANKGVQWKES
        P F+++VI Q+ W++FC HP   +VPLVREFYANL + +     V+   V F++  IN+++ ++  +  +  D  S  + +Q+   L  VA +G  W+ S
Subjt:  PEFVSKVISQYKWQEFCAHPQEGVVPLVREFYANLREESISMAVVRGKMVRFSSVDINTVYRIKAPLHPKGNDVISNPSAKQMREALKLVANKGVQWKES

Query:  QTKVKTLVPNDLKPESAMWLHFLKKRLMPTTHDSTISVDRVMLLYCIMTGLEINIGSIIREEILAYGRKQAVSGKDEKRHFFKPTIDLSLIVKLRQNSLQ
             T +  +LK  + +W HFL  R MP+TH  T++ DRV+LLY I+TG+ +NI  I  +EI      +A S   ++   + P++   L   L+ N   
Subjt:  QTKVKTLVPNDLKPESAMWLHFLKKRLMPTTHDSTISVDRVMLLYCIMTGLEINIGSIIREEILAYGRKQAVSGKDEKRHFFKPTIDLSLIVKLRQNSLQ

Query:  RKDKASTSQA
         KD+A    A
Subjt:  RKDKASTSQA

W9RBS1 Uncharacterized protein7.4e-2326.39Show/hide
Query:  FAKRSWTGSMDASPAAPPTVSPAKSKAKSPKAASPKNPFLEVFRDVNFQEMMEIMRKKDFLNDKGF---SNRAGALPEFVSKVISQYKWQEFCAHPQEGV
        F+   W  ++D + AA P        + S +  S    F++   +  ++E    +  ++ + +KGF    +     P F+S VI    WQ FC HP + +
Subjt:  FAKRSWTGSMDASPAAPPTVSPAKSKAKSPKAASPKNPFLEVFRDVNFQEMMEIMRKKDFLNDKGF---SNRAGALPEFVSKVISQYKWQEFCAHPQEGV

Query:  VPLVREFYANLREESISMAVVRGKMVRFSSVDINTVYRIKAPLHPKGND----VISNPSAKQMREALKLVANKGVQWKESQTKVKTLVPNDLKPESAMWL
        VPLV+EFYANL+ +  +   V    + F+S  IN V  I     P  +D    +I++   +Q++E LK +A  G QW  S     T   ++L+P + +W 
Subjt:  VPLVREFYANLREESISMAVVRGKMVRFSSVDINTVYRIKAPLHPKGND----VISNPSAKQMREALKLVANKGVQWKESQTKVKTLVPNDLKPESAMWL

Query:  HFLKKRLMPTTHDSTISVDRVMLLYCIMTGLEINIGSIIREEILAYGRK------------------QAVSGKDEKRHFFKPTIDLSLIVKLRQNSLQRK
        HFL  RL+ +TH  TIS +R +LLY ++ G  IN+G +I ++I A   K                         E R      +DL  I ++     ++ 
Subjt:  HFLKKRLMPTTHDSTISVDRVMLLYCIMTGLEINIGSIIREEILAYGRK------------------QAVSGKDEKRHFFKPTIDLSLIVKLRQNSLQRK

Query:  DKA-STSQATPPSGPNMDFPSQHTSFTGPSPSSEALAIAYCQLE-----------QIRDNMRTYWAYAKERDEAIREFY
        +K     +   PS P+    + HT     + S E L       E           Q ++ +  +W Y+++RD A+++ +
Subjt:  DKA-STSQATPPSGPNMDFPSQHTSFTGPSPSSEALAIAYCQLE-----------QIRDNMRTYWAYAKERDEAIREFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAACTCGCCCAAATCATCATCATATCGCAAGATCACTCGATCCCATAGTGCTCAAACTGCCCAAGAAGCTGAAGCACACATTCAACGTCAAGAAGAGCAACCCGA
CGCCCCCATGCACGGCACGAGAAGGACGAGACCTTCGGGTTTCTCACTGGCGATCGTGAACCAAGAACCCGCCGCTCAAACTCCTTCTTCCTCGACAATGCCGGCCACCT
CAAGGGAGAATCCGGGTTCGTCTCAACTGAGAAGGTCCACGCGCGCCAATGTCGTACATAAAACCCAAAAACCCGCAACCCAACAATTCAGAAAACGCTCACGGGAGTGG
TTTTCAATGATCCGAGCGATGGGAGCTCAAAGATGCGCAGCTCTTGAAGAAGAAGCGAATAGGCGAGATGAAGAAGAAGCCGCCAAGGCAGCAGAAAGCTCTCGGCAAAG
AGAGTCTTCAACGGGTAAAGCTTCTGTACCTTCAACTAACCCCTTTTCGTCTTGCAGGAACAAACCATTTGTTACTTACAGTGCAAGGAAGATAAGTCCCAAGAAAGTTG
TGCCCAAAAAGCCGCTTGTAATTGAGCCCCTTAAAACCGCAAGAATGCCCTCAAATGTGTTCGAGGACATAATCCGCCAAGCTGTGGCAAAGGCTCTGGTCATTGTCGAA
GGCTACAAGGCTGAACAAGAAGCCTTGAGGGATATTGACGCTGAAAGAGAAGTTGAAAATCAGCATATGAGGGAAGAGGATGAGGTTGCAAGAAAAAGAGATCTTGAAGA
TGAAAAGAAGAAAGAAAAGGAAAGGCAAGAGGCAGAGAGGGCCAAGTTAGCTGAAGAAGAGGAAATAAAGTTAGGAGAAAACCTCAGGAGGGCAGCAGTTGAATTGCAAC
TTCTTGAGGAAGAAAAACAAAGAAGGGAAAAATTAAAAGAAGATGAGAAAAGAAGAAAGGAAGCCGAAGACTTCCTTGCAGCTTTTGAGCCACTCCACAAGGCTCAAAGT
GAGACTAAGATGCTGCAAGGATTAGAAGAAAAGGCCCATCAGGGGCTAAATGAAGAAAATAAAGAAGAAGAAAAAGAAACAGAAGTAGTGAATGAAGGCCAGAATGCGAC
CGCATCTGGGCCGCATTCTGAAGAAAGCCAGGAAATGGCTACGGAAGCTCAGCCAGCTGATGAGGTTTTGGAACCTCTATTCAAATATGACCCACCAGCAGTTGATAGCA
CCTCTTCGGGAGAGAATAGGGATGAAGAAGAGAAAGAAAGCAAGGAGGCCGAAACCTCCAGTGACTCTGAAACAGAATCTAACTTAGAGATCAAGGAATTGGATGACGAC
CAAGTTCCCATCCCTGCAGCATTGGGGAGAAAGAGAAGAAGAGAGATTAAAGCTAAAAGGAGGACTAAAAACAAGAATGACCCGATATTTGCCAAGAGGTCGTGGACTGG
GTCCATGGACGCCTCTCCTGCAGCTCCTCCTACTGTCTCACCCGCGAAGTCGAAAGCCAAATCTCCTAAGGCTGCATCTCCTAAAAATCCATTCCTCGAAGTATTCAGAG
ATGTAAATTTTCAGGAAATGATGGAGATAATGAGAAAAAAAGATTTCCTCAACGATAAGGGATTCTCTAACAGAGCTGGAGCACTGCCAGAGTTCGTAAGCAAAGTTATC
TCACAGTACAAGTGGCAGGAGTTCTGTGCTCACCCTCAGGAGGGCGTGGTGCCTTTAGTGAGAGAGTTTTATGCCAACCTGAGGGAGGAAAGCATCAGTATGGCGGTAGT
GAGAGGCAAAATGGTCCGCTTCTCTTCAGTAGACATCAACACGGTGTACAGAATCAAAGCACCCTTACATCCAAAAGGGAATGATGTCATTAGCAACCCCTCGGCCAAGC
AGATGAGAGAAGCGCTAAAATTAGTGGCCAACAAGGGAGTTCAGTGGAAAGAATCCCAGACGAAGGTGAAGACTTTAGTGCCAAACGATCTCAAGCCAGAATCAGCAATG
TGGCTTCACTTTCTGAAAAAACGATTGATGCCGACCACCCACGATAGCACCATATCAGTAGATAGAGTCATGCTCCTCTACTGTATTATGACAGGGTTGGAGATCAATAT
TGGGAGCATAATCAGGGAGGAGATTCTAGCCTATGGAAGGAAACAAGCAGTTTCGGGCAAGGACGAGAAGCGTCATTTCTTCAAGCCTACCATTGACCTGTCCTTGATTG
TGAAGCTTCGACAGAATAGCCTCCAAAGGAAAGATAAAGCCTCCACATCTCAGGCCACTCCACCATCAGGGCCGAACATGGATTTTCCATCCCAGCACACTTCTTTTACA
GGGCCCTCACCATCATCTGAAGCCCTAGCCATTGCCTACTGTCAGCTAGAACAAATCAGGGACAACATGAGGACTTATTGGGCATATGCAAAGGAGAGGGATGAAGCCAT
TAGAGAGTTCTATCTCTCTATCGCCCCGAGTATTGCTCCGGTCTTTCCCAATTTCCCTCAGTCGCTCCTGCCTCAAGAAGATAAGGATTCTGATGAAGAAGAAGATGAGA
ATGATGATGAAGAGGAAGAGAGTTCCTCAGATGAGGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGAACTCGCCCAAATCATCATCATATCGCAAGATCACTCGATCCCATAGTGCTCAAACTGCCCAAGAAGCTGAAGCACACATTCAACGTCAAGAAGAGCAACCCGA
CGCCCCCATGCACGGCACGAGAAGGACGAGACCTTCGGGTTTCTCACTGGCGATCGTGAACCAAGAACCCGCCGCTCAAACTCCTTCTTCCTCGACAATGCCGGCCACCT
CAAGGGAGAATCCGGGTTCGTCTCAACTGAGAAGGTCCACGCGCGCCAATGTCGTACATAAAACCCAAAAACCCGCAACCCAACAATTCAGAAAACGCTCACGGGAGTGG
TTTTCAATGATCCGAGCGATGGGAGCTCAAAGATGCGCAGCTCTTGAAGAAGAAGCGAATAGGCGAGATGAAGAAGAAGCCGCCAAGGCAGCAGAAAGCTCTCGGCAAAG
AGAGTCTTCAACGGGTAAAGCTTCTGTACCTTCAACTAACCCCTTTTCGTCTTGCAGGAACAAACCATTTGTTACTTACAGTGCAAGGAAGATAAGTCCCAAGAAAGTTG
TGCCCAAAAAGCCGCTTGTAATTGAGCCCCTTAAAACCGCAAGAATGCCCTCAAATGTGTTCGAGGACATAATCCGCCAAGCTGTGGCAAAGGCTCTGGTCATTGTCGAA
GGCTACAAGGCTGAACAAGAAGCCTTGAGGGATATTGACGCTGAAAGAGAAGTTGAAAATCAGCATATGAGGGAAGAGGATGAGGTTGCAAGAAAAAGAGATCTTGAAGA
TGAAAAGAAGAAAGAAAAGGAAAGGCAAGAGGCAGAGAGGGCCAAGTTAGCTGAAGAAGAGGAAATAAAGTTAGGAGAAAACCTCAGGAGGGCAGCAGTTGAATTGCAAC
TTCTTGAGGAAGAAAAACAAAGAAGGGAAAAATTAAAAGAAGATGAGAAAAGAAGAAAGGAAGCCGAAGACTTCCTTGCAGCTTTTGAGCCACTCCACAAGGCTCAAAGT
GAGACTAAGATGCTGCAAGGATTAGAAGAAAAGGCCCATCAGGGGCTAAATGAAGAAAATAAAGAAGAAGAAAAAGAAACAGAAGTAGTGAATGAAGGCCAGAATGCGAC
CGCATCTGGGCCGCATTCTGAAGAAAGCCAGGAAATGGCTACGGAAGCTCAGCCAGCTGATGAGGTTTTGGAACCTCTATTCAAATATGACCCACCAGCAGTTGATAGCA
CCTCTTCGGGAGAGAATAGGGATGAAGAAGAGAAAGAAAGCAAGGAGGCCGAAACCTCCAGTGACTCTGAAACAGAATCTAACTTAGAGATCAAGGAATTGGATGACGAC
CAAGTTCCCATCCCTGCAGCATTGGGGAGAAAGAGAAGAAGAGAGATTAAAGCTAAAAGGAGGACTAAAAACAAGAATGACCCGATATTTGCCAAGAGGTCGTGGACTGG
GTCCATGGACGCCTCTCCTGCAGCTCCTCCTACTGTCTCACCCGCGAAGTCGAAAGCCAAATCTCCTAAGGCTGCATCTCCTAAAAATCCATTCCTCGAAGTATTCAGAG
ATGTAAATTTTCAGGAAATGATGGAGATAATGAGAAAAAAAGATTTCCTCAACGATAAGGGATTCTCTAACAGAGCTGGAGCACTGCCAGAGTTCGTAAGCAAAGTTATC
TCACAGTACAAGTGGCAGGAGTTCTGTGCTCACCCTCAGGAGGGCGTGGTGCCTTTAGTGAGAGAGTTTTATGCCAACCTGAGGGAGGAAAGCATCAGTATGGCGGTAGT
GAGAGGCAAAATGGTCCGCTTCTCTTCAGTAGACATCAACACGGTGTACAGAATCAAAGCACCCTTACATCCAAAAGGGAATGATGTCATTAGCAACCCCTCGGCCAAGC
AGATGAGAGAAGCGCTAAAATTAGTGGCCAACAAGGGAGTTCAGTGGAAAGAATCCCAGACGAAGGTGAAGACTTTAGTGCCAAACGATCTCAAGCCAGAATCAGCAATG
TGGCTTCACTTTCTGAAAAAACGATTGATGCCGACCACCCACGATAGCACCATATCAGTAGATAGAGTCATGCTCCTCTACTGTATTATGACAGGGTTGGAGATCAATAT
TGGGAGCATAATCAGGGAGGAGATTCTAGCCTATGGAAGGAAACAAGCAGTTTCGGGCAAGGACGAGAAGCGTCATTTCTTCAAGCCTACCATTGACCTGTCCTTGATTG
TGAAGCTTCGACAGAATAGCCTCCAAAGGAAAGATAAAGCCTCCACATCTCAGGCCACTCCACCATCAGGGCCGAACATGGATTTTCCATCCCAGCACACTTCTTTTACA
GGGCCCTCACCATCATCTGAAGCCCTAGCCATTGCCTACTGTCAGCTAGAACAAATCAGGGACAACATGAGGACTTATTGGGCATATGCAAAGGAGAGGGATGAAGCCAT
TAGAGAGTTCTATCTCTCTATCGCCCCGAGTATTGCTCCGGTCTTTCCCAATTTCCCTCAGTCGCTCCTGCCTCAAGAAGATAAGGATTCTGATGAAGAAGAAGATGAGA
ATGATGATGAAGAGGAAGAGAGTTCCTCAGATGAGGATTAG
Protein sequenceShow/hide protein sequence
MKNSPKSSSYRKITRSHSAQTAQEAEAHIQRQEEQPDAPMHGTRRTRPSGFSLAIVNQEPAAQTPSSSTMPATSRENPGSSQLRRSTRANVVHKTQKPATQQFRKRSREW
FSMIRAMGAQRCAALEEEANRRDEEEAAKAAESSRQRESSTGKASVPSTNPFSSCRNKPFVTYSARKISPKKVVPKKPLVIEPLKTARMPSNVFEDIIRQAVAKALVIVE
GYKAEQEALRDIDAEREVENQHMREEDEVARKRDLEDEKKKEKERQEAERAKLAEEEEIKLGENLRRAAVELQLLEEEKQRREKLKEDEKRRKEAEDFLAAFEPLHKAQS
ETKMLQGLEEKAHQGLNEENKEEEKETEVVNEGQNATASGPHSEESQEMATEAQPADEVLEPLFKYDPPAVDSTSSGENRDEEEKESKEAETSSDSETESNLEIKELDDD
QVPIPAALGRKRRREIKAKRRTKNKNDPIFAKRSWTGSMDASPAAPPTVSPAKSKAKSPKAASPKNPFLEVFRDVNFQEMMEIMRKKDFLNDKGFSNRAGALPEFVSKVI
SQYKWQEFCAHPQEGVVPLVREFYANLREESISMAVVRGKMVRFSSVDINTVYRIKAPLHPKGNDVISNPSAKQMREALKLVANKGVQWKESQTKVKTLVPNDLKPESAM
WLHFLKKRLMPTTHDSTISVDRVMLLYCIMTGLEINIGSIIREEILAYGRKQAVSGKDEKRHFFKPTIDLSLIVKLRQNSLQRKDKASTSQATPPSGPNMDFPSQHTSFT
GPSPSSEALAIAYCQLEQIRDNMRTYWAYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEDKDSDEEEDENDDEEEESSSDED