; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg026904 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg026904
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111022007
Genome locationscaffold13:27490466..27493094
RNA-Seq ExpressionSpg026904
SyntenySpg026904
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]1.1e-2236.78Show/hide
Query:  PEFVSRVISQYKWQEFCAHPQEAVVPLVREFYDGLREESISMAVVRGKMASFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKES
        P F++RVI Q+ W++FC HP   +VPLVREFY  L + +     V+     F++  IN ++ ++  ++    D     + +Q++  L  VA +G  W+ S
Subjt:  PEFVSRVISQYKWQEFCAHPQEAVVPLVREFYDGLREESISMAVVRGKMASFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKES

Query:  QTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLKINIGSIIRDEILAC--GRKRAG
             T +  +LK  + +W HFL  R MP+TH  T++ DRV+LLY I+ G+ +NI  I   EI AC   RKR G
Subjt:  QTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLKINIGSIIRDEILAC--GRKRAG

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.0e-2838.46Show/hide
Query:  MRKRDFLNEKGF----SNRAEALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYDGLREESISMAVVRGKMASFSSVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S     LP F+++VI+Q+ W++FCAHP++ +VPLVREFY  L +   +   VRG   S+S   IN V+ +  P++   ++ I N
Subjt:  MRKRDFLNEKGF----SNRAEALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYDGLREESISMAVVRGKMASFSSVDINRVYRIKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLKINIGSIIRDEILACGRKRAG
         +   +   L+ VA  G +W  S     T + S L P + VW HFLK+ L+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G
Subjt:  PSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLKINIGSIIRDEILACGRKRAG

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.1e-3030.2Show/hide
Query:  MRKRDFLNEKGF----SNRAEALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYDGLREESISMAVVRGKMASFSSVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S     LP F+++VI+Q+ W++FCAHP++ +VPLVREFY  L +   +   VRG   S+S   IN V+ +  P++   ++ I+N
Subjt:  MRKRDFLNEKGF----SNRAEALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYDGLREESISMAVVRGKMASFSSVDINRVYRIKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLKINIGSIIRDEILACGRKRAG-----
         + + +   L+ VA  G +W  S     T + S L P + VW HFLK+RL+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G     
Subjt:  PSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLKINIGSIIRDEILACGRKRAG-----

Query:  ----------------------------------------------PSPSSEALAIAYRQLDQIRENLKT----------------------------YW
                                                      PS S  A A + R    I + LK                             +W
Subjt:  ----------------------------------------------PSPSSEALAIAYRQLDQIRENLKT----------------------------YW

Query:  AYAKERDEAIREFYLSIGLSIAPVFPNFPQSLLPQEDKDSDEEEDGNNDED
        AY+KERD A+++   +      P FP FPQ +L    KD D E +  +D+D
Subjt:  AYAKERDEAIREFYLSIGLSIAPVFPNFPQSLLPQEDKDSDEEEDGNNDED

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]2.8e-2633.33Show/hide
Query:  ASPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGF---SNRAEALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYDGLREESISMAVVRGKMASFSSVD
        AS    F     ++ ++E ++    R    EK F   +++    P F++ VI Q+ WQ FCAHP++ +VPLVREFY  +         +RG     S   
Subjt:  ASPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGF---SNRAEALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYDGLREESISMAVVRGKMASFSSVD

Query:  INRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLKINIG
        IN ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     T + S L P + VW HFLK+RL+PTTH  T+S + V LLY ++ G  IN+G
Subjt:  INRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLKINIG

Query:  SIIRDEILACGRKRAG
         +I  EI AC  +++G
Subjt:  SIIRDEILACGRKRAG

XP_038904385.1 uncharacterized protein LOC120090747 [Benincasa hispida]1.1e-2229.13Show/hide
Query:  RGEEEKESKEAETSSNSETESDSEMKELDDDQVPIFAALRRKRRREIKAEKRTKNKNDPIFAKRPRTRSMDASPAAPPTVSPAKSKAKSPKAASPKNPFP
        +G+++++SKE       E     E KE    +       +R+RRRE   EKR K +     A+ P       + +    VSP + K+  P+ AS  +   
Subjt:  RGEEEKESKEAETSSNSETESDSEMKELDDDQVPIFAALRRKRRREIKAEKRTKNKNDPIFAKRPRTRSMDASPAAPPTVSPAKSKAKSPKAASPKNPFP

Query:  EVFRDVNFQERMEIMRKR-----------------------DFLNEKGFSNRAEALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYDGLREESISMAV
         V  D    +   +MR+R                       D + E GF   +  LP+F S V+ ++ W+ F       +  +VR FY G    +    +
Subjt:  EVFRDVNFQERMEIMRKR-----------------------DFLNEKGFSNRAEALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYDGLREESISMAV

Query:  VRGKMASFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLL
        ++G +  FS+ DIN +Y++K   +  GN +I +P  ++M++AL+ +   G QW  S   +KTL  S L PE+ +W++ +K R++PT+HD T+S DRVM  
Subjt:  VRGKMASFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLL

Query:  YCIMKGLKINIGSIIRDEI--LACGRKRAGPSP
        YCI  G+ I++  +I  +    A  ++R G SP
Subjt:  YCIMKGLKINIGSIIRDEI--LACGRKRAGPSP

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)4.9e-2938.46Show/hide
Query:  MRKRDFLNEKGF----SNRAEALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYDGLREESISMAVVRGKMASFSSVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S     LP F+++VI+Q+ W++FCAHP++ +VPLVREFY  L +   +   VRG   S+S   IN V+ +  P++   ++ I N
Subjt:  MRKRDFLNEKGF----SNRAEALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYDGLREESISMAVVRGKMASFSSVDINRVYRIKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLKINIGSIIRDEILACGRKRAG
         +   +   L+ VA  G +W  S     T + S L P + VW HFLK+ L+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G
Subjt:  PSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLKINIGSIIRDEILACGRKRAG

A0A2P5BCG4 Uncharacterized protein (Fragment)5.3e-3130.2Show/hide
Query:  MRKRDFLNEKGF----SNRAEALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYDGLREESISMAVVRGKMASFSSVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S     LP F+++VI+Q+ W++FCAHP++ +VPLVREFY  L +   +   VRG   S+S   IN V+ +  P++   ++ I+N
Subjt:  MRKRDFLNEKGF----SNRAEALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYDGLREESISMAVVRGKMASFSSVDINRVYRIKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLKINIGSIIRDEILACGRKRAG-----
         + + +   L+ VA  G +W  S     T + S L P + VW HFLK+RL+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  ++ G     
Subjt:  PSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLKINIGSIIRDEILACGRKRAG-----

Query:  ----------------------------------------------PSPSSEALAIAYRQLDQIRENLKT----------------------------YW
                                                      PS S  A A + R    I + LK                             +W
Subjt:  ----------------------------------------------PSPSSEALAIAYRQLDQIRENLKT----------------------------YW

Query:  AYAKERDEAIREFYLSIGLSIAPVFPNFPQSLLPQEDKDSDEEEDGNNDED
        AY+KERD A+++   +      P FP FPQ +L    KD D E +  +D+D
Subjt:  AYAKERDEAIREFYLSIGLSIAPVFPNFPQSLLPQEDKDSDEEEDGNNDED

A0A2P5DAQ2 Uncharacterized protein1.3e-2633.33Show/hide
Query:  ASPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGF---SNRAEALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYDGLREESISMAVVRGKMASFSSVD
        AS    F     ++ ++E ++    R    EK F   +++    P F++ VI Q+ WQ FCAHP++ +VPLVREFY  +         +RG     S   
Subjt:  ASPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGF---SNRAEALPEFVSRVISQYKWQEFCAHPQEAVVPLVREFYDGLREESISMAVVRGKMASFSSVD

Query:  INRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLKINIG
        IN ++ +  P++   ++ + + +  ++   L+ VA  G +W  S     T + S L P + VW HFLK+RL+PTTH  T+S + V LLY ++ G  IN+G
Subjt:  INRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLKINIG

Query:  SIIRDEILACGRKRAG
         +I  EI AC  +++G
Subjt:  SIIRDEILACGRKRAG

W9QTD9 Uncharacterized protein5.3e-2336.78Show/hide
Query:  PEFVSRVISQYKWQEFCAHPQEAVVPLVREFYDGLREESISMAVVRGKMASFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKES
        P F++RVI Q+ W++FC HP   +VPLVREFY  L + +     V+     F++  IN ++ ++  ++    D     + +Q++  L  VA +G  W+ S
Subjt:  PEFVSRVISQYKWQEFCAHPQEAVVPLVREFYDGLREESISMAVVRGKMASFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKES

Query:  QTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLKINIGSIIRDEILAC--GRKRAG
             T +  +LK  + +W HFL  R MP+TH  T++ DRV+LLY I+ G+ +NI  I   EI AC   RKR G
Subjt:  QTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLKINIGSIIRDEILAC--GRKRAG

W9RBS1 Uncharacterized protein1.3e-2133.85Show/hide
Query:  FAKRPRTR-----SMDASPAAPPTVSPAKSKAKSPKAASPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGF---SNRAEALPEFVSRVISQYKWQEFCAH
        FAKRP +      ++D + AA P        + S +  S    F +   +  ++E    +  R+ + EKGF    +     P F+S VI    WQ FC H
Subjt:  FAKRPRTR-----SMDASPAAPPTVSPAKSKAKSPKAASPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGF---SNRAEALPEFVSRVISQYKWQEFCAH

Query:  PQEAVVPLVREFYDGLREESISMAVVRGKMASFSSVDINRVYRIKAPLNPRGND----VIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPE
        P + +VPLV+EFY  L+ +  +   V     +F+S  IN V  I     P  +D    +I +   +Q+KE LK +A  G QW  S     T    +L+P 
Subjt:  PQEAVVPLVREFYDGLREESISMAVVRGKMASFSSVDINRVYRIKAPLNPRGND----VIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPE

Query:  SAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLKINIGSIIRDEILACGRKRAG
        + VW HFL +RL+ +TH  TIS +R +LLY ++ G  IN+G +I D+I AC  K  G
Subjt:  SAVWLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLKINIGSIIRDEILACGRKRAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAACTTGCCCAAATCTTCATCATCACGCAAGATCACTCGATCTCAGAGTGCTCAAACCGCCCATGAAGCTGAAGCAAGTGTTCGACGACAAGAAGAGAACCCCGA
AACACCCATGCACGACACGAGAAGGACGAGACCCTTGGGTTTCTCACCGGTGATCGTGAACCAAGAGCCCAATGCTCAAACTCCGTCTTCCTCGACAATGCAGGCCACTT
TGAGGGAGAATCCGAGTTCGTCTCAACTCAGGAGGTCCACGTGCTCAAGTGTCGTCCACAAAACCCAAAAATCCGCAACTCAACAATTCAGAAAACGCTCACGGGAGTGG
TTTTCAATGATCTGGACGATGGGAGCTGAGAGACGTGCTACTCTTGAAGAAGAAGTGAGTAGGCGAGATGAAGAAGAAATCGCCAAGGCAGCAGGAAACTCTCGACAAGG
AGAGGCTTCAACGGGTAAACATTCTGAACCTTCAACTAACCCCTCTTCGTCTTGCAGGAACAAACCATTCGTTACCTACAGTGCAAGGAGGAGGAGTCCCAAGAAAGTTG
TGCCCGAGAAACCACTTGTAATTGAGCCCCTTAAAACCGAGAGAATGCCCCCGAACGTGTTCGAGGACATAATCCGTCAAGCTGTGGCAAAGGCTCTAGTGATTTCCGAA
GGCTATAGGGTTGAACAAGAAGCTTTGAAGGATATTGAGGCTGAGAGAGAGATGGAAAATCAGCACATGATGAAAGAGGATGAGGTTGCAAGAGAAAGAGAGCTTGAAGA
AGAAAAGAAAAAGGAAGAGAAAAGGCAGGAGGCCGAGAGGGCCAAGTTAGCTGAAGAAGAGGAAAGAAAGTTAGGAGAAAACCTCAGGAGGGCAGCAGTTGATTTGCAGC
TCCTTGAGGAAGAAAAACAGAGAAAAAAAGAATCGAAAGAAGATGAGAAAAGAAGAAAGGATGCTGAAGACTTCCTTGCAGCTTTTGAGCCACTCCACAAGGCTCAAAGC
GAGGCTGAGATGTTGCGAGGAAGAGAAGAAAAGGCCCAACAGGGGCCAACTGAAGGAAGTTCAGAAAAAAAAAAAGAAATAGAAGAAGTGGATAAAGGCCAGAATGCGAC
CGCATCTGGACCGCATTCTGAAGAAGGCCTGACAGAGGCCACTGAAGCACAACCTGCTGATGAGGTTTTTGAACCTCTATTCAAAGATGACCCACCAGCAGCTGATAGCA
CCTCTTCGGGAGAGAAGAGAGGTGAAGAAGAGAAAGAAAGCAAGGAGGCCGAGACCTCCAGTAATTCAGAGACAGAATCTGATTCAGAGATGAAGGAGCTGGATGACGAC
CAAGTTCCTATCTTTGCAGCATTGAGAAGAAAGAGAAGAAGAGAGATTAAAGCTGAAAAGAGGACCAAGAACAAGAATGACCCCATATTTGCCAAGAGGCCGAGGACTAG
GTCCATGGACGCCTCTCCTGCAGCTCCTCCTACCGTCTCACCCGCCAAGTCAAAAGCCAAATCTCCTAAGGCTGCATCTCCTAAAAATCCATTCCCCGAAGTATTCAGAG
ATGTTAATTTTCAGGAAAGGATGGAGATAATGAGAAAAAGAGATTTCCTCAACGAGAAAGGATTCTCTAACAGAGCAGAAGCACTGCCAGAATTTGTAAGCAGAGTTATC
TCCCAGTACAAGTGGCAGGAGTTCTGTGCTCACCCTCAGGAGGCCGTAGTGCCTTTAGTTCGTGAATTTTACGATGGCCTGAGGGAGGAAAGCATCAGTATGGCGGTGGT
GAGAGGCAAAATGGCCAGCTTCTCTTCTGTAGACATTAACCGGGTGTACAGAATCAAAGCACCCTTGAACCCAAGAGGGAACGATGTCATTAGGAACCCCTCGGCCAAGC
AAATGAAAGAGGCACTAAAACTAGTGGCCAACAAGGGAGTTCAGTGGAAAGAGTCCCAAACAAAGGTGAAGACTCTAGTGCCCAGTGATTTAAAGCCAGAATCGGCAGTT
TGGCTTCACTTTCTGAAGAACCGATTGATGCCAACCACCCACGATAGCACCATCTCAGTAGATAGAGTTATGCTACTCTACTGTATTATGAAAGGGTTGAAGATCAATAT
TGGAAGCATAATCAGGGATGAGATTCTTGCCTGTGGAAGGAAACGAGCAGGGCCCTCACCATCATCGGAAGCCTTAGCCATTGCCTACCGTCAGCTTGATCAGATCAGGG
AAAACCTGAAGACATATTGGGCATATGCAAAGGAGAGGGATGAAGCCATTAGAGAGTTCTATCTCTCTATCGGCCTAAGTATTGCACCGGTCTTTCCCAATTTCCCTCAA
TCGCTGCTGCCTCAAGAAGACAAGGATTCTGATGAAGAAGAAGATGGAAATAATGATGAAGATGATGACGAGAAAGAGAGTTCCTCGGACGAGGACTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAACTTGCCCAAATCTTCATCATCACGCAAGATCACTCGATCTCAGAGTGCTCAAACCGCCCATGAAGCTGAAGCAAGTGTTCGACGACAAGAAGAGAACCCCGA
AACACCCATGCACGACACGAGAAGGACGAGACCCTTGGGTTTCTCACCGGTGATCGTGAACCAAGAGCCCAATGCTCAAACTCCGTCTTCCTCGACAATGCAGGCCACTT
TGAGGGAGAATCCGAGTTCGTCTCAACTCAGGAGGTCCACGTGCTCAAGTGTCGTCCACAAAACCCAAAAATCCGCAACTCAACAATTCAGAAAACGCTCACGGGAGTGG
TTTTCAATGATCTGGACGATGGGAGCTGAGAGACGTGCTACTCTTGAAGAAGAAGTGAGTAGGCGAGATGAAGAAGAAATCGCCAAGGCAGCAGGAAACTCTCGACAAGG
AGAGGCTTCAACGGGTAAACATTCTGAACCTTCAACTAACCCCTCTTCGTCTTGCAGGAACAAACCATTCGTTACCTACAGTGCAAGGAGGAGGAGTCCCAAGAAAGTTG
TGCCCGAGAAACCACTTGTAATTGAGCCCCTTAAAACCGAGAGAATGCCCCCGAACGTGTTCGAGGACATAATCCGTCAAGCTGTGGCAAAGGCTCTAGTGATTTCCGAA
GGCTATAGGGTTGAACAAGAAGCTTTGAAGGATATTGAGGCTGAGAGAGAGATGGAAAATCAGCACATGATGAAAGAGGATGAGGTTGCAAGAGAAAGAGAGCTTGAAGA
AGAAAAGAAAAAGGAAGAGAAAAGGCAGGAGGCCGAGAGGGCCAAGTTAGCTGAAGAAGAGGAAAGAAAGTTAGGAGAAAACCTCAGGAGGGCAGCAGTTGATTTGCAGC
TCCTTGAGGAAGAAAAACAGAGAAAAAAAGAATCGAAAGAAGATGAGAAAAGAAGAAAGGATGCTGAAGACTTCCTTGCAGCTTTTGAGCCACTCCACAAGGCTCAAAGC
GAGGCTGAGATGTTGCGAGGAAGAGAAGAAAAGGCCCAACAGGGGCCAACTGAAGGAAGTTCAGAAAAAAAAAAAGAAATAGAAGAAGTGGATAAAGGCCAGAATGCGAC
CGCATCTGGACCGCATTCTGAAGAAGGCCTGACAGAGGCCACTGAAGCACAACCTGCTGATGAGGTTTTTGAACCTCTATTCAAAGATGACCCACCAGCAGCTGATAGCA
CCTCTTCGGGAGAGAAGAGAGGTGAAGAAGAGAAAGAAAGCAAGGAGGCCGAGACCTCCAGTAATTCAGAGACAGAATCTGATTCAGAGATGAAGGAGCTGGATGACGAC
CAAGTTCCTATCTTTGCAGCATTGAGAAGAAAGAGAAGAAGAGAGATTAAAGCTGAAAAGAGGACCAAGAACAAGAATGACCCCATATTTGCCAAGAGGCCGAGGACTAG
GTCCATGGACGCCTCTCCTGCAGCTCCTCCTACCGTCTCACCCGCCAAGTCAAAAGCCAAATCTCCTAAGGCTGCATCTCCTAAAAATCCATTCCCCGAAGTATTCAGAG
ATGTTAATTTTCAGGAAAGGATGGAGATAATGAGAAAAAGAGATTTCCTCAACGAGAAAGGATTCTCTAACAGAGCAGAAGCACTGCCAGAATTTGTAAGCAGAGTTATC
TCCCAGTACAAGTGGCAGGAGTTCTGTGCTCACCCTCAGGAGGCCGTAGTGCCTTTAGTTCGTGAATTTTACGATGGCCTGAGGGAGGAAAGCATCAGTATGGCGGTGGT
GAGAGGCAAAATGGCCAGCTTCTCTTCTGTAGACATTAACCGGGTGTACAGAATCAAAGCACCCTTGAACCCAAGAGGGAACGATGTCATTAGGAACCCCTCGGCCAAGC
AAATGAAAGAGGCACTAAAACTAGTGGCCAACAAGGGAGTTCAGTGGAAAGAGTCCCAAACAAAGGTGAAGACTCTAGTGCCCAGTGATTTAAAGCCAGAATCGGCAGTT
TGGCTTCACTTTCTGAAGAACCGATTGATGCCAACCACCCACGATAGCACCATCTCAGTAGATAGAGTTATGCTACTCTACTGTATTATGAAAGGGTTGAAGATCAATAT
TGGAAGCATAATCAGGGATGAGATTCTTGCCTGTGGAAGGAAACGAGCAGGGCCCTCACCATCATCGGAAGCCTTAGCCATTGCCTACCGTCAGCTTGATCAGATCAGGG
AAAACCTGAAGACATATTGGGCATATGCAAAGGAGAGGGATGAAGCCATTAGAGAGTTCTATCTCTCTATCGGCCTAAGTATTGCACCGGTCTTTCCCAATTTCCCTCAA
TCGCTGCTGCCTCAAGAAGACAAGGATTCTGATGAAGAAGAAGATGGAAATAATGATGAAGATGATGACGAGAAAGAGAGTTCCTCGGACGAGGACTAA
Protein sequenceShow/hide protein sequence
MKNLPKSSSSRKITRSQSAQTAHEAEASVRRQEENPETPMHDTRRTRPLGFSPVIVNQEPNAQTPSSSTMQATLRENPSSSQLRRSTCSSVVHKTQKSATQQFRKRSREW
FSMIWTMGAERRATLEEEVSRRDEEEIAKAAGNSRQGEASTGKHSEPSTNPSSSCRNKPFVTYSARRRSPKKVVPEKPLVIEPLKTERMPPNVFEDIIRQAVAKALVISE
GYRVEQEALKDIEAEREMENQHMMKEDEVARERELEEEKKKEEKRQEAERAKLAEEEERKLGENLRRAAVDLQLLEEEKQRKKESKEDEKRRKDAEDFLAAFEPLHKAQS
EAEMLRGREEKAQQGPTEGSSEKKKEIEEVDKGQNATASGPHSEEGLTEATEAQPADEVFEPLFKDDPPAADSTSSGEKRGEEEKESKEAETSSNSETESDSEMKELDDD
QVPIFAALRRKRRREIKAEKRTKNKNDPIFAKRPRTRSMDASPAAPPTVSPAKSKAKSPKAASPKNPFPEVFRDVNFQERMEIMRKRDFLNEKGFSNRAEALPEFVSRVI
SQYKWQEFCAHPQEAVVPLVREFYDGLREESISMAVVRGKMASFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVANKGVQWKESQTKVKTLVPSDLKPESAV
WLHFLKNRLMPTTHDSTISVDRVMLLYCIMKGLKINIGSIIRDEILACGRKRAGPSPSSEALAIAYRQLDQIRENLKTYWAYAKERDEAIREFYLSIGLSIAPVFPNFPQ
SLLPQEDKDSDEEEDGNNDEDDDEKESSSDED