; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg024946 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg024946
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionNucleolar protein 58-like
Genome locationscaffold12:13062878..13071381
RNA-Seq ExpressionSpg024946
SyntenySpg024946
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]1.9e-2730.21Show/hide
Query:  GQNATASRPHSDEEKRDEEEKERKEAETPSDSEIESDSEIEELDDDQEQM--EIMRKRDFLNEKGF---SNRAGALPEFVSRVILQYKWQKFCAHSQEFV
        G+ + A RP S    R +   ++  A  PS S  +  +  + +D+  E+   E +  R+ + EKGF    +     P F+S VI+   WQ FC H  + +
Subjt:  GQNATASRPHSDEEKRDEEEKERKEAETPSDSEIESDSEIEELDDDQEQM--EIMRKRDFLNEKGF---SNRAGALPEFVSRVILQYKWQKFCAHSQEFV

Query:  VPLVREFYAGLREESISMAVVRGKKVSFSSVDINRVYKIKTSLHPRGNDVIRNPSAKQMKEALKLVANKGVQWKEAQTKVKTLVPSDLKPESAVWLHFLK
        VPLV+EFYA L+ +  +   V    ++F+S  IN V  I         ++I +   +Q+KE LK +A  G QW  +     T    +L+P + VW HFL 
Subjt:  VPLVREFYAGLREESISMAVVRGKKVSFSSVDINRVYKIKTSLHPRGNDVIRNPSAKQMKEALKLVANKGVQWKEAQTKVKTLVPSDLKPESAVWLHFLK

Query:  NRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKGAGKLFFGSLITSLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNSLQRKDKAS
        +RL+ +TH  TIS +R +LLY ++ G  IN+G +I ++I AC  KG G L+F SLI+ LC +  +     E R      +DL  I ++     ++ +K  
Subjt:  NRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKGAGKLFFGSLITSLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNSLQRKDKAS

Query:  TSQATPPSGPNRASPSQHTSFTGPSPSSEALA-----------IAYRQLDQIRDNLNTYWAYAKEKDEAIREFY
          +      P+R S S HT     + S E L              +  L Q ++ L  +W Y++++D A+++ +
Subjt:  TSQATPPSGPNRASPSQHTSFTGPSPSSEALA-----------IAYRQLDQIRDNLNTYWAYAKEKDEAIREFY

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]5.0e-3335.98Show/hide
Query:  MRKRDFLNEKGF----SNRAGALPEFVSRVILQYKWQKFCAHSQEFVVPLVREFYAGLREESISMAVVRGKKVSFSSVDINRVYKIKTSLHPRGNDVIRN
        ++ R    EKGF    S   G LP F+++VI Q+ W++FCAH ++ +VPLVREFYA L +   +   VRG +VS+S   IN V+ +   +    ++ I N
Subjt:  MRKRDFLNEKGF----SNRAGALPEFVSRVILQYKWQKFCAHSQEFVVPLVREFYAGLREESISMAVVRGKKVSFSSVDINRVYKIKTSLHPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKEAQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKGAGKLFFG
         +   +   L+ VA  G +W  +     T + S L P + VW HFLK+ L+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  +  G LFF 
Subjt:  PSAKQMKEALKLVANKGVQWKEAQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKGAGKLFFG

Query:  SLITSLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNSLQRKDKASTSQATPPSGPNRASPSQ
        SLIT LC+  +     +EE+      ID   + ++ Q     +    ++Q    S P  AS S+
Subjt:  SLITSLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNSLQRKDKASTSQATPPSGPNRASPSQ

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.7e-3632.2Show/hide
Query:  MRKRDFLNEKGF----SNRAGALPEFVSRVILQYKWQKFCAHSQEFVVPLVREFYAGLREESISMAVVRGKKVSFSSVDINRVYKIKTSLHPRGNDVIRN
        ++ R    EKGF    S   G LP F+++VI Q+ W++FCAH ++ +VPLVREFYA L +   +   VRG +VS+S   IN V+ +   +    ++ I+N
Subjt:  MRKRDFLNEKGF----SNRAGALPEFVSRVILQYKWQKFCAHSQEFVVPLVREFYAGLREESISMAVVRGKKVSFSSVDINRVYKIKTSLHPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKEAQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKGAGKLFFG
         + + +   L+ VA  G +W  +     T + S L P + VW HFLK+RL+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  +  G LFF 
Subjt:  PSAKQMKEALKLVANKGVQWKEAQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKGAGKLFFG

Query:  SLITSLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQN--SLQRKDKASTSQATPPSGPNRASPSQHTSFTGPSPSSEALAIAYRQ--LDQIRDNLNTYW
        SLIT LC+  +     +EE+      ID   + ++ Q   +   +  +S+  AT  S        Q         S + +   +    L         +W
Subjt:  SLITSLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQN--SLQRKDKASTSQATPPSGPNRASPSQHTSFTGPSPSSEALAIAYRQ--LDQIRDNLNTYW

Query:  AYAKEKDEAIREFYLSIAPSIAPIFLDFPRSLLPQEDKDSDEEDDENDDEENEE
        AY+KE+D A+++   +      P F  FP+ +L   D + + E D++   E  E
Subjt:  AYAKEKDEAIREFYLSIAPSIAPIFLDFPRSLLPQEDKDSDEEDDENDDEENEE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]6.2e-3137.06Show/hide
Query:  PEFVSRVILQYKWQKFCAHSQEFVVPLVREFYAGLREESISMAVVRGKKVSFSSVDINRVYKIKTSLHPRGNDVIRNPSAKQMKEALKLVANKGVQWKEA
        P F++ VI+Q+ WQ FCAH ++ +VPLVREFY  +         +RG +V  S   IN ++ +   +    ++ + + +  ++   L+ VA  G +W  +
Subjt:  PEFVSRVILQYKWQKFCAHSQEFVVPLVREFYAGLREESISMAVVRGKKVSFSSVDINRVYKIKTSLHPRGNDVIRNPSAKQMKEALKLVANKGVQWKEA

Query:  QTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKGAGKLFFGSLITSLCQRVKIVPGKDEER
             T + S L P + VW HFLK+RL+PTTH  T+S + V LLY ++ G  IN+G +I  EI AC  + +G LFF SLITS+C+  +     +EE+
Subjt:  QTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKGAGKLFFGSLITSLCQRVKIVPGKDEER

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.9e-2731.41Show/hide
Query:  FCAHSQEFVVPLVREFYAGLREESISMAVVRGKKVSFSSVDINRVYKIKTSLHPRGNDVIRNPSAKQMKEALKLVANKGVQWKEAQTKVKTLVPSDLKPE
        F A   +F +PLVREFYA L +   +   VRG +VS+S   IN V+ +   +    ++ I N +  ++   L+ VA  G +W  +     T + S L P 
Subjt:  FCAHSQEFVVPLVREFYAGLREESISMAVVRGKKVSFSSVDINRVYKIKTSLHPRGNDVIRNPSAKQMKEALKLVANKGVQWKEAQTKVKTLVPSDLKPE

Query:  SAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKGAGKLFFGSLITSLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQN
        + VW HFLK+RL+PTTH   +S DR++LL+ ++ G  IN+G +I  EI AC  +  G LFF SLIT LC+    +   +EE+      ID   + ++ Q 
Subjt:  SAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKGAGKLFFGSLITSLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQN

Query:  SLQRKDKASTSQATPPSGPNRASPSQHTSFTGPSPSSEALAIAYRQLDQIRDNLNTYWAYAKEKDEAIREFYLSIAPSIAPIFLDFPRSLLPQEDKDSDE
              +  T     PS    A+ S   +        +AL     Q +        +WAY+KE+D A+++   +      P F  FP+ +L   D + + 
Subjt:  SLQRKDKASTSQATPPSGPNRASPSQHTSFTGPSPSSEALAIAYRQLDQIRDNLNTYWAYAKEKDEAIREFYLSIAPSIAPIFLDFPRSLLPQEDKDSDE

Query:  EDDENDDEENEE
        E D++   E  E
Subjt:  EDDENDDEENEE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.4e-3335.98Show/hide
Query:  MRKRDFLNEKGF----SNRAGALPEFVSRVILQYKWQKFCAHSQEFVVPLVREFYAGLREESISMAVVRGKKVSFSSVDINRVYKIKTSLHPRGNDVIRN
        ++ R    EKGF    S   G LP F+++VI Q+ W++FCAH ++ +VPLVREFYA L +   +   VRG +VS+S   IN V+ +   +    ++ I N
Subjt:  MRKRDFLNEKGF----SNRAGALPEFVSRVILQYKWQKFCAHSQEFVVPLVREFYAGLREESISMAVVRGKKVSFSSVDINRVYKIKTSLHPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKEAQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKGAGKLFFG
         +   +   L+ VA  G +W  +     T + S L P + VW HFLK+ L+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  +  G LFF 
Subjt:  PSAKQMKEALKLVANKGVQWKEAQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKGAGKLFFG

Query:  SLITSLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNSLQRKDKASTSQATPPSGPNRASPSQ
        SLIT LC+  +     +EE+      ID   + ++ Q     +    ++Q    S P  AS S+
Subjt:  SLITSLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNSLQRKDKASTSQATPPSGPNRASPSQ

A0A2P5BCG4 Uncharacterized protein (Fragment)8.1e-3732.2Show/hide
Query:  MRKRDFLNEKGF----SNRAGALPEFVSRVILQYKWQKFCAHSQEFVVPLVREFYAGLREESISMAVVRGKKVSFSSVDINRVYKIKTSLHPRGNDVIRN
        ++ R    EKGF    S   G LP F+++VI Q+ W++FCAH ++ +VPLVREFYA L +   +   VRG +VS+S   IN V+ +   +    ++ I+N
Subjt:  MRKRDFLNEKGF----SNRAGALPEFVSRVILQYKWQKFCAHSQEFVVPLVREFYAGLREESISMAVVRGKKVSFSSVDINRVYKIKTSLHPRGNDVIRN

Query:  PSAKQMKEALKLVANKGVQWKEAQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKGAGKLFFG
         + + +   L+ VA  G +W  +     T + S L P + VW HFLK+RL+PTTH  T+S DR++LL+ ++ G  IN+G +I  EI AC  +  G LFF 
Subjt:  PSAKQMKEALKLVANKGVQWKEAQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKGAGKLFFG

Query:  SLITSLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQN--SLQRKDKASTSQATPPSGPNRASPSQHTSFTGPSPSSEALAIAYRQ--LDQIRDNLNTYW
        SLIT LC+  +     +EE+      ID   + ++ Q   +   +  +S+  AT  S        Q         S + +   +    L         +W
Subjt:  SLITSLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQN--SLQRKDKASTSQATPPSGPNRASPSQHTSFTGPSPSSEALAIAYRQ--LDQIRDNLNTYW

Query:  AYAKEKDEAIREFYLSIAPSIAPIFLDFPRSLLPQEDKDSDEEDDENDDEENEE
        AY+KE+D A+++   +      P F  FP+ +L   D + + E D++   E  E
Subjt:  AYAKEKDEAIREFYLSIAPSIAPIFLDFPRSLLPQEDKDSDEEDDENDDEENEE

A0A2P5DAQ2 Uncharacterized protein3.0e-3137.06Show/hide
Query:  PEFVSRVILQYKWQKFCAHSQEFVVPLVREFYAGLREESISMAVVRGKKVSFSSVDINRVYKIKTSLHPRGNDVIRNPSAKQMKEALKLVANKGVQWKEA
        P F++ VI+Q+ WQ FCAH ++ +VPLVREFY  +         +RG +V  S   IN ++ +   +    ++ + + +  ++   L+ VA  G +W  +
Subjt:  PEFVSRVILQYKWQKFCAHSQEFVVPLVREFYAGLREESISMAVVRGKKVSFSSVDINRVYKIKTSLHPRGNDVIRNPSAKQMKEALKLVANKGVQWKEA

Query:  QTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKGAGKLFFGSLITSLCQRVKIVPGKDEER
             T + S L P + VW HFLK+RL+PTTH  T+S + V LLY ++ G  IN+G +I  EI AC  + +G LFF SLITS+C+  +     +EE+
Subjt:  QTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKGAGKLFFGSLITSLCQRVKIVPGKDEER

A0A2P5DXM3 Uncharacterized protein9.0e-2831.41Show/hide
Query:  FCAHSQEFVVPLVREFYAGLREESISMAVVRGKKVSFSSVDINRVYKIKTSLHPRGNDVIRNPSAKQMKEALKLVANKGVQWKEAQTKVKTLVPSDLKPE
        F A   +F +PLVREFYA L +   +   VRG +VS+S   IN V+ +   +    ++ I N +  ++   L+ VA  G +W  +     T + S L P 
Subjt:  FCAHSQEFVVPLVREFYAGLREESISMAVVRGKKVSFSSVDINRVYKIKTSLHPRGNDVIRNPSAKQMKEALKLVANKGVQWKEAQTKVKTLVPSDLKPE

Query:  SAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKGAGKLFFGSLITSLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQN
        + VW HFLK+RL+PTTH   +S DR++LL+ ++ G  IN+G +I  EI AC  +  G LFF SLIT LC+    +   +EE+      ID   + ++ Q 
Subjt:  SAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKGAGKLFFGSLITSLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQN

Query:  SLQRKDKASTSQATPPSGPNRASPSQHTSFTGPSPSSEALAIAYRQLDQIRDNLNTYWAYAKEKDEAIREFYLSIAPSIAPIFLDFPRSLLPQEDKDSDE
              +  T     PS    A+ S   +        +AL     Q +        +WAY+KE+D A+++   +      P F  FP+ +L   D + + 
Subjt:  SLQRKDKASTSQATPPSGPNRASPSQHTSFTGPSPSSEALAIAYRQLDQIRDNLNTYWAYAKEKDEAIREFYLSIAPSIAPIFLDFPRSLLPQEDKDSDE

Query:  EDDENDDEENEE
        E D++   E  E
Subjt:  EDDENDDEENEE

W9RBS1 Uncharacterized protein9.0e-2830.21Show/hide
Query:  GQNATASRPHSDEEKRDEEEKERKEAETPSDSEIESDSEIEELDDDQEQM--EIMRKRDFLNEKGF---SNRAGALPEFVSRVILQYKWQKFCAHSQEFV
        G+ + A RP S    R +   ++  A  PS S  +  +  + +D+  E+   E +  R+ + EKGF    +     P F+S VI+   WQ FC H  + +
Subjt:  GQNATASRPHSDEEKRDEEEKERKEAETPSDSEIESDSEIEELDDDQEQM--EIMRKRDFLNEKGF---SNRAGALPEFVSRVILQYKWQKFCAHSQEFV

Query:  VPLVREFYAGLREESISMAVVRGKKVSFSSVDINRVYKIKTSLHPRGNDVIRNPSAKQMKEALKLVANKGVQWKEAQTKVKTLVPSDLKPESAVWLHFLK
        VPLV+EFYA L+ +  +   V    ++F+S  IN V  I         ++I +   +Q+KE LK +A  G QW  +     T    +L+P + VW HFL 
Subjt:  VPLVREFYAGLREESISMAVVRGKKVSFSSVDINRVYKIKTSLHPRGNDVIRNPSAKQMKEALKLVANKGVQWKEAQTKVKTLVPSDLKPESAVWLHFLK

Query:  NRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKGAGKLFFGSLITSLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNSLQRKDKAS
        +RL+ +TH  TIS +R +LLY ++ G  IN+G +I ++I AC  KG G L+F SLI+ LC +  +     E R      +DL  I ++     ++ +K  
Subjt:  NRLMPTTHDNTISVDRVMLLYCIMKGLEINIGSIIREEILACGRKGAGKLFFGSLITSLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNSLQRKDKAS

Query:  TSQATPPSGPNRASPSQHTSFTGPSPSSEALA-----------IAYRQLDQIRDNLNTYWAYAKEKDEAIREFY
          +      P+R S S HT     + S E L              +  L Q ++ L  +W Y++++D A+++ +
Subjt:  TSQATPPSGPNRASPSQHTSFTGPSPSSEALA-----------IAYRQLDQIRDNLNTYWAYAKEKDEAIREFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCGGGCGATGGGATCTCAAAGACGTGCAACTCTTGAAGAAGTAGAAAATAGATTAGATGAAGAAGAAGCTGCCAAGGCAGCAGAGAGCTCTCGGCAAAGAGAGGC
TTCAACGGGTAAAACTTCTGAACTTTCAACTAACCCTTCTTCGTCTTGCAGGAACAAACCATTCGTTACTTACAGTGCAAGGAAGAGGAGTCCAAAGAAAGTTGCACCCG
AAAAGCCGCTTGTCATCGAGCCCCTCAAAACCGCAAGAATGCCCCTGGACGTGTTCGAAGACATAATCCGCCAAGCTGTGGCAAAGGCTCTTGTGATTGCCGAAGGGTAT
AAGGCTGAACAAGAAGCCTTGAGGGAGATTGAGGCTGAAAGGGAAATTGAAAACCAGCATATGAGGGAAGAGGATGAGTTTTTGAGAAAACGAGATCTTGATGAAGAAAA
GAAAAAGGAAGAGGAAAGGCAAGAGGCCGAGAAGACCAAGTTAGCTGAAGAAGAAGAAAGAAAACTAGGTGAAAACCTCAGGAGGGCAGCGCTTTTGAGCCACTCCACAA
GTCTCAAAGTGAGGCTGAGATGTTGCAAGGAAGAGAAGAAGAGGCCCATCAGGGGCCAAATGAAGAAAATCAAGAAAAAGAAAAAAGAAAAAGAAGCAGTGGATGAAGGC
CAGAATGCGACCGCATCTAGGCCACATTCTGATGAAGAGAAGAGGGATGAAGAAGAGAAAGAAAGAAAGGAGGCCGAGACCCCCAGTGATTCAGAGATAGAATCTGATTC
AGAGATCGAGGAACTGGATGACGACCAAGAACAAATGGAGATCATGAGAAAAAGAGATTTCCTCAACGAGAAAGGATTCTCTAATAGAGCAGGAGCACTGCCAGAGTTCG
TAAGCAGAGTTATCTTACAGTATAAGTGGCAGAAGTTCTGTGCTCACTCTCAGGAGTTCGTGGTGCCTTTAGTTCGTGAATTTTACGCCGGCCTAAGGGAGGAGAGCATC
AGTATGGCGGTAGTGAGAGGCAAGAAGGTCAGCTTCTCTTCAGTAGACATCAATAGGGTGTACAAAATCAAAACATCCCTACATCCAAGAGGGAATGATGTCATTAGGAA
CCCTTCGGCCAAACAGATGAAAGAAGCATTGAAATTAGTGGCCAACAAGGGTGTTCAGTGGAAAGAAGCCCAGACGAAGGTGAAGACTCTAGTGCCAAGCGACCTAAAGC
CAGAATCGGCAGTATGGCTTCACTTTCTGAAGAATCGATTGATGCCAACCACCCACGATAACACCATCTCAGTAGATAGAGTCATGCTCCTCTACTGTATTATGAAGGGG
TTGGAGATCAACATAGGGAGTATTATTAGGGAGGAGATTCTTGCCTGTGGAAGGAAAGGAGCAGGAAAACTTTTCTTTGGATCGCTTATCACCTCGCTCTGTCAAAGAGT
GAAGATAGTTCCTGGCAAGGATGAAGAGCGTCACTTCTTCAAGCCAACCATTGACCTATCCTTGATCGGGAAGCTTCAACAGAATAGCCTCCAAAGGAAAGACAAAGCCT
CCACATCTCAGGCCACTCCACCATCAGGGCCAAACAGGGCTTCTCCATCCCAACACACTTCTTTTACAGGGCCCTCACCATCATCTGAAGCCCTAGCTATTGCCTACCGT
CAGCTAGATCAAATCAGGGACAACCTGAATACTTATTGGGCATATGCAAAGGAGAAGGATGAAGCCATTAGAGAGTTCTATCTCTCTATTGCCCCGAGTATTGCCCCGAT
TTTTCTCGATTTCCCTCGATCGTTGCTGCCTCAAGAAGACAAGGATTCTGATGAAGAAGATGATGAGAACGATGATGAAGAAAATGAAGAGAAAGAGAAAAAATCAGAGG
AAGCTGGAATTTGCCCAGAAATGCGACGCATTTCTGGAAAAACAGAGGCAGTTCCGAGTCTATCGCGGGTCGGCAAGCTGCCTAAGGCACCCTATATCAACTATAGGGTT
ATAGATTGCGTTCGTGAATTATCTATTTCAGAGTCGAGTCACAGAAATGTGCATAACGATAAAAGGATGGAGAAGAAAAGCCAGAATCCAAACATAGAACCTTTAGAAAA
TAACCGCAATCCGCAGAGGTCACGAGAAGAAAATCACTTGCAGAGATCGCGAGAAGAAGGTCGAGGTCGGCCTCGGTACCTTCGGCCTTCATCTCGAGACAGGCAGACTG
ATGTGAAAATTGCTGCCCTCGAGGACAAAGTAAGTGCGATGGATCACAATTTATCTAGGATACTTCATATCTTTGATAAACCTGGTCGTAGCACTAAAACCCATGATGAG
AGGTTGGTTAGGGATCCGAGGAAGGAGAAGGAACCAATAGAGTACACTGTAGAGTCAGAAACAAGGTCGAAGGGAAAGAAAACTGATAGCGTGACCAGCAAGGTCAGGGG
GCTGAAGCATGGATTGGAGGTAGCGTCGCGACACTATATTGCACAGCATCGCGATGCTGCTATGCACGCGTCGTGTTGCCCAAGAAATGGACAGTGTCGCGACGCTGTCT
TCACAGCGTCTCGACGCTGTGCCATTTTCCAGCAATTCCAGCTTCGGAATTTCCAAGGGGACCTGAACAAATCAAAAGGTTCGGAGGACCAAGATTTGGAAGCCTTGATC
GATCAGGTTGATCCGCCCTTCACTGATGAAGTTATGAAAGTCGAGGTGCCCCAAAAGTTCAAGGAAGCAACCTTAGTAGCCATAACAGCAGGACTGGAGGACAAGAGGTT
GCTTAATTTGATAGGTAAGAGCCAACCTCGAACCTATGCTGAATTTGTTTCCAGGGCACAGAAGTATATGAGCGTAGAGGAGCTACTGAAATCAAAGAGGTCAGAACGAA
AGCACAAGAGACATTCCTCATTCGACCAAGACAGAAAGAATGACAAAAAACCATGGACAGACGATGGTGGCCAAGGTCGAGCCGACCATGACCATGGCCGAGGTCGGGCA
CATCCCTTTGGTAAGTTTGAGAAATACACGCCAACTGCTGTCCGGCAAGAGCAAGTTCTGATGGAGTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTCGGGCGATGGGATCTCAAAGACGTGCAACTCTTGAAGAAGTAGAAAATAGATTAGATGAAGAAGAAGCTGCCAAGGCAGCAGAGAGCTCTCGGCAAAGAGAGGC
TTCAACGGGTAAAACTTCTGAACTTTCAACTAACCCTTCTTCGTCTTGCAGGAACAAACCATTCGTTACTTACAGTGCAAGGAAGAGGAGTCCAAAGAAAGTTGCACCCG
AAAAGCCGCTTGTCATCGAGCCCCTCAAAACCGCAAGAATGCCCCTGGACGTGTTCGAAGACATAATCCGCCAAGCTGTGGCAAAGGCTCTTGTGATTGCCGAAGGGTAT
AAGGCTGAACAAGAAGCCTTGAGGGAGATTGAGGCTGAAAGGGAAATTGAAAACCAGCATATGAGGGAAGAGGATGAGTTTTTGAGAAAACGAGATCTTGATGAAGAAAA
GAAAAAGGAAGAGGAAAGGCAAGAGGCCGAGAAGACCAAGTTAGCTGAAGAAGAAGAAAGAAAACTAGGTGAAAACCTCAGGAGGGCAGCGCTTTTGAGCCACTCCACAA
GTCTCAAAGTGAGGCTGAGATGTTGCAAGGAAGAGAAGAAGAGGCCCATCAGGGGCCAAATGAAGAAAATCAAGAAAAAGAAAAAAGAAAAAGAAGCAGTGGATGAAGGC
CAGAATGCGACCGCATCTAGGCCACATTCTGATGAAGAGAAGAGGGATGAAGAAGAGAAAGAAAGAAAGGAGGCCGAGACCCCCAGTGATTCAGAGATAGAATCTGATTC
AGAGATCGAGGAACTGGATGACGACCAAGAACAAATGGAGATCATGAGAAAAAGAGATTTCCTCAACGAGAAAGGATTCTCTAATAGAGCAGGAGCACTGCCAGAGTTCG
TAAGCAGAGTTATCTTACAGTATAAGTGGCAGAAGTTCTGTGCTCACTCTCAGGAGTTCGTGGTGCCTTTAGTTCGTGAATTTTACGCCGGCCTAAGGGAGGAGAGCATC
AGTATGGCGGTAGTGAGAGGCAAGAAGGTCAGCTTCTCTTCAGTAGACATCAATAGGGTGTACAAAATCAAAACATCCCTACATCCAAGAGGGAATGATGTCATTAGGAA
CCCTTCGGCCAAACAGATGAAAGAAGCATTGAAATTAGTGGCCAACAAGGGTGTTCAGTGGAAAGAAGCCCAGACGAAGGTGAAGACTCTAGTGCCAAGCGACCTAAAGC
CAGAATCGGCAGTATGGCTTCACTTTCTGAAGAATCGATTGATGCCAACCACCCACGATAACACCATCTCAGTAGATAGAGTCATGCTCCTCTACTGTATTATGAAGGGG
TTGGAGATCAACATAGGGAGTATTATTAGGGAGGAGATTCTTGCCTGTGGAAGGAAAGGAGCAGGAAAACTTTTCTTTGGATCGCTTATCACCTCGCTCTGTCAAAGAGT
GAAGATAGTTCCTGGCAAGGATGAAGAGCGTCACTTCTTCAAGCCAACCATTGACCTATCCTTGATCGGGAAGCTTCAACAGAATAGCCTCCAAAGGAAAGACAAAGCCT
CCACATCTCAGGCCACTCCACCATCAGGGCCAAACAGGGCTTCTCCATCCCAACACACTTCTTTTACAGGGCCCTCACCATCATCTGAAGCCCTAGCTATTGCCTACCGT
CAGCTAGATCAAATCAGGGACAACCTGAATACTTATTGGGCATATGCAAAGGAGAAGGATGAAGCCATTAGAGAGTTCTATCTCTCTATTGCCCCGAGTATTGCCCCGAT
TTTTCTCGATTTCCCTCGATCGTTGCTGCCTCAAGAAGACAAGGATTCTGATGAAGAAGATGATGAGAACGATGATGAAGAAAATGAAGAGAAAGAGAAAAAATCAGAGG
AAGCTGGAATTTGCCCAGAAATGCGACGCATTTCTGGAAAAACAGAGGCAGTTCCGAGTCTATCGCGGGTCGGCAAGCTGCCTAAGGCACCCTATATCAACTATAGGGTT
ATAGATTGCGTTCGTGAATTATCTATTTCAGAGTCGAGTCACAGAAATGTGCATAACGATAAAAGGATGGAGAAGAAAAGCCAGAATCCAAACATAGAACCTTTAGAAAA
TAACCGCAATCCGCAGAGGTCACGAGAAGAAAATCACTTGCAGAGATCGCGAGAAGAAGGTCGAGGTCGGCCTCGGTACCTTCGGCCTTCATCTCGAGACAGGCAGACTG
ATGTGAAAATTGCTGCCCTCGAGGACAAAGTAAGTGCGATGGATCACAATTTATCTAGGATACTTCATATCTTTGATAAACCTGGTCGTAGCACTAAAACCCATGATGAG
AGGTTGGTTAGGGATCCGAGGAAGGAGAAGGAACCAATAGAGTACACTGTAGAGTCAGAAACAAGGTCGAAGGGAAAGAAAACTGATAGCGTGACCAGCAAGGTCAGGGG
GCTGAAGCATGGATTGGAGGTAGCGTCGCGACACTATATTGCACAGCATCGCGATGCTGCTATGCACGCGTCGTGTTGCCCAAGAAATGGACAGTGTCGCGACGCTGTCT
TCACAGCGTCTCGACGCTGTGCCATTTTCCAGCAATTCCAGCTTCGGAATTTCCAAGGGGACCTGAACAAATCAAAAGGTTCGGAGGACCAAGATTTGGAAGCCTTGATC
GATCAGGTTGATCCGCCCTTCACTGATGAAGTTATGAAAGTCGAGGTGCCCCAAAAGTTCAAGGAAGCAACCTTAGTAGCCATAACAGCAGGACTGGAGGACAAGAGGTT
GCTTAATTTGATAGGTAAGAGCCAACCTCGAACCTATGCTGAATTTGTTTCCAGGGCACAGAAGTATATGAGCGTAGAGGAGCTACTGAAATCAAAGAGGTCAGAACGAA
AGCACAAGAGACATTCCTCATTCGACCAAGACAGAAAGAATGACAAAAAACCATGGACAGACGATGGTGGCCAAGGTCGAGCCGACCATGACCATGGCCGAGGTCGGGCA
CATCCCTTTGGTAAGTTTGAGAAATACACGCCAACTGCTGTCCGGCAAGAGCAAGTTCTGATGGAGTTCTGA
Protein sequenceShow/hide protein sequence
MIRAMGSQRRATLEEVENRLDEEEAAKAAESSRQREASTGKTSELSTNPSSSCRNKPFVTYSARKRSPKKVAPEKPLVIEPLKTARMPLDVFEDIIRQAVAKALVIAEGY
KAEQEALREIEAEREIENQHMREEDEFLRKRDLDEEKKKEEERQEAEKTKLAEEEERKLGENLRRAALLSHSTSLKVRLRCCKEEKKRPIRGQMKKIKKKKKEKEAVDEG
QNATASRPHSDEEKRDEEEKERKEAETPSDSEIESDSEIEELDDDQEQMEIMRKRDFLNEKGFSNRAGALPEFVSRVILQYKWQKFCAHSQEFVVPLVREFYAGLREESI
SMAVVRGKKVSFSSVDINRVYKIKTSLHPRGNDVIRNPSAKQMKEALKLVANKGVQWKEAQTKVKTLVPSDLKPESAVWLHFLKNRLMPTTHDNTISVDRVMLLYCIMKG
LEINIGSIIREEILACGRKGAGKLFFGSLITSLCQRVKIVPGKDEERHFFKPTIDLSLIGKLQQNSLQRKDKASTSQATPPSGPNRASPSQHTSFTGPSPSSEALAIAYR
QLDQIRDNLNTYWAYAKEKDEAIREFYLSIAPSIAPIFLDFPRSLLPQEDKDSDEEDDENDDEENEEKEKKSEEAGICPEMRRISGKTEAVPSLSRVGKLPKAPYINYRV
IDCVRELSISESSHRNVHNDKRMEKKSQNPNIEPLENNRNPQRSREENHLQRSREEGRGRPRYLRPSSRDRQTDVKIAALEDKVSAMDHNLSRILHIFDKPGRSTKTHDE
RLVRDPRKEKEPIEYTVESETRSKGKKTDSVTSKVRGLKHGLEVASRHYIAQHRDAAMHASCCPRNGQCRDAVFTASRRCAIFQQFQLRNFQGDLNKSKGSEDQDLEALI
DQVDPPFTDEVMKVEVPQKFKEATLVAITAGLEDKRLLNLIGKSQPRTYAEFVSRAQKYMSVEELLKSKRSERKHKRHSSFDQDRKNDKKPWTDDGGQGRADHDHGRGRA
HPFGKFEKYTPTAVRQEQVLMEF