; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg015619 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg015619
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionNucleolar protein 58-like
Genome locationscaffold10:16784636..16787137
RNA-Seq ExpressionSpg015619
SyntenySpg015619
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]2.5e-2728.8Show/hide
Query:  FAKRPRTRSMDASPAVPPTISPAKPKGKSPKAASPRNPFPEVFRDVNFQERMEIMKKRDFLNEKGF---SNRAGALPEFVSRIISQYKWQDFCAHPQEAV
        FAKRP + S    PA+    + A     S +  S    F +   +  ++E    +  R+ + EKGF    +     P F+S +I    WQ FC HP + +
Subjt:  FAKRPRTRSMDASPAVPPTISPAKPKGKSPKAASPRNPFPEVFRDVNFQERMEIMKKRDFLNEKGF---SNRAGALPEFVSRIISQYKWQDFCAHPQEAV

Query:  VPLVREFYAGLREESISMEVVRGQMVGFSSVDINRVYRIKAPLNPRGND----VIRNPSAKQMKEALKLVVNKGVQWKESQTKVKSLVPSDLKPKSAVWL
        VPLV+EFYA L+ +  +   V    + F+S  IN V  I     P  +D    +I +   +Q+KE LK +   G QW  S     +    +L+P + VW 
Subjt:  VPLVREFYAGLREESISMEVVRGQMVGFSSVDINRVYRIKAPLNPRGND----VIRNPSAKQMKEALKLVVNKGVQWKESQTKVKSLVPSDLKPKSAVWL

Query:  HFIKNRLMPTTHDNTISVDRVMLLYCLMNGLEINVGSIIRDDILACGRKLASKLFFGSLITQLCQRVKIVPGKDEERHFFKSTIDLSLIGKLQQNSIQRK
        HF+ +RL+ +TH  TIS +R +LLY ++ G  INVG +I D I AC  K    L+F SLI++LC +  +     E R      +DL  I ++     ++ 
Subjt:  HFIKNRLMPTTHDNTISVDRVMLLYCLMNGLEINVGSIIRDDILACGRKLASKLFFGSLITQLCQRVKIVPGKDEERHFFKSTIDLSLIGKLQQNSIQRK

Query:  DKA-STSQATPPTGPNVASPSQHTHFTGPSPSSKALAI-------AYRRLDQIRENVKTYWAYAKERDEAIREFY
        +K     +   P+ P+ +     +         K +++        +  L Q +E +  +W Y+++RD A+++ +
Subjt:  DKA-STSQATPPTGPNVASPSQHTHFTGPSPSSKALAI-------AYRRLDQIRENVKTYWAYAKERDEAIREFY

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]8.1e-3434.47Show/hide
Query:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLVREFYAGLREESISMEVVRGQMVGFSSVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F++++I+Q+ W+ FCAHP++ +VPLVREFYA L +   +   VRG  V +S   IN V+ +  P++   ++ I N
Subjt:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLVREFYAGLREESISMEVVRGQMVGFSSVDINRVYRIKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVVNKGVQWKESQTKVKSLVPSDLKPKSAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMNGLEINVGSIIRDDILACGRKLASKLFFG
         +   +   L+ V   G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  INVG +I  +I AC  +    LFF 
Subjt:  PSAKQMKEALKLVVNKGVQWKESQTKVKSLVPSDLKPKSAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMNGLEINVGSIIRDDILACGRKLASKLFFG

Query:  SLITQLCQRVKIVPGKDEERHFFKSTIDLSLIGKLQQNSIQRKDKASTSQATPPTGPNVASPSQ
        SLIT+LC+  +     +EE+      ID   + ++ Q     +    ++Q    + P  AS S+
Subjt:  SLITQLCQRVKIVPGKDEERHFFKSTIDLSLIGKLQQNSIQRKDKASTSQATPPTGPNVASPSQ

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.1e-3831.58Show/hide
Query:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLVREFYAGLREESISMEVVRGQMVGFSSVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F++++I+Q+ W+ FCAHP++ +VPLVREFYA L +   +   VRG  V +S   IN V+ +  P++   ++ I+N
Subjt:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLVREFYAGLREESISMEVVRGQMVGFSSVDINRVYRIKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVVNKGVQWKESQTKVKSLVPSDLKPKSAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMNGLEINVGSIIRDDILACGRKLASKLFFG
         + + +   L+ V   G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S DR++LL+ ++ G  INVG +I  +I AC  +    LFF 
Subjt:  PSAKQMKEALKLVVNKGVQWKESQTKVKSLVPSDLKPKSAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMNGLEINVGSIIRDDILACGRKLASKLFFG

Query:  SLITQLCQRVKIVPGKDEERHFFKSTIDLSLIGKLQQNSIQRKDKASTSQATPPTGPNVASPSQHTHFTGPSPSSKALAIAYRR-----------LDQIR
        SLIT+LC+  +     +EE+      ID   + ++ Q       +  T     P+    A+ S +          KAL     +           L    
Subjt:  SLITQLCQRVKIVPGKDEERHFFKSTIDLSLIGKLQQNSIQRKDKASTSQATPPTGPNVASPSQHTHFTGPSPSSKALAIAYRR-----------LDQIR

Query:  ENVKTYWAYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEEDSDEEEDEEKENSSDE
        +  + +WAY+KERD A+++   +      P FP FPQ +L   + + + E D++  N + E
Subjt:  ENVKTYWAYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEEDSDEEEDEEKENSSDE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]5.5e-3032.78Show/hide
Query:  KAASPRNPFPEVFRDVNFQER-MEIMKKRDFLNEKGFSNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLVREFYAGLREESISMEVVRGQMVGFSSVD
        KA    +   E+  + N Q R + + K+  + N K         P F++ +I Q+ WQ FCAHP++ +VPLVREFY  +         +RG  V  S   
Subjt:  KAASPRNPFPEVFRDVNFQER-MEIMKKRDFLNEKGFSNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLVREFYAGLREESISMEVVRGQMVGFSSVD

Query:  INRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVVNKGVQWKESQTKVKSLVPSDLKPKSAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMNGLEINVG
        IN ++ +  P++   ++ + + +  ++   L+ V   G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S + V LLY ++ G  INVG
Subjt:  INRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVVNKGVQWKESQTKVKSLVPSDLKPKSAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMNGLEINVG

Query:  SIIRDDILACGRKLASKLFFGSLITQLCQRVKIVPGKDEER
         +I  +I AC  + +  LFF SLIT +C+  +     +EE+
Subjt:  SIIRDDILACGRKLASKLFFGSLITQLCQRVKIVPGKDEER

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.0e-2831.02Show/hide
Query:  VPLVREFYAGLREESISMEVVRGQMVGFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVVNKGVQWKESQTKVKSLVPSDLKPKSAVWLHFIK
        +PLVREFYA L +   +   VRG  V +S   IN V+ +  P++   ++ I N +  ++   L+ V   G +W  S     + + S L P + VW HF+K
Subjt:  VPLVREFYAGLREESISMEVVRGQMVGFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVVNKGVQWKESQTKVKSLVPSDLKPKSAVWLHFIK

Query:  NRLMPTTHDNTISVDRVMLLYCLMNGLEINVGSIIRDDILACGRKLASKLFFGSLITQLCQRVKIVPGKDEERHFFKSTIDLSLIGKLQQNSIQRKDKAS
        +RL+PTTH   +S DR++LL+ ++NG  INVG +I  +I AC  +    LFF SLIT+LC+    +   +EE+      ID   + ++ Q       +  
Subjt:  NRLMPTTHDNTISVDRVMLLYCLMNGLEINVGSIIRDDILACGRKLASKLFFGSLITQLCQRVKIVPGKDEERHFFKSTIDLSLIGKLQQNSIQRKDKAS

Query:  TSQATPPTGPNVASPSQHTHFTGPSPSSKALAIAYRRLDQIRENVKTYWAYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEEDSDEEEDEEKENS
        T     P+    A+ S            KAL     + +   +  + +WAY+KERD A+++   +      P FP FPQ +L   + + + E D++  N 
Subjt:  TSQATPPTGPNVASPSQHTHFTGPSPSSKALAIAYRRLDQIRENVKTYWAYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEEDSDEEEDEEKENS

Query:  SDE
        + E
Subjt:  SDE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)3.9e-3434.47Show/hide
Query:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLVREFYAGLREESISMEVVRGQMVGFSSVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F++++I+Q+ W+ FCAHP++ +VPLVREFYA L +   +   VRG  V +S   IN V+ +  P++   ++ I N
Subjt:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLVREFYAGLREESISMEVVRGQMVGFSSVDINRVYRIKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVVNKGVQWKESQTKVKSLVPSDLKPKSAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMNGLEINVGSIIRDDILACGRKLASKLFFG
         +   +   L+ V   G +W  S     + + S L P + VW HF+K+ L+PTTH  T+S DR++LL+ ++ G  INVG +I  +I AC  +    LFF 
Subjt:  PSAKQMKEALKLVVNKGVQWKESQTKVKSLVPSDLKPKSAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMNGLEINVGSIIRDDILACGRKLASKLFFG

Query:  SLITQLCQRVKIVPGKDEERHFFKSTIDLSLIGKLQQNSIQRKDKASTSQATPPTGPNVASPSQ
        SLIT+LC+  +     +EE+      ID   + ++ Q     +    ++Q    + P  AS S+
Subjt:  SLITQLCQRVKIVPGKDEERHFFKSTIDLSLIGKLQQNSIQRKDKASTSQATPPTGPNVASPSQ

A0A2P5BCG4 Uncharacterized protein (Fragment)5.3e-3931.58Show/hide
Query:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLVREFYAGLREESISMEVVRGQMVGFSSVDINRVYRIKAPLNPRGNDVIRN
        ++ R    EKGF    S   G LP F++++I+Q+ W+ FCAHP++ +VPLVREFYA L +   +   VRG  V +S   IN V+ +  P++   ++ I+N
Subjt:  MKKRDFLNEKGF----SNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLVREFYAGLREESISMEVVRGQMVGFSSVDINRVYRIKAPLNPRGNDVIRN

Query:  PSAKQMKEALKLVVNKGVQWKESQTKVKSLVPSDLKPKSAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMNGLEINVGSIIRDDILACGRKLASKLFFG
         + + +   L+ V   G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S DR++LL+ ++ G  INVG +I  +I AC  +    LFF 
Subjt:  PSAKQMKEALKLVVNKGVQWKESQTKVKSLVPSDLKPKSAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMNGLEINVGSIIRDDILACGRKLASKLFFG

Query:  SLITQLCQRVKIVPGKDEERHFFKSTIDLSLIGKLQQNSIQRKDKASTSQATPPTGPNVASPSQHTHFTGPSPSSKALAIAYRR-----------LDQIR
        SLIT+LC+  +     +EE+      ID   + ++ Q       +  T     P+    A+ S +          KAL     +           L    
Subjt:  SLITQLCQRVKIVPGKDEERHFFKSTIDLSLIGKLQQNSIQRKDKASTSQATPPTGPNVASPSQHTHFTGPSPSSKALAIAYRR-----------LDQIR

Query:  ENVKTYWAYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEEDSDEEEDEEKENSSDE
        +  + +WAY+KERD A+++   +      P FP FPQ +L   + + + E D++  N + E
Subjt:  ENVKTYWAYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEEDSDEEEDEEKENSSDE

A0A2P5DAQ2 Uncharacterized protein2.6e-3032.78Show/hide
Query:  KAASPRNPFPEVFRDVNFQER-MEIMKKRDFLNEKGFSNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLVREFYAGLREESISMEVVRGQMVGFSSVD
        KA    +   E+  + N Q R + + K+  + N K         P F++ +I Q+ WQ FCAHP++ +VPLVREFY  +         +RG  V  S   
Subjt:  KAASPRNPFPEVFRDVNFQER-MEIMKKRDFLNEKGFSNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLVREFYAGLREESISMEVVRGQMVGFSSVD

Query:  INRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVVNKGVQWKESQTKVKSLVPSDLKPKSAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMNGLEINVG
        IN ++ +  P++   ++ + + +  ++   L+ V   G +W  S     + + S L P + VW HF+K+RL+PTTH  T+S + V LLY ++ G  INVG
Subjt:  INRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVVNKGVQWKESQTKVKSLVPSDLKPKSAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMNGLEINVG

Query:  SIIRDDILACGRKLASKLFFGSLITQLCQRVKIVPGKDEER
         +I  +I AC  + +  LFF SLIT +C+  +     +EE+
Subjt:  SIIRDDILACGRKLASKLFFGSLITQLCQRVKIVPGKDEER

A0A2P5DXM3 Uncharacterized protein5.0e-2931.02Show/hide
Query:  VPLVREFYAGLREESISMEVVRGQMVGFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVVNKGVQWKESQTKVKSLVPSDLKPKSAVWLHFIK
        +PLVREFYA L +   +   VRG  V +S   IN V+ +  P++   ++ I N +  ++   L+ V   G +W  S     + + S L P + VW HF+K
Subjt:  VPLVREFYAGLREESISMEVVRGQMVGFSSVDINRVYRIKAPLNPRGNDVIRNPSAKQMKEALKLVVNKGVQWKESQTKVKSLVPSDLKPKSAVWLHFIK

Query:  NRLMPTTHDNTISVDRVMLLYCLMNGLEINVGSIIRDDILACGRKLASKLFFGSLITQLCQRVKIVPGKDEERHFFKSTIDLSLIGKLQQNSIQRKDKAS
        +RL+PTTH   +S DR++LL+ ++NG  INVG +I  +I AC  +    LFF SLIT+LC+    +   +EE+      ID   + ++ Q       +  
Subjt:  NRLMPTTHDNTISVDRVMLLYCLMNGLEINVGSIIRDDILACGRKLASKLFFGSLITQLCQRVKIVPGKDEERHFFKSTIDLSLIGKLQQNSIQRKDKAS

Query:  TSQATPPTGPNVASPSQHTHFTGPSPSSKALAIAYRRLDQIRENVKTYWAYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEEDSDEEEDEEKENS
        T     P+    A+ S            KAL     + +   +  + +WAY+KERD A+++   +      P FP FPQ +L   + + + E D++  N 
Subjt:  TSQATPPTGPNVASPSQHTHFTGPSPSSKALAIAYRRLDQIRENVKTYWAYAKERDEAIREFYLSIAPSIAPVFPNFPQSLLPQEEEDSDEEEDEEKENS

Query:  SDE
        + E
Subjt:  SDE

W9RBS1 Uncharacterized protein1.2e-2728.8Show/hide
Query:  FAKRPRTRSMDASPAVPPTISPAKPKGKSPKAASPRNPFPEVFRDVNFQERMEIMKKRDFLNEKGF---SNRAGALPEFVSRIISQYKWQDFCAHPQEAV
        FAKRP + S    PA+    + A     S +  S    F +   +  ++E    +  R+ + EKGF    +     P F+S +I    WQ FC HP + +
Subjt:  FAKRPRTRSMDASPAVPPTISPAKPKGKSPKAASPRNPFPEVFRDVNFQERMEIMKKRDFLNEKGF---SNRAGALPEFVSRIISQYKWQDFCAHPQEAV

Query:  VPLVREFYAGLREESISMEVVRGQMVGFSSVDINRVYRIKAPLNPRGND----VIRNPSAKQMKEALKLVVNKGVQWKESQTKVKSLVPSDLKPKSAVWL
        VPLV+EFYA L+ +  +   V    + F+S  IN V  I     P  +D    +I +   +Q+KE LK +   G QW  S     +    +L+P + VW 
Subjt:  VPLVREFYAGLREESISMEVVRGQMVGFSSVDINRVYRIKAPLNPRGND----VIRNPSAKQMKEALKLVVNKGVQWKESQTKVKSLVPSDLKPKSAVWL

Query:  HFIKNRLMPTTHDNTISVDRVMLLYCLMNGLEINVGSIIRDDILACGRKLASKLFFGSLITQLCQRVKIVPGKDEERHFFKSTIDLSLIGKLQQNSIQRK
        HF+ +RL+ +TH  TIS +R +LLY ++ G  INVG +I D I AC  K    L+F SLI++LC +  +     E R      +DL  I ++     ++ 
Subjt:  HFIKNRLMPTTHDNTISVDRVMLLYCLMNGLEINVGSIIRDDILACGRKLASKLFFGSLITQLCQRVKIVPGKDEERHFFKSTIDLSLIGKLQQNSIQRK

Query:  DKA-STSQATPPTGPNVASPSQHTHFTGPSPSSKALAI-------AYRRLDQIRENVKTYWAYAKERDEAIREFY
        +K     +   P+ P+ +     +         K +++        +  L Q +E +  +W Y+++RD A+++ +
Subjt:  DKA-STSQATPPTGPNVASPSQHTHFTGPSPSSKALAI-------AYRRLDQIRENVKTYWAYAKERDEAIREFY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGGCACACGAAAAACGAGACCCACGGGATTCTCATCGGCGGTCGTGAACCAAGCATCCAACTCTCCAACTCCATCTTCTTCGGCAATGCCGGCTAGTTCGAGGGA
GATGCCGAGTTCGTCTACGCCAAGGAGGTTCACGCGCGCCGCCGCCGTCCATCAAACCCAAAAACCCGCCGCTCAACAGTTCAAGAAACGTTCGCGGGAGTGGTTTGCGA
TGATCCGAGAGATGGGTGCTCAGAGACGTGTTGTCCTTGAAGAAGAAGGGAATCGACAAGATGAAAAAGAAGTCGCCAAGGCAGCTGAAAGCTCTCGACAAGGAGAAGCT
TCAATGGATAAGGTTTACGAACCTTCAACTAACCCTTCTTTTTCTTGCAGGACCAAACCCGTTGTTACTTACAGTGCAAGAAAGAGGAGCCCGAAGAAGGTTGTGTCTGA
AAGGCCGCTAAAAATTGAGCCCCTCAAAATCACAAGGATGCCTCCAGAGATATTCGAAGGAATAATTCGCCAAGCAGTGGCAAAGGCTCTTGCGATTGCAGAAGGGTACA
AGGCTGAACAGGATGCTTTGAAAGAGATTGAAATTGAGAGAGAGATGGAAAATAGGAAAATGGTTGAGGAAGACGAGCTTGCAAAAGAAAGAGATCGTAAAGAGGAGAAA
AGAAGGAGAGAAGAAGAGCCAGAGCCCGAGAGGGCCTTAGAAGCTGAGGAAGAAAGAAAATATGAGGAAAACCTCAGGAGGGCAGCTATGGATTTGCAGCTCCTTGAGGA
AGAGAAAAAGAGAAGAGAAGAAATAAAAGAAGATGAAAAAAGAAGGAAGGAAGCTGAAGACTTCCTTGCAGCCTTTGAGCCACTCCACAAGGCTCAAAGTGAGGCTGAAG
CACTGCAAGGAAGGGTAGAAGAAAAGGCCCAACAGGGGCCAACTGAAGAAAATTTAGAAAAAGAAAAAGAAAGAGAAGTAGAGGAAGAAGGACAGACTGTGACCGCATCT
GGGCCGCAATCTGAAGAAGGCCTAACCGAGGCCACCGTTGATCATCCAGCTGAAGAGAGGGTGGAAGAAGAAAAGGAAGACGAGGAGGCCAAGACCTCTAGTGATTCTGA
TTCTGAAACAGAATCTGATTCAGAGATAAGGGAGCTAGATGATGACCAAGTCCCTATCTCTGCAGCATTGAGAAGAAAGAGGAAGAGAGAGATAAAGGCCGAGAGGAGGA
CAAAGGACAAGAATGACCCGATATTTGCCAAGAGGCCGAGGACAAGGTCCATGGACGCCTCTCCTGCAGTTCCTCCTACTATCTCACCCGCCAAGCCAAAGGGAAAATCA
CCCAAGGCTGCATCTCCCAGAAATCCGTTCCCTGAGGTATTTAGAGATGTTAATTTTCAGGAACGAATGGAGATCATGAAGAAAAGAGATTTCCTCAATGAAAAGGGATT
CTCTAATAGAGCAGGAGCACTGCCAGAGTTCGTGAGCAGGATCATATCTCAATACAAGTGGCAGGACTTCTGTGCTCACCCTCAGGAGGCTGTTGTGCCTTTAGTTCGAG
AGTTTTACGCTGGCCTGAGGGAGGAGAGTATCAGCATGGAGGTTGTGAGGGGGCAGATGGTCGGTTTCTCCTCAGTCGACATTAATAGGGTGTACAGGATCAAGGCACCC
TTGAATCCGAGAGGGAATGATGTGATAAGGAACCCTTCGGCCAAGCAGATGAAGGAAGCATTGAAGCTTGTGGTCAACAAGGGGGTTCAATGGAAAGAATCGCAAACGAA
AGTGAAGTCTTTAGTGCCAAGCGACCTAAAGCCAAAATCGGCAGTTTGGCTTCACTTCATCAAGAACCGTTTGATGCCAACCACCCACGACAACACGATTTCAGTGGATA
GAGTGATGCTACTCTATTGCCTTATGAACGGGTTGGAGATTAATGTAGGGAGCATTATTAGGGACGATATTTTAGCCTGTGGGAGAAAGCTAGCAAGCAAGCTTTTCTTT
GGATCACTCATCACCCAGCTCTGCCAAAGGGTGAAGATCGTTCCAGGCAAGGACGAGGAGCGTCACTTCTTCAAGTCGACCATCGACCTGTCCTTGATTGGAAAGCTCCA
GCAGAATAGCATCCAGAGGAAAGACAAAGCCTCGACATCTCAGGCTACTCCACCTACAGGGCCGAATGTAGCTTCTCCATCCCAGCACACTCATTTCACAGGGCCTTCAC
CATCATCGAAAGCCCTAGCTATTGCCTACCGCCGGCTAGATCAAATCAGGGAGAACGTGAAGACGTATTGGGCATATGCAAAGGAGCGGGATGAAGCCATTAGAGAGTTC
TATCTCTCGATTGCCCCAAGTATTGCTCCGGTCTTTCCAAATTTCCCTCAGTCGCTGCTGCCCCAAGAAGAAGAGGATTCTGATGAAGAGGAAGATGAAGAGAAAGAGAA
TTCCTCGGACGAGGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAGGCACACGAAAAACGAGACCCACGGGATTCTCATCGGCGGTCGTGAACCAAGCATCCAACTCTCCAACTCCATCTTCTTCGGCAATGCCGGCTAGTTCGAGGGA
GATGCCGAGTTCGTCTACGCCAAGGAGGTTCACGCGCGCCGCCGCCGTCCATCAAACCCAAAAACCCGCCGCTCAACAGTTCAAGAAACGTTCGCGGGAGTGGTTTGCGA
TGATCCGAGAGATGGGTGCTCAGAGACGTGTTGTCCTTGAAGAAGAAGGGAATCGACAAGATGAAAAAGAAGTCGCCAAGGCAGCTGAAAGCTCTCGACAAGGAGAAGCT
TCAATGGATAAGGTTTACGAACCTTCAACTAACCCTTCTTTTTCTTGCAGGACCAAACCCGTTGTTACTTACAGTGCAAGAAAGAGGAGCCCGAAGAAGGTTGTGTCTGA
AAGGCCGCTAAAAATTGAGCCCCTCAAAATCACAAGGATGCCTCCAGAGATATTCGAAGGAATAATTCGCCAAGCAGTGGCAAAGGCTCTTGCGATTGCAGAAGGGTACA
AGGCTGAACAGGATGCTTTGAAAGAGATTGAAATTGAGAGAGAGATGGAAAATAGGAAAATGGTTGAGGAAGACGAGCTTGCAAAAGAAAGAGATCGTAAAGAGGAGAAA
AGAAGGAGAGAAGAAGAGCCAGAGCCCGAGAGGGCCTTAGAAGCTGAGGAAGAAAGAAAATATGAGGAAAACCTCAGGAGGGCAGCTATGGATTTGCAGCTCCTTGAGGA
AGAGAAAAAGAGAAGAGAAGAAATAAAAGAAGATGAAAAAAGAAGGAAGGAAGCTGAAGACTTCCTTGCAGCCTTTGAGCCACTCCACAAGGCTCAAAGTGAGGCTGAAG
CACTGCAAGGAAGGGTAGAAGAAAAGGCCCAACAGGGGCCAACTGAAGAAAATTTAGAAAAAGAAAAAGAAAGAGAAGTAGAGGAAGAAGGACAGACTGTGACCGCATCT
GGGCCGCAATCTGAAGAAGGCCTAACCGAGGCCACCGTTGATCATCCAGCTGAAGAGAGGGTGGAAGAAGAAAAGGAAGACGAGGAGGCCAAGACCTCTAGTGATTCTGA
TTCTGAAACAGAATCTGATTCAGAGATAAGGGAGCTAGATGATGACCAAGTCCCTATCTCTGCAGCATTGAGAAGAAAGAGGAAGAGAGAGATAAAGGCCGAGAGGAGGA
CAAAGGACAAGAATGACCCGATATTTGCCAAGAGGCCGAGGACAAGGTCCATGGACGCCTCTCCTGCAGTTCCTCCTACTATCTCACCCGCCAAGCCAAAGGGAAAATCA
CCCAAGGCTGCATCTCCCAGAAATCCGTTCCCTGAGGTATTTAGAGATGTTAATTTTCAGGAACGAATGGAGATCATGAAGAAAAGAGATTTCCTCAATGAAAAGGGATT
CTCTAATAGAGCAGGAGCACTGCCAGAGTTCGTGAGCAGGATCATATCTCAATACAAGTGGCAGGACTTCTGTGCTCACCCTCAGGAGGCTGTTGTGCCTTTAGTTCGAG
AGTTTTACGCTGGCCTGAGGGAGGAGAGTATCAGCATGGAGGTTGTGAGGGGGCAGATGGTCGGTTTCTCCTCAGTCGACATTAATAGGGTGTACAGGATCAAGGCACCC
TTGAATCCGAGAGGGAATGATGTGATAAGGAACCCTTCGGCCAAGCAGATGAAGGAAGCATTGAAGCTTGTGGTCAACAAGGGGGTTCAATGGAAAGAATCGCAAACGAA
AGTGAAGTCTTTAGTGCCAAGCGACCTAAAGCCAAAATCGGCAGTTTGGCTTCACTTCATCAAGAACCGTTTGATGCCAACCACCCACGACAACACGATTTCAGTGGATA
GAGTGATGCTACTCTATTGCCTTATGAACGGGTTGGAGATTAATGTAGGGAGCATTATTAGGGACGATATTTTAGCCTGTGGGAGAAAGCTAGCAAGCAAGCTTTTCTTT
GGATCACTCATCACCCAGCTCTGCCAAAGGGTGAAGATCGTTCCAGGCAAGGACGAGGAGCGTCACTTCTTCAAGTCGACCATCGACCTGTCCTTGATTGGAAAGCTCCA
GCAGAATAGCATCCAGAGGAAAGACAAAGCCTCGACATCTCAGGCTACTCCACCTACAGGGCCGAATGTAGCTTCTCCATCCCAGCACACTCATTTCACAGGGCCTTCAC
CATCATCGAAAGCCCTAGCTATTGCCTACCGCCGGCTAGATCAAATCAGGGAGAACGTGAAGACGTATTGGGCATATGCAAAGGAGCGGGATGAAGCCATTAGAGAGTTC
TATCTCTCGATTGCCCCAAGTATTGCTCCGGTCTTTCCAAATTTCCCTCAGTCGCTGCTGCCCCAAGAAGAAGAGGATTCTGATGAAGAGGAAGATGAAGAGAAAGAGAA
TTCCTCGGACGAGGAATAG
Protein sequenceShow/hide protein sequence
MQGTRKTRPTGFSSAVVNQASNSPTPSSSAMPASSREMPSSSTPRRFTRAAAVHQTQKPAAQQFKKRSREWFAMIREMGAQRRVVLEEEGNRQDEKEVAKAAESSRQGEA
SMDKVYEPSTNPSFSCRTKPVVTYSARKRSPKKVVSERPLKIEPLKITRMPPEIFEGIIRQAVAKALAIAEGYKAEQDALKEIEIEREMENRKMVEEDELAKERDRKEEK
RRREEEPEPERALEAEEERKYEENLRRAAMDLQLLEEEKKRREEIKEDEKRRKEAEDFLAAFEPLHKAQSEAEALQGRVEEKAQQGPTEENLEKEKEREVEEEGQTVTAS
GPQSEEGLTEATVDHPAEERVEEEKEDEEAKTSSDSDSETESDSEIRELDDDQVPISAALRRKRKREIKAERRTKDKNDPIFAKRPRTRSMDASPAVPPTISPAKPKGKS
PKAASPRNPFPEVFRDVNFQERMEIMKKRDFLNEKGFSNRAGALPEFVSRIISQYKWQDFCAHPQEAVVPLVREFYAGLREESISMEVVRGQMVGFSSVDINRVYRIKAP
LNPRGNDVIRNPSAKQMKEALKLVVNKGVQWKESQTKVKSLVPSDLKPKSAVWLHFIKNRLMPTTHDNTISVDRVMLLYCLMNGLEINVGSIIRDDILACGRKLASKLFF
GSLITQLCQRVKIVPGKDEERHFFKSTIDLSLIGKLQQNSIQRKDKASTSQATPPTGPNVASPSQHTHFTGPSPSSKALAIAYRRLDQIRENVKTYWAYAKERDEAIREF
YLSIAPSIAPVFPNFPQSLLPQEEEDSDEEEDEEKENSSDEE