; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030999 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030999
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold10:23142716..23147438
RNA-Seq ExpressionSpg030999
SyntenySpg030999
Gene Ontology termsNA
InterPro domainsIPR045045 - Small heat shock protein RTM2-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]1.0e-3035.81Show/hide
Query:  PHFLRTGIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLS
        P F+   I  HGW +FC  P      +VREFYAN+   +   V V+ V+V ++  AIN+++ L+      Y + A   ++EQL   + EV IEGA WQ+S
Subjt:  PHFLRTGIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLS

Query:  KTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGC-LKKKVGKLFFPNTITMLYRRAGVLVDEGDVILF
          G  T     LKR A  W  F+  R +P+TH  TV+++RVLL ++IL  +S+++ ++ + EI  C   +K G L+FP+ IT L+ +A V   + + I+ 
Subjt:  KTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGC-LKKKVGKLFFPNTITMLYRRAGVLVDEGDVILF

Query:  DKGIIDTPNLARLQR
        + G I T +++R+ +
Subjt:  DKGIIDTPNLARLQR

KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]2.5e-2928.34Show/hide
Query:  RFINNFARAKYAELLKRDFLFEKGF------SGNLPHFLRTGIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQN
        +F N+ A+A++     R+  FE GF       G     +   +    W +F   P +VNA +V+EFYANI K +   + VRG ++ ++  AIN  ++LQ 
Subjt:  RFINNFARAKYAELLKRDFLFEKGF------SGNLPHFLRTGIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQN

Query:  F--PHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEI
            HA + E A    + +    + ++  E  +W   +T + +     L+  A  W  F++ +++PT+H++TVS  R+LL  +++ S  IDVG++IV ++
Subjt:  F--PHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEI

Query:  SGCLKKKVGKLFFPNTITMLYRRAGVLVDEGDVILFDKGIIDTPNLARLQRTQEARQGGLIY-------GINTVLEQLAL-SASRQEFAERQAL-----T
          CL KK   L FPN IT L R+  V  +  D IL     I    L  L   +  +    ++         N  +  LAL  A  Q  A+  AL      
Subjt:  SGCLKKKVGKLFFPNTITMLYRRAGVLVDEGDVILFDKGIIDTPNLARLQRTQEARQGGLIY-------GINTVLEQLAL-SASRQEFAERQAL-----T

Query:  FWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPW-----------IPPPPVEREEEDDEEQET
        F+ YV++RD  ++   QE         P FP+++L  +            P PP      D    ET
Subjt:  FWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPW-----------IPPPPVEREEEDDEEQET

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]5.3e-3238.28Show/hide
Query:  RFINNFARAKYA-ELLKRDFLFEKGF-------SGNLPHFLRTGIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL
        +F    A  +Y   +  R    EKGF        G LP F+   I  H W++FC+ PE     +VREFYAN+       V VRGV+V WS  AINA++ L
Subjt:  RFINNFARAKYA-ELLKRDFLFEKGF-------SGNLPHFLRTGIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEI
         + P   ++E     +   L   +  V + GA+W +S  G  T   + L   A  W  F++  +LPTTH  TVS++R+LL  ++L   SI+VG+MI +EI
Subjt:  QNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEI

Query:  SGCLKKKVGKLFFPNTITMLYR--RAGVLVDEGDVILFDKGIIDTPNLARLQRTQE
          C  +K G LFFP+ IT L R  RA  LV+E    L + G ID   +AR+  TQE
Subjt:  SGCLKKKVGKLFFPNTITMLYR--RAGVLVDEGDVILFDKGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]8.8e-4334.33Show/hide
Query:  RFINNFARAKYA-ELLKRDFLFEKGF-------SGNLPHFLRTGIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL
        +F    A  +Y   +  R    EKGF        G LP F+   I  H W++FC+ PE     +VREFYAN+   +   V VRGV+V WS  AINA++ L
Subjt:  RFINNFARAKYA-ELLKRDFLFEKGF-------SGNLPHFLRTGIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEI
         + P   ++E     + + L   +  V   GA+W +S  G  T   + L   A  W  F++ R+LPTTH  TVS++R+LL  ++L   SI+VG+MI +EI
Subjt:  QNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEI

Query:  SGCLKKKVGKLFFPNTITMLYR--RAGVLVDEGDVILFDKGIIDTPNLARLQR---TQEARQ---------------GGLIYGINTVLEQLALSASRQ--
          C  +K G LFFP+ IT L R  RA  LV+E    L + G ID   +AR+ +   T+  +Q               G ++  +  + ++L+    +Q  
Subjt:  SGCLKKKVGKLFFPNTITMLYR--RAGVLVDEGDVILFDKGIIDTPNLARLQR---TQEARQ---------------GGLIYGINTVLEQLALSASRQ--

Query:  -----EFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE
             +   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E E E D++
Subjt:  -----EFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.1e-3737.07Show/hide
Query:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQR
        +VREFYAN+   +   + VRGV+V WS  AINA++ L + P   ++E     +  +L   +  V   GA+W +S  G  T   + L   A  W  F++ R
Subjt:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQR

Query:  MLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCLKKKVGKLFFPNTITMLYRRAGVLVDEGDVILFDKGIIDTPNLARL------QRTQE---
        +LPTTH   VS++R+LL  ++L   SI+VG+MI +EI  C  +K G LFFP+ IT L R A  LV+E    L + G ID   +AR+      + TQ+   
Subjt:  MLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCLKKKVGKLFFPNTITMLYRRAGVLVDEGDVILFDKGIIDTPNLARL------QRTQE---

Query:  --------ARQGGLIYGINTVLEQLALSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE
                +R  G +      LEQ     S+QE   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E E E D++
Subjt:  --------ARQGGLIYGINTVLEQLALSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.6e-3238.28Show/hide
Query:  RFINNFARAKYA-ELLKRDFLFEKGF-------SGNLPHFLRTGIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL
        +F    A  +Y   +  R    EKGF        G LP F+   I  H W++FC+ PE     +VREFYAN+       V VRGV+V WS  AINA++ L
Subjt:  RFINNFARAKYA-ELLKRDFLFEKGF-------SGNLPHFLRTGIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEI
         + P   ++E     +   L   +  V + GA+W +S  G  T   + L   A  W  F++  +LPTTH  TVS++R+LL  ++L   SI+VG+MI +EI
Subjt:  QNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEI

Query:  SGCLKKKVGKLFFPNTITMLYR--RAGVLVDEGDVILFDKGIIDTPNLARLQRTQE
          C  +K G LFFP+ IT L R  RA  LV+E    L + G ID   +AR+  TQE
Subjt:  SGCLKKKVGKLFFPNTITMLYR--RAGVLVDEGDVILFDKGIIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)4.2e-4334.33Show/hide
Query:  RFINNFARAKYA-ELLKRDFLFEKGF-------SGNLPHFLRTGIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL
        +F    A  +Y   +  R    EKGF        G LP F+   I  H W++FC+ PE     +VREFYAN+   +   V VRGV+V WS  AINA++ L
Subjt:  RFINNFARAKYA-ELLKRDFLFEKGF-------SGNLPHFLRTGIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEI
         + P   ++E     + + L   +  V   GA+W +S  G  T   + L   A  W  F++ R+LPTTH  TVS++R+LL  ++L   SI+VG+MI +EI
Subjt:  QNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEI

Query:  SGCLKKKVGKLFFPNTITMLYR--RAGVLVDEGDVILFDKGIIDTPNLARLQR---TQEARQ---------------GGLIYGINTVLEQLALSASRQ--
          C  +K G LFFP+ IT L R  RA  LV+E    L + G ID   +AR+ +   T+  +Q               G ++  +  + ++L+    +Q  
Subjt:  SGCLKKKVGKLFFPNTITMLYR--RAGVLVDEGDVILFDKGIIDTPNLARLQR---TQEARQ---------------GGLIYGINTVLEQLALSASRQ--

Query:  -----EFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE
             +   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E E E D++
Subjt:  -----EFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE

A0A2P5DXM3 Uncharacterized protein5.4e-3837.07Show/hide
Query:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQR
        +VREFYAN+   +   + VRGV+V WS  AINA++ L + P   ++E     +  +L   +  V   GA+W +S  G  T   + L   A  W  F++ R
Subjt:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQR

Query:  MLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCLKKKVGKLFFPNTITMLYRRAGVLVDEGDVILFDKGIIDTPNLARL------QRTQE---
        +LPTTH   VS++R+LL  ++L   SI+VG+MI +EI  C  +K G LFFP+ IT L R A  LV+E    L + G ID   +AR+      + TQ+   
Subjt:  MLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCLKKKVGKLFFPNTITMLYRRAGVLVDEGDVILFDKGIIDTPNLARL------QRTQE---

Query:  --------ARQGGLIYGINTVLEQLALSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE
                +R  G +      LEQ     S+QE   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E E E D++
Subjt:  --------ARQGGLIYGINTVLEQLALSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE

A0A6A3BU96 Uncharacterized protein1.2e-2928.34Show/hide
Query:  RFINNFARAKYAELLKRDFLFEKGF------SGNLPHFLRTGIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQN
        +F N+ A+A++     R+  FE GF       G     +   +    W +F   P +VNA +V+EFYANI K +   + VRG ++ ++  AIN  ++LQ 
Subjt:  RFINNFARAKYAELLKRDFLFEKGF------SGNLPHFLRTGIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQN

Query:  F--PHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEI
            HA + E A    + +    + ++  E  +W   +T + +     L+  A  W  F++ +++PT+H++TVS  R+LL  +++ S  IDVG++IV ++
Subjt:  F--PHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEI

Query:  SGCLKKKVGKLFFPNTITMLYRRAGVLVDEGDVILFDKGIIDTPNLARLQRTQEARQGGLIY-------GINTVLEQLAL-SASRQEFAERQAL-----T
          CL KK   L FPN IT L R+  V  +  D IL     I    L  L   +  +    ++         N  +  LAL  A  Q  A+  AL      
Subjt:  SGCLKKKVGKLFFPNTITMLYRRAGVLVDEGDVILFDKGIIDTPNLARLQRTQEARQGGLIY-------GINTVLEQLAL-SASRQEFAERQAL-----T

Query:  FWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPW-----------IPPPPVEREEEDDEEQET
        F+ YV++RD  ++   QE         P FP+++L  +            P PP      D    ET
Subjt:  FWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPW-----------IPPPPVEREEEDDEEQET

W9QTD9 Uncharacterized protein4.9e-3135.81Show/hide
Query:  PHFLRTGIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLS
        P F+   I  HGW +FC  P      +VREFYAN+   +   V V+ V+V ++  AIN+++ L+      Y + A   ++EQL   + EV IEGA WQ+S
Subjt:  PHFLRTGIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLS

Query:  KTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGC-LKKKVGKLFFPNTITMLYRRAGVLVDEGDVILF
          G  T     LKR A  W  F+  R +P+TH  TV+++RVLL ++IL  +S+++ ++ + EI  C   +K G L+FP+ IT L+ +A V   + + I+ 
Subjt:  KTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGC-LKKKVGKLFFPNTITMLYRRAGVLVDEGDVILF

Query:  DKGIIDTPNLARLQR
        + G I T +++R+ +
Subjt:  DKGIIDTPNLARLQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCAAAACGAGAGCTAGAAAAGAAAGGGAGGATGAGGACGAAGAGGTACTTGTTACCCCTGTAGCACAGAAAGCGAAAACGAAAAAGAAGAAGACGCCAGAGGAGAA
AGAAGCGAAAAGGAGGAGAAAGCAACAGAGGGCTGAGGAACAAGAAAAGGCAACAGAGGTTGCGACTGTTACTGACACAGTAGAAGAAGAAAGCCCGAAACAACCAGAGG
AAAATACCGAGGAAAATACCGAGCAGAGGGTCGCGGATACAGAAGAAGAACGAACAGAAGAGGCAGAAGATGTTCAGGTAACGGATAATGAGCCAGTGCAGGAGGCTCAA
GTGGAGGTGATCATGTCGGAGGTACCAAAGCGTCGTCGCGTTAAGAGGAAAGCAGGCCGCGTTAGGGTTGTCCGAACTGATACTCCTTCGCCTCCAACCACGGATTCTGA
AAGAGAAAATGCAGAGAGAGAGGAGCGTGAGAAGAAGGAAGCCGAAGAAAGAGCAAGAGAAGAGGCAGAGAAAAAGGCTGAGGAAGAGCGGTTGCTCAAGCGAAGGGCGG
AAAAGGGCAAAAGTGTTGCTGAAGCAGCGGAGGAACCTGATGAAATAGAAGAACATGGTCGCTTCATCAACAATTTTGCCAGAGCAAAATACGCTGAGCTGCTGAAAAGA
GACTTCCTGTTTGAAAAAGGATTTAGCGGTAATCTTCCACATTTTCTGAGGACCGGCATTGCAGACCACGGCTGGGAGCGGTTTTGTTCAAAGCCTGAGGCTGTAAACGC
ACAGGTGGTGCGTGAATTTTATGCTAATATTGACAAGGAAGATGGTTTCCAGGTGATTGTTCGAGGAGTCGAGGTAGACTGGAGTCCTAGTGCTATCAATGCACTGTATA
ACCTTCAGAATTTCCCCCATGCAGCTTATAATGAGATGGCTGTAGCGCCATCTAATGAGCAGTTAAGTGATGCTGTGCGGGAGGTGGGTATTGAAGGGGCACAGTGGCAG
CTGTCCAAGACAGGGAAAAGGACATTTCAGTCAGCTTATCTGAAGAGGGAAGCGAACACGTGGATGGGATTTATCAGACAGAGGATGCTTCCAACGACTCATGACTCGAC
GGTCTCGAGGGAACGGGTTCTTCTGGCTTTCGCGATTTTGCGGTCTCTTAGCATTGATGTAGGGAAGATGATTGTTAATGAAATTTCTGGTTGTTTGAAGAAGAAGGTGG
GGAAACTGTTCTTTCCGAACACAATCACGATGCTTTACAGAAGAGCAGGGGTTCTAGTGGATGAGGGAGATGTTATCCTGTTTGACAAGGGAATTATAGATACGCCTAAC
TTGGCGCGGCTTCAGCGTACGCAGGAGGCACGTCAAGGTGGGCTTATCTACGGCATCAACACGGTTTTAGAACAACTGGCACTTTCGGCCAGCAGGCAAGAGTTTGCCGA
GAGGCAAGCTTTAACCTTCTGGAACTATGTTAGAAATCGTGATGCCAATCTGAAGAAGGCGCTGCAAGAAAATTTTTCCAAGCCATATCCAGCCCTTCCAGCATTCCCTG
AGGATCTGTTGAACCCTTGGATTCCACCCCCACCTGTTGAAAGAGAAGAGGAGGATGATGAAGAGCAGGAAACCTTTTGCTTGAGCATTTTCTCTGGCCTGGTCGTTGCT
ACGGCAAAGAAAATTCTGGAGCAGAGCTTGGTTTTGCAGAGTGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAGTAGAAC
TGCCACGTCACAGCTCAGGGTGAAGATAGTTCCGGGTAAGGATGGGGAGCGTCATTTCTTCAAGCCGACCATTGACCTATCCTTGATTGGGAAGCTTCAGCAGAATAGCC
TCCAAAGGAAAGACAAAGCCTCCACATCTCAGGCCACTCCACCATCAGGGTCGAACATAGCTTCTTCGTCCCAGAACAGTCCTTTTTCAGGGCCCTCACACTCATCTGAA
GCCCTAGCCATTGCCTACCGTCAGCTTGATCAAATCAGGGACAACCTGAGGACTTATTGGGCATATGCAAAGGAGAGGGATGAAACCATTAGAGAGTTCAATCTCTCTAT
CGCCCCGAGTATTGCCCCGGTTTTTCCCAATTTCCCTCGATCGCTGCTGCCTCAAGAAGACAAGGATTCTGATGAAGATGAAGAAGAAAATGATGATGAAGATGAAGAGA
AAGAGAGTTCCTCAGACGAGGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCCAAAACGAGAGCTAGAAAAGAAAGGGAGGATGAGGACGAAGAGGTACTTGTTACCCCTGTAGCACAGAAAGCGAAAACGAAAAAGAAGAAGACGCCAGAGGAGAA
AGAAGCGAAAAGGAGGAGAAAGCAACAGAGGGCTGAGGAACAAGAAAAGGCAACAGAGGTTGCGACTGTTACTGACACAGTAGAAGAAGAAAGCCCGAAACAACCAGAGG
AAAATACCGAGGAAAATACCGAGCAGAGGGTCGCGGATACAGAAGAAGAACGAACAGAAGAGGCAGAAGATGTTCAGGTAACGGATAATGAGCCAGTGCAGGAGGCTCAA
GTGGAGGTGATCATGTCGGAGGTACCAAAGCGTCGTCGCGTTAAGAGGAAAGCAGGCCGCGTTAGGGTTGTCCGAACTGATACTCCTTCGCCTCCAACCACGGATTCTGA
AAGAGAAAATGCAGAGAGAGAGGAGCGTGAGAAGAAGGAAGCCGAAGAAAGAGCAAGAGAAGAGGCAGAGAAAAAGGCTGAGGAAGAGCGGTTGCTCAAGCGAAGGGCGG
AAAAGGGCAAAAGTGTTGCTGAAGCAGCGGAGGAACCTGATGAAATAGAAGAACATGGTCGCTTCATCAACAATTTTGCCAGAGCAAAATACGCTGAGCTGCTGAAAAGA
GACTTCCTGTTTGAAAAAGGATTTAGCGGTAATCTTCCACATTTTCTGAGGACCGGCATTGCAGACCACGGCTGGGAGCGGTTTTGTTCAAAGCCTGAGGCTGTAAACGC
ACAGGTGGTGCGTGAATTTTATGCTAATATTGACAAGGAAGATGGTTTCCAGGTGATTGTTCGAGGAGTCGAGGTAGACTGGAGTCCTAGTGCTATCAATGCACTGTATA
ACCTTCAGAATTTCCCCCATGCAGCTTATAATGAGATGGCTGTAGCGCCATCTAATGAGCAGTTAAGTGATGCTGTGCGGGAGGTGGGTATTGAAGGGGCACAGTGGCAG
CTGTCCAAGACAGGGAAAAGGACATTTCAGTCAGCTTATCTGAAGAGGGAAGCGAACACGTGGATGGGATTTATCAGACAGAGGATGCTTCCAACGACTCATGACTCGAC
GGTCTCGAGGGAACGGGTTCTTCTGGCTTTCGCGATTTTGCGGTCTCTTAGCATTGATGTAGGGAAGATGATTGTTAATGAAATTTCTGGTTGTTTGAAGAAGAAGGTGG
GGAAACTGTTCTTTCCGAACACAATCACGATGCTTTACAGAAGAGCAGGGGTTCTAGTGGATGAGGGAGATGTTATCCTGTTTGACAAGGGAATTATAGATACGCCTAAC
TTGGCGCGGCTTCAGCGTACGCAGGAGGCACGTCAAGGTGGGCTTATCTACGGCATCAACACGGTTTTAGAACAACTGGCACTTTCGGCCAGCAGGCAAGAGTTTGCCGA
GAGGCAAGCTTTAACCTTCTGGAACTATGTTAGAAATCGTGATGCCAATCTGAAGAAGGCGCTGCAAGAAAATTTTTCCAAGCCATATCCAGCCCTTCCAGCATTCCCTG
AGGATCTGTTGAACCCTTGGATTCCACCCCCACCTGTTGAAAGAGAAGAGGAGGATGATGAAGAGCAGGAAACCTTTTGCTTGAGCATTTTCTCTGGCCTGGTCGTTGCT
ACGGCAAAGAAAATTCTGGAGCAGAGCTTGGTTTTGCAGAGTGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAGTAGAAC
TGCCACGTCACAGCTCAGGGTGAAGATAGTTCCGGGTAAGGATGGGGAGCGTCATTTCTTCAAGCCGACCATTGACCTATCCTTGATTGGGAAGCTTCAGCAGAATAGCC
TCCAAAGGAAAGACAAAGCCTCCACATCTCAGGCCACTCCACCATCAGGGTCGAACATAGCTTCTTCGTCCCAGAACAGTCCTTTTTCAGGGCCCTCACACTCATCTGAA
GCCCTAGCCATTGCCTACCGTCAGCTTGATCAAATCAGGGACAACCTGAGGACTTATTGGGCATATGCAAAGGAGAGGGATGAAACCATTAGAGAGTTCAATCTCTCTAT
CGCCCCGAGTATTGCCCCGGTTTTTCCCAATTTCCCTCGATCGCTGCTGCCTCAAGAAGACAAGGATTCTGATGAAGATGAAGAAGAAAATGATGATGAAGATGAAGAGA
AAGAGAGTTCCTCAGACGAGGACTAG
Protein sequenceShow/hide protein sequence
MPKTRARKEREDEDEEVLVTPVAQKAKTKKKKTPEEKEAKRRRKQQRAEEQEKATEVATVTDTVEEESPKQPEENTEENTEQRVADTEEERTEEAEDVQVTDNEPVQEAQ
VEVIMSEVPKRRRVKRKAGRVRVVRTDTPSPPTTDSERENAEREEREKKEAEERAREEAEKKAEEERLLKRRAEKGKSVAEAAEEPDEIEEHGRFINNFARAKYAELLKR
DFLFEKGFSGNLPHFLRTGIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQ
LSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCLKKKVGKLFFPNTITMLYRRAGVLVDEGDVILFDKGIIDTPN
LARLQRTQEARQGGLIYGINTVLEQLALSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEEQETFCLSIFSGLVVA
TAKKILEQSLVLQSAQNLLLGDLREQILCCSKTGSRTATSQLRVKIVPGKDGERHFFKPTIDLSLIGKLQQNSLQRKDKASTSQATPPSGSNIASSSQNSPFSGPSHSSE
ALAIAYRQLDQIRDNLRTYWAYAKERDETIREFNLSIAPSIAPVFPNFPRSLLPQEDKDSDEDEEENDDEDEEKESSSDED