; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg024556 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg024556
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold12:16402307..16413221
RNA-Seq ExpressionSpg024556
SyntenySpg024556
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8680640.1 hypothetical protein F3Y22_tig00111372pilonHSYRG00020 [Hibiscus syriacus]6.4e-3130.49Show/hide
Query:  ARAKYQEMLKRDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNVVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFDLQDF--PH
        A+A++Q   K+   FE GF       G   P  +   +  L W +F   P  VN++VV+EFYAN+    +  + VRG  + ++P A+   F LQD    H
Subjt:  ARAKYQEMLKRDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNVVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFDLQDF--PH

Query:  AAFNEMVVAPSNDQLSAAVREVGIEGAQWRLSQTRKRTFQPAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWR
        A F E   + + D++   + ++  E  +W   QT + +     L+  A  W  F+K +L+PT++++TVS  R+LL  +I  S  IDVG+II  ++ DC  
Subjt:  AAFNEMVVAPSNDQLSAAVREVGIEGAQWRLSQTRKRTFQPAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWR

Query:  KKVGKLFFPNTITMLCRRARVPESEDDMILPDKGIIDTPNLARL-------------QRTQEARQGGLVCGIQQIQELLQLHSSRMEFAERQFQTFWDYT
        KK   L FPN IT LCR+ +V E+  D ILP    I    L  L             +++    Q      +  ++E++    + +     + + F+ Y 
Subjt:  KKVGKLFFPNTITMLCRRARVPESEDDMILPDKGIIDTPNLARL-------------QRTQEARQGGLVCGIQQIQELLQLHSSRMEFAERQFQTFWDYT

Query:  KKRDV
        K RDV
Subjt:  KKRDV

KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]2.2e-3128.61Show/hide
Query:  RFVNYLARAKYQEMLKRDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNVVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFDLQ
        +F N  A+A++Q    R+  FE GF       G   P  +   +  L W +F   P  VN+++V+EFYAN+    +  + VRG  + ++  A+N  F LQ
Subjt:  RFVNYLARAKYQEMLKRDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNVVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFDLQ

Query:  DF--PHAAFNEMVVAPSNDQLSAAVREVGIEGAQWRLSQTRKRTFQPAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        +    HA F E      +++    + ++  E  +W   QT + +     L+  A  W  F+K +L+PT+H++TVS  R+LL  +++ S  IDVG+II  +
Subjt:  DF--PHAAFNEMVVAPSNDQLSAAVREVGIEGAQWRLSQTRKRTFQPAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  ILDCWRKKVGKLFFPNTITMLCRRARVPESEDDMILPDKGIIDTPNLARL-------------QRTQEARQGGLVCGIQQIQELLQLHSSRMEFAERQFQ
        + DC  KK   L FPN IT LCR+ +V E+  D ILP    I    L  L             +++    +      +  ++E +    +++       +
Subjt:  ILDCWRKKVGKLFFPNTITMLCRRARVPESEDDMILPDKGIIDTPNLARL-------------QRTQEARQGGLVCGIQQIQELLQLHSSRMEFAERQFQ

Query:  TFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLLNPWIPPAPMEGGEEEDGNEPGQED
         F+ Y K RDV +    Q          P FPD++L  +   A  E   E D  +P   D
Subjt:  TFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLLNPWIPPAPMEGGEEEDGNEPGQED

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]5.1e-3640Show/hide
Query:  IRFVNYLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNVVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFD
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE     +VREFYANL D  E  V VRGV V WS EA+N +F 
Subjt:  IRFVNYLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNVVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFD

Query:  LQDFPHAAFNEMVVAPSNDQLSAAVREVGIEGAQWRLSQTRKRTFQPAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        L D P    +E +   +   L   +  V + GA+W +S     T   + L   A  W  F+K  LLPTTH  TVS+DR+LL  ++L   SI+VG++I SE
Subjt:  LQDFPHAAFNEMVVAPSNDQLSAAVREVGIEGAQWRLSQTRKRTFQPAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  ILDCWRKKVGKLFFPNTITMLCRRARVPESEDDMILPDKGIIDTPNLARLQRTQE
        I  C  +K G LFFP+ IT LCR AR P   ++  L + G ID   +AR+  TQE
Subjt:  ILDCWRKKVGKLFFPNTITMLCRRARVPESEDDMILPDKGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.1e-4636.12Show/hide
Query:  IRFVNYLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNVVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFD
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE     +VREFYANL D EE  V VRGV V WS EA+N +F 
Subjt:  IRFVNYLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNVVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFD

Query:  LQDFPHAAFNEMVVAPSNDQLSAAVREVGIEGAQWRLSQTRKRTFQPAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        L D P    +E +   +   L   +  V   GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS+DR+LL  ++L   SI+VG++I SE
Subjt:  LQDFPHAAFNEMVVAPSNDQLSAAVREVGIEGAQWRLSQTRKRTFQPAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  ILDCWRKKVGKLFFPNTITMLCRRARVPESEDDMILPDKGIIDTPNLARLQR---TQEARQ---------------GGLVCGIQQI------QELLQLH-
        I  C  +K G LFFP+ IT LCR AR P   ++  L + G ID   +AR+ +   T+  +Q               G ++  ++ +      QE+ Q H 
Subjt:  ILDCWRKKVGKLFFPNTITMLCRRARVPESEDDMILPDKGIIDTPNLARLQR---TQEARQ---------------GGLVCGIQQI------QELLQLH-

Query:  SSRMEFAERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLLNPWIPPAPMEGGEEEDGNEPGQE
         S ++   +Q Q FW Y+K+RD AL+ ALQ+NF+ P P  P FP ++L         E   ++DG+    E
Subjt:  SSRMEFAERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLLNPWIPPAPMEGGEEEDGNEPGQE

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]7.6e-4039.2Show/hide
Query:  VVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFDLQD--FPHAAFNEMVVAPSNDQLSAAVREVGIEGAQWRLSQTRKRTFQPAYLKSEANTWMGFIK
        +VREFYANL D EE  + VRGV V WS EA+N +F L D    H+ F E +  P   +L   +  V   GA+W +S     T   + L   A  W  F+K
Subjt:  VVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFDLQD--FPHAAFNEMVVAPSNDQLSAAVREVGIEGAQWRLSQTRKRTFQPAYLKSEANTWMGFIK

Query:  LRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRARVPESEDDMILPDKGIIDTPNLARL------QRTQE-
         RLLPTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+      + TQ+ 
Subjt:  LRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRARVPESEDDMILPDKGIIDTPNLARL------QRTQE-

Query:  ----------ARQGGLVCGIQQIQELLQLHSSRMEFAERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLLNPWIPPAPMEGGEEEDGNEPGQ
                  +R  G V  +QQ++ L Q   S+ E   +Q Q FW Y+K+RD AL+ ALQ+NF+ P P  P FP ++L         E   ++DG+    
Subjt:  ----------ARQGGLVCGIQQIQELLQLHSSRMEFAERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLLNPWIPPAPMEGGEEEDGNEPGQ

Query:  E
        E
Subjt:  E

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.5e-3640Show/hide
Query:  IRFVNYLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNVVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFD
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE     +VREFYANL D  E  V VRGV V WS EA+N +F 
Subjt:  IRFVNYLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNVVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFD

Query:  LQDFPHAAFNEMVVAPSNDQLSAAVREVGIEGAQWRLSQTRKRTFQPAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        L D P    +E +   +   L   +  V + GA+W +S     T   + L   A  W  F+K  LLPTTH  TVS+DR+LL  ++L   SI+VG++I SE
Subjt:  LQDFPHAAFNEMVVAPSNDQLSAAVREVGIEGAQWRLSQTRKRTFQPAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  ILDCWRKKVGKLFFPNTITMLCRRARVPESEDDMILPDKGIIDTPNLARLQRTQE
        I  C  +K G LFFP+ IT LCR AR P   ++  L + G ID   +AR+  TQE
Subjt:  ILDCWRKKVGKLFFPNTITMLCRRARVPESEDDMILPDKGIIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)5.3e-4736.12Show/hide
Query:  IRFVNYLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNVVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFD
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE     +VREFYANL D EE  V VRGV V WS EA+N +F 
Subjt:  IRFVNYLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNVVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFD

Query:  LQDFPHAAFNEMVVAPSNDQLSAAVREVGIEGAQWRLSQTRKRTFQPAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        L D P    +E +   +   L   +  V   GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS+DR+LL  ++L   SI+VG++I SE
Subjt:  LQDFPHAAFNEMVVAPSNDQLSAAVREVGIEGAQWRLSQTRKRTFQPAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  ILDCWRKKVGKLFFPNTITMLCRRARVPESEDDMILPDKGIIDTPNLARLQR---TQEARQ---------------GGLVCGIQQI------QELLQLH-
        I  C  +K G LFFP+ IT LCR AR P   ++  L + G ID   +AR+ +   T+  +Q               G ++  ++ +      QE+ Q H 
Subjt:  ILDCWRKKVGKLFFPNTITMLCRRARVPESEDDMILPDKGIIDTPNLARLQR---TQEARQ---------------GGLVCGIQQI------QELLQLH-

Query:  SSRMEFAERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLLNPWIPPAPMEGGEEEDGNEPGQE
         S ++   +Q Q FW Y+K+RD AL+ ALQ+NF+ P P  P FP ++L         E   ++DG+    E
Subjt:  SSRMEFAERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLLNPWIPPAPMEGGEEEDGNEPGQE

A0A2P5DAQ2 Uncharacterized protein3.1e-3136.12Show/hide
Query:  IRFVNYLARAKYQE-------MLKRDFLFERGFGNELPRFLRTGIENLGWSQFCAKPEPVNSNVVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFDL
        ++F +  A  +Y+E        ++++F+++     E P F+   I    W  FCA PE     +VREFY N+ + ++  V +RGV V  S EA+N +F L
Subjt:  IRFVNYLARAKYQE-------MLKRDFLFERGFGNELPRFLRTGIENLGWSQFCAKPEPVNSNVVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFDL

Query:  QDFPHAAFNEMVVAPSNDQLSAAVREVGIEGAQWRLSQTRKRTFQPAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
         D P    +E V   +  +L   +  V I GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS++ V L +++L   SI+VG++I  EI
Subjt:  QDFPHAAFNEMVVAPSNDQLSAAVREVGIEGAQWRLSQTRKRTFQPAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRARVP
          C  +K G LFFP+ IT +CR  R P
Subjt:  LDCWRKKVGKLFFPNTITMLCRRARVP

A0A2P5DXM3 Uncharacterized protein3.7e-4039.2Show/hide
Query:  VVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFDLQD--FPHAAFNEMVVAPSNDQLSAAVREVGIEGAQWRLSQTRKRTFQPAYLKSEANTWMGFIK
        +VREFYANL D EE  + VRGV V WS EA+N +F L D    H+ F E +  P   +L   +  V   GA+W +S     T   + L   A  W  F+K
Subjt:  VVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFDLQD--FPHAAFNEMVVAPSNDQLSAAVREVGIEGAQWRLSQTRKRTFQPAYLKSEANTWMGFIK

Query:  LRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRARVPESEDDMILPDKGIIDTPNLARL------QRTQE-
         RLLPTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+      + TQ+ 
Subjt:  LRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRARVPESEDDMILPDKGIIDTPNLARL------QRTQE-

Query:  ----------ARQGGLVCGIQQIQELLQLHSSRMEFAERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLLNPWIPPAPMEGGEEEDGNEPGQ
                  +R  G V  +QQ++ L Q   S+ E   +Q Q FW Y+K+RD AL+ ALQ+NF+ P P  P FP ++L         E   ++DG+    
Subjt:  ----------ARQGGLVCGIQQIQELLQLHSSRMEFAERQFQTFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLLNPWIPPAPMEGGEEEDGNEPGQ

Query:  E
        E
Subjt:  E

A0A6A3BU96 Uncharacterized protein1.1e-3128.61Show/hide
Query:  RFVNYLARAKYQEMLKRDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNVVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFDLQ
        +F N  A+A++Q    R+  FE GF       G   P  +   +  L W +F   P  VN+++V+EFYAN+    +  + VRG  + ++  A+N  F LQ
Subjt:  RFVNYLARAKYQEMLKRDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNVVREFYANLDDKEEFQVIVRGVPVDWSPEAVNDLFDLQ

Query:  DF--PHAAFNEMVVAPSNDQLSAAVREVGIEGAQWRLSQTRKRTFQPAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        +    HA F E      +++    + ++  E  +W   QT + +     L+  A  W  F+K +L+PT+H++TVS  R+LL  +++ S  IDVG+II  +
Subjt:  DF--PHAAFNEMVVAPSNDQLSAAVREVGIEGAQWRLSQTRKRTFQPAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  ILDCWRKKVGKLFFPNTITMLCRRARVPESEDDMILPDKGIIDTPNLARL-------------QRTQEARQGGLVCGIQQIQELLQLHSSRMEFAERQFQ
        + DC  KK   L FPN IT LCR+ +V E+  D ILP    I    L  L             +++    +      +  ++E +    +++       +
Subjt:  ILDCWRKKVGKLFFPNTITMLCRRARVPESEDDMILPDKGIIDTPNLARL-------------QRTQEARQGGLVCGIQQIQELLQLHSSRMEFAERQFQ

Query:  TFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLLNPWIPPAPMEGGEEEDGNEPGQED
         F+ Y K RDV +    Q          P FPD++L  +   A  E   E D  +P   D
Subjt:  TFWDYTKKRDVALRVALQSNFSEPYPALPVFPDDLLNPWIPPAPMEGGEEEDGNEPGQED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTGGGGGAGTCCAATTGGTGGGATTTTAGCCCTAATTTCGACTAGAGACCTTCTGTGGGAACTGAAGGCAAGGCCGAGGAGCGGGAATCAAGCGAGAAGACGTGG
AGATCGTGACGGGGACTCTAGAGCGAGGAGTCTAAAATTTGATGAAATTATAAGAGGACTTCAAACCCCAAAGGAAGTTAGTGAGGTGGGATCCACTTCCATAGGATGTC
TAGTGAGTCATGTGGCCGGTGATTCATTGGCACCCATTCATGAGCTTAATTCTTTCGATCTTGCTAAGGATGAGCAGTTAGGGGTGGGTTGCATTAATAGTGGGGAAGAA
TTGGAGAGTTGTAGCACCTATCAAGAACATGTTTGTGAGGAAGAAAAAGAAAATGAGCTTGCAGTGACAGAGGAAGTTCGAGAGGTAGAGTTTGAGGTTGAAAAGCCTTC
GTCTGATTTATCTTCTCCATCCTTTGTAGATTTTGATTGTTCTTTTTTCTATTCTCACGAATTTTCTATAAATTCATCCTTTTGTGCTATGGATTGGGGGAGTCCAATTG
GTGGGATTTTAGCCCTAATTTCGACTAGAGACCTTCTGTGGGAACTGAAGGCAAGGCCGAGGAGCGGGAATCAAGCGAGAAGACGTGGAGATCGTGACGGGGACAATCTG
TTGCTGGGCGACTTGAGGGAGCAGATTTTATGTTGGAGCAAACCCGGAGGCAAAACTGCCACGTCACAACTCGTTATCCAATTTAGTAAACCGACTTCTACGGTTGGTGC
AGCAGAGTTGTGGCTGAAGCAAATCGTCCGAATAGGAAGGGATTGTAAGATCTTTCTGTTAGGATTCTATTTTGCTCAAGCCTTATTTATTGGCAACCCACGGTACGTTT
CTCTTACTTCATCATTCTTGCTTCAATCTTTTGCTTTCTTTTTCGTTTTCATTTTCTGTAAATCCCTTAAGTTTTCCATGGCGAAAACGAGAGCGAGGAAAGAAAGAGAA
AGTGAGGAGGAAGAGATACCCGTTACCCCCGAAAAAGCAGCCGAAGAAGTGGTTGAAGAAGATCCGCAAGAACCTGTCATACAGAACCCCGAGCAGGATGAGCCAAGAGT
TGCGGATACAGAGGAAGTCCAAGAAACGGGACACACTGAGGAAAGTCAAGAGCAACAGAATAAGGATATACAGGCAGAGGGTGCGACTGAAGAGAAGCCAGTTCAAAAGG
CTCGTGTTGAGGTTATCATGCCCGAACCGCCGAAACGTCGCCGCATAAAGCGAAAGGCCGGCCGTATTTCGGAAAAAGCGCGAGAAGAAGCAAAGAAGGCTGAGGAAGAG
ATTTTGAGCAAGCAAAGAGAAGACAAGGGCAAAGGTATTGCCGAGGCATCAGGTGCGGCTGACGAGGTTGAAGCACAAGGGTTACCTTTTATTCGCTTCGTCAACTACCT
TGCTCGAGCAAAATACCAGGAGATGCTGAAACGGGACTTTCTGTTTGAACGAGGATTTGGCAATGAGTTGCCACGGTTCTTGAGGACGGGAATAGAGAACCTCGGCTGGA
GCCAATTTTGTGCGAAACCAGAGCCTGTAAATTCCAACGTTGTTCGGGAATTTTACGCAAATCTTGACGATAAGGAAGAATTTCAGGTTATAGTTCGAGGAGTCCCAGTG
GATTGGAGCCCAGAAGCTGTTAATGACTTGTTTGATCTCCAGGATTTTCCGCATGCAGCCTTCAATGAGATGGTGGTTGCCCCATCTAACGACCAGTTAAGTGCGGCTGT
CCGAGAGGTTGGCATTGAGGGGGCCCAGTGGAGGTTGTCGCAGACGCGGAAGCGCACATTTCAGCCAGCTTATTTGAAAAGTGAGGCCAACACCTGGATGGGTTTTATTA
AGTTGCGCTTACTACCGACTACGCATGACTCCACAGTATCTCGGGACAGGGTATTGCTTGCCTTTGCTATTCTTCGCTCAATGAGTATTGATGTAGGAAAAATAATTTCG
TCTGAAATTCTTGACTGCTGGCGGAAAAAGGTGGGGAAGCTGTTTTTCCCCAACACTATCACGATGCTATGCCGAAGGGCACGGGTGCCAGAGAGTGAGGATGATATGAT
ATTACCAGATAAGGGAATAATTGATACGCCAAATTTGGCTAGGCTTCAGAGGACACAGGAAGCACGCCAAGGGGGTTTGGTGTGCGGCATCCAACAAATTCAGGAGCTGT
TGCAATTGCATTCCAGCAGAATGGAATTTGCTGAAAGACAATTTCAGACTTTCTGGGACTATACAAAGAAAAGGGATGTCGCCTTAAGGGTGGCCTTGCAATCAAATTTT
TCTGAACCATACCCGGCTTTACCCGTATTCCCTGATGACCTACTGAACCCCTGGATTCCGCCCGCACCAATGGAAGGAGGAGAAGAGGAAGATGGAAATGAACCGGGCCA
AGAGGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTGGGGGAGTCCAATTGGTGGGATTTTAGCCCTAATTTCGACTAGAGACCTTCTGTGGGAACTGAAGGCAAGGCCGAGGAGCGGGAATCAAGCGAGAAGACGTGG
AGATCGTGACGGGGACTCTAGAGCGAGGAGTCTAAAATTTGATGAAATTATAAGAGGACTTCAAACCCCAAAGGAAGTTAGTGAGGTGGGATCCACTTCCATAGGATGTC
TAGTGAGTCATGTGGCCGGTGATTCATTGGCACCCATTCATGAGCTTAATTCTTTCGATCTTGCTAAGGATGAGCAGTTAGGGGTGGGTTGCATTAATAGTGGGGAAGAA
TTGGAGAGTTGTAGCACCTATCAAGAACATGTTTGTGAGGAAGAAAAAGAAAATGAGCTTGCAGTGACAGAGGAAGTTCGAGAGGTAGAGTTTGAGGTTGAAAAGCCTTC
GTCTGATTTATCTTCTCCATCCTTTGTAGATTTTGATTGTTCTTTTTTCTATTCTCACGAATTTTCTATAAATTCATCCTTTTGTGCTATGGATTGGGGGAGTCCAATTG
GTGGGATTTTAGCCCTAATTTCGACTAGAGACCTTCTGTGGGAACTGAAGGCAAGGCCGAGGAGCGGGAATCAAGCGAGAAGACGTGGAGATCGTGACGGGGACAATCTG
TTGCTGGGCGACTTGAGGGAGCAGATTTTATGTTGGAGCAAACCCGGAGGCAAAACTGCCACGTCACAACTCGTTATCCAATTTAGTAAACCGACTTCTACGGTTGGTGC
AGCAGAGTTGTGGCTGAAGCAAATCGTCCGAATAGGAAGGGATTGTAAGATCTTTCTGTTAGGATTCTATTTTGCTCAAGCCTTATTTATTGGCAACCCACGGTACGTTT
CTCTTACTTCATCATTCTTGCTTCAATCTTTTGCTTTCTTTTTCGTTTTCATTTTCTGTAAATCCCTTAAGTTTTCCATGGCGAAAACGAGAGCGAGGAAAGAAAGAGAA
AGTGAGGAGGAAGAGATACCCGTTACCCCCGAAAAAGCAGCCGAAGAAGTGGTTGAAGAAGATCCGCAAGAACCTGTCATACAGAACCCCGAGCAGGATGAGCCAAGAGT
TGCGGATACAGAGGAAGTCCAAGAAACGGGACACACTGAGGAAAGTCAAGAGCAACAGAATAAGGATATACAGGCAGAGGGTGCGACTGAAGAGAAGCCAGTTCAAAAGG
CTCGTGTTGAGGTTATCATGCCCGAACCGCCGAAACGTCGCCGCATAAAGCGAAAGGCCGGCCGTATTTCGGAAAAAGCGCGAGAAGAAGCAAAGAAGGCTGAGGAAGAG
ATTTTGAGCAAGCAAAGAGAAGACAAGGGCAAAGGTATTGCCGAGGCATCAGGTGCGGCTGACGAGGTTGAAGCACAAGGGTTACCTTTTATTCGCTTCGTCAACTACCT
TGCTCGAGCAAAATACCAGGAGATGCTGAAACGGGACTTTCTGTTTGAACGAGGATTTGGCAATGAGTTGCCACGGTTCTTGAGGACGGGAATAGAGAACCTCGGCTGGA
GCCAATTTTGTGCGAAACCAGAGCCTGTAAATTCCAACGTTGTTCGGGAATTTTACGCAAATCTTGACGATAAGGAAGAATTTCAGGTTATAGTTCGAGGAGTCCCAGTG
GATTGGAGCCCAGAAGCTGTTAATGACTTGTTTGATCTCCAGGATTTTCCGCATGCAGCCTTCAATGAGATGGTGGTTGCCCCATCTAACGACCAGTTAAGTGCGGCTGT
CCGAGAGGTTGGCATTGAGGGGGCCCAGTGGAGGTTGTCGCAGACGCGGAAGCGCACATTTCAGCCAGCTTATTTGAAAAGTGAGGCCAACACCTGGATGGGTTTTATTA
AGTTGCGCTTACTACCGACTACGCATGACTCCACAGTATCTCGGGACAGGGTATTGCTTGCCTTTGCTATTCTTCGCTCAATGAGTATTGATGTAGGAAAAATAATTTCG
TCTGAAATTCTTGACTGCTGGCGGAAAAAGGTGGGGAAGCTGTTTTTCCCCAACACTATCACGATGCTATGCCGAAGGGCACGGGTGCCAGAGAGTGAGGATGATATGAT
ATTACCAGATAAGGGAATAATTGATACGCCAAATTTGGCTAGGCTTCAGAGGACACAGGAAGCACGCCAAGGGGGTTTGGTGTGCGGCATCCAACAAATTCAGGAGCTGT
TGCAATTGCATTCCAGCAGAATGGAATTTGCTGAAAGACAATTTCAGACTTTCTGGGACTATACAAAGAAAAGGGATGTCGCCTTAAGGGTGGCCTTGCAATCAAATTTT
TCTGAACCATACCCGGCTTTACCCGTATTCCCTGATGACCTACTGAACCCCTGGATTCCGCCCGCACCAATGGAAGGAGGAGAAGAGGAAGATGGAAATGAACCGGGCCA
AGAGGACTGA
Protein sequenceShow/hide protein sequence
MDWGSPIGGILALISTRDLLWELKARPRSGNQARRRGDRDGDSRARSLKFDEIIRGLQTPKEVSEVGSTSIGCLVSHVAGDSLAPIHELNSFDLAKDEQLGVGCINSGEE
LESCSTYQEHVCEEEKENELAVTEEVREVEFEVEKPSSDLSSPSFVDFDCSFFYSHEFSINSSFCAMDWGSPIGGILALISTRDLLWELKARPRSGNQARRRGDRDGDNL
LLGDLREQILCWSKPGGKTATSQLVIQFSKPTSTVGAAELWLKQIVRIGRDCKIFLLGFYFAQALFIGNPRYVSLTSSFLLQSFAFFFVFIFCKSLKFSMAKTRARKERE
SEEEEIPVTPEKAAEEVVEEDPQEPVIQNPEQDEPRVADTEEVQETGHTEESQEQQNKDIQAEGATEEKPVQKARVEVIMPEPPKRRRIKRKAGRISEKAREEAKKAEEE
ILSKQREDKGKGIAEASGAADEVEAQGLPFIRFVNYLARAKYQEMLKRDFLFERGFGNELPRFLRTGIENLGWSQFCAKPEPVNSNVVREFYANLDDKEEFQVIVRGVPV
DWSPEAVNDLFDLQDFPHAAFNEMVVAPSNDQLSAAVREVGIEGAQWRLSQTRKRTFQPAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIIS
SEILDCWRKKVGKLFFPNTITMLCRRARVPESEDDMILPDKGIIDTPNLARLQRTQEARQGGLVCGIQQIQELLQLHSSRMEFAERQFQTFWDYTKKRDVALRVALQSNF
SEPYPALPVFPDDLLNPWIPPAPMEGGEEEDGNEPGQED