; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg018506 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg018506
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold3:10126245..10139841
RNA-Seq ExpressionSpg018506
SyntenySpg018506
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8707640.1 hypothetical protein F3Y22_tig00110378pilonHSYRG00039 [Hibiscus syriacus]3.1e-3129.39Show/hide
Query:  EEEPKYQEVLKRDFLFERGFGSDLPRFLESGIASLG------WREFCAKPDPVNANIVREFYANLDVKDDFEVIVRGVPVQWSPEAINNLFDLQDF--PH
        E +P++Q V  R   FE GF      F E      G       +EF   P  VNA++V+EFYAN+   +   + VRG  + ++  AIN  F LQ+    H
Subjt:  EEEPKYQEVLKRDFLFERGFGSDLPRFLESGIASLG------WREFCAKPDPVNANIVREFYANLDVKDDFEVIVRGVPVQWSPEAINNLFDLQDF--PH

Query:  AVFNEMMVAPSSDQLSAAVREVGVEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTISPDRVLLAFAILRSMSIDVGKIIATEIADCWR
        A+  E      S+     + ++  E  +W   QT +++     L+  A  W  F++ +L+PT+H++ +S  R+LL  +++ S  IDVG+II  ++ DC  
Subjt:  AVFNEMMVAPSSDQLSAAVREVGVEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTISPDRVLLAFAILRSMSIDVGKIIATEIADCWR

Query:  KKVGKLFFPNTITMLCRRAGVPESEDDVILPDKGIIDTPNLARLQRTQEVRQGGLIY----GINTIVEQLALLASRQEFAERQA---------LTFWTYF
        KK   L FPN IT LC +  V E+  D ILP    I    L  L   +  +    ++    G      +  LLA  ++  + QA           F+ Y 
Subjt:  KKVGKLFFPNTITMLCRRAGVPESEDDVILPDKGIIDTPNLARLQRTQEVRQGGLIY----GINTIVEQLALLASRQEFAERQA---------LTFWTYF

Query:  KNRDAGLRRALQENFSNPYPALPAFPEDLLNPWIPPPPAEKENEEED
        K+RDA +    QE   +     P FP+++L  +      E E+   D
Subjt:  KNRDAGLRRALQENFSNPYPALPAFPEDLLNPWIPPPPAEKENEEED

KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]3.7e-3228.16Show/hide
Query:  VIRNTPSPPTSDSEEEKREAENKAKEEEARKAEEEPKYQEVLKRDFLFERGF-------GSDLPRFLESGIASLGWREFCAKPDPVNANIVREFYANLDV
        ++R+ P+P  + + ++    E KA            ++Q    R+  FE GF       G   P  ++  +  L W +F   P  VNA++V+EFYAN+  
Subjt:  VIRNTPSPPTSDSEEEKREAENKAKEEEARKAEEEPKYQEVLKRDFLFERGF-------GSDLPRFLESGIASLGWREFCAKPDPVNANIVREFYANLDV

Query:  KDDFEVIVRGVPVQWSPEAINNLFDLQDF--PHAVFNEMMVAPSSDQLSAAVREVGVEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDST
         +   + VRG  ++++  AIN  F LQ+    HA+F E      S++    + ++  E  +W   QT +++     L+  A  W  F++ +L+PT+H++T
Subjt:  KDDFEVIVRGVPVQWSPEAINNLFDLQDF--PHAVFNEMMVAPSSDQLSAAVREVGVEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDST

Query:  ISPDRVLLAFAILRSMSIDVGKIIATEIADCWRKKVGKLFFPNTITMLCRRAGVPESEDDVILPDKGIIDTPNLARLQRTQEVRQGGLIY----GINTIV
        +S  R+LL  +++ S  IDVG+II  ++ DC  KK   L FPN IT LCR+  V E+  D ILP    I    L  L   +  +    ++    G     
Subjt:  ISPDRVLLAFAILRSMSIDVGKIIATEIADCWRKKVGKLFFPNTITMLCRRAGVPESEDDVILPDKGIIDTPNLARLQRTQEVRQGGLIY----GINTIV

Query:  EQLALLASRQEFAERQA---------LTFWTYFKNRDAGLRRALQENFSNPYPALPAFPEDLLNPWIPPPPAEKENEEED
         ++ LLA  +   + QA           F+ Y K+RD  +    QE         P FP+++L  +      E E++  D
Subjt:  EQLALLASRQEFAERQA---------LTFWTYFKNRDAGLRRALQENFSNPYPALPAFPEDLLNPWIPPPPAEKENEEED

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.4e-3138.11Show/hide
Query:  KREAENKAKEEEARKAEEEPKYQ-EVLKRDFLFERGFGSD-------LPRFLESGIASLGWREFCAKPDPVNANIVREFYANLDVKDDFEVIVRGVPVQW
        KR A    K  +      E +Y+  +  R    E+GF  D       LP F+   I    W++FCA P+     +VREFYANL    +  V VRGV V W
Subjt:  KREAENKAKEEEARKAEEEPKYQ-EVLKRDFLFERGFGSD-------LPRFLESGIASLGWREFCAKPDPVNANIVREFYANLDVKDDFEVIVRGVPVQW

Query:  SPEAINNLFDLQDFPHAVFNEMMVAPSSDQLSAAVREVGVEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTISPDRVLLAFAILRSMS
        S EAIN +F L D P    +E +   +   L   +  V V GA+W VS    +T   + L   A  W  F++  LLPTTH  T+S DR+LL  ++L   S
Subjt:  SPEAINNLFDLQDFPHAVFNEMMVAPSSDQLSAAVREVGVEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTISPDRVLLAFAILRSMS

Query:  IDVGKIIATEIADCWRKKVGKLFFPNTITMLCRRAGVPESEDDVILPDKGIIDTPNLARLQRTQE
        I+VG++I +EI  C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+  TQE
Subjt:  IDVGKIIATEIADCWRKKVGKLFFPNTITMLCRRAGVPESEDDVILPDKGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.2e-4334.37Show/hide
Query:  ARKAEEEPKYQ----------EVLKRDFLFERGFGSD-------LPRFLESGIASLGWREFCAKPDPVNANIVREFYANLDVKDDFEVIVRGVPVQWSPE
        ARKA +  K++           +  R    E+GF  D       LP F+   I    W++FCA P+     +VREFYANL   ++  V VRGV V WS E
Subjt:  ARKAEEEPKYQ----------EVLKRDFLFERGFGSD-------LPRFLESGIASLGWREFCAKPDPVNANIVREFYANLDVKDDFEVIVRGVPVQWSPE

Query:  AINNLFDLQDFPHAVFNEMMVAPSSDQLSAAVREVGVEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTISPDRVLLAFAILRSMSIDV
        AIN +F L D P    +E +   +   L   +  V   GA+W VS    +T   + L   A  W  F++ RLLPTTH  T+S DR+LL  ++L   SI+V
Subjt:  AINNLFDLQDFPHAVFNEMMVAPSSDQLSAAVREVGVEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTISPDRVLLAFAILRSMSIDV

Query:  GKIIATEIADCWRKKVGKLFFPNTITMLCRRAGVPESEDDVILPDKGIIDTPNLARLQR---TQEVRQ---------------GGLIYGINTIVEQLAL-
        G++I +EI  C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+ +   T+  +Q               G ++  +  + ++L+  
Subjt:  GKIIATEIADCWRKKVGKLFFPNTITMLCRRAGVPESEDDVILPDKGIIDTPNLARLQR---TQEVRQ---------------GGLIYGINTIVEQLAL-

Query:  ------LASRQEFAERQALTFWTYFKNRDAGLRRALQENFSNPYPALPAFPEDLL
              + S  +   +Q   FW Y K RD  L++ALQ NF+ P P  PAFP+++L
Subjt:  ------LASRQEFAERQALTFWTYFKNRDAGLRRALQENFSNPYPALPAFPEDLL

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]4.0e-3937.63Show/hide
Query:  IVREFYANLDVKDDFEVIVRGVPVQWSPEAINNLFDLQD--FPHAVFNEMMVAPSSDQLSAAVREVGVEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIR
        +VREFYANL   ++  + VRGV V WS EAIN +F L D    H+ F E +  P   +L   +  V   GA+W VS    +T   + L   A  W  F++
Subjt:  IVREFYANLDVKDDFEVIVRGVPVQWSPEAINNLFDLQD--FPHAVFNEMMVAPSSDQLSAAVREVGVEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIR

Query:  LRLLPTTHDSTISPDRVLLAFAILRSMSIDVGKIIATEIADCWRKKVGKLFFPNTITMLCRRAGVPESEDDVILPDKGIIDTPNLARLQR---TQEVRQ-
         RLLPTTH   +S DR+LL  ++L   SI+VG++I +EI  C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+ +   T+  +Q 
Subjt:  LRLLPTTHDSTISPDRVLLAFAILRSMSIDVGKIIATEIADCWRKKVGKLFFPNTITMLCRRAGVPESEDDVILPDKGIIDTPNLARLQR---TQEVRQ-

Query:  --------------GGLIYGINTIVEQLALLASRQEFAERQALTFWTYFKNRDAGLRRALQENFSNPYPALPAFPEDLL
                      G ++  +  + ++L    S+QE   +Q   FW Y K RD  L++ALQ NF+ P P  PAFP+++L
Subjt:  --------------GGLIYGINTIVEQLALLASRQEFAERQALTFWTYFKNRDAGLRRALQENFSNPYPALPAFPEDLL

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.2e-3138.11Show/hide
Query:  KREAENKAKEEEARKAEEEPKYQ-EVLKRDFLFERGFGSD-------LPRFLESGIASLGWREFCAKPDPVNANIVREFYANLDVKDDFEVIVRGVPVQW
        KR A    K  +      E +Y+  +  R    E+GF  D       LP F+   I    W++FCA P+     +VREFYANL    +  V VRGV V W
Subjt:  KREAENKAKEEEARKAEEEPKYQ-EVLKRDFLFERGFGSD-------LPRFLESGIASLGWREFCAKPDPVNANIVREFYANLDVKDDFEVIVRGVPVQW

Query:  SPEAINNLFDLQDFPHAVFNEMMVAPSSDQLSAAVREVGVEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTISPDRVLLAFAILRSMS
        S EAIN +F L D P    +E +   +   L   +  V V GA+W VS    +T   + L   A  W  F++  LLPTTH  T+S DR+LL  ++L   S
Subjt:  SPEAINNLFDLQDFPHAVFNEMMVAPSSDQLSAAVREVGVEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTISPDRVLLAFAILRSMS

Query:  IDVGKIIATEIADCWRKKVGKLFFPNTITMLCRRAGVPESEDDVILPDKGIIDTPNLARLQRTQE
        I+VG++I +EI  C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+  TQE
Subjt:  IDVGKIIATEIADCWRKKVGKLFFPNTITMLCRRAGVPESEDDVILPDKGIIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)5.9e-4434.37Show/hide
Query:  ARKAEEEPKYQ----------EVLKRDFLFERGFGSD-------LPRFLESGIASLGWREFCAKPDPVNANIVREFYANLDVKDDFEVIVRGVPVQWSPE
        ARKA +  K++           +  R    E+GF  D       LP F+   I    W++FCA P+     +VREFYANL   ++  V VRGV V WS E
Subjt:  ARKAEEEPKYQ----------EVLKRDFLFERGFGSD-------LPRFLESGIASLGWREFCAKPDPVNANIVREFYANLDVKDDFEVIVRGVPVQWSPE

Query:  AINNLFDLQDFPHAVFNEMMVAPSSDQLSAAVREVGVEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTISPDRVLLAFAILRSMSIDV
        AIN +F L D P    +E +   +   L   +  V   GA+W VS    +T   + L   A  W  F++ RLLPTTH  T+S DR+LL  ++L   SI+V
Subjt:  AINNLFDLQDFPHAVFNEMMVAPSSDQLSAAVREVGVEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTISPDRVLLAFAILRSMSIDV

Query:  GKIIATEIADCWRKKVGKLFFPNTITMLCRRAGVPESEDDVILPDKGIIDTPNLARLQR---TQEVRQ---------------GGLIYGINTIVEQLAL-
        G++I +EI  C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+ +   T+  +Q               G ++  +  + ++L+  
Subjt:  GKIIATEIADCWRKKVGKLFFPNTITMLCRRAGVPESEDDVILPDKGIIDTPNLARLQR---TQEVRQ---------------GGLIYGINTIVEQLAL-

Query:  ------LASRQEFAERQALTFWTYFKNRDAGLRRALQENFSNPYPALPAFPEDLL
              + S  +   +Q   FW Y K RD  L++ALQ NF+ P P  PAFP+++L
Subjt:  ------LASRQEFAERQALTFWTYFKNRDAGLRRALQENFSNPYPALPAFPEDLL

A0A2P5DXM3 Uncharacterized protein2.0e-3937.63Show/hide
Query:  IVREFYANLDVKDDFEVIVRGVPVQWSPEAINNLFDLQD--FPHAVFNEMMVAPSSDQLSAAVREVGVEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIR
        +VREFYANL   ++  + VRGV V WS EAIN +F L D    H+ F E +  P   +L   +  V   GA+W VS    +T   + L   A  W  F++
Subjt:  IVREFYANLDVKDDFEVIVRGVPVQWSPEAINNLFDLQD--FPHAVFNEMMVAPSSDQLSAAVREVGVEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIR

Query:  LRLLPTTHDSTISPDRVLLAFAILRSMSIDVGKIIATEIADCWRKKVGKLFFPNTITMLCRRAGVPESEDDVILPDKGIIDTPNLARLQR---TQEVRQ-
         RLLPTTH   +S DR+LL  ++L   SI+VG++I +EI  C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+ +   T+  +Q 
Subjt:  LRLLPTTHDSTISPDRVLLAFAILRSMSIDVGKIIATEIADCWRKKVGKLFFPNTITMLCRRAGVPESEDDVILPDKGIIDTPNLARLQR---TQEVRQ-

Query:  --------------GGLIYGINTIVEQLALLASRQEFAERQALTFWTYFKNRDAGLRRALQENFSNPYPALPAFPEDLL
                      G ++  +  + ++L    S+QE   +Q   FW Y K RD  L++ALQ NF+ P P  PAFP+++L
Subjt:  --------------GGLIYGINTIVEQLALLASRQEFAERQALTFWTYFKNRDAGLRRALQENFSNPYPALPAFPEDLL

A0A6A3ASF6 Uncharacterized protein1.5e-3129.39Show/hide
Query:  EEEPKYQEVLKRDFLFERGFGSDLPRFLESGIASLG------WREFCAKPDPVNANIVREFYANLDVKDDFEVIVRGVPVQWSPEAINNLFDLQDF--PH
        E +P++Q V  R   FE GF      F E      G       +EF   P  VNA++V+EFYAN+   +   + VRG  + ++  AIN  F LQ+    H
Subjt:  EEEPKYQEVLKRDFLFERGFGSDLPRFLESGIASLG------WREFCAKPDPVNANIVREFYANLDVKDDFEVIVRGVPVQWSPEAINNLFDLQDF--PH

Query:  AVFNEMMVAPSSDQLSAAVREVGVEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTISPDRVLLAFAILRSMSIDVGKIIATEIADCWR
        A+  E      S+     + ++  E  +W   QT +++     L+  A  W  F++ +L+PT+H++ +S  R+LL  +++ S  IDVG+II  ++ DC  
Subjt:  AVFNEMMVAPSSDQLSAAVREVGVEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTISPDRVLLAFAILRSMSIDVGKIIATEIADCWR

Query:  KKVGKLFFPNTITMLCRRAGVPESEDDVILPDKGIIDTPNLARLQRTQEVRQGGLIY----GINTIVEQLALLASRQEFAERQA---------LTFWTYF
        KK   L FPN IT LC +  V E+  D ILP    I    L  L   +  +    ++    G      +  LLA  ++  + QA           F+ Y 
Subjt:  KKVGKLFFPNTITMLCRRAGVPESEDDVILPDKGIIDTPNLARLQRTQEVRQGGLIY----GINTIVEQLALLASRQEFAERQA---------LTFWTYF

Query:  KNRDAGLRRALQENFSNPYPALPAFPEDLLNPWIPPPPAEKENEEED
        K+RDA +    QE   +     P FP+++L  +      E E+   D
Subjt:  KNRDAGLRRALQENFSNPYPALPAFPEDLLNPWIPPPPAEKENEEED

A0A6A3BU96 Uncharacterized protein1.8e-3228.16Show/hide
Query:  VIRNTPSPPTSDSEEEKREAENKAKEEEARKAEEEPKYQEVLKRDFLFERGF-------GSDLPRFLESGIASLGWREFCAKPDPVNANIVREFYANLDV
        ++R+ P+P  + + ++    E KA            ++Q    R+  FE GF       G   P  ++  +  L W +F   P  VNA++V+EFYAN+  
Subjt:  VIRNTPSPPTSDSEEEKREAENKAKEEEARKAEEEPKYQEVLKRDFLFERGF-------GSDLPRFLESGIASLGWREFCAKPDPVNANIVREFYANLDV

Query:  KDDFEVIVRGVPVQWSPEAINNLFDLQDF--PHAVFNEMMVAPSSDQLSAAVREVGVEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDST
         +   + VRG  ++++  AIN  F LQ+    HA+F E      S++    + ++  E  +W   QT +++     L+  A  W  F++ +L+PT+H++T
Subjt:  KDDFEVIVRGVPVQWSPEAINNLFDLQDF--PHAVFNEMMVAPSSDQLSAAVREVGVEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDST

Query:  ISPDRVLLAFAILRSMSIDVGKIIATEIADCWRKKVGKLFFPNTITMLCRRAGVPESEDDVILPDKGIIDTPNLARLQRTQEVRQGGLIY----GINTIV
        +S  R+LL  +++ S  IDVG+II  ++ DC  KK   L FPN IT LCR+  V E+  D ILP    I    L  L   +  +    ++    G     
Subjt:  ISPDRVLLAFAILRSMSIDVGKIIATEIADCWRKKVGKLFFPNTITMLCRRAGVPESEDDVILPDKGIIDTPNLARLQRTQEVRQGGLIY----GINTIV

Query:  EQLALLASRQEFAERQA---------LTFWTYFKNRDAGLRRALQENFSNPYPALPAFPEDLLNPWIPPPPAEKENEEED
         ++ LLA  +   + QA           F+ Y K+RD  +    QE         P FP+++L  +      E E++  D
Subjt:  EQLALLASRQEFAERQA---------LTFWTYFKNRDAGLRRALQENFSNPYPALPAFPEDLLNPWIPPPPAEKENEEED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAGACTCGAGCTAGGAAAGAAAGAGAGAGTGAAGAGGAGGAAGTGCCAGTCATGCCGGAAGTGCAAAAAGGGAAAACTAAGAAAAAGAGAACGCCAGAAGAAAA
AGAGGCTAAGCGAAGAAGGAGGCAGCAGAGGGCTGCGGAGCAAGAAGCCACTCAAGAAACGGAGAATGTTGTGGATACGGAGGGAATTCAAAATCTTGAGGAAGAATCGA
TAATTTCTGCTACGGTTCAAGAAGGGAATACTGAGAAGAATCAGGAAACGGAGGCTGAAGAGCAGGCCGCTGGTGAGCCTGACAAGGAGAAAACACTGGAGCCGGAGGCT
CATGTTGAAGTCGTAATGCCGGAACCGCCCAAGCGCCGCCGCATCAAAAGGAAGGCGGGTCGCGTGAGGGTGATTCGGAACACTCCATCGCCTCCGACGTCGGACTCTGA
GGAAGAAAAGAGGGAAGCTGAAAATAAGGCAAAGGAAGAAGAGGCAAGGAAGGCAGAAGAAGAGCCAAAATATCAAGAAGTGCTGAAGCGTGATTTCTTGTTCGAGCGAG
GATTTGGCAGTGATTTGCCAAGGTTCTTAGAGTCTGGAATAGCGAGCCTTGGGTGGAGGGAGTTTTGTGCGAAGCCTGATCCTGTCAATGCCAACATCGTTCGGGAATTC
TACGCCAATCTTGACGTTAAGGATGATTTTGAAGTTATAGTGCGAGGAGTGCCTGTCCAATGGAGCCCAGAGGCCATTAATAATTTGTTTGATCTCCAGGACTTTCCACA
TGCAGTTTTCAATGAGATGATGGTTGCCCCATCGAGCGACCAATTAAGTGCGGCTGTTCGAGAGGTAGGTGTTGAGGGGGCTCAATGGAGGGTGTCGCAGACGCGCAAGC
ACACATTTCAAGCTGCCTATTTGAAAAGTGAAGCCAACACTTGGATGGGTTTCATTAGGCTACGCTTACTGCCGACAACACACGACTCCACTATATCTCCGGACAGAGTA
TTGCTTGCCTTTGCTATCCTTCGCTCAATGAGTATAGATGTTGGAAAAATAATTGCTACTGAGATTGCTGACTGTTGGCGTAAGAAGGTGGGGAAGCTGTTTTTCCCCAA
CACTATCACGATGTTGTGCCGAAGGGCAGGGGTACCAGAGAGTGAAGATGATGTGATTTTACCGGATAAGGGAATCATCGATACGCCCAATCTGGCACGGCTCCAGCGTA
CGCAGGAGGTACGCCAAGGTGGGCTTATCTACGGCATCAACACGATTGTCGAACAACTGGCACTTTTGGCCAGCAGGCAGGAGTTTGCTGAAAGGCAAGCTTTAACCTTC
TGGACCTATTTTAAGAATCGTGATGCCGGATTGAGAAGGGCGCTGCAGGAAAATTTTTCAAACCCATATCCAGCCCTTCCTGCATTCCCTGAGGACCTACTGAACCCCTG
GATTCCACCACCGCCTGCTGAGAAGGAAAATGAAGAAGAAGATTTGGCATGTCTTCTAGCCTGGACTGTTGCTGCGGCAAAGAAGAATTCTGGAGGAATCATTTTGCTGC
AGCAGAGCTTGGTTTTGCAGAGTGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCAAATTTTATGCTGGAGCAAACCTGGAAGCAAAACTGCCACGTCACAGCTCGTT
ATCCAATTTAGTAAACCGACTTCTGAACACGAGACGGCATGCGCGAGGAGAGAGAGCCAAGTTGATTGGCTTCGTGGTCTGCGGTTCGTCGGCGTGGAGGAGACAACGAG
GGAAGAGGGAGACGGCGCCAGAGAGAAGAAGAAACGACAGCTGAACAGAGGACGTGGCTCACCTTCCTTTCAGAATGGCCCTATTCTCCCTTCTCCGCCGCCGAATACGG
GTAGAGTTGTTTGCCAAATCTGCCTTCGCCCCGGTCATTCCGCCTTGGACTGCTACAATCGTATGAACTATAGCTTCCAAGGGCGTCATCCTCCAGCCCAACTTGCTGCA
CTCGTTGCCTCTCAAAACTCAACTCATCATGTTGTTTTTCCATCTTCTTCTACTTGGCTCACAGACTTCGGATGCAATGCTCATATTACTACTGACCTGAACAATTTGTC
CACTGCCTCTGAATACAATGGAGAAGAACAAGTCTCAATCGGTAGTGATGACCTAGCTGTCTCCCGCCGCCGTGAACCACCGCCACCCGCCGCCACCGTCGTCATACCGT
CGTGCCTCGCCGTGAGCCGCAGTCCGTCACTCCCTCTGATCACCACCATCGCTGCCTTAAAACAAACAGCCGCGCGCCTCTTGAATGTTCGACGATTTCAGTTAGTGGGT
AAGGGAGTTGTGGTGGTCGAATCATCAAAGGTGGAGCGAAACTTGGAAGCTGGGCAGCAAGCATTAGTTTCGTTTGGCTCCACTGACTGGGTGCCTTCTACCATCAGTAG
GAGTAGAGATCTCCCAAAATACCCACCTGTGTACTACCAGGCGAAAATCTACCGAAAAACCCGCAAGTGGAACAGCCGGGGCCTGCGGCAGAACCTAGCACAATGGATAT
CCAAGCAAAAGTTTAGTCAGCTGTTGCCGGAGCGGTACAGTCTGCAGTTCAGACCGCAGTTCAGGCGGCTATTGCAGGCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAGACTCGAGCTAGGAAAGAAAGAGAGAGTGAAGAGGAGGAAGTGCCAGTCATGCCGGAAGTGCAAAAAGGGAAAACTAAGAAAAAGAGAACGCCAGAAGAAAA
AGAGGCTAAGCGAAGAAGGAGGCAGCAGAGGGCTGCGGAGCAAGAAGCCACTCAAGAAACGGAGAATGTTGTGGATACGGAGGGAATTCAAAATCTTGAGGAAGAATCGA
TAATTTCTGCTACGGTTCAAGAAGGGAATACTGAGAAGAATCAGGAAACGGAGGCTGAAGAGCAGGCCGCTGGTGAGCCTGACAAGGAGAAAACACTGGAGCCGGAGGCT
CATGTTGAAGTCGTAATGCCGGAACCGCCCAAGCGCCGCCGCATCAAAAGGAAGGCGGGTCGCGTGAGGGTGATTCGGAACACTCCATCGCCTCCGACGTCGGACTCTGA
GGAAGAAAAGAGGGAAGCTGAAAATAAGGCAAAGGAAGAAGAGGCAAGGAAGGCAGAAGAAGAGCCAAAATATCAAGAAGTGCTGAAGCGTGATTTCTTGTTCGAGCGAG
GATTTGGCAGTGATTTGCCAAGGTTCTTAGAGTCTGGAATAGCGAGCCTTGGGTGGAGGGAGTTTTGTGCGAAGCCTGATCCTGTCAATGCCAACATCGTTCGGGAATTC
TACGCCAATCTTGACGTTAAGGATGATTTTGAAGTTATAGTGCGAGGAGTGCCTGTCCAATGGAGCCCAGAGGCCATTAATAATTTGTTTGATCTCCAGGACTTTCCACA
TGCAGTTTTCAATGAGATGATGGTTGCCCCATCGAGCGACCAATTAAGTGCGGCTGTTCGAGAGGTAGGTGTTGAGGGGGCTCAATGGAGGGTGTCGCAGACGCGCAAGC
ACACATTTCAAGCTGCCTATTTGAAAAGTGAAGCCAACACTTGGATGGGTTTCATTAGGCTACGCTTACTGCCGACAACACACGACTCCACTATATCTCCGGACAGAGTA
TTGCTTGCCTTTGCTATCCTTCGCTCAATGAGTATAGATGTTGGAAAAATAATTGCTACTGAGATTGCTGACTGTTGGCGTAAGAAGGTGGGGAAGCTGTTTTTCCCCAA
CACTATCACGATGTTGTGCCGAAGGGCAGGGGTACCAGAGAGTGAAGATGATGTGATTTTACCGGATAAGGGAATCATCGATACGCCCAATCTGGCACGGCTCCAGCGTA
CGCAGGAGGTACGCCAAGGTGGGCTTATCTACGGCATCAACACGATTGTCGAACAACTGGCACTTTTGGCCAGCAGGCAGGAGTTTGCTGAAAGGCAAGCTTTAACCTTC
TGGACCTATTTTAAGAATCGTGATGCCGGATTGAGAAGGGCGCTGCAGGAAAATTTTTCAAACCCATATCCAGCCCTTCCTGCATTCCCTGAGGACCTACTGAACCCCTG
GATTCCACCACCGCCTGCTGAGAAGGAAAATGAAGAAGAAGATTTGGCATGTCTTCTAGCCTGGACTGTTGCTGCGGCAAAGAAGAATTCTGGAGGAATCATTTTGCTGC
AGCAGAGCTTGGTTTTGCAGAGTGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCAAATTTTATGCTGGAGCAAACCTGGAAGCAAAACTGCCACGTCACAGCTCGTT
ATCCAATTTAGTAAACCGACTTCTGAACACGAGACGGCATGCGCGAGGAGAGAGAGCCAAGTTGATTGGCTTCGTGGTCTGCGGTTCGTCGGCGTGGAGGAGACAACGAG
GGAAGAGGGAGACGGCGCCAGAGAGAAGAAGAAACGACAGCTGAACAGAGGACGTGGCTCACCTTCCTTTCAGAATGGCCCTATTCTCCCTTCTCCGCCGCCGAATACGG
GTAGAGTTGTTTGCCAAATCTGCCTTCGCCCCGGTCATTCCGCCTTGGACTGCTACAATCGTATGAACTATAGCTTCCAAGGGCGTCATCCTCCAGCCCAACTTGCTGCA
CTCGTTGCCTCTCAAAACTCAACTCATCATGTTGTTTTTCCATCTTCTTCTACTTGGCTCACAGACTTCGGATGCAATGCTCATATTACTACTGACCTGAACAATTTGTC
CACTGCCTCTGAATACAATGGAGAAGAACAAGTCTCAATCGGTAGTGATGACCTAGCTGTCTCCCGCCGCCGTGAACCACCGCCACCCGCCGCCACCGTCGTCATACCGT
CGTGCCTCGCCGTGAGCCGCAGTCCGTCACTCCCTCTGATCACCACCATCGCTGCCTTAAAACAAACAGCCGCGCGCCTCTTGAATGTTCGACGATTTCAGTTAGTGGGT
AAGGGAGTTGTGGTGGTCGAATCATCAAAGGTGGAGCGAAACTTGGAAGCTGGGCAGCAAGCATTAGTTTCGTTTGGCTCCACTGACTGGGTGCCTTCTACCATCAGTAG
GAGTAGAGATCTCCCAAAATACCCACCTGTGTACTACCAGGCGAAAATCTACCGAAAAACCCGCAAGTGGAACAGCCGGGGCCTGCGGCAGAACCTAGCACAATGGATAT
CCAAGCAAAAGTTTAGTCAGCTGTTGCCGGAGCGGTACAGTCTGCAGTTCAGACCGCAGTTCAGGCGGCTATTGCAGGCGTGA
Protein sequenceShow/hide protein sequence
MAKTRARKERESEEEEVPVMPEVQKGKTKKKRTPEEKEAKRRRRQQRAAEQEATQETENVVDTEGIQNLEEESIISATVQEGNTEKNQETEAEEQAAGEPDKEKTLEPEA
HVEVVMPEPPKRRRIKRKAGRVRVIRNTPSPPTSDSEEEKREAENKAKEEEARKAEEEPKYQEVLKRDFLFERGFGSDLPRFLESGIASLGWREFCAKPDPVNANIVREF
YANLDVKDDFEVIVRGVPVQWSPEAINNLFDLQDFPHAVFNEMMVAPSSDQLSAAVREVGVEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTISPDRV
LLAFAILRSMSIDVGKIIATEIADCWRKKVGKLFFPNTITMLCRRAGVPESEDDVILPDKGIIDTPNLARLQRTQEVRQGGLIYGINTIVEQLALLASRQEFAERQALTF
WTYFKNRDAGLRRALQENFSNPYPALPAFPEDLLNPWIPPPPAEKENEEEDLACLLAWTVAAAKKNSGGIILLQQSLVLQSAQNLLLGDLREQILCWSKPGSKTATSQLV
IQFSKPTSEHETACARRESQVDWLRGLRFVGVEETTREEGDGAREKKKRQLNRGRGSPSFQNGPILPSPPPNTGRVVCQICLRPGHSALDCYNRMNYSFQGRHPPAQLAA
LVASQNSTHHVVFPSSSTWLTDFGCNAHITTDLNNLSTASEYNGEEQVSIGSDDLAVSRRREPPPPAATVVIPSCLAVSRSPSLPLITTIAALKQTAARLLNVRRFQLVG
KGVVVVESSKVERNLEAGQQALVSFGSTDWVPSTISRSRDLPKYPPVYYQAKIYRKTRKWNSRGLRQNLAQWISKQKFSQLLPERYSLQFRPQFRRLLQA