; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg024644 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg024644
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold12:20221939..20239788
RNA-Seq ExpressionSpg024644
SyntenySpg024644
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]5.8e-2530.07Show/hide
Query:  PEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLKDF--PHAAFNEMVG------------IEGAQWRLSKTEKHTFQAAYLKSEANAWM
        P  VN+++V+EFYAN+    +  + VRG  + ++  AIN  F+L++    HA F E                E  +W   +T +++     L+  A  W 
Subjt:  PEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLKDF--PHAAFNEMVG------------IEGAQWRLSKTEKHTFQAAYLKSEANAWM

Query:  GFIKLRLLPTTHDSTVSRDRVFLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPED-------------EDDVPLI-------D
         F+K +L+PT+H++TVS  R+ L  +++ S  IDVG+II  ++ DC  KK   L FPN IT LCR+  V E+              D +PL+        
Subjt:  GFIKLRLLPTTHDSTVSRDRVFLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPED-------------EDDVPLI-------D

Query:  KGIIDTPNLARLQRTQEARQGGLVCGIHQMQEQLQ-LHSSRMEFVERQLQTFWSYVKTRDVALRVALQSNFSKPYPALPVFPDDLL
        K  +   ++   +   E R   L   + Q Q QL  LH          ++ F+ YVK RDV +    Q          P FPD++L
Subjt:  KGIIDTPNLARLQRTQEARQGGLVCGIHQMQEQLQ-LHSSRMEFVERQLQTFWSYVKTRDVALRVALQSNFSKPYPALPVFPDDLL

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]6.5e-2436.84Show/hide
Query:  RRVKRKAGRARVVRTDTPSPPTTDSERENARREEREKKEVEDKAREEEAKKAEEEILRNEPRLESILCKPEPVNSNIVREFYANLDDQEEFQVIVRGMPV
        +RV RKA +A  V+ +T +  T        R    EK  V D +          +++  +   +     PE     +VREFYANL D  E  V VRG+ V
Subjt:  RRVKRKAGRARVVRTDTPSPPTTDSERENARREEREKKEVEDKAREEEAKKAEEEILRNEPRLESILCKPEPVNSNIVREFYANLDDQEEFQVIVRGMPV

Query:  DWSPEAINDLFNLKD--FPHAAF------------NEMVGIEGAQWRLSKTEKHTFQAAYLKSEANAWMGFIKLRLLPTTHDSTVSRDRVFLAFAILRSM
         WS EAIN +F L D    H+ F             E V + GA+W +S    +T   + L   A  W  F+K  LLPTTH  TVS+DR+ L  ++L   
Subjt:  DWSPEAINDLFNLKD--FPHAAF------------NEMVGIEGAQWRLSKTEKHTFQAAYLKSEANAWMGFIKLRLLPTTHDSTVSRDRVFLAFAILRSM

Query:  SIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE
        SI+VG++I SEI  C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+  TQE
Subjt:  SIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]3.7e-3534.82Show/hide
Query:  RRVKRKAGRARVVRTDTPSPPTTDSERENARREEREKKEVEDKAREEEAKKAEEEILRNEPRLESILCKPEPVNSNIVREFYANLDDQEEFQVIVRGMPV
        +RV RKA +A  V+ +T +  T        R    EK  V D +          +++  +   +     PE     +VREFYANL D EE  V VRG+ V
Subjt:  RRVKRKAGRARVVRTDTPSPPTTDSERENARREEREKKEVEDKAREEEAKKAEEEILRNEPRLESILCKPEPVNSNIVREFYANLDDQEEFQVIVRGMPV

Query:  DWSPEAINDLFNLKD--FPHAAF------------NEMVGIEGAQWRLSKTEKHTFQAAYLKSEANAWMGFIKLRLLPTTHDSTVSRDRVFLAFAILRSM
         WS EAIN +F L D    H+ F             E V   GA+W +S    +T   + L   A  W  F+K RLLPTTH  TVS+DR+ L  ++L   
Subjt:  DWSPEAINDLFNLKD--FPHAAF------------NEMVGIEGAQWRLSKTEKHTFQAAYLKSEANAWMGFIKLRLLPTTHDSTVSRDRVFLAFAILRSM

Query:  SIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQR---TQEARQ---------------GGLVCGIHQMQEQ
        SI+VG++I SEI  C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+ +   T+  +Q               G ++  +  ++++
Subjt:  SIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQR---TQEARQ---------------GGLVCGIHQMQEQ

Query:  L------QLH-SSRMEFVERQLQTFWSYVKTRDVALRVALQSNFSKPYPALPVFPDDLL
        L      Q H  S ++   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P FP ++L
Subjt:  L------QLH-SSRMEFVERQLQTFWSYVKTRDVALRVALQSNFSKPYPALPVFPDDLL

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]2.2e-2438.31Show/hide
Query:  LKSEANAWMGFIKLRLLPTTHDSTVSRDRVFLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLAR
        L   A  W  F+K RLLPTTH  TVS+DR+ L +++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A  P   ++  L   G ID   +AR
Subjt:  LKSEANAWMGFIKLRLLPTTHDSTVSRDRVFLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLAR

Query:  LQRTQEAR--------------------QGGLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWSYVKTRDVALRVALQSNFSKPYPALPVFPDDL
        +  TQE +                     G ++  +  ++++L      Q H  S ++   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P FP +L
Subjt:  LQRTQEAR--------------------QGGLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWSYVKTRDVALRVALQSNFSKPYPALPVFPDDL

Query:  L
        L
Subjt:  L

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]2.1e-3838.97Show/hide
Query:  IVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLKD--FPHAAF------------NEMVGIEGAQWRLSKTEKHTFQAAYLKSEANAWMGFIKLRL
        +VREFYANL D EE  + VRG+ V WS EAIN +F L D    H+ F             E V   GA+W +S    +T   + L   A  W  F+K RL
Subjt:  IVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLKD--FPHAAF------------NEMVGIEGAQWRLSKTEKHTFQAAYLKSEANAWMGFIKLRL

Query:  LPTTHDSTVSRDRVFLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARL--------------Q
        LPTTH   VS+DR+ L  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+               
Subjt:  LPTTHDSTVSRDRVFLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARL--------------Q

Query:  RTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKTRDVALRVALQSNFSKPYPALPVFPDDLL
        R   A        + Q  + L+   S+ E   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P FP ++L
Subjt:  RTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKTRDVALRVALQSNFSKPYPALPVFPDDLL

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)3.1e-2436.84Show/hide
Query:  RRVKRKAGRARVVRTDTPSPPTTDSERENARREEREKKEVEDKAREEEAKKAEEEILRNEPRLESILCKPEPVNSNIVREFYANLDDQEEFQVIVRGMPV
        +RV RKA +A  V+ +T +  T        R    EK  V D +          +++  +   +     PE     +VREFYANL D  E  V VRG+ V
Subjt:  RRVKRKAGRARVVRTDTPSPPTTDSERENARREEREKKEVEDKAREEEAKKAEEEILRNEPRLESILCKPEPVNSNIVREFYANLDDQEEFQVIVRGMPV

Query:  DWSPEAINDLFNLKD--FPHAAF------------NEMVGIEGAQWRLSKTEKHTFQAAYLKSEANAWMGFIKLRLLPTTHDSTVSRDRVFLAFAILRSM
         WS EAIN +F L D    H+ F             E V + GA+W +S    +T   + L   A  W  F+K  LLPTTH  TVS+DR+ L  ++L   
Subjt:  DWSPEAINDLFNLKD--FPHAAF------------NEMVGIEGAQWRLSKTEKHTFQAAYLKSEANAWMGFIKLRLLPTTHDSTVSRDRVFLAFAILRSM

Query:  SIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE
        SI+VG++I SEI  C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+  TQE
Subjt:  SIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)1.8e-3534.82Show/hide
Query:  RRVKRKAGRARVVRTDTPSPPTTDSERENARREEREKKEVEDKAREEEAKKAEEEILRNEPRLESILCKPEPVNSNIVREFYANLDDQEEFQVIVRGMPV
        +RV RKA +A  V+ +T +  T        R    EK  V D +          +++  +   +     PE     +VREFYANL D EE  V VRG+ V
Subjt:  RRVKRKAGRARVVRTDTPSPPTTDSERENARREEREKKEVEDKAREEEAKKAEEEILRNEPRLESILCKPEPVNSNIVREFYANLDDQEEFQVIVRGMPV

Query:  DWSPEAINDLFNLKD--FPHAAF------------NEMVGIEGAQWRLSKTEKHTFQAAYLKSEANAWMGFIKLRLLPTTHDSTVSRDRVFLAFAILRSM
         WS EAIN +F L D    H+ F             E V   GA+W +S    +T   + L   A  W  F+K RLLPTTH  TVS+DR+ L  ++L   
Subjt:  DWSPEAINDLFNLKD--FPHAAF------------NEMVGIEGAQWRLSKTEKHTFQAAYLKSEANAWMGFIKLRLLPTTHDSTVSRDRVFLAFAILRSM

Query:  SIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQR---TQEARQ---------------GGLVCGIHQMQEQ
        SI+VG++I SEI  C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+ +   T+  +Q               G ++  +  ++++
Subjt:  SIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQR---TQEARQ---------------GGLVCGIHQMQEQ

Query:  L------QLH-SSRMEFVERQLQTFWSYVKTRDVALRVALQSNFSKPYPALPVFPDDLL
        L      Q H  S ++   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P FP ++L
Subjt:  L------QLH-SSRMEFVERQLQTFWSYVKTRDVALRVALQSNFSKPYPALPVFPDDLL

A0A2P5CEY2 Uncharacterized protein1.1e-2438.31Show/hide
Query:  LKSEANAWMGFIKLRLLPTTHDSTVSRDRVFLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLAR
        L   A  W  F+K RLLPTTH  TVS+DR+ L +++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A  P   ++  L   G ID   +AR
Subjt:  LKSEANAWMGFIKLRLLPTTHDSTVSRDRVFLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLAR

Query:  LQRTQEAR--------------------QGGLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWSYVKTRDVALRVALQSNFSKPYPALPVFPDDL
        +  TQE +                     G ++  +  ++++L      Q H  S ++   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P FP +L
Subjt:  LQRTQEAR--------------------QGGLVCGIHQMQEQL------QLH-SSRMEFVERQLQTFWSYVKTRDVALRVALQSNFSKPYPALPVFPDDL

Query:  L
        L
Subjt:  L

A0A2P5DXM3 Uncharacterized protein1.0e-3838.97Show/hide
Query:  IVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLKD--FPHAAF------------NEMVGIEGAQWRLSKTEKHTFQAAYLKSEANAWMGFIKLRL
        +VREFYANL D EE  + VRG+ V WS EAIN +F L D    H+ F             E V   GA+W +S    +T   + L   A  W  F+K RL
Subjt:  IVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLKD--FPHAAF------------NEMVGIEGAQWRLSKTEKHTFQAAYLKSEANAWMGFIKLRL

Query:  LPTTHDSTVSRDRVFLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARL--------------Q
        LPTTH   VS+DR+ L  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+               
Subjt:  LPTTHDSTVSRDRVFLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARL--------------Q

Query:  RTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKTRDVALRVALQSNFSKPYPALPVFPDDLL
        R   A        + Q  + L+   S+ E   +Q Q FW+Y K RD AL+ ALQ+NF++P P  P FP ++L
Subjt:  RTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKTRDVALRVALQSNFSKPYPALPVFPDDLL

A0A6A3BU96 Uncharacterized protein2.8e-2530.07Show/hide
Query:  PEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLKDF--PHAAFNEMVG------------IEGAQWRLSKTEKHTFQAAYLKSEANAWM
        P  VN+++V+EFYAN+    +  + VRG  + ++  AIN  F+L++    HA F E                E  +W   +T +++     L+  A  W 
Subjt:  PEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLKDF--PHAAFNEMVG------------IEGAQWRLSKTEKHTFQAAYLKSEANAWM

Query:  GFIKLRLLPTTHDSTVSRDRVFLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPED-------------EDDVPLI-------D
         F+K +L+PT+H++TVS  R+ L  +++ S  IDVG+II  ++ DC  KK   L FPN IT LCR+  V E+              D +PL+        
Subjt:  GFIKLRLLPTTHDSTVSRDRVFLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPED-------------EDDVPLI-------D

Query:  KGIIDTPNLARLQRTQEARQGGLVCGIHQMQEQLQ-LHSSRMEFVERQLQTFWSYVKTRDVALRVALQSNFSKPYPALPVFPDDLL
        K  +   ++   +   E R   L   + Q Q QL  LH          ++ F+ YVK RDV +    Q          P FPD++L
Subjt:  KGIIDTPNLARLQRTQEARQGGLVCGIHQMQEQLQ-LHSSRMEFVERQLQTFWSYVKTRDVALRVALQSNFSKPYPALPVFPDDLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACTGAGTAATTCAAACTTAACTGATTTTCTTCTGAATTTTTTAAGCTTGGACTGGTTAAGCTTAATTAGATCAAGAGCTAGGCTATGGCAAGTTCTTAGAATTGA
GTTAAAAGTGGTGATTATTTGTCCATGCCGGAAGAACTATTTTGCTCCAGCAGAGCTTGGTTTTGCAGCATGCTCAGAATGTATTGCTGAGCAACTTGAGGGAGCAAATT
CTGTTCTGCAGCAAAGCTGGGAGCAAAACTGCCACGTCACAGCTCGGCTCCAATCTTTGGCTTTTGGTGTCATAATCCAGGCCCGCTTCAGTACAATTGGGTCTCTTCAC
CTCCTCTCCGCTCAAGACATCGAGATGGCTCCAAAAACTCCGTCCTATCAGGGAGAGCAGCCAAAAGAACTGGAAGAAGGGAAAAATTCAGAGCAGGGTGATCAACCTAC
CGAAACTCAGCAGGAAGTTCAGGAAAAACAAGCAGAAGATGTGCCGGAACAAGGGAATGGTAAAGGAACTCAAGACCACAAAGCTGGGAATAAAACTGCCACGTCACAGC
TCGTTAGCCAATTTAATGAACCGAATTCTTGTTCTAGAGCATTTTTATGTCGCATTTTTACTCACTCTAGGAACCCAACAGAGTCTGAGAAGGAGAATACAGAAAGGGTG
GATCAGGAGAAAGAGGAAACTGAAAAGAAGGTTGAAGAAGAGGCCTCTACGAAGCAACAAGAAAACAGGGGCAAAGGAGTTGCTAAAGCAGCAGTCGAAGTAGAGGAGGC
TAAGATTGAGGAGCCAAGAATGTCGTACGCGTGCTTCATCAACGATCGTGCCAGAGCAAAATATCTGGAGATGCTGAAGAGAGACTTCTTGTTTGAAAGGGAATTCAGTG
ATGACCTGCCACATTTCTTGCGAGCCAGGATTAGAAACCACGGATGGGATCAGTGCCGACTGGAGCCCAGGACTCTTCAGGTTCAGAAAGTTGTTGCGGCAAAGTTATGG
CTGAAGCGAATCGTCCGAATAAGAAGGGAAAAACAGAACCTAGAACAGACTGAGCAGAGAGTCGCGGGTACAGAAGAAGTTCAAGAGGAGCAAACAGAGGAAGTTCAAGA
GGATCGGACCGAGGAAGTTCGAGAAGAAATTACAGAGGAAGTTCAAGAAAAGCAGGCCGAGGATGTACAAATGCAACAGGCAGAAGATGTTCAGGTACCGGATAATGAGC
CAGTGCAGGAGGCTCAATTGGAGGTGATCATGCCGGAGGTACCAAAGCGTCGCCGCGTTAAGAGGAAAGCAGGCCGCGCTAGGGTTGTCCGAACTGATACTCCTTCGCCT
CCGACAACTGATTCTGAAAGGGAAAATGCAAGAAGAGAGGAACGGGAAAAGAAGGAAGTTGAGGACAAGGCAAGAGAAGAAGAAGCGAAGAAAGCGGAAGAGGAGATTTT
GCGCAATGAACCTCGGCTGGAGTCAATTTTGTGCAAGCCGGAGCCTGTTAATTCCAACATTGTTCGGGAGTTTTACGCAAATCTTGACGATCAGGAAGAATTTCAGGTTA
TAGTTCGAGGAATGCCCGTGGACTGGAGCCCAGAAGCCATTAATGATTTGTTCAATCTCAAGGATTTTCCGCATGCGGCCTTTAATGAGATGGTTGGCATTGAGGGCGCT
CAGTGGAGACTGTCGAAGACGGAAAAGCACACATTTCAGGCTGCTTATTTGAAAAGCGAGGCCAATGCATGGATGGGTTTCATCAAGCTGCGCTTACTGCCGACAACTCA
CGACTCAACGGTATCTCGAGACCGGGTTTTTCTTGCCTTTGCTATTCTTCGTTCCATGAGTATTGATGTGGGTAAGATAATTTCTTCTGAGATTCTGGATTGCTGGAGGA
AAAAGGTGGGGAAGTTGTTTTTCCCCAACACTATCACGATGCTATGCCGAAGGGCAGGGGTGCCAGAAGATGAGGATGATGTGCCGTTAATAGACAAGGGGATAATTGAC
ACACCAAATCTGGCTAGGCTCCAGAGGACGCAGGAAGCACGCCAAGGAGGTTTGGTGTGCGGCATCCACCAAATGCAGGAGCAATTACAGCTGCATTCCAGCAGGATGGA
ATTTGTTGAAAGACAATTGCAGACTTTCTGGAGCTATGTCAAAACGAGGGATGTCGCGTTGAGGGTAGCCTTGCAGTCGAATTTTTCCAAGCCATATCCGGCTTTACCCG
TATTCCCTGACGACCTACTGAACCCCTGGATCCCACCCCCACCTATTGAACGAGAGGATGTTGATGAAGAGCAGGAAACCTTTTGCTTGAGCATTTTCTCTGACCTGGTC
ATTGCTGCGGCAAAGAAAATTCTGGAGGTAGTGTTGACTTATGTGATCCGCTTTAAGCTTAGGTCTAGTCCCGCGCTTACAGAGCTTGGTTTTGCAGAGTGCTCAGATAA
AGAATCTGTTGCTGGGCGCGACTTGAGGGAGCAAATGTTGTGCTGCAGCAAAGCTGGGAGTAAAACTGCGACGTCACAGCTCGTTAGCCAATTTAATGAACCGAATTCTT
GTTCTAGAGCATTTTTATGTCGCATTTTTACTCACTCTAGGAACCCAACAGAGTCTGAGAAGGAGAATACAGAAAGGGTGGATCAGGAGAAAGAGGAAACTGAAAAGAAG
GGTGAAGAAGAGGCCTCGACGAAGCAACAAGAAGACAGGGGCAAAGGAGTTGCTAAAGCAGCAGTCGAAGTAGAGGAGGCTAAGACTGAGGAGCCAAGAATGTCTTACGC
GTGCTTCATCAACGATCGTGCCAGAGCAAAATATCTGGAGATGCTGAAGAGAGACTTCTTTGCCAGTAAATTCGATGTTGTCCGCGAGTTCTACGCGAACATAGATGAAG
AAGAAGGGGCTCAGTGGAGGTTGTTGAAAACTGAGAAGCGCACATTTCAGGCAACCTATTTGAATAGAAAGGCCAACACCTGGTTGGGCTTCATCAAATTTCGTCTTCTA
CCGACAACTCACGATTCTACAGTCTCCCGAGATCGAGTGCTCCTGGTTTTTGCAATCTTGAGGTCCCTAAGTATTGATGTTGGTAAGGTAATCTCCAGTGAAATTCACAA
CTACTGGCGCAAAAAGGTGGACAAATTATCCTTCCCAAATACGATTACTATGCTGTGTAATCGGGCAGGGGTCCCCACAACTCCAGACGATGTCATTTTGCTTGACAAGG
GGATTATTGACACGTCCAACCTAGCAAGGCTTCAACATACACAAGAAGCTCGGCCAGGGGGGCTATTGTGTGGCATCCATCAAATTTTAGAGCAACTGCAACTTTCAGCC
AGTAGGCAGGAGTATGCTGAGAGGCAAGCTCGGACCTACTGGACTTATGCTAAAAGGAGGGATGCCACCCTAAGGAGGGCGTTGCAGTCAAATTTTTCAAAACCATATCA
GGCCTTCCCAGTTTTCTCTGATGACTTGTTTAATCCTTGGATCCCGCCACCGCCAGTCGAAAGAGAGGAGAAGGAAGAAGATGATCGGGTTCAGGAAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCACTGAGTAATTCAAACTTAACTGATTTTCTTCTGAATTTTTTAAGCTTGGACTGGTTAAGCTTAATTAGATCAAGAGCTAGGCTATGGCAAGTTCTTAGAATTGA
GTTAAAAGTGGTGATTATTTGTCCATGCCGGAAGAACTATTTTGCTCCAGCAGAGCTTGGTTTTGCAGCATGCTCAGAATGTATTGCTGAGCAACTTGAGGGAGCAAATT
CTGTTCTGCAGCAAAGCTGGGAGCAAAACTGCCACGTCACAGCTCGGCTCCAATCTTTGGCTTTTGGTGTCATAATCCAGGCCCGCTTCAGTACAATTGGGTCTCTTCAC
CTCCTCTCCGCTCAAGACATCGAGATGGCTCCAAAAACTCCGTCCTATCAGGGAGAGCAGCCAAAAGAACTGGAAGAAGGGAAAAATTCAGAGCAGGGTGATCAACCTAC
CGAAACTCAGCAGGAAGTTCAGGAAAAACAAGCAGAAGATGTGCCGGAACAAGGGAATGGTAAAGGAACTCAAGACCACAAAGCTGGGAATAAAACTGCCACGTCACAGC
TCGTTAGCCAATTTAATGAACCGAATTCTTGTTCTAGAGCATTTTTATGTCGCATTTTTACTCACTCTAGGAACCCAACAGAGTCTGAGAAGGAGAATACAGAAAGGGTG
GATCAGGAGAAAGAGGAAACTGAAAAGAAGGTTGAAGAAGAGGCCTCTACGAAGCAACAAGAAAACAGGGGCAAAGGAGTTGCTAAAGCAGCAGTCGAAGTAGAGGAGGC
TAAGATTGAGGAGCCAAGAATGTCGTACGCGTGCTTCATCAACGATCGTGCCAGAGCAAAATATCTGGAGATGCTGAAGAGAGACTTCTTGTTTGAAAGGGAATTCAGTG
ATGACCTGCCACATTTCTTGCGAGCCAGGATTAGAAACCACGGATGGGATCAGTGCCGACTGGAGCCCAGGACTCTTCAGGTTCAGAAAGTTGTTGCGGCAAAGTTATGG
CTGAAGCGAATCGTCCGAATAAGAAGGGAAAAACAGAACCTAGAACAGACTGAGCAGAGAGTCGCGGGTACAGAAGAAGTTCAAGAGGAGCAAACAGAGGAAGTTCAAGA
GGATCGGACCGAGGAAGTTCGAGAAGAAATTACAGAGGAAGTTCAAGAAAAGCAGGCCGAGGATGTACAAATGCAACAGGCAGAAGATGTTCAGGTACCGGATAATGAGC
CAGTGCAGGAGGCTCAATTGGAGGTGATCATGCCGGAGGTACCAAAGCGTCGCCGCGTTAAGAGGAAAGCAGGCCGCGCTAGGGTTGTCCGAACTGATACTCCTTCGCCT
CCGACAACTGATTCTGAAAGGGAAAATGCAAGAAGAGAGGAACGGGAAAAGAAGGAAGTTGAGGACAAGGCAAGAGAAGAAGAAGCGAAGAAAGCGGAAGAGGAGATTTT
GCGCAATGAACCTCGGCTGGAGTCAATTTTGTGCAAGCCGGAGCCTGTTAATTCCAACATTGTTCGGGAGTTTTACGCAAATCTTGACGATCAGGAAGAATTTCAGGTTA
TAGTTCGAGGAATGCCCGTGGACTGGAGCCCAGAAGCCATTAATGATTTGTTCAATCTCAAGGATTTTCCGCATGCGGCCTTTAATGAGATGGTTGGCATTGAGGGCGCT
CAGTGGAGACTGTCGAAGACGGAAAAGCACACATTTCAGGCTGCTTATTTGAAAAGCGAGGCCAATGCATGGATGGGTTTCATCAAGCTGCGCTTACTGCCGACAACTCA
CGACTCAACGGTATCTCGAGACCGGGTTTTTCTTGCCTTTGCTATTCTTCGTTCCATGAGTATTGATGTGGGTAAGATAATTTCTTCTGAGATTCTGGATTGCTGGAGGA
AAAAGGTGGGGAAGTTGTTTTTCCCCAACACTATCACGATGCTATGCCGAAGGGCAGGGGTGCCAGAAGATGAGGATGATGTGCCGTTAATAGACAAGGGGATAATTGAC
ACACCAAATCTGGCTAGGCTCCAGAGGACGCAGGAAGCACGCCAAGGAGGTTTGGTGTGCGGCATCCACCAAATGCAGGAGCAATTACAGCTGCATTCCAGCAGGATGGA
ATTTGTTGAAAGACAATTGCAGACTTTCTGGAGCTATGTCAAAACGAGGGATGTCGCGTTGAGGGTAGCCTTGCAGTCGAATTTTTCCAAGCCATATCCGGCTTTACCCG
TATTCCCTGACGACCTACTGAACCCCTGGATCCCACCCCCACCTATTGAACGAGAGGATGTTGATGAAGAGCAGGAAACCTTTTGCTTGAGCATTTTCTCTGACCTGGTC
ATTGCTGCGGCAAAGAAAATTCTGGAGGTAGTGTTGACTTATGTGATCCGCTTTAAGCTTAGGTCTAGTCCCGCGCTTACAGAGCTTGGTTTTGCAGAGTGCTCAGATAA
AGAATCTGTTGCTGGGCGCGACTTGAGGGAGCAAATGTTGTGCTGCAGCAAAGCTGGGAGTAAAACTGCGACGTCACAGCTCGTTAGCCAATTTAATGAACCGAATTCTT
GTTCTAGAGCATTTTTATGTCGCATTTTTACTCACTCTAGGAACCCAACAGAGTCTGAGAAGGAGAATACAGAAAGGGTGGATCAGGAGAAAGAGGAAACTGAAAAGAAG
GGTGAAGAAGAGGCCTCGACGAAGCAACAAGAAGACAGGGGCAAAGGAGTTGCTAAAGCAGCAGTCGAAGTAGAGGAGGCTAAGACTGAGGAGCCAAGAATGTCTTACGC
GTGCTTCATCAACGATCGTGCCAGAGCAAAATATCTGGAGATGCTGAAGAGAGACTTCTTTGCCAGTAAATTCGATGTTGTCCGCGAGTTCTACGCGAACATAGATGAAG
AAGAAGGGGCTCAGTGGAGGTTGTTGAAAACTGAGAAGCGCACATTTCAGGCAACCTATTTGAATAGAAAGGCCAACACCTGGTTGGGCTTCATCAAATTTCGTCTTCTA
CCGACAACTCACGATTCTACAGTCTCCCGAGATCGAGTGCTCCTGGTTTTTGCAATCTTGAGGTCCCTAAGTATTGATGTTGGTAAGGTAATCTCCAGTGAAATTCACAA
CTACTGGCGCAAAAAGGTGGACAAATTATCCTTCCCAAATACGATTACTATGCTGTGTAATCGGGCAGGGGTCCCCACAACTCCAGACGATGTCATTTTGCTTGACAAGG
GGATTATTGACACGTCCAACCTAGCAAGGCTTCAACATACACAAGAAGCTCGGCCAGGGGGGCTATTGTGTGGCATCCATCAAATTTTAGAGCAACTGCAACTTTCAGCC
AGTAGGCAGGAGTATGCTGAGAGGCAAGCTCGGACCTACTGGACTTATGCTAAAAGGAGGGATGCCACCCTAAGGAGGGCGTTGCAGTCAAATTTTTCAAAACCATATCA
GGCCTTCCCAGTTTTCTCTGATGACTTGTTTAATCCTTGGATCCCGCCACCGCCAGTCGAAAGAGAGGAGAAGGAAGAAGATGATCGGGTTCAGGAAGATTGA
Protein sequenceShow/hide protein sequence
MALSNSNLTDFLLNFLSLDWLSLIRSRARLWQVLRIELKVVIICPCRKNYFAPAELGFAACSECIAEQLEGANSVLQQSWEQNCHVTARLQSLAFGVIIQARFSTIGSLH
LLSAQDIEMAPKTPSYQGEQPKELEEGKNSEQGDQPTETQQEVQEKQAEDVPEQGNGKGTQDHKAGNKTATSQLVSQFNEPNSCSRAFLCRIFTHSRNPTESEKENTERV
DQEKEETEKKVEEEASTKQQENRGKGVAKAAVEVEEAKIEEPRMSYACFINDRARAKYLEMLKRDFLFEREFSDDLPHFLRARIRNHGWDQCRLEPRTLQVQKVVAAKLW
LKRIVRIRREKQNLEQTEQRVAGTEEVQEEQTEEVQEDRTEEVREEITEEVQEKQAEDVQMQQAEDVQVPDNEPVQEAQLEVIMPEVPKRRRVKRKAGRARVVRTDTPSP
PTTDSERENARREEREKKEVEDKAREEEAKKAEEEILRNEPRLESILCKPEPVNSNIVREFYANLDDQEEFQVIVRGMPVDWSPEAINDLFNLKDFPHAAFNEMVGIEGA
QWRLSKTEKHTFQAAYLKSEANAWMGFIKLRLLPTTHDSTVSRDRVFLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIID
TPNLARLQRTQEARQGGLVCGIHQMQEQLQLHSSRMEFVERQLQTFWSYVKTRDVALRVALQSNFSKPYPALPVFPDDLLNPWIPPPPIEREDVDEEQETFCLSIFSDLV
IAAAKKILEVVLTYVIRFKLRSSPALTELGFAECSDKESVAGRDLREQMLCCSKAGSKTATSQLVSQFNEPNSCSRAFLCRIFTHSRNPTESEKENTERVDQEKEETEKK
GEEEASTKQQEDRGKGVAKAAVEVEEAKTEEPRMSYACFINDRARAKYLEMLKRDFFASKFDVVREFYANIDEEEGAQWRLLKTEKRTFQATYLNRKANTWLGFIKFRLL
PTTHDSTVSRDRVLLVFAILRSLSIDVGKVISSEIHNYWRKKVDKLSFPNTITMLCNRAGVPTTPDDVILLDKGIIDTSNLARLQHTQEARPGGLLCGIHQILEQLQLSA
SRQEYAERQARTYWTYAKRRDATLRRALQSNFSKPYQAFPVFSDDLFNPWIPPPPVEREEKEEDDRVQED