; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg024121 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg024121
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold4:20087907..20092399
RNA-Seq ExpressionSpg024121
SyntenySpg024121
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]1.6e-2927.61Show/hide
Query:  LPYNRFINNLARAKYVEMLRRDFLFE----------RGFGDDLPRFLRTGIVNLGWSQFCAKPEPINSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAI
        + + +F N+ A+A++     R+  FE           GFG D+       ++ L W +F   P  +N+++V+EFYANI    +  + VRG  + ++  AI
Subjt:  LPYNRFINNLARAKYVEMLRRDFLFE----------RGFGDDLPRFLRTGIVNLGWSQFCAKPEPINSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAI

Query:  NSLFNLQDF--PHTGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSMVSRDRVLLAFAILHSMSIDV
        N  F+LQ+    H  F E      +++ +  + ++  E  +W   +T + +     L+  A  W  F+K +L+PT+H++ VS  R+LL  +++ S  IDV
Subjt:  NSLFNLQDF--PHTGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSMVSRDRVLLAFAILHSMSIDV

Query:  GKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPED-------------EDDVPLI-------DKGIIDTPNLARLQRTQEARQGGLVFGIHQMQEQLQ
        G+II  ++ DC  KK   L FPN IT LCR+  V E+              D +PL+        K  +   ++   +   E R   L   + Q Q QL 
Subjt:  GKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPED-------------EDDVPLI-------DKGIIDTPNLARLQRTQEARQGGLVFGIHQMQEQLQ

Query:  -LHSSRMEFAERQFQTFWNYVKRRDAALRVALQSNFSEPYPALPVFPDDLLNPWVPPPPVEREEDGEEQGQED
         LH           + F+ YVK RD  +    Q          P FPD++L  +      E E D  +    D
Subjt:  -LHSSRMEFAERQFQTFWNYVKRRDAALRVALQSNFSEPYPALPVFPDDLLNPWVPPPPVEREEDGEEQGQED

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]6.4e-3439.37Show/hide
Query:  RFINNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKPEPINSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINSLFNL
        +F    A  +Y   ++ R    E+GF  D       LP F+   I    W QFCA PE     +VREFYAN+ D  E  V VRGV V WS  AIN++F L
Subjt:  RFINNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKPEPINSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINSLFNL

Query:  QDFPHTGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSMVSRDRVLLAFAILHSMSIDVGKIISSEI
         D P    +E +   +   L   +  V + GA+W +S     T   + L   A  W  F+K  LLPTTH   VS+DR+LL  ++L   SI+VG++I SEI
Subjt:  QDFPHTGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSMVSRDRVLLAFAILHSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+  TQE
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.3e-4435.6Show/hide
Query:  RFINNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKPEPINSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINSLFNL
        +F    A  +Y   ++ R    E+GF  D       LP F+   I    W QFCA PE     +VREFYAN+ D EE  V VRGV V WS  AIN++F L
Subjt:  RFINNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKPEPINSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINSLFNL

Query:  QDFPHTGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSMVSRDRVLLAFAILHSMSIDVGKIISSEI
         D P    +E +   +   L   +  V   GA+W +S     T   + L   A  W  F+K RLLPTTH   VS+DR+LL  ++L   SI+VG++I SEI
Subjt:  QDFPHTGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSMVSRDRVLLAFAILHSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQR---TQEARQ---------------GGLVFGIHQMQEQL------QLH-S
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+ +   T+  +Q               G ++  +  ++++L      Q H  
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQR---TQEARQ---------------GGLVFGIHQMQEQL------QLH-S

Query:  SRMEFAERQFQTFWNYVKRRDAALRVALQSNFSEPYPALPVFPDDLLNPWVPPPPVEREEDGEEQGQE
        S ++   +Q Q FW Y K RD AL+ ALQ+NF+ P P  P FP ++L         E ++DG  +  E
Subjt:  SRMEFAERQFQTFWNYVKRRDAALRVALQSNFSEPYPALPVFPDDLLNPWVPPPPVEREEDGEEQGQE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]4.8e-2934.96Show/hide
Query:  RFINNLARAKYVEMLR-------RDFLFERGFGDDLPRFLRTGIVNLGWSQFCAKPEPINSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINSLFNLQ
        +F +  A  +Y E ++       ++F+++     + P F+   I+   W  FCA PE     +VREFY N+ + ++  V +RGV V  S  AIN++F+L 
Subjt:  RFINNLARAKYVEMLR-------RDFLFERGFGDDLPRFLRTGIVNLGWSQFCAKPEPINSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINSLFNLQ

Query:  DFPHTGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSMVSRDRVLLAFAILHSMSIDVGKIISSEIL
        D P    +E V   +  +L   +  V I GA+W +S     T   + L   A  W  F+K RLLPTTH   VS++ V L +++L   SI+VG++I  EI 
Subjt:  DFPHTGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSMVSRDRVLLAFAILHSMSIDVGKIISSEIL

Query:  DCWRKKVGKLFFPNTITMLCRRAGVP
         C  +K G LFFP+ IT +CR    P
Subjt:  DCWRKKVGKLFFPNTITMLCRRAGVP

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.0e-3938.46Show/hide
Query:  IVREFYANIDDHEEFQVIVRGVPVDWSPGAINSLFNLQD--FPHTGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIK
        +VREFYAN+ D EE  + VRGV V WS  AIN++F L D    H+ F E +  P   +L   +  V   GA+W +S     T   + L   A  W  F+K
Subjt:  IVREFYANIDDHEEFQVIVRGVPVDWSPGAINSLFNLQD--FPHTGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIK

Query:  LRLLPTTHDSMVSRDRVLLAFAILHSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARL------QRTQE-
         RLLPTTH  +VS+DR+LL  ++L+  SI+VG++I SEI  C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+      + TQ+ 
Subjt:  LRLLPTTHDSMVSRDRVLLAFAILHSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARL------QRTQE-

Query:  ----------ARQGGLVFGIHQMQEQLQLHSSRMEFAERQFQTFWNYVKRRDAALRVALQSNFSEPYPALPVFPDDLLNPWVPPPPVEREEDGEEQGQE
                  +R  G V    Q  + L+   S+ E   +Q Q FW Y K RD AL+ ALQ+NF+ P P  P FP ++L         E ++DG  +  E
Subjt:  ----------ARQGGLVFGIHQMQEQLQLHSSRMEFAERQFQTFWNYVKRRDAALRVALQSNFSEPYPALPVFPDDLLNPWVPPPPVEREEDGEEQGQE

TrEMBL top hitse value%identityAlignment
A0A2C9KDU0 Uncharacterized protein2.0e-0431.13Show/hide
Query:  IKERENEDKEVPVTPEVQKAKTKKKKTLEEKEAKRRRRQQRAEDQEALQK----ATDDVAATVVEENPKEPEEQNLEQTEQRVADTEEVQEEQTEEVQEE
        +KE E E+KE       ++ + K+++  +EKE K+ +  +  ED+E  +K      D++     EE  KE EE+ +++ E++  + +EV+E + EEV E+
Subjt:  IKERENEDKEVPVTPEVQKAKTKKKKTLEEKEAKRRRRQQRAEDQEALQK----ATDDVAATVVEENPKEPEEQNLEQTEQRVADTEEVQEEQTEEVQEE

Query:  QTEEVQEKQTEDTQEGRVEDVQVTDNEPVQEARVEVIMPEVPKRRRVKRKAGRVRVVRTDTPSPPTTDSERENAGREEREKKEAEDKAREEGAKKAEEEI
        + EEV+EK+ +D +    EDV+  + E   E + EV + EV +   VK K  +      +         E+E  G +E+E +E  +K  EE  +K E EI
Subjt:  QTEEVQEKQTEDTQEGRVEDVQVTDNEPVQEARVEVIMPEVPKRRRVKRKAGRVRVVRTDTPSPPTTDSERENAGREEREKKEAEDKAREEGAKKAEEEI

Query:  LPKQAED---KGKGIAEASGEADEIEEPRLPYNRFINNLARAKKERDNE-----EEEVLVTPEAPKVKAKKKKTPEEKEAKRRRRQQRAEDQEVVEKTEP
          K+ E+   K +  A+A  E D  E+        +       KE++ E     EEEV    E  K K K+ K  EE+E K +      E++EV EK E 
Subjt:  LPKQAED---KGKGIAEASGEADEIEEPRLPYNRFINNLARAKKERDNE-----EEEVLVTPEAPKVKAKKKKTPEEKEAKRRRRQQRAEDQEVVEKTEP

Query:  GVAD-TEEVREENTEEVREENTKEVREEITEEVQEKQAEDTQEGRAEDVQIRVVRTDTPSPPTTDSKRENAGREE---REKKEAEDKAREEEAKKAKEEI
         V +  EEV+E   EEV+E+  +EV+E+  EEV+EK+ E+ +E +  +++   V+    +    + + +   +EE    E+KE EDK +EEE  + KEE 
Subjt:  GVAD-TEEVREENTEEVREENTKEVREEITEEVQEKQAEDTQEGRAEDVQIRVVRTDTPSPPTTDSKRENAGREE---REKKEAEDKAREEEAKKAKEEI

Query:  LPKQAEDKGKGIAEASGEADEIEE
        + ++ E K K + E   E +++EE
Subjt:  LPKQAEDKGKGIAEASGEADEIEE

A0A2P5AGA5 Uncharacterized protein (Fragment)3.1e-3439.37Show/hide
Query:  RFINNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKPEPINSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINSLFNL
        +F    A  +Y   ++ R    E+GF  D       LP F+   I    W QFCA PE     +VREFYAN+ D  E  V VRGV V WS  AIN++F L
Subjt:  RFINNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKPEPINSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINSLFNL

Query:  QDFPHTGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSMVSRDRVLLAFAILHSMSIDVGKIISSEI
         D P    +E +   +   L   +  V + GA+W +S     T   + L   A  W  F+K  LLPTTH   VS+DR+LL  ++L   SI+VG++I SEI
Subjt:  QDFPHTGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSMVSRDRVLLAFAILHSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+  TQE
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)1.1e-4435.6Show/hide
Query:  RFINNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKPEPINSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINSLFNL
        +F    A  +Y   ++ R    E+GF  D       LP F+   I    W QFCA PE     +VREFYAN+ D EE  V VRGV V WS  AIN++F L
Subjt:  RFINNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKPEPINSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINSLFNL

Query:  QDFPHTGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSMVSRDRVLLAFAILHSMSIDVGKIISSEI
         D P    +E +   +   L   +  V   GA+W +S     T   + L   A  W  F+K RLLPTTH   VS+DR+LL  ++L   SI+VG++I SEI
Subjt:  QDFPHTGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSMVSRDRVLLAFAILHSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQR---TQEARQ---------------GGLVFGIHQMQEQL------QLH-S
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+ +   T+  +Q               G ++  +  ++++L      Q H  
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQR---TQEARQ---------------GGLVFGIHQMQEQL------QLH-S

Query:  SRMEFAERQFQTFWNYVKRRDAALRVALQSNFSEPYPALPVFPDDLLNPWVPPPPVEREEDGEEQGQE
        S ++   +Q Q FW Y K RD AL+ ALQ+NF+ P P  P FP ++L         E ++DG  +  E
Subjt:  SRMEFAERQFQTFWNYVKRRDAALRVALQSNFSEPYPALPVFPDDLLNPWVPPPPVEREEDGEEQGQE

A0A2P5DXM3 Uncharacterized protein4.9e-4038.46Show/hide
Query:  IVREFYANIDDHEEFQVIVRGVPVDWSPGAINSLFNLQD--FPHTGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIK
        +VREFYAN+ D EE  + VRGV V WS  AIN++F L D    H+ F E +  P   +L   +  V   GA+W +S     T   + L   A  W  F+K
Subjt:  IVREFYANIDDHEEFQVIVRGVPVDWSPGAINSLFNLQD--FPHTGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIK

Query:  LRLLPTTHDSMVSRDRVLLAFAILHSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARL------QRTQE-
         RLLPTTH  +VS+DR+LL  ++L+  SI+VG++I SEI  C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+      + TQ+ 
Subjt:  LRLLPTTHDSMVSRDRVLLAFAILHSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARL------QRTQE-

Query:  ----------ARQGGLVFGIHQMQEQLQLHSSRMEFAERQFQTFWNYVKRRDAALRVALQSNFSEPYPALPVFPDDLLNPWVPPPPVEREEDGEEQGQE
                  +R  G V    Q  + L+   S+ E   +Q Q FW Y K RD AL+ ALQ+NF+ P P  P FP ++L         E ++DG  +  E
Subjt:  ----------ARQGGLVFGIHQMQEQLQLHSSRMEFAERQFQTFWNYVKRRDAALRVALQSNFSEPYPALPVFPDDLLNPWVPPPPVEREEDGEEQGQE

A0A6A3BU96 Uncharacterized protein7.9e-3027.61Show/hide
Query:  LPYNRFINNLARAKYVEMLRRDFLFE----------RGFGDDLPRFLRTGIVNLGWSQFCAKPEPINSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAI
        + + +F N+ A+A++     R+  FE           GFG D+       ++ L W +F   P  +N+++V+EFYANI    +  + VRG  + ++  AI
Subjt:  LPYNRFINNLARAKYVEMLRRDFLFE----------RGFGDDLPRFLRTGIVNLGWSQFCAKPEPINSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAI

Query:  NSLFNLQDF--PHTGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSMVSRDRVLLAFAILHSMSIDV
        N  F+LQ+    H  F E      +++ +  + ++  E  +W   +T + +     L+  A  W  F+K +L+PT+H++ VS  R+LL  +++ S  IDV
Subjt:  NSLFNLQDF--PHTGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSMVSRDRVLLAFAILHSMSIDV

Query:  GKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPED-------------EDDVPLI-------DKGIIDTPNLARLQRTQEARQGGLVFGIHQMQEQLQ
        G+II  ++ DC  KK   L FPN IT LCR+  V E+              D +PL+        K  +   ++   +   E R   L   + Q Q QL 
Subjt:  GKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPED-------------EDDVPLI-------DKGIIDTPNLARLQRTQEARQGGLVFGIHQMQEQLQ

Query:  -LHSSRMEFAERQFQTFWNYVKRRDAALRVALQSNFSEPYPALPVFPDDLLNPWVPPPPVEREEDGEEQGQED
         LH           + F+ YVK RD  +    Q          P FPD++L  +      E E D  +    D
Subjt:  -LHSSRMEFAERQFQTFWNYVKRRDAALRVALQSNFSEPYPALPVFPDDLLNPWVPPPPVEREEDGEEQGQED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTAATCTGATGAGGCCACGTGTCGAACAGAGATCATCCAAGGGCGTTGATGGTGGACAGCACATGATTATTTACCATGGGCGATTAAAATTCAGGAATTTATCGTT
GATGGATTTAAATTTCAGGATTGCATGGGGTTTGAACGTTTCACTTCATCTTCTTGATTCAAATCTTTGCCTGTTACATTCTTCTTTCTTTGTTGTTATTCTCTGTCAAC
CCCTTGCGTCTTCAATGGCCAAAACAAGAGCTATAAAAGAGAGGGAGAATGAGGACAAAGAGGTACCTGTTACCCCTGAAGTACAGAAAGCGAAAACGAAAAAGAAGAAG
ACGCTAGAGGAGAAAGAAGCGAAAAGGAGGAGAAGGCAACAGAGGGCCGAGGATCAGGAGGCTCTACAGAAAGCGACGGATGATGTGGCTGCCACAGTAGTTGAAGAAAA
CCCGAAAGAACCAGAAGAACAGAACCTAGAGCAAACTGAGCAGAGAGTCGCGGATACAGAAGAAGTTCAAGAGGAGCAAACAGAAGAAGTTCAAGAGGAGCAAACAGAGG
AAGTTCAAGAAAAACAGACCGAAGATACGCAAGAAGGTAGGGTAGAAGATGTTCAGGTAACGGATAATGAGCCAGTGCAGGAGGCTCGAGTGGAGGTGATCATGCCAGAA
GTACCAAAGCGTCGCCGCGTTAAGAGGAAAGCAGGTCGGGTTAGGGTTGTCCGAACTGATACTCCTTCGCCTCCAACCACTGATTCTGAAAGAGAAAATGCAGGAAGAGA
GGAACGAGAAAAGAAGGAAGCTGAGGACAAGGCTAGAGAAGAAGGAGCAAAGAAAGCGGAAGAAGAGATTTTGCCCAAACAAGCGGAAGACAAGGGAAAAGGTATTGCTG
AGGCATCAGGTGAGGCTGACGAGATTGAGGAACCGAGATTACCGTACAATCGCTTCATCAATAACCTTGCTCGGGCAAAAAAAGAGAGAGATAATGAGGAAGAGGAGGTA
CTCGTGACCCCCGAAGCACCGAAAGTGAAGGCAAAAAAGAAGAAGACACCAGAAGAAAAAGAAGCTAAAAGAAGAAGAAGACAGCAGAGGGCTGAGGATCAAGAAGTGGT
AGAGAAGACTGAGCCAGGTGTTGCGGATACAGAGGAAGTTCGAGAAGAAAATACAGAGGAAGTCCGAGAAGAAAATACAAAGGAAGTTCGAGAGGAAATTACAGAGGAAG
TTCAAGAAAAGCAGGCCGAGGATACGCAAGAAGGTAGGGCAGAAGATGTTCAGATTAGGGTTGTCCGAACTGATACTCCTTCGCCTCCAACCACTGATTCTAAAAGAGAA
AATGCAGGAAGAGAGGAACGAGAAAAGAAGGAAGCTGAGGACAAGGCTAGAGAAGAAGAAGCAAAGAAAGCGAAAGAAGAGATTTTGCCCAAACAAGCGGAAGACAAGGG
CAAAGGTATTGCTGAGGCATCGGGTGAGGCTGACGAGATTGAGGAACCGAGATTACCGTACAATCGCTTCATCAATAACCTTGCTCGGGCAAAGTATGTTGAGATGCTGA
GACGGGACTTCCTGTTTGAACGAGGATTTGGCGATGATCTGCCACGGTTCTTGAGGACTGGAATAGTGAACCTCGGCTGGAGTCAATTTTGTGCAAAGCCGGAACCTATT
AATTCAAACATTGTTCGGGAATTTTACGCAAATATTGACGATCACGAAGAATTTCAGGTTATCGTTCGAGGAGTGCCCGTTGACTGGAGCCCAGGAGCCATTAATTCTTT
GTTTAACCTCCAGGACTTTCCACACACAGGCTTTAATGAGATGGTGGTCGCACCATCTAACGACCAACTAAATGCGGCTGTCCGAGAGGTTGGCATCGAGGGGGCCCAGT
GGAGATTGTCGAAGACGGAAAAGCGCACATTTCAGGCTGCTTATTTGAAGAGCGAGGCCAATACATGGATGGGTTTCATCAAGCTGCGCTTATTGCCGACAACTCACGAC
TCAATGGTGTCTCGAGACCGGGTTTTGCTTGCCTTTGCTATTCTTCATTCGATGAGTATTGATGTGGGTAAAATAATTTCTTCTGAGATTCTTGATTGCTGGCGGAAAAA
GGTGGGGAAGCTGTTTTTCCCCAACACTATCACGATGCTATGCCGAAGGGCAGGGGTGCCAGAGGATGAGGATGATGTGCCACTAATAGACAAGGGAATAATTGACACAC
CAAATCTGGCTAGGCTTCAGAGGACGCAGGAAGCACGCCAAGGAGGTTTGGTGTTCGGCATCCACCAAATGCAGGAGCAATTACAGCTGCATTCCAGTAGGATGGAGTTT
GCTGAAAGGCAATTTCAGACTTTCTGGAACTATGTGAAAAGGAGGGATGCCGCGTTGAGGGTAGCTTTGCAGTCGAATTTTTCCGAGCCATATCCGGCTTTACCCGTATT
CCCTGATGACCTACTGAACCCTTGGGTTCCGCCCCCACCTGTTGAACGAGAAGAAGATGGTGAAGAGCAGGGTCAGGAAGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCGTAATCTGATGAGGCCACGTGTCGAACAGAGATCATCCAAGGGCGTTGATGGTGGACAGCACATGATTATTTACCATGGGCGATTAAAATTCAGGAATTTATCGTT
GATGGATTTAAATTTCAGGATTGCATGGGGTTTGAACGTTTCACTTCATCTTCTTGATTCAAATCTTTGCCTGTTACATTCTTCTTTCTTTGTTGTTATTCTCTGTCAAC
CCCTTGCGTCTTCAATGGCCAAAACAAGAGCTATAAAAGAGAGGGAGAATGAGGACAAAGAGGTACCTGTTACCCCTGAAGTACAGAAAGCGAAAACGAAAAAGAAGAAG
ACGCTAGAGGAGAAAGAAGCGAAAAGGAGGAGAAGGCAACAGAGGGCCGAGGATCAGGAGGCTCTACAGAAAGCGACGGATGATGTGGCTGCCACAGTAGTTGAAGAAAA
CCCGAAAGAACCAGAAGAACAGAACCTAGAGCAAACTGAGCAGAGAGTCGCGGATACAGAAGAAGTTCAAGAGGAGCAAACAGAAGAAGTTCAAGAGGAGCAAACAGAGG
AAGTTCAAGAAAAACAGACCGAAGATACGCAAGAAGGTAGGGTAGAAGATGTTCAGGTAACGGATAATGAGCCAGTGCAGGAGGCTCGAGTGGAGGTGATCATGCCAGAA
GTACCAAAGCGTCGCCGCGTTAAGAGGAAAGCAGGTCGGGTTAGGGTTGTCCGAACTGATACTCCTTCGCCTCCAACCACTGATTCTGAAAGAGAAAATGCAGGAAGAGA
GGAACGAGAAAAGAAGGAAGCTGAGGACAAGGCTAGAGAAGAAGGAGCAAAGAAAGCGGAAGAAGAGATTTTGCCCAAACAAGCGGAAGACAAGGGAAAAGGTATTGCTG
AGGCATCAGGTGAGGCTGACGAGATTGAGGAACCGAGATTACCGTACAATCGCTTCATCAATAACCTTGCTCGGGCAAAAAAAGAGAGAGATAATGAGGAAGAGGAGGTA
CTCGTGACCCCCGAAGCACCGAAAGTGAAGGCAAAAAAGAAGAAGACACCAGAAGAAAAAGAAGCTAAAAGAAGAAGAAGACAGCAGAGGGCTGAGGATCAAGAAGTGGT
AGAGAAGACTGAGCCAGGTGTTGCGGATACAGAGGAAGTTCGAGAAGAAAATACAGAGGAAGTCCGAGAAGAAAATACAAAGGAAGTTCGAGAGGAAATTACAGAGGAAG
TTCAAGAAAAGCAGGCCGAGGATACGCAAGAAGGTAGGGCAGAAGATGTTCAGATTAGGGTTGTCCGAACTGATACTCCTTCGCCTCCAACCACTGATTCTAAAAGAGAA
AATGCAGGAAGAGAGGAACGAGAAAAGAAGGAAGCTGAGGACAAGGCTAGAGAAGAAGAAGCAAAGAAAGCGAAAGAAGAGATTTTGCCCAAACAAGCGGAAGACAAGGG
CAAAGGTATTGCTGAGGCATCGGGTGAGGCTGACGAGATTGAGGAACCGAGATTACCGTACAATCGCTTCATCAATAACCTTGCTCGGGCAAAGTATGTTGAGATGCTGA
GACGGGACTTCCTGTTTGAACGAGGATTTGGCGATGATCTGCCACGGTTCTTGAGGACTGGAATAGTGAACCTCGGCTGGAGTCAATTTTGTGCAAAGCCGGAACCTATT
AATTCAAACATTGTTCGGGAATTTTACGCAAATATTGACGATCACGAAGAATTTCAGGTTATCGTTCGAGGAGTGCCCGTTGACTGGAGCCCAGGAGCCATTAATTCTTT
GTTTAACCTCCAGGACTTTCCACACACAGGCTTTAATGAGATGGTGGTCGCACCATCTAACGACCAACTAAATGCGGCTGTCCGAGAGGTTGGCATCGAGGGGGCCCAGT
GGAGATTGTCGAAGACGGAAAAGCGCACATTTCAGGCTGCTTATTTGAAGAGCGAGGCCAATACATGGATGGGTTTCATCAAGCTGCGCTTATTGCCGACAACTCACGAC
TCAATGGTGTCTCGAGACCGGGTTTTGCTTGCCTTTGCTATTCTTCATTCGATGAGTATTGATGTGGGTAAAATAATTTCTTCTGAGATTCTTGATTGCTGGCGGAAAAA
GGTGGGGAAGCTGTTTTTCCCCAACACTATCACGATGCTATGCCGAAGGGCAGGGGTGCCAGAGGATGAGGATGATGTGCCACTAATAGACAAGGGAATAATTGACACAC
CAAATCTGGCTAGGCTTCAGAGGACGCAGGAAGCACGCCAAGGAGGTTTGGTGTTCGGCATCCACCAAATGCAGGAGCAATTACAGCTGCATTCCAGTAGGATGGAGTTT
GCTGAAAGGCAATTTCAGACTTTCTGGAACTATGTGAAAAGGAGGGATGCCGCGTTGAGGGTAGCTTTGCAGTCGAATTTTTCCGAGCCATATCCGGCTTTACCCGTATT
CCCTGATGACCTACTGAACCCTTGGGTTCCGCCCCCACCTGTTGAACGAGAAGAAGATGGTGAAGAGCAGGGTCAGGAAGATTAA
Protein sequenceShow/hide protein sequence
MRNLMRPRVEQRSSKGVDGGQHMIIYHGRLKFRNLSLMDLNFRIAWGLNVSLHLLDSNLCLLHSSFFVVILCQPLASSMAKTRAIKERENEDKEVPVTPEVQKAKTKKKK
TLEEKEAKRRRRQQRAEDQEALQKATDDVAATVVEENPKEPEEQNLEQTEQRVADTEEVQEEQTEEVQEEQTEEVQEKQTEDTQEGRVEDVQVTDNEPVQEARVEVIMPE
VPKRRRVKRKAGRVRVVRTDTPSPPTTDSERENAGREEREKKEAEDKAREEGAKKAEEEILPKQAEDKGKGIAEASGEADEIEEPRLPYNRFINNLARAKKERDNEEEEV
LVTPEAPKVKAKKKKTPEEKEAKRRRRQQRAEDQEVVEKTEPGVADTEEVREENTEEVREENTKEVREEITEEVQEKQAEDTQEGRAEDVQIRVVRTDTPSPPTTDSKRE
NAGREEREKKEAEDKAREEEAKKAKEEILPKQAEDKGKGIAEASGEADEIEEPRLPYNRFINNLARAKYVEMLRRDFLFERGFGDDLPRFLRTGIVNLGWSQFCAKPEPI
NSNIVREFYANIDDHEEFQVIVRGVPVDWSPGAINSLFNLQDFPHTGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHD
SMVSRDRVLLAFAILHSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQEARQGGLVFGIHQMQEQLQLHSSRMEF
AERQFQTFWNYVKRRDAALRVALQSNFSEPYPALPVFPDDLLNPWVPPPPVEREEDGEEQGQED