; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041703 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041703
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein CHROMATIN REMODELING
Genome locationchr13:24307589..24314591
RNA-Seq ExpressionLag0041703
SyntenyLag0041703
Gene Ontology termsGO:0051716 - cellular response to stimulus (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003682 - chromatin binding (molecular function)
GO:0004386 - helicase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0140658 - ATP-dependent chromatin remodeler activity (molecular function)
InterPro domainsIPR000330 - SNF2, N-terminal
IPR005162 - Retrotransposon gag domain
IPR014001 - Helicase superfamily 1/2, ATP-binding domain
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR038718 - SNF2-like, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0067170.1 protein CHROMATIN REMODELING 19 isoform X1 [Cucumis melo var. makuwa]9.2e-7892.9Show/hide
Query:  LGQPFMAILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLF
        LG  F +ILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENW RELKKWCPSFSVL YHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLF
Subjt:  LGQPFMAILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLF

Query:  ERHSSQQKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS
        ERHSSQQKDERKILKRWQWSCVLMDEAHALKD+NSYRWKNLMSLARNAKQR +++
Subjt:  ERHSSQQKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS

XP_004140399.1 protein CHROMATIN REMODELING 19 [Cucumis sativus]1.6e-7795.97Show/hide
Query:  AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ
        AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVL YHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ
Subjt:  AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ

Query:  QKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS
        QKDERKILKRWQWSCVLMDEAHALKD+NSYRWKNLMSLARNAKQR +++
Subjt:  QKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS

XP_008460214.1 PREDICTED: protein CHROMATIN REMODELING 19 isoform X2 [Cucumis melo]1.0e-7695.3Show/hide
Query:  AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ
        AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENW RELKKWCPSFSVL YHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ
Subjt:  AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ

Query:  QKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS
        QKDERKILKRWQWSCVLMDEAHALKD+NSYRWKNLMSLARNAKQR +++
Subjt:  QKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS

XP_016902524.1 PREDICTED: protein CHROMATIN REMODELING 19 isoform X3 [Cucumis melo]1.0e-7695.3Show/hide
Query:  AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ
        AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENW RELKKWCPSFSVL YHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ
Subjt:  AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ

Query:  QKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS
        QKDERKILKRWQWSCVLMDEAHALKD+NSYRWKNLMSLARNAKQR +++
Subjt:  QKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS

XP_022157601.1 protein CHROMATIN REMODELING 19 [Momordica charantia]2.7e-7795.97Show/hide
Query:  AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ
        AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGA RSAYAKEL SLAKSGLPPPFNVLLVCYSLFERHSSQ
Subjt:  AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ

Query:  QKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS
        QKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQR +++
Subjt:  QKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS

TrEMBL top hitse value%identityAlignment
A0A0A0KSU2 Uncharacterized protein1.7e-7795.3Show/hide
Query:  AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ
        +ILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVL YHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ
Subjt:  AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ

Query:  QKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS
        QKDERKILKRWQWSCVLMDEAHALKD+NSYRWKNLMSLARNAKQR +++
Subjt:  QKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS

A0A1S3CC04 protein CHROMATIN REMODELING 19 isoform X14.9e-7795.3Show/hide
Query:  AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ
        AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENW RELKKWCPSFSVL YHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ
Subjt:  AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ

Query:  QKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS
        QKDERKILKRWQWSCVLMDEAHALKD+NSYRWKNLMSLARNAKQR +++
Subjt:  QKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS

A0A1S3CC36 protein CHROMATIN REMODELING 19 isoform X24.9e-7795.3Show/hide
Query:  AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ
        AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENW RELKKWCPSFSVL YHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ
Subjt:  AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ

Query:  QKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS
        QKDERKILKRWQWSCVLMDEAHALKD+NSYRWKNLMSLARNAKQR +++
Subjt:  QKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS

A0A5D3DR37 Protein CHROMATIN REMODELING 19 isoform X14.4e-7892.9Show/hide
Query:  LGQPFMAILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLF
        LG  F +ILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENW RELKKWCPSFSVL YHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLF
Subjt:  LGQPFMAILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLF

Query:  ERHSSQQKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS
        ERHSSQQKDERKILKRWQWSCVLMDEAHALKD+NSYRWKNLMSLARNAKQR +++
Subjt:  ERHSSQQKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS

A0A6J1DYN8 protein CHROMATIN REMODELING 191.3e-7795.97Show/hide
Query:  AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ
        AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGA RSAYAKEL SLAKSGLPPPFNVLLVCYSLFERHSSQ
Subjt:  AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ

Query:  QKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS
        QKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQR +++
Subjt:  QKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS

SwissProt top hitse value%identityAlignment
C0H4W3 Probable ATP-dependent helicase PF08_00481.4e-2340Show/hide
Query:  ILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQQ
        ILADEMGLGKT+Q I+ L  L Y  N  GPHL++ P S+L NWE ELK++CP F +L Y+G     Y K +    K      F++ +  YS   +     
Subjt:  ILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQQ

Query:  KDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARN
          +  + KR +W  +++DEAH +K+ N+ RW  ++SL R+
Subjt:  KDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARN

E7F1C4 SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily A containing DEAD/H box 1B6.7e-2342.38Show/hide
Query:  ILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAA---RSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHS
        ILADEMGLGKTIQAI++L  L Y   + GPHLI  PAS L+NW REL  WCPSF VL Y+G+A   +    + LN + +      +N+++  Y+L   +S
Subjt:  ILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAA---RSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHS

Query:  SQQKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS
        S    +R +  + +    + DE H LK+ NS R+++LM++  NAK R +++
Subjt:  SQQKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS

Q5ARK3 Helicase swr12.6e-2239.44Show/hide
Query:  ILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQQ
        ILADEMGLGKTIQ I  L  L   +   GPHL+V P SV+ NWE E KKWCP F ++ Y+G       K    +  +     +NVL+  Y L        
Subjt:  ILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQQ

Query:  KDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAK
          ++++LKR  W  +++DEAH +K+  S RW+ L++    A+
Subjt:  KDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAK

Q5FWR0 SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily A containing DEAD/H box 13.9e-2339.19Show/hide
Query:  ILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQQ
        ILADEMGLGKT+QAI +L  L Y+  DSGPHL+V PAS ++NW RE  +WCPS ++L Y+G+         + L K      FNV++  Y+     +   
Subjt:  ILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQQ

Query:  KDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS
         ++R + +R + +  + DE H LK+ ++ R+++LM+L  NA+ R +++
Subjt:  KDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS

Q9ZUL5 Protein CHROMATIN REMODELING 199.2e-7383.89Show/hide
Query:  AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ
        AILADEMGLGKTIQAITYL +L  LNND GPHL+VCPASVLENWEREL+KWCPSF+VLQYHGAAR+AY++ELNSL+K+G PPPFNVLLVCYSLFERHS Q
Subjt:  AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ

Query:  QKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS
        QKD+RK+LKRW+WSCVLMDEAHALKDKNSYRWKNLMS+ARNA QR +++
Subjt:  QKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS

Arabidopsis top hitse value%identityAlignment
AT2G02090.1 SNF2 domain-containing protein / helicase domain-containing protein6.6e-7483.89Show/hide
Query:  AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ
        AILADEMGLGKTIQAITYL +L  LNND GPHL+VCPASVLENWEREL+KWCPSF+VLQYHGAAR+AY++ELNSL+K+G PPPFNVLLVCYSLFERHS Q
Subjt:  AILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQ

Query:  QKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS
        QKD+RK+LKRW+WSCVLMDEAHALKDKNSYRWKNLMS+ARNA QR +++
Subjt:  QKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS

AT2G13370.1 chromatin remodeling 53.9e-1827.51Show/hide
Query:  ILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARS---AYAKELNSLAKSGLPPPFNVLLVCYSLFERHS
        ILADEMGLGKT+Q+++ L  L+      GP L+V P S L NW +E +KW P  +++ Y G   S       E  +  K G P  FN LL  Y +  +  
Subjt:  ILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARS---AYAKELNSLAKSGLPPPFNVLLVCYSLFERHS

Query:  SQQKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVISWGMVGKMEGRVVALEEKLAEIAVKQVDLEVNLGSHMAEMGEKQTS---V
             ++ +L + +W  +++DEAH LK+  +  +  L+    + K + +I+   +      + AL   L     K  D  V    +++   E + +   +
Subjt:  SQQKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVISWGMVGKMEGRVVALEEKLAEIAVKQVDLEVNLGSHMAEMGEKQTS---V

Query:  EAKLDLQFTLVREEMKAMFTRLEGIMNVE
        E +  +   ++++  K++  ++E I+ VE
Subjt:  EAKLDLQFTLVREEMKAMFTRLEGIMNVE

AT3G12810.1 SNF2 domain-containing protein / helicase domain-containing protein3.1e-2339.86Show/hide
Query:  ILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQQ
        ILADEMGLGKTI  I  L  L       GPHLIV P SV+ NWE E  KWCP+F +L Y G+A+    K    +  +     F+V +  Y L  + S   
Subjt:  ILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPFNVLLVCYSLFERHSSQQ

Query:  KDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS
            K+ KR +W  +++DEAH +K+  S RW+ L++   N+K+R +++
Subjt:  KDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS

AT3G57300.1 INO80 ortholog4.5e-2237.42Show/hide
Query:  QPFMAILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGA--ARSAYAKELNSLAKSGLPPPFNVLLVCYSLF
        Q    ILADEMGLGKTIQA+ +L  L    N  GP L+V PASVL NW  E+ ++CP    L Y G    R+   K +N          F++L+  Y L 
Subjt:  QPFMAILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGA--ARSAYAKELNSLAKSGLPPPFNVLLVCYSLF

Query:  ERHSSQQKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS
                 + K  +R +W  +++DEA A+K  +S RWK L+S   N + R +++
Subjt:  ERHSSQQKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS

AT3G57300.2 INO80 ortholog3.0e-1831.38Show/hide
Query:  QPFMAILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGA--ARSAYAKELNS--------------------
        Q    ILADEMGLGKTIQA+ +L  L    N  GP L+V PASVL NW  E+ ++CP    L Y G    R+   K +N                     
Subjt:  QPFMAILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGA--ARSAYAKELNS--------------------

Query:  -------------LAKSGLPPPFNVLLVCYSLFERHSSQQKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS
                        S +   F++L+  Y L          + K  +R +W  +++DEA A+K  +S RWK L+S   N + R +++
Subjt:  -------------LAKSGLPPPFNVLLVCYSLFERHSSQQKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTGGATTGGTGATTGAGCCCTCAAAGCTTCCTTCCCAAGAATATTCAATATTGCTTCTCCAAAGGATAACCTTATTATTCAATATTGCTTCTCCAAAGGACAGCCT
TATTAATCAGTGCTGGTTTGTTTCTTCTAATGGGTGGGTTGTTCGTTTAAGGAGGAATTTGACTGATAATGTAACTCTCGAGTGGATTGCACTCCATAGCCTCCTTGAGA
CAAAATGTCCTTCAAATATAGAGGACTCCAAATCTTGGACACTCAACAGATGTGGCTGGATTTTGGGAATGTACTTCAAAGAAGATGGCCAAATTGCTCCATGTCCCCAA
GTTTGTGCCCGATGTGGTAGATTGGATGGAGAAATCAGCATCATCTCTTCTTCCATTGCCCTTGGGCAACCTTTTATGGCAATACTTGCAGACGAAATGGGTCTGGGGAA
GACAATACAGGCTATTACATATCTTGTGATGTTGAAATATTTGAACAATGATTCGGGGCCTCATCTAATTGTATGCCCTGCATCTGTTCTGGAGAATTGGGAAAGAGAAC
TCAAAAAGTGGTGCCCATCATTTTCTGTACTCCAGTATCATGGGGCTGCGCGATCGGCATATGCAAAGGAATTGAATTCTCTAGCCAAGTCGGGGTTGCCTCCTCCATTT
AATGTTCTTCTTGTTTGTTATTCTCTCTTTGAAAGACACAGTTCCCAGCAGAAAGATGAACGCAAAATTCTGAAACGCTGGCAGTGGAGCTGTGTTCTTATGGATGAGGC
TCATGCCTTGAAAGATAAAAACAGCTATCGGTGGAAAAATTTAATGTCTCTTGCACGTAATGCAAAGCAACGAGCAGTGATTTCTTGGGGAATGGTGGGAAAGATGGAAG
GAAGGGTAGTCGCGTTGGAAGAGAAATTGGCCGAGATAGCTGTTAAGCAGGTCGATTTGGAGGTGAACTTGGGATCTCATATGGCGGAAATGGGAGAGAAACAAACGAGT
GTGGAGGCAAAGCTGGACTTACAGTTCACGCTTGTGCGGGAGGAGATGAAGGCCATGTTCACGCGATTGGAAGGAATAATGAACGTTGAGAAAGGGTCACCATCCTCAGA
GCGAACAATCACCGATAAAGGGAAGCGCATAGTCGATGATGAGTCGGTGTTGAAGGCTCCGGAACCGCGAGATGACGACGAGAAGAAGGCCACGAGTAGTGACGGAATCC
AGGCTCCTAGTAGTCGTGAAGTGCCCCTGTTCGACATGCGCCTAAGGAAGTTAGAGGTGCCCATATTTAAGGGGGAAGATGAGGAAGACCCAGATGGTTGGTTGCATCGG
GTGGAGCGGTATTTCGTAGTCAATCGCTTGTCCGAGAGGGACAAATTGGAAGCTGCCGTCATGTGTCTTGAGGGAGAAGCCCTAAACTGGCATCAGTATGAAGAAGAGAG
AACACCGATGGGCACTTGGAAGGAATTCCGAAGGTTATTGTTGGAACGATTTCGACCGACGTCCCAAGGAGATCGGTATGCTCGTTTAATGAAGTTGCAACAGGAAACCA
CCGTGAGGGAATATCGCCGGCGTTTTGAGCAATACGCGGCGACACTCAAGGACGTGAGTGACGACGTGCTAGAGAGTAAGTTTGAATGTGGGTTGAAGGAGGAGATCCAA
AGTGAGATGAGGAAGTTTCAGCCCGTGGGCCTGAAGGCGAAGATGTTGATGGCCCAATTGATTGAAGATGACAACGCCGTCCAAGAAAAGAAAAGAACGGGAAAAGCCCA
AGGCCAAAACGCAAGCCCAAAAAATAACACAAACCCGAATGGGGCAAGCGGTGGATCCAGTACGTCTGGCGGGTCAAGTGGGTCGACACTAGAACGATTTCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGTGGATTGGTGATTGAGCCCTCAAAGCTTCCTTCCCAAGAATATTCAATATTGCTTCTCCAAAGGATAACCTTATTATTCAATATTGCTTCTCCAAAGGACAGCCT
TATTAATCAGTGCTGGTTTGTTTCTTCTAATGGGTGGGTTGTTCGTTTAAGGAGGAATTTGACTGATAATGTAACTCTCGAGTGGATTGCACTCCATAGCCTCCTTGAGA
CAAAATGTCCTTCAAATATAGAGGACTCCAAATCTTGGACACTCAACAGATGTGGCTGGATTTTGGGAATGTACTTCAAAGAAGATGGCCAAATTGCTCCATGTCCCCAA
GTTTGTGCCCGATGTGGTAGATTGGATGGAGAAATCAGCATCATCTCTTCTTCCATTGCCCTTGGGCAACCTTTTATGGCAATACTTGCAGACGAAATGGGTCTGGGGAA
GACAATACAGGCTATTACATATCTTGTGATGTTGAAATATTTGAACAATGATTCGGGGCCTCATCTAATTGTATGCCCTGCATCTGTTCTGGAGAATTGGGAAAGAGAAC
TCAAAAAGTGGTGCCCATCATTTTCTGTACTCCAGTATCATGGGGCTGCGCGATCGGCATATGCAAAGGAATTGAATTCTCTAGCCAAGTCGGGGTTGCCTCCTCCATTT
AATGTTCTTCTTGTTTGTTATTCTCTCTTTGAAAGACACAGTTCCCAGCAGAAAGATGAACGCAAAATTCTGAAACGCTGGCAGTGGAGCTGTGTTCTTATGGATGAGGC
TCATGCCTTGAAAGATAAAAACAGCTATCGGTGGAAAAATTTAATGTCTCTTGCACGTAATGCAAAGCAACGAGCAGTGATTTCTTGGGGAATGGTGGGAAAGATGGAAG
GAAGGGTAGTCGCGTTGGAAGAGAAATTGGCCGAGATAGCTGTTAAGCAGGTCGATTTGGAGGTGAACTTGGGATCTCATATGGCGGAAATGGGAGAGAAACAAACGAGT
GTGGAGGCAAAGCTGGACTTACAGTTCACGCTTGTGCGGGAGGAGATGAAGGCCATGTTCACGCGATTGGAAGGAATAATGAACGTTGAGAAAGGGTCACCATCCTCAGA
GCGAACAATCACCGATAAAGGGAAGCGCATAGTCGATGATGAGTCGGTGTTGAAGGCTCCGGAACCGCGAGATGACGACGAGAAGAAGGCCACGAGTAGTGACGGAATCC
AGGCTCCTAGTAGTCGTGAAGTGCCCCTGTTCGACATGCGCCTAAGGAAGTTAGAGGTGCCCATATTTAAGGGGGAAGATGAGGAAGACCCAGATGGTTGGTTGCATCGG
GTGGAGCGGTATTTCGTAGTCAATCGCTTGTCCGAGAGGGACAAATTGGAAGCTGCCGTCATGTGTCTTGAGGGAGAAGCCCTAAACTGGCATCAGTATGAAGAAGAGAG
AACACCGATGGGCACTTGGAAGGAATTCCGAAGGTTATTGTTGGAACGATTTCGACCGACGTCCCAAGGAGATCGGTATGCTCGTTTAATGAAGTTGCAACAGGAAACCA
CCGTGAGGGAATATCGCCGGCGTTTTGAGCAATACGCGGCGACACTCAAGGACGTGAGTGACGACGTGCTAGAGAGTAAGTTTGAATGTGGGTTGAAGGAGGAGATCCAA
AGTGAGATGAGGAAGTTTCAGCCCGTGGGCCTGAAGGCGAAGATGTTGATGGCCCAATTGATTGAAGATGACAACGCCGTCCAAGAAAAGAAAAGAACGGGAAAAGCCCA
AGGCCAAAACGCAAGCCCAAAAAATAACACAAACCCGAATGGGGCAAGCGGTGGATCCAGTACGTCTGGCGGGTCAAGTGGGTCGACACTAGAACGATTTCCTTAA
Protein sequenceShow/hide protein sequence
MCGLVIEPSKLPSQEYSILLLQRITLLFNIASPKDSLINQCWFVSSNGWVVRLRRNLTDNVTLEWIALHSLLETKCPSNIEDSKSWTLNRCGWILGMYFKEDGQIAPCPQ
VCARCGRLDGEISIISSSIALGQPFMAILADEMGLGKTIQAITYLVMLKYLNNDSGPHLIVCPASVLENWERELKKWCPSFSVLQYHGAARSAYAKELNSLAKSGLPPPF
NVLLVCYSLFERHSSQQKDERKILKRWQWSCVLMDEAHALKDKNSYRWKNLMSLARNAKQRAVISWGMVGKMEGRVVALEEKLAEIAVKQVDLEVNLGSHMAEMGEKQTS
VEAKLDLQFTLVREEMKAMFTRLEGIMNVEKGSPSSERTITDKGKRIVDDESVLKAPEPRDDDEKKATSSDGIQAPSSREVPLFDMRLRKLEVPIFKGEDEEDPDGWLHR
VERYFVVNRLSERDKLEAAVMCLEGEALNWHQYEEERTPMGTWKEFRRLLLERFRPTSQGDRYARLMKLQQETTVREYRRRFEQYAATLKDVSDDVLESKFECGLKEEIQ
SEMRKFQPVGLKAKMLMAQLIEDDNAVQEKKRTGKAQGQNASPKNNTNPNGASGGSSTSGGSSGSTLERFP