; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011893 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011893
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr1:34579577..34581275
RNA-Seq ExpressionLag0011893
SyntenyLag0011893
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG48193.1 hypothetical protein EZV62_027487 [Acer yangbiense]1.5e-2527.03Show/hide
Query:  EVQSKLERLGLEEEEGGRVVDIEDDDIDETDKDFQNSLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSFDD
        E+    E L LE+E+   V +I +D I + D+D    L  K+ + + +N E F  L+ +IW   G V++E  G N F+  F N+  +N+V   GPW F  
Subjt:  EVQSKLERLGLEEEEGGRVVDIEDDDIDETDKDFQNSLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSFDD

Query:  AIMVFEDPKGNSSVEEIDYRFVEFWVDFHGLPRGSALSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENGKM
        +++V E PKG   + ++ +   +FWV  H +P                 I   N                      +  + L   IG  V   T E+ + 
Subjt:  AIMVFEDPKGNSSVEEIDYRFVEFWVDFHGLPRGSALSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENGKM

Query:  EGESLRVRVKLDIHNPLRRGTNIKTGSMADKKWIKATYEKLPDSCYFCGKLGHTIQDCEEEGCE----EGKKNDYGIELRNTQGSKGFYKGEKPSNRDTQ
         G+ +RV+V++DI  PL+R   IK G   +   +   YE+LPD C+ CG++GH++++C +E  +    +G++  +G  +R T   K   K    +   + 
Subjt:  EGESLRVRVKLDIHNPLRRGTNIKTGSMADKKWIKATYEKLPDSCYFCGKLGHTIQDCEEEGCE----EGKKNDYGIELRNTQGSKGFYKGEKPSNRDTQ

Query:  NRGRG--RGNFFHGRGRADWNRGRSNEEEEESD
         RGR         G G      G    ++  SD
Subjt:  NRGRG--RGNFFHGRGRADWNRGRSNEEEEESD

TXG54013.1 hypothetical protein EZV62_019269 [Acer yangbiense]1.8e-2624.8Show/hide
Query:  ADEVQSKLERLGLEEEEGGRVVDIEDDDIDETDKDFQNSLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSF
        +D++  K E+L L +++ G +  I+    +  ++    SL  K  + + IN E F + +  IW  +  V +E  G+N+F  +F+N   + R+   GPW F
Subjt:  ADEVQSKLERLGLEEEEGGRVVDIEDDDIDETDKDFQNSLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSF

Query:  DDAIMVFEDPKGNSSVEEIDYRFVEFWVDFHGLPRGSALSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENG
        D  ++V  +  G+  V ++ +R+V FW+  H LP                              L+R IG +           LG  +G     +  E+G
Subjt:  DDAIMVFEDPKGNSSVEEIDYRFVEFWVDFHGLPRGSALSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENG

Query:  KMEGESLRVRVKLDIHNPLRRGTNIKTGSMADKKWIKATYEKLPDSCYFCGKLGHTIQDC--EEEGCEEGKKNDYGIELRNTQGSKGFYKGEKPSNRDTQ
        +  G+ +R+RV +D+ NPL+RG  +  G       +   YE+LP+ CY+CGK+GH ++DC    +         +G  +R    ++    GEK ++ +  
Subjt:  KMEGESLRVRVKLDIHNPLRRGTNIKTGSMADKKWIKATYEKLPDSCYFCGKLGHTIQDC--EEEGCEEGKKNDYGIELRNTQGSKGFYKGEKPSNRDTQ

Query:  NRGRGRGNF--FHGRGRADWNRGRSNEEEEESDEERKLEEACSEREAEGQRELVNQIPRGMVATKDE
          G           +G   WN G+ +       E+  L    +E ++    E    + R MV +K E
Subjt:  NRGRGRGNF--FHGRGRADWNRGRSNEEEEESDEERKLEEACSEREAEGQRELVNQIPRGMVATKDE

TXG71426.1 hypothetical protein EZV62_000005 [Acer yangbiense]4.0e-2629.12Show/hide
Query:  EVQSKLERLGLEEEEGGRVVDIEDDDID-ETDKDFQNSLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSFD
        E+    E L L +E+G      E D  + E + +    L  KI S + +N E FI+L+ ++W   G VKIE    N+F+ KF NQ  +NR+ + GPW F+
Subjt:  EVQSKLERLGLEEEEGGRVVDIEDDDID-ETDKDFQNSLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSFD

Query:  DAIMVFEDPKGNSSVEEIDYRFVEFWVDFHGLPRGSALSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENGK
         +++V E P+G  S+ ++ +  VE W+  H +P                            I +++R         W  AE +G  I +       E+  
Subjt:  DAIMVFEDPKGNSSVEEIDYRFVEFWVDFHGLPRGSALSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENGK

Query:  MEGESLRVRVKLDIHNPLRRGTNIKTGSMADKKWIKATYEKLPDSCYFCGKLGHTIQDCEE
          G+ L+V+V++DI  PL+R  ++K     D   +   YE+LP+ CY CG++GH  ++C +
Subjt:  MEGESLRVRVKLDIHNPLRRGTNIKTGSMADKKWIKATYEKLPDSCYFCGKLGHTIQDCEE

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]3.0e-2929.39Show/hide
Query:  ERLGLEEEEGGRVVDIEDDDIDETDKDFQNSLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSFDDAIMVFE
        ++  L  EE    +D++ D +   ++    SL  K+ + R I+A+    ++   W +E  + +E  G N+FL  F  +   NRV + GPW FD A++V +
Subjt:  ERLGLEEEEGGRVVDIEDDDIDETDKDFQNSLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSFDDAIMVFE

Query:  DPKGNSSVEEIDYRFVEFWVDFHGLPRGSALSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENGKMEGESLR
         P  + ++ E+++  V FW+    LP      T                                       A  LGN+IG FV  + +E G   G SLR
Subjt:  DPKGNSSVEEIDYRFVEFWVDFHGLPRGSALSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENGKMEGESLR

Query:  VRVKLDIHNPLRRGTNIKTGSMADKKWIKATYEKLPDSCYFCGKLGHTIQDCEEEGC----EEGKKNDYGIELRNTQGSKGFYKGEK---PSNRDT
        +RV +DI  PLRRG  I         WI   YE+LPD CYFCG +GH+  DC+        +    ++YG  LR      G  KG K   P+  D+
Subjt:  VRVKLDIHNPLRRGTNIKTGSMADKKWIKATYEKLPDSCYFCGKLGHTIQDCEEEGC----EEGKKNDYGIELRNTQGSKGFYKGEK---PSNRDT

XP_042988686.1 uncharacterized protein LOC122316216 [Carya illinoinensis]5.3e-2630.59Show/hide
Query:  ERLGLEEEEGGRVVDIEDDDIDETDKDFQNSLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSFDDAIMVFE
        +RL L E+E   VV +E  D++E+       L   +F+ +  N E F   M + W +   VK      N+FL  F++ R K +V R GPWSFD  +++ +
Subjt:  ERLGLEEEEGGRVVDIEDDDIDETDKDFQNSLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSFDDAIMVFE

Query:  DPKGNSSVEEIDYRFVEFWVDFHGLPRGSALSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENGKMEGESLR
        +  GN  V++I  R   FWV  H LP  +                            + R+G             +G  IG  +  + D      G+ LR
Subjt:  DPKGNSSVEEIDYRFVEFWVDFHGLPRGSALSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENGKMEGESLR

Query:  VRVKLDIHNPLRRGTNIKTGSMADKKWIKATYEKLPDSCYFCGKLGHTIQDCEEE
        VRV LDI  PL RGT    G   D  W++ +YE+L   C++CG LGH  ++CE +
Subjt:  VRVKLDIHNPLRRGTNIKTGSMADKKWIKATYEKLPDSCYFCGKLGHTIQDCEEE

TrEMBL top hitse value%identityAlignment
A0A5C7GU64 CCHC-type domain-containing protein7.4e-2627.03Show/hide
Query:  EVQSKLERLGLEEEEGGRVVDIEDDDIDETDKDFQNSLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSFDD
        E+    E L LE+E+   V +I +D I + D+D    L  K+ + + +N E F  L+ +IW   G V++E  G N F+  F N+  +N+V   GPW F  
Subjt:  EVQSKLERLGLEEEEGGRVVDIEDDDIDETDKDFQNSLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSFDD

Query:  AIMVFEDPKGNSSVEEIDYRFVEFWVDFHGLPRGSALSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENGKM
        +++V E PKG   + ++ +   +FWV  H +P                 I   N                      +  + L   IG  V   T E+ + 
Subjt:  AIMVFEDPKGNSSVEEIDYRFVEFWVDFHGLPRGSALSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENGKM

Query:  EGESLRVRVKLDIHNPLRRGTNIKTGSMADKKWIKATYEKLPDSCYFCGKLGHTIQDCEEEGCE----EGKKNDYGIELRNTQGSKGFYKGEKPSNRDTQ
         G+ +RV+V++DI  PL+R   IK G   +   +   YE+LPD C+ CG++GH++++C +E  +    +G++  +G  +R T   K   K    +   + 
Subjt:  EGESLRVRVKLDIHNPLRRGTNIKTGSMADKKWIKATYEKLPDSCYFCGKLGHTIQDCEEEGCE----EGKKNDYGIELRNTQGSKGFYKGEKPSNRDTQ

Query:  NRGRG--RGNFFHGRGRADWNRGRSNEEEEESD
         RGR         G G      G    ++  SD
Subjt:  NRGRG--RGNFFHGRGRADWNRGRSNEEEEESD

A0A5C7H9Y2 CCHC-type domain-containing protein8.8e-2724.8Show/hide
Query:  ADEVQSKLERLGLEEEEGGRVVDIEDDDIDETDKDFQNSLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSF
        +D++  K E+L L +++ G +  I+    +  ++    SL  K  + + IN E F + +  IW  +  V +E  G+N+F  +F+N   + R+   GPW F
Subjt:  ADEVQSKLERLGLEEEEGGRVVDIEDDDIDETDKDFQNSLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSF

Query:  DDAIMVFEDPKGNSSVEEIDYRFVEFWVDFHGLPRGSALSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENG
        D  ++V  +  G+  V ++ +R+V FW+  H LP                              L+R IG +           LG  +G     +  E+G
Subjt:  DDAIMVFEDPKGNSSVEEIDYRFVEFWVDFHGLPRGSALSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENG

Query:  KMEGESLRVRVKLDIHNPLRRGTNIKTGSMADKKWIKATYEKLPDSCYFCGKLGHTIQDC--EEEGCEEGKKNDYGIELRNTQGSKGFYKGEKPSNRDTQ
        +  G+ +R+RV +D+ NPL+RG  +  G       +   YE+LP+ CY+CGK+GH ++DC    +         +G  +R    ++    GEK ++ +  
Subjt:  KMEGESLRVRVKLDIHNPLRRGTNIKTGSMADKKWIKATYEKLPDSCYFCGKLGHTIQDC--EEEGCEEGKKNDYGIELRNTQGSKGFYKGEKPSNRDTQ

Query:  NRGRGRGNF--FHGRGRADWNRGRSNEEEEESDEERKLEEACSEREAEGQRELVNQIPRGMVATKDE
          G           +G   WN G+ +       E+  L    +E ++    E    + R MV +K E
Subjt:  NRGRGRGNF--FHGRGRADWNRGRSNEEEEESDEERKLEEACSEREAEGQRELVNQIPRGMVATKDE

A0A5C7ISG5 CCHC-type domain-containing protein2.0e-2629.12Show/hide
Query:  EVQSKLERLGLEEEEGGRVVDIEDDDID-ETDKDFQNSLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSFD
        E+    E L L +E+G      E D  + E + +    L  KI S + +N E FI+L+ ++W   G VKIE    N+F+ KF NQ  +NR+ + GPW F+
Subjt:  EVQSKLERLGLEEEEGGRVVDIEDDDID-ETDKDFQNSLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSFD

Query:  DAIMVFEDPKGNSSVEEIDYRFVEFWVDFHGLPRGSALSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENGK
         +++V E P+G  S+ ++ +  VE W+  H +P                            I +++R         W  AE +G  I +       E+  
Subjt:  DAIMVFEDPKGNSSVEEIDYRFVEFWVDFHGLPRGSALSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENGK

Query:  MEGESLRVRVKLDIHNPLRRGTNIKTGSMADKKWIKATYEKLPDSCYFCGKLGHTIQDCEE
          G+ L+V+V++DI  PL+R  ++K     D   +   YE+LP+ CY CG++GH  ++C +
Subjt:  MEGESLRVRVKLDIHNPLRRGTNIKTGSMADKKWIKATYEKLPDSCYFCGKLGHTIQDCEE

A0A5C7IW83 CCHC-type domain-containing protein1.7e-2527.22Show/hide
Query:  ADEVQSKLERLGLEEEEGGRVVDIEDDDIDETDKDFQNSLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSF
        A E+    E L + EE+ G V++  + +I++  KD    L  K+ + + +N E F +L+ +IW   G V++E    N+F+  F  Q  +NRV + GPW F
Subjt:  ADEVQSKLERLGLEEEEGGRVVDIEDDDIDETDKDFQNSLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSF

Query:  DDAIMVFEDPKGNSSVEEIDYRFVEFWVDFHGLPRGSALSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENG
          +++V E PKG  +  ++ +    FWV  H  P                            I ++RR+           A+ +   IG  V    D   
Subjt:  DDAIMVFEDPKGNSSVEEIDYRFVEFWVDFHGLPRGSALSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENG

Query:  KMEGESLRVRVKLDIHNPLRRGTNIKTGSMADKKWIKATYEKLPDSCYFCGKLGHTIQDCEEEGCE----EGKKNDYGIELRNTQGSKGFYKGEKPSNRD
        +  G+ +RV+V +DI  PLRR   +K G   +   +   YE+LP+ CY CG++GH I +C +        E     YG  L+   G K + +        
Subjt:  KMEGESLRVRVKLDIHNPLRRGTNIKTGSMADKKWIKATYEKLPDSCYFCGKLGHTIQDCEEEGCE----EGKKNDYGIELRNTQGSKGFYKGEKPSNRD

Query:  TQNRGRGRGNFFHGRG
        + +RGR       G G
Subjt:  TQNRGRGRGNFFHGRG

A0A6J1DU55 uncharacterized protein LOC1110231351.4e-2929.39Show/hide
Query:  ERLGLEEEEGGRVVDIEDDDIDETDKDFQNSLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSFDDAIMVFE
        ++  L  EE    +D++ D +   ++    SL  K+ + R I+A+    ++   W +E  + +E  G N+FL  F  +   NRV + GPW FD A++V +
Subjt:  ERLGLEEEEGGRVVDIEDDDIDETDKDFQNSLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSFDDAIMVFE

Query:  DPKGNSSVEEIDYRFVEFWVDFHGLPRGSALSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENGKMEGESLR
         P  + ++ E+++  V FW+    LP      T                                       A  LGN+IG FV  + +E G   G SLR
Subjt:  DPKGNSSVEEIDYRFVEFWVDFHGLPRGSALSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENGKMEGESLR

Query:  VRVKLDIHNPLRRGTNIKTGSMADKKWIKATYEKLPDSCYFCGKLGHTIQDCEEEGC----EEGKKNDYGIELRNTQGSKGFYKGEK---PSNRDT
        +RV +DI  PLRRG  I         WI   YE+LPD CYFCG +GH+  DC+        +    ++YG  LR      G  KG K   P+  D+
Subjt:  VRVKLDIHNPLRRGTNIKTGSMADKKWIKATYEKLPDSCYFCGKLGHTIQDCEEEGC----EEGKKNDYGIELRNTQGSKGFYKGEK---PSNRDT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G36228.1 nucleic acid binding;zinc ion binding5.3e-0820.27Show/hide
Query:  SLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSFDDAIMVFEDPKGNSSVEEIDYRFVEFWVDFHGLPRGSA
        SL  +I +P+  + E  I  +P  WG+   V         F  +F+++       R  PW F++  +  +  +      E    F++ WV   G+P    
Subjt:  SLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSFDDAIMVFEDPKGNSSVEEIDYRFVEFWVDFHGLPRGSA

Query:  LSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENGKMEGESLRVRVKLDIHNPLRRGTNIKTGSMADKKWIKA
                                           P    +  E + +++G  VA + +E    +   +RV+V++D   PLR    ++  S  ++  I  
Subjt:  LSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENGKMEGESLRVRVKLDIHNPLRRGTNIKTGSMADKKWIKA

Query:  TYEKLPDSCYFCGKLGHTIQDC
         YEKL   C  C ++ H +  C
Subjt:  TYEKLPDSCYFCGKLGHTIQDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCTAAGGATGTTGCAGACGAGGTTCAAAGCAAGCTGGAAAGACTTGGGTTGGAAGAGGAGGAAGGAGGCAGGGTGGTAGACATCGAGGACGACGACATCGATGA
GACGGATAAGGATTTTCAGAACTCACTTGCGTGCAAAATATTCTCCCCTAGAGCTATAAATGCAGAAGGATTCATAAACTTGATGCCGAAAATTTGGGGGATTGAAGGAA
ATGTCAAAATAGAAAAGGCTGGAATGAATATGTTTCTCTGTAAATTCAAAAATCAGCGTGCGAAAAACAGAGTACAACGAAACGGACCTTGGTCCTTCGATGATGCGATA
ATGGTGTTTGAAGACCCAAAAGGAAACAGTAGCGTGGAGGAGATTGACTACAGGTTTGTCGAATTTTGGGTTGACTTCCATGGCTTACCTCGCGGCTCAGCTCTAAGTAC
TTCTATTTCGACGGTCATCCCGAATGGCCTCATTGAAAGCTGCAATAGTGGTTTCCTCAGTCACATTATCCTTCATCGTCGAATTGGGGCTAATCCCCCTCTTAAAGGTT
GGAAATACGCGGAGGCCCTCGGAAATTCCATTGGCGTGTTCGTGGCGGCTGAAACGGATGAGAATGGGAAAATGGAAGGGGAATCGCTTAGGGTCAGAGTGAAATTGGAC
ATCCATAACCCTTTGAGGAGAGGAACCAATATTAAAACCGGATCCATGGCGGATAAAAAATGGATAAAGGCCACCTACGAGAAGCTGCCAGACTCCTGCTACTTCTGTGG
CAAGTTGGGACATACTATACAGGACTGTGAGGAAGAAGGTTGCGAGGAGGGAAAGAAAAATGACTATGGGATTGAGTTAAGAAATACCCAAGGTAGTAAGGGCTTTTATA
AGGGGGAAAAACCGAGTAACAGAGACACACAAAACAGGGGAAGAGGAAGAGGGAATTTTTTCCATGGAAGAGGAAGAGCGGATTGGAATAGGGGAAGAAGTAATGAGGAA
GAAGAAGAAAGTGATGAAGAAAGAAAATTAGAGGAAGCTTGTTCTGAAAGAGAGGCAGAGGGGCAGAGGGAATTGGTTAACCAAATACCAAGGGGAATGGTGGCCACCAA
GGATGAAGGGCCCTCGCAAAAAGAGAAAGGTAAACCAAATGATGGAATGATAAACTTACCAAGCCGAGAAGTTGCCCTCTTGGATAAGGAAGGTACTGGAGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAATCTAAGGATGTTGCAGACGAGGTTCAAAGCAAGCTGGAAAGACTTGGGTTGGAAGAGGAGGAAGGAGGCAGGGTGGTAGACATCGAGGACGACGACATCGATGA
GACGGATAAGGATTTTCAGAACTCACTTGCGTGCAAAATATTCTCCCCTAGAGCTATAAATGCAGAAGGATTCATAAACTTGATGCCGAAAATTTGGGGGATTGAAGGAA
ATGTCAAAATAGAAAAGGCTGGAATGAATATGTTTCTCTGTAAATTCAAAAATCAGCGTGCGAAAAACAGAGTACAACGAAACGGACCTTGGTCCTTCGATGATGCGATA
ATGGTGTTTGAAGACCCAAAAGGAAACAGTAGCGTGGAGGAGATTGACTACAGGTTTGTCGAATTTTGGGTTGACTTCCATGGCTTACCTCGCGGCTCAGCTCTAAGTAC
TTCTATTTCGACGGTCATCCCGAATGGCCTCATTGAAAGCTGCAATAGTGGTTTCCTCAGTCACATTATCCTTCATCGTCGAATTGGGGCTAATCCCCCTCTTAAAGGTT
GGAAATACGCGGAGGCCCTCGGAAATTCCATTGGCGTGTTCGTGGCGGCTGAAACGGATGAGAATGGGAAAATGGAAGGGGAATCGCTTAGGGTCAGAGTGAAATTGGAC
ATCCATAACCCTTTGAGGAGAGGAACCAATATTAAAACCGGATCCATGGCGGATAAAAAATGGATAAAGGCCACCTACGAGAAGCTGCCAGACTCCTGCTACTTCTGTGG
CAAGTTGGGACATACTATACAGGACTGTGAGGAAGAAGGTTGCGAGGAGGGAAAGAAAAATGACTATGGGATTGAGTTAAGAAATACCCAAGGTAGTAAGGGCTTTTATA
AGGGGGAAAAACCGAGTAACAGAGACACACAAAACAGGGGAAGAGGAAGAGGGAATTTTTTCCATGGAAGAGGAAGAGCGGATTGGAATAGGGGAAGAAGTAATGAGGAA
GAAGAAGAAAGTGATGAAGAAAGAAAATTAGAGGAAGCTTGTTCTGAAAGAGAGGCAGAGGGGCAGAGGGAATTGGTTAACCAAATACCAAGGGGAATGGTGGCCACCAA
GGATGAAGGGCCCTCGCAAAAAGAGAAAGGTAAACCAAATGATGGAATGATAAACTTACCAAGCCGAGAAGTTGCCCTCTTGGATAAGGAAGGTACTGGAGAGTAG
Protein sequenceShow/hide protein sequence
MESKDVADEVQSKLERLGLEEEEGGRVVDIEDDDIDETDKDFQNSLACKIFSPRAINAEGFINLMPKIWGIEGNVKIEKAGMNMFLCKFKNQRAKNRVQRNGPWSFDDAI
MVFEDPKGNSSVEEIDYRFVEFWVDFHGLPRGSALSTSISTVIPNGLIESCNSGFLSHIILHRRIGANPPLKGWKYAEALGNSIGVFVAAETDENGKMEGESLRVRVKLD
IHNPLRRGTNIKTGSMADKKWIKATYEKLPDSCYFCGKLGHTIQDCEEEGCEEGKKNDYGIELRNTQGSKGFYKGEKPSNRDTQNRGRGRGNFFHGRGRADWNRGRSNEE
EEESDEERKLEEACSEREAEGQRELVNQIPRGMVATKDEGPSQKEKGKPNDGMINLPSREVALLDKEGTGE