; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G014660 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G014660
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptiongeneral transcription factor 3C polypeptide 5-like
Genome locationCG_Chr09:26922936..26930454
RNA-Seq ExpressionClCG09G014660
SyntenyClCG09G014660
Gene Ontology termsGO:0006384 - transcription initiation from RNA polymerase III promoter (biological process)
GO:0000127 - transcription factor TFIIIC complex (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR040454 - Transcription factor IIIC subunit Tfc1/Sfc1
IPR041499 - Transcription factor IIIC subunit Tfc1/Sfc1, triple barrel domain
IPR042536 - TFIIIC, subcomplex tauA subunit Sfc1, triple barrel domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588837.1 General transcription factor 3C polypeptide 5, partial [Cucurbita argyrosperma subsp. sororia]1.5e-8884.49Show/hide
Query:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEV
        MGKLKDNTISG LPTAQ+FA+HYPGYPSSKRRA+E+LGG QSILKVR LQSNKLELRFRPEDPYSHPT+GELRPCSCFLLKICH KS++ EGI KVEKEV
Subjt:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEV

Query:  PREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVPENLVKK
        PREDEINLDFEMVA VPEAYHFEGM DYQHVVA HADA +RK+G+W EM +PCLGK N +DVDKEDTMILVPPLFA+KDVPE+LV K
Subjt:  PREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVPENLVKK

XP_004142476.1 general transcription factor 3C polypeptide 5 isoform X2 [Cucumis sativus]2.8e-8786.63Show/hide
Query:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEV
        MGKLKDNTISG LPTAQ FA+HYP YPSSK +AIESLGG+QSILKVRGLQSNKLELRFRP DPYSHPTYGELRPCS FLLKICHSKSDT EGIMKVE EV
Subjt:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEV

Query:  PREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVPENLVKK
        P EDE+NLDFEMVARVPEAYHFEGMVDYQHVVA+HADA +RK+GNWAEM +P LGK N +DVDKEDTMILVPPLF+IKDVPENLV K
Subjt:  PREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVPENLVKK

XP_022927790.1 general transcription factor 3C polypeptide 5-like [Cucurbita moschata]5.1e-8985.03Show/hide
Query:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEV
        MGKLKDNTISG LPTAQ+FA+HYPGYPSSKRRA+E+LGG QSILKVR LQSNKLELRFRPEDPYSHPT+GELRPCSCFLLKICH KSD+ EGI KVEKEV
Subjt:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEV

Query:  PREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVPENLVKK
        PREDEINLDFEMVA VPEAYHFEGM DYQHVVA HADA +RK+G+W EM +PCLGK N +DVDKEDTMILVPPLFA+KDVPE+LV K
Subjt:  PREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVPENLVKK

XP_023530657.1 general transcription factor 3C polypeptide 5-like [Cucurbita pepo subsp. pepo]1.1e-8884.49Show/hide
Query:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEV
        MGKLKDNTISG LPTAQ+FA+HYPGYPSSKRRA+E+LGG QSILKVR LQSNKLELRFRPEDPYSHPT+GELRPCSCFLLKICH KSD+ EGI KVEKEV
Subjt:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEV

Query:  PREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVPENLVKK
        PREDEINLDFEMVA VPEAYHFEGM DYQHVVA HADA +RK+G+W E+ +PCLGK N +DVDKEDTMILVPPLFA+KDVPE+LV K
Subjt:  PREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVPENLVKK

XP_038888320.1 general transcription factor 3C polypeptide 5-like [Benincasa hispida]1.5e-9389.3Show/hide
Query:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEV
        MGKLKDN+ISG LPTAQ FA+HYPGYP SKRRAIESLGG+QSILKVR +QS+KLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEV
Subjt:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEV

Query:  PREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVPENLVKK
        PREDE NLDFEMVARVPEAYHFEGM+DYQHV+A+HADA RRK+GNWAEM +PCLGKGN VDVDKEDTMILVPPLF+IKDVPENLV K
Subjt:  PREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVPENLVKK

TrEMBL top hitse value%identityAlignment
A0A0A0KV91 Uncharacterized protein5.2e-8768.38Show/hide
Query:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEV
        MGKLKDNTISG LPTAQ FA+HYP YPSSK +AIESLGG+QSILKVRGLQSNKLELRFRP DPYSHPTYGELRPCS FLLKICHSKSDT EGIMKVE EV
Subjt:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEV

Query:  PREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVPENLVKK-------RKEKDL
        P EDE+NLDFEMVARVPEAYHFEGMVDYQHVVA+HADA +RK+GNWAEM +P LGK N +DVDKEDTMILVPPLF+IKDVPENLV K       RK+ + 
Subjt:  PREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVPENLVKK-------RKEKDL

Query:  LSELEFFDLKAKMEDLSTTEMDIQLAIKGDLLNLYLPEERNLIQKSKLNWLKL
        +            E +   +++  LAI  ++ ++ +  E NL     L +L +
Subjt:  LSELEFFDLKAKMEDLSTTEMDIQLAIKGDLLNLYLPEERNLIQKSKLNWLKL

A0A1S4E516 general transcription factor 3C polypeptide 5-like isoform X11.2e-8686.7Show/hide
Query:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEV
        MGKLKDNTISG LP AQ FA+HYPGYPSSK RAIESLGG+QSILKVRGLQSNKLELRFRP DPYSHPTYGELRPCSC LLKICHSK DT EGIMKVE  V
Subjt:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEV

Query:  PREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKG-NTVDVDKEDTMILVPPLFAIKDVPENLVKK
        P EDE+NLDFEMVARVPEAYHFEGMVDYQHVVA+HADA  RK+GNWAEM +P LGKG N VDVDKEDTMILVPPLF+IKDVPENLV K
Subjt:  PREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKG-NTVDVDKEDTMILVPPLFAIKDVPENLVKK

A0A5A7VI24 General transcription factor 3C polypeptide 5-like isoform X12.0e-8687.1Show/hide
Query:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEV
        MGKLKDNTISG LP AQ FA+HYPGYPSSK RAIESLGG+QSILKVRGLQSNKLELRFRP DPYSHPTYGELRPCSC LLKICHSK DT EGIMKVE  V
Subjt:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEV

Query:  PREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKG-NTVDVDKEDTMILVPPLFAIKDVPENLV
        P EDE+NLDFEMVARVPEAYHFEGMVDYQHVVA+HADA  RK+GNWAEM +P LGKG N VDVDKEDTMILVPPLF+IKDVPENLV
Subjt:  PREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKG-NTVDVDKEDTMILVPPLFAIKDVPENLV

A0A6J1EPX9 general transcription factor 3C polypeptide 5-like2.5e-8985.03Show/hide
Query:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEV
        MGKLKDNTISG LPTAQ+FA+HYPGYPSSKRRA+E+LGG QSILKVR LQSNKLELRFRPEDPYSHPT+GELRPCSCFLLKICH KSD+ EGI KVEKEV
Subjt:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEV

Query:  PREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVPENLVKK
        PREDEINLDFEMVA VPEAYHFEGM DYQHVVA HADA +RK+G+W EM +PCLGK N +DVDKEDTMILVPPLFA+KDVPE+LV K
Subjt:  PREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVPENLVKK

A0A6J1JKL2 general transcription factor 3C polypeptide 5-like8.8e-8783.96Show/hide
Query:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEV
        MGKLKDNTISG LPTAQ+FA+HYPGYPSSKRRAIE+LGG QSILKVR LQSNKLELRFRPED YSHPT+GELRPCSCFLLKI H KSD+ EGI  VEKEV
Subjt:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEV

Query:  PREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVPENLVKK
        PREDEINLDFEMVA VPEAYHFEGM DYQHVVA HADA +RK+GNW EM +PCLGK N +DVDKEDTMILVPP+FA+KDVPE+LV K
Subjt:  PREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVPENLVKK

SwissProt top hitse value%identityAlignment
Q54GS8 General transcription factor 3C polypeptide 52.0e-1123.35Show/hide
Query:  DNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKV-RGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKI----------------------
        D  I   LPT   + + YP    +  +AIES+GG   I  V +  +   L+L+FRP +P   PT+G   P    LL++                      
Subjt:  DNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKV-RGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKI----------------------

Query:  ------CHSKSDTIEGIMKVEKEVPREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGK----------------GNTV
                S   T     + +++ P+E+E   D  ++A VP    F+G+ D+Q+++    +  + +  N  +       K                 +  
Subjt:  ------CHSKSDTIEGIMKVEKEVPREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGK----------------GNTV

Query:  DVDKEDTMILVPPLFAIKDVPENLVKK
        ++ K++ M L+PPLF+  D P+N + K
Subjt:  DVDKEDTMILVPPLFAIKDVPENLVKK

Q8R2T8 General transcription factor 3C polypeptide 54.4e-1125.45Show/hide
Query:  QIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEVPREDEINLDFEMVARV
        ++  + YPG   ++ + +++LGG +S+ ++    + +LEL FRP+DPY HP        S  LL+I   ++    G++  E       ++  + E++  +
Subjt:  QIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEVPREDEINLDFEMVARV

Query:  PEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVP
           Y F+GM D+Q+ +A+H +A  +    +  +L     K       +E  + + PP+F+  D P
Subjt:  PEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVP

Q9Y5Q8 General transcription factor 3C polypeptide 55.3e-1226.06Show/hide
Query:  QIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEVPREDEINLDFEMVARV
        ++  + YPG      + + +LGG + + ++    + +LEL FRP+DPY HP        S  LL+I   ++   +G++  E       E+  D E++  +
Subjt:  QIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEVPREDEINLDFEMVARV

Query:  PEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVP
           Y F+GM D+Q+ +A+H +A  +    + ++L   L         +E  + + PP+F+  D P
Subjt:  PEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVP

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.1e-0432.2Show/hide
Query:  QKSKLNWLKLGDENNKFFHRFLSAKRRRYMITELSADSNMVLHSFRDIEAEILGYFTEL
        QKS++ WL+ GD N +FFH+ + A + + +I  L  D ++ + +   ++  I+ Y+T L
Subjt:  QKSKLNWLKLGDENNKFFHRFLSAKRRRYMITELSADSNMVLHSFRDIEAEILGYFTEL

AT3G49410.1 Transcription factor IIIC, subunit 57.5e-4644.55Show/hide
Query:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHS--KSDTIEGIMKVEK
        MG +++ TISG LP+ + F +H+PGYPSS  RAIE+LGG Q I + R   SNKLELRFRPEDPY+HP  GE RPCS FLL+I     K    + ++   +
Subjt:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHS--KSDTIEGIMKVEK

Query:  EVPREDEIN-LDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVPENLVKK------RKEK
        +V  E+    L  ++VAR+ E++HF+GM DYQHV+ IHAD  ++K+  W + + P  GK + + +  ED M+L+P  FA KD+P+N+  K       K+K
Subjt:  EVPREDEIN-LDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVPENLVKK------RKEK

Query:  DLLSELEFFDL
        D  +   F+++
Subjt:  DLLSELEFFDL

AT4G20520.1 RNA binding;RNA-directed DNA polymerases9.9e-0632.38Show/hide
Query:  ILQAVKSLAVEDNRAKKKKGW-ILKLDMEKAFNRVDWSFLEKNLALMGFHPKWVEWISGFLVGKDKV-------HLSILQFADDTLLFCKYDDDIVNCTA
        + +AV S+     R K  KGW +LKLD+EKA++R+ W +LE  L   GF   W+  I+    G  +V         S      D     +YDD     T+
Subjt:  ILQAVKSLAVEDNRAKKKKGW-ILKLDMEKAFNRVDWSFLEKNLALMGFHPKWVEWISGFLVGKDKV-------HLSILQFADDTLLFCKYDDDIVNCTA

Query:  NRLKC
        N + C
Subjt:  NRLKC

AT5G24450.1 Transcription factor IIIC, subunit 55.3e-4445.12Show/hide
Query:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHS--KSDTI---EGIMK
        MG +++ TISG LP+ + F +HYPGYPSS  RA+E+LGG Q I   R   SNKLEL FRPEDP +HP YGE R C+ FLLKI     K D++   + ++ 
Subjt:  MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHS--KSDTI---EGIMK

Query:  VEKEVPREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVPENLV--------KK
               E    L  ++VARV E+Y F+GMVDYQHV+ IHAD  ++K+  W E +K   GK + +D+  ED M+L+P  F+ KD P+NLV         K
Subjt:  VEKEVPREDEINLDFEMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVPENLV--------KK

Query:  RKEKDLLSELEFFDL
        +K+++L   L   D+
Subjt:  RKEKDLLSELEFFDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGCTGAAAGACAACACTATCTCAGGGCTCTTACCTACTGCCCAGATTTTTGCAATTCACTATCCAGGTTATCCTTCGTCCAAGCGTCGGGCAATTGAAAGTCT
AGGAGGATCACAATCAATTCTCAAGGTTCGTGGTCTTCAATCAAATAAGCTAGAGCTTCGATTTAGACCTGAAGATCCATATTCACATCCAACATATGGGGAGCTTCGCC
CATGCAGTTGCTTTCTTTTGAAAATATGTCATTCAAAAAGTGATACAATCGAGGGAATCATGAAAGTTGAAAAAGAAGTACCTAGAGAAGATGAAATAAACCTTGATTTT
GAGATGGTTGCTCGTGTTCCTGAGGCATATCATTTTGAGGGCATGGTTGACTACCAACATGTCGTTGCTATTCATGCGGATGCTATCCGAAGGAAGAGGGGGAATTGGGC
AGAAATGCTCAAGCCATGTTTAGGGAAGGGTAACACTGTTGATGTGGACAAGGAGGATACTATGATTTTAGTTCCTCCATTATTTGCGATTAAGGATGTGCCTGAAAATT
TAGTAAAGAAGAGAAAGGAGAAGGATCTTCTATCGGAGCTAGAATTCTTTGATTTGAAAGCAAAAATGGAAGATCTTTCAACGACGGAGATGGATATTCAGCTGGCCATT
AAAGGGGATTTATTGAATTTATACCTTCCGGAGGAAAGGAATTTGATCCAAAAGAGTAAATTGAATTGGTTGAAATTGGGGGACGAAAATAATAAATTCTTCCACAGATT
CCTCTCAGCAAAAAGAAGAAGATATATGATTACAGAATTGAGTGCAGATAGTAATATGGTGCTGCATTCTTTTCGGGATATAGAAGCTGAAATATTGGGGTATTTCACTG
AGTTATACTCTAAAATCCTAGGTTCCAGATTCATTCCAGCAGGTTTACCTTGGCAGCAAGTCTCTATTTCACAAAATCATGCATTAATAGCTCCATTTTGGATTGAAGAA
ATACTACAGGCTGTCAAATCACTCGCTGTGGAAGATAATAGGGCTAAAAAGAAGAAAGGGTGGATTCTGAAATTAGATATGGAGAAGGCCTTCAATCGAGTTGATTGGAG
TTTTTTGGAAAAAAATCTTGCATTGATGGGATTCCATCCCAAATGGGTTGAATGGATAAGTGGCTTCTTGGTTGGTAAAGATAAGGTACATTTGTCCATTTTGCAATTTG
CAGATGATACACTTCTTTTTTGCAAATATGACGATGACATTGTGAATTGCACGGCGAATAGATTGAAGTGCCAAGCTGGGAAATTACCTTTCTTATACTTGGGATTGCCT
CTTGGAGGATATTCAAAGCAATATTCTTTTTGGCAACCAATAATTGACAAAGTACATTTGAAGCTTGACAGATGGAAGAGTTTCTGTAGAGATCGGAAGGATAATGTGCA
GATTTTTATGGGAAGGTCAAAGAGGCAGCAGAATCCTAACAATTGGCATCAGAGTAATGTCTTGGTCTCAAAAGAGTCGTTGCAGGGCAGTGGACGTGACAATGCTTCAA
TTGGAGGTGGAAACCAAATAGGGGAAGATGAAGAAGTCAAGGAGAAGAACAAACAACAAACGACAACGTGGTCGGAGATGGAGTTGCGGGGAAAGCGTTGTTGCAAGTTT
TGTGAAGAGGTGATGAAGAAATTGGAGAATGATAACCATGAAGAGAAGATTGAGGAAGGTCAACAACGAGTGTTTATGTGGTTGGAGATGAAGACTTGGGTGAAGAAAAG
GATGTTGCTCAAAGACGCAGAGGATGGTTGGCCGCAATGGAGAAAGAAGACGTCGACAGTTCATGGGTTGGAATGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAGCTGAAAGACAACACTATCTCAGGGCTCTTACCTACTGCCCAGATTTTTGCAATTCACTATCCAGGTTATCCTTCGTCCAAGCGTCGGGCAATTGAAAGTCT
AGGAGGATCACAATCAATTCTCAAGGTTCGTGGTCTTCAATCAAATAAGCTAGAGCTTCGATTTAGACCTGAAGATCCATATTCACATCCAACATATGGGGAGCTTCGCC
CATGCAGTTGCTTTCTTTTGAAAATATGTCATTCAAAAAGTGATACAATCGAGGGAATCATGAAAGTTGAAAAAGAAGTACCTAGAGAAGATGAAATAAACCTTGATTTT
GAGATGGTTGCTCGTGTTCCTGAGGCATATCATTTTGAGGGCATGGTTGACTACCAACATGTCGTTGCTATTCATGCGGATGCTATCCGAAGGAAGAGGGGGAATTGGGC
AGAAATGCTCAAGCCATGTTTAGGGAAGGGTAACACTGTTGATGTGGACAAGGAGGATACTATGATTTTAGTTCCTCCATTATTTGCGATTAAGGATGTGCCTGAAAATT
TAGTAAAGAAGAGAAAGGAGAAGGATCTTCTATCGGAGCTAGAATTCTTTGATTTGAAAGCAAAAATGGAAGATCTTTCAACGACGGAGATGGATATTCAGCTGGCCATT
AAAGGGGATTTATTGAATTTATACCTTCCGGAGGAAAGGAATTTGATCCAAAAGAGTAAATTGAATTGGTTGAAATTGGGGGACGAAAATAATAAATTCTTCCACAGATT
CCTCTCAGCAAAAAGAAGAAGATATATGATTACAGAATTGAGTGCAGATAGTAATATGGTGCTGCATTCTTTTCGGGATATAGAAGCTGAAATATTGGGGTATTTCACTG
AGTTATACTCTAAAATCCTAGGTTCCAGATTCATTCCAGCAGGTTTACCTTGGCAGCAAGTCTCTATTTCACAAAATCATGCATTAATAGCTCCATTTTGGATTGAAGAA
ATACTACAGGCTGTCAAATCACTCGCTGTGGAAGATAATAGGGCTAAAAAGAAGAAAGGGTGGATTCTGAAATTAGATATGGAGAAGGCCTTCAATCGAGTTGATTGGAG
TTTTTTGGAAAAAAATCTTGCATTGATGGGATTCCATCCCAAATGGGTTGAATGGATAAGTGGCTTCTTGGTTGGTAAAGATAAGGTACATTTGTCCATTTTGCAATTTG
CAGATGATACACTTCTTTTTTGCAAATATGACGATGACATTGTGAATTGCACGGCGAATAGATTGAAGTGCCAAGCTGGGAAATTACCTTTCTTATACTTGGGATTGCCT
CTTGGAGGATATTCAAAGCAATATTCTTTTTGGCAACCAATAATTGACAAAGTACATTTGAAGCTTGACAGATGGAAGAGTTTCTGTAGAGATCGGAAGGATAATGTGCA
GATTTTTATGGGAAGGTCAAAGAGGCAGCAGAATCCTAACAATTGGCATCAGAGTAATGTCTTGGTCTCAAAAGAGTCGTTGCAGGGCAGTGGACGTGACAATGCTTCAA
TTGGAGGTGGAAACCAAATAGGGGAAGATGAAGAAGTCAAGGAGAAGAACAAACAACAAACGACAACGTGGTCGGAGATGGAGTTGCGGGGAAAGCGTTGTTGCAAGTTT
TGTGAAGAGGTGATGAAGAAATTGGAGAATGATAACCATGAAGAGAAGATTGAGGAAGGTCAACAACGAGTGTTTATGTGGTTGGAGATGAAGACTTGGGTGAAGAAAAG
GATGTTGCTCAAAGACGCAGAGGATGGTTGGCCGCAATGGAGAAAGAAGACGTCGACAGTTCATGGGTTGGAATGGTGA
Protein sequenceShow/hide protein sequence
MGKLKDNTISGLLPTAQIFAIHYPGYPSSKRRAIESLGGSQSILKVRGLQSNKLELRFRPEDPYSHPTYGELRPCSCFLLKICHSKSDTIEGIMKVEKEVPREDEINLDF
EMVARVPEAYHFEGMVDYQHVVAIHADAIRRKRGNWAEMLKPCLGKGNTVDVDKEDTMILVPPLFAIKDVPENLVKKRKEKDLLSELEFFDLKAKMEDLSTTEMDIQLAI
KGDLLNLYLPEERNLIQKSKLNWLKLGDENNKFFHRFLSAKRRRYMITELSADSNMVLHSFRDIEAEILGYFTELYSKILGSRFIPAGLPWQQVSISQNHALIAPFWIEE
ILQAVKSLAVEDNRAKKKKGWILKLDMEKAFNRVDWSFLEKNLALMGFHPKWVEWISGFLVGKDKVHLSILQFADDTLLFCKYDDDIVNCTANRLKCQAGKLPFLYLGLP
LGGYSKQYSFWQPIIDKVHLKLDRWKSFCRDRKDNVQIFMGRSKRQQNPNNWHQSNVLVSKESLQGSGRDNASIGGGNQIGEDEEVKEKNKQQTTTWSEMELRGKRCCKF
CEEVMKKLENDNHEEKIEEGQQRVFMWLEMKTWVKKRMLLKDAEDGWPQWRKKTSTVHGLEW