; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0006780 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0006780
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionDNA-directed RNA polymerases IV and V subunit 4-like
Genome locationchr04:3209356..3213022
RNA-Seq ExpressionPI0006780
SyntenyPI0006780
Gene Ontology termsGO:0006352 - DNA-templated transcription, initiation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0030880 - RNA polymerase complex (cellular component)
GO:0000166 - nucleotide binding (molecular function)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR005574 - RNA polymerase subunit RPB4/RPC9
IPR006590 - RNA polymerase Rpb4/RPC9, core
IPR010997 - HRDC-like superfamily
IPR038324 - Rpb4/RPC9 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134991.2 DNA-directed RNA polymerases IV and V subunit 4 isoform X1 [Cucumis sativus]1.4e-10595Show/hide
Query:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNTFSMKYSGKNGDLGKGGK-ANMKASVVKEPQALELKIEQELP
        MSEKGEKGFSVQK+PAKSSLKSSALKDASLKGKDDSLSK KKGRKVQFDAQGSVDA NTFSMKYSGKNGDLGKGGK AN KAS+ KEPQALELKIEQELP
Subjt:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNTFSMKYSGKNGDLGKGGK-ANMKASVVKEPQALELKIEQELP

Query:  KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKSKR
        KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYV+AESVRPVLE LKKYGVTDSEICVIANVCPDTTDEVFALLPSLK KR
Subjt:  KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKSKR

Query:  SKLSEPLNNVLSELAKVKSS
        SKLSEP+NNVLSELAKVKSS
Subjt:  SKLSEPLNNVLSELAKVKSS

XP_008440955.1 PREDICTED: DNA-directed RNA polymerases IV and V subunit 4 isoform X1 [Cucumis melo]2.4e-10594.55Show/hide
Query:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNTFSMKYSGKNGDLGKGGK-ANMKASVVKEPQALELKIEQELP
        MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSK KKGRKVQFDAQGSVDA N+FSMKYSGKNGD+GKGGK AN KASV KEPQ+LELKIEQELP
Subjt:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNTFSMKYSGKNGDLGKGGK-ANMKASVVKEPQALELKIEQELP

Query:  KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKSKR
        KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYV+AESVRPVLE LKKYG+TDSEICVIANVCPDTTDEVFALLPSLK+KR
Subjt:  KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKSKR

Query:  SKLSEPLNNVLSELAKVKSS
        SKLSEPLNNVLSELAKVKSS
Subjt:  SKLSEPLNNVLSELAKVKSS

XP_008440960.1 PREDICTED: DNA-directed RNA polymerases IV and V subunit 4 isoform X2 [Cucumis melo]7.2e-10292.73Show/hide
Query:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNTFSMKYSGKNGDLGKGGK-ANMKASVVKEPQALELKIEQELP
        MSEKGEKGFSVQKKPAKSSLKSSALKD    GKDDSLSK KKGRKVQFDAQGSVDA N+FSMKYSGKNGD+GKGGK AN KASV KEPQ+LELKIEQELP
Subjt:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNTFSMKYSGKNGDLGKGGK-ANMKASVVKEPQALELKIEQELP

Query:  KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKSKR
        KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYV+AESVRPVLE LKKYG+TDSEICVIANVCPDTTDEVFALLPSLK+KR
Subjt:  KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKSKR

Query:  SKLSEPLNNVLSELAKVKSS
        SKLSEPLNNVLSELAKVKSS
Subjt:  SKLSEPLNNVLSELAKVKSS

XP_011658103.1 DNA-directed RNA polymerases IV and V subunit 4 isoform X2 [Cucumis sativus]4.2e-10293.18Show/hide
Query:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNTFSMKYSGKNGDLGKGGK-ANMKASVVKEPQALELKIEQELP
        MSEKGEKGFSVQK+PAKSSLKSSALKD    GKDDSLSK KKGRKVQFDAQGSVDA NTFSMKYSGKNGDLGKGGK AN KAS+ KEPQALELKIEQELP
Subjt:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNTFSMKYSGKNGDLGKGGK-ANMKASVVKEPQALELKIEQELP

Query:  KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKSKR
        KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYV+AESVRPVLE LKKYGVTDSEICVIANVCPDTTDEVFALLPSLK KR
Subjt:  KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKSKR

Query:  SKLSEPLNNVLSELAKVKSS
        SKLSEP+NNVLSELAKVKSS
Subjt:  SKLSEPLNNVLSELAKVKSS

XP_038881037.1 DNA-directed RNA polymerases IV and V subunit 4-like isoform X1 [Benincasa hispida]6.1e-10191.36Show/hide
Query:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNTFSMKYSGKNGDLGKGGK-ANMKASVVKEPQALELKIEQELP
        MS+KGEKGF VQKKPAKSSLKSSALKDASLKGKDDSLSK KKGRKVQFDAQGSVDAQ+TFSMKYSGKNGDLGKGGK  N KAS  KEPQ LELKIEQELP
Subjt:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNTFSMKYSGKNGDLGKGGK-ANMKASVVKEPQALELKIEQELP

Query:  KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKSKR
        KNVKCQCLMDCEAAQLLQGIQDQMV LSADPTIKIPTSFDRGLQYAKRANHYV++ESVRPVLE LKK+G+T+SEICVIANVCPDTTDEVFALLPSLK KR
Subjt:  KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKSKR

Query:  SKLSEPLNNVLSELAKVKSS
        SKLSEPLNNVL ELAKVKSS
Subjt:  SKLSEPLNNVLSELAKVKSS

TrEMBL top hitse value%identityAlignment
A0A0A0KH45 RPOL4c domain-containing protein6.8e-10695Show/hide
Query:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNTFSMKYSGKNGDLGKGGK-ANMKASVVKEPQALELKIEQELP
        MSEKGEKGFSVQK+PAKSSLKSSALKDASLKGKDDSLSK KKGRKVQFDAQGSVDA NTFSMKYSGKNGDLGKGGK AN KAS+ KEPQALELKIEQELP
Subjt:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNTFSMKYSGKNGDLGKGGK-ANMKASVVKEPQALELKIEQELP

Query:  KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKSKR
        KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYV+AESVRPVLE LKKYGVTDSEICVIANVCPDTTDEVFALLPSLK KR
Subjt:  KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKSKR

Query:  SKLSEPLNNVLSELAKVKSS
        SKLSEP+NNVLSELAKVKSS
Subjt:  SKLSEPLNNVLSELAKVKSS

A0A1S3B2C2 DNA-directed RNA polymerases IV and V subunit 4 isoform X11.2e-10594.55Show/hide
Query:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNTFSMKYSGKNGDLGKGGK-ANMKASVVKEPQALELKIEQELP
        MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSK KKGRKVQFDAQGSVDA N+FSMKYSGKNGD+GKGGK AN KASV KEPQ+LELKIEQELP
Subjt:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNTFSMKYSGKNGDLGKGGK-ANMKASVVKEPQALELKIEQELP

Query:  KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKSKR
        KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYV+AESVRPVLE LKKYG+TDSEICVIANVCPDTTDEVFALLPSLK+KR
Subjt:  KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKSKR

Query:  SKLSEPLNNVLSELAKVKSS
        SKLSEPLNNVLSELAKVKSS
Subjt:  SKLSEPLNNVLSELAKVKSS

A0A1S3B2C7 DNA-directed RNA polymerases IV and V subunit 4 isoform X23.5e-10292.73Show/hide
Query:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNTFSMKYSGKNGDLGKGGK-ANMKASVVKEPQALELKIEQELP
        MSEKGEKGFSVQKKPAKSSLKSSALKD    GKDDSLSK KKGRKVQFDAQGSVDA N+FSMKYSGKNGD+GKGGK AN KASV KEPQ+LELKIEQELP
Subjt:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNTFSMKYSGKNGDLGKGGK-ANMKASVVKEPQALELKIEQELP

Query:  KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKSKR
        KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYV+AESVRPVLE LKKYG+TDSEICVIANVCPDTTDEVFALLPSLK+KR
Subjt:  KNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKSKR

Query:  SKLSEPLNNVLSELAKVKSS
        SKLSEPLNNVLSELAKVKSS
Subjt:  SKLSEPLNNVLSELAKVKSS

A0A6J1IMM9 DNA-directed RNA polymerases IV and V subunit 4-like isoform X13.1e-9586.76Show/hide
Query:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNTFSMKYSGKNGDLGKGGKANMKASVVKEPQALELKIEQELPK
        MSEKGEKG  + KKP KSSLKSS+ KDASLKGKDDSL KPKKGRKVQFDAQGSVDAQ  FSMKYSGKNG+LGKGGK     S  KEPQ LELKIEQELPK
Subjt:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNTFSMKYSGKNGDLGKGGKANMKASVVKEPQALELKIEQELPK

Query:  NVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKSKRS
        NVKCQCLMDCEAAQ+LQGIQDQMV LSADPTIKIPTSFDRGLQYAKRANHYV+ ESVRPVL+ LKKYGV DSE+CV+ANVCPDT DEVFALLPSLKSKRS
Subjt:  NVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKSKRS

Query:  KLSEPLNNVLSELAKVKSS
        KLSEPLNNVLSELAKVKSS
Subjt:  KLSEPLNNVLSELAKVKSS

A0A6J1KMD1 DNA-directed RNA polymerases IV and V subunit 4-like4.1e-9587.33Show/hide
Query:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNTFSMKYSGKNGDLGKGGKA--NMKASVVKEPQALELKIEQEL
        MSEKGEKGF VQKKP KSSLKS A KDASLKGKDDSLSK KKGRKVQFDAQGSVDA    SMK+ GKNGDLGKGGK     KASV KE Q LELKIEQEL
Subjt:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNTFSMKYSGKNGDLGKGGKA--NMKASVVKEPQALELKIEQEL

Query:  PKNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKSK
        PKNVKCQCLMDCEAAQLLQGIQDQMV LSADPTIKIPTSFDRGLQYAKRANHYV+ ESVRPVLE LKKYGV DSEICVIANVCPDT DEVFAL+PSLKSK
Subjt:  PKNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKSK

Query:  RSKLSEPLNNVLSELAKVKSS
        RSKL+EPLNNVLSELAK+KSS
Subjt:  RSKLSEPLNNVLSELAKVKSS

SwissProt top hitse value%identityAlignment
O48890 DNA-directed RNA polymerase II subunit 43.5e-1938.06Show/hide
Query:  KEPQALELKIEQELPKNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDT
        +E  A ELKI  E    +K +CLM+CE + +L+   +Q+  +S DP  ++   F++ LQY KR + Y + ++VR V E L ++ +T+ E+CV+ N+CP+T
Subjt:  KEPQALELKIEQELPKNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDT

Query:  TDEVFALLPSLKSK-RSKLSEPLNNVLSELAKVK
         +E  A++PSLK+K R+   E +  +L++L+ VK
Subjt:  TDEVFALLPSLKSK-RSKLSEPLNNVLSELAKVK

Q54S04 DNA-directed RNA polymerase II subunit rpb42.2e-0527.15Show/hide
Query:  KASVVKEPQALELKIEQELPKNVKCQCLMDCEAAQLLQ---GIQDQ----------MVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKK
        ++ + +E     LK  ++L K+ K   L++ E A LL+   GI +           ++ L  +    I  +F + L YA++ + Y +  S++ V   L K
Subjt:  KASVVKEPQALELKIEQELPKNVKCQCLMDCEAAQLLQ---GIQDQ----------MVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKK

Query:  YGVTDSEICVIANVCPDTTDEVFALLPSLKSKRSKLSEPLNNVLSELAKVK
          + + EI  +AN+CP+ +DE  +L+PSLK       + L  +L EL+ ++
Subjt:  YGVTDSEICVIANVCPDTTDEVFALLPSLKSKRSKLSEPLNNVLSELAKVK

Q6DBA5 DNA-directed RNA polymerases IV and V subunit 47.8e-3543.75Show/hide
Query:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFD-----AQGSVDAQNTFSMKYSGKNGDLGKGGKANMKASVVKEPQALELKIE
        MSEKG KG        KSSLKS   KD    GKD S +K KKGRK+ FD     A   +   ++    +       GK  K    +       + ELK  
Subjt:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFD-----AQGSVDAQNTFSMKYSGKNGDLGKGGKANMKASVVKEPQALELKIE

Query:  QELPKNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSL
         +LP+N   +C+MDCEA Q+L GI+ Q+V LS DP+IKIP S+DR L Y +   HY + +SVR VLE LK YG++D E+CVIAN   ++ DEV A +PSL
Subjt:  QELPKNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSL

Query:  KSKRSKLSEPLNNVLSELAKVKSS
        K+K+  +++PL + L EL+K+K S
Subjt:  KSKRSKLSEPLNNVLSELAKVKSS

Arabidopsis top hitse value%identityAlignment
AT4G15950.1 RNA polymerase II, Rpb4, core protein5.6e-3643.75Show/hide
Query:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFD-----AQGSVDAQNTFSMKYSGKNGDLGKGGKANMKASVVKEPQALELKIE
        MSEKG KG        KSSLKS   KD    GKD S +K KKGRK+ FD     A   +   ++    +       GK  K    +       + ELK  
Subjt:  MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFD-----AQGSVDAQNTFSMKYSGKNGDLGKGGKANMKASVVKEPQALELKIE

Query:  QELPKNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSL
         +LP+N   +C+MDCEA Q+L GI+ Q+V LS DP+IKIP S+DR L Y +   HY + +SVR VLE LK YG++D E+CVIAN   ++ DEV A +PSL
Subjt:  QELPKNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSL

Query:  KSKRSKLSEPLNNVLSELAKVKSS
        K+K+  +++PL + L EL+K+K S
Subjt:  KSKRSKLSEPLNNVLSELAKVKSS

AT5G09920.1 RNA polymerase II, Rpb4, core protein2.5e-2038.06Show/hide
Query:  KEPQALELKIEQELPKNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDT
        +E  A ELKI  E    +K +CLM+CE + +L+   +Q+  +S DP  ++   F++ LQY KR + Y + ++VR V E L ++ +T+ E+CV+ N+CP+T
Subjt:  KEPQALELKIEQELPKNVKCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDT

Query:  TDEVFALLPSLKSK-RSKLSEPLNNVLSELAKVK
         +E  A++PSLK+K R+   E +  +L++L+ VK
Subjt:  TDEVFALLPSLKSK-RSKLSEPLNNVLSELAKVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGAGAAAGGCGAAAAGGGTTTTTCCGTGCAGAAAAAACCTGCGAAGTCTTCCCTCAAGTCTTCCGCTCTCAAGGATGCTTCTCTAAAAGGAAAGGATGATAGTTT
GTCAAAGCCAAAGAAGGGAAGGAAAGTCCAGTTCGATGCTCAAGGATCAGTTGATGCGCAGAATACTTTTTCGATGAAATACAGTGGCAAAAATGGTGACTTGGGTAAAG
GAGGAAAAGCAAACATGAAGGCTTCTGTTGTGAAGGAACCCCAAGCACTAGAATTGAAGATTGAGCAAGAACTGCCCAAGAATGTTAAATGCCAATGCCTTATGGACTGT
GAGGCTGCACAACTTTTACAGGGAATCCAAGATCAGATGGTTTTTCTATCCGCTGATCCAACCATAAAAATTCCCACGTCATTTGATAGGGGATTGCAATATGCTAAAAG
AGCCAACCACTATGTAGATGCCGAGTCAGTTAGACCAGTTCTTGAAAACCTCAAGAAATATGGCGTAACGGATAGTGAGATATGTGTGATTGCTAATGTCTGCCCAGACA
CTACTGATGAAGTTTTTGCTCTTCTTCCATCCTTGAAGAGCAAAAGAAGCAAGCTAAGTGAACCTCTCAACAATGTCTTGAGTGAGCTAGCCAAGGTAAAATCATCCTGA
mRNA sequenceShow/hide mRNA sequence
AGATAAAAAGTAATAAATTTAGAAGTAAGATAGTTTTGTAGGCTTGATTTTTAAAAACCCCAATAACTTTTTGCATATTTATTGAATAAGAGAATTGAAAGCTTGAAACG
AAGGCTCGTCGTCAGTCGACCCCGTTGCGCAAGACGAAAGGGAAGATTCTCCCATCTTCCGATCAACCAACAGTAATGTCGGAGAAAGGCGAAAAGGGTTTTTCCGTGCA
GAAAAAACCTGCGAAGTCTTCCCTCAAGTCTTCCGCTCTCAAGGATGCTTCTCTAAAAGGAAAGGATGATAGTTTGTCAAAGCCAAAGAAGGGAAGGAAAGTCCAGTTCG
ATGCTCAAGGATCAGTTGATGCGCAGAATACTTTTTCGATGAAATACAGTGGCAAAAATGGTGACTTGGGTAAAGGAGGAAAAGCAAACATGAAGGCTTCTGTTGTGAAG
GAACCCCAAGCACTAGAATTGAAGATTGAGCAAGAACTGCCCAAGAATGTTAAATGCCAATGCCTTATGGACTGTGAGGCTGCACAACTTTTACAGGGAATCCAAGATCA
GATGGTTTTTCTATCCGCTGATCCAACCATAAAAATTCCCACGTCATTTGATAGGGGATTGCAATATGCTAAAAGAGCCAACCACTATGTAGATGCCGAGTCAGTTAGAC
CAGTTCTTGAAAACCTCAAGAAATATGGCGTAACGGATAGTGAGATATGTGTGATTGCTAATGTCTGCCCAGACACTACTGATGAAGTTTTTGCTCTTCTTCCATCCTTG
AAGAGCAAAAGAAGCAAGCTAAGTGAACCTCTCAACAATGTCTTGAGTGAGCTAGCCAAGGTAAAATCATCCTGAAGTCATTGAGTTGCTCCCTGTTGCTGTTGCTGTTG
CTCAGGTCCGGTTGCCCTCTCCCCTTCCTGTGAAGATTGTTTTATGGCATTTGAAAATTAGGACCTTATCTTTTCAATGGCATTCAATAGATATAACTTCGGGGGAGTAA
GTATTAGCCCATTGGAATATTTTGTTCTTGAAGAGATCTGTATCTGCTACTGGCATTGATGATATAGTGTAGAAATATGGGTTAACTTTACTGAGAATGCCTTTCCCAGT
TGAATCAACTCCGATGGTCTGAAAATTAAACTAGAAAACCACTATTATTAATTTTAATGTAAAGTGAGAATGTCTGATGTCAAGGGAGTAATTTCTACTGGATGCACATG
TTTGACTGCCTGTTTATATTGTTGATATCAAGTACTGTTGGGCAACAGTAGAAACTGGTTATTGAAGAGTAAGAAAAATTATAGATATTAAATTTCAATTA
Protein sequenceShow/hide protein sequence
MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNTFSMKYSGKNGDLGKGGKANMKASVVKEPQALELKIEQELPKNVKCQCLMDC
EAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVDAESVRPVLENLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKSKRSKLSEPLNNVLSELAKVKSS