; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G21550 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G21550
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionDNA-directed RNA polymerases IV and V subunit 4-like
Genome locationClcChr01:33084315..33090011
RNA-Seq ExpressionClc01G21550
SyntenyClc01G21550
Gene Ontology termsGO:0006352 - DNA-templated transcription, initiation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0030880 - RNA polymerase complex (cellular component)
GO:0000166 - nucleotide binding (molecular function)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR005574 - RNA polymerase subunit RPB4/RPC9
IPR006590 - RNA polymerase Rpb4/RPC9, core
IPR010997 - HRDC-like superfamily
IPR038324 - Rpb4/RPC9 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594716.1 DNA-directed RNA polymerases IV and V subunit 4, partial [Cucurbita argyrosperma subsp. sororia]3.2e-9089.9Show/hide
Query:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNNLSMKYSGKNGDLGKGGKGVS-VKASVAKEPQPLELKIEQEL
        MSEKG+KGFPVQKKP KSSLKSS  KDASLKGKDDSLSK KKGRKVQFDAQGSVDAQ NLS+K+SGKNGDLGKGGKG++  KASV+KE QPLELKIEQEL
Subjt:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNNLSMKYSGKNGDLGKGGKGVS-VKASVAKEPQPLELKIEQEL

Query:  PKNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLK
        PKNVKCQCLMDCEAAQLLQGI DQMVLLSADPTIKIPTSFDRGLQYAKRANHYVN+ESVRPVLETLKK+GV DSEICVIANVCPDTTDEVFAL+P+LK
Subjt:  PKNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLK

XP_004134991.2 DNA-directed RNA polymerases IV and V subunit 4 isoform X1 [Cucumis sativus]3.5e-9290.4Show/hide
Query:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNNLSMKYSGKNGDLGKGGKGVSVKASVAKEPQPLELKIEQELP
        MSEKG+KGF VQK+PAKSSLKSS LKDASLKGKDDSLSK KKGRKVQFDAQGSVDA N  SMKYSGKNGDLGKGGKG + KAS+AKEPQ LELKIEQELP
Subjt:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNNLSMKYSGKNGDLGKGGKGVSVKASVAKEPQPLELKIEQELP

Query:  KNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLKK
        KNVKCQCLMDCEAAQLLQGI DQMV LSADPTIKIPTSFDRGLQYAKRANHYVN+ESVRPVLETLKK+GVTDSEICVIANVCPDTTDEVFALLP+LK+
Subjt:  KNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLKK

XP_008440955.1 PREDICTED: DNA-directed RNA polymerases IV and V subunit 4 isoform X1 [Cucumis melo]3.5e-9290.86Show/hide
Query:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNNLSMKYSGKNGDLGKGGKGVSVKASVAKEPQPLELKIEQELP
        MSEKG+KGF VQKKPAKSSLKSS LKDASLKGKDDSLSK KKGRKVQFDAQGSVDA N+ SMKYSGKNGD+GKGGKG + KASVAKEPQ LELKIEQELP
Subjt:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNNLSMKYSGKNGDLGKGGKGVSVKASVAKEPQPLELKIEQELP

Query:  KNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLK
        KNVKCQCLMDCEAAQLLQGI DQMV LSADPTIKIPTSFDRGLQYAKRANHYVN+ESVRPVLETLKK+G+TDSEICVIANVCPDTTDEVFALLP+LK
Subjt:  KNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLK

XP_038881037.1 DNA-directed RNA polymerases IV and V subunit 4-like isoform X1 [Benincasa hispida]9.8e-9592.89Show/hide
Query:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNNLSMKYSGKNGDLGKGGKGVSVKASVAKEPQPLELKIEQELP
        MS+KG+KGFPVQKKPAKSSLKSS LKDASLKGKDDSLSK KKGRKVQFDAQGSVDAQ+  SMKYSGKNGDLGKGGKGV+ KAS AKEPQPLELKIEQELP
Subjt:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNNLSMKYSGKNGDLGKGGKGVSVKASVAKEPQPLELKIEQELP

Query:  KNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLK
        KNVKCQCLMDCEAAQLLQGI DQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHG+T+SEICVIANVCPDTTDEVFALLP+LK
Subjt:  KNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLK

XP_038881040.1 DNA-directed RNA polymerases IV and V subunit 4-like isoform X2 [Benincasa hispida]3.8e-9190.86Show/hide
Query:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNNLSMKYSGKNGDLGKGGKGVSVKASVAKEPQPLELKIEQELP
        MS+KG+KGFPVQKKPAKSSLKSS LKD    GKDDSLSK KKGRKVQFDAQGSVDAQ+  SMKYSGKNGDLGKGGKGV+ KAS AKEPQPLELKIEQELP
Subjt:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNNLSMKYSGKNGDLGKGGKGVSVKASVAKEPQPLELKIEQELP

Query:  KNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLK
        KNVKCQCLMDCEAAQLLQGI DQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHG+T+SEICVIANVCPDTTDEVFALLP+LK
Subjt:  KNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLK

TrEMBL top hitse value%identityAlignment
A0A0A0KH45 RPOL4c domain-containing protein1.7e-9290.4Show/hide
Query:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNNLSMKYSGKNGDLGKGGKGVSVKASVAKEPQPLELKIEQELP
        MSEKG+KGF VQK+PAKSSLKSS LKDASLKGKDDSLSK KKGRKVQFDAQGSVDA N  SMKYSGKNGDLGKGGKG + KAS+AKEPQ LELKIEQELP
Subjt:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNNLSMKYSGKNGDLGKGGKGVSVKASVAKEPQPLELKIEQELP

Query:  KNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLKK
        KNVKCQCLMDCEAAQLLQGI DQMV LSADPTIKIPTSFDRGLQYAKRANHYVN+ESVRPVLETLKK+GVTDSEICVIANVCPDTTDEVFALLP+LK+
Subjt:  KNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLKK

A0A1S3B2C2 DNA-directed RNA polymerases IV and V subunit 4 isoform X11.7e-9290.86Show/hide
Query:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNNLSMKYSGKNGDLGKGGKGVSVKASVAKEPQPLELKIEQELP
        MSEKG+KGF VQKKPAKSSLKSS LKDASLKGKDDSLSK KKGRKVQFDAQGSVDA N+ SMKYSGKNGD+GKGGKG + KASVAKEPQ LELKIEQELP
Subjt:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNNLSMKYSGKNGDLGKGGKGVSVKASVAKEPQPLELKIEQELP

Query:  KNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLK
        KNVKCQCLMDCEAAQLLQGI DQMV LSADPTIKIPTSFDRGLQYAKRANHYVN+ESVRPVLETLKK+G+TDSEICVIANVCPDTTDEVFALLP+LK
Subjt:  KNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLK

A0A1S3B2C7 DNA-directed RNA polymerases IV and V subunit 4 isoform X26.6e-8988.83Show/hide
Query:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNNLSMKYSGKNGDLGKGGKGVSVKASVAKEPQPLELKIEQELP
        MSEKG+KGF VQKKPAKSSLKSS LKD    GKDDSLSK KKGRKVQFDAQGSVDA N+ SMKYSGKNGD+GKGGKG + KASVAKEPQ LELKIEQELP
Subjt:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNNLSMKYSGKNGDLGKGGKGVSVKASVAKEPQPLELKIEQELP

Query:  KNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLK
        KNVKCQCLMDCEAAQLLQGI DQMV LSADPTIKIPTSFDRGLQYAKRANHYVN+ESVRPVLETLKK+G+TDSEICVIANVCPDTTDEVFALLP+LK
Subjt:  KNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLK

A0A6J1EJD9 DNA-directed RNA polymerases IV and V subunit 4-like isoform X15.1e-8988.38Show/hide
Query:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNNLSMKYSGKNGDLGKGGKGVS-VKASVAKEPQPLELKIEQEL
        MSEKG+KGFPVQKKP KSSLKSS  KDASLKGKDDSLSK KKGRKVQFDAQGSVDAQ NLS+K+SGKNGDLGKGGKG++  KASV+KE QPLELKIEQEL
Subjt:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNNLSMKYSGKNGDLGKGGKGVS-VKASVAKEPQPLELKIEQEL

Query:  PKNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLK
        PKNVKCQCLMDCEAAQLLQGI DQM LLSADPTIKIPTSFDRGLQYAKRANHYVN+E VRPVLETLKK+GV DSEICVIANVCPDTTDEVF+L+P+LK
Subjt:  PKNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLK

A0A6J1KMD1 DNA-directed RNA polymerases IV and V subunit 4-like1.9e-8888.38Show/hide
Query:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNNLSMKYSGKNGDLGKGGKGVS-VKASVAKEPQPLELKIEQEL
        MSEKG+KGFPVQKKP KSSLKS   KDASLKGKDDSLSK KKGRKVQFDAQGSVDA  NLSMK+ GKNGDLGKGGKG++  KASV+KE QPLELKIEQEL
Subjt:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQNNLSMKYSGKNGDLGKGGKGVS-VKASVAKEPQPLELKIEQEL

Query:  PKNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLK
        PKNVKCQCLMDCEAAQLLQGI DQMVLLSADPTIKIPTSFDRGLQYAKRANHYVN+ESVRPVLETLKK+GV DSEICVIANVCPDT DEVFAL+P+LK
Subjt:  PKNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLK

SwissProt top hitse value%identityAlignment
O48890 DNA-directed RNA polymerase II subunit 45.9e-1839.29Show/hide
Query:  KEPQPLELKIEQELPKNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDT
        +E    ELKI  E    +K +CLM+CE + +L+   +Q+  +S DP  ++   F++ LQY KR + Y N ++VR V E L +H +T+ E+CV+ N+CP+T
Subjt:  KEPQPLELKIEQELPKNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDT

Query:  TDEVFALLPTLK
         +E  A++P+LK
Subjt:  TDEVFALLPTLK

Q54S04 DNA-directed RNA polymerase II subunit rpb46.7e-0637.7Show/hide
Query:  SFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLKK
        +F + L YA++ + Y N  S++ V   L K  + + EI  +AN+CP+ +DE  +L+P+LKK
Subjt:  SFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLKK

Q6DBA5 DNA-directed RNA polymerases IV and V subunit 41.1e-2945.23Show/hide
Query:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQN--NLSMKYSGKNGDLGKGGKGVSVKASVAKEPQPLELKIEQE
        MSEKG KG        KSSLKS   KD    GKD S +K KKGRK+ FD +         N+S           K GK      S        ELK   +
Subjt:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQN--NLSMKYSGKNGDLGKGGKGVSVKASVAKEPQPLELKIEQE

Query:  LPKNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLK
        LP+N   +C+MDCEA Q+L GI  Q+V LS DP+IKIP S+DR L Y +   HY N +SVR VLE LK +G++D E+CVIAN   ++ DEV A +P+LK
Subjt:  LPKNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLK

Arabidopsis top hitse value%identityAlignment
AT4G15950.1 RNA polymerase II, Rpb4, core protein8.1e-3145.23Show/hide
Query:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQN--NLSMKYSGKNGDLGKGGKGVSVKASVAKEPQPLELKIEQE
        MSEKG KG        KSSLKS   KD    GKD S +K KKGRK+ FD +         N+S           K GK      S        ELK   +
Subjt:  MSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDAQN--NLSMKYSGKNGDLGKGGKGVSVKASVAKEPQPLELKIEQE

Query:  LPKNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLK
        LP+N   +C+MDCEA Q+L GI  Q+V LS DP+IKIP S+DR L Y +   HY N +SVR VLE LK +G++D E+CVIAN   ++ DEV A +P+LK
Subjt:  LPKNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDTTDEVFALLPTLK

AT5G09920.1 RNA polymerase II, Rpb4, core protein4.2e-1939.29Show/hide
Query:  KEPQPLELKIEQELPKNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDT
        +E    ELKI  E    +K +CLM+CE + +L+   +Q+  +S DP  ++   F++ LQY KR + Y N ++VR V E L +H +T+ E+CV+ N+CP+T
Subjt:  KEPQPLELKIEQELPKNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLKKHGVTDSEICVIANVCPDT

Query:  TDEVFALLPTLK
         +E  A++P+LK
Subjt:  TDEVFALLPTLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAGCAAACTATTGAATCCTTCAGAATTGGAGGTTGGAACCAGGCTCGTCAGTCGTCCCATTGCGCAAGACGAAAGGAAAGATTCTCCCATTTTCCGATCAACCAA
CAGGTTTTGCTTGGAGCTTAATCGAACTGAAACGGGGGAACAATTTTCAGCAATGTCGGAGAAAGGCGATAAGGGTTTTCCCGTGCAGAAAAAACCTGCAAAGTCTTCCC
TCAAATCTTCCACTCTCAAGGATGCTTCTCTAAAAGGAAAGGATGATAGTTTGTCAAAGCCAAAGAAGGGAAGGAAAGTCCAGTTCGATGCTCAAGGATCAGTTGATGCG
CAGAATAATCTTTCAATGAAATACAGTGGCAAAAATGGTGACTTGGGTAAAGGAGGAAAAGGTGTAAGCGTGAAGGCTTCTGTTGCAAAGGAACCCCAACCACTAGAATT
GAAGATTGAGCAAGAACTTCCCAAGAATGTTAAATGCCAATGCCTTATGGACTGTGAGGCTGCACAACTTTTACAGGGAATCCATGATCAGATGGTTCTTCTATCTGCAG
ATCCAACCATAAAAATTCCTACGTCATTTGATAGGGGATTGCAATACGCTAAAAGAGCCAACCACTATGTAAATAGCGAGTCAGTTAGACCAGTTCTCGAAACCCTCAAG
AAACATGGCGTAACAGATAGTGAGATATGTGTGATTGCTAATGTCTGCCCAGACACTACTGATGAAGTTTTTGCTCTTCTTCCAACCTTGAAGAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAAGCAAACTATTGAATCCTTCAGAATTGGAGGTTGGAACCAGGCTCGTCAGTCGTCCCATTGCGCAAGACGAAAGGAAAGATTCTCCCATTTTCCGATCAACCAA
CAGGTTTTGCTTGGAGCTTAATCGAACTGAAACGGGGGAACAATTTTCAGCAATGTCGGAGAAAGGCGATAAGGGTTTTCCCGTGCAGAAAAAACCTGCAAAGTCTTCCC
TCAAATCTTCCACTCTCAAGGATGCTTCTCTAAAAGGAAAGGATGATAGTTTGTCAAAGCCAAAGAAGGGAAGGAAAGTCCAGTTCGATGCTCAAGGATCAGTTGATGCG
CAGAATAATCTTTCAATGAAATACAGTGGCAAAAATGGTGACTTGGGTAAAGGAGGAAAAGGTGTAAGCGTGAAGGCTTCTGTTGCAAAGGAACCCCAACCACTAGAATT
GAAGATTGAGCAAGAACTTCCCAAGAATGTTAAATGCCAATGCCTTATGGACTGTGAGGCTGCACAACTTTTACAGGGAATCCATGATCAGATGGTTCTTCTATCTGCAG
ATCCAACCATAAAAATTCCTACGTCATTTGATAGGGGATTGCAATACGCTAAAAGAGCCAACCACTATGTAAATAGCGAGTCAGTTAGACCAGTTCTCGAAACCCTCAAG
AAACATGGCGTAACAGATAGTGAGATATGTGTGATTGCTAATGTCTGCCCAGACACTACTGATGAAGTTTTTGCTCTTCTTCCAACCTTGAAGAAGTAG
Protein sequenceShow/hide protein sequence
MRSKLLNPSELEVGTRLVSRPIAQDERKDSPIFRSTNRFCLELNRTETGEQFSAMSEKGDKGFPVQKKPAKSSLKSSTLKDASLKGKDDSLSKPKKGRKVQFDAQGSVDA
QNNLSMKYSGKNGDLGKGGKGVSVKASVAKEPQPLELKIEQELPKNVKCQCLMDCEAAQLLQGIHDQMVLLSADPTIKIPTSFDRGLQYAKRANHYVNSESVRPVLETLK
KHGVTDSEICVIANVCPDTTDEVFALLPTLKK