; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC09G162700 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC09G162700
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionDNA-directed RNA polymerase III subunit RPC7-like isoform X1
Genome locationCicolChr09:218391..230547
RNA-Seq ExpressionCcUC09G162700
SyntenyCcUC09G162700
Gene Ontology termsGO:0006383 - transcription by RNA polymerase III (biological process)
GO:0005634 - nucleus (cellular component)
InterPro domainsIPR024661 - DNA-directed RNA polymerase III, subunit Rpc31


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575499.1 hypothetical protein SDJN03_26138, partial [Cucurbita argyrosperma subsp. sororia]1.5e-6869.07Show/hide
Query:  CYSHNSRKSSKR-----GMAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQRTEIE
        C+SHN  KSS+R     GMAFRGRGRGR GGGG+FQYAKQEPFELFPE    NVTLP+VSD+PE + L + N++ LNYWKASPFYLEENVMKK+Q+TEIE
Subjt:  CYSHNSRKSSKR-----GMAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQRTEIE

Query:  KFSDRSKSNCTLKRDSLAQILQLTSRNFPEELVEDTLLLNFQVSKGSCGPKEKFNGIP----------------AQGQDKDDKEKKEGEEGEDEDEDEDD
        +FSDR+KSN TLKRDSLAQILQLTSRNFPEELVE          KG    K K    P                 +GQ+KDDKEKKEGE  EDEDE++DD
Subjt:  KFSDRSKSNCTLKRDSLAQILQLTSRNFPEELVEDTLLLNFQVSKGSCGPKEKFNGIP----------------AQGQDKDDKEKKEGEEGEDEDEDEDD

Query:  AQSEELTDDDYYQNEYFDDDEDDYNMEEEGGDEPEY
        AQSEELTDDDYYQNEYFDDDEDDYNMEEEGGDEPEY
Subjt:  AQSEELTDDDYYQNEYFDDDEDDYNMEEEGGDEPEY

XP_008462407.1 PREDICTED: DNA-directed RNA polymerase III subunit RPC7-like isoform X1 [Cucumis melo]7.7e-6872.94Show/hide
Query:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQRTEIEKFSDRSKSNCTLKRDSLA
        MAFRGRGRGRGGGGG+FQYAKQEPFELFPE    NVTLPSVS+MPEE +LA+    FL YWKASPFYLEENVMKK+QRTEIEKFSDR K N TLKRDSLA
Subjt:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQRTEIEKFSDRSKSNCTLKRDSLA

Query:  QILQLTSRNFPEELVEDTLLLNFQVSKGSCGPKEKFNGIP----------------AQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYFD
        QI+QLTSRNFPEELVE          KG    K K    P                 +GQDK+DKEKKEGEEGEDEDE+E+DAQSEELTDDDYYQNEYFD
Subjt:  QILQLTSRNFPEELVEDTLLLNFQVSKGSCGPKEKFNGIP----------------AQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYFD

Query:  DDEDDYNMEEEGGDEPEY
        DDEDDYNMEEEGGDEPEY
Subjt:  DDEDDYNMEEEGGDEPEY

XP_022154143.1 ribosomal L1 domain-containing protein CG13096-like [Momordica charantia]1.5e-6671.69Show/hide
Query:  MAFR-GRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQRTEIEKFSDRSKSNCTLKRDSL
        MAFR GRGRGRGGGGGAFQYAKQEPFELFPE    NVTLPSVSD+PEE+ L + N++ LNYWKASPF+LEENV+KK+QRTEIEKFSDRSK N TLKRDSL
Subjt:  MAFR-GRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQRTEIEKFSDRSKSNCTLKRDSL

Query:  AQILQLTSRNFPEELVEDTLLLNFQVSKGSCGPKEKFNGIP----------------AQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYF
        AQILQLTSRNFPEELVE          KG    K K    P                 +GQDKDDKEKKEGEEGEDE+++EDDAQSEELTDDDYYQNEYF
Subjt:  AQILQLTSRNFPEELVEDTLLLNFQVSKGSCGPKEKFNGIP----------------AQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYF

Query:  DDDEDDYNMEEEGGDEPEY
        DDDEDDYNME++GGDEP Y
Subjt:  DDDEDDYNMEEEGGDEPEY

XP_038898860.1 glutamic acid-rich protein-like isoform X1 [Benincasa hispida]2.2e-7065.64Show/hide
Query:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGS-----------------------------------------SNVTLPSVSDMPEERSLALRNNKFLN
        MAFRGRGRGRGGGGGAFQYAKQEPFELFPEV                                            NVTLP VSDMPEE+SLA+RNNKFLN
Subjt:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGS-----------------------------------------SNVTLPSVSDMPEERSLALRNNKFLN

Query:  YWKASPFYLEENVMKKVQRTEIEKFSDRSKSNCTLKRDSLAQILQLTSRNFPEELVEDTLLLNFQVSKGSCGPKEKFNGIP----------------AQG
        YWKASPFYLEENVMKK+QRTE+EKFSDRSKSN TLKRDSLAQILQLTSRNFPEELV+          KG    K K    P                 +G
Subjt:  YWKASPFYLEENVMKKVQRTEIEKFSDRSKSNCTLKRDSLAQILQLTSRNFPEELVEDTLLLNFQVSKGSCGPKEKFNGIP----------------AQG

Query:  QDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYFDDDEDDYNMEEEGGDEPEY
        QDKDDKEKKEGEEGEDEDE+EDDAQSEELTDDDYYQNEYFDDDEDDYNMEEEGGDEPEY
Subjt:  QDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYFDDDEDDYNMEEEGGDEPEY

XP_038898864.1 glutamic acid-rich protein-like isoform X2 [Benincasa hispida]1.4e-7477.52Show/hide
Query:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQRTEIEKFSDRSKSNCTLKRDSLA
        MAFRGRGRGRGGGGGAFQYAKQEPFELFPE    NVTLP VSDMPEE+SLA+RNNKFLNYWKASPFYLEENVMKK+QRTE+EKFSDRSKSN TLKRDSLA
Subjt:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQRTEIEKFSDRSKSNCTLKRDSLA

Query:  QILQLTSRNFPEELVEDTLLLNFQVSKGSCGPKEKFNGIP----------------AQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYFD
        QILQLTSRNFPEELV+          KG    K K    P                 +GQDKDDKEKKEGEEGEDEDE+EDDAQSEELTDDDYYQNEYFD
Subjt:  QILQLTSRNFPEELVEDTLLLNFQVSKGSCGPKEKFNGIP----------------AQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYFD

Query:  DDEDDYNMEEEGGDEPEY
        DDEDDYNMEEEGGDEPEY
Subjt:  DDEDDYNMEEEGGDEPEY

TrEMBL top hitse value%identityAlignment
A0A0A0KCT3 Uncharacterized protein8.3e-6869.53Show/hide
Query:  YSHNSRKSSKRGMAFRG---RGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQRTEIEKFS
        Y HN  KSSKR   FRG   RGRGRGGGGG+FQYAKQEPFELFPE    NVTLPS+ +MPEE +LA+ +  F+ YWKASPFYLEENVMKK+QRTEIEKFS
Subjt:  YSHNSRKSSKRGMAFRG---RGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQRTEIEKFS

Query:  DRSKSNCTLKRDSLAQILQLTSRNFPEELVEDTLLLNFQVSKGSCGPKEKFNGIP----------------AQGQDKDDKEKKEGEEGEDEDEDEDDAQS
        DR+K N TLKRDSLAQI+QLTSRNFPEELVE          KG    K K    P                 +GQDK+DKEKKEGEEGEDEDE+E+DAQS
Subjt:  DRSKSNCTLKRDSLAQILQLTSRNFPEELVEDTLLLNFQVSKGSCGPKEKFNGIP----------------AQGQDKDDKEKKEGEEGEDEDEDEDDAQS

Query:  EELTDDDYYQNEYFDDDEDDYNMEEEGGDEPEY
        EELTDDDYYQNEYFDDDEDDYNMEEEGGDEPEY
Subjt:  EELTDDDYYQNEYFDDDEDDYNMEEEGGDEPEY

A0A1S3CGV4 DNA-directed RNA polymerase III subunit3.7e-6872.94Show/hide
Query:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQRTEIEKFSDRSKSNCTLKRDSLA
        MAFRGRGRGRGGGGG+FQYAKQEPFELFPE    NVTLPSVS+MPEE +LA+    FL YWKASPFYLEENVMKK+QRTEIEKFSDR K N TLKRDSLA
Subjt:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQRTEIEKFSDRSKSNCTLKRDSLA

Query:  QILQLTSRNFPEELVEDTLLLNFQVSKGSCGPKEKFNGIP----------------AQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYFD
        QI+QLTSRNFPEELVE          KG    K K    P                 +GQDK+DKEKKEGEEGEDEDE+E+DAQSEELTDDDYYQNEYFD
Subjt:  QILQLTSRNFPEELVEDTLLLNFQVSKGSCGPKEKFNGIP----------------AQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYFD

Query:  DDEDDYNMEEEGGDEPEY
        DDEDDYNMEEEGGDEPEY
Subjt:  DDEDDYNMEEEGGDEPEY

A0A5D3BVU1 DNA-directed RNA polymerase III subunit RPC7-like isoform X18.6e-6567.37Show/hide
Query:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQRTEIEKFSDRSKSNCTLKRDSLA
        MAFRGRGRGRGGGGG+FQYAKQEPFELFPE    NVTLPSVS+MPEE +LA+    FL YWKASPFYLEENVMKK+QRTEIEKFSDR K N TLKRDSLA
Subjt:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQRTEIEKFSDRSKSNCTLKRDSLA

Query:  QILQLTSRNFPEELVEDTLLLNFQVSKGSCGPKEKFNGIP----------------AQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYFD
        QI+QLTSRNFPEELVE          KG    K K    P                 +GQDK+DKEKKEGEEGEDEDE+E+DAQSEELTDDDYYQNEYFD
Subjt:  QILQLTSRNFPEELVEDTLLLNFQVSKGSCGPKEKFNGIP----------------AQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYFD

Query:  DDEDDYNMEEEGG------------------DEPEY
        DDEDDYNMEEEGG                  DEPEY
Subjt:  DDEDDYNMEEEGG------------------DEPEY

A0A6J1DMV9 ribosomal L1 domain-containing protein CG13096-like7.0e-6771.69Show/hide
Query:  MAFR-GRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQRTEIEKFSDRSKSNCTLKRDSL
        MAFR GRGRGRGGGGGAFQYAKQEPFELFPE    NVTLPSVSD+PEE+ L + N++ LNYWKASPF+LEENV+KK+QRTEIEKFSDRSK N TLKRDSL
Subjt:  MAFR-GRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQRTEIEKFSDRSKSNCTLKRDSL

Query:  AQILQLTSRNFPEELVEDTLLLNFQVSKGSCGPKEKFNGIP----------------AQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYF
        AQILQLTSRNFPEELVE          KG    K K    P                 +GQDKDDKEKKEGEEGEDE+++EDDAQSEELTDDDYYQNEYF
Subjt:  AQILQLTSRNFPEELVEDTLLLNFQVSKGSCGPKEKFNGIP----------------AQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYF

Query:  DDDEDDYNMEEEGGDEPEY
        DDDEDDYNME++GGDEP Y
Subjt:  DDDEDDYNMEEEGGDEPEY

A0A6J1GNY2 glutamic acid-rich protein-like5.0e-6570.64Show/hide
Query:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQRTEIEKFSDRSKSNCTLKRDSLA
        MAFRGRGRGR GGGG+FQYAKQEPFELFPE    NVTLP+VSD+PE + L + N++ LNYWKASPFYLEENVMKK+Q+TEIE+FSDR+KSN TLKRDSLA
Subjt:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQRTEIEKFSDRSKSNCTLKRDSLA

Query:  QILQLTSRNFPEELVEDTLLLNFQVSKGSCGPKEKFNGIP----------------AQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYFD
        QILQLTSRNFPEELVE          KG    K K    P                 +GQ+KDDKEKKEGE  EDEDE++DDAQSEELTDDDYYQNEYFD
Subjt:  QILQLTSRNFPEELVEDTLLLNFQVSKGSCGPKEKFNGIP----------------AQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYFD

Query:  DDEDDYNMEEEGGDEPEY
        DDEDDYNMEEEGGDEPEY
Subjt:  DDEDDYNMEEEGGDEPEY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G01590.1 unknown protein3.2e-1133.02Show/hide
Query:  MAFRG-RGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQR--TEIEKFSD----RSKSNCT
        M+++G RG+ +G GG    Y K EPF +FPE     +TLP    +  +  L      F  +W+ SP++L +  + K ++    IE++SD    + KSN  
Subjt:  MAFRG-RGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQR--TEIEKFSD----RSKSNCT

Query:  LKRDSLAQILQLTSRNFPEELVEDTLLLNFQVSKGSCGPK------EKFNGIPAQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYFDDDE
         K  S    L L   NFP+EL+ DT      V +     +      + F  + A+ + +  +EK+EGE    +DE+  +++ EE  + DY QN+ FDDD+
Subjt:  LKRDSLAQILQLTSRNFPEELVEDTLLLNFQVSKGSCGPK------EKFNGIPAQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYFDDDE

Query:  DDYNMEEEGGDEPEY
        DDYN E++G  E  Y
Subjt:  DDYNMEEEGGDEPEY

AT4G01590.2 unknown protein1.4e-1133.02Show/hide
Query:  MAFRG-RGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQR--TEIEKFSD----RSKSNCT
        M+++G RG+ +G GG    Y K EPF +FPE     +TLP    +  +  L      F  +W+ SP++L +  + K ++    IE++SD    + KSN  
Subjt:  MAFRG-RGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQR--TEIEKFSD----RSKSNCT

Query:  LKRDSLAQILQLTSRNFPEELVEDTLLLNFQVSKGSCGPK------EKFNGIPAQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYFDDDE
         K  S    L L   NFP+EL+ DT      V +     +      + F  + A+ + +  +EK+EGE    +DE+  +++ EE  + DY QN+ FDDD+
Subjt:  LKRDSLAQILQLTSRNFPEELVEDTLLLNFQVSKGSCGPK------EKFNGIPAQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYFDDDE

Query:  DDYNMEEEGGDE
        DDYN E++G +E
Subjt:  DDYNMEEEGGDE

AT4G01590.3 unknown protein3.2e-1133.02Show/hide
Query:  MAFRG-RGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQR--TEIEKFSD----RSKSNCT
        M+++G RG+ +G GG    Y K EPF +FPE     +TLP    +  +  L      F  +W+ SP++L +  + K ++    IE++SD    + KSN  
Subjt:  MAFRG-RGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQR--TEIEKFSD----RSKSNCT

Query:  LKRDSLAQILQLTSRNFPEELVEDTLLLNFQVSKGSCGPK------EKFNGIPAQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYFDDDE
         K  S    L L   NFP+EL+ DT      V +     +      + F  + A+ + +  +EK+EGE    +DE+  +++ EE  + DY QN+ FDDD+
Subjt:  LKRDSLAQILQLTSRNFPEELVEDTLLLNFQVSKGSCGPK------EKFNGIPAQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYFDDDE

Query:  DDYNMEEEGGDEPEY
        DDYN E++G  E  Y
Subjt:  DDYNMEEEGGDEPEY

AT4G35680.1 Arabidopsis protein of unknown function (DUF241)6.2e-0729.86Show/hide
Query:  HNSRKSSKRGMAFR-GRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFL--NYWKASPFYLEENVMKKVQRTEIEKFSDR
        H+SR+     M+++ GRG+ +G GG    Y K EPF +FPE     +TLP    +  +  L +  + F    +W  SP++L +  + K           +
Subjt:  HNSRKSSKRGMAFR-GRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFL--NYWKASPFYLEENVMKKVQRTEIEKFSDR

Query:  SKSNCTLKRDSLAQILQLTSRNFPEELVEDTLLLNFQVSKGSCGPK------EKFNGIPAQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNE
         K++  ++R            NF +ELV DT      V +     +      + F  + ++ + + ++EK++GE    +DE   +++ EE  + DY QN+
Subjt:  SKSNCTLKRDSLAQILQLTSRNFPEELVEDTLLLNFQVSKGSCGPK------EKFNGIPAQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNE

Query:  YFDDDEDDYNMEEEGGDEPEY
         FDDDEDDYN EE+GG E  Y
Subjt:  YFDDDEDDYNMEEEGGDEPEY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACGATCAACTGCTGCAGAAAGAAGGAATACTTCGAAAAATCAGCCGAAACCCATTTCAGATTGCTTTCCCTCTTTCACACTGCCGTCCACCGGCCCCGACTGCCA
CCGCTCGCCTCCATTCTCGCCACCGATCCTAGTCCAACGCTGCCTGACCGCCGCCGCCGTCGCTAACGTCAAAGCCTCTCTTTTCAACTCTCGCGCGATTCTCTCTCTCT
CATACTTCATTCGATCGCCGCCTGTAGCCAGCACTGTGCCTCCGCCGCTTAGGGATAAGTGTTTAAGAAGATACCAGGGTGCAGTGTCGCATTTTGTTAGGAAGCATTTT
TGTTACAGCCATAATTCGAGGAAGTCATCAAAAAGGGGGATGGCATTTAGAGGGCGAGGACGAGGACGAGGTGGCGGCGGTGGGGCCTTTCAGTATGCCAAGCAAGAACC
GTTTGAGCTTTTTCCAGAGGTGGGTAGTTCTAATGTAACTCTACCGAGCGTCAGTGATATGCCTGAAGAAAGAAGCTTGGCTTTACGTAACAACAAGTTTCTGAATTATT
GGAAGGCCTCTCCTTTTTATCTAGAGGAAAATGTTATGAAAAAGGTGCAAAGAACTGAGATAGAGAAATTTTCCGATAGATCCAAGTCGAATTGTACGTTGAAGCGTGAT
TCCCTTGCACAAATTCTACAGCTCACATCCAGGAACTTTCCTGAAGAATTGGTTGAAGATACTCTACTGTTAAATTTTCAGGTTTCAAAGGGAAGTTGCGGACCAAAAGA
AAAGTTCAATGGAATCCCGGCTCAGGGACAGGATAAGGATGATAAGGAGAAGAAAGAAGGAGAAGAAGGTGAAGATGAAGATGAAGACGAAGACGATGCACAGTCCGAGG
AACTTACTGATGATGATTATTATCAGAATGAATACTTCGATGACGATGAAGATGATTACAACATGGAAGAAGAAGGTGGAGATGAACCAGAATATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAACGATCAACTGCTGCAGAAAGAAGGAATACTTCGAAAAATCAGCCGAAACCCATTTCAGATTGCTTTCCCTCTTTCACACTGCCGTCCACCGGCCCCGACTGCCA
CCGCTCGCCTCCATTCTCGCCACCGATCCTAGTCCAACGCTGCCTGACCGCCGCCGCCGTCGCTAACGTCAAAGCCTCTCTTTTCAACTCTCGCGCGATTCTCTCTCTCT
CATACTTCATTCGATCGCCGCCTGTAGCCAGCACTGTGCCTCCGCCGCTTAGGGATAAGTGTTTAAGAAGATACCAGGGTGCAGTGTCGCATTTTGTTAGGAAGCATTTT
TGTTACAGCCATAATTCGAGGAAGTCATCAAAAAGGGGGATGGCATTTAGAGGGCGAGGACGAGGACGAGGTGGCGGCGGTGGGGCCTTTCAGTATGCCAAGCAAGAACC
GTTTGAGCTTTTTCCAGAGGTGGGTAGTTCTAATGTAACTCTACCGAGCGTCAGTGATATGCCTGAAGAAAGAAGCTTGGCTTTACGTAACAACAAGTTTCTGAATTATT
GGAAGGCCTCTCCTTTTTATCTAGAGGAAAATGTTATGAAAAAGGTGCAAAGAACTGAGATAGAGAAATTTTCCGATAGATCCAAGTCGAATTGTACGTTGAAGCGTGAT
TCCCTTGCACAAATTCTACAGCTCACATCCAGGAACTTTCCTGAAGAATTGGTTGAAGATACTCTACTGTTAAATTTTCAGGTTTCAAAGGGAAGTTGCGGACCAAAAGA
AAAGTTCAATGGAATCCCGGCTCAGGGACAGGATAAGGATGATAAGGAGAAGAAAGAAGGAGAAGAAGGTGAAGATGAAGATGAAGACGAAGACGATGCACAGTCCGAGG
AACTTACTGATGATGATTATTATCAGAATGAATACTTCGATGACGATGAAGATGATTACAACATGGAAGAAGAAGGTGGAGATGAACCAGAATATTAGCAGAATGGAAAG
TAGGTGGTAAGATTTGGGGCTTGTTTCTCCAACTCAAAGAATTTAATTGGTGTAGGGCATTGGATTGGTTAGTCAGGTTTTAAAATTTTTAAATTTTAATTTTTTTTTTT
TTTTTGAGACTGTAAAGTATTCTAATTTCTAAGTAATTAAATTTATGTTTAAATCATATTTTAGTTCTTCAATTTTCATGTTTATTCTTTCTGGTCTATGAAATTTGCAT
AATGGACGATTTGTGTCTAATGTTTTTTTAAAAAAAACTATTTAATGGAAATTTGAAGTGACATGCATTTTGTTTTACTTTATTTATATGAGCAAATTTATGTATGCATT
AATATTTGAAATCATGGGTGTGGTGATGAGTTTTGAAAAATCTATGAAGGCTTCTATTTTAGCAAAAATTAGCAACAAATACAAGTGCAAAATAGATTTTTATGTAGAAT
TTTATGATAAAAGTAGAAAACTTTGGAAGTATTGGAACATAAGCATAATATTCAGAGGTTTAAACAGAAAATATAATGATAATGTTACTAAAATAAATTTATTATCATAA
GTTTGATTATATAATATTGTCATACAGCAAAGGTGGAAAAAAAAATGAAGTATTGAACACGCTGTTGACTTTTTTTAGAAAAAAAGATACTTAGTGATATGTGAACCAAG
CTATTTTTTCTCTAGTTTAGGTCTGTTTTGATAAATTTTTTAACCAAATACTTTCTCTTAGGATTTGTTTGTT
Protein sequenceShow/hide protein sequence
MKRSTAAERRNTSKNQPKPISDCFPSFTLPSTGPDCHRSPPFSPPILVQRCLTAAAVANVKASLFNSRAILSLSYFIRSPPVASTVPPPLRDKCLRRYQGAVSHFVRKHF
CYSHNSRKSSKRGMAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKVQRTEIEKFSDRSKSNCTLKRD
SLAQILQLTSRNFPEELVEDTLLLNFQVSKGSCGPKEKFNGIPAQGQDKDDKEKKEGEEGEDEDEDEDDAQSEELTDDDYYQNEYFDDDEDDYNMEEEGGDEPEY