; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg014620 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg014620
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDNA-directed RNA polymerase III subunit RPC7-like isoform X1
Genome locationscaffold3:48538920..48546726
RNA-Seq ExpressionSpg014620
SyntenySpg014620
Gene Ontology termsGO:0006383 - transcription by RNA polymerase III (biological process)
GO:0005666 - RNA polymerase III complex (cellular component)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR024661 - DNA-directed RNA polymerase III, subunit Rpc31


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575499.1 hypothetical protein SDJN03_26138, partial [Cucurbita argyrosperma subsp. sororia]1.1e-8989.05Show/hide
Query:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLA
        MAFRGRGRGR GGGG+FQYAKQEPFELFPE    NVTLP+VSD+PE K LVICNS+LLNYWKASPFYLEENVMKKMQ+TEIE+FSDR+KS+STLKRDSLA
Subjt:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLA

Query:  QILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM
        QILQLTSRNFPEELVEGFKGKLR+KRKVQWNPESGL KLDFLEKREES KGQ+KDDKEKKEGE  EDEDEE+DDAQSEELTDDDYYQNEYFDDDEDDYNM
Subjt:  QILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM

Query:  EDEGGDEPEY
        E+EGGDEPEY
Subjt:  EDEGGDEPEY

XP_022154143.1 ribosomal L1 domain-containing protein CG13096-like [Momordica charantia]2.2e-9391.94Show/hide
Query:  MAFR-GRGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSL
        MAFR GRGRGRGGGGG FQYAKQEPFELFPE    NVTLPSVSDVPEEK LVICNS+LLNYWKASPF+LEENV+KKMQRTEIEKFSDRSK +STLKRDSL
Subjt:  MAFR-GRGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSL

Query:  AQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYN
        AQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREES KGQDKDDKEKKEGEEGEDE++EEDDAQSEELTDDDYYQNEYFDDDEDDYN
Subjt:  AQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYN

Query:  MEDEGGDEPEY
        MED+GGDEP Y
Subjt:  MEDEGGDEPEY

XP_022953194.1 glutamic acid-rich protein-like [Cucurbita moschata]1.1e-8989.05Show/hide
Query:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLA
        MAFRGRGRGR GGGG+FQYAKQEPFELFPE    NVTLP+VSD+PE K LVICNS+LLNYWKASPFYLEENVMKKMQ+TEIE+FSDR+KS+STLKRDSLA
Subjt:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLA

Query:  QILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM
        QILQLTSRNFPEELVEGFKGKLR+KRKVQWNPESGL KLDFLEKREES KGQ+KDDKEKKEGE  EDEDEE+DDAQSEELTDDDYYQNEYFDDDEDDYNM
Subjt:  QILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM

Query:  EDEGGDEPEY
        E+EGGDEPEY
Subjt:  EDEGGDEPEY

XP_023548752.1 DNA-directed RNA polymerase III subunit rpc31-like [Cucurbita pepo subsp. pepo]2.3e-9090Show/hide
Query:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLA
        MAFRGRGRGR GGGG+FQYAKQEPFELFPE    NVTLP+VSD+PE K LVICNS+LLNYWKASPFYLEENVMKKMQRTEIE+FSDR+KS+STLKRDSLA
Subjt:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLA

Query:  QILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM
        QILQLTSRNFPEELVEGFKGKLR+KRKVQWNPESGL KLDFLEKREES KGQ+KDDKEKKEGE  EDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM
Subjt:  QILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM

Query:  EDEGGDEPEY
        E+EGGDEPEY
Subjt:  EDEGGDEPEY

XP_038898864.1 glutamic acid-rich protein-like isoform X2 [Benincasa hispida]1.3e-9391.9Show/hide
Query:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLA
        MAFRGRGRGRGGGGG FQYAKQEPFELFPE    NVTLP VSD+PEEKSL I N+K LNYWKASPFYLEENVMKKMQRTE+EKFSDRSKS+STLKRDSLA
Subjt:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLA

Query:  QILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM
        QILQLTSRNFPEELV+GFKGKLR KRKVQWNPESGLQKLDFLEKREES KGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM
Subjt:  QILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM

Query:  EDEGGDEPEY
        E+EGGDEPEY
Subjt:  EDEGGDEPEY

TrEMBL top hitse value%identityAlignment
A0A1S3CGV4 DNA-directed RNA polymerase III subunit8.9e-8886.67Show/hide
Query:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLA
        MAFRGRGRGRGGGGG+FQYAKQEPFELFPE    NVTLPSVS++PEE +L +     L YWKASPFYLEENVMKKMQRTEIEKFSDR K +STLKRDSLA
Subjt:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLA

Query:  QILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM
        QI+QLTSRNFPEELVEGFKGKLR KRKVQWNPESGL+K+DFLEKREES KGQDK+DKEKKEGEEGEDEDEEE+DAQSEELTDDDYYQNEYFDDDEDDYNM
Subjt:  QILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM

Query:  EDEGGDEPEY
        E+EGGDEPEY
Subjt:  EDEGGDEPEY

A0A5D3BVU1 DNA-directed RNA polymerase III subunit RPC7-like isoform X12.0e-8479.82Show/hide
Query:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLA
        MAFRGRGRGRGGGGG+FQYAKQEPFELFPE    NVTLPSVS++PEE +L +     L YWKASPFYLEENVMKKMQRTEIEKFSDR K +STLKRDSLA
Subjt:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLA

Query:  QILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM
        QI+QLTSRNFPEELVEGFKGKLR KRKVQWNPESGL+K+DFLEKREES KGQDK+DKEKKEGEEGEDEDEEE+DAQSEELTDDDYYQNEYFDDDEDDYNM
Subjt:  QILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM

Query:  EDEGG------------------DEPEY
        E+EGG                  DEPEY
Subjt:  EDEGG------------------DEPEY

A0A6J1DMV9 ribosomal L1 domain-containing protein CG13096-like1.1e-9391.94Show/hide
Query:  MAFR-GRGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSL
        MAFR GRGRGRGGGGG FQYAKQEPFELFPE    NVTLPSVSDVPEEK LVICNS+LLNYWKASPF+LEENV+KKMQRTEIEKFSDRSK +STLKRDSL
Subjt:  MAFR-GRGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSL

Query:  AQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYN
        AQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREES KGQDKDDKEKKEGEEGEDE++EEDDAQSEELTDDDYYQNEYFDDDEDDYN
Subjt:  AQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYN

Query:  MEDEGGDEPEY
        MED+GGDEP Y
Subjt:  MEDEGGDEPEY

A0A6J1GNY2 glutamic acid-rich protein-like5.6e-9089.05Show/hide
Query:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLA
        MAFRGRGRGR GGGG+FQYAKQEPFELFPE    NVTLP+VSD+PE K LVICNS+LLNYWKASPFYLEENVMKKMQ+TEIE+FSDR+KS+STLKRDSLA
Subjt:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLA

Query:  QILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM
        QILQLTSRNFPEELVEGFKGKLR+KRKVQWNPESGL KLDFLEKREES KGQ+KDDKEKKEGE  EDEDEE+DDAQSEELTDDDYYQNEYFDDDEDDYNM
Subjt:  QILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM

Query:  EDEGGDEPEY
        E+EGGDEPEY
Subjt:  EDEGGDEPEY

A0A6J1JVL5 DNA-directed RNA polymerase III subunit rpc31-like1.2e-8989.05Show/hide
Query:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLA
        MAFRGRGRGR GGGG+FQYAKQEPFELFPE    NVTLP+VSD+PE K LVICNS+LLNYWKASPFYLEENVMKKMQ+ EIE+FSDR+KS+STLKRDSLA
Subjt:  MAFRGRGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLA

Query:  QILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM
        QILQLTSRNFPEELVEGFKGKLR+KRKVQWNPESGL KLDFLEKREES KGQ+KDDKEKKEGE  EDEDEEED+AQSEELTDDDYYQNEYFDDDEDDYNM
Subjt:  QILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM

Query:  EDEGGDEPEY
        EDEGGDEPEY
Subjt:  EDEGGDEPEY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G01590.1 unknown protein8.7e-1937.85Show/hide
Query:  MAFRG-RGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQR--TEIEKFSDRSK-SSSTLKR
        M+++G RG+ +G GG    Y K EPF +FPE     +TLP    +  +  LV        +W+ SP++L +  + K ++    IE++SD  K    + K 
Subjt:  MAFRG-RGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQR--TEIEKFSDRSK-SSSTLKR

Query:  DSLAQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDED
         S    L L   NFP+EL+   + + R  ++ +W+ E+ LQKLD  EK E  FK + K++K     EEGED DEE  +++ EE  + DY QN+ FDDD+D
Subjt:  DSLAQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDED

Query:  DYNMEDEGGDEPEY
        DYN ED+G  E  Y
Subjt:  DYNMEDEGGDEPEY

AT4G01590.2 unknown protein3.0e-1937.91Show/hide
Query:  MAFRG-RGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQR--TEIEKFSDRSK-SSSTLKR
        M+++G RG+ +G GG    Y K EPF +FPE     +TLP    +  +  LV        +W+ SP++L +  + K ++    IE++SD  K    + K 
Subjt:  MAFRG-RGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQR--TEIEKFSDRSK-SSSTLKR

Query:  DSLAQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDED
         S    L L   NFP+EL+   + + R  ++ +W+ E+ LQKLD  EK E  FK + K++K     EEGED DEE  +++ EE  + DY QN+ FDDD+D
Subjt:  DSLAQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDED

Query:  DYNMEDEGGDE
        DYN ED+G +E
Subjt:  DYNMEDEGGDE

AT4G01590.3 unknown protein8.7e-1937.85Show/hide
Query:  MAFRG-RGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQR--TEIEKFSDRSK-SSSTLKR
        M+++G RG+ +G GG    Y K EPF +FPE     +TLP    +  +  LV        +W+ SP++L +  + K ++    IE++SD  K    + K 
Subjt:  MAFRG-RGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQR--TEIEKFSDRSK-SSSTLKR

Query:  DSLAQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDED
         S    L L   NFP+EL+   + + R  ++ +W+ E+ LQKLD  EK E  FK + K++K     EEGED DEE  +++ EE  + DY QN+ FDDD+D
Subjt:  DSLAQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDED

Query:  DYNMEDEGGDEPEY
        DYN ED+G  E  Y
Subjt:  DYNMEDEGGDEPEY

AT4G35680.1 Arabidopsis protein of unknown function (DUF241)9.9e-1535.58Show/hide
Query:  GRGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLL--NYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLAQI
        GRG+ +G GG    Y K EPF +FPE     +TLP    +  +  LV+  S      +W  SP++L +  + K           + K+S  ++R      
Subjt:  GRGRGRGGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLL--NYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLAQI

Query:  LQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMED
              NF +ELV   + + R  ++ +W+ E+ LQKLD  EK E  FK Q  ++K     E+GED DE+  +++ EE  + DY QN+ FDDDEDDYN E+
Subjt:  LQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMED

Query:  EGGDEPEY
        +GG E  Y
Subjt:  EGGDEPEY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATCAGCGGGAGAGTTACGCTTTGACGCAGGCCTGATCGCTACACACCCCTTGCCTCTCCCAACCGTATCTCCCTCTCTCGCTCACGCTGAACTAGCCGCCGCACG
TCTGAGTAGCAGCTTCATTCCCGCGCCGCCGCCGCTTGTCCGAGCAGCAGCCTTTATCCACCTTCTTCTTTATCTCTCTCTCTCTCGGCCGCATCTTTCCTTCCCTCTCT
CTCCGTCGCCGGCAACAACTGCTGGCAGCAGCGAATCGCCGGGCATCGCTAGAAGGGCGAACCTGAGAGCGAGGCAAGCTGGGAACGGGCTGAAGCTGACGATTCGAAGA
CAAAGTCCCTTCTCGTCATCGAAGACCCCAGTCGTCAAGCCCCGTGTCTGTTGGGAGAGGGTTTGGTCGCGTCCAGTCCCGGTTCTGCCACTATCCGTTTGGTTTCCGTT
CGCGTTTAAATTCGGGAATTGGCATCATTTAGCCGCTGTCCGGAAGAGGATTGAAGGATTTGGGTTGTGTGAAGGGCGTCCTAGGAGAAATTCCTCTTTAATTGCTTTGG
GTTCAAAGCCAGGTAAGGAGATGGCATTTAGAGGGCGAGGGCGAGGACGAGGTGGCGGTGGTGGGACCTTTCAGTATGCCAAGCAAGAACCCTTCGAGCTCTTCCCGGAG
GTGGGTAGTTCTAATGTAACTCTACCGAGCGTCAGTGATGTGCCTGAAGAAAAAAGCTTGGTTATTTGTAACTCCAAGTTGCTGAATTATTGGAAGGCCTCTCCTTTTTA
TCTAGAGGAGAATGTCATGAAAAAGATGCAAAGAACTGAGATAGAGAAATTTTCCGATAGATCCAAGTCGAGTAGTACTTTGAAACGTGATTCCCTTGCACAAATTCTAC
AGCTCACATCTAGGAACTTTCCTGAAGAATTGGTCGAAGGTTTCAAAGGGAAATTGCGGAACAAACGAAAAGTTCAATGGAATCCCGAGTCAGGGCTGCAAAAATTGGAC
TTTCTGGAGAAGCGTGAAGAATCTTTCAAAGGACAGGATAAGGATGATAAGGAGAAGAAAGAAGGAGAAGAAGGTGAAGACGAAGATGAAGAAGAAGACGATGCACAGTC
CGAGGAACTTACTGACGATGATTATTATCAGAACGAATACTTCGACGATGATGAAGATGATTACAACATGGAAGATGAAGGTGGAGACGAACCAGAATATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAATCAGCGGGAGAGTTACGCTTTGACGCAGGCCTGATCGCTACACACCCCTTGCCTCTCCCAACCGTATCTCCCTCTCTCGCTCACGCTGAACTAGCCGCCGCACG
TCTGAGTAGCAGCTTCATTCCCGCGCCGCCGCCGCTTGTCCGAGCAGCAGCCTTTATCCACCTTCTTCTTTATCTCTCTCTCTCTCGGCCGCATCTTTCCTTCCCTCTCT
CTCCGTCGCCGGCAACAACTGCTGGCAGCAGCGAATCGCCGGGCATCGCTAGAAGGGCGAACCTGAGAGCGAGGCAAGCTGGGAACGGGCTGAAGCTGACGATTCGAAGA
CAAAGTCCCTTCTCGTCATCGAAGACCCCAGTCGTCAAGCCCCGTGTCTGTTGGGAGAGGGTTTGGTCGCGTCCAGTCCCGGTTCTGCCACTATCCGTTTGGTTTCCGTT
CGCGTTTAAATTCGGGAATTGGCATCATTTAGCCGCTGTCCGGAAGAGGATTGAAGGATTTGGGTTGTGTGAAGGGCGTCCTAGGAGAAATTCCTCTTTAATTGCTTTGG
GTTCAAAGCCAGGTAAGGAGATGGCATTTAGAGGGCGAGGGCGAGGACGAGGTGGCGGTGGTGGGACCTTTCAGTATGCCAAGCAAGAACCCTTCGAGCTCTTCCCGGAG
GTGGGTAGTTCTAATGTAACTCTACCGAGCGTCAGTGATGTGCCTGAAGAAAAAAGCTTGGTTATTTGTAACTCCAAGTTGCTGAATTATTGGAAGGCCTCTCCTTTTTA
TCTAGAGGAGAATGTCATGAAAAAGATGCAAAGAACTGAGATAGAGAAATTTTCCGATAGATCCAAGTCGAGTAGTACTTTGAAACGTGATTCCCTTGCACAAATTCTAC
AGCTCACATCTAGGAACTTTCCTGAAGAATTGGTCGAAGGTTTCAAAGGGAAATTGCGGAACAAACGAAAAGTTCAATGGAATCCCGAGTCAGGGCTGCAAAAATTGGAC
TTTCTGGAGAAGCGTGAAGAATCTTTCAAAGGACAGGATAAGGATGATAAGGAGAAGAAAGAAGGAGAAGAAGGTGAAGACGAAGATGAAGAAGAAGACGATGCACAGTC
CGAGGAACTTACTGACGATGATTATTATCAGAACGAATACTTCGACGATGATGAAGATGATTACAACATGGAAGATGAAGGTGGAGACGAACCAGAATATTAG
Protein sequenceShow/hide protein sequence
MKSAGELRFDAGLIATHPLPLPTVSPSLAHAELAAARLSSSFIPAPPPLVRAAAFIHLLLYLSLSRPHLSFPLSPSPATTAGSSESPGIARRANLRARQAGNGLKLTIRR
QSPFSSSKTPVVKPRVCWERVWSRPVPVLPLSVWFPFAFKFGNWHHLAAVRKRIEGFGLCEGRPRRNSSLIALGSKPGKEMAFRGRGRGRGGGGGTFQYAKQEPFELFPE
VGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLAQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLD
FLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDEGGDEPEY