; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0003054 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0003054
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDNA-directed RNA polymerase III subunit RPC7-like isoform X1
Genome locationchr4:47681443..47685963
RNA-Seq ExpressionLag0003054
SyntenyLag0003054
Gene Ontology termsGO:0006383 - transcription by RNA polymerase III (biological process)
GO:0005666 - RNA polymerase III complex (cellular component)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR024661 - DNA-directed RNA polymerase III, subunit Rpc31


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575499.1 hypothetical protein SDJN03_26138, partial [Cucurbita argyrosperma subsp. sororia]1.2e-9188.32Show/hide
Query:  KPGKGMAFRGRGRGRGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKR
        K  +GMAFRGRGRGRGGGG+FQYAKQEPFELFPE    NVTLP+VSD+PE K LVICNS+LLNYWKASPFYLEENVMKKMQ+TEIE+FSDR+KS+STLKR
Subjt:  KPGKGMAFRGRGRGRGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKR

Query:  DSLAQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDED
        DSLAQILQLTSRNFPEELVEGFKGKLR+KRKVQWNPESGL KLDFLEKREES KGQ+KDDKEKKEGE  EDEDEE+DDAQSEELTDDDYYQNEYFDDDED
Subjt:  DSLAQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDED

Query:  DYNMEDEGGDEPEY
        DYNME+EGGDEPEY
Subjt:  DYNMEDEGGDEPEY

XP_022154143.1 ribosomal L1 domain-containing protein CG13096-like [Momordica charantia]2.0e-9191.47Show/hide
Query:  MAFR-GRGRGR-GGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSL
        MAFR GRGRGR GGGG FQYAKQEPFELFPE    NVTLPSVSDVPEEK LVICNS+LLNYWKASPF+LEENV+KKMQRTEIEKFSDRSK +STLKRDSL
Subjt:  MAFR-GRGRGR-GGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSL

Query:  AQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYN
        AQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREES KGQDKDDKEKKEGEEGEDE++EEDDAQSEELTDDDYYQNEYFDDDEDDYN
Subjt:  AQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYN

Query:  MEDEGGDEPEY
        MED+GGDEP Y
Subjt:  MEDEGGDEPEY

XP_022953194.1 glutamic acid-rich protein-like [Cucurbita moschata]4.5e-9189.47Show/hide
Query:  MAFRGRGRGRGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLAQ
        MAFRGRGRGRGGGG+FQYAKQEPFELFPE    NVTLP+VSD+PE K LVICNS+LLNYWKASPFYLEENVMKKMQ+TEIE+FSDR+KS+STLKRDSLAQ
Subjt:  MAFRGRGRGRGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLAQ

Query:  ILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNME
        ILQLTSRNFPEELVEGFKGKLR+KRKVQWNPESGL KLDFLEKREES KGQ+KDDKEKKEGE  EDEDEE+DDAQSEELTDDDYYQNEYFDDDEDDYNME
Subjt:  ILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNME

Query:  DEGGDEPEY
        +EGGDEPEY
Subjt:  DEGGDEPEY

XP_023548752.1 DNA-directed RNA polymerase III subunit rpc31-like [Cucurbita pepo subsp. pepo]9.0e-9290.43Show/hide
Query:  MAFRGRGRGRGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLAQ
        MAFRGRGRGRGGGG+FQYAKQEPFELFPE    NVTLP+VSD+PE K LVICNS+LLNYWKASPFYLEENVMKKMQRTEIE+FSDR+KS+STLKRDSLAQ
Subjt:  MAFRGRGRGRGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLAQ

Query:  ILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNME
        ILQLTSRNFPEELVEGFKGKLR+KRKVQWNPESGL KLDFLEKREES KGQ+KDDKEKKEGE  EDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNME
Subjt:  ILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNME

Query:  DEGGDEPEY
        +EGGDEPEY
Subjt:  DEGGDEPEY

XP_038898864.1 glutamic acid-rich protein-like isoform X2 [Benincasa hispida]1.2e-9191.43Show/hide
Query:  MAFRGRGRGR-GGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLA
        MAFRGRGRGR GGGG FQYAKQEPFELFPE    NVTLP VSD+PEEKSL I N+K LNYWKASPFYLEENVMKKMQRTE+EKFSDRSKS+STLKRDSLA
Subjt:  MAFRGRGRGR-GGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLA

Query:  QILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM
        QILQLTSRNFPEELV+GFKGKLR KRKVQWNPESGLQKLDFLEKREES KGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM
Subjt:  QILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM

Query:  EDEGGDEPEY
        E+EGGDEPEY
Subjt:  EDEGGDEPEY

TrEMBL top hitse value%identityAlignment
A0A0A0KCT3 Uncharacterized protein3.4e-8482.94Show/hide
Query:  KGMAFRGRGRGRGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSL
        +GMAFRGRGRG GGGG+FQYAKQEPFELFPE    NVTLPS+ ++PEE +L + +   + YWKASPFYLEENVMKKMQRTEIEKFSDR+K ++TLKRDSL
Subjt:  KGMAFRGRGRGRGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSL

Query:  AQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYN
        AQI+QLTSRNFPEELVEGFKGKLR KRKVQWNP+SGL+K+D LEKREES KGQDK+DKEKKEGEEGEDEDEEE+DAQSEELTDDDYYQNEYFDDDEDDYN
Subjt:  AQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYN

Query:  MEDEGGDEPEY
        ME+EGGDEPEY
Subjt:  MEDEGGDEPEY

A0A1S3CGV4 DNA-directed RNA polymerase III subunit8.0e-8686.19Show/hide
Query:  MAFRGRGRGR-GGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLA
        MAFRGRGRGR GGGG+FQYAKQEPFELFPE    NVTLPSVS++PEE +L +     L YWKASPFYLEENVMKKMQRTEIEKFSDR K +STLKRDSLA
Subjt:  MAFRGRGRGR-GGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLA

Query:  QILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM
        QI+QLTSRNFPEELVEGFKGKLR KRKVQWNPESGL+K+DFLEKREES KGQDK+DKEKKEGEEGEDEDEEE+DAQSEELTDDDYYQNEYFDDDEDDYNM
Subjt:  QILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNM

Query:  EDEGGDEPEY
        E+EGGDEPEY
Subjt:  EDEGGDEPEY

A0A6J1DMV9 ribosomal L1 domain-containing protein CG13096-like9.7e-9291.47Show/hide
Query:  MAFR-GRGRGR-GGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSL
        MAFR GRGRGR GGGG FQYAKQEPFELFPE    NVTLPSVSDVPEEK LVICNS+LLNYWKASPF+LEENV+KKMQRTEIEKFSDRSK +STLKRDSL
Subjt:  MAFR-GRGRGR-GGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSL

Query:  AQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYN
        AQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREES KGQDKDDKEKKEGEEGEDE++EEDDAQSEELTDDDYYQNEYFDDDEDDYN
Subjt:  AQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYN

Query:  MEDEGGDEPEY
        MED+GGDEP Y
Subjt:  MEDEGGDEPEY

A0A6J1GNY2 glutamic acid-rich protein-like2.2e-9189.47Show/hide
Query:  MAFRGRGRGRGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLAQ
        MAFRGRGRGRGGGG+FQYAKQEPFELFPE    NVTLP+VSD+PE K LVICNS+LLNYWKASPFYLEENVMKKMQ+TEIE+FSDR+KS+STLKRDSLAQ
Subjt:  MAFRGRGRGRGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLAQ

Query:  ILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNME
        ILQLTSRNFPEELVEGFKGKLR+KRKVQWNPESGL KLDFLEKREES KGQ+KDDKEKKEGE  EDEDEE+DDAQSEELTDDDYYQNEYFDDDEDDYNME
Subjt:  ILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNME

Query:  DEGGDEPEY
        +EGGDEPEY
Subjt:  DEGGDEPEY

A0A6J1JVL5 DNA-directed RNA polymerase III subunit rpc31-like4.8e-9189.47Show/hide
Query:  MAFRGRGRGRGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLAQ
        MAFRGRGRGRGGGG+FQYAKQEPFELFPE    NVTLP+VSD+PE K LVICNS+LLNYWKASPFYLEENVMKKMQ+ EIE+FSDR+KS+STLKRDSLAQ
Subjt:  MAFRGRGRGRGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLAQ

Query:  ILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNME
        ILQLTSRNFPEELVEGFKGKLR+KRKVQWNPESGL KLDFLEKREES KGQ+KDDKEKKEGE  EDEDEEED+AQSEELTDDDYYQNEYFDDDEDDYNME
Subjt:  ILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNME

Query:  DEGGDEPEY
        DEGGDEPEY
Subjt:  DEGGDEPEY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G01590.1 unknown protein6.4e-1938.03Show/hide
Query:  MAFRG-RGRGRGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQR--TEIEKFSDRSK-SSSTLKRD
        M+++G RG+ +G GG   Y K EPF +FPE     +TLP    +  +  LV        +W+ SP++L +  + K ++    IE++SD  K    + K  
Subjt:  MAFRG-RGRGRGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQR--TEIEKFSDRSK-SSSTLKRD

Query:  SLAQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDD
        S    L L   NFP+EL+   + + R  ++ +W+ E+ LQKLD  EK E  FK + K++K     EEGED DEE  +++ EE  + DY QN+ FDDD+DD
Subjt:  SLAQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDD

Query:  YNMEDEGGDEPEY
        YN ED+G  E  Y
Subjt:  YNMEDEGGDEPEY

AT4G01590.2 unknown protein2.2e-1938.1Show/hide
Query:  MAFRG-RGRGRGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQR--TEIEKFSDRSK-SSSTLKRD
        M+++G RG+ +G GG   Y K EPF +FPE     +TLP    +  +  LV        +W+ SP++L +  + K ++    IE++SD  K    + K  
Subjt:  MAFRG-RGRGRGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQR--TEIEKFSDRSK-SSSTLKRD

Query:  SLAQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDD
        S    L L   NFP+EL+   + + R  ++ +W+ E+ LQKLD  EK E  FK + K++K     EEGED DEE  +++ EE  + DY QN+ FDDD+DD
Subjt:  SLAQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDD

Query:  YNMEDEGGDE
        YN ED+G +E
Subjt:  YNMEDEGGDE

AT4G01590.3 unknown protein6.4e-1938.03Show/hide
Query:  MAFRG-RGRGRGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQR--TEIEKFSDRSK-SSSTLKRD
        M+++G RG+ +G GG   Y K EPF +FPE     +TLP    +  +  LV        +W+ SP++L +  + K ++    IE++SD  K    + K  
Subjt:  MAFRG-RGRGRGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLLNYWKASPFYLEENVMKKMQR--TEIEKFSDRSK-SSSTLKRD

Query:  SLAQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDD
        S    L L   NFP+EL+   + + R  ++ +W+ E+ LQKLD  EK E  FK + K++K     EEGED DEE  +++ EE  + DY QN+ FDDD+DD
Subjt:  SLAQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDD

Query:  YNMEDEGGDEPEY
        YN ED+G  E  Y
Subjt:  YNMEDEGGDEPEY

AT4G35680.1 Arabidopsis protein of unknown function (DUF241)1.6e-1435.78Show/hide
Query:  GSKPGKGMAFRGRGRGRGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLL--NYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSS
        GS   KG    GRG+ +G GG   Y K EPF +FPE     +TLP    +  +  LV+  S      +W  SP++L +  + K           + K+S 
Subjt:  GSKPGKGMAFRGRGRGRGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPEEKSLVICNSKLL--NYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSS

Query:  TLKRDSLAQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFD
         ++R            NF +ELV   + + R  ++ +W+ E+ LQKLD  EK E  FK Q  ++K     E+GED DE+  +++ EE  + DY QN+ FD
Subjt:  TLKRDSLAQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFD

Query:  DDEDDYNMEDEGGDEPEY
        DDEDDYN E++GG E  Y
Subjt:  DDEDDYNMEDEGGDEPEY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAACGTGGCTACCTCCACCCCTTGCCTCTCCCAACCGTATCTCTCTCTCTCTCGCTCACGCTGAACTAGCCGCCGCACGTCTGAGTAGCAGCTTCAGTCCCGCGCC
GCCGCCGCTTGTCCGAGCAGCAGCCATTATCCACCTTCTTCTTTATCTCTCTCTCTCTCGGCCGCATCTTTCCTTCCCTCTCTCTCCGTCGCCGGCAACAACTGCTGGCA
GCAGCGAATCGCCGGGCATCGCTAGAAGGGCGAACCTAAGAGCGAGGCAAGCTGGGAACGGGCTGAAGCTGACGATTCGAAGACAAAGTCCCTTCCCGTCATCGAAGACC
CCAGTCGTCACGCCCCGTCCTTGGTCGCGGATTGATGATCTTTTGGGTAAGCTTCAAATCGATTCCTTGGCCATTAAAGTTTTTAGTGATTTTTCTGCCGGTGTCCGGAA
GAGGATTGAAGGATTTGGGTTGTGTGAAGGGCGTCCTAGGAGAAATTCCTCTTTAATTGCTTTGGGTTCAAAGCCAGGTAAGGGGATGGCATTTAGAGGGCGAGGGCGAG
GACGAGGTGGTGGTGGGACCTTTCAGTATGCCAAGCAAGAACCCTTCGAGCTCTTCCCGGAGGTGGGTAGTTCTAATGTAACTCTACCGAGCGTCAGTGATGTGCCTGAA
GAAAAAAGCTTGGTTATTTGTAACTCCAAGTTGCTGAATTATTGGAAGGCCTCTCCTTTTTATCTAGAGGAGAATGTCATGAAAAAGATGCAAAGAACTGAGATAGAGAA
ATTTTCCGATAGATCCAAGTCGAGTAGTACGTTGAAACGTGATTCCCTTGCACAAATTCTACAGCTCACATCTAGGAACTTTCCTGAAGAATTGGTTGAAGGTTTCAAAG
GGAAATTGCGGAACAAACGAAAAGTTCAATGGAATCCCGAGTCAGGGCTGCAAAAATTGGACTTTCTGGAGAAGCGTGAAGAATCTTTCAAAGGACAGGATAAGGATGAT
AAGGAGAAGAAAGAAGGAGAAGAAGGTGAAGACGAAGATGAAGAAGAAGACGACGCACAGTCCGAGGAACTTACGGACGATGATTATTATCAGAACGAATACTTCGACGA
CGACGAAGATGATTACAACATGGAAGATGAAGGTGGAGACGAACCAGAATATTAA
mRNA sequenceShow/hide mRNA sequence
ATGACAACGTGGCTACCTCCACCCCTTGCCTCTCCCAACCGTATCTCTCTCTCTCTCGCTCACGCTGAACTAGCCGCCGCACGTCTGAGTAGCAGCTTCAGTCCCGCGCC
GCCGCCGCTTGTCCGAGCAGCAGCCATTATCCACCTTCTTCTTTATCTCTCTCTCTCTCGGCCGCATCTTTCCTTCCCTCTCTCTCCGTCGCCGGCAACAACTGCTGGCA
GCAGCGAATCGCCGGGCATCGCTAGAAGGGCGAACCTAAGAGCGAGGCAAGCTGGGAACGGGCTGAAGCTGACGATTCGAAGACAAAGTCCCTTCCCGTCATCGAAGACC
CCAGTCGTCACGCCCCGTCCTTGGTCGCGGATTGATGATCTTTTGGGTAAGCTTCAAATCGATTCCTTGGCCATTAAAGTTTTTAGTGATTTTTCTGCCGGTGTCCGGAA
GAGGATTGAAGGATTTGGGTTGTGTGAAGGGCGTCCTAGGAGAAATTCCTCTTTAATTGCTTTGGGTTCAAAGCCAGGTAAGGGGATGGCATTTAGAGGGCGAGGGCGAG
GACGAGGTGGTGGTGGGACCTTTCAGTATGCCAAGCAAGAACCCTTCGAGCTCTTCCCGGAGGTGGGTAGTTCTAATGTAACTCTACCGAGCGTCAGTGATGTGCCTGAA
GAAAAAAGCTTGGTTATTTGTAACTCCAAGTTGCTGAATTATTGGAAGGCCTCTCCTTTTTATCTAGAGGAGAATGTCATGAAAAAGATGCAAAGAACTGAGATAGAGAA
ATTTTCCGATAGATCCAAGTCGAGTAGTACGTTGAAACGTGATTCCCTTGCACAAATTCTACAGCTCACATCTAGGAACTTTCCTGAAGAATTGGTTGAAGGTTTCAAAG
GGAAATTGCGGAACAAACGAAAAGTTCAATGGAATCCCGAGTCAGGGCTGCAAAAATTGGACTTTCTGGAGAAGCGTGAAGAATCTTTCAAAGGACAGGATAAGGATGAT
AAGGAGAAGAAAGAAGGAGAAGAAGGTGAAGACGAAGATGAAGAAGAAGACGACGCACAGTCCGAGGAACTTACGGACGATGATTATTATCAGAACGAATACTTCGACGA
CGACGAAGATGATTACAACATGGAAGATGAAGGTGGAGACGAACCAGAATATTAA
Protein sequenceShow/hide protein sequence
MTTWLPPPLASPNRISLSLAHAELAAARLSSSFSPAPPPLVRAAAIIHLLLYLSLSRPHLSFPLSPSPATTAGSSESPGIARRANLRARQAGNGLKLTIRRQSPFPSSKT
PVVTPRPWSRIDDLLGKLQIDSLAIKVFSDFSAGVRKRIEGFGLCEGRPRRNSSLIALGSKPGKGMAFRGRGRGRGGGGTFQYAKQEPFELFPEVGSSNVTLPSVSDVPE
EKSLVICNSKLLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSSSTLKRDSLAQILQLTSRNFPEELVEGFKGKLRNKRKVQWNPESGLQKLDFLEKREESFKGQDKDD
KEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEDEGGDEPEY