; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G029790 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G029790
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionDNA-directed RNA polymerase III subunit RPC7-like isoform X1
Genome locationchr02:35882419..35885394
RNA-Seq ExpressionLsi02G029790
SyntenyLsi02G029790
Gene Ontology termsGO:0006383 - transcription by RNA polymerase III (biological process)
GO:0005666 - RNA polymerase III complex (cellular component)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR024661 - DNA-directed RNA polymerase III, subunit Rpc31


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575499.1 hypothetical protein SDJN03_26138, partial [Cucurbita argyrosperma subsp. sororia]1.7e-7371.97Show/hide
Query:  CYSHNLSKSSKRLKEFQSSLLICSTISGMAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENV
        C+SHNLSKSS+R KEFQ          GMAFRGRGRGR GGGG+FQYAKQEPFELFPE    NVTLP+VSD+PE + L + N++ LNYWKASPFYLEENV
Subjt:  CYSHNLSKSSKRLKEFQSSLLICSTISGMAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENV

Query:  MKKMQRTEIEKFSDRSKSNSTLKRDSLAQILQLTSRNFPEELVEGMQSQI-SQRGMRVILRVSKGSCGPNEKFNGIPSQGQDKDDKEKKEGEEGEDEDEE
        MKKMQ+TEIE+FSDR+KSNSTLKRDSLAQILQLTSRNFPEELVEG + ++ S+R ++             EK      +GQ+KDDKEKKEGE  EDEDEE
Subjt:  MKKMQRTEIEKFSDRSKSNSTLKRDSLAQILQLTSRNFPEELVEGMQSQI-SQRGMRVILRVSKGSCGPNEKFNGIPSQGQDKDDKEKKEGEEGEDEDEE

Query:  EDDAQSEELTDDDYYQNEYFDDDEDDYNMEEEGGDEPEY
        +DDAQSEELTDDDYYQNEYFDDDEDDYNMEEEGGDEPEY
Subjt:  EDDAQSEELTDDDYYQNEYFDDDEDDYNMEEEGGDEPEY

XP_008462407.1 PREDICTED: DNA-directed RNA polymerase III subunit RPC7-like isoform X1 [Cucumis melo]5.1e-7074.88Show/hide
Query:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSNSTLKRDSLA
        MAFRGRGRGRGGGGG+FQYAKQEPFELFPE    NVTLPSVS+MPEE +LA+    FL YWKASPFYLEENVMKKMQRTEIEKFSDR K NSTLKRDSLA
Subjt:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSNSTLKRDSLA

Query:  QILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPS-----QGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDE
        QI+QLTSRNFPEELVEG + +     +R   +V        +K + +       +GQDK+DKEKKEGEEGEDEDEEE+DAQSEELTDDDYYQNEYFDDDE
Subjt:  QILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPS-----QGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDE

Query:  DDYNMEEEGGDEPEY
        DDYNMEEEGGDEPEY
Subjt:  DDYNMEEEGGDEPEY

XP_022154143.1 ribosomal L1 domain-containing protein CG13096-like [Momordica charantia]9.6e-6973.15Show/hide
Query:  MAFR-GRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSNSTLKRDSL
        MAFR GRGRGRGGGGGAFQYAKQEPFELFPE    NVTLPSVSD+PEE+ L + N++ LNYWKASPF+LEENV+KKMQRTEIEKFSDRSK NSTLKRDSL
Subjt:  MAFR-GRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSNSTLKRDSL

Query:  AQILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPS-----QGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDD
        AQILQLTSRNFPEELVEG + ++  +      +V        +K + +       +GQDKDDKEKKEGEEGEDE++EEDDAQSEELTDDDYYQNEYFDDD
Subjt:  AQILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPS-----QGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDD

Query:  EDDYNMEEEGGDEPEY
        EDDYNME++GGDEP Y
Subjt:  EDDYNMEEEGGDEPEY

XP_038898860.1 glutamic acid-rich protein-like isoform X1 [Benincasa hispida]1.1e-7267.19Show/hide
Query:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGS-----------------------------------------SNVTLPSVSDMPEERSLALRNNKFLN
        MAFRGRGRGRGGGGGAFQYAKQEPFELFPEV                                            NVTLP VSDMPEE+SLA+RNNKFLN
Subjt:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGS-----------------------------------------SNVTLPSVSDMPEERSLALRNNKFLN

Query:  YWKASPFYLEENVMKKMQRTEIEKFSDRSKSNSTLKRDSLAQILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPS-----QGQDK
        YWKASPFYLEENVMKKMQRTE+EKFSDRSKSNSTLKRDSLAQILQLTSRNFPEELV+G + +     +R   +V        +K + +       +GQDK
Subjt:  YWKASPFYLEENVMKKMQRTEIEKFSDRSKSNSTLKRDSLAQILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPS-----QGQDK

Query:  DDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEEEGGDEPEY
        DDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEEEGGDEPEY
Subjt:  DDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEEEGGDEPEY

XP_038898864.1 glutamic acid-rich protein-like isoform X2 [Benincasa hispida]7.3e-7779.53Show/hide
Query:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSNSTLKRDSLA
        MAFRGRGRGRGGGGGAFQYAKQEPFELFPE    NVTLP VSDMPEE+SLA+RNNKFLNYWKASPFYLEENVMKKMQRTE+EKFSDRSKSNSTLKRDSLA
Subjt:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSNSTLKRDSLA

Query:  QILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPS-----QGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDE
        QILQLTSRNFPEELV+G + +     +R   +V        +K + +       +GQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDE
Subjt:  QILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPS-----QGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDE

Query:  DDYNMEEEGGDEPEY
        DDYNMEEEGGDEPEY
Subjt:  DDYNMEEEGGDEPEY

TrEMBL top hitse value%identityAlignment
A0A0A0KCT3 Uncharacterized protein1.0e-7165.77Show/hide
Query:  FFSGYRGVLSHCARKHFCYSHNLSKSSKRLKEFQSSLLICSTISGMAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNN
        F  GYR +     ++   Y HNLSKSSKR   F+          GMAF  RGRGRGGGGG+FQYAKQEPFELFPE    NVTLPS+ +MPEE +LA+ + 
Subjt:  FFSGYRGVLSHCARKHFCYSHNLSKSSKRLKEFQSSLLICSTISGMAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNN

Query:  KFLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSNSTLKRDSLAQILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPS-----Q
         F+ YWKASPFYLEENVMKKMQRTEIEKFSDR+K N+TLKRDSLAQI+QLTSRNFPEELVEG + +     +R   +V        +K + +       +
Subjt:  KFLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSNSTLKRDSLAQILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPS-----Q

Query:  GQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEEEGGDEPEY
        GQDK+DKEKKEGEEGEDEDEEE+DAQSEELTDDDYYQNEYFDDDEDDYNMEEEGGDEPEY
Subjt:  GQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEEEGGDEPEY

A0A1S3CGV4 DNA-directed RNA polymerase III subunit2.5e-7074.88Show/hide
Query:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSNSTLKRDSLA
        MAFRGRGRGRGGGGG+FQYAKQEPFELFPE    NVTLPSVS+MPEE +LA+    FL YWKASPFYLEENVMKKMQRTEIEKFSDR K NSTLKRDSLA
Subjt:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSNSTLKRDSLA

Query:  QILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPS-----QGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDE
        QI+QLTSRNFPEELVEG + +     +R   +V        +K + +       +GQDK+DKEKKEGEEGEDEDEEE+DAQSEELTDDDYYQNEYFDDDE
Subjt:  QILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPS-----QGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDE

Query:  DDYNMEEEGGDEPEY
        DDYNMEEEGGDEPEY
Subjt:  DDYNMEEEGGDEPEY

A0A5D3BVU1 DNA-directed RNA polymerase III subunit RPC7-like isoform X15.7e-6769.1Show/hide
Query:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSNSTLKRDSLA
        MAFRGRGRGRGGGGG+FQYAKQEPFELFPE    NVTLPSVS+MPEE +LA+    FL YWKASPFYLEENVMKKMQRTEIEKFSDR K NSTLKRDSLA
Subjt:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSNSTLKRDSLA

Query:  QILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPS-----QGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDE
        QI+QLTSRNFPEELVEG + +     +R   +V        +K + +       +GQDK+DKEKKEGEEGEDEDEEE+DAQSEELTDDDYYQNEYFDDDE
Subjt:  QILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPS-----QGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDE

Query:  DDYNMEEEGG------------------DEPEY
        DDYNMEEEGG                  DEPEY
Subjt:  DDYNMEEEGG------------------DEPEY

A0A6J1DMV9 ribosomal L1 domain-containing protein CG13096-like4.6e-6973.15Show/hide
Query:  MAFR-GRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSNSTLKRDSL
        MAFR GRGRGRGGGGGAFQYAKQEPFELFPE    NVTLPSVSD+PEE+ L + N++ LNYWKASPF+LEENV+KKMQRTEIEKFSDRSK NSTLKRDSL
Subjt:  MAFR-GRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSNSTLKRDSL

Query:  AQILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPS-----QGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDD
        AQILQLTSRNFPEELVEG + ++  +      +V        +K + +       +GQDKDDKEKKEGEEGEDE++EEDDAQSEELTDDDYYQNEYFDDD
Subjt:  AQILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPS-----QGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDD

Query:  EDDYNMEEEGGDEPEY
        EDDYNME++GGDEP Y
Subjt:  EDDYNMEEEGGDEPEY

A0A6J1GNY2 glutamic acid-rich protein-like9.7e-6774.41Show/hide
Query:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSNSTLKRDSLA
        MAFRGRGRGR GGGG+FQYAKQEPFELFPE    NVTLP+VSD+PE + L + N++ LNYWKASPFYLEENVMKKMQ+TEIE+FSDR+KSNSTLKRDSLA
Subjt:  MAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKMQRTEIEKFSDRSKSNSTLKRDSLA

Query:  QILQLTSRNFPEELVEGMQSQI-SQRGMRVILRVSKGSCGPNEKFNGIPSQGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYN
        QILQLTSRNFPEELVEG + ++ S+R ++             EK      +GQ+KDDKEKKEGE  EDEDEE+DDAQSEELTDDDYYQNEYFDDDEDDYN
Subjt:  QILQLTSRNFPEELVEGMQSQI-SQRGMRVILRVSKGSCGPNEKFNGIPSQGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYN

Query:  MEEEGGDEPEY
        MEEEGGDEPEY
Subjt:  MEEEGGDEPEY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G01590.1 unknown protein2.8e-1033.18Show/hide
Query:  MAFRG-RGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKMQR--TEIEKFSDRSKSN-STLKR
        M+++G RG+ +G GG    Y K EPF +FPE     +TLP    +  +  L      F  +W+ SP++L +  + K ++    IE++SD  K    + K 
Subjt:  MAFRG-RGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKMQR--TEIEKFSDRSKSN-STLKR

Query:  DSLAQILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPSQGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDED
         S    L L   NFP+EL+       ++R  R + R         +K +           + K+E EEGED DEE  +++ EE  + DY QN+ FDDD+D
Subjt:  DSLAQILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPSQGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDED

Query:  DYNMEEEGGDEPEY
        DYN E++G  E  Y
Subjt:  DYNMEEEGGDEPEY

AT4G01590.2 unknown protein9.7e-1133.18Show/hide
Query:  MAFRG-RGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKMQR--TEIEKFSDRSKSN-STLKR
        M+++G RG+ +G GG    Y K EPF +FPE     +TLP    +  +  L      F  +W+ SP++L +  + K ++    IE++SD  K    + K 
Subjt:  MAFRG-RGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKMQR--TEIEKFSDRSKSN-STLKR

Query:  DSLAQILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPSQGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDED
         S    L L   NFP+EL+       ++R  R + R         +K +           + K+E EEGED DEE  +++ EE  + DY QN+ FDDD+D
Subjt:  DSLAQILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPSQGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDED

Query:  DYNMEEEGGDE
        DYN E++G +E
Subjt:  DYNMEEEGGDE

AT4G01590.3 unknown protein2.8e-1033.18Show/hide
Query:  MAFRG-RGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKMQR--TEIEKFSDRSKSN-STLKR
        M+++G RG+ +G GG    Y K EPF +FPE     +TLP    +  +  L      F  +W+ SP++L +  + K ++    IE++SD  K    + K 
Subjt:  MAFRG-RGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFLNYWKASPFYLEENVMKKMQR--TEIEKFSDRSKSN-STLKR

Query:  DSLAQILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPSQGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDED
         S    L L   NFP+EL+       ++R  R + R         +K +           + K+E EEGED DEE  +++ EE  + DY QN+ FDDD+D
Subjt:  DSLAQILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPSQGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDED

Query:  DYNMEEEGGDEPEY
        DYN E++G  E  Y
Subjt:  DYNMEEEGGDEPEY

AT4G35680.1 Arabidopsis protein of unknown function (DUF241)3.8e-0729.33Show/hide
Query:  GRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFL--NYWKASPFYLEENVMKKMQRTEIEKFSDRSKSNSTLKRDSLAQI
        GRG+ +G GG    Y K EPF +FPE     +TLP    +  +  L +  + F    +W  SP++L +  + K ++  ++           ++R      
Subjt:  GRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFL--NYWKASPFYLEENVMKKMQRTEIEKFSDRSKSNSTLKRDSLAQI

Query:  LQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPSQGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEE
              NF +ELV   + +  QR ++      +      + F  + S+ + + ++EK++GE    +DE+  +++ EE  + DY QN+ FDDDEDDYN EE
Subjt:  LQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPSQGQDKDDKEKKEGEEGEDEDEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEE

Query:  EGGDEPEY
        +GG E  Y
Subjt:  EGGDEPEY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCCGCCTCCATCTTTGTTCTTCTCAGGATACCGGGGGGTACTGTCGCATTGTGCTAGGAAGCATTTTTGTTACAGCCATAATTTGAGCAAGTCATCTAAAAGGTT
GAAGGAATTTCAGAGTAGTTTACTCATTTGTTCTACAATCAGTGGGATGGCATTTAGAGGGCGAGGGCGAGGACGAGGTGGCGGTGGTGGGGCCTTTCAGTATGCCAAGC
AAGAACCCTTCGAGCTTTTCCCAGAGGTGGGTAGTTCTAATGTAACTCTACCAAGCGTCAGTGATATGCCTGAAGAAAGAAGCTTGGCTTTACGTAACAACAAGTTTCTG
AATTATTGGAAGGCCTCTCCTTTTTATCTAGAGGAAAATGTTATGAAAAAGATGCAAAGAACTGAGATAGAGAAATTTTCTGATAGATCCAAGTCGAATAGTACATTGAA
GCGTGATTCCCTCGCACAAATTCTACAGCTCACATCCAGGAACTTTCCTGAAGAATTGGTTGAAGGTATGCAATCTCAGATCTCTCAGAGAGGGATGAGAGTTATACTAA
GAGTTTCAAAGGGAAGTTGCGGACCAAACGAAAAGTTCAATGGAATCCCGAGTCAGGGACAGGATAAGGATGATAAGGAGAAGAAAGAAGGAGAAGAAGGTGAAGATGAA
GACGAAGAAGAAGACGATGCACAATCCGAGGAACTTACCGATGATGATTATTATCAGAATGAATACTTCGACGACGATGAAGACGATTACAACATGGAAGAAGAAGGTGG
AGATGAACCAGAATATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGCCGCCTCCATCTTTGTTCTTCTCAGGATACCGGGGGGTACTGTCGCATTGTGCTAGGAAGCATTTTTGTTACAGCCATAATTTGAGCAAGTCATCTAAAAGGTT
GAAGGAATTTCAGAGTAGTTTACTCATTTGTTCTACAATCAGTGGGATGGCATTTAGAGGGCGAGGGCGAGGACGAGGTGGCGGTGGTGGGGCCTTTCAGTATGCCAAGC
AAGAACCCTTCGAGCTTTTCCCAGAGGTGGGTAGTTCTAATGTAACTCTACCAAGCGTCAGTGATATGCCTGAAGAAAGAAGCTTGGCTTTACGTAACAACAAGTTTCTG
AATTATTGGAAGGCCTCTCCTTTTTATCTAGAGGAAAATGTTATGAAAAAGATGCAAAGAACTGAGATAGAGAAATTTTCTGATAGATCCAAGTCGAATAGTACATTGAA
GCGTGATTCCCTCGCACAAATTCTACAGCTCACATCCAGGAACTTTCCTGAAGAATTGGTTGAAGGTATGCAATCTCAGATCTCTCAGAGAGGGATGAGAGTTATACTAA
GAGTTTCAAAGGGAAGTTGCGGACCAAACGAAAAGTTCAATGGAATCCCGAGTCAGGGACAGGATAAGGATGATAAGGAGAAGAAAGAAGGAGAAGAAGGTGAAGATGAA
GACGAAGAAGAAGACGATGCACAATCCGAGGAACTTACCGATGATGATTATTATCAGAATGAATACTTCGACGACGATGAAGACGATTACAACATGGAAGAAGAAGGTGG
AGATGAACCAGAATATTAGGCACTATGGAAAGTGGTGGTAAGATTTGGGGCTTGTTTCTCCAACTCAAAGAATTTAACTGGTGTAGGGCATTGGATTGTTTAATCAGGTT
TTAAATTGTTAAATTTTTAATTTTAATTTTTTTGGTTATGAGACTGTAAAGTGTTCTAATTTCTAAATAATTAAATTTAAGTTTAAATCATATTTTAGTTTATTCAATTT
TAATGTTTATTCTTT
Protein sequenceShow/hide protein sequence
MLPPPSLFFSGYRGVLSHCARKHFCYSHNLSKSSKRLKEFQSSLLICSTISGMAFRGRGRGRGGGGGAFQYAKQEPFELFPEVGSSNVTLPSVSDMPEERSLALRNNKFL
NYWKASPFYLEENVMKKMQRTEIEKFSDRSKSNSTLKRDSLAQILQLTSRNFPEELVEGMQSQISQRGMRVILRVSKGSCGPNEKFNGIPSQGQDKDDKEKKEGEEGEDE
DEEEDDAQSEELTDDDYYQNEYFDDDEDDYNMEEEGGDEPEY