; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025442 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025442
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionDNA-directed RNA polymerase II subunit 1-like
Genome locationtig00006406:1356736..1371498
RNA-Seq ExpressionSgr025442
SyntenySgr025442
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004133795.1 DNA-directed RNA polymerase II subunit RPB1 [Cucumis sativus]6.8e-6360.22Show/hide
Query:  GRSF---YRFSSVNRPVAPGGPTAGQDSAQYDGRQLPSVTRDQSAEPRLHSPPYSPRRES--LPASPTYSVKKTTSPPSSPPYRAPAAAARTISSPPKPA
        GR+F   YRFSS NRP+AP   T+GQDSAQYDGR+ PS  RD S EPR   P  + RR+   LPASPTYS+KK TSPPSSP YRAP  AAR ISSP K  
Subjt:  GRSF---YRFSSVNRPVAPGGPTAGQDSAQYDGRQLPSVTRDQSAEPRLHSPPYSPRRES--LPASPTYSVKKTTSPPSSPPYRAPAAAARTISSPPKPA

Query:  DEYPK-------SPPKVEQKPVIY--KAVEKVAKSDRHPESGKSVSSH--SKQPNAINIAGENLGAVMEIVQSPKREGGHIIKR-KET-RGTHNTASENN
        DEYPK         P+ +QKP I+    VEKV KSDR+ ES K++SSH    QPNAINI GEN+GAVMEIV+S KREGGH+IK+ KET RG  N    NN
Subjt:  DEYPK-------SPPKVEQKPVIY--KAVEKVAKSDRHPESGKSVSSH--SKQPNAINIAGENLGAVMEIVQSPKREGGHIIKR-KET-RGTHNTASENN

Query:  EASKDKAAGQKGKEETSLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFSRKPTGGGFTIDEKKQQ
        + + D+         +S+P +TFLNSNFQSVNNSLLYNA+L HRDPGLHLAFSR PTG    +D+ K+Q
Subjt:  EASKDKAAGQKGKEETSLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFSRKPTGGGFTIDEKKQQ

XP_008437848.1 PREDICTED: uncharacterized protein LOC103483156 [Cucumis melo]1.5e-6561.42Show/hide
Query:  GRSF---YRFSSVNRPVAPGGPTAGQDSAQYDGRQLPSVTRDQSAEPRLHSPPYSPRRES--LPASPTYSVKKTTSPPSSPPYRAPAAAARTISSPPKPA
        GR+F   YRFSS NRP+AP   T+GQDSAQYD RQ PS TRD S EPR   P  S R++   LPASPTYS+KK TSPPSSPPYR    AAR ISSPPK  
Subjt:  GRSF---YRFSSVNRPVAPGGPTAGQDSAQYDGRQLPSVTRDQSAEPRLHSPPYSPRRES--LPASPTYSVKKTTSPPSSPPYRAPAAAARTISSPPKPA

Query:  DEYPK-------SPPKVEQKPVIYK--AVEKVAKSDRHPESGKSVSSH-SKQPNAINIAGENLGAVMEIVQSPKREGGHIIKR-KET-RGTHNTASENNE
        DEYPK         P+ +QKP I+K   VEKV KSDR+ E  K+VSSH  +QPNAINI GEN+GAVMEIV+S KREGGH+IK+ KET RG  N    NN+
Subjt:  DEYPK-------SPPKVEQKPVIYK--AVEKVAKSDRHPESGKSVSSH-SKQPNAINIAGENLGAVMEIVQSPKREGGHIIKR-KET-RGTHNTASENNE

Query:  ASKDKAAGQKGKEETSLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFSRKPTGGGFTIDEKKQ
         + D+         +S+P +TFLNSNFQSVNNSLLYNA+L +RDPGLHL+FSR PTG  F +D+KKQ
Subjt:  ASKDKAAGQKGKEETSLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFSRKPTGGGFTIDEKKQ

XP_022147459.1 serine/arginine repetitive matrix protein 1-like [Momordica charantia]8.0e-8870.33Show/hide
Query:  GRLGRSFYRFSSVNRPVAPGGPTAGQDSAQYDGRQLPSVTRDQSAEPRLHSPPYSPRRESLPASPTYSVKKTTSPPSSPPYRAPAAAARTISSPPKPADE
        GR GRSFYRFSSVNRP+APGG T+ QDSAQYDGRQ PS TRD S EPR  SPP+SPRRES PASPTYSVKK  SPPSSPPYRAP  AART+SSPP+  DE
Subjt:  GRLGRSFYRFSSVNRPVAPGGPTAGQDSAQYDGRQLPSVTRDQSAEPRLHSPPYSPRRESLPASPTYSVKKTTSPPSSPPYRAPAAAARTISSPPKPADE

Query:  YPK-------SPPKVEQKPVIYKAVEK-VAKSDRHPESGKSVSSHSKQ---PNAINIAGENLGAVMEIVQSPKREGGHIIKRKETRG-THNTASENNEAS
        YPK         P+ +QKPVIYKA+EK   KSDR+PE+GK+ SS  +Q   PNAINI+GENLGAVMEIVQSPKREGGHII++KE+RG T ++  +NNE S
Subjt:  YPK-------SPPKVEQKPVIYKAVEK-VAKSDRHPESGKSVSSHSKQ---PNAINIAGENLGAVMEIVQSPKREGGHIIKRKETRG-THNTASENNEAS

Query:  KDKAAGQKGKEET------SLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFSRKPTGGGFTIDEKKQQP
        KD    QKGKE T      SLPA+TFLNSNFQSVNNSLLYNASLAHRDPGLHLAF+R P G  F  D+KK  P
Subjt:  KDKAAGQKGKEET------SLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFSRKPTGGGFTIDEKKQQP

XP_038879892.1 uncharacterized protein At1g10890-like isoform X1 [Benincasa hispida]4.1e-6863.33Show/hide
Query:  GRSF---YRFSSVNRPVAPGGP-TAGQDSAQYDGRQLPSVTRDQSAEPRLHS-PPYSPRRES-LPASPTYSVKKTTSPPSSPPYRAPAAAARTISSPPKP
        GR+F   YRFSS NRP+AP    T+GQDSAQYDGRQ  S TRD S EPR  S PP SPRR+  LPASPTYS+KK TSPP SPPYRAP  AAR ISSPPK 
Subjt:  GRSF---YRFSSVNRPVAPGGP-TAGQDSAQYDGRQLPSVTRDQSAEPRLHS-PPYSPRRES-LPASPTYSVKKTTSPPSSPPYRAPAAAARTISSPPKP

Query:  ADEYPK-------SPPKVEQKPVIYKAVEKVAKSDRHPESGKSVSSHS-KQPNAINIAGENLGAVMEIVQSPKREG-GHIIKRKETR----GTHNTASEN
         DEY K         P+ +QKP I K V+KV KSDRH ES K+VSSH  +QPNAINI G+N+GAVMEIV+S KREG GH+IK+KET        N A++ 
Subjt:  ADEYPK-------SPPKVEQKPVIYKAVEKVAKSDRHPESGKSVSSHS-KQPNAINIAGENLGAVMEIVQSPKREG-GHIIKRKETR----GTHNTASEN

Query:  -NEASKDKAAGQKGKEETSLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFSRKPTGGGFTIDEKKQ
         NEASK           +S+P STFLNSNFQSVNNSLL+NA+LAHRDPGLHLAFS  PTG   T+D+KKQ
Subjt:  -NEASKDKAAGQKGKEETSLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFSRKPTGGGFTIDEKKQ

XP_038879901.1 uncharacterized protein LOC120071611 isoform X2 [Benincasa hispida]1.3e-6965.4Show/hide
Query:  GRSF---YRFSSVNRPVAPGGP-TAGQDSAQYDGRQLPSVTRDQSAEPRLHS-PPYSPRRES-LPASPTYSVKKTTSPPSSPPYRAPAAAARTISSPPKP
        GR+F   YRFSS NRP+AP    T+GQDSAQYDGRQ  S TRD S EPR  S PP SPRR+  LPASPTYS+KK TSPP SPPYRAP  AAR ISSPPK 
Subjt:  GRSF---YRFSSVNRPVAPGGP-TAGQDSAQYDGRQLPSVTRDQSAEPRLHS-PPYSPRRES-LPASPTYSVKKTTSPPSSPPYRAPAAAARTISSPPKP

Query:  ADEYPKSPPKVEQKPVIYKAVEKVAKSDRHPESGKSVSSHS-KQPNAINIAGENLGAVMEIVQSPKREG-GHIIKRKETR----GTHNTASEN-NEASKD
         DEY KS P+ +QKP I K V+KV KSDRH ES K+VSSH  +QPNAINI G+N+GAVMEIV+S KREG GH+IK+KET        N A++  NEASK 
Subjt:  ADEYPKSPPKVEQKPVIYKAVEKVAKSDRHPESGKSVSSHS-KQPNAINIAGENLGAVMEIVQSPKREG-GHIIKRKETR----GTHNTASEN-NEASKD

Query:  KAAGQKGKEETSLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFSRKPTGGGFTIDEKKQ
                  +S+P STFLNSNFQSVNNSLL+NA+LAHRDPGLHLAFS  PTG   T+D+KKQ
Subjt:  KAAGQKGKEETSLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFSRKPTGGGFTIDEKKQ

TrEMBL top hitse value%identityAlignment
A0A0A0L3R5 Uncharacterized protein3.3e-6360.22Show/hide
Query:  GRSF---YRFSSVNRPVAPGGPTAGQDSAQYDGRQLPSVTRDQSAEPRLHSPPYSPRRES--LPASPTYSVKKTTSPPSSPPYRAPAAAARTISSPPKPA
        GR+F   YRFSS NRP+AP   T+GQDSAQYDGR+ PS  RD S EPR   P  + RR+   LPASPTYS+KK TSPPSSP YRAP  AAR ISSP K  
Subjt:  GRSF---YRFSSVNRPVAPGGPTAGQDSAQYDGRQLPSVTRDQSAEPRLHSPPYSPRRES--LPASPTYSVKKTTSPPSSPPYRAPAAAARTISSPPKPA

Query:  DEYPK-------SPPKVEQKPVIY--KAVEKVAKSDRHPESGKSVSSH--SKQPNAINIAGENLGAVMEIVQSPKREGGHIIKR-KET-RGTHNTASENN
        DEYPK         P+ +QKP I+    VEKV KSDR+ ES K++SSH    QPNAINI GEN+GAVMEIV+S KREGGH+IK+ KET RG  N    NN
Subjt:  DEYPK-------SPPKVEQKPVIY--KAVEKVAKSDRHPESGKSVSSH--SKQPNAINIAGENLGAVMEIVQSPKREGGHIIKR-KET-RGTHNTASENN

Query:  EASKDKAAGQKGKEETSLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFSRKPTGGGFTIDEKKQQ
        + + D+         +S+P +TFLNSNFQSVNNSLLYNA+L HRDPGLHLAFSR PTG    +D+ K+Q
Subjt:  EASKDKAAGQKGKEETSLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFSRKPTGGGFTIDEKKQQ

A0A1S3AVL1 uncharacterized protein LOC1034831567.1e-6661.42Show/hide
Query:  GRSF---YRFSSVNRPVAPGGPTAGQDSAQYDGRQLPSVTRDQSAEPRLHSPPYSPRRES--LPASPTYSVKKTTSPPSSPPYRAPAAAARTISSPPKPA
        GR+F   YRFSS NRP+AP   T+GQDSAQYD RQ PS TRD S EPR   P  S R++   LPASPTYS+KK TSPPSSPPYR    AAR ISSPPK  
Subjt:  GRSF---YRFSSVNRPVAPGGPTAGQDSAQYDGRQLPSVTRDQSAEPRLHSPPYSPRRES--LPASPTYSVKKTTSPPSSPPYRAPAAAARTISSPPKPA

Query:  DEYPK-------SPPKVEQKPVIYK--AVEKVAKSDRHPESGKSVSSH-SKQPNAINIAGENLGAVMEIVQSPKREGGHIIKR-KET-RGTHNTASENNE
        DEYPK         P+ +QKP I+K   VEKV KSDR+ E  K+VSSH  +QPNAINI GEN+GAVMEIV+S KREGGH+IK+ KET RG  N    NN+
Subjt:  DEYPK-------SPPKVEQKPVIYK--AVEKVAKSDRHPESGKSVSSH-SKQPNAINIAGENLGAVMEIVQSPKREGGHIIKR-KET-RGTHNTASENNE

Query:  ASKDKAAGQKGKEETSLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFSRKPTGGGFTIDEKKQ
         + D+         +S+P +TFLNSNFQSVNNSLLYNA+L +RDPGLHL+FSR PTG  F +D+KKQ
Subjt:  ASKDKAAGQKGKEETSLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFSRKPTGGGFTIDEKKQ

A0A5A7U5I8 Zyxin-like7.1e-6661.42Show/hide
Query:  GRSF---YRFSSVNRPVAPGGPTAGQDSAQYDGRQLPSVTRDQSAEPRLHSPPYSPRRES--LPASPTYSVKKTTSPPSSPPYRAPAAAARTISSPPKPA
        GR+F   YRFSS NRP+AP   T+GQDSAQYD RQ PS TRD S EPR   P  S R++   LPASPTYS+KK TSPPSSPPYR    AAR ISSPPK  
Subjt:  GRSF---YRFSSVNRPVAPGGPTAGQDSAQYDGRQLPSVTRDQSAEPRLHSPPYSPRRES--LPASPTYSVKKTTSPPSSPPYRAPAAAARTISSPPKPA

Query:  DEYPK-------SPPKVEQKPVIYK--AVEKVAKSDRHPESGKSVSSH-SKQPNAINIAGENLGAVMEIVQSPKREGGHIIKR-KET-RGTHNTASENNE
        DEYPK         P+ +QKP I+K   VEKV KSDR+ E  K+VSSH  +QPNAINI GEN+GAVMEIV+S KREGGH+IK+ KET RG  N    NN+
Subjt:  DEYPK-------SPPKVEQKPVIYK--AVEKVAKSDRHPESGKSVSSH-SKQPNAINIAGENLGAVMEIVQSPKREGGHIIKR-KET-RGTHNTASENNE

Query:  ASKDKAAGQKGKEETSLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFSRKPTGGGFTIDEKKQ
         + D+         +S+P +TFLNSNFQSVNNSLLYNA+L +RDPGLHL+FSR PTG  F +D+KKQ
Subjt:  ASKDKAAGQKGKEETSLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFSRKPTGGGFTIDEKKQ

A0A6J1D126 serine/arginine repetitive matrix protein 1-like3.9e-8870.33Show/hide
Query:  GRLGRSFYRFSSVNRPVAPGGPTAGQDSAQYDGRQLPSVTRDQSAEPRLHSPPYSPRRESLPASPTYSVKKTTSPPSSPPYRAPAAAARTISSPPKPADE
        GR GRSFYRFSSVNRP+APGG T+ QDSAQYDGRQ PS TRD S EPR  SPP+SPRRES PASPTYSVKK  SPPSSPPYRAP  AART+SSPP+  DE
Subjt:  GRLGRSFYRFSSVNRPVAPGGPTAGQDSAQYDGRQLPSVTRDQSAEPRLHSPPYSPRRESLPASPTYSVKKTTSPPSSPPYRAPAAAARTISSPPKPADE

Query:  YPK-------SPPKVEQKPVIYKAVEK-VAKSDRHPESGKSVSSHSKQ---PNAINIAGENLGAVMEIVQSPKREGGHIIKRKETRG-THNTASENNEAS
        YPK         P+ +QKPVIYKA+EK   KSDR+PE+GK+ SS  +Q   PNAINI+GENLGAVMEIVQSPKREGGHII++KE+RG T ++  +NNE S
Subjt:  YPK-------SPPKVEQKPVIYKAVEK-VAKSDRHPESGKSVSSHSKQ---PNAINIAGENLGAVMEIVQSPKREGGHIIKRKETRG-THNTASENNEAS

Query:  KDKAAGQKGKEET------SLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFSRKPTGGGFTIDEKKQQP
        KD    QKGKE T      SLPA+TFLNSNFQSVNNSLLYNASLAHRDPGLHLAF+R P G  F  D+KK  P
Subjt:  KDKAAGQKGKEET------SLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFSRKPTGGGFTIDEKKQQP

A0A6J1INF1 uncharacterized protein LOC111479075 isoform X11.6e-4650.19Show/hide
Query:  RLGRSFYRFSSVNRPVAPGGPTAGQDSAQYDGRQLPSVTRDQSAEPRLHSPPYSPRRESLPASPTYSVKKTTSPPSSPPYRAPAAAARTISSPPKPADEY
        R GRS YRFSSVNRP AP                     R  S EPR  SP   PR                  P+SPP      ++R I+SPP P  +Y
Subjt:  RLGRSFYRFSSVNRPVAPGGPTAGQDSAQYDGRQLPSVTRDQSAEPRLHSPPYSPRRESLPASPTYSVKKTTSPPSSPPYRAPAAAARTISSPPKPADEY

Query:  PKSPPKVEQKPVIYKAVEKVAKSDRHPESGKSVSSHSKQPNAINIAGENLGAVMEIVQSPKREGGHIIKRKET-RGTHNTASEN-NEASKD-----KAAG
        PKS P+ +QK +++K VEK AKS+R+ +SG+  +  ++QPNAINIAGEN+GAVMEIV+S K EGGH++K+KET RG  + A +  N+  KD     KA  
Subjt:  PKSPPKVEQKPVIYKAVEKVAKSDRHPESGKSVSSHSKQPNAINIAGENLGAVMEIVQSPKREGGHIIKRKET-RGTHNTASEN-NEASKD-----KAAG

Query:  QKGKEETSLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFSRKPTGGGFTIDEKKQ
        QK    +SLP +TFLN+NFQSVNNSLL++ASLAHRDPGLHLAFSR  TG  FT+D+KKQ
Subjt:  QKGKEETSLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFSRKPTGGGFTIDEKKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G63310.1 unknown protein7.6e-0433.68Show/hide
Query:  INIAGENLGAVMEIVQSPKREGGHIIKRKETRGTHNTASENNEASKDKAAGQKGKEETSLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFS
        I ++G NLGA M+                       T  +NN   +D    Q G  E     ST++NSNFQ+VNNS++  A     DPG+HL  S
Subjt:  INIAGENLGAVMEIVQSPKREGGHIIKRKETRGTHNTASENNEASKDKAAGQKGKEETSLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFS

AT2G46630.1 unknown protein6.2e-1434.78Show/hide
Query:  HPESGKSVSSHSKQPNAINIAGENLGAVMEIVQSPK--REGGHII-----------KRKETRGTHNTASENNEASKDKAAGQKGKEETSLPASTFLNSNF
        H +   S S +      I IAGEN GAVMEI++SP+  + GG              K +  + + +++S+  E  K        K  ++LP   F+NSN 
Subjt:  HPESGKSVSSHSKQPNAINIAGENLGAVMEIVQSPK--REGGHII-----------KRKETRGTHNTASENNEASKDKAAGQKGKEETSLPASTFLNSNF

Query:  QSVNNSLLYNASLAHRDPGLHLAFSRKP-TGGGFTIDE
        Q +NNS++YN++ +H DPG+HL  SRKP +  GF + +
Subjt:  QSVNNSLLYNASLAHRDPGLHLAFSRKP-TGGGFTIDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGATATGTAAATCGTGTCGAACACATGCGATTTATAGACATATCGCGGCACACGATTTTATGCCCCGATCTATGGATATATCGCGGCACTCAAGATCGACAAGAAA
TTTATGGAGCAATCGCGGCACAATATCGCATGCCACTACCTCGCACAATATCGCGGCACACGATCTAAAGCGGTTGTCAGATGCCCAAAAGTCGCCTGGCTCTGGCTGCG
CCGCCGCCTCCTTAGGCTTCTTGGCCAAGCAGCTGCCGTTCGCCGTCTCAACTTCTGGTTTCTTCTGCAAATCTGGCGCTGCCACTTTCTCCGACGCCTGCTTTGGCTTC
CCATCAGCATCCGCCTTGAACAGACGGCCATTGATCCTCCAATCCTCAAGAACATCAACGATCTCATCAACCTGCTTTGCTAATCCTTGCCTGTCGGCCTCCTTCGTGAT
AGATTCAGTGTTCTTTCTCAACACATTTCGAGTGGACTCAGCTGTGCAACTTCATATAAGCAAAGAAGCAAAAAAATTGGAGACAAAACCACATGAAGAAATGAAAAGAG
ATGCCGAAAAGGCCCCATGGGTACTATTTGCAGTTGCAAGAATTGGTCGTTTGGGTCGTTCATTTTACCGTTTTTCCTCCGTCAACCGACCCGTCGCCCCCGGCGGCCCC
ACTGCAGGCCAGGATTCGGCTCAGTATGACGGCAGGCAATTGCCTTCAGTCACCAGAGACCAGTCGGCGGAACCCCGCCTTCACTCGCCGCCATATTCACCTAGGAGAGA
GTCTCTTCCTGCTTCACCGACGTATTCCGTCAAGAAAACCACTTCGCCACCTTCTTCTCCGCCCTACAGAGCTCCTGCGGCTGCGGCACGTACGATCAGCTCGCCGCCGA
AGCCTGCAGACGAATACCCCAAGTCACCACCGAAGGTTGAGCAGAAACCTGTGATCTACAAGGCCGTCGAGAAGGTGGCGAAATCCGACCGTCATCCGGAGTCGGGTAAG
TCGGTGTCGTCCCACAGCAAGCAGCCTAATGCGATAAACATCGCCGGAGAAAACCTCGGCGCCGTCATGGAAATCGTTCAGTCCCCCAAACGCGAAGGCGGACATATCAT
CAAGAGGAAGGAGACCAGAGGAACCCACAACACTGCCAGTGAAAACAACGAAGCTTCTAAAGATAAGGCCGCCGGCCAGAAAGGAAAGGAAGAAACTTCACTGCCGGCGA
GCACTTTCTTGAACAGCAACTTCCAGAGCGTGAACAACTCCCTTCTGTACAACGCATCTCTGGCTCACCGCGACCCCGGTCTGCACCTTGCCTTCTCCCGGAAGCCGACC
GGCGGCGGGTTCACCATCGATGAGAAGAAGCAGCAACCGAACATGAAAGATAGAGCAGGCTCAGCTTTTGCTTTCTCTAGCGCCTGGAATCAAGCTTGGGTTCCCCATGG
CTCGTCTTCCTTGAGAAACCAAGATCTTGCCGGCTTCTCGGTGATCTTCTGGAAAAGTATGGTTTTCTGCTCGACCGCCGGTGGGCTCATGGATCGAGTAGTAGGCCCAG
CTACAGGTTTCGCCTCCGTGCGTGGTGGAGATAGTGGTGGGGCCGCGGCGGCGGCGGGGTGTGACAGAGGCGGAGAGGTTTGGGTGGGCTTGGAGGGCAAAGTGGCAGTG
GGTCTATATTTTGGAGAAGAAGGCGGGGAGGCAAGTTTCTTCGGCGGCGGAGAGGGAAGGCGAGAAGATGGAGTAGGTTCTTCAAAAGGTGGGAGAGTTTGGGTGGTCGG
CGGAGCGAACAAGGCGGCTGCCGGAGCAGGGCGGGGAAGGGAAGAGAAACGTTGCCATGTACGACCAAAGCGAGGAAGATTTGCCATGACGAAGAAGGAGACTGTTTGGA
AGATTGTCGTGGCTCTGGTTCGTCACCTTCAACACTGTATAAAAGGCCACAGAAATCGCCCTTTCAACTCCATCTCTCTTTTCTCTTCCATTAATTCTTCTCAACTTCCT
GTAGTTTTCTCTTTCTTCACCATGGCAAATCGCTTGGGTCGTTCATTTTACCGTTTTTCTACAGTCAACCGGCCCACCCCTGCCCCCGCCGCCGGTCCACCCCCAGAACC
GGTTCAGCCTGACTTCCTTCCACCCACCACAGAATCCCTTCCGCCTTCCCGTTCGGTCAAGAAAGTTGCTTCGCCGCCGCCTTATTCACCAGTCGCGAAACCGGCGAAAT
CCGACCGCCTCTCCGACTTCGGTAAGCAGTCCCACAAGCTGCCGGACGACATAAAAATTGCAGGGCAAAACATTGGCTCTGTCATGGAAATCACTCAGTCCGGCGGCAAA
CGGGAAGGTGGACAGATCGTCAAAAAGAAGGACGGGAGAGGAATCCATAACGTCGGCCATGCAAACGATGAAGCTTCGAAGGATAAGAGCAGCCATAATTATAAAGGTGA
CGAAACGACGTCGTTGCCTTCCGACACATTCATGAACACCAACTTCCAGAGCGTCAACAATTCTCTTGTGTACGGCGCTTCTCTCTCTCACCGCGACCCGGGGCTTCACG
TTGCTTTCTCCAAGTCGCCGGCCGGTGATGGATCCACCAAAGCCGACGAGAAGCTCATCAAGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGGATATGTAAATCGTGTCGAACACATGCGATTTATAGACATATCGCGGCACACGATTTTATGCCCCGATCTATGGATATATCGCGGCACTCAAGATCGACAAGAAA
TTTATGGAGCAATCGCGGCACAATATCGCATGCCACTACCTCGCACAATATCGCGGCACACGATCTAAAGCGGTTGTCAGATGCCCAAAAGTCGCCTGGCTCTGGCTGCG
CCGCCGCCTCCTTAGGCTTCTTGGCCAAGCAGCTGCCGTTCGCCGTCTCAACTTCTGGTTTCTTCTGCAAATCTGGCGCTGCCACTTTCTCCGACGCCTGCTTTGGCTTC
CCATCAGCATCCGCCTTGAACAGACGGCCATTGATCCTCCAATCCTCAAGAACATCAACGATCTCATCAACCTGCTTTGCTAATCCTTGCCTGTCGGCCTCCTTCGTGAT
AGATTCAGTGTTCTTTCTCAACACATTTCGAGTGGACTCAGCTGTGCAACTTCATATAAGCAAAGAAGCAAAAAAATTGGAGACAAAACCACATGAAGAAATGAAAAGAG
ATGCCGAAAAGGCCCCATGGGTACTATTTGCAGTTGCAAGAATTGGTCGTTTGGGTCGTTCATTTTACCGTTTTTCCTCCGTCAACCGACCCGTCGCCCCCGGCGGCCCC
ACTGCAGGCCAGGATTCGGCTCAGTATGACGGCAGGCAATTGCCTTCAGTCACCAGAGACCAGTCGGCGGAACCCCGCCTTCACTCGCCGCCATATTCACCTAGGAGAGA
GTCTCTTCCTGCTTCACCGACGTATTCCGTCAAGAAAACCACTTCGCCACCTTCTTCTCCGCCCTACAGAGCTCCTGCGGCTGCGGCACGTACGATCAGCTCGCCGCCGA
AGCCTGCAGACGAATACCCCAAGTCACCACCGAAGGTTGAGCAGAAACCTGTGATCTACAAGGCCGTCGAGAAGGTGGCGAAATCCGACCGTCATCCGGAGTCGGGTAAG
TCGGTGTCGTCCCACAGCAAGCAGCCTAATGCGATAAACATCGCCGGAGAAAACCTCGGCGCCGTCATGGAAATCGTTCAGTCCCCCAAACGCGAAGGCGGACATATCAT
CAAGAGGAAGGAGACCAGAGGAACCCACAACACTGCCAGTGAAAACAACGAAGCTTCTAAAGATAAGGCCGCCGGCCAGAAAGGAAAGGAAGAAACTTCACTGCCGGCGA
GCACTTTCTTGAACAGCAACTTCCAGAGCGTGAACAACTCCCTTCTGTACAACGCATCTCTGGCTCACCGCGACCCCGGTCTGCACCTTGCCTTCTCCCGGAAGCCGACC
GGCGGCGGGTTCACCATCGATGAGAAGAAGCAGCAACCGAACATGAAAGATAGAGCAGGCTCAGCTTTTGCTTTCTCTAGCGCCTGGAATCAAGCTTGGGTTCCCCATGG
CTCGTCTTCCTTGAGAAACCAAGATCTTGCCGGCTTCTCGGTGATCTTCTGGAAAAGTATGGTTTTCTGCTCGACCGCCGGTGGGCTCATGGATCGAGTAGTAGGCCCAG
CTACAGGTTTCGCCTCCGTGCGTGGTGGAGATAGTGGTGGGGCCGCGGCGGCGGCGGGGTGTGACAGAGGCGGAGAGGTTTGGGTGGGCTTGGAGGGCAAAGTGGCAGTG
GGTCTATATTTTGGAGAAGAAGGCGGGGAGGCAAGTTTCTTCGGCGGCGGAGAGGGAAGGCGAGAAGATGGAGTAGGTTCTTCAAAAGGTGGGAGAGTTTGGGTGGTCGG
CGGAGCGAACAAGGCGGCTGCCGGAGCAGGGCGGGGAAGGGAAGAGAAACGTTGCCATGTACGACCAAAGCGAGGAAGATTTGCCATGACGAAGAAGGAGACTGTTTGGA
AGATTGTCGTGGCTCTGGTTCGTCACCTTCAACACTGTATAAAAGGCCACAGAAATCGCCCTTTCAACTCCATCTCTCTTTTCTCTTCCATTAATTCTTCTCAACTTCCT
GTAGTTTTCTCTTTCTTCACCATGGCAAATCGCTTGGGTCGTTCATTTTACCGTTTTTCTACAGTCAACCGGCCCACCCCTGCCCCCGCCGCCGGTCCACCCCCAGAACC
GGTTCAGCCTGACTTCCTTCCACCCACCACAGAATCCCTTCCGCCTTCCCGTTCGGTCAAGAAAGTTGCTTCGCCGCCGCCTTATTCACCAGTCGCGAAACCGGCGAAAT
CCGACCGCCTCTCCGACTTCGGTAAGCAGTCCCACAAGCTGCCGGACGACATAAAAATTGCAGGGCAAAACATTGGCTCTGTCATGGAAATCACTCAGTCCGGCGGCAAA
CGGGAAGGTGGACAGATCGTCAAAAAGAAGGACGGGAGAGGAATCCATAACGTCGGCCATGCAAACGATGAAGCTTCGAAGGATAAGAGCAGCCATAATTATAAAGGTGA
CGAAACGACGTCGTTGCCTTCCGACACATTCATGAACACCAACTTCCAGAGCGTCAACAATTCTCTTGTGTACGGCGCTTCTCTCTCTCACCGCGACCCGGGGCTTCACG
TTGCTTTCTCCAAGTCGCCGGCCGGTGATGGATCCACCAAAGCCGACGAGAAGCTCATCAAGAACTGA
Protein sequenceShow/hide protein sequence
MWICKSCRTHAIYRHIAAHDFMPRSMDISRHSRSTRNLWSNRGTISHATTSHNIAAHDLKRLSDAQKSPGSGCAAASLGFLAKQLPFAVSTSGFFCKSGAATFSDACFGF
PSASALNRRPLILQSSRTSTISSTCFANPCLSASFVIDSVFFLNTFRVDSAVQLHISKEAKKLETKPHEEMKRDAEKAPWVLFAVARIGRLGRSFYRFSSVNRPVAPGGP
TAGQDSAQYDGRQLPSVTRDQSAEPRLHSPPYSPRRESLPASPTYSVKKTTSPPSSPPYRAPAAAARTISSPPKPADEYPKSPPKVEQKPVIYKAVEKVAKSDRHPESGK
SVSSHSKQPNAINIAGENLGAVMEIVQSPKREGGHIIKRKETRGTHNTASENNEASKDKAAGQKGKEETSLPASTFLNSNFQSVNNSLLYNASLAHRDPGLHLAFSRKPT
GGGFTIDEKKQQPNMKDRAGSAFAFSSAWNQAWVPHGSSSLRNQDLAGFSVIFWKSMVFCSTAGGLMDRVVGPATGFASVRGGDSGGAAAAAGCDRGGEVWVGLEGKVAV
GLYFGEEGGEASFFGGGEGRREDGVGSSKGGRVWVVGGANKAAAGAGRGREEKRCHVRPKRGRFAMTKKETVWKIVVALVRHLQHCIKGHRNRPFNSISLFSSINSSQLP
VVFSFFTMANRLGRSFYRFSTVNRPTPAPAAGPPPEPVQPDFLPPTTESLPPSRSVKKVASPPPYSPVAKPAKSDRLSDFGKQSHKLPDDIKIAGQNIGSVMEITQSGGK
REGGQIVKKKDGRGIHNVGHANDEASKDKSSHNYKGDETTSLPSDTFMNTNFQSVNNSLVYGASLSHRDPGLHVAFSKSPAGDGSTKADEKLIKN