; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022936 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022936
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTransmembrane protein
Genome locationtig00000729:1138353..1142037
RNA-Seq ExpressionSgr022936
SyntenySgr022936
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008445046.1 PREDICTED: uncharacterized protein LOC103488205 [Cucumis melo]9.8e-8767.12Show/hide
Query:  MAANPRETRRRRILERGSDRLALITGQIQSLPSSSASPSPYDESKDSSSQPLISNHQDLQ-PRFSDQPTVSHDKDKLVGSALQHHDPQIDARSSTHYGTS
        MAAN RE RRRRILERGSDRLALITGQIQSLPSSSASP PYD++ DSSSQPLISN QDL+ P  SDQPTVSHDKDK VGS+L H+DPQI  RSST+ GTS
Subjt:  MAANPRETRRRRILERGSDRLALITGQIQSLPSSSASPSPYDESKDSSSQPLISNHQDLQ-PRFSDQPTVSHDKDKLVGSALQHHDPQIDARSSTHYGTS

Query:  SAPLLSKCNPIEIAVASAPEDGGKAQSHLAPSE--DPPL-------------------------------------FSATIAFLVVASYVGFPFLGQSLM
        +APL+ K N IE AVAS PED G+A    +PSE  D PL                                     FS  IAFLVVAS V FPFLGQS+M
Subjt:  SAPLLSKCNPIEIAVASAPEDGGKAQSHLAPSE--DPPL-------------------------------------FSATIAFLVVASYVGFPFLGQSLM

Query:  RIIFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQWL
        R +F  RPLYL+LLTN T+VLGRL+FT+QKGFRVSDR + QVNPPEGQSSVEQIGKVLEA LV QKAMGAI MDCSVFAVIVVSGLS +Q L
Subjt:  RIIFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQWL

XP_022131304.1 uncharacterized protein LOC111004568 [Momordica charantia]1.4e-8869.07Show/hide
Query:  MAANPRETRRRRILERGSDRLALITGQIQSLPSSSASPSPYDESKDSSSQPLISNHQDLQPRFSDQPTVSHDKDKLVGSALQHHDPQIDARSSTHYGTSS
        MAANPRE+RRRRILERGSDRLALITGQIQSLPS S SPSPY  ++DSS+QPLIS+ QDLQPR SDQ TVS +K KLVGS LQH DPQIDARSS ++GTSS
Subjt:  MAANPRETRRRRILERGSDRLALITGQIQSLPSSSASPSPYDESKDSSSQPLISNHQDLQPRFSDQPTVSHDKDKLVGSALQHHDPQIDARSSTHYGTSS

Query:  APLLSKCNPIEIAVASAPEDGGKAQSHLAPSED------------------PPL---------------------FSATIAFLVVASYVGFPFLGQSLMR
        APL SK   +E AVAS  ED GKA   LA SE                   PP+                     FSATIAFLVVASYVGFPFLGQSL R
Subjt:  APLLSKCNPIEIAVASAPEDGGKAQSHLAPSED------------------PPL---------------------FSATIAFLVVASYVGFPFLGQSLMR

Query:  IIFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQWL
        I F SRPLYLLLLTNVTVVLGRL+FT++KGFR   RG+GQV P  GQSS+EQIGKVLEAGL+VQKAMGAIFMDCSVFAVIVVSGLSFVQ L
Subjt:  IIFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQWL

XP_022996481.1 uncharacterized protein LOC111491715 [Cucurbita maxima]8.9e-8868.28Show/hide
Query:  MAANPRETRRRRILERGSDRLALITGQIQSLPSSSASPSPYDESKDSSSQPLISNHQDLQPRFSDQPTVSHDKDKLVGSALQHHDPQIDARSSTHYGTSS
        MAAN RE+RRRRILERGS+RLALITGQIQSLPS S SP P DE+ DS  QP ISN QDL+PR SDQPTVSHD DKL+GSALQH DP+I  RSS + G S+
Subjt:  MAANPRETRRRRILERGSDRLALITGQIQSLPSSSASPSPYDESKDSSSQPLISNHQDLQPRFSDQPTVSHDKDKLVGSALQHHDPQIDARSSTHYGTSS

Query:  APLLSKCNPIEIAVASAPEDGGKAQSHLAPSE--DPPL------------------------------------FSATIAFLVVASYVGFPFLGQSLMRI
        APLLSK N IE AVAS P+DGG+A SH+  SE  D  L                                    FSA IAFLVVASYVGFPFLGQSLMRI
Subjt:  APLLSKCNPIEIAVASAPEDGGKAQSHLAPSE--DPPL------------------------------------FSATIAFLVVASYVGFPFLGQSLMRI

Query:  IFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQWL
        +F SRP+YL+LLTN TVVLGRL+F +QKG RVSDRGEGQV PPEGQSS EQIG VLEAGLV QKAMGAIFMD SVFAVIVVSGLSFVQ L
Subjt:  IFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQWL

XP_023545800.1 uncharacterized protein LOC111805127 [Cucurbita pepo subsp. pepo]2.0e-8767.7Show/hide
Query:  MAANPRETRRRRILERGSDRLALITGQIQSLPSSSASPSPYDESKDSSSQPLISNHQDLQPRFSDQPTVSHDKDKLVGSALQHHDPQIDARSSTHYGTSS
        MAAN RE+RRRRILERGS+RLALITGQIQSLPS S SP P +E+ DS  QP ISN QDL+PR SDQPTVSHD DKL+ S LQH DPQI ARSS + G S 
Subjt:  MAANPRETRRRRILERGSDRLALITGQIQSLPSSSASPSPYDESKDSSSQPLISNHQDLQPRFSDQPTVSHDKDKLVGSALQHHDPQIDARSSTHYGTSS

Query:  APLLSKCNPIEIAVASAPEDGGKAQSHLAPSED------------------PPL---------------------FSATIAFLVVASYVGFPFLGQSLMR
        APLLSK N IE AVAS P+DGG+A S++ PSE                    PL                     FSA IAFLVVASYVGFPFLGQSLMR
Subjt:  APLLSKCNPIEIAVASAPEDGGKAQSHLAPSED------------------PPL---------------------FSATIAFLVVASYVGFPFLGQSLMR

Query:  IIFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQWL
        I+F SRP+YL+LLTN TVVLGRL+FT+QKG RVSDRGEGQV PPEGQ S EQIG VLEAGLV QKAMGAIFMD SVFAVIVVSGLSFVQ L
Subjt:  IIFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQWL

XP_038885578.1 uncharacterized protein LOC120075907 [Benincasa hispida]1.2e-9269.18Show/hide
Query:  MAANPRETRRRRILERGSDRLALITGQIQSLPSSSASPSPYDESKDSSSQPLISNHQDLQP-RFSDQPTVSHDKDKLVGSALQHHDPQIDARSSTHYGTS
        MAANPRE RRRRILERGSDRLALITGQIQSLPSSSASP  YDE  DSSSQPLISN QDL+P R S QPTVSHDKDKL+GS LQH+DPQI ARSS + GTS
Subjt:  MAANPRETRRRRILERGSDRLALITGQIQSLPSSSASPSPYDESKDSSSQPLISNHQDLQP-RFSDQPTVSHDKDKLVGSALQHHDPQIDARSSTHYGTS

Query:  SAPLLSKCNPIEIAVASAPEDGGKAQSHLAPSE---------------DPPL------------------------FSATIAFLVVASYVGFPFLGQSLM
        + PL  K N IE AVAS PEDGG A SH  PS+                P L                        FSA IAFLVVASYVGFPFLGQS+M
Subjt:  SAPLLSKCNPIEIAVASAPEDGGKAQSHLAPSE---------------DPPL------------------------FSATIAFLVVASYVGFPFLGQSLM

Query:  RIIFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQWL
        R +F S+PLYL+L TN TVVLGRL+FT+QKGFRVSDRG+GQVNPPEGQSSVEQIGKVLEAG+V QKAMGAIFMDCSVFAVI+V GL F+Q L
Subjt:  RIIFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQWL

TrEMBL top hitse value%identityAlignment
A0A0A0LP83 Uncharacterized protein6.2e-8766.55Show/hide
Query:  MAANPRETRRRRILERGSDRLALITGQIQSLP-SSSASPSPYDESKDSSSQPLISNHQDLQ-PRFSDQPTVSHDKDKLVGSALQHHDPQIDARSSTHYGT
        MAAN RE RRRRILERGSDRLALITGQIQSLP SSSASP P+D++ +SSSQPLISN QDL+ P  SDQPTVSHD DK VGS L H+DPQI ARSST+YGT
Subjt:  MAANPRETRRRRILERGSDRLALITGQIQSLP-SSSASPSPYDESKDSSSQPLISNHQDLQ-PRFSDQPTVSHDKDKLVGSALQHHDPQIDARSSTHYGT

Query:  SSAPLLSKCNPIEIAVASAPEDGGKAQS--HLAPSEDPPL-------------------------------------FSATIAFLVVASYVGFPFLGQSL
        S+APLLSK N IE AVAS PED G+A     L+  +D PL                                     FS  IAFLVVA YVGFPFLGQS+
Subjt:  SSAPLLSKCNPIEIAVASAPEDGGKAQS--HLAPSEDPPL-------------------------------------FSATIAFLVVASYVGFPFLGQSL

Query:  MRIIFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQWL
        MRI+F  RPLYL+LLTN T+VLG+L+FT+QKG+RV++RG+GQVNPPE QSSVEQIGKVLEA LV QKAMGAIFMDCSV+AVIVVSGLS VQ L
Subjt:  MRIIFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQWL

A0A1S3BBA5 uncharacterized protein LOC1034882054.8e-8767.12Show/hide
Query:  MAANPRETRRRRILERGSDRLALITGQIQSLPSSSASPSPYDESKDSSSQPLISNHQDLQ-PRFSDQPTVSHDKDKLVGSALQHHDPQIDARSSTHYGTS
        MAAN RE RRRRILERGSDRLALITGQIQSLPSSSASP PYD++ DSSSQPLISN QDL+ P  SDQPTVSHDKDK VGS+L H+DPQI  RSST+ GTS
Subjt:  MAANPRETRRRRILERGSDRLALITGQIQSLPSSSASPSPYDESKDSSSQPLISNHQDLQ-PRFSDQPTVSHDKDKLVGSALQHHDPQIDARSSTHYGTS

Query:  SAPLLSKCNPIEIAVASAPEDGGKAQSHLAPSE--DPPL-------------------------------------FSATIAFLVVASYVGFPFLGQSLM
        +APL+ K N IE AVAS PED G+A    +PSE  D PL                                     FS  IAFLVVAS V FPFLGQS+M
Subjt:  SAPLLSKCNPIEIAVASAPEDGGKAQSHLAPSE--DPPL-------------------------------------FSATIAFLVVASYVGFPFLGQSLM

Query:  RIIFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQWL
        R +F  RPLYL+LLTN T+VLGRL+FT+QKGFRVSDR + QVNPPEGQSSVEQIGKVLEA LV QKAMGAI MDCSVFAVIVVSGLS +Q L
Subjt:  RIIFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQWL

A0A6J1BPV3 uncharacterized protein LOC1110045686.6e-8969.07Show/hide
Query:  MAANPRETRRRRILERGSDRLALITGQIQSLPSSSASPSPYDESKDSSSQPLISNHQDLQPRFSDQPTVSHDKDKLVGSALQHHDPQIDARSSTHYGTSS
        MAANPRE+RRRRILERGSDRLALITGQIQSLPS S SPSPY  ++DSS+QPLIS+ QDLQPR SDQ TVS +K KLVGS LQH DPQIDARSS ++GTSS
Subjt:  MAANPRETRRRRILERGSDRLALITGQIQSLPSSSASPSPYDESKDSSSQPLISNHQDLQPRFSDQPTVSHDKDKLVGSALQHHDPQIDARSSTHYGTSS

Query:  APLLSKCNPIEIAVASAPEDGGKAQSHLAPSED------------------PPL---------------------FSATIAFLVVASYVGFPFLGQSLMR
        APL SK   +E AVAS  ED GKA   LA SE                   PP+                     FSATIAFLVVASYVGFPFLGQSL R
Subjt:  APLLSKCNPIEIAVASAPEDGGKAQSHLAPSED------------------PPL---------------------FSATIAFLVVASYVGFPFLGQSLMR

Query:  IIFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQWL
        I F SRPLYLLLLTNVTVVLGRL+FT++KGFR   RG+GQV P  GQSS+EQIGKVLEAGL+VQKAMGAIFMDCSVFAVIVVSGLSFVQ L
Subjt:  IIFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQWL

A0A6J1HDN9 uncharacterized protein LOC1114626411.4e-8667.35Show/hide
Query:  MAANPRETRRRRILERGSDRLALITGQIQSLPSSSASPSPYDESKDSSSQPLISNHQDLQPRFSDQPTVSHDKDKLVGSALQHHDPQIDARSSTHYGTSS
        MAAN RE+RRRRILERGS+RLALITGQIQSLPS S SP P DE+ +S  QP ISN QDL+PR SDQPTVS D DKL+GS LQ  DPQI ARSS + G S 
Subjt:  MAANPRETRRRRILERGSDRLALITGQIQSLPSSSASPSPYDESKDSSSQPLISNHQDLQPRFSDQPTVSHDKDKLVGSALQHHDPQIDARSSTHYGTSS

Query:  APLLSKCNPIEIAVASAPEDGGKAQSHLAPSED------------------PPL---------------------FSATIAFLVVASYVGFPFLGQSLMR
        APLLSK N IE AVAS P+DGG+A SH+ PSE                    PL                     FSA IAFLVVASYVGFPFLGQSLMR
Subjt:  APLLSKCNPIEIAVASAPEDGGKAQSHLAPSED------------------PPL---------------------FSATIAFLVVASYVGFPFLGQSLMR

Query:  IIFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQWL
        I+F SRP+YL+LLTN TVVLGRL+F +QKG RVSDRGEGQV PPEGQ S EQIG VLEAGLV QKAMGAIFMD SVFAVIVVSGLSFVQ L
Subjt:  IIFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQWL

A0A6J1K4V4 uncharacterized protein LOC1114917154.3e-8868.28Show/hide
Query:  MAANPRETRRRRILERGSDRLALITGQIQSLPSSSASPSPYDESKDSSSQPLISNHQDLQPRFSDQPTVSHDKDKLVGSALQHHDPQIDARSSTHYGTSS
        MAAN RE+RRRRILERGS+RLALITGQIQSLPS S SP P DE+ DS  QP ISN QDL+PR SDQPTVSHD DKL+GSALQH DP+I  RSS + G S+
Subjt:  MAANPRETRRRRILERGSDRLALITGQIQSLPSSSASPSPYDESKDSSSQPLISNHQDLQPRFSDQPTVSHDKDKLVGSALQHHDPQIDARSSTHYGTSS

Query:  APLLSKCNPIEIAVASAPEDGGKAQSHLAPSE--DPPL------------------------------------FSATIAFLVVASYVGFPFLGQSLMRI
        APLLSK N IE AVAS P+DGG+A SH+  SE  D  L                                    FSA IAFLVVASYVGFPFLGQSLMRI
Subjt:  APLLSKCNPIEIAVASAPEDGGKAQSHLAPSE--DPPL------------------------------------FSATIAFLVVASYVGFPFLGQSLMRI

Query:  IFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQWL
        +F SRP+YL+LLTN TVVLGRL+F +QKG RVSDRGEGQV PPEGQSS EQIG VLEAGLV QKAMGAIFMD SVFAVIVVSGLSFVQ L
Subjt:  IFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQWL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52343.1 unknown protein6.5e-1233.33Show/hide
Query:  RETRRRRILERGSDRLALITGQIQSL----PSSSASPS-----PYDES---KDSSSQPLISNHQDLQPRFSDQPTVSHDKDKLVGSALQHHDPQIDARSS
        RE RRRRI+ERGSDRLALITGQ+ +L    PSSS+S S      Y ES   +  S    I     L+ +F ++     ++ KL  S + H   +I+    
Subjt:  RETRRRRILERGSDRLALITGQIQSL----PSSSASPS-----PYDES---KDSSSQPLISNHQDLQPRFSDQPTVSHDKDKLVGSALQHHDPQIDARSS

Query:  THYGTSSAPLLSKCNPIEIAVASAPEDGGKAQSHLAPSEDPPLFSATIAFLVVASYVGFPFLGQSLMRIIFRSRPLYLLLLTNVTVVLGRLIFTEQKG--
             S        N   I   S+ +      S ++      L S TIA  VV      P L  +    I   RPL+LL+LT+  +V+  L  TE  G  
Subjt:  THYGTSSAPLLSKCNPIEIAVASAPEDGGKAQSHLAPSEDPPLFSATIAFLVVASYVGFPFLGQSLMRIIFRSRPLYLLLLTNVTVVLGRLIFTEQKG--

Query:  --FRVSDRGEGQ-VNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVV
            + + G+G+  N  E  S  E   ++LE G+VV +A+  +F+DCS++ V+VV
Subjt:  --FRVSDRGEGQ-VNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVV

AT4G32680.1 unknown protein3.5e-1832.65Show/hide
Query:  MAANPRETRRRRILERGSDRLALITGQIQSLPS-----SSASPSPYDESKDSSSQPLI--------------SNHQDLQPRFSDQPTVSHDKDKLVGSAL
        MA+N RE RRR+IL+RGSDRLA ITGQI  +PS     S++S S  D   D S    I              ++HQD     +    V H   +     L
Subjt:  MAANPRETRRRRILERGSDRLALITGQIQSLPS-----SSASPSPYDESKDSSSQPLI--------------SNHQDLQPRFSDQPTVSHDKDKLVGSAL

Query:  Q---HHDPQIDARSSTHYGTSS---APLLSKC-NP--------------IEIAVASAPEDGGKAQSHLAPSEDPPLFSA-TIAFLVVASYVGFPFLGQSL
        Q   H +   +A +S    T++    P  S   NP              +    A  P+  G A   +  SE   +F+A  IA +V+ S++GF  LG   
Subjt:  Q---HHDPQIDARSSTHYGTSS---APLLSKC-NP--------------IEIAVASAPEDGGKAQSHLAPSEDPPLFSA-TIAFLVVASYVGFPFLGQSL

Query:  MRIIFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQ
           I   RP++LL+LT+ T+VLGR++ + +      D          GQ  V+Q+G  LE  ++V+K M A+ MD S++AVI++ GL   Q
Subjt:  MRIIFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGGAAGTGCAGAAAAAATCAGTGGCTTCTTCTTCTTCTGATCAGTCTACTTTGTCGCTTACTAGCCGTCGCAGCGAAGTCCCAAACCAAACCCATGGACTCTCAAG
ATATTCAGGAAGTTGGCGGCTAAAGATTGATATGCCCATCCCCGAATTTTGGTATGTGCTGGGGAATGGGATAATCTTGCAGAAGGAAACAAAACCGACCCCGATTCCGG
CGGTGCTTGAAGATATGGCGGCGAACCCCAGAGAAACCAGACGGCGGCGGATCCTGGAGCGAGGTTCCGACCGTTTAGCCCTAATCACCGGTCAGATCCAGTCCCTCCCT
TCTTCCTCGGCATCTCCCTCTCCTTACGACGAAAGTAAGGATTCATCATCCCAGCCGTTGATCTCGAATCATCAGGATCTTCAACCTCGTTTTTCGGATCAACCCACCGT
TTCTCATGATAAGGACAAGCTGGTCGGTTCTGCATTGCAACATCATGATCCTCAGATTGATGCTAGATCATCTACACATTATGGAACCAGTTCTGCTCCTCTCTTGAGCA
AATGTAACCCAATTGAAATCGCAGTAGCCTCTGCTCCAGAAGATGGTGGAAAAGCACAGTCCCATCTCGCCCCATCTGAAGACCCGCCTTTGTTTTCGGCTACTATAGCC
TTTCTAGTAGTTGCTTCATATGTAGGATTTCCTTTCTTAGGCCAGAGTCTTATGAGAATTATTTTCAGATCTAGACCACTCTATCTTCTTTTGCTCACTAATGTAACAGT
TGTACTTGGGAGACTTATTTTCACCGAACAGAAGGGTTTTAGAGTATCTGACAGAGGAGAGGGTCAAGTAAATCCACCTGAAGGACAAAGTTCAGTTGAACAAATTGGTA
AGGTTTTAGAGGCAGGTTTAGTGGTTCAGAAGGCAATGGGTGCAATTTTCATGGACTGCAGTGTGTTTGCGGTAATCGTCGTATCGGGACTTTCATTTGTGCAGTGGCTT
TAG
mRNA sequenceShow/hide mRNA sequence
ATGCAGGAAGTGCAGAAAAAATCAGTGGCTTCTTCTTCTTCTGATCAGTCTACTTTGTCGCTTACTAGCCGTCGCAGCGAAGTCCCAAACCAAACCCATGGACTCTCAAG
ATATTCAGGAAGTTGGCGGCTAAAGATTGATATGCCCATCCCCGAATTTTGGTATGTGCTGGGGAATGGGATAATCTTGCAGAAGGAAACAAAACCGACCCCGATTCCGG
CGGTGCTTGAAGATATGGCGGCGAACCCCAGAGAAACCAGACGGCGGCGGATCCTGGAGCGAGGTTCCGACCGTTTAGCCCTAATCACCGGTCAGATCCAGTCCCTCCCT
TCTTCCTCGGCATCTCCCTCTCCTTACGACGAAAGTAAGGATTCATCATCCCAGCCGTTGATCTCGAATCATCAGGATCTTCAACCTCGTTTTTCGGATCAACCCACCGT
TTCTCATGATAAGGACAAGCTGGTCGGTTCTGCATTGCAACATCATGATCCTCAGATTGATGCTAGATCATCTACACATTATGGAACCAGTTCTGCTCCTCTCTTGAGCA
AATGTAACCCAATTGAAATCGCAGTAGCCTCTGCTCCAGAAGATGGTGGAAAAGCACAGTCCCATCTCGCCCCATCTGAAGACCCGCCTTTGTTTTCGGCTACTATAGCC
TTTCTAGTAGTTGCTTCATATGTAGGATTTCCTTTCTTAGGCCAGAGTCTTATGAGAATTATTTTCAGATCTAGACCACTCTATCTTCTTTTGCTCACTAATGTAACAGT
TGTACTTGGGAGACTTATTTTCACCGAACAGAAGGGTTTTAGAGTATCTGACAGAGGAGAGGGTCAAGTAAATCCACCTGAAGGACAAAGTTCAGTTGAACAAATTGGTA
AGGTTTTAGAGGCAGGTTTAGTGGTTCAGAAGGCAATGGGTGCAATTTTCATGGACTGCAGTGTGTTTGCGGTAATCGTCGTATCGGGACTTTCATTTGTGCAGTGGCTT
TAG
Protein sequenceShow/hide protein sequence
MQEVQKKSVASSSSDQSTLSLTSRRSEVPNQTHGLSRYSGSWRLKIDMPIPEFWYVLGNGIILQKETKPTPIPAVLEDMAANPRETRRRRILERGSDRLALITGQIQSLP
SSSASPSPYDESKDSSSQPLISNHQDLQPRFSDQPTVSHDKDKLVGSALQHHDPQIDARSSTHYGTSSAPLLSKCNPIEIAVASAPEDGGKAQSHLAPSEDPPLFSATIA
FLVVASYVGFPFLGQSLMRIIFRSRPLYLLLLTNVTVVLGRLIFTEQKGFRVSDRGEGQVNPPEGQSSVEQIGKVLEAGLVVQKAMGAIFMDCSVFAVIVVSGLSFVQWL