; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014414 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014414
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionElongation factor P
Genome locationtig00000589:171374..181109
RNA-Seq ExpressionSgr014414
SyntenySgr014414
Gene Ontology termsGO:0006414 - translational elongation (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0003746 - translation elongation factor activity (molecular function)
InterPro domainsIPR001059 - Translation elongation factor P/YeiP, central
IPR012340 - Nucleic acid-binding, OB-fold
IPR020599 - Translation elongation factor P/YeiP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7027859.1 efp, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-6561.04Show/hide
Query:  VAQRTPSVRFPL-GRNSTEGSEKPLTVALTPQEMAHIVFCNASSASFFLGCLSSSTTSLSMPSKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIY
        + +R P++      R   E SEKPLTVA T  +MA  + CNAS +S FL   SSS  SLS+ SK SV + RRFSS++     +S    L        RIY
Subjt:  VAQRTPSVRFPL-GRNSTEGSEKPLTVALTPQEMAHIVFCNASSASFFLGCLSSSTTSLSMPSKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIY

Query:  ALSSNDIKVGTNIEVDGAPWR--------------DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGD
        AL+SNDIKVGTNIEVDGAPWR                +T   N +      +   +   LDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGD
Subjt:  ALSSNDIKVGTNIEVDGAPWR--------------DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGD

Query:  RTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTVVDVDPGLKGDTAQGRS
        RTKWLKEGMDCIVLFWNGKVIDFEVP TIQLTVVDVDPGLKGDTAQG S
Subjt:  RTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTVVDVDPGLKGDTAQGRS

XP_022157330.1 uncharacterized protein LOC111024058 [Momordica charantia]2.7e-6767.59Show/hide
Query:  MAHIVFCNASSASFFLGCLSSSTTSLSMPSKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIYALSSNDIKVGTNIEVDGAPWR------------
        MA IV C+ASSAS FLG  SSSTTSLS+P KPS+  IR+ S  S RPGFS              RIYALSSNDIKVGTN+EVDGAPWR            
Subjt:  MAHIVFCNASSASFFLGCLSSSTTSLSMPSKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIYALSSNDIKVGTNIEVDGAPWR------------

Query:  --DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTV
            +T   N +      +   +   LDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPIT+QLTV
Subjt:  --DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTV

Query:  VDVDPGLKGDTAQGRS
        V+VDPGLKGDTAQG S
Subjt:  VDVDPGLKGDTAQGRS

XP_022971212.1 uncharacterized protein LOC111470003 [Cucurbita maxima]5.2e-6365.28Show/hide
Query:  MAHIVFCNASSASFFLGCLSSSTTSLSMPSKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIYALSSNDIKVGTNIEVDGAPWR------------
        MA  + CNAS +S FL   SSS TSLS+ SKPSV + RRFSS + R  FS              RIYAL+SNDIKVGTNIEVDG PWR            
Subjt:  MAHIVFCNASSASFFLGCLSSSTTSLSMPSKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIYALSSNDIKVGTNIEVDGAPWR------------

Query:  --DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTV
            +T   N +      +   +   LDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVP TIQLTV
Subjt:  --DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTV

Query:  VDVDPGLKGDTAQGRS
        VDVDPGLKGDTAQG S
Subjt:  VDVDPGLKGDTAQGRS

XP_038904606.1 elongation factor P isoform X1 [Benincasa hispida]9.5e-6567.13Show/hide
Query:  MAHIVFCNASSASFFLGCLSSSTTSLSMPSKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIYALSSNDIKVGTNIEVDGAPWR------------
        MA I+ CNASSAS FL   SS  TSL +P KPSV  IRR SS S R GF               RIYALSSNDIKVGTNIEVDGAPWR            
Subjt:  MAHIVFCNASSASFFLGCLSSSTTSLSMPSKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIYALSSNDIKVGTNIEVDGAPWR------------

Query:  --DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTV
            +T   N +      +   +   L+EANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTV
Subjt:  --DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTV

Query:  VDVDPGLKGDTAQGRS
        VDVDPGLKGDTAQG S
Subjt:  VDVDPGLKGDTAQGRS

XP_038904607.1 elongation factor P isoform X2 [Benincasa hispida]8.9e-6365.74Show/hide
Query:  MAHIVFCNASSASFFLGCLSSSTTSLSMPSKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIYALSSNDIKVGTNIEVDGAPWR------------
        MA I+ CNASSAS FL   SS  TSL +P KPSV  IRR SS                    S RIYALSSNDIKVGTNIEVDGAPWR            
Subjt:  MAHIVFCNASSASFFLGCLSSSTTSLSMPSKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIYALSSNDIKVGTNIEVDGAPWR------------

Query:  --DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTV
            +T   N +      +   +   L+EANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTV
Subjt:  --DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTV

Query:  VDVDPGLKGDTAQGRS
        VDVDPGLKGDTAQG S
Subjt:  VDVDPGLKGDTAQGRS

TrEMBL top hitse value%identityAlignment
A0A0A0LD89 Uncharacterized protein1.1e-6165.28Show/hide
Query:  MAHIVFCNASSASFFLGCLSSSTTSLSMPSKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIYALSSNDIKVGTNIEVDGAPWR------------
        MA  + CNASSAS FL   SSS TSLS+P KP    +R FS +S R GF               RIYAL+SNDIKVGTNIEVDGAPWR            
Subjt:  MAHIVFCNASSASFFLGCLSSSTTSLSMPSKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIYALSSNDIKVGTNIEVDGAPWR------------

Query:  --DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTV
            +T   N +      +   +   L+EANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTV
Subjt:  --DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTV

Query:  VDVDPGLKGDTAQGRS
        VDVDPGLKGDTAQG S
Subjt:  VDVDPGLKGDTAQGRS

A0A5A7TQ46 Elongation factor P1.6e-6264.49Show/hide
Query:  MAHIVFCNASSASFFLGCLSSSTTSLSMPSKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIYALSSNDIKVGTNIEVDGAPWR------------
        MA  + CNAS  S FL   SSS TSLS+P K  V +  R SS S R GF S P           RIYAL+SNDIKVGTN+EVDGAPWR            
Subjt:  MAHIVFCNASSASFFLGCLSSSTTSLSMPSKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIYALSSNDIKVGTNIEVDGAPWR------------

Query:  --DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTV
            +T   N +      +   +   L+EANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDC+VLFWNGKVIDFEVPITIQLTV
Subjt:  --DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTV

Query:  VDVDPGLKGDTAQG
        VDVDPGLKGDTAQG
Subjt:  VDVDPGLKGDTAQG

A0A6J1DSR9 uncharacterized protein LOC1110240581.3e-6767.59Show/hide
Query:  MAHIVFCNASSASFFLGCLSSSTTSLSMPSKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIYALSSNDIKVGTNIEVDGAPWR------------
        MA IV C+ASSAS FLG  SSSTTSLS+P KPS+  IR+ S  S RPGFS              RIYALSSNDIKVGTN+EVDGAPWR            
Subjt:  MAHIVFCNASSASFFLGCLSSSTTSLSMPSKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIYALSSNDIKVGTNIEVDGAPWR------------

Query:  --DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTV
            +T   N +      +   +   LDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPIT+QLTV
Subjt:  --DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTV

Query:  VDVDPGLKGDTAQGRS
        V+VDPGLKGDTAQG S
Subjt:  VDVDPGLKGDTAQGRS

A0A6J1FIJ6 uncharacterized protein LOC1114442511.2e-6265.28Show/hide
Query:  MAHIVFCNASSASFFLGCLSSSTTSLSMPSKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIYALSSNDIKVGTNIEVDGAPWR------------
        MA  + CNAS +S FL   SSST SLS+ SKPSV + RRFSS++     +S    L        RIYAL+SNDIKVGTNIEVDGAPWR            
Subjt:  MAHIVFCNASSASFFLGCLSSSTTSLSMPSKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIYALSSNDIKVGTNIEVDGAPWR------------

Query:  --DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTV
            +T   N +      +   +   LDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVP TIQLTV
Subjt:  --DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTV

Query:  VDVDPGLKGDTAQGRS
        VDVDPGLKGDTAQG S
Subjt:  VDVDPGLKGDTAQGRS

A0A6J1I1D0 uncharacterized protein LOC1114700032.5e-6365.28Show/hide
Query:  MAHIVFCNASSASFFLGCLSSSTTSLSMPSKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIYALSSNDIKVGTNIEVDGAPWR------------
        MA  + CNAS +S FL   SSS TSLS+ SKPSV + RRFSS + R  FS              RIYAL+SNDIKVGTNIEVDG PWR            
Subjt:  MAHIVFCNASSASFFLGCLSSSTTSLSMPSKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIYALSSNDIKVGTNIEVDGAPWR------------

Query:  --DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTV
            +T   N +      +   +   LDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVP TIQLTV
Subjt:  --DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTV

Query:  VDVDPGLKGDTAQGRS
        VDVDPGLKGDTAQG S
Subjt:  VDVDPGLKGDTAQGRS

SwissProt top hitse value%identityAlignment
B0JHV3 Elongation factor P6.2e-2741.1Show/hide
Query:  LSSNDIKVGTNIEVDGAPWR--------------DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDR
        +SSND + GT IE+DG+ WR                +T   N     +  +   + + L  A + K   Q TYK+G QFVFMD+ T+EE  L    +GDR
Subjt:  LSSNDIKVGTNIEVDGAPWR--------------DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDR

Query:  TKWLKEGMDCIVLFWNGKVIDFEVPITIQLTVVDVDPGLKGDTAQG
         K+LKEGM+  +LFWN +V+D E+P ++ L + D DPG+KGDTA G
Subjt:  TKWLKEGMDCIVLFWNGKVIDFEVPITIQLTVVDVDPGLKGDTAQG

B1XKV1 Elongation factor P2.1e-2743.15Show/hide
Query:  LSSNDIKVGTNIEVDGAPWR--------------DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDR
        +SSND + GT+IE+DG+ WR                +T   N     +  +   + + + +A + K   Q TYK+G QFVFMD+ TYEE+RL  A +GDR
Subjt:  LSSNDIKVGTNIEVDGAPWR--------------DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDR

Query:  TKWLKEGMDCIVLFWNGKVIDFEVPITIQLTVVDVDPGLKGDTAQG
         K+L E M+  VLFWN +VID E+P T+ L V + DPG+KGDTA G
Subjt:  TKWLKEGMDCIVLFWNGKVIDFEVPITIQLTVVDVDPGLKGDTAQG

B7KGU8 Elongation factor P5.8e-2538.51Show/hide
Query:  LSSNDIKVGTNIEVDGAPWR--------------DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDR
        +SSND + G +IE++G+ W+                +TT  N    ++  +   + + + +A + K   Q TYK+G QFVFMD+ TYEE RLN   +GD 
Subjt:  LSSNDIKVGTNIEVDGAPWR--------------DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDR

Query:  TKWLKEGMDCIVLFWNGKVIDFEVPITIQLTVVDVDPGLKGDTAQGRS
         K++KE M+  VL+W  +V++ E+P ++ L V D DPG+KGDTA G S
Subjt:  TKWLKEGMDCIVLFWNGKVIDFEVPITIQLTVVDVDPGLKGDTAQGRS

Q54760 Elongation factor P1.5e-2843.84Show/hide
Query:  LSSNDIKVGTNIEVDGAPWR--------------DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDR
        +SSND + GT IE+DGA WR                +T   N     +  +   + + + +A + K   Q+TYKDG  FVFMD+ TYEE RL AA +GDR
Subjt:  LSSNDIKVGTNIEVDGAPWR--------------DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDR

Query:  TKWLKEGMDCIVLFWNGKVIDFEVPITIQLTVVDVDPGLKGDTAQG
         K+LKEGM+  V+ WNG+VI+ E+P ++ L V++ DPG+KGDTA G
Subjt:  TKWLKEGMDCIVLFWNGKVIDFEVPITIQLTVVDVDPGLKGDTAQG

Q5N1T5 Elongation factor P1.5e-2843.84Show/hide
Query:  LSSNDIKVGTNIEVDGAPWR--------------DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDR
        +SSND + GT IE+DGA WR                +T   N     +  +   + + + +A + K   Q+TYKDG  FVFMD+ TYEE RL AA +GDR
Subjt:  LSSNDIKVGTNIEVDGAPWR--------------DAKTTTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDR

Query:  TKWLKEGMDCIVLFWNGKVIDFEVPITIQLTVVDVDPGLKGDTAQG
         K+LKEGM+  V+ WNG+VI+ E+P ++ L V++ DPG+KGDTA G
Subjt:  TKWLKEGMDCIVLFWNGKVIDFEVPITIQLTVVDVDPGLKGDTAQG

Arabidopsis top hitse value%identityAlignment
AT3G08740.1 elongation factor P (EF-P) family protein4.0e-4549.05Show/hide
Query:  ASSASFFLGCLSSSTTSLSMP-SKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIY-ALSSNDIKVGTNIEVDGAPWR--------------DAKT
        A  A F + C  SST SL +P S  S + + R +  ++R    +N            RI+ ++S+NDIK GTNIEVDGAPWR                +T
Subjt:  ASSASFFLGCLSSSTTSLSMP-SKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIY-ALSSNDIKVGTNIEVDGAPWR--------------DAKT

Query:  TTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTVVDVDPG
           N +      +   +   ++EAN+YKE KQFTYKDGSQFVFMDL TYEE RLN +D+G++TKWLKEGMDCI+L+W  KVIDF++PIT++L VVDVDPG
Subjt:  TTGNELCPQMFSQDLGSVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTVVDVDPG

Query:  LKGDTAQGRS
        L+GDT QG S
Subjt:  LKGDTAQGRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGCAGAGCTAAGATAAAAGCTAGTTCTCTATCTCGAAGAACAACTGAAACTGCTAAACTCGTCGACTACCATAGAAAGCTCAGGACTGAAGCTTTGAAAATAGA
GACAGAGAAAGGGATAATCGGAGAGACGGCGGAGTGCGAGGGGAGGACACCAACCGCGCTGAAAGAGAGAGGCCAACAATCACATCCGGAGGCCGATCATAGGCGACAGC
TGACATCGCCAGGGCCAGTGGTCGCGCAGCGCACGCCTTCCGTCCGATTCCCACTCGGTCGGAACTCGACGGAAGGCTCAGAAAAACCACTTACAGTAGCCTTGACGCCC
CAAGAAATGGCTCACATCGTCTTCTGTAATGCCTCGTCGGCTTCTTTTTTTCTCGGCTGCTTGTCCTCCTCAACAACTTCGCTTTCGATGCCTTCTAAGCCGTCCGTTGC
CCAAATCAGGCGGTTTTCCTCGACAAGTTTGCGTCCCGGATTTTCGAGTAACCCTTCGATTCTCGCACTGGAAATTTTCATTTCTACACGGATTTATGCGTTATCTAGCA
ACGACATTAAAGTTGGCACCAATATTGAAGTGGATGGAGCTCCTTGGCGGGATGCTAAAACCACAACAGGGAATGAATTATGCCCTCAAATGTTTTCTCAAGACCTTGGT
TCTGTAAAAGATCTTGACGAGGCCAATGTATACAAGGAAGTCAAGCAGTTCACTTACAAAGACGGTTCCCAGTTTGTTTTCATGGACCTGAATACATATGAAGAAATACG
CCTAAATGCAGCAGACGTTGGTGATAGGACAAAGTGGCTGAAAGAGGGAATGGACTGCATTGTATTGTTTTGGAATGGGAAGGTTATTGATTTTGAGGTTCCCATCACAA
TTCAATTGACTGTGGTTGACGTTGATCCTGGACTGAAAGGTGACACGGCACAAGGCAGGAGCATTTCTCATGGATATACTGGCTTGAGTGACTATGCATATTTGTTCATT
GAGGCCTCTTTCTGGGTCGTAAGAAACTTTGCTGGCAGGGATTCCATGCCGGTCTCCCACAAGTCCCAAAACACACAGTGCTTCGTCGGCGCTTCCCCTATGACAAAGTG
CTTGAATTATAACGTCAACGAAAGCATCCACGAGAACTGTGATGGTTTCATCGTTTGGGTTACAACCGCTACTCAGCATTTGGTCATTGAATTCAAATATGTTTCGAATC
TCACTATCTTTTTGAGACATACTGAGAATTTCAGTCCGTTCTTTCGAGTTCGACGGCGTTGGCAGCGATCTAATTCTGTCAGCAGCGCTTTTCACGGTTATTCTACCGGG
GTTCTTAATCTCCAGTTTGAAGGTTCACACCAAAAGCGAAAGGTGCATCTGTGGAAGAGAGTGGAAGAGTTGCGAGGGAAGACGGCTGAACGGCCAGATGCATTCGTTTA
TGCTTGGATTTCCAACCAAGGAAAGATTGATTTTGCAGAGAAGATTGGATGGGAAGCTGAAGTTGCTGGAGCGCATTGCAGGGACAAGGCATCGCTGCGTTACCGAATGA
ACGGAAGTTTGGTCCCTAGCAATTCCTCCTCCTTGTGTTTTTATACATTAACATTACATCATACGCTCGCGTTGCCCGACAAGACAAGGGAAAGCAAGGAGGCAGATGAG
ATGTCAACATTAAATATCAATTTTAGCATTTCAAGATCAACAGCTCAGGGTCAGAGGCGCATGGATTTACTGAGGGCTTTCCAAGACATGGGATTTTTAAAATCTGGAAA
CTCGTTCTTCGCCCTACTCTTTCTCTCCAAAATGGGTTTGAGTAACTTGTTGATAATCTTGTTGAAGTTGAAGGTGGATCCACGACCTGTTGCGGTTGAGACCTCATCGT
CCTCGGCTCCGAAAGACTCCGCCCGAGCATCGCTTGGTTTAGTCTCGCTTCAAAGCTCAGTCTCTCATATTGATCCAACAAGTTGCCGTGTTCATCCCCAAATTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGCAGAGCTAAGATAAAAGCTAGTTCTCTATCTCGAAGAACAACTGAAACTGCTAAACTCGTCGACTACCATAGAAAGCTCAGGACTGAAGCTTTGAAAATAGA
GACAGAGAAAGGGATAATCGGAGAGACGGCGGAGTGCGAGGGGAGGACACCAACCGCGCTGAAAGAGAGAGGCCAACAATCACATCCGGAGGCCGATCATAGGCGACAGC
TGACATCGCCAGGGCCAGTGGTCGCGCAGCGCACGCCTTCCGTCCGATTCCCACTCGGTCGGAACTCGACGGAAGGCTCAGAAAAACCACTTACAGTAGCCTTGACGCCC
CAAGAAATGGCTCACATCGTCTTCTGTAATGCCTCGTCGGCTTCTTTTTTTCTCGGCTGCTTGTCCTCCTCAACAACTTCGCTTTCGATGCCTTCTAAGCCGTCCGTTGC
CCAAATCAGGCGGTTTTCCTCGACAAGTTTGCGTCCCGGATTTTCGAGTAACCCTTCGATTCTCGCACTGGAAATTTTCATTTCTACACGGATTTATGCGTTATCTAGCA
ACGACATTAAAGTTGGCACCAATATTGAAGTGGATGGAGCTCCTTGGCGGGATGCTAAAACCACAACAGGGAATGAATTATGCCCTCAAATGTTTTCTCAAGACCTTGGT
TCTGTAAAAGATCTTGACGAGGCCAATGTATACAAGGAAGTCAAGCAGTTCACTTACAAAGACGGTTCCCAGTTTGTTTTCATGGACCTGAATACATATGAAGAAATACG
CCTAAATGCAGCAGACGTTGGTGATAGGACAAAGTGGCTGAAAGAGGGAATGGACTGCATTGTATTGTTTTGGAATGGGAAGGTTATTGATTTTGAGGTTCCCATCACAA
TTCAATTGACTGTGGTTGACGTTGATCCTGGACTGAAAGGTGACACGGCACAAGGCAGGAGCATTTCTCATGGATATACTGGCTTGAGTGACTATGCATATTTGTTCATT
GAGGCCTCTTTCTGGGTCGTAAGAAACTTTGCTGGCAGGGATTCCATGCCGGTCTCCCACAAGTCCCAAAACACACAGTGCTTCGTCGGCGCTTCCCCTATGACAAAGTG
CTTGAATTATAACGTCAACGAAAGCATCCACGAGAACTGTGATGGTTTCATCGTTTGGGTTACAACCGCTACTCAGCATTTGGTCATTGAATTCAAATATGTTTCGAATC
TCACTATCTTTTTGAGACATACTGAGAATTTCAGTCCGTTCTTTCGAGTTCGACGGCGTTGGCAGCGATCTAATTCTGTCAGCAGCGCTTTTCACGGTTATTCTACCGGG
GTTCTTAATCTCCAGTTTGAAGGTTCACACCAAAAGCGAAAGGTGCATCTGTGGAAGAGAGTGGAAGAGTTGCGAGGGAAGACGGCTGAACGGCCAGATGCATTCGTTTA
TGCTTGGATTTCCAACCAAGGAAAGATTGATTTTGCAGAGAAGATTGGATGGGAAGCTGAAGTTGCTGGAGCGCATTGCAGGGACAAGGCATCGCTGCGTTACCGAATGA
ACGGAAGTTTGGTCCCTAGCAATTCCTCCTCCTTGTGTTTTTATACATTAACATTACATCATACGCTCGCGTTGCCCGACAAGACAAGGGAAAGCAAGGAGGCAGATGAG
ATGTCAACATTAAATATCAATTTTAGCATTTCAAGATCAACAGCTCAGGGTCAGAGGCGCATGGATTTACTGAGGGCTTTCCAAGACATGGGATTTTTAAAATCTGGAAA
CTCGTTCTTCGCCCTACTCTTTCTCTCCAAAATGGGTTTGAGTAACTTGTTGATAATCTTGTTGAAGTTGAAGGTGGATCCACGACCTGTTGCGGTTGAGACCTCATCGT
CCTCGGCTCCGAAAGACTCCGCCCGAGCATCGCTTGGTTTAGTCTCGCTTCAAAGCTCAGTCTCTCATATTGATCCAACAAGTTGCCGTGTTCATCCCCAAATTTCTTAA
Protein sequenceShow/hide protein sequence
MGSRAKIKASSLSRRTTETAKLVDYHRKLRTEALKIETEKGIIGETAECEGRTPTALKERGQQSHPEADHRRQLTSPGPVVAQRTPSVRFPLGRNSTEGSEKPLTVALTP
QEMAHIVFCNASSASFFLGCLSSSTTSLSMPSKPSVAQIRRFSSTSLRPGFSSNPSILALEIFISTRIYALSSNDIKVGTNIEVDGAPWRDAKTTTGNELCPQMFSQDLG
SVKDLDEANVYKEVKQFTYKDGSQFVFMDLNTYEEIRLNAADVGDRTKWLKEGMDCIVLFWNGKVIDFEVPITIQLTVVDVDPGLKGDTAQGRSISHGYTGLSDYAYLFI
EASFWVVRNFAGRDSMPVSHKSQNTQCFVGASPMTKCLNYNVNESIHENCDGFIVWVTTATQHLVIEFKYVSNLTIFLRHTENFSPFFRVRRRWQRSNSVSSAFHGYSTG
VLNLQFEGSHQKRKVHLWKRVEELRGKTAERPDAFVYAWISNQGKIDFAEKIGWEAEVAGAHCRDKASLRYRMNGSLVPSNSSSLCFYTLTLHHTLALPDKTRESKEADE
MSTLNINFSISRSTAQGQRRMDLLRAFQDMGFLKSGNSFFALLFLSKMGLSNLLIILLKLKVDPRPVAVETSSSSAPKDSARASLGLVSLQSSVSHIDPTSCRVHPQIS