; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025793 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025793
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionNuclear transport factor 2 (NTF2) family protein
Genome locationtig00152936:3082137..3084725
RNA-Seq ExpressionSgr025793
SyntenySgr025793
Gene Ontology termsGO:0071586 - CAAX-box protein processing (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004222 - metalloendopeptidase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR018790 - Protein of unknown function DUF2358
IPR032710 - NTF2-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575195.1 hypothetical protein SDJN03_25834, partial [Cucurbita argyrosperma subsp. sororia]1.9e-11182.01Show/hide
Query:  MATIFSFRSFTSLQSSRNVFPPNRPLKICRIRCHEDNPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSPSSDVEAPDRARVLARDFSGAIHRAV
        MATIF F+S TSLQ+S NV  PNRPLKIC IRC  DNPAT SPK +ES+PENA+LKVAWYGSELLGIAASFLR P +DVE P RA+ L RD SGAI R V
Subjt:  MATIFSFRSFTSLQSSRNVFPPNRPLKICRIRCHEDNPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSPSSDVEAPDRARVLARDFSGAIHRAV

Query:  IVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATGYTEYYFD
        IVE+IK+DF RSYFVTGNLT++AYEE+CEFADPAGSFKGL RFKRNCTNFGSLV+K NMKLTKWEDFEDKGIGHWKFSCILSFPW+PILSATGYTEYYFD
Subjt:  IVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATGYTEYYFD

Query:  AGSGKVCRHIEHWNVPKMALLKQIVRPTRGWLWFKKAGG
        AGSGKV RH+EHWNVPKMALL QI+RPTR WLWFKK  G
Subjt:  AGSGKVCRHIEHWNVPKMALLKQIVRPTRGWLWFKKAGG

XP_022147215.1 uncharacterized protein LOC111016220 isoform X1 [Momordica charantia]3.8e-11284.02Show/hide
Query:  MATIFSFRS--FTSLQSSR-NVFPPNRPLKICRIRCHED--NPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSPSSDVEAPDRARVLARDFSGA
        MA I SF+S   T LQ+SR N   PN PLKICRIRC  +  NPAT S KN+ESEPENALLKVAWYGSELLGIAASFLRSP SD EAP RA  LA D SGA
Subjt:  MATIFSFRS--FTSLQSSR-NVFPPNRPLKICRIRCHED--NPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSPSSDVEAPDRARVLARDFSGA

Query:  IHRAVIVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATGYT
        I R +IVE+IKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRF+RNCTNFGSLVE SNMKLTKWEDFEDKGIGHWKFSC+LSFPW+PILSATGYT
Subjt:  IHRAVIVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATGYT

Query:  EYYFDAGSGKVCRHIEHWNVPKMALLKQIVRPTRGWLWFKKAGG
        EYYFDAGSGKVCRH+EHWNVPKMALLKQI+RPTR WLWFKKAGG
Subjt:  EYYFDAGSGKVCRHIEHWNVPKMALLKQIVRPTRGWLWFKKAGG

XP_022959200.1 uncharacterized protein LOC111460260 [Cucurbita moschata]4.4e-11382.77Show/hide
Query:  MATIFSFRSFTSLQSSRNVFPPNRPLKICRIRCHEDNPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSPSSDVEAPDRARVLARDFSGAIHRAV
        MATIF F+S TSLQ+S N   PNRPLKIC+IRC  +NPAT SPKN+ES+PENA+LKVAWYGSELLGIAASFLR P +DVE P RA+ LARD SGAI R V
Subjt:  MATIFSFRSFTSLQSSRNVFPPNRPLKICRIRCHEDNPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSPSSDVEAPDRARVLARDFSGAIHRAV

Query:  IVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATGYTEYYFD
        IVE+IK+DF RSYFVTGNLT++AYEE+CEFADPAGSFKGL RFKRNCTNFGSLV+KSNMKLTKWEDFEDKGIGHWKFSCILSFPW+PILSATGYTEYYFD
Subjt:  IVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATGYTEYYFD

Query:  AGSGKVCRHIEHWNVPKMALLKQIVRPTRGWLWFKKAG
        AGSGKV RH+EHWNVPKMALL QI+RPTR WLWFKK G
Subjt:  AGSGKVCRHIEHWNVPKMALLKQIVRPTRGWLWFKKAG

XP_023547468.1 uncharacterized protein LOC111806404 [Cucurbita pepo subsp. pepo]2.0e-11383.61Show/hide
Query:  MATIFSFRSFTSLQSSRNVFPPNRPLKICRIRCHEDNPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSPSSDVEAPDRARVLARDFSGAIHRAV
        MATIF F+S TSLQ+S N   PNRPLKIC IRC  DNPAT SPKNQES+PENA+LKVAWYGSELLGIAASFLR P +DVE P RA+ LARD SGAI R V
Subjt:  MATIFSFRSFTSLQSSRNVFPPNRPLKICRIRCHEDNPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSPSSDVEAPDRARVLARDFSGAIHRAV

Query:  IVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATGYTEYYFD
        IVE+IK+DF RSYFVTGNLT++AYEE+CEFADPAGSFKGL RFKRNCTNFGSLV+KSNMKLTKWEDFEDKGIGHWKFSCILSFPW+PILSATGYTEYYFD
Subjt:  IVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATGYTEYYFD

Query:  AGSGKVCRHIEHWNVPKMALLKQIVRPTRGWLWFKKAG
        AGSGKV RH+EHWNVPKMALL QI+RPTR WLWFKK G
Subjt:  AGSGKVCRHIEHWNVPKMALLKQIVRPTRGWLWFKKAG

XP_038906789.1 uncharacterized protein LOC120092709 isoform X1 [Benincasa hispida]1.2e-11584.17Show/hide
Query:  MATIFSFRSFTSLQSSRNVFPPNRPLKICRIRCHEDNPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSPSSDVEAPDRARVLARDFSGAIHRAV
        MATIFS +S +SL++S N   PN PL+I RIRC  +NPAT SPKNQES+PENA+LKVAWYGSELLGIAASFLR P SDVE P RA+ LARD SGAIHR +
Subjt:  MATIFSFRSFTSLQSSRNVFPPNRPLKICRIRCHEDNPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSPSSDVEAPDRARVLARDFSGAIHRAV

Query:  IVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATGYTEYYFD
        IVE+IKEDFGRSYFVTGNLTL+AYEE+CEFADPAGSFKGLRRFKRNCTNFGSLV+KSNMKLTKWEDFEDKGIGHW+FSCILSFPW+PILSATGYTEYYFD
Subjt:  IVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATGYTEYYFD

Query:  AGSGKVCRHIEHWNVPKMALLKQIVRPTRGWLWFKKAGGG
        A SGKVCRH+EHWNVPKMALLKQI+RPTR WLWFKKAGGG
Subjt:  AGSGKVCRHIEHWNVPKMALLKQIVRPTRGWLWFKKAGGG

TrEMBL top hitse value%identityAlignment
A0A1S3C8H3 uncharacterized protein LOC1034976903.2e-10980.33Show/hide
Query:  MATIFSFRSFTSLQSSRNVFPPNRPLKICRIRCHEDNPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSPSSDVEAPDRARVLARDFSGAIHRAV
        MATI SF+  +SLQ+S     PN  L+ICRI C  +NP T SP NQES+PENA+LKVAWYGSELLGIAASFLR P SDV+ P RA+ L  D SGAI R +
Subjt:  MATIFSFRSFTSLQSSRNVFPPNRPLKICRIRCHEDNPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSPSSDVEAPDRARVLARDFSGAIHRAV

Query:  IVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATGYTEYYFD
        IVE+IKEDF RSYFVTGNLTL+AYEE+CEFADPAGSFKGLRRFKRNCTNFGSLV+KSNMKLTKWEDFEDKGIGHWKFSCILSFPW+PILSATGYT+YYFD
Subjt:  IVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATGYTEYYFD

Query:  AGSGKVCRHIEHWNVPKMALLKQIVRPTRGWLWFKKAGG
        A SGKVCRH+EHWNVPKMALLKQI+RPTR WLWFKKAGG
Subjt:  AGSGKVCRHIEHWNVPKMALLKQIVRPTRGWLWFKKAGG

A0A5D3BY85 Nuclear transport factor 2 family protein3.2e-10980.33Show/hide
Query:  MATIFSFRSFTSLQSSRNVFPPNRPLKICRIRCHEDNPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSPSSDVEAPDRARVLARDFSGAIHRAV
        MATI SF+  +SLQ+S     PN  L+ICRI C  +NP T SP NQES+PENA+LKVAWYGSELLGIAASFLR P SDV+ P RA+ L  D SGAI R +
Subjt:  MATIFSFRSFTSLQSSRNVFPPNRPLKICRIRCHEDNPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSPSSDVEAPDRARVLARDFSGAIHRAV

Query:  IVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATGYTEYYFD
        IVE+IKEDF RSYFVTGNLTL+AYEE+CEFADPAGSFKGLRRFKRNCTNFGSLV+KSNMKLTKWEDFEDKGIGHWKFSCILSFPW+PILSATGYT+YYFD
Subjt:  IVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATGYTEYYFD

Query:  AGSGKVCRHIEHWNVPKMALLKQIVRPTRGWLWFKKAGG
        A SGKVCRH+EHWNVPKMALLKQI+RPTR WLWFKKAGG
Subjt:  AGSGKVCRHIEHWNVPKMALLKQIVRPTRGWLWFKKAGG

A0A6J1CZI8 uncharacterized protein LOC111016220 isoform X11.8e-11284.02Show/hide
Query:  MATIFSFRS--FTSLQSSR-NVFPPNRPLKICRIRCHED--NPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSPSSDVEAPDRARVLARDFSGA
        MA I SF+S   T LQ+SR N   PN PLKICRIRC  +  NPAT S KN+ESEPENALLKVAWYGSELLGIAASFLRSP SD EAP RA  LA D SGA
Subjt:  MATIFSFRS--FTSLQSSR-NVFPPNRPLKICRIRCHED--NPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSPSSDVEAPDRARVLARDFSGA

Query:  IHRAVIVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATGYT
        I R +IVE+IKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRF+RNCTNFGSLVE SNMKLTKWEDFEDKGIGHWKFSC+LSFPW+PILSATGYT
Subjt:  IHRAVIVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATGYT

Query:  EYYFDAGSGKVCRHIEHWNVPKMALLKQIVRPTRGWLWFKKAGG
        EYYFDAGSGKVCRH+EHWNVPKMALLKQI+RPTR WLWFKKAGG
Subjt:  EYYFDAGSGKVCRHIEHWNVPKMALLKQIVRPTRGWLWFKKAGG

A0A6J1H5M4 uncharacterized protein LOC1114602602.1e-11382.77Show/hide
Query:  MATIFSFRSFTSLQSSRNVFPPNRPLKICRIRCHEDNPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSPSSDVEAPDRARVLARDFSGAIHRAV
        MATIF F+S TSLQ+S N   PNRPLKIC+IRC  +NPAT SPKN+ES+PENA+LKVAWYGSELLGIAASFLR P +DVE P RA+ LARD SGAI R V
Subjt:  MATIFSFRSFTSLQSSRNVFPPNRPLKICRIRCHEDNPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSPSSDVEAPDRARVLARDFSGAIHRAV

Query:  IVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATGYTEYYFD
        IVE+IK+DF RSYFVTGNLT++AYEE+CEFADPAGSFKGL RFKRNCTNFGSLV+KSNMKLTKWEDFEDKGIGHWKFSCILSFPW+PILSATGYTEYYFD
Subjt:  IVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATGYTEYYFD

Query:  AGSGKVCRHIEHWNVPKMALLKQIVRPTRGWLWFKKAG
        AGSGKV RH+EHWNVPKMALL QI+RPTR WLWFKK G
Subjt:  AGSGKVCRHIEHWNVPKMALLKQIVRPTRGWLWFKKAG

A0A6J1KYC4 uncharacterized protein LOC1114993121.5e-11181.93Show/hide
Query:  MATIFSFRSFTSLQSSRNVFPPNRPLKICRIRCHEDNPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSPSSDVEAPDRARVLARDFSGAIHRAV
        MATIF F+S TSL++S N   PNRPLKIC IRC  DNPAT SPKN+ES+PENA+LKVAWYGSELLGIAASFLR P +DVE P RA+ L RD SGAI R V
Subjt:  MATIFSFRSFTSLQSSRNVFPPNRPLKICRIRCHEDNPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSPSSDVEAPDRARVLARDFSGAIHRAV

Query:  IVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATGYTEYYFD
        IVE+IK+DF RSYFVTGNLT++AYEE+CEFADPAGSFKGL RFKRNCTNFGSLV+KSNMKLTKW DFEDKGIGHWKFSCILSFPW+PILSATGYTEYYFD
Subjt:  IVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATGYTEYYFD

Query:  AGSGKVCRHIEHWNVPKMALLKQIVRPTRGWLWFKKAG
        AGSGKV RH+EHWNVPKMALL QI+RPTR WLWFKK G
Subjt:  AGSGKVCRHIEHWNVPKMALLKQIVRPTRGWLWFKKAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46100.1 Nuclear transport factor 2 (NTF2) family protein1.0e-8361.6Show/hide
Query:  MATIFSFRSFTSL------QSSRNVFPPNRPLKICRIRCHEDNPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSP-SSDVEAPDRARVLARDFS
        M    SF S  ++         R++F  NR  +   + C   NP      ++  EP+N LLK+AWYGSELLGIAAS  RSP +S +       V   D S
Subjt:  MATIFSFRSFTSL------QSSRNVFPPNRPLKICRIRCHEDNPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSP-SSDVEAPDRARVLARDFS

Query:  GAIHRAVIVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATG
        G   R  +V+SIK+DF RSYFVTGNLT + YEE+CEFADPAGSFKGL RFKRNCTNFGSL+EKSNMKL KWE+FEDKGIGHWKFSC++SFPWKPILSATG
Subjt:  GAIHRAVIVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATG

Query:  YTEYYFDAGSGKVCRHIEHWNVPKMALLKQIVRPTRG
        YTEYYFD  SGK+CRH+EHWNVPK+AL KQ++RP+RG
Subjt:  YTEYYFDAGSGKVCRHIEHWNVPKMALLKQIVRPTRG

AT2G46100.2 Nuclear transport factor 2 (NTF2) family protein1.1e-4554.29Show/hide
Query:  MATIFSFRSFTSL------QSSRNVFPPNRPLKICRIRCHEDNPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSP-SSDVEAPDRARVLARDFS
        M    SF S  ++         R++F  NR  +   + C   NP      ++  EP+N LLK+AWYGSELLGIAAS  RSP +S +       V   D S
Subjt:  MATIFSFRSFTSL------QSSRNVFPPNRPLKICRIRCHEDNPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSP-SSDVEAPDRARVLARDFS

Query:  GAIHRAVIVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFE
        G   R  +V+SIK+DF RSYFVTGNLT + YEE+CEFADPAGSFKGL RFKRNCTNFGSL+EKSNMKL KWE+FE
Subjt:  GAIHRAVIVESIKEDFGRSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFE

AT3G04890.1 Uncharacterized conserved protein (DUF2358)1.7e-1729.8Show/hide
Query:  RCHEDNPATASPKNQESEPENALLKVAWYG-SELLGIAASFLRSPSSDVEAPDRARVLARDFSGAIHRAVIVESIKEDFGRSYFVTGNLTLDAYEEECEF
        RC   +P   +P         A+LK A  G +E L + +    +PSS   A ++ R      +G +    ++  ++ D+   YFVTG LT   Y ++C F
Subjt:  RCHEDNPATASPKNQESEPENALLKVAWYG-SELLGIAASFLRSPSSDVEAPDRARVLARDFSGAIHRAVIVESIKEDFGRSYFVTGNLTLDAYEEECEF

Query:  ADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKG----IGHWKFSCILSFPWKPILSATGYTEYYFDAGSGKVCRHIEHWNVPKMALLKQI
         DP  SF+G   ++RN       +E ++++L   E  E       +  WK    L  PW+P++S  G T Y  D    K+ RH+E WNV  +  + QI
Subjt:  ADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKG----IGHWKFSCILSFPWKPILSATGYTEYYFDAGSGKVCRHIEHWNVPKMALLKQI

AT3G04890.2 Uncharacterized conserved protein (DUF2358)2.1e-1228.74Show/hide
Query:  RCHEDNPATASPKNQESEPENALLKVAWYG-SELLGIAASFLRSPSSDVEAPDRARVLARDFSGAIHRAVIVESIKEDFGRSYFVTGNLTLDAYEEECEF
        RC   +P   +P         A+LK A  G +E L + +    +PSS   A ++ R      +G +    ++  ++ D+   YFVTG LT   Y ++C F
Subjt:  RCHEDNPATASPKNQESEPENALLKVAWYG-SELLGIAASFLRSPSSDVEAPDRARVLARDFSGAIHRAVIVESIKEDFGRSYFVTGNLTLDAYEEECEF

Query:  ADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKG----IGHWKFSCILSFPWKPILSATGYTEYYFD
         DP  SF+G   ++RN       +E ++++L   E  E       +  WK    L  PW+P++S  G T Y  D
Subjt:  ADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKG----IGHWKFSCILSFPWKPILSATGYTEYYFD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACTATCTTCTCTTTTCGATCATTTACTTCCCTCCAAAGCTCTAGGAACGTCTTCCCACCAAATCGCCCTCTCAAAATCTGCAGAATTCGGTGCCATGAGGACAA
TCCCGCTACCGCTTCGCCGAAAAATCAAGAATCCGAACCCGAGAATGCGCTGCTCAAAGTAGCTTGGTATGGCTCTGAGCTTTTGGGGATTGCCGCTTCATTTCTCCGGT
CGCCGTCGTCGGATGTCGAAGCTCCCGATAGGGCTCGTGTGCTTGCGAGAGATTTCTCCGGTGCAATTCATCGCGCTGTGATTGTGGAATCAATCAAGGAAGACTTTGGG
CGGTCGTATTTCGTCACAGGGAACCTTACTCTTGATGCATATGAAGAGGAGTGTGAATTTGCTGATCCTGCTGGTTCGTTCAAAGGGCTTCGCCGATTCAAAAGAAACTG
TACAAACTTTGGATCCCTTGTGGAGAAGTCAAACATGAAGCTTACCAAATGGGAGGACTTTGAGGACAAGGGCATTGGACATTGGAAGTTCAGTTGTATCTTGTCGTTTC
CTTGGAAACCAATTCTATCCGCAACTGGATATACAGAGTATTATTTTGATGCAGGATCTGGAAAAGTATGCAGGCATATAGAGCACTGGAATGTTCCCAAAATGGCTTTA
CTGAAGCAAATTGTAAGACCCACTAGAGGGTGGTTGTGGTTTAAGAAAGCAGGTGGTGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGACTATCTTCTCTTTTCGATCATTTACTTCCCTCCAAAGCTCTAGGAACGTCTTCCCACCAAATCGCCCTCTCAAAATCTGCAGAATTCGGTGCCATGAGGACAA
TCCCGCTACCGCTTCGCCGAAAAATCAAGAATCCGAACCCGAGAATGCGCTGCTCAAAGTAGCTTGGTATGGCTCTGAGCTTTTGGGGATTGCCGCTTCATTTCTCCGGT
CGCCGTCGTCGGATGTCGAAGCTCCCGATAGGGCTCGTGTGCTTGCGAGAGATTTCTCCGGTGCAATTCATCGCGCTGTGATTGTGGAATCAATCAAGGAAGACTTTGGG
CGGTCGTATTTCGTCACAGGGAACCTTACTCTTGATGCATATGAAGAGGAGTGTGAATTTGCTGATCCTGCTGGTTCGTTCAAAGGGCTTCGCCGATTCAAAAGAAACTG
TACAAACTTTGGATCCCTTGTGGAGAAGTCAAACATGAAGCTTACCAAATGGGAGGACTTTGAGGACAAGGGCATTGGACATTGGAAGTTCAGTTGTATCTTGTCGTTTC
CTTGGAAACCAATTCTATCCGCAACTGGATATACAGAGTATTATTTTGATGCAGGATCTGGAAAAGTATGCAGGCATATAGAGCACTGGAATGTTCCCAAAATGGCTTTA
CTGAAGCAAATTGTAAGACCCACTAGAGGGTGGTTGTGGTTTAAGAAAGCAGGTGGTGGTTGA
Protein sequenceShow/hide protein sequence
MATIFSFRSFTSLQSSRNVFPPNRPLKICRIRCHEDNPATASPKNQESEPENALLKVAWYGSELLGIAASFLRSPSSDVEAPDRARVLARDFSGAIHRAVIVESIKEDFG
RSYFVTGNLTLDAYEEECEFADPAGSFKGLRRFKRNCTNFGSLVEKSNMKLTKWEDFEDKGIGHWKFSCILSFPWKPILSATGYTEYYFDAGSGKVCRHIEHWNVPKMAL
LKQIVRPTRGWLWFKKAGGG