; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016938 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016938
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00153016:784505..794207
RNA-Seq ExpressionSgr016938
SyntenySgr016938
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141349.1 uncharacterized protein LOC111011770 [Momordica charantia]7.3e-28087.14Show/hide
Query:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGD-DVEESQARSGMERRQSSCVTVEQMGEAFKD--------------
        MRHGGSRRKRP  F RYVVVLCAVGAAIGFLMLN+ MRLEAR+S+SSSDQ GNGD  V ESQ RSG+E R+SSC TVEQMGEAFK+              
Subjt:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGD-DVEESQARSGMERRQSSCVTVEQMGEAFKD--------------

Query:  -------GASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQL
               GASRVR+LPPE FCKHGFV+GKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE+KHLWRL GCGRK+ R L
Subjt:  -------GASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQL

Query:  IMRIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPD
        IMR DNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQ EVLESRPNVFGELM+VLISPS+DVQEA+YSVLKSG DPD
Subjt:  IMRIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPD

Query:  ISLHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFV
        ISLHMRMLMNRS RGLQAALQCIRKAMLNL+   KPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNIS MHD FHKLDFRVKDWGP+PRWVAFV
Subjt:  ISLHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFV

Query:  DFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAW
        DFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGN+S GS FSFLSSFQSNLL EGLKNQVGWGH+WNRFAGPLSCP+QPNQCALTPLLPPAW
Subjt:  DFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAW

Query:  WDGLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNVVRTIPFIL
        WDGLWQSPIPRDIKRM NYGV+LSG GTV+EDSLRSFCNA+KNVVRTIPFIL
Subjt:  WDGLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNVVRTIPFIL

XP_022938779.1 uncharacterized protein LOC111444894 isoform X1 [Cucurbita moschata]4.7e-27986.73Show/hide
Query:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------
        MRHGGS+RKR SS VRYVVVLCAVGAAIGFLMLNVL RLE+R S+ SSDQFGNGDDVEES ARSG+E R+ SC TVE+MGE F D               
Subjt:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------

Query:  -----GASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM
             GASRVRHLPPE FCKHGFV+GK+SEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE+KHLWRLKGC RKFKR LIM
Subjt:  -----GASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM

Query:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS
        RID+FEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKN+HP MRAAASNLFGQ EVLESRPNVFGELMR+LISPSKDV+EAV SVLKSGADPDIS
Subjt:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS

Query:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF
        LHMRMLMNRS+RGLQAA+QCIRKAMLNLTT  KPRLVLVSDTP+FVKSIMPILGEFAEVIHFDYEHFRGNIS  HD FHKLDFRVKDWGP+PRWVAFVDF
Subjt:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF

Query:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD
        FLASRAKHAVISGAHRR+GTTYAQLIAALAAAHNLD+ GN+S GS FSFLSSFQSNLL EGLKNQVGWGH+WNRFAGPLSCP QPNQCALTPLLPPAWWD
Subjt:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD

Query:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNVVRTIPFIL
        GLWQSPIPRDIKRMENYGVHLS  G ++EDSLRSFCNAKKNVVRTIPFIL
Subjt:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNVVRTIPFIL

XP_022992741.1 uncharacterized protein LOC111488989 isoform X1 [Cucurbita maxima]6.2e-27985.08Show/hide
Query:  SLNPYQIVKQRRKMRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD--
        S +P    + + KMRHGG +RKR SS VRYVVVLCAVGAAIGFLMLNVL RLE+R S+ SSDQFGNGDDVEES ARSG+E R+ SC TVEQMGE F D  
Subjt:  SLNPYQIVKQRRKMRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD--

Query:  ------------------GASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRL
                          GASRVRHLPPE FCKHGFV+GK+SEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE+KHLWRL
Subjt:  ------------------GASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRL

Query:  KGCGRKFKRQLIMRIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAV
        KGC RKFKR LIMRID+FEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKN+HP MRAAASNLFGQ EVLESRPNVFGELMR+LISPSKDV+EAV
Subjt:  KGCGRKFKRQLIMRIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAV

Query:  YSVLKSGADPDISLHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKD
         SVLKSGADPDISLHMRMLMNRS+RGLQAA+QCIRKA+LNLTT  KPRLVLVSDTP+FV SIMPILGEFAEVIHFDYEHFRGNIS  HD FHKLDFRVKD
Subjt:  YSVLKSGADPDISLHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKD

Query:  WGPAPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQ
        WGP+PRWVAFVDFFLASRAKHAVISGAHRR+GTTYAQLIAALAAAHNLD+ GN+S GS FSFLSSFQSNLL EGLKNQVGWGH+WNRFAGPLSCP QPNQ
Subjt:  WGPAPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQ

Query:  CALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNVVRTIPFIL
        CALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLS  G V+EDSLRSFCNAKKNVVRTIPFIL
Subjt:  CALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNVVRTIPFIL

XP_023549723.1 uncharacterized protein LOC111808143 isoform X1 [Cucurbita pepo subsp. pepo]6.2e-27986.91Show/hide
Query:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------
        MRHGGSRRKR SS VRYVVVLCAVGAAIGFLMLNVL RLE+R S+  SDQFGNGDDVEES ARSG+E R+ SC TVE+MGE F D               
Subjt:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------

Query:  -----GASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM
             GASRVRHLPPE FCKHGFV+GK+SEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE+KHLWRLKGC RKFKR LIM
Subjt:  -----GASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM

Query:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS
        RID+FEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKN+HP MRAAASNLFGQ EVLESRPNVFGELMR+LISPSKDV+EAV SVLKSGADPDIS
Subjt:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS

Query:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF
        LHMRMLMNRS+RGLQAA+QCIRKAMLNLTT  KPRLVLVSDTP+FVKSIMPILGEFAEVIHFDYEHFRGNIS  HD FHKLDFRVKDWGP+PRWVAFVDF
Subjt:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF

Query:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD
        FLASRAKHAVISGAHRR+GTTYAQLIAALAAAHNLD+LGN+S GS FSFLSSFQSNLL EGLKNQVGWGH+WNRFAGPLSCP QPNQCALTPLLPPAWWD
Subjt:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD

Query:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNVVRTIPFIL
        G WQSPIPRDIKRMENYGVHLS  G V+EDSLRSFCNAKKNVVRTIPFIL
Subjt:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNVVRTIPFIL

XP_038906660.1 uncharacterized protein LOC120092597 isoform X1 [Benincasa hispida]5.0e-28187.64Show/hide
Query:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------
        MRHGGSRRKR SSFVRYVVVLCAVGAAIGFLMLN+LMRLEAR+S+S+SDQFGNGDDVEE+ A+SGME  +SSC TVEQMGE+FKD               
Subjt:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------

Query:  -----GASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM
             GASRVR LPPE FCKHGFV+GK+SEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDI+FTLKE+KHLWRL GC RKF R LIM
Subjt:  -----GASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM

Query:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS
        RID+FEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKN+HP MRAAASNLFG  EVLESRPNVFGELMRVLISPSKDV+EAV SVLKSGADPDIS
Subjt:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS

Query:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF
        LHMRMLMNRSVRGLQAA+QCIRKAMLNLTT SKPRLVLVSDTPNFVKSIM ILGEFAEVIHFDYEHFRGNIS  HD FHKLDFRVKDWGP+PRWVAFVDF
Subjt:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF

Query:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD
        FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLD+LGNSS GS FSFLSS+QSNLLREGLKNQVGWGH+WNRFAGPLSCPSQPNQCA TP+LPPAWWD
Subjt:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD

Query:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNVVRTIPFIL
        GLWQSPIPRDIKRMENYGVHLS  G V+EDSLRSFCNAKKNVVRTIPFIL
Subjt:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNVVRTIPFIL

TrEMBL top hitse value%identityAlignment
A0A6J1CIE2 uncharacterized protein LOC1110117703.5e-28087.14Show/hide
Query:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGD-DVEESQARSGMERRQSSCVTVEQMGEAFKD--------------
        MRHGGSRRKRP  F RYVVVLCAVGAAIGFLMLN+ MRLEAR+S+SSSDQ GNGD  V ESQ RSG+E R+SSC TVEQMGEAFK+              
Subjt:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGD-DVEESQARSGMERRQSSCVTVEQMGEAFKD--------------

Query:  -------GASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQL
               GASRVR+LPPE FCKHGFV+GKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE+KHLWRL GCGRK+ R L
Subjt:  -------GASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQL

Query:  IMRIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPD
        IMR DNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQ EVLESRPNVFGELM+VLISPS+DVQEA+YSVLKSG DPD
Subjt:  IMRIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPD

Query:  ISLHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFV
        ISLHMRMLMNRS RGLQAALQCIRKAMLNL+   KPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNIS MHD FHKLDFRVKDWGP+PRWVAFV
Subjt:  ISLHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFV

Query:  DFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAW
        DFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGN+S GS FSFLSSFQSNLL EGLKNQVGWGH+WNRFAGPLSCP+QPNQCALTPLLPPAW
Subjt:  DFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAW

Query:  WDGLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNVVRTIPFIL
        WDGLWQSPIPRDIKRM NYGV+LSG GTV+EDSLRSFCNA+KNVVRTIPFIL
Subjt:  WDGLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNVVRTIPFIL

A0A6J1E7F2 uncharacterized protein LOC1114305934.0e-27685.82Show/hide
Query:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------
        MRHGGSR+KR SSF RYVVVLCAVGA+IGFLMLN LMR+EA++S+SSSDQ GNGDDVEES+  S M+ R+ SC TVEQMGEAFKD               
Subjt:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------

Query:  -----GASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM
             GASRVR LPPE FCKHGFV+GK+SEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYS+++FT+KE+KHLWRLKGC RKF R LIM
Subjt:  -----GASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM

Query:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS
        R D+FEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKN+HP MRAAASNLFG  EVLESRPNVFGELMRVLISPSKDV+EAV+SVLKSG DPDIS
Subjt:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS

Query:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF
        LHMRMLMNRSVRGLQAALQCIRK + NLTTDSKPRLVLVSDTPNFVKSI+P+LGEFAEVIHFDYEHFRG IS  HD FHKLDFRVKDWGP+PRWVAFVDF
Subjt:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF

Query:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD
        FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLD+LGN+S GS F FLSSFQSNLLREGLKNQVGWGH+WNRFAGPLSCPSQPNQCALTPLLPPAWWD
Subjt:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD

Query:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNVVRTIPFIL
        GLWQSPIPRDIKRMENYGVHLSGFGT++EDSLRSFCNAKKNVVRTIPFIL
Subjt:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNVVRTIPFIL

A0A6J1FF37 uncharacterized protein LOC111444894 isoform X12.3e-27986.73Show/hide
Query:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------
        MRHGGS+RKR SS VRYVVVLCAVGAAIGFLMLNVL RLE+R S+ SSDQFGNGDDVEES ARSG+E R+ SC TVE+MGE F D               
Subjt:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------

Query:  -----GASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM
             GASRVRHLPPE FCKHGFV+GK+SEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE+KHLWRLKGC RKFKR LIM
Subjt:  -----GASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM

Query:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS
        RID+FEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKN+HP MRAAASNLFGQ EVLESRPNVFGELMR+LISPSKDV+EAV SVLKSGADPDIS
Subjt:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS

Query:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF
        LHMRMLMNRS+RGLQAA+QCIRKAMLNLTT  KPRLVLVSDTP+FVKSIMPILGEFAEVIHFDYEHFRGNIS  HD FHKLDFRVKDWGP+PRWVAFVDF
Subjt:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF

Query:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD
        FLASRAKHAVISGAHRR+GTTYAQLIAALAAAHNLD+ GN+S GS FSFLSSFQSNLL EGLKNQVGWGH+WNRFAGPLSCP QPNQCALTPLLPPAWWD
Subjt:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD

Query:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNVVRTIPFIL
        GLWQSPIPRDIKRMENYGVHLS  G ++EDSLRSFCNAKKNVVRTIPFIL
Subjt:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNVVRTIPFIL

A0A6J1FKR5 uncharacterized protein LOC111444894 isoform X27.6e-27586.53Show/hide
Query:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------
        MRHGGS+RKR SS VRYVVVLCAVGAAIGFLMLNVL RLE+R S+ SSDQFGNGDDVEES ARSG+E R+ SC TVE+MGE F D               
Subjt:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------

Query:  -----GASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM
             GASRVRHLPPE FCKHGFV+GK+SEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE+KHLWRLKGC RKFKR LIM
Subjt:  -----GASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM

Query:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS
        RID+FEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKN+HP MRAAASNLFGQ EVLESRPNVFGELMR+LISPSKDV+EAV SVLKSGADPDIS
Subjt:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS

Query:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF
        LHMRMLMNRS+RGLQAA+QCIRKAMLNLTT  KPRLVLVSDTP+FVKSIMPILGEFAEVIHFDYEHFRGNIS  HD FHKLDFRVKDWGP+PRWVAFVDF
Subjt:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF

Query:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD
        FLASRAKHAVISGAHRR+GTTYAQLIAALAAAHNLD+ GN+S GS FSFLSSFQSNLL EGLKNQVGWGH+WNRFAGPLSCP QPNQCALTPLLPPAWWD
Subjt:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD

Query:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNV
        GLWQSPIPRDIKRMENYGVHLS  G ++EDSLRSFCNAKKNV
Subjt:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNV

A0A6J1JUE3 uncharacterized protein LOC111488989 isoform X13.0e-27985.08Show/hide
Query:  SLNPYQIVKQRRKMRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD--
        S +P    + + KMRHGG +RKR SS VRYVVVLCAVGAAIGFLMLNVL RLE+R S+ SSDQFGNGDDVEES ARSG+E R+ SC TVEQMGE F D  
Subjt:  SLNPYQIVKQRRKMRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD--

Query:  ------------------GASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRL
                          GASRVRHLPPE FCKHGFV+GK+SEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE+KHLWRL
Subjt:  ------------------GASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRL

Query:  KGCGRKFKRQLIMRIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAV
        KGC RKFKR LIMRID+FEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKN+HP MRAAASNLFGQ EVLESRPNVFGELMR+LISPSKDV+EAV
Subjt:  KGCGRKFKRQLIMRIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAV

Query:  YSVLKSGADPDISLHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKD
         SVLKSGADPDISLHMRMLMNRS+RGLQAA+QCIRKA+LNLTT  KPRLVLVSDTP+FV SIMPILGEFAEVIHFDYEHFRGNIS  HD FHKLDFRVKD
Subjt:  YSVLKSGADPDISLHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKD

Query:  WGPAPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQ
        WGP+PRWVAFVDFFLASRAKHAVISGAHRR+GTTYAQLIAALAAAHNLD+ GN+S GS FSFLSSFQSNLL EGLKNQVGWGH+WNRFAGPLSCP QPNQ
Subjt:  WGPAPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQ

Query:  CALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNVVRTIPFIL
        CALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLS  G V+EDSLRSFCNAKKNVVRTIPFIL
Subjt:  CALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNVVRTIPFIL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G26950.1 unknown protein3.1e-18859.75Show/hide
Query:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQS----------SCVTVEQMGEAFK------
        M+ GG+RRKR        ++L +V   IGF     L+ L  R    +S    + DD  ES+  S      S           C TVE+MG  F       
Subjt:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQS----------SCVTVEQMGEAFK------

Query:  --------------DGASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGC
                      +GAS +R LPPE FC+HG+V+GK +EAGFGNEMYKILT+ ALSIMLNRSLIIGQTRGK+PFGDYI+YS+ +FT+ EVKHLWR  GC
Subjt:  --------------DGASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGC

Query:  GRKFKRQLIMRIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSV
         +K+KR+L+MR+D+FEKPA++NVLCSNWK+WE  IIWFQGTTDAVAAQFFLKN+HP MRAAA  LFG+      R NVFGELM  LISP+KDV+EAV  V
Subjt:  GRKFKRQLIMRIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSV

Query:  LKSGADPDISLHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGP
        L    DPDIS+HMRMLM++SVR ++AA+ C+ KA +N      PR+V+VSDTP+ VK I   +   AEV+HFDY+ FRG+I++   G   LDFR+KDWGP
Subjt:  LKSGADPDISLHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGP

Query:  APRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCAL
        APRWVAFVDFFLA RAKHAVISGA+RRVGTTYAQL+AALAAA   +SL + S+ S F+FLSSFQSNLL +GLKNQVGWGHVWNR+AGPLSCP QPNQCA 
Subjt:  APRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCAL

Query:  TPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNVVRTIPFI
        TPL PP WWDG+WQSPIPRD +R+  +G+ LSGFGTVNED   ++C+AKK  V T+  I
Subjt:  TPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNVVRTIPFI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTATCGAGTCAATGGATGGACGGCGCCGCCTGGTACCGGAGTTTTTCCGGTGTCTGCTTTGATCGAGTTTTCAAGATCGCAGAATATGCAAATCGCCGCTTCTCC
GAGACGAGTGCGAGCAGTTGTGGTCTCTGTCTTGAGCTCGGTGGCGCACTCCTCGGGGAATAAGCCCTCGATTTTTGGGACCATATCGCGCAGAGCCTCGTACATGCTGG
CGGCGAGTGCGGGATACCCTGAATATACAGCTTCGGCGAGATTCCGTTCGTCGGAGAACAAAACCGTGGCGCATTGTTTGAAGGTTTTGATCCACGTAGCGATCTCCCTC
TCCATTAATTCCCAATTCATCTTCTGGATATCATCAATGCTGTGTTTCTCGAATCCTAGCTTCTGCAAGCTCTCCTCGAATATATTTCTCCTCGCAACCATGATCCGCGT
CGCTACTTCTGGATTCCTCCAGAAAGTACCTGAATTCTTCCTCCAAATAAGATATTGCTCGCTGTTGAGTGCTCCCGATTCGGTTAATCAACGAATTCTTCTCCTCTTCC
AATTTCATTTCATGGAATGCAATATTCAGCTTCGATATCCGATCAACAGCTTCCAGAAACGACGATTCGCCCTCCGGCACCCGAGACTCCTTCGGATTCGATTCTGCATT
ATTATATTGCGCAATCTTTGCATCCAAGAGATCCAAGAATTTGTCGATAAACTGCGGAATCTCCACCGCCGCATCCTCCTGTCGGGGAGGAATTTTACATCGGCGTCGGT
GTCTGCAGCTGCTTCAGGATCTGGAACCGTCACATTGGCATCATCAACGCCCAATTTGGGGTCGTCCTCCTTGGTGGCGAAGCTACTGGTCTTCTCCAGAAGTTTGAATC
CCTACCAGATTGTGAAGCAAAGGAGGAAAATGAGGCATGGTGGATCCAGGAGGAAGAGACCATCATCGTTTGTACGATATGTCGTCGTTCTATGTGCAGTCGGTGCTGCA
ATTGGATTTCTAATGCTCAATGTTCTTATGAGGCTGGAAGCTCGACAATCAAAATCGAGTTCTGATCAGTTTGGTAATGGCGACGACGTTGAAGAAAGTCAGGCTCGGAG
TGGAATGGAGAGAAGGCAGAGCTCCTGCGTGACGGTGGAACAGATGGGAGAGGCCTTTAAAGATGGTGCTTCAAGAGTGCGACATCTTCCTCCTGAGCTGTTCTGCAAAC
ATGGTTTTGTCATAGGCAAAGCTTCAGAGGCAGGCTTTGGTAATGAGATGTACAAGATTTTAACTGCTGGAGCTTTAAGTATAATGCTGAACCGATCCTTGATCATTGGG
CAAACCAGGGGCAAGTTTCCTTTCGGGGATTACATTTCTTATTCTGATATTTCGTTTACCTTGAAAGAAGTCAAGCATTTGTGGAGACTTAAAGGTTGTGGTAGGAAATT
CAAAAGGCAATTGATTATGCGAATTGATAACTTTGAAAAGCCTGCACAGACAAATGTTCTATGTAGTAATTGGAAGGAATGGGAGCATCCTATCATATGGTTCCAAGGTA
CAACGGATGCTGTCGCTGCTCAATTTTTCTTGAAGAATTTACATCCCACTATGAGGGCTGCTGCGTCTAATTTATTTGGACAGGCAGAGGTTTTGGAATCTAGACCTAAT
GTATTTGGAGAGCTCATGAGAGTTCTGATATCTCCTTCAAAGGATGTTCAAGAAGCAGTGTACTCGGTCCTTAAAAGTGGGGCTGATCCTGATATTTCATTGCACATGCG
CATGCTTATGAATAGGTCCGTCAGAGGTTTACAGGCAGCATTGCAATGCATCAGAAAAGCCATGCTTAATCTAACCACGGACTCGAAACCCAGATTGGTTTTAGTATCGG
ATACCCCAAATTTTGTGAAAAGTATCATGCCTATCTTAGGTGAATTTGCAGAGGTCATTCATTTTGATTATGAACATTTCAGAGGAAATATTTCTGAAATGCACGATGGA
TTCCATAAATTGGATTTCAGAGTAAAGGACTGGGGCCCAGCACCGAGATGGGTTGCCTTTGTGGATTTCTTTCTTGCATCCCGTGCCAAGCATGCTGTTATTTCTGGTGC
TCACAGGCGTGTAGGTACTACCTATGCTCAGCTAATCGCAGCACTGGCTGCAGCACACAATCTCGACAGTCTCGGGAATAGTTCAGCTGGTTCAAAATTTTCATTCTTGA
GTAGCTTCCAAAGTAATTTGTTGAGAGAAGGTTTAAAGAACCAGGTTGGGTGGGGGCATGTCTGGAACAGATTTGCAGGTCCTTTAAGCTGCCCTAGCCAGCCTAATCAG
TGTGCCTTAACCCCTCTTCTCCCTCCAGCATGGTGGGATGGACTTTGGCAATCTCCCATTCCACGAGATATTAAAAGAATGGAAAATTATGGAGTTCATTTATCGGGCTT
CGGCACTGTTAACGAAGACAGTCTTCGATCATTCTGTAATGCAAAGAAGAATGTTGTGAGGACTATCCCTTTCATACTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGTATCGAGTCAATGGATGGACGGCGCCGCCTGGTACCGGAGTTTTTCCGGTGTCTGCTTTGATCGAGTTTTCAAGATCGCAGAATATGCAAATCGCCGCTTCTCC
GAGACGAGTGCGAGCAGTTGTGGTCTCTGTCTTGAGCTCGGTGGCGCACTCCTCGGGGAATAAGCCCTCGATTTTTGGGACCATATCGCGCAGAGCCTCGTACATGCTGG
CGGCGAGTGCGGGATACCCTGAATATACAGCTTCGGCGAGATTCCGTTCGTCGGAGAACAAAACCGTGGCGCATTGTTTGAAGGTTTTGATCCACGTAGCGATCTCCCTC
TCCATTAATTCCCAATTCATCTTCTGGATATCATCAATGCTGTGTTTCTCGAATCCTAGCTTCTGCAAGCTCTCCTCGAATATATTTCTCCTCGCAACCATGATCCGCGT
CGCTACTTCTGGATTCCTCCAGAAAGTACCTGAATTCTTCCTCCAAATAAGATATTGCTCGCTGTTGAGTGCTCCCGATTCGGTTAATCAACGAATTCTTCTCCTCTTCC
AATTTCATTTCATGGAATGCAATATTCAGCTTCGATATCCGATCAACAGCTTCCAGAAACGACGATTCGCCCTCCGGCACCCGAGACTCCTTCGGATTCGATTCTGCATT
ATTATATTGCGCAATCTTTGCATCCAAGAGATCCAAGAATTTGTCGATAAACTGCGGAATCTCCACCGCCGCATCCTCCTGTCGGGGAGGAATTTTACATCGGCGTCGGT
GTCTGCAGCTGCTTCAGGATCTGGAACCGTCACATTGGCATCATCAACGCCCAATTTGGGGTCGTCCTCCTTGGTGGCGAAGCTACTGGTCTTCTCCAGAAGTTTGAATC
CCTACCAGATTGTGAAGCAAAGGAGGAAAATGAGGCATGGTGGATCCAGGAGGAAGAGACCATCATCGTTTGTACGATATGTCGTCGTTCTATGTGCAGTCGGTGCTGCA
ATTGGATTTCTAATGCTCAATGTTCTTATGAGGCTGGAAGCTCGACAATCAAAATCGAGTTCTGATCAGTTTGGTAATGGCGACGACGTTGAAGAAAGTCAGGCTCGGAG
TGGAATGGAGAGAAGGCAGAGCTCCTGCGTGACGGTGGAACAGATGGGAGAGGCCTTTAAAGATGGTGCTTCAAGAGTGCGACATCTTCCTCCTGAGCTGTTCTGCAAAC
ATGGTTTTGTCATAGGCAAAGCTTCAGAGGCAGGCTTTGGTAATGAGATGTACAAGATTTTAACTGCTGGAGCTTTAAGTATAATGCTGAACCGATCCTTGATCATTGGG
CAAACCAGGGGCAAGTTTCCTTTCGGGGATTACATTTCTTATTCTGATATTTCGTTTACCTTGAAAGAAGTCAAGCATTTGTGGAGACTTAAAGGTTGTGGTAGGAAATT
CAAAAGGCAATTGATTATGCGAATTGATAACTTTGAAAAGCCTGCACAGACAAATGTTCTATGTAGTAATTGGAAGGAATGGGAGCATCCTATCATATGGTTCCAAGGTA
CAACGGATGCTGTCGCTGCTCAATTTTTCTTGAAGAATTTACATCCCACTATGAGGGCTGCTGCGTCTAATTTATTTGGACAGGCAGAGGTTTTGGAATCTAGACCTAAT
GTATTTGGAGAGCTCATGAGAGTTCTGATATCTCCTTCAAAGGATGTTCAAGAAGCAGTGTACTCGGTCCTTAAAAGTGGGGCTGATCCTGATATTTCATTGCACATGCG
CATGCTTATGAATAGGTCCGTCAGAGGTTTACAGGCAGCATTGCAATGCATCAGAAAAGCCATGCTTAATCTAACCACGGACTCGAAACCCAGATTGGTTTTAGTATCGG
ATACCCCAAATTTTGTGAAAAGTATCATGCCTATCTTAGGTGAATTTGCAGAGGTCATTCATTTTGATTATGAACATTTCAGAGGAAATATTTCTGAAATGCACGATGGA
TTCCATAAATTGGATTTCAGAGTAAAGGACTGGGGCCCAGCACCGAGATGGGTTGCCTTTGTGGATTTCTTTCTTGCATCCCGTGCCAAGCATGCTGTTATTTCTGGTGC
TCACAGGCGTGTAGGTACTACCTATGCTCAGCTAATCGCAGCACTGGCTGCAGCACACAATCTCGACAGTCTCGGGAATAGTTCAGCTGGTTCAAAATTTTCATTCTTGA
GTAGCTTCCAAAGTAATTTGTTGAGAGAAGGTTTAAAGAACCAGGTTGGGTGGGGGCATGTCTGGAACAGATTTGCAGGTCCTTTAAGCTGCCCTAGCCAGCCTAATCAG
TGTGCCTTAACCCCTCTTCTCCCTCCAGCATGGTGGGATGGACTTTGGCAATCTCCCATTCCACGAGATATTAAAAGAATGGAAAATTATGGAGTTCATTTATCGGGCTT
CGGCACTGTTAACGAAGACAGTCTTCGATCATTCTGTAATGCAAAGAAGAATGTTGTGAGGACTATCCCTTTCATACTATAG
Protein sequenceShow/hide protein sequence
MVYRVNGWTAPPGTGVFPVSALIEFSRSQNMQIAASPRRVRAVVVSVLSSVAHSSGNKPSIFGTISRRASYMLAASAGYPEYTASARFRSSENKTVAHCLKVLIHVAISL
SINSQFIFWISSMLCFSNPSFCKLSSNIFLLATMIRVATSGFLQKVPEFFLQIRYCSLLSAPDSVNQRILLLFQFHFMECNIQLRYPINSFQKRRFALRHPRLLRIRFCI
IILRNLCIQEIQEFVDKLRNLHRRILLSGRNFTSASVSAAASGSGTVTLASSTPNLGSSSLVAKLLVFSRSLNPYQIVKQRRKMRHGGSRRKRPSSFVRYVVVLCAVGAA
IGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKDGASRVRHLPPELFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIG
QTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIMRIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPN
VFGELMRVLISPSKDVQEAVYSVLKSGADPDISLHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDG
FHKLDFRVKDWGPAPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQ
CALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNVVRTIPFIL