; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr001908 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr001908
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00001202:9134..28800
RNA-Seq ExpressionSgr001908
SyntenySgr001908
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141349.1 uncharacterized protein LOC111011770 [Momordica charantia]1.3e-27587.13Show/hide
Query:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGD-DVEESQARSGMERRQSSCVTVEQMGEAFKD--------------
        MRHGGSRRKRP  F RYVVVLCAVGAAIGFLMLN+ MRLEAR+S+SSSDQ GNGD  V ESQ RSG+E R+SSC TVEQMGEAFK+              
Subjt:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGD-DVEESQARSGMERRQSSCVTVEQMGEAFKD--------------

Query:  -------GASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQL
               GASRVR+LPPEQFCKHGFV+GKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE+KHLWRL GCGRK+ R L
Subjt:  -------GASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQL

Query:  IMRIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPD
        IMR DNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQ EVLESRPNVFGELM+VLISPS+DVQEA+YSVLKSG DPD
Subjt:  IMRIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPD

Query:  ISLHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFV
        ISLHMRMLMNRS RGLQAALQCIRKAMLNL+   KPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNIS MHD FHKLDFRVKDWGP+PRWVAFV
Subjt:  ISLHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFV

Query:  DFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAW
        DFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGN+S GS FSFLSSFQSNLL EGLKNQVGWGH+WNRFAGPLSCP+QPNQCALTPLLPPAW
Subjt:  DFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAW

Query:  WDGLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNV
        WDGLWQSPIPRDIKRM NYGV+LSG GTV+EDSLRSFCNA+KNV
Subjt:  WDGLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNV

XP_022938779.1 uncharacterized protein LOC111444894 isoform X1 [Cucurbita moschata]8.1e-27586.72Show/hide
Query:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------
        MRHGGS+RKR SS VRYVVVLCAVGAAIGFLMLNVL RLE+R S+ SSDQFGNGDDVEES ARSG+E R+ SC TVE+MGE F D               
Subjt:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------

Query:  -----GASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM
             GASRVRHLPPEQFCKHGFV+GK+SEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE+KHLWRLKGC RKFKR LIM
Subjt:  -----GASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM

Query:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS
        RID+FEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKN+HP MRAAASNLFGQ EVLESRPNVFGELMR+LISPSKDV+EAV SVLKSGADPDIS
Subjt:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS

Query:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF
        LHMRMLMNRS+RGLQAA+QCIRKAMLNLTT  KPRLVLVSDTP+FVKSIMPILGEFAEVIHFDYEHFRGNIS  HD FHKLDFRVKDWGP+PRWVAFVDF
Subjt:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF

Query:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD
        FLASRAKHAVISGAHRR+GTTYAQLIAALAAAHNLD+ GN+S GS FSFLSSFQSNLL EGLKNQVGWGH+WNRFAGPLSCP QPNQCALTPLLPPAWWD
Subjt:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD

Query:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNV
        GLWQSPIPRDIKRMENYGVHLS  G ++EDSLRSFCNAKKNV
Subjt:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNV

XP_022938780.1 uncharacterized protein LOC111444894 isoform X2 [Cucurbita moschata]8.1e-27586.72Show/hide
Query:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------
        MRHGGS+RKR SS VRYVVVLCAVGAAIGFLMLNVL RLE+R S+ SSDQFGNGDDVEES ARSG+E R+ SC TVE+MGE F D               
Subjt:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------

Query:  -----GASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM
             GASRVRHLPPEQFCKHGFV+GK+SEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE+KHLWRLKGC RKFKR LIM
Subjt:  -----GASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM

Query:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS
        RID+FEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKN+HP MRAAASNLFGQ EVLESRPNVFGELMR+LISPSKDV+EAV SVLKSGADPDIS
Subjt:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS

Query:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF
        LHMRMLMNRS+RGLQAA+QCIRKAMLNLTT  KPRLVLVSDTP+FVKSIMPILGEFAEVIHFDYEHFRGNIS  HD FHKLDFRVKDWGP+PRWVAFVDF
Subjt:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF

Query:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD
        FLASRAKHAVISGAHRR+GTTYAQLIAALAAAHNLD+ GN+S GS FSFLSSFQSNLL EGLKNQVGWGH+WNRFAGPLSCP QPNQCALTPLLPPAWWD
Subjt:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD

Query:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNV
        GLWQSPIPRDIKRMENYGVHLS  G ++EDSLRSFCNAKKNV
Subjt:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNV

XP_023549723.1 uncharacterized protein LOC111808143 isoform X1 [Cucurbita pepo subsp. pepo]1.1e-27486.9Show/hide
Query:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------
        MRHGGSRRKR SS VRYVVVLCAVGAAIGFLMLNVL RLE+R S+  SDQFGNGDDVEES ARSG+E R+ SC TVE+MGE F D               
Subjt:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------

Query:  -----GASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM
             GASRVRHLPPEQFCKHGFV+GK+SEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE+KHLWRLKGC RKFKR LIM
Subjt:  -----GASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM

Query:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS
        RID+FEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKN+HP MRAAASNLFGQ EVLESRPNVFGELMR+LISPSKDV+EAV SVLKSGADPDIS
Subjt:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS

Query:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF
        LHMRMLMNRS+RGLQAA+QCIRKAMLNLTT  KPRLVLVSDTP+FVKSIMPILGEFAEVIHFDYEHFRGNIS  HD FHKLDFRVKDWGP+PRWVAFVDF
Subjt:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF

Query:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD
        FLASRAKHAVISGAHRR+GTTYAQLIAALAAAHNLD+LGN+S GS FSFLSSFQSNLL EGLKNQVGWGH+WNRFAGPLSCP QPNQCALTPLLPPAWWD
Subjt:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD

Query:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNV
        G WQSPIPRDIKRMENYGVHLS  G V+EDSLRSFCNAKKNV
Subjt:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNV

XP_038906660.1 uncharacterized protein LOC120092597 isoform X1 [Benincasa hispida]8.7e-27787.64Show/hide
Query:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------
        MRHGGSRRKR SSFVRYVVVLCAVGAAIGFLMLN+LMRLEAR+S+S+SDQFGNGDDVEE+ A+SGME  +SSC TVEQMGE+FKD               
Subjt:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------

Query:  -----GASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM
             GASRVR LPPEQFCKHGFV+GK+SEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDI+FTLKE+KHLWRL GC RKF R LIM
Subjt:  -----GASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM

Query:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS
        RID+FEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKN+HP MRAAASNLFG  EVLESRPNVFGELMRVLISPSKDV+EAV SVLKSGADPDIS
Subjt:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS

Query:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF
        LHMRMLMNRSVRGLQAA+QCIRKAMLNLTT SKPRLVLVSDTPNFVKSIM ILGEFAEVIHFDYEHFRGNIS  HD FHKLDFRVKDWGP+PRWVAFVDF
Subjt:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF

Query:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD
        FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLD+LGNSS GS FSFLSS+QSNLLREGLKNQVGWGH+WNRFAGPLSCPSQPNQCA TP+LPPAWWD
Subjt:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD

Query:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNV
        GLWQSPIPRDIKRMENYGVHLS  G V+EDSLRSFCNAKKNV
Subjt:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNV

TrEMBL top hitse value%identityAlignment
A0A6J1CIE2 uncharacterized protein LOC1110117706.1e-27687.13Show/hide
Query:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGD-DVEESQARSGMERRQSSCVTVEQMGEAFKD--------------
        MRHGGSRRKRP  F RYVVVLCAVGAAIGFLMLN+ MRLEAR+S+SSSDQ GNGD  V ESQ RSG+E R+SSC TVEQMGEAFK+              
Subjt:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGD-DVEESQARSGMERRQSSCVTVEQMGEAFKD--------------

Query:  -------GASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQL
               GASRVR+LPPEQFCKHGFV+GKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE+KHLWRL GCGRK+ R L
Subjt:  -------GASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQL

Query:  IMRIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPD
        IMR DNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQ EVLESRPNVFGELM+VLISPS+DVQEA+YSVLKSG DPD
Subjt:  IMRIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPD

Query:  ISLHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFV
        ISLHMRMLMNRS RGLQAALQCIRKAMLNL+   KPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNIS MHD FHKLDFRVKDWGP+PRWVAFV
Subjt:  ISLHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFV

Query:  DFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAW
        DFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGN+S GS FSFLSSFQSNLL EGLKNQVGWGH+WNRFAGPLSCP+QPNQCALTPLLPPAW
Subjt:  DFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAW

Query:  WDGLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNV
        WDGLWQSPIPRDIKRM NYGV+LSG GTV+EDSLRSFCNA+KNV
Subjt:  WDGLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNV

A0A6J1FF37 uncharacterized protein LOC111444894 isoform X13.9e-27586.72Show/hide
Query:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------
        MRHGGS+RKR SS VRYVVVLCAVGAAIGFLMLNVL RLE+R S+ SSDQFGNGDDVEES ARSG+E R+ SC TVE+MGE F D               
Subjt:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------

Query:  -----GASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM
             GASRVRHLPPEQFCKHGFV+GK+SEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE+KHLWRLKGC RKFKR LIM
Subjt:  -----GASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM

Query:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS
        RID+FEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKN+HP MRAAASNLFGQ EVLESRPNVFGELMR+LISPSKDV+EAV SVLKSGADPDIS
Subjt:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS

Query:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF
        LHMRMLMNRS+RGLQAA+QCIRKAMLNLTT  KPRLVLVSDTP+FVKSIMPILGEFAEVIHFDYEHFRGNIS  HD FHKLDFRVKDWGP+PRWVAFVDF
Subjt:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF

Query:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD
        FLASRAKHAVISGAHRR+GTTYAQLIAALAAAHNLD+ GN+S GS FSFLSSFQSNLL EGLKNQVGWGH+WNRFAGPLSCP QPNQCALTPLLPPAWWD
Subjt:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD

Query:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNV
        GLWQSPIPRDIKRMENYGVHLS  G ++EDSLRSFCNAKKNV
Subjt:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNV

A0A6J1FKR5 uncharacterized protein LOC111444894 isoform X23.9e-27586.72Show/hide
Query:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------
        MRHGGS+RKR SS VRYVVVLCAVGAAIGFLMLNVL RLE+R S+ SSDQFGNGDDVEES ARSG+E R+ SC TVE+MGE F D               
Subjt:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD---------------

Query:  -----GASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM
             GASRVRHLPPEQFCKHGFV+GK+SEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE+KHLWRLKGC RKFKR LIM
Subjt:  -----GASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIM

Query:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS
        RID+FEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKN+HP MRAAASNLFGQ EVLESRPNVFGELMR+LISPSKDV+EAV SVLKSGADPDIS
Subjt:  RIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSVLKSGADPDIS

Query:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF
        LHMRMLMNRS+RGLQAA+QCIRKAMLNLTT  KPRLVLVSDTP+FVKSIMPILGEFAEVIHFDYEHFRGNIS  HD FHKLDFRVKDWGP+PRWVAFVDF
Subjt:  LHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGPAPRWVAFVDF

Query:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD
        FLASRAKHAVISGAHRR+GTTYAQLIAALAAAHNLD+ GN+S GS FSFLSSFQSNLL EGLKNQVGWGH+WNRFAGPLSCP QPNQCALTPLLPPAWWD
Subjt:  FLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCALTPLLPPAWWD

Query:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNV
        GLWQSPIPRDIKRMENYGVHLS  G ++EDSLRSFCNAKKNV
Subjt:  GLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNV

A0A6J1JQR8 uncharacterized protein LOC111488989 isoform X26.7e-27585.05Show/hide
Query:  SLNPYQIVKQRRKMRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD--
        S +P    + + KMRHGG +RKR SS VRYVVVLCAVGAAIGFLMLNVL RLE+R S+ SSDQFGNGDDVEES ARSG+E R+ SC TVEQMGE F D  
Subjt:  SLNPYQIVKQRRKMRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD--

Query:  ------------------GASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRL
                          GASRVRHLPPEQFCKHGFV+GK+SEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE+KHLWRL
Subjt:  ------------------GASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRL

Query:  KGCGRKFKRQLIMRIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAV
        KGC RKFKR LIMRID+FEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKN+HP MRAAASNLFGQ EVLESRPNVFGELMR+LISPSKDV+EAV
Subjt:  KGCGRKFKRQLIMRIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAV

Query:  YSVLKSGADPDISLHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKD
         SVLKSGADPDISLHMRMLMNRS+RGLQAA+QCIRKA+LNLTT  KPRLVLVSDTP+FV SIMPILGEFAEVIHFDYEHFRGNIS  HD FHKLDFRVKD
Subjt:  YSVLKSGADPDISLHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKD

Query:  WGPAPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQ
        WGP+PRWVAFVDFFLASRAKHAVISGAHRR+GTTYAQLIAALAAAHNLD+ GN+S GS FSFLSSFQSNLL EGLKNQVGWGH+WNRFAGPLSCP QPNQ
Subjt:  WGPAPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQ

Query:  CALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNV
        CALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLS  G V+EDSLRSFCNAKKNV
Subjt:  CALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNV

A0A6J1JUE3 uncharacterized protein LOC111488989 isoform X16.7e-27585.05Show/hide
Query:  SLNPYQIVKQRRKMRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD--
        S +P    + + KMRHGG +RKR SS VRYVVVLCAVGAAIGFLMLNVL RLE+R S+ SSDQFGNGDDVEES ARSG+E R+ SC TVEQMGE F D  
Subjt:  SLNPYQIVKQRRKMRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKD--

Query:  ------------------GASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRL
                          GASRVRHLPPEQFCKHGFV+GK+SEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE+KHLWRL
Subjt:  ------------------GASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRL

Query:  KGCGRKFKRQLIMRIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAV
        KGC RKFKR LIMRID+FEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKN+HP MRAAASNLFGQ EVLESRPNVFGELMR+LISPSKDV+EAV
Subjt:  KGCGRKFKRQLIMRIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAV

Query:  YSVLKSGADPDISLHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKD
         SVLKSGADPDISLHMRMLMNRS+RGLQAA+QCIRKA+LNLTT  KPRLVLVSDTP+FV SIMPILGEFAEVIHFDYEHFRGNIS  HD FHKLDFRVKD
Subjt:  YSVLKSGADPDISLHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKD

Query:  WGPAPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQ
        WGP+PRWVAFVDFFLASRAKHAVISGAHRR+GTTYAQLIAALAAAHNLD+ GN+S GS FSFLSSFQSNLL EGLKNQVGWGH+WNRFAGPLSCP QPNQ
Subjt:  WGPAPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQ

Query:  CALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNV
        CALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLS  G V+EDSLRSFCNAKKNV
Subjt:  CALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G26950.1 unknown protein5.2e-18760.36Show/hide
Query:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQS----------SCVTVEQMGEAFK------
        M+ GG+RRKR        ++L +V   IGF     L+ L  R    +S    + DD  ES+  S      S           C TVE+MG  F       
Subjt:  MRHGGSRRKRPSSFVRYVVVLCAVGAAIGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQS----------SCVTVEQMGEAFK------

Query:  --------------DGASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGC
                      +GAS +R LPPEQFC+HG+V+GK +EAGFGNEMYKILT+ ALSIMLNRSLIIGQTRGK+PFGDYI+YS+ +FT+ EVKHLWR  GC
Subjt:  --------------DGASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKEVKHLWRLKGC

Query:  GRKFKRQLIMRIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSV
         +K+KR+L+MR+D+FEKPA++NVLCSNWK+WE  IIWFQGTTDAVAAQFFLKN+HP MRAAA  LFG+      R NVFGELM  LISP+KDV+EAV  V
Subjt:  GRKFKRQLIMRIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPNVFGELMRVLISPSKDVQEAVYSV

Query:  LKSGADPDISLHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGP
        L    DPDIS+HMRMLM++SVR ++AA+ C+ KA +N      PR+V+VSDTP+ VK I   +   AEV+HFDY+ FRG+I++   G   LDFR+KDWGP
Subjt:  LKSGADPDISLHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDGFHKLDFRVKDWGP

Query:  APRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCAL
        APRWVAFVDFFLA RAKHAVISGA+RRVGTTYAQL+AALAAA   +SL + S+ S F+FLSSFQSNLL +GLKNQVGWGHVWNR+AGPLSCP QPNQCA 
Subjt:  APRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQCAL

Query:  TPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKK
        TPL PP WWDG+WQSPIPRD +R+  +G+ LSGFGTVNED   ++C+AKK
Subjt:  TPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTATCGAGTCAATGGATGGACGGCGCCGCCTGGTACTGGAGTTTTTCCGGTGTCTGCTTTGATCGAGTTTTCAAGATCGCAGAATATGCAAATCGCCGCTTCTCC
GAGACGAGTGCGAGCAGTTGTGGTCTCTGTCTTGAGCTCGGTGGCGCACTCCTCGGGGAATAAGCCCTCGATTTTTGGGACCATATCGCGCAGAGCCTCGTACATGCTGG
CGGCGAGTGCGGGATACCCTGAATATACAGCTTCGGCGAGATTCCGTTCGTCGGAGAACAAAACCGTGGCGCATTGTTTGAAGGTTTTGATCCACGTAGCGATCTCCCTC
TCCATTAATTCCCAATTCATCTTCTGGATATCATCAATGCTGTGTTTCTCGAATCCTAGCTTCTGCAAGCTCTCCTCGAATATATTTCTCCTCGCAACCATGATCCGCGT
CGCTACTTCTGGATTCCTCCAGAAAGTACCTGAATTCTTCCTCCAAATAAGATATTGCTCGCTGTTGAGTGCTCCCGATTCGGTTAATCAACGAATTCTTCTCCTCTTCC
AATTTCATTTCATGGAATGCAATATTCAGCTTCGATATCCGATCAACAGCTTCCAGAAACGACGATTCGCCCTCCGGCACCCGAGACTCCTTCGGATTCGATTCTGCATT
ATTATATTGCGCAATCTTTGCATCCAAGAGATCCAAGAATTTGTCGATAAACTGCGGAATCTCCACCGCCGCATCCTCCTCTCGGGGAGGAATATTACATCGGCGTCGGT
GTCTGCAGCTGCATCAGGATCTGGAACCGTCACATTGGCATCATCAACGCCCAATTTGGGGTCGTCCTCCTTGGTGGCGAAGCTACTGGTCTTCTCCAGAAGTTTGAATC
CCTACCAGATTGTGAAGCAAAGGAGGAAAATGAGGCATGGTGGATCCAGGAGGAAGAGACCATCATCGTTTGTACGATATGTCGTCGTTCTATGTGCAGTCGGTGCTGCA
ATTGGATTTCTAATGCTCAATGTTCTTATGAGGCTGGAAGCTCGACAATCAAAATCGAGTTCTGATCAGTTTGGTAATGGCGACGACGTTGAAGAAAGTCAGGCTCGGAG
TGGAATGGAGAGAAGGCAGAGCTCCTGCGTGACGGTGGAACAGATGGGAGAGGCCTTTAAAGATGGTGCTTCAAGAGTGCGACATCTTCCTCCTGAGCAGTTCTGCAAAC
ATGGTTTTGTCATAGGCAAAGCTTCAGAGGCAGGCTTTGGTAATGAGATGTACAAGATTTTAACTGCTGGAGCTTTAAGTATAATGCTGAACCGATCCTTGATCATTGGG
CAAACCAGGGGCAAGTTTCCTTTTGGGGATTACATTTCTTATTCTGATATTTCGTTTACCTTGAAAGAAGTGAAGCATTTGTGGAGACTTAAAGGTTGTGGTAGGAAATT
CAAAAGGCAATTGATTATGCGAATTGATAACTTTGAAAAGCCTGCACAGACAAATGTTCTATGTAGTAATTGGAAGGAATGGGAGCATCCTATCATATGGTTCCAAGGTA
CAACGGATGCTGTCGCCGCTCAATTTTTCTTGAAGAATTTACATCCCACTATGAGGGCTGCTGCATCTAATTTATTTGGACAGGCAGAGGTTTTGGAATCTAGACCTAAT
GTATTTGGAGAGCTCATGAGAGTTCTGATATCTCCTTCAAAGGATGTTCAAGAAGCAGTGTACTCTGTCCTTAAAAGTGGGGCTGATCCTGATATTTCATTGCACATGCG
CATGCTTATGAATAGGTCCGTCAGAGGTTTACAGGCAGCATTGCAATGCATCAGAAAAGCCATGCTTAATCTAACCACGGACTCGAAACCCAGATTGGTTTTAGTATCGG
ATACCCCAAATTTTGTGAAAAGCATCATGCCTATCTTAGGTGAATTTGCAGAGGTCATTCATTTTGATTATGAACATTTCAGAGGAAATATTTCTGAAATGCACGATGGA
TTCCATAAATTGGATTTCAGAGTAAAGGACTGGGGCCCAGCACCGAGATGGGTTGCCTTTGTGGATTTCTTTCTTGCATCCCGTGCCAAGCATGCTGTTATTTCTGGTGC
TCACAGGCGTGTAGGTACTACCTATGCTCAGCTAATCGCAGCACTGGCTGCAGCACACAATCTCGACAGTCTCGGGAATAGTTCAGCTGGTTCAAAATTTTCATTCTTGA
GTAGCTTCCAAAGTAATTTGTTGAGAGAAGGTTTAAAGAACCAGGTTGGGTGGGGGCATGTCTGGAACAGATTTGCAGGTCCTTTAAGCTGCCCTAGCCAGCCTAATCAG
TGTGCCTTAACCCCTCTTCTCCCTCCAGCATGGTGGGATGGACTTTGGCAATCTCCCATTCCACGAGATATTAAACGAATGGAAAATTATGGAGTTCATTTATCGGGCTT
CGGCACTGTTAACGAAGACAGTCTTCGATCATTCTGTAATGCAAAGAAGAATGTTGGTCAAGAACAATTACATATAAACACTATTTTGCAAGAAGTAAATGGGAATATAA
CTTGCAAAGCCAAAGGAGATCTTTACATTAGAGAGAGTTTTCATCTCCCATCTGCAGATGTGAAAGATTCTGCAAATCCATATGGCCGCGTCGGCAACGACATCTCGAGG
TCAATGCCGTACATATCTTCAGCATTCTGGCCTTTGATATGGCTCCCTGGCACAACAATGACCCGCCAACTCAGCAACCGTGCTGATGCTGGCGAACGTTTCTTCGGTAA
GATTGATAGTTGGATCAATGGCCTTGTGGAAGGAGTCCTTGTTGATTTGCATACGACGGAACCAAGTCACAAGGTGCATACTCTCCTCAGGCTGGCTTTCGTCGAGGGCC
CTTCTCCCAGTGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGTATCGAGTCAATGGATGGACGGCGCCGCCTGGTACTGGAGTTTTTCCGGTGTCTGCTTTGATCGAGTTTTCAAGATCGCAGAATATGCAAATCGCCGCTTCTCC
GAGACGAGTGCGAGCAGTTGTGGTCTCTGTCTTGAGCTCGGTGGCGCACTCCTCGGGGAATAAGCCCTCGATTTTTGGGACCATATCGCGCAGAGCCTCGTACATGCTGG
CGGCGAGTGCGGGATACCCTGAATATACAGCTTCGGCGAGATTCCGTTCGTCGGAGAACAAAACCGTGGCGCATTGTTTGAAGGTTTTGATCCACGTAGCGATCTCCCTC
TCCATTAATTCCCAATTCATCTTCTGGATATCATCAATGCTGTGTTTCTCGAATCCTAGCTTCTGCAAGCTCTCCTCGAATATATTTCTCCTCGCAACCATGATCCGCGT
CGCTACTTCTGGATTCCTCCAGAAAGTACCTGAATTCTTCCTCCAAATAAGATATTGCTCGCTGTTGAGTGCTCCCGATTCGGTTAATCAACGAATTCTTCTCCTCTTCC
AATTTCATTTCATGGAATGCAATATTCAGCTTCGATATCCGATCAACAGCTTCCAGAAACGACGATTCGCCCTCCGGCACCCGAGACTCCTTCGGATTCGATTCTGCATT
ATTATATTGCGCAATCTTTGCATCCAAGAGATCCAAGAATTTGTCGATAAACTGCGGAATCTCCACCGCCGCATCCTCCTCTCGGGGAGGAATATTACATCGGCGTCGGT
GTCTGCAGCTGCATCAGGATCTGGAACCGTCACATTGGCATCATCAACGCCCAATTTGGGGTCGTCCTCCTTGGTGGCGAAGCTACTGGTCTTCTCCAGAAGTTTGAATC
CCTACCAGATTGTGAAGCAAAGGAGGAAAATGAGGCATGGTGGATCCAGGAGGAAGAGACCATCATCGTTTGTACGATATGTCGTCGTTCTATGTGCAGTCGGTGCTGCA
ATTGGATTTCTAATGCTCAATGTTCTTATGAGGCTGGAAGCTCGACAATCAAAATCGAGTTCTGATCAGTTTGGTAATGGCGACGACGTTGAAGAAAGTCAGGCTCGGAG
TGGAATGGAGAGAAGGCAGAGCTCCTGCGTGACGGTGGAACAGATGGGAGAGGCCTTTAAAGATGGTGCTTCAAGAGTGCGACATCTTCCTCCTGAGCAGTTCTGCAAAC
ATGGTTTTGTCATAGGCAAAGCTTCAGAGGCAGGCTTTGGTAATGAGATGTACAAGATTTTAACTGCTGGAGCTTTAAGTATAATGCTGAACCGATCCTTGATCATTGGG
CAAACCAGGGGCAAGTTTCCTTTTGGGGATTACATTTCTTATTCTGATATTTCGTTTACCTTGAAAGAAGTGAAGCATTTGTGGAGACTTAAAGGTTGTGGTAGGAAATT
CAAAAGGCAATTGATTATGCGAATTGATAACTTTGAAAAGCCTGCACAGACAAATGTTCTATGTAGTAATTGGAAGGAATGGGAGCATCCTATCATATGGTTCCAAGGTA
CAACGGATGCTGTCGCCGCTCAATTTTTCTTGAAGAATTTACATCCCACTATGAGGGCTGCTGCATCTAATTTATTTGGACAGGCAGAGGTTTTGGAATCTAGACCTAAT
GTATTTGGAGAGCTCATGAGAGTTCTGATATCTCCTTCAAAGGATGTTCAAGAAGCAGTGTACTCTGTCCTTAAAAGTGGGGCTGATCCTGATATTTCATTGCACATGCG
CATGCTTATGAATAGGTCCGTCAGAGGTTTACAGGCAGCATTGCAATGCATCAGAAAAGCCATGCTTAATCTAACCACGGACTCGAAACCCAGATTGGTTTTAGTATCGG
ATACCCCAAATTTTGTGAAAAGCATCATGCCTATCTTAGGTGAATTTGCAGAGGTCATTCATTTTGATTATGAACATTTCAGAGGAAATATTTCTGAAATGCACGATGGA
TTCCATAAATTGGATTTCAGAGTAAAGGACTGGGGCCCAGCACCGAGATGGGTTGCCTTTGTGGATTTCTTTCTTGCATCCCGTGCCAAGCATGCTGTTATTTCTGGTGC
TCACAGGCGTGTAGGTACTACCTATGCTCAGCTAATCGCAGCACTGGCTGCAGCACACAATCTCGACAGTCTCGGGAATAGTTCAGCTGGTTCAAAATTTTCATTCTTGA
GTAGCTTCCAAAGTAATTTGTTGAGAGAAGGTTTAAAGAACCAGGTTGGGTGGGGGCATGTCTGGAACAGATTTGCAGGTCCTTTAAGCTGCCCTAGCCAGCCTAATCAG
TGTGCCTTAACCCCTCTTCTCCCTCCAGCATGGTGGGATGGACTTTGGCAATCTCCCATTCCACGAGATATTAAACGAATGGAAAATTATGGAGTTCATTTATCGGGCTT
CGGCACTGTTAACGAAGACAGTCTTCGATCATTCTGTAATGCAAAGAAGAATGTTGGTCAAGAACAATTACATATAAACACTATTTTGCAAGAAGTAAATGGGAATATAA
CTTGCAAAGCCAAAGGAGATCTTTACATTAGAGAGAGTTTTCATCTCCCATCTGCAGATGTGAAAGATTCTGCAAATCCATATGGCCGCGTCGGCAACGACATCTCGAGG
TCAATGCCGTACATATCTTCAGCATTCTGGCCTTTGATATGGCTCCCTGGCACAACAATGACCCGCCAACTCAGCAACCGTGCTGATGCTGGCGAACGTTTCTTCGGTAA
GATTGATAGTTGGATCAATGGCCTTGTGGAAGGAGTCCTTGTTGATTTGCATACGACGGAACCAAGTCACAAGGTGCATACTCTCCTCAGGCTGGCTTTCGTCGAGGGCC
CTTCTCCCAGTGATTAG
Protein sequenceShow/hide protein sequence
MVYRVNGWTAPPGTGVFPVSALIEFSRSQNMQIAASPRRVRAVVVSVLSSVAHSSGNKPSIFGTISRRASYMLAASAGYPEYTASARFRSSENKTVAHCLKVLIHVAISL
SINSQFIFWISSMLCFSNPSFCKLSSNIFLLATMIRVATSGFLQKVPEFFLQIRYCSLLSAPDSVNQRILLLFQFHFMECNIQLRYPINSFQKRRFALRHPRLLRIRFCI
IILRNLCIQEIQEFVDKLRNLHRRILLSGRNITSASVSAAASGSGTVTLASSTPNLGSSSLVAKLLVFSRSLNPYQIVKQRRKMRHGGSRRKRPSSFVRYVVVLCAVGAA
IGFLMLNVLMRLEARQSKSSSDQFGNGDDVEESQARSGMERRQSSCVTVEQMGEAFKDGASRVRHLPPEQFCKHGFVIGKASEAGFGNEMYKILTAGALSIMLNRSLIIG
QTRGKFPFGDYISYSDISFTLKEVKHLWRLKGCGRKFKRQLIMRIDNFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNLHPTMRAAASNLFGQAEVLESRPN
VFGELMRVLISPSKDVQEAVYSVLKSGADPDISLHMRMLMNRSVRGLQAALQCIRKAMLNLTTDSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEHFRGNISEMHDG
FHKLDFRVKDWGPAPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDSLGNSSAGSKFSFLSSFQSNLLREGLKNQVGWGHVWNRFAGPLSCPSQPNQ
CALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTVNEDSLRSFCNAKKNVGQEQLHINTILQEVNGNITCKAKGDLYIRESFHLPSADVKDSANPYGRVGNDISR
SMPYISSAFWPLIWLPGTTMTRQLSNRADAGERFFGKIDSWINGLVEGVLVDLHTTEPSHKVHTLLRLAFVEGPSPSD