; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC01G009120 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC01G009120
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionUnknown protein
Genome locationCicolChr01:10284222..10289125
RNA-Seq ExpressionCcUC01G009120
SyntenyCcUC01G009120
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022922653.1 uncharacterized protein LOC111430593 [Cucurbita moschata]2.2e-26779.26Show/hide
Query:  MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMRLEARESESSSDQFGNGDDVEETLVQTGMEGSRSSCATVEQMGEPFKDGVWKESLRISHEDFL
        MRHGGSR+KR SSF RYVV+LCAV A IGF MLN LMR+EA+ESESSSDQ GNGDDVEE+ V + M+G R SCATVEQMGE FKDGVWKESLR+      
Subjt:  MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMRLEARESESSSDQFGNGDDVEETLVQTGMEGSRSSCATVEQMGEPFKDGVWKESLRISHEDFL

Query:  LSQAVVPKTSKWNFLNLTSYGVPSDEIIHKGGNPCNSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM--------------------
               +T   N   L                               GASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM                    
Subjt:  LSQAVVPKTSKWNFLNLTSYGVPSDEIIHKGGNPCNSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM--------------------

Query:  ---GKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIMRIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG
           GKFPFGDYISYS+++FT+ EIKHLWRL GC+RKFNRHLIMR DDFEKP+QTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG
Subjt:  ---GKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIMRIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG

Query:  WPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFVKSIMPILGEFA
         PEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSG DPDI+LHMRMLMNRSVRGLQAA+QCIRK + NLTT SKPRLVLVSDTPNFVKSI+P+LGEFA
Subjt:  WPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFVKSIMPILGEFA

Query:  EVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSDFSFLSSFQSNL
        EVIHFDYE FRG ISGTHDEF KLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSDF FLSSFQSNL
Subjt:  EVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSDFSFLSSFQSNL

Query:  LREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKKNVVRTIPFIL
        LREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLP AWWDGLWQSPIPRDIKRMENYGVHLS  G +DEDSLRSFCNAKKNVVRTIPFIL
Subjt:  LREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKKNVVRTIPFIL

XP_022938779.1 uncharacterized protein LOC111444894 isoform X1 [Cucurbita moschata]2.5e-27180.1Show/hide
Query:  MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMRLEARESESSSDQFGNGDDVEETLVQTGMEGSRSSCATVEQMGEPFKDGVWKESLRISHEDFL
        MRHGGS+RKRSSS VRYVV+LCAV A IGF MLNVL RLE+R SE SSDQFGNGDDVEE+  ++G+EG R SCATVE+MGE F DGVWKESLR+      
Subjt:  MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMRLEARESESSSDQFGNGDDVEETLVQTGMEGSRSSCATVEQMGEPFKDGVWKESLRISHEDFL

Query:  LSQAVVPKTSKWNFLNLTSYGVPSDEIIHKGGNPCNSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM--------------------
               +T   N   L                               GASRVR LPPEQFCKHGFVMGKSSEAGFGNEM                    
Subjt:  LSQAVVPKTSKWNFLNLTSYGVPSDEIIHKGGNPCNSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM--------------------

Query:  ---GKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIMRIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG
           GKFPFGDYISYSDISFTL EIKHLWRL GCVRKF RHLIMRIDDFEKP+QTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG
Subjt:  ---GKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIMRIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG

Query:  WPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFVKSIMPILGEFA
         PEVLESRPNVFGELMR+LISPSKDVEEAV SVLKSGADPDI+LHMRMLMNRS+RGLQAAVQCIRKAMLNLTTV KPRLVLVSDTP+FVKSIMPILGEFA
Subjt:  WPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFVKSIMPILGEFA

Query:  EVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSDFSFLSSFQSNL
        EVIHFDYE FRGNIS THDEF KLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRR+GTTYAQLIAALAAAHNLDN GNNSTGSDFSFLSSFQSNL
Subjt:  EVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSDFSFLSSFQSNL

Query:  LREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKKNVVRTIPFIL
        L EGLKNQVGWGHIWNRFAGPLSCP QPNQCALTPLLP AWWDGLWQSPIPRDIKRMENYGVHLSS GI+DEDSLRSFCNAKKNVVRTIPFIL
Subjt:  LREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKKNVVRTIPFIL

XP_022992741.1 uncharacterized protein LOC111488989 isoform X1 [Cucurbita maxima]2.7e-27079.93Show/hide
Query:  MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMRLEARESESSSDQFGNGDDVEETLVQTGMEGSRSSCATVEQMGEPFKDGVWKESLRISHEDFL
        MRHGG +RKRSSS VRYVV+LCAV A IGF MLNVL RLE+R SE SSDQFGNGDDVEE+  ++G+EG R SCATVEQMGE F DGVWKESLR+      
Subjt:  MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMRLEARESESSSDQFGNGDDVEETLVQTGMEGSRSSCATVEQMGEPFKDGVWKESLRISHEDFL

Query:  LSQAVVPKTSKWNFLNLTSYGVPSDEIIHKGGNPCNSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM--------------------
               +T   N   L                               GASRVR LPPEQFCKHGFVMGKSSEAGFGNEM                    
Subjt:  LSQAVVPKTSKWNFLNLTSYGVPSDEIIHKGGNPCNSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM--------------------

Query:  ---GKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIMRIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG
           GKFPFGDYISYSDISFTL EIKHLWRL GCVRKF RHLIMRIDDFEKP+QTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG
Subjt:  ---GKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIMRIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG

Query:  WPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFVKSIMPILGEFA
         PEVLESRPNVFGELMR+LISPSKDVEEAV SVLKSGADPDI+LHMRMLMNRS+RGLQAAVQCIRKA+LNLTTV KPRLVLVSDTP+FV SIMPILGEFA
Subjt:  WPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFVKSIMPILGEFA

Query:  EVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSDFSFLSSFQSNL
        EVIHFDYE FRGNIS THDEF KLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRR+GTTYAQLIAALAAAHNLDN GNNSTGSDFSFLSSFQSNL
Subjt:  EVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSDFSFLSSFQSNL

Query:  LREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKKNVVRTIPFIL
        L EGLKNQVGWGHIWNRFAGPLSCP QPNQCALTPLLP AWWDGLWQSPIPRDIKRMENYGVHLSS GIVDEDSLRSFCNAKKNVVRTIPFIL
Subjt:  LREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKKNVVRTIPFIL

XP_023549723.1 uncharacterized protein LOC111808143 isoform X1 [Cucurbita pepo subsp. pepo]1.5e-27180.27Show/hide
Query:  MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMRLEARESESSSDQFGNGDDVEETLVQTGMEGSRSSCATVEQMGEPFKDGVWKESLRISHEDFL
        MRHGGSRRKRSSS VRYVV+LCAV A IGF MLNVL RLE+R SE  SDQFGNGDDVEE+  ++G+EG R SCATVE+MGE F DGVWKESLR+      
Subjt:  MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMRLEARESESSSDQFGNGDDVEETLVQTGMEGSRSSCATVEQMGEPFKDGVWKESLRISHEDFL

Query:  LSQAVVPKTSKWNFLNLTSYGVPSDEIIHKGGNPCNSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM--------------------
               +T   N   L                               GASRVR LPPEQFCKHGFVMGKSSEAGFGNEM                    
Subjt:  LSQAVVPKTSKWNFLNLTSYGVPSDEIIHKGGNPCNSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM--------------------

Query:  ---GKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIMRIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG
           GKFPFGDYISYSDISFTL EIKHLWRL GCVRKF RHLIMRIDDFEKP+QTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG
Subjt:  ---GKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIMRIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG

Query:  WPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFVKSIMPILGEFA
         PEVLESRPNVFGELMR+LISPSKDVEEAV SVLKSGADPDI+LHMRMLMNRS+RGLQAAVQCIRKAMLNLTT  KPRLVLVSDTP+FVKSIMPILGEFA
Subjt:  WPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFVKSIMPILGEFA

Query:  EVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSDFSFLSSFQSNL
        EVIHFDYE FRGNISGTHDEF KLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRR+GTTYAQLIAALAAAHNLDNLGNNSTGSDFSFLSSFQSNL
Subjt:  EVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSDFSFLSSFQSNL

Query:  LREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKKNVVRTIPFIL
        L EGLKNQVGWGHIWNRFAGPLSCP QPNQCALTPLLP AWWDG WQSPIPRDIKRMENYGVHLSS GIVDEDSLRSFCNAKKNVVRTIPFIL
Subjt:  LREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKKNVVRTIPFIL

XP_038906660.1 uncharacterized protein LOC120092597 isoform X1 [Benincasa hispida]6.3e-28383.47Show/hide
Query:  MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMRLEARESESSSDQFGNGDDVEETLVQTGMEGSRSSCATVEQMGEPFKDGVWKESLRISHEDFL
        MRHGGSRRKRSSSFVRYVV+LCAV A IGF MLN+LMRLEARESES+SDQFGNGDDVEET  Q+GMEGSRSSCATVEQMGE FKDGVWKESLR+      
Subjt:  MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMRLEARESESSSDQFGNGDDVEETLVQTGMEGSRSSCATVEQMGEPFKDGVWKESLRISHEDFL

Query:  LSQAVVPKTSKWNFLNLTSYGVPSDEIIHKGGNPCNSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM--------------------
               +T   N   L                               GASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM                    
Subjt:  LSQAVVPKTSKWNFLNLTSYGVPSDEIIHKGGNPCNSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM--------------------

Query:  ---GKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIMRIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG
           GKFPFGDYISYSDI+FTL EIKHLWRLNGCVRKFNRHLIMRIDDFEKP+QTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG
Subjt:  ---GKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIMRIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG

Query:  WPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFVKSIMPILGEFA
        WPEVLESRPNVFGELMRVLISPSKDVEEAV SVLKSGADPDI+LHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFVKSIM ILGEFA
Subjt:  WPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFVKSIMPILGEFA

Query:  EVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSDFSFLSSFQSNL
        EVIHFDYE FRGNISGTHDEF KLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGN+STGSDFSFLSS+QSNL
Subjt:  EVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSDFSFLSSFQSNL

Query:  LREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKKNVVRTIPFIL
        LREGLKNQVGWGHIWNRFAGPLSCPSQPNQCA TP+LP AWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKKNVVRTIPFIL
Subjt:  LREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKKNVVRTIPFIL

TrEMBL top hitse value%identityAlignment
A0A5A7TVU3 Uncharacterized protein3.1e-26778.2Show/hide
Query:  MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMRLEARESESSSDQFGNGDDVEETLVQ-TGMEGSRSSCATVEQMGEPFKDGVWKESLR---ISH
        MRHGGSRRKRSSSFVRY+++LCAV A I F MLNVLMR+EA     SSDQFG+G+  EE   Q TGMEG R+SCATVEQMG+PFKDGV KESLR   I  
Subjt:  MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMRLEARESESSSDQFGNGDDVEETLVQ-TGMEGSRSSCATVEQMGEPFKDGVWKESLR---ISH

Query:  EDFLLSQAVVPKTSKWNFLNLTSYGVPSDEIIH-----KGGNPC--------NSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM---
            + +    + SK  F +L   G     I+      K  + C        N    +   HRN+GASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM   
Subjt:  EDFLLSQAVVPKTSKWNFLNLTSYGVPSDEIIH-----KGGNPC--------NSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM---

Query:  --------------------GKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIMRIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF
                            GKFPFGDYISYSDISFTL EIKHLWRLNGCV+KFNR LIMRIDDFEKP+QTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF
Subjt:  --------------------GKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIMRIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF

Query:  LKNVHPAMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVS
        LKN+HP MRAAASNLFGWPEVLESRPNVFGELMRVLISPSK+VEEAVFSVLKSGADPDI+LHMRMLMNRSVRGLQAAVQCIRKAMLNLT VSKPRLVLVS
Subjt:  LKNVHPAMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVS

Query:  DTPNFVKSIMPILGEFAEVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGN
        DTPNFVKSI+PIL EFAEVIHFDYE FRGNISGT DEF KLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAA+NLD LGN
Subjt:  DTPNFVKSIMPILGEFAEVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGN

Query:  NSTGSDFSFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKK
         STGSDFSFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLSC SQPNQCA+TPLLP AWWDG+WQSPIPRDIKRMENYGVHL+S G VDED LRSFC AKK
Subjt:  NSTGSDFSFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKK

Query:  NVVRTIPFIL
        NV+RTIPFIL
Subjt:  NVVRTIPFIL

A0A6J1E7F2 uncharacterized protein LOC1114305931.1e-26779.26Show/hide
Query:  MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMRLEARESESSSDQFGNGDDVEETLVQTGMEGSRSSCATVEQMGEPFKDGVWKESLRISHEDFL
        MRHGGSR+KR SSF RYVV+LCAV A IGF MLN LMR+EA+ESESSSDQ GNGDDVEE+ V + M+G R SCATVEQMGE FKDGVWKESLR+      
Subjt:  MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMRLEARESESSSDQFGNGDDVEETLVQTGMEGSRSSCATVEQMGEPFKDGVWKESLRISHEDFL

Query:  LSQAVVPKTSKWNFLNLTSYGVPSDEIIHKGGNPCNSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM--------------------
               +T   N   L                               GASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM                    
Subjt:  LSQAVVPKTSKWNFLNLTSYGVPSDEIIHKGGNPCNSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM--------------------

Query:  ---GKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIMRIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG
           GKFPFGDYISYS+++FT+ EIKHLWRL GC+RKFNRHLIMR DDFEKP+QTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG
Subjt:  ---GKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIMRIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG

Query:  WPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFVKSIMPILGEFA
         PEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSG DPDI+LHMRMLMNRSVRGLQAA+QCIRK + NLTT SKPRLVLVSDTPNFVKSI+P+LGEFA
Subjt:  WPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFVKSIMPILGEFA

Query:  EVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSDFSFLSSFQSNL
        EVIHFDYE FRG ISGTHDEF KLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSDF FLSSFQSNL
Subjt:  EVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSDFSFLSSFQSNL

Query:  LREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKKNVVRTIPFIL
        LREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLP AWWDGLWQSPIPRDIKRMENYGVHLS  G +DEDSLRSFCNAKKNVVRTIPFIL
Subjt:  LREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKKNVVRTIPFIL

A0A6J1FF37 uncharacterized protein LOC111444894 isoform X11.2e-27180.1Show/hide
Query:  MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMRLEARESESSSDQFGNGDDVEETLVQTGMEGSRSSCATVEQMGEPFKDGVWKESLRISHEDFL
        MRHGGS+RKRSSS VRYVV+LCAV A IGF MLNVL RLE+R SE SSDQFGNGDDVEE+  ++G+EG R SCATVE+MGE F DGVWKESLR+      
Subjt:  MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMRLEARESESSSDQFGNGDDVEETLVQTGMEGSRSSCATVEQMGEPFKDGVWKESLRISHEDFL

Query:  LSQAVVPKTSKWNFLNLTSYGVPSDEIIHKGGNPCNSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM--------------------
               +T   N   L                               GASRVR LPPEQFCKHGFVMGKSSEAGFGNEM                    
Subjt:  LSQAVVPKTSKWNFLNLTSYGVPSDEIIHKGGNPCNSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM--------------------

Query:  ---GKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIMRIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG
           GKFPFGDYISYSDISFTL EIKHLWRL GCVRKF RHLIMRIDDFEKP+QTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG
Subjt:  ---GKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIMRIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG

Query:  WPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFVKSIMPILGEFA
         PEVLESRPNVFGELMR+LISPSKDVEEAV SVLKSGADPDI+LHMRMLMNRS+RGLQAAVQCIRKAMLNLTTV KPRLVLVSDTP+FVKSIMPILGEFA
Subjt:  WPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFVKSIMPILGEFA

Query:  EVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSDFSFLSSFQSNL
        EVIHFDYE FRGNIS THDEF KLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRR+GTTYAQLIAALAAAHNLDN GNNSTGSDFSFLSSFQSNL
Subjt:  EVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSDFSFLSSFQSNL

Query:  LREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKKNVVRTIPFIL
        L EGLKNQVGWGHIWNRFAGPLSCP QPNQCALTPLLP AWWDGLWQSPIPRDIKRMENYGVHLSS GI+DEDSLRSFCNAKKNVVRTIPFIL
Subjt:  LREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKKNVVRTIPFIL

A0A6J1FKR5 uncharacterized protein LOC111444894 isoform X24.0e-26779.83Show/hide
Query:  MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMRLEARESESSSDQFGNGDDVEETLVQTGMEGSRSSCATVEQMGEPFKDGVWKESLRISHEDFL
        MRHGGS+RKRSSS VRYVV+LCAV A IGF MLNVL RLE+R SE SSDQFGNGDDVEE+  ++G+EG R SCATVE+MGE F DGVWKESLR+      
Subjt:  MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMRLEARESESSSDQFGNGDDVEETLVQTGMEGSRSSCATVEQMGEPFKDGVWKESLRISHEDFL

Query:  LSQAVVPKTSKWNFLNLTSYGVPSDEIIHKGGNPCNSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM--------------------
               +T   N   L                               GASRVR LPPEQFCKHGFVMGKSSEAGFGNEM                    
Subjt:  LSQAVVPKTSKWNFLNLTSYGVPSDEIIHKGGNPCNSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM--------------------

Query:  ---GKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIMRIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG
           GKFPFGDYISYSDISFTL EIKHLWRL GCVRKF RHLIMRIDDFEKP+QTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG
Subjt:  ---GKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIMRIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG

Query:  WPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFVKSIMPILGEFA
         PEVLESRPNVFGELMR+LISPSKDVEEAV SVLKSGADPDI+LHMRMLMNRS+RGLQAAVQCIRKAMLNLTTV KPRLVLVSDTP+FVKSIMPILGEFA
Subjt:  WPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFVKSIMPILGEFA

Query:  EVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSDFSFLSSFQSNL
        EVIHFDYE FRGNIS THDEF KLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRR+GTTYAQLIAALAAAHNLDN GNNSTGSDFSFLSSFQSNL
Subjt:  EVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSDFSFLSSFQSNL

Query:  LREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKKNV
        L EGLKNQVGWGHIWNRFAGPLSCP QPNQCALTPLLP AWWDGLWQSPIPRDIKRMENYGVHLSS GI+DEDSLRSFCNAKKNV
Subjt:  LREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKKNV

A0A6J1JUE3 uncharacterized protein LOC111488989 isoform X11.3e-27079.93Show/hide
Query:  MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMRLEARESESSSDQFGNGDDVEETLVQTGMEGSRSSCATVEQMGEPFKDGVWKESLRISHEDFL
        MRHGG +RKRSSS VRYVV+LCAV A IGF MLNVL RLE+R SE SSDQFGNGDDVEE+  ++G+EG R SCATVEQMGE F DGVWKESLR+      
Subjt:  MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMRLEARESESSSDQFGNGDDVEETLVQTGMEGSRSSCATVEQMGEPFKDGVWKESLRISHEDFL

Query:  LSQAVVPKTSKWNFLNLTSYGVPSDEIIHKGGNPCNSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM--------------------
               +T   N   L                               GASRVR LPPEQFCKHGFVMGKSSEAGFGNEM                    
Subjt:  LSQAVVPKTSKWNFLNLTSYGVPSDEIIHKGGNPCNSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM--------------------

Query:  ---GKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIMRIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG
           GKFPFGDYISYSDISFTL EIKHLWRL GCVRKF RHLIMRIDDFEKP+QTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG
Subjt:  ---GKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIMRIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFG

Query:  WPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFVKSIMPILGEFA
         PEVLESRPNVFGELMR+LISPSKDVEEAV SVLKSGADPDI+LHMRMLMNRS+RGLQAAVQCIRKA+LNLTTV KPRLVLVSDTP+FV SIMPILGEFA
Subjt:  WPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFVKSIMPILGEFA

Query:  EVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSDFSFLSSFQSNL
        EVIHFDYE FRGNIS THDEF KLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRR+GTTYAQLIAALAAAHNLDN GNNSTGSDFSFLSSFQSNL
Subjt:  EVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSDFSFLSSFQSNL

Query:  LREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKKNVVRTIPFIL
        L EGLKNQVGWGHIWNRFAGPLSCP QPNQCALTPLLP AWWDGLWQSPIPRDIKRMENYGVHLSS GIVDEDSLRSFCNAKKNVVRTIPFIL
Subjt:  LREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKKNVVRTIPFIL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G26950.1 unknown protein7.4e-17352.07Show/hide
Query:  MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMR--------LEARESESSSDQ---FGNGDDVEETLVQTGMEGSRSSCATVEQMGEPFKDGVWK
        M+ GG+RRKR        +LL +V   IGF +L + +R        ++  + ES S++   + N   + E +V    +G++  CATVE+MG  F  G   
Subjt:  MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMR--------LEARESESSSDQ---FGNGDDVEETLVQTGMEGSRSSCATVEQMGEPFKDGVWK

Query:  ESLRISHEDFLLSQAVVPKTSKWNFLNLTSYGVPSDEIIHKGGNPCNSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM---------
        +SLR+                               ++IH+        F+I       GAS +R+LPPEQFC+HG+V+GK++EAGFGNEM         
Subjt:  ESLRISHEDFLLSQAVVPKTSKWNFLNLTSYGVPSDEIIHKGGNPCNSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEM---------

Query:  --------------GKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIMRIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHP
                      GK+PFGDYI+YS+ +FT++E+KHLWR NGCV+K+ R L+MR+DDFEKP+++NVLCSNWK+WE  IIWFQGTTDAVAAQFFLKNVHP
Subjt:  --------------GKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIMRIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHP

Query:  AMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFV
         MRAAA  LFG       R NVFGELM  LISP+KDV+EAV  VL    DPDI++HMRMLM++SVR ++AA+ C+ KA +N   +  PR+V+VSDTP+ V
Subjt:  AMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFV

Query:  KSIMPILGEFAEVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSD
        K I   +   AEV+HFDY+ FRG+I+        LDFR+KDWGP+PRWVAFVDFFLA RAKHAVISGA+RRVGTTYAQL+AALAAA++L    + S+ S 
Subjt:  KSIMPILGEFAEVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNLGNNSTGSD

Query:  FSFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKKNVVRTI
        F+FLSSFQSNLL +GLKNQVGWGH+WNR+AGPLSCP QPNQCA TPL P  WWDG+WQSPIPRD +R+  +G+ LS  G V+ED   ++C+AKK  V T+
Subjt:  FSFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKKNVVRTI

Query:  PFI
          I
Subjt:  PFI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGCATGGTGGATCCAGGAGGAAGAGATCCTCATCCTTTGTGCGCTATGTCGTCCTTCTATGCGCTGTCGCTGCTGTAATTGGATTCTTTATGCTCAATGTTCTTAT
GAGGCTGGAAGCCCGAGAATCGGAATCCAGCTCTGATCAGTTTGGTAATGGCGACGACGTTGAGGAAACTCTGGTTCAGACTGGAATGGAAGGAAGCCGGAGCTCCTGCG
CCACGGTGGAGCAGATGGGAGAGCCCTTTAAAGATGGTGTCTGGAAGGAAAGCCTGAGAATTTCACATGAGGACTTTTTGTTAAGCCAAGCAGTTGTACCGAAGACCTCA
AAGTGGAACTTCTTGAATCTTACAAGTTATGGTGTACCCTCCGATGAAATAATCCACAAGGGAGGAAACCCATGTAATTCGTGCTTTAGAATTAGAATAGGTCATAGGAA
CATTGGTGCTTCAAGAGTGCGACAGCTTCCTCCTGAGCAGTTTTGCAAACATGGTTTTGTCATGGGCAAATCCTCCGAGGCAGGCTTTGGGAATGAGATGGGCAAGTTTC
CTTTTGGGGACTACATTTCTTATTCTGATATCTCGTTTACCTTGAATGAAATCAAGCATTTGTGGAGACTTAACGGTTGTGTTAGGAAATTCAATAGGCATTTGATTATG
CGAATTGATGATTTTGAAAAGCCTTCACAGACAAATGTTCTATGCAGTAATTGGAAGGAATGGGAGCATCCCATCATATGGTTCCAAGGTACAACTGATGCTGTGGCTGC
TCAATTTTTCTTGAAGAATGTACATCCCGCTATGAGGGCTGCCGCTTCTAATTTATTTGGATGGCCAGAGGTTTTAGAATCTAGACCTAATGTATTTGGAGAGCTGATGA
GAGTTCTTATATCTCCTTCAAAGGATGTTGAAGAAGCAGTGTTCTCGGTCCTTAAAAGTGGGGCTGATCCTGATATTACCTTGCACATGCGGATGCTTATGAATAGGTCT
GTCAGAGGTTTACAGGCCGCAGTGCAGTGCATCAGAAAAGCCATGCTTAATCTAACCACTGTCTCGAAACCCAGATTGGTTTTAGTATCAGATACCCCAAATTTTGTAAA
AAGTATCATGCCCATCTTAGGCGAATTTGCAGAGGTTATTCATTTTGATTATGAACAGTTCAGAGGAAACATCTCTGGAACTCACGATGAATTCCAGAAATTGGACTTCA
GAGTGAAGGACTGGGGTCCATCGCCAAGATGGGTTGCTTTTGTAGATTTCTTTCTTGCATCTCGTGCCAAACATGCTGTTATATCTGGTGCTCACCGGCGTGTAGGTACT
ACCTACGCTCAGCTAATTGCGGCATTGGCTGCAGCACACAATCTAGACAATCTCGGGAACAATTCTACTGGTTCAGACTTTTCATTCTTGAGTAGCTTCCAAAGTAATTT
GCTGAGAGAAGGTTTAAAGAACCAGGTTGGCTGGGGGCATATCTGGAACAGATTTGCAGGTCCTTTAAGCTGTCCGAGCCAGCCTAATCAGTGTGCCTTAACCCCTCTTC
TCCCTTCGGCTTGGTGGGATGGACTTTGGCAATCTCCCATTCCACGAGACATTAAAAGAATGGAGAATTATGGAGTTCATTTATCGAGCTTGGGCATTGTTGATGAAGAT
AGTCTACGATCATTCTGTAACGCAAAGAAGAATGTCGTGAGGACTATCCCTTTCATACTATAG
mRNA sequenceShow/hide mRNA sequence
GGGAAACCCCTAGCTTTAAGCTGGATTAGATATGAAGTATTAAACCACGAAGTAGAAAAATTCACACTGAGCTTCAATTCGTATTTCCGTTTGGGGAATGATAATCTGAA
TCTCTAGCAGCAGATCGTGATAGCAAAGGAAGAATGAGGCATGGTGGATCCAGGAGGAAGAGATCCTCATCCTTTGTGCGCTATGTCGTCCTTCTATGCGCTGTCGCTGC
TGTAATTGGATTCTTTATGCTCAATGTTCTTATGAGGCTGGAAGCCCGAGAATCGGAATCCAGCTCTGATCAGTTTGGTAATGGCGACGACGTTGAGGAAACTCTGGTTC
AGACTGGAATGGAAGGAAGCCGGAGCTCCTGCGCCACGGTGGAGCAGATGGGAGAGCCCTTTAAAGATGGTGTCTGGAAGGAAAGCCTGAGAATTTCACATGAGGACTTT
TTGTTAAGCCAAGCAGTTGTACCGAAGACCTCAAAGTGGAACTTCTTGAATCTTACAAGTTATGGTGTACCCTCCGATGAAATAATCCACAAGGGAGGAAACCCATGTAA
TTCGTGCTTTAGAATTAGAATAGGTCATAGGAACATTGGTGCTTCAAGAGTGCGACAGCTTCCTCCTGAGCAGTTTTGCAAACATGGTTTTGTCATGGGCAAATCCTCCG
AGGCAGGCTTTGGGAATGAGATGGGCAAGTTTCCTTTTGGGGACTACATTTCTTATTCTGATATCTCGTTTACCTTGAATGAAATCAAGCATTTGTGGAGACTTAACGGT
TGTGTTAGGAAATTCAATAGGCATTTGATTATGCGAATTGATGATTTTGAAAAGCCTTCACAGACAAATGTTCTATGCAGTAATTGGAAGGAATGGGAGCATCCCATCAT
ATGGTTCCAAGGTACAACTGATGCTGTGGCTGCTCAATTTTTCTTGAAGAATGTACATCCCGCTATGAGGGCTGCCGCTTCTAATTTATTTGGATGGCCAGAGGTTTTAG
AATCTAGACCTAATGTATTTGGAGAGCTGATGAGAGTTCTTATATCTCCTTCAAAGGATGTTGAAGAAGCAGTGTTCTCGGTCCTTAAAAGTGGGGCTGATCCTGATATT
ACCTTGCACATGCGGATGCTTATGAATAGGTCTGTCAGAGGTTTACAGGCCGCAGTGCAGTGCATCAGAAAAGCCATGCTTAATCTAACCACTGTCTCGAAACCCAGATT
GGTTTTAGTATCAGATACCCCAAATTTTGTAAAAAGTATCATGCCCATCTTAGGCGAATTTGCAGAGGTTATTCATTTTGATTATGAACAGTTCAGAGGAAACATCTCTG
GAACTCACGATGAATTCCAGAAATTGGACTTCAGAGTGAAGGACTGGGGTCCATCGCCAAGATGGGTTGCTTTTGTAGATTTCTTTCTTGCATCTCGTGCCAAACATGCT
GTTATATCTGGTGCTCACCGGCGTGTAGGTACTACCTACGCTCAGCTAATTGCGGCATTGGCTGCAGCACACAATCTAGACAATCTCGGGAACAATTCTACTGGTTCAGA
CTTTTCATTCTTGAGTAGCTTCCAAAGTAATTTGCTGAGAGAAGGTTTAAAGAACCAGGTTGGCTGGGGGCATATCTGGAACAGATTTGCAGGTCCTTTAAGCTGTCCGA
GCCAGCCTAATCAGTGTGCCTTAACCCCTCTTCTCCCTTCGGCTTGGTGGGATGGACTTTGGCAATCTCCCATTCCACGAGACATTAAAAGAATGGAGAATTATGGAGTT
CATTTATCGAGCTTGGGCATTGTTGATGAAGATAGTCTACGATCATTCTGTAACGCAAAGAAGAATGTCGTGAGGACTATCCCTTTCATACTATAGTCATTTTATGCTCC
TGTCTCTCATGTAAGCTCCATCACTAACATAGTTCTTATAACTGTTCCCAAACTGTGCTTATTTAGTCAACTTTGGCACAGTCCAGGAATTTGTTTCTGTTAAATCAAAT
AGCCCAATAAATAGTGTATTCTTCTATTTATTGGGCCATTTGATTTACAAAACAAGATCTTATATGCCGTTAAGAATGGCCAATTCCATTTTCTCGATTTCCATCAGATC
AAATAGGTTTAAACAGACATTTTCCCATTTTACTCCCGAGTTAAAAAATGTGCATATACAATAGAAAAATTATCATATTACACATTTTTTTCTG
Protein sequenceShow/hide protein sequence
MRHGGSRRKRSSSFVRYVVLLCAVAAVIGFFMLNVLMRLEARESESSSDQFGNGDDVEETLVQTGMEGSRSSCATVEQMGEPFKDGVWKESLRISHEDFLLSQAVVPKTS
KWNFLNLTSYGVPSDEIIHKGGNPCNSCFRIRIGHRNIGASRVRQLPPEQFCKHGFVMGKSSEAGFGNEMGKFPFGDYISYSDISFTLNEIKHLWRLNGCVRKFNRHLIM
RIDDFEKPSQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPAMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGADPDITLHMRMLMNRS
VRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFVKSIMPILGEFAEVIHFDYEQFRGNISGTHDEFQKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGT
TYAQLIAALAAAHNLDNLGNNSTGSDFSFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPSAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDED
SLRSFCNAKKNVVRTIPFIL