; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0815 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0815
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Description30S ribosomal protein S1-like
Genome locationMC05:7012692..7018086
RNA-Seq ExpressionMC05g0815
SyntenyMC05g0815
Gene Ontology termsGO:0071840 - cellular component organization or biogenesis (biological process)
GO:0005840 - ribosome (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR003029 - S1 domain
IPR012340 - Nucleic acid-binding, OB-fold
IPR022967 - RNA-binding domain, S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582039.1 30S ribosomal protein S1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]6.89e-21281.12Show/hide
Query:  MPICTAT-LGSLSPYSFLSHFASTDSSPS--SILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEG
        MPI  AT LGSLS +SFLS  ASTD+S S  S     H SPSKR  NF  R+SL   P+PIAGVL++ SP+SPES+ R+RRS DWK AREYLDSGFI++G
Subjt:  MPICTAT-LGSLSPYSFLSHFASTDSSPS--SILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEG

Query:  RIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAF
        RIEGSNAGGLLVRF+SLVGFLPFP LSP+HSCKEPYKSIQDIAKSL+GS++PVK+IQADE+NK LIFSEKEAAWSKFSE+V VG+VY+ARVGSVEDYGAF
Subjt:  RIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAF

Query:  VHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEI
        VHLRFSDGLYHLTGLVH+SEVSWDLVQDVRDILSEGDEVRVKVI+VDR        +KSRITLSIKQLEEDPLLETLDKVIPQD SAEPDSFGP+SDSEI
Subjt:  VHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEI

Query:  IPLPGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP
        IPLPGL+TIFEELLQE+G    IEDV +NRQGFEKRVVSQDLQLWLSNAPPVEKKF LLARAGRQVQEIQLTTSLDQEGIK ALQ VLERVP
Subjt:  IPLPGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP

XP_022145209.1 uncharacterized protein LOC111014714 [Momordica charantia]3.59e-26296.92Show/hide
Query:  MPICTATLGSLSPYSFLSHFASTDSSPSSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIE
        MPICTATLGSLSPYSFLSHFASTDSSPSSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIE
Subjt:  MPICTATLGSLSPYSFLSHFASTDSSPSSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIE

Query:  GSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHL
        GSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHL
Subjt:  GSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHL

Query:  RFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPL
        RFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDR        EKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPL
Subjt:  RFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPL

Query:  PGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP
        PGLETIFEELLQEDG    IEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP
Subjt:  PGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP

XP_022955545.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111457527 [Cucurbita moschata]1.23e-21676.14Show/hide
Query:  MFIRHEKIRWVLIGFAAVGEFPFGAQS---------AIAIVIVIINMPICTAT-LGSLSPYSFLSHFASTDSS----PSSILTHNHNSPSKRSPNFPPRL
        M +RH    ++ IGFAAV  FPF AQS         AI+  +   NMPI  AT LGSLS +SFLS  AS D+S     S +LTH   SPSKR  NF  R+
Subjt:  MFIRHEKIRWVLIGFAAVGEFPFGAQS---------AIAIVIVIINMPICTAT-LGSLSPYSFLSHFASTDSS----PSSILTHNHNSPSKRSPNFPPRL

Query:  SLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVP
        SL   P+PIAGVL++ SP+SPES+ R+RRS DWK AREYLDSGFI++GRIEGSNAGGLLVRF+SLVGFLPFP LSP+HSCKEPYKSIQDIAKSL+GS++P
Subjt:  SLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVP

Query:  VKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLS
        VK+IQADE+NK LIFSEKEAAWSKFSE+V VG+VY+ARVGS+EDYGAFVHLRFSDGLYHLTGLVH+SEVSWDLVQDVRDILSEGDEVRVKVI+VDR    
Subjt:  VKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLS

Query:  GSFLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPV
            +KSRITLSIKQLEEDPLLETLDKVIPQD SAEPDSFGP+SDSEIIPLPGL+TIFEELLQE+G    IEDV +NRQGFEKRVVSQDLQLWLSNAPPV
Subjt:  GSFLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPV

Query:  EKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP
        EKKF LLARAGRQVQEIQLTTSLDQEGIK ALQ VLERVP
Subjt:  EKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP

XP_023526022.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111789621 [Cucurbita pepo subsp. pepo]1.51e-21776.48Show/hide
Query:  MFIRHEKIRWVLIGFAAVGEFPFGAQS---------AIAIVIVIINMPICTAT-LGSLSPYSFLSHFASTDSSPSSILTH--NHNSPSKRSPNFPPRLSL
        M + H    ++ IGFAAV  FPF AQS         AI+  +   NMPI  AT LGSLS +SFL+  ASTD+S S   +    H SPSKR  NF  R+SL
Subjt:  MFIRHEKIRWVLIGFAAVGEFPFGAQS---------AIAIVIVIINMPICTAT-LGSLSPYSFLSHFASTDSSPSSILTH--NHNSPSKRSPNFPPRLSL

Query:  PANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVK
           PDPIAGVL++ SP+SPES+ R+RRS DWK AREYLDSGFI++GRIEGSNAGGLLVRF+SLVGFLPFP LSP+HSCKEPYKSIQDIAKSL+GS++PVK
Subjt:  PANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVK

Query:  IIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGS
        +IQADE+NK LIFSEKEAAWSKFSEQV VG+VY+ARVGSVEDYGAFVHLRFSDGLYHLTGLVH+SEVSWDLVQDVRDILSEGDEVRVKVI+VDR      
Subjt:  IIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGS

Query:  FLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEK
          +KSRITLSIKQLEEDPLLETLDKVIPQD SAEPDSFGP+SDSEIIPLPGL+TIFEELLQE+G    IEDV +NRQGFEKRVVSQDLQLWLSNAPPVEK
Subjt:  FLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEK

Query:  KFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP
        KF LLARAGRQVQEIQLTTSLDQEGIK ALQ VLERVP
Subjt:  KFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP

XP_038897871.1 30S ribosomal protein S1 homolog B [Benincasa hispida]2.35e-21482.82Show/hide
Query:  MPICTATLGSLSPYSFLSHFAST-DSSPSSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRI
        MPI  ATLGS+  +SFLS  AST D + S+       SPSKR  NFP R+SL   PDPIAGVLDT SP+SPESL R+RRS DWKAAREYLD+GFIYEGRI
Subjt:  MPICTATLGSLSPYSFLSHFAST-DSSPSSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRI

Query:  EGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVH
        EGSNAGGLLVRF+SL+GFLPFPQLSPSHSCKEPYKSIQDIAKSL+GS++ VK+IQADERNKKLIFSEKEAAWSKFSEQV VG+VY+ARVGSVEDYGAFVH
Subjt:  EGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVH

Query:  LRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIP
        LRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEV VKVINVDR        +KSRITLSIKQLEEDPLLETLDKVIPQ GSAEPDSFGP+SDSEI+P
Subjt:  LRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIP

Query:  LPGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP
        LPGLETI EELLQEDG    I D+ VNRQGFEKRVVSQDLQLWLSNAPPVEKKF LLARAGRQVQEIQLTTSLDQEGIK ALQ VLERVP
Subjt:  LPGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP

TrEMBL top hitse value%identityAlignment
A0A1S3BXL4 30S ribosomal protein S1 isoform X11.44e-20775.23Show/hide
Query:  MFIRHEKIRWVLIGFAA-----VGE-FPFGAQSAIAIVIVI-INMPICTATLGSLSPYSFLSHFASTD-------SSPSSILTHNHNSPSKRSPNFPPRL
        MF+RH   RWV I   +     + E FP     AI   + I I MPI  AT+ S+S +SFLS  AST        SS SSIL     SPSKR   FP R+
Subjt:  MFIRHEKIRWVLIGFAA-----VGE-FPFGAQSAIAIVIVI-INMPICTATLGSLSPYSFLSHFASTD-------SSPSSILTHNHNSPSKRSPNFPPRL

Query:  SLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVP
        SL   PDPIAGVLDT    SPES+ R+RRS DWKAAREYLDSGFIYEGRIEGSNAGGLLVRF+SL+GFLPFPQLSPSHSCKEP KSIQDIAKSL GS++ 
Subjt:  SLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVP

Query:  VKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLS
        VK+IQADERNKKLIFSEKEA WSKFS QV VG+VY+A+VGS+EDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEV VKVINVDR    
Subjt:  VKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLS

Query:  GSFLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPV
            +KSRITLSI+QLEEDPLLETLDKVIPQD SAEPDSFGP+SDSEIIPLPGL TI EEL QE+G    I DVRVNRQGFEKRVVSQDLQLWLSNAPP+
Subjt:  GSFLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPV

Query:  EKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP
        EKKF LLARAGRQVQEIQLTTSLDQEGIK ALQ VLERVP
Subjt:  EKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP

A0A5A7TNY0 30S ribosomal protein S1 isoform X11.45e-20480.1Show/hide
Query:  MPICTATLGSLSPYSFLSHFASTD-------SSPSSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGF
        MPI  AT+ S+S +SFLS  AST        SS SSIL     SPSKR   FP R+SL   PDPIAGVLDT    SPES+ R+RRS DWKAAREYLDSGF
Subjt:  MPICTATLGSLSPYSFLSHFASTD-------SSPSSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGF

Query:  IYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKI-IQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVE
        IYEGRIEGSNAGGLLVRF+SL+GFLPFPQLSPSHSCKEP KSIQDIAKSL GS++ VK+ IQADERNKKLIFSEKEA WSKFS QV VG+VY+A+VGS+E
Subjt:  IYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKI-IQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVE

Query:  DYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPR
        DYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEV VKVINVDRQV      +KSRITLSI+QLEEDPLLETLDKVIPQD SAEPDSFGP+
Subjt:  DYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPR

Query:  SDSEIIPLPGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP
        SDSEIIPLPGL TI EEL QE+G    I DVRVNRQGFEKRVVSQDLQLWLSNAPP+EKKF LLARAGRQVQEIQLTTSLDQEGIK ALQ VLERVP
Subjt:  SDSEIIPLPGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP

A0A6J1CUJ6 uncharacterized protein LOC1110147141.74e-26296.92Show/hide
Query:  MPICTATLGSLSPYSFLSHFASTDSSPSSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIE
        MPICTATLGSLSPYSFLSHFASTDSSPSSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIE
Subjt:  MPICTATLGSLSPYSFLSHFASTDSSPSSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIE

Query:  GSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHL
        GSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHL
Subjt:  GSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHL

Query:  RFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPL
        RFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDR        EKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPL
Subjt:  RFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPL

Query:  PGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP
        PGLETIFEELLQEDG    IEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP
Subjt:  PGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP

A0A6J1GVD7 LOW QUALITY PROTEIN: uncharacterized protein LOC1114575275.97e-21776.14Show/hide
Query:  MFIRHEKIRWVLIGFAAVGEFPFGAQS---------AIAIVIVIINMPICTAT-LGSLSPYSFLSHFASTDSS----PSSILTHNHNSPSKRSPNFPPRL
        M +RH    ++ IGFAAV  FPF AQS         AI+  +   NMPI  AT LGSLS +SFLS  AS D+S     S +LTH   SPSKR  NF  R+
Subjt:  MFIRHEKIRWVLIGFAAVGEFPFGAQS---------AIAIVIVIINMPICTAT-LGSLSPYSFLSHFASTDSS----PSSILTHNHNSPSKRSPNFPPRL

Query:  SLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVP
        SL   P+PIAGVL++ SP+SPES+ R+RRS DWK AREYLDSGFI++GRIEGSNAGGLLVRF+SLVGFLPFP LSP+HSCKEPYKSIQDIAKSL+GS++P
Subjt:  SLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVP

Query:  VKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLS
        VK+IQADE+NK LIFSEKEAAWSKFSE+V VG+VY+ARVGS+EDYGAFVHLRFSDGLYHLTGLVH+SEVSWDLVQDVRDILSEGDEVRVKVI+VDR    
Subjt:  VKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLS

Query:  GSFLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPV
            +KSRITLSIKQLEEDPLLETLDKVIPQD SAEPDSFGP+SDSEIIPLPGL+TIFEELLQE+G    IEDV +NRQGFEKRVVSQDLQLWLSNAPPV
Subjt:  GSFLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPV

Query:  EKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP
        EKKF LLARAGRQVQEIQLTTSLDQEGIK ALQ VLERVP
Subjt:  EKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP

A0A6J1IUF3 uncharacterized protein LOC1114794061.35e-21180.87Show/hide
Query:  MPICTAT-LGSLSPYSFLSHFASTDSSPS--SILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEG
        MPI  AT LGSLS +SFLS  ASTD+S S  S     H SPSKR  NF  R+SL   P+PIAGVL++ SP+SPES+ R+RRS DWK AREYLDSGFI++G
Subjt:  MPICTAT-LGSLSPYSFLSHFASTDSSPS--SILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEG

Query:  RIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAF
        RIEGSNAGGLLVRF+SLVGFLPFP LSP+HSCKEPYKSIQDIAKSL+GS++PVK+IQADE+NK LIFSEKEAAWSKFSEQV VG+VY+ARVGSVEDYGAF
Subjt:  RIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAF

Query:  VHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEI
        VHLRFSDG YHLTGLVH+SEVSWDLVQDVRDILSEGDEVRVKV++VDR        +KSRITLSIKQLEEDPLLETLDKVIPQD SAEPDSFGP+SDSEI
Subjt:  VHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEI

Query:  IPLPGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP
        IPLPGL+TIFEELLQE+G    IEDV +NRQGFEKRVVSQDLQLWLSNAPPVEKKF LLARAGRQVQEIQLTTSLDQEGIK ALQ VLERVP
Subjt:  IPLPGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP

SwissProt top hitse value%identityAlignment
O33698 30S ribosomal protein S12.1e-1227.09Show/hide
Query:  WKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVG
        W    E  + G   + ++ GSN GG+      L  F+P   L+            ++   SL G  + V  ++ +  +KKL+ SE++AA +    ++ VG
Subjt:  WKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVG

Query:  EVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLETLDKVIPQD
        ++ + +V  ++ +G FV L  +      T L+ ++++S   V DV  I   GD ++  V+ +D          K RI+LS K LE  P  E L+ V    
Subjt:  EVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLETLDKVIPQD

Query:  GSA
         SA
Subjt:  GSA

P29344 30S ribosomal protein S1, chloroplastic4.8e-2030.85Show/hide
Query:  WKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVG
        W+  R+      + +G+I G+N GG++     L GF+PF Q+S   S +E           LL   +P+K ++ DE   +L+ S ++ A +    Q+ +G
Subjt:  WKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVG

Query:  EVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDP
         V    V S++ YGAF+ +        + GL+HVS++S D V D+  +L  GD ++V +++ DR        E+ R++LS K+LE  P
Subjt:  EVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDP

P50889 30S ribosomal protein S13.5e-1530.81Show/hide
Query:  SRRSEDWKAAREYLD--SGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSK
        S++  D + A E L    G   + ++  +  GGL+V  + + GF+P   ++         + + D+        +  ++I+ D  N +LI S K  A  +
Subjt:  SRRSEDWKAAREYLD--SGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSK

Query:  FSEQ-------VSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLE
         + Q       +SVGEV +  V  + D+GAFV L   D      GLVHVSE+S D V++  D+L++GD+V VK++ +D         EK RI+LSIK  +
Subjt:  FSEQ-------VSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLE

Query:  EDPLLETLDKV
          P  E  D++
Subjt:  EDPLLETLDKV

Q8RI52 4-hydroxy-3-methylbut-2-enyl diphosphate reductase2.8e-1229.33Show/hide
Query:  ESLPRSRR----SEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLP--FPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIF
        E +  SRR     ++W+   +   +  I + ++     GG LV      GFLP    ++S S   K   K IQ I K        +K+   D++N+K+ +
Subjt:  ESLPRSRR----SEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLP--FPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIF

Query:  SEKE---AAWSKFSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLS
        S K+   A   K    ++VG++ D  V  V D+G  V +        L G +H+SEVSW  +  + D    GD+++  V+++D          K  + LS
Subjt:  SEKE---AAWSKFSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLS

Query:  IKQLEEDP
        IK+LEEDP
Subjt:  IKQLEEDP

Q93VC7 30S ribosomal protein S1, chloroplastic2.0e-1830.32Show/hide
Query:  WKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVG
        W+  R+      I + ++ G+N GGL+     L GF+PF Q+S   + +E           LL   +P+K ++ DE   KL+ S ++A  +    Q+ +G
Subjt:  WKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVG

Query:  EVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDP
         V    V S++ YGAF+ +        + GL+HVS++S D V D+  +L  GD ++V +++ DR        ++ R++LS K+LE  P
Subjt:  EVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDP

Arabidopsis top hitse value%identityAlignment
AT1G12800.1 Nucleic acid-binding, OB-fold-like protein4.3e-1629.5Show/hide
Query:  QDIAKSLLGSIVPVKIIQADERNKKLIFS----EKEAAWSK---FSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDI
        Q    S +G  + V ++ A+  ++KLIFS    E E    K      ++ VG+V    +  +  +G F  L        +  LVH SEVSWD   D    
Subjt:  QDIAKSLLGSIVPVKIIQADERNKKLIFS----EKEAAWSK---FSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDI

Query:  LSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPL--PGLETIFEELLQEDGLNISIEDVRVNR
           G  V  KV  +D            RI LS+K++  DPL E L+ V+  D     D  G R  +  +    P +E++ +EL   +G    I+ V  +R
Subjt:  LSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPL--PGLETIFEELLQEDGLNISIEDVRVNR

Query:  QGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERV
          F    ++   Q+++  AP  E ++ LLARAG +VQE+ +  SL +E +K  +     RV
Subjt:  QGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERV

AT1G71720.1 Nucleic acid-binding proteins superfamily7.3e-0828.21Show/hide
Query:  WKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVG
        W   R+        E +I   N GGLL R   L  F+P  +L    +     K  +++ +  L     V+I + +E    LI SEK  AW K    +  G
Subjt:  WKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVG

Query:  EVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLETLDK
         + +  V  +  YGA V L    G    +GL+H+S ++   +  V D+L   + V+V V+           L   +I+LSI  LE +P L   D+
Subjt:  EVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLETLDK

AT3G23700.1 Nucleic acid-binding proteins superfamily2.5e-11760.2Show/hide
Query:  TLGSLSPYSFLSHFASTD-SSPSSILTHNHNSPSKRS------PNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSE-------DWKAAREYLDSG
        TLGS+S  S L    ST  S P  +L  + +S S R+       +F    +   + +       +   AS  SL R    E       DWK A+ Y  SG
Subjt:  TLGSLSPYSFLSHFASTD-SSPSSILTHNHNSPSKRS------PNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSE-------DWKAAREYLDSG

Query:  FIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVE
          +EG ++G N GGLL+RFHSLVGFLP+PQLSPS SCKEP KSI +IAK+L+GS +PVK++QADE N+KLI SEK A W K+S+ V+VG+V++ RVGSVE
Subjt:  FIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVE

Query:  DYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPR
        DYGAF+HLRF DGLYHLTGLVHVSEVSWD VQDVRD+L +GDEVRV V N+D+        EKSRITLSIKQLE+DPLLETLDKVI +D S    S    
Subjt:  DYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPR

Query:  SDSEIIPLPGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP
        +   I PLPGLETI EELL+EDG    IE V++NRQGFEKRVVSQDLQLWLSN PP + KF+LLARAGRQVQEI LTTSL+Q GIK ALQHVLERVP
Subjt:  SDSEIIPLPGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP

AT4G29060.1 elongation factor Ts family protein4.7e-0729.27Show/hide
Query:  SEQVSVGEVYDARVGSVEDYGAFVHL-RFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLET
        +E++  G  +  +V +++ +GAFV    F+D      GLVHVS++S + V+DV  +++ G EV+V+++  D        +E  RI+L++++ ++ P  ++
Subjt:  SEQVSVGEVYDARVGSVEDYGAFVHL-RFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLET

Query:  LDKVIPQDGSAEPDSFGPRSDSE
                GS +P S G R  S+
Subjt:  LDKVIPQDGSAEPDSFGPRSDSE

AT5G30510.1 ribosomal protein S11.4e-1930.32Show/hide
Query:  WKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVG
        W+  R+      I + ++ G+N GGL+     L GF+PF Q+S   + +E           LL   +P+K ++ DE   KL+ S ++A  +    Q+ +G
Subjt:  WKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVG

Query:  EVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDP
         V    V S++ YGAF+ +        + GL+HVS++S D V D+  +L  GD ++V +++ DR        ++ R++LS K+LE  P
Subjt:  EVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTATCCGTCACGAAAAAATCCGATGGGTTCTTATCGGATTTGCAGCAGTTGGTGAGTTTCCATTTGGAGCCCAATCCGCCATTGCCATCGTCATCGTCATC
ATCAACATGCCAATCTGCACTGCAACTCTCGGATCTCTCTCCCCTTATTCCTTTCTCTCACACTTCGCTTCCACTGATTCTTCACCCTCCTCCATTCTCACCCAC
AACCACAACTCCCCCTCTAAACGCTCTCCCAACTTCCCCCCCAGACTCTCCCTCCCCGCAAATCCCGACCCCATTGCCGGAGTTCTTGATACTACTTCCCCTGCC
TCGCCGGAATCGCTTCCACGTTCTCGGAGATCTGAGGATTGGAAGGCAGCGAGGGAATACCTTGATAGTGGATTTATCTATGAGGGTAGGATTGAAGGTTCAAAT
GCTGGAGGTTTACTTGTCAGATTTCATTCTCTAGTTGGCTTTCTTCCATTCCCTCAATTGAGCCCATCTCATTCTTGTAAAGAACCATACAAAAGTATCCAAGAT
ATTGCAAAAAGCTTACTTGGTTCGATTGTACCAGTGAAGATTATCCAAGCAGATGAGAGAAACAAAAAATTGATATTTTCAGAGAAAGAAGCTGCGTGGTCAAAG
TTTTCTGAGCAAGTTAGTGTGGGAGAGGTTTATGATGCTCGAGTTGGTTCTGTGGAGGATTATGGTGCCTTTGTACATTTACGTTTCTCTGATGGTCTTTATCAT
CTTACCGGGCTAGTACACGTATCAGAAGTTTCATGGGATCTAGTTCAGGATGTAAGAGACATCTTGAGTGAGGGTGACGAAGTGAGGGTGAAAGTTATTAATGTT
GATAGGCAAGTTCTCTCAGGTTCCTTTTTGGAAAAGTCCAGGATCACACTGTCAATTAAACAACTGGAGGAAGATCCACTGTTGGAAACGCTGGACAAAGTAATA
CCGCAGGATGGTTCTGCTGAACCTGATTCTTTCGGACCTAGAAGTGATAGCGAAATTATACCACTTCCTGGACTTGAAACAATATTTGAAGAGCTACTGCAGGAA
GATGGTTTGAATATCAGTATAGAAGATGTTCGTGTCAACCGACAAGGATTTGAGAAGCGGGTGGTTTCACAAGATCTACAGCTTTGGCTATCAAATGCTCCTCCC
GTTGAAAAGAAGTTCATTCTTCTTGCTCGTGCCGGTAGGCAGGTTCAAGAAATACAACTGACGACATCGCTTGATCAGGAAGGTATAAAAATGGCATTGCAGCAC
GTATTGGAGCGTGTCCCGTGA
mRNA sequenceShow/hide mRNA sequence
CGTCGACCCTGAGTTCAAATTCTGTCTTCTATTTAAAAAAAAAAATTGTATTAAAAAAACATTAAAAAAAAAAGGGAAAAAAAAGGGAAAAATGATACCGATACG
TGTATGTTTATCCGTCACGAAAAAATCCGATGGGTTCTTATCGGATTTGCAGCAGTTGGTGAGTTTCCATTTGGAGCCCAATCCGCCATTGCCATCGTCATCGTC
ATCATCAACATGCCAATCTGCACTGCAACTCTCGGATCTCTCTCCCCTTATTCCTTTCTCTCACACTTCGCTTCCACTGATTCTTCACCCTCCTCCATTCTCACC
CACAACCACAACTCCCCCTCTAAACGCTCTCCCAACTTCCCCCCCAGACTCTCCCTCCCCGCAAATCCCGACCCCATTGCCGGAGTTCTTGATACTACTTCCCCT
GCCTCGCCGGAATCGCTTCCACGTTCTCGGAGATCTGAGGATTGGAAGGCAGCGAGGGAATACCTTGATAGTGGATTTATCTATGAGGGTAGGATTGAAGGTTCA
AATGCTGGAGGTTTACTTGTCAGATTTCATTCTCTAGTTGGCTTTCTTCCATTCCCTCAATTGAGCCCATCTCATTCTTGTAAAGAACCATACAAAAGTATCCAA
GATATTGCAAAAAGCTTACTTGGTTCGATTGTACCAGTGAAGATTATCCAAGCAGATGAGAGAAACAAAAAATTGATATTTTCAGAGAAAGAAGCTGCGTGGTCA
AAGTTTTCTGAGCAAGTTAGTGTGGGAGAGGTTTATGATGCTCGAGTTGGTTCTGTGGAGGATTATGGTGCCTTTGTACATTTACGTTTCTCTGATGGTCTTTAT
CATCTTACCGGGCTAGTACACGTATCAGAAGTTTCATGGGATCTAGTTCAGGATGTAAGAGACATCTTGAGTGAGGGTGACGAAGTGAGGGTGAAAGTTATTAAT
GTTGATAGGCAAGTTCTCTCAGGTTCCTTTTTGGAAAAGTCCAGGATCACACTGTCAATTAAACAACTGGAGGAAGATCCACTGTTGGAAACGCTGGACAAAGTA
ATACCGCAGGATGGTTCTGCTGAACCTGATTCTTTCGGACCTAGAAGTGATAGCGAAATTATACCACTTCCTGGACTTGAAACAATATTTGAAGAGCTACTGCAG
GAAGATGGTTTGAATATCAGTATAGAAGATGTTCGTGTCAACCGACAAGGATTTGAGAAGCGGGTGGTTTCACAAGATCTACAGCTTTGGCTATCAAATGCTCCT
CCCGTTGAAAAGAAGTTCATTCTTCTTGCTCGTGCCGGTAGGCAGGTTCAAGAAATACAACTGACGACATCGCTTGATCAGGAAGGTATAAAAATGGCATTGCAG
CACGTATTGGAGCGTGTCCCGTGATTTGAATCCGGAGTTCGTGTTCGATTTTGTATGATGATGTCTGTAAAGAAGATGCTGTCAATTCAATTGTTTCCAGTAAGA
TGTATATTTTGTTTGGTCTGTTCATCTATTCTAATTTGCCAAGGATCTTGTAAGCATGAACGCTCGATTGTTGTCTGACAGCGTTCTGCTTGGAAGATTTAGTCT
TTGTTTTCCATATTCAATATAATAGTATCATTTTTGTGTTCCATTGAATTGAAACACATGTAAATTAGTTTCAATTTGATTCTTATACCTCTAAAAGTTGCATTA
TCATTTATATCTATAGATTAAAGCTTAGCAAACATTGCTATTGTTACTGAAAAATAACAGAAATCCA
Protein sequenceShow/hide protein sequence
MFIRHEKIRWVLIGFAAVGEFPFGAQSAIAIVIVIINMPICTATLGSLSPYSFLSHFASTDSSPSSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPA
SPESLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSK
FSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDRQVLSGSFLEKSRITLSIKQLEEDPLLETLDKVI
PQDGSAEPDSFGPRSDSEIIPLPGLETIFEELLQEDGLNISIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQH
VLERVP