; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g09100 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g09100
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Description30S ribosomal protein S1-like
Genome locationchr5:7002665..7007593
RNA-Seq ExpressionMoc05g09100
SyntenyMoc05g09100
Gene Ontology termsGO:0000481 - maturation of 5S rRNA (biological process)
GO:0009737 - response to abscisic acid (biological process)
GO:0032508 - DNA duplex unwinding (biological process)
GO:0034337 - RNA folding (biological process)
GO:0005840 - ribosome (cellular component)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR003029 - S1 domain
IPR012340 - Nucleic acid-binding, OB-fold
IPR022967 - RNA-binding domain, S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582039.1 30S ribosomal protein S1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]5.2e-17083.68Show/hide
Query:  MPICTA-TLGSLSPYSFLSHFASTDSSP--SSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEG
        MPI  A TLGSLS +SFLS  ASTD+S   SS     H SPSKR  NF  R+SL   P+PIAGVL+ +SP+SPES+ R+RRS DWK AREYLDSGFI++G
Subjt:  MPICTA-TLGSLSPYSFLSHFASTDSSP--SSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEG

Query:  RIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAF
        RIEGSNAGGLLVRF+SLVGFLPFP LSP+HSCKEPYKSIQDIAKSL+GS++PVK+IQADE+NK LIFSEKEAAWSKFSE+V VG+VY+ARVGSVEDYGAF
Subjt:  RIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAF

Query:  VHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLET
        VHLRFSDGLYHLTGLVH+SEVSWDLVQDVRDILSEGDEVRVKVI+VDR+KSRITLSIKQLEEDPLLETLDKVIPQD SAEPDSFGP+SDSEIIPLPGL+T
Subjt:  VHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLET

Query:  IFEELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP
        IFEELLQE+GIEDV +NRQGFEKRVVSQDLQLWLSNAPPVEKKF LLARAGRQVQEIQLTTSLDQEGIK ALQ VLERVP
Subjt:  IFEELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP

XP_022145209.1 uncharacterized protein LOC111014714 [Momordica charantia]2.8e-208100Show/hide
Query:  MPICTATLGSLSPYSFLSHFASTDSSPSSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIE
        MPICTATLGSLSPYSFLSHFASTDSSPSSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIE
Subjt:  MPICTATLGSLSPYSFLSHFASTDSSPSSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIE

Query:  GSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHL
        GSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHL
Subjt:  GSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHL

Query:  RFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIFE
        RFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIFE
Subjt:  RFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIFE

Query:  ELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP
        ELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP
Subjt:  ELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP

XP_022955545.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111457527 [Cucurbita moschata]4.6e-17478.17Show/hide
Query:  MFIRHEKIRWVLIGFAAVGEFPFGAQS---------AIAIVIVIINMPICTA-TLGSLSPYSFLSHFASTDSSP--SSILTHNHNSPSKRSPNFPPRLSL
        M +RH    ++ IGFAAV  FPF AQS         AI+  +   NMPI  A TLGSLS +SFLS  AS D+S   S+     H SPSKR  NF  R+SL
Subjt:  MFIRHEKIRWVLIGFAAVGEFPFGAQS---------AIAIVIVIINMPICTA-TLGSLSPYSFLSHFASTDSSP--SSILTHNHNSPSKRSPNFPPRLSL

Query:  PANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVK
           P+PIAGVL+ +SP+SPES+ R+RRS DWK AREYLDSGFI++GRIEGSNAGGLLVRF+SLVGFLPFP LSP+HSCKEPYKSIQDIAKSL+GS++PVK
Subjt:  PANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVK

Query:  IIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRIT
        +IQADE+NK LIFSEKEAAWSKFSE+V VG+VY+ARVGS+EDYGAFVHLRFSDGLYHLTGLVH+SEVSWDLVQDVRDILSEGDEVRVKVI+VDR+KSRIT
Subjt:  IIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRIT

Query:  LSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIFEELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQV
        LSIKQLEEDPLLETLDKVIPQD SAEPDSFGP+SDSEIIPLPGL+TIFEELLQE+GIEDV +NRQGFEKRVVSQDLQLWLSNAPPVEKKF LLARAGRQV
Subjt:  LSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIFEELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQV

Query:  QEIQLTTSLDQEGIKMALQHVLERVP
        QEIQLTTSLDQEGIK ALQ VLERVP
Subjt:  QEIQLTTSLDQEGIKMALQHVLERVP

XP_023526022.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111789621 [Cucurbita pepo subsp. pepo]7.0e-17578.64Show/hide
Query:  MFIRHEKIRWVLIGFAAVGEFPFGAQS---------AIAIVIVIINMPICTA-TLGSLSPYSFLSHFASTDSSPSSILTH--NHNSPSKRSPNFPPRLSL
        M + H    ++ IGFAAV  FPF AQS         AI+  +   NMPI  A TLGSLS +SFL+  ASTD+S S   +    H SPSKR  NF  R+SL
Subjt:  MFIRHEKIRWVLIGFAAVGEFPFGAQS---------AIAIVIVIINMPICTA-TLGSLSPYSFLSHFASTDSSPSSILTH--NHNSPSKRSPNFPPRLSL

Query:  PANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVK
           PDPIAGVL+ +SP+SPES+ R+RRS DWK AREYLDSGFI++GRIEGSNAGGLLVRF+SLVGFLPFP LSP+HSCKEPYKSIQDIAKSL+GS++PVK
Subjt:  PANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVK

Query:  IIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRIT
        +IQADE+NK LIFSEKEAAWSKFSEQV VG+VY+ARVGSVEDYGAFVHLRFSDGLYHLTGLVH+SEVSWDLVQDVRDILSEGDEVRVKVI+VDR+KSRIT
Subjt:  IIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRIT

Query:  LSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIFEELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQV
        LSIKQLEEDPLLETLDKVIPQD SAEPDSFGP+SDSEIIPLPGL+TIFEELLQE+GIEDV +NRQGFEKRVVSQDLQLWLSNAPPVEKKF LLARAGRQV
Subjt:  LSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIFEELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQV

Query:  QEIQLTTSLDQEGIKMALQHVLERVP
        QEIQLTTSLDQEGIK ALQ VLERVP
Subjt:  QEIQLTTSLDQEGIKMALQHVLERVP

XP_038897871.1 30S ribosomal protein S1 homolog B [Benincasa hispida]7.3e-17285.45Show/hide
Query:  MPICTATLGSLSPYSFLSHFAS-TDSSPSSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRI
        MPI  ATLGS+  +SFLS  AS TD + S+       SPSKR  NFP R+SL   PDPIAGVLD TSP+SPESL R+RRS DWKAAREYLD+GFIYEGRI
Subjt:  MPICTATLGSLSPYSFLSHFAS-TDSSPSSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRI

Query:  EGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVH
        EGSNAGGLLVRF+SL+GFLPFPQLSPSHSCKEPYKSIQDIAKSL+GS++ VK+IQADERNKKLIFSEKEAAWSKFSEQV VG+VY+ARVGSVEDYGAFVH
Subjt:  EGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVH

Query:  LRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIF
        LRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEV VKVINVDR+KSRITLSIKQLEEDPLLETLDKVIPQ GSAEPDSFGP+SDSEI+PLPGLETI 
Subjt:  LRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIF

Query:  EELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP
        EELLQEDGI D+ VNRQGFEKRVVSQDLQLWLSNAPPVEKKF LLARAGRQVQEIQLTTSLDQEGIK ALQ VLERVP
Subjt:  EELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP

TrEMBL top hitse value%identityAlignment
A0A0A0KTX2 Uncharacterized protein4.6e-16481.94Show/hide
Query:  MPICTATLGSLSPYSFLSHFASTD-----SSPSSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIY
        MPI  AT+ S+S +SFLS  AST      SS SS       SPSKRS  FP R+SL   PDPIAGVLDT    SPES+ R+RRS DWKAAREYLDSGFIY
Subjt:  MPICTATLGSLSPYSFLSHFASTD-----SSPSSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIY

Query:  EGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYG
        EGRIEGSNAGGLLVRF+SLVGFLPFPQLSPSHSCKEPYKSIQDIAKSL+GS++ VK+IQADE+N+KLIFSEKEAA SKFS QV+VG+VY+ +VGSVEDYG
Subjt:  EGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYG

Query:  AFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGL
        AFVHLR SDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEV VKVINV++ KSRITLSI+QLEEDPLLETLDKVIPQ+ SAEPDSFGP+ DSEIIPLPGL
Subjt:  AFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGL

Query:  ETIFEELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP
        ETI EELLQE+GI DVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKF LLARAGRQVQEIQLTTSLDQEGIK ALQ VLERVP
Subjt:  ETIFEELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP

A0A1S3BXL4 30S ribosomal protein S1 isoform X12.0e-16777.34Show/hide
Query:  MFIRHEKIRWVLIGFA-----AVGE-FPFGAQSAIAIVIVI-INMPICTATLGSLSPYSFLSHFAST-------DSSPSSILTHNHNSPSKRSPNFPPRL
        MF+RH   RWV I        ++ E FP     AI   + I I MPI  AT+ S+S +SFLS  AST        SS SSIL     SPSKR   FP R+
Subjt:  MFIRHEKIRWVLIGFA-----AVGE-FPFGAQSAIAIVIVI-INMPICTATLGSLSPYSFLSHFAST-------DSSPSSILTHNHNSPSKRSPNFPPRL

Query:  SLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVP
        SL   PDPIAGVLDT    SPES+ R+RRS DWKAAREYLDSGFIYEGRIEGSNAGGLLVRF+SL+GFLPFPQLSPSHSCKEP KSIQDIAKSL GS++ 
Subjt:  SLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVP

Query:  VKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSR
        VK+IQADERNKKLIFSEKEA WSKFS QV VG+VY+A+VGS+EDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEV VKVINVDR+KSR
Subjt:  VKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSR

Query:  ITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIFEELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGR
        ITLSI+QLEEDPLLETLDKVIPQD SAEPDSFGP+SDSEIIPLPGL TI EEL QE+GI DVRVNRQGFEKRVVSQDLQLWLSNAPP+EKKF LLARAGR
Subjt:  ITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIFEELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGR

Query:  QVQEIQLTTSLDQEGIKMALQHVLERVP
        QVQEIQLTTSLDQEGIK ALQ VLERVP
Subjt:  QVQEIQLTTSLDQEGIKMALQHVLERVP

A0A6J1CUJ6 uncharacterized protein LOC1110147141.4e-208100Show/hide
Query:  MPICTATLGSLSPYSFLSHFASTDSSPSSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIE
        MPICTATLGSLSPYSFLSHFASTDSSPSSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIE
Subjt:  MPICTATLGSLSPYSFLSHFASTDSSPSSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIE

Query:  GSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHL
        GSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHL
Subjt:  GSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHL

Query:  RFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIFE
        RFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIFE
Subjt:  RFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIFE

Query:  ELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP
        ELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP
Subjt:  ELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP

A0A6J1GVD7 LOW QUALITY PROTEIN: uncharacterized protein LOC1114575272.2e-17478.17Show/hide
Query:  MFIRHEKIRWVLIGFAAVGEFPFGAQS---------AIAIVIVIINMPICTA-TLGSLSPYSFLSHFASTDSSP--SSILTHNHNSPSKRSPNFPPRLSL
        M +RH    ++ IGFAAV  FPF AQS         AI+  +   NMPI  A TLGSLS +SFLS  AS D+S   S+     H SPSKR  NF  R+SL
Subjt:  MFIRHEKIRWVLIGFAAVGEFPFGAQS---------AIAIVIVIINMPICTA-TLGSLSPYSFLSHFASTDSSP--SSILTHNHNSPSKRSPNFPPRLSL

Query:  PANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVK
           P+PIAGVL+ +SP+SPES+ R+RRS DWK AREYLDSGFI++GRIEGSNAGGLLVRF+SLVGFLPFP LSP+HSCKEPYKSIQDIAKSL+GS++PVK
Subjt:  PANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVK

Query:  IIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRIT
        +IQADE+NK LIFSEKEAAWSKFSE+V VG+VY+ARVGS+EDYGAFVHLRFSDGLYHLTGLVH+SEVSWDLVQDVRDILSEGDEVRVKVI+VDR+KSRIT
Subjt:  IIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRIT

Query:  LSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIFEELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQV
        LSIKQLEEDPLLETLDKVIPQD SAEPDSFGP+SDSEIIPLPGL+TIFEELLQE+GIEDV +NRQGFEKRVVSQDLQLWLSNAPPVEKKF LLARAGRQV
Subjt:  LSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLETIFEELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQV

Query:  QEIQLTTSLDQEGIKMALQHVLERVP
        QEIQLTTSLDQEGIK ALQ VLERVP
Subjt:  QEIQLTTSLDQEGIKMALQHVLERVP

A0A6J1IUF3 uncharacterized protein LOC1114794067.3e-17083.42Show/hide
Query:  MPICTA-TLGSLSPYSFLSHFASTDSSP--SSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEG
        MPI  A TLGSLS +SFLS  ASTD+S   SS     H SPSKR  NF  R+SL   P+PIAGVL+ +SP+SPES+ R+RRS DWK AREYLDSGFI++G
Subjt:  MPICTA-TLGSLSPYSFLSHFASTDSSP--SSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSEDWKAAREYLDSGFIYEG

Query:  RIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAF
        RIEGSNAGGLLVRF+SLVGFLPFP LSP+HSCKEPYKSIQDIAKSL+GS++PVK+IQADE+NK LIFSEKEAAWSKFSEQV VG+VY+ARVGSVEDYGAF
Subjt:  RIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVEDYGAF

Query:  VHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLET
        VHLRFSDG YHLTGLVH+SEVSWDLVQDVRDILSEGDEVRVKV++VDR+KSRITLSIKQLEEDPLLETLDKVIPQD SAEPDSFGP+SDSEIIPLPGL+T
Subjt:  VHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLPGLET

Query:  IFEELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP
        IFEELLQE+GIEDV +NRQGFEKRVVSQDLQLWLSNAPPVEKKF LLARAGRQVQEIQLTTSLDQEGIK ALQ VLERVP
Subjt:  IFEELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP

SwissProt top hitse value%identityAlignment
P29344 30S ribosomal protein S1, chloroplastic3.8e-2232.22Show/hide
Query:  WKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVG
        W+  R+      + +G+I G+N GG++     L GF+PF Q+S   S +E           LL   +P+K ++ DE   +L+ S ++ A +    Q+ +G
Subjt:  WKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVG

Query:  EVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP
         V    V S++ YGAF+ +        + GL+HVS++S D V D+  +L  GD ++V +++ DRE+ R++LS K+LE  P
Subjt:  EVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP

P46228 30S ribosomal protein S11.3e-1930.53Show/hide
Query:  SLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAW
        S+ R      W+  R+           +  +N GG LVR   L GF+P   +S +   KE           L+G  +P+K ++ DE   +L+ S + A  
Subjt:  SLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAW

Query:  SKFSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP
         +   ++ VGEV    V  ++ YGAF+ +        ++GL+H+SE+S D ++    + +  DEV+V +I++D E+ RI+LS KQLE +P
Subjt:  SKFSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP

P73530 30S ribosomal protein S1 homolog A7.4e-1827.89Show/hide
Query:  SLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAW
        S+ R      W+  R+           +  +N GG LVR   L GF+P   +           S ++  + L+G  +P+K ++ DE   +L+ S + A  
Subjt:  SLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAW

Query:  SKFSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP
         +    + V +V    V  ++ YGAF+ +        ++GL+H+SE+S D +     + +  DE++V +I++D E+ RI+LS KQLE +P
Subjt:  SKFSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP

Q93VC7 30S ribosomal protein S1, chloroplastic1.2e-2031.67Show/hide
Query:  WKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVG
        W+  R+      I + ++ G+N GGL+     L GF+PF Q+S   + +E           LL   +P+K ++ DE   KL+ S ++A  +    Q+ +G
Subjt:  WKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVG

Query:  EVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP
         V    V S++ YGAF+ +        + GL+HVS++S D V D+  +L  GD ++V +++ DR++ R++LS K+LE  P
Subjt:  EVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP

Q9JZ44 30S ribosomal protein S11.6e-1730.46Show/hide
Query:  SLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAW
        S  +++R+ DW A  E +++G I  G I G   GGL V   S+  FLP   +          + ++D      G  +  K+I+ D++   ++ S +    
Subjt:  SLPRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAW

Query:  SKFSEQ-------VSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP
        +   E+       +  G V    V ++ DYGAFV L   DGL H+T      +++W  V+   ++L  G EV  KV+  D+EK R++L +KQL EDP
Subjt:  SKFSEQ-------VSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP

Arabidopsis top hitse value%identityAlignment
AT1G12800.1 Nucleic acid-binding, OB-fold-like protein1.4e-1930.92Show/hide
Query:  QDIAKSLLGSIVPVKIIQADERNKKLIFS----EKEAAWSK---FSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDI
        Q    S +G  + V ++ A+  ++KLIFS    E E    K      ++ VG+V    +  +  +G F  L        +  LVH SEVSWD   D    
Subjt:  QDIAKSLLGSIVPVKIIQADERNKKLIFS----EKEAAWSK---FSEQVSVGEVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDI

Query:  LSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPL--PGLETIFEELLQEDGIEDVRVNRQGFEKRVVSQDL
           G  V  KV  +D    RI LS+K++  DPL E L+ V+  D     D  G R  +  +    P +E++ +EL   +GI+ V  +R  F    ++   
Subjt:  LSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPL--PGLETIFEELLQEDGIEDVRVNRQGFEKRVVSQDL

Query:  QLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERV
        Q+++  AP  E ++ LLARAG +VQE+ +  SL +E +K  +     RV
Subjt:  QLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERV

AT3G23700.1 Nucleic acid-binding proteins superfamily3.7e-12162.08Show/hide
Query:  TLGSLSPYSFLSHFASTD-SSPSSILTHNHNSPSKRS------PNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSE-------DWKAAREYLDSG
        TLGS+S  S L    ST  S P  +L  + +S S R+       +F    +   + +       +   AS  SL R    E       DWK A+ Y  SG
Subjt:  TLGSLSPYSFLSHFASTD-SSPSSILTHNHNSPSKRS------PNFPPRLSLPANPDPIAGVLDTTSPASPESLPRSRRSE-------DWKAAREYLDSG

Query:  FIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVE
          +EG ++G N GGLL+RFHSLVGFLP+PQLSPS SCKEP KSI +IAK+L+GS +PVK++QADE N+KLI SEK A W K+S+ V+VG+V++ RVGSVE
Subjt:  FIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEVYDARVGSVE

Query:  DYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPL
        DYGAF+HLRF DGLYHLTGLVHVSEVSWD VQDVRD+L +GDEVRV V N+D+EKSRITLSIKQLE+DPLLETLDKVI +D S    S    +   I PL
Subjt:  DYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPL

Query:  PGLETIFEELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP
        PGLETI EELL+EDGIE V++NRQGFEKRVVSQDLQLWLSN PP + KF+LLARAGRQVQEI LTTSL+Q GIK ALQHVLERVP
Subjt:  PGLETIFEELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP

AT4G29060.1 elongation factor Ts family protein1.1e-0831.3Show/hide
Query:  SEQVSVGEVYDARVGSVEDYGAFVHL-RFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQD
        +E++  G  +  +V +++ +GAFV    F+D      GLVHVS++S + V+DV  +++ G EV+V+++  D E  RI+L++++ ++ P  ++        
Subjt:  SEQVSVGEVYDARVGSVEDYGAFVHL-RFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQD

Query:  GSAEPDSFGPRSDSE
        GS +P S G R  S+
Subjt:  GSAEPDSFGPRSDSE

AT4G29060.2 elongation factor Ts family protein1.1e-0831.3Show/hide
Query:  SEQVSVGEVYDARVGSVEDYGAFVHL-RFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQD
        +E++  G  +  +V +++ +GAFV    F+D      GLVHVS++S + V+DV  +++ G EV+V+++  D E  RI+L++++ ++ P  ++        
Subjt:  SEQVSVGEVYDARVGSVEDYGAFVHL-RFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQD

Query:  GSAEPDSFGPRSDSE
        GS +P S G R  S+
Subjt:  GSAEPDSFGPRSDSE

AT5G30510.1 ribosomal protein S18.7e-2231.67Show/hide
Query:  WKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVG
        W+  R+      I + ++ G+N GGL+     L GF+PF Q+S   + +E           LL   +P+K ++ DE   KL+ S ++A  +    Q+ +G
Subjt:  WKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVG

Query:  EVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP
         V    V S++ YGAF+ +        + GL+HVS++S D V D+  +L  GD ++V +++ DR++ R++LS K+LE  P
Subjt:  EVYDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTATCCGTCACGAAAAAATCCGATGGGTTCTTATCGGATTTGCAGCAGTTGGTGAGTTTCCATTTGGAGCCCAATCCGCCATTGCCATCGTCATCGTCATCATCAA
CATGCCAATCTGCACTGCAACTCTCGGATCTCTCTCCCCTTATTCCTTTCTCTCACACTTCGCTTCCACTGATTCTTCACCCTCCTCCATTCTCACCCACAACCACAACT
CCCCCTCTAAACGCTCTCCCAACTTCCCCCCCAGACTCTCCCTCCCCGCAAATCCCGACCCCATTGCCGGAGTTCTTGATACTACTTCCCCTGCCTCGCCGGAATCGCTT
CCACGTTCTCGGAGATCTGAGGATTGGAAGGCAGCGAGGGAATACCTTGATAGTGGATTTATCTATGAGGGTAGGATTGAAGGTTCAAATGCTGGAGGTTTACTTGTCAG
ATTTCATTCTCTAGTTGGCTTTCTTCCATTCCCTCAATTGAGCCCATCTCATTCTTGTAAAGAACCATACAAAAGTATCCAAGATATTGCAAAAAGCTTACTTGGTTCGA
TTGTACCAGTGAAGATTATCCAAGCAGATGAGAGAAACAAAAAATTGATATTTTCAGAGAAAGAAGCTGCGTGGTCAAAGTTTTCTGAGCAAGTTAGTGTGGGAGAGGTT
TATGATGCTCGAGTTGGTTCTGTGGAGGATTATGGTGCCTTTGTACATTTACGTTTCTCTGATGGTCTTTATCATCTTACCGGGCTAGTACACGTATCAGAAGTTTCATG
GGATCTAGTTCAGGATGTAAGAGACATCTTGAGTGAGGGTGACGAAGTGAGGGTGAAAGTTATTAATGTTGATAGGGAAAAGTCCAGGATCACACTGTCAATTAAACAAC
TGGAGGAAGATCCACTGTTGGAAACGCTGGACAAAGTAATACCGCAGGATGGTTCTGCTGAACCTGATTCTTTCGGACCTAGAAGTGATAGCGAAATTATACCACTTCCT
GGACTTGAAACAATATTTGAAGAGCTACTGCAGGAAGATGGTATAGAAGATGTTCGTGTCAACCGACAAGGATTTGAGAAGCGGGTGGTTTCACAAGATCTACAGCTTTG
GCTATCAAATGCTCCTCCCGTTGAAAAGAAGTTCATTCTTCTTGCTCGTGCCGGTAGGCAGGTTCAAGAAATACAACTGACGACATCGCTTGATCAGGAAGGTATAAAAA
TGGCATTGCAGCACGTATTGGAGCGTGTCCCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTATCCGTCACGAAAAAATCCGATGGGTTCTTATCGGATTTGCAGCAGTTGGTGAGTTTCCATTTGGAGCCCAATCCGCCATTGCCATCGTCATCGTCATCATCAA
CATGCCAATCTGCACTGCAACTCTCGGATCTCTCTCCCCTTATTCCTTTCTCTCACACTTCGCTTCCACTGATTCTTCACCCTCCTCCATTCTCACCCACAACCACAACT
CCCCCTCTAAACGCTCTCCCAACTTCCCCCCCAGACTCTCCCTCCCCGCAAATCCCGACCCCATTGCCGGAGTTCTTGATACTACTTCCCCTGCCTCGCCGGAATCGCTT
CCACGTTCTCGGAGATCTGAGGATTGGAAGGCAGCGAGGGAATACCTTGATAGTGGATTTATCTATGAGGGTAGGATTGAAGGTTCAAATGCTGGAGGTTTACTTGTCAG
ATTTCATTCTCTAGTTGGCTTTCTTCCATTCCCTCAATTGAGCCCATCTCATTCTTGTAAAGAACCATACAAAAGTATCCAAGATATTGCAAAAAGCTTACTTGGTTCGA
TTGTACCAGTGAAGATTATCCAAGCAGATGAGAGAAACAAAAAATTGATATTTTCAGAGAAAGAAGCTGCGTGGTCAAAGTTTTCTGAGCAAGTTAGTGTGGGAGAGGTT
TATGATGCTCGAGTTGGTTCTGTGGAGGATTATGGTGCCTTTGTACATTTACGTTTCTCTGATGGTCTTTATCATCTTACCGGGCTAGTACACGTATCAGAAGTTTCATG
GGATCTAGTTCAGGATGTAAGAGACATCTTGAGTGAGGGTGACGAAGTGAGGGTGAAAGTTATTAATGTTGATAGGGAAAAGTCCAGGATCACACTGTCAATTAAACAAC
TGGAGGAAGATCCACTGTTGGAAACGCTGGACAAAGTAATACCGCAGGATGGTTCTGCTGAACCTGATTCTTTCGGACCTAGAAGTGATAGCGAAATTATACCACTTCCT
GGACTTGAAACAATATTTGAAGAGCTACTGCAGGAAGATGGTATAGAAGATGTTCGTGTCAACCGACAAGGATTTGAGAAGCGGGTGGTTTCACAAGATCTACAGCTTTG
GCTATCAAATGCTCCTCCCGTTGAAAAGAAGTTCATTCTTCTTGCTCGTGCCGGTAGGCAGGTTCAAGAAATACAACTGACGACATCGCTTGATCAGGAAGGTATAAAAA
TGGCATTGCAGCACGTATTGGAGCGTGTCCCGTGA
Protein sequenceShow/hide protein sequence
MFIRHEKIRWVLIGFAAVGEFPFGAQSAIAIVIVIINMPICTATLGSLSPYSFLSHFASTDSSPSSILTHNHNSPSKRSPNFPPRLSLPANPDPIAGVLDTTSPASPESL
PRSRRSEDWKAAREYLDSGFIYEGRIEGSNAGGLLVRFHSLVGFLPFPQLSPSHSCKEPYKSIQDIAKSLLGSIVPVKIIQADERNKKLIFSEKEAAWSKFSEQVSVGEV
YDARVGSVEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVRVKVINVDREKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPRSDSEIIPLP
GLETIFEELLQEDGIEDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFILLARAGRQVQEIQLTTSLDQEGIKMALQHVLERVP