; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0004718 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0004718
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
Description30S ribosomal protein S1
Genome locationchr07:22445825..22451651
RNA-Seq ExpressionIVF0004718
SyntenyIVF0004718
Gene Ontology termsGO:0000481 - maturation of 5S rRNA (biological process)
GO:0009737 - response to abscisic acid (biological process)
GO:0032508 - DNA duplex unwinding (biological process)
GO:0034337 - RNA folding (biological process)
GO:0005840 - ribosome (cellular component)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR003029 - S1 domain
IPR012340 - Nucleic acid-binding, OB-fold
IPR022967 - RNA-binding domain, S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044578.1 30S ribosomal protein S1 isoform X1 [Cucumis melo var. makuwa]1.36e-25499.21Show/hide
Query:  MPIFLATIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGRI
        MPIFLATIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGRI
Subjt:  MPIFLATIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGRI

Query:  EGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKV-IQADERNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGAFV
        EGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKV IQADERNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGAFV
Subjt:  EGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKV-IQADERNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGAFV

Query:  HLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDR--DKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLG
        HLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDR  DKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLG
Subjt:  HLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDR--DKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLG

Query:  TIIEELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        TIIEELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  TIIEELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

XP_004152109.1 uncharacterized protein LOC101213559 isoform X2 [Cucumis sativus]5.11e-24093.92Show/hide
Query:  MPIFLATIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGRI
        MPIF+ATIASVS HSFLSLLASTSDASSTSSSSSS ILPLKSPSKR SIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGRI
Subjt:  MPIFLATIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGRI

Query:  EGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGAFVH
        EGSNAGGLLVRFYSL+GFLPFPQLSPSHSCKEP KSIQDIAKSL GSLISVKVIQADE+N+KLIFSEKEA  SKFSGQV VGDVYE KVGS+EDYGAFVH
Subjt:  EGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGAFVH

Query:  LRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLGTII
        LR SDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINV+++KSRITLSIRQLEEDPLLETLDKVIPQ+SSAEPDSFGPK DSEIIPLPGL TII
Subjt:  LRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLGTII

Query:  EELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        EEL QEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPP+EKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  EELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

XP_008454000.1 PREDICTED: 30S ribosomal protein S1 isoform X1 [Cucumis melo]2.97e-291100Show/hide
Query:  MRHANKRWVPIRICSSWWHSISESFPITLLPAIVFPLSIPIIMPIFLATIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSG
        MRHANKRWVPIRICSSWWHSISESFPITLLPAIVFPLSIPIIMPIFLATIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSG
Subjt:  MRHANKRWVPIRICSSWWHSISESFPITLLPAIVFPLSIPIIMPIFLATIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSG

Query:  KPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADE
        KPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADE
Subjt:  KPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADE

Query:  RNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQL
        RNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQL
Subjt:  RNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQL

Query:  EEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLGTIIEELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLT
        EEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLGTIIEELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLT
Subjt:  EEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLGTIIEELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLT

Query:  TSLDQEGIKRALQRVLERVP
        TSLDQEGIKRALQRVLERVP
Subjt:  TSLDQEGIKRALQRVLERVP

XP_011653045.1 uncharacterized protein LOC101213559 isoform X1 [Cucumis sativus]1.14e-23792.93Show/hide
Query:  MPIFLATIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGRI
        MPIF+ATIASVS HSFLSLLASTSDASSTSSSSSS ILPLKSPSKR SIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGRI
Subjt:  MPIFLATIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGRI

Query:  EGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGAFVH
        EGSNAGGLLVRFYSL+GFLPFPQLSPSHSCKEP KSIQDIAKSL GSLISVKVIQADE+N+KLIFSEKEA  SKFSGQV VGDVYE KVGS+EDYGAFVH
Subjt:  EGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGAFVH

Query:  LRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLGTII
        LR SDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINV+++KSRITLSIRQLEEDPLLETLDKVIPQ+SSAEPDSFGPK DSEIIPLPGL TII
Subjt:  LRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLGTII

Query:  EELQQEEG----IVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        EEL QEEG    IVDVRVNRQGFEKRVVSQDLQLWLSNAPP+EKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  EELQQEEG----IVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

XP_038897871.1 30S ribosomal protein S1 homolog B [Benincasa hispida]9.58e-23190.03Show/hide
Query:  MPIFLATIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSGKPDPIAGVLDTSP---ESVRRARRSADWKAAREYLDSGFIYE
        MPIF AT+ SV  HSFLSLLAST+D +S    S+S ILP KSPSKRPS FP+RVSLSGKPDPIAGVLDTSP   ES+RRARRSADWKAAREYLD+GFIYE
Subjt:  MPIFLATIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSGKPDPIAGVLDTSP---ESVRRARRSADWKAAREYLDSGFIYE

Query:  GRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGA
        GRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEP KSIQDIAKSL GSLISVKVIQADERNKKLIFSEKEA WSKFS QVGVGDVYEA+VGS+EDYGA
Subjt:  GRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGA

Query:  FVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLG
        FVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSI+QLEEDPLLETLDKVIPQ  SAEPDSFGPKSDSEI+PLPGL 
Subjt:  FVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLG

Query:  TIIEELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        TIIEEL QE+GIVD+ VNRQGFEKRVVSQDLQLWLSNAPP+EKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  TIIEELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

TrEMBL top hitse value%identityAlignment
A0A0A0KTX2 Uncharacterized protein3.6e-18893.92Show/hide
Query:  MPIFLATIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGRI
        MPIF+ATIASVS HSFLSLLASTSDASSTSSSSSS ILPLKSPSKR SIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGRI
Subjt:  MPIFLATIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGRI

Query:  EGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGAFVH
        EGSNAGGLLVRFYSL+GFLPFPQLSPSHSCKEP KSIQDIAKSL GSLISVKVIQADE+N+KLIFSEKEA  SKFSGQV VGDVYE KVGS+EDYGAFVH
Subjt:  EGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGAFVH

Query:  LRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLGTII
        LR SDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINV+++KSRITLSIRQLEEDPLLETLDKVIPQ+SSAEPDSFGPK DSEIIPLPGL TII
Subjt:  LRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLGTII

Query:  EELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        EEL QEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPP+EKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  EELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

A0A1S3BXL4 30S ribosomal protein S1 isoform X13.5e-228100Show/hide
Query:  MRHANKRWVPIRICSSWWHSISESFPITLLPAIVFPLSIPIIMPIFLATIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSG
        MRHANKRWVPIRICSSWWHSISESFPITLLPAIVFPLSIPIIMPIFLATIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSG
Subjt:  MRHANKRWVPIRICSSWWHSISESFPITLLPAIVFPLSIPIIMPIFLATIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSG

Query:  KPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADE
        KPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADE
Subjt:  KPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADE

Query:  RNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQL
        RNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQL
Subjt:  RNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQL

Query:  EEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLGTIIEELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLT
        EEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLGTIIEELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLT
Subjt:  EEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLGTIIEELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLT

Query:  TSLDQEGIKRALQRVLERVP
        TSLDQEGIKRALQRVLERVP
Subjt:  TSLDQEGIKRALQRVLERVP

A0A5A7TNY0 30S ribosomal protein S1 isoform X12.6e-19999.21Show/hide
Query:  MPIFLATIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGRI
        MPIFLATIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGRI
Subjt:  MPIFLATIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSGKPDPIAGVLDTSPESVRRARRSADWKAAREYLDSGFIYEGRI

Query:  EGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVK-VIQADERNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGAFV
        EGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVK VIQADERNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGAFV
Subjt:  EGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVK-VIQADERNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGAFV

Query:  HLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDR--DKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLG
        HLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDR  DKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLG
Subjt:  HLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDR--DKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLG

Query:  TIIEELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        TIIEELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  TIIEELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

A0A6J1GVD7 LOW QUALITY PROTEIN: uncharacterized protein LOC1114575275.9e-17580.33Show/hide
Query:  MRHANKRWVPIRICSSW---WHSISESFPITLLPAIVFPLSIPIIMPIFLA-TIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRV
        +RH N  ++PI   +     + + SE F   LLPAI FPL  P  MPIF A T+ S+S HSFLSL AS   A +++S S+S +L  KSPSKRPS F +RV
Subjt:  MRHANKRWVPIRICSSW---WHSISESFPITLLPAIVFPLSIPIIMPIFLA-TIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRV

Query:  SLSGKPDPIAGVLD---TSPESVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISV
        SLSGKP+PIAGVL+   +SPESVRRARRSADWK AREYLDSGFI++GRIEGSNAGGLLVRFYSL+GFLPFP LSP+HSCKEP KSIQDIAKSL GSLI V
Subjt:  SLSGKPDPIAGVLD---TSPESVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISV

Query:  KVIQADERNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRI
        KVIQADE+NK LIFSEKEA WSKFS +VGVGDVYEA+VGSLEDYGAFVHLRFSDGLYHLTGLVH+SEVSWDLVQDVRDILSEGDEV VKVI+VDRDKSRI
Subjt:  KVIQADERNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRI

Query:  TLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLGTIIEELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQ
        TLSI+QLEEDPLLETLDKVIPQD SAEPDSFGPKSDSEIIPLPGL TI EEL QEEGI DV +NRQGFEKRVVSQDLQLWLSNAPP+EKKFTLLARAGRQ
Subjt:  TLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLGTIIEELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQ

Query:  VQEIQLTTSLDQEGIKRALQRVLERVP
        VQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  VQEIQLTTSLDQEGIKRALQRVLERVP

A0A6J1IUF3 uncharacterized protein LOC1114794066.7e-17186.13Show/hide
Query:  MPIFLA-TIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSGKPDPIAGVLD---TSPESVRRARRSADWKAAREYLDSGFIY
        MPIF A T+ S+S HSFLSLLAST DAS   S SSS +L  KSPSKRPS F +RVSLSGKP+PIAGVL+   +SPESVRRARRSADWK AREYLDSGFI+
Subjt:  MPIFLA-TIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSGKPDPIAGVLD---TSPESVRRARRSADWKAAREYLDSGFIY

Query:  EGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYG
        +GRIEGSNAGGLLVRFYSL+GFLPFP LSP+HSCKEP KSIQDIAKSL GSLI VKVIQADE+NK LIFSEKEA WSKFS QVGVGDVYEA+VGS+EDYG
Subjt:  EGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYG

Query:  AFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGL
        AFVHLRFSDG YHLTGLVH+SEVSWDLVQDVRDILSEGDEV VKV++VDRDKSRITLSI+QLEEDPLLETLDKVIPQD SAEPDSFGPKSDSEIIPLPGL
Subjt:  AFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGL

Query:  GTIIEELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
         TI EEL QEEGI DV +NRQGFEKRVVSQDLQLWLSNAPP+EKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  GTIIEELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

SwissProt top hitse value%identityAlignment
P29344 30S ribosomal protein S1, chloroplastic1.9e-2131.05Show/hide
Query:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATW
        S+R+ +    W+  R+      + +G+I G+N GG++     L GF+PF Q+S   S +E           L    I +K ++ DE   +L+ S ++A  
Subjt:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATW

Query:  SKFSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDP
        +    Q+G+G V    V SL+ YGAF+ +        + GL+HVS++S D V D+  +L  GD + V +++ DR++ R++LS ++LE  P
Subjt:  SKFSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDP

P46228 30S ribosomal protein S15.7e-1828.95Show/hide
Query:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATW
        S+RR      W+  R+           +  +N GG LVR   L GF+P   +S +   KE           L G  + +K ++ DE   +L+ S + A  
Subjt:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATW

Query:  SKFSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDP
         +   ++ VG+V    V  ++ YGAF+ +        ++GL+H+SE+S D ++    + +  DEV V +I++D ++ RI+LS +QLE +P
Subjt:  SKFSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDP

P74142 30S ribosomal protein S1 homolog B3.4e-1829.35Show/hide
Query:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATW
        S R+ +    W+   E  +SG   E  + G+N GG++     L GF+P      SH   + N        +L G ++   +++A++ N KL+ +++    
Subjt:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATW

Query:  SKFSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDP--LLETLDKV
        ++  G++  G++YE KV  ++ YG FV +        +TGL+HVS+VS   V  +  + + G  ++V V  +D  K+RI+LS R LE  P  L+E  D++
Subjt:  SKFSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDP--LLETLDKV

Query:  I
        +
Subjt:  I

Q93VC7 30S ribosomal protein S1, chloroplastic7.3e-2131.58Show/hide
Query:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATW
        S+R  +    W+  R+      I + ++ G+N GGL+     L GF+PF Q+S   + +E           L    I +K ++ DE   KL+ S ++A  
Subjt:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATW

Query:  SKFSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDP
        +    Q+G+G V    V SL+ YGAF+ +        + GL+HVS++S D V D+  +L  GD + V +++ DRD+ R++LS ++LE  P
Subjt:  SKFSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDP

Q9JZ44 30S ribosomal protein S12.6e-1830.77Show/hide
Query:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEK---E
        S  +A+R+ADW A  E +++G I  G I G   GGL V   S+  FLP   +          + ++D      G  I  KVI+ D++   ++ S +   E
Subjt:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEK---E

Query:  ATWSK----FSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDPLLE
        AT  +        +  G V +  V ++ DYGAFV L   DGL H+T      +++W  V+   ++L  G EV  KV+  D++K R++L ++QL EDP   
Subjt:  ATWSK----FSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDPLLE

Query:  TLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLGTIIEELQQEEGIVDV
         L +  PQ         G +   ++  L   G  +E  Q  EG+V V
Subjt:  TLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLGTIIEELQQEEGIVDV

Arabidopsis top hitse value%identityAlignment
AT1G12800.1 Nucleic acid-binding, OB-fold-like protein4.8e-2031.73Show/hide
Query:  QDIAKSLTGSLISVKVIQADERNKKLIFS----EKEATWSK---FSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDI
        Q    S  G  I V V+ A+  ++KLIFS    E E    K      ++ VGDV +  +  +  +G F  L        +  LVH SEVSWD   D    
Subjt:  QDIAKSLTGSLISVKVIQADERNKKLIFS----EKEATWSK---FSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDI

Query:  LSEGDEVTVKVINVDRDKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPL--PGLGTIIEELQQEEGIVDVRVNRQGFEKRVVSQDL
           G  V  KV  +D    RI LS++++  DPL E L+ V+  D+    D  G +  +  +    P + ++I+EL+  EGI  V  +R  F    ++   
Subjt:  LSEGDEVTVKVINVDRDKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPL--PGLGTIIEELQQEEGIVDVRVNRQGFEKRVVSQDL

Query:  QLWLSNAPPIEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERV
        Q+++  AP  E ++ LLARAG +VQE+ +  SL +E +K  +     RV
Subjt:  QLWLSNAPPIEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERV

AT3G23700.1 Nucleic acid-binding proteins superfamily2.6e-11461.08Show/hide
Query:  SFLSL-LASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSGKPDPIAG-----VLDTSPESVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGL
        +FLS  L     +SS+SS+ +S +  +KS S   +    R S       +       + DTS E+   A   +DWK A+ Y  SG  +EG ++G N GGL
Subjt:  SFLSL-LASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSGKPDPIAG-----VLDTSPESVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGL

Query:  LVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLY
        L+RF+SL+GFLP+PQLSPS SCKEP KSI +IAK+L GS + VKV+QADE N+KLI SEK A W K+S  V VGDV+  +VGS+EDYGAF+HLRF DGLY
Subjt:  LVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATWSKFSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLY

Query:  HLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLGTIIEELQQEEG
        HLTGLVHVSEVSWD VQDVRD+L +GDEV V V N+D++KSRITLSI+QLE+DPLLETLDKVI +DSS    S    +   I PLPGL TI+EEL +E+G
Subjt:  HLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDSEIIPLPGLGTIIEELQQEEG

Query:  IVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        I  V++NRQGFEKRVVSQDLQLWLSN PP + KF LLARAGRQVQEI LTTSL+Q GIK+ALQ VLERVP
Subjt:  IVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

AT4G29060.1 elongation factor Ts family protein3.6e-0735.37Show/hide
Query:  GDVYEAKVGSLEDYGAFVHL-RFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDP
        G  +  KV +++ +GAFV    F+D      GLVHVS++S + V+DV  +++ G EV V+++  D +  RI+L++R+ ++ P
Subjt:  GDVYEAKVGSLEDYGAFVHL-RFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDP

AT4G29060.2 elongation factor Ts family protein3.6e-0735.37Show/hide
Query:  GDVYEAKVGSLEDYGAFVHL-RFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDP
        G  +  KV +++ +GAFV    F+D      GLVHVS++S + V+DV  +++ G EV V+++  D +  RI+L++R+ ++ P
Subjt:  GDVYEAKVGSLEDYGAFVHL-RFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDP

AT5G30510.1 ribosomal protein S15.2e-2231.58Show/hide
Query:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATW
        S+R  +    W+  R+      I + ++ G+N GGL+     L GF+PF Q+S   + +E           L    I +K ++ DE   KL+ S ++A  
Subjt:  SVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATW

Query:  SKFSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDP
        +    Q+G+G V    V SL+ YGAF+ +        + GL+HVS++S D V D+  +L  GD + V +++ DRD+ R++LS ++LE  P
Subjt:  SKFSGQVGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTCACGCAAATAAGCGATGGGTTCCCATTCGGATTTGCAGCAGTTGGTGGCATTCCATATCCGAATCCTTTCCCATTACGCTCCTCCCCGCCATTGTCTTCCCTCT
TTCCATTCCCATAATCATGCCAATCTTTCTTGCAACCATCGCCTCTGTCTCTACTCATTCCTTCCTCTCACTCCTTGCTTCCACTTCTGATGCTTCTTCAACCTCCTCCT
CCTCCTCCTCCTCCATTTTACCACTCAAATCCCCCTCTAAACGCCCTTCCATTTTCCCCTCCAGAGTTTCCCTCTCCGGAAAACCTGACCCCATTGCCGGAGTTCTGGAC
ACCTCCCCGGAATCCGTTCGACGTGCTCGGAGATCTGCTGATTGGAAGGCAGCGAGGGAATACCTTGATAGTGGATTTATCTATGAGGGCAGGATCGAAGGTTCAAATGC
TGGAGGCTTACTTGTCCGATTTTATTCTCTTATGGGGTTTCTTCCATTCCCTCAATTGAGTCCGTCTCATTCTTGTAAAGAACCAAACAAGAGTATTCAAGATATTGCAA
AAAGCTTAACTGGTTCGCTTATATCAGTAAAGGTAATCCAAGCAGATGAGAGAAACAAGAAATTGATATTTTCAGAGAAGGAAGCTACGTGGTCAAAGTTTTCTGGGCAA
GTGGGTGTGGGAGATGTCTATGAAGCTAAAGTTGGATCTCTGGAGGATTATGGTGCTTTTGTACATCTACGTTTCTCTGATGGTCTTTATCATCTTACTGGGCTAGTACA
TGTATCAGAAGTTTCATGGGATCTAGTTCAGGATGTACGAGACATATTAAGTGAGGGTGACGAAGTGACAGTGAAAGTCATTAATGTTGATAGGGATAAGTCTAGGATCA
CATTGTCGATCAGACAACTCGAGGAAGATCCACTTTTAGAAACATTGGACAAAGTAATACCGCAGGACAGTTCTGCTGAACCTGATTCTTTCGGACCTAAAAGCGACAGC
GAAATAATACCCCTCCCTGGACTTGGAACAATAATTGAAGAGCTACAGCAGGAAGAGGGTATAGTAGATGTTCGTGTCAACCGACAAGGATTTGAGAAACGGGTGGTTTC
ACAAGACCTACAGCTTTGGCTATCGAATGCACCTCCCATTGAAAAGAAGTTCACTCTCCTTGCTCGTGCCGGGAGGCAGGTTCAAGAAATTCAGCTGACAACATCACTCG
ATCAGGAAGGTATAAAGAGGGCATTGCAGCGAGTGTTGGAACGTGTCCCATGA
mRNA sequenceShow/hide mRNA sequence
GTTTATGCGTCACGCAAATAAGCGATGGGTTCCCATTCGGATTTGCAGCAGTTGGTGGCATTCCATATCCGAATCCTTTCCCATTACGCTCCTCCCCGCCATTGTCTTCC
CTCTTTCCATTCCCATAATCATGCCAATCTTTCTTGCAACCATCGCCTCTGTCTCTACTCATTCCTTCCTCTCACTCCTTGCTTCCACTTCTGATGCTTCTTCAACCTCC
TCCTCCTCCTCCTCCTCCATTTTACCACTCAAATCCCCCTCTAAACGCCCTTCCATTTTCCCCTCCAGAGTTTCCCTCTCCGGAAAACCTGACCCCATTGCCGGAGTTCT
GGACACCTCCCCGGAATCCGTTCGACGTGCTCGGAGATCTGCTGATTGGAAGGCAGCGAGGGAATACCTTGATAGTGGATTTATCTATGAGGGCAGGATCGAAGGTTCAA
ATGCTGGAGGCTTACTTGTCCGATTTTATTCTCTTATGGGGTTTCTTCCATTCCCTCAATTGAGTCCGTCTCATTCTTGTAAAGAACCAAACAAGAGTATTCAAGATATT
GCAAAAAGCTTAACTGGTTCGCTTATATCAGTAAAGGTAATCCAAGCAGATGAGAGAAACAAGAAATTGATATTTTCAGAGAAGGAAGCTACGTGGTCAAAGTTTTCTGG
GCAAGTGGGTGTGGGAGATGTCTATGAAGCTAAAGTTGGATCTCTGGAGGATTATGGTGCTTTTGTACATCTACGTTTCTCTGATGGTCTTTATCATCTTACTGGGCTAG
TACATGTATCAGAAGTTTCATGGGATCTAGTTCAGGATGTACGAGACATATTAAGTGAGGGTGACGAAGTGACAGTGAAAGTCATTAATGTTGATAGGGATAAGTCTAGG
ATCACATTGTCGATCAGACAACTCGAGGAAGATCCACTTTTAGAAACATTGGACAAAGTAATACCGCAGGACAGTTCTGCTGAACCTGATTCTTTCGGACCTAAAAGCGA
CAGCGAAATAATACCCCTCCCTGGACTTGGAACAATAATTGAAGAGCTACAGCAGGAAGAGGGTATAGTAGATGTTCGTGTCAACCGACAAGGATTTGAGAAACGGGTGG
TTTCACAAGACCTACAGCTTTGGCTATCGAATGCACCTCCCATTGAAAAGAAGTTCACTCTCCTTGCTCGTGCCGGGAGGCAGGTTCAAGAAATTCAGCTGACAACATCA
CTCGATCAGGAAGGTATAAAGAGGGCATTGCAGCGAGTGTTGGAACGTGTCCCATGATTTCGGAAACAAAGTTCGTTTTTGTTCGATTTTGTATGAAGTTTTCTGTAAAG
AAGATGATGCCAACTCAATTGTTTCCAGTAAGTTGTATATTTTGTCTGATCTCTACATCTATTCTAATGGTAAAAGTAAAAGGTGTTGTAAGCATGAATGCTCGATCCTC
GTATAACAACAGCCTTCTGCTTGGAAGACTTGTAGTTTTTGCTTTTCGTAGTCAATATTATATATATGTATGTATGTACATATTCTTTGTTTCATTGAATTCAAAGA
Protein sequenceShow/hide protein sequence
MRHANKRWVPIRICSSWWHSISESFPITLLPAIVFPLSIPIIMPIFLATIASVSTHSFLSLLASTSDASSTSSSSSSSILPLKSPSKRPSIFPSRVSLSGKPDPIAGVLD
TSPESVRRARRSADWKAAREYLDSGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPNKSIQDIAKSLTGSLISVKVIQADERNKKLIFSEKEATWSKFSGQ
VGVGDVYEAKVGSLEDYGAFVHLRFSDGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRDKSRITLSIRQLEEDPLLETLDKVIPQDSSAEPDSFGPKSDS
EIIPLPGLGTIIEELQQEEGIVDVRVNRQGFEKRVVSQDLQLWLSNAPPIEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP