; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G15340 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G15340
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Description30S ribosomal protein S1
Genome locationClcChr07:29923184..29929077
RNA-Seq ExpressionClc07G15340
SyntenyClc07G15340
Gene Ontology termsGO:0009987 - cellular process (biological process)
GO:0016020 - membrane (cellular component)
GO:0043229 - intracellular organelle (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR003029 - S1 domain
IPR012340 - Nucleic acid-binding, OB-fold
IPR022967 - RNA-binding domain, S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018465.1 30S ribosomal protein S1, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]2.2e-16471.22Show/hide
Query:  MPIF-ATTLPSLSAPSFLSLFPSIDASHSPSTSFLLPLKSPSTRPSNFPSRVSLSGKPDPLAAGLLDTSPSSPESLRRARRSADWKAAREFLDNGFIYEG
        MPIF ATTL SLSA SFLSL  S DAS+S STSFLL  KSPS RPSNF +RVSLSGKP+P+ AG+L++SPSSPES+RRARRSADWK ARE+LD+GFI++G
Subjt:  MPIF-ATTLPSLSAPSFLSLFPSIDASHSPSTSFLLPLKSPSTRPSNFPSRVSLSGKPDPLAAGLLDTSPSSPESLRRARRSADWKAAREFLDNGFIYEG

Query:  RIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGTIIHCLVPSWDS
        RIEGSNAGGLLVRFYSL+GFLPFP LSP+HSCKEPYKSIQDIAKSLIGSLIPVK                                              
Subjt:  RIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGTIIHCLVPSWDS

Query:  EVISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGLV
                 VIQADE+NK LIFSEKEAAWSKFSE+VGVGDVYEARVGSVEDYGAFVHLRFSD                             GLYHLTGLV
Subjt:  EVISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGLV

Query:  HVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPGLETIIEELLQE
        H+SEVSWDLVQDVRDILSEGDEV VKVI+VDR        DKSRITLSIKQLEEDPLLETLDKVIPQD SAEPDSFGPKSDSEIIPLPGL+TI EELLQE
Subjt:  HVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPGLETIIEELLQE

Query:  DGLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        +G    I DV +NRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  DGLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

XP_022955545.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111457527 [Cucurbita moschata]4.9e-16471.01Show/hide
Query:  MPIF-ATTLPSLSAPSFLSLFPSIDASHSPSTSFLLPLKSPSTRPSNFPSRVSLSGKPDPLAAGLLDTSPSSPESLRRARRSADWKAAREFLDNGFIYEG
        MPIF ATTL SLSA SFLSL  S DAS+S STSFLL  KSPS RPSNF +RVSLSGKP+P+ AG+L++SPSSPES+RRARRSADWK ARE+LD+GFI++G
Subjt:  MPIF-ATTLPSLSAPSFLSLFPSIDASHSPSTSFLLPLKSPSTRPSNFPSRVSLSGKPDPLAAGLLDTSPSSPESLRRARRSADWKAAREFLDNGFIYEG

Query:  RIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGTIIHCLVPSWDS
        RIEGSNAGGLLVRFYSL+GFLPFP LSP+HSCKEPYKSIQDIAKSLIGSLIPVK                                              
Subjt:  RIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGTIIHCLVPSWDS

Query:  EVISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGLV
                 VIQADE+NK LIFSEKEAAWSKFSE+VGVGDVYEARVGS+EDYGAFVHLRFSD                             GLYHLTGLV
Subjt:  EVISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGLV

Query:  HVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPGLETIIEELLQE
        H+SEVSWDLVQDVRDILSEGDEV VKVI+VDR        DKSRITLSIKQLEEDPLLETLDKVIPQD SAEPDSFGPKSDSEIIPLPGL+TI EELLQE
Subjt:  HVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPGLETIIEELLQE

Query:  DGLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        +G    I DV +NRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  DGLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

XP_022979820.1 uncharacterized protein LOC111479406 [Cucurbita maxima]2.2e-16470.8Show/hide
Query:  MPIF-ATTLPSLSAPSFLSLFPSIDASHSPSTSFLLPLKSPSTRPSNFPSRVSLSGKPDPLAAGLLDTSPSSPESLRRARRSADWKAAREFLDNGFIYEG
        MPIF ATTL SLSA SFLSL  S DASHS S+SF+L  KSPS RPSNF +RVSLSGKP+P+ AG+L++SPSSPES+RRARRSADWK ARE+LD+GFI++G
Subjt:  MPIF-ATTLPSLSAPSFLSLFPSIDASHSPSTSFLLPLKSPSTRPSNFPSRVSLSGKPDPLAAGLLDTSPSSPESLRRARRSADWKAAREFLDNGFIYEG

Query:  RIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGTIIHCLVPSWDS
        RIEGSNAGGLLVRFYSL+GFLPFP LSP+HSCKEPYKSIQDIAKSLIGSLIPVK                                              
Subjt:  RIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGTIIHCLVPSWDS

Query:  EVISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGLV
                 VIQADE+NK LIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSD                             G YHLTGLV
Subjt:  EVISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGLV

Query:  HVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPGLETIIEELLQE
        H+SEVSWDLVQDVRDILSEGDEV VKV++VDR        DKSRITLSIKQLEEDPLLETLDKVIPQD SAEPDSFGPKSDSEIIPLPGL+TI EELLQE
Subjt:  HVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPGLETIIEELLQE

Query:  DGLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        +G    I DV +NRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  DGLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

XP_023526022.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111789621 [Cucurbita pepo subsp. pepo]1.7e-16471.22Show/hide
Query:  MPIF-ATTLPSLSAPSFLSLFPSIDASHSPSTSFLLPLKSPSTRPSNFPSRVSLSGKPDPLAAGLLDTSPSSPESLRRARRSADWKAAREFLDNGFIYEG
        MPIF ATTL SLSA SFL+L  S DAS+S STSF L  KSPS RPSNF +RVSLSGKPDP+ AG+L++SPSSPES+RRARRSADWK ARE+LD+GFI++G
Subjt:  MPIF-ATTLPSLSAPSFLSLFPSIDASHSPSTSFLLPLKSPSTRPSNFPSRVSLSGKPDPLAAGLLDTSPSSPESLRRARRSADWKAAREFLDNGFIYEG

Query:  RIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGTIIHCLVPSWDS
        RIEGSNAGGLLVRFYSL+GFLPFP LSP+HSCKEPYKSIQDIAKSLIGSLIPVK                                              
Subjt:  RIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGTIIHCLVPSWDS

Query:  EVISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGLV
                 VIQADE+NK LIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSD                             GLYHLTGLV
Subjt:  EVISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGLV

Query:  HVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPGLETIIEELLQE
        H+SEVSWDLVQDVRDILSEGDEV VKVI+VDR        DKSRITLSIKQLEEDPLLETLDKVIPQD SAEPDSFGPKSDSEIIPLPGL+TI EELLQE
Subjt:  HVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPGLETIIEELLQE

Query:  DGLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        +G    I DV +NRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  DGLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

XP_038897871.1 30S ribosomal protein S1 homolog B [Benincasa hispida]2.4e-17174.32Show/hide
Query:  MPIFATTLPSLSAPSFLSLFPSIDASHSPSTSFLLPLKSPSTRPSNFPSRVSLSGKPDPLAAGLLDTSPSSPESLRRARRSADWKAAREFLDNGFIYEGR
        MPIFA TL S+ A SFLSL  S     S STSF+LP KSPS RPSNFP+RVSLSGKPDP+ AG+LDTSPSSPESLRRARRSADWKAARE+LDNGFIYEGR
Subjt:  MPIFATTLPSLSAPSFLSLFPSIDASHSPSTSFLLPLKSPSTRPSNFPSRVSLSGKPDPLAAGLLDTSPSSPESLRRARRSADWKAAREFLDNGFIYEGR

Query:  IEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGTIIHCLVPSWDSE
        IEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLI VK                                               
Subjt:  IEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGTIIHCLVPSWDSE

Query:  VISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGLVH
                VIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSD                             GLYHLTGLVH
Subjt:  VISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGLVH

Query:  VSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPGLETIIEELLQED
        VSEVSWDLVQDVRDILSEGDEVTVKVINVDR        DKSRITLSIKQLEEDPLLETLDKVIPQ GSAEPDSFGPKSDSEI+PLPGLETIIEELLQED
Subjt:  VSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPGLETIIEELLQED

Query:  GLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        G    IVD+ VNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  GLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

TrEMBL top hitse value%identityAlignment
A0A0A0KTX2 Uncharacterized protein1.8e-15970.5Show/hide
Query:  MPIFATTLPSLSAPSFLSLFPSI-DAS--HSPSTSFLLPLKSPSTRPSNFPSRVSLSGKPDPLAAGLLDTSPSSPESLRRARRSADWKAAREFLDNGFIY
        MPIF  T+ S+SA SFLSL  S  DAS   S S+SF+LPLKSPS R S FPSRVSLSGKPDP+ AG+LDT   SPES+RRARRSADWKAARE+LD+GFIY
Subjt:  MPIFATTLPSLSAPSFLSLFPSI-DAS--HSPSTSFLLPLKSPSTRPSNFPSRVSLSGKPDPLAAGLLDTSPSSPESLRRARRSADWKAAREFLDNGFIY

Query:  EGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGTIIHCLVPSW
        EGRIEGSNAGGLLVRFYSL+GFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLI VK                                            
Subjt:  EGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGTIIHCLVPSW

Query:  DSEVISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTG
                   VIQADE+N+KLIFSEKEAA SKFS QV VGDVYE +VGSVEDYGAFVHLR SD                             GLYHLTG
Subjt:  DSEVISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTG

Query:  LVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPGLETIIEELL
        LVHVSEVSWDLVQDVRDILSEGDEVTVKVIN        V+K+KSRITLSI+QLEEDPLLETLDKVIPQ+ SAEPDSFGPK DSEIIPLPGLETIIEELL
Subjt:  LVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPGLETIIEELL

Query:  QEDGLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        QE+G    IVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  QEDGLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

A0A1S3BXL4 30S ribosomal protein S1 isoform X11.9e-16171.13Show/hide
Query:  MPIFATTLPSLSAPSFLSLFPSI-DAS--HSPSTSFLLPLKSPSTRPSNFPSRVSLSGKPDPLAAGLLDTSPSSPESLRRARRSADWKAAREFLDNGFIY
        MPIF  T+ S+S  SFLSL  S  DAS   S S+S +LPLKSPS RPS FPSRVSLSGKPDP+ AG+LDT   SPES+RRARRSADWKAARE+LD+GFIY
Subjt:  MPIFATTLPSLSAPSFLSLFPSI-DAS--HSPSTSFLLPLKSPSTRPSNFPSRVSLSGKPDPLAAGLLDTSPSSPESLRRARRSADWKAAREFLDNGFIY

Query:  EGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGTIIHCLVPSW
        EGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEP KSIQDIAKSL GSLI VK                                            
Subjt:  EGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGTIIHCLVPSW

Query:  DSEVISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTG
                   VIQADERNKKLIFSEKEA WSKFS QVGVGDVYEA+VGS+EDYGAFVHLRFSD                             GLYHLTG
Subjt:  DSEVISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTG

Query:  LVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPGLETIIEELL
        LVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDR        DKSRITLSI+QLEEDPLLETLDKVIPQD SAEPDSFGPKSDSEIIPLPGL TIIEEL 
Subjt:  LVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPGLETIIEELL

Query:  QEDGLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        QE+G    IVDVRVNRQGFEKRVVSQDLQLWLSNAPP+EKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  QEDGLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

A0A5A7TNY0 30S ribosomal protein S1 isoform X17.6e-16371.34Show/hide
Query:  MPIFATTLPSLSAPSFLSLFPSI-DAS--HSPSTSFLLPLKSPSTRPSNFPSRVSLSGKPDPLAAGLLDTSPSSPESLRRARRSADWKAAREFLDNGFIY
        MPIF  T+ S+S  SFLSL  S  DAS   S S+S +LPLKSPS RPS FPSRVSLSGKPDP+ AG+LDT   SPES+RRARRSADWKAARE+LD+GFIY
Subjt:  MPIFATTLPSLSAPSFLSLFPSI-DAS--HSPSTSFLLPLKSPSTRPSNFPSRVSLSGKPDPLAAGLLDTSPSSPESLRRARRSADWKAAREFLDNGFIY

Query:  EGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGTIIHCLVPSW
        EGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEP KSIQDIAKSL GSLI VK                                            
Subjt:  EGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGTIIHCLVPSW

Query:  DSEVISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTG
                  +VIQADERNKKLIFSEKEA WSKFS QVGVGDVYEA+VGS+EDYGAFVHLRFSD                             GLYHLTG
Subjt:  DSEVISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTG

Query:  LVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPGLETIIEELL
        LVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQ+      DKSRITLSI+QLEEDPLLETLDKVIPQD SAEPDSFGPKSDSEIIPLPGL TIIEEL 
Subjt:  LVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPGLETIIEELL

Query:  QEDGLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        QE+G    IVDVRVNRQGFEKRVVSQDLQLWLSNAPP+EKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  QEDGLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

A0A6J1GVD7 LOW QUALITY PROTEIN: uncharacterized protein LOC1114575272.4e-16471.01Show/hide
Query:  MPIF-ATTLPSLSAPSFLSLFPSIDASHSPSTSFLLPLKSPSTRPSNFPSRVSLSGKPDPLAAGLLDTSPSSPESLRRARRSADWKAAREFLDNGFIYEG
        MPIF ATTL SLSA SFLSL  S DAS+S STSFLL  KSPS RPSNF +RVSLSGKP+P+ AG+L++SPSSPES+RRARRSADWK ARE+LD+GFI++G
Subjt:  MPIF-ATTLPSLSAPSFLSLFPSIDASHSPSTSFLLPLKSPSTRPSNFPSRVSLSGKPDPLAAGLLDTSPSSPESLRRARRSADWKAAREFLDNGFIYEG

Query:  RIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGTIIHCLVPSWDS
        RIEGSNAGGLLVRFYSL+GFLPFP LSP+HSCKEPYKSIQDIAKSLIGSLIPVK                                              
Subjt:  RIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGTIIHCLVPSWDS

Query:  EVISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGLV
                 VIQADE+NK LIFSEKEAAWSKFSE+VGVGDVYEARVGS+EDYGAFVHLRFSD                             GLYHLTGLV
Subjt:  EVISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGLV

Query:  HVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPGLETIIEELLQE
        H+SEVSWDLVQDVRDILSEGDEV VKVI+VDR        DKSRITLSIKQLEEDPLLETLDKVIPQD SAEPDSFGPKSDSEIIPLPGL+TI EELLQE
Subjt:  HVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPGLETIIEELLQE

Query:  DGLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        +G    I DV +NRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  DGLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

A0A6J1IUF3 uncharacterized protein LOC1114794061.1e-16470.8Show/hide
Query:  MPIF-ATTLPSLSAPSFLSLFPSIDASHSPSTSFLLPLKSPSTRPSNFPSRVSLSGKPDPLAAGLLDTSPSSPESLRRARRSADWKAAREFLDNGFIYEG
        MPIF ATTL SLSA SFLSL  S DASHS S+SF+L  KSPS RPSNF +RVSLSGKP+P+ AG+L++SPSSPES+RRARRSADWK ARE+LD+GFI++G
Subjt:  MPIF-ATTLPSLSAPSFLSLFPSIDASHSPSTSFLLPLKSPSTRPSNFPSRVSLSGKPDPLAAGLLDTSPSSPESLRRARRSADWKAAREFLDNGFIYEG

Query:  RIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGTIIHCLVPSWDS
        RIEGSNAGGLLVRFYSL+GFLPFP LSP+HSCKEPYKSIQDIAKSLIGSLIPVK                                              
Subjt:  RIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGTIIHCLVPSWDS

Query:  EVISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGLV
                 VIQADE+NK LIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSD                             G YHLTGLV
Subjt:  EVISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGLV

Query:  HVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPGLETIIEELLQE
        H+SEVSWDLVQDVRDILSEGDEV VKV++VDR        DKSRITLSIKQLEEDPLLETLDKVIPQD SAEPDSFGPKSDSEIIPLPGL+TI EELLQE
Subjt:  HVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPGLETIIEELLQE

Query:  DGLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        +G    I DV +NRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
Subjt:  DGLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

SwissProt top hitse value%identityAlignment
P50889 30S ribosomal protein S11.2e-0830.86Show/hide
Query:  VIQADERNKKLIFSEKEAAWSKFSEQ-------VGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGLVHV
        VI+ D  N +LI S K  A  + + Q       + VG+V E  V  + D+GAFV L   D                                   GLVHV
Subjt:  VIQADERNKKLIFSEKEAAWSKFSEQ-------VGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGLVHV

Query:  SEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKV
        SE+S D V++  D+L++GD+V VK++ +D         +K RI+LSIK  +  P  E  D++
Subjt:  SEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKV

Q8DWB2 Polyribonucleotide nucleotidyltransferase2.3e-0730.72Show/hide
Query:  IQADERNKKLIFSEKEAAWSKFSE-------QVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGLVHVS
        I  DE     I+S  + A ++  E       +  VG++YEA V  +E +GAFVHL                            F  +        LVH+S
Subjt:  IQADERNKKLIFSEKEAAWSKFSE-------QVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGLVHVS

Query:  EVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDP
        E++W     V D+L+ GD+VTVKV+ VD         DK RI  S+K L   P
Subjt:  EVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDP

Q93VC7 30S ribosomal protein S1, chloroplastic2.0e-0628.03Show/hide
Query:  EVISSNQPL-VIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGL
        E++    PL  ++ DE   KL+ S ++A  +    Q+G+G V    V S++ YGAF+ +    G I                                GL
Subjt:  EVISSNQPL-VIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGL

Query:  VHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDP
        +HVS++S D V D+  +L  GD + V +++ DR        D+ R++LS K+LE  P
Subjt:  VHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDP

Q9HZ71 30S ribosomal protein S11.1e-0625.42Show/hide
Query:  VIQADERNKKLIFSEKEA---AWSKFSEQVGVGDVYEARVGSVEDYGAF----------VHL--------------RFSDGNILCSIKLLILFADVFRNK
        V+  DE  +++    K+     W  FS Q   GD     + S+ D+G F          VHL              RF  G+    ++ +IL  D  R +
Subjt:  VIQADERNKKLIFSEKEA---AWSKFSEQVGVGDVYEARVGSVEDYGAF----------VHL--------------RFSDGNILCSIKLLILFADVFRNK

Query:  LPL--------TFQNSAGLYH------------------------LTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSR-ITLSI
        + L         F N A L+                         + G++  SE+S D V+D R++L EG+EV  K+I++DR         KSR I+LS+
Subjt:  LPL--------TFQNSAGLYH------------------------LTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSR-ITLSI

Query:  KQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEII
        K  + D   + + ++  Q    E +S GP +  ++I
Subjt:  KQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEII

Q9JZ44 30S ribosomal protein S13.8e-1025.52Show/hide
Query:  SLRRARRSADWKAAREFLDNGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCT
        S  +A+R+ADW A  E ++NG I  G I G   GGL V   S+  FLP                         GSL+ V+ V D S F            
Subjt:  SLRRARRSADWKAAREFLDNGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCT

Query:  DSSKPVNTAHLAAGTIIHCLVPSWDSE----VISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIK
                     G  I   V   D +    V+S    L     E  K L+            E +  G V +  V ++ DYGAFV L   D        
Subjt:  DSSKPVNTAHLAAGTIIHCLVPSWDSE----VISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIK

Query:  LLILFADVFRNKLPLTFQNSAGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDP
                                   GL+H+++++W  V+   ++L  G EV  KV+  D        ++K R++L +KQL EDP
Subjt:  LLILFADVFRNKLPLTFQNSAGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDP

Arabidopsis top hitse value%identityAlignment
AT1G12800.1 Nucleic acid-binding, OB-fold-like protein1.3e-1324.29Show/hide
Query:  RRARRSADWKAAREFLDNGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHS--CKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCT
        R +    DW  A   +      +  +  S+  G  V + SL+GFLP+  L+        E +   + +  S     + V    DV+S       P P+ +
Subjt:  RRARRSADWKAAREFLDNGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHS--CKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCT

Query:  DSSKPVNT--AHLAAGTIIHCLVPSWDSE--------VISSNQPLVIQADERNKKLIFS----EKEAAWSK---FSEQVGVGDVYEARVGSVEDYGAFVH
          S+   T    +++   +  L+  +D E        V    +  V+ A+  ++KLIFS    E E    K      ++ VGDV +  +  +  +G F  
Subjt:  DSSKPVNT--AHLAAGTIIHCLVPSWDSE--------VISSNQPLVIQADERNKKLIFS----EKEAAWSK---FSEQVGVGDVYEARVGSVEDYGAFVH

Query:  LRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPL
        L                                     +  LVH SEVSWD   D       G  V  KV  +D  +         RI LS+K++  DPL
Subjt:  LRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPL

Query:  LETLDKVIPQDGSAEPDSFGPKSDSEIIPL--PGLETIIEELLQEDGLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQL
         E L+ V+  D     D  G +  +  +    P +E++I+EL   +G+  S+   R     F    ++   Q+++  AP  E ++ LLARAG +VQE+ +
Subjt:  LETLDKVIPQDGSAEPDSFGPKSDSEIIPL--PGLETIIEELLQEDGLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQL

Query:  TTSLDQEGIKRALQRVLERV
          SL +E +K  +     RV
Subjt:  TTSLDQEGIKRALQRVLERV

AT3G23700.1 Nucleic acid-binding proteins superfamily3.9e-10349.28Show/hide
Query:  ATTLPSLSAPS--------FLS----LFPSIDASHSPSTSFLLPLKSPSTRPSNFPSRVS-LSGKPDPLAAG---LLDTSPSSPESLRRARRSADWKAAR
        ATTL S+S  S        FLS    L PS  +S S   S +  +KS S+  +    R S  S     L+A    L DTS  +      A   +DWK A+
Subjt:  ATTLPSLSAPS--------FLS----LFPSIDASHSPSTSFLLPLKSPSTRPSNFPSRVS-LSGKPDPLAAG---LLDTSPSSPESLRRARRSADWKAAR

Query:  EFLDNGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGT
         +  +G  +EG ++G N GGLL+RF+SL+GFLP+PQLSPS SCKEP KSI +IAK+L+GS +PVK                                   
Subjt:  EFLDNGFIYEGRIEGSNAGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGT

Query:  IIHCLVPSWDSEVISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQN
                            V+QADE N+KLI SEK A W K+S+ V VGDV+  RVGSVEDYGAF+HLRF D                           
Subjt:  IIHCLVPSWDSEVISSNQPLVIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQN

Query:  SAGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPG
          GLYHLTGLVHVSEVSWD VQDVRD+L +GDEV V V N+D        K+KSRITLSIKQLE+DPLLETLDKVI +D S    S    +   I PLPG
Subjt:  SAGLYHLTGLVHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPG

Query:  LETIIEELLQEDGLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP
        LETI+EELL+EDG+      V++NRQGFEKRVVSQDLQLWLSN PP + KF LLARAGRQVQEI LTTSL+Q GIK+ALQ VLERVP
Subjt:  LETIIEELLQEDGLNISIVDVRVNRQGFEKRVVSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP

AT5G30510.1 ribosomal protein S11.4e-0728.03Show/hide
Query:  EVISSNQPL-VIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGL
        E++    PL  ++ DE   KL+ S ++A  +    Q+G+G V    V S++ YGAF+ +    G I                                GL
Subjt:  EVISSNQPL-VIQADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGL

Query:  VHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDP
        +HVS++S D V D+  +L  GD + V +++ DR        D+ R++LS K+LE  P
Subjt:  VHVSEVSWDLVQDVRDILSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAATCTTTGCTACAACTCTCCCATCTCTCTCTGCTCCTTCCTTTCTCTCACTCTTCCCCTCCATTGATGCTTCTCACTCCCCTTCAACCTCCTTCCTTTTA
CCCCTTAAATCCCCTTCTACACGCCCTTCTAACTTCCCTTCCAGAGTTTCCCTCTCCGGAAAACCCGACCCCCTTGCTGCCGGACTTCTAGACACTTCCCCTTCC
TCCCCGGAATCACTTCGACGTGCTCGGAGATCTGCTGATTGGAAGGCAGCGAGGGAATTCCTTGATAATGGATTTATCTACGAGGGTAGGATTGAAGGTTCAAAC
GCTGGAGGTTTACTTGTTCGATTCTATTCTCTTATGGGGTTTCTTCCATTCCCTCAATTGAGCCCGTCTCATTCTTGTAAAGAACCATACAAGAGTATTCAAGAT
ATTGCAAAAAGCTTAATTGGTTCGCTTATACCAGTGAAGAGAGTCCCAGACGTGTCAAGCTTCCACCTGGCACCATCAAAGCCCCTCCCAAATTGCACCGACTCA
AGTAAGCCTGTGAACACAGCCCATCTTGCTGCAGGAACAATAATACATTGCTTAGTGCCATCATGGGATAGTGAAGTGATTTCTTCAAATCAACCACTGGTAATC
CAAGCAGATGAGAGAAACAAGAAATTGATATTTTCAGAGAAGGAAGCTGCGTGGTCAAAGTTTTCTGAGCAAGTTGGTGTAGGAGATGTGTATGAAGCTAGAGTT
GGATCTGTGGAGGATTATGGTGCCTTTGTACATCTACGTTTCTCTGATGGTAATATTTTGTGTTCTATTAAATTATTAATCTTATTCGCTGATGTATTTAGAAAC
AAATTGCCACTTACTTTCCAAAATTCTGCAGGTCTTTATCATCTTACTGGGCTAGTACATGTATCAGAAGTTTCATGGGATCTAGTTCAGGATGTAAGAGACATA
TTAAGTGAGGGTGACGAAGTGACAGTGAAAGTCATTAATGTTGATAGGCAAATAGGGAAGTTAGTTAGTAAAGATAAGTCCAGGATCACATTGTCAATCAAGCAA
CTCGAGGAAGATCCACTTTTAGAAACATTGGACAAAGTAATACCCCAGGATGGTTCTGCTGAACCTGATTCTTTCGGACCTAAAAGCGACAGTGAAATTATACCC
CTACCTGGACTTGAAACAATAATTGAAGAGCTACTGCAGGAAGATGGTTTGAATATCAGTATAGTAGATGTTCGTGTCAACCGACAAGGATTTGAGAAACGGGTG
GTTTCACAAGACCTACAGCTTTGGCTATCAAATGCACCTCCAGTTGAAAAGAAGTTCACTCTCCTTGCTCGTGCCGGGAGGCAGGTTCAAGAAATACAGCTGACA
ACATCACTCGATCAGGAAGGTATAAAAAGAGCATTGCAGCGAGTGTTGGAACGTGTCCCATGA
mRNA sequenceShow/hide mRNA sequence
GGACGTGTATGTTTTAGCGTCACGCAAATAAGCGATGGGTTCCGATGGGATTTGGAACAGTTGGTGGGTTTCCATTAGTCACTGCGCTCTTCCCCGCCATTTCCT
TCTCTCTCAACTCTCAAAATGCCAATCTTTGCTACAACTCTCCCATCTCTCTCTGCTCCTTCCTTTCTCTCACTCTTCCCCTCCATTGATGCTTCTCACTCCCCT
TCAACCTCCTTCCTTTTACCCCTTAAATCCCCTTCTACACGCCCTTCTAACTTCCCTTCCAGAGTTTCCCTCTCCGGAAAACCCGACCCCCTTGCTGCCGGACTT
CTAGACACTTCCCCTTCCTCCCCGGAATCACTTCGACGTGCTCGGAGATCTGCTGATTGGAAGGCAGCGAGGGAATTCCTTGATAATGGATTTATCTACGAGGGT
AGGATTGAAGGTTCAAACGCTGGAGGTTTACTTGTTCGATTCTATTCTCTTATGGGGTTTCTTCCATTCCCTCAATTGAGCCCGTCTCATTCTTGTAAAGAACCA
TACAAGAGTATTCAAGATATTGCAAAAAGCTTAATTGGTTCGCTTATACCAGTGAAGAGAGTCCCAGACGTGTCAAGCTTCCACCTGGCACCATCAAAGCCCCTC
CCAAATTGCACCGACTCAAGTAAGCCTGTGAACACAGCCCATCTTGCTGCAGGAACAATAATACATTGCTTAGTGCCATCATGGGATAGTGAAGTGATTTCTTCA
AATCAACCACTGGTAATCCAAGCAGATGAGAGAAACAAGAAATTGATATTTTCAGAGAAGGAAGCTGCGTGGTCAAAGTTTTCTGAGCAAGTTGGTGTAGGAGAT
GTGTATGAAGCTAGAGTTGGATCTGTGGAGGATTATGGTGCCTTTGTACATCTACGTTTCTCTGATGGTAATATTTTGTGTTCTATTAAATTATTAATCTTATTC
GCTGATGTATTTAGAAACAAATTGCCACTTACTTTCCAAAATTCTGCAGGTCTTTATCATCTTACTGGGCTAGTACATGTATCAGAAGTTTCATGGGATCTAGTT
CAGGATGTAAGAGACATATTAAGTGAGGGTGACGAAGTGACAGTGAAAGTCATTAATGTTGATAGGCAAATAGGGAAGTTAGTTAGTAAAGATAAGTCCAGGATC
ACATTGTCAATCAAGCAACTCGAGGAAGATCCACTTTTAGAAACATTGGACAAAGTAATACCCCAGGATGGTTCTGCTGAACCTGATTCTTTCGGACCTAAAAGC
GACAGTGAAATTATACCCCTACCTGGACTTGAAACAATAATTGAAGAGCTACTGCAGGAAGATGGTTTGAATATCAGTATAGTAGATGTTCGTGTCAACCGACAA
GGATTTGAGAAACGGGTGGTTTCACAAGACCTACAGCTTTGGCTATCAAATGCACCTCCAGTTGAAAAGAAGTTCACTCTCCTTGCTCGTGCCGGGAGGCAGGTT
CAAGAAATACAGCTGACAACATCACTCGATCAGGAAGGTATAAAAAGAGCATTGCAGCGAGTGTTGGAACGTGTCCCATGATTTTGCAAACAAAGTTTATTTTGT
TCGATTTTGTATGAAGCTGTCTGTAAAGAAGATGTCAATTCAATTGTTTCCAGTAAGTTGTATATTTTGTCTGATCTCTACGGTAAAAGTAAAAGGTGTTGTAAG
CATGAACGCTCGATCATTGTCCGACAGCCTTCTGCTTGGAAGATTTGTAGTTTTTGCTTTTCATAGCCAATATTATTATATACACGTATGTATGTATATATTCTT
TGTTTCATCGAATTCAAACATAGGTAAATTTCAATTAAC
Protein sequenceShow/hide protein sequence
MPIFATTLPSLSAPSFLSLFPSIDASHSPSTSFLLPLKSPSTRPSNFPSRVSLSGKPDPLAAGLLDTSPSSPESLRRARRSADWKAAREFLDNGFIYEGRIEGSN
AGGLLVRFYSLMGFLPFPQLSPSHSCKEPYKSIQDIAKSLIGSLIPVKRVPDVSSFHLAPSKPLPNCTDSSKPVNTAHLAAGTIIHCLVPSWDSEVISSNQPLVI
QADERNKKLIFSEKEAAWSKFSEQVGVGDVYEARVGSVEDYGAFVHLRFSDGNILCSIKLLILFADVFRNKLPLTFQNSAGLYHLTGLVHVSEVSWDLVQDVRDI
LSEGDEVTVKVINVDRQIGKLVSKDKSRITLSIKQLEEDPLLETLDKVIPQDGSAEPDSFGPKSDSEIIPLPGLETIIEELLQEDGLNISIVDVRVNRQGFEKRV
VSQDLQLWLSNAPPVEKKFTLLARAGRQVQEIQLTTSLDQEGIKRALQRVLERVP