; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G11190 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G11190
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Description40S ribosomal protein S4
Genome locationClcChr09:9902545..9910392
RNA-Seq ExpressionClc09G11190
SyntenyClc09G11190
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0022627 - cytosolic small ribosomal subunit (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0019843 - rRNA binding (molecular function)
InterPro domainsIPR000876 - Ribosomal protein S4e
IPR002942 - RNA-binding S4 domain
IPR005824 - KOW
IPR013843 - Ribosomal protein S4e, N-terminal
IPR013845 - Ribosomal protein S4e, central region
IPR014722 - Ribosomal protein L2, domain 2
IPR018199 - Ribosomal protein S4e, N-terminal, conserved site
IPR032277 - 40S ribosomal protein S4, C-terminal domain
IPR036986 - RNA-binding S4 domain superfamily
IPR038237 - Ribosomal protein S4e, central domain superfamily
IPR041982 - Ribosomal protein S4, KOW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3538276.1 hypothetical protein F2Q69_00018504, partial [Brassica cretica]2.3e-15266.82Show/hide
Query:  ARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRL
        ARGLKKHLKRLNAPKHW LDKLGGAFAPKPSSGPHKSRECLPL+LI+RN+L                                             N+ +
Subjt:  ARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRL

Query:  LYDTKGRFRLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLESNKIADFIKGLKKHLKRLNAPKHWMLDKLGGAFAPK
        +                   +LC              ++ Y                 I++ LE  +     +GLKKHLKRLNAPKHW LDKLGGAFAPK
Subjt:  LYDTKGRFRLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLESNKIADFIKGLKKHLKRLNAPKHWMLDKLGGAFAPK

Query:  PSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLLYDTKGRFHLHSVRDEEAKYKLCKVRSVQ
        PSSGPHKSRECLPL+LI+RN+LKYALTYREVI+ILMQRH+ VDGKVRTDKT+PAGFMDVVSIPKTNENFRLLYDTKGRF LHS+RDEEAK+KLCKVR++Q
Subjt:  PSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLLYDTKGRFHLHSVRDEEAKYKLCKVRSVQ

Query:  FGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHKGSFETIHIQDATGHEFATRLGNVFTIGK
        FGQKGIPYLNTYDGRTIRYPDPLIK NDTIKLDL+ NKI +FIKFDVGNVVMVTGGRNRGRVGVIKNREKHKGSFETIHIQD+TGHEFATRLGNVFT+GK
Subjt:  FGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHKGSFETIHIQDATGHEFATRLGNVFTIGK

Query:  GTKPWVSLPKGKGIKLSIIEEARKRLASQAA
        GTKPWVSLPKGKGIKL+IIEEARKRL++Q A
Subjt:  GTKPWVSLPKGKGIKLSIIEEARKRLASQAA

PHT44647.1 40S ribosomal protein S4, partial [Capsicum baccatum]8.9e-22979.65Show/hide
Query:  ARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRL
        ARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLI+ILRNRLKYALTYREVI+ILMQR VLVDGKVRTDKT+PAGFMDVVSIPKTN +FR 
Subjt:  ARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRL

Query:  LYDTKGRFRLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLESNKIADFIK---------------------------
        LYDTKGRFRLHS+RDEEAK+KLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLES+KI DFIK                           
Subjt:  LYDTKGRFRLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLESNKIADFIK---------------------------

Query:  --------------------------------------------------GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLK
                                                          GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLI+ILRNRLK
Subjt:  --------------------------------------------------GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLK

Query:  YALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLLYDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPL
        YALTYREVI+ILMQR VLVDGKVRTDKT+PAGFMDVVSIPKTNE+FRLLYDTKGRF LHS+RDEEAK+KLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPL
Subjt:  YALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLLYDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPL

Query:  IKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHKGSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEAR
        IKANDTIKLDL+SNKI DFIKFDVGNVVMVTGGRNRGRVGVIKNREKHKGSFET+HIQDA GHEFATRLGNVFT+GKGTKPWVSLPKGKGIKLSIIEEAR
Subjt:  IKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHKGSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEAR

Query:  KRLASQAAVTA
        KRLA+Q+A TA
Subjt:  KRLASQAAVTA

XP_022132117.1 40S ribosomal protein S4-3 [Momordica charantia]3.6e-14598.85Show/hide
Query:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL
        +GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKT+PAGFMDVVSIPKTNENFRLL
Subjt:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL

Query:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
        YDTKGRF LHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
Subjt:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK

Query:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA
        GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA
Subjt:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA

XP_038899765.1 40S ribosomal protein S4-3-like [Benincasa hispida]1.6e-14599.24Show/hide
Query:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL
        +GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL
Subjt:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL

Query:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
        YDTKGRF LHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
Subjt:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK

Query:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA
        GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA
Subjt:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA

XP_038900121.1 40S ribosomal protein S4-3 [Benincasa hispida]4.7e-14598.85Show/hide
Query:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL
        +GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL
Subjt:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL

Query:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
        YDTKGRF LHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDL+SNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
Subjt:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK

Query:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA
        GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA
Subjt:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA

TrEMBL top hitse value%identityAlignment
A0A2G2WHB8 40S ribosomal protein S4 (Fragment)4.3e-22979.65Show/hide
Query:  ARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRL
        ARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLI+ILRNRLKYALTYREVI+ILMQR VLVDGKVRTDKT+PAGFMDVVSIPKTN +FR 
Subjt:  ARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRL

Query:  LYDTKGRFRLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLESNKIADFIK---------------------------
        LYDTKGRFRLHS+RDEEAK+KLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLES+KI DFIK                           
Subjt:  LYDTKGRFRLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLESNKIADFIK---------------------------

Query:  --------------------------------------------------GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLK
                                                          GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLI+ILRNRLK
Subjt:  --------------------------------------------------GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLK

Query:  YALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLLYDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPL
        YALTYREVI+ILMQR VLVDGKVRTDKT+PAGFMDVVSIPKTNE+FRLLYDTKGRF LHS+RDEEAK+KLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPL
Subjt:  YALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLLYDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPL

Query:  IKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHKGSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEAR
        IKANDTIKLDL+SNKI DFIKFDVGNVVMVTGGRNRGRVGVIKNREKHKGSFET+HIQDA GHEFATRLGNVFT+GKGTKPWVSLPKGKGIKLSIIEEAR
Subjt:  IKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHKGSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEAR

Query:  KRLASQAAVTA
        KRLA+Q+A TA
Subjt:  KRLASQAAVTA

A0A3Q7F1W7 Uncharacterized protein2.5e-22980.08Show/hide
Query:  ARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRL
        ARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPL++ILRNRLKYALTYREVI+ILMQR V+VDGKVRTDKT+PAGFMDVV+IPKTNE+FRL
Subjt:  ARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRL

Query:  LYDTKGRFRLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLESNKIADFIK---------------------------
        LYDTKGRFRLHS+RDEEAK+KLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDL+SNKI DFIK                           
Subjt:  LYDTKGRFRLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLESNKIADFIK---------------------------

Query:  ------------------------------------GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQ
                                            GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPL++ILRNRLKYALTYREVI+ILMQ
Subjt:  ------------------------------------GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQ

Query:  RHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLLYDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSN
        R V+VDGKVRTDKT+PAGFMDVV+IPKTNE+FRLLYDTKGRF LHS+RDEEAK+KLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSN
Subjt:  RHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLLYDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSN

Query:  KIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHKGSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA
        KI DFIKFDVGNVVMVTGGRNRGRVG++KNREKHKGSFET+HIQD+ GHEFATRLGNVFT+GKG+KPWVSLPKGKGIKLSIIEEARKR+A+Q+A  A
Subjt:  KIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHKGSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA

A0A5A7V1R9 40S ribosomal protein S48.6e-14598.47Show/hide
Query:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL
        +GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKT+ENFRLL
Subjt:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL

Query:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
        YDTKGRF LHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDL+SNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
Subjt:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK

Query:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA
        GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA
Subjt:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA

A0A5D3BXA6 40S ribosomal protein S48.6e-14595.19Show/hide
Query:  SNKIADFIKGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPK
        S  ++   +GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMD++SIPK
Subjt:  SNKIADFIKGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPK

Query:  TNENFRLLYDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGV
        TNENFRLLYDTKGRF LHSVRDEEAKYKLCKVRSVQFGQKGIPYLNT+DGRTIRYPDPLIKANDTIKLDL+SNKIADFIKFDVGNVVMVTGGRNRGRVGV
Subjt:  TNENFRLLYDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGV

Query:  IKNREKHKGSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA
        IKNREKHKGSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA
Subjt:  IKNREKHKGSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA

A0A6J1BSY6 40S ribosomal protein S41.7e-14598.85Show/hide
Query:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL
        +GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKT+PAGFMDVVSIPKTNENFRLL
Subjt:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL

Query:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
        YDTKGRF LHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
Subjt:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK

Query:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA
        GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA
Subjt:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA

SwissProt top hitse value%identityAlignment
P46299 40S ribosomal protein S42.6e-13892.28Show/hide
Query:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL
        +GLKKHLKRLNAP+HWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHV+VDGKVRTDKT+PAGFMDVVSIPKTNE+FRLL
Subjt:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL

Query:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
        YDTKGRF LH++  +E K+KLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDL+SNKI DFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
Subjt:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK

Query:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAA
        GSFETIH+QDA GHEFATRLGNVFTIGKGTKPWVSLPK KGIKLSIIEEARKRLA+Q A
Subjt:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAA

P46300 40S ribosomal protein S48.0e-14090.84Show/hide
Query:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL
        +GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPL++I+RNRLKYALTYREVI+ILMQR V+VDGKVRTDKT+PAGFMDVVSIPKTNENFRLL
Subjt:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL

Query:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
        YDTKGRF LHS+RDEE+K+KLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDL+SNKI DFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
Subjt:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK

Query:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA
        GSFET+HIQD+ GHEFATRLGNVFT+GKGTKPWVSLPKGKGIKL+IIE+ARKRLA+Q+A  A
Subjt:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA

P49204 40S ribosomal protein S4-24.7e-14091.89Show/hide
Query:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL
        +GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPL+LI+RNRLKYALTYREVI+ILMQRH+ VDGKVRTDKT+PAGFMDVVSIPKTNENFRLL
Subjt:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL

Query:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
        YDTKGRF LHS++DEEAK+KLCKVRS+QFGQKGIPYLNTYDGRTIRYPDPLIK NDTIKLDL+ NKI +FIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
Subjt:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK

Query:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAA
        GSFETIHIQD+TGHEFATRLGNV+TIGKGTKPWVSLPKGKGIKL+IIEEARKRLASQ A
Subjt:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAA

Q8VYK6 40S ribosomal protein S4-33.6e-14091.89Show/hide
Query:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL
        +GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPL+LI+RNRLKYALTYREVI+ILMQRH+ VDGKVRTDKT+PAGFMDVVSIPKTNENFRLL
Subjt:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL

Query:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
        YDTKGRF LHS++DEEAK+KLCKVRS+QFGQKGIPYLNTYDGRTIRYPDPLIK NDTIKLDL++NKI +FIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
Subjt:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK

Query:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAA
        GSFETIHIQD+TGHEFATRLGNV+TIGKGTKPWVSLPKGKGIKL+IIEEARKRLASQ A
Subjt:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAA

Q93VH9 40S ribosomal protein S4-12.3e-13991.12Show/hide
Query:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL
        +GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPL+LI+RNRLKYALTYREVI+ILMQRH+ VDGKVRTDKT+PAGFMDVVSIPKTNENFRLL
Subjt:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL

Query:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
        YDTKGRF LHS++DEEAK+KLCKVRS+QFGQKGIPYLNTYDGRTIRYPDPLIK NDTIKLDL+ NKI +FIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
Subjt:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK

Query:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAA
        GSFETIHIQD+TGHEFATRLGNV+TIGKGTKPWVSLPKGKGIKL+IIEEARKRL++Q A
Subjt:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAA

Arabidopsis top hitse value%identityAlignment
AT2G17360.1 Ribosomal protein S4 (RPS4A) family protein1.7e-14091.12Show/hide
Query:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL
        +GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPL+LI+RNRLKYALTYREVI+ILMQRH+ VDGKVRTDKT+PAGFMDVVSIPKTNENFRLL
Subjt:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL

Query:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
        YDTKGRF LHS++DEEAK+KLCKVRS+QFGQKGIPYLNTYDGRTIRYPDPLIK NDTIKLDL+ NKI +FIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
Subjt:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK

Query:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAA
        GSFETIHIQD+TGHEFATRLGNV+TIGKGTKPWVSLPKGKGIKL+IIEEARKRL++Q A
Subjt:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAA

AT2G17360.2 Ribosomal protein S4 (RPS4A) family protein4.0e-9489.39Show/hide
Query:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL
        +GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPL+LI+RNRLKYALTYREVI+ILMQRH+ VDGKVRTDKT+PAGFMDVVSIPKTNENFRLL
Subjt:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL

Query:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVV
        YDTKGRF LHS++DEEAK+KLCKVRS+QFGQKGIPYLNTYDGRTIRYPDPLIK NDTIKLDL+ NKI +FIKFDVG  V
Subjt:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVV

AT5G07090.1 Ribosomal protein S4 (RPS4A) family protein3.3e-14191.89Show/hide
Query:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL
        +GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPL+LI+RNRLKYALTYREVI+ILMQRH+ VDGKVRTDKT+PAGFMDVVSIPKTNENFRLL
Subjt:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL

Query:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
        YDTKGRF LHS++DEEAK+KLCKVRS+QFGQKGIPYLNTYDGRTIRYPDPLIK NDTIKLDL+ NKI +FIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
Subjt:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK

Query:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAA
        GSFETIHIQD+TGHEFATRLGNV+TIGKGTKPWVSLPKGKGIKL+IIEEARKRLASQ A
Subjt:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAA

AT5G07090.2 Ribosomal protein S4 (RPS4A) family protein5.3e-13191.77Show/hide
Query:  MLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLLYDTKGRFHLHSVRDEE
        MLDKLGGAFAPKPSSGPHKSRECLPL+LI+RNRLKYALTYREVI+ILMQRH+ VDGKVRTDKT+PAGFMDVVSIPKTNENFRLLYDTKGRF LHS++DEE
Subjt:  MLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLLYDTKGRFHLHSVRDEE

Query:  AKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHKGSFETIHIQDATGHEF
        AK+KLCKVRS+QFGQKGIPYLNTYDGRTIRYPDPLIK NDTIKLDL+ NKI +FIKFDVGNVVMVTGGRNRGRVGVIKNREKHKGSFETIHIQD+TGHEF
Subjt:  AKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHKGSFETIHIQDATGHEF

Query:  ATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAA
        ATRLGNV+TIGKGTKPWVSLPKGKGIKL+IIEEARKRLASQ A
Subjt:  ATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAA

AT5G58420.1 Ribosomal protein S4 (RPS4A) family protein2.6e-14191.89Show/hide
Query:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL
        +GLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPL+LI+RNRLKYALTYREVI+ILMQRH+ VDGKVRTDKT+PAGFMDVVSIPKTNENFRLL
Subjt:  KGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLL

Query:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
        YDTKGRF LHS++DEEAK+KLCKVRS+QFGQKGIPYLNTYDGRTIRYPDPLIK NDTIKLDL++NKI +FIKFDVGNVVMVTGGRNRGRVGVIKNREKHK
Subjt:  YDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHK

Query:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAA
        GSFETIHIQD+TGHEFATRLGNV+TIGKGTKPWVSLPKGKGIKL+IIEEARKRLASQ A
Subjt:  GSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAGAGGACTGAAGAAACATTTGAAGAGGCTCAATGCTCCAAAGCATTGGATGCTTGATAAACTTGGTGGTGCATTTGCCCCCAAACCTTCATCTGGACCTCATAA
GTCGAGGGAATGCCTCCCTTTGATCCTTATATTGAGGAACCGACTGAAATATGCTCTCACATATCGTGAGGTAATTGCAATTTTAATGCAAAGACATGTTTTGGTCGATG
GGAAGGTCAGGACAGACAAGACTTTCCCTGCTGGTTTCATGGACGTCGTGTCAATTCCCAAGACAAATGAGAATTTCCGGCTTCTTTATGACACAAAAGGTCGATTCCGT
CTACACTCAGTTAGGGATGAAGAGGCTAAGTATAAGCTGTGCAAAGTTCGCTCGGTGCAGTTCGGGCAAAAGGGTATCCCTTATCTGAACACTTACGACGGACGCACAAT
CCGGTATCCTGACCCTCTGATCAAGGCGAATGACACCATCAAGCTAGACCTCGAGTCCAACAAGATTGCTGATTTCATCAAAGGACTGAAGAAACATTTGAAGAGGCTCA
ATGCTCCAAAGCATTGGATGCTTGACAAACTTGGTGGTGCATTTGCCCCCAAACCTTCATCTGGACCTCATAAGTCAAGGGAATGCCTCCCATTGATCCTTATATTGAGG
AACCGACTGAAATATGCTCTCACATATCGTGAGGTAATTGCCATTTTAATGCAAAGGCATGTTTTGGTGGATGGGAAGGTCAGGACAGACAAGACTTTCCCTGCTGGTTT
CATGGACGTTGTGTCAATTCCCAAGACAAACGAGAATTTTCGGCTTCTTTATGACACAAAAGGTCGTTTCCATCTACACTCAGTTAGGGATGAAGAGGCTAAGTATAAGC
TGTGCAAAGTTCGCTCAGTGCAGTTTGGGCAAAAGGGCATCCCTTATCTGAACACATACGATGGGCGCACAATCCGTTACCCCGACCCTCTGATCAAAGCGAATGACACC
ATCAAGCTGGACCTTGACTCCAACAAGATTGCAGATTTCATCAAGTTTGATGTAGGAAATGTTGTGATGGTGACTGGTGGAAGAAACAGGGGCAGAGTTGGAGTGATAAA
GAACAGGGAGAAGCATAAGGGAAGCTTTGAAACAATCCACATTCAGGATGCAACCGGACACGAATTCGCTACTCGGCTGGGCAACGTGTTCACAATTGGCAAGGGGACTA
AGCCATGGGTGTCCCTACCCAAGGGCAAGGGTATTAAATTATCCATCATCGAAGAAGCTAGGAAGAGGCTGGCAAGCCAAGCCGCAGTTACTGCTTAA
mRNA sequenceShow/hide mRNA sequence
TTTTAAAATAAGAAATAAAATTCCCTCACTCGCTTCCTCTCTCTCCCTCCAAGCCGTTTTTTATCTTTCGTTCTCACACGCAGAGCGAAACCCTAGCTCTCTGCCACAGA
CGCCGGCGCCGAATCCATTTTAGCTCCCTCGAGAAAATATGGCGAGAGGACTGAAGAAACATTTGAAGAGGCTCAATGCTCCAAAGCATTGGATGCTTGATAAACTTGGT
GGTGCATTTGCCCCCAAACCTTCATCTGGACCTCATAAGTCGAGGGAATGCCTCCCTTTGATCCTTATATTGAGGAACCGACTGAAATATGCTCTCACATATCGTGAGGT
AATTGCAATTTTAATGCAAAGACATGTTTTGGTCGATGGGAAGGTCAGGACAGACAAGACTTTCCCTGCTGGTTTCATGGACGTCGTGTCAATTCCCAAGACAAATGAGA
ATTTCCGGCTTCTTTATGACACAAAAGGTCGATTCCGTCTACACTCAGTTAGGGATGAAGAGGCTAAGTATAAGCTGTGCAAAGTTCGCTCGGTGCAGTTCGGGCAAAAG
GGTATCCCTTATCTGAACACTTACGACGGACGCACAATCCGGTATCCTGACCCTCTGATCAAGGCGAATGACACCATCAAGCTAGACCTCGAGTCCAACAAGATTGCTGA
TTTCATCAAAGGACTGAAGAAACATTTGAAGAGGCTCAATGCTCCAAAGCATTGGATGCTTGACAAACTTGGTGGTGCATTTGCCCCCAAACCTTCATCTGGACCTCATA
AGTCAAGGGAATGCCTCCCATTGATCCTTATATTGAGGAACCGACTGAAATATGCTCTCACATATCGTGAGGTAATTGCCATTTTAATGCAAAGGCATGTTTTGGTGGAT
GGGAAGGTCAGGACAGACAAGACTTTCCCTGCTGGTTTCATGGACGTTGTGTCAATTCCCAAGACAAACGAGAATTTTCGGCTTCTTTATGACACAAAAGGTCGTTTCCA
TCTACACTCAGTTAGGGATGAAGAGGCTAAGTATAAGCTGTGCAAAGTTCGCTCAGTGCAGTTTGGGCAAAAGGGCATCCCTTATCTGAACACATACGATGGGCGCACAA
TCCGTTACCCCGACCCTCTGATCAAAGCGAATGACACCATCAAGCTGGACCTTGACTCCAACAAGATTGCAGATTTCATCAAGTTTGATGTAGGAAATGTTGTGATGGTG
ACTGGTGGAAGAAACAGGGGCAGAGTTGGAGTGATAAAGAACAGGGAGAAGCATAAGGGAAGCTTTGAAACAATCCACATTCAGGATGCAACCGGACACGAATTCGCTAC
TCGGCTGGGCAACGTGTTCACAATTGGCAAGGGGACTAAGCCATGGGTGTCCCTACCCAAGGGCAAGGGTATTAAATTATCCATCATCGAAGAAGCTAGGAAGAGGCTGG
CAAGCCAAGCCGCAGTTACTGCTTAA
Protein sequenceShow/hide protein sequence
MARGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILRNRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLLYDTKGRFR
LHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDTIKLDLESNKIADFIKGLKKHLKRLNAPKHWMLDKLGGAFAPKPSSGPHKSRECLPLILILR
NRLKYALTYREVIAILMQRHVLVDGKVRTDKTFPAGFMDVVSIPKTNENFRLLYDTKGRFHLHSVRDEEAKYKLCKVRSVQFGQKGIPYLNTYDGRTIRYPDPLIKANDT
IKLDLDSNKIADFIKFDVGNVVMVTGGRNRGRVGVIKNREKHKGSFETIHIQDATGHEFATRLGNVFTIGKGTKPWVSLPKGKGIKLSIIEEARKRLASQAAVTA