; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022580 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022580
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Description30S ribosomal protein S1 homolog A isoform X2
Genome locationtig00000289:1230285..1234177
RNA-Seq ExpressionSgr022580
SyntenySgr022580
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578864.1 Protein PIGMENT DEFECTIVE 338, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]2.1e-15463.21Show/hide
Query:  LCVPNSSFEPRDFLVSSNPVRNSALLNVPRFLTPPGISNSKFFSTSNWSNIHNKSKKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEIDEL--
        LC  NSS EP+DFLVSSN +RNSA L        P ISN K +STSNWSN  NKSKKL+ R R+SV FCSRN++FD FSSTQ P++P ED IQ +DE+  
Subjt:  LCVPNSSFEPRDFLVSSNPVRNSALLNVPRFLTPPGISNSKFFSTSNWSNIHNKSKKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEIDEL--

Query:  --------------------VEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDNETEKANKLNVEYYEPKPGDAVVGVVVSGNENKL
                            VE PDENEALAPFMKFFK  D +EEE   EEEEK +   EEK+D D E  KANKLNVEYYEPKPGD VVGVVVSGNENKL
Subjt:  --------------------VEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDNETEKANKLNVEYYEPKPGDAVVGVVVSGNENKL

Query:  DINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDDAVSHGPGPGRPVVETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA----
        DINVGADLLGTMLTKEVLPL+DKE+EFLMCD DKDAESFM+NGKMGLVKYDDA S GPGPGRPVVETGTVLFAEVLGRTL GRPLLSTRRLFRR+A    
Subjt:  DINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDDAVSHGPGPGRPVVETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA----

Query:  ----------------------------------GIEDLSTLNNTLYLLQHLLD-----IIGRKEGMDDLGSLNWLLL--------------------CS
                                            E L+ +NN   L +++ +     I+  ++  + L      LL                     S
Subjt:  ----------------------------------GIEDLSTLNNTLYLLQHLLD-----IIGRKEGMDDLGSLNWLLL--------------------CS

Query:  GLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPARTTDLSFENEASM
        GLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEA+IMAKKYRQ+LP VEG  RPEPA TTDL FENE+SM
Subjt:  GLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPARTTDLSFENEASM

Query:  YANWKWFKFER
        YANWKWFKFER
Subjt:  YANWKWFKFER

XP_022134276.1 uncharacterized protein LOC111006579 isoform X1 [Momordica charantia]2.3e-16464.5Show/hide
Query:  EYLCVPNSSFEPRDFLVSSNPVRNSALLNVPRFLTPPGISNSKFFSTSNWSNIHNKSKKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEIDEL
        ++ C+PNSSFEP+DFL+SSNP RNS+ +NVPR +TP  IS+ K FSTSNWSN HNKSKK  FRERNSV FCSRNDVFD FSSTQ PDRP+E+GIQEIDEL
Subjt:  EYLCVPNSSFEPRDFLVSSNPVRNSALLNVPRFLTPPGISNSKFFSTSNWSNIHNKSKKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEIDEL

Query:  ----------------------VEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDNETEKANKLNVEYYEPKPGDAVVGVVVSGNEN
                               +KPDENEALAPFMKFF+T D +      EEEEK L V EEKV+R++ETEKANKLNVEYYEPKPGD VVGVVVSGNEN
Subjt:  ----------------------VEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDNETEKANKLNVEYYEPKPGDAVVGVVVSGNEN

Query:  KLDINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDDAVSHGPGPGRPVVETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA--
        KLDINVGADLLGTMLTKEVLPLYDKEM+FLMCDFDKDAESFMVNGKMGLVKY+DAVS GPG GRPVVETGTVLFAEVLGRTL GRPLLSTRRLFRRIA  
Subjt:  KLDINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDDAVSHGPGPGRPVVETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA--

Query:  --------------------------GIEDL------STLNNTLYLLQHLLDIIGRK---------EGMDDL--------------------GSLNWLLL
                                   IE L      + L N +     L + +GR+         E  +DL                    G++  +  
Subjt:  --------------------------GIEDL------STLNNTLYLLQHLLDIIGRK---------EGMDDL--------------------GSLNWLLL

Query:  C-----------SGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPA
                    SGLLHVSNITR+RVTSVSDLLAVGE VKVLVVKSMFPDKISLSIADLESEPGLFI NKEKVF+EAE+MAKKYRQKLP +EGIP  EP 
Subjt:  C-----------SGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPA

Query:  RTTDLSFENEASMYANWKWFKFER
         TTDL FENE+SMYANWKWFKFER
Subjt:  RTTDLSFENEASMYANWKWFKFER

XP_022134277.1 uncharacterized protein LOC111006579 isoform X2 [Momordica charantia]2.1e-16264.31Show/hide
Query:  EYLCVPNSSFEPRDFLVSSNPVRNSALLNVPRFLTPPGISNSKFFSTSNWSNIHNKSKKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEIDEL
        ++ C+PNSSFEP+DFL+SSNP RNS+ +NVPR +TP  IS+ K FSTSNWSN HNKSKK  FRERNSV FCSRNDVFD FSSTQ PDRP+E+GIQEIDEL
Subjt:  EYLCVPNSSFEPRDFLVSSNPVRNSALLNVPRFLTPPGISNSKFFSTSNWSNIHNKSKKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEIDEL

Query:  ----------------------VEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDNETEKANKLNVEYYEPKPGDAVVGVVVSGNEN
                               +KPDENEALAPFMKFF+T D +      EEEEK L V EEKV+R++ETEKANKLNVEYYEPKPGD VVGVVVSGNEN
Subjt:  ----------------------VEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDNETEKANKLNVEYYEPKPGDAVVGVVVSGNEN

Query:  KLDINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDDAVSHGPGPGRPVVETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA--
        KLDINVGADLLGTMLTKEVLPLYDKEM+FLMCDFDKDAESFMVNGKMGLVKY+DAVS GPG GRPVVETGTVLFAEVLGRTL GRPLLSTRRLFRRIA  
Subjt:  KLDINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDDAVSHGPGPGRPVVETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA--

Query:  --------------------------GIEDL------STLNNTLYLLQHLLDIIGRK---------EGMDDL--------------------GSLNWLLL
                                   IE L      + L N +     L + +GR+         E  +DL                    G++  +  
Subjt:  --------------------------GIEDL------STLNNTLYLLQHLLDIIGRK---------EGMDDL--------------------GSLNWLLL

Query:  C-----------SGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPA
                    SGLLHVSNITR+RVTSVSDLLAVGE VKVLVVKSMFPDKISLSIADLESEPGLFI NKE VF+EAE+MAKKYRQKLP +EGIP  EP 
Subjt:  C-----------SGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPA

Query:  RTTDLSFENEASMYANWKWFKFER
         TTDL FENE+SMYANWKWFKFER
Subjt:  RTTDLSFENEASMYANWKWFKFER

XP_023551503.1 uncharacterized protein LOC111809291 [Cucurbita pepo subsp. pepo]1.8e-15363.03Show/hide
Query:  LCVPNSSFEPRDFLVSSNPVRNSALLNVPRFLTPPGISNSKFFSTSNWSNIHNKSKKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEIDEL--
        LC  NSS EP+DFLVSSN +RNS+      FLT P ISN K +STSNWSN  NKSKKL  R R+SV FCSRN++FD FSSTQ P++P ED IQ IDE+  
Subjt:  LCVPNSSFEPRDFLVSSNPVRNSALLNVPRFLTPPGISNSKFFSTSNWSNIHNKSKKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEIDEL--

Query:  --------------------VEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDNETEKANKLNVEYYEPKPGDAVVGVVVSGNENKL
                            VE PDENEALAPFMKFFK  D +EEE   EEEEK +   EEK+D D E  KA KLNVEYYEPKPGD VVGVVVSGNENKL
Subjt:  --------------------VEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDNETEKANKLNVEYYEPKPGDAVVGVVVSGNENKL

Query:  DINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDDAVSHGPGPGRPVVETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA----
        DINVGADLLGTMLTKEVLPLYDKE+EFLMCD DKDAESFM+NGKMGLVKYDDAVS GPGPGRPVVETGTVLFAEVLGRTL GRPLLSTRRLFRR+A    
Subjt:  DINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDDAVSHGPGPGRPVVETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA----

Query:  ------------------------GIEDL------STLNNTLYLLQHLLDIIGRK---------EGMDDL--------------------GSLNWLLLC-
                                 IE L      + L N +     L + +GR+         E  +DL                    G++  +    
Subjt:  ------------------------GIEDL------STLNNTLYLLQHLLDIIGRK---------EGMDDL--------------------GSLNWLLLC-

Query:  ----------SGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPART
                  SGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEA+IMAKKYRQ+LP VEG  RPEPA T
Subjt:  ----------SGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPART

Query:  TDLSFENEASMYANWKWFKFER
        TDL FENE+SMYANWKWFKFER
Subjt:  TDLSFENEASMYANWKWFKFER

XP_038886053.1 protein PIGMENT DEFECTIVE 338, chloroplastic [Benincasa hispida]1.6e-15763.55Show/hide
Query:  EYLCVPNSSFEPRDFLVSSNPVRNSALLNVPRFLTPPGISNSKFFSTSNWSNIHNKSKKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEIDEL
        +YLCVPNSSFE RDFLVSSN V+NSA L VPR L  P ISNS   ST N SN    SKKLTFR R+SV FCSRND+FD FSSTQ PD+ +EDGIQEIDE+
Subjt:  EYLCVPNSSFEPRDFLVSSNPVRNSALLNVPRFLTPPGISNSKFFSTSNWSNIHNKSKKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEIDEL

Query:  ----------------------VEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDNETEKANKLNVEYYEPKPGDAVVGVVVSGNEN
                              VE  DENEALAPFMKFFKT D  +EE   EEEEK L   EEK+D D+ETEK NKLNVEYYEPKPGD VVGVVVSGNEN
Subjt:  ----------------------VEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDNETEKANKLNVEYYEPKPGDAVVGVVVSGNEN

Query:  KLDINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDDAVSHGPGPGRPVVETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA--
        KLDINVGADLLGTMLTKEVLPLYDKEMEFLMCD DKDAESFM+NGKMGLVKY+DA S GPGPGRPVVETGTVLFAEVLGRTL GRPLLSTRRLFRR+A  
Subjt:  KLDINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDDAVSHGPGPGRPVVETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA--

Query:  ------------------------------------GIEDLSTLNNTLYLLQHL----------LD------IIGRKEGMDDLGSLNWLLL---------
                                              E L+ +NN   L +++          +D      I+  ++  + L      LL         
Subjt:  ------------------------------------GIEDLSTLNNTLYLLQHL----------LD------IIGRKEGMDDLGSLNWLLL---------

Query:  -----------CSGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPA
                    SGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEA+IMA++YRQ+LP ++GI RPEPA
Subjt:  -----------CSGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPA

Query:  RTTDLSFENEASMYANWKWFKFER
         TTDL FENE+SMYANWKWFKFER
Subjt:  RTTDLSFENEASMYANWKWFKFER

TrEMBL top hitse value%identityAlignment
A0A1S3C3I1 30S ribosomal protein S1 homolog A isoform X13.3e-15362.21Show/hide
Query:  EYLCVPNSSFEPRDFLVSSNPVRNSALLNVPRFLTPPGISNSKFFSTSNWSNIHNKSKKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEIDEL
        +YLCVPNSSFE +DFLVSSN VRNSA L VPR L    ISNS   STSN S+  N+SKKLTFR R+SV  CSRND+FD  SSTQ PD+P  DGIQEIDE+
Subjt:  EYLCVPNSSFEPRDFLVSSNPVRNSALLNVPRFLTPPGISNSKFFSTSNWSNIHNKSKKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEIDEL

Query:  ----------------------VEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDNETEKANKLNVEYYEPKPGDAVVGVVVSGNEN
                              VE PDENEALAPFMKFFK  D  +E+   EEE+K L   EEK+D D+ETE ANKLNVEYYEPKPGD VVGVVVSGNEN
Subjt:  ----------------------VEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDNETEKANKLNVEYYEPKPGDAVVGVVVSGNEN

Query:  KLDINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDDAVSHGPGPGRPVVETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA--
        KLDINVGADLLGTMLTKEVLPLYDKEMEFLMCD DKDAESFM+NGKMGLVKY+DA S G GPGRPVVE GTVLFAEVLGRTL GRPLLSTRRLFRR+A  
Subjt:  KLDINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDDAVSHGPGPGRPVVETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA--

Query:  ------GIED------------------------------LSTLNNTLYLLQHL----------LD------IIGRKEGMDDLGSLNWLLL---------
              G+++                              L+ +NN   L +++          +D      I+  K+  + L      LL         
Subjt:  ------GIED------------------------------LSTLNNTLYLLQHL----------LD------IIGRKEGMDDLGSLNWLLL---------

Query:  -----------CSGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPA
                    SGLLHVSNITRARVTSVSDLL VGEKVKVLVVKSMFPDKISLSIADLESEPGLFI+NKEKVFSEA+IMAKKYRQ+LP ++GIPR + A
Subjt:  -----------CSGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPA

Query:  RTTDLSFENEASMYANWKWFKFER
         TTDL FENE+SMYANWKWFKFER
Subjt:  RTTDLSFENEASMYANWKWFKFER

A0A6J1BXF7 uncharacterized protein LOC111006579 isoform X11.1e-16464.5Show/hide
Query:  EYLCVPNSSFEPRDFLVSSNPVRNSALLNVPRFLTPPGISNSKFFSTSNWSNIHNKSKKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEIDEL
        ++ C+PNSSFEP+DFL+SSNP RNS+ +NVPR +TP  IS+ K FSTSNWSN HNKSKK  FRERNSV FCSRNDVFD FSSTQ PDRP+E+GIQEIDEL
Subjt:  EYLCVPNSSFEPRDFLVSSNPVRNSALLNVPRFLTPPGISNSKFFSTSNWSNIHNKSKKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEIDEL

Query:  ----------------------VEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDNETEKANKLNVEYYEPKPGDAVVGVVVSGNEN
                               +KPDENEALAPFMKFF+T D +      EEEEK L V EEKV+R++ETEKANKLNVEYYEPKPGD VVGVVVSGNEN
Subjt:  ----------------------VEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDNETEKANKLNVEYYEPKPGDAVVGVVVSGNEN

Query:  KLDINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDDAVSHGPGPGRPVVETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA--
        KLDINVGADLLGTMLTKEVLPLYDKEM+FLMCDFDKDAESFMVNGKMGLVKY+DAVS GPG GRPVVETGTVLFAEVLGRTL GRPLLSTRRLFRRIA  
Subjt:  KLDINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDDAVSHGPGPGRPVVETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA--

Query:  --------------------------GIEDL------STLNNTLYLLQHLLDIIGRK---------EGMDDL--------------------GSLNWLLL
                                   IE L      + L N +     L + +GR+         E  +DL                    G++  +  
Subjt:  --------------------------GIEDL------STLNNTLYLLQHLLDIIGRK---------EGMDDL--------------------GSLNWLLL

Query:  C-----------SGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPA
                    SGLLHVSNITR+RVTSVSDLLAVGE VKVLVVKSMFPDKISLSIADLESEPGLFI NKEKVF+EAE+MAKKYRQKLP +EGIP  EP 
Subjt:  C-----------SGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPA

Query:  RTTDLSFENEASMYANWKWFKFER
         TTDL FENE+SMYANWKWFKFER
Subjt:  RTTDLSFENEASMYANWKWFKFER

A0A6J1BZ75 uncharacterized protein LOC111006579 isoform X21.0e-16264.31Show/hide
Query:  EYLCVPNSSFEPRDFLVSSNPVRNSALLNVPRFLTPPGISNSKFFSTSNWSNIHNKSKKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEIDEL
        ++ C+PNSSFEP+DFL+SSNP RNS+ +NVPR +TP  IS+ K FSTSNWSN HNKSKK  FRERNSV FCSRNDVFD FSSTQ PDRP+E+GIQEIDEL
Subjt:  EYLCVPNSSFEPRDFLVSSNPVRNSALLNVPRFLTPPGISNSKFFSTSNWSNIHNKSKKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEIDEL

Query:  ----------------------VEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDNETEKANKLNVEYYEPKPGDAVVGVVVSGNEN
                               +KPDENEALAPFMKFF+T D +      EEEEK L V EEKV+R++ETEKANKLNVEYYEPKPGD VVGVVVSGNEN
Subjt:  ----------------------VEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDNETEKANKLNVEYYEPKPGDAVVGVVVSGNEN

Query:  KLDINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDDAVSHGPGPGRPVVETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA--
        KLDINVGADLLGTMLTKEVLPLYDKEM+FLMCDFDKDAESFMVNGKMGLVKY+DAVS GPG GRPVVETGTVLFAEVLGRTL GRPLLSTRRLFRRIA  
Subjt:  KLDINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDDAVSHGPGPGRPVVETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA--

Query:  --------------------------GIEDL------STLNNTLYLLQHLLDIIGRK---------EGMDDL--------------------GSLNWLLL
                                   IE L      + L N +     L + +GR+         E  +DL                    G++  +  
Subjt:  --------------------------GIEDL------STLNNTLYLLQHLLDIIGRK---------EGMDDL--------------------GSLNWLLL

Query:  C-----------SGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPA
                    SGLLHVSNITR+RVTSVSDLLAVGE VKVLVVKSMFPDKISLSIADLESEPGLFI NKE VF+EAE+MAKKYRQKLP +EGIP  EP 
Subjt:  C-----------SGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPA

Query:  RTTDLSFENEASMYANWKWFKFER
         TTDL FENE+SMYANWKWFKFER
Subjt:  RTTDLSFENEASMYANWKWFKFER

A0A6J1FMS2 uncharacterized protein LOC111445605 isoform X11.7e-15262.84Show/hide
Query:  LCVPNSSFEPRDFLVSSNPVRNSALLNVPRFLTPPGISNSKFFSTSNWSNIHNKSKKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEIDEL--
        LC  NSS EP+DF VSSN +RNSA      FLT P ISN K +STSNWSN  +KSKKL+ R R+SV FCSRN++FD FSSTQ P++P ED IQ IDE+  
Subjt:  LCVPNSSFEPRDFLVSSNPVRNSALLNVPRFLTPPGISNSKFFSTSNWSNIHNKSKKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEIDEL--

Query:  --------------------VEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDNETEKANKLNVEYYEPKPGDAVVGVVVSGNENKL
                            VE PDENEALAPFMKFFK  D +EEE   EEEEK +   EEK+D D E  KANKLNVEYYEPKPGD VVGVVVSGNENKL
Subjt:  --------------------VEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDNETEKANKLNVEYYEPKPGDAVVGVVVSGNENKL

Query:  DINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDDAVSHGPGPGRPVVETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA----
        DINVGADLLGTMLTKEVLPLYDKE+EFLMCD DKDAESFM+NGKMGLVKYDDAVS G GPGRPVVETGTVLFAEVLGRTL GRPLLSTRRLFRR+A    
Subjt:  DINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDDAVSHGPGPGRPVVETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA----

Query:  ------------------------GIEDL------STLNNTLYLLQHLLDIIGRK---------EGMDDL--------------------GSLNWLLLC-
                                 IE L      + L N +     L + +GR+         E  +DL                    G++  +    
Subjt:  ------------------------GIEDL------STLNNTLYLLQHLLDIIGRK---------EGMDDL--------------------GSLNWLLLC-

Query:  ----------SGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPART
                  SGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEA+IMAKKYRQ+LP VEG  RPEPA T
Subjt:  ----------SGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPART

Query:  TDLSFENEASMYANWKWFKFER
        TDL FENE+SMYANWKWFKFER
Subjt:  TDLSFENEASMYANWKWFKFER

A0A6J1JY23 uncharacterized protein LOC1114898624.3e-15362.84Show/hide
Query:  LCVPNSSFEPRDFLVSSNPVRNSALLNVPRFLTPPGISNSKFFSTSNWSNIHNKSKKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEIDEL--
        LC  NSS EP+DFLVSSN +RNSA      FLT P ISN K +STSNWSN  NKSKKLT + R+SV FCSRN++FD FSSTQ P++P ED IQ IDE+  
Subjt:  LCVPNSSFEPRDFLVSSNPVRNSALLNVPRFLTPPGISNSKFFSTSNWSNIHNKSKKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEIDEL--

Query:  --------------------VEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDNETEKANKLNVEYYEPKPGDAVVGVVVSGNENKL
                            VE P ENEALAPFMKFFK  D +EEE   EEEEK +   EEK+D D E  KANKLNVEYYEPKPGD VVGVVVSGNENKL
Subjt:  --------------------VEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDNETEKANKLNVEYYEPKPGDAVVGVVVSGNENKL

Query:  DINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDDAVSHGPGPGRPVVETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA----
        DINVGADLLGTMLTKEVLPLYDKE+EFLMCD DKDAESFM+NGKMGLVKYDDAVS GPGPGRPVVETGTVLFAEVLGRTL GRPLLSTRRLFRR+A    
Subjt:  DINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDDAVSHGPGPGRPVVETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA----

Query:  ------------------------GIEDL------STLNNTLYLLQHLLDIIGRK---------EGMDDL--------------------GSLNWLLLC-
                                 IE L      + L N +     L + +GR+         E  +DL                    G++  +    
Subjt:  ------------------------GIEDL------STLNNTLYLLQHLLDIIGRK---------EGMDDL--------------------GSLNWLLLC-

Query:  ----------SGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPART
                  SGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVF EA+IMAKKYRQ+LP VEG  RPEPA T
Subjt:  ----------SGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPART

Query:  TDLSFENEASMYANWKWFKFER
        TDL F+NE+SMYANWKWFKFER
Subjt:  TDLSFENEASMYANWKWFKFER

SwissProt top hitse value%identityAlignment
P29344 30S ribosomal protein S1, chloroplastic1.4e-0735.64Show/hide
Query:  DLGSLNWLLLCSGLLHVSNITRARVTSVSDLLAVGEKVKVLVVK-SMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPA
        D+G +N      GLLHVS I+  RV+ ++ +L  G+ +KV+++       ++SLS   LE  PG  I N + VF +AE MA+ +RQ++   E + R +  
Subjt:  DLGSLNWLLLCSGLLHVSNITRARVTSVSDLLAVGEKVKVLVVK-SMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPA

Query:  R
        R
Subjt:  R

P46228 30S ribosomal protein S13.1e-0735.05Show/hide
Query:  IIGRKEGMDDLGSLNWLLLCSGLLHVSNITRARVTSVSDLLAVGEKVKVLVVK-SMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKL
        ++G   G+   G+   +   SGLLH+S I+   + +   +  V ++VKV+++       +ISLS   LE EPG  + N E V+ +AE MA +YR+KL
Subjt:  IIGRKEGMDDLGSLNWLLLCSGLLHVSNITRARVTSVSDLLAVGEKVKVLVVK-SMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKL

P73530 30S ribosomal protein S1 homolog A6.3e-0830.08Show/hide
Query:  IIGRKEGMDDLGSLNWLLLCSGLLHVSNITRARVTSVSDLLAVGEKVKVLVVK-SMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQK-LPD
        ++G   G+   G+   +   SGLLH+S I+   + +   +  V +++KV+++       +ISLS   LE EPG  + +++ V   A+ MA+ +RQK L +
Subjt:  IIGRKEGMDDLGSLNWLLLCSGLLHVSNITRARVTSVSDLLAVGEKVKVLVVK-SMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQK-LPD

Query:  VEGIPRPEPARTTDLSFENEASM
         +GIP   P    D   E + S+
Subjt:  VEGIPRPEPARTTDLSFENEASM

Q93VC7 30S ribosomal protein S1, chloroplastic1.4e-0735.64Show/hide
Query:  DLGSLNWLLLCSGLLHVSNITRARVTSVSDLLAVGEKVKVLVVK-SMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPA
        D+G +N      GLLHVS I+  RV+ ++ +L  G+ +KV+++       ++SLS   LE  PG  I N + VF +AE MA+ +RQ++   E + R +  
Subjt:  DLGSLNWLLLCSGLLHVSNITRARVTSVSDLLAVGEKVKVLVVK-SMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPA

Query:  R
        R
Subjt:  R

Q9M9H4 Protein PIGMENT DEFECTIVE 338, chloroplastic7.3e-8145.63Show/hide
Query:  KKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEI----------DELVEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDN
        ++   R    + FCSR DV    S   +     E+ I+ +          +E   K D++  L PF+KFFK     EEE EG E E    VS+E      
Subjt:  KKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEI----------DELVEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDN

Query:  ETEKANKLNVEYYEPKPGDAVVGVVVSGNENKLDINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDD---AVSHGPGPGRPV
             ++++VEYY+PKPGD VVGVVVSGNENKLD+N+GAD+LGTMLTKE+LPLYDKE+++L+CD   DAE F+VNGKMG+VK DD    ++     GRPV
Subjt:  ETEKANKLNVEYYEPKPGDAVVGVVVSGNENKLDINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDD---AVSHGPGPGRPV

Query:  VETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA--GIEDLSTLN----------NTLYLL----------------------QHLLDIIGRK---------
        VE GTV+FAEVLGRTL GRPLLS+RR FRRIA   +  +  LN          NT  LL                        L + +GR+         
Subjt:  VETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA--GIEDLSTLN----------NTLYLL----------------------QHLLDIIGRK---------

Query:  EGMDDL---GSLNWLLLC----------------------------SGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLF
        E  +DL     + W  L                             SGLLH+SNITR R+ SVSD+L V E VKVLVVKS+FPDKISLSIADLESEPGLF
Subjt:  EGMDDL---GSLNWLLLC----------------------------SGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLF

Query:  ISNKEKVFSEAEIMAKKYRQKLPDVEGIP-RPEPARTTDLSFENEASMYANWKWFKFE
        IS++EKVF+EAE MAKKYR+K+P V   P    P  T+      +  +YANW+WFKFE
Subjt:  ISNKEKVFSEAEIMAKKYRQKLPDVEGIP-RPEPARTTDLSFENEASMYANWKWFKFE

Arabidopsis top hitse value%identityAlignment
AT1G71720.1 Nucleic acid-binding proteins superfamily5.2e-8245.63Show/hide
Query:  KKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEI----------DELVEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDN
        ++   R    + FCSR DV    S   +     E+ I+ +          +E   K D++  L PF+KFFK     EEE EG E E    VS+E      
Subjt:  KKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEI----------DELVEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDN

Query:  ETEKANKLNVEYYEPKPGDAVVGVVVSGNENKLDINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDD---AVSHGPGPGRPV
             ++++VEYY+PKPGD VVGVVVSGNENKLD+N+GAD+LGTMLTKE+LPLYDKE+++L+CD   DAE F+VNGKMG+VK DD    ++     GRPV
Subjt:  ETEKANKLNVEYYEPKPGDAVVGVVVSGNENKLDINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDD---AVSHGPGPGRPV

Query:  VETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA--GIEDLSTLN----------NTLYLL----------------------QHLLDIIGRK---------
        VE GTV+FAEVLGRTL GRPLLS+RR FRRIA   +  +  LN          NT  LL                        L + +GR+         
Subjt:  VETGTVLFAEVLGRTLGGRPLLSTRRLFRRIA--GIEDLSTLN----------NTLYLL----------------------QHLLDIIGRK---------

Query:  EGMDDL---GSLNWLLLC----------------------------SGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLF
        E  +DL     + W  L                             SGLLH+SNITR R+ SVSD+L V E VKVLVVKS+FPDKISLSIADLESEPGLF
Subjt:  EGMDDL---GSLNWLLLC----------------------------SGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLF

Query:  ISNKEKVFSEAEIMAKKYRQKLPDVEGIP-RPEPARTTDLSFENEASMYANWKWFKFE
        IS++EKVF+EAE MAKKYR+K+P V   P    P  T+      +  +YANW+WFKFE
Subjt:  ISNKEKVFSEAEIMAKKYRQKLPDVEGIP-RPEPARTTDLSFENEASMYANWKWFKFE

AT5G30510.1 ribosomal protein S11.0e-0835.64Show/hide
Query:  DLGSLNWLLLCSGLLHVSNITRARVTSVSDLLAVGEKVKVLVVK-SMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPA
        D+G +N      GLLHVS I+  RV+ ++ +L  G+ +KV+++       ++SLS   LE  PG  I N + VF +AE MA+ +RQ++   E + R +  
Subjt:  DLGSLNWLLLCSGLLHVSNITRARVTSVSDLLAVGEKVKVLVVK-SMFPDKISLSIADLESEPGLFISNKEKVFSEAEIMAKKYRQKLPDVEGIPRPEPA

Query:  R
        R
Subjt:  R


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGGGCCTACTTTTGAGACCCATCGGGCGCCTCCGCCCATGGTCCGGCCCCTCCCACTGCACCTCCTCAGGCCCAGAAACAGTCGCGTCGAATTTCTCCAACATGA
ACGGATAATGCACACTCATGGCAAGCCTTCCTCTTCGGCTCTGTCAATCCCAGTAGCCTACGACGCCGAGTCAGCTCATGAGTATCTGTGTGTTCCCAACTCGTCGTTTG
AGCCTCGAGATTTCTTAGTTTCATCCAATCCCGTTCGAAATTCAGCACTTCTCAACGTTCCCAGATTTCTAACTCCTCCAGGAATCTCAAATTCTAAGTTTTTTTCCACT
TCTAATTGGTCTAATATCCATAATAAATCTAAGAAGTTGACATTTCGGGAAAGAAATAGCGTTGCGTTTTGCTCTAGGAATGACGTTTTTGATACTTTCTCGAGCACCCA
GTCGCCGGATAGGCCTCAAGAGGATGGAATTCAAGAGATTGATGAGCTGGTTGAAAAGCCGGACGAGAATGAGGCCTTGGCGCCATTTATGAAATTCTTTAAGACTGGAG
ATTGTATAGAAGAAGAGGAAGAAGGGGAAGAAGAGGAAAAAATGCTACATGTTTCTGAAGAAAAAGTTGACAGAGATAATGAGACCGAGAAAGCCAATAAGTTAAATGTG
GAGTACTATGAGCCCAAACCTGGAGATGCTGTGGTTGGCGTAGTTGTCTCAGGTAACGAAAACAAGCTTGATATCAACGTGGGAGCGGACTTATTGGGAACAATGTTGAC
AAAGGAGGTGCTTCCCTTGTATGACAAAGAGATGGAATTCTTGATGTGTGATTTTGACAAGGATGCTGAGTCTTTTATGGTGAATGGAAAGATGGGGTTGGTGAAATACG
ATGACGCTGTTAGCCATGGACCGGGGCCGGGGCGGCCTGTCGTGGAGACTGGCACAGTTTTATTTGCTGAGGTTCTGGGAAGAACACTCGGTGGTCGGCCATTGCTCTCG
ACCAGAAGGTTGTTTCGGCGGATAGCTGGCATCGAAGACCTCAGTACCTTGAACAACACTCTTTACTTGCTGCAACATTTGCTTGACATAATTGGGAGAAAGGAAGGTAT
GGACGATCTCGGCTCACTGAATTGGCTCCTATTGTGCAGCGGGTTACTCCATGTTTCGAACATCACCCGTGCCCGAGTTACCTCGGTGAGCGACTTACTTGCAGTGGGTG
AAAAGGTCAAAGTTCTTGTTGTGAAGTCAATGTTTCCTGACAAGATATCTCTAAGTATTGCAGACCTTGAAAGCGAGCCCGGCCTGTTTATATCGAACAAAGAGAAAGTA
TTTTCAGAGGCCGAGATCATGGCGAAGAAGTACAGGCAAAAGCTACCTGATGTTGAAGGCATTCCCAGGCCCGAACCTGCTCGAACTACAGATCTGTCGTTTGAAAACGA
AGCGAGCATGTACGCGAATTGGAAGTGGTTCAAATTCGAAAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTGGGCCTACTTTTGAGACCCATCGGGCGCCTCCGCCCATGGTCCGGCCCCTCCCACTGCACCTCCTCAGGCCCAGAAACAGTCGCGTCGAATTTCTCCAACATGA
ACGGATAATGCACACTCATGGCAAGCCTTCCTCTTCGGCTCTGTCAATCCCAGTAGCCTACGACGCCGAGTCAGCTCATGAGTATCTGTGTGTTCCCAACTCGTCGTTTG
AGCCTCGAGATTTCTTAGTTTCATCCAATCCCGTTCGAAATTCAGCACTTCTCAACGTTCCCAGATTTCTAACTCCTCCAGGAATCTCAAATTCTAAGTTTTTTTCCACT
TCTAATTGGTCTAATATCCATAATAAATCTAAGAAGTTGACATTTCGGGAAAGAAATAGCGTTGCGTTTTGCTCTAGGAATGACGTTTTTGATACTTTCTCGAGCACCCA
GTCGCCGGATAGGCCTCAAGAGGATGGAATTCAAGAGATTGATGAGCTGGTTGAAAAGCCGGACGAGAATGAGGCCTTGGCGCCATTTATGAAATTCTTTAAGACTGGAG
ATTGTATAGAAGAAGAGGAAGAAGGGGAAGAAGAGGAAAAAATGCTACATGTTTCTGAAGAAAAAGTTGACAGAGATAATGAGACCGAGAAAGCCAATAAGTTAAATGTG
GAGTACTATGAGCCCAAACCTGGAGATGCTGTGGTTGGCGTAGTTGTCTCAGGTAACGAAAACAAGCTTGATATCAACGTGGGAGCGGACTTATTGGGAACAATGTTGAC
AAAGGAGGTGCTTCCCTTGTATGACAAAGAGATGGAATTCTTGATGTGTGATTTTGACAAGGATGCTGAGTCTTTTATGGTGAATGGAAAGATGGGGTTGGTGAAATACG
ATGACGCTGTTAGCCATGGACCGGGGCCGGGGCGGCCTGTCGTGGAGACTGGCACAGTTTTATTTGCTGAGGTTCTGGGAAGAACACTCGGTGGTCGGCCATTGCTCTCG
ACCAGAAGGTTGTTTCGGCGGATAGCTGGCATCGAAGACCTCAGTACCTTGAACAACACTCTTTACTTGCTGCAACATTTGCTTGACATAATTGGGAGAAAGGAAGGTAT
GGACGATCTCGGCTCACTGAATTGGCTCCTATTGTGCAGCGGGTTACTCCATGTTTCGAACATCACCCGTGCCCGAGTTACCTCGGTGAGCGACTTACTTGCAGTGGGTG
AAAAGGTCAAAGTTCTTGTTGTGAAGTCAATGTTTCCTGACAAGATATCTCTAAGTATTGCAGACCTTGAAAGCGAGCCCGGCCTGTTTATATCGAACAAAGAGAAAGTA
TTTTCAGAGGCCGAGATCATGGCGAAGAAGTACAGGCAAAAGCTACCTGATGTTGAAGGCATTCCCAGGCCCGAACCTGCTCGAACTACAGATCTGTCGTTTGAAAACGA
AGCGAGCATGTACGCGAATTGGAAGTGGTTCAAATTCGAAAGGTAG
Protein sequenceShow/hide protein sequence
MAGPTFETHRAPPPMVRPLPLHLLRPRNSRVEFLQHERIMHTHGKPSSSALSIPVAYDAESAHEYLCVPNSSFEPRDFLVSSNPVRNSALLNVPRFLTPPGISNSKFFST
SNWSNIHNKSKKLTFRERNSVAFCSRNDVFDTFSSTQSPDRPQEDGIQEIDELVEKPDENEALAPFMKFFKTGDCIEEEEEGEEEEKMLHVSEEKVDRDNETEKANKLNV
EYYEPKPGDAVVGVVVSGNENKLDINVGADLLGTMLTKEVLPLYDKEMEFLMCDFDKDAESFMVNGKMGLVKYDDAVSHGPGPGRPVVETGTVLFAEVLGRTLGGRPLLS
TRRLFRRIAGIEDLSTLNNTLYLLQHLLDIIGRKEGMDDLGSLNWLLLCSGLLHVSNITRARVTSVSDLLAVGEKVKVLVVKSMFPDKISLSIADLESEPGLFISNKEKV
FSEAEIMAKKYRQKLPDVEGIPRPEPARTTDLSFENEASMYANWKWFKFER