; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028172 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028172
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Description30S ribosomal protein S20
Genome locationtig00153056:4197699..4216787
RNA-Seq ExpressionSgr028172
SyntenySgr028172
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0043248 - proteasome assembly (biological process)
GO:0005840 - ribosome (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR002583 - Ribosomal protein S20
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold
IPR019538 - 26S proteasome non-ATPase regulatory subunit 5
IPR036510 - Ribosomal protein S20 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607893.1 26S proteasome non-ATPase regulatory subunit 5, partial [Cucurbita argyrosperma subsp. sororia]1.1e-23988.33Show/hide
Query:  MEEFAVDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLAC
        MEEFAVDDPT+LLEA+ADFANYPGVRTDASVKEF  RFPLPV+INALQ KAE PGLE TLVACLDRIFKTKYGAS IPHYMPFVQVGL+A+SQ VRSLAC
Subjt:  MEEFAVDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLAC

Query:  KTVTHLLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISS
        KTVT LLEE+D    LA QLI+DY+IYPLL++CLL+GNEQVANSSMDAIKKLAAFPKGMEIIFPTNK EATHLGT+ASTCSSLGRVRVMAL+VKLFS+SS
Subjt:  KTVTHLLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISS

Query:  SVASAIYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILI
        SVASA+YNSNLL+LLESEI+NSNDTLVTLSVLELLYELVEIEHGT FLPRTSFLQLLSSIISN S ESILRSRAMVI GRLLSKENIF LVDESCVRILI
Subjt:  SVASAIYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILI

Query:  SAIDEILRSS-GQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIA
        SAIDEIL SS GQDVNVCESA EALGQIGS+ WGATLL SS+PTCVK+VI+AAF +HEHGKQLAAMHALGNI GETRSENDIMLNDNAEENLRDLIYQ A
Subjt:  SAIDEILRSS-GQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIA

Query:  SRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASK
        SR  KM PSGL LAVLQQDSEIRLASYRMITGLV RPWCLMEICSKQDIINIV+DAS ETTKIGMEARYNCC+AIHK FMSSTRLTGDPALAGIASK
Subjt:  SRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASK

XP_022139463.1 uncharacterized protein LOC111010386 isoform X1 [Momordica charantia]2.5e-24490.74Show/hide
Query:  MEEFAVDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLAC
        MEEFAVDDPT+LLEA+ADFA+YPGVRTDASVKEFLDRFPLPVIINALQTKAE+PGLE TLVACLDRIFKTKYGAS IPH+MPF+QVGLRADSQTVR LAC
Subjt:  MEEFAVDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLAC

Query:  KTVTHLLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISS
        KTVT LLEESD +AVLAIQLIIDY IYPLLL+CLL+GNEQVANSSMDAIKKLAAFPKGME+IFPTN+ EATHLGTLASTCSSLGRVRVMALVVKLFS+S 
Subjt:  KTVTHLLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISS

Query:  SVASAIYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILI
        SVASAIYNSNLL+LLESEINNSNDTLVTLSVLELLYELVEIEHGT+FLPRTS LQLLSSIISNSS ESILRSRAMVI GRLLSKEN++LLVDESCVRILI
Subjt:  SVASAIYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILI

Query:  SAIDEILRSS-GQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIA
        SAIDE L SS GQDVNVCESA EALGQIGST  GATLL SSF TCVK +I AAF +HEHGKQLAAMHALGNICGETRSENDIMLND AEENLRDL+YQIA
Subjt:  SAIDEILRSS-GQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIA

Query:  SRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASK
        SR SK+MPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQ+IINIVTDAS ETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASK
Subjt:  SRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASK

XP_022940916.1 uncharacterized protein LOC111446360 [Cucurbita moschata]2.5e-24188.93Show/hide
Query:  MEEFAVDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLAC
        MEEFAVDDPT+LLEA+ADFANYPGVRTDASVKEF  RFPLPV+INALQ KAE PGLE TLVACLDRIFKTKYGAS IPHYMPFVQVGL+ADSQ VRSLAC
Subjt:  MEEFAVDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLAC

Query:  KTVTHLLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISS
        KTVT LLEE+D    LA QLI+DY+IYPLL++CLL+GNEQVANSSMDAIKKLAAFPKGMEIIFPTNK EATHLGT+ASTCSSLGRVRVMAL+VKLFS+SS
Subjt:  KTVTHLLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISS

Query:  SVASAIYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILI
        SVASA+YNSNLL+LLESEI+NSNDTLVTLSVLELLYELVEIEHGT FLPRTSFLQLLSSIISN S ESILRSRAMVI GRLLSKENIF LVDESCVRILI
Subjt:  SVASAIYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILI

Query:  SAIDEILRSS-GQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIA
        SAIDEIL SS GQDVNVCESA EALGQIGS+ WGATLL SS+PTCVK+VI+AAF +HEHGKQLAAMHALGNI GETRSENDIMLNDNAEENLRDLIYQ A
Subjt:  SAIDEILRSS-GQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIA

Query:  SRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASK
        SR  KM PSGL LAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIV+DAS ETTKIGMEARYNCC+AIHKAFMSSTRLTGDPALAGIASK
Subjt:  SRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASK

XP_022981236.1 uncharacterized protein LOC111480433 [Cucurbita maxima]6.9e-23987.93Show/hide
Query:  MEEFAVDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLAC
        MEEFAVDDPT+LLEA+ADFANYPGVRTD SVKEF  RFPLPV+INALQ KAE PGLE TLVACLDRIFKTKYGAS IPHYMPFVQVGL+ADSQ VRSLAC
Subjt:  MEEFAVDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLAC

Query:  KTVTHLLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISS
        KTVT LLEE+D    LA QLI+DY+IYPLL++CLL+GNEQVANSSMDA+KKLAAFPKGMEIIFPTNK EATHLGT+ASTCSSLGRVRVMAL+VKLFS+SS
Subjt:  KTVTHLLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISS

Query:  SVASAIYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILI
        SVASA+YNSNLL+LLESEI+NSNDTLVTLSVLELLYELVEIEHGT FLPRTSFLQLLSSIISN S ESILRSRAMVI GRLLSKENIF LVDESCVRILI
Subjt:  SVASAIYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILI

Query:  SAIDEILRSS-GQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIA
        S+IDEIL SS GQDVNVCESA EALGQIGS+ WGATLL SS+PTCVK+VI+AAF +HEHGKQLAAMHALGNI GETRSENDIMLNDNAEENL DLIYQ A
Subjt:  SAIDEILRSS-GQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIA

Query:  SRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASK
        SR  K+ PSGL LAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIV+DAS ETTKIGMEARYNCC+AIHKAFMSSTRLTGDPALAGIASK
Subjt:  SRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASK

XP_038898234.1 uncharacterized protein LOC120085959 isoform X1 [Benincasa hispida]2.3e-24289.74Show/hide
Query:  MEEFAVDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLAC
        MEEF+VDDPTRLLEA+ADFANYPGVRTDASVKEFLDRFPLP IINALQTKAE P LE TLVACLDRIFKTKYGAS IPHYMPFVQVGLRADSQ VRSLAC
Subjt:  MEEFAVDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLAC

Query:  KTVTHLLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISS
        KTVT LL+ESDE  +LAIQLIIDY IYPLLL CL++GNEQVANSSMDAIK LAAFP GMEIIFPTNK EATHLGT+ASTCSSLGRVRVMALVVKLFS+SS
Subjt:  KTVTHLLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISS

Query:  SVASAIYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILI
        SVASA+YN+NLL LLESEINNSNDTLVTLSVLELLYELVEIEHGT+FLPRTSF+ LLSSIISNSS ESILRSRAMVI GRLLSK+NIF LVDESCVR LI
Subjt:  SVASAIYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILI

Query:  SAIDEILRSS-GQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIA
        SAID IL SS GQDVNVCE+A EALGQIGS++WGATLL SSFPTCVKHVI AAF +HEHGKQLAAMHALGNI GETRSEND+MLNDNAEENLRDLIYQIA
Subjt:  SAIDEILRSS-GQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIA

Query:  SRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASK
        SR SKMMPSGLFLAVLQQDSEIRLASYRM+TGLVARPWCLMEICSKQDIINIV+DAS ETTKIGMEARYNCC+AIHKAFMSSTRLTGDPALAGIASK
Subjt:  SRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASK

TrEMBL top hitse value%identityAlignment
A0A0A0L246 Uncharacterized protein9.1e-23787.73Show/hide
Query:  MEEFAVDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLAC
        MEEF+V+DPTRLL+A+A+FANYPGVRTDASVKEFLDRFPLP IINALQTKAE PGLE TLVACLDRIFKTKYGAS IPHYMPFVQVGL+ADSQTVR+LAC
Subjt:  MEEFAVDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLAC

Query:  KTVTHLLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISS
        KTVT LL+ESDE A+  IQLIIDY IYPLLLDCLL+GNEQVANSSMD+IK LAAFP+GMEII P+NK EATHLGT+ASTCSSLGRVRVMALVVKLFS+SS
Subjt:  KTVTHLLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISS

Query:  SVASAIYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILI
        SVASA+YN+NLLSLLESEINNS DTLVTLSVLELLYELVEIEHGT+FLPRTSFLQLL SIISNSS ESILRSRAMVI GRLLSKENIF LVDESC+R LI
Subjt:  SVASAIYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILI

Query:  SAIDEILRSS-GQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIA
        SA+D IL SS G+DVNV E+A EALGQIGS+ WGATLL SSFPTCVKHVI AAF +HEHGKQLAAMHALGNI GE RSENDIMLNDNAEENLRDLIYQIA
Subjt:  SAIDEILRSS-GQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIA

Query:  SRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASK
        SR SKM PSGLFLAVLQQDSEIRLASYRMITGLVARPWCL EICSKQDI+NIV DAS ETTKIGMEARYNCC+AIHKAFMSS RLTGDPALAGIASK
Subjt:  SRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASK

A0A1S3CKC0 uncharacterized protein LOC103501781 isoform X19.4e-23486.32Show/hide
Query:  MEEFAVDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLAC
        ME+F+V+DPT+LL+A+A+FANYPGVRTDASVKEFLDRFPLP IINALQTKAE PG+E TLVACLDRIFKTKYGAS IPHYMPFVQVGL+ADSQTVR+LAC
Subjt:  MEEFAVDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLAC

Query:  KTVTHLLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISS
        KTVT LL+ESDE    AIQLIIDY IYPLLLDCLL+GNEQVANSSMD+IK LAAFP+GMEII P+NK EATHLG +ASTCSSLGRVRVMALVVKLFS+SS
Subjt:  KTVTHLLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISS

Query:  SVASAIYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILI
        SVASA+YN+NLLSLLESEINNS DTLVTLSVLELLYELVEIEHGT+FLPRTSFLQLLSSIISNSS ESILRSRAMVI GRLLSKENIF LVDESCVR LI
Subjt:  SVASAIYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILI

Query:  SAIDEILRSS-GQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIA
        SA+D IL SS G+DVNV E+A EALGQIGS+ WGATLL SSFPTCVKH I  AF +HEHGKQLAAMHALGNI GE+RSENDI+LNDNAEENLRDLIYQIA
Subjt:  SAIDEILRSS-GQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIA

Query:  SRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASK
        SR SKM PSGLFLAVLQQDSEIRLASYRMITGLVARPWCL EICSKQ+I+NIV DAS ETTKIGMEARYNCC++IHKAFMSS RLTGDPALAGIASK
Subjt:  SRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASK

A0A6J1CFN3 uncharacterized protein LOC111010386 isoform X11.2e-24490.74Show/hide
Query:  MEEFAVDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLAC
        MEEFAVDDPT+LLEA+ADFA+YPGVRTDASVKEFLDRFPLPVIINALQTKAE+PGLE TLVACLDRIFKTKYGAS IPH+MPF+QVGLRADSQTVR LAC
Subjt:  MEEFAVDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLAC

Query:  KTVTHLLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISS
        KTVT LLEESD +AVLAIQLIIDY IYPLLL+CLL+GNEQVANSSMDAIKKLAAFPKGME+IFPTN+ EATHLGTLASTCSSLGRVRVMALVVKLFS+S 
Subjt:  KTVTHLLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISS

Query:  SVASAIYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILI
        SVASAIYNSNLL+LLESEINNSNDTLVTLSVLELLYELVEIEHGT+FLPRTS LQLLSSIISNSS ESILRSRAMVI GRLLSKEN++LLVDESCVRILI
Subjt:  SVASAIYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILI

Query:  SAIDEILRSS-GQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIA
        SAIDE L SS GQDVNVCESA EALGQIGST  GATLL SSF TCVK +I AAF +HEHGKQLAAMHALGNICGETRSENDIMLND AEENLRDL+YQIA
Subjt:  SAIDEILRSS-GQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIA

Query:  SRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASK
        SR SK+MPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQ+IINIVTDAS ETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASK
Subjt:  SRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASK

A0A6J1FJQ7 uncharacterized protein LOC1114463601.2e-24188.93Show/hide
Query:  MEEFAVDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLAC
        MEEFAVDDPT+LLEA+ADFANYPGVRTDASVKEF  RFPLPV+INALQ KAE PGLE TLVACLDRIFKTKYGAS IPHYMPFVQVGL+ADSQ VRSLAC
Subjt:  MEEFAVDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLAC

Query:  KTVTHLLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISS
        KTVT LLEE+D    LA QLI+DY+IYPLL++CLL+GNEQVANSSMDAIKKLAAFPKGMEIIFPTNK EATHLGT+ASTCSSLGRVRVMAL+VKLFS+SS
Subjt:  KTVTHLLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISS

Query:  SVASAIYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILI
        SVASA+YNSNLL+LLESEI+NSNDTLVTLSVLELLYELVEIEHGT FLPRTSFLQLLSSIISN S ESILRSRAMVI GRLLSKENIF LVDESCVRILI
Subjt:  SVASAIYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILI

Query:  SAIDEILRSS-GQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIA
        SAIDEIL SS GQDVNVCESA EALGQIGS+ WGATLL SS+PTCVK+VI+AAF +HEHGKQLAAMHALGNI GETRSENDIMLNDNAEENLRDLIYQ A
Subjt:  SAIDEILRSS-GQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIA

Query:  SRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASK
        SR  KM PSGL LAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIV+DAS ETTKIGMEARYNCC+AIHKAFMSSTRLTGDPALAGIASK
Subjt:  SRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASK

A0A6J1IVZ7 uncharacterized protein LOC1114804333.4e-23987.93Show/hide
Query:  MEEFAVDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLAC
        MEEFAVDDPT+LLEA+ADFANYPGVRTD SVKEF  RFPLPV+INALQ KAE PGLE TLVACLDRIFKTKYGAS IPHYMPFVQVGL+ADSQ VRSLAC
Subjt:  MEEFAVDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLAC

Query:  KTVTHLLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISS
        KTVT LLEE+D    LA QLI+DY+IYPLL++CLL+GNEQVANSSMDA+KKLAAFPKGMEIIFPTNK EATHLGT+ASTCSSLGRVRVMAL+VKLFS+SS
Subjt:  KTVTHLLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISS

Query:  SVASAIYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILI
        SVASA+YNSNLL+LLESEI+NSNDTLVTLSVLELLYELVEIEHGT FLPRTSFLQLLSSIISN S ESILRSRAMVI GRLLSKENIF LVDESCVRILI
Subjt:  SVASAIYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILI

Query:  SAIDEILRSS-GQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIA
        S+IDEIL SS GQDVNVCESA EALGQIGS+ WGATLL SS+PTCVK+VI+AAF +HEHGKQLAAMHALGNI GETRSENDIMLNDNAEENL DLIYQ A
Subjt:  SAIDEILRSS-GQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIA

Query:  SRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASK
        SR  K+ PSGL LAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIV+DAS ETTKIGMEARYNCC+AIHKAFMSSTRLTGDPALAGIASK
Subjt:  SRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASK

SwissProt top hitse value%identityAlignment
B0C2C8 30S ribosomal protein S203.1e-0848.31Show/hide
Query:  SAAKRARQAEKRRIYNKARKSEIKTRMKKVLEALDDLKKKSEAQSEEVLPIEKLIAEAYSVIDKGVKVGTLHRNTAARRKSRLARRKKA
        SA KR   AE+ R+ NKA KS IKT  K+   A+DD   KS    +++  I+  ++  YS IDK VKVG  HRNT AR+K+ LAR  KA
Subjt:  SAAKRARQAEKRRIYNKARKSEIKTRMKKVLEALDDLKKKSEAQSEEVLPIEKLIAEAYSVIDKGVKVGTLHRNTAARRKSRLARRKKA

P62661 30S ribosomal protein S201.1e-0846.88Show/hide
Query:  VCEAAPKKADSAAKRARQAEKRRIYNKARKSEIKTRMKKVLEALDDLKKKSEAQSEEVLPIEKLIAEAYSVIDKGVKVGTLHRNTAARRKSRLARR
        + +  PK+  SA KR RQ+ KRR+ NKA+KS IKT  KK ++         E ++EE L   K++ +A S+IDK  K  TLH+N AARRKSRL R+
Subjt:  VCEAAPKKADSAAKRARQAEKRRIYNKARKSEIKTRMKKVLEALDDLKKKSEAQSEEVLPIEKLIAEAYSVIDKGVKVGTLHRNTAARRKSRLARR

P80380 30S ribosomal protein S208.2e-0946.88Show/hide
Query:  VCEAAPKKADSAAKRARQAEKRRIYNKARKSEIKTRMKKVLEALDDLKKKSEAQSEEVLPIEKLIAEAYSVIDKGVKVGTLHRNTAARRKSRLARR
        + +  PK+  SA KR RQ+ KRR+ NKA+KS IKT  KK ++         E ++EE L   K++ +A S+IDK  K  TLH+N AARRKSRL R+
Subjt:  VCEAAPKKADSAAKRARQAEKRRIYNKARKSEIKTRMKKVLEALDDLKKKSEAQSEEVLPIEKLIAEAYSVIDKGVKVGTLHRNTAARRKSRLARR

P82130 30S ribosomal protein S20, chloroplastic5.1e-4362.5Show/hide
Query:  MAAAPICCFVLSSKFRNLSLNASSSCLPPCSSSSTLRSLSFSSNLSACAFSNGDYLWGCLSVSKAQRPLRYSVVCEAAP-KKADSAAKRARQAEKRRIYN
        MA     C  +SSK  NLS   SS+  P    +S+L+ L+FS+NLS   FS      GC S+   QR   +SVVCE A  KKADSAAKR RQAE RR+ N
Subjt:  MAAAPICCFVLSSKFRNLSLNASSSCLPPCSSSSTLRSLSFSSNLSACAFSNGDYLWGCLSVSKAQRPLRYSVVCEAAP-KKADSAAKRARQAEKRRIYN

Query:  KARKSEIKTRMKKVLEALDDLKKKSEAQSEEVLPIEKLIAEAYSVIDKGVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPA
        KARKSE+KTRM+KV EALD LKKKS A +EE++PI+ LIAEAYS IDK V  GTLHRNTAARRKSRLAR KK VEIHHGWYTP+
Subjt:  KARKSEIKTRMKKVLEALDDLKKKSEAQSEEVLPIEKLIAEAYSVIDKGVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPA

Q9ASV6 30S ribosomal protein S20, chloroplastic1.4e-4561.46Show/hide
Query:  CFVLSSKFRNLSLNASSSCLPPCSSSSTLR--------SLSFSSNLSAC-AFSNGDYLWGCLSVSKAQRPLRYSVVCEAA--PKKADSAAKRARQAEKRR
        C  L S+F+ LSL    SC  P SS S  R        SLSFS ++S C AFS G+ LW        Q+P+R  +VCEAA   KKADSAAKRARQAEKRR
Subjt:  CFVLSSKFRNLSLNASSSCLPPCSSSSTLR--------SLSFSSNLSAC-AFSNGDYLWGCLSVSKAQRPLRYSVVCEAA--PKKADSAAKRARQAEKRR

Query:  IYNKARKSEIKTRMKKVLEALDDLKKKSEAQSEEVLPIEKLIAEAYSVIDKGVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA
        +YNK++KSE +TRMKKVLEAL+ LKKK++AQ++E++ +EKLI EAYS IDK VKV  LH+NT ARRKSRLARRKKAVEIHHGWY P + AAA
Subjt:  IYNKARKSEIKTRMKKVLEALDDLKKKSEAQSEEVLPIEKLIAEAYSVIDKGVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA

Arabidopsis top hitse value%identityAlignment
AT3G15180.1 ARM repeat superfamily protein1.5e-15157.86Show/hide
Query:  VDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLACKTVTH
        ++D  +L +A+ +FA+YPG + + SVKEFLDRFPLPVI NALQT  + PG E TLV CL+R+FKTKYGAS IP YMP +QVGL+ADS  V+SLACKTV  
Subjt:  VDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLACKTVTH

Query:  LLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISSSVASA
        LLE+ D N V ++QL+++  IYPLLLD +++ +++VAN++ + IK LA FP  M +IFP+   + THL  LA+ CSSL RVRV++L+VKLFSIS  VAS 
Subjt:  LLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISSSVASA

Query:  IYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILISAIDE
        +  S LL LLE+E+  + DTLV L+VLEL YEL+E+EH +EF+P+TS +QLL SIIS +S     + RAM+I GRLLSKENI+ +V+E+ V+ LISAID 
Subjt:  IYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILISAIDE

Query:  ILRS-SGQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIASRGSK
         L S    D +  E+A +ALGQ+GST  GA L+ S+ P   +HV+ +AF ++ HGKQLAA+HAL NI GETR +++ +++  AEE+LR LIY  A++ +K
Subjt:  ILRS-SGQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIASRGSK

Query:  MMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSS
        + PSGLFL+VLQQ SEIRLA YR +T LVARPWCL+EI +K++IINIVTDA+ ET KI MEARYNCC AIH+AF+ S
Subjt:  MMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSS

AT3G15180.2 ARM repeat superfamily protein1.0e-14754.42Show/hide
Query:  VDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLACKTVTH
        ++D  +L +A+ +FA+YPG + + SVKEFLDRFPLPVI NALQT  + PG E TLV CL+R+FKTKYGAS IP YMP +QVGL+ADS  V+SLACKTV  
Subjt:  VDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLACKTVTH

Query:  LLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISSSVASA
        LLE+ D N V ++QL+++  IYPLLLD +++ +++VAN++ + IK LA FP  M +IFP+   + THL  LA+ CSSL RVRV++L+VKLFSIS  VAS 
Subjt:  LLEESDENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISSSVASA

Query:  IYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDES-----------
        +  S LL LLE+E+  + DTLV L+VLEL YEL+E+EH +EF+P+TS +QLL SIIS +S     + RAM+I GRLLSKENI+ +V+E+           
Subjt:  IYNSNLLSLLESEINNSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDES-----------

Query:  ---------------------CVRILISAIDEILRS-SGQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNIC
                             CV+ LISAID  L S    D +  E+A +ALGQ+GST  GA L+ S+ P   +HV+ +AF ++ HGKQLAA+HAL NI 
Subjt:  ---------------------CVRILISAIDEILRS-SGQDVNVCESACEALGQIGSTVWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNIC

Query:  GETRSENDIMLNDNAEENLRDLIYQIASRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCM
        GETR +++ +++  AEE+LR LIY  A++ +K+ PSGLFL+VLQQ SEIRLA YR +T LVARPWCL+EI +K++IINIVTDA+ ET KI MEARYNCC 
Subjt:  GETRSENDIMLNDNAEENLRDLIYQIASRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLMEICSKQDIINIVTDASMETTKIGMEARYNCCM

Query:  AIHKAFMSS
        AIH+AF+ S
Subjt:  AIHKAFMSS

AT3G15190.1 chloroplast 30S ribosomal protein S20, putative1.0e-4661.46Show/hide
Query:  CFVLSSKFRNLSLNASSSCLPPCSSSSTLR--------SLSFSSNLSAC-AFSNGDYLWGCLSVSKAQRPLRYSVVCEAA--PKKADSAAKRARQAEKRR
        C  L S+F+ LSL    SC  P SS S  R        SLSFS ++S C AFS G+ LW        Q+P+R  +VCEAA   KKADSAAKRARQAEKRR
Subjt:  CFVLSSKFRNLSLNASSSCLPPCSSSSTLR--------SLSFSSNLSAC-AFSNGDYLWGCLSVSKAQRPLRYSVVCEAA--PKKADSAAKRARQAEKRR

Query:  IYNKARKSEIKTRMKKVLEALDDLKKKSEAQSEEVLPIEKLIAEAYSVIDKGVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA
        +YNK++KSE +TRMKKVLEAL+ LKKK++AQ++E++ +EKLI EAYS IDK VKV  LH+NT ARRKSRLARRKKAVEIHHGWY P + AAA
Subjt:  IYNKARKSEIKTRMKKVLEALDDLKKKSEAQSEEVLPIEKLIAEAYSVIDKGVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAATTTGCTGTGGATGATCCGACTCGGCTACTCGAAGCAAGTGCAGACTTCGCAAACTATCCTGGTGTTCGGACTGATGCTTCAGTGAAGGAATTTCTCGACCG
CTTCCCTCTTCCCGTCATCATCAATGCTTTACAAACAAAAGCGGAATCTCCTGGTCTGGAAACCACCTTGGTTGCTTGTCTCGACAGGATATTCAAAACCAAGTATGGTG
CTTCTCACATACCACATTATATGCCTTTTGTACAGGTTGGACTACGAGCAGATTCTCAAACAGTTAGAAGTTTAGCTTGTAAAACGGTCACTCACCTTTTAGAGGAGTCC
GATGAGAATGCTGTTTTGGCCATACAACTTATTATTGACTATGATATTTATCCACTTTTGCTCGATTGCCTTCTGCATGGTAATGAACAAGTTGCTAACTCATCAATGGA
TGCAATAAAGAAGTTGGCTGCATTTCCAAAGGGAATGGAAATTATCTTCCCAACAAATAAAAGAGAAGCAACACACCTAGGAACTTTAGCTTCAACATGTTCATCTCTGG
GGAGAGTTCGAGTTATGGCTCTGGTAGTGAAACTATTTTCCATTTCTAGCTCCGTGGCATCGGCAATATACAATTCAAATTTACTAAGTTTGCTGGAAAGTGAAATCAAC
AACTCAAATGACACACTTGTAACTTTAAGTGTGTTGGAACTCTTGTATGAGTTAGTGGAGATTGAACATGGTACAGAGTTCTTGCCAAGGACCAGCTTTCTCCAACTACT
AAGTTCTATAATCAGCAACTCATCAGGGGAGTCTATTTTAAGATCAAGAGCAATGGTCATTGGTGGAAGACTTTTGTCTAAAGAGAATATTTTCTTGCTTGTTGATGAAT
CTTGTGTAAGGATTTTAATATCTGCTATAGATGAAATTCTTCGGTCATCAGGTCAAGATGTAAATGTATGTGAATCTGCATGTGAAGCACTGGGTCAAATAGGGTCAACT
GTATGGGGAGCTACTTTACTGCAGTCAAGTTTTCCAACTTGTGTGAAGCATGTAATTGATGCAGCATTTGTTCAGCATGAACATGGTAAACAGCTGGCAGCCATGCATGC
TCTTGGCAATATCTGTGGAGAAACTCGATCTGAGAATGATATTATGCTCAATGATAATGCAGAAGAAAATTTGCGGGACTTAATATATCAAATTGCATCCAGAGGTTCAA
AAATGATGCCATCAGGCCTTTTTCTAGCTGTCCTTCAGCAGGACTCTGAGATTCGCTTGGCGAGTTATAGAATGATAACTGGATTGGTTGCTCGACCTTGGTGCCTTATG
GAAATCTGCTCAAAACAAGACATAATAAATATAGTGACTGATGCAAGCATGGAGACTACGAAAATAGGAATGGAAGCTAGATATAATTGTTGTATGGCTATCCACAAGGC
CTTCATGTCTTCAACTAGGCTTACTGGCGATCCTGCTCTTGCTGGAATAGCTTCGAAGCGGATAGCGCCAGCAAGGTGCGAGAACCACCATTCTTCAAGCTGTGGAGCTT
TCAGAAACCCCCAGAGAAAAATGGCGGCAGCTCCAATTTGTTGCTTCGTTCTTTCTTCTAAATTCAGAAATCTTTCTCTTAATGCTTCTTCCTCATGTTTGCCTCCTTGC
TCATCTTCCTCGACCCTCAGATCTCTCAGTTTCTCCTCCAACCTTTCAGCCTGTGCCTTCTCCAATGGTGATTACTTATGGGGGTGCCTGTCAGTGAGTAAAGCTCAGAG
GCCACTTCGTTACTCTGTTGTCTGCGAGGCAGCTCCTAAGAAGGCCGATTCTGCTGCAAAGAGAGCTCGGCAGGCTGAGAAAAGACGGATTTACAACAAAGCCCGGAAGT
CTGAAATCAAAACCAGGATGAAGAAGGTTTTGGAAGCTCTAGATGATCTTAAGAAGAAATCTGAAGCACAATCAGAGGAAGTGCTTCCAATTGAGAAGCTCATTGCAGAG
GCGTACTCAGTGATTGACAAAGGCGTGAAAGTGGGAACATTGCACCGAAACACTGCAGCACGTCGAAAATCTCGGCTTGCCAGAAGAAAGAAAGCCGTCGAAATCCATCA
TGGCTGGTACACCCCTGCTTCACCGGCAGCTGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAATTTGCTGTGGATGATCCGACTCGGCTACTCGAAGCAAGTGCAGACTTCGCAAACTATCCTGGTGTTCGGACTGATGCTTCAGTGAAGGAATTTCTCGACCG
CTTCCCTCTTCCCGTCATCATCAATGCTTTACAAACAAAAGCGGAATCTCCTGGTCTGGAAACCACCTTGGTTGCTTGTCTCGACAGGATATTCAAAACCAAGTATGGTG
CTTCTCACATACCACATTATATGCCTTTTGTACAGGTTGGACTACGAGCAGATTCTCAAACAGTTAGAAGTTTAGCTTGTAAAACGGTCACTCACCTTTTAGAGGAGTCC
GATGAGAATGCTGTTTTGGCCATACAACTTATTATTGACTATGATATTTATCCACTTTTGCTCGATTGCCTTCTGCATGGTAATGAACAAGTTGCTAACTCATCAATGGA
TGCAATAAAGAAGTTGGCTGCATTTCCAAAGGGAATGGAAATTATCTTCCCAACAAATAAAAGAGAAGCAACACACCTAGGAACTTTAGCTTCAACATGTTCATCTCTGG
GGAGAGTTCGAGTTATGGCTCTGGTAGTGAAACTATTTTCCATTTCTAGCTCCGTGGCATCGGCAATATACAATTCAAATTTACTAAGTTTGCTGGAAAGTGAAATCAAC
AACTCAAATGACACACTTGTAACTTTAAGTGTGTTGGAACTCTTGTATGAGTTAGTGGAGATTGAACATGGTACAGAGTTCTTGCCAAGGACCAGCTTTCTCCAACTACT
AAGTTCTATAATCAGCAACTCATCAGGGGAGTCTATTTTAAGATCAAGAGCAATGGTCATTGGTGGAAGACTTTTGTCTAAAGAGAATATTTTCTTGCTTGTTGATGAAT
CTTGTGTAAGGATTTTAATATCTGCTATAGATGAAATTCTTCGGTCATCAGGTCAAGATGTAAATGTATGTGAATCTGCATGTGAAGCACTGGGTCAAATAGGGTCAACT
GTATGGGGAGCTACTTTACTGCAGTCAAGTTTTCCAACTTGTGTGAAGCATGTAATTGATGCAGCATTTGTTCAGCATGAACATGGTAAACAGCTGGCAGCCATGCATGC
TCTTGGCAATATCTGTGGAGAAACTCGATCTGAGAATGATATTATGCTCAATGATAATGCAGAAGAAAATTTGCGGGACTTAATATATCAAATTGCATCCAGAGGTTCAA
AAATGATGCCATCAGGCCTTTTTCTAGCTGTCCTTCAGCAGGACTCTGAGATTCGCTTGGCGAGTTATAGAATGATAACTGGATTGGTTGCTCGACCTTGGTGCCTTATG
GAAATCTGCTCAAAACAAGACATAATAAATATAGTGACTGATGCAAGCATGGAGACTACGAAAATAGGAATGGAAGCTAGATATAATTGTTGTATGGCTATCCACAAGGC
CTTCATGTCTTCAACTAGGCTTACTGGCGATCCTGCTCTTGCTGGAATAGCTTCGAAGCGGATAGCGCCAGCAAGGTGCGAGAACCACCATTCTTCAAGCTGTGGAGCTT
TCAGAAACCCCCAGAGAAAAATGGCGGCAGCTCCAATTTGTTGCTTCGTTCTTTCTTCTAAATTCAGAAATCTTTCTCTTAATGCTTCTTCCTCATGTTTGCCTCCTTGC
TCATCTTCCTCGACCCTCAGATCTCTCAGTTTCTCCTCCAACCTTTCAGCCTGTGCCTTCTCCAATGGTGATTACTTATGGGGGTGCCTGTCAGTGAGTAAAGCTCAGAG
GCCACTTCGTTACTCTGTTGTCTGCGAGGCAGCTCCTAAGAAGGCCGATTCTGCTGCAAAGAGAGCTCGGCAGGCTGAGAAAAGACGGATTTACAACAAAGCCCGGAAGT
CTGAAATCAAAACCAGGATGAAGAAGGTTTTGGAAGCTCTAGATGATCTTAAGAAGAAATCTGAAGCACAATCAGAGGAAGTGCTTCCAATTGAGAAGCTCATTGCAGAG
GCGTACTCAGTGATTGACAAAGGCGTGAAAGTGGGAACATTGCACCGAAACACTGCAGCACGTCGAAAATCTCGGCTTGCCAGAAGAAAGAAAGCCGTCGAAATCCATCA
TGGCTGGTACACCCCTGCTTCACCGGCAGCTGCCTGA
Protein sequenceShow/hide protein sequence
MEEFAVDDPTRLLEASADFANYPGVRTDASVKEFLDRFPLPVIINALQTKAESPGLETTLVACLDRIFKTKYGASHIPHYMPFVQVGLRADSQTVRSLACKTVTHLLEES
DENAVLAIQLIIDYDIYPLLLDCLLHGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKREATHLGTLASTCSSLGRVRVMALVVKLFSISSSVASAIYNSNLLSLLESEIN
NSNDTLVTLSVLELLYELVEIEHGTEFLPRTSFLQLLSSIISNSSGESILRSRAMVIGGRLLSKENIFLLVDESCVRILISAIDEILRSSGQDVNVCESACEALGQIGST
VWGATLLQSSFPTCVKHVIDAAFVQHEHGKQLAAMHALGNICGETRSENDIMLNDNAEENLRDLIYQIASRGSKMMPSGLFLAVLQQDSEIRLASYRMITGLVARPWCLM
EICSKQDIINIVTDASMETTKIGMEARYNCCMAIHKAFMSSTRLTGDPALAGIASKRIAPARCENHHSSSCGAFRNPQRKMAAAPICCFVLSSKFRNLSLNASSSCLPPC
SSSSTLRSLSFSSNLSACAFSNGDYLWGCLSVSKAQRPLRYSVVCEAAPKKADSAAKRARQAEKRRIYNKARKSEIKTRMKKVLEALDDLKKKSEAQSEEVLPIEKLIAE
AYSVIDKGVKVGTLHRNTAARRKSRLARRKKAVEIHHGWYTPASPAAA