; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029717 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029717
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein MID1-COMPLEMENTING ACTIVITY 1
Genome locationtig00153449:2122627..2126707
RNA-Seq ExpressionSgr029717
SyntenySgr029717
Gene Ontology termsGO:0007166 - cell surface receptor signaling pathway (biological process)
InterPro domainsIPR006461 - PLAC8 motif-containing protein
IPR036537 - Adaptor protein Cbl, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146155.1 protein MID1-COMPLEMENTING ACTIVITY 1 isoform X2 [Cucumis sativus]7.5e-15679.53Show/hide
Query:  MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLL
        MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLE+ALRRSYILVNSCQDRSYLYLL
Subjt:  MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLL

Query:  AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFN
        AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVR             L       R+         IH+VILNPEPS DEAMVLKKSLSCSYP+LPFN
Subjt:  AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFN

Query:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRHEKIHLKKPIK------------------------------KSSSISSGHDLLSTR
        KALQKENEKLQLELQRSQANLNVGECEVIQRLI VTE+VAATS  EK   +K  K                              KSSS+SSGHDLLSTR
Subjt:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRHEKIHLKKPIK------------------------------KSSSISSGHDLLSTR

Query:  GSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT
        GSDRYGEWHTDLLDCCSEPSLCLKT FYPCGT SRIATVAT+RHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT
Subjt:  GSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT

XP_008448524.1 PREDICTED: protein MID1-COMPLEMENTING ACTIVITY 1 [Cucumis melo]1.3e-15579.27Show/hide
Query:  MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLL
        MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLE+ALRRSYILVNSCQDRSYLYLL
Subjt:  MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLL

Query:  AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFN
        AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVR             L       R+         IH+VILNPEPS DEAMVLKKSLSCSYP+LPFN
Subjt:  AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFN

Query:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRHEKIHLKKPIK------------------------------KSSSISSGHDLLSTR
        KALQKENEKLQLELQRSQANLNVGECEVIQRL+ VTE+VAATS  EK   +K  K                              KSSS+SSGHDLLSTR
Subjt:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRHEKIHLKKPIK------------------------------KSSSISSGHDLLSTR

Query:  GSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT
        GSDRYGEWHTDLLDCCSEPSLCLKT FYPCGT SRIATVAT+RHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT
Subjt:  GSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT

XP_022145341.1 protein MID1-COMPLEMENTING ACTIVITY 1-like isoform X1 [Momordica charantia]1.5e-15680.05Show/hide
Query:  MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLL
        MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLE+ALRRSYILVNSCQDRSYLYLL
Subjt:  MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLL

Query:  AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFN
        AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVR             L       R+         IH+VILNPEPS DEAM LKKSLSCSYP+LPFN
Subjt:  AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFN

Query:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRHEKIHLKKPIK------------------------------KSSSISSGHDLLSTR
        KALQKENEKLQLELQRSQA+LNVGECEVIQRLI VTEAVAA+S HEK    K  K                              KSSSISSGHDLLSTR
Subjt:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRHEKIHLKKPIK------------------------------KSSSISSGHDLLSTR

Query:  GSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT
        GSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNR TSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT
Subjt:  GSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT

XP_023539882.1 protein MID1-COMPLEMENTING ACTIVITY 1-like [Cucurbita pepo subsp. pepo]2.4e-15477.69Show/hide
Query:  MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLL
        MSSW+TLGE+ANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALR+SYILVNSCQDRSYLYLL
Subjt:  MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLL

Query:  AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFN
        AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVR             L       R+         IH+VILNPEPSTDEAMVLKKSLSCSYP+LPFN
Subjt:  AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFN

Query:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRHEK-----IH-------------------------LKKPIKKSSSISSGHDLLSTR
        KALQKENEKLQLELQRSQANLNVGECEVIQRLI VTE+VAATSRHEK     +H                          +   +K+SS+SSGH LLS R
Subjt:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRHEK-----IH-------------------------LKKPIKKSSSISSGHDLLSTR

Query:  GSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT
        GSD YGEWHTDLLDCC++PSLCLKT FYPCGTFSRIATVAT+RHTSP EACNDL+AYTLILSCCCYTCCVRRKLRKMLNIT
Subjt:  GSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT

XP_031738428.1 protein MID1-COMPLEMENTING ACTIVITY 1 isoform X1 [Cucumis sativus]1.2e-16169.81Show/hide
Query:  MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLL
        MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLE+ALRRSYILVNSCQDRSYLYLL
Subjt:  MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLL

Query:  AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFN
        AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVR             L       R+         IH+VILNPEPS DEAMVLKKSLSCSYP+LPFN
Subjt:  AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFN

Query:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRHEKIHLKKPIK------------------------------KSSSISSGHDLLSTR
        KALQKENEKLQLELQRSQANLNVGECEVIQRLI VTE+VAATS  EK   +K  K                              KSSS+SSGHDLLSTR
Subjt:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRHEKIHLKKPIK------------------------------KSSSISSGHDLLSTR

Query:  GSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNITTHNLGSAYYITGDPTLQIC
        GSDRYGEWHTDLLDCCSEPSLCLKT FYPCGT SRIATVAT+RHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT                   
Subjt:  GSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNITTHNLGSAYYITGDPTLQIC

Query:  AIILDQLLPMGLKSVFGREVLLMTFSPILCAAAVPWSKNGEKSKYVGFMVLKRPKQALHPPNTWNPK
                  G    F     L       CA    W +   +  Y    VLKR KQALHPPNTW PK
Subjt:  AIILDQLLPMGLKSVFGREVLLMTFSPILCAAAVPWSKNGEKSKYVGFMVLKRPKQALHPPNTWNPK

TrEMBL top hitse value%identityAlignment
A0A0A0L3N0 Uncharacterized protein3.6e-15679.53Show/hide
Query:  MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLL
        MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLE+ALRRSYILVNSCQDRSYLYLL
Subjt:  MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLL

Query:  AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFN
        AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVR             L       R+         IH+VILNPEPS DEAMVLKKSLSCSYP+LPFN
Subjt:  AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFN

Query:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRHEKIHLKKPIK------------------------------KSSSISSGHDLLSTR
        KALQKENEKLQLELQRSQANLNVGECEVIQRLI VTE+VAATS  EK   +K  K                              KSSS+SSGHDLLSTR
Subjt:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRHEKIHLKKPIK------------------------------KSSSISSGHDLLSTR

Query:  GSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT
        GSDRYGEWHTDLLDCCSEPSLCLKT FYPCGT SRIATVAT+RHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT
Subjt:  GSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT

A0A1S3BKI0 protein MID1-COMPLEMENTING ACTIVITY 16.2e-15679.27Show/hide
Query:  MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLL
        MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLE+ALRRSYILVNSCQDRSYLYLL
Subjt:  MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLL

Query:  AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFN
        AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVR             L       R+         IH+VILNPEPS DEAMVLKKSLSCSYP+LPFN
Subjt:  AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFN

Query:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRHEKIHLKKPIK------------------------------KSSSISSGHDLLSTR
        KALQKENEKLQLELQRSQANLNVGECEVIQRL+ VTE+VAATS  EK   +K  K                              KSSS+SSGHDLLSTR
Subjt:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRHEKIHLKKPIK------------------------------KSSSISSGHDLLSTR

Query:  GSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT
        GSDRYGEWHTDLLDCCSEPSLCLKT FYPCGT SRIATVAT+RHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT
Subjt:  GSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT

A0A6J1CW21 protein MID1-COMPLEMENTING ACTIVITY 1-like isoform X17.3e-15780.05Show/hide
Query:  MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLL
        MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLE+ALRRSYILVNSCQDRSYLYLL
Subjt:  MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLL

Query:  AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFN
        AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVR             L       R+         IH+VILNPEPS DEAM LKKSLSCSYP+LPFN
Subjt:  AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFN

Query:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRHEKIHLKKPIK------------------------------KSSSISSGHDLLSTR
        KALQKENEKLQLELQRSQA+LNVGECEVIQRLI VTEAVAA+S HEK    K  K                              KSSSISSGHDLLSTR
Subjt:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRHEKIHLKKPIK------------------------------KSSSISSGHDLLSTR

Query:  GSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT
        GSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNR TSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT
Subjt:  GSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT

A0A6J1FSL9 protein MID1-COMPLEMENTING ACTIVITY 1-like7.5e-15477.43Show/hide
Query:  MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLL
        MSSW+TLGE+ANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALR+SYILVNSCQDRSYLYLL
Subjt:  MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLL

Query:  AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFN
        AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVR             L       R+         IH+VILNPEPSTDEAMVLKKSLSCSYP+LPFN
Subjt:  AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFN

Query:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRHEK------------------------------IHLKKPIKKSSSISSGHDLLSTR
        KALQKENEKLQLELQRSQANLNVGECEVIQRLI VTE+VAATSRHEK                              I  +   +K+SS+SSGH LLS R
Subjt:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRHEK------------------------------IHLKKPIKKSSSISSGHDLLSTR

Query:  GSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT
        GSD YGEWHTDLLDCC++PSLCLKT FYPCGTFSRIATVAT+RHTSP EA NDL+AYTLILSCCCYTCCVRRKLRKMLNIT
Subjt:  GSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT

A0A6J1I3K3 protein MID1-COMPLEMENTING ACTIVITY 1-like1.5e-15476.62Show/hide
Query:  MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLL
        MSSW+TLGE+ANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALR+SYILVNSCQDRSYLYLL
Subjt:  MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLL

Query:  AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRV-FDSVSFPSVIIYLFIYTTGSRKMHFYSI---YATIHDVILNPEPSTDEAMVLKKSLSCSYPH
        AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVR   + +                +  H Y++      IH+VILNPEPSTDEAMVLKKSLSCSYP+
Subjt:  AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRV-FDSVSFPSVIIYLFIYTTGSRKMHFYSI---YATIHDVILNPEPSTDEAMVLKKSLSCSYPH

Query:  LPFNKALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRHEK------------------------------IHLKKPIKKSSSISSGHDL
        LPFNKALQKENEKLQLELQRSQANLNVGECEVIQRLI VTE+VAATS HEK                              I      +K+SS+SSGH L
Subjt:  LPFNKALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRHEK------------------------------IHLKKPIKKSSSISSGHDL

Query:  LSTRGSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT
        LS RGSD YGEWHTDLLDCC++PSLCLKT FYPCGTFSRIATVAT+RHTSP EACNDL+AYTLILSCCCYTCCVRRKLRKMLNIT
Subjt:  LSTRGSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT

SwissProt top hitse value%identityAlignment
B6SJQ0 Cell number regulator 131.6e-12458.93Show/hide
Query:  MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLL
        M+SW+ LGE++N+AQLTG+DAV+LIS+IV++ASTAR+HK+NCR+FAQHLKLIG LLEQL++SEL++YPETREPLEQLEDALRR Y+LVNSCQDRSYLYLL
Subjt:  MSSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLL

Query:  AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFN
        AMGWNIVYQFRKAQ+EID YLRLVPLITLVDNAR+R  D + +       + +    +K         + D +LNP+P T+  +VLKK+LSCSYP+LPFN
Subjt:  AMGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFN

Query:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRHEKIHLKKPIKK------------------------------------------SS
        +AL+KE+EKLQ+ELQRSQ+N+++G CEVIQ L+GVT+ V +T   ++ ++K P KK                                          SS
Subjt:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRHEKIHLKKPIKK------------------------------------------SS

Query:  SISSGHDLLSTRGSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNI
         +  GHDL+S+RGS    EWH DLL CCS+P+LCLKTLF+PCGTFSRIA++A +R  S  EACND+MAY+LILSCCCYTCCVRRKLR+ L+I
Subjt:  SISSGHDLLSTRGSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNI

Q3EBY6 Protein MID1-COMPLEMENTING ACTIVITY 26.9e-10455.61Show/hide
Query:  SSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLLA
        +SW+ LGEIA+VAQLTGIDA++LI MIV +A+TARMHKKNCRQFA HLKLI NLLEQ+K SE+ +  E  EPL+ L+DALRRSYILV SCQ++SYLYLLA
Subjt:  SSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLLA

Query:  MGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEA-MVLKKSLSCSYPHLPFN
        MGWNIV QF KAQNEID +L++VPLI + DNAR+R             L       R+         + DVIL  E + + A  VLKK+LS SYP++ F 
Subjt:  MGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEA-MVLKKSLSCSYPHLPFN

Query:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRH-EKIHLKK----------------------PIKKSSSISSGHDLLSTRGSDRYGE
        +AL+ E EKLQLELQRS+A  +  +CEVIQRLI VT+  A    + EK+  KK                        + +S +SSGH+LLS R     G 
Subjt:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRH-EKIHLKK----------------------PIKKSSSISSGHDLLSTRGSDRYGE

Query:  WHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT
        WH DLLDCCSEP LCLKTLF+PCGT ++I+TVAT+R  S  E C +L+ Y+LILSCCCYTCC+R+KLRK LNIT
Subjt:  WHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT

Q8L7E9 Protein MID1-COMPLEMENTING ACTIVITY 14.6e-12458.95Show/hide
Query:  SWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLLAM
        SW+ LGEIA+VAQLTG+DAV+LI +IVK+A+TA MHKKNCRQFAQHLKLIGNLLEQLKISE+K+YPETREPLE LEDALRRSY+LVNSC+DRSYLYLLAM
Subjt:  SWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLLAM

Query:  GWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFNKA
        GWNIVYQFRK Q+EIDR+L+++PLITLVDNAR+R      F  +      YT      H       + DVIL  E + + A VLKK+LSCSYP+L F +A
Subjt:  GWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFNKA

Query:  LQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATS-RHEKIHLKKPIKKS-----------------------------SSISSGHDLLSTRGS
        L+ ENEKLQ+ELQRSQ + +V +CEVIQRLIGVT+A AA     EK   KK  KKS                             S++SSGHDLLS R S
Subjt:  LQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATS-RHEKIHLKKPIKKS-----------------------------SSISSGHDLLSTRGS

Query:  --DRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNITTHNLGSAYYITGDPTLQIC
            + EWHTDLL CCSEPSLC KT F+PCGT ++IAT A+NRH S AEACN+LMAY+LILSCCCYTCCVRRKLRK LNIT    G          +  C
Subjt:  --DRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNITTHNLGSAYYITGDPTLQIC

Query:  AIILDQLLPMGLKSVFGRE
          ++ +L  + ++  +G E
Subjt:  AIILDQLLPMGLKSVFGRE

Arabidopsis top hitse value%identityAlignment
AT2G17780.1 PLAC8 family protein4.9e-10555.61Show/hide
Query:  SSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLLA
        +SW+ LGEIA+VAQLTGIDA++LI MIV +A+TARMHKKNCRQFA HLKLI NLLEQ+K SE+ +  E  EPL+ L+DALRRSYILV SCQ++SYLYLLA
Subjt:  SSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLLA

Query:  MGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEA-MVLKKSLSCSYPHLPFN
        MGWNIV QF KAQNEID +L++VPLI + DNAR+R             L       R+         + DVIL  E + + A  VLKK+LS SYP++ F 
Subjt:  MGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEA-MVLKKSLSCSYPHLPFN

Query:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRH-EKIHLKK----------------------PIKKSSSISSGHDLLSTRGSDRYGE
        +AL+ E EKLQLELQRS+A  +  +CEVIQRLI VT+  A    + EK+  KK                        + +S +SSGH+LLS R     G 
Subjt:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRH-EKIHLKK----------------------PIKKSSSISSGHDLLSTRGSDRYGE

Query:  WHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT
        WH DLLDCCSEP LCLKTLF+PCGT ++I+TVAT+R  S  E C +L+ Y+LILSCCCYTCC+R+KLRK LNIT
Subjt:  WHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT

AT2G17780.2 PLAC8 family protein4.9e-10555.61Show/hide
Query:  SSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLLA
        +SW+ LGEIA+VAQLTGIDA++LI MIV +A+TARMHKKNCRQFA HLKLI NLLEQ+K SE+ +  E  EPL+ L+DALRRSYILV SCQ++SYLYLLA
Subjt:  SSWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLLA

Query:  MGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEA-MVLKKSLSCSYPHLPFN
        MGWNIV QF KAQNEID +L++VPLI + DNAR+R             L       R+         + DVIL  E + + A  VLKK+LS SYP++ F 
Subjt:  MGWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEA-MVLKKSLSCSYPHLPFN

Query:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRH-EKIHLKK----------------------PIKKSSSISSGHDLLSTRGSDRYGE
        +AL+ E EKLQLELQRS+A  +  +CEVIQRLI VT+  A    + EK+  KK                        + +S +SSGH+LLS R     G 
Subjt:  KALQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATSRH-EKIHLKK----------------------PIKKSSSISSGHDLLSTRGSDRYGE

Query:  WHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT
        WH DLLDCCSEP LCLKTLF+PCGT ++I+TVAT+R  S  E C +L+ Y+LILSCCCYTCC+R+KLRK LNIT
Subjt:  WHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNIT

AT4G35920.1 PLAC8 family protein3.3e-12558.95Show/hide
Query:  SWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLLAM
        SW+ LGEIA+VAQLTG+DAV+LI +IVK+A+TA MHKKNCRQFAQHLKLIGNLLEQLKISE+K+YPETREPLE LEDALRRSY+LVNSC+DRSYLYLLAM
Subjt:  SWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLLAM

Query:  GWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFNKA
        GWNIVYQFRK Q+EIDR+L+++PLITLVDNAR+R      F  +      YT      H       + DVIL  E + + A VLKK+LSCSYP+L F +A
Subjt:  GWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFNKA

Query:  LQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATS-RHEKIHLKKPIKKS-----------------------------SSISSGHDLLSTRGS
        L+ ENEKLQ+ELQRSQ + +V +CEVIQRLIGVT+A AA     EK   KK  KKS                             S++SSGHDLLS R S
Subjt:  LQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATS-RHEKIHLKKPIKKS-----------------------------SSISSGHDLLSTRGS

Query:  --DRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNITTHNLGSAYYITGDPTLQIC
            + EWHTDLL CCSEPSLC KT F+PCGT ++IAT A+NRH S AEACN+LMAY+LILSCCCYTCCVRRKLRK LNIT    G          +  C
Subjt:  --DRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNITTHNLGSAYYITGDPTLQIC

Query:  AIILDQLLPMGLKSVFGRE
          ++ +L  + ++  +G E
Subjt:  AIILDQLLPMGLKSVFGRE

AT4G35920.2 PLAC8 family protein3.3e-12558.95Show/hide
Query:  SWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLLAM
        SW+ LGEIA+VAQLTG+DAV+LI +IVK+A+TA MHKKNCRQFAQHLKLIGNLLEQLKISE+K+YPETREPLE LEDALRRSY+LVNSC+DRSYLYLLAM
Subjt:  SWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLLAM

Query:  GWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFNKA
        GWNIVYQFRK Q+EIDR+L+++PLITLVDNAR+R      F  +      YT      H       + DVIL  E + + A VLKK+LSCSYP+L F +A
Subjt:  GWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFNKA

Query:  LQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATS-RHEKIHLKKPIKKS-----------------------------SSISSGHDLLSTRGS
        L+ ENEKLQ+ELQRSQ + +V +CEVIQRLIGVT+A AA     EK   KK  KKS                             S++SSGHDLLS R S
Subjt:  LQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATS-RHEKIHLKKPIKKS-----------------------------SSISSGHDLLSTRGS

Query:  --DRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNITTHNLGSAYYITGDPTLQIC
            + EWHTDLL CCSEPSLC KT F+PCGT ++IAT A+NRH S AEACN+LMAY+LILSCCCYTCCVRRKLRK LNIT    G          +  C
Subjt:  --DRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNITTHNLGSAYYITGDPTLQIC

Query:  AIILDQLLPMGLKSVFGRE
          ++ +L  + ++  +G E
Subjt:  AIILDQLLPMGLKSVFGRE

AT4G35920.3 PLAC8 family protein3.3e-12558.95Show/hide
Query:  SWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLLAM
        SW+ LGEIA+VAQLTG+DAV+LI +IVK+A+TA MHKKNCRQFAQHLKLIGNLLEQLKISE+K+YPETREPLE LEDALRRSY+LVNSC+DRSYLYLLAM
Subjt:  SWETLGEIANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLLAM

Query:  GWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFNKA
        GWNIVYQFRK Q+EIDR+L+++PLITLVDNAR+R      F  +      YT      H       + DVIL  E + + A VLKK+LSCSYP+L F +A
Subjt:  GWNIVYQFRKAQNEIDRYLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFNKA

Query:  LQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATS-RHEKIHLKKPIKKS-----------------------------SSISSGHDLLSTRGS
        L+ ENEKLQ+ELQRSQ + +V +CEVIQRLIGVT+A AA     EK   KK  KKS                             S++SSGHDLLS R S
Subjt:  LQKENEKLQLELQRSQANLNVGECEVIQRLIGVTEAVAATS-RHEKIHLKKPIKKS-----------------------------SSISSGHDLLSTRGS

Query:  --DRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNITTHNLGSAYYITGDPTLQIC
            + EWHTDLL CCSEPSLC KT F+PCGT ++IAT A+NRH S AEACN+LMAY+LILSCCCYTCCVRRKLRK LNIT    G          +  C
Subjt:  --DRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCCVRRKLRKMLNITTHNLGSAYYITGDPTLQIC

Query:  AIILDQLLPMGLKSVFGRE
          ++ +L  + ++  +G E
Subjt:  AIILDQLLPMGLKSVFGRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGACCACTTTGAATGCTTTTACAGATCAAATGCAGTTTGTTACCTTTTATTTTTTTATTTTTTTATTTTATCTGTTGAAACCGACATGCATTCTGGTGAGTTTGG
GGCGCCGACGTTGAAGATCTTTGAATCCAGTGCTTACGTCGAAGGTCTTTATACTGCCAATCGACTGAGGAAGTTTGTACTTCATTCTCCTATAGCGTGTTGTAATCGGT
GGCGGTGCTTTAAGATTTCGTACTATCCGTCCGTGCGCTTCTGTTGTTTCAGAATTCAGGCCAGAATTCCGAGTTTGGCAAATATGTCTTCGTGGGAAACGCTTGGTGAG
ATAGCAAATGTGGCCCAGCTTACGGGCATTGACGCAGTTCGTTTGATTTCCATGATAGTGAAATCGGCGAGCACAGCACGAATGCACAAAAAGAACTGTAGGCAATTTGC
TCAGCACCTGAAGTTGATCGGAAACCTGCTGGAGCAGCTCAAGATCTCGGAGCTTAAGAGGTATCCCGAGACTCGGGAGCCTCTGGAGCAGCTCGAAGATGCTTTGAGGA
GGTCATACATTTTGGTCAACAGCTGCCAGGACCGCAGCTATCTCTATCTGCTCGCCATGGGATGGAACATTGTATACCAATTCAGGAAGGCCCAGAATGAAATTGACAGA
TATCTGCGCCTTGTTCCTCTGATTACACTCGTTGACAATGCTCGAGTCAGGGTATTTGATTCTGTTTCTTTCCCAAGTGTTATTATTTATTTGTTTATTTACACAACAGG
GTCTCGAAAGATGCATTTCTACAGCATTTACGCCACGATTCATGATGTAATTCTGAACCCAGAACCTTCAACAGATGAGGCAATGGTGTTGAAGAAATCTCTTTCTTGTT
CTTATCCACACTTGCCTTTTAACAAAGCACTACAAAAAGAAAATGAAAAGCTTCAGCTAGAACTACAACGGTCTCAAGCTAATCTGAATGTGGGTGAATGTGAAGTTATT
CAACGTTTGATAGGTGTCACTGAAGCTGTAGCTGCAACTTCTCGTCACGAGAAAATTCACCTGAAAAAGCCCATAAAAAAAAGTTCTTCCATTTCATCAGGACATGATCT
GCTCTCGACCAGAGGTTCTGATCGGTATGGAGAATGGCACACTGACTTACTTGATTGTTGTTCAGAACCTTCTCTGTGTCTTAAGACTCTTTTCTACCCTTGTGGGACTT
TTTCAAGGATTGCTACTGTTGCAACTAATAGGCATACATCACCAGCAGAAGCATGTAATGATTTGATGGCGTATACATTGATACTATCGTGCTGTTGCTATACATGCTGT
GTCAGGAGAAAGCTTCGCAAGATGTTGAACATCACGACACATAACTTAGGCAGTGCATATTATATTACTGGAGATCCAACATTGCAGATATGTGCAATTATTCTTGACCA
ATTGCTGCCAATGGGACTGAAATCTGTGTTTGGTAGGGAGGTTTTGTTGATGACTTTCTCTCCCATCTTATGTGCTGCTGCTGTGCCCTGGTCCAAGAATGGCGAGAAGT
CGAAATACGTGGGGTTTATGGTCCTGAAAAGACCAAAACAAGCCCTCCACCCTCCCAATACATGGAATCCTAAAGTACAGTCGAAGCATATTCAATCGTCACTAGTCTCG
GAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTGACCACTTTGAATGCTTTTACAGATCAAATGCAGTTTGTTACCTTTTATTTTTTTATTTTTTTATTTTATCTGTTGAAACCGACATGCATTCTGGTGAGTTTGG
GGCGCCGACGTTGAAGATCTTTGAATCCAGTGCTTACGTCGAAGGTCTTTATACTGCCAATCGACTGAGGAAGTTTGTACTTCATTCTCCTATAGCGTGTTGTAATCGGT
GGCGGTGCTTTAAGATTTCGTACTATCCGTCCGTGCGCTTCTGTTGTTTCAGAATTCAGGCCAGAATTCCGAGTTTGGCAAATATGTCTTCGTGGGAAACGCTTGGTGAG
ATAGCAAATGTGGCCCAGCTTACGGGCATTGACGCAGTTCGTTTGATTTCCATGATAGTGAAATCGGCGAGCACAGCACGAATGCACAAAAAGAACTGTAGGCAATTTGC
TCAGCACCTGAAGTTGATCGGAAACCTGCTGGAGCAGCTCAAGATCTCGGAGCTTAAGAGGTATCCCGAGACTCGGGAGCCTCTGGAGCAGCTCGAAGATGCTTTGAGGA
GGTCATACATTTTGGTCAACAGCTGCCAGGACCGCAGCTATCTCTATCTGCTCGCCATGGGATGGAACATTGTATACCAATTCAGGAAGGCCCAGAATGAAATTGACAGA
TATCTGCGCCTTGTTCCTCTGATTACACTCGTTGACAATGCTCGAGTCAGGGTATTTGATTCTGTTTCTTTCCCAAGTGTTATTATTTATTTGTTTATTTACACAACAGG
GTCTCGAAAGATGCATTTCTACAGCATTTACGCCACGATTCATGATGTAATTCTGAACCCAGAACCTTCAACAGATGAGGCAATGGTGTTGAAGAAATCTCTTTCTTGTT
CTTATCCACACTTGCCTTTTAACAAAGCACTACAAAAAGAAAATGAAAAGCTTCAGCTAGAACTACAACGGTCTCAAGCTAATCTGAATGTGGGTGAATGTGAAGTTATT
CAACGTTTGATAGGTGTCACTGAAGCTGTAGCTGCAACTTCTCGTCACGAGAAAATTCACCTGAAAAAGCCCATAAAAAAAAGTTCTTCCATTTCATCAGGACATGATCT
GCTCTCGACCAGAGGTTCTGATCGGTATGGAGAATGGCACACTGACTTACTTGATTGTTGTTCAGAACCTTCTCTGTGTCTTAAGACTCTTTTCTACCCTTGTGGGACTT
TTTCAAGGATTGCTACTGTTGCAACTAATAGGCATACATCACCAGCAGAAGCATGTAATGATTTGATGGCGTATACATTGATACTATCGTGCTGTTGCTATACATGCTGT
GTCAGGAGAAAGCTTCGCAAGATGTTGAACATCACGACACATAACTTAGGCAGTGCATATTATATTACTGGAGATCCAACATTGCAGATATGTGCAATTATTCTTGACCA
ATTGCTGCCAATGGGACTGAAATCTGTGTTTGGTAGGGAGGTTTTGTTGATGACTTTCTCTCCCATCTTATGTGCTGCTGCTGTGCCCTGGTCCAAGAATGGCGAGAAGT
CGAAATACGTGGGGTTTATGGTCCTGAAAAGACCAAAACAAGCCCTCCACCCTCCCAATACATGGAATCCTAAAGTACAGTCGAAGCATATTCAATCGTCACTAGTCTCG
GAATAG
Protein sequenceShow/hide protein sequence
MPDHFECFYRSNAVCYLLFFYFFILSVETDMHSGEFGAPTLKIFESSAYVEGLYTANRLRKFVLHSPIACCNRWRCFKISYYPSVRFCCFRIQARIPSLANMSSWETLGE
IANVAQLTGIDAVRLISMIVKSASTARMHKKNCRQFAQHLKLIGNLLEQLKISELKRYPETREPLEQLEDALRRSYILVNSCQDRSYLYLLAMGWNIVYQFRKAQNEIDR
YLRLVPLITLVDNARVRVFDSVSFPSVIIYLFIYTTGSRKMHFYSIYATIHDVILNPEPSTDEAMVLKKSLSCSYPHLPFNKALQKENEKLQLELQRSQANLNVGECEVI
QRLIGVTEAVAATSRHEKIHLKKPIKKSSSISSGHDLLSTRGSDRYGEWHTDLLDCCSEPSLCLKTLFYPCGTFSRIATVATNRHTSPAEACNDLMAYTLILSCCCYTCC
VRRKLRKMLNITTHNLGSAYYITGDPTLQICAIILDQLLPMGLKSVFGREVLLMTFSPILCAAAVPWSKNGEKSKYVGFMVLKRPKQALHPPNTWNPKVQSKHIQSSLVS
E