; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy3G008640 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy3G008640
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
Descriptionprotein REVEILLE 6 isoform X1
Genome locationGy14Chr3:7120328..7124485
RNA-Seq ExpressionCsGy3G008640
SyntenyCsGy3G008640
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR006447 - Myb domain, plants
IPR009057 - Homeobox-like domain superfamily
IPR017884 - SANT domain
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048947.1 protein REVEILLE 6 isoform X1 [Cucumis melo var. makuwa]1.10e-21096.19Show/hide
Query:  MSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFL
        MSHFPGIDSVRTPTP  LRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFL
Subjt:  MSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFL

Query:  KIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPK-GGSTLAHSSSS
        KIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSP EPRYIYIPDS AGFGLPSPNATFSSWSCSP+PTIDVSQVPK GG TLAHSSSS
Subjt:  KIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPK-GGSTLAHSSSS

Query:  ESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSLF
        ESTPRTWKLGEISDQGNQSMRNRVMPDFAQVY FIGSVFDPTVSGHIQRLRKMDPINLET LLLMQNLAINLISPEFE+HRKLISSYDED KKAKSGSLF
Subjt:  ESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSLF

Query:  NSLYNVRSDNTILSA
        NSL+NVRSDNTILSA
Subjt:  NSLYNVRSDNTILSA

XP_004133848.1 protein REVEILLE 6 isoform X2 [Cucumis sativus]6.24e-22298.73Show/hide
Query:  MNMSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKY
        MNMSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKY
Subjt:  MNMSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKY

Query:  FLKIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPKGGSTLAHSSS
        FLKIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSP EPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPKGGSTLAHSSS
Subjt:  FLKIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPKGGSTLAHSSS

Query:  SESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSL
        SESTPRTWKLGEISDQGNQSMRNRVMPDFAQVY FIGSVFDPTVSGHIQRLRKMDPINLET LLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSL
Subjt:  SESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSL

Query:  FNSLYNVRSDNTILSA
        FNSLYNVRSD+TILSA
Subjt:  FNSLYNVRSDNTILSA

XP_008437994.1 PREDICTED: protein REVEILLE 6 isoform X1 [Cucumis melo]2.50e-21296.21Show/hide
Query:  MNMSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKY
        MNMSHFPGIDSVRTPTP  LRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKY
Subjt:  MNMSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKY

Query:  FLKIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPK-GGSTLAHSS
        FLKIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSP EPRYIYIPDS AGFGLPSPNATFSSWSCSP+PTIDVSQVPK GG TLAHSS
Subjt:  FLKIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPK-GGSTLAHSS

Query:  SSESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGS
        SSESTPRTWKLGEISDQGNQSMRNRVMPDFAQVY FIGSVFDPTVSGHIQRLRKMDPINLET LLLMQNLAINLISPEFE+HRKLISSYDED KKAKSGS
Subjt:  SSESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGS

Query:  LFNSLYNVRSDNTILSA
        LFNSL+NVRSDNTILSA
Subjt:  LFNSLYNVRSDNTILSA

XP_008437998.1 PREDICTED: protein REVEILLE 6 isoform X2 [Cucumis melo]3.57e-21496.52Show/hide
Query:  MNMSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKY
        MNMSHFPGIDSVRTPTP  LRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKY
Subjt:  MNMSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKY

Query:  FLKIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPKGGSTLAHSSS
        FLKIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSP EPRYIYIPDS AGFGLPSPNATFSSWSCSP+PTIDVSQVPKGG TLAHSSS
Subjt:  FLKIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPKGGSTLAHSSS

Query:  SESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSL
        SESTPRTWKLGEISDQGNQSMRNRVMPDFAQVY FIGSVFDPTVSGHIQRLRKMDPINLET LLLMQNLAINLISPEFE+HRKLISSYDED KKAKSGSL
Subjt:  SESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSL

Query:  FNSLYNVRSDNTILSA
        FNSL+NVRSDNTILSA
Subjt:  FNSLYNVRSDNTILSA

XP_011650733.1 protein REVEILLE 6 isoform X1 [Cucumis sativus]4.37e-22098.42Show/hide
Query:  MNMSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKY
        MNMSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKY
Subjt:  MNMSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKY

Query:  FLKIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPK-GGSTLAHSS
        FLKIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSP EPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPK GGSTLAHSS
Subjt:  FLKIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPK-GGSTLAHSS

Query:  SSESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGS
        SSESTPRTWKLGEISDQGNQSMRNRVMPDFAQVY FIGSVFDPTVSGHIQRLRKMDPINLET LLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGS
Subjt:  SSESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGS

Query:  LFNSLYNVRSDNTILSA
        LFNSLYNVRSD+TILSA
Subjt:  LFNSLYNVRSDNTILSA

TrEMBL top hitse value%identityAlignment
A0A0A0L436 HTH myb-type domain-containing protein3.02e-22298.73Show/hide
Query:  MNMSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKY
        MNMSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKY
Subjt:  MNMSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKY

Query:  FLKIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPKGGSTLAHSSS
        FLKIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSP EPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPKGGSTLAHSSS
Subjt:  FLKIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPKGGSTLAHSSS

Query:  SESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSL
        SESTPRTWKLGEISDQGNQSMRNRVMPDFAQVY FIGSVFDPTVSGHIQRLRKMDPINLET LLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSL
Subjt:  SESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSL

Query:  FNSLYNVRSDNTILSA
        FNSLYNVRSD+TILSA
Subjt:  FNSLYNVRSDNTILSA

A0A1S3AUZ0 protein REVEILLE 6 isoform X11.21e-21296.21Show/hide
Query:  MNMSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKY
        MNMSHFPGIDSVRTPTP  LRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKY
Subjt:  MNMSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKY

Query:  FLKIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPK-GGSTLAHSS
        FLKIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSP EPRYIYIPDS AGFGLPSPNATFSSWSCSP+PTIDVSQVPK GG TLAHSS
Subjt:  FLKIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPK-GGSTLAHSS

Query:  SSESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGS
        SSESTPRTWKLGEISDQGNQSMRNRVMPDFAQVY FIGSVFDPTVSGHIQRLRKMDPINLET LLLMQNLAINLISPEFE+HRKLISSYDED KKAKSGS
Subjt:  SSESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGS

Query:  LFNSLYNVRSDNTILSA
        LFNSL+NVRSDNTILSA
Subjt:  LFNSLYNVRSDNTILSA

A0A1S3AVA9 protein REVEILLE 6 isoform X21.73e-21496.52Show/hide
Query:  MNMSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKY
        MNMSHFPGIDSVRTPTP  LRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKY
Subjt:  MNMSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKY

Query:  FLKIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPKGGSTLAHSSS
        FLKIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSP EPRYIYIPDS AGFGLPSPNATFSSWSCSP+PTIDVSQVPKGG TLAHSSS
Subjt:  FLKIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPKGGSTLAHSSS

Query:  SESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSL
        SESTPRTWKLGEISDQGNQSMRNRVMPDFAQVY FIGSVFDPTVSGHIQRLRKMDPINLET LLLMQNLAINLISPEFE+HRKLISSYDED KKAKSGSL
Subjt:  SESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSL

Query:  FNSLYNVRSDNTILSA
        FNSL+NVRSDNTILSA
Subjt:  FNSLYNVRSDNTILSA

A0A5A7U3R5 Protein REVEILLE 6 isoform X15.34e-21196.19Show/hide
Query:  MSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFL
        MSHFPGIDSVRTPTP  LRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFL
Subjt:  MSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFL

Query:  KIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPK-GGSTLAHSSSS
        KIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSP EPRYIYIPDS AGFGLPSPNATFSSWSCSP+PTIDVSQVPK GG TLAHSSSS
Subjt:  KIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPK-GGSTLAHSSSS

Query:  ESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSLF
        ESTPRTWKLGEISDQGNQSMRNRVMPDFAQVY FIGSVFDPTVSGHIQRLRKMDPINLET LLLMQNLAINLISPEFE+HRKLISSYDED KKAKSGSLF
Subjt:  ESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSLF

Query:  NSLYNVRSDNTILSA
        NSL+NVRSDNTILSA
Subjt:  NSLYNVRSDNTILSA

A0A5D3D070 Protein REVEILLE 6 isoform X11.26e-20995.87Show/hide
Query:  MSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFL
        MSHFPGIDSVRTPTP  LRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFL
Subjt:  MSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFL

Query:  KIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPK-GGSTLAHSSSS
        KIQKSGKSEHVPPPRPKKKASHP PQKAPKNATTQHPGMYQPLSSP EPRYIYIPDS AGFGLPSPNATFSSWSCSP+PTIDVSQVPK GG TLAHSSSS
Subjt:  KIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPK-GGSTLAHSSSS

Query:  ESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSLF
        ESTPRTWKLGEISDQGNQSMRNRVMPDFAQVY FIGSVFDPTVSGHIQRLRKMDPINLET LLLMQNLAINLISPEFE+HRKLISSYDED KKAKSGSLF
Subjt:  ESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSLF

Query:  NSLYNVRSDNTILSA
        NSL+NVRSDNTILSA
Subjt:  NSLYNVRSDNTILSA

SwissProt top hitse value%identityAlignment
C0SVG5 Protein REVEILLE 54.7e-7355.44Show/hide
Query:  FPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFLKIQ
        FP  DS     P      ++P +   S  +F  SED + KIRKPYTI KSRE+WT+QEHDKFLEAL LFDRDWKKIEAFVGSKTV+QIRSHAQKYFLK+Q
Subjt:  FPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFLKIQ

Query:  KSGKSEHVPPPRPKKKASHPYPQKAPKNA--TTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNA-TFSSWSCS----PMPTIDVSQVPKGGSTLA--
        KSG +EH+PPPRPK+KASHPYP KAPKN   T+       PL   LEP Y+Y  DS +  G  +  A T SSW+      P P I+V +   G S  A  
Subjt:  KSGKSEHVPPPRPKKKASHPYPQKAPKNA--TTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNA-TFSSWSCS----PMPTIDVSQVPKGGSTLA--

Query:  --HSSSSESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSY
          +    E T R   + + +++ +    +RVMP+FA+VY FIGSVFDP  SGH+QRL++MDPIN+ET LLLMQNL++NL SPEF   R+LISSY
Subjt:  --HSSSSESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSY

Q6R0G4 Protein REVEILLE 48.6e-5949.1Show/hide
Query:  STSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFLKIQKSGKSEHVPPPRPKKKASHPYP
        +T  ++A     E   KK+RK YTITKSRESWTE EHDKFLEALQLFDRDWKKIE FVGSKTVIQIRSHAQKYFLK+QK+G   HVPPPRPK+KA+HPYP
Subjt:  STSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFLKIQKSGKSEHVPPPRPKKKASHPYP

Query:  QKAPKNA-TTQHPGMYQPLSSPLEPRYIYIPDSTAGF------GLPSPNATFSSWSCSPMPTIDVSQVPKGGSTLAHSSSSESTPRTWKLGEISDQGNQS
        QKA KNA  + H  M  P      P Y    D T+        G+  P     +  C     +DV        T   +S   S+ RT    +      Q+
Subjt:  QKAPKNA-TTQHPGMYQPLSSPLEPRYIYIPDSTAGF------GLPSPNATFSSWSCSPMPTIDVSQVPKGGSTLAHSSSSESTPRTWKLGEISDQGNQS

Query:  MRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKS
             +PDFA+VY FIGSVFDP   G +++L++MDPIN ET LLLM+NL +NL +P+FE   + + + +E  +   S
Subjt:  MRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKS

Q6R0H0 Protein REVEILLE 33.7e-7052.7Show/hide
Query:  MNMSHFPGIDSVRTPTPPPLRTAALPTS-TSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQK
        M MS  PG +++      P     +P S  SN   +F   ED +KK+RKPYTITKSRE+WTEQEHDKFLEAL LFDRDWKKI+AFVGSKTVIQIRSHAQK
Subjt:  MNMSHFPGIDSVRTPTPPPLRTAALPTS-TSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQK

Query:  YFLKIQKSGKSEHVPPPRPKKKASHPYPQKAPK------NATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPKGGS
        YFLK+QK+G  EH+PPPRPK+KA+HPYPQKAPK      NA  QH  +Y   S P       +  +T   GL   + +        +P+  + +      
Subjt:  YFLKIQKSGKSEHVPPPRPKKKASHPYPQKAPK------NATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPKGGS

Query:  TLAHSSSSESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYD
            +SSS    RT  + E +DQ +    +RV P+FA+VY FIGSVFDP  +GH++RL++MDPINLET LLLM+NL++NL SPEF+  RKLISSY+
Subjt:  TLAHSSSSESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYD

Q8H0W3 Protein REVEILLE 61.5e-8256.8Show/hide
Query:  TAALPTSTSNSVAAFPVS-------EDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFLKIQKSGKSEHVP
        +++ PT++S +VA   V+       ED SKKIRKPYTITKSRESWTE EHDKFLEALQLFDRDWKKIEAF+GSKTVIQIRSHAQKYFLK+QKSG  EH+P
Subjt:  TAALPTSTSNSVAAFPVS-------EDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFLKIQKSGKSEHVP

Query:  PPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPKGGSTLAH--SSSSESTPRTWKLGE
        PPRPK+KA+HPYPQKA KN   Q PG ++  S P +P +++ P+S++        A  + W+ +   TI  + +PK G+   +  SSSSE+TPR     +
Subjt:  PPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPKGGSTLAH--SSSSESTPRTWKLGE

Query:  ISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSLFNSLYN
          D GN     RV+PDFAQVY FIGSVFDP  S H+Q+L+KMDPI++ET LLLM+NL+INL SP+FE+HR+L+SSYD   + A      N   N
Subjt:  ISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSLFNSLYN

Q8RWU3 Protein REVEILLE 85.8e-6351.53Show/hide
Query:  MSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFL
        MS  P  +      PPP      PTST        V+E +SKK+RKPYTITKSRESWTE+EHDKFLEALQLFDRDWKKIE FVGSKTVIQIRSHAQKYFL
Subjt:  MSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFL

Query:  KIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSP---MPTIDVSQVPKGGSTLAHSS
        K+QK+G   HVPPPRPK+KA+HPYPQKA KNA  Q P   Q  +S    R   +P    G+      +   +   SP   + T+  ++   G   L + S
Subjt:  KIQKSGKSEHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSP---MPTIDVSQVPKGGSTLAHSS

Query:  SSE-----STPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYD
        S       S+ RT    EI  +  Q      +PDFA+VY FIGSVFDP   GH+++L++MDPIN ET LLLM+NL +NL +P+ E+ RK++ SYD
Subjt:  SSE-----STPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYD

Arabidopsis top hitse value%identityAlignment
AT1G01520.1 Homeodomain-like superfamily protein2.7e-7152.7Show/hide
Query:  MNMSHFPGIDSVRTPTPPPLRTAALPTS-TSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQK
        M MS  PG +++      P     +P S  SN   +F   ED +KK+RKPYTITKSRE+WTEQEHDKFLEAL LFDRDWKKI+AFVGSKTVIQIRSHAQK
Subjt:  MNMSHFPGIDSVRTPTPPPLRTAALPTS-TSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQK

Query:  YFLKIQKSGKSEHVPPPRPKKKASHPYPQKAPK------NATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPKGGS
        YFLK+QK+G  EH+PPPRPK+KA+HPYPQKAPK      NA  QH  +Y   S P       +  +T   GL   + +        +P+  + +      
Subjt:  YFLKIQKSGKSEHVPPPRPKKKASHPYPQKAPK------NATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPKGGS

Query:  TLAHSSSSESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYD
            +SSS    RT  + E +DQ +    +RV P+FA+VY FIGSVFDP  +GH++RL++MDPINLET LLLM+NL++NL SPEF+  RKLISSY+
Subjt:  TLAHSSSSESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYD

AT4G01280.1 Homeodomain-like superfamily protein7.5e-7454.98Show/hide
Query:  FPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFLKIQ
        FP  DS     P      ++P +   S  +F  SED + KIRKPYTI KSRE+WT+QEHDKFLEAL LFDRDWKKIEAFVGSKTV+QIRSHAQKYFLK+Q
Subjt:  FPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFLKIQ

Query:  KSGKSEHVPPPRPKKKASHPYPQKAPKNA--TTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNA-TFSSWSCS----PMPTIDVSQVPKGGSTLAHS
        KSG +EH+PPPRPK+KASHPYP KAPKN   T+       PL   LEP Y+Y  DS +  G  +  A T SSW+      P P I+        + L ++
Subjt:  KSGKSEHVPPPRPKKKASHPYPQKAPKNA--TTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNA-TFSSWSCS----PMPTIDVSQVPKGGSTLAHS

Query:  -SSSESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSY
            E T R   + + +++ +    +RVMP+FA+VY FIGSVFDP  SGH+QRL++MDPIN+ET LLLMQNL++NL SPEF   R+LISSY
Subjt:  -SSSESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSY

AT4G01280.2 Homeodomain-like superfamily protein3.4e-7455.44Show/hide
Query:  FPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFLKIQ
        FP  DS     P      ++P +   S  +F  SED + KIRKPYTI KSRE+WT+QEHDKFLEAL LFDRDWKKIEAFVGSKTV+QIRSHAQKYFLK+Q
Subjt:  FPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFLKIQ

Query:  KSGKSEHVPPPRPKKKASHPYPQKAPKNA--TTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNA-TFSSWSCS----PMPTIDVSQVPKGGSTLA--
        KSG +EH+PPPRPK+KASHPYP KAPKN   T+       PL   LEP Y+Y  DS +  G  +  A T SSW+      P P I+V +   G S  A  
Subjt:  KSGKSEHVPPPRPKKKASHPYPQKAPKNA--TTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNA-TFSSWSCS----PMPTIDVSQVPKGGSTLA--

Query:  --HSSSSESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSY
          +    E T R   + + +++ +    +RVMP+FA+VY FIGSVFDP  SGH+QRL++MDPIN+ET LLLMQNL++NL SPEF   R+LISSY
Subjt:  --HSSSSESTPRTWKLGEISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSY

AT5G52660.1 Homeodomain-like superfamily protein4.7e-8457.34Show/hide
Query:  TAALPTSTSNSVAAFPVS-------EDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFLKIQKSGKSEHVP
        +++ PT++S +VA   V+       ED SKKIRKPYTITKSRESWTE EHDKFLEALQLFDRDWKKIEAF+GSKTVIQIRSHAQKYFLK+QKSG  EH+P
Subjt:  TAALPTSTSNSVAAFPVS-------EDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFLKIQKSGKSEHVP

Query:  PPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPKG-GSTLAHSSSSESTPRTWKLGEI
        PPRPK+KA+HPYPQKA KN   Q PG ++  S P +P +++ P+S++        A  + W+ +   TI  + +PKG G+    SSSSE+TPR     + 
Subjt:  PPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPKG-GSTLAHSSSSESTPRTWKLGEI

Query:  SDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSLFNSLYN
         D GN     RV+PDFAQVY FIGSVFDP  S H+Q+L+KMDPI++ET LLLM+NL+INL SP+FE+HR+L+SSYD   + A      N   N
Subjt:  SDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSLFNSLYN

AT5G52660.2 Homeodomain-like superfamily protein1.0e-8356.8Show/hide
Query:  TAALPTSTSNSVAAFPVS-------EDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFLKIQKSGKSEHVP
        +++ PT++S +VA   V+       ED SKKIRKPYTITKSRESWTE EHDKFLEALQLFDRDWKKIEAF+GSKTVIQIRSHAQKYFLK+QKSG  EH+P
Subjt:  TAALPTSTSNSVAAFPVS-------EDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFLKIQKSGKSEHVP

Query:  PPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPKGGSTLAH--SSSSESTPRTWKLGE
        PPRPK+KA+HPYPQKA KN   Q PG ++  S P +P +++ P+S++        A  + W+ +   TI  + +PK G+   +  SSSSE+TPR     +
Subjt:  PPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPKGGSTLAH--SSSSESTPRTWKLGE

Query:  ISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSLFNSLYN
          D GN     RV+PDFAQVY FIGSVFDP  S H+Q+L+KMDPI++ET LLLM+NL+INL SP+FE+HR+L+SSYD   + A      N   N
Subjt:  ISDQGNQSMRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSLFNSLYN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATATGAGCCATTTTCCTGGTATTGATTCTGTTCGAACTCCAACTCCTCCTCCACTAAGAACTGCTGCTCTGCCTACTTCGACTTCTAATTCTGTTGCTGCTTTCCC
GGTTTCGGAGGATGCGAGTAAGAAGATTCGAAAGCCTTATACAATTACTAAGTCGAGAGAGAGTTGGACGGAGCAGGAGCACGATAAGTTTCTTGAAGCTCTTCAATTAT
TTGATCGTGACTGGAAGAAGATTGAAGCATTTGTTGGCTCAAAAACAGTTATCCAGATTCGCAGTCATGCTCAAAAATATTTTCTAAAGATTCAAAAAAGTGGGAAAAGT
GAGCATGTACCTCCTCCTCGACCAAAGAAGAAAGCATCTCACCCATACCCACAAAAAGCTCCTAAGAATGCGACCACTCAGCATCCTGGGATGTATCAACCCTTATCTTC
TCCACTCGAACCAAGATATATTTACATTCCAGACTCAACAGCAGGGTTTGGACTTCCCAGTCCAAATGCTACCTTCTCTTCTTGGAGCTGTAGCCCTATGCCGACTATTG
ATGTGTCACAAGTACCCAAAGGTGGATCAACATTGGCACATAGTAGCAGTAGTGAGAGCACTCCACGGACATGGAAACTTGGAGAAATTTCTGACCAAGGAAATCAGAGC
ATGCGAAACAGAGTCATGCCGGATTTTGCTCAAGTTTACCGCTTCATCGGCAGTGTATTTGATCCCACTGTGTCGGGTCATATTCAGAGACTAAGAAAGATGGACCCAAT
AAATCTAGAGACGACGCTGCTTTTAATGCAAAATCTTGCCATCAACTTGATCAGTCCAGAATTTGAAAACCACAGAAAACTGATTTCATCATATGATGAGGACAGGAAGA
AGGCCAAATCTGGTAGCCTCTTTAACAGTCTATACAATGTCAGATCAGACAATACCATTCTATCAGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATATGAGCCATTTTCCTGGTATTGATTCTGTTCGAACTCCAACTCCTCCTCCACTAAGAACTGCTGCTCTGCCTACTTCGACTTCTAATTCTGTTGCTGCTTTCCC
GGTTTCGGAGGATGCGAGTAAGAAGATTCGAAAGCCTTATACAATTACTAAGTCGAGAGAGAGTTGGACGGAGCAGGAGCACGATAAGTTTCTTGAAGCTCTTCAATTAT
TTGATCGTGACTGGAAGAAGATTGAAGCATTTGTTGGCTCAAAAACAGTTATCCAGATTCGCAGTCATGCTCAAAAATATTTTCTAAAGATTCAAAAAAGTGGGAAAAGT
GAGCATGTACCTCCTCCTCGACCAAAGAAGAAAGCATCTCACCCATACCCACAAAAAGCTCCTAAGAATGCGACCACTCAGCATCCTGGGATGTATCAACCCTTATCTTC
TCCACTCGAACCAAGATATATTTACATTCCAGACTCAACAGCAGGGTTTGGACTTCCCAGTCCAAATGCTACCTTCTCTTCTTGGAGCTGTAGCCCTATGCCGACTATTG
ATGTGTCACAAGTACCCAAAGGTGGATCAACATTGGCACATAGTAGCAGTAGTGAGAGCACTCCACGGACATGGAAACTTGGAGAAATTTCTGACCAAGGAAATCAGAGC
ATGCGAAACAGAGTCATGCCGGATTTTGCTCAAGTTTACCGCTTCATCGGCAGTGTATTTGATCCCACTGTGTCGGGTCATATTCAGAGACTAAGAAAGATGGACCCAAT
AAATCTAGAGACGACGCTGCTTTTAATGCAAAATCTTGCCATCAACTTGATCAGTCCAGAATTTGAAAACCACAGAAAACTGATTTCATCATATGATGAGGACAGGAAGA
AGGCCAAATCTGGTAGCCTCTTTAACAGTCTATACAATGTCAGATCAGACAATACCATTCTATCAGCTTAG
Protein sequenceShow/hide protein sequence
MNMSHFPGIDSVRTPTPPPLRTAALPTSTSNSVAAFPVSEDASKKIRKPYTITKSRESWTEQEHDKFLEALQLFDRDWKKIEAFVGSKTVIQIRSHAQKYFLKIQKSGKS
EHVPPPRPKKKASHPYPQKAPKNATTQHPGMYQPLSSPLEPRYIYIPDSTAGFGLPSPNATFSSWSCSPMPTIDVSQVPKGGSTLAHSSSSESTPRTWKLGEISDQGNQS
MRNRVMPDFAQVYRFIGSVFDPTVSGHIQRLRKMDPINLETTLLLMQNLAINLISPEFENHRKLISSYDEDRKKAKSGSLFNSLYNVRSDNTILSA