; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029755 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029755
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionWD repeat-containing protein 55
Genome locationtig00153449:2494269..2511200
RNA-Seq ExpressionSgr029755
SyntenySgr029755
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR019775 - WD40 repeat, conserved site
IPR024977 - Anaphase-promoting complex subunit 4, WD40 domain
IPR036322 - WD40-repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593090.1 WD repeat-containing protein 55, partial [Cucurbita argyrosperma subsp. sororia]2.9e-16892.83Show/hide
Query:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL
        MEVNLGGIPFDLDFHPS+QLVAAGLIDGNLLLYRY+ NALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL
Subjt:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL

Query:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT
        +N+TSETIASGDDNGCIKVWDTRQRSCC+SFEAHEDYISDMTYS+DSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT
Subjt:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT

Query:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN
        LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRV+A SESGLISLVG+LPNRIIQPIAEHSDYPVERLAFSHD+K+LGSISHDYMLKLWDM+LLQSS N
Subjt:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN

Query:  ILNGQTTV--DASASDNLFTF
        + NG   V   ASASDNLF F
Subjt:  ILNGQTTV--DASASDNLFTF

KAG7025499.1 WD repeat-containing protein 55, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-16892.83Show/hide
Query:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL
        MEVNLGGIPFDLDFHPS+QLVAAGLIDGNLLLYRY+ NALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL
Subjt:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL

Query:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT
        +N+TSETIASGDDNGCIKVWDTRQRSCC+SFEAHEDYISDMTYS+DSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT
Subjt:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT

Query:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN
        LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRV+A SESGLISLVG+LPNRIIQPIAEHSDYPVERLAFSHD+K+LGSISHDYMLKLWDM+LLQSS N
Subjt:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN

Query:  ILNGQTTV--DASASDNLFTF
        + NG+  V   ASASDNLF F
Subjt:  ILNGQTTV--DASASDNLFTF

XP_022151710.1 WD repeat-containing protein 55 [Momordica charantia]7.9e-17495.92Show/hide
Query:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL
        MEVNLGGIPFDLDFHPS+QLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL
Subjt:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL

Query:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT
        INLTSETIASGDDNGCIKVWDTRQRSCC++FEAHEDYISDMT+S+DSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT
Subjt:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT

Query:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN
        LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRV+A SESGLISLVGILPNR+IQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDM+LLQSSGN
Subjt:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN

Query:  ILNGQTTVDASASDNLFTF
          NGQTTVDA  SDNLFTF
Subjt:  ILNGQTTVDASASDNLFTF

XP_022960342.1 WD repeat-containing protein 55 [Cucurbita moschata]2.9e-16892.83Show/hide
Query:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL
        MEVNLGGIPFDLDFHPS+QLVAAGLIDGNLLLYRY+ NALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL
Subjt:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL

Query:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT
        +N+TSETIASGDDNGCIKVWDTRQRSCC+SFEAHEDYISDMTYS+DSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT
Subjt:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT

Query:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN
        LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRV+A SESGLISLVG+LPNRIIQPIAEHSDYPVERLAFSHD+K+LGSISHDYMLKLWDM+LLQSS N
Subjt:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN

Query:  ILNGQTTV--DASASDNLFTF
        + NG   V   ASASDNLF F
Subjt:  ILNGQTTV--DASASDNLFTF

XP_023515082.1 WD repeat-containing protein 55 [Cucurbita pepo subsp. pepo]8.4e-16892.83Show/hide
Query:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL
        MEVNLGGIPFDLDFHPS+QLVAAGLIDGNLLLYRY+ NALPQRLLKVRAHSESCRAVRFIN GRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL
Subjt:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL

Query:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT
        +N+TSETIASGDDNGCIKVWDTRQRSCC+SFEAHEDYISDMTYS+DSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT
Subjt:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT

Query:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN
        LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRV+A SESGLISLVG+LPNRIIQPIAEHSDYPVERLAFSHD+KFLGSISHDYMLKLWDM+LLQSS N
Subjt:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN

Query:  ILNGQTTV--DASASDNLFTF
        + NG   V   ASASDNLF F
Subjt:  ILNGQTTV--DASASDNLFTF

TrEMBL top hitse value%identityAlignment
A0A1S3CPA6 WD repeat-containing protein 551.3e-15386.52Show/hide
Query:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL
        ME+NL  IPFDLDFHPSDQLVAAG+I GNL LYRY+ANALPQ+L KVRAH +SCRAVRFINDGRAILTGS DHSIL+TDVETGSVIARLEDAHD+AV++L
Subjt:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL

Query:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT
        IN+TSETIASGDDNG IKVWDTRQRSCCSSF+AHEDYISDMTYS+DS KLLATSGDG+LSV NLRRNK+HARSEFSE ELLSVV MKNGRKVICGSQ GT
Subjt:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT

Query:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN
        LLLYSWG F+DCSDRFVD SQNPVNALLKLDEDRVI  SESGLISLVGILPNR+IQPIAEHSDYPVERLAFSHDRKFLGSISHDYM+KLWDM+LLQSSG 
Subjt:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN

Query:  ILNGQTTVDASASDNLFTF
          N  TTV+ASAS N FTF
Subjt:  ILNGQTTVDASASDNLFTF

A0A5A7UUH0 WD repeat-containing protein 551.3e-15386.52Show/hide
Query:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL
        ME+NL  IPFDLDFHPSDQLVAAG+I GNL LYRY+ANALPQ+L KVRAH +SCRAVRFINDGRAILTGS DHSIL+TDVETGSVIARLEDAHD+AV++L
Subjt:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL

Query:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT
        IN+TSETIASGDDNG IKVWDTRQRSCCSSF+AHEDYISDMTYS+DS KLLATSGDG+LSV NLRRNK+HARSEFSE ELLSVV MKNGRKVICGSQ GT
Subjt:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT

Query:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN
        LLLYSWG F+DCSDRFVD SQNPVNALLKLDEDRVI  SESGLISLVGILPNR+IQPIAEHSDYPVERLAFSHDRKFLGSISHDYM+KLWDM+LLQSSG 
Subjt:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN

Query:  ILNGQTTVDASASDNLFTF
          N  TTV+ASAS N FTF
Subjt:  ILNGQTTVDASASDNLFTF

A0A6J1DBY5 WD repeat-containing protein 553.8e-17495.92Show/hide
Query:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL
        MEVNLGGIPFDLDFHPS+QLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL
Subjt:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL

Query:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT
        INLTSETIASGDDNGCIKVWDTRQRSCC++FEAHEDYISDMT+S+DSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT
Subjt:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT

Query:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN
        LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRV+A SESGLISLVGILPNR+IQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDM+LLQSSGN
Subjt:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN

Query:  ILNGQTTVDASASDNLFTF
          NGQTTVDA  SDNLFTF
Subjt:  ILNGQTTVDASASDNLFTF

A0A6J1H752 WD repeat-containing protein 551.4e-16892.83Show/hide
Query:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL
        MEVNLGGIPFDLDFHPS+QLVAAGLIDGNLLLYRY+ NALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL
Subjt:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL

Query:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT
        +N+TSETIASGDDNGCIKVWDTRQRSCC+SFEAHEDYISDMTYS+DSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT
Subjt:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT

Query:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN
        LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRV+A SESGLISLVG+LPNRIIQPIAEHSDYPVERLAFSHD+K+LGSISHDYMLKLWDM+LLQSS N
Subjt:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN

Query:  ILNGQTTV--DASASDNLFTF
        + NG   V   ASASDNLF F
Subjt:  ILNGQTTV--DASASDNLFTF

A0A6J1KPY0 WD repeat-containing protein 555.3e-16892.52Show/hide
Query:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL
        MEVNLGGIPFDLDFHPS+QLVAAGLIDGNLLLYRY+ NALPQRLLKVRAHSESCRAVRFIN GRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL
Subjt:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL

Query:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT
        +N+TSETIASGDDNGCIKVWDTRQRSCC+SFEAHEDYISDMTYS+DSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKV+CGSQTGT
Subjt:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT

Query:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN
        LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRV+A SESGLISLVG+LPNRIIQPIAEHSDYPVERLAFSHD+KFLGSISHDYMLKLWDM+LLQSS N
Subjt:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN

Query:  ILNGQTTV--DASASDNLFTF
        + NG   V   ASASDNLF F
Subjt:  ILNGQTTV--DASASDNLFTF

SwissProt top hitse value%identityAlignment
A1L112 WD repeat-containing protein 551.3e-5437.22Show/hide
Query:  TIRVPENKFTGMEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNA-NALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIAR
        T   P  + T  ++ L      L FHP+  L+AAG +DG++ ++ Y+      + L     H +SCRAV F  DG+ ++T S D +I   DVE G +  R
Subjt:  TIRVPENKFTGMEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNA-NALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIAR

Query:  LEDAHDDAVNRLINLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKN
        +  AH   +N L+ +    + +GDD G I++WD R+         HE+YI+DM        LL  SGDG L V N++R +    SE    +L SV +MK 
Subjt:  LEDAHDDAVNRLINLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKN

Query:  GRKVICGSQTGTLLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLK
        G+KV CGS  GT+ L++W  F   SDRF  L    ++ ++ + E+ +  GS  G+I  V ILPNR++  + +H+  PVE LA SH   FL S  HD  LK
Subjt:  GRKVICGSQTGTLLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLK

Query:  LWDMNLLQS
         WDM  L++
Subjt:  LWDMNLLQS

O80775 WD repeat-containing protein 556.1e-12168.25Show/hide
Query:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL
        ME++LG   F +DFHPS  LVAAGLIDG+L LYRY++++   R  KVRAH ESCRAVRFI+DG+ I+T S D SILATDVETG+ +A LE+AH+DAVN L
Subjt:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL

Query:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT
        IN+T  TIASGDD GC+K+WDTRQRSC   F AHEDYIS MT+++DSMKL+ TSGDGTLSVCNLR +KV ++SEFSE ELLSVVIMKNGRKVICG+Q GT
Subjt:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT

Query:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN
        LLLYSWG FKDCSDRFVDL+ N V+ALLKLDEDR+I G ++G+ISLVGILPNRIIQPI  H DYP+E LA SHD+KFLGS +HD MLKLW++  +    N
Subjt:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN

Query:  ILNGQTTVDASASDN
        + +G  +  A  SD+
Subjt:  ILNGQTTVDASASDN

Q54SA5 WD repeat-containing protein 55 homolog1.3e-5737.05Show/hide
Query:  LDTIRVPENKFTGMEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIA
        +DT    +++    +++   IPF L+FHP++ L+     +G L L++Y+ +    + L +R H   CR   F +DG+ I T S D S+   D+ TGS++ 
Subjt:  LDTIRVPENKFTGMEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIA

Query:  RLEDAHDDAVNRLINLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMK
          E+AHD  +N L++     + +GDD G IKVWD RQ++    F+ H D+ISD+T + D   + ATSGDG +S+ N  R  +   SE S+ ELLS + + 
Subjt:  RLEDAHDDAVNRLINLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMK

Query:  NGRKVICGSQTGTLLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYML
        NG+K++CGSQ G++L+Y     ++   +F    Q+ V+AL+K++ +   +GS  G+I  +G+ P +++  + EHS +P+ER+A S D ++LGSISHD+ L
Subjt:  NGRKVICGSQTGTLLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYML

Query:  KLWDM
        K W++
Subjt:  KLWDM

Q58DT8 WD repeat-containing protein 552.6e-5540.07Show/hide
Query:  LDFHPSDQLVAAGLIDGNLLLYRYNA-NALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRLINLTSETIAS
        L FHP+  L+AAG +DG++ ++ Y+      + L     H +SCRAV F  DG+ ++T S D +I   DVE G +  R+  AH   +N L+ +    +A+
Subjt:  LDFHPSDQLVAAGLIDGNLLLYRYNA-NALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRLINLTSETIAS

Query:  GDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGTLLLYSWGCFK
        GDD G I++WD R+         HE+YI+DM    D   LL  SGDG L V N++R +    SE    +L SV +MK GRKV CGS  GT+ L++W  F 
Subjt:  GDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGTLLLYSWGCFK

Query:  DCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQS
          SDRF  L    ++ ++ + E  + AGS  G+I  V ILPNR++  + +H++ PVE LA SH   FL S  HD  LK WDM  L++
Subjt:  DCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQS

Q6DRF9 WD repeat-containing protein 552.6e-5536.36Show/hide
Query:  DTIRVPENKFTGMEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYN-ANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIA
        D  + P+ + T  ++ L  I   + FHP   ++AAG IDG++ L+ Y+      + L     H +SCR V F +DG+ + + S D +I   DVE G +  
Subjt:  DTIRVPENKFTGMEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYN-ANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIA

Query:  RLEDAHDDAVNRLINLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMK
        R+  AH   +N ++ +     A+GDD G +KVWD R+ +     + HEDYISD+T       LL +SGDGTL V N++R +    SE    +L SV IMK
Subjt:  RLEDAHDDAVNRLINLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMK

Query:  NGRKVICGSQTGTLLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYML
         GRKV+CGS  GT+ +++W  F   SDRF  +    V+ ++ + +  + A S  G+I  + ILPNR++  I +H    +E +A   D  FL S +HD ++
Subjt:  NGRKVICGSQTGTLLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYML

Query:  KLWDMNLL
        K WD++ L
Subjt:  KLWDMNLL

Arabidopsis top hitse value%identityAlignment
AT1G73720.1 transducin family protein / WD-40 repeat family protein2.0e-1025.47Show/hide
Query:  RAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLE-------DAHDDAVNRL-INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYIS
        ++H+E     RF  DG+ + + S D  I   D  +G +   L+         HDD V  +  +  SE +ASG  +G IK+W  R   C   F+AH   ++
Subjt:  RAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLE-------DAHDDAVNRL-INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYIS

Query:  DMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGTLLLYSWGCFKDCSDRF--------VDLSQNPVNALLKLD
         +++S D  +LL+TS D T  +  L+  K+          +   +   +G ++I  S   T+ ++      DC   F         D S N ++   K  
Subjt:  DMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGTLLLYSWGCFKDCSDRF--------VDLSQNPVNALLKLD

Query:  EDRVIAGSESGL
        E  V+    S +
Subjt:  EDRVIAGSESGL

AT2G34260.1 transducin family protein / WD-40 repeat family protein4.4e-12268.25Show/hide
Query:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL
        ME++LG   F +DFHPS  LVAAGLIDG+L LYRY++++   R  KVRAH ESCRAVRFI+DG+ I+T S D SILATDVETG+ +A LE+AH+DAVN L
Subjt:  MEVNLGGIPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL

Query:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT
        IN+T  TIASGDD GC+K+WDTRQRSC   F AHEDYIS MT+++DSMKL+ TSGDGTLSVCNLR +KV ++SEFSE ELLSVVIMKNGRKVICG+Q GT
Subjt:  INLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGT

Query:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN
        LLLYSWG FKDCSDRFVDL+ N V+ALLKLDEDR+I G ++G+ISLVGILPNRIIQPI  H DYP+E LA SHD+KFLGS +HD MLKLW++  +    N
Subjt:  LLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGN

Query:  ILNGQTTVDASASDN
        + +G  +  A  SD+
Subjt:  ILNGQTTVDASASDN

AT2G34260.2 transducin family protein / WD-40 repeat family protein1.3e-9769.6Show/hide
Query:  ILTGSPDHSILATDVETGSVIARLEDAHDDAVNRLINLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLR
        I+T S D SILATDVETG+ +A LE+AH+DAVN LIN+T  TIASGDD GC+K+WDTRQRSC   F AHEDYIS MT+++DSMKL+ TSGDGTLSVCNLR
Subjt:  ILTGSPDHSILATDVETGSVIARLEDAHDDAVNRLINLTSETIASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLR

Query:  RNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGTLLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYP
         +KV ++SEFSE ELLSVVIMKNGRKVICG+Q GTLLLYSWG FKDCSDRFVDL+ N V+ALLKLDEDR+I G ++G+ISLVGILPNRIIQPI  H DYP
Subjt:  RNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGTLLLYSWGCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYP

Query:  VERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGNILNGQTTVDASASDN
        +E LA SHD+KFLGS +HD MLKLW++  +    N+ +G  +  A  SD+
Subjt:  VERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGNILNGQTTVDASASDN

AT2G43770.1 Transducin/WD40 repeat-like superfamily protein4.2e-1622.26Show/hide
Query:  FDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL--INLTSET
        + + F+P+  L+A+G  D  + L+R + +   +  + ++ H  +   + + +DG  I++ SPD ++ A DVETG  I ++ + H   VN           
Subjt:  FDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRL--INLTSET

Query:  IASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGTLLLYSW-
        I SG D+G  K+WD RQR    +F   +  I+ +++S  + K+     D  + V +LR+ +     E  +  +  + +  +G  ++       L ++   
Subjt:  IASGDDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGTLLLYSW-

Query:  ------GCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDM
               C K       +  +N +      D  +V AGS   ++ +      R I  +  H+   V    F      +GS S D  + L ++
Subjt:  ------GCFKDCSDRFVDLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDM

AT3G49660.1 Transducin/WD40 repeat-like superfamily protein5.1e-1421.95Show/hide
Query:  FHPSDQLVAAGLIDGNLLLYRYNA--NALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRLINLTSETIASG
        F    +L+A+   D  +  Y  N   + + + + +   H      V F +D R I++ S D ++   DVETGS+I  L    + A     N  S  I SG
Subjt:  FHPSDQLVAAGLIDGNLLLYRYNA--NALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRLINLTSETIASG

Query:  DDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNK-VHARSEFSEVELLSVVIMKNGRKVICGSQTGTLLLYSWGCFK
          +  +++WD     C     AH D ++ + ++ D   ++++S DG   + +      V    +     +  V    NG+ ++ G+   TL L++     
Subjt:  DDNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNK-VHARSEFSEVELLSVVIMKNGRKVICGSQTGTLLLYSWGCFK

Query:  DCSDRFVDLSQNPVNALLKLDE-------DRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLW
          S +F+      VNA   +          R+++GSE   + +  +   +++Q +  H++  V  +A       + S S D  +++W
Subjt:  DCSDRFVDLSQNPVNALLKLDE-------DRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGTGCCGGCGGCGGGCGAGGGACAGCTGGCAGATTGGCGGGGCGCCGCCGAGTCCGAGATAATCCTCCCTCTCTTTCGCAGTGGGCCTGTACTGCCAGAC
CCTGCAGCAGCGGCAGCCGCAGCGGCGCACTCCAAATTTGGCCACCGCCACTCCGCCATCATCAGTATCTCCTCTCTCTTTCTCTCGGTACGTAGCCGGCCTTCT
CTAGTCTTCTCACCCTGTGTTGCCAATCCCTCTCTTCTACAATTTTTGGATACGATCCGAGTGCCAGAAAACAAGTTTACAGGCATGGAAGTAAACCTGGGAGGC
ATTCCTTTCGACTTGGACTTCCATCCATCCGACCAGTTGGTTGCAGCTGGTCTCATCGACGGCAATCTTCTCCTATACCGTTACAACGCAAATGCTTTACCCCAA
AGGCTATTGAAAGTTCGTGCACACAGTGAATCTTGCAGAGCCGTTCGGTTCATCAATGACGGACGTGCAATTTTGACGGGTTCTCCAGACCATTCCATTCTTGCG
ACGGATGTGGAGACTGGTTCTGTTATTGCTCGTCTCGAAGATGCACATGATGATGCAGTCAATAGATTGATCAACTTAACCTCGGAAACCATTGCTTCAGGAGAT
GACAATGGGTGCATCAAGGTATGGGATACCAGACAACGTTCTTGCTGCAGTTCTTTTGAAGCTCATGAAGATTATATTTCAGATATGACCTATTCAGCTGATTCC
ATGAAGCTTTTGGCGACAAGTGGAGATGGGACTCTATCTGTTTGCAATCTTCGGAGAAACAAGGTCCATGCTCGATCTGAGTTTTCAGAAGTTGAGCTGCTATCT
GTTGTTATCATGAAGAATGGACGTAAAGTCATCTGTGGATCGCAAACTGGAACTCTATTATTGTATTCATGGGGCTGCTTCAAGGACTGCAGTGATCGCTTTGTT
GATCTCTCTCAAAATCCTGTGAATGCTTTGCTAAAGCTTGATGAAGATCGAGTCATTGCTGGATCTGAGAGTGGACTCATCAGTCTGGTAGGCATATTACCGAAC
AGAATAATCCAACCAATTGCAGAACACTCTGACTATCCTGTGGAGCGGCTTGCTTTCTCCCATGACAGAAAATTTCTCGGCAGTATTTCACACGATTATATGTTG
AAGCTATGGGATATGAATTTATTGCAAAGTTCTGGAAACATTTTAAATGGTCAGACTACGGTAGATGCAAGTGCCAGTGACAATCTATTTACTTTCAT
mRNA sequenceShow/hide mRNA sequence
ATGGCCGTGCCGGCGGCGGGCGAGGGACAGCTGGCAGATTGGCGGGGCGCCGCCGAGTCCGAGATAATCCTCCCTCTCTTTCGCAGTGGGCCTGTACTGCCAGAC
CCTGCAGCAGCGGCAGCCGCAGCGGCGCACTCCAAATTTGGCCACCGCCACTCCGCCATCATCAGTATCTCCTCTCTCTTTCTCTCGGTACGTAGCCGGCCTTCT
CTAGTCTTCTCACCCTGTGTTGCCAATCCCTCTCTTCTACAATTTTTGGATACGATCCGAGTGCCAGAAAACAAGTTTACAGGCATGGAAGTAAACCTGGGAGGC
ATTCCTTTCGACTTGGACTTCCATCCATCCGACCAGTTGGTTGCAGCTGGTCTCATCGACGGCAATCTTCTCCTATACCGTTACAACGCAAATGCTTTACCCCAA
AGGCTATTGAAAGTTCGTGCACACAGTGAATCTTGCAGAGCCGTTCGGTTCATCAATGACGGACGTGCAATTTTGACGGGTTCTCCAGACCATTCCATTCTTGCG
ACGGATGTGGAGACTGGTTCTGTTATTGCTCGTCTCGAAGATGCACATGATGATGCAGTCAATAGATTGATCAACTTAACCTCGGAAACCATTGCTTCAGGAGAT
GACAATGGGTGCATCAAGGTATGGGATACCAGACAACGTTCTTGCTGCAGTTCTTTTGAAGCTCATGAAGATTATATTTCAGATATGACCTATTCAGCTGATTCC
ATGAAGCTTTTGGCGACAAGTGGAGATGGGACTCTATCTGTTTGCAATCTTCGGAGAAACAAGGTCCATGCTCGATCTGAGTTTTCAGAAGTTGAGCTGCTATCT
GTTGTTATCATGAAGAATGGACGTAAAGTCATCTGTGGATCGCAAACTGGAACTCTATTATTGTATTCATGGGGCTGCTTCAAGGACTGCAGTGATCGCTTTGTT
GATCTCTCTCAAAATCCTGTGAATGCTTTGCTAAAGCTTGATGAAGATCGAGTCATTGCTGGATCTGAGAGTGGACTCATCAGTCTGGTAGGCATATTACCGAAC
AGAATAATCCAACCAATTGCAGAACACTCTGACTATCCTGTGGAGCGGCTTGCTTTCTCCCATGACAGAAAATTTCTCGGCAGTATTTCACACGATTATATGTTG
AAGCTATGGGATATGAATTTATTGCAAAGTTCTGGAAACATTTTAAATGGTCAGACTACGGTAGATGCAAGTGCCAGTGACAATCTATTTACTTTCAT
Protein sequenceShow/hide protein sequence
MAVPAAGEGQLADWRGAAESEIILPLFRSGPVLPDPAAAAAAAAHSKFGHRHSAIISISSLFLSVRSRPSLVFSPCVANPSLLQFLDTIRVPENKFTGMEVNLGG
IPFDLDFHPSDQLVAAGLIDGNLLLYRYNANALPQRLLKVRAHSESCRAVRFINDGRAILTGSPDHSILATDVETGSVIARLEDAHDDAVNRLINLTSETIASGD
DNGCIKVWDTRQRSCCSSFEAHEDYISDMTYSADSMKLLATSGDGTLSVCNLRRNKVHARSEFSEVELLSVVIMKNGRKVICGSQTGTLLLYSWGCFKDCSDRFV
DLSQNPVNALLKLDEDRVIAGSESGLISLVGILPNRIIQPIAEHSDYPVERLAFSHDRKFLGSISHDYMLKLWDMNLLQSSGNILNGQTTVDASASDNLFTFX