; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G004430 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G004430
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionF-box/WD-40 repeat-containing protein
Genome locationCG_Chr09:3930674..3935209
RNA-Seq ExpressionClCG09G004430
SyntenyClCG09G004430
Gene Ontology termsGO:0051568 - histone H3-K4 methylation (biological process)
GO:0048188 - Set1C/COMPASS complex (cellular component)
GO:0042393 - histone binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR001810 - F-box domain
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR036047 - F-box-like domain superfamily
IPR036322 - WD40-repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058585.1 F-box/WD-40 repeat-containing protein [Cucumis melo var. makuwa]5.0e-21782.59Show/hide
Query:  MTPPPAADRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAM
        MTPPP ADRSS RRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRC  VCKSWNYAIYKSEILRTFCLRYQKQE NSASTS+VSFSLEKPL++CLEEIAM
Subjt:  MTPPPAADRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAM

Query:  ERHKLALEEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLC
        ERHKLALE+GRIRV QWIGHSVR+   +   K+  +   +G              D VMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVG+QLC
Subjt:  ERHKLALEEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLC

Query:  IWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLL
        IWSRSGKRSIFPSRECTF KGLCMR                      YFDAEAVVG EDGTAHVFDMYSRRCSRIIRMLPGPVTCLCV+DDQL+ GGSLL
Subjt:  IWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLL

Query:  GNIGVSGLRSDQRVAMLRSRNTVGIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR
        GNIGVSGLRSDQRV MLRSRNTVGIRTLC NDS+ LVFAGSTAGHVYCWDLRTMKSLWESRVSPNV+YSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR
Subjt:  GNIGVSGLRSDQRVAMLRSRNTVGIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR

Query:  SCCIMDRRLLSTYRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS
        SCC+MD RLLST++NGL  VEER GKRLSDETPIDAI+RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS
Subjt:  SCCIMDRRLLSTYRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS

XP_004135950.1 F-box/WD-40 repeat-containing protein At3g52030 isoform X1 [Cucumis sativus]3.2e-21681.53Show/hide
Query:  MTPPPAADRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAM
        MTPPP ADRSS RRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRC  VCKSWNYAIYKSEILRTFCLRYQKQE NSASTSEVSFSLEKPL+ECLEEIAM
Subjt:  MTPPPAADRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAM

Query:  ERHKLALEEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLC
        ERHKLALE+GRIRVSQWIGHSVR  + +   K+  +   +G              D VMRLWSPENFRCLEEYS+PEK+PL+DFDFDVGKIVGL+GRQLC
Subjt:  ERHKLALEEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLC

Query:  IWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLL
        IWSRSGKRSIFPSRECTF KGLCMR                      YFDAEAVVG EDGTAHVFDMYSRRCSRIIRMLPGPVTCLCV+DDQL+ GGSLL
Subjt:  IWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLL

Query:  GNIGVSGLRSDQRVAMLRSRNTVGIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR
        GNIGVSG+RSDQRV MLRSRNTVGIRTLCYN S+ LVFAGSTAGHVYCWDLRTMKSLWESRVSPNV+YSLQHLQNDRSSLAVGGIDGILRILDQNTGTV+
Subjt:  GNIGVSGLRSDQRVAMLRSRNTVGIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR

Query:  SCCIMDRRLLSTYRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS
        SCC+MD RLLST+++GL +VEER GKRLSDETPID I+RRNRPSITSLAVGMNKIVTTHNDKFI+LWKFQS
Subjt:  SCCIMDRRLLSTYRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS

XP_008461347.1 PREDICTED: F-box/WD-40 repeat-containing protein At3g52030 [Cucumis melo]8.5e-21782.38Show/hide
Query:  MTPPPAADRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAM
        M PPP ADRSS RRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRC  VCKSWNYAIYKSEILRTFCLRYQKQE NSASTS+VSFSLEKPL++CLEEIAM
Subjt:  MTPPPAADRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAM

Query:  ERHKLALEEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLC
        ERHKLALE+GRIRV QWIGHSVR+   +   K+  +   +G              D VMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVG+QLC
Subjt:  ERHKLALEEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLC

Query:  IWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLL
        IWSRSGKRSIFPSRECTF KGLCMR                      Y DAEAVVG EDGTAHVFDMYSRRCSRIIRMLPGPVTCLCV+DDQL+ GGSLL
Subjt:  IWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLL

Query:  GNIGVSGLRSDQRVAMLRSRNTVGIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR
        GNIGVSGLRSDQRV MLRSRNTVGIRTLCYNDS+ LVFAGSTAGHVYCWDLRTMKSLWESRVSPNV+YSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR
Subjt:  GNIGVSGLRSDQRVAMLRSRNTVGIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR

Query:  SCCIMDRRLLSTYRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS
        SCC+MD RLLST++NGL  VEER GKRLSDETPIDAI+RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS
Subjt:  SCCIMDRRLLSTYRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS

XP_022149888.1 F-box/WD-40 repeat-containing protein At3g52030 isoform X1 [Momordica charantia]3.7e-20477.92Show/hide
Query:  MTPPPAADRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAM
        M PPPAADRSS R+RS +D+K V SLSHD+LCIIFSFLDLFDLVRCSVVCKSWNYA++KSEIL+ FC R+QKQ+ NS S+SE+SFS EKPL  CLEEIAM
Subjt:  MTPPPAADRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAM

Query:  ERHKLALEEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLC
        ERHKLA EE  I+VS W GHSVR+   +   K+  +   +G              D VMRLWS ENFRCLEEYSIP KVPL+DFDFD GKIVGLVGRQ+C
Subjt:  ERHKLALEEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLC

Query:  IWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLL
        IWSRSGKRSIFPSRECTF KGLCMR                      YFD EAVVG EDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGS+L
Subjt:  IWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLL

Query:  GNIGVSGLRSDQRVAMLRSRNTVGIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR
        GNIGVSGL SDQRVAMLRSRNTVGIRTLCYN S+HLVFAGSTAGHVYCWDLRTMKSLWESRVSPN+VYSL+HLQNDRSSLAVGGIDGILRIL+QNTGTVR
Subjt:  GNIGVSGLRSDQRVAMLRSRNTVGIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR

Query:  SCCIMDRRLLSTYRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS
        SCCIMD RLLSTY+NGL VVEERIGKR+S ETPIDAINRRNRP ITSLAVGMNKIVTTHNDKFIRLWKFQ+
Subjt:  SCCIMDRRLLSTYRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS

XP_038899781.1 F-box/WD-40 repeat-containing protein At3g52030 [Benincasa hispida]2.8e-22083.65Show/hide
Query:  MTPPPAADRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAM
        MTPPPAA+RSS RRRSD+DAKPVHSLS+DILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFC RYQKQE N+ASTSEVSFSLEKPL+ECLEEIAM
Subjt:  MTPPPAADRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAM

Query:  ERHKLALEEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLC
        ERHKLAL+EGRIRVSQWIGHSVR+   +   K+  +   +G              D VMRLWSPE FRCLEEYSIPEKVPL+DFDFD+GKIVGLVGRQLC
Subjt:  ERHKLALEEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLC

Query:  IWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLL
        IWSRSG RSIFPSRECTFVKGLCMR                      YFDAEAVVG EDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLL
Subjt:  IWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLL

Query:  GNIGVSGLRSDQRVAMLRSRNTVGIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR
        GNIGVSGLRSDQRVAMLRSRNTVGIR+LCYNDS+HLVFAGSTAGHVYCWDLR MKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR
Subjt:  GNIGVSGLRSDQRVAMLRSRNTVGIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR

Query:  SCCIMDRRLLSTYRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS
        SCCIMD RLLST++N L  VEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKI TTHNDKFIRLWKFQS
Subjt:  SCCIMDRRLLSTYRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS

TrEMBL top hitse value%identityAlignment
A0A0A0KBD6 F-box domain-containing protein1.6e-21681.53Show/hide
Query:  MTPPPAADRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAM
        MTPPP ADRSS RRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRC  VCKSWNYAIYKSEILRTFCLRYQKQE NSASTSEVSFSLEKPL+ECLEEIAM
Subjt:  MTPPPAADRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAM

Query:  ERHKLALEEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLC
        ERHKLALE+GRIRVSQWIGHSVR  + +   K+  +   +G              D VMRLWSPENFRCLEEYS+PEK+PL+DFDFDVGKIVGL+GRQLC
Subjt:  ERHKLALEEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLC

Query:  IWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLL
        IWSRSGKRSIFPSRECTF KGLCMR                      YFDAEAVVG EDGTAHVFDMYSRRCSRIIRMLPGPVTCLCV+DDQL+ GGSLL
Subjt:  IWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLL

Query:  GNIGVSGLRSDQRVAMLRSRNTVGIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR
        GNIGVSG+RSDQRV MLRSRNTVGIRTLCYN S+ LVFAGSTAGHVYCWDLRTMKSLWESRVSPNV+YSLQHLQNDRSSLAVGGIDGILRILDQNTGTV+
Subjt:  GNIGVSGLRSDQRVAMLRSRNTVGIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR

Query:  SCCIMDRRLLSTYRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS
        SCC+MD RLLST+++GL +VEER GKRLSDETPID I+RRNRPSITSLAVGMNKIVTTHNDKFI+LWKFQS
Subjt:  SCCIMDRRLLSTYRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS

A0A1S3CEH9 F-box/WD-40 repeat-containing protein At3g520304.1e-21782.38Show/hide
Query:  MTPPPAADRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAM
        M PPP ADRSS RRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRC  VCKSWNYAIYKSEILRTFCLRYQKQE NSASTS+VSFSLEKPL++CLEEIAM
Subjt:  MTPPPAADRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAM

Query:  ERHKLALEEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLC
        ERHKLALE+GRIRV QWIGHSVR+   +   K+  +   +G              D VMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVG+QLC
Subjt:  ERHKLALEEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLC

Query:  IWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLL
        IWSRSGKRSIFPSRECTF KGLCMR                      Y DAEAVVG EDGTAHVFDMYSRRCSRIIRMLPGPVTCLCV+DDQL+ GGSLL
Subjt:  IWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLL

Query:  GNIGVSGLRSDQRVAMLRSRNTVGIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR
        GNIGVSGLRSDQRV MLRSRNTVGIRTLCYNDS+ LVFAGSTAGHVYCWDLRTMKSLWESRVSPNV+YSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR
Subjt:  GNIGVSGLRSDQRVAMLRSRNTVGIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR

Query:  SCCIMDRRLLSTYRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS
        SCC+MD RLLST++NGL  VEER GKRLSDETPIDAI+RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS
Subjt:  SCCIMDRRLLSTYRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS

A0A5D3CFV2 F-box/WD-40 repeat-containing protein2.4e-21782.59Show/hide
Query:  MTPPPAADRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAM
        MTPPP ADRSS RRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRC  VCKSWNYAIYKSEILRTFCLRYQKQE NSASTS+VSFSLEKPL++CLEEIAM
Subjt:  MTPPPAADRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAM

Query:  ERHKLALEEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLC
        ERHKLALE+GRIRV QWIGHSVR+   +   K+  +   +G              D VMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVG+QLC
Subjt:  ERHKLALEEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLC

Query:  IWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLL
        IWSRSGKRSIFPSRECTF KGLCMR                      YFDAEAVVG EDGTAHVFDMYSRRCSRIIRMLPGPVTCLCV+DDQL+ GGSLL
Subjt:  IWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLL

Query:  GNIGVSGLRSDQRVAMLRSRNTVGIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR
        GNIGVSGLRSDQRV MLRSRNTVGIRTLC NDS+ LVFAGSTAGHVYCWDLRTMKSLWESRVSPNV+YSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR
Subjt:  GNIGVSGLRSDQRVAMLRSRNTVGIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR

Query:  SCCIMDRRLLSTYRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS
        SCC+MD RLLST++NGL  VEER GKRLSDETPIDAI+RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS
Subjt:  SCCIMDRRLLSTYRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS

A0A6J1D7Z0 F-box/WD-40 repeat-containing protein At3g52030 isoform X11.8e-20477.92Show/hide
Query:  MTPPPAADRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAM
        M PPPAADRSS R+RS +D+K V SLSHD+LCIIFSFLDLFDLVRCSVVCKSWNYA++KSEIL+ FC R+QKQ+ NS S+SE+SFS EKPL  CLEEIAM
Subjt:  MTPPPAADRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAM

Query:  ERHKLALEEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLC
        ERHKLA EE  I+VS W GHSVR+   +   K+  +   +G              D VMRLWS ENFRCLEEYSIP KVPL+DFDFD GKIVGLVGRQ+C
Subjt:  ERHKLALEEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLC

Query:  IWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLL
        IWSRSGKRSIFPSRECTF KGLCMR                      YFD EAVVG EDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGS+L
Subjt:  IWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLL

Query:  GNIGVSGLRSDQRVAMLRSRNTVGIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR
        GNIGVSGL SDQRVAMLRSRNTVGIRTLCYN S+HLVFAGSTAGHVYCWDLRTMKSLWESRVSPN+VYSL+HLQNDRSSLAVGGIDGILRIL+QNTGTVR
Subjt:  GNIGVSGLRSDQRVAMLRSRNTVGIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR

Query:  SCCIMDRRLLSTYRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS
        SCCIMD RLLSTY+NGL VVEERIGKR+S ETPIDAINRRNRP ITSLAVGMNKIVTTHNDKFIRLWKFQ+
Subjt:  SCCIMDRRLLSTYRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS

A0A6J1HB56 F-box/WD-40 repeat-containing protein At3g52030 isoform X13.7e-20277.07Show/hide
Query:  MTPPPAADRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAM
        M PP  ADRSS RRRS++DAKPV+SLSHDILCIIFSFLDLFDLVRCSVVCKSWN AI+ SE+LRTFC+++QKQE  S+S+ EVS S EKPL+ECLEEIAM
Subjt:  MTPPPAADRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAM

Query:  ERHKLALEEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLC
        ERHKLALEEGRIRVSQW+GHSVR+   +   K+  +   +G              D VMRLWS ENFRCLEEYSIPEK+PLIDFDFD  KIVGLVGR LC
Subjt:  ERHKLALEEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLC

Query:  IWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLL
        IWSRSGKRSIFPSRECTFV+G CMR                      YFD EAVVG  DGTAHVFDMYSRRCSRI+RMLPGPVTCLCV DDQLILGGSL 
Subjt:  IWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLL

Query:  GNIGVSGLRSDQRVAMLRSRNTVGIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR
        GNIGVSGLRSDQRVAMLRSRNT+GI+T+CYN S+HLVFAGSTAGHVYCWDLRTMK LWESRVSPNVVYSL+HLQNDRSSLAVGGIDGILRILDQNTGTVR
Subjt:  GNIGVSGLRSDQRVAMLRSRNTVGIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVR

Query:  SCCIMDRRLLSTYRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS
        S CIMD RLLSTY++G+ VVEERIG RLSDETPIDAI+RR+RP ITSLAVGMNKIVTTHNDKFIRLWKF++
Subjt:  SCCIMDRRLLSTYRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS

SwissProt top hitse value%identityAlignment
Q9SV01 F-box/WD-40 repeat-containing protein At3g520302.5e-11046.8Show/hide
Query:  DRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAMERHKLAL
        D SS R+        + SL  DILCIIFSFLDLFDLV C+VVC SWN  I + ++L+  C +     ++S S+S    SL++P    +E+ AM+ HK+AL
Subjt:  DRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAMERHKLAL

Query:  EEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGK-----------------
          GRI + +W  HS R  +++   K   L   +G              D VMRLWS ++++C+EEYS+P+   LIDFDFD  K                 
Subjt:  EEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGK-----------------

Query:  ----IVGLVGRQLCIWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCL
            IVGLVG ++ IW R+G+RSIFPSR  TF KGLCMR                      Y D EAVVG EDGTA VFDMYS+ CS+IIR   GP+TCL
Subjt:  ----IVGLVGRQLCIWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCL

Query:  CVSDDQLILGGSLLGNIGVSGLRSDQRVAMLRSRNTV-GIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGI
         +SD+QL L GS LG + VS    DQ VA L+S  T  GI+T+C+N  T+L F G+T G+V CWDLR M  LWE RVSPNVVYS+Q L+ND S +  GGI
Subjt:  CVSDDQLILGGSLLGNIGVSGLRSDQRVAMLRSRNTV-GIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGI

Query:  DGILRILDQNTGTVRSCCIMDRRLLST-YRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKF
        DG+LR++DQ +G V S  IMD +  +T  RN   V+E+R GKR+S +  ID I R+ RP I+ +A+GM K+VT HN K I +WKF
Subjt:  DGILRILDQNTGTVRSCCIMDRRLLST-YRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKF

Arabidopsis top hitse value%identityAlignment
AT3G52030.1 F-box family protein with WD40/YVTN repeat doamin1.8e-11146.8Show/hide
Query:  DRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAMERHKLAL
        D SS R+        + SL  DILCIIFSFLDLFDLV C+VVC SWN  I + ++L+  C +     ++S S+S    SL++P    +E+ AM+ HK+AL
Subjt:  DRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAMERHKLAL

Query:  EEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGK-----------------
          GRI + +W  HS R  +++   K   L   +G              D VMRLWS ++++C+EEYS+P+   LIDFDFD  K                 
Subjt:  EEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGK-----------------

Query:  ----IVGLVGRQLCIWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCL
            IVGLVG ++ IW R+G+RSIFPSR  TF KGLCMR                      Y D EAVVG EDGTA VFDMYS+ CS+IIR   GP+TCL
Subjt:  ----IVGLVGRQLCIWSRSGKRSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCL

Query:  CVSDDQLILGGSLLGNIGVSGLRSDQRVAMLRSRNTV-GIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGI
         +SD+QL L GS LG + VS    DQ VA L+S  T  GI+T+C+N  T+L F G+T G+V CWDLR M  LWE RVSPNVVYS+Q L+ND S +  GGI
Subjt:  CVSDDQLILGGSLLGNIGVSGLRSDQRVAMLRSRNTV-GIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGI

Query:  DGILRILDQNTGTVRSCCIMDRRLLST-YRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKF
        DG+LR++DQ +G V S  IMD +  +T  RN   V+E+R GKR+S +  ID I R+ RP I+ +A+GM K+VT HN K I +WKF
Subjt:  DGILRILDQNTGTVRSCCIMDRRLLST-YRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKF

AT3G52030.2 F-box family protein with WD40/YVTN repeat doamin3.4e-11548.92Show/hide
Query:  DRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAMERHKLAL
        D SS R+        + SL  DILCIIFSFLDLFDLV C+VVC SWN  I + ++L+  C +     ++S S+S    SL++P    +E+ AM+ HK+AL
Subjt:  DRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAMERHKLAL

Query:  EEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLCIWSRSGK
          GRI + +W  HS R  +++   K   L   +G              D VMRLWS ++++C+EEYS+P+   LIDFDFD  KIVGLVG ++ IW R+G+
Subjt:  EEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLCIWSRSGK

Query:  RSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLLGNIGVSG
        RSIFPSR  TF KGLCMR                      Y D EAVVG EDGTA VFDMYS+ CS+IIR   GP+TCL +SD+QL L GS LG + VS 
Subjt:  RSIFPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLLGNIGVSG

Query:  LRSDQRVAMLRSRNTV-GIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCIMD
           DQ VA L+S  T  GI+T+C+N  T+L F G+T G+V CWDLR M  LWE RVSPNVVYS+Q L+ND S +  GGIDG+LR++DQ +G V S  IMD
Subjt:  LRSDQRVAMLRSRNTV-GIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCIMD

Query:  RRLLST-YRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKF
         +  +T  RN   V+E+R GKR+S +  ID I R+ RP I+ +A+GM K+VT HN K I +WKF
Subjt:  RRLLST-YRNGLEVVEERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTCCTCCACCGGCCGCCGACAGGTCATCGAAGAGAAGACGGAGTGATGTTGATGCAAAACCAGTTCACTCCCTAAGCCACGACATCTTGTGCATAATTTTC
TCGTTTCTTGACCTTTTCGACTTGGTTCGATGCTCAGTTGTTTGTAAATCCTGGAATTATGCTATTTATAAGTCCGAGATATTGCGGACGTTTTGCTTGAGGTAT
CAGAAGCAGGAGACGAACTCCGCTAGTACTTCCGAAGTTTCATTTTCTTTGGAGAAACCATTGGTGGAATGTTTAGAGGAAATAGCGATGGAACGACACAAGTTG
GCATTGGAGGAAGGTCGTATTAGAGTTTCTCAATGGATTGGCCATTCAGTGAGGAGTTTGATTGCTGAAGTATCAGATAAAATCGTTTCCTTATTTTACTGTCTT
GGCACTTGTGTTTTCTTATCTACAGCTGAGGATTCTTTATCCTCAGACAATGTTATGAGACTTTGGTCGCCAGAGAACTTTAGATGTCTAGAAGAATATTCAATT
CCTGAGAAAGTGCCATTAATCGATTTTGATTTTGATGTGGGCAAGATTGTTGGTTTGGTTGGCAGACAGTTGTGCATATGGAGCCGGAGTGGGAAAAGAAGTATA
TTTCCTTCGCGTGAATGTACTTTCGTGAAGGGGTTGTGTATGCGGCTTAAAGTGCTTCTCAATTGGTTTTCAATTAATTTGTATGATGAGTCTTCCAAACCTGCT
TCCAGTTACTTTGATGCAGAGGCTGTTGTGGGTTCTGAAGATGGAACAGCTCATGTATTTGACATGTACAGTAGGAGATGCTCTAGAATTATCAGGATGCTTCCT
GGGCCCGTGACTTGCTTATGTGTGAGTGATGATCAGCTCATACTTGGTGGTTCCCTACTTGGGAACATTGGAGTATCGGGTCTTCGGTCTGATCAGCGGGTAGCA
ATGCTCAGATCAAGAAATACCGTAGGCATAAGGACTTTGTGTTACAACGATTCTACACATTTAGTGTTCGCGGGATCAACTGCCGGCCATGTCTATTGTTGGGAC
CTCAGGACTATGAAATCCTTATGGGAATCTCGAGTGAGCCCGAACGTCGTATATTCTTTGCAACATCTCCAAAATGACAGGTCAAGTTTGGCCGTTGGTGGAATA
GATGGCATTCTACGTATTTTAGATCAGAATACAGGCACGGTGCGGTCGTGTTGTATTATGGATCGTAGATTGTTATCAACATACCGGAATGGTCTCGAAGTTGTC
GAAGAAAGGATAGGGAAAAGATTGTCAGATGAGACTCCAATTGATGCCATAAACAGAAGGAATAGGCCTTCGATCACAAGCTTGGCCGTTGGAATGAATAAGATA
GTCACAACGCACAATGATAAGTTCATTAGATTGTGGAAGTTTCAAAGCTAA
mRNA sequenceShow/hide mRNA sequence
GGCGAATGGGCCTCTTGAAGAACGTATCCATCAGCAGGTCCAAAGCCCACACAGATCCTTCTCCAAGTAGCTACCGGCCTCCTTCTCCAGACGTCGGCGTTTTGC
TTCAATCCTTCTCCGTCCGTGAAACCGCCGCCGGTGATTGCCGGCATGACTCCTCCACCGGCCGCCGACAGGTCATCGAAGAGAAGACGGAGTGATGTTGATGCA
AAACCAGTTCACTCCCTAAGCCACGACATCTTGTGCATAATTTTCTCGTTTCTTGACCTTTTCGACTTGGTTCGATGCTCAGTTGTTTGTAAATCCTGGAATTAT
GCTATTTATAAGTCCGAGATATTGCGGACGTTTTGCTTGAGGTATCAGAAGCAGGAGACGAACTCCGCTAGTACTTCCGAAGTTTCATTTTCTTTGGAGAAACCA
TTGGTGGAATGTTTAGAGGAAATAGCGATGGAACGACACAAGTTGGCATTGGAGGAAGGTCGTATTAGAGTTTCTCAATGGATTGGCCATTCAGTGAGGAGTTTG
ATTGCTGAAGTATCAGATAAAATCGTTTCCTTATTTTACTGTCTTGGCACTTGTGTTTTCTTATCTACAGCTGAGGATTCTTTATCCTCAGACAATGTTATGAGA
CTTTGGTCGCCAGAGAACTTTAGATGTCTAGAAGAATATTCAATTCCTGAGAAAGTGCCATTAATCGATTTTGATTTTGATGTGGGCAAGATTGTTGGTTTGGTT
GGCAGACAGTTGTGCATATGGAGCCGGAGTGGGAAAAGAAGTATATTTCCTTCGCGTGAATGTACTTTCGTGAAGGGGTTGTGTATGCGGCTTAAAGTGCTTCTC
AATTGGTTTTCAATTAATTTGTATGATGAGTCTTCCAAACCTGCTTCCAGTTACTTTGATGCAGAGGCTGTTGTGGGTTCTGAAGATGGAACAGCTCATGTATTT
GACATGTACAGTAGGAGATGCTCTAGAATTATCAGGATGCTTCCTGGGCCCGTGACTTGCTTATGTGTGAGTGATGATCAGCTCATACTTGGTGGTTCCCTACTT
GGGAACATTGGAGTATCGGGTCTTCGGTCTGATCAGCGGGTAGCAATGCTCAGATCAAGAAATACCGTAGGCATAAGGACTTTGTGTTACAACGATTCTACACAT
TTAGTGTTCGCGGGATCAACTGCCGGCCATGTCTATTGTTGGGACCTCAGGACTATGAAATCCTTATGGGAATCTCGAGTGAGCCCGAACGTCGTATATTCTTTG
CAACATCTCCAAAATGACAGGTCAAGTTTGGCCGTTGGTGGAATAGATGGCATTCTACGTATTTTAGATCAGAATACAGGCACGGTGCGGTCGTGTTGTATTATG
GATCGTAGATTGTTATCAACATACCGGAATGGTCTCGAAGTTGTCGAAGAAAGGATAGGGAAAAGATTGTCAGATGAGACTCCAATTGATGCCATAAACAGAAGG
AATAGGCCTTCGATCACAAGCTTGGCCGTTGGAATGAATAAGATAGTCACAACGCACAATGATAAGTTCATTAGATTGTGGAAGTTTCAAAGCTAATTTTATTTC
CTATGCTCTTTATTTAAAGTGTATATATATATTTGCATATTGCCGTACGAGATTTATGATGAGTAATTTCAAGTTCTGATCAGATACAACTTTCGCTTTCAAGTA
AAACTTATAGATTTCAACTCAGTGGATGGGGATAAATTTTTTGAGTTCAT
Protein sequenceShow/hide protein sequence
MTPPPAADRSSKRRRSDVDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKSEILRTFCLRYQKQETNSASTSEVSFSLEKPLVECLEEIAMERHKL
ALEEGRIRVSQWIGHSVRSLIAEVSDKIVSLFYCLGTCVFLSTAEDSLSSDNVMRLWSPENFRCLEEYSIPEKVPLIDFDFDVGKIVGLVGRQLCIWSRSGKRSI
FPSRECTFVKGLCMRLKVLLNWFSINLYDESSKPASSYFDAEAVVGSEDGTAHVFDMYSRRCSRIIRMLPGPVTCLCVSDDQLILGGSLLGNIGVSGLRSDQRVA
MLRSRNTVGIRTLCYNDSTHLVFAGSTAGHVYCWDLRTMKSLWESRVSPNVVYSLQHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCIMDRRLLSTYRNGLEVV
EERIGKRLSDETPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQS