; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017928 (gene) of Snake gourd v1 genome

Gene IDTan0017928
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionF-box/WD-40 repeat-containing protein
Genome locationLG06:4805064..4811591
RNA-Seq ExpressionTan0017928
SyntenyTan0017928
Gene Ontology termsGO:0051568 - histone H3-K4 methylation (biological process)
GO:0048188 - Set1C/COMPASS complex (cellular component)
GO:0042393 - histone binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR001810 - F-box domain
IPR011047 - Quinoprotein alcohol dehydrogenase-like superfamily
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR036047 - F-box-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593280.1 F-box/WD-40 repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]5.7e-22086.61Show/hide
Query:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM
        MAPP  ADRSS R RS+IDAKPV+SLSHDILCIIFSFLDLFDLVRCSVVCKSWN AI+  E+LRTF ++HQKQ ++S+++ EVS SS+KPLLECLEEIAM
Subjt:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM

Query:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLREC
        ERHK ALEEGRIRVSQW+GHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCL+EYSIPEKLPLIDFDFDESKIVGLVGR LCIW+RSGKRSIFP REC
Subjt:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLREC

Query:  TFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVF
        TFVEGSCMRYFDPEA+VGC DGTAHVFDMYSRRCSRI+R+L GPVTCLCV DDQLILGGSL GNIGV GLRSDQRVAMLRSRNT+GI+++CYN SSHLVF
Subjt:  TFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVF

Query:  AGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLSTYQNSLGVVEERIGKRLSDEAPIDAIN
        AGSTAGHVYCWDLRT+K LW  RVSPNV+YSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRS CI DSRLLSTYQ+ +GVVEERIG RLSDE PIDAI+
Subjt:  AGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLSTYQNSLGVVEERIGKRLSDEAPIDAIN

Query:  RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQN
        RR+RP ITSLAVGMNKIVTTHNDKFIRLWKF+N
Subjt:  RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQN

XP_022960484.1 F-box/WD-40 repeat-containing protein At3g52030 isoform X1 [Cucurbita moschata]7.5e-22086.61Show/hide
Query:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM
        MAPP  ADRSS R RS+IDAKPV+SLSHDILCIIFSFLDLFDLVRCSVVCKSWN AI+  E+LRTF ++HQKQ ++S+++ EVS SS+KPLLECLEEIAM
Subjt:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM

Query:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLREC
        ERHK ALEEGRIRVSQW+GHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCL+EYSIPEKLPLIDFDFDESKIVGLVGR LCIW+RSGKRSIFP REC
Subjt:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLREC

Query:  TFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVF
        TFVEGSCMRYFDPEA+VGC DGTAHVFDMYSRRCSRI+R+L GPVTCLCV DDQLILGGSL GNIGV GLRSDQRVAMLRSRNT+GI+++CYN SSHLVF
Subjt:  TFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVF

Query:  AGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLSTYQNSLGVVEERIGKRLSDEAPIDAIN
        AGSTAGHVYCWDLRT+K LW  RVSPNV+YSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRS CI DSRLLSTYQ+ +GVVEERIG RLSDE PIDAI+
Subjt:  AGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLSTYQNSLGVVEERIGKRLSDEAPIDAIN

Query:  RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQN
        RR+RP ITSLAVGMNKIVTTHNDKFIRLWKF+N
Subjt:  RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQN

XP_023004546.1 F-box/WD-40 repeat-containing protein At3g52030 isoform X1 [Cucurbita maxima]2.9e-21986.14Show/hide
Query:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM
        MAPP  ADRSS + RS+IDAKPV+SLSHDILCIIFSFLDLFDLVRCSVVCKSWN AI+  E+LRTF ++HQKQ ++S+++ +VS SS+KPLLECLEEIAM
Subjt:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM

Query:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLREC
        ERHK ALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCL+EYSIPEKLPLIDFDFDESKIVGLVGR +CIW+RSGKRSIFP REC
Subjt:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLREC

Query:  TFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVF
        TFVEGSCMRYFDPEA+VGC DGTAHVFDMYSRRCSRI+R+L GPVTCLCV DDQLILGGSL GNIGV GLRSDQRVAMLRSRNT+GI+++CYN SSHLVF
Subjt:  TFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVF

Query:  AGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLSTYQNSLGVVEERIGKRLSDEAPIDAIN
        AGSTAGHVYCWDLRT+K LW  RVSPNV+YSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRS CI DSRLLSTYQ+ +GVVEERIG RLSDE PIDAI+
Subjt:  AGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLSTYQNSLGVVEERIGKRLSDEAPIDAIN

Query:  RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQN
        RR+RP ITSLAVGMNKIVTTHNDKFIRLWKF+N
Subjt:  RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQN

XP_023513731.1 F-box/WD-40 repeat-containing protein At3g52030 isoform X1 [Cucurbita pepo subsp. pepo]3.4e-22086.84Show/hide
Query:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM
        MAPP  ADRSS R RS+IDAKPV+SLSHDILCIIFSFLDLFDLVRCSVVCKSWN AI+  E+LRTF ++HQKQ ++S+++ EVS SS+KPLLECLEEIAM
Subjt:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM

Query:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLREC
        ERHK ALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCL+EYSIPEKLPLIDFDFDESKIVGLVGR LCIW+RSGKRSIFP REC
Subjt:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLREC

Query:  TFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVF
        TFVEGSCMRYFDPEA+VGC DGTAHVFDMYSRRCSRI+R+L GPVTCLCV DDQLILGGSL GNIGV GLRSDQRVAMLRSRNT+GI+++CYN SSHLVF
Subjt:  TFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVF

Query:  AGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLSTYQNSLGVVEERIGKRLSDEAPIDAIN
        AGSTAGHVYCWDLRT+K LW  RVSPNV+YSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRS CI DSRLLSTYQ+ +GVVEERIG RLSDE PIDAI+
Subjt:  AGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLSTYQNSLGVVEERIGKRLSDEAPIDAIN

Query:  RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQN
        RR+RP ITSLAVGMNKIVTTHNDKFIRLWKF+N
Subjt:  RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQN

XP_038899781.1 F-box/WD-40 repeat-containing protein At3g52030 [Benincasa hispida]1.5e-22388.68Show/hide
Query:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM
        M PP AA+RSSTR RSDIDAKPVHSLS+DILCIIFSFLDLFDLVRCSVVCKSWNYAIYK EILRTF  R+QKQ + +A+TSEVSFS +KPLLECLEEIAM
Subjt:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM

Query:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLREC
        ERHK AL+EGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWS E FRCL+EYSIPEK+PL+DFDFD  KIVGLVGR+LCIW+RSG RSIFP REC
Subjt:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLREC

Query:  TFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVF
        TFV+G CMRYFD EA+VGCEDGTAHVFDMYSRRCSRIIR+L GPVTCLCVSDDQLILGGSLLGNIGV GLRSDQRVAMLRSRNTVGIRSLCYN SSHLVF
Subjt:  TFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVF

Query:  AGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLSTYQNSLGVVEERIGKRLSDEAPIDAIN
        AGSTAGHVYCWDLR +KSLW  RVSPNV+YSL+HLQNDRSSLAVGGIDGILRILDQNTGTVRSCCI DSRLLST+QNSLG VEERIGKRLSDE PIDAIN
Subjt:  AGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLSTYQNSLGVVEERIGKRLSDEAPIDAIN

Query:  RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQN
        RRNRPSITSLAVGMNKI TTHNDKFIRLWKFQ+
Subjt:  RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQN

TrEMBL top hitse value%identityAlignment
A0A0A0KBD6 F-box domain-containing protein9.9e-21885.22Show/hide
Query:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM
        M PP  ADRSS R RSD+DAKPVHSLSHDILCIIFSFLDLFDLVRC  VCKSWNYAIYK EILRTF LR+QKQ + SA+TSEVSFS +KPLLECLEEIAM
Subjt:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM

Query:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLREC
        ERHK ALE+GRIRVSQWIGHSVR EQCRMKMGLILTGVGDKVMRLWS ENFRCL+EYS+PEK+PL+DFDFD  KIVGL+GR+LCIW+RSGKRSIFP REC
Subjt:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLREC

Query:  TFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVF
        TF +G CMRYFD EA+VGCEDGTAHVFDMYSRRCSRIIR+L GPVTCLCV+DDQL+ GGSLLGNIGV G+RSDQRV MLRSRNTVGIR+LCYN SS LVF
Subjt:  TFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVF

Query:  AGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLSTYQNSLGVVEERIGKRLSDEAPIDAIN
        AGSTAGHVYCWDLRT+KSLW  RVSPNVIYSL+HLQNDRSSLAVGGIDGILRILDQNTGTV+SCC+ DSRLLST+Q+ LG+VEER GKRLSDE PID I+
Subjt:  AGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLSTYQNSLGVVEERIGKRLSDEAPIDAIN

Query:  RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQN
        RRNRPSITSLAVGMNKIVTTHNDKFI+LWKFQ+
Subjt:  RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQN

A0A1S3CEH9 F-box/WD-40 repeat-containing protein At3g520302.0e-21886.37Show/hide
Query:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM
        MAPP  ADRSS R RSD+DAKPVHSLSHDILCIIFSFLDLFDLVRC  VCKSWNYAIYK EILRTF LR+QKQ + SA+TS+VSFS +KPLL+CLEEIAM
Subjt:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM

Query:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLREC
        ERHK ALE+GRIRV QWIGHSVRAEQCRMKMGLILTGVGDKVMRLWS ENFRCL+EYSIPEK+PLIDFDFD  KIVGLVG++LCIW+RSGKRSIFP REC
Subjt:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLREC

Query:  TFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVF
        TF +G CMRY D EA+VGCEDGTAHVFDMYSRRCSRIIR+L GPVTCLCV+DDQL+ GGSLLGNIGV GLRSDQRV MLRSRNTVGIR+LCYN SS LVF
Subjt:  TFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVF

Query:  AGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLSTYQNSLGVVEERIGKRLSDEAPIDAIN
        AGSTAGHVYCWDLRT+KSLW  RVSPNVIYSL+HLQNDRSSLAVGGIDGILRILDQNTGTVRSCC+ DSRLLST+QN LG VEER GKRLSDE PIDAI+
Subjt:  AGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLSTYQNSLGVVEERIGKRLSDEAPIDAIN

Query:  RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQN
        RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQ+
Subjt:  RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQN

A0A5D3CFV2 F-box/WD-40 repeat-containing protein1.3e-21786.14Show/hide
Query:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM
        M PP  ADRSS R RSD+DAKPVHSLSHDILCIIFSFLDLFDLVRC  VCKSWNYAIYK EILRTF LR+QKQ + SA+TS+VSFS +KPLL+CLEEIAM
Subjt:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM

Query:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLREC
        ERHK ALE+GRIRV QWIGHSVRAEQCRMKMGLILTGVGDKVMRLWS ENFRCL+EYSIPEK+PLIDFDFD  KIVGLVG++LCIW+RSGKRSIFP REC
Subjt:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLREC

Query:  TFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVF
        TF +G CMRYFD EA+VGCEDGTAHVFDMYSRRCSRIIR+L GPVTCLCV+DDQL+ GGSLLGNIGV GLRSDQRV MLRSRNTVGIR+LC N SS LVF
Subjt:  TFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVF

Query:  AGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLSTYQNSLGVVEERIGKRLSDEAPIDAIN
        AGSTAGHVYCWDLRT+KSLW  RVSPNVIYSL+HLQNDRSSLAVGGIDGILRILDQNTGTVRSCC+ DSRLLST+QN LG VEER GKRLSDE PIDAI+
Subjt:  AGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLSTYQNSLGVVEERIGKRLSDEAPIDAIN

Query:  RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQN
        RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQ+
Subjt:  RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQN

A0A6J1HB56 F-box/WD-40 repeat-containing protein At3g52030 isoform X13.6e-22086.61Show/hide
Query:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM
        MAPP  ADRSS R RS+IDAKPV+SLSHDILCIIFSFLDLFDLVRCSVVCKSWN AI+  E+LRTF ++HQKQ ++S+++ EVS SS+KPLLECLEEIAM
Subjt:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM

Query:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLREC
        ERHK ALEEGRIRVSQW+GHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCL+EYSIPEKLPLIDFDFDESKIVGLVGR LCIW+RSGKRSIFP REC
Subjt:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLREC

Query:  TFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVF
        TFVEGSCMRYFDPEA+VGC DGTAHVFDMYSRRCSRI+R+L GPVTCLCV DDQLILGGSL GNIGV GLRSDQRVAMLRSRNT+GI+++CYN SSHLVF
Subjt:  TFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVF

Query:  AGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLSTYQNSLGVVEERIGKRLSDEAPIDAIN
        AGSTAGHVYCWDLRT+K LW  RVSPNV+YSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRS CI DSRLLSTYQ+ +GVVEERIG RLSDE PIDAI+
Subjt:  AGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLSTYQNSLGVVEERIGKRLSDEAPIDAIN

Query:  RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQN
        RR+RP ITSLAVGMNKIVTTHNDKFIRLWKF+N
Subjt:  RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQN

A0A6J1KSE3 F-box/WD-40 repeat-containing protein At3g52030 isoform X11.4e-21986.14Show/hide
Query:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM
        MAPP  ADRSS + RS+IDAKPV+SLSHDILCIIFSFLDLFDLVRCSVVCKSWN AI+  E+LRTF ++HQKQ ++S+++ +VS SS+KPLLECLEEIAM
Subjt:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM

Query:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLREC
        ERHK ALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCL+EYSIPEKLPLIDFDFDESKIVGLVGR +CIW+RSGKRSIFP REC
Subjt:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLREC

Query:  TFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVF
        TFVEGSCMRYFDPEA+VGC DGTAHVFDMYSRRCSRI+R+L GPVTCLCV DDQLILGGSL GNIGV GLRSDQRVAMLRSRNT+GI+++CYN SSHLVF
Subjt:  TFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVF

Query:  AGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLSTYQNSLGVVEERIGKRLSDEAPIDAIN
        AGSTAGHVYCWDLRT+K LW  RVSPNV+YSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRS CI DSRLLSTYQ+ +GVVEERIG RLSDE PIDAI+
Subjt:  AGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLSTYQNSLGVVEERIGKRLSDEAPIDAIN

Query:  RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQN
        RR+RP ITSLAVGMNKIVTTHNDKFIRLWKF+N
Subjt:  RRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQN

SwissProt top hitse value%identityAlignment
F4IIK6 Non-functional target of rapamycin complex subunit LST8-22.8e-0723.5Show/hide
Query:  ILTGVGDKVMRLWSSENFRCLDEYSIPE-KLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLRECTFVEGSCM----RYFDPEAIVGCEDGTAHVFD
        + T   D+ +RLW +   RC   +  P+  +  ++   ++ K+V      + +++        P+R       + M    +Y       G EDG+  ++D
Subjt:  ILTGVGDKVMRLWSSENFRCLDEYSIPE-KLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLRECTFVEGSCM----RYFDPEAIVGCEDGTAHVFD

Query:  MYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVFAGSTAGHVYCW
        +  R C R  R +S   T +   +   ++ G   GNI V  LR+D     L       IRSL       +V A +  G  Y W
Subjt:  MYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVFAGSTAGHVYCW

Q9SV01 F-box/WD-40 repeat-containing protein At3g520301.8e-12049.78Show/hide
Query:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM
        M      D SS R         + SL  DILCIIFSFLDLFDLV C+VVC SWN  I + ++L+      +K     +++   S S  +P    +E+ AM
Subjt:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM

Query:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESK---------------------IVGLV
        + HK AL  GRI + +W  HS R  QCRMK GL+LTGVGDKVMRLWS ++++C++EYS+P+   LIDFDFDESK                     IVGLV
Subjt:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESK---------------------IVGLV

Query:  GRRLCIWNRSGKRSIFPLRECTFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAML
        G R+ IW R+G+RSIFP R  TF +G CMRY DPEA+VGCEDGTA VFDMYS+ CS+IIR   GP+TCL +SD+QL L GS LG + V     DQ VA L
Subjt:  GRRLCIWNRSGKRSIFPLRECTFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAML

Query:  RSRNTV-GIRSLCYNPSSHLVFAGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLST-YQN
        +S  T  GI+++C+N  ++L F G+T G+V CWDLR +  LW  RVSPNV+YS++ L+ND S +  GGIDG+LR++DQ +G V S  I D +  +T  +N
Subjt:  RSRNTV-GIRSLCYNPSSHLVFAGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLST-YQN

Query:  SLGVVEERIGKRLSDEAPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKF
        +  V+E+R GKR+S +  ID I R+ RP I+ +A+GM K+VT HN K I +WKF
Subjt:  SLGVVEERIGKRLSDEAPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKF

Arabidopsis top hitse value%identityAlignment
AT2G22040.1 Transducin/WD40 repeat-like superfamily protein2.0e-0823.5Show/hide
Query:  ILTGVGDKVMRLWSSENFRCLDEYSIPE-KLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLRECTFVEGSCM----RYFDPEAIVGCEDGTAHVFD
        + T   D+ +RLW +   RC   +  P+  +  ++   ++ K+V      + +++        P+R       + M    +Y       G EDG+  ++D
Subjt:  ILTGVGDKVMRLWSSENFRCLDEYSIPE-KLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLRECTFVEGSCM----RYFDPEAIVGCEDGTAHVFD

Query:  MYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVFAGSTAGHVYCW
        +  R C R  R +S   T +   +   ++ G   GNI V  LR+D     L       IRSL       +V A +  G  Y W
Subjt:  MYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVFAGSTAGHVYCW

AT3G52030.1 F-box family protein with WD40/YVTN repeat doamin1.3e-12149.78Show/hide
Query:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM
        M      D SS R         + SL  DILCIIFSFLDLFDLV C+VVC SWN  I + ++L+      +K     +++   S S  +P    +E+ AM
Subjt:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM

Query:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESK---------------------IVGLV
        + HK AL  GRI + +W  HS R  QCRMK GL+LTGVGDKVMRLWS ++++C++EYS+P+   LIDFDFDESK                     IVGLV
Subjt:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESK---------------------IVGLV

Query:  GRRLCIWNRSGKRSIFPLRECTFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAML
        G R+ IW R+G+RSIFP R  TF +G CMRY DPEA+VGCEDGTA VFDMYS+ CS+IIR   GP+TCL +SD+QL L GS LG + V     DQ VA L
Subjt:  GRRLCIWNRSGKRSIFPLRECTFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAML

Query:  RSRNTV-GIRSLCYNPSSHLVFAGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLST-YQN
        +S  T  GI+++C+N  ++L F G+T G+V CWDLR +  LW  RVSPNV+YS++ L+ND S +  GGIDG+LR++DQ +G V S  I D +  +T  +N
Subjt:  RSRNTV-GIRSLCYNPSSHLVFAGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLST-YQN

Query:  SLGVVEERIGKRLSDEAPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKF
        +  V+E+R GKR+S +  ID I R+ RP I+ +A+GM K+VT HN K I +WKF
Subjt:  SLGVVEERIGKRLSDEAPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKF

AT3G52030.2 F-box family protein with WD40/YVTN repeat doamin2.6e-12552.19Show/hide
Query:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM
        M      D SS R         + SL  DILCIIFSFLDLFDLV C+VVC SWN  I + ++L+      +K     +++   S S  +P    +E+ AM
Subjt:  MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAM

Query:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLREC
        + HK AL  GRI + +W  HS R  QCRMK GL+LTGVGDKVMRLWS ++++C++EYS+P+   LIDFDFDESKIVGLVG R+ IW R+G+RSIFP R  
Subjt:  ERHKFALEEGRIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLREC

Query:  TFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTV-GIRSLCYNPSSHLV
        TF +G CMRY DPEA+VGCEDGTA VFDMYS+ CS+IIR   GP+TCL +SD+QL L GS LG + V     DQ VA L+S  T  GI+++C+N  ++L 
Subjt:  TFVEGSCMRYFDPEAIVGCEDGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTV-GIRSLCYNPSSHLV

Query:  FAGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLST-YQNSLGVVEERIGKRLSDEAPIDA
        F G+T G+V CWDLR +  LW  RVSPNV+YS++ L+ND S +  GGIDG+LR++DQ +G V S  I D +  +T  +N+  V+E+R GKR+S +  ID 
Subjt:  FAGSTAGHVYCWDLRTLKSLWGYRVSPNVIYSLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLST-YQNSLGVVEERIGKRLSDEAPIDA

Query:  INRRNRPSITSLAVGMNKIVTTHNDKFIRLWKF
        I R+ RP I+ +A+GM K+VT HN K I +WKF
Subjt:  INRRNRPSITSLAVGMNKIVTTHNDKFIRLWKF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCTCCTGCGGCTGCCGACAGGTCCTCGACGAGAATACGGAGTGATATTGATGCAAAACCAGTTCACTCCCTAAGCCACGACATCTTGTGCATAATTTTTTCGTT
CCTTGACCTTTTCGACTTGGTTCGATGCTCAGTTGTTTGTAAATCCTGGAATTATGCTATTTATAAGTACGAAATACTGCGAACGTTTTTATTGAGGCATCAGAAGCAGG
CGCTGCAGTCTGCTAATACTTCCGAAGTATCATTTTCTTCAAAGAAACCATTGCTGGAATGCTTAGAGGAAATAGCGATGGAACGACACAAGTTTGCCTTGGAAGAAGGT
CGTATTAGAGTTTCTCAATGGATAGGCCACTCAGTGAGGGCTGAACAATGCCGAATGAAGATGGGCTTGATTCTTACAGGAGTGGGTGATAAAGTTATGAGACTTTGGTC
ATCAGAGAACTTCAGATGTCTGGATGAATATTCCATTCCGGAGAAACTGCCTCTAATTGACTTTGATTTTGATGAGAGCAAGATTGTTGGTTTGGTTGGTAGAAGGTTGT
GCATATGGAATCGGAGTGGGAAAAGAAGTATATTTCCTTTGCGTGAATGTACATTTGTGGAGGGTTCGTGCATGCGTTACTTTGATCCAGAGGCCATTGTTGGTTGTGAA
GATGGAACAGCTCATGTATTTGACATGTATAGTAGGAGATGCTCTAGAATTATCAGGTTGCTTTCTGGGCCAGTGACATGCTTATGTGTGAGTGATGATCAGCTCATACT
TGGGGGTTCCCTTCTTGGGAACATTGGAGTATTGGGTCTTCGGTCTGATCAGCGGGTAGCAATGCTTAGATCAAGAAATACCGTAGGCATAAGGTCTTTGTGTTATAACC
CCTCTTCACATTTAGTATTTGCGGGATCAACTGCCGGACATGTCTATTGTTGGGACCTCAGGACACTGAAATCGTTATGGGGATACCGAGTGAGCCCGAATGTCATATAT
TCTTTGCGACATCTTCAAAATGACAGGTCAAGTTTGGCTGTTGGTGGAATAGATGGCATTCTACGTATTTTAGACCAGAATACAGGCACGGTGCGGTCGTGCTGTATTAC
GGATAGTAGACTGTTATCGACATATCAGAACAGTCTCGGAGTTGTCGAAGAAAGGATAGGAAAAAGACTGTCAGATGAGGCTCCTATTGATGCCATAAACAGAAGGAATA
GGCCTTCGATCACAAGCTTGGCCGTTGGGATGAATAAGATAGTCACAACGCACAACGATAAGTTCATTAGATTATGGAAGTTTCAAAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCCTCCTGCGGCTGCCGACAGGTCCTCGACGAGAATACGGAGTGATATTGATGCAAAACCAGTTCACTCCCTAAGCCACGACATCTTGTGCATAATTTTTTCGTT
CCTTGACCTTTTCGACTTGGTTCGATGCTCAGTTGTTTGTAAATCCTGGAATTATGCTATTTATAAGTACGAAATACTGCGAACGTTTTTATTGAGGCATCAGAAGCAGG
CGCTGCAGTCTGCTAATACTTCCGAAGTATCATTTTCTTCAAAGAAACCATTGCTGGAATGCTTAGAGGAAATAGCGATGGAACGACACAAGTTTGCCTTGGAAGAAGGT
CGTATTAGAGTTTCTCAATGGATAGGCCACTCAGTGAGGGCTGAACAATGCCGAATGAAGATGGGCTTGATTCTTACAGGAGTGGGTGATAAAGTTATGAGACTTTGGTC
ATCAGAGAACTTCAGATGTCTGGATGAATATTCCATTCCGGAGAAACTGCCTCTAATTGACTTTGATTTTGATGAGAGCAAGATTGTTGGTTTGGTTGGTAGAAGGTTGT
GCATATGGAATCGGAGTGGGAAAAGAAGTATATTTCCTTTGCGTGAATGTACATTTGTGGAGGGTTCGTGCATGCGTTACTTTGATCCAGAGGCCATTGTTGGTTGTGAA
GATGGAACAGCTCATGTATTTGACATGTATAGTAGGAGATGCTCTAGAATTATCAGGTTGCTTTCTGGGCCAGTGACATGCTTATGTGTGAGTGATGATCAGCTCATACT
TGGGGGTTCCCTTCTTGGGAACATTGGAGTATTGGGTCTTCGGTCTGATCAGCGGGTAGCAATGCTTAGATCAAGAAATACCGTAGGCATAAGGTCTTTGTGTTATAACC
CCTCTTCACATTTAGTATTTGCGGGATCAACTGCCGGACATGTCTATTGTTGGGACCTCAGGACACTGAAATCGTTATGGGGATACCGAGTGAGCCCGAATGTCATATAT
TCTTTGCGACATCTTCAAAATGACAGGTCAAGTTTGGCTGTTGGTGGAATAGATGGCATTCTACGTATTTTAGACCAGAATACAGGCACGGTGCGGTCGTGCTGTATTAC
GGATAGTAGACTGTTATCGACATATCAGAACAGTCTCGGAGTTGTCGAAGAAAGGATAGGAAAAAGACTGTCAGATGAGGCTCCTATTGATGCCATAAACAGAAGGAATA
GGCCTTCGATCACAAGCTTGGCCGTTGGGATGAATAAGATAGTCACAACGCACAACGATAAGTTCATTAGATTATGGAAGTTTCAAAACTAA
Protein sequenceShow/hide protein sequence
MAPPAAADRSSTRIRSDIDAKPVHSLSHDILCIIFSFLDLFDLVRCSVVCKSWNYAIYKYEILRTFLLRHQKQALQSANTSEVSFSSKKPLLECLEEIAMERHKFALEEG
RIRVSQWIGHSVRAEQCRMKMGLILTGVGDKVMRLWSSENFRCLDEYSIPEKLPLIDFDFDESKIVGLVGRRLCIWNRSGKRSIFPLRECTFVEGSCMRYFDPEAIVGCE
DGTAHVFDMYSRRCSRIIRLLSGPVTCLCVSDDQLILGGSLLGNIGVLGLRSDQRVAMLRSRNTVGIRSLCYNPSSHLVFAGSTAGHVYCWDLRTLKSLWGYRVSPNVIY
SLRHLQNDRSSLAVGGIDGILRILDQNTGTVRSCCITDSRLLSTYQNSLGVVEERIGKRLSDEAPIDAINRRNRPSITSLAVGMNKIVTTHNDKFIRLWKFQN