; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC10G200210 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC10G200210
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionWD_REPEATS_REGION domain-containing protein
Genome locationCicolChr10:26988320..26997550
RNA-Seq ExpressionCcUC10G200210
SyntenyCcUC10G200210
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR001810 - F-box domain
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR019775 - WD40 repeat, conserved site
IPR020472 - G-protein beta WD-40 repeat
IPR036047 - F-box-like domain superfamily
IPR036322 - WD40-repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588383.1 F-box/WD repeat-containing protein sel-10, partial [Cucurbita argyrosperma subsp. sororia]8.5e-19971.85Show/hide
Query:  MDRNQKFPTTITDLDEDSLAHCASFLKRHDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASIL
        MD+NQ  PT ITDL+EDSLAHCA+FL  HDIFNLA+TCKYL+Q A SDSIWQRLFRERWQH LP L SS+ASGGAR AY ARLS LQ  KFEDPLV  +L
Subjt:  MDRNQKFPTTITDLDEDSLAHCASFLKRHDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASIL

Query:  TQPEPYGPMLLDTDNIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQKCFRGHNG
        T+PEPY  MLLD D++FVS+GSSI+M+ I K   R  SL TLNDHNARITCMR FPL ETS  RSEG++SGNFLVTSSSDHSIRLWWKG CQKCFRGH+G
Subjt:  TQPEPYGPMLLDTDNIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQKCFRGHNG

Query:  PVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT--STIRSSCCVGLTSLPG
        PVS LSDKLLGDG+ K+LASGGEDGTVRLWSL SSGKRGKSALK TLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT  STIRSSCCVGLTSLPG
Subjt:  PVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT--STIRSSCCVGLTSLPG

Query:  GPINLKCHESLVYVATTSSVVAIDLRTMQKVLTAATYQPFLYSFEMIPSKSLLCTGGSGS-------------------------GPVSFLHMDPYKVVT
         PIN+KCHESL+Y AT+SSVVA+DLRTMQKVLTAA YQPFLYSFEMIPSKSL+CTGG GS                         G V+FLHMDPYK+VT
Subjt:  GPINLKCHESLVYVATTSSVVAIDLRTMQKVLTAATYQPFLYSFEMIPSKSLLCTGGSGS-------------------------GPVSFLHMDPYKVVT

Query:  GCPNDVYVHVWEVDSGTPVNSLSCWFNAYTEGSTTLSSMAVDGCRIATASYAGDIGLLRYIDYTNALRPIGR----AMRNPLC---TFVSSSKPLLPYLS
        GCP++V V+VWE DSG   NSLSCWF    + STTLSSMAV+GCR+ TA YA D+GLLR  DYTNA RPI R    +  +P     TF+S S  L   LS
Subjt:  GCPNDVYVHVWEVDSGTPVNSLSCWFNAYTEGSTTLSSMAVDGCRIATASYAGDIGLLRYIDYTNALRPIGR----AMRNPLC---TFVSSSKPLLPYLS

Query:  RNFSLPKL
         + S P L
Subjt:  RNFSLPKL

KAG7022229.1 hypothetical protein SDJN02_15959 [Cucurbita argyrosperma subsp. argyrosperma]1.7e-19474.36Show/hide
Query:  MDRNQKFPTTITDLDEDSLAHCASFLKRHDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASIL
        MD+NQ  PT ITDL+EDSLAHCA+FL  HDIFNLA+TCKYL+Q A SDSIWQRLFRERWQH LP L SS+ASGGAR AY ARLS LQ  KFEDPLV  +L
Subjt:  MDRNQKFPTTITDLDEDSLAHCASFLKRHDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASIL

Query:  TQPEPYGPMLLDTDNIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQKCFRGHNG
        T+PEPY  MLLD D++FVS+GSSI+M+ I K   R  SL TLNDHNARITCMR FPL ETS  RSEG++SGNFLVTSSSDHSIRLWWK    KCFRGH+G
Subjt:  TQPEPYGPMLLDTDNIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQKCFRGHNG

Query:  PVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT--STIRSSCCVGLTSLPG
        PVS LSDKLLGDG+ K+LASGGEDGTVRLWSL SSGKRGKSALK TLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT  STIRSSCCVGLTSLPG
Subjt:  PVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT--STIRSSCCVGLTSLPG

Query:  GPINLKCHESLVYVATTSSVVAIDLRTMQKVLTAATYQPFLYSFEMIPSKSLLCTGGSGS-------------------------GPVSFLHMDPYKVVT
         PIN+KCHESL+Y AT+SSVVA+DLRTMQKVLTAA YQPFLYSFEMIPSKSL+CTGG GS                         G V+FLHMDPYK+VT
Subjt:  GPINLKCHESLVYVATTSSVVAIDLRTMQKVLTAATYQPFLYSFEMIPSKSLLCTGGSGS-------------------------GPVSFLHMDPYKVVT

Query:  GCPNDVYVHVWEVDSGTPVNSLSCWFNAYTEGSTTLSSMAVDGCRIATASYAGDIGLLRYIDYTNALRPIGR
        GCP++V V+VWE DSG   NSLSCWF    + STTLSSMAV+GCR+ TA YA D+GLLR  DYTNA RPI R
Subjt:  GCPNDVYVHVWEVDSGTPVNSLSCWFNAYTEGSTTLSSMAVDGCRIATASYAGDIGLLRYIDYTNALRPIGR

XP_022928620.1 nuclear distribution protein nudF-like isoform X1 [Cucurbita moschata]2.5e-19874.79Show/hide
Query:  MDRNQKFPTTITDLDEDSLAHCASFLKRHDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASIL
        MD+NQ  PT ITDL+EDSLAHCA+FL  HDIFNLA+TCKYL+Q A S+SIWQRLFRERWQ  LPPL SS+ASGGAR+AY ARLS LQ  KFEDPLV  +L
Subjt:  MDRNQKFPTTITDLDEDSLAHCASFLKRHDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASIL

Query:  TQPEPYGPMLLDTDNIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQKCFRGHNG
        T+PEPY  MLLD D++FVS+GSSI+M+ I K   R  SL TLNDHNARITCMR FPL ETS  RSEG++SGNFLVTSSSDHSIRLWWKG CQKCFRGH+G
Subjt:  TQPEPYGPMLLDTDNIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQKCFRGHNG

Query:  PVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT--STIRSSCCVGLTSLPG
        PVS LSDKLLGDG+ K+LASGGEDGTVRLWSL SSGKRGKSALK TLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT  STIRSSCCVGLTSLPG
Subjt:  PVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT--STIRSSCCVGLTSLPG

Query:  GPINLKCHESLVYVATTSSVVAIDLRTMQKVLTAATYQPFLYSFEMIPSKSLLCTGGSGS-------------------------GPVSFLHMDPYKVVT
         PIN+KCHESL+Y AT+SSVVA+DLRTMQKVLTAA YQPFLYSFEMIPSKSL+CTGG GS                         G V+FLHMDPYK+VT
Subjt:  GPINLKCHESLVYVATTSSVVAIDLRTMQKVLTAATYQPFLYSFEMIPSKSLLCTGGSGS-------------------------GPVSFLHMDPYKVVT

Query:  GCPNDVYVHVWEVDSGTPVNSLSCWFNAYTEGSTTLSSMAVDGCRIATASYAGDIGLLRYIDYTNALRPIGR
        GCP++V V+VWE DSG   NSLSCWF    + STTLSSMAV+GCR+ TA YA D+GLLR  DYTNA RPI R
Subjt:  GCPNDVYVHVWEVDSGTPVNSLSCWFNAYTEGSTTLSSMAVDGCRIATASYAGDIGLLRYIDYTNALRPIGR

XP_023530008.1 uncharacterized protein LOC111792690 [Cucurbita pepo subsp. pepo]5.9e-20075.42Show/hide
Query:  MDRNQKFPTTITDLDEDSLAHCASFLKRHDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASIL
        MD+NQ  PT ITDL+EDSLAHCA+FL  HDIFNLA+TCKYL+Q A SDSIWQRLFRERWQH LPPL SS+ASGGAR+AY ARLS LQ  KFEDPLV  +L
Subjt:  MDRNQKFPTTITDLDEDSLAHCASFLKRHDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASIL

Query:  TQPEPYGPMLLDTDNIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQKCFRGHNG
        T+PEPY  MLLD D++FVS+GSSI+M+ I K   R  SL TLNDHNARITCMR FPL ETS  RSEG++SGNFLVTSSSDHSIRLWWKG CQKCFRGH+G
Subjt:  TQPEPYGPMLLDTDNIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQKCFRGHNG

Query:  PVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT--STIRSSCCVGLTSLPG
        PVS LSDKLLGDG+ K+LASGGEDGTVRLWSL SSGKRGKSALK TLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT  STIRSSCCVGLTSLPG
Subjt:  PVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT--STIRSSCCVGLTSLPG

Query:  GPINLKCHESLVYVATTSSVVAIDLRTMQKVLTAATYQPFLYSFEMIPSKSLLCTGGSGS-------------------------GPVSFLHMDPYKVVT
         PIN+KCHESL+Y AT+SSVVA+DLRTMQKVLTAA YQPFLYSFEMIPSKSL+CTGG GS                         G V+FLHMDPYK+VT
Subjt:  GPINLKCHESLVYVATTSSVVAIDLRTMQKVLTAATYQPFLYSFEMIPSKSLLCTGGSGS-------------------------GPVSFLHMDPYKVVT

Query:  GCPNDVYVHVWEVDSGTPVNSLSCWFNAYTEGSTTLSSMAVDGCRIATASYAGDIGLLRYIDYTNALRPIGR
        GCP++V V+VW+ DSG   NSLSCWF   T+ STTLSSMAV+GCRI TA YA D+GLLR  DYTNA RPI R
Subjt:  GCPNDVYVHVWEVDSGTPVNSLSCWFNAYTEGSTTLSSMAVDGCRIATASYAGDIGLLRYIDYTNALRPIGR

XP_038904795.1 uncharacterized WD repeat-containing protein alr2800-like [Benincasa hispida]1.3e-22683.97Show/hide
Query:  MDRNQKFPTTITDLDEDSLAHCASFLKRHDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASIL
        MDRNQ FPTTITDLDEDSLAHCASFLKRHDIFNLA TCKYLQQVANSDSIWQRL+RERWQHQLPPL SSLASGGARNAYFARLS L  CKFEDP V+SI 
Subjt:  MDRNQKFPTTITDLDEDSLAHCASFLKRHDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASIL

Query:  TQPEPYGPMLLDTDNIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQKCFRGHNG
        TQ +PYGP LLD+DN+FVSQGSSI MVT +KN+SR FSLATL+DHNARITCMRSFPL+ETSFLRSEGQRSGNFL TSSSDHSIRLWWKG CQKCFRGHNG
Subjt:  TQPEPYGPMLLDTDNIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQKCFRGHNG

Query:  PVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVTSTIRSSCCVGLTSLPGGP
        PVSILSDKLLGDG+GK LASGGEDGTVR+WSLSSSGKRGK ALKVTLHGHEKPIKLMSV GHKTSLLVSIARDSKVRVWDVTSTIRSSCCVGLTSL G P
Subjt:  PVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVTSTIRSSCCVGLTSLPGGP

Query:  INLKCHESLVYVATTSSVVAIDLRTMQKVLTAATYQPFLYSFEMIPSKSLLCTGGSG-------------------------SGPVSFLHMDPYKVVTGC
        INLKCHESLVYVATTSSV+AIDLRTMQKVLTAA YQPFLYSF+MIPSKS+LCTGGSG                         SGPV+FLHMDPYKVVTGC
Subjt:  INLKCHESLVYVATTSSVVAIDLRTMQKVLTAATYQPFLYSFEMIPSKSLLCTGGSG-------------------------SGPVSFLHMDPYKVVTGC

Query:  PNDVYVHVWEVDSGTPVNSLSCWFNAYTEGSTTLSSMAVDGCRIATASYAGDIGLLRYIDYTNALRPI
        P+DVYVHVWEV+SG P NSLSCWF  Y EGSTTLSS+A DGCRI TASY GDIG++R IDYTNALRPI
Subjt:  PNDVYVHVWEVDSGTPVNSLSCWFNAYTEGSTTLSSMAVDGCRIATASYAGDIGLLRYIDYTNALRPI

TrEMBL top hitse value%identityAlignment
A0A6J1DGX1 F-box/WD repeat-containing protein sel-10-like1.5e-18872.13Show/hide
Query:  MDRNQKFPTTITDLDEDSLAHCASFLKRHDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASIL
        MD+NQ  P TITDL+EDSLA CA FL   DI N+A TCK L+QVA SDSIWQ LFRERWQH +PPL SS+ASGGAR+AY +RL+ LQ  KFEDPLVA + 
Subjt:  MDRNQKFPTTITDLDEDSLAHCASFLKRHDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASIL

Query:  TQPEPYGPMLLDTDNIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQKCFRGHNG
        TQ EPYG MLLD D+IF+S+GSSIQM+ I K+ SR  SL TL+DHNARITCMR FPLY TS  RSEGQRS NFLVTSSSDHSIR WWKG CQ+CFRGHNG
Subjt:  TQPEPYGPMLLDTDNIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQKCFRGHNG

Query:  PVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT--STIRSSCCVGLTSLPG
        P+S LSDKLLGD + K+LASGGEDGTVRLWSLSSSGKRGKSAL VTLHGH +PIK +SVAGHKTSLLVSI+RDSKVRVWDVT  ST RSSCCVGLTSL G
Subjt:  PVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT--STIRSSCCVGLTSLPG

Query:  GPINLKCHESLVYVATTSSVVAIDLRTMQKVLTAATYQPFLYSFEMIPSKSLLCTGGSGS-------------------------GPVSFLHMDPYKVVT
         P+N+KCHESL+YV T+SSVVAIDLRTMQK  TAA YQPFLYSFEM+PSKSL+CTGG GS                         GPVSFLHMDPYK+VT
Subjt:  GPINLKCHESLVYVATTSSVVAIDLRTMQKVLTAATYQPFLYSFEMIPSKSLLCTGGSGS-------------------------GPVSFLHMDPYKVVT

Query:  GCPNDVYVHVWEVDSGTPVNSLSCWFNAYTEGSTTLSSMAVDGCRIATASYAGDIGLLRYIDYTNALRPI
        G P D+YV++WEVDSGT  NSLSCWFN     STTLSS AV+GCRI TA Y  D+GLLR  DYT+A RPI
Subjt:  GCPNDVYVHVWEVDSGTPVNSLSCWFNAYTEGSTTLSSMAVDGCRIATASYAGDIGLLRYIDYTNALRPI

A0A6J1EKG1 nuclear distribution protein nudF-like isoform X22.6e-16966.74Show/hide
Query:  MDRNQKFPTTITDLDEDSLAHCASFLKRHDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASIL
        MD+NQ  PT ITDL+EDSLAHCA+FL  HDIFNLA+T                                                   CKFEDPLV  +L
Subjt:  MDRNQKFPTTITDLDEDSLAHCASFLKRHDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASIL

Query:  TQPEPYGPMLLDTDNIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQKCFRGHNG
        T+PEPY  MLLD D++FVS+GSSI+M+ I K   R  SL TLNDHNARITCMR FPL ETS  RSEG++SGNFLVTSSSDHSIRLWWKG CQKCFRGH+G
Subjt:  TQPEPYGPMLLDTDNIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQKCFRGHNG

Query:  PVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT--STIRSSCCVGLTSLPG
        PVS LSDKLLGDG+ K+LASGGEDGTVRLWSL SSGKRGKSALK TLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT  STIRSSCCVGLTSLPG
Subjt:  PVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT--STIRSSCCVGLTSLPG

Query:  GPINLKCHESLVYVATTSSVVAIDLRTMQKVLTAATYQPFLYSFEMIPSKSLLCTGGSGS-------------------------GPVSFLHMDPYKVVT
         PIN+KCHESL+Y AT+SSVVA+DLRTMQKVLTAA YQPFLYSFEMIPSKSL+CTGG GS                         G V+FLHMDPYK+VT
Subjt:  GPINLKCHESLVYVATTSSVVAIDLRTMQKVLTAATYQPFLYSFEMIPSKSLLCTGGSGS-------------------------GPVSFLHMDPYKVVT

Query:  GCPNDVYVHVWEVDSGTPVNSLSCWFNAYTEGSTTLSSMAVDGCRIATASYAGDIGLLRYIDYTNALRPIGR
        GCP++V V+VWE DSG   NSLSCWF    + STTLSSMAV+GCR+ TA YA D+GLLR  DYTNA RPI R
Subjt:  GCPNDVYVHVWEVDSGTPVNSLSCWFNAYTEGSTTLSSMAVDGCRIATASYAGDIGLLRYIDYTNALRPIGR

A0A6J1ES54 nuclear distribution protein nudF-like isoform X11.2e-19874.79Show/hide
Query:  MDRNQKFPTTITDLDEDSLAHCASFLKRHDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASIL
        MD+NQ  PT ITDL+EDSLAHCA+FL  HDIFNLA+TCKYL+Q A S+SIWQRLFRERWQ  LPPL SS+ASGGAR+AY ARLS LQ  KFEDPLV  +L
Subjt:  MDRNQKFPTTITDLDEDSLAHCASFLKRHDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASIL

Query:  TQPEPYGPMLLDTDNIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQKCFRGHNG
        T+PEPY  MLLD D++FVS+GSSI+M+ I K   R  SL TLNDHNARITCMR FPL ETS  RSEG++SGNFLVTSSSDHSIRLWWKG CQKCFRGH+G
Subjt:  TQPEPYGPMLLDTDNIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQKCFRGHNG

Query:  PVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT--STIRSSCCVGLTSLPG
        PVS LSDKLLGDG+ K+LASGGEDGTVRLWSL SSGKRGKSALK TLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT  STIRSSCCVGLTSLPG
Subjt:  PVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT--STIRSSCCVGLTSLPG

Query:  GPINLKCHESLVYVATTSSVVAIDLRTMQKVLTAATYQPFLYSFEMIPSKSLLCTGGSGS-------------------------GPVSFLHMDPYKVVT
         PIN+KCHESL+Y AT+SSVVA+DLRTMQKVLTAA YQPFLYSFEMIPSKSL+CTGG GS                         G V+FLHMDPYK+VT
Subjt:  GPINLKCHESLVYVATTSSVVAIDLRTMQKVLTAATYQPFLYSFEMIPSKSLLCTGGSGS-------------------------GPVSFLHMDPYKVVT

Query:  GCPNDVYVHVWEVDSGTPVNSLSCWFNAYTEGSTTLSSMAVDGCRIATASYAGDIGLLRYIDYTNALRPIGR
        GCP++V V+VWE DSG   NSLSCWF    + STTLSSMAV+GCR+ TA YA D+GLLR  DYTNA RPI R
Subjt:  GCPNDVYVHVWEVDSGTPVNSLSCWFNAYTEGSTTLSSMAVDGCRIATASYAGDIGLLRYIDYTNALRPIGR

A0A6J1ES67 transcription initiation factor TFIID subunit 5-like isoform X35.1e-16579.62Show/hide
Query:  MDRNQKFPTTITDLDEDSLAHCASFLKRHDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASIL
        MD+NQ  PT ITDL+EDSLAHCA+FL  HDIFNLA+TCKYL+Q A S+SIWQRLFRERWQ  LPPL SS+ASGGAR+AY ARLS LQ  KFEDPLV  +L
Subjt:  MDRNQKFPTTITDLDEDSLAHCASFLKRHDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASIL

Query:  TQPEPYGPMLLDTDNIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQKCFRGHNG
        T+PEPY  MLLD D++FVS+GSSI+M+ I K   R  SL TLNDHNARITCMR FPL ETS  RSEG++SGNFLVTSSSDHSIRLWWKG CQKCFRGH+G
Subjt:  TQPEPYGPMLLDTDNIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQKCFRGHNG

Query:  PVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT--STIRSSCCVGLTSLPG
        PVS LSDKLLGDG+ K+LASGGEDGTVRLWSL SSGKRGKSALK TLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT  STIRSSCCVGLTSLPG
Subjt:  PVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT--STIRSSCCVGLTSLPG

Query:  GPINLKCHESLVYVATTSSVVAIDLRTMQKVLTAATYQPFLYSFEMIPSKSLLCTGGSGSGPVSFLHM
         PIN+KCHESL+Y AT+SSVVA+DLRTMQKVLTAA YQPFLYSFEMIPSKSL+CTGG G   +  L +
Subjt:  GPINLKCHESLVYVATTSSVVAIDLRTMQKVLTAATYQPFLYSFEMIPSKSLLCTGGSGSGPVSFLHM

A0A7N2L0R8 WD_REPEATS_REGION domain-containing protein6.7e-14960.17Show/hide
Query:  TTITDLDEDSLAHCASFLKR-HDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASILTQPEPYG
        T ITD+DEDSLAHCA++L    D+ NLA +CKY ++VA S+SIW R FRE W  Q P   SS  +   R AY AR + LQ  KF DPLVA I T P+PY 
Subjt:  TTITDLDEDSLAHCASFLKR-HDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASILTQPEPYG

Query:  PMLLDTDNIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQKCFRGHNGPVSILSD
         +LL+ ++   SQGS I+M  I   +    SL  L+DH+ARITCMR FPL ETS  RSE Q   N LVTSS DH+IRLWW+G CQ+CFRGHNGPV+ LSD
Subjt:  PMLLDTDNIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQKCFRGHNGPVSILSD

Query:  KLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT--STIRSSCCVGLTSLPGGPINLKC
        KLLGDG GK+LASGGEDGTVRLWSL+SSGKRGKSALK TL+GHEKP+KLMSVAGHKTSLLV+++RDSKVRVWD T  S++RSSCCVG+ SLPG P+N+KC
Subjt:  KLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVT--STIRSSCCVGLTSLPGGPINLKC

Query:  HESLVYVATTSSVVAIDLRTMQKVLTAATYQPFLYSFEMIPSKSLLCTGGSG------------------------SGPVSFLHMDPYKVVTGCPNDVYV
        HESL+YVAT SSV+AIDLRTM+KVLTAA YQP L+SFEM+PSKSL+CTG SG                        +G V+F+HMDPYK+VTG P D ++
Subjt:  HESLVYVATTSSVVAIDLRTMQKVLTAATYQPFLYSFEMIPSKSLLCTGGSG------------------------SGPVSFLHMDPYKVVTGCPNDVYV

Query:  HVWEVDSGTPVNSLSCWFNAYTEGSTTLSSMAVDGCRIATASYAGDIGLLRYIDYTNALRPI
        +VWE D+GT +NSLSC  ++  E S+  +++AV+GCRI T S    +  LR+ D+ NA  P+
Subjt:  HVWEVDSGTPVNSLSCWFNAYTEGSTTLSSMAVDGCRIATASYAGDIGLLRYIDYTNALRPI

SwissProt top hitse value%identityAlignment
C4Q0P6 Lissencephaly-1 homolog1.2e-0934.92Show/hide
Query:  SGNFLVTSSSDHSIRLW--WKGYCQKCFRGHNGPVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGH-----
        SG+FLV++S D +I++W    GYC K F GH   +      +     G LLAS   D T+R+WS+ S         +V L GHE  ++ ++ A H     
Subjt:  SGNFLVTSSSDHSIRLW--WKGYCQKCFRGHNGPVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGH-----

Query:  -------KTSLLVSIARDSKVRVWDV
                + LLVS +RD  +R WDV
Subjt:  -------KTSLLVSIARDSKVRVWDV

O75529 TAF5-like RNA polymerase II p300/CBP-associated factor-associated factor 65 kDa subunit 5L4.8e-1127.16Show/hide
Query:  SILTQPEPYGPMLLDTD--NIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLW--WKGYCQK
        ++L Q   Y    LD    +++ + GS  +   +  +  R + L     H A + C++  P             + N+L T S+D ++RLW   +G   +
Subjt:  SILTQPEPYGPMLLDTD--NIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLW--WKGYCQK

Query:  CFRGHNGPVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVTSTIRSSCCVGL
         F GH GPV      L    NGK LAS GED  ++LW L+S        L   L GH   I  ++ +   + L+ S + D+ VRVWD+ +T  S+   G 
Subjt:  CFRGHNGPVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVTSTIRSSCCVGL

Query:  TSLPGGPINLKCHESLVYVATTSSVVAIDLRTMQKVLTAATYQ
        +S   G          VY    S+V+++       +L     Q
Subjt:  TSLPGGPINLKCHESLVYVATTSSVVAIDLRTMQKVLTAATYQ

Q6S7B0 Transcription initiation factor TFIID subunit 59.1e-1035.9Show/hide
Query:  NFLVTSSSDHSIRLW--WKGYCQKCFRGHNGPVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVS
        N++ T SSD ++RLW    G C + F GH   V      L    +G+ +ASG EDGT+ +W LS+      +     L GH   +  +S +G + SLL S
Subjt:  NFLVTSSSDHSIRLW--WKGYCQKCFRGHNGPVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVS

Query:  IARDSKVRVWDVTSTIR
         + D  V++WDVTS+ +
Subjt:  IARDSKVRVWDVTSTIR

Q91WQ5 TAF5-like RNA polymerase II p300/CBP-associated factor-associated factor 65 kDa subunit 5L2.0e-1234.18Show/hide
Query:  RDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLW--WKGYCQKCFRGHNGPVSILSDKLLGDGNGKLLASGGEDGTVRLWSL
        R + L     H A + C++  P             + N+L T S+D ++RLW   +G   + F GH GPV  LS       NGK LAS GED  ++LW L
Subjt:  RDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLW--WKGYCQKCFRGHNGPVSILSDKLLGDGNGKLLASGGEDGTVRLWSL

Query:  SSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVTSTIRSSCC
        +S        L   L GH   I  ++ +   + L+ S + D+ VRVWD    IRS+CC
Subjt:  SSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVTSTIRSSCC

Q93794 F-box/WD repeat-containing protein sel-103.7e-1122.95Show/hide
Query:  LKRHDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASILTQPEPYGPMLLDT---------DNI
        L  +D+  +A   K  + ++  D IW+ L  E ++H   P                  +D     ++   +A+ +T P+   P  L+           +I
Subjt:  LKRHDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASILTQPEPYGPMLLDT---------DNI

Query:  FVSQGSSIQMVTITK-----NVSRDFSLATLNDHNAR-ITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWW--KGYCQKCFRGHNGPV--SILS
        F       + +   K     N +     A L  H    ITCM               Q   + LVT S D+++++W   KG       GH G V  S +S
Subjt:  FVSQGSSIQMVTITK-----NVSRDFSLATLNDHNAR-ITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWW--KGYCQKCFRGHNGPV--SILS

Query:  DKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVTSTIRSSCCVGLTSLPGGPINLKCH
                G+ + SG  D TV++WS          +L  TL GH   ++ M++AG   S+LV+ +RD+ +RVWDV S         L +L G    ++C 
Subjt:  DKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVTSTIRSSCCVGLTSLPGGPINLKCH

Query:  E----SLVYVATTSSVVAIDLRTMQKVLTAATYQPFLYSFEMIPSKSLLCTGG-------------SGSGPVSFLHMDPY---------KVVTGCPNDVY
        +    ++V      +V   +  T + + T   +   +YS      +S++C+G               G   V+ L               ++  C  D +
Subjt:  E----SLVYVATTSSVVAIDLRTMQKVLTAATYQPFLYSFEMIPSKSLLCTGG-------------SGSGPVSFLHMDPY---------KVVTGCPNDVY

Query:  VHVWEVDSGTPVNSLSCWFNAYTE----GSTTLSSMAVDG
        V VW++  GT V+ LS   +A T     G   +++ + DG
Subjt:  VHVWEVDSGTPVNSLSCWFNAYTE----GSTTLSSMAVDG

Arabidopsis top hitse value%identityAlignment
AT5G16750.1 Transducin family protein / WD-40 repeat family protein1.6e-0931.62Show/hide
Query:  ITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLW--WKGYCQKCFRGHNGPVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVT
        + C+RS+  +E   +      SG  L T+ +D  + +W    G+C   FRGH G VS  S     D N  +L SG +D TVR+W L++     K    + 
Subjt:  ITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLW--WKGYCQKCFRGHNGPVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVT

Query:  LHGHEKPIKLMSVAGHKTSL-LVSIARDSKVRVWDV
         H       + S+A  +  L L S  RD  V +WD+
Subjt:  LHGHEKPIKLMSVAGHKTSL-LVSIARDSKVRVWDV

AT5G24710.1 Transducin/WD40 repeat-like superfamily protein1.2e-0427.78Show/hide
Query:  FLRSEGQRSGNFLVTSSSDHSIR----LWWKGYCQKCFRGHNGPVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHE---KPI
        FL       G  +   S+D  IR    + WK   ++   GH G +  L + +   G   LL SGG DG + LWS        +   K++L  H+     +
Subjt:  FLRSEGQRSGNFLVTSSSDHSIR----LWWKGYCQKCFRGHNGPVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHE---KPI

Query:  KLMSVAGHKTSLLVSIARDSKVRVWD
        +L  V+G     L++I  D  + +WD
Subjt:  KLMSVAGHKTSLLVSIARDSKVRVWD

AT5G25150.1 TBP-associated factor 56.4e-1135.9Show/hide
Query:  NFLVTSSSDHSIRLW--WKGYCQKCFRGHNGPVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVS
        N++ T SSD ++RLW    G C + F GH   V      L    +G+ +ASG EDGT+ +W LS+      +     L GH   +  +S +G + SLL S
Subjt:  NFLVTSSSDHSIRLW--WKGYCQKCFRGHNGPVSILSDKLLGDGNGKLLASGGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVS

Query:  IARDSKVRVWDVTSTIR
         + D  V++WDVTS+ +
Subjt:  IARDSKVRVWDVTSTIR

AT5G50120.1 Transducin/WD40 repeat-like superfamily protein1.4e-0526.67Show/hide
Query:  CMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQ---KCFRGHNGPVSILSDK-------LLGDGNGKLLASGGEDGTVRLWSLSSSGKRGK
        C+ SF       + +        + T SSD  I++W K   +   K  R H+  V+ILS+         L   NG LL SGG DG++ +W        G 
Subjt:  CMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQ---KCFRGHNGPVSILSDK-------LLGDGNGKLLASGGEDGTVRLWSLSSSGKRGK

Query:  SALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVTSTIRSSCCVGLTSLPGGPINLKC
          +   L GH + +  ++V    + +L S + D  VR+W  ++  +   C+ +     GP+  KC
Subjt:  SALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVTSTIRSSCCVGLTSLPGGPINLKC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCGGAATCAGAAATTTCCGACCACCATAACCGACTTGGATGAAGATTCTCTCGCTCACTGCGCGTCATTCCTTAAGCGCCACGACATCTTCAATCTCGCCAGCAC
CTGCAAATATCTTCAACAAGTTGCAAATTCCGACTCCATTTGGCAACGCCTTTTCAGGGAGCGATGGCAGCATCAGTTACCACCTCTTGGTTCTTCCTTAGCTTCAGGAG
GAGCTAGGAATGCCTACTTTGCTAGGCTTTCCGATTTGCAACTGTGCAAGTTTGAAGATCCTTTGGTTGCTAGTATCCTCACCCAACCCGAGCCTTATGGCCCCATGCTT
TTGGATACAGATAATATATTCGTTTCTCAGGGCTCCTCAATTCAAATGGTGACTATCACCAAAAATGTTAGCAGAGATTTTTCTCTTGCCACTCTGAATGATCACAATGC
GCGTATCACTTGTATGAGGTCGTTTCCCCTTTATGAAACTTCATTTCTTAGAAGTGAAGGACAAAGATCAGGGAATTTTTTGGTAACCTCAAGTTCTGATCACTCAATCC
GTTTGTGGTGGAAGGGTTATTGTCAGAAATGTTTTCGAGGTCATAATGGCCCAGTTTCAATTCTGTCAGATAAACTGTTAGGTGATGGTAACGGCAAATTATTGGCCAGT
GGAGGGGAAGATGGTACAGTTCGCCTTTGGTCTCTTAGTTCCAGTGGCAAGCGAGGGAAGAGTGCTTTAAAGGTTACACTACATGGACATGAAAAACCAATTAAATTAAT
GTCAGTTGCGGGGCACAAGACTTCTCTTTTGGTGAGCATTGCAAGAGACTCCAAGGTAAGAGTTTGGGATGTTACATCAACTATCCGTTCGTCTTGTTGCGTTGGATTAA
CTTCTCTTCCTGGTGGTCCTATAAACCTTAAGTGCCATGAATCATTGGTCTATGTTGCTACAACTTCCTCCGTTGTTGCAATCGATTTAAGGACTATGCAGAAAGTTCTT
ACGGCTGCAACTTATCAACCGTTTTTATATTCATTCGAGATGATTCCTTCTAAATCCTTACTATGCACAGGTGGTAGTGGCAGCGGACCAGTGAGCTTCCTACACATGGA
TCCCTACAAAGTTGTTACGGGATGTCCAAACGATGTTTATGTACACGTTTGGGAGGTCGATTCAGGCACACCAGTGAATTCTTTGAGTTGCTGGTTCAATGCATATACAG
AGGGCAGCACAACATTGTCCTCTATGGCTGTTGATGGGTGTAGAATTGCCACTGCGTCGTATGCTGGTGATATAGGACTCTTACGATATATAGACTATACCAATGCTCTT
CGTCCTATTGGGAGGGCCATGAGAAATCCACTTTGTACCTTCGTTTCGTCATCAAAGCCTCTTCTCCCTTATTTAAGCCGTAACTTCTCCCTACCGAAGTTGAAGCTATT
TGGACTCCTTGAAAGGGTGATGTCTGAACACAGCAAGAAGCCAATCTTGGAGGTCTTGAGGGGACTCGAATGCCATTTGAAAGGAAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATTTCATTAACTTTTGATCGTATTTGTGTTTCTCTTTCTATCCCTCAAATCCAATTTCTCATCATTGGAGTCAAAGTTACTAATTTGTCGATTTCTCAAGCCTTTGTAAA
TTGTAAACCCTGCGAAACCAAACCAATCACCATCATCATCGATGGACCGGAATCAGAAATTTCCGACCACCATAACCGACTTGGATGAAGATTCTCTCGCTCACTGCGCG
TCATTCCTTAAGCGCCACGACATCTTCAATCTCGCCAGCACCTGCAAATATCTTCAACAAGTTGCAAATTCCGACTCCATTTGGCAACGCCTTTTCAGGGAGCGATGGCA
GCATCAGTTACCACCTCTTGGTTCTTCCTTAGCTTCAGGAGGAGCTAGGAATGCCTACTTTGCTAGGCTTTCCGATTTGCAACTGTGCAAGTTTGAAGATCCTTTGGTTG
CTAGTATCCTCACCCAACCCGAGCCTTATGGCCCCATGCTTTTGGATACAGATAATATATTCGTTTCTCAGGGCTCCTCAATTCAAATGGTGACTATCACCAAAAATGTT
AGCAGAGATTTTTCTCTTGCCACTCTGAATGATCACAATGCGCGTATCACTTGTATGAGGTCGTTTCCCCTTTATGAAACTTCATTTCTTAGAAGTGAAGGACAAAGATC
AGGGAATTTTTTGGTAACCTCAAGTTCTGATCACTCAATCCGTTTGTGGTGGAAGGGTTATTGTCAGAAATGTTTTCGAGGTCATAATGGCCCAGTTTCAATTCTGTCAG
ATAAACTGTTAGGTGATGGTAACGGCAAATTATTGGCCAGTGGAGGGGAAGATGGTACAGTTCGCCTTTGGTCTCTTAGTTCCAGTGGCAAGCGAGGGAAGAGTGCTTTA
AAGGTTACACTACATGGACATGAAAAACCAATTAAATTAATGTCAGTTGCGGGGCACAAGACTTCTCTTTTGGTGAGCATTGCAAGAGACTCCAAGGTAAGAGTTTGGGA
TGTTACATCAACTATCCGTTCGTCTTGTTGCGTTGGATTAACTTCTCTTCCTGGTGGTCCTATAAACCTTAAGTGCCATGAATCATTGGTCTATGTTGCTACAACTTCCT
CCGTTGTTGCAATCGATTTAAGGACTATGCAGAAAGTTCTTACGGCTGCAACTTATCAACCGTTTTTATATTCATTCGAGATGATTCCTTCTAAATCCTTACTATGCACA
GGTGGTAGTGGCAGCGGACCAGTGAGCTTCCTACACATGGATCCCTACAAAGTTGTTACGGGATGTCCAAACGATGTTTATGTACACGTTTGGGAGGTCGATTCAGGCAC
ACCAGTGAATTCTTTGAGTTGCTGGTTCAATGCATATACAGAGGGCAGCACAACATTGTCCTCTATGGCTGTTGATGGGTGTAGAATTGCCACTGCGTCGTATGCTGGTG
ATATAGGACTCTTACGATATATAGACTATACCAATGCTCTTCGTCCTATTGGGAGGGCCATGAGAAATCCACTTTGTACCTTCGTTTCGTCATCAAAGCCTCTTCTCCCT
TATTTAAGCCGTAACTTCTCCCTACCGAAGTTGAAGCTATTTGGACTCCTTGAAAGGGTGATGTCTGAACACAGCAAGAAGCCAATCTTGGAGGTCTTGAGGGGACTCGA
ATGCCATTTGAAAGGAAGTTAACCCAAATTTTGTAACCATAAATTTCCAAACCATATTGCATAAGATATAGAATAGAGTGAGGAAGAAAAAGTGTACATGTAATTCTCTC
TAAATAATATAGAAAAATGGAAGGAAAAAAGAAGCTGCTCAAATTGATGAAAAGGAAAGAAGAAAATGATGAGAAACCAATAATGTAATTTTGATGCAAATTCCAATGTC
CCGGTTGACTCGTCAAGAGACAACACTGGACTCGCTCTCACTCAATACGATCGTCATCCCCCCTTCTTCTTCCTCAATTCCTCCTCAATTCTCTTCAAATCTTTCTTAGT
TCTAATGGGTTTTCTTCGGAATTCGTAGAGAATCGTAATCTTCTACAATCACATCCATTTCTCCATCCTCCTTATTTCCAATCAATCAATCCAATCCAATCTAAAACCCT
AAAAATGCCCACCACCACCTCCGCCGCCGTCGCCGCCGCGGCAATCCTCTTTCTGTTCATCATCGTCACCGCCACCTCCGCTCCGATCCTCGGCCTCGATTCATTTCTCG
CTCAGCAATCTCGCTTCGACCCACATGCCTCCAACGACACATTTCTCTCCCTCTCATCGTCCCTCAAGAAATCTCTTTCTGTATCTTCTCCTCCTCCTCCTCTCATCCCT
TCTTTCATCTCTTCTCTCCTCTCTCTCTCCCTTTCTTTCTCTCTCCATGTCCGTCTTGTTGGTGACTTTCCCTCCGATTCATCGACCCATCTCTCATCTTTCCTCTCTGC
TTCTCTCCCTTCCGACCATTTCCATGTCATTGCTCCTTTTGATTCGTATCAACACCGTCTTGCCGTTAAGCATTCTCTTCATCTCGATGTATCTCATGCCCCCTCTTTGG
CTTCTCATCTCTCGGAGATCTTGAAATCTGAAATCTCTAATACTGCTTCTAGCCTCCGATCTTCGCTTCTTGCTGTTCCTTATGAGTCTGTGGACCGTGTCATAAAACAG
GATTTTGAGAAGGAGAAATCCGGTCAAGGGGTTTACATATATTTGCTTAATTTGGGCTCTCAGTCGAAGCCGTATGCTTACAATTATGGCCATGGGGATTCATCCCCTGG
TTTTACCAAGTGTTTAGGAAGCATTTGGAGTGGTGGAGAAAGGTATTTATGGGTTGATTTAGGTGCAGGTCCTGTTGATTATGGACCGTCGCTGTCTGGAGATGGGGTTC
TTCCTAGAGGAGAGTTTCATCCTTTGGCGACCTTGCATGGCCGGCCGAAGTCCCAGAAGGCGCTGCTAGCGGATTTGGCTTCGTTGGTTTGGAGTGCTTATCAGGTTCAT
TTAGTTCCTTCTATGAGAATTCCTGTTCCTTTTGAAAGTTCATTGGTTGTTCAATTTGTACACATATATGGGTCTGAGAGTAGTGAGGGAGGTGATTTGGATTGGAAGTC
TATTGAGAGAACCTTGAGAGATGGTGGGCTGTTGTTGGGTGAGCAGTTTTTGAGCTTCAAGACTTATAGTGTGAGCTATGCTAAGTGCCCAATTTGTGCTTTTGCCGTTT
CTCGGTCTACGAATTCATATACTTCAAGGTTTTTGTTTGATAACTACACTTTGATTGTAAATGAGTATTTGGATTCTAAAAGATTGCATCAGATACTGTCTGATTCTGCT
GAGGAGTTTAGAAGGGCTGGATTCCCTGAGGAGGAGGAGATGGCCAGAGTGGTTCCAGTCTATGTTTTCGATTTGAACTTGAATACAATCTTGTTGCTTGATCGTTACCA
TCAATCTGTGGCCTTTACAGACATGGTTATTGCTGTAAGGACTAAGAATACTCAGACTGTGAGTGATTATAGCTGTAATGGTCGCCATGTATTTACACATACAAGGGACC
TTGAAAGGCCACTTATTGGTTCAATTTTACAAAGTATGTGGGGAGTGTCACCTACCCACTTAGCTTGGAGCTCGAGGCACAACGACACCATTGTCGATTACTCATGGAGC
ATTGGGCAAACTCCTTTTGGTCCATTCTCAGAGGTTTCATCCTTATCATTTGTTCAGAAGGATGCAGCGAGGAGAAACCTTATATTGACAGCATTGAATAGCAGTATCAC
TAGTGCGATTGATGTTCTTAACTCCGTAGCTGCACATGGTGGTGATAGAAATTTGCTAAAACCGAAACAACGTACTGAGTTCATACAACGATGGAACCTTTTCAAATACA
AGCTGGACAAAGTGGTGTCTGCAATGTCGCATTTTGACTTCGAGATGGCTTTGTATTATGTAAGATCTTCAGATCATGACCTTTACATGCTTCACTCCATCGTCTACAAC
GCATCTCAAGAACTGGAGGCATCGTTGGTTTGCTTCAAAGACCCTCCATTCCCTTGGGGTTCTGTGTCAGTGTCTGTTGTACTTTTCTTTGCTTTCTTATATGTTTACAC
AAAAAGGGATAGAATTTTCAAGAACAAAAGGAAGCAATTTTGATGCTAATTAATGGGTTGCAGAGTATGTATTACCGACAGCACATTATAAATGTAATTTCTTTGCAGAG
TTAGATTGAAGCATTAGGATGTTGAATTCGGATCACTGAACTGGTTACTTTTTTGATTTGGCAATTTGCTTGAGTTCTGGGTTATTACTTTTCTATGATTCTGCAATATC
TGAGATTCACGTAAGTTATTTTCTCTTGGTTTATCAATGTTCGTTCTAGAACAGATCTGAAAAGAGAAAAGAGAGAAATGTTGAGTAGTTGGTTTACTGATGTCCACAAA
AATACAAAGAAAACAGTTTACTGATCGAGTTGAGTTCTAACTCCCATATTAACATGTTTGATACGATGATATTAACAATGACATGAAGCACACTTTCAGGCTGCTTTGAA
GCGATGAAATGTGCCATTAAAGTAAGAATGAAGCTCGCTCTCTTGCAGCTCTTTGTTGGAGCTGCAAACCAACGGTACATCTGTAATTCTGTCACAGAAGACATCTTCAA
ACTCATCAAATACACTGTCGAGCTGCACTCGAGTCTTCGCCTCTTCCCCATTATCAGCCTCATCCACAGAGAGATTCGAGTTGGTCTCCACCAGTTTCTTGAAAAATGAT
GATCGAAGGAGGAGGTCGAGTGCTGATGCAGAAGAAGATCTGGCGTAAGCATTGAGCGGGGCATTGCAGCTCTGAAGAAGATCTTGCTTTTGAAGGGAATGAAAGGAATC
AATGTCGTAGTGATGGTTTTTGAAGAGTGAAGTTTCTTCAGCTGGGATGGAGCTGGAGTTGCAGAGCTCCTTGTGGGGGTCTGTTGCCATATGGGCTTCACTTGGGGCGG
CGGATGGCGGCGGAGGAGGCTTCAACCAGGCCATGTAATTGCTCCAGTCGAAGTTGGTGACTGCATTGATCCCTCGGTATTCAATGGCGGCCATATCATAAGCTCGAGCA
GCTTCTTCTTGAGTGCCTGTGGTGGAAGATGAAACAAATATGTGATTAGTATTCAAATTTAACATGACAAATCTCCTACTCTTTACTCTATAAAAGAGAGTATTAAGAGT
AACATAAATTTAATGGCTTTATTAATTGTATTTCTAAATGAGATTTTTCCCACCACTCTTCTCATTTTGGTCTGAAACAATGGCATGACTTATTAGAACAAACAATGCCT
CATTCTTTACCTTTCACCTATTACTAAGACCACTTATTTATACTTTCTAGAACATTCGGGTAATGAAGAACATAATTATGTTAATATTTTGTTAAAAGAATTATTTTATA
GAAAATGTTTATTGGCCCAGAGATTTGTATACCGTTTTTATTTGGTTTATACCTTTACTCACGAG
Protein sequenceShow/hide protein sequence
MDRNQKFPTTITDLDEDSLAHCASFLKRHDIFNLASTCKYLQQVANSDSIWQRLFRERWQHQLPPLGSSLASGGARNAYFARLSDLQLCKFEDPLVASILTQPEPYGPML
LDTDNIFVSQGSSIQMVTITKNVSRDFSLATLNDHNARITCMRSFPLYETSFLRSEGQRSGNFLVTSSSDHSIRLWWKGYCQKCFRGHNGPVSILSDKLLGDGNGKLLAS
GGEDGTVRLWSLSSSGKRGKSALKVTLHGHEKPIKLMSVAGHKTSLLVSIARDSKVRVWDVTSTIRSSCCVGLTSLPGGPINLKCHESLVYVATTSSVVAIDLRTMQKVL
TAATYQPFLYSFEMIPSKSLLCTGGSGSGPVSFLHMDPYKVVTGCPNDVYVHVWEVDSGTPVNSLSCWFNAYTEGSTTLSSMAVDGCRIATASYAGDIGLLRYIDYTNAL
RPIGRAMRNPLCTFVSSSKPLLPYLSRNFSLPKLKLFGLLERVMSEHSKKPILEVLRGLECHLKGS