; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0021215 (gene) of Chayote v1 genome

Gene IDSed0021215
OrganismSechium edule (Chayote v1)
Descriptionprotein SICKLE isoform X2
Genome locationLG05:44992526..44999273
RNA-Seq ExpressionSed0021215
SyntenySed0021215
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0035196 - production of miRNAs involved in gene silencing by miRNA (biological process)
GO:1903730 - regulation of phosphatidate phosphatase activity (biological process)
InterPro domainsIPR028265 - TTDN1/Protein SICKLE
IPR039292 - Protein SICKLE


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022947202.1 protein SICKLE isoform X1 [Cucurbita moschata]8.8e-13673.96Show/hide
Query:  MEESEKRRERLRAMRMEASEADAANHAETSLPNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGN---SHN------------YVPSYR
        MEESEKRRERLRAMRMEAS+AD AN+ ETSLPN LSNPLVESS  M GQSE CTAPRFDYYTNPMAAFSS++KRGN   SH+            YVPS+ 
Subjt:  MEESEKRRERLRAMRMEASEADAANHAETSLPNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGN---SHN------------YVPSYR

Query:  TNSPATYVPSNFPGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSGDHGSPAMARPFSMDQRTPEMWHGPRSPFVNHFPGPPPWGMSSPRSPFINRFPS
         +SP  YVPSNFPG+RNPEMS S  HQFHQHSPDQR F+ARG+SGSG HGSPAM RPF MDQR+P MW GPRSP+VNHFPGPPP GM+SPR PF+N+FPS
Subjt:  TNSPATYVPSNFPGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSGDHGSPAMARPFSMDQRTPEMWHGPRSPFVNHFPGPPPWGMSSPRSPFINRFPS

Query:  HLPREASSPGVVSGPRGNYH-----------SPSPSPGYQGR-----GRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRPEQFYNASMLEDPWKVLQ
         LPR+ SSP  VSGPRGN +           SPSPSPGYQG      G HGHRG MTPSPRFGSGRG GSH RRFSS+ES RPEQFYN SMLEDPWKVLQ
Subjt:  HLPREASSPGVVSGPRGNYH-----------SPSPSPGYQGR-----GRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRPEQFYNASMLEDPWKVLQ

Query:  PGIWTTVAPSSSSANPSESWISKYGTKKARASDSSSGRSGSQPSLAEYLAASFNEAVNEAP
        PGIW  VAP  SSANPSESWISK+ TKKAR SD+SSGRSGSQPSLAEYLAASFNEA N  P
Subjt:  PGIWTTVAPSSSSANPSESWISKYGTKKARASDSSSGRSGSQPSLAEYLAASFNEAVNEAP

XP_022947209.1 protein SICKLE isoform X2 [Cucurbita moschata]1.9e-13876.5Show/hide
Query:  MEESEKRRERLRAMRMEASEADAANHAETSLPNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGN---SHNYVPSYRTNSPATYVPSNF
        MEESEKRRERLRAMRMEAS+AD AN+ ETSLPN LSNPLVESS  M GQSE CTAPRFDYYTNPMAAFSS++KRGN   SH+YVPS+  +SP  YVPSNF
Subjt:  MEESEKRRERLRAMRMEASEADAANHAETSLPNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGN---SHNYVPSYRTNSPATYVPSNF

Query:  PGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSGDHGSPAMARPFSMDQRTPEMWHGPRSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREASSPGVV
        PG+RNPEMS S  HQFHQHSPDQR F+ARG+SGSG HGSPAM RPF MDQR+P MW GPRSP+VNHFPGPPP GM+SPR PF+N+FPS LPR+ SSP  V
Subjt:  PGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSGDHGSPAMARPFSMDQRTPEMWHGPRSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREASSPGVV

Query:  SGPRGNYH-----------SPSPSPGYQGR-----GRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRPEQFYNASMLEDPWKVLQPGIWTTVAPSSS
        SGPRGN +           SPSPSPGYQG      G HGHRG MTPSPRFGSGRG GSH RRFSS+ES RPEQFYN SMLEDPWKVLQPGIW  VAP  S
Subjt:  SGPRGNYH-----------SPSPSPGYQGR-----GRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRPEQFYNASMLEDPWKVLQPGIWTTVAPSSS

Query:  SANPSESWISKYGTKKARASDSSSGRSGSQPSLAEYLAASFNEAVNEAP
        SANPSESWISK+ TKKAR SD+SSGRSGSQPSLAEYLAASFNEA N  P
Subjt:  SANPSESWISKYGTKKARASDSSSGRSGSQPSLAEYLAASFNEAVNEAP

XP_023007497.1 protein SICKLE isoform X1 [Cucurbita maxima]4.1e-13373.73Show/hide
Query:  MEESEKRRERLRAMRMEASEADAANHAETSLPNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGN---SHNYVPSYRTNSPATYVPSNF
        MEESEKRRERLRAMRMEAS+AD AN+ ETSLPN LSNPLVESS  M GQSE CTAPRFDYYTNPMAAFSS++KRGN   SH YVPS+  +SP  YVPSNF
Subjt:  MEESEKRRERLRAMRMEASEADAANHAETSLPNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGN---SHNYVPSYRTNSPATYVPSNF

Query:  PGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSGDHGSPAMARPFSMDQRTPEMWHGPRSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREASSPGVV
        PG+RNPEMS S  HQFHQHSPDQR F+ARG+SGSG HGSPAM RPF MDQR+P MW GPRSP+VNHFPGPPP  M+SPR PF+N+FPS LPR+ SSP  V
Subjt:  PGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSGDHGSPAMARPFSMDQRTPEMWHGPRSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREASSPGVV

Query:  SGPRGNYH-----------SPSPSPGYQGR-------------GRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRPEQFYNASMLEDPWKVLQPGIW
        SGPRGN +           SPSPSPGYQG              G HGHRG MTPSPRFG GRG GSH RRFSS++S RPEQFY+ SMLEDPWKVLQPGIW
Subjt:  SGPRGNYH-----------SPSPSPGYQGR-------------GRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRPEQFYNASMLEDPWKVLQPGIW

Query:  TTVAPSSSSANPSESWISKYGTKKARASDSSSGRSGSQPSLAEYLAASFNEAVN
          VAP  SSANPSESWISK+ TKKAR  D+SSGRSGSQPSLAEYLAASFNEA N
Subjt:  TTVAPSSSSANPSESWISKYGTKKARASDSSSGRSGSQPSLAEYLAASFNEAVN

XP_023007500.1 protein SICKLE isoform X2 [Cucurbita maxima]4.1e-13373.73Show/hide
Query:  MEESEKRRERLRAMRMEASEADAANHAETSLPNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGN---SHNYVPSYRTNSPATYVPSNF
        MEESEKRRERLRAMRMEAS+AD AN+ ETSLPN LSNPLVESS  M GQSE CTAPRFDYYTNPMAAFSS++KRGN   SH YVPS+  +SP  YVPSNF
Subjt:  MEESEKRRERLRAMRMEASEADAANHAETSLPNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGN---SHNYVPSYRTNSPATYVPSNF

Query:  PGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSGDHGSPAMARPFSMDQRTPEMWHGPRSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREASSPGVV
        PG+RNPEMS S  HQFHQHSPDQR F+ARG+SGSG HGSPAM RPF MDQR+P MW GPRSP+VNHFPGPPP  M+SPR PF+N+FPS LPR+ SSP  V
Subjt:  PGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSGDHGSPAMARPFSMDQRTPEMWHGPRSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREASSPGVV

Query:  SGPRGNYH-----------SPSPSPGYQGR-------------GRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRPEQFYNASMLEDPWKVLQPGIW
        SGPRGN +           SPSPSPGYQG              G HGHRG MTPSPRFG GRG GSH RRFSS++S RPEQFY+ SMLEDPWKVLQPGIW
Subjt:  SGPRGNYH-----------SPSPSPGYQGR-------------GRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRPEQFYNASMLEDPWKVLQPGIW

Query:  TTVAPSSSSANPSESWISKYGTKKARASDSSSGRSGSQPSLAEYLAASFNEAVN
          VAP  SSANPSESWISK+ TKKAR  D+SSGRSGSQPSLAEYLAASFNEA N
Subjt:  TTVAPSSSSANPSESWISKYGTKKARASDSSSGRSGSQPSLAEYLAASFNEAVN

XP_023533680.1 protein SICKLE [Cucurbita pepo subsp. pepo]1.4e-13373.13Show/hide
Query:  MEESEKRRERLRAMRMEASEADAANHAETSLPNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGN---SHN------------YVPSYR
        MEESEKRRERLRAMRMEAS+AD AN+ ETSLPN LSNPLVESS  M GQSE CTAPRFDYYTNPMAAFSS++KRGN   SH+            YVPS+ 
Subjt:  MEESEKRRERLRAMRMEASEADAANHAETSLPNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGN---SHN------------YVPSYR

Query:  TNSPATYVPSNFPGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSGDHGSPAMARPFSMDQRTPEMWHGPRSPFVNHFPGPPPWGMSSPRSPFINRFPS
         +SP  YVPSNFPG+RNPEMS S  HQFHQHSPDQR F+ARG+SGSG HGSPAM RPF MDQR+P MW GPRSP+VNHFPGPPP GM+SPR PF+N+FPS
Subjt:  TNSPATYVPSNFPGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSGDHGSPAMARPFSMDQRTPEMWHGPRSPFVNHFPGPPPWGMSSPRSPFINRFPS

Query:  HLPREASSPGVVSGPRGNYH-----------SPSPSPGYQGR-----GRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRPEQFYNASMLEDPWKVLQ
         LPR+  SP  VSGPRGN +           SPSPSPGYQG      G HGHRG MTPSPRFGSGRG GSH RRFSS+ES RPEQFYN SMLEDPWKVLQ
Subjt:  HLPREASSPGVVSGPRGNYH-----------SPSPSPGYQGR-----GRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRPEQFYNASMLEDPWKVLQ

Query:  PGIWTTVAPSSSSANPSESWISKYGTKKARASDSSSGRSGSQPSLAEYLAASFNEAVNEAP
        PGIW  VAP  SSAN SESWISK+ TKKAR SD+SSGRS SQPSLAEYLAASFNEA N  P
Subjt:  PGIWTTVAPSSSSANPSESWISKYGTKKARASDSSSGRSGSQPSLAEYLAASFNEAVNEAP

TrEMBL top hitse value%identityAlignment
A0A5D3C828 ACT11D09.51.5e-10462.43Show/hide
Query:  MEESEKRRERLRAMRMEASEADAANHAETSLPNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGNSHN------YVPSYRTNSPATYVP
        MEESEKRRERLRAMRMEA++AD  N+ ETSLPN LSNPLVESS TM GQ   CTAPRFDYYTNPMAAFS+++K+G   N      +VP +   S  TY+P
Subjt:  MEESEKRRERLRAMRMEASEADAANHAETSLPNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGNSHN------YVPSYRTNSPATYVP

Query:  SNFPGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSGDHGSPAMARPFSMDQRTPEMWHGPRSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREASSP
          FPGLRNPEMS S THQFHQ+SPDQRTF+ARG S +G HGSP M RP++++Q  P MW GPR PFV                   N+FP+H PRE +S 
Subjt:  SNFPGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSGDHGSPAMARPFSMDQRTPEMWHGPRSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREASSP

Query:  GVVSGPRGN-----------YHSPSPSPGYQ-----GRGRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRPEQFYNASMLEDPWKVLQPGIWTTVAP
          VSGPRGN           Y S SP+PG+      GRG HGH G MTPSPRFG GRG+G H R  S  +   PEQFYN SMLEDPWKVLQP IWTT+  
Subjt:  GVVSGPRGN-----------YHSPSPSPGYQ-----GRGRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRPEQFYNASMLEDPWKVLQPGIWTTVAP

Query:  SSSSANPSESWISKYGTKKARASDSSSGRSGS-QPSLAEYLAASFNEAVNEAPN
        SS+SA PSESWISK+GTKKAR SDSSSGRS S QPSLAEYLAASF EA+ +APN
Subjt:  SSSSANPSESWISKYGTKKARASDSSSGRSGS-QPSLAEYLAASFNEAVNEAPN

A0A6J1G5T4 protein SICKLE isoform X14.3e-13673.96Show/hide
Query:  MEESEKRRERLRAMRMEASEADAANHAETSLPNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGN---SHN------------YVPSYR
        MEESEKRRERLRAMRMEAS+AD AN+ ETSLPN LSNPLVESS  M GQSE CTAPRFDYYTNPMAAFSS++KRGN   SH+            YVPS+ 
Subjt:  MEESEKRRERLRAMRMEASEADAANHAETSLPNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGN---SHN------------YVPSYR

Query:  TNSPATYVPSNFPGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSGDHGSPAMARPFSMDQRTPEMWHGPRSPFVNHFPGPPPWGMSSPRSPFINRFPS
         +SP  YVPSNFPG+RNPEMS S  HQFHQHSPDQR F+ARG+SGSG HGSPAM RPF MDQR+P MW GPRSP+VNHFPGPPP GM+SPR PF+N+FPS
Subjt:  TNSPATYVPSNFPGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSGDHGSPAMARPFSMDQRTPEMWHGPRSPFVNHFPGPPPWGMSSPRSPFINRFPS

Query:  HLPREASSPGVVSGPRGNYH-----------SPSPSPGYQGR-----GRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRPEQFYNASMLEDPWKVLQ
         LPR+ SSP  VSGPRGN +           SPSPSPGYQG      G HGHRG MTPSPRFGSGRG GSH RRFSS+ES RPEQFYN SMLEDPWKVLQ
Subjt:  HLPREASSPGVVSGPRGNYH-----------SPSPSPGYQGR-----GRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRPEQFYNASMLEDPWKVLQ

Query:  PGIWTTVAPSSSSANPSESWISKYGTKKARASDSSSGRSGSQPSLAEYLAASFNEAVNEAP
        PGIW  VAP  SSANPSESWISK+ TKKAR SD+SSGRSGSQPSLAEYLAASFNEA N  P
Subjt:  PGIWTTVAPSSSSANPSESWISKYGTKKARASDSSSGRSGSQPSLAEYLAASFNEAVNEAP

A0A6J1G649 protein SICKLE isoform X29.2e-13976.5Show/hide
Query:  MEESEKRRERLRAMRMEASEADAANHAETSLPNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGN---SHNYVPSYRTNSPATYVPSNF
        MEESEKRRERLRAMRMEAS+AD AN+ ETSLPN LSNPLVESS  M GQSE CTAPRFDYYTNPMAAFSS++KRGN   SH+YVPS+  +SP  YVPSNF
Subjt:  MEESEKRRERLRAMRMEASEADAANHAETSLPNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGN---SHNYVPSYRTNSPATYVPSNF

Query:  PGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSGDHGSPAMARPFSMDQRTPEMWHGPRSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREASSPGVV
        PG+RNPEMS S  HQFHQHSPDQR F+ARG+SGSG HGSPAM RPF MDQR+P MW GPRSP+VNHFPGPPP GM+SPR PF+N+FPS LPR+ SSP  V
Subjt:  PGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSGDHGSPAMARPFSMDQRTPEMWHGPRSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREASSPGVV

Query:  SGPRGNYH-----------SPSPSPGYQGR-----GRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRPEQFYNASMLEDPWKVLQPGIWTTVAPSSS
        SGPRGN +           SPSPSPGYQG      G HGHRG MTPSPRFGSGRG GSH RRFSS+ES RPEQFYN SMLEDPWKVLQPGIW  VAP  S
Subjt:  SGPRGNYH-----------SPSPSPGYQGR-----GRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRPEQFYNASMLEDPWKVLQPGIWTTVAPSSS

Query:  SANPSESWISKYGTKKARASDSSSGRSGSQPSLAEYLAASFNEAVNEAP
        SANPSESWISK+ TKKAR SD+SSGRSGSQPSLAEYLAASFNEA N  P
Subjt:  SANPSESWISKYGTKKARASDSSSGRSGSQPSLAEYLAASFNEAVNEAP

A0A6J1KYW0 protein SICKLE isoform X12.0e-13373.73Show/hide
Query:  MEESEKRRERLRAMRMEASEADAANHAETSLPNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGN---SHNYVPSYRTNSPATYVPSNF
        MEESEKRRERLRAMRMEAS+AD AN+ ETSLPN LSNPLVESS  M GQSE CTAPRFDYYTNPMAAFSS++KRGN   SH YVPS+  +SP  YVPSNF
Subjt:  MEESEKRRERLRAMRMEASEADAANHAETSLPNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGN---SHNYVPSYRTNSPATYVPSNF

Query:  PGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSGDHGSPAMARPFSMDQRTPEMWHGPRSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREASSPGVV
        PG+RNPEMS S  HQFHQHSPDQR F+ARG+SGSG HGSPAM RPF MDQR+P MW GPRSP+VNHFPGPPP  M+SPR PF+N+FPS LPR+ SSP  V
Subjt:  PGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSGDHGSPAMARPFSMDQRTPEMWHGPRSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREASSPGVV

Query:  SGPRGNYH-----------SPSPSPGYQGR-------------GRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRPEQFYNASMLEDPWKVLQPGIW
        SGPRGN +           SPSPSPGYQG              G HGHRG MTPSPRFG GRG GSH RRFSS++S RPEQFY+ SMLEDPWKVLQPGIW
Subjt:  SGPRGNYH-----------SPSPSPGYQGR-------------GRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRPEQFYNASMLEDPWKVLQPGIW

Query:  TTVAPSSSSANPSESWISKYGTKKARASDSSSGRSGSQPSLAEYLAASFNEAVN
          VAP  SSANPSESWISK+ TKKAR  D+SSGRSGSQPSLAEYLAASFNEA N
Subjt:  TTVAPSSSSANPSESWISKYGTKKARASDSSSGRSGSQPSLAEYLAASFNEAVN

A0A6J1L0Q3 protein SICKLE isoform X22.0e-13373.73Show/hide
Query:  MEESEKRRERLRAMRMEASEADAANHAETSLPNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGN---SHNYVPSYRTNSPATYVPSNF
        MEESEKRRERLRAMRMEAS+AD AN+ ETSLPN LSNPLVESS  M GQSE CTAPRFDYYTNPMAAFSS++KRGN   SH YVPS+  +SP  YVPSNF
Subjt:  MEESEKRRERLRAMRMEASEADAANHAETSLPNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGN---SHNYVPSYRTNSPATYVPSNF

Query:  PGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSGDHGSPAMARPFSMDQRTPEMWHGPRSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREASSPGVV
        PG+RNPEMS S  HQFHQHSPDQR F+ARG+SGSG HGSPAM RPF MDQR+P MW GPRSP+VNHFPGPPP  M+SPR PF+N+FPS LPR+ SSP  V
Subjt:  PGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSGDHGSPAMARPFSMDQRTPEMWHGPRSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREASSPGVV

Query:  SGPRGNYH-----------SPSPSPGYQGR-------------GRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRPEQFYNASMLEDPWKVLQPGIW
        SGPRGN +           SPSPSPGYQG              G HGHRG MTPSPRFG GRG GSH RRFSS++S RPEQFY+ SMLEDPWKVLQPGIW
Subjt:  SGPRGNYH-----------SPSPSPGYQGR-------------GRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRPEQFYNASMLEDPWKVLQPGIW

Query:  TTVAPSSSSANPSESWISKYGTKKARASDSSSGRSGSQPSLAEYLAASFNEAVN
          VAP  SSANPSESWISK+ TKKAR  D+SSGRSGSQPSLAEYLAASFNEA N
Subjt:  TTVAPSSSSANPSESWISKYGTKKARASDSSSGRSGSQPSLAEYLAASFNEAVN

SwissProt top hitse value%identityAlignment
Q9SB47 Protein SICKLE5.0e-2536.76Show/hide
Query:  MEESEKRRERLRAMRMEA---SEADAANHAETSL-PNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRK-RGNSHNYV--PSYRTNSPATYV
        ME+SEKR++ L+AMRMEA   ++ DA    ETS+    LSNPL E+S+      ET    RFDYYT+PMAA+SS +K +     Y+  PS++ +SP   V
Subjt:  MEESEKRRERLRAMRMEA---SEADAANHAETSL-PNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRK-RGNSHNYV--PSYRTNSPATYV

Query:  PSNFPGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSG-DHGSPAMARPFSMDQRTPEMWHGP-RSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREA
        P  FP    P +        +Q   +   FHA  +   G  H SP+   P       P  W+   R P VNH  GPP W    PR PF   F   +P   
Subjt:  PSNFPGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSG-DHGSPAMARPFSMDQRTPEMWHGP-RSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREA

Query:  SSPGVVSGPRGNYHSPSPSPGYQGRGRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRP-----EQFYNASMLEDPWKVLQPGIWTTVAPSSSSANPS
        ++     G RG+Y++  P     GR      G   P+   G  RG G ++  F  +   RP     E+FY+ SM EDPWK L+P +W   + +SSS++  
Subjt:  SSPGVVSGPRGNYHSPSPSPGYQGRGRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRP-----EQFYNASMLEDPWKVLQPGIWTTVAPSSSSANPS

Query:  ESWISK-YGTKKARASDSSSGRSGSQPSLAEYLAASFNEA
        ++W+ K    KK+  S+++   S +Q SLAEYLAAS + A
Subjt:  ESWISK-YGTKKARASDSSSGRSGSQPSLAEYLAASFNEA

Arabidopsis top hitse value%identityAlignment
AT4G24500.1 hydroxyproline-rich glycoprotein family protein3.6e-2636.76Show/hide
Query:  MEESEKRRERLRAMRMEA---SEADAANHAETSL-PNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRK-RGNSHNYV--PSYRTNSPATYV
        ME+SEKR++ L+AMRMEA   ++ DA    ETS+    LSNPL E+S+      ET    RFDYYT+PMAA+SS +K +     Y+  PS++ +SP   V
Subjt:  MEESEKRRERLRAMRMEA---SEADAANHAETSL-PNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRK-RGNSHNYV--PSYRTNSPATYV

Query:  PSNFPGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSG-DHGSPAMARPFSMDQRTPEMWHGP-RSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREA
        P  FP    P +        +Q   +   FHA  +   G  H SP+   P       P  W+   R P VNH  GPP W    PR PF   F   +P   
Subjt:  PSNFPGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSG-DHGSPAMARPFSMDQRTPEMWHGP-RSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREA

Query:  SSPGVVSGPRGNYHSPSPSPGYQGRGRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRP-----EQFYNASMLEDPWKVLQPGIWTTVAPSSSSANPS
        ++     G RG+Y++  P     GR      G   P+   G  RG G ++  F  +   RP     E+FY+ SM EDPWK L+P +W   + +SSS++  
Subjt:  SSPGVVSGPRGNYHSPSPSPGYQGRGRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRP-----EQFYNASMLEDPWKVLQPGIWTTVAPSSSSANPS

Query:  ESWISK-YGTKKARASDSSSGRSGSQPSLAEYLAASFNEA
        ++W+ K    KK+  S+++   S +Q SLAEYLAAS + A
Subjt:  ESWISK-YGTKKARASDSSSGRSGSQPSLAEYLAASFNEA

AT4G24500.2 hydroxyproline-rich glycoprotein family protein2.6e-1632.64Show/hide
Query:  MEESEKRRERLRAMRMEA---SEADAANHAETSL-PNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGNSHNYVPSYRTNSPATYVPSN
        ME+SEKR++ L+AMRMEA   ++ DA    ETS+    LSNPL E+S+      ET                              S++ +SP   VP  
Subjt:  MEESEKRRERLRAMRMEA---SEADAANHAETSL-PNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGNSHNYVPSYRTNSPATYVPSN

Query:  FPGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSG-DHGSPAMARPFSMDQRTPEMWHGP-RSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREASSP
        FP    P +        +Q   +   FHA  +   G  H SP+   P       P  W+   R P VNH  GPP W    PR PF   F   +P   ++ 
Subjt:  FPGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSG-DHGSPAMARPFSMDQRTPEMWHGP-RSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREASSP

Query:  GVVSGPRGNYHSPSPSPGYQGRGRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRP-----EQFYNASMLEDPWKVLQPGIWTTVAPSSSSANPSESW
            G RG+Y++  P     GR      G   P+   G  RG G ++  F  +   RP     E+FY+ SM EDPWK L+P +W   + +SSS++  ++W
Subjt:  GVVSGPRGNYHSPSPSPGYQGRGRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRP-----EQFYNASMLEDPWKVLQPGIWTTVAPSSSSANPSESW

Query:  ISK-YGTKKARASDSSSGRSGSQPSLAEYLAASFNEA
        + K    KK+  S+++   S +Q SLAEYLAAS + A
Subjt:  ISK-YGTKKARASDSSSGRSGSQPSLAEYLAASFNEA

AT4G24500.3 hydroxyproline-rich glycoprotein family protein3.6e-2636.76Show/hide
Query:  MEESEKRRERLRAMRMEA---SEADAANHAETSL-PNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRK-RGNSHNYV--PSYRTNSPATYV
        ME+SEKR++ L+AMRMEA   ++ DA    ETS+    LSNPL E+S+      ET    RFDYYT+PMAA+SS +K +     Y+  PS++ +SP   V
Subjt:  MEESEKRRERLRAMRMEA---SEADAANHAETSL-PNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRK-RGNSHNYV--PSYRTNSPATYV

Query:  PSNFPGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSG-DHGSPAMARPFSMDQRTPEMWHGP-RSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREA
        P  FP    P +        +Q   +   FHA  +   G  H SP+   P       P  W+   R P VNH  GPP W    PR PF   F   +P   
Subjt:  PSNFPGLRNPEMSSSPTHQFHQHSPDQRTFHARGFSGSG-DHGSPAMARPFSMDQRTPEMWHGP-RSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREA

Query:  SSPGVVSGPRGNYHSPSPSPGYQGRGRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRP-----EQFYNASMLEDPWKVLQPGIWTTVAPSSSSANPS
        ++     G RG+Y++  P     GR      G   P+   G  RG G ++  F  +   RP     E+FY+ SM EDPWK L+P +W   + +SSS++  
Subjt:  SSPGVVSGPRGNYHSPSPSPGYQGRGRHGHRGGMTPSPRFGSGRGSGSHSRRFSSNESPRP-----EQFYNASMLEDPWKVLQPGIWTTVAPSSSSANPS

Query:  ESWISK-YGTKKARASDSSSGRSGSQPSLAEYLAASFNEA
        ++W+ K    KK+  S+++   S +Q SLAEYLAAS + A
Subjt:  ESWISK-YGTKKARASDSSSGRSGSQPSLAEYLAASFNEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAATCTGAGAAACGAAGGGAGAGACTCAGAGCAATGCGAATGGAAGCTTCTGAGGCTGATGCCGCTAATCATGCCGAAACTTCTTTGCCTAATCAACTTTCGAA
TCCATTGGTGGAGTCGTCGGATACCATGGCAGGGCAATCGGAGACTTGTACCGCCCCGAGATTCGACTATTATACGAACCCTATGGCTGCATTTTCCTCTACCAGGAAGA
GAGGGAATTCTCATAATTATGTTCCTTCCTACCGTACTAATTCTCCTGCAACTTATGTACCATCTAATTTTCCAGGATTGAGAAACCCTGAAATGTCTTCCTCGCCGACT
CATCAATTCCATCAACATTCACCTGACCAGAGAACGTTCCATGCACGAGGTTTTAGTGGATCTGGTGACCATGGCAGCCCAGCAATGGCTAGACCTTTTTCAATGGATCA
AAGAACTCCTGAAATGTGGCATGGACCTAGAAGTCCATTTGTCAACCACTTTCCTGGCCCTCCTCCATGGGGAATGAGCTCCCCCAGGAGCCCATTTATCAATCGATTCC
CTAGCCATCTACCAAGGGAAGCGAGCTCCCCCGGCGTTGTCTCTGGACCAAGAGGTAATTACCATAGTCCAAGCCCCAGTCCAGGGTATCAAGGCAGAGGCAGACATGGT
CATCGTGGCGGTATGACCCCTAGTCCAAGATTTGGCTCTGGTCGAGGTTCTGGTTCTCACAGTCGTCGTTTTTCATCAAATGAATCACCCAGACCAGAACAATTTTACAA
TGCATCCATGCTTGAAGATCCCTGGAAGGTTCTGCAACCTGGTATTTGGACGACAGTTGCTCCATCGAGCAGTTCTGCCAACCCTTCAGAATCTTGGATTTCGAAGTACG
GTACGAAAAAGGCAAGAGCTTCAGATTCTTCTTCTGGCAGGTCAGGGTCTCAACCCAGCCTCGCCGAATACCTGGCTGCTTCCTTCAATGAAGCAGTCAACGAGGCACCA
AATGTGTAA
mRNA sequenceShow/hide mRNA sequence
AATTTGTCTGAAGAATGTTATACAGAGAAGGAAGCAGTTTCTAGGGTTTTTTTCTTCGGAGTCTCCATTTCAATTCTCTTCAATCTTTATCTTGATTAAGAATCGTTATT
GTTCGATTTAATGGAAGAATCTGAGAAACGAAGGGAGAGACTCAGAGCAATGCGAATGGAAGCTTCTGAGGCTGATGCCGCTAATCATGCCGAAACTTCTTTGCCTAATC
AACTTTCGAATCCATTGGTGGAGTCGTCGGATACCATGGCAGGGCAATCGGAGACTTGTACCGCCCCGAGATTCGACTATTATACGAACCCTATGGCTGCATTTTCCTCT
ACCAGGAAGAGAGGGAATTCTCATAATTATGTTCCTTCCTACCGTACTAATTCTCCTGCAACTTATGTACCATCTAATTTTCCAGGATTGAGAAACCCTGAAATGTCTTC
CTCGCCGACTCATCAATTCCATCAACATTCACCTGACCAGAGAACGTTCCATGCACGAGGTTTTAGTGGATCTGGTGACCATGGCAGCCCAGCAATGGCTAGACCTTTTT
CAATGGATCAAAGAACTCCTGAAATGTGGCATGGACCTAGAAGTCCATTTGTCAACCACTTTCCTGGCCCTCCTCCATGGGGAATGAGCTCCCCCAGGAGCCCATTTATC
AATCGATTCCCTAGCCATCTACCAAGGGAAGCGAGCTCCCCCGGCGTTGTCTCTGGACCAAGAGGTAATTACCATAGTCCAAGCCCCAGTCCAGGGTATCAAGGCAGAGG
CAGACATGGTCATCGTGGCGGTATGACCCCTAGTCCAAGATTTGGCTCTGGTCGAGGTTCTGGTTCTCACAGTCGTCGTTTTTCATCAAATGAATCACCCAGACCAGAAC
AATTTTACAATGCATCCATGCTTGAAGATCCCTGGAAGGTTCTGCAACCTGGTATTTGGACGACAGTTGCTCCATCGAGCAGTTCTGCCAACCCTTCAGAATCTTGGATT
TCGAAGTACGGTACGAAAAAGGCAAGAGCTTCAGATTCTTCTTCTGGCAGGTCAGGGTCTCAACCCAGCCTCGCCGAATACCTGGCTGCTTCCTTCAATGAAGCAGTCAA
CGAGGCACCAAATGTGTAAAACATGTATCCAATATCCAGACGTTTCATCCTACTGTAATTCTTAAACTAGGCATTTTCTAAATCCCGGCCCATTTTTTTTCTATAGACTG
GTTTAATGTGGTGATTTCTAGGGAATTTTGTAGTGCTTGTTAATGACAGTTATCTCTCTTTGTTAGGTTTTGACTGTTATAAGTTCTTTTCAGCTATATATGTTTAGACT
ATGGA
Protein sequenceShow/hide protein sequence
MEESEKRRERLRAMRMEASEADAANHAETSLPNQLSNPLVESSDTMAGQSETCTAPRFDYYTNPMAAFSSTRKRGNSHNYVPSYRTNSPATYVPSNFPGLRNPEMSSSPT
HQFHQHSPDQRTFHARGFSGSGDHGSPAMARPFSMDQRTPEMWHGPRSPFVNHFPGPPPWGMSSPRSPFINRFPSHLPREASSPGVVSGPRGNYHSPSPSPGYQGRGRHG
HRGGMTPSPRFGSGRGSGSHSRRFSSNESPRPEQFYNASMLEDPWKVLQPGIWTTVAPSSSSANPSESWISKYGTKKARASDSSSGRSGSQPSLAEYLAASFNEAVNEAP
NV