; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g1037 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g1037
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotein SICKLE isoform X2
Genome locationMC02:8385588..8389501
RNA-Seq ExpressionMC02g1037
SyntenyMC02g1037
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0035196 - production of miRNAs involved in gene silencing by miRNA (biological process)
GO:1903730 - regulation of phosphatidate phosphatase activity (biological process)
GO:0016020 - membrane (cellular component)
InterPro domainsIPR039292 - Protein SICKLE


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148514.1 M-phase-specific PLK1-interacting protein [Momordica charantia]1.57e-176100Show/hide
Query:  MSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSPSIVPGPRGNS
        MSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSPSIVPGPRGNS
Subjt:  MSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSPSIVPGPRGNS

Query:  YTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDLQPGIWRTTARANSSESWI
        YTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDLQPGIWRTTARANSSESWI
Subjt:  YTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDLQPGIWRTTARANSSESWI

Query:  SRSRMKKARVSEPFSSSSSQPSLAEYLAASFNEAVDDAPSV
        SRSRMKKARVSEPFSSSSSQPSLAEYLAASFNEAVDDAPSV
Subjt:  SRSRMKKARVSEPFSSSSSQPSLAEYLAASFNEAVDDAPSV

XP_022947202.1 protein SICKLE isoform X1 [Cucurbita moschata]8.02e-16470.52Show/hide
Query:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGN-------FDNHGVS--DNYVPSHH
        MEESEKRRERLRAMRMEA+QA+V NY ETSLPNHLSNPLVESSA +L QSEPCTAPRFDYYTNPMAAFS++KKRGN         +H  S  D YVPSH 
Subjt:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGN-------FDNHGVS--DNYVPSHH

Query:  NNSPATFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPS
        N+SP  +VPSNF G+RNPEMSPS  HQFHQHSPDQR F+ARG++GSG HG PA+PRPFPMD+ +P +W GPRSP+VN FPG PPRGM+SPR PFVN FPS
Subjt:  NNSPATFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPS

Query:  HPPRDMSSPSIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKD
          PRDMSSPS V GPRGNSY N   D VN+ +PSPS GY+GS SPG GSHG  GNMTP+ RFGSGRG   HGR FSS++S RPE F  YN SML+DPWK 
Subjt:  HPPRDMSSPSIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKD

Query:  LQPGIWRTTA----RANSSESWISRSRMKKARVSEPFSS-SSSQPSLAEYLAASFNEAVDDAP
        LQPGIWR  A     AN SESWIS+   KKARVS+  S  S SQPSLAEYLAASFNEA ++ P
Subjt:  LQPGIWRTTA----RANSSESWISRSRMKKARVSEPFSS-SSSQPSLAEYLAASFNEAVDDAP

XP_022947209.1 protein SICKLE isoform X2 [Cucurbita moschata]1.93e-16672.03Show/hide
Query:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPATFVP
        MEESEKRRERLRAMRMEA+QA+V NY ETSLPNHLSNPLVESSA +L QSEPCTAPRFDYYTNPMAAFS++KKRGN     VS +YVPSH N+SP  +VP
Subjt:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPATFVP

Query:  SNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSP
        SNF G+RNPEMSPS  HQFHQHSPDQR F+ARG++GSG HG PA+PRPFPMD+ +P +W GPRSP+VN FPG PPRGM+SPR PFVN FPS  PRDMSSP
Subjt:  SNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSP

Query:  SIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDLQPGIWRTT
        S V GPRGNSY N   D VN+ +PSPS GY+GS SPG GSHG  GNMTP+ RFGSGRG   HGR FSS++S RPE F  YN SML+DPWK LQPGIWR  
Subjt:  SIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDLQPGIWRTT

Query:  A----RANSSESWISRSRMKKARVSEPFSS-SSSQPSLAEYLAASFNEAVDDAP
        A     AN SESWIS+   KKARVS+  S  S SQPSLAEYLAASFNEA ++ P
Subjt:  A----RANSSESWISRSRMKKARVSEPFSS-SSSQPSLAEYLAASFNEAVDDAP

XP_023007500.1 protein SICKLE isoform X2 [Cucurbita maxima]7.23e-15969.72Show/hide
Query:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPATFVP
        MEESEKRRERLRAMRMEA+QA+V NY ETSLPNHLSNPLVESSA +L QSEPCTAPRFDYYTNPMAAFS++KKRGN     VS  YVPSH N+SP  +VP
Subjt:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPATFVP

Query:  SNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSP
        SNF G+RNPEMSPS  HQFHQHSPDQR F+ARG++GSG HG PA+PRPFPMD+ +P +W GPRSP+VN FPG PPR M+SPR PFVN FPS  PRDMSSP
Subjt:  SNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSP

Query:  SIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGR--------GSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDL
        S V GPRGNSY +  QD VN+ +PSPS GY+GS SPG         GSHG  GNMTP+ RFG GRG   HGR FSS+KS RPE F  Y+ SML+DPWK L
Subjt:  SIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGR--------GSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDL

Query:  QPGIWRTTA----RANSSESWISRSRMKKARVSEPFSS-SSSQPSLAEYLAASFNEAVDD
        QPGIWR  A     AN SESWIS+   KKARV +  S  S SQPSLAEYLAASFNEA ++
Subjt:  QPGIWRTTA----RANSSESWISRSRMKKARVSEPFSS-SSSQPSLAEYLAASFNEAVDD

XP_023533680.1 protein SICKLE [Cucurbita pepo subsp. pepo]1.39e-16470.8Show/hide
Query:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGN-------FDNHGVS--DNYVPSHH
        MEESEKRRERLRAMRMEA+QA+V NY ETSLPNHLSNPLVESSA +L QSEPCTAPRFDYYTNPMAAFS++KKRGN         +H  S  D YVPSH 
Subjt:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGN-------FDNHGVS--DNYVPSHH

Query:  NNSPATFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPS
        N+SP  +VPSNF G+RNPEMSPS  HQFHQHSPDQR F+ARG++GSG HG PA+PRPFPMD+ +P +W GPRSP+VN FPG PPRGM+SPR PFVN FPS
Subjt:  NNSPATFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPS

Query:  HPPRDMSSPSIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKD
          PRDM SPS V GPRGNSY N  QD VN+ +PSPS GY+GS SPG GSHG  GNMTP+ RFGSGRG   HGR FSS++S RPE F  YN SML+DPWK 
Subjt:  HPPRDMSSPSIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKD

Query:  LQPGIWRTTA----RANSSESWISRSRMKKARVSEPFSS-SSSQPSLAEYLAASFNEAVDDAP
        LQPGIWR  A     AN+SESWIS+   KKARVS+  S  SSSQPSLAEYLAASFNEA ++ P
Subjt:  LQPGIWRTTA----RANSSESWISRSRMKKARVSEPFSS-SSSQPSLAEYLAASFNEAVDDAP

TrEMBL top hitse value%identityAlignment
A0A6J1D482 M-phase-specific PLK1-interacting protein7.60e-177100Show/hide
Query:  MSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSPSIVPGPRGNS
        MSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSPSIVPGPRGNS
Subjt:  MSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSPSIVPGPRGNS

Query:  YTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDLQPGIWRTTARANSSESWI
        YTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDLQPGIWRTTARANSSESWI
Subjt:  YTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDLQPGIWRTTARANSSESWI

Query:  SRSRMKKARVSEPFSSSSSQPSLAEYLAASFNEAVDDAPSV
        SRSRMKKARVSEPFSSSSSQPSLAEYLAASFNEAVDDAPSV
Subjt:  SRSRMKKARVSEPFSSSSSQPSLAEYLAASFNEAVDDAPSV

A0A6J1G5T4 protein SICKLE isoform X13.88e-16470.52Show/hide
Query:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGN-------FDNHGVS--DNYVPSHH
        MEESEKRRERLRAMRMEA+QA+V NY ETSLPNHLSNPLVESSA +L QSEPCTAPRFDYYTNPMAAFS++KKRGN         +H  S  D YVPSH 
Subjt:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGN-------FDNHGVS--DNYVPSHH

Query:  NNSPATFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPS
        N+SP  +VPSNF G+RNPEMSPS  HQFHQHSPDQR F+ARG++GSG HG PA+PRPFPMD+ +P +W GPRSP+VN FPG PPRGM+SPR PFVN FPS
Subjt:  NNSPATFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPS

Query:  HPPRDMSSPSIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKD
          PRDMSSPS V GPRGNSY N   D VN+ +PSPS GY+GS SPG GSHG  GNMTP+ RFGSGRG   HGR FSS++S RPE F  YN SML+DPWK 
Subjt:  HPPRDMSSPSIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKD

Query:  LQPGIWRTTA----RANSSESWISRSRMKKARVSEPFSS-SSSQPSLAEYLAASFNEAVDDAP
        LQPGIWR  A     AN SESWIS+   KKARVS+  S  S SQPSLAEYLAASFNEA ++ P
Subjt:  LQPGIWRTTA----RANSSESWISRSRMKKARVSEPFSS-SSSQPSLAEYLAASFNEAVDDAP

A0A6J1G649 protein SICKLE isoform X29.34e-16772.03Show/hide
Query:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPATFVP
        MEESEKRRERLRAMRMEA+QA+V NY ETSLPNHLSNPLVESSA +L QSEPCTAPRFDYYTNPMAAFS++KKRGN     VS +YVPSH N+SP  +VP
Subjt:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPATFVP

Query:  SNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSP
        SNF G+RNPEMSPS  HQFHQHSPDQR F+ARG++GSG HG PA+PRPFPMD+ +P +W GPRSP+VN FPG PPRGM+SPR PFVN FPS  PRDMSSP
Subjt:  SNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSP

Query:  SIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDLQPGIWRTT
        S V GPRGNSY N   D VN+ +PSPS GY+GS SPG GSHG  GNMTP+ RFGSGRG   HGR FSS++S RPE F  YN SML+DPWK LQPGIWR  
Subjt:  SIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDLQPGIWRTT

Query:  A----RANSSESWISRSRMKKARVSEPFSS-SSSQPSLAEYLAASFNEAVDDAP
        A     AN SESWIS+   KKARVS+  S  S SQPSLAEYLAASFNEA ++ P
Subjt:  A----RANSSESWISRSRMKKARVSEPFSS-SSSQPSLAEYLAASFNEAVDDAP

A0A6J1KYW0 protein SICKLE isoform X14.18e-15969.72Show/hide
Query:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPATFVP
        MEESEKRRERLRAMRMEA+QA+V NY ETSLPNHLSNPLVESSA +L QSEPCTAPRFDYYTNPMAAFS++KKRGN     VS  YVPSH N+SP  +VP
Subjt:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPATFVP

Query:  SNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSP
        SNF G+RNPEMSPS  HQFHQHSPDQR F+ARG++GSG HG PA+PRPFPMD+ +P +W GPRSP+VN FPG PPR M+SPR PFVN FPS  PRDMSSP
Subjt:  SNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSP

Query:  SIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGR--------GSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDL
        S V GPRGNSY +  QD VN+ +PSPS GY+GS SPG         GSHG  GNMTP+ RFG GRG   HGR FSS+KS RPE F  Y+ SML+DPWK L
Subjt:  SIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGR--------GSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDL

Query:  QPGIWRTTA----RANSSESWISRSRMKKARVSEPFSS-SSSQPSLAEYLAASFNEAVDD
        QPGIWR  A     AN SESWIS+   KKARV +  S  S SQPSLAEYLAASFNEA ++
Subjt:  QPGIWRTTA----RANSSESWISRSRMKKARVSEPFSS-SSSQPSLAEYLAASFNEAVDD

A0A6J1L0Q3 protein SICKLE isoform X23.50e-15969.72Show/hide
Query:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPATFVP
        MEESEKRRERLRAMRMEA+QA+V NY ETSLPNHLSNPLVESSA +L QSEPCTAPRFDYYTNPMAAFS++KKRGN     VS  YVPSH N+SP  +VP
Subjt:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPATFVP

Query:  SNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSP
        SNF G+RNPEMSPS  HQFHQHSPDQR F+ARG++GSG HG PA+PRPFPMD+ +P +W GPRSP+VN FPG PPR M+SPR PFVN FPS  PRDMSSP
Subjt:  SNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSP

Query:  SIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGR--------GSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDL
        S V GPRGNSY +  QD VN+ +PSPS GY+GS SPG         GSHG  GNMTP+ RFG GRG   HGR FSS+KS RPE F  Y+ SML+DPWK L
Subjt:  SIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGR--------GSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDL

Query:  QPGIWRTTA----RANSSESWISRSRMKKARVSEPFSS-SSSQPSLAEYLAASFNEAVDD
        QPGIWR  A     AN SESWIS+   KKARV +  S  S SQPSLAEYLAASFNEA ++
Subjt:  QPGIWRTTA----RANSSESWISRSRMKKARVSEPFSS-SSSQPSLAEYLAASFNEAVDD

SwissProt top hitse value%identityAlignment
Q9SB47 Protein SICKLE6.9e-2535.16Show/hide
Query:  MEESEKRRERLRAMRMEAAQAN---VGNYAETSL-PNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPA
        ME+SEKR++ L+AMRMEAA  N        ETS+   HLSNPL E+S     Q +     RFDYYT+PMAA+S+ KK        +S    PSH  +SP 
Subjt:  MEESEKRRERLRAMRMEAAQAN---VGNYAETSL-PNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPA

Query:  TFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRD
          VP  F     P + P      +Q   +   FHA  +   G      +    P  RG P  W+       N F           R P VNH  S PP+ 
Subjt:  TFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRD

Query:  MSSPSIVPGPRGNSYTNPMQDRVNY-HTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRG---SSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDL
        +  P        N   N    R +Y +TP   S Y      GR +  + GN  P S  G  RG   ++  GR     + + P    FY+ SM +DPWK L
Subjt:  MSSPSIVPGPRGNSYTNPMQDRVNY-HTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRG---SSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDL

Query:  QPGIWRTTARANSSES----WISRS-RMKKARVSE-PFSSSSSQPSLAEYLAASFNEAVDDAPS
        +P +W+  + A+SS S    W+ +S   KK+  SE    +SS+Q SLAEYLAAS + A  D  S
Subjt:  QPGIWRTTARANSSES----WISRS-RMKKARVSE-PFSSSSSQPSLAEYLAASFNEAVDDAPS

Arabidopsis top hitse value%identityAlignment
AT4G24500.1 hydroxyproline-rich glycoprotein family protein4.9e-2635.16Show/hide
Query:  MEESEKRRERLRAMRMEAAQAN---VGNYAETSL-PNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPA
        ME+SEKR++ L+AMRMEAA  N        ETS+   HLSNPL E+S     Q +     RFDYYT+PMAA+S+ KK        +S    PSH  +SP 
Subjt:  MEESEKRRERLRAMRMEAAQAN---VGNYAETSL-PNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPA

Query:  TFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRD
          VP  F     P + P      +Q   +   FHA  +   G      +    P  RG P  W+       N F           R P VNH  S PP+ 
Subjt:  TFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRD

Query:  MSSPSIVPGPRGNSYTNPMQDRVNY-HTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRG---SSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDL
        +  P        N   N    R +Y +TP   S Y      GR +  + GN  P S  G  RG   ++  GR     + + P    FY+ SM +DPWK L
Subjt:  MSSPSIVPGPRGNSYTNPMQDRVNY-HTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRG---SSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDL

Query:  QPGIWRTTARANSSES----WISRS-RMKKARVSE-PFSSSSSQPSLAEYLAASFNEAVDDAPS
        +P +W+  + A+SS S    W+ +S   KK+  SE    +SS+Q SLAEYLAAS + A  D  S
Subjt:  QPGIWRTTARANSSES----WISRS-RMKKARVSE-PFSSSSSQPSLAEYLAASFNEAVDDAPS

AT4G24500.2 hydroxyproline-rich glycoprotein family protein1.3e-1531.32Show/hide
Query:  MEESEKRRERLRAMRMEAAQAN---VGNYAETSL-PNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPA
        ME+SEKR++ L+AMRMEAA  N        ETS+   HLSNPL E+S                                   NH        SH  +SP 
Subjt:  MEESEKRRERLRAMRMEAAQAN---VGNYAETSL-PNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPA

Query:  TFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRD
          VP  F     P + P      +Q   +   FHA  +   G      +    P  RG P  W+       N F           R P VNH  S PP+ 
Subjt:  TFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRD

Query:  MSSPSIVPGPRGNSYTNPMQDRVNY-HTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRG---SSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDL
        +  P        N   N    R +Y +TP   S Y      GR +  + GN  P S  G  RG   ++  GR     + + P    FY+ SM +DPWK L
Subjt:  MSSPSIVPGPRGNSYTNPMQDRVNY-HTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRG---SSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDL

Query:  QPGIWRTTARANSSES----WISRS-RMKKARVSE-PFSSSSSQPSLAEYLAASFNEAVDDAPS
        +P +W+  + A+SS S    W+ +S   KK+  SE    +SS+Q SLAEYLAAS + A  D  S
Subjt:  QPGIWRTTARANSSES----WISRS-RMKKARVSE-PFSSSSSQPSLAEYLAASFNEAVDDAPS

AT4G24500.3 hydroxyproline-rich glycoprotein family protein4.9e-2635.16Show/hide
Query:  MEESEKRRERLRAMRMEAAQAN---VGNYAETSL-PNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPA
        ME+SEKR++ L+AMRMEAA  N        ETS+   HLSNPL E+S     Q +     RFDYYT+PMAA+S+ KK        +S    PSH  +SP 
Subjt:  MEESEKRRERLRAMRMEAAQAN---VGNYAETSL-PNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPA

Query:  TFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRD
          VP  F     P + P      +Q   +   FHA  +   G      +    P  RG P  W+       N F           R P VNH  S PP+ 
Subjt:  TFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRD

Query:  MSSPSIVPGPRGNSYTNPMQDRVNY-HTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRG---SSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDL
        +  P        N   N    R +Y +TP   S Y      GR +  + GN  P S  G  RG   ++  GR     + + P    FY+ SM +DPWK L
Subjt:  MSSPSIVPGPRGNSYTNPMQDRVNY-HTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRG---SSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDL

Query:  QPGIWRTTARANSSES----WISRS-RMKKARVSE-PFSSSSSQPSLAEYLAASFNEAVDDAPS
        +P +W+  + A+SS S    W+ +S   KK+  SE    +SS+Q SLAEYLAAS + A  D  S
Subjt:  QPGIWRTTARANSSES----WISRS-RMKKARVSE-PFSSSSSQPSLAEYLAASFNEAVDDAPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAATCTGAGAAACGAAGGGAGAGGCTAAGAGCAATGCGAATGGAAGCTGCCCAGGCTAATGTGGGTAATTATGCCGAGACTTCTCTGCCTAATCATCTTTCCAA
TCCACTGGTCGAGTCCTCAGCAGCCTTGTTAGAGCAATCAGAACCATGTACCGCCCCAAGATTTGACTATTACACAAACCCTATGGCTGCATTTTCTGCTACCAAGAAGA
GAGGGAACTTTGATAATCATGGCGTGTCAGATAATTATGTTCCTTCTCACCACAATAATTCTCCAGCAACTTTTGTTCCATCAAATTTTGCAGGATTGAGAAACCCTGAA
ATGTCTCCCTCTCCAACTCATCAATTCCATCAACATTCACCTGACCAGAGAATGTTTCATGCACGAGGGTTTAATGGATCTGGTTGCCATGGTGGCCCAGCGATTCCCAG
GCCGTTTCCTATGGATCGAGGAACTCCTGGTATCTGGAGCGGACCGAGAAGCCCATTTGTCAACCAATTCCCTGGCCATCCTCCAAGGGGGATGAGCTCCCCCAGAAGCC
CATTTGTCAACCACTTCCCTAGCCATCCACCAAGGGATATGAGCTCCCCCAGCATTGTTCCTGGACCAAGAGGTAATTCTTACACCAATCCAATGCAAGACAGGGTTAAT
TACCATACTCCTAGCCCTAGTTCAGGGTATGAAGGCAGTCCTAGCCCAGGTCGAGGCAGCCATGGTTTTTGTGGCAATATGACCCCTACTTCAAGATTTGGCTCTGGACG
AGGTTCCAGTTATCATGGTCGTCACTTTTCATCGAACAAATCGCTCAGGCCCGAGCATTTTCCATTTTATAATGAATCCATGCTTGACGATCCCTGGAAAGATTTGCAAC
CTGGTATTTGGAGGACAACCGCTCGTGCCAACTCTTCGGAATCTTGGATTTCAAGATCCCGTATGAAAAAAGCAAGAGTTTCAGAACCTTTCAGCAGCTCAAGCTCTCAA
CCTAGCCTCGCCGAGTACCTGGCTGCCTCGTTCAATGAAGCAGTGGACGATGCACCAAGTGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATAAAATTTTATCTTCCTCTCAGCCGTAGACCTCCCCCCTTCGTGCCCTTCCCTCTCTTGATTTCTATCACCGCGCACCCGCCTTCTCCTTTTCCTCCGGCGAGCAGCAA
ACTCCACGGCGGCGGCAGCAGCGGCGGACGGTCTCCAGCGAGCTCAGCCTCACGCCGATACCCACGAGCGGCGGTCCTCGAGAACTCGATCGGCGGCATCTTCACGCATC
TTCAGCAACTACTTTTCTTAAACCCAGTCCTCTTTGCTCTTTCTCTGTTAAATGGAAGAATCTGAGAAACGAAGGGAGAGGCTAAGAGCAATGCGAATGGAAGCTGCCCA
GGCTAATGTGGGTAATTATGCCGAGACTTCTCTGCCTAATCATCTTTCCAATCCACTGGTCGAGTCCTCAGCAGCCTTGTTAGAGCAATCAGAACCATGTACCGCCCCAA
GATTTGACTATTACACAAACCCTATGGCTGCATTTTCTGCTACCAAGAAGAGAGGGAACTTTGATAATCATGGCGTGTCAGATAATTATGTTCCTTCTCACCACAATAAT
TCTCCAGCAACTTTTGTTCCATCAAATTTTGCAGGATTGAGAAACCCTGAAATGTCTCCCTCTCCAACTCATCAATTCCATCAACATTCACCTGACCAGAGAATGTTTCA
TGCACGAGGGTTTAATGGATCTGGTTGCCATGGTGGCCCAGCGATTCCCAGGCCGTTTCCTATGGATCGAGGAACTCCTGGTATCTGGAGCGGACCGAGAAGCCCATTTG
TCAACCAATTCCCTGGCCATCCTCCAAGGGGGATGAGCTCCCCCAGAAGCCCATTTGTCAACCACTTCCCTAGCCATCCACCAAGGGATATGAGCTCCCCCAGCATTGTT
CCTGGACCAAGAGGTAATTCTTACACCAATCCAATGCAAGACAGGGTTAATTACCATACTCCTAGCCCTAGTTCAGGGTATGAAGGCAGTCCTAGCCCAGGTCGAGGCAG
CCATGGTTTTTGTGGCAATATGACCCCTACTTCAAGATTTGGCTCTGGACGAGGTTCCAGTTATCATGGTCGTCACTTTTCATCGAACAAATCGCTCAGGCCCGAGCATT
TTCCATTTTATAATGAATCCATGCTTGACGATCCCTGGAAAGATTTGCAACCTGGTATTTGGAGGACAACCGCTCGTGCCAACTCTTCGGAATCTTGGATTTCAAGATCC
CGTATGAAAAAAGCAAGAGTTTCAGAACCTTTCAGCAGCTCAAGCTCTCAACCTAGCCTCGCCGAGTACCTGGCTGCCTCGTTCAATGAAGCAGTGGACGATGCACCAAG
TGTGTGAAACTTTTATCCGGAGCTCCGTTGAAGCCATTTCATCTTACTGCAGATTCCAACCTAGTACAACTGCAACCAAAAGGAGAGTAATTTTTCTCTATATACTGGTT
ATAAATGTGGTAAAATTCAAAGGTAATCTATGGTGATCATGGCACAGTTATCTCTCCATGTTGGGTTACGACTTCTATGTACACTTTCTTCGGTTAGTTGCATGTTTTAT
CAACCATGTTTAGAGGATAATGGACATGATTTAGCTTTGTTTTGTTGTCTTATGATCCTCTCTATAAACAATTTAATATGATTGAGTTTCAAATCATT
Protein sequenceShow/hide protein sequence
MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAALLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPATFVPSNFAGLRNPE
MSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDRGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSPSIVPGPRGNSYTNPMQDRVN
YHTPSPSSGYEGSPSPGRGSHGFCGNMTPTSRFGSGRGSSYHGRHFSSNKSLRPEHFPFYNESMLDDPWKDLQPGIWRTTARANSSESWISRSRMKKARVSEPFSSSSSQ
PSLAEYLAASFNEAVDDAPSV