; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS020444 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS020444
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein SICKLE isoform X2
Genome locationscaffold375:182755..185873
RNA-Seq ExpressionMS020444
SyntenyMS020444
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0035196 - production of miRNAs involved in gene silencing by miRNA (biological process)
GO:1903730 - regulation of phosphatidate phosphatase activity (biological process)
GO:0016020 - membrane (cellular component)
InterPro domainsIPR039292 - Protein SICKLE


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148514.1 M-phase-specific PLK1-interacting protein [Momordica charantia]2.1e-13297.51Show/hide
Query:  MSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSPSIVPGPRGNS
        MSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMD+GTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSPSIVPGPRGNS
Subjt:  MSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSPSIVPGPRGNS

Query:  YTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRGSSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDLQPGIWRTTARANSSESWI
        YTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGF GNMTPTSRFGSGRGSS HGRHFSSNKS RPEHFPFYNESMLDDPWKDLQPGIWRTTARANSSESWI
Subjt:  YTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRGSSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDLQPGIWRTTARANSSESWI

Query:  SKSRMKKARVSEPFSRSSSQPSLAEYLAASFNEAVDDAPSV
        S+SRMKKARVSEPFS SSSQPSLAEYLAASFNEAVDDAPSV
Subjt:  SKSRMKKARVSEPFSRSSSQPSLAEYLAASFNEAVDDAPSV

XP_022947202.1 protein SICKLE isoform X1 [Cucurbita moschata]2.8e-13271.9Show/hide
Query:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGN-------FDNH--GVSDNYVPSHH
        MEESEKRRERLRAMRMEA+QA+V NY ETSLPNHLSNPLVESSA ML QSEPCTAPRFDYYTNPMAAFS++KKRGN         +H     D YVPSH 
Subjt:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGN-------FDNH--GVSDNYVPSHH

Query:  NNSPATFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPS
        N+SP  +VPSNF G+RNPEMSPS  HQFHQHSPDQR F+ARG++GSG HG PA+PRPFPMDQ +P +W GPRSP+VN FPG PPRGM+SPR PFVN FPS
Subjt:  NNSPATFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPS

Query:  HPPRDMSSPSIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRGSSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKD
          PRDMSSPS V GPRGNSY N   D VN+ +PSPS GY+GS SPG GSHG RGNMTP+ RFGSGRG  SHGR FSS++S RPE   FYN SML+DPWK 
Subjt:  HPPRDMSSPSIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRGSSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKD

Query:  LQPGIWRTTA----RANSSESWISKSRMKKARVSEPFS-RSSSQPSLAEYLAASFNEAVDDAP
        LQPGIWR  A     AN SESWISK   KKARVS+  S RS SQPSLAEYLAASFNEA ++ P
Subjt:  LQPGIWRTTA----RANSSESWISKSRMKKARVSEPFS-RSSSQPSLAEYLAASFNEAVDDAP

XP_022947209.1 protein SICKLE isoform X2 [Cucurbita moschata]3.9e-13473.73Show/hide
Query:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPATFVP
        MEESEKRRERLRAMRMEA+QA+V NY ETSLPNHLSNPLVESSA ML QSEPCTAPRFDYYTNPMAAFS++KKRG   N  VS +YVPSH N+SP  +VP
Subjt:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPATFVP

Query:  SNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSP
        SNF G+RNPEMSPS  HQFHQHSPDQR F+ARG++GSG HG PA+PRPFPMDQ +P +W GPRSP+VN FPG PPRGM+SPR PFVN FPS  PRDMSSP
Subjt:  SNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSP

Query:  SIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRGSSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDLQPGIWRTT
        S V GPRGNSY N   D VN+ +PSPS GY+GS SPG GSHG RGNMTP+ RFGSGRG  SHGR FSS++S RPE   FYN SML+DPWK LQPGIWR  
Subjt:  SIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRGSSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDLQPGIWRTT

Query:  A----RANSSESWISKSRMKKARVSEPFS-RSSSQPSLAEYLAASFNEAVDDAP
        A     AN SESWISK   KKARVS+  S RS SQPSLAEYLAASFNEA ++ P
Subjt:  A----RANSSESWISKSRMKKARVSEPFS-RSSSQPSLAEYLAASFNEAVDDAP

XP_023007497.1 protein SICKLE isoform X1 [Cucurbita maxima]1.9e-12871.39Show/hide
Query:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPATFVP
        MEESEKRRERLRAMRMEA+QA+V NY ETSLPNHLSNPLVESSA ML QSEPCTAPRFDYYTNPMAAFS++KKRG   N  VS  YVPSH N+SP  +VP
Subjt:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPATFVP

Query:  SNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSP
        SNF G+RNPEMSPS  HQFHQHSPDQR F+ARG++GSG HG PA+PRPFPMDQ +P +W GPRSP+VN FPG PPR M+SPR PFVN FPS  PRDMSSP
Subjt:  SNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSP

Query:  SIVPGPRGNSYTNPMQDRVNYHTPSP--------SSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRGSSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDL
        S V GPRGNSY +  QD VN+ +PSP        S GY+GS SPG GSHG RGNMTP+ RFG GRG  SHGR FSS+KS RPE   FY+ SML+DPWK L
Subjt:  SIVPGPRGNSYTNPMQDRVNYHTPSP--------SSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRGSSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDL

Query:  QPGIWRTTA----RANSSESWISKSRMKKARVSEPFS-RSSSQPSLAEYLAASFNEAVDD
        QPGIWR  A     AN SESWISK   KKARV +  S RS SQPSLAEYLAASFNEA ++
Subjt:  QPGIWRTTA----RANSSESWISKSRMKKARVSEPFS-RSSSQPSLAEYLAASFNEAVDD

XP_023533680.1 protein SICKLE [Cucurbita pepo subsp. pepo]7.4e-13372.18Show/hide
Query:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGN-------FDNH--GVSDNYVPSHH
        MEESEKRRERLRAMRMEA+QA+V NY ETSLPNHLSNPLVESSA ML QSEPCTAPRFDYYTNPMAAFS++KKRGN         +H     D YVPSH 
Subjt:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGN-------FDNH--GVSDNYVPSHH

Query:  NNSPATFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPS
        N+SP  +VPSNF G+RNPEMSPS  HQFHQHSPDQR F+ARG++GSG HG PA+PRPFPMDQ +P +W GPRSP+VN FPG PPRGM+SPR PFVN FPS
Subjt:  NNSPATFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPS

Query:  HPPRDMSSPSIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRGSSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKD
          PRDM SPS V GPRGNSY N  QD VN+ +PSPS GY+GS SPG GSHG RGNMTP+ RFGSGRG  SHGR FSS++S RPE   FYN SML+DPWK 
Subjt:  HPPRDMSSPSIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRGSSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKD

Query:  LQPGIWRTTA----RANSSESWISKSRMKKARVSEPFS-RSSSQPSLAEYLAASFNEAVDDAP
        LQPGIWR  A     AN+SESWISK   KKARVS+  S RSSSQPSLAEYLAASFNEA ++ P
Subjt:  LQPGIWRTTA----RANSSESWISKSRMKKARVSEPFS-RSSSQPSLAEYLAASFNEAVDDAP

TrEMBL top hitse value%identityAlignment
A0A6J1D482 M-phase-specific PLK1-interacting protein1.0e-13297.51Show/hide
Query:  MSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSPSIVPGPRGNS
        MSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMD+GTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSPSIVPGPRGNS
Subjt:  MSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSPSIVPGPRGNS

Query:  YTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRGSSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDLQPGIWRTTARANSSESWI
        YTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGF GNMTPTSRFGSGRGSS HGRHFSSNKS RPEHFPFYNESMLDDPWKDLQPGIWRTTARANSSESWI
Subjt:  YTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRGSSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDLQPGIWRTTARANSSESWI

Query:  SKSRMKKARVSEPFSRSSSQPSLAEYLAASFNEAVDDAPSV
        S+SRMKKARVSEPFS SSSQPSLAEYLAASFNEAVDDAPSV
Subjt:  SKSRMKKARVSEPFSRSSSQPSLAEYLAASFNEAVDDAPSV

A0A6J1G5T4 protein SICKLE isoform X11.4e-13271.9Show/hide
Query:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGN-------FDNH--GVSDNYVPSHH
        MEESEKRRERLRAMRMEA+QA+V NY ETSLPNHLSNPLVESSA ML QSEPCTAPRFDYYTNPMAAFS++KKRGN         +H     D YVPSH 
Subjt:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGN-------FDNH--GVSDNYVPSHH

Query:  NNSPATFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPS
        N+SP  +VPSNF G+RNPEMSPS  HQFHQHSPDQR F+ARG++GSG HG PA+PRPFPMDQ +P +W GPRSP+VN FPG PPRGM+SPR PFVN FPS
Subjt:  NNSPATFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPS

Query:  HPPRDMSSPSIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRGSSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKD
          PRDMSSPS V GPRGNSY N   D VN+ +PSPS GY+GS SPG GSHG RGNMTP+ RFGSGRG  SHGR FSS++S RPE   FYN SML+DPWK 
Subjt:  HPPRDMSSPSIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRGSSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKD

Query:  LQPGIWRTTA----RANSSESWISKSRMKKARVSEPFS-RSSSQPSLAEYLAASFNEAVDDAP
        LQPGIWR  A     AN SESWISK   KKARVS+  S RS SQPSLAEYLAASFNEA ++ P
Subjt:  LQPGIWRTTA----RANSSESWISKSRMKKARVSEPFS-RSSSQPSLAEYLAASFNEAVDDAP

A0A6J1G649 protein SICKLE isoform X21.9e-13473.73Show/hide
Query:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPATFVP
        MEESEKRRERLRAMRMEA+QA+V NY ETSLPNHLSNPLVESSA ML QSEPCTAPRFDYYTNPMAAFS++KKRG   N  VS +YVPSH N+SP  +VP
Subjt:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPATFVP

Query:  SNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSP
        SNF G+RNPEMSPS  HQFHQHSPDQR F+ARG++GSG HG PA+PRPFPMDQ +P +W GPRSP+VN FPG PPRGM+SPR PFVN FPS  PRDMSSP
Subjt:  SNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSP

Query:  SIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRGSSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDLQPGIWRTT
        S V GPRGNSY N   D VN+ +PSPS GY+GS SPG GSHG RGNMTP+ RFGSGRG  SHGR FSS++S RPE   FYN SML+DPWK LQPGIWR  
Subjt:  SIVPGPRGNSYTNPMQDRVNYHTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRGSSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDLQPGIWRTT

Query:  A----RANSSESWISKSRMKKARVSEPFS-RSSSQPSLAEYLAASFNEAVDDAP
        A     AN SESWISK   KKARVS+  S RS SQPSLAEYLAASFNEA ++ P
Subjt:  A----RANSSESWISKSRMKKARVSEPFS-RSSSQPSLAEYLAASFNEAVDDAP

A0A6J1KYW0 protein SICKLE isoform X19.1e-12971.39Show/hide
Query:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPATFVP
        MEESEKRRERLRAMRMEA+QA+V NY ETSLPNHLSNPLVESSA ML QSEPCTAPRFDYYTNPMAAFS++KKRG   N  VS  YVPSH N+SP  +VP
Subjt:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPATFVP

Query:  SNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSP
        SNF G+RNPEMSPS  HQFHQHSPDQR F+ARG++GSG HG PA+PRPFPMDQ +P +W GPRSP+VN FPG PPR M+SPR PFVN FPS  PRDMSSP
Subjt:  SNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSP

Query:  SIVPGPRGNSYTNPMQDRVNYHTPSP--------SSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRGSSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDL
        S V GPRGNSY +  QD VN+ +PSP        S GY+GS SPG GSHG RGNMTP+ RFG GRG  SHGR FSS+KS RPE   FY+ SML+DPWK L
Subjt:  SIVPGPRGNSYTNPMQDRVNYHTPSP--------SSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRGSSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDL

Query:  QPGIWRTTA----RANSSESWISKSRMKKARVSEPFS-RSSSQPSLAEYLAASFNEAVDD
        QPGIWR  A     AN SESWISK   KKARV +  S RS SQPSLAEYLAASFNEA ++
Subjt:  QPGIWRTTA----RANSSESWISKSRMKKARVSEPFS-RSSSQPSLAEYLAASFNEAVDD

A0A6J1L0Q3 protein SICKLE isoform X29.1e-12971.39Show/hide
Query:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPATFVP
        MEESEKRRERLRAMRMEA+QA+V NY ETSLPNHLSNPLVESSA ML QSEPCTAPRFDYYTNPMAAFS++KKRG   N  VS  YVPSH N+SP  +VP
Subjt:  MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPATFVP

Query:  SNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSP
        SNF G+RNPEMSPS  HQFHQHSPDQR F+ARG++GSG HG PA+PRPFPMDQ +P +W GPRSP+VN FPG PPR M+SPR PFVN FPS  PRDMSSP
Subjt:  SNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSP

Query:  SIVPGPRGNSYTNPMQDRVNYHTPSP--------SSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRGSSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDL
        S V GPRGNSY +  QD VN+ +PSP        S GY+GS SPG GSHG RGNMTP+ RFG GRG  SHGR FSS+KS RPE   FY+ SML+DPWK L
Subjt:  SIVPGPRGNSYTNPMQDRVNYHTPSP--------SSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRGSSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDL

Query:  QPGIWRTTA----RANSSESWISKSRMKKARVSEPFS-RSSSQPSLAEYLAASFNEAVDD
        QPGIWR  A     AN SESWISK   KKARV +  S RS SQPSLAEYLAASFNEA ++
Subjt:  QPGIWRTTA----RANSSESWISKSRMKKARVSEPFS-RSSSQPSLAEYLAASFNEAVDD

SwissProt top hitse value%identityAlignment
Q9SB47 Protein SICKLE1.5e-2435.44Show/hide
Query:  MEESEKRRERLRAMRMEAAQAN---VGNYAETSL-PNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPA
        ME+SEKR++ L+AMRMEAA  N        ETS+   HLSNPL E+S     Q +     RFDYYT+PMAA+S+ KK        +S    PSH  +SP 
Subjt:  MEESEKRRERLRAMRMEAAQAN---VGNYAETSL-PNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPA

Query:  TFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRD
          VP  F     P + P      +Q   +   FHA  +   G      +    P  +G P  W+       N F           R P VNH  S PP+ 
Subjt:  TFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRD

Query:  MSSPSIVPGPRGNSYTNPMQDRVNY-HTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRG---SSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDL
        +  P        N   N    R +Y +TP   S Y      GR +  + GN  P S  G  RG   ++S GR     +   P    FY+ SM +DPWK L
Subjt:  MSSPSIVPGPRGNSYTNPMQDRVNY-HTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRG---SSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDL

Query:  QPGIWRTTARANSSES----WISKS-RMKKARVSEPFSR-SSSQPSLAEYLAASFNEAVDDAPS
        +P +W+  + A+SS S    W+ KS   KK+  SE   + SS+Q SLAEYLAAS + A  D  S
Subjt:  QPGIWRTTARANSSES----WISKS-RMKKARVSEPFSR-SSSQPSLAEYLAASFNEAVDDAPS

Arabidopsis top hitse value%identityAlignment
AT4G24500.1 hydroxyproline-rich glycoprotein family protein1.1e-2535.44Show/hide
Query:  MEESEKRRERLRAMRMEAAQAN---VGNYAETSL-PNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPA
        ME+SEKR++ L+AMRMEAA  N        ETS+   HLSNPL E+S     Q +     RFDYYT+PMAA+S+ KK        +S    PSH  +SP 
Subjt:  MEESEKRRERLRAMRMEAAQAN---VGNYAETSL-PNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPA

Query:  TFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRD
          VP  F     P + P      +Q   +   FHA  +   G      +    P  +G P  W+       N F           R P VNH  S PP+ 
Subjt:  TFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRD

Query:  MSSPSIVPGPRGNSYTNPMQDRVNY-HTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRG---SSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDL
        +  P        N   N    R +Y +TP   S Y      GR +  + GN  P S  G  RG   ++S GR     +   P    FY+ SM +DPWK L
Subjt:  MSSPSIVPGPRGNSYTNPMQDRVNY-HTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRG---SSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDL

Query:  QPGIWRTTARANSSES----WISKS-RMKKARVSEPFSR-SSSQPSLAEYLAASFNEAVDDAPS
        +P +W+  + A+SS S    W+ KS   KK+  SE   + SS+Q SLAEYLAAS + A  D  S
Subjt:  QPGIWRTTARANSSES----WISKS-RMKKARVSEPFSR-SSSQPSLAEYLAASFNEAVDDAPS

AT4G24500.2 hydroxyproline-rich glycoprotein family protein3.0e-1531.59Show/hide
Query:  MEESEKRRERLRAMRMEAAQAN---VGNYAETSL-PNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPA
        ME+SEKR++ L+AMRMEAA  N        ETS+   HLSNPL E+S                                   NH        SH  +SP 
Subjt:  MEESEKRRERLRAMRMEAAQAN---VGNYAETSL-PNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPA

Query:  TFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRD
          VP  F     P + P      +Q   +   FHA  +   G      +    P  +G P  W+       N F           R P VNH  S PP+ 
Subjt:  TFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRD

Query:  MSSPSIVPGPRGNSYTNPMQDRVNY-HTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRG---SSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDL
        +  P        N   N    R +Y +TP   S Y      GR +  + GN  P S  G  RG   ++S GR     +   P    FY+ SM +DPWK L
Subjt:  MSSPSIVPGPRGNSYTNPMQDRVNY-HTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRG---SSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDL

Query:  QPGIWRTTARANSSES----WISKS-RMKKARVSEPFSR-SSSQPSLAEYLAASFNEAVDDAPS
        +P +W+  + A+SS S    W+ KS   KK+  SE   + SS+Q SLAEYLAAS + A  D  S
Subjt:  QPGIWRTTARANSSES----WISKS-RMKKARVSEPFSR-SSSQPSLAEYLAASFNEAVDDAPS

AT4G24500.3 hydroxyproline-rich glycoprotein family protein1.1e-2535.44Show/hide
Query:  MEESEKRRERLRAMRMEAAQAN---VGNYAETSL-PNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPA
        ME+SEKR++ L+AMRMEAA  N        ETS+   HLSNPL E+S     Q +     RFDYYT+PMAA+S+ KK        +S    PSH  +SP 
Subjt:  MEESEKRRERLRAMRMEAAQAN---VGNYAETSL-PNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPA

Query:  TFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRD
          VP  F     P + P      +Q   +   FHA  +   G      +    P  +G P  W+       N F           R P VNH  S PP+ 
Subjt:  TFVPSNFAGLRNPEMSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRD

Query:  MSSPSIVPGPRGNSYTNPMQDRVNY-HTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRG---SSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDL
        +  P        N   N    R +Y +TP   S Y      GR +  + GN  P S  G  RG   ++S GR     +   P    FY+ SM +DPWK L
Subjt:  MSSPSIVPGPRGNSYTNPMQDRVNY-HTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRG---SSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDL

Query:  QPGIWRTTARANSSES----WISKS-RMKKARVSEPFSR-SSSQPSLAEYLAASFNEAVDDAPS
        +P +W+  + A+SS S    W+ KS   KK+  SE   + SS+Q SLAEYLAAS + A  D  S
Subjt:  QPGIWRTTARANSSES----WISKS-RMKKARVSEPFSR-SSSQPSLAEYLAASFNEAVDDAPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAATCTGAGAAACGAAGGGAGAGGCTAAGAGCAATGCGAATGGAAGCTGCCCAGGCTAATGTGGGTAATTATGCCGAGACTTCTCTGCCTAATCATCTTTCCAA
TCCACTGGTCGAGTCCTCAGCAGCCATGTTAGAGCAATCAGAACCATGTACCGCCCCAAGATTTGACTATTACACAAACCCTATGGCTGCATTTTCTGCTACCAAGAAGA
GAGGGAACTTTGATAATCATGGCGTGTCAGATAATTATGTTCCTTCTCACCACAATAATTCTCCAGCAACTTTTGTTCCATCAAATTTTGCAGGATTGAGAAACCCTGAA
ATGTCTCCCTCTCCAACTCATCAATTCCATCAACATTCACCTGACCAGAGAATGTTTCATGCACGAGGGTTTAATGGATCTGGTTGCCATGGTGGCCCAGCGATTCCCAG
GCCGTTTCCTATGGATCAAGGAACTCCTGGTATCTGGAGCGGACCGAGAAGCCCATTTGTCAACCAATTCCCTGGCCATCCTCCAAGGGGGATGAGCTCCCCCAGAAGCC
CATTTGTCAACCACTTCCCTAGCCATCCACCAAGGGATATGAGCTCCCCCAGCATTGTTCCTGGACCAAGAGGTAATTCTTACACCAATCCAATGCAAGACAGGGTTAAT
TACCATACTCCTAGCCCTAGTTCAGGGTATGAAGGCAGTCCTAGCCCAGGTCGAGGCAGCCATGGTTTTCGTGGCAATATGACCCCTACTTCAAGATTTGGCTCTGGACG
AGGTTCCAGTTCTCATGGTCGTCACTTTTCATCGAACAAATCGCCCAGGCCCGAGCATTTTCCATTTTATAATGAATCCATGCTTGACGATCCCTGGAAAGATTTGCAAC
CTGGTATTTGGAGGACAACCGCTCGTGCCAACTCTTCGGAATCTTGGATTTCAAAATCCCGTATGAAAAAAGCAAGAGTTTCAGAACCTTTCAGCAGGTCAAGCTCTCAA
CCTAGCCTCGCCGAGTACCTGGCTGCCTCGTTCAACGAAGCAGTGGACGATGCACCAAGTGTG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAATCTGAGAAACGAAGGGAGAGGCTAAGAGCAATGCGAATGGAAGCTGCCCAGGCTAATGTGGGTAATTATGCCGAGACTTCTCTGCCTAATCATCTTTCCAA
TCCACTGGTCGAGTCCTCAGCAGCCATGTTAGAGCAATCAGAACCATGTACCGCCCCAAGATTTGACTATTACACAAACCCTATGGCTGCATTTTCTGCTACCAAGAAGA
GAGGGAACTTTGATAATCATGGCGTGTCAGATAATTATGTTCCTTCTCACCACAATAATTCTCCAGCAACTTTTGTTCCATCAAATTTTGCAGGATTGAGAAACCCTGAA
ATGTCTCCCTCTCCAACTCATCAATTCCATCAACATTCACCTGACCAGAGAATGTTTCATGCACGAGGGTTTAATGGATCTGGTTGCCATGGTGGCCCAGCGATTCCCAG
GCCGTTTCCTATGGATCAAGGAACTCCTGGTATCTGGAGCGGACCGAGAAGCCCATTTGTCAACCAATTCCCTGGCCATCCTCCAAGGGGGATGAGCTCCCCCAGAAGCC
CATTTGTCAACCACTTCCCTAGCCATCCACCAAGGGATATGAGCTCCCCCAGCATTGTTCCTGGACCAAGAGGTAATTCTTACACCAATCCAATGCAAGACAGGGTTAAT
TACCATACTCCTAGCCCTAGTTCAGGGTATGAAGGCAGTCCTAGCCCAGGTCGAGGCAGCCATGGTTTTCGTGGCAATATGACCCCTACTTCAAGATTTGGCTCTGGACG
AGGTTCCAGTTCTCATGGTCGTCACTTTTCATCGAACAAATCGCCCAGGCCCGAGCATTTTCCATTTTATAATGAATCCATGCTTGACGATCCCTGGAAAGATTTGCAAC
CTGGTATTTGGAGGACAACCGCTCGTGCCAACTCTTCGGAATCTTGGATTTCAAAATCCCGTATGAAAAAAGCAAGAGTTTCAGAACCTTTCAGCAGGTCAAGCTCTCAA
CCTAGCCTCGCCGAGTACCTGGCTGCCTCGTTCAACGAAGCAGTGGACGATGCACCAAGTGTG
Protein sequenceShow/hide protein sequence
MEESEKRRERLRAMRMEAAQANVGNYAETSLPNHLSNPLVESSAAMLEQSEPCTAPRFDYYTNPMAAFSATKKRGNFDNHGVSDNYVPSHHNNSPATFVPSNFAGLRNPE
MSPSPTHQFHQHSPDQRMFHARGFNGSGCHGGPAIPRPFPMDQGTPGIWSGPRSPFVNQFPGHPPRGMSSPRSPFVNHFPSHPPRDMSSPSIVPGPRGNSYTNPMQDRVN
YHTPSPSSGYEGSPSPGRGSHGFRGNMTPTSRFGSGRGSSSHGRHFSSNKSPRPEHFPFYNESMLDDPWKDLQPGIWRTTARANSSESWISKSRMKKARVSEPFSRSSSQ
PSLAEYLAASFNEAVDDAPSV