; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020724 (gene) of Snake gourd v1 genome

Gene IDTan0020724
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein SICKLE isoform X2
Genome locationLG09:53274309..53284002
RNA-Seq ExpressionTan0020724
SyntenyTan0020724
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0035196 - production of miRNAs involved in gene silencing by miRNA (biological process)
GO:1903730 - regulation of phosphatidate phosphatase activity (biological process)
GO:0016020 - membrane (cellular component)
InterPro domainsIPR039292 - Protein SICKLE


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022947202.1 protein SICKLE isoform X1 [Cucurbita moschata]8.4e-14576.71Show/hide
Query:  MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVS------------DNYVPSHH
        MEESEKRRERLRAMRMEA+QADVANY+ETSLPNHLSNPLVESSA MLGQ EPCT PRFDYYTNPMAAFS+SKKRGNQTVS            D YVPSH 
Subjt:  MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVS------------DNYVPSHH

Query:  NNSPATYVPSNFPGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGPRSPFVNHFPGPTPRGMSSPRSPFVNQFPS
        N+SP  YVPSNFPG+RNPEMSPS   QFHQHSPDQR FYA G+SGSGG G+P MPR FPMDQ +P MW GPRSP+VNHFPGP PRGM+SPR PFVNQFPS
Subjt:  NNSPATYVPSNFPGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGPRSPFVNHFPGPTPRGMSSPRSPFVNQFPS

Query:  HL-----SPSVLSGQRGNSYSNPTQDRVNYHSPSPSPSPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKD
         L     SPS +SG RGNSY N T D VN+   SPSPSPGYQGSSSPGG SHGHRGNMTP PRFGSGRG GSHGRRFS DES RPE FYN SML+DPWK 
Subjt:  HL-----SPSVLSGQRGNSYSNPTQDRVNYHSPSPSPSPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKD

Query:  LQPGIWRTVAPV-----STESWISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEAVNNAP
        LQPGIWR VAP+      +ESWISKF TKKAR+SD+S  SGRS SQPSLAEYLAASFNEA NN P
Subjt:  LQPGIWRTVAPV-----STESWISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEAVNNAP

XP_022947209.1 protein SICKLE isoform X2 [Cucurbita moschata]5.2e-14779.04Show/hide
Query:  MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVSDNYVPSHHNNSPATYVPSNF
        MEESEKRRERLRAMRMEA+QADVANY+ETSLPNHLSNPLVESSA MLGQ EPCT PRFDYYTNPMAAFS+SKKRGNQTVS +YVPSH N+SP  YVPSNF
Subjt:  MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVSDNYVPSHHNNSPATYVPSNF

Query:  PGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGPRSPFVNHFPGPTPRGMSSPRSPFVNQFPSHL-----SPSVL
        PG+RNPEMSPS   QFHQHSPDQR FYA G+SGSGG G+P MPR FPMDQ +P MW GPRSP+VNHFPGP PRGM+SPR PFVNQFPS L     SPS +
Subjt:  PGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGPRSPFVNHFPGPTPRGMSSPRSPFVNQFPSHL-----SPSVL

Query:  SGQRGNSYSNPTQDRVNYHSPSPSPSPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIWRTVAPV
        SG RGNSY N T D VN+   SPSPSPGYQGSSSPGG SHGHRGNMTP PRFGSGRG GSHGRRFS DES RPE FYN SML+DPWK LQPGIWR VAP+
Subjt:  SGQRGNSYSNPTQDRVNYHSPSPSPSPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIWRTVAPV

Query:  -----STESWISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEAVNNAP
              +ESWISKF TKKAR+SD+S  SGRS SQPSLAEYLAASFNEA NN P
Subjt:  -----STESWISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEAVNNAP

XP_023007497.1 protein SICKLE isoform X1 [Cucurbita maxima]3.5e-14377.31Show/hide
Query:  MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVSDNYVPSHHNNSPATYVPSNF
        MEESEKRRERLRAMRMEA+QADVANYVETSLPNHLSNPLVESSA MLGQ EPCT PRFDYYTNPMAAFS+SKKRGNQTVS  YVPSH N+SP  YVPSNF
Subjt:  MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVSDNYVPSHHNNSPATYVPSNF

Query:  PGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGPRSPFVNHFPGPTPRGMSSPRSPFVNQFPSHL-----SPSVL
        PG+RNPEMSPS   QFHQHSPDQR FYA G+SGSGG G+P MPR FPMDQ +P MW GPRSP+VNHFPGP PR M+SPR PFVNQFPS L     SPS +
Subjt:  PGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGPRSPFVNHFPGPTPRGMSSPRSPFVNQFPSHL-----SPSVL

Query:  SGQRGNSYSNPTQDRVNYHSPSPSP------SPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIW
        SG RGNSY + TQD VN+ SPSPSP      SPGYQGSSSPGG SHGHRGNMTP PRFG GRG GSHGRRFS D+S RPE FY+ SML+DPWK LQPGIW
Subjt:  SGQRGNSYSNPTQDRVNYHSPSPSP------SPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIW

Query:  RTVAPV-----STESWISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEAVNN
        R VAP+      +ESWISKF TKKAR+ D+S  SGRS SQPSLAEYLAASFNEA NN
Subjt:  RTVAPV-----STESWISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEAVNN

XP_023007500.1 protein SICKLE isoform X2 [Cucurbita maxima]3.5e-14377.31Show/hide
Query:  MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVSDNYVPSHHNNSPATYVPSNF
        MEESEKRRERLRAMRMEA+QADVANYVETSLPNHLSNPLVESSA MLGQ EPCT PRFDYYTNPMAAFS+SKKRGNQTVS  YVPSH N+SP  YVPSNF
Subjt:  MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVSDNYVPSHHNNSPATYVPSNF

Query:  PGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGPRSPFVNHFPGPTPRGMSSPRSPFVNQFPSHL-----SPSVL
        PG+RNPEMSPS   QFHQHSPDQR FYA G+SGSGG G+P MPR FPMDQ +P MW GPRSP+VNHFPGP PR M+SPR PFVNQFPS L     SPS +
Subjt:  PGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGPRSPFVNHFPGPTPRGMSSPRSPFVNQFPSHL-----SPSVL

Query:  SGQRGNSYSNPTQDRVNYHSPSPSP------SPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIW
        SG RGNSY + TQD VN+ SPSPSP      SPGYQGSSSPGG SHGHRGNMTP PRFG GRG GSHGRRFS D+S RPE FY+ SML+DPWK LQPGIW
Subjt:  SGQRGNSYSNPTQDRVNYHSPSPSP------SPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIW

Query:  RTVAPV-----STESWISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEAVNN
        R VAP+      +ESWISKF TKKAR+ D+S  SGRS SQPSLAEYLAASFNEA NN
Subjt:  RTVAPV-----STESWISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEAVNN

XP_023533680.1 protein SICKLE [Cucurbita pepo subsp. pepo]7.6e-14677.26Show/hide
Query:  MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVS------------DNYVPSHH
        MEESEKRRERLRAMRMEA+QADVANYVETSLPNHLSNPLVESSA MLGQ EPCT PRFDYYTNPMAAFS+SKKRGNQTVS            D YVPSH 
Subjt:  MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVS------------DNYVPSHH

Query:  NNSPATYVPSNFPGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGPRSPFVNHFPGPTPRGMSSPRSPFVNQFPS
        N+SP  YVPSNFPG+RNPEMSPS   QFHQHSPDQR FYA G+SGSGG G+P MPR FPMDQ +P MW GPRSP+VNHFPGP PRGM+SPR PFVNQFPS
Subjt:  NNSPATYVPSNFPGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGPRSPFVNHFPGPTPRGMSSPRSPFVNQFPS

Query:  HL-----SPSVLSGQRGNSYSNPTQDRVNYHSPSPSPSPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKD
         L     SPS +SG RGNSY N TQD VN+   SPSPSPGYQGSSSPGG SHGHRGNMTP PRFGSGRG GSHGRRFS DES RPE FYN SML+DPWK 
Subjt:  HL-----SPSVLSGQRGNSYSNPTQDRVNYHSPSPSPSPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKD

Query:  LQPGIWRTVAPV-----STESWISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEAVNNAP
        LQPGIWR VAP+     ++ESWISKF TKKAR+SD+S  SGRS SQPSLAEYLAASFNEA NN P
Subjt:  LQPGIWRTVAPV-----STESWISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEAVNNAP

TrEMBL top hitse value%identityAlignment
A0A5D3C828 ACT11D09.59.4e-11867.61Show/hide
Query:  MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRG---NQTVSDNYVPSHHNNSPATYVP
        MEESEKRRERLRAMRMEAAQADV NY+ETSLPNHLSNPLVESSATM+GQL PCT PRFDYYTNPMAAFS SKK+G   NQ VSD +VP HHN S  TY+P
Subjt:  MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRG---NQTVSDNYVPSHHNNSPATYVP

Query:  SNFPGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGPRSPFVNHFPGPTPRGMSSPRSPFVNQFPSHLSPSVLSG
          FPGLRNPEMSPS T QFHQ+SPDQRTFYA G S +GG G+P MPR + ++QG+P MW GPR PFVN FP   PR M+              S S +SG
Subjt:  SNFPGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGPRSPFVNHFPGPTPRGMSSPRSPFVNQFPSHLSPSVLSG

Query:  QRGNSYSNPTQDRVNYHSPSPSPSPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIWRTVAPVS-
         RGNSY+NPTQDR  Y   S SP+PG+ GS SPG  SHGH GNMTP PRFG GRG+G HGR   LD+S  PE FYN SML+DPWK LQP IW T+   S 
Subjt:  QRGNSYSNPTQDRVNYHSPSPSPSPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIWRTVAPVS-

Query:  ----TESWISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEAVNNAPS
            +ESWISKFGTKKAR+SDSSSG   S  QPSLAEYLAASF EA+ +AP+
Subjt:  ----TESWISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEAVNNAPS

A0A6J1G5T4 protein SICKLE isoform X14.0e-14576.71Show/hide
Query:  MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVS------------DNYVPSHH
        MEESEKRRERLRAMRMEA+QADVANY+ETSLPNHLSNPLVESSA MLGQ EPCT PRFDYYTNPMAAFS+SKKRGNQTVS            D YVPSH 
Subjt:  MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVS------------DNYVPSHH

Query:  NNSPATYVPSNFPGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGPRSPFVNHFPGPTPRGMSSPRSPFVNQFPS
        N+SP  YVPSNFPG+RNPEMSPS   QFHQHSPDQR FYA G+SGSGG G+P MPR FPMDQ +P MW GPRSP+VNHFPGP PRGM+SPR PFVNQFPS
Subjt:  NNSPATYVPSNFPGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGPRSPFVNHFPGPTPRGMSSPRSPFVNQFPS

Query:  HL-----SPSVLSGQRGNSYSNPTQDRVNYHSPSPSPSPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKD
         L     SPS +SG RGNSY N T D VN+   SPSPSPGYQGSSSPGG SHGHRGNMTP PRFGSGRG GSHGRRFS DES RPE FYN SML+DPWK 
Subjt:  HL-----SPSVLSGQRGNSYSNPTQDRVNYHSPSPSPSPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKD

Query:  LQPGIWRTVAPV-----STESWISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEAVNNAP
        LQPGIWR VAP+      +ESWISKF TKKAR+SD+S  SGRS SQPSLAEYLAASFNEA NN P
Subjt:  LQPGIWRTVAPV-----STESWISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEAVNNAP

A0A6J1G649 protein SICKLE isoform X22.5e-14779.04Show/hide
Query:  MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVSDNYVPSHHNNSPATYVPSNF
        MEESEKRRERLRAMRMEA+QADVANY+ETSLPNHLSNPLVESSA MLGQ EPCT PRFDYYTNPMAAFS+SKKRGNQTVS +YVPSH N+SP  YVPSNF
Subjt:  MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVSDNYVPSHHNNSPATYVPSNF

Query:  PGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGPRSPFVNHFPGPTPRGMSSPRSPFVNQFPSHL-----SPSVL
        PG+RNPEMSPS   QFHQHSPDQR FYA G+SGSGG G+P MPR FPMDQ +P MW GPRSP+VNHFPGP PRGM+SPR PFVNQFPS L     SPS +
Subjt:  PGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGPRSPFVNHFPGPTPRGMSSPRSPFVNQFPSHL-----SPSVL

Query:  SGQRGNSYSNPTQDRVNYHSPSPSPSPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIWRTVAPV
        SG RGNSY N T D VN+   SPSPSPGYQGSSSPGG SHGHRGNMTP PRFGSGRG GSHGRRFS DES RPE FYN SML+DPWK LQPGIWR VAP+
Subjt:  SGQRGNSYSNPTQDRVNYHSPSPSPSPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIWRTVAPV

Query:  -----STESWISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEAVNNAP
              +ESWISKF TKKAR+SD+S  SGRS SQPSLAEYLAASFNEA NN P
Subjt:  -----STESWISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEAVNNAP

A0A6J1KYW0 protein SICKLE isoform X11.7e-14377.31Show/hide
Query:  MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVSDNYVPSHHNNSPATYVPSNF
        MEESEKRRERLRAMRMEA+QADVANYVETSLPNHLSNPLVESSA MLGQ EPCT PRFDYYTNPMAAFS+SKKRGNQTVS  YVPSH N+SP  YVPSNF
Subjt:  MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVSDNYVPSHHNNSPATYVPSNF

Query:  PGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGPRSPFVNHFPGPTPRGMSSPRSPFVNQFPSHL-----SPSVL
        PG+RNPEMSPS   QFHQHSPDQR FYA G+SGSGG G+P MPR FPMDQ +P MW GPRSP+VNHFPGP PR M+SPR PFVNQFPS L     SPS +
Subjt:  PGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGPRSPFVNHFPGPTPRGMSSPRSPFVNQFPSHL-----SPSVL

Query:  SGQRGNSYSNPTQDRVNYHSPSPSP------SPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIW
        SG RGNSY + TQD VN+ SPSPSP      SPGYQGSSSPGG SHGHRGNMTP PRFG GRG GSHGRRFS D+S RPE FY+ SML+DPWK LQPGIW
Subjt:  SGQRGNSYSNPTQDRVNYHSPSPSP------SPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIW

Query:  RTVAPV-----STESWISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEAVNN
        R VAP+      +ESWISKF TKKAR+ D+S  SGRS SQPSLAEYLAASFNEA NN
Subjt:  RTVAPV-----STESWISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEAVNN

A0A6J1L0Q3 protein SICKLE isoform X21.7e-14377.31Show/hide
Query:  MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVSDNYVPSHHNNSPATYVPSNF
        MEESEKRRERLRAMRMEA+QADVANYVETSLPNHLSNPLVESSA MLGQ EPCT PRFDYYTNPMAAFS+SKKRGNQTVS  YVPSH N+SP  YVPSNF
Subjt:  MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVSDNYVPSHHNNSPATYVPSNF

Query:  PGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGPRSPFVNHFPGPTPRGMSSPRSPFVNQFPSHL-----SPSVL
        PG+RNPEMSPS   QFHQHSPDQR FYA G+SGSGG G+P MPR FPMDQ +P MW GPRSP+VNHFPGP PR M+SPR PFVNQFPS L     SPS +
Subjt:  PGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGPRSPFVNHFPGPTPRGMSSPRSPFVNQFPSHL-----SPSVL

Query:  SGQRGNSYSNPTQDRVNYHSPSPSP------SPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIW
        SG RGNSY + TQD VN+ SPSPSP      SPGYQGSSSPGG SHGHRGNMTP PRFG GRG GSHGRRFS D+S RPE FY+ SML+DPWK LQPGIW
Subjt:  SGQRGNSYSNPTQDRVNYHSPSPSP------SPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIW

Query:  RTVAPV-----STESWISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEAVNN
        R VAP+      +ESWISKF TKKAR+ D+S  SGRS SQPSLAEYLAASFNEA NN
Subjt:  RTVAPV-----STESWISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEAVNN

SwissProt top hitse value%identityAlignment
Q9SB47 Protein SICKLE4.0e-2535.14Show/hide
Query:  MEESEKRRERLRAMRMEAA---QADVANYVETSL-PNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVSDNYV--PSHHNNSPAT
        ME+SEKR++ L+AMRMEAA     D     ETS+   HLSNPL E+S       E   T RFDYYT+PMAA+S+ KK  N+T    Y+  PSH  +SP  
Subjt:  MEESEKRRERLRAMRMEAA---QADVANYVETSL-PNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVSDNYV--PSHHNNSPAT

Query:  YVPSNFPGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGP-RSPFVNHFPGPTPRGMSSPRSPFVNQFPSHLSPS
         VP  FP    P + P      +Q   +   F+A  +   G      M    P  +G P  WN   R P VNH  GP P+ +  P  PF  + P      
Subjt:  YVPSNFPGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGP-RSPFVNHFPGPTPRGMSSPRSPFVNQFPSHLSPS

Query:  VLSGQRGNSYSNPTQDRVNYHSPSPSPSPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIWRTVA
               N  +N    R +Y++  P  S   + +++ GG+++ + G    R R G     G  G R  ++  P  E FY+ SM +DPWK L+P +W+  +
Subjt:  VLSGQRGNSYSNPTQDRVNYHSPSPSPSPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIWRTVA

Query:  PVSTES-----WISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEA
          S+ S     W+ K    K  ++ S +    S +Q SLAEYLAAS + A
Subjt:  PVSTES-----WISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEA

Arabidopsis top hitse value%identityAlignment
AT4G24500.1 hydroxyproline-rich glycoprotein family protein2.9e-2635.14Show/hide
Query:  MEESEKRRERLRAMRMEAA---QADVANYVETSL-PNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVSDNYV--PSHHNNSPAT
        ME+SEKR++ L+AMRMEAA     D     ETS+   HLSNPL E+S       E   T RFDYYT+PMAA+S+ KK  N+T    Y+  PSH  +SP  
Subjt:  MEESEKRRERLRAMRMEAA---QADVANYVETSL-PNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVSDNYV--PSHHNNSPAT

Query:  YVPSNFPGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGP-RSPFVNHFPGPTPRGMSSPRSPFVNQFPSHLSPS
         VP  FP    P + P      +Q   +   F+A  +   G      M    P  +G P  WN   R P VNH  GP P+ +  P  PF  + P      
Subjt:  YVPSNFPGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGP-RSPFVNHFPGPTPRGMSSPRSPFVNQFPSHLSPS

Query:  VLSGQRGNSYSNPTQDRVNYHSPSPSPSPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIWRTVA
               N  +N    R +Y++  P  S   + +++ GG+++ + G    R R G     G  G R  ++  P  E FY+ SM +DPWK L+P +W+  +
Subjt:  VLSGQRGNSYSNPTQDRVNYHSPSPSPSPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIWRTVA

Query:  PVSTES-----WISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEA
          S+ S     W+ K    K  ++ S +    S +Q SLAEYLAAS + A
Subjt:  PVSTES-----WISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEA

AT4G24500.2 hydroxyproline-rich glycoprotein family protein1.6e-1330.17Show/hide
Query:  MEESEKRRERLRAMRMEAA---QADVANYVETSL-PNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVSDNYVPSHHNNSPATYV
        ME+SEKR++ L+AMRMEAA     D     ETS+   HLSNPL E+S                                N         SH  +SP   V
Subjt:  MEESEKRRERLRAMRMEAA---QADVANYVETSL-PNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVSDNYVPSHHNNSPATYV

Query:  PSNFPGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGP-RSPFVNHFPGPTPRGMSSPRSPFVNQFPSHLSPSVL
        P  FP    P + P      +Q   +   F+A  +   G      M    P  +G P  WN   R P VNH  GP P+ +  P  PF  + P        
Subjt:  PSNFPGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGP-RSPFVNHFPGPTPRGMSSPRSPFVNQFPSHLSPSVL

Query:  SGQRGNSYSNPTQDRVNYHSPSPSPSPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIWRTVAPV
             N  +N    R +Y++  P  S   + +++ GG+++ + G    R R G     G  G R  ++  P  E FY+ SM +DPWK L+P +W+  +  
Subjt:  SGQRGNSYSNPTQDRVNYHSPSPSPSPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIWRTVAPV

Query:  STES-----WISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEA
        S+ S     W+ K    K  ++ S +    S +Q SLAEYLAAS + A
Subjt:  STES-----WISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEA

AT4G24500.3 hydroxyproline-rich glycoprotein family protein2.9e-2635.14Show/hide
Query:  MEESEKRRERLRAMRMEAA---QADVANYVETSL-PNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVSDNYV--PSHHNNSPAT
        ME+SEKR++ L+AMRMEAA     D     ETS+   HLSNPL E+S       E   T RFDYYT+PMAA+S+ KK  N+T    Y+  PSH  +SP  
Subjt:  MEESEKRRERLRAMRMEAA---QADVANYVETSL-PNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVSDNYV--PSHHNNSPAT

Query:  YVPSNFPGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGP-RSPFVNHFPGPTPRGMSSPRSPFVNQFPSHLSPS
         VP  FP    P + P      +Q   +   F+A  +   G      M    P  +G P  WN   R P VNH  GP P+ +  P  PF  + P      
Subjt:  YVPSNFPGLRNPEMSPSPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGP-RSPFVNHFPGPTPRGMSSPRSPFVNQFPSHLSPS

Query:  VLSGQRGNSYSNPTQDRVNYHSPSPSPSPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIWRTVA
               N  +N    R +Y++  P  S   + +++ GG+++ + G    R R G     G  G R  ++  P  E FY+ SM +DPWK L+P +W+  +
Subjt:  VLSGQRGNSYSNPTQDRVNYHSPSPSPSPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIWRTVA

Query:  PVSTES-----WISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEA
          S+ S     W+ K    K  ++ S +    S +Q SLAEYLAAS + A
Subjt:  PVSTES-----WISKFGTKKARISDSSSGSGRSISQPSLAEYLAASFNEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAATCTGAGAAACGAAGGGAGAGACTGAGAGCAATGCGAATGGAAGCTGCTCAGGCTGATGTCGCTAATTATGTCGAAACTTCTCTGCCTAATCATCTTTCCAA
TCCATTGGTCGAGTCCTCAGCTACGATGTTAGGGCAATTAGAACCATGTACTACCCCAAGATTTGACTACTACACAAACCCTATGGCTGCATTTTCTGCTAGCAAGAAGA
GAGGGAATCAGACTGTGTCCGATAATTATGTTCCTTCCCACCACAATAATTCTCCAGCAACTTATGTACCATCAAATTTCCCAGGATTGAGAAACCCTGAAATGTCTCCC
TCTCCGACTCAACAATTCCATCAACATTCACCTGACCAGAGAACGTTCTATGCAGGAGGGTTTAGTGGGTCTGGTGGCCCTGGTACCCCAACAATGCCCAGATTTTTTCC
TATGGATCAAGGAAATCCTGGTATGTGGAATGGACCTAGAAGCCCATTTGTCAACCACTTCCCAGGCCCTACTCCAAGGGGGATGAGCTCCCCCAGAAGCCCATTTGTCA
ACCAATTCCCTAGCCATCTCTCCCCCAGCGTTCTCTCTGGACAAAGAGGCAATTCTTACTCCAATCCAACGCAAGACAGGGTTAATTACCATAGTCCAAGCCCTAGTCCT
AGTCCGGGGTATCAAGGCAGTTCGAGTCCTGGTGGAAGCAGCCATGGTCATCGTGGCAATATGACCCCCAGACCAAGATTTGGCTCTGGACGAGGTTCTGGTTCTCATGG
TCGTCGTTTTTCATTAGATGAATCACCCAGACCGGAACCATTTTACAATGCGTCCATGCTTGATGATCCCTGGAAGGATCTGCAACCTGGTATTTGGAGGACAGTTGCTC
CAGTAAGCACTGAATCTTGGATTTCAAAGTTTGGTACTAAGAAAGCAAGAATTTCAGATTCTTCTTCTGGCTCTGGCAGGTCAATCTCTCAACCTAGCCTCGCAGAGTAC
CTGGCTGCCTCCTTCAACGAAGCAGTCAACAATGCACCAAGTTGGCCACTACTGTTCTGA
mRNA sequenceShow/hide mRNA sequence
AAAGAAAGGAAAAAAAAAAGAAGAAAGGAAAAAAGCCCAAACCCCTTTCTTTCTCCCTCTTCTTCACGAGCCTTCACCACCCTCCCAACACCACATCATCACCATCCTCG
TTTTCTTTTTCTTCTTCTTCCCCGTCGACAGCCTCCACCGCTCGAACGCCGCCGGAGAAGTCACAGCGCCGAACGCCAGCCGTCGATCTGTCCCTCGACGTGCGCTGTAA
CCGCGCGTTTGAGCCCCGAACCGCCGCACCATCACTCAATCCGGGTCATCACTGCAGCCCCGAACCACCGCGCGCAGGTTTTCTGAAGCACGATCTGCCATCAGCCGTGT
GTCAAGGTCGCCGGAGAACCCAACGCGAGCGCGTCTCGAGTGTGTTGGTCCGGAGGCTGTGATTTGTGGGATTCCCATTGGATTTGGAGGCTGGACCATAAAAAATTGTA
TCACTCTAATACCGAACCAGCAAGTGCGTCGTTATGTTTTGGTAAGGTTTAAGAGCTGAGGCTTCCAATTTATGACCTCTCTGGGCAGGTTGTAAATTGAACCAATCAAA
GTGAGGCGTTCATAATTCGAGGATTTCTCTTGGGTGGTGCAAGTGACTTTCCTAGGAGAAGGAAATTTTAAATATTCTAGTTTTGGCTCACATAGGTTAAATGGAAGAAT
CTGAGAAACGAAGGGAGAGACTGAGAGCAATGCGAATGGAAGCTGCTCAGGCTGATGTCGCTAATTATGTCGAAACTTCTCTGCCTAATCATCTTTCCAATCCATTGGTC
GAGTCCTCAGCTACGATGTTAGGGCAATTAGAACCATGTACTACCCCAAGATTTGACTACTACACAAACCCTATGGCTGCATTTTCTGCTAGCAAGAAGAGAGGGAATCA
GACTGTGTCCGATAATTATGTTCCTTCCCACCACAATAATTCTCCAGCAACTTATGTACCATCAAATTTCCCAGGATTGAGAAACCCTGAAATGTCTCCCTCTCCGACTC
AACAATTCCATCAACATTCACCTGACCAGAGAACGTTCTATGCAGGAGGGTTTAGTGGGTCTGGTGGCCCTGGTACCCCAACAATGCCCAGATTTTTTCCTATGGATCAA
GGAAATCCTGGTATGTGGAATGGACCTAGAAGCCCATTTGTCAACCACTTCCCAGGCCCTACTCCAAGGGGGATGAGCTCCCCCAGAAGCCCATTTGTCAACCAATTCCC
TAGCCATCTCTCCCCCAGCGTTCTCTCTGGACAAAGAGGCAATTCTTACTCCAATCCAACGCAAGACAGGGTTAATTACCATAGTCCAAGCCCTAGTCCTAGTCCGGGGT
ATCAAGGCAGTTCGAGTCCTGGTGGAAGCAGCCATGGTCATCGTGGCAATATGACCCCCAGACCAAGATTTGGCTCTGGACGAGGTTCTGGTTCTCATGGTCGTCGTTTT
TCATTAGATGAATCACCCAGACCGGAACCATTTTACAATGCGTCCATGCTTGATGATCCCTGGAAGGATCTGCAACCTGGTATTTGGAGGACAGTTGCTCCAGTAAGCAC
TGAATCTTGGATTTCAAAGTTTGGTACTAAGAAAGCAAGAATTTCAGATTCTTCTTCTGGCTCTGGCAGGTCAATCTCTCAACCTAGCCTCGCAGAGTACCTGGCTGCCT
CCTTCAACGAAGCAGTCAACAATGCACCAAGTTGGCCACTACTGTTCTGATTTGAACACTGAAACTATCTTTAGCCTCTTCATTTGAGTCATGTTCCTGTTCCCCTGGGA
ATTTACAACTGGCCCTCTGTCTTCTCAGAAGTTTTTTTTCCCCTACAATATGAATCTGATAGTTATTCAATTATAA
Protein sequenceShow/hide protein sequence
MEESEKRRERLRAMRMEAAQADVANYVETSLPNHLSNPLVESSATMLGQLEPCTTPRFDYYTNPMAAFSASKKRGNQTVSDNYVPSHHNNSPATYVPSNFPGLRNPEMSP
SPTQQFHQHSPDQRTFYAGGFSGSGGPGTPTMPRFFPMDQGNPGMWNGPRSPFVNHFPGPTPRGMSSPRSPFVNQFPSHLSPSVLSGQRGNSYSNPTQDRVNYHSPSPSP
SPGYQGSSSPGGSSHGHRGNMTPRPRFGSGRGSGSHGRRFSLDESPRPEPFYNASMLDDPWKDLQPGIWRTVAPVSTESWISKFGTKKARISDSSSGSGRSISQPSLAEY
LAASFNEAVNNAPSWPLLF