; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014647 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014647
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionS-protein homolog
Genome locationChr02:17230082..17230528
RNA-Seq ExpressionHG10014647
SyntenyHG10014647
Gene Ontology termsNA
InterPro domainsIPR010264 - Plant self-incompatibility S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8648168.1 hypothetical protein Csa_018354 [Cucumis sativus]8.1e-4174.75Show/hide
Query:  LDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFIDQFCPNLKCVWIAKQEGLYVVNNARQHVLIHRWKMLRMP
        +D HCWSKD+DLGL ILLPDE+Q W FRRNF+GTTLFHCRLEWERGFREFDAF+VDE F++QFCPN  CVWIAKQ+GLY++N A Q V  H WK+LR+P
Subjt:  LDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFIDQFCPNLKCVWIAKQEGLYVVNNARQHVLIHRWKMLRMP

KAE8648174.1 hypothetical protein Csa_018375 [Cucumis sativus]2.1e-3677.91Show/hide
Query:  LDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFIDQFCPNLKCVWIAKQEGLYVVNNARQ
        +D HCWSKDDDLGL ILLPDE+Q W FRRNF+GTTLFHCRLEWERGFREFDAF+VDE F++QFCPN  CVWIA+Q+GLY++N A Q
Subjt:  LDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFIDQFCPNLKCVWIAKQEGLYVVNNARQ

KAG6600849.1 S-protein-like 74, partial [Cucurbita argyrosperma subsp. sororia]2.8e-4966.67Show/hide
Query:  SRPTVG--RAIRRPNRSLIYIPKFHVEIHNELKMYILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFIDQFC
        S PT G  +  ++  RSL+ IPKFHVEI N LKMYILDSHC+SKDDDLGL+IL PD++Q WSFR NF G TLFHCRLEWERGF+EFDAF+VDE F D++C
Subjt:  SRPTVG--RAIRRPNRSLIYIPKFHVEIHNELKMYILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFIDQFC

Query:  PNLKCVWIAKQEGLYVVNNARQHVLIHRWKMLRMP
         N +C+WIAKQ+G+Y++N A Q V  HRWK+LR+P
Subjt:  PNLKCVWIAKQEGLYVVNNARQHVLIHRWKMLRMP

KAG7031483.1 S-protein-like 1, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-5165.13Show/hide
Query:  MHQLVLLLLCLLLIASRPTVG----RAIRRPNRSLIYIPKFHVEIHNELKMYILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGF
        M +LVLL L LLL+ S PT G    +  ++  RSL+ IPKFHVEI N LKMYILDSHC+SKDDDLGL+IL PD++Q WSFR NF G TLFHCRLEWERGF
Subjt:  MHQLVLLLLCLLLIASRPTVG----RAIRRPNRSLIYIPKFHVEIHNELKMYILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGF

Query:  REFDAFYVDETFIDQFCPNLKCVWIAKQEGLYVVNNARQHVLIHRWKMLRMP
        +EFDAF+VDE F D++C N +C+WIAKQ+G+Y++N A Q V  HRWK+LR+P
Subjt:  REFDAFYVDETFIDQFCPNLKCVWIAKQEGLYVVNNARQHVLIHRWKMLRMP

XP_023546939.1 S-protein homolog 74-like [Cucurbita pepo subsp. pepo]9.9e-4767.74Show/hide
Query:  RRPNRSLIYIPKFHVEIHNELKMYILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFIDQFCPNLKCVWIAKQ
        ++  RSLI IPKF VEI N L+MYILDSHC+SKDDDLGL+IL PD++Q WSFR N  G TLFHCRLEWERGF+EFDAF+VDE F D++C N +C+WIAKQ
Subjt:  RRPNRSLIYIPKFHVEIHNELKMYILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFIDQFCPNLKCVWIAKQ

Query:  EGLYVVNNARQHVLIHRWKMLRMP
        +G+Y++N A Q V  HRWK+LR+P
Subjt:  EGLYVVNNARQHVLIHRWKMLRMP

TrEMBL top hitse value%identityAlignment
A0A0A0KL79 S-protein homolog1.2e-3454.17Show/hide
Query:  PNRSLIYIPKFHVEIHNELKMYILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFIDQFCPNLKCVWIAKQEG
        PN  ++   +F VEIHN+L+M+ILDSHC+SKDDDLGL+IL PDEKQ WSF+ N++ TT FHCRLEWE G+ EFD+F     F+  +C N  C+W A+Q+G
Subjt:  PNRSLIYIPKFHVEIHNELKMYILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFIDQFCPNLKCVWIAKQEG

Query:  LYVVNNARQHVLIHRWKMLR
        +Y+ N A + V  + W+M+R
Subjt:  LYVVNNARQHVLIHRWKMLR

A0A0A0KSN2 S-protein homolog3.1e-3046.21Show/hide
Query:  HQLVLLLLCLLLIASRPTVGRAIRRPNR-SLIYIPKFHVEIHNELKMYILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFD
        H LVLL+   L++    ++ R    P + S+I +  + +EIHN+L+MY+LDSHC+SKD+DLGL+IL P E Q WSF+ N   TT F C LEWE G  EFD
Subjt:  HQLVLLLLCLLLIASRPTVGRAIRRPNR-SLIYIPKFHVEIHNELKMYILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFD

Query:  AFYVDETFIDQFCPNLKCVWIAKQEGLYVVNNARQHVLIHRWKML
        +F  +  F++ FC NL C W A+Q+G+Y+ N   ++V    W ML
Subjt:  AFYVDETFIDQFCPNLKCVWIAKQEGLYVVNNARQHVLIHRWKML

A0A1S3BUQ9 S-protein homolog1.0e-3350.79Show/hide
Query:  GRAIRRPNRSLIYIPKFHVEIHNELKMYILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFIDQFCPNLKCVW
        G  + +P+  L+   KF++ I+NEL+MY+LDSHC+SKDDDLG  +L P+++Q WSFR N+LGTT FHC+LEWE G+ EFDAF  D  F+  FC    C W
Subjt:  GRAIRRPNRSLIYIPKFHVEIHNELKMYILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFIDQFCPNLKCVW

Query:  IAKQEGLYVVNNARQHVLIHRWKMLR
         A+Q+G+Y+ N   Q V    W+M+R
Subjt:  IAKQEGLYVVNNARQHVLIHRWKMLR

A0A6J1F5H5 S-protein homolog4.1e-3054.1Show/hide
Query:  RPNRSLIYIPKFHVEIHNELKMYILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFY--VDETFIDQFCPNLKCVWIAK
        +PN       +F VEIHN+L+M+ILD+HC SKDDDLGL+IL PDE+Q WSF  N+LGTT FHCRLEW+ G  EFDAF+   D   ID +C N  C+W A+
Subjt:  RPNRSLIYIPKFHVEIHNELKMYILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFY--VDETFIDQFCPNLKCVWIAK

Query:  QEGLYVVNNARQHVLIHRWKML
        Q+G+Y+ N     V    W ML
Subjt:  QEGLYVVNNARQHVLIHRWKML

A0A6J1FMU2 S-protein homolog1.7e-3148.7Show/hide
Query:  MHQLVLLLLCLL-------LIAS--RPTVGRAIRRPNRSLIYIPKFHVEIHNELKMYILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLE
        M +L LLLL LL        +AS  +   G     PN+  I + KF V I N L+MY+LDSHC SKDDDLG+ ++ PD++QRWSFR N+LG+T FHC+LE
Subjt:  MHQLVLLLLCLL-------LIAS--RPTVGRAIRRPNRSLIYIPKFHVEIHNELKMYILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLE

Query:  WERGFREFDAFYVDETFIDQFCPNLKCVWIAKQEGLYVVNNARQHVLIHRWKML
        W  GF EFDAF  D  F+  FC +  CVW AKQ+GLY+ +   Q V    W+++
Subjt:  WERGFREFDAFYVDETFIDQFCPNLKCVWIAKQEGLYVVNNARQHVLIHRWKML

SwissProt top hitse value%identityAlignment
F4JLQ5 S-protein homolog 25.9e-1037.96Show/hide
Query:  VEIHNEL-KMYILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVD--ETFIDQFCPNLKCVWIAKQEGLYVVNN-AR
        VEI+N+L     L  HC SKDDDLG   L P E   +SF R F G TL+ C   W      FD  Y D  ++  D  C + +CVW  ++ G    N+  +
Subjt:  VEIHNEL-KMYILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVD--ETFIDQFCPNLKCVWIAKQEGLYVVNN-AR

Query:  QHVLIHRW
        Q  L + W
Subjt:  QHVLIHRW

F4JLS0 S-protein homolog 13.8e-0934.41Show/hide
Query:  HCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFIDQFCPNLKCVWIAKQEGLYVVNNAR-QHVLIHRWKM
        HC SK+DDLG   L    +  W+F  N L +T F C +  + G    + F+ D+  +   C    C+W AK +GLY+ N+A  + VL  +W++
Subjt:  HCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFIDQFCPNLKCVWIAKQEGLYVVNNAR-QHVLIHRWKM

Q2HQ46 S-protein homolog 741.7e-0934.04Show/hide
Query:  LLLLCLLLIASRPTVGRAIRRPNRSLIYIPKFHVEIHNELKM-YILDSHCWSKDDDLG-LNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFY
        L+L C   + +R T  R I  P      I ++ V + N L     L  HC SK++DLG +N+   D +  W+F  N L +TLF C +  + G      F+
Subjt:  LLLLCLLLIASRPTVGRAIRRPNRSLIYIPKFHVEIHNELKM-YILDSHCWSKDDDLG-LNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFY

Query:  VDETFIDQFCPNLKCVWIAKQEGLYVVNNA-RQHVLIHRWK
         D+  +   C    CVW AK +GLY+ N+A  + VL  +WK
Subjt:  VDETFIDQFCPNLKCVWIAKQEGLYVVNNA-RQHVLIHRWK

Q3E9W6 S-protein homolog 202.1e-0736.96Show/hide
Query:  HVEIHNELKMYI-LDSHCWSKDDDLGLNILLPDEKQRWSFRR--NFLGTTLFHCRLEWERGFREFDAFYVDETFIDQFCPNLKCVWIAKQEG
        +V+I N++   + L  HC SKD DLG   L P  +Q W FR+  +F G TLF C  EWE   + FD            C    CVW  +  G
Subjt:  HVEIHNELKMYI-LDSHCWSKDDDLGLNILLPDEKQRWSFRR--NFLGTTLFHCRLEWERGFREFDAFYVDETFIDQFCPNLKCVWIAKQEG

Q9FMQ4 S-protein homolog 31.7e-0936.36Show/hide
Query:  LDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFIDQFCPNLKCVWIAKQEG
        L+ HC S DDDLGL IL P+    + FR + +GTTLF+C   W    + FD +  D   +      + C+W    +G
Subjt:  LDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFIDQFCPNLKCVWIAKQEG

Arabidopsis top hitse value%identityAlignment
AT4G16195.1 Plant self-incompatibility protein S1 family4.2e-1137.96Show/hide
Query:  VEIHNEL-KMYILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVD--ETFIDQFCPNLKCVWIAKQEGLYVVNN-AR
        VEI+N+L     L  HC SKDDDLG   L P E   +SF R F G TL+ C   W      FD  Y D  ++  D  C + +CVW  ++ G    N+  +
Subjt:  VEIHNEL-KMYILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVD--ETFIDQFCPNLKCVWIAKQEGLYVVNN-AR

Query:  QHVLIHRW
        Q  L + W
Subjt:  QHVLIHRW

AT4G16295.1 S-protein homologue 12.7e-1034.41Show/hide
Query:  HCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFIDQFCPNLKCVWIAKQEGLYVVNNAR-QHVLIHRWKM
        HC SK+DDLG   L    +  W+F  N L +T F C +  + G    + F+ D+  +   C    C+W AK +GLY+ N+A  + VL  +W++
Subjt:  HCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFIDQFCPNLKCVWIAKQEGLYVVNNAR-QHVLIHRWKM

AT4G29035.1 Plant self-incompatibility protein S1 family1.2e-1034.04Show/hide
Query:  LLLLCLLLIASRPTVGRAIRRPNRSLIYIPKFHVEIHNELKM-YILDSHCWSKDDDLG-LNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFY
        L+L C   + +R T  R I  P      I ++ V + N L     L  HC SK++DLG +N+   D +  W+F  N L +TLF C +  + G      F+
Subjt:  LLLLCLLLIASRPTVGRAIRRPNRSLIYIPKFHVEIHNELKM-YILDSHCWSKDDDLG-LNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFY

Query:  VDETFIDQFCPNLKCVWIAKQEGLYVVNNA-RQHVLIHRWK
         D+  +   C    CVW AK +GLY+ N+A  + VL  +WK
Subjt:  VDETFIDQFCPNLKCVWIAKQEGLYVVNNA-RQHVLIHRWK

AT5G06020.1 Plant self-incompatibility protein S1 family3.9e-0938.46Show/hide
Query:  YILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFIDQF-CPNLKCVWIAKQEGLYVVNNARQHV
        Y+L  HC SKDDDLG +I    E   W F  NF  +TL+ C   + +G      F +D    D + C N  C W AK++ LY  +N  Q V
Subjt:  YILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFIDQF-CPNLKCVWIAKQEGLYVVNNARQHV

AT5G12060.1 Plant self-incompatibility protein S1 family1.2e-1036.36Show/hide
Query:  LDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFIDQFCPNLKCVWIAKQEG
        L+ HC S DDDLGL IL P+    + FR + +GTTLF+C   W    + FD +  D   +      + C+W    +G
Subjt:  LDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFIDQFCPNLKCVWIAKQEG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCAACTCGTATTATTATTGTTATGCCTCTTACTTATCGCATCTCGACCAACGGTCGGAAGGGCAATAAGACGACCAAATAGATCTCTTATTTATATCCCAAAATT
TCATGTAGAAATTCACAACGAGTTGAAAATGTATATATTGGATAGCCATTGTTGGTCGAAAGATGATGATTTAGGGTTAAATATACTTCTTCCGGATGAAAAACAAAGGT
GGTCATTCAGAAGGAATTTTTTGGGAACAACCTTATTCCATTGCAGATTGGAATGGGAAAGAGGATTTAGGGAGTTTGATGCATTTTATGTTGATGAAACTTTCATTGAT
CAATTCTGTCCCAATCTAAAATGTGTTTGGATAGCCAAACAAGAAGGGCTTTATGTGGTGAATAATGCTCGTCAACATGTTCTTATTCATCGTTGGAAAATGCTTCGTAT
GCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATCAACTCGTATTATTATTGTTATGCCTCTTACTTATCGCATCTCGACCAACGGTCGGAAGGGCAATAAGACGACCAAATAGATCTCTTATTTATATCCCAAAATT
TCATGTAGAAATTCACAACGAGTTGAAAATGTATATATTGGATAGCCATTGTTGGTCGAAAGATGATGATTTAGGGTTAAATATACTTCTTCCGGATGAAAAACAAAGGT
GGTCATTCAGAAGGAATTTTTTGGGAACAACCTTATTCCATTGCAGATTGGAATGGGAAAGAGGATTTAGGGAGTTTGATGCATTTTATGTTGATGAAACTTTCATTGAT
CAATTCTGTCCCAATCTAAAATGTGTTTGGATAGCCAAACAAGAAGGGCTTTATGTGGTGAATAATGCTCGTCAACATGTTCTTATTCATCGTTGGAAAATGCTTCGTAT
GCCCTGA
Protein sequenceShow/hide protein sequence
MHQLVLLLLCLLLIASRPTVGRAIRRPNRSLIYIPKFHVEIHNELKMYILDSHCWSKDDDLGLNILLPDEKQRWSFRRNFLGTTLFHCRLEWERGFREFDAFYVDETFID
QFCPNLKCVWIAKQEGLYVVNNARQHVLIHRWKMLRMP