; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG07G006080 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG07G006080
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionS-protein homolog
Genome locationCG_Chr07:10089623..10090063
RNA-Seq ExpressionClCG07G006080
SyntenyClCG07G006080
Gene Ontology termsNA
InterPro domainsIPR010264 - Plant self-incompatibility S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8648119.1 hypothetical protein Csa_004728 [Cucumis sativus]5.9e-5279.49Show/hide
Query:  GSITPQCSALSMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKGEWLGS
        GSIT QCSA SM SKY  SFS WT+ I+N+L SNQ+LF HCKSKDD+LGDHTVE GQTYQW FKENAL TTLFWCTLRTP NLH +FEVFWRE+GEWL S
Subjt:  GSITPQCSALSMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKGEWLGS

Query:  RCNFRACLWYARDDGIY
        RCNFRAC+WYARDDG Y
Subjt:  RCNFRACLWYARDDGIY

KAG2712058.1 hypothetical protein I3760_04G107600 [Carya illinoinensis]2.5e-3452Show/hide
Query:  ALSMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKGEWLGSRCNFRACL
        +L+  +K   S  +  + +VN L   Q LF HCKS+DDDLG H + VG  Y W F+ N L+TTLFWC +RT   LH  F+VFW  KG+WL  RCN++ C+
Subjt:  ALSMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKGEWLGSRCNFRACL

Query:  WYARDDGIYLLNIPQSSYELIHKWE
        W A+DDGIYL NIP++  EL+HKWE
Subjt:  WYARDDGIYLLNIPQSSYELIHKWE

XP_022143694.1 S-protein homolog 1-like [Momordica charantia]3.3e-3448.92Show/hide
Query:  LTIAGSITPQCSALSMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWRE--K
        + +   +   CS L+  S +P S+  W + IVN+L S Q LF HC+SKDDDLG+H + VG  Y + FK+N   TTLFWC LR P N H  F+V+W +  K
Subjt:  LTIAGSITPQCSALSMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWRE--K

Query:  GEWLGSRCNFRACLWYARDDGIYLLNIPQSSYELIHKWE
        G WL +RCN++ C+W A+DDGIY+ +IP +  +LIHKWE
Subjt:  GEWLGSRCNFRACLWYARDDGIYLLNIPQSSYELIHKWE

XP_022152463.1 S-protein homolog 1-like [Momordica charantia]3.1e-3245.83Show/hide
Query:  LVLILTIAGSITPQCSALSMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWR
        +V+ L + G     CS +++ + +P     WT+ IVN+L + Q+LF HCKSKDDDLG+H ++ G  Y + FK+N   TTLFWC LR P N H +F+V+W 
Subjt:  LVLILTIAGSITPQCSALSMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWR

Query:  E--KGEWLGSRCNFRACLWYARDDGIYLLNIPQSSYELIHKWEH
        +  KG WL +RC+++ C+W A+ DGIY+ NIP +  EL+H WE+
Subjt:  E--KGEWLGSRCNFRACLWYARDDGIYLLNIPQSSYELIHKWEH

XP_042977762.1 S-protein homolog 1-like [Carya illinoinensis]1.1e-3449.64Show/hide
Query:  LILTIAGSITPQCSALSMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREK
        L++ I    TP   +L+  +K   S  +  + +VN L   Q LF HCKS+DDDLG H + VG  Y W F+ N L+TTLFWC +RT   LH  F+VFW  K
Subjt:  LILTIAGSITPQCSALSMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREK

Query:  GEWLGSRCNFRACLWYARDDGIYLLNIPQSSYELIHKWE
        G+WL  RCN++ C+W A+DDGIYL NIP++  EL+HKWE
Subjt:  GEWLGSRCNFRACLWYARDDGIYLLNIPQSSYELIHKWE

TrEMBL top hitse value%identityAlignment
A0A2P6QC93 S-protein homolog6.2e-3153.85Show/hide
Query:  PRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKG-EWLGSRCNFRACLWYARDDG
        P S   W + IVN L S + LF HCKSKD+DLG H + VG    W FKEN + TTLFWC LRT    +  F+VFW E   +WL +RCN++ C+W ARDDG
Subjt:  PRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKG-EWLGSRCNFRACLWYARDDG

Query:  IYLLNIPQSSYELIHKW
        +Y+ NIP +S ELIH+W
Subjt:  IYLLNIPQSSYELIHKW

A0A5A7UB41 S-protein homolog1.3e-3141.78Show/hide
Query:  MIPSLVLILTIAGSITPQCSALSMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFE
        ++P +V+++ + G      S L  GSK    FS W + + NQL   Q L  HCKSKD+DLG+H++ VG+ + W FKEN  STT FWC+L + +   ++ +
Subjt:  MIPSLVLILTIAGSITPQCSALSMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFE

Query:  VFWREKGEWLGSRCNFRACLWYARDDGIYLLNIPQSSYELIHKWEH
        VFW E+ +WL  RCN+  C+W A+DDGIY++N+  +  E + KW++
Subjt:  VFWREKGEWLGSRCNFRACLWYARDDGIYLLNIPQSSYELIHKWEH

A0A6J1CQ33 S-protein homolog1.6e-3448.92Show/hide
Query:  LTIAGSITPQCSALSMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWRE--K
        + +   +   CS L+  S +P S+  W + IVN+L S Q LF HC+SKDDDLG+H + VG  Y + FK+N   TTLFWC LR P N H  F+V+W +  K
Subjt:  LTIAGSITPQCSALSMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWRE--K

Query:  GEWLGSRCNFRACLWYARDDGIYLLNIPQSSYELIHKWE
        G WL +RCN++ C+W A+DDGIY+ +IP +  +LIHKWE
Subjt:  GEWLGSRCNFRACLWYARDDGIYLLNIPQSSYELIHKWE

A0A6J1CQW3 S-protein homolog9.0e-3045.32Show/hide
Query:  ILTIAGSITPQCSALSMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFW--RE
        +L +   +    S  +  S +P   ++W + IVN L S Q LF HCKSKDDDLG H + VG  + W+F++N   TTLFWC +R P N + SFEV+W  + 
Subjt:  ILTIAGSITPQCSALSMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFW--RE

Query:  KGEWLGSRCNFRACLWYARDDGIYLLNIPQSSYELIHKW
        K +WL + CN+  C+W A+DDGIY+ NI  +  EL HKW
Subjt:  KGEWLGSRCNFRACLWYARDDGIYLLNIPQSSYELIHKW

A0A6J1DEX4 S-protein homolog1.5e-3245.83Show/hide
Query:  LVLILTIAGSITPQCSALSMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWR
        +V+ L + G     CS +++ + +P     WT+ IVN+L + Q+LF HCKSKDDDLG+H ++ G  Y + FK+N   TTLFWC LR P N H +F+V+W 
Subjt:  LVLILTIAGSITPQCSALSMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWR

Query:  E--KGEWLGSRCNFRACLWYARDDGIYLLNIPQSSYELIHKWEH
        +  KG WL +RC+++ C+W A+ DGIY+ NIP +  EL+H WE+
Subjt:  E--KGEWLGSRCNFRACLWYARDDGIYLLNIPQSSYELIHKWEH

SwissProt top hitse value%identityAlignment
F2Q9V4 S-protein homolog 61.7e-0935.42Show/hide
Query:  LFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKGEWLGSRCNFRACLWYARDDGIYLLNIPQSSYELIHKW
        L  HCKS+DDD G H ++ G  Y W F  N +++TL++C           F+++   K     SRC  R C W A++DGIY          L +KW
Subjt:  LFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKGEWLGSRCNFRACLWYARDDGIYLLNIPQSSYELIHKW

F4JLQ5 S-protein homolog 25.1e-1431.15Show/hide
Query:  SMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKGEWLGSRCNFRACLWY
        S  S +P   S+ T+ I N LG+   L  HCKSKDDDLG+ T++ G+++ + F       TL++C+   P   H SF+++   +     ++C    C+W 
Subjt:  SMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKGEWLGSRCNFRACLWY

Query:  ARDDGIYLLNIPQSSYELIHKW
         R +G    N     ++L + W
Subjt:  ARDDGIYLLNIPQSSYELIHKW

F4JLS0 S-protein homolog 13.3e-2140.71Show/hide
Query:  SEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKGEWLGSRCNFRACLWYARDDGIYLLN
        SEW + +VN L + + LF HCKSK+DDLG+  ++    + W+F EN L +T FWC +    N H++  VFW +    L  RC ++ C+W A+ DG+YL N
Subjt:  SEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKGEWLGSRCNFRACLWYARDDGIYLLN

Query:  IPQSSYELIHKWE
               L  KWE
Subjt:  IPQSSYELIHKWE

Q2HQ46 S-protein homolog 746.2e-2038.94Show/hide
Query:  SEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKGEWLGSRCNFRACLWYARDDGIYLLN
        SEW + + N L + + LF HCKSK++DLGD  ++    + W+F EN L +TLFWC + +  + H++ +VFW +    L  RC+++ C+W A++DG+YL N
Subjt:  SEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKGEWLGSRCNFRACLWYARDDGIYLLN

Query:  IPQSSYELIHKWE
               L  KW+
Subjt:  IPQSSYELIHKWE

Q40975 Self-incompatibility protein S12.6e-1029.06Show/hide
Query:  SKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKGEWLGSRCNFRACLWYARD
        SK    F    + I+N+ G+ + +  HC+SKD+DL + TV  G    + F+E+   TT F+C L+        F  +  ++ +    RC+ + CLW   D
Subjt:  SKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKGEWLGSRCNFRACLWYARD

Query:  DGIYLLNIPQSSYELIH
        DG+Y  +     +++ H
Subjt:  DGIYLLNIPQSSYELIH

Arabidopsis top hitse value%identityAlignment
AT4G16195.1 Plant self-incompatibility protein S1 family3.6e-1531.15Show/hide
Query:  SMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKGEWLGSRCNFRACLWY
        S  S +P   S+ T+ I N LG+   L  HCKSKDDDLG+ T++ G+++ + F       TL++C+   P   H SF+++   +     ++C    C+W 
Subjt:  SMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKGEWLGSRCNFRACLWY

Query:  ARDDGIYLLNIPQSSYELIHKW
         R +G    N     ++L + W
Subjt:  ARDDGIYLLNIPQSSYELIHKW

AT4G16295.1 S-protein homologue 12.3e-2240.71Show/hide
Query:  SEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKGEWLGSRCNFRACLWYARDDGIYLLN
        SEW + +VN L + + LF HCKSK+DDLG+  ++    + W+F EN L +T FWC +    N H++  VFW +    L  RC ++ C+W A+ DG+YL N
Subjt:  SEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKGEWLGSRCNFRACLWYARDDGIYLLN

Query:  IPQSSYELIHKWE
               L  KWE
Subjt:  IPQSSYELIHKWE

AT4G29035.1 Plant self-incompatibility protein S1 family4.4e-2138.94Show/hide
Query:  SEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKGEWLGSRCNFRACLWYARDDGIYLLN
        SEW + + N L + + LF HCKSK++DLGD  ++    + W+F EN L +TLFWC + +  + H++ +VFW +    L  RC+++ C+W A++DG+YL N
Subjt:  SEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKGEWLGSRCNFRACLWYARDDGIYLLN

Query:  IPQSSYELIHKWE
               L  KW+
Subjt:  IPQSSYELIHKWE

AT5G04347.1 Plant self-incompatibility protein S1 family5.2e-1436.21Show/hide
Query:  SFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTL-RTPT-NLHLSFEVFWREKGEWLGSRCNFRACLWYARDDGI
        +F   T+ + N+L +N+ L   C+SKDD+LGDH + VGQ  + +F +N    TLFWC L + P   LH++F+ +  +    +G R      LW AR+DGI
Subjt:  SFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTL-RTPT-NLHLSFEVFWREKGEWLGSRCNFRACLWYARDDGI

Query:  YLLNIPQSSYELIHKW
        Y    P++  +  + W
Subjt:  YLLNIPQSSYELIHKW

AT5G04350.1 Plant self-incompatibility protein S1 family3.4e-1337.76Show/hide
Query:  EWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNL--HLSFEVFWREKGEWLGSRCNFRACLWYARDDGIY
        E  + + NQL  ++ L  HC+SKDDDLG+H +++GQ Y++ F +N   TT F C +    N   HL F  +     E   S+    +C W  R+DGIY
Subjt:  EWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNL--HLSFEVFWREKGEWLGSRCNFRACLWYARDDGIY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCCATCTCTTGTTTTGATTTTGACGATTGCAGGATCAATAACACCACAATGTTCAGCTTTGTCGATGGGTTCAAAGTATCCACGATCTTTCAGTGAATGGACAAT
AGGAATTGTGAACCAACTAGGTTCAAATCAGAAACTCTTCGCTCATTGCAAGTCAAAAGATGATGATTTGGGAGATCATACAGTAGAAGTTGGGCAAACCTACCAATGGC
ATTTTAAGGAAAATGCACTTTCAACAACGCTCTTTTGGTGCACTTTAAGAACTCCAACAAACCTACACCTATCATTCGAAGTGTTTTGGAGAGAAAAAGGAGAGTGGTTG
GGTTCTCGTTGCAATTTCCGGGCTTGTCTTTGGTATGCTCGAGATGATGGCATTTATTTGCTTAATATTCCTCAAAGTTCTTATGAGCTTATACATAAATGGGAACATTA
G
mRNA sequenceShow/hide mRNA sequence
ATGATTCCATCTCTTGTTTTGATTTTGACGATTGCAGGATCAATAACACCACAATGTTCAGCTTTGTCGATGGGTTCAAAGTATCCACGATCTTTCAGTGAATGGACAAT
AGGAATTGTGAACCAACTAGGTTCAAATCAGAAACTCTTCGCTCATTGCAAGTCAAAAGATGATGATTTGGGAGATCATACAGTAGAAGTTGGGCAAACCTACCAATGGC
ATTTTAAGGAAAATGCACTTTCAACAACGCTCTTTTGGTGCACTTTAAGAACTCCAACAAACCTACACCTATCATTCGAAGTGTTTTGGAGAGAAAAAGGAGAGTGGTTG
GGTTCTCGTTGCAATTTCCGGGCTTGTCTTTGGTATGCTCGAGATGATGGCATTTATTTGCTTAATATTCCTCAAAGTTCTTATGAGCTTATACATAAATGGGAACATTA
G
Protein sequenceShow/hide protein sequence
MIPSLVLILTIAGSITPQCSALSMGSKYPRSFSEWTIGIVNQLGSNQKLFAHCKSKDDDLGDHTVEVGQTYQWHFKENALSTTLFWCTLRTPTNLHLSFEVFWREKGEWL
GSRCNFRACLWYARDDGIYLLNIPQSSYELIHKWEH