; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019365 (gene) of Snake gourd v1 genome

Gene IDTan0019365
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionS-protein homolog
Genome locationLG06:75309247..75309660
RNA-Seq ExpressionTan0019365
SyntenyTan0019365
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR010264 - Plant self-incompatibility S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN61530.1 hypothetical protein Csa_006387 [Cucumis sativus]9.9e-3357.25Show/hide
Query:  LVTLMSLFFASLFTAEG-IFDRERVTVNITNVLES-HNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDKEL
        L   +SL FA++FT EG  F    VTVNITN L+  +NQLT+HCKSG+DDLG+H LS   SY F+FRPNF+GSTLFYC+F W GS HYF IY++ RD+  
Subjt:  LVTLMSLFFASLFTAEG-IFDRERVTVNITNVLES-HNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDKEL

Query:  C--TKCLWTVSEKGPCLSKPAGRNYDICYGW
        C  T CLW V E+G C+       YDICY W
Subjt:  C--TKCLWTVSEKGPCLSKPAGRNYDICYGW

ONI26926.1 hypothetical protein PRUPE_1G055500 [Prunus persica]1.0e-2948.51Show/hide
Query:  GCSFLILVTLMSLFFASLFTAEGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPN-FFGSTLFYCSFQWPGSFHYFEIYKNK
        G S ++L+T M+LF  ++    G     +  V +T+ LE    LTVHCKS +DDLG+H+L P  SY+FSF+PN F  STLF+CSFQWPG+FH+F+IY + 
Subjt:  GCSFLILVTLMSLFFASLFTAEGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPN-FFGSTLFYCSFQWPGSFHYFEIYKNK

Query:  RDKELCTKCLWTVSEKGPCLSKPAGRNYDICYGW
        RD ++C+KC W V E GPC+   + + Y+IC+ W
Subjt:  RDKELCTKCLWTVSEKGPCLSKPAGRNYDICYGW

XP_008465559.1 PREDICTED: pumilio homolog 15-like, partial [Cucumis melo]1.7e-3257.85Show/hide
Query:  LFTAEG-IFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDKELCT-KCLWTVSEK
        LFT EG  F    VTVNITN L   +QLTVHCKSG+DDLGIH L P   Y F+FRPNF G+TLFYC+FQWPG  H F+IYK+ RD++ C   CLW V E+
Subjt:  LFTAEG-IFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDKELCT-KCLWTVSEK

Query:  GPCLSKPAGRNYDICYGWLSK
        G C+     + YD CY W+ K
Subjt:  GPCLSKPAGRNYDICYGWLSK

XP_022138779.1 S-protein homolog 5 [Momordica charantia]4.3e-3657.46Show/hide
Query:  SFLILVTLMSLFFASLFTAEGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDK
        S ++ ++ + LF       E +    R +VNITN+LESH QLTVHCKS +DDLG H L P  SY F FRPN + STLF+CSFQWPGSFHYFEIY  KRD+
Subjt:  SFLILVTLMSLFFASLFTAEGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDK

Query:  ELCTKCLWTVSEKGPCLSKPAGRNYDICYGWLSK
        +LCT CLW V EKG CL       YDICY W S+
Subjt:  ELCTKCLWTVSEKGPCLSKPAGRNYDICYGWLSK

XP_022975612.1 S-protein homolog 3-like, partial [Cucurbita maxima]2.1e-3064.55Show/hide
Query:  MSLFFASLFTA--EGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDKELCTK-
        MSLFF +LFT   E IF    +TVNITNVLESHNQLTVHCKSG+DDLGIH L     Y F+FRPNF+GST FYC+FQWPG   YF+IYK+ RD+  C K 
Subjt:  MSLFFASLFTA--EGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDKELCTK-

Query:  -CLWTVSEKG
         CLW V ++G
Subjt:  -CLWTVSEKG

TrEMBL top hitse value%identityAlignment
A0A0A0LI28 S-protein homolog4.8e-3357.25Show/hide
Query:  LVTLMSLFFASLFTAEG-IFDRERVTVNITNVLES-HNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDKEL
        L   +SL FA++FT EG  F    VTVNITN L+  +NQLT+HCKSG+DDLG+H LS   SY F+FRPNF+GSTLFYC+F W GS HYF IY++ RD+  
Subjt:  LVTLMSLFFASLFTAEG-IFDRERVTVNITNVLES-HNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDKEL

Query:  C--TKCLWTVSEKGPCLSKPAGRNYDICYGW
        C  T CLW V E+G C+       YDICY W
Subjt:  C--TKCLWTVSEKGPCLSKPAGRNYDICYGW

A0A1S3CP53 S-protein homolog8.1e-3357.85Show/hide
Query:  LFTAEG-IFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDKELCT-KCLWTVSEK
        LFT EG  F    VTVNITN L   +QLTVHCKSG+DDLGIH L P   Y F+FRPNF G+TLFYC+FQWPG  H F+IYK+ RD++ C   CLW V E+
Subjt:  LFTAEG-IFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDKELCT-KCLWTVSEK

Query:  GPCLSKPAGRNYDICYGWLSK
        G C+     + YD CY W+ K
Subjt:  GPCLSKPAGRNYDICYGWLSK

A0A6J1CB24 S-protein homolog2.1e-3657.46Show/hide
Query:  SFLILVTLMSLFFASLFTAEGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDK
        S ++ ++ + LF       E +    R +VNITN+LESH QLTVHCKS +DDLG H L P  SY F FRPN + STLF+CSFQWPGSFHYFEIY  KRD+
Subjt:  SFLILVTLMSLFFASLFTAEGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDK

Query:  ELCTKCLWTVSEKGPCLSKPAGRNYDICYGWLSK
        +LCT CLW V EKG CL       YDICY W S+
Subjt:  ELCTKCLWTVSEKGPCLSKPAGRNYDICYGWLSK

A0A6J1IDI1 S-protein homolog9.9e-3164.55Show/hide
Query:  MSLFFASLFTA--EGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDKELCTK-
        MSLFF +LFT   E IF    +TVNITNVLESHNQLTVHCKSG+DDLGIH L     Y F+FRPNF+GST FYC+FQWPG   YF+IYK+ RD+  C K 
Subjt:  MSLFFASLFTA--EGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDKELCTK-

Query:  -CLWTVSEKG
         CLW V ++G
Subjt:  -CLWTVSEKG

M5XJY5 S-protein homolog4.9e-3048.51Show/hide
Query:  GCSFLILVTLMSLFFASLFTAEGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPN-FFGSTLFYCSFQWPGSFHYFEIYKNK
        G S ++L+T M+LF  ++    G     +  V +T+ LE    LTVHCKS +DDLG+H+L P  SY+FSF+PN F  STLF+CSFQWPG+FH+F+IY + 
Subjt:  GCSFLILVTLMSLFFASLFTAEGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPN-FFGSTLFYCSFQWPGSFHYFEIYKNK

Query:  RDKELCTKCLWTVSEKGPCLSKPAGRNYDICYGW
        RD ++C+KC W V E GPC+   + + Y+IC+ W
Subjt:  RDKELCTKCLWTVSEKGPCLSKPAGRNYDICYGW

SwissProt top hitse value%identityAlignment
F4JLQ5 S-protein homolog 26.2e-2240.34Show/hide
Query:  TAEGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRD-----KELCTKCLWTVSE
        +   +F   + TV I N L +   L  HCKS +DDLG   L P  S+ FSF   FFG TL++CSF WP   H F+IYK+ RD     K    +C+W +  
Subjt:  TAEGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRD-----KELCTKCLWTVSE

Query:  KGPCLSKPAGRNYDICYGW
         GPC      + +D+CY W
Subjt:  KGPCLSKPAGRNYDICYGW

F4JZG1 S-protein homolog 41.7e-1942.06Show/hide
Query:  VNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFF-GSTLFYCSFQWPGSFHYFEIYKNKRDKEL----CTKCLWTVSEKGPCLSKPAGRN
        V ITN L   + L +HCKS +DDLG+ +L+P+ S+ F FRP+   G TLF+C F WPG   +F IY + RD       C  C+W + + GPC        
Subjt:  VNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFF-GSTLFYCSFQWPGSFHYFEIYKNKRDKEL----CTKCLWTVSEKGPCLSKPAGRN

Query:  YDICYGW
        ++ICY W
Subjt:  YDICYGW

O23020 S-protein homolog 52.2e-1941.09Show/hide
Query:  VTLMSLFFASLFTA--EGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDKELC
        V+++  FF  LF +   G+    R TV     L     LT+HCKS  DDLGIH++     Y F F+PN + STLF+CSFQW   F  F+IY  +RD+ +C
Subjt:  VTLMSLFFASLFTA--EGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDKELC

Query:  TKCLWTVSEKGPC-LSKPAGRNYDICYGW
          C W +   GPC L K A      C+ W
Subjt:  TKCLWTVSEKGPC-LSKPAGRNYDICYGW

P0DN93 S-protein homolog 291.4e-1832.81Show/hide
Query:  ILVTLMSLFFASLFTAEGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDKELC
        I V L  + F  + +  G     +  V +TN +     LT+ C+S +DDLG HLL    ++ + FRP++F +TLF C F W  +  +F+ Y++ RD+  C
Subjt:  ILVTLMSLFFASLFTAEGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDKELC

Query:  TKCLWTVSEKGPCLSKPAGRNYDICYGW
          C W+++    C+S    + +D CY W
Subjt:  TKCLWTVSEKGPCLSKPAGRNYDICYGW

Q9FMQ4 S-protein homolog 32.4e-2136.84Show/hide
Query:  LILVTLMSLFFASLFTAEGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRD---
        +++  L+ + F+ + T   +       V ITN L     L +HCKS +DDLG+ +L+P+ S+ F FR +  G+TLFYC F WPG    F+IY + RD   
Subjt:  LILVTLMSLFFASLFTAEGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRD---

Query:  -KELCTKCLWTVSEKGPCLSKPAGRNYDICYGW
            C  C+W +S +GPC+   +   ++ICY W
Subjt:  -KELCTKCLWTVSEKGPCLSKPAGRNYDICYGW

Arabidopsis top hitse value%identityAlignment
AT3G16970.1 Plant self-incompatibility protein S1 family4.0e-2442.24Show/hide
Query:  FDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDKEL----CTKCLWTVSEKGPCLS
        FD  R TV I N L  H  L  HCKS NDDLG   ++ + ++ F FRP+ FG TLF+C F W    H+F+IYK  RD+E     C +C W + + GPC  
Subjt:  FDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDKEL----CTKCLWTVSEKGPCLS

Query:  KPAGRNYDICYGWLSK
              +D+C  W S+
Subjt:  KPAGRNYDICYGWLSK

AT3G17080.1 Plant self-incompatibility protein S1 family1.5e-2342.4Show/hide
Query:  LFFASLFTAEGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDKEL----CTKC
        LFF  +     I  R   +V I N L     L  HCKS  DDLG   L+P  S+ F F P+ FG TLFYC F W    H F+IYK  RDKE     C KC
Subjt:  LFFASLFTAEGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDKEL----CTKC

Query:  LWTVSEKGPCLSKPAGRNYDICYGW
         W + + GPC        +D CY W
Subjt:  LWTVSEKGPCLSKPAGRNYDICYGW

AT4G16195.1 Plant self-incompatibility protein S1 family4.4e-2340.34Show/hide
Query:  TAEGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRD-----KELCTKCLWTVSE
        +   +F   + TV I N L +   L  HCKS +DDLG   L P  S+ FSF   FFG TL++CSF WP   H F+IYK+ RD     K    +C+W +  
Subjt:  TAEGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRD-----KELCTKCLWTVSE

Query:  KGPCLSKPAGRNYDICYGW
         GPC      + +D+CY W
Subjt:  KGPCLSKPAGRNYDICYGW

AT5G12060.1 Plant self-incompatibility protein S1 family1.7e-2236.84Show/hide
Query:  LILVTLMSLFFASLFTAEGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRD---
        +++  L+ + F+ + T   +       V ITN L     L +HCKS +DDLG+ +L+P+ S+ F FR +  G+TLFYC F WPG    F+IY + RD   
Subjt:  LILVTLMSLFFASLFTAEGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRD---

Query:  -KELCTKCLWTVSEKGPCLSKPAGRNYDICYGW
            C  C+W +S +GPC+   +   ++ICY W
Subjt:  -KELCTKCLWTVSEKGPCLSKPAGRNYDICYGW

AT5G12070.1 Plant self-incompatibility protein S1 family1.2e-2042.06Show/hide
Query:  VNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFF-GSTLFYCSFQWPGSFHYFEIYKNKRDKEL----CTKCLWTVSEKGPCLSKPAGRN
        V ITN L   + L +HCKS +DDLG+ +L+P+ S+ F FRP+   G TLF+C F WPG   +F IY + RD       C  C+W + + GPC        
Subjt:  VNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFF-GSTLFYCSFQWPGSFHYFEIYKNKRDKEL----CTKCLWTVSEKGPCLSKPAGRN

Query:  YDICYGW
        ++ICY W
Subjt:  YDICYGW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTGTTCATTTTTAATACTTGTAACGTTAATGTCGTTATTTTTTGCTAGCCTGTTTACTGCAGAAGGAATCTTCGATAGGGAAAGAGTTACAGTCAATATAACAAA
TGTTCTGGAAAGTCACAATCAGCTCACGGTTCACTGCAAATCTGGCAATGACGATTTAGGAATTCACTTGTTGTCACCTTCGACCAGCTATGACTTTAGTTTTCGTCCAA
ATTTCTTTGGTTCGACGTTGTTCTACTGCAGCTTCCAATGGCCTGGCTCGTTTCATTATTTCGAAATCTACAAGAATAAAAGAGACAAAGAGCTTTGTACAAAATGTTTA
TGGACTGTGAGTGAAAAAGGTCCCTGTTTGTCCAAACCTGCTGGCCGCAACTACGATATTTGCTATGGGTGGCTGTCTAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGTGTTCATTTTTAATACTTGTAACGTTAATGTCGTTATTTTTTGCTAGCCTGTTTACTGCAGAAGGAATCTTCGATAGGGAAAGAGTTACAGTCAATATAACAAA
TGTTCTGGAAAGTCACAATCAGCTCACGGTTCACTGCAAATCTGGCAATGACGATTTAGGAATTCACTTGTTGTCACCTTCGACCAGCTATGACTTTAGTTTTCGTCCAA
ATTTCTTTGGTTCGACGTTGTTCTACTGCAGCTTCCAATGGCCTGGCTCGTTTCATTATTTCGAAATCTACAAGAATAAAAGAGACAAAGAGCTTTGTACAAAATGTTTA
TGGACTGTGAGTGAAAAAGGTCCCTGTTTGTCCAAACCTGCTGGCCGCAACTACGATATTTGCTATGGGTGGCTGTCTAAATAA
Protein sequenceShow/hide protein sequence
MGCSFLILVTLMSLFFASLFTAEGIFDRERVTVNITNVLESHNQLTVHCKSGNDDLGIHLLSPSTSYDFSFRPNFFGSTLFYCSFQWPGSFHYFEIYKNKRDKELCTKCL
WTVSEKGPCLSKPAGRNYDICYGWLSK