; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021087 (gene) of Snake gourd v1 genome

Gene IDTan0021087
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionS-protein homolog
Genome locationLG06:16739576..16740010
RNA-Seq ExpressionTan0021087
SyntenyTan0021087
Gene Ontology termsGO:0060320 - rejection of self pollen (biological process)
GO:0005576 - extracellular region (cellular component)
InterPro domainsIPR010264 - Plant self-incompatibility S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016902699.1 PREDICTED: uncharacterized protein LOC107991826 [Cucumis melo]1.5e-5265.03Show/hide
Query:  MKSMKKHFSL--FLIVLSLAMLESARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAF
        M+SMK HF +  FL VLSLA+++  +A  L++WHIH++NGLSN Q L VHC+SKDDDLG+  +SVG EF+WTF++NFW+TTLFWCYL+K NA+ VSFEAF
Subjt:  MKSMKKHFSL--FLIVLSLAMLESARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAF

Query:  WIESRSVWLFHKCFDSNCIWTAKDDGIYLKDNTTQVDVLIHQW
        W+E +S+WLF++CF SNCIWTAKDDGIYLKDN    D L+H W
Subjt:  WIESRSVWLFHKCFDSNCIWTAKDDGIYLKDNTTQVDVLIHQW

XP_022143772.1 S-protein homolog 74-like [Momordica charantia]1.3e-5466.21Show/hide
Query:  MEMKSMKKHFSLFLIVLSLAMLESARA-AELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEA
        M ++S KKHF +FL+VLSL +LE   A +ELKKW IHV+NGLSNGQ L VHCKSKD+DLGEHN++ G EF+WTFRVN WNTTLFWCYL K + +  SF+ 
Subjt:  MEMKSMKKHFSLFLIVLSLAMLESARA-AELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEA

Query:  FWIESRSVWLFHKCFDSNCIWTAKDDGIYLKDNTTQVDVLIHQWK
        FW+E +S+WLF++C+ SNCIWTAKDDGIYL+DN  Q D+L+H+WK
Subjt:  FWIESRSVWLFHKCFDSNCIWTAKDDGIYLKDNTTQVDVLIHQWK

XP_022143829.1 S-protein homolog 74-like [Momordica charantia]1.1e-5061.81Show/hide
Query:  MEMKSMKKHFSLFLIVLSLAMLESARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAF
        M  + +KKHF + L+ LSLA++E   + ELK+W+IHV+NGL NG++L VHCKS+DDDLGE N+  GAEF WTFRVN  +TTLFWC+L+K +AQ VSF+AF
Subjt:  MEMKSMKKHFSLFLIVLSLAMLESARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAF

Query:  WIESRSVWLFHKCFDSNCIWTAKDDGIYLKDNTTQVDVLIHQWK
        W+E  S+WLF++C+D+NCIWTAKDDG+YL+DN  Q DVL+H+W+
Subjt:  WIESRSVWLFHKCFDSNCIWTAKDDGIYLKDNTTQVDVLIHQWK

XP_031745090.1 S-protein homolog 1-like [Cucumis sativus]9.0e-5367.59Show/hide
Query:  MEMKSMKKHFSLFLIVLSLAMLESARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAF
        +E+  MKK   + L VL LA+LE  +A EL+KWHIHV+NGLSNGQ+LL HCKSKD+DLGE  +  G EF+W FRVNFWNTTLFWCYL+K N Q  SFE+F
Subjt:  MEMKSMKKHFSLFLIVLSLAMLESARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAF

Query:  WIESRSVWLFHKCFDSNCIWTAKDDGIYLKDN-TTQVDVLIHQWK
        WIESRSVWL+  CF+ NCIWTAKDDGIYLKDN  T  D+LIH+W+
Subjt:  WIESRSVWLFHKCFDSNCIWTAKDDGIYLKDN-TTQVDVLIHQWK

XP_038896594.1 S-protein homolog 1-like [Benincasa hispida]1.5e-5570.14Show/hide
Query:  MEMKSMKKHFSLFLIVLSLAMLESARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAF
        MEM+ M K   +   VL LAML+  +AAEL KW IHV+NGLSNGQ+L VHCKSKD+DLGEH +SVG EF+W FRVNFWNTTLFWCYL+K NAQ  SF+AF
Subjt:  MEMKSMKKHFSLFLIVLSLAMLESARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAF

Query:  WIESRSVWLFHKCFDSNCIWTAKDDGIYLKDNTTQVDVLIHQWK
        WIES SVWL++ C+DSNCIW AKDDG+YLKDNT   DVLIH+W+
Subjt:  WIESRSVWLFHKCFDSNCIWTAKDDGIYLKDNTTQVDVLIHQWK

TrEMBL top hitse value%identityAlignment
A0A1S4E390 S-protein homolog7.4e-5365.03Show/hide
Query:  MKSMKKHFSL--FLIVLSLAMLESARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAF
        M+SMK HF +  FL VLSLA+++  +A  L++WHIH++NGLSN Q L VHC+SKDDDLG+  +SVG EF+WTF++NFW+TTLFWCYL+K NA+ VSFEAF
Subjt:  MKSMKKHFSL--FLIVLSLAMLESARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAF

Query:  WIESRSVWLFHKCFDSNCIWTAKDDGIYLKDNTTQVDVLIHQW
        W+E +S+WLF++CF SNCIWTAKDDGIYLKDN    D L+H W
Subjt:  WIESRSVWLFHKCFDSNCIWTAKDDGIYLKDNTTQVDVLIHQW

A0A6J1CPC6 S-protein homolog1.2e-3962.79Show/hide
Query:  VLSLAMLESARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHN-ISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAFWIESRSVWLFHKCF
        VL L    S  +A L KWHIHV+NGLS    L VHCKSKDDDLG HN ++ G EF WTF+VNFW TTL+WCYLKK NA  VSFE+FW+E   +WL ++C 
Subjt:  VLSLAMLESARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHN-ISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAFWIESRSVWLFHKCF

Query:  DSNCIWTAKDDGIYLKDNTTQVDVLIHQW
        D NCIWTAKDDGIYL++N   VD  IH+W
Subjt:  DSNCIWTAKDDGIYLKDNTTQVDVLIHQW

A0A6J1CPR8 S-protein homolog6.5e-4963.97Show/hide
Query:  KHFSLFLIVLSLAMLESARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAFWIESRSV
        KHF +FL V SLA++E   A  L KW IHV N LSN QML VHCKSK+DDLGEHN+SVG EF+W FRVN W+TTL+WCYL+K N Q VSF+AFW+E  S+
Subjt:  KHFSLFLIVLSLAMLESARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAFWIESRSV

Query:  WLFHKCFDSNCIWTAKDDGIYLKDNTTQVDVLIHQW
        WL++KC +SNC W AKDDGIYL++N    DV +H+W
Subjt:  WLFHKCFDSNCIWTAKDDGIYLKDNTTQVDVLIHQW

A0A6J1CQH6 S-protein homolog5.3e-5161.81Show/hide
Query:  MEMKSMKKHFSLFLIVLSLAMLESARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAF
        M  + +KKHF + L+ LSLA++E   + ELK+W+IHV+NGL NG++L VHCKS+DDDLGE N+  GAEF WTFRVN  +TTLFWC+L+K +AQ VSF+AF
Subjt:  MEMKSMKKHFSLFLIVLSLAMLESARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAF

Query:  WIESRSVWLFHKCFDSNCIWTAKDDGIYLKDNTTQVDVLIHQWK
        W+E  S+WLF++C+D+NCIWTAKDDG+YL+DN  Q DVL+H+W+
Subjt:  WIESRSVWLFHKCFDSNCIWTAKDDGIYLKDNTTQVDVLIHQWK

A0A6J1CRU0 S-protein homolog6.1e-5566.21Show/hide
Query:  MEMKSMKKHFSLFLIVLSLAMLESARA-AELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEA
        M ++S KKHF +FL+VLSL +LE   A +ELKKW IHV+NGLSNGQ L VHCKSKD+DLGEHN++ G EF+WTFRVN WNTTLFWCYL K + +  SF+ 
Subjt:  MEMKSMKKHFSLFLIVLSLAMLESARA-AELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEA

Query:  FWIESRSVWLFHKCFDSNCIWTAKDDGIYLKDNTTQVDVLIHQWK
        FW+E +S+WLF++C+ SNCIWTAKDDGIYL+DN  Q D+L+H+WK
Subjt:  FWIESRSVWLFHKCFDSNCIWTAKDDGIYLKDNTTQVDVLIHQWK

SwissProt top hitse value%identityAlignment
F4JLS0 S-protein homolog 19.8e-2643.97Show/hide
Query:  ELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAFWIESRSVWLFHKCFDSNCIWTAKDDGIY
        ++ +W + V+NGL+ G+ L +HCKSK+DDLGE N+     F W F  N  ++T FWCY+ K N   ++   FW +   V LFH+C   NCIWTAK DG+Y
Subjt:  ELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAFWIESRSVWLFHKCFDSNCIWTAKDDGIY

Query:  LKDNTTQVDVLIHQWK
        L ++ +  DVL  +W+
Subjt:  LKDNTTQVDVLIHQWK

P0DN92 S-protein homolog 241.2e-1233.33Show/hide
Query:  FSLFLIVLSLAMLESARAAELK---KWHI-HVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAFWIESR
        F + ++V+SL   E+ +  E K   + H+  V     N  +L +HCKS+DDDLG H ++ G  F W F VNF  +TL++C   +   +   FE +    R
Subjt:  FSLFLIVLSLAMLESARAAELK---KWHI-HVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAFWIESR

Query:  SVWLFHKCFDSNCIWTAKDDGIYLKDNTTQVDVLIHQW
        +   F++C  +NC W A+ DGIY          L + W
Subjt:  SVWLFHKCFDSNCIWTAKDDGIYLKDNTTQVDVLIHQW

Q2HQ46 S-protein homolog 741.1e-2442.24Show/hide
Query:  ELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAFWIESRSVWLFHKCFDSNCIWTAKDDGIY
        ++ +W + V NGL+ G+ L +HCKSK++DLG+ N+     F W F  N  ++TLFWCY+ K +   ++ + FW +   V LFH+C   NC+WTAK+DG+Y
Subjt:  ELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAFWIESRSVWLFHKCFDSNCIWTAKDDGIY

Query:  LKDNTTQVDVLIHQWK
        L ++    DVL  +WK
Subjt:  LKDNTTQVDVLIHQWK

Q9FI84 S-protein homolog 271.0e-1136.27Show/hide
Query:  SNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAFWIESRSVWLFHKCFDSNCIWTAKDDGIYLKDNTTQVDVLIH
        +N  +L +HCKSKDDDLG H    G  + W F VNF N+TL++C   +       F+      R+   F++C   NC W AK D +Y   N  Q      
Subjt:  SNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAFWIESRSVWLFHKCFDSNCIWTAKDDGIYLKDNTTQVDVLIH

Query:  QW
        +W
Subjt:  QW

Q9LW22 S-protein homolog 212.1e-1237.6Show/hide
Query:  KHFSLFLIVLSLAMLESARAAELKKWHIHVMNGLS--NGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKK--SNAQFVSFEAFWIE
        K+ S+FL V+ L M+        KK  I V N L+  N  +L VHCKSK++D+G   + +G    ++F+ NFW TT FWC L K     ++    A+   
Subjt:  KHFSLFLIVLSLAMLESARAAELKKWHIHVMNGLS--NGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKK--SNAQFVSFEAFWIE

Query:  SRSVWLFHKCFDSNCIWTAKDDGIY
         +++ LF K   S+  W A+DDGIY
Subjt:  SRSVWLFHKCFDSNCIWTAKDDGIY

Arabidopsis top hitse value%identityAlignment
AT3G26880.1 Plant self-incompatibility protein S1 family1.5e-1337.6Show/hide
Query:  KHFSLFLIVLSLAMLESARAAELKKWHIHVMNGLS--NGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKK--SNAQFVSFEAFWIE
        K+ S+FL V+ L M+        KK  I V N L+  N  +L VHCKSK++D+G   + +G    ++F+ NFW TT FWC L K     ++    A+   
Subjt:  KHFSLFLIVLSLAMLESARAAELKKWHIHVMNGLS--NGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKK--SNAQFVSFEAFWIE

Query:  SRSVWLFHKCFDSNCIWTAKDDGIY
         +++ LF K   S+  W A+DDGIY
Subjt:  SRSVWLFHKCFDSNCIWTAKDDGIY

AT4G16295.1 S-protein homologue 16.9e-2743.97Show/hide
Query:  ELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAFWIESRSVWLFHKCFDSNCIWTAKDDGIY
        ++ +W + V+NGL+ G+ L +HCKSK+DDLGE N+     F W F  N  ++T FWCY+ K N   ++   FW +   V LFH+C   NCIWTAK DG+Y
Subjt:  ELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAFWIESRSVWLFHKCFDSNCIWTAKDDGIY

Query:  LKDNTTQVDVLIHQWK
        L ++ +  DVL  +W+
Subjt:  LKDNTTQVDVLIHQWK

AT4G29035.1 Plant self-incompatibility protein S1 family7.7e-2642.24Show/hide
Query:  ELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAFWIESRSVWLFHKCFDSNCIWTAKDDGIY
        ++ +W + V NGL+ G+ L +HCKSK++DLG+ N+     F W F  N  ++TLFWCY+ K +   ++ + FW +   V LFH+C   NC+WTAK+DG+Y
Subjt:  ELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAFWIESRSVWLFHKCFDSNCIWTAKDDGIY

Query:  LKDNTTQVDVLIHQWK
        L ++    DVL  +WK
Subjt:  LKDNTTQVDVLIHQWK

AT5G04350.1 Plant self-incompatibility protein S1 family1.9e-1633.33Show/hide
Query:  LFLIVLSLAMLESARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLK-----KSNAQFVSFEAFWIESRS
        +F IV+ L +  S    E+ +  + + N L + ++L VHC+SKDDDLGEH + +G ++++TF  N W TT F C +      K +  FV++E  W     
Subjt:  LFLIVLSLAMLESARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLK-----KSNAQFVSFEAFWIESRS

Query:  VWLFHKCFDSNCIWTAKDDGIYLKDN
             K  +++C W  ++DGIY   +
Subjt:  VWLFHKCFDSNCIWTAKDDGIYLKDN

AT5G06020.1 Plant self-incompatibility protein S1 family7.4e-1336.27Show/hide
Query:  SNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAFWIESRSVWLFHKCFDSNCIWTAKDDGIYLKDNTTQVDVLIH
        +N  +L +HCKSKDDDLG H    G  + W F VNF N+TL++C   +       F+      R+   F++C   NC W AK D +Y   N  Q      
Subjt:  SNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAFWIESRSVWLFHKCFDSNCIWTAKDDGIYLKDNTTQVDVLIH

Query:  QW
        +W
Subjt:  QW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATGAAATCTATGAAAAAGCACTTTTCGCTTTTCTTGATTGTCTTGTCATTGGCAATGCTTGAGTCAGCAAGGGCTGCCGAGCTGAAAAAATGGCACATCCACGT
TATGAATGGGCTAAGCAACGGCCAAATGTTGTTGGTGCACTGCAAGTCAAAGGACGATGATCTAGGCGAACACAATATTAGCGTTGGAGCTGAATTCGATTGGACTTTTA
GAGTAAACTTTTGGAATACAACGTTGTTTTGGTGTTACTTGAAAAAGTCGAATGCTCAATTTGTTTCATTTGAAGCTTTTTGGATTGAGAGCAGATCTGTTTGGTTGTTT
CATAAATGCTTTGATTCTAATTGCATTTGGACAGCAAAAGATGATGGAATCTATTTGAAAGACAACACGACTCAAGTAGATGTTTTGATTCATCAATGGAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAATGAAATCTATGAAAAAGCACTTTTCGCTTTTCTTGATTGTCTTGTCATTGGCAATGCTTGAGTCAGCAAGGGCTGCCGAGCTGAAAAAATGGCACATCCACGT
TATGAATGGGCTAAGCAACGGCCAAATGTTGTTGGTGCACTGCAAGTCAAAGGACGATGATCTAGGCGAACACAATATTAGCGTTGGAGCTGAATTCGATTGGACTTTTA
GAGTAAACTTTTGGAATACAACGTTGTTTTGGTGTTACTTGAAAAAGTCGAATGCTCAATTTGTTTCATTTGAAGCTTTTTGGATTGAGAGCAGATCTGTTTGGTTGTTT
CATAAATGCTTTGATTCTAATTGCATTTGGACAGCAAAAGATGATGGAATCTATTTGAAAGACAACACGACTCAAGTAGATGTTTTGATTCATCAATGGAAATAG
Protein sequenceShow/hide protein sequence
MEMKSMKKHFSLFLIVLSLAMLESARAAELKKWHIHVMNGLSNGQMLLVHCKSKDDDLGEHNISVGAEFDWTFRVNFWNTTLFWCYLKKSNAQFVSFEAFWIESRSVWLF
HKCFDSNCIWTAKDDGIYLKDNTTQVDVLIHQWK