; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007087 (gene) of Snake gourd v1 genome

Gene IDTan0007087
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionS-protein homolog
Genome locationLG06:17084707..17085162
RNA-Seq ExpressionTan0007087
SyntenyTan0007087
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR010264 - Plant self-incompatibility S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0067970.1 uncharacterized protein E6C27_scaffold138G001560 [Cucumis melo var. makuwa]4.3e-5268.28Show/hide
Query:  YEEKKHLAIVLLLVLAALAVVQPSTAVPLPLPTWRIHVVNGLSNETLLAHCKSKDDDLGNQTLAAKGDEYHWTFK--------------KPNFDVSFESF
        YEEK +L +V+LLVL A+AVVQP TAVPLPLP WRIHVVNGL+NETLL HCKSKDDDLG   L +KG+E+ WTFK              KPNF VSFESF
Subjt:  YEEKKHLAIVLLLVLAALAVVQPSTAVPLPLPTWRIHVVNGLSNETLLAHCKSKDDDLGNQTLAAKGDEYHWTFK--------------KPNFDVSFESF

Query:  WVEKTHPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQWNK
        WVEK+HPWLNSRC+  DC WIAKDD +YLRNN  NVDE +H+WNK
Subjt:  WVEKTHPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQWNK

KAE8646232.1 hypothetical protein Csa_016625, partial [Cucumis sativus]2.6e-4167.91Show/hide
Query:  HLAIV-LLLVLAALAVVQPSTAVPLPLPTWRIHVVNGLSNETLLAHCKSKDDDLGNQTLAAKGDEYHWTFK--------------KPNFDVSFESFWVEK
        HLA+V LL+VL A+ VVQP  AVP+P P W IHVVNGLSNETLL HCKS DDDLG Q L  +G E+HWTF+              KPNF VSFESFWVEK
Subjt:  HLAIV-LLLVLAALAVVQPSTAVPLPLPTWRIHVVNGLSNETLLAHCKSKDDDLGNQTLAAKGDEYHWTFK--------------KPNFDVSFESFWVEK

Query:  THPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDE
         H WLNSRCYDK+C WIAKDDGIYLRNNP N+DE
Subjt:  THPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDE

XP_004139722.1 S-protein homolog 1-like [Cucumis sativus]3.9e-4568.09Show/hide
Query:  HLAIV-LLLVLAALAVVQPSTAVPLPLPTWRIHVVNGLSNETLLAHCKSKDDDLGNQTLAAKGDEYHWTFK--------------KPNFDVSFESFWVEK
        HLA+V LL+VL A+ VVQP  AVP+P P W IHVVNGLSNETLL HCKS DDDLG Q L  +G E+HWTF+              KPNF VSFESFWVEK
Subjt:  HLAIV-LLLVLAALAVVQPSTAVPLPLPTWRIHVVNGLSNETLLAHCKSKDDDLGNQTLAAKGDEYHWTFK--------------KPNFDVSFESFWVEK

Query:  THPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQWNK
         H WLNSRCYDK+C WIAKDDGIYLRNNP N+DE VH WNK
Subjt:  THPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQWNK

XP_008461539.1 PREDICTED: uncharacterized protein LOC103500111 [Cucumis melo]6.1e-5168.97Show/hide
Query:  YEEKKHLAIVLLLVLAALAVVQPSTAVPLPLPTWRIHVVNGLSNETLLAHCKSKDDDLGNQTLAAKGDEYHWTF--------------KKPNFDVSFESF
        YE+   LA+ LLLVL A+ +VQPSTAVPLPLP W IHVVNGL N+TL  HCKSKDDDLGN TL+ KG E  WTF              KKPNF V+FESF
Subjt:  YEEKKHLAIVLLLVLAALAVVQPSTAVPLPLPTWRIHVVNGLSNETLLAHCKSKDDDLGNQTLAAKGDEYHWTF--------------KKPNFDVSFESF

Query:  WVEKTHPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQWNK
        WVEKTHPWL SRC+DK+C WIAKDDGIYLRNN  NVDELVH WNK
Subjt:  WVEKTHPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQWNK

XP_008462114.1 PREDICTED: uncharacterized protein LOC103500542 [Cucumis melo]4.1e-4769.29Show/hide
Query:  HLAIVLLLVLAALAVVQPSTAVPLPLPTWRIHVVNGLSNETLLAHCKSKDDDLGNQTLAAKGDEYHWTFK--------------KPNFDVSFESFWVEKT
        HLA+V LLVL A+ VVQP TAVP+P P W IHVVNGLSNETLL HCKS+DDDLG Q L  KG E+HWTF+              KPNF VSFESFWVEK 
Subjt:  HLAIVLLLVLAALAVVQPSTAVPLPLPTWRIHVVNGLSNETLLAHCKSKDDDLGNQTLAAKGDEYHWTFK--------------KPNFDVSFESFWVEKT

Query:  HPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQWNK
        H WLNSRCYDK+C WIAKDDGIYLRNNP N++E VH WNK
Subjt:  HPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQWNK

TrEMBL top hitse value%identityAlignment
A0A1S3CEQ0 S-protein homolog3.0e-5168.97Show/hide
Query:  YEEKKHLAIVLLLVLAALAVVQPSTAVPLPLPTWRIHVVNGLSNETLLAHCKSKDDDLGNQTLAAKGDEYHWTF--------------KKPNFDVSFESF
        YE+   LA+ LLLVL A+ +VQPSTAVPLPLP W IHVVNGL N+TL  HCKSKDDDLGN TL+ KG E  WTF              KKPNF V+FESF
Subjt:  YEEKKHLAIVLLLVLAALAVVQPSTAVPLPLPTWRIHVVNGLSNETLLAHCKSKDDDLGNQTLAAKGDEYHWTF--------------KKPNFDVSFESF

Query:  WVEKTHPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQWNK
        WVEKTHPWL SRC+DK+C WIAKDDGIYLRNN  NVDELVH WNK
Subjt:  WVEKTHPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQWNK

A0A1S3CG89 S-protein homolog2.0e-4769.29Show/hide
Query:  HLAIVLLLVLAALAVVQPSTAVPLPLPTWRIHVVNGLSNETLLAHCKSKDDDLGNQTLAAKGDEYHWTFK--------------KPNFDVSFESFWVEKT
        HLA+V LLVL A+ VVQP TAVP+P P W IHVVNGLSNETLL HCKS+DDDLG Q L  KG E+HWTF+              KPNF VSFESFWVEK 
Subjt:  HLAIVLLLVLAALAVVQPSTAVPLPLPTWRIHVVNGLSNETLLAHCKSKDDDLGNQTLAAKGDEYHWTFK--------------KPNFDVSFESFWVEKT

Query:  HPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQWNK
        H WLNSRCYDK+C WIAKDDGIYLRNNP N++E VH WNK
Subjt:  HPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQWNK

A0A5A7VL75 S-protein homolog2.1e-5268.28Show/hide
Query:  YEEKKHLAIVLLLVLAALAVVQPSTAVPLPLPTWRIHVVNGLSNETLLAHCKSKDDDLGNQTLAAKGDEYHWTFK--------------KPNFDVSFESF
        YEEK +L +V+LLVL A+AVVQP TAVPLPLP WRIHVVNGL+NETLL HCKSKDDDLG   L +KG+E+ WTFK              KPNF VSFESF
Subjt:  YEEKKHLAIVLLLVLAALAVVQPSTAVPLPLPTWRIHVVNGLSNETLLAHCKSKDDDLGNQTLAAKGDEYHWTFK--------------KPNFDVSFESF

Query:  WVEKTHPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQWNK
        WVEK+HPWLNSRC+  DC WIAKDD +YLRNN  NVDE +H+WNK
Subjt:  WVEKTHPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQWNK

A0A6J1CPC6 S-protein homolog2.9e-3854.67Show/hide
Query:  YEEKKHLAIVLLLVLAALAVVQPSTAVPLP---LPTWRIHVVNGLSNETLLAHCKSKDDDLGNQTLAAKGDEYHWTF--------------KKPNFDVSF
        Y + + + +V+ LV    AV+Q  TA  L    LP W IHVVNGLS  TL  HCKSKDDDLG   L  +GDE+ WTF              KKPN DVSF
Subjt:  YEEKKHLAIVLLLVLAALAVVQPSTAVPLP---LPTWRIHVVNGLSNETLLAHCKSKDDDLGNQTLAAKGDEYHWTF--------------KKPNFDVSF

Query:  ESFWVEKTHPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQWNKRT
        ESFWVE+TH WL  RC DK+C W AKDDGIYLRNNP  VDE +H+W   T
Subjt:  ESFWVEKTHPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQWNKRT

A0A6J1L0E8 S-protein homolog6.0e-3652.67Show/hide
Query:  MGATYEEKKHL-AIVLLLVLAALAVVQPSTAV--PLPLPTWRIHVVNGLSNETLLAHCKSKDDDLGNQTLAAKGDEYHWTFK--------------KPNF
        M +TY +K+HL A   LL L A A+ QP       +P+  WR+HVVN L+N TL  HCKSKDDDLG   L   G E+ W+FK              KPN 
Subjt:  MGATYEEKKHL-AIVLLLVLAALAVVQPSTAV--PLPLPTWRIHVVNGLSNETLLAHCKSKDDDLGNQTLAAKGDEYHWTFK--------------KPNF

Query:  DVSFESFWVEKTHPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQW
         VSFE+FW+EKTH WLN RCY ++C W AKDDG+YLRNNP  VDE VH+W
Subjt:  DVSFESFWVEKTHPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQW

SwissProt top hitse value%identityAlignment
F4JLQ5 S-protein homolog 27.4e-0727.1Show/hide
Query:  KKHLAIVLLLVLAALAVVQPSTAVPLPLPT-------------WRIHVVNGLSNE-TLLAHCKSKDDDLGNQTLAAKGDEYHWTFKKPNF--DVSFESF-
        K++L++ +L++     + Q      +P+P                + + N L N+ TLL HCKSKDDDLGN+TL   G+ + ++F +  F   + F SF 
Subjt:  KKHLAIVLLLVLAALAVVQPSTAVPLPLPT-------------WRIHVVNGLSNE-TLLAHCKSKDDDLGNQTLAAKGDEYHWTFKKPNF--DVSFESF-

Query:  WVEKTHPW----------LNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQWNK
        W  ++H +           +++C    C W  + +G    N+     +L + WNK
Subjt:  WVEKTHPW----------LNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQWNK

F4JLS0 S-protein homolog 14.5e-1236.11Show/hide
Query:  WRIHVVNGL-SNETLLAHCKSKDDDLGNQTLAAKGDEYHWTFKKPNFDVSFESFWVEKTHPWLN-----------SRCYDKDCFWIAKDDGIYLRNNPAN
        W++ VVNGL + ETL  HCKSK+DDLG   L  + + + W F +     +F   ++ K +  +N            RC  K+C W AK DG+YL N+ + 
Subjt:  WRIHVVNGL-SNETLLAHCKSKDDDLGNQTLAAKGDEYHWTFKKPNFDVSFESFWVEKTHPWLN-----------SRCYDKDCFWIAKDDGIYLRNNPAN

Query:  VDELVHQW
         D L  +W
Subjt:  VDELVHQW

Q2HQ46 S-protein homolog 741.6e-0932.87Show/hide
Query:  LAIVLLLVLAAL--AVVQPSTAVPLPLP---TWRIHVVNGL-SNETLLAHCKSKDDDLGNQTLAAKGDEYHWTF--------------KKPNFDVSFESF
        LAI   LVL      + + +T   + +P    W++ V NGL + ETL  HCKSK++DLG+  L    D + W F               K +  ++ + F
Subjt:  LAIVLLLVLAAL--AVVQPSTAVPLPLP---TWRIHVVNGL-SNETLLAHCKSKDDDLGNQTLAAKGDEYHWTF--------------KKPNFDVSFESF

Query:  WVEKTHPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQW
        W +     L  RC  K+C W AK+DG+YL N+    D L  +W
Subjt:  WVEKTHPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQW

Arabidopsis top hitse value%identityAlignment
AT4G10895.1 Plant self-incompatibility protein S1 family7.1e-0534.92Show/hide
Query:  FKKPNF--DVSFESFWVEKTHPWLNSRCYDKD---CFWIAKDDGIYLRNNPANVDELVHQWNK
        +K P+F   VSF++F  +++  +++  C       CFW  +DDG++ RNNP    +L+++WNK
Subjt:  FKKPNF--DVSFESFWVEKTHPWLNSRCYDKD---CFWIAKDDGIYLRNNPANVDELVHQWNK

AT4G16195.1 Plant self-incompatibility protein S1 family5.2e-0827.1Show/hide
Query:  KKHLAIVLLLVLAALAVVQPSTAVPLPLPT-------------WRIHVVNGLSNE-TLLAHCKSKDDDLGNQTLAAKGDEYHWTFKKPNF--DVSFESF-
        K++L++ +L++     + Q      +P+P                + + N L N+ TLL HCKSKDDDLGN+TL   G+ + ++F +  F   + F SF 
Subjt:  KKHLAIVLLLVLAALAVVQPSTAVPLPLPT-------------WRIHVVNGLSNE-TLLAHCKSKDDDLGNQTLAAKGDEYHWTFKKPNF--DVSFESF-

Query:  WVEKTHPW----------LNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQWNK
        W  ++H +           +++C    C W  + +G    N+     +L + WNK
Subjt:  WVEKTHPW----------LNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQWNK

AT4G16295.1 S-protein homologue 13.2e-1336.11Show/hide
Query:  WRIHVVNGL-SNETLLAHCKSKDDDLGNQTLAAKGDEYHWTFKKPNFDVSFESFWVEKTHPWLN-----------SRCYDKDCFWIAKDDGIYLRNNPAN
        W++ VVNGL + ETL  HCKSK+DDLG   L  + + + W F +     +F   ++ K +  +N            RC  K+C W AK DG+YL N+ + 
Subjt:  WRIHVVNGL-SNETLLAHCKSKDDDLGNQTLAAKGDEYHWTFKKPNFDVSFESFWVEKTHPWLN-----------SRCYDKDCFWIAKDDGIYLRNNPAN

Query:  VDELVHQW
         D L  +W
Subjt:  VDELVHQW

AT4G29035.1 Plant self-incompatibility protein S1 family1.1e-1032.87Show/hide
Query:  LAIVLLLVLAAL--AVVQPSTAVPLPLP---TWRIHVVNGL-SNETLLAHCKSKDDDLGNQTLAAKGDEYHWTF--------------KKPNFDVSFESF
        LAI   LVL      + + +T   + +P    W++ V NGL + ETL  HCKSK++DLG+  L    D + W F               K +  ++ + F
Subjt:  LAIVLLLVLAAL--AVVQPSTAVPLPLP---TWRIHVVNGL-SNETLLAHCKSKDDDLGNQTLAAKGDEYHWTF--------------KKPNFDVSFESF

Query:  WVEKTHPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQW
        W +     L  RC  K+C W AK+DG+YL N+    D L  +W
Subjt:  WVEKTHPWLNSRCYDKDCFWIAKDDGIYLRNNPANVDELVHQW

AT5G04350.1 Plant self-incompatibility protein S1 family6.4e-0631.68Show/hide
Query:  RIHVVNGLSNETLL-AHCKSKDDDLGNQTLAAKGDEYHWTF---------------KKPNFD-----VSFESFWVEKTHPWLNSRCYDKDCFWIAKDDGI
        ++ + N L +  LL  HC+SKDDDLG   L   G +Y +TF               + PNF      V++E+ W         S+  +  C WI ++DGI
Subjt:  RIHVVNGLSNETLL-AHCKSKDDDLGNQTLAAKGDEYHWTF---------------KKPNFD-----VSFESFWVEKTHPWLNSRCYDKDCFWIAKDDGI

Query:  Y
        Y
Subjt:  Y


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGCAACGTACGAGGAAAAGAAGCATTTGGCAATTGTTCTCTTACTTGTCTTGGCGGCCCTGGCGGTGGTTCAGCCGTCCACGGCGGTTCCACTGCCACTCCCAAC
GTGGCGCATTCATGTGGTCAACGGGCTGAGCAACGAAACCCTATTGGCTCATTGTAAGTCAAAAGATGATGATTTGGGCAACCAAACTTTGGCTGCCAAAGGGGATGAAT
ATCATTGGACTTTTAAGAAGCCAAATTTTGATGTGTCGTTTGAATCGTTTTGGGTTGAAAAAACTCACCCTTGGCTCAATTCTAGATGCTATGATAAAGATTGCTTTTGG
ATTGCTAAAGATGATGGGATTTACTTGAGAAACAATCCTGCTAATGTTGATGAACTTGTACATCAGTGGAACAAACGTACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGCAACGTACGAGGAAAAGAAGCATTTGGCAATTGTTCTCTTACTTGTCTTGGCGGCCCTGGCGGTGGTTCAGCCGTCCACGGCGGTTCCACTGCCACTCCCAAC
GTGGCGCATTCATGTGGTCAACGGGCTGAGCAACGAAACCCTATTGGCTCATTGTAAGTCAAAAGATGATGATTTGGGCAACCAAACTTTGGCTGCCAAAGGGGATGAAT
ATCATTGGACTTTTAAGAAGCCAAATTTTGATGTGTCGTTTGAATCGTTTTGGGTTGAAAAAACTCACCCTTGGCTCAATTCTAGATGCTATGATAAAGATTGCTTTTGG
ATTGCTAAAGATGATGGGATTTACTTGAGAAACAATCCTGCTAATGTTGATGAACTTGTACATCAGTGGAACAAACGTACTTGA
Protein sequenceShow/hide protein sequence
MGATYEEKKHLAIVLLLVLAALAVVQPSTAVPLPLPTWRIHVVNGLSNETLLAHCKSKDDDLGNQTLAAKGDEYHWTFKKPNFDVSFESFWVEKTHPWLNSRCYDKDCFW
IAKDDGIYLRNNPANVDELVHQWNKRT