; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014151 (gene) of Snake gourd v1 genome

Gene IDTan0014151
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptioncysteine-rich and transmembrane domain-containing protein WIH1-like
Genome locationLG05:44890678..44894501
RNA-Seq ExpressionTan0014151
SyntenyTan0014151
Gene Ontology termsGO:0005886 - plasma membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR028144 - Cysteine-rich transmembrane CYSTM domain
IPR044850 - Cysteine-rich and transmembrane domain-containing protein WIH1/2-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605218.1 hypothetical protein SDJN03_02535, partial [Cucurbita argyrosperma subsp. sororia]5.5e-3489.66Show/hide
Query:  MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGYPPPPPPG-YAPQYAP-AQQPPPKHETGCLQGCLAALCCCCLLDACF
        MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYP AQGYPPPPPP  YAP YAP  QQPPPK +TGCL+GCLAALCCCCLLDACF
Subjt:  MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGYPPPPPPG-YAPQYAP-AQQPPPKHETGCLQGCLAALCCCCLLDACF

XP_008457737.2 PREDICTED: cysteine-rich and transmembrane domain-containing protein A-like [Cucumis melo]3.6e-3388.64Show/hide
Query:  MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGY--PPPPPPGYAPQYAPAQQPPPKHE-TGCLQGCLAALCCCCLLDACF
        MSYYDHQQPPVGAPPPQGFPPKD+YPPPGYPVQGYPAAQGY  PPPPPPGYAPQY+   QPPPK+E TGCL+GCLAALCCCCLLDACF
Subjt:  MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGY--PPPPPPGYAPQYAPAQQPPPKHE-TGCLQGCLAALCCCCLLDACF

XP_011649307.1 cysteine-rich and transmembrane domain-containing protein WIH1 [Cucumis sativus]1.6e-3389.77Show/hide
Query:  MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGY--PPPPPPGYAPQYAPAQQPPPKHE-TGCLQGCLAALCCCCLLDACF
        MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGY  PPPPPPGYAPQY+   QPPPK+E TGCL+GCLAALCCCCLLDACF
Subjt:  MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGY--PPPPPPGYAPQYAPAQQPPPKHE-TGCLQGCLAALCCCCLLDACF

XP_022947085.1 cysteine-rich and transmembrane domain-containing protein WIH1-like isoform X2 [Cucurbita moschata]1.1e-3490.8Show/hide
Query:  MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGYPPPPPPG-YAPQYAP-AQQPPPKHETGCLQGCLAALCCCCLLDACF
        MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYP AQGYPPPPPP  YAPQYAP  QQPPPK +TGCL+GCLAALCCCCLLDACF
Subjt:  MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGYPPPPPPG-YAPQYAP-AQQPPPKHETGCLQGCLAALCCCCLLDACF

XP_038901964.1 cysteine-rich and transmembrane domain-containing protein WIH1-like [Benincasa hispida]8.0e-3388.76Show/hide
Query:  MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGY---PPPPPPGYAPQYAPAQQPPPKHE-TGCLQGCLAALCCCCLLDACF
        MSYYDHQQPPVGAPPPQGFP KDAYPPPGYPVQGYPAAQGY   PPPPPPGYAPQYA   QPPPK+E TGCL+GCLAALCCCCLLDACF
Subjt:  MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGY---PPPPPPGYAPQYAPAQQPPPKHE-TGCLQGCLAALCCCCLLDACF

TrEMBL top hitse value%identityAlignment
A0A0A0LJA6 CYSTM domain-containing protein7.8e-3489.77Show/hide
Query:  MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGY--PPPPPPGYAPQYAPAQQPPPKHE-TGCLQGCLAALCCCCLLDACF
        MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGY  PPPPPPGYAPQY+   QPPPK+E TGCL+GCLAALCCCCLLDACF
Subjt:  MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGY--PPPPPPGYAPQYAPAQQPPPKHE-TGCLQGCLAALCCCCLLDACF

A0A1S3C5S7 cysteine-rich and transmembrane domain-containing protein A-like1.7e-3388.64Show/hide
Query:  MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGY--PPPPPPGYAPQYAPAQQPPPKHE-TGCLQGCLAALCCCCLLDACF
        MSYYDHQQPPVGAPPPQGFPPKD+YPPPGYPVQGYPAAQGY  PPPPPPGYAPQY+   QPPPK+E TGCL+GCLAALCCCCLLDACF
Subjt:  MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGY--PPPPPPGYAPQYAPAQQPPPKHE-TGCLQGCLAALCCCCLLDACF

A0A5N5HK71 Rhodopsin2.7e-2677.53Show/hide
Query:  MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGYPP---PPPPGYAPQYAPAQQPPPKHET-GCLQGCLAALCCCCLLDACF
        MSYY+ QQ PVG PPPQG+PPKDAYPP GYP QGYP  QGYPP   PP  GYAPQY  AQQPPP+ E+ GCLQGCLAALCCCCLLDACF
Subjt:  MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGYPP---PPPPGYAPQYAPAQQPPPKHET-GCLQGCLAALCCCCLLDACF

A0A6J1G5U7 cysteine-rich and transmembrane domain-containing protein WIH1-like isoform X25.4e-3590.8Show/hide
Query:  MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGYPPPPPPG-YAPQYAP-AQQPPPKHETGCLQGCLAALCCCCLLDACF
        MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYP AQGYPPPPPP  YAPQYAP  QQPPPK +TGCL+GCLAALCCCCLLDACF
Subjt:  MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGYPPPPPPG-YAPQYAP-AQQPPPKHETGCLQGCLAALCCCCLLDACF

A0A6J1L666 cysteine-rich and transmembrane domain-containing protein WIH1-like1.1e-3287.36Show/hide
Query:  MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGYPPPPPPG-YAPQYAP-AQQPPPKHETGCLQGCLAALCCCCLLDACF
        MSYYDHQQPPVGAPPPQG PPKDAYPPPGYPVQGYP AQ YPPPPPP  YAPQYAP  QQP PK +TGCL+GCLAALCCCCLLDACF
Subjt:  MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGYPPPPPPG-YAPQYAP-AQQPPPKHETGCLQGCLAALCCCCLLDACF

SwissProt top hitse value%identityAlignment
O16005 Rhodopsin8.6e-0659.68Show/hide
Query:  PPVGAPPPQGFPPKDA---YPPPGY--PVQGYPAAQGYPP---PPPPGYAPQYAPAQQPPPK
        PP G  PPQG+PP  A   YPP GY  P QGYP AQGYPP   PPP G  PQ AP Q  PP+
Subjt:  PPVGAPPPQGFPPKDA---YPPPGY--PVQGYPAAQGYPP---PPPPGYAPQYAPAQQPPPK

P31356 Rhodopsin5.6e-0566.04Show/hide
Query:  PPVG-APPPQGFPPKDAYPPPGYPVQGYPAAQGYPPPPPPGYAPQYAPAQQPP
        PP G APPPQG+PP+  YPP GYP QGYP  QGY PPPP G  PQ AP   PP
Subjt:  PPVG-APPPQGFPPKDAYPPPGYPVQGYPAAQGYPPPPPPGYAPQYAPAQQPP

Q8LCL8 Cysteine-rich and transmembrane domain-containing protein B5.1e-1455.56Show/hide
Query:  QQPP-VGAPPPQGF----PPKDAYPPPG--YPVQGYPAAQGYPPP--PPPGYAPQ-----------YAPAQQPPPKHETGCLQGCLAALCCCCLLDACF
        QQPP VG PP   +    PPKDAYPPPG  YP QGYP  QGYP    PP GY PQ           Y P QQ   KH  G L+GC+AALCC C+LDACF
Subjt:  QQPP-VGAPPPQGF----PPKDAYPPPG--YPVQGYPAAQGYPPP--PPPGYAPQ-----------YAPAQQPPPKHETGCLQGCLAALCCCCLLDACF

Q8S8M0 Cysteine-rich and transmembrane domain-containing protein WIH28.1e-2061.39Show/hide
Query:  MSYYDHQQPPVGAPPPQGFP----PKDAYPPPGYPVQGYPAAQGYPPP-------PPPGYAPQYAPAQQPPPKHE-----TGCLQGCLAALCCCCLLDAC
        MS Y+  QPPVG PPPQG+P    PKDAYPP GYP QGYP  QGYPP        P  GY P YAP   PPP+H+      G L+GCLAALCCCCLLDAC
Subjt:  MSYYDHQQPPVGAPPPQGFP----PKDAYPPPGYPVQGYPAAQGYPPP-------PPPGYAPQYAPAQQPPPKHE-----TGCLQGCLAALCCCCLLDAC

Query:  F
        F
Subjt:  F

Q9FJW3 Cysteine-rich and transmembrane domain-containing protein WIH12.2e-1762.07Show/hide
Query:  YDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPA---AQGYPPP--PPPGYAPQYAPAQQPPPKHETGCLQGCLAALCCCCLLDACF
        Y  +Q PVGAPPPQG+PPKD YPP GYP  GYP    AQGYP    PPP Y      +Q P  K   G L+GCLAALCCCCLLDACF
Subjt:  YDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPA---AQGYPPP--PPPGYAPQYAPAQQPPPKHETGCLQGCLAALCCCCLLDACF

Arabidopsis top hitse value%identityAlignment
AT2G41420.1 proline-rich family protein5.7e-2161.39Show/hide
Query:  MSYYDHQQPPVGAPPPQGFP----PKDAYPPPGYPVQGYPAAQGYPPP-------PPPGYAPQYAPAQQPPPKHE-----TGCLQGCLAALCCCCLLDAC
        MS Y+  QPPVG PPPQG+P    PKDAYPP GYP QGYP  QGYPP        P  GY P YAP   PPP+H+      G L+GCLAALCCCCLLDAC
Subjt:  MSYYDHQQPPVGAPPPQGFP----PKDAYPPPGYPVQGYPAAQGYPPP-------PPPGYAPQYAPAQQPPPKHE-----TGCLQGCLAALCCCCLLDAC

Query:  F
        F
Subjt:  F

AT3G49845.1 unknown protein6.8e-1445.67Show/hide
Query:  MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPV---------------------------QGYPAAQGYPPPP-PPGYAPQYAPAQQPPPKH---------
        MSY D Q  PV APPPQG+PPK+ YPP GYP                            QGYP AQGYPPP  P G+ PQY P Q PPP H         
Subjt:  MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPV---------------------------QGYPAAQGYPPPP-PPGYAPQYAPAQQPPPKH---------

Query:  -----ETGCLQGCLAALCCCCLLDACF
               G ++GCLA LCCC LL+ACF
Subjt:  -----ETGCLQGCLAALCCCCLLDACF

AT4G19200.1 proline-rich family protein2.6e-0549.25Show/hide
Query:  PPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGYPPPPPPGYAPQYAPAQQPPPKHETGCLQGCLAAL
        PP G PP QG+PP   YPP GYP   YPAA G  PP P GY P   PA   P  H +G   G L  +
Subjt:  PPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGYPPPPPPGYAPQYAPAQQPPPKHETGCLQGCLAAL

AT4G33660.1 unknown protein4.7e-0750.7Show/hide
Query:  PKDAYPPPGYPVQGYPAAQGYP--PPPPPGYAPQY-----APAQQPPPKHETGCLQGCLAALCCCCLLDAC
        PK AYP        YPA   YP  PPPP G  PQY      P   PPP  + G L+G LAALCCCCL+D C
Subjt:  PKDAYPPPGYPVQGYPAAQGYP--PPPPPGYAPQY-----APAQQPPPKHETGCLQGCLAALCCCCLLDAC

AT5G67600.1 unknown protein1.6e-1862.07Show/hide
Query:  YDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPA---AQGYPPP--PPPGYAPQYAPAQQPPPKHETGCLQGCLAALCCCCLLDACF
        Y  +Q PVGAPPPQG+PPKD YPP GYP  GYP    AQGYP    PPP Y      +Q P  K   G L+GCLAALCCCCLLDACF
Subjt:  YDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPA---AQGYPPP--PPPGYAPQYAPAQQPPPKHETGCLQGCLAALCCCCLLDACF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTACTACGACCACCAACAGCCGCCGGTTGGGGCTCCACCGCCGCAAGGATTCCCTCCGAAGGACGCTTACCCTCCGCCAGGGTACCCCGTTCAGGGATACCCTGC
GGCTCAAGGGTACCCCCCTCCTCCGCCGCCGGGGTATGCCCCACAGTACGCTCCAGCGCAGCAGCCTCCTCCCAAGCACGAAACTGGTTGCCTTCAAGGATGTTTAGCAG
CACTTTGCTGTTGTTGCCTTTTGGATGCTTGCTTCTGA
mRNA sequenceShow/hide mRNA sequence
CTTTATTCTATTCCCACTCTTTTCTCTAAACAATTTAAGCATTATTATTCACAATCTCTCTTCTTCCAATTGAACCAAACCCCCTCGCCGGAATCCTCCGATTATCATGA
GTTACTACGACCACCAACAGCCGCCGGTTGGGGCTCCACCGCCGCAAGGATTCCCTCCGAAGGACGCTTACCCTCCGCCAGGGTACCCCGTTCAGGGATACCCTGCGGCT
CAAGGGTACCCCCCTCCTCCGCCGCCGGGGTATGCCCCACAGTACGCTCCAGCGCAGCAGCCTCCTCCCAAGCACGAAACTGGTTGCCTTCAAGGATGTTTAGCAGCACT
TTGCTGTTGTTGCCTTTTGGATGCTTGCTTCTGATAAGGATGGTCCATGATTCTAATTTCATTGTAGTATGACAATCATCAAATCTTGTTATATTAATAATGCTTTTAGA
TAACATTCAAAATTATAGTGTGTTTGAATAATTTGTGAAAAAATGCCTTTGGAACAAAAATCAACTTTTCTACAAGTTTCTTTAAAAACTGATCATTTACATTA
Protein sequenceShow/hide protein sequence
MSYYDHQQPPVGAPPPQGFPPKDAYPPPGYPVQGYPAAQGYPPPPPPGYAPQYAPAQQPPPKHETGCLQGCLAALCCCCLLDACF