; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018621 (gene) of Snake gourd v1 genome

Gene IDTan0018621
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransmembrane protein
Genome locationLG05:78917817..78918528
RNA-Seq ExpressionTan0018621
SyntenyTan0018621
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583852.1 hypothetical protein SDJN03_19784, partial [Cucurbita argyrosperma subsp. sororia]1.4e-5984.08Show/hide
Query:  MPIPSKIAALAAVYSSLALIFLAGASNARDLRPSEHGLEYQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSGAA----G
        M I S+IAA+AAVY S+A+IFLAG SNARDLRPSEHGLEYQNPGA ENSSPDMQSFFKGNSWSSS+VALPKAMNTSLPSQWWN  RSH +RSGAA    G
Subjt:  MPIPSKIAALAAVYSSLALIFLAGASNARDLRPSEHGLEYQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSGAA----G

Query:  GDHIRGALLVASLVCGIIGASLLIASAFIYIFKFRKQNIRSSSSLSVAETANIIVNK
        GDHIRGALLVASLVCG+IG SLLIASAFIY+FKFRKQN RSSSSLSV+ET NII NK
Subjt:  GDHIRGALLVASLVCGIIGASLLIASAFIYIFKFRKQNIRSSSSLSVAETANIIVNK

XP_022927447.1 uncharacterized protein LOC111434271 [Cucurbita moschata]1.8e-5983.44Show/hide
Query:  MPIPSKIAALAAVYSSLALIFLAGASNARDLRPSEHGLEYQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSGAA----G
        M I S+IAA+AA+Y S+A+IFLAG SNARDLRPSEHGLEYQNPGA ENSSPDMQSFFKGNSWSSS+VALPKAMNTSLPSQWWN  RSH +RSGAA    G
Subjt:  MPIPSKIAALAAVYSSLALIFLAGASNARDLRPSEHGLEYQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSGAA----G

Query:  GDHIRGALLVASLVCGIIGASLLIASAFIYIFKFRKQNIRSSSSLSVAETANIIVNK
        GDHIRGALLVASLVCG+IG SLLIASAFIY+FKFRKQN RSSSSLSV+ET NII NK
Subjt:  GDHIRGALLVASLVCGIIGASLLIASAFIYIFKFRKQNIRSSSSLSVAETANIIVNK

XP_023000949.1 uncharacterized protein LOC111495232 [Cucurbita maxima]1.3e-5783.23Show/hide
Query:  IPSKIAALAAVYSSLALIFLAGASNARDLRPSEHGLEYQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSGAA----GGD
        I S+IAA AA+Y SLA+IFLAG SNARDLRPSEHGLEYQNPGA ENSSPDMQSFFKGNSWSSS+VALPKAMNTSLPSQWWN  RSH +RS AA     GD
Subjt:  IPSKIAALAAVYSSLALIFLAGASNARDLRPSEHGLEYQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSGAA----GGD

Query:  HIRGALLVASLVCGIIGASLLIASAFIYIFKFRKQNIRSSSSLSVAETANIIVNK
        HIRGALLVASLVCG+IG SLLIASAFIY+FKFRKQN RSSSSLSV+ET NII NK
Subjt:  HIRGALLVASLVCGIIGASLLIASAFIYIFKFRKQNIRSSSSLSVAETANIIVNK

XP_023519911.1 uncharacterized protein LOC111783236 [Cucurbita pepo subsp. pepo]8.1e-6084.08Show/hide
Query:  MPIPSKIAALAAVYSSLALIFLAGASNARDLRPSEHGLEYQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSGAA----G
        M I S+IAA+AA+Y SLA+IFLAG SNARDLRPSEHGLEYQNPGA ENSSPDMQSFFKGNSWSSS+VALPKAMNTSLPSQWWN  RSH +RSGAA    G
Subjt:  MPIPSKIAALAAVYSSLALIFLAGASNARDLRPSEHGLEYQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSGAA----G

Query:  GDHIRGALLVASLVCGIIGASLLIASAFIYIFKFRKQNIRSSSSLSVAETANIIVNK
        GDHIRGALLVASLVCG+IG SLLIASAFIY+FKFRKQN RSSSSLSV+ET NII NK
Subjt:  GDHIRGALLVASLVCGIIGASLLIASAFIYIFKFRKQNIRSSSSLSVAETANIIVNK

XP_038895122.1 uncharacterized protein LOC120083432 [Benincasa hispida]8.4e-5783.23Show/hide
Query:  IPSKIAALAAVYSSLALIFLAGASNARDLRPSEHGLEYQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSGAA----GGD
        IPS+I + AAV+SSLA+I LAG SNARDLRPSEHGLEYQNPGA ENSSP MQSFFKGNSWSSS+VALPKAMNTSLP QWWN  RSH NRSGAA    GGD
Subjt:  IPSKIAALAAVYSSLALIFLAGASNARDLRPSEHGLEYQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSGAA----GGD

Query:  HIRGALLVASLVCGIIGASLLIASAFIYIFKFRKQNIRSSSSLSVAETANIIVNK
        HIRGALLVASLVCGIIG SLLIASAFIY FKFRKQ+ RSSSSLSV+ET NII NK
Subjt:  HIRGALLVASLVCGIIGASLLIASAFIYIFKFRKQNIRSSSSLSVAETANIIVNK

TrEMBL top hitse value%identityAlignment
A0A5D3C2L6 Uncharacterized protein2.8e-5076.77Show/hide
Query:  SKIAALAAVYSSLALI--FLAGASNARDLRPSEHGLEYQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSG--AAGGDHI
        S I + A +YSSLALI   LA  SNARDLRPSEHGL+YQNP A +NSSP MQSFFKGNSWS+S+VALPKAMNTSLP QWWN  RSH NRSG  AAGGDHI
Subjt:  SKIAALAAVYSSLALI--FLAGASNARDLRPSEHGLEYQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSG--AAGGDHI

Query:  RGALLVASLVCGIIGASLLIASAFIYIFKFRKQ--NIRSSSSLSVAETANIIVNK
        RGALLVASLVCGIIG SLL+ASAFIY FKFRKQ  +  SSSSLS +ET NII NK
Subjt:  RGALLVASLVCGIIGASLLIASAFIYIFKFRKQ--NIRSSSSLSVAETANIIVNK

A0A6J1EH69 uncharacterized protein LOC1114342718.7e-6083.44Show/hide
Query:  MPIPSKIAALAAVYSSLALIFLAGASNARDLRPSEHGLEYQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSGAA----G
        M I S+IAA+AA+Y S+A+IFLAG SNARDLRPSEHGLEYQNPGA ENSSPDMQSFFKGNSWSSS+VALPKAMNTSLPSQWWN  RSH +RSGAA    G
Subjt:  MPIPSKIAALAAVYSSLALIFLAGASNARDLRPSEHGLEYQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSGAA----G

Query:  GDHIRGALLVASLVCGIIGASLLIASAFIYIFKFRKQNIRSSSSLSVAETANIIVNK
        GDHIRGALLVASLVCG+IG SLLIASAFIY+FKFRKQN RSSSSLSV+ET NII NK
Subjt:  GDHIRGALLVASLVCGIIGASLLIASAFIYIFKFRKQNIRSSSSLSVAETANIIVNK

A0A6J1GT11 uncharacterized protein LOC1114567133.8e-5580.39Show/hide
Query:  MPIPSKIAALAAVYSSLALIFLAGASNARDLRPSEHGLEYQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSGAAGGDHI
        M IPS+IAA AAVY  LA IF  G SNARDLRPSEHGLEYQNPGAT NSS +MQSFF+GNSWSSSEV LPKA+NTSLPSQWWN  RSH NRSG AGGDHI
Subjt:  MPIPSKIAALAAVYSSLALIFLAGASNARDLRPSEHGLEYQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSGAAGGDHI

Query:  RGALLVASLVCGIIGASLLIASAFIYIFKFRKQNIRSSSSLSVAETANIIVNK
        RGAL+V SLVCGIIG SLLIASAFIY FKF+KQN RS SSLS +ETA+IIV+K
Subjt:  RGALLVASLVCGIIGASLLIASAFIYIFKFRKQNIRSSSSLSVAETANIIVNK

A0A6J1K0B3 uncharacterized protein LOC1114899331.6e-5075.82Show/hide
Query:  MPIPSKIAALAAVYSSLALIFLAGASNARDLRPSEHGLEYQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSGAAGGDHI
        M IPS+IAA AAV   LA IF  G SNARDLRPSEHGLEYQNPGAT NSS +MQSFF+GN+ +SS V LPKA+NTSLPSQWWN  RSH NRSG +GGDHI
Subjt:  MPIPSKIAALAAVYSSLALIFLAGASNARDLRPSEHGLEYQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSGAAGGDHI

Query:  RGALLVASLVCGIIGASLLIASAFIYIFKFRKQNIRSSSSLSVAETANIIVNK
        RGAL+V SLVCGIIG SLLIASAFIY FKF+KQN RS SSLS +ETA+I+V+K
Subjt:  RGALLVASLVCGIIGASLLIASAFIYIFKFRKQNIRSSSSLSVAETANIIVNK

A0A6J1KLE3 uncharacterized protein LOC1114952326.2e-5883.23Show/hide
Query:  IPSKIAALAAVYSSLALIFLAGASNARDLRPSEHGLEYQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSGAA----GGD
        I S+IAA AA+Y SLA+IFLAG SNARDLRPSEHGLEYQNPGA ENSSPDMQSFFKGNSWSSS+VALPKAMNTSLPSQWWN  RSH +RS AA     GD
Subjt:  IPSKIAALAAVYSSLALIFLAGASNARDLRPSEHGLEYQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSGAA----GGD

Query:  HIRGALLVASLVCGIIGASLLIASAFIYIFKFRKQNIRSSSSLSVAETANIIVNK
        HIRGALLVASLVCG+IG SLLIASAFIY+FKFRKQN RSSSSLSV+ET NII NK
Subjt:  HIRGALLVASLVCGIIGASLLIASAFIYIFKFRKQNIRSSSSLSVAETANIIVNK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30515.1 unknown protein3.2e-0637.96Show/hide
Query:  NARDLRPSEHGLE-YQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSGAAGGDHIRG-ALLVASLVCGIIGASLLIASAF
        NAR+LRPS+HGLE Y  PG     S +M SFF G   S+   ++    ++ LPS   +  ++    S     D +    L+V SLVCG+ G +L++ASA 
Subjt:  NARDLRPSEHGLE-YQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSGAAGGDHIRG-ALLVASLVCGIIGASLLIASAF

Query:  IYIFKFRK
        IY   + K
Subjt:  IYIFKFRK

AT4G21740.1 unknown protein2.2e-1037.4Show/hide
Query:  LIFLAGASNARDLRPSEHGLEYQ--NPGATENSSPDMQSFFKGNSWSSS-----EVALPK--AMNTSLPSQWWNRSRSHGNRSGAA-GGDHI-RGALLVA
        L+   G S A +LRPS+HGL+YQ  +P    +S P     F G+S SSS        LPK  A +      WW        R GA    DH+ R   L A
Subjt:  LIFLAGASNARDLRPSEHGLEYQ--NPGATENSSPDMQSFFKGNSWSSS-----EVALPK--AMNTSLPSQWWNRSRSHGNRSGAA-GGDHI-RGALLVA

Query:  SLVCGIIGASLLIASAFIYIFKFRKQNIRSS
        S++CG+ G +LL+    IY F++RK N  +S
Subjt:  SLVCGIIGASLLIASAFIYIFKFRKQNIRSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGATTCCCTCGAAGATTGCTGCACTCGCCGCCGTCTACTCCTCTCTCGCCCTCATTTTCCTCGCCGGAGCCTCAAATGCGAGGGACCTGCGTCCATCGGAGCACGG
CTTGGAGTACCAAAACCCTGGCGCTACTGAAAATTCCTCGCCGGATATGCAATCATTCTTCAAAGGAAACTCCTGGTCCTCCTCCGAGGTCGCGCTTCCGAAGGCGATGA
ACACCAGCCTGCCGTCGCAGTGGTGGAATCGCAGTCGCAGCCACGGCAACCGCTCTGGAGCCGCTGGCGGAGATCACATTCGCGGCGCATTGCTTGTGGCGAGTTTGGTA
TGTGGAATTATAGGCGCTTCTTTGCTGATTGCTTCTGCTTTTATTTACATCTTCAAGTTCAGGAAGCAGAATATTAGATCGTCGTCCTCGCTCTCTGTCGCTGAAACTGC
TAACATTATCGTTAACAAGTGA
mRNA sequenceShow/hide mRNA sequence
CTCACAGATAAAATCAAATCAACCACAAACGCAAAGCACTGTAAATCCATACCAGAAAAATGCCGATTCCCTCGAAGATTGCTGCACTCGCCGCCGTCTACTCCTCTCTC
GCCCTCATTTTCCTCGCCGGAGCCTCAAATGCGAGGGACCTGCGTCCATCGGAGCACGGCTTGGAGTACCAAAACCCTGGCGCTACTGAAAATTCCTCGCCGGATATGCA
ATCATTCTTCAAAGGAAACTCCTGGTCCTCCTCCGAGGTCGCGCTTCCGAAGGCGATGAACACCAGCCTGCCGTCGCAGTGGTGGAATCGCAGTCGCAGCCACGGCAACC
GCTCTGGAGCCGCTGGCGGAGATCACATTCGCGGCGCATTGCTTGTGGCGAGTTTGGTATGTGGAATTATAGGCGCTTCTTTGCTGATTGCTTCTGCTTTTATTTACATC
TTCAAGTTCAGGAAGCAGAATATTAGATCGTCGTCCTCGCTCTCTGTCGCTGAAACTGCTAACATTATCGTTAACAAGTGATTTGAACACGGAATTGCATGTACATAGCT
CTGAGATCTGGTCGCTGATCTTATTTTTGTGAGAGATTCATCTCTTTTTCTCTGATTTTTCCTCTGATATGTTTGAATTGAATTGAGTTCCTGTACTACTAGTGTAATAT
TAGATCTTCAGATACTCTAATAGATTGTTGCCAATTTCAGATTCATATAGAG
Protein sequenceShow/hide protein sequence
MPIPSKIAALAAVYSSLALIFLAGASNARDLRPSEHGLEYQNPGATENSSPDMQSFFKGNSWSSSEVALPKAMNTSLPSQWWNRSRSHGNRSGAAGGDHIRGALLVASLV
CGIIGASLLIASAFIYIFKFRKQNIRSSSSLSVAETANIIVNK