; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021854 (gene) of Snake gourd v1 genome

Gene IDTan0021854
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransmembrane protein
Genome locationLG02:788983..790118
RNA-Seq ExpressionTan0021854
SyntenyTan0021854
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7023009.1 hypothetical protein SDJN02_16745, partial [Cucurbita argyrosperma subsp. argyrosperma]3.3e-10194.06Show/hide
Query:  DDNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGAACKRSWIPSLCS
        D+NN++LSIMESKPSHPLHQIA+TPTHKLLLKQWLKEEELIFGR+SLKETQIDSVRKEITMLHIFFFVFHST+ILLLFNASAKDFNGAAC RSWIPS+CS
Subjt:  DDNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGAACKRSWIPSLCS

Query:  LLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIVLC
        LLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCV+ELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDF+TLFFFSVSCMFLG+IRIVLC
Subjt:  LLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIVLC

Query:  NK
        +K
Subjt:  NK

XP_022952110.1 uncharacterized protein LOC111454873 [Cucurbita moschata]4.3e-10193.56Show/hide
Query:  MDDNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGAACKRSWIPSLC
        MDDNNK LS+ME KPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFF+FHSTA+LLLFNAS KDF+G AC RSWIPSLC
Subjt:  MDDNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGAACKRSWIPSLC

Query:  SLLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIVL
        SLLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLL KCVDELKRKGVEFDLLKEVDALRRAKSLRVE+KAVRKWSSRDFITLFFFSVSCMFLG+IR+VL
Subjt:  SLLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIVL

Query:  CN
        CN
Subjt:  CN

XP_023529232.1 uncharacterized protein LOC111792131 [Cucurbita pepo subsp. pepo]1.5e-10194.55Show/hide
Query:  DDNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGAACKRSWIPSLCS
        D+NNK+LSIMESKPSHPLHQIA+TPTHKLLLKQWLKEEELIFGR+SLKETQIDSVRKEITMLHIFFFVFHST+ILLLFNASAKDFNGAAC RSWIPS+CS
Subjt:  DDNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGAACKRSWIPSLCS

Query:  LLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIVLC
        LLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCV+ELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDF+TLFFFSVSCMFLG+IRIVLC
Subjt:  LLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIVLC

Query:  NK
        +K
Subjt:  NK

XP_023553637.1 uncharacterized protein LOC111811128 [Cucurbita pepo subsp. pepo]4.3e-10193.56Show/hide
Query:  MDDNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGAACKRSWIPSLC
        MDDNNK LS+ME KPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFF+FHSTA+LLLFNAS KDF+G AC RSWIPSLC
Subjt:  MDDNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGAACKRSWIPSLC

Query:  SLLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIVL
        SLLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLL KCVDELKRKGVEFDLLKEVDALRRAKSLRVE+KAVRKWSSRDFITLFFFSVSCMFLG+IR+VL
Subjt:  SLLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIVL

Query:  CN
        CN
Subjt:  CN

XP_038888007.1 uncharacterized protein LOC120077946 [Benincasa hispida]5.6e-10194.06Show/hide
Query:  MDDNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGAACKRSWIPSLC
        MDDNNK LSIME KPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVR+EITMLHIFFFVFHSTAILLLFNAS KDF+G ACKRSWIPSLC
Subjt:  MDDNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGAACKRSWIPSLC

Query:  SLLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIVL
        SLLFSLGIIWAVRYKTD+EAHLEKLLEREKEDRNLLSKCVDELKRKG+EFDLLKEVDALRRAKSLRVE+KAVRKWSSRDFITLFFF VSCMFLG+IR+VL
Subjt:  SLLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIVL

Query:  CN
        CN
Subjt:  CN

TrEMBL top hitse value%identityAlignment
A0A0A0K424 Uncharacterized protein7.9e-10194.03Show/hide
Query:  DDNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGAACKRSWIPSLCS
        D NNK LS+ME KPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNAS KDF+G ACKRSWIPSLCS
Subjt:  DDNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGAACKRSWIPSLCS

Query:  LLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIVLC
        LLFSLGIIWAVRYKTD+EAHLEKLLEREKEDRNLLSKCVDELKRKG+EFDLLKEVDALRRAKSLRVE+KAVRKWSSRDFITLFFFSVSCMFLG+IR+VLC
Subjt:  LLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIVLC

Query:  N
        N
Subjt:  N

A0A1S3BHA3 uncharacterized protein LOC1034896443.6e-10195.02Show/hide
Query:  DDNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGAACKRSWIPSLCS
        D NNK LSIME KPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNAS KDF+G ACKRSWIPSLCS
Subjt:  DDNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGAACKRSWIPSLCS

Query:  LLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIVLC
        LLFSLGIIWAVRYKTD+EAHLEKLLEREKEDRNLLSKCVDELKRKG+EFDLLKEVDALRRAKSLRVE+KAVRKWSSRDFITLFFFSVSCMFLGVIR+VLC
Subjt:  LLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIVLC

Query:  N
        N
Subjt:  N

A0A5A7U7C4 Putative transmembrane protein3.6e-10195.02Show/hide
Query:  DDNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGAACKRSWIPSLCS
        D NNK LSIME KPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNAS KDF+G ACKRSWIPSLCS
Subjt:  DDNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGAACKRSWIPSLCS

Query:  LLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIVLC
        LLFSLGIIWAVRYKTD+EAHLEKLLEREKEDRNLLSKCVDELKRKG+EFDLLKEVDALRRAKSLRVE+KAVRKWSSRDFITLFFFSVSCMFLGVIR+VLC
Subjt:  LLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIVLC

Query:  N
        N
Subjt:  N

A0A6J1GKU6 uncharacterized protein LOC1114548732.1e-10193.56Show/hide
Query:  MDDNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGAACKRSWIPSLC
        MDDNNK LS+ME KPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFF+FHSTA+LLLFNAS KDF+G AC RSWIPSLC
Subjt:  MDDNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGAACKRSWIPSLC

Query:  SLLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIVL
        SLLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLL KCVDELKRKGVEFDLLKEVDALRRAKSLRVE+KAVRKWSSRDFITLFFFSVSCMFLG+IR+VL
Subjt:  SLLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIVL

Query:  CN
        CN
Subjt:  CN

A0A6J1I0X4 uncharacterized protein LOC1114680534.6e-10193.07Show/hide
Query:  MDDNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGAACKRSWIPSLC
        MDDNNK LS+ME KPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFF+FHSTA+LLLFNAS KDF+G AC RSWIPSLC
Subjt:  MDDNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGAACKRSWIPSLC

Query:  SLLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIVL
        SLLFSLGIIWAVRYKTDLEAHL+KLLEREKEDRNLL KCVDELKRKGVEFDLLKEVDALRRAKSLRVE+KAVRKWSSRDFITLFFFSVSCMFLG+IR+VL
Subjt:  SLLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIVL

Query:  CN
        CN
Subjt:  CN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G12870.1 unknown protein5.8e-8072.91Show/hide
Query:  DNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGA---ACKRSWIPSL
        D +    IME+K  HPLHQIA+TPTHKLLLKQWLKEEELI  R+S KE+QIDSVR+EIT L+IFFF+FHS ++LLLF+AS+   + A   ACKRSWIPSL
Subjt:  DNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGA---ACKRSWIPSL

Query:  CSLLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIV
        C+LL SLGIIWAVRYK+++E+HLEKLLEREKED  LL KCV+ELK+KG+EFDLLKEVDALRRAKSLRVESK V+KWS+RDF+TLFFFSVSC+ L +IR++
Subjt:  CSLLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIV

Query:  LCN
        LC+
Subjt:  LCN

AT5G56120.1 unknown protein1.2e-2736.76Show/hide
Query:  NNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKD-----FNGAACKRSWIPS
        +N  + I++    HPL +I+E+P H LLLK W +EE+L   R+ LKE++++S+++EI  L  FF VFH     L++++S  D      + A CK+ WIPS
Subjt:  NNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKD-----FNGAACKRSWIPS

Query:  LCSLLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESK-AVRKWSSRDFITLFFFSVSCMFLGVIR
          SL  SL +++ V+ K  +   + + + RE+ D   L++CV EL+ KG  FDL KE  + +R KS  VE K     W S+  IT+     + +F  V +
Subjt:  LCSLLFSLGIIWAVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESK-AVRKWSSRDFITLFFFSVSCMFLGVIR

Query:  IVLC
         +LC
Subjt:  IVLC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGATAACAACAAAACCTTATCAATCATGGAATCAAAACCTTCACACCCACTTCACCAAATTGCCGAAACCCCAACCCACAAACTTCTTCTCAAGCAGTGGCTCAA
AGAAGAAGAACTAATCTTCGGCAGAATCTCTCTCAAAGAAACCCAAATCGACTCTGTTCGAAAAGAAATCACTATGCTTCACATTTTCTTCTTCGTCTTCCACTCCACAG
CTATCCTCCTCCTCTTCAACGCCTCAGCCAAGGACTTCAATGGCGCAGCCTGCAAGAGATCGTGGATTCCATCGCTTTGTTCTCTGTTATTCTCTCTGGGAATCATTTGG
GCCGTCAGATACAAGACCGATCTCGAGGCCCATTTGGAGAAATTGTTAGAAAGGGAAAAGGAAGATAGAAATTTGCTGTCGAAGTGTGTGGATGAATTGAAGAGGAAAGG
GGTTGAATTTGATTTGTTGAAGGAGGTGGATGCGCTGCGGAGAGCAAAAAGTTTGCGGGTTGAAAGCAAGGCAGTGAGAAAATGGTCTTCAAGAGATTTCATCACTCTGT
TTTTCTTCTCTGTTTCTTGTATGTTTCTTGGGGTTATTAGGATTGTTTTATGTAATAAGTGA
mRNA sequenceShow/hide mRNA sequence
GCCATTGCCAGTTGCCCGTTGGGAGGGCGGTGGAGTGAAACACAAACGCGCACGCGCCTATTTACGGGAAAAACGCAATTCAAACGCTTTTCCAGATCCGACCGTTACAA
TTATTTGAAATAATCAGAGAGAGCCTCCTTTGAAAAACCTCCATTCCTGTATCTTTCAAGCTCTCCATCTTTCACTTCATCTCTTTCCTTTCTTCTTCTTCTTCTTCTTC
TTCTTCTTTGTTTTGTGATCCATGGATGATAACAACAAAACCTTATCAATCATGGAATCAAAACCTTCACACCCACTTCACCAAATTGCCGAAACCCCAACCCACAAACT
TCTTCTCAAGCAGTGGCTCAAAGAAGAAGAACTAATCTTCGGCAGAATCTCTCTCAAAGAAACCCAAATCGACTCTGTTCGAAAAGAAATCACTATGCTTCACATTTTCT
TCTTCGTCTTCCACTCCACAGCTATCCTCCTCCTCTTCAACGCCTCAGCCAAGGACTTCAATGGCGCAGCCTGCAAGAGATCGTGGATTCCATCGCTTTGTTCTCTGTTA
TTCTCTCTGGGAATCATTTGGGCCGTCAGATACAAGACCGATCTCGAGGCCCATTTGGAGAAATTGTTAGAAAGGGAAAAGGAAGATAGAAATTTGCTGTCGAAGTGTGT
GGATGAATTGAAGAGGAAAGGGGTTGAATTTGATTTGTTGAAGGAGGTGGATGCGCTGCGGAGAGCAAAAAGTTTGCGGGTTGAAAGCAAGGCAGTGAGAAAATGGTCTT
CAAGAGATTTCATCACTCTGTTTTTCTTCTCTGTTTCTTGTATGTTTCTTGGGGTTATTAGGATTGTTTTATGTAATAAGTGAAGGGTTTTATAGAAATTGGTGATTTTG
TTGGTGATGGGGCATTCATGTGATGCCAGGGATTTCGTTCTTGTCCTGAAATTTGGATGTTGTTTCAGTGCAGATAACTCTGTTCTGCACTTGTATTTTGTTGCTTTGAT
GGGATAAATTGTCTAACAATTTTAAGTTCTTTCCAAAAATGGTTCTTTTGTTATCATTCACTCTCACTCAGTTGATCCTTGCTATATCTGTGCTCAAACTCTTCGTCCTA
CTTCCTCAATCCTCTGTTATTGACAGCCCATGATTA
Protein sequenceShow/hide protein sequence
MDDNNKTLSIMESKPSHPLHQIAETPTHKLLLKQWLKEEELIFGRISLKETQIDSVRKEITMLHIFFFVFHSTAILLLFNASAKDFNGAACKRSWIPSLCSLLFSLGIIW
AVRYKTDLEAHLEKLLEREKEDRNLLSKCVDELKRKGVEFDLLKEVDALRRAKSLRVESKAVRKWSSRDFITLFFFSVSCMFLGVIRIVLCNK