; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004787 (gene) of Snake gourd v1 genome

Gene IDTan0004787
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF 3339)
Genome locationLG11:1327517..1328695
RNA-Seq ExpressionTan0004787
SyntenyTan0004787
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021775 - Protein of unknown function DUF3339


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056546.1 hypothetical protein E6C27_scaffold288G00480 [Cucumis melo var. makuwa]5.1e-4672.3Show/hide
Query:  MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAIERREEEVTGSSSVDVDMKSENEDGVEDEEDESVNGHS
        MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAIL                 ER+EEEVTGS+SVDVDM+SENE+ +EDEEDESV  +S
Subjt:  MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAIERREEEVTGSSSVDVDMKSENEDGVEDEEDESVNGHS

Query:  FTVSLHAPELKLPIIPRKLEQKPRGEQHEQYNSDHHWRPIRHTFRDIV
        F+V LHAPELKL IIP KL+QKP  +Q+E+YNSD H  PIRH+F DIV
Subjt:  FTVSLHAPELKLPIIPRKLEQKPRGEQHEQYNSDHHWRPIRHTFRDIV

KAE8652261.1 hypothetical protein Csa_022581 [Cucumis sativus]1.3e-4976.35Show/hide
Query:  MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAIERREEEVTGSSSVDVDMKSENEDGVEDEEDESVNGHS
        MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTIL+IAI          +SVDVDM+SENED +EDEEDESV  +S
Subjt:  MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAIERREEEVTGSSSVDVDMKSENEDGVEDEEDESVNGHS

Query:  FTVSLHAPELKLPIIPRKLEQKPRGEQHEQYNSDHHWRPIRHTFRDIV
        F++ LHAP+LKL II RKL+QK R +Q+E+YNSD H  PIRH+FRD V
Subjt:  FTVSLHAPELKLPIIPRKLEQKPRGEQHEQYNSDHHWRPIRHTFRDIV

KAG6578364.1 hypothetical protein SDJN03_22812, partial [Cucurbita argyrosperma subsp. sororia]7.5e-5075.33Show/hide
Query:  MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAIERREEEVTGSSSVDVDMKSENEDGVEDEEDESVNGHS
        M+ADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAI          +S+DVD +SENEDGV+D+E+E VNG  
Subjt:  MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAIERREEEVTGSSSVDVDMKSENEDGVEDEEDESVNGHS

Query:  FTVSLHAPELKLPIIPRKLEQKPRGEQHEQYNSDHHWRPIRHTFRDIVIH
         +V LH P+LKL IIPRKLEQKP  EQ+EQYNSDH+  PIRH F D V+H
Subjt:  FTVSLHAPELKLPIIPRKLEQKPRGEQHEQYNSDHHWRPIRHTFRDIVIH

KAG6602686.1 Actin-related protein 2/3 complex subunit 2A, partial [Cucurbita argyrosperma subsp. sororia]1.3e-5769.84Show/hide
Query:  MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAI-------------------------------------
        MSADWGPVVVAVALFI+LSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAI                                     
Subjt:  MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAI-------------------------------------

Query:  -----ERREEEVTGSSSVDVDMKSENEDGVEDEEDESVNGHSFTVSLHAPELKLPIIPRKLEQKPRGEQHEQYNSDHHWRPIRHTFRDI
             E REEEVTGSS VDVDMKSENEDG+EDEEDES+N +SFTVSLH PELKLPIIPR+LEQKP  +QHEQ NSDHHWRPIRH  RDI
Subjt:  -----ERREEEVTGSSSVDVDMKSENEDGVEDEEDESVNGHSFTVSLHAPELKLPIIPRKLEQKPRGEQHEQYNSDHHWRPIRHTFRDI

RZB59728.1 hypothetical protein D0Y65_042794 [Glycine soja]5.1e-3865.07Show/hide
Query:  MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAIERREEEVTGSSSVDVDMKSENEDGVEDEEDESVNGHS
        M ADWGPV++AVALFILLSPGLLFQ PAR RVVEFGNM+TSGIAILVHAIIFFCILTIL+          TGS+ VD+D   E+EDGVE EEDE V+ + 
Subjt:  MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAIERREEEVTGSSSVDVDMKSENEDGVEDEEDESVNGHS

Query:  FTVSLHAPELKLPIIPRKLEQKPRGEQHEQYNSDHHWRPIRHTFRD
        F V LHA ELKL  +PRK E++PR EQHEQ   D +  PI H   D
Subjt:  FTVSLHAPELKLPIIPRKLEQKPRGEQHEQYNSDHHWRPIRHTFRD

TrEMBL top hitse value%identityAlignment
A0A445GEX9 Uncharacterized protein2.5e-3865.07Show/hide
Query:  MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAIERREEEVTGSSSVDVDMKSENEDGVEDEEDESVNGHS
        M ADWGPV++AVALFILLSPGLLFQ PAR RVVEFGNM+TSGIAILVHAIIFFCILTIL+          TGS+ VD+D   E+EDGVE EEDE V+ + 
Subjt:  MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAIERREEEVTGSSSVDVDMKSENEDGVEDEEDESVNGHS

Query:  FTVSLHAPELKLPIIPRKLEQKPRGEQHEQYNSDHHWRPIRHTFRD
        F V LHA ELKL  +PRK E++PR EQHEQ   D +  PI H   D
Subjt:  FTVSLHAPELKLPIIPRKLEQKPRGEQHEQYNSDHHWRPIRHTFRD

A0A498HDU7 Uncharacterized protein4.8e-3452.87Show/hide
Query:  MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILV------------------IAIERREEEVTGSSSVDVDMKS
        MSADWGPVVVAV +FILLSPGLLFQLPAR RV+EFGNM+TSGIAILVHA+I+FCI+TIL                   I  +     VTGS+ +DVD+  
Subjt:  MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILV------------------IAIERREEEVTGSSSVDVDMKS

Query:  ENEDGVEDEEDESVNGHSFTVSLHAPELKLPIIPRKLEQKPRGEQHEQYNSDHHWRP
        ++E+G+E+EE+ESV+     V LHAP+L+L  + R+ E++PR +Q +Q++S  +W P
Subjt:  ENEDGVEDEEDESVNGHSFTVSLHAPELKLPIIPRKLEQKPRGEQHEQYNSDHHWRP

A0A4D8YM52 Uncharacterized protein2.3e-2852.74Show/hide
Query:  MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAIERREEEVTGSSSVDVDMKSENEDGVEDEEDESVNGHS
        M  DWGPVVVAVA+FILLSPGLLFQLPAR RV+EFGNM TSGI+IL+HAI++FCI TI+V+AI                       GV+ EEDE V   S
Subjt:  MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAIERREEEVTGSSSVDVDMKSENEDGVEDEEDESVNGHS

Query:  FTVSLHAPELKLPIIPRKLEQKPRGEQHEQYNSDHHWRPIRHTFRD
        F V   A +L L  +  +LE+KPR +++E+YNS  H  P+RH  R+
Subjt:  FTVSLHAPELKLPIIPRKLEQKPRGEQHEQYNSDHHWRPIRHTFRD

A0A5A7QQI9 Uncharacterized protein2.5e-3560.71Show/hide
Query:  DWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFC-ILTILVIAIERREEEVTGSSSVDVDMKSENEDGVEDEEDESVNGHSFT
        DWGPVVVAVA+FILLSPGLLFQLPAR RV+EFGNM TSGI+ILVHAI++F  ++ +    I    E  TGS+ VDVD+  E+E+ VE EEDE V+G+S  
Subjt:  DWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFC-ILTILVIAIERREEEVTGSSSVDVDMKSENEDGVEDEEDESVNGHSFT

Query:  VSLHAPELKLPIIPRKLEQKPRGEQHEQYNSDHHWRPIRH
        V LHA EL+L ++PR L+Q+PR +Q+E+YNSD H  P+ H
Subjt:  VSLHAPELKLPIIPRKLEQKPRGEQHEQYNSDHHWRPIRH

A0A5D3BWV4 Uncharacterized protein2.4e-4672.3Show/hide
Query:  MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAIERREEEVTGSSSVDVDMKSENEDGVEDEEDESVNGHS
        MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAIL                 ER+EEEVTGS+SVDVDM+SENE+ +EDEEDESV  +S
Subjt:  MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAIERREEEVTGSSSVDVDMKSENEDGVEDEEDESVNGHS

Query:  FTVSLHAPELKLPIIPRKLEQKPRGEQHEQYNSDHHWRPIRHTFRDIV
        F+V LHAPELKL IIP KL+QKP  +Q+E+YNSD H  PIRH+F DIV
Subjt:  FTVSLHAPELKLPIIPRKLEQKPRGEQHEQYNSDHHWRPIRHTFRDIV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01940.1 Protein of unknown function (DUF 3339)3.9e-2079.03Show/hide
Query:  ADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAIE
        ADWGPV VAV LF++LSPGLLFQLPAR RV+E GNM TSGI+ILVHAI+FF I+TILVIAI+
Subjt:  ADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAIE

AT3G27027.1 Protein of unknown function (DUF 3339)3.4e-2487.5Show/hide
Query:  MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAIE
        MS DWGPV+VAV+LFILLSPGLLFQLPAR RVVEFGNM TSGIAILVHA I+FCILTILVIAI+
Subjt:  MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAIE

AT3G48660.1 Protein of unknown function (DUF 3339)7.3e-1977.05Show/hide
Query:  ADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAI
        ADWGPVVVAV LF+LL+PGLLFQ+PAR RVVEFGNM TSG +ILVH IIFF ++TI  IAI
Subjt:  ADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAI

AT5G40980.1 Protein of unknown function (DUF 3339)8.9e-2587.5Show/hide
Query:  MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAIE
        MSADWGPV+VAVALFILLSPGLLFQLPAR RV+EFGNM+TSGI+ILVHAII+FCILTIL+IAI+
Subjt:  MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAIE

AT5G63500.1 Protein of unknown function (DUF 3339)4.8e-1873.77Show/hide
Query:  ADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAI
        ADWGPV++AV LF++LSPGLLFQ+PA  RVVEFGNM TSG +ILVHAIIFF ++TI  IAI
Subjt:  ADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGCTGACTGGGGACCGGTTGTCGTGGCAGTGGCGCTGTTCATTCTCCTGTCGCCAGGGCTGCTTTTTCAGTTGCCGGCGAGAATCAGGGTGGTGGAGTTTGGGAA
CATGAACACCAGTGGGATTGCCATTTTGGTGCACGCCATCATTTTCTTCTGCATACTCACCATATTGGTCATCGCTATTGAGAGACGAGAAGAAGAGGTTACCGGATCAT
CCAGTGTTGATGTGGATATGAAGAGCGAGAATGAAGACGGCGTAGAGGATGAAGAAGATGAGAGTGTGAATGGCCACAGCTTTACCGTTAGTCTTCATGCTCCCGAACTC
AAGCTGCCGATTATTCCCCGGAAACTGGAACAGAAGCCCCGGGGAGAGCAGCACGAACAGTACAACTCCGATCACCACTGGCGCCCAATCCGACATACTTTCAGAGATAT
CGTGATTCACTTAACCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGCTGACTGGGGACCGGTTGTCGTGGCAGTGGCGCTGTTCATTCTCCTGTCGCCAGGGCTGCTTTTTCAGTTGCCGGCGAGAATCAGGGTGGTGGAGTTTGGGAA
CATGAACACCAGTGGGATTGCCATTTTGGTGCACGCCATCATTTTCTTCTGCATACTCACCATATTGGTCATCGCTATTGAGAGACGAGAAGAAGAGGTTACCGGATCAT
CCAGTGTTGATGTGGATATGAAGAGCGAGAATGAAGACGGCGTAGAGGATGAAGAAGATGAGAGTGTGAATGGCCACAGCTTTACCGTTAGTCTTCATGCTCCCGAACTC
AAGCTGCCGATTATTCCCCGGAAACTGGAACAGAAGCCCCGGGGAGAGCAGCACGAACAGTACAACTCCGATCACCACTGGCGCCCAATCCGACATACTTTCAGAGATAT
CGTGATTCACTTAACCTAA
Protein sequenceShow/hide protein sequence
MSADWGPVVVAVALFILLSPGLLFQLPARIRVVEFGNMNTSGIAILVHAIIFFCILTILVIAIERREEEVTGSSSVDVDMKSENEDGVEDEEDESVNGHSFTVSLHAPEL
KLPIIPRKLEQKPRGEQHEQYNSDHHWRPIRHTFRDIVIHLT