; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002124 (gene) of Snake gourd v1 genome

Gene IDTan0002124
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1118)
Genome locationLG05:71252417..71254034
RNA-Seq ExpressionTan0002124
SyntenyTan0002124
Gene Ontology termsGO:0032774 - RNA biosynthetic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR009500 - Protein of unknown function DUF1118


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152441.1 uncharacterized protein LOC101214681 [Cucumis sativus]1.3e-7082.44Show/hide
Query:  MAVTSPSSSAATAAPSHLPN---LLRPNHFSS-SRFRPSSSNHGRRPVTIVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKKVLSNV
        MAVTS S S++TA  SHL N   + R +H S+ +R RP SS+    P+TI +MA QKKVNKYD AWEKKWFGAGIFYES+EDVEVDVFKKLETKKVLSNV
Subjt:  MAVTSPSSSAATAAPSHLPN---LLRPNHFSS-SRFRPSSSNHGRRPVTIVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKKVLSNV

Query:  EKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGG
        EKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPS LASLALPILVAAL AIV+IPDDSVALV LQAVVGGGLALGAAGL VGSVVLGG
Subjt:  EKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGG

Query:  LQEAD
        LQEAD
Subjt:  LQEAD

XP_008437595.1 PREDICTED: uncharacterized protein LOC103482961 [Cucumis melo]1.5e-6981.19Show/hide
Query:  VTSPSSSAATAAPSHLPN---LLRPNHFSSSRFRPSSSNHGRRPVTIVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKKVLSNVEKA
        +TS S S++TA+ SH PN   + R +H S+   RP  S+    P+TIV+MA QKKVNKYD AWEKKWFGAGIFYES+EDVEVDVFKKLETKKVLSNVEKA
Subjt:  VTSPSSSAATAAPSHLPN---LLRPNHFSSSRFRPSSSNHGRRPVTIVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKKVLSNVEKA

Query:  GLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGLQE
        GLLSKAEELG TLSSIEKLGVFSKAEELGLLSLLEKVASSSPS LASLALPILVAAL AIV+IPDDSVALV LQAVVGGGLALGAAGL VGSVVLGGLQE
Subjt:  GLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGLQE

Query:  AD
        AD
Subjt:  AD

XP_022137512.1 uncharacterized protein LOC111008941 [Momordica charantia]5.4e-7282.78Show/hide
Query:  MAVTSPSS--SAATAAPSHLPNLLRPNHFSSSRFRPSSSNHGR------RPVTIVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKKV
        MAV SPSS  +AATAAPSHL     PN F SS+FR  + NH R        +TIV+MA +KKVNKYD  W+KKWFGAGIFYESSEDVEVDVFKKLETKKV
Subjt:  MAVTSPSS--SAATAAPSHLPNLLRPNHFSSSRFRPSSSNHGR------RPVTIVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKKV

Query:  LSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGSV
        LSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPS LASLALPILVAALAAIVLIPDDS ALV LQAVVGGGLALGAAGL VGSV
Subjt:  LSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGSV

Query:  VLGGLQEAD
        VLGGLQEAD
Subjt:  VLGGLQEAD

XP_023000865.1 uncharacterized protein LOC111495182 [Cucurbita maxima]1.1e-6981.43Show/hide
Query:  MAVT--SPSSSAATAAPSHLPNLLRPNHFSSSRFRPSSSNHGR-------RPVTIVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKK
        MAVT  SPSSSAA       P+LL PN FS SRFR  +SNH R        P+ I++MASQKKVNKYD  WEKKWFGAGIFYESSEDVEVDVFKKLETKK
Subjt:  MAVT--SPSSSAATAAPSHLPNLLRPNHFSSSRFRPSSSNHGR-------RPVTIVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKK

Query:  VLSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGS
        VLSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEE GLLSLLEKVAS+SPS LASLALPILVAALAAIVLIPDDS  LV LQAVV GGL LGAAGLFVGS
Subjt:  VLSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGS

Query:  VVLGGLQEAD
        VVLGGLQEAD
Subjt:  VVLGGLQEAD

XP_038895665.1 uncharacterized protein LOC120083851 [Benincasa hispida]7.6e-7484.29Show/hide
Query:  MAVT--SPSSSAATAAPSHLPNLLRPNHFSSSRFRPSSSNHGR-------RPVTIVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKK
        MAVT  SPSSSAATAAPSHL      N FS  RFR   SNH R        P+TIV+MA QKKVNKYD AWEKKWFGAGIFYES+EDVEVDVFKKLETKK
Subjt:  MAVT--SPSSSAATAAPSHLPNLLRPNHFSSSRFRPSSSNHGR-------RPVTIVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKK

Query:  VLSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGS
        VLSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEK+ SSSPS LASLALPILV ALAAIVLIPDDSVALVVLQAVVGGGLALGAAGL VGS
Subjt:  VLSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGS

Query:  VVLGGLQEAD
        VVLGGLQEAD
Subjt:  VVLGGLQEAD

TrEMBL top hitse value%identityAlignment
A0A0A0LW40 Uncharacterized protein6.5e-7182.44Show/hide
Query:  MAVTSPSSSAATAAPSHLPN---LLRPNHFSS-SRFRPSSSNHGRRPVTIVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKKVLSNV
        MAVTS S S++TA  SHL N   + R +H S+ +R RP SS+    P+TI +MA QKKVNKYD AWEKKWFGAGIFYES+EDVEVDVFKKLETKKVLSNV
Subjt:  MAVTSPSSSAATAAPSHLPN---LLRPNHFSS-SRFRPSSSNHGRRPVTIVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKKVLSNV

Query:  EKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGG
        EKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPS LASLALPILVAAL AIV+IPDDSVALV LQAVVGGGLALGAAGL VGSVVLGG
Subjt:  EKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGG

Query:  LQEAD
        LQEAD
Subjt:  LQEAD

A0A1S3AV09 uncharacterized protein LOC1034829617.2e-7081.19Show/hide
Query:  VTSPSSSAATAAPSHLPN---LLRPNHFSSSRFRPSSSNHGRRPVTIVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKKVLSNVEKA
        +TS S S++TA+ SH PN   + R +H S+   RP  S+    P+TIV+MA QKKVNKYD AWEKKWFGAGIFYES+EDVEVDVFKKLETKKVLSNVEKA
Subjt:  VTSPSSSAATAAPSHLPN---LLRPNHFSSSRFRPSSSNHGRRPVTIVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKKVLSNVEKA

Query:  GLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGLQE
        GLLSKAEELG TLSSIEKLGVFSKAEELGLLSLLEKVASSSPS LASLALPILVAAL AIV+IPDDSVALV LQAVVGGGLALGAAGL VGSVVLGGLQE
Subjt:  GLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGLQE

Query:  AD
        AD
Subjt:  AD

A0A5D3BL78 DUF1118 domain-containing protein7.2e-7081.19Show/hide
Query:  VTSPSSSAATAAPSHLPN---LLRPNHFSSSRFRPSSSNHGRRPVTIVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKKVLSNVEKA
        +TS S S++TA+ SH PN   + R +H S+   RP  S+    P+TIV+MA QKKVNKYD AWEKKWFGAGIFYES+EDVEVDVFKKLETKKVLSNVEKA
Subjt:  VTSPSSSAATAAPSHLPN---LLRPNHFSSSRFRPSSSNHGRRPVTIVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKKVLSNVEKA

Query:  GLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGLQE
        GLLSKAEELG TLSSIEKLGVFSKAEELGLLSLLEKVASSSPS LASLALPILVAAL AIV+IPDDSVALV LQAVVGGGLALGAAGL VGSVVLGGLQE
Subjt:  GLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGLQE

Query:  AD
        AD
Subjt:  AD

A0A6J1C8G3 uncharacterized protein LOC1110089412.6e-7282.78Show/hide
Query:  MAVTSPSS--SAATAAPSHLPNLLRPNHFSSSRFRPSSSNHGR------RPVTIVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKKV
        MAV SPSS  +AATAAPSHL     PN F SS+FR  + NH R        +TIV+MA +KKVNKYD  W+KKWFGAGIFYESSEDVEVDVFKKLETKKV
Subjt:  MAVTSPSS--SAATAAPSHLPNLLRPNHFSSSRFRPSSSNHGR------RPVTIVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKKV

Query:  LSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGSV
        LSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPS LASLALPILVAALAAIVLIPDDS ALV LQAVVGGGLALGAAGL VGSV
Subjt:  LSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGSV

Query:  VLGGLQEAD
        VLGGLQEAD
Subjt:  VLGGLQEAD

A0A6J1KJH4 uncharacterized protein LOC1114951825.5e-7081.43Show/hide
Query:  MAVT--SPSSSAATAAPSHLPNLLRPNHFSSSRFRPSSSNHGR-------RPVTIVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKK
        MAVT  SPSSSAA       P+LL PN FS SRFR  +SNH R        P+ I++MASQKKVNKYD  WEKKWFGAGIFYESSEDVEVDVFKKLETKK
Subjt:  MAVT--SPSSSAATAAPSHLPNLLRPNHFSSSRFRPSSSNHGR-------RPVTIVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKK

Query:  VLSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGS
        VLSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEE GLLSLLEKVAS+SPS LASLALPILVAALAAIVLIPDDS  LV LQAVV GGL LGAAGLFVGS
Subjt:  VLSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGS

Query:  VVLGGLQEAD
        VVLGGLQEAD
Subjt:  VVLGGLQEAD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G74730.1 Protein of unknown function (DUF1118)3.9e-5263.96Show/hide
Query:  SSSAATAAPSHLPNLLRPNHFSSSRFRPSSSNHGRRPVT--IVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKKVLSNVEKAGLLSK
        SS AA      L N + P      RFR S S  G+ P T  +V+MA QKKVNKYD  W+K+W+GAG+F+E SE + VDVFKKLE +KVLSNVEK+GLLSK
Subjt:  SSSAATAAPSHLPNLLRPNHFSSSRFRPSSSNHGRRPVT--IVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKKVLSNVEKAGLLSK

Query:  AEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGLQEAD
        AE LG TLSS+EKL VFSKAE+LGLLSLLE +A +SP+VLAS ALP L AA+ A+VLIPDDS  LVV QAV+ G LAL    L VGSVVL GLQEAD
Subjt:  AEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGLQEAD

AT5G08050.1 Protein of unknown function (DUF1118)4.8e-1036.72Show/hide
Query:  ESSEDVEVDVFKKLETKKVLSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQ
        +S+   +V +  ++E  K+L+  EKAGLLS AE+ GF+LS+IE+LG+ +KAEE G+LS        +P  L +L+L +L+       ++P+D    VV+Q
Subjt:  ESSEDVEVDVFKKLETKKVLSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQ

Query:  AVVGGGLALGAAGLFVGSVVLGGLQEAD
         +V     LG +  F  S  +  LQ++D
Subjt:  AVVGGGLALGAAGLFVGSVVLGGLQEAD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGTCACTTCACCTTCTTCCTCCGCTGCCACTGCCGCTCCTTCTCACCTTCCCAACCTCCTCCGACCCAACCATTTCTCCAGTTCCAGATTCCGCCCATCCTCTTC
TAACCATGGCCGCCGCCCAGTCACGATCGTTTCCATGGCTTCTCAGAAGAAGGTGAACAAATACGACGGCGCTTGGGAGAAGAAGTGGTTCGGAGCCGGAATCTTCTACG
AGAGCTCGGAGGATGTGGAGGTTGACGTGTTCAAGAAGCTTGAGACGAAGAAAGTCCTTAGCAATGTCGAAAAAGCAGGGCTACTATCGAAAGCAGAGGAATTAGGGTTT
ACGCTCTCTTCGATTGAGAAATTGGGGGTTTTTTCAAAGGCCGAGGAATTAGGGCTTTTGAGCTTGCTGGAGAAGGTTGCGAGCTCTTCTCCATCGGTCTTAGCCTCCCT
TGCTCTCCCCATTCTCGTGGCGGCGCTGGCGGCGATTGTTTTGATTCCCGACGACTCGGTGGCGCTTGTGGTTTTGCAGGCGGTGGTCGGCGGCGGACTGGCGCTTGGGG
CTGCCGGATTGTTTGTTGGATCGGTGGTGCTGGGTGGGTTGCAGGAAGCTGATTGA
mRNA sequenceShow/hide mRNA sequence
TGATAGAGAGGGAGAGTTGTTGGCAAATTGAGATAAAGAAATGGAAGAAATGATTGGTGGATACTAAGATTACACTCAGATCTTTCATCCCCACATGGCAAAACCTGGCT
GAACCTGAACCACAACAAAATTCAAAACTCCTTCTCAATCTCTTCCTTCAACAATGGCAGTCACTTCACCTTCTTCCTCCGCTGCCACTGCCGCTCCTTCTCACCTTCCC
AACCTCCTCCGACCCAACCATTTCTCCAGTTCCAGATTCCGCCCATCCTCTTCTAACCATGGCCGCCGCCCAGTCACGATCGTTTCCATGGCTTCTCAGAAGAAGGTGAA
CAAATACGACGGCGCTTGGGAGAAGAAGTGGTTCGGAGCCGGAATCTTCTACGAGAGCTCGGAGGATGTGGAGGTTGACGTGTTCAAGAAGCTTGAGACGAAGAAAGTCC
TTAGCAATGTCGAAAAAGCAGGGCTACTATCGAAAGCAGAGGAATTAGGGTTTACGCTCTCTTCGATTGAGAAATTGGGGGTTTTTTCAAAGGCCGAGGAATTAGGGCTT
TTGAGCTTGCTGGAGAAGGTTGCGAGCTCTTCTCCATCGGTCTTAGCCTCCCTTGCTCTCCCCATTCTCGTGGCGGCGCTGGCGGCGATTGTTTTGATTCCCGACGACTC
GGTGGCGCTTGTGGTTTTGCAGGCGGTGGTCGGCGGCGGACTGGCGCTTGGGGCTGCCGGATTGTTTGTTGGATCGGTGGTGCTGGGTGGGTTGCAGGAAGCTGATTGAA
TGAACGAATTGCTGGCTGCAACTCTACCTTTGGTTATGTAGGTGAATCGTGATTCGTGAATAACTACTTTTCAAATAGTTTTAATTTGTTTGTGTAATTAAAAACCATAA
ATATGAAGAGTTTTTTTTTTCATGGTTTCAAAATGTGGATTAGTAAATACTAACACCACCCTTTCATTTTTAGTTTTTTATTTTTGAATTTTAAGTTTATTTTCTCTTTA
TTTCTTTATGAGTGTTTTTATCTCCCTCCTACAAACATTTGAATTCTTAGCTAAATTTAAAAAATAAGAATAAATTTTTAAAAATTACATCTTTTAATTTTCAATAAAAA
AAAACTAAATGATTATCAATAGCCTAGTTGATTTTGAAACAATATTTTTATTTATTTAAATATATTCGCAAGGTGAAATGAGACTATTTTTCTTGGCATTGGTACTTCAA
ACTGGGCTTGATGCTTTTAATTAAATTGTATTTGTGCATTTGTTATTAAGTCTGTTTTTTCTTTAAAAATGATTGGTTGATTTTGTAGCTAGATTGAATATTTTGTGGTT
GACTTTGGACATTTGAGTCACATGATTTTGGGTATTTTGATAATTGAATTGGGTAGATCAATTACAAGTTTAGTGTTCTCTAGACTTTTAAAGCTGTATTTAGTAGGCTA
TTGAGCTTTCAAATGTGTCAATATACTTCTAAACTTTCAATATTTATCTTATTTGACATTTTTTTTAATCTATGGATGTAAAAATATCAAGGTCCTAT
Protein sequenceShow/hide protein sequence
MAVTSPSSSAATAAPSHLPNLLRPNHFSSSRFRPSSSNHGRRPVTIVSMASQKKVNKYDGAWEKKWFGAGIFYESSEDVEVDVFKKLETKKVLSNVEKAGLLSKAEELGF
TLSSIEKLGVFSKAEELGLLSLLEKVASSSPSVLASLALPILVAALAAIVLIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGLQEAD