; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004073 (gene) of Snake gourd v1 genome

Gene IDTan0004073
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationLG09:55357740..55359395
RNA-Seq ExpressionTan0004073
SyntenyTan0004073
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050070.1 putative polyprotein [Cucumis melo var. makuwa]4.2e-3945.56Show/hide
Query:  STKFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKIF
        ST+FEV +F+G  DF LW+ K+ AIL Q KV   +L+ + +P++I   + ++++++ +S I+LYLSD ++R +    T+ +LW KLES Y   S+S+KI+
Subjt:  STKFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKIF

Query:  LKSNFFSFKMDSSKSMKDNLDEFHRVVFEMENIGEIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIRVK
        +K  FF +KMD SK +++NLDEF +++ ++ NIGE  SDEN AV+L+NSLPE Y +VK+A+K+GRD+L+  I+L+A++ +
Subjt:  LKSNFFSFKMDSSKSMKDNLDEFHRVVFEMENIGEIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIRVK

KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]3.8e-4047.78Show/hide
Query:  STKFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKIF
        ST+FEV +F+G  DF LW+ K+ AIL Q KV   +L+ + +PD+I   + ++++++A+S I+LYLSD ++R +    T  +LW KLES Y   S+ +KI+
Subjt:  STKFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKIF

Query:  LKSNFFSFKMDSSKSMKDNLDEFHRVVFEMENIGEIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIRVK
        +K  FF +KMD SKS+++NLDEF ++V ++ NIGE  SDEN AV+L+NSLPE+Y +VK+A+K+GRD+L+  IVL+A++ +
Subjt:  LKSNFFSFKMDSSKSMKDNLDEFHRVVFEMENIGEIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIRVK

PON64464.1 hypothetical protein TorRG33x02_273130 [Trema orientale]1.0e-4047.19Show/hide
Query:  STKFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKIF
        ++K E+E+FDGK DF +WK KM A+L QQK    L +A  +P+++   + +E+ + A+SL+IL L+D ++RQ+  +DT  K+W+KL+S Y   ++S+KI+
Subjt:  STKFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKIF

Query:  LKSNFFSFKMDSSKSMKDNLDEFHRVVFEMENIGEIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIR
        LK   F FKMDS+KS++DNLD+F R+   + NI E  +DEN A++++NSLPESY D+KS +K+GR++LS + VL A+R
Subjt:  LKSNFFSFKMDSSKSMKDNLDEFHRVVFEMENIGEIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIR

TXG48538.1 hypothetical protein EZV62_024413 [Acer yangbiense]2.0e-4150.54Show/hide
Query:  MSTKFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKI
        +STKF++E+F+GK DFG+W+ KM AIL QQK   AL E KD+P S+  ++ +++ + A+S IIL L+D ++RQ+  EDT  K+W KLES Y   S+S KI
Subjt:  MSTKFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKI

Query:  FLKSNFFSFKMDSSKSMKDNLDEFHRVVFEMENIG--EIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIRVKE
        +LK   F FKMD SKS++DNLD+F ++  E+ N    E  SDEN A+++ NSL  SY D+K+A+K+GRD+LS E VL A+R +E
Subjt:  FLKSNFFSFKMDSSKSMKDNLDEFHRVVFEMENIG--EIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIRVKE

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]2.9e-4047.78Show/hide
Query:  STKFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKIF
        ST+FEV +F+G  DF LW+ K+ AIL Q KV   +L+ + +PD+I   + ++++++A+S I+LYLSD ++R +    T  +LW KLES Y   S+ +KI+
Subjt:  STKFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKIF

Query:  LKSNFFSFKMDSSKSMKDNLDEFHRVVFEMENIGEIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIRVK
        +K  FF +KMD SKS+++NLDEF ++V ++ NIGE  SDEN AV+L+NSLPE+Y +VK+A+K+GRD+L+  IVL+A++ +
Subjt:  LKSNFFSFKMDSSKSMKDNLDEFHRVVFEMENIGEIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIRVK

TrEMBL top hitse value%identityAlignment
A0A2P5CTT8 Uncharacterized protein4.8e-4147.19Show/hide
Query:  STKFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKIF
        ++K E+E+FDGK DF +WK KM A+L QQK    L +A  +P+++   + +E+ + A+SL+IL L+D ++RQ+  +DT  K+W+KL+S Y   ++S+KI+
Subjt:  STKFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKIF

Query:  LKSNFFSFKMDSSKSMKDNLDEFHRVVFEMENIGEIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIR
        LK   F FKMDS+KS++DNLD+F R+   + NI E  +DEN A++++NSLPESY D+KS +K+GR++LS + VL A+R
Subjt:  LKSNFFSFKMDSSKSMKDNLDEFHRVVFEMENIGEIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIR

A0A5A7U459 Putative polyprotein2.0e-3945.56Show/hide
Query:  STKFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKIF
        ST+FEV +F+G  DF LW+ K+ AIL Q KV   +L+ + +P++I   + ++++++ +S I+LYLSD ++R +    T+ +LW KLES Y   S+S+KI+
Subjt:  STKFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKIF

Query:  LKSNFFSFKMDSSKSMKDNLDEFHRVVFEMENIGEIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIRVK
        +K  FF +KMD SK +++NLDEF +++ ++ NIGE  SDEN AV+L+NSLPE Y +VK+A+K+GRD+L+  I+L+A++ +
Subjt:  LKSNFFSFKMDSSKSMKDNLDEFHRVVFEMENIGEIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIRVK

A0A5A7UB25 Putative gag-pol polyprotein1.8e-4047.78Show/hide
Query:  STKFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKIF
        ST+FEV +F+G  DF LW+ K+ AIL Q KV   +L+ + +PD+I   + ++++++A+S I+LYLSD ++R +    T  +LW KLES Y   S+ +KI+
Subjt:  STKFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKIF

Query:  LKSNFFSFKMDSSKSMKDNLDEFHRVVFEMENIGEIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIRVK
        +K  FF +KMD SKS+++NLDEF ++V ++ NIGE  SDEN AV+L+NSLPE+Y +VK+A+K+GRD+L+  IVL+A++ +
Subjt:  LKSNFFSFKMDSSKSMKDNLDEFHRVVFEMENIGEIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIRVK

A0A5C7GV29 Uncharacterized protein9.8e-4250.54Show/hide
Query:  MSTKFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKI
        +STKF++E+F+GK DFG+W+ KM AIL QQK   AL E KD+P S+  ++ +++ + A+S IIL L+D ++RQ+  EDT  K+W KLES Y   S+S KI
Subjt:  MSTKFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKI

Query:  FLKSNFFSFKMDSSKSMKDNLDEFHRVVFEMENIG--EIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIRVKE
        +LK   F FKMD SKS++DNLD+F ++  E+ N    E  SDEN A+++ NSL  SY D+K+A+K+GRD+LS E VL A+R +E
Subjt:  FLKSNFFSFKMDSSKSMKDNLDEFHRVVFEMENIG--EIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIRVKE

A0A5D3DNU1 Putative gag-pol polyprotein1.4e-4047.78Show/hide
Query:  STKFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKIF
        ST+FEV +F+G  DF LW+ K+ AIL Q KV   +L+ + +PD+I   + ++++++A+S I+LYLSD ++R +    T  +LW KLES Y   S+ +KI+
Subjt:  STKFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKIF

Query:  LKSNFFSFKMDSSKSMKDNLDEFHRVVFEMENIGEIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIRVK
        +K  FF +KMD SKS+++NLDEF ++V ++ NIGE  SDEN AV+L+NSLPE+Y +VK+A+K+GRD+L+  IVL+A++ +
Subjt:  LKSNFFSFKMDSSKSMKDNLDEFHRVVFEMENIGEIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIRVK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-2029.05Show/hide
Query:  KFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKIFLK
        K+EV +F+G   F  W+ +M  +L QQ +   L      PD++  +   ++++ A S I L+LSD ++  +  EDT   +WT+LES Y   ++++K++LK
Subjt:  KFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKIFLK

Query:  SNFFSFKMDSSKSMKDNLDEFHRVVFEMENIGEIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIRVKE
           ++  M    +   +L+ F+ ++ ++ N+G    +E+ A++L+NSLP SY+++ + +  G+ T+  + V +A+ + E
Subjt:  SNFFSFKMDSSKSMKDNLDEFHRVVFEMENIGEIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIRVKE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTACCAAGTTCGAGGTGGAGCGGTTTGATGGGAAAGAAGATTTTGGTCTATGGAAGATCAAAATGTTGGCCATCCTTCGTCAACAAAAGGTCGATTATGCACTTTT
GGAGGCTAAGGATATTCCAGACAGCATAATAACGGACAAACTGAAAGAGATAAATAAGGTAGCTCACAGCCTTATAATCCTTTATTTGTCTGATACAATTATAAGACAAA
TGCGTAGTGAAGATACTGTTATTAAACTTTGGACAAAATTAGAATCTACCTACCAAGTCGCATCAATTTCAAGCAAAATATTTTTAAAATCTAATTTCTTTAGTTTTAAA
ATGGACTCTAGTAAAAGCATGAAGGATAACCTAGATGAATTCCATAGAGTAGTTTTTGAAATGGAAAATATTGGTGAAATCTTTTCTGATGAAAATCATGCTGTAGTTCT
GATAAATTCTTTGCCTGAATCTTATAATGATGTGAAATCTGCTATGAAGTTTGGTAGGGATACTTTGTCAACTGAAATTGTCTTAAATGCCATTAGAGTAAAAGAATAA
mRNA sequenceShow/hide mRNA sequence
AGTTGCTCTACCATGGGTCTAATTGCAGACTGTAAACGTTGAAGGTTGGGCGTTTTTCAGTGGATCGTGATTTTTCCTTTCGAGAAGCAAGGTTGTATTTCTCCTCATCT
TAATTTCTTTCTAGTTAGTTTAGATTCAAGATTTAATCCCAAAGATTGTTAGACTCCCAATCTTGAATAAAGTTTGGTTGTAGTCTCTCTTTGGGCATAAATTTTGGAAT
CTTGTTAAATCCTAAATGTTAAATGTATCTTTAGATATCCTGCTGTAAATTCTCTGTATTCTCCCGTATATCTCTAATAAACTGTCTAGCAAAGGCTTAAAACACCTTTG
CAAGTGGAGTAGGTCGTTTTATGACCGAACCATTATAAATATTGGTGCTCTTTGATTGTTCTTTACTGTTTCTGCACAAGTTTAAATTCTGTCCAACCCTTAAATTGTTT
AAAAATCGCTTATAAACTTGTTGTAAACACCTCAAACTTGTCTAAAATTGGTTAGACATAGGATTTGGATCTTGACATATAATAACATTTTGGTATCAGAGCTTTTAATC
CTTGTTTTTAGGGCATTCTTTCTTGAATTAAAATTTAGTTTGTTGAATAAAATCTTTGGGTTTAAAGAATCTTTAAGTTGTTAGTATTCTTTAATTCTGATTTTTTTTAT
TTCAAGAAAAGTTTGCTGTCCAATTTTGTTTCTCATTTTTCAGGATGAGTACCAAGTTCGAGGTGGAGCGGTTTGATGGGAAAGAAGATTTTGGTCTATGGAAGATCAAA
ATGTTGGCCATCCTTCGTCAACAAAAGGTCGATTATGCACTTTTGGAGGCTAAGGATATTCCAGACAGCATAATAACGGACAAACTGAAAGAGATAAATAAGGTAGCTCA
CAGCCTTATAATCCTTTATTTGTCTGATACAATTATAAGACAAATGCGTAGTGAAGATACTGTTATTAAACTTTGGACAAAATTAGAATCTACCTACCAAGTCGCATCAA
TTTCAAGCAAAATATTTTTAAAATCTAATTTCTTTAGTTTTAAAATGGACTCTAGTAAAAGCATGAAGGATAACCTAGATGAATTCCATAGAGTAGTTTTTGAAATGGAA
AATATTGGTGAAATCTTTTCTGATGAAAATCATGCTGTAGTTCTGATAAATTCTTTGCCTGAATCTTATAATGATGTGAAATCTGCTATGAAGTTTGGTAGGGATACTTT
GTCAACTGAAATTGTCTTAAATGCCATTAGAGTAAAAGAATAAGAACTAAAAGAAACCAAGAAGTCTAATAGTGAGACTTTGTACGTTAGAGGGAGATCAGAGAATAAGA
AAAAGTATAGAAGCAAGAGTAGGGGTAGATCAAAGTCTAAGGGGGCTAGTAATAGGAGGTGTTATCATTGTAAAAAAGAAGGCCACATTAGAAGGTTTTGTCCTGACCTT
AGGAATAAAAAGTCGTCGGGTAAAGGGAAAGAGGAGTCTAGTGTAGTAAACCTTAGTGAAGAGTATAACCACTCTCTTATGGTTAGTAACACTAGAGAGGATGACACTTG
GATCCTAGATTCAGGTTGTTCTTTCCATATGACCCCTAATAGAGAATGGATAGAAGACTTCCAAACCGGTGTGGGAGGTAAAGTCATGTTAGGGAACCGACACTTTTGTA
GTGTAG
Protein sequenceShow/hide protein sequence
MSTKFEVERFDGKEDFGLWKIKMLAILRQQKVDYALLEAKDIPDSIITDKLKEINKVAHSLIILYLSDTIIRQMRSEDTVIKLWTKLESTYQVASISSKIFLKSNFFSFK
MDSSKSMKDNLDEFHRVVFEMENIGEIFSDENHAVVLINSLPESYNDVKSAMKFGRDTLSTEIVLNAIRVKE