; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008807 (gene) of Snake gourd v1 genome

Gene IDTan0008807
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationLG04:31382185..31385426
RNA-Seq ExpressionTan0008807
SyntenyTan0008807
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060199.1 uncharacterized protein E6C27_scaffold386G00120 [Cucumis melo var. makuwa]6.3e-1062.9Show/hide
Query:  LSTLSVLRDNNAVVEIELPVPDNLPYQQV----------LLRGNNVCWLHAVFRAKLAAGSG
        LSTLSVLRDN+AVVEIELPVPD LP              ++RG++VCWLHAVFRAK A G G
Subjt:  LSTLSVLRDNNAVVEIELPVPDNLPYQQV----------LLRGNNVCWLHAVFRAKLAAGSG

TYK00845.1 zf-CCHC domain-containing protein [Cucumis melo var. makuwa]2.7e-0856.45Show/hide
Query:  LSTLSVLRDNNAVVEIELPVPDNLPYQQV----------LLRGNNVCWLHAVFRAKLAAGSG
        LSTLSVLRDN+AV EIELPVPD L               ++RG++VCWLHA+FRAK+  G G
Subjt:  LSTLSVLRDNNAVVEIELPVPDNLPYQQV----------LLRGNNVCWLHAVFRAKLAAGSG

TYK00848.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-0856.92Show/hide
Query:  LSTLSVLRDNNAVVEIELPVPDNLPYQQV----------LLRGNNVCWLHAVFRAKLAAGSGEEL
        LSTLSVL DN+AVVEIEL VPD LP              ++RG++VCWLHA+FRAK A G G +L
Subjt:  LSTLSVLRDNNAVVEIELPVPDNLPYQQV----------LLRGNNVCWLHAVFRAKLAAGSGEEL

TYK06355.1 uncharacterized protein E5676_scaffold163G00340 [Cucumis melo var. makuwa]1.6e-0858.73Show/hide
Query:  LSTLSVLRDNNAVVEIELPVPDNLPYQQV-----------LLRGNNVCWLHAVFRAKLAAGSG
        LSTLSVLRDN+ VVEIELPVPD LP  +V           ++R +NVCWL AVFRAK   G G
Subjt:  LSTLSVLRDNNAVVEIELPVPDNLPYQQV-----------LLRGNNVCWLHAVFRAKLAAGSG

TYK11693.1 gag protease polyprotein [Cucumis melo var. makuwa]5.9e-0858.06Show/hide
Query:  LLSTLSVLRDNNAVVEIELPVPDNLPYQQV----------LLRGNNVCWLHAVFRAKLAAGS
        LLSTLSVL DN+AVVEIELPVPD LP              ++RG++VCWLH VFRAK   G+
Subjt:  LLSTLSVLRDNNAVVEIELPVPDNLPYQQV----------LLRGNNVCWLHAVFRAKLAAGS

TrEMBL top hitse value%identityAlignment
A0A5D3BAD1 Retrotrans_gag domain-containing protein3.1e-1062.9Show/hide
Query:  LSTLSVLRDNNAVVEIELPVPDNLPYQQV----------LLRGNNVCWLHAVFRAKLAAGSG
        LSTLSVLRDN+AVVEIELPVPD LP              ++RG++VCWLHAVFRAK A G G
Subjt:  LSTLSVLRDNNAVVEIELPVPDNLPYQQV----------LLRGNNVCWLHAVFRAKLAAGSG

A0A5D3BNX1 Zf-CCHC domain-containing protein1.3e-0856.45Show/hide
Query:  LSTLSVLRDNNAVVEIELPVPDNLPYQQV----------LLRGNNVCWLHAVFRAKLAAGSG
        LSTLSVLRDN+AV EIELPVPD L               ++RG++VCWLHA+FRAK+  G G
Subjt:  LSTLSVLRDNNAVVEIELPVPDNLPYQQV----------LLRGNNVCWLHAVFRAKLAAGSG

A0A5D3BP54 Gag/pol protein5.8e-0956.92Show/hide
Query:  LSTLSVLRDNNAVVEIELPVPDNLPYQQV----------LLRGNNVCWLHAVFRAKLAAGSGEEL
        LSTLSVL DN+AVVEIEL VPD LP              ++RG++VCWLHA+FRAK A G G +L
Subjt:  LSTLSVLRDNNAVVEIELPVPDNLPYQQV----------LLRGNNVCWLHAVFRAKLAAGSGEEL

A0A5D3C4P7 Retrotrans_gag domain-containing protein7.5e-0958.73Show/hide
Query:  LSTLSVLRDNNAVVEIELPVPDNLPYQQV-----------LLRGNNVCWLHAVFRAKLAAGSG
        LSTLSVLRDN+ VVEIELPVPD LP  +V           ++R +NVCWL AVFRAK   G G
Subjt:  LSTLSVLRDNNAVVEIELPVPDNLPYQQV-----------LLRGNNVCWLHAVFRAKLAAGSG

A0A5D3CIF4 Gag protease polyprotein2.9e-0858.06Show/hide
Query:  LLSTLSVLRDNNAVVEIELPVPDNLPYQQV----------LLRGNNVCWLHAVFRAKLAAGS
        LLSTLSVL DN+AVVEIELPVPD LP              ++RG++VCWLH VFRAK   G+
Subjt:  LLSTLSVLRDNNAVVEIELPVPDNLPYQQV----------LLRGNNVCWLHAVFRAKLAAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCGAGCCAATTCTAATCTTAAGTTACTGTCATATTCTGTTGTCGACGTTGAGTGTACTCCGTGACAACAATGCTGTCGTAGAGATCGAACTCCCGGTGCCTGATAA
CCTGCCATATCAGCAGGTATTGTTAAGAGGTAACAATGTCTGTTGGCTTCATGCCGTCTTCCGGGCTAAGTTAGCAGCTGGTTCGGGAGAGGAACTCCATGTGCAAATTA
TGGCAAGAATCATCGAGGGCAATGTCTTGTTGGTGTCGGTGTGTGTTACCAGTGTGGATAGCCAGGGCATTTCAAGAGGGATTGTCCATGGCTTAGAGCAGCCACACAGA
GGGACCAGGGAGTTGGGTCCCAAACAGTTGAGCAGCCGAGAGTTCCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTCGAGCCAATTCTAATCTTAAGTTACTGTCATATTCTGTTGTCGACGTTGAGTGTACTCCGTGACAACAATGCTGTCGTAGAGATCGAACTCCCGGTGCCTGATAA
CCTGCCATATCAGCAGGTATTGTTAAGAGGTAACAATGTCTGTTGGCTTCATGCCGTCTTCCGGGCTAAGTTAGCAGCTGGTTCGGGAGAGGAACTCCATGTGCAAATTA
TGGCAAGAATCATCGAGGGCAATGTCTTGTTGGTGTCGGTGTGTGTTACCAGTGTGGATAGCCAGGGCATTTCAAGAGGGATTGTCCATGGCTTAGAGCAGCCACACAGA
GGGACCAGGGAGTTGGGTCCCAAACAGTTGAGCAGCCGAGAGTTCCAGTAG
Protein sequenceShow/hide protein sequence
MLEPILILSYCHILLSTLSVLRDNNAVVEIELPVPDNLPYQQVLLRGNNVCWLHAVFRAKLAAGSGEELHVQIMARIIEGNVLLVSVCVTSVDSQGISRGIVHGLEQPHR
GTRELGPKQLSSREFQ