; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006041 (gene) of Snake gourd v1 genome

Gene IDTan0006041
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationLG05:33966016..33966387
RNA-Seq ExpressionTan0006041
SyntenyTan0006041
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056771.1 putative mitochondrial protein [Cucumis melo var. makuwa]2.9e-3670.34Show/hide
Query:  SSSTSPGTTSPLFLLSNICNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWVSRDQALITLVNATLSPSAL
        SSS     TS LFLLSNICNLV +RLDS NYVLWK+QV+SILKAHSLFGHIDD+LP P + + SST    +E++P Y+QW+SRDQALITL+NATLS SAL
Subjt:  SSSTSPGTTSPLFLLSNICNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWVSRDQALITLVNATLSPSAL

Query:  AHVVGSESSKDLWFSLEK
         HVVGS +SK LW SLEK
Subjt:  AHVVGSESSKDLWFSLEK

KAA0061282.1 putative mitochondrial protein [Cucumis melo var. makuwa]5.0e-3662.77Show/hide
Query:  MVASSSSSTSPGTTSP--------------LFLLSNICNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWV
        M   SS S  PGT++P              LFLLSNICNLV +RLDS NYVLWK+QV+SILKAHSLFGHIDD+LP P + + SST    +E++P+Y+QW+
Subjt:  MVASSSSSTSPGTTSP--------------LFLLSNICNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWV

Query:  SRDQALITLVNATLSPSALAHVVGSESSKDLWFSLEK
        SRDQALITL+N TLS SALAHVV S SSK LW SLEK
Subjt:  SRDQALITLVNATLSPSALAHVVGSESSKDLWFSLEK

KAA0067173.1 retrotransposon protein [Cucumis melo var. makuwa]1.0e-3670.34Show/hide
Query:  SSSTSPGTTSPLFLLSNICNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWVSRDQALITLVNATLSPSAL
        SSS S   TS LFLLSNICNLV +RLDS NY+LWK+QV+SILKAHS FGHIDD+LP P + + SST    +E++P+Y+QW+SR QALITL+NATLS SAL
Subjt:  SSSTSPGTTSPLFLLSNICNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWVSRDQALITLVNATLSPSAL

Query:  AHVVGSESSKDLWFSLEK
        AHVVGS SSK LW SLEK
Subjt:  AHVVGSESSKDLWFSLEK

TYJ97594.1 putative mitochondrial protein [Cucumis melo var. makuwa]1.3e-3670.34Show/hide
Query:  SSSTSPGTTSPLFLLSNICNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWVSRDQALITLVNATLSPSAL
        SSS     TS LFLLSNICNLV +RLDS NYVLWK+QV+SILKAHSLFGHIDD+LP P + + SST    +E++P+Y+QW+SRDQALITL+NATLS SAL
Subjt:  SSSTSPGTTSPLFLLSNICNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWVSRDQALITLVNATLSPSAL

Query:  AHVVGSESSKDLWFSLEK
         HVVGS +SK LW SLEK
Subjt:  AHVVGSESSKDLWFSLEK

TYK11141.1 putative mitochondrial protein [Cucumis melo var. makuwa]1.6e-3769.92Show/hide
Query:  MVASSSSSTSPGTTSPLFLLSNICNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWVSRDQALITLVNATL
        ++ SSS+ + P  TS LFLLSNICNLV +RLDS NYVLWK+QV+SILKAHSLFGHIDD+LP P +   SST    +E++P+Y+QW+SRDQALITL+NATL
Subjt:  MVASSSSSTSPGTTSPLFLLSNICNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWVSRDQALITLVNATL

Query:  SPSALAHVVGSESSKDLWFSLEK
        S SALAHVVGS SSK LW SLEK
Subjt:  SPSALAHVVGSESSKDLWFSLEK

TrEMBL top hitse value%identityAlignment
A0A5A7UTE5 Putative mitochondrial protein1.4e-3670.34Show/hide
Query:  SSSTSPGTTSPLFLLSNICNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWVSRDQALITLVNATLSPSAL
        SSS     TS LFLLSNICNLV +RLDS NYVLWK+QV+SILKAHSLFGHIDD+LP P + + SST    +E++P Y+QW+SRDQALITL+NATLS SAL
Subjt:  SSSTSPGTTSPLFLLSNICNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWVSRDQALITLVNATLSPSAL

Query:  AHVVGSESSKDLWFSLEK
         HVVGS +SK LW SLEK
Subjt:  AHVVGSESSKDLWFSLEK

A0A5A7UZE5 Putative mitochondrial protein2.4e-3662.77Show/hide
Query:  MVASSSSSTSPGTTSP--------------LFLLSNICNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWV
        M   SS S  PGT++P              LFLLSNICNLV +RLDS NYVLWK+QV+SILKAHSLFGHIDD+LP P + + SST    +E++P+Y+QW+
Subjt:  MVASSSSSTSPGTTSP--------------LFLLSNICNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWV

Query:  SRDQALITLVNATLSPSALAHVVGSESSKDLWFSLEK
        SRDQALITL+N TLS SALAHVV S SSK LW SLEK
Subjt:  SRDQALITLVNATLSPSALAHVVGSESSKDLWFSLEK

A0A5A7VGG0 Retrotransposon protein4.9e-3770.34Show/hide
Query:  SSSTSPGTTSPLFLLSNICNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWVSRDQALITLVNATLSPSAL
        SSS S   TS LFLLSNICNLV +RLDS NY+LWK+QV+SILKAHS FGHIDD+LP P + + SST    +E++P+Y+QW+SR QALITL+NATLS SAL
Subjt:  SSSTSPGTTSPLFLLSNICNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWVSRDQALITLVNATLSPSAL

Query:  AHVVGSESSKDLWFSLEK
        AHVVGS SSK LW SLEK
Subjt:  AHVVGSESSKDLWFSLEK

A0A5D3BEU0 Putative mitochondrial protein6.4e-3770.34Show/hide
Query:  SSSTSPGTTSPLFLLSNICNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWVSRDQALITLVNATLSPSAL
        SSS     TS LFLLSNICNLV +RLDS NYVLWK+QV+SILKAHSLFGHIDD+LP P + + SST    +E++P+Y+QW+SRDQALITL+NATLS SAL
Subjt:  SSSTSPGTTSPLFLLSNICNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWVSRDQALITLVNATLSPSAL

Query:  AHVVGSESSKDLWFSLEK
         HVVGS +SK LW SLEK
Subjt:  AHVVGSESSKDLWFSLEK

A0A5D3CH60 Putative mitochondrial protein7.5e-3869.92Show/hide
Query:  MVASSSSSTSPGTTSPLFLLSNICNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWVSRDQALITLVNATL
        ++ SSS+ + P  TS LFLLSNICNLV +RLDS NYVLWK+QV+SILKAHSLFGHIDD+LP P +   SST    +E++P+Y+QW+SRDQALITL+NATL
Subjt:  MVASSSSSTSPGTTSPLFLLSNICNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWVSRDQALITLVNATL

Query:  SPSALAHVVGSESSKDLWFSLEK
        S SALAHVVGS SSK LW SLEK
Subjt:  SPSALAHVVGSESSKDLWFSLEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).8.3e-0526.36Show/hide
Query:  VASSSSSTSPGT--TSPLFLLSNI-----CNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWVSRDQALIT
        +A +  S SP +   SP +L  +I      ++  L  D +NYV WK +  S L+    FG ID +LP+P+ F            SP Y  W   +  ++ 
Subjt:  VASSSSSTSPGT--TSPLFLLSNI-----CNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWVSRDQALIT

Query:  LVNATLSPSALAHVVGSESSKDLWFSLEK
         +  +++   L  V+ +E++  +W  L +
Subjt:  LVNATLSPSALAHVVGSESSKDLWFSLEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGCATCTTCCTCCTCTTCAACTTCTCCTGGAACAACATCTCCTCTCTTTCTGTTATCCAACATATGCAACCTCGTGTCGCTGCGTTTGGACTCCAATAACTATGT
GCTTTGGAAGTTTCAAGTCGCCTCAATCTTGAAGGCTCACTCCCTCTTTGGACATATAGATGACTCTCTCCCACAACCGGAGCAATTTATGCGTTCATCAACAGGGACAC
CGACAACAGAGGTTAGTCCCGATTATATTCAATGGGTATCTCGTGATCAAGCTCTTATCACTTTAGTTAATGCCACTTTATCTCCATCTGCTCTTGCCCATGTCGTTGGA
TCTGAGTCTTCAAAAGATTTGTGGTTTTCTTTAGAAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGCATCTTCCTCCTCTTCAACTTCTCCTGGAACAACATCTCCTCTCTTTCTGTTATCCAACATATGCAACCTCGTGTCGCTGCGTTTGGACTCCAATAACTATGT
GCTTTGGAAGTTTCAAGTCGCCTCAATCTTGAAGGCTCACTCCCTCTTTGGACATATAGATGACTCTCTCCCACAACCGGAGCAATTTATGCGTTCATCAACAGGGACAC
CGACAACAGAGGTTAGTCCCGATTATATTCAATGGGTATCTCGTGATCAAGCTCTTATCACTTTAGTTAATGCCACTTTATCTCCATCTGCTCTTGCCCATGTCGTTGGA
TCTGAGTCTTCAAAAGATTTGTGGTTTTCTTTAGAAAAATGA
Protein sequenceShow/hide protein sequence
MVASSSSSTSPGTTSPLFLLSNICNLVSLRLDSNNYVLWKFQVASILKAHSLFGHIDDSLPQPEQFMRSSTGTPTTEVSPDYIQWVSRDQALITLVNATLSPSALAHVVG
SESSKDLWFSLEK