; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039487 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039487
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr2:44747682..44748101
RNA-Seq ExpressionLag0039487
SyntenyLag0039487
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049700.1 T4.5 [Cucumis melo var. makuwa]5.9e-2564.52Show/hide
Query:  KDLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATSTVNPSYGDWLAKDQALMTLIN
        KD  SP+FLLSNICNLIS+RLDS+N+VLWKFQ T++L+AHKL+GF+DG++P PP+T       SSST +     NPSY DW+AKDQALMT+IN
Subjt:  KDLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATSTVNPSYGDWLAKDQALMTLIN

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]5.9e-2564.52Show/hide
Query:  KDLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATSTVNPSYGDWLAKDQALMTLIN
        KD  SP+FLLSNICNLIS+RLDS+N+VLWKFQ T++L+AHKL+GF+DG++P PP+T       SSST +     NPSY DW+AKDQALMT+IN
Subjt:  KDLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATSTVNPSYGDWLAKDQALMTLIN

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]5.9e-2564.52Show/hide
Query:  KDLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATSTVNPSYGDWLAKDQALMTLIN
        KD  SP+FLLSNICNLIS+RLDS+N+VLWKFQ T++L+AHKL+GF+DG++P PP+T       SSST +     NPSY DW+AKDQALMT+IN
Subjt:  KDLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATSTVNPSYGDWLAKDQALMTLIN

XP_016900446.1 PREDICTED: uncharacterized protein LOC103490319 isoform X1 [Cucumis melo]5.9e-2564.52Show/hide
Query:  KDLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATSTVNPSYGDWLAKDQALMTLIN
        KD  SP+FLLSNICNLIS+RLDS+N+VLWKFQ T++L+AHKL+GF+DG++P PP+T       SSST +     NPSY DW+AKDQALMT+IN
Subjt:  KDLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATSTVNPSYGDWLAKDQALMTLIN

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]2.2e-2463.16Show/hide
Query:  KDLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATS--TVNPSYGDWLAKDQALMTLIN
        KDLHSP+FLLSNICNL+SIRLDS++++LWKFQ T++L+AHKLFGF+DGS  AP + L   S   S   + TS   +NP + DW+AKDQALMTLIN
Subjt:  KDLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATS--TVNPSYGDWLAKDQALMTLIN

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X22.8e-2564.52Show/hide
Query:  KDLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATSTVNPSYGDWLAKDQALMTLIN
        KD  SP+FLLSNICNLIS+RLDS+N+VLWKFQ T++L+AHKL+GF+DG++P PP+T       SSST +     NPSY DW+AKDQALMT+IN
Subjt:  KDLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATSTVNPSYGDWLAKDQALMTLIN

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X32.8e-2564.52Show/hide
Query:  KDLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATSTVNPSYGDWLAKDQALMTLIN
        KD  SP+FLLSNICNLIS+RLDS+N+VLWKFQ T++L+AHKL+GF+DG++P PP+T       SSST +     NPSY DW+AKDQALMT+IN
Subjt:  KDLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATSTVNPSYGDWLAKDQALMTLIN

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X12.8e-2564.52Show/hide
Query:  KDLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATSTVNPSYGDWLAKDQALMTLIN
        KD  SP+FLLSNICNLIS+RLDS+N+VLWKFQ T++L+AHKL+GF+DG++P PP+T       SSST +     NPSY DW+AKDQALMT+IN
Subjt:  KDLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATSTVNPSYGDWLAKDQALMTLIN

A0A5D3CLI6 T4.52.8e-2564.52Show/hide
Query:  KDLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATSTVNPSYGDWLAKDQALMTLIN
        KD  SP+FLLSNICNLIS+RLDS+N+VLWKFQ T++L+AHKL+GF+DG++P PP+T       SSST +     NPSY DW+AKDQALMT+IN
Subjt:  KDLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATSTVNPSYGDWLAKDQALMTLIN

A0A6J1D9L6 uncharacterized protein LOC1110188921.1e-2463.16Show/hide
Query:  KDLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATS--TVNPSYGDWLAKDQALMTLIN
        KDLHSP+FLLSNICNL+SIRLDS++++LWKFQ T++L+AHKLFGF+DGS  AP + L   S   S   + TS   +NP + DW+AKDQALMTLIN
Subjt:  KDLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATS--TVNPSYGDWLAKDQALMTLIN

SwissProt top hitse value%identityAlignment
Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.4e-0535.62Show/hide
Query:  RLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATSTVNPSYGDWLAKDQALMTLI
        +L S+NY++W  Q  ++   ++L GF+DGS P PP T+         TD A   VNP Y  W  +D+ + + I
Subjt:  RLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATSTVNPSYGDWLAKDQALMTLI

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.5e-0537.29Show/hide
Query:  DLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVF
        D+H P     +  ++  +  D  NYV WK +F S LR  K FGF+DG+ P P    P++
Subjt:  DLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTTTGAGGTTTCTCTCAAAGATCTTCATTCTCCGATGTTTCTTCTGTCAAATATTTGCAATTTGATCTCGATACGGCTGGATTCTTCAAACTATGTCTTGTG
GAAGTTTCAATTCACTTCTATGTTGCGTGCACATAAACTTTTTGGTTTTGTTGATGGATCGCATCCTGCTCCTCCAAAAACTCTTCCGGTTTTTTCTACTGAGAGTTCTT
CAACCGATTCTGCTACATCTACTGTTAATCCATCGTATGGAGACTGGTTGGCCAAAGATCAAGCGCTTATGACCTTAATTAATCAATGCTACACTGTCTCCAGAAGCCTT
GGCATATATCGTTGGCTGCAATTCTTCTCAAGAGGAATGGGAAGCCCTAGAGAAACATTACTCTTCTTCTTCGAGGACAAATATTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTTTGAGGTTTCTCTCAAAGATCTTCATTCTCCGATGTTTCTTCTGTCAAATATTTGCAATTTGATCTCGATACGGCTGGATTCTTCAAACTATGTCTTGTG
GAAGTTTCAATTCACTTCTATGTTGCGTGCACATAAACTTTTTGGTTTTGTTGATGGATCGCATCCTGCTCCTCCAAAAACTCTTCCGGTTTTTTCTACTGAGAGTTCTT
CAACCGATTCTGCTACATCTACTGTTAATCCATCGTATGGAGACTGGTTGGCCAAAGATCAAGCGCTTATGACCTTAATTAATCAATGCTACACTGTCTCCAGAAGCCTT
GGCATATATCGTTGGCTGCAATTCTTCTCAAGAGGAATGGGAAGCCCTAGAGAAACATTACTCTTCTTCTTCGAGGACAAATATTGTTAA
Protein sequenceShow/hide protein sequence
MASFEVSLKDLHSPMFLLSNICNLISIRLDSSNYVLWKFQFTSMLRAHKLFGFVDGSHPAPPKTLPVFSTESSSTDSATSTVNPSYGDWLAKDQALMTLINQCYTVSRSL
GIYRWLQFFSRGMGSPRETLLFFFEDKYC