; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032115 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032115
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr11:25100306..25100949
RNA-Seq ExpressionLag0032115
SyntenyLag0032115
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR041373 - Reverse transcriptase, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034981.1 Transposon Tf2-9 polyprotein [Cucumis melo var. makuwa]2.9e-4256.41Show/hide
Query:  LGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAAD
        +GAVL Q+++PIAY++H+L++  ++K +YERELM VV+AVQRW+ YLLGRKFLV+T+QR+LK LLEQR+IQP+YQKW++KLLGY+FE+ Y+ GLENKAAD
Subjt:  LGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAAD

Query:  ALFKLPPATHLAHMVVPTIIDVAVIQKEVLNDTHLRSIIQKL-QEDPNYTTRFSLQ
        AL + PP  HL  +  PT+ID+  I++EV  D  L+ I+ +L +E+ +   +FS+Q
Subjt:  ALFKLPPATHLAHMVVPTIIDVAVIQKEVLNDTHLRSIIQKL-QEDPNYTTRFSLQ

KAA0037388.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]1.4e-4159.03Show/hide
Query:  LGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAAD
        +GAVL QN++PIA+++HTL++  ++K +YER+LM VV+AVQRWR YLLGR+F+V+T+QR+LK LLEQRVIQP+YQKW++KLLGY+FEI Y+ GLENKAAD
Subjt:  LGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAAD

Query:  ALFKLPPATHLAHMVVPTIIDVAVIQKEVLNDTHLRSIIQKLQE
        ALF++P    L  ++VP +ID+ +I++EV  D +L+ II+++ E
Subjt:  ALFKLPPATHLAHMVVPTIIDVAVIQKEVLNDTHLRSIIQKLQE

KAA0064689.1 putative retroelement pol polyprotein [Cucumis melo var. makuwa]3.2e-4157.69Show/hide
Query:  LGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAAD
        +GAVL Q+++PIA+Y+HTLS+  +++ +YERELM VV++VQRWR YLLG KF+V+T+Q++LK LLEQRVIQP+YQKW+SKLLGY+FE+ Y+ GLENKAAD
Subjt:  LGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAAD

Query:  ALFKLPPATHLAHMVVPTIIDVAVIQKEVLNDTHLRSIIQKL-QEDPNYTTRFSLQ
        AL + PP  HL  + VP  +D+  I+KEV  D  L+ I+  L +ED N T++F+L+
Subjt:  ALFKLPPATHLAHMVVPTIIDVAVIQKEVLNDTHLRSIIQKL-QEDPNYTTRFSLQ

TYJ97524.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]1.4e-4159.03Show/hide
Query:  LGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAAD
        +GAVL QN++PIA+++HTL++  ++K +YER+LM VV+AVQRWR YLLGR+F+V+T+QR+LK LLEQRVIQP+YQKW++KLLGY+FEI Y+ GLENKAAD
Subjt:  LGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAAD

Query:  ALFKLPPATHLAHMVVPTIIDVAVIQKEVLNDTHLRSIIQKLQE
        ALF++P    L  ++VP +ID+ +I++EV  D +L+ II+++ E
Subjt:  ALFKLPPATHLAHMVVPTIIDVAVIQKEVLNDTHLRSIIQKLQE

TYK28538.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]1.8e-4158.55Show/hide
Query:  LGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAAD
        +GAVL  N++PIA+++HTL++   +K +YERELM VV+AVQ WR YLLGRKFLV+T+QR LK LLEQR++QP+YQKW++KLLGY FE+ Y+ GLENKAAD
Subjt:  LGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAAD

Query:  ALFKLPPATHLAHMVVPTIIDVAVIQKEVLNDTHLRSIIQKLQ---EDPNYT
         L ++PP  HL ++  P +ID+ VI +EV ND  L+ II KLQ   E  NY+
Subjt:  ALFKLPPATHLAHMVVPTIIDVAVIQKEVLNDTHLRSIIQKLQ---EDPNYT

TrEMBL top hitse value%identityAlignment
A0A5A7T3M0 Transposon Tf2-1 polyprotein isoform X16.9e-4259.03Show/hide
Query:  LGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAAD
        +GAVL QN++PIA+++HTL++  ++K +YER+LM VV+AVQRWR YLLGR+F+V+T+QR+LK LLEQRVIQP+YQKW++KLLGY+FEI Y+ GLENKAAD
Subjt:  LGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAAD

Query:  ALFKLPPATHLAHMVVPTIIDVAVIQKEVLNDTHLRSIIQKLQE
        ALF++P    L  ++VP +ID+ +I++EV  D +L+ II+++ E
Subjt:  ALFKLPPATHLAHMVVPTIIDVAVIQKEVLNDTHLRSIIQKLQE

A0A5A7VGU4 Putative retroelement pol polyprotein1.5e-4157.69Show/hide
Query:  LGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAAD
        +GAVL Q+++PIA+Y+HTLS+  +++ +YERELM VV++VQRWR YLLG KF+V+T+Q++LK LLEQRVIQP+YQKW+SKLLGY+FE+ Y+ GLENKAAD
Subjt:  LGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAAD

Query:  ALFKLPPATHLAHMVVPTIIDVAVIQKEVLNDTHLRSIIQKL-QEDPNYTTRFSLQ
        AL + PP  HL  + VP  +D+  I+KEV  D  L+ I+  L +ED N T++F+L+
Subjt:  ALFKLPPATHLAHMVVPTIIDVAVIQKEVLNDTHLRSIIQKL-QEDPNYTTRFSLQ

A0A5D3BEN4 Transposon Tf2-1 polyprotein isoform X16.9e-4259.03Show/hide
Query:  LGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAAD
        +GAVL QN++PIA+++HTL++  ++K +YER+LM VV+AVQRWR YLLGR+F+V+T+QR+LK LLEQRVIQP+YQKW++KLLGY+FEI Y+ GLENKAAD
Subjt:  LGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAAD

Query:  ALFKLPPATHLAHMVVPTIIDVAVIQKEVLNDTHLRSIIQKLQE
        ALF++P    L  ++VP +ID+ +I++EV  D +L+ II+++ E
Subjt:  ALFKLPPATHLAHMVVPTIIDVAVIQKEVLNDTHLRSIIQKLQE

A0A5D3BRN6 Transposon Tf2-9 polyprotein1.4e-4256.41Show/hide
Query:  LGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAAD
        +GAVL Q+++PIAY++H+L++  ++K +YERELM VV+AVQRW+ YLLGRKFLV+T+QR+LK LLEQR+IQP+YQKW++KLLGY+FE+ Y+ GLENKAAD
Subjt:  LGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAAD

Query:  ALFKLPPATHLAHMVVPTIIDVAVIQKEVLNDTHLRSIIQKL-QEDPNYTTRFSLQ
        AL + PP  HL  +  PT+ID+  I++EV  D  L+ I+ +L +E+ +   +FS+Q
Subjt:  ALFKLPPATHLAHMVVPTIIDVAVIQKEVLNDTHLRSIIQKL-QEDPNYTTRFSLQ

A0A5D3DYL5 Transposon Tf2-1 polyprotein isoform X19.0e-4258.55Show/hide
Query:  LGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAAD
        +GAVL  N++PIA+++HTL++   +K +YERELM VV+AVQ WR YLLGRKFLV+T+QR LK LLEQR++QP+YQKW++KLLGY FE+ Y+ GLENKAAD
Subjt:  LGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAAD

Query:  ALFKLPPATHLAHMVVPTIIDVAVIQKEVLNDTHLRSIIQKLQ---EDPNYT
         L ++PP  HL ++  P +ID+ VI +EV ND  L+ II KLQ   E  NY+
Subjt:  ALFKLPPATHLAHMVVPTIIDVAVIQKEVLNDTHLRSIIQKLQ---EDPNYT

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.3e-1032.28Show/hide
Query:  SLGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAA
        +LGAVLSQ+  P++Y + TL+    + S  E+EL+ +V A + +R YLLGR F + ++ + L  L   +    +  +W  KL  + F+I+Y  G EN  A
Subjt:  SLGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAA

Query:  DALFKLP-PATHLA----HMVVPTIIDVAVIQKEVLNDTHLRSIIQKLQEDPNYTTRF
        DAL ++    T+L+    H       D+  I +  LN  + + I  K   D   T  F
Subjt:  DALFKLP-PATHLA----HMVVPTIIDVAVIQKEVLNDTHLRSIIQKLQEDPNYTTRF

P10394 Retrovirus-related Pol polyprotein from transposon 4125.3e-0733.33Show/hide
Query:  GAVLSQN----QKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENK
        GAVL+QN    Q P+AY +   +    +KS  E+EL  +  A+  +R Y+ G+ F V+T+ R L +L        +  +   +L  Y F +EY  G +N 
Subjt:  GAVLSQN----QKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENK

Query:  AADALFKL
         ADAL ++
Subjt:  AADALFKL

P10401 Retrovirus-related Pol polyprotein from transposon gypsy7.6e-0631.13Show/hide
Query:  STSLGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLG-RKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLEN
        ++ +GAVLSQ  +PI   + TL    Q+ +  EREL+ +V A+ + + +L G R+  + T+ + L   +  R    + ++W S +  +  ++ Y+ G EN
Subjt:  STSLGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLG-RKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLEN

Query:  KAADAL
          ADAL
Subjt:  KAADAL

P20825 Retrovirus-related Pol polyprotein from transposon 2971.1e-1238.68Show/hide
Query:  SLGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAA
        +LGAVLSQN  PI++ + TL+    + S  E+EL+ +V A + +R YLLGR+FL+ ++ + L+ L   +    + ++W  +L  Y F+I+Y  G EN  A
Subjt:  SLGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKAA

Query:  DALFKL
        DAL ++
Subjt:  DALFKL

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.5e-0427.78Show/hide
Query:  LGAVLSQNQKP------IAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGL
        +GAVL +          + Y++ +L    ++    E EL+ ++ A+  +R  L G+ F +RT+  +L  L  +       Q+W+  L  Y F +EY +G 
Subjt:  LGAVLSQNQKP------IAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGL

Query:  ENKAADAL
        +N  ADA+
Subjt:  ENKAADAL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGCATTGCTGCACCACTCCATCAGTACTAGTTTGGGAGCAGTATTATCGCAGAACCAGAAACCAATTGCTTATTACAATCATACCCTCTCAGTAACAGCTCAGTC
CAAATCTATTTACGAAAGAGAGCTCATGGTTGTAGTCATGGCAGTCCAAAGATGGAGGTTGTACCTGTTGGGTAGGAAGTTCCTTGTTCGAACGAATCAACGTGCTTTGA
AACATCTATTAGAGCAAAGAGTTATACAACCTGAATATCAGAAGTGGGTGTCTAAATTGTTGGGTTATGCCTTTGAGATTGAATACAGGTCGGGTTTAGAGAATAAAGCA
GCCGATGCCTTGTTCAAACTCCCACCAGCAACCCACCTTGCTCACATGGTTGTGCCGACTATAATTGATGTAGCGGTCATACAGAAAGAAGTACTGAATGATACTCACTT
GCGTTCGATAATTCAGAAACTCCAAGAAGACCCGAATTACACTACCAGATTCTCTTTACAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGCATTGCTGCACCACTCCATCAGTACTAGTTTGGGAGCAGTATTATCGCAGAACCAGAAACCAATTGCTTATTACAATCATACCCTCTCAGTAACAGCTCAGTC
CAAATCTATTTACGAAAGAGAGCTCATGGTTGTAGTCATGGCAGTCCAAAGATGGAGGTTGTACCTGTTGGGTAGGAAGTTCCTTGTTCGAACGAATCAACGTGCTTTGA
AACATCTATTAGAGCAAAGAGTTATACAACCTGAATATCAGAAGTGGGTGTCTAAATTGTTGGGTTATGCCTTTGAGATTGAATACAGGTCGGGTTTAGAGAATAAAGCA
GCCGATGCCTTGTTCAAACTCCCACCAGCAACCCACCTTGCTCACATGGTTGTGCCGACTATAATTGATGTAGCGGTCATACAGAAAGAAGTACTGAATGATACTCACTT
GCGTTCGATAATTCAGAAACTCCAAGAAGACCCGAATTACACTACCAGATTCTCTTTACAGTAG
Protein sequenceShow/hide protein sequence
MGALLHHSISTSLGAVLSQNQKPIAYYNHTLSVTAQSKSIYERELMVVVMAVQRWRLYLLGRKFLVRTNQRALKHLLEQRVIQPEYQKWVSKLLGYAFEIEYRSGLENKA
ADALFKLPPATHLAHMVVPTIIDVAVIQKEVLNDTHLRSIIQKLQEDPNYTTRFSLQ