; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000905 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000905
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr4:19298358..19301121
RNA-Seq ExpressionLag0000905
SyntenyLag0000905
Gene Ontology termsGO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0090304 - nucleic acid metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN66445.1 hypothetical protein VITISV_003574 [Vitis vinifera]8.4e-4150.85Show/hide
Query:  GVMLPEEEDIEREIISFFSTLYAPPKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFE
        G++L   E I +EI+ +F  LYA P      V+G+ WSPISE     LD+PF+ EEI KA+F  DRDKAPGPDGF++A FQD W+ IK DL RVF E   
Subjt:  GVMLPEEEDIEREIISFFSTLYAPPKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFE

Query:  RGILDSSLSETFVCIIPKKDRTNK-----------GAFLEGRQILDQALIANEAIEDYRGSGREGVIFKIDFEKAYD
         GI++ S + +F+ ++PKK  T K            AF++GRQILD   IANE +++ R SG EGV+FKIDFEKAYD
Subjt:  RGILDSSLSETFVCIIPKKDRTNK-----------GAFLEGRQILDQALIANEAIEDYRGSGREGVIFKIDFEKAYD

RVW12452.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]9.2e-4050Show/hide
Query:  GVMLPEEEDIEREIISFFSTLYAPPKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFE
        G++L   E I  EI+ +F  LY+ P      V+GI WSPISE     LD PF+  EI  A+F  DRDKAPGPDGF++A FQD W+ IK DL RVF EF  
Subjt:  GVMLPEEEDIEREIISFFSTLYAPPKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFE

Query:  RGILDSSLSETFVCIIPKKDR----------TNKGAFLEGRQILDQALIANEAIEDYRGSGREGVIFKIDFEKAYD
         GI++ + + +F+ ++PKK R          + +GAF++GRQILD  LIANE +++ + SG EGV+FKIDFEKAYD
Subjt:  RGILDSSLSETFVCIIPKKDR----------TNKGAFLEGRQILDQALIANEAIEDYRGSGREGVIFKIDFEKAYD

RVW15097.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.6e-3944.83Show/hide
Query:  GVMLPEEEDIEREIISFFSTLYAPPKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFE
        G++L   E I  EI+ +F  LYA P      ++G+ WSPISE     LDAPF+ EEI KA+F  DRDKAPGPDGF++A FQD W+ IK DL RVF EF  
Subjt:  GVMLPEEEDIEREIISFFSTLYAPPKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFE

Query:  RGILDSSLSETFVCIIPKKDRTNK-------------------------------------GAFLEGRQILDQALIANEAIEDYRGSGREGVIFKIDFEK
         G+++ S + +F+ ++PKK  T K                                     GAF++GRQI+D  LIANE +++ R SG EGV+FKIDFEK
Subjt:  RGILDSSLSETFVCIIPKKDRTNK-------------------------------------GAFLEGRQILDQALIANEAIEDYRGSGREGVIFKIDFEK

Query:  AYD
        AYD
Subjt:  AYD

RVW28753.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.4e-4050Show/hide
Query:  GVMLPEEEDIEREIISFFSTLYAPPKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFE
        G++L     I  EI+ +F  LYA P      ++G+ WSPISE     L APF+ EEI KA+F  DRDKAPGPDGF++A FQD W+ IK DL RVF EF  
Subjt:  GVMLPEEEDIEREIISFFSTLYAPPKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFE

Query:  RGILDSSLSETFVCIIPKKDR----------TNKGAFLEGRQILDQALIANEAIEDYRGSGREGVIFKIDFEKAYD
         G+++ S + +F+ ++PKK R          + +GAF++GRQI+D  LIANE +++ R SG EGV+FKIDFEKAYD
Subjt:  RGILDSSLSETFVCIIPKKDR----------TNKGAFLEGRQILDQALIANEAIEDYRGSGREGVIFKIDFEKAYD

RVW96282.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]5.4e-4049.43Show/hide
Query:  GVMLPEEEDIEREIISFFSTLYAPPKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFE
        G++L   E I  EI+ +F  LY+ P      V+GI WSPISE     LD+PF+  EI  A+F  DRDKAPGPDGF++A FQD W+ IK DL RVF EF  
Subjt:  GVMLPEEEDIEREIISFFSTLYAPPKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFE

Query:  RGILDSSLSETFVCIIPKKDR----------TNKGAFLEGRQILDQALIANEAIEDYRGSGREGVIFKIDFEKAYD
         GI++ + + +F+ ++PKK R          + +GAF++GRQILD  LI NE +++ + SG EGV+FKIDFEKAYD
Subjt:  RGILDSSLSETFVCIIPKKDR----------TNKGAFLEGRQILDQALIANEAIEDYRGSGREGVIFKIDFEKAYD

TrEMBL top hitse value%identityAlignment
A0A438BND3 LINE-1 retrotransposable element ORF2 protein4.5e-4050Show/hide
Query:  GVMLPEEEDIEREIISFFSTLYAPPKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFE
        G++L   E I  EI+ +F  LY+ P      V+GI WSPISE     LD PF+  EI  A+F  DRDKAPGPDGF++A FQD W+ IK DL RVF EF  
Subjt:  GVMLPEEEDIEREIISFFSTLYAPPKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFE

Query:  RGILDSSLSETFVCIIPKKDR----------TNKGAFLEGRQILDQALIANEAIEDYRGSGREGVIFKIDFEKAYD
         GI++ + + +F+ ++PKK R          + +GAF++GRQILD  LIANE +++ + SG EGV+FKIDFEKAYD
Subjt:  RGILDSSLSETFVCIIPKKDR----------TNKGAFLEGRQILDQALIANEAIEDYRGSGREGVIFKIDFEKAYD

A0A438BW25 Transposon TX1 uncharacterized 149 kDa protein7.6e-4044.83Show/hide
Query:  GVMLPEEEDIEREIISFFSTLYAPPKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFE
        G++L   E I  EI+ +F  LYA P      ++G+ WSPISE     LDAPF+ EEI KA+F  DRDKAPGPDGF++A FQD W+ IK DL RVF EF  
Subjt:  GVMLPEEEDIEREIISFFSTLYAPPKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFE

Query:  RGILDSSLSETFVCIIPKKDRTNK-------------------------------------GAFLEGRQILDQALIANEAIEDYRGSGREGVIFKIDFEK
         G+++ S + +F+ ++PKK  T K                                     GAF++GRQI+D  LIANE +++ R SG EGV+FKIDFEK
Subjt:  RGILDSSLSETFVCIIPKKDRTNK-------------------------------------GAFLEGRQILDQALIANEAIEDYRGSGREGVIFKIDFEK

Query:  AYD
        AYD
Subjt:  AYD

A0A438CZW6 Transposon TX1 uncharacterized 149 kDa protein6.9e-4150Show/hide
Query:  GVMLPEEEDIEREIISFFSTLYAPPKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFE
        G++L     I  EI+ +F  LYA P      ++G+ WSPISE     L APF+ EEI KA+F  DRDKAPGPDGF++A FQD W+ IK DL RVF EF  
Subjt:  GVMLPEEEDIEREIISFFSTLYAPPKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFE

Query:  RGILDSSLSETFVCIIPKKDR----------TNKGAFLEGRQILDQALIANEAIEDYRGSGREGVIFKIDFEKAYD
         G+++ S + +F+ ++PKK R          + +GAF++GRQI+D  LIANE +++ R SG EGV+FKIDFEKAYD
Subjt:  RGILDSSLSETFVCIIPKKDR----------TNKGAFLEGRQILDQALIANEAIEDYRGSGREGVIFKIDFEKAYD

A0A438IHV3 LINE-1 retrotransposable element ORF2 protein2.6e-4049.43Show/hide
Query:  GVMLPEEEDIEREIISFFSTLYAPPKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFE
        G++L   E I  EI+ +F  LY+ P      V+GI WSPISE     LD+PF+  EI  A+F  DRDKAPGPDGF++A FQD W+ IK DL RVF EF  
Subjt:  GVMLPEEEDIEREIISFFSTLYAPPKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFE

Query:  RGILDSSLSETFVCIIPKKDR----------TNKGAFLEGRQILDQALIANEAIEDYRGSGREGVIFKIDFEKAYD
         GI++ + + +F+ ++PKK R          + +GAF++GRQILD  LI NE +++ + SG EGV+FKIDFEKAYD
Subjt:  RGILDSSLSETFVCIIPKKDR----------TNKGAFLEGRQILDQALIANEAIEDYRGSGREGVIFKIDFEKAYD

A5C7H0 Uncharacterized protein4.1e-4150.85Show/hide
Query:  GVMLPEEEDIEREIISFFSTLYAPPKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFE
        G++L   E I +EI+ +F  LYA P      V+G+ WSPISE     LD+PF+ EEI KA+F  DRDKAPGPDGF++A FQD W+ IK DL RVF E   
Subjt:  GVMLPEEEDIEREIISFFSTLYAPPKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFE

Query:  RGILDSSLSETFVCIIPKKDRTNK-----------GAFLEGRQILDQALIANEAIEDYRGSGREGVIFKIDFEKAYD
         GI++ S + +F+ ++PKK  T K            AF++GRQILD   IANE +++ R SG EGV+FKIDFEKAYD
Subjt:  RGILDSSLSETFVCIIPKKDRTNK-----------GAFLEGRQILDQALIANEAIEDYRGSGREGVIFKIDFEKAYD

SwissProt top hitse value%identityAlignment
P10394 Retrovirus-related Pol polyprotein from transposon 4125.9e-1340.57Show/hide
Query:  DGEPL---SLKSP-TEYEILHDQVQELLDKGHIQPSLSPCAVPAPLTPTKDG------SWRMCVDSRAINKITVKYRFPIPRINDLLDQLGSATIFSKID
        D EP+   + +SP ++ E +  QVQ+L+    ++PS+S    P  L P K         WR+ +D R INK  +  +FP+PRI+D+LDQLG A  FS +D
Subjt:  DGEPL---SLKSP-TEYEILHDQVQELLDKGHIQPSLSPCAVPAPLTPTKDG------SWRMCVDSRAINKITVKYRFPIPRINDLLDQLGSATIFSKID

Query:  LKSGYH
        L SG+H
Subjt:  LKSGYH

P14381 Transposon TX1 uncharacterized 149 kDa protein1.2e-1032.52Show/hide
Query:  GVMLPEEEDIEREIISFFSTLYAP----PKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFK
        G  L + E I     SF+  L++P    P + +   DG+    +SER K+ L+ P +L+E+ +A+     +K+PG DG ++ FFQ  W+ +  D  RV  
Subjt:  GVMLPEEEDIEREIISFFSTLYAP----PKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFK

Query:  EFFERGILDSSLSETFVCIIPKK
        E F++G L  S     + ++PKK
Subjt:  EFFERGILDSSLSETFVCIIPKK

P31843 RNA-directed DNA polymerase homolog1.6e-1057.45Show/hide
Query:  SWRMCVDSRAINKITVKYRFPIPRINDLLDQLGSATIFSKIDLKSGY
        S RMC+D RA+ K+T+K ++PIPR++DL D+L  AT F+K+DL+SGY
Subjt:  SWRMCVDSRAINKITVKYRFPIPRINDLLDQLGSATIFSKIDLKSGY

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein4.2e-1948.94Show/hide
Query:  EPLSLKSPTEYEILHDQVQELLDKGHIQPSLSPCAVPAPLTPTKDGSWRMCVDSRAINKITVKYRFPIPRINDLLDQLGSATIFSKIDLKSGYH
        +P  +    E EI +  VQ+LLD   I PS SPC+ P  L P KDG++R+CVD R +NK T+   FP+PRI++LL ++G+A IF+ +DL SGYH
Subjt:  EPLSLKSPTEYEILHDQVQELLDKGHIQPSLSPCAVPAPLTPTKDGSWRMCVDSRAINKITVKYRFPIPRINDLLDQLGSATIFSKIDLKSGYH

Q99315 Transposon Ty3-G Gag-Pol polyprotein4.2e-1948.94Show/hide
Query:  EPLSLKSPTEYEILHDQVQELLDKGHIQPSLSPCAVPAPLTPTKDGSWRMCVDSRAINKITVKYRFPIPRINDLLDQLGSATIFSKIDLKSGYH
        +P  +    E EI +  VQ+LLD   I PS SPC+ P  L P KDG++R+CVD R +NK T+   FP+PRI++LL ++G+A IF+ +DL SGYH
Subjt:  EPLSLKSPTEYEILHDQVQELLDKGHIQPSLSPCAVPAPLTPTKDGSWRMCVDSRAINKITVKYRFPIPRINDLLDQLGSATIFSKIDLKSGYH

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein5.8e-0842.25Show/hide
Query:  LDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFERGILDSSLSETFVCIIPK
        L A  S +EI  AVF   R+KAPGPD F+  FF ++W  +K       KEFF  G L    + T + +IPK
Subjt:  LDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFERGILDSSLSETFVCIIPK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGTGATGCTCCCTGAGGAGGAAGATATAGAGAGGGAGATCATTTCTTTCTTCTCAACTCTTTATGCCCCTCCGAAGTCCCCCAAGCCCTTTGTTGACGGGATCGC
CTGGAGCCCTATTTCTGAGAGGGATAAAAAGGATTTGGATGCCCCCTTCTCTTTGGAGGAAATTAGGAAGGCGGTTTTTGACTGTGATAGAGATAAAGCCCCTGGGCCGG
ATGGCTTTTCTATGGCTTTTTTTCAAGATAACTGGAATGAGATTAAGGGGGATCTAGAGCGCGTTTTTAAAGAATTCTTTGAGCGTGGCATTTTGGATAGTTCGTTATCT
GAGACCTTTGTTTGCATCATCCCCAAGAAAGACAGAACTAATAAGGGTGCCTTTCTTGAAGGTCGTCAAATTTTGGATCAAGCCCTTATTGCGAATGAGGCTATTGAGGA
CTATCGCGGGAGTGGGCGTGAGGGAGTGATTTTTAAGATTGACTTTGAGAAGGCCTATGATCCGAAAATTCATACAAATCCTGAGAAGAACCATCAAGATCATCCACAGG
AGGACTTTAGAACAAGGCAGTGGCAAGAAACAAGTTGTGGAGCAAAAAATGACCCAAGGAATAAATCCGGTACCAAGATTCTATCAAGATCATCGCACGGGCAAGGTAAT
AGATCGATCATGGAGTACACAGAAGAATTCCATCGGCTAGGAGCAAGAACAAATCTTGGAGAGAGCCAACAATATTTAGTAGCAAGATACAAAGGAGGCCTTCGAAATGA
TATTAAAGAACAACTTTCTTTACAACCAATTGAATATTTAAATGAAGCCATCTCCGCTGCAGCAGTAATTGAAGAACAAATCGCCACTCGCTTCAAAAGGACATATTCAA
GAAGGACTACGGGTGAACATACCAACAACCTCAACAAAACTGCAATTGGTGATAAAACCTTGCAACAGAACTCGATAGCAGCACTTAAAGACAATGTAGAATATCAAGAT
GATGAAGACGAAGCCGACACTGATGACAATATAGACTTTATTGAATCGGATGACGGTGAACCACTATCTTTGAAGAGTCCAACAGAGTACGAAATTCTCCATGACCAAGT
GCAAGAGCTACTCGATAAAGGACACATCCAACCAAGCTTGAGTCCTTGTGCAGTTCCAGCCCCGTTAACACCCACAAAAGATGGAAGTTGGCGCATGTGTGTGGACAGTA
GGGCAATAAACAAAATCACAGTGAAGTATAGGTTCCCAATTCCTCGGATTAATGACCTCTTGGACCAACTTGGAAGTGCCACAATTTTCTCAAAGATAGATTTGAAGAGT
GGTTATCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGTGATGCTCCCTGAGGAGGAAGATATAGAGAGGGAGATCATTTCTTTCTTCTCAACTCTTTATGCCCCTCCGAAGTCCCCCAAGCCCTTTGTTGACGGGATCGC
CTGGAGCCCTATTTCTGAGAGGGATAAAAAGGATTTGGATGCCCCCTTCTCTTTGGAGGAAATTAGGAAGGCGGTTTTTGACTGTGATAGAGATAAAGCCCCTGGGCCGG
ATGGCTTTTCTATGGCTTTTTTTCAAGATAACTGGAATGAGATTAAGGGGGATCTAGAGCGCGTTTTTAAAGAATTCTTTGAGCGTGGCATTTTGGATAGTTCGTTATCT
GAGACCTTTGTTTGCATCATCCCCAAGAAAGACAGAACTAATAAGGGTGCCTTTCTTGAAGGTCGTCAAATTTTGGATCAAGCCCTTATTGCGAATGAGGCTATTGAGGA
CTATCGCGGGAGTGGGCGTGAGGGAGTGATTTTTAAGATTGACTTTGAGAAGGCCTATGATCCGAAAATTCATACAAATCCTGAGAAGAACCATCAAGATCATCCACAGG
AGGACTTTAGAACAAGGCAGTGGCAAGAAACAAGTTGTGGAGCAAAAAATGACCCAAGGAATAAATCCGGTACCAAGATTCTATCAAGATCATCGCACGGGCAAGGTAAT
AGATCGATCATGGAGTACACAGAAGAATTCCATCGGCTAGGAGCAAGAACAAATCTTGGAGAGAGCCAACAATATTTAGTAGCAAGATACAAAGGAGGCCTTCGAAATGA
TATTAAAGAACAACTTTCTTTACAACCAATTGAATATTTAAATGAAGCCATCTCCGCTGCAGCAGTAATTGAAGAACAAATCGCCACTCGCTTCAAAAGGACATATTCAA
GAAGGACTACGGGTGAACATACCAACAACCTCAACAAAACTGCAATTGGTGATAAAACCTTGCAACAGAACTCGATAGCAGCACTTAAAGACAATGTAGAATATCAAGAT
GATGAAGACGAAGCCGACACTGATGACAATATAGACTTTATTGAATCGGATGACGGTGAACCACTATCTTTGAAGAGTCCAACAGAGTACGAAATTCTCCATGACCAAGT
GCAAGAGCTACTCGATAAAGGACACATCCAACCAAGCTTGAGTCCTTGTGCAGTTCCAGCCCCGTTAACACCCACAAAAGATGGAAGTTGGCGCATGTGTGTGGACAGTA
GGGCAATAAACAAAATCACAGTGAAGTATAGGTTCCCAATTCCTCGGATTAATGACCTCTTGGACCAACTTGGAAGTGCCACAATTTTCTCAAAGATAGATTTGAAGAGT
GGTTATCATTAA
Protein sequenceShow/hide protein sequence
MGVMLPEEEDIEREIISFFSTLYAPPKSPKPFVDGIAWSPISERDKKDLDAPFSLEEIRKAVFDCDRDKAPGPDGFSMAFFQDNWNEIKGDLERVFKEFFERGILDSSLS
ETFVCIIPKKDRTNKGAFLEGRQILDQALIANEAIEDYRGSGREGVIFKIDFEKAYDPKIHTNPEKNHQDHPQEDFRTRQWQETSCGAKNDPRNKSGTKILSRSSHGQGN
RSIMEYTEEFHRLGARTNLGESQQYLVARYKGGLRNDIKEQLSLQPIEYLNEAISAAAVIEEQIATRFKRTYSRRTTGEHTNNLNKTAIGDKTLQQNSIAALKDNVEYQD
DEDEADTDDNIDFIESDDGEPLSLKSPTEYEILHDQVQELLDKGHIQPSLSPCAVPAPLTPTKDGSWRMCVDSRAINKITVKYRFPIPRINDLLDQLGSATIFSKIDLKS
GYH