; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024797 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024797
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr10:5809478..5810125
RNA-Seq ExpressionLag0024797
SyntenyLag0024797
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056890.1 aminoacyl-tRNA ligase [Cucumis melo var. makuwa]6.5e-4759.62Show/hide
Query:  KFPPFNGTDSPSWISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNWFRCVLNRKNPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRNYCSQFEL
        + P F+GTD+  WI KME YFE HHIDD A MMD I LC++G+AL WFRC  N   PP SWDEFR +L+ RF +   +  +F+ L+QEGSV  YCS+FE 
Subjt:  KFPPFNGTDSPSWISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNWFRCVLNRKNPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRNYCSQFEL

Query:  LGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKARLVDDMNNVALN
        LGALLP++   +LEAKFMNGLK EIR +VRML PK I +IM +ARL +  NNVALN
Subjt:  LGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKARLVDDMNNVALN

KAE8652678.1 hypothetical protein Csa_013756 [Cucumis sativus]6.9e-4961.54Show/hide
Query:  KFPPFNGTDSPSWISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNWFRCVLNRKNPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRNYCSQFEL
        + P F+GTD+  WI KME YFE HHIDD A+MM+ I LC++G+AL WFRC  N  NPP SW EFR +L+KRF +G  +  RFI LQQEGSV  YCS+FE 
Subjt:  KFPPFNGTDSPSWISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNWFRCVLNRKNPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRNYCSQFEL

Query:  LGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKARLVDDMNNVALN
        LGALLP++  C++EAKFMNGLK EIR EVRML  +GI +IM +ARL +  NNVA N
Subjt:  LGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKARLVDDMNNVALN

XP_016900762.1 PREDICTED: uncharacterized protein LOC107991016 [Cucumis melo]6.5e-4759.62Show/hide
Query:  KFPPFNGTDSPSWISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNWFRCVLNRKNPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRNYCSQFEL
        + P F+GTD+  WI KME YFE HHIDD A MMD I LC++G+AL WFRC  N   PP SWDEFR +L+ RF +   +  +F+ L+QEGSV  YCS+FE 
Subjt:  KFPPFNGTDSPSWISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNWFRCVLNRKNPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRNYCSQFEL

Query:  LGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKARLVDDMNNVALN
        LGALLP++   +LEAKFMNGLK EIR +VRML PK I +IM +ARL +  NNVALN
Subjt:  LGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKARLVDDMNNVALN

XP_028554071.1 uncharacterized protein LOC114580489 [Dendrobium catenatum]3.8e-2340.91Show/hide
Query:  RMQKFPPFNGTDSPSWISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNWFRCVLNRKNPPGSWDEFRHALFKRF--ENGDTMYDRFIALQQEGSVRNYC
        R  K P F G D+  WI ++E YF  + + +  ++M  + +CL G++L WF+ +  R++   SW+EF+  L  RF   +  T Y++F+AL QEG+V  Y 
Subjt:  RMQKFPPFNGTDSPSWISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNWFRCVLNRKNPPGSWDEFRHALFKRF--ENGDTMYDRFIALQQEGSVRNYC

Query:  SQFELLGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKARLVDD
          FELL   L  IPD +LE  FM GLK  IRA +R++ P G+ +IM  A+LV+D
Subjt:  SQFELLGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKARLVDD

XP_042752301.1 uncharacterized protein LOC111913995 isoform X1 [Lactuca sativa]2.9e-2340.91Show/hide
Query:  RMQKFPPFNGTDSPSWISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNWFRCVLNRKNPPGSWDEFRHALFKRFE--NGDTMYDRFIALQQEGSVRNYC
        R  K P F G D+  WI K+E +FE   I    + +   A+CL G+AL WFR     ++P  SW+E +  L +RF+      +Y +F+A++QEGSVR+Y 
Subjt:  RMQKFPPFNGTDSPSWISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNWFRCVLNRKNPPGSWDEFRHALFKRFE--NGDTMYDRFIALQQEGSVRNYC

Query:  SQFELLGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKARLVDD
        S FE L   L DIP+ +LE  F+NGLK + R+ VR+LQP  +   M  A ++D+
Subjt:  SQFELLGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKARLVDD

TrEMBL top hitse value%identityAlignment
A0A0A0LUB3 Retrotrans_gag domain-containing protein3.3e-4961.54Show/hide
Query:  KFPPFNGTDSPSWISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNWFRCVLNRKNPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRNYCSQFEL
        + P F+GTD+  WI KME YFE HHIDD A+MM+ I LC++G+AL WFRC  N  NPP SW EFR +L+KRF +G  +  RFI LQQEGSV  YCS+FE 
Subjt:  KFPPFNGTDSPSWISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNWFRCVLNRKNPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRNYCSQFEL

Query:  LGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKARLVDDMNNVALN
        LGALLP++  C++EAKFMNGLK EIR EVRML  +GI +IM +ARL +  NNVA N
Subjt:  LGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKARLVDDMNNVALN

A0A1S4DXQ7 uncharacterized protein LOC1079910163.1e-4759.62Show/hide
Query:  KFPPFNGTDSPSWISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNWFRCVLNRKNPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRNYCSQFEL
        + P F+GTD+  WI KME YFE HHIDD A MMD I LC++G+AL WFRC  N   PP SWDEFR +L+ RF +   +  +F+ L+QEGSV  YCS+FE 
Subjt:  KFPPFNGTDSPSWISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNWFRCVLNRKNPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRNYCSQFEL

Query:  LGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKARLVDDMNNVALN
        LGALLP++   +LEAKFMNGLK EIR +VRML PK I +IM +ARL +  NNVALN
Subjt:  LGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKARLVDDMNNVALN

A0A1S8ADV7 Aminoacyl-tRNA ligase1.1e-2338.61Show/hide
Query:  RMQKFPPFNGTDSPSWISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNWFRCVLNRKNPPGSWDEFRHALFKRFE--NGDTMYDRFIALQQEGSVRNYC
        R  K P F G D+  W+ + E YF  + + +  ++M   ALCL G+AL WF+    ++ P  SW EF+  L +RF+      ++++F AL QE +V  Y 
Subjt:  RMQKFPPFNGTDSPSWISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNWFRCVLNRKNPPGSWDEFRHALFKRFE--NGDTMYDRFIALQQEGSVRNYC

Query:  SQFELLGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKARLVDDMNNV
         +FELL   L  +P+ +LE  FM GLK EIRA +R+L+P+G+ E M  A++++D N +
Subjt:  SQFELLGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKARLVDDMNNV

A0A2U1PZV9 Ty3/gypsy retrotransposon protein2.4e-2336.27Show/hide
Query:  LIQKMQEKKEPQSGKAKR---RVKDYQRLPESIA---------------RRMQKFPPFNGTDSPSWISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNW
        L  K + +   + GK  R   R +DY R  ES A                R  K P F+G DS  WI K+E YFE   I+   E +    LC+ G+AL W
Subjt:  LIQKMQEKKEPQSGKAKR---RVKDYQRLPESIA---------------RRMQKFPPFNGTDSPSWISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNW

Query:  FRCVLNRKNPPGSWDEFRHALFKRFENGD--TMYDRFIALQQEGSVRNYCSQFELLGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKAR
        FR     ++P  +W+  +  L +RF++    T+Y++F+A+ QEGS R Y S FE L   L  I + ++E  F+ GLK E+RA VR++QP+G+   M+ A 
Subjt:  FRCVLNRKNPPGSWDEFRHALFKRFENGD--TMYDRFIALQQEGSVRNYCSQFELLGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKAR

Query:  LVDD
        ++DD
Subjt:  LVDD

A0A5D3BJD9 Aminoacyl-tRNA ligase3.1e-4759.62Show/hide
Query:  KFPPFNGTDSPSWISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNWFRCVLNRKNPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRNYCSQFEL
        + P F+GTD+  WI KME YFE HHIDD A MMD I LC++G+AL WFRC  N   PP SWDEFR +L+ RF +   +  +F+ L+QEGSV  YCS+FE 
Subjt:  KFPPFNGTDSPSWISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNWFRCVLNRKNPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRNYCSQFEL

Query:  LGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKARLVDDMNNVALN
        LGALLP++   +LEAKFMNGLK EIR +VRML PK I +IM +ARL +  NNVALN
Subjt:  LGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKARLVDDMNNVALN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein1.9e-0437.31Show/hide
Query:  FIALQQEGSVRNYCSQFELLGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIM-RKARLV
        +  +QQEGSVR+Y  +FE L      +P    E  F+ GL+  ++  VR L+P GI     R+A L+
Subjt:  FIALQQEGSVRNYCSQFELLGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIM-RKARLV

AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding5.5e-1233.33Show/hide
Query:  ISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNWFRCVLNRKNPPGSWDEFRHALFKRFENGDTM----YDRFIALQQEGSVRNYCSQFELLGALLPDIP
        +S  E YF  ++I +  E + I+   L G    W +  L +KN P SW EF+  + +  E   TM       +  +QQEGSVR Y  +FE L      +P
Subjt:  ISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNWFRCVLNRKNPPGSWDEFRHALFKRFENGDTM----YDRFIALQQEGSVRNYCSQFELLGALLPDIP

Query:  DCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKARLVDDMNNV
           LEA F+ GL+  ++  VR L+P GI ++M  A+ +++ N++
Subjt:  DCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKARLVDDMNNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAAAACATCACGGCTCTGTTTGAAGAAATGGCCCAGCTTCGACTGCGCCAGGAAGCCACTGAGACATTGATCCAGAAGATGCAAGAGAAGAAGGAACCACAGTC
CGGTAAAGCGAAACGAAGAGTAAAAGATTACCAACGCCTACCTGAATCCATAGCCCGCCGAATGCAGAAGTTTCCTCCGTTCAATGGAACCGATTCGCCCTCGTGGATCT
CGAAAATGGAATGGTACTTTGAGTTTCACCACATAGACGACTTTGCCGAGATGATGGACATTATCGCACTCTGTTTGACTGGCCGAGCCCTAAACTGGTTCCGATGCGTT
CTAAATAGGAAAAACCCGCCGGGATCGTGGGATGAGTTTCGGCATGCTTTGTTTAAGCGATTTGAAAACGGCGACACCATGTATGACAGGTTCATTGCCTTACAGCAAGA
GGGGAGCGTGAGGAACTATTGCAGCCAGTTCGAGTTACTAGGGGCGCTCCTTCCGGACATTCCTGACTGCATTCTTGAAGCAAAGTTTATGAACGGCTTAAAGGCGGAGA
TTCGAGCGGAGGTTCGGATGTTACAACCAAAAGGTATAGAGGAAATCATGAGAAAGGCGAGGTTGGTGGACGACATGAACAACGTTGCGCTGAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAAAACATCACGGCTCTGTTTGAAGAAATGGCCCAGCTTCGACTGCGCCAGGAAGCCACTGAGACATTGATCCAGAAGATGCAAGAGAAGAAGGAACCACAGTC
CGGTAAAGCGAAACGAAGAGTAAAAGATTACCAACGCCTACCTGAATCCATAGCCCGCCGAATGCAGAAGTTTCCTCCGTTCAATGGAACCGATTCGCCCTCGTGGATCT
CGAAAATGGAATGGTACTTTGAGTTTCACCACATAGACGACTTTGCCGAGATGATGGACATTATCGCACTCTGTTTGACTGGCCGAGCCCTAAACTGGTTCCGATGCGTT
CTAAATAGGAAAAACCCGCCGGGATCGTGGGATGAGTTTCGGCATGCTTTGTTTAAGCGATTTGAAAACGGCGACACCATGTATGACAGGTTCATTGCCTTACAGCAAGA
GGGGAGCGTGAGGAACTATTGCAGCCAGTTCGAGTTACTAGGGGCGCTCCTTCCGGACATTCCTGACTGCATTCTTGAAGCAAAGTTTATGAACGGCTTAAAGGCGGAGA
TTCGAGCGGAGGTTCGGATGTTACAACCAAAAGGTATAGAGGAAATCATGAGAAAGGCGAGGTTGGTGGACGACATGAACAACGTTGCGCTGAACTAG
Protein sequenceShow/hide protein sequence
MEKNITALFEEMAQLRLRQEATETLIQKMQEKKEPQSGKAKRRVKDYQRLPESIARRMQKFPPFNGTDSPSWISKMEWYFEFHHIDDFAEMMDIIALCLTGRALNWFRCV
LNRKNPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRNYCSQFELLGALLPDIPDCILEAKFMNGLKAEIRAEVRMLQPKGIEEIMRKARLVDDMNNVALN