; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041929 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041929
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag-protease polyprotein
Genome locationchr13:31717010..31718268
RNA-Seq ExpressionLag0041929
SyntenyLag0041929
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032474.1 gag-protease polyprotein [Cucumis melo var. makuwa]5.5e-3945.66Show/hide
Query:  ESLSREAGCLRDFRKWYHHPFDGASKDPTLALLWLSSIETVFRHMNYSEDKKVYCVAFLLQDNALIWWQSIERTIDVSNGSVTWLQFRKTFLRKYYPTNA
        + LS EA  LRDFRK+    FDG+ +DPT A +WLSS+ET+FR+M Y ED+KV C  F+L D    WW++ ER +      +TW QF+++F  K++ T+ 
Subjt:  ESLSREAGCLRDFRKWYHHPFDGASKDPTLALLWLSSIETVFRHMNYSEDKKVYCVAFLLQDNALIWWQSIERTIDVSNGSVTWLQFRKTFLRKYYPTNA

Query:  RFKKQAGFVALNQGSRTVEEYETEFARLSRFAPTMVAIEADKVERFITGFRDNIQGSMAAHQPPDYATTVRVA
        R  K+  F+ L QG  TVE+Y+ EF  LSRFAP M+A EA + ++F+ G R +IQG + A +P  +A  +R+A
Subjt:  RFKKQAGFVALNQGSRTVEEYETEFARLSRFAPTMVAIEADKVERFITGFRDNIQGSMAAHQPPDYATTVRVA

KAA0035574.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]5.5e-3944.04Show/hide
Query:  VQATVAGVIAGQQAQAPHNNESLSREAGCLRDFRKWYHHPFDGASKDPTLALLWLSSIETVFRHMNYSEDKKVYCVAFLLQDNALIWWQSIERTIDVSNG
        VQ  V  V       AP   + LS EA  LRDFRK+    FDG+ +DPT A LWLSS+ET+FR+M   ED+KV C  F+L D    WW++IER +    G
Subjt:  VQATVAGVIAGQQAQAPHNNESLSREAGCLRDFRKWYHHPFDGASKDPTLALLWLSSIETVFRHMNYSEDKKVYCVAFLLQDNALIWWQSIERTIDVSNG

Query:  SVTWLQFRKTFLRKYYPTNARFKKQAGFVALNQGSRTVEEYETEFARLSRFAPTMVAIEADKVERFITGFRDNIQGSMAAHQPPDYATTVRVA
         +TW QF+++F  K++  + R  K+  F+ L Q   TVE+Y+ EF  LSRFAP M+A EA + ++F+ G R +IQG + A +P  +A  +R+A
Subjt:  SVTWLQFRKTFLRKYYPTNARFKKQAGFVALNQGSRTVEEYETEFARLSRFAPTMVAIEADKVERFITGFRDNIQGSMAAHQPPDYATTVRVA

TYK30962.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]5.5e-3944.04Show/hide
Query:  VQATVAGVIAGQQAQAPHNNESLSREAGCLRDFRKWYHHPFDGASKDPTLALLWLSSIETVFRHMNYSEDKKVYCVAFLLQDNALIWWQSIERTIDVSNG
        VQ  V  V       AP   + LS EA  LRDFRK+    FDG+ +DPT A LWLSS+ET+FR+M   ED+KV C  F+L D    WW++IER +    G
Subjt:  VQATVAGVIAGQQAQAPHNNESLSREAGCLRDFRKWYHHPFDGASKDPTLALLWLSSIETVFRHMNYSEDKKVYCVAFLLQDNALIWWQSIERTIDVSNG

Query:  SVTWLQFRKTFLRKYYPTNARFKKQAGFVALNQGSRTVEEYETEFARLSRFAPTMVAIEADKVERFITGFRDNIQGSMAAHQPPDYATTVRVA
         +TW QF+++F  K++  + R  K+  F+ L Q   TVE+Y+ EF  LSRFAP M+A EA + ++F+ G R +IQG + A +P  +A  +R+A
Subjt:  SVTWLQFRKTFLRKYYPTNARFKKQAGFVALNQGSRTVEEYETEFARLSRFAPTMVAIEADKVERFITGFRDNIQGSMAAHQPPDYATTVRVA

XP_022156662.1 uncharacterized protein LOC111023512 [Momordica charantia]1.0e-4849.5Show/hide
Query:  VQATVAGVIAGQQAQAPHNNESLSREAGCLRDFRKWYHHPFDGASKDPTLALLWLSSIETVFRHMNYSEDKKVYCVAFLLQDNALIWWQSIERTIDVSNG
        +Q  V   ++ Q  Q   N  S+S EA  LRDF+K+    FDG S DP LA  WLS +ET+FR+M   E++KV C  F+L+D+A +WW+S ER IDVS G
Subjt:  VQATVAGVIAGQQAQAPHNNESLSREAGCLRDFRKWYHHPFDGASKDPTLALLWLSSIETVFRHMNYSEDKKVYCVAFLLQDNALIWWQSIERTIDVSNG

Query:  SVTWLQFRKTFLRKYYPTNARFKKQAGFVALNQGSRTVEEYETEFARLSRFAPTMVAIEADKVERFITGFRDNIQGSMAAHQPPDYATTVRVAELIDRRP
         VTWLQF++ F ++YYP    ++KQ  F+ L Q +R+VEEY+ EF +LSRFAP +V  EA K ERFI   +D  +G +A   PPDYAT +R A LID R 
Subjt:  SVTWLQFRKTFLRKYYPTNARFKKQAGFVALNQGSRTVEEYETEFARLSRFAPTMVAIEADKVERFITGFRDNIQGSMAAHQPPDYATTVRVAELIDRRP

Query:  AT
        A+
Subjt:  AT

XP_038891712.1 uncharacterized protein LOC120081110 [Benincasa hispida]1.0e-4047.34Show/hide
Query:  QAQAPHNNESLSR---EAGCLRDFRKWYHHPFDGASKDPTLALLWLSSIETVFRHMNYSEDKKVYCVAFLLQDNALIWWQSIERTIDVSNGSVTWLQFRK
        Q Q    N+S+S    EA  LRDF+K+    F+G+ KDPT A LW+S IET+FR+M   ED+KV C  F+L D A IWWQ  ER + V    VTW QF++
Subjt:  QAQAPHNNESLSR---EAGCLRDFRKWYHHPFDGASKDPTLALLWLSSIETVFRHMNYSEDKKVYCVAFLLQDNALIWWQSIERTIDVSNGSVTWLQFRK

Query:  TFLRKYYPTNARFKKQAGFVALNQGSRTVEEYETEFARLSRFAPTMVAIEADKVERFITGFRDNIQGSMAAHQPPDYATTVRVAELID
         F  KY+  N R+ KQ  F+ L QG R+VEEY+ EF  LSRFAP +VA EA + ERFI G +++I+G + A +P  +   +R+A  +D
Subjt:  TFLRKYYPTNARFKKQAGFVALNQGSRTVEEYETEFARLSRFAPTMVAIEADKVERFITGFRDNIQGSMAAHQPPDYATTVRVAELID

TrEMBL top hitse value%identityAlignment
A0A5A7SMS7 Gag-protease polyprotein2.7e-3945.66Show/hide
Query:  ESLSREAGCLRDFRKWYHHPFDGASKDPTLALLWLSSIETVFRHMNYSEDKKVYCVAFLLQDNALIWWQSIERTIDVSNGSVTWLQFRKTFLRKYYPTNA
        + LS EA  LRDFRK+    FDG+ +DPT A +WLSS+ET+FR+M Y ED+KV C  F+L D    WW++ ER +      +TW QF+++F  K++ T+ 
Subjt:  ESLSREAGCLRDFRKWYHHPFDGASKDPTLALLWLSSIETVFRHMNYSEDKKVYCVAFLLQDNALIWWQSIERTIDVSNGSVTWLQFRKTFLRKYYPTNA

Query:  RFKKQAGFVALNQGSRTVEEYETEFARLSRFAPTMVAIEADKVERFITGFRDNIQGSMAAHQPPDYATTVRVA
        R  K+  F+ L QG  TVE+Y+ EF  LSRFAP M+A EA + ++F+ G R +IQG + A +P  +A  +R+A
Subjt:  RFKKQAGFVALNQGSRTVEEYETEFARLSRFAPTMVAIEADKVERFITGFRDNIQGSMAAHQPPDYATTVRVA

A0A5A7SW90 Reverse transcriptase2.7e-3944.04Show/hide
Query:  VQATVAGVIAGQQAQAPHNNESLSREAGCLRDFRKWYHHPFDGASKDPTLALLWLSSIETVFRHMNYSEDKKVYCVAFLLQDNALIWWQSIERTIDVSNG
        VQ  V  V       AP   + LS EA  LRDFRK+    FDG+ +DPT A LWLSS+ET+FR+M   ED+KV C  F+L D    WW++IER +    G
Subjt:  VQATVAGVIAGQQAQAPHNNESLSREAGCLRDFRKWYHHPFDGASKDPTLALLWLSSIETVFRHMNYSEDKKVYCVAFLLQDNALIWWQSIERTIDVSNG

Query:  SVTWLQFRKTFLRKYYPTNARFKKQAGFVALNQGSRTVEEYETEFARLSRFAPTMVAIEADKVERFITGFRDNIQGSMAAHQPPDYATTVRVA
         +TW QF+++F  K++  + R  K+  F+ L Q   TVE+Y+ EF  LSRFAP M+A EA + ++F+ G R +IQG + A +P  +A  +R+A
Subjt:  SVTWLQFRKTFLRKYYPTNARFKKQAGFVALNQGSRTVEEYETEFARLSRFAPTMVAIEADKVERFITGFRDNIQGSMAAHQPPDYATTVRVA

A0A5A7VDM7 Gag protease polyprotein3.5e-3946.24Show/hide
Query:  ESLSREAGCLRDFRKWYHHPFDGASKDPTLALLWLSSIETVFRHMNYSEDKKVYCVAFLLQDNALIWWQSIERTIDVSNGSVTWLQFRKTFLRKYYPTNA
        + LS EA  LRDFRK+    FDG+ +DPT A LWLSS+ET+FR+M Y ED+KV C  F+L D    WW++ ER +    G +TW QF+++F  K++  + 
Subjt:  ESLSREAGCLRDFRKWYHHPFDGASKDPTLALLWLSSIETVFRHMNYSEDKKVYCVAFLLQDNALIWWQSIERTIDVSNGSVTWLQFRKTFLRKYYPTNA

Query:  RFKKQAGFVALNQGSRTVEEYETEFARLSRFAPTMVAIEADKVERFITGFRDNIQGSMAAHQPPDYATTVRVA
        R  KQ  F+ L QG  TVE+Y+ EF  LSRFAP M+A EA +  +F+ G R +IQ  + A +P  +A  +R+A
Subjt:  RFKKQAGFVALNQGSRTVEEYETEFARLSRFAPTMVAIEADKVERFITGFRDNIQGSMAAHQPPDYATTVRVA

A0A5D3E4V0 Reverse transcriptase2.7e-3944.04Show/hide
Query:  VQATVAGVIAGQQAQAPHNNESLSREAGCLRDFRKWYHHPFDGASKDPTLALLWLSSIETVFRHMNYSEDKKVYCVAFLLQDNALIWWQSIERTIDVSNG
        VQ  V  V       AP   + LS EA  LRDFRK+    FDG+ +DPT A LWLSS+ET+FR+M   ED+KV C  F+L D    WW++IER +    G
Subjt:  VQATVAGVIAGQQAQAPHNNESLSREAGCLRDFRKWYHHPFDGASKDPTLALLWLSSIETVFRHMNYSEDKKVYCVAFLLQDNALIWWQSIERTIDVSNG

Query:  SVTWLQFRKTFLRKYYPTNARFKKQAGFVALNQGSRTVEEYETEFARLSRFAPTMVAIEADKVERFITGFRDNIQGSMAAHQPPDYATTVRVA
         +TW QF+++F  K++  + R  K+  F+ L Q   TVE+Y+ EF  LSRFAP M+A EA + ++F+ G R +IQG + A +P  +A  +R+A
Subjt:  SVTWLQFRKTFLRKYYPTNARFKKQAGFVALNQGSRTVEEYETEFARLSRFAPTMVAIEADKVERFITGFRDNIQGSMAAHQPPDYATTVRVA

A0A6J1DSJ6 uncharacterized protein LOC1110235124.8e-4949.5Show/hide
Query:  VQATVAGVIAGQQAQAPHNNESLSREAGCLRDFRKWYHHPFDGASKDPTLALLWLSSIETVFRHMNYSEDKKVYCVAFLLQDNALIWWQSIERTIDVSNG
        +Q  V   ++ Q  Q   N  S+S EA  LRDF+K+    FDG S DP LA  WLS +ET+FR+M   E++KV C  F+L+D+A +WW+S ER IDVS G
Subjt:  VQATVAGVIAGQQAQAPHNNESLSREAGCLRDFRKWYHHPFDGASKDPTLALLWLSSIETVFRHMNYSEDKKVYCVAFLLQDNALIWWQSIERTIDVSNG

Query:  SVTWLQFRKTFLRKYYPTNARFKKQAGFVALNQGSRTVEEYETEFARLSRFAPTMVAIEADKVERFITGFRDNIQGSMAAHQPPDYATTVRVAELIDRRP
         VTWLQF++ F ++YYP    ++KQ  F+ L Q +R+VEEY+ EF +LSRFAP +V  EA K ERFI   +D  +G +A   PPDYAT +R A LID R 
Subjt:  SVTWLQFRKTFLRKYYPTNARFKKQAGFVALNQGSRTVEEYETEFARLSRFAPTMVAIEADKVERFITGFRDNIQGSMAAHQPPDYATTVRVAELIDRRP

Query:  AT
        A+
Subjt:  AT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGTATTGGTCGTTTACCACACCTTTGGCAGCCCTCCCTTGGCGCTTCTTGATGATACCGACAATTTTGTTGGTCTTCCCTCGATGCTACTTGCTGATATTGGATT
GTACTTATATCATCATAGAAGAGTTCGTGGAAGGGGTCGTGAAAGGGGTCGCGCGGCCCTTGAGGCAGTTGTGCCGCTGGTTGGACAAGAAAACAATCCGGCAGGGGACC
CACGAGTAGAGCAACCGGCACTGACAGCGGAACGTATCACGGTGGATTCTATTCAGGCAATTTTGCAGTCAACGGTGGTTGGGGCAGTGCAGTCTGCGGTGCAAGCGACA
GTTGCAGGCGTGATTGCGGGGCAGCAGGCCCAAGCGCCTCATAATAATGAATCACTATCGCGAGAGGCAGGGTGTTTAAGGGACTTTAGGAAGTGGTACCATCATCCATT
CGATGGAGCATCAAAGGACCCCACATTGGCGTTGTTGTGGCTCTCTTCCATTGAAACCGTCTTTCGTCACATGAATTATTCGGAAGACAAAAAGGTTTATTGTGTCGCTT
TCCTGTTGCAAGATAATGCTTTGATTTGGTGGCAGTCGATCGAAAGGACTATAGACGTCAGTAATGGATCTGTGACATGGCTCCAGTTCAGGAAAACGTTCTTAAGGAAA
TATTACCCTACAAATGCACGTTTTAAGAAGCAAGCGGGGTTCGTAGCTCTCAATCAGGGAAGCCGAACGGTGGAAGAATATGAGACAGAGTTTGCCAGACTATCTCGGTT
TGCCCCTACCATGGTTGCTATAGAGGCCGACAAAGTAGAACGATTTATCACGGGTTTTAGGGATAACATACAAGGTAGCATGGCTGCCCATCAACCACCAGACTACGCCA
CGACAGTCAGAGTGGCAGAGTTAATAGATCGTCGTCCAGCAACTACGCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCGTATTGGTCGTTTACCACACCTTTGGCAGCCCTCCCTTGGCGCTTCTTGATGATACCGACAATTTTGTTGGTCTTCCCTCGATGCTACTTGCTGATATTGGATT
GTACTTATATCATCATAGAAGAGTTCGTGGAAGGGGTCGTGAAAGGGGTCGCGCGGCCCTTGAGGCAGTTGTGCCGCTGGTTGGACAAGAAAACAATCCGGCAGGGGACC
CACGAGTAGAGCAACCGGCACTGACAGCGGAACGTATCACGGTGGATTCTATTCAGGCAATTTTGCAGTCAACGGTGGTTGGGGCAGTGCAGTCTGCGGTGCAAGCGACA
GTTGCAGGCGTGATTGCGGGGCAGCAGGCCCAAGCGCCTCATAATAATGAATCACTATCGCGAGAGGCAGGGTGTTTAAGGGACTTTAGGAAGTGGTACCATCATCCATT
CGATGGAGCATCAAAGGACCCCACATTGGCGTTGTTGTGGCTCTCTTCCATTGAAACCGTCTTTCGTCACATGAATTATTCGGAAGACAAAAAGGTTTATTGTGTCGCTT
TCCTGTTGCAAGATAATGCTTTGATTTGGTGGCAGTCGATCGAAAGGACTATAGACGTCAGTAATGGATCTGTGACATGGCTCCAGTTCAGGAAAACGTTCTTAAGGAAA
TATTACCCTACAAATGCACGTTTTAAGAAGCAAGCGGGGTTCGTAGCTCTCAATCAGGGAAGCCGAACGGTGGAAGAATATGAGACAGAGTTTGCCAGACTATCTCGGTT
TGCCCCTACCATGGTTGCTATAGAGGCCGACAAAGTAGAACGATTTATCACGGGTTTTAGGGATAACATACAAGGTAGCATGGCTGCCCATCAACCACCAGACTACGCCA
CGACAGTCAGAGTGGCAGAGTTAATAGATCGTCGTCCAGCAACTACGCCTTGA
Protein sequenceShow/hide protein sequence
MFVLVVYHTFGSPPLALLDDTDNFVGLPSMLLADIGLYLYHHRRVRGRGRERGRAALEAVVPLVGQENNPAGDPRVEQPALTAERITVDSIQAILQSTVVGAVQSAVQAT
VAGVIAGQQAQAPHNNESLSREAGCLRDFRKWYHHPFDGASKDPTLALLWLSSIETVFRHMNYSEDKKVYCVAFLLQDNALIWWQSIERTIDVSNGSVTWLQFRKTFLRK
YYPTNARFKKQAGFVALNQGSRTVEEYETEFARLSRFAPTMVAIEADKVERFITGFRDNIQGSMAAHQPPDYATTVRVAELIDRRPATTP