; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015655 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015655
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag protease polyprotein
Genome locationchr12:18771574..18772086
RNA-Seq ExpressionLag0015655
SyntenyLag0015655
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151688.1 uncharacterized protein LOC111019603 [Momordica charantia]1.4e-2338.12Show/hide
Query:  PPSFDGHSEGSEAVEEWTAIFEALFQLLGVDAQQRVQGATFMIKGYARTWWKVVGQTENRLENPISWSRVKGLVRDHFGCRLADVELEAKFVSLVQGTLS
        PP+F+G SE +  VEEW    EAL+  LG   Q +V+GA FM++G A  WW VV   E+    PI+W+  K L+ D++  +    E E +F+ L Q TL 
Subjt:  PPSFDGHSEGSEAVEEWTAIFEALFQLLGVDAQQRVQGATFMIKGYARTWWKVVGQTENRLENPISWSRVKGLVRDHFGCRLADVELEAKFVSLVQGTLS

Query:  MEQYVRRFEKLSYRVQEMVAIEESRINRFVNRLRAEIRGLVRLSRPATFATALASARMLD
        + QY ++F + S    +++  E  +I RFV  L   I+G + L RP T+A A+  A ++D
Subjt:  MEQYVRRFEKLSYRVQEMVAIEESRINRFVNRLRAEIRGLVRLSRPATFATALASARMLD

XP_022155000.1 uncharacterized protein LOC111022144 [Momordica charantia]5.5e-2541.25Show/hide
Query:  PPSFDGHSEGSEAVEEWTAIFEALFQLLGVDAQQRVQGATFMIKGYARTWWKVVGQTENRLENPISWSRVKGLVRDHFGCRLADVELEAKFVSLVQGTLS
        PP+FDG SE + A EEW    EA +  LG + Q +V+GA FM++G A  WW  +   E+     I W+R K L+ D++         EA+F+ LVQGTLS
Subjt:  PPSFDGHSEGSEAVEEWTAIFEALFQLLGVDAQQRVQGATFMIKGYARTWWKVVGQTENRLENPISWSRVKGLVRDHFGCRLADVELEAKFVSLVQGTLS

Query:  MEQYVRRFEKLSYRVQEMVAIEESRINRFVNRLRAEIRGLVRLSRPATFATALASARMLD
        + QY R+F +LS    E++     +I RFV  L   IRG V L RPA++A A+  A ++D
Subjt:  MEQYVRRFEKLSYRVQEMVAIEESRINRFVNRLRAEIRGLVRLSRPATFATALASARMLD

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]4.1e-2845Show/hide
Query:  PPSFDGHSEGSEAVEEWTAIFEALFQLLGVDAQQRVQGATFMIKGYARTWWKVVGQTENRLENPISWSRVKGLVRDHFGCRLADVELEAKFVSLVQGTLS
        PP+FDG SE + AVEEW    EAL+  LG + Q +V+GA FM++G A  WW  V   E+    PI W+R K L+ D++         EA+F+ LVQGTLS
Subjt:  PPSFDGHSEGSEAVEEWTAIFEALFQLLGVDAQQRVQGATFMIKGYARTWWKVVGQTENRLENPISWSRVKGLVRDHFGCRLADVELEAKFVSLVQGTLS

Query:  MEQYVRRFEKLSYRVQEMVAIEESRINRFVNRLRAEIRGLVRLSRPATFATALASARMLD
        + QY R+F +LS    E++  E  +I RFV  LR  IRG V L RP T+A A+  A ++D
Subjt:  MEQYVRRFEKLSYRVQEMVAIEESRINRFVNRLRAEIRGLVRLSRPATFATALASARMLD

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]1.4e-2338.12Show/hide
Query:  PPSFDGHSEGSEAVEEWTAIFEALFQLLGVDAQQRVQGATFMIKGYARTWWKVVGQTENRLENPISWSRVKGLVRDHFGCRLADVELEAKFVSLVQGTLS
        PP F+G SE   A EEW    EAL+  LG     +V+GA FM++G A  WW+ V   E+    P++W+R K L+ +++   +A  E   +F+ L QG+L+
Subjt:  PPSFDGHSEGSEAVEEWTAIFEALFQLLGVDAQQRVQGATFMIKGYARTWWKVVGQTENRLENPISWSRVKGLVRDHFGCRLADVELEAKFVSLVQGTLS

Query:  MEQYVRRFEKLSYRVQEMVAIEESRINRFVNRLRAEIRGLVRLSRPATFATALASARMLD
        + QY R+F +LS    + V  E+ +I++F++ LR EI+GL+ L  P T+A A+  A ++D
Subjt:  MEQYVRRFEKLSYRVQEMVAIEESRINRFVNRLRAEIRGLVRLSRPATFATALASARMLD

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]1.4e-2540.62Show/hide
Query:  PPSFDGHSEGSEAVEEWTAIFEALFQLLGVDAQQRVQGATFMIKGYARTWWKVVGQTENRLENPISWSRVKGLVRDHFGCRLADVELEAKFVSLVQGTLS
        PP+F G SE +   EEW    EAL+  LG + Q +V+GA FM++  A  WW  V  TE+    P+ W+R K L+ DH+     +   E +F+ LVQGTL+
Subjt:  PPSFDGHSEGSEAVEEWTAIFEALFQLLGVDAQQRVQGATFMIKGYARTWWKVVGQTENRLENPISWSRVKGLVRDHFGCRLADVELEAKFVSLVQGTLS

Query:  MEQYVRRFEKLSYRVQEMVAIEESRINRFVNRLRAEIRGLVRLSRPATFATALASARMLD
        + QY R+F +LS    E++  E  +I RFV  L   IRG V L RP T+A A+    ++D
Subjt:  MEQYVRRFEKLSYRVQEMVAIEESRINRFVNRLRAEIRGLVRLSRPATFATALASARMLD

TrEMBL top hitse value%identityAlignment
A0A6J1DCW8 uncharacterized protein LOC1110196036.6e-2438.12Show/hide
Query:  PPSFDGHSEGSEAVEEWTAIFEALFQLLGVDAQQRVQGATFMIKGYARTWWKVVGQTENRLENPISWSRVKGLVRDHFGCRLADVELEAKFVSLVQGTLS
        PP+F+G SE +  VEEW    EAL+  LG   Q +V+GA FM++G A  WW VV   E+    PI+W+  K L+ D++  +    E E +F+ L Q TL 
Subjt:  PPSFDGHSEGSEAVEEWTAIFEALFQLLGVDAQQRVQGATFMIKGYARTWWKVVGQTENRLENPISWSRVKGLVRDHFGCRLADVELEAKFVSLVQGTLS

Query:  MEQYVRRFEKLSYRVQEMVAIEESRINRFVNRLRAEIRGLVRLSRPATFATALASARMLD
        + QY ++F + S    +++  E  +I RFV  L   I+G + L RP T+A A+  A ++D
Subjt:  MEQYVRRFEKLSYRVQEMVAIEESRINRFVNRLRAEIRGLVRLSRPATFATALASARMLD

A0A6J1DL73 uncharacterized protein LOC1110221442.7e-2541.25Show/hide
Query:  PPSFDGHSEGSEAVEEWTAIFEALFQLLGVDAQQRVQGATFMIKGYARTWWKVVGQTENRLENPISWSRVKGLVRDHFGCRLADVELEAKFVSLVQGTLS
        PP+FDG SE + A EEW    EA +  LG + Q +V+GA FM++G A  WW  +   E+     I W+R K L+ D++         EA+F+ LVQGTLS
Subjt:  PPSFDGHSEGSEAVEEWTAIFEALFQLLGVDAQQRVQGATFMIKGYARTWWKVVGQTENRLENPISWSRVKGLVRDHFGCRLADVELEAKFVSLVQGTLS

Query:  MEQYVRRFEKLSYRVQEMVAIEESRINRFVNRLRAEIRGLVRLSRPATFATALASARMLD
        + QY R+F +LS    E++     +I RFV  L   IRG V L RPA++A A+  A ++D
Subjt:  MEQYVRRFEKLSYRVQEMVAIEESRINRFVNRLRAEIRGLVRLSRPATFATALASARMLD

A0A6J1DQB9 Reverse transcriptase6.6e-2438.12Show/hide
Query:  PPSFDGHSEGSEAVEEWTAIFEALFQLLGVDAQQRVQGATFMIKGYARTWWKVVGQTENRLENPISWSRVKGLVRDHFGCRLADVELEAKFVSLVQGTLS
        PP F+G SE   A EEW    EAL+  LG     +V+GA FM++G A  WW+ V   E+    P++W+R K L+ +++   +A  E   +F+ L QG+L+
Subjt:  PPSFDGHSEGSEAVEEWTAIFEALFQLLGVDAQQRVQGATFMIKGYARTWWKVVGQTENRLENPISWSRVKGLVRDHFGCRLADVELEAKFVSLVQGTLS

Query:  MEQYVRRFEKLSYRVQEMVAIEESRINRFVNRLRAEIRGLVRLSRPATFATALASARMLD
        + QY R+F +LS    + V  E+ +I++F++ LR EI+GL+ L  P T+A A+  A ++D
Subjt:  MEQYVRRFEKLSYRVQEMVAIEESRINRFVNRLRAEIRGLVRLSRPATFATALASARMLD

A0A6J1DUM2 uncharacterized protein LOC1110232472.0e-2845Show/hide
Query:  PPSFDGHSEGSEAVEEWTAIFEALFQLLGVDAQQRVQGATFMIKGYARTWWKVVGQTENRLENPISWSRVKGLVRDHFGCRLADVELEAKFVSLVQGTLS
        PP+FDG SE + AVEEW    EAL+  LG + Q +V+GA FM++G A  WW  V   E+    PI W+R K L+ D++         EA+F+ LVQGTLS
Subjt:  PPSFDGHSEGSEAVEEWTAIFEALFQLLGVDAQQRVQGATFMIKGYARTWWKVVGQTENRLENPISWSRVKGLVRDHFGCRLADVELEAKFVSLVQGTLS

Query:  MEQYVRRFEKLSYRVQEMVAIEESRINRFVNRLRAEIRGLVRLSRPATFATALASARMLD
        + QY R+F +LS    E++  E  +I RFV  LR  IRG V L RP T+A A+  A ++D
Subjt:  MEQYVRRFEKLSYRVQEMVAIEESRINRFVNRLRAEIRGLVRLSRPATFATALASARMLD

A0A6J1DVA0 uncharacterized protein LOC1110234247.0e-2640.62Show/hide
Query:  PPSFDGHSEGSEAVEEWTAIFEALFQLLGVDAQQRVQGATFMIKGYARTWWKVVGQTENRLENPISWSRVKGLVRDHFGCRLADVELEAKFVSLVQGTLS
        PP+F G SE +   EEW    EAL+  LG + Q +V+GA FM++  A  WW  V  TE+    P+ W+R K L+ DH+     +   E +F+ LVQGTL+
Subjt:  PPSFDGHSEGSEAVEEWTAIFEALFQLLGVDAQQRVQGATFMIKGYARTWWKVVGQTENRLENPISWSRVKGLVRDHFGCRLADVELEAKFVSLVQGTLS

Query:  MEQYVRRFEKLSYRVQEMVAIEESRINRFVNRLRAEIRGLVRLSRPATFATALASARMLD
        + QY R+F +LS    E++  E  +I RFV  L   IRG V L RP T+A A+    ++D
Subjt:  MEQYVRRFEKLSYRVQEMVAIEESRINRFVNRLRAEIRGLVRLSRPATFATALASARMLD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGACGAAGCCTCCTTCATTCGATGGACACTCTGAAGGTTCTGAAGCAGTAGAAGAATGGACTGCAATATTTGAAGCATTGTTCCAACTTCTTGGAGTCGATGCCCA
ACAACGAGTCCAAGGAGCTACCTTTATGATCAAAGGCTACGCTCGCACTTGGTGGAAGGTAGTGGGTCAAACCGAGAACCGCCTAGAGAATCCCATTTCATGGTCAAGGG
TTAAAGGTCTTGTGCGAGACCATTTTGGCTGTCGTTTGGCTGATGTTGAGCTCGAAGCGAAGTTCGTCTCACTTGTTCAAGGGACCTTGTCCATGGAGCAGTACGTCAGA
AGGTTTGAAAAGTTGTCCTACCGTGTCCAGGAGATGGTTGCCATTGAAGAAAGTAGGATCAACCGATTTGTTAATCGTCTCCGTGCAGAAATCCGAGGTTTGGTCAGGCT
TAGTCGACCAGCCACATTCGCGACAGCTCTAGCGAGCGCTCGGATGTTGGATATGTTGGGTTTTATGTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGACGAAGCCTCCTTCATTCGATGGACACTCTGAAGGTTCTGAAGCAGTAGAAGAATGGACTGCAATATTTGAAGCATTGTTCCAACTTCTTGGAGTCGATGCCCA
ACAACGAGTCCAAGGAGCTACCTTTATGATCAAAGGCTACGCTCGCACTTGGTGGAAGGTAGTGGGTCAAACCGAGAACCGCCTAGAGAATCCCATTTCATGGTCAAGGG
TTAAAGGTCTTGTGCGAGACCATTTTGGCTGTCGTTTGGCTGATGTTGAGCTCGAAGCGAAGTTCGTCTCACTTGTTCAAGGGACCTTGTCCATGGAGCAGTACGTCAGA
AGGTTTGAAAAGTTGTCCTACCGTGTCCAGGAGATGGTTGCCATTGAAGAAAGTAGGATCAACCGATTTGTTAATCGTCTCCGTGCAGAAATCCGAGGTTTGGTCAGGCT
TAGTCGACCAGCCACATTCGCGACAGCTCTAGCGAGCGCTCGGATGTTGGATATGTTGGGTTTTATGTCCTAA
Protein sequenceShow/hide protein sequence
MKTKPPSFDGHSEGSEAVEEWTAIFEALFQLLGVDAQQRVQGATFMIKGYARTWWKVVGQTENRLENPISWSRVKGLVRDHFGCRLADVELEAKFVSLVQGTLSMEQYVR
RFEKLSYRVQEMVAIEESRINRFVNRLRAEIRGLVRLSRPATFATALASARMLDMLGFMS