; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007675 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007675
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr9:2967281..2967805
RNA-Seq ExpressionLag0007675
SyntenyLag0007675
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156330.1 uncharacterized protein LOC111023250 [Momordica charantia]1.1e-3363.55Show/hide
Query:  ESQFIRDFKRYGPPSFDGQSENPLAAERWVAGLEALFDLMNCNDPQKIRGAIFMLKDDARMWWQSVAAAEDHANQLMSWERFKDLLYDYYFPETVKDDKE
        E+QFI+DFKRYGPP+FDG S+    AE WV  LEAL+  + C D  K++GA+FML+  A  WW SVAAAEDHAN  ++W RFKDLLYDYY+PETVKD KE
Subjt:  ESQFIRDFKRYGPPSFDGQSENPLAAERWVAGLEALFDLMNCNDPQKIRGAIFMLKDDARMWWQSVAAAEDHANQLMSWERFKDLLYDYYFPETVKDDKE

Query:  AKFLHLT
        A+FLH +
Subjt:  AKFLHLT

XP_022158637.1 uncharacterized protein LOC111025088 [Momordica charantia]8.7e-3457.6Show/hide
Query:  GQRNVRPPRNPNWAPGNADESQFIRDFKRYGPPSFDGQSENPLAAERWVAGLEALFDLMNCNDPQKIRGAIFMLKDDARMWWQSVAAAEDHANQLMSWER
        G     PPR+ +       E+QFI+DFKRYGPP+FDG SE   AAE WV  LEAL+  + C D  K++G +FML+ +A  WW S+A AEDHAN  + W R
Subjt:  GQRNVRPPRNPNWAPGNADESQFIRDFKRYGPPSFDGQSENPLAAERWVAGLEALFDLMNCNDPQKIRGAIFMLKDDARMWWQSVAAAEDHANQLMSWER

Query:  FKDLLYDYYFPETVKDDKEAKFLHL
        FKDLLYDYY+PETVKD KEA+FLHL
Subjt:  FKDLLYDYYFPETVKDDKEAKFLHL

XP_022158645.1 uncharacterized protein LOC111025102 [Momordica charantia]1.5e-3356.06Show/hide
Query:  DHVRQHGQRNVRPPRNPNWAPGNADESQFIRDFKRYGPPSFDGQSENPLAAERWVAGLEALFDLMNCNDPQKIRGAIFMLKDDARMWWQSVAAAEDHANQ
        D+  + G   V+ PR    A    +E QFIRDFKR+GPP F+G SE P AAE WV  LEAL+  + C+D  K+RGA+FML  +A  WW+SVAAAEDHAN 
Subjt:  DHVRQHGQRNVRPPRNPNWAPGNADESQFIRDFKRYGPPSFDGQSENPLAAERWVAGLEALFDLMNCNDPQKIRGAIFMLKDDARMWWQSVAAAEDHANQ

Query:  LMSWERFKDLLYDYYFPETVKDDKEAKFLHLT
         ++W RFKDLLY+YYFP TV+++K A+FL LT
Subjt:  LMSWERFKDLLYDYYFPETVKDDKEAKFLHLT

XP_022158749.1 uncharacterized protein LOC111025213 [Momordica charantia]1.1e-3355.15Show/hide
Query:  PGSPPDHVRQHGQRNVRPPRNPNWAPGNADESQFIRDFKRYGPPSFDGQSENPLAAERWVAGLEALFDLMNCNDPQKIRGAIFMLKDDARMWWQSVAAAE
        P  PP   R+     V PP  P        E++FI+DFKRYGPP+FDG+SE   AAE W+  LEAL+  + C D  K++GA+FML+ +A  WW SVAAAE
Subjt:  PGSPPDHVRQHGQRNVRPPRNPNWAPGNADESQFIRDFKRYGPPSFDGQSENPLAAERWVAGLEALFDLMNCNDPQKIRGAIFMLKDDARMWWQSVAAAE

Query:  DHANQLMSWERFKDLLYDYYFPETVKDDKEAKFLHL
        DHAN  + W RFK+LLYDYY+PE VKD KEA+FLHL
Subjt:  DHANQLMSWERFKDLLYDYYFPETVKDDKEAKFLHL

XP_022159307.1 uncharacterized protein LOC111025716, partial [Momordica charantia]3.9e-3458.73Show/hide
Query:  GQRNVRPPRNPNWAPGNADESQFIRDFKRYGPPSFDGQSENPLAAERWVAGLEALFDLMNCNDPQKIRGAIFMLKDDARMWWQSVAAAEDHANQLMSWER
        G    +PPR+ +       E+QFI+DFKRYGPP+FDG SE   AAE WV  LEAL+  + C D  K +GA+FML+ +A  WW SV AAEDHAN   +W R
Subjt:  GQRNVRPPRNPNWAPGNADESQFIRDFKRYGPPSFDGQSENPLAAERWVAGLEALFDLMNCNDPQKIRGAIFMLKDDARMWWQSVAAAEDHANQLMSWER

Query:  FKDLLYDYYFPETVKDDKEAKFLHLT
        FKDLLYDYY+PETVKD KEA+FLHL+
Subjt:  FKDLLYDYYFPETVKDDKEAKFLHLT

TrEMBL top hitse value%identityAlignment
A0A6J1DQ01 uncharacterized protein LOC1110232505.5e-3463.55Show/hide
Query:  ESQFIRDFKRYGPPSFDGQSENPLAAERWVAGLEALFDLMNCNDPQKIRGAIFMLKDDARMWWQSVAAAEDHANQLMSWERFKDLLYDYYFPETVKDDKE
        E+QFI+DFKRYGPP+FDG S+    AE WV  LEAL+  + C D  K++GA+FML+  A  WW SVAAAEDHAN  ++W RFKDLLYDYY+PETVKD KE
Subjt:  ESQFIRDFKRYGPPSFDGQSENPLAAERWVAGLEALFDLMNCNDPQKIRGAIFMLKDDARMWWQSVAAAEDHANQLMSWERFKDLLYDYYFPETVKDDKE

Query:  AKFLHLT
        A+FLH +
Subjt:  AKFLHLT

A0A6J1DWE6 uncharacterized protein LOC1110251027.2e-3456.06Show/hide
Query:  DHVRQHGQRNVRPPRNPNWAPGNADESQFIRDFKRYGPPSFDGQSENPLAAERWVAGLEALFDLMNCNDPQKIRGAIFMLKDDARMWWQSVAAAEDHANQ
        D+  + G   V+ PR    A    +E QFIRDFKR+GPP F+G SE P AAE WV  LEAL+  + C+D  K+RGA+FML  +A  WW+SVAAAEDHAN 
Subjt:  DHVRQHGQRNVRPPRNPNWAPGNADESQFIRDFKRYGPPSFDGQSENPLAAERWVAGLEALFDLMNCNDPQKIRGAIFMLKDDARMWWQSVAAAEDHANQ

Query:  LMSWERFKDLLYDYYFPETVKDDKEAKFLHLT
         ++W RFKDLLY+YYFP TV+++K A+FL LT
Subjt:  LMSWERFKDLLYDYYFPETVKDDKEAKFLHLT

A0A6J1DXQ7 uncharacterized protein LOC1110250884.2e-3457.6Show/hide
Query:  GQRNVRPPRNPNWAPGNADESQFIRDFKRYGPPSFDGQSENPLAAERWVAGLEALFDLMNCNDPQKIRGAIFMLKDDARMWWQSVAAAEDHANQLMSWER
        G     PPR+ +       E+QFI+DFKRYGPP+FDG SE   AAE WV  LEAL+  + C D  K++G +FML+ +A  WW S+A AEDHAN  + W R
Subjt:  GQRNVRPPRNPNWAPGNADESQFIRDFKRYGPPSFDGQSENPLAAERWVAGLEALFDLMNCNDPQKIRGAIFMLKDDARMWWQSVAAAEDHANQLMSWER

Query:  FKDLLYDYYFPETVKDDKEAKFLHL
        FKDLLYDYY+PETVKD KEA+FLHL
Subjt:  FKDLLYDYYFPETVKDDKEAKFLHL

A0A6J1DZH0 uncharacterized protein LOC1110257161.9e-3458.73Show/hide
Query:  GQRNVRPPRNPNWAPGNADESQFIRDFKRYGPPSFDGQSENPLAAERWVAGLEALFDLMNCNDPQKIRGAIFMLKDDARMWWQSVAAAEDHANQLMSWER
        G    +PPR+ +       E+QFI+DFKRYGPP+FDG SE   AAE WV  LEAL+  + C D  K +GA+FML+ +A  WW SV AAEDHAN   +W R
Subjt:  GQRNVRPPRNPNWAPGNADESQFIRDFKRYGPPSFDGQSENPLAAERWVAGLEALFDLMNCNDPQKIRGAIFMLKDDARMWWQSVAAAEDHANQLMSWER

Query:  FKDLLYDYYFPETVKDDKEAKFLHLT
        FKDLLYDYY+PETVKD KEA+FLHL+
Subjt:  FKDLLYDYYFPETVKDDKEAKFLHLT

A0A6J1E0B4 uncharacterized protein LOC1110252135.5e-3455.15Show/hide
Query:  PGSPPDHVRQHGQRNVRPPRNPNWAPGNADESQFIRDFKRYGPPSFDGQSENPLAAERWVAGLEALFDLMNCNDPQKIRGAIFMLKDDARMWWQSVAAAE
        P  PP   R+     V PP  P        E++FI+DFKRYGPP+FDG+SE   AAE W+  LEAL+  + C D  K++GA+FML+ +A  WW SVAAAE
Subjt:  PGSPPDHVRQHGQRNVRPPRNPNWAPGNADESQFIRDFKRYGPPSFDGQSENPLAAERWVAGLEALFDLMNCNDPQKIRGAIFMLKDDARMWWQSVAAAE

Query:  DHANQLMSWERFKDLLYDYYFPETVKDDKEAKFLHL
        DHAN  + W RFK+LLYDYY+PE VKD KEA+FLHL
Subjt:  DHANQLMSWERFKDLLYDYYFPETVKDDKEAKFLHL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAACGCTTAGCAGGAAGTTAATCCTCCGATCCCCCTGTTCAGCACGGAGACGACCTTCCGCCTCCCCCTCCTCCTCCGGTCCCTCCTGCGGCTCCTATGCTGATCA
GCCTGGAAGCCCTCCAGACCATGTTCGACAACATGGCCAGAGAAATGTTAGGCCACCGCGGAATCCTAATTGGGCACCTGGGAACGCAGATGAATCCCAGTTCATTAGAG
ACTTCAAGCGCTACGGGCCTCCCTCGTTTGACGGGCAGTCCGAGAATCCCTTGGCAGCAGAGCGTTGGGTCGCTGGATTAGAGGCACTGTTTGATCTCATGAACTGCAAT
GATCCCCAGAAGATTAGAGGAGCAATCTTCATGCTAAAGGATGACGCTCGCATGTGGTGGCAGTCTGTGGCAGCTGCTGAAGATCATGCCAATCAGCTGATGTCATGGGA
AAGGTTCAAAGACCTGTTGTACGATTATTACTTCCCGGAGACCGTCAAAGATGATAAAGAAGCTAAGTTCCTCCATCTGACCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAACGCTTAGCAGGAAGTTAATCCTCCGATCCCCCTGTTCAGCACGGAGACGACCTTCCGCCTCCCCCTCCTCCTCCGGTCCCTCCTGCGGCTCCTATGCTGATCA
GCCTGGAAGCCCTCCAGACCATGTTCGACAACATGGCCAGAGAAATGTTAGGCCACCGCGGAATCCTAATTGGGCACCTGGGAACGCAGATGAATCCCAGTTCATTAGAG
ACTTCAAGCGCTACGGGCCTCCCTCGTTTGACGGGCAGTCCGAGAATCCCTTGGCAGCAGAGCGTTGGGTCGCTGGATTAGAGGCACTGTTTGATCTCATGAACTGCAAT
GATCCCCAGAAGATTAGAGGAGCAATCTTCATGCTAAAGGATGACGCTCGCATGTGGTGGCAGTCTGTGGCAGCTGCTGAAGATCATGCCAATCAGCTGATGTCATGGGA
AAGGTTCAAAGACCTGTTGTACGATTATTACTTCCCGGAGACCGTCAAAGATGATAAAGAAGCTAAGTTCCTCCATCTGACCTAG
Protein sequenceShow/hide protein sequence
MSTLSRKLILRSPCSARRRPSASPSSSGPSCGSYADQPGSPPDHVRQHGQRNVRPPRNPNWAPGNADESQFIRDFKRYGPPSFDGQSENPLAAERWVAGLEALFDLMNCN
DPQKIRGAIFMLKDDARMWWQSVAAAEDHANQLMSWERFKDLLYDYYFPETVKDDKEAKFLHLT