; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008194 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008194
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr9:14367404..14368155
RNA-Seq ExpressionLag0008194
SyntenyLag0008194
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155000.1 uncharacterized protein LOC111022144 [Momordica charantia]1.3e-5064.05Show/hide
Query:  ESQFLRDFKRYGPPTFDGRSENPVAAERWIADLEAMYDYMNCDDRLMVRGAIFMLRDDARMWWKSIAEAEDHANNPVSWERFKDLLYDHYFLETVKDDKE
        E+ F++DFKRYGPPTFDG SE   AAE WI +LEA Y Y+ C+D+  V+GA+FMLR +A  WW SIA AEDHAN  + W RFKDLLYD+Y+LETVKD KE
Subjt:  ESQFLRDFKRYGPPTFDGRSENPVAAERWIADLEAMYDYMNCDDRLMVRGAIFMLRDDARMWWKSIAEAEDHANNPVSWERFKDLLYDHYFLETVKDDKE

Query:  TEFLHLTQGNLTVVQYERKFTELSRFAQDLVSTPERKIKRFVKGLRRKSEVPL
         EFLHL QG L+V QYERKFTELSRFA +L+     KIKRFVKGL +    P+
Subjt:  TEFLHLTQGNLTVVQYERKFTELSRFAQDLVSTPERKIKRFVKGLRRKSEVPL

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]2.2e-5057.78Show/hide
Query:  PVPRRRRGVVPPAPPPRYPNRAAEDPMESQFLRDFKRYGPPTFDGRSENPVAAERWIADLEAMYDYMNCDDRLMVRGAIFMLRDDARMWWKSIAEAEDHA
        P P    GV   APPP++ +       E++F++DFKRYGPPTFDG SE   A E WI +LEA+Y Y+ C+D+  V+GA+FMLR +A  WW S+A AED+A
Subjt:  PVPRRRRGVVPPAPPPRYPNRAAEDPMESQFLRDFKRYGPPTFDGRSENPVAAERWIADLEAMYDYMNCDDRLMVRGAIFMLRDDARMWWKSIAEAEDHA

Query:  NNPVSWERFKDLLYDHYFLETVKDDKETEFLHLTQGNLTVVQYERKFTELSRFAQDLVSTPERKIKRFVKGLRRKSEVPL
        N P+ W RFK+LLYD+Y+ ETVKD KE EFLHL QG L+V QYERKFTELSRFA +L+ T   KIKRFVKGLR+    P+
Subjt:  NNPVSWERFKDLLYDHYFLETVKDDKETEFLHLTQGNLTVVQYERKFTELSRFAQDLVSTPERKIKRFVKGLRRKSEVPL

XP_022156330.1 uncharacterized protein LOC111023250 [Momordica charantia]4.0e-5261.59Show/hide
Query:  YPNRAAEDPM-ESQFLRDFKRYGPPTFDGRSENPVAAERWIADLEAMYDYMNCDDRLMVRGAIFMLRDDARMWWKSIAEAEDHANNPVSWERFKDLLYDH
        +P R    P  E+QF++DFKRYGPPTFDG S+    AE W+ +LEA+Y Y+ C+D+  V+GA+FMLR  A  WW S+A AEDHAN PV+W RFKDLLYD+
Subjt:  YPNRAAEDPM-ESQFLRDFKRYGPPTFDGRSENPVAAERWIADLEAMYDYMNCDDRLMVRGAIFMLRDDARMWWKSIAEAEDHANNPVSWERFKDLLYDH

Query:  YFLETVKDDKETEFLHLTQGNLTVVQYERKFTELSRFAQDLVSTPERKIKRFVKGLRRKSEVPL
        Y+ ETVKD KE EFLH +QG LTV QYERKFTELSRFA++L+ T   KIKRFVKGLR+    P+
Subjt:  YFLETVKDDKETEFLHLTQGNLTVVQYERKFTELSRFAQDLVSTPERKIKRFVKGLRRKSEVPL

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]7.1e-4953.77Show/hide
Query:  DPPVPPVHQE--VHPPVP----------RRRRGVVPPAPPPRYPNRAAEDPMESQFLRDFKRYGPPTFDGRSENPVAAERWIADLEAMYDYMNCDDRLMV
        DPP PP+  +  V PP P              GV      P  P        E+QF++DFKRYGPPTF G SE    AE W+ +LEA+Y Y+ C+D+  V
Subjt:  DPPVPPVHQE--VHPPVP----------RRRRGVVPPAPPPRYPNRAAEDPMESQFLRDFKRYGPPTFDGRSENPVAAERWIADLEAMYDYMNCDDRLMV

Query:  RGAIFMLRDDARMWWKSIAEAEDHANNPVSWERFKDLLYDHYFLETVKDDKETEFLHLTQGNLTVVQYERKFTELSRFAQDLVSTPERKIKRFVKGLRR
        +GA+FMLR +A  WW S+A  EDHAN PV W RFK+LLYDHY+ ETV+D KE EFLHL QG LTV QYERKFTELS FA +L+ T   KIKRFVKGL +
Subjt:  RGAIFMLRDDARMWWKSIAEAEDHANNPVSWERFKDLLYDHYFLETVKDDKETEFLHLTQGNLTVVQYERKFTELSRFAQDLVSTPERKIKRFVKGLRR

XP_022158637.1 uncharacterized protein LOC111025088 [Momordica charantia]1.5e-5161.9Show/hide
Query:  APPPRYPNRAAEDPMESQFLRDFKRYGPPTFDGRSENPVAAERWIADLEAMYDYMNCDDRLMVRGAIFMLRDDARMWWKSIAEAEDHANNPVSWERFKDL
        A PPR+ +       E+QF++DFKRYGPPTFDG SE   AAE W+ +LEA+Y Y+ C+D+  V+G +FMLR +A  WW SIA AEDHAN PV W RFKDL
Subjt:  APPPRYPNRAAEDPMESQFLRDFKRYGPPTFDGRSENPVAAERWIADLEAMYDYMNCDDRLMVRGAIFMLRDDARMWWKSIAEAEDHANNPVSWERFKDL

Query:  LYDHYFLETVKDDKETEFLHLTQGNLTVVQYERKFTELSRFAQDLVSTPERKIKRFVKGLRRKSEVPL
        LYD+Y+ ETVKD KE EFLHL QG LTV QYERKFTELSRFA + + T   KIKRFVKGLR+    P+
Subjt:  LYDHYFLETVKDDKETEFLHLTQGNLTVVQYERKFTELSRFAQDLVSTPERKIKRFVKGLRRKSEVPL

TrEMBL top hitse value%identityAlignment
A0A6J1DL73 uncharacterized protein LOC1110221446.3e-5164.05Show/hide
Query:  ESQFLRDFKRYGPPTFDGRSENPVAAERWIADLEAMYDYMNCDDRLMVRGAIFMLRDDARMWWKSIAEAEDHANNPVSWERFKDLLYDHYFLETVKDDKE
        E+ F++DFKRYGPPTFDG SE   AAE WI +LEA Y Y+ C+D+  V+GA+FMLR +A  WW SIA AEDHAN  + W RFKDLLYD+Y+LETVKD KE
Subjt:  ESQFLRDFKRYGPPTFDGRSENPVAAERWIADLEAMYDYMNCDDRLMVRGAIFMLRDDARMWWKSIAEAEDHANNPVSWERFKDLLYDHYFLETVKDDKE

Query:  TEFLHLTQGNLTVVQYERKFTELSRFAQDLVSTPERKIKRFVKGLRRKSEVPL
         EFLHL QG L+V QYERKFTELSRFA +L+     KIKRFVKGL +    P+
Subjt:  TEFLHLTQGNLTVVQYERKFTELSRFAQDLVSTPERKIKRFVKGLRRKSEVPL

A0A6J1DQ01 uncharacterized protein LOC1110232502.0e-5261.59Show/hide
Query:  YPNRAAEDPM-ESQFLRDFKRYGPPTFDGRSENPVAAERWIADLEAMYDYMNCDDRLMVRGAIFMLRDDARMWWKSIAEAEDHANNPVSWERFKDLLYDH
        +P R    P  E+QF++DFKRYGPPTFDG S+    AE W+ +LEA+Y Y+ C+D+  V+GA+FMLR  A  WW S+A AEDHAN PV+W RFKDLLYD+
Subjt:  YPNRAAEDPM-ESQFLRDFKRYGPPTFDGRSENPVAAERWIADLEAMYDYMNCDDRLMVRGAIFMLRDDARMWWKSIAEAEDHANNPVSWERFKDLLYDH

Query:  YFLETVKDDKETEFLHLTQGNLTVVQYERKFTELSRFAQDLVSTPERKIKRFVKGLRRKSEVPL
        Y+ ETVKD KE EFLH +QG LTV QYERKFTELSRFA++L+ T   KIKRFVKGLR+    P+
Subjt:  YFLETVKDDKETEFLHLTQGNLTVVQYERKFTELSRFAQDLVSTPERKIKRFVKGLRRKSEVPL

A0A6J1DUM2 uncharacterized protein LOC1110232471.1e-5057.78Show/hide
Query:  PVPRRRRGVVPPAPPPRYPNRAAEDPMESQFLRDFKRYGPPTFDGRSENPVAAERWIADLEAMYDYMNCDDRLMVRGAIFMLRDDARMWWKSIAEAEDHA
        P P    GV   APPP++ +       E++F++DFKRYGPPTFDG SE   A E WI +LEA+Y Y+ C+D+  V+GA+FMLR +A  WW S+A AED+A
Subjt:  PVPRRRRGVVPPAPPPRYPNRAAEDPMESQFLRDFKRYGPPTFDGRSENPVAAERWIADLEAMYDYMNCDDRLMVRGAIFMLRDDARMWWKSIAEAEDHA

Query:  NNPVSWERFKDLLYDHYFLETVKDDKETEFLHLTQGNLTVVQYERKFTELSRFAQDLVSTPERKIKRFVKGLRRKSEVPL
        N P+ W RFK+LLYD+Y+ ETVKD KE EFLHL QG L+V QYERKFTELSRFA +L+ T   KIKRFVKGLR+    P+
Subjt:  NNPVSWERFKDLLYDHYFLETVKDDKETEFLHLTQGNLTVVQYERKFTELSRFAQDLVSTPERKIKRFVKGLRRKSEVPL

A0A6J1DVA0 uncharacterized protein LOC1110234243.5e-4953.77Show/hide
Query:  DPPVPPVHQE--VHPPVP----------RRRRGVVPPAPPPRYPNRAAEDPMESQFLRDFKRYGPPTFDGRSENPVAAERWIADLEAMYDYMNCDDRLMV
        DPP PP+  +  V PP P              GV      P  P        E+QF++DFKRYGPPTF G SE    AE W+ +LEA+Y Y+ C+D+  V
Subjt:  DPPVPPVHQE--VHPPVP----------RRRRGVVPPAPPPRYPNRAAEDPMESQFLRDFKRYGPPTFDGRSENPVAAERWIADLEAMYDYMNCDDRLMV

Query:  RGAIFMLRDDARMWWKSIAEAEDHANNPVSWERFKDLLYDHYFLETVKDDKETEFLHLTQGNLTVVQYERKFTELSRFAQDLVSTPERKIKRFVKGLRR
        +GA+FMLR +A  WW S+A  EDHAN PV W RFK+LLYDHY+ ETV+D KE EFLHL QG LTV QYERKFTELS FA +L+ T   KIKRFVKGL +
Subjt:  RGAIFMLRDDARMWWKSIAEAEDHANNPVSWERFKDLLYDHYFLETVKDDKETEFLHLTQGNLTVVQYERKFTELSRFAQDLVSTPERKIKRFVKGLRR

A0A6J1DXQ7 uncharacterized protein LOC1110250887.4e-5261.9Show/hide
Query:  APPPRYPNRAAEDPMESQFLRDFKRYGPPTFDGRSENPVAAERWIADLEAMYDYMNCDDRLMVRGAIFMLRDDARMWWKSIAEAEDHANNPVSWERFKDL
        A PPR+ +       E+QF++DFKRYGPPTFDG SE   AAE W+ +LEA+Y Y+ C+D+  V+G +FMLR +A  WW SIA AEDHAN PV W RFKDL
Subjt:  APPPRYPNRAAEDPMESQFLRDFKRYGPPTFDGRSENPVAAERWIADLEAMYDYMNCDDRLMVRGAIFMLRDDARMWWKSIAEAEDHANNPVSWERFKDL

Query:  LYDHYFLETVKDDKETEFLHLTQGNLTVVQYERKFTELSRFAQDLVSTPERKIKRFVKGLRRKSEVPL
        LYD+Y+ ETVKD KE EFLHL QG LTV QYERKFTELSRFA + + T   KIKRFVKGLR+    P+
Subjt:  LYDHYFLETVKDDKETEFLHLTQGNLTVVQYERKFTELSRFAQDLVSTPERKIKRFVKGLRRKSEVPL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCGTGGTCGTGGTCGAGGTAGACCTAGGAACCCAGCTGTTAGGCAGGAGCACCAGGTAGAGGGTGATGATGTTCAGCAGGACCCTCCAGTCCCTCCCGTTCACCA
AGAGGTTCACCCTCCGGTTCCTCGCCGTCGTCGAGGAGTTGTACCTCCGGCACCTCCTCCGAGGTACCCCAATCGGGCAGCTGAAGACCCAATGGAGTCTCAATTCCTTC
GGGATTTTAAGCGTTATGGCCCTCCTACCTTCGATGGACGGTCAGAGAATCCTGTGGCGGCCGAGCGTTGGATCGCCGATTTGGAAGCCATGTACGATTACATGAATTGT
GATGACCGTCTGATGGTCAGAGGGGCGATATTTATGCTGAGGGATGATGCCCGCATGTGGTGGAAATCCATAGCTGAAGCGGAAGATCATGCTAACAATCCAGTCTCGTG
GGAGAGGTTCAAGGACCTTCTGTACGATCATTATTTTCTTGAGACTGTCAAGGATGATAAGGAGACAGAATTTTTGCACCTAACTCAAGGAAATTTGACAGTGGTGCAAT
ATGAAAGGAAATTCACTGAGCTCTCCCGCTTTGCTCAGGATCTGGTTAGCACGCCAGAACGAAAAATCAAGAGGTTCGTCAAAGGCCTCCGGAGGAAATCAGAGGTACCG
TTGCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCGTGGTCGTGGTCGAGGTAGACCTAGGAACCCAGCTGTTAGGCAGGAGCACCAGGTAGAGGGTGATGATGTTCAGCAGGACCCTCCAGTCCCTCCCGTTCACCA
AGAGGTTCACCCTCCGGTTCCTCGCCGTCGTCGAGGAGTTGTACCTCCGGCACCTCCTCCGAGGTACCCCAATCGGGCAGCTGAAGACCCAATGGAGTCTCAATTCCTTC
GGGATTTTAAGCGTTATGGCCCTCCTACCTTCGATGGACGGTCAGAGAATCCTGTGGCGGCCGAGCGTTGGATCGCCGATTTGGAAGCCATGTACGATTACATGAATTGT
GATGACCGTCTGATGGTCAGAGGGGCGATATTTATGCTGAGGGATGATGCCCGCATGTGGTGGAAATCCATAGCTGAAGCGGAAGATCATGCTAACAATCCAGTCTCGTG
GGAGAGGTTCAAGGACCTTCTGTACGATCATTATTTTCTTGAGACTGTCAAGGATGATAAGGAGACAGAATTTTTGCACCTAACTCAAGGAAATTTGACAGTGGTGCAAT
ATGAAAGGAAATTCACTGAGCTCTCCCGCTTTGCTCAGGATCTGGTTAGCACGCCAGAACGAAAAATCAAGAGGTTCGTCAAAGGCCTCCGGAGGAAATCAGAGGTACCG
TTGCCTTGA
Protein sequenceShow/hide protein sequence
MARGRGRGRPRNPAVRQEHQVEGDDVQQDPPVPPVHQEVHPPVPRRRRGVVPPAPPPRYPNRAAEDPMESQFLRDFKRYGPPTFDGRSENPVAAERWIADLEAMYDYMNC
DDRLMVRGAIFMLRDDARMWWKSIAEAEDHANNPVSWERFKDLLYDHYFLETVKDDKETEFLHLTQGNLTVVQYERKFTELSRFAQDLVSTPERKIKRFVKGLRRKSEVP
LP