; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035803 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035803
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr3:30607511..30608269
RNA-Seq ExpressionLag0035803
SyntenyLag0035803
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151688.1 uncharacterized protein LOC111019603 [Momordica charantia]7.1e-2935.65Show/hide
Query:  HLEANPPVPPPAPLVVLKAEALLALLSNAVQNNLQHA-SANQAPTRGKDVQFFRSFMKAKPLSFDGQLGSSHVVEEWTSELEALFQYLGVDAQQCVQGAT
        H  A+P        V L   AL AL+ N++      A    QA     + QF R F +  P +F+G+   + VVEEW  ELEAL+ YLG   Q  V+GA 
Subjt:  HLEANPPVPPPAPLVVLKAEALLALLSNAVQNNLQHA-SANQAPTRGKDVQFFRSFMKAKPLSFDGQLGSSHVVEEWTSELEALFQYLGVDAQQCVQGAT

Query:  SILRGHARNWWNVMGQFENRPDNSLSWLGFKGLVRDQFGRHFLGADVDLEVEFVSLVQGTMTVAQYIRRFEELSCRAPELVATEASRINHFFNGLCFEIR
         +LRG A NWW+V+   E+  +  ++W   K L+ D +    +    + E+EF+ L Q T+ VAQY ++F E S  A +L+ TEA +I  F  GL   I+
Subjt:  SILRGHARNWWNVMGQFENRPDNSLSWLGFKGLVRDQFGRHFLGADVDLEVEFVSLVQGTMTVAQYIRRFEELSCRAPELVATEASRINHFFNGLCFEIR

Query:  GLVKLGRPDTFTATLASVRMLDDDIPRMAQ
        G + L RP T+   +    ++D D+   AQ
Subjt:  GLVKLGRPDTFTATLASVRMLDDDIPRMAQ

XP_022155000.1 uncharacterized protein LOC111022144 [Momordica charantia]6.7e-2738.59Show/hide
Query:  DVQFFRSFMKAKPLSFDGQLGSSHVVEEWTSELEALFQYLGVDAQQCVQGATSILRGHARNWWNVMGQFENRPDNSLSWLGFKGLVRDQFGRHFLGADVD
        +  F + F +  P +FDG+   +   EEW  ELEA + YLG + Q  V+GA  +LRG A NWW+ +   E+  + ++ W  FK L+ D +   +L    D
Subjt:  DVQFFRSFMKAKPLSFDGQLGSSHVVEEWTSELEALFQYLGVDAQQCVQGATSILRGHARNWWNVMGQFENRPDNSLSWLGFKGLVRDQFGRHFLGADVD

Query:  L-EVEFVSLVQGTMTVAQYIRRFEELSCRAPELVATEASRINHFFNGLCFEIRGLVKLGRPDTFTATLASVRMLDDDIPRMAQS
        + E EF+ LVQGT++VAQY R+F ELS  A EL+   A +I  F  GL   IRG V L RP ++   +    ++D D+   A S
Subjt:  L-EVEFVSLVQGTMTVAQYIRRFEELSCRAPELVATEASRINHFFNGLCFEIRGLVKLGRPDTFTATLASVRMLDDDIPRMAQS

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]4.6e-2840.54Show/hide
Query:  TRGKDVQFFRSFMKAKPLSFDGQLGSSHVVEEWTSELEALFQYLGVDAQQCVQGATSILRGHARNWWNVMGQFENRPDNSLSWLGFKGLVRDQFGRHFLG
        T   + +F + F +  P +FDG+   +  VEEW  ELEAL+ YLG + Q  V+GA  +LRG A NWW+ +   E+  +  + W  FK L+ D +    + 
Subjt:  TRGKDVQFFRSFMKAKPLSFDGQLGSSHVVEEWTSELEALFQYLGVDAQQCVQGATSILRGHARNWWNVMGQFENRPDNSLSWLGFKGLVRDQFGRHFLG

Query:  ADVDLEVEFVSLVQGTMTVAQYIRRFEELSCRAPELVATEASRINHFFNGLCFEIRGLVKLGRPDTFTATLASVRMLDDDIPRMA
         D+  E EF+ LVQGT++VAQY R+F ELS  A EL+ TEA +I  F  GL   IRG V L RP T+   +    ++D D+   A
Subjt:  ADVDLEVEFVSLVQGTMTVAQYIRRFEELSCRAPELVATEASRINHFFNGLCFEIRGLVKLGRPDTFTATLASVRMLDDDIPRMA

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]1.5e-2637.33Show/hide
Query:  PPVPPPAPLVVLKAEALLALLSNAVQNNLQHASAN-----QAPTRGK----DVQFFRSFMKAKPLSFDGQLGSSHVVEEWTSELEALFQYLGVDAQQCVQ
        PPVP  AP  V +    +ALL+ A+Q  L +A+       Q P R +    +VQF R F    P  F+G        EEW  ELEAL+ YLG      V+
Subjt:  PPVPPPAPLVVLKAEALLALLSNAVQNNLQHASAN-----QAPTRGK----DVQFFRSFMKAKPLSFDGQLGSSHVVEEWTSELEALFQYLGVDAQQCVQ

Query:  GATSILRGHARNWWNVMGQFENRPDNSLSWLGFKGLVRDQFGRHFLGADVDLEVEFVSLVQGTMTVAQYIRRFEELSCRAPELVATEASRINHFFNGLCF
        GA  +LRG A NWW  +   E+  +  ++W  FK L+ + +    + A  +  VEF+ L QG++TVAQY R+F ELS    + V TE  +I+ F +GL  
Subjt:  GATSILRGHARNWWNVMGQFENRPDNSLSWLGFKGLVRDQFGRHFLGADVDLEVEFVSLVQGTMTVAQYIRRFEELSCRAPELVATEASRINHFFNGLCF

Query:  EIRGLVKLGRPDTFTATLASVRMLD
        EI+GL+ L  P T+ A +    ++D
Subjt:  EIRGLVKLGRPDTFTATLASVRMLD

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]1.9e-2937.5Show/hide
Query:  PPVPPPAPLVVLKAEALLALLSNAVQNNLQHASANQAPTRGKDVQFFRSFMKAKPLSFDGQLGSSHVVEEWTSELEALFQYLGVDAQQCVQGATSILRGH
        PP PP A      A+ L  + + A     Q        T   + QF + F +  P +F G    + + EEW  ELEAL+ YLG + Q  V+GA  +LR  
Subjt:  PPVPPPAPLVVLKAEALLALLSNAVQNNLQHASANQAPTRGKDVQFFRSFMKAKPLSFDGQLGSSHVVEEWTSELEALFQYLGVDAQQCVQGATSILRGH

Query:  ARNWWNVMGQFENRPDNSLSWLGFKGLVRDQFGRHFLGADVDLEVEFVSLVQGTMTVAQYIRRFEELSCRAPELVATEASRINHFFNGLCFEIRGLVKLG
        A NWW+ +   E+  +  + W  FK L+ D + R  +  +   EVEF+ LVQGT+TVAQY R+F ELS  A EL+ TEA +I  F  GL   IRG V L 
Subjt:  ARNWWNVMGQFENRPDNSLSWLGFKGLVRDQFGRHFLGADVDLEVEFVSLVQGTMTVAQYIRRFEELSCRAPELVATEASRINHFFNGLCFEIRGLVKLG

Query:  RPDTFTATLASVRMLDDDIPRMAQ
        RP T+   +    ++D D+    Q
Subjt:  RPDTFTATLASVRMLDDDIPRMAQ

TrEMBL top hitse value%identityAlignment
A0A6J1DCW8 uncharacterized protein LOC1110196033.5e-2935.65Show/hide
Query:  HLEANPPVPPPAPLVVLKAEALLALLSNAVQNNLQHA-SANQAPTRGKDVQFFRSFMKAKPLSFDGQLGSSHVVEEWTSELEALFQYLGVDAQQCVQGAT
        H  A+P        V L   AL AL+ N++      A    QA     + QF R F +  P +F+G+   + VVEEW  ELEAL+ YLG   Q  V+GA 
Subjt:  HLEANPPVPPPAPLVVLKAEALLALLSNAVQNNLQHA-SANQAPTRGKDVQFFRSFMKAKPLSFDGQLGSSHVVEEWTSELEALFQYLGVDAQQCVQGAT

Query:  SILRGHARNWWNVMGQFENRPDNSLSWLGFKGLVRDQFGRHFLGADVDLEVEFVSLVQGTMTVAQYIRRFEELSCRAPELVATEASRINHFFNGLCFEIR
         +LRG A NWW+V+   E+  +  ++W   K L+ D +    +    + E+EF+ L Q T+ VAQY ++F E S  A +L+ TEA +I  F  GL   I+
Subjt:  SILRGHARNWWNVMGQFENRPDNSLSWLGFKGLVRDQFGRHFLGADVDLEVEFVSLVQGTMTVAQYIRRFEELSCRAPELVATEASRINHFFNGLCFEIR

Query:  GLVKLGRPDTFTATLASVRMLDDDIPRMAQ
        G + L RP T+   +    ++D D+   AQ
Subjt:  GLVKLGRPDTFTATLASVRMLDDDIPRMAQ

A0A6J1DL73 uncharacterized protein LOC1110221443.2e-2738.59Show/hide
Query:  DVQFFRSFMKAKPLSFDGQLGSSHVVEEWTSELEALFQYLGVDAQQCVQGATSILRGHARNWWNVMGQFENRPDNSLSWLGFKGLVRDQFGRHFLGADVD
        +  F + F +  P +FDG+   +   EEW  ELEA + YLG + Q  V+GA  +LRG A NWW+ +   E+  + ++ W  FK L+ D +   +L    D
Subjt:  DVQFFRSFMKAKPLSFDGQLGSSHVVEEWTSELEALFQYLGVDAQQCVQGATSILRGHARNWWNVMGQFENRPDNSLSWLGFKGLVRDQFGRHFLGADVD

Query:  L-EVEFVSLVQGTMTVAQYIRRFEELSCRAPELVATEASRINHFFNGLCFEIRGLVKLGRPDTFTATLASVRMLDDDIPRMAQS
        + E EF+ LVQGT++VAQY R+F ELS  A EL+   A +I  F  GL   IRG V L RP ++   +    ++D D+   A S
Subjt:  L-EVEFVSLVQGTMTVAQYIRRFEELSCRAPELVATEASRINHFFNGLCFEIRGLVKLGRPDTFTATLASVRMLDDDIPRMAQS

A0A6J1DQB9 Reverse transcriptase7.2e-2737.33Show/hide
Query:  PPVPPPAPLVVLKAEALLALLSNAVQNNLQHASAN-----QAPTRGK----DVQFFRSFMKAKPLSFDGQLGSSHVVEEWTSELEALFQYLGVDAQQCVQ
        PPVP  AP  V +    +ALL+ A+Q  L +A+       Q P R +    +VQF R F    P  F+G        EEW  ELEAL+ YLG      V+
Subjt:  PPVPPPAPLVVLKAEALLALLSNAVQNNLQHASAN-----QAPTRGK----DVQFFRSFMKAKPLSFDGQLGSSHVVEEWTSELEALFQYLGVDAQQCVQ

Query:  GATSILRGHARNWWNVMGQFENRPDNSLSWLGFKGLVRDQFGRHFLGADVDLEVEFVSLVQGTMTVAQYIRRFEELSCRAPELVATEASRINHFFNGLCF
        GA  +LRG A NWW  +   E+  +  ++W  FK L+ + +    + A  +  VEF+ L QG++TVAQY R+F ELS    + V TE  +I+ F +GL  
Subjt:  GATSILRGHARNWWNVMGQFENRPDNSLSWLGFKGLVRDQFGRHFLGADVDLEVEFVSLVQGTMTVAQYIRRFEELSCRAPELVATEASRINHFFNGLCF

Query:  EIRGLVKLGRPDTFTATLASVRMLD
        EI+GL+ L  P T+ A +    ++D
Subjt:  EIRGLVKLGRPDTFTATLASVRMLD

A0A6J1DUM2 uncharacterized protein LOC1110232472.2e-2840.54Show/hide
Query:  TRGKDVQFFRSFMKAKPLSFDGQLGSSHVVEEWTSELEALFQYLGVDAQQCVQGATSILRGHARNWWNVMGQFENRPDNSLSWLGFKGLVRDQFGRHFLG
        T   + +F + F +  P +FDG+   +  VEEW  ELEAL+ YLG + Q  V+GA  +LRG A NWW+ +   E+  +  + W  FK L+ D +    + 
Subjt:  TRGKDVQFFRSFMKAKPLSFDGQLGSSHVVEEWTSELEALFQYLGVDAQQCVQGATSILRGHARNWWNVMGQFENRPDNSLSWLGFKGLVRDQFGRHFLG

Query:  ADVDLEVEFVSLVQGTMTVAQYIRRFEELSCRAPELVATEASRINHFFNGLCFEIRGLVKLGRPDTFTATLASVRMLDDDIPRMA
         D+  E EF+ LVQGT++VAQY R+F ELS  A EL+ TEA +I  F  GL   IRG V L RP T+   +    ++D D+   A
Subjt:  ADVDLEVEFVSLVQGTMTVAQYIRRFEELSCRAPELVATEASRINHFFNGLCFEIRGLVKLGRPDTFTATLASVRMLDDDIPRMA

A0A6J1DVA0 uncharacterized protein LOC1110234249.1e-3037.5Show/hide
Query:  PPVPPPAPLVVLKAEALLALLSNAVQNNLQHASANQAPTRGKDVQFFRSFMKAKPLSFDGQLGSSHVVEEWTSELEALFQYLGVDAQQCVQGATSILRGH
        PP PP A      A+ L  + + A     Q        T   + QF + F +  P +F G    + + EEW  ELEAL+ YLG + Q  V+GA  +LR  
Subjt:  PPVPPPAPLVVLKAEALLALLSNAVQNNLQHASANQAPTRGKDVQFFRSFMKAKPLSFDGQLGSSHVVEEWTSELEALFQYLGVDAQQCVQGATSILRGH

Query:  ARNWWNVMGQFENRPDNSLSWLGFKGLVRDQFGRHFLGADVDLEVEFVSLVQGTMTVAQYIRRFEELSCRAPELVATEASRINHFFNGLCFEIRGLVKLG
        A NWW+ +   E+  +  + W  FK L+ D + R  +  +   EVEF+ LVQGT+TVAQY R+F ELS  A EL+ TEA +I  F  GL   IRG V L 
Subjt:  ARNWWNVMGQFENRPDNSLSWLGFKGLVRDQFGRHFLGADVDLEVEFVSLVQGTMTVAQYIRRFEELSCRAPELVATEASRINHFFNGLCFEIRGLVKLG

Query:  RPDTFTATLASVRMLDDDIPRMAQ
        RP T+   +    ++D D+    Q
Subjt:  RPDTFTATLASVRMLDDDIPRMAQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTATAGCGAAGGACAAGGAAGTGGACGTTCTGGCCCTCTAGACGTTCTTGAAATCAATTCGCATCTCGAGGCGAATCCCCCTGTTCCTCCCCCAGCGCCTCTTGT
GGTGCTGAAAGCGGAGGCATTGCTAGCTTTGCTTAGTAATGCAGTCCAAAATAATTTGCAGCACGCCAGTGCGAATCAAGCCCCTACTCGTGGCAAGGATGTGCAGTTTT
TTCGGAGCTTCATGAAGGCAAAGCCTCTTTCGTTTGATGGTCAGCTTGGAAGTTCACACGTTGTAGAAGAGTGGACTTCGGAGTTGGAAGCCCTATTTCAATATCTTGGA
GTTGACGCCCAGCAGTGTGTCCAAGGAGCTACCTCTATACTTAGAGGCCATGCACGCAATTGGTGGAATGTAATGGGTCAGTTTGAGAATCGCCCGGACAATTCTCTGTC
GTGGTTAGGGTTTAAGGGTCTTGTGCGAGACCAATTTGGCCGACATTTCCTCGGCGCTGATGTTGATCTTGAAGTAGAGTTCGTCTCGCTCGTGCAAGGGACCATGACCG
TGGCGCAGTATATTAGAAGGTTCGAGGAGTTGTCTTGTCGTGCTCCTGAGTTGGTCGCCACCGAGGCAAGCAGGATCAACCATTTCTTCAATGGTTTGTGTTTTGAAATT
AGAGGGTTGGTCAAGCTAGGACGACCAGACACTTTCACAGCAACTCTTGCGAGCGTTCGGATGTTGGATGATGACATCCCTAGGATGGCGCAGTCATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTTATAGCGAAGGACAAGGAAGTGGACGTTCTGGCCCTCTAGACGTTCTTGAAATCAATTCGCATCTCGAGGCGAATCCCCCTGTTCCTCCCCCAGCGCCTCTTGT
GGTGCTGAAAGCGGAGGCATTGCTAGCTTTGCTTAGTAATGCAGTCCAAAATAATTTGCAGCACGCCAGTGCGAATCAAGCCCCTACTCGTGGCAAGGATGTGCAGTTTT
TTCGGAGCTTCATGAAGGCAAAGCCTCTTTCGTTTGATGGTCAGCTTGGAAGTTCACACGTTGTAGAAGAGTGGACTTCGGAGTTGGAAGCCCTATTTCAATATCTTGGA
GTTGACGCCCAGCAGTGTGTCCAAGGAGCTACCTCTATACTTAGAGGCCATGCACGCAATTGGTGGAATGTAATGGGTCAGTTTGAGAATCGCCCGGACAATTCTCTGTC
GTGGTTAGGGTTTAAGGGTCTTGTGCGAGACCAATTTGGCCGACATTTCCTCGGCGCTGATGTTGATCTTGAAGTAGAGTTCGTCTCGCTCGTGCAAGGGACCATGACCG
TGGCGCAGTATATTAGAAGGTTCGAGGAGTTGTCTTGTCGTGCTCCTGAGTTGGTCGCCACCGAGGCAAGCAGGATCAACCATTTCTTCAATGGTTTGTGTTTTGAAATT
AGAGGGTTGGTCAAGCTAGGACGACCAGACACTTTCACAGCAACTCTTGCGAGCGTTCGGATGTTGGATGATGACATCCCTAGGATGGCGCAGTCATAG
Protein sequenceShow/hide protein sequence
MSYSEGQGSGRSGPLDVLEINSHLEANPPVPPPAPLVVLKAEALLALLSNAVQNNLQHASANQAPTRGKDVQFFRSFMKAKPLSFDGQLGSSHVVEEWTSELEALFQYLG
VDAQQCVQGATSILRGHARNWWNVMGQFENRPDNSLSWLGFKGLVRDQFGRHFLGADVDLEVEFVSLVQGTMTVAQYIRRFEELSCRAPELVATEASRINHFFNGLCFEI
RGLVKLGRPDTFTATLASVRMLDDDIPRMAQS