; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018264 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018264
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr5:20798283..20799503
RNA-Seq ExpressionLag0018264
SyntenyLag0018264
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151688.1 uncharacterized protein LOC111019603 [Momordica charantia]2.3e-4541.31Show/hide
Query:  IPPDQRRVDPPPAPPTAPMLITLETFQTMFDNMAQRNGLQALR---------------------APSFDGQSKNPLAAERWIVDLEALFELMNCNDPLKI
        I  +    DP     TA + +     Q + DN       QAL+                      P+F+G+S+     E WI +LEAL+  + C+D LK+
Subjt:  IPPDQRRVDPPPAPPTAPMLITLETFQTMFDNMAQRNGLQALR---------------------APSFDGQSKNPLAAERWIVDLEALFELMNCNDPLKI

Query:  RGAVFMLKDDARMWWKPMAAVEDHANQPISWKRFKDLLYDYYFPETVKDEKEVEFLHLAQ-----------------------GTPERKIKRFIKGLHEE
        +GAVFML+ +A  WW  +A VEDH N+PI+W   KDLLYDYYFP+T+KDEKE+EFLHL Q                        T  RKIKRF++GL + 
Subjt:  RGAVFMLKDDARMWWKPMAAVEDHANQPISWKRFKDLLYDYYFPETVKDEKEVEFLHLAQ-----------------------GTPERKIKRFIKGLHEE

Query:  IRGSIALSMPTTFAEALTGALIMDKNVPKKAQPHLEKGSTSGIKRKLSPLRNPPIESTQ
        I+G I L  PTT+AEA+ GAL+MDK+V +KAQP  + G +SG+KRK+     PPI S+Q
Subjt:  IRGSIALSMPTTFAEALTGALIMDKNVPKKAQPHLEKGSTSGIKRKLSPLRNPPIESTQ

XP_022155000.1 uncharacterized protein LOC111022144 [Momordica charantia]9.0e-5048.4Show/hide
Query:  PSFDGQSKNPLAAERWIVDLEALFELMNCNDPLKIRGAVFMLKDDARMWWKPMAAVEDHANQPISWKRFKDLLYDYYFPETVKDEKEVEFLHLAQGT---
        P+FDG+S+   AAE WI +LEA +  + C D  K++GAVFML+ +A  WW  +AA EDHAN  I W RFKDLLYDYY+ ETVKD KE EFLHL QGT   
Subjt:  PSFDGQSKNPLAAERWIVDLEALFELMNCNDPLKIRGAVFMLKDDARMWWKPMAAVEDHANQPISWKRFKDLLYDYYFPETVKDEKEVEFLHLAQGT---

Query:  --------------------PERKIKRFIKGLHEEIRGSIALSMPTTFAEALTGALIMDKNVPKKAQPHLEKGSTSGIKRKLSP-LRNPPIESTQQQVKE
                               KIKRF+KGL + IRG + L  P ++AEA+ GALIMDK+V  KA    E GS+SG+KRK  P   +P + + Q Q + 
Subjt:  --------------------PERKIKRFIKGLHEEIRGSIALSMPTTFAEALTGALIMDKNVPKKAQPHLEKGSTSGIKRKLSP-LRNPPIESTQQQVKE

Query:  YVPYPPCPFCHKLHKGECW
            P CP C K H G+CW
Subjt:  YVPYPPCPFCHKLHKGECW

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]3.7e-5148.86Show/hide
Query:  PSFDGQSKNPLAAERWIVDLEALFELMNCNDPLKIRGAVFMLKDDARMWWKPMAAVEDHANQPISWKRFKDLLYDYYFPETVKDEKEVEFLHLAQG----
        P+FDG+S+   A E WI +LEAL+  + C D  K++GAVFML+ +A  WW  +AA ED+AN PI W RFK+LLYDYY+PETVKD KE EFLHL QG    
Subjt:  PSFDGQSKNPLAAERWIVDLEALFELMNCNDPLKIRGAVFMLKDDARMWWKPMAAVEDHANQPISWKRFKDLLYDYYFPETVKDEKEVEFLHLAQG----

Query:  -------------------TPERKIKRFIKGLHEEIRGSIALSMPTTFAEALTGALIMDKNVPKKAQPHLEKGSTSGIKRKL-SPLRNPPIESTQQQVKE
                           T   KIKRF+KGL + IRG + L  PTT+AEA+ GAL+MDK+V  KA P  E GS+SG+KRK  S   +  + + Q+Q + 
Subjt:  -------------------TPERKIKRFIKGLHEEIRGSIALSMPTTFAEALTGALIMDKNVPKKAQPHLEKGSTSGIKRKL-SPLRNPPIESTQQQVKE

Query:  YVPYPPCPFCHKLHKGECW
            P CP C K H G+CW
Subjt:  YVPYPPCPFCHKLHKGECW

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]4.8e-4340.73Show/hide
Query:  VDPLAPLFQEV-NPLIPP--DQRRVDPPPAPPTAPMLITLETFQTMFDNMAQ--------RNGLQALR------APSFDGQSKNPLAAERWIVDLEALFE
        VDP AP+ + V +P  PP  DQ  V PP  P  A  L  +     +     Q        ++  Q ++       P+F G S+    AE W+ +LEAL+ 
Subjt:  VDPLAPLFQEV-NPLIPP--DQRRVDPPPAPPTAPMLITLETFQTMFDNMAQ--------RNGLQALR------APSFDGQSKNPLAAERWIVDLEALFE

Query:  LMNCNDPLKIRGAVFMLKDDARMWWKPMAAVEDHANQPISWKRFKDLLYDYYFPETVKDEKEVEFLHLAQG-----------------------TPERKI
         + C D  K++GAVFML+ +A  WW  +AA EDHAN P+ W RFK+LLYD+Y+ ETV+D KEVEFLHL QG                       T   KI
Subjt:  LMNCNDPLKIRGAVFMLKDDARMWWKPMAAVEDHANQPISWKRFKDLLYDYYFPETVKDEKEVEFLHLAQG-----------------------TPERKI

Query:  KRFIKGLHEEIRGSIALSMPTTFAEALTGALIMDKNVPKKAQPHLEKGSTSGIKRKLSP-LRNPPIESTQQQVKE
        KRF+KGLH+ IRGS+ L  P T+AEA+ G LIMDK+V  + QP +E GS+ G+KRK+ P   + P  + Q+  ++
Subjt:  KRFIKGLHEEIRGSIALSMPTTFAEALTGALIMDKNVPKKAQPHLEKGSTSGIKRKLSP-LRNPPIESTQQQVKE

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]2.1e-4338.79Show/hide
Query:  NPLIPPDQRRVDPPPAPPTA---------PMLITLETFQTMFDNMAQRNGLQALRAPSF-----DGQSKNPLAAERWIVDLEALFELMNCNDPLKIRGAV
        +P +PP       PP P  A          + +  E  Q + DN     G Q ++ P +     +  S+ P AAE W+ +LEAL+  + C+D  K+RGAV
Subjt:  NPLIPPDQRRVDPPPAPPTA---------PMLITLETFQTMFDNMAQRNGLQALRAPSF-----DGQSKNPLAAERWIVDLEALFELMNCNDPLKIRGAV

Query:  FMLKDDARMWWKPMAAVEDHANQPISWKRFKDLLYDYYFPETVKDEKEVEFLHLAQG-----------------------TPERKIKRFIKGLHEEIRGS
        FML+ +A  WW+ +AA EDHAN P++W RFKDLLY+YYFP TV++EK VEFL L QG                       T + KI +FI GL  EI+G 
Subjt:  FMLKDDARMWWKPMAAVEDHANQPISWKRFKDLLYDYYFPETVKDEKEVEFLHLAQG-----------------------TPERKIKRFIKGLHEEIRGS

Query:  IALSMPTTFAEALTGALIMDKNVPKKAQPHLEKGSTSGIKRKLSPL-RNPPIESTQQQVKEYVPYPPCPFCHKLHKGECWL
        + L  PTT+A A+  AL+MDK + ++ Q     GS+SG+KRK +    + P    Q  V+     P CP C K H G CW+
Subjt:  IALSMPTTFAEALTGALIMDKNVPKKAQPHLEKGSTSGIKRKLSPL-RNPPIESTQQQVKEYVPYPPCPFCHKLHKGECWL

TrEMBL top hitse value%identityAlignment
A0A6J1DCW8 uncharacterized protein LOC1110196031.1e-4541.31Show/hide
Query:  IPPDQRRVDPPPAPPTAPMLITLETFQTMFDNMAQRNGLQALR---------------------APSFDGQSKNPLAAERWIVDLEALFELMNCNDPLKI
        I  +    DP     TA + +     Q + DN       QAL+                      P+F+G+S+     E WI +LEAL+  + C+D LK+
Subjt:  IPPDQRRVDPPPAPPTAPMLITLETFQTMFDNMAQRNGLQALR---------------------APSFDGQSKNPLAAERWIVDLEALFELMNCNDPLKI

Query:  RGAVFMLKDDARMWWKPMAAVEDHANQPISWKRFKDLLYDYYFPETVKDEKEVEFLHLAQ-----------------------GTPERKIKRFIKGLHEE
        +GAVFML+ +A  WW  +A VEDH N+PI+W   KDLLYDYYFP+T+KDEKE+EFLHL Q                        T  RKIKRF++GL + 
Subjt:  RGAVFMLKDDARMWWKPMAAVEDHANQPISWKRFKDLLYDYYFPETVKDEKEVEFLHLAQ-----------------------GTPERKIKRFIKGLHEE

Query:  IRGSIALSMPTTFAEALTGALIMDKNVPKKAQPHLEKGSTSGIKRKLSPLRNPPIESTQ
        I+G I L  PTT+AEA+ GAL+MDK+V +KAQP  + G +SG+KRK+     PPI S+Q
Subjt:  IRGSIALSMPTTFAEALTGALIMDKNVPKKAQPHLEKGSTSGIKRKLSPLRNPPIESTQ

A0A6J1DL73 uncharacterized protein LOC1110221444.4e-5048.4Show/hide
Query:  PSFDGQSKNPLAAERWIVDLEALFELMNCNDPLKIRGAVFMLKDDARMWWKPMAAVEDHANQPISWKRFKDLLYDYYFPETVKDEKEVEFLHLAQGT---
        P+FDG+S+   AAE WI +LEA +  + C D  K++GAVFML+ +A  WW  +AA EDHAN  I W RFKDLLYDYY+ ETVKD KE EFLHL QGT   
Subjt:  PSFDGQSKNPLAAERWIVDLEALFELMNCNDPLKIRGAVFMLKDDARMWWKPMAAVEDHANQPISWKRFKDLLYDYYFPETVKDEKEVEFLHLAQGT---

Query:  --------------------PERKIKRFIKGLHEEIRGSIALSMPTTFAEALTGALIMDKNVPKKAQPHLEKGSTSGIKRKLSP-LRNPPIESTQQQVKE
                               KIKRF+KGL + IRG + L  P ++AEA+ GALIMDK+V  KA    E GS+SG+KRK  P   +P + + Q Q + 
Subjt:  --------------------PERKIKRFIKGLHEEIRGSIALSMPTTFAEALTGALIMDKNVPKKAQPHLEKGSTSGIKRKLSP-LRNPPIESTQQQVKE

Query:  YVPYPPCPFCHKLHKGECW
            P CP C K H G+CW
Subjt:  YVPYPPCPFCHKLHKGECW

A0A6J1DUM2 uncharacterized protein LOC1110232471.8e-5148.86Show/hide
Query:  PSFDGQSKNPLAAERWIVDLEALFELMNCNDPLKIRGAVFMLKDDARMWWKPMAAVEDHANQPISWKRFKDLLYDYYFPETVKDEKEVEFLHLAQG----
        P+FDG+S+   A E WI +LEAL+  + C D  K++GAVFML+ +A  WW  +AA ED+AN PI W RFK+LLYDYY+PETVKD KE EFLHL QG    
Subjt:  PSFDGQSKNPLAAERWIVDLEALFELMNCNDPLKIRGAVFMLKDDARMWWKPMAAVEDHANQPISWKRFKDLLYDYYFPETVKDEKEVEFLHLAQG----

Query:  -------------------TPERKIKRFIKGLHEEIRGSIALSMPTTFAEALTGALIMDKNVPKKAQPHLEKGSTSGIKRKL-SPLRNPPIESTQQQVKE
                           T   KIKRF+KGL + IRG + L  PTT+AEA+ GAL+MDK+V  KA P  E GS+SG+KRK  S   +  + + Q+Q + 
Subjt:  -------------------TPERKIKRFIKGLHEEIRGSIALSMPTTFAEALTGALIMDKNVPKKAQPHLEKGSTSGIKRKL-SPLRNPPIESTQQQVKE

Query:  YVPYPPCPFCHKLHKGECW
            P CP C K H G+CW
Subjt:  YVPYPPCPFCHKLHKGECW

A0A6J1DVA0 uncharacterized protein LOC1110234242.3e-4340.73Show/hide
Query:  VDPLAPLFQEV-NPLIPP--DQRRVDPPPAPPTAPMLITLETFQTMFDNMAQ--------RNGLQALR------APSFDGQSKNPLAAERWIVDLEALFE
        VDP AP+ + V +P  PP  DQ  V PP  P  A  L  +     +     Q        ++  Q ++       P+F G S+    AE W+ +LEAL+ 
Subjt:  VDPLAPLFQEV-NPLIPP--DQRRVDPPPAPPTAPMLITLETFQTMFDNMAQ--------RNGLQALR------APSFDGQSKNPLAAERWIVDLEALFE

Query:  LMNCNDPLKIRGAVFMLKDDARMWWKPMAAVEDHANQPISWKRFKDLLYDYYFPETVKDEKEVEFLHLAQG-----------------------TPERKI
         + C D  K++GAVFML+ +A  WW  +AA EDHAN P+ W RFK+LLYD+Y+ ETV+D KEVEFLHL QG                       T   KI
Subjt:  LMNCNDPLKIRGAVFMLKDDARMWWKPMAAVEDHANQPISWKRFKDLLYDYYFPETVKDEKEVEFLHLAQG-----------------------TPERKI

Query:  KRFIKGLHEEIRGSIALSMPTTFAEALTGALIMDKNVPKKAQPHLEKGSTSGIKRKLSP-LRNPPIESTQQQVKE
        KRF+KGLH+ IRGS+ L  P T+AEA+ G LIMDK+V  + QP +E GS+ G+KRK+ P   + P  + Q+  ++
Subjt:  KRFIKGLHEEIRGSIALSMPTTFAEALTGALIMDKNVPKKAQPHLEKGSTSGIKRKLSP-LRNPPIESTQQQVKE

A0A6J1DWP4 uncharacterized protein LOC1110252151.0e-4338.79Show/hide
Query:  NPLIPPDQRRVDPPPAPPTA---------PMLITLETFQTMFDNMAQRNGLQALRAPSF-----DGQSKNPLAAERWIVDLEALFELMNCNDPLKIRGAV
        +P +PP       PP P  A          + +  E  Q + DN     G Q ++ P +     +  S+ P AAE W+ +LEAL+  + C+D  K+RGAV
Subjt:  NPLIPPDQRRVDPPPAPPTA---------PMLITLETFQTMFDNMAQRNGLQALRAPSF-----DGQSKNPLAAERWIVDLEALFELMNCNDPLKIRGAV

Query:  FMLKDDARMWWKPMAAVEDHANQPISWKRFKDLLYDYYFPETVKDEKEVEFLHLAQG-----------------------TPERKIKRFIKGLHEEIRGS
        FML+ +A  WW+ +AA EDHAN P++W RFKDLLY+YYFP TV++EK VEFL L QG                       T + KI +FI GL  EI+G 
Subjt:  FMLKDDARMWWKPMAAVEDHANQPISWKRFKDLLYDYYFPETVKDEKEVEFLHLAQG-----------------------TPERKIKRFIKGLHEEIRGS

Query:  IALSMPTTFAEALTGALIMDKNVPKKAQPHLEKGSTSGIKRKLSPL-RNPPIESTQQQVKEYVPYPPCPFCHKLHKGECWL
        + L  PTT+A A+  AL+MDK + ++ Q     GS+SG+KRK +    + P    Q  V+     P CP C K H G CW+
Subjt:  IALSMPTTFAEALTGALIMDKNVPKKAQPHLEKGSTSGIKRKLSPL-RNPPIESTQQQVKEYVPYPPCPFCHKLHKGECWL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTGTGGTCATGATCCTGAAGTTCCAATTGTCAGGCAGGATGACCAAGCAAAGGAAGTTACTACTCAGCAAGGGGTCGATCCTCTGGCTCCCCTTTTTCAGGAGGT
TAATCCCCTGATTCCTCCCGATCAGCGTAGAGTTGATCCTCCTCCAGCCCCTCCTACAGCTCCTATGCTGATCACTCTGGAAACCTTTCAGACCATGTTCGATAACATGG
CCCAGAGAAATGGACTTCAAGCGCTACGGGCTCCCTCTTTTGATGGGCAATCCAAAAATCCGTTGGCAGCAGAACGATGGATTGTTGATTTAGAGGCACTGTTTGAGCTC
ATGAACTGTAATGATCCCCTGAAGATCAGAGGAGCGGTCTTCATGCTCAAGGACGACGCTCGCATGTGGTGGAAGCCTATGGCAGCTGTCGAAGATCATGCCAATCAACC
GATTTCTTGGAAAAGGTTTAAAGACCTGTTGTACGATTATTACTTCCCGGAGACAGTCAAGGATGAAAAAGAAGTGGAATTCCTTCATTTGGCCCAGGGAACACCAGAGC
GGAAGATCAAGAGGTTCATTAAAGGTCTCCATGAGGAAATTCGTGGCTCTATAGCCCTGAGCATGCCCACGACCTTCGCTGAAGCACTCACGGGTGCATTGATCATGGAT
AAGAATGTTCCCAAGAAGGCACAACCTCATCTTGAAAAGGGATCAACTTCTGGAATTAAAAGAAAGTTGTCTCCCTTGAGGAACCCACCTATTGAGTCTACTCAACAGCA
GGTGAAAGAGTACGTTCCATATCCTCCTTGCCCTTTTTGTCACAAGCTTCACAAAGGAGAGTGTTGGTTAGTGCACCTTCCTCGAGCCTATCTATCGCTCTTGAATTTCC
TTTCTTCTTGCAATGCCTTGTCATCTGAATCTATTTGTGTTGCGAGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTTGTGGTCATGATCCTGAAGTTCCAATTGTCAGGCAGGATGACCAAGCAAAGGAAGTTACTACTCAGCAAGGGGTCGATCCTCTGGCTCCCCTTTTTCAGGAGGT
TAATCCCCTGATTCCTCCCGATCAGCGTAGAGTTGATCCTCCTCCAGCCCCTCCTACAGCTCCTATGCTGATCACTCTGGAAACCTTTCAGACCATGTTCGATAACATGG
CCCAGAGAAATGGACTTCAAGCGCTACGGGCTCCCTCTTTTGATGGGCAATCCAAAAATCCGTTGGCAGCAGAACGATGGATTGTTGATTTAGAGGCACTGTTTGAGCTC
ATGAACTGTAATGATCCCCTGAAGATCAGAGGAGCGGTCTTCATGCTCAAGGACGACGCTCGCATGTGGTGGAAGCCTATGGCAGCTGTCGAAGATCATGCCAATCAACC
GATTTCTTGGAAAAGGTTTAAAGACCTGTTGTACGATTATTACTTCCCGGAGACAGTCAAGGATGAAAAAGAAGTGGAATTCCTTCATTTGGCCCAGGGAACACCAGAGC
GGAAGATCAAGAGGTTCATTAAAGGTCTCCATGAGGAAATTCGTGGCTCTATAGCCCTGAGCATGCCCACGACCTTCGCTGAAGCACTCACGGGTGCATTGATCATGGAT
AAGAATGTTCCCAAGAAGGCACAACCTCATCTTGAAAAGGGATCAACTTCTGGAATTAAAAGAAAGTTGTCTCCCTTGAGGAACCCACCTATTGAGTCTACTCAACAGCA
GGTGAAAGAGTACGTTCCATATCCTCCTTGCCCTTTTTGTCACAAGCTTCACAAAGGAGAGTGTTGGTTAGTGCACCTTCCTCGAGCCTATCTATCGCTCTTGAATTTCC
TTTCTTCTTGCAATGCCTTGTCATCTGAATCTATTTGTGTTGCGAGCTCTTAG
Protein sequenceShow/hide protein sequence
MSCGHDPEVPIVRQDDQAKEVTTQQGVDPLAPLFQEVNPLIPPDQRRVDPPPAPPTAPMLITLETFQTMFDNMAQRNGLQALRAPSFDGQSKNPLAAERWIVDLEALFEL
MNCNDPLKIRGAVFMLKDDARMWWKPMAAVEDHANQPISWKRFKDLLYDYYFPETVKDEKEVEFLHLAQGTPERKIKRFIKGLHEEIRGSIALSMPTTFAEALTGALIMD
KNVPKKAQPHLEKGSTSGIKRKLSPLRNPPIESTQQQVKEYVPYPPCPFCHKLHKGECWLVHLPRAYLSLLNFLSSCNALSSESICVASS