; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026607 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026607
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr10:39625118..39625909
RNA-Seq ExpressionLag0026607
SyntenyLag0026607
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0016779 - nucleotidyltransferase activity (molecular function)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]2.9e-5056.79Show/hide
Query:  GKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWGV---RRKVHWASWKRMCVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKG
        GKEILIKAVLQ IPTYSMSCF++PK L +E N IMARFWW     +R +HW  W+ +C  KF+GGLGFRDLE FN+ALL KQ WR++  P SL+ R+ + 
Subjt:  GKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWGV---RRKVHWASWKRMCVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKG

Query:  KYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWVPRSSTLKVI
        +Y     FLEA+   N SF+WRSL WG+ LL +G+RWRVG G SI +  DKW+P  S  K++
Subjt:  KYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWVPRSSTLKVI

ONI09819.1 hypothetical protein PRUPE_4G011200 [Prunus persica]4.9e-5056.79Show/hide
Query:  GKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWGV---RRKVHWASWKRMCVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKG
        GKEILIKAVLQ IPTYSMSCFQ+PK L +E N IMARFWW     +R +HW  W+ +C  KF+GGLGFRDLE FN+ALL KQ WR++  P SL+ R+ + 
Subjt:  GKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWGV---RRKVHWASWKRMCVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKG

Query:  KYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWVPRSSTLKVI
        +Y     FLEA+   N SF+W SL WG+ LL +G+RWRVG G SI +  DKW+P  S  K++
Subjt:  KYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWVPRSSTLKVI

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]6.4e-5056.17Show/hide
Query:  GKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWGV---RRKVHWASWKRMCVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKG
        GKEIL+KAVLQ IPTYSMSCF++PK L +E N IMARFWW     +R +HW  W+ +C  KF+GGLGFRDLE FN+ALL KQ WR++  P SL+ R+ + 
Subjt:  GKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWGV---RRKVHWASWKRMCVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKG

Query:  KYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWVPRSSTLKVI
        +Y     FLEA+   N SF+WRSL WG+ LL +G+RWRVG+G SI +  DKW+P  S  K++
Subjt:  KYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWVPRSSTLKVI

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]2.4e-5759.76Show/hide
Query:  VGGKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWGVR---RKVHWASWKRMCVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVL
        +GGKE+LIKAV Q IP Y+MSCF+LPK L+RE + I ARFWWG     +K+HW +W  + +PK  GG+GFRDLELFNKALL KQ WR++ +PNS+L RVL
Subjt:  VGGKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWGVR---RKVHWASWKRMCVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVL

Query:  KGKYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWVPRSSTLKVI
        KG+YF++CSF+EAK  GN S++WRS++WGR LL++G+RWR+G+G S+ I  D WVP   TLK++
Subjt:  KGKYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWVPRSSTLKVI

XP_030495126.1 uncharacterized protein LOC115710915 [Cannabis sativa]1.8e-5255.28Show/hide
Query:  GKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWG---VRRKVHWASWKRMCVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKG
        G+EIL+KA++Q IPTY MSCF+LPK L+++ + +MARFWWG    ++K+HW +W ++C PK  GG+GF++LELFN++LL KQGW++I NP+S+L RVLK 
Subjt:  GKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWG---VRRKVHWASWKRMCVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKG

Query:  KYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWVPRSSTLKV
         Y+   +FLEAK  G GS++WRS++WGR ++++GIRWRV  GR I+I  DKW+PR ST  +
Subjt:  KYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWVPRSSTLKV

TrEMBL top hitse value%identityAlignment
A0A6J1DAR4 uncharacterized protein LOC1110189541.2e-5759.76Show/hide
Query:  VGGKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWGVR---RKVHWASWKRMCVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVL
        +GGKE+LIKAV Q IP Y+MSCF+LPK L+RE + I ARFWWG     +K+HW +W  + +PK  GG+GFRDLELFNKALL KQ WR++ +PNS+L RVL
Subjt:  VGGKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWGVR---RKVHWASWKRMCVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVL

Query:  KGKYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWVPRSSTLKVI
        KG+YF++CSF+EAK  GN S++WRS++WGR LL++G+RWR+G+G S+ I  D WVP   TLK++
Subjt:  KGKYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWVPRSSTLKVI

A0A803NGK5 Uncharacterized protein1.6e-5455.62Show/hide
Query:  GKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWG---VRRKVHWASWKRMCVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKG
        G+EIL+ A++Q IPTY MSCF+LPK L+ + +++MARFWWG    + K+HW  WK++C PK  GG+GF+DLE FN+ALL KQGW++I NP+S+L RVLK 
Subjt:  GKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWG---VRRKVHWASWKRMCVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKG

Query:  KYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWVPRSSTLKVIHNEGIDP
         Y+   SFLEAK  G GSF+WRS++WGR ++E+G+RWRV  GR I+I  DKW+PRSST  +     + P
Subjt:  KYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWVPRSSTLKVIHNEGIDP

A0A803NWY3 Uncharacterized protein5.6e-5254.66Show/hide
Query:  GKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWG---VRRKVHWASWKRMCVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKG
        G+E+L+KAV+Q IPTY MSCF+LPK L+++ + +MARFWWG    + K+HW  W ++C PK  GG+GF++LE FN++LL KQGW++I NP+SLL RVLK 
Subjt:  GKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWG---VRRKVHWASWKRMCVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKG

Query:  KYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWVPRSSTLKV
         Y+   +FLEAK  G GSF+WRS++WGR ++++GIRWRV  GR + I  DKW+PR +T  +
Subjt:  KYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWVPRSSTLKV

A0A803Q9W0 Uncharacterized protein6.7e-5355.28Show/hide
Query:  GKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWG---VRRKVHWASWKRMCVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKG
        G+EIL+KA++Q IPTY MSCF+LPK L+++ + +MARFWWG    ++K HW +WK++C PK  GG+GF++LELFN++LL KQGW++I NP+S+L RVLK 
Subjt:  GKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWG---VRRKVHWASWKRMCVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKG

Query:  KYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWVPRSSTLKV
         Y+   +FLEAK  G GS++WRS++WGR ++++GIRWRV  GR + I  DKW+PR ST  +
Subjt:  KYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWVPRSSTLKV

A0A803QCV6 Uncharacterized protein3.5e-5449.75Show/hide
Query:  MPGGLFKIDRAGDQFWQVWALLKPKRRGENKTGLCLGCGGEVGGKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWG---VRRKVHWASWKRM
        MP  + K  R  + F ++  +++ K +G  K  L         G+E+L+KA++Q IPTY MSCF+LPK L+++ + +MARFWWG    +RK+HW  WK++
Subjt:  MPGGLFKIDRAGDQFWQVWALLKPKRRGENKTGLCLGCGGEVGGKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWG---VRRKVHWASWKRM

Query:  CVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKGKYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWVPRSS
        C PK  G +GF+DLE FN+ALL KQGW++I NP S+L RVLK  Y+   SFLEAK  G GSF+WRS++WGR ++E+G+RWR+  GR I I  DKW+PR S
Subjt:  CVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKGKYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWVPRSS

Query:  T
        T
Subjt:  T

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.9e-2537.06Show/hide
Query:  GKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWG---VRRKVHWASWKRMCVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKG
        G+  L KAVL  +P +SMS   LP+S++   +++   F WG    ++K H   W ++C PK  GGLG R  +  N+AL++K GWRL++  NSL   VL+ 
Subjt:  GKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWG---VRRKVHWASWKRMCVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKG

Query:  KY----FRECSFLEAKHRGNGSFVWRSLMWG-RTLLEEGIRWRVGDGRSINILRDKWVPRSSTLKVIHNE
        KY     R+  +L  K  G+ S  WRS+  G R ++  G+ W  GDG+ I    D+WV     L++ + E
Subjt:  KY----FRECSFLEAKHRGNGSFVWRSLMWG-RTLLEEGIRWRVGDGRSINILRDKWVPRSSTLKVIHNE

P93295 Uncharacterized mitochondrial protein AtMg003107.6e-3041.55Show/hide
Query:  IPTYSMSCFQLPKSLVRECNRIMARFWWGV---RRKVHWASWKRMCVPK-FSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKGKYFRECSFLEA
        +P Y+MSCF+L K L ++    M  FWW     +RK+ W +W+++C  K   GGLGFRDL  FN+ALL KQ +R+I  P++LL R+L+ +YF   S +E 
Subjt:  IPTYSMSCFQLPKSLVRECNRIMARFWWGV---RRKVHWASWKRMCVPK-FSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKGKYFRECSFLEA

Query:  KHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWV
              S+ WRS++ GR LL  G+   +GDG    +  D+W+
Subjt:  KHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWV

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein2.7e-0642.86Show/hide
Query:  LKGKYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWV
        +K +YF++ S L+AK R   S+ W SL+ G  LL++G R  +GDG++I I  D  V
Subjt:  LKGKYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWV

AT4G29090.1 Ribonuclease H-like superfamily protein4.2e-3139.47Show/hide
Query:  IPTYSMSCFQLPKSLVRECNRIMARFWWGVRRK---VHWASWKRMCVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKGKYFRECSFLEAK
        +PTY+M+CF LPK++ ++   ++A FWW  +++   +HW +W  +   K  GG+GF+D+E FN ALL KQ WR++  P SL+ +V K +YF +   L A 
Subjt:  IPTYSMSCFQLPKSLVRECNRIMARFWWGVRRK---VHWASWKRMCVPKFSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKGKYFRECSFLEAK

Query:  HRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWV---PRSSTLKV
             SFVW+S+   + +L +G R  VG+G  I I R KW+   P S+ L++
Subjt:  HRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWV---PRSSTLKV

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.4e-3141.55Show/hide
Query:  IPTYSMSCFQLPKSLVRECNRIMARFWWGV---RRKVHWASWKRMCVPK-FSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKGKYFRECSFLEA
        +P Y+MSCF+L K L ++    M  FWW     +RK+ W +W+++C  K   GGLGFRDL  FN+ALL KQ +R+I  P++LL R+L+ +YF   S +E 
Subjt:  IPTYSMSCFQLPKSLVRECNRIMARFWWGV---RRKVHWASWKRMCVPK-FSGGLGFRDLELFNKALLTKQGWRLIENPNSLLCRVLKGKYFRECSFLEA

Query:  KHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWV
              S+ WRS++ GR LL  G+   +GDG    +  D+W+
Subjt:  KHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGGGGGCCTATTCAAGATTGACAGGGCGGGAGATCAATTTTGGCAAGTCTGGGCTTTGCTTAAGCCCAAACGTAGAGGAGAAAACAAGACAGGCCTTTGCCTCGG
TTGTGGGGGTGAAGTTGGGGGAAAGGAGATCCTCATAAAGGCTGTGTTGCAAGTCATCCCTACCTATTCTATGTCATGCTTTCAGTTGCCGAAGAGCTTAGTTCGAGAGT
GCAACCGGATAATGGCAAGGTTTTGGTGGGGGGTGAGGAGGAAGGTTCATTGGGCCTCGTGGAAGAGAATGTGTGTGCCCAAGTTCAGTGGAGGCTTGGGTTTTCGGGAC
TTAGAGTTGTTTAACAAGGCCCTCTTGACGAAACAAGGGTGGAGACTGATAGAGAATCCTAACTCTTTGTTGTGTAGAGTTCTTAAAGGAAAGTATTTCCGAGAATGCTC
TTTTTTGGAGGCAAAGCATAGAGGAAATGGATCGTTTGTGTGGCGTAGTTTGATGTGGGGGAGAACGTTGTTAGAGGAAGGAATTCGTTGGAGGGTAGGGGATGGGAGAT
CAATTAATATCCTAAGGGATAAATGGGTTCCTAGGAGTTCAACGTTGAAAGTTATTCACAATGAGGGGATAGATCCTGATGCTAGGAGTGGATGGGTTGTTGAACTCAGA
TAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTGGGGGCCTATTCAAGATTGACAGGGCGGGAGATCAATTTTGGCAAGTCTGGGCTTTGCTTAAGCCCAAACGTAGAGGAGAAAACAAGACAGGCCTTTGCCTCGG
TTGTGGGGGTGAAGTTGGGGGAAAGGAGATCCTCATAAAGGCTGTGTTGCAAGTCATCCCTACCTATTCTATGTCATGCTTTCAGTTGCCGAAGAGCTTAGTTCGAGAGT
GCAACCGGATAATGGCAAGGTTTTGGTGGGGGGTGAGGAGGAAGGTTCATTGGGCCTCGTGGAAGAGAATGTGTGTGCCCAAGTTCAGTGGAGGCTTGGGTTTTCGGGAC
TTAGAGTTGTTTAACAAGGCCCTCTTGACGAAACAAGGGTGGAGACTGATAGAGAATCCTAACTCTTTGTTGTGTAGAGTTCTTAAAGGAAAGTATTTCCGAGAATGCTC
TTTTTTGGAGGCAAAGCATAGAGGAAATGGATCGTTTGTGTGGCGTAGTTTGATGTGGGGGAGAACGTTGTTAGAGGAAGGAATTCGTTGGAGGGTAGGGGATGGGAGAT
CAATTAATATCCTAAGGGATAAATGGGTTCCTAGGAGTTCAACGTTGAAAGTTATTCACAATGAGGGGATAGATCCTGATGCTAGGAGTGGATGGGTTGTTGAACTCAGA
TAG
Protein sequenceShow/hide protein sequence
MPGGLFKIDRAGDQFWQVWALLKPKRRGENKTGLCLGCGGEVGGKEILIKAVLQVIPTYSMSCFQLPKSLVRECNRIMARFWWGVRRKVHWASWKRMCVPKFSGGLGFRD
LELFNKALLTKQGWRLIENPNSLLCRVLKGKYFRECSFLEAKHRGNGSFVWRSLMWGRTLLEEGIRWRVGDGRSINILRDKWVPRSSTLKVIHNEGIDPDARSGWVVELR