; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034969 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034969
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:13070479..13071063
RNA-Seq ExpressionLag0034969
SyntenyLag0034969
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]2.1e-3641.92Show/hide
Query:  WGKELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQSPITLPLDACVADLITAEGQWNEELLRHHLSPHEVNIILTIPLRHVWSED-----REKSGVYS
        WGKELL KG+RWRVG+G  I+VY   W+P  +C ++ SP  LPL   V DL T+ GQWN  LL+      EV+ IL IPL  +   D      E++G+YS
Subjt:  WGKELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQSPITLPLDACVADLITAEGQWNEELLRHHLSPHEVNIILTIPLRHVWSED-----REKSGVYS

Query:  VKSGYRLGQRSLLDLGPSSSFNESLLS-WWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKFLQRILMGS
        VKSGYRL       +    S    L S +WK  W ++IPNK+K FLWR   D LP    L  R +    +C  C R  ES +H  W C+  + +   S
Subjt:  VKSGYRLGQRSLLDLGPSSSFNESLLS-WWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKFLQRILMGS

ONI09819.1 hypothetical protein PRUPE_4G011200 [Prunus persica]2.1e-3643.39Show/hide
Query:  WGKELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQSPITLPLDACVADLITAEGQWNEELLRHHLSPHEVNIILTIPLRHVWSED-----REKSGVYS
        WGKELL KG+RWRVG+G  I+VY   W+P  +C ++ SP  LPL   V DL T+ GQWN  LL+      EV+ IL IPL  +   D      E++G+YS
Subjt:  WGKELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQSPITLPLDACVADLITAEGQWNEELLRHHLSPHEVNIILTIPLRHVWSED-----REKSGVYS

Query:  VKSGYRLGQRSLLDLGPSSSFNESLLS-WWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCK
        VKSGYRL +     +    S    L S +WK  W ++IPNK+K FLWR   D LP    L  R +    +C  C R  ES +H  W C+
Subjt:  VKSGYRLGQRSLLDLGPSSSFNESLLS-WWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCK

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]5.1e-3541.41Show/hide
Query:  WGKELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQSPITLPLDACVADLITAEGQWNEELLRHHLSPHEVNIILTIPLRHVWSED-----REKSGVYS
        WGKELL KG+RWRVGNG  I+VY   W+P  +  ++ SP  LPL   V DL T+ GQWN  LL+      EV+  L IPL  +   D      E++G+YS
Subjt:  WGKELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQSPITLPLDACVADLITAEGQWNEELLRHHLSPHEVNIILTIPLRHVWSED-----REKSGVYS

Query:  VKSGYRLGQRSLLDLGPSSSFNESLLS-WWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKFLQRILMGS
        VKSGYRL       +    S    L S +WK  W ++IPNK+K FLWR   D LP    L  R +    +C  C R  ES +H  W C+  + +   S
Subjt:  VKSGYRLGQRSLLDLGPSSSFNESLLS-WWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKFLQRILMGS

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]1.1e-5047.06Show/hide
Query:  MWGKELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQSPITLPLDACVADLIT-AEGQWNEELLRHHLSPHEVNIILTIPLRHVWSEDR-----EKSGV
        +WG++LL+KG+RWR+GNG+ + +YG NW+P+   +++ S   LPL + V+ L+   EG W  +++R   +P E   IL+IP+     EDR     EK+GV
Subjt:  MWGKELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQSPITLPLDACVADLIT-AEGQWNEELLRHHLSPHEVNIILTIPLRHVWSEDR-----EKSGV

Query:  YSVKSGYRLGQRSLLD----LGPSSSFNESLLSWWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKFLQRIL
        YSV+SGY++   +LL+      PSSS +E +  WW G WKM IPNK+K+FLWRLCLDRLPT   L+ RGV++ N C  CGR+GE  IHLFW CKF + + 
Subjt:  YSVKSGYRLGQRSLLD----LGPSSSFNESLLSWWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKFLQRIL

Query:  MGSE
        + S+
Subjt:  MGSE

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]6.3e-3339.18Show/hide
Query:  MWGKELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQSPITLPLDACVADLITAEGQWNEELLRHHLSPHEVNIILTIPLRHVWSEDR-----EKSGVY
        +WG+++L KG RWR+GNG+ + VYG+NWIP     +  S  ++  D  VA+LI  + QW E+L+  H  P +   I+ IPL     ED+     +K G Y
Subjt:  MWGKELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQSPITLPLDACVADLITAEGQWNEELLRHHLSPHEVNIILTIPLRHVWSEDR-----EKSGVY

Query:  SVKSGYRLGQRSLLDLGPSSSFNESLLSWWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKFLQRI
        SVKSGY++  R      PS S ++  L  W+  WK+ IP KVKIFLWR   D LPT + L  + V    +C  C  H E+  H   +C   ++I
Subjt:  SVKSGYRLGQRSLLDLGPSSSFNESLLSWWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKFLQRI

TrEMBL top hitse value%identityAlignment
A0A251NPF0 Reverse transcriptase domain-containing protein1.0e-3641.92Show/hide
Query:  WGKELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQSPITLPLDACVADLITAEGQWNEELLRHHLSPHEVNIILTIPLRHVWSED-----REKSGVYS
        WGKELL KG+RWRVG+G  I+VY   W+P  +C ++ SP  LPL   V DL T+ GQWN  LL+      EV+ IL IPL  +   D      E++G+YS
Subjt:  WGKELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQSPITLPLDACVADLITAEGQWNEELLRHHLSPHEVNIILTIPLRHVWSED-----REKSGVYS

Query:  VKSGYRLGQRSLLDLGPSSSFNESLLS-WWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKFLQRILMGS
        VKSGYRL       +    S    L S +WK  W ++IPNK+K FLWR   D LP    L  R +    +C  C R  ES +H  W C+  + +   S
Subjt:  VKSGYRLGQRSLLDLGPSSSFNESLLS-WWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKFLQRILMGS

A0A5E4FZN9 PREDICTED: retrotransposon2.5e-3541.41Show/hide
Query:  WGKELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQSPITLPLDACVADLITAEGQWNEELLRHHLSPHEVNIILTIPLRHVWSED-----REKSGVYS
        WGKELL KG+RWRVGNG  I+VY   W+P  +  ++ SP  LPL   V DL T+ GQWN  LL+      EV+  L IPL  +   D      E++G+YS
Subjt:  WGKELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQSPITLPLDACVADLITAEGQWNEELLRHHLSPHEVNIILTIPLRHVWSED-----REKSGVYS

Query:  VKSGYRLGQRSLLDLGPSSSFNESLLS-WWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKFLQRILMGS
        VKSGYRL       +    S    L S +WK  W ++IPNK+K FLWR   D LP    L  R +    +C  C R  ES +H  W C+  + +   S
Subjt:  VKSGYRLGQRSLLDLGPSSSFNESLLS-WWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKFLQRILMGS

A0A6J1DAR4 uncharacterized protein LOC1110189545.5e-5147.06Show/hide
Query:  MWGKELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQSPITLPLDACVADLIT-AEGQWNEELLRHHLSPHEVNIILTIPLRHVWSEDR-----EKSGV
        +WG++LL+KG+RWR+GNG+ + +YG NW+P+   +++ S   LPL + V+ L+   EG W  +++R   +P E   IL+IP+     EDR     EK+GV
Subjt:  MWGKELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQSPITLPLDACVADLIT-AEGQWNEELLRHHLSPHEVNIILTIPLRHVWSEDR-----EKSGV

Query:  YSVKSGYRLGQRSLLD----LGPSSSFNESLLSWWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKFLQRIL
        YSV+SGY++   +LL+      PSSS +E +  WW G WKM IPNK+K+FLWRLCLDRLPT   L+ RGV++ N C  CGR+GE  IHLFW CKF + + 
Subjt:  YSVKSGYRLGQRSLLD----LGPSSSFNESLLSWWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKFLQRIL

Query:  MGSE
        + S+
Subjt:  MGSE

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)1.0e-3641.92Show/hide
Query:  WGKELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQSPITLPLDACVADLITAEGQWNEELLRHHLSPHEVNIILTIPLRHVWSED-----REKSGVYS
        WGKELL KG+RWRVG+G  I+VY   W+P  +C ++ SP  LPL   V DL T+ GQWN  LL+      EV+ IL IPL  +   D      E++G+YS
Subjt:  WGKELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQSPITLPLDACVADLITAEGQWNEELLRHHLSPHEVNIILTIPLRHVWSED-----REKSGVYS

Query:  VKSGYRLGQRSLLDLGPSSSFNESLLS-WWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKFLQRILMGS
        VKSGYRL       +    S    L S +WK  W ++IPNK+K FLWR   D LP    L  R +    +C  C R  ES +H  W C+  + +   S
Subjt:  VKSGYRLGQRSLLDLGPSSSFNESLLS-WWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKFLQRILMGS

M5WJW2 Reverse transcriptase domain-containing protein1.0e-3643.39Show/hide
Query:  WGKELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQSPITLPLDACVADLITAEGQWNEELLRHHLSPHEVNIILTIPLRHVWSED-----REKSGVYS
        WGKELL KG+RWRVG+G  I+VY   W+P  +C ++ SP  LPL   V DL T+ GQWN  LL+      EV+ IL IPL  +   D      E++G+YS
Subjt:  WGKELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQSPITLPLDACVADLITAEGQWNEELLRHHLSPHEVNIILTIPLRHVWSED-----REKSGVYS

Query:  VKSGYRLGQRSLLDLGPSSSFNESLLS-WWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCK
        VKSGYRL +     +    S    L S +WK  W ++IPNK+K FLWR   D LP    L  R +    +C  C R  ES +H  W C+
Subjt:  VKSGYRLGQRSLLDLGPSSSFNESLLS-WWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCK

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657509.5e-0822.87Show/hide
Query:  KELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQS-PITLPLDACVADLITAEGQ-WNEELL------RHHLSPHEVNIILTIPLRHVWSEDREKSGVY
        ++++  G+ W  G+G++IR +   W+     + + +       D  VA  +   G+ W+   +         L    V + L    R   S    + G +
Subjt:  KELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQS-PITLPLDACVADLITAEGQ-WNEELL------RHHLSPHEVNIILTIPLRHVWSEDREKSGVY

Query:  SVKSGYRLGQRSLLDLGPSSSFNESLLSWWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQC
        SV+S Y +       L        ++ S++   WK+R+P +VK FLW +    + T +    R +   NVC +C    ES +H+   C
Subjt:  SVKSGYRLGQRSLLDLGPSSSFNESLLSWWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQC

Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.8e-0635.38Show/hide
Query:  LLSWWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKF
        ++ W+K  W      K     W +  +RL T D L   G+ +  VC+LC  H ES  HLF++C F
Subjt:  LLSWWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKF

AT3G25270.1 Ribonuclease H-like superfamily protein3.5e-0533.85Show/hide
Query:  WKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKFLQRILMGS
        WK++   K+K FLW+L    L T D L  R +     C  C +  E+  HLF+ C + Q++   S
Subjt:  WKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKFLQRILMGS

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.7e-0724.86Show/hide
Query:  VGNGEKIRVYGSNWIPDDACMRVQSP-----ITLPLDACVADLITAEGQW-NEELLRHHLSPHEVNIIL----TIPLRH----VWSEDREKSGVYSVKSG
        VG+G   + +  NWI     + V  P     + LP+DA V D +     W      R+ +     N++      +  +H    +W  D          S 
Subjt:  VGNGEKIRVYGSNWIPDDACMRVQSP-----ITLPLDACVADLITAEGQW-NEELLRHHLSPHEVNIIL----TIPLRH----VWSEDREKSGVYSVKSG

Query:  YRLGQRSLLDLGPSSSFNESLLSWWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKF
             R+   L P S      + W K  W      K     W +  +RL T D L   G+ +   C+LC  H +S  HLF++C+F
Subjt:  YRLGQRSLLDLGPSSSFNESLLSWWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKF

AT4G29090.1 Ribonuclease H-like superfamily protein5.3e-1428.43Show/hide
Query:  KELLRKGIRWRVGNGEKIRVYGSNWI---PDDACMRV-----QSPITLPLDACVADLITAEG-QWNEELLRHHLSPHEVNIILTI-----PLRHVWSEDR
        +E+LR+G R  VGNGE I ++   W+   P  A +R+     Q   ++     V+DLI   G +W ++++       E  +I  +      +   ++ D 
Subjt:  KELLRKGIRWRVGNGEKIRVYGSNWI---PDDACMRV-----QSPITLPLDACVADLITAEG-QWNEELLRHHLSPHEVNIILTI-----PLRHVWSEDR

Query:  EKSGVYSVKSGY-RLGQRSLLDLGPSSSFNESLLSWWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKF
          SG Y+VKSGY  L Q       P      SL   ++  WK +   K++ FLW+   + LP    LA R +   + C+ C    E+  HL ++C F
Subjt:  EKSGVYSVKSGY-RLGQRSLLDLGPSSSFNESLLSWWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKF

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.1e-0623.76Show/hide
Query:  IRWRVGNGEKIRVYGSNWIPDDACMRV-----QSPITLPLDACVADLITAEGQWNEELLRHHLSPHEVNIILTIPLRHVWSEDR-EKSGVYSVKSGYRLG
        +R  +GNGE    +   W      +          + +  DA V +  +  G W     R   S   +  +   P+ H   E R + S ++   +G  L 
Subjt:  IRWRVGNGEKIRVYGSNWIPDDACMRV-----QSPITLPLDACVADLITAEGQWNEELLRHHLSPHEVNIILTIPLRHVWSEDR-EKSGVYSVKSGYRLG

Query:  QRSLLDLGPSSSFNESLLSWWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKF
          S  D       +   + W K  W      +  +  W   L+RLPT D L   G+++ +  VLC    E+  HLF++C F
Subjt:  QRSLLDLGPSSSFNESLLSWWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGGGAAAGAGCTTCTGAGAAAAGGTATCCGCTGGAGAGTGGGAAATGGGGAGAAGATCAGGGTGTATGGGTCTAACTGGATCCCAGATGATGCCTGCATGAGGGT
GCAGTCCCCAATCACCTTACCGCTTGATGCTTGTGTTGCTGACCTTATCACAGCCGAGGGGCAGTGGAACGAAGAGTTACTTCGGCACCATCTTAGCCCCCACGAGGTAA
ATATCATCCTCACTATCCCTCTTCGACATGTTTGGTCTGAGGATAGAGAGAAAAGTGGTGTCTACTCCGTTAAGAGTGGGTACCGGTTAGGCCAAAGGAGCTTGCTTGAC
CTGGGTCCATCCTCGTCTTTTAACGAGTCTTTACTTAGTTGGTGGAAGGGGTGTTGGAAGATGAGGATCCCTAACAAAGTGAAGATCTTTTTGTGGAGACTTTGCCTTGA
TCGCCTTCCTACAGTGGATGGTTTGGCAGTTAGAGGCGTTGATGTCTTGAATGTGTGTGTTCTTTGTGGCCGACATGGGGAATCTTGCATCCATCTCTTCTGGCAATGCA
AGTTTCTGCAGAGAATTTTGATGGGCTCTGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGGGGAAAGAGCTTCTGAGAAAAGGTATCCGCTGGAGAGTGGGAAATGGGGAGAAGATCAGGGTGTATGGGTCTAACTGGATCCCAGATGATGCCTGCATGAGGGT
GCAGTCCCCAATCACCTTACCGCTTGATGCTTGTGTTGCTGACCTTATCACAGCCGAGGGGCAGTGGAACGAAGAGTTACTTCGGCACCATCTTAGCCCCCACGAGGTAA
ATATCATCCTCACTATCCCTCTTCGACATGTTTGGTCTGAGGATAGAGAGAAAAGTGGTGTCTACTCCGTTAAGAGTGGGTACCGGTTAGGCCAAAGGAGCTTGCTTGAC
CTGGGTCCATCCTCGTCTTTTAACGAGTCTTTACTTAGTTGGTGGAAGGGGTGTTGGAAGATGAGGATCCCTAACAAAGTGAAGATCTTTTTGTGGAGACTTTGCCTTGA
TCGCCTTCCTACAGTGGATGGTTTGGCAGTTAGAGGCGTTGATGTCTTGAATGTGTGTGTTCTTTGTGGCCGACATGGGGAATCTTGCATCCATCTCTTCTGGCAATGCA
AGTTTCTGCAGAGAATTTTGATGGGCTCTGAATAG
Protein sequenceShow/hide protein sequence
MWGKELLRKGIRWRVGNGEKIRVYGSNWIPDDACMRVQSPITLPLDACVADLITAEGQWNEELLRHHLSPHEVNIILTIPLRHVWSEDREKSGVYSVKSGYRLGQRSLLD
LGPSSSFNESLLSWWKGCWKMRIPNKVKIFLWRLCLDRLPTVDGLAVRGVDVLNVCVLCGRHGESCIHLFWQCKFLQRILMGSE