; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021298 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021298
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:6226664..6227454
RNA-Seq ExpressionLag0021298
SyntenyLag0021298
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PRQ55763.1 putative RNA-directed DNA polymerase [Rosa chinensis]2.6e-1428.83Show/hide
Query:  LFVVFWWTIWNLRNSLLWDGRCDDC-DLFQWSRNYLEVFQQASVRNVLSLVPFFQSSGRVAVVAWVPPVENELKLNVDAAVRPKTGEASGGFVLRKDKG-
        LF V+ W IW+ RN+L+W+G   +  +  QW+  +L  +QQ          P  Q         W  P    LK+N+D + RP++G+   G V+R DKG 
Subjt:  LFVVFWWTIWNLRNSLLWDGRCDDC-DLFQWSRNYLEVFQQASVRNVLSLVPFFQSSGRVAVVAWVPPVENELKLNVDAAVRPKTGEASGGFVLRKDKG-

Query:  ------------------------------------EVLLTTDSLRLSKVLTDDVDDISEVGAIMDVIRGLLRQDSSCRVLFTSRQGNMVAHTMASLA-F
                                            E +L +D   +   L  +++D SEVG I+D  +  L    S ++    R+ N VA+ +A LA  
Subjt:  ------------------------------------EVLLTTDSLRLSKVLTDDVDDISEVGAIMDVIRGLLRQDSSCRVLFTSRQGNMVAHTMASLA-F

Query:  VYANFVWLEDWPSEASDVLYRD
         + N VWLE+ P    DVLY D
Subjt:  VYANFVWLEDWPSEASDVLYRD

XP_022150944.1 uncharacterized protein LOC111018973 [Momordica charantia]8.7e-1831.73Show/hide
Query:  IKSVWCCSEFVSLYQSF-----SELNFESLLWALKEKVSRLDFELFVVFWWTIWNLRNSLLWDGRCDD--CDLFQWSRNYLEVFQQASVRNVLSLVPFFQ
        ++S+W CS+F            S++     +W   + VS       VV  W IWN RN             DL  WS NYL+V+Q A   +  +LV    
Subjt:  IKSVWCCSEFVSLYQSF-----SELNFESLLWALKEKVSRLDFELFVVFWWTIWNLRNSLLWDGRCDD--CDLFQWSRNYLEVFQQASVRNVLSLVPFFQ

Query:  SSGRVAVVAWVPPVENELKLNVDAAVRPKTGEASGGFVLRKDKGEVLLT-------------------------------------TDSLRLSKVLTDDV
           RVA+  W PP    LK+NVDAA R ++  A  G ++R   G V LT                                     TDSLR+  +LT D 
Subjt:  SSGRVAVVAWVPPVENELKLNVDAAVRPKTGEASGGFVLRKDKGEVLLT-------------------------------------TDSLRLSKVLTDDV

Query:  DDISEVGAIMDVIRGLLRQDSS-CRVLFTSRQGNMVAHTMASLAFVYANF-VWLEDWPSEASDVLYRDFSS
         D SEVG +  VI+  L   +      FT R GN  AH +A LA    +  +W+E+WP E S VL  D  S
Subjt:  DDISEVGAIMDVIRGLLRQDSS-CRVLFTSRQGNMVAHTMASLAFVYANF-VWLEDWPSEASDVLYRDFSS

XP_023884925.1 uncharacterized protein LOC111997106 [Quercus suber]5.8e-1427.97Show/hide
Query:  LKEKVSRLDFELFVVFWWTIWNLRNSLLWDGRCD-DCDLFQWSRNYLEVFQQASVRNVLSLVPFFQSSGRVAVVAWVPPVENELKLNVDAAVRPKTGEAS
        L E++S+ + ELF    W +WN RN LL  G+      L + +  Y+  F++A  R  +      Q + + A   W PP   E K+N DAA+  + G+  
Subjt:  LKEKVSRLDFELFVVFWWTIWNLRNSLLWDGRCD-DCDLFQWSRNYLEVFQQASVRNVLSLVPFFQSSGRVAVVAWVPPVENELKLNVDAAVRPKTGEAS

Query:  GGFVLRKDKGEV-------------------------------------LLTTDSLRLSKVLTDDVDDISEVGAIMDVIRGLLRQDSSCRVLFTSRQGNM
         G ++R + GEV                                     ++  D++ + + ++  V++IS +G ++D IR LLR      +    R GN 
Subjt:  GGFVLRKDKGEV-------------------------------------LLTTDSLRLSKVLTDDVDDISEVGAIMDVIRGLLRQDSSCRVLFTSRQGNM

Query:  VAHTMASLA--FVYANFVWLEDWPSEASDVLYRDFS
        VAH +A  A   +  +  WLED P  A++ LYRD S
Subjt:  VAHTMASLA--FVYANFVWLEDWPSEASDVLYRDFS

XP_023928118.1 uncharacterized protein LOC112039474 [Quercus suber]6.4e-1328.03Show/hide
Query:  LLWALKEKVSRLDFELFVVFWWTIWNLRNSLLWDGRCDDCD-LFQWSRNYLEVFQQASVRNVLSLVPFFQSSGRVAVVAWVPPVENELKLNVDAAVRPKT
        L   L  K+S  +FELFV+  W IWN RN++++ G+  D   L +W++ +L+ F QA  +  + +     S        W PP ++  KLN DA V  K 
Subjt:  LLWALKEKVSRLDFELFVVFWWTIWNLRNSLLWDGRCDDCD-LFQWSRNYLEVFQQASVRNVLSLVPFFQSSGRVAVVAWVPPVENELKLNVDAAVRPKT

Query:  GEASGGFVLRKDKGEVLLTT-------------------------------------DSLRLSKVLTDDVDDISEVGAIMDVIRGLLRQDSSCRVLFTSR
          +  G ++R + GEV+ T                                      D+L + K LT    D+S +G I+  I+ L R        +  R
Subjt:  GEASGGFVLRKDKGEVLLTT-------------------------------------DSLRLSKVLTDDVDDISEVGAIMDVIRGLLRQDSSCRVLFTSR

Query:  QGNMVAHTMASLA-FVYANFVWLEDWPSEASDVLYRDFS
          N VA+ +A  A  ++ +  W+ED P    + LY DFS
Subjt:  QGNMVAHTMASLA-FVYANFVWLEDWPSEASDVLYRDFS

XP_030941688.1 uncharacterized protein LOC115966628 [Quercus lobata]5.4e-1227.92Show/hide
Query:  LLWALKEKVSRLDFELFVVFWWTIWNLRNSLLWDGRC-DDCDLFQWSRNYLEVFQQASVRNVLSLVPFFQSSGRVAVVAWVPPVENELKLNVDAAVRPKT
        L   L  ++SR +FELF+   W IWN RN+++  G+  D   L + +  YL  ++QA  ++ LS+      S + +V  W PP  +  KLN DAA+  + 
Subjt:  LLWALKEKVSRLDFELFVVFWWTIWNLRNSLLWDGRC-DDCDLFQWSRNYLEVFQQASVRNVLSLVPFFQSSGRVAVVAWVPPVENELKLNVDAAVRPKT

Query:  GEASGGFVLRKDKGEV-------------------------------------LLTTDSLRLSKVLTDDVDDISEVGAIMDVIRGLLRQDSSCRVLFTSR
        G +  G ++R ++GEV                                     ++  D++ + K +  +  D S +G +   I+ L+       V    R
Subjt:  GEASGGFVLRKDKGEV-------------------------------------LLTTDSLRLSKVLTDDVDDISEVGAIMDVIRGLLRQDSSCRVLFTSR

Query:  QGNMVAHTMASLA-FVYANFVWLEDWPSEASDVLYRDFSS
          N VAH++A  A +V    +WLE+ P  ASD LY+D  S
Subjt:  QGNMVAHTMASLA-FVYANFVWLEDWPSEASDVLYRDFSS

TrEMBL top hitse value%identityAlignment
A0A2P6SAP1 Putative RNA-directed DNA polymerase1.3e-1428.83Show/hide
Query:  LFVVFWWTIWNLRNSLLWDGRCDDC-DLFQWSRNYLEVFQQASVRNVLSLVPFFQSSGRVAVVAWVPPVENELKLNVDAAVRPKTGEASGGFVLRKDKG-
        LF V+ W IW+ RN+L+W+G   +  +  QW+  +L  +QQ          P  Q         W  P    LK+N+D + RP++G+   G V+R DKG 
Subjt:  LFVVFWWTIWNLRNSLLWDGRCDDC-DLFQWSRNYLEVFQQASVRNVLSLVPFFQSSGRVAVVAWVPPVENELKLNVDAAVRPKTGEASGGFVLRKDKG-

Query:  ------------------------------------EVLLTTDSLRLSKVLTDDVDDISEVGAIMDVIRGLLRQDSSCRVLFTSRQGNMVAHTMASLA-F
                                            E +L +D   +   L  +++D SEVG I+D  +  L    S ++    R+ N VA+ +A LA  
Subjt:  ------------------------------------EVLLTTDSLRLSKVLTDDVDDISEVGAIMDVIRGLLRQDSSCRVLFTSRQGNMVAHTMASLA-F

Query:  VYANFVWLEDWPSEASDVLYRD
         + N VWLE+ P    DVLY D
Subjt:  VYANFVWLEDWPSEASDVLYRD

A0A6J1C467 uncharacterized protein LOC1110077751.7e-1127.97Show/hide
Query:  EKVSRLDFELFVVFWWTIWNLRNSLLWDGRCDD--CDLFQWSRNYLEVFQQASVRNVLSLVPFFQSSGRVAVVAWVPPVENELKLNVDAAVRPKTGEASG
        +++     E   VF W IWN RN  ++         ++  W  +YL+V+Q A     L L+      GRV+  AW PP+    K+NVDAA +     A  
Subjt:  EKVSRLDFELFVVFWWTIWNLRNSLLWDGRCDD--CDLFQWSRNYLEVFQQASVRNVLSLVPFFQSSGRVAVVAWVPPVENELKLNVDAAVRPKTGEASG

Query:  GFVLRKDKGEVLLT-------------------------------------TDSLRLSKVLTDDVDDISEVGAIMDVIRGLLRQDSSCRV----LFTSRQ
          ++R     VLL+                                     TDS ++  +L  D +D SE+G +   IR ++   SS  +     F +R+
Subjt:  GFVLRKDKGEVLLT-------------------------------------TDSLRLSKVLTDDVDDISEVGAIMDVIRGLLRQDSSCRV----LFTSRQ

Query:  GNMVAHTMASLAFVYANF-VWLEDWPSEASDVLYRD
        GN  AHT+A +  V  +F VW+E+W S+ S+V+  D
Subjt:  GNMVAHTMASLAFVYANF-VWLEDWPSEASDVLYRD

A0A6J1DBJ7 uncharacterized protein LOC1110189734.2e-1831.73Show/hide
Query:  IKSVWCCSEFVSLYQSF-----SELNFESLLWALKEKVSRLDFELFVVFWWTIWNLRNSLLWDGRCDD--CDLFQWSRNYLEVFQQASVRNVLSLVPFFQ
        ++S+W CS+F            S++     +W   + VS       VV  W IWN RN             DL  WS NYL+V+Q A   +  +LV    
Subjt:  IKSVWCCSEFVSLYQSF-----SELNFESLLWALKEKVSRLDFELFVVFWWTIWNLRNSLLWDGRCDD--CDLFQWSRNYLEVFQQASVRNVLSLVPFFQ

Query:  SSGRVAVVAWVPPVENELKLNVDAAVRPKTGEASGGFVLRKDKGEVLLT-------------------------------------TDSLRLSKVLTDDV
           RVA+  W PP    LK+NVDAA R ++  A  G ++R   G V LT                                     TDSLR+  +LT D 
Subjt:  SSGRVAVVAWVPPVENELKLNVDAAVRPKTGEASGGFVLRKDKGEVLLT-------------------------------------TDSLRLSKVLTDDV

Query:  DDISEVGAIMDVIRGLLRQDSS-CRVLFTSRQGNMVAHTMASLAFVYANF-VWLEDWPSEASDVLYRDFSS
         D SEVG +  VI+  L   +      FT R GN  AH +A LA    +  +W+E+WP E S VL  D  S
Subjt:  DDISEVGAIMDVIRGLLRQDSS-CRVLFTSRQGNMVAHTMASLAFVYANF-VWLEDWPSEASDVLYRDFSS

A0A803PI72 Uncharacterized protein4.2e-1025.45Show/hide
Query:  ELFVVFWWTIWNLRNSLLWDGRCDDCD-LFQWSRNYLEVFQQASVRNVLSLVPFFQSSGRVAVVAWVPPVENELKLNVDAA-------------------
        EL V   W+IWN RN  +W  +  + D +   ++ +L  ++ A  ++ +    F  S    A V W PP+ N +K+NVDAA                   
Subjt:  ELFVVFWWTIWNLRNSLLWDGRCDDCD-LFQWSRNYLEVFQQASVRNVLSLVPFFQSSGRVAVVAWVPPVENELKLNVDAA-------------------

Query:  -------------VRPKTGEASG-----GFVLRKDKGEVLLTTDSLRLSKVLTDDVDDISEVGAIMDVIRGLLRQDSSCRVLFTSRQGNMVAHTMASLAF
                     V P T EA G      ++ +K    V + TD L++ + L   ++  S  G +++V + +L+      + F  R  NMVAH+ A  + 
Subjt:  -------------VRPKTGEASG-----GFVLRKDKGEVLLTTDSLRLSKVLTDDVDDISEVGAIMDVIRGLLRQDSSCRVLFTSRQGNMVAHTMASLAF

Query:  VYANFVW-LEDWPSEASDVL
         Y +  + LE  P++   +L
Subjt:  VYANFVW-LEDWPSEASDVL

M5XSK0 Reverse transcriptase domain-containing protein2.2e-1127Show/hide
Query:  EKVSRLDFELFVVFWWTIWNLRNSLLWDGRCDDCDLFQWSRNYLEVFQQASVR--NVLSLVPFFQSSGRVAVV--AWVPPVENELKLNVDAAVRPKTGEA
        E++S  DF  F++  W IW  RN LLW+ +            + +V   AS+R  + L +     S  R   +   W PP EN LK+NVD A +P T E 
Subjt:  EKVSRLDFELFVVFWWTIWNLRNSLLWDGRCDDCDLFQWSRNYLEVFQQASVR--NVLSLVPFFQSSGRVAVV--AWVPPVENELKLNVDAAVRPKTGEA

Query:  SGGFVLRKDKGE-------------------------------------VLLTTDSLRLSKVLTDDVDDISEVGAIMDVIRGLLRQDSSCRVLFTSRQGN
          G V+R   G+                                     V+  +D+L++   L +   D S +G +++  + LL Q +        R  N
Subjt:  SGGFVLRKDKGE-------------------------------------VLLTTDSLRLSKVLTDDVDDISEVGAIMDVIRGLLRQDSSCRVLFTSRQGN

Query:  MVAHTMASLAF-VYANFVWLEDWPSEASDVLYRDFSS
         VAH +A  A  +  +  W E+ P   SD+LY D +S
Subjt:  MVAHTMASLAF-VYANFVWLEDWPSEASDVLYRDFSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein2.6e-0424.17Show/hide
Query:  WCCSEFVSLYQSFSELNFESLLWALKEKVSRLDFELFVVFW--WTIWNLRNSLLWDGRCDDCDLFQWSRNYLEVFQQASVRNVLSLVPFFQSSGRVAVVA
        W  S +V+LY  F+ L   +  W   EK S+L      V W  W +W  RN L++ GR  + +  +  R   +  ++  +R             R +   
Subjt:  WCCSEFVSLYQSFSELNFESLLWALKEKVSRLDFELFVVFW--WTIWNLRNSLLWDGRCDDCDLFQWSRNYLEVFQQASVRNVLSLVPFFQSSGRVAVVA

Query:  WVPPVENELKLNVDAAVRPKTGEASGGFVLRKDKGE-------------------------------------VLLTTDSLRLSKVLTDDVDDISEVGAI
        W PP    +K N DA           G+VLR +KGE                                     V+  +DS  L ++L +D    S    I
Subjt:  WVPPVENELKLNVDAAVRPKTGEASGGFVLRKDKGE-------------------------------------VLLTTDSLRLSKVLTDDVDDISEVGAI

Query:  MDVIRGLLRQDSSCRVLFTSRQGNMVAHTMASLAFVYANF
         D+ R LL Q +  + +F  R+GN +A  +A  +  + N+
Subjt:  MDVIRGLLRQDSSCRVLFTSRQGNMVAHTMASLAFVYANF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTAAATCCGTGTGGTGTTGCTCTGAGTTTGTATCTTTATACCAATCCTTTTCTGAATTGAACTTTGAATCGTTGTTGTGGGCTTTAAAGGAGAAAGTGAGTAGGCT
GGATTTTGAGCTTTTTGTTGTTTTCTGGTGGACTATATGGAACTTACGCAATTCACTGTTATGGGATGGAAGGTGTGATGACTGTGATCTGTTTCAATGGTCGAGGAATT
ACCTTGAGGTGTTCCAGCAAGCGTCGGTGCGGAATGTGTTGTCGCTGGTGCCGTTTTTTCAGTCTTCGGGGAGAGTTGCGGTAGTGGCTTGGGTCCCTCCAGTCGAGAAT
GAACTGAAGCTGAACGTTGACGCAGCAGTTAGGCCCAAGACGGGAGAGGCTAGTGGTGGCTTTGTCCTGCGAAAGGATAAAGGGGAAGTGTTGTTGACAACTGATTCATT
GAGATTATCTAAAGTTCTGACTGATGATGTGGACGATATTTCGGAAGTAGGAGCAATTATGGATGTGATTCGAGGATTGCTTCGGCAGGATTCTTCGTGTAGGGTGTTGT
TCACATCCAGGCAAGGCAACATGGTAGCACATACCATGGCTTCTTTAGCTTTTGTTTATGCTAATTTTGTTTGGCTTGAGGATTGGCCCTCTGAGGCCTCAGATGTACTT
TATCGTGATTTTTCCTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGATTAAATCCGTGTGGTGTTGCTCTGAGTTTGTATCTTTATACCAATCCTTTTCTGAATTGAACTTTGAATCGTTGTTGTGGGCTTTAAAGGAGAAAGTGAGTAGGCT
GGATTTTGAGCTTTTTGTTGTTTTCTGGTGGACTATATGGAACTTACGCAATTCACTGTTATGGGATGGAAGGTGTGATGACTGTGATCTGTTTCAATGGTCGAGGAATT
ACCTTGAGGTGTTCCAGCAAGCGTCGGTGCGGAATGTGTTGTCGCTGGTGCCGTTTTTTCAGTCTTCGGGGAGAGTTGCGGTAGTGGCTTGGGTCCCTCCAGTCGAGAAT
GAACTGAAGCTGAACGTTGACGCAGCAGTTAGGCCCAAGACGGGAGAGGCTAGTGGTGGCTTTGTCCTGCGAAAGGATAAAGGGGAAGTGTTGTTGACAACTGATTCATT
GAGATTATCTAAAGTTCTGACTGATGATGTGGACGATATTTCGGAAGTAGGAGCAATTATGGATGTGATTCGAGGATTGCTTCGGCAGGATTCTTCGTGTAGGGTGTTGT
TCACATCCAGGCAAGGCAACATGGTAGCACATACCATGGCTTCTTTAGCTTTTGTTTATGCTAATTTTGTTTGGCTTGAGGATTGGCCCTCTGAGGCCTCAGATGTACTT
TATCGTGATTTTTCCTCCTAA
Protein sequenceShow/hide protein sequence
MIKSVWCCSEFVSLYQSFSELNFESLLWALKEKVSRLDFELFVVFWWTIWNLRNSLLWDGRCDDCDLFQWSRNYLEVFQQASVRNVLSLVPFFQSSGRVAVVAWVPPVEN
ELKLNVDAAVRPKTGEASGGFVLRKDKGEVLLTTDSLRLSKVLTDDVDDISEVGAIMDVIRGLLRQDSSCRVLFTSRQGNMVAHTMASLAFVYANFVWLEDWPSEASDVL
YRDFSS