; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039657 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039657
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:47863364..47865600
RNA-Seq ExpressionLag0039657
SyntenyLag0039657
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023872411.1 uncharacterized protein LOC111985024 [Quercus suber]3.6e-2344.29Show/hide
Query:  MLKIGFHQTWMNIIMDWVRTATFSVLINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMRNG----------------------SLIFCKAK
        M K+GFH  W +++M  + + T++V ING P G I+P  GLRQ DPLSPYLFLL +E LS+LI  A  +G                      SLIFCKA 
Subjt:  MLKIGFHQTWMNIIMDWVRTATFSVLINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMRNG----------------------SLIFCKAK

Query:  IEEVWAFKKVLERYEKASGQKVNYNKSSMFVSPNMSNEFR
         +E    +K+L  YEKASGQ++N +K+S+F SPN + E +
Subjt:  IEEVWAFKKVLERYEKASGQKVNYNKSSMFVSPNMSNEFR

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]1.6e-2346.26Show/hide
Query:  MLKIGFHQTWMNIIMDWVRTATFSVLINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMRN----------------------GSLIFCKAK
        M K+GF + W+++IM+ V + ++SVLING   G I P  G+RQ DPLSP LFLL +E LS+LI  A RN                       SL+FCKAK
Subjt:  MLKIGFHQTWMNIIMDWVRTATFSVLINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMRN----------------------GSLIFCKAK

Query:  IEEVWAFKKVLERYEKASGQKVNYNKSSMFVSPNMSNEFRCVLADIL
         +E  A   +L RYE+ASGQK+N +KSS+F SPN S E R  + +IL
Subjt:  IEEVWAFKKVLERYEKASGQKVNYNKSSMFVSPNMSNEFRCVLADIL

XP_023920261.1 LOW QUALITY PROTEIN: intron-binding protein aquarius-like [Quercus suber]1.0e-2238.27Show/hide
Query:  MLKIGFHQTWMNIIMDWVRTATFSVLINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMRNG----------------------SLIFCKAK
        ML++GFH+ W+ ++M  V+T ++S+L+NG P G I P  G+RQ DP SPYLFLL SE L+ L+  A+  G                      SLIFC+AK
Subjt:  MLKIGFHQTWMNIIMDWVRTATFSVLINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMRNG----------------------SLIFCKAK

Query:  IEEVWAFKKVLERYEKASGQKVNYNKSSMFVSPNMSNEFRCVLADILKVSVVDNLDRILRLV
        + ++ A + +L  YE+ASGQKVN  K+++F   ++S   +  L D L+V  +   ++ L L+
Subjt:  IEEVWAFKKVLERYEKASGQKVNYNKSSMFVSPNMSNEFRCVLADILKVSVVDNLDRILRLV

XP_030497588.1 uncharacterized protein LOC115713245 [Cannabis sativa]9.5e-2428.65Show/hide
Query:  MLKIGFHQTWMNIIMDWVRTATFSVLINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMRNG----------------------SLIFCKAK
        M K+GF+  W+++IM  + T +FS  +NGS +G I PQ GLRQ DPLSPYLFL+ SE LS L+    R G                      SL+FC+A 
Subjt:  MLKIGFHQTWMNIIMDWVRTATFSVLINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMRNG----------------------SLIFCKAK

Query:  IEEVWAFKKVLERYEKASGQKVNYNKSSMFVSPNMSNEFRCVLADILKVSVVDNLDRILRL---VDHLSTEVFEKTCITLWAIWNDRNSFLF-----EQP
             + K+ L  Y +ASGQ++N +KS M  SPN     +    +IL + + D  ++ L L    +    ++F      +W + N  N  LF     E+ 
Subjt:  IEEVWAFKKVLERYEKASGQKVNYNKSSMFVSPNMSNEFRCVLADILKVSVVDNLDRILRL---VDHLSTEVFEKTCITLWAIWNDRNSFLF-----EQP

Query:  LME------SSARCGWINKYWEETCQRSTPLPTQLEES--------WANHVSLAARTMVFTDAPVRHYKVGVGYEVVITNEEDTLQSAMHMFEHKSLSPL
        L        ++   G    Y+    Q ST  P  +++S        W        +  V     V   K+GVG  ++I N    + +A          P 
Subjt:  LME------SSARCGWINKYWEETCQRSTPLPTQLEES--------WANHVSLAARTMVFTDAPVRHYKVGVGYEVVITNEEDTLQSAMHMFEHKSLSPL

Query:  ATEVQEILQAVTLLKRMNISEAVLHSDSLNAIKMINEEQEPDSEVHFWILQIQELSKSFSSLAYVHVGRWRNGRADTLAKHVLQ
          E + ++  +      N+S  +L SDSL  +  IN      S     +L I+      SS+   HV +  N  A  LAK  L+
Subjt:  ATEVQEILQAVTLLKRMNISEAVLHSDSLNAIKMINEEQEPDSEVHFWILQIQELSKSFSSLAYVHVGRWRNGRADTLAKHVLQ

XP_030942103.1 uncharacterized protein LOC115967179 [Quercus lobata]4.3e-2441.61Show/hide
Query:  MLKIGFHQTWMNIIMDWVRTATFSVLINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMRNG----------------------SLIFCKAK
        M K+GFH  W +++M  + + T++V +NG P G I+P  GLRQ DPLSPYLFL+Y+E LS+LI  A  +G                      SLIFCKA 
Subjt:  MLKIGFHQTWMNIIMDWVRTATFSVLINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMRNG----------------------SLIFCKAK

Query:  IEEVWAFKKVLERYEKASGQKVNYNKSSMFVSPNMSNEFRCVLADILKVSVVDNLDRILRL
        I+E  A +KVL  YE+ASGQ++N  K+S+F SPN + E +  +       V+   +R L L
Subjt:  IEEVWAFKKVLERYEKASGQKVNYNKSSMFVSPNMSNEFRCVLADILKVSVVDNLDRILRL

TrEMBL top hitse value%identityAlignment
A0A2N9EK17 Reverse transcriptase domain-containing protein6.6e-2340.88Show/hide
Query:  MLKIGFHQTWMNIIMDWVRTATFSVLINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMRNGSLIFCKAKIEEVWAFKKVLERYEKASGQKV
        MLK+GF   W+N+IM+ + T  +SVLING P G I P  GLRQ DPLSPYLFL+ +E LS+L   A R   L  C  + + + A K++L  YE ASGQ++
Subjt:  MLKIGFHQTWMNIIMDWVRTATFSVLINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMRNGSLIFCKAKIEEVWAFKKVLERYEKASGQKV

Query:  NYNKSSMFVSPNMSNEFRCVLADILKVSVVDNLDRIL---RLVDHLSTEVFEKTCITLW
        N+ KS+ F S N   + +  ++ IL  S+  N  + L    L+     + FE+    +W
Subjt:  NYNKSSMFVSPNMSNEFRCVLADILKVSVVDNLDRIL---RLVDHLSTEVFEKTCITLW

A0A2N9FT59 Reverse transcriptase domain-containing protein5.1e-2339.13Show/hide
Query:  MLKIGFHQTWMNIIMDWVRTATFSVLINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMR----------------------NGSLIFCKAK
        M+K+GFH  W+ ++M+ VR+AT+S+L+NG P G I PQ GLRQ DPLSPYLFLL +E LS+++  A R                      + S++FC+A 
Subjt:  MLKIGFHQTWMNIIMDWVRTATFSVLINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMR----------------------NGSLIFCKAK

Query:  IEEVWAFKKVLERYEKASGQKVNYNKSSMFVSPNMSNEFRCVLADILKVSVVDNLDRILRL
          +  A + +L  Y  ASGQ VN +K+++F SPNMS + R  +      S     ++ L L
Subjt:  IEEVWAFKKVLERYEKASGQKVNYNKSSMFVSPNMSNEFRCVLADILKVSVVDNLDRILRL

A0A2N9GAY0 Uncharacterized protein2.3e-2345.32Show/hide
Query:  MLKIGFHQTWMNIIMDWVRTATFSVLINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMRNGSLIFCKAKIEEVWAFKKVLERYEKASGQKV
        +LK+GFH+ W+ ++M  V +ATF+V++NG P G I P  GLRQ DPLSPYLFLL +E LS+LI  A  + S+IFC+A  ++      +L  YE+ASGQK+
Subjt:  MLKIGFHQTWMNIIMDWVRTATFSVLINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMRNGSLIFCKAKIEEVWAFKKVLERYEKASGQKV

Query:  NYNKSSMFVSPNMSNEFRCVLADILKVSVVDNLDRILRL
        N  K+++F S N   E R VL  +   S     ++ L L
Subjt:  NYNKSSMFVSPNMSNEFRCVLADILKVSVVDNLDRILRL

A0A7N2LPF9 Uncharacterized protein3.0e-2345Show/hide
Query:  MLKIGFHQTWMNIIMDWVRTATFSVLINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMRNG----------------------SLIFCKAK
        M K+GF+  W N++M  + + T++V ING P G I P  GLRQ DPLSPYLFLL +E+LS+LI  A  +G                      SLIFCKA 
Subjt:  MLKIGFHQTWMNIIMDWVRTATFSVLINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMRNG----------------------SLIFCKAK

Query:  IEEVWAFKKVLERYEKASGQKVNYNKSSMFVSPNMSNEFR
        +EE    ++VL  YEKASGQ++N +K+S+F SPN + E +
Subjt:  IEEVWAFKKVLERYEKASGQKVNYNKSSMFVSPNMSNEFR

A0A803P4A1 Uncharacterized protein3.0e-2325.6Show/hide
Query:  MLKIGFHQTWMNIIMDWVRTATFSVLINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLI-----TGAMR-----------------NGSLIFCKAK
        M K+G +  W+N+IM++++T   S +INGS  G + PQ GLRQ DPLSP+LFL+YSE LS L+      G ++                 + SL+FC+A 
Subjt:  MLKIGFHQTWMNIIMDWVRTATFSVLINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLI-----TGAMR-----------------NGSLIFCKAK

Query:  IEEVWAFKKVLERYEKASGQKVNYNKSSMFVSPNMSNEFRCVLADILKVSVVDNLDRILRLVDHL---STEVFEKTCITLWAI---WNDR-------NSF
             A K+ L+ Y KA GQ++N  KS M  SPN ++E +    +IL +S+ D  +  L L  +      ++F      +W +   W D+        + 
Subjt:  IEEVWAFKKVLERYEKASGQKVNYNKSSMFVSPNMSNEFRCVLADILKVSVVDNLDRILRLVDHL---STEVFEKTCITLWAI---WNDR-------NSF

Query:  LFEQPLMESSARCGWINKYWEETCQRSTPLPTQLEESWANHVSLAARTMVFT-DAPVRHYKVGVGYEVVITNEEDTLQSAMHMFEHKSLSPLATEVQEIL
            P +  +  C    +  ++    +   P     SW      +   +    DA        +G+  +I +    +++AM    H    P   E + + 
Subjt:  LFEQPLMESSARCGWINKYWEETCQRSTPLPTQLEESWANHVSLAARTMVFT-DAPVRHYKVGVGYEVVITNEEDTLQSAMHMFEHKSLSPLATEVQEIL

Query:  QAVTLLKRMNISEAVLHSDSLNAIKMINEEQEPDSEVHFWILQIQELSKSFSSLAYVHVGRWRNGRADTLAKHVL
         ++   ++ N    ++ +DSL     + +     S     +  ++       S+   HV R  N  A  LAK VL
Subjt:  QAVTLLKRMNISEAVLHSDSLNAIKMINEEQEPDSEVHFWILQIQELSKSFSSLAYVHVGRWRNGRADTLAKHVL

SwissProt top hitse value%identityAlignment
P92555 Uncharacterized mitochondrial protein AtMg012502.2e-0755.32Show/hide
Query:  LINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMRNGSL
        +ING+P G + P  GLRQ DPLSPYLF+L +E LS L   A   G L
Subjt:  LINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMRNGSL

Arabidopsis top hitse value%identityAlignment
ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.6e-0855.32Show/hide
Query:  LINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMRNGSL
        +ING+P G + P  GLRQ DPLSPYLF+L +E LS L   A   G L
Subjt:  LINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMRNGSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCAAGATTGGATTTCATCAGACATGGATGAATATCATAATGGATTGGGTAAGAACTGCAACTTTTTCTGTTCTAATTAATGGTAGTCCTATGGGACAGATTGTTCC
ACAACATGGCTTAAGACAGAGTGATCCTCTATCTCCCTATTTATTTTTGCTTTATTCAGAAGCATTATCTTCATTGATCACAGGGGCTATGAGGAATGGAAGCCTTATCT
TTTGTAAGGCAAAGATTGAGGAGGTTTGGGCTTTCAAAAAGGTCCTAGAGAGGTATGAAAAAGCATCAGGACAAAAGGTGAATTATAATAAGTCTTCGATGTTTGTTTCT
CCCAACATGTCCAACGAGTTTCGATGTGTTCTGGCTGATATTTTAAAAGTATCTGTGGTGGATAATTTGGATAGAATTTTAAGATTGGTGGATCATCTTAGCACGGAGGT
TTTTGAGAAAACATGTATTACTCTTTGGGCAATATGGAACGATCGGAATAGTTTCCTTTTCGAACAGCCTCTTATGGAGTCGTCTGCTCGTTGTGGTTGGATTAATAAGT
ATTGGGAGGAAACTTGCCAACGCTCTACTCCACTACCAACCCAGCTCGAGGAGTCATGGGCCAACCACGTTTCATTGGCTGCTCGGACGATGGTGTTTACAGATGCGCCA
GTTCGACATTATAAGGTAGGCGTCGGATATGAGGTGGTCATTACCAACGAGGAAGATACGTTGCAAAGTGCCATGCATATGTTTGAGCATAAGTCTTTATCACCATTGGC
GACAGAAGTACAAGAAATCCTTCAAGCGGTGACATTGTTGAAGCGTATGAATATTTCGGAGGCTGTGTTACATTCAGATTCGTTGAATGCGATTAAAATGATTAATGAGG
AGCAAGAGCCAGATTCTGAGGTACATTTCTGGATTTTGCAAATTCAGGAATTGAGCAAATCATTCAGTTCACTGGCTTATGTCCATGTGGGAAGATGGCGAAATGGAAGG
GCGGACACTTTAGCAAAACATGTCCTCCAAAGCACACAATCTATGTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCAAGATTGGATTTCATCAGACATGGATGAATATCATAATGGATTGGGTAAGAACTGCAACTTTTTCTGTTCTAATTAATGGTAGTCCTATGGGACAGATTGTTCC
ACAACATGGCTTAAGACAGAGTGATCCTCTATCTCCCTATTTATTTTTGCTTTATTCAGAAGCATTATCTTCATTGATCACAGGGGCTATGAGGAATGGAAGCCTTATCT
TTTGTAAGGCAAAGATTGAGGAGGTTTGGGCTTTCAAAAAGGTCCTAGAGAGGTATGAAAAAGCATCAGGACAAAAGGTGAATTATAATAAGTCTTCGATGTTTGTTTCT
CCCAACATGTCCAACGAGTTTCGATGTGTTCTGGCTGATATTTTAAAAGTATCTGTGGTGGATAATTTGGATAGAATTTTAAGATTGGTGGATCATCTTAGCACGGAGGT
TTTTGAGAAAACATGTATTACTCTTTGGGCAATATGGAACGATCGGAATAGTTTCCTTTTCGAACAGCCTCTTATGGAGTCGTCTGCTCGTTGTGGTTGGATTAATAAGT
ATTGGGAGGAAACTTGCCAACGCTCTACTCCACTACCAACCCAGCTCGAGGAGTCATGGGCCAACCACGTTTCATTGGCTGCTCGGACGATGGTGTTTACAGATGCGCCA
GTTCGACATTATAAGGTAGGCGTCGGATATGAGGTGGTCATTACCAACGAGGAAGATACGTTGCAAAGTGCCATGCATATGTTTGAGCATAAGTCTTTATCACCATTGGC
GACAGAAGTACAAGAAATCCTTCAAGCGGTGACATTGTTGAAGCGTATGAATATTTCGGAGGCTGTGTTACATTCAGATTCGTTGAATGCGATTAAAATGATTAATGAGG
AGCAAGAGCCAGATTCTGAGGTACATTTCTGGATTTTGCAAATTCAGGAATTGAGCAAATCATTCAGTTCACTGGCTTATGTCCATGTGGGAAGATGGCGAAATGGAAGG
GCGGACACTTTAGCAAAACATGTCCTCCAAAGCACACAATCTATGTTATGA
Protein sequenceShow/hide protein sequence
MLKIGFHQTWMNIIMDWVRTATFSVLINGSPMGQIVPQHGLRQSDPLSPYLFLLYSEALSSLITGAMRNGSLIFCKAKIEEVWAFKKVLERYEKASGQKVNYNKSSMFVS
PNMSNEFRCVLADILKVSVVDNLDRILRLVDHLSTEVFEKTCITLWAIWNDRNSFLFEQPLMESSARCGWINKYWEETCQRSTPLPTQLEESWANHVSLAARTMVFTDAP
VRHYKVGVGYEVVITNEEDTLQSAMHMFEHKSLSPLATEVQEILQAVTLLKRMNISEAVLHSDSLNAIKMINEEQEPDSEVHFWILQIQELSKSFSSLAYVHVGRWRNGR
ADTLAKHVLQSTQSML