; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018532 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018532
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr5:29048293..29049657
RNA-Seq ExpressionLag0018532
SyntenyLag0018532
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ABA98898.1 retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group]3.5e-1930.18Show/hide
Query:  QDPWINFKSARIPIKVKDDFKNKRVNVLIEEDGSWKESLIKEVFTSMEAEEIINMPRCKSKAKDEIIWNEDPKGQFTVKSAYHLATNLEARNEASSSDNQ
        +DPW+    +R PI  K + + K V  L++++GSW   +I ++F  ++ E I ++     + +D I W  D  GQF+++SAY LA  L    E+SSS   
Subjt:  QDPWINFKSARIPIKVKDDFKNKRVNVLIEEDGSWKESLIKEVFTSMEAEEIINMPRCKSKAKDEIIWNEDPKGQFTVKSAYHLATNLEARNEASSSDNQ

Query:  ASTTIWNSIWKIKCIPRAKIAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMCKFSKK
         +  +WN +WK     + K+ +W+ I++ +P+  N K++    +D C F     E   H +W C  +++
Subjt:  ASTTIWNSIWKIKCIPRAKIAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMCKFSKK

EEC72938.1 hypothetical protein OsI_06800 [Oryza sativa Indica Group]2.0e-1933.53Show/hide
Query:  QDPWINFKSARIPIKVKDDFKNKRVNVLIEEDGSWKESLIKEVFTSMEAEEIINMPRCKSKAKDEII-WNEDPKGQFTVKSAYHLATNLEARNEASSSDN
        +DPWI    +R PI  K + + + V+ LI++  SW    I + F  M+AE I+N+ R  S+ +D+ I W+ D  G+F+V+SAYHLA  L   +E SSS  
Subjt:  QDPWINFKSARIPIKVKDDFKNKRVNVLIEEDGSWKESLIKEVFTSMEAEEIINMPRCKSKAKDEII-WNEDPKGQFTVKSAYHLATNLEARNEASSSDN

Query:  QASTTIWNSIWKIKCIPRAKIAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMCKFSKK
          ++  WN++WK     + KI AW+ I + +P+  N K++ +  +D C       E   H ++ C  +++
Subjt:  QASTTIWNSIWKIKCIPRAKIAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMCKFSKK

EEC84753.1 hypothetical protein OsI_31756 [Oryza sativa Indica Group]2.7e-1931.18Show/hide
Query:  QDPWINFKSARIPIKVKDDFKNKRVNVLIEEDGSWKESLIKEVFTSMEAEEIINMPRCKSKAKDEIIWNEDPKGQFTVKSAYHLATNLEARNEASSSDNQ
        +DPWI    +R PI  K ++K K V  L+ + G W  + ++++F  ++AE I+ +        D + W+ D  G+F+V+SAY LA +L + N  S+S   
Subjt:  QDPWINFKSARIPIKVKDDFKNKRVNVLIEEDGSWKESLIKEVFTSMEAEEIINMPRCKSKAKDEIIWNEDPKGQFTVKSAYHLATNLEARNEASSSDNQ

Query:  ASTTIWNSIWKIKCIPRAKIAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMCKFSKKM
          +  WN +WK     + KI AWK+  D + +  N KR+ + + D C      EE + H +  C  +K +
Subjt:  ASTTIWNSIWKIKCIPRAKIAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMCKFSKKM

XP_023875263.1 uncharacterized protein LOC111987752 [Quercus suber]1.2e-1932.95Show/hide
Query:  QDPWINFKSARIPIKVK---DDFKNKRVNVLIEEDGS-WKESLIKEVFTSMEAEEIINMPRCKSKAKDEIIWNEDPKGQFTVKSAYHLATNL-EARNEAS
        +D W+   +   PI      DDF    V+ LI+ D   WK  L+K +F   EA+ I+++P C S   D++IW  + KG FTV+SAY++A +L E  NE  
Subjt:  QDPWINFKSARIPIKVK---DDFKNKRVNVLIEEDGS-WKESLIKEVFTSMEAEEIINMPRCKSKAKDEIIWNEDPKGQFTVKSAYHLATNL-EARNEAS

Query:  SSDNQASTTIWNSIWKIKCIPRAKIAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMCKFSK
        SS   + + +W  +W +  + + +I AW+   + +P+  N++ +G+N++D C    +  E T H ++ C  SK
Subjt:  SSDNQASTTIWNSIWKIKCIPRAKIAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMCKFSK

XP_040986468.1 uncharacterized protein LOC121234565 [Juglans microcarpa x Juglans regia]1.4e-2036.55Show/hide
Query:  KNKRVNVLIE-EDGSWKESLIKEVFTSMEAEEIINMPRCKSKAKDEIIWNEDPKGQFTVKSAYHLATNLEARNEASSSDNQASTTIWNSIWKIKCIPRAK
        K+ +V  LI+ + G W E++I+ +F+  E E+I+ MP  + +AKD+I W    KG FTV+SAY L   L+ RN+  SS  +    IW SIW ++     K
Subjt:  KNKRVNVLIE-EDGSWKESLIKEVFTSMEAEEIINMPRCKSKAKDEIIWNEDPKGQFTVKSAYHLATNLEARNEASSSDNQASTTIWNSIWKIKCIPRAK

Query:  IAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMC
        +  WK  N ++P+  N+ R+ I+ + RC      EE+  H++W C
Subjt:  IAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMC

TrEMBL top hitse value%identityAlignment
A0A2N9H3I8 Uncharacterized protein2.1e-2234.41Show/hide
Query:  QDPWI-NFKSARI--PIKVKDDFKNKRVNVLIEEDG-SWKESLIKEVFTSMEAEEIINMPRCKSKAKDEIIWNEDPKGQFTVKSAYHLATNLEARNEASS
        +D W+ +  S R+  P+ V D+F    V+ LI +D  +WK  L+ E+F   + + II +P    +  D +IW    +G F+VKSAYHL  +L    EA++
Subjt:  QDPWI-NFKSARI--PIKVKDDFKNKRVNVLIEEDG-SWKESLIKEVFTSMEAEEIINMPRCKSKAKDEIIWNEDPKGQFTVKSAYHLATNLEARNEASS

Query:  SDNQASTTIWNSIWKIKCIPRAKIAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMCKFSKKMLKKRRHQSRSAY
        S   +  +IW+SIW +K  P+ K+  WK  +DI+P+   +  KGI+ +  C++  E  ET +H++W C+F+ K+ K       S Y
Subjt:  SDNQASTTIWNSIWKIKCIPRAKIAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMCKFSKKMLKKRRHQSRSAY

A0A803PAX6 Uncharacterized protein1.1e-2135.75Show/hide
Query:  LDQDPWINFKSA---RIPIKVKDDFKNKRVNVLIEEDGSWKESLIKEVFTSMEAEEIINMPRCKSKAKDEIIWNEDPKGQFTVKSAYHLATNLEARNEAS
        +++D W+   S    R P KV    K   ++ L EE+G WK  LIKE F + +   I+ M  CK  + D++IW+    G +TV+S Y +A   E  N  +
Subjt:  LDQDPWINFKSA---RIPIKVKDDFKNKRVNVLIEEDGSWKESLIKEVFTSMEAEEIINMPRCKSKAKDEIIWNEDPKGQFTVKSAYHLATNLEARNEAS

Query:  SSDNQAST-TIWNSIWKIKCIPRAKIAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMCKFSKKMLKK
           ++A+T   W SIWKI+  P+ +   W++ N  IP N  ++R+G+N N  C +  + EET EH +W C +SK++ KK
Subjt:  SSDNQAST-TIWNSIWKIKCIPRAKIAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMCKFSKKMLKK

M5VU98 Reverse transcriptase domain-containing protein5.2e-2126.16Show/hide
Query:  DDFKNKRVNVLIEEDGS--WKESLIKEVFTSMEAEEIINMPRCKSKAKDEIIWNEDPKGQFTVKSAYHLATNLEARNE-ASSSDNQASTTIWNSIWKIKC
        D  +N +V+ LI  +GS  W    +  +F  ++  +I+ +P       D I+WN D  G FTVKSAY +A  + + +E  SSS N  +  +W  IW    
Subjt:  DDFKNKRVNVLIEEDGS--WKESLIKEVFTSMEAEEIINMPRCKSKAKDEIIWNEDPKGQFTVKSAYHLATNLEARNE-ASSSDNQASTTIWNSIWKIKC

Query:  IPRAKIAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMCKFSK-----KMLKKRRHQSRSAYYVELMEYEKHMLSQQSEPKFSEHQKIN
          + KI AW++ +DI+P+  N+ +KG++  D C+F  +  E+  H++ MC F+       +L +  HQ      V+   +E    +QQ   +F       
Subjt:  IPRAKIAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMCKFSK-----KMLKKRRHQSRSAYYVELMEYEKHMLSQQSEPKFSEHQKIN

Query:  SKLYRRRRRKVDRAPDSDGRPVSDSDGEPAESPPLIAWRWPIKVLE-------AKAISEGLKTYIKATEMEDLEIRPEMEV-----------EADSLEVI
        SK+  R R  V  A    GR   + DG  A  P   + R  + V+          A+++ +   + A   E L  R  + +           E DS  V+
Subjt:  SKLYRRRRRKVDRAPDSDGRPVSDSDGEPAESPPLIAWRWPIKVLE-------AKAISEGLKTYIKATEMEDLEIRPEMEV-----------EADSLEVI

Query:  QTINKKSVDLSELSLVIDEIEEMVPGARVASFVKCSRSSNDIAHKIARAGVIHGDFCYFFGHPSNSV
          I +   D S +  ++++++ +      + F    R +N +AH++AR G+ + D   +F  P + +
Subjt:  QTINKKSVDLSELSLVIDEIEEMVPGARVASFVKCSRSSNDIAHKIARAGVIHGDFCYFFGHPSNSV

M5XHI9 Reverse transcriptase domain-containing protein8.9e-2125.61Show/hide
Query:  DDFKNKRVNVLIEEDGS--WKESLIKEVFTSMEAEEIINMPRCKSKAKDEIIWNEDPKGQFTVKSAYHLATNLEARNE-ASSSDNQASTTIWNSIWKIKC
        D  +N +V+ LI  +GS  W    +  +F  ++  +I+ +P       D I+WN D  G FTVKSAY +A  + + +E  SSS N  +  +W  IW    
Subjt:  DDFKNKRVNVLIEEDGS--WKESLIKEVFTSMEAEEIINMPRCKSKAKDEIIWNEDPKGQFTVKSAYHLATNLEARNE-ASSSDNQASTTIWNSIWKIKC

Query:  IPRAKIAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMCKFSK-----KMLKKRRHQSRSAYYVELMEYEKHMLSQQSEPKFSEHQKIN
          + KI AW++ +DI+P+  N+ +KG++  D C+F  +  E+  H++ MC F+       +L +  HQ      V+   +E    +QQ   +F       
Subjt:  IPRAKIAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMCKFSK-----KMLKKRRHQSRSAYYVELMEYEKHMLSQQSEPKFSEHQKIN

Query:  SKLYRRRRRKVDRAPDSDGRPVSDSDG--EPAESPPLIAWRWPIKVLEA-----KAISEGLKTYIKATEMEDLEIRPEMEV-----------EADSLEVI
        SK+  R R  V  A    GR   + DG  +P      +     +   +A      A+++ +   + A   E L  R  + +           E DS  V+
Subjt:  SKLYRRRRRKVDRAPDSDGRPVSDSDG--EPAESPPLIAWRWPIKVLEA-----KAISEGLKTYIKATEMEDLEIRPEMEV-----------EADSLEVI

Query:  QTINKKSVDLSELSLVIDEIEEMVPGARVASFVKCSRSSNDIAHKIARAGVIHGDFCYFFGHPSNSV
          I +   D S +  ++++++ +      + F    R +N +AH++AR G+ + D   +F  P + +
Subjt:  QTINKKSVDLSELSLVIDEIEEMVPGARVASFVKCSRSSNDIAHKIARAGVIHGDFCYFFGHPSNSV

M5XK32 Reverse transcriptase domain-containing protein (Fragment)1.8e-2125.26Show/hide
Query:  DPWINFKSARIPI-KVKDDFKNKRVNVLIEEDGS--WKESLIKEVFTSMEAEEIINMPRCKSKAKDEIIWNEDPKGQFTVKSAYHLATNLEARNE-ASSS
        D W+   +A + I    D  +N +V+ LI  +GS  W    +  +F  ++  + + +P       D I+WN D  G FTVKSAY +A  + + +E  SSS
Subjt:  DPWINFKSARIPI-KVKDDFKNKRVNVLIEEDGS--WKESLIKEVFTSMEAEEIINMPRCKSKAKDEIIWNEDPKGQFTVKSAYHLATNLEARNE-ASSS

Query:  DNQASTTIWNSIWKIKCIPRAKIAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMCKFSK-----KMLKKRRHQSRSAYYVELMEYEKH
         N  ++ +W  IW      + KI AW++ +DI+P+  N+ +KG++  D C+F  +  E+  H++ MC F+       +L +  HQ      V+   ++  
Subjt:  DNQASTTIWNSIWKIKCIPRAKIAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMCKFSK-----KMLKKRRHQSRSAYYVELMEYEKH

Query:  MLSQQSEPKFSEHQKINSKLYRRRRRKVDRAPDSDGRPVSDSDGEPAESPPLIAWRWPIKVLE-------AKAISEGLKTYIKATEMEDLEIRPEMEV--
          +QQ   +F       SK+  R R  V  A  S GR   + DG  A  P   + R  + V+          A+++ +   + A   E L  R  + +  
Subjt:  MLSQQSEPKFSEHQKINSKLYRRRRRKVDRAPDSDGRPVSDSDGEPAESPPLIAWRWPIKVLE-------AKAISEGLKTYIKATEMEDLEIRPEMEV--

Query:  ---------EADSLEVIQTINKKSVDLSELSLVIDEIEEMVPGARVASFVKCSRSSNDIAHKIARAGVIHGDFCYFFGHPSNSV
                 E DS  V+  + +   D S +  ++++++ +      + F    R +N + H++AR G+ + D   +F  P + +
Subjt:  ---------EADSLEVIQTINKKSVDLSELSLVIDEIEEMVPGARVASFVKCSRSSNDIAHKIARAGVIHGDFCYFFGHPSNSV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein6.7e-1324.68Show/hide
Query:  PIKVKDDFKNKRVNVLIEEDGS---WKESLIKEVFTSMEAEEIINMPRCKSKAKDEIIWNEDPKGQFTVKSAYHLATNLEARNEASSSDNQASTTIWNSI
        P+  ++ +K   +N L E  GS   W +S I +     +   I  +   KSK  D+IIWN +  G++TV+S Y L T+  + N  + +    S  +   I
Subjt:  PIKVKDDFKNKRVNVLIEEDGS---WKESLIKEVFTSMEAEEIINMPRCKSKAKDEIIWNEDPKGQFTVKSAYHLATNLEARNEASSSDNQASTTIWNSI

Query:  WKIKCIPRAKIAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMCKFS
        W +  +P+ K   W+ ++  + +   +  +G+  +  C       E+  H ++ C F+
Subjt:  WKIKCIPRAKIAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMCKFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGATCAAGACCCCTGGATTAATTTCAAAAGTGCTAGAATCCCCATCAAAGTGAAGGATGATTTTAAGAACAAGAGAGTTAACGTGCTGATTGAGGAAGATGGAAG
CTGGAAAGAGTCTTTAATCAAAGAGGTGTTCACATCCATGGAAGCTGAAGAAATCATAAATATGCCAAGATGCAAGTCCAAAGCTAAGGATGAAATTATATGGAATGAGG
ATCCTAAAGGTCAATTCACGGTGAAGAGCGCTTACCATCTAGCTACTAATTTAGAAGCTAGAAATGAAGCTTCTTCATCTGATAATCAAGCTTCAACAACCATTTGGAAC
TCGATCTGGAAGATTAAATGCATTCCGAGAGCAAAAATTGCAGCTTGGAAGATTATCAATGACATTATTCCTTCTAATTACAATGTTAAAAGGAAAGGAATTAATTCTAA
TGATCGATGTGTTTTTTACAGGGAGCATGAGGAAACTACAGAACATTTGATATGGATGTGTAAATTCTCTAAGAAGATGCTTAAAAAGAGAAGACACCAAAGTCGCAGTG
CTTATTATGTGGAGCTTATGGAATACGAGAAACATATGCTTTCACAACAATCAGAGCCCAAATTTTCAGAGCACCAGAAGATCAATTCAAAACTTTATAGAAGAAGAAGA
AGAAAGGTTGATCGAGCACCTGATTCCGATGGAAGACCCGTCTCCGACAGCGATGGAGAACCAGCTGAGTCACCCCCATTGATTGCATGGAGATGGCCAATTAAAGTTCT
AGAAGCAAAAGCTATTTCTGAGGGTTTGAAGACTTACATTAAAGCGACGGAGATGGAAGATCTCGAGATCAGGCCAGAGATGGAAGTGGAGGCGGATTCTCTTGAGGTGA
TTCAAACCATCAACAAGAAGTCTGTGGACCTTTCGGAGTTGTCGCTGGTGATAGACGAAATCGAAGAGATGGTTCCTGGTGCAAGAGTCGCCTCGTTTGTGAAATGCTCG
AGATCGAGCAATGATATTGCACATAAGATTGCGCGCGCTGGAGTCATTCATGGAGATTTCTGTTATTTTTTTGGTCACCCCTCCAATTCTGTTTTGAGGGAATTTCAGGG
TGCAAGGATAAGAGTATCCCTGTGTGGCTTAAAAGATGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTAGATCAAGACCCCTGGATTAATTTCAAAAGTGCTAGAATCCCCATCAAAGTGAAGGATGATTTTAAGAACAAGAGAGTTAACGTGCTGATTGAGGAAGATGGAAG
CTGGAAAGAGTCTTTAATCAAAGAGGTGTTCACATCCATGGAAGCTGAAGAAATCATAAATATGCCAAGATGCAAGTCCAAAGCTAAGGATGAAATTATATGGAATGAGG
ATCCTAAAGGTCAATTCACGGTGAAGAGCGCTTACCATCTAGCTACTAATTTAGAAGCTAGAAATGAAGCTTCTTCATCTGATAATCAAGCTTCAACAACCATTTGGAAC
TCGATCTGGAAGATTAAATGCATTCCGAGAGCAAAAATTGCAGCTTGGAAGATTATCAATGACATTATTCCTTCTAATTACAATGTTAAAAGGAAAGGAATTAATTCTAA
TGATCGATGTGTTTTTTACAGGGAGCATGAGGAAACTACAGAACATTTGATATGGATGTGTAAATTCTCTAAGAAGATGCTTAAAAAGAGAAGACACCAAAGTCGCAGTG
CTTATTATGTGGAGCTTATGGAATACGAGAAACATATGCTTTCACAACAATCAGAGCCCAAATTTTCAGAGCACCAGAAGATCAATTCAAAACTTTATAGAAGAAGAAGA
AGAAAGGTTGATCGAGCACCTGATTCCGATGGAAGACCCGTCTCCGACAGCGATGGAGAACCAGCTGAGTCACCCCCATTGATTGCATGGAGATGGCCAATTAAAGTTCT
AGAAGCAAAAGCTATTTCTGAGGGTTTGAAGACTTACATTAAAGCGACGGAGATGGAAGATCTCGAGATCAGGCCAGAGATGGAAGTGGAGGCGGATTCTCTTGAGGTGA
TTCAAACCATCAACAAGAAGTCTGTGGACCTTTCGGAGTTGTCGCTGGTGATAGACGAAATCGAAGAGATGGTTCCTGGTGCAAGAGTCGCCTCGTTTGTGAAATGCTCG
AGATCGAGCAATGATATTGCACATAAGATTGCGCGCGCTGGAGTCATTCATGGAGATTTCTGTTATTTTTTTGGTCACCCCTCCAATTCTGTTTTGAGGGAATTTCAGGG
TGCAAGGATAAGAGTATCCCTGTGTGGCTTAAAAGATGTTTAG
Protein sequenceShow/hide protein sequence
MLDQDPWINFKSARIPIKVKDDFKNKRVNVLIEEDGSWKESLIKEVFTSMEAEEIINMPRCKSKAKDEIIWNEDPKGQFTVKSAYHLATNLEARNEASSSDNQASTTIWN
SIWKIKCIPRAKIAAWKIINDIIPSNYNVKRKGINSNDRCVFYREHEETTEHLIWMCKFSKKMLKKRRHQSRSAYYVELMEYEKHMLSQQSEPKFSEHQKINSKLYRRRR
RKVDRAPDSDGRPVSDSDGEPAESPPLIAWRWPIKVLEAKAISEGLKTYIKATEMEDLEIRPEMEVEADSLEVIQTINKKSVDLSELSLVIDEIEEMVPGARVASFVKCS
RSSNDIAHKIARAGVIHGDFCYFFGHPSNSVLREFQGARIRVSLCGLKDV