; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025513 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025513
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr10:14150416..14151150
RNA-Seq ExpressionLag0025513
SyntenyLag0025513
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015382877.1 uncharacterized protein LOC107175700 [Citrus sinensis]1.7e-1729.11Show/hide
Query:  MWSLNVPSKVKQFIWRAYNNCIPTMMNLYRHHVVNG-LCPICKKGSKSTDHALFRCGRAKRFWKLIHNKVKIRQFSYGSIQDRFLDFYQSDQKDMLDWIC
        +W+ ++P KVK FIWRA  N +PT+ NL+R  VV    C  C    +   HAL  C  AKR WKL      ++Q ++  +           +K+ L+ + 
Subjt:  MWSLNVPSKVKQFIWRAYNNCIPTMMNLYRHHVVNG-LCPICKKGSKSTDHALFRCGRAKRFWKLIHNKVKIRQFSYGSIQDRFLDFYQSDQKDMLDWIC

Query:  MGAWALWNGRNAVV-HGK---------------------PIPTEN--------EQCEW-----------------LLRDKYGIGVIVRPLDGSTQAAMHG
           WA+W  RN +V  GK                      +P E          Q EW                 +   K G+GV++R   G   AA   
Subjt:  MGAWALWNGRNAVV-HGK---------------------PIPTEN--------EQCEW-----------------LLRDKYGIGVIVRPLDGSTQAAMHG

Query:  QFPYLLNPLCAEAKAVLEGLQLVQRLEITEVDIYSDS
          PY  +  C EA+AVL G+Q  Q+  +  + I SDS
Subjt:  QFPYLLNPLCAEAKAVLEGLQLVQRLEITEVDIYSDS

XP_024046693.1 uncharacterized protein LOC112101029 [Citrus clementina]1.5e-1829.96Show/hide
Query:  MWSLNVPSKVKQFIWRAYNNCIPTMMNLYRHHVVNG-LCPICKKGSKSTDHALFRCGRAKRFWKLIHNKVKIRQFSYGSIQDRFLDFYQSDQKDMLDWIC
        +W+ ++P KVK FIWRA  N +PT+ NL+R  VV    C IC    +   HAL  C  AKR WKL      + Q +   +     D     +K+ L+ + 
Subjt:  MWSLNVPSKVKQFIWRAYNNCIPTMMNLYRHHVVNG-LCPICKKGSKSTDHALFRCGRAKRFWKLIHNKVKIRQFSYGSIQDRFLDFYQSDQKDMLDWIC

Query:  MGAWALWNGRNAVV-HGK---------------------PIPTEN--------EQCEW-----------------LLRDKYGIGVIVRPLDGSTQAAMHG
           WA+W  RN +V  GK                      +P E          Q EW                 +   K G+GV++R   G   AA   
Subjt:  MGAWALWNGRNAVV-HGK---------------------PIPTEN--------EQCEW-----------------LLRDKYGIGVIVRPLDGSTQAAMHG

Query:  QFPYLLNPLCAEAKAVLEGLQLVQRLEITEVDIYSDS
          PY  +  C EA+AVL G+Q  Q+  +  + I SDS
Subjt:  QFPYLLNPLCAEAKAVLEGLQLVQRLEITEVDIYSDS

XP_030479177.1 uncharacterized protein LOC115696415 [Cannabis sativa]7.7e-1826.87Show/hide
Query:  MWSLNVPSKVKQFIWRAYNNCIPTMMNLYRHHVVNG-LCPICKKGSKSTDHALFRCGRAKRFWKLIHNKVKIRQFSYGSIQDRFLDFYQSDQKDMLDWIC
        +WS   P KVK FIWR  +N IP   +L++  V++  LCPICK  +++  HAL  C RA++ W+          F    I + FL  + +  K+ +  + 
Subjt:  MWSLNVPSKVKQFIWRAYNNCIPTMMNLYRHHVVNG-LCPICKKGSKSTDHALFRCGRAKRFWKLIHNKVKIRQFSYGSIQDRFLDFYQSDQKDMLDWIC

Query:  MGAWALWNGRNAVVHGKPIPTENEQCEWLL----------------------------------------------RDKYGIGVIVRPLDGSTQAAMHGQ
           WALWN RN +V+ +      E  +W L                                                K+ IGV+V   D + +A     
Subjt:  MGAWALWNGRNAVVHGKPIPTENEQCEWLL----------------------------------------------RDKYGIGVIVRPLDGSTQAAMHGQ

Query:  FPYLLNPLCAEAKAVLEGLQLVQRLEI
        F  L+ P  AEAKA+ + +Q  Q + +
Subjt:  FPYLLNPLCAEAKAVLEGLQLVQRLEI

XP_042962663.1 uncharacterized protein LOC122296935 [Carya illinoinensis]3.1e-1928.77Show/hide
Query:  MWSLNVPSKVKQFIWRAYNNCIPTMMNL-YRHHVVNGLCPICKKGSKSTDHALFRCGRAKRFWKLIHNKVKIRQFSYGSIQDRFLDFYQSDQKDMLDWIC
        MW LN+P  VK F+W+A NNC PT  NL  R  V +  CPICK   ++  HAL+ CG A   W    + V+    +   +QD + D+ +  QK+ L+ + 
Subjt:  MWSLNVPSKVKQFIWRAYNNCIPTMMNL-YRHHVVNGLCPICKKGSKSTDHALFRCGRAKRFWKLIHNKVKIRQFSYGSIQDRFLDFYQSDQKDMLDWIC

Query:  MGAWALWNGRNAVVHGKPIPTENEQCE-----------------------W-----LLRDKYGIGVIVRPLDGSTQAAMHGQFPYLLNPLCAEAKAVLEG
        +    +W+  +++   +    +N +C+                       W     L   + G+GV++R  +G    +M  Q   + +PL AE +A+   
Subjt:  MGAWALWNGRNAVVHGKPIPTENEQCE-----------------------W-----LLRDKYGIGVIVRPLDGSTQAAMHGQFPYLLNPLCAEAKAVLEG

Query:  LQLVQRLEITEVDIYSDSL
        LQ+   L +TEV    D+L
Subjt:  LQLVQRLEITEVDIYSDSL

XP_042974848.1 uncharacterized protein LOC122306483 [Carya illinoinensis]5.9e-1830.15Show/hide
Query:  MWSLNVPSKVKQFIWRAYNNCIPTMMNLYRHHVVNG-LCPICKKGSKSTDHALFRCGRAKRFWKLIHNKVKIRQFSYGSIQDRFLDFYQSDQKDMLDWIC
        MW +  P   K  +WRA    + T MNL++  VV   LCPIC +  +S  HAL+ C  A+  W     +V+      GS +   + F+    K  L  + 
Subjt:  MWSLNVPSKVKQFIWRAYNNCIPTMMNLYRHHVVNG-LCPICKKGSKSTDHALFRCGRAKRFWKLIHNKVKIRQFSYGSIQDRFLDFYQSDQKDMLDWIC

Query:  MGAWALWNGRNAVVHGKPIPTENEQCEWL--------LRDKYGIGVIVRPLDGSTQAAMHGQFPYLLNPLCAEAKAVLEGLQLVQRLEITEVDIYSDSL
        + A A+W+ RN     K   + ++  + +         R K GIGVIVR  +G   A++        NP+  EA A L    L   L +T++ I  D+L
Subjt:  MGAWALWNGRNAVVHGKPIPTENEQCEWL--------LRDKYGIGVIVRPLDGSTQAAMHGQFPYLLNPLCAEAKAVLEGLQLVQRLEITEVDIYSDSL

TrEMBL top hitse value%identityAlignment
A0A2N9G3W9 Uncharacterized protein4.4e-1931.16Show/hide
Query:  MWSLNVPSKVKQFIWRAYNNCIPTMMNLYRHHVV-NGLCPICKKGSKSTDHALFRCGRAKRFWKLIHNKVKIRQFSYGSIQDRFLDFYQSDQKDMLDWIC
        +WS+ VP KV+QFIWRA    +PTM+N+ R ++V    CP C+   +   HAL+RC      W       K+ +  + +  D  LD +     + +  + 
Subjt:  MWSLNVPSKVKQFIWRAYNNCIPTMMNLYRHHVV-NGLCPICKKGSKSTDHALFRCGRAKRFWKLIHNKVKIRQFSYGSIQDRFLDFYQSDQKDMLDWIC

Query:  MGAWALWNGRN-AVVHGKPIP----TENEQCEW---------------LLRDKY--GIGVIVRPLDGSTQAAMHGQFPYLLNPLCAEAKAVLEGLQLVQR
           W LWN RN A+   + +P    T   + +W               L  D+   G+GVI+R   G   AA+  +FPYL +   AEA A  E  Q    
Subjt:  MGAWALWNGRN-AVVHGKPIP----TENEQCEW---------------LLRDKY--GIGVIVRPLDGSTQAAMHGQFPYLLNPLCAEAKAVLEGLQLVQR

Query:  LEI--TEVDIYSDSL
        + I  T+V+   DSL
Subjt:  LEI--TEVDIYSDSL

A0A5B6WZ13 Reverse transcriptase2.0e-1630.14Show/hide
Query:  MWSLNVPSKVKQFIWRAYNNCIPTMMNLYRHH-VVNGLCPICKKGSKSTDHALFRCGRAKRFWKLIHNKVKIRQFSYGSIQDRFLDFYQSDQKDMLDWIC
        +W L+VP K+K  IWR +NN +P   NL R   V+N +CP+C+   K ++H ++ CG  K  W  +H +V   + S+ S Q  F   + +  +++   + 
Subjt:  MWSLNVPSKVKQFIWRAYNNCIPTMMNLYRHH-VVNGLCPICKKGSKSTDHALFRCGRAKRFWKLIHNKVKIRQFSYGSIQDRFLDFYQSDQKDMLDWIC

Query:  MGAWALWNGRNAVVHGKP----------IPTE-----NEQCEWLLRDKYGI-GVIVRPLDGSTQAAMHGQFPYLLNPLCAEAKAVLEGLQLVQRLEITEV
        +  W+LW  RN ++H             I  E     N    +    K  I  VIVR L G+ + A    F  + +P  AEA+A    L L   ++   V
Subjt:  MGAWALWNGRNAVVHGKP----------IPTE-----NEQCEWLLRDKYGI-GVIVRPLDGSTQAAMHGQFPYLLNPLCAEAKAVLEGLQLVQRLEITEV

Query:  DIYSDSLWV
         +  DSL V
Subjt:  DIYSDSLWV

A0A6J1DX30 uncharacterized protein LOC1110248741.1e-1727.87Show/hide
Query:  MWSLNVPSKVKQFIWRAYNNCIPTMMNLYRHHVVN-GLCPICKKGSKSTDHALFRCGRAKRFWKLIHNKVKIRQFSYGSIQDR--FLDFYQS-----DQK
        +W L VP+K+K FIWR+ +  IPT  NL    +     C IC    +S  HA F C RA++ W+ +   +        S +D   FL+ + S     + K
Subjt:  MWSLNVPSKVKQFIWRAYNNCIPTMMNLYRHHVVN-GLCPICKKGSKSTDHALFRCGRAKRFWKLIHNKVKIRQFSYGSIQDR--FLDFYQS-----DQK

Query:  DMLDWICMGAWALWNGRNAVVHGKPIPTENEQCEWL--LRDKY--------------------------------------------GIGVIVRPLDGST
        D L+   +  W +WN RN+++HGK +     +CEWL    D +                                              G I+R    S 
Subjt:  DMLDWICMGAWALWNGRNAVVHGKPIPTENEQCEWL--LRDKY--------------------------------------------GIGVIVRPLDGST

Query:  QAAMHGQFPYLLNPLCAEAKAVLEGLQLVQRLEITEVDIYSDSL
         AA   + P+ L+PL AE + +LEGL+       T +++ SDSL
Subjt:  QAAMHGQFPYLLNPLCAEAKAVLEGLQLVQRLEITEVDIYSDSL

A0A803PZ76 Uncharacterized protein8.3e-1826.99Show/hide
Query:  MWSLNVPSKVKQFIWRAYNNCIPTMMNLYRHHVVNG-LCPICKKGSKSTDHALFRCGRAKRFWKLIHNKVKIRQFSYGSIQDRFLDFYQSDQKDMLDWIC
        +WSL  P KVK F+WR  +N IP   +L++  V+N  LCP+CK  +++  HAL  C R+++ W+          F +  I + FL   QS  K+ +  + 
Subjt:  MWSLNVPSKVKQFIWRAYNNCIPTMMNLYRHHVVNG-LCPICKKGSKSTDHALFRCGRAKRFWKLIHNKVKIRQFSYGSIQDRFLDFYQSDQKDMLDWIC

Query:  MGAWALWNGRNAVVHGKPIPTENEQCEWLL---------------------------------------------RDKYGIGVIVRPLDGSTQAAMHGQF
           WALWN RN  V  +      E  +W L                                               ++ IGV+V   D + +A     F
Subjt:  MGAWALWNGRNAVVHGKPIPTENEQCEWLL---------------------------------------------RDKYGIGVIVRPLDGSTQAAMHGQF

Query:  PYLLNPLCAEAKAVLEGLQLVQRLEI
          L+ P  AEAKA+ + +Q  Q + +
Subjt:  PYLLNPLCAEAKAVLEGLQLVQRLEI

Q2QNX8 Retrotransposon protein, putative, unclassified1.1e-1736.89Show/hide
Query:  MWSLNVPSKVKQFIWRAYNNCIPTMMN-LYRHHVVNGLCPICKKGSKSTDHALFRCGRAKRFWKLIHNKVKIRQFSYGSIQ--DRFLDFYQSDQKDMLDW
        +W  N+P KVK F WRA +NC+PTM+N   R+  ++ +C  C    +   HAL+RC  A+R+W L+ +++++++  YGS +  D  LDF +S  +D    
Subjt:  MWSLNVPSKVKQFIWRAYNNCIPTMMN-LYRHHVVNGLCPICKKGSKSTDHALFRCGRAKRFWKLIHNKVKIRQFSYGSIQ--DRFLDFYQSDQKDMLDW

Query:  ICMGAWALWNGRNAVVHGKPIP
          M  W +W  RN + H K  P
Subjt:  ICMGAWALWNGRNAVVHGKPIP

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657502.9e-0431.75Show/hide
Query:  MWSLNVPSKVKQFIWRAYNNCIPTMMNLYRHHV-VNGLCPICKKGSKSTDHALFRCGRAKRFW
        +W + VP +VK F+W   N  + T    +R H+  + +C +CK G +S  H L  C      W
Subjt:  MWSLNVPSKVKQFIWRAYNNCIPTMMNLYRHHV-VNGLCPICKKGSKSTDHALFRCGRAKRFW

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein6.3e-1031.2Show/hide
Query:  MWSLNVPSKVKQFIWRAYNNCIPTMMNL-YRHHVVNGLCPICKKGSKSTDHALFRCGRAKRFWK-----LIHNKVKIRQF--SYGSIQDRFLDFYQSDQK
        +W+L +  K+K F+WRA +  + T   L  R   ++  CP C + ++S +HALF C  A   W+     LI N++    F  +  +I +   D   SD  
Subjt:  MWSLNVPSKVKQFIWRAYNNCIPTMMNL-YRHHVVNGLCPICKKGSKSTDHALFRCGRAKRFWK-----LIHNKVKIRQF--SYGSIQDRFLDFYQSDQK

Query:  DMLD-WICMGAWALWNGRNAVVHGK
         +L  W+    W +W  RN VV  K
Subjt:  DMLD-WICMGAWALWNGRNAVVHGK

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.9e-0434.43Show/hide
Query:  MWSLNVPSKVKQFIWRAYNNCIPTMMNLYRHHV-VNGLCPICKKGSKSTDHALFRCGRAKR
        +WSL +  K+K  IW+A NN +P    L   ++ +   C  C+     T H LF C  A+R
Subjt:  MWSLNVPSKVKQFIWRAYNNCIPTMMNLYRHHV-VNGLCPICKKGSKSTDHALFRCGRAKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGTCCTTAAACGTTCCATCTAAGGTTAAACAGTTCATTTGGAGGGCTTATAACAACTGCATACCAACAATGATGAATCTGTATCGGCATCACGTGGTTAATGGCCT
TTGCCCAATTTGTAAAAAGGGATCAAAATCTACTGATCATGCCTTATTTAGGTGTGGTAGAGCCAAGAGGTTTTGGAAACTAATCCATAACAAAGTTAAAATTCGTCAGT
TCAGTTATGGATCAATTCAAGATAGGTTCTTAGATTTTTACCAATCGGACCAAAAGGACATGTTGGATTGGATTTGCATGGGTGCATGGGCACTATGGAATGGTCGAAAT
GCTGTGGTGCACGGGAAACCTATACCAACTGAAAATGAGCAGTGTGAATGGTTGCTCCGGGATAAGTATGGTATTGGAGTTATAGTAAGACCACTAGATGGGAGTACTCA
AGCAGCCATGCATGGTCAATTTCCATATTTGCTAAATCCTCTATGTGCAGAAGCTAAAGCGGTGCTGGAAGGACTTCAGTTAGTGCAACGTTTGGAGATCACGGAGGTAG
ATATCTACTCGGATTCACTTTGGGTTGATCTCCGTGATCAATGGATATGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGGTCCTTAAACGTTCCATCTAAGGTTAAACAGTTCATTTGGAGGGCTTATAACAACTGCATACCAACAATGATGAATCTGTATCGGCATCACGTGGTTAATGGCCT
TTGCCCAATTTGTAAAAAGGGATCAAAATCTACTGATCATGCCTTATTTAGGTGTGGTAGAGCCAAGAGGTTTTGGAAACTAATCCATAACAAAGTTAAAATTCGTCAGT
TCAGTTATGGATCAATTCAAGATAGGTTCTTAGATTTTTACCAATCGGACCAAAAGGACATGTTGGATTGGATTTGCATGGGTGCATGGGCACTATGGAATGGTCGAAAT
GCTGTGGTGCACGGGAAACCTATACCAACTGAAAATGAGCAGTGTGAATGGTTGCTCCGGGATAAGTATGGTATTGGAGTTATAGTAAGACCACTAGATGGGAGTACTCA
AGCAGCCATGCATGGTCAATTTCCATATTTGCTAAATCCTCTATGTGCAGAAGCTAAAGCGGTGCTGGAAGGACTTCAGTTAGTGCAACGTTTGGAGATCACGGAGGTAG
ATATCTACTCGGATTCACTTTGGGTTGATCTCCGTGATCAATGGATATGTTGA
Protein sequenceShow/hide protein sequence
MWSLNVPSKVKQFIWRAYNNCIPTMMNLYRHHVVNGLCPICKKGSKSTDHALFRCGRAKRFWKLIHNKVKIRQFSYGSIQDRFLDFYQSDQKDMLDWICMGAWALWNGRN
AVVHGKPIPTENEQCEWLLRDKYGIGVIVRPLDGSTQAAMHGQFPYLLNPLCAEAKAVLEGLQLVQRLEITEVDIYSDSLWVDLRDQWIC