; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041335 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041335
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionzf-RVT domain-containing protein
Genome locationchr13:15855918..15856745
RNA-Seq ExpressionLag0041335
SyntenyLag0041335
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.4e-2428.36Show/hide
Query:  MKCQEASLSNVEKETTWWNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLLYPPM-MRNLWVHMD
        +KC   S S      T WN +WK+ VP+K+KIF W++ H  IPT  NL    +     C +C +  E+  HA F C RAR++W  L+P +   +   ++ 
Subjt:  MKCQEASLSNVEKETTWWNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLLYPPM-MRNLWVHMD

Query:  IKDYWLSLADN-PMEVLERICVGTWSIWNDRNNSLHQCQIPNPKLRCEWINDYLSEFLKA--------------------NPKGGAFVQTEKDIV-----
          + W SL +    + L    +  W IWNDRN+ +H  Q+   + +CEW+  +L    +A                     P     ++   D       
Subjt:  IKDYWLSLADN-PMEVLERICVGTWSIWNDRNNSLHQCQIPNPKLRCEWINDYLSEFLKA--------------------NPKGGAFVQTEKDIV-----

Query:  NLISDVLRDKQGHLKLVRNLSSQAGNSSLEAEAIAVLEGMRLARSLNVQQFTVLSDSLTLINLINEKI
             ++RD    L    ++      S L AE   +LEG++ A + N     V SDSL  I LI  +I
Subjt:  NLISDVLRDKQGHLKLVRNLSSQAGNSSLEAEAIAVLEGMRLARSLNVQQFTVLSDSLTLINLINEKI

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]4.2e-2131.35Show/hide
Query:  WNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLLYPPMMRNLWVHMDIKDYWLS-LADNPMEVLE
        W R+W+++VP K+KIF W+T  N +PTM NL +  V  +  CP+C + +ET  HAL  C  A+  W   +     +L    D+ +  L  +A   +  LE
Subjt:  WNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLLYPPMMRNLWVHMDIKDYWLS-LADNPMEVLE

Query:  RICVGTWSIWNDRNNSLHQCQIPNPKLRCEWINDYLSEF-----------------LKANPKGGAFVQT-----EKDIVNLISDVLRDKQGHLKLVRNLS
              WSIW +RN ++H+     P    E     L+EF                  KA P G   + T     + +  + I  V+RD +G   +V   S
Subjt:  RICVGTWSIWNDRNNSLHQCQIPNPKLRCEWINDYLSEF-----------------LKANPKGGAFVQT-----EKDIVNLISDVLRDKQGHLKLVRNLS

Query:  SQAGNSSLEA---EAIAVLEGMRLARSLNVQQFTVLSDSLTLINLINEKIQG
        S+  ++S  A   EA+A+ EG+ LA  + V      SDSL++I  I+EK+ G
Subjt:  SQAGNSSLEA---EAIAVLEGMRLARSLNVQQFTVLSDSLTLINLINEKIQG

XP_030479077.1 uncharacterized protein LOC115696311 [Cannabis sativa]1.0e-2229.08Show/hide
Query:  KCQEASLSNVEKETTWWNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLLYPPMMRNLWVHMDIK
        K  E+S S++     WWNR+W +++P KVKIF W+ +++++PT VNL +  +SV  +C +C    E+S HALF CTRA++VW L +      L   M+  
Subjt:  KCQEASLSNVEKETTWWNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLLYPPMMRNLWVHMDIK

Query:  DYWLSLAD-NPMEVLERICVGTWSIWNDRNNSLHQCQIPNPKLRCEWINDYLSEFLKAN--------------------PKGG--------AFVQTEKDI
        + + S+A     +VL +I    W IW +RN  +H  +     + C +++ Y+ +FL A+                    P+G         A   T    
Subjt:  DYWLSLAD-NPMEVLERICVGTWSIWNDRNNSLHQCQIPNPKLRCEWINDYLSEFLKAN--------------------PKGG--------AFVQTEKDI

Query:  VNL------------ISDVLRDKQGHLKLVRNLSSQAGNSSLEAEAIAVLEGMRLARSLNVQQFTVLSDSLTLINLINEKIQ
        +N+            I DV+RD  G +    +       S  E EA A+L  ++ A  L +Q   + ++SL   N IN+  Q
Subjt:  VNL------------ISDVLRDKQGHLKLVRNLSSQAGNSSLEAEAIAVLEGMRLARSLNVQQFTVLSDSLTLINLINEKIQ

XP_030943029.1 uncharacterized protein LOC115967981 [Quercus lobata]3.8e-2228.95Show/hide
Query:  QEASLSNVEKETTWWNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLLYPPMMRNLWVHMDIKDY
        +E   ++    T  W R+W+ +VP K+KIF W+T  N +PTM NL +  V  +  CP+C + +ET+ HAL  C  A+  W   Y   +     + D  D 
Subjt:  QEASLSNVEKETTWWNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLLYPPMMRNLWVHMDIKDY

Query:  WLS-LADNPMEVLERICVGTWSIWNDRNNSLHQCQIPNPKLRCEWINDYLSE-------------------------FLKANPKGGAFVQTEKDIVNLIS
         L  +A      LE     TWSIW +RN ++H+     P    E     L+E                         F+K N    AF    K  + +  
Subjt:  WLS-LADNPMEVLERICVGTWSIWNDRNNSLHQCQIPNPKLRCEWINDYLSE-------------------------FLKANPKGGAFVQTEKDIVNLIS

Query:  DVLRDKQGHLKLVRNLSSQAGNSSLEAEAIAVLEGMRLARSLNVQQFTVLSDSLTLINLINEKIQG
         V+RD  G +    N       S   +EA+A+ +G+ LA  + V      SD+L++I  IN  I G
Subjt:  DVLRDKQGHLKLVRNLSSQAGNSSLEAEAIAVLEGMRLARSLNVQQFTVLSDSLTLINLINEKIQG

XP_030970300.1 uncharacterized protein LOC115990627 [Quercus lobata]3.2e-2129.05Show/hide
Query:  QEASLSNVEKETTWWNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLLYPPMMRNLWVHMDIKDY
        ++   ++    T  W  +W+ +VP K+KIF W+T  N +PTM NL +  V  +  CP+C + +ET+ HAL  C  A+  W   Y   +       D+ D 
Subjt:  QEASLSNVEKETTWWNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLLYPPMMRNLWVHMDIKDY

Query:  WLS-LADNPMEVLERICVGTWSIWNDRNNSLHQCQIPNPKLRCEWINDYLSEFLKANPKGGAFVQTEKDIVNLISDVLRDKQGHLKLVRNLSSQAGNSSL
         L  +A      LE      WSIW +RN ++H+     P    E     L+EF  A     AF       + +   V+RD  G +            S+ 
Subjt:  WLS-LADNPMEVLERICVGTWSIWNDRNNSLHQCQIPNPKLRCEWINDYLSEFLKANPKGGAFVQTEKDIVNLISDVLRDKQGHLKLVRNLSSQAGNSSL

Query:  EAEAIAVLEGMRLARSLNVQQFTVLSDSLTLINLINEKIQG
         +EA+A+ +G+ LA  + V      S++L++I  IN+ I G
Subjt:  EAEAIAVLEGMRLARSLNVQQFTVLSDSLTLINLINEKIQG

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248741.1e-2428.36Show/hide
Query:  MKCQEASLSNVEKETTWWNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLLYPPM-MRNLWVHMD
        +KC   S S      T WN +WK+ VP+K+KIF W++ H  IPT  NL    +     C +C +  E+  HA F C RAR++W  L+P +   +   ++ 
Subjt:  MKCQEASLSNVEKETTWWNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLLYPPM-MRNLWVHMD

Query:  IKDYWLSLADN-PMEVLERICVGTWSIWNDRNNSLHQCQIPNPKLRCEWINDYLSEFLKA--------------------NPKGGAFVQTEKDIV-----
          + W SL +    + L    +  W IWNDRN+ +H  Q+   + +CEW+  +L    +A                     P     ++   D       
Subjt:  IKDYWLSLADN-PMEVLERICVGTWSIWNDRNNSLHQCQIPNPKLRCEWINDYLSEFLKA--------------------NPKGGAFVQTEKDIV-----

Query:  NLISDVLRDKQGHLKLVRNLSSQAGNSSLEAEAIAVLEGMRLARSLNVQQFTVLSDSLTLINLINEKI
             ++RD    L    ++      S L AE   +LEG++ A + N     V SDSL  I LI  +I
Subjt:  NLISDVLRDKQGHLKLVRNLSSQAGNSSLEAEAIAVLEGMRLARSLNVQQFTVLSDSLTLINLINEKI

A0A803NL40 Uncharacterized protein2.0e-2137.27Show/hide
Query:  KCQEASLSNVEKETTWWNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLL----YPPMMRNLWVH
        K  ++S S+  ++  WWNR+W + +P KVKIF W+ ++N++PT  NL+   +S    C +C    E+  HALF C+R R+VW        P  ++NL   
Subjt:  KCQEASLSNVEKETTWWNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLL----YPPMMRNLWVH

Query:  MDIKDYWLSLADNPMEVLERICVGTWSIWNDRNNSLHQCQIPNPKLRCEWINDYLSEFLKA
         DI  Y L+   N +E LE+I    W IW +RN  LHQ +    K+ C +   YL  F KA
Subjt:  MDIKDYWLSLADNPMEVLERICVGTWSIWNDRNNSLHQCQIPNPKLRCEWINDYLSEFLKA

A0A803NQ77 Uncharacterized protein5.3e-2236.08Show/hide
Query:  EASLSNVEKETTWWNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWD----LLYPPMMRNLWVHMDI
        E S S+  ++  WWNR+W +R+P K+KIF W+ ++ ++PT VNL +  +S +  C +CQ    TS HA+F+C RA+ VW      +Y P M N   + DI
Subjt:  EASLSNVEKETTWWNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWD----LLYPPMMRNLWVHMDI

Query:  KDYWLSLADNPMEVLERICVGTWSIWNDRNNSLHQCQIPNPKLRCEWINDYLSEFLKA
          Y ++ A N +E+ + +C+  W+IW++RN   H  +     + C +   YL +F KA
Subjt:  KDYWLSLADNPMEVLERICVGTWSIWNDRNNSLHQCQIPNPKLRCEWINDYLSEFLKA

A0A803PKZ1 Uncharacterized protein4.1e-2236.36Show/hide
Query:  EASLSNVEKETTWWNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWD----LLYPPMMRNLWVHMDI
        E S S+  ++  WWNR+W +R+P K+KIF W+ ++ ++PT VNL +   S +  C +CQ   +TS HA+F+C RA+ VW      ++ P M N     DI
Subjt:  EASLSNVEKETTWWNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWD----LLYPPMMRNLWVHMDI

Query:  KDYWLSLADNPMEVLERICVGTWSIWNDRNNSLHQCQIPNPKLRCEWINDYLSEFLKA-NPKGGA
          Y ++ A N  E+ + +C+  WSIW++RN  +H  +     + C +   YL++F KA  PK  A
Subjt:  KDYWLSLADNPMEVLERICVGTWSIWNDRNNSLHQCQIPNPKLRCEWINDYLSEFLKA-NPKGGA

A0A803QG02 Uncharacterized protein4.8e-2329.08Show/hide
Query:  KCQEASLSNVEKETTWWNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLLYPPMMRNLWVHMDIK
        K  E+S S++     WWNR+W +++P KVKIF W+ +++++PT VNL +  +SV  +C +C    E+S HALF CTRA++VW L +      L   M+  
Subjt:  KCQEASLSNVEKETTWWNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLLYPPMMRNLWVHMDIK

Query:  DYWLSLAD-NPMEVLERICVGTWSIWNDRNNSLHQCQIPNPKLRCEWINDYLSEFLKAN--------------------PKGG--------AFVQTEKDI
        + + S+A     +VL +I    W IW +RN  +H  +     + C +++ Y+ +FL A+                    P+G         A   T    
Subjt:  DYWLSLAD-NPMEVLERICVGTWSIWNDRNNSLHQCQIPNPKLRCEWINDYLSEFLKAN--------------------PKGG--------AFVQTEKDI

Query:  VNL------------ISDVLRDKQGHLKLVRNLSSQAGNSSLEAEAIAVLEGMRLARSLNVQQFTVLSDSLTLINLINEKIQ
        +N+            I DV+RD  G +    +       S  E EA A+L  ++ A  L +Q   + ++SL   N IN+  Q
Subjt:  VNL------------ISDVLRDKQGHLKLVRNLSSQAGNSSLEAEAIAVLEGMRLARSLNVQQFTVLSDSLTLINLINEKIQ

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657502.0e-0530.14Show/hide
Query:  TWWNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLLYP
        +++N +WK+RVP +VK F W   + ++ T       H+S +  C VC+  +E+  H L  C     +W  + P
Subjt:  TWWNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLLYP

Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein3.8e-1223.6Show/hide
Query:  VWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLLYPPMMRNLWVHMDIKDYW---LSLADNPMEVLER
        +WK+ V  K+K F W+ V  ++ T   LR+ ++  +  C  C  E ET  H +F C   + VW             ++ I + W    S  DN   +++ 
Subjt:  VWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLLYPPMMRNLWVHMDIKDYW---LSLADNPMEVLER

Query:  ICVGT-------------WSIWNDRNNSLHQ--CQIPNPKLR------CEWIN------------------DYLSEFLKANPKGGAFVQ-------TEKD
            T             W +W  RN  L Q  CQ P+ + R       EW+N                      +  + NP    +V+       T+  
Subjt:  ICVGT-------------WSIWNDRNNSLHQ--CQIPNPKLR------CEWIN------------------DYLSEFLKANPKGGAFVQ-------TEKD

Query:  IVNLISDVLRDKQGHLKLVRNLSSQAGNSSLEAEAIAVLEGMRLARSLNVQQFTVLSDSLTLINLIN
                +R+  GH+ L  N   Q+   SL AEA+  L  +++  +  ++     SDS +L+ LIN
Subjt:  IVNLISDVLRDKQGHLKLVRNLSSQAGNSSLEAEAIAVLEGMRLARSLNVQQFTVLSDSLTLINLIN

AT3G25270.1 Ribonuclease H-like superfamily protein5.5e-1128.93Show/hide
Query:  RVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLLYPP--MMRNLWVHMDIKDYWL---SLADNPMEV
        ++WK++   K+K F WK +  ++ T  NL+  H+  +  C  C +E ETS H  F C  A++VW     P   +R   + M+ K   L    LA+   ++
Subjt:  RVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLLYPP--MMRNLWVHMDIKDYWL---SLADNPMEV

Query:  LERICVGTWSIWNDRNNSLHQ
                W +W  RN  + Q
Subjt:  LERICVGTWSIWNDRNNSLHQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAAGTGTCAAGAGGCCTCATTGTCAAATGTGGAAAAGGAGACTACTTGGTGGAATAGGGTGTGGAAGATGAGAGTGCCTAGCAAAGTGAAAATTTTCTTCTGGAA
AACTGTTCACAACTCTATCCCAACTATGGTAAACCTACGGAATCATCATGTATCGGTTAACGGGAATTGCCCGGTTTGCCAGGAGGAGATGGAGACTTCAGACCATGCCC
TCTTTCAGTGTACGAGGGCTCGTGAGGTATGGGACCTTCTTTATCCACCAATGATGAGGAATTTATGGGTTCATATGGATATTAAAGACTACTGGTTGAGTTTGGCTGAC
AATCCAATGGAGGTCTTAGAGCGTATTTGTGTGGGGACCTGGTCAATTTGGAACGACAGGAATAACTCGTTACATCAGTGTCAAATCCCTAATCCAAAGCTTAGATGTGA
ATGGATTAATGACTATCTGTCTGAGTTCTTGAAGGCCAACCCGAAAGGCGGTGCCTTTGTTCAAACGGAGAAAGATATTGTTAATCTCATTTCAGATGTTTTGCGTGATA
AACAGGGGCATCTCAAGCTGGTGAGAAATCTATCTTCTCAGGCTGGTAACTCTTCGTTGGAAGCGGAAGCAATAGCGGTGCTTGAAGGGATGCGTCTGGCTAGATCTTTG
AATGTGCAACAATTCACTGTGTTGTCTGATTCTTTGACTTTGATAAATTTGATCAATGAGAAGATTCAGGGGGAGGCCTATGTTGCTGCGACTCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGATGAAGTGTCAAGAGGCCTCATTGTCAAATGTGGAAAAGGAGACTACTTGGTGGAATAGGGTGTGGAAGATGAGAGTGCCTAGCAAAGTGAAAATTTTCTTCTGGAA
AACTGTTCACAACTCTATCCCAACTATGGTAAACCTACGGAATCATCATGTATCGGTTAACGGGAATTGCCCGGTTTGCCAGGAGGAGATGGAGACTTCAGACCATGCCC
TCTTTCAGTGTACGAGGGCTCGTGAGGTATGGGACCTTCTTTATCCACCAATGATGAGGAATTTATGGGTTCATATGGATATTAAAGACTACTGGTTGAGTTTGGCTGAC
AATCCAATGGAGGTCTTAGAGCGTATTTGTGTGGGGACCTGGTCAATTTGGAACGACAGGAATAACTCGTTACATCAGTGTCAAATCCCTAATCCAAAGCTTAGATGTGA
ATGGATTAATGACTATCTGTCTGAGTTCTTGAAGGCCAACCCGAAAGGCGGTGCCTTTGTTCAAACGGAGAAAGATATTGTTAATCTCATTTCAGATGTTTTGCGTGATA
AACAGGGGCATCTCAAGCTGGTGAGAAATCTATCTTCTCAGGCTGGTAACTCTTCGTTGGAAGCGGAAGCAATAGCGGTGCTTGAAGGGATGCGTCTGGCTAGATCTTTG
AATGTGCAACAATTCACTGTGTTGTCTGATTCTTTGACTTTGATAAATTTGATCAATGAGAAGATTCAGGGGGAGGCCTATGTTGCTGCGACTCTCTAG
Protein sequenceShow/hide protein sequence
MMKCQEASLSNVEKETTWWNRVWKMRVPSKVKIFFWKTVHNSIPTMVNLRNHHVSVNGNCPVCQEEMETSDHALFQCTRAREVWDLLYPPMMRNLWVHMDIKDYWLSLAD
NPMEVLERICVGTWSIWNDRNNSLHQCQIPNPKLRCEWINDYLSEFLKANPKGGAFVQTEKDIVNLISDVLRDKQGHLKLVRNLSSQAGNSSLEAEAIAVLEGMRLARSL
NVQQFTVLSDSLTLINLINEKIQGEAYVAATL