; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021549 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021549
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:9038723..9043181
RNA-Seq ExpressionLag0021549
SyntenyLag0021549
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAH66131.1 OSIGBa0135L04.5 [Oryza sativa]7.4e-1527.57Show/hide
Query:  HESDIPPPNIRSRWILHYLVEFDRAEERRKISLQAGSKLMKVSRSQS-------------WASPPVNTWKINVDAAWDGLS--TGIGAICRNSNGEILGA
        H    PP  +  R++  Y+       + +  +L  G  +++ SR                W  P     K+NVD ++D  S   GIGA+ RNS GE++ +
Subjt:  HESDIPPPNIRSRWILHYLVEFDRAEERRKISLQAGSKLMKVSRSQS-------------WASPPVNTWKINVDAAWDGLS--TGIGAICRNSNGEILGA

Query:  CSKFLDFSLPPPMAELLAIKEGVDLALSMEGAKLIIEIDCLQAFKLVCGEIEIWNEVGILLEGILRRSFAGEEIVFSFVPRECNVLADALAKRAKRDKCN
           FLD    P   ELLA KEG+++AL      +++E+DC +A KL+    +  +EV  ++  I +  F   EI+   +    N  +  LA + + +   
Subjt:  CSKFLDFSLPPPMAELLAIKEGVDLALSMEGAKLIIEIDCLQAFKLVCGEIEIWNEVGILLEGILRRSFAGEEIVFSFVPRECNVLADALAKRAKRDKCN

Query:  LMWRDNLPQWLVAL
          W D+   +L AL
Subjt:  LMWRDNLPQWLVAL

PRQ16761.1 putative ribonuclease H-like domain-containing protein [Rosa chinensis]3.9e-1632.2Show/hide
Query:  AEERRKISLQAGSKLM--KVSRSQSWASPPVNTWKINVDAA--WDGLSTGIGAICRNSNGEILGACSKFLDFSLPPPMAELLAIKEGVDLALSMEGAKLI
        A + R +    GS L+  ++ R   W +PP +++K+N DA+   +G   G+GA+ R  +G+++ A    +   L P  AEL+A+K G+  A+      L+
Subjt:  AEERRKISLQAGSKLM--KVSRSQSWASPPVNTWKINVDAA--WDGLSTGIGAICRNSNGEILGACSKFLDFSLPPPMAELLAIKEGVDLALSMEGAKLI

Query:  IEIDCLQAFKLVCGEIEIWNEVGILLEGILRRSFAGEEIVFSFVPRECNVLADALAKRAKRDKCNLMWRDNLPQWLV
        +E DCL+A +LV    E   EVG+L+E +             FVPRE N +A ++AK   R+     W +  P WL+
Subjt:  IEIDCLQAFKLVCGEIEIWNEVGILLEGILRRSFAGEEIVFSFVPRECNVLADALAKRAKRDKCNLMWRDNLPQWLV

XP_022158489.1 uncharacterized protein LOC111024968 [Momordica charantia]1.5e-1534.3Show/hide
Query:  DRNKKIHESDIPPPNIRSRWILHYLVEFDRAEERRKISLQAGSKLMKVSRSQSWASPPVNTWKINVDAAWDGLSTGIGAICRNSNGEILGACSKFLDFSL
        DR+  I++  IP   I+  WIL Y      AEE R +    G K  ++ R   W  P     K+N DAA     +G+G + R  N EI+GA    +DF +
Subjt:  DRNKKIHESDIPPPNIRSRWILHYLVEFDRAEERRKISLQAGSKLMKVSRSQSWASPPVNTWKINVDAAWDGLSTGIGAICRNSNGEILGACSKFLDFSL

Query:  --PPPMAELLAIKEGVDLALSMEGAKLIIEIDCLQAFKLVCGEIEIW-NEVGILLEGILRRSFAG--EEIVFSFVPRECNVLADALAKRAKRDKCNLMWR
           P +A++LAI+EG+ LA  +   ++++E D L+A  L+ G+   W  E    +E I  R+FA   +EI F  V RE N +A  L +     +C  +WR
Subjt:  --PPPMAELLAIKEGVDLALSMEGAKLIIEIDCLQAFKLVCGEIEIW-NEVGILLEGILRRSFAG--EEIVFSFVPRECNVLADALAKRAKRDKCNLMWR

Query:  DNLPQWL
         + P WL
Subjt:  DNLPQWL

XP_023893347.1 uncharacterized protein LOC112005329 [Quercus suber]7.4e-1532.69Show/hide
Query:  GDRNKKIHESDIPPPNIRSRWILHYLVEFDRAEERRKISLQAGSKLMKVSRSQSWASPPVNTWKINVDAAWD-GLS-TGIGAICRNSNGEILGACSKFLD
        G+RN+  H     PP+         L +F+ A +     L   S L        WA+PP    KINVD A D G+  +GIG I R+S+G ++GA SK L 
Subjt:  GDRNKKIHESDIPPPNIRSRWILHYLVEFDRAEERRKISLQAGSKLMKVSRSQSWASPPVNTWKINVDAAWD-GLS-TGIGAICRNSNGEILGACSKFLD

Query:  FSLPPPMAELLAIKEGVDLALSMEGAKLIIEIDCLQAFKLVCGEIEIWNEVGILLEGILRRSFAGEEIVFSFVPRECNVLADALAKRAKRDKCNLMWRDN
         SL   + E  A+  GV  AL ++ ++ I E D L    L     E   E+G +LE I+  S +     F  + R+ N  A +LA+ AK      +W+  
Subjt:  FSLPPPMAELLAIKEGVDLALSMEGAKLIIEIDCLQAFKLVCGEIEIWNEVGILLEGILRRSFAGEEIVFSFVPRECNVLADALAKRAKRDKCNLMWRDN

Query:  LPQWLVAL
         PQ L+ +
Subjt:  LPQWLVAL

XP_038902513.1 uncharacterized protein LOC120089172 [Benincasa hispida]5.1e-1630.89Show/hide
Query:  PPNIRSRWILHYL----VEFDRAEERRKISLQAGSKLMKVSRSQSWASPPVNTWKINVDAAW--DGLSTGIGAICRNSNGEILGACSKFLDFSLPPPMAE
        P  + S W+   +    + FDRA   R  + +  S     +  ++W++PP +  K+NVDAAW     S+G  AI R++ G +       +D   PPP+AE
Subjt:  PPNIRSRWILHYL----VEFDRAEERRKISLQAGSKLMKVSRSQSWASPPVNTWKINVDAAW--DGLSTGIGAICRNSNGEILGACSKFLDFSLPPPMAE

Query:  LLAIKEGVDLALSMEGAKLIIEIDCLQAFKLVCGEIEIWNEVGILLEGILRRSFAGEEIVFSFVPRECNVLADALAKRAKRDKCNLMWRDN
           + +G+ L   M   K+I++ DC  A  L    +   + V + LE I   S     I F+++PR  N LAD +AKR +    N +W D+
Subjt:  LLAIKEGVDLALSMEGAKLIIEIDCLQAFKLVCGEIEIWNEVGILLEGILRRSFAGEEIVFSFVPRECNVLADALAKRAKRDKCNLMWRDN

TrEMBL top hitse value%identityAlignment
A0A2P6P4A4 Putative ribonuclease H-like domain-containing protein1.9e-1632.2Show/hide
Query:  AEERRKISLQAGSKLM--KVSRSQSWASPPVNTWKINVDAA--WDGLSTGIGAICRNSNGEILGACSKFLDFSLPPPMAELLAIKEGVDLALSMEGAKLI
        A + R +    GS L+  ++ R   W +PP +++K+N DA+   +G   G+GA+ R  +G+++ A    +   L P  AEL+A+K G+  A+      L+
Subjt:  AEERRKISLQAGSKLM--KVSRSQSWASPPVNTWKINVDAA--WDGLSTGIGAICRNSNGEILGACSKFLDFSLPPPMAELLAIKEGVDLALSMEGAKLI

Query:  IEIDCLQAFKLVCGEIEIWNEVGILLEGILRRSFAGEEIVFSFVPRECNVLADALAKRAKRDKCNLMWRDNLPQWLV
        +E DCL+A +LV    E   EVG+L+E +             FVPRE N +A ++AK   R+     W +  P WL+
Subjt:  IEIDCLQAFKLVCGEIEIWNEVGILLEGILRRSFAGEEIVFSFVPRECNVLADALAKRAKRDKCNLMWRDNLPQWLV

A0A6J1BQ49 uncharacterized protein LOC1110047868.0e-1537.32Show/hide
Query:  DRNKKIHESDIPPPNIRSRWILHYLVEFDRAEERRKISLQAGSKLMKVSRSQSWASPPVNTWKINVDAAWDGLSTGIGAICRNSNGEILGACSKFLDFSL
        DRN   H S +  P +R  WI  Y   + +A+E ++IS Q  S      R   W  P  +  K+N DAA    STG+G I R+  G +L A S FL   L
Subjt:  DRNKKIHESDIPPPNIRSRWILHYLVEFDRAEERRKISLQAGSKLMKVSRSQSWASPPVNTWKINVDAAWDGLSTGIGAICRNSNGEILGACSKFLDFSL

Query:  PPPMAELLAIKEGVDLALSMEGAKLIIEIDCLQAFKLVCGEI
         P  AE+  I E + LA S    +L++E DC +A +LV G++
Subjt:  PPPMAELLAIKEGVDLALSMEGAKLIIEIDCLQAFKLVCGEI

A0A6J1DZK3 uncharacterized protein LOC1110249687.2e-1634.3Show/hide
Query:  DRNKKIHESDIPPPNIRSRWILHYLVEFDRAEERRKISLQAGSKLMKVSRSQSWASPPVNTWKINVDAAWDGLSTGIGAICRNSNGEILGACSKFLDFSL
        DR+  I++  IP   I+  WIL Y      AEE R +    G K  ++ R   W  P     K+N DAA     +G+G + R  N EI+GA    +DF +
Subjt:  DRNKKIHESDIPPPNIRSRWILHYLVEFDRAEERRKISLQAGSKLMKVSRSQSWASPPVNTWKINVDAAWDGLSTGIGAICRNSNGEILGACSKFLDFSL

Query:  --PPPMAELLAIKEGVDLALSMEGAKLIIEIDCLQAFKLVCGEIEIW-NEVGILLEGILRRSFAG--EEIVFSFVPRECNVLADALAKRAKRDKCNLMWR
           P +A++LAI+EG+ LA  +   ++++E D L+A  L+ G+   W  E    +E I  R+FA   +EI F  V RE N +A  L +     +C  +WR
Subjt:  --PPPMAELLAIKEGVDLALSMEGAKLIIEIDCLQAFKLVCGEIEIW-NEVGILLEGILRRSFAG--EEIVFSFVPRECNVLADALAKRAKRDKCNLMWR

Query:  DNLPQWL
         + P WL
Subjt:  DNLPQWL

A0A803QCV6 Uncharacterized protein1.8e-1429.95Show/hide
Query:  MLGYLGDRNKKIHESDIPPPNIRSRWILHYLVEFDRAEERRKISLQAGSKLMKVSRSQSWASPPVNTWKINVDAAWDGL--STGIGAICRNSNGEILGAC
        M G L  RNKK     +PP  I   WI    +E D       I L+  S   +      W  PP N++ IN DA+ +    S G+GA+ RN++G+++   
Subjt:  MLGYLGDRNKKIHESDIPPPNIRSRWILHYLVEFDRAEERRKISLQAGSKLMKVSRSQSWASPPVNTWKINVDAAWDGL--STGIGAICRNSNGEILGAC

Query:  SKFLDFSLPPPMAELLAIKEGVDLALSMEGAKLIIEIDCLQAFKLVCGEIEIWNEVGILLEGILRRSFAGEEIVFSFVPRECNVLADALAKRAKRDKCNL
         +    S    +AE+LA++ G+ LA+       II+ DC++A   + G  ++  +   LLE I         +    + R+ N  A +LAK A   KCN 
Subjt:  SKFLDFSLPPPMAELLAIKEGVDLALSMEGAKLIIEIDCLQAFKLVCGEIEIWNEVGILLEGILRRSFAGEEIVFSFVPRECNVLADALAKRAKRDKCNL

Query:  MWRDNLP
        +W  + P
Subjt:  MWRDNLP

Q01M84 OSIGBa0135L04.5 protein3.6e-1527.57Show/hide
Query:  HESDIPPPNIRSRWILHYLVEFDRAEERRKISLQAGSKLMKVSRSQS-------------WASPPVNTWKINVDAAWDGLS--TGIGAICRNSNGEILGA
        H    PP  +  R++  Y+       + +  +L  G  +++ SR                W  P     K+NVD ++D  S   GIGA+ RNS GE++ +
Subjt:  HESDIPPPNIRSRWILHYLVEFDRAEERRKISLQAGSKLMKVSRSQS-------------WASPPVNTWKINVDAAWDGLS--TGIGAICRNSNGEILGA

Query:  CSKFLDFSLPPPMAELLAIKEGVDLALSMEGAKLIIEIDCLQAFKLVCGEIEIWNEVGILLEGILRRSFAGEEIVFSFVPRECNVLADALAKRAKRDKCN
           FLD    P   ELLA KEG+++AL      +++E+DC +A KL+    +  +EV  ++  I +  F   EI+   +    N  +  LA + + +   
Subjt:  CSKFLDFSLPPPMAELLAIKEGVDLALSMEGAKLIIEIDCLQAFKLVCGEIEIWNEVGILLEGILRRSFAGEEIVFSFVPRECNVLADALAKRAKRDKCN

Query:  LMWRDNLPQWLVAL
          W D+   +L AL
Subjt:  LMWRDNLPQWLVAL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein6.3e-0432.31Show/hide
Query:  PPVNTWKINVDAAW--DGLSTGIGAICRN-SNGEILGACSKFLDFSLPPPMAELLAIKEGVDLALSMEGAKLIIEIDCLQAFKLVCGEIEIWNEVGILLE
        P ++   I  DAAW  +    G G + RN      L   S   +  L P MAE +A+   +  A S+   KL +  D  Q    +  E       GI+ +
Subjt:  PPVNTWKINVDAAW--DGLSTGIGAICRN-SNGEILGACSKFLDFSLPPPMAELLAIKEGVDLALSMEGAKLIIEIDCLQAFKLVCGEIEIWNEVGILLE

Query:  GILRRSFAGEEIVFSFVPRECNVLADALAK
         IL  S    ++ FSFVPR  N +AD LAK
Subjt:  GILRRSFAGEEIVFSFVPRECNVLADALAK

AT4G29090.1 Ribonuclease H-like superfamily protein1.2e-0729.45Show/hide
Query:  EFDRAEERRKISLQAGSKLMKVSRSQ--SWASPPVNTWKINVDAAW--DGLSTGIGAICRNSNGEILGACSKFLDFSLPPPMAELLAIKEGVDLALSMEG
        E D  E R +   ++     +V+RS    W  PP    K N DA W  D    GIG + RN  GE+    ++ L        AEL A++  V L+LS   
Subjt:  EFDRAEERRKISLQAGSKLMKVSRSQ--SWASPPVNTWKINVDAAW--DGLSTGIGAICRNSNGEILGACSKFLDFSLPPPMAELLAIKEGVDLALSMEG

Query:  AKLIIEIDCLQAFKLVCGEIEIWNEVGILLEGILRRSFAGEEIVFSFVPRECNVLADALAKRA
           +I     Q    +    EIW  +   ++ + R      E+ F F+PRE N LA+ +A+ +
Subjt:  AKLIIEIDCLQAFKLVCGEIEIWNEVGILLEGILRRSFAGEEIVFSFVPRECNVLADALAKRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGGCTATCTGGGAGATCGAAACAAGAAAATTCATGAATCTGATATTCCGCCTCCGAACATTCGTAGCAGATGGATTTTACATTACCTGGTGGAGTTTGAT
CGTGCTGAAGAAAGAAGAAAGATCAGCCTCCAAGCTGGTAGCAAGTTGATGAAAGTGTCTAGATCTCAGTCTTGGGCTTCTCCCCCTGTTAATACATGGAAAATC
AACGTCGATGCTGCTTGGGATGGATTATCTACTGGTATTGGAGCTATCTGTAGAAACAGCAATGGAGAAATTTTGGGTGCTTGTAGCAAATTTCTGGATTTTTCT
CTTCCGCCTCCCATGGCTGAGCTTCTGGCCATCAAGGAAGGTGTTGATCTGGCTCTCTCTATGGAGGGAGCTAAATTGATTATCGAAATCGACTGCCTTCAAGCT
TTCAAATTAGTGTGCGGAGAGATTGAAATCTGGAATGAGGTTGGGATTTTGTTGGAAGGAATTTTGCGAAGATCCTTTGCTGGTGAGGAGATTGTTTTTTCTTTC
GTCCCTCGTGAGTGTAATGTTTTAGCAGATGCTTTAGCTAAAAGAGCCAAAAGAGATAAATGTAATCTGATGTGGAGGGATAATCTTCCACAGTGGCTTGTTGCC
TTGGAAATGGTTGATTGGAGCTATAACGCAATATCAGAGTTAATCGGGTGCTCGGGGCGTGAAAAGATGCAAAGGAATGAAAAGAGTAAAAGTGGAGAAAAGTCA
AATCTCGGTCAACAGCAGGCTAGCGTCGAGACGCTAGCTCTTGAGCGTCTCGACGCTCACATTCCATATCAGATTAGGCGCGTAAAGCTTACAGCGTCGAGACGC
TATGATAGGAAGCGTCCCGACGCTACCGTTTTTCCTTATTCAGAACGCGCGTTTAAGAGGCAGCGTCGCGACGCTGTCTTGACAGCGTCTCGACGCTTCGACGAA
AAATCAGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGGCTATCTGGGAGATCGAAACAAGAAAATTCATGAATCTGATATTCCGCCTCCGAACATTCGTAGCAGATGGATTTTACATTACCTGGTGGAGTTTGAT
CGTGCTGAAGAAAGAAGAAAGATCAGCCTCCAAGCTGGTAGCAAGTTGATGAAAGTGTCTAGATCTCAGTCTTGGGCTTCTCCCCCTGTTAATACATGGAAAATC
AACGTCGATGCTGCTTGGGATGGATTATCTACTGGTATTGGAGCTATCTGTAGAAACAGCAATGGAGAAATTTTGGGTGCTTGTAGCAAATTTCTGGATTTTTCT
CTTCCGCCTCCCATGGCTGAGCTTCTGGCCATCAAGGAAGGTGTTGATCTGGCTCTCTCTATGGAGGGAGCTAAATTGATTATCGAAATCGACTGCCTTCAAGCT
TTCAAATTAGTGTGCGGAGAGATTGAAATCTGGAATGAGGTTGGGATTTTGTTGGAAGGAATTTTGCGAAGATCCTTTGCTGGTGAGGAGATTGTTTTTTCTTTC
GTCCCTCGTGAGTGTAATGTTTTAGCAGATGCTTTAGCTAAAAGAGCCAAAAGAGATAAATGTAATCTGATGTGGAGGGATAATCTTCCACAGTGGCTTGTTGCC
TTGGAAATGGTTGATTGGAGCTATAACGCAATATCAGAGTTAATCGGGTGCTCGGGGCGTGAAAAGATGCAAAGGAATGAAAAGAGTAAAAGTGGAGAAAAGTCA
AATCTCGGTCAACAGCAGGCTAGCGTCGAGACGCTAGCTCTTGAGCGTCTCGACGCTCACATTCCATATCAGATTAGGCGCGTAAAGCTTACAGCGTCGAGACGC
TATGATAGGAAGCGTCCCGACGCTACCGTTTTTCCTTATTCAGAACGCGCGTTTAAGAGGCAGCGTCGCGACGCTGTCTTGACAGCGTCTCGACGCTTCGACGAA
AAATCAGAATAA
Protein sequenceShow/hide protein sequence
MLGYLGDRNKKIHESDIPPPNIRSRWILHYLVEFDRAEERRKISLQAGSKLMKVSRSQSWASPPVNTWKINVDAAWDGLSTGIGAICRNSNGEILGACSKFLDFS
LPPPMAELLAIKEGVDLALSMEGAKLIIEIDCLQAFKLVCGEIEIWNEVGILLEGILRRSFAGEEIVFSFVPRECNVLADALAKRAKRDKCNLMWRDNLPQWLVA
LEMVDWSYNAISELIGCSGREKMQRNEKSKSGEKSNLGQQQASVETLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERAFKRQRRDAVLTASRRFDE
KSE