; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029185 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029185
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:36192010..36197155
RNA-Seq ExpressionLag0029185
SyntenyLag0029185
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]2.6e-2234.05Show/hide
Query:  ESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSSFMI---------SVHFV-----------------------------------
        + + IHWV W+ LC  K  GGLGFRD++ FNQALL KQCWR+ R P S +          SV F+                                   
Subjt:  ESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSSFMI---------SVHFV-----------------------------------

Query:  --------------------PSLPLTSCARDLFSDTGQWDGPKIWSHFTIADSEAILRRPLGNMLMEDCLIWHFEKNGLFSVKSG
                            P LPL++   DLF+ +GQW+ P +   F   + +AIL+ PL ++   DCLIWH+E+NG++SVKSG
Subjt:  --------------------PSLPLTSCARDLFSDTGQWDGPKIWSHFTIADSEAILRRPLGNMLMEDCLIWHFEKNGLFSVKSG

ONI09819.1 hypothetical protein PRUPE_4G011200 [Prunus persica]2.6e-2234.05Show/hide
Query:  ESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSSFMI---------SVHFV-----------------------------------
        + + IHWV W+ LC  K  GGLGFRD++ FNQALL KQCWR+ R P S +          SV F+                                   
Subjt:  ESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSSFMI---------SVHFV-----------------------------------

Query:  --------------------PSLPLTSCARDLFSDTGQWDGPKIWSHFTIADSEAILRRPLGNMLMEDCLIWHFEKNGLFSVKSG
                            P LPL++   DLF+ +GQW+ P +   F   + +AIL+ PL ++   DCLIWH+E+NG++SVKSG
Subjt:  --------------------PSLPLTSCARDLFSDTGQWDGPKIWSHFTIADSEAILRRPLGNMLMEDCLIWHFEKNGLFSVKSG

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]9.8e-2233.51Show/hide
Query:  ESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSSFMI---------SVHFV-----------------------------------
        + + IHWV W+ LC  K  GGLGFRD++ FNQALL KQCWR+ R P S +          SV F+                                   
Subjt:  ESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSSFMI---------SVHFV-----------------------------------

Query:  --------------------PSLPLTSCARDLFSDTGQWDGPKIWSHFTIADSEAILRRPLGNMLMEDCLIWHFEKNGLFSVKSG
                            P LPL++   DLF+ +GQW+ P +   F   + +A L+ PL ++   DCLIWH+E+NG++SVKSG
Subjt:  --------------------PSLPLTSCARDLFSDTGQWDGPKIWSHFTIADSEAILRRPLGNMLMEDCLIWHFEKNGLFSVKSG

XP_024033483.1 uncharacterized protein LOC112095606 [Citrus clementina]9.1e-2033.96Show/hide
Query:  ESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSS---------FMISVHFV-----------------------------PSLPLT
        + + IHW  W+ L   KC G +GFRD   FNQAL+ KQ WR+ + P S         F + VH +                             PSLP  
Subjt:  ESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSS---------FMISVHFV-----------------------------PSLPLT

Query:  SCARDLFSDTGQWDGPKIWSHFTIADSEAILRRPLGNMLMEDCLIWHFEKNGLFSVKSG
        +   +L ++  QWD  +I+ HF   D++ I+  PL     ED +IWH++K GL+SVKSG
Subjt:  SCARDLFSDTGQWDGPKIWSHFTIADSEAILRRPLGNMLMEDCLIWHFEKNGLFSVKSG

XP_028954978.1 uncharacterized protein LOC108172663 [Malus domestica]1.8e-2033.04Show/hide
Query:  ESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSSFMISVHFVPSLPLTSCARDLFSD-------TGQWDGPKIWSHFTIADSEAI-
        +S++IHW SWD LC  K  GG+GFR++  FN A+L KQ WR+ +DP S   S+     + L   +R L  D         QW   +IWS F +     + 
Subjt:  ESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSSFMISVHFVPSLPLTSCARDLFSD-------TGQWDGPKIWSHFTIADSEAI-

Query:  LRRPLGNM-LMED--------CLIWHFEKNGLFSVKSGCVGEG-----W--LWYA---------GGGSIMRNGRGEVLLSAGFVLPNCWNVDLAEAWAML
        LR  L ++ L+ED         + W + K    S  S  +G       W  +W A         G G ++R+G+G  L +A   LP   +   AE  A+ 
Subjt:  LRRPLGNM-LMED--------CLIWHFEKNGLFSVKSGCVGEG-----W--LWYA---------GGGSIMRNGRGEVLLSAGFVLPNCWNVDLAEAWAML

Query:  RGIEIARQMGFSRFHMETDSLRSISSI
        RG+E+A Q+GF +  + +DS ++IS I
Subjt:  RGIEIARQMGFSRFHMETDSLRSISSI

TrEMBL top hitse value%identityAlignment
A0A251NPF0 Reverse transcriptase domain-containing protein1.2e-2234.05Show/hide
Query:  ESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSSFMI---------SVHFV-----------------------------------
        + + IHWV W+ LC  K  GGLGFRD++ FNQALL KQCWR+ R P S +          SV F+                                   
Subjt:  ESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSSFMI---------SVHFV-----------------------------------

Query:  --------------------PSLPLTSCARDLFSDTGQWDGPKIWSHFTIADSEAILRRPLGNMLMEDCLIWHFEKNGLFSVKSG
                            P LPL++   DLF+ +GQW+ P +   F   + +AIL+ PL ++   DCLIWH+E+NG++SVKSG
Subjt:  --------------------PSLPLTSCARDLFSDTGQWDGPKIWSHFTIADSEAILRRPLGNMLMEDCLIWHFEKNGLFSVKSG

A0A5E4FZN9 PREDICTED: retrotransposon4.7e-2233.51Show/hide
Query:  ESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSSFMI---------SVHFV-----------------------------------
        + + IHWV W+ LC  K  GGLGFRD++ FNQALL KQCWR+ R P S +          SV F+                                   
Subjt:  ESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSSFMI---------SVHFV-----------------------------------

Query:  --------------------PSLPLTSCARDLFSDTGQWDGPKIWSHFTIADSEAILRRPLGNMLMEDCLIWHFEKNGLFSVKSG
                            P LPL++   DLF+ +GQW+ P +   F   + +A L+ PL ++   DCLIWH+E+NG++SVKSG
Subjt:  --------------------PSLPLTSCARDLFSDTGQWDGPKIWSHFTIADSEAILRRPLGNMLMEDCLIWHFEKNGLFSVKSG

A0A803QK36 Uncharacterized protein3.4e-2028.57Show/hide
Query:  IHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSSFMISV--------------------------------HFVPSLPLTS----CARDL
        IHW +W  LC  K  GG+GFR+   FNQALL KQ WR+  DPSS +  V                                +F P L L+S       D 
Subjt:  IHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSSFMISV--------------------------------HFVPSLPLTS----CARDL

Query:  FSDTGQWDGPKIWSHFTIADSEAILRRPLGNMLMEDCLIWHFEKNGLFSVKSG--------------------CVGEGWLWYAGGGSIMRNGRGEVLLSA
         + + QWD PK+   FT  D + IL  PL     ED L+WH+   G ++VKSG                    C+        G  +I+RN  G++L + 
Subjt:  FSDTGQWDGPKIWSHFTIADSEAILRRPLGNMLMEDCLIWHFEKNGLFSVKSG--------------------CVGEGWLWYAGGGSIMRNGRGEVLLSA

Query:  GFVLPNCWNVDLAEAWAMLRGIEIARQMGFSRFHMETDSLRSISSIVFDFTDFIDFILI
           +  C   +  EA A+  G+++   +  +   +E+DSL  ++ +   F     F +I
Subjt:  GFVLPNCWNVDLAEAWAMLRGIEIARQMGFSRFHMETDSLRSISSIVFDFTDFIDFILI

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)1.2e-2234.05Show/hide
Query:  ESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSSFMI---------SVHFV-----------------------------------
        + + IHWV W+ LC  K  GGLGFRD++ FNQALL KQCWR+ R P S +          SV F+                                   
Subjt:  ESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSSFMI---------SVHFV-----------------------------------

Query:  --------------------PSLPLTSCARDLFSDTGQWDGPKIWSHFTIADSEAILRRPLGNMLMEDCLIWHFEKNGLFSVKSG
                            P LPL++   DLF+ +GQW+ P +   F   + +AIL+ PL ++   DCLIWH+E+NG++SVKSG
Subjt:  --------------------PSLPLTSCARDLFSDTGQWDGPKIWSHFTIADSEAILRRPLGNMLMEDCLIWHFEKNGLFSVKSG

M5WJW2 Reverse transcriptase domain-containing protein1.2e-2234.05Show/hide
Query:  ESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSSFMI---------SVHFV-----------------------------------
        + + IHWV W+ LC  K  GGLGFRD++ FNQALL KQCWR+ R P S +          SV F+                                   
Subjt:  ESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSSFMI---------SVHFV-----------------------------------

Query:  --------------------PSLPLTSCARDLFSDTGQWDGPKIWSHFTIADSEAILRRPLGNMLMEDCLIWHFEKNGLFSVKSG
                            P LPL++   DLF+ +GQW+ P +   F   + +AIL+ PL ++   DCLIWH+E+NG++SVKSG
Subjt:  --------------------PSLPLTSCARDLFSDTGQWDGPKIWSHFTIADSEAILRRPLGNMLMEDCLIWHFEKNGLFSVKSG

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.0e-0545.83Show/hide
Query:  ESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSS
        E K+ H V W  +C PK  GGLG R  K  N+AL++K  WRL ++ +S
Subjt:  ESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSS

P93295 Uncharacterized mitochondrial protein AtMg003102.1e-0640.3Show/hide
Query:  KRIHWVSWDSLCCPK-CLGGLGFRDMKLFNQALLTKQCWRLFRDPS---SFMISVHFVPSLPLTSCA
        ++I WV+W  LC  K   GGLGFRD+  FNQALL KQ +R+   P    S ++   + P   +  C+
Subjt:  KRIHWVSWDSLCCPK-CLGGLGFRDMKLFNQALLTKQCWRLFRDPS---SFMISVHFVPSLPLTSCA

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein2.2e-1146.27Show/hide
Query:  ESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSSFMISV----HFVPSLPLTS
        E+K +HW +WD L C K  GG+GF+D++ FN ALL KQ WR+   P S M  V    +F  S PL +
Subjt:  ESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSSFMISV----HFVPSLPLTS

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.5e-0740.3Show/hide
Query:  KRIHWVSWDSLCCPK-CLGGLGFRDMKLFNQALLTKQCWRLFRDPS---SFMISVHFVPSLPLTSCA
        ++I WV+W  LC  K   GGLGFRD+  FNQALL KQ +R+   P    S ++   + P   +  C+
Subjt:  KRIHWVSWDSLCCPK-CLGGLGFRDMKLFNQALLTKQCWRLFRDPS---SFMISVHFVPSLPLTSCA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGAGTCTAAACGGATCCACTGGGTGAGCTGGGACTCTCTCTGTTGTCCTAAATGCCTTGGTGGACTTGGTTTCAGAGATATGAAGCTTTTTAACCAGGCTCTTTT
AACTAAGCAGTGTTGGCGCTTGTTCAGAGACCCATCCTCCTTCATGATCTCGGTTCACTTCGTTCCCTCCTTGCCGCTGACTAGTTGTGCTCGGGATCTGTTCTCTGATA
CAGGGCAATGGGATGGGCCGAAAATTTGGAGCCACTTTACTATTGCAGATAGTGAAGCCATTCTGAGAAGACCACTTGGGAATATGTTAATGGAGGACTGTCTGATTTGG
CATTTCGAGAAAAATGGGCTCTTTTCTGTCAAAAGTGGATGCGTCGGTGAGGGCTGGTTGTGGTACGCAGGGGGCGGTTCTATTATGAGAAATGGGAGGGGCGAGGTGCT
GCTGTCAGCAGGCTTTGTTCTACCGAACTGTTGGAATGTTGATCTAGCTGAAGCGTGGGCAATGTTGAGGGGCATCGAAATTGCTCGTCAAATGGGCTTCTCTCGGTTCC
ATATGGAGACCGACTCCCTAAGATCGATCAGTTCGATTGTTTTCGATTTCACGGATTTCATAGATTTTATTTTGATCGATTGTTTTCTGATCGAGTATCGACGCCTCTTC
CGGTCGCCGGATTTTAGAGATCTCGCTGGAAAGAAAATTCGAAAGTTACAGTCGGTTTGGATTTCATCGAAAGGATCTTTTGGATCTCAGATTTTACCGAGAAGTGATAG
AAAATTTCTATCACTGATAGAAATTTGGGTGGGCTCAAAATTGCATCCCTCCGATGAAATCGAAATCCTCCGCCATCGCTGCCCGATCATCCAAATCCCTAATCCCTCAA
AAAATCAATATCACGTAGCTAATCCCAAACTCCGCTCTCTATTTAACCCCCCCGACTCTGTTGACCTTGGGATTGCCCGTCGGAAACTCGATCTCGAGGCGGAAGATCCA
AGCGTGGCTGCTTCACCATCGGGAAGAAGAGATGGCAATAAGTCTGGAGGAATCGAAATTCGAGCGACGATAAAGATCAGGAAGAAGAGGAAGGAGAAGTTGACTGAAAA
ACTTTCTATCATTTCTATCACTGATAGAAAAACTTTCTATCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGGAGTCTAAACGGATCCACTGGGTGAGCTGGGACTCTCTCTGTTGTCCTAAATGCCTTGGTGGACTTGGTTTCAGAGATATGAAGCTTTTTAACCAGGCTCTTTT
AACTAAGCAGTGTTGGCGCTTGTTCAGAGACCCATCCTCCTTCATGATCTCGGTTCACTTCGTTCCCTCCTTGCCGCTGACTAGTTGTGCTCGGGATCTGTTCTCTGATA
CAGGGCAATGGGATGGGCCGAAAATTTGGAGCCACTTTACTATTGCAGATAGTGAAGCCATTCTGAGAAGACCACTTGGGAATATGTTAATGGAGGACTGTCTGATTTGG
CATTTCGAGAAAAATGGGCTCTTTTCTGTCAAAAGTGGATGCGTCGGTGAGGGCTGGTTGTGGTACGCAGGGGGCGGTTCTATTATGAGAAATGGGAGGGGCGAGGTGCT
GCTGTCAGCAGGCTTTGTTCTACCGAACTGTTGGAATGTTGATCTAGCTGAAGCGTGGGCAATGTTGAGGGGCATCGAAATTGCTCGTCAAATGGGCTTCTCTCGGTTCC
ATATGGAGACCGACTCCCTAAGATCGATCAGTTCGATTGTTTTCGATTTCACGGATTTCATAGATTTTATTTTGATCGATTGTTTTCTGATCGAGTATCGACGCCTCTTC
CGGTCGCCGGATTTTAGAGATCTCGCTGGAAAGAAAATTCGAAAGTTACAGTCGGTTTGGATTTCATCGAAAGGATCTTTTGGATCTCAGATTTTACCGAGAAGTGATAG
AAAATTTCTATCACTGATAGAAATTTGGGTGGGCTCAAAATTGCATCCCTCCGATGAAATCGAAATCCTCCGCCATCGCTGCCCGATCATCCAAATCCCTAATCCCTCAA
AAAATCAATATCACGTAGCTAATCCCAAACTCCGCTCTCTATTTAACCCCCCCGACTCTGTTGACCTTGGGATTGCCCGTCGGAAACTCGATCTCGAGGCGGAAGATCCA
AGCGTGGCTGCTTCACCATCGGGAAGAAGAGATGGCAATAAGTCTGGAGGAATCGAAATTCGAGCGACGATAAAGATCAGGAAGAAGAGGAAGGAGAAGTTGACTGAAAA
ACTTTCTATCATTTCTATCACTGATAGAAAAACTTTCTATCAATGA
Protein sequenceShow/hide protein sequence
MGESKRIHWVSWDSLCCPKCLGGLGFRDMKLFNQALLTKQCWRLFRDPSSFMISVHFVPSLPLTSCARDLFSDTGQWDGPKIWSHFTIADSEAILRRPLGNMLMEDCLIW
HFEKNGLFSVKSGCVGEGWLWYAGGGSIMRNGRGEVLLSAGFVLPNCWNVDLAEAWAMLRGIEIARQMGFSRFHMETDSLRSISSIVFDFTDFIDFILIDCFLIEYRRLF
RSPDFRDLAGKKIRKLQSVWISSKGSFGSQILPRSDRKFLSLIEIWVGSKLHPSDEIEILRHRCPIIQIPNPSKNQYHVANPKLRSLFNPPDSVDLGIARRKLDLEAEDP
SVAASPSGRRDGNKSGGIEIRATIKIRKKRKEKLTEKLSIISITDRKTFYQ