; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy10g005120 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy10g005120
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr10:14165297..14168668
RNA-Seq ExpressionLcy10g005120
SyntenyLcy10g005120
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4347300.1 hypothetical protein G4B88_031301 [Cannabis sativa]1.0e-2734.67Show/hide
Query:  IQNFRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPIELRLEGAHIRPNFGK-----GFRFEECWSK
        +++F++ +D CGL+D  S     TWCN  Q+ +Q+  RLDR + N  +L CF    +  LDW +SDH+P+ + L  A      G+      F FEE W  
Subjt:  IQNFRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPIELRLEGAHIRPNFGK-----GFRFEECWSK

Query:  DEECLHIISRVGGWSDLGFRSQPLHLKLQKCAKALKGWGFRKNKTRWDGIRKQIGDGNSVCLLEDPWILRPYTFKVLGVKSYATGLKAADCILPSGQWDI
        D+EC  I+  + GW D  F       K     K L+G+       RW     ++GDGNS+ +LEDPW+ RP TFK+         L+ AD  L  G WD 
Subjt:  DEECLHIISRVGGWSDLGFRSQPLHLKLQKCAKALKGWGFRKNKTRWDGIRKQIGDGNSVCLLEDPWILRPYTFKVLGVKSYATGLKAADCILPSGQWDI

Query:  PKLSQFLLEKDDQELIRLPINLSIS
           ++F+     Q LI + I ++I+
Subjt:  PKLSQFLLEKDDQELIRLPINLSIS

KAF4349753.1 hypothetical protein G4B88_029501, partial [Cannabis sativa]2.3e-2735.85Show/hide
Query:  FRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPIELRLEGA--HIRPNFGKGFRFEECWSKDEECLH
        F+  LD C L+D+   G  +TW N+RQ    V  RLDRF  N  + + F    V   D+  SDH+P+   LE      + +  +GFRFE  W KDEEC  
Subjt:  FRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPIELRLEGA--HIRPNFGKGFRFEECWSKDEECLH

Query:  IISRVGGWSDLGFRSQPLHLKL-QKCAKALKGWGFRKNKTRW---------DGIRKQIGDGNSVCLLEDPWILRPYTFKVLGVKSYATGLKAADCILPSG
        I+ +     D    SQ   + +  +CA  L  W    NK ++          G+RK +GDG SV    DPW+ RP +F+ +   +   GL  +D I   G
Subjt:  IISRVGGWSDLGFRSQPLHLKL-QKCAKALKGWGFRKNKTRW---------DGIRKQIGDGNSVCLLEDPWILRPYTFKVLGVKSYATGLKAADCILPSG

Query:  QWDIPKLSQFLL
         WDIP LSQ+ L
Subjt:  QWDIPKLSQFLL

KAF4363712.1 hypothetical protein G4B88_030211, partial [Cannabis sativa]2.5e-2634.09Show/hide
Query:  IQNFRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPIELRLEGA--HIRPNFGKGFRFEECWSKDEE
        +  F+  LD C L+D+   G  +TW N+RQ    V  RLDRF  N  + D F    V   D+  SDH+P+   LE      + +  +GFRFE  W KDEE
Subjt:  IQNFRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPIELRLEGA--HIRPNFGKGFRFEECWSKDEE

Query:  CLHIISRVGGWSDLGFRSQPLHLKL-QKCAKALKGWGFRKNKTRWDGIRKQI-----------GDGNSVCLLEDPWILRPYTFKVLGVKSYATGLKAADC
        C  I+ +     D    SQ   + +  +CA  L  W    NK+++  I K +            DG SV    DPW+ RP +F+ +   +   G+  +D 
Subjt:  CLHIISRVGGWSDLGFRSQPLHLKL-QKCAKALKGWGFRKNKTRWDGIRKQI-----------GDGNSVCLLEDPWILRPYTFKVLGVKSYATGLKAADC

Query:  ILPSGQWDIPKLSQFLLEKD
        I   G WDIP LSQ+ L  D
Subjt:  ILPSGQWDIPKLSQFLLEKD

KAF4386840.1 hypothetical protein F8388_006795 [Cannabis sativa]6.8e-3232.79Show/hide
Query:  LIQNFRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPI----ELRLEGAHIRPNFGK-GFRFEECWS
        ++  FR+ +D C L+D +S     TWCN   +  ++  RLDR +    +L  F    ++ LDW +SDH+ +     +R++GA    +  K  F FEE W 
Subjt:  LIQNFRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPI----ELRLEGAHIRPNFGK-GFRFEECWS

Query:  KDEECLHIISRVGGWSDLGFRSQP--LHLKLQKCAKALKGWGFRK-----------NKTRWDGIRKQIGDGNSVCLLEDPWILRPYTFKVLGVKSYATGL
        ++EEC  I+ RV  W D   R +P  +  K+ KC KA  GW  +K            K   +G R +IG+G+SV +LEDPW+ RP TFKV         L
Subjt:  KDEECLHIISRVGGWSDLGFRSQP--LHLKLQKCAKALKGWGFRK-----------NKTRWDGIRKQIGDGNSVCLLEDPWILRPYTFKVLGVKSYATGL

Query:  KAADCILPSGQWDIPKLSQFLLEKDDQELIRLPIN-LSISDRWIWHF
           D   P+G WD   +       D   ++++P +   I D+ +WH+
Subjt:  KAADCILPSGQWDIPKLSQFLLEKDDQELIRLPIN-LSISDRWIWHF

KAF4392222.1 hypothetical protein F8388_012678 [Cannabis sativa]1.4e-2428.92Show/hide
Query:  IQNFRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPIELRLEGAHIRPNFGKG-----FRFEECWSK
        ++ F+  LD C L+D +     +TWCN  ++ + V  RLDR + N  + D F    V  LDW +SDH+P+ + +         GK      F FEE W +
Subjt:  IQNFRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPIELRLEGAHIRPNFGKG-----FRFEECWSK

Query:  DEECLHIISRVGGWSDLGFRSQP------LHLKLQKCAKALKGWGFRK--------NKTRWD----GIRKQIGDGNSVCLLEDPWILRPYTFKVLGVKSY
        ++EC  I+   G W       +P       H K  +  K L GW  ++         K + D    G R ++G+G  V ++EDPW+  P +FK+      
Subjt:  DEECLHIISRVGGWSDLGFRSQP------LHLKLQKCAKALKGWGFRK--------NKTRWD----GIRKQIGDGNSVCLLEDPWILRPYTFKVLGVKSY

Query:  ATGLKAADCILPSGQWDIPKLSQFLLEKDDQELIRLP-INLSISDRWIW
           L   D  LP+G+WD   +     ++D + +I LP +   + D+ +W
Subjt:  ATGLKAADCILPSGQWDIPKLSQFLLEKDDQELIRLP-INLSISDRWIW

TrEMBL top hitse value%identityAlignment
A0A7J6DN93 CCHC-type domain-containing protein4.9e-2834.67Show/hide
Query:  IQNFRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPIELRLEGAHIRPNFGK-----GFRFEECWSK
        +++F++ +D CGL+D  S     TWCN  Q+ +Q+  RLDR + N  +L CF    +  LDW +SDH+P+ + L  A      G+      F FEE W  
Subjt:  IQNFRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPIELRLEGAHIRPNFGK-----GFRFEECWSK

Query:  DEECLHIISRVGGWSDLGFRSQPLHLKLQKCAKALKGWGFRKNKTRWDGIRKQIGDGNSVCLLEDPWILRPYTFKVLGVKSYATGLKAADCILPSGQWDI
        D+EC  I+  + GW D  F       K     K L+G+       RW     ++GDGNS+ +LEDPW+ RP TFK+         L+ AD  L  G WD 
Subjt:  DEECLHIISRVGGWSDLGFRSQPLHLKLQKCAKALKGWGFRKNKTRWDGIRKQIGDGNSVCLLEDPWILRPYTFKVLGVKSYATGLKAADCILPSGQWDI

Query:  PKLSQFLLEKDDQELIRLPINLSIS
           ++F+     Q LI + I ++I+
Subjt:  PKLSQFLLEKDDQELIRLPINLSIS

A0A7J6DUI7 Uncharacterized protein (Fragment)1.1e-2735.85Show/hide
Query:  FRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPIELRLEGA--HIRPNFGKGFRFEECWSKDEECLH
        F+  LD C L+D+   G  +TW N+RQ    V  RLDRF  N  + + F    V   D+  SDH+P+   LE      + +  +GFRFE  W KDEEC  
Subjt:  FRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPIELRLEGA--HIRPNFGKGFRFEECWSKDEECLH

Query:  IISRVGGWSDLGFRSQPLHLKL-QKCAKALKGWGFRKNKTRW---------DGIRKQIGDGNSVCLLEDPWILRPYTFKVLGVKSYATGLKAADCILPSG
        I+ +     D    SQ   + +  +CA  L  W    NK ++          G+RK +GDG SV    DPW+ RP +F+ +   +   GL  +D I   G
Subjt:  IISRVGGWSDLGFRSQPLHLKL-QKCAKALKGWGFRKNKTRW---------DGIRKQIGDGNSVCLLEDPWILRPYTFKVLGVKSYATGLKAADCILPSG

Query:  QWDIPKLSQFLL
         WDIP LSQ+ L
Subjt:  QWDIPKLSQFLL

A0A7J6EZ57 Uncharacterized protein1.2e-2634.09Show/hide
Query:  IQNFRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPIELRLEGA--HIRPNFGKGFRFEECWSKDEE
        +  F+  LD C L+D+   G  +TW N+RQ    V  RLDRF  N  + D F    V   D+  SDH+P+   LE      + +  +GFRFE  W KDEE
Subjt:  IQNFRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPIELRLEGA--HIRPNFGKGFRFEECWSKDEE

Query:  CLHIISRVGGWSDLGFRSQPLHLKL-QKCAKALKGWGFRKNKTRWDGIRKQI-----------GDGNSVCLLEDPWILRPYTFKVLGVKSYATGLKAADC
        C  I+ +     D    SQ   + +  +CA  L  W    NK+++  I K +            DG SV    DPW+ RP +F+ +   +   G+  +D 
Subjt:  CLHIISRVGGWSDLGFRSQPLHLKL-QKCAKALKGWGFRKNKTRWDGIRKQI-----------GDGNSVCLLEDPWILRPYTFKVLGVKSYATGLKAADC

Query:  ILPSGQWDIPKLSQFLLEKD
        I   G WDIP LSQ+ L  D
Subjt:  ILPSGQWDIPKLSQFLLEKD

A0A7J6GWY1 Uncharacterized protein3.3e-3232.79Show/hide
Query:  LIQNFRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPI----ELRLEGAHIRPNFGK-GFRFEECWS
        ++  FR+ +D C L+D +S     TWCN   +  ++  RLDR +    +L  F    ++ LDW +SDH+ +     +R++GA    +  K  F FEE W 
Subjt:  LIQNFRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPI----ELRLEGAHIRPNFGK-GFRFEECWS

Query:  KDEECLHIISRVGGWSDLGFRSQP--LHLKLQKCAKALKGWGFRK-----------NKTRWDGIRKQIGDGNSVCLLEDPWILRPYTFKVLGVKSYATGL
        ++EEC  I+ RV  W D   R +P  +  K+ KC KA  GW  +K            K   +G R +IG+G+SV +LEDPW+ RP TFKV         L
Subjt:  KDEECLHIISRVGGWSDLGFRSQP--LHLKLQKCAKALKGWGFRK-----------NKTRWDGIRKQIGDGNSVCLLEDPWILRPYTFKVLGVKSYATGL

Query:  KAADCILPSGQWDIPKLSQFLLEKDDQELIRLPIN-LSISDRWIWHF
           D   P+G WD   +       D   ++++P +   I D+ +WH+
Subjt:  KAADCILPSGQWDIPKLSQFLLEKDDQELIRLPIN-LSISDRWIWHF

A0A7J6HAF6 Uncharacterized protein6.6e-2528.92Show/hide
Query:  IQNFRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPIELRLEGAHIRPNFGKG-----FRFEECWSK
        ++ F+  LD C L+D +     +TWCN  ++ + V  RLDR + N  + D F    V  LDW +SDH+P+ + +         GK      F FEE W +
Subjt:  IQNFRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPIELRLEGAHIRPNFGKG-----FRFEECWSK

Query:  DEECLHIISRVGGWSDLGFRSQP------LHLKLQKCAKALKGWGFRK--------NKTRWD----GIRKQIGDGNSVCLLEDPWILRPYTFKVLGVKSY
        ++EC  I+   G W       +P       H K  +  K L GW  ++         K + D    G R ++G+G  V ++EDPW+  P +FK+      
Subjt:  DEECLHIISRVGGWSDLGFRSQP------LHLKLQKCAKALKGWGFRK--------NKTRWD----GIRKQIGDGNSVCLLEDPWILRPYTFKVLGVKSY

Query:  ATGLKAADCILPSGQWDIPKLSQFLLEKDDQELIRLP-INLSISDRWIW
           L   D  LP+G+WD   +     ++D + +I LP +   + D+ +W
Subjt:  ATGLKAADCILPSGQWDIPKLSQFLLEKDDQELIRLP-INLSISDRWIW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.0e-0630.2Show/hide
Query:  IQNFRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPIELRLEGAHIRPNFGKGFRFEECWSKDEECL
        ++ F++ L    L+D+ S+G  YTW N  Q  + +  +LDR +AN  +   F   I        SDH P  + LE    R    K FR+    S     L
Subjt:  IQNFRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPIELRLEGAHIRPNFGKGFRFEECWSKDEECL

Query:  HIISRVGGWSD---LGFRSQPL--HLK-LQKCAKAL--KGWGFRKNKTR
          +S    W +   +G     L  HLK  +KC K L  +G+G  ++KT+
Subjt:  HIISRVGGWSD---LGFRSQPL--HLK-LQKCAKAL--KGWGFRKNKTR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCCTAATTCAAAATTTTCGTGATACTCTTGATTCTTGTGGCCTTTTGGATTTGAATTCCAAGGGGGGGATTTACACTTGGTGCAACCGGCGACAAGCAGGGGATCA
AGTTAGTCTTCGCCTTGATCGCTTCGTGGCTAATTCAGCTTTCTTGGATTGTTTTACAGAGTGTATTGTCAATAATCTTGATTGGGCCAAGTCAGACCATAAACCGATTG
AGCTTCGCCTGGAGGGAGCTCATATCCGGCCGAATTTCGGAAAGGGGTTTAGATTTGAAGAGTGTTGGTCCAAAGATGAAGAATGTCTCCACATTATTTCTCGGGTTGGG
GGATGGTCAGATTTGGGATTTAGATCCCAACCTCTCCATTTGAAACTGCAAAAATGTGCAAAAGCGCTCAAGGGTTGGGGTTTCAGGAAGAACAAGACAAGGTGGGATGG
AATTCGTAAACAAATTGGTGATGGTAATTCGGTCTGTCTGTTGGAGGACCCATGGATTCTTCGTCCTTATACTTTTAAAGTGCTTGGTGTTAAGAGCTACGCAACAGGAT
TGAAGGCAGCTGACTGTATCCTGCCATCTGGTCAGTGGGACATTCCAAAGCTTTCTCAATTCCTTTTGGAGAAGGATGATCAGGAGCTTATTCGACTGCCTATTAATTTG
TCGATTTCGGATAGATGGATTTGGCATTTTGATAAATATGAGGATACTTCTCATGCTCTTTTTATGTGTTCTAGAGAGTTAGAAGACTGGACAACATTGGGTTATCGGGA
GATGGTAAGAATAGACATCAGTATGGATTTTAAGGATCGGTGGCTTGACATTAGCAATAATGTTTCAGGAATGGTCCTTGAGCGGATTTGTGTGGCTGTCTGGACTATCT
GGAATGATCGGAATAGTGTGTTTCATAAGTGTCCAATCCCCTCGGTGGGGGTTCAATGTGATTGGATTTTGGAGTGTCTGTCTGAATACCAAACAGTTCATAATTCCGGT
GGTCGTAGAATCCAATCAAGGGATTCGATTTCTGAAATGATCTCAGGAGGGGAGGATATTATTTTGCATGTTGATGCAGCATGGTTGAAGCACAACAGATGTGGTGGTGT
GGGGGCAGTTCTGCACACAAAGTCCGGTAAGTTGGTGGCTATGATGCAGAAAAGGATTGCGTTTCCCCCATCTCCATTATGTGCTGAGGCGTTAGCAGTTCTTGAGGGTC
TTAAAATGACTTCTCTAAGGAATATTCGGAAAATTACGGTGTGCTCGGACTCCTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGCCTAATTCAAAATTTTCGTGATACTCTTGATTCTTGTGGCCTTTTGGATTTGAATTCCAAGGGGGGGATTTACACTTGGTGCAACCGGCGACAAGCAGGGGATCA
AGTTAGTCTTCGCCTTGATCGCTTCGTGGCTAATTCAGCTTTCTTGGATTGTTTTACAGAGTGTATTGTCAATAATCTTGATTGGGCCAAGTCAGACCATAAACCGATTG
AGCTTCGCCTGGAGGGAGCTCATATCCGGCCGAATTTCGGAAAGGGGTTTAGATTTGAAGAGTGTTGGTCCAAAGATGAAGAATGTCTCCACATTATTTCTCGGGTTGGG
GGATGGTCAGATTTGGGATTTAGATCCCAACCTCTCCATTTGAAACTGCAAAAATGTGCAAAAGCGCTCAAGGGTTGGGGTTTCAGGAAGAACAAGACAAGGTGGGATGG
AATTCGTAAACAAATTGGTGATGGTAATTCGGTCTGTCTGTTGGAGGACCCATGGATTCTTCGTCCTTATACTTTTAAAGTGCTTGGTGTTAAGAGCTACGCAACAGGAT
TGAAGGCAGCTGACTGTATCCTGCCATCTGGTCAGTGGGACATTCCAAAGCTTTCTCAATTCCTTTTGGAGAAGGATGATCAGGAGCTTATTCGACTGCCTATTAATTTG
TCGATTTCGGATAGATGGATTTGGCATTTTGATAAATATGAGGATACTTCTCATGCTCTTTTTATGTGTTCTAGAGAGTTAGAAGACTGGACAACATTGGGTTATCGGGA
GATGGTAAGAATAGACATCAGTATGGATTTTAAGGATCGGTGGCTTGACATTAGCAATAATGTTTCAGGAATGGTCCTTGAGCGGATTTGTGTGGCTGTCTGGACTATCT
GGAATGATCGGAATAGTGTGTTTCATAAGTGTCCAATCCCCTCGGTGGGGGTTCAATGTGATTGGATTTTGGAGTGTCTGTCTGAATACCAAACAGTTCATAATTCCGGT
GGTCGTAGAATCCAATCAAGGGATTCGATTTCTGAAATGATCTCAGGAGGGGAGGATATTATTTTGCATGTTGATGCAGCATGGTTGAAGCACAACAGATGTGGTGGTGT
GGGGGCAGTTCTGCACACAAAGTCCGGTAAGTTGGTGGCTATGATGCAGAAAAGGATTGCGTTTCCCCCATCTCCATTATGTGCTGAGGCGTTAGCAGTTCTTGAGGGTC
TTAAAATGACTTCTCTAAGGAATATTCGGAAAATTACGGTGTGCTCGGACTCCTTTTAG
Protein sequenceShow/hide protein sequence
MCLIQNFRDTLDSCGLLDLNSKGGIYTWCNRRQAGDQVSLRLDRFVANSAFLDCFTECIVNNLDWAKSDHKPIELRLEGAHIRPNFGKGFRFEECWSKDEECLHIISRVG
GWSDLGFRSQPLHLKLQKCAKALKGWGFRKNKTRWDGIRKQIGDGNSVCLLEDPWILRPYTFKVLGVKSYATGLKAADCILPSGQWDIPKLSQFLLEKDDQELIRLPINL
SISDRWIWHFDKYEDTSHALFMCSRELEDWTTLGYREMVRIDISMDFKDRWLDISNNVSGMVLERICVAVWTIWNDRNSVFHKCPIPSVGVQCDWILECLSEYQTVHNSG
GRRIQSRDSISEMISGGEDIILHVDAAWLKHNRCGGVGAVLHTKSGKLVAMMQKRIAFPPSPLCAEALAVLEGLKMTSLRNIRKITVCSDSF