; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034219 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034219
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:5338452..5339054
RNA-Seq ExpressionLag0034219
SyntenyLag0034219
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_013694754.1 uncharacterized protein LOC106398790 [Brassica napus]1.0e-0931.11Show/hide
Query:  PPNSPRWKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLDLVGEAKEIEVEADALDLICCLNNQEENLTEIKCI
        PP   + K N+DA+W  E ++GG+GWV+RD  G ++ AG +   +  S+   E +A+   ++++       + +  E+D+  L+  LNN+EE    +K +
Subjt:  PPNSPRWKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLDLVGEAKEIEVEADALDLICCLNNQEENLTEIKCI

Query:  VEAINNTLA--PKLGVLFFKHCPRSQNGVAHSIAR
        ++AI+++ +   K+ V ++   PRS N +AH IA+
Subjt:  VEAINNTLA--PKLGVLFFKHCPRSQNGVAHSIAR

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]2.0e-1334.59Show/hide
Query:  PPNSPRWKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLDLVGEAKEIEVEADALDLICCLNNQEENLTEIKCI
        PP S  WK N++A+W A+   GG+GW++RD  G ++ A  +  R E +I  LE  A+ EGL+++       + I +E+D+L+ I  L+ Q ++ TEI  +
Subjt:  PPNSPRWKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLDLVGEAKEIEVEADALDLICCLNNQEENLTEIKCI

Query:  VEAINNTLAPKLGVLFFKHCPRSQNGVAHSIAR
        +E I   +   + ++  +H  R  N VAH +AR
Subjt:  VEAINNTLAPKLGVLFFKHCPRSQNGVAHSIAR

XP_022148737.1 uncharacterized protein LOC111017329 [Momordica charantia]1.8e-1133.08Show/hide
Query:  WKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLDLVGEAKEIEVEADALDLICCLNNQEENLTEIKCIVEAINN
        WK N+DA+W+A + +GGLGW++R+    +  AG QT  +   I  LE  A+W G+++V+     +  + +E+++L+ I  +   ++N+TEI  +V+ I N
Subjt:  WKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLDLVGEAKEIEVEADALDLICCLNNQEENLTEIKCIVEAINN

Query:  TLAPKLGVLFFKHCPRSQNGVAHSIARAGA
            +  +  F+H  R  N VA  IA   A
Subjt:  TLAPKLGVLFFKHCPRSQNGVAHSIARAGA

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]3.3e-1335.34Show/hide
Query:  PPNSPRWKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLDLVGEAKEIEVEADALDLICCLNNQEENLTEIKCI
        PP    W  N+DASW+     GG+GW++R + G +V AG +      ++ +LEA A+ EGL+++ +L G  + + +E D+ ++   LN + E+LT+   +
Subjt:  PPNSPRWKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLDLVGEAKEIEVEADALDLICCLNNQEENLTEIKCI

Query:  VEAINNTLAPKLGVLFFKHCPRSQNGVAHSIAR
        VE I N L     +L F    R  NG AHS+A+
Subjt:  VEAINNTLAPKLGVLFFKHCPRSQNGVAHSIAR

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]7.4e-1333.81Show/hide
Query:  PPNSPRWKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLD------LVGEAKEIEVEADALDLICCLNNQEENL
        PP S  WK N+DA+W A+   GG+GW++RD  G ++ A  +  R E +I  LE  A+ EGL+++             + I +E+D+L+ I  L+ Q ++ 
Subjt:  PPNSPRWKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLD------LVGEAKEIEVEADALDLICCLNNQEENL

Query:  TEIKCIVEAINNTLAPKLGVLFFKHCPRSQNGVAHSIAR
        TEI  ++E I   +   + ++  +H  R  N VAH +AR
Subjt:  TEIKCIVEAINNTLAPKLGVLFFKHCPRSQNGVAHSIAR

TrEMBL top hitse value%identityAlignment
A0A3P6ESY2 RNase H domain-containing protein (Fragment)4.1e-0932.41Show/hide
Query:  PPNSPRWKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLDLVGEAKEIEVEADALDLICCLNNQEENLTEIKCI
        PP     K N+DA+W  E++ GG GW++RD  G L+ AG +   E  S    EA+A+W  L++V     +   +++E D+L L+  +N +EE    ++ I
Subjt:  PPNSPRWKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLDLVGEAKEIEVEADALDLICCLNNQEENLTEIKCI

Query:  VEAINNTLA--PKLGVLFFKHCPRSQNGVAHSIAR-AGAFHSCAP
        ++ I   ++   ++ V+++   PRS N  A  IAR    F S  P
Subjt:  VEAINNTLA--PKLGVLFFKHCPRSQNGVAHSIAR-AGAFHSCAP

A0A6J1CP26 uncharacterized protein LOC1110134129.5e-1434.59Show/hide
Query:  PPNSPRWKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLDLVGEAKEIEVEADALDLICCLNNQEENLTEIKCI
        PP S  WK N++A+W A+   GG+GW++RD  G ++ A  +  R E +I  LE  A+ EGL+++       + I +E+D+L+ I  L+ Q ++ TEI  +
Subjt:  PPNSPRWKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLDLVGEAKEIEVEADALDLICCLNNQEENLTEIKCI

Query:  VEAINNTLAPKLGVLFFKHCPRSQNGVAHSIAR
        +E I   +   + ++  +H  R  N VAH +AR
Subjt:  VEAINNTLAPKLGVLFFKHCPRSQNGVAHSIAR

A0A6J1D5W1 uncharacterized protein LOC1110173298.9e-1233.08Show/hide
Query:  WKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLDLVGEAKEIEVEADALDLICCLNNQEENLTEIKCIVEAINN
        WK N+DA+W+A + +GGLGW++R+    +  AG QT  +   I  LE  A+W G+++V+     +  + +E+++L+ I  +   ++N+TEI  +V+ I N
Subjt:  WKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLDLVGEAKEIEVEADALDLICCLNNQEENLTEIKCIVEAINN

Query:  TLAPKLGVLFFKHCPRSQNGVAHSIARAGA
            +  +  F+H  R  N VA  IA   A
Subjt:  TLAPKLGVLFFKHCPRSQNGVAHSIARAGA

A0A6J1DNV9 uncharacterized protein LOC1110224031.6e-1335.34Show/hide
Query:  PPNSPRWKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLDLVGEAKEIEVEADALDLICCLNNQEENLTEIKCI
        PP    W  N+DASW+     GG+GW++R + G +V AG +      ++ +LEA A+ EGL+++ +L G  + + +E D+ ++   LN + E+LT+   +
Subjt:  PPNSPRWKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLDLVGEAKEIEVEADALDLICCLNNQEENLTEIKCI

Query:  VEAINNTLAPKLGVLFFKHCPRSQNGVAHSIAR
        VE I N L     +L F    R  NG AHS+A+
Subjt:  VEAINNTLAPKLGVLFFKHCPRSQNGVAHSIAR

A0A6J1DSV1 uncharacterized protein LOC1110236083.6e-1333.81Show/hide
Query:  PPNSPRWKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLD------LVGEAKEIEVEADALDLICCLNNQEENL
        PP S  WK N+DA+W A+   GG+GW++RD  G ++ A  +  R E +I  LE  A+ EGL+++             + I +E+D+L+ I  L+ Q ++ 
Subjt:  PPNSPRWKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLD------LVGEAKEIEVEADALDLICCLNNQEENL

Query:  TEIKCIVEAINNTLAPKLGVLFFKHCPRSQNGVAHSIAR
        TEI  ++E I   +   + ++  +H  R  N VAH +AR
Subjt:  TEIKCIVEAINNTLAPKLGVLFFKHCPRSQNGVAHSIAR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein6.1e-0530.37Show/hide
Query:  PPNSPRWKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAM-WEGLKSVLDLVG-EAKEIEVEADALDLICCLNNQEENLTEIK
        PP     K N+DA+WN + +  G+GWV+R+  G +   G +   +  S+   E +AM W    +VL L   +   +  E+D+  LI  LNN +E    +K
Subjt:  PPNSPRWKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAM-WEGLKSVLDLVG-EAKEIEVEADALDLICCLNNQEENLTEIK

Query:  CIVEAINNTLAPKLGVLFFKHCPRSQNGVAHSIAR
          ++ +   L+    V F    PR  N +A  +AR
Subjt:  CIVEAINNTLAPKLGVLFFKHCPRSQNGVAHSIAR

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.4e-0423.49Show/hide
Query:  PPNSPRWKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLDLVGEAKEIEVEADALDLICCLNNQEENLTEIKCI
        PP   + K N DAS +    + GLGW++R+  G+++  G+   +   + +  E   +   +++        K++  E D   +   +N +  N   ++  
Subjt:  PPNSPRWKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLDLVGEAKEIEVEADALDLICCLNNQEENLTEIKCI

Query:  VEAINNTLAPKLGVLF-FKHCPRSQNGVAHSIARA--------GAFHSC
        ++ I + +     + F FKH  R QNG A  +A+           FHSC
Subjt:  VEAINNTLAPKLGVLF-FKHCPRSQNGVAHSIARA--------GAFHSC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATATATATATATATACTACTTTGCTAATACCCCCTAACTCCCCTCGATGGAAGTTCAATTCAGACGCGTCCTGGAATGCGGAGAAGAAGATCGGAGGGTTAGGGTG
GGTAGTTCGTGATTTCGGTGGTTCTTTGGTCTGTGCGGGCTTGCAGACATCTAGAGAGGAGTGGTCGATCGATGTTCTTGAGGCCAAAGCCATGTGGGAAGGGCTTAAGT
CTGTGCTTGACTTGGTCGGGGAGGCGAAAGAAATTGAAGTAGAAGCGGATGCTCTTGATCTCATCTGCTGTTTGAACAATCAAGAGGAAAATTTGACAGAGATCAAGTGT
ATTGTTGAAGCCATTAACAACACTTTGGCTCCAAAGCTTGGAGTGCTCTTCTTCAAGCATTGCCCTAGAAGTCAGAATGGGGTGGCTCACTCCATCGCTCGTGCAGGCGC
TTTTCATTCTTGTGCTCCTGGTTTTTGTAATTCTGATTCTTTTGCTGGGGATCAGAGGAATCCTTCCACGCTGGAAGGCAGTTATTGTTTTTGGTCCTCTGATCCTCTAG
AGTGGTTATCCTCCTTGATTTCTAAGGAGTTGGTTGTACTTGACTCTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATATATATATATATACTACTTTGCTAATACCCCCTAACTCCCCTCGATGGAAGTTCAATTCAGACGCGTCCTGGAATGCGGAGAAGAAGATCGGAGGGTTAGGGTG
GGTAGTTCGTGATTTCGGTGGTTCTTTGGTCTGTGCGGGCTTGCAGACATCTAGAGAGGAGTGGTCGATCGATGTTCTTGAGGCCAAAGCCATGTGGGAAGGGCTTAAGT
CTGTGCTTGACTTGGTCGGGGAGGCGAAAGAAATTGAAGTAGAAGCGGATGCTCTTGATCTCATCTGCTGTTTGAACAATCAAGAGGAAAATTTGACAGAGATCAAGTGT
ATTGTTGAAGCCATTAACAACACTTTGGCTCCAAAGCTTGGAGTGCTCTTCTTCAAGCATTGCCCTAGAAGTCAGAATGGGGTGGCTCACTCCATCGCTCGTGCAGGCGC
TTTTCATTCTTGTGCTCCTGGTTTTTGTAATTCTGATTCTTTTGCTGGGGATCAGAGGAATCCTTCCACGCTGGAAGGCAGTTATTGTTTTTGGTCCTCTGATCCTCTAG
AGTGGTTATCCTCCTTGATTTCTAAGGAGTTGGTTGTACTTGACTCTTCTTAA
Protein sequenceShow/hide protein sequence
MNIYIYTTLLIPPNSPRWKFNSDASWNAEKKIGGLGWVVRDFGGSLVCAGLQTSREEWSIDVLEAKAMWEGLKSVLDLVGEAKEIEVEADALDLICCLNNQEENLTEIKC
IVEAINNTLAPKLGVLFFKHCPRSQNGVAHSIARAGAFHSCAPGFCNSDSFAGDQRNPSTLEGSYCFWSSDPLEWLSSLISKELVVLDSS