; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022160 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022160
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr7:19876081..19876797
RNA-Seq ExpressionLag0022160
SyntenyLag0022160
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]2.2e-1140.38Show/hide
Query:  LQWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLKAYEDNGDFEGSRCKPPLVVESDSVEVVEAMNRAS
        ++W  P  + WKLN DASWSE  E GG+GW + D  G ++ AG  KI     I  LE   II GL+       F   + + P+ +ESDSVEV+  M +  
Subjt:  LQWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLKAYEDNGDFEGSRCKPPLVVESDSVEVVEAMNRAS

Query:  EDAT
         D T
Subjt:  EDAT

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]1.0e-1636.55Show/hide
Query:  QWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLKAYEDNGDFEGSRCKPPLVVESDSVEVVEAMNRASE
        QW+PP  + WKLN +A+W     +GG+GW +RD  G +I A C+ I    NI  LE  AI EGL+A           C+ P+ +ESDS+E +  ++R  +
Subjt:  QWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLKAYEDNGDFEGSRCKPPLVVESDSVEVVEAMNRASE

Query:  DATELCLFVDEIDRFRGPDQAKFFSKCSRQSNSLAHELERAASRN
        D TE+   ++EI +     +       SR++N +AH L R A  N
Subjt:  DATELCLFVDEIDRFRGPDQAKFFSKCSRQSNSLAHELERAASRN

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]3.7e-1935.96Show/hide
Query:  FNKARKQIAISIEESPKIQDKTQSQPKLESLSSHLQWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLK
        F+   +Q+   + ES   Q +T      ++L++ L+WEPP    W LN DASWS+    GG+GW IR  +G ++ AG + +    N+K LE  AI+EGL+
Subjt:  FNKARKQIAISIEESPKIQDKTQSQPKLESLSSHLQWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLK

Query:  AYEDNGDFEGSRCKPPLVVESDSVEVVEAMNRASEDATELCLFVDEIDRFRGPDQAKFFSKCSRQSNSLAHELERAAS
           + G         PL +E+DS EV   +NR  ED T+    V+EI   R   +   F+K  R++N  AH L + AS
Subjt:  AYEDNGDFEGSRCKPPLVVESDSVEVVEAMNRASEDATELCLFVDEIDRFRGPDQAKFFSKCSRQSNSLAHELERAAS

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]1.6e-1736.73Show/hide
Query:  QWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLKA--YEDNGDFEGSRCKPPLVVESDSVEVVEAMNRA
        +W+PP  + WKLN DA+W     +GG+GW +RD  G +I A C+ I    NI  LE  AI EGL+A   E     +   C+ P+ +ESDS+E +  ++R 
Subjt:  QWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLKA--YEDNGDFEGSRCKPPLVVESDSVEVVEAMNRA

Query:  SEDATELCLFVDEIDRFRGPDQAKFFSKCSRQSNSLAHELERAASRN
         +D TE+   ++EI +     +       SR++N +AH+L R A  N
Subjt:  SEDATELCLFVDEIDRFRGPDQAKFFSKCSRQSNSLAHELERAASRN

XP_037439691.1 uncharacterized protein LOC119307722 isoform X1 [Triticum dicoccoides]2.2e-1132.18Show/hide
Query:  KLESLSSHLQWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLKAYEDNGDFEGSRCKPPLVVESDSVEV
        K +  +  ++W  P     KLNVDAS+   E +G  G  +RDS+G+ I A C       ++  +E  A++EGL+  E  G +        LVVESDS E+
Subjt:  KLESLSSHLQWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLKAYEDNGDFEGSRCKPPLVVESDSVEV

Query:  VEAMNRASEDATELCLFVDEIDRFRGPDQAKFFSKCSRQSNSLAHELERAASR---NSDCNFFVNLSLLYGEDD
        V+AM   SE      +  D+  +            C+R+SN +AH L R + R   N   +  V   LL G+ D
Subjt:  VEAMNRASEDATELCLFVDEIDRFRGPDQAKFFSKCSRQSNSLAHELERAASR---NSDCNFFVNLSLLYGEDD

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134124.9e-1736.55Show/hide
Query:  QWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLKAYEDNGDFEGSRCKPPLVVESDSVEVVEAMNRASE
        QW+PP  + WKLN +A+W     +GG+GW +RD  G +I A C+ I    NI  LE  AI EGL+A           C+ P+ +ESDS+E +  ++R  +
Subjt:  QWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLKAYEDNGDFEGSRCKPPLVVESDSVEVVEAMNRASE

Query:  DATELCLFVDEIDRFRGPDQAKFFSKCSRQSNSLAHELERAASRN
        D TE+   ++EI +     +       SR++N +AH L R A  N
Subjt:  DATELCLFVDEIDRFRGPDQAKFFSKCSRQSNSLAHELERAASRN

A0A6J1CQG0 uncharacterized protein LOC1110132161.1e-1140.38Show/hide
Query:  LQWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLKAYEDNGDFEGSRCKPPLVVESDSVEVVEAMNRAS
        ++W  P  + WKLN DASWSE  E GG+GW + D  G ++ AG  KI     I  LE   II GL+       F   + + P+ +ESDSVEV+  M +  
Subjt:  LQWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLKAYEDNGDFEGSRCKPPLVVESDSVEVVEAMNRAS

Query:  EDAT
         D T
Subjt:  EDAT

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X11.5e-1037.62Show/hide
Query:  SSHLQWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLKAYEDNGDFEGSRCKPPLVVESDSVEVVEAMN
        ++  +W+PP  + WKLN DA+W     + G+GW +RD  G +I  GC+ I    NI  LE  AI EGL+A           C+ P+ +ESDS+E +  ++
Subjt:  SSHLQWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLKAYEDNGDFEGSRCKPPLVVESDSVEVVEAMN

Query:  R
        R
Subjt:  R

A0A6J1DNV9 uncharacterized protein LOC1110224031.8e-1935.96Show/hide
Query:  FNKARKQIAISIEESPKIQDKTQSQPKLESLSSHLQWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLK
        F+   +Q+   + ES   Q +T      ++L++ L+WEPP    W LN DASWS+    GG+GW IR  +G ++ AG + +    N+K LE  AI+EGL+
Subjt:  FNKARKQIAISIEESPKIQDKTQSQPKLESLSSHLQWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLK

Query:  AYEDNGDFEGSRCKPPLVVESDSVEVVEAMNRASEDATELCLFVDEIDRFRGPDQAKFFSKCSRQSNSLAHELERAAS
           + G         PL +E+DS EV   +NR  ED T+    V+EI   R   +   F+K  R++N  AH L + AS
Subjt:  AYEDNGDFEGSRCKPPLVVESDSVEVVEAMNRASEDATELCLFVDEIDRFRGPDQAKFFSKCSRQSNSLAHELERAAS

A0A6J1DSV1 uncharacterized protein LOC1110236087.5e-1836.73Show/hide
Query:  QWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLKA--YEDNGDFEGSRCKPPLVVESDSVEVVEAMNRA
        +W+PP  + WKLN DA+W     +GG+GW +RD  G +I A C+ I    NI  LE  AI EGL+A   E     +   C+ P+ +ESDS+E +  ++R 
Subjt:  QWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLKA--YEDNGDFEGSRCKPPLVVESDSVEVVEAMNRA

Query:  SEDATELCLFVDEIDRFRGPDQAKFFSKCSRQSNSLAHELERAASRN
         +D TE+   ++EI +     +       SR++N +AH+L R A  N
Subjt:  SEDATELCLFVDEIDRFRGPDQAKFFSKCSRQSNSLAHELERAASRN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G23320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.5e-0532.91Show/hide
Query:  QWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLKAYEDNG----DFEG
        +W+ P  ++ K N D S     +  GL W IR+S G+ +  GC K      IK  E  A+I  ++   D G    +FEG
Subjt:  QWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLKAYEDNG----DFEG

AT4G09775.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT2G02650.1)5.5e-0532.39Show/hide
Query:  QPKLESLSSHLQWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAI
        QP         +W PP   Y K N D+ + +  +     W IRDSNG +I +GC K+   ++   L+ EA+
Subjt:  QPKLESLSSHLQWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAI

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.5e-0727.33Show/hide
Query:  SSHLQWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLKAYEDNGDFEGSRCKPPLVVESDSVEVVEAMN
        S + +W PP  D  K N DAS  ER    GLGW +R+S G++I  G  K       +  E   +I  ++A    G          ++ E D+  +   +N
Subjt:  SSHLQWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEAIIEGLKAYEDNGDFEGSRCKPPLVVESDSVEVVEAMN

Query:  RASEDATELCLFVDEIDRFRGPDQAKFFSKCSRQSNSLAHELERAASRNS
          S +   L  F+D I  +    ++  FS   R+ N  A  L + A + +
Subjt:  RASEDATELCLFVDEIDRFRGPDQAKFFSKCSRQSNSLAHELERAASRNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACACAAGGAATGCAAGCAACACCAACAACCAACATTACCAGATTTCAACAAAGCCAGAAAACAAATTGCTATCAGTATCGAAGAAAGCCCTAAGATTCAAGACAA
GACCCAAAGCCAGCCAAAGTTAGAGAGCCTTTCGAGTCACTTACAATGGGAGCCTCCGGAGCCCGACTACTGGAAGCTTAATGTTGATGCCTCTTGGTCCGAAAGAGAGG
AGAGCGGCGGTCTTGGCTGGGCAATTCGTGACTCTAACGGATCTTTAATCGGAGCGGGCTGCAAGAAGATTTCGAATATGTGGAACATTAAATGCCTTGAAGAAGAAGCG
ATTATTGAAGGTTTAAAAGCTTACGAGGACAACGGTGATTTCGAAGGAAGCAGATGTAAGCCACCGCTGGTAGTTGAGTCTGACTCTGTCGAAGTTGTGGAAGCTATGAA
TCGAGCCTCTGAAGACGCCACCGAGCTCTGCTTGTTCGTGGATGAAATCGACAGATTCAGAGGTCCGGACCAAGCGAAATTCTTCTCCAAATGCTCCAGACAGAGCAATT
CTCTGGCGCACGAGCTTGAGCGAGCAGCGTCGAGAAACAGCGACTGTAATTTTTTTGTAAATCTCTCTCTCCTCTATGGAGAAGATGATCAGTTTTGGAGGGAAGTTCCT
TTTCTTGGGTGGTTCACTTCTGTTATTAATGCCCTAGTGGGTGCAACTGTTCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAACACAAGGAATGCAAGCAACACCAACAACCAACATTACCAGATTTCAACAAAGCCAGAAAACAAATTGCTATCAGTATCGAAGAAAGCCCTAAGATTCAAGACAA
GACCCAAAGCCAGCCAAAGTTAGAGAGCCTTTCGAGTCACTTACAATGGGAGCCTCCGGAGCCCGACTACTGGAAGCTTAATGTTGATGCCTCTTGGTCCGAAAGAGAGG
AGAGCGGCGGTCTTGGCTGGGCAATTCGTGACTCTAACGGATCTTTAATCGGAGCGGGCTGCAAGAAGATTTCGAATATGTGGAACATTAAATGCCTTGAAGAAGAAGCG
ATTATTGAAGGTTTAAAAGCTTACGAGGACAACGGTGATTTCGAAGGAAGCAGATGTAAGCCACCGCTGGTAGTTGAGTCTGACTCTGTCGAAGTTGTGGAAGCTATGAA
TCGAGCCTCTGAAGACGCCACCGAGCTCTGCTTGTTCGTGGATGAAATCGACAGATTCAGAGGTCCGGACCAAGCGAAATTCTTCTCCAAATGCTCCAGACAGAGCAATT
CTCTGGCGCACGAGCTTGAGCGAGCAGCGTCGAGAAACAGCGACTGTAATTTTTTTGTAAATCTCTCTCTCCTCTATGGAGAAGATGATCAGTTTTGGAGGGAAGTTCCT
TTTCTTGGGTGGTTCACTTCTGTTATTAATGCCCTAGTGGGTGCAACTGTTCTTTAG
Protein sequenceShow/hide protein sequence
MEHKECKQHQQPTLPDFNKARKQIAISIEESPKIQDKTQSQPKLESLSSHLQWEPPEPDYWKLNVDASWSEREESGGLGWAIRDSNGSLIGAGCKKISNMWNIKCLEEEA
IIEGLKAYEDNGDFEGSRCKPPLVVESDSVEVVEAMNRASEDATELCLFVDEIDRFRGPDQAKFFSKCSRQSNSLAHELERAASRNSDCNFFVNLSLLYGEDDQFWREVP
FLGWFTSVINALVGATVL