; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018625 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018625
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr5:31120697..31121476
RNA-Seq ExpressionLag0018625
SyntenyLag0018625
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4321714.1 unnamed protein product [Prunus armeniaca]3.1e-0944.59Show/hide
Query:  GCCSVCGKEEETVNHALLECDPARAFWFGSPLQLDAGRFRQLHFAESWERVDQWLRERDGTGEAQCLFAFGLWR
        G C  CG ++ET  H   ECD ARAFWF SPLQLD  +     F  +W+ +   L   +   EA   F FGLWR
Subjt:  GCCSVCGKEEETVNHALLECDPARAFWFGSPLQLDAGRFRQLHFAESWERVDQWLRERDGTGEAQCLFAFGLWR

XP_010495306.1 PREDICTED: uncharacterized protein LOC104772378 [Camelina sativa]1.8e-0927.98Show/hide
Query:  CSVCGKEEETVNHALLECDPARAFWFGSPLQLDAGRFRQLHF-AES-WERVDQWLRERDGTGEAQCLFAFGLW-----RGER------------LTLGVD
        C+ CG E ET+NHA+  C PAR  W  + + + +     LHF +ES +  VD +L   +   + Q +F + +W     R  R            + +  D
Subjt:  CSVCGKEEETVNHALLECDPARAFWFGSPLQLDAGRFRQLHF-AES-WERVDQWLRERDGTGEAQCLFAFGLW-----RGER------------LTLGVD

Query:  AAWDRASLATRYGWAVEGLDSVRQREEACRWRGSLNNLHVEGRAMRWGLECMRSMGVKSLYVKSDSLCLVNIVNHKEKCPADFVPIYEMIYQLYEYFDLC
         +W         GW       V     A  +R SL+ LH E  A  W + CM     + +   +DS  LV +V+     PA F    E I    E F   
Subjt:  AAWDRASLATRYGWAVEGLDSVRQREEACRWRGSLNNLHVEGRAMRWGLECMRSMGVKSLYVKSDSLCLVNIVNHKEKCPADFVPIYEMIYQLYEYFDLC

Query:  DFLYVKRDLNGNAHRLAK
           Y+ R+ N  A  LA+
Subjt:  DFLYVKRDLNGNAHRLAK

XP_019094536.1 PREDICTED: uncharacterized protein LOC109129945 [Camelina sativa]7.4e-1130.45Show/hide
Query:  MGTDGCCSVCGKEEETVNHALLECDPARAFWFGSPL-QLDAGRFRQLHFAE----SW------ERVDQWLRERDGTGEAQCLFAFGLWRGERLTLG----
        +G D  CS C  +EET+NHAL EC PAR          LD+     L  AE    +W      + V   + +  G   A  L   G     R  L     
Subjt:  MGTDGCCSVCGKEEETVNHALLECDPARAFWFGSPL-QLDAGRFRQLHFAE----SW------ERVDQWLRERDGTGEAQCLFAFGLWRGERLTLG----

Query:  VDAAWDRASLATRYGWAVEGLDSVRQREEACRWRGSLNNLHVEGRAMRWGLECMRSMGVKSLYVKSDSLCLVNIVNHKEKCPADFVPIYEMIYQLYEYFD
        VD +W         GW             A   R SL+ LH E  A+ W + CM     +S+   +D   LV +V+   + PA F P  E IY   E F 
Subjt:  VDAAWDRASLATRYGWAVEGLDSVRQREEACRWRGSLNNLHVEGRAMRWGLECMRSMGVKSLYVKSDSLCLVNIVNHKEKCPADFVPIYEMIYQLYEYFD

Query:  LCDFLYVKRDLNGNAHRLAK
            + V R  NG A  LA+
Subjt:  LCDFLYVKRDLNGNAHRLAK

XP_021763973.1 uncharacterized protein LOC110728637 [Chenopodium quinoa]1.2e-1327.59Show/hide
Query:  DGCCSVCGKEEETVNHALLECDPARAFWFGSPLQLDAGRFRQLHFAESWERVDQWLRERDGTGEAQCLFAFGLWRGERLTLGVDAAWDRASLATRYGWAV
        D  CS+CG  EET+NHAL EC+ A+  W     +          F E W  +   L+E D    A  L+A   WR   + +  ++  +   +A  Y   V
Subjt:  DGCCSVCGKEEETVNHALLECDPARAFWFGSPLQLDAGRFRQLHFAESWERVDQWLRERDGTGEAQCLFAFGLWRGERLTLGVDAAWDRASLATRYGWAV

Query:  EGLDSVRQREEACRW---------RGS-----------LNNLHVEGRAMRWGLECMRSMGVKSLYVKSDSLCLVNIVNHKEKCPADFVPIYEMIYQLYEY
        E      +R   CR           GS           L     E  A+ + +  +   G  S +++SD+L +V+ +  +    A    I E I  LY  
Subjt:  EGLDSVRQREEACRW---------RGS-----------LNNLHVEGRAMRWGLECMRSMGVKSLYVKSDSLCLVNIVNHKEKCPADFVPIYEMIYQLYEY

Query:  FDLCDFLYVKRDLNGNAHRLAK--LGLCNAMI
        F++C F +VKR+ N  AH +A+  +G C  +I
Subjt:  FDLCDFLYVKRDLNGNAHRLAK--LGLCNAMI

XP_022544206.1 uncharacterized protein LOC111199041 [Brassica napus]1.1e-0927.98Show/hide
Query:  MGTDGCCSVCGKEEETVNHALLECDPA------RAFWFGSPLQLDAGRFRQLHFAESWERVDQWLRERDGTGEAQ------CLFAFGLWRGERLTLGVDA
        MGTD  C  C    E++NH L EC PA      + F       +D  +   L  AE W + +    E++ T E Q         A  L   +  T  +DA
Subjt:  MGTDGCCSVCGKEEETVNHALLECDPA------RAFWFGSPLQLDAGRFRQLHFAESWERVDQWLRERDGTGEAQ------CLFAFGLWRGERLTLGVDA

Query:  AWDRASLATRYGWAV-EGLDSVRQREEACRWRGSLNNLHVEGRAMRWGLECMRSMGVKSLYVKSDSLCLVNIVNHKEKCPADFVPIYEMIYQLYEYFDLC
        +W      +  GW++ + +D       AC  + SL+ LH E   + W   CMR M + S+  ++D   LV ++ +  + PA F    E+  ++ E  +  
Subjt:  AWDRASLATRYGWAV-EGLDSVRQREEACRWRGSLNNLHVEGRAMRWGLECMRSMGVKSLYVKSDSLCLVNIVNHKEKCPADFVPIYEMIYQLYEYFDLC

Query:  DFLYVKRDLNGNAHRLAK
           ++ R  NG A  LAK
Subjt:  DFLYVKRDLNGNAHRLAK

TrEMBL top hitse value%identityAlignment
A0A6J5UE59 Reverse transcriptase domain-containing protein1.5e-0944.59Show/hide
Query:  GCCSVCGKEEETVNHALLECDPARAFWFGSPLQLDAGRFRQLHFAESWERVDQWLRERDGTGEAQCLFAFGLWR
        G C  CG ++ET  H   ECD ARAFWF SPLQLD  +     F  +W+ +   L   +   EA   F FGLWR
Subjt:  GCCSVCGKEEETVNHALLECDPARAFWFGSPLQLDAGRFRQLHFAESWERVDQWLRERDGTGEAQCLFAFGLWR

A0A6J5UF50 Uncharacterized protein1.5e-0944.59Show/hide
Query:  GCCSVCGKEEETVNHALLECDPARAFWFGSPLQLDAGRFRQLHFAESWERVDQWLRERDGTGEAQCLFAFGLWR
        G C  CG ++ET  H   ECD ARAFWF SPLQLD  +     F  +W+ +   L   +   EA   F FGLWR
Subjt:  GCCSVCGKEEETVNHALLECDPARAFWFGSPLQLDAGRFRQLHFAESWERVDQWLRERDGTGEAQCLFAFGLWR

A0A6J5VSP2 Uncharacterized protein1.5e-0944.59Show/hide
Query:  GCCSVCGKEEETVNHALLECDPARAFWFGSPLQLDAGRFRQLHFAESWERVDQWLRERDGTGEAQCLFAFGLWR
        G C  CG ++ET  H   ECD ARAFWF SPLQLD  +     F  +W+ +   L   +   EA   F FGLWR
Subjt:  GCCSVCGKEEETVNHALLECDPARAFWFGSPLQLDAGRFRQLHFAESWERVDQWLRERDGTGEAQCLFAFGLWR

A0A6J5WPU6 Reverse transcriptase domain-containing protein1.5e-0944.59Show/hide
Query:  GCCSVCGKEEETVNHALLECDPARAFWFGSPLQLDAGRFRQLHFAESWERVDQWLRERDGTGEAQCLFAFGLWR
        G C  CG ++ET  H   ECD ARAFWF SPLQLD  +     F  +W+ +   L   +   EA   F FGLWR
Subjt:  GCCSVCGKEEETVNHALLECDPARAFWFGSPLQLDAGRFRQLHFAESWERVDQWLRERDGTGEAQCLFAFGLWR

A0A6J5YDN0 Uncharacterized protein1.5e-0944.59Show/hide
Query:  GCCSVCGKEEETVNHALLECDPARAFWFGSPLQLDAGRFRQLHFAESWERVDQWLRERDGTGEAQCLFAFGLWR
        G C  CG ++ET  H   ECD ARAFWF SPLQLD  +     F  +W+ +   L   +   EA   F FGLWR
Subjt:  GCCSVCGKEEETVNHALLECDPARAFWFGSPLQLDAGRFRQLHFAESWERVDQWLRERDGTGEAQCLFAFGLWR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAACTGATGGGTGCTGCAGTGTCTGCGGCAAGGAGGAAGAAACGGTTAATCATGCTTTGCTAGAATGTGATCCGGCCAGAGCCTTTTGGTTTGGATCACCACTTCA
ACTGGACGCTGGGAGGTTCCGACAGTTGCATTTCGCTGAAAGTTGGGAGAGAGTGGACCAATGGTTGAGGGAGAGGGATGGGACTGGAGAAGCGCAATGCCTCTTTGCGT
TTGGGCTATGGAGGGGGGAGCGTCTGACGCTGGGGGTGGATGCTGCGTGGGATAGAGCCTCGTTGGCAACAAGGTATGGGTGGGCTGTAGAGGGTTTGGACAGTGTGAGG
CAGAGGGAGGAGGCCTGTAGATGGAGGGGGAGTTTGAACAACCTGCATGTTGAGGGTCGGGCTATGAGGTGGGGGTTGGAATGTATGAGGAGCATGGGGGTGAAAAGCCT
ATACGTCAAATCGGATTCCCTGTGCCTGGTTAATATTGTCAATCATAAAGAGAAGTGCCCAGCTGATTTCGTGCCAATATATGAGATGATTTACCAACTATATGAGTACT
TTGATCTGTGTGATTTCCTCTACGTGAAGAGGGATTTGAATGGGAATGCCCATCGCTTGGCCAAATTAGGCCTATGTAATGCTATGATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAACTGATGGGTGCTGCAGTGTCTGCGGCAAGGAGGAAGAAACGGTTAATCATGCTTTGCTAGAATGTGATCCGGCCAGAGCCTTTTGGTTTGGATCACCACTTCA
ACTGGACGCTGGGAGGTTCCGACAGTTGCATTTCGCTGAAAGTTGGGAGAGAGTGGACCAATGGTTGAGGGAGAGGGATGGGACTGGAGAAGCGCAATGCCTCTTTGCGT
TTGGGCTATGGAGGGGGGAGCGTCTGACGCTGGGGGTGGATGCTGCGTGGGATAGAGCCTCGTTGGCAACAAGGTATGGGTGGGCTGTAGAGGGTTTGGACAGTGTGAGG
CAGAGGGAGGAGGCCTGTAGATGGAGGGGGAGTTTGAACAACCTGCATGTTGAGGGTCGGGCTATGAGGTGGGGGTTGGAATGTATGAGGAGCATGGGGGTGAAAAGCCT
ATACGTCAAATCGGATTCCCTGTGCCTGGTTAATATTGTCAATCATAAAGAGAAGTGCCCAGCTGATTTCGTGCCAATATATGAGATGATTTACCAACTATATGAGTACT
TTGATCTGTGTGATTTCCTCTACGTGAAGAGGGATTTGAATGGGAATGCCCATCGCTTGGCCAAATTAGGCCTATGTAATGCTATGATCTAA
Protein sequenceShow/hide protein sequence
MGTDGCCSVCGKEEETVNHALLECDPARAFWFGSPLQLDAGRFRQLHFAESWERVDQWLRERDGTGEAQCLFAFGLWRGERLTLGVDAAWDRASLATRYGWAVEGLDSVR
QREEACRWRGSLNNLHVEGRAMRWGLECMRSMGVKSLYVKSDSLCLVNIVNHKEKCPADFVPIYEMIYQLYEYFDLCDFLYVKRDLNGNAHRLAKLGLCNAMI