; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019686 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019686
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr5:44456086..44457312
RNA-Seq ExpressionLag0019686
SyntenyLag0019686
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4273082.1 unnamed protein product [Prunus armeniaca]2.4e-0733.57Show/hide
Query:  MPKNPASQAQGA----KPPPNVWKLNSDATWFEKEGRGGLEWLVRDSEGSIICFRMKQVNKKWAIKNLKACAM-LEGIKEVTDTCNRLGI-RLEVETDAI
        +P+ P  QA+      KPP  V K+N DA W  +   GG+ W+VRDS+G ++     Q   K  ++   A AM  E I+E    C + G+ +LEVE+D++
Subjt:  MPKNPASQAQGA----KPPPNVWKLNSDATWFEKEGRGGLEWLVRDSEGSIICFRMKQVNKKWAIKNLKACAM-LEGIKEVTDTCNRLGI-RLEVETDAI

Query:  EVVKAINGESEDLSDLKIFTDEIKALATQAFSMSFSYCNRLLN
        +V++ I GE      +     +IK L  Q     F Y  R  N
Subjt:  EVVKAINGESEDLSDLKIFTDEIKALATQAFSMSFSYCNRLLN

KAG6599977.1 hypothetical protein SDJN03_05210, partial [Cucurbita argyrosperma subsp. sororia]1.3e-0837.39Show/hide
Query:  KLNSDATWFEKEGRGGLEWLVRDSEGSIICFRMKQVNKKWAIKNLKACAMLEGIKEVTDTCNRLGIRLEVETD-----AIEVVKAINGESEDLSDLKIFT
        KLNS A+W E+ G+GG+ W++ DS GS IC   K++ +KW +K L+  AM+EG+K        L IR  V         +E+ + +N    DL +L    
Subjt:  KLNSDATWFEKEGRGGLEWLVRDSEGSIICFRMKQVNKKWAIKNLKACAMLEGIKEVTDTCNRLGIRLEVETD-----AIEVVKAINGESEDLSDLKIFT

Query:  DEIKALATQAFSMSF
        DEI  L   A  +SF
Subjt:  DEIKALATQAFSMSF

XP_021763631.1 uncharacterized protein LOC110728264 [Chenopodium quinoa]6.4e-0832.84Show/hide
Query:  ASQAQGAKPPP--NVWKLNSDATWFEKEGRGGLEWLVRDSEGSIICFRMKQVNKKWAIKNLKACAMLEGIKEVTDTCNRLGIRLEVETDAIEVVKAINGE
        +S+  G   PP  + +KLN+DA+   K+ RGGL  +VRD+ G ++    K +N    I  ++A A+L G++ V D   R   +LEV +D ++V++ +NG 
Subjt:  ASQAQGAKPPP--NVWKLNSDATWFEKEGRGGLEWLVRDSEGSIICFRMKQVNKKWAIKNLKACAMLEGIKEVTDTCNRLGIRLEVETDAIEVVKAINGE

Query:  SEDLSDLKIFTDEIKALATQAFSMSFSYCNRLLN
          + S  ++   +I + A     + FS+C RL N
Subjt:  SEDLSDLKIFTDEIKALATQAFSMSFSYCNRLLN

XP_022148549.1 uncharacterized protein LOC111017181 [Momordica charantia]2.0e-0634.29Show/hide
Query:  PPPNVWKLNSDATWFEKEGRGGLEWLVRDSEGSIICFRMKQVNKKWAIKNLKACAMLEGIKEVTDTCN-RLGIRLEVETDAIEVVKAINGESEDLSDLKI
        P  ++WKLN DATW +    GGL W+VRDSEG  I                    M E +K ++ +     GI++E+E+D +EVV  IN  S  L+++ +
Subjt:  PPPNVWKLNSDATWFEKEGRGGLEWLVRDSEGSIICFRMKQVNKKWAIKNLKACAMLEGIKEVTDTCN-RLGIRLEVETDAIEVVKAINGESEDLSDLKI

Query:  FTDEI
          ++I
Subjt:  FTDEI

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]5.2e-1029.63Show/hide
Query:  DRSIVIMPKNPASQAQGAKPPPNVWKLNSDATWFEKEGRGGLEWLVRDSEGSIICFRMKQVNKKWAIKNLKACAMLEGIKEVTDTCNRLGI--RLEVETD
        + S+ ++ K   ++ +   PP ++W LN+DA+W +   RGG+ W++R  +G I+    + V     +K L+A A+LEG++ +T+    LG+   L +ETD
Subjt:  DRSIVIMPKNPASQAQGAKPPPNVWKLNSDATWFEKEGRGGLEWLVRDSEGSIICFRMKQVNKKWAIKNLKACAMLEGIKEVTDTCNRLGI--RLEVETD

Query:  AIEVVKAINGESEDLSDLKIFTDEIKALATQAFSMSFSY-------CNRLLNTTVTVLRETL
        + EV   +N + EDL+      +EI  L      ++F+        C   L    +VLRE++
Subjt:  AIEVVKAINGESEDLSDLKIFTDEIKALATQAFSMSFSY-------CNRLLNTTVTVLRETL

TrEMBL top hitse value%identityAlignment
A0A3P6ESY2 RNase H domain-containing protein (Fragment)2.9e-0627.65Show/hide
Query:  KDYWGWMAANLGKEEIDRSIVIMPKNPASQAQGAKPPPNVWKLNSDATWFEKEGRGGLEWLVRDSEGSIICFRMKQVNKKWAIKNLKACAMLEGIKEVTD
        +D + W +    + E+    V  P  P    +   P     K N+DA W ++E  GG  W++RD  G +I    K++ +  +    +A A+   ++ VT 
Subjt:  KDYWGWMAANLGKEEIDRSIVIMPKNPASQAQGAKPPPNVWKLNSDATWFEKEGRGGLEWLVRDSEGSIICFRMKQVNKKWAIKNLKACAMLEGIKEVTD

Query:  TCNRLGI-RLEVETDAIEVVKAINGESEDLSDLKIFTDEIKALATQAFSMSFSYCNRLLNTTV-TVLRET
             G  R+++ETD++++++ INGE E    L+    EI AL +    ++  Y  R  N T   + RET
Subjt:  TCNRLGI-RLEVETDAIEVVKAINGESEDLSDLKIFTDEIKALATQAFSMSFSYCNRLLNTTV-TVLRET

A0A5B7A5C8 Uncharacterized protein1.7e-0630.87Show/hide
Query:  EEIDRSIVIMPKNPASQAQGAKPPP--NVWKLNSDATWFEKEGRGGLEWLVRDSEGSIICFRMKQVNKKWAIKNLKACAMLEGIKEVTDTCNRLGI-RLE
        ++    + ++P  P      +  PP   V+KLN D +W   E  GG+  ++RD +G +I    K + +  +  + KA A+L GI    D    +GI +L 
Subjt:  EEIDRSIVIMPKNPASQAQGAKPPP--NVWKLNSDATWFEKEGRGGLEWLVRDSEGSIICFRMKQVNKKWAIKNLKACAMLEGIKEVTDTCNRLGI-RLE

Query:  VETDAIEVVKAINGESEDLSDLKIFTDEIKALATQAFSMSFSYCNRLLN
        VE D + V+ AI   S+DLSDL    D+IK       S   ++  R  N
Subjt:  VETDAIEVVKAINGESEDLSDLKIFTDEIKALATQAFSMSFSYCNRLLN

A0A6J1D4B6 uncharacterized protein LOC1110171819.9e-0734.29Show/hide
Query:  PPPNVWKLNSDATWFEKEGRGGLEWLVRDSEGSIICFRMKQVNKKWAIKNLKACAMLEGIKEVTDTCN-RLGIRLEVETDAIEVVKAINGESEDLSDLKI
        P  ++WKLN DATW +    GGL W+VRDSEG  I                    M E +K ++ +     GI++E+E+D +EVV  IN  S  L+++ +
Subjt:  PPPNVWKLNSDATWFEKEGRGGLEWLVRDSEGSIICFRMKQVNKKWAIKNLKACAMLEGIKEVTDTCN-RLGIRLEVETDAIEVVKAINGESEDLSDLKI

Query:  FTDEI
          ++I
Subjt:  FTDEI

A0A6J1DNV9 uncharacterized protein LOC1110224032.5e-1029.63Show/hide
Query:  DRSIVIMPKNPASQAQGAKPPPNVWKLNSDATWFEKEGRGGLEWLVRDSEGSIICFRMKQVNKKWAIKNLKACAMLEGIKEVTDTCNRLGI--RLEVETD
        + S+ ++ K   ++ +   PP ++W LN+DA+W +   RGG+ W++R  +G I+    + V     +K L+A A+LEG++ +T+    LG+   L +ETD
Subjt:  DRSIVIMPKNPASQAQGAKPPPNVWKLNSDATWFEKEGRGGLEWLVRDSEGSIICFRMKQVNKKWAIKNLKACAMLEGIKEVTDTCNRLGI--RLEVETD

Query:  AIEVVKAINGESEDLSDLKIFTDEIKALATQAFSMSFSY-------CNRLLNTTVTVLRETL
        + EV   +N + EDL+      +EI  L      ++F+        C   L    +VLRE++
Subjt:  AIEVVKAINGESEDLSDLKIFTDEIKALATQAFSMSFSY-------CNRLLNTTVTVLRETL

A0A6J5UAY2 Reverse transcriptase domain-containing protein1.2e-0733.57Show/hide
Query:  MPKNPASQAQGA----KPPPNVWKLNSDATWFEKEGRGGLEWLVRDSEGSIICFRMKQVNKKWAIKNLKACAM-LEGIKEVTDTCNRLGI-RLEVETDAI
        +P+ P  QA+      KPP  V K+N DA W  +   GG+ W+VRDS+G ++     Q   K  ++   A AM  E I+E    C + G+ +LEVE+D++
Subjt:  MPKNPASQAQGA----KPPPNVWKLNSDATWFEKEGRGGLEWLVRDSEGSIICFRMKQVNKKWAIKNLKACAM-LEGIKEVTDTCNRLGI-RLEVETDAI

Query:  EVVKAINGESEDLSDLKIFTDEIKALATQAFSMSFSYCNRLLN
        +V++ I GE      +     +IK L  Q     F Y  R  N
Subjt:  EVVKAINGESEDLSDLKIFTDEIKALATQAFSMSFSYCNRLLN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCTTCCCAATCCAACCCCTCAACGGAAGCAACTTTCTGGGATAAGTTATGGAAGGCCAATGTTCTCCCAAGATCCAAAGGAGAATTGGCAACCAAAGGATTACTG
GGGATGGATGGCTGCTAACCTTGGCAAAGAAGAGATTGACAGAAGCATTGTTATCATGCCAAAGAATCCCGCGAGTCAAGCTCAAGGGGCGAAGCCGCCGCCGAACGTAT
GGAAACTTAATTCAGACGCAACCTGGTTCGAGAAAGAAGGTCGCGGAGGTCTCGAATGGCTCGTGCGTGACTCGGAGGGTTCCATAATCTGTTTCAGAATGAAGCAAGTT
AATAAAAAATGGGCTATCAAGAATTTGAAAGCTTGCGCTATGCTTGAAGGTATCAAGGAAGTTACAGATACCTGTAATCGTCTCGGTATTCGCCTGGAAGTCGAGACGGA
TGCTATTGAGGTCGTTAAGGCCATCAACGGCGAGTCGGAAGATTTGTCCGACCTGAAGATCTTCACTGATGAAATTAAGGCTCTTGCTACCCAAGCCTTTTCCATGAGCT
TTAGCTATTGTAATCGTCTTTTGAACACAACGGTCACTGTGTTGCGAGAAACGCTGCCGGCAGGTCCTCGTCTTCTCCGGCGAAGCCTCCTAGCCAGTCGTCTTCTTCGC
GGGATAGGGAGCATAACGTTGCCAACAAGAGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGCTTCCCAATCCAACCCCTCAACGGAAGCAACTTTCTGGGATAAGTTATGGAAGGCCAATGTTCTCCCAAGATCCAAAGGAGAATTGGCAACCAAAGGATTACTG
GGGATGGATGGCTGCTAACCTTGGCAAAGAAGAGATTGACAGAAGCATTGTTATCATGCCAAAGAATCCCGCGAGTCAAGCTCAAGGGGCGAAGCCGCCGCCGAACGTAT
GGAAACTTAATTCAGACGCAACCTGGTTCGAGAAAGAAGGTCGCGGAGGTCTCGAATGGCTCGTGCGTGACTCGGAGGGTTCCATAATCTGTTTCAGAATGAAGCAAGTT
AATAAAAAATGGGCTATCAAGAATTTGAAAGCTTGCGCTATGCTTGAAGGTATCAAGGAAGTTACAGATACCTGTAATCGTCTCGGTATTCGCCTGGAAGTCGAGACGGA
TGCTATTGAGGTCGTTAAGGCCATCAACGGCGAGTCGGAAGATTTGTCCGACCTGAAGATCTTCACTGATGAAATTAAGGCTCTTGCTACCCAAGCCTTTTCCATGAGCT
TTAGCTATTGTAATCGTCTTTTGAACACAACGGTCACTGTGTTGCGAGAAACGCTGCCGGCAGGTCCTCGTCTTCTCCGGCGAAGCCTCCTAGCCAGTCGTCTTCTTCGC
GGGATAGGGAGCATAACGTTGCCAACAAGAGCATAG
Protein sequenceShow/hide protein sequence
MKLPNPTPQRKQLSGISYGRPMFSQDPKENWQPKDYWGWMAANLGKEEIDRSIVIMPKNPASQAQGAKPPPNVWKLNSDATWFEKEGRGGLEWLVRDSEGSIICFRMKQV
NKKWAIKNLKACAMLEGIKEVTDTCNRLGIRLEVETDAIEVVKAINGESEDLSDLKIFTDEIKALATQAFSMSFSYCNRLLNTTVTVLRETLPAGPRLLRRSLLASRLLR
GIGSITLPTRA