; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032000 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032000
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr11:22205292..22208339
RNA-Seq ExpressionLag0032000
SyntenyLag0032000
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EEF49216.1 conserved hypothetical protein [Ricinus communis]4.4e-0656.9Show/hide
Query:  HEARHILSIPLRQVACDDTVIWHFEKSGLYSVKSGYRLGQGPLVVQGPSSSSSEWCIG
        H+A  ILSIP  Q +C D V WHF+K G YSVKS Y LG  P V Q  SSSS   C G
Subjt:  HEARHILSIPLRQVACDDTVIWHFEKSGLYSVKSGYRLGQGPLVVQGPSSSSSEWCIG

KAG8363091.1 hypothetical protein BUALT_BualtUnG0005200 [Buddleja alternifolia]6.1e-0827.5Show/hide
Query:  PHEARHILSIPLRQVACDDTVIWHFEKSGLYSVKSGYRLGQGPLVVQGP---SSSSSEWCIGGGMTVGEIMWSAGFGAIVAKCEAVDIKFLLRDVNDELE
        P +A  ILSIPL + +C D++IWH+ K+GL+SVKS Y + +  +   GP   SSSS++W           +W A                          
Subjt:  PHEARHILSIPLRQVACDDTVIWHFEKSGLYSVKSGYRLGQGPLVVQGP---SSSSSEWCIGGGMTVGEIMWSAGFGAIVAKCEAVDIKFLLRDVNDELE

Query:  WKRFEEWVEGDGSVGGKTRSGGLQVGAAGFDGF---------KVNVDTTFCLESGAAGVGVICRDSLGQVSFTTSVLQENVRDADFAEGLAASIGLSLAV
                     +  K R    +V AA    F         K+N D     +    GVGVI R+S G      +    N+ ++++AE L A   L LA+
Subjt:  WKRFEEWVEGDGSVGGKTRSGGLQVGAAGFDGF---------KVNVDTTFCLESGAAGVGVICRDSLGQVSFTTSVLQENVRDADFAEGLAASIGLSLAV

KAG8380966.1 hypothetical protein BUALT_Bualt06G0071400 [Buddleja alternifolia]1.5e-0647.46Show/hide
Query:  PHEARHILSIPLRQVACDDTVIWHFEKSGLYSVKSGYRLGQGPLVVQGP---SSSSSEW
        P +A  ILSIPL + +C D++IWH+ K+GL+SVKS Y + +  +   GP   SSSS++W
Subjt:  PHEARHILSIPLRQVACDDTVIWHFEKSGLYSVKSGYRLGQGPLVVQGP---SSSSSEW

XP_021761446.1 uncharacterized protein LOC110726302 [Chenopodium quinoa]8.8e-0723.78Show/hide
Query:  MMTRFWWCGVEEDRKIHWPT-----------------------------SYLARVLKGLPVSE-------------LSEEGRKSSFLYLEEPYVGGRSCW
        MM RFWW   ++ RKIHW                                 L RV  G  ++              ++ E R+     ++E +     CW
Subjt:  MMTRFWWCGVEEDRKIHWPT-----------------------------SYLARVLKGLPVSE-------------LSEEGRKSSFLYLEEPYVGGRSCW

Query:  EGGFVGGLGMETDPHEARHILSIPLRQVACDDTVIWHFEKSGLYSVKSGYRLGQGPLV-------VQGPSSSSSEWCIGGGMTVGEIMWSAGFGAIVAKC
        +   V  +  E D    R IL+IPL +    D++ W + K G YSVK+ Y LG+   +       V+   +  +      G+ V  + +        +  
Subjt:  EGGFVGGLGMETDPHEARHILSIPLRQVACDDTVIWHFEKSGLYSVKSGYRLGQGPLV-------VQGPSSSSSEWCIGGGMTVGEIMWSAGFGAIVAKC

Query:  EAVDIKFLLRDVNDELEWKRFEEWVEGDGSVGGKTRSGGLQVGAAGFDGFKVNVDTTFCLESGAAGVGVICRDSLGQVSFTTSVLQENVRDADFAEGLAA
        ++ D+  LLR      ++  + E + G+ +  G+     + V     D  KVNVD T   E+G  G+G I RD  G++ F+         +   AE  AA
Subjt:  EAVDIKFLLRDVNDELEWKRFEEWVEGDGSVGGKTRSGGLQVGAAGFDGFKVNVDTTFCLESGAAGVGVICRDSLGQVSFTTSVLQENVRDADFAEGLAA

Query:  SIGLSLA
          G+  A
Subjt:  SIGLSLA

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]8.8e-0745.1Show/hide
Query:  LKGLPVSELSEEGRKSSFLYLEEPYVGGRSCWEGGFVGGLGMETDPHEARHILSIPLRQVACDDTVIWHFEKSGLYSVKSGYRLG--QGPLVVQGPSSSS
        LK L    L    R SS +  EE   GG   W+G  V     E  P EA+ ILSIP+ + A +D +IW++EK+G+YSV+SGY++     P  VQ PSSSS
Subjt:  LKGLPVSELSEEGRKSSFLYLEEPYVGGRSCWEGGFVGGLGMETDPHEARHILSIPLRQVACDDTVIWHFEKSGLYSVKSGYRLG--QGPLVVQGPSSSS

Query:  SE
        SE
Subjt:  SE

TrEMBL top hitse value%identityAlignment
A0A392M5F5 Ribonuclease H protein (Fragment)8.0e-0629.01Show/hide
Query:  MMTRFWWCGVEEDRKIHWPTSYLARVLK---GLPVSELSEEGRKSSFLYLEEPYVGGRSC-WEGGFVGGLGMETDPHEARHILSIPLRQVACDDTVIWHF
        M+  FWW G   ++ I W +       K   GL    +  +G  S+   +++P++      W G        +    EA+ IL++PL +   +D +IWH 
Subjt:  MMTRFWWCGVEEDRKIHWPTSYLARVLK---GLPVSELSEEGRKSSFLYLEEPYVGGRSC-WEGGFVGGLGMETDPHEARHILSIPLRQVACDDTVIWHF

Query:  EKSGLYSVKSGYRLGQGPLVVQGPSSSSSEW
        EK GLY V+SGYR      + +G   +S  W
Subjt:  EKSGLYSVKSGYRLGQGPLVVQGPSSSSSEW

A0A6J1DAR4 uncharacterized protein LOC1110189544.3e-0745.1Show/hide
Query:  LKGLPVSELSEEGRKSSFLYLEEPYVGGRSCWEGGFVGGLGMETDPHEARHILSIPLRQVACDDTVIWHFEKSGLYSVKSGYRLG--QGPLVVQGPSSSS
        LK L    L    R SS +  EE   GG   W+G  V     E  P EA+ ILSIP+ + A +D +IW++EK+G+YSV+SGY++     P  VQ PSSSS
Subjt:  LKGLPVSELSEEGRKSSFLYLEEPYVGGRSCWEGGFVGGLGMETDPHEARHILSIPLRQVACDDTVIWHFEKSGLYSVKSGYRLG--QGPLVVQGPSSSS

Query:  SE
        SE
Subjt:  SE

B8AA89 CCHC-type domain-containing protein8.0e-0624.78Show/hide
Query:  SCWEGGFVGGLGMETDPHEARHILSIPLRQVACDDTVIWHFEKSGLYSVKSGYR---LGQGPLVVQGPSSSSSEWCI--------GGGMTVGEIMWSA--
        +CW+   +  +     P +A  +L I L Q   DD + WHFEK G+Y+V+SGYR   + Q        SS++S+W +             +  ++WS+  
Subjt:  SCWEGGFVGGLGMETDPHEARHILSIPLRQVACDDTVIWHFEKSGLYSVKSGYR---LGQGPLVVQGPSSSSSEWCI--------GGGMTVGEIMWSA--

Query:  -------GFGAIVAKCEAVDIKFLLRDVN-------DELEWKRFEEWVEGDGSVGGKTRSGGLQVGAAGFDGFKVNVDTTFCLESGAAGVGVICRDSLGQ
               G G+       + ++ L   ++        +++ K  +   +   +V   + S  ++  A      K+NVD  F  + G AG G++ RD +G 
Subjt:  -------GFGAIVAKCEAVDIKFLLRDVN-------DELEWKRFEEWVEGDGSVGGKTRSGGLQVGAAGFDGFKVNVDTTFCLESGAAGVGVICRDSLGQ

Query:  VSFTTSVLQENVRDADFAEGLAASIGLSLA
        V  T+     + +  + AE  A   GL LA
Subjt:  VSFTTSVLQENVRDADFAEGLAASIGLSLA

B9EXJ8 CCHC-type domain-containing protein8.0e-0624.78Show/hide
Query:  SCWEGGFVGGLGMETDPHEARHILSIPLRQVACDDTVIWHFEKSGLYSVKSGYR---LGQGPLVVQGPSSSSSEWCI--------GGGMTVGEIMWSA--
        +CW+   +  +     P +A  +L I L Q   DD + WHFEK G+Y+V+SGYR   + Q        SS++S+W +             +  ++WS+  
Subjt:  SCWEGGFVGGLGMETDPHEARHILSIPLRQVACDDTVIWHFEKSGLYSVKSGYR---LGQGPLVVQGPSSSSSEWCI--------GGGMTVGEIMWSA--

Query:  -------GFGAIVAKCEAVDIKFLLRDVN-------DELEWKRFEEWVEGDGSVGGKTRSGGLQVGAAGFDGFKVNVDTTFCLESGAAGVGVICRDSLGQ
               G G+       + ++ L   ++        +++ K  +   +   +V   + S  ++  A      K+NVD  F  + G AG G++ RD +G 
Subjt:  -------GFGAIVAKCEAVDIKFLLRDVN-------DELEWKRFEEWVEGDGSVGGKTRSGGLQVGAAGFDGFKVNVDTTFCLESGAAGVGVICRDSLGQ

Query:  VSFTTSVLQENVRDADFAEGLAASIGLSLA
        V  T+     + +  + AE  A   GL LA
Subjt:  VSFTTSVLQENVRDADFAEGLAASIGLSLA

B9RHK6 zf-RVT domain-containing protein2.1e-0656.9Show/hide
Query:  HEARHILSIPLRQVACDDTVIWHFEKSGLYSVKSGYRLGQGPLVVQGPSSSSSEWCIG
        H+A  ILSIP  Q +C D V WHF+K G YSVKS Y LG  P V Q  SSSS   C G
Subjt:  HEARHILSIPLRQVACDDTVIWHFEKSGLYSVKSGYRLGQGPLVVQGPSSSSSEWCIG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGACAAGGTTTTGGTGGTGTGGGGTGGAGGAGGATAGGAAGATTCACTGGCCGACTTCTTACCTGGCCCGTGTCCTGAAAGGGCTTCCCGTTTCGGAGCTTTCGGA
GGAGGGTAGGAAATCGTCCTTCCTTTATCTGGAGGAGCCTTATGTCGGGGGAAGGAGTTGCTGGGAAGGGGGATTCGTTGGAGGATTGGGAATGGAGACAGACCCGCATG
AGGCAAGACACATCCTTTCTATCCCTTTGCGCCAAGTTGCTTGTGATGATACAGTTATTTGGCACTTTGAGAAGTCTGGCCTCTATTCAGTGAAGAGTGGGTACCGTTTA
GGGCAGGGTCCTCTGGTTGTCCAAGGTCCTTCGTCCTCGTCGTCCGAGTGGTGCATAGGTGGTGGAATGACTGTTGGAGAGATCATGTGGTCTGCAGGTTTTGGGGCTAT
TGTGGCCAAGTGCGAGGCAGTCGACATTAAGTTCCTCCTCCGAGATGTGAATGATGAGTTGGAGTGGAAACGGTTTGAGGAGTGGGTGGAGGGGGATGGATCTGTTGGTG
GGAAGACCAGATCGGGTGGTCTGCAGGTGGGCGCCGCCGGATTTGATGGGTTTAAGGTGAATGTGGATACGACGTTTTGCCTGGAGAGCGGAGCGGCGGGTGTAGGCGTG
ATATGTCGCGATTCCTTGGGGCAAGTGAGTTTCACAACTTCTGTTTTGCAGGAGAATGTTAGAGATGCAGATTTTGCAGAAGGGTTGGCTGCTTCCATCGGCTTGAGCCT
TGCGGTGGGACGGCTACCAGACCTTTTGTGCTGGAGACAGATTCTTTGCGGGTGTCCAAGCTGCTGCAGCGTGAGGTGGAGGAAGTGTTGGAGTTGGGGATGCTGGTGGA
TGATGCAACGAGGGGCACTCCTGCGGGGTGGTTTCTCGGAGGTAGCTTTACGTTCAGGGAAGGCCCTAGGTTTTGTGCACCCTTTTCTGCGCCTTTCAGCTTCTTTGCGA
GGCCTTAGTGCCAGTGTAGGGGGAAGATGGGTGGAGGAGTTTATAAGCCTTGCTTGGGCTGCTCTATGCTTGATTGAGATTTATCAGGTTGCAGTATGGGGTGTCCAGGC
AGGCCTGGCGAGTGGTGATGGGGATGACTTGAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGACAAGGTTTTGGTGGTGTGGGGTGGAGGAGGATAGGAAGATTCACTGGCCGACTTCTTACCTGGCCCGTGTCCTGAAAGGGCTTCCCGTTTCGGAGCTTTCGGA
GGAGGGTAGGAAATCGTCCTTCCTTTATCTGGAGGAGCCTTATGTCGGGGGAAGGAGTTGCTGGGAAGGGGGATTCGTTGGAGGATTGGGAATGGAGACAGACCCGCATG
AGGCAAGACACATCCTTTCTATCCCTTTGCGCCAAGTTGCTTGTGATGATACAGTTATTTGGCACTTTGAGAAGTCTGGCCTCTATTCAGTGAAGAGTGGGTACCGTTTA
GGGCAGGGTCCTCTGGTTGTCCAAGGTCCTTCGTCCTCGTCGTCCGAGTGGTGCATAGGTGGTGGAATGACTGTTGGAGAGATCATGTGGTCTGCAGGTTTTGGGGCTAT
TGTGGCCAAGTGCGAGGCAGTCGACATTAAGTTCCTCCTCCGAGATGTGAATGATGAGTTGGAGTGGAAACGGTTTGAGGAGTGGGTGGAGGGGGATGGATCTGTTGGTG
GGAAGACCAGATCGGGTGGTCTGCAGGTGGGCGCCGCCGGATTTGATGGGTTTAAGGTGAATGTGGATACGACGTTTTGCCTGGAGAGCGGAGCGGCGGGTGTAGGCGTG
ATATGTCGCGATTCCTTGGGGCAAGTGAGTTTCACAACTTCTGTTTTGCAGGAGAATGTTAGAGATGCAGATTTTGCAGAAGGGTTGGCTGCTTCCATCGGCTTGAGCCT
TGCGGTGGGACGGCTACCAGACCTTTTGTGCTGGAGACAGATTCTTTGCGGGTGTCCAAGCTGCTGCAGCGTGAGGTGGAGGAAGTGTTGGAGTTGGGGATGCTGGTGGA
TGATGCAACGAGGGGCACTCCTGCGGGGTGGTTTCTCGGAGGTAGCTTTACGTTCAGGGAAGGCCCTAGGTTTTGTGCACCCTTTTCTGCGCCTTTCAGCTTCTTTGCGA
GGCCTTAGTGCCAGTGTAGGGGGAAGATGGGTGGAGGAGTTTATAAGCCTTGCTTGGGCTGCTCTATGCTTGATTGAGATTTATCAGGTTGCAGTATGGGGTGTCCAGGC
AGGCCTGGCGAGTGGTGATGGGGATGACTTGAGGTGA
Protein sequenceShow/hide protein sequence
MMTRFWWCGVEEDRKIHWPTSYLARVLKGLPVSELSEEGRKSSFLYLEEPYVGGRSCWEGGFVGGLGMETDPHEARHILSIPLRQVACDDTVIWHFEKSGLYSVKSGYRL
GQGPLVVQGPSSSSSEWCIGGGMTVGEIMWSAGFGAIVAKCEAVDIKFLLRDVNDELEWKRFEEWVEGDGSVGGKTRSGGLQVGAAGFDGFKVNVDTTFCLESGAAGVGV
ICRDSLGQVSFTTSVLQENVRDADFAEGLAASIGLSLAVGRLPDLLCWRQILCGCPSCCSVRWRKCWSWGCWWMMQRGALLRGGFSEVALRSGKALGFVHPFLRLSASLR
GLSASVGGRWVEEFISLAWAALCLIEIYQVAVWGVQAGLASGDGDDLR