; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021110 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021110
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionzf-RVT domain-containing protein
Genome locationchr7:4739588..4740271
RNA-Seq ExpressionLag0021110
SyntenyLag0021110
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.5e-2238.13Show/hide
Query:  NLTILSDSLGPIDVDVIQGLSICG-STPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLW
        ++T +S S    D D+I  + I   +  D W+WHYD++G YSV++GYKL M        +        W  +WK+ +P+K+K+FIW+S H  IPT  NL 
Subjt:  NLTILSDSLGPIDVDVIQGLSICG-STPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLW

Query:  NHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWALLHP
           +  +  C +C D  E+  HA F C RAR+IW  L P
Subjt:  NHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWALLHP

XP_030479077.1 uncharacterized protein LOC115696311 [Cannabis sativa]8.1e-2438.41Show/hide
Query:  NLTILSDSLGPIDVDVIQGLSIC-GSTPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLW
        +L +L    G ID+D I  LSI   S  D+ IWH+   G Y+VK+GY L++   +    S  + N+ WW R+W +++P KVK+F W+  ++++PT VNL 
Subjt:  NLTILSDSLGPIDVDVIQGLSIC-GSTPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLW

Query:  NHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWALLH
        +  + V+ +C +C+   E++ HALF C+RA+++W+L H
Subjt:  NHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWALLH

XP_030479133.1 uncharacterized protein LOC115696372 [Cannabis sativa]1.4e-2037.31Show/hide
Query:  NLTILSDSLGPIDVDVIQGLSICG-STPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLW
        N+  L       DVD I  + +      D+WIWHY+  G+YSV +GY L+    +E   S       WWK  WK+ +PSKVK+F WK   +SIP   +L+
Subjt:  NLTILSDSLGPIDVDVIQGLSICG-STPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLW

Query:  NHHVPVMGTCHVCHDEMETTDHALFQCSRAREIW
        +  +    TC +C    E+  HALF C  A+E+W
Subjt:  NHHVPVMGTCHVCHDEMETTDHALFQCSRAREIW

XP_030483769.1 uncharacterized protein LOC115700339 [Cannabis sativa]1.1e-2036.03Show/hide
Query:  NLTILSDSLGPIDVDVIQGLSIC-GSTPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLW
        N+ +L     P DVD I  + +   +  D+WIWH+D  G+YSV  GY  +    +    +     N WWK  W   +P+KVK+F W+   +SIP   +L+
Subjt:  NLTILSDSLGPIDVDVIQGLSIC-GSTPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLW

Query:  NHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWAL
        +  V    TC +C    ET  HALF C  A+E+W L
Subjt:  NHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWAL

XP_030505962.1 uncharacterized protein LOC115720894 [Cannabis sativa]2.9e-2134.33Show/hide
Query:  NLTILSDSLGPIDVDVIQGLSICGS-TPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLW
        NL +L+     +D+D I  + +  S   D+WIWH+    EY+V+ GY L+    +    +     + WWK  W +++PSKVK+F+WK+FH +IPT  +L 
Subjt:  NLTILSDSLGPIDVDVIQGLSICGS-TPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLW

Query:  NHHVPVMGTCHVCHDEMETTDHALFQCSRAREIW
        N  +     C +C +  E+  HAL  C  A+++W
Subjt:  NHHVPVMGTCHVCHDEMETTDHALFQCSRAREIW

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248747.4e-2338.13Show/hide
Query:  NLTILSDSLGPIDVDVIQGLSICG-STPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLW
        ++T +S S    D D+I  + I   +  D W+WHYD++G YSV++GYKL M        +        W  +WK+ +P+K+K+FIW+S H  IPT  NL 
Subjt:  NLTILSDSLGPIDVDVIQGLSICG-STPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLW

Query:  NHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWALLHP
           +  +  C +C D  E+  HA F C RAR+IW  L P
Subjt:  NHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWALLHP

A0A803NI87 Uncharacterized protein1.4e-2134.33Show/hide
Query:  NLTILSDSLGPIDVDVIQGLSICGS-TPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLW
        NL +L+     +D+D I  + +  S   D+WIWH+    EY+V+ GY L+    +    +     + WWK  W +++PSKVK+F+WK+FH +IPT  +L 
Subjt:  NLTILSDSLGPIDVDVIQGLSICGS-TPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLW

Query:  NHHVPVMGTCHVCHDEMETTDHALFQCSRAREIW
        N  +     C +C +  E+  HAL  C  A+++W
Subjt:  NHHVPVMGTCHVCHDEMETTDHALFQCSRAREIW

A0A803P8W5 Uncharacterized protein1.1e-2138.69Show/hide
Query:  NLTILSDSLGPIDVDVIQGLSIC-GSTPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLW
        +L++L    G ID+D I  +SI      D+ IWH+   G Y+VK+GY L+    +    S    N  WWKR+W +++P KVK F+W+  ++++PT VNL 
Subjt:  NLTILSDSLGPIDVDVIQGLSIC-GSTPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLW

Query:  NHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWALL
        +  V     C +C+   ET  HALF C+RA  +WA L
Subjt:  NHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWALL

A0A803Q105 Uncharacterized protein5.3e-2136.03Show/hide
Query:  NLTILSDSLGPIDVDVIQGLSIC-GSTPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLW
        N+ +L     P DVD I  + +   +  D+WIWH+D  G+YSV  GY  +    +    +     N WWK  W   +P+KVK+F W+   +SIP   +L+
Subjt:  NLTILSDSLGPIDVDVIQGLSIC-GSTPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLW

Query:  NHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWAL
        +  V    TC +C    ET  HALF C  A+E+W L
Subjt:  NHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWAL

A0A803QG02 Uncharacterized protein3.9e-2438.41Show/hide
Query:  NLTILSDSLGPIDVDVIQGLSIC-GSTPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLW
        +L +L    G ID+D I  LSI   S  D+ IWH+   G Y+VK+GY L++   +    S  + N+ WW R+W +++P KVK+F W+  ++++PT VNL 
Subjt:  NLTILSDSLGPIDVDVIQGLSIC-GSTPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLW

Query:  NHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWALLH
        +  + V+ +C +C+   E++ HALF C+RA+++W+L H
Subjt:  NHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWALLH

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657502.9e-0823.58Show/hide
Query:  LSICGSTPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNF--WWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEME
        L +     D+  W + + G++SV++ Y++       + + +V + N   ++  +WK+R+P +VK F+W   + ++ T       H+     C VC   +E
Subjt:  LSICGSTPDKWIWHYDRKGEYSVKNGYKLSMLKSQEVFLSDVEKNNF--WWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEME

Query:  TTDHALFQCSRAREIWALLHPPK
        +  H L  C     IW  + P +
Subjt:  TTDHALFQCSRAREIWALLHPPK

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein9.9e-1233.33Show/hide
Query:  PDKWIWHYDRKGEYSVKNGY-KLSMLKSQEVFLSDVEKNNFWWK-RVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETTDHALF
        PDK IW+Y+  GEY+V++GY  L+   S  +   +    +   K R+W + I  K+K F+W++   ++ T   L    + +  +C  CH E E+ +HALF
Subjt:  PDKWIWHYDRKGEYSVKNGY-KLSMLKSQEVFLSDVEKNNFWWK-RVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETTDHALF

Query:  QCSRAREIWAL
         C  A   W L
Subjt:  QCSRAREIWAL

AT3G25270.1 Ribonuclease H-like superfamily protein3.0e-0834.38Show/hide
Query:  RVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETTDHALFQCSRAREIW
        ++WK++   K+K F+WK    ++ T  NL   H+     CH C  E ET+ H  F C  A+++W
Subjt:  RVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETTDHALFQCSRAREIW

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.3e-0633.33Show/hide
Query:  NFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETTDHALFQCSRAR
        N W   +W ++I  K+K+ IWK+ +N++P    L + ++ +   C  C D  ET  H LF C  A+
Subjt:  NFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETTDHALFQCSRAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACACGGATGCATCTATTATGGGGAATCAGCAAACATCTGGTATTGGAGTGGTTTTGCGAGATAAAGGGGGCTTGCTTAAGGCGGTGCAGAATTCATCATCTTTGGT
GACCAACTCTCCTTTGGAAGCGGAAGCAGTGGCCGTTCTTGAAGGCTTGCGACTGGCTAGGGAATTGGATGTGTGTAACCTAACTATTTTGTCCGACTCCTTGGGTCCAA
TCGATGTGGACGTTATTCAAGGCTTATCGATCTGTGGTTCGACGCCTGATAAATGGATATGGCATTATGATAGAAAAGGGGAGTATTCTGTTAAAAATGGATACAAGCTC
TCGATGCTAAAGAGCCAAGAGGTATTTCTATCAGACGTGGAAAAAAATAATTTCTGGTGGAAGAGAGTGTGGAAGATGAGAATTCCTAGTAAAGTTAAAGTATTCATTTG
GAAATCATTCCACAACTCAATCCCAACCATGGTAAACCTATGGAACCATCATGTGCCAGTTATGGGAACCTGTCATGTATGCCATGATGAGATGGAGACTACAGATCATG
CTTTGTTTCAGTGTTCGAGGGCTCGAGAGATATGGGCCCTTCTTCACCCGCCCAAGATGAGAGTGCTTTGGGATCCTATGGACATTAAAGACCGATGGCATGGCGTCTCT
GAAGAACCCATGCAGATTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCACACGGATGCATCTATTATGGGGAATCAGCAAACATCTGGTATTGGAGTGGTTTTGCGAGATAAAGGGGGCTTGCTTAAGGCGGTGCAGAATTCATCATCTTTGGT
GACCAACTCTCCTTTGGAAGCGGAAGCAGTGGCCGTTCTTGAAGGCTTGCGACTGGCTAGGGAATTGGATGTGTGTAACCTAACTATTTTGTCCGACTCCTTGGGTCCAA
TCGATGTGGACGTTATTCAAGGCTTATCGATCTGTGGTTCGACGCCTGATAAATGGATATGGCATTATGATAGAAAAGGGGAGTATTCTGTTAAAAATGGATACAAGCTC
TCGATGCTAAAGAGCCAAGAGGTATTTCTATCAGACGTGGAAAAAAATAATTTCTGGTGGAAGAGAGTGTGGAAGATGAGAATTCCTAGTAAAGTTAAAGTATTCATTTG
GAAATCATTCCACAACTCAATCCCAACCATGGTAAACCTATGGAACCATCATGTGCCAGTTATGGGAACCTGTCATGTATGCCATGATGAGATGGAGACTACAGATCATG
CTTTGTTTCAGTGTTCGAGGGCTCGAGAGATATGGGCCCTTCTTCACCCGCCCAAGATGAGAGTGCTTTGGGATCCTATGGACATTAAAGACCGATGGCATGGCGTCTCT
GAAGAACCCATGCAGATTTTTTAG
Protein sequenceShow/hide protein sequence
MHTDASIMGNQQTSGIGVVLRDKGGLLKAVQNSSSLVTNSPLEAEAVAVLEGLRLARELDVCNLTILSDSLGPIDVDVIQGLSICGSTPDKWIWHYDRKGEYSVKNGYKL
SMLKSQEVFLSDVEKNNFWWKRVWKMRIPSKVKVFIWKSFHNSIPTMVNLWNHHVPVMGTCHVCHDEMETTDHALFQCSRAREIWALLHPPKMRVLWDPMDIKDRWHGVS
EEPMQIF