; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017757 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017757
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionzf-RVT domain-containing protein
Genome locationchr5:8321872..8322653
RNA-Seq ExpressionLag0017757
SyntenyLag0017757
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PKA49974.1 Putative ribonuclease H protein [Apostasia shenzhenica]4.7e-3334.74Show/hide
Query:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWLPRPFTFKVISPTVPCLENIKVAEFITPS-----------VW--
        VL  KYF     L    +P+SS+ W+  +WG +LLK G+R  +GNG S+ VF DPWLPRP +F+ I+   P L   ++ E +T             +W  
Subjt:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWLPRPFTFKVISPTVPCLENIKVAEFITPS-----------VW--

Query:  -------------------IWHYDSRGEYSVKSGYKLGMLCPLEASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPV
                           IWH+D +G YSVKSGY          + S++G    WWKK+W  +IPNK+K   W++FH+++ +  +L    +  N  CP 
Subjt:  -------------------IWHYDSRGEYSVKSGYKLGMLCPLEASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPV

Query:  CHEEIKTTDHALF
        C E+I+   HAL+
Subjt:  CHEEIKTTDHALF

XP_017250619.1 PREDICTED: uncharacterized protein LOC108221234 [Daucus carota subsp. sativus]3.7e-3034.29Show/hide
Query:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWLPRPFTF-----------KVI------------------SPTVP
        VL  KYF +SS L+       S  W+  VWG  LL +G+R+ +GNGQS   FKDPWL RP +F           KV+                  SP + 
Subjt:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWLPRPFTF-----------KVI------------------SPTVP

Query:  CLENIKVAEFITPSVWIWHYDSRGEYSVKSGYKLGMLCPLEASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPVCHE
         +  I ++ F     W WHY+S+G Y+VKSGYKL      + S S+  +  +WWK  W  +IP K+ IF W+ +H  +P+   L    + ++  CP+C  
Subjt:  CLENIKVAEFITPSVWIWHYDSRGEYSVKSGYKLGMLCPLEASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPVCHE

Query:  EIKTTDHALF
           +  HA+F
Subjt:  EIKTTDHALF

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]8.6e-3537.56Show/hide
Query:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWLPRPFTFKVISPTVPCLENIKVAEFIT-----------------
        VL  KYF  +S+L  S    SS+FWKGF+WG +LL +G+R  +GNG +I  F DPWLPRP TFK +      L+   VA FIT                 
Subjt:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWLPRPFTFKVISPTVPCLENIKVAEFIT-----------------

Query:  ---------------PSVWIWHYDSRGEYSVKSGYKLGMLCPLEASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPV
                          W+WHYD RG YSV+SGYKL M     A+ +++      W  +WKL +P K+KIF+W+S H  IP+  NL    +     C +
Subjt:  ---------------PSVWIWHYDSRGEYSVKSGYKLGMLCPLEASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPV

Query:  CHEEIKTTDHALF
        C +  ++  HA F
Subjt:  CHEEIKTTDHALF

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]4.1e-2935.21Show/hide
Query:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWLPRPFTFKVISPTVPCL-ENIKVAEFI-TPSVW-----------
        VL  +YF  +  +N       SF W+  VWG ++L +G R  +GNGQ++LV+ + W+PRP TFK IS   P +  +  VAE I     W           
Subjt:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWLPRPFTFKVISPTVPCL-ENIKVAEFI-TPSVW-----------

Query:  --------------------IWHYDSRGEYSVKSGYKLGMLCPLEASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCP
                            IWHYD +G YSVKSGY++ M        S S  +   W+ +WKL IP KVKIF+W++ H+ +P+  NL    V     C 
Subjt:  --------------------IWHYDSRGEYSVKSGYKLGMLCPLEASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCP

Query:  VCHEEIKTTDHAL
         CH  ++T  HAL
Subjt:  VCHEEIKTTDHAL

XP_030505522.1 uncharacterized protein LOC115720515 [Cannabis sativa]9.2e-2935.55Show/hide
Query:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWLPRPFTFKVISPTVPCLE------------NIKVAE--FITPSV
        VL  +YFP  S L  + +   S  W+G VWG ELL +G+R+ +GNG +  VFKDPW+PRP +F  I+  V  L             NI +    F TP  
Subjt:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWLPRPFTFKVISPTVPCLE------------NIKVAE--FITPSV

Query:  ----------------WIWHYDSRGEYSVKSGYKLGMLCPLEASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPVCH
                        W+WH+ S G YSVKSGY L +        S++ + A WW   W ++IP K+  F WK FH  +PS   L       +  CP+C 
Subjt:  ----------------WIWHYDSRGEYSVKSGYKLGMLCPLEASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPVCH

Query:  EEIKTTDHALF
            T  HA+F
Subjt:  EEIKTTDHALF

TrEMBL top hitse value%identityAlignment
A0A2I0A354 Ribonuclease H protein2.3e-3334.74Show/hide
Query:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWLPRPFTFKVISPTVPCLENIKVAEFITPS-----------VW--
        VL  KYF     L    +P+SS+ W+  +WG +LLK G+R  +GNG S+ VF DPWLPRP +F+ I+   P L   ++ E +T             +W  
Subjt:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWLPRPFTFKVISPTVPCLENIKVAEFITPS-----------VW--

Query:  -------------------IWHYDSRGEYSVKSGYKLGMLCPLEASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPV
                           IWH+D +G YSVKSGY          + S++G    WWKK+W  +IPNK+K   W++FH+++ +  +L    +  N  CP 
Subjt:  -------------------IWHYDSRGEYSVKSGYKLGMLCPLEASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPV

Query:  CHEEIKTTDHALF
        C E+I+   HAL+
Subjt:  CHEEIKTTDHALF

A0A6J1DX30 uncharacterized protein LOC1110248744.1e-3537.56Show/hide
Query:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWLPRPFTFKVISPTVPCLENIKVAEFIT-----------------
        VL  KYF  +S+L  S    SS+FWKGF+WG +LL +G+R  +GNG +I  F DPWLPRP TFK +      L+   VA FIT                 
Subjt:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWLPRPFTFKVISPTVPCLENIKVAEFIT-----------------

Query:  ---------------PSVWIWHYDSRGEYSVKSGYKLGMLCPLEASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPV
                          W+WHYD RG YSV+SGYKL M     A+ +++      W  +WKL +P K+KIF+W+S H  IP+  NL    +     C +
Subjt:  ---------------PSVWIWHYDSRGEYSVKSGYKLGMLCPLEASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPV

Query:  CHEEIKTTDHALF
        C +  ++  HA F
Subjt:  CHEEIKTTDHALF

A0A803PEK8 Uncharacterized protein4.0e-3035.51Show/hide
Query:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWLPRPFTFKVISPTVPCLENIKVAE--------------------
        VL   YFP  S+L       +SF W+  +WG +++  G R  +GNG+++ V +DPWL RP TFK+     P  EN+ VA+                    
Subjt:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWLPRPFTFKVISPTVPCLENIKVAE--------------------

Query:  -----FITPSV-W------IWHYDSRGEYSVKSGYKLGMLCPLEASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPV
               TPS  W      +WHY   GEY+VKSGYK+      E   S   +   WWK +W L+IP K+K FVWK  +N IP+  NL    V ++  C  
Subjt:  -----FITPSV-W------IWHYDSRGEYSVKSGYKLGMLCPLEASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPV

Query:  CH-EEIKTTDHALF
        C    ++TT HAL+
Subjt:  CH-EEIKTTDHALF

A0A803Q185 Uncharacterized protein2.4e-3035.35Show/hide
Query:  NFVLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWLPRPFTFKVI----------------------SPTVPCLENI
        N VL   YFP +SVL       +SF W+  +WG +L+ +G R  +G+G +I V +DPWLPRP TFK+                        P +  + N 
Subjt:  NFVLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWLPRPFTFKVI----------------------SPTVPCLENI

Query:  KVAEFITP---SVW------IWHYDSRGEYSVKSGYKLGMLCPLEASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCP
          AE I     S W      +WHY   GEYSVKSGY++      E   S   +  +WW+K+W+L+IP K+K+FVWK  H  +P+   L   HV     C 
Subjt:  KVAEFITP---SVW------IWHYDSRGEYSVKSGYKLGMLCPLEASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCP

Query:  VCH-EEIKTTDHALF
         C     +T  HAL+
Subjt:  VCH-EEIKTTDHALF

A0A803QQT2 Uncharacterized protein5.6e-3236.62Show/hide
Query:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWLPRPFTFKVI-SPTVPC---LENIKVAE----------------
        VL   YFP   VL      ++SF W+  VWG +L+ +G R  +GNG+S+ V +DPWLPRP TFKV   P++P    + ++K+A+                
Subjt:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWLPRPFTFKVI-SPTVPC---LENIKVAE----------------

Query:  -----FITPSVW------IWHYDSRGEYSVKSGYKLGMLCPLEASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPVC
              I  S W      +WHY   GEYSVKSGY++      E   S      +WWKK+W+L+IP KVK FVWK  HN +P+  NL    +  ++ C  C
Subjt:  -----FITPSVW------IWHYDSRGEYSVKSGYKLGMLCPLEASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPVC

Query:  HEEI-KTTDHALF
           + ++  HAL+
Subjt:  HEEI-KTTDHALF

SwissProt top hitse value%identityAlignment
P93295 Uncharacterized mitochondrial protein AtMg003102.8e-0433.33Show/hide
Query:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWL
        +L  +YFP SS++  S     S+ W+  + G ELL +G+ + +G+G    V+ D W+
Subjt:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWL

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein2.8e-1525.47Show/hide
Query:  KYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILV----FKDPWLPRPF----TFKVIS--------PTVPCLENIKVAEFI------
        +YF   S+L+       S+ W   + G+ LLK+G R  +G+GQ+I +      D   PRP     T+K ++         +    ++ K+++F+      
Subjt:  KYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILV----FKDPWLPRPF----TFKVIS--------PTVPCLENIKVAEFI------

Query:  -----------TPSVWIWHYDSRGEYSVKSGYKLGMLCPLE--ASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPVC
                    P   IW+Y++ GEY+V+SGY L    P     +++          ++W L I  K+K F+W++   ++ +   L +  + ++  CP C
Subjt:  -----------TPSVWIWHYDSRGEYSVKSGYKLGMLCPLE--ASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPVC

Query:  HEEIKTTDHALF
        H E ++ +HALF
Subjt:  HEEIKTTDHALF

AT3G25270.1 Ribonuclease H-like superfamily protein1.3e-0434.55Show/hide
Query:  KVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPVCHEEIKTTDHALF
        K+WKL+   K+K F+WK    ++ +  NLK  H+  +  C  C +E +T+ H  F
Subjt:  KVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPVCHEEIKTTDHALF

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.8e-0632.26Show/hide
Query:  MEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPVCHEEIKTTDHALF
        M   W   +W L+I  K+K+ +WK+ +N++P    L S ++ +  FC  C  + +T  H LF
Subjt:  MEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPVCHEEIKTTDHALF

AT4G29090.1 Ribonuclease H-like superfamily protein1.6e-1526.22Show/hide
Query:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWL-PRPFTFKVISPTVPCLENIKVA--------------------
        V   +YF  S  LN       SF WK      E+L+QG R  +GNG+ I++++  WL  +P +  +    VP  E   V+                    
Subjt:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWL-PRPFTFKVISPTVPCLENIKVA--------------------

Query:  -EFITPSV------------------WIWHYDSRGEYSVKSGYKLGMLCPLEASMSTSGMEARW---WKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKS
         E + P V                  + W Y S G+Y+VKSGY +      + S      E      ++K+WK +   K++ F+WK   NS+P    L  
Subjt:  -EFITPSV------------------WIWHYDSRGEYSVKSGYKLGMLCPLEASMSTSGMEARW---WKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKS

Query:  HHVPVNMFCPVCHEEIKTTDHALFR
         H+     C  C    +T +H LF+
Subjt:  HHVPVNMFCPVCHEEIKTTDHALFR

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.0e-0533.33Show/hide
Query:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWL
        +L  +YFP SS++  S     S+ W+  + G ELL +G+ + +G+G    V+ D W+
Subjt:  VLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGAAATGGGAAGAACTTTGTGTTATGTGGTAAGTACTTCCCTGCTTCTTCAGTGTTAAATGTATCATATACGCCTTCGTCTTCCTTTTTTTGGAAGGGTTTTGT
ATGGGGAATGGAACTCTTAAAACAAGGAATCCGGAAAAACTTGGGAAATGGACAATCTATCCTTGTGTTTAAAGATCCATGGCTTCCTCGCCCCTTTACTTTTAAGGTGA
TTTCTCCGACTGTTCCATGTTTGGAAAATATCAAGGTCGCGGAGTTTATTACTCCAAGCGTGTGGATTTGGCATTATGATAGTAGGGGTGAATATTCCGTGAAGAGTGGC
TACAAGCTTGGTATGCTGTGCCCGCTAGAAGCATCAATGTCTACAAGTGGAATGGAAGCTAGGTGGTGGAAGAAGGTTTGGAAGTTACGAATTCCCAACAAAGTCAAAAT
TTTTGTGTGGAAATCGTTCCATAATTCTATTCCATCTATGTTCAATTTGAAGAGTCATCATGTTCCAGTCAATATGTTTTGCCCTGTATGTCATGAGGAGATTAAAACAA
CGGATCACGCTCTTTTCCGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGAAATGGGAAGAACTTTGTGTTATGTGGTAAGTACTTCCCTGCTTCTTCAGTGTTAAATGTATCATATACGCCTTCGTCTTCCTTTTTTTGGAAGGGTTTTGT
ATGGGGAATGGAACTCTTAAAACAAGGAATCCGGAAAAACTTGGGAAATGGACAATCTATCCTTGTGTTTAAAGATCCATGGCTTCCTCGCCCCTTTACTTTTAAGGTGA
TTTCTCCGACTGTTCCATGTTTGGAAAATATCAAGGTCGCGGAGTTTATTACTCCAAGCGTGTGGATTTGGCATTATGATAGTAGGGGTGAATATTCCGTGAAGAGTGGC
TACAAGCTTGGTATGCTGTGCCCGCTAGAAGCATCAATGTCTACAAGTGGAATGGAAGCTAGGTGGTGGAAGAAGGTTTGGAAGTTACGAATTCCCAACAAAGTCAAAAT
TTTTGTGTGGAAATCGTTCCATAATTCTATTCCATCTATGTTCAATTTGAAGAGTCATCATGTTCCAGTCAATATGTTTTGCCCTGTATGTCATGAGGAGATTAAAACAA
CGGATCACGCTCTTTTCCGGTGA
Protein sequenceShow/hide protein sequence
MGGNGKNFVLCGKYFPASSVLNVSYTPSSSFFWKGFVWGMELLKQGIRKNLGNGQSILVFKDPWLPRPFTFKVISPTVPCLENIKVAEFITPSVWIWHYDSRGEYSVKSG
YKLGMLCPLEASMSTSGMEARWWKKVWKLRIPNKVKIFVWKSFHNSIPSMFNLKSHHVPVNMFCPVCHEEIKTTDHALFR