; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036167 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036167
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionzf-RVT domain-containing protein
Genome locationchr3:40882729..40883578
RNA-Seq ExpressionLag0036167
SyntenyLag0036167
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023896927.1 uncharacterized protein LOC112008817 [Quercus suber]4.0e-2427.2Show/hide
Query:  GKRVAELI-QEDVNWKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSDFSSSKKLWRFIWRLDCWPKAKI
        G  V ++I Q +  WK  LI   F+  +A II  +P  + +  D+ +W   S G+++ +SAY+   L +     + S   SS  +WR IW L    K + 
Subjt:  GKRVAELI-QEDVNWKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSDFSSSKKLWRFIWRLDCWPKAKI

Query:  NLWNAMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNC--------------------------------KILDLQEARTACNIIWAIWNRRNQ
         +W A N+ALPT  N+++RR+     C  CR   ED  H  W C                                 + D  +    C I+W IWNRRN 
Subjt:  NLWNAMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNC--------------------------------KILDLQEARTACNIIWAIWNRRNQ

Query:  LKNSNSRADQVQFSVDIICSVIRIQGEKTYQSSSPEIQLSHGVWKPPYSLLWKMNVDAAWF
         +  +S  +           ++  Q  +  +     +  S G W+PP   L+K+N D A F
Subjt:  LKNSNSRADQVQFSVDIICSVIRIQGEKTYQSSSPEIQLSHGVWKPPYSLLWKMNVDAAWF

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]6.8e-2430.77Show/hide
Query:  WKPIRVNENLKGKRVAELIQEDVNWKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSDFSSSKKLWRFIW
        +KPI          VAELI E   W+EDLI   F   DAE I+ +P     KED++IW  D KG +SVKS Y+  + +      S S  +  + LWRFIW
Subjt:  WKPIRVNENLKGKRVAELIQEDVNWKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSDFSSSKKLWRFIW

Query:  RLDCWPKAKINLWNAMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEH----------------------------LFWNCKILDLQEART----ACNI
        +L    K KI LW A +D LPT EN+ K+++    +C  C    E V H                            + W  +    Q A+        +
Subjt:  RLDCWPKAKINLWNAMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEH----------------------------LFWNCKILDLQEART----ACNI

Query:  IWAIWNRRNQLKNSNSRADQVQFSVD---IICSVIRI-QGEKTYQSSSPEIQLSHGVWKPPYSLLWKMNVDAA
        +WAIW  RN+      + + ++   +   I+ S  +I Q E  Y++     +     W PP +   K+NVDAA
Subjt:  IWAIWNRRNQLKNSNSRADQVQFSVD---IICSVIRI-QGEKTYQSSSPEIQLSHGVWKPPYSLLWKMNVDAA

XP_030923017.1 uncharacterized protein LOC115949892 [Quercus lobata]2.3e-2427.59Show/hide
Query:  GKRVAELIQEDVN-WKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSDFSSSKKLWRFIWRLDCWPKAKI
        G  V +LI +D + WK DLI   F+  +A+II  +P  + +  D+ +W   + G+++ +SAY+   L D  N  + S    S   W+ IW L    K + 
Subjt:  GKRVAELIQEDVN-WKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSDFSSSKKLWRFIWRLDCWPKAKI

Query:  NLWNAMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNCKIL--------------------------------DLQEARTACNIIWAIWNRRNQ
         +W A N++LPT  N++KRR+     C  C+   ED  H  W C  L                                D  + +  C I+W +WNRRN 
Subjt:  NLWNAMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNCKIL--------------------------------DLQEARTACNIIWAIWNRRNQ

Query:  LKNSNSRADQVQFSVDIICSVIRIQGEKTYQSSSPEIQLSHGVWKPPYSLLWKMNVDAAWF
        ++  +S  +           ++  Q  +      P    S   W+PP S L+K+N D A F
Subjt:  LKNSNSRADQVQFSVDIICSVIRIQGEKTYQSSSPEIQLSHGVWKPPYSLLWKMNVDAAWF

XP_030924745.1 uncharacterized protein LOC115951731 [Quercus lobata]6.8e-2427.21Show/hide
Query:  DGNWKPIRVNENL--------KGKRVAELIQEDV-NWKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSD
        +  W P++ N ++         G +V+ LI  D+  W   L+   F+   A +I  MP       D IIW     G+F+ KSAY+  +  +  N A  S 
Subjt:  DGNWKPIRVNENL--------KGKRVAELIQEDV-NWKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSD

Query:  FSSSKKLWRFIWRLDCWPKAKINLWNAMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNC-----------------------------KILDL
          + ++ WR +W+L    K K   W A NDALPT  N+ +R I +N++C  C+ +PED  H  W+C                             K + +
Subjt:  FSSSKKLWRFIWRLDCWPKAKINLWNAMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNC-----------------------------KILDL

Query:  QE---ARTACNIIWAIWNRRNQLKNSNSRADQVQFSVDIICSVIRIQGEKTYQSSSPEIQLSHGVWKPPYSLLWKMNVDAAWF
        +E   +     I W +WNRRN L+  N     +     +  S ++   +   Q        S   W+PP    +K N DAA F
Subjt:  QE---ARTACNIIWAIWNRRNQLKNSNSRADQVQFSVDIICSVIRIQGEKTYQSSSPEIQLSHGVWKPPYSLLWKMNVDAAWF

XP_042958109.1 uncharacterized protein LOC122293655 [Carya illinoinensis]3.0e-2428.63Show/hide
Query:  VAELIQ-EDVNWKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSDFSSSKKLWRFIWRLDCWPKAKINLW
        V+ELI+ E   WK++L++  F + +A+ I ++P      +D++IWG   KG F+VKSAY   + + N+N    S    SK LW  +WRL    K KI LW
Subjt:  VAELIQ-EDVNWKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSDFSSSKKLWRFIWRLDCWPKAKINLW

Query:  NAMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNCKI--------------------------------LDLQEARTACNIIWAIWNRRN----
         A+N  LPT   + KR++  N  C +C ++ E   H+ W+C                                  L+  E      + + +W RRN    
Subjt:  NAMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNCKI--------------------------------LDLQEARTACNIIWAIWNRRN----

Query:  --QLKNSNSRADQVQFSVDIICSVIRIQGEKTYQSSSPEIQLSHGVWKPPYSLLWKMNVDAA
          + K       Q     + + S      EK  +  +  ++    VWKPP     K+NVDAA
Subjt:  --QLKNSNSRADQVQFSVDIICSVIRIQGEKTYQSSSPEIQLSHGVWKPPYSLLWKMNVDAA

TrEMBL top hitse value%identityAlignment
A0A1S8ACU2 Ribonuclease H-like superfamily protein1.9e-2432.56Show/hide
Query:  VAELIQEDVNWKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSDFSSSKKLWRFIWRLDCWPKAKINLWN
        VAELI E+  WKE LI+  F   DAE+I  +      K D+I+W  D KG +SVKS Y+  I M  ++ A  S  SS+   W  IW L+   K KI +W 
Subjt:  VAELIQEDVNWKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSDFSSSKKLWRFIWRLDCWPKAKINLWN

Query:  AMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNCK----------------ILDLQ----------------EARTACNIIWAIWNRRNQLKNS
        A  + LPT+EN+ +R++    +C  C+ K ED+ H    CK                +LD Q                E R    + W  W+ RN     
Subjt:  AMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNCK----------------ILDLQ----------------EARTACNIIWAIWNRRNQLKNS

Query:  NSRAD---QVQFSVDIICSVIRIQGEKTYQSSSPEIQLSHGVWKPPYSLLWKMNVDAA
        N R D    V  +  ++ S  R++  K  Q+++  I      W PP    +K NVDAA
Subjt:  NSRAD---QVQFSVDIICSVIRIQGEKTYQSSSPEIQLSHGVWKPPYSLLWKMNVDAA

A0A2N9ED81 Uncharacterized protein1.2e-2326.64Show/hide
Query:  DGNWKPIRVNENLKGKR--------VAELIQEDVN-WKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSD
        + +W PIR + ++   R        V+ LI+E  N WKE +I++SF+ ++AE IL +P  ++  +D  +W     G++SV+S Y   +   +R+    SD
Subjt:  DGNWKPIRVNENLKGKR--------VAELIQEDVN-WKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSD

Query:  FSSSKKLWRFIWRLDCWPKAKINLWNAMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNCK--------------------------------I
         S   ++W  IW L    K +  LW A ++ALPT  N+  R +  +  C  C +  E V H  W CK                                 
Subjt:  FSSSKKLWRFIWRLDCWPKAKINLWNAMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNCK--------------------------------I

Query:  LDLQEARTACNIIWAIWNRRNQ--LKNSNSRADQVQFSVDIICS---VIRIQGEKTYQSSSPEIQLSHGVWKPPYSLLWKMNVDAAWFD
        L   E +      W IW++RN+  L       +Q+        S     ++    T    +P I++   VWKPP    +K N D A+F+
Subjt:  LDLQEARTACNIIWAIWNRRNQ--LKNSNSRADQVQFSVDIICS---VIRIQGEKTYQSSSPEIQLSHGVWKPPYSLLWKMNVDAAWFD

A0A2N9FJ03 Reverse transcriptase domain-containing protein2.8e-2329.96Show/hide
Query:  LIQEDVNWKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSDFSSSKKLWRFIWRLDCWPKAKINLWNAMN
        ++Q  + W   LI   F+  DAE I N+P  + +  D+  W   + G +SVKS Y+  I ++N+     S  +    +W+ +W L    K +   W A  
Subjt:  LIQEDVNWKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSDFSSSKKLWRFIWRLDCWPKAKINLWNAMN

Query:  DALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNC-----------------------------KILDL---QEARTACNIIWAIWNRRNQLK-NSNS
        DALPT  N++KR I  + +C  CR  PEDV H  WNC                             K+L+    Q +     I W +WNRRN+L+ N   
Subjt:  DALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNC-----------------------------KILDL---QEARTACNIIWAIWNRRNQLK-NSNS

Query:  RA-DQVQFSVDIICSVIRIQGEK-TYQSSSPEIQLSHGVWKPPYSLLWKMNVDAAWF
         A D V               EK   + S+PE+ +    W+PP  L +K N D A F
Subjt:  RA-DQVQFSVDIICSVIRIQGEK-TYQSSSPEIQLSHGVWKPPYSLLWKMNVDAAWF

A0A2N9IJM4 RNase H domain-containing protein1.2e-2326.64Show/hide
Query:  DGNWKPIRVNENLKGKR--------VAELIQEDVN-WKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSD
        + +W PIR + ++   R        V+ LI+E  N WKE +I++SF+ ++AE IL +P  ++  +D  +W     G++SV+S Y   +   +R+    SD
Subjt:  DGNWKPIRVNENLKGKR--------VAELIQEDVN-WKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSD

Query:  FSSSKKLWRFIWRLDCWPKAKINLWNAMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNCK--------------------------------I
         S   ++W  IW L    K +  LW A ++ALPT  N+  R +  +  C  C +  E V H  W CK                                 
Subjt:  FSSSKKLWRFIWRLDCWPKAKINLWNAMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNCK--------------------------------I

Query:  LDLQEARTACNIIWAIWNRRNQ--LKNSNSRADQVQFSVDIICS---VIRIQGEKTYQSSSPEIQLSHGVWKPPYSLLWKMNVDAAWFD
        L   E +      W IW++RN+  L       +Q+        S     ++    T    +P I++   VWKPP    +K N D A+F+
Subjt:  LDLQEARTACNIIWAIWNRRNQ--LKNSNSRADQVQFSVDIICS---VIRIQGEKTYQSSSPEIQLSHGVWKPPYSLLWKMNVDAAWFD

A0A7N2LEC9 zf-RVT domain-containing protein3.3e-2428.96Show/hide
Query:  RVAELIQEDVN-WKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSDFSSSKKLWRFIWRLDCWPKAKINL
        RV  LI  ++  WK D++   F+  +A +IL +P       D+I WG    G+FS KSAY+  + +DN      S      K W+ +W L    K K   
Subjt:  RVAELIQEDVN-WKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSDFSSSKKLWRFIWRLDCWPKAKINL

Query:  WNAMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNCKILD------------LQEARTACN--------------------IIWAIWNRRNQLK
        W A N+ALPT  N+++R I ++DLC  C  +PED  H  W C  L+             Q + ++ N                    I W IWNRRN LK
Subjt:  WNAMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNCKILD------------LQEARTACN--------------------IIWAIWNRRNQLK

Query:  NSNSRADQVQFSVDIICSVIRIQGEKTYQSSSPEIQLSHGVWKPPYSLLWKMNVDAAWF
                          +      +   +  P + +S   W PP    +K N D A F
Subjt:  NSNSRADQVQFSVDIICSVIRIQGEKTYQSSSPEIQLSHGVWKPPYSLLWKMNVDAAWF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein4.1e-1124.86Show/hide
Query:  KPIRVNENLKGKRVAELIQEDVN---WKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSDFSSSKKLWRF
        +P+   E  K   +  L +   +   W +  I     +SD   I  +      K D+IIW  ++ G ++V+S Y       + N  + +    S  L   
Subjt:  KPIRVNENLKGKRVAELIQEDVN---WKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSDFSSSKKLWRF

Query:  IWRLDCWPKAKINLWNAMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNCKILDLQEARTACNIIWAIWNRRNQLKNSN
        IW L   PK K  LW A++ AL TTE +  R +  +  C  C ++ E + H  + C    +    +  ++I      RNQL +++
Subjt:  IWRLDCWPKAKINLWNAMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNCKILDLQEARTACNIIWAIWNRRNQLKNSN

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.8e-0633.77Show/hide
Query:  IWRLDCWPKAKINLWNAMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNC----------KILDLQEARTA
        IW L   PK K+ +W A+N+ALP    +  R I     C  CR   E + H+ +NC           I+D +E +TA
Subjt:  IWRLDCWPKAKINLWNAMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNC----------KILDLQEARTA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGACGATGGGAATTGGAAACCTATTAGAGTCAATGAGAACCTTAAAGGTAAAAGAGTGGCGGAGCTCATTCAAGAGGACGTGAATTGGAAAGAAGATCTGATAAG
GAACTCTTTCATTCGGAGTGATGCGGAGATTATCCTTAACATGCCCAAGAGAAACCTTTCTAAAGAAGACGAGATTATATGGGGAAAGGATTCTAAAGGCTTATTTTCGG
TTAAAAGTGCTTATCGCACGACAATCCTCATGGATAATCGAAATTCAGCCTCGGGATCAGATTTTTCTTCTTCTAAAAAGCTTTGGAGGTTTATTTGGCGGCTGGATTGC
TGGCCAAAAGCCAAAATTAACCTCTGGAATGCGATGAATGATGCTTTACCCACGACTGAGAATATTAAAAAAAGAAGAATTGATTCTAACGATCTGTGTTTCTTGTGCAG
GAAAAAGCCTGAGGATGTGGAGCATTTGTTCTGGAATTGCAAGATCCTAGACCTTCAGGAAGCGAGGACGGCGTGTAACATTATTTGGGCAATTTGGAACAGAAGGAATC
AATTAAAGAATTCCAACAGCAGAGCAGATCAAGTGCAATTCAGTGTAGATATTATATGTTCGGTTATTAGGATTCAAGGGGAGAAGACGTACCAGTCAAGTTCGCCGGAG
ATCCAACTGAGTCACGGTGTCTGGAAGCCTCCCTACAGTCTTCTGTGGAAGATGAATGTGGACGCCGCCTGGTTTGACGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGGACGATGGGAATTGGAAACCTATTAGAGTCAATGAGAACCTTAAAGGTAAAAGAGTGGCGGAGCTCATTCAAGAGGACGTGAATTGGAAAGAAGATCTGATAAG
GAACTCTTTCATTCGGAGTGATGCGGAGATTATCCTTAACATGCCCAAGAGAAACCTTTCTAAAGAAGACGAGATTATATGGGGAAAGGATTCTAAAGGCTTATTTTCGG
TTAAAAGTGCTTATCGCACGACAATCCTCATGGATAATCGAAATTCAGCCTCGGGATCAGATTTTTCTTCTTCTAAAAAGCTTTGGAGGTTTATTTGGCGGCTGGATTGC
TGGCCAAAAGCCAAAATTAACCTCTGGAATGCGATGAATGATGCTTTACCCACGACTGAGAATATTAAAAAAAGAAGAATTGATTCTAACGATCTGTGTTTCTTGTGCAG
GAAAAAGCCTGAGGATGTGGAGCATTTGTTCTGGAATTGCAAGATCCTAGACCTTCAGGAAGCGAGGACGGCGTGTAACATTATTTGGGCAATTTGGAACAGAAGGAATC
AATTAAAGAATTCCAACAGCAGAGCAGATCAAGTGCAATTCAGTGTAGATATTATATGTTCGGTTATTAGGATTCAAGGGGAGAAGACGTACCAGTCAAGTTCGCCGGAG
ATCCAACTGAGTCACGGTGTCTGGAAGCCTCCCTACAGTCTTCTGTGGAAGATGAATGTGGACGCCGCCTGGTTTGACGTTTAA
Protein sequenceShow/hide protein sequence
MMDDGNWKPIRVNENLKGKRVAELIQEDVNWKEDLIRNSFIRSDAEIILNMPKRNLSKEDEIIWGKDSKGLFSVKSAYRTTILMDNRNSASGSDFSSSKKLWRFIWRLDC
WPKAKINLWNAMNDALPTTENIKKRRIDSNDLCFLCRKKPEDVEHLFWNCKILDLQEARTACNIIWAIWNRRNQLKNSNSRADQVQFSVDIICSVIRIQGEKTYQSSSPE
IQLSHGVWKPPYSLLWKMNVDAAWFDV