; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035026 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035026
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:13774459..13775457
RNA-Seq ExpressionLag0035026
SyntenyLag0035026
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4309574.1 unnamed protein product [Prunus armeniaca]4.5e-0927.31Show/hide
Query:  MWNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHC-----IFVNDDAAAV------SKRVGGIVGATKLPIVWCCVTAR
        +W L IP+KIK+F+W+C  + LP    L ++ I  T++C+ C++K+E+V HA++ C     ++ N    AV      +K       A  LP +       
Subjt:  MWNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHC-----IFVNDDAAAV------SKRVGGIVGATKLPIVWCCVTAR

Query:  QEFASQT----------IFVDAAVQEGDEIFDFGAAVMDLRGGLQAALQGFKSVASSVLIVEAQAIIEGLRLAQRLNVQRVIIKSDSLSCVHMI------
            SQ           I+VD A++ GD +   G  V + +G   AA   +   +      E  A IEGLR A  +     I++ D+   ++ I      
Subjt:  QEFASQT----------IFVDAAVQEGDEIFDFGAAVMDLRGGLQAALQGFKSVASSVLIVEAQAIIEGLRLAQRLNVQRVIIKSDSLSCVHMI------

Query:  NGDH-PIAIEVKTIIQ
        NG + P+  EV ++++
Subjt:  NGDH-PIAIEVKTIIQ

KAG6693248.1 hypothetical protein I3842_10G159900, partial [Carya illinoinensis]2.4e-1024.53Show/hide
Query:  MWNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHCIFVNDDAAAVSKRVGGIVG--------------------ATKLP
        +W+L +P KIK F W+  HE LPT   L  KH+   ++C  C   +E   HALF+C  V          +G IV                      T L 
Subjt:  MWNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHCIFVNDDAAAVSKRVGGIVG--------------------ATKLP

Query:  IVW-----------------------CCVTARQEFASQTIF------------------------VDAAVQEGDEIFDFGAAVMDLRGGLQAALQGFKSV
        + W                         +  +QE+    +F                        VD A          G  + D  G +  A    +  
Subjt:  IVW-----------------------CCVTARQEFASQTIF------------------------VDAAVQEGDEIFDFGAAVMDLRGGLQAALQGFKSV

Query:  ASSVLIVEAQAIIEGLRLAQRLNVQRVIIKSDSLSCVHMINGDHPIAIEVKTIIQDILALNDTFQ
         SS + +EA A++ GL+L  +  V ++I++SDSL  V+ +N D     E   I+QDI  L   F+
Subjt:  ASSVLIVEAQAIIEGLRLAQRLNVQRVIIKSDSLSCVHMINGDHPIAIEVKTIIQDILALNDTFQ

KAG6693249.1 hypothetical protein I3842_10G159900 [Carya illinoinensis]2.4e-1024.53Show/hide
Query:  MWNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHCIFVNDDAAAVSKRVGGIVG--------------------ATKLP
        +W+L +P KIK F W+  HE LPT   L  KH+   ++C  C   +E   HALF+C  V          +G IV                      T L 
Subjt:  MWNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHCIFVNDDAAAVSKRVGGIVG--------------------ATKLP

Query:  IVW-----------------------CCVTARQEFASQTIF------------------------VDAAVQEGDEIFDFGAAVMDLRGGLQAALQGFKSV
        + W                         +  +QE+    +F                        VD A          G  + D  G +  A    +  
Subjt:  IVW-----------------------CCVTARQEFASQTIF------------------------VDAAVQEGDEIFDFGAAVMDLRGGLQAALQGFKSV

Query:  ASSVLIVEAQAIIEGLRLAQRLNVQRVIIKSDSLSCVHMINGDHPIAIEVKTIIQDILALNDTFQ
         SS + +EA A++ GL+L  +  V ++I++SDSL  V+ +N D     E   I+QDI  L   F+
Subjt:  ASSVLIVEAQAIIEGLRLAQRLNVQRVIIKSDSLSCVHMINGDHPIAIEVKTIIQDILALNDTFQ

XP_030502854.1 uncharacterized protein LOC115718027 [Cannabis sativa]2.4e-1027.19Show/hide
Query:  MWNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHCIFVNDDAAAVSKRVG-------GIVGATKLPIVWCCVTARQ---
        +W LCIP K+K F+W+    NLPT + L  +H+    +C +C  + ET  HAL  C    + A A  ++VG       G    + L +++   TA Q   
Subjt:  MWNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHCIFVNDDAAAVSKRVG-------GIVGATKLPIVWCCVTARQ---

Query:  -----------------------------EFASQTIFVDAAVQEGDEIFDFGAAVMDLRGGLQAALQGFKSVASSVLIVEAQAIIEGLRLAQRLNVQRVI
                                     E  +  + VDAA+ E +E + FG    D  G    A  G      SV ++EA  I E L   +  N   V+
Subjt:  -----------------------------EFASQTIFVDAAVQEGDEIFDFGAAVMDLRGGLQAALQGFKSVASSVLIVEAQAIIEGLRLAQRLNVQRVI

Query:  IKSDSLSCVHMINGDHPIAIEVKTIIQD
        +++DSL  V  I     +      +IQD
Subjt:  IKSDSLSCVHMINGDHPIAIEVKTIIQD

XP_042950031.1 uncharacterized protein LOC122282138 [Carya illinoinensis]1.4e-1327.75Show/hide
Query:  MWNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHCIFVNDDAAAVSKRVGGIVG--------------------ATKLP
        +W+L IP +IK F W+  HE LPT   L  KH+   ++C +C   +E   HALF+C  V         ++G IV                     AT L 
Subjt:  MWNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHCIFVNDDAAAVSKRVGGIVG--------------------ATKLP

Query:  IVWCCVTARQEFASQ------TIFVDAAVQ-EGDEIFDFGAAVM--DLRGGLQAALQGFKSVASSVLIVEAQAIIEGLRLAQRLNVQRVIIKSDSLSCVH
        + W     R +F  +      T  +++A+  + DE F  G  V+  D  G +   +   +   SS   +E  A++ GL+L  +  V ++I+KS+SL  V+
Subjt:  IVWCCVTARQEFASQ------TIFVDAAVQ-EGDEIFDFGAAVM--DLRGGLQAALQGFKSVASSVLIVEAQAIIEGLRLAQRLNVQRVIIKSDSLSCVH

Query:  MINGDHPIAIEVKTIIQDILALNDTFQ
         +N       +   I+QD   L + F+
Subjt:  MINGDHPIAIEVKTIIQDILALNDTFQ

TrEMBL top hitse value%identityAlignment
A0A2H5Q8Y3 Uncharacterized protein4.9e-0925.96Show/hide
Query:  MMWNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHC----------IFVNDDAAAVSKRVGGIVGATK-----------
        ++W+L +P KI+ F+W+     LP+   L ++ I     C LC   IE V HAL  C          +F ND  AA  + +  ++   K           
Subjt:  MMWNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHC----------IFVNDDAAAVSKRVGGIVGATK-----------

Query:  LPIVWCCVTARQEFA-------SQTIFVDA-AVQE--------GDEIFDFGAAVMDLRGGLQAALQGFKSVASSVLIVEAQAIIEGLRLAQRLNVQRVII
          ++W    AR ++         Q++   A AV E         D +   GA + D  G + A          SV   EA+A+  GL++A+  +V+ VI+
Subjt:  LPIVWCCVTARQEFA-------SQTIFVDA-AVQE--------GDEIFDFGAAVMDLRGGLQAALQGFKSVASSVLIVEAQAIIEGLRLAQRLNVQRVII

Query:  KSDSLSCVHMINGDHPIAIEVKTIIQDILALNDTF
        +SDS   V ++N       E+  ++ +I  L ++F
Subjt:  KSDSLSCVHMINGDHPIAIEVKTIIQDILALNDTF

A0A2N9FSI8 Reverse transcriptase domain-containing protein6.4e-0944.59Show/hide
Query:  WNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHCIFVNDDAAAVSKRVGGIVG
        W L IP KIK+FIW+ FHE+LPT   L R+ I  T  C +C +++ET  HAL+ C    +  A V  R  G +G
Subjt:  WNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHCIFVNDDAAAVSKRVGGIVG

A0A2N9HY54 Reverse transcriptase2.6e-1027.23Show/hide
Query:  MWNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHCIFVND--DAAAVSKRVGGIVGATKLPIVWCCVTARQEFASQTIF
        +W+L +P K+K F+W+  HE LP    L R+ I     C+ C    ET  HAL+ C F  +   A      +     ++ + + W C      F +    
Subjt:  MWNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHCIFVND--DAAAVSKRVGGIVGATKLPIVWCCVTARQEFASQTIF

Query:  VDAAVQEGDEIFDFGAAVMDLRGGLQAALQGFKSVASSVLIVEAQAIIEGLRLAQRLNVQRVIIKSDSLSCVHMINGDHPIAIEVKTIIQD
         D A+         G  + + +G   A +    ++ +SV +VEA+A  E ++LA  L ++RVI + DS   +H +    P      T I+D
Subjt:  VDAAVQEGDEIFDFGAAVMDLRGGLQAALQGFKSVASSVLIVEAQAIIEGLRLAQRLNVQRVIIKSDSLSCVHMINGDHPIAIEVKTIIQD

A0A6J5XES2 Uncharacterized protein2.2e-0927.31Show/hide
Query:  MWNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHC-----IFVNDDAAAV------SKRVGGIVGATKLPIVWCCVTAR
        +W L IP+KIK+F+W+C  + LP    L ++ I  T++C+ C++K+E+V HA++ C     ++ N    AV      +K       A  LP +       
Subjt:  MWNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHC-----IFVNDDAAAV------SKRVGGIVGATKLPIVWCCVTAR

Query:  QEFASQT----------IFVDAAVQEGDEIFDFGAAVMDLRGGLQAALQGFKSVASSVLIVEAQAIIEGLRLAQRLNVQRVIIKSDSLSCVHMI------
            SQ           I+VD A++ GD +   G  V + +G   AA   +   +      E  A IEGLR A  +     I++ D+   ++ I      
Subjt:  QEFASQT----------IFVDAAVQEGDEIFDFGAAVMDLRGGLQAALQGFKSVASSVLIVEAQAIIEGLRLAQRLNVQRVIIKSDSLSCVHMI------

Query:  NGDH-PIAIEVKTIIQ
        NG + P+  EV ++++
Subjt:  NGDH-PIAIEVKTIIQ

A0A803PAK3 Uncharacterized protein3.7e-0926.47Show/hide
Query:  WNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHC--------IFVNDDAAAVSKRVGGIVGATKLPIVWCCVTARQEFA
        W L +PSKI+ F+W+  H+ LP    L  +HIA ++ C LC +  ET+ HALF+C        +     A + +   G     TK    W          
Subjt:  WNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHC--------IFVNDDAAAVSKRVGGIVGATKLPIVWCCVTARQEFA

Query:  SQTIFVDAAVQEGDEIFDFGAAVMDLRGGLQAALQGFKSVASSVLIVEAQAIIEGLRLAQRLNVQRVIIKSDSLSCVHMINGDHPIAIEVKTIIQDILAL
           +  DAA+   +    FGA + D  G + AA+        +  I EA A++  L+  + L +    I+++SLS V  ++    +  +   ++ +I  L
Subjt:  SQTIFVDAAVQEGDEIFDFGAAVMDLRGGLQAALQGFKSVASSVLIVEAQAIIEGLRLAQRLNVQRVIIKSDSLSCVHMINGDHPIAIEVKTIIQDILAL

Query:  NDTF
           F
Subjt:  NDTF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein1.4e-0534.48Show/hide
Query:  MWNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHCIF
        +WNL I  K+K+F+W+   + L T   L  + +     C  C+++ E+++HALF C F
Subjt:  MWNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHCIF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGTGGAACTTGTGTATACCTTCTAAGATTAAGTACTTTATTTGGAAATGCTTCCATGAAAACTTGCCTACCATGCTTGCTTTGCCTAGGAAACATATTGCAACCAC
AAATATGTGTAATCTTTGTTATAAAAAGATTGAAACGGTGGACCATGCCTTATTCCATTGTATATTCGTTAACGATGATGCAGCTGCTGTTTCAAAGCGGGTTGGAGGAA
TTGTAGGTGCTACGAAACTTCCAATAGTCTGGTGTTGTGTGACGGCGAGGCAAGAATTTGCAAGCCAAACCATTTTTGTCGATGCGGCTGTGCAGGAAGGAGATGAAATT
TTCGACTTTGGTGCAGCAGTGATGGATTTAAGGGGCGGGCTTCAAGCGGCTCTTCAAGGGTTTAAATCAGTGGCTTCTTCAGTTCTCATTGTCGAAGCCCAAGCAATCAT
AGAAGGTCTACGACTGGCCCAACGGTTGAATGTCCAACGGGTTATTATTAAATCAGATTCACTGTCATGTGTCCACATGATAAATGGGGATCATCCAATTGCCATTGAAG
TCAAAACAATTATTCAGGATATTTTGGCGTTAAATGATACATTTCAAATCAAGGATTTGATAGAGAGGGAAAATCTATTTAGTGAAGAAAATCAGAGAGAATGTTCTCAT
TACAGAAACCATGACAACAGCGAAAGAAATAGACATTACACAGAACACGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGTGGAACTTGTGTATACCTTCTAAGATTAAGTACTTTATTTGGAAATGCTTCCATGAAAACTTGCCTACCATGCTTGCTTTGCCTAGGAAACATATTGCAACCAC
AAATATGTGTAATCTTTGTTATAAAAAGATTGAAACGGTGGACCATGCCTTATTCCATTGTATATTCGTTAACGATGATGCAGCTGCTGTTTCAAAGCGGGTTGGAGGAA
TTGTAGGTGCTACGAAACTTCCAATAGTCTGGTGTTGTGTGACGGCGAGGCAAGAATTTGCAAGCCAAACCATTTTTGTCGATGCGGCTGTGCAGGAAGGAGATGAAATT
TTCGACTTTGGTGCAGCAGTGATGGATTTAAGGGGCGGGCTTCAAGCGGCTCTTCAAGGGTTTAAATCAGTGGCTTCTTCAGTTCTCATTGTCGAAGCCCAAGCAATCAT
AGAAGGTCTACGACTGGCCCAACGGTTGAATGTCCAACGGGTTATTATTAAATCAGATTCACTGTCATGTGTCCACATGATAAATGGGGATCATCCAATTGCCATTGAAG
TCAAAACAATTATTCAGGATATTTTGGCGTTAAATGATACATTTCAAATCAAGGATTTGATAGAGAGGGAAAATCTATTTAGTGAAGAAAATCAGAGAGAATGTTCTCAT
TACAGAAACCATGACAACAGCGAAAGAAATAGACATTACACAGAACACGTGTAA
Protein sequenceShow/hide protein sequence
MMWNLCIPSKIKYFIWKCFHENLPTMLALPRKHIATTNMCNLCYKKIETVDHALFHCIFVNDDAAAVSKRVGGIVGATKLPIVWCCVTARQEFASQTIFVDAAVQEGDEI
FDFGAAVMDLRGGLQAALQGFKSVASSVLIVEAQAIIEGLRLAQRLNVQRVIIKSDSLSCVHMINGDHPIAIEVKTIIQDILALNDTFQIKDLIERENLFSEENQRECSH
YRNHDNSERNRHYTEHV