; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000486 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000486
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:8526765..8528899
RNA-Seq ExpressionLag0000486
SyntenyLag0000486
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.8e-3128.5Show/hide
Query:  GFHDNWISLIVDCISTFSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDSPVFLKSNAYES
        GF++ WI  I+ CIST  FSI +NG   G   PSRG+RQGDPLSPYLFLLC+EG S+L++   NS  L+G+    ++  ++HLLF DDS +FL+S   E 
Subjt:  GFHDNWISLIVDCISTFSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDSPVFLKSNAYES

Query:  GLFKSLVLDYERASGQCINFS-------------------------------------------------------------------------------
           + L+  Y RASGQCINFS                                                                               
Subjt:  GLFKSLVLDYERASGQCINFS-------------------------------------------------------------------------------

Query:  ---------------------------------------------FVWGLELLHKGLRKNLGNGKEIRMFKDPRLPLSSTFKVVSSN-------------
                                                     F+WG +LL KGLR  +GNG  I+ F DP LP  +TFK +  N             
Subjt:  ---------------------------------------------FVWGLELLHKGLRKNLGNGKEIRMFKDPRLPLSSTFKVVSSN-------------

Query:  ------------------ADFIKNFPMSNFT-PDIWVWHSGRFGKYFVRSGYKVFMLSKVEGAPSSLA------NGLLKESSPTKL
                           D I + P+S++   D W+WH  + G Y VRSGYK++M  K     +S        N + K + PTK+
Subjt:  ------------------ADFIKNFPMSNFT-PDIWVWHSGRFGKYFVRSGYKVFMLSKVEGAPSSLA------NGLLKESSPTKL

XP_023880426.1 uncharacterized protein LOC111992797 [Quercus suber]8.3e-2852.94Show/hide
Query:  GFHDNWISLIVDCISTFSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDSPVFLKSNAYES
        GF++NWI L++ CIS+ S+S+++NG + G+I+P+RGLRQGDPLS YLFLLC+EGFS+L++   ++  L+G+S+ R + KVSHL F DDS +F K+N  E 
Subjt:  GFHDNWISLIVDCISTFSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDSPVFLKSNAYES

Query:  GLFKSLVLDYERASGQCIN
           K ++  YE ASGQ IN
Subjt:  GLFKSLVLDYERASGQCIN

XP_030478262.1 uncharacterized protein LOC115695328 [Cannabis sativa]2.2e-2852.1Show/hide
Query:  GFHDNWISLIVDCISTFSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDSPVFLKSNAYES
        GFHD W+SLI++C+ + SFS ++NGE +GH+ PSRGLRQGDPLSPYLFL+CSEG S LL +  +S  L GL L+RH+  +SHLLF DD+ +F ++    +
Subjt:  GFHDNWISLIVDCISTFSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDSPVFLKSNAYES

Query:  GLFKSLVLDYERASGQCIN
             ++  Y +ASGQ +N
Subjt:  GLFKSLVLDYERASGQCIN

XP_030958760.1 uncharacterized protein LOC115980671 [Quercus lobata]1.5e-2955.46Show/hide
Query:  GFHDNWISLIVDCISTFSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDSPVFLKSNAYES
        GFHD+W  LI+ CI++ S+S+++NG + G+I PSRGLRQGDPLSPYLFLLC++GFSSL+S  V +++LSGLS+ R   K+SHL F DDS +F K+N++E 
Subjt:  GFHDNWISLIVDCISTFSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDSPVFLKSNAYES

Query:  GLFKSLVLDYERASGQCIN
             ++  YE  SGQ IN
Subjt:  GLFKSLVLDYERASGQCIN

XP_030970961.1 uncharacterized protein LOC115991405 [Quercus lobata]6.8e-3056.3Show/hide
Query:  GFHDNWISLIVDCISTFSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDSPVFLKSNAYES
        GFHD+W  LI+ CI++ S+S+++NG + G+I PSRGLRQGDPLSPYLFLLC++GFSSL+S  V +++LSGLS+ R   K+SHL F DDS +F K+N++E 
Subjt:  GFHDNWISLIVDCISTFSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDSPVFLKSNAYES

Query:  GLFKSLVLDYERASGQCIN
             ++  YE ASGQ IN
Subjt:  GLFKSLVLDYERASGQCIN

TrEMBL top hitse value%identityAlignment
A0A2N9F9Z6 Reverse transcriptase domain-containing protein4.3e-3033.2Show/hide
Query:  GFHDNWISLIVDCISTFSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDSPVFLKSNAYES
        GFH  W+S++++C+ T S+S+++NG+      PSRGLRQGDP+SPYLFLLC+EG  +LL++  +S  + GLS+S     ++HL F DDS +F ++  +  
Subjt:  GFHDNWISLIVDCISTFSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDSPVFLKSNAYES

Query:  GLFKSLVLDYERASGQCIN-------FSFVWGLEL---LHKGLR-KNLGNGKEIRMFKDPRLPLSSTFKVVS----------------------------
           + ++  YER SGQ IN       FS    LE    +   L    +GNG  I+++ D  LP  S    V+                            
Subjt:  GLFKSLVLDYERASGQCIN-------FSFVWGLEL---LHKGLR-KNLGNGKEIRMFKDPRLPLSSTFKVVS----------------------------

Query:  -----SNADFIKNFPMSNFTP-DIWVWHSGRFGKYFVRSGYKVFMLSKVEGAPSSLANG
               A  IK+  +S   P D+ VW   R G+Y VRS Y++ + ++ +G P  L  G
Subjt:  -----SNADFIKNFPMSNFTP-DIWVWHSGRFGKYFVRSGYKVFMLSKVEGAPSSLANG

A0A2N9FR55 Reverse transcriptase domain-containing protein2.8e-2930.27Show/hide
Query:  GFHDNWISLIVDCISTFSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDSPVFLKSNAYES
        GF   W+ LI++C+ST S+S+++NG    +I P RGLRQGDPLSPYLFLL +EG +SLL    + R + G+++S+   ++SH+LF+DDS +F K+   E 
Subjt:  GFHDNWISLIVDCISTFSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDSPVFLKSNAYES

Query:  GLFKSLVLDYERASGQCINFSFVWGLELLHKGLRKNLGNGKEIRMFKDPRLPLSSTF------KVVSSNADF---------------------------I
           K ++  YE A G C+                   G    IR+++D  +P   TF      K++S NA                             I
Subjt:  GLFKSLVLDYERASGQCINFSFVWGLELLHKGLRKNLGNGKEIRMFKDPRLPLSSTF------KVVSSNADF---------------------------I

Query:  KNFPMSNF-TPDIWVWHSGRFGKYFVRSGYKVFM---LSKVEGAPSSLANGLLKESSPTKLVAPFI-----QDRNGVVIHVDVPCNHHHNITGF
        K+ P+S + +PD+ +W   + G + V+S Y + M    ++  G  S++++G L      K    +      +D    ++HV V C     + GF
Subjt:  KNFPMSNF-TPDIWVWHSGRFGKYFVRSGYKVFM---LSKVEGAPSSLANGLLKESSPTKLVAPFI-----QDRNGVVIHVDVPCNHHHNITGF

A0A6J1DX30 uncharacterized protein LOC1110248741.3e-3128.5Show/hide
Query:  GFHDNWISLIVDCISTFSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDSPVFLKSNAYES
        GF++ WI  I+ CIST  FSI +NG   G   PSRG+RQGDPLSPYLFLLC+EG S+L++   NS  L+G+    ++  ++HLLF DDS +FL+S   E 
Subjt:  GFHDNWISLIVDCISTFSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDSPVFLKSNAYES

Query:  GLFKSLVLDYERASGQCINFS-------------------------------------------------------------------------------
           + L+  Y RASGQCINFS                                                                               
Subjt:  GLFKSLVLDYERASGQCINFS-------------------------------------------------------------------------------

Query:  ---------------------------------------------FVWGLELLHKGLRKNLGNGKEIRMFKDPRLPLSSTFKVVSSN-------------
                                                     F+WG +LL KGLR  +GNG  I+ F DP LP  +TFK +  N             
Subjt:  ---------------------------------------------FVWGLELLHKGLRKNLGNGKEIRMFKDPRLPLSSTFKVVSSN-------------

Query:  ------------------ADFIKNFPMSNFT-PDIWVWHSGRFGKYFVRSGYKVFMLSKVEGAPSSLA------NGLLKESSPTKL
                           D I + P+S++   D W+WH  + G Y VRSGYK++M  K     +S        N + K + PTK+
Subjt:  ------------------ADFIKNFPMSNFT-PDIWVWHSGRFGKYFVRSGYKVFMLSKVEGAPSSLA------NGLLKESSPTKL

A0A803NML1 Uncharacterized protein3.8e-3446.6Show/hide
Query:  GFHDNWISLIVDCISTFSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDSPVFLKSNAYES
        GF   WISLI+ CIST SFS  +NG+ +GH++P RGLRQGDPLSPYLFL+CSEG S  L     S +L GL L+R++  VSHLLF DDS +F +S    +
Subjt:  GFHDNWISLIVDCISTFSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDSPVFLKSNAYES

Query:  GLFKSLVLDYERASGQCINFSFVWGLELLHKGLRKNLGNGKEIRMFKDPRLPLSSTFKVVSSNADFIKNFPMSNFTPDIWVWHSGRFGKYF
           K  +  Y RASGQ          ELL KGLR  +G+G  +   KDP +P  + FK VS       + P+S+F  D  VW+      YF
Subjt:  GLFKSLVLDYERASGQCINFSFVWGLELLHKGLRKNLGNGKEIRMFKDPRLPLSSTFKVVSSNADFIKNFPMSNFTPDIWVWHSGRFGKYF

A0A803P5H2 Uncharacterized protein2.0e-3540.95Show/hide
Query:  GFHDNWISLIVDCISTFSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDSPVFLKSNAYES
        GF   WISLI+ C+ T SFS ++NGE  G + P RGLRQGDPLSPYLFL+CSEG S LL        L GL++SRHS  ++HLLF DDS +F ++N    
Subjt:  GFHDNWISLIVDCISTFSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDSPVFLKSNAYES

Query:  GLFKSLVLDYERASGQCIN---FSFVWGLELLHKGLRKNLGNGKEIRMFKDPRLPLSSTFKVV------------------------------SSNADFI
        G  K  +  Y RASGQ +N      VWG ELL KGL   +G+G  +   +D  +P +  FK +                               ++ D I
Subjt:  GLFKSLVLDYERASGQCIN---FSFVWGLELLHKGLRKNLGNGKEIRMFKDPRLPLSSTFKVV------------------------------SSNADFI

Query:  KNFPMS-NFTPDIWVWHSGRFGKYFVRSGYKV
           P+S N T D W WH    G Y V+SGY +
Subjt:  KNFPMS-NFTPDIWVWHSGRFGKYFVRSGYKV

SwissProt top hitse value%identityAlignment
P92555 Uncharacterized mitochondrial protein AtMg012503.9e-1247.95Show/hide
Query:  FSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDS
        F    ++NG   G ++PSRGLRQGDPLSPYLF+LC+E  S L         L G+ +S +S +++HLLF DD+
Subjt:  FSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDS

Arabidopsis top hitse value%identityAlignment
ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.8e-1347.95Show/hide
Query:  FSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDS
        F    ++NG   G ++PSRGLRQGDPLSPYLF+LC+E  S L         L G+ +S +S +++HLLF DD+
Subjt:  FSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATAACAAATCCGCTCTATCAGGCTTTCATGATAATTGGATCTCACTGATTGTGGATTGTATATCTACTTTTTCCTTCTCTATTGTCATGAATGGTGAGTCTATTGG
ACATATTTCTCCGTCTCGAGGACTCCGACAAGGTGACCCTTTATCTCCATATCTATTCCTCCTTTGTTCTGAGGGGTTTTCATCATTGCTTTCAACTACAGTCAATTCTC
GTTTGCTATCAGGCTTATCTTTATCCAGACATTCCGCAAAAGTGTCTCATTTACTTTTTATTGATGATAGCCCGGTGTTTCTGAAATCTAATGCGTATGAGTCCGGTTTG
TTTAAGTCGCTTGTACTAGATTATGAACGGGCGTCGGGACAATGTATAAATTTCAGTTTTGTTTGGGGTTTGGAGTTACTACACAAGGGGTTAAGGAAGAATTTAGGAAA
TGGTAAGGAAATCAGGATGTTCAAAGATCCAAGGCTTCCACTGTCGTCAACGTTTAAGGTTGTTTCTTCGAATGCAGACTTTATAAAGAATTTTCCCATGAGTAACTTTA
CTCCTGATATTTGGGTTTGGCATTCCGGTCGTTTTGGAAAGTATTTTGTGAGGAGTGGATACAAGGTGTTTATGCTTTCTAAAGTCGAAGGGGCACCCTCTAGTTTGGCT
AATGGGTTGCTAAAGGAATCTAGCCCAACGAAGTTGGTTGCTCCTTTTATTCAAGATAGGAATGGAGTTGTTATCCACGTTGATGTCCCTTGTAATCATCACCATAATAT
TACTGGATTTTGGGCGATTATTCGAGATGGAGGGGGCTCGGTGTTGGCTGTTATGGTGGCTGTTCATTCTGGTTTGCTTTCGATAAGTGCAGAAACTAGAGCTATAAAGG
AAGCTCTTAGTTTGGCTCGAAGGATGGATTTCGAACATATTACATTGTTCTCTGATTCGCTTAATGTTATTAATATTTTGAATGAGGATTTAGATTGTTTGTGTAATGCC
TCACGTTTTGGAATCGACCCATTTTGGGATCTGTCGAGCGTTCTTATTCTAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATAACAAATCCGCTCTATCAGGCTTTCATGATAATTGGATCTCACTGATTGTGGATTGTATATCTACTTTTTCCTTCTCTATTGTCATGAATGGTGAGTCTATTGG
ACATATTTCTCCGTCTCGAGGACTCCGACAAGGTGACCCTTTATCTCCATATCTATTCCTCCTTTGTTCTGAGGGGTTTTCATCATTGCTTTCAACTACAGTCAATTCTC
GTTTGCTATCAGGCTTATCTTTATCCAGACATTCCGCAAAAGTGTCTCATTTACTTTTTATTGATGATAGCCCGGTGTTTCTGAAATCTAATGCGTATGAGTCCGGTTTG
TTTAAGTCGCTTGTACTAGATTATGAACGGGCGTCGGGACAATGTATAAATTTCAGTTTTGTTTGGGGTTTGGAGTTACTACACAAGGGGTTAAGGAAGAATTTAGGAAA
TGGTAAGGAAATCAGGATGTTCAAAGATCCAAGGCTTCCACTGTCGTCAACGTTTAAGGTTGTTTCTTCGAATGCAGACTTTATAAAGAATTTTCCCATGAGTAACTTTA
CTCCTGATATTTGGGTTTGGCATTCCGGTCGTTTTGGAAAGTATTTTGTGAGGAGTGGATACAAGGTGTTTATGCTTTCTAAAGTCGAAGGGGCACCCTCTAGTTTGGCT
AATGGGTTGCTAAAGGAATCTAGCCCAACGAAGTTGGTTGCTCCTTTTATTCAAGATAGGAATGGAGTTGTTATCCACGTTGATGTCCCTTGTAATCATCACCATAATAT
TACTGGATTTTGGGCGATTATTCGAGATGGAGGGGGCTCGGTGTTGGCTGTTATGGTGGCTGTTCATTCTGGTTTGCTTTCGATAAGTGCAGAAACTAGAGCTATAAAGG
AAGCTCTTAGTTTGGCTCGAAGGATGGATTTCGAACATATTACATTGTTCTCTGATTCGCTTAATGTTATTAATATTTTGAATGAGGATTTAGATTGTTTGTGTAATGCC
TCACGTTTTGGAATCGACCCATTTTGGGATCTGTCGAGCGTTCTTATTCTAATTTGA
Protein sequenceShow/hide protein sequence
MYNKSALSGFHDNWISLIVDCISTFSFSIVMNGESIGHISPSRGLRQGDPLSPYLFLLCSEGFSSLLSTTVNSRLLSGLSLSRHSAKVSHLLFIDDSPVFLKSNAYESGL
FKSLVLDYERASGQCINFSFVWGLELLHKGLRKNLGNGKEIRMFKDPRLPLSSTFKVVSSNADFIKNFPMSNFTPDIWVWHSGRFGKYFVRSGYKVFMLSKVEGAPSSLA
NGLLKESSPTKLVAPFIQDRNGVVIHVDVPCNHHHNITGFWAIIRDGGGSVLAVMVAVHSGLLSISAETRAIKEALSLARRMDFEHITLFSDSLNVINILNEDLDCLCNA
SRFGIDPFWDLSSVLILI