; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000515 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000515
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:9378110..9379248
RNA-Seq ExpressionLag0000515
SyntenyLag0000515
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4308802.1 unnamed protein product [Prunus armeniaca]2.7e-2126.51Show/hide
Query:  MSSAKRLLGFDNCFCVDCHGRSGGLALLW----------------------------RLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAILC
        M   +R LG   CF V   GR+GGLALLW                            R TGFYG P   +   SW LL R+ G  + PW++GGDFN IL 
Subjt:  MSSAKRLLGFDNCFCVDCHGRSGGLALLW----------------------------RLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAILC

Query:  QDEKDGGRDKPLSELAAFQSVVDLCGCRTWVLWETALLGVTDDQEVRRFMSGWIGPLATRLGGPLSELCDLRQLVGSSWAAGPGESSDITPL------AI
          E  G   +P+S++ AF+  ++ CG  T  +    +        +   +S  + PL   L          R L    W    G    I+         I
Subjt:  QDEKDGGRDKPLSELAAFQSVVDLCGCRTWVLWETALLGVTDDQEVRRFMSGWIGPLATRLGGPLSELCDLRQLVGSSWAAGPGESSDITPL------AI

Query:  ANKAKRCMHSMVGWGRSKSGNFPRRIRSANQRVQ-------SAIADLSTSD---SRDVLVQAE--------------------------AQLEEKLNRIG
          K K C  S+  W +   G  P ++RS  +++        S +A+    +     DVL++ E                          A++  + N I 
Subjt:  ANKAKRCMHSMVGWGRSKSGNFPRRIRSANQRVQ-------SAIADLSTSD---SRDVLVQAE--------------------------AQLEEKLNRIG

Query:  GLEDEQGVWQQEKTAVIQVMTDYFQHLFSSSG
         + D+QGVW     ++     DYFQ LF+S+G
Subjt:  GLEDEQGVWQQEKTAVIQVMTDYFQHLFSSSG

XP_030509295.1 uncharacterized protein LOC115723978 [Cannabis sativa]1.9e-1927.24Show/hide
Query:  MSSAKRLLGFDNCFCVDCHGRSGGLALLW----------------------------RLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAILC
        + S  R L F+  F V+ HG SGG+ALLW                            R TG YG P+  +  Q+W+L+  L   +  PW I GDFN +L 
Subjt:  MSSAKRLLGFDNCFCVDCHGRSGGLALLW----------------------------RLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAILC

Query:  QDEKDGGRDKPLSELAAFQSVVDLCGCRTWVLWETALLGVTDDQEVRRFMSGWI----------------GPLATRLGGPLSELCDLRQLVGSSW-AAGP
        Q EK GG   P   +  FQ V+  CG     L +  + G     E  R    WI                 P   R         D   ++ SSW AAG 
Subjt:  QDEKDGGRDKPLSELAAFQSVVDLCGCRTWVLWETALLGVTDDQEVRRFMSGWI----------------GPLATRLGGPLSELCDLRQLVGSSW-AAGP

Query:  GESSDITPLAIANKAKRCMHSMVGWGRSKSGNFPRRIRSANQRVQSAIADLSTSDSRDVL-VQAEAQLEEKLNRIGGLEDEQGVWQQEKTAVIQVMTDYF
                + I  K   C  ++  WG+  SG+F  +I+  N+R++           RDV   +   Q ++ L +I  L+D  G  +  +  + +VM +YF
Subjt:  GESSDITPLAIANKAKRCMHSMVGWGRSKSGNFPRRIRSANQRVQSAIADLSTSDSRDVL-VQAEAQLEEKLNRIGGLEDEQGVWQQEKTAVIQVMTDYF

Query:  QHLFSSSGPSAR
          +F +   + R
Subjt:  QHLFSSSGPSAR

XP_030970624.1 uncharacterized protein LOC115991006 [Quercus lobata]1.0e-2025.6Show/hide
Query:  MSSAKRLLGFDNCFCVDCHGRSGGLALLW----------------------------RLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAILC
        M S K  L FD+ F V   GR GGLALLW                            RLTGFYG P  +   + W++L  L    + PW+  GDFN +L 
Subjt:  MSSAKRLLGFDNCFCVDCHGRSGGLALLW----------------------------RLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAILC

Query:  QDEKDGGRDKPLSELAAFQSVVDLCG----------------CRTWVLWETALLGVTDDQEVRRFMSGWIG------------PLATRLGGPLSELCDLR
         +EK GG  +  +++  F+ V+D  G                 R  ++WE    GV +   + +F +  I              LA    G         
Subjt:  QDEKDGGRDKPLSELAAFQSVVDLCG----------------CRTWVLWETALLGVTDDQEVRRFMSGWIG------------PLATRLGGPLSELCDLR

Query:  QLVGSSWAAGPGESSDITP-----------LAIANKAKRCMHSMVGWGRSKSGNFPRRIRSANQRVQSAIADLSTSDS--RDVLVQAEAQLEEKLNRIGG
            + W   PG    +T             A   K K+C   +  W R   G+    I+   +++  A  +   S S      +++E  + ++ N I G
Subjt:  QLVGSSWAAGPGESSDITP-----------LAIANKAKRCMHSMVGWGRSKSGNFPRRIRSANQRVQSAIADLSTSDS--RDVLVQAEAQLEEKLNRIGG

Query:  LEDEQGVWQQEKTAVIQVMTDYFQHLFSSSGP
        L+D++GVW +E+  V  ++ +Y+  LFS+S P
Subjt:  LEDEQGVWQQEKTAVIQVMTDYFQHLFSSSGP

XP_030974624.1 uncharacterized protein LOC115994558 [Quercus lobata]8.6e-2027.94Show/hide
Query:  MSSAKRLLGFDNCFCVDCHGRSGGLAL-----------------------------LWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAIL
        M   K  +GF N F V   GRSGG+AL                             LWR+TGFYG P       SW LL+ L    + PWL  GDFN IL
Subjt:  MSSAKRLLGFDNCFCVDCHGRSGGLAL-----------------------------LWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAIL

Query:  CQDEKDGGRDKPLSELAAFQSVVDLCG------CRTWVLWETALLGVTDDQ-------EVRRFMSGWIGPLATRLGGPLSELCDL---------------
          DEK GG  +   ++  F++VVD CG      C T   W     G  + Q           ++  + G     L    S+ C L               
Subjt:  CQDEKDGGRDKPLSELAAFQSVVDLCG------CRTWVLWETALLGVTDDQ-------EVRRFMSGWIGPLATRLGGPLSELCDL---------------

Query:  -------------RQLVGSSWAAGPGESSDITPLAIANKAKRCMHSMVGWGRSKSGNFPRRIRSANQRVQSAIADLSTSDSRDVLVQAEAQLEEKLNRIG
                     R+++ S W  G   S   TP+ I    K C   +  W  S  GN P++I+S     ++A+++L+  D    L      L  +LN + 
Subjt:  -------------RQLVGSSWAAGPGESSDITPLAIANKAKRCMHSMVGWGRSKSGNFPRRIRSANQRVQSAIADLSTSDSRDVLVQAEAQLEEKLNRIG

Query:  GLEDEQGVWQQEKTA
         L+DE+  W Q   A
Subjt:  GLEDEQGVWQQEKTA

XP_042972966.1 uncharacterized protein LOC122304767 [Carya illinoinensis]1.3e-2325.29Show/hide
Query:  MSSAKRLLGFDNCFCVDCHGRSGGLALLWR-----------------------------LTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAIL
        M S +R +G++ CF V C GRSGGLA+LW+                              TG YG P  ++  ++WS++  L+G    PWL+ GDFN +L
Subjt:  MSSAKRLLGFDNCFCVDCHGRSGGLALLWR-----------------------------LTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAIL

Query:  CQDEKDGGRDKPLSELAAFQSVVDLCGCRTWVLWETALLGVTDDQEVR----RFMSGWIGPLATRLGGPLSELCDLRQLVGSSWAAGPGESSDITPL-AI
        C  EK GG+ +P  ++ AF+ ++D C          +L+   + +  R    RF + W            +E  D  ++V  SW +  G++    P+   
Subjt:  CQDEKDGGRDKPLSELAAFQSVVDLCGCRTWVLWETALLGVTDDQEVR----RFMSGWIGPLATRLGGPLSELCDLRQLVGSSWAAGPGESSDITPL-AI

Query:  ANKAKRCMHSMVGWGRSKSGNFPRRIRSANQRVQSAIAD-------------------------------------LSTSDSRDVLVQAEAQLEEKLNRI
          + + C  S+  W + K G   ++I+ A   +Q  + D                                     L   D        +A L+     I
Subjt:  ANKAKRCMHSMVGWGRSKSGNFPRRIRSANQRVQSAIAD-------------------------------------LSTSDSRDVLVQAEAQLEEKLNRI

Query:  GGLEDEQGVWQ--QEKTAVIQVMTDYFQHLFSSSGPSARI
         GL+DE G WQ  Q + AVI    +YF++LFS++  +  +
Subjt:  GGLEDEQGVWQ--QEKTAVIQVMTDYFQHLFSSSGPSARI

TrEMBL top hitse value%identityAlignment
A0A2N9EFD5 Reverse transcriptase domain-containing protein5.8e-2225.43Show/hide
Query:  LGFDNCFCVDCHGRSGGLALL----------------------------WRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAILCQDEKDGG
        LG  NCF VD HG  GGLALL                            W++TGFYG+P       SWSLL +L   +  PW++ GDFN I+  DEK G 
Subjt:  LGFDNCFCVDCHGRSGGLALL----------------------------WRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAILCQDEKDGG

Query:  RDKPLSELAAFQSVVDLCGCRTWVLWETALLGVTDDQEVRRFMSGWIGPLATRLGGPLSELCDL---------------------------RQLVGSSWA
         D+ L+++AAF+ V++ CG +        L GV  +   + F   ++  L       +  L DL                            +++  +W+
Subjt:  RDKPLSELAAFQSVVDLCGCRTWVLWETALLGVTDDQEVRRFMSGWIGPLATRLGGPLSELCDL---------------------------RQLVGSSWA

Query:  AGPGESSDITPLAIANKAKRCMHSMVGWGRSKSGNFPRRIRSANQ------------------------------------RVQSAIADLSTSDSRDVLV
        A   E        +  K K+C   ++ W  S+    P+ + S                                       R +S ++ L   D      
Subjt:  AGPGESSDITPLAIANKAKRCMHSMVGWGRSKSGNFPRRIRSANQ------------------------------------RVQSAIADLSTSDSRDVLV

Query:  QAEAQLEEKLNRIGGLEDEQGVWQQEKTAVIQVMTDYFQHLFSSSG
         A A   ++ N I GL +EQGVWQ     +  +  DYF +LF +SG
Subjt:  QAEAQLEEKLNRIGGLEDEQGVWQQEKTAVIQVMTDYFQHLFSSSG

A0A2N9FB16 CCHC-type domain-containing protein6.9e-2326.65Show/hide
Query:  LGFDNCFCVDCHGRSGGLALL----------------------------WRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAILCQDEKDGG
        LG   C  V+ HG+ GGLALL                            WRLTGFYG+P A +  +SWSLL  LR  ++ PW+I GDFN I   +EK G 
Subjt:  LGFDNCFCVDCHGRSGGLALL----------------------------WRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAILCQDEKDGG

Query:  RDKPLSELAAFQSVVDLCGCRTWVLWETA--LLGVTDDQ----------EVRRFMSGWIGPLATRLGGPLSELCDLRQLVGSSWAAGPGESSDITPLAIA
         D+  +++AAF+  +  C  +     +    LL    DQ           + RF   W+            +     +++  +W   P          +A
Subjt:  RDKPLSELAAFQSVVDLCGCRTWVLWETA--LLGVTDDQ----------EVRRFMSGWIGPLATRLGGPLSELCDLRQLVGSSWAAGPGESSDITPLAIA

Query:  NKAKRCMHSMVGWGRSKSGNFPRRIRSANQRVQ------------------------------------SAIADLSTSDSRDVLVQAEAQLEEKLNRIGG
         K K+C   ++ W +S     P+ I S  +++Q                                    S I  L+  D         A   +K+N I G
Subjt:  NKAKRCMHSMVGWGRSKSGNFPRRIRSANQRVQ------------------------------------SAIADLSTSDSRDVLVQAEAQLEEKLNRIGG

Query:  LEDEQGVWQQEKTAVIQVMTDYFQHLFSSSGPSA
        L D+Q  W+ E   V Q+  DYF  LF+SS P A
Subjt:  LEDEQGVWQQEKTAVIQVMTDYFQHLFSSSGPSA

A0A2N9IPS8 Reverse transcriptase domain-containing protein5.6e-2525.41Show/hide
Query:  FDNCFCVDCHGRSGGLALLW-----------------------------RLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAILCQDEKDGGR
        FD  FCV   G  GGLA+LW                             RLTGFYG P      +SW+LL  L   + +PWL  GDFN IL  +E+ G  
Subjt:  FDNCFCVDCHGRSGGLALLW-----------------------------RLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAILCQDEKDGGR

Query:  DKPLSELAAFQSVVDLCGCR---------TWVLWETALLGVTDDQEVRRFMSGWI----GPLATRLGGPLSELCDL------------------------
         +P  ++  F+  V  CG           TW         VT   +       W+    G + + L    S+ C +                        
Subjt:  DKPLSELAAFQSVVDLCGCR---------TWVLWETALLGVTDDQEVRRFMSGWI----GPLATRLGGPLSELCDL------------------------

Query:  -----RQLVGSSWAAGPGESSDITPLAIANKAKRCMHSMVGWGRSKSGNFPRRIRSANQRVQ-----------------------------------SAI
             R+++  +W  G  E S +    +  K K C  S++GW R + G+    I+   +++Q                                   S +
Subjt:  -----RQLVGSSWAAGPGESSDITPLAIANKAKRCMHSMVGWGRSKSGNFPRRIRSANQRVQ-----------------------------------SAI

Query:  ADLSTSDSRDVLVQAEAQLEEKLNRIGGLEDEQGVWQQEKTAVIQVMTDYFQHLFSSSGPSA
        A +S  D       A+     + N I GL D  GVWQ EKT + ++  DYFQ +F+SS PSA
Subjt:  ADLSTSDSRDVLVQAEAQLEEKLNRIGGLEDEQGVWQQEKTAVIQVMTDYFQHLFSSSGPSA

A0A6J5X9X9 Uncharacterized protein1.3e-2126.51Show/hide
Query:  MSSAKRLLGFDNCFCVDCHGRSGGLALLW----------------------------RLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAILC
        M   +R LG   CF V   GR+GGLALLW                            R TGFYG P   +   SW LL R+ G  + PW++GGDFN IL 
Subjt:  MSSAKRLLGFDNCFCVDCHGRSGGLALLW----------------------------RLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAILC

Query:  QDEKDGGRDKPLSELAAFQSVVDLCGCRTWVLWETALLGVTDDQEVRRFMSGWIGPLATRLGGPLSELCDLRQLVGSSWAAGPGESSDITPL------AI
          E  G   +P+S++ AF+  ++ CG  T  +    +        +   +S  + PL   L          R L    W    G    I+         I
Subjt:  QDEKDGGRDKPLSELAAFQSVVDLCGCRTWVLWETALLGVTDDQEVRRFMSGWIGPLATRLGGPLSELCDLRQLVGSSWAAGPGESSDITPL------AI

Query:  ANKAKRCMHSMVGWGRSKSGNFPRRIRSANQRVQ-------SAIADLSTSD---SRDVLVQAE--------------------------AQLEEKLNRIG
          K K C  S+  W +   G  P ++RS  +++        S +A+    +     DVL++ E                          A++  + N I 
Subjt:  ANKAKRCMHSMVGWGRSKSGNFPRRIRSANQRVQ-------SAIADLSTSD---SRDVLVQAE--------------------------AQLEEKLNRIG

Query:  GLEDEQGVWQQEKTAVIQVMTDYFQHLFSSSG
         + D+QGVW     ++     DYFQ LF+S+G
Subjt:  GLEDEQGVWQQEKTAVIQVMTDYFQHLFSSSG

A0A7N2LT95 Reverse transcriptase domain-containing protein2.2e-2125.6Show/hide
Query:  MSSAKRLLGFDNCFCVDCHGRSGGLALLW----------------------------RLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAILC
        M S K  L FD+ F V   GR GGLALLW                            RLTGFYG P  +   + W++L  L    + PW+  GDFN +L 
Subjt:  MSSAKRLLGFDNCFCVDCHGRSGGLALLW----------------------------RLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAILC

Query:  QDEKDGGRDKPLSELAAFQSVVDLCG----------------CRTWVLWETALLGVTDDQEVRRFMSGWIG------------PLATRLGGPLSELCDLR
         +EK GG  +  +++  F+ V+D  G                 R  ++WE    GV +   + +F +  I              LA    G         
Subjt:  QDEKDGGRDKPLSELAAFQSVVDLCG----------------CRTWVLWETALLGVTDDQEVRRFMSGWIG------------PLATRLGGPLSELCDLR

Query:  QLVGSSWAAGPGESSDITP-----------LAIANKAKRCMHSMVGWGRSKSGNFPRRIRSANQRVQSAIAD--LSTSDSRDVLVQAEAQLEEKLNRIGG
            + W   PG    +T             A   K K+C   +  W R   G+    I+   +++  A  +  L + D         A   ++ N I G
Subjt:  QLVGSSWAAGPGESSDITP-----------LAIANKAKRCMHSMVGWGRSKSGNFPRRIRSANQRVQSAIAD--LSTSDSRDVLVQAEAQLEEKLNRIGG

Query:  LEDEQGVWQQEKTAVIQVMTDYFQHLFSSSGP
        L+D++GVW +E+  V  ++ +Y+  LFS+S P
Subjt:  LEDEQGVWQQEKTAVIQVMTDYFQHLFSSSGP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCGGCAAAGCGTTTGTTGGGGTTTGATAACTGCTTTTGTGTTGACTGTCATGGACGGAGTGGTGGGTTGGCTCTCTTGTGGCGGCTCACTGGTTTCTATGGTTT
CCCTTCAGCAGATATGCACCCTCAGTCATGGTCTCTTCTGTCTAGACTGAGGGGCTGTGCTGAGACACCGTGGCTGATTGGTGGGGATTTTAACGCCATCCTATGTCAGG
ATGAGAAGGATGGGGGCAGGGATAAGCCGTTGTCTGAACTGGCTGCCTTTCAGAGTGTGGTTGATTTGTGTGGCTGCAGGACTTGGGTTTTGTGGGAGACTGCTTTACTT
GGTGTAACAGACGACCAGGAGGTGAGACGATTTATGAGCGGTTGGATCGGGCCTTTGGCAACTCGGCTTGGCGGACCTCTATCCGAACTATGTGATTTGCGGCAGTTGGT
TGGTAGTTCGTGGGCCGCAGGGCCTGGAGAGTCTAGTGATATAACGCCTTTGGCTATTGCCAATAAGGCTAAGAGATGTATGCATTCGATGGTTGGTTGGGGTCGATCAA
AGTCCGGGAACTTCCCGAGGCGCATCCGTAGTGCGAATCAGAGGGTTCAATCGGCCATCGCTGATCTTAGTACATCGGACTCTCGTGACGTGCTTGTTCAAGCTGAGGCT
CAGTTGGAGGAGAAGCTTAATCGCATTGGAGGTTTGGAGGATGAGCAGGGAGTGTGGCAGCAGGAGAAGACTGCAGTTATTCAGGTGATGACTGACTATTTCCAGCATCT
GTTTTCCTCATCGGGTCCGAGTGCCAGGATTTTGAGGTGGCGCTGCGAGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCGGCAAAGCGTTTGTTGGGGTTTGATAACTGCTTTTGTGTTGACTGTCATGGACGGAGTGGTGGGTTGGCTCTCTTGTGGCGGCTCACTGGTTTCTATGGTTT
CCCTTCAGCAGATATGCACCCTCAGTCATGGTCTCTTCTGTCTAGACTGAGGGGCTGTGCTGAGACACCGTGGCTGATTGGTGGGGATTTTAACGCCATCCTATGTCAGG
ATGAGAAGGATGGGGGCAGGGATAAGCCGTTGTCTGAACTGGCTGCCTTTCAGAGTGTGGTTGATTTGTGTGGCTGCAGGACTTGGGTTTTGTGGGAGACTGCTTTACTT
GGTGTAACAGACGACCAGGAGGTGAGACGATTTATGAGCGGTTGGATCGGGCCTTTGGCAACTCGGCTTGGCGGACCTCTATCCGAACTATGTGATTTGCGGCAGTTGGT
TGGTAGTTCGTGGGCCGCAGGGCCTGGAGAGTCTAGTGATATAACGCCTTTGGCTATTGCCAATAAGGCTAAGAGATGTATGCATTCGATGGTTGGTTGGGGTCGATCAA
AGTCCGGGAACTTCCCGAGGCGCATCCGTAGTGCGAATCAGAGGGTTCAATCGGCCATCGCTGATCTTAGTACATCGGACTCTCGTGACGTGCTTGTTCAAGCTGAGGCT
CAGTTGGAGGAGAAGCTTAATCGCATTGGAGGTTTGGAGGATGAGCAGGGAGTGTGGCAGCAGGAGAAGACTGCAGTTATTCAGGTGATGACTGACTATTTCCAGCATCT
GTTTTCCTCATCGGGTCCGAGTGCCAGGATTTTGAGGTGGCGCTGCGAGATTTGA
Protein sequenceShow/hide protein sequence
MSSAKRLLGFDNCFCVDCHGRSGGLALLWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAILCQDEKDGGRDKPLSELAAFQSVVDLCGCRTWVLWETALL
GVTDDQEVRRFMSGWIGPLATRLGGPLSELCDLRQLVGSSWAAGPGESSDITPLAIANKAKRCMHSMVGWGRSKSGNFPRRIRSANQRVQSAIADLSTSDSRDVLVQAEA
QLEEKLNRIGGLEDEQGVWQQEKTAVIQVMTDYFQHLFSSSGPSARILRWRCEI