; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022600 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022600
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:34212488..34213081
RNA-Seq ExpressionLag0022600
SyntenyLag0022600
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3482596.1 reverse transcriptase [Gossypium australe]2.1e-2036.92Show/hide
Query:  MNILFWNVRGLGSPRAFRRLYKLVQQHRPHMVFLSETMVHSSRFDSLKVSWGM---LTVFCE-----LCWQEWRLSIAVELGGPFQSPIILSQPHRWVGG
        M I+ WNVRGLG+PRA RRL  L++QH P MVF  ET V+S R + ++   G    + V  E     LC   WR    V L    ++ I +      V G
Subjt:  MNILFWNVRGLGSPRAFRRLYKLVQQHRPHMVFLSETMVHSSRFDSLKVSWGM---LTVFCE-----LCWQEWRLSIAVELGGPFQSPIILSQPHRWVGG

Query:  RRGESVAVFWDLWFPPSGVEGEDMGAT----EALAWEPGTPWLLGGDFNAILSHQEKDGGRAKSGAELAGFQGAIDDCELIDLGFRGPAFTWCNG
                 W        +   D  A+     AL  E   PW++ GDFN IL   EK GG+ +    +A F+  +DDC+L+DLGF+G  FTW  G
Subjt:  RRGESVAVFWDLWFPPSGVEGEDMGAT----EALAWEPGTPWLLGGDFNAILSHQEKDGGRAKSGAELAGFQGAIDDCELIDLGFRGPAFTWCNG

KAF5449841.1 hypothetical protein F2P56_030246 [Juglans regia]2.3e-2234.45Show/hide
Query:  MNILFWNVRGLGSPRAFRRLYKLVQQHRPHMVFLSETMVHSSRFDSLKVSWGMLTVFCELCWQE-------WRLSIAVELGGPFQSPIILSQPHRWVGGR
        M IL WN  GLG+P+  R L  L+ +  P +VFL ET + +    + K  +G +  F   C          W+  I+V +          S+ H  +   
Subjt:  MNILFWNVRGLGSPRAFRRLYKLVQQHRPHMVFLSETMVHSSRFDSLKVSWGMLTVFCELCWQE-------WRLSIAVELGGPFQSPIILSQPHRWVGGR

Query:  RGESVAVFWDLWFPPSGVEGEDMGATEALAWE------PGT--PWLLGGDFNAILSHQEKDGGRAKSGAELAGFQGAIDDCELIDLGFRGPAFTWCNGRP
          ES    W      +GV G    +   L W       P T  PWL+GGDFN +L   EK GGR    A+L  F+  ++DC L DLGF+GP FTWCNGR 
Subjt:  RGESVAVFWDLWFPPSGVEGEDMGATEALAWE------PGT--PWLLGGDFNAILSHQEKDGGRAKSGAELAGFQGAIDDCELIDLGFRGPAFTWCNGRP

Query:  GGEQFGRGL
        G ++    L
Subjt:  GGEQFGRGL

XP_028075737.1 uncharacterized protein LOC114277953 [Camellia sinensis]3.6e-2036.1Show/hide
Query:  MNILFWNVRGLGSPRAFRRLYKLVQQHRPHMVFLSETMVHSSRFDSLKVSWGMLTVF----------CELCW-QEWRLSIAVELGGPFQSPIILSQPHRW
        M IL WN RGLG+PR  R L  L+++  P MVFL ET + +   + ++   G+   F            L W  E +L I     G   S IILS     
Subjt:  MNILFWNVRGLGSPRAFRRLYKLVQQHRPHMVFLSETMVHSSRFDSLKVSWGMLTVF----------CELCW-QEWRLSIAVELGGPFQSPIILSQPHRW

Query:  VGGRRGESVAVFWDLWFPPSGVEGEDMGATEALAWE--------PGTPWLLGGDFNAILSHQEKDGGRAKSGAELAGFQGAIDDCELIDLGFRGPAFTWC
              ES    W      +G  G  + +  + +WE           PWL  GDFN IL   EK G  A+S  ++ GF+  + DC+L DLGF G AFTWC
Subjt:  VGGRRGESVAVFWDLWFPPSGVEGEDMGATEALAWE--------PGTPWLLGGDFNAILSHQEKDGGRAKSGAELAGFQGAIDDCELIDLGFRGPAFTWC

Query:  NGRPG
        NGR G
Subjt:  NGRPG

XP_042939411.1 uncharacterized protein LOC122274439 [Carya illinoinensis]1.2e-2033.33Show/hide
Query:  MNILFWNVRGLGSPRAFRRLYKLVQQHRPHMVFLSETMVHSSRFDSLKVS---WGMLTVFCE-------LCWQEWRLSIAVELGGPFQSPIILSQPHR--
        M+ + WN RGLG+P   + L  LV++  P  +FL ET +HS   + LK     +G L V CE       L W+     + V+    F     +++     
Subjt:  MNILFWNVRGLGSPRAFRRLYKLVQQHRPHMVFLSETMVHSSRFDSLKVS---WGMLTVFCE-------LCWQEWRLSIAVELGGPFQSPIILSQPHR--

Query:  ---WVGGRRGE-SVAVFWDLWFPPSGVEGEDMGATEALAWEPGTPWLLGGDFNAILSHQEKDGGRAKSGAELAGFQGAIDDCELIDLGFRGPAFTWCNGR
           W+ G  G+  V+  ++ W         D+  +  +  + G  WLL GDFN I+S+ EK GGR +S +++  F+  I++C L DLGF G  FTWCN R
Subjt:  ---WVGGRRGE-SVAVFWDLWFPPSGVEGEDMGATEALAWEPGTPWLLGGDFNAILSHQEKDGGRAKSGAELAGFQGAIDDCELIDLGFRGPAFTWCNGR

Query:  PGGE
         GG+
Subjt:  PGGE

XP_042944747.1 uncharacterized protein LOC122278633 [Carya illinoinensis]5.6e-2131.6Show/hide
Query:  MNILFWNVRGLGSPRAFRRLYKLVQQHRPHMVFLSETMVHSSRFDSLKVS---WGMLTVFCE-------LCWQEWRLSIAVELGGPFQSPIILSQPHRWV
        M+ + WN RGLG+P   + L  LV++  P  +FL ET +HS   + LK     +G L V CE       L W+    ++ ++    F     +++     
Subjt:  MNILFWNVRGLGSPRAFRRLYKLVQQHRPHMVFLSETMVHSSRFDSLKVS---WGMLTVFCE-------LCWQEWRLSIAVELGGPFQSPIILSQPHRWV

Query:  GGRRGESVAVFWDLWFPPSGVEGEDMGATEALAWE--------PGTPWLLGGDFNAILSHQEKDGGRAKSGAELAGFQGAIDDCELIDLGFRGPAFTWCN
            GE+   +W      +G+ G+   +    +W+            WLL GDFN I+S+ EK GGR++S +++  F+  I++C L DLGF G  FTWCN
Subjt:  GGRRGESVAVFWDLWFPPSGVEGEDMGATEALAWE--------PGTPWLLGGDFNAILSHQEKDGGRAKSGAELAGFQGAIDDCELIDLGFRGPAFTWCN

Query:  GRPGGEQFGRGL
         R GG+     L
Subjt:  GRPGGEQFGRGL

TrEMBL top hitse value%identityAlignment
A0A2I0A4A6 Uncharacterized protein6.7e-2034.48Show/hide
Query:  MNILFWNVRGLGSPRAFRRLYKLVQQHRPHMVFLSETMVHSSRFDSLKVSWGMLTVFCELCWQE-------WRLSIAVELGGPFQSPIILSQPHRWVGGR
        MNI   N RGLG PRA +RL   ++  +P +VFLSET + +S FD +K++    + F   C          W     V L     S I +S  H      
Subjt:  MNILFWNVRGLGSPRAFRRLYKLVQQHRPHMVFLSETMVHSSRFDSLKVSWGMLTVFCELCWQE-------WRLSIAVELGGPFQSPIILSQPHRWVGGR

Query:  RGESVAVFWDLWFPPSGVEGEDMGATEALAWE--------PGTPWLLGGDFNAILSHQEKDGGRAKSGAELAGFQGAIDDCELIDLGFRGPAFTWCNGRP
           S AV W      +G  GE + + + L+W+           PWL  GDFN +L   EK GG  +   +   F+  ++ C+LID+GF G  FTW NGR 
Subjt:  RGESVAVFWDLWFPPSGVEGEDMGATEALAWE--------PGTPWLLGGDFNAILSHQEKDGGRAKSGAELAGFQGAIDDCELIDLGFRGPAFTWCNGRP

Query:  GGE
        G E
Subjt:  GGE

A0A2N9I239 RNase H domain-containing protein1.1e-1931.58Show/hide
Query:  MNILFWNVRGLGSPRAFRRLYKLVQQHRPHMVFLSETMVHSSRFDSLKVSWGM--------------LTVFCELCWQEWRLSIAVELGGPFQSPIIL---
        M+ LFWN RGLG+P   + L  +V+Q  P  +F+SET + + + + L+  WG               L +F       WR  ++V +    Q  I     
Subjt:  MNILFWNVRGLGSPRAFRRLYKLVQQHRPHMVFLSETMVHSSRFDSLKVSWGM--------------LTVFCELCWQEWRLSIAVELGGPFQSPIIL---

Query:  -SQPHRW-----VGGRRGESVAVFWDLWFPPSGVEGEDMGATEALAWEPGTPWLLGGDFNAILSHQEKDGGRAKSGAELAGFQGAIDDCELIDLGFRGPA
          +P  W      G        V WD+                 L+     PWL GGDFN +L  +EK G  A+S A++A F+  +DDC  +DLGF GP 
Subjt:  -SQPHRW-----VGGRRGESVAVFWDLWFPPSGVEGEDMGATEALAWEPGTPWLLGGDFNAILSHQEKDGGRAKSGAELAGFQGAIDDCELIDLGFRGPA

Query:  FTWCNGRPG
        +TW N R G
Subjt:  FTWCNGRPG

A0A2N9IFR8 RNase H domain-containing protein1.9e-1931.25Show/hide
Query:  MNILFWNVRGLGSPRAFRRLYKLVQQHRPHMVFLSETMVHSSRFDSLKVSWGM--------------LTVFCELCWQEWRLSIAVELGGPFQSPIILSQP
        M+ LFWN RGLG+P   + L  +V+Q  P  +F+SET +   + + L+  WG               L +F       WR  ++V          + S  
Subjt:  MNILFWNVRGLGSPRAFRRLYKLVQQHRPHMVFLSETMVHSSRFDSLKVSWGM--------------LTVFCELCWQEWRLSIAVELGGPFQSPIILSQP

Query:  HRWVGGRRGESVAVFWDLWFPPSGVEGEDMGATEALAWE--------PGTPWLLGGDFNAILSHQEKDGGRAKSGAELAGFQGAIDDCELIDLGFRGPAF
        H  +     +     W +    +G  G    A + +AW+           PWL GGDFN +L  +EK G  A+S A++A F+  +DDC  +DLGF GP +
Subjt:  HRWVGGRRGESVAVFWDLWFPPSGVEGEDMGATEALAWE--------PGTPWLLGGDFNAILSHQEKDGGRAKSGAELAGFQGAIDDCELIDLGFRGPAF

Query:  TWCNGRPG
        TW N R G
Subjt:  TWCNGRPG

A0A5B6WN25 Reverse transcriptase1.0e-2036.92Show/hide
Query:  MNILFWNVRGLGSPRAFRRLYKLVQQHRPHMVFLSETMVHSSRFDSLKVSWGM---LTVFCE-----LCWQEWRLSIAVELGGPFQSPIILSQPHRWVGG
        M I+ WNVRGLG+PRA RRL  L++QH P MVF  ET V+S R + ++   G    + V  E     LC   WR    V L    ++ I +      V G
Subjt:  MNILFWNVRGLGSPRAFRRLYKLVQQHRPHMVFLSETMVHSSRFDSLKVSWGM---LTVFCE-----LCWQEWRLSIAVELGGPFQSPIILSQPHRWVGG

Query:  RRGESVAVFWDLWFPPSGVEGEDMGAT----EALAWEPGTPWLLGGDFNAILSHQEKDGGRAKSGAELAGFQGAIDDCELIDLGFRGPAFTWCNG
                 W        +   D  A+     AL  E   PW++ GDFN IL   EK GG+ +    +A F+  +DDC+L+DLGF+G  FTW  G
Subjt:  RRGESVAVFWDLWFPPSGVEGEDMGAT----EALAWEPGTPWLLGGDFNAILSHQEKDGGRAKSGAELAGFQGAIDDCELIDLGFRGPAFTWCNG

F4NCI4 Reverse transcriptase domain-containing protein7.4e-1935.53Show/hide
Query:  NILFWNVRGLGSPRAFRRLYKLVQQHRPHMVFLSETMVHSSRFDSL--KVSW-GMLTVFCE-LCWQE-------WRLSIAVELGGPFQSPIILSQPHRWV
        +IL WN RG+GSP A   L +L+    P +VFLSET + S   +S+  K+ W  M+ V CE  C +        WR  I V++     + I +      V
Subjt:  NILFWNVRGLGSPRAFRRLYKLVQQHRPHMVFLSETMVHSSRFDSL--KVSW-GMLTVFCE-LCWQE-------WRLSIAVELGGPFQSPIILSQPHRWV

Query:  GGRRGESVAVFWDLWFPPSGVEGEDMGA-TEALAWEPGTPWLLGGDFNAILSHQEKDGGRAKSGAELAGFQGAIDDCELIDLGFRGPAFTWCNGRPG
         G   +    F  ++  P     +  GA   ALA     PWL GGDFN +L   EK GG   +  E   F+ A+++C  +DLGF G  FTW N R G
Subjt:  GGRRGESVAVFWDLWFPPSGVEGEDMGA-TEALAWEPGTPWLLGGDFNAILSHQEKDGGRAKSGAELAGFQGAIDDCELIDLGFRGPAFTWCNGRPG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATATCCTATTTTGGAATGTACGGGGTTTGGGGTCACCGCGTGCATTCCGGCGGTTGTACAAGCTGGTACAACAACATAGACCTCACATGGTGTTTCTGTCAGAGAC
AATGGTCCATTCGTCACGTTTTGATTCTCTTAAGGTAAGTTGGGGTATGCTAACTGTTTTCTGTGAGCTGTGTTGGCAGGAGTGGCGGCTTAGCATTGCTGTGGAGCTCG
GAGGTCCGTTTCAGTCTCCTATCATACTCTCCCAACCACATAGATGGGTGGGTGGACGGAGGGGTGAGTCCGTGGCGGTTTTCTGGGATTTATGGTTTCCCCCAAGCGGA
GTTGAAGGTGAGGACATGGGCGCTACTGAGGCACTTGCATGGGAACCAGGGACGCCGTGGTTGCTGGGTGGAGACTTTAATGCCATTTTGTCTCATCAGGAGAAGGATGG
TGGTAGGGCTAAGTCTGGGGCTGAACTGGCAGGCTTCCAGGGTGCAATTGATGATTGTGAACTAATTGATTTGGGTTTCAGGGGTCCTGCTTTTACTTGGTGTAATGGGA
GGCCGGGAGGGGAACAGTTTGGGAGAGGATTGACAGATGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATATCCTATTTTGGAATGTACGGGGTTTGGGGTCACCGCGTGCATTCCGGCGGTTGTACAAGCTGGTACAACAACATAGACCTCACATGGTGTTTCTGTCAGAGAC
AATGGTCCATTCGTCACGTTTTGATTCTCTTAAGGTAAGTTGGGGTATGCTAACTGTTTTCTGTGAGCTGTGTTGGCAGGAGTGGCGGCTTAGCATTGCTGTGGAGCTCG
GAGGTCCGTTTCAGTCTCCTATCATACTCTCCCAACCACATAGATGGGTGGGTGGACGGAGGGGTGAGTCCGTGGCGGTTTTCTGGGATTTATGGTTTCCCCCAAGCGGA
GTTGAAGGTGAGGACATGGGCGCTACTGAGGCACTTGCATGGGAACCAGGGACGCCGTGGTTGCTGGGTGGAGACTTTAATGCCATTTTGTCTCATCAGGAGAAGGATGG
TGGTAGGGCTAAGTCTGGGGCTGAACTGGCAGGCTTCCAGGGTGCAATTGATGATTGTGAACTAATTGATTTGGGTTTCAGGGGTCCTGCTTTTACTTGGTGTAATGGGA
GGCCGGGAGGGGAACAGTTTGGGAGAGGATTGACAGATGTTTGA
Protein sequenceShow/hide protein sequence
MNILFWNVRGLGSPRAFRRLYKLVQQHRPHMVFLSETMVHSSRFDSLKVSWGMLTVFCELCWQEWRLSIAVELGGPFQSPIILSQPHRWVGGRRGESVAVFWDLWFPPSG
VEGEDMGATEALAWEPGTPWLLGGDFNAILSHQEKDGGRAKSGAELAGFQGAIDDCELIDLGFRGPAFTWCNGRPGGEQFGRGLTDV