; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016223 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016223
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr12:34924375..34925242
RNA-Seq ExpressionLag0016223
SyntenyLag0016223
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5443558.1 hypothetical protein F2P56_036105, partial [Juglans regia]8.2e-1842.98Show/hide
Query:  EVNVSIRNFSIHPIDANI-SWNGYTLRFTRLYGQPDSSFRYETWELLKQLNNHDASAWVVGRDFNEILWNSKKVGGPERDQRLIQNFRNALDDCSFRDLV
        ++ V +++FS+H IDA I   +G   RFT +YG P+   RY TW LL++LN+     W+VG DFNE+L  ++K GG  R +  ++ FRN + DCS RDL 
Subjt:  EVNVSIRNFSIHPIDANI-SWNGYTLRFTRLYGQPDSSFRYETWELLKQLNNHDASAWVVGRDFNEILWNSKKVGGPERDQRLIQNFRNALDDCSFRDLV

Query:  CHGGLFTWCKEEDG
          G  +TWC    G
Subjt:  CHGGLFTWCKEEDG

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]5.5e-2234.43Show/hide
Query:  GYTLRFTRLYGQPDSSFRYETWELLKQLNNHDASAWVVGRDFNEILWNSKKVGGPERDQRLIQNFRNALDDCSFRDLVCHGGLFTWCKE-----------
        G++ RFT  YG P +  R+ TWELL++++N DAS W++G D N ILWN +       D   I+ FRN +D CS  D+   GG+FTWC             
Subjt:  GYTLRFTRLYGQPDSSFRYETWELLKQLNNHDASAWVVGRDFNEILWNSKKVGGPERDQRLIQNFRNALDDCSFRDLVCHGGLFTWCKE-----------

Query:  -----EDGVIKLVPDWIVSLQIQNYTLHLHHTLARCVGALRKWGKRQNSDLRSRIKILRDQIKAEYAKPLLLDFSVIHELESN
              D    + PD         +      ++     ALR WG+    DL  +IK  +  I   Y +PL LDF++IH LE++
Subjt:  -----EDGVIKLVPDWIVSLQIQNYTLHLHHTLARCVGALRKWGKRQNSDLRSRIKILRDQIKAEYAKPLLLDFSVIHELESN

XP_030958760.1 uncharacterized protein LOC115980671 [Quercus lobata]1.4e-1743.1Show/hide
Query:  RYEVNVSIRNFSIHPIDANISWN-GYTLRFTRLYGQPDSSFRYETWELLKQLNNHDASAWVVGRDFNEILWNSKKVGGPERDQRLIQNFRNALDDCSFRD
        R E+NV +++FS   IDA ++ + G+  R T  YG P+   R E+WELLK L+      W+   DFNEI+  S+K+GG  R QR + +FR A+D C F D
Subjt:  RYEVNVSIRNFSIHPIDANISWN-GYTLRFTRLYGQPDSSFRYETWELLKQLNNHDASAWVVGRDFNEILWNSKKVGGPERDQRLIQNFRNALDDCSFRD

Query:  LVCHGGLFTWCKEEDG
        L   G  FTWC  ++G
Subjt:  LVCHGGLFTWCKEEDG

XP_030970961.1 uncharacterized protein LOC115991405 [Quercus lobata]2.4e-1743.1Show/hide
Query:  RYEVNVSIRNFSIHPIDANISWN-GYTLRFTRLYGQPDSSFRYETWELLKQLNNHDASAWVVGRDFNEILWNSKKVGGPERDQRLIQNFRNALDDCSFRD
        R E+NV +++FS   IDA ++ + G+  R T  YG P+   R E+WELLK L+      W+   DFNEI+  S+K+GG  R QR + +FR A+D C F D
Subjt:  RYEVNVSIRNFSIHPIDANISWN-GYTLRFTRLYGQPDSSFRYETWELLKQLNNHDASAWVVGRDFNEILWNSKKVGGPERDQRLIQNFRNALDDCSFRD

Query:  LVCHGGLFTWCKEEDG
        L   G  FTWC  ++G
Subjt:  LVCHGGLFTWCKEEDG

XP_042972796.1 uncharacterized protein LOC122304603 [Carya illinoinensis]6.3e-1843.8Show/hide
Query:  RYEVNVSIRNFSIHPIDANISW--NGYTLRFTRLYGQPDSSFRYETWELLKQLNNHDASAWVVGRDFNEILWNSKKVGGPERDQRLIQNFRNALDDCSFR
        R E+ +SI++FS+  IDA IS    G   +FT LYG  +   R ETW LL+ L       W+V  DFNE+L   +K+GG  R + L+Q FR+ LDDC+  
Subjt:  RYEVNVSIRNFSIHPIDANISW--NGYTLRFTRLYGQPDSSFRYETWELLKQLNNHDASAWVVGRDFNEILWNSKKVGGPERDQRLIQNFRNALDDCSFR

Query:  DLVCHGGLFTWC--KEEDGVI
        DL   G  +TWC  + EDGV+
Subjt:  DLVCHGGLFTWC--KEEDGVI

TrEMBL top hitse value%identityAlignment
A0A2N9EFF7 Uncharacterized protein8.9e-1834.83Show/hide
Query:  EVNVSIRNFSIHPIDANISWN-GYTLRFTRLYGQPDSSFRYETWELLKQLNNHDASAWVVGRDFNEILWNSKKVGGPERDQRLIQNFRNALDDCSFRDLV
        EV+V I+++S   IDA I  N G + RFT  YG PD   + E+W+LL++L +  +  W++  DFNEI+ N +K+G   R QR ++ FR AL DC   DL 
Subjt:  EVNVSIRNFSIHPIDANISWN-GYTLRFTRLYGQPDSSFRYETWELLKQLNNHDASAWVVGRDFNEILWNSKKVGGPERDQRLIQNFRNALDDCSFRDLV

Query:  CHGGLFTWC--KEEDGVIKLVPDWIVSLQIQNYTLHLHHTLARCV--------GALRKWGKRQNSDLRSRIKILRDQI
          G  FTWC  +  + V+      +  L  Q +    H T   C+         AL +W K+    L   IK L+  +
Subjt:  CHGGLFTWC--KEEDGVIKLVPDWIVSLQIQNYTLHLHHTLARCV--------GALRKWGKRQNSDLRSRIKILRDQI

A0A2N9IXK4 RNase H domain-containing protein3.1e-1842.98Show/hide
Query:  RYEVNVSIRNFSIHPIDANI------SWNGYTLRFTRLYGQPDSSFRYETWELLKQLNNHDASAWVVGRDFNEILWNSKKVGGPERDQRLIQNFRNALDD
        + E +VSI++FS H IDA I      SW     RFT  YG P++  R+E+W LL+ L++  +  W    DFNE+L   +K GGP R  R +Q+FR+A+D 
Subjt:  RYEVNVSIRNFSIHPIDANI------SWNGYTLRFTRLYGQPDSSFRYETWELLKQLNNHDASAWVVGRDFNEILWNSKKVGGPERDQRLIQNFRNALDD

Query:  CSFRDLVCHGGLFTWCKEEDG
        C F DL  +G  FTWC    G
Subjt:  CSFRDLVCHGGLFTWCKEEDG

A0A6J1DX30 uncharacterized protein LOC1110248742.7e-2234.43Show/hide
Query:  GYTLRFTRLYGQPDSSFRYETWELLKQLNNHDASAWVVGRDFNEILWNSKKVGGPERDQRLIQNFRNALDDCSFRDLVCHGGLFTWCKE-----------
        G++ RFT  YG P +  R+ TWELL++++N DAS W++G D N ILWN +       D   I+ FRN +D CS  D+   GG+FTWC             
Subjt:  GYTLRFTRLYGQPDSSFRYETWELLKQLNNHDASAWVVGRDFNEILWNSKKVGGPERDQRLIQNFRNALDDCSFRDLVCHGGLFTWCKE-----------

Query:  -----EDGVIKLVPDWIVSLQIQNYTLHLHHTLARCVGALRKWGKRQNSDLRSRIKILRDQIKAEYAKPLLLDFSVIHELESN
              D    + PD         +      ++     ALR WG+    DL  +IK  +  I   Y +PL LDF++IH LE++
Subjt:  -----EDGVIKLVPDWIVSLQIQNYTLHLHHTLARCVGALRKWGKRQNSDLRSRIKILRDQIKAEYAKPLLLDFSVIHELESN

A0A803P4U9 Uncharacterized protein1.5e-2030.38Show/hide
Query:  VNVSIRNFSIHPIDANISWN-GYTLRFTRLYGQPDSSFRYETWELLKQLNNHDASAWVVGRDFNEILWNSKKVGGPERDQRLIQNFRNALDDCSFRDLVC
        V V +++F++  IDA +  + G+T RFT  YG PD   R E+W+LLK+L      AWV G DFNEI  N++K GG ++   L+ NFR  + +C  R++  
Subjt:  VNVSIRNFSIHPIDANISWN-GYTLRFTRLYGQPDSSFRYETWELLKQLNNHDASAWVVGRDFNEILWNSKKVGGPERDQRLIQNFRNALDDCSFRDLVC

Query:  HGGLFTWCK--------EEDGVIKLVPDWIVSLQIQN---------------YTLHLHH-----------------------------------------
         GG+FTWC         E+   I    DW  + ++ +                T HLH+                                         
Subjt:  HGGLFTWCK--------EEDGVIKLVPDWIVSLQIQN---------------YTLHLHH-----------------------------------------

Query:  ------TLARCVGALRKWGKRQNSDLRSRIKILRDQI
               L  C   L KW KRQ SDL  RIK L+D+I
Subjt:  ------TLARCVGALRKWGKRQNSDLRSRIKILRDQI

A0A803PY54 Uncharacterized protein6.8e-1842.2Show/hide
Query:  EVNVSIRNFSIHPIDANISWN-GYTLRFTRLYGQPDSSFRYETWELLKQLNNHDASAWVVGRDFNEILWNSKKVGGPERDQRLIQNFRNALDDCSFRDLV
        + +V I++F+   IDA +  + G+T RFT  YG PD   R E+W+LLK++      AW+ G DFNEI+ N +K GG  + + L++NFR A+ DC  +++ 
Subjt:  EVNVSIRNFSIHPIDANISWN-GYTLRFTRLYGQPDSSFRYETWELLKQLNNHDASAWVVGRDFNEILWNSKKVGGPERDQRLIQNFRNALDDCSFRDLV

Query:  CHGGLFTWC
          GG FTWC
Subjt:  CHGGLFTWC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTTTGTGGAAAGATATGAGGTGAATGTCTCAATTCGTAATTTTTCAATTCACCCTATAGATGCTAATATAAGTTGGAATGGGTATACGTTGAGGTTCACCAGGTT
GTACGGACAACCTGATTCCAGTTTTCGGTATGAGACTTGGGAACTTCTAAAACAATTAAACAATCATGATGCTTCTGCATGGGTTGTGGGAAGGGATTTCAATGAGATAT
TGTGGAACTCGAAAAAAGTTGGAGGCCCTGAACGTGATCAACGATTGATTCAAAATTTCAGAAATGCTTTGGATGATTGCTCCTTCCGAGATCTAGTTTGTCATGGAGGA
CTATTCACATGGTGTAAAGAAGAGGATGGGGTGATCAAGTTAGTACCCGACTGGATCGTTTCCTTGCAAATCCAAAATTATACACTGCACTTGCATCACACATTGGCCAG
GTGTGTGGGGGCTCTCAGAAAATGGGGGAAAAGACAAAACTCGGATCTTAGGAGCCGGATCAAAATTCTAAGGGACCAAATTAAAGCAGAATATGCAAAGCCACTCCTTT
TGGATTTCTCTGTTATTCATGAGTTAGAATCCAATCCAGACTCCTATCTTCATGAGGAGGGATGTACTGGCGCCAATGATCTTGGGAGAACTGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTTTTGTGGAAAGATATGAGGTGAATGTCTCAATTCGTAATTTTTCAATTCACCCTATAGATGCTAATATAAGTTGGAATGGGTATACGTTGAGGTTCACCAGGTT
GTACGGACAACCTGATTCCAGTTTTCGGTATGAGACTTGGGAACTTCTAAAACAATTAAACAATCATGATGCTTCTGCATGGGTTGTGGGAAGGGATTTCAATGAGATAT
TGTGGAACTCGAAAAAAGTTGGAGGCCCTGAACGTGATCAACGATTGATTCAAAATTTCAGAAATGCTTTGGATGATTGCTCCTTCCGAGATCTAGTTTGTCATGGAGGA
CTATTCACATGGTGTAAAGAAGAGGATGGGGTGATCAAGTTAGTACCCGACTGGATCGTTTCCTTGCAAATCCAAAATTATACACTGCACTTGCATCACACATTGGCCAG
GTGTGTGGGGGCTCTCAGAAAATGGGGGAAAAGACAAAACTCGGATCTTAGGAGCCGGATCAAAATTCTAAGGGACCAAATTAAAGCAGAATATGCAAAGCCACTCCTTT
TGGATTTCTCTGTTATTCATGAGTTAGAATCCAATCCAGACTCCTATCTTCATGAGGAGGGATGTACTGGCGCCAATGATCTTGGGAGAACTGGCTGA
Protein sequenceShow/hide protein sequence
MPFVERYEVNVSIRNFSIHPIDANISWNGYTLRFTRLYGQPDSSFRYETWELLKQLNNHDASAWVVGRDFNEILWNSKKVGGPERDQRLIQNFRNALDDCSFRDLVCHGG
LFTWCKEEDGVIKLVPDWIVSLQIQNYTLHLHHTLARCVGALRKWGKRQNSDLRSRIKILRDQIKAEYAKPLLLDFSVIHELESNPDSYLHEEGCTGANDLGRTG