; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025104 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025104
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr10:8583212..8583709
RNA-Seq ExpressionLag0025104
SyntenyLag0025104
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023905101.1 uncharacterized protein LOC112016863 [Quercus suber]3.0e-2036.71Show/hide
Query:  MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPLFRFEEVWTQYDE
        MQ+FR+ ID+C   DM F G  FTW +  +   ++ ERLDRGL  + F++ FP + +HHL+   SDH P+ I++ G   P     +  FRFEE+W   ++
Subjt:  MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPLFRFEEVWTQYDE

Query:  FREMVERGW--GMDSRVNQSLMASWKERLNKCAKHLSSWGKTKKGNYGRRIQEARGRL
          EMVE  W  G   +   SL  S  ++++KC+K L  W +   GN  R +++ +  L
Subjt:  FREMVERGW--GMDSRVNQSLMASWKERLNKCAKHLSSWGKTKKGNYGRRIQEARGRL

XP_030930710.1 uncharacterized protein LOC115956488 [Quercus lobata]2.3e-2036.59Show/hide
Query:  MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPL--FRFEEVWTQY
        M +FR+A+D C+L DM F G+PFTW   R G+   +ERLDR + N  + + FP S++ HL    SDH P+ +     G   G  NR +  F+FEE W  +
Subjt:  MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPL--FRFEEVWTQY

Query:  DEFREMVERGWGMDSRVNQSLMASWKERLNKCAKHLSSWGKTKKGNYGRRIQEARGRLQMEIER
        +E  ++V   W  D R   SL A+ KE++  C   L +WG ++     + I+    RLQ  +E+
Subjt:  DEFREMVERGWGMDSRVNQSLMASWKERLNKCAKHLSSWGKTKKGNYGRRIQEARGRLQMEIER

XP_042942839.1 uncharacterized protein LOC122277021 [Carya illinoinensis]1.4e-2039.31Show/hide
Query:  MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPLFRFEEVWTQYDE
        ++ FR A + CSL DM + G+ FTW   R G   ++ER+DR LCN  +  +FPFS V++L    SDHCPI + V+   +     +RP FR+E  W   ++
Subjt:  MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPLFRFEEVWTQYDE

Query:  FREMVERGWGMDSRVNQSLMASWKERLNKCAKHLSSWGKTKKGNY
        F E++E+ W    RV+  L     E LN+C   L  W K   GNY
Subjt:  FREMVERGWGMDSRVNQSLMASWKERLNKCAKHLSSWGKTKKGNY

XP_042983251.1 uncharacterized protein LOC122312656 [Carya illinoinensis]2.3e-2035.03Show/hide
Query:  MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPLFRFEEVWTQYDE
        M+ FR  IDDC+L D+ + G  FTW+  R+GV A+ ERLDR L N+ +   FP++ V H     SDH PI +  +G  V    G +P FRFE +WT+ +E
Subjt:  MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPLFRFEEVWTQYDE

Query:  FREMVERGWGMDSRVNQSLMASWKERLNKCAKHLSSWGKTKKGNYGRRIQEARGRLQ
          +++E+ W +        +     ++++C+  L  W K   G   +R++ AR +L+
Subjt:  FREMVERGWGMDSRVNQSLMASWKERLNKCAKHLSSWGKTKKGNYGRRIQEARGRLQ

XP_042988712.1 uncharacterized protein LOC122316247 [Carya illinoinensis]1.0e-2038.89Show/hide
Query:  MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPLFRFEEVWTQYDE
        M+ FR  +DDCS  D+ F G PFTW   R     + ERLDR L N  +   FP   V H     SDH PI +    S          LFRFE +WT+  E
Subjt:  MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPLFRFEEVWTQYDE

Query:  FREMVERGWGMDSRVNQSLMASWKERLNKCAKHLSSWGKTKKGNYGRRIQEARGRLQMEIER
        +  +VE  W  + R + + M + K R+  C++ L+ W KTK GN  +RIQ+ R  LQ   ER
Subjt:  FREMVERGWGMDSRVNQSLMASWKERLNKCAKHLSSWGKTKKGNYGRRIQEARGRLQMEIER

TrEMBL top hitse value%identityAlignment
A0A2N9EK24 CCHC-type domain-containing protein5.6e-2035.67Show/hide
Query:  MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPLFRFEEVWTQYDE
        MQ FRDA+DDC+L D+ F G  FTW   R+G   +++RLDR LC + +  MFP   VHHL    SDH  + +             RP+ RFEE WT++ E
Subjt:  MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPLFRFEEVWTQYDE

Query:  FREMVERGWGMDSRVNQSLMASWKERLNKCAKHLSSWGKTKKGNYGRRIQEARGRLQ
          +++E  W   + +  + M    E++  C K L+SW +T        I++ +  LQ
Subjt:  FREMVERGWGMDSRVNQSLMASWKERLNKCAKHLSSWGKTKKGNYGRRIQEARGRLQ

A0A2N9HG19 Uncharacterized protein5.6e-2036.94Show/hide
Query:  MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPLFRFEEVWTQYDE
        MQ FRD +DDC   D+ F G PFTW   R G +   ERLDR +    +   FP + VHHL +  SDH PI +  +   +P    +R LFRFEEVWT    
Subjt:  MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPLFRFEEVWTQYDE

Query:  FREMVERGWGMDSRVNQSLMASWKERLNKCAKHLSSWGKTKKGNYGRRIQEARGRLQ
          + +E  W  DS+    +  +W E+++ C + L  W +   GN   +I E   +L+
Subjt:  FREMVERGWGMDSRVNQSLMASWKERLNKCAKHLSSWGKTKKGNYGRRIQEARGRLQ

A0A2N9I0P4 Reverse transcriptase domain-containing protein4.0e-2640.48Show/hide
Query:  MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPLFRFEEVWTQYDE
        MQ+FRDAID C   D+ F G PFTW   R G   + ERLDRGL    +  MFP +S+HH+  G SDHCP+ +++  +  P     R  FRFEE+W     
Subjt:  MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPLFRFEEVWTQYDE

Query:  FREMVERGWGMDSRVNQSLMASWKERLNKCAKHLSSWGKTKKGNYGRRIQEARGRLQMEIE----RGG
         R++V + W  D RV+   MA   +++  C KHL SW K    N  R + + +  L+ E+E    RGG
Subjt:  FREMVERGWGMDSRVNQSLMASWKERLNKCAKHLSSWGKTKKGNYGRRIQEARGRLQMEIE----RGG

A0A2N9IMU2 Reverse transcriptase domain-containing protein4.0e-2640.48Show/hide
Query:  MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPLFRFEEVWTQYDE
        MQ+FRDAID C   D+ F G PFTW   R G   + ERLDRGL    +  MFP +S+HH+  G SDHCP+ +++  +  P     R  FRFEE+W     
Subjt:  MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPLFRFEEVWTQYDE

Query:  FREMVERGWGMDSRVNQSLMASWKERLNKCAKHLSSWGKTKKGNYGRRIQEARGRLQMEIE----RGG
         R++V + W  D RV+   MA   +++  C KHL SW K    N  R + + +  L+ E+E    RGG
Subjt:  FREMVERGWGMDSRVNQSLMASWKERLNKCAKHLSSWGKTKKGNYGRRIQEARGRLQMEIE----RGG

A0A6P9EM92 uncharacterized protein LOC1089797765.6e-2037.58Show/hide
Query:  MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPLFRFEEVWTQYDE
        M  FR+ + DCSL DM F G  FTW   R+G  A+ ERLDR L N+ F+ +FP   V H     SDH PI  D +G G+  G  +R  FRFE +W    +
Subjt:  MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPLFRFEEVWTQYDE

Query:  FREMVERGWGMDSRVNQSLMASWKERLNKCAKHLSSWGKTKKGNYGRRIQEARGRLQ
          +++E  W   + +  + M    +R+  C   L+SW K   G+  ++  EAR RL+
Subjt:  FREMVERGWGMDSRVNQSLMASWKERLNKCAKHLSSWGKTKKGNYGRRIQEARGRLQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein8.5e-0525.62Show/hide
Query:  MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPLFRFEEVWTQYDE
        ++EF++ + D  L D+  RG  +TW   +     ++ +LDR + N  ++  FP +       G+SDH P  I ++     +   ++  FR+    + +  
Subjt:  MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPLFRFEEVWTQYDE

Query:  FREMVERGWGMDSRVNQSLMASWKERL---NKCAKHLSSWGKTKKGNYGRRIQEARGRLQ
        F   +   W     V  S M S  E L    KC K L+  G    GN   + +EA   L+
Subjt:  FREMVERGWGMDSRVNQSLMASWKERL---NKCAKHLSSWGKTKKGNYGRRIQEARGRLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGAGTTTAGGGATGCTATTGATGATTGCAGTTTGTCTGACATGATGTTTAGAGGGCACCCTTTCACGTGGTATAAGTTGAGGAAGGGGGTTGTGGCTATGCAGGA
GCGGCTGGATAGGGGCCTCTGCAATGATCCCTTTTGGATGATGTTCCCATTCTCTAGCGTGCATCACTTAAAGTTTGGGTTGTCTGACCATTGCCCGATCCGGATTGATG
TGGATGGGAGTGGAGTCCCAGTGGGTGGAGGGAATAGGCCACTTTTTCGCTTTGAGGAGGTTTGGACTCAGTATGATGAGTTCCGAGAGATGGTGGAGAGAGGGTGGGGG
ATGGATTCTCGAGTTAACCAATCTCTAATGGCGAGTTGGAAGGAGAGGTTGAATAAGTGTGCAAAACACTTATCTAGTTGGGGCAAAACAAAGAAAGGGAATTATGGTAG
AAGAATCCAGGAGGCACGGGGCAGGCTCCAAATGGAGATTGAGAGAGGGGGAATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGAGTTTAGGGATGCTATTGATGATTGCAGTTTGTCTGACATGATGTTTAGAGGGCACCCTTTCACGTGGTATAAGTTGAGGAAGGGGGTTGTGGCTATGCAGGA
GCGGCTGGATAGGGGCCTCTGCAATGATCCCTTTTGGATGATGTTCCCATTCTCTAGCGTGCATCACTTAAAGTTTGGGTTGTCTGACCATTGCCCGATCCGGATTGATG
TGGATGGGAGTGGAGTCCCAGTGGGTGGAGGGAATAGGCCACTTTTTCGCTTTGAGGAGGTTTGGACTCAGTATGATGAGTTCCGAGAGATGGTGGAGAGAGGGTGGGGG
ATGGATTCTCGAGTTAACCAATCTCTAATGGCGAGTTGGAAGGAGAGGTTGAATAAGTGTGCAAAACACTTATCTAGTTGGGGCAAAACAAAGAAAGGGAATTATGGTAG
AAGAATCCAGGAGGCACGGGGCAGGCTCCAAATGGAGATTGAGAGAGGGGGAATGTGA
Protein sequenceShow/hide protein sequence
MQEFRDAIDDCSLSDMMFRGHPFTWYKLRKGVVAMQERLDRGLCNDPFWMMFPFSSVHHLKFGLSDHCPIRIDVDGSGVPVGGGNRPLFRFEEVWTQYDEFREMVERGWG
MDSRVNQSLMASWKERLNKCAKHLSSWGKTKKGNYGRRIQEARGRLQMEIERGGM