; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039740 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039740
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr2:49128126..49128650
RNA-Seq ExpressionLag0039740
SyntenyLag0039740
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3452465.1 reverse transcriptase [Gossypium australe]6.2e-4052.63Show/hide
Query:  DLEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALA
        +L+  +T+ EI  ALK+M P KAPGEDG  A F+Q  W +IG+DV   CL+ LN G+DV  +NKT +VLIPK  NP  + +FRPISLCNV+YK+IAKA+A
Subjt:  DLEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALA

Query:  NRMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQ
        NR++ VL   I   QSAF+PGR+ITDN++L +E +HT+  +K  K+  ++V+
Subjt:  NRMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQ

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]3.6e-4051.66Show/hide
Query:  LEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALAN
        LE+P+T  +I  AL +M P KAPG DG  A FFQ +W ++G+ +T+ CL +LN    +  LN T + LIPK + P ++ EFRPISLCNVVY+I+AKA+AN
Subjt:  LEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALAN

Query:  RMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQ
        R+K +L+ IISP QSAFIP R+ITDN+I+G+EC+H I   K R+   ++++
Subjt:  RMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQ

XP_023913142.1 uncharacterized protein LOC112024740 [Quercus suber]5.6e-4152.63Show/hide
Query:  DLEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALA
        +L KPYT  E++ AL  M P KAPG DG +A F+Q +W ++G+DV+   L  LN+G  +  +N T +VLIPK ++PE++ +FRPISLCNV+YKII+K LA
Subjt:  DLEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALA

Query:  NRMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQ
        NR+K +L  +ISPTQSAF+PGR+ITDN++L +E +H ++ RKK K R L+++
Subjt:  NRMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQ

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]1.2e-4056.55Show/hide
Query:  LEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALAN
        L   +T  E++ AL  M P KAPG DG +A F+Q +W ++GD V    L  LNNG  +  +N T +VLIPK QNPER+ EFRPISLCNV+YKII+K LAN
Subjt:  LEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALAN

Query:  RMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKK
        R+K+VL  IIS TQSAF+PGR+ITDN+++ +E +HT++ RKK KK
Subjt:  RMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKK

XP_030939512.1 uncharacterized protein LOC115964316 [Quercus lobata]3.6e-4052.32Show/hide
Query:  LEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALAN
        L   ++  E++ AL  M P KAPG DG +A F+Q +W ++GD V    L  LNNG  +  +N T +VLIPK +NPE++ +FRPISLCNV+YKII+K LAN
Subjt:  LEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALAN

Query:  RMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQ
        R+K+VL  IIS TQSAF+PGR+ITDN+++ +E +HT++CRKK KK  ++++
Subjt:  RMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQ

TrEMBL top hitse value%identityAlignment
A0A2P6S9N7 Putative RNA-directed DNA polymerase3.9e-4057.14Show/hide
Query:  LEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALAN
        L +P+T+ EIE ALK M P+KAPGEDG  A F+Q YW +IGDD++R CL+ LN G D+   + TLL LIPK   P+ L +FRPISLCNV+YK+++KA+ N
Subjt:  LEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALAN

Query:  RMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCR
        RMK  L ++ISP QSAF+PGR I DNII  FE +H+I  +
Subjt:  RMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCR

A0A5B6UFE1 Reverse transcriptase3.0e-4052.63Show/hide
Query:  DLEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALA
        +L+  +T+ EI  ALK+M P KAPGEDG  A F+Q  W +IG+DV   CL+ LN G+DV  +NKT +VLIPK  NP  + +FRPISLCNV+YK+IAKA+A
Subjt:  DLEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALA

Query:  NRMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQ
        NR++ VL   I   QSAF+PGR+ITDN++L +E +HT+  +K  K+  ++V+
Subjt:  NRMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQ

A0A5B6VKR7 Reverse transcriptase1.1e-3952.32Show/hide
Query:  LEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALAN
        L+  +T  EI  AL +M P KAPGEDG  A F+Q  W +IG DV+  CL+ LNNG++V  +NKT +VLIPK  NP  + +FRPISLCNV+YK+IAK +AN
Subjt:  LEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALAN

Query:  RMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQ
        R+  V+   I P QSAF+PGR+ITDN++L +E +HT+  +K  KK  ++V+
Subjt:  RMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQ

A0A5B6WEP5 Reverse transcriptase8.7e-4052.32Show/hide
Query:  LEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALAN
        L+  +T  EI  AL +M P KAPGEDG  A F+Q  W +IG DV+  CL+ LNNG++V  +NKT +VLIPK  NP  + +FRPISLCNV+YK+IAK +AN
Subjt:  LEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALAN

Query:  RMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQ
        R++ V+   I P QSAF+PGR+ITDN++L +E +HT+  +K  KK  ++V+
Subjt:  RMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQ

A0A7N2LIH6 Uncharacterized protein1.3e-4051.97Show/hide
Query:  DLEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALA
        +L+K +   E+  AL+ M P KAPG DG    F+Q YWD++G  VT   L+ LN+G+    +NKT + LIPKT+NP+++ EFRPISLCNV+YKII+K LA
Subjt:  DLEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALA

Query:  NRMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQ
        NR+KKVL  +I   QSAF+PGR+ITDN+I+ FE +H+IN R+K K+  ++++
Subjt:  NRMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQ

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.3e-1331.41Show/hide
Query:  MRDLEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKT-QNPERLEEFRPISLCNVVYKIIAK
        +  L +P T +EI   +  +   K+PG DG  A F+Q Y + +   + ++   +   GI      +  ++LIPK  ++  + E FRPISL N+  KI+ K
Subjt:  MRDLEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKT-QNPERLEEFRPISLCNVVYKIIAK

Query:  ALANRMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQA
         LANR+++ +  +I   Q  FIPG     NI      I  IN  K +    +S+ A
Subjt:  ALANRMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQA

P08548 LINE-1 reverse transcriptase homolog5.1e-1331.37Show/hide
Query:  LEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKT-QNPERLEEFRPISLCNVVYKIIAKALA
        L +P + +EI   ++++   K+PG DG  + F+Q + + +   +  +   +   GI      +  + LIPK  ++P R E +RPISL N+  KI+ K L 
Subjt:  LEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKT-QNPERLEEFRPISLCNVVYKIIAKALA

Query:  NRMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQA
        NR+++ +  II   Q  FIPG     NI      I  IN  K +    LS+ A
Subjt:  NRMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQA

P11369 LINE-1 retrotransposable element ORF2 protein1.9e-1533.99Show/hide
Query:  LEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQ-NPERLEEFRPISLCNVVYKIIAKALA
        L  P +  EIE  +  +   K+PG DG  A F+Q + + +   + ++  ++   G       +  + LIPK Q +P ++E FRPISL N+  KI+ K LA
Subjt:  LEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQ-NPERLEEFRPISLCNVVYKIIAKALA

Query:  NRMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQA
        NR+++ +  II P Q  FIPG     NI      IH IN  K +    +S+ A
Subjt:  NRMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQA

P14381 Transposon TX1 uncharacterized 149 kDa protein3.0e-2140Show/hide
Query:  LEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALAN
        LE P T  E+  AL+ M  NK+PG DG    FFQ +WD +G D  RV       G       + +L L+PK  +   ++ +RP+SL +  YKI+AKA++ 
Subjt:  LEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALAN

Query:  RMKKVLDDIISPTQSAFIPGRIITDNIILGFECIH
        R+K VL ++I P QS  +PGR I DN+ L  + +H
Subjt:  RMKKVLDDIISPTQSAFIPGRIITDNIILGFECIH

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.2e-1039.53Show/hide
Query:  EIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKII
        EI  A+  M  NKAPG D   A FF   W ++ D            G  +   N T + LIPK    ++L  FRP+S C VVYKII
Subjt:  EIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases6.9e-0538.64Show/hide
Query:  LANRMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRK
        +  R+K ++ ++I P Q++FIPGR+ TDNI+   E +H++  +K
Subjt:  LANRMKKVLDDIISPTQSAFIPGRIITDNIILGFECIHTINCRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGACCTGGAAAAGCCTTACACTAGAGCTGAGATCGAGTTTGCATTAAAGGATATGAGCCCCAACAAAGCCCCAGGCGAAGATGGCACTCATGCCACCTTTTTTCA
GAACTACTGGGATTTGATTGGGGATGATGTGACTAGAGTGTGCTTAAGAGTCCTGAACAATGGTATAGATGTGGGCCCCCTGAACAAGACCCTCCTAGTTCTCATTCCAA
AGACTCAAAACCCCGAAAGGTTGGAGGAGTTCAGGCCCATCAGCCTCTGCAATGTGGTCTACAAAATTATTGCTAAAGCGCTTGCGAATAGAATGAAAAAAGTCCTCGAC
GACATCATATCCCCCACGCAGTCTGCTTTTATCCCAGGGAGAATCATAACGGACAACATCATCCTAGGTTTTGAATGTATACACACTATTAACTGCAGGAAAAAAAGGAA
AAAACGGTGTCTCAGCGTTCAAGCTCGATATGAGCAAAACGTACGACAGAGTGGAGTGGAACTTCCTAAAGAAGGTGATGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGACCTGGAAAAGCCTTACACTAGAGCTGAGATCGAGTTTGCATTAAAGGATATGAGCCCCAACAAAGCCCCAGGCGAAGATGGCACTCATGCCACCTTTTTTCA
GAACTACTGGGATTTGATTGGGGATGATGTGACTAGAGTGTGCTTAAGAGTCCTGAACAATGGTATAGATGTGGGCCCCCTGAACAAGACCCTCCTAGTTCTCATTCCAA
AGACTCAAAACCCCGAAAGGTTGGAGGAGTTCAGGCCCATCAGCCTCTGCAATGTGGTCTACAAAATTATTGCTAAAGCGCTTGCGAATAGAATGAAAAAAGTCCTCGAC
GACATCATATCCCCCACGCAGTCTGCTTTTATCCCAGGGAGAATCATAACGGACAACATCATCCTAGGTTTTGAATGTATACACACTATTAACTGCAGGAAAAAAAGGAA
AAAACGGTGTCTCAGCGTTCAAGCTCGATATGAGCAAAACGTACGACAGAGTGGAGTGGAACTTCCTAAAGAAGGTGATGGATAG
Protein sequenceShow/hide protein sequence
MRDLEKPYTRAEIEFALKDMSPNKAPGEDGTHATFFQNYWDLIGDDVTRVCLRVLNNGIDVGPLNKTLLVLIPKTQNPERLEEFRPISLCNVVYKIIAKALANRMKKVLD
DIISPTQSAFIPGRIITDNIILGFECIHTINCRKKRKKRCLSVQARYEQNVRQSGVELPKEGDG