; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018316 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018316
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr5:22818242..22818817
RNA-Seq ExpressionLag0018316
SyntenyLag0018316
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3462911.1 reverse transcriptase [Gossypium australe]7.3e-4250.55Show/hide
Query:  QGALD--QVLGHVPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPR
        +G +D  +VL  +   +T+E+N+ L  P+T +EV++A+K     KAPGPDGFPA+FFQ++W IVG   +  CL ILN    I S N T+ VLIPK+ QP 
Subjt:  QGALD--QVLGHVPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPR

Query:  LVSDYRQISLCNFSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAY
         + ++R ISLC+  YK++TK IANRL+ V+   ID+ Q AFI GR I DN+IL +E LH    KR GK  Y A+KLDMSKAY
Subjt:  LVSDYRQISLCNFSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAY

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.8e-4551.98Show/hide
Query:  LDQVLGHVPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPRLVSDY
        ++ ++  +PT++T E+NE L+ PYT+EE+  A++    TKA GPDGFPA+F+Q YW +VG  T+  CL  LN+   I  WN T   LIPKI QPR +SD+
Subjt:  LDQVLGHVPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPRLVSDY

Query:  RQISLCNFSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAY
        R ISLCN SYKII+K I NRLK V+  +I + Q AF+  R+I+DN+I+GHE LH I+S + G    AALKLD+SKA+
Subjt:  RQISLCNFSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAY

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]5.4e-4551.98Show/hide
Query:  LDQVLGHVPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPRLVSDY
        + +VL  +PT VTEEMN  L+  +TREE+ +A+   H TKAPGPDG  A+FFQ+YW IVG+  +   L +LNS   +   N TN  L+PKI  P  +SD+
Subjt:  LDQVLGHVPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPRLVSDY

Query:  RQISLCNFSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAY
        R ISLCN  YK+I+KV+ANRLK +L +II E Q AF+ GR ITDN+++  E +H++  K++GK  +AA+KLDMSKAY
Subjt:  RQISLCNFSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAY

XP_042950087.1 uncharacterized protein LOC122282202 [Carya illinoinensis]3.3e-4251.1Show/hide
Query:  TDQGALDQVLGHVPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPR
        TD+  ++ VL  V  +VT EMNE L+ PY  EEV  A+K  H ++APGPDG P +FFQ+YW ++G++     L+ LNS       NHT   LIPK   P 
Subjt:  TDQGALDQVLGHVPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPR

Query:  LVSDYRQISLCNFSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAY
         V+D+R ISLCN  YKI++KVIANRLK VL  II   Q AF+ GR I+DN+++ +E LHF+ +KRKG+  + +LKLDMSKAY
Subjt:  LVSDYRQISLCNFSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAY

XP_042962672.1 uncharacterized protein LOC122296942 [Carya illinoinensis]1.1e-4251.38Show/hide
Query:  DQGALDQVLGHVPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPRL
        D+  ++ VL  V  +VT EMNE L+ PY  EEV  A+K  H +KAPGPDG P +FFQ+YW ++G++     L+ LNS       NHT   LIPK   P  
Subjt:  DQGALDQVLGHVPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPRL

Query:  VSDYRQISLCNFSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAY
        V+D+R ISLCN  YKI++KVIANRLK VL +II   Q AF+ GR I+DN+++ +E LHF+ +KRKG+  + +LKLDMSKAY
Subjt:  VSDYRQISLCNFSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAY

TrEMBL top hitse value%identityAlignment
A0A2N9FNH6 Reverse transcriptase domain-containing protein5.4e-4349.47Show/hide
Query:  TDQGALDQVLGHVPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPR
        ++  A+ QV+  V   VT+ MN++L+ P+T EE+ SA+   H TKAPGPDG  A+F+Q++W IVGD      L  L+S   + S N T+  LIPKI  P 
Subjt:  TDQGALDQVLGHVPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPR

Query:  LVSDYRQISLCNFSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAYMK-GWN
        L++ +R ISLCN  YKII+KV+ANRLK VL+ II + Q AF+ GR ITDN+++  E+LH++ +KRKG++ + A+KLDMSKAY +  WN
Subjt:  LVSDYRQISLCNFSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAYMK-GWN

A0A2N9GM07 Reverse transcriptase domain-containing protein1.2e-4252.35Show/hide
Query:  VPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPRLVSDYRQISLCN
        +  KVT  MN+ L   +T EEV  A++  H TKAPGPDG  AVFFQ+YW IVG       L +LN+ A   ++N TN  LIPK   P+ ++++R ISLCN
Subjt:  VPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPRLVSDYRQISLCN

Query:  FSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAY
         +YK+I+KVIANRLK VL+++I E Q AF+ GR+ITDN ++  E +H+   KR GK  Y ALKLDMSKAY
Subjt:  FSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAY

A0A2N9J109 Uncharacterized protein5.8e-4551.65Show/hide
Query:  TDQGALDQVLGHVPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPR
        ++  ++D VL  VPT VT+ MNE L  PYT  EV  A++      APGPDG P VF+Q +W ++G+  I   L+ LNS   + S NHTN  LIPK+  P 
Subjt:  TDQGALDQVLGHVPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPR

Query:  LVSDYRQISLCNFSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAY
         VS++R ISLCN  YKII+KV+ANRLK++L  +I E Q AF+LGR ITDN+++  E+LH + S R+GK  + ALKLDMSKAY
Subjt:  LVSDYRQISLCNFSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAY

A0A2N9J3U0 Reverse transcriptase domain-containing protein5.4e-4349.47Show/hide
Query:  TDQGALDQVLGHVPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPR
        ++  A+ QV+  V   VT+ MN++L+ P+T EE+ SA+   H TKAPGPDG  A+F+Q++W IVGD      L  L+S   + S N T+  LIPKI  P 
Subjt:  TDQGALDQVLGHVPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPR

Query:  LVSDYRQISLCNFSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAYMK-GWN
        L++ +R ISLCN  YKII+KV+ANRLK VL+ II + Q AF+ GR ITDN+++  E+LH++ +KRKG++ + A+KLDMSKAY +  WN
Subjt:  LVSDYRQISLCNFSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAYMK-GWN

A0A6J1DX30 uncharacterized protein LOC1110248748.9e-4651.98Show/hide
Query:  LDQVLGHVPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPRLVSDY
        ++ ++  +PT++T E+NE L+ PYT+EE+  A++    TKA GPDGFPA+F+Q YW +VG  T+  CL  LN+   I  WN T   LIPKI QPR +SD+
Subjt:  LDQVLGHVPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPRLVSDY

Query:  RQISLCNFSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAY
        R ISLCN SYKII+K I NRLK V+  +I + Q AF+  R+I+DN+I+GHE LH I+S + G    AALKLD+SKA+
Subjt:  RQISLCNFSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAY

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein3.1e-1129.24Show/hide
Query:  KVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPRLVSD-YRQISLCNFS
        ++ +E  E L  P T  E+ + + S  + K+PGPDGF A F+QRY   +    +    +I       +S+   + +LIPK  +     + +R ISL N  
Subjt:  KVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPRLVSD-YRQISLCNFS

Query:  YKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFI-HSKRKGKTRYAALKLDMSKAYMK
         KI+ K++ANR++  + ++I   Q  FI G     N+    +S++ I H  R     +  + +D  KA+ K
Subjt:  YKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFI-HSKRKGKTRYAALKLDMSKAYMK

P08548 LINE-1 reverse transcriptase homolog2.0e-1029.12Show/hide
Query:  LDQVLG--HVPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPI-HSWNHTNFVLIPKICQ-PRL
        +DQ L   H+P +++++  E+L  P +  E+ S +++    K+PGPDGF + F+Q +   +    + N    +  E  + +++   N  LIPK  + P  
Subjt:  LDQVLG--HVPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPI-HSWNHTNFVLIPKICQ-PRL

Query:  VSDYRQISLCNFSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTR-YAALKLDMSKAY
          +YR ISL N   KI+ K++ NR++  + +II   Q  FI G     N+    +S++ I    K K + +  L +D  KA+
Subjt:  VSDYRQISLCNFSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTR-YAALKLDMSKAY

P11369 LINE-1 retrotransposable element ORF2 protein1.0e-0929.24Show/hide
Query:  KVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPI-HSWNHTNFVLIPKICQ-PRLVSDYRQISLCNF
        K+ ++  + L  P + +E+ + + S  + K+PGPDGF A F+Q +   +    +      +  E  + +S+      LIPK  + P  + ++R ISL N 
Subjt:  KVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPI-HSWNHTNFVLIPKICQ-PRLVSDYRQISLCNF

Query:  SYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAYMK
          KI+ K++ANR++  +  II   Q  FI G     N+      +H+I +K K K  +  + LD  KA+ K
Subjt:  SYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAYMK

P14381 Transposon TX1 uncharacterized 149 kDa protein1.9e-1636.53Show/hide
Query:  VTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVG-DTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPRLVSDYRQISLCNFSY
        V+E   E L  P T +E+  A++     K+PG DG    FFQ +W  +G D       A    E P+ S       L+PK    RL+ ++R +SL +  Y
Subjt:  VTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVG-DTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPRLVSDYRQISLCNFSY

Query:  KIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAY
        KI+ K I+ RLK VL E+I   Q   + GR+I DN+ L  + LHF  ++R G +  A L LD  KA+
Subjt:  KIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAY

Q95SX7 Probable RNA-directed DNA polymerase from transposon BS4.8e-0428.4Show/hide
Query:  LLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWV--IVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQP-RLVSDYRQISLCNFSYKIITK
        L + P   EE+  A+KS    K+PG D       +   V  I+    I N  AIL  +     W     ++I K  +P      YR ISL +   K+  +
Subjt:  LLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWV--IVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQP-RLVSDYRQISLCNFSYKIITK

Query:  VIANRLKIVLNE--IIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAYMKGWN
        +IANRL  ++ E  I+ + QF F  G S  + +   H     I      K    A+ +DM +A+ + W+
Subjt:  VIANRLKIVLNE--IIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAYMKGWN

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein7.1e-1138.64Show/hide
Query:  EEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPRLVSDYRQISLCNFSYKIIT
        +E+ +AV +    KAPGPD F A FF   W +V D+TIA       +   +  +N T   LIPK+     +S +R +S C   YKIIT
Subjt:  EEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPRLVSDYRQISLCNFSYKIIT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGGATCAAGGAGCGCTTGATCAGGTGCTTGGTCATGTACCCACGAAAGTCACTGAGGAGATGAACGAGTTGCTAGTCGTTCCTTATACCCGAGAGGAAGTGTTTTC
AGCGGTTAAAAGTTTTCACTCGACTAAGGCCCCTGGGCCAGACGGTTTTCCTGCTGTATTTTTTCAGAGGTACTGGGTGATTGTAGGCGATACTACGATAGCAAATTGCC
TTGCGATTCTGAATTCGGAAGCACCAATACATTCATGGAACCATACTAATTTTGTTCTCATACCAAAAATTTGCCAACCGAGGTTAGTCTCTGATTATCGCCAAATTAGT
CTGTGTAATTTCTCGTATAAGATTATTACTAAGGTCATAGCCAACAGACTTAAGATTGTACTAAATGAGATTATTGATGAATGTCAATTTGCTTTTATTCTGGGACGATC
TATTACGGATAATATGATATTGGGTCATGAGTCGTTACATTTCATTCATAGCAAGAGGAAAGGTAAAACAAGATATGCGGCTCTGAAACTTGATATGAGCAAGGCATATA
TGAAGGGGTGGAATGGTCGTATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACGGATCAAGGAGCGCTTGATCAGGTGCTTGGTCATGTACCCACGAAAGTCACTGAGGAGATGAACGAGTTGCTAGTCGTTCCTTATACCCGAGAGGAAGTGTTTTC
AGCGGTTAAAAGTTTTCACTCGACTAAGGCCCCTGGGCCAGACGGTTTTCCTGCTGTATTTTTTCAGAGGTACTGGGTGATTGTAGGCGATACTACGATAGCAAATTGCC
TTGCGATTCTGAATTCGGAAGCACCAATACATTCATGGAACCATACTAATTTTGTTCTCATACCAAAAATTTGCCAACCGAGGTTAGTCTCTGATTATCGCCAAATTAGT
CTGTGTAATTTCTCGTATAAGATTATTACTAAGGTCATAGCCAACAGACTTAAGATTGTACTAAATGAGATTATTGATGAATGTCAATTTGCTTTTATTCTGGGACGATC
TATTACGGATAATATGATATTGGGTCATGAGTCGTTACATTTCATTCATAGCAAGAGGAAAGGTAAAACAAGATATGCGGCTCTGAAACTTGATATGAGCAAGGCATATA
TGAAGGGGTGGAATGGTCGTATTTGA
Protein sequenceShow/hide protein sequence
MTDQGALDQVLGHVPTKVTEEMNELLVVPYTREEVFSAVKSFHSTKAPGPDGFPAVFFQRYWVIVGDTTIANCLAILNSEAPIHSWNHTNFVLIPKICQPRLVSDYRQIS
LCNFSYKIITKVIANRLKIVLNEIIDECQFAFILGRSITDNMILGHESLHFIHSKRKGKTRYAALKLDMSKAYMKGWNGRI