; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032845 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032845
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr11:38164367..38164855
RNA-Seq ExpressionLag0032845
SyntenyLag0032845
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CCA66036.1 hypothetical protein [Beta vulgaris subsp. vulgaris]3.9e-4457.41Show/hide
Query:  MDRPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIAN
        +D PF +EE+ AA+  M+P KA  PDG +A  YQ  WDT+GED     L +LNN DN+  +N T I LIPK K  +   +FRPISLCNV+YK VAK +AN
Subjt:  MDRPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIAN

Query:  RMKTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE
        RMK  L  +I  SQS F+ GR ITDNV+VA+EC H L KK+ GKKG L +KLDMSK YDRVE
Subjt:  RMKTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE

KAF5471784.1 hypothetical protein F2P56_008554 [Juglans regia]5.6e-4353.46Show/hide
Query:  PFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIANRMK
        PFT+EEI AA+  MNP  +  PDG  A+ YQK W+ +GE      L  LN+  ++ ++N+T I+LIPK + PK++A++RPISLCNV+YK V+K IANR+K
Subjt:  PFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIANRMK

Query:  TALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE
          L  II+P+QSAF++GR I+DN +VA+E +H++N + +GKKG +A+KLDMSK YDRVE
Subjt:  TALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]9.5e-4356.96Show/hide
Query:  FTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIANRMKT
        FT+EEIE A+  M+PTKA  PDG  A  +QK W+ +G D + + L +LN+  +M  IN T I L+PK K+P +M++FRPISLCNV+YK ++K +ANR+K 
Subjt:  FTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIANRMKT

Query:  ALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE
         L  IIS +QSAF+SGR ITDNV+VAFE +H L  K+ GK+G  AIKLDMSK YDRVE
Subjt:  ALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]4.3e-4353.7Show/hide
Query:  MDRPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIAN
        +   FT EE++AA+  M PTKA  PDG +A  YQK W  +G+  +   L  LNN + +  IN+T I LIPK ++P+RM+EFRPISLCNVIYK ++K +AN
Subjt:  MDRPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIAN

Query:  RMKTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE
        R+K  L  IIS +QSAF+ GR ITDNV+VA+E +H ++ +++GKKG++A+KLD+SK YDRVE
Subjt:  RMKTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE

XP_042962379.1 uncharacterized protein LOC122296642 [Carya illinoinensis]2.5e-4351.85Show/hide
Query:  MDRPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIAN
        + +PFT+EE+ +A+  MNP  +  PDG  A  YQK W+ +GE+     L +LN   +++++N+T I+LIPK K+PKR+AEFRPISLCNV+YK V+K +AN
Subjt:  MDRPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIAN

Query:  RMKTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE
        RMK  L  +IS +QSAF+ GR I+DN++VA+E +H++N + +GK+G +AIK+DMSK YDR+E
Subjt:  RMKTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE

TrEMBL top hitse value%identityAlignment
A0A2N9HRH8 Reverse transcriptase domain-containing protein6.0e-4356.25Show/hide
Query:  RPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIANRM
        +PFT +E+  A+  M+P+KA  PDG  +  +QK W+ +G D +   L +LN+   +  IN T I+LIPK K+P+RM+E+RPISLCNV+YK ++K +ANR+
Subjt:  RPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIANRM

Query:  KTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE
        K  L  IIS SQSAF+ GR ITDNV VAFE IH +  KRRGKKG +AIKLDMSK YDRVE
Subjt:  KTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE

A0A2N9HU09 Reverse transcriptase domain-containing protein6.0e-4356.25Show/hide
Query:  RPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIANRM
        +PFT +E+  A+  M+P+KA  PDG  +  +QK W+ +G D +   L +LN+   +  IN T I+LIPK K+P+RM+E+RPISLCNV+YK ++K +ANR+
Subjt:  RPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIANRM

Query:  KTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE
        K  L  IIS SQSAF+ GR ITDNV VAFE IH +  KRRGKKG +AIKLDMSK YDRVE
Subjt:  KTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE

A0A2N9I475 Reverse transcriptase domain-containing protein6.0e-4356.25Show/hide
Query:  RPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIANRM
        +PFT +E+  A+  M+P+KA  PDG  +  +QK W+ +G D +   L +LN+   +  IN T I+LIPK K+P+RM+E+RPISLCNV+YK ++K +ANR+
Subjt:  RPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIANRM

Query:  KTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE
        K  L  IIS SQSAF+ GR ITDNV VAFE IH +  KRRGKKG +AIKLDMSK YDRVE
Subjt:  KTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE

A0A2N9IT57 Reverse transcriptase domain-containing protein6.0e-4356.25Show/hide
Query:  RPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIANRM
        +PFT +E+  A+  M+P+KA  PDG  +  +QK W+ +G D +   L +LN+   +  IN T I+LIPK K+P+RM+E+RPISLCNV+YK ++K +ANR+
Subjt:  RPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIANRM

Query:  KTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE
        K  L  IIS SQSAF+ GR ITDNV VAFE IH +  KRRGKKG +AIKLDMSK YDRVE
Subjt:  KTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE

F4NCI4 Reverse transcriptase domain-containing protein1.9e-4457.41Show/hide
Query:  MDRPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIAN
        +D PF +EE+ AA+  M+P KA  PDG +A  YQ  WDT+GED     L +LNN DN+  +N T I LIPK K  +   +FRPISLCNV+YK VAK +AN
Subjt:  MDRPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIAN

Query:  RMKTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE
        RMK  L  +I  SQS F+ GR ITDNV+VA+EC H L KK+ GKKG L +KLDMSK YDRVE
Subjt:  RMKTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.0e-1230.06Show/hide
Query:  MDRPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKT-KDPKRMAEFRPISLCNVIYKAVAKAIA
        ++RP T  EI A I  +   K+  PDG  A+ YQ+  + L    +K+   I        +     I LIPK  +D  +   FRPISL N+  K + K +A
Subjt:  MDRPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKT-KDPKRMAEFRPISLCNVIYKAVAKAIA

Query:  NRMKTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE
        NR++  +  +I   Q  FI G Q   N+  +   I  +N+ +   K ++ I +D  K +D+++
Subjt:  NRMKTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE

P08548 LINE-1 reverse transcriptase homolog3.3e-1430.54Show/hide
Query:  MDRPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTL----IALIPKT-KDPKRMAEFRPISLCNVIYKAVA
        ++RP +  EI + I+ +   K+  PDG  ++ YQ    T  E+ + + L +  N +    + NT     I LIPK  KDP R   +RPISL N+  K + 
Subjt:  MDRPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTL----IALIPKT-KDPKRMAEFRPISLCNVIYKAVA

Query:  KAIANRMKTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE
        K + NR++  +  II   Q  FI G Q   N+  +   I  +NK +   K ++ + +D  K +D ++
Subjt:  KAIANRMKTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE

P11369 LINE-1 retrotransposable element ORF2 protein7.1e-1734.13Show/hide
Query:  MDRPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTL----IALIPK-TKDPKRMAEFRPISLCNVIYKAVA
        ++ P + +EIEA I  +   K+  PDG  A+ YQ    T  ED I +   + +  +    + N+     I LIPK  KDP ++  FRPISL N+  K + 
Subjt:  MDRPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTL----IALIPK-TKDPKRMAEFRPISLCNVIYKAVA

Query:  KAIANRMKTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE
        K +ANR++  +  II P Q  FI G Q   N+  +   IH +NK +   K ++ I LD  K +D+++
Subjt:  KAIANRMKTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE

P14381 Transposon TX1 uncharacterized 149 kDa protein2.5e-1732.72Show/hide
Query:  MDRPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIAN
        ++ P T +E+  A++ M   K+   DG   + +Q  WDTLG D  +V        +   +    +++L+PK  D + +  +RP+SL +  YK VAKAI+ 
Subjt:  MDRPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIAN

Query:  RMKTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE
        R+K+ L  +I P QS  + GR I DNV +  + +H     RR       + LD  K +DRV+
Subjt:  RMKTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.8e-0832.18Show/hide
Query:  EEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAV
        +EI AA+  M   KA  PD   A+ + + W  + + TI            ++  N T I LIPK     +++ FRP+S C V+YK +
Subjt:  EEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAV

AT4G20520.1 RNA binding;RNA-directed DNA polymerases5.6e-0939.06Show/hide
Query:  IANRMKTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRV
        +  R+K  +  +I P+Q++FI GR  TDN++   E +H++ +K +G KG + +KLD+ K YDR+
Subjt:  IANRMKTALDTIISPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAGACCTTTCACCAAAGAGGAGATTGAAGCTGCTATCAAAGGAATGAACCCTACAAAGGCTTCGAGTCCTGACGGAGCCCACGCCAAGCTGTATCAGAAAGGTTG
GGACACCCTAGGGGAAGACACCATTAAGGTCTGCTTAGGAATTCTAAACAATAGAGACAACATGGAAAACATCAACAACACATTGATAGCTCTCATCCCTAAAACTAAGG
ACCCGAAAAGGATGGCGGAATTCAGACCTATTAGCCTATGCAACGTGATCTACAAGGCAGTGGCCAAAGCAATAGCTAATAGAATGAAGACGGCCCTAGATACTATCATC
TCCCCAAGTCAATCAGCATTCATCTCCGGCAGACAAATAACGGATAATGTGATAGTGGCGTTTGAATGCATCCACGCCTTGAACAAAAAAAGAAGGGGCAAAAAAGGGAA
CCTTGCCATAAAGCTTGACATGAGCAAAACTTATGACAGGGTGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACAGACCTTTCACCAAAGAGGAGATTGAAGCTGCTATCAAAGGAATGAACCCTACAAAGGCTTCGAGTCCTGACGGAGCCCACGCCAAGCTGTATCAGAAAGGTTG
GGACACCCTAGGGGAAGACACCATTAAGGTCTGCTTAGGAATTCTAAACAATAGAGACAACATGGAAAACATCAACAACACATTGATAGCTCTCATCCCTAAAACTAAGG
ACCCGAAAAGGATGGCGGAATTCAGACCTATTAGCCTATGCAACGTGATCTACAAGGCAGTGGCCAAAGCAATAGCTAATAGAATGAAGACGGCCCTAGATACTATCATC
TCCCCAAGTCAATCAGCATTCATCTCCGGCAGACAAATAACGGATAATGTGATAGTGGCGTTTGAATGCATCCACGCCTTGAACAAAAAAAGAAGGGGCAAAAAAGGGAA
CCTTGCCATAAAGCTTGACATGAGCAAAACTTATGACAGGGTGGAATGA
Protein sequenceShow/hide protein sequence
MDRPFTKEEIEAAIKGMNPTKASSPDGAHAKLYQKGWDTLGEDTIKVCLGILNNRDNMENINNTLIALIPKTKDPKRMAEFRPISLCNVIYKAVAKAIANRMKTALDTII
SPSQSAFISGRQITDNVIVAFECIHALNKKRRGKKGNLAIKLDMSKTYDRVE