; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027829 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027829
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:5657829..5658395
RNA-Seq ExpressionLag0027829
SyntenyLag0027829
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBN67876.1 transposable element gene, partial [Prunus dulcis]2.1e-5456.35Show/hide
Query:  IISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQEAFYLRR
        +IS +Q+AFVP RLITDN +V FE  H L  +R+G++G +A+KLDMSKAYDRVEWEF++K ML MGF   WV+ VMDC+ TV YS L+NGEP    Y  R
Subjt:  IISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQEAFYLRR

Query:  GLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEASG
        GLRQGDPLSPY+FL+CAEG +TLL++ E      +G+ I +  P++SHLFFADDS +  +A D++C  +K +   YE ASG
Subjt:  GLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEASG

VVA38592.1 PREDICTED: reverse mRNAase, partial [Prunus dulcis]1.5e-5555.32Show/hide
Query:  MKHTLDYIISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQ
        MK  +  +IS +Q+AFVP RLITDN +V FE  H L  +R+G++G +A+KLDMSKAYDRVEWEF++K ML MGF   WV+ VMDC+ TV YS L+NGEP 
Subjt:  MKHTLDYIISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQ

Query:  EAFYLRRGLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEASG
           Y  RGLRQGDPLSPY+FL+CAEG +TLL++ E      +G+ I +  P++SHLFFADDS +  +A D++C  +K +   YE ASG
Subjt:  EAFYLRRGLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEASG

XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]6.4e-5957.22Show/hide
Query:  MKHTLDYIISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQ
        +K  +D +ISP Q+AFVP R ITDN ++GFEC++++ NKR GK G +AMKLDMSKAYDRVEW++++  M+KMGF   WV  +M+C+E+V ++VLING P 
Subjt:  MKHTLDYIISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQ

Query:  EAFYLRRGLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEAS
        + F   RGLRQGDPLSPY+F++CAEGLS L+N EE    N   LKIN+RCP +SHLF+ADD L+  +A   +CR+IK +L +YE+AS
Subjt:  EAFYLRRGLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEAS

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]9.6e-5555.85Show/hide
Query:  MKHTLDYIISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQ
        +K+ L  IIS NQ+AF+  RLITDNVLV FE +H L +K++GKEG+ A+KLDMSKAYDRVEW FIK+ M KMGF E W++ VM CI +V YS+L+NG   
Subjt:  MKHTLDYIISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQ

Query:  EAFYLRRGLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEASG
         +    RGLRQGDP+SPYIFL+CA+G S+LLN      +   G+ I + CP ++HLFFADDSL+ C+A   +C+T+  +L  YE+ASG
Subjt:  EAFYLRRGLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEASG

XP_042944540.1 uncharacterized protein LOC122278417 [Carya illinoinensis]1.6e-5456.91Show/hide
Query:  MKHTLDYIISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQ
        +K  L  II+P Q+AFVP R+I DNV+V FE LHS++ K KG++GY+A+KLDMSKAYDRVEWEF++K M+KMGF   WV  VM CI TV YS+L+NG PQ
Subjt:  MKHTLDYIISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQ

Query:  EAFYLRRGLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEASG
         +F   RG+RQGDPLSPY+F++ +E LS+LLN  E+  K   G +I++   S++HLFFADDSL+ CRA   +  +I  +L+ YEEASG
Subjt:  EAFYLRRGLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEASG

TrEMBL top hitse value%identityAlignment
A0A2N9EX83 Reverse transcriptase domain-containing protein1.3e-5756.38Show/hide
Query:  MKHTLDYIISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQ
        +K  L +IIS +Q+AFVP RLI+DN+L+ FE LH + + +  K+GY+A+KLDMSKAYDRVEW F+++ ML MGF E WV  +M+C+ TV YSVLINGEP+
Subjt:  MKHTLDYIISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQ

Query:  EAFYLRRGLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEASG
          F   RGLRQGDP+SPY+FL+CAEGL+ LL +  +  K  +G+ I++  P LSHLFFADDS++ CRA   +C  I+ +L TYE ASG
Subjt:  EAFYLRRGLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEASG

A0A2N9F5W1 Reverse transcriptase domain-containing protein2.9e-5756.68Show/hide
Query:  MKHTLDYIISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQ
        +K  L +IIS +Q+AFVP RLI+DN+L+ FE LH + + +  K+GY+A+KLDMSKAYDRVEW F+++ MLKMGF E WV  +M+C+ TV YSVLINGEP+
Subjt:  MKHTLDYIISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQ

Query:  EAFYLRRGLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEAS
          F   RGLRQGDP+SPY+FL+CAEGL+ LL +  +  K  +G+ I++  P LSHLFFADDS++ CRA   +C  I+ +L TYE AS
Subjt:  EAFYLRRGLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEAS

A0A2N9GLU2 Reverse transcriptase domain-containing protein1.1e-5655.32Show/hide
Query:  MKHTLDYIISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQ
        +K  L ++IS  Q+AFVP RLI+DN+L+ FE LH ++  R+G++GY+A+KLDMSKAYDRVEW F++K M  MGF + WV  +M+C+ +V YSVLINGEP+
Subjt:  MKHTLDYIISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQ

Query:  EAFYLRRGLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEASG
          F+  RGLRQGDP+SPY+FL+CAEGL  LL+R  +  +  +GL I++  P L+HLFFADDS++ CRA   +C TI  +L  YE ASG
Subjt:  EAFYLRRGLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEASG

A0A2N9IP69 Reverse transcriptase domain-containing protein7.7e-5859.36Show/hide
Query:  KHTLDYIISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQE
        K  L YIIS  Q+AFVP RLITDN+L+ FE LH +NN+R GK G +A+KLDMSKAYDRVEW F+K+ MLKMGF   WV  +M+CI TV YS+LINGEP  
Subjt:  KHTLDYIISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQE

Query:  AFYLRRGLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEASG
             RGLRQGDP+SPY+FL+CAEGL+ LLN+      +  G+ I +R P L+HLFFADDSL+ CRA  ++C  I+ VL  YE  SG
Subjt:  AFYLRRGLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEASG

A0A6J1DUG8 uncharacterized protein LOC1110241353.1e-5957.22Show/hide
Query:  MKHTLDYIISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQ
        +K  +D +ISP Q+AFVP R ITDN ++GFEC++++ NKR GK G +AMKLDMSKAYDRVEW++++  M+KMGF   WV  +M+C+E+V ++VLING P 
Subjt:  MKHTLDYIISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQ

Query:  EAFYLRRGLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEAS
        + F   RGLRQGDPLSPY+F++CAEGLS L+N EE    N   LKIN+RCP +SHLF+ADD L+  +A   +CR+IK +L +YE+AS
Subjt:  EAFYLRRGLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEAS

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.0e-1525.97Show/hide
Query:  IISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQEAFYLRR
        +I  +Q  F+P      N+      +  +N  R   + ++ + +D  KA+D+++  F+ KT+ K+G    +++ +    +    ++++NG+  EAF L+ 
Subjt:  IISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQEAFYLRR

Query:  GLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEASG
        G RQG PLSP +F I  E L+  + +E    K  KG+++ K    LS   FADD ++         + + +++  + + SG
Subjt:  GLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEASG

P08548 LINE-1 reverse transcriptase homolog2.5e-1327.62Show/hide
Query:  IISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQEAFYLRR
        II  +Q  F+P      N+      +  + NK K K+ ++ + +D  KA+D ++  F+ +T+ K+G +  +++ +         ++++NG   ++F LR 
Subjt:  IISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQEAFYLRR

Query:  GLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEASG
        G RQG PLSP +F I  E L+  +  E    K  KG+ I      LS   FADD ++           +  V+  Y   SG
Subjt:  GLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEASG

P11369 LINE-1 retrotransposable element ORF2 protein7.0e-1628.73Show/hide
Query:  IISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQEAFYLRR
        II P+Q  F+P      N+      +H + NK K K  ++ + LD  KA+D+++  F+ K + + G +  ++  +         ++ +NGE  EA  L+ 
Subjt:  IISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQEAFYLRR

Query:  GLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEASG
        G RQG PLSPY+F I  E L+  + ++    K  KG++I K    +S L  ADD ++      +  R +  +++++ E  G
Subjt:  GLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEASG

P92555 Uncharacterized mitochondrial protein AtMg012501.8e-1146.38Show/hide
Query:  LINGEPQEAFYLRRGLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDS
        +ING PQ      RGLRQGDPLSPY+F++C E LS L  R +   +   G++++   P ++HL FADD+
Subjt:  LINGEPQEAFYLRRGLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDS

Q05118 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)1.8e-1131.65Show/hide
Query:  HSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQEAFYLRRGLRQGDPLSPYIFLICAEGLSTLLNR
        H +  +R   + Y  + LD+ KA+D V    I + M   G  +     +M  I     ++++ G      Y+R G++QGDPLSP +F I  + L T LN 
Subjt:  HSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQEAFYLRRGLRQGDPLSPYIFLICAEGLSTLLNR

Query:  EEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSD
        E+       G  +   C  ++ L FADD L+L   ED D
Subjt:  EEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSD

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases7.9e-1539.02Show/hide
Query:  MKHTLDYIISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKV
        +K  +  +I P QA+F+P R+ TDN++   E +HS+  K KG +G++ +KLD+ KAYDR+ W++++ T++  GF E W+ ++
Subjt:  MKHTLDYIISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKV

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.3e-1246.38Show/hide
Query:  LINGEPQEAFYLRRGLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDS
        +ING PQ      RGLRQGDPLSPY+F++C E LS L  R +   +   G++++   P ++HL FADD+
Subjt:  LINGEPQEAFYLRRGLRQGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCACACCCTAGACTACATTATATCCCCCAATCAAGCAGCTTTTGTTCCAAAAAGGCTCATCACGGATAATGTGCTTGTGGGTTTTGAATGCCTTCACTCTCTAAA
CAACAAAAGAAAAGGCAAGGAAGGGTACATAGCCATGAAGCTGGACATGAGCAAGGCTTACGATCGTGTGGAATGGGAGTTCATAAAGAAAACAATGCTGAAGATGGGCT
TTAAAGAAGACTGGGTTCAAAAAGTAATGGATTGCATCGAAACAGTGGAGTACTCAGTGTTGATAAATGGGGAACCCCAAGAAGCTTTCTATCTCAGGAGAGGACTACGA
CAGGGAGATCCTTTATCGCCATACATATTTCTGATTTGTGCTGAAGGCCTATCTACACTTCTAAACAGGGAAGAAATGCACTTTAAAAATTTTAAAGGCTTAAAGATCAA
CAAACGGTGCCCCTCACTTTCTCATTTGTTTTTTGCAGATGATAGTCTAATTCTGTGCAGAGCGGAGGACTCGGATTGCAGGACTATAAAGAGAGTTCTACACACCTATG
AAGAAGCCTCGGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGCACACCCTAGACTACATTATATCCCCCAATCAAGCAGCTTTTGTTCCAAAAAGGCTCATCACGGATAATGTGCTTGTGGGTTTTGAATGCCTTCACTCTCTAAA
CAACAAAAGAAAAGGCAAGGAAGGGTACATAGCCATGAAGCTGGACATGAGCAAGGCTTACGATCGTGTGGAATGGGAGTTCATAAAGAAAACAATGCTGAAGATGGGCT
TTAAAGAAGACTGGGTTCAAAAAGTAATGGATTGCATCGAAACAGTGGAGTACTCAGTGTTGATAAATGGGGAACCCCAAGAAGCTTTCTATCTCAGGAGAGGACTACGA
CAGGGAGATCCTTTATCGCCATACATATTTCTGATTTGTGCTGAAGGCCTATCTACACTTCTAAACAGGGAAGAAATGCACTTTAAAAATTTTAAAGGCTTAAAGATCAA
CAAACGGTGCCCCTCACTTTCTCATTTGTTTTTTGCAGATGATAGTCTAATTCTGTGCAGAGCGGAGGACTCGGATTGCAGGACTATAAAGAGAGTTCTACACACCTATG
AAGAAGCCTCGGGATAG
Protein sequenceShow/hide protein sequence
MKHTLDYIISPNQAAFVPKRLITDNVLVGFECLHSLNNKRKGKEGYIAMKLDMSKAYDRVEWEFIKKTMLKMGFKEDWVQKVMDCIETVEYSVLINGEPQEAFYLRRGLR
QGDPLSPYIFLICAEGLSTLLNREEMHFKNFKGLKINKRCPSLSHLFFADDSLILCRAEDSDCRTIKRVLHTYEEASG