; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001221 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001221
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:26983002..26983553
RNA-Seq ExpressionLag0001221
SyntenyLag0001221
Gene Ontology termsGO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5447348.1 hypothetical protein F2P56_032905 [Juglans regia]5.0e-4859.21Show/hide
Query:  MSNFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAE
        +S+++PISLCNVIYK+V+KA+ANR+K +L DIIS TQ+AFVP RLI DN+I+ FE +H ++++  GK GL+A+KLDM+KAYDR+EW FL+++MLK+ FA 
Subjt:  MSNFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAE

Query:  GWTRKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLS
         W + V+NC+E+V Y +++NG  QE   P RG+RQGDPLSPYLF+LC E L+
Subjt:  GWTRKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLS

XP_035539661.1 uncharacterized protein LOC118344031 [Juglans regia]2.6e-4963.76Show/hide
Query:  FKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAEGWT
        F+PISLCNVIYKI++KA+ANR+K+VL  IISP+Q+AFVP RLI DNVI+ FE  H +N K  GK G +A+KLDM+KAYDRVEW FLR ++  L F+  W 
Subjt:  FKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAEGWT

Query:  RKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLS
          V+ CIE+V Y +++NGV Q E YP RGLRQGDPLSPYLF+LC E LS
Subjt:  RKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLS

XP_039834390.1 uncharacterized protein LOC120695147 [Panicum virgatum]5.9e-4960.39Show/hide
Query:  MSNFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAE
        + + +PISLCNV+YK+++K LANR+K VL +IISP+Q+AFVP RLI DNV+L +E  H LN +R GK G+ A+KLDM+KAYDRVEW FL KMMLKL FA 
Subjt:  MSNFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAE

Query:  GWTRKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLSVI
         W  +V+ C+ +V Y + +NG    +IYP+RGLRQG+PLSPYLF+LC EGLS +
Subjt:  GWTRKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLSVI

XP_042952220.1 uncharacterized protein LOC122289300 [Carya illinoinensis]1.9e-4753.85Show/hide
Query:  MSNFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAE
        ++ F PISLCNV YK+++K +ANR+K+VL  IISP+Q+AFVP RLI DN+I+ FE +H +  K  GK G +A+KLDM+KAYDRVEW FLR ++ KL F++
Subjt:  MSNFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAE

Query:  GWTRKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLSVISLYLSFMRGRLVY----TSNQERISKQF
         W   V+NCIE+V Y +L+NG  Q    P RG+RQGDPLSPYLF+LC E LS  +L      GRL++      +Q RIS  F
Subjt:  GWTRKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLSVISLYLSFMRGRLVY----TSNQERISKQF

XP_042972818.1 uncharacterized protein LOC122304624 [Carya illinoinensis]5.0e-4857.24Show/hide
Query:  MSNFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAE
        ++ ++PISLCNV+YKIV+KA++NR+K++L  IISPTQ+AF+P RLI DN+++ FE +H++  +++GK G +A+KLDM+KAYDRVEW FL  +M KL F E
Subjt:  MSNFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAE

Query:  GWTRKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLS
         W + ++ C++SV Y VL+NG+  ++ +PKRGLRQGDPLSPYLF++C EGLS
Subjt:  GWTRKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLS

TrEMBL top hitse value%identityAlignment
A0A2N9E8U7 Reverse transcriptase domain-containing protein5.4e-4863.64Show/hide
Query:  MSNFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAE
        MS+++PISLCNVIYKI++K LANR+K VL  IIS  Q+AFVP RLI DNV + FE IH L  KRKGK G +A+KLDM+KAYDRVEW FL  +M KL FA+
Subjt:  MSNFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAE

Query:  GWTRKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLSVI
         W   +++CI SVQY VL++GV    I P RG+RQGDPLSPYLFL+C EGLS +
Subjt:  GWTRKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLSVI

A0A2N9FP20 Reverse transcriptase domain-containing protein9.2e-4860.61Show/hide
Query:  MSNFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAE
        MS+++PISLCNVIYKI++K +ANR+K VL  IIS +Q+AFVP RLI DNV + FE +H +  KRKGK G +A+KLDM+KAYDRVEW F+  MM KL FAE
Subjt:  MSNFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAE

Query:  GWTRKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLSVISLYLSFMRGRL
         W   ++ CI +VQY VL++GV +  + P RGLRQGDPLSPYLFLLC EGLS +      M GRL
Subjt:  GWTRKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLSVISLYLSFMRGRL

A0A2N9G933 Reverse transcriptase domain-containing protein9.2e-4860.61Show/hide
Query:  MSNFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAE
        MS+++PISLCNVIYKI++K +ANR+K VL  IIS +Q+AFVP RLI DNV + FE +H +  KRKGK G +A+KLDM+KAYDRVEW F+  MM KL FAE
Subjt:  MSNFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAE

Query:  GWTRKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLSVISLYLSFMRGRL
         W   ++ CI +VQY VL++GV +  + P RGLRQGDPLSPYLFLLC EGLS +      M GRL
Subjt:  GWTRKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLSVISLYLSFMRGRL

A0A2N9GQE6 Reverse transcriptase domain-containing protein9.2e-4860Show/hide
Query:  MSNFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAE
        MS+++PISLCNV+YKI++K LANR+K VL  IIS +Q+AFVP RLI DNV + FE IH L  KRKG+ G +A+KLDM+KAYDRVEW FL  +M KL FA+
Subjt:  MSNFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAE

Query:  GWTRKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLSVISLYLSFMRGRL
         W   +++C+ SVQY VL++GV +  I P RG+RQGDPLSPYLFL+C EGLS +    S + GRL
Subjt:  GWTRKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLSVISLYLSFMRGRL

A0A6P9DWY7 uncharacterized protein LOC1183440311.3e-4963.76Show/hide
Query:  FKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAEGWT
        F+PISLCNVIYKI++KA+ANR+K+VL  IISP+Q+AFVP RLI DNVI+ FE  H +N K  GK G +A+KLDM+KAYDRVEW FLR ++  L F+  W 
Subjt:  FKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAEGWT

Query:  RKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLS
          V+ CIE+V Y +++NGV Q E YP RGLRQGDPLSPYLF+LC E LS
Subjt:  RKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLS

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein3.2e-1334.44Show/hide
Query:  NFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAEGW
        NF+PISL N+  KI+ K LANR+++ +  +I   Q  F+P      N+      I  + N+ K K  +I + +D  KA+D+++  F+ K + KL   +G 
Subjt:  NFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAEGW

Query:  TRKVINCI-ESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLS
          K+I  I +     +++NG   E    K G RQG PLSP LF + +E L+
Subjt:  TRKVINCI-ESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLS

P08548 LINE-1 reverse transcriptase homolog5.4e-1333.55Show/hide
Query:  NFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAEGW
        N++PISL N+  KI+ K L NR+++ +  II   Q  F+P      N+      I  + NK K K  +I + +D  KA+D ++  F+ + + K+   EG 
Subjt:  NFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAEGW

Query:  TRKVINCIESVQYV-VLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLSV
          K+I  I S     +++NGV  +    + G RQG PLSP LF + +E L++
Subjt:  TRKVINCIESVQYV-VLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLSV

P11369 LINE-1 retrotransposable element ORF2 protein1.5e-1537.91Show/hide
Query:  MSNFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAE
        + NF+PISL N+  KI+ K LANR++  +  II P Q  F+P      N+      IH + NK K K  +I + LD  KA+D+++  F+ K +L+ +  +
Subjt:  MSNFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAE

Query:  GWTRKVINCIESVQYV-VLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLS
        G    +I  I S     + +NG   E I  K G RQG PLSPYLF + +E L+
Subjt:  GWTRKVINCIESVQYV-VLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLS

P14381 Transposon TX1 uncharacterized 149 kDa protein2.6e-1534.55Show/hide
Query:  MSNFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAE
        + N++P+SL +  YKIVAKA++ R+K VL ++I P Q+  VP R I DNV L  + +H     R+    L  + LD  KA+DRV+  +L   +   +F  
Subjt:  MSNFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAE

Query:  GWTRKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLSVISLYLSFMRGRL
         +   +     S + +V IN      +   RG+RQG PLS  L+ L +E       +L  +R RL
Subjt:  GWTRKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLSVISLYLSFMRGRL

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM8.9e-0828.86Show/hide
Query:  NFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAEGW
        +F+PIS+ +V+ + +   LA R+   ++    P Q  F+P     DN  +       L +  K         LD++KA+D +    +   +      +G+
Subjt:  NFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAEGW

Query:  TRKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGL
           V N  E     +  +G S EE  P RG++QGDPLSP LF L ++ L
Subjt:  TRKVINCIESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGL

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases7.2e-1334.88Show/hide
Query:  LANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAEGWTRKV
        +  R+K ++ ++I P QA+F+P R+  DN++   E +H++  K KG  G + +KLD+ KAYDR+ W +L   ++   F E W  ++
Subjt:  LANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAEGWTRKV

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.2e-0768.57Show/hide
Query:  LINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLS
        +ING  Q  + P RGLRQGDPLSPYLF+LC E LS
Subjt:  LINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAACTTCAAGCCTATTAGTCTTTGCAACGTAATTTACAAAATAGTAGCTAAGGCCCTGGCGAATAGGATGAAGAGGGTGCTTGACGACATTATATCTCCAACTCA
AGCGGCTTTTGTTCCTAAAAGACTCATATTAGATAATGTGATCTTGGGTTTTGAATGTATTCATGCGCTTAATAACAAAAGAAAAGGAAAAGGAGGCCTCATTGCAATGA
AACTGGATATGACCAAGGCTTACGACAGAGTTGAATGGGTGTTTTTAAGGAAGATGATGTTGAAGCTGAATTTTGCTGAGGGTTGGACCAGAAAAGTGATTAATTGCATT
GAGTCGGTGCAATATGTTGTTCTCATCAATGGAGTTTCCCAGGAAGAGATCTATCCAAAGCGAGGCTTGAGACAGGGAGACCCGTTATCTCCCTATCTCTTTCTTTTATG
CGTGGAAGGCTTGTCTGTTATCTCCTTATATCTTTCTTTTATGCGTGGAAGGCTTGTGTACACTTCTAATCAGGAAAGAATTTCTAAACAATTCCAAAGGCTTAAAGATT
AG
mRNA sequenceShow/hide mRNA sequence
ATGTCGAACTTCAAGCCTATTAGTCTTTGCAACGTAATTTACAAAATAGTAGCTAAGGCCCTGGCGAATAGGATGAAGAGGGTGCTTGACGACATTATATCTCCAACTCA
AGCGGCTTTTGTTCCTAAAAGACTCATATTAGATAATGTGATCTTGGGTTTTGAATGTATTCATGCGCTTAATAACAAAAGAAAAGGAAAAGGAGGCCTCATTGCAATGA
AACTGGATATGACCAAGGCTTACGACAGAGTTGAATGGGTGTTTTTAAGGAAGATGATGTTGAAGCTGAATTTTGCTGAGGGTTGGACCAGAAAAGTGATTAATTGCATT
GAGTCGGTGCAATATGTTGTTCTCATCAATGGAGTTTCCCAGGAAGAGATCTATCCAAAGCGAGGCTTGAGACAGGGAGACCCGTTATCTCCCTATCTCTTTCTTTTATG
CGTGGAAGGCTTGTCTGTTATCTCCTTATATCTTTCTTTTATGCGTGGAAGGCTTGTGTACACTTCTAATCAGGAAAGAATTTCTAAACAATTCCAAAGGCTTAAAGATT
AG
Protein sequenceShow/hide protein sequence
MSNFKPISLCNVIYKIVAKALANRMKRVLDDIISPTQAAFVPKRLILDNVILGFECIHALNNKRKGKGGLIAMKLDMTKAYDRVEWVFLRKMMLKLNFAEGWTRKVINCI
ESVQYVVLINGVSQEEIYPKRGLRQGDPLSPYLFLLCVEGLSVISLYLSFMRGRLVYTSNQERISKQFQRLKD