; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040702 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040702
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr13:7475520..7477296
RNA-Seq ExpressionLag0040702
SyntenyLag0040702
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]9.1e-3176.6Show/hide
Query:  MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHRFKYLLNELNVKS
        +WLKRIVGEL SQEFIP+I CDS SAI LAKNPSHHERSKH DVK FH+IR++I+  +V+LVKVHT +NL DMLTK L+AHRFKYLL+ELNVKS
Subjt:  MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHRFKYLLNELNVKS

KAA0065713.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.2e-2474.39Show/hide
Query:  MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHR
        +WLKRIVGEL SQEFIP+I CDS SAI LAKNPSHHERSKH DVK FH+IR++I+  +V+LVKVHT +NL DMLTK L+AHR
Subjt:  MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHR

TYK10339.1 hypothetical protein E5676_scaffold367G00140 [Cucumis melo var. makuwa]4.2e-2870.21Show/hide
Query:  MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHRFKYLLNELNVKS
        +WLKRIVGEL SQ+FIP+I C+S SAI LAKNPSHHERSKH D+K F++IR++I+  +V++VKVH  +NL DMLTK L+AHRFKYLL+ELNVKS
Subjt:  MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHRFKYLLNELNVKS

TYK13826.1 putative polyprotein [Cucumis melo var. makuwa]9.1e-3176.6Show/hide
Query:  MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHRFKYLLNELNVKS
        +WLKRIVGEL SQEFIP+I CDS SAI LAKNPSHHERSKH DVK FH+IR++I+  +V+LVKVHT +NL DMLTK L+AHRFKYLL+ELNVKS
Subjt:  MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHRFKYLLNELNVKS

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]1.7e-2455.38Show/hide
Query:  MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHR-FKYLLNELNVKSRHIVI
        +WLKRIVGEL SQEFIP+I CDS SAI LAKNPSHHERSKH DVK FH+IR++I+  +V+LVKVHT +NL DMLTK L+AHR  +    E + K R   +
Subjt:  MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHR-FKYLLNELNVKSRHIVI

Query:  GNFGLPDASLEIGSKDGPWRRNEPRGRAKA
          FG      E G ++    R    GR  A
Subjt:  GNFGLPDASLEIGSKDGPWRRNEPRGRAKA

TrEMBL top hitse value%identityAlignment
A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class1.0e-2474.39Show/hide
Query:  MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHR
        +WLKRIVGEL SQEFIP+I CDS SAI LAKNPSHHERSKH DVK FH+IR++I+  +V+LVKVHT +NL DMLTK L+AHR
Subjt:  MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHR

A0A5A7UB25 Putative gag-pol polyprotein4.4e-3176.6Show/hide
Query:  MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHRFKYLLNELNVKS
        +WLKRIVGEL SQEFIP+I CDS SAI LAKNPSHHERSKH DVK FH+IR++I+  +V+LVKVHT +NL DMLTK L+AHRFKYLL+ELNVKS
Subjt:  MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHRFKYLLNELNVKS

A0A5D3CJG1 Integrase catalytic domain-containing protein2.0e-2870.21Show/hide
Query:  MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHRFKYLLNELNVKS
        +WLKRIVGEL SQ+FIP+I C+S SAI LAKNPSHHERSKH D+K F++IR++I+  +V++VKVH  +NL DMLTK L+AHRFKYLL+ELNVKS
Subjt:  MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHRFKYLLNELNVKS

A0A5D3CTV2 Putative polyprotein4.4e-3176.6Show/hide
Query:  MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHRFKYLLNELNVKS
        +WLKRIVGEL SQEFIP+I CDS SAI LAKNPSHHERSKH DVK FH+IR++I+  +V+LVKVHT +NL DMLTK L+AHRFKYLL+ELNVKS
Subjt:  MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHRFKYLLNELNVKS

A0A5D3DNU1 Putative gag-pol polyprotein8.0e-2555.38Show/hide
Query:  MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHR-FKYLLNELNVKSRHIVI
        +WLKRIVGEL SQEFIP+I CDS SAI LAKNPSHHERSKH DVK FH+IR++I+  +V+LVKVHT +NL DMLTK L+AHR  +    E + K R   +
Subjt:  MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHR-FKYLLNELNVKSRHIVI

Query:  GNFGLPDASLEIGSKDGPWRRNEPRGRAKA
          FG      E G ++    R    GR  A
Subjt:  GNFGLPDASLEIGSKDGPWRRNEPRGRAKA

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.9e-0735.48Show/hide
Query:  MWLKRIVGELSSQEFIPV-ICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHRFKYLLNELNV
        +WLK ++  ++ +   P+ I  D+   I +A NPS H+R+KH D+K +HF R+ +  N + L  + T   L D+ TK L A RF  L ++L +
Subjt:  MWLKRIVGELSSQEFIPV-ICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHRFKYLLNELNV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.0e-1445.24Show/hide
Query:  MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHRFK
        +WLKR + EL   +   V+ CDS SAI L+KN  +H R+KH DV+ +H+IR+M+    +K++K+ TN+N  DMLTKV+  ++F+
Subjt:  MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHRFK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.8e-0636.84Show/hide
Query:  PVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHRFKYLLNELNV
        PVI CD++ A  L  NP  H R KH  +  +HFIR+ +    +++V V T+  L D LTK L+   F+   +++ V
Subjt:  PVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHRFKYLLNELNV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.5e-0734.78Show/hide
Query:  WLKRIVGELSSQ-EFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHRFKYLLNELNV
        W+  ++ EL  Q    PVI CD++ A  L  NP  H R KH  +  +HFIR+ +    +++V V T+  L D LTK L+   F+    ++ V
Subjt:  WLKRIVGELSSQ-EFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHRFKYLLNELNV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGTTGAAGAGAATTGTGGGTGAGTTGTCGTCTCAAGAGTTTATTCCTGTCATCTGTTGTGATAGCCTGAGTGCTATTCGTCTTGCGAAGAATCCATCTCACCATGA
ACGGTCTAAACATAATGATGTTAAGTTGTTTCACTTTATAAGGGATATGATTTCTTATAATGAAGTTAAACTGGTGAAAGTTCATACAAATCAAAACTTGCCAGATATGC
TTACCAAAGTTCTCACAGCTCATAGGTTTAAATACCTGTTAAATGAGTTGAATGTAAAGTCAAGACATATTGTTATAGGTAATTTTGGACTACCCGACGCGAGCCTAGAA
ATAGGATCGAAGGACGGACCCTGGAGAAGAAACGAGCCAAGGGGTCGAGCCAAGGCCAAAGAGATCAGGCTTTTGGTCCGACCCCATGGTCGGCCTCGGCCCGCTTGCGT
GGGCCGAGTTCTTCCAACTCCGTTCAGTCCTTGTTGCCTCTGGCCGCCCCGGTTCTGCCTGGTTCGTCCTGCAACGCCTCCAAATTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGGTTGAAGAGAATTGTGGGTGAGTTGTCGTCTCAAGAGTTTATTCCTGTCATCTGTTGTGATAGCCTGAGTGCTATTCGTCTTGCGAAGAATCCATCTCACCATGA
ACGGTCTAAACATAATGATGTTAAGTTGTTTCACTTTATAAGGGATATGATTTCTTATAATGAAGTTAAACTGGTGAAAGTTCATACAAATCAAAACTTGCCAGATATGC
TTACCAAAGTTCTCACAGCTCATAGGTTTAAATACCTGTTAAATGAGTTGAATGTAAAGTCAAGACATATTGTTATAGGTAATTTTGGACTACCCGACGCGAGCCTAGAA
ATAGGATCGAAGGACGGACCCTGGAGAAGAAACGAGCCAAGGGGTCGAGCCAAGGCCAAAGAGATCAGGCTTTTGGTCCGACCCCATGGTCGGCCTCGGCCCGCTTGCGT
GGGCCGAGTTCTTCCAACTCCGTTCAGTCCTTGTTGCCTCTGGCCGCCCCGGTTCTGCCTGGTTCGTCCTGCAACGCCTCCAAATTCCTAA
Protein sequenceShow/hide protein sequence
MWLKRIVGELSSQEFIPVICCDSLSAIRLAKNPSHHERSKHNDVKLFHFIRDMISYNEVKLVKVHTNQNLPDMLTKVLTAHRFKYLLNELNVKSRHIVIGNFGLPDASLE
IGSKDGPWRRNEPRGRAKAKEIRLLVRPHGRPRPACVGRVLPTPFSPCCLWPPRFCLVRPATPPNS