; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011731 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011731
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr1:31771426..31778532
RNA-Seq ExpressionLag0011731
SyntenyLag0011731
Gene Ontology termsGO:0006310 - DNA recombination (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0032508 - DNA duplex unwinding (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0015074 - DNA integration (biological process)
GO:0006508 - proteolysis (biological process)
GO:0000723 - telomere maintenance (biological process)
GO:0006281 - DNA repair (biological process)
GO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0005524 - ATP binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0016301 - kinase activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0003678 - DNA helicase activity (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAW22873.1 putative polyprotein [Solanum lycopersicum]7.0e-2876.74Show/hide
Query:  MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPC
        MLWLKRFLQELGLKQ EYVV CDS+SAMDLSKN MYHARTK+ID+RYHWLR  I+E+ MKLKK+H +KN AD+LTKVV  SK+E C
Subjt:  MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPC

CAA2995030.1 Retrovirus-related Pol poly from transposon TNT 1-94 [Olea europaea subsp. europaea]3.2e-2565.52Show/hide
Query:  MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPCR
        MLWLKRFLQELG+KQ +Y V CD+QSA+DLSKNSMYH+RTK+IDIRYHW+R  ++++ ++L K+H ++N AD+LTKVV+R K+E CR
Subjt:  MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPCR

KYP66486.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]5.0e-2666.28Show/hide
Query:  MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPC
        MLW+K+FLQELGLKQ EY+V CDSQSA+DLSKN+MYH+RTK+ID+RYHW+R  IE++  +LKK+H D+N+AD+LTK V R K+  C
Subjt:  MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPC

KZV32632.1 putative LRR receptor-like serine/threonine-protein kinase [Dorcoceras hygrometricum]5.5e-2561.22Show/hide
Query:  MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPCRASIFGVLQQDL
        MLWLKR+LQE G+KQ +Y V CDSQSA+DLSKNSMYH+RTK+IDIRYHW+R  ++ + ++L K+H  +N AD+LTKVV R K+E CR  I G+L  D+
Subjt:  MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPCRASIFGVLQQDL

OMO65301.1 DNA helicase PIF1, ATP-dependent [Corchorus capsularis]2.9e-2670.93Show/hide
Query:  MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPC
        MLW+KRFL ELGL Q EYVV C+SQSA+DLSKN+MY ARTK+ID+RYHWLR   E+K+++LKK+HIDKN AD+LTKVV R K+E C
Subjt:  MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPC

TrEMBL top hitse value%identityAlignment
A0A2N9EH16 Integrase catalytic domain-containing protein4.9e-2768.54Show/hide
Query:  MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPCRAS
        MLW+KRFLQ+LGLKQ EYVV CDSQSA+DLSKNS YH+RTK+ID+RYHWLR  I+++ M+L+K+H DKN AD+LTKVV + K+E C  S
Subjt:  MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPCRAS

A0A2N9ETM3 Uncharacterized protein4.9e-2767.42Show/hide
Query:  MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPCRAS
        MLW+KRFLQ+LGLKQ EYVV CDSQSA+DLSKNS YH+RTK+ID+RYHWLR  ++++ M+L+K+H DKN AD+LTKVV + K+E C  S
Subjt:  MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPCRAS

A0A2N9FZK6 Uncharacterized protein6.4e-2767.42Show/hide
Query:  MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPCRAS
        MLW+KRFLQ+LGLKQ EYVV CDSQSA+DLSKNS YH+RTK+ID+RYHWLR  ++++ M+L+K+H DKN AD+LTKVV + K+E C  S
Subjt:  MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPCRAS

A0A2N9IPN0 Integrase catalytic domain-containing protein6.4e-2767.42Show/hide
Query:  MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPCRAS
        MLW+KRFLQ+LGLKQ EYVV CDSQSA+DLSKNS YH+RTK+ID+RYHWLR  ++++ M+L+K+H DKN AD+LTKVV + K+E C  S
Subjt:  MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPCRAS

Q5GA69 Putative polyprotein3.4e-2876.74Show/hide
Query:  MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPC
        MLWLKRFLQELGLKQ EYVV CDS+SAMDLSKN MYHARTK+ID+RYHWLR  I+E+ MKLKK+H +KN AD+LTKVV  SK+E C
Subjt:  MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPC

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.0e-0528.87Show/hide
Query:  LWLKRFLQELGLK-QHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPCRASIFGVLQQD
        LWLK  L  + +K ++   +  D+Q  + ++ N   H R K+IDI+YH+ R +++   + L+ +  +   AD+ TK +  ++    R  + G+LQ D
Subjt:  LWLKRFLQELGLK-QHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPCRASIFGVLQQD

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-2765.52Show/hide
Query:  MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPCR
        M+WLKRFLQELGL Q EYVV CDSQSA+DLSKNSMYHARTK+ID+RYHW+R  ++++ +K+ K+  ++N AD+LTKVV R+K E C+
Subjt:  MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPCR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.6e-0531.76Show/hide
Query:  MLWLKRFLQELGLK-QHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIE
        M W+   L ELG++     V+ CD+  A  L  N ++H+R K+I I YH++R +++   +++  V      AD LTK + R+  +
Subjt:  MLWLKRFLQELGLK-QHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.9e-0532.91Show/hide
Query:  WLKRFLQELGLK-QHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLR
        W+   L ELG++  H  V+ CD+  A  L  N ++H+R K+I + YH++R +++   +++  V      AD LTK + R
Subjt:  WLKRFLQELGLK-QHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLR

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTATGGCTTAAAAGGTTCCTTCAAGAGTTGGGTTTGAAGCAACATGAGTATGTAGTATGTTGTGATAGTCAGAGTGCTATGGATTTGAGCAAGAATTCAATGTATCA
TGCTCGCACCAAATACATTGACATTAGATATCATTGGTTGCGATATGAGATTGAAGAGAAGAGAATGAAACTGAAGAAAGTTCACATTGACAAGAATAGTGCAGATGTGT
TGACTAAAGTAGTTCTTAGAAGCAAGATTGAGCCTTGTAGAGCTTCGATCTTCGGAGTCTTGCAGCAGGATCTGGACTTCAATCTTCAAGATTTTTGCAGGAGAAATGAA
GCTTCAATATTCAAGTCTTGTAGAAGGAATAAAGCTTCAATCGTCAAGTCTTCCAGCACCCCGAAACGCGCTGCCGCCGTCCCTGCTCGAGCGCCGCCCTCCCCCTCGTG
CGCGATCTCCCTTCCCCACGCCGTCTTCGTGGGTTTTTTCGTCGAGAAGTGGAAAAATACTCGTGTGGGGTTTAGACTCCGTTTTGGATCGTTATGGCGTCGCTTAGCGA
TTTCGGTGAGTTTAAGCGCTTACCCATTCTTGCGCTGTCTAAGTGATCGATTGGGTTCGAATCACTATAAGCTCGAATACCCACTACCCAAGGATCGTTCTAGTGCGTTG
TTCGAGCGCCGTAAAAGTGTTCGATTGAGTTCGAATCACTTAAAACTTGAATACCCATTGTCCAAGGAGAATTCTAACACGTTGTTCGAGAGCGTGACTCGCAAATCACG
TGTTAAGAGTGCGTTGCATGGCGCGGAATTCATGTGTATTGATGATGCAGGTGTTGGCGAATATTATGAGCCTGGTGGTGCGGACGAGGAGGCACACGAGGCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTATGGCTTAAAAGGTTCCTTCAAGAGTTGGGTTTGAAGCAACATGAGTATGTAGTATGTTGTGATAGTCAGAGTGCTATGGATTTGAGCAAGAATTCAATGTATCA
TGCTCGCACCAAATACATTGACATTAGATATCATTGGTTGCGATATGAGATTGAAGAGAAGAGAATGAAACTGAAGAAAGTTCACATTGACAAGAATAGTGCAGATGTGT
TGACTAAAGTAGTTCTTAGAAGCAAGATTGAGCCTTGTAGAGCTTCGATCTTCGGAGTCTTGCAGCAGGATCTGGACTTCAATCTTCAAGATTTTTGCAGGAGAAATGAA
GCTTCAATATTCAAGTCTTGTAGAAGGAATAAAGCTTCAATCGTCAAGTCTTCCAGCACCCCGAAACGCGCTGCCGCCGTCCCTGCTCGAGCGCCGCCCTCCCCCTCGTG
CGCGATCTCCCTTCCCCACGCCGTCTTCGTGGGTTTTTTCGTCGAGAAGTGGAAAAATACTCGTGTGGGGTTTAGACTCCGTTTTGGATCGTTATGGCGTCGCTTAGCGA
TTTCGGTGAGTTTAAGCGCTTACCCATTCTTGCGCTGTCTAAGTGATCGATTGGGTTCGAATCACTATAAGCTCGAATACCCACTACCCAAGGATCGTTCTAGTGCGTTG
TTCGAGCGCCGTAAAAGTGTTCGATTGAGTTCGAATCACTTAAAACTTGAATACCCATTGTCCAAGGAGAATTCTAACACGTTGTTCGAGAGCGTGACTCGCAAATCACG
TGTTAAGAGTGCGTTGCATGGCGCGGAATTCATGTGTATTGATGATGCAGGTGTTGGCGAATATTATGAGCCTGGTGGTGCGGACGAGGAGGCACACGAGGCGTGA
Protein sequenceShow/hide protein sequence
MLWLKRFLQELGLKQHEYVVCCDSQSAMDLSKNSMYHARTKYIDIRYHWLRYEIEEKRMKLKKVHIDKNSADVLTKVVLRSKIEPCRASIFGVLQQDLDFNLQDFCRRNE
ASIFKSCRRNKASIVKSSSTPKRAAAVPARAPPSPSCAISLPHAVFVGFFVEKWKNTRVGFRLRFGSLWRRLAISVSLSAYPFLRCLSDRLGSNHYKLEYPLPKDRSSAL
FERRKSVRLSSNHLKLEYPLSKENSNTLFESVTRKSRVKSALHGAEFMCIDDAGVGEYYEPGGADEEAHEA