; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011807 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011807
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr1:33194297..33197432
RNA-Seq ExpressionLag0011807
SyntenyLag0011807
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041588 - Integrase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIM97577.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]4.6e-4557.79Show/hide
Query:  ILEACHSAPYRGHFAGLKTAAKVLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSFTPSFGNIYILLAVDYVPKW
        ILE CH++PY GHF G +TAAK+LQSG+FWP+LFK AH F  NC++CQRTG+++ R EMPLN ILEVE+FDVWGI+FMG F PSFGN+YIL+AVDYV KW
Subjt:  ILEACHSAPYRGHFAGLKTAAKVLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSFTPSFGNIYILLAVDYVPKW

Query:  IEV------------------ITTRYETPRFIICDEGSHFLNKVIANLFSKYNI
        +E                   I TR+ TPR II D G+HF N+    L SKY +
Subjt:  IEV------------------ITTRYETPRFIICDEGSHFLNKVIANLFSKYNI

PIN17864.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]2.3e-4459.56Show/hide
Query:  ILEACHSAPYRGHFAGLKTAAKVLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSFTPSFGNIYILLAVDYVPKW
        ILE CH++PY GHF G +TAAK+LQSG+FWP+LFK AH F  NC++CQRTG+++ R EMPLN ILEVE+FDVW I+FMG F PSFGN+YIL+AVDY+ KW
Subjt:  ILEACHSAPYRGHFAGLKTAAKVLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSFTPSFGNIYILLAVDYVPKW

Query:  IEVITTRYETPRFIICDEGSHFLNKVIANLFSKYNI
        +E +       + II D G+HF N+    L SKY +
Subjt:  IEVITTRYETPRFIICDEGSHFLNKVIANLFSKYNI

WP_217833161.1 DDE-type integrase/transposase/recombinase, partial [Synechococcus sp. PCC 7002]6.0e-4556.49Show/hide
Query:  ILEACHSAPYRGHFAGLKTAAKVLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSFTPSFGNIYILLAVDYVPKW
        IL++CH APY GHF G +TAAKVLQSGYFWP+LFK A  +   C++CQRTG+++ R+EMPLN +LEVE+FDVWGI+FMG F PS GN YIL+AVDYV KW
Subjt:  ILEACHSAPYRGHFAGLKTAAKVLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSFTPSFGNIYILLAVDYVPKW

Query:  IEV------------------ITTRYETPRFIICDEGSHFLNKVIANLFSKYNI
        +E                   I +R+ TPR II DEG+HF+N++I NL +K+N+
Subjt:  IEV------------------ITTRYETPRFIICDEGSHFLNKVIANLFSKYNI

XP_022150300.1 uncharacterized protein K02A2.6-like [Momordica charantia]3.9e-4459.86Show/hide
Query:  APYRGHFAGLKTAAKVLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSFTPSFGNIYILLAVDYVPKWIEVIT--
        +PY GHFAG KTAAKVLQSG+FWP+LF+ AH F + C++CQRTG+++  +EMPLN ILEVEIFDVWGI+FMG F PS+G  YILLAVDYV KWIEV+   
Subjt:  APYRGHFAGLKTAAKVLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSFTPSFGNIYILLAVDYVPKWIEVIT--

Query:  ----------------TRYETPRFIICDEGSHFLNKVIANLFSKYNI
                        TRY TP+++I DEG+HFLN+ I +L SKYN+
Subjt:  ----------------TRYETPRFIICDEGSHFLNKVIANLFSKYNI

XP_022152369.1 uncharacterized protein K02A2.6-like [Momordica charantia]2.2e-4761.04Show/hide
Query:  ILEACHSAPYRGHFAGLKTAAKVLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSFTPSFGNIYILLAVDYVPKW
        ILEACH  PY GHFAG KTAAKVLQSG+FWPS+F+ AH+FT+ C  CQR+G+LT + EMPLN ILEVE+FDVWGINFMG F PSFG  YILLAVDYV KW
Subjt:  ILEACHSAPYRGHFAGLKTAAKVLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSFTPSFGNIYILLAVDYVPKW

Query:  IEV------------------ITTRYETPRFIICDEGSHFLNKVIANLFSKYNI
        +E                   I TRY T + +I DEG+HFLN+V++ L   YNI
Subjt:  IEV------------------ITTRYETPRFIICDEGSHFLNKVIANLFSKYNI

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase2.2e-4557.79Show/hide
Query:  ILEACHSAPYRGHFAGLKTAAKVLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSFTPSFGNIYILLAVDYVPKW
        ILE CH++PY GHF G +TAAK+LQSG+FWP+LFK AH F  NC++CQRTG+++ R EMPLN ILEVE+FDVWGI+FMG F PSFGN+YIL+AVDYV KW
Subjt:  ILEACHSAPYRGHFAGLKTAAKVLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSFTPSFGNIYILLAVDYVPKW

Query:  IEV------------------ITTRYETPRFIICDEGSHFLNKVIANLFSKYNI
        +E                   I TR+ TPR II D G+HF N+    L SKY +
Subjt:  IEV------------------ITTRYETPRFIICDEGSHFLNKVIANLFSKYNI

A0A2G9HK33 Reverse transcriptase1.1e-4459.56Show/hide
Query:  ILEACHSAPYRGHFAGLKTAAKVLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSFTPSFGNIYILLAVDYVPKW
        ILE CH++PY GHF G +TAAK+LQSG+FWP+LFK AH F  NC++CQRTG+++ R EMPLN ILEVE+FDVW I+FMG F PSFGN+YIL+AVDY+ KW
Subjt:  ILEACHSAPYRGHFAGLKTAAKVLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSFTPSFGNIYILLAVDYVPKW

Query:  IEVITTRYETPRFIICDEGSHFLNKVIANLFSKYNI
        +E +       + II D G+HF N+    L SKY +
Subjt:  IEVITTRYETPRFIICDEGSHFLNKVIANLFSKYNI

A0A6J1D844 uncharacterized protein K02A2.6-like1.9e-4459.86Show/hide
Query:  APYRGHFAGLKTAAKVLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSFTPSFGNIYILLAVDYVPKWIEVIT--
        +PY GHFAG KTAAKVLQSG+FWP+LF+ AH F + C++CQRTG+++  +EMPLN ILEVEIFDVWGI+FMG F PS+G  YILLAVDYV KWIEV+   
Subjt:  APYRGHFAGLKTAAKVLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSFTPSFGNIYILLAVDYVPKWIEVIT--

Query:  ----------------TRYETPRFIICDEGSHFLNKVIANLFSKYNI
                        TRY TP+++I DEG+HFLN+ I +L SKYN+
Subjt:  ----------------TRYETPRFIICDEGSHFLNKVIANLFSKYNI

A0A6J1DFT9 uncharacterized protein K02A2.6-like1.1e-4761.04Show/hide
Query:  ILEACHSAPYRGHFAGLKTAAKVLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSFTPSFGNIYILLAVDYVPKW
        ILEACH  PY GHFAG KTAAKVLQSG+FWPS+F+ AH+FT+ C  CQR+G+LT + EMPLN ILEVE+FDVWGINFMG F PSFG  YILLAVDYV KW
Subjt:  ILEACHSAPYRGHFAGLKTAAKVLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSFTPSFGNIYILLAVDYVPKW

Query:  IEV------------------ITTRYETPRFIICDEGSHFLNKVIANLFSKYNI
        +E                   I TRY T + +I DEG+HFLN+V++ L   YNI
Subjt:  IEV------------------ITTRYETPRFIICDEGSHFLNKVIANLFSKYNI

A0A6J1E3L7 uncharacterized protein LOC1110257543.2e-4457.79Show/hide
Query:  ILEACHSAPYRGHFAGLKTAAKVLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSFTPSFGNIYILLAVDYVPKW
        IL+ACH +PY GHFAG KTAAKVLQSG+FWP+LFK A+ F + C++CQRTG+++ R EMP   ILE+EIFDVWGI+FMGSFT S G +YILLAVDYV KW
Subjt:  ILEACHSAPYRGHFAGLKTAAKVLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSFTPSFGNIYILLAVDYVPKW

Query:  IEVIT------------------TRYETPRFIICDEGSHFLNKVIANLFSKYNI
        IE I                   TRY TPR +I D G+HF+N+ +  L +KYN+
Subjt:  IEVIT------------------TRYETPRFIICDEGSHFLNKVIANLFSKYNI

SwissProt top hitse value%identityAlignment
P14350 Pro-Pol polyprotein9.1e-0424.54Show/hide
Query:  LARPTSTKLY--GSTKVPFILEACHSAPYRGHFAGLKTAAKVLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSF
        ++RP   K+    S +   +L+A H+  + G  A L   A +    Y+WP++ K        C+QC  T      S   L      + FD + I+++G  
Subjt:  LARPTSTKLY--GSTKVPFILEACHSAPYRGHFAGLKTAAKVLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSF

Query:  TPSFGNIYILLAVDYVP--KWIEV--------------ITTRYETPRFIICDEGSHFLNKVIA
         PS G +Y+L+ VD +    W+                + T    P+ I  D+G+ F +   A
Subjt:  TPSFGNIYILLAVDYVP--KWIEV--------------ITTRYETPRFIICDEGSHFLNKVIA

P92516 Uncharacterized mitochondrial protein AtMg007504.7e-1662.5Show/hide
Query:  VLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFM
        VLQ+G++WP+ FK AH F  +C+ CQR G+ T R+EMP +FILEVE+FDVWGI FM
Subjt:  VLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFM

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein3.3e-1762.5Show/hide
Query:  VLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFM
        VLQ+G++WP+ FK AH F  +C+ CQR G+ T R+EMP +FILEVE+FDVWGI FM
Subjt:  VLQSGYFWPSLFKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATGGGCATCTGGATTACTTCGTTTGTTTTGCGGTAATGTTTTTAGGAATTTGGAGGCGTTTCGGGACAAACCAAGCGGAACCGGGGCGGCCAGAGGCGGTAGGGA
CCAAACGGAGCTAGACGAGCTCGACCGAGACCGAGCACGGGGTCCGGCCAAAAGCCCGACCCCTTCGGTTTTGGCTCGTCCTACTTCTACAAAATTATATGGAAGTACCA
AAGTACCATTCATTTTGGAAGCCTGTCACTCGGCTCCATATAGAGGGCACTTCGCCGGCCTCAAAACTGCAGCCAAAGTCCTTCAATCAGGGTACTTTTGGCCGAGTCTT
TTCAAGGGTGCACATCTCTTTACTAAGAACTGCGAACAGTGCCAGAGAACTGGGGATTTAACAGCTAGAAGTGAAATGCCACTAAACTTCATACTTGAAGTGGAAATCTT
TGATGTATGGGGAATAAATTTCATGGGCTCATTCACGCCGTCCTTTGGAAATATCTATATTCTGCTTGCAGTTGATTATGTACCAAAGTGGATCGAGGTCATAACCACAA
GATATGAGACTCCTCGATTCATCATATGCGATGAAGGATCACATTTTCTGAACAAGGTGATAGCCAACCTGTTTTCCAAATATAACATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAATGGGCATCTGGATTACTTCGTTTGTTTTGCGGTAATGTTTTTAGGAATTTGGAGGCGTTTCGGGACAAACCAAGCGGAACCGGGGCGGCCAGAGGCGGTAGGGA
CCAAACGGAGCTAGACGAGCTCGACCGAGACCGAGCACGGGGTCCGGCCAAAAGCCCGACCCCTTCGGTTTTGGCTCGTCCTACTTCTACAAAATTATATGGAAGTACCA
AAGTACCATTCATTTTGGAAGCCTGTCACTCGGCTCCATATAGAGGGCACTTCGCCGGCCTCAAAACTGCAGCCAAAGTCCTTCAATCAGGGTACTTTTGGCCGAGTCTT
TTCAAGGGTGCACATCTCTTTACTAAGAACTGCGAACAGTGCCAGAGAACTGGGGATTTAACAGCTAGAAGTGAAATGCCACTAAACTTCATACTTGAAGTGGAAATCTT
TGATGTATGGGGAATAAATTTCATGGGCTCATTCACGCCGTCCTTTGGAAATATCTATATTCTGCTTGCAGTTGATTATGTACCAAAGTGGATCGAGGTCATAACCACAA
GATATGAGACTCCTCGATTCATCATATGCGATGAAGGATCACATTTTCTGAACAAGGTGATAGCCAACCTGTTTTCCAAATATAACATTTGA
Protein sequenceShow/hide protein sequence
MEWASGLLRLFCGNVFRNLEAFRDKPSGTGAARGGRDQTELDELDRDRARGPAKSPTPSVLARPTSTKLYGSTKVPFILEACHSAPYRGHFAGLKTAAKVLQSGYFWPSL
FKGAHLFTKNCEQCQRTGDLTARSEMPLNFILEVEIFDVWGINFMGSFTPSFGNIYILLAVDYVPKWIEVITTRYETPRFIICDEGSHFLNKVIANLFSKYNI