; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011678 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011678
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase
Genome locationchr1:30602721..30606695
RNA-Seq ExpressionLag0011678
SyntenyLag0011678
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038926.1 integrase [Cucumis melo var. makuwa]2.7e-3467.23Show/hide
Query:  MASSSSSSPRPTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRT
        M SSSSS+   TS +E SPR+ RSIQEIY+ + R+ +D   +FALFA VDP+ F+EAIQDEKWK  MDQEIDAIRRNETWEL++LP +++ LGVKWV RT
Subjt:  MASSSSSSPRPTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRT

Query:  KLKQNGEVEKYKARLIVKG
        KLK +G VEKYKARL+VKG
Subjt:  KLKQNGEVEKYKARLIVKG

KAA0039947.1 integrase [Cucumis melo var. makuwa]2.7e-3467.23Show/hide
Query:  MASSSSSSPRPTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRT
        M SSSSS+   TS +E SPR+ RSIQEIY+ + R+ +D   +FALFA VDP+ F+EAIQDEKWK  MDQEIDAIRRNETWEL++LP +++ LGVKWV RT
Subjt:  MASSSSSSPRPTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRT

Query:  KLKQNGEVEKYKARLIVKG
        KLK +G VEKYKARL+VKG
Subjt:  KLKQNGEVEKYKARLIVKG

KAA0051300.1 integrase [Cucumis melo var. makuwa]9.4e-3568.38Show/hide
Query:  SSSSSSPRPTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRTKL
        SSSSS+  PTS +E SPR+ RSIQEIY+ + R+ +D   +FALFA VDP+ F+EAIQDEKWK  MDQEIDAIRRNETWEL++LP +++ LGVKWV RTKL
Subjt:  SSSSSSPRPTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRTKL

Query:  KQNGEVEKYKARLIVKG
        K +G VEKYKARL+VKG
Subjt:  KQNGEVEKYKARLIVKG

KAA0060377.1 integrase [Cucumis melo var. makuwa]2.7e-3467.23Show/hide
Query:  MASSSSSSPRPTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRT
        M SSSSS+   TS +E SPR+ RSIQEIY+ + R+ +D   +FALFA VDP+ F+EAIQDEKWK  MDQEIDAIRRNETWEL++LP +++ LGVKWV RT
Subjt:  MASSSSSSPRPTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRT

Query:  KLKQNGEVEKYKARLIVKG
        KLK +G VEKYKARL+VKG
Subjt:  KLKQNGEVEKYKARLIVKG

KAA0060708.1 integrase [Cucumis melo var. makuwa]1.2e-3467.23Show/hide
Query:  MASSSSSSPRPTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRT
        M SSSSS+P  TS +E SPR+ RSIQEIY+ + R+ +D   +FALFA VDP+ F+EAIQDEKW  VMDQEIDA+RRNETWEL++LP +++ LGVKWV RT
Subjt:  MASSSSSSPRPTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRT

Query:  KLKQNGEVEKYKARLIVKG
        KLK +G VEKYKARL+VKG
Subjt:  KLKQNGEVEKYKARLIVKG

TrEMBL top hitse value%identityAlignment
A0A5A7U800 Integrase4.6e-3568.38Show/hide
Query:  SSSSSSPRPTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRTKL
        SSSSS+  PTS +E SPR+ RSIQEIY+ + R+ +D   +FALFA VDP+ F+EAIQDEKWK  MDQEIDAIRRNETWEL++LP +++ LGVKWV RTKL
Subjt:  SSSSSSPRPTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRTKL

Query:  KQNGEVEKYKARLIVKG
        K +G VEKYKARL+VKG
Subjt:  KQNGEVEKYKARLIVKG

A0A5A7UZJ8 Integrase1.3e-3467.23Show/hide
Query:  MASSSSSSPRPTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRT
        M SSSSS+   TS +E SPR+ RSIQEIY+ + R+ +D   +FALFA VDP+ F+EAIQDEKWK  MDQEIDAIRRNETWEL++LP +++ LGVKWV RT
Subjt:  MASSSSSSPRPTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRT

Query:  KLKQNGEVEKYKARLIVKG
        KLK +G VEKYKARL+VKG
Subjt:  KLKQNGEVEKYKARLIVKG

A0A5A7UZM3 Integrase6.0e-3567.23Show/hide
Query:  MASSSSSSPRPTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRT
        M SSSSS+P  TS +E SPR+ RSIQEIY+ + R+ +D   +FALFA VDP+ F+EAIQDEKW  VMDQEIDA+RRNETWEL++LP +++ LGVKWV RT
Subjt:  MASSSSSSPRPTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRT

Query:  KLKQNGEVEKYKARLIVKG
        KLK +G VEKYKARL+VKG
Subjt:  KLKQNGEVEKYKARLIVKG

A0A5D3BQ81 Integrase1.3e-3467.23Show/hide
Query:  MASSSSSSPRPTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRT
        M SSSSS+   TS +E SPR+ RSIQEIY+ + R+ +D   +FALFA VDP+ F+EAIQDEKWK  MDQEIDAIRRNETWEL++LP +++ LGVKWV RT
Subjt:  MASSSSSSPRPTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRT

Query:  KLKQNGEVEKYKARLIVKG
        KLK +G VEKYKARL+VKG
Subjt:  KLKQNGEVEKYKARLIVKG

A0A5D3E3T2 Integrase1.3e-3467.23Show/hide
Query:  MASSSSSSPRPTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRT
        M SSSSS+   TS +E SPR+ RSIQEIY+ + R+ +D   +FALFA VDP+ F+EAIQDEKWK  MDQEIDAIRRNETWEL++LP +++ LGVKWV RT
Subjt:  MASSSSSSPRPTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRT

Query:  KLKQNGEVEKYKARLIVKG
        KLK +G VEKYKARL+VKG
Subjt:  KLKQNGEVEKYKARLIVKG

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-0430.17Show/hide
Query:  PTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAI---QDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRTKLKQNGE
        PT  EE      RS +   + SRR    ++V   +  D +P   +E +   +  +    M +E++++++N T++LV+LP+ ++ L  KWV + K   + +
Subjt:  PTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAI---QDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRTKLKQNGE

Query:  VEKYKARLIVKGVFKK
        + +YKARL+VKG  +K
Subjt:  VEKYKARLIVKGVFKK

P92520 Uncharacterized mitochondrial protein AtMg008202.4e-0946.03Show/hide
Query:  AIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRTKLKQNGEVEKYKARLIVKG
        A++D  W   M +E+DA+ RN+TW LV  P ++  LG KWV +TKL  +G +++ KARL+ KG
Subjt:  AIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRTKLKQNGEVEKYKARLIVKG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-0632.09Show/hide
Query:  ASSSSSSPRPTSV--------------EETSPRKTRSIQEIYDASRRMTEDDH-VDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLP
        ASSSS+SP P S+                 +P  T S+     A        + +  +L A+ +P    +A++DE+W+  M  EI+A   N TW+LV  P
Subjt:  ASSSSSSPRPTSV--------------EETSPRKTRSIQEIYDASRRMTEDDH-VDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLP

Query:  ESRKTL-GVKWVSRTKLKQNGEVEKYKARLIVKG
         S  T+ G +W+   K   +G + +YKARL+ KG
Subjt:  ESRKTL-GVKWVSRTKLKQNGEVEKYKARLIVKG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.2e-0530.47Show/hide
Query:  ASSSSSSPRP--------TSVEETSPRKTRSI-QEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTL
        +SS+S+ P P          V   +P  T S+     D  R+  +      +L A+ +P    +A++D++W+  M  EI+A   N TW+LV  P    T+
Subjt:  ASSSSSSPRP--------TSVEETSPRKTRSI-QEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTL

Query:  -GVKWVSRTKLKQNGEVEKYKARLIVKG
         G +W+   K   +G + +YKARL+ KG
Subjt:  -GVKWVSRTKLKQNGEVEKYKARLIVKG

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.5e-0931.88Show/hide
Query:  MASSSSSSPRPTSVEETSPRKTRS---IQEIYDAS-RRMTEDDHVDFALFADVDPIH---------------FEEAIQDEKWKAVMDQEIDAIRRNETWE
        M S++  +  P     TS R+TR    +Q+ Y  S   +T  D   F  +  V P++               + EA +   W   MD EI A+    TWE
Subjt:  MASSSSSSPRPTSVEETSPRKTRS---IQEIYDAS-RRMTEDDHVDFALFADVDPIH---------------FEEAIQDEKWKAVMDQEIDAIRRNETWE

Query:  LVQLPESRKTLGVKWVSRTKLKQNGEVEKYKARLIVKG
        +  LP ++K +G KWV + K   +G +E+YKARL+ KG
Subjt:  LVQLPESRKTLGVKWVSRTKLKQNGEVEKYKARLIVKG

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.7e-1046.03Show/hide
Query:  AIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRTKLKQNGEVEKYKARLIVKG
        A++D  W   M +E+DA+ RN+TW LV  P ++  LG KWV +TKL  +G +++ KARL+ KG
Subjt:  AIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRTKLKQNGEVEKYKARLIVKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCATCTTCCTCTTCATCTCCACGTCCCACAAGTGTTGAAGAAACTTCGCCTAGGAAAACAAGAAGTATTCAAGAGATTTATGATGCTTCAAGAAGGATGACAGA
AGATGATCATGTTGATTTTGCTTTATTTGCAGACGTGGATCCTATACATTTTGAAGAGGCGATTCAAGATGAAAAATGGAAAGCTGTAATGGATCAAGAGATCGATGCAA
TTAGAAGAAATGAAACATGGGAGTTAGTACAACTACCAGAGAGCAGAAAAACTCTTGGAGTAAAATGGGTGTCTAGAACAAAATTGAAGCAAAATGGTGAAGTTGAGAAG
TACAAGGCAAGACTCATTGTCAAAGGAGTCTTCAAGAAGCTTTGGTCTAGAAGGTTTGTTCTTCAGAAAGATTTGGATTTTTCTGGATGGATCGAGGTAAACTTCAGTCT
TCAGGTTTGCAGGAGGTTCAGAGCTTCAGTCTTCGACTTGGTAGGAGGACTTGAAGAGGATCTGAACTTCGATCTTCAGTTCTGCATTTTGCGGCAGGATTTGAGGGTGA
TCGGAACCTTGTTCTTCAGTTTTGTAGCTGAATCTGGCGAGGATCTGAGCTTCGATCTTCGTTCTGCAGTGGGTTTTGAAGAGGATCTGAACTTCGATCTTCGGTTTTGC
AGAAGGAGCTTCAATCTTCGCAAGTCTACGACACCTACTGACGGCCACCGTTTCAGCAACCAGCCGATCCGTAAGGCACGAGAGGTAGCAACGCTTTGCCGTACGACGGG
CTCGTCTTCGTGCTGTGGCGGCGCCGGCGAGAGGGTGAGGGAGGCGAGCGGGAATAATTATATTCAACCGCTCGTATTGCACCTAACACTTGCTTGCTACCTAGAAGGCA
GTGATCAACCCCTGATTCAGGCGGAATCCAGGATGGTCACTCGCCCTTCACGTCCTGCCTCAGGCAGTGATCAACCCCTGATTCAGGCGGAATTCAGGATGGTCACTCGC
CCTTCACGTCTTGCCTCAGGCAGTGATCAACCCCTGATTCAGGCGGAATTCAAGCCGATCACTTTATATTTTGTGATTCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCATCTTCCTCTTCATCTCCACGTCCCACAAGTGTTGAAGAAACTTCGCCTAGGAAAACAAGAAGTATTCAAGAGATTTATGATGCTTCAAGAAGGATGACAGA
AGATGATCATGTTGATTTTGCTTTATTTGCAGACGTGGATCCTATACATTTTGAAGAGGCGATTCAAGATGAAAAATGGAAAGCTGTAATGGATCAAGAGATCGATGCAA
TTAGAAGAAATGAAACATGGGAGTTAGTACAACTACCAGAGAGCAGAAAAACTCTTGGAGTAAAATGGGTGTCTAGAACAAAATTGAAGCAAAATGGTGAAGTTGAGAAG
TACAAGGCAAGACTCATTGTCAAAGGAGTCTTCAAGAAGCTTTGGTCTAGAAGGTTTGTTCTTCAGAAAGATTTGGATTTTTCTGGATGGATCGAGGTAAACTTCAGTCT
TCAGGTTTGCAGGAGGTTCAGAGCTTCAGTCTTCGACTTGGTAGGAGGACTTGAAGAGGATCTGAACTTCGATCTTCAGTTCTGCATTTTGCGGCAGGATTTGAGGGTGA
TCGGAACCTTGTTCTTCAGTTTTGTAGCTGAATCTGGCGAGGATCTGAGCTTCGATCTTCGTTCTGCAGTGGGTTTTGAAGAGGATCTGAACTTCGATCTTCGGTTTTGC
AGAAGGAGCTTCAATCTTCGCAAGTCTACGACACCTACTGACGGCCACCGTTTCAGCAACCAGCCGATCCGTAAGGCACGAGAGGTAGCAACGCTTTGCCGTACGACGGG
CTCGTCTTCGTGCTGTGGCGGCGCCGGCGAGAGGGTGAGGGAGGCGAGCGGGAATAATTATATTCAACCGCTCGTATTGCACCTAACACTTGCTTGCTACCTAGAAGGCA
GTGATCAACCCCTGATTCAGGCGGAATCCAGGATGGTCACTCGCCCTTCACGTCCTGCCTCAGGCAGTGATCAACCCCTGATTCAGGCGGAATTCAGGATGGTCACTCGC
CCTTCACGTCTTGCCTCAGGCAGTGATCAACCCCTGATTCAGGCGGAATTCAAGCCGATCACTTTATATTTTGTGATTCAATAA
Protein sequenceShow/hide protein sequence
MASSSSSSPRPTSVEETSPRKTRSIQEIYDASRRMTEDDHVDFALFADVDPIHFEEAIQDEKWKAVMDQEIDAIRRNETWELVQLPESRKTLGVKWVSRTKLKQNGEVEK
YKARLIVKGVFKKLWSRRFVLQKDLDFSGWIEVNFSLQVCRRFRASVFDLVGGLEEDLNFDLQFCILRQDLRVIGTLFFSFVAESGEDLSFDLRSAVGFEEDLNFDLRFC
RRSFNLRKSTTPTDGHRFSNQPIRKAREVATLCRTTGSSSCCGGAGERVREASGNNYIQPLVLHLTLACYLEGSDQPLIQAESRMVTRPSRPASGSDQPLIQAEFRMVTR
PSRLASGSDQPLIQAEFKPITLYFVIQ