; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001640 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001640
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr4:33937568..33940290
RNA-Seq ExpressionLag0001640
SyntenyLag0001640
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8482924.1 hypothetical protein CXB51_024275 [Gossypium anomalum]8.9e-3343Show/hide
Query:  MYLMVCTRPDLAHGLSVVSRYMANPGKEHCK-----LEILSANSCVALS------------------------TMEAEYMAVTEAFKEGLWLRELVEEFG
        MY MVC+RPDL++ +S VSRYMANPGKEH K     L  L   + V L                         ++ AEYMA+TEA KE +WL+ L  E  
Subjt:  MYLMVCTRPDLAHGLSVVSRYMANPGKEHCK-----LEILSANSCVALS------------------------TMEAEYMAVTEAFKEGLWLRELVEEFG

Query:  EDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIKLKKIPSNENPTDGLTKTLPLAKLESCI
        EDL+                         +  V CDSQSA++L+K+Q FHERTKHIDVR+HF+RDII  G I + KI ++ENP D +TK+LP+ K E C+
Subjt:  EDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIKLKKIPSNENPTDGLTKTLPLAKLESCI

KAG8485521.1 hypothetical protein CXB51_019057 [Gossypium anomalum]5.2e-3342.03Show/hide
Query:  MYLMVCTRPDLAHGLSVVSRYMANPGKEHCK-----LEILSANSCVALS------------------------TMEAEYMAVTEAFKEGLWLRELVEEFG
        MY MVC+RPDL++ +S VSRYMANPGKEH K     L  L   + V L                         ++ AEYMA+TEA KE +WL+ L  E  
Subjt:  MYLMVCTRPDLAHGLSVVSRYMANPGKEHCK-----LEILSANSCVALS------------------------TMEAEYMAVTEAFKEGLWLRELVEEFG

Query:  EDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIKLKKIPSNENPTDGLTKTLPLAKLESCI
        EDL+                         +  V CDSQSA++L+K+Q FHERTKHIDVR+HF+RDII  G I + KI ++ENP D +TK+LP+ K E C+
Subjt:  EDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIKLKKIPSNENPTDGLTKTLPLAKLESCI

Query:  KALKIAR
          + + R
Subjt:  KALKIAR

KAG8489970.1 hypothetical protein CXB51_015404 [Gossypium anomalum]5.2e-3337.94Show/hide
Query:  MYLMVCTRPDLAHGLSVVSRYMANPGKEHCKL---------------------------------------------EILSANSC-----------VALS
        MY MVC+RPDL++ +S VSRYMANPGKEH K                                               + +   C           VALS
Subjt:  MYLMVCTRPDLAHGLSVVSRYMANPGKEHCKL---------------------------------------------EILSANSC-----------VALS

Query:  TMEAEYMAVTEAFKEGLWLRELVEEFGEDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIK
        T EAEYMA+TEA KE +WL+ L  E  EDL+                         +  V CDSQSA++L+K+Q FHERTKHIDVR+HF+RDII  G I 
Subjt:  TMEAEYMAVTEAFKEGLWLRELVEEFGEDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIK

Query:  LKKIPSNENPTDGLTKTLPLAKLESCIKALKIARWRIVKIFAILRQMIKSHLY
        + KI ++ENP D +TK+LP+ K E C+    + R  +V++F +   M  SH Y
Subjt:  LKKIPSNENPTDGLTKTLPLAKLESCIKALKIARWRIVKIFAILRQMIKSHLY

KAG8495869.1 hypothetical protein CXB51_007704 [Gossypium anomalum]2.8e-3442.79Show/hide
Query:  MYLMVCTRPDLAHGLSVVSRYMANPGKEHCKL-------------------EILSANSC-----------VALSTMEAEYMAVTEAFKEGLWLRELVEEF
        MY MVC+RPDL++ +S VSRYMANPGKEH K                     + +   C           VALST EAEYMA+TEA KE +WL+ L  E 
Subjt:  MYLMVCTRPDLAHGLSVVSRYMANPGKEHCKL-------------------EILSANSC-----------VALSTMEAEYMAVTEAFKEGLWLRELVEEF

Query:  GEDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIKLKKIPSNENPTDGLTKTLPLAKLESC
         E L+                         +  V CDSQSA++L+K+Q FHERTKHIDVR+HF+RDII  G I + KI ++ENP D +TK+LP+ K E C
Subjt:  GEDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIKLKKIPSNENPTDGLTKTLPLAKLESC

Query:  IKALKIAR
        +  ++  R
Subjt:  IKALKIAR

KAG8496414.1 hypothetical protein CXB51_007522 [Gossypium anomalum]5.2e-3340.83Show/hide
Query:  MYLMVCTRPDLAHGLSVVSRYMANPGKEHCKL------------------------------------EILSANSC-----------VALSTMEAEYMAV
        MY MVC+RPDL++ +S VSRYMANPGKEH K                                      + +   C           VALST EAEYMA+
Subjt:  MYLMVCTRPDLAHGLSVVSRYMANPGKEHCKL------------------------------------EILSANSC-----------VALSTMEAEYMAV

Query:  TEAFKEGLWLRELVEEFGEDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIKLKKIPSNEN
        TEA KE +WL+ L  E  EDL+                         +  V CDSQSA++L+K+Q FHERTKHIDVR+HF+RDII  G I + KI ++EN
Subjt:  TEAFKEGLWLRELVEEFGEDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIKLKKIPSNEN

Query:  PTDGLTKTLPLAKLESCI
        P D +TK+LP+ K E C+
Subjt:  PTDGLTKTLPLAKLESCI

TrEMBL top hitse value%identityAlignment
A0A2N9EYI8 Uncharacterized protein5.7e-3343.9Show/hide
Query:  MYLMVCTRPDLAHGLSVVSRYMANPGKEH--------------CKLEIL--------------SANSCVALSTMEAEYMAVTEAFKEGLWLRELVEEFGE
        MY MVCTRPDLAH +S VSRYMANPG+EH               + EIL               A+  VA+ST EAEYMAV EA KE LWL+ LV+E G 
Subjt:  MYLMVCTRPDLAHGLSVVSRYMANPGKEH--------------CKLEIL--------------SANSCVALSTMEAEYMAVTEAFKEGLWLRELVEEFGE

Query:  DLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIKLKKIPSNENPTDGLTKTLPLAKLESCIK
                          G           ++HCDSQSA+YL+KNQ +H RTKHIDVRFH IR++I+ G I L+K+ ++EN  D LTK +  AK + C+ 
Subjt:  DLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIKLKKIPSNENPTDGLTKTLPLAKLESCIK

Query:  ALKIA
         + ++
Subjt:  ALKIA

A0A2N9FZX5 Integrase catalytic domain-containing protein3.7e-3241.95Show/hide
Query:  MYLMVCTRPDLAHGLSVVSRYMANPGKEHCK----------------------------LEILSANSCVALSTMEAEYMAVTEAFKEGLWLRELVEEFGE
        MY MVCTRPDLAH +S VSRYMANPG+EH                              +  + A+  VA+ST EAEYMAV EA KE LWL+ LV+E G 
Subjt:  MYLMVCTRPDLAHGLSVVSRYMANPGKEHCK----------------------------LEILSANSCVALSTMEAEYMAVTEAFKEGLWLRELVEEFGE

Query:  DLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIKLKKIPSNENPTDGLTKTLPLAKLESCIK
                          G           ++HCDSQSA+YL+KNQ +H RTKHI+VRFH IR++I+ G I L+K+ +++N TD LTK +  AK + C+ 
Subjt:  DLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIKLKKIPSNENPTDGLTKTLPLAKLESCIK

Query:  ALKIA
         + ++
Subjt:  ALKIA

A0A2N9IN51 Integrase catalytic domain-containing protein1.3e-3242.44Show/hide
Query:  MYLMVCTRPDLAHGLSVVSRYMANPGKEHCK----------------------------LEILSANSCVALSTMEAEYMAVTEAFKEGLWLRELVEEFGE
        MY MVCTRPDLAH +S VSRYMANPG+EH                              +  + A+  VA+ST EAEYMAV EA KE LWL+ LV+E G 
Subjt:  MYLMVCTRPDLAHGLSVVSRYMANPGKEHCK----------------------------LEILSANSCVALSTMEAEYMAVTEAFKEGLWLRELVEEFGE

Query:  DLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIKLKKIPSNENPTDGLTKTLPLAKLESCIK
                          G           ++HCDSQSA+YL+KNQ +H RTKHIDVRFH IR++I+ G I L+K+ ++EN  D LTK +  AK + C+ 
Subjt:  DLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIKLKKIPSNENPTDGLTKTLPLAKLESCIK

Query:  ALKIA
         + ++
Subjt:  ALKIA

A0A438IBT7 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-3143.01Show/hide
Query:  MYLMVCTRPDLAHGLSVVSRYMANPGKEHCK-----LEILSANS------------------CVALSTMEAEYMAVTEAFKEGLWLRELVEEFGEDLKEE
        MY MVCTRPD+AH + VVS++M+NPGKEH       L  L   S                  CVALST+EAE++A+TEA KE LWL++ ++E G  LK+E
Subjt:  MYLMVCTRPDLAHGLSVVSRYMANPGKEHCK-----LEILSANS------------------CVALSTMEAEYMAVTEAFKEGLWLRELVEEFGEDLKEE

Query:  HEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIKLKKIPSNENPTDGLTKTLPLAKLESC
          V                       +HCDSQSA++LSKN +FH R+KHIDVR+H+IRD++ +  ++L+K+ +++N +D LTK L   K E C
Subjt:  HEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIKLKKIPSNENPTDGLTKTLPLAKLESC

A0A6A3BFR2 Integrase catalytic domain-containing protein6.3e-3242.57Show/hide
Query:  MYLMVCTRPDLAHGLSVVSRYMANPGKEHCK-----LEIL--------------------------SANSCVALSTMEAEYMAVTEAFKEGLWLRELVEE
        MY MVC RPDLA+ +SV+SR+MANPG+ H +     L  L                          +  S VALST EAEY+AVTEA KE +WL+ +VEE
Subjt:  MYLMVCTRPDLAHGLSVVSRYMANPGKEHCK-----LEIL--------------------------SANSCVALSTMEAEYMAVTEAFKEGLWLRELVEE

Query:  FGEDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIKLKKIPSNENPTDGLTKTLPLAKLES
         G                        ++ K +  V CD+QS ++L KNQ FHER+KHIDV+ HF+RD+I +G I +KKIP+ ENP D L K LP+AK   
Subjt:  FGEDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIKLKKIPSNENPTDGLTKTLPLAKLES

Query:  CI
        C+
Subjt:  CI

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.7e-1032.54Show/hide
Query:  VALSTMEAEYMAVTEAFKEGLWLRELVEEFGEDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILE
        VA S+ EAEYMA+ EA +E LWL+ L+     ++K E+ +K                      ++ D+Q  + ++ N + H+R KHID+++HF R+ +  
Subjt:  VALSTMEAEYMAVTEAFKEGLWLRELVEEFGEDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILE

Query:  GKIKLKKIPSNENPTDGLTKTLPLAK
          I L+ IP+     D  TK LP A+
Subjt:  GKIKLKKIPSNENPTDGLTKTLPLAK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.7e-2936.28Show/hide
Query:  MYLMVCTRPDLAHGLSVVSRYMANPGKEHCK-----LEILSANS---------------------------------------------------CVALS
        MY MVCTRPD+AH + VVSR++ NPGKEH +     L  L   +                                                   CVALS
Subjt:  MYLMVCTRPDLAHGLSVVSRYMANPGKEHCK-----LEILSANS---------------------------------------------------CVALS

Query:  TMEAEYMAVTEAFKEGLWLRELVEEFGEDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIK
        T EAEY+A TE  KE +WL+  ++E G   KE                         + V+CDSQSA+ LSKN  +H RTKHIDVR+H+IR+++ +  +K
Subjt:  TMEAEYMAVTEAFKEGLWLRELVEEFGEDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIK

Query:  LKKIPSNENPTDGLTKTLPLAKLESC
        + KI +NENP D LTK +P  K E C
Subjt:  LKKIPSNENPTDGLTKTLPLAKLESC

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.8e-0726.09Show/hide
Query:  VALSTMEAEYMAVTEAFKEGLWLRELVEEFGEDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILE
        V  S+ EAEY +V     E  W+  L+ E G                        +R      ++CD+  A YL  N  FH R KHI + +HFIR+ +  
Subjt:  VALSTMEAEYMAVTEAFKEGLWLRELVEEFGEDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILE

Query:  GKIKLKKIPSNENPTDGLTKTLPLAKLESCIKALKIAR
        G +++  + +++   D LTK L     ++    + + R
Subjt:  GKIKLKKIPSNENPTDGLTKTLPLAKLESCIKALKIAR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.1e-0729.51Show/hide
Query:  VALSTMEAEYMAVTEAFKEGLWLRELVEEFGEDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILE
        V  S+ EAEY +V     E  W+  L+ E G  L                       H  +  ++CD+  A YL  N  FH R KHI + +HFIR+ +  
Subjt:  VALSTMEAEYMAVTEAFKEGLWLRELVEEFGEDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRDIILE

Query:  GKIKLKKIPSNENPTDGLTKTL
        G +++  + +++   D LTK L
Subjt:  GKIKLKKIPSNENPTDGLTKTL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.2e-0430.21Show/hide
Query:  VALSTMEAEYMAVTEAFKEGLWLRELVEEFGEDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRD
        V+ S+ EAEY A++ A  E +WL +   E    L +                       TL  + CD+ +A++++ N  FHERTKHI+   H +R+
Subjt:  VALSTMEAEYMAVTEAFKEGLWLRELVEEFGEDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQSAVYLSKNQTFHERTKHIDVRFHFIRD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACCTCATGGTGTGCACCAGACCTGATTTAGCTCATGGACTTAGTGTTGTGAGCAGATACATGGCTAATCCAGGGAAAGAACATTGTAAGCTGGAGATCCTGTCTGC
AAACAGTTGTGTAGCATTATCAACTATGGAGGCAGAGTATATGGCAGTCACTGAGGCTTTTAAAGAGGGCTTGTGGCTGAGAGAGTTAGTTGAAGAATTTGGTGAAGACC
TTAAAGAAGAGCATGAAGTGAAGGACTGGAATTCCCAATTCCGTTACAGTGGAAGCAATTGGACCGTTCGACATAAAACCCTACATGAAGTGCACTGTGACAGTCAAAGT
GCAGTATATTTGTCAAAGAACCAAACCTTCCACGAGAGGACAAAGCATATTGACGTAAGGTTCCACTTCATTAGAGACATCATTCTTGAAGGAAAGATCAAGCTGAAGAA
AATTCCTTCAAATGAGAATCCTACTGATGGTTTAACAAAGACTCTTCCTCTTGCTAAGCTTGAGAGCTGCATAAAAGCTCTCAAGATAGCAAGGTGGAGAATTGTTAAAA
TCTTTGCTATTCTAAGACAAATGATTAAGAGCCACTTGTATTATTTAATCCTAGTCAACCTTGTAACAGTGTCTCAGCTAGGGAAAGGAATTTTGGACATGGCCGTACAA
GTTGGTTATGGCCAGGCAAAATCCGTTATTCAGGCAAGGCTCCTTCTAGTAGTCAGTGTGGGCTCCATCTGCAATTCAGAGGTTGATATATTTCGTCCCTTGCAATGGGA
AAATTTTGAGAGAACAATCGTGAGTGTTGAGAGGCTGAGATTGTTGAGAGAGGCGAAGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTACCTCATGGTGTGCACCAGACCTGATTTAGCTCATGGACTTAGTGTTGTGAGCAGATACATGGCTAATCCAGGGAAAGAACATTGTAAGCTGGAGATCCTGTCTGC
AAACAGTTGTGTAGCATTATCAACTATGGAGGCAGAGTATATGGCAGTCACTGAGGCTTTTAAAGAGGGCTTGTGGCTGAGAGAGTTAGTTGAAGAATTTGGTGAAGACC
TTAAAGAAGAGCATGAAGTGAAGGACTGGAATTCCCAATTCCGTTACAGTGGAAGCAATTGGACCGTTCGACATAAAACCCTACATGAAGTGCACTGTGACAGTCAAAGT
GCAGTATATTTGTCAAAGAACCAAACCTTCCACGAGAGGACAAAGCATATTGACGTAAGGTTCCACTTCATTAGAGACATCATTCTTGAAGGAAAGATCAAGCTGAAGAA
AATTCCTTCAAATGAGAATCCTACTGATGGTTTAACAAAGACTCTTCCTCTTGCTAAGCTTGAGAGCTGCATAAAAGCTCTCAAGATAGCAAGGTGGAGAATTGTTAAAA
TCTTTGCTATTCTAAGACAAATGATTAAGAGCCACTTGTATTATTTAATCCTAGTCAACCTTGTAACAGTGTCTCAGCTAGGGAAAGGAATTTTGGACATGGCCGTACAA
GTTGGTTATGGCCAGGCAAAATCCGTTATTCAGGCAAGGCTCCTTCTAGTAGTCAGTGTGGGCTCCATCTGCAATTCAGAGGTTGATATATTTCGTCCCTTGCAATGGGA
AAATTTTGAGAGAACAATCGTGAGTGTTGAGAGGCTGAGATTGTTGAGAGAGGCGAAGGATTGA
Protein sequenceShow/hide protein sequence
MYLMVCTRPDLAHGLSVVSRYMANPGKEHCKLEILSANSCVALSTMEAEYMAVTEAFKEGLWLRELVEEFGEDLKEEHEVKDWNSQFRYSGSNWTVRHKTLHEVHCDSQS
AVYLSKNQTFHERTKHIDVRFHFIRDIILEGKIKLKKIPSNENPTDGLTKTLPLAKLESCIKALKIARWRIVKIFAILRQMIKSHLYYLILVNLVTVSQLGKGILDMAVQ
VGYGQAKSVIQARLLLVVSVGSICNSEVDIFRPLQWENFERTIVSVERLRLLREAKD