; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035549 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035549
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr3:23873836..23876484
RNA-Seq ExpressionLag0035549
SyntenyLag0035549
Gene Ontology termsGO:0043167 - ion binding (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3616341.1 hypothetical protein FXO38_34599 [Capsicum annuum]5.9e-4442.19Show/hide
Query:  EKPEKFNGSDFKRWQQKMLFYLTTLSLARFLKEDAPVLDEGETNKEKLLALEAWKHTEYLCRNYILNGLENTLYNVYS----------------------
        EKP+KF   +FKRWQQKM FYLTTL L  F  +D P + E  ++KE+ + +E WKH+++LCRN IL+GL++ LYNVYS                      
Subjt:  EKPEKFNGSDFKRWQQKMLFYLTTLSLARFLKEDAPVLDEGETNKEKLLALEAWKHTEYLCRNYILNGLENTLYNVYS----------------------

Query:  --------------------------------GVDSAK-----------------------------------ARSTMESEFIALDKAGEEVEWLRNFLE
                                         +DS                                     ARS ME EFI LDKAGEE EWL NFLE
Subjt:  --------------------------------GVDSAK-----------------------------------ARSTMESEFIALDKAGEEVEWLRNFLE

Query:  DIPNWTKPVPPICIHCDSQAAIGRAQNLMYNGKSRHIRRRHNTIRQLLSNGIISID
        DI  W KPV P+CIHCDSQAAI +A ++MYNGK RHIRRRHNTI++LLS+GII++D
Subjt:  DIPNWTKPVPPICIHCDSQAAIGRAQNLMYNGKSRHIRRRHNTIRQLLSNGIISID

KAF5931719.1 hypothetical protein HYC85_027890 [Camellia sinensis]7.2e-4235.15Show/hide
Query:  TPSPIPQGGGNHGEKPEKFNGSDFKRWQQKMLFYLTTLSLARFLKEDAPVLDEGETNKEKLLALEAWKHTEYLCRNYILNGLENTLYNVYSGVDSAK---
        TPSP+     NHGEKPEKF+G+DFKRWQQKMLFYLTTL+LARFL+EDAP+L E +T+++ + A++AWKH ++LCRNY+LN L+N LYNVYS + +AK   
Subjt:  TPSPIPQGGGNHGEKPEKFNGSDFKRWQQKMLFYLTTLSLARFLKEDAPVLDEGETNKEKLLALEAWKHTEYLCRNYILNGLENTLYNVYSGVDSAK---

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------ARSTMESEFIALDKAGEEVEWLRNFLEDIP
                                                                              ARSTMESEFIALDKAGEE EWLR+FLEDIP
Subjt:  ----------------------------------------------------------------------ARSTMESEFIALDKAGEEVEWLRNFLEDIP

Query:  NWTKPVPPICIHCDSQAAIGRAQNLMYNGK
         W KPVP ICIHCDSQ+AIGRAQ+ MYNGK
Subjt:  NWTKPVPPICIHCDSQAAIGRAQNLMYNGK

PHT32382.1 hypothetical protein CQW23_28719 [Capsicum baccatum]2.7e-4153.29Show/hide
Query:  EKPEKFNGSDFKRWQQKMLFYLTTLSLARFLKEDAPVLDEGETNKEKLLALEAWKHTEYLCRNYILNGLENTLYNVYSGVDSAKARSTMESEFIALDKAG
        EKP+KF G DFKRWQQK+ FYLTT  L RF+ E AP + EG + +EK + +EAWKH++ L RN IL+                             DK G
Subjt:  EKPEKFNGSDFKRWQQKMLFYLTTLSLARFLKEDAPVLDEGETNKEKLLALEAWKHTEYLCRNYILNGLENTLYNVYSGVDSAKARSTMESEFIALDKAG

Query:  EEVEWLRNFLEDIPNWTKPVPPICIHCDSQAAIGRAQNLMYNGKSRHIRRRHNTIRQLLSNGIISID
        EE +WL+NFLEDIP W K V P+CIHCDSQAAIGRA ++MYN KS HIRRR+NT+R++LS+GII +D
Subjt:  EEVEWLRNFLEDIPNWTKPVPPICIHCDSQAAIGRAQNLMYNGKSRHIRRRHNTIRQLLSNGIISID

PHT70243.1 hypothetical protein T459_25347 [Capsicum annuum]8.2e-5453.62Show/hide
Query:  EKPEKFNGSDFKRWQQKMLFYLTTLSLARFLKEDAPVLDEGETNKEKLLALEAWKHTEYLCRNYILNGLENTLYNVYSGVDSAKA---------------
        EK EKF G DFKRWQQKM FYLTTL L RF  EDAP + EG ++KE  + +EAWKH+++LCRNYIL+GL++ LY+ YSG  ++K                
Subjt:  EKPEKFNGSDFKRWQQKMLFYLTTLSLARFLKEDAPVLDEGETNKEKLLALEAWKHTEYLCRNYILNGLENTLYNVYSGVDSAKA---------------

Query:  -------------------------RSTMESEFIALDKAGEEVEWLRNFLEDIPNWTKPVPPICIHCDSQAAIGRAQNLMYNGKSRHIRRRHNTIRQLLS
                                 R   E EFIALDKAGEE EW +NFLEDIP WTKPV P+CI+CDSQAAIGRA ++MYNGKSR++R RHNTIR++LS
Subjt:  -------------------------RSTMESEFIALDKAGEEVEWLRNFLEDIPNWTKPVPPICIHCDSQAAIGRAQNLMYNGKSRHIRRRHNTIRQLLS

Query:  NGIISID
        + II++D
Subjt:  NGIISID

PHU26485.1 hypothetical protein BC332_04817 [Capsicum chinense]1.5e-3646.98Show/hide
Query:  EKPEKFNGSDFKRWQQKMLFYLTTLSLARFLKEDAPV-LDEGET-----NKEKL----------------LALEAWKHTEYLCRNYILNGLE--NTLYNV
        EKPEKF+G DFKRWQQK+ FYLTTL L RF  EDAP  +D   T     N+E                   A    + T  +C        E  N+  N+
Subjt:  EKPEKFNGSDFKRWQQKMLFYLTTLSLARFLKEDAPV-LDEGET-----NKEKL----------------LALEAWKHTEYLCRNYILNGLE--NTLYNV

Query:  YSGVDSAK------------------------ARSTMESEFIALDKAGEEVEWLRNFLEDIPNWTKPVPPICIHCDSQAAIGRAQNLMYNGKSRHIRRRH
          G + AK                        A STMESEF ALDKA EE EWLRNFLEDI  W  PV P+CIHCDSQA IGRA ++MY+GKS HIR+RH
Subjt:  YSGVDSAK------------------------ARSTMESEFIALDKAGEEVEWLRNFLEDIPNWTKPVPPICIHCDSQAAIGRAQNLMYNGKSRHIRRRH

Query:  NTIRQLLSNGIISID
        NT+R+LL N II++D
Subjt:  NTIRQLLSNGIISID

TrEMBL top hitse value%identityAlignment
A0A2G2VHD3 Uncharacterized protein1.3e-4153.29Show/hide
Query:  EKPEKFNGSDFKRWQQKMLFYLTTLSLARFLKEDAPVLDEGETNKEKLLALEAWKHTEYLCRNYILNGLENTLYNVYSGVDSAKARSTMESEFIALDKAG
        EKP+KF G DFKRWQQK+ FYLTT  L RF+ E AP + EG + +EK + +EAWKH++ L RN IL+                             DK G
Subjt:  EKPEKFNGSDFKRWQQKMLFYLTTLSLARFLKEDAPVLDEGETNKEKLLALEAWKHTEYLCRNYILNGLENTLYNVYSGVDSAKARSTMESEFIALDKAG

Query:  EEVEWLRNFLEDIPNWTKPVPPICIHCDSQAAIGRAQNLMYNGKSRHIRRRHNTIRQLLSNGIISID
        EE +WL+NFLEDIP W K V P+CIHCDSQAAIGRA ++MYN KS HIRRR+NT+R++LS+GII +D
Subjt:  EEVEWLRNFLEDIPNWTKPVPPICIHCDSQAAIGRAQNLMYNGKSRHIRRRHNTIRQLLSNGIISID

A0A2G2YKH8 Uncharacterized protein4.0e-5453.62Show/hide
Query:  EKPEKFNGSDFKRWQQKMLFYLTTLSLARFLKEDAPVLDEGETNKEKLLALEAWKHTEYLCRNYILNGLENTLYNVYSGVDSAKA---------------
        EK EKF G DFKRWQQKM FYLTTL L RF  EDAP + EG ++KE  + +EAWKH+++LCRNYIL+GL++ LY+ YSG  ++K                
Subjt:  EKPEKFNGSDFKRWQQKMLFYLTTLSLARFLKEDAPVLDEGETNKEKLLALEAWKHTEYLCRNYILNGLENTLYNVYSGVDSAKA---------------

Query:  -------------------------RSTMESEFIALDKAGEEVEWLRNFLEDIPNWTKPVPPICIHCDSQAAIGRAQNLMYNGKSRHIRRRHNTIRQLLS
                                 R   E EFIALDKAGEE EW +NFLEDIP WTKPV P+CI+CDSQAAIGRA ++MYNGKSR++R RHNTIR++LS
Subjt:  -------------------------RSTMESEFIALDKAGEEVEWLRNFLEDIPNWTKPVPPICIHCDSQAAIGRAQNLMYNGKSRHIRRRHNTIRQLLS

Query:  NGIISID
        + II++D
Subjt:  NGIISID

A0A2N9HX08 Integrase catalytic domain-containing protein2.0e-3772.17Show/hide
Query:  ARSTMESEFIALDKAGEEVEWLRNFLEDIPNWTKPVPPICIHCDSQAAIGRAQNLMYNGKSRHIRRRHNTIRQLLSNGIISIDCL-GWSAVTAPETARLD
        ARSTMESEFIALDKAGEE EWLR+FLED+P WTKPVPPICIHCDSQ+AIGRAQ+ MYNGKSRHIRRRHNT+RQLLSNGIISID +     +  P T  L 
Subjt:  ARSTMESEFIALDKAGEEVEWLRNFLEDIPNWTKPVPPICIHCDSQAAIGRAQNLMYNGKSRHIRRRHNTIRQLLSNGIISIDCL-GWSAVTAPETARLD

Query:  FRRLKFSSQLKFTKP
          R+  SS+    KP
Subjt:  FRRLKFSSQLKFTKP

A0A2N9ILQ6 Integrase catalytic domain-containing protein8.9e-3865.65Show/hide
Query:  LENTLYNVYSGVDSAKARSTMESEFIALDKAGEEVEWLRNFLEDIPNWTKPVPPICIHCDSQAAIGRAQNLMYNGKSRHIRRRHNTIRQLLSNGIISIDC
        LE    +  S   +  ARSTMESEFIALDKAGEE EWLR+FLED+P WTKPVPPICIHCDSQ+AIGRAQ+ MYNGKSRHIRRRHNT+RQLLSNGIISID 
Subjt:  LENTLYNVYSGVDSAKARSTMESEFIALDKAGEEVEWLRNFLEDIPNWTKPVPPICIHCDSQAAIGRAQNLMYNGKSRHIRRRHNTIRQLLSNGIISIDC

Query:  L-GWSAVTAPETARLDFRRLKFSSQLKFTKP
        +     +  P T  L   R+  SS+    KP
Subjt:  L-GWSAVTAPETARLDFRRLKFSSQLKFTKP

A0A7J7FTW0 (S)-2-hydroxy-acid oxidase3.5e-4235.15Show/hide
Query:  TPSPIPQGGGNHGEKPEKFNGSDFKRWQQKMLFYLTTLSLARFLKEDAPVLDEGETNKEKLLALEAWKHTEYLCRNYILNGLENTLYNVYSGVDSAK---
        TPSP+     NHGEKPEKF+G+DFKRWQQKMLFYLTTL+LARFL+EDAP+L E +T+++ + A++AWKH ++LCRNY+LN L+N LYNVYS + +AK   
Subjt:  TPSPIPQGGGNHGEKPEKFNGSDFKRWQQKMLFYLTTLSLARFLKEDAPVLDEGETNKEKLLALEAWKHTEYLCRNYILNGLENTLYNVYSGVDSAK---

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------ARSTMESEFIALDKAGEEVEWLRNFLEDIP
                                                                              ARSTMESEFIALDKAGEE EWLR+FLEDIP
Subjt:  ----------------------------------------------------------------------ARSTMESEFIALDKAGEEVEWLRNFLEDIP

Query:  NWTKPVPPICIHCDSQAAIGRAQNLMYNGK
         W KPVP ICIHCDSQ+AIGRAQ+ MYNGK
Subjt:  NWTKPVPPICIHCDSQAAIGRAQNLMYNGK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.2e-0832.58Show/hide
Query:  SGVDSAKARSTMESEFIALDKAGEEVEWLRNFLEDIPNWTKPVPPICIHCDSQAAIGRAQNLMYNGKSRHIRRRHNTIRQLLSNGIISI
        S +    A ST E+E+IA  + G+E+ WL+ FL+++    K      ++CDSQ+AI  ++N MY+ +++HI  R++ IR+++ +  + +
Subjt:  SGVDSAKARSTMESEFIALDKAGEEVEWLRNFLEDIPNWTKPVPPICIHCDSQAAIGRAQNLMYNGKSRHIRRRHNTIRQLLSNGIISI

Arabidopsis top hitse value%identityAlignment
AT4G00980.1 zinc knuckle (CCHC-type) family protein7.5e-0529.76Show/hide
Query:  KPEKFNGSDFKRWQQKMLFYLTTLSLARFLKEDAPVLDEG---ETNKEKLLALEA----WKHTEYLCRNYILNGLENTLYNVYS
        K  +F+G  +  W  +M  +L  L L   L E  P +      ETN  ++   +A    W   +YLC  +++N L + LY  YS
Subjt:  KPEKFNGSDFKRWQQKMLFYLTTLSLARFLKEDAPVLDEG---ETNKEKLLALEA----WKHTEYLCRNYILNGLENTLYNVYS

AT5G35970.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein8.0e-0761.11Show/hide
Query:  KRELGKSVVRWIGKAMRAVASDFASTEDQGDFSKLR
        +R+LG++VV+WI +AM+A+ASDFA+ E QG+FS+LR
Subjt:  KRELGKSVVRWIGKAMRAVASDFASTEDQGDFSKLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCAATAATACCGAAGACACCGAGTCCCATACCACAAGGTGGTGGAAATCATGGGGAAAAGCCTGAGAAGTTTAATGGCAGTGACTTCAAACGTTGGCAACAAAA
GATGTTGTTCTATCTCACAACATTGAGTCTGGCAAGATTTCTCAAAGAAGATGCTCCTGTTCTGGATGAAGGAGAGACCAACAAAGAAAAGTTGCTGGCACTTGAGGCAT
GGAAACATACAGAATATTTGTGCAGGAACTATATCTTGAATGGACTGGAAAACACACTATATAATGTTTATAGTGGGGTTGATTCGGCAAAAGCAAGGTCTACTATGGAA
TCTGAATTCATAGCTTTGGACAAGGCAGGGGAAGAAGTCGAATGGTTGCGTAACTTTTTGGAAGATATTCCTAATTGGACTAAACCTGTACCCCCAATATGTATACATTG
TGATAGTCAGGCTGCTATTGGAAGAGCGCAAAACTTGATGTATAATGGTAAGTCTCGACATATACGTCGGAGACATAATACCATAAGACAATTGCTCTCGAATGGAATTA
TATCTATTGATTGCCTGGGGTGGTCGGCGGTGACGGCGCCTGAGACAGCCCGTTTGGATTTTCGTCGCCTCAAATTCAGTTCGCAGCTCAAATTCACAAAACCCTATTTC
CCCTCCGCCATTAGAAGATTTTTCCACCGCTTCAACGAAATGCAACGTGGTGCGCAAAGGAGGATTCAGGTTGTCAAAACCAAGATCAAGAATGTGAAGAAACCTAATAT
TCTTGAGGTTTCTTCTCCTTCTACTGCTAGTCTCTCTGCTAGTGCTAGAATCGGAGTCAGTATCCGTGGTTCAATCGGCTCTGAGACGAAGAGAGAGCTAGGGAAGAGTG
TGGTTCGGTGGATTGGGAAGGCCATGCGAGCTGTGGCCTCTGATTTTGCTTCTACGGAGGATCAGGGCGATTTCTCTAAGCTCCGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCAATAATACCGAAGACACCGAGTCCCATACCACAAGGTGGTGGAAATCATGGGGAAAAGCCTGAGAAGTTTAATGGCAGTGACTTCAAACGTTGGCAACAAAA
GATGTTGTTCTATCTCACAACATTGAGTCTGGCAAGATTTCTCAAAGAAGATGCTCCTGTTCTGGATGAAGGAGAGACCAACAAAGAAAAGTTGCTGGCACTTGAGGCAT
GGAAACATACAGAATATTTGTGCAGGAACTATATCTTGAATGGACTGGAAAACACACTATATAATGTTTATAGTGGGGTTGATTCGGCAAAAGCAAGGTCTACTATGGAA
TCTGAATTCATAGCTTTGGACAAGGCAGGGGAAGAAGTCGAATGGTTGCGTAACTTTTTGGAAGATATTCCTAATTGGACTAAACCTGTACCCCCAATATGTATACATTG
TGATAGTCAGGCTGCTATTGGAAGAGCGCAAAACTTGATGTATAATGGTAAGTCTCGACATATACGTCGGAGACATAATACCATAAGACAATTGCTCTCGAATGGAATTA
TATCTATTGATTGCCTGGGGTGGTCGGCGGTGACGGCGCCTGAGACAGCCCGTTTGGATTTTCGTCGCCTCAAATTCAGTTCGCAGCTCAAATTCACAAAACCCTATTTC
CCCTCCGCCATTAGAAGATTTTTCCACCGCTTCAACGAAATGCAACGTGGTGCGCAAAGGAGGATTCAGGTTGTCAAAACCAAGATCAAGAATGTGAAGAAACCTAATAT
TCTTGAGGTTTCTTCTCCTTCTACTGCTAGTCTCTCTGCTAGTGCTAGAATCGGAGTCAGTATCCGTGGTTCAATCGGCTCTGAGACGAAGAGAGAGCTAGGGAAGAGTG
TGGTTCGGTGGATTGGGAAGGCCATGCGAGCTGTGGCCTCTGATTTTGCTTCTACGGAGGATCAGGGCGATTTCTCTAAGCTCCGGTAG
Protein sequenceShow/hide protein sequence
MSSIIPKTPSPIPQGGGNHGEKPEKFNGSDFKRWQQKMLFYLTTLSLARFLKEDAPVLDEGETNKEKLLALEAWKHTEYLCRNYILNGLENTLYNVYSGVDSAKARSTME
SEFIALDKAGEEVEWLRNFLEDIPNWTKPVPPICIHCDSQAAIGRAQNLMYNGKSRHIRRRHNTIRQLLSNGIISIDCLGWSAVTAPETARLDFRRLKFSSQLKFTKPYF
PSAIRRFFHRFNEMQRGAQRRIQVVKTKIKNVKKPNILEVSSPSTASLSASARIGVSIRGSIGSETKRELGKSVVRWIGKAMRAVASDFASTEDQGDFSKLR