; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008229 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008229
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr9:15303313..15304502
RNA-Seq ExpressionLag0008229
SyntenyLag0008229
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026100.1 uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa]2.2e-5952.1Show/hide
Query:  GPSCPPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEEAEAGVT------------SSEAGTSEAAD-----SSSPTME-VNPLYESWLAIDQLLL
        G S PPLNQ+LNQ+ ++KLDR N+LLWK LALPIL+ YKLE    G T            S+   T E AD     SSS T   VN L+E W+  D LLL
Subjt:  GPSCPPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEEAEAGVT------------SSEAGTSEAAD-----SSSPTME-VNPLYESWLAIDQLLL

Query:  GWLYNSMAPKVATQVMGFENAKELWEAIQEMFGVQSRAEEDYLRQVFQQCRKGSLKMIDYLRVMKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVA
        GWLYNSM P VA Q+MGF N ++LW+A Q+ FGVQSRAEED+LRQ+ Q  RKG+ KM +YL VMK + DNLGQ GSPV  R+LISQVLLGLDE +N V+ 
Subjt:  GWLYNSMAPKVATQVMGFENAKELWEAIQEMFGVQSRAEEDYLRQVFQQCRKGSLKMIDYLRVMKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVA

Query:  MIQGRSGISWSEMQAKLLVFEKRLALQNSLKTV----SLSQGTSVNMASSKESGNQRNFNGN---NNNRQPSYG-RGNQRGGESEQ
        +IQG+  ISW +MQ+KLL+FEK L  QN+ K      +++Q  ++NMA       QRN +       NRQ   G RGN   G + Q
Subjt:  MIQGRSGISWSEMQAKLLVFEKRLALQNSLKTV----SLSQGTSVNMASSKESGNQRNFNGN---NNNRQPSYG-RGNQRGGESEQ

XP_022148963.1 uncharacterized protein LOC111017501 [Momordica charantia]2.0e-5571.25Show/hide
Query:  LEEAEAGVTSSEAGTSEAADSSSPT-MEVNPLYESWLAIDQLLLGWLYNSMAPKVATQVMGFENAKELWEAIQEMFGVQSRAEEDYLRQVFQQCRKGSLK
        ++++   + +S+   S  + SS  T   +NPLYESW+  DQLLLGWLYNSM P+VATQVMG+ENA +LW AIQE+FGVQS+AEEDYLRQVFQQ RKGSLK
Subjt:  LEEAEAGVTSSEAGTSEAADSSSPT-MEVNPLYESWLAIDQLLLGWLYNSMAPKVATQVMGFENAKELWEAIQEMFGVQSRAEEDYLRQVFQQCRKGSLK

Query:  MIDYLRVMKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRSGISWSEMQAK
        M D+LRVMK+HADNLGQAGSPV TRSLISQVLLGLDEE+NPVVA IQG+ GISW EMQA+
Subjt:  MIDYLRVMKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRSGISWSEMQAK

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]4.3e-7156.47Show/hide
Query:  PPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEEAEAG---------VTSSEAGTSEAADSSSPTMEVNPLYESWLAIDQLLLGWLYNSMAPKVAT
        PPLNQLLNQ+TSIK+DRGNFLLW+NLALPILRSYKL +   G         V +      E + SS  +  +NP YE+W+ +D+LLLGWLYNSMA  VA 
Subjt:  PPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEEAEAG---------VTSSEAGTSEAADSSSPTMEVNPLYESWLAIDQLLLGWLYNSMAPKVAT

Query:  QVMGFENAKELWEAIQEMFGVQSRAEEDYLRQVFQQCRKGSLKMIDYLRVMKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRSGISWSEM
        QVMGF  ++ELW A+QE+FGVQSRAE DYL+QVFQQ  KGSL+MI+YL++MK+HADNL  AGS VS R L+SQVL GLDEE+NP+V  +QG+  +SWSEM
Subjt:  QVMGFENAKELWEAIQEMFGVQSRAEEDYLRQVFQQCRKGSLKMIDYLRVMKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRSGISWSEM

Query:  QAKLLVFEKRLALQNSLKT---VSLSQGTSVNMASSKE-SGNQRNFNGNNNNRQPSYGRGNQRGGESEQGGLRGCRRR
         A+LL +EKRL  QNSLK+   ++ +Q  SVN    +    NQR  NGNN     S+G    RGG  ++G   G R R
Subjt:  QAKLLVFEKRLALQNSLKT---VSLSQGTSVNMASSKE-SGNQRNFNGNNNNRQPSYGRGNQRGGESEQGGLRGCRRR

XP_038902487.1 uncharacterized protein LOC120089143 [Benincasa hispida]7.2e-5851.89Show/hide
Query:  TSIKLDRGNFLLWKNLALPILRSYKLE--------------------------EAEAGVTSSEAGTSE--------AADSSSPTMEVNPLYESWLAIDQL
        T+IKLD+ N+LLW+NLALPILRSY+LE                            EAG+    +G +          A +SSP ++VNP YES   +DQL
Subjt:  TSIKLDRGNFLLWKNLALPILRSYKLE--------------------------EAEAGVTSSEAGTSE--------AADSSSPTMEVNPLYESWLAIDQL

Query:  LLGWLYNSMAPKVATQVMGFENAKELWEAIQEMFGVQSRAEEDYLRQVFQQCRKGSLKMIDYLRVMKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPV
        LLGWLYN M  +VA QVMG+EN K LW AIQE+FG+QSRA EDYLRQVFQQ  KG++KM +YLRVMK H+DNLG  GSPV TR+L+SQVLLGLDEEFNP 
Subjt:  LLGWLYNSMAPKVATQVMGFENAKELWEAIQEMFGVQSRAEEDYLRQVFQQCRKGSLKMIDYLRVMKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPV

Query:  VAMIQGRSGISWSEMQAKLLVFEKRLALQNSLKTVSLSQGTSVNMASSKESGNQRNFNGNNNNR
        VA IQGRS ISW+ MQ +LL FEKR    N+ +              S  +  +R +N N+ NR
Subjt:  VAMIQGRSGISWSEMQAKLLVFEKRLALQNSLKTVSLSQGTSVNMASSKESGNQRNFNGNNNNR

XP_038904321.1 uncharacterized protein LOC120090675 [Benincasa hispida]1.5e-5264.24Show/hide
Query:  SSEAGTSEAADSSSPTMEVNPLYESWLAIDQLLLGWLYNSMAPKVATQVMGFENAKELWEAIQEMFGVQSRAEEDYLRQVFQQCRKGSLKMIDYLRVMKN
        S+E  +   A SSS T+EVNP Y +W+A+DQLLLGWLYNSM PK+A QVMGFE A++LW  IQ++FG+QSRAEEDYLR VFQ  RKG+LKM DYLR MK 
Subjt:  SSEAGTSEAADSSSPTMEVNPLYESWLAIDQLLLGWLYNSMAPKVATQVMGFENAKELWEAIQEMFGVQSRAEEDYLRQVFQQCRKGSLKMIDYLRVMKN

Query:  HADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRSGISWSEMQAKLLVFEKRLALQNSLK
        + DNL QAGSPV  R+L+ QVLLGLDEE+N +VA IQGR+ +SW +MQ+KLL++E+RL  Q++ K
Subjt:  HADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRSGISWSEMQAKLLVFEKRLALQNSLK

TrEMBL top hitse value%identityAlignment
A0A5A7SIT7 Uncharacterized protein1.1e-5952.1Show/hide
Query:  GPSCPPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEEAEAGVT------------SSEAGTSEAAD-----SSSPTME-VNPLYESWLAIDQLLL
        G S PPLNQ+LNQ+ ++KLDR N+LLWK LALPIL+ YKLE    G T            S+   T E AD     SSS T   VN L+E W+  D LLL
Subjt:  GPSCPPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEEAEAGVT------------SSEAGTSEAAD-----SSSPTME-VNPLYESWLAIDQLLL

Query:  GWLYNSMAPKVATQVMGFENAKELWEAIQEMFGVQSRAEEDYLRQVFQQCRKGSLKMIDYLRVMKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVA
        GWLYNSM P VA Q+MGF N ++LW+A Q+ FGVQSRAEED+LRQ+ Q  RKG+ KM +YL VMK + DNLGQ GSPV  R+LISQVLLGLDE +N V+ 
Subjt:  GWLYNSMAPKVATQVMGFENAKELWEAIQEMFGVQSRAEEDYLRQVFQQCRKGSLKMIDYLRVMKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVA

Query:  MIQGRSGISWSEMQAKLLVFEKRLALQNSLKTV----SLSQGTSVNMASSKESGNQRNFNGN---NNNRQPSYG-RGNQRGGESEQ
        +IQG+  ISW +MQ+KLL+FEK L  QN+ K      +++Q  ++NMA       QRN +       NRQ   G RGN   G + Q
Subjt:  MIQGRSGISWSEMQAKLLVFEKRLALQNSLKTV----SLSQGTSVNMASSKESGNQRNFNGN---NNNRQPSYG-RGNQRGGESEQ

A0A5D3BCH9 Uncharacterized protein4.4e-4556.99Show/hide
Query:  GPSCPPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEEAEAGVT------------SSEAGTSEAAD-----SSSPTME-VNPLYESWLAIDQLLL
        G S PPLNQ+LNQ+ ++KLDR N+LLWK LALPIL+ YKLE    G T            S+   T E AD     SSS T   VN L+E W+  D LLL
Subjt:  GPSCPPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEEAEAGVT------------SSEAGTSEAAD-----SSSPTME-VNPLYESWLAIDQLLL

Query:  GWLYNSMAPKVATQVMGFENAKELWEAIQEMFGVQSRAEEDYLRQVFQQCRKGSLKMIDYLRVMKNHADNLGQAGSPVSTRSLISQ
        GWLYNSM P VA Q+MGF N ++LW+A Q+ FGVQSRAEED+LRQ+ Q  RKG+ KM +YL VMK + DNLGQ GSPV  R+LISQ
Subjt:  GWLYNSMAPKVATQVMGFENAKELWEAIQEMFGVQSRAEEDYLRQVFQQCRKGSLKMIDYLRVMKNHADNLGQAGSPVSTRSLISQ

A0A5D3E3L7 Uncharacterized protein4.9e-4450.45Show/hide
Query:  LALPILRSYKLEEAE-AGVTSSEAGTSEAADSSSPTME---VNPLYESWLAIDQLLLGWLYNSMAPKVATQVMGFENAKELWEAIQEMFGVQSRAEEDYL
        L L  L S   E  E  GV +S   +     SSS +M    VNP YE W+  D LLLG +YNSM P VA Q+MGF  AK+LWEAIQ +FG++SRAEE +L
Subjt:  LALPILRSYKLEEAE-AGVTSSEAGTSEAADSSSPTME---VNPLYESWLAIDQLLLGWLYNSMAPKVATQVMGFENAKELWEAIQEMFGVQSRAEEDYL

Query:  RQVFQQCRKGSLKMIDYLRVMKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRSGISWSEMQAKLLVFEKRLALQNSLKTVSLSQGTSVNM
        R  FQ  R+G+ KM DYLR+MK +ADNLGQAGSPV  R LISQVLLGLDE +NPV A+IQG+  ISW +MQ++LL+FE  + +      +   +  ++ M
Subjt:  RQVFQQCRKGSLKMIDYLRVMKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRSGISWSEMQAKLLVFEKRLALQNSLKTVSLSQGTSVNM

Query:  ASSKESGNQRNFNGNNNNRQ
        A+       R FN N N +Q
Subjt:  ASSKESGNQRNFNGNNNNRQ

A0A6J1D5J0 uncharacterized protein LOC1110175019.5e-5671.25Show/hide
Query:  LEEAEAGVTSSEAGTSEAADSSSPT-MEVNPLYESWLAIDQLLLGWLYNSMAPKVATQVMGFENAKELWEAIQEMFGVQSRAEEDYLRQVFQQCRKGSLK
        ++++   + +S+   S  + SS  T   +NPLYESW+  DQLLLGWLYNSM P+VATQVMG+ENA +LW AIQE+FGVQS+AEEDYLRQVFQQ RKGSLK
Subjt:  LEEAEAGVTSSEAGTSEAADSSSPT-MEVNPLYESWLAIDQLLLGWLYNSMAPKVATQVMGFENAKELWEAIQEMFGVQSRAEEDYLRQVFQQCRKGSLK

Query:  MIDYLRVMKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRSGISWSEMQAK
        M D+LRVMK+HADNLGQAGSPV TRSLISQVLLGLDEE+NPVVA IQG+ GISW EMQA+
Subjt:  MIDYLRVMKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRSGISWSEMQAK

A0A6J1DCW4 uncharacterized protein LOC1110195982.1e-7156.47Show/hide
Query:  PPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEEAEAG---------VTSSEAGTSEAADSSSPTMEVNPLYESWLAIDQLLLGWLYNSMAPKVAT
        PPLNQLLNQ+TSIK+DRGNFLLW+NLALPILRSYKL +   G         V +      E + SS  +  +NP YE+W+ +D+LLLGWLYNSMA  VA 
Subjt:  PPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEEAEAG---------VTSSEAGTSEAADSSSPTMEVNPLYESWLAIDQLLLGWLYNSMAPKVAT

Query:  QVMGFENAKELWEAIQEMFGVQSRAEEDYLRQVFQQCRKGSLKMIDYLRVMKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRSGISWSEM
        QVMGF  ++ELW A+QE+FGVQSRAE DYL+QVFQQ  KGSL+MI+YL++MK+HADNL  AGS VS R L+SQVL GLDEE+NP+V  +QG+  +SWSEM
Subjt:  QVMGFENAKELWEAIQEMFGVQSRAEEDYLRQVFQQCRKGSLKMIDYLRVMKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRSGISWSEM

Query:  QAKLLVFEKRLALQNSLKT---VSLSQGTSVNMASSKE-SGNQRNFNGNNNNRQPSYGRGNQRGGESEQGGLRGCRRR
         A+LL +EKRL  QNSLK+   ++ +Q  SVN    +    NQR  NGNN     S+G    RGG  ++G   G R R
Subjt:  QAKLLVFEKRLALQNSLKT---VSLSQGTSVNMASSKE-SGNQRNFNGNNNNRQPSYGRGNQRGGESEQGGLRGCRRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.0e-0622.62Show/hide
Query:  SIKLDRGNFLLWKNLALPILRSYKLEEAEAGVTSSEAGTSEAADSSSPTMEVNPLYES-WLAIDQLLLGWLYNSMAPKVATQVMGFE-NAKELWEAIQEM
        ++ L++ N+ +W+ L   +  S+       GV     G      SS+PT    P+ E  W   D L+  W+Y ++   +   ++     A++LW +++ +
Subjt:  SIKLDRGNFLLWKNLALPILRSYKLEEAEAGVTSSEAGTSEAADSSSPTMEVNPLYES-WLAIDQLLLGWLYNSMAPKVATQVMGFE-NAKELWEAIQEM

Query:  FGVQSRAEEDYLRQVFQQCRKGSLKMIDYLRVMKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRSGI-SWSEMQAKLLVFEKRLALQNSL
        F     A         +      L + +Y + +K+ +D L    SP+S R L+  +L GL E+++ ++ +I+ +S   S++E ++ LL+ E RL+ ++  
Subjt:  FGVQSRAEEDYLRQVFQQCRKGSLKMIDYLRVMKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRSGI-SWSEMQAKLLVFEKRLALQNSL

Query:  KTVSLSQGTSVNMASSKESGNQR---NFNGNNNNRQPSYGRGNQRGGESEQG
             +  +  N+  +     +R    ++ NN+N      +   RGG S  G
Subjt:  KTVSLSQGTSVNMASSKESGNQR---NFNGNNNNRQPSYGRGNQRGGESEQG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCCTTTGACGTCTCGGCATTGGCGATGGTCTCAAACGTGTCTTCATCCACCGATTGGTACAAGATATAGAGGGCCTTCTTGTCCTCCTCTTAATCAACTTTTGAA
TCAGGTGACATCCATAAAGCTAGATCGAGGAAACTTTCTACTATGGAAGAATCTAGCCCTTCCAATCCTTCGGAGTTATAAACTCGAAGAAGCAGAAGCTGGTGTCACAA
GTTCTGAAGCAGGTACCAGTGAAGCAGCTGACTCTTCCTCCCCTACCATGGAAGTAAATCCATTGTATGAGTCCTGGCTTGCAATTGATCAACTGTTATTAGGTTGGTTA
TACAATTCAATGGCTCCTAAGGTTGCAACTCAGGTGATGGGATTTGAAAATGCCAAGGAACTGTGGGAGGCTATTCAGGAAATGTTTGGAGTCCAATCAAGAGCGGAAGA
AGACTATCTTCGCCAGGTATTTCAGCAGTGTAGGAAAGGTTCGTTAAAAATGATTGATTATCTAAGGGTTATGAAAAACCATGCTGACAATTTGGGGCAAGCTGGTAGCC
CAGTTAGTACTCGATCGTTGATTTCGCAGGTACTTTTGGGTCTTGATGAGGAGTTCAATCCAGTAGTTGCCATGATTCAAGGGAGGTCTGGCATCTCGTGGTCAGAAATG
CAAGCGAAATTGCTTGTATTCGAGAAGCGGTTGGCGTTGCAAAACAGCTTGAAAACTGTATCTCTGAGTCAAGGAACGTCTGTGAATATGGCAAGCAGTAAAGAAAGTGG
CAATCAGAGAAACTTCAATGGCAACAATAACAATCGACAACCAAGTTATGGCAGAGGAAATCAAAGAGGAGGAGAATCAGAACAGGGCGGACTCAGGGGTTGTCGGAGGC
GGATGTTGATGGTGGCGAACAAACGGACGACGGCGAAGAGGCGGCCGACGGCGAAGAGTTGGTCAGAACAACAACGGTGGAACTTCGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCGCCTTTGACGTCTCGGCATTGGCGATGGTCTCAAACGTGTCTTCATCCACCGATTGGTACAAGATATAGAGGGCCTTCTTGTCCTCCTCTTAATCAACTTTTGAA
TCAGGTGACATCCATAAAGCTAGATCGAGGAAACTTTCTACTATGGAAGAATCTAGCCCTTCCAATCCTTCGGAGTTATAAACTCGAAGAAGCAGAAGCTGGTGTCACAA
GTTCTGAAGCAGGTACCAGTGAAGCAGCTGACTCTTCCTCCCCTACCATGGAAGTAAATCCATTGTATGAGTCCTGGCTTGCAATTGATCAACTGTTATTAGGTTGGTTA
TACAATTCAATGGCTCCTAAGGTTGCAACTCAGGTGATGGGATTTGAAAATGCCAAGGAACTGTGGGAGGCTATTCAGGAAATGTTTGGAGTCCAATCAAGAGCGGAAGA
AGACTATCTTCGCCAGGTATTTCAGCAGTGTAGGAAAGGTTCGTTAAAAATGATTGATTATCTAAGGGTTATGAAAAACCATGCTGACAATTTGGGGCAAGCTGGTAGCC
CAGTTAGTACTCGATCGTTGATTTCGCAGGTACTTTTGGGTCTTGATGAGGAGTTCAATCCAGTAGTTGCCATGATTCAAGGGAGGTCTGGCATCTCGTGGTCAGAAATG
CAAGCGAAATTGCTTGTATTCGAGAAGCGGTTGGCGTTGCAAAACAGCTTGAAAACTGTATCTCTGAGTCAAGGAACGTCTGTGAATATGGCAAGCAGTAAAGAAAGTGG
CAATCAGAGAAACTTCAATGGCAACAATAACAATCGACAACCAAGTTATGGCAGAGGAAATCAAAGAGGAGGAGAATCAGAACAGGGCGGACTCAGGGGTTGTCGGAGGC
GGATGTTGATGGTGGCGAACAAACGGACGACGGCGAAGAGGCGGCCGACGGCGAAGAGTTGGTCAGAACAACAACGGTGGAACTTCGCTTGA
Protein sequenceShow/hide protein sequence
MPPLTSRHWRWSQTCLHPPIGTRYRGPSCPPLNQLLNQVTSIKLDRGNFLLWKNLALPILRSYKLEEAEAGVTSSEAGTSEAADSSSPTMEVNPLYESWLAIDQLLLGWL
YNSMAPKVATQVMGFENAKELWEAIQEMFGVQSRAEEDYLRQVFQQCRKGSLKMIDYLRVMKNHADNLGQAGSPVSTRSLISQVLLGLDEEFNPVVAMIQGRSGISWSEM
QAKLLVFEKRLALQNSLKTVSLSQGTSVNMASSKESGNQRNFNGNNNNRQPSYGRGNQRGGESEQGGLRGCRRRMLMVANKRTTAKRRPTAKSWSEQQRWNFA