; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028444 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028444
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase, catalytic core
Genome locationchr8:21874710..21875447
RNA-Seq ExpressionLag0028444
SyntenyLag0028444
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026100.1 uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa]2.1e-6556.73Show/hide
Query:  MAHATFSSSQTITSSANHVFSSPLLNQLLNQITSIKLDRGNYLLWKNLALPILRSYRLEGHLSGEKACPPKFLSTATAAVTNVVDSSSEALDFETIDGGS
        MA+A  +++    SSA   FS+P LNQ+LNQ+ ++KLDR NYLLWK LALPIL+ Y+LEGHL+GE  CP  F+ +A+++ T V +  ++A    TI G S
Subjt:  MAHATFSSSQTITSSANHVFSSPLLNQLLNQITSIKLDRGNYLLWKNLALPILRSYRLEGHLSGEKACPPKFLSTATAAVTNVVDSSSEALDFETIDGGS

Query:  SSSSTMISTVNPKYESWLVIDQLLLGWLYNSMTPEIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADNLGQAG
        SS +  I  VN  +E W+  D LLLGWLYNSMTP++A Q+MG+ N +DLW A Q+ FGVQSRAEED+LRQ+ Q TRK + KM +YL VMKT+ DNLGQ G
Subjt:  SSSSTMISTVNPKYESWLVIDQLLLGWLYNSMTPEIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADNLGQAG

Query:  SPVSNRSLISQVLLGLDEEYNPVVAMIQGRVNISWSEIQAELLVF
        SPV  R+LISQVLLGLDE YN V+ +IQG+ +ISW ++Q++LL+F
Subjt:  SPVSNRSLISQVLLGLDEEYNPVVAMIQGRVNISWSEIQAELLVF

KAA0067279.1 uncharacterized protein E6C27_scaffold418G001000 [Cucumis melo var. makuwa]2.4e-5354Show/hide
Query:  TITSSANHVFSSPLLNQLLNQITSIKLDRGNYLLWKNLALPILRSYRLEGHLSGEKACPPKFLSTATAAVTNVVDSSSEALDFETIDGGSSSSSTMISTV
        ++T++    F++PLLNQ+LNQ+T+IKLDRGNYLLWK LALPIL+SY+L  HL GE  C PK +   T    ++V+++ E          +SSSST + TV
Subjt:  TITSSANHVFSSPLLNQLLNQITSIKLDRGNYLLWKNLALPILRSYRLEGHLSGEKACPPKFLSTATAAVTNVVDSSSEALDFETIDGGSSSSSTMISTV

Query:  NPKYESWLVIDQLLLGWLYNSMTPEIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADNLGQAGSPVSNRSLIS
        NPKYE W+  D LLLGWLYNSMTPE+  Q+MG+ N+KDLW A Q+LFG+QSRA+ED+L Q FQ T+K +L M +YL  MK + +NLGQA S V + +++S
Subjt:  NPKYESWLVIDQLLLGWLYNSMTPEIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADNLGQAGSPVSNRSLIS

XP_022148963.1 uncharacterized protein LOC111017501 [Momordica charantia]1.7e-5678.77Show/hide
Query:  IDGGSSSSSTMISTVNPKYESWLVIDQLLLGWLYNSMTPEIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADN
        I   SSSS    + +NP YESW+  DQLLLGWLYNSMTPE+ATQVMGYEN+ DLWAAIQELFGVQS+AEEDYLRQVFQQTRK SLKM D+L VMK+HADN
Subjt:  IDGGSSSSSTMISTVNPKYESWLVIDQLLLGWLYNSMTPEIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADN

Query:  LGQAGSPVSNRSLISQVLLGLDEEYNPVVAMIQGRVNISWSEIQAE
        LGQAGSPV  RSLISQVLLGLDEEYNPVVA IQG+  ISW E+QAE
Subjt:  LGQAGSPVSNRSLISQVLLGLDEEYNPVVAMIQGRVNISWSEIQAE

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]5.6e-7162.11Show/hide
Query:  VFSSPLLNQLLNQITSIKLDRGNYLLWKNLALPILRSYRLEGHLSGEKACPPKFLSTATAAVTNVVDSSSEALDFETIDGGSSSSSTMISTVNPKYESWL
        VF+SP LNQLLNQITSIK+DRGN+LLW+NLALPILRSY+L  +L+G+K CPP  L   T   TN             I+G +SS S+   T+NP YE+W+
Subjt:  VFSSPLLNQLLNQITSIKLDRGNYLLWKNLALPILRSYRLEGHLSGEKACPPKFLSTATAAVTNVVDSSSEALDFETIDGGSSSSSTMISTVNPKYESWL

Query:  VIDQLLLGWLYNSMTPEIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADNLGQAGSPVSNRSLISQVLLGLDE
        V+D+LLLGWLYNSM  ++A QVMG+  S++LW A+QELFGVQSRAE DYL+QVFQQT K SL+M++YL +MK+HADNL  AGS VS R L+SQVL GLDE
Subjt:  VIDQLLLGWLYNSMTPEIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADNLGQAGSPVSNRSLISQVLLGLDE

Query:  EYNPVVAMIQGRVNISWSEIQAELLVF
        EYNP+V  +QG+VN+SWSE+ AELL +
Subjt:  EYNPVVAMIQGRVNISWSEIQAELLVF

XP_038902487.1 uncharacterized protein LOC120089143 [Benincasa hispida]2.4e-6664.13Show/hide
Query:  TSIKLDRGNYLLWKNLALPILRSYRLEGHLSGEKACPPKFLSTATAAVTNVVDSSSEA--------LDFETIDGG--SSSSSTMISTVNPKYESWLVIDQ
        T+IKLD+ NYLLW+NLALPILRSYRLEGHL+GE  CPP+F S AT   T  V    EA        +   T   G  ++S+S+ +  VNP YES  V+DQ
Subjt:  TSIKLDRGNYLLWKNLALPILRSYRLEGHLSGEKACPPKFLSTATAAVTNVVDSSSEA--------LDFETIDGG--SSSSSTMISTVNPKYESWLVIDQ

Query:  LLLGWLYNSMTPEIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADNLGQAGSPVSNRSLISQVLLGLDEEYNP
        LLLGWLYN MT E+A QVMGYEN K LWAAIQELFG+QSRA EDYLRQVFQQT K ++KM +YL VMKTH+DNLG  GSPV  R+L+SQVLLGLDEE+NP
Subjt:  LLLGWLYNSMTPEIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADNLGQAGSPVSNRSLISQVLLGLDEEYNP

Query:  VVAMIQGRVNISWSEIQAELLVF
         VA IQGR  ISW+ +Q ELL F
Subjt:  VVAMIQGRVNISWSEIQAELLVF

TrEMBL top hitse value%identityAlignment
A0A5A7SIT7 Uncharacterized protein1.0e-6556.73Show/hide
Query:  MAHATFSSSQTITSSANHVFSSPLLNQLLNQITSIKLDRGNYLLWKNLALPILRSYRLEGHLSGEKACPPKFLSTATAAVTNVVDSSSEALDFETIDGGS
        MA+A  +++    SSA   FS+P LNQ+LNQ+ ++KLDR NYLLWK LALPIL+ Y+LEGHL+GE  CP  F+ +A+++ T V +  ++A    TI G S
Subjt:  MAHATFSSSQTITSSANHVFSSPLLNQLLNQITSIKLDRGNYLLWKNLALPILRSYRLEGHLSGEKACPPKFLSTATAAVTNVVDSSSEALDFETIDGGS

Query:  SSSSTMISTVNPKYESWLVIDQLLLGWLYNSMTPEIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADNLGQAG
        SS +  I  VN  +E W+  D LLLGWLYNSMTP++A Q+MG+ N +DLW A Q+ FGVQSRAEED+LRQ+ Q TRK + KM +YL VMKT+ DNLGQ G
Subjt:  SSSSTMISTVNPKYESWLVIDQLLLGWLYNSMTPEIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADNLGQAG

Query:  SPVSNRSLISQVLLGLDEEYNPVVAMIQGRVNISWSEIQAELLVF
        SPV  R+LISQVLLGLDE YN V+ +IQG+ +ISW ++Q++LL+F
Subjt:  SPVSNRSLISQVLLGLDEEYNPVVAMIQGRVNISWSEIQAELLVF

A0A5A7VPY0 Uncharacterized protein1.1e-5354Show/hide
Query:  TITSSANHVFSSPLLNQLLNQITSIKLDRGNYLLWKNLALPILRSYRLEGHLSGEKACPPKFLSTATAAVTNVVDSSSEALDFETIDGGSSSSSTMISTV
        ++T++    F++PLLNQ+LNQ+T+IKLDRGNYLLWK LALPIL+SY+L  HL GE  C PK +   T    ++V+++ E          +SSSST + TV
Subjt:  TITSSANHVFSSPLLNQLLNQITSIKLDRGNYLLWKNLALPILRSYRLEGHLSGEKACPPKFLSTATAAVTNVVDSSSEALDFETIDGGSSSSSTMISTV

Query:  NPKYESWLVIDQLLLGWLYNSMTPEIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADNLGQAGSPVSNRSLIS
        NPKYE W+  D LLLGWLYNSMTPE+  Q+MG+ N+KDLW A Q+LFG+QSRA+ED+L Q FQ T+K +L M +YL  MK + +NLGQA S V + +++S
Subjt:  NPKYESWLVIDQLLLGWLYNSMTPEIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADNLGQAGSPVSNRSLIS

A0A5D3BCH9 Uncharacterized protein3.3e-5356.4Show/hide
Query:  MAHATFSSSQTITSSANHVFSSPLLNQLLNQITSIKLDRGNYLLWKNLALPILRSYRLEGHLSGEKACPPKFLSTATAAVTNVVDSSSEALDFETIDGGS
        MA+A  +++    SSA   FS+P LNQ+LNQ+ ++KLDR NYLLWK LALPIL+ Y+LEGHL+GE  CP  F+ +A+++ T V +  ++A    TI G S
Subjt:  MAHATFSSSQTITSSANHVFSSPLLNQLLNQITSIKLDRGNYLLWKNLALPILRSYRLEGHLSGEKACPPKFLSTATAAVTNVVDSSSEALDFETIDGGS

Query:  SSSSTMISTVNPKYESWLVIDQLLLGWLYNSMTPEIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADNLGQAG
        SS +  I  VN  +E W+  D LLLGWLYNSMTP++A Q+MG+ N +DLW A Q+ FGVQSRAEED+LRQ+ Q TRK + KM +YL VMKT+ DNLGQ G
Subjt:  SSSSTMISTVNPKYESWLVIDQLLLGWLYNSMTPEIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADNLGQAG

Query:  SPVSNRSLISQ
        SPV  R+LISQ
Subjt:  SPVSNRSLISQ

A0A6J1D5J0 uncharacterized protein LOC1110175018.5e-5778.77Show/hide
Query:  IDGGSSSSSTMISTVNPKYESWLVIDQLLLGWLYNSMTPEIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADN
        I   SSSS    + +NP YESW+  DQLLLGWLYNSMTPE+ATQVMGYEN+ DLWAAIQELFGVQS+AEEDYLRQVFQQTRK SLKM D+L VMK+HADN
Subjt:  IDGGSSSSSTMISTVNPKYESWLVIDQLLLGWLYNSMTPEIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADN

Query:  LGQAGSPVSNRSLISQVLLGLDEEYNPVVAMIQGRVNISWSEIQAE
        LGQAGSPV  RSLISQVLLGLDEEYNPVVA IQG+  ISW E+QAE
Subjt:  LGQAGSPVSNRSLISQVLLGLDEEYNPVVAMIQGRVNISWSEIQAE

A0A6J1DCW4 uncharacterized protein LOC1110195982.7e-7162.11Show/hide
Query:  VFSSPLLNQLLNQITSIKLDRGNYLLWKNLALPILRSYRLEGHLSGEKACPPKFLSTATAAVTNVVDSSSEALDFETIDGGSSSSSTMISTVNPKYESWL
        VF+SP LNQLLNQITSIK+DRGN+LLW+NLALPILRSY+L  +L+G+K CPP  L   T   TN             I+G +SS S+   T+NP YE+W+
Subjt:  VFSSPLLNQLLNQITSIKLDRGNYLLWKNLALPILRSYRLEGHLSGEKACPPKFLSTATAAVTNVVDSSSEALDFETIDGGSSSSSTMISTVNPKYESWL

Query:  VIDQLLLGWLYNSMTPEIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADNLGQAGSPVSNRSLISQVLLGLDE
        V+D+LLLGWLYNSM  ++A QVMG+  S++LW A+QELFGVQSRAE DYL+QVFQQT K SL+M++YL +MK+HADNL  AGS VS R L+SQVL GLDE
Subjt:  VIDQLLLGWLYNSMTPEIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADNLGQAGSPVSNRSLISQVLLGLDE

Query:  EYNPVVAMIQGRVNISWSEIQAELLVF
        EYNP+V  +QG+VN+SWSE+ AELL +
Subjt:  EYNPVVAMIQGRVNISWSEIQAELLVF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.5e-0525.98Show/hide
Query:  TMISTVNPKYESWLVIDQLLLGWLYNSMTP-EIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADNLGQAGSPV
        T++ T N    +W   D ++   LY ++TP +     +    S+D+W  I+  F     A    L    +      +++ DY   MK  AD+L     PV
Subjt:  TMISTVNPKYESWLVIDQLLLGWLYNSMTP-EIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADNLGQAGSPV

Query:  SNRSLISQVLLGLDEEYNPVVAMIQGR
        ++R+L+  VL GL+ +++ ++ +I+ R
Subjt:  SNRSLISQVLLGLDEEYNPVVAMIQGR

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)9.4e-0826.43Show/hide
Query:  STVNPKYES-WLVIDQLLLGWLYNSMTPEIATQVMGYE-NSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADNLGQAGSPVSN
        ST  P  E  W   D L+  W+Y ++T  +   ++     ++DLW +++ LF     A         + T    L + +Y   +K+ +D L    SP+S+
Subjt:  STVNPKYES-WLVIDQLLLGWLYNSMTPEIATQVMGYE-NSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADNLGQAGSPVSN

Query:  RSLISQVLLGLDEEYNPVVAMIQGRVNI-SWSEIQAELLV
        R L+  +L GL E+Y+ ++ +I+ +    S++E ++ LL+
Subjt:  RSLISQVLLGLDEEYNPVVAMIQGRVNI-SWSEIQAELLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCACGCTACTTTTTCTTCAAGCCAAACTATAACCTCATCTGCAAACCATGTGTTTAGTAGTCCACTGCTAAATCAACTTTTAAATCAAATCACTTCTATTAAGCT
TGACAGAGGAAATTATTTGCTTTGGAAAAATCTTGCTCTCCCCATTCTTAGAAGTTATCGCTTGGAGGGTCATCTGTCCGGTGAGAAGGCCTGTCCACCCAAGTTTCTAT
CAACGGCAACCGCTGCAGTCACAAATGTTGTCGATTCAAGCTCCGAAGCATTGGATTTCGAGACTATTGATGGCGGCTCTAGCTCTTCGTCTACCATGATCTCTACTGTT
AATCCCAAGTATGAAAGTTGGCTTGTTATAGATCAACTTCTTCTTGGTTGGTTGTACAATTCTATGACTCCTGAAATTGCAACTCAGGTAATGGGATATGAAAATTCAAA
GGATCTTTGGGCTGCTATTCAAGAACTTTTTGGCGTTCAGTCTCGGGCAGAGGAAGATTACCTTCGACAGGTTTTTCAGCAAACTCGTAAGTGTTCTCTTAAAATGGTTG
ATTATTTGAGTGTCATGAAAACTCATGCAGATAACTTGGGGCAAGCTGGAAGTCCGGTTTCAAACCGTTCATTGATTTCCCAAGTCTTGTTGGGGTTAGATGAGGAATAC
AATCCAGTTGTGGCTATGATCCAGGGACGAGTGAACATTTCATGGTCTGAAATACAGGCAGAGTTGCTAGTTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTCACGCTACTTTTTCTTCAAGCCAAACTATAACCTCATCTGCAAACCATGTGTTTAGTAGTCCACTGCTAAATCAACTTTTAAATCAAATCACTTCTATTAAGCT
TGACAGAGGAAATTATTTGCTTTGGAAAAATCTTGCTCTCCCCATTCTTAGAAGTTATCGCTTGGAGGGTCATCTGTCCGGTGAGAAGGCCTGTCCACCCAAGTTTCTAT
CAACGGCAACCGCTGCAGTCACAAATGTTGTCGATTCAAGCTCCGAAGCATTGGATTTCGAGACTATTGATGGCGGCTCTAGCTCTTCGTCTACCATGATCTCTACTGTT
AATCCCAAGTATGAAAGTTGGCTTGTTATAGATCAACTTCTTCTTGGTTGGTTGTACAATTCTATGACTCCTGAAATTGCAACTCAGGTAATGGGATATGAAAATTCAAA
GGATCTTTGGGCTGCTATTCAAGAACTTTTTGGCGTTCAGTCTCGGGCAGAGGAAGATTACCTTCGACAGGTTTTTCAGCAAACTCGTAAGTGTTCTCTTAAAATGGTTG
ATTATTTGAGTGTCATGAAAACTCATGCAGATAACTTGGGGCAAGCTGGAAGTCCGGTTTCAAACCGTTCATTGATTTCCCAAGTCTTGTTGGGGTTAGATGAGGAATAC
AATCCAGTTGTGGCTATGATCCAGGGACGAGTGAACATTTCATGGTCTGAAATACAGGCAGAGTTGCTAGTTTTTTAG
Protein sequenceShow/hide protein sequence
MAHATFSSSQTITSSANHVFSSPLLNQLLNQITSIKLDRGNYLLWKNLALPILRSYRLEGHLSGEKACPPKFLSTATAAVTNVVDSSSEALDFETIDGGSSSSSTMISTV
NPKYESWLVIDQLLLGWLYNSMTPEIATQVMGYENSKDLWAAIQELFGVQSRAEEDYLRQVFQQTRKCSLKMVDYLSVMKTHADNLGQAGSPVSNRSLISQVLLGLDEEY
NPVVAMIQGRVNISWSEIQAELLVF