; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039624 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039624
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase, catalytic core
Genome locationchr2:47510943..47511725
RNA-Seq ExpressionLag0039624
SyntenyLag0039624
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026100.1 uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa]4.1e-7260.96Show/hide
Query:  MANAAQNTSINSSANPTFSSPPLNQLLNQITSIKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVEEFTGVTSSETTVGATLEASSSTTT
        MANA    +  S ++  FS+PPLNQ+LNQ+ ++KLDR NY+LWK LALPIL+ YKLEGHLTG+TPCP  F        T VT  E    AT+ ASSS T 
Subjt:  MANAAQNTSINSSANPTFSSPPLNQLLNQITSIKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVEEFTGVTSSETTVGATLEASSSTTT

Query:  TIKTSNPQYELWVVTDQLLLGWLYNSMTAEVATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSA
         I   N  +E WV TD LLLGWLYNSMT +VA Q+MG  N +DLW A Q+ FGVQSRAEEDFLRQ+ Q TRKG+ KM EYL  MKT+ DNLGQ GSPV  
Subjt:  TIKTSNPQYELWVVTDQLLLGWLYNSMTAEVATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSA

Query:  RSLISQVLLGLDEEYNPVVAMIQGKTEISWAEMQGELLAFEKRLELQNAHK
        R+LISQVLLGLDE YN V+ +IQGK +ISW +MQ +LL FEK L+ QN  K
Subjt:  RSLISQVLLGLDEEYNPVVAMIQGKTEISWAEMQGELLAFEKRLELQNAHK

TYJ96311.1 uncharacterized protein E5676_scaffold1970G00140 [Cucumis melo var. makuwa]3.2e-5660.68Show/hide
Query:  MANAAQNTSINSSANPTFSSPPLNQLLNQITSIKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVEEFTGVTSSETTVGATLEASSSTTT
        MANA    +  S ++  FS+PPLNQ+LNQ+ ++KLDR NY+LWK LALPIL+ YKLEGHLTG+TPCP  F        T VT  E    AT+ ASSS T 
Subjt:  MANAAQNTSINSSANPTFSSPPLNQLLNQITSIKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVEEFTGVTSSETTVGATLEASSSTTT

Query:  TIKTSNPQYELWVVTDQLLLGWLYNSMTAEVATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSA
         I   N  +E WV TD LLLGWLYNSMT +VA Q+MG  N +DLW A Q+ FGVQSRAEEDFLRQ+ Q TRKG+ KM EYL  MKT+ DNLGQ GSPV  
Subjt:  TIKTSNPQYELWVVTDQLLLGWLYNSMTAEVATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSA

Query:  RSLISQ
        R+LISQ
Subjt:  RSLISQ

XP_016902203.1 PREDICTED: uncharacterized protein LOC107991581 isoform X3 [Cucumis melo]8.6e-5451.39Show/hide
Query:  MANAAQNTSINSSANPTFSSPPLNQLLNQITSIKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVEEFTGVTSSETTVGATLEASSSTTT
        MANA    +  S ++  FS+PPLNQ+LNQ+T++KLDR NY+LWK LALPIL+ YKLEGHLT +TPCP  F        T VT  E    AT+ ASSS T 
Subjt:  MANAAQNTSINSSANPTFSSPPLNQLLNQITSIKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVEEFTGVTSSETTVGATLEASSSTTT

Query:  TIKTSNPQYELWVVTDQLLLGWLYNSMTAEVATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSA
         I   NP +E WV TD LLLGWLYNSMT +VA Q+MG  N +DLW A Q+ FGVQSRAEEDFLRQ+ Q TRK                            
Subjt:  TIKTSNPQYELWVVTDQLLLGWLYNSMTAEVATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSA

Query:  RSLISQVLLGLDEEYNPVVAMIQGKTEISWAEMQGELLAFEKRLELQNAHK
                 GLDE YN V+ +IQGK +ISW +MQ +LL FEKRL+ QN  K
Subjt:  RSLISQVLLGLDEEYNPVVAMIQGKTEISWAEMQGELLAFEKRLELQNAHK

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]2.0e-7160.85Show/hide
Query:  FSSPPLNQLLNQITSIKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVEEFTGVTSSETTVGATLEASSSTTTTIKTSNPQYELWVVTDQ
        F+SPPLNQLLNQITSIK+DR N++LW+NLALPILRSYKL  +LTG  PCPP          T +  ++T        SS ++ T+   NP YE W+V D+
Subjt:  FSSPPLNQLLNQITSIKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVEEFTGVTSSETTVGATLEASSSTTTTIKTSNPQYELWVVTDQ

Query:  LLLGWLYNSMTAEVATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSARSLISQVLLGLDEEYNP
        LLLGWLYNSM A+VA QVMG   +++LW A+QE+FGVQSRAE D+L+QVFQQT KGSL+M EYL+ MK+HADNL  AGS VS R L+SQVL GLDEEYNP
Subjt:  LLLGWLYNSMTAEVATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSARSLISQVLLGLDEEYNP

Query:  VVAMIQGKTEISWAEMQGELLAFEKRLELQNAHKT
        +V  +QGK  +SW+EM  ELL +EKRLE QN+ K+
Subjt:  VVAMIQGKTEISWAEMQGELLAFEKRLELQNAHKT

XP_038902487.1 uncharacterized protein LOC120089143 [Benincasa hispida]3.1e-6761.4Show/hide
Query:  TSIKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVE---------------EFTGVTSSETTVGATLEASSSTTTTIKTSNPQYELWVVT
        T+IKLD+ NY+LW+NLALPILRSY+LEGHLTG+ PCPP+F+    +               +++G+ S     G T  ++SS    +   NP YE   V 
Subjt:  TSIKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVE---------------EFTGVTSSETTVGATLEASSSTTTTIKTSNPQYELWVVT

Query:  DQLLLGWLYNSMTAEVATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSARSLISQVLLGLDEEY
        DQLLLGWLYN MTAEVA QVMG+EN K LW AIQE+FG+QSRA ED+LRQVFQQT KG++KM EYLR MKTH+DNLG  GSPV  R+L+SQVLLGLDEE+
Subjt:  DQLLLGWLYNSMTAEVATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSARSLISQVLLGLDEEY

Query:  NPVVAMIQGKTEISWAEMQGELLAFEKR
        NP VA IQG++EISW  MQ ELLAFEKR
Subjt:  NPVVAMIQGKTEISWAEMQGELLAFEKR

TrEMBL top hitse value%identityAlignment
A0A1S4E1U9 uncharacterized protein LOC107991581 isoform X44.2e-5451.39Show/hide
Query:  MANAAQNTSINSSANPTFSSPPLNQLLNQITSIKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVEEFTGVTSSETTVGATLEASSSTTT
        MANA    +  S ++  FS+PPLNQ+LNQ+T++KLDR NY+LWK LALPIL+ YKLEGHLT +TPCP  F        T VT  E    AT+ ASSS T 
Subjt:  MANAAQNTSINSSANPTFSSPPLNQLLNQITSIKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVEEFTGVTSSETTVGATLEASSSTTT

Query:  TIKTSNPQYELWVVTDQLLLGWLYNSMTAEVATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSA
         I   NP +E WV TD LLLGWLYNSMT +VA Q+MG  N +DLW A Q+ FGVQSRAEEDFLRQ+ Q TRK                            
Subjt:  TIKTSNPQYELWVVTDQLLLGWLYNSMTAEVATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSA

Query:  RSLISQVLLGLDEEYNPVVAMIQGKTEISWAEMQGELLAFEKRLELQNAHK
                 GLDE YN V+ +IQGK +ISW +MQ +LL FEKRL+ QN  K
Subjt:  RSLISQVLLGLDEEYNPVVAMIQGKTEISWAEMQGELLAFEKRLELQNAHK

A0A5A7SIT7 Uncharacterized protein2.0e-7260.96Show/hide
Query:  MANAAQNTSINSSANPTFSSPPLNQLLNQITSIKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVEEFTGVTSSETTVGATLEASSSTTT
        MANA    +  S ++  FS+PPLNQ+LNQ+ ++KLDR NY+LWK LALPIL+ YKLEGHLTG+TPCP  F        T VT  E    AT+ ASSS T 
Subjt:  MANAAQNTSINSSANPTFSSPPLNQLLNQITSIKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVEEFTGVTSSETTVGATLEASSSTTT

Query:  TIKTSNPQYELWVVTDQLLLGWLYNSMTAEVATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSA
         I   N  +E WV TD LLLGWLYNSMT +VA Q+MG  N +DLW A Q+ FGVQSRAEEDFLRQ+ Q TRKG+ KM EYL  MKT+ DNLGQ GSPV  
Subjt:  TIKTSNPQYELWVVTDQLLLGWLYNSMTAEVATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSA

Query:  RSLISQVLLGLDEEYNPVVAMIQGKTEISWAEMQGELLAFEKRLELQNAHK
        R+LISQVLLGLDE YN V+ +IQGK +ISW +MQ +LL FEK L+ QN  K
Subjt:  RSLISQVLLGLDEEYNPVVAMIQGKTEISWAEMQGELLAFEKRLELQNAHK

A0A5D3BCH9 Uncharacterized protein1.5e-5660.68Show/hide
Query:  MANAAQNTSINSSANPTFSSPPLNQLLNQITSIKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVEEFTGVTSSETTVGATLEASSSTTT
        MANA    +  S ++  FS+PPLNQ+LNQ+ ++KLDR NY+LWK LALPIL+ YKLEGHLTG+TPCP  F        T VT  E    AT+ ASSS T 
Subjt:  MANAAQNTSINSSANPTFSSPPLNQLLNQITSIKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVEEFTGVTSSETTVGATLEASSSTTT

Query:  TIKTSNPQYELWVVTDQLLLGWLYNSMTAEVATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSA
         I   N  +E WV TD LLLGWLYNSMT +VA Q+MG  N +DLW A Q+ FGVQSRAEEDFLRQ+ Q TRKG+ KM EYL  MKT+ DNLGQ GSPV  
Subjt:  TIKTSNPQYELWVVTDQLLLGWLYNSMTAEVATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSA

Query:  RSLISQ
        R+LISQ
Subjt:  RSLISQ

A0A6J1D5J0 uncharacterized protein LOC1110175014.2e-5468Show/hide
Query:  VVEEFTGVTSSETTVGATLEASSSTTTTIKTSNPQYELWVVTDQLLLGWLYNSMTAEVATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKG
        V +    + +S+T + A    SSS+  T    NP YE WV TDQLLLGWLYNSMT EVATQVMG+ENA DLW AIQE+FGVQS+AEED+LRQVFQQTRKG
Subjt:  VVEEFTGVTSSETTVGATLEASSSTTTTIKTSNPQYELWVVTDQLLLGWLYNSMTAEVATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKG

Query:  SLKMAEYLRTMKTHADNLGQAGSPVSARSLISQVLLGLDEEYNPVVAMIQGKTEISWAEMQGELLAFEKRLELQN
        SLKM ++LR MK+HADNLGQAGSPV  RSLISQVLLGLDEEYNPVVA IQGK  ISW EMQ E  +     + QN
Subjt:  SLKMAEYLRTMKTHADNLGQAGSPVSARSLISQVLLGLDEEYNPVVAMIQGKTEISWAEMQGELLAFEKRLELQN

A0A6J1DCW4 uncharacterized protein LOC1110195989.9e-7260.85Show/hide
Query:  FSSPPLNQLLNQITSIKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVEEFTGVTSSETTVGATLEASSSTTTTIKTSNPQYELWVVTDQ
        F+SPPLNQLLNQITSIK+DR N++LW+NLALPILRSYKL  +LTG  PCPP          T +  ++T        SS ++ T+   NP YE W+V D+
Subjt:  FSSPPLNQLLNQITSIKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVEEFTGVTSSETTVGATLEASSSTTTTIKTSNPQYELWVVTDQ

Query:  LLLGWLYNSMTAEVATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSARSLISQVLLGLDEEYNP
        LLLGWLYNSM A+VA QVMG   +++LW A+QE+FGVQSRAE D+L+QVFQQT KGSL+M EYL+ MK+HADNL  AGS VS R L+SQVL GLDEEYNP
Subjt:  LLLGWLYNSMTAEVATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSARSLISQVLLGLDEEYNP

Query:  VVAMIQGKTEISWAEMQGELLAFEKRLELQNAHKT
        +V  +QGK  +SW+EM  ELL +EKRLE QN+ K+
Subjt:  VVAMIQGKTEISWAEMQGELLAFEKRLELQNAHKT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.6e-0521.35Show/hide
Query:  IKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVEEFTGVTSSETTVGATLEASSSTTTTIKTSNPQYELWVVTDQLLLGWLYNSMT-AEV
        + ++ SNY  W+ L L    S+ + GH+ G                                      T+  +N     W   D ++   LY ++T  + 
Subjt:  IKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVEEFTGVTSSETTVGATLEASSSTTTTIKTSNPQYELWVVTDQLLLGWLYNSMT-AEV

Query:  ATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSARSLISQVLLGLDEEYNPVVAMIQ
            +    ++D+W  I+  F     A    L    +    G +++A+Y R MK  AD+L     PV+ R+L+  VL GL+ +++ ++ +I+
Subjt:  ATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSARSLISQVLLGLDEEYNPVVAMIQ

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.5e-0824.19Show/hide
Query:  SIKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVEEFTGVTSSETTVGATLEASSSTTTTIKTSNPQYELWVVTDQLLLGWLYNSMTAEV
        ++ L++ NY +W+ L   +  S+ + GH+ G                                 SST T +         W   D L+  W+Y ++T  +
Subjt:  SIKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVEEFTGVTSSETTVGATLEASSSTTTTIKTSNPQYELWVVTDQLLLGWLYNSMTAEV

Query:  A-TQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSARSLISQVLLGLDEEYNPVVAMIQGKTEI-S
          T +     A+DLW +++ +F     A         + T    L + EY + +K+ +D L    SP+S R L+  +L GL E+Y+ ++ +I+ K+   S
Subjt:  A-TQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSARSLISQVLLGLDEEYNPVVAMIQGKTEI-S

Query:  WAEMQGELLAFEKRL
        + E +  LL  E RL
Subjt:  WAEMQGELLAFEKRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAACGCCGCTCAAAACACCTCCATCAATTCCTCTGCAAATCCTACATTCAGCAGTCCTCCTCTTAATCAACTGCTGAATCAAATTACCTCCATCAAGCTAGATAG
GAGTAATTACATGTTGTGGAAGAATCTGGCACTTCCCATTTTGAGAAGCTACAAACTAGAAGGTCATTTGACCGGAAAAACGCCTTGCCCACCTAAATTCACTCAAGATG
TTGTTGAAGAATTTACTGGAGTTACAAGTTCAGAAACTACAGTGGGTGCTACCTTAGAAGCTTCAAGCTCAACAACAACCACAATTAAAACAAGCAATCCTCAGTATGAA
TTGTGGGTTGTAACTGACCAACTTCTTCTTGGTTGGTTATATAATTCAATGACGGCAGAAGTAGCTACACAAGTAATGGGACACGAAAATGCGAAGGATCTGTGGAAAGC
CATTCAAGAAATTTTTGGAGTTCAGTCACGGGCAGAAGAGGATTTTCTTCGACAAGTGTTTCAACAAACCCGTAAAGGGTCACTTAAGATGGCAGAGTACTTGCGCACAA
TGAAGACCCACGCTGATAATCTTGGACAAGCCGGAAGTCCTGTTTCAGCTCGATCTCTTATTTCCCAAGTCTTGTTGGGATTAGATGAGGAGTACAATCCTGTTGTGGCT
ATGATTCAAGGTAAGACTGAGATTTCGTGGGCTGAAATGCAAGGAGAACTTCTTGCATTCGAGAAGCGTTTGGAACTACAAAACGCTCACAAAACTACCTCTTTAACCAA
AGTACATCAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCAACGCCGCTCAAAACACCTCCATCAATTCCTCTGCAAATCCTACATTCAGCAGTCCTCCTCTTAATCAACTGCTGAATCAAATTACCTCCATCAAGCTAGATAG
GAGTAATTACATGTTGTGGAAGAATCTGGCACTTCCCATTTTGAGAAGCTACAAACTAGAAGGTCATTTGACCGGAAAAACGCCTTGCCCACCTAAATTCACTCAAGATG
TTGTTGAAGAATTTACTGGAGTTACAAGTTCAGAAACTACAGTGGGTGCTACCTTAGAAGCTTCAAGCTCAACAACAACCACAATTAAAACAAGCAATCCTCAGTATGAA
TTGTGGGTTGTAACTGACCAACTTCTTCTTGGTTGGTTATATAATTCAATGACGGCAGAAGTAGCTACACAAGTAATGGGACACGAAAATGCGAAGGATCTGTGGAAAGC
CATTCAAGAAATTTTTGGAGTTCAGTCACGGGCAGAAGAGGATTTTCTTCGACAAGTGTTTCAACAAACCCGTAAAGGGTCACTTAAGATGGCAGAGTACTTGCGCACAA
TGAAGACCCACGCTGATAATCTTGGACAAGCCGGAAGTCCTGTTTCAGCTCGATCTCTTATTTCCCAAGTCTTGTTGGGATTAGATGAGGAGTACAATCCTGTTGTGGCT
ATGATTCAAGGTAAGACTGAGATTTCGTGGGCTGAAATGCAAGGAGAACTTCTTGCATTCGAGAAGCGTTTGGAACTACAAAACGCTCACAAAACTACCTCTTTAACCAA
AGTACATCAGTAA
Protein sequenceShow/hide protein sequence
MANAAQNTSINSSANPTFSSPPLNQLLNQITSIKLDRSNYMLWKNLALPILRSYKLEGHLTGKTPCPPKFTQDVVEEFTGVTSSETTVGATLEASSSTTTTIKTSNPQYE
LWVVTDQLLLGWLYNSMTAEVATQVMGHENAKDLWKAIQEIFGVQSRAEEDFLRQVFQQTRKGSLKMAEYLRTMKTHADNLGQAGSPVSARSLISQVLLGLDEEYNPVVA
MIQGKTEISWAEMQGELLAFEKRLELQNAHKTTSLTKVHQ