; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001100 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001100
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr4:24475158..24478642
RNA-Seq ExpressionLag0001100
SyntenyLag0001100
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040811.1 retrotransposon gag protein [Cucumis melo var. makuwa]8.8e-0467.5Show/hide
Query:  MKNLELKLFDEVNSDEKLHSIIPSRMKRKFSVLINTEGSL
        MK+L++K F E N D+K+HS +PSRMKRK SV INTEGSL
Subjt:  MKNLELKLFDEVNSDEKLHSIIPSRMKRKFSVLINTEGSL

KAA0041559.1 hypothetical protein E6C27_scaffold93G00150 [Cucumis melo var. makuwa]1.4e-0437.76Show/hide
Query:  MKNLELKLFDEVNSDEKLHSIIPSRMKRKFSVLINTEGSLKFEGSSLYPAALFLLQVRGFSVVRLLRCSSSKCEGSYVVRCCIVPSSLKFEGSHAALL
        MK+L+ K F E N D+K+HS +PSRMKRK SV INTEG+      ++    L L   RG S      C S +    +++ C        F  SH   L
Subjt:  MKNLELKLFDEVNSDEKLHSIIPSRMKRKFSVLINTEGSLKFEGSSLYPAALFLLQVRGFSVVRLLRCSSSKCEGSYVVRCCIVPSSLKFEGSHAALL

PAA71337.1 hypothetical protein BOX15_Mlig005876g1 [Macrostomum lignano]2.3e-0429.11Show/hide
Query:  RSEITASEAGDDRCTKLTHSSKKETNCNNICAKKEMEAKSSIVQSKKINKSADRSSKKINKPTGRSSKKIN-----KSADRSSNRSSEKINKSVDRSSKK
        RS  ++ +    +  K+     +++   N    ++   K    +S+K+N+   R S+K+N+   R S+K+N     KS+ +  +R S K+N+   R S+K
Subjt:  RSEITASEAGDDRCTKLTHSSKKETNCNNICAKKEMEAKSSIVQSKKINKSADRSSKKINKPTGRSSKKIN-----KSADRSSNRSSEKINKSVDRSSKK

Query:  INKPTGRSSKKINKLADRSS-----NRSSEKINKSADRSSKKINKATGRSSKKINKSQ
        +N+  GR S+K+N+   R S     +R S K+N+   R S+K+N+  GR S+K+N+ +
Subjt:  INKPTGRSSKKINKLADRSS-----NRSSEKINKSADRSSKKINKATGRSSKKINKSQ

PAA71360.1 hypothetical protein BOX15_Mlig020368g1 [Macrostomum lignano]6.7e-0433.58Show/hide
Query:  KEMEAKSSIVQSKKINKSADRSSKKINKPTGRSSKKIN-----KSADRSSNRSSEKINKSVDRSSKKINKPTGRSSKKINKLADRSS-----NRSSEKIN
        ++   K    +S+K+N+   R S+K+N+  GR S+K+N     KSA +  +R S K+N+   R S+K+N+  GR S+K+N+   R S     +R S K+N
Subjt:  KEMEAKSSIVQSKKINKSADRSSKKINKPTGRSSKKIN-----KSADRSSNRSSEKINKSVDRSSKKINKPTGRSSKKINKLADRSS-----NRSSEKIN

Query:  ---------KSADRSSKKINKATGRSSKKINKSQ
                 K   R S+K+N+  GR S+K+N+ +
Subjt:  ---------KSADRSSKKINKATGRSSKKINKSQ

QAX24809.1 adhesion protein 1 [Macrostomum lignano]6.7e-0433.58Show/hide
Query:  KEMEAKSSIVQSKKINKSADRSSKKINKPTGRSSKKIN-----KSADRSSNRSSEKINKSVDRSSKKINKPTGRSSKKINKLADRSS-----NRSSEKIN
        ++   K    +S+K+N+   R S+K+N+  GR S+K+N     KSA +  +R S K+N+   R S+K+N+  GR S+K+N+   R S     +R S K+N
Subjt:  KEMEAKSSIVQSKKINKSADRSSKKINKPTGRSSKKIN-----KSADRSSNRSSEKINKSVDRSSKKINKPTGRSSKKINKLADRSS-----NRSSEKIN

Query:  ---------KSADRSSKKINKATGRSSKKINKSQ
                 K   R S+K+N+  GR S+K+N+ +
Subjt:  ---------KSADRSSKKINKATGRSSKKINKSQ

TrEMBL top hitse value%identityAlignment
A0A267FC56 Uncharacterized protein3.3e-0433.58Show/hide
Query:  KEMEAKSSIVQSKKINKSADRSSKKINKPTGRSSKKIN-----KSADRSSNRSSEKINKSVDRSSKKINKPTGRSSKKINKLADRSS-----NRSSEKIN
        ++   K    +S+K+N+   R S+K+N+  GR S+K+N     KSA +  +R S K+N+   R S+K+N+  GR S+K+N+   R S     +R S K+N
Subjt:  KEMEAKSSIVQSKKINKSADRSSKKINKPTGRSSKKIN-----KSADRSSNRSSEKINKSVDRSSKKINKPTGRSSKKINKLADRSS-----NRSSEKIN

Query:  ---------KSADRSSKKINKATGRSSKKINKSQ
                 K   R S+K+N+  GR S+K+N+ +
Subjt:  ---------KSADRSSKKINKATGRSSKKINKSQ

A0A267FDU5 Uncharacterized protein1.1e-0429.11Show/hide
Query:  RSEITASEAGDDRCTKLTHSSKKETNCNNICAKKEMEAKSSIVQSKKINKSADRSSKKINKPTGRSSKKIN-----KSADRSSNRSSEKINKSVDRSSKK
        RS  ++ +    +  K+     +++   N    ++   K    +S+K+N+   R S+K+N+   R S+K+N     KS+ +  +R S K+N+   R S+K
Subjt:  RSEITASEAGDDRCTKLTHSSKKETNCNNICAKKEMEAKSSIVQSKKINKSADRSSKKINKPTGRSSKKIN-----KSADRSSNRSSEKINKSVDRSSKK

Query:  INKPTGRSSKKINKLADRSS-----NRSSEKINKSADRSSKKINKATGRSSKKINKSQ
        +N+  GR S+K+N+   R S     +R S K+N+   R S+K+N+  GR S+K+N+ +
Subjt:  INKPTGRSSKKINKLADRSS-----NRSSEKINKSADRSSKKINKATGRSSKKINKSQ

A0A411ACX2 Adhesion protein 13.3e-0433.58Show/hide
Query:  KEMEAKSSIVQSKKINKSADRSSKKINKPTGRSSKKIN-----KSADRSSNRSSEKINKSVDRSSKKINKPTGRSSKKINKLADRSS-----NRSSEKIN
        ++   K    +S+K+N+   R S+K+N+  GR S+K+N     KSA +  +R S K+N+   R S+K+N+  GR S+K+N+   R S     +R S K+N
Subjt:  KEMEAKSSIVQSKKINKSADRSSKKINKPTGRSSKKIN-----KSADRSSNRSSEKINKSVDRSSKKINKPTGRSSKKINKLADRSS-----NRSSEKIN

Query:  ---------KSADRSSKKINKATGRSSKKINKSQ
                 K   R S+K+N+  GR S+K+N+ +
Subjt:  ---------KSADRSSKKINKATGRSSKKINKSQ

A0A5A7TJM9 Uncharacterized protein6.5e-0537.76Show/hide
Query:  MKNLELKLFDEVNSDEKLHSIIPSRMKRKFSVLINTEGSLKFEGSSLYPAALFLLQVRGFSVVRLLRCSSSKCEGSYVVRCCIVPSSLKFEGSHAALL
        MK+L+ K F E N D+K+HS +PSRMKRK SV INTEG+      ++    L L   RG S      C S +    +++ C        F  SH   L
Subjt:  MKNLELKLFDEVNSDEKLHSIIPSRMKRKFSVLINTEGSLKFEGSSLYPAALFLLQVRGFSVVRLLRCSSSKCEGSYVVRCCIVPSSLKFEGSHAALL

A0A5D3CRV2 Uncharacterized protein3.3e-0451.72Show/hide
Query:  MKNLELKLFDEVNSDEKLHSIIPSRMKRKFSVLINTEGSLKFEGSSLYPAALFLLQVR
        MK+L+ K F E N D+K+HS +PSRMKRK SV INTE  ++    S + A L  LQ+R
Subjt:  MKNLELKLFDEVNSDEKLHSIIPSRMKRKFSVLINTEGSLKFEGSSLYPAALFLLQVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACGAGAAGCTTCATAGTATCATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGA
AGGTTCCTTGAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTCCTCCTCCAAGTGCG
AAGGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGAGGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCG
CTCCGTTGCAGTTCCTTCTTTCCAAGGTCGAAGGTTCTCACTCGCTGCGTTGTAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCTCCCAA
ATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTTCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCAAAATTCGAAGTTCCTTCCTCCAAGT
CTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTTGCTGCAGTTCCTTCCTCACAGTTCGAAGGTTCTCACGCGCCTCGCT
GCAGTTCCTTCCCCCAAGTTCGAAGATTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTTCGCGCTGCGCTTCGTTGCAGT
TCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGCTACAGTTTCTTCTCCCTAACTTCGAAGGTTCTCACGCGCTTCGCTGCAATTCCTTCCTCCCTAAGTTTGAAGT
TTCTCACGCATGAAGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAATGAAATAGGGGACTGGTCTAGCAGGAGTGAGATCACTGCAAGTGAAGCTGGT
GACGACCGTTGCACGAAGCTTACACATTCAAGCAAGAAAGAAACAAATTGCAATAATATATGTGCAAAGAAAGAAATGGAAGCCAAAAGCTCTATAGTACAATCAAAGAA
GATCAATAAGTCAGCAGACCGATCATCCAAGAAGATCAACAAGCCAACAGGCCGATCATCCAAGAAGATCAACAAATCAGCAGACCGATCATCAAACCGATCATCCGAGA
AGATCAATAAGTCAGTAGACCGATCATCCAAGAAGATCAACAAGCCAACAGGCCGATCATCCAAGAAGATAAACAAGTTAGCAGATCGATCATCAAACCGATCATCCGAG
AAGATCAACAAGTCAGCAGACCGATCATCCAAGAAGATCAACAAGGCAACAGGCCGATCATCCAAGAAGATCAACAAGTCACAACAGGCTGATCCAAGAGATCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACGAGAAGCTTCATAGTATCATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGA
AGGTTCCTTGAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTCCTCCTCCAAGTGCG
AAGGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGAGGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCG
CTCCGTTGCAGTTCCTTCTTTCCAAGGTCGAAGGTTCTCACTCGCTGCGTTGTAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCTCCCAA
ATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTTCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCAAAATTCGAAGTTCCTTCCTCCAAGT
CTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTTGCTGCAGTTCCTTCCTCACAGTTCGAAGGTTCTCACGCGCCTCGCT
GCAGTTCCTTCCCCCAAGTTCGAAGATTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTTCGCGCTGCGCTTCGTTGCAGT
TCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGCTACAGTTTCTTCTCCCTAACTTCGAAGGTTCTCACGCGCTTCGCTGCAATTCCTTCCTCCCTAAGTTTGAAGT
TTCTCACGCATGAAGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAATGAAATAGGGGACTGGTCTAGCAGGAGTGAGATCACTGCAAGTGAAGCTGGT
GACGACCGTTGCACGAAGCTTACACATTCAAGCAAGAAAGAAACAAATTGCAATAATATATGTGCAAAGAAAGAAATGGAAGCCAAAAGCTCTATAGTACAATCAAAGAA
GATCAATAAGTCAGCAGACCGATCATCCAAGAAGATCAACAAGCCAACAGGCCGATCATCCAAGAAGATCAACAAATCAGCAGACCGATCATCAAACCGATCATCCGAGA
AGATCAATAAGTCAGTAGACCGATCATCCAAGAAGATCAACAAGCCAACAGGCCGATCATCCAAGAAGATAAACAAGTTAGCAGATCGATCATCAAACCGATCATCCGAG
AAGATCAACAAGTCAGCAGACCGATCATCCAAGAAGATCAACAAGGCAACAGGCCGATCATCCAAGAAGATCAACAAGTCACAACAGGCTGATCCAAGAGATCATTAA
Protein sequenceShow/hide protein sequence
MKNLELKLFDEVNSDEKLHSIIPSRMKRKFSVLINTEGSLKFEGSSLYPAALFLLQVRGFSVVRLLRCSSSKCEGSYVVRCCIVPSSLKFEGSHAALLEFLLPKFEGSHA
LRCSSFFPRSKVLTRCVVVLSPQVRRFTHFAAVPSPKFEGSHALRSAISSPKFEGSHALRAVPSSKFEVPSSKSEGSHALRCSSFPQVRRFSRALLQFLPHSSKVLTRLA
AVPSPKFEDSHVASLQFLPPSLKVLTSLRFALRFVAVPSSKFEGSHTLRYSFFSLTSKVLTRFAAIPSSLSLKFLTHEGESGDYPCRLLRSPNEIGDWSSRSEITASEAG
DDRCTKLTHSSKKETNCNNICAKKEMEAKSSIVQSKKINKSADRSSKKINKPTGRSSKKINKSADRSSNRSSEKINKSVDRSSKKINKPTGRSSKKINKLADRSSNRSSE
KINKSADRSSKKINKATGRSSKKINKSQQADPRDH