; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018390 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018390
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr5:25581851..25587450
RNA-Seq ExpressionLag0018390
SyntenyLag0018390
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016020 - membrane (cellular component)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032941.1 hypothetical protein E6C27_scaffold269G00150 [Cucumis melo var. makuwa]2.0e-0762.9Show/hide
Query:  STSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSV
        ST TR SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M  L+ K F E N D K+HS V
Subjt:  STSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSV

KAA0036008.1 Retrotransposon gag protein [Cucumis melo var. makuwa]2.0e-0760.32Show/hide
Query:  MSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSV
        MST  R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M + + K F E N D K+HS V
Subjt:  MSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSV

KAA0050345.1 gag protease polyprotein [Cucumis melo var. makuwa]8.9e-0862.9Show/hide
Query:  STSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSV
        ST TR SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D K+HS V
Subjt:  STSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSV

KAA0056374.1 retrotransposon gag protein [Cucumis melo var. makuwa]8.9e-0862.71Show/hide
Query:  TRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSV
        TR SAF+RLS+STSKK +PSTS FDRLK+ +DQ +R+M +L+ KLF E N D K+HS V
Subjt:  TRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSV

KAA0065418.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.2e-0764.41Show/hide
Query:  TRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSV
        TR SAF+RLS+STSKK +PSTSVFDRLK+T+DQ +R+M +L+ K F E N D K+HS V
Subjt:  TRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSV

TrEMBL top hitse value%identityAlignment
A0A5A7STU1 Uncharacterized protein9.6e-0862.9Show/hide
Query:  STSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSV
        ST TR SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M  L+ K F E N D K+HS V
Subjt:  STSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSV

A0A5A7SZJ7 Retrotransposon gag protein9.6e-0860.32Show/hide
Query:  MSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSV
        MST  R SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M + + K F E N D K+HS V
Subjt:  MSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSV

A0A5A7U9R6 Gag protease polyprotein4.3e-0862.9Show/hide
Query:  STSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSV
        ST TR SAF+RLS+STSKK +PSTS FDRLK+T+DQ +R+M +L+ K F E N D K+HS V
Subjt:  STSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSV

A0A5A7VCR0 Retrotransposon gag protein5.6e-0864.41Show/hide
Query:  TRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSV
        TR SAF+RLS+STSKK +PSTSVFDRLK+T+DQ +R+M +L+ K F E N D K+HS V
Subjt:  TRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSV

A0A5D3DZF3 Retrotransposon gag protein4.3e-0862.71Show/hide
Query:  TRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSV
        TR SAF+RLS+STSKK +PSTS FDRLK+ +DQ +R+M +L+ KLF E N D K+HS V
Subjt:  TRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTCGACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCA
ACCTAAAAGAAAAATGGACAACTTGGAGGTGAAACTTTTCGATGAAGTAAACCGCGACAAGAAGCTTCATAGTAGCGTCCGTCACTTCGAAGGTTCTTCGTTGTATCCTG
CTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTCCTCCTCCAAGTGTGAAGGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCT
TCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTTTCAAGGTCGAAGGTTCT
CACTCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTT
CCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGTAGTTCCTTCCCCCTAAGTTTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGTTCCTTCC
CCCAAGTTCGAAGGTTCTTACGCGCTTTGCTGCAGTTCCTTCCTCACAGTTCAAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCAAGGTTCTCACGTCG
CTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCATATCGCTTCGCTTCGCGCTACGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACTCGCTTCGC
TGCAGTTCATTCCTCCAAATTCGAAGGTTTCGAAGGTTCTCAAGCGCTGTGATTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTTGCGCTTCGTTGCTCCTTCC
TCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCCTCTCTCC
ACTGTTCCTTCCTCCAAGTTCGAAGGTTCTCAGGCGCTTCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGG
TTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCTACTGCTCATTCTCCAAGTTCG
AAGGTGCTTCTCTCCACCCCTCTTTTTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGCGGTTGACGT
CCTCGTTCCGCTTCATCTTCAAATGTTGGCAGTTGACGGCGTGGTGAAATCACTGCAAGTGAAAGCTGATGACACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATC
AAAGTGACTGGTCTAGAAAGACAGGTGGTGAAATCACTGCAAGTGAAAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCT
AGACAGGTGGTGAAATCACTGCAAGTGAAAGCTGATGACGACCGTGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACT
GCAAGTGAAAGCTGATGACGACCGTGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGACAGGTGGTGAAATCACTGCAAGTGAAAAGCTGATGACGACCGTGGTGA
CCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGACAGGTGGTGAAATCACTGCAAGTGAAAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGA
AACTACAGTCATCAAAGTGACTGGTCTAGACAGACAGGTGGTGAAATCACTGCAAGTGAAAAGCTGATGACGACCATGGTGACCACCCCTGCAGGAAACTACAGTCATCA
AAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGATCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGATTGGTCTAGACAGA
GTCAGAGAACTCAGAGTCCAGAGCATTCTGCCAAGAGTCCAGAGTCGGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAA
CAGGCCGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCA
ACAAGCTGATCCAAGAGATCAAGAAGCCAACCGACCGATTAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATC
AACAAGTCAGCAGACCGATCATCCAGGAAGATCAACAAGCCAACAAGCCGATCCAAGAGATCATCACGCCAACAGGTCGATCATCCAAGAAGATCAACAAGCCCAATAGG
TCGATCCAAGAGATCATCAACCTAACAGGCCGATCATCCAAGAAGATTAACAAGTCCAATAGGTCGATCCAGGAGATCATCAACCTAACAGACCGATCATCCAAGAAGAT
CAACAAGCCAATAAGCCGATCCAAGAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAACCTAGCAAGTTGATCATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTCGACATCTGTTTTTGATCGCCTCAAAGTAACAAGCGATCA
ACCTAAAAGAAAAATGGACAACTTGGAGGTGAAACTTTTCGATGAAGTAAACCGCGACAAGAAGCTTCATAGTAGCGTCCGTCACTTCGAAGGTTCTTCGTTGTATCCTG
CTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTCCTCCTCCAAGTGTGAAGGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCT
TCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCCTTCTTTTCAAGGTCGAAGGTTCT
CACTCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTT
CCCCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGTAGTTCCTTCCCCCTAAGTTTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGTTCCTTCC
CCCAAGTTCGAAGGTTCTTACGCGCTTTGCTGCAGTTCCTTCCTCACAGTTCAAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCAAGGTTCTCACGTCG
CTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCATATCGCTTCGCTTCGCGCTACGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACTCGCTTCGC
TGCAGTTCATTCCTCCAAATTCGAAGGTTTCGAAGGTTCTCAAGCGCTGTGATTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTTGCGCTTCGTTGCTCCTTCC
TCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCCTCTCTCC
ACTGTTCCTTCCTCCAAGTTCGAAGGTTCTCAGGCGCTTCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGG
TTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCTACTGCTCATTCTCCAAGTTCG
AAGGTGCTTCTCTCCACCCCTCTTTTTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGCGGTTGACGT
CCTCGTTCCGCTTCATCTTCAAATGTTGGCAGTTGACGGCGTGGTGAAATCACTGCAAGTGAAAGCTGATGACACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATC
AAAGTGACTGGTCTAGAAAGACAGGTGGTGAAATCACTGCAAGTGAAAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCT
AGACAGGTGGTGAAATCACTGCAAGTGAAAGCTGATGACGACCGTGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACT
GCAAGTGAAAGCTGATGACGACCGTGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGACAGGTGGTGAAATCACTGCAAGTGAAAAGCTGATGACGACCGTGGTGA
CCACCCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGACAGGTGGTGAAATCACTGCAAGTGAAAAGCTGATGACGACCGTGGTGACCACCCCTGCAGGA
AACTACAGTCATCAAAGTGACTGGTCTAGACAGACAGGTGGTGAAATCACTGCAAGTGAAAAGCTGATGACGACCATGGTGACCACCCCTGCAGGAAACTACAGTCATCA
AAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGATCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGATTGGTCTAGACAGA
GTCAGAGAACTCAGAGTCCAGAGCATTCTGCCAAGAGTCCAGAGTCGGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAA
CAGGCCGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCA
ACAAGCTGATCCAAGAGATCAAGAAGCCAACCGACCGATTAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATC
AACAAGTCAGCAGACCGATCATCCAGGAAGATCAACAAGCCAACAAGCCGATCCAAGAGATCATCACGCCAACAGGTCGATCATCCAAGAAGATCAACAAGCCCAATAGG
TCGATCCAAGAGATCATCAACCTAACAGGCCGATCATCCAAGAAGATTAACAAGTCCAATAGGTCGATCCAGGAGATCATCAACCTAACAGACCGATCATCCAAGAAGAT
CAACAAGCCAATAAGCCGATCCAAGAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAACCTAGCAAGTTGATCATCTAA
Protein sequenceShow/hide protein sequence
MSTSTRPSAFQRLSVSTSKKSQPSTSVFDRLKVTSDQPKRKMDNLEVKLFDEVNRDKKLHSSVRHFEGSSLYPAALFLLQVRGFSVVRLLRCSSSKCEGSYVVRCCIVPS
SLKFDGSHAALLEFLLPKFEGSHALRCSSFFSRSKVLTRCVAVLSPQVRRFTHFAAVPSPKFEGSHALRSAIPSPKFEGSHALRCSSFPLSLKVLTRFAAVPSSKFKVPS
PKFEGSYALCCSSFLTVQRFSRASLQFLPPSSRFSRRFAAVPSSKFEGSHIASLRATLRCSSFLQVRRFSLASLQFIPPNSKVSKVLKRCDSLQFLPPSSKVLLRFVAPS
SKFEGSLTRCCSSFLQVRRFPHALRSLLLQVRRRLSPLFLPPSSKVLRRFVATFLQVRRFSHALLQLLPPSSKVPSRASLAPSPSSKALLSTAPSPSSKALLSTAHSPSS
KVLLSTPLFEGSPLRFSFSKFEGSPLLLFKCLAAVDVLVPLHLQMLAVDGVVKSLQVKADDTVVTTPAGNYSHQSDWSRKTGGEITASEKLMTTVVTTPAGNYSHQSDWS
RQVVKSLQVKADDDRVTTPAGNYSHQSDWSRQVVKSLQVKADDDRGNYSHQSDWSRQTGGEITASEKLMTTVVTTPAGNYSHQSDWSRQTGGEITASEKLMTTVVTTPAG
NYSHQSDWSRQTGGEITASEKLMTTMVTTPAGNYSHQSDWSRQVVKSLQVKLMTIVVTTPAGNYSHQSDWSRQSQRTQSPEHSAKSPESAGRSSKRINKLTSRSNRSSSQ
QADPRDQQANRPIKKINKSAGRSSKRSTSQPTDQEDQQADPRDQEANRPIKKINKSAGRSSKRSTSQPTDQEDQQVSRPIIQEDQQANKPIQEIITPTGRSSKKINKPNR
SIQEIINLTGRSSKKINKSNRSIQEIINLTDRSSKKINKPISRSKRSSSQQADPRDHQPSKLII