; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021605 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021605
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr7:9737383..9739914
RNA-Seq ExpressionLag0021605
SyntenyLag0021605
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.1e-0966.67Show/hide
Query:  TSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        TS FDRLK+TNDQ +R+M +L+ K F E N D K+HS +PSRMKRK SV INTEGSL
Subjt:  TSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

KAA0051997.1 hypothetical protein E6C27_scaffold60G004810 [Cucumis melo var. makuwa]3.5e-1248.89Show/hide
Query:  MFDVHLHSTFSFPKAKCLHIEEKSIFTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKDLIKTM
        M +++LH  FSF K K    ++    TS FDRLK+T DQ +R+M +L++K F E N D K+HS +PSRMKRK SV INTEG+   L + +
Subjt:  MFDVHLHSTFSFPKAKCLHIEEKSIFTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKDLIKTM

KAA0056218.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]6.6e-1144.26Show/hide
Query:  TSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKSEWKLLPP
        T VFDRLK+TNDQ +R+M  L+ K F E N D K+H+ +PSRMKRK SV IN E                        EPKLH APSP ELKS   +  P
Subjt:  TSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKSEWKLLPP

Query:  SSKVPTRFAAVPSPKFEGSHAL
          +   +FA+  SP  EG+ +L
Subjt:  SSKVPTRFAAVPSPKFEGSHAL

KAA0065966.1 hypothetical protein E6C27_scaffold62G00430 [Cucumis melo var. makuwa]9.5e-1043.08Show/hide
Query:  TSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKSEWKLLPP
        TS FDR K+TN+Q +R++ +L+ KLF E N D K+HS +PSRMKRK SV INTE                        +PKLH APSP ELKS     PP
Subjt:  TSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKSEWKLLPP

Query:  SSKVPTRFAAVPSPKFEGSHALRCSSFPPS
         +     F+   SPK      L+  S PPS
Subjt:  SSKVPTRFAAVPSPKFEGSHALRCSSFPPS

TYK30948.1 hypothetical protein E5676_scaffold455G001730 [Cucumis melo var. makuwa]1.9e-1055.13Show/hide
Query:  MFDVHLHSTFSFPKAKCLHIEEKSIFTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLIN
        M +V+ HS FSF KAK LHIEE    TS FD LK+TNDQ +R+M  L+ K F E N+D K++S + S MKRK  V IN
Subjt:  MFDVHLHSTFSFPKAKCLHIEEKSIFTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLIN

TrEMBL top hitse value%identityAlignment
A0A5A7UC34 Uncharacterized protein1.7e-1248.89Show/hide
Query:  MFDVHLHSTFSFPKAKCLHIEEKSIFTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKDLIKTM
        M +++LH  FSF K K    ++    TS FDRLK+T DQ +R+M +L++K F E N D K+HS +PSRMKRK SV INTEG+   L + +
Subjt:  MFDVHLHSTFSFPKAKCLHIEEKSIFTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKDLIKTM

A0A5A7UM99 Ty3-gypsy retrotransposon protein3.2e-1144.26Show/hide
Query:  TSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKSEWKLLPP
        T VFDRLK+TNDQ +R+M  L+ K F E N D K+H+ +PSRMKRK SV IN E                        EPKLH APSP ELKS   +  P
Subjt:  TSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKSEWKLLPP

Query:  SSKVPTRFAAVPSPKFEGSHAL
          +   +FA+  SP  EG+ +L
Subjt:  SSKVPTRFAAVPSPKFEGSHAL

A0A5A7VHY3 Uncharacterized protein4.6e-1043.08Show/hide
Query:  TSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKSEWKLLPP
        TS FDR K+TN+Q +R++ +L+ KLF E N D K+HS +PSRMKRK SV INTE                        +PKLH APSP ELKS     PP
Subjt:  TSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKSEWKLLPP

Query:  SSKVPTRFAAVPSPKFEGSHALRCSSFPPS
         +     F+   SPK      L+  S PPS
Subjt:  SSKVPTRFAAVPSPKFEGSHALRCSSFPPS

A0A5A7VRG7 Retrotransposon gag protein1.0e-0966.67Show/hide
Query:  TSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        TS FDRLK+TNDQ +R+M +L+ K F E N D K+HS +PSRMKRK SV INTEGSL
Subjt:  TSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

A0A5D3E580 Uncharacterized protein9.2e-1155.13Show/hide
Query:  MFDVHLHSTFSFPKAKCLHIEEKSIFTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLIN
        M +V+ HS FSF KAK LHIEE    TS FD LK+TNDQ +R+M  L+ K F E N+D K++S + S MKRK  V IN
Subjt:  MFDVHLHSTFSFPKAKCLHIEEKSIFTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLIN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTCACGTCTGTCTTTGATCGCCTCAAAGTAACAAA
CGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGCATCCCGTCACGTATGAAGAGGAAGTTTT
CTGTTCTCATAAATACGGAAGGTTCCTTGAAGGATCTGATCAAGACCATGACAAAGATAAGAGCTTTTAAATGTAAAAGCTCCTTGTCGCAAGAGCCTAAACTGCATGAT
GCTCCTAGCCCACACGAGCTTAAAAGCGAATGGAAGTTGCTTCCTCCAAGTTCGAAGGTTCCCACGCGCTTCGCTGCAGTTCCTTCCCCCAAATTCGAAGGTTCTCACGC
GCTTCGCTGCAGTTCCTTCCCTCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGTTTCACTGCAGTTCTTTCCTC
ACAGTTCGAAGGTTCTCACGCGCTTCACTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCG
CTTCGCTTCGCTCACGCGCTTCGCTGCGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCG
TGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGTTTCGCTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCAAAGTTCG
AAGGTTCTCACGCGCTGCGCTGCAGTTCCTTCCTCAAAGTTCAAAGGTTCTCACGCGCTTCGTTGCAGTTCTTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCA
GTTCCTTCCCCCCAAGTTCGAAGTTCCTTCCTCACAATTCGAAGGTTCTCACGCGCTTCGCTGTTGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCG
CTCATGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTC
CTTCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGTCGCTTCGCTGCGCTCATGCGCTTCGCTGCAGTTCCTTCC
TCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTCGAAGGTTCTCATGC
GCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCA
AGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTTCTTCCCCAAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTT
CGTTGCAGTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGATTCTCACGCACTTCGCTGCAGTTCCTTCCTCCAAG
TTCAAAGGTTCTCACGCGCTTCGCTGCATTCCAGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCTCACGCTACGCTGCTTCCTTCTCCAAGTTCGAGGGTCCTCA
TGCTACGCTCGGCTACATTGCTGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCCTGCACTCATGCTGAAAAGGGCATGGCGGCGACACAAGTCCAAGGACATGTC
CCAAAGCGAGGAACATGTCCCTGTACTCGTGCTGAAAGGCGCGACGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGG
CGTGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGACCGTGCACTCGTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTCACGTCTGTCTTTGATCGCCTCAAAGTAACAAA
CGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGCATCCCGTCACGTATGAAGAGGAAGTTTT
CTGTTCTCATAAATACGGAAGGTTCCTTGAAGGATCTGATCAAGACCATGACAAAGATAAGAGCTTTTAAATGTAAAAGCTCCTTGTCGCAAGAGCCTAAACTGCATGAT
GCTCCTAGCCCACACGAGCTTAAAAGCGAATGGAAGTTGCTTCCTCCAAGTTCGAAGGTTCCCACGCGCTTCGCTGCAGTTCCTTCCCCCAAATTCGAAGGTTCTCACGC
GCTTCGCTGCAGTTCCTTCCCTCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGTTTCACTGCAGTTCTTTCCTC
ACAGTTCGAAGGTTCTCACGCGCTTCACTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCG
CTTCGCTTCGCTCACGCGCTTCGCTGCGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCG
TGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGTTTCGCTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCAAAGTTCG
AAGGTTCTCACGCGCTGCGCTGCAGTTCCTTCCTCAAAGTTCAAAGGTTCTCACGCGCTTCGTTGCAGTTCTTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCA
GTTCCTTCCCCCCAAGTTCGAAGTTCCTTCCTCACAATTCGAAGGTTCTCACGCGCTTCGCTGTTGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCG
CTCATGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTC
CTTCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGTCGCTTCGCTGCGCTCATGCGCTTCGCTGCAGTTCCTTCC
TCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTCGAAGGTTCTCATGC
GCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGTTGCATTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCA
AGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTTCTTCCCCAAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTT
CGTTGCAGTTCCTTCCCCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGATTCTCACGCACTTCGCTGCAGTTCCTTCCTCCAAG
TTCAAAGGTTCTCACGCGCTTCGCTGCATTCCAGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCTCACGCTACGCTGCTTCCTTCTCCAAGTTCGAGGGTCCTCA
TGCTACGCTCGGCTACATTGCTGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCCTGCACTCATGCTGAAAAGGGCATGGCGGCGACACAAGTCCAAGGACATGTC
CCAAAGCGAGGAACATGTCCCTGTACTCGTGCTGAAAGGCGCGACGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGG
CGTGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGACCGTGCACTCGTGCTGA
Protein sequenceShow/hide protein sequence
MFDVHLHSTFSFPKAKCLHIEEKSIFTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLSQEPKLHD
APSPHELKSEWKLLPPSSKVPTRFAAVPSPKFEGSHALRCSSFPPSSKVLTSLRCSSFLQVRRFSRVSLQFFPHSSKVLTRFTAVPSPQVRRFSRRFAEFLPPSLKVLTS
LRFAHALRCVPSSKFEGSHTLRSAIPSPKFEGSHALRAVPSSKFEGSHAFRCISFPPNSKVLTRFAAVPSSKFEGSHALRCSSFLKVQRFSRASLQFFPPSSKVLTRFAA
VPSPQVRSSFLTIRRFSRASLLFLPPKFEGSHVASLRSCASLQFLPPSLKVLTSLRCDPSSKFEGSHALRSAIPSPSSKVLTRFVQFLPPNSKVLTRRFAALMRFAAVPS
SKFEGSHIASLRSFLQVRRFSRASLCNSFPKFEGSHALRAVPSSKFEGSHALRCISFPPNSKVLTRFAAVPSSKFEGSHALRSAISSPKFEGSHALRAVPSSKFEGSHAL
RCSSFPPNSKVLTRFAAVPSPQVRRFSRTSLQFLPPSSKVLTRFAAFQRYFLKSKDVNCPHATLLPSPSSRVLMLRSATLLRYFLKSKDVNCPCTHAEKGMAATQVQGHV
PKRGTCPCTRAERRDGGTSPRNMSQLKEHVRALVLKGVAAAQVQGTCPNSRNMTVHSC