; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001219 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001219
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr4:26955951..26958697
RNA-Seq ExpressionLag0001219
SyntenyLag0001219
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035697.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.2e-0861.67Show/hide
Query:  SASVFDRLQVTNDQPKRKMDNLELKLFNEVNNDEKLHSSILSRVKRKFSVLINTEGSLKV
        S S FDRL++TNDQ +++M  L+ K F E NND+K+HS +LSR+KRK SV INTEGSL V
Subjt:  SASVFDRLQVTNDQPKRKMDNLELKLFNEVNNDEKLHSSILSRVKRKFSVLINTEGSLKV

KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]6.5e-0860Show/hide
Query:  SASVFDRLQVTNDQPKRKMDNLELKLFNEVNNDEKLHSSILSRVKRKFSVLINTEGSLKV
        S S FDRL++TNDQ +R+M +L+ K F+E N+D+K+HS + SR+KRK SV INTEGSL V
Subjt:  SASVFDRLQVTNDQPKRKMDNLELKLFNEVNNDEKLHSSILSRVKRKFSVLINTEGSLKV

KAA0051997.1 hypothetical protein E6C27_scaffold60G004810 [Cucumis melo var. makuwa]5.3e-1047.67Show/hide
Query:  MFGVHLHSTFSFPKAKSLHVEEKSISASVFDRLQVTNDQPKRKMDNLELKLFNEVNNDEKLHSSILSRVKRKFSVLINTEGSLKVL
        M  ++LH  FSF K K    ++   S S FDRL++T DQ +R+M +L++K F+E N+D+K+HS + SR+KRK SV INTEG+   L
Subjt:  MFGVHLHSTFSFPKAKSLHVEEKSISASVFDRLQVTNDQPKRKMDNLELKLFNEVNNDEKLHSSILSRVKRKFSVLINTEGSLKVL

KAA0068201.1 retrotransposon gag protein [Cucumis melo var. makuwa]6.5e-0860Show/hide
Query:  SASVFDRLQVTNDQPKRKMDNLELKLFNEVNNDEKLHSSILSRVKRKFSVLINTEGSLKV
        S S FDRL++TNDQ +R+M +L+ K F+E N+D+K+HS + SR+KRK SV INTEGSL V
Subjt:  SASVFDRLQVTNDQPKRKMDNLELKLFNEVNNDEKLHSSILSRVKRKFSVLINTEGSLKV

TYK30948.1 hypothetical protein E5676_scaffold455G001730 [Cucumis melo var. makuwa]2.7e-0944.76Show/hide
Query:  MFGVHLHSTFSFPKAKSLHVEEKSISASVFDRLQVTNDQPKRKMDNLELKLFNEVNNDEKLHSSILSRVKRKFSVLINTEGSLKVLTLRCFLLQVQRFSR
        M  V+ HS FSF KAK LH+EE   S S FD L++TNDQ +R+M  L+ K F+E NND+K++S + S +KRK  V IN +  L   + R F++     S 
Subjt:  MFGVHLHSTFSFPKAKSLHVEEKSISASVFDRLQVTNDQPKRKMDNLELKLFNEVNNDEKLHSSILSRVKRKFSVLINTEGSLKVLTLRCFLLQVQRFSR

Query:  ISLQF
         +L F
Subjt:  ISLQF

TrEMBL top hitse value%identityAlignment
A0A5A7TQ06 Retrotransposon gag protein3.2e-0860Show/hide
Query:  SASVFDRLQVTNDQPKRKMDNLELKLFNEVNNDEKLHSSILSRVKRKFSVLINTEGSLKV
        S S FDRL++TNDQ +R+M +L+ K F+E N+D+K+HS + SR+KRK SV INTEGSL V
Subjt:  SASVFDRLQVTNDQPKRKMDNLELKLFNEVNNDEKLHSSILSRVKRKFSVLINTEGSLKV

A0A5A7UC34 Uncharacterized protein2.6e-1047.67Show/hide
Query:  MFGVHLHSTFSFPKAKSLHVEEKSISASVFDRLQVTNDQPKRKMDNLELKLFNEVNNDEKLHSSILSRVKRKFSVLINTEGSLKVL
        M  ++LH  FSF K K    ++   S S FDRL++T DQ +R+M +L++K F+E N+D+K+HS + SR+KRK SV INTEG+   L
Subjt:  MFGVHLHSTFSFPKAKSLHVEEKSISASVFDRLQVTNDQPKRKMDNLELKLFNEVNNDEKLHSSILSRVKRKFSVLINTEGSLKVL

A0A5A7VRG7 Retrotransposon gag protein3.2e-0860Show/hide
Query:  SASVFDRLQVTNDQPKRKMDNLELKLFNEVNNDEKLHSSILSRVKRKFSVLINTEGSLKV
        S S FDRL++TNDQ +R+M +L+ K F+E N+D+K+HS + SR+KRK SV INTEGSL V
Subjt:  SASVFDRLQVTNDQPKRKMDNLELKLFNEVNNDEKLHSSILSRVKRKFSVLINTEGSLKV

A0A5D3E4T1 Retrotransposon gag protein1.1e-0861.67Show/hide
Query:  SASVFDRLQVTNDQPKRKMDNLELKLFNEVNNDEKLHSSILSRVKRKFSVLINTEGSLKV
        S S FDRL++TNDQ +++M  L+ K F E NND+K+HS +LSR+KRK SV INTEGSL V
Subjt:  SASVFDRLQVTNDQPKRKMDNLELKLFNEVNNDEKLHSSILSRVKRKFSVLINTEGSLKV

A0A5D3E580 Uncharacterized protein1.3e-0944.76Show/hide
Query:  MFGVHLHSTFSFPKAKSLHVEEKSISASVFDRLQVTNDQPKRKMDNLELKLFNEVNNDEKLHSSILSRVKRKFSVLINTEGSLKVLTLRCFLLQVQRFSR
        M  V+ HS FSF KAK LH+EE   S S FD L++TNDQ +R+M  L+ K F+E NND+K++S + S +KRK  V IN +  L   + R F++     S 
Subjt:  MFGVHLHSTFSFPKAKSLHVEEKSISASVFDRLQVTNDQPKRKMDNLELKLFNEVNNDEKLHSSILSRVKRKFSVLINTEGSLKVLTLRCFLLQVQRFSR

Query:  ISLQF
         +L F
Subjt:  ISLQF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTTATATGAACACCTTCACCAAGTACTCGTCCTTCGGTATTCCAAAGAATGAGTATGGCCACGACAGAAGAAGAAAATCAATGTTTGGTGTCCACCTCCACTCAAC
CTTCAGCTTTCCAAAGGCTAAGAGTCTCCACGTCGAAGAAAAGTCAATTTCAGCATCTGTCTTCGATCGTCTCCAAGTAACAAACGATCAACCTAAAAGAAAGATGGACA
ACTTGGAGTTGAAACTTTTCAATGAAGTAAACAACGACGAGAAACTTCATAGTAGCATCCTGTCACGTGTGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCC
TTGAAGGTCCTCACGCTGCGTTGCTTCCTTCTCCAAGTTCAAAGGTTCTCACGCATTTCGCTGCAGTTCCTTCTCACCAAGTTTGAAGTTCCCTCTCTCCAAGTTCGAAG
GTTCTCACGCGCTTCGCTGCAGTTCCTTCTCCCCAAGTTCAAGGGTCCATCGTTGTACGCTGCTGCATTGTTCCTTCTCCAAGTTCGAAGGTTCTCAGTTGTACAACTGC
TACGTTGTTCCTCCTCCAAGTGTGAAGGATCTTATGTTGTGCGCTGCTGCATTGTTCCCTCTTCTCTCAAGTTCGAAGGTTCTCACGCTGCTTCGCTGGAGTTCCTTCTC
CCCAAGTTTGAAGGTTCTCACGCGCTTCGCTGCAGTTCATTCTCTCCAAGTTCGAAGGTTCTCACGCATTGCGTTGCAGTTCCTTCTCCCCATGTTCGAAGGCTCACGCA
CTTCGTTGCAGTTCCTTCTCTCAAGTTTGAAGGTTCTCACGCTGCTTCGCTGCAGTTCCTTCTCTCCAAGTTCGAAGATTCTCATGTGCTTCGTTGCAGATCCTTCTCTT
CAAGTTCGAAGGTTCACACGTTGCGCTGCTTCTTCACCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAATTCCTTCTCTCCAAGTTTGAAGGTTCTCACGCGCTTCGTT
GCAGTTCCTTCTCCCCAAGTTCAAAAGTTCACCCGTTGTATGCTGCTATTTCTTTTCAAGTTCGAAGGTTCACACGTTGCGCTGTTCCTTCACCAAGTTCGAAGGTTCTC
ATGCTGCATTGCTTCCTTCTCCAAGTTTGAAGGTTCTCATGCTACGTTAGATTGCGCTGCTGTGCTACTTCCTTCTTTAAGTTTGAAGGTTTCCACATTGCACTGTTGTG
CTACTTCCTTCTCCAAGTTTGAAGGTTCTCACGCTGCTCTGCTTCCTTCACCAAGTTCGAAGGTTCTTCGTTGTATTGTTGCACGATGTTCCCTTCTCTCTCCAAGTTTG
AAAGTTCTGACGCTGCACTGCTTCTTTCACCAAGTTCGAAGGTTCTCACGCTGTTCCTTCTCCAAGTTCGAAGGTCCTCACGCTGTGTTGCTTCATTCTCCAAGTTCGAA
GGTTTTGACGTTGCGCTGCTTCCTTCACCAAGTTCGAAGGTTCTCACGCTGCACTGTTTCACTGTTCCTTCTCCAAGTTCGAAGGTTTTCATGCTGCTTCGTTGTTCCTT
CTTCAAGTTCGAAGGTTCTCACGCTGGGCTGTTTCTCTGTTCCTTCTCCAAGTTCAAAGGTTCTCATGATGTTTCGTTGTTCCTTCTTCAAGTTCGAAGGTTCTCACGCT
GCGCTGTTGCGCTTCTTCCTTCTCCACGTTCGAAAGTTCTCGTGCTACACTGCTTCCTTCTCCAAGTTTGAAGGTTCTGACGTTGCGCTGCTTCCTTCACCAAGTTCGAA
GGTTCTCAAGTTGCGCTGTTTTGTTGTTCCTTCTCCAAGTTTGAAGGTTCTCATGTTGCTTCGTTGTTCCTTCTTCAAGTTCGAAGGTTCTCATGCTGCGCTGTTTCGCT
GTTCCTTCTCCAAGTTCGAAGGTTCTCATGTTGCTTCGTTGTTTCTTCTTCAAGTTCTAAGGTTCTCACGCTGCGCTGTTACGCTGCTTCATTCTCCAAGTTCAAAAATC
CGCACGCTGTGTTTCTGCGCGCTGCTTCATTCTCTCTCAAAGTTCTCACGTTGCTGCGCTGCCAGTCCATTCTCTCTCCAAGTTCGAAGGTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGTTATATGAACACCTTCACCAAGTACTCGTCCTTCGGTATTCCAAAGAATGAGTATGGCCACGACAGAAGAAGAAAATCAATGTTTGGTGTCCACCTCCACTCAAC
CTTCAGCTTTCCAAAGGCTAAGAGTCTCCACGTCGAAGAAAAGTCAATTTCAGCATCTGTCTTCGATCGTCTCCAAGTAACAAACGATCAACCTAAAAGAAAGATGGACA
ACTTGGAGTTGAAACTTTTCAATGAAGTAAACAACGACGAGAAACTTCATAGTAGCATCCTGTCACGTGTGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCC
TTGAAGGTCCTCACGCTGCGTTGCTTCCTTCTCCAAGTTCAAAGGTTCTCACGCATTTCGCTGCAGTTCCTTCTCACCAAGTTTGAAGTTCCCTCTCTCCAAGTTCGAAG
GTTCTCACGCGCTTCGCTGCAGTTCCTTCTCCCCAAGTTCAAGGGTCCATCGTTGTACGCTGCTGCATTGTTCCTTCTCCAAGTTCGAAGGTTCTCAGTTGTACAACTGC
TACGTTGTTCCTCCTCCAAGTGTGAAGGATCTTATGTTGTGCGCTGCTGCATTGTTCCCTCTTCTCTCAAGTTCGAAGGTTCTCACGCTGCTTCGCTGGAGTTCCTTCTC
CCCAAGTTTGAAGGTTCTCACGCGCTTCGCTGCAGTTCATTCTCTCCAAGTTCGAAGGTTCTCACGCATTGCGTTGCAGTTCCTTCTCCCCATGTTCGAAGGCTCACGCA
CTTCGTTGCAGTTCCTTCTCTCAAGTTTGAAGGTTCTCACGCTGCTTCGCTGCAGTTCCTTCTCTCCAAGTTCGAAGATTCTCATGTGCTTCGTTGCAGATCCTTCTCTT
CAAGTTCGAAGGTTCACACGTTGCGCTGCTTCTTCACCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAATTCCTTCTCTCCAAGTTTGAAGGTTCTCACGCGCTTCGTT
GCAGTTCCTTCTCCCCAAGTTCAAAAGTTCACCCGTTGTATGCTGCTATTTCTTTTCAAGTTCGAAGGTTCACACGTTGCGCTGTTCCTTCACCAAGTTCGAAGGTTCTC
ATGCTGCATTGCTTCCTTCTCCAAGTTTGAAGGTTCTCATGCTACGTTAGATTGCGCTGCTGTGCTACTTCCTTCTTTAAGTTTGAAGGTTTCCACATTGCACTGTTGTG
CTACTTCCTTCTCCAAGTTTGAAGGTTCTCACGCTGCTCTGCTTCCTTCACCAAGTTCGAAGGTTCTTCGTTGTATTGTTGCACGATGTTCCCTTCTCTCTCCAAGTTTG
AAAGTTCTGACGCTGCACTGCTTCTTTCACCAAGTTCGAAGGTTCTCACGCTGTTCCTTCTCCAAGTTCGAAGGTCCTCACGCTGTGTTGCTTCATTCTCCAAGTTCGAA
GGTTTTGACGTTGCGCTGCTTCCTTCACCAAGTTCGAAGGTTCTCACGCTGCACTGTTTCACTGTTCCTTCTCCAAGTTCGAAGGTTTTCATGCTGCTTCGTTGTTCCTT
CTTCAAGTTCGAAGGTTCTCACGCTGGGCTGTTTCTCTGTTCCTTCTCCAAGTTCAAAGGTTCTCATGATGTTTCGTTGTTCCTTCTTCAAGTTCGAAGGTTCTCACGCT
GCGCTGTTGCGCTTCTTCCTTCTCCACGTTCGAAAGTTCTCGTGCTACACTGCTTCCTTCTCCAAGTTTGAAGGTTCTGACGTTGCGCTGCTTCCTTCACCAAGTTCGAA
GGTTCTCAAGTTGCGCTGTTTTGTTGTTCCTTCTCCAAGTTTGAAGGTTCTCATGTTGCTTCGTTGTTCCTTCTTCAAGTTCGAAGGTTCTCATGCTGCGCTGTTTCGCT
GTTCCTTCTCCAAGTTCGAAGGTTCTCATGTTGCTTCGTTGTTTCTTCTTCAAGTTCTAAGGTTCTCACGCTGCGCTGTTACGCTGCTTCATTCTCCAAGTTCAAAAATC
CGCACGCTGTGTTTCTGCGCGCTGCTTCATTCTCTCTCAAAGTTCTCACGTTGCTGCGCTGCCAGTCCATTCTCTCTCCAAGTTCGAAGGTTCTAA
Protein sequenceShow/hide protein sequence
MCYMNTFTKYSSFGIPKNEYGHDRRRKSMFGVHLHSTFSFPKAKSLHVEEKSISASVFDRLQVTNDQPKRKMDNLELKLFNEVNNDEKLHSSILSRVKRKFSVLINTEGS
LKVLTLRCFLLQVQRFSRISLQFLLTKFEVPSLQVRRFSRASLQFLLPKFKGPSLYAAALFLLQVRRFSVVQLLRCSSSKCEGSYVVRCCIVPSSLKFEGSHAASLEFLL
PKFEGSHALRCSSFSPSSKVLTHCVAVPSPHVRRLTHFVAVPSLKFEGSHAASLQFLLSKFEDSHVLRCRSFSSSSKVHTLRCFFTKFEGSHALRCNSFSPSLKVLTRFV
AVPSPQVQKFTRCMLLFLFKFEGSHVALFLHQVRRFSCCIASFSKFEGSHATLDCAAVLLPSLSLKVSTLHCCATSFSKFEGSHAALLPSPSSKVLRCIVARCSLLSPSL
KVLTLHCFFHQVRRFSRCSFSKFEGPHAVLLHSPSSKVLTLRCFLHQVRRFSRCTVSLFLLQVRRFSCCFVVPSSSSKVLTLGCFSVPSPSSKVLMMFRCSFFKFEGSHA
ALLRFFLLHVRKFSCYTASFSKFEGSDVALLPSPSSKVLKLRCFVVPSPSLKVLMLLRCSFFKFEGSHAALFRCSFSKFEGSHVASLFLLQVLRFSRCAVTLLHSPSSKI
RTLCFCALLHSLSKFSRCCAASPFSLQVRRF