; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000774 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000774
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr4:15685472..15690298
RNA-Seq ExpressionLag0000774
SyntenyLag0000774
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036008.1 Retrotransposon gag protein [Cucumis melo var. makuwa]5.5e-1666.67Show/hide
Query:  MSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        MST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M + + K F E N D K+HS +PSRMKRK SV INTEGSL
Subjt:  MSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

KAA0041559.1 hypothetical protein E6C27_scaffold93G00150 [Cucumis melo var. makuwa]2.5e-1641.46Show/hide
Query:  TRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKFEGSSLYPAALFLLQVRGFSVV
        TR S F+RLS+STSKK+R STS FDRLK+ NDQ +R+M +L+ K F E N D K+HS +PSRMKRK SV INTEG+      ++    L L   RG S  
Subjt:  TRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKFEGSSLYPAALFLLQVRGFSVV

Query:  RLLRCSSSKCEGSYVVRCCIVPSSLKFDGSHAALL--------------EFLLPNSKVLTCFAA
            C S +    +++ C        F  SH   L              E+L  +  +LTC AA
Subjt:  RLLRCSSSKCEGSYVVRCCIVPSSLKFDGSHAALL--------------EFLLPNSKVLTCFAA

KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]7.2e-1667.5Show/hide
Query:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HS +PSRMKRK SV INTEGSL
Subjt:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

KAA0050734.1 gag protease polyprotein [Cucumis melo var. makuwa]7.2e-1667.5Show/hide
Query:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HS +PSRMKRK SV INTEGSL
Subjt:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

TYK08944.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.6e-1566.25Show/hide
Query:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+H+ +PSRMKRK SV INTEGSL
Subjt:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

TrEMBL top hitse value%identityAlignment
A0A5A7SZJ7 Retrotransposon gag protein2.7e-1666.67Show/hide
Query:  MSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        MST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M + + K F E N D K+HS +PSRMKRK SV INTEGSL
Subjt:  MSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

A0A5A7TGM1 Retrotransposon gag protein7.8e-1666.25Show/hide
Query:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        ST TR SAF+ LS+STSKK R STS FDRLK+ NDQ +R+M +L++K F E N D K+HS +PSRMKRK SV INTEGSL
Subjt:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

A0A5A7TJM9 Uncharacterized protein1.2e-1641.46Show/hide
Query:  TRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKFEGSSLYPAALFLLQVRGFSVV
        TR S F+RLS+STSKK+R STS FDRLK+ NDQ +R+M +L+ K F E N D K+HS +PSRMKRK SV INTEG+      ++    L L   RG S  
Subjt:  TRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKFEGSSLYPAALFLLQVRGFSVV

Query:  RLLRCSSSKCEGSYVVRCCIVPSSLKFDGSHAALL--------------EFLLPNSKVLTCFAA
            C S +    +++ C        F  SH   L              E+L  +  +LTC AA
Subjt:  RLLRCSSSKCEGSYVVRCCIVPSSLKFDGSHAALL--------------EFLLPNSKVLTCFAA

A0A5A7TQ06 Retrotransposon gag protein3.5e-1667.5Show/hide
Query:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HS +PSRMKRK SV INTEGSL
Subjt:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

A0A5D3BBF9 Gag protease polyprotein3.5e-1667.5Show/hide
Query:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL
        ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HS +PSRMKRK SV INTEGSL
Subjt:  STSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCA
ACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGATAAGAAGCTTCATAGTAGCATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTC
TCATAAATACGGAAGGTTCCTTGAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTCC
TCCTCCAAGTGCGAAGGATCTTATGTGGTGCGTTGTTGCATTGTTCCATCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCAAATTCGAA
GGTTCTCACATGCTTCGCTGCAGTTCCTTCTTCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCGCTGCAG
TTCCTTCTTTCCAAGGTCGAAGGTTCTCACGTGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTT
CTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGTTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGTTCCTTCCCCCAAGTTCGAAGGTTCTCA
CGCGCTTCGCTGCAGTTTCTTCTCCCTAAGTTCGAAGGTTCTCACGCGCTTCGATGCAGTTCCTTCCTCCCTAAGTTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCA
AGCGCTTCGCTGCACAGTTCCTTCCTCCAAGTTTGAAGGTTCTCATGCACTTCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCT
CCAAGTTCGAAGGTTCCCTCACGCGCTTCACTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTATCTCCGTTGCTAC
CTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTC
TCTCCACTGCTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTTTTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTT
TTCAAATGTTTGGCGGCAGTTGACGTCCTCGTTCTGCTTCATCTTCAAATGTTGACAGGTGGTGAAGTCACTGCAATTGAATCTGATGACGACCGTTGTAGGCGAGTCGA
GTCTGGTGACCACCCCTGCAGGTTACTCAGATCACCCAATAAAATGGGACTGGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGAT
CACCCAATAAAATGGGGACTGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAATGAAATAGGGGACTGGTCTAGCAGG
AGTGAGATCACTGCAAGTGAAGCTGGTGGCGACCGTTGCACGAAGCTTACACATTCAAGCAAGAAAGAAACAAATTGCAATAATATATGTGCAAAGAAAAGAAATGGAAG
CCAAAAGCTCTATACCGATCCAACAAATCAACAAGCCAACAGGCCGATCAAAGAGATCATCAAGTCAACAGACCGATCATCCAGGAGGATCAACAAGCTAACAACCGATC
CAACAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAACCGATCCAACAGATCATCAAGCCAAC
AGGCCGATCCAAGAGATCATCAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGTCGATCCAACAGATCATCAAGCCAACAGGCTGATCCAAGAGATCA
ACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAACAGGCTGATC
CAAGAGATCAACAAGACAATCGACTGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCGAAGAGATCAACAAGCCAATCGACCGATCAAGAAGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCA
ACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGATAAGAAGCTTCATAGTAGCATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTC
TCATAAATACGGAAGGTTCCTTGAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGCTACGTTGTTCC
TCCTCCAAGTGCGAAGGATCTTATGTGGTGCGTTGTTGCATTGTTCCATCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCAAATTCGAA
GGTTCTCACATGCTTCGCTGCAGTTCCTTCTTCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCGCTGCAG
TTCCTTCTTTCCAAGGTCGAAGGTTCTCACGTGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTT
CTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCGTTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGTTCCTTCCCCCAAGTTCGAAGGTTCTCA
CGCGCTTCGCTGCAGTTTCTTCTCCCTAAGTTCGAAGGTTCTCACGCGCTTCGATGCAGTTCCTTCCTCCCTAAGTTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCA
AGCGCTTCGCTGCACAGTTCCTTCCTCCAAGTTTGAAGGTTCTCATGCACTTCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCT
CCAAGTTCGAAGGTTCCCTCACGCGCTTCACTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTATCTCCGTTGCTAC
CTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTC
TCTCCACTGCTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTTTTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTT
TTCAAATGTTTGGCGGCAGTTGACGTCCTCGTTCTGCTTCATCTTCAAATGTTGACAGGTGGTGAAGTCACTGCAATTGAATCTGATGACGACCGTTGTAGGCGAGTCGA
GTCTGGTGACCACCCCTGCAGGTTACTCAGATCACCCAATAAAATGGGACTGGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGAT
CACCCAATAAAATGGGGACTGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAATGAAATAGGGGACTGGTCTAGCAGG
AGTGAGATCACTGCAAGTGAAGCTGGTGGCGACCGTTGCACGAAGCTTACACATTCAAGCAAGAAAGAAACAAATTGCAATAATATATGTGCAAAGAAAAGAAATGGAAG
CCAAAAGCTCTATACCGATCCAACAAATCAACAAGCCAACAGGCCGATCAAAGAGATCATCAAGTCAACAGACCGATCATCCAGGAGGATCAACAAGCTAACAACCGATC
CAACAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAACCGATCCAACAGATCATCAAGCCAAC
AGGCCGATCCAAGAGATCATCAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGTCGATCCAACAGATCATCAAGCCAACAGGCTGATCCAAGAGATCA
ACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAACAGGCTGATC
CAAGAGATCAACAAGACAATCGACTGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCGAAGAGATCAACAAGCCAATCGACCGATCAAGAAGATTAA
Protein sequenceShow/hide protein sequence
MSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKFEGSSLYPAALFLLQVRGFSVVRLLRCS
SSKCEGSYVVRCCIVPSSLKFDGSHAALLEFLLPNSKVLTCFAAVPSSKFEGSHVASLQFLPPSSKVLTCFAAVPSFQGRRFSRAALQFFLPKFEGSRTSLQFLLPNSKV
LTRFVQFLPPNSKVLTRFAAVPSSKFEVPSPKFEGSHALRCSFFSLSSKVLTRFDAVPSSLSSKFLPPSSKVLKRFAAQFLPPSLKVLMHFVATFLQVRRFSHALLQLLP
PSSKVPSRASLAPSPSSKALLSTAPSPSSKALISVATFLQVRRFSHALLQFLPPSSKVPSRASLAPSPSSKALLSTAPSPSSKVLLSTPLFEGSPLRFSFSKFEGSPLLL
FKCLAAVDVLVLLHLQMLTGGEVTAIESDDDRCRRVESGDHPCRLLRSPNKMGLGLAGVHEGESGDYPCRLLRSPNKMGTGLAGVHEGESGDYPCRLLRSPNEIGDWSSR
SEITASEAGGDRCTKLTHSSKKETNCNNICAKKRNGSQKLYTDPTNQQANRPIKEIIKSTDRSSRRINKLTTDPTDHQANRPIQEIIKSAGRSSKRINKLTTDPTDHQAN
RPIQEIIKSAGRSSKRINKLTSRSNRSSSQQADPRDQQANRPIKKINKSAGRSSKRINKLTSRSNRSSSQQADPRDQQDNRLIKKINKSAGRSSKRSTSQSTDQED