; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040887 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040887
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr13:9281052..9285993
RNA-Seq ExpressionLag0040887
SyntenyLag0040887
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036008.1 Retrotransposon gag protein [Cucumis melo var. makuwa]4.8e-1666.67Show/hide
Query:  MSTSTRPSAFQRLSVSTSKKSQSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL
        MST  R SAF+RLS+STSKK + STS FDRLK+TNDQ +R+M + + K F E N D K+HSR+PSRMKRK SV INTEGSL
Subjt:  MSTSTRPSAFQRLSVSTSKKSQSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL

KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]6.3e-1667.5Show/hide
Query:  STSTRPSAFQRLSVSTSKKSQSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL
        ST  R SAF+RLS+STSKK + STS FDRLK+TNDQ +R+M +L+ K F E N D K+HSR+PSRMKRK SV INTEGSL
Subjt:  STSTRPSAFQRLSVSTSKKSQSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL

KAA0056218.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]7.4e-1752.07Show/hide
Query:  STSTRPSAFQRLSVSTSKKSQSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLS
        ST TR SAF+RLS+STSKK + ST VFDRLK+TNDQ +R+M  L+ K F E N D K+H+R+PSRMKRK SV IN E                       
Subjt:  STSTRPSAFQRLSVSTSKKSQSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLS

Query:  QEPKLHDAPSPHELKRFSAAA
         EPKLH APSP ELK  + A+
Subjt:  QEPKLHDAPSPHELKRFSAAA

KAA0056374.1 retrotransposon gag protein [Cucumis melo var. makuwa]6.3e-1654.9Show/hide
Query:  TRPSAFQRLSVSTSKKSQSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKDLIKT----------MTKIRAF
        TR SAF+RLS+STSKK + STS FDRLK+ NDQ +R+M +L+ KLF E N D K+HSR+PS MKRK SV INTEG +  L++           M +IR F
Subjt:  TRPSAFQRLSVSTSKKSQSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKDLIKT----------MTKIRAF

Query:  KC
         C
Subjt:  KC

KAA0065966.1 hypothetical protein E6C27_scaffold62G00430 [Cucumis melo var. makuwa]3.3e-1752.99Show/hide
Query:  STSTRPSAFQRLSVSTSKKSQSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLS
        ST TR SAF+RLS+STSKK + STS FDR K+TN+Q +R++ +L+ KLF E N D K+HSR+PSRMKRK SV INTE                       
Subjt:  STSTRPSAFQRLSVSTSKKSQSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLS

Query:  QEPKLHDAPSPHELKRF
         +PKLH APSP ELK F
Subjt:  QEPKLHDAPSPHELKRF

TrEMBL top hitse value%identityAlignment
A0A5A7SZJ7 Retrotransposon gag protein2.3e-1666.67Show/hide
Query:  MSTSTRPSAFQRLSVSTSKKSQSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL
        MST  R SAF+RLS+STSKK + STS FDRLK+TNDQ +R+M + + K F E N D K+HSR+PSRMKRK SV INTEGSL
Subjt:  MSTSTRPSAFQRLSVSTSKKSQSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL

A0A5A7UM99 Ty3-gypsy retrotransposon protein3.6e-1752.07Show/hide
Query:  STSTRPSAFQRLSVSTSKKSQSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLS
        ST TR SAF+RLS+STSKK + ST VFDRLK+TNDQ +R+M  L+ K F E N D K+H+R+PSRMKRK SV IN E                       
Subjt:  STSTRPSAFQRLSVSTSKKSQSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLS

Query:  QEPKLHDAPSPHELKRFSAAA
         EPKLH APSP ELK  + A+
Subjt:  QEPKLHDAPSPHELKRFSAAA

A0A5A7VHY3 Uncharacterized protein1.6e-1752.99Show/hide
Query:  STSTRPSAFQRLSVSTSKKSQSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLS
        ST TR SAF+RLS+STSKK + STS FDR K+TN+Q +R++ +L+ KLF E N D K+HSR+PSRMKRK SV INTE                       
Subjt:  STSTRPSAFQRLSVSTSKKSQSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLS

Query:  QEPKLHDAPSPHELKRF
         +PKLH APSP ELK F
Subjt:  QEPKLHDAPSPHELKRF

A0A5D3BBF9 Gag protease polyprotein3.0e-1667.5Show/hide
Query:  STSTRPSAFQRLSVSTSKKSQSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL
        ST  R SAF+RLS+STSKK + STS FDRLK+TNDQ +R+M +L+ K F E N D K+HSR+PSRMKRK SV INTEGSL
Subjt:  STSTRPSAFQRLSVSTSKKSQSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSL

A0A5D3DZF3 Retrotransposon gag protein3.0e-1654.9Show/hide
Query:  TRPSAFQRLSVSTSKKSQSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKDLIKT----------MTKIRAF
        TR SAF+RLS+STSKK + STS FDRLK+ NDQ +R+M +L+ KLF E N D K+HSR+PS MKRK SV INTEG +  L++           M +IR F
Subjt:  TRPSAFQRLSVSTSKKSQSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKDLIKT----------MTKIRAF

Query:  KC
         C
Subjt:  KC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTTAGTGTCTCCACATCGAAGAAAAGTCAATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCA
ACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGAATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTC
TCATAAATACGGAAGGTTCCTTGAAGGATCTGATCAAGACCATGACAAAGATAAGAGCTTTTAAATGTAAAAGCTCCTTATCGCAAGAGCCTAAACTGCATGATGCTCCT
AGCCCACACGAGCTTAAAAGGTTCTCCGCTGCTGCAGTTCATTCTTCCAAGTTCAAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCTAAGTTCGAAGGTTCTCAC
ATGCTTCGCTGTAGTTCCTTCTCTCCAAGTTCGAAGGCGCTTCTCTTCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCATGCGCTGCTGCAGTTCCTTCCTCCAAG
TTCGAAGGTTCTCTCATGCGCTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCACTTCTCTCCACTGCT
CCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCATGCGCTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCATG
CGCTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCATGCGCTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAG
TTCGAAGGCGCTTCTCTCCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCATGCGCTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCATGCGCTGCTGCAG
TTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCACTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTC
CGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCATGCGCTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCATGCGCTGCTGCAGTTCCTTCCTCCAAGTTCG
AAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCACTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCACGCGCTTCGATGCAGTTCC
TTCCTCCCTAAGTTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGCTGCACAGTTCTTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGTTGCTACCT
TCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCATTCTCCAAGTTCGAAGGCGCTTCTC
TCCACTGCTCCTTCTCCAAGTTCGAGGGCGCTTCTCTCCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGTTCCTTCCTCCAAGTTTGAAGGT
TCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCCCCAAGTTCTC
TCCACTGCTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTTTTGAAGTTGACGGCGTCCGCTTCGCTTCATCTTCAAAAATTGACTGTTGATAACTTCACTT
CATATTCAAAAGTTGACGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAGTCACTGCAATTGAATCTGATGACGACCGTTGAAGGCGAGTCGGGTCTG
GTGACCACCCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATCGCTGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGC
AGGAGTGCATGAAGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAATGAAATAGGGGACTGGTCTAGCAGGAGTGATATCACTGCAAGCGAATTTGATC
ATCAAGCCAACAGGCCGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATC
AAGAAGATCAACAAGTTAGCAGGCCGATCATCCAGGAAGATCAACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAACAGGCTGATCCAAGAGATCAACAAGCCAA
CCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAGGAAG
ATCAACAAGCCAATAAGCCGATCCAAGAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAACCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTTAGTGTCTCCACATCGAAGAAAAGTCAATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCA
ACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGAATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTC
TCATAAATACGGAAGGTTCCTTGAAGGATCTGATCAAGACCATGACAAAGATAAGAGCTTTTAAATGTAAAAGCTCCTTATCGCAAGAGCCTAAACTGCATGATGCTCCT
AGCCCACACGAGCTTAAAAGGTTCTCCGCTGCTGCAGTTCATTCTTCCAAGTTCAAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCTAAGTTCGAAGGTTCTCAC
ATGCTTCGCTGTAGTTCCTTCTCTCCAAGTTCGAAGGCGCTTCTCTTCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCATGCGCTGCTGCAGTTCCTTCCTCCAAG
TTCGAAGGTTCTCTCATGCGCTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCACTTCTCTCCACTGCT
CCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCATGCGCTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCATG
CGCTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCATGCGCTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAG
TTCGAAGGCGCTTCTCTCCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCATGCGCTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCATGCGCTGCTGCAG
TTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCACTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTC
CGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCATGCGCTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCATGCGCTGCTGCAGTTCCTTCCTCCAAGTTCG
AAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCACTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCACGCGCTTCGATGCAGTTCC
TTCCTCCCTAAGTTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGCTGCACAGTTCTTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGTTGCTACCT
TCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCATTCTCCAAGTTCGAAGGCGCTTCTC
TCCACTGCTCCTTCTCCAAGTTCGAGGGCGCTTCTCTCCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGTTCCTTCCTCCAAGTTTGAAGGT
TCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCCCCAAGTTCTC
TCCACTGCTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTTTTGAAGTTGACGGCGTCCGCTTCGCTTCATCTTCAAAAATTGACTGTTGATAACTTCACTT
CATATTCAAAAGTTGACGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAGTCACTGCAATTGAATCTGATGACGACCGTTGAAGGCGAGTCGGGTCTG
GTGACCACCCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATCGCTGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGC
AGGAGTGCATGAAGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAATGAAATAGGGGACTGGTCTAGCAGGAGTGATATCACTGCAAGCGAATTTGATC
ATCAAGCCAACAGGCCGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATC
AAGAAGATCAACAAGTTAGCAGGCCGATCATCCAGGAAGATCAACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAACAGGCTGATCCAAGAGATCAACAAGCCAA
CCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAGGAAG
ATCAACAAGCCAATAAGCCGATCCAAGAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAACCTAG
Protein sequenceShow/hide protein sequence
MSTSTRPSAFQRLSVSTSKKSQSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKDLIKTMTKIRAFKCKSSLSQEPKLHDAP
SPHELKRFSAAAVHSSKFKGSHVASLQFLPLSSKVLTCFAVVPSLQVRRRFSSLLPSSKFEGSLMRCCSSFLQVRRFSHALLQFLPPSSKVPSRASLAPSPSSKALLSTA
PSPSSKALLSVATFLQVRRFSHALLQFLPPSSKVLSCAAAVPSSKFEGSLMRCCSSFLQVRRFPHALRSLLLQVRRRFSPLLPSSKFEGSLMRCCSSFLQVRRFSHALLQ
FLPPSSKVPSRASLAPSPSSKALLSTAPSPSSKALLSVATFLQVRRFSHALLQFLPPSSKVLSCAAAVPSSKFEGSLTRFARSFSKFEGTSLHCSFSKFEGASHALRCSS
FLPKFEVPSSKFEGSHALRCTVLSSKFEGSHALRCYLPPSSKVLSRAAAAPSSKFEGSLTRFARSFSKFEGASLHCSFSKFEGASLRCYLPPSSKVLSRAAAVPSSKFEG
SLTRFARSFSKFEGASLRCYFSKFEGASLHCSFPKFSPLLLLQVRRCFSPPLFLKLTASASLHLQKLTVDNFTSYSKVDGNYSHQSDWSRQVVKSLQLNLMTTVEGESGL
VTTPAGYSDHPIKWGLGLAGVHRCYSDHPIKWGLGLAGVHEGESGDYPCRLLRSPNEIGDWSSRSDITASEFDHQANRPIQEINKPTDRSRRSTSQQADHPRDQQANRPI
KKINKLAGRSSRKINKLTSRSNRSSSQQADPRDQQANRPIKKINKSAGRSSKRSTSQPTDQEDQQVSRPIIQEDQQANKPIQEIIKPTGRSKRSST