; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0025128 (gene) of Chayote v1 genome

Gene IDSed0025128
OrganismSechium edule (Chayote v1)
DescriptionGag protease polyprotein
Genome locationLG04:15935425..15936104
RNA-Seq ExpressionSed0025128
SyntenySed0025128
Gene Ontology termsGO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025469.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.8e-1134.19Show/hide
Query:  EAAMPQGAALGGDVSAMRDMMMGLMAKQRAETQSSSEPKAAPVAPVAQPEGQGQQDQADSLDSAEGAAEFGRWLKDFTKLKPKEFDAS-ENPLVAVRWLA
        +A  P       D++AM      L+ + R + Q +S   A   AP   P     Q   D L SAE      + L+DF K  P  FD S ++P  A  WL+
Subjt:  EAAMPQGAALGGDVSAMRDMMMGLMAKQRAETQSSSEPKAAPVAPVAQPEGQGQQDQADSLDSAEGAAEFGRWLKDFTKLKPKEFDAS-ENPLVAVRWLA

Query:  RMEYIFRVMGCPDVQRPRCGAHVLTDIAEWWWESTDRARPAGSSPATWEFFRTKF
         +E IFR M CP+ Q+ +C   +LTD    WWE+T+R      S  TW+ F+  F
Subjt:  RMEYIFRVMGCPDVQRPRCGAHVLTDIAEWWWESTDRARPAGSSPATWEFFRTKF

KAA0039053.1 gag protease polyprotein [Cucumis melo var. makuwa]1.8e-1133.53Show/hide
Query:  RPLVAPGANLPREGVEAAMPQGAALGGDVSAMRDMMMGLMAKQRAETQSSSEPKAAPVAPVAQPEGQGQQDQADSLDSAEGAAEFGRWLKDFTKLKPKEF
        +P V P A  P  G        AA+       RD++M +  +Q+  + + +   A   AP   P     Q   D L SAE      + L+DF K  P  F
Subjt:  RPLVAPGANLPREGVEAAMPQGAALGGDVSAMRDMMMGLMAKQRAETQSSSEPKAAPVAPVAQPEGQGQQDQADSLDSAEGAAEFGRWLKDFTKLKPKEF

Query:  DAS-ENPLVAVRWLARMEYIFRVMGCPDVQRPRCGAHVLTDIAEWWWESTDRARPAGSSPATWEFFRTKF
        D S E+P  A  WL+ +E IFR M CP+ Q+ +C   +LTD    WWE+T+R      S  TW+ F+  F
Subjt:  DAS-ENPLVAVRWLARMEYIFRVMGCPDVQRPRCGAHVLTDIAEWWWESTDRARPAGSSPATWEFFRTKF

KAA0040699.1 pol protein [Cucumis melo var. makuwa]1.1e-1134.17Show/hide
Query:  RSGAGPSRTANRDVVEPAAMVYQARRPLVAPGANLPREGVEAAMPQGAALGGDVSAMRDMMMGLMAKQR-AETQSSSEPKAAPV---APVAQPEGQGQQD
        R GAG +   + D    +   Y  R P V P A  P     AA    A L       RD++M +  +Q+ A    +  P  AP    AP   P     Q 
Subjt:  RSGAGPSRTANRDVVEPAAMVYQARRPLVAPGANLPREGVEAAMPQGAALGGDVSAMRDMMMGLMAKQR-AETQSSSEPKAAPV---APVAQPEGQGQQD

Query:  QADSLDSAEGAAEFGRWLKDFTKLKPKEFDAS-ENPLVAVRWLARMEYIFRVMGCPDVQRPRCGAHVLTDIAEWWWESTDRARPAGSSPATWEFFRTKF
          D L SAE      + L+DF K  P  FD S E+P  A  WL+ +E IFR M CP+ Q+ +C   +LTD    WWE+T+R      S  TW+ F+  F
Subjt:  QADSLDSAEGAAEFGRWLKDFTKLKPKEFDAS-ENPLVAVRWLARMEYIFRVMGCPDVQRPRCGAHVLTDIAEWWWESTDRARPAGSSPATWEFFRTKF

KAA0060256.1 gag protease polyprotein [Cucumis melo var. makuwa]1.4e-1136.23Show/hide
Query:  RDMMMGLMAKQRAETQSSSEPKAAPVAPVAQPEGQGQQDQADSLDSAEGAAEFGRWLKDFTKLKPKEFDAS-ENPLVAVRWLARMEYIFRVMGCPDVQRP
        RDM+M +  +Q+  + + +   A   AP   P     Q  +D L SAE      + L+DF K  P  FD S E+P  A  WL+ +E IFR M CP+ Q+ 
Subjt:  RDMMMGLMAKQRAETQSSSEPKAAPVAPVAQPEGQGQQDQADSLDSAEGAAEFGRWLKDFTKLKPKEFDAS-ENPLVAVRWLARMEYIFRVMGCPDVQRP

Query:  RCGAHVLTDIAEWWWESTDRARPAGSSPATWEFFRTKF
        +C   +LTD    WWE+T+R      S  TW+ F+  F
Subjt:  RCGAHVLTDIAEWWWESTDRARPAGSSPATWEFFRTKF

KAA0062785.1 gag protease polyprotein [Cucumis melo var. makuwa]2.4e-1134.71Show/hide
Query:  RPLVAPGANLPREGVEAAMPQGAALGGDVSAMRDMMMGLMAKQRAETQSSSEPKAAPVAPVAQPEGQGQQDQADSLDSAEGAAEFGRWLKDFTKLKPKEF
        +P V P A  P     AA    A L       RD++M +  +Q+  + + +   A   APV  P     Q   D L SAE      + L+DF K  P  F
Subjt:  RPLVAPGANLPREGVEAAMPQGAALGGDVSAMRDMMMGLMAKQRAETQSSSEPKAAPVAPVAQPEGQGQQDQADSLDSAEGAAEFGRWLKDFTKLKPKEF

Query:  DAS-ENPLVAVRWLARMEYIFRVMGCPDVQRPRCGAHVLTDIAEWWWESTDRARPAGSSPATWEFFRTKF
        D S E+P  A  WL+ +E IFR M CP+ Q+ +C   +LTD    WWE+T+R      S  TW+ F+  F
Subjt:  DAS-ENPLVAVRWLARMEYIFRVMGCPDVQRPRCGAHVLTDIAEWWWESTDRARPAGSSPATWEFFRTKF

TrEMBL top hitse value%identityAlignment
A0A5A7SJH3 Reverse transcriptase8.8e-1234.19Show/hide
Query:  EAAMPQGAALGGDVSAMRDMMMGLMAKQRAETQSSSEPKAAPVAPVAQPEGQGQQDQADSLDSAEGAAEFGRWLKDFTKLKPKEFDAS-ENPLVAVRWLA
        +A  P       D++AM      L+ + R + Q +S   A   AP   P     Q   D L SAE      + L+DF K  P  FD S ++P  A  WL+
Subjt:  EAAMPQGAALGGDVSAMRDMMMGLMAKQRAETQSSSEPKAAPVAPVAQPEGQGQQDQADSLDSAEGAAEFGRWLKDFTKLKPKEFDAS-ENPLVAVRWLA

Query:  RMEYIFRVMGCPDVQRPRCGAHVLTDIAEWWWESTDRARPAGSSPATWEFFRTKF
         +E IFR M CP+ Q+ +C   +LTD    WWE+T+R      S  TW+ F+  F
Subjt:  RMEYIFRVMGCPDVQRPRCGAHVLTDIAEWWWESTDRARPAGSSPATWEFFRTKF

A0A5A7T7V8 Gag protease polyprotein8.8e-1233.53Show/hide
Query:  RPLVAPGANLPREGVEAAMPQGAALGGDVSAMRDMMMGLMAKQRAETQSSSEPKAAPVAPVAQPEGQGQQDQADSLDSAEGAAEFGRWLKDFTKLKPKEF
        +P V P A  P  G        AA+       RD++M +  +Q+  + + +   A   AP   P     Q   D L SAE      + L+DF K  P  F
Subjt:  RPLVAPGANLPREGVEAAMPQGAALGGDVSAMRDMMMGLMAKQRAETQSSSEPKAAPVAPVAQPEGQGQQDQADSLDSAEGAAEFGRWLKDFTKLKPKEF

Query:  DAS-ENPLVAVRWLARMEYIFRVMGCPDVQRPRCGAHVLTDIAEWWWESTDRARPAGSSPATWEFFRTKF
        D S E+P  A  WL+ +E IFR M CP+ Q+ +C   +LTD    WWE+T+R      S  TW+ F+  F
Subjt:  DAS-ENPLVAVRWLARMEYIFRVMGCPDVQRPRCGAHVLTDIAEWWWESTDRARPAGSSPATWEFFRTKF

A0A5A7THF3 Reverse transcriptase5.1e-1234.17Show/hide
Query:  RSGAGPSRTANRDVVEPAAMVYQARRPLVAPGANLPREGVEAAMPQGAALGGDVSAMRDMMMGLMAKQR-AETQSSSEPKAAPV---APVAQPEGQGQQD
        R GAG +   + D    +   Y  R P V P A  P     AA    A L       RD++M +  +Q+ A    +  P  AP    AP   P     Q 
Subjt:  RSGAGPSRTANRDVVEPAAMVYQARRPLVAPGANLPREGVEAAMPQGAALGGDVSAMRDMMMGLMAKQR-AETQSSSEPKAAPV---APVAQPEGQGQQD

Query:  QADSLDSAEGAAEFGRWLKDFTKLKPKEFDAS-ENPLVAVRWLARMEYIFRVMGCPDVQRPRCGAHVLTDIAEWWWESTDRARPAGSSPATWEFFRTKF
          D L SAE      + L+DF K  P  FD S E+P  A  WL+ +E IFR M CP+ Q+ +C   +LTD    WWE+T+R      S  TW+ F+  F
Subjt:  QADSLDSAEGAAEFGRWLKDFTKLKPKEFDAS-ENPLVAVRWLARMEYIFRVMGCPDVQRPRCGAHVLTDIAEWWWESTDRARPAGSSPATWEFFRTKF

A0A5A7V1A3 Gag protease polyprotein6.7e-1236.23Show/hide
Query:  RDMMMGLMAKQRAETQSSSEPKAAPVAPVAQPEGQGQQDQADSLDSAEGAAEFGRWLKDFTKLKPKEFDAS-ENPLVAVRWLARMEYIFRVMGCPDVQRP
        RDM+M +  +Q+  + + +   A   AP   P     Q  +D L SAE      + L+DF K  P  FD S E+P  A  WL+ +E IFR M CP+ Q+ 
Subjt:  RDMMMGLMAKQRAETQSSSEPKAAPVAPVAQPEGQGQQDQADSLDSAEGAAEFGRWLKDFTKLKPKEFDAS-ENPLVAVRWLARMEYIFRVMGCPDVQRP

Query:  RCGAHVLTDIAEWWWESTDRARPAGSSPATWEFFRTKF
        +C   +LTD    WWE+T+R      S  TW+ F+  F
Subjt:  RCGAHVLTDIAEWWWESTDRARPAGSSPATWEFFRTKF

A0A5A7VA59 Gag protease polyprotein1.1e-1134.71Show/hide
Query:  RPLVAPGANLPREGVEAAMPQGAALGGDVSAMRDMMMGLMAKQRAETQSSSEPKAAPVAPVAQPEGQGQQDQADSLDSAEGAAEFGRWLKDFTKLKPKEF
        +P V P A  P     AA    A L       RD++M +  +Q+  + + +   A   APV  P     Q   D L SAE      + L+DF K  P  F
Subjt:  RPLVAPGANLPREGVEAAMPQGAALGGDVSAMRDMMMGLMAKQRAETQSSSEPKAAPVAPVAQPEGQGQQDQADSLDSAEGAAEFGRWLKDFTKLKPKEF

Query:  DAS-ENPLVAVRWLARMEYIFRVMGCPDVQRPRCGAHVLTDIAEWWWESTDRARPAGSSPATWEFFRTKF
        D S E+P  A  WL+ +E IFR M CP+ Q+ +C   +LTD    WWE+T+R      S  TW+ F+  F
Subjt:  DAS-ENPLVAVRWLARMEYIFRVMGCPDVQRPRCGAHVLTDIAEWWWESTDRARPAGSSPATWEFFRTKF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGTTACACGCAGTGGAGCAGGTCCGTCCAGAACTGCAAATAGAGATGTTGTAGAGCCAGCAGCCATGGTTTATCAAGCTCGTAGGCCGCTCGTAGCTCCAGGTGC
TAACTTGCCTAGAGAAGGGGTGGAGGCTGCTATGCCTCAAGGTGCAGCACTAGGGGGAGATGTTTCTGCTATGAGAGATATGATGATGGGCCTAATGGCCAAGCAGAGAG
CTGAGACTCAGAGCTCCAGTGAGCCCAAAGCAGCTCCAGTAGCTCCGGTAGCTCAACCGGAGGGGCAGGGTCAGCAGGACCAGGCTGATTCACTTGATAGTGCAGAGGGA
GCCGCTGAGTTTGGCAGGTGGCTGAAAGATTTCACCAAGTTGAAGCCGAAGGAATTTGACGCTTCAGAGAACCCACTTGTGGCAGTCAGGTGGTTGGCAAGGATGGAGTA
CATTTTTAGAGTTATGGGATGCCCTGATGTACAGAGGCCGAGATGTGGGGCCCATGTTCTCACTGACATAGCTGAGTGGTGGTGGGAGTCCACAGATAGAGCTAGACCAG
CTGGGTCTTCTCCAGCCACCTGGGAGTTCTTTAGAACGAAGTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGGTTACACGCAGTGGAGCAGGTCCGTCCAGAACTGCAAATAGAGATGTTGTAGAGCCAGCAGCCATGGTTTATCAAGCTCGTAGGCCGCTCGTAGCTCCAGGTGC
TAACTTGCCTAGAGAAGGGGTGGAGGCTGCTATGCCTCAAGGTGCAGCACTAGGGGGAGATGTTTCTGCTATGAGAGATATGATGATGGGCCTAATGGCCAAGCAGAGAG
CTGAGACTCAGAGCTCCAGTGAGCCCAAAGCAGCTCCAGTAGCTCCGGTAGCTCAACCGGAGGGGCAGGGTCAGCAGGACCAGGCTGATTCACTTGATAGTGCAGAGGGA
GCCGCTGAGTTTGGCAGGTGGCTGAAAGATTTCACCAAGTTGAAGCCGAAGGAATTTGACGCTTCAGAGAACCCACTTGTGGCAGTCAGGTGGTTGGCAAGGATGGAGTA
CATTTTTAGAGTTATGGGATGCCCTGATGTACAGAGGCCGAGATGTGGGGCCCATGTTCTCACTGACATAGCTGAGTGGTGGTGGGAGTCCACAGATAGAGCTAGACCAG
CTGGGTCTTCTCCAGCCACCTGGGAGTTCTTTAGAACGAAGTTCTAA
Protein sequenceShow/hide protein sequence
MPVTRSGAGPSRTANRDVVEPAAMVYQARRPLVAPGANLPREGVEAAMPQGAALGGDVSAMRDMMMGLMAKQRAETQSSSEPKAAPVAPVAQPEGQGQQDQADSLDSAEG
AAEFGRWLKDFTKLKPKEFDASENPLVAVRWLARMEYIFRVMGCPDVQRPRCGAHVLTDIAEWWWESTDRARPAGSSPATWEFFRTKF