; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0000392 (gene) of Chayote v1 genome

Gene IDSed0000392
OrganismSechium edule (Chayote v1)
DescriptionGag-protease polyprotein
Genome locationLG05:11118332..11119752
RNA-Seq ExpressionSed0000392
SyntenySed0000392
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156662.1 uncharacterized protein LOC111023512 [Momordica charantia]6.3e-0945.45Show/hide
Query:  RKQMEFLHLQQKEKTVEDYEIAFTELGRYYPELIDSDVKKIERCILGLRVEIQRTMEALAPSDYTAAVRIASLMDAR
        RKQ+EFL+L+Q  ++VE+Y+  FT+L R+ PEL+D++  K ER I+ L+ E +  +  L+P DY  A+R A+L+D R
Subjt:  RKQMEFLHLQQKEKTVEDYEIAFTELGRYYPELIDSDVKKIERCILGLRVEIQRTMEALAPSDYTAAVRIASLMDAR

XP_022931733.1 uncharacterized protein LOC111437895 [Cucurbita moschata]6.3e-0944.16Show/hide
Query:  RNRKQMEFLHLQQKEKTVEDYEIAFTELGRYYPELIDSDVKKIERCILGLRVEIQRTMEALAPSDYTAAVRIASLMD
        R++ Q  FL L+Q +KTVEDY++ F +L R+ PE + ++  KI+R I GLR+E+Q  +     SDY  A+R+A++MD
Subjt:  RNRKQMEFLHLQQKEKTVEDYEIAFTELGRYYPELIDSDVKKIERCILGLRVEIQRTMEALAPSDYTAAVRIASLMD

XP_022931734.1 uncharacterized protein LOC111437896 [Cucurbita moschata]6.3e-0944.16Show/hide
Query:  RNRKQMEFLHLQQKEKTVEDYEIAFTELGRYYPELIDSDVKKIERCILGLRVEIQRTMEALAPSDYTAAVRIASLMD
        R++ Q  FL L+Q +KTVEDY++ F +L R+ PE + ++  KI+R I GLR+E+Q  +     SDY  A+R+A++MD
Subjt:  RNRKQMEFLHLQQKEKTVEDYEIAFTELGRYYPELIDSDVKKIERCILGLRVEIQRTMEALAPSDYTAAVRIASLMD

XP_023522342.1 uncharacterized protein LOC111786265, partial [Cucurbita pepo subsp. pepo]1.1e-0846.67Show/hide
Query:  RKQMEFLHLQQKEKTVEDYEIAFTELGRYYPELIDSDVKKIERCILGLRVEIQRTMEALAPSDYTAAVRIASLMD
        RKQ EF  L Q+ +TV  Y   F++L R+ PEL+++D K   R +LGL  +I+ T+EA+AP+ YTAA+R A  M+
Subjt:  RKQMEFLHLQQKEKTVEDYEIAFTELGRYYPELIDSDVKKIERCILGLRVEIQRTMEALAPSDYTAAVRIASLMD

XP_038891712.1 uncharacterized protein LOC120081110 [Benincasa hispida]5.3e-0833.64Show/hide
Query:  RNRKQMEFLHLQQKEKTVEDYEIAFTELGRYYPELIDSDVKKIERCILGLRVEIQRTMEALAPSDYTAAVRIASLMDARYHLTGMTAVGHNPDLNRKQKV
        R  KQ EFL L+Q  ++VE+Y+  F  L R+ PEL+ ++  + ER I GL+  I+  ++A  P+ +  A+R+A+ +D +         G  P   +K+K 
Subjt:  RNRKQMEFLHLQQKEKTVEDYEIAFTELGRYYPELIDSDVKKIERCILGLRVEIQRTMEALAPSDYTAAVRIASLMDARYHLTGMTAVGHNPDLNRKQKV

Query:  EQENYKP
        +Q+++KP
Subjt:  EQENYKP

TrEMBL top hitse value%identityAlignment
A0A5A7TFN8 Gag-protease polyprotein4.4e-0834.86Show/hide
Query:  RNRKQMEFLHLQQKEKTVEDYEIAFTELGRYYPELIDSDVKKIERCILGLRVEIQRTMEALAPSDYTAAVRIASLMDARYHLTGMTAVGHNPDLNRKQKV
        ++ K  EFL+L+Q + TVE Y+  F  L R+ P ++  +  + E+ + GLR+++Q  + AL P+ +  A+RIA  +     +    A G  P L +K+KV
Subjt:  RNRKQMEFLHLQQKEKTVEDYEIAFTELGRYYPELIDSDVKKIERCILGLRVEIQRTMEALAPSDYTAAVRIASLMDARYHLTGMTAVGHNPDLNRKQKV

Query:  E-QENYKPQ
        E Q N  PQ
Subjt:  E-QENYKPQ

A0A6J1DSJ6 uncharacterized protein LOC1110235123.1e-0945.45Show/hide
Query:  RKQMEFLHLQQKEKTVEDYEIAFTELGRYYPELIDSDVKKIERCILGLRVEIQRTMEALAPSDYTAAVRIASLMDAR
        RKQ+EFL+L+Q  ++VE+Y+  FT+L R+ PEL+D++  K ER I+ L+ E +  +  L+P DY  A+R A+L+D R
Subjt:  RKQMEFLHLQQKEKTVEDYEIAFTELGRYYPELIDSDVKKIERCILGLRVEIQRTMEALAPSDYTAAVRIASLMDAR

A0A6J1EV26 Reverse transcriptase3.1e-0944.16Show/hide
Query:  RNRKQMEFLHLQQKEKTVEDYEIAFTELGRYYPELIDSDVKKIERCILGLRVEIQRTMEALAPSDYTAAVRIASLMD
        R++ Q  FL L+Q +KTVEDY++ F +L R+ PE + ++  KI+R I GLR+E+Q  +     SDY  A+R+A++MD
Subjt:  RNRKQMEFLHLQQKEKTVEDYEIAFTELGRYYPELIDSDVKKIERCILGLRVEIQRTMEALAPSDYTAAVRIASLMD

A0A6J1EZJ9 uncharacterized protein LOC1114378953.1e-0944.16Show/hide
Query:  RNRKQMEFLHLQQKEKTVEDYEIAFTELGRYYPELIDSDVKKIERCILGLRVEIQRTMEALAPSDYTAAVRIASLMD
        R++ Q  FL L+Q +KTVEDY++ F +L R+ PE + ++  KI+R I GLR+E+Q  +     SDY  A+R+A++MD
Subjt:  RNRKQMEFLHLQQKEKTVEDYEIAFTELGRYYPELIDSDVKKIERCILGLRVEIQRTMEALAPSDYTAAVRIASLMD

A0A6J1FB78 uncharacterized protein LOC1114438455.8e-0834.31Show/hide
Query:  RNRKQMEFLHLQQKEKTVEDYEIAFTELGRYYPELIDSDVKKIERCILGLRVEIQRTMEALAPSDYTAAVRIASLMDARYHLTGMTAVGHNPDLNRKQKV
        R +KQ EFL ++Q  ++VE+YE  FT L R+ P ++  +  K+E  ++GLR +I+  +    P DY  A+++A  +D + + T  T    N  +++K+K 
Subjt:  RNRKQMEFLHLQQKEKTVEDYEIAFTELGRYYPELIDSDVKKIERCILGLRVEIQRTMEALAPSDYTAAVRIASLMDARYHLTGMTAVGHNPDLNRKQKV

Query:  EQ
        EQ
Subjt:  EQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGACCGCATGCATACAGGAACTACGCACAAGGTGTCACAAGTGGTATCAGAGCCCAGGAATAGGAAGCAAATGGAGTTCTTACATCTTCAACAAAAGGAAAAGAC
AGTGGAGGATTATGAGATAGCGTTTACTGAATTAGGCAGGTATTACCCAGAATTGATAGACTCAGATGTCAAAAAAATTGAAAGGTGTATATTAGGATTGAGGGTTGAAA
TTCAGAGGACAATGGAAGCTTTGGCACCTTCAGATTATACTGCGGCAGTTAGAATAGCTTCTTTAATGGATGCACGGTATCATCTGACTGGAATGACAGCAGTAGGTCAT
AACCCGGACTTAAACCGGAAACAAAAAGTTGAGCAAGAAAACTACAAGCCACAGATGTTGGGACGAGGTCAGCAAAAAGGGTCGTTGACAAGCATAACAAAAGAAAAGTG
GGTTGTAATATATGTGGAAAAACACATGGAGGACGATGTATGGCTAGCTCTTGATCATGTTTTAGATGTGGAAGAAATGGCCACCTCAGTCGAGAGCGTACCAATCCCAG
AGTTTGTTATAAATGCGGGAAAAATGGGTATATAA
mRNA sequenceShow/hide mRNA sequence
ATGATTGACCGCATGCATACAGGAACTACGCACAAGGTGTCACAAGTGGTATCAGAGCCCAGGAATAGGAAGCAAATGGAGTTCTTACATCTTCAACAAAAGGAAAAGAC
AGTGGAGGATTATGAGATAGCGTTTACTGAATTAGGCAGGTATTACCCAGAATTGATAGACTCAGATGTCAAAAAAATTGAAAGGTGTATATTAGGATTGAGGGTTGAAA
TTCAGAGGACAATGGAAGCTTTGGCACCTTCAGATTATACTGCGGCAGTTAGAATAGCTTCTTTAATGGATGCACGGTATCATCTGACTGGAATGACAGCAGTAGGTCAT
AACCCGGACTTAAACCGGAAACAAAAAGTTGAGCAAGAAAACTACAAGCCACAGATGTTGGGACGAGGTCAGCAAAAAGGGTCGTTGACAAGCATAACAAAAGAAAAGTG
GGTTGTAATATATGTGGAAAAACACATGGAGGACGATGTATGGCTAGCTCTTGATCATGTTTTAGATGTGGAAGAAATGGCCACCTCAGTCGAGAGCGTACCAATCCCAG
AGTTTGTTATAAATGCGGGAAAAATGGGTATATAA
Protein sequenceShow/hide protein sequence
MIDRMHTGTTHKVSQVVSEPRNRKQMEFLHLQQKEKTVEDYEIAFTELGRYYPELIDSDVKKIERCILGLRVEIQRTMEALAPSDYTAAVRIASLMDARYHLTGMTAVGH
NPDLNRKQKVEQENYKPQMLGRGQQKGSLTSITKEKWVVIYVEKHMEDDVWLALDHVLDVEEMATSVESVPIPEFVINAGKMGI