; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy2G041280 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy2G041280
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchrH02:21052542..21055433
RNA-Seq ExpressionChy2G041280
SyntenyChy2G041280
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035961.1 uncharacterized protein E6C27_scaffold56G001660 [Cucumis melo var. makuwa]3.92e-3650.89Show/hide
Query:  MDYRSKEKRTHENEGEVSESRKEHVHKERESHKDSERHEKTTEEKKGTFLKTPRPCFLCNGPHWARDCPQKKMLSALVAKHHEGEEEAHVPKEAHLGGMQ
        +D  SKEK+   +EGE  E +KE       + K   ++ K     K T    P  CFLC GPHW RDCP+K  LSA+VA   E EEE  +   A LGG++
Subjt:  MDYRSKEKRTHENEGEVSESRKEHVHKERESHKDSERHEKTTEEKKGTFLKTPRPCFLCNGPHWARDCPQKKMLSALVAKHHEGEEEAHVPKEAHLGGMQ

Query:  FLNAMENDSSSKQPKGRGWMYVEARLNNKAALPMLDTGATSNIIDLEEAKRLGLKITRRGDTELKIISA
        +L+AM+++SS K+P+  G +YV+A  NNKAAL MLD GAT N++D EEAKRLGL+ TRR DTE+KIISA
Subjt:  FLNAMENDSSSKQPKGRGWMYVEARLNNKAALPMLDTGATSNIIDLEEAKRLGLKITRRGDTELKIISA

KAA0064169.1 uncharacterized protein E6C27_scaffold548G00940 [Cucumis melo var. makuwa]7.84e-3951.19Show/hide
Query:  KEKRTHENEGEVSESRKEHVHKERESHKDSERHEKTTEEKKGTFLKTPRPCFLCNGPHWARDCPQKKMLSALVAKHHEGEEEAHVPKEAHLGGMQFLNAM
        KEK+T  +EG+ SE++K+       + K   ++ K     KGT   TP PCFLCNGPHWARDCP+K  LSA+VA  ++ EEEA +   A L  +++L+ M
Subjt:  KEKRTHENEGEVSESRKEHVHKERESHKDSERHEKTTEEKKGTFLKTPRPCFLCNGPHWARDCPQKKMLSALVAKHHEGEEEAHVPKEAHLGGMQFLNAM

Query:  ENDSSSKQPKGRGWMYVEARLNNKAALPMLDTGATSNIIDLEEAKRLGLKITRRGDTELKIISAMPTR
        END S K+P+  GW+YV+  LN+K AL +LDT    N++D EEAK+LGL+ITRR DTE+KIISA P R
Subjt:  ENDSSSKQPKGRGWMYVEARLNNKAALPMLDTGATSNIIDLEEAKRLGLKITRRGDTELKIISAMPTR

KAE8637433.1 hypothetical protein CSA_004611, partial [Cucumis sativus]4.66e-9057.56Show/hide
Query:  DYRSKEKRTHENEGEVSESRKEHVHKERESHKDSERHEKTTEEKKGTFLKTPRPCFLCNGPHWARDCPQKKMLSALVAK-HHEGEEEAHVPKEAHLGGMQ
        DYR KEKRT+E+ GEVSE+RKEHVHK R+SH+DS RH K TE KKGT  KTP PCFLCNGPHWARDCPQKK+LSAL+AK H   EEE  +P+EAHLGG+Q
Subjt:  DYRSKEKRTHENEGEVSESRKEHVHKERESHKDSERHEKTTEEKKGTFLKTPRPCFLCNGPHWARDCPQKKMLSALVAK-HHEGEEEAHVPKEAHLGGMQ

Query:  FLNAMENDSSSKQPKGRGWMYVEARLNNKAALPMLDTGATSNIIDLEEAKRLGLKITRRGDTELKIISAMPTR---------------------------
        +L AM+   SSK+P G+G +YV+A LNN+AAL +LDTGATSNIIDLEEAKRLGLKITRR D ELKII+A PTR                           
Subjt:  FLNAMENDSSSKQPKGRGWMYVEARLNNKAALPMLDTGATSNIIDLEEAKRLGLKITRRGDTELKIISAMPTR---------------------------

Query:  -------LDFLDKAHAVINTLKKTLEFPDIRREVPLKRMRVLSQERLSAMRIARHHLGKGSSGNVPTRTLQ
               LDF+ +AHA ++  ++TL FP+ +REVPLK + +  +  + AM+  RH  GKGS+G+V TR L+
Subjt:  -------LDFLDKAHAVINTLKKTLEFPDIRREVPLKRMRVLSQERLSAMRIARHHLGKGSSGNVPTRTLQ

KAE8646814.1 hypothetical protein Csa_005402 [Cucumis sativus]1.95e-9157.72Show/hide
Query:  DYRSKEKRTHENEGEVSESRKEHVHKERESHKDSERHEKTTEEKKGTFLKTPRPCFLCNGPHWARDCPQKKMLSALVAK-HHEGEEEAHVPKEAHLGGMQ
        DYR KEKRT+E+ GEVSE+RKEHVHK R+SH+DS RH K TE KKGT  KTP PCFLCNGPHWARDCPQKK+LSAL+AK H   EEE  +P+EAHLGG+Q
Subjt:  DYRSKEKRTHENEGEVSESRKEHVHKERESHKDSERHEKTTEEKKGTFLKTPRPCFLCNGPHWARDCPQKKMLSALVAK-HHEGEEEAHVPKEAHLGGMQ

Query:  FLNAMENDSSSKQPKGRGWMYVEARLNNKAALPMLDTGATSNIIDLEEAKRLGLKITRRGDTELKIISAMPTR---------------------------
        +L AM+   SSK+P G+G +YV+A LNN+AAL +LDTGATSNIIDLEEAKRLGLKITRR D ELKII+A PTR                           
Subjt:  FLNAMENDSSSKQPKGRGWMYVEARLNNKAALPMLDTGATSNIIDLEEAKRLGLKITRRGDTELKIISAMPTR---------------------------

Query:  -------LDFLDKAHAVINTLKKTLEFPDIRREVPLKRMRVLSQERLSAMRIARHHLGKGSSGNVPTRTLQR
               LDF+ +AHA ++  ++TL FP+ +REVPLK + +  +  + AM+  RH  GKGS+G+V TR L+R
Subjt:  -------LDFLDKAHAVINTLKKTLEFPDIRREVPLKRMRVLSQERLSAMRIARHHLGKGSSGNVPTRTLQR

KAE8649392.1 hypothetical protein Csa_011844 [Cucumis sativus]5.62e-3578.65Show/hide
Query:  YRSKEKRTHENEGEVSESRKEHVHKERESHKDSERHEKTTEEKKGTFLKTPRPCFLCNGPHWARDCPQKKMLSALVAKHHEGEEEAHVP
        YRSKEKR +EN GEVSESR+EHVHK R+SHK S RH K TE +KGT  KTP  CFLCNGPHWA DCPQKKMLSALVAK HEGEEEA VP
Subjt:  YRSKEKRTHENEGEVSESRKEHVHKERESHKDSERHEKTTEEKKGTFLKTPRPCFLCNGPHWARDCPQKKMLSALVAKHHEGEEEAHVP

TrEMBL top hitse value%identityAlignment
A0A5A7SY30 Retrotrans_gag domain-containing protein3.2e-2046.98Show/hide
Query:  MDYRSKEKRTHENEGEVSESRKEHVHKERESHKDSERHEKTTEEKKGTFLKTPRPCFLCNGPHWARDCPQKKMLSALVAKHHEGEEEAHVPKEAHLGGMQ
        +D   KEK+T  +EGE  E       KE  S +D E++ KT    + T   TP  CFLC GPH  R+CP++  LSA+VA   E EEE  +   A LG ++
Subjt:  MDYRSKEKRTHENEGEVSESRKEHVHKERESHKDSERHEKTTEEKKGTFLKTPRPCFLCNGPHWARDCPQKKMLSALVAKHHEGEEEAHVPKEAHLGGMQ

Query:  FLNAMENDSSSKQPKGRGWMYVEARLNNKAALPMLDTGATSNIIDLEEA
        +L+AM++D SSK+P+  GW+YV+A  NNKAAL MLD GAT N+++ EEA
Subjt:  FLNAMENDSSSKQPKGRGWMYVEARLNNKAALPMLDTGATSNIIDLEEA

A0A5A7T2W8 Retrotrans_gag domain-containing protein1.5e-3050.89Show/hide
Query:  MDYRSKEKRTHENEGEVSESRKEHVHKERESHKDSERHEKTTEEKKGTFLKTPRPCFLCNGPHWARDCPQKKMLSALVAKHHEGEEEAHVPKEAHLGGMQ
        +D  SKEK+   +EGE  E +KE       + K   ++ K     K T    P  CFLC GPHW RDCP+K  LSA+VA   E EEE  +   A LGG++
Subjt:  MDYRSKEKRTHENEGEVSESRKEHVHKERESHKDSERHEKTTEEKKGTFLKTPRPCFLCNGPHWARDCPQKKMLSALVAKHHEGEEEAHVPKEAHLGGMQ

Query:  FLNAMENDSSSKQPKGRGWMYVEARLNNKAALPMLDTGATSNIIDLEEAKRLGLKITRRGDTELKIISA
        +L+AM+++ SSK+P+  G +YV+A  NNKAAL MLD GAT N++D EEAKRLGL+ TRR DTE+KIISA
Subjt:  FLNAMENDSSSKQPKGRGWMYVEARLNNKAALPMLDTGATSNIIDLEEAKRLGLKITRRGDTELKIISA

A0A5C7ISI1 Uncharacterized protein4.6e-1929.37Show/hide
Query:  DYRSKEKRTHENE-GEVSESRKEHVHKERESHKDSERHEKTTEE-KKGTFLKTPRPCFLCNGPHWARDCPQKKMLSALVAKHHEGEEEAHVPKEAHLGGM
        +Y   EK+ ++ + G     ++    KE    K S   ++      K    K   PCF+C+GPHW RDCP++K LSA++++H E ++  +   EA +G +
Subjt:  DYRSKEKRTHENE-GEVSESRKEHVHKERESHKDSERHEKTTEE-KKGTFLKTPRPCFLCNGPHWARDCPQKKMLSALVAKHHEGEEEAHVPKEAHLGGM

Query:  QFLNAMENDSSSKQPKGRGWMYVEARLNNKAALPMLDTGATSNIIDLEEAKRLGLKITRRGDTELKIIS----------AMPTRL---------------
        + L A++++++      +G M+V A +N KA   MLDTGAT N + ++EAK LGL+ T RG T   + S          A+P  L               
Subjt:  QFLNAMENDSSSKQPKGRGWMYVEARLNNKAALPMLDTGATSNIIDLEEAKRLGLKITRRGDTELKIIS----------AMPTRL---------------

Query:  --------DFLDKAHAVINTLKKTLEFPDIRRE--VPLKRMRVLSQERLSAMRIARHHLGKGSSGNVPTRTL-QRDPSIIPRNPYPPEDEI---HYESSM
                +F D+ HA       +L   D  +   VP +RM  +    LSA++  R  L K  S     R L   + S++P++P P + +     Y+  M
Subjt:  --------DFLDKAHAVINTLKKTLEFPDIRRE--VPLKRMRVLSQERLSAMRIARHHLGKGSSGNVPTRTL-QRDPSIIPRNPYPPEDEI---HYESSM

Query:  PHE
        P E
Subjt:  PHE

A0A5C7IW54 Uncharacterized protein6.0e-1933.68Show/hide
Query:  RSKEKRTHENEGEVSESRKEHVHKERESHKDSERHEKTTEEK--KGTFLKTPRPCFLCNGPHWARDCPQKKMLSALVAKHHEGEEEAHVPKEAHLGGMQF
        +SKEK++++ +    +S +    ++ E  +     +K  +    K    K   PCF+C+GPHW RDCP++K LSA++++H E +E  +   EA +G ++ 
Subjt:  RSKEKRTHENEGEVSESRKEHVHKERESHKDSERHEKTTEEK--KGTFLKTPRPCFLCNGPHWARDCPQKKMLSALVAKHHEGEEEAHVPKEAHLGGMQF

Query:  LNAMENDSSSKQPKGRGWMYVEARLNNKAALPMLDTGATSNIIDLEEAKRLGLKITRRGDTELKIISAMPTRLDFLDKAHAV-INTLKKTLEF
        L A++ +++      +G M+V A +N KA   MLDTGAT N + L++AK+LGLK T  G T +K +++    +  + KA  V + T    L+F
Subjt:  LNAMENDSSSKQPKGRGWMYVEARLNNKAALPMLDTGATSNIIDLEEAKRLGLKITRRGDTELKIISAMPTRLDFLDKAHAV-INTLKKTLEF

A0A5D3BUR0 Retrotrans_gag domain-containing protein1.6e-3251.19Show/hide
Query:  KEKRTHENEGEVSESRKEHVHKERESHKDSERHEKTTEEKKGTFLKTPRPCFLCNGPHWARDCPQKKMLSALVAKHHEGEEEAHVPKEAHLGGMQFLNAM
        KEK+T  +EG+ SE++K+       + K   ++ K     KGT   TP PCFLCNGPHWARDCP+K  LSA+VA  ++ EEEA +   A L  +++L+ M
Subjt:  KEKRTHENEGEVSESRKEHVHKERESHKDSERHEKTTEEKKGTFLKTPRPCFLCNGPHWARDCPQKKMLSALVAKHHEGEEEAHVPKEAHLGGMQFLNAM

Query:  ENDSSSKQPKGRGWMYVEARLNNKAALPMLDTGATSNIIDLEEAKRLGLKITRRGDTELKIISAMPTR
        END  SK+P+  GW+YV+  LN+K AL +LDT    N++D EEAK+LGL+ITRR DTE+KIISA P R
Subjt:  ENDSSSKQPKGRGWMYVEARLNNKAALPMLDTGATSNIIDLEEAKRLGLKITRRGDTELKIISAMPTR

SwissProt top hitse value%identityAlignment
Q8VYS8 Probable protein S-acyltransferase 95.3e-0448.78Show/hide
Query:  TLQRDPSIIPRNPYPPEDEIHYESSMPHEHGGRQTPSLQFP
        T  RDP I+PRN +PPE+E+ Y++++  +  GRQTP++Q P
Subjt:  TLQRDPSIIPRNPYPPEDEIHYESSMPHEHGGRQTPSLQFP

Q9SB58 Protein S-acyltransferase 81.4e-0451.22Show/hide
Query:  TLQRDPSIIPRNPYPPEDEIHYESSMPHEHGGRQTPSLQFP
        T  RDP I+PRN +PPE+++ YE+++  +  GRQTPS+Q P
Subjt:  TLQRDPSIIPRNPYPPEDEIHYESSMPHEHGGRQTPSLQFP

Arabidopsis top hitse value%identityAlignment
AT4G24630.1 DHHC-type zinc finger family protein9.9e-0651.22Show/hide
Query:  TLQRDPSIIPRNPYPPEDEIHYESSMPHEHGGRQTPSLQFP
        T  RDP I+PRN +PPE+++ YE+++  +  GRQTPS+Q P
Subjt:  TLQRDPSIIPRNPYPPEDEIHYESSMPHEHGGRQTPSLQFP

AT5G50020.1 DHHC-type zinc finger family protein3.8e-0548.78Show/hide
Query:  TLQRDPSIIPRNPYPPEDEIHYESSMPHEHGGRQTPSLQFP
        T  RDP I+PRN +PPE+E+ Y++++  +  GRQTP++Q P
Subjt:  TLQRDPSIIPRNPYPPEDEIHYESSMPHEHGGRQTPSLQFP

AT5G50020.2 DHHC-type zinc finger family protein3.8e-0548.78Show/hide
Query:  TLQRDPSIIPRNPYPPEDEIHYESSMPHEHGGRQTPSLQFP
        T  RDP I+PRN +PPE+E+ Y++++  +  GRQTP++Q P
Subjt:  TLQRDPSIIPRNPYPPEDEIHYESSMPHEHGGRQTPSLQFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTACCGATCCAAAGAGAAGAGAACCCATGAAAATGAAGGAGAAGTATCGGAGTCAAGAAAGGAACATGTTCATAAAGAGAGGGAAAGCCACAAGGATAGTGAAAG
GCACGAGAAGACCACGGAAGAGAAGAAAGGTACATTTCTCAAAACTCCTAGACCTTGTTTTCTCTGCAATGGTCCACATTGGGCAAGAGATTGTCCACAAAAGAAAATGT
TGAGTGCATTGGTGGCAAAACACCACGAAGGAGAAGAGGAAGCACATGTACCGAAGGAGGCACACTTAGGTGGAATGCAATTCCTAAATGCCATGGAGAACGATTCTTCC
TCTAAACAACCCAAGGGTAGGGGTTGGATGTACGTGGAGGCCAGACTCAACAACAAGGCTGCCTTACCTATGCTGGATACTGGCGCAACCAGCAACATCATTGATCTAGA
AGAGGCCAAGCGGCTAGGTCTCAAGATCACAAGGAGAGGCGACACTGAGTTGAAGATTATCAGTGCCATGCCCACGCGGTTGGACTTCTTAGACAAAGCCCATGCCGTAA
TCAACACACTCAAGAAGACCCTGGAGTTCCCTGACATACGGAGGGAGGTGCCCCTAAAACGCATGAGGGTGTTGAGCCAGGAGAGGCTTAGTGCCATGCGGATTGCAAGA
CACCACTTGGGAAAAGGGTCGAGTGGTAATGTCCCAACGAGGACGTTGCAAAGGGACCCAAGTATAATTCCACGGAATCCATATCCCCCAGAAGATGAAATTCATTATGA
GTCTTCTATGCCACATGAGCATGGGGGAAGACAAACACCAAGCCTCCAGTTTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTACCGATCCAAAGAGAAGAGAACCCATGAAAATGAAGGAGAAGTATCGGAGTCAAGAAAGGAACATGTTCATAAAGAGAGGGAAAGCCACAAGGATAGTGAAAG
GCACGAGAAGACCACGGAAGAGAAGAAAGGTACATTTCTCAAAACTCCTAGACCTTGTTTTCTCTGCAATGGTCCACATTGGGCAAGAGATTGTCCACAAAAGAAAATGT
TGAGTGCATTGGTGGCAAAACACCACGAAGGAGAAGAGGAAGCACATGTACCGAAGGAGGCACACTTAGGTGGAATGCAATTCCTAAATGCCATGGAGAACGATTCTTCC
TCTAAACAACCCAAGGGTAGGGGTTGGATGTACGTGGAGGCCAGACTCAACAACAAGGCTGCCTTACCTATGCTGGATACTGGCGCAACCAGCAACATCATTGATCTAGA
AGAGGCCAAGCGGCTAGGTCTCAAGATCACAAGGAGAGGCGACACTGAGTTGAAGATTATCAGTGCCATGCCCACGCGGTTGGACTTCTTAGACAAAGCCCATGCCGTAA
TCAACACACTCAAGAAGACCCTGGAGTTCCCTGACATACGGAGGGAGGTGCCCCTAAAACGCATGAGGGTGTTGAGCCAGGAGAGGCTTAGTGCCATGCGGATTGCAAGA
CACCACTTGGGAAAAGGGTCGAGTGGTAATGTCCCAACGAGGACGTTGCAAAGGGACCCAAGTATAATTCCACGGAATCCATATCCCCCAGAAGATGAAATTCATTATGA
GTCTTCTATGCCACATGAGCATGGGGGAAGACAAACACCAAGCCTCCAGTTTCCTTGA
Protein sequenceShow/hide protein sequence
MDYRSKEKRTHENEGEVSESRKEHVHKERESHKDSERHEKTTEEKKGTFLKTPRPCFLCNGPHWARDCPQKKMLSALVAKHHEGEEEAHVPKEAHLGGMQFLNAMENDSS
SKQPKGRGWMYVEARLNNKAALPMLDTGATSNIIDLEEAKRLGLKITRRGDTELKIISAMPTRLDFLDKAHAVINTLKKTLEFPDIRREVPLKRMRVLSQERLSAMRIAR
HHLGKGSSGNVPTRTLQRDPSIIPRNPYPPEDEIHYESSMPHEHGGRQTPSLQFP