; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G12060 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G12060
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationChr4:10383877..10384469
RNA-Seq ExpressionCSPI04G12060
SyntenyCSPI04G12060
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040209.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-7168.95Show/hide
Query:  MELTQINVEETTEVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVA
        +EL Q+++ E  E++ ++IT FTSKGTMKLKG++KGKEV++LID GAT+NFIH+++VEE+ + IE G+ FGVTIGDGTRCKGKG+CR+VELRL ++TIV 
Subjt:  MELTQINVEETTEVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVA

Query:  DFLAVELGKVDVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLEFQNYKVEIENEYDTE
        DFLAVELGKVDVVLGMQWLDTTGTMKVHWPSL M FWV  +QIVLKGDPS I+A CS + +EKTW  ED GFLLE+QNY +E E++Y  E
Subjt:  DFLAVELGKVDVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLEFQNYKVEIENEYDTE

TYK07871.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]4.4e-7168.42Show/hide
Query:  MELTQINVEETTEVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVA
        +EL Q+++ E  E++ + IT FTSKGTMKLKG++KGKEV++LID GAT+NFIH+++VEE+ + IE G+ FGVTIGDGTRCKGKG+CR+VELRL ++TIV 
Subjt:  MELTQINVEETTEVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVA

Query:  DFLAVELGKVDVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLEFQNYKVEIENEYDTE
        DFLAVELGKVDV+LGMQWLDTTGTMKVHWPSL M FWV  +QIVLKGDPS I+A CS + +EKTW  ED GFLLE+QNY +E E++Y  E
Subjt:  DFLAVELGKVDVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLEFQNYKVEIENEYDTE

TYK14439.1 uncharacterized protein E5676_scaffold186G00980 [Cucumis melo var. makuwa]4.9e-7067.89Show/hide
Query:  MELTQINVEETTEVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVA
        MEL  + + E TE+EL ++T  TSKGTMKLKG ++ KE+V+LIDSGAT+NFIH+++ EE  + +E  TQFG TIG+GTRCKGKGVCRRVEL+LKEITI+A
Subjt:  MELTQINVEETTEVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVA

Query:  DFLAVELGKVDVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLEFQNYKVEIENEYDTE
        DFLAVELG VD VLGMQWLDTTGTM++HWPSL M FW +GRQIVLKGDPSLIKA CS +TLEKTW  +D GFLLE+ N ++  E +Y+T+
Subjt:  DFLAVELGKVDVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLEFQNYKVEIENEYDTE

XP_031737605.1 uncharacterized protein LOC116402475 [Cucumis sativus]1.6e-7371.05Show/hide
Query:  MELTQINVEETTEVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVA
        +EL Q+ + E T +ELR ITG TSKGTMKLKG++ GKEVVILIDSGATNNFI + +V+E  L+I+PGT+FGV IG+GTRC+G+G+C+RV+++LKE+TIVA
Subjt:  MELTQINVEETTEVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVA

Query:  DFLAVELGKVDVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLEFQNYKVEIENEYDTE
        DFLAVELGKVD+VLGMQWLD+TGTMKVHWPSL MTFW KGR+I+LKGD SL K+ CS RTLEKTW + D GFLLEFQNY+V+ E E +TE
Subjt:  DFLAVELGKVDVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLEFQNYKVEIENEYDTE

XP_031745528.1 uncharacterized protein LOC116405915 [Cucumis sativus]9.8e-7169.15Show/hide
Query:  MELTQINVEETTEVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVA
        +EL Q+ +EE TE+EL++I G TSKGTMK+KG IKGKEV+ILIDSGAT+NFIH  +VEE GL +E  T FGVTIGDGTRC+G+GVC R+EL+LKEITIVA
Subjt:  MELTQINVEETTEVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVA

Query:  DFLAVELGKVDVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLEFQNYKVEIENEYD
        DFLA+ELG VDV+LGMQWL+TTGTMK+HWPSL MTF +  +Q +LKGDPSLI+A CS +T+EKTW+ +D GFLLE QNY+ E + E D
Subjt:  DFLAVELGKVDVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLEFQNYKVEIENEYD

TrEMBL top hitse value%identityAlignment
A0A5A7TG20 Ty3-gypsy retrotransposon protein5.6e-7268.95Show/hide
Query:  MELTQINVEETTEVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVA
        +EL Q+++ E  E++ ++IT FTSKGTMKLKG++KGKEV++LID GAT+NFIH+++VEE+ + IE G+ FGVTIGDGTRCKGKG+CR+VELRL ++TIV 
Subjt:  MELTQINVEETTEVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVA

Query:  DFLAVELGKVDVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLEFQNYKVEIENEYDTE
        DFLAVELGKVDVVLGMQWLDTTGTMKVHWPSL M FWV  +QIVLKGDPS I+A CS + +EKTW  ED GFLLE+QNY +E E++Y  E
Subjt:  DFLAVELGKVDVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLEFQNYKVEIENEYDTE

A0A5A7TH07 Uncharacterized protein1.4e-6766.84Show/hide
Query:  MELTQINVEETTEVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVA
        +EL  +   E   +EL+++T F+SKGTMKLKG I+ KE+VILIDSGAT+NFIH+S+  +  L +E  TQFG TIG GTRCKGKG+CRRVE++L+EITI+A
Subjt:  MELTQINVEETTEVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVA

Query:  DFLAVELGKVDVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLEFQNYKVEIENEYDTE
        DFLAVELG VD VLGMQW+DTTGTMK+HWPSL M+FW +GRQI+LKGDPSLIKA CS RTLEKTW  +D GFLLE+ N +VE E+ Y T+
Subjt:  DFLAVELGKVDVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLEFQNYKVEIENEYDTE

A0A5A7TIU7 Transposon Ty3-G Gag-Pol polyprotein8.4e-6866.32Show/hide
Query:  MELTQINVEETTEVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVA
        +EL  + + E   +EL+++T F+SKGTMKLKG I+ KE+VILIDSGAT+NFIH+S+  +  L +E  TQFG TIG+GT CKGKG+CRRVE++L+EITI+A
Subjt:  MELTQINVEETTEVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVA

Query:  DFLAVELGKVDVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLEFQNYKVEIENEYDTE
        DFLAVELG VD VLGMQWLDTTGTMK+HWPSL M+FW +GRQI+LKGDPSL+KA CS RTLEKTW  +D GFLLE+ N +VE E  Y T+
Subjt:  DFLAVELGKVDVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLEFQNYKVEIENEYDTE

A0A5D3C7M9 Ty3-gypsy retrotransposon protein2.1e-7168.42Show/hide
Query:  MELTQINVEETTEVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVA
        +EL Q+++ E  E++ + IT FTSKGTMKLKG++KGKEV++LID GAT+NFIH+++VEE+ + IE G+ FGVTIGDGTRCKGKG+CR+VELRL ++TIV 
Subjt:  MELTQINVEETTEVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVA

Query:  DFLAVELGKVDVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLEFQNYKVEIENEYDTE
        DFLAVELGKVDV+LGMQWLDTTGTMKVHWPSL M FWV  +QIVLKGDPS I+A CS + +EKTW  ED GFLLE+QNY +E E++Y  E
Subjt:  DFLAVELGKVDVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLEFQNYKVEIENEYDTE

A0A5D3CW02 Uncharacterized protein2.4e-7067.89Show/hide
Query:  MELTQINVEETTEVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVA
        MEL  + + E TE+EL ++T  TSKGTMKLKG ++ KE+V+LIDSGAT+NFIH+++ EE  + +E  TQFG TIG+GTRCKGKGVCRRVEL+LKEITI+A
Subjt:  MELTQINVEETTEVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVA

Query:  DFLAVELGKVDVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLEFQNYKVEIENEYDTE
        DFLAVELG VD VLGMQWLDTTGTM++HWPSL M FW +GRQIVLKGDPSLIKA CS +TLEKTW  +D GFLLE+ N ++  E +Y+T+
Subjt:  DFLAVELGKVDVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLEFQNYKVEIENEYDTE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G53705.1 aminoacyl-tRNA ligases;nucleotide binding;ATP binding8.4e-0438.46Show/hide
Query:  IGDGTRCKGKGVCRRVELRLKEITIVADFLAVELGK--VDVVLGMQWLDTTG
        +G G   + KG C  + L ++E  IV D+L ++L K   DV+LG +WL   G
Subjt:  IGDGTRCKGKGVCRRVELRLKEITIVADFLAVELGK--VDVVLGMQWLDTTG

AT1G53705.2 aminoacyl-tRNA ligases;nucleotide binding;ATP binding8.4e-0438.46Show/hide
Query:  IGDGTRCKGKGVCRRVELRLKEITIVADFLAVELGK--VDVVLGMQWLDTTG
        +G G   + KG C  + L ++E  IV D+L ++L K   DV+LG +WL   G
Subjt:  IGDGTRCKGKGVCRRVELRLKEITIVADFLAVELGK--VDVVLGMQWLDTTG

AT3G29750.1 Eukaryotic aspartyl protease family protein1.4e-1128.76Show/hide
Query:  ITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVADFLAVELGK--VDVVLGM
        +   T    M+  G I   +VV+ IDSGAT+NFI   +     L      Q  V +G     +  G C  + L ++E+ I  +FL ++L K  VDV+LG 
Subjt:  ITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVADFLAVELGK--VDVVLGM

Query:  QWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAED
        +WL   G   V+W +   +F    + I L  +   ++ V +   ++   + ED
Subjt:  QWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAED

AT3G30770.1 Eukaryotic aspartyl protease family protein6.0e-1027.27Show/hide
Query:  EVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVADFLAVELGK--V
        +V+ +S T FT    M+  G I   +VV++IDSGATNNFI + +     L      Q  V +G     +  G C  + L ++E+ I  +FL ++L K  V
Subjt:  EVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVADFLAVELGK--V

Query:  DVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLE
        DV+LG           + W +   +F+   + + L      ++ V +   ++  ++ E     LE
Subjt:  DVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLE

AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding2.9e-0435.71Show/hide
Query:  KGVCRRVELRLKEITIVADFLAVELGK--VDVVLGMQWLDTTGTMKVHWPSLIMTF
        K  C+ + LR+ +I IV D+   +L +  VDV+LG +WL   G  +V+W +   +F
Subjt:  KGVCRRVELRLKEITIVADFLAVELGK--VDVVLGMQWLDTTGTMKVHWPSLIMTF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTAACTCAAATAAATGTAGAGGAAACGACGGAAGTTGAGTTGAGAAGCATCACGGGGTTCACCTCAAAAGGGACGATGAAGTTGAAGGGAAACATAAAAGGAAA
AGAAGTAGTGATCCTCATTGATAGTGGAGCCACAAATAACTTCATACATGAGTCAGTGGTGGAGGAACAAGGGCTAAACATTGAACCGGGGACACAGTTTGGAGTGACCA
TTGGAGATGGAACTCGTTGTAAAGGCAAGGGAGTCTGTAGAAGAGTGGAACTGAGATTGAAGGAAATAACAATTGTAGCAGACTTCTTAGCTGTGGAATTGGGAAAGGTT
GATGTAGTATTGGGAATGCAGTGGTTAGATACCACCGGAACAATGAAGGTTCATTGGCCATCCCTAATCATGACTTTCTGGGTTAAAGGCAGACAGATTGTATTGAAAGG
AGATCCCTCTCTGATTAAGGCGGTATGTTCACCAAGAACATTGGAGAAAACGTGGGATGCTGAAGATCATGGGTTCTTGTTGGAATTCCAGAATTATAAGGTGGAAATTG
AGAATGAGTATGACACTGAAACATAG
mRNA sequenceShow/hide mRNA sequence
GGGAGGAAGAAGAAGTGATGGAGTTAACTCAAATAAATGTAGAGGAAACGACGGAAGTTGAGTTGAGAAGCATCACGGGGTTCACCTCAAAAGGGACGATGAAGTTGAAG
GGAAACATAAAAGGAAAAGAAGTAGTGATCCTCATTGATAGTGGAGCCACAAATAACTTCATACATGAGTCAGTGGTGGAGGAACAAGGGCTAAACATTGAACCGGGGAC
ACAGTTTGGAGTGACCATTGGAGATGGAACTCGTTGTAAAGGCAAGGGAGTCTGTAGAAGAGTGGAACTGAGATTGAAGGAAATAACAATTGTAGCAGACTTCTTAGCTG
TGGAATTGGGAAAGGTTGATGTAGTATTGGGAATGCAGTGGTTAGATACCACCGGAACAATGAAGGTTCATTGGCCATCCCTAATCATGACTTTCTGGGTTAAAGGCAGA
CAGATTGTATTGAAAGGAGATCCCTCTCTGATTAAGGCGGTATGTTCACCAAGAACATTGGAGAAAACGTGGGATGCTGAAGATCATGGGTTCTTGTTGGAATTCCAGAA
TTATAAGGTGGAAATTGAGAATGAGTATGACACTGAAACATAG
Protein sequenceShow/hide protein sequence
MELTQINVEETTEVELRSITGFTSKGTMKLKGNIKGKEVVILIDSGATNNFIHESVVEEQGLNIEPGTQFGVTIGDGTRCKGKGVCRRVELRLKEITIVADFLAVELGKV
DVVLGMQWLDTTGTMKVHWPSLIMTFWVKGRQIVLKGDPSLIKAVCSPRTLEKTWDAEDHGFLLEFQNYKVEIENEYDTET