; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G30600 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G30600
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr3:28001308..28002585
RNA-Seq ExpressionCSPI03G30600
SyntenyCSPI03G30600
Gene Ontology termsGO:0006488 - dolichol-linked oligosaccharide biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047238.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.1e-6669.43Show/hide
Query:  MRMITLKGVTTEEKRKEGPTKRLFDVGFQAWREKRLCFRCEEKYHAGHICKVKEQKELRMLVVRENGEELEII-EEFFDAETEAQTIEIGKVENLNIELS
        MR ITL+ V T + R+EGPTK+L D  FQA REK LCFRC EKY AGH CK KE KELRMLVV+E GEELEI+ EEFFDAETE + +E+  VENLNIELS
Subjt:  MRMITLKGVTTEEKRKEGPTKRLFDVGFQAWREKRLCFRCEEKYHAGHICKVKEQKELRMLVVRENGEELEII-EEFFDAETEAQTIEIGKVENLNIELS

Query:  INSIVGLSNLGTMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEIG
        INS+VGL+N GTMKVK ++ E EVV+LIDCGATHNF+ E+LVT L L + ET NYGVILGSG AVKGKG+C NVEV++  W V DSFLPL++G
Subjt:  INSIVGLSNLGTMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEIG

KAA0048037.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.7e-6670.47Show/hide
Query:  MRMITLKGVTTEEKRKEGPTKRLFDVGFQAWREKRLCFRCEEKYHAGHICKVKEQKELRMLVVRENGEELEII-EEFFDAETEAQTIEIGKVENLNIELS
        MR ITL+ V T + R+EGPTKRL D  FQA REK LCFRC EKY AGH CK KE KELRMLVV+E GEELEI+ EEFFDAETE + +E+  VENLNIELS
Subjt:  MRMITLKGVTTEEKRKEGPTKRLFDVGFQAWREKRLCFRCEEKYHAGHICKVKEQKELRMLVVRENGEELEII-EEFFDAETEAQTIEIGKVENLNIELS

Query:  INSIVGLSNLGTMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEIG
        INS+VGL+N GTMKVK K+ E EVV+LIDCGATHNFI E+LVT L L + ET NYGVILGSG AVKGKG+C +VEV++  W V DSFLPL++G
Subjt:  INSIVGLSNLGTMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEIG

KAA0062226.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.6e-6668.91Show/hide
Query:  MRMITLKGVTTEEKRKEGPTKRLFDVGFQAWREKRLCFRCEEKYHAGHICKVKEQKELRMLVVRENGEELEIIEEFFDAE-TEAQTIEIGKVENLNIELS
        MR +TL+G TT + R+EGP+KRL D  FQA REK LCF+C EKYHAGH CK KE KELRMLV+ ENGEE EIIEE  + E  +   IE+G V+NLNIELS
Subjt:  MRMITLKGVTTEEKRKEGPTKRLFDVGFQAWREKRLCFRCEEKYHAGHICKVKEQKELRMLVVRENGEELEIIEEFFDAE-TEAQTIEIGKVENLNIELS

Query:  INSIVGLSNLGTMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEIG
        INS+VGL+N GTMKVK K+K+ +VVVLIDCGATHNFI E LVT L+L +  T+NYGVILGSGAA+KGKGIC  VEV +G+W VVDSFLPLE+G
Subjt:  INSIVGLSNLGTMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEIG

TYJ96499.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.7e-6670.47Show/hide
Query:  MRMITLKGVTTEEKRKEGPTKRLFDVGFQAWREKRLCFRCEEKYHAGHICKVKEQKELRMLVVRENGEELEII-EEFFDAETEAQTIEIGKVENLNIELS
        MR ITL+ V T + R+EGPTKRL D  FQA REK LCFRC EKY AGH CK KE KELRMLVV+E GEELEI+ EEFFDAETE + +E+  VENLNIELS
Subjt:  MRMITLKGVTTEEKRKEGPTKRLFDVGFQAWREKRLCFRCEEKYHAGHICKVKEQKELRMLVVRENGEELEII-EEFFDAETEAQTIEIGKVENLNIELS

Query:  INSIVGLSNLGTMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEIG
        INS+VGL+N GTMKVK K+ E EVV+LIDCGATHNFI E+LVT L L + ET NYGVILGSG AVKGKG+C +VEV++  W V DSFLPL++G
Subjt:  INSIVGLSNLGTMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEIG

TYK02491.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.6e-6669.27Show/hide
Query:  MRMITLKGVTTEEKRKEGPTKRLFDVGFQAWREKRLCFRCEEKYHAGHICKVKEQKELRMLVVRENGEELEIIEEFFDAETEAQTIEIGKVENLNIELSI
        MR ITL+ V T + R+EGPTKRL D  FQA REK LCFRC EKY AGH CK KE KELRMLVV+E GEELEI+EEFFDAETE + +E+  VENLNIELS+
Subjt:  MRMITLKGVTTEEKRKEGPTKRLFDVGFQAWREKRLCFRCEEKYHAGHICKVKEQKELRMLVVRENGEELEIIEEFFDAETEAQTIEIGKVENLNIELSI

Query:  NSIVGLSNLGTMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEIG
        NS+VGL+N GTMKVK ++ E EVV+LIDCGATHNFI ENLVT L L + ET  YGVILGS  AVKGKG+C +VEV++  W V DSFLPL++G
Subjt:  NSIVGLSNLGTMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEIG

TrEMBL top hitse value%identityAlignment
A0A5A7TWF5 Ty3/gypsy retrotransposon protein1.0e-6669.43Show/hide
Query:  MRMITLKGVTTEEKRKEGPTKRLFDVGFQAWREKRLCFRCEEKYHAGHICKVKEQKELRMLVVRENGEELEII-EEFFDAETEAQTIEIGKVENLNIELS
        MR ITL+ V T + R+EGPTK+L D  FQA REK LCFRC EKY AGH CK KE KELRMLVV+E GEELEI+ EEFFDAETE + +E+  VENLNIELS
Subjt:  MRMITLKGVTTEEKRKEGPTKRLFDVGFQAWREKRLCFRCEEKYHAGHICKVKEQKELRMLVVRENGEELEII-EEFFDAETEAQTIEIGKVENLNIELS

Query:  INSIVGLSNLGTMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEIG
        INS+VGL+N GTMKVK ++ E EVV+LIDCGATHNF+ E+LVT L L + ET NYGVILGSG AVKGKG+C NVEV++  W V DSFLPL++G
Subjt:  INSIVGLSNLGTMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEIG

A0A5A7U1D6 Ty3/gypsy retrotransposon protein1.3e-6670.47Show/hide
Query:  MRMITLKGVTTEEKRKEGPTKRLFDVGFQAWREKRLCFRCEEKYHAGHICKVKEQKELRMLVVRENGEELEII-EEFFDAETEAQTIEIGKVENLNIELS
        MR ITL+ V T + R+EGPTKRL D  FQA REK LCFRC EKY AGH CK KE KELRMLVV+E GEELEI+ EEFFDAETE + +E+  VENLNIELS
Subjt:  MRMITLKGVTTEEKRKEGPTKRLFDVGFQAWREKRLCFRCEEKYHAGHICKVKEQKELRMLVVRENGEELEII-EEFFDAETEAQTIEIGKVENLNIELS

Query:  INSIVGLSNLGTMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEIG
        INS+VGL+N GTMKVK K+ E EVV+LIDCGATHNFI E+LVT L L + ET NYGVILGSG AVKGKG+C +VEV++  W V DSFLPL++G
Subjt:  INSIVGLSNLGTMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEIG

A0A5A7V723 Ty3/gypsy retrotransposon protein2.2e-6668.91Show/hide
Query:  MRMITLKGVTTEEKRKEGPTKRLFDVGFQAWREKRLCFRCEEKYHAGHICKVKEQKELRMLVVRENGEELEIIEEFFDAE-TEAQTIEIGKVENLNIELS
        MR +TL+G TT + R+EGP+KRL D  FQA REK LCF+C EKYHAGH CK KE KELRMLV+ ENGEE EIIEE  + E  +   IE+G V+NLNIELS
Subjt:  MRMITLKGVTTEEKRKEGPTKRLFDVGFQAWREKRLCFRCEEKYHAGHICKVKEQKELRMLVVRENGEELEIIEEFFDAE-TEAQTIEIGKVENLNIELS

Query:  INSIVGLSNLGTMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEIG
        INS+VGL+N GTMKVK K+K+ +VVVLIDCGATHNFI E LVT L+L +  T+NYGVILGSGAA+KGKGIC  VEV +G+W VVDSFLPLE+G
Subjt:  INSIVGLSNLGTMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEIG

A0A5D3BBV5 Ty3/gypsy retrotransposon protein1.3e-6670.47Show/hide
Query:  MRMITLKGVTTEEKRKEGPTKRLFDVGFQAWREKRLCFRCEEKYHAGHICKVKEQKELRMLVVRENGEELEII-EEFFDAETEAQTIEIGKVENLNIELS
        MR ITL+ V T + R+EGPTKRL D  FQA REK LCFRC EKY AGH CK KE KELRMLVV+E GEELEI+ EEFFDAETE + +E+  VENLNIELS
Subjt:  MRMITLKGVTTEEKRKEGPTKRLFDVGFQAWREKRLCFRCEEKYHAGHICKVKEQKELRMLVVRENGEELEII-EEFFDAETEAQTIEIGKVENLNIELS

Query:  INSIVGLSNLGTMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEIG
        INS+VGL+N GTMKVK K+ E EVV+LIDCGATHNFI E+LVT L L + ET NYGVILGSG AVKGKG+C +VEV++  W V DSFLPL++G
Subjt:  INSIVGLSNLGTMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEIG

A0A5D3BRX1 Ty3/gypsy retrotransposon protein2.2e-6669.27Show/hide
Query:  MRMITLKGVTTEEKRKEGPTKRLFDVGFQAWREKRLCFRCEEKYHAGHICKVKEQKELRMLVVRENGEELEIIEEFFDAETEAQTIEIGKVENLNIELSI
        MR ITL+ V T + R+EGPTKRL D  FQA REK LCFRC EKY AGH CK KE KELRMLVV+E GEELEI+EEFFDAETE + +E+  VENLNIELS+
Subjt:  MRMITLKGVTTEEKRKEGPTKRLFDVGFQAWREKRLCFRCEEKYHAGHICKVKEQKELRMLVVRENGEELEIIEEFFDAETEAQTIEIGKVENLNIELSI

Query:  NSIVGLSNLGTMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEIG
        NS+VGL+N GTMKVK ++ E EVV+LIDCGATHNFI ENLVT L L + ET  YGVILGS  AVKGKG+C +VEV++  W V DSFLPL++G
Subjt:  NSIVGLSNLGTMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEIG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein2.0e-0629.85Show/hide
Query:  IIEEFFDAETEAQTIEIGKVENLNIELSINSIVGLSNLGTMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICD
        +I E  + E ++ T+  G +E L I+L+ N          M+    I + +VVV ID GAT NFI   L  +L L  + T+   V+LG    ++  G C 
Subjt:  IIEEFFDAETEAQTIEIGKVENLNIELSINSIVGLSNLGTMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICD

Query:  NVEVRVGEWNVVDSFLPLEIGE------LMYYWV
         + + V E  + ++FL L++ +      L Y W+
Subjt:  NVEVRVGEWNVVDSFLPLEIGE------LMYYWV

AT3G30770.1 Eukaryotic aspartyl protease family protein5.7e-0635.71Show/hide
Query:  EVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEI
        +VVV+ID GAT+NFI + L   L L  + T+   V+LG    ++  G C  + + V E  + ++FL L++
Subjt:  EVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAATGATTACGTTAAAAGGGGTGACAACAGAGGAAAAGAGGAAGGAAGGTCCGACCAAACGTCTGTTCGACGTCGGATTTCAAGCATGGAGGGAAAAGAGATTGTG
TTTTCGATGTGAGGAGAAGTATCACGCGGGACACATATGTAAGGTGAAGGAACAAAAGGAATTGCGGATGCTAGTGGTGAGGGAAAATGGAGAAGAACTGGAAATTATTG
AGGAATTCTTTGACGCGGAAACAGAGGCCCAGACCATCGAAATAGGAAAAGTGGAGAATCTGAATATAGAGCTGTCCATTAATTCAATTGTGGGGCTATCCAATCTAGGA
ACAATGAAAGTAAAATGGAAGATCAAAGAGACCGAAGTGGTAGTACTCATTGACTGTGGAGCTACCCACAATTTTATTGAAGAGAATTTGGTAACAACCCTTAGCCTACT
AGTGACAGAAACATCCAACTATGGGGTGATTTTGGGATCAGGGGCTGCGGTCAAAGGAAAGGGAATATGCGACAATGTGGAAGTAAGGGTAGGAGAATGGAATGTTGTAG
ATAGCTTTTTACCCTTGGAAATAGGGGAGTTGATGTACTATTGGGTATGCAGTGGTTGCACTCATTGGGAGTGA
mRNA sequenceShow/hide mRNA sequence
CCCAGATGATGAAATTAGCCCTGAAGATTGAAAATAGAGAGATGGTTCGACGGGAATGTGAGCTAAGTAGTTTGTTTGGAGGGGGAATGTGGGCTAAGTTTGTTCGGAGG
GAATCAACAATTCAAACAAAATTATGTGAAGCCAACCTTGATTGTAACAAACAATAAGAAACAAACGGAAGGAAGCTGGCCGATGAGAATGATTACGTTAAAAGGGGTGA
CAACAGAGGAAAAGAGGAAGGAAGGTCCGACCAAACGTCTGTTCGACGTCGGATTTCAAGCATGGAGGGAAAAGAGATTGTGTTTTCGATGTGAGGAGAAGTATCACGCG
GGACACATATGTAAGGTGAAGGAACAAAAGGAATTGCGGATGCTAGTGGTGAGGGAAAATGGAGAAGAACTGGAAATTATTGAGGAATTCTTTGACGCGGAAACAGAGGC
CCAGACCATCGAAATAGGAAAAGTGGAGAATCTGAATATAGAGCTGTCCATTAATTCAATTGTGGGGCTATCCAATCTAGGAACAATGAAAGTAAAATGGAAGATCAAAG
AGACCGAAGTGGTAGTACTCATTGACTGTGGAGCTACCCACAATTTTATTGAAGAGAATTTGGTAACAACCCTTAGCCTACTAGTGACAGAAACATCCAACTATGGGGTG
ATTTTGGGATCAGGGGCTGCGGTCAAAGGAAAGGGAATATGCGACAATGTGGAAGTAAGGGTAGGAGAATGGAATGTTGTAGATAGCTTTTTACCCTTGGAAATAGGGGA
GTTGATGTACTATTGGGTATGCAGTGGTTGCACTCATTGGGAGTGACTGGAAAAATTTAGTGATGAGATTCCAACATGGTGGGAGGAAGATAGTGATAAAGGGGGATCCT
AGCTCACCGAAACTAGAGTGAGCTTAAAGTCTATGATGAAGACATGGGGAGTTGGTGATCATGGATATTTAGTGGAATGTCGGGCGATAGAAGGAAGGGTAGCCGTGGAA
GATTTAGATGATGAAGACGTGCTAGCCATTGTTATGATCTCCCCTCTGTTGAACAAATTTAGTGATGTGTCTGATTGACCGGAAGAACTACCTTCAAAAAGGGACATTGA
ACACCATATATACCTTAAAAAGGGAGTGGATCTCGTTAATGCCAGACCTTATCGTTACGCTCACCACCAAAAGGAAGAAATAGAAAAACTAGTTGATTAAATGCTAAAAA
CAGACATCATAAGGCCAAGTACCATTCCGTATTCTAGCCCAGTACTTCTCGTGAAAAAGAAAGACGAC
Protein sequenceShow/hide protein sequence
MRMITLKGVTTEEKRKEGPTKRLFDVGFQAWREKRLCFRCEEKYHAGHICKVKEQKELRMLVVRENGEELEIIEEFFDAETEAQTIEIGKVENLNIELSINSIVGLSNLG
TMKVKWKIKETEVVVLIDCGATHNFIEENLVTTLSLLVTETSNYGVILGSGAAVKGKGICDNVEVRVGEWNVVDSFLPLEIGELMYYWVCSGCTHWE