; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G09800 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G09800
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationChr4:7773905..7774360
RNA-Seq ExpressionCSPI04G09800
SyntenyCSPI04G09800
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051392.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]6.4e-4160.28Show/hide
Query:  MFVVQANEEEWVIIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILG
        MFVV   +EE+ I++E E   K L M+EV D  +A +ELSINSVVGL++P TM V+G +++ E+II+IDCGATHNFIS ++V+ LQ+ TK T+HYGVILG
Subjt:  MFVVQANEEEWVIIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILG

Query:  SGTAVKGKGVCETVEVKLGGWKLTANFLPLELGGVDVVLGM
        SGTA++GKGVC+ VE++L  W L  +FLPLELGG DV+LGM
Subjt:  SGTAVKGKGVCETVEVKLGGWKLTANFLPLELGGVDVVLGM

KAA0052232.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]1.4e-4060.28Show/hide
Query:  MFVVQANEEEWVIIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILG
        MFVV+ N EE  I++E E  N EL + EV       +ELSINSVVGL++P TM VKGS++ KEV+ILIDCGATHNF+S +++  LQL  K T+HYGVILG
Subjt:  MFVVQANEEEWVIIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILG

Query:  SGTAVKGKGVCETVEVKLGGWKLTANFLPLELGGVDVVLGM
        SGTA++GKG+CE+VE+++  W +  +FLPLELGGVDV+LGM
Subjt:  SGTAVKGKGVCETVEVKLGGWKLTANFLPLELGGVDVVLGM

KAA0054961.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]8.4e-4159.18Show/hide
Query:  MFVVQANEEEWVIIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILG
        MFVV +N EE  I++E+E  +KEL M EV D   A +ELSINSVVGL++P TM V+G ++++EV+ILID GATHNF+S ++V+ L+L  K T+HYGVILG
Subjt:  MFVVQANEEEWVIIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILG

Query:  SGTAVKGKGVCETVEVKLGGWKLTANFLPLELGGVDVVLGMHGSTLL
        S TA++GKGVCE++EVK+  WK+  +FLPLELGGVD++LGM    LL
Subjt:  SGTAVKGKGVCETVEVKLGGWKLTANFLPLELGGVDVVLGMHGSTLL

KAA0058383.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]1.9e-4060.99Show/hide
Query:  MFVVQANEEEWVIIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILG
        MFVV    EE+ I++E E   KEL  +EV++  Q   ELSINSVVGL++P TM V+G + +KE+I++IDCGATHNFIS ++V+ L+LATK T+HYGVILG
Subjt:  MFVVQANEEEWVIIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILG

Query:  SGTAVKGKGVCETVEVKLGGWKLTANFLPLELGGVDVVLGM
        SGT ++GKGVCE VE++L   K+T  FLPLELGGVDV+LGM
Subjt:  SGTAVKGKGVCETVEVKLGGWKLTANFLPLELGGVDVVLGM

XP_031745972.1 uncharacterized protein LOC116406393 [Cucumis sativus]1.5e-4266.67Show/hide
Query:  MFVVQANEEEWVIIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILG
        M+VV+  +EE+ I++E E    ELN VE+   +QA++ELSINSVVGL+NP TM V+G IK++EVIILIDCGATHNFIS +VV+EL L TK TSHYGVILG
Subjt:  MFVVQANEEEWVIIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILG

Query:  SGTAVKGKGVCETVEVKLGGWKLTANFLPLELGGVDVVLGM
        S  AVKGKG+CE +E++L GWK+ ANFLPLELGGVD VL M
Subjt:  SGTAVKGKGVCETVEVKLGGWKLTANFLPLELGGVDVVLGM

TrEMBL top hitse value%identityAlignment
A0A5A7U6F1 Ty3-gypsy retrotransposon protein3.1e-4160.28Show/hide
Query:  MFVVQANEEEWVIIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILG
        MFVV   +EE+ I++E E   K L M+EV D  +A +ELSINSVVGL++P TM V+G +++ E+II+IDCGATHNFIS ++V+ LQ+ TK T+HYGVILG
Subjt:  MFVVQANEEEWVIIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILG

Query:  SGTAVKGKGVCETVEVKLGGWKLTANFLPLELGGVDVVLGM
        SGTA++GKGVC+ VE++L  W L  +FLPLELGG DV+LGM
Subjt:  SGTAVKGKGVCETVEVKLGGWKLTANFLPLELGGVDVVLGM

A0A5A7U908 Transposon Tf2-1 polyprotein isoform X16.9e-4160.28Show/hide
Query:  MFVVQANEEEWVIIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILG
        MFVV+ N EE  I++E E  N EL + EV       +ELSINSVVGL++P TM VKGS++ KEV+ILIDCGATHNF+S +++  LQL  K T+HYGVILG
Subjt:  MFVVQANEEEWVIIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILG

Query:  SGTAVKGKGVCETVEVKLGGWKLTANFLPLELGGVDVVLGM
        SGTA++GKG+CE+VE+++  W +  +FLPLELGGVDV+LGM
Subjt:  SGTAVKGKGVCETVEVKLGGWKLTANFLPLELGGVDVVLGM

A0A5A7UN12 Transposon Ty3-I Gag-Pol polyprotein4.0e-4159.18Show/hide
Query:  MFVVQANEEEWVIIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILG
        MFVV +N EE  I++E+E  +KEL M EV D   A +ELSINSVVGL++P TM V+G ++++EV+ILID GATHNF+S ++V+ L+L  K T+HYGVILG
Subjt:  MFVVQANEEEWVIIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILG

Query:  SGTAVKGKGVCETVEVKLGGWKLTANFLPLELGGVDVVLGMHGSTLL
        S TA++GKGVCE++EVK+  WK+  +FLPLELGGVD++LGM    LL
Subjt:  SGTAVKGKGVCETVEVKLGGWKLTANFLPLELGGVDVVLGMHGSTLL

A0A5D3DFC8 Ty3-gypsy retrotransposon protein1.2e-4059.57Show/hide
Query:  MFVVQANEEEWVIIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILG
        MFVV   +EE+ I++E E   K L M+EV D  +A +ELSINSVVGL++P TM V+G +++ E+II+IDCGATHNFIS ++V+ LQ+ TK T+HYGV LG
Subjt:  MFVVQANEEEWVIIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILG

Query:  SGTAVKGKGVCETVEVKLGGWKLTANFLPLELGGVDVVLGM
        SGTA++GKGVC+ VE++L  W L  +FLPLELGG DV+LGM
Subjt:  SGTAVKGKGVCETVEVKLGGWKLTANFLPLELGGVDVVLGM

A0A5D3DTK7 Transposon Tf2-1 polyprotein isoform X19.0e-4160.99Show/hide
Query:  MFVVQANEEEWVIIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILG
        MFVV    EE+ I++E E   KEL  +EV++  Q   ELSINSVVGL++P TM V+G + +KE+I++IDCGATHNFIS ++V+ L+LATK T+HYGVILG
Subjt:  MFVVQANEEEWVIIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILG

Query:  SGTAVKGKGVCETVEVKLGGWKLTANFLPLELGGVDVVLGM
        SGT ++GKGVCE VE++L   K+T  FLPLELGGVDV+LGM
Subjt:  SGTAVKGKGVCETVEVKLGGWKLTANFLPLELGGVDVVLGM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein2.0e-0831.54Show/hide
Query:  IIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILGSGTAVKGKGVCE
        +I+E+E+  ++   +     EQ VI+L+ N          M   G I + +V++ ID GAT NFI   +   L+L T  T+   V+LG    ++  G C 
Subjt:  IIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILGSGTAVKGKGVCE

Query:  TVEVKLGGWKLTANFLPLELG--GVDVVLG
         + + +   ++T NFL L+L    VDV+LG
Subjt:  TVEVKLGGWKLTANFLPLELG--GVDVVLG

AT3G30770.1 Eukaryotic aspartyl protease family protein2.0e-0829.66Show/hide
Query:  VMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILGSGTAVKGKGVCETVEVKLGGWKLTANFL
        +++  + + ++   S    +    M   G I   +V+++ID GAT+NFIS  +   L+L T  T+   V+LG    ++  G C  + + +   ++  NFL
Subjt:  VMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILGSGTAVKGKGVCETVEVKLGGWKLTANFL

Query:  PLEL--GGVDVVLGMHGS
         L+L    VDV+LG  GS
Subjt:  PLEL--GGVDVVLGMHGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGTTGTTCAAGCTAACGAGGAAGAATGGGTAATCATAGATGAGATGGAAGACACCAATAAGGAACTAAACATGGTGGAAGTGATGGATACAGAGCAAGCTGTTAT
AGAGTTGTCTATCAACTCAGTTGTAGGGTTGTCTAACCCAAGTACTATGAACGTCAAAGGAAGCATCAAAGAAAAGGAAGTAATAATTCTGATCGATTGTGGAGCTACCC
ACAATTTCATCTCTACACGAGTGGTCGAAGAACTACAGCTAGCAACGAAAAATACCTCCCATTATGGAGTTATTTTGGGATCCGGCACTGCAGTAAAAGGAAAGGGAGTC
TGTGAAACAGTAGAAGTGAAGCTGGGCGGCTGGAAATTAACGGCTAATTTCTTACCGTTGGAATTAGGAGGAGTAGACGTCGTGTTGGGAATGCATGGCTCTACTCTCTT
GGCATCACTGAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTGTTGTTCAAGCTAACGAGGAAGAATGGGTAATCATAGATGAGATGGAAGACACCAATAAGGAACTAAACATGGTGGAAGTGATGGATACAGAGCAAGCTGTTAT
AGAGTTGTCTATCAACTCAGTTGTAGGGTTGTCTAACCCAAGTACTATGAACGTCAAAGGAAGCATCAAAGAAAAGGAAGTAATAATTCTGATCGATTGTGGAGCTACCC
ACAATTTCATCTCTACACGAGTGGTCGAAGAACTACAGCTAGCAACGAAAAATACCTCCCATTATGGAGTTATTTTGGGATCCGGCACTGCAGTAAAAGGAAAGGGAGTC
TGTGAAACAGTAGAAGTGAAGCTGGGCGGCTGGAAATTAACGGCTAATTTCTTACCGTTGGAATTAGGAGGAGTAGACGTCGTGTTGGGAATGCATGGCTCTACTCTCTT
GGCATCACTGAAGTAG
Protein sequenceShow/hide protein sequence
MFVVQANEEEWVIIDEMEDTNKELNMVEVMDTEQAVIELSINSVVGLSNPSTMNVKGSIKEKEVIILIDCGATHNFISTRVVEELQLATKNTSHYGVILGSGTAVKGKGV
CETVEVKLGGWKLTANFLPLELGGVDVVLGMHGSTLLASLK