; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G16377 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G16377
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionTy3/gypsy retrotransposon protein
Genome locationctg2279:111973..123992
RNA-Seq ExpressionCucsat.G16377
SyntenyCucsat.G16377
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
GO:0030246 - carbohydrate binding (molecular function)
GO:0047938 - glucose-6-phosphate 1-epimerase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057388.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]4.75e-19362.53Show/hide
Query:  MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKW
        MPVF G+DP+SWLFRAERYFQIHKLT+SEK+ V+TI F GPALNWYRSQEER+KF  W NLKERLL+RF+S+REG++ GRFLRIQQ+++VEEYRN FDK 
Subjt:  MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKW

Query:  VAPLSDIPEKIVEETFMGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRA
        VAPLSD+ +++VEETFM GL PWI+ E   C P GLAE M  AQ+VE REILR  ANL  Y G K S+ +    K     + + +K N  FPIRTITL++
Subjt:  VAPLSDIPEKIVEETFMGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRA

Query:  SPAKEVRKEGPSRRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTN
            E+RKEG S+R+ DAEFQ ++EKGLCFKC+EKY + HKCK +E RELRMFVV+ D+ E EI+EE E    +++  E+Q      VEL INSVVGL +
Subjt:  SPAKEVRKEGPSRRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTN

Query:  PGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLELGGVDLILGMQWLHSLGV
        PGTMK++G++Q  EVV+L+DCGATHNF+S+++V +L++  K+TA+YGVILGSGTAI+GKG+CE VE+ +  WTV E+FLPLELGGVD+ILGMQWL+SLGV
Subjt:  PGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLELGGVDLILGMQWLHSLGV

Query:  TEMDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLK
        T  DW+NLT++F+ N K++ +KGDPSLTK +VSLK
Subjt:  TEMDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLK

TYJ96875.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.84e-19265.06Show/hide
Query:  MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKW
        MPVF G+DP+SWLFRAERYFQIHKLT+SEK+ V+TI F+GPALNWYR+QEER+KF  W NLKERLLIRF+S+REG+ +GRFLRIQQ+++VEEYRNLFDK 
Subjt:  MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKW

Query:  VAPLSDIPEKIVEETFMGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRA
        VAPLSD+ +++VEETFM GL PWI+ E   C P GLAEMMR AQ+VE RE+LR  ANL GY G K S+ +   TK Y   + + NK N  FPIRTITL++
Subjt:  VAPLSDIPEKIVEETFMGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRA

Query:  SPAKEVRKEGPSRRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTN
          + E RKEG S+RL DAEFQ +REKGLCFKC+EKY + HKCK RE RELRMFVV+ ++ E EI+EE E +  +L+ +E++ +    VEL INSVVGL +
Subjt:  SPAKEVRKEGPSRRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTN

Query:  PGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLELGGVDLILGMQWLHSLGV
        PGTMK+RGT+Q  EVV+L+DCGATHNF+S++LV TL++  K+TA+YGVILGSGTAI+GKG+CE +E+ +  WTV E+FLPLELGGVD+ILGMQWL+SLGV
Subjt:  PGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLELGGVDLILGMQWLHSLGV

Query:  TEMDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLK
        T  DW+NLT++F+ + KK+ +KGDPSLTK +VSLK
Subjt:  TEMDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLK

TYK21115.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]3.98e-20265.06Show/hide
Query:  MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKW
        MPVF G+DP+SWLFRAERYFQIHKLT+SEK+ V+TI F+GPALNWYR+QEER+KF  W NLKERLLIRF+S+REG+ +GRFLRIQQ+++VEEYRNLFDK 
Subjt:  MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKW

Query:  VAPLSDIPEKIVEETFMGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRA
        VAPLSD+ +++VEETFM GL PWI+ E   C P GLAEMMR AQ+VE RE+LR  ANL GY G K S+ +   TK Y   + + NK N  FPIRTITL++
Subjt:  VAPLSDIPEKIVEETFMGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRA

Query:  SPAKEVRKEGPSRRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTN
          + E RKEG S+RL DAEFQ +REKGLCFKC+EKY + HKCK RE RELRMFVV+ ++ E EI+EE E +  +L+ +E++ +    VEL INSVVGL +
Subjt:  SPAKEVRKEGPSRRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTN

Query:  PGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLELGGVDLILGMQWLHSLGV
        PGTMK+RGT+Q  EVV+L+DCGATHNF+S++LV TL++  K+TA+YGVILGSGTAI+GKG+CE +E+ +  WTV E+FLPLELGGVD+ILGMQWL+SLGV
Subjt:  PGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLELGGVDLILGMQWLHSLGV

Query:  TEMDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLK
        T  DW+NLT++F+ + KK+ +KGDPSLTK +VSLK
Subjt:  TEMDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLK

TYK28503.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.82e-19361.84Show/hide
Query:  MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKW
        MPVF G+DPDSWLFRAERYFQIHKLTDSEK+ V+T+SF+GPALNW+RSQEERD+FT W N+KERLL+RFRS+++G++ G+FLR++Q+S+VEEY NLFDK 
Subjt:  MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKW

Query:  VAPLSDIPEKIVEETFMGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRA
        VAP++D+PE+++++TFM GLL W++ E  F  P  LAEMM  AQMVE REI R EA + GYS  K +  +    KT  G     NK NTVFPIRTITLR+
Subjt:  VAPLSDIPEKIVEETFMGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRA

Query:  SPAKEVRKEGPSRRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTN
        S   E R+EG  +RLSDAEFQA++EKGLCF+C+E+Y + HKC+ RE RELRMFVV A+  E EI+EE++   E L  +E+  +   VVEL INSVVGL +
Subjt:  SPAKEVRKEGPSRRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTN

Query:  PGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLELGGVDLILGMQWLHSLGV
        PGTMK+RG +   EV++L+DCGATHNF+S++LVK L +  K+T++YGVILGSG A++GKG+CEK+E+ LS W +VE+FLPLELGGVD+ILGMQWL+SLGV
Subjt:  PGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLELGGVDLILGMQWLHSLGV

Query:  TEMDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLK
        T +DW+NL+++F  + K+V +KGDPSLTK ++SLK
Subjt:  TEMDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLK

XP_031745972.1 uncharacterized protein LOC116406393 [Cucumis sativus]2.48e-20165.06Show/hide
Query:  MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKW
        MPVF+G+DPDSWLFRA+RYFQIHKL+D+EK+ VATISFEGPALNWYR+QEERDKFT W NLKERLL+RFRSSREGS+ G+FLRI+Q+++VEEYRN FD+ 
Subjt:  MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKW

Query:  VAPLSDIPEKIVEETFMGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRA
        +APL+D+ +++VEETFM GL PWIK E  FC PVGLAEMM  AQ+VE REI+R+EANL GY+  K+   +    ++   +    +K NT+FPIRT+TLR 
Subjt:  VAPLSDIPEKIVEETFMGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRA

Query:  SPAKEVRKEGPSRRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTN
        +   EV+KEGP++RL DAEFQA++EKGLCF+C+EKY+ GH+CK RE RELRM+VV+ D+ E EI+EE E++  +L  +E+  E   +VEL INSVVGLTN
Subjt:  SPAKEVRKEGPSRRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTN

Query:  PGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLELGGVDLILGMQWLHSLGV
        PGTMK+RG I+  EV++L+DCGATHNFISD++V+ L + TK T++YGVILGS  A+KGKG+CE +EL L GW V  NFLPLELGGVD +L MQWL+SLGV
Subjt:  PGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLELGGVDLILGMQWLHSLGV

Query:  TEMDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLK
        TE+DW+NLTM+F HN KKV +KGDPSLTK  V LK
Subjt:  TEMDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLK

TrEMBL top hitse value%identityAlignment
A0A5A7UQI8 Transposon Tf2-1 polyprotein isoform X12.30e-19362.53Show/hide
Query:  MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKW
        MPVF G+DP+SWLFRAERYFQIHKLT+SEK+ V+TI F GPALNWYRSQEER+KF  W NLKERLL+RF+S+REG++ GRFLRIQQ+++VEEYRN FDK 
Subjt:  MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKW

Query:  VAPLSDIPEKIVEETFMGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRA
        VAPLSD+ +++VEETFM GL PWI+ E   C P GLAE M  AQ+VE REILR  ANL  Y G K S+ +    K     + + +K N  FPIRTITL++
Subjt:  VAPLSDIPEKIVEETFMGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRA

Query:  SPAKEVRKEGPSRRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTN
            E+RKEG S+R+ DAEFQ ++EKGLCFKC+EKY + HKCK +E RELRMFVV+ D+ E EI+EE E    +++  E+Q      VEL INSVVGL +
Subjt:  SPAKEVRKEGPSRRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTN

Query:  PGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLELGGVDLILGMQWLHSLGV
        PGTMK++G++Q  EVV+L+DCGATHNF+S+++V +L++  K+TA+YGVILGSGTAI+GKG+CE VE+ +  WTV E+FLPLELGGVD+ILGMQWL+SLGV
Subjt:  PGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLELGGVDLILGMQWLHSLGV

Query:  TEMDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLK
        T  DW+NLT++F+ N K++ +KGDPSLTK +VSLK
Subjt:  TEMDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLK

A0A5A7VJA0 Ty3/gypsy retrotransposon protein2.71e-19265.06Show/hide
Query:  MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKW
        MPVF G+DP+SWLFRAERYFQIHKLT+SEK+ V+TI F+GPALNWYR+QEER+KF  W NLKERLLIRF+S+REG+ +GRFLRIQQ+++VEEYRNLFDK 
Subjt:  MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKW

Query:  VAPLSDIPEKIVEETFMGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRA
        VAPLSD+ +++VEETFM GL PWI+ E   C P GLAEMMR AQ+VE RE+LR  ANL GY G K S+ +   TK Y   + + NK N  FPIRTITL++
Subjt:  VAPLSDIPEKIVEETFMGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRA

Query:  SPAKEVRKEGPSRRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTN
          + E RKEG S+RL DAEFQ +REKGLCFKC+EKY + HKCK RE RELRMFVV+ ++ E EI+EE E +  +L+ +E++ +    VEL INSVVGL +
Subjt:  SPAKEVRKEGPSRRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTN

Query:  PGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLELGGVDLILGMQWLHSLGV
        PGTMK+RGT+Q  EVV+L+DCGATHNF+S++LV TL++  K+TA+YGVILGSGTAI+GKG+CE +E+ +  WTV E+FLPLELGGVD+ILGMQWL+SLGV
Subjt:  PGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLELGGVDLILGMQWLHSLGV

Query:  TEMDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLK
        T  DW+NLT++F+ + KK+ +KGDPSLTK +VSLK
Subjt:  TEMDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLK

A0A5D3BEL2 Ty3/gypsy retrotransposon protein1.38e-19265.06Show/hide
Query:  MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKW
        MPVF G+DP+SWLFRAERYFQIHKLT+SEK+ V+TI F+GPALNWYR+QEER+KF  W NLKERLLIRF+S+REG+ +GRFLRIQQ+++VEEYRNLFDK 
Subjt:  MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKW

Query:  VAPLSDIPEKIVEETFMGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRA
        VAPLSD+ +++VEETFM GL PWI+ E   C P GLAEMMR AQ+VE RE+LR  ANL GY G K S+ +   TK Y   + + NK N  FPIRTITL++
Subjt:  VAPLSDIPEKIVEETFMGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRA

Query:  SPAKEVRKEGPSRRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTN
          + E RKEG S+RL DAEFQ +REKGLCFKC+EKY + HKCK RE RELRMFVV+ ++ E EI+EE E +  +L+ +E++ +    VEL INSVVGL +
Subjt:  SPAKEVRKEGPSRRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTN

Query:  PGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLELGGVDLILGMQWLHSLGV
        PGTMK+RGT+Q  EVV+L+DCGATHNF+S++LV TL++  K+TA+YGVILGSGTAI+GKG+CE +E+ +  WTV E+FLPLELGGVD+ILGMQWL+SLGV
Subjt:  PGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLELGGVDLILGMQWLHSLGV

Query:  TEMDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLK
        T  DW+NLT++F+ + KK+ +KGDPSLTK +VSLK
Subjt:  TEMDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLK

A0A5D3DC20 Transposon Tf2-1 polyprotein isoform X11.93e-20265.06Show/hide
Query:  MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKW
        MPVF G+DP+SWLFRAERYFQIHKLT+SEK+ V+TI F+GPALNWYR+QEER+KF  W NLKERLLIRF+S+REG+ +GRFLRIQQ+++VEEYRNLFDK 
Subjt:  MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKW

Query:  VAPLSDIPEKIVEETFMGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRA
        VAPLSD+ +++VEETFM GL PWI+ E   C P GLAEMMR AQ+VE RE+LR  ANL GY G K S+ +   TK Y   + + NK N  FPIRTITL++
Subjt:  VAPLSDIPEKIVEETFMGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRA

Query:  SPAKEVRKEGPSRRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTN
          + E RKEG S+RL DAEFQ +REKGLCFKC+EKY + HKCK RE RELRMFVV+ ++ E EI+EE E +  +L+ +E++ +    VEL INSVVGL +
Subjt:  SPAKEVRKEGPSRRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTN

Query:  PGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLELGGVDLILGMQWLHSLGV
        PGTMK+RGT+Q  EVV+L+DCGATHNF+S++LV TL++  K+TA+YGVILGSGTAI+GKG+CE +E+ +  WTV E+FLPLELGGVD+ILGMQWL+SLGV
Subjt:  PGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLELGGVDLILGMQWLHSLGV

Query:  TEMDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLK
        T  DW+NLT++F+ + KK+ +KGDPSLTK +VSLK
Subjt:  TEMDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLK

A0A5D3DXQ1 Ty3/gypsy retrotransposon protein1.36e-19361.84Show/hide
Query:  MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKW
        MPVF G+DPDSWLFRAERYFQIHKLTDSEK+ V+T+SF+GPALNW+RSQEERD+FT W N+KERLL+RFRS+++G++ G+FLR++Q+S+VEEY NLFDK 
Subjt:  MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKW

Query:  VAPLSDIPEKIVEETFMGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRA
        VAP++D+PE+++++TFM GLL W++ E  F  P  LAEMM  AQMVE REI R EA + GYS  K +  +    KT  G     NK NTVFPIRTITLR+
Subjt:  VAPLSDIPEKIVEETFMGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRA

Query:  SPAKEVRKEGPSRRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTN
        S   E R+EG  +RLSDAEFQA++EKGLCF+C+E+Y + HKC+ RE RELRMFVV A+  E EI+EE++   E L  +E+  +   VVEL INSVVGL +
Subjt:  SPAKEVRKEGPSRRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTN

Query:  PGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLELGGVDLILGMQWLHSLGV
        PGTMK+RG +   EV++L+DCGATHNF+S++LVK L +  K+T++YGVILGSG A++GKG+CEK+E+ LS W +VE+FLPLELGGVD+ILGMQWL+SLGV
Subjt:  PGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLELGGVDLILGMQWLHSLGV

Query:  TEMDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLK
        T +DW+NL+++F  + K+V +KGDPSLTK ++SLK
Subjt:  TEMDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G67020.1 unknown protein2.8e-1039.13Show/hide
Query:  MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRF
        MPVFDG     W  + ER+F++ +  DS+KL +  +S EG AL W+  +    +F  W + ++RLL RF
Subjt:  MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRF

AT3G29750.1 Eukaryotic aspartyl protease family protein8.9e-1728.85Show/hide
Query:  YGGVKEQGNKEN-----TVFPIRTITLRASPAKEVRKEGPSRRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDV--------EEE
        Y G++++G+  +         +R++TL     +E+  +G    L  A  + K   G+         + ++ +  E+  L +   + D V        E E
Subjt:  YGGVKEQGNKEN-----TVFPIRTITLRASPAKEVRKEGPSRRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDV--------EEE

Query:  IIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTNPGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCE
         +E+D Y L   + ME               V+ LT    M+  G I   +VVV +D GAT NFI   L  +LK+ T  T    V+LG    I+  G C 
Subjt:  IIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTNPGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCE

Query:  KVELNLSGWTVVENFLPLELG--GVDLILGMQWLHSLGVTEMDWRNLTMSFFHNSKKVVL
         + L +    + ENFL L+L    VD+ILG +WL  LG T ++W+N   SF HN + + L
Subjt:  KVELNLSGWTVVENFLPLELG--GVDLILGMQWLHSLGVTEMDWRNLTMSFFHNSKKVVL

AT3G30770.1 Eukaryotic aspartyl protease family protein4.1e-1438.46Show/hide
Query:  SVVGLTNPGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLEL--GGVDLILG
        S    T    M+  G I   +VVV++D GAT+NFISD L   LK+ T  T    V+LG    I+  G C  + L +    + ENFL L+L    VD+ILG
Subjt:  SVVGLTNPGTMKIRGTIQSMEVVVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLEL--GGVDLILG

Query:  MQWLHSLGVTEMDWRNLTMSFFHNSKKVVL
             +L    + W N   SFFHN + V L
Subjt:  MQWLHSLGVTEMDWRNLTMSFFHNSKKVVL

AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding3.3e-1120.59Show/hide
Query:  ERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKWVAPLSDIPEKIVEETF
        E YF  + + + E+L +   + EG    W +   +++  T W   K  +    +++ + +    +  IQQ+ SV EYR  F+        +P + +E  F
Subjt:  ERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKWVAPLSDIPEKIVEETF

Query:  MGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRASPAKEVRKEGPSRRLS
        + GL P ++       P G+ +MM  AQ +E+   L                        YG                                      
Subjt:  MGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRASPAKEVRKEGPSRRLS

Query:  DAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHE-PGEVVELCINSVVGLTNPGTMKIRGTIQSMEV
                  GL  + + K Y   + + R        +V    + E++ +      ED   ++ +HE PG  V  C            M+  G I   EV
Subjt:  DAEFQAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHE-PGEVVELCINSVVGLTNPGTMKIRGTIQSMEV

Query:  VVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFL--PLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFF
                     S RL   +K +                      C+++ L ++   +VE++    L+   VD+ILG +WL  LG TE++W+N + SF 
Subjt:  VVLVDCGATHNFISDRLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFL--PLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFF

Query:  HNSKKVVL
        HN   V L
Subjt:  HNSKKVVL

AT3G44713.1 unknown protein1.7e-0429.33Show/hide
Query:  PVFDG--DDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSRE
        P F+G   +  SW+   E +F     TD EK+ +A    EG A  W+  +++   F  W +L++ L++RF   ++
Subjt:  PVFDG--DDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGTATTCGATGGAGATGACCCCGACTCGTGGCTGTTCCGCGCAGAGAGGTACTTTCAAATACACAAACTAACGGATTCTGAAAAACTGACGGTCGCCACTATCAG
CTTTGAAGGACCCGCACTCAACTGGTACCGATCACAGGAGGAGAGAGACAAGTTCACCTGCTGGTTAAACTTAAAGGAGCGATTATTGATCCGATTCCGATCATCCCGCG
AAGGTTCCTTGTATGGCCGGTTCTTGCGTATCCAACAGAAGTCAAGCGTAGAGGAGTATAGGAATCTCTTCGATAAATGGGTGGCCCCATTATCGGATATTCCGGAGAAG
ATCGTGGAAGAAACATTCATGGGAGGGTTGTTACCGTGGATTAAGGTAGAAACGGAATTCTGCAATCCCGTGGGATTAGCCGAGATGATGAGATACGCGCAAATGGTAGA
ACAACGGGAGATCCTGAGGAGGGAAGCAAACTTACCAGGATATTCTGGAACGAAATTCTCGAGTAGCAGCTATCCTACAACCAAAACATACGGTGGGGTAAAGGAACAAG
GGAATAAAGAGAACACAGTCTTCCCAATCAGAACGATTACACTGAGAGCTTCGCCGGCGAAGGAGGTTAGGAAAGAAGGACCCTCGAGACGACTATCGGACGCGGAATTC
CAGGCCAAGAGAGAGAAGGGACTCTGTTTTAAATGTGATGAGAAGTATTACTCCGGGCACAAGTGCAAGGCAAGGGAGATTCGCGAGTTACGAATGTTTGTGGTCAGGGC
TGACGATGTAGAAGAAGAAATCATTGAGGAAGATGAATACAATTTAGAAGACTTGAAGGCCATGGAGTTGCAGCACGAACCGGGAGAAGTAGTCGAGTTATGTATCAACT
CAGTAGTGGGATTGACAAATCCGGGAACCATGAAGATAAGAGGCACAATCCAAAGCATGGAAGTTGTCGTGCTGGTCGACTGTGGAGCAACCCACAACTTCATATCTGAT
CGGCTAGTCAAGACGTTGAAGATAACCACAAAAGACACTGCCAATTATGGAGTAATACTGGGGTCCGGAACAGCCATCAAAGGCAAGGGAGTGTGTGAAAAAGTGGAGCT
GAACCTCAGTGGGTGGACAGTAGTAGAAAATTTCCTACCACTGGAACTCGGAGGAGTAGATTTGATCTTGGGAATGCAATGGTTACATTCCTTGGGAGTGACAGAAATGG
ATTGGAGGAACCTAACCATGTCCTTTTTTCATAACAGTAAAAAGGTGGTGCTAAAAGGAGATCCGAGTTTAACAAAAACTCAAGTGAGTCTAAAAAAACCTCACTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGGTATTCGATGGAGATGACCCCGACTCGTGGCTGTTCCGCGCAGAGAGGTACTTTCAAATACACAAACTAACGGATTCTGAAAAACTGACGGTCGCCACTATCAG
CTTTGAAGGACCCGCACTCAACTGGTACCGATCACAGGAGGAGAGAGACAAGTTCACCTGCTGGTTAAACTTAAAGGAGCGATTATTGATCCGATTCCGATCATCCCGCG
AAGGTTCCTTGTATGGCCGGTTCTTGCGTATCCAACAGAAGTCAAGCGTAGAGGAGTATAGGAATCTCTTCGATAAATGGGTGGCCCCATTATCGGATATTCCGGAGAAG
ATCGTGGAAGAAACATTCATGGGAGGGTTGTTACCGTGGATTAAGGTAGAAACGGAATTCTGCAATCCCGTGGGATTAGCCGAGATGATGAGATACGCGCAAATGGTAGA
ACAACGGGAGATCCTGAGGAGGGAAGCAAACTTACCAGGATATTCTGGAACGAAATTCTCGAGTAGCAGCTATCCTACAACCAAAACATACGGTGGGGTAAAGGAACAAG
GGAATAAAGAGAACACAGTCTTCCCAATCAGAACGATTACACTGAGAGCTTCGCCGGCGAAGGAGGTTAGGAAAGAAGGACCCTCGAGACGACTATCGGACGCGGAATTC
CAGGCCAAGAGAGAGAAGGGACTCTGTTTTAAATGTGATGAGAAGTATTACTCCGGGCACAAGTGCAAGGCAAGGGAGATTCGCGAGTTACGAATGTTTGTGGTCAGGGC
TGACGATGTAGAAGAAGAAATCATTGAGGAAGATGAATACAATTTAGAAGACTTGAAGGCCATGGAGTTGCAGCACGAACCGGGAGAAGTAGTCGAGTTATGTATCAACT
CAGTAGTGGGATTGACAAATCCGGGAACCATGAAGATAAGAGGCACAATCCAAAGCATGGAAGTTGTCGTGCTGGTCGACTGTGGAGCAACCCACAACTTCATATCTGAT
CGGCTAGTCAAGACGTTGAAGATAACCACAAAAGACACTGCCAATTATGGAGTAATACTGGGGTCCGGAACAGCCATCAAAGGCAAGGGAGTGTGTGAAAAAGTGGAGCT
GAACCTCAGTGGGTGGACAGTAGTAGAAAATTTCCTACCACTGGAACTCGGAGGAGTAGATTTGATCTTGGGAATGCAATGGTTACATTCCTTGGGAGTGACAGAAATGG
ATTGGAGGAACCTAACCATGTCCTTTTTTCATAACAGTAAAAAGGTGGTGCTAAAAGGAGATCCGAGTTTAACAAAAACTCAAGTGAGTCTAAAAAAACCTCACTAA
Protein sequenceShow/hide protein sequence
MPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPALNWYRSQEERDKFTCWLNLKERLLIRFRSSREGSLYGRFLRIQQKSSVEEYRNLFDKWVAPLSDIPEK
IVEETFMGGLLPWIKVETEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGTKFSSSSYPTTKTYGGVKEQGNKENTVFPIRTITLRASPAKEVRKEGPSRRLSDAEF
QAKREKGLCFKCDEKYYSGHKCKAREIRELRMFVVRADDVEEEIIEEDEYNLEDLKAMELQHEPGEVVELCINSVVGLTNPGTMKIRGTIQSMEVVVLVDCGATHNFISD
RLVKTLKITTKDTANYGVILGSGTAIKGKGVCEKVELNLSGWTVVENFLPLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFFHNSKKVVLKGDPSLTKTQVSLKKPH