; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G20970 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G20970
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr1:16519535..16521095
RNA-Seq ExpressionCSPI01G20970
SyntenyCSPI01G20970
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0062868.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.6e-13352.17Show/hide
Query:  MVQTRSEERRDTHEQ-------ELNKIPVMEEKLTVMSQNMENLQAQVEKTHQMVMIFMERMAKERVLASGKQIDSSAQETWTGKSAEGESSASKETKND
        MVQTR+EER ++ EQ       EL K+PV+E  L  +++NME ++ Q EK  Q ++ +ME  AKER +A  +  +S  Q + T KS   ++S+S++ + +
Subjt:  MVQTRSEERRDTHEQ-------ELNKIPVMEEKLTVMSQNMENLQAQVEKTHQMVMIFMERMAKERVLASGKQIDSSAQETWTGKSAEGESSASKETKND

Query:  TMEKKGDGDGDNNDRNKFNKVEMSVFNGDDPDSWLFRADRYFQIHKLTNSE--------------------------------NSRSLQSVLKAPHSTGI
           KK + D ++NDR+KF KVEM VF G+DP+SWLFRA+RYFQIHKLT SE                                  R L         T  
Subjt:  TMEKKGDGDGDNNDRNKFNKVEMSVFNGDDPDSWLFRADRYFQIHKLTNSE--------------------------------NSRSLQSVLKAPHSTGI

Query:  GRRRRETDLPQESSVEEYRNLFDKWVAPLSDIPKKIVEETFMGGLLPWIKVEMEFCIPVGLAEMMRYAQMVEHREILRREANLPGYSGAKVPNYPYNTAK
        GR  R   + QE++VEEYRNLFDK VAPL D+  ++VEETFM GL PWI+ E+  C P GLAEMMR AQ+VE REILR  ANL GY G K         K
Subjt:  GRRRRETDLPQESSVEEYRNLFDKWVAPLSDIPKKIVEETFMGGLLPWIKVEMEFCIPVGLAEMMRYAQMVEHREILRREANLPGYSGAKVPNYPYNTAK

Query:  TNSIIKEQGNKENTVFSIRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRVKEIRELCMFVVRADDVEEEIIEEDEYDLKEL
             + + NK N  F IRTITLK   + E +KEG SKRL DAEFQ +REKGLCFKC+EKY + HKC+++E REL MFVV+ ++ E EI+EE E D  EL
Subjt:  TNSIIKEQGNKENTVFSIRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRVKEIRELCMFVVRADDVEEEIIEEDEYDLKEL

Query:  KTIELQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVL
        +T+E+Q      VEL INSVVGL +PGTMK+RGT+Q KEVV+L+DCGA HNF+S++LV TL+LP K+T++YGVILGSGTAI+GKG+CE +++ +  WTV 
Subjt:  KTIELQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVL

Query:  ENFLPLELGGVDVILGMQWLHSLGVTEMD
        E+FLPLELGGVDVILGMQWL+SLGVT  D
Subjt:  ENFLPLELGGVDVILGMQWLHSLGVTEMD

KAA0068193.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.0e-13251.98Show/hide
Query:  MVQTRSEERRDTHEQ-------ELNKIPVMEEKLTVMSQNMENLQAQVEKTHQMVMIFMERMAKERVLASGKQIDSSAQETWTGKSAEGESSASKETKND
        MVQTR EER ++ EQ       EL K+PV+E  L  +++NME ++ Q EK  Q ++ +ME  AKER +A  +  +S  Q + T KS   ++S+S++ + +
Subjt:  MVQTRSEERRDTHEQ-------ELNKIPVMEEKLTVMSQNMENLQAQVEKTHQMVMIFMERMAKERVLASGKQIDSSAQETWTGKSAEGESSASKETKND

Query:  TMEKKGDGDGDNNDRNKFNKVEMSVFNGDDPDSWLFRADRYFQIHKLTNSE--------------------------------NSRSLQSVLKAPHSTGI
           KK + D ++NDR+KF KVEM VF G+DP+SWLFRA+RYFQIHKLT SE                                  R L         T  
Subjt:  TMEKKGDGDGDNNDRNKFNKVEMSVFNGDDPDSWLFRADRYFQIHKLTNSE--------------------------------NSRSLQSVLKAPHSTGI

Query:  GRRRRETDLPQESSVEEYRNLFDKWVAPLSDIPKKIVEETFMGGLLPWIKVEMEFCIPVGLAEMMRYAQMVEHREILRREANLPGYSGAKVPNYPYNTAK
        GR  R   + QE++VEEYRNLFDK VAPLSD+  ++VEETFM GL PWI+ E+  C P GLAEMMR AQ+VE RE+LR  ANL GY G K         K
Subjt:  GRRRRETDLPQESSVEEYRNLFDKWVAPLSDIPKKIVEETFMGGLLPWIKVEMEFCIPVGLAEMMRYAQMVEHREILRREANLPGYSGAKVPNYPYNTAK

Query:  TNSIIKEQGNKENTVFSIRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRVKEIRELCMFVVRADDVEEEIIEEDEYDLKEL
             + + NK N  F IRTITLK   + E +KEG SKRL DAEFQ +REKGLCFKC+EKY + HKC+++E REL MFVV+ ++ E EI+EE E D  EL
Subjt:  TNSIIKEQGNKENTVFSIRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRVKEIRELCMFVVRADDVEEEIIEEDEYDLKEL

Query:  KTIELQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVL
        +T+E++      VEL INSVVGL +PGTMK+RGT+Q KEVV+L+DCGA HNF+S++LV TL+LP K+T++YGVILGSGTAI+GKG+CE +++ +  WTV 
Subjt:  KTIELQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVL

Query:  ENFLPLELGGVDVILGMQWLHSLGVTEMD
        E+FLPLELGGVDVILGMQWL+SLGVT  D
Subjt:  ENFLPLELGGVDVILGMQWLHSLGVTEMD

TYJ96875.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.6e-13351.98Show/hide
Query:  MVQTRSEERRDTHEQ-------ELNKIPVMEEKLTVMSQNMENLQAQVEKTHQMVMIFMERMAKERVLASGKQIDSSAQETWTGKSAEGESSASKETKND
        MVQTR EER ++ EQ       EL K+PV+E  L  +++NME ++ Q EK  Q ++ +ME  AKER +A  +  +S  Q + T KS   ++S+S++ + +
Subjt:  MVQTRSEERRDTHEQ-------ELNKIPVMEEKLTVMSQNMENLQAQVEKTHQMVMIFMERMAKERVLASGKQIDSSAQETWTGKSAEGESSASKETKND

Query:  TMEKKGDGDGDNNDRNKFNKVEMSVFNGDDPDSWLFRADRYFQIHKLTNSE--------------------------------NSRSLQSVLKAPHSTGI
           KK + D ++NDR+KF KVEM VF G+DP+SWLFRA+RYFQIHKLT SE                                  R L         T  
Subjt:  TMEKKGDGDGDNNDRNKFNKVEMSVFNGDDPDSWLFRADRYFQIHKLTNSE--------------------------------NSRSLQSVLKAPHSTGI

Query:  GRRRRETDLPQESSVEEYRNLFDKWVAPLSDIPKKIVEETFMGGLLPWIKVEMEFCIPVGLAEMMRYAQMVEHREILRREANLPGYSGAKVPNYPYNTAK
        GR  R   + QE++VEEYRNLFDK VAPLSD+  ++VEETFM GL PWI+ E+  C P GLAEMMR AQ+VE RE+LR  ANL GY G K         K
Subjt:  GRRRRETDLPQESSVEEYRNLFDKWVAPLSDIPKKIVEETFMGGLLPWIKVEMEFCIPVGLAEMMRYAQMVEHREILRREANLPGYSGAKVPNYPYNTAK

Query:  TNSIIKEQGNKENTVFSIRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRVKEIRELCMFVVRADDVEEEIIEEDEYDLKEL
             + + NK N  F IRTITLK   + E +KEG SKRL DAEFQ +REKGLCFKC+EKY + HKC+++E REL MFVV+ ++ E EI+EE E D  EL
Subjt:  TNSIIKEQGNKENTVFSIRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRVKEIRELCMFVVRADDVEEEIIEEDEYDLKEL

Query:  KTIELQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVL
        +T+E++      VEL INSVVGL +PGTMK+RGT+Q KEVV+L+DCGA HNF+S++LV TL+LP K+T++YGVILGSGTAI+GKG+CE +++ +  WTV 
Subjt:  KTIELQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVL

Query:  ENFLPLELGGVDVILGMQWLHSLGVTEMD
        E+FLPLELGGVDVILGMQWL+SLGVT  D
Subjt:  ENFLPLELGGVDVILGMQWLHSLGVTEMD

TYK21115.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]4.6e-13351.98Show/hide
Query:  MVQTRSEERRDTHEQ-------ELNKIPVMEEKLTVMSQNMENLQAQVEKTHQMVMIFMERMAKERVLASGKQIDSSAQETWTGKSAEGESSASKETKND
        MVQTR EER ++ EQ       EL K+PV+E  L  +++NME ++ Q EK  Q ++ +ME  AKER +A  +  +S  Q + T KS   ++S+S++ + +
Subjt:  MVQTRSEERRDTHEQ-------ELNKIPVMEEKLTVMSQNMENLQAQVEKTHQMVMIFMERMAKERVLASGKQIDSSAQETWTGKSAEGESSASKETKND

Query:  TMEKKGDGDGDNNDRNKFNKVEMSVFNGDDPDSWLFRADRYFQIHKLTNSE--------------------------------NSRSLQSVLKAPHSTGI
           KK + D ++NDR+KF KVEM VF G+DP+SWLFRA+RYFQIHKLT SE                                  R L         T  
Subjt:  TMEKKGDGDGDNNDRNKFNKVEMSVFNGDDPDSWLFRADRYFQIHKLTNSE--------------------------------NSRSLQSVLKAPHSTGI

Query:  GRRRRETDLPQESSVEEYRNLFDKWVAPLSDIPKKIVEETFMGGLLPWIKVEMEFCIPVGLAEMMRYAQMVEHREILRREANLPGYSGAKVPNYPYNTAK
        GR  R   + QE++VEEYRNLFDK VAPLSD+  ++VEETFM GL PWI+ E+  C P GLAEMMR AQ+VE RE+LR  ANL GY G K         K
Subjt:  GRRRRETDLPQESSVEEYRNLFDKWVAPLSDIPKKIVEETFMGGLLPWIKVEMEFCIPVGLAEMMRYAQMVEHREILRREANLPGYSGAKVPNYPYNTAK

Query:  TNSIIKEQGNKENTVFSIRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRVKEIRELCMFVVRADDVEEEIIEEDEYDLKEL
             + + NK N  F IRTITLK   + E +KEG SKRL DAEFQ +REKGLCFKC+EKY + HKC+++E REL MFVV+ ++ E EI+EE E D  EL
Subjt:  TNSIIKEQGNKENTVFSIRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRVKEIRELCMFVVRADDVEEEIIEEDEYDLKEL

Query:  KTIELQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVL
        +T+E++      VEL INSVVGL +PGTMK+RGT+Q KEVV+L+DCGA HNF+S++LV TL+LP K+T++YGVILGSGTAI+GKG+CE +++ +  WTV 
Subjt:  KTIELQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVL

Query:  ENFLPLELGGVDVILGMQWLHSLGVTEMD
        E+FLPLELGGVDVILGMQWL+SLGVT  D
Subjt:  ENFLPLELGGVDVILGMQWLHSLGVTEMD

XP_031745972.1 uncharacterized protein LOC116406393 [Cucumis sativus]1.5e-13651.99Show/hide
Query:  MVQTRSEERRDTHEQEL-------NKIPVMEEKLTVMSQNMENLQAQVEKTHQMVMIFMERMAKERVLASGKQIDSSAQETWTGKSAEGESSASKETKND
        MV TRSEER + ++QE+       +KIP +EE LT +S        Q EKTHQM+MI M+ +AKER   S K  D  AQET   KS EGESS S+  +N+
Subjt:  MVQTRSEERRDTHEQEL-------NKIPVMEEKLTVMSQNMENLQAQVEKTHQMVMIFMERMAKERVLASGKQIDSSAQETWTGKSAEGESSASKETKND

Query:  TMEKKGDGDGDNNDRNKFNKVEMSVFNGDDPDSWLFRADRYFQIHKLTNSENSRSLQSVLKAPHSTGIGRRRRETD------------------------
        T E++ + +   N+R+KF KVEM VFNG+DPDSWLFRADRYFQIHKL+++E         + P +    R + E D                        
Subjt:  TMEKKGDGDGDNNDRNKFNKVEMSVFNGDDPDSWLFRADRYFQIHKLTNSENSRSLQSVLKAPHSTGIGRRRRETD------------------------

Query:  ------LPQESSVEEYRNLFDKWVAPLSDIPKKIVEETFMGGLLPWIKVEMEFCIPVGLAEMMRYAQMVEHREILRREANLPGYSGAKVPNYPYNTAKTN
              + Q+++VEEYRN FD+ +APL+D+  ++VEETFM GL PWIK E+ FC PVGLAEMM  AQ+VE+REI+R+EANL GY+  K P    +  +++
Subjt:  ------LPQESSVEEYRNLFDKWVAPLSDIPKKIVEETFMGGLLPWIKVEMEFCIPVGLAEMMRYAQMVEHREILRREANLPGYSGAKVPNYPYNTAKTN

Query:  SIIKEQGNKENTVFSIRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRVKEIRELCMFVVRADDVEEEIIEEDEYDLKELKT
        + +    +K NT+F IRT+TL+ +   E+KKEGP+KRL DAEFQA++EKGLCF+C+EKY+  H+C+ +E REL M+VV+ D+ E EI+EE E+D  EL  
Subjt:  SIIKEQGNKENTVFSIRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRVKEIRELCMFVVRADDVEEEIIEEDEYDLKELKT

Query:  IELQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVLEN
        +E+  +   +VEL INSVVGLTNPGTMK+RG I+ +EV++L+DCGA HNFISD++V  L LPTK TS+YGVILGS  A+KGKG+CE ++L+L GW V  N
Subjt:  IELQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVLEN

Query:  FLPLELGGVDVILGMQWLHSLGVTEMD
        FLPLELGGVD +L MQWL+SLGVTE+D
Subjt:  FLPLELGGVDVILGMQWLHSLGVTEMD

TrEMBL top hitse value%identityAlignment
A0A5A7V5H5 Ty3/gypsy retrotransposon protein2.2e-13352.17Show/hide
Query:  MVQTRSEERRDTHEQ-------ELNKIPVMEEKLTVMSQNMENLQAQVEKTHQMVMIFMERMAKERVLASGKQIDSSAQETWTGKSAEGESSASKETKND
        MVQTR+EER ++ EQ       EL K+PV+E  L  +++NME ++ Q EK  Q ++ +ME  AKER +A  +  +S  Q + T KS   ++S+S++ + +
Subjt:  MVQTRSEERRDTHEQ-------ELNKIPVMEEKLTVMSQNMENLQAQVEKTHQMVMIFMERMAKERVLASGKQIDSSAQETWTGKSAEGESSASKETKND

Query:  TMEKKGDGDGDNNDRNKFNKVEMSVFNGDDPDSWLFRADRYFQIHKLTNSE--------------------------------NSRSLQSVLKAPHSTGI
           KK + D ++NDR+KF KVEM VF G+DP+SWLFRA+RYFQIHKLT SE                                  R L         T  
Subjt:  TMEKKGDGDGDNNDRNKFNKVEMSVFNGDDPDSWLFRADRYFQIHKLTNSE--------------------------------NSRSLQSVLKAPHSTGI

Query:  GRRRRETDLPQESSVEEYRNLFDKWVAPLSDIPKKIVEETFMGGLLPWIKVEMEFCIPVGLAEMMRYAQMVEHREILRREANLPGYSGAKVPNYPYNTAK
        GR  R   + QE++VEEYRNLFDK VAPL D+  ++VEETFM GL PWI+ E+  C P GLAEMMR AQ+VE REILR  ANL GY G K         K
Subjt:  GRRRRETDLPQESSVEEYRNLFDKWVAPLSDIPKKIVEETFMGGLLPWIKVEMEFCIPVGLAEMMRYAQMVEHREILRREANLPGYSGAKVPNYPYNTAK

Query:  TNSIIKEQGNKENTVFSIRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRVKEIRELCMFVVRADDVEEEIIEEDEYDLKEL
             + + NK N  F IRTITLK   + E +KEG SKRL DAEFQ +REKGLCFKC+EKY + HKC+++E REL MFVV+ ++ E EI+EE E D  EL
Subjt:  TNSIIKEQGNKENTVFSIRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRVKEIRELCMFVVRADDVEEEIIEEDEYDLKEL

Query:  KTIELQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVL
        +T+E+Q      VEL INSVVGL +PGTMK+RGT+Q KEVV+L+DCGA HNF+S++LV TL+LP K+T++YGVILGSGTAI+GKG+CE +++ +  WTV 
Subjt:  KTIELQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVL

Query:  ENFLPLELGGVDVILGMQWLHSLGVTEMD
        E+FLPLELGGVDVILGMQWL+SLGVT  D
Subjt:  ENFLPLELGGVDVILGMQWLHSLGVTEMD

A0A5A7VAR4 Ty3/gypsy retrotransposon protein9.7e-12951.14Show/hide
Query:  MVQTRSEERRDTHEQ-------ELNKIPVMEEKLTVMSQNMENLQAQVEKTHQMVMIFMERMAKERVLASGKQIDSSAQETWTGKSAEGESSASKETKND
        MVQTR EER +  EQ       EL K+P +E  L  +++NME ++ Q EK  Q ++ +ME  AKER +   +  +S  Q + T KS  G++S+S +    
Subjt:  MVQTRSEERRDTHEQ-------ELNKIPVMEEKLTVMSQNMENLQAQVEKTHQMVMIFMERMAKERVLASGKQIDSSAQETWTGKSAEGESSASKETKND

Query:  TMEKKGDGDGDNNDRNKFNKVEMSVFNGDDPDSWLFRADRYFQIHKLTNSENS---------------RSLQSVLKAPHSTGIGRR--------------
        + EKK D D + NDR+KF KVEM VF G+DP+SWLFRA+RYFQIHKLT SE                 RS +   K    T +  R              
Subjt:  TMEKKGDGDGDNNDRNKFNKVEMSVFNGDDPDSWLFRADRYFQIHKLTNSENS---------------RSLQSVLKAPHSTGIGRR--------------

Query:  RRETDLPQESSVEEYRNLFDKWVAPLSDIPKKIVEETFMGGLLPWIKVEMEFCIPVGLAEMMRYAQMVEHREILRREANLPGYSGAKVPNYPYNTAKTNS
         R   + QE++VEEYRN FDK VAPLSD+  ++VEETFM GL PWI+ E+  C P GLAE M  AQ+VE REILR  ANL  Y G K         K + 
Subjt:  RRETDLPQESSVEEYRNLFDKWVAPLSDIPKKIVEETFMGGLLPWIKVEMEFCIPVGLAEMMRYAQMVEHREILRREANLPGYSGAKVPNYPYNTAKTNS

Query:  IIKEQGNKENTVFSIRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRVKEIRELCMFVVRADDVEEEIIEEDEYDLKELKTI
          + + +K N  F IRTITLK     EI+KEG SKRL DAEFQ ++EKGLCFKC+EKY + HKC++KE REL MFVV+ D+ E EI+EE E +  E++  
Subjt:  IIKEQGNKENTVFSIRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRVKEIRELCMFVVRADDVEEEIIEEDEYDLKELKTI

Query:  ELQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVLENF
        E+Q      VEL INSVVGL +PGTMK++G++Q KEVV+L+DCGA HNF+S+++V +L+LP K+T++YGVILGSGTAI+GKG+CE V++ +  WTV E+F
Subjt:  ELQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVLENF

Query:  LPLELGGVDVILGMQWLHSLGVTEMD
        LPLELGGVDVILGMQWL+SLGVT  D
Subjt:  LPLELGGVDVILGMQWLHSLGVTEMD

A0A5A7VJA0 Ty3/gypsy retrotransposon protein5.0e-13351.98Show/hide
Query:  MVQTRSEERRDTHEQ-------ELNKIPVMEEKLTVMSQNMENLQAQVEKTHQMVMIFMERMAKERVLASGKQIDSSAQETWTGKSAEGESSASKETKND
        MVQTR EER ++ EQ       EL K+PV+E  L  +++NME ++ Q EK  Q ++ +ME  AKER +A  +  +S  Q + T KS   ++S+S++ + +
Subjt:  MVQTRSEERRDTHEQ-------ELNKIPVMEEKLTVMSQNMENLQAQVEKTHQMVMIFMERMAKERVLASGKQIDSSAQETWTGKSAEGESSASKETKND

Query:  TMEKKGDGDGDNNDRNKFNKVEMSVFNGDDPDSWLFRADRYFQIHKLTNSE--------------------------------NSRSLQSVLKAPHSTGI
           KK + D ++NDR+KF KVEM VF G+DP+SWLFRA+RYFQIHKLT SE                                  R L         T  
Subjt:  TMEKKGDGDGDNNDRNKFNKVEMSVFNGDDPDSWLFRADRYFQIHKLTNSE--------------------------------NSRSLQSVLKAPHSTGI

Query:  GRRRRETDLPQESSVEEYRNLFDKWVAPLSDIPKKIVEETFMGGLLPWIKVEMEFCIPVGLAEMMRYAQMVEHREILRREANLPGYSGAKVPNYPYNTAK
        GR  R   + QE++VEEYRNLFDK VAPLSD+  ++VEETFM GL PWI+ E+  C P GLAEMMR AQ+VE RE+LR  ANL GY G K         K
Subjt:  GRRRRETDLPQESSVEEYRNLFDKWVAPLSDIPKKIVEETFMGGLLPWIKVEMEFCIPVGLAEMMRYAQMVEHREILRREANLPGYSGAKVPNYPYNTAK

Query:  TNSIIKEQGNKENTVFSIRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRVKEIRELCMFVVRADDVEEEIIEEDEYDLKEL
             + + NK N  F IRTITLK   + E +KEG SKRL DAEFQ +REKGLCFKC+EKY + HKC+++E REL MFVV+ ++ E EI+EE E D  EL
Subjt:  TNSIIKEQGNKENTVFSIRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRVKEIRELCMFVVRADDVEEEIIEEDEYDLKEL

Query:  KTIELQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVL
        +T+E++      VEL INSVVGL +PGTMK+RGT+Q KEVV+L+DCGA HNF+S++LV TL+LP K+T++YGVILGSGTAI+GKG+CE +++ +  WTV 
Subjt:  KTIELQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVL

Query:  ENFLPLELGGVDVILGMQWLHSLGVTEMD
        E+FLPLELGGVDVILGMQWL+SLGVT  D
Subjt:  ENFLPLELGGVDVILGMQWLHSLGVTEMD

A0A5D3BEL2 Ty3/gypsy retrotransposon protein2.2e-13351.98Show/hide
Query:  MVQTRSEERRDTHEQ-------ELNKIPVMEEKLTVMSQNMENLQAQVEKTHQMVMIFMERMAKERVLASGKQIDSSAQETWTGKSAEGESSASKETKND
        MVQTR EER ++ EQ       EL K+PV+E  L  +++NME ++ Q EK  Q ++ +ME  AKER +A  +  +S  Q + T KS   ++S+S++ + +
Subjt:  MVQTRSEERRDTHEQ-------ELNKIPVMEEKLTVMSQNMENLQAQVEKTHQMVMIFMERMAKERVLASGKQIDSSAQETWTGKSAEGESSASKETKND

Query:  TMEKKGDGDGDNNDRNKFNKVEMSVFNGDDPDSWLFRADRYFQIHKLTNSE--------------------------------NSRSLQSVLKAPHSTGI
           KK + D ++NDR+KF KVEM VF G+DP+SWLFRA+RYFQIHKLT SE                                  R L         T  
Subjt:  TMEKKGDGDGDNNDRNKFNKVEMSVFNGDDPDSWLFRADRYFQIHKLTNSE--------------------------------NSRSLQSVLKAPHSTGI

Query:  GRRRRETDLPQESSVEEYRNLFDKWVAPLSDIPKKIVEETFMGGLLPWIKVEMEFCIPVGLAEMMRYAQMVEHREILRREANLPGYSGAKVPNYPYNTAK
        GR  R   + QE++VEEYRNLFDK VAPLSD+  ++VEETFM GL PWI+ E+  C P GLAEMMR AQ+VE RE+LR  ANL GY G K         K
Subjt:  GRRRRETDLPQESSVEEYRNLFDKWVAPLSDIPKKIVEETFMGGLLPWIKVEMEFCIPVGLAEMMRYAQMVEHREILRREANLPGYSGAKVPNYPYNTAK

Query:  TNSIIKEQGNKENTVFSIRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRVKEIRELCMFVVRADDVEEEIIEEDEYDLKEL
             + + NK N  F IRTITLK   + E +KEG SKRL DAEFQ +REKGLCFKC+EKY + HKC+++E REL MFVV+ ++ E EI+EE E D  EL
Subjt:  TNSIIKEQGNKENTVFSIRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRVKEIRELCMFVVRADDVEEEIIEEDEYDLKEL

Query:  KTIELQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVL
        +T+E++      VEL INSVVGL +PGTMK+RGT+Q KEVV+L+DCGA HNF+S++LV TL+LP K+T++YGVILGSGTAI+GKG+CE +++ +  WTV 
Subjt:  KTIELQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVL

Query:  ENFLPLELGGVDVILGMQWLHSLGVTEMD
        E+FLPLELGGVDVILGMQWL+SLGVT  D
Subjt:  ENFLPLELGGVDVILGMQWLHSLGVTEMD

A0A5D3DC20 Transposon Tf2-1 polyprotein isoform X12.2e-13351.98Show/hide
Query:  MVQTRSEERRDTHEQ-------ELNKIPVMEEKLTVMSQNMENLQAQVEKTHQMVMIFMERMAKERVLASGKQIDSSAQETWTGKSAEGESSASKETKND
        MVQTR EER ++ EQ       EL K+PV+E  L  +++NME ++ Q EK  Q ++ +ME  AKER +A  +  +S  Q + T KS   ++S+S++ + +
Subjt:  MVQTRSEERRDTHEQ-------ELNKIPVMEEKLTVMSQNMENLQAQVEKTHQMVMIFMERMAKERVLASGKQIDSSAQETWTGKSAEGESSASKETKND

Query:  TMEKKGDGDGDNNDRNKFNKVEMSVFNGDDPDSWLFRADRYFQIHKLTNSE--------------------------------NSRSLQSVLKAPHSTGI
           KK + D ++NDR+KF KVEM VF G+DP+SWLFRA+RYFQIHKLT SE                                  R L         T  
Subjt:  TMEKKGDGDGDNNDRNKFNKVEMSVFNGDDPDSWLFRADRYFQIHKLTNSE--------------------------------NSRSLQSVLKAPHSTGI

Query:  GRRRRETDLPQESSVEEYRNLFDKWVAPLSDIPKKIVEETFMGGLLPWIKVEMEFCIPVGLAEMMRYAQMVEHREILRREANLPGYSGAKVPNYPYNTAK
        GR  R   + QE++VEEYRNLFDK VAPLSD+  ++VEETFM GL PWI+ E+  C P GLAEMMR AQ+VE RE+LR  ANL GY G K         K
Subjt:  GRRRRETDLPQESSVEEYRNLFDKWVAPLSDIPKKIVEETFMGGLLPWIKVEMEFCIPVGLAEMMRYAQMVEHREILRREANLPGYSGAKVPNYPYNTAK

Query:  TNSIIKEQGNKENTVFSIRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRVKEIRELCMFVVRADDVEEEIIEEDEYDLKEL
             + + NK N  F IRTITLK   + E +KEG SKRL DAEFQ +REKGLCFKC+EKY + HKC+++E REL MFVV+ ++ E EI+EE E D  EL
Subjt:  TNSIIKEQGNKENTVFSIRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRVKEIRELCMFVVRADDVEEEIIEEDEYDLKEL

Query:  KTIELQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVL
        +T+E++      VEL INSVVGL +PGTMK+RGT+Q KEVV+L+DCGA HNF+S++LV TL+LP K+T++YGVILGSGTAI+GKG+CE +++ +  WTV 
Subjt:  KTIELQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVL

Query:  ENFLPLELGGVDVILGMQWLHSLGVTEMD
        E+FLPLELGGVDVILGMQWL+SLGVT  D
Subjt:  ENFLPLELGGVDVILGMQWLHSLGVTEMD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein1.2e-1430.81Show/hide
Query:  IRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRVKEIRELCMFVVRADDVEEEIIEEDEYDLKELKTIELQNDLGEVVELCI
        +R++TL G   +E+  +G    L  A  + K   G+         + ++ R  E+  L +   + D     ++++ +  + EL+  EL+ D   + +   
Subjt:  IRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRVKEIRELCMFVVRADDVEEEIIEEDEYDLKELKTIELQNDLGEVVELCI

Query:  NSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVLENFLPLELG--GVDVIL
          V+ LT    M+  G I   +VVV +D GA  NFI   L  +LKLPT  T+   V+LG    I+  G C  ++L +    + ENFL L+L    VDVIL
Subjt:  NSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVLENFLPLELG--GVDVIL

Query:  GMQWLHSLGVT
        G +WL  LG T
Subjt:  GMQWLHSLGVT

AT3G30770.1 Eukaryotic aspartyl protease family protein5.7e-1238.6Show/hide
Query:  LQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVLENFL
        L  D   + ++   S    T    M+  G I   +VVV++D GA +NFISD L + LKLPT  T+   V+LG    I+  G C  + L +    + ENFL
Subjt:  LQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGTAIKGKGVCEKVKLDLNGWTVLENFL

Query:  PLEL--GGVDVILG
         L+L    VDVILG
Subjt:  PLEL--GGVDVILG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAAACCAGAAGCGAAGAGAGAAGGGACACACACGAACAAGAACTCAACAAAATTCCGGTGATGGAAGAGAAGCTTACGGTGATGTCACAAAACATGGAAAACCT
TCAGGCCCAAGTGGAGAAAACACACCAGATGGTGATGATATTCATGGAGAGGATGGCCAAGGAACGAGTATTAGCGAGCGGTAAACAAATCGATTCGTCGGCACAAGAAA
CATGGACGGGAAAATCGGCGGAGGGAGAGAGTTCGGCAAGTAAGGAAACTAAAAATGACACGATGGAGAAGAAGGGTGATGGCGATGGGGATAACAACGATCGAAACAAA
TTCAACAAAGTTGAAATGTCGGTATTCAATGGAGATGACCCAGATTCATGGCTATTCCGTGCAGATAGGTATTTTCAAATACACAAACTGACGAATTCTGAAAACTCACG
GTCGCTACAATCAGTTTTGAAGGCCCCGCACTCAACTGGTATCGGTCGCAGGAGAAGAGAGACAGATTTACCTCAGGAATCAAGTGTAGAGGAATACCGGAATCTATTCG
ATAAGTGGGTGGCACCGTTATCGGACATTCCGAAAAAGATTGTGGAAGAGACGTTCATGGGAGGGTTGTTACCGTGGATTAAGGTGGAGATGGAATTCTGCATTCCCGTG
GGATTAGCCGAGATGATGAGATACGCGCAGATGGTGGAACATCGGGAGATCCTGAGGAGAGAAGCAAATTTACCCGGTTATTCTGGAGCGAAAGTTCCAAATTACCCCTA
TAATACGGCCAAAACAAATTCAATTATAAAAGAACAGGGGAATAAGGAGAACACGGTATTTTCGATACGAACAATCACACTGAAGGGATCACCGGCAAAGGAGATTAAGA
AAGAAGGACCATCCAAACGGCTTTCCGACGCAGAATTCCAGGCCAAGAGGGAGAAAGGACTCTGTTTCAAATGTGATGAGAAGTATTACTCCAGGCACAAATGCAGGGTG
AAGGAAATACGTGAGTTATGTATGTTCGTGGTAAGAGCAGACGACGTGGAGGAAGAAATTATTGAGGAAGACGAGTATGACTTGAAGGAATTGAAAACTATTGAGTTGCA
GAATGACCTTGGGGAAGTAGTGGAGTTATGTATTAACTCGGTAGTGGGATTGACGAATCCGGGTACCATGAAGATAAGGGGAACAATTCAAAGTAAGGAGGTTGTCGTGC
TAGTGGATTGTGGAGCCATCCACAATTTCATATCCGACCGACTAGTGATGACACTGAAATTACCCACAAAGGATACTTCTAACTATGGGGTAATACTTGGGTCAGGAACA
GCCATCAAAGGCAAGGGAGTGTGTGAAAAAGTAAAGTTGGATCTCAATGGGTGGACAGTCCTTGAAAACTTCCTACCACTGGAACTGGGAGGGGTAGACGTGATACTTGG
GATGCAATGGTTACACTCATTGGGAGTGACTGAGATGGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAAACCAGAAGCGAAGAGAGAAGGGACACACACGAACAAGAACTCAACAAAATTCCGGTGATGGAAGAGAAGCTTACGGTGATGTCACAAAACATGGAAAACCT
TCAGGCCCAAGTGGAGAAAACACACCAGATGGTGATGATATTCATGGAGAGGATGGCCAAGGAACGAGTATTAGCGAGCGGTAAACAAATCGATTCGTCGGCACAAGAAA
CATGGACGGGAAAATCGGCGGAGGGAGAGAGTTCGGCAAGTAAGGAAACTAAAAATGACACGATGGAGAAGAAGGGTGATGGCGATGGGGATAACAACGATCGAAACAAA
TTCAACAAAGTTGAAATGTCGGTATTCAATGGAGATGACCCAGATTCATGGCTATTCCGTGCAGATAGGTATTTTCAAATACACAAACTGACGAATTCTGAAAACTCACG
GTCGCTACAATCAGTTTTGAAGGCCCCGCACTCAACTGGTATCGGTCGCAGGAGAAGAGAGACAGATTTACCTCAGGAATCAAGTGTAGAGGAATACCGGAATCTATTCG
ATAAGTGGGTGGCACCGTTATCGGACATTCCGAAAAAGATTGTGGAAGAGACGTTCATGGGAGGGTTGTTACCGTGGATTAAGGTGGAGATGGAATTCTGCATTCCCGTG
GGATTAGCCGAGATGATGAGATACGCGCAGATGGTGGAACATCGGGAGATCCTGAGGAGAGAAGCAAATTTACCCGGTTATTCTGGAGCGAAAGTTCCAAATTACCCCTA
TAATACGGCCAAAACAAATTCAATTATAAAAGAACAGGGGAATAAGGAGAACACGGTATTTTCGATACGAACAATCACACTGAAGGGATCACCGGCAAAGGAGATTAAGA
AAGAAGGACCATCCAAACGGCTTTCCGACGCAGAATTCCAGGCCAAGAGGGAGAAAGGACTCTGTTTCAAATGTGATGAGAAGTATTACTCCAGGCACAAATGCAGGGTG
AAGGAAATACGTGAGTTATGTATGTTCGTGGTAAGAGCAGACGACGTGGAGGAAGAAATTATTGAGGAAGACGAGTATGACTTGAAGGAATTGAAAACTATTGAGTTGCA
GAATGACCTTGGGGAAGTAGTGGAGTTATGTATTAACTCGGTAGTGGGATTGACGAATCCGGGTACCATGAAGATAAGGGGAACAATTCAAAGTAAGGAGGTTGTCGTGC
TAGTGGATTGTGGAGCCATCCACAATTTCATATCCGACCGACTAGTGATGACACTGAAATTACCCACAAAGGATACTTCTAACTATGGGGTAATACTTGGGTCAGGAACA
GCCATCAAAGGCAAGGGAGTGTGTGAAAAAGTAAAGTTGGATCTCAATGGGTGGACAGTCCTTGAAAACTTCCTACCACTGGAACTGGGAGGGGTAGACGTGATACTTGG
GATGCAATGGTTACACTCATTGGGAGTGACTGAGATGGACTGA
Protein sequenceShow/hide protein sequence
MVQTRSEERRDTHEQELNKIPVMEEKLTVMSQNMENLQAQVEKTHQMVMIFMERMAKERVLASGKQIDSSAQETWTGKSAEGESSASKETKNDTMEKKGDGDGDNNDRNK
FNKVEMSVFNGDDPDSWLFRADRYFQIHKLTNSENSRSLQSVLKAPHSTGIGRRRRETDLPQESSVEEYRNLFDKWVAPLSDIPKKIVEETFMGGLLPWIKVEMEFCIPV
GLAEMMRYAQMVEHREILRREANLPGYSGAKVPNYPYNTAKTNSIIKEQGNKENTVFSIRTITLKGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSRHKCRV
KEIRELCMFVVRADDVEEEIIEEDEYDLKELKTIELQNDLGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGAIHNFISDRLVMTLKLPTKDTSNYGVILGSGT
AIKGKGVCEKVKLDLNGWTVLENFLPLELGGVDVILGMQWLHSLGVTEMD