; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G10250 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G10250
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr7:8379062..8383451
RNA-Seq ExpressionCSPI07G10250
SyntenyCSPI07G10250
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0062868.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0062.42Show/hide
Query:  MVQTRSEERRDTHEQ-------ELNKISVMEEKLTVMSQNMENLQARVEKTHQMVMIFMETMAKEQVLASGKGIDSSVQETWTGKTAEGESSASKETKNG
        MVQTR+EER ++ EQ       EL K+ V+E  L  +++NME ++ + EK  Q ++ +ME  AKE+ +A  +  +S  Q + T K+   ++S+S++ +  
Subjt:  MVQTRSEERRDTHEQ-------ELNKISVMEEKLTVMSQNMENLQARVEKTHQMVMIFMETMAKEQVLASGKGIDSSVQETWTGKTAEGESSASKETKNG

Query:  TTEKKEEGDGDSNDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNW------------------------------VVY
         T KK E D +SNDR+KFKKVEMPVF G+DP+SWLFRAERYFQIHKLT+SEK+ V+TI F+GP LNW                                +
Subjt:  TTEKKEEGDGDSNDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNW------------------------------VVY

Query:  GRFLRIQQETSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLLPWIKVEMEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGAKVPNGSYSTAKTYS
        GRFLRIQQET+VEEYRNLFDK VAPL D+ +++VEETFM GL PWI+ E+  C P GLAEMMR AQ+VE REILR  ANL GY G K    + +  K Y 
Subjt:  GRFLRIQQETSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLLPWIKVEMEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGAKVPNGSYSTAKTYS

Query:  GLKEQGNKENTVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYDLKELKAM
          + + NK N  FPIRTITL+   + E +KEG SKRL DAEFQ +REKGLCFKC+EKY + HKCK +E RELRMFVV+D++ E EI+EE E D  EL+ +
Subjt:  GLKEQGNKENTVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYDLKELKAM

Query:  ELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGWTVMENF
        E+Q      VEL INSVVGL +PGTMK+RGT+Q KEVV+L+DCGATHNF+S++LV TL++P K+TA+YGVILGSGTAI+GKG+CE +E+ +  WTV E+F
Subjt:  ELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGWTVMENF

Query:  LPLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLE---ACQAKIKTEENENEPESVL
        LPLELGGVD+ILGMQWL+SLGVT  DW+NLT++F+ D KK+ +KGDPSLTK +VSLKN  K+W E D GYLIECR++    A    +  EE     E +L
Subjt:  LPLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLE---ACQAKIKTEENENEPESVL

Query:  TVLTQYKDVFDGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITIPD
         +L Q+KD+F+ P++LPP R IEH IH++EG +PVNVRPYRYA+ QKEE+E+LV+EM+ASGIIRPS SPYSSPVLLVKKKDGSWRFCVDYRALNN+T+PD
Subjt:  TVLTQYKDVFDGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITIPD

Query:  KFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEH
        KFPIPVVEELFDEL GA+LF+KIDLKAGYHQ+RM   DIEKTAFRTHEGHYEFLVMPFGLTN PATFQSLMNSIFR YLR+FVLVFFDDIL+YSRNLE+H
Subjt:  KFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEH

Query:  CQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALW----IHSGT----LTQLLKLGAFKWN
         +HIE V  VLR+H+LFANRKKC F  ++VEYLGH++S +GVEVDPEKI+A+  WP PT+VRE RGFLGLT  +     H GT    LTQLLK G F WN
Subjt:  CQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALW----IHSGT----LTQLLKLGAFKWN

Query:  EEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTLALRDRSKP
         EA +AFEKL++AM+ LP+LAL  F+ PFE+ETDASGYG+GAVL+QNKRPIAFYSHTLA+RDR +P
Subjt:  EEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTLALRDRSKP

KAA0068193.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0062.32Show/hide
Query:  MVQTRSEERRDTHEQ-------ELNKISVMEEKLTVMSQNMENLQARVEKTHQMVMIFMETMAKEQVLASGKGIDSSVQETWTGKTAEGESSASKETKNG
        MVQTR EER ++ EQ       EL K+ V+E  L  +++NME ++ + EK  Q ++ +ME  AKE+ +A  +  +S  Q + T K+   ++S+S++ +  
Subjt:  MVQTRSEERRDTHEQ-------ELNKISVMEEKLTVMSQNMENLQARVEKTHQMVMIFMETMAKEQVLASGKGIDSSVQETWTGKTAEGESSASKETKNG

Query:  TTEKKEEGDGDSNDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNW------------------------------VVY
         T KK E D +SNDR+KFKKVEMPVF G+DP+SWLFRAERYFQIHKLT+SEK+ V+TI F+GP LNW                                +
Subjt:  TTEKKEEGDGDSNDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNW------------------------------VVY

Query:  GRFLRIQQETSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLLPWIKVEMEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGAKVPNGSYSTAKTYS
        GRFLRIQQET+VEEYRNLFDK VAPLSD+ +++VEETFM GL PWI+ E+  C P GLAEMMR AQ+VE RE+LR  ANL GY G K    + +  K Y 
Subjt:  GRFLRIQQETSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLLPWIKVEMEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGAKVPNGSYSTAKTYS

Query:  GLKEQGNKENTVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYDLKELKAM
          + + NK N  FPIRTITL+   + E +KEG SKRL DAEFQ +REKGLCFKC+EKY + HKCK +E RELRMFVV+D++ E EI+EE E D  EL+ +
Subjt:  GLKEQGNKENTVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYDLKELKAM

Query:  ELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGWTVMENF
        E++      VEL INSVVGL +PGTMK+RGT+Q KEVV+L+DCGATHNF+S++LV TL++P K+TA+YGVILGSGTAI+GKG+CE +E+ +  WTV E+F
Subjt:  ELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGWTVMENF

Query:  LPLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLE---ACQAKIKTEENENEPESVL
        LPLELGGVD+ILGMQWL+SLGVT  DW+NLT++F+ D KK+ +KGDPSLTK +VSLKN  K+W E D GYLIECR++    A    +  EE     E +L
Subjt:  LPLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLE---ACQAKIKTEENENEPESVL

Query:  TVLTQYKDVFDGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITIPD
         +L Q+KD+F+ P++LPP R IEH IH++EG +PVNVRPYRYA+ QKEE+E+LV+EM+ASGIIRPS SPYSSPVLLVKKKDGSWRFCVDYRALNN+T+PD
Subjt:  TVLTQYKDVFDGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITIPD

Query:  KFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEH
        KFPIPVVEELFDEL GA+LF+KIDLKAGYHQ+RM   DIEKTAFRTHEGHYEFLVMPFGLTN PATFQSLMNSIFR YLR+FVLVFFDDIL+YSRNLE+H
Subjt:  KFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEH

Query:  CQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALW----IHSGT----LTQLLKLGAFKWN
         +HIE V  VLR+H+LFANRKKC F  ++VEYLGH++S +GVEVDPEKI+A+  WP PT+VRE RGFLGLT  +     H GT    LTQLLK G F WN
Subjt:  CQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALW----IHSGT----LTQLLKLGAFKWN

Query:  EEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTLALRDRSKP
         EA +AFEKL++AM+ LP+LAL  F+ PFE+ETDASGYG+GAVL+QNKRPIAFYSHTLA+RDR +P
Subjt:  EEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTLALRDRSKP

TYJ96875.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0062.32Show/hide
Query:  MVQTRSEERRDTHEQ-------ELNKISVMEEKLTVMSQNMENLQARVEKTHQMVMIFMETMAKEQVLASGKGIDSSVQETWTGKTAEGESSASKETKNG
        MVQTR EER ++ EQ       EL K+ V+E  L  +++NME ++ + EK  Q ++ +ME  AKE+ +A  +  +S  Q + T K+   ++S+S++ +  
Subjt:  MVQTRSEERRDTHEQ-------ELNKISVMEEKLTVMSQNMENLQARVEKTHQMVMIFMETMAKEQVLASGKGIDSSVQETWTGKTAEGESSASKETKNG

Query:  TTEKKEEGDGDSNDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNW------------------------------VVY
         T KK E D +SNDR+KFKKVEMPVF G+DP+SWLFRAERYFQIHKLT+SEK+ V+TI F+GP LNW                                +
Subjt:  TTEKKEEGDGDSNDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNW------------------------------VVY

Query:  GRFLRIQQETSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLLPWIKVEMEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGAKVPNGSYSTAKTYS
        GRFLRIQQET+VEEYRNLFDK VAPLSD+ +++VEETFM GL PWI+ E+  C P GLAEMMR AQ+VE RE+LR  ANL GY G K    + +  K Y 
Subjt:  GRFLRIQQETSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLLPWIKVEMEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGAKVPNGSYSTAKTYS

Query:  GLKEQGNKENTVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYDLKELKAM
          + + NK N  FPIRTITL+   + E +KEG SKRL DAEFQ +REKGLCFKC+EKY + HKCK +E RELRMFVV+D++ E EI+EE E D  EL+ +
Subjt:  GLKEQGNKENTVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYDLKELKAM

Query:  ELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGWTVMENF
        E++      VEL INSVVGL +PGTMK+RGT+Q KEVV+L+DCGATHNF+S++LV TL++P K+TA+YGVILGSGTAI+GKG+CE +E+ +  WTV E+F
Subjt:  ELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGWTVMENF

Query:  LPLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLE---ACQAKIKTEENENEPESVL
        LPLELGGVD+ILGMQWL+SLGVT  DW+NLT++F+ D KK+ +KGDPSLTK +VSLKN  K+W E D GYLIECR++    A    +  EE     E +L
Subjt:  LPLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLE---ACQAKIKTEENENEPESVL

Query:  TVLTQYKDVFDGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITIPD
         +L Q+KD+F+ P++LPP R IEH IH++EG +PVNVRPYRYA+ QKEE+E+LV+EM+ASGIIRPS SPYSSPVLLVKKKDGSWRFCVDYRALNN+T+PD
Subjt:  TVLTQYKDVFDGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITIPD

Query:  KFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEH
        KFPIPVVEELFDEL GA+LF+KIDLKAGYHQ+RM   DIEKTAFRTHEGHYEFLVMPFGLTN PATFQSLMNSIFR YLR+FVLVFFDDIL+YSRNLE+H
Subjt:  KFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEH

Query:  CQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALW----IHSGT----LTQLLKLGAFKWN
         +HIE V  VLR+H+LFANRKKC F  ++VEYLGH++S +GVEVDPEKI+A+  WP PT+VRE RGFLGLT  +     H GT    LTQLLK G F WN
Subjt:  CQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALW----IHSGT----LTQLLKLGAFKWN

Query:  EEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTLALRDRSKP
         EA +AFEKL++AM+ LP+LAL  F+ PFE+ETDASGYG+GAVL+QNKRPIAFYSHTLA+RDR +P
Subjt:  EEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTLALRDRSKP

TYK22240.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0061.65Show/hide
Query:  MVQTRSEERRDTHEQ-------ELNKISVMEEKLTVMSQNMENLQARVEKTHQMVMIFMETMAKEQVLASGKGIDSSVQETWTGKTAEGESSASKETKNG
        MVQTR EER +  EQ       EL K+  +E  L  +++NME ++ + EK  Q ++ +ME  AKE+ +A  +  +S +Q +   K+  G++S+S +    
Subjt:  MVQTRSEERRDTHEQ-------ELNKISVMEEKLTVMSQNMENLQARVEKTHQMVMIFMETMAKEQVLASGKGIDSSVQETWTGKTAEGESSASKETKNG

Query:  TTEKKEEGDGDSNDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNW------------------------------VVY
        + E+K + D ++NDR+KFKKVEMPVF G+DP+SWLFRAERYFQIHKLT+SEK+ V+TI F+GP LNW                               V 
Subjt:  TTEKKEEGDGDSNDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNW------------------------------VVY

Query:  GRFLRIQQETSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLLPWIKVEMEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGAKVPNGSYSTAKTYS
        GRFLRIQQET+VEEYRN FDK VAPLSD+ +++VEETFM GL PWI+ E+  C P GLAE M  AQ+VE REILR  ANL  Y G K       +A T +
Subjt:  GRFLRIQQETSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLLPWIKVEMEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGAKVPNGSYSTAKTYS

Query:  GLKE---QGNKE---NTVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYDL
        G+K    Q NKE   N  FPIRTITL+     EI+KEG SKRL DAEFQ ++EKGLCFKC+EKY + HKCK KE RELRMFVV++D+ E EI+EE E + 
Subjt:  GLKE---QGNKE---NTVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYDL

Query:  KELKAMELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGW
         E++  E+Q      VEL INSVVGL +PGTMK++G++Q KEVV+L+DCGATHNF+S+++V +L++P K+TA+YGVILGSGTAI+GKG+CE VE+ +  W
Subjt:  KELKAMELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGW

Query:  TVMENFLPLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLEACQAKIK-TEENENEP
        TV E+FLPLELGGVD+ILGMQWL+SLGVT  DW+NLT++F+ + K++ +KGDPSLTK +VSLKN  K+W E D GYLIECR++E  + K    EE E   
Subjt:  TVMENFLPLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLEACQAKIK-TEENENEP

Query:  ESVLTVLTQYKDVFDGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNI
        + ++ +L Q+ DVF+ P++LPP R IEH IH++EG +PVNVRPYRYA+ QKEE+EKLV+EM+ SGIIRPS SPYSSPVLLVKKKDGSWRFCVDYRALNN+
Subjt:  ESVLTVLTQYKDVFDGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNI

Query:  TIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRN
        T+PDKFPIPVVEELFDEL GA+LF+KIDLKAGYHQ+RM   DIEKTAFRTHEGHYEFLVMPFGLTN PATFQSLMNSIFR YLRKFVLVFFDDIL+YSRN
Subjt:  TIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRN

Query:  LEEHCQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALW----IHSGT----LTQLLKLGA
         E+H +H+E+V  VLR+H+LFANRKKC F  ++VEYLGH++S +GVEVDPEKI+A+  WP PTNVRE RGFLGLT  +     H GT    LTQLLK G 
Subjt:  LEEHCQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALW----IHSGT----LTQLLKLGA

Query:  FKWNEEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTLALRDRSKP
        FKWN EA +AFEKL+ AM+ LPILAL  F+ PFE+ETDASGYGIGAVL+QNKRPIAFYSHTLA RDR +P
Subjt:  FKWNEEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTLALRDRSKP

TYK30083.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0061.65Show/hide
Query:  MVQTRSEERRDTHEQ-------ELNKISVMEEKLTVMSQNMENLQARVEKTHQMVMIFMETMAKEQVLASGKGIDSSVQETWTGKTAEGESSASKETKNG
        MVQTR EER +  EQ       EL K+  +E  L  +++NME ++ + EK  Q ++ +ME  AKE+ +A  +  +S +Q +   K+  G++S+S +    
Subjt:  MVQTRSEERRDTHEQ-------ELNKISVMEEKLTVMSQNMENLQARVEKTHQMVMIFMETMAKEQVLASGKGIDSSVQETWTGKTAEGESSASKETKNG

Query:  TTEKKEEGDGDSNDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNW------------------------------VVY
        + E+K + D ++NDR+KFKKVEMPVF G+DP+SWLFRAERYFQIHKLT+SEK+ V+TI F+GP LNW                               V 
Subjt:  TTEKKEEGDGDSNDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNW------------------------------VVY

Query:  GRFLRIQQETSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLLPWIKVEMEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGAKVPNGSYSTAKTYS
        GRFLRIQQET+VEEYRN FDK VAPLSD+ +++VEETFM GL PWI+ E+  C P GLAE M  AQ+VE REILR  ANL  Y G K       +A T +
Subjt:  GRFLRIQQETSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLLPWIKVEMEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGAKVPNGSYSTAKTYS

Query:  GLKE---QGNKE---NTVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYDL
        G+K    Q NKE   N  FPIRTITL+     EI+KEG SKRL DAEFQ ++EKGLCFKC+EKY + HKCK KE RELRMFVV++D+ E EI+EE E + 
Subjt:  GLKE---QGNKE---NTVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYDL

Query:  KELKAMELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGW
         E++  E+Q      VEL INSVVGL +PGTMK++G++Q KEVV+L+DCGATHNF+S+++V +L++P K+TA+YGVILGSGTAI+GKG+CE VE+ +  W
Subjt:  KELKAMELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGW

Query:  TVMENFLPLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLEACQAKIK-TEENENEP
        TV E+FLPLELGGVD+ILGMQWL+SLGVT  DW+NLT++F+ + K++ +KGDPSLTK +VSLKN  K+W E D GYLIECR++E  + K    EE E   
Subjt:  TVMENFLPLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLEACQAKIK-TEENENEP

Query:  ESVLTVLTQYKDVFDGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNI
        + ++ +L Q+ DVF+ P++LPP R IEH IH++EG +PVNVRPYRYA+ QKEE+EKLV+EM+ SGIIRPS SPYSSPVLLVKKKDGSWRFCVDYRALNN+
Subjt:  ESVLTVLTQYKDVFDGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNI

Query:  TIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRN
        T+PDKFPIPVVEELFDEL GA+LF+KIDLKAGYHQ+RM   DIEKTAFRTHEGHYEFLVMPFGLTN PATFQSLMNSIFR YLRKFVLVFFDDIL+YSRN
Subjt:  TIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRN

Query:  LEEHCQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALW----IHSGT----LTQLLKLGA
         E+H +H+E+V  VLR+H+LFANRKKC F  ++VEYLGH++S +GVEVDPEKI+A+  WP PTNVRE RGFLGLT  +     H GT    LTQLLK G 
Subjt:  LEEHCQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALW----IHSGT----LTQLLKLGA

Query:  FKWNEEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTLALRDRSKP
        FKWN EA +AFEKL+ AM+ LPILAL  F+ PFE+ETDASGYGIGAVL+QNKRPIAFYSHTLA RDR +P
Subjt:  FKWNEEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTLALRDRSKP

TrEMBL top hitse value%identityAlignment
A0A5A7V5H5 Ty3/gypsy retrotransposon protein0.0e+0062.42Show/hide
Query:  MVQTRSEERRDTHEQ-------ELNKISVMEEKLTVMSQNMENLQARVEKTHQMVMIFMETMAKEQVLASGKGIDSSVQETWTGKTAEGESSASKETKNG
        MVQTR+EER ++ EQ       EL K+ V+E  L  +++NME ++ + EK  Q ++ +ME  AKE+ +A  +  +S  Q + T K+   ++S+S++ +  
Subjt:  MVQTRSEERRDTHEQ-------ELNKISVMEEKLTVMSQNMENLQARVEKTHQMVMIFMETMAKEQVLASGKGIDSSVQETWTGKTAEGESSASKETKNG

Query:  TTEKKEEGDGDSNDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNW------------------------------VVY
         T KK E D +SNDR+KFKKVEMPVF G+DP+SWLFRAERYFQIHKLT+SEK+ V+TI F+GP LNW                                +
Subjt:  TTEKKEEGDGDSNDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNW------------------------------VVY

Query:  GRFLRIQQETSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLLPWIKVEMEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGAKVPNGSYSTAKTYS
        GRFLRIQQET+VEEYRNLFDK VAPL D+ +++VEETFM GL PWI+ E+  C P GLAEMMR AQ+VE REILR  ANL GY G K    + +  K Y 
Subjt:  GRFLRIQQETSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLLPWIKVEMEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGAKVPNGSYSTAKTYS

Query:  GLKEQGNKENTVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYDLKELKAM
          + + NK N  FPIRTITL+   + E +KEG SKRL DAEFQ +REKGLCFKC+EKY + HKCK +E RELRMFVV+D++ E EI+EE E D  EL+ +
Subjt:  GLKEQGNKENTVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYDLKELKAM

Query:  ELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGWTVMENF
        E+Q      VEL INSVVGL +PGTMK+RGT+Q KEVV+L+DCGATHNF+S++LV TL++P K+TA+YGVILGSGTAI+GKG+CE +E+ +  WTV E+F
Subjt:  ELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGWTVMENF

Query:  LPLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLE---ACQAKIKTEENENEPESVL
        LPLELGGVD+ILGMQWL+SLGVT  DW+NLT++F+ D KK+ +KGDPSLTK +VSLKN  K+W E D GYLIECR++    A    +  EE     E +L
Subjt:  LPLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLE---ACQAKIKTEENENEPESVL

Query:  TVLTQYKDVFDGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITIPD
         +L Q+KD+F+ P++LPP R IEH IH++EG +PVNVRPYRYA+ QKEE+E+LV+EM+ASGIIRPS SPYSSPVLLVKKKDGSWRFCVDYRALNN+T+PD
Subjt:  TVLTQYKDVFDGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITIPD

Query:  KFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEH
        KFPIPVVEELFDEL GA+LF+KIDLKAGYHQ+RM   DIEKTAFRTHEGHYEFLVMPFGLTN PATFQSLMNSIFR YLR+FVLVFFDDIL+YSRNLE+H
Subjt:  KFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEH

Query:  CQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALW----IHSGT----LTQLLKLGAFKWN
         +HIE V  VLR+H+LFANRKKC F  ++VEYLGH++S +GVEVDPEKI+A+  WP PT+VRE RGFLGLT  +     H GT    LTQLLK G F WN
Subjt:  CQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALW----IHSGT----LTQLLKLGAFKWN

Query:  EEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTLALRDRSKP
         EA +AFEKL++AM+ LP+LAL  F+ PFE+ETDASGYG+GAVL+QNKRPIAFYSHTLA+RDR +P
Subjt:  EEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTLALRDRSKP

A0A5A7VJA0 Ty3/gypsy retrotransposon protein0.0e+0062.32Show/hide
Query:  MVQTRSEERRDTHEQ-------ELNKISVMEEKLTVMSQNMENLQARVEKTHQMVMIFMETMAKEQVLASGKGIDSSVQETWTGKTAEGESSASKETKNG
        MVQTR EER ++ EQ       EL K+ V+E  L  +++NME ++ + EK  Q ++ +ME  AKE+ +A  +  +S  Q + T K+   ++S+S++ +  
Subjt:  MVQTRSEERRDTHEQ-------ELNKISVMEEKLTVMSQNMENLQARVEKTHQMVMIFMETMAKEQVLASGKGIDSSVQETWTGKTAEGESSASKETKNG

Query:  TTEKKEEGDGDSNDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNW------------------------------VVY
         T KK E D +SNDR+KFKKVEMPVF G+DP+SWLFRAERYFQIHKLT+SEK+ V+TI F+GP LNW                                +
Subjt:  TTEKKEEGDGDSNDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNW------------------------------VVY

Query:  GRFLRIQQETSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLLPWIKVEMEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGAKVPNGSYSTAKTYS
        GRFLRIQQET+VEEYRNLFDK VAPLSD+ +++VEETFM GL PWI+ E+  C P GLAEMMR AQ+VE RE+LR  ANL GY G K    + +  K Y 
Subjt:  GRFLRIQQETSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLLPWIKVEMEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGAKVPNGSYSTAKTYS

Query:  GLKEQGNKENTVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYDLKELKAM
          + + NK N  FPIRTITL+   + E +KEG SKRL DAEFQ +REKGLCFKC+EKY + HKCK +E RELRMFVV+D++ E EI+EE E D  EL+ +
Subjt:  GLKEQGNKENTVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYDLKELKAM

Query:  ELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGWTVMENF
        E++      VEL INSVVGL +PGTMK+RGT+Q KEVV+L+DCGATHNF+S++LV TL++P K+TA+YGVILGSGTAI+GKG+CE +E+ +  WTV E+F
Subjt:  ELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGWTVMENF

Query:  LPLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLE---ACQAKIKTEENENEPESVL
        LPLELGGVD+ILGMQWL+SLGVT  DW+NLT++F+ D KK+ +KGDPSLTK +VSLKN  K+W E D GYLIECR++    A    +  EE     E +L
Subjt:  LPLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLE---ACQAKIKTEENENEPESVL

Query:  TVLTQYKDVFDGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITIPD
         +L Q+KD+F+ P++LPP R IEH IH++EG +PVNVRPYRYA+ QKEE+E+LV+EM+ASGIIRPS SPYSSPVLLVKKKDGSWRFCVDYRALNN+T+PD
Subjt:  TVLTQYKDVFDGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITIPD

Query:  KFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEH
        KFPIPVVEELFDEL GA+LF+KIDLKAGYHQ+RM   DIEKTAFRTHEGHYEFLVMPFGLTN PATFQSLMNSIFR YLR+FVLVFFDDIL+YSRNLE+H
Subjt:  KFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEH

Query:  CQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALW----IHSGT----LTQLLKLGAFKWN
         +HIE V  VLR+H+LFANRKKC F  ++VEYLGH++S +GVEVDPEKI+A+  WP PT+VRE RGFLGLT  +     H GT    LTQLLK G F WN
Subjt:  CQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALW----IHSGT----LTQLLKLGAFKWN

Query:  EEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTLALRDRSKP
         EA +AFEKL++AM+ LP+LAL  F+ PFE+ETDASGYG+GAVL+QNKRPIAFYSHTLA+RDR +P
Subjt:  EEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTLALRDRSKP

A0A5D3BEL2 Ty3/gypsy retrotransposon protein0.0e+0062.32Show/hide
Query:  MVQTRSEERRDTHEQ-------ELNKISVMEEKLTVMSQNMENLQARVEKTHQMVMIFMETMAKEQVLASGKGIDSSVQETWTGKTAEGESSASKETKNG
        MVQTR EER ++ EQ       EL K+ V+E  L  +++NME ++ + EK  Q ++ +ME  AKE+ +A  +  +S  Q + T K+   ++S+S++ +  
Subjt:  MVQTRSEERRDTHEQ-------ELNKISVMEEKLTVMSQNMENLQARVEKTHQMVMIFMETMAKEQVLASGKGIDSSVQETWTGKTAEGESSASKETKNG

Query:  TTEKKEEGDGDSNDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNW------------------------------VVY
         T KK E D +SNDR+KFKKVEMPVF G+DP+SWLFRAERYFQIHKLT+SEK+ V+TI F+GP LNW                                +
Subjt:  TTEKKEEGDGDSNDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNW------------------------------VVY

Query:  GRFLRIQQETSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLLPWIKVEMEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGAKVPNGSYSTAKTYS
        GRFLRIQQET+VEEYRNLFDK VAPLSD+ +++VEETFM GL PWI+ E+  C P GLAEMMR AQ+VE RE+LR  ANL GY G K    + +  K Y 
Subjt:  GRFLRIQQETSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLLPWIKVEMEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGAKVPNGSYSTAKTYS

Query:  GLKEQGNKENTVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYDLKELKAM
          + + NK N  FPIRTITL+   + E +KEG SKRL DAEFQ +REKGLCFKC+EKY + HKCK +E RELRMFVV+D++ E EI+EE E D  EL+ +
Subjt:  GLKEQGNKENTVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYDLKELKAM

Query:  ELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGWTVMENF
        E++      VEL INSVVGL +PGTMK+RGT+Q KEVV+L+DCGATHNF+S++LV TL++P K+TA+YGVILGSGTAI+GKG+CE +E+ +  WTV E+F
Subjt:  ELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGWTVMENF

Query:  LPLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLE---ACQAKIKTEENENEPESVL
        LPLELGGVD+ILGMQWL+SLGVT  DW+NLT++F+ D KK+ +KGDPSLTK +VSLKN  K+W E D GYLIECR++    A    +  EE     E +L
Subjt:  LPLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLE---ACQAKIKTEENENEPESVL

Query:  TVLTQYKDVFDGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITIPD
         +L Q+KD+F+ P++LPP R IEH IH++EG +PVNVRPYRYA+ QKEE+E+LV+EM+ASGIIRPS SPYSSPVLLVKKKDGSWRFCVDYRALNN+T+PD
Subjt:  TVLTQYKDVFDGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITIPD

Query:  KFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEH
        KFPIPVVEELFDEL GA+LF+KIDLKAGYHQ+RM   DIEKTAFRTHEGHYEFLVMPFGLTN PATFQSLMNSIFR YLR+FVLVFFDDIL+YSRNLE+H
Subjt:  KFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEH

Query:  CQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALW----IHSGT----LTQLLKLGAFKWN
         +HIE V  VLR+H+LFANRKKC F  ++VEYLGH++S +GVEVDPEKI+A+  WP PT+VRE RGFLGLT  +     H GT    LTQLLK G F WN
Subjt:  CQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALW----IHSGT----LTQLLKLGAFKWN

Query:  EEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTLALRDRSKP
         EA +AFEKL++AM+ LP+LAL  F+ PFE+ETDASGYG+GAVL+QNKRPIAFYSHTLA+RDR +P
Subjt:  EEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTLALRDRSKP

A0A5D3DFT1 Ty3/gypsy retrotransposon protein0.0e+0061.65Show/hide
Query:  MVQTRSEERRDTHEQ-------ELNKISVMEEKLTVMSQNMENLQARVEKTHQMVMIFMETMAKEQVLASGKGIDSSVQETWTGKTAEGESSASKETKNG
        MVQTR EER +  EQ       EL K+  +E  L  +++NME ++ + EK  Q ++ +ME  AKE+ +A  +  +S +Q +   K+  G++S+S +    
Subjt:  MVQTRSEERRDTHEQ-------ELNKISVMEEKLTVMSQNMENLQARVEKTHQMVMIFMETMAKEQVLASGKGIDSSVQETWTGKTAEGESSASKETKNG

Query:  TTEKKEEGDGDSNDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNW------------------------------VVY
        + E+K + D ++NDR+KFKKVEMPVF G+DP+SWLFRAERYFQIHKLT+SEK+ V+TI F+GP LNW                               V 
Subjt:  TTEKKEEGDGDSNDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNW------------------------------VVY

Query:  GRFLRIQQETSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLLPWIKVEMEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGAKVPNGSYSTAKTYS
        GRFLRIQQET+VEEYRN FDK VAPLSD+ +++VEETFM GL PWI+ E+  C P GLAE M  AQ+VE REILR  ANL  Y G K       +A T +
Subjt:  GRFLRIQQETSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLLPWIKVEMEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGAKVPNGSYSTAKTYS

Query:  GLKE---QGNKE---NTVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYDL
        G+K    Q NKE   N  FPIRTITL+     EI+KEG SKRL DAEFQ ++EKGLCFKC+EKY + HKCK KE RELRMFVV++D+ E EI+EE E + 
Subjt:  GLKE---QGNKE---NTVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYDL

Query:  KELKAMELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGW
         E++  E+Q      VEL INSVVGL +PGTMK++G++Q KEVV+L+DCGATHNF+S+++V +L++P K+TA+YGVILGSGTAI+GKG+CE VE+ +  W
Subjt:  KELKAMELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGW

Query:  TVMENFLPLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLEACQAKIK-TEENENEP
        TV E+FLPLELGGVD+ILGMQWL+SLGVT  DW+NLT++F+ + K++ +KGDPSLTK +VSLKN  K+W E D GYLIECR++E  + K    EE E   
Subjt:  TVMENFLPLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLEACQAKIK-TEENENEP

Query:  ESVLTVLTQYKDVFDGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNI
        + ++ +L Q+ DVF+ P++LPP R IEH IH++EG +PVNVRPYRYA+ QKEE+EKLV+EM+ SGIIRPS SPYSSPVLLVKKKDGSWRFCVDYRALNN+
Subjt:  ESVLTVLTQYKDVFDGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNI

Query:  TIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRN
        T+PDKFPIPVVEELFDEL GA+LF+KIDLKAGYHQ+RM   DIEKTAFRTHEGHYEFLVMPFGLTN PATFQSLMNSIFR YLRKFVLVFFDDIL+YSRN
Subjt:  TIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRN

Query:  LEEHCQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALW----IHSGT----LTQLLKLGA
         E+H +H+E+V  VLR+H+LFANRKKC F  ++VEYLGH++S +GVEVDPEKI+A+  WP PTNVRE RGFLGLT  +     H GT    LTQLLK G 
Subjt:  LEEHCQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALW----IHSGT----LTQLLKLGA

Query:  FKWNEEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTLALRDRSKP
        FKWN EA +AFEKL+ AM+ LPILAL  F+ PFE+ETDASGYGIGAVL+QNKRPIAFYSHTLA RDR +P
Subjt:  FKWNEEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTLALRDRSKP

A0A5D3E1V9 Ty3/gypsy retrotransposon protein0.0e+0061.65Show/hide
Query:  MVQTRSEERRDTHEQ-------ELNKISVMEEKLTVMSQNMENLQARVEKTHQMVMIFMETMAKEQVLASGKGIDSSVQETWTGKTAEGESSASKETKNG
        MVQTR EER +  EQ       EL K+  +E  L  +++NME ++ + EK  Q ++ +ME  AKE+ +A  +  +S +Q +   K+  G++S+S +    
Subjt:  MVQTRSEERRDTHEQ-------ELNKISVMEEKLTVMSQNMENLQARVEKTHQMVMIFMETMAKEQVLASGKGIDSSVQETWTGKTAEGESSASKETKNG

Query:  TTEKKEEGDGDSNDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNW------------------------------VVY
        + E+K + D ++NDR+KFKKVEMPVF G+DP+SWLFRAERYFQIHKLT+SEK+ V+TI F+GP LNW                               V 
Subjt:  TTEKKEEGDGDSNDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNW------------------------------VVY

Query:  GRFLRIQQETSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLLPWIKVEMEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGAKVPNGSYSTAKTYS
        GRFLRIQQET+VEEYRN FDK VAPLSD+ +++VEETFM GL PWI+ E+  C P GLAE M  AQ+VE REILR  ANL  Y G K       +A T +
Subjt:  GRFLRIQQETSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLLPWIKVEMEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGAKVPNGSYSTAKTYS

Query:  GLKE---QGNKE---NTVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYDL
        G+K    Q NKE   N  FPIRTITL+     EI+KEG SKRL DAEFQ ++EKGLCFKC+EKY + HKCK KE RELRMFVV++D+ E EI+EE E + 
Subjt:  GLKE---QGNKE---NTVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYDL

Query:  KELKAMELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGW
         E++  E+Q      VEL INSVVGL +PGTMK++G++Q KEVV+L+DCGATHNF+S+++V +L++P K+TA+YGVILGSGTAI+GKG+CE VE+ +  W
Subjt:  KELKAMELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGW

Query:  TVMENFLPLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLEACQAKIK-TEENENEP
        TV E+FLPLELGGVD+ILGMQWL+SLGVT  DW+NLT++F+ + K++ +KGDPSLTK +VSLKN  K+W E D GYLIECR++E  + K    EE E   
Subjt:  TVMENFLPLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLEACQAKIK-TEENENEP

Query:  ESVLTVLTQYKDVFDGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNI
        + ++ +L Q+ DVF+ P++LPP R IEH IH++EG +PVNVRPYRYA+ QKEE+EKLV+EM+ SGIIRPS SPYSSPVLLVKKKDGSWRFCVDYRALNN+
Subjt:  ESVLTVLTQYKDVFDGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNI

Query:  TIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRN
        T+PDKFPIPVVEELFDEL GA+LF+KIDLKAGYHQ+RM   DIEKTAFRTHEGHYEFLVMPFGLTN PATFQSLMNSIFR YLRKFVLVFFDDIL+YSRN
Subjt:  TIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRN

Query:  LEEHCQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALW----IHSGT----LTQLLKLGA
         E+H +H+E+V  VLR+H+LFANRKKC F  ++VEYLGH++S +GVEVDPEKI+A+  WP PTNVRE RGFLGLT  +     H GT    LTQLLK G 
Subjt:  LEEHCQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALW----IHSGT----LTQLLKLGA

Query:  FKWNEEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTLALRDRSKP
        FKWN EA +AFEKL+ AM+ LPILAL  F+ PFE+ETDASGYGIGAVL+QNKRPIAFYSHTLA RDR +P
Subjt:  FKWNEEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTLALRDRSKP

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.65.0e-7133.33Show/hide
Query:  TIQSKE--VVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGWTVMENFLPLELGGVDLILGMQWLHSLGVTEMDWR
        TI+ KE  +  L+D G+T N  S  +     +P ++T+ + +   +G  I  K +   +   ++  T  E  L       DL+LG + L     T + +R
Subjt:  TIQSKE--VVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGWTVMENFLPLELGGVDLILGMQWLHSLGVTEMDWR

Query:  NLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLEACQAKIKTEE--NENEPESVLTVLTQYKDV-FDGPKELPPSRDIEHHIHI
        +  ++ +++  K++   +   T  Q   +N     +  DT      +     ++ +   E  N  E + +  +L +Y D+ +    +L  +   +H I+ 
Subjt:  NLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLEACQAKIKTEE--NENEPESVLTVLTQYKDV-FDGPKELPPSRDIEHHIHI

Query:  REGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLV-KKKDGS----WRFCVDYRALNNITIPDKFPIPVVEELFDELNGANLFSKI
        +      +   Y  A++Q  E+E  + +M+  GIIR S SPY+SP+ +V KK+D S    +R  +DYR LN IT+ D+ PIP ++E+  +L   N F+ I
Subjt:  REGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLV-KKKDGS----WRFCVDYRALNNITIPDKFPIPVVEELFDELNGANLFSKI

Query:  DLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHIELVLEVLREHKLFANRKKC
        DL  G+HQ+ M  + + KTAF T  GHYE+L MPFGL N PATFQ  MN I R  L K  LV+ DDI+V+S +L+EH Q + LV E L +  L     KC
Subjt:  DLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHIELVLEVLREHKLFANRKKC

Query:  CFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALWIH--------SGTLTQLLK--LGAFKWNEEAHEAFEKLQRAMMTLPILA
         F      +LGHVL+  G++ +PEKI A++++PIPT  +E++ FLGLT  +          +  +T+ LK  +     N E   AF+KL+  +   PIL 
Subjt:  CFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALWIH--------SGTLTQLLK--LGAFKWNEEAHEAFEKLQRAMMTLPILA

Query:  LLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTL
        + DF   F + TDAS   +GAVL Q+  P+++ S TL
Subjt:  LLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTL

P20825 Retrovirus-related Pol polyprotein from transposon 2971.0e-6834.57Show/hide
Query:  DLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLEACQAKIKTEENENEPESVLTVLTQYKDV-F
        D+++G + L +   + ++++N T++ F  + K++     S     + ++   +S   +D   +   + L+  Q ++    N+ E   +  +L +++++ +
Subjt:  DLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLEACQAKIKTEENENEPESVLTVLTQYKDV-F

Query:  DGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKD-----GSWRFCVDYRALNNITIPDKFPIP
           ++L  +  I+H ++    + P+  + Y  A   + E+E  V EM+  G+IR S SPY+SP  +V KK        +R  +DYR LN ITIPD++PIP
Subjt:  DGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKD-----GSWRFCVDYRALNNITIPDKFPIP

Query:  VVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHIE
         ++E+  +L     F+ IDL  G+HQ+ M  + I KTAF T  GHYE+L MPFGL N PATFQ  MN+I R  L K  LV+ DDI+++S +L EH   I+
Subjt:  VVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHIE

Query:  LVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALWIH------------SGTLTQLLKLGAFKWNE
        LV   L +  L     KC F      +LGH+++  G++ +P K++A+  +PIPT  +E+R FLGLT  +              +  L +  K+   K   
Subjt:  LVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALWIH------------SGTLTQLLKLGAFKWNE

Query:  EAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTL
        E  EAFEKL+  ++  PIL L DF   F + TDAS   +GAVL QN  PI+F S TL
Subjt:  EAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.4e-6534.99Show/hide
Query:  SWIETDTGYLIECRTLEACQAKIKTEENENEPESVLTVLTQYKDVFDGPKELPP------SRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMM
        S +E D  Y     T+++ +    T+ +  +    L V  Q K       +LPP      +  ++H I I+ GA    ++PY    + ++E+ K+V +++
Subjt:  SWIETDTGYLIECRTLEACQAKIKTEENENEPESVLTVLTQYKDVFDGPKELPP------SRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMM

Query:  ASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPF
         +  I PS SP SSPV+LV KKDG++R CVDYR LN  TI D FP+P ++ L   +  A +F+ +DL +GYHQ+ M  +D  KTAF T  G YE+ VMPF
Subjt:  ASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPF

Query:  GLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIP
        GL N P+TF   M   FR    +FV V+ DDIL++S + EEH +H++ VLE L+   L   +KKC FA    E+LG+ +  + +     K  A++ +P P
Subjt:  GLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIP

Query:  TNVREVRGFLGLTALW----IHSGTLTQLLKL---GAFKWNEEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVL--MQNKRP----IAF
          V++ + FLG+   +     +   + Q ++L      +W E+  +A EKL+ A+   P+L   +  A + + TDAS  GIGAVL  + NK      + +
Subjt:  TNVREVRGFLGLTALW----IHSGTLTQLLKL---GAFKWNEEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVL--MQNKRP----IAF

Query:  YSHTLALRDRSKPSNAGDPIYLG
        +S +L    ++ P  AG+   LG
Subjt:  YSHTLALRDRSKPSNAGDPIYLG

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.6e-6434.26Show/hide
Query:  IKTEENENEPESVLTVLTQYKDVFDGPKELPPSRDIEHHIHIREGA-DPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKK-----
        +  E  +   E + ++L ++  +F+ P     S +      IR    DP+  + Y Y    + E+E+ +DE++  GIIRPS SPY+SP+ +V KK     
Subjt:  IKTEENENEPESVLTVLTQYKDVFDGPKELPPSRDIEHHIHIREGA-DPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKK-----

Query:  DGSWRFCVDYRALNNITIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLR
        +  +R  VD++ LN +TIPD +PIP +      L  A  F+ +DL +G+HQ+ M   DI KTAF T  G YEFL +PFGL N PA FQ +++ I R ++ 
Subjt:  DGSWRFCVDYRALNNITIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLR

Query:  KFVLVFFDDILVYSRNLEEHCQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALW------
        K   V+ DDI+V+S + + H +++ LVL  L +  L  N +K  F  ++VE+LG++++  G++ DP+K+RA+ + P PT+V+E++ FLG+T+ +      
Subjt:  KFVLVFFDDILVYSRNLEEHCQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALW------

Query:  --------------IHSGTLTQLLKLGAFKWNEEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQN----KRPIAFYSHTL
                      +++   +          +E A ++F  L+  + +  ILA   F  PF + TDAS + IGAVL Q+     RPIA+ S +L
Subjt:  --------------IHSGTLTQLLKLGAFKWNEEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVLMQN----KRPIAFYSHTL

Q99315 Transposon Ty3-G Gag-Pol polyprotein7.0e-6534.75Show/hide
Query:  SWIETDTGYLIECRTLEACQAKIKTEENENEPESVLTVLTQYKDVFDGPKELPP------SRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMM
        S +E D  Y     T+++ +    T+ +  +    L V  Q K       +LPP      +  ++H I I+ GA    ++PY    + ++E+ K+V +++
Subjt:  SWIETDTGYLIECRTLEACQAKIKTEENENEPESVLTVLTQYKDVFDGPKELPP------SRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMM

Query:  ASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPF
         +  I PS SP SSPV+LV KKDG++R CVDYR LN  TI D FP+P ++ L   +  A +F+ +DL +GYHQ+ M  +D  KTAF T  G YE+ VMPF
Subjt:  ASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYRALNNITIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPF

Query:  GLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIP
        GL N P+TF   M   FR    +FV V+ DDIL++S + EEH +H++ VLE L+   L   +KKC FA    E+LG+ +  + +     K  A++ +P P
Subjt:  GLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHCQHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIP

Query:  TNVREVRGFLGLTALW----IHSGTLTQLLKL---GAFKWNEEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVL--MQNKRP----IAF
          V++ + FLG+   +     +   + Q ++L      +W E+  +A +KL+ A+   P+L   +  A + + TDAS  GIGAVL  + NK      + +
Subjt:  TNVREVRGFLGLTALW----IHSGTLTQLLKL---GAFKWNEEAHEAFEKLQRAMMTLPILALLDFNAPFEVETDASGYGIGAVL--MQNKRP----IAF

Query:  YSHTLALRDRSKPSNAGDPIYLG
        +S +L    ++ P  AG+   LG
Subjt:  YSHTLALRDRSKPSNAGDPIYLG

Arabidopsis top hitse value%identityAlignment
AT1G67020.1 unknown protein2.6e-0631.48Show/hide
Query:  KTAEGESSASKETKNGTTEKKEEGDGDS-----NDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNWVVYGRFLRIQQE
        +  +GE S      + ++   + G G+S     N  +  +++EMPVFDG     W  + ER+F++ +  DS+KL +  +S EG  L W     FLR   E
Subjt:  KTAEGESSASKETKNGTTEKKEEGDGDS-----NDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNWVVYGRFLRIQQE

Query:  TSVEEYRN
         S  E+R+
Subjt:  TSVEEYRN

AT3G29750.1 Eukaryotic aspartyl protease family protein1.7e-1828.97Show/hide
Query:  YSGLKEQGNKEN-----TVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYD
        YSG++++G+  +         +R++TL G   +E+  +G    L  A  + K   G+         + ++ +  E+  L +   + D     ++++ +  
Subjt:  YSGLKEQGNKEN-----TVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKYYSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYD

Query:  LKELKAMELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMG
        + EL+  EL+ D   + +     V+ LT    M+  G I   +VVV +D GAT NFI   L  +LK+PT  T    V+LG    I+  G C  + L +  
Subjt:  LKELKAMELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMG

Query:  WTVMENFLPLELG--GVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVL
          + ENFL L+L    VD+ILG +WL  LG T ++W+N   SF H+ + + L
Subjt:  WTVMENFLPLELG--GVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVL

AT3G30770.1 Eukaryotic aspartyl protease family protein8.9e-1538.46Show/hide
Query:  SVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGWTVMENFLPLEL--GGVDLILG
        S    T    M+  G I   +VVV++D GAT+NFISD L   LK+PT  T    V+LG    I+  G C  + L +    + ENFL L+L    VD+ILG
Subjt:  SVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANYGVILGSGTAIKGKGVCEKVELNLMGWTVMENFLPLEL--GGVDLILG

Query:  MQWLHSLGVTEMDWRNLTMSFFHDSKKVVL
             +L    + W N   SFFH+ + V L
Subjt:  MQWLHSLGVTEMDWRNLTMSFFHDSKKVVL

ATMG00850.1 DNA/RNA polymerases superfamily protein9.9e-0656.41Show/hide
Query:  QKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSW
        ++  L+  + EM+ + II+PS SPYSSPVLLV+KKDG W
Subjt:  QKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSW

ATMG00860.1 DNA/RNA polymerases superfamily protein4.3e-2544.19Show/hide
Query:  HIELVLEVLREHKLFANRKKCCFAGSRVEYLG--HVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALWIH--------SGTLTQLLKLGAFKWN
        H+ +VL++  +H+ +ANRKKC F   ++ YLG  H++SG GV  DP K+ A+  WP P N  E+RGFLGLT  +             LT+LLK  + KW 
Subjt:  HIELVLEVLREHKLFANRKKCCFAGSRVEYLG--HVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALWIH--------SGTLTQLLKLGAFKWN

Query:  EEAHEAFEKLQRAMMTLPILALLDFNAPF
        E A  AF+ L+ A+ TLP+LAL D   PF
Subjt:  EEAHEAFEKLQRAMMTLPILALLDFNAPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGTGGTGGCTGTGGGATTCTTCTCTTCGGTTTAACATTTCGACACTTTCCATCAATCTATCAAATTTGTTGTGGAGATCTCCCAAAGATTCCTCAACAGCCAGCAA
GCGAACTGTTGATGTCTTTGGAGAGAGGACGGTGATTTCCTCCGCCGCTTCTTGCACACGGTTTTCCCCCGTGGCAGGGTTGTTAACTCTTCAGCCAGCCATTCCGTCCA
AGAATCGGGGCTGCTCTGCTACCAATTTGATGATAGCAGGAGAGGGAGAGGAATCCATCGGAGAGGCTCGAAGCACTTCGCTGGAATCCCATCACCTTGGTATCAGAGCT
CGTCGATCCGAAAAGAAGGGGGAAGCCATGGTTCAAACCAGAAGCGAAGAGAGAAGGGACACACACGAACAAGAACTCAACAAGATTTCGGTGATGGAGGAGAAACTTAC
GGTAATGTCGCAGAACATGGAAAACTTGCAGGCCCGAGTGGAGAAAACACACCAGATGGTAATGATATTCATGGAGACGATGGCCAAGGAACAGGTATTAGCTAGCGGAA
AAGGAATCGATTCGTCAGTACAAGAAACATGGACGGGAAAAACGGCGGAGGGAGAGAGTTCGGCAAGTAAAGAAACGAAAAATGGGACGACGGAAAAGAAGGAAGAGGGC
GACGGGGATAGCAACGATCGAAACAAATTCAAAAAAGTTGAAATGCCGGTATTCGATGGAGATGACCCAGATTCATGGCTGTTCCGCGCAGAGAGATACTTTCAAATACA
CAAACTCACGGATTCTGAAAAACTTACGGTTGCCACAATCAGCTTTGAAGGGCCCGTACTCAACTGGGTCGTTTATGGCCGGTTCTTACGAATTCAGCAGGAAACAAGCG
TGGAGGAATATAGGAATCTCTTCGATAAGTGGGTGGCTCCATTATCGGATATTCCTGAAAAGATTGTGGAAGAGACATTCATGGGAGGGCTGTTACCGTGGATTAAGGTG
GAGATGGAATTCTGCAATCCCGTGGGATTAGCTGAGATGATGAGATATGCGCAAATGGTGGAACAGCGAGAGATCCTGAGGAGGGAAGCAAATTTACCCGGATATTCTGG
AGCGAAGGTCCCAAATGGCAGCTATTCTACAGCCAAAACGTATTCCGGTCTAAAGGAACAAGGGAATAAGGAGAACACAGTATTTCCAATACGAACGATTACACTGAGAG
GATCGCCGGCGAAAGAGATTAAGAAAGAAGGACCCTCTAAACGGCTCTCAGATGCGGAATTCCAGGCCAAGAGAGAGAAGGGACTCTGTTTCAAATGCGATGAGAAGTAT
TACTCCGGGCACAAATGCAAGGCGAAGGAAATACGTGAGTTACGGATGTTTGTGGTCAGAGATGACGACGTAGAGGAAGAAATCATTGAGGAAGATGAGTATGATCTAAA
AGAATTGAAAGCCATGGAGCTGCAGAACGATCCGGGAGAAGTAGTAGAGTTATGTATTAACTCAGTAGTGGGATTGACGAATCCGGGGACCATGAAGATAAGAGGCACAA
TCCAAAGCAAAGAAGTTGTCGTGCTAGTAGACTGCGGAGCCACCCACAACTTCATATCCGACCGGCTAGTAAGGACACTGAAAATACCCACAAAAGACACTGCCAACTAT
GGAGTAATACTGGGGTCAGGAACAGCCATCAAAGGCAAAGGGGTGTGTGAAAAAGTAGAGTTGAATCTCATGGGGTGGACAGTCATGGAGAATTTCCTACCACTGGAACT
GGGAGGGGTAGACTTGATACTGGGAATGCAATGGTTACATTCCTTGGGAGTGACGGAGATGGACTGGAGGAATTTAACCATGTCCTTTTTCCATGACAGCAAAAAAGTGG
TGCTAAAAGGAGATCCAAGCTTAACTAAAACTCAAGTGAGCCTAAAAAATTTCACCAAATCATGGATTGAAACGGACACGGGATATTTGATTGAGTGCAGAACACTGGAA
GCATGCCAAGCTAAGATAAAAACTGAAGAGAACGAGAACGAACCTGAGAGCGTCTTGACAGTCCTGACACAGTATAAGGATGTTTTCGATGGGCCTAAGGAATTGCCTCC
CAGCAGGGATATCGAACATCATATACACATAAGGGAAGGAGCAGACCCGGTGAACGTCCGGCCCTATCGGTACGCATTCCAGCAGAAAGAAGAACTGGAAAAACTGGTGG
ACGAAATGATGGCTTCAGGAATCATCCGCCCCAGCACAAGCCCCTACTCAAGTCCCGTTCTCTTGGTCAAGAAAAAGGATGGGAGTTGGCGATTCTGCGTGGATTATAGG
GCGCTCAATAACATAACGATTCCAGATAAGTTCCCTATCCCAGTTGTGGAAGAACTGTTTGATGAGTTAAATGGTGCCAACTTATTCTCCAAAATAGACCTGAAAGCCGG
TTATCATCAACTGAGAATGTGTAGTCAAGATATAGAGAAAACGGCCTTTAGAACTCATGAGGGACATTACGAGTTCTTAGTGATGCCGTTTGGACTCACAAACACACCAG
CAACCTTCCAATCACTAATGAACTCGATTTTTAGGTCGTATCTGAGGAAGTTTGTCTTAGTTTTCTTTGATGATATACTGGTTTATAGCAGGAACCTAGAGGAACATTGT
CAACATATTGAACTAGTGCTGGAAGTATTAAGGGAGCATAAGTTATTTGCTAACCGAAAGAAATGCTGCTTTGCGGGTTCAAGGGTGGAGTATCTGGGACATGTATTGTC
GGGAAGAGGAGTAGAAGTCGACCCTGAAAAAATTAGAGCAGTCAAGCAATGGCCAATACCAACAAATGTTCGGGAGGTTAGAGGGTTTTTGGGGCTGACGGCATTATGGA
TCCATAGCGGCACCCTGACTCAGCTGCTGAAGCTCGGAGCATTTAAATGGAATGAAGAAGCACATGAGGCGTTTGAGAAGCTTCAACGAGCAATGATGACCCTACCCATA
TTAGCTCTGCTCGATTTCAATGCGCCCTTCGAAGTAGAGACAGACGCGTCAGGCTATGGGATTGGGGCGGTACTAATGCAGAACAAAAGACCCATAGCGTTTTATAGCCA
CACACTAGCGCTGCGCGACCGATCCAAACCATCCAACGCTGGCGACCCTATCTACTTGGGAGGACGTTCATAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGTGGTGGCTGTGGGATTCTTCTCTTCGGTTTAACATTTCGACACTTTCCATCAATCTATCAAATTTGTTGTGGAGATCTCCCAAAGATTCCTCAACAGCCAGCAA
GCGAACTGTTGATGTCTTTGGAGAGAGGACGGTGATTTCCTCCGCCGCTTCTTGCACACGGTTTTCCCCCGTGGCAGGGTTGTTAACTCTTCAGCCAGCCATTCCGTCCA
AGAATCGGGGCTGCTCTGCTACCAATTTGATGATAGCAGGAGAGGGAGAGGAATCCATCGGAGAGGCTCGAAGCACTTCGCTGGAATCCCATCACCTTGGTATCAGAGCT
CGTCGATCCGAAAAGAAGGGGGAAGCCATGGTTCAAACCAGAAGCGAAGAGAGAAGGGACACACACGAACAAGAACTCAACAAGATTTCGGTGATGGAGGAGAAACTTAC
GGTAATGTCGCAGAACATGGAAAACTTGCAGGCCCGAGTGGAGAAAACACACCAGATGGTAATGATATTCATGGAGACGATGGCCAAGGAACAGGTATTAGCTAGCGGAA
AAGGAATCGATTCGTCAGTACAAGAAACATGGACGGGAAAAACGGCGGAGGGAGAGAGTTCGGCAAGTAAAGAAACGAAAAATGGGACGACGGAAAAGAAGGAAGAGGGC
GACGGGGATAGCAACGATCGAAACAAATTCAAAAAAGTTGAAATGCCGGTATTCGATGGAGATGACCCAGATTCATGGCTGTTCCGCGCAGAGAGATACTTTCAAATACA
CAAACTCACGGATTCTGAAAAACTTACGGTTGCCACAATCAGCTTTGAAGGGCCCGTACTCAACTGGGTCGTTTATGGCCGGTTCTTACGAATTCAGCAGGAAACAAGCG
TGGAGGAATATAGGAATCTCTTCGATAAGTGGGTGGCTCCATTATCGGATATTCCTGAAAAGATTGTGGAAGAGACATTCATGGGAGGGCTGTTACCGTGGATTAAGGTG
GAGATGGAATTCTGCAATCCCGTGGGATTAGCTGAGATGATGAGATATGCGCAAATGGTGGAACAGCGAGAGATCCTGAGGAGGGAAGCAAATTTACCCGGATATTCTGG
AGCGAAGGTCCCAAATGGCAGCTATTCTACAGCCAAAACGTATTCCGGTCTAAAGGAACAAGGGAATAAGGAGAACACAGTATTTCCAATACGAACGATTACACTGAGAG
GATCGCCGGCGAAAGAGATTAAGAAAGAAGGACCCTCTAAACGGCTCTCAGATGCGGAATTCCAGGCCAAGAGAGAGAAGGGACTCTGTTTCAAATGCGATGAGAAGTAT
TACTCCGGGCACAAATGCAAGGCGAAGGAAATACGTGAGTTACGGATGTTTGTGGTCAGAGATGACGACGTAGAGGAAGAAATCATTGAGGAAGATGAGTATGATCTAAA
AGAATTGAAAGCCATGGAGCTGCAGAACGATCCGGGAGAAGTAGTAGAGTTATGTATTAACTCAGTAGTGGGATTGACGAATCCGGGGACCATGAAGATAAGAGGCACAA
TCCAAAGCAAAGAAGTTGTCGTGCTAGTAGACTGCGGAGCCACCCACAACTTCATATCCGACCGGCTAGTAAGGACACTGAAAATACCCACAAAAGACACTGCCAACTAT
GGAGTAATACTGGGGTCAGGAACAGCCATCAAAGGCAAAGGGGTGTGTGAAAAAGTAGAGTTGAATCTCATGGGGTGGACAGTCATGGAGAATTTCCTACCACTGGAACT
GGGAGGGGTAGACTTGATACTGGGAATGCAATGGTTACATTCCTTGGGAGTGACGGAGATGGACTGGAGGAATTTAACCATGTCCTTTTTCCATGACAGCAAAAAAGTGG
TGCTAAAAGGAGATCCAAGCTTAACTAAAACTCAAGTGAGCCTAAAAAATTTCACCAAATCATGGATTGAAACGGACACGGGATATTTGATTGAGTGCAGAACACTGGAA
GCATGCCAAGCTAAGATAAAAACTGAAGAGAACGAGAACGAACCTGAGAGCGTCTTGACAGTCCTGACACAGTATAAGGATGTTTTCGATGGGCCTAAGGAATTGCCTCC
CAGCAGGGATATCGAACATCATATACACATAAGGGAAGGAGCAGACCCGGTGAACGTCCGGCCCTATCGGTACGCATTCCAGCAGAAAGAAGAACTGGAAAAACTGGTGG
ACGAAATGATGGCTTCAGGAATCATCCGCCCCAGCACAAGCCCCTACTCAAGTCCCGTTCTCTTGGTCAAGAAAAAGGATGGGAGTTGGCGATTCTGCGTGGATTATAGG
GCGCTCAATAACATAACGATTCCAGATAAGTTCCCTATCCCAGTTGTGGAAGAACTGTTTGATGAGTTAAATGGTGCCAACTTATTCTCCAAAATAGACCTGAAAGCCGG
TTATCATCAACTGAGAATGTGTAGTCAAGATATAGAGAAAACGGCCTTTAGAACTCATGAGGGACATTACGAGTTCTTAGTGATGCCGTTTGGACTCACAAACACACCAG
CAACCTTCCAATCACTAATGAACTCGATTTTTAGGTCGTATCTGAGGAAGTTTGTCTTAGTTTTCTTTGATGATATACTGGTTTATAGCAGGAACCTAGAGGAACATTGT
CAACATATTGAACTAGTGCTGGAAGTATTAAGGGAGCATAAGTTATTTGCTAACCGAAAGAAATGCTGCTTTGCGGGTTCAAGGGTGGAGTATCTGGGACATGTATTGTC
GGGAAGAGGAGTAGAAGTCGACCCTGAAAAAATTAGAGCAGTCAAGCAATGGCCAATACCAACAAATGTTCGGGAGGTTAGAGGGTTTTTGGGGCTGACGGCATTATGGA
TCCATAGCGGCACCCTGACTCAGCTGCTGAAGCTCGGAGCATTTAAATGGAATGAAGAAGCACATGAGGCGTTTGAGAAGCTTCAACGAGCAATGATGACCCTACCCATA
TTAGCTCTGCTCGATTTCAATGCGCCCTTCGAAGTAGAGACAGACGCGTCAGGCTATGGGATTGGGGCGGTACTAATGCAGAACAAAAGACCCATAGCGTTTTATAGCCA
CACACTAGCGCTGCGCGACCGATCCAAACCATCCAACGCTGGCGACCCTATCTACTTGGGAGGACGTTCATAG
Protein sequenceShow/hide protein sequence
MWWWLWDSSLRFNISTLSINLSNLLWRSPKDSSTASKRTVDVFGERTVISSAASCTRFSPVAGLLTLQPAIPSKNRGCSATNLMIAGEGEESIGEARSTSLESHHLGIRA
RRSEKKGEAMVQTRSEERRDTHEQELNKISVMEEKLTVMSQNMENLQARVEKTHQMVMIFMETMAKEQVLASGKGIDSSVQETWTGKTAEGESSASKETKNGTTEKKEEG
DGDSNDRNKFKKVEMPVFDGDDPDSWLFRAERYFQIHKLTDSEKLTVATISFEGPVLNWVVYGRFLRIQQETSVEEYRNLFDKWVAPLSDIPEKIVEETFMGGLLPWIKV
EMEFCNPVGLAEMMRYAQMVEQREILRREANLPGYSGAKVPNGSYSTAKTYSGLKEQGNKENTVFPIRTITLRGSPAKEIKKEGPSKRLSDAEFQAKREKGLCFKCDEKY
YSGHKCKAKEIRELRMFVVRDDDVEEEIIEEDEYDLKELKAMELQNDPGEVVELCINSVVGLTNPGTMKIRGTIQSKEVVVLVDCGATHNFISDRLVRTLKIPTKDTANY
GVILGSGTAIKGKGVCEKVELNLMGWTVMENFLPLELGGVDLILGMQWLHSLGVTEMDWRNLTMSFFHDSKKVVLKGDPSLTKTQVSLKNFTKSWIETDTGYLIECRTLE
ACQAKIKTEENENEPESVLTVLTQYKDVFDGPKELPPSRDIEHHIHIREGADPVNVRPYRYAFQQKEELEKLVDEMMASGIIRPSTSPYSSPVLLVKKKDGSWRFCVDYR
ALNNITIPDKFPIPVVEELFDELNGANLFSKIDLKAGYHQLRMCSQDIEKTAFRTHEGHYEFLVMPFGLTNTPATFQSLMNSIFRSYLRKFVLVFFDDILVYSRNLEEHC
QHIELVLEVLREHKLFANRKKCCFAGSRVEYLGHVLSGRGVEVDPEKIRAVKQWPIPTNVREVRGFLGLTALWIHSGTLTQLLKLGAFKWNEEAHEAFEKLQRAMMTLPI
LALLDFNAPFEVETDASGYGIGAVLMQNKRPIAFYSHTLALRDRSKPSNAGDPIYLGGRS