; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G33050 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G33050
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr1:27931912..27937393
RNA-Seq ExpressionCSPI01G33050
SyntenyCSPI01G33050
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0062868.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0056.08Show/hide
Query:  MYFKNLGMERQNQMMIRFMESAAQERSTMNEKIIELSMRSTLMKSNSEGEGSSSRESIPKNEEKKTNEGNNGDRNKFKKVEMPIFNGDDPDSWLFRVE--
        M    L  E+Q Q ++ +ME  A+ERS   E++ E   +++    +   + SSSR+    N +K   + N+ DR+KFKKVEMP+F G+DP+SWLFR E  
Subjt:  MYFKNLGMERQNQMMIRFMESAAQERSTMNEKIIELSMRSTLMKSNSEGEGSSSRESIPKNEEKKTNEGNNGDRNKFKKVEMPIFNGDDPDSWLFRVE--

Query:  -----------------------------------------------------REGSLYGRFLRIQQTTTVEEYRNLFDKLVAPLSDLPE----------
                                                             REG+ +GRFLRIQQ TTVEEYRNLFDKLVAPL D+ +          
Subjt:  -----------------------------------------------------REGSLYGRFLRIQQTTTVEEYRNLFDKLVAPLSDLPE----------

Query:  --------------KGLSQMMRIAQKVENKEDIRREANLPGYSGGKLSS-SGSNTKVNTTVNSGDNKGSMNWPMRTITLRGTSNEEVRKEGPTKQLSDAE
                      KGL++MMR AQ VE++E +R  ANL GY GGK S+ + + TK      + +NK +  +P+RTITL+  ++ E RKEG +K+L DAE
Subjt:  --------------KGLSQMMRIAQKVENKEDIRREANLPGYSGGKLSS-SGSNTKVNTTVNSGDNKGSMNWPMRTITLRGTSNEEVRKEGPTKQLSDAE

Query:  FQSRREKGLCFRCNEKYSHDHKCKTKEHRELCMLVVMGENVEYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIVLI
        FQ RREKGLCF+CNEKYS DHKCK +E REL M VV   N E EI+EE  T+  EL T+EV  +    VELSINSVVGL++PG MKV+G LQG+EV++LI
Subjt:  FQSRREKGLCFRCNEKYSHDHKCKTKEHRELCMLVVMGENVEYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIVLI

Query:  DCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGKKV
        DCGATHNF+ E LV  LQ+  K T++YGVILGSGT I+GK +CE+IE+ + DW V ++FLPLELGGVD +LGMQWLYSLG+T  DWKNL LTF    KK+
Subjt:  DCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGKKV

Query:  VIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRALERRESLEEEDSFDEVLTIEESVEVVFKRFEDVFEWPETLSPRRLIEHHIDLKKGTDPVNVRPYR
         I+GDPSLTKARVSLKNL+K+W E D G+L+ECR++    +       +E   IEE +  +  +F+D+FEWPE L PRR IEH I LK+GT+PVNVRPYR
Subjt:  VIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRALERRESLEEEDSFDEVLTIEESVEVVFKRFEDVFEWPETLSPRRLIEHHIDLKKGTDPVNVRPYR

Query:  YAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKSGYHQIRMCADDIEK
        YAY QK EMERLV EMLASG+I+PS SPYSSPVL+V+KKDGSWRFCVDY ALNNVTV DKFPIPV+EELFDEL GA++FTKIDLK+GYHQIRM   DIEK
Subjt:  YAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKSGYHQIRMCADDIEK

Query:  TAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELYAN-KKCSFAQFRVDYLGHIISGDG
        TAFRTHEGHYEF+VMPFGLTNAP+TFQSLMN+IFRPYL+             ++ LEDHL H+++V  VLRK+EL+AN KKCSF   +V+YLGH+IS  G
Subjt:  TAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELYAN-KKCSFAQFRVDYLGHIISGDG

Query:  VEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALLDFSVPFEIETDASGYGLG
        VEVD EKI+AI +WP P +VRE RGFLGLTGYYRKFV +YGT+AAPLTQLLK GGF W  E + AF +L++AM++LPVLAL  F  PFEIETDASGYG+G
Subjt:  VEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALLDFSVPFEIETDASGYGLG

Query:  AVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQ-----------------------------
        AVL+QNKRPIA+Y+HTLAIR R +PVYERELMAVV+ VQRWRPYLLG +F+V+TDQ+SLKFLLEQRV+QPQ                             
Subjt:  AVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQ-----------------------------

Query:  --VPATVHLNQLTAPKVIDIEVIKAEVDQDEKLKNI
          +  TV +  +T P  +D+++IK EV++D KL  I
Subjt:  --VPATVHLNQLTAPKVIDIEVIKAEVDQDEKLKNI

KAA0068193.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0056.18Show/hide
Query:  MYFKNLGMERQNQMMIRFMESAAQERSTMNEKIIELSMRSTLMKSNSEGEGSSSRESIPKNEEKKTNEGNNGDRNKFKKVEMPIFNGDDPDSWLFRVE--
        M    L  E+Q Q ++ +ME  A+ERS   E++ E   +++    +   + SSSR+    N +K   + N+ DR+KFKKVEMP+F G+DP+SWLFR E  
Subjt:  MYFKNLGMERQNQMMIRFMESAAQERSTMNEKIIELSMRSTLMKSNSEGEGSSSRESIPKNEEKKTNEGNNGDRNKFKKVEMPIFNGDDPDSWLFRVE--

Query:  -----------------------------------------------------REGSLYGRFLRIQQTTTVEEYRNLFDKLVAPLSDLPE----------
                                                             REG+ +GRFLRIQQ TTVEEYRNLFDKLVAPLSD+ +          
Subjt:  -----------------------------------------------------REGSLYGRFLRIQQTTTVEEYRNLFDKLVAPLSDLPE----------

Query:  --------------KGLSQMMRIAQKVENKEDIRREANLPGYSGGKLSS-SGSNTKVNTTVNSGDNKGSMNWPMRTITLRGTSNEEVRKEGPTKQLSDAE
                      KGL++MMR AQ VE++E +R  ANL GY GGK S+ + + TK      + +NK +  +P+RTITL+  ++ E RKEG +K+L DAE
Subjt:  --------------KGLSQMMRIAQKVENKEDIRREANLPGYSGGKLSS-SGSNTKVNTTVNSGDNKGSMNWPMRTITLRGTSNEEVRKEGPTKQLSDAE

Query:  FQSRREKGLCFRCNEKYSHDHKCKTKEHRELCMLVVMGENVEYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIVLI
        FQ RREKGLCF+CNEKYS DHKCK +E REL M VV   N E EI+EE  T+  EL T+EV  +    VELSINSVVGL++PG MKV+G LQG+EV++LI
Subjt:  FQSRREKGLCFRCNEKYSHDHKCKTKEHRELCMLVVMGENVEYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIVLI

Query:  DCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGKKV
        DCGATHNF+ E LV  LQ+  K T++YGVILGSGT I+GK +CE+IE+ + DW V ++FLPLELGGVD +LGMQWLYSLG+T  DWKNL LTF    KK+
Subjt:  DCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGKKV

Query:  VIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRALERRESLEEEDSFDEVLTIEESVEVVFKRFEDVFEWPETLSPRRLIEHHIDLKKGTDPVNVRPYR
         I+GDPSLTKARVSLKNL+K+W E D G+L+ECR++    +       +E   IEE +  +  +F+D+FEWPE L PRR IEH I LK+GT+PVNVRPYR
Subjt:  VIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRALERRESLEEEDSFDEVLTIEESVEVVFKRFEDVFEWPETLSPRRLIEHHIDLKKGTDPVNVRPYR

Query:  YAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKSGYHQIRMCADDIEK
        YAY QK EMERLV EMLASG+I+PS SPYSSPVL+V+KKDGSWRFCVDY ALNNVTV DKFPIPV+EELFDEL GA++FTKIDLK+GYHQIRM   DIEK
Subjt:  YAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKSGYHQIRMCADDIEK

Query:  TAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELYAN-KKCSFAQFRVDYLGHIISGDG
        TAFRTHEGHYEF+VMPFGLTNAP+TFQSLMN+IFRPYL+             ++ LEDHL H+++V  VLRK+EL+AN KKCSF   +V+YLGH+IS  G
Subjt:  TAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELYAN-KKCSFAQFRVDYLGHIISGDG

Query:  VEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALLDFSVPFEIETDASGYGLG
        VEVD EKI+AI +WP P +VRE RGFLGLTGYYRKFV +YGT+AAPLTQLLK GGF W  E + AF +L++AM++LPVLAL  F  PFEIETDASGYG+G
Subjt:  VEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALLDFSVPFEIETDASGYGLG

Query:  AVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQ-----------------------------
        AVL+QNKRPIA+Y+HTLAIR R +PVYERELMAVV+ VQRWRPYLLG +F+V+TDQ+SLKFLLEQRV+QPQ                             
Subjt:  AVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQ-----------------------------

Query:  --VPATVHLNQLTAPKVIDIEVIKAEVDQDEKLKNI
          +  TV +  +T P  +D+++IK EV++D KL  I
Subjt:  --VPATVHLNQLTAPKVIDIEVIKAEVDQDEKLKNI

TYJ96875.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0056.18Show/hide
Query:  MYFKNLGMERQNQMMIRFMESAAQERSTMNEKIIELSMRSTLMKSNSEGEGSSSRESIPKNEEKKTNEGNNGDRNKFKKVEMPIFNGDDPDSWLFRVE--
        M    L  E+Q Q ++ +ME  A+ERS   E++ E   +++    +   + SSSR+    N +K   + N+ DR+KFKKVEMP+F G+DP+SWLFR E  
Subjt:  MYFKNLGMERQNQMMIRFMESAAQERSTMNEKIIELSMRSTLMKSNSEGEGSSSRESIPKNEEKKTNEGNNGDRNKFKKVEMPIFNGDDPDSWLFRVE--

Query:  -----------------------------------------------------REGSLYGRFLRIQQTTTVEEYRNLFDKLVAPLSDLPE----------
                                                             REG+ +GRFLRIQQ TTVEEYRNLFDKLVAPLSD+ +          
Subjt:  -----------------------------------------------------REGSLYGRFLRIQQTTTVEEYRNLFDKLVAPLSDLPE----------

Query:  --------------KGLSQMMRIAQKVENKEDIRREANLPGYSGGKLSS-SGSNTKVNTTVNSGDNKGSMNWPMRTITLRGTSNEEVRKEGPTKQLSDAE
                      KGL++MMR AQ VE++E +R  ANL GY GGK S+ + + TK      + +NK +  +P+RTITL+  ++ E RKEG +K+L DAE
Subjt:  --------------KGLSQMMRIAQKVENKEDIRREANLPGYSGGKLSS-SGSNTKVNTTVNSGDNKGSMNWPMRTITLRGTSNEEVRKEGPTKQLSDAE

Query:  FQSRREKGLCFRCNEKYSHDHKCKTKEHRELCMLVVMGENVEYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIVLI
        FQ RREKGLCF+CNEKYS DHKCK +E REL M VV   N E EI+EE  T+  EL T+EV  +    VELSINSVVGL++PG MKV+G LQG+EV++LI
Subjt:  FQSRREKGLCFRCNEKYSHDHKCKTKEHRELCMLVVMGENVEYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIVLI

Query:  DCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGKKV
        DCGATHNF+ E LV  LQ+  K T++YGVILGSGT I+GK +CE+IE+ + DW V ++FLPLELGGVD +LGMQWLYSLG+T  DWKNL LTF    KK+
Subjt:  DCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGKKV

Query:  VIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRALERRESLEEEDSFDEVLTIEESVEVVFKRFEDVFEWPETLSPRRLIEHHIDLKKGTDPVNVRPYR
         I+GDPSLTKARVSLKNL+K+W E D G+L+ECR++    +       +E   IEE +  +  +F+D+FEWPE L PRR IEH I LK+GT+PVNVRPYR
Subjt:  VIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRALERRESLEEEDSFDEVLTIEESVEVVFKRFEDVFEWPETLSPRRLIEHHIDLKKGTDPVNVRPYR

Query:  YAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKSGYHQIRMCADDIEK
        YAY QK EMERLV EMLASG+I+PS SPYSSPVL+V+KKDGSWRFCVDY ALNNVTV DKFPIPV+EELFDEL GA++FTKIDLK+GYHQIRM   DIEK
Subjt:  YAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKSGYHQIRMCADDIEK

Query:  TAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELYAN-KKCSFAQFRVDYLGHIISGDG
        TAFRTHEGHYEF+VMPFGLTNAP+TFQSLMN+IFRPYL+             ++ LEDHL H+++V  VLRK+EL+AN KKCSF   +V+YLGH+IS  G
Subjt:  TAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELYAN-KKCSFAQFRVDYLGHIISGDG

Query:  VEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALLDFSVPFEIETDASGYGLG
        VEVD EKI+AI +WP P +VRE RGFLGLTGYYRKFV +YGT+AAPLTQLLK GGF W  E + AF +L++AM++LPVLAL  F  PFEIETDASGYG+G
Subjt:  VEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALLDFSVPFEIETDASGYGLG

Query:  AVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQ-----------------------------
        AVL+QNKRPIA+Y+HTLAIR R +PVYERELMAVV+ VQRWRPYLLG +F+V+TDQ+SLKFLLEQRV+QPQ                             
Subjt:  AVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQ-----------------------------

Query:  --VPATVHLNQLTAPKVIDIEVIKAEVDQDEKLKNI
          +  TV +  +T P  +D+++IK EV++D KL  I
Subjt:  --VPATVHLNQLTAPKVIDIEVIKAEVDQDEKLKNI

TYK02195.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0055.78Show/hide
Query:  ERQNQMMIRFMESAAQERSTMNEKIIELSMRSTLMKSNSEGEGSSSRESI-------PKNEEKKTNEGNNGDRNKFKKVEMPIFNGDDPDSWLFRVE---
        ++Q QMM+  ME+ A++R+T +E+  + +   ++     E E +SS+ ++        +NE K   E    DRNKFKKVEMP+F G+DPDSWLFR E   
Subjt:  ERQNQMMIRFMESAAQERSTMNEKIIELSMRSTLMKSNSEGEGSSSRESI-------PKNEEKKTNEGNNGDRNKFKKVEMPIFNGDDPDSWLFRVE---

Query:  ----------------------------------------------------REGSLYGRFLRIQQTTTVEEYRNLFDKLVAPLSDLPE-----------
                                                            R+G++ G+FLR++Q +TV++YRNLFDKLVAPLSD+P+           
Subjt:  ----------------------------------------------------REGSLYGRFLRIQQTTTVEEYRNLFDKLVAPLSDLPE-----------

Query:  -------------KGLSQMMRIAQKVENKEDIRREANLPGYSGGKL---SSSGSNTKVNTTVNSGDNKGSMNWPMRTITLRGTSNEEVRKEGPTKQLSDA
                     KGL++ M  AQ VEN+E  R EANL  ++GGK    ++  S T VNT   + D+K + N+PMRTITLR ++N E+RK+G +++L DA
Subjt:  -------------KGLSQMMRIAQKVENKEDIRREANLPGYSGGKL---SSSGSNTKVNTTVNSGDNKGSMNWPMRTITLRGTSNEEVRKEGPTKQLSDA

Query:  EFQSRREKGLCFRCNEKYSHDHKCKTKEHRELCMLVVMGENVEYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIVL
        EFQ+R+EKGLCFRCNEKYS DH+CK KE REL M VV  E  EYEIIEE++ E+K L   +   +++T  ELS+NSVVGL++PG MKVKGK+Q REVI+L
Subjt:  EFQSRREKGLCFRCNEKYSHDHKCKTKEHRELCMLVVMGENVEYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIVL

Query:  IDCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGKK
        IDCGATHNFI E LVK LQ+  K T +YGVILGSGT ++GK +CE +EI L +W+V +EFLPLELGGVD +LGMQWL+SLGIT VDWKNL LTF+ +GK+
Subjt:  IDCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGKK

Query:  VVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRALERRESLEEEDSFDEVLTI-EESVEVVFKRFEDVFEWPETLSPRRLIEHHIDLKKGTDPVNVRP
        + I+GDPSLTK+R+SLK+++K+W E+D+GFL+ECRA++     E E S   V TI +E ++ V K+FEDVF+WPE L PRR IEH I LK+GT+P+NVRP
Subjt:  VVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRALERRESLEEEDSFDEVLTI-EESVEVVFKRFEDVFEWPETLSPRRLIEHHIDLKKGTDPVNVRP

Query:  YRYAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKSGYHQIRMCADDI
        YRY +QQK EMERLVEEMLASG+I+PS SP+SSPVL+V+KKDGSWRFCVDY A+NN T+ DKFPIPV EELFDEL+GAT+F+KIDLKSGYHQIRM   DI
Subjt:  YRYAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKSGYHQIRMCADDI

Query:  EKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELYAN-KKCSFAQFRVDYLGHIISG
         KTAFRTHEGHYEF+VMPFGLTNAP+TFQ+LMN+IFRP+L+             +K  +DH+ H++ V   LRK+ L+AN KKC+F Q +V+YLGHIISG
Subjt:  EKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELYAN-KKCSFAQFRVDYLGHIISG

Query:  DGVEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALLDFSVPFEIETDASGYG
        +GVEVD EKI+A+ +WP P N+REVRGFLGLTGYYR+FVQ+YG+IAAPLTQLLK GGFKW E+ + +F +L+ AMMSLP LAL +F++PFEIETDASG+G
Subjt:  DGVEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALLDFSVPFEIETDASGYG

Query:  LGAVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQV--------------------------
        +GAVL+Q+KRPIA+Y+HTL++R R +PVYERELMAVV++VQRWRPYLLG KF+VKTDQ+SLKFLLEQRVIQPQ                           
Subjt:  LGAVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQV--------------------------

Query:  -----PATVHLNQLTAPKVIDIEVIKAEVDQDEKLKNI
             P  + LN ++ P  +D+E IK EV++DEKLK I
Subjt:  -----PATVHLNQLTAPKVIDIEVIKAEVDQDEKLKNI

TYK03866.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0055.05Show/hide
Query:  LGMERQNQMMIRFMESAAQERSTMNEKIIELSMRSTLMKSNSEGEGSSSR----ESIPKNE--EKKTNEGNN-GDRNKFKKVEMPIFNGDDPDSWLFRVE
        L  E+Q Q++   +E++++ERSTM+ +  E +++        + + SSSR    +   +NE  E++T+  +   DRNKFKK+EMP+F G+DPDSWLFR E
Subjt:  LGMERQNQMMIRFMESAAQERSTMNEKIIELSMRSTLMKSNSEGEGSSSR----ESIPKNE--EKKTNEGNN-GDRNKFKKVEMPIFNGDDPDSWLFRVE

Query:  R-------------------------------------------------------EGSLYGRFLRIQQTTTVEEYRNLFDKLVAPLSDLPE--------
        R                                                       +G++ G+FLRI+Q  +VEEY NLFDK+VAP++DLPE        
Subjt:  R-------------------------------------------------------EGSLYGRFLRIQQTTTVEEYRNLFDKLVAPLSDLPE--------

Query:  ----------------KGLSQMMRIAQKVENKEDIRREANLPGYSGGKLSS-SGSNTKVNTTVNSGDNKGSMNWPMRTITLRGTSNEEVRKEGPTKQLSD
                        KGL++MM  AQ VEN+E +R E  + GYSGGKL+S + SN ++N    +GD KG+ ++  RTITLR  ++ E R+EG  K+L D
Subjt:  ----------------KGLSQMMRIAQKVENKEDIRREANLPGYSGGKLSS-SGSNTKVNTTVNSGDNKGSMNWPMRTITLRGTSNEEVRKEGPTKQLSD

Query:  AEFQSRREKGLCFRCNEKYSHDHKCKTKEHRELCMLVVMGENVEYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIV
        AEFQ+R+EKGLCFRCNEKYS DHKC+ KE REL M V+     EYEI+EE+  E+K+L  +EV GE  T+VELSINSVVGL++PG MKV+GKL G EV+V
Subjt:  AEFQSRREKGLCFRCNEKYSHDHKCKTKEHRELCMLVVMGENVEYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIV

Query:  LIDCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGK
        LIDCGATHNF+ E LVK+L +  K TS+YGVILGSG  ++GK +CE +E+ L  WR+V++FLPLELGGVD +LGMQWLYSLG+T VDWKNL+L+F  +GK
Subjt:  LIDCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGK

Query:  KVVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRALERRESLEEEDSFDEVLTIEES-VEVVFKRFEDVFEWPETLSPRRLIEHHIDLKKGTDPVNVR
        +V I+GDPSLTKAR+SLKN+MK W E+D GFL+ECR+L+ R   +EE        + +  +  V K+++DVF+WPE L PRR IEHHI LK GTDP+NVR
Subjt:  KVVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRALERRESLEEEDSFDEVLTIEES-VEVVFKRFEDVFEWPETLSPRRLIEHHIDLKKGTDPVNVR

Query:  PYRYAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKSGYHQIRMCADD
        PYRY +QQK EME+LV+EML SGVI+PS SPYSSPVL+V+KKDGSWRFCVDY A+NN T+ DKFPIPV+EELFDEL GA +F+KI+LKSGYHQIRM  DD
Subjt:  PYRYAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKSGYHQIRMCADD

Query:  IEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELYAN-KKCSFAQFRVDYLGHIIS
        IEKTAFRTHEGHYEF+VMPFGLTNAP+TFQ+LMNT+F+P+L+             ++  ++H  H++ VL+VLR++ELYAN KKC FAQ +++YLGH+IS
Subjt:  IEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELYAN-KKCSFAQFRVDYLGHIIS

Query:  GDGVEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALLDFSVPFEIETDASGY
        G+GV VD EKI+AI +WP P NV+E RGFLGLTGYYR+FV+NYGTIAAPLTQLLK GGF WTEE   AF+RL+ AM+SLPVLAL DF+  FEIE DASGY
Subjt:  GDGVEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALLDFSVPFEIETDASGY

Query:  GLGAVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQV-------------------------
        G+GAVL+Q++RP+AY++HTLA+R R +PVYERELMA+V+ VQRWRPYLL  KF VKTDQ++LKFLL+QR+IQPQ                          
Subjt:  GLGAVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQV-------------------------

Query:  ------PATVHLNQLTAPKVIDIEVIKAEVDQDEKLKNI
              P  V L  L+ P  +D++VIK EV QD K K I
Subjt:  ------PATVHLNQLTAPKVIDIEVIKAEVDQDEKLKNI

TrEMBL top hitse value%identityAlignment
A0A5A7V5H5 Ty3/gypsy retrotransposon protein0.0e+0056.08Show/hide
Query:  MYFKNLGMERQNQMMIRFMESAAQERSTMNEKIIELSMRSTLMKSNSEGEGSSSRESIPKNEEKKTNEGNNGDRNKFKKVEMPIFNGDDPDSWLFRVE--
        M    L  E+Q Q ++ +ME  A+ERS   E++ E   +++    +   + SSSR+    N +K   + N+ DR+KFKKVEMP+F G+DP+SWLFR E  
Subjt:  MYFKNLGMERQNQMMIRFMESAAQERSTMNEKIIELSMRSTLMKSNSEGEGSSSRESIPKNEEKKTNEGNNGDRNKFKKVEMPIFNGDDPDSWLFRVE--

Query:  -----------------------------------------------------REGSLYGRFLRIQQTTTVEEYRNLFDKLVAPLSDLPE----------
                                                             REG+ +GRFLRIQQ TTVEEYRNLFDKLVAPL D+ +          
Subjt:  -----------------------------------------------------REGSLYGRFLRIQQTTTVEEYRNLFDKLVAPLSDLPE----------

Query:  --------------KGLSQMMRIAQKVENKEDIRREANLPGYSGGKLSS-SGSNTKVNTTVNSGDNKGSMNWPMRTITLRGTSNEEVRKEGPTKQLSDAE
                      KGL++MMR AQ VE++E +R  ANL GY GGK S+ + + TK      + +NK +  +P+RTITL+  ++ E RKEG +K+L DAE
Subjt:  --------------KGLSQMMRIAQKVENKEDIRREANLPGYSGGKLSS-SGSNTKVNTTVNSGDNKGSMNWPMRTITLRGTSNEEVRKEGPTKQLSDAE

Query:  FQSRREKGLCFRCNEKYSHDHKCKTKEHRELCMLVVMGENVEYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIVLI
        FQ RREKGLCF+CNEKYS DHKCK +E REL M VV   N E EI+EE  T+  EL T+EV  +    VELSINSVVGL++PG MKV+G LQG+EV++LI
Subjt:  FQSRREKGLCFRCNEKYSHDHKCKTKEHRELCMLVVMGENVEYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIVLI

Query:  DCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGKKV
        DCGATHNF+ E LV  LQ+  K T++YGVILGSGT I+GK +CE+IE+ + DW V ++FLPLELGGVD +LGMQWLYSLG+T  DWKNL LTF    KK+
Subjt:  DCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGKKV

Query:  VIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRALERRESLEEEDSFDEVLTIEESVEVVFKRFEDVFEWPETLSPRRLIEHHIDLKKGTDPVNVRPYR
         I+GDPSLTKARVSLKNL+K+W E D G+L+ECR++    +       +E   IEE +  +  +F+D+FEWPE L PRR IEH I LK+GT+PVNVRPYR
Subjt:  VIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRALERRESLEEEDSFDEVLTIEESVEVVFKRFEDVFEWPETLSPRRLIEHHIDLKKGTDPVNVRPYR

Query:  YAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKSGYHQIRMCADDIEK
        YAY QK EMERLV EMLASG+I+PS SPYSSPVL+V+KKDGSWRFCVDY ALNNVTV DKFPIPV+EELFDEL GA++FTKIDLK+GYHQIRM   DIEK
Subjt:  YAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKSGYHQIRMCADDIEK

Query:  TAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELYAN-KKCSFAQFRVDYLGHIISGDG
        TAFRTHEGHYEF+VMPFGLTNAP+TFQSLMN+IFRPYL+             ++ LEDHL H+++V  VLRK+EL+AN KKCSF   +V+YLGH+IS  G
Subjt:  TAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELYAN-KKCSFAQFRVDYLGHIISGDG

Query:  VEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALLDFSVPFEIETDASGYGLG
        VEVD EKI+AI +WP P +VRE RGFLGLTGYYRKFV +YGT+AAPLTQLLK GGF W  E + AF +L++AM++LPVLAL  F  PFEIETDASGYG+G
Subjt:  VEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALLDFSVPFEIETDASGYGLG

Query:  AVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQ-----------------------------
        AVL+QNKRPIA+Y+HTLAIR R +PVYERELMAVV+ VQRWRPYLLG +F+V+TDQ+SLKFLLEQRV+QPQ                             
Subjt:  AVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQ-----------------------------

Query:  --VPATVHLNQLTAPKVIDIEVIKAEVDQDEKLKNI
          +  TV +  +T P  +D+++IK EV++D KL  I
Subjt:  --VPATVHLNQLTAPKVIDIEVIKAEVDQDEKLKNI

A0A5A7VJA0 Ty3/gypsy retrotransposon protein0.0e+0056.18Show/hide
Query:  MYFKNLGMERQNQMMIRFMESAAQERSTMNEKIIELSMRSTLMKSNSEGEGSSSRESIPKNEEKKTNEGNNGDRNKFKKVEMPIFNGDDPDSWLFRVE--
        M    L  E+Q Q ++ +ME  A+ERS   E++ E   +++    +   + SSSR+    N +K   + N+ DR+KFKKVEMP+F G+DP+SWLFR E  
Subjt:  MYFKNLGMERQNQMMIRFMESAAQERSTMNEKIIELSMRSTLMKSNSEGEGSSSRESIPKNEEKKTNEGNNGDRNKFKKVEMPIFNGDDPDSWLFRVE--

Query:  -----------------------------------------------------REGSLYGRFLRIQQTTTVEEYRNLFDKLVAPLSDLPE----------
                                                             REG+ +GRFLRIQQ TTVEEYRNLFDKLVAPLSD+ +          
Subjt:  -----------------------------------------------------REGSLYGRFLRIQQTTTVEEYRNLFDKLVAPLSDLPE----------

Query:  --------------KGLSQMMRIAQKVENKEDIRREANLPGYSGGKLSS-SGSNTKVNTTVNSGDNKGSMNWPMRTITLRGTSNEEVRKEGPTKQLSDAE
                      KGL++MMR AQ VE++E +R  ANL GY GGK S+ + + TK      + +NK +  +P+RTITL+  ++ E RKEG +K+L DAE
Subjt:  --------------KGLSQMMRIAQKVENKEDIRREANLPGYSGGKLSS-SGSNTKVNTTVNSGDNKGSMNWPMRTITLRGTSNEEVRKEGPTKQLSDAE

Query:  FQSRREKGLCFRCNEKYSHDHKCKTKEHRELCMLVVMGENVEYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIVLI
        FQ RREKGLCF+CNEKYS DHKCK +E REL M VV   N E EI+EE  T+  EL T+EV  +    VELSINSVVGL++PG MKV+G LQG+EV++LI
Subjt:  FQSRREKGLCFRCNEKYSHDHKCKTKEHRELCMLVVMGENVEYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIVLI

Query:  DCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGKKV
        DCGATHNF+ E LV  LQ+  K T++YGVILGSGT I+GK +CE+IE+ + DW V ++FLPLELGGVD +LGMQWLYSLG+T  DWKNL LTF    KK+
Subjt:  DCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGKKV

Query:  VIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRALERRESLEEEDSFDEVLTIEESVEVVFKRFEDVFEWPETLSPRRLIEHHIDLKKGTDPVNVRPYR
         I+GDPSLTKARVSLKNL+K+W E D G+L+ECR++    +       +E   IEE +  +  +F+D+FEWPE L PRR IEH I LK+GT+PVNVRPYR
Subjt:  VIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRALERRESLEEEDSFDEVLTIEESVEVVFKRFEDVFEWPETLSPRRLIEHHIDLKKGTDPVNVRPYR

Query:  YAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKSGYHQIRMCADDIEK
        YAY QK EMERLV EMLASG+I+PS SPYSSPVL+V+KKDGSWRFCVDY ALNNVTV DKFPIPV+EELFDEL GA++FTKIDLK+GYHQIRM   DIEK
Subjt:  YAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKSGYHQIRMCADDIEK

Query:  TAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELYAN-KKCSFAQFRVDYLGHIISGDG
        TAFRTHEGHYEF+VMPFGLTNAP+TFQSLMN+IFRPYL+             ++ LEDHL H+++V  VLRK+EL+AN KKCSF   +V+YLGH+IS  G
Subjt:  TAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELYAN-KKCSFAQFRVDYLGHIISGDG

Query:  VEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALLDFSVPFEIETDASGYGLG
        VEVD EKI+AI +WP P +VRE RGFLGLTGYYRKFV +YGT+AAPLTQLLK GGF W  E + AF +L++AM++LPVLAL  F  PFEIETDASGYG+G
Subjt:  VEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALLDFSVPFEIETDASGYGLG

Query:  AVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQ-----------------------------
        AVL+QNKRPIA+Y+HTLAIR R +PVYERELMAVV+ VQRWRPYLLG +F+V+TDQ+SLKFLLEQRV+QPQ                             
Subjt:  AVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQ-----------------------------

Query:  --VPATVHLNQLTAPKVIDIEVIKAEVDQDEKLKNI
          +  TV +  +T P  +D+++IK EV++D KL  I
Subjt:  --VPATVHLNQLTAPKVIDIEVIKAEVDQDEKLKNI

A0A5D3BEL2 Ty3/gypsy retrotransposon protein0.0e+0056.18Show/hide
Query:  MYFKNLGMERQNQMMIRFMESAAQERSTMNEKIIELSMRSTLMKSNSEGEGSSSRESIPKNEEKKTNEGNNGDRNKFKKVEMPIFNGDDPDSWLFRVE--
        M    L  E+Q Q ++ +ME  A+ERS   E++ E   +++    +   + SSSR+    N +K   + N+ DR+KFKKVEMP+F G+DP+SWLFR E  
Subjt:  MYFKNLGMERQNQMMIRFMESAAQERSTMNEKIIELSMRSTLMKSNSEGEGSSSRESIPKNEEKKTNEGNNGDRNKFKKVEMPIFNGDDPDSWLFRVE--

Query:  -----------------------------------------------------REGSLYGRFLRIQQTTTVEEYRNLFDKLVAPLSDLPE----------
                                                             REG+ +GRFLRIQQ TTVEEYRNLFDKLVAPLSD+ +          
Subjt:  -----------------------------------------------------REGSLYGRFLRIQQTTTVEEYRNLFDKLVAPLSDLPE----------

Query:  --------------KGLSQMMRIAQKVENKEDIRREANLPGYSGGKLSS-SGSNTKVNTTVNSGDNKGSMNWPMRTITLRGTSNEEVRKEGPTKQLSDAE
                      KGL++MMR AQ VE++E +R  ANL GY GGK S+ + + TK      + +NK +  +P+RTITL+  ++ E RKEG +K+L DAE
Subjt:  --------------KGLSQMMRIAQKVENKEDIRREANLPGYSGGKLSS-SGSNTKVNTTVNSGDNKGSMNWPMRTITLRGTSNEEVRKEGPTKQLSDAE

Query:  FQSRREKGLCFRCNEKYSHDHKCKTKEHRELCMLVVMGENVEYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIVLI
        FQ RREKGLCF+CNEKYS DHKCK +E REL M VV   N E EI+EE  T+  EL T+EV  +    VELSINSVVGL++PG MKV+G LQG+EV++LI
Subjt:  FQSRREKGLCFRCNEKYSHDHKCKTKEHRELCMLVVMGENVEYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIVLI

Query:  DCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGKKV
        DCGATHNF+ E LV  LQ+  K T++YGVILGSGT I+GK +CE+IE+ + DW V ++FLPLELGGVD +LGMQWLYSLG+T  DWKNL LTF    KK+
Subjt:  DCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGKKV

Query:  VIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRALERRESLEEEDSFDEVLTIEESVEVVFKRFEDVFEWPETLSPRRLIEHHIDLKKGTDPVNVRPYR
         I+GDPSLTKARVSLKNL+K+W E D G+L+ECR++    +       +E   IEE +  +  +F+D+FEWPE L PRR IEH I LK+GT+PVNVRPYR
Subjt:  VIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRALERRESLEEEDSFDEVLTIEESVEVVFKRFEDVFEWPETLSPRRLIEHHIDLKKGTDPVNVRPYR

Query:  YAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKSGYHQIRMCADDIEK
        YAY QK EMERLV EMLASG+I+PS SPYSSPVL+V+KKDGSWRFCVDY ALNNVTV DKFPIPV+EELFDEL GA++FTKIDLK+GYHQIRM   DIEK
Subjt:  YAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKSGYHQIRMCADDIEK

Query:  TAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELYAN-KKCSFAQFRVDYLGHIISGDG
        TAFRTHEGHYEF+VMPFGLTNAP+TFQSLMN+IFRPYL+             ++ LEDHL H+++V  VLRK+EL+AN KKCSF   +V+YLGH+IS  G
Subjt:  TAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELYAN-KKCSFAQFRVDYLGHIISGDG

Query:  VEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALLDFSVPFEIETDASGYGLG
        VEVD EKI+AI +WP P +VRE RGFLGLTGYYRKFV +YGT+AAPLTQLLK GGF W  E + AF +L++AM++LPVLAL  F  PFEIETDASGYG+G
Subjt:  VEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALLDFSVPFEIETDASGYGLG

Query:  AVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQ-----------------------------
        AVL+QNKRPIA+Y+HTLAIR R +PVYERELMAVV+ VQRWRPYLLG +F+V+TDQ+SLKFLLEQRV+QPQ                             
Subjt:  AVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQ-----------------------------

Query:  --VPATVHLNQLTAPKVIDIEVIKAEVDQDEKLKNI
          +  TV +  +T P  +D+++IK EV++D KL  I
Subjt:  --VPATVHLNQLTAPKVIDIEVIKAEVDQDEKLKNI

A0A5D3BSP2 Ty3/gypsy retrotransposon protein0.0e+0055.78Show/hide
Query:  ERQNQMMIRFMESAAQERSTMNEKIIELSMRSTLMKSNSEGEGSSSRESI-------PKNEEKKTNEGNNGDRNKFKKVEMPIFNGDDPDSWLFRVE---
        ++Q QMM+  ME+ A++R+T +E+  + +   ++     E E +SS+ ++        +NE K   E    DRNKFKKVEMP+F G+DPDSWLFR E   
Subjt:  ERQNQMMIRFMESAAQERSTMNEKIIELSMRSTLMKSNSEGEGSSSRESI-------PKNEEKKTNEGNNGDRNKFKKVEMPIFNGDDPDSWLFRVE---

Query:  ----------------------------------------------------REGSLYGRFLRIQQTTTVEEYRNLFDKLVAPLSDLPE-----------
                                                            R+G++ G+FLR++Q +TV++YRNLFDKLVAPLSD+P+           
Subjt:  ----------------------------------------------------REGSLYGRFLRIQQTTTVEEYRNLFDKLVAPLSDLPE-----------

Query:  -------------KGLSQMMRIAQKVENKEDIRREANLPGYSGGKL---SSSGSNTKVNTTVNSGDNKGSMNWPMRTITLRGTSNEEVRKEGPTKQLSDA
                     KGL++ M  AQ VEN+E  R EANL  ++GGK    ++  S T VNT   + D+K + N+PMRTITLR ++N E+RK+G +++L DA
Subjt:  -------------KGLSQMMRIAQKVENKEDIRREANLPGYSGGKL---SSSGSNTKVNTTVNSGDNKGSMNWPMRTITLRGTSNEEVRKEGPTKQLSDA

Query:  EFQSRREKGLCFRCNEKYSHDHKCKTKEHRELCMLVVMGENVEYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIVL
        EFQ+R+EKGLCFRCNEKYS DH+CK KE REL M VV  E  EYEIIEE++ E+K L   +   +++T  ELS+NSVVGL++PG MKVKGK+Q REVI+L
Subjt:  EFQSRREKGLCFRCNEKYSHDHKCKTKEHRELCMLVVMGENVEYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIVL

Query:  IDCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGKK
        IDCGATHNFI E LVK LQ+  K T +YGVILGSGT ++GK +CE +EI L +W+V +EFLPLELGGVD +LGMQWL+SLGIT VDWKNL LTF+ +GK+
Subjt:  IDCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGKK

Query:  VVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRALERRESLEEEDSFDEVLTI-EESVEVVFKRFEDVFEWPETLSPRRLIEHHIDLKKGTDPVNVRP
        + I+GDPSLTK+R+SLK+++K+W E+D+GFL+ECRA++     E E S   V TI +E ++ V K+FEDVF+WPE L PRR IEH I LK+GT+P+NVRP
Subjt:  VVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRALERRESLEEEDSFDEVLTI-EESVEVVFKRFEDVFEWPETLSPRRLIEHHIDLKKGTDPVNVRP

Query:  YRYAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKSGYHQIRMCADDI
        YRY +QQK EMERLVEEMLASG+I+PS SP+SSPVL+V+KKDGSWRFCVDY A+NN T+ DKFPIPV EELFDEL+GAT+F+KIDLKSGYHQIRM   DI
Subjt:  YRYAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKSGYHQIRMCADDI

Query:  EKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELYAN-KKCSFAQFRVDYLGHIISG
         KTAFRTHEGHYEF+VMPFGLTNAP+TFQ+LMN+IFRP+L+             +K  +DH+ H++ V   LRK+ L+AN KKC+F Q +V+YLGHIISG
Subjt:  EKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELYAN-KKCSFAQFRVDYLGHIISG

Query:  DGVEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALLDFSVPFEIETDASGYG
        +GVEVD EKI+A+ +WP P N+REVRGFLGLTGYYR+FVQ+YG+IAAPLTQLLK GGFKW E+ + +F +L+ AMMSLP LAL +F++PFEIETDASG+G
Subjt:  DGVEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALLDFSVPFEIETDASGYG

Query:  LGAVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQV--------------------------
        +GAVL+Q+KRPIA+Y+HTL++R R +PVYERELMAVV++VQRWRPYLLG KF+VKTDQ+SLKFLLEQRVIQPQ                           
Subjt:  LGAVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQV--------------------------

Query:  -----PATVHLNQLTAPKVIDIEVIKAEVDQDEKLKNI
             P  + LN ++ P  +D+E IK EV++DEKLK I
Subjt:  -----PATVHLNQLTAPKVIDIEVIKAEVDQDEKLKNI

A0A5D3C091 Ty3/gypsy retrotransposon protein0.0e+0055.05Show/hide
Query:  LGMERQNQMMIRFMESAAQERSTMNEKIIELSMRSTLMKSNSEGEGSSSR----ESIPKNE--EKKTNEGNN-GDRNKFKKVEMPIFNGDDPDSWLFRVE
        L  E+Q Q++   +E++++ERSTM+ +  E +++        + + SSSR    +   +NE  E++T+  +   DRNKFKK+EMP+F G+DPDSWLFR E
Subjt:  LGMERQNQMMIRFMESAAQERSTMNEKIIELSMRSTLMKSNSEGEGSSSR----ESIPKNE--EKKTNEGNN-GDRNKFKKVEMPIFNGDDPDSWLFRVE

Query:  R-------------------------------------------------------EGSLYGRFLRIQQTTTVEEYRNLFDKLVAPLSDLPE--------
        R                                                       +G++ G+FLRI+Q  +VEEY NLFDK+VAP++DLPE        
Subjt:  R-------------------------------------------------------EGSLYGRFLRIQQTTTVEEYRNLFDKLVAPLSDLPE--------

Query:  ----------------KGLSQMMRIAQKVENKEDIRREANLPGYSGGKLSS-SGSNTKVNTTVNSGDNKGSMNWPMRTITLRGTSNEEVRKEGPTKQLSD
                        KGL++MM  AQ VEN+E +R E  + GYSGGKL+S + SN ++N    +GD KG+ ++  RTITLR  ++ E R+EG  K+L D
Subjt:  ----------------KGLSQMMRIAQKVENKEDIRREANLPGYSGGKLSS-SGSNTKVNTTVNSGDNKGSMNWPMRTITLRGTSNEEVRKEGPTKQLSD

Query:  AEFQSRREKGLCFRCNEKYSHDHKCKTKEHRELCMLVVMGENVEYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIV
        AEFQ+R+EKGLCFRCNEKYS DHKC+ KE REL M V+     EYEI+EE+  E+K+L  +EV GE  T+VELSINSVVGL++PG MKV+GKL G EV+V
Subjt:  AEFQSRREKGLCFRCNEKYSHDHKCKTKEHRELCMLVVMGENVEYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIV

Query:  LIDCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGK
        LIDCGATHNF+ E LVK+L +  K TS+YGVILGSG  ++GK +CE +E+ L  WR+V++FLPLELGGVD +LGMQWLYSLG+T VDWKNL+L+F  +GK
Subjt:  LIDCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGK

Query:  KVVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRALERRESLEEEDSFDEVLTIEES-VEVVFKRFEDVFEWPETLSPRRLIEHHIDLKKGTDPVNVR
        +V I+GDPSLTKAR+SLKN+MK W E+D GFL+ECR+L+ R   +EE        + +  +  V K+++DVF+WPE L PRR IEHHI LK GTDP+NVR
Subjt:  KVVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRALERRESLEEEDSFDEVLTIEES-VEVVFKRFEDVFEWPETLSPRRLIEHHIDLKKGTDPVNVR

Query:  PYRYAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKSGYHQIRMCADD
        PYRY +QQK EME+LV+EML SGVI+PS SPYSSPVL+V+KKDGSWRFCVDY A+NN T+ DKFPIPV+EELFDEL GA +F+KI+LKSGYHQIRM  DD
Subjt:  PYRYAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKSGYHQIRMCADD

Query:  IEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELYAN-KKCSFAQFRVDYLGHIIS
        IEKTAFRTHEGHYEF+VMPFGLTNAP+TFQ+LMNT+F+P+L+             ++  ++H  H++ VL+VLR++ELYAN KKC FAQ +++YLGH+IS
Subjt:  IEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELYAN-KKCSFAQFRVDYLGHIIS

Query:  GDGVEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALLDFSVPFEIETDASGY
        G+GV VD EKI+AI +WP P NV+E RGFLGLTGYYR+FV+NYGTIAAPLTQLLK GGF WTEE   AF+RL+ AM+SLPVLAL DF+  FEIE DASGY
Subjt:  GDGVEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALLDFSVPFEIETDASGY

Query:  GLGAVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQV-------------------------
        G+GAVL+Q++RP+AY++HTLA+R R +PVYERELMA+V+ VQRWRPYLL  KF VKTDQ++LKFLL+QR+IQPQ                          
Subjt:  GLGAVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQV-------------------------

Query:  ------PATVHLNQLTAPKVIDIEVIKAEVDQDEKLKNI
              P  V L  L+ P  +D++VIK EV QD K K I
Subjt:  ------PATVHLNQLTAPKVIDIEVIKAEVDQDEKLKNI

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.65.8e-7338.78Show/hide
Query:  EHHIDLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGS-----WRFCVDYMALNNVTVLDKFPIPVIEELFDELHGA
        +H I+ K      +   Y  AY+Q  E+E  +++ML  G+I+ S SPY+SP+ VV KK  +     +R  +DY  LN +TV D+ PIP ++E+  +L   
Subjt:  EHHIDLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGS-----WRFCVDYMALNNVTVLDKFPIPVIEELFDELHGA

Query:  TMFTKIDLKSGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELY
          FT IDL  G+HQI M  + + KTAF T  GHYE++ MPFGL NAP+TFQ  MN I RP L              +  L++HL  +  V E L K  L 
Subjt:  TMFTKIDLKSGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELY

Query:  AN-KKCSFAQFRVDYLGHIISGDGVEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLK--IGGFKWTEETQGAFNRLQQAMM
            KC F +    +LGH+++ DG++ + EKI AI+++PIP   +E++ FLGLTGYYRKF+ N+  IA P+T+ LK  +       E   AF +L+  + 
Subjt:  AN-KKCSFAQFRVDYLGHIISGDGVEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLK--IGGFKWTEETQGAFNRLQQAMM

Query:  SLPVLALLDFSVPFEIETDASGYGLGAVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQR
          P+L + DF+  F + TDAS   LGAVL Q+  P++Y + TL          E+EL+A+V   + +R YLLG+ F + +D + L +L   +
Subjt:  SLPVLALLDFSVPFEIETDASGYGLGAVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQR

P20825 Retrovirus-related Pol polyprotein from transposon 2973.3e-7632.48Show/hide
Query:  VKGKLQGREVIVLIDCGATHNFILEGLVKELQINTK---ITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITE
        +K   +GR    L+D G+T N I E +      N++   +TSN  + L     +    + +  E           ++       D L+G + L +   + 
Subjt:  VKGKLQGREVIVLIDCGATHNFILEGLVKELQINTK---ITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITE

Query:  VDWKNLNLTFTHQGKKVVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRALE-RRESLEEEDSFDEVLTIEESVEVVFKRFEDV-FEWPETLSPRRLI
        +++KN  +T   Q  K++     S     + ++   +S    DQ  + +    + R + L +E++F         ++ +  +F ++ ++  E L+    I
Subjt:  VDWKNLNLTFTHQGKKVVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRALE-RRESLEEEDSFDEVLTIEESVEVVFKRFEDV-FEWPETLSPRRLI

Query:  EHHIDLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKD-----GSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGA
        +H ++    + P+  + Y  A   + E+E  V+EML  G+I+ S SPY+SP  VV KK        +R  +DY  LN +T+ D++PIP ++E+  +L   
Subjt:  EHHIDLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKD-----GSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGA

Query:  TMFTKIDLKSGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELY
          FT IDL  G+HQI M  + I KTAF T  GHYE++ MPFGL NAP+TFQ  MN I RP L              +  L +HLN ++ V   L    L 
Subjt:  TMFTKIDLKSGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELY

Query:  AN-KKCSFAQFRVDYLGHIISGDGVEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQ--GAFNRLQQAMM
            KC F +   ++LGHI++ DG++ +  K++AI  +PIP   +E+R FLGLTGYYRKF+ NY  IA P+T  LK      T++ +   AF +L+  ++
Subjt:  AN-KKCSFAQFRVDYLGHIISGDGVEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQ--GAFNRLQQAMM

Query:  SLPVLALLDFSVPFEIETDASGYGLGAVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFL
          P+L L DF   F + TDAS   LGAVL QN  PI++ + TL          E+EL+A+V   + +R YLLG++FL+ +D + L++L
Subjt:  SLPVLALLDFSVPFEIETDASGYGLGAVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.3e-6738.12Show/hide
Query:  IEHHIDLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFT
        ++H I++K G     ++PY    + + E+ ++V+++L +  I PS SP SSPV++V KKDG++R CVDY  LN  T+ D FP+P I+ L   +  A +FT
Subjt:  IEHHIDLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFT

Query:  KIDLKSGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRP------YLQ-----NKGLEDHLNHMKSVLEVLR-KNELYANKKC
         +DL SGYHQI M   D  KTAF T  G YE+ VMPFGL NAPSTF   M   FR       YL      ++  E+H  H+ +VLE L+ +N +   KKC
Subjt:  KIDLKSGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRP------YLQ-----NKGLEDHLNHMKSVLEVLR-KNELYANKKC

Query:  SFAQFRVDYLGHIISGDGVEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALL
         FA    ++LG+ I    +     K  AI+++P P  V++ + FLG+  YYR+F+ N   IA P+ QL      +WTE+   A  +L+ A+ + PVL   
Subjt:  SFAQFRVDYLGHIISGDGVEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALL

Query:  DFSVPFEIETDASGYGLGAVL--VQNKRP----IAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSL
        +    + + TDAS  G+GAVL  V NK      + Y++ +L    +  P  E EL+ ++  +  +R  L GK F ++TD  SL
Subjt:  DFSVPFEIETDASGYGLGAVL--VQNKRP----IAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus3.3e-6837.03Show/hide
Query:  DPVNVRPYRYAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKK-----DGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKS
        DP+  + Y Y    + E+ER ++E+L  G+I+PS SPY+SP+ +V KK     +  +R  VD+  LN VT+ D +PIP I      L  A  FT +DL S
Subjt:  DPVNVRPYRYAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKK-----DGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKS

Query:  GYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELYAN-KKCSFAQ
        G+HQI M   DI KTAF T  G YEF+ +PFGL NAP+ FQ +++ I R ++              ++  + H  +++ VL  L K  L  N +K  F  
Subjt:  GYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQ-------------NKGLEDHLNHMKSVLEVLRKNELYAN-KKCSFAQ

Query:  FRVDYLGHIISGDGVEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLK--IGGFKWTEETQ----------GAFNRLQQAMM
         +V++LG+I++ DG++ D +K+RAI E P P +V+E++ FLG+T YYRKF+Q+Y  +A PLT L +      K ++ ++           +FN L+  + 
Subjt:  FRVDYLGHIISGDGVEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLK--IGGFKWTEETQ----------GAFNRLQQAMM

Query:  SLPVLALLDFSVPFEIETDASGYGLGAVLVQN----KRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFL-VKTDQRSLKFLLEQR
        S  +LA   F+ PF + TDAS + +GAVL Q+     RPIAY + +L          E+E++A++ ++   R YL G   + V TD + L F L  R
Subjt:  SLPVLALLDFSVPFEIETDASGYGLGAVLVQN----KRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFL-VKTDQRSLKFLLEQR

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.8e-6736.21Show/hide
Query:  IEHHIDLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFT
        ++H I++K G     ++PY    + + E+ ++V+++L +  I PS SP SSPV++V KKDG++R CVDY  LN  T+ D FP+P I+ L   +  A +FT
Subjt:  IEHHIDLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFT

Query:  KIDLKSGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRP------YLQ-----NKGLEDHLNHMKSVLEVLR-KNELYANKKC
         +DL SGYHQI M   D  KTAF T  G YE+ VMPFGL NAPSTF   M   FR       YL      ++  E+H  H+ +VLE L+ +N +   KKC
Subjt:  KIDLKSGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRP------YLQ-----NKGLEDHLNHMKSVLEVLR-KNELYANKKC

Query:  SFAQFRVDYLGHIISGDGVEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALL
         FA    ++LG+ I    +     K  AI+++P P  V++ + FLG+  YYR+F+ N   IA P+ QL      +WTE+   A ++L+ A+ + PVL   
Subjt:  SFAQFRVDYLGHIISGDGVEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQGAFNRLQQAMMSLPVLALL

Query:  DFSVPFEIETDASGYGLGAVL--VQNKRP----IAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLE--------QRVI
        +    + + TDAS  G+GAVL  V NK      + Y++ +L    +  P  E EL+ ++  +  +R  L GK F ++TD  SL  L          QR +
Subjt:  DFSVPFEIETDASGYGLGAVL--VQNKRP----IAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLE--------QRVI

Query:  QPQVPATVHLNQLTAPKVIDIEVIKAEV
                 L  L  PK +  + I   V
Subjt:  QPQVPATVHLNQLTAPKVIDIEVIKAEV

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein5.1e-1631.02Show/hide
Query:  EYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIVLIDCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKE
        E E +E+DS   ++       G EQ +++L+ N          M+  G +   +V+V ID GAT NFIL  L   L++ T IT+   V+LG    I+   
Subjt:  EYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIVLIDCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKE

Query:  VCEAIEIMLGDWRVVDEFLPLELG--GVDALLGMQWLYSLGITEVDWKNLNLTFTHQGKKVVIRGDPSLTKARVSLKNLMKSWGEED
         C  I + + +  + + FL L+L    VD +LG +WL  LG T V+W+N + +F+H  + + +  +    + +V+ K  MKS  E++
Subjt:  VCEAIEIMLGDWRVVDEFLPLELG--GVDALLGMQWLYSLGITEVDWKNLNLTFTHQGKKVVIRGDPSLTKARVSLKNLMKSWGEED

AT3G30770.1 Eukaryotic aspartyl protease family protein1.8e-0827.27Show/hide
Query:  QTMVELSINSVVGLSNPGRMKVKGKLQGREVIVLIDCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLEL-
        +T+ ++   S    +    M+  G +   +V+V+ID GAT+NFI + L   L++ T  T+   V+LG    I+    C  I +++ +  + + FL L+L 
Subjt:  QTMVELSINSVVGLSNPGRMKVKGKLQGREVIVLIDCGATHNFILEGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLEL-

Query:  -GGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGKKVVI-RGDPSLTKARVSLKNLMKSWGEEDQ
           VD +LG     +L    + W N + +F H  + V +   D  L +    +K  MKS  E+++
Subjt:  -GGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGKKVVI-RGDPSLTKARVSLKNLMKSWGEEDQ

AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding6.9e-0536.21Show/hide
Query:  KEVCEAIEIMLGDWRVVDEFL--PLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTH
        K  C+ I + + D  +V+++    L+   VD +LG +WL  LG TEV+W+N + +F H
Subjt:  KEVCEAIEIMLGDWRVVDEFL--PLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTH

ATMG00850.1 DNA/RNA polymerases superfamily protein8.2e-0653.85Show/hide
Query:  QKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSW
        ++T ++  + EML + +I+PS+SPYSSPVL+V+KKDG W
Subjt:  QKTEMERLVEEMLASGVIKPSMSPYSSPVLVVRKKDGSW

ATMG00860.1 DNA/RNA polymerases superfamily protein9.3e-3453.44Show/hide
Query:  LNHMKSVLEVLRKNELYAN-KKCSFAQFRVDYLG--HIISGDGVEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFK
        +NH+  VL++  +++ YAN KKC+F Q ++ YLG  HIISG+GV  D  K+ A+  WP P N  E+RGFLGLTGYYR+FV+NYG I  PLT+LLK    K
Subjt:  LNHMKSVLEVLRKNELYAN-KKCSFAQFRVDYLG--HIISGDGVEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFK

Query:  WTEETQGAFNRLQQAMMSLPVLALLDFSVPF
        WTE    AF  L+ A+ +LPVLAL D  +PF
Subjt:  WTEETQGAFNRLQQAMMSLPVLALLDFSVPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGAAGTTAATGGAGCTCCTTTCCGGCGTTGGGTGTTGTTAGGTTATTGTATCGGTGCCGGCAGGCTACAGGAAAAGTATTCGGACGGAGGTGAAGATGTTGGTGG
AGCAGAGATACTTGGACCGCAGAAGGTACTTCCGCCCTTTTTGCTACCTCTTCCTCTGTTTTACCACCAACCTCTTCAAATTCCGAAGTATCCTCTTTGCCCATTGTCTC
ATCTTGGTCTATTACCTTATTTCCTCATAAACTCTATTTCTGATTATCAAACCCTTTTTCGTGTTCTTGAAGCTTTTGGACTCTTCCCCTTGAATTCCAAAACTATTATT
GAAACGCATGCACCTATTATTCCAAAACTAAGCGTTCTAAGAGTTGATGTTTGTTCAAGAAAGTGGATTTGCGGGTTGAGAGTTATTGGATTGACGATTGAGGTTGGTAG
TTTCTACATCTTATCAACAATACGTTTCTGCTATTCTGTTCAATTGTTAACTGACATTTCTGCTAGTGTTACCGACAGTACTGCCTCTTTTCACATGCCGCTGCATTCTG
ATCCTATAAAGCAGCCGCACATCCATGGTACCAGGCTTGATCAACTCTACGATAAATCAAGGTTTGAAGTTGGTGAATCTTCGTCACAATCCAAACCAACCGACTGGATG
ATGTATTTCAAGAACTTGGGCATGGAAAGACAGAATCAAATGATGATACGATTTATGGAATCGGCGGCACAAGAGAGATCGACGATGAACGAGAAGATTATCGAATTGTC
GATGCGGAGTACTTTGATGAAGAGTAATAGCGAAGGCGAAGGATCATCAAGTCGCGAAAGTATACCGAAGAACGAAGAGAAGAAAACGAATGAGGGTAACAATGGCGATC
GAAATAAGTTCAAGAAAGTGGAGATGCCAATCTTCAATGGGGACGATCCCGATTCGTGGCTATTTCGCGTTGAAAGGGAGGGCTCATTGTATGGTCGATTCTTGCGTATA
CAACAGACGACGACCGTGGAGGAGTATCGAAATCTGTTTGATAAATTGGTAGCACCTTTATCCGACTTACCTGAGAAGGGATTATCACAAATGATGAGGATTGCTCAGAA
GGTAGAAAACAAAGAGGATATTAGAAGAGAAGCAAACCTTCCTGGATATTCAGGAGGAAAATTGAGCAGTTCTGGAAGCAATACCAAAGTAAATACGACTGTGAATTCTG
GAGATAATAAGGGGAGTATGAATTGGCCGATGAGGACAATTACATTGAGAGGGACTTCAAATGAAGAAGTACGAAAAGAAGGTCCCACAAAACAACTTTCTGATGCCGAG
TTTCAATCACGAAGAGAGAAGGGTCTATGTTTCCGATGCAATGAGAAATATTCTCATGACCATAAGTGTAAAACTAAGGAGCATAGGGAATTATGTATGCTGGTAGTAAT
GGGAGAGAATGTTGAATATGAAATCATTGAAGAAGATAGCACTGAACAGAAGGAGTTGAACACAATTGAGGTTACAGGAGAAGAACAAACAATGGTAGAATTGTCGATAA
ATTCAGTGGTGGGGCTATCCAATCCCGGAAGGATGAAGGTTAAGGGAAAGTTACAAGGAAGAGAAGTTATTGTGTTGATAGATTGTGGAGCTACACACAATTTTATCTTA
GAAGGTTTAGTTAAGGAGTTACAGATTAATACCAAGATTACCTCAAATTATGGGGTCATTTTGGGTTCGGGAACAACTATTAAAGGAAAAGAAGTTTGTGAAGCCATTGA
AATAATGTTAGGAGACTGGAGGGTGGTTGATGAATTCTTACCTTTGGAGTTAGGAGGGGTGGATGCACTATTGGGAATGCAATGGTTGTATTCTCTAGGTATAACCGAGG
TGGATTGGAAGAATTTGAACTTGACTTTTACACATCAAGGGAAGAAGGTGGTTATAAGGGGGGATCCAAGCCTAACAAAGGCAAGGGTCAGTTTAAAAAATCTCATGAAA
TCCTGGGGAGAAGAGGATCAGGGATTTTTAGTAGAGTGTCGTGCATTGGAAAGGAGAGAGTCATTGGAAGAGGAAGATTCGTTTGATGAAGTGTTGACTATAGAAGAATC
CGTGGAAGTGGTGTTCAAAAGATTTGAGGACGTCTTTGAATGGCCCGAGACATTATCTCCACGAAGATTGATAGAACATCATATCGATCTTAAAAAGGGAACTGATCCAG
TAAATGTTCGTCCTTATCGGTATGCATATCAACAAAAGACAGAGATGGAAAGATTAGTGGAAGAGATGCTAGCATCAGGGGTAATAAAGCCGAGTATGAGCCCTTATTCC
AGTCCAGTATTGGTGGTGAGAAAGAAGGATGGGAGCTGGCGCTTTTGTGTAGATTACATGGCTTTAAATAATGTAACTGTATTAGATAAGTTTCCAATACCAGTGATTGA
GGAGTTGTTTGATGAGTTACATGGAGCTACTATGTTCACTAAGATAGATCTTAAGTCAGGATACCATCAGATTAGGATGTGTGCAGATGATATTGAAAAGACAGCATTCC
GGACCCATGAGGGTCACTATGAATTTATGGTGATGCCATTTGGATTGACAAACGCACCATCCACGTTTCAGTCACTGATGAATACTATATTCAGACCATACCTTCAGAAC
AAAGGATTGGAAGATCATTTGAACCATATGAAATCAGTGTTAGAAGTATTGAGGAAGAATGAATTATACGCGAATAAGAAGTGTAGTTTTGCTCAGTTTCGAGTGGACTA
CCTGGGGCATATTATTTCGGGAGATGGAGTTGAAGTGGATCATGAGAAGATCAGGGCTATAAAGGAGTGGCCAATTCCAGTGAATGTAAGAGAGGTTCGAGGATTTCTTG
GGTTGACTGGATATTATCGCAAATTTGTTCAAAATTATGGAACAATTGCAGCTCCTCTAACACAGTTGCTGAAGATAGGAGGGTTCAAATGGACAGAGGAAACACAAGGG
GCTTTCAACAGGCTACAACAAGCGATGATGTCTCTTCCTGTATTGGCTCTACTTGATTTCAGTGTGCCATTTGAAATCGAAACCGATGCTTCAGGGTATGGATTAGGGGC
TGTATTGGTACAAAACAAGCGGCCAATTGCTTATTATAATCATACTTTGGCAATAAGAGGCCGGGTTAAACCTGTATACGAAAGGGAGCTGATGGCAGTAGTTATGACTG
TACAAAGGTGGCGTCCATACTTATTGGGAAAGAAGTTCCTAGTCAAAACTGATCAACGGTCTCTAAAGTTTTTGCTGGAGCAAAGGGTGATTCAACCTCAAGTACCAGCA
ACTGTGCATTTGAATCAATTAACAGCTCCAAAGGTGATTGATATAGAGGTGATTAAAGCAGAGGTGGATCAAGATGAGAAGTTGAAGAATATTAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGGAAGTTAATGGAGCTCCTTTCCGGCGTTGGGTGTTGTTAGGTTATTGTATCGGTGCCGGCAGGCTACAGGAAAAGTATTCGGACGGAGGTGAAGATGTTGGTGG
AGCAGAGATACTTGGACCGCAGAAGGTACTTCCGCCCTTTTTGCTACCTCTTCCTCTGTTTTACCACCAACCTCTTCAAATTCCGAAGTATCCTCTTTGCCCATTGTCTC
ATCTTGGTCTATTACCTTATTTCCTCATAAACTCTATTTCTGATTATCAAACCCTTTTTCGTGTTCTTGAAGCTTTTGGACTCTTCCCCTTGAATTCCAAAACTATTATT
GAAACGCATGCACCTATTATTCCAAAACTAAGCGTTCTAAGAGTTGATGTTTGTTCAAGAAAGTGGATTTGCGGGTTGAGAGTTATTGGATTGACGATTGAGGTTGGTAG
TTTCTACATCTTATCAACAATACGTTTCTGCTATTCTGTTCAATTGTTAACTGACATTTCTGCTAGTGTTACCGACAGTACTGCCTCTTTTCACATGCCGCTGCATTCTG
ATCCTATAAAGCAGCCGCACATCCATGGTACCAGGCTTGATCAACTCTACGATAAATCAAGGTTTGAAGTTGGTGAATCTTCGTCACAATCCAAACCAACCGACTGGATG
ATGTATTTCAAGAACTTGGGCATGGAAAGACAGAATCAAATGATGATACGATTTATGGAATCGGCGGCACAAGAGAGATCGACGATGAACGAGAAGATTATCGAATTGTC
GATGCGGAGTACTTTGATGAAGAGTAATAGCGAAGGCGAAGGATCATCAAGTCGCGAAAGTATACCGAAGAACGAAGAGAAGAAAACGAATGAGGGTAACAATGGCGATC
GAAATAAGTTCAAGAAAGTGGAGATGCCAATCTTCAATGGGGACGATCCCGATTCGTGGCTATTTCGCGTTGAAAGGGAGGGCTCATTGTATGGTCGATTCTTGCGTATA
CAACAGACGACGACCGTGGAGGAGTATCGAAATCTGTTTGATAAATTGGTAGCACCTTTATCCGACTTACCTGAGAAGGGATTATCACAAATGATGAGGATTGCTCAGAA
GGTAGAAAACAAAGAGGATATTAGAAGAGAAGCAAACCTTCCTGGATATTCAGGAGGAAAATTGAGCAGTTCTGGAAGCAATACCAAAGTAAATACGACTGTGAATTCTG
GAGATAATAAGGGGAGTATGAATTGGCCGATGAGGACAATTACATTGAGAGGGACTTCAAATGAAGAAGTACGAAAAGAAGGTCCCACAAAACAACTTTCTGATGCCGAG
TTTCAATCACGAAGAGAGAAGGGTCTATGTTTCCGATGCAATGAGAAATATTCTCATGACCATAAGTGTAAAACTAAGGAGCATAGGGAATTATGTATGCTGGTAGTAAT
GGGAGAGAATGTTGAATATGAAATCATTGAAGAAGATAGCACTGAACAGAAGGAGTTGAACACAATTGAGGTTACAGGAGAAGAACAAACAATGGTAGAATTGTCGATAA
ATTCAGTGGTGGGGCTATCCAATCCCGGAAGGATGAAGGTTAAGGGAAAGTTACAAGGAAGAGAAGTTATTGTGTTGATAGATTGTGGAGCTACACACAATTTTATCTTA
GAAGGTTTAGTTAAGGAGTTACAGATTAATACCAAGATTACCTCAAATTATGGGGTCATTTTGGGTTCGGGAACAACTATTAAAGGAAAAGAAGTTTGTGAAGCCATTGA
AATAATGTTAGGAGACTGGAGGGTGGTTGATGAATTCTTACCTTTGGAGTTAGGAGGGGTGGATGCACTATTGGGAATGCAATGGTTGTATTCTCTAGGTATAACCGAGG
TGGATTGGAAGAATTTGAACTTGACTTTTACACATCAAGGGAAGAAGGTGGTTATAAGGGGGGATCCAAGCCTAACAAAGGCAAGGGTCAGTTTAAAAAATCTCATGAAA
TCCTGGGGAGAAGAGGATCAGGGATTTTTAGTAGAGTGTCGTGCATTGGAAAGGAGAGAGTCATTGGAAGAGGAAGATTCGTTTGATGAAGTGTTGACTATAGAAGAATC
CGTGGAAGTGGTGTTCAAAAGATTTGAGGACGTCTTTGAATGGCCCGAGACATTATCTCCACGAAGATTGATAGAACATCATATCGATCTTAAAAAGGGAACTGATCCAG
TAAATGTTCGTCCTTATCGGTATGCATATCAACAAAAGACAGAGATGGAAAGATTAGTGGAAGAGATGCTAGCATCAGGGGTAATAAAGCCGAGTATGAGCCCTTATTCC
AGTCCAGTATTGGTGGTGAGAAAGAAGGATGGGAGCTGGCGCTTTTGTGTAGATTACATGGCTTTAAATAATGTAACTGTATTAGATAAGTTTCCAATACCAGTGATTGA
GGAGTTGTTTGATGAGTTACATGGAGCTACTATGTTCACTAAGATAGATCTTAAGTCAGGATACCATCAGATTAGGATGTGTGCAGATGATATTGAAAAGACAGCATTCC
GGACCCATGAGGGTCACTATGAATTTATGGTGATGCCATTTGGATTGACAAACGCACCATCCACGTTTCAGTCACTGATGAATACTATATTCAGACCATACCTTCAGAAC
AAAGGATTGGAAGATCATTTGAACCATATGAAATCAGTGTTAGAAGTATTGAGGAAGAATGAATTATACGCGAATAAGAAGTGTAGTTTTGCTCAGTTTCGAGTGGACTA
CCTGGGGCATATTATTTCGGGAGATGGAGTTGAAGTGGATCATGAGAAGATCAGGGCTATAAAGGAGTGGCCAATTCCAGTGAATGTAAGAGAGGTTCGAGGATTTCTTG
GGTTGACTGGATATTATCGCAAATTTGTTCAAAATTATGGAACAATTGCAGCTCCTCTAACACAGTTGCTGAAGATAGGAGGGTTCAAATGGACAGAGGAAACACAAGGG
GCTTTCAACAGGCTACAACAAGCGATGATGTCTCTTCCTGTATTGGCTCTACTTGATTTCAGTGTGCCATTTGAAATCGAAACCGATGCTTCAGGGTATGGATTAGGGGC
TGTATTGGTACAAAACAAGCGGCCAATTGCTTATTATAATCATACTTTGGCAATAAGAGGCCGGGTTAAACCTGTATACGAAAGGGAGCTGATGGCAGTAGTTATGACTG
TACAAAGGTGGCGTCCATACTTATTGGGAAAGAAGTTCCTAGTCAAAACTGATCAACGGTCTCTAAAGTTTTTGCTGGAGCAAAGGGTGATTCAACCTCAAGTACCAGCA
ACTGTGCATTTGAATCAATTAACAGCTCCAAAGGTGATTGATATAGAGGTGATTAAAGCAGAGGTGGATCAAGATGAGAAGTTGAAGAATATTAAGTAG
Protein sequenceShow/hide protein sequence
MREVNGAPFRRWVLLGYCIGAGRLQEKYSDGGEDVGGAEILGPQKVLPPFLLPLPLFYHQPLQIPKYPLCPLSHLGLLPYFLINSISDYQTLFRVLEAFGLFPLNSKTII
ETHAPIIPKLSVLRVDVCSRKWICGLRVIGLTIEVGSFYILSTIRFCYSVQLLTDISASVTDSTASFHMPLHSDPIKQPHIHGTRLDQLYDKSRFEVGESSSQSKPTDWM
MYFKNLGMERQNQMMIRFMESAAQERSTMNEKIIELSMRSTLMKSNSEGEGSSSRESIPKNEEKKTNEGNNGDRNKFKKVEMPIFNGDDPDSWLFRVEREGSLYGRFLRI
QQTTTVEEYRNLFDKLVAPLSDLPEKGLSQMMRIAQKVENKEDIRREANLPGYSGGKLSSSGSNTKVNTTVNSGDNKGSMNWPMRTITLRGTSNEEVRKEGPTKQLSDAE
FQSRREKGLCFRCNEKYSHDHKCKTKEHRELCMLVVMGENVEYEIIEEDSTEQKELNTIEVTGEEQTMVELSINSVVGLSNPGRMKVKGKLQGREVIVLIDCGATHNFIL
EGLVKELQINTKITSNYGVILGSGTTIKGKEVCEAIEIMLGDWRVVDEFLPLELGGVDALLGMQWLYSLGITEVDWKNLNLTFTHQGKKVVIRGDPSLTKARVSLKNLMK
SWGEEDQGFLVECRALERRESLEEEDSFDEVLTIEESVEVVFKRFEDVFEWPETLSPRRLIEHHIDLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASGVIKPSMSPYS
SPVLVVRKKDGSWRFCVDYMALNNVTVLDKFPIPVIEELFDELHGATMFTKIDLKSGYHQIRMCADDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNTIFRPYLQN
KGLEDHLNHMKSVLEVLRKNELYANKKCSFAQFRVDYLGHIISGDGVEVDHEKIRAIKEWPIPVNVREVRGFLGLTGYYRKFVQNYGTIAAPLTQLLKIGGFKWTEETQG
AFNRLQQAMMSLPVLALLDFSVPFEIETDASGYGLGAVLVQNKRPIAYYNHTLAIRGRVKPVYERELMAVVMTVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQVPA
TVHLNQLTAPKVIDIEVIKAEVDQDEKLKNIK