; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G21530 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G21530
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr1:17101263..17103834
RNA-Seq ExpressionCSPI01G21530
SyntenyCSPI01G21530
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057179.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.6e-28159.88Show/hide
Query:  TNPQNSTKANTNSNTGENKGGTNWPMRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEE
        T     TK        ENK  T +P+RTITL+  +  E  KEG SKRL DAEFQ R+EK LCF+CNE+YS DHKCK +E REL+M VVN ++EE EI+EE
Subjt:  TNPQNSTKANTNSNTGENKGGTNWPMRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEE

Query:  GEETMKELKTIEIMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVEL
         E    EL+T+E+  ++ A VELSINSVVGL++P TMKV+G +Q+++VV+LIDCGATHNFVSE LV  LQL  KET++YG ILGSGT ++GKGIC+ VE+
Subjt:  GEETMKELKTIEIMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVEL

Query:  MIGKWKVIDEFLPLELEGVDAILGMK--------------------------------ALERCEIS--------EEED------------SIEEVLT---
         +  W V ++FLPLEL GVD ILGM+                                +L +  +S        EE D             I E +T   
Subjt:  MIGKWKVIDEFLPLELEGVDAILGMK--------------------------------ALERCEIS--------EEED------------SIEEVLT---

Query:  -----EEESVAVVLKRFDDVFDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVD
              EE +  +L +F+D+F+WPE LPPRR IEH IHLKEG NPVNVRPYRYAY QK EME+LV EML+SGIIRP+ SPYSSPVLLV+KKD SWRFCVD
Subjt:  -----EEESVAVVLKRFDDVFDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVD

Query:  YRALNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDD
        YRALNNVTVPDKFPIPV+EELFDEL GA +FTK+DLK GYHQIRM   DIEKTAFRTHEGHYEF+VMPFGLTNAP+TFQSLMN +F+P+LR+FVLVFFDD
Subjt:  YRALNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDD

Query:  ILIYSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLT
        ILIYSR+L+ +L H++ V  +LRK+EL+ N+KKCSF   +VEYLGH+IS +G+EVDPEKI+AI ++P P +VRE RGFLGLTGYYRKFV HYGT+AAPLT
Subjt:  ILIYSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLT

Query:  QLLKKGGFRWTEEAQEAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGK
        QLLKKGGF W  EA++AF +L++ M+ L VLA+P F  PFEIETDASGYG+GAVLIQN+RPIA+YSH LA+RDR +PVYERELMAVV+AVQRWRPYLLG 
Subjt:  QLLKKGGFRWTEEAQEAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGK

Query:  RFKVKTDQRSLKFLLEQRVIQP-------------------------------RVPPTVHLNHLIAPNIIDVAVIKEEVNQDEKFKKIREELAEKGEEQS
        RF V+TDQ+SLKFLLEQRV+QP                               R+ PTV +  +  P  +D+  IKEEV +D K  KI  E+      Q 
Subjt:  RFKVKTDQRSLKFLLEQRVIQP-------------------------------RVPPTVHLNHLIAPNIIDVAVIKEEVNQDEKFKKIREELAEKGEEQS

Query:  SKYSMKQGMLMYKNRMVISKTSKLIPMILHTLHDSVFGGHSGFLRTYKRLTGELY
        SK+ M  GML YK+R+VIS+TSKLIP ILH+ HDS  GGHSGFLRTYKR++GELY
Subjt:  SKYSMKQGMLMYKNRMVISKTSKLIPMILHTLHDSVFGGHSGFLRTYKRLTGELY

KAA0062868.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.0e-28059.77Show/hide
Query:  TNPQNSTKANTNSNTGENKGGTNWPMRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEE
        T     TK   +    ENK    +P+RTITL+  +  E  KEG SKRL DAEFQ R+EK LCF+CNE+YS DHKCK +E RELRM VV  ++EE EI+EE
Subjt:  TNPQNSTKANTNSNTGENKGGTNWPMRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEE

Query:  GEETMKELKTIEIMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVEL
         E    EL+T+E+  ++ A VELSINSVVGL++P TMKV+G +Q ++VV+LIDCGATHNFVSE LV  LQL  KET++YG ILGSGTA++GKGIC+ +E+
Subjt:  GEETMKELKTIEIMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVEL

Query:  MIGKWKVIDEFLPLELEGVDAILGMK--------------------------------ALERCEIS--------EEED------------SIEEVLT---
         +  W V ++FLPLEL GVD ILGM+                                +L +  +S        EE D             I E +T   
Subjt:  MIGKWKVIDEFLPLELEGVDAILGMK--------------------------------ALERCEIS--------EEED------------SIEEVLT---

Query:  -----EEESVAVVLKRFDDVFDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVD
              EE +  +L +F D+F+WPE LPPRR IEH IHLKEG NPVNVRPYRYAY QK EME+LV EML+SGIIRP+ SPYSSPVLLV+KKD SWRFCVD
Subjt:  -----EEESVAVVLKRFDDVFDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVD

Query:  YRALNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDD
        YRALNNVTVPDKFPIPV+EELFDEL GA +FTK+DLK GYHQIRM   DIEKTAFRTHEGHYEF+VMPFGLTNAP+TFQSLMN +F+P+LR+FVLVFFDD
Subjt:  YRALNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDD

Query:  ILIYSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLT
        ILIYSR+L+ HL H++ V  +LRK+EL+ N+KKCSF   +VEYLGH+IS +G+EVDPEKI+AI ++P P +VRE RGFLGLTGYYRKFV HYGT+AAPLT
Subjt:  ILIYSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLT

Query:  QLLKKGGFRWTEEAQEAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGK
        QLLKKGGF W  EA++AF +L++ M+ L VLA+P F  PFEIETDASGYG+GAVLIQN+RPIA+YSH LA+RDR +PVYERELMAVV+AVQRWRPYLLG 
Subjt:  QLLKKGGFRWTEEAQEAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGK

Query:  RFKVKTDQRSLKFLLEQRVIQP-------------------------------RVPPTVHLNHLIAPNIIDVAVIKEEVNQDEKFKKIREELAEKGEEQS
        RF V+TDQ+SLKFLLEQRV+QP                               R+ PTV +  +  P  +D+ +IKEEV +D K  KI  E+      Q 
Subjt:  RFKVKTDQRSLKFLLEQRVIQP-------------------------------RVPPTVHLNHLIAPNIIDVAVIKEEVNQDEKFKKIREELAEKGEEQS

Query:  SKYSMKQGMLMYKNRMVISKTSKLIPMILHTLHDSVFGGHSGFLRTYKRLTGELY
        SK+ +  GML YK+R+VIS+TSKLIP ILH+ HDS  GGHSGFLRTYKR++GELY
Subjt:  SKYSMKQGMLMYKNRMVISKTSKLIPMILHTLHDSVFGGHSGFLRTYKRLTGELY

TYJ96875.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-28059.77Show/hide
Query:  TNPQNSTKANTNSNTGENKGGTNWPMRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEE
        T     TK   +    ENK    +P+RTITL+  +  E  KEG SKRL DAEFQ R+EK LCF+CNE+YS DHKCK +E RELRM VV  ++EE EI+EE
Subjt:  TNPQNSTKANTNSNTGENKGGTNWPMRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEE

Query:  GEETMKELKTIEIMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVEL
         E    EL+T+E+  ++ A VELSINSVVGL++P TMKV+G +Q ++VV+LIDCGATHNFVSE LV  LQL  KET++YG ILGSGTA++GKGIC+ +E+
Subjt:  GEETMKELKTIEIMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVEL

Query:  MIGKWKVIDEFLPLELEGVDAILGMK--------------------------------ALERCEIS--------EEED------------SIEEVLT---
         +  W V ++FLPLEL GVD ILGM+                                +L +  +S        EE D             I E +T   
Subjt:  MIGKWKVIDEFLPLELEGVDAILGMK--------------------------------ALERCEIS--------EEED------------SIEEVLT---

Query:  -----EEESVAVVLKRFDDVFDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVD
              EE +  +L +F D+F+WPE LPPRR IEH IHLKEG NPVNVRPYRYAY QK EME+LV EML+SGIIRP+ SPYSSPVLLV+KKD SWRFCVD
Subjt:  -----EEESVAVVLKRFDDVFDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVD

Query:  YRALNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDD
        YRALNNVTVPDKFPIPV+EELFDEL GA +FTK+DLK GYHQIRM   DIEKTAFRTHEGHYEF+VMPFGLTNAP+TFQSLMN +F+P+LR+FVLVFFDD
Subjt:  YRALNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDD

Query:  ILIYSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLT
        ILIYSR+L+ HL H++ V  +LRK+EL+ N+KKCSF   +VEYLGH+IS +G+EVDPEKI+AI ++P P +VRE RGFLGLTGYYRKFV HYGT+AAPLT
Subjt:  ILIYSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLT

Query:  QLLKKGGFRWTEEAQEAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGK
        QLLKKGGF W  EA++AF +L++ M+ L VLA+P F  PFEIETDASGYG+GAVLIQN+RPIA+YSH LA+RDR +PVYERELMAVV+AVQRWRPYLLG 
Subjt:  QLLKKGGFRWTEEAQEAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGK

Query:  RFKVKTDQRSLKFLLEQRVIQP-------------------------------RVPPTVHLNHLIAPNIIDVAVIKEEVNQDEKFKKIREELAEKGEEQS
        RF V+TDQ+SLKFLLEQRV+QP                               R+ PTV +  +  P  +D+ +IKEEV +D K  KI  E+      Q 
Subjt:  RFKVKTDQRSLKFLLEQRVIQP-------------------------------RVPPTVHLNHLIAPNIIDVAVIKEEVNQDEKFKKIREELAEKGEEQS

Query:  SKYSMKQGMLMYKNRMVISKTSKLIPMILHTLHDSVFGGHSGFLRTYKRLTGELY
        SK+ +  GML YK+R+VIS+TSKLIP ILH+ HDS  GGHSGFLRTYKR++GELY
Subjt:  SKYSMKQGMLMYKNRMVISKTSKLIPMILHTLHDSVFGGHSGFLRTYKRLTGELY

TYK02195.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.4e-28560.33Show/hide
Query:  QNSTKANTNSNT-GENKGGTNWPMRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEEGE
        QN+  + T  NT  ++K  TN+PMRTITLR ++  E+ K+G S+RL DAEFQARKEK LCFRCNE+YS DH+CK KELREL+M VV  + EEYEIIEE  
Subjt:  QNSTKANTNSNT-GENKGGTNWPMRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEEGE

Query:  ETMKELKTIEIMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVELMI
           K L+  +  E+ +   ELS+NSVVGL++P TMKVKG+IQER+V++LIDCGATHNF+SE LVK LQL  KET +YG ILGSGTAV+GKGIC++VE+ +
Subjt:  ETMKELKTIEIMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVELMI

Query:  GKWKVIDEFLPLELEGVDAILGMKALERCEI----------------------------------------------------------SEEEDSIEEVL
          WKV +EFLPLEL GVD +LGM+ L    I                                                          +E E S   V 
Subjt:  GKWKVIDEFLPLELEGVDAILGMKALERCEI----------------------------------------------------------SEEEDSIEEVL

Query:  T-EEESVAVVLKRFDDVFDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVDYRA
        T  +E +  VLK+F+DVFDWPE LPPRR IEH IHLKEG NP+NVRPYRY +QQK EME+LVEEML+SGIIRP+ SP+SSPVLLV+KKD SWRFCVDYRA
Subjt:  T-EEESVAVVLKRFDDVFDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVDYRA

Query:  LNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDDILI
        +NN T+PDKFPIPV EELFDELNGA +F+K+DLK+GYHQIRM   DI KTAFRTHEGHYEF+VMPFGLTNAP+TFQ+LMN +F+PFLRKFVLVFFDDILI
Subjt:  LNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDDILI

Query:  YSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLTQLL
        YS++ K H++H++ V   LRK+ L+ NKKKC+F + +VEYLGHIISG G+EVD EKI+A+ ++P P N+REVRGFLGLTGYYR+FV+HYG+IAAPLTQLL
Subjt:  YSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLTQLL

Query:  KKGGFRWTEEAQEAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGKRFK
        KKGGF+W E+A+E+F +L+  MM+L  LA+P+F++PFEIETDASG+G+GAVLIQ++RPIA+YSH L++RDR +PVYERELMAVV++VQRWRPYLLG +F 
Subjt:  KKGGFRWTEEAQEAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGKRFK

Query:  VKTDQRSLKFLLEQRVIQP-------------------------------RVPPTVHLNHLIAPNIIDVAVIKEEVNQDEKFKKIREELAEKGEEQSSKY
        VKTDQ+SLKFLLEQRVIQP                               R PP + LN +  P  +D+  IKEEV +DEK KKI   L E+ E+Q+SK+
Subjt:  VKTDQRSLKFLLEQRVIQP-------------------------------RVPPTVHLNHLIAPNIIDVAVIKEEVNQDEKFKKIREELAEKGEEQSSKY

Query:  SMKQGMLMYKNRMVISKTSKLIPMILHTLHDSVFGGHSGFLRTYKRLTGELY
        ++K  +L YKNR+VISK S LIP IL+T HDSV  GHSGFLRTYKRL  ELY
Subjt:  SMKQGMLMYKNRMVISKTSKLIPMILHTLHDSVFGGHSGFLRTYKRLTGELY

TYK23779.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.5e-28963.18Show/hide
Query:  QNSTKANTNSNT-GENKGGTNWPMRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEEGE
        QN+  + T +NT  ++K  TN+PMRTITLR ++  E+ K+G S+RL DAEFQA+KEK LCFRCNE+YS DH+CK KELREL+M VV  + EEYEIIEE  
Subjt:  QNSTKANTNSNT-GENKGGTNWPMRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEEGE

Query:  ETMKELKTIEIMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVELMI
           K L++ +  E+ +   ELS+NSVVGL++P TMKVK +IQER+V++LIDCGATHNF+SE LV+ LQL  KET +YG ILGSGTAV+GKGIC++VE+ +
Subjt:  ETMKELKTIEIMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVELMI

Query:  GKWKVIDEFLPLELEGVDAILGMKALERCEISEEEDSIEEVLTEEESVAVVLKRFDDVFDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEME
          WKV +EF+PLEL GVD +LG        I +   + ++    +E +  VLK+F+DVFDWPE LPPRR IEH IHLKEG +P+NVRPYRY +QQK EME
Subjt:  GKWKVIDEFLPLELEGVDAILGMKALERCEISEEEDSIEEVLTEEESVAVVLKRFDDVFDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEME

Query:  KLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVDYRALNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHY
        +LVEEML+SGIIRP+ SP+SSPVLLV+KKD SWRFCVDYR +NN T+PDKFPIPV EELFDELNGA IF+K+DLK+GYHQIRM   DI KTAFRTHEGHY
Subjt:  KLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVDYRALNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHY

Query:  EFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDDILIYSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRA
        EF+VMPFGLTNAP+TFQ+LMN +F+PFLRKFVLVFFDDILIYS++ K H++H++ V   LRK+ L+ NKKKC+F + +VEYLGHIISG G+EVD EKI+ 
Subjt:  EFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDDILIYSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRA

Query:  IKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLTQLLKKGGFRWTEEAQEAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPI
        + ++P P N+REVRGFLGLTGYYR+FV+HYG+IAAPLTQLLKKGGF+W E+A+E+F +L+  MM+L  LA+P+F++PFEIETDASG+G+GAVLIQ++RPI
Subjt:  IKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLTQLLKKGGFRWTEEAQEAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPI

Query:  AYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGKRFKVKTDQRSLKFLLEQRVIQP-------------------------------RVPPTVHLN
        A+YSH L++RDR +PVYERELMAVV++VQRWRPYLLG +F VKTDQ+SLKFLLEQRVIQP                               R PP + LN
Subjt:  AYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGKRFKVKTDQRSLKFLLEQRVIQP-------------------------------RVPPTVHLN

Query:  HLIAPNIIDVAVIKEEVNQDEKFKKIREELAEKGEEQSSKYSMKQGMLMYKNRMVISKTSKLIPMILHTLHDSVFGGHSGFLRTYKRLTGELY
         +  P  +D+  IKEEV +DEK KKI   L E+ E+Q+SK+++K  +L YKNR+VISK S LIP I++T HDSV GGHSGFLRTYKRL  ELY
Subjt:  HLIAPNIIDVAVIKEEVNQDEKFKKIREELAEKGEEQSSKYSMKQGMLMYKNRMVISKTSKLIPMILHTLHDSVFGGHSGFLRTYKRLTGELY

TrEMBL top hitse value%identityAlignment
A0A5A7V5H5 Ty3/gypsy retrotransposon protein5.0e-28159.77Show/hide
Query:  TNPQNSTKANTNSNTGENKGGTNWPMRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEE
        T     TK   +    ENK    +P+RTITL+  +  E  KEG SKRL DAEFQ R+EK LCF+CNE+YS DHKCK +E RELRM VV  ++EE EI+EE
Subjt:  TNPQNSTKANTNSNTGENKGGTNWPMRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEE

Query:  GEETMKELKTIEIMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVEL
         E    EL+T+E+  ++ A VELSINSVVGL++P TMKV+G +Q ++VV+LIDCGATHNFVSE LV  LQL  KET++YG ILGSGTA++GKGIC+ +E+
Subjt:  GEETMKELKTIEIMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVEL

Query:  MIGKWKVIDEFLPLELEGVDAILGMK--------------------------------ALERCEIS--------EEED------------SIEEVLT---
         +  W V ++FLPLEL GVD ILGM+                                +L +  +S        EE D             I E +T   
Subjt:  MIGKWKVIDEFLPLELEGVDAILGMK--------------------------------ALERCEIS--------EEED------------SIEEVLT---

Query:  -----EEESVAVVLKRFDDVFDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVD
              EE +  +L +F D+F+WPE LPPRR IEH IHLKEG NPVNVRPYRYAY QK EME+LV EML+SGIIRP+ SPYSSPVLLV+KKD SWRFCVD
Subjt:  -----EEESVAVVLKRFDDVFDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVD

Query:  YRALNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDD
        YRALNNVTVPDKFPIPV+EELFDEL GA +FTK+DLK GYHQIRM   DIEKTAFRTHEGHYEF+VMPFGLTNAP+TFQSLMN +F+P+LR+FVLVFFDD
Subjt:  YRALNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDD

Query:  ILIYSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLT
        ILIYSR+L+ HL H++ V  +LRK+EL+ N+KKCSF   +VEYLGH+IS +G+EVDPEKI+AI ++P P +VRE RGFLGLTGYYRKFV HYGT+AAPLT
Subjt:  ILIYSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLT

Query:  QLLKKGGFRWTEEAQEAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGK
        QLLKKGGF W  EA++AF +L++ M+ L VLA+P F  PFEIETDASGYG+GAVLIQN+RPIA+YSH LA+RDR +PVYERELMAVV+AVQRWRPYLLG 
Subjt:  QLLKKGGFRWTEEAQEAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGK

Query:  RFKVKTDQRSLKFLLEQRVIQP-------------------------------RVPPTVHLNHLIAPNIIDVAVIKEEVNQDEKFKKIREELAEKGEEQS
        RF V+TDQ+SLKFLLEQRV+QP                               R+ PTV +  +  P  +D+ +IKEEV +D K  KI  E+      Q 
Subjt:  RFKVKTDQRSLKFLLEQRVIQP-------------------------------RVPPTVHLNHLIAPNIIDVAVIKEEVNQDEKFKKIREELAEKGEEQS

Query:  SKYSMKQGMLMYKNRMVISKTSKLIPMILHTLHDSVFGGHSGFLRTYKRLTGELY
        SK+ +  GML YK+R+VIS+TSKLIP ILH+ HDS  GGHSGFLRTYKR++GELY
Subjt:  SKYSMKQGMLMYKNRMVISKTSKLIPMILHTLHDSVFGGHSGFLRTYKRLTGELY

A0A5D3BEL2 Ty3/gypsy retrotransposon protein6.5e-28159.77Show/hide
Query:  TNPQNSTKANTNSNTGENKGGTNWPMRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEE
        T     TK   +    ENK    +P+RTITL+  +  E  KEG SKRL DAEFQ R+EK LCF+CNE+YS DHKCK +E RELRM VV  ++EE EI+EE
Subjt:  TNPQNSTKANTNSNTGENKGGTNWPMRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEE

Query:  GEETMKELKTIEIMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVEL
         E    EL+T+E+  ++ A VELSINSVVGL++P TMKV+G +Q ++VV+LIDCGATHNFVSE LV  LQL  KET++YG ILGSGTA++GKGIC+ +E+
Subjt:  GEETMKELKTIEIMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVEL

Query:  MIGKWKVIDEFLPLELEGVDAILGMK--------------------------------ALERCEIS--------EEED------------SIEEVLT---
         +  W V ++FLPLEL GVD ILGM+                                +L +  +S        EE D             I E +T   
Subjt:  MIGKWKVIDEFLPLELEGVDAILGMK--------------------------------ALERCEIS--------EEED------------SIEEVLT---

Query:  -----EEESVAVVLKRFDDVFDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVD
              EE +  +L +F D+F+WPE LPPRR IEH IHLKEG NPVNVRPYRYAY QK EME+LV EML+SGIIRP+ SPYSSPVLLV+KKD SWRFCVD
Subjt:  -----EEESVAVVLKRFDDVFDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVD

Query:  YRALNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDD
        YRALNNVTVPDKFPIPV+EELFDEL GA +FTK+DLK GYHQIRM   DIEKTAFRTHEGHYEF+VMPFGLTNAP+TFQSLMN +F+P+LR+FVLVFFDD
Subjt:  YRALNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDD

Query:  ILIYSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLT
        ILIYSR+L+ HL H++ V  +LRK+EL+ N+KKCSF   +VEYLGH+IS +G+EVDPEKI+AI ++P P +VRE RGFLGLTGYYRKFV HYGT+AAPLT
Subjt:  ILIYSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLT

Query:  QLLKKGGFRWTEEAQEAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGK
        QLLKKGGF W  EA++AF +L++ M+ L VLA+P F  PFEIETDASGYG+GAVLIQN+RPIA+YSH LA+RDR +PVYERELMAVV+AVQRWRPYLLG 
Subjt:  QLLKKGGFRWTEEAQEAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGK

Query:  RFKVKTDQRSLKFLLEQRVIQP-------------------------------RVPPTVHLNHLIAPNIIDVAVIKEEVNQDEKFKKIREELAEKGEEQS
        RF V+TDQ+SLKFLLEQRV+QP                               R+ PTV +  +  P  +D+ +IKEEV +D K  KI  E+      Q 
Subjt:  RFKVKTDQRSLKFLLEQRVIQP-------------------------------RVPPTVHLNHLIAPNIIDVAVIKEEVNQDEKFKKIREELAEKGEEQS

Query:  SKYSMKQGMLMYKNRMVISKTSKLIPMILHTLHDSVFGGHSGFLRTYKRLTGELY
        SK+ +  GML YK+R+VIS+TSKLIP ILH+ HDS  GGHSGFLRTYKR++GELY
Subjt:  SKYSMKQGMLMYKNRMVISKTSKLIPMILHTLHDSVFGGHSGFLRTYKRLTGELY

A0A5D3BSP2 Ty3/gypsy retrotransposon protein6.7e-28660.33Show/hide
Query:  QNSTKANTNSNT-GENKGGTNWPMRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEEGE
        QN+  + T  NT  ++K  TN+PMRTITLR ++  E+ K+G S+RL DAEFQARKEK LCFRCNE+YS DH+CK KELREL+M VV  + EEYEIIEE  
Subjt:  QNSTKANTNSNT-GENKGGTNWPMRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEEGE

Query:  ETMKELKTIEIMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVELMI
           K L+  +  E+ +   ELS+NSVVGL++P TMKVKG+IQER+V++LIDCGATHNF+SE LVK LQL  KET +YG ILGSGTAV+GKGIC++VE+ +
Subjt:  ETMKELKTIEIMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVELMI

Query:  GKWKVIDEFLPLELEGVDAILGMKALERCEI----------------------------------------------------------SEEEDSIEEVL
          WKV +EFLPLEL GVD +LGM+ L    I                                                          +E E S   V 
Subjt:  GKWKVIDEFLPLELEGVDAILGMKALERCEI----------------------------------------------------------SEEEDSIEEVL

Query:  T-EEESVAVVLKRFDDVFDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVDYRA
        T  +E +  VLK+F+DVFDWPE LPPRR IEH IHLKEG NP+NVRPYRY +QQK EME+LVEEML+SGIIRP+ SP+SSPVLLV+KKD SWRFCVDYRA
Subjt:  T-EEESVAVVLKRFDDVFDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVDYRA

Query:  LNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDDILI
        +NN T+PDKFPIPV EELFDELNGA +F+K+DLK+GYHQIRM   DI KTAFRTHEGHYEF+VMPFGLTNAP+TFQ+LMN +F+PFLRKFVLVFFDDILI
Subjt:  LNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDDILI

Query:  YSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLTQLL
        YS++ K H++H++ V   LRK+ L+ NKKKC+F + +VEYLGHIISG G+EVD EKI+A+ ++P P N+REVRGFLGLTGYYR+FV+HYG+IAAPLTQLL
Subjt:  YSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLTQLL

Query:  KKGGFRWTEEAQEAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGKRFK
        KKGGF+W E+A+E+F +L+  MM+L  LA+P+F++PFEIETDASG+G+GAVLIQ++RPIA+YSH L++RDR +PVYERELMAVV++VQRWRPYLLG +F 
Subjt:  KKGGFRWTEEAQEAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGKRFK

Query:  VKTDQRSLKFLLEQRVIQP-------------------------------RVPPTVHLNHLIAPNIIDVAVIKEEVNQDEKFKKIREELAEKGEEQSSKY
        VKTDQ+SLKFLLEQRVIQP                               R PP + LN +  P  +D+  IKEEV +DEK KKI   L E+ E+Q+SK+
Subjt:  VKTDQRSLKFLLEQRVIQP-------------------------------RVPPTVHLNHLIAPNIIDVAVIKEEVNQDEKFKKIREELAEKGEEQSSKY

Query:  SMKQGMLMYKNRMVISKTSKLIPMILHTLHDSVFGGHSGFLRTYKRLTGELY
        ++K  +L YKNR+VISK S LIP IL+T HDSV  GHSGFLRTYKRL  ELY
Subjt:  SMKQGMLMYKNRMVISKTSKLIPMILHTLHDSVFGGHSGFLRTYKRLTGELY

A0A5D3D5G0 Ty3/gypsy retrotransposon protein1.7e-28159.88Show/hide
Query:  TNPQNSTKANTNSNTGENKGGTNWPMRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEE
        T     TK        ENK  T +P+RTITL+  +  E  KEG SKRL DAEFQ R+EK LCF+CNE+YS DHKCK +E REL+M VVN ++EE EI+EE
Subjt:  TNPQNSTKANTNSNTGENKGGTNWPMRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEE

Query:  GEETMKELKTIEIMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVEL
         E    EL+T+E+  ++ A VELSINSVVGL++P TMKV+G +Q+++VV+LIDCGATHNFVSE LV  LQL  KET++YG ILGSGT ++GKGIC+ VE+
Subjt:  GEETMKELKTIEIMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVEL

Query:  MIGKWKVIDEFLPLELEGVDAILGMK--------------------------------ALERCEIS--------EEED------------SIEEVLT---
         +  W V ++FLPLEL GVD ILGM+                                +L +  +S        EE D             I E +T   
Subjt:  MIGKWKVIDEFLPLELEGVDAILGMK--------------------------------ALERCEIS--------EEED------------SIEEVLT---

Query:  -----EEESVAVVLKRFDDVFDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVD
              EE +  +L +F+D+F+WPE LPPRR IEH IHLKEG NPVNVRPYRYAY QK EME+LV EML+SGIIRP+ SPYSSPVLLV+KKD SWRFCVD
Subjt:  -----EEESVAVVLKRFDDVFDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVD

Query:  YRALNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDD
        YRALNNVTVPDKFPIPV+EELFDEL GA +FTK+DLK GYHQIRM   DIEKTAFRTHEGHYEF+VMPFGLTNAP+TFQSLMN +F+P+LR+FVLVFFDD
Subjt:  YRALNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDD

Query:  ILIYSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLT
        ILIYSR+L+ +L H++ V  +LRK+EL+ N+KKCSF   +VEYLGH+IS +G+EVDPEKI+AI ++P P +VRE RGFLGLTGYYRKFV HYGT+AAPLT
Subjt:  ILIYSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLT

Query:  QLLKKGGFRWTEEAQEAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGK
        QLLKKGGF W  EA++AF +L++ M+ L VLA+P F  PFEIETDASGYG+GAVLIQN+RPIA+YSH LA+RDR +PVYERELMAVV+AVQRWRPYLLG 
Subjt:  QLLKKGGFRWTEEAQEAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGK

Query:  RFKVKTDQRSLKFLLEQRVIQP-------------------------------RVPPTVHLNHLIAPNIIDVAVIKEEVNQDEKFKKIREELAEKGEEQS
        RF V+TDQ+SLKFLLEQRV+QP                               R+ PTV +  +  P  +D+  IKEEV +D K  KI  E+      Q 
Subjt:  RFKVKTDQRSLKFLLEQRVIQP-------------------------------RVPPTVHLNHLIAPNIIDVAVIKEEVNQDEKFKKIREELAEKGEEQS

Query:  SKYSMKQGMLMYKNRMVISKTSKLIPMILHTLHDSVFGGHSGFLRTYKRLTGELY
        SK+ M  GML YK+R+VIS+TSKLIP ILH+ HDS  GGHSGFLRTYKR++GELY
Subjt:  SKYSMKQGMLMYKNRMVISKTSKLIPMILHTLHDSVFGGHSGFLRTYKRLTGELY

A0A5D3DKH5 Ty3/gypsy retrotransposon protein1.7e-28963.18Show/hide
Query:  QNSTKANTNSNT-GENKGGTNWPMRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEEGE
        QN+  + T +NT  ++K  TN+PMRTITLR ++  E+ K+G S+RL DAEFQA+KEK LCFRCNE+YS DH+CK KELREL+M VV  + EEYEIIEE  
Subjt:  QNSTKANTNSNT-GENKGGTNWPMRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEEGE

Query:  ETMKELKTIEIMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVELMI
           K L++ +  E+ +   ELS+NSVVGL++P TMKVK +IQER+V++LIDCGATHNF+SE LV+ LQL  KET +YG ILGSGTAV+GKGIC++VE+ +
Subjt:  ETMKELKTIEIMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVELMI

Query:  GKWKVIDEFLPLELEGVDAILGMKALERCEISEEEDSIEEVLTEEESVAVVLKRFDDVFDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEME
          WKV +EF+PLEL GVD +LG        I +   + ++    +E +  VLK+F+DVFDWPE LPPRR IEH IHLKEG +P+NVRPYRY +QQK EME
Subjt:  GKWKVIDEFLPLELEGVDAILGMKALERCEISEEEDSIEEVLTEEESVAVVLKRFDDVFDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEME

Query:  KLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVDYRALNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHY
        +LVEEML+SGIIRP+ SP+SSPVLLV+KKD SWRFCVDYR +NN T+PDKFPIPV EELFDELNGA IF+K+DLK+GYHQIRM   DI KTAFRTHEGHY
Subjt:  KLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVDYRALNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHY

Query:  EFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDDILIYSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRA
        EF+VMPFGLTNAP+TFQ+LMN +F+PFLRKFVLVFFDDILIYS++ K H++H++ V   LRK+ L+ NKKKC+F + +VEYLGHIISG G+EVD EKI+ 
Subjt:  EFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDDILIYSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRA

Query:  IKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLTQLLKKGGFRWTEEAQEAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPI
        + ++P P N+REVRGFLGLTGYYR+FV+HYG+IAAPLTQLLKKGGF+W E+A+E+F +L+  MM+L  LA+P+F++PFEIETDASG+G+GAVLIQ++RPI
Subjt:  IKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLTQLLKKGGFRWTEEAQEAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPI

Query:  AYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGKRFKVKTDQRSLKFLLEQRVIQP-------------------------------RVPPTVHLN
        A+YSH L++RDR +PVYERELMAVV++VQRWRPYLLG +F VKTDQ+SLKFLLEQRVIQP                               R PP + LN
Subjt:  AYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGKRFKVKTDQRSLKFLLEQRVIQP-------------------------------RVPPTVHLN

Query:  HLIAPNIIDVAVIKEEVNQDEKFKKIREELAEKGEEQSSKYSMKQGMLMYKNRMVISKTSKLIPMILHTLHDSVFGGHSGFLRTYKRLTGELY
         +  P  +D+  IKEEV +DEK KKI   L E+ E+Q+SK+++K  +L YKNR+VISK S LIP I++T HDSV GGHSGFLRTYKRL  ELY
Subjt:  HLIAPNIIDVAVIKEEVNQDEKFKKIREELAEKGEEQSSKYSMKQGMLMYKNRMVISKTSKLIPMILHTLHDSVFGGHSGFLRTYKRLTGELY

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.2e-8436.78Show/hide
Query:  TSNYGAILGSGTAVKGKGIC----KDVELMIGKWKVIDEFLPLE---LEGVDAILGMKALERCEISEEEDS----IEEVLTEE-ESVAVVLKRFDDV-FD
        + NY  +LG     + K       ++V L   K+K+I+     E    + V+ I      +  +IS   +S    +E +  EE + +  +L+++ D+ + 
Subjt:  TSNYGAILGSGTAVKGKGIC----KDVELMIGKWKVIDEFLPLE---LEGVDAILGMKALERCEISEEEDS----IEEVLTEE-ESVAVVLKRFDDV-FD

Query:  WPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDS-----WRFCVDYRALNNVTVPDKFPIPV
          + L      +H I+ K  +   +   Y  AY+Q  E+E  +++ML+ GIIR + SPY+SP+ +V KK D+     +R  +DYR LN +TV D+ PIP 
Subjt:  WPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDS-----WRFCVDYRALNNVTVPDKFPIPV

Query:  IEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDDILIYSRDLKAHLDHLKA
        ++E+  +L     FT +DL  G+HQI M+ + + KTAF T  GHYE++ MPFGL NAP+TFQ  MN + +P L K  LV+ DDI+++S  L  HL  L  
Subjt:  IEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDDILIYSRDLKAHLDHLKA

Query:  VLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLTQLLKKGGFRWT--EEAQ
        V E L K  L +   KC F K    +LGH+++  GI+ +PEKI AI+++PIP   +E++ FLGLTGYYRKF+ ++  IA P+T+ LKK     T   E  
Subjt:  VLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLTQLLKKGGFRWT--EEAQ

Query:  EAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGKRFKVKTDQRSLKFLL
         AF +L+  +    +L +PDF+  F + TDAS   LGAVL Q+  P++Y S  L   +      E+EL+A+V A + +R YLLG+ F++ +D + L +L 
Subjt:  EAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGKRFKVKTDQRSLKFLL

Query:  EQR
          +
Subjt:  EQR

P20825 Retrovirus-related Pol polyprotein from transposon 2971.7e-8439.32Show/hide
Query:  SEEEDSIEEV---------LTEEESVAV--VLKRFDDV-FDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPY
        S +++SI+++         L +EE+  +  +L +F ++ +   E L     I+H ++     +P+  + Y  A   + E+E  V+EML+ G+IR + SPY
Subjt:  SEEEDSIEEV---------LTEEESVAV--VLKRFDDV-FDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPY

Query:  SSPVLLVRKKDDS-----WRFCVDYRALNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPS
        +SP  +V KK D+     +R  +DYR LN +T+PD++PIP ++E+  +L     FT +DL  G+HQI M+ + I KTAF T  GHYE++ MPFGL NAP+
Subjt:  SSPVLLVRKKDDS-----WRFCVDYRALNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPS

Query:  TFQSLMNVVFKPFLRKFVLVFFDDILIYSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVR
        TFQ  MN + +P L K  LV+ DDI+I+S  L  HL+ ++ V   L    L +   KC F K    +LGHI++  GI+ +P K++AI  +PIP   +E+R
Subjt:  TFQSLMNVVFKPFLRKFVLVFFDDILIYSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVR

Query:  GFLGLTGYYRKFVRHYGTIAAPLTQLLKKGGFRWTEEAQ--EAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHILAVRDR
         FLGLTGYYRKF+ +Y  IA P+T  LKK     T++ +  EAF +L+  ++   +L +PDF   F + TDAS   LGAVL QN  PI++ S  L   + 
Subjt:  GFLGLTGYYRKFVRHYGTIAAPLTQLLKKGGFRWTEEAQ--EAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHILAVRDR

Query:  VKPVYERELMAVVMAVQRWRPYLLGKRFKVKTDQRSLKFL
             E+EL+A+V A + +R YLLG++F + +D + L++L
Subjt:  VKPVYERELMAVVMAVQRWRPYLLGKRFKVKTDQRSLKFL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.9e-7340.3Show/hide
Query:  LPPRRV------IEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVDYRALNNVTVPDKFPIPVIEE
        LPPR        ++H I +K G     ++PY    + + E+ K+V+++L +  I P+ SP SSPV+LV KKD ++R CVDYR LN  T+ D FP+P I+ 
Subjt:  LPPRRV------IEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVDYRALNNVTVPDKFPIPVIEE

Query:  LFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDDILIYSRDLKAHLDHLKAVLE
        L   +  A IFT LDL +GYHQI M   D  KTAF T  G YE+ VMPFGL NAPSTF   M   F+    +FV V+ DDILI+S   + H  HL  VLE
Subjt:  LFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDDILIYSRDLKAHLDHLKAVLE

Query:  ILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLTQLLKKGGFRWTEEAQEAFVR
         L+   L V KKKC FA    E+LG+ I  + I     K  AI++FP P  V++ + FLG+  YYR+F+ +   IA P+ QL      +WTE+  +A  +
Subjt:  ILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLTQLLKKGGFRWTEEAQEAFVR

Query:  LQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVL--IQNQRP----IAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGKRFKVKTDQRSL
        L+  +    VL   +    + + TDAS  G+GAVL  + N+      + Y+S  L    +  P  E EL+ ++ A+  +R  L GK F ++TD  SL
Subjt:  LQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVL--IQNQRP----IAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGKRFKVKTDQRSL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.1e-7840.05Show/hide
Query:  NPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKK-----DDSWRFCVDYRALNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKT
        +P+  + Y Y    + E+E+ ++E+L  GIIRP+ SPY+SP+ +V KK     +  +R  VD++ LN VT+PD +PIP I      L  A  FT LDL +
Subjt:  NPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKK-----DDSWRFCVDYRALNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKT

Query:  GYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDDILIYSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAK
        G+HQI M   DI KTAF T  G YEF+ +PFGL NAP+ FQ +++ + +  + K   V+ DDI+++S D   H  +L+ VL  L K  L VN +K  F  
Subjt:  GYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDDILIYSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAK

Query:  TRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLTQLLK------------KGGFRWTEEAQEAFVRLQRTMM
        T+VE+LG+I++  GI+ DP+K+RAI E P P +V+E++ FLG+T YYRKF++ Y  +A PLT L +            K      E A ++F  L+  + 
Subjt:  TRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLTQLLK------------KGGFRWTEEAQEAFVRLQRTMM

Query:  TLLVLAMPDFSIPFEIETDASGYGLGAVLIQN----QRPIAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGK-RFKVKTDQRSLKFLLEQR
        +  +LA P F+ PF + TDAS + +GAVL Q+     RPIAY S  L   +      E+E++A++ ++   R YL G    KV TD + L F L  R
Subjt:  TLLVLAMPDFSIPFEIETDASGYGLGAVLIQN----QRPIAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGK-RFKVKTDQRSLKFLLEQR

Q99315 Transposon Ty3-G Gag-Pol polyprotein6.7e-7340.3Show/hide
Query:  LPPRRV------IEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVDYRALNNVTVPDKFPIPVIEE
        LPPR        ++H I +K G     ++PY    + + E+ K+V+++L +  I P+ SP SSPV+LV KKD ++R CVDYR LN  T+ D FP+P I+ 
Subjt:  LPPRRV------IEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSWRFCVDYRALNNVTVPDKFPIPVIEE

Query:  LFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDDILIYSRDLKAHLDHLKAVLE
        L   +  A IFT LDL +GYHQI M   D  KTAF T  G YE+ VMPFGL NAPSTF   M   F+    +FV V+ DDILI+S   + H  HL  VLE
Subjt:  LFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFDDILIYSRDLKAHLDHLKAVLE

Query:  ILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLTQLLKKGGFRWTEEAQEAFVR
         L+   L V KKKC FA    E+LG+ I  + I     K  AI++FP P  V++ + FLG+  YYR+F+ +   IA P+ QL      +WTE+  +A  +
Subjt:  ILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLTQLLKKGGFRWTEEAQEAFVR

Query:  LQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVL--IQNQRP----IAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGKRFKVKTDQRSL
        L+  +    VL   +    + + TDAS  G+GAVL  + N+      + Y+S  L    +  P  E EL+ ++ A+  +R  L GK F ++TD  SL
Subjt:  LQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVL--IQNQRP----IAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGKRFKVKTDQRSL

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein1.2e-0824.64Show/hide
Query:  MRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEEGEETMKELKTIEIMEESQAIVELSI
        +R++TL G   EE+  +G    L  A  + +   +          + ++ +  EL  L ++       + +++++ +  + EL+ +E  ++S  + +   
Subjt:  MRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEEGEETMKELKTIEIMEESQAIVELSI

Query:  NSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVELMIGKWKVIDEFLPLEL--EGVDAIL
          V+ L+  + M+  G I +  VVV ID GAT NF+   L   L+L T  T+    +LG    ++  G C  + L + + ++ + FL L+L    VD IL
Subjt:  NSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVELMIGKWKVIDEFLPLEL--EGVDAIL

Query:  GMKALER
        G + L +
Subjt:  GMKALER

AT3G30770.1 Eukaryotic aspartyl protease family protein4.1e-0930.33Show/hide
Query:  IMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVELMIGKWKVIDEFL
        ++E+ + I ++   S    +  + M+  G I    VVV+ID GAT+NF+S+ L   L+L T  T+    +LG    ++  G C  + L++ + ++ + FL
Subjt:  IMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVELMIGKWKVIDEFL

Query:  PLEL--EGVDAILGMKALERCE
         L+L    VD ILG    +  E
Subjt:  PLEL--EGVDAILGMKALERCE

ATMG00850.1 DNA/RNA polymerases superfamily protein2.1e-0553.85Show/hide
Query:  QKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSW
        ++T ++  + EML + II+P+ SPYSSPVLLV+KKD  W
Subjt:  QKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVRKKDDSW

ATMG00860.1 DNA/RNA polymerases superfamily protein6.3e-3451.15Show/hide
Query:  LDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLG--HIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLTQLLKKGGFR
        ++HL  VL+I  +++ Y N+KKC+F + ++ YLG  HIISG G+  DP K+ A+  +P P N  E+RGFLGLTGYYR+FV++YG I  PLT+LLKK   +
Subjt:  LDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLG--HIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLTQLLKKGGFR

Query:  WTEEAQEAFVRLQRTMMTLLVLAMPDFSIPF
        WTE A  AF  L+  + TL VLA+PD  +PF
Subjt:  WTEEAQEAFVRLQRTMMTLLVLAMPDFSIPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTAATCCACAGAATAGTACTAAGGCTAACACGAATTCGAACACTGGAGAAAATAAGGGGGGTACGAATTGGCCAATGAGAACCATTACTCTACGTGGGACATCAAA
GGAGGAAGTTCCAAAAGAAGGACCATCAAAACGATTATCTGATGCGGAATTCCAAGCTCGGAAGGAGAAAGTGTTGTGTTTTCGGTGTAATGAGCAATATTCTCATGACC
ATAAATGCAAAACTAAGGAGTTGAGGGAATTACGAATGCTAGTAGTTAATGGGGATGATGAAGAATACGAAATTATCGAGGAAGGAGAGGAGACAATGAAGGAGTTGAAG
ACTATTGAAATAATGGAAGAAAGCCAGGCGATAGTAGAGTTATCTATCAACTCTGTGGTTGGATTATCTAATCCAAGGACGATGAAGGTGAAGGGAAGGATACAAGAAAG
GGATGTGGTGGTGTTGATTGATTGTGGAGCAACACACAATTTTGTTTCTGAAATACTTGTGAAGGAATTACAACTTGAGACAAAAGAAACCTCAAATTATGGGGCAATTT
TGGGTTCAGGTACTGCTGTAAAAGGAAAAGGGATATGTAAAGATGTGGAACTGATGATAGGAAAATGGAAAGTGATTGATGAATTCTTACCATTGGAATTAGAGGGAGTA
GATGCCATTCTAGGGATGAAAGCCTTGGAAAGATGTGAAATATCAGAGGAAGAAGATTCGATTGAGGAAGTATTGACTGAGGAAGAGTCGGTCGCTGTTGTGTTGAAAAG
ATTTGACGATGTTTTTGATTGGCCAGAAACTTTACCTCCCCGACGGGTAATAGAGCATTACATTCATCTGAAGGAGGGTGTAAATCCAGTAAATGTGCGACCATATCGAT
ATGCTTACCAACAAAAAACTGAGATGGAGAAGTTAGTTGAAGAAATGTTATCGTCAGGGATAATACGCCCCAACACGAGTCCTTACTCGAGCCCTGTATTATTGGTGAGA
AAAAAAGATGATAGTTGGCGGTTTTGCGTAGATTACAGAGCCTTGAACAATGTGACTGTGCCTGATAAATTTCCAATACCAGTAATTGAAGAGTTATTTGATGAGTTAAA
TGGAGCTGTGATATTCACTAAGCTGGATCTGAAGACAGGATATCATCAGATTAGAATGAATTTCGATGATATTGAAAAGACTGCATTCAGGACGCATGAAGGTCATTATG
AGTTCATGGTGATGCCTTTCGGACTAACAAATGCACCTTCAACTTTCCAATCCTTAATGAATGTTGTATTTAAGCCGTTTCTCAGAAAGTTTGTGCTAGTATTCTTTGAC
GATATATTGATTTATAGCAGAGATTTGAAAGCTCATTTGGATCACCTCAAAGCAGTGTTAGAAATACTGCGAAAGAATGAACTATATGTTAACAAGAAAAAGTGCAGTTT
TGCCAAGACAAGGGTAGAATATCTAGGGCATATTATATCTGGAAGGGGAATTGAGGTGGATCCGGAGAAAATTAGAGCCATCAAAGAATTTCCTATTCCAGTTAACGTGA
GAGAAGTCCGAGGATTTTTGGGTTTAACTGGATATTATCGTAAGTTTGTAAGGCATTATGGTACGATTGCAGCACCACTGACTCAATTGTTGAAGAAAGGGGGATTTAGA
TGGACTGAAGAGGCTCAGGAAGCTTTCGTCAGACTGCAGAGAACCATGATGACACTACTGGTATTGGCAATGCCAGATTTCAGCATTCCCTTTGAAATTGAAACGGATGC
TTCGGGCTATGGTTTAGGTGCTGTGTTGATACAAAACCAAAGACCTATTGCTTATTATAGCCACATTCTAGCAGTTAGAGATAGAGTTAAACCTGTTTATGAACGAGAAT
TGATGGCTGTAGTAATGGCAGTACAGAGGTGGCGACCCTATTTGCTGGGGAAGAGATTCAAAGTAAAAACTGATCAACGGTCATTAAAATTCCTGCTGGAACAACGGGTG
ATACAACCCCGAGTACCACCTACTGTCCATCTGAATCATTTGATAGCTCCAAATATAATTGATGTGGCAGTGATTAAAGAAGAGGTTAATCAAGATGAAAAGTTTAAGAA
GATCAGAGAAGAATTAGCAGAAAAAGGTGAGGAGCAGTCCAGCAAATACTCTATGAAGCAGGGGATGTTGATGTATAAGAATCGAATGGTAATTTCTAAAACTTCTAAAC
TGATTCCTATGATCTTACATACGCTTCATGATTCGGTATTTGGAGGACATTCGGGTTTCTTACGAACGTACAAGAGATTGACCGGAGAGTTATACTAG
mRNA sequenceShow/hide mRNA sequence
ATGACTAATCCACAGAATAGTACTAAGGCTAACACGAATTCGAACACTGGAGAAAATAAGGGGGGTACGAATTGGCCAATGAGAACCATTACTCTACGTGGGACATCAAA
GGAGGAAGTTCCAAAAGAAGGACCATCAAAACGATTATCTGATGCGGAATTCCAAGCTCGGAAGGAGAAAGTGTTGTGTTTTCGGTGTAATGAGCAATATTCTCATGACC
ATAAATGCAAAACTAAGGAGTTGAGGGAATTACGAATGCTAGTAGTTAATGGGGATGATGAAGAATACGAAATTATCGAGGAAGGAGAGGAGACAATGAAGGAGTTGAAG
ACTATTGAAATAATGGAAGAAAGCCAGGCGATAGTAGAGTTATCTATCAACTCTGTGGTTGGATTATCTAATCCAAGGACGATGAAGGTGAAGGGAAGGATACAAGAAAG
GGATGTGGTGGTGTTGATTGATTGTGGAGCAACACACAATTTTGTTTCTGAAATACTTGTGAAGGAATTACAACTTGAGACAAAAGAAACCTCAAATTATGGGGCAATTT
TGGGTTCAGGTACTGCTGTAAAAGGAAAAGGGATATGTAAAGATGTGGAACTGATGATAGGAAAATGGAAAGTGATTGATGAATTCTTACCATTGGAATTAGAGGGAGTA
GATGCCATTCTAGGGATGAAAGCCTTGGAAAGATGTGAAATATCAGAGGAAGAAGATTCGATTGAGGAAGTATTGACTGAGGAAGAGTCGGTCGCTGTTGTGTTGAAAAG
ATTTGACGATGTTTTTGATTGGCCAGAAACTTTACCTCCCCGACGGGTAATAGAGCATTACATTCATCTGAAGGAGGGTGTAAATCCAGTAAATGTGCGACCATATCGAT
ATGCTTACCAACAAAAAACTGAGATGGAGAAGTTAGTTGAAGAAATGTTATCGTCAGGGATAATACGCCCCAACACGAGTCCTTACTCGAGCCCTGTATTATTGGTGAGA
AAAAAAGATGATAGTTGGCGGTTTTGCGTAGATTACAGAGCCTTGAACAATGTGACTGTGCCTGATAAATTTCCAATACCAGTAATTGAAGAGTTATTTGATGAGTTAAA
TGGAGCTGTGATATTCACTAAGCTGGATCTGAAGACAGGATATCATCAGATTAGAATGAATTTCGATGATATTGAAAAGACTGCATTCAGGACGCATGAAGGTCATTATG
AGTTCATGGTGATGCCTTTCGGACTAACAAATGCACCTTCAACTTTCCAATCCTTAATGAATGTTGTATTTAAGCCGTTTCTCAGAAAGTTTGTGCTAGTATTCTTTGAC
GATATATTGATTTATAGCAGAGATTTGAAAGCTCATTTGGATCACCTCAAAGCAGTGTTAGAAATACTGCGAAAGAATGAACTATATGTTAACAAGAAAAAGTGCAGTTT
TGCCAAGACAAGGGTAGAATATCTAGGGCATATTATATCTGGAAGGGGAATTGAGGTGGATCCGGAGAAAATTAGAGCCATCAAAGAATTTCCTATTCCAGTTAACGTGA
GAGAAGTCCGAGGATTTTTGGGTTTAACTGGATATTATCGTAAGTTTGTAAGGCATTATGGTACGATTGCAGCACCACTGACTCAATTGTTGAAGAAAGGGGGATTTAGA
TGGACTGAAGAGGCTCAGGAAGCTTTCGTCAGACTGCAGAGAACCATGATGACACTACTGGTATTGGCAATGCCAGATTTCAGCATTCCCTTTGAAATTGAAACGGATGC
TTCGGGCTATGGTTTAGGTGCTGTGTTGATACAAAACCAAAGACCTATTGCTTATTATAGCCACATTCTAGCAGTTAGAGATAGAGTTAAACCTGTTTATGAACGAGAAT
TGATGGCTGTAGTAATGGCAGTACAGAGGTGGCGACCCTATTTGCTGGGGAAGAGATTCAAAGTAAAAACTGATCAACGGTCATTAAAATTCCTGCTGGAACAACGGGTG
ATACAACCCCGAGTACCACCTACTGTCCATCTGAATCATTTGATAGCTCCAAATATAATTGATGTGGCAGTGATTAAAGAAGAGGTTAATCAAGATGAAAAGTTTAAGAA
GATCAGAGAAGAATTAGCAGAAAAAGGTGAGGAGCAGTCCAGCAAATACTCTATGAAGCAGGGGATGTTGATGTATAAGAATCGAATGGTAATTTCTAAAACTTCTAAAC
TGATTCCTATGATCTTACATACGCTTCATGATTCGGTATTTGGAGGACATTCGGGTTTCTTACGAACGTACAAGAGATTGACCGGAGAGTTATACTAG
Protein sequenceShow/hide protein sequence
MTNPQNSTKANTNSNTGENKGGTNWPMRTITLRGTSKEEVPKEGPSKRLSDAEFQARKEKVLCFRCNEQYSHDHKCKTKELRELRMLVVNGDDEEYEIIEEGEETMKELK
TIEIMEESQAIVELSINSVVGLSNPRTMKVKGRIQERDVVVLIDCGATHNFVSEILVKELQLETKETSNYGAILGSGTAVKGKGICKDVELMIGKWKVIDEFLPLELEGV
DAILGMKALERCEISEEEDSIEEVLTEEESVAVVLKRFDDVFDWPETLPPRRVIEHYIHLKEGVNPVNVRPYRYAYQQKTEMEKLVEEMLSSGIIRPNTSPYSSPVLLVR
KKDDSWRFCVDYRALNNVTVPDKFPIPVIEELFDELNGAVIFTKLDLKTGYHQIRMNFDDIEKTAFRTHEGHYEFMVMPFGLTNAPSTFQSLMNVVFKPFLRKFVLVFFD
DILIYSRDLKAHLDHLKAVLEILRKNELYVNKKKCSFAKTRVEYLGHIISGRGIEVDPEKIRAIKEFPIPVNVREVRGFLGLTGYYRKFVRHYGTIAAPLTQLLKKGGFR
WTEEAQEAFVRLQRTMMTLLVLAMPDFSIPFEIETDASGYGLGAVLIQNQRPIAYYSHILAVRDRVKPVYERELMAVVMAVQRWRPYLLGKRFKVKTDQRSLKFLLEQRV
IQPRVPPTVHLNHLIAPNIIDVAVIKEEVNQDEKFKKIREELAEKGEEQSSKYSMKQGMLMYKNRMVISKTSKLIPMILHTLHDSVFGGHSGFLRTYKRLTGELY