; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G09980 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G09980
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr7:8034466..8038412
RNA-Seq ExpressionCSPI07G09980
SyntenyCSPI07G09980
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025132.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.6e-30547.47Show/hide
Query:  EKITELSMRSTLMKNNDEGEGSFVSENEPKNEEKKAIEDANCDRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEKMIVATISFEGPALNWYRSQ
        +++ E S   T M+     E +  + +   N E K  E    DR+KFKKVEMPIF G D DSWLFRA+RYF+IH L+D EK+ VA ISF+GPAL+WYRSQ
Subjt:  EKITELSMRSTLMKNNDEGEGSFVSENEPKNEEKKAIEDANCDRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEKMIVATISFEGPALNWYRSQ

Query:  EERDKFIDWANLKERVLVWFRSTREGSLYGRFLRIQQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKPWIHAEMDS--------------KVENI
        EER+ F  W +LK+++LV FRS R+G+L GRFL I+Q TT+EEYRN FDK + P++ L   V+EETFM+GL PW+ +E+++              K+EN 
Subjt:  EERDKFIDWANLKERVLVWFRSTREGSLYGRFLRIQQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKPWIHAEMDS--------------KVENI

Query:  EDIRRESNLPGYSEGKLSNARSGNKASTIGNSGENKGNWNWPMRTITLRGTTGEEVRKTGPTKRLSDAEFQSRKEKG-----------------------
        E +R+E  L    + K +   +  +        EN     WPMRTITLR     E R+ GPTKRL+DAEFQ+R+EKG                       
Subjt:  EDIRRESNLPGYSEGKLSNARSGNKASTIGNSGENKGNWNWPMRTITLRGTTGEEVRKTGPTKRLSDAEFQSRKEKG-----------------------

Query:  --LLMGEN-EEYEIIEEE------EAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVKGKLLGREIIVLIDCGATHNFISESLVEELKINTKSTT
          LL+ EN EE EIIEEE      E K  E  +V     E   +ELS NSVVGL+N GTMKVKG+L   E++VLIDCGATHNFI+E LVE+L++  K T 
Subjt:  --LLMGEN-EEYEIIEEE------EAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVKGKLLGREIIVLIDCGATHNFISESLVEELKINTKSTT

Query:  NYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELGGMNAILGMQWLYSLGITE------------------------------------------
        NYGVILGS T+IKGKGVC+ +E+ LG  +V D FLPLELGG++AILGMQWL+S+G TE                                          
Subjt:  NYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELGGMNAILGMQWLYSLGITE------------------------------------------

Query:  --GFLAECRALERRESLEEEDSFDEVLTVE-ELVTVVLKSFEDVFDWPETLPPRR--------------------------KIEMERLVEEMLASGVIRS
          G+L ECR LE +E +E+ +   ++   E   +T +L  F DVF+ P  LPP+R                          K EMERLV+EML+SG+IR 
Subjt:  --GFLAECRALERRESLEEEDSFDEVLTVE-ELVTVVLKSFEDVFDWPETLPPRR--------------------------KIEMERLVEEMLASGVIRS

Query:  STSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQSG-------------------------------------
        STSPYS                 +Y ALNNVTVPDKFPIPV+EELFDELNG  +F+KIDL++G                                     
Subjt:  STSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQSG-------------------------------------

Query:  -----------------------------KGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDGVEVDPEKIRAIKEWPIPANVREVK
                                     KG+++H  HL  +LE+LR++ELYAN  KC FA++R+ YLGH IS  G+EVDPEKIRA+KEWP PANVRE++
Subjt:  -----------------------------KGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDGVEVDPEKIRAIKEWPIPANVREVK

Query:  GFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGLGA------------GANGSSNGRAK
        GFLGLTGYYR+FV NYG+I+APLTQLLK G +KWTEE + AF +LK+AMMTLPVLA+PDFN+ FEIE DASG+G+GA                S+  RA+
Subjt:  GFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGLGA------------GANGSSNGRAK

Query:  -------MASI---------LVG----------------EERVIQPQYQKWISKLLGYSFEVVYKSGLENKAANALSRVPATVHLNQLTAPKMIDIEIIK
               MA +         L+G                E+RVIQPQYQ+WI+KLLGYSFEV+YK GLENKAA+ALSR+  T HLNQLTAP ++D+E+I+
Subjt:  -------MASI---------LVG----------------EERVIQPQYQKWISKLLGYSFEVVYKSGLENKAANALSRVPATVHLNQLTAPKMIDIEIIK

Query:  AEVERDEKLKKVMQKLSETEELEEGKYLIKQGILMYKGRMELSKTSRLIPTILHTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEVKKYCEECMICQRN
         EV +D  L++++  + E + +E   Y   QGIL +KGR+ LSK S LIPTI+HTYHDSVFGGHSGFLRTYKRMA EL+W+GMKK+V+KYCEECMICQ+N
Subjt:  AEVERDEKLKKVMQKLSETEELEEGKYLIKQGILMYKGRMELSKTSRLIPTILHTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEVKKYCEECMICQRN

Query:  KTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGHFIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENPLSQI---------
        K+ ALSPA LL PLE+P+ +W DISMDF+EGLPK+ G+EV+ VVVDR SKY HF+ +KHPYT K V+EVFVKEV+RLHG+PKSI +   +I         
Subjt:  KTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGHFIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENPLSQI---------

Query:  ------ETRRSC---------VAFVGKSQNNGLNGFLGR----------------------NTGVTPFQAVYGRLPPPLVYYEERDTTNSTLDEQQKERD
              +  RS             V KS    L  F G                       + G+TPFQAVYGRLPPPL+ Y E +T NSTLD+Q ++RD
Subjt:  ------ETRRSC---------VAFVGKSQNNGLNGFLGR----------------------NTGVTPFQAVYGRLPPPLVYYEERDTTNSTLDEQQKERD

Query:  VVLGAL
        VVLGAL
Subjt:  VVLGAL

KAA0039975.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.7e-30746.4Show/hide
Query:  MESAAQERSTMNEKITELSMRSTLMKNNDEGEGSFVSENE------PKNEEKKAIEDANC-DRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEK
        +E+   ERST +  +TE + +        EG+ S     E      P   E+++  D +  DRNKFKK+EMP+F G+D DSWLFRAERYFQIHKLTD EK
Subjt:  MESAAQERSTMNEKITELSMRSTLMKNNDEGEGSFVSENE------PKNEEKKAIEDANC-DRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEK

Query:  MIVATISFEGPALNWYRSQEERDKFIDWANLKERVLVWFRSTREGSLYGRFLRIQQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKPWIHA----
        M+V+T+SF+GPALNW+RSQEERDKF  WAN+KER+LV FRS ++G++ G+FLRI+Q  T+EEY NLFDK+V P++DLPE+V+ +TFM+GL PW+ +    
Subjt:  MIVATISFEGPALNWYRSQEERDKFIDWANLKERVLVWFRSTREGSLYGRFLRIQQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKPWIHA----

Query:  ----------EMDSKVENIEDIRRESNLPGYSEGKLSNARS-GNKASTIGNSGENKGNWNWPMRTITLRGTTGEEVRKTGPTKRLSDAEFQSRKEKGLLM
                  E+   VEN E  R E+ + GYS GK++   S   K ST G +G+NK N  +P+RTITLR +   E R+ G  KRL DAEFQ+RKEKGL  
Subjt:  ----------EMDSKVENIEDIRRESNLPGYSEGKLSNARS-GNKASTIGNSGENKGNWNWPMRTITLRGTTGEEVRKTGPTKRLSDAEFQSRKEKGLLM

Query:  GEN--------------------------EEYEIIEEEEAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVKGKLLGREIIVLIDCGATHNFISE
          N                          EEYEI+E+E+ ++KEL  + + +   TVVELS NSVVGL++ GTMKV+GKLLG E+++LIDCGATHNF+SE
Subjt:  GEN--------------------------EEYEIIEEEEAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVKGKLLGREIIVLIDCGATHNFISE

Query:  SLVEELKINTKSTTNYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELGGMNAILGMQWLYSLGIT-----------------------------
         LV +L +  K T++YGVILGS  A++GKGVCE +E+ L +W++V+DFLPLELGG++ ILGMQWLYSLG+T                             
Subjt:  SLVEELKINTKSTTNYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELGGMNAILGMQWLYSLGIT-----------------------------

Query:  ---------------EGFLAECRALERRESLEEEDSFDEVLTVEE--LVTVVLKSFEDVFDWPETLPPRRKI--------------------------EM
                        GFL ECR+L+ R ++E+++ +     VE    +  V+K F DVF+WPE LPPRR+I                          EM
Subjt:  ---------------EGFLAECRALERRESLEEEDSFDEVLTVEE--LVTVVLKSFEDVFDWPETLPPRRKI--------------------------EM

Query:  ERLVEEMLASGVIRSSTSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQSG----------------------
        E+LV+EML SGVIR S SPYS                 +Y A+NN T+PDKFPIPV+EELFDEL G T+F+KIDL+SG                      
Subjt:  ERLVEEMLASGVIRSSTSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQSG----------------------

Query:  --------------------------------------------KGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDGVEVDPEKIR
                                                    +  ++H  H+R +L VLR +EL+AN+KKC FAQ +++YLGH+ISG+GV VDPEKI+
Subjt:  --------------------------------------------KGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDGVEVDPEKIR

Query:  AIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGLGA--------
        AI +WP P NV+E +GFLGLTGYYR+FV +YGTIAAPLTQLLK GGF WTEE   AFNRLK AMM+LPVLALPDF+  FEIE DASGYG+GA        
Subjt:  AIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGLGA--------

Query:  -----------GANGSSNGRAKMASI----------LVG---------------EERVIQPQYQKWISKLLGYSFEVVYKSGLENKAANALSRVPATVHL
                     +     R  MA +          L+G               ++R+IQPQYQKWI+KLLGYSFEVVYK G+EN+AA+ALSR P  V L
Subjt:  -----------GANGSSNGRAKMASI----------LVG---------------EERVIQPQYQKWISKLLGYSFEVVYKSGLENKAANALSRVPATVHL

Query:  NQLTAPKMIDIEIIKAEVERDEKLKKVMQKLSETEELEEGKYLIKQGILMYKGRMELSKTSRLIPTILHTYHDSVFGGHSGFLRTYKRMAAELHWEGMKK
          L+ P  +D+EI+K EV +D K +K+++++ + EELE   Y +K+G+LM+K R+ + K S LIP IL T+H+S  GGHSGFLRTYKR+AAEL+W GMK 
Subjt:  NQLTAPKMIDIEIIKAEVERDEKLKKVMQKLSETEELEEGKYLIKQGILMYKGRMELSKTSRLIPTILHTYHDSVFGGHSGFLRTYKRMAAELHWEGMKK

Query:  EVKKYCEECMICQRNKTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGHFIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIE
        ++KK+CEEC++CQR+KTLALSPA LL PLE+P  +W DISMDFVEGLPK++GYEV+ VVVDR SKYGHF+P+KHPYT K+V+E+FVKE++RLHG+P SI 
Subjt:  EVKKYCEECMICQRNKTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGHFIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIE

Query:  NP-----LSQIETR---------RSCVAFVGKSQ------NNGLNGFL--------------------------GRNTGVTPFQAVYGRLPPPLVYYEER
        +      LSQ  T              A+  +S       N G+  +L                           R+ G+TPFQ VYGR PP ++ Y   
Subjt:  NP-----LSQIETR---------RSCVAFVGKSQ------NNGLNGFL--------------------------GRNTGVTPFQAVYGRLPPPLVYYEER

Query:  DTTNSTLDEQQKERDVVLGAL
         + NST++E   ERD+VLG+L
Subjt:  DTTNSTLDEQQKERDVVLGAL

TYK02195.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0046.93Show/hide
Query:  MESAAQERSTMNEKITELSMRSTLMKNNDEGEG-------SFVSENEPKNEEKKAIEDANCDRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEK
        ME+ A++R+T +E+  + +   ++     E E        S  S ++ +NE K   E+A  DRNKFKKVEMP+F G+D DSWLFRAERYFQIHKL+D EK
Subjt:  MESAAQERSTMNEKITELSMRSTLMKNNDEGEG-------SFVSENEPKNEEKKAIEDANCDRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEK

Query:  MIVATISFEGPALNWYRSQEERDKFIDWANLKERVLVWFRSTREGSLYGRFLRIQQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKPWIHAEM--
        M+V+TISF+GPALNWYR QEER+KF  WANLKER+LV FRS+R+G++ G+FLR++Q +T+++YRNLFDKLV PLSD+P+ VVE+TFM+GL PWI AE+  
Subjt:  MIVATISFEGPALNWYRSQEERDKFIDWANLKERVLVWFRSTREGSLYGRFLRIQQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKPWIHAEM--

Query:  ------------DSKVENIEDIRRESNLPGYSEGKLSNARSGNKASTIGNSGENKGNWNWPMRTITLRGTTGEEVRKTGPTKRLSDAEFQSRKEKGL---
                       VEN E  R E+NL  ++ GK     + N  + +  + ++K N N+PMRTITLR +   E+RK G ++RL DAEFQ+RKEKGL   
Subjt:  ------------DSKVENIEDIRRESNLPGYSEGKLSNARSGNKASTIGNSGENKGNWNWPMRTITLRGTTGEEVRKTGPTKRLSDAEFQSRKEKGL---

Query:  -----------------------LMGENEEYEIIEEEEAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVKGKLLGREIIVLIDCGATHNFISES
                               +  E EEYEIIEE  A++K L      ++++T  ELS NSVVGL++ GTMKVKGK+  RE+I+LIDCGATHNFISE 
Subjt:  -----------------------LMGENEEYEIIEEEEAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVKGKLLGREIIVLIDCGATHNFISES

Query:  LVEELKINTKSTTNYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELGGMNAILGMQWLYSLGIT------------------------------
        LV+ L++  K T +YGVILGS TA++GKG+CE +E+ L  W+V ++FLPLELGG++ +LGMQWL+SLGIT                              
Subjt:  LVEELKINTKSTTNYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELGGMNAILGMQWLYSLGIT------------------------------

Query:  --------------EGFLAECRALERRESLEEEDSFDEVLTV-EELVTVVLKSFEDVFDWPETLPPRR--------------------------KIEMER
                      EGFL ECRA++     E E S   V T+ +E +  VLK FEDVFDWPE LPPRR                          K EMER
Subjt:  --------------EGFLAECRALERRESLEEEDSFDEVLTV-EELVTVVLKSFEDVFDWPETLPPRR--------------------------KIEMER

Query:  LVEEMLASGVIRSSTSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQSG------------------------
        LVEEMLASG+IR S SP+S                 +Y A+NN T+PDKFPIPV EELFDELNG T+F+KIDL+SG                        
Subjt:  LVEEMLASGVIRSSTSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQSG------------------------

Query:  ------------------------------------------KGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDGVEVDPEKIRAI
                                                  K  KDH+ H+  +   LRK+ L+ANKKKC+F Q +V+YLGHIISG+GVEVD EKI+A+
Subjt:  ------------------------------------------KGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDGVEVDPEKIRAI

Query:  KEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGLGA----------
         +WP P N+REV+GFLGLTGYYR+FV +YG+IAAPLTQLLK GGFKW E+ +E+F +LK AMM+LP LALP+F + FEIE DASG+G+GA          
Subjt:  KEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGLGA----------

Query:  --GANGSSNGRAK-------MASIL-------------------------VGEERVIQPQYQKWISKLLGYSFEVVYKSGLENKAANALSRVPATVHLNQ
              S   RA+       MA +L                         + E+RVIQPQYQKW+SKLLGYSFEVVYK GLENKAA+ALSR P  + LN 
Subjt:  --GANGSSNGRAK-------MASIL-------------------------VGEERVIQPQYQKWISKLLGYSFEVVYKSGLENKAANALSRVPATVHLNQ

Query:  LTAPKMIDIEIIKAEVERDEKLKKVMQKLSETEELEEGKYLIKQGILMYKGRMELSKTSRLIPTILHTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEV
        ++ P  +D+E IK EVE+DEKLKK++  L+E +E +  K+ IK  +L YK R+ +SK S LIP IL+T+HDSV  GHSGFLRTYKR+A+EL+WEGMK +V
Subjt:  LTAPKMIDIEIIKAEVERDEKLKKVMQKLSETEELEEGKYLIKQGILMYKGRMELSKTSRLIPTILHTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEV

Query:  KKYCEECMICQRNKTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGHFIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENP
        KK+CE C+ CQRNK+LALSPA LL PL++P+++W DISMDF++GLPKA G++V+ VVVDR SKY HF+ +KHPYT K V+E FVKE++RLHG+P SI + 
Subjt:  KKYCEECMICQRNKTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGHFIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENP

Query:  LSQI-------ETRRSCVAFVGKSQ-------------NNGLNGFL--------------------------GRNTGVTPFQAVYGRLPPPLVYYEERDT
          ++       E  +     + KS              N GL  +L                           +  G +PFQ VYG  PP L+ Y ER T
Subjt:  LSQI-------ETRRSCVAFVGKSQ-------------NNGLNGFL--------------------------GRNTGVTPFQAVYGRLPPPLVYYEERDT

Query:  TNSTLDEQQKERDVVLGAL
        +NS++DEQ KERDV L AL
Subjt:  TNSTLDEQQKERDVVLGAL

TYK06640.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-30746.4Show/hide
Query:  MESAAQERSTMNEKITELSMRSTLMKNNDEGEGSFVSENE------PKNEEKKAIEDANC-DRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEK
        +E+   ERST +  +TE + +        EG+ S     E      P   E+++  D +  DRNKFKK+EMP+F G+D DSWLFRAERYFQIHKLTD EK
Subjt:  MESAAQERSTMNEKITELSMRSTLMKNNDEGEGSFVSENE------PKNEEKKAIEDANC-DRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEK

Query:  MIVATISFEGPALNWYRSQEERDKFIDWANLKERVLVWFRSTREGSLYGRFLRIQQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKPWIHA----
        M+V+T+SF+GPALNW+RSQEERDKF  WAN+KER+LV FRS ++G++ G+FLRI+Q  T+EEY NLFDK+V P++DLPE+V+ +TFM+GL PW+ +    
Subjt:  MIVATISFEGPALNWYRSQEERDKFIDWANLKERVLVWFRSTREGSLYGRFLRIQQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKPWIHA----

Query:  ----------EMDSKVENIEDIRRESNLPGYSEGKLSNARS-GNKASTIGNSGENKGNWNWPMRTITLRGTTGEEVRKTGPTKRLSDAEFQSRKEKGLLM
                  E+   VEN E  R E+ + GYS GK++   S   K ST G +G+NK N  +P+RTITLR +   E R+ G  KRL DAEFQ+RKEKGL  
Subjt:  ----------EMDSKVENIEDIRRESNLPGYSEGKLSNARS-GNKASTIGNSGENKGNWNWPMRTITLRGTTGEEVRKTGPTKRLSDAEFQSRKEKGLLM

Query:  GEN--------------------------EEYEIIEEEEAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVKGKLLGREIIVLIDCGATHNFISE
          N                          EEYEI+E+E+ ++KEL  + + +   TVVELS NSVVGL++ GTMKV+GKLLG E+++LIDCGATHNF+SE
Subjt:  GEN--------------------------EEYEIIEEEEAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVKGKLLGREIIVLIDCGATHNFISE

Query:  SLVEELKINTKSTTNYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELGGMNAILGMQWLYSLGIT-----------------------------
         LV +L +  K T++YGVILGS  A++GKGVCE +E+ L +W++V+DFLPLELGG++ ILGMQWLYSLG+T                             
Subjt:  SLVEELKINTKSTTNYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELGGMNAILGMQWLYSLGIT-----------------------------

Query:  ---------------EGFLAECRALERRESLEEEDSFDEVLTVEE--LVTVVLKSFEDVFDWPETLPPRRKI--------------------------EM
                        GFL ECR+L+ R ++E+++ +     VE    +  V+K F DVF+WPE LPPRR+I                          EM
Subjt:  ---------------EGFLAECRALERRESLEEEDSFDEVLTVEE--LVTVVLKSFEDVFDWPETLPPRRKI--------------------------EM

Query:  ERLVEEMLASGVIRSSTSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQSG----------------------
        E+LV+EML SGVIR S SPYS                 +Y A+NN T+PDKFPIPV+EELFDEL G T+F+KIDL+SG                      
Subjt:  ERLVEEMLASGVIRSSTSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQSG----------------------

Query:  --------------------------------------------KGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDGVEVDPEKIR
                                                    +  ++H  H+R +L VLR +EL+AN+KKC FAQ +++YLGH+ISG+GV VDPEKI+
Subjt:  --------------------------------------------KGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDGVEVDPEKIR

Query:  AIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGLGA--------
        AI +WP P NV+E +GFLGLTGYYR+FV +YGTIAAPLTQLLK GGF WTEE   AFNRLK AMM+LPVLALPDF+  FEIE DASGYG+GA        
Subjt:  AIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGLGA--------

Query:  -----------GANGSSNGRAKMASI----------LVG---------------EERVIQPQYQKWISKLLGYSFEVVYKSGLENKAANALSRVPATVHL
                     +     R  MA +          L+G               ++R+IQPQYQKWI+KLLGYSFEVVYK G+EN+AA+ALSR P  V L
Subjt:  -----------GANGSSNGRAKMASI----------LVG---------------EERVIQPQYQKWISKLLGYSFEVVYKSGLENKAANALSRVPATVHL

Query:  NQLTAPKMIDIEIIKAEVERDEKLKKVMQKLSETEELEEGKYLIKQGILMYKGRMELSKTSRLIPTILHTYHDSVFGGHSGFLRTYKRMAAELHWEGMKK
          L+ P  +D+EI+K EV +D K +K+++++ + EELE   Y +K+G+LM+K R+ + K S LIP IL T+H+S  GGHSGFLRTYKR+AAEL+W GMK 
Subjt:  NQLTAPKMIDIEIIKAEVERDEKLKKVMQKLSETEELEEGKYLIKQGILMYKGRMELSKTSRLIPTILHTYHDSVFGGHSGFLRTYKRMAAELHWEGMKK

Query:  EVKKYCEECMICQRNKTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGHFIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIE
        ++KK+CEEC++CQR+KTLALSPA LL PLE+P  +W DISMDFVEGLPK++GYEV+ VVVDR SKYGHF+P+KHPYT K+V+E+FVKE++RLHG+P SI 
Subjt:  EVKKYCEECMICQRNKTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGHFIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIE

Query:  NP-----LSQIETR---------RSCVAFVGKSQ------NNGLNGFL--------------------------GRNTGVTPFQAVYGRLPPPLVYYEER
        +      LSQ  T              A+  +S       N G+  +L                           R+ G+TPFQ VYGR PP ++ Y   
Subjt:  NP-----LSQIETR---------RSCVAFVGKSQ------NNGLNGFL--------------------------GRNTGVTPFQAVYGRLPPPLVYYEER

Query:  DTTNSTLDEQQKERDVVLGAL
         + NST++E   ERD+VLG+L
Subjt:  DTTNSTLDEQQKERDVVLGAL

TYK18846.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.0e-30946.63Show/hide
Query:  MESAAQERSTMNEKITELSMRSTLMKNNDEGEGSFVSENEPK-------NEEKKAIEDANCDRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEK
        +E+   ERST +  +TE + +        EG+ S     E         NE +   +++  DRNKFKK+EMP+F G+D DSWLFRAERYFQIHKLTD EK
Subjt:  MESAAQERSTMNEKITELSMRSTLMKNNDEGEGSFVSENEPK-------NEEKKAIEDANCDRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEK

Query:  MIVATISFEGPALNWYRSQEERDKFIDWANLKERVLVWFRSTREGSLYGRFLRIQQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKPWIHA----
        M+V+T+SF+GPALNW+RSQEERDKF  WAN+KER+LV FRS ++G++ G+FLRI+Q  T+EEY NLFDK+V P++DLPE+V+ +TFM+GL PW+ +    
Subjt:  MIVATISFEGPALNWYRSQEERDKFIDWANLKERVLVWFRSTREGSLYGRFLRIQQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKPWIHA----

Query:  ----------EMDSKVENIEDIRRESNLPGYSEGKLSNARS-GNKASTIGNSGENKGNWNWPMRTITLRGTTGEEVRKTGPTKRLSDAEFQSRKEKGLLM
                  E+   VEN E  R E+ + GYS GK++   S   K ST G +G+NK N  +P+RTITLR +   E R+ G  KRL DAEFQ+RKEKGL  
Subjt:  ----------EMDSKVENIEDIRRESNLPGYSEGKLSNARS-GNKASTIGNSGENKGNWNWPMRTITLRGTTGEEVRKTGPTKRLSDAEFQSRKEKGLLM

Query:  GEN--------------------------EEYEIIEEEEAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVKGKLLGREIIVLIDCGATHNFISE
          N                          EEYEI+E+E+ ++KEL  + + +   TVVELS NSVVGL++ GTMKV+GKLLG E+++LIDCGATHNF+SE
Subjt:  GEN--------------------------EEYEIIEEEEAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVKGKLLGREIIVLIDCGATHNFISE

Query:  SLVEELKINTKSTTNYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELGGMNAILGMQWLYSLGIT-----------------------------
         LV +L +  K T++YGVILGS  A++GKGVCE +E+ L +W++V+DFLPLELGG++ ILGMQWLYSLG+T                             
Subjt:  SLVEELKINTKSTTNYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELGGMNAILGMQWLYSLGIT-----------------------------

Query:  ---------------EGFLAECRALERRESLEEEDSFDEVLTVEE--LVTVVLKSFEDVFDWPETLPPRRKI--------------------------EM
                        GFL ECR+L+ R ++E+++ +     VE    +  V+K F DVF+WPE LPPRR+I                          EM
Subjt:  ---------------EGFLAECRALERRESLEEEDSFDEVLTVEE--LVTVVLKSFEDVFDWPETLPPRRKI--------------------------EM

Query:  ERLVEEMLASGVIRSSTSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQSG----------------------
        E+LV+EML SGVIR S SPYS                 +Y A+NN T+PDKFPIPV+EELFDEL G T+F+KIDL+SG                      
Subjt:  ERLVEEMLASGVIRSSTSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQSG----------------------

Query:  --------------------------------------------KGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDGVEVDPEKIR
                                                    +  ++H  H+R +L VLR +EL+AN+KKC FAQ +++YLGH+ISG+GV VDPEKI+
Subjt:  --------------------------------------------KGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDGVEVDPEKIR

Query:  AIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGLGA--------
        AI +WP P NV+E++GFLGLTGYYR+FV NYGTIAAPLTQLLK GGF WTEE   AFNRLK AMM+LPVLALPDF+  FEIE DASGYG+GA        
Subjt:  AIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGLGA--------

Query:  -----------GANGSSNGRAKMASI----------LVG---------------EERVIQPQYQKWISKLLGYSFEVVYKSGLENKAANALSRVPATVHL
                     +     R  MA +          L+G               ++R+IQPQYQKWI+KLLGYSFEVVYK G+EN+AA+ALSR P  V L
Subjt:  -----------GANGSSNGRAKMASI----------LVG---------------EERVIQPQYQKWISKLLGYSFEVVYKSGLENKAANALSRVPATVHL

Query:  NQLTAPKMIDIEIIKAEVERDEKLKKVMQKLSETEELEEGKYLIKQGILMYKGRMELSKTSRLIPTILHTYHDSVFGGHSGFLRTYKRMAAELHWEGMKK
          L+ P  +D+EI+K EV +D K +K+++++ + EELE   Y +K+G+LMYK R+ + K S LIP IL T+H+S  GGHSGFLRTYKR+AAEL+W GMK 
Subjt:  NQLTAPKMIDIEIIKAEVERDEKLKKVMQKLSETEELEEGKYLIKQGILMYKGRMELSKTSRLIPTILHTYHDSVFGGHSGFLRTYKRMAAELHWEGMKK

Query:  EVKKYCEECMICQRNKTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGHFIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIE
        ++KKYCEEC++CQR+KTLALSPA LL PLE+P  +W DISMDFVEGLPK++G+EV+ VVVDR SKYGHF+P+KHPYT K+V+E+FVKE++RLHG+P SI 
Subjt:  EVKKYCEECMICQRNKTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGHFIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIE

Query:  NP-----LSQ--IETRRSCVAFVGKSQ-------------NNGLNGFL--------------------------GRNTGVTPFQAVYGRLPPPLVYYEER
        +      LSQ  IE  R     + KS              N G+  +L                           R+ G+TPFQ VYGR PP ++ Y   
Subjt:  NP-----LSQ--IETRRSCVAFVGKSQ-------------NNGLNGFL--------------------------GRNTGVTPFQAVYGRLPPPLVYYEER

Query:  DTTNSTLDEQQKERDVVLGAL
         + NST++E   ERD+VLG+L
Subjt:  DTTNSTLDEQQKERDVVLGAL

TrEMBL top hitse value%identityAlignment
A0A5A7SIV7 Ty3/gypsy retrotransposon protein7.9e-30647.47Show/hide
Query:  EKITELSMRSTLMKNNDEGEGSFVSENEPKNEEKKAIEDANCDRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEKMIVATISFEGPALNWYRSQ
        +++ E S   T M+     E +  + +   N E K  E    DR+KFKKVEMPIF G D DSWLFRA+RYF+IH L+D EK+ VA ISF+GPAL+WYRSQ
Subjt:  EKITELSMRSTLMKNNDEGEGSFVSENEPKNEEKKAIEDANCDRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEKMIVATISFEGPALNWYRSQ

Query:  EERDKFIDWANLKERVLVWFRSTREGSLYGRFLRIQQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKPWIHAEMDS--------------KVENI
        EER+ F  W +LK+++LV FRS R+G+L GRFL I+Q TT+EEYRN FDK + P++ L   V+EETFM+GL PW+ +E+++              K+EN 
Subjt:  EERDKFIDWANLKERVLVWFRSTREGSLYGRFLRIQQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKPWIHAEMDS--------------KVENI

Query:  EDIRRESNLPGYSEGKLSNARSGNKASTIGNSGENKGNWNWPMRTITLRGTTGEEVRKTGPTKRLSDAEFQSRKEKG-----------------------
        E +R+E  L    + K +   +  +        EN     WPMRTITLR     E R+ GPTKRL+DAEFQ+R+EKG                       
Subjt:  EDIRRESNLPGYSEGKLSNARSGNKASTIGNSGENKGNWNWPMRTITLRGTTGEEVRKTGPTKRLSDAEFQSRKEKG-----------------------

Query:  --LLMGEN-EEYEIIEEE------EAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVKGKLLGREIIVLIDCGATHNFISESLVEELKINTKSTT
          LL+ EN EE EIIEEE      E K  E  +V     E   +ELS NSVVGL+N GTMKVKG+L   E++VLIDCGATHNFI+E LVE+L++  K T 
Subjt:  --LLMGEN-EEYEIIEEE------EAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVKGKLLGREIIVLIDCGATHNFISESLVEELKINTKSTT

Query:  NYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELGGMNAILGMQWLYSLGITE------------------------------------------
        NYGVILGS T+IKGKGVC+ +E+ LG  +V D FLPLELGG++AILGMQWL+S+G TE                                          
Subjt:  NYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELGGMNAILGMQWLYSLGITE------------------------------------------

Query:  --GFLAECRALERRESLEEEDSFDEVLTVE-ELVTVVLKSFEDVFDWPETLPPRR--------------------------KIEMERLVEEMLASGVIRS
          G+L ECR LE +E +E+ +   ++   E   +T +L  F DVF+ P  LPP+R                          K EMERLV+EML+SG+IR 
Subjt:  --GFLAECRALERRESLEEEDSFDEVLTVE-ELVTVVLKSFEDVFDWPETLPPRR--------------------------KIEMERLVEEMLASGVIRS

Query:  STSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQSG-------------------------------------
        STSPYS                 +Y ALNNVTVPDKFPIPV+EELFDELNG  +F+KIDL++G                                     
Subjt:  STSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQSG-------------------------------------

Query:  -----------------------------KGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDGVEVDPEKIRAIKEWPIPANVREVK
                                     KG+++H  HL  +LE+LR++ELYAN  KC FA++R+ YLGH IS  G+EVDPEKIRA+KEWP PANVRE++
Subjt:  -----------------------------KGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDGVEVDPEKIRAIKEWPIPANVREVK

Query:  GFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGLGA------------GANGSSNGRAK
        GFLGLTGYYR+FV NYG+I+APLTQLLK G +KWTEE + AF +LK+AMMTLPVLA+PDFN+ FEIE DASG+G+GA                S+  RA+
Subjt:  GFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGLGA------------GANGSSNGRAK

Query:  -------MASI---------LVG----------------EERVIQPQYQKWISKLLGYSFEVVYKSGLENKAANALSRVPATVHLNQLTAPKMIDIEIIK
               MA +         L+G                E+RVIQPQYQ+WI+KLLGYSFEV+YK GLENKAA+ALSR+  T HLNQLTAP ++D+E+I+
Subjt:  -------MASI---------LVG----------------EERVIQPQYQKWISKLLGYSFEVVYKSGLENKAANALSRVPATVHLNQLTAPKMIDIEIIK

Query:  AEVERDEKLKKVMQKLSETEELEEGKYLIKQGILMYKGRMELSKTSRLIPTILHTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEVKKYCEECMICQRN
         EV +D  L++++  + E + +E   Y   QGIL +KGR+ LSK S LIPTI+HTYHDSVFGGHSGFLRTYKRMA EL+W+GMKK+V+KYCEECMICQ+N
Subjt:  AEVERDEKLKKVMQKLSETEELEEGKYLIKQGILMYKGRMELSKTSRLIPTILHTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEVKKYCEECMICQRN

Query:  KTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGHFIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENPLSQI---------
        K+ ALSPA LL PLE+P+ +W DISMDF+EGLPK+ G+EV+ VVVDR SKY HF+ +KHPYT K V+EVFVKEV+RLHG+PKSI +   +I         
Subjt:  KTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGHFIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENPLSQI---------

Query:  ------ETRRSC---------VAFVGKSQNNGLNGFLGR----------------------NTGVTPFQAVYGRLPPPLVYYEERDTTNSTLDEQQKERD
              +  RS             V KS    L  F G                       + G+TPFQAVYGRLPPPL+ Y E +T NSTLD+Q ++RD
Subjt:  ------ETRRSC---------VAFVGKSQNNGLNGFLGR----------------------NTGVTPFQAVYGRLPPPLVYYEERDTTNSTLDEQQKERD

Query:  VVLGAL
        VVLGAL
Subjt:  VVLGAL

A0A5A7TR82 Transposon Ty3-G Gag-Pol polyprotein5.1e-30547.77Show/hide
Query:  MESAAQERSTMNEKITELSMRSTLMKNNDEGEGSFVSENEPK-------NEEKKAIEDANCDRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEK
        +E++++ERS M+ + TE + +        E + S    NE +        E +   +D   DRNKFKK+EMP+F G+D DSWLFRAERYFQIH+LT+ EK
Subjt:  MESAAQERSTMNEKITELSMRSTLMKNNDEGEGSFVSENEPK-------NEEKKAIEDANCDRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEK

Query:  MIVATISFEGPALNWYRSQEERDKFIDWANLKERVLVWFRSTREGSLYGRFLRIQQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKPWIHAEM--
        M+V+TISF+GPALNWYRSQEER +F  W+N+KER+LV FRS ++G++ G+FLRI+Q   +EEY NLFDK+V P++DLPE V+E+TFM+GL PW+ +E+  
Subjt:  MIVATISFEGPALNWYRSQEERDKFIDWANLKERVLVWFRSTREGSLYGRFLRIQQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKPWIHAEM--

Query:  ------------DSKVENIEDIRRESNLPGYSEGKLSNARSGN-KASTIGNSGENKGNWNWPMRTITLRGTTGEEVRKTGPTKRLSDAEFQSRKEKGLLM
                       VEN E +R E+ + GYS GK+S     N K ++ G +G+ KGN ++P+RTITLR     E R+ G  KRL DAEFQ++KEKGL  
Subjt:  ------------DSKVENIEDIRRESNLPGYSEGKLSNARSGN-KASTIGNSGENKGNWNWPMRTITLRGTTGEEVRKTGPTKRLSDAEFQSRKEKGLLM

Query:  GENE--------------------------EYEIIEEEEAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVKGKLLGREIIVLIDCGATHNFISE
          NE                          EYEI+EEE+ ++K+L  + V ++  TVVELS NSVVGL++ GTMKV+GKLLG E+IVLIDCGATHNF+SE
Subjt:  GENE--------------------------EYEIIEEEEAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVKGKLLGREIIVLIDCGATHNFISE

Query:  SLVEELKINTKSTTNYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELGGMNAILGMQWLYSLGIT-----------------------------
         LV++L +  K T++YGVILGS  A++GKG+CE +E+ L  W+VV+DFLPLELGG++ ILGMQWLYSLG+T                             
Subjt:  SLVEELKINTKSTTNYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELGGMNAILGMQWLYSLGIT-----------------------------

Query:  ---------------EGFLAECRALERRESLEEEDSFDEVLTVEE-LVTVVLKSFEDVFDWPETLPPRRKI--------------------------EME
                        GFL ECR+L+ R    EE        V + L++ V+K ++DVF+WPE LPPRR+I                          EME
Subjt:  ---------------EGFLAECRALERRESLEEEDSFDEVLTVEE-LVTVVLKSFEDVFDWPETLPPRRKI--------------------------EME

Query:  RLVEEMLASGVIRSSTSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQSGKGLKDHLNHLRAILEVLRKNELY
        +LV+EML SGVIR STSPYS                 +Y A+NN T+PDKFPIPV+EELFDELN                ++H+ HL+ +L+VLR++ELY
Subjt:  RLVEEMLASGVIRSSTSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQSGKGLKDHLNHLRAILEVLRKNELY

Query:  ANKKKCSFAQSRVDYLGHIISGDGVEVDPEKIRAIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFKWTEEVQEAFNRLKQAMMTL
        AN+KKC FAQ +++YLGH+ISG+GV VDPEKI+AI +WP P NV+E +GFLGLTGYYR+FV NYGTIAAPLTQLLK GGFKWTEE  +AF+RLK AM++L
Subjt:  ANKKKCSFAQSRVDYLGHIISGDGVEVDPEKIRAIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFKWTEEVQEAFNRLKQAMMTL

Query:  PVLALPDFNVSFEIEIDASGYGLGA-------------------GANGSSNGRAKMASI----------LVG---------------EERVIQPQYQKWI
        PVLALPDF   FEIE DASGYG+GA                           R  MA +          L+G               ++R+IQ QYQKWI
Subjt:  PVLALPDFNVSFEIEIDASGYGLGA-------------------GANGSSNGRAKMASI----------LVG---------------EERVIQPQYQKWI

Query:  SKLLGYSFEVVYKSGLENKAANALSRVPATVHLNQLTAPKMIDIEIIKAEVERDEKLKKVMQKLSETEELEEGKYLIKQGILMYKGRMELSKTSRLIPTI
        +KLLGYSFEVVYK G+EN+AA+ALSR P  V L  L+ P  +D+E+IK EV +D K  +++++L + EEL+   Y +++G+LMYK R+ + + S LIP I
Subjt:  SKLLGYSFEVVYKSGLENKAANALSRVPATVHLNQLTAPKMIDIEIIKAEVERDEKLKKVMQKLSETEELEEGKYLIKQGILMYKGRMELSKTSRLIPTI

Query:  LHTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEVKKYCEECMICQRNKTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYG
        L T+H+S  GGHSGFLRTYKR+AAEL+W+GMK E+KK+CEEC+ CQRNKT+ALSPA LL P E+P  +W +ISMDFVEGLPK++GYEV+ VVVDR SKYG
Subjt:  LHTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEVKKYCEECMICQRNKTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYG

Query:  HFIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENP-----LSQIETR---------RSCVAFVGKSQ------NNGLNGFL-------------------
        HF+P+KHP+T K+V E+FVKEV+RLHG+P SI +      LSQ  T+             A+  +S       N G+  +L                   
Subjt:  HFIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENP-----LSQIETR---------RSCVAFVGKSQ------NNGLNGFL-------------------

Query:  -------GRNTGVTPFQAVYGRLPPPLVYYEERDTTNSTLDEQQKERDVVLGAL
                R+ G+TPFQ VYGR PP +V Y    + NST++E  +ERD++L +L
Subjt:  -------GRNTGVTPFQAVYGRLPPPLVYYEERDTTNSTLDEQQKERDVVLGAL

A0A5D3CXB1 Ty3/gypsy retrotransposon protein1.0e-30547.47Show/hide
Query:  EKITELSMRSTLMKNNDEGEGSFVSENEPKNEEKKAIEDANCDRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEKMIVATISFEGPALNWYRSQ
        +++ E S   T M+     E +  + +   N E K  E    DR+KFKKVEMPIF G D DSWLFRA+RYF+IH L+D EK+ VA ISF+GPAL+WYRSQ
Subjt:  EKITELSMRSTLMKNNDEGEGSFVSENEPKNEEKKAIEDANCDRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEKMIVATISFEGPALNWYRSQ

Query:  EERDKFIDWANLKERVLVWFRSTREGSLYGRFLRIQQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKPWIHAEMDS--------------KVENI
        EER+ F  W +LK+++LV FRS R+G+L GRFL I+Q TT+EEYRN FDK + P++ L   V+EETFM+GL PW+ +E+++              K+EN 
Subjt:  EERDKFIDWANLKERVLVWFRSTREGSLYGRFLRIQQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKPWIHAEMDS--------------KVENI

Query:  EDIRRESNLPGYSEGKLSNARSGNKASTIGNSGENKGNWNWPMRTITLRGTTGEEVRKTGPTKRLSDAEFQSRKEKG-----------------------
        E +R+E  L    + K +   +  +        EN     WPMRTITLR     E R+ GPTKRL+DAEFQ+R+EKG                       
Subjt:  EDIRRESNLPGYSEGKLSNARSGNKASTIGNSGENKGNWNWPMRTITLRGTTGEEVRKTGPTKRLSDAEFQSRKEKG-----------------------

Query:  --LLMGEN-EEYEIIEEE------EAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVKGKLLGREIIVLIDCGATHNFISESLVEELKINTKSTT
          LL+ EN EE EIIEEE      E K  E  +V     E   +ELS NSVVGL+N GTMKVKG+L   E++VLIDCGATHNFI+E LVE+L++  K T 
Subjt:  --LLMGEN-EEYEIIEEE------EAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVKGKLLGREIIVLIDCGATHNFISESLVEELKINTKSTT

Query:  NYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELGGMNAILGMQWLYSLGITE------------------------------------------
        NYGVILGS T+IKGKGVC+ +E+ LG  +V D FLPLELGG++AILGMQWL+S+G TE                                          
Subjt:  NYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELGGMNAILGMQWLYSLGITE------------------------------------------

Query:  --GFLAECRALERRESLEEEDSFDEVLTVE-ELVTVVLKSFEDVFDWPETLPPRR--------------------------KIEMERLVEEMLASGVIRS
          G+L ECR LE +E +E+ +   ++   E   +T +L  F DVF+ P  LPP+R                          K EMERLV+EML+SG+IR 
Subjt:  --GFLAECRALERRESLEEEDSFDEVLTVE-ELVTVVLKSFEDVFDWPETLPPRR--------------------------KIEMERLVEEMLASGVIRS

Query:  STSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQSG-------------------------------------
        STSPYS                 +Y ALNNVTVPDKFPIPV+EELFDELNG  +F+KIDL++G                                     
Subjt:  STSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQSG-------------------------------------

Query:  -----------------------------KGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDGVEVDPEKIRAIKEWPIPANVREVK
                                     KG+++H  HL  +LE+LR++ELYAN  KC FA++R+ YLGH IS  G+EVDPEKIRA+KEWP PANVRE++
Subjt:  -----------------------------KGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDGVEVDPEKIRAIKEWPIPANVREVK

Query:  GFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGLGA------------GANGSSNGRAK
        GFLGLTGYYR+FV NYG+I+APLTQLLK G +KWTEE + AF +LK+AMMTLPVLA+PDFN+ FEIE DASG+G+GA                S+  RA+
Subjt:  GFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGLGA------------GANGSSNGRAK

Query:  -------MASI---------LVG----------------EERVIQPQYQKWISKLLGYSFEVVYKSGLENKAANALSRVPATVHLNQLTAPKMIDIEIIK
               MA +         L+G                E+RVIQPQYQ+WI+KLLGYSFEV+YK GLENKAA+ALSR+  T HLNQLTAP ++D+E+I+
Subjt:  -------MASI---------LVG----------------EERVIQPQYQKWISKLLGYSFEVVYKSGLENKAANALSRVPATVHLNQLTAPKMIDIEIIK

Query:  AEVERDEKLKKVMQKLSETEELEEGKYLIKQGILMYKGRMELSKTSRLIPTILHTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEVKKYCEECMICQRN
         EV +D  L++++  + E + +E   Y   QGIL +KGR+ LSK S LIPTI+HTYHDSVFGGHSGFLRTYKRMA EL+W+GMKK+V+KYCEECMICQ+N
Subjt:  AEVERDEKLKKVMQKLSETEELEEGKYLIKQGILMYKGRMELSKTSRLIPTILHTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEVKKYCEECMICQRN

Query:  KTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGHFIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENPLSQI---------
        K+ ALSPA LL PLE+P+ +W DISMDF+EGLPK+ G+EV+ VVVDR SKY HF+ +KHPYT K V+EVFVKEV+RLHG+PKSI +   +I         
Subjt:  KTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGHFIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENPLSQI---------

Query:  ------ETRRSC---------VAFVGKSQNNGLNGFLGR----------------------NTGVTPFQAVYGRLPPPLVYYEERDTTNSTLDEQQKERD
              +  RS             V KS    L  F G                       + G+TPFQAVYGRLPPPL+ Y E +T NSTLD+Q ++RD
Subjt:  ------ETRRSC---------VAFVGKSQNNGLNGFLGR----------------------NTGVTPFQAVYGRLPPPLVYYEERDTTNSTLDEQQKERD

Query:  VVLGAL
        VVLGAL
Subjt:  VVLGAL

A0A5D3DI73 Ty3/gypsy retrotransposon protein1.0e-30547.47Show/hide
Query:  EKITELSMRSTLMKNNDEGEGSFVSENEPKNEEKKAIEDANCDRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEKMIVATISFEGPALNWYRSQ
        +++ E S   T M+     E +  + +   N E K  E    DR+KFKKVEMPIF G D DSWLFRA+RYF+IH L+D EK+ VA ISF+GPAL+WYRSQ
Subjt:  EKITELSMRSTLMKNNDEGEGSFVSENEPKNEEKKAIEDANCDRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEKMIVATISFEGPALNWYRSQ

Query:  EERDKFIDWANLKERVLVWFRSTREGSLYGRFLRIQQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKPWIHAEMDS--------------KVENI
        EER+ F  W +LK+++LV FRS R+G+L GRFL I+Q TT+EEYRN FDK + P++ L   V+EETFM+GL PW+ +E+++              K+EN 
Subjt:  EERDKFIDWANLKERVLVWFRSTREGSLYGRFLRIQQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKPWIHAEMDS--------------KVENI

Query:  EDIRRESNLPGYSEGKLSNARSGNKASTIGNSGENKGNWNWPMRTITLRGTTGEEVRKTGPTKRLSDAEFQSRKEKG-----------------------
        E +R+E  L    + K +   +  +        EN     WPMRTITLR     E R+ GPTKRL+DAEFQ+R+EKG                       
Subjt:  EDIRRESNLPGYSEGKLSNARSGNKASTIGNSGENKGNWNWPMRTITLRGTTGEEVRKTGPTKRLSDAEFQSRKEKG-----------------------

Query:  --LLMGEN-EEYEIIEEE------EAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVKGKLLGREIIVLIDCGATHNFISESLVEELKINTKSTT
          LL+ EN EE EIIEEE      E K  E  +V     E   +ELS NSVVGL+N GTMKVKG+L   E++VLIDCGATHNFI+E LVE+L++  K T 
Subjt:  --LLMGEN-EEYEIIEEE------EAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVKGKLLGREIIVLIDCGATHNFISESLVEELKINTKSTT

Query:  NYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELGGMNAILGMQWLYSLGITE------------------------------------------
        NYGVILGS T+IKGKGVC+ +E+ LG  +V D FLPLELGG++AILGMQWL+S+G TE                                          
Subjt:  NYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELGGMNAILGMQWLYSLGITE------------------------------------------

Query:  --GFLAECRALERRESLEEEDSFDEVLTVE-ELVTVVLKSFEDVFDWPETLPPRR--------------------------KIEMERLVEEMLASGVIRS
          G+L ECR LE +E +E+ +   ++   E   +T +L  F DVF+ P  LPP+R                          K EMERLV+EML+SG+IR 
Subjt:  --GFLAECRALERRESLEEEDSFDEVLTVE-ELVTVVLKSFEDVFDWPETLPPRR--------------------------KIEMERLVEEMLASGVIRS

Query:  STSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQSG-------------------------------------
        STSPYS                 +Y ALNNVTVPDKFPIPV+EELFDELNG  +F+KIDL++G                                     
Subjt:  STSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQSG-------------------------------------

Query:  -----------------------------KGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDGVEVDPEKIRAIKEWPIPANVREVK
                                     KG+++H  HL  +LE+LR++ELYAN  KC FA++R+ YLGH IS  G+EVDPEKIRA+KEWP PANVRE++
Subjt:  -----------------------------KGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDGVEVDPEKIRAIKEWPIPANVREVK

Query:  GFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGLGA------------GANGSSNGRAK
        GFLGLTGYYR+FV NYG+I+APLTQLLK G +KWTEE + AF +LK+AMMTLPVLA+PDFN+ FEIE DASG+G+GA                S+  RA+
Subjt:  GFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGLGA------------GANGSSNGRAK

Query:  -------MASI---------LVG----------------EERVIQPQYQKWISKLLGYSFEVVYKSGLENKAANALSRVPATVHLNQLTAPKMIDIEIIK
               MA +         L+G                E+RVIQPQYQ+WI+KLLGYSFEV+YK GLENKAA+ALSR+  T HLNQLTAP ++D+E+I+
Subjt:  -------MASI---------LVG----------------EERVIQPQYQKWISKLLGYSFEVVYKSGLENKAANALSRVPATVHLNQLTAPKMIDIEIIK

Query:  AEVERDEKLKKVMQKLSETEELEEGKYLIKQGILMYKGRMELSKTSRLIPTILHTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEVKKYCEECMICQRN
         EV +D  L++++  + E + +E   Y   QGIL +KGR+ LSK S LIPTI+HTYHDSVFGGHSGFLRTYKRMA EL+W+GMKK+V+KYCEECMICQ+N
Subjt:  AEVERDEKLKKVMQKLSETEELEEGKYLIKQGILMYKGRMELSKTSRLIPTILHTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEVKKYCEECMICQRN

Query:  KTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGHFIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENPLSQI---------
        K+ ALSPA LL PLE+P+ +W DISMDF+EGLPK+ G+EV+ VVVDR SKY HF+ +KHPYT K V+EVFVKEV+RLHG+PKSI +   +I         
Subjt:  KTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGHFIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENPLSQI---------

Query:  ------ETRRSC---------VAFVGKSQNNGLNGFLGR----------------------NTGVTPFQAVYGRLPPPLVYYEERDTTNSTLDEQQKERD
              +  RS             V KS    L  F G                       + G+TPFQAVYGRLPPPL+ Y E +T NSTLD+Q ++RD
Subjt:  ------ETRRSC---------VAFVGKSQNNGLNGFLGR----------------------NTGVTPFQAVYGRLPPPLVYYEERDTTNSTLDEQQKERD

Query:  VVLGAL
        VVLGAL
Subjt:  VVLGAL

A0A5D3DLL9 Ty3/gypsy retrotransposon protein6.4e-30846.4Show/hide
Query:  MESAAQERSTMNEKITELSMRSTLMKNNDEGEGSFVSENE------PKNEEKKAIEDANC-DRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEK
        +E+   ERST +  +TE + +        EG+ S     E      P   E+++  D +  DRNKFKK+EMP+F G+D DSWLFRAERYFQIHKLTD EK
Subjt:  MESAAQERSTMNEKITELSMRSTLMKNNDEGEGSFVSENE------PKNEEKKAIEDANC-DRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEK

Query:  MIVATISFEGPALNWYRSQEERDKFIDWANLKERVLVWFRSTREGSLYGRFLRIQQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKPWIHA----
        M+V+T+SF+GPALNW+RSQEERDKF  WAN+KER+LV FRS ++G++ G+FLRI+Q  T+EEY NLFDK+V P++DLPE+V+ +TFM+GL PW+ +    
Subjt:  MIVATISFEGPALNWYRSQEERDKFIDWANLKERVLVWFRSTREGSLYGRFLRIQQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKPWIHA----

Query:  ----------EMDSKVENIEDIRRESNLPGYSEGKLSNARS-GNKASTIGNSGENKGNWNWPMRTITLRGTTGEEVRKTGPTKRLSDAEFQSRKEKGLLM
                  E+   VEN E  R E+ + GYS GK++   S   K ST G +G+NK N  +P+RTITLR +   E R+ G  KRL DAEFQ+RKEKGL  
Subjt:  ----------EMDSKVENIEDIRRESNLPGYSEGKLSNARS-GNKASTIGNSGENKGNWNWPMRTITLRGTTGEEVRKTGPTKRLSDAEFQSRKEKGLLM

Query:  GEN--------------------------EEYEIIEEEEAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVKGKLLGREIIVLIDCGATHNFISE
          N                          EEYEI+E+E+ ++KEL  + + +   TVVELS NSVVGL++ GTMKV+GKLLG E+++LIDCGATHNF+SE
Subjt:  GEN--------------------------EEYEIIEEEEAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVKGKLLGREIIVLIDCGATHNFISE

Query:  SLVEELKINTKSTTNYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELGGMNAILGMQWLYSLGIT-----------------------------
         LV +L +  K T++YGVILGS  A++GKGVCE +E+ L +W++V+DFLPLELGG++ ILGMQWLYSLG+T                             
Subjt:  SLVEELKINTKSTTNYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELGGMNAILGMQWLYSLGIT-----------------------------

Query:  ---------------EGFLAECRALERRESLEEEDSFDEVLTVEE--LVTVVLKSFEDVFDWPETLPPRRKI--------------------------EM
                        GFL ECR+L+ R ++E+++ +     VE    +  V+K F DVF+WPE LPPRR+I                          EM
Subjt:  ---------------EGFLAECRALERRESLEEEDSFDEVLTVEE--LVTVVLKSFEDVFDWPETLPPRRKI--------------------------EM

Query:  ERLVEEMLASGVIRSSTSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQSG----------------------
        E+LV+EML SGVIR S SPYS                 +Y A+NN T+PDKFPIPV+EELFDEL G T+F+KIDL+SG                      
Subjt:  ERLVEEMLASGVIRSSTSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQSG----------------------

Query:  --------------------------------------------KGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDGVEVDPEKIR
                                                    +  ++H  H+R +L VLR +EL+AN+KKC FAQ +++YLGH+ISG+GV VDPEKI+
Subjt:  --------------------------------------------KGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDGVEVDPEKIR

Query:  AIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGLGA--------
        AI +WP P NV+E +GFLGLTGYYR+FV +YGTIAAPLTQLLK GGF WTEE   AFNRLK AMM+LPVLALPDF+  FEIE DASGYG+GA        
Subjt:  AIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGLGA--------

Query:  -----------GANGSSNGRAKMASI----------LVG---------------EERVIQPQYQKWISKLLGYSFEVVYKSGLENKAANALSRVPATVHL
                     +     R  MA +          L+G               ++R+IQPQYQKWI+KLLGYSFEVVYK G+EN+AA+ALSR P  V L
Subjt:  -----------GANGSSNGRAKMASI----------LVG---------------EERVIQPQYQKWISKLLGYSFEVVYKSGLENKAANALSRVPATVHL

Query:  NQLTAPKMIDIEIIKAEVERDEKLKKVMQKLSETEELEEGKYLIKQGILMYKGRMELSKTSRLIPTILHTYHDSVFGGHSGFLRTYKRMAAELHWEGMKK
          L+ P  +D+EI+K EV +D K +K+++++ + EELE   Y +K+G+LM+K R+ + K S LIP IL T+H+S  GGHSGFLRTYKR+AAEL+W GMK 
Subjt:  NQLTAPKMIDIEIIKAEVERDEKLKKVMQKLSETEELEEGKYLIKQGILMYKGRMELSKTSRLIPTILHTYHDSVFGGHSGFLRTYKRMAAELHWEGMKK

Query:  EVKKYCEECMICQRNKTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGHFIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIE
        ++KK+CEEC++CQR+KTLALSPA LL PLE+P  +W DISMDFVEGLPK++GYEV+ VVVDR SKYGHF+P+KHPYT K+V+E+FVKE++RLHG+P SI 
Subjt:  EVKKYCEECMICQRNKTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGHFIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIE

Query:  NP-----LSQIETR---------RSCVAFVGKSQ------NNGLNGFL--------------------------GRNTGVTPFQAVYGRLPPPLVYYEER
        +      LSQ  T              A+  +S       N G+  +L                           R+ G+TPFQ VYGR PP ++ Y   
Subjt:  NP-----LSQIETR---------RSCVAFVGKSQ------NNGLNGFL--------------------------GRNTGVTPFQAVYGRLPPPLVYYEER

Query:  DTTNSTLDEQQKERDVVLGAL
         + NST++E   ERD+VLG+L
Subjt:  DTTNSTLDEQQKERDVVLGAL

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein8.2e-5025.54Show/hide
Query:  LPPRRKIEMERLVEEMLASGVIRSSTSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQS--------------
        LPP +   M   + + L SG+IR S +  +                 +Y  LN    P+ +P+P+IE+L  ++ G+TIFTK+DL+S              
Subjt:  LPPRRKIEMERLVEEMLASGVIRSSTSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQS--------------

Query:  ----------------------------------------------------GKGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDG
                                                             K   +H+ H++ +L+ L+   L  N+ KC F QS+V ++G+ IS  G
Subjt:  ----------------------------------------------------GKGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDG

Query:  VEVDPEKIRAIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIG-GFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGL
             E I  + +W  P N +E++ FLG   Y RKF+     +  PL  LLK    +KWT    +A   +KQ +++ PVL   DF+    +E DAS   +
Subjt:  VEVDPEKIRAIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIG-GFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGL

Query:  G------------------------AGANGSSNGRAKMASI-----------------------------LVGEERVIQPQYQKWISKLLGYSFEVVYKS
        G                        A  N S + +  +A I                             +  E      +  +W   L  ++FE+ Y+ 
Subjt:  G------------------------AGANGSSNGRAKMASI-----------------------------LVGEERVIQPQYQKWISKLLGYSFEVVYKS

Query:  GLENKAANALSR-------VPATVHLNQLTAPKMIDI-----EIIKAEVERDEKLKKVMQKLSETEELEEGKYLIKQGILM-YKGRMELSKTSRLIPTIL
        G  N  A+ALSR       +P     N +     I I       +  E   D KL  ++   +E + +EE   L K G+L+  K ++ L   ++L  TI+
Subjt:  GLENKAANALSR-------VPATVHLNQLTAPKMIDI-----EIIKAEVERDEKLKKVMQKLSETEELEEGKYLIKQGILM-YKGRMELSKTSRLIPTIL

Query:  HTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEVKKYCEECMICQRNKTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGH
          YH+     H G       +     W+G++K++++Y + C  CQ NK+    P   L P+    R WE +SMDF+  LP+++GY  LFVVVDRFSK   
Subjt:  HTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEVKKYCEECMICQRNKTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGH

Query:  FIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENPLSQIETRRSCVAFVGK
         +P     T +  + +F + V+   G PK I      I T ++   F  K
Subjt:  FIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENPLSQIETRRSCVAFVGK

P0CT35 Transposon Tf2-2 polyprotein8.2e-5025.54Show/hide
Query:  LPPRRKIEMERLVEEMLASGVIRSSTSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQS--------------
        LPP +   M   + + L SG+IR S +  +                 +Y  LN    P+ +P+P+IE+L  ++ G+TIFTK+DL+S              
Subjt:  LPPRRKIEMERLVEEMLASGVIRSSTSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQS--------------

Query:  ----------------------------------------------------GKGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDG
                                                             K   +H+ H++ +L+ L+   L  N+ KC F QS+V ++G+ IS  G
Subjt:  ----------------------------------------------------GKGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDG

Query:  VEVDPEKIRAIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIG-GFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGL
             E I  + +W  P N +E++ FLG   Y RKF+     +  PL  LLK    +KWT    +A   +KQ +++ PVL   DF+    +E DAS   +
Subjt:  VEVDPEKIRAIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIG-GFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGL

Query:  G------------------------AGANGSSNGRAKMASI-----------------------------LVGEERVIQPQYQKWISKLLGYSFEVVYKS
        G                        A  N S + +  +A I                             +  E      +  +W   L  ++FE+ Y+ 
Subjt:  G------------------------AGANGSSNGRAKMASI-----------------------------LVGEERVIQPQYQKWISKLLGYSFEVVYKS

Query:  GLENKAANALSR-------VPATVHLNQLTAPKMIDI-----EIIKAEVERDEKLKKVMQKLSETEELEEGKYLIKQGILM-YKGRMELSKTSRLIPTIL
        G  N  A+ALSR       +P     N +     I I       +  E   D KL  ++   +E + +EE   L K G+L+  K ++ L   ++L  TI+
Subjt:  GLENKAANALSR-------VPATVHLNQLTAPKMIDI-----EIIKAEVERDEKLKKVMQKLSETEELEEGKYLIKQGILM-YKGRMELSKTSRLIPTIL

Query:  HTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEVKKYCEECMICQRNKTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGH
          YH+     H G       +     W+G++K++++Y + C  CQ NK+    P   L P+    R WE +SMDF+  LP+++GY  LFVVVDRFSK   
Subjt:  HTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEVKKYCEECMICQRNKTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGH

Query:  FIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENPLSQIETRRSCVAFVGK
         +P     T +  + +F + V+   G PK I      I T ++   F  K
Subjt:  FIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENPLSQIETRRSCVAFVGK

P0CT36 Transposon Tf2-3 polyprotein8.2e-5025.54Show/hide
Query:  LPPRRKIEMERLVEEMLASGVIRSSTSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQS--------------
        LPP +   M   + + L SG+IR S +  +                 +Y  LN    P+ +P+P+IE+L  ++ G+TIFTK+DL+S              
Subjt:  LPPRRKIEMERLVEEMLASGVIRSSTSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQS--------------

Query:  ----------------------------------------------------GKGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDG
                                                             K   +H+ H++ +L+ L+   L  N+ KC F QS+V ++G+ IS  G
Subjt:  ----------------------------------------------------GKGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDG

Query:  VEVDPEKIRAIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIG-GFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGL
             E I  + +W  P N +E++ FLG   Y RKF+     +  PL  LLK    +KWT    +A   +KQ +++ PVL   DF+    +E DAS   +
Subjt:  VEVDPEKIRAIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIG-GFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGL

Query:  G------------------------AGANGSSNGRAKMASI-----------------------------LVGEERVIQPQYQKWISKLLGYSFEVVYKS
        G                        A  N S + +  +A I                             +  E      +  +W   L  ++FE+ Y+ 
Subjt:  G------------------------AGANGSSNGRAKMASI-----------------------------LVGEERVIQPQYQKWISKLLGYSFEVVYKS

Query:  GLENKAANALSR-------VPATVHLNQLTAPKMIDI-----EIIKAEVERDEKLKKVMQKLSETEELEEGKYLIKQGILM-YKGRMELSKTSRLIPTIL
        G  N  A+ALSR       +P     N +     I I       +  E   D KL  ++   +E + +EE   L K G+L+  K ++ L   ++L  TI+
Subjt:  GLENKAANALSR-------VPATVHLNQLTAPKMIDI-----EIIKAEVERDEKLKKVMQKLSETEELEEGKYLIKQGILM-YKGRMELSKTSRLIPTIL

Query:  HTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEVKKYCEECMICQRNKTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGH
          YH+     H G       +     W+G++K++++Y + C  CQ NK+    P   L P+    R WE +SMDF+  LP+++GY  LFVVVDRFSK   
Subjt:  HTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEVKKYCEECMICQRNKTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGH

Query:  FIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENPLSQIETRRSCVAFVGK
         +P     T +  + +F + V+   G PK I      I T ++   F  K
Subjt:  FIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENPLSQIETRRSCVAFVGK

P0CT41 Transposon Tf2-12 polyprotein8.2e-5025.54Show/hide
Query:  LPPRRKIEMERLVEEMLASGVIRSSTSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQS--------------
        LPP +   M   + + L SG+IR S +  +                 +Y  LN    P+ +P+P+IE+L  ++ G+TIFTK+DL+S              
Subjt:  LPPRRKIEMERLVEEMLASGVIRSSTSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQS--------------

Query:  ----------------------------------------------------GKGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDG
                                                             K   +H+ H++ +L+ L+   L  N+ KC F QS+V ++G+ IS  G
Subjt:  ----------------------------------------------------GKGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDG

Query:  VEVDPEKIRAIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIG-GFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGL
             E I  + +W  P N +E++ FLG   Y RKF+     +  PL  LLK    +KWT    +A   +KQ +++ PVL   DF+    +E DAS   +
Subjt:  VEVDPEKIRAIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIG-GFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGL

Query:  G------------------------AGANGSSNGRAKMASI-----------------------------LVGEERVIQPQYQKWISKLLGYSFEVVYKS
        G                        A  N S + +  +A I                             +  E      +  +W   L  ++FE+ Y+ 
Subjt:  G------------------------AGANGSSNGRAKMASI-----------------------------LVGEERVIQPQYQKWISKLLGYSFEVVYKS

Query:  GLENKAANALSR-------VPATVHLNQLTAPKMIDI-----EIIKAEVERDEKLKKVMQKLSETEELEEGKYLIKQGILM-YKGRMELSKTSRLIPTIL
        G  N  A+ALSR       +P     N +     I I       +  E   D KL  ++   +E + +EE   L K G+L+  K ++ L   ++L  TI+
Subjt:  GLENKAANALSR-------VPATVHLNQLTAPKMIDI-----EIIKAEVERDEKLKKVMQKLSETEELEEGKYLIKQGILM-YKGRMELSKTSRLIPTIL

Query:  HTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEVKKYCEECMICQRNKTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGH
          YH+     H G       +     W+G++K++++Y + C  CQ NK+    P   L P+    R WE +SMDF+  LP+++GY  LFVVVDRFSK   
Subjt:  HTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEVKKYCEECMICQRNKTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGH

Query:  FIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENPLSQIETRRSCVAFVGK
         +P     T +  + +F + V+   G PK I      I T ++   F  K
Subjt:  FIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENPLSQIETRRSCVAFVGK

Q9UR07 Transposon Tf2-11 polyprotein8.2e-5025.54Show/hide
Query:  LPPRRKIEMERLVEEMLASGVIRSSTSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQS--------------
        LPP +   M   + + L SG+IR S +  +                 +Y  LN    P+ +P+P+IE+L  ++ G+TIFTK+DL+S              
Subjt:  LPPRRKIEMERLVEEMLASGVIRSSTSPYS-----------------NYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQS--------------

Query:  ----------------------------------------------------GKGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDG
                                                             K   +H+ H++ +L+ L+   L  N+ KC F QS+V ++G+ IS  G
Subjt:  ----------------------------------------------------GKGLKDHLNHLRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDG

Query:  VEVDPEKIRAIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIG-GFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGL
             E I  + +W  P N +E++ FLG   Y RKF+     +  PL  LLK    +KWT    +A   +KQ +++ PVL   DF+    +E DAS   +
Subjt:  VEVDPEKIRAIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIG-GFKWTEEVQEAFNRLKQAMMTLPVLALPDFNVSFEIEIDASGYGL

Query:  G------------------------AGANGSSNGRAKMASI-----------------------------LVGEERVIQPQYQKWISKLLGYSFEVVYKS
        G                        A  N S + +  +A I                             +  E      +  +W   L  ++FE+ Y+ 
Subjt:  G------------------------AGANGSSNGRAKMASI-----------------------------LVGEERVIQPQYQKWISKLLGYSFEVVYKS

Query:  GLENKAANALSR-------VPATVHLNQLTAPKMIDI-----EIIKAEVERDEKLKKVMQKLSETEELEEGKYLIKQGILM-YKGRMELSKTSRLIPTIL
        G  N  A+ALSR       +P     N +     I I       +  E   D KL  ++   +E + +EE   L K G+L+  K ++ L   ++L  TI+
Subjt:  GLENKAANALSR-------VPATVHLNQLTAPKMIDI-----EIIKAEVERDEKLKKVMQKLSETEELEEGKYLIKQGILM-YKGRMELSKTSRLIPTIL

Query:  HTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEVKKYCEECMICQRNKTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGH
          YH+     H G       +     W+G++K++++Y + C  CQ NK+    P   L P+    R WE +SMDF+  LP+++GY  LFVVVDRFSK   
Subjt:  HTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEVKKYCEECMICQRNKTLALSPARLLTPLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGH

Query:  FIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENPLSQIETRRSCVAFVGK
         +P     T +  + +F + V+   G PK I      I T ++   F  K
Subjt:  FIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENPLSQIETRRSCVAFVGK

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein2.9e-1029.08Show/hide
Query:  MRTITLRGTTGEEVRKTGPTKRLSDA----------EFQSRK-EKGLLMGENEEYEIIEEEEAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVK
        +R++TL G   EE+   G    L  A           +QSR+ E   L     + +++++++    EL    + Q+  T+ +     V+ L+ +  M+  
Subjt:  MRTITLRGTTGEEVRKTGPTKRLSDA----------EFQSRK-EKGLLMGENEEYEIIEEEEAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVK

Query:  GKLLGREIIVLIDCGATHNFISESLVEELKINTKSTTNYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELG--GMNAILGMQWLYSLGIT
        G +L  +++V ID GAT NFI   L   LK+ T  T    V+LG    I+  G C  I + + E  + ++FL L+L    ++ ILG +WL  LG T
Subjt:  GKLLGREIIVLIDCGATHNFISESLVEELKINTKSTTNYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELG--GMNAILGMQWLYSLGIT

AT3G30770.1 Eukaryotic aspartyl protease family protein3.6e-0829.81Show/hide
Query:  VVQEEQTVVELSTNSVVGLSNSGTMKVKGKLLGREIIVLIDCGATHNFISESLVEELKINTKSTTNYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFL
        ++++ +T+ ++   S    +    M+  G +   +++V+ID GAT+NFIS+ L   LK+ T +T    V+LG    I+  G C  I +++ E  + ++FL
Subjt:  VVQEEQTVVELSTNSVVGLSNSGTMKVKGKLLGREIIVLIDCGATHNFISESLVEELKINTKSTTNYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFL

Query:  PLEL
         L+L
Subjt:  PLEL

AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding3.1e-0421.74Show/hide
Query:  IEDANC-DRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEKMIVATISFEGPALNWYRSQEERDKFIDWANLKERVLVWFRSTREGSLYGRFLRI
        ++D++C + ++  +V  P+   ++    L   E YF  + + + E++ +   + EG    W +   +++    W   K  +    ++T + +    +  I
Subjt:  IEDANC-DRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEKMIVATISFEGPALNWYRSQEERDKFIDWANLKERVLVWFRSTREGSLYGRFLRI

Query:  QQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKP
        QQ  ++ EYR  F+ L +    LP + +E  F+ GL+P
Subjt:  QQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKP

ATMG00860.1 DNA/RNA polymerases superfamily protein7.4e-3854.96Show/hide
Query:  LNHLRAILEVLRKNELYANKKKCSFAQSRVDYLG--HIISGDGVEVDPEKIRAIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFK
        +NHL  +L++  +++ YAN+KKC+F Q ++ YLG  HIISG+GV  DP K+ A+  WP P N  E++GFLGLTGYYR+FV NYG I  PLT+LLK    K
Subjt:  LNHLRAILEVLRKNELYANKKKCSFAQSRVDYLG--HIISGDGVEVDPEKIRAIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFK

Query:  WTEEVQEAFNRLKQAMMTLPVLALPDFNVSF
        WTE    AF  LK A+ TLPVLALPD  + F
Subjt:  WTEEVQEAFNRLKQAMMTLPVLALPDFNVSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCGGCCGCCCAAGAGAGATCAACGATGAATGAAAAGATTACTGAGTTATCGATGCGGAGTACTTTGATGAAGAATAACGACGAAGGCGAAGGATCTTTCGTTAG
CGAAAATGAACCGAAGAATGAGGAGAAGAAAGCAATTGAAGATGCTAACTGTGATCGCAACAAGTTCAAGAAAGTGGAGATGCCAATCTTCAATGGAGACGATTCAGATT
CGTGGTTATTTCGCGCCGAAAGGTATTTTCAAATCCATAAGCTTACTGACTTCGAAAAAATGATTGTAGCAACGATTAGCTTCGAAGGACCCGCGCTGAATTGGTATCGG
TCTCAAGAAGAGCGAGATAAATTCATTGATTGGGCAAACTTAAAGGAAAGAGTGTTAGTCTGGTTCCGATCAACGAGGGAAGGCTCTTTATACGGCCGATTCTTGCGCAT
ACAACAGATAACCACAATGGAGGAATACCGAAATTTGTTTGATAAATTAGTAGTGCCTCTGTCGGATCTGCCCGAGAAAGTGGTAGAAGAGACTTTCATGTCCGGATTGA
AGCCATGGATACATGCTGAGATGGATTCTAAAGTGGAAAACATAGAAGATATTAGAAGAGAATCAAATCTTCCTGGTTATTCGGAAGGGAAATTGAGTAATGCGAGAAGT
GGAAACAAAGCTAGTACGATTGGAAATTCTGGAGAAAATAAGGGAAATTGGAACTGGCCGATGAGAACTATCACGTTGAGAGGAACAACTGGTGAAGAAGTACGAAAAAC
AGGTCCAACAAAAAGGTTGTCTGATGCCGAATTTCAATCGCGAAAAGAGAAGGGACTATTGATGGGGGAGAATGAGGAATATGAAATTATTGAAGAAGAAGAAGCAAAAC
AGAAAGAATTGAATGCGGTTGTAGTTGTCCAAGAAGAACAGACTGTGGTGGAGTTGTCGACAAATTCAGTAGTAGGATTATCGAATTCGGGAACTATGAAGGTGAAGGGA
AAATTACTAGGAAGGGAGATCATTGTTCTAATAGATTGCGGAGCCACACATAACTTCATTTCAGAAAGCTTAGTTGAGGAGTTGAAGATAAACACGAAGAGTACTACAAA
TTATGGAGTCATATTAGGCTCTGAAACTGCTATAAAAGGGAAGGGAGTTTGTGAAGCCATTGAGATGGTGTTGGGGGAATGGAGAGTGGTTGATGACTTCTTGCCATTGG
AGTTAGGAGGGATGAATGCAATATTGGGAATGCAATGGTTATATTCTCTTGGTATAACTGAGGGATTTTTAGCAGAATGTAGAGCGTTGGAAAGGAGGGAGTCTTTAGAA
GAGGAGGATTCGTTTGATGAAGTTTTGACTGTAGAAGAGTTAGTGACAGTGGTATTGAAAAGTTTCGAGGATGTCTTTGATTGGCCAGAAACATTGCCTCCACGAAGAAA
GATAGAAATGGAGAGATTGGTGGAAGAAATGCTGGCATCTGGAGTAATAAGGTCGAGTACGAGCCCTTATTCCAATTACATGGCTCTAAATAATGTAACCGTACCTGATA
AGTTTCCAATCCCAGTGATTGAGGAATTGTTTGATGAGTTGAATGGAACTACAATTTTCACGAAGATTGATCTTCAATCAGGTAAGGGGTTGAAAGATCACTTGAATCAT
TTGAGAGCAATATTGGAAGTACTGAGGAAGAATGAACTATATGCAAATAAGAAGAAATGTAGCTTTGCTCAGTCTCGAGTAGATTACTTGGGACATATTATTTCGGGAGA
TGGAGTTGAGGTGGATCCTGAAAAGATCAGAGCTATAAAGGAGTGGCCAATTCCTGCTAACGTGAGGGAAGTAAAAGGATTTCTTGGGCTGACTGGATACTATCGCAAGT
TTGTCCATAATTATGGGACAATTGCAGCTCCTTTGACGCAGTTATTGAAGATAGGAGGGTTTAAGTGGACAGAAGAGGTTCAAGAAGCGTTCAATAGGCTGAAACAAGCC
ATGATGACGCTTCCGGTATTGGCTCTGCCTGATTTCAATGTGTCTTTTGAAATAGAAATTGATGCTTCTGGATACGGATTGGGGGCTGGAGCTAATGGCAGTAGTAATGG
CAGGGCAAAGATGGCGTCCATACTTGTTGGGGAAGAGAGAGTGATACAACCTCAGTATCAAAAATGGATATCCAAACTTCTTGGATACTCCTTCGAGGTGGTGTATAAAT
CGGGATTGGAAAATAAGGCCGCAAATGCATTGTCTAGAGTACCAGCAACAGTGCATTTGAATCAGTTAACAGCTCCCAAGATGATCGATATCGAAATAATCAAAGCAGAA
GTAGAGCGAGATGAGAAGTTGAAGAAGGTTATGCAGAAATTAAGTGAGACAGAAGAGTTGGAGGAGGGAAAATATTTGATTAAACAAGGTATTCTGATGTATAAAGGTCG
AATGGAGTTATCTAAAACATCTAGATTAATTCCAACCATCTTACACACTTATCATGATTCGGTTTTCGGAGGTCACTCGGGGTTCTTGCGAACATATAAGAGGATGGCTG
CAGAGTTACATTGGGAAGGAATGAAGAAGGAAGTGAAGAAATATTGCGAGGAGTGCATGATTTGTCAAAGAAATAAGACTTTGGCACTTTCACCAGCTAGACTGTTAACA
CCTTTAGAGGTGCCGAATCGAGTGTGGGAGGATATTTCAATGGATTTTGTGGAAGGGTTACCTAAAGCAACAGGATACGAAGTATTATTTGTAGTGGTTGATCGATTCAG
TAAATATGGACATTTCATCCCTATGAAACATCCATATACAACAAAGATGGTATCTGAAGTATTTGTGAAGGAGGTCCTTCGGTTACATGGATATCCAAAATCCATTGAAA
ATCCATTGTCTCAAATAGAGACAAGACGTTCTTGCGTTGCTTTTGTGGGGAAAAGCCAAAATAATGGGTTAAATGGATTCCTTGGGCGGAATACTGGAGTCACACCGTTT
CAGGCTGTATATGGGCGTTTGCCACCTCCTTTGGTATACTATGAGGAGAGGGACACTACTAACTCCACATTAGATGAACAACAGAAGGAAAGAGATGTGGTGTTAGGAGC
TCTCGGGAACATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCGGCCGCCCAAGAGAGATCAACGATGAATGAAAAGATTACTGAGTTATCGATGCGGAGTACTTTGATGAAGAATAACGACGAAGGCGAAGGATCTTTCGTTAG
CGAAAATGAACCGAAGAATGAGGAGAAGAAAGCAATTGAAGATGCTAACTGTGATCGCAACAAGTTCAAGAAAGTGGAGATGCCAATCTTCAATGGAGACGATTCAGATT
CGTGGTTATTTCGCGCCGAAAGGTATTTTCAAATCCATAAGCTTACTGACTTCGAAAAAATGATTGTAGCAACGATTAGCTTCGAAGGACCCGCGCTGAATTGGTATCGG
TCTCAAGAAGAGCGAGATAAATTCATTGATTGGGCAAACTTAAAGGAAAGAGTGTTAGTCTGGTTCCGATCAACGAGGGAAGGCTCTTTATACGGCCGATTCTTGCGCAT
ACAACAGATAACCACAATGGAGGAATACCGAAATTTGTTTGATAAATTAGTAGTGCCTCTGTCGGATCTGCCCGAGAAAGTGGTAGAAGAGACTTTCATGTCCGGATTGA
AGCCATGGATACATGCTGAGATGGATTCTAAAGTGGAAAACATAGAAGATATTAGAAGAGAATCAAATCTTCCTGGTTATTCGGAAGGGAAATTGAGTAATGCGAGAAGT
GGAAACAAAGCTAGTACGATTGGAAATTCTGGAGAAAATAAGGGAAATTGGAACTGGCCGATGAGAACTATCACGTTGAGAGGAACAACTGGTGAAGAAGTACGAAAAAC
AGGTCCAACAAAAAGGTTGTCTGATGCCGAATTTCAATCGCGAAAAGAGAAGGGACTATTGATGGGGGAGAATGAGGAATATGAAATTATTGAAGAAGAAGAAGCAAAAC
AGAAAGAATTGAATGCGGTTGTAGTTGTCCAAGAAGAACAGACTGTGGTGGAGTTGTCGACAAATTCAGTAGTAGGATTATCGAATTCGGGAACTATGAAGGTGAAGGGA
AAATTACTAGGAAGGGAGATCATTGTTCTAATAGATTGCGGAGCCACACATAACTTCATTTCAGAAAGCTTAGTTGAGGAGTTGAAGATAAACACGAAGAGTACTACAAA
TTATGGAGTCATATTAGGCTCTGAAACTGCTATAAAAGGGAAGGGAGTTTGTGAAGCCATTGAGATGGTGTTGGGGGAATGGAGAGTGGTTGATGACTTCTTGCCATTGG
AGTTAGGAGGGATGAATGCAATATTGGGAATGCAATGGTTATATTCTCTTGGTATAACTGAGGGATTTTTAGCAGAATGTAGAGCGTTGGAAAGGAGGGAGTCTTTAGAA
GAGGAGGATTCGTTTGATGAAGTTTTGACTGTAGAAGAGTTAGTGACAGTGGTATTGAAAAGTTTCGAGGATGTCTTTGATTGGCCAGAAACATTGCCTCCACGAAGAAA
GATAGAAATGGAGAGATTGGTGGAAGAAATGCTGGCATCTGGAGTAATAAGGTCGAGTACGAGCCCTTATTCCAATTACATGGCTCTAAATAATGTAACCGTACCTGATA
AGTTTCCAATCCCAGTGATTGAGGAATTGTTTGATGAGTTGAATGGAACTACAATTTTCACGAAGATTGATCTTCAATCAGGTAAGGGGTTGAAAGATCACTTGAATCAT
TTGAGAGCAATATTGGAAGTACTGAGGAAGAATGAACTATATGCAAATAAGAAGAAATGTAGCTTTGCTCAGTCTCGAGTAGATTACTTGGGACATATTATTTCGGGAGA
TGGAGTTGAGGTGGATCCTGAAAAGATCAGAGCTATAAAGGAGTGGCCAATTCCTGCTAACGTGAGGGAAGTAAAAGGATTTCTTGGGCTGACTGGATACTATCGCAAGT
TTGTCCATAATTATGGGACAATTGCAGCTCCTTTGACGCAGTTATTGAAGATAGGAGGGTTTAAGTGGACAGAAGAGGTTCAAGAAGCGTTCAATAGGCTGAAACAAGCC
ATGATGACGCTTCCGGTATTGGCTCTGCCTGATTTCAATGTGTCTTTTGAAATAGAAATTGATGCTTCTGGATACGGATTGGGGGCTGGAGCTAATGGCAGTAGTAATGG
CAGGGCAAAGATGGCGTCCATACTTGTTGGGGAAGAGAGAGTGATACAACCTCAGTATCAAAAATGGATATCCAAACTTCTTGGATACTCCTTCGAGGTGGTGTATAAAT
CGGGATTGGAAAATAAGGCCGCAAATGCATTGTCTAGAGTACCAGCAACAGTGCATTTGAATCAGTTAACAGCTCCCAAGATGATCGATATCGAAATAATCAAAGCAGAA
GTAGAGCGAGATGAGAAGTTGAAGAAGGTTATGCAGAAATTAAGTGAGACAGAAGAGTTGGAGGAGGGAAAATATTTGATTAAACAAGGTATTCTGATGTATAAAGGTCG
AATGGAGTTATCTAAAACATCTAGATTAATTCCAACCATCTTACACACTTATCATGATTCGGTTTTCGGAGGTCACTCGGGGTTCTTGCGAACATATAAGAGGATGGCTG
CAGAGTTACATTGGGAAGGAATGAAGAAGGAAGTGAAGAAATATTGCGAGGAGTGCATGATTTGTCAAAGAAATAAGACTTTGGCACTTTCACCAGCTAGACTGTTAACA
CCTTTAGAGGTGCCGAATCGAGTGTGGGAGGATATTTCAATGGATTTTGTGGAAGGGTTACCTAAAGCAACAGGATACGAAGTATTATTTGTAGTGGTTGATCGATTCAG
TAAATATGGACATTTCATCCCTATGAAACATCCATATACAACAAAGATGGTATCTGAAGTATTTGTGAAGGAGGTCCTTCGGTTACATGGATATCCAAAATCCATTGAAA
ATCCATTGTCTCAAATAGAGACAAGACGTTCTTGCGTTGCTTTTGTGGGGAAAAGCCAAAATAATGGGTTAAATGGATTCCTTGGGCGGAATACTGGAGTCACACCGTTT
CAGGCTGTATATGGGCGTTTGCCACCTCCTTTGGTATACTATGAGGAGAGGGACACTACTAACTCCACATTAGATGAACAACAGAAGGAAAGAGATGTGGTGTTAGGAGC
TCTCGGGAACATTTGA
Protein sequenceShow/hide protein sequence
MESAAQERSTMNEKITELSMRSTLMKNNDEGEGSFVSENEPKNEEKKAIEDANCDRNKFKKVEMPIFNGDDSDSWLFRAERYFQIHKLTDFEKMIVATISFEGPALNWYR
SQEERDKFIDWANLKERVLVWFRSTREGSLYGRFLRIQQITTMEEYRNLFDKLVVPLSDLPEKVVEETFMSGLKPWIHAEMDSKVENIEDIRRESNLPGYSEGKLSNARS
GNKASTIGNSGENKGNWNWPMRTITLRGTTGEEVRKTGPTKRLSDAEFQSRKEKGLLMGENEEYEIIEEEEAKQKELNAVVVVQEEQTVVELSTNSVVGLSNSGTMKVKG
KLLGREIIVLIDCGATHNFISESLVEELKINTKSTTNYGVILGSETAIKGKGVCEAIEMVLGEWRVVDDFLPLELGGMNAILGMQWLYSLGITEGFLAECRALERRESLE
EEDSFDEVLTVEELVTVVLKSFEDVFDWPETLPPRRKIEMERLVEEMLASGVIRSSTSPYSNYMALNNVTVPDKFPIPVIEELFDELNGTTIFTKIDLQSGKGLKDHLNH
LRAILEVLRKNELYANKKKCSFAQSRVDYLGHIISGDGVEVDPEKIRAIKEWPIPANVREVKGFLGLTGYYRKFVHNYGTIAAPLTQLLKIGGFKWTEEVQEAFNRLKQA
MMTLPVLALPDFNVSFEIEIDASGYGLGAGANGSSNGRAKMASILVGEERVIQPQYQKWISKLLGYSFEVVYKSGLENKAANALSRVPATVHLNQLTAPKMIDIEIIKAE
VERDEKLKKVMQKLSETEELEEGKYLIKQGILMYKGRMELSKTSRLIPTILHTYHDSVFGGHSGFLRTYKRMAAELHWEGMKKEVKKYCEECMICQRNKTLALSPARLLT
PLEVPNRVWEDISMDFVEGLPKATGYEVLFVVVDRFSKYGHFIPMKHPYTTKMVSEVFVKEVLRLHGYPKSIENPLSQIETRRSCVAFVGKSQNNGLNGFLGRNTGVTPF
QAVYGRLPPPLVYYEERDTTNSTLDEQQKERDVVLGALGNI