; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh10G002590 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh10G002590
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationCmo_Chr10:1145071..1149204
RNA-Seq ExpressionCmoCh10G002590
SyntenyCmoCh10G002590
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAT07563.1 putative polyprotein [Oryza sativa Japonica Group]0.0e+0049.2Show/hide
Query:  MSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIII-NPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGIT
        +S+P     IS KLT++N+ LW+ QIL  LR   L   +  +  AP+  I  E  ++    KI+I NPE+  W+ QDQ VL  I SS++ EVL  + G  
Subjt:  MSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIII-NPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGIT

Query:  TAREAWITLERQFA------------STIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSI--TTRTDVYTVSDVYAHMLSYEMRHL
        TA +AW  ++  F+             ++++Y  K++ L D +AA GK +++EEL+AY++ GL  ++D  V  +  T R    ++S VY+ +LSYE R +
Subjt:  TAREAWITLERQFA------------STIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSI--TTRTDVYTVSDVYAHMLSYEMRHL

Query:  RKGTFEQLSSANNVNRISIRGGANGGRGSRGRSRQ--LNSGHGQSRRTVNNPGRQPSKTQSSSGIVCQICGKPNHDALQCWHRFDQAYQAENNLKQAALA
        R       +SAN  NR   RGG     G+RG  R      GHG+   +  + GR      + +  VCQ+C K  H A  CWHR+D +Y  +  L  A  A
Subjt:  RKGTFEQLSSANNVNRISIRGGANGGRGSRGRSRQ--LNSGHGQSRRTVNNPGRQPSKTQSSSGIVCQICGKPNHDALQCWHRFDQAYQAENNLKQAALA

Query:  TSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLISGSS--LVLKHILYVPKINKHLISVQRLASDNNAVVEFHPNYF
        T  Y  DTNWYVDTGATDHIT  L++LTT+ERY GTDQI  A+G G SI H+G++++   S  L LK++L+VP+  K+L+SV +L +DN A +E H  YF
Subjt:  TSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLISGSS--LVLKHILYVPKINKHLISVQRLASDNNAVVEFHPNYF

Query:  LVKDRVTKKLLLHGRCKNGLYVLPHNFS-QALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFGSSQHVSTAPLQ
        L+KD+ T++ +L G C+ GLY LP   S +    A  S  +WH RLGHP+ PI +RIL  N L   +N  + S+C+ACQ  K HQLPF  S  VS  PL+
Subjt:  LVKDRVTKKLLLHGRCKNGLYVLPHNFS-QALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFGSSQHVSTAPLQ

Query:  LIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHISCPHTHQQNGLV
        LIH+DVWGP+  SV   KYYVSF+DD+S++VWIYFL+ KS+V   F +FQK VE   + KI S+Q+DWGGEY +LH +F   GI HH+SCPHTHQQNG V
Subjt:  LIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHISCPHTHQQNGLV

Query:  ERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKG
        ERKHRHIVE GL+LLA A+MPL +WDEAF  A +LINR+PS+ I+ DTPL +LF ++PDY  LR FGCACWPNLRPYN  KL FR+ +C+FLG+S+ HKG
Subjt:  ERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKG

Query:  YKCLNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPHHPVLLPALA---------------KLANFYTENALTDIEPVVSNSHMND----------GQ
        +KCL+ STGR+Y+SRDV FDE +FPF    P           LLP                   L N    NA TDI    S  ++N           G 
Subjt:  YKCLNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPHHPVLLPALA---------------KLANFYTENALTDIEPVVSNSHMND----------GQ

Query:  TDNIASDNL---------------------------SGVSLSSADNTRSSEEIA---EYEAESSSINAQNQTHEHVSDQPTEAASQHPM---RTRLRNNI
          ++A ++                            SG+  + A +  +S   A      A +SS     Q+H+  S  P+E+A    +   +TRL++ I
Subjt:  TDNIASDNL---------------------------SGVSLSSADNTRSSEEIA---EYEAESSSINAQNQTHEHVSDQPTEAASQHPM---RTRLRNNI

Query:  VQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKAR
         + K +TDGT+RYS                  ++ EP+ L EA+    W+ AM+ E  AL +N TW LVP K G N+ID KWVYKVKRKADGS++R KAR
Subjt:  VQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKAR

Query:  LVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTG
        +VAKGFKQR+G+DY DTF+PV+K +TIR ILS+A+++GW +RQ+D+QNAFLHG+L+E+V+MRQPPG++   +   Y+CKL KALYGLKQAP+AW+SRL+ 
Subjt:  LVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTG

Query:  KLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCK
        KL ELGFK+S +D+SLF     ++ ++ML+YVDDII+ SSS  AT  L++ L  +FA+KDLG L YFLGIEVK+  +GI+L+Q +YA+D+LKRVNM  CK
Subjt:  KLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCK

Query:  PMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSDADW
         ++TP+  +EKL   +G P   E+  +YRS VGALQYLT+TRPDL+F+VNKVCQYLH PT  HW A KRILRY+K T+ LG+KI KS ++++S FSDADW
Subjt:  PMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSDADW

Query:  AGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVDFHF
        AGC DDR ST GFAVF+G NL+SWS+RKQATVSRSSTEAEYKA+AN+TAE++WI++LL ELG+   K  ++WCDN+GA Y+T+NPVFHARTKHIEVD+HF
Subjt:  AGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVDFHF

Query:  VREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNLNM
        VRE+VARK ++V +IS+ DQVAD  TK LS        NNLN+
Subjt:  VREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNLNM

KAG8087752.1 hypothetical protein GUJ93_ZPchr0010g8288 [Zizania palustris]0.0e+0051.33Show/hide
Query:  ILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLERQFAS-----------
        ++P L S NL+G+ DGS  AP + I+V P+ E    +++ NP F  W  QDQ+V+S I SS++E+VL  ++   T+ + W  LER FAS           
Subjt:  ILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLERQFAS-----------

Query:  ----------TIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDVYTVSDVYAHMLSYEM---RHLRKGTFEQ-LSSANNVN
                   + DYF K+K++ DTL+A G  + +E++++Y+L GLGP Y+ LV S+   ++  ++ D+Y++++S+E    ++  +G  +  +SSAN V+
Subjt:  ----------TIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDVYTVSDVYAHMLSYEM---RHLRKGTFEQ-LSSANNVN

Query:  R-ISIRGGANGGRG----SRGRSRQLNSGHGQSRRTVN-NPGRQPSKTQSSSGI---------VCQICGKPNHDALQCWHRFDQAYQAENNLKQAALATS
        R     GG  GGRG     RGR R    G  Q     N N G+Q  +    +            CQIC K  HDAL CW+R+ + Y  E++   A     
Subjt:  R-ISIRGGANGGRG----SRGRSRQLNSGHGQSRRTVN-NPGRQPSKTQSSSGI---------VCQICGKPNHDALQCWHRFDQAYQAENNLKQAALATS

Query:  GYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLISGSS--LVLKHILYVPKINKHLISVQRLASDNNAVVEFHPNYFLV
         Y  DTNWY+D+GATDHIT+D++RL  R  Y G +Q+QVANGAGL I H G++ ISGSS  L L +IL+VP I+KHL+SV +LA DNNA +EFHP  F V
Subjt:  GYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLISGSS--LVLKHILYVPKINKHLISVQRLASDNNAVVEFHPNYFLV

Query:  KDRVTKKLLLHGRCKNGLYVLPHNFS-----QALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFGSSQHVSTAP
        KD+ T+++L HG CK GLY L  + S       L  +K+ +  WHRRLGHP+S I   +L+  +L +      S +C+ACQ  K+H+LPF SS  VST+P
Subjt:  KDRVTKKLLLHGRCKNGLYVLPHNFS-----QALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFGSSQHVSTAP

Query:  LQLIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHISCPHTHQQNG
        L+L+H+DVWGP+I+SV+  KYYV F+DDFSR+ W+Y L+ KSDVE  F+QF+   E  LN KI++VQSD GGEY +LH+ F S GI H +SCPHT QQNG
Subjt:  LQLIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHISCPHTHQQNG

Query:  LVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSH
        + ERKHRH+VETGLALLAQ+++PL +WDEAF +AC+LINRMPSR     +P+ +LF K  DYS+LRVFGCACWP LRPY ++K+ FR+TRC+FLGYS  H
Subjt:  LVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSH

Query:  KGYKCLNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPHHPVLLPALAKLANFYTENALTDIEPVVSNSHMNDGQTDNIASDNLSGVSLSSADNTRSS
        KGYKCL+  TGRIYISRDV FDE IFPF ++   + T + +      +  +L++     + + +EPV+   H+   Q  N    + + +   + D+T ++
Subjt:  KGYKCLNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPHHPVLLPALAKLANFYTENALTDIEPVVSNSHMNDGQTDNIASDNLSGVSLSSADNTRSS

Query:  EEIAEYEAESSSINAQNQTHEHVSDQPTEAASQHPMR--TRLRNNIVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMN
        +  +    E+ S    +     +S +   A  Q   R  TRL + I + K+FTDGT+RY    R F +++++         EP +  EA + P WR AM 
Subjt:  EEIAEYEAESSSINAQNQTHEHVSDQPTEAASQHPMR--TRLRNNIVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMN

Query:  DELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGI
         E  AL RN TW L+P +   N++  +W++KVK KADG+++R KARLVAKGF QR G+DY DTFSPV+KP+T+R++LSLAV++GW++RQ+DIQNAFLHG 
Subjt:  DELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGI

Query:  LKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKI
        L EEVYM+QPPGF++S  P+ YICKL KALYGLKQAP+AW+ +L+ KL  LGF AS +DSSLFIL    ++IYML+YVDDIII SS+   +++L+Q+L  
Subjt:  LKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKI

Query:  DFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQ
        +FAVKDLG L YFLGIE     DG++L+Q +Y  DLL+R NM+ CKP  TPM S EKL RE G  L  +E F YRSTVGALQYLT+TRPD++FAVNKVCQ
Subjt:  DFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQ

Query:  YLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWI
        +LH PTDAHW AVKRILRY+KGT  +G+KI++S +  LS FSDADWAGCPDDRRST GFA+F G NL+SWSSRKQ+T+SRSSTEAEYKA+AN TAE+IW+
Subjt:  YLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWI

Query:  KSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNLNM
        +SLL+EL V     P+LWCDNLGATYLT+NPVFHARTKHIE+D HFVRE+V R  +EV+FISS+DQVADI TKPL +  F     +LN+
Subjt:  KSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNLNM

pir|T02087| gag/pol polyprotein - maize retrotransposon Hopscotch [Zea mays]0.0e+0050.1Show/hide
Query:  ISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLE
        +S KLT+ NYLLW  Q+LP +R+  L   + G    P +TI    S+ +     + NP +  W  +DQ VL  + SS++ EVLS++V  +T+   W TL 
Subjt:  ISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLE

Query:  RQF---------------------ASTIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDVYTVSDVYAHMLSYEMR-HLRK
          +                     AS++A+YF K++   D L A GK ++DEE ++++L GL  D++PLVT++  R+D  T  D+Y  +LSYE R HL+ 
Subjt:  RQF---------------------ASTIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDVYTVSDVYAHMLSYEMR-HLRK

Query:  GTFEQLSSANNVNRISIRGGANGGRGSRGRSRQLNSGHGQSRRTVNNPGR-----QPSKTQSSSGIVCQICGKPNHDALQCWHRFDQAYQAENNLKQAAL
        G+   + S+ N  R   RG + G  G RG SR    G G SR    + GR       +   +SS   CQ+C +  H AL CW+RFD+ Y  +     +A 
Subjt:  GTFEQLSSANNVNRISIRGGANGGRGSRGRSRQLNSGHGQSRRTVNNPGR-----QPSKTQSSSGIVCQICGKPNHDALQCWHRFDQAYQAENNLKQAAL

Query:  ATSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLI--SGSSLVLKHILYVPKINKHLISVQRLASDNNAVVEFHPNY
          +G  S+  WY DTGATDHIT DL+RLT  ++YTGTDQI  ANG G++IS+IGN+++  S  SL L+ +L+VP  +K+LISV RL +DN+  +EFH ++
Subjt:  ATSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLI--SGSSLVLKHILYVPKINKHLISVQRLASDNNAVVEFHPNY

Query:  FLVKDRVTKKLLLHGRCKNGLYVLP--------HNFSQALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFGSSQ
        FL+KDR TK +LLHG+C++GLY LP        HNFS    + ++  E WH+RLGHP+  I  R++ +NNL   +N  ++S+C+AC   KAHQLP+  S 
Subjt:  FLVKDRVTKKLLLHGRCKNGLYVLP--------HNFSQALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFGSSQ

Query:  HVSTAPLQLIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHISCPH
          S+APL LI +DV+GP+I S    KYYVSF+DD+S++ WIY LR KSDV   F +FQ  VE M   KI + QSDWGGEY +L+ +FK+ GI H +SCPH
Subjt:  HVSTAPLQLIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHISCPH

Query:  THQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRCIFL
        THQQNG  ERKHRHIVE GLALLAQ++MPL YWD AF  A +LINR PS+TI  DTPLHKL G +PDYS LR+FGCACWPNLRPYN  KL FR+TRC+FL
Subjt:  THQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRCIFL

Query:  GYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPHHPVLLPALAKLANFYTENA-----LTDIEPVVSNSHMNDGQTDNIASDNL---
        GYS+ HKG+KCL+ STGRIYISRDVVFDE++FPF               +LLP  +   N  T++A      +   P ++  H   G ++   S+N    
Subjt:  GYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPHHPVLLPALAKLANFYTENA-----LTDIEPVVSNSHMNDGQTDNIASDNL---

Query:  ----------------------------SGVSLSS-----ADNTRSSEEIAE------------YEAESSSINAQNQTHEHVSDQPTEAASQHPMRTRLR
                                    +GVS ++     AD+  S   +A              +A  SS+      H+     P  AA+    RTRL+
Subjt:  ----------------------------SGVSLSS-----ADNTRSSEEIAE------------YEAESSSINAQNQTHEHVSDQPTEAASQHPMRTRLR

Query:  NNIVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERL
        + I + KQFTDGT+RY   + +               TEP ++ EA+  P+WR AM  E  AL++N TW LVPP    NLID KWV+KVK  ADGS++RL
Subjt:  NNIVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERL

Query:  KARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSR
        KARLVAKGFKQ++G+DY DTFSPV+K STIR++LSLAV++ W++RQ+D+QNAFLHGIL+E VYM+QPPGF D+  P NY C L+K+LYGLKQ P+AW+SR
Subjt:  KARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSR

Query:  LTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNME
        L+ KL  LGF  S AD SLFI       IY+L+YVDDIII  SS  A + ++ KLK DFA+KDLG L YFLGIEV +  DG++L Q +YA DLLKRV ME
Subjt:  LTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNME

Query:  KCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSD
         CKP+ TP+ ++EKL    G  LS EE  KYRS VGALQYLT+TRPDL++A+N+VCQ+LH PTD HW AVKRILR ++ T+ LG+ I+ S ++MLS FSD
Subjt:  KCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSD

Query:  ADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVD
        ADWAGCPDDR+ST G+A+FLG NLISW+S+KQ+TVSRSSTEAEYKA+AN TAE+IW++SLL ELG+  +  PRLWCDNLGATYL+S P+F+ARTKHIEVD
Subjt:  ADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVD

Query:  FHFVREQVARKAMEVRFISSSDQVADILTKPLS
        FHFVR++V  K +++R IS++DQVAD  TK L+
Subjt:  FHFVREQVARKAMEVRFISSSDQVADILTKPLS

QCC26836.1 Hopscotch gagpol polyprotein [Zea mays]0.0e+0050.1Show/hide
Query:  ISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLE
        +S KLT+ NYLLW  Q+LP +R+  L   + G    P +TI    S+ +     + NP +  W  +DQ VL  + SS++ EVLS++V  +T+   W TL 
Subjt:  ISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLE

Query:  RQF---------------------ASTIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDVYTVSDVYAHMLSYEMR-HLRK
          +                     AS++A+YF K++   D L A GK ++DEE ++++L GL  D++PLVT++  R+D  T  D+Y  +LSYE R HL+ 
Subjt:  RQF---------------------ASTIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDVYTVSDVYAHMLSYEMR-HLRK

Query:  GTFEQL-SSANNVNR-ISIRGGANGGRG-SRGRSRQLNSGHGQSRRTVNNPGR-----QPSKTQSSSGIVCQICGKPNHDALQCWHRFDQAYQAENNLKQ
        G+   + SSAN  +R   +  G +GGRG SRGR R    G G SR    + GR       +   +SS   CQ+C +  H AL CW+RFD+ Y  +     
Subjt:  GTFEQL-SSANNVNR-ISIRGGANGGRG-SRGRSRQLNSGHGQSRRTVNNPGR-----QPSKTQSSSGIVCQICGKPNHDALQCWHRFDQAYQAENNLKQ

Query:  AALATSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLI--SGSSLVLKHILYVPKINKHLISVQRLASDNNAVVEFH
        +A   +G  S+  WY DTGATDHIT DL+RLT  ++YTGTDQI  ANG G++IS+IGN+++  S  SL L+ +L+VP  +K+LISV RL +DN+  +EFH
Subjt:  AALATSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLI--SGSSLVLKHILYVPKINKHLISVQRLASDNNAVVEFH

Query:  PNYFLVKDRVTKKLLLHGRCKNGLYVLP--------HNFSQALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFG
         ++FL+KDR TK +LLHG+C++GLY LP        HNFS    + ++  E WH+RLGHP+  I  R++ +NNL   +N  ++S+C+AC   KAHQLP+ 
Subjt:  PNYFLVKDRVTKKLLLHGRCKNGLYVLP--------HNFSQALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFG

Query:  SSQHVSTAPLQLIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHIS
         S   S+APL LI +DV+GP+I S    KYYVSF+DD+S++ WIY LR KSDV   F +FQ  VE M   KI + QSDWGGEY +L+ +FK+ GI H +S
Subjt:  SSQHVSTAPLQLIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHIS

Query:  CPHTHQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRC
        CPHTHQQNG  ERKHRHIVE GLALLAQ++MPL YWD AF  A +LINR PS+TI  DTPLHKL G +PDYS LR+FGCACWPNLRPYN  KL FR+TRC
Subjt:  CPHTHQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRC

Query:  IFLGYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPHHPVLLPALAKLANFYTENA-----LTDIEPVVSNSHMNDGQTDNIASDNL
        +FLGYS+ HKG+KCL+ STGRIYISRDVVFDE++FPF               +LLP  +   N  T++A      +   P ++  H   G ++   S+N 
Subjt:  IFLGYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPHHPVLLPALAKLANFYTENA-----LTDIEPVVSNSHMNDGQTDNIASDNL

Query:  -------------------------------SGVSLSS-----ADNTRSSEEIAE------------YEAESSSINAQNQTHEHVSDQPTEAASQHPMRT
                                       +GVS ++     AD+  S   +A              +A  SS+      H+     P  AA+    RT
Subjt:  -------------------------------SGVSLSS-----ADNTRSSEEIAE------------YEAESSSINAQNQTHEHVSDQPTEAASQHPMRT

Query:  RLRNNIVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSV
        RL++ I + KQFTDGT+RY   + +               TEP ++ EA+  P+WR AM  E  AL++N TW LVPP    NLID KWV+KVK  ADGS+
Subjt:  RLRNNIVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSV

Query:  ERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAW
        +RLKARLVAKGFKQ++G+DY DTFSPV+K STIR++LSLAV++ W++RQ+D+QNAFLHGIL+E VYM+QPPGF D+  P NY C L+K+LYGLKQAP+AW
Subjt:  ERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAW

Query:  HSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRV
        +SRL+ KL  LGF  S AD SLFI       IY+L+YVDDIII  SS  A + ++ KLK DFA+KDLG L YFLGIEV +  DG++L Q +YA DLLKRV
Subjt:  HSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRV

Query:  NMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSG
         ME CKP+ TP+ ++EKL    G  LS EE  KYRS VGALQYLT+TRPDL++A+N+VCQ+LH PTD HW AVKRILR ++ T+ LG+ I+ S ++MLS 
Subjt:  NMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSG

Query:  FSDADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHI
        FSDADWAGCPDDR+ST G+A+FLG NLISW+S+KQ+TVSRSSTEAEYKA+AN TAE+IW++SLL ELG+  +  PRLWCDNLGATYL+S P+F+ARTKHI
Subjt:  FSDADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHI

Query:  EVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNLNM
        EVDFHFVR++V  K +++R IS++DQVAD  TK L+         NLN+
Subjt:  EVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNLNM

XP_035817309.1 uncharacterized protein LOC100279596 isoform X1 [Zea mays]0.0e+0047.96Show/hide
Query:  MSIPNNTM------SSPSIS-----QVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIIINPEFTVWYPQDQLVLSL
        M++ N+T       SSPS+S       I+V+LT+EN+ LW  Q  P LR+  L G+VDGS+ APS+ I      E    + + NP + +WY QDQLVLS 
Subjt:  MSIPNNTM------SSPSIS-----QVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIIINPEFTVWYPQDQLVLSL

Query:  INSSVTEEVLSTMVGITTAREAWITLERQFAS----------------------TIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTS
        + SS++E++L  M    TA + W  L    +S                      T + YF+++K   DT+A++G  + DEE++ YML GLG +++PLV +
Subjt:  INSSVTEEVLSTMVGITTAREAWITLERQFAS----------------------TIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTS

Query:  ITTRTDVYTVSDVYAHMLSYEMRHLRKG-TFEQLSSANNVNRISIRGGANGGRGSRGRSRQLNSGHGQSRRTVNNPGRQPSKTQSSSGIVCQICGKPNHD
        IT R D  +++  ++ +LS E+R  R   T E LSSAN   R     G  G RG   R R    G G   R  ++ G +P+         CQ+CGK  HD
Subjt:  ITTRTDVYTVSDVYAHMLSYEMRHLRKG-TFEQLSSANNVNRISIRGGANGGRGSRGRSRQLNSGHGQSRRTVNNPGRQPSKTQSSSGIVCQICGKPNHD

Query:  ALQCWHRFDQAYQAENNLKQAALA--TSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLISGSS--LVLKHILYVPK
        AL+C+ RF+ A+Q E++  ++A +  T  Y  DT+W +D+GA DH+T+DL+RLTT ER++G D +QVANG+GLSISH G SL+ GSS  L L+++L+VP 
Subjt:  ALQCWHRFDQAYQAENNLKQAALA--TSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLISGSS--LVLKHILYVPK

Query:  INKHLISVQRLASDNNAVVEFHPNYFLVKDRVTKKLLLHGRCKNGLYVL---PHNFSQ----ALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTN
        ++ HL+S  RLASDNN  +E HPN+F VKDRVT+K LL G+  NGLY +   PH+ S      L     S + WH+RLGHP+  I   +L+ N LA   +
Subjt:  INKHLISVQRLASDNNAVVEFHPNYFLVKDRVTKKLLLHGRCKNGLYVL---PHNFSQ----ALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTN

Query:  IPSSSICNACQLGKAHQLPFGSSQHVSTAPLQLIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDW
          SSS+C++CQ  K HQLPF  S HVS++PL+L+H+DVWGP+I SV   KYYVSF+DD+SRY WIY L+ KSDVE  F  FQKHVE +LN KIR  QSDW
Subjt:  IPSSSICNACQLGKAHQLPFGSSQHVSTAPLQLIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDW

Query:  GGEYHRLHNYFKSTGIEHHISCPHTHQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGC
        GGEY RL  +  STGI+H ++CPHT QQNG+ ERK+RHIVETGLALLA +++P+ +WDEAF TAC+LINRMP+RT+   TPL  +F + P+YS+LRVFGC
Subjt:  GGEYHRLHNYFKSTGIEHHISCPHTHQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGC

Query:  ACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPFEESKP---PNKTTNPHHPVLLPALAKLANFYTENALTDIEPV
        ACWPNLRPYNN KLSFR+ +C+FLGYSS HKGYKCL+RS GRIYISRDVVFDE++FPF  S P   P+   + H     P L   +N   E +   +   
Subjt:  ACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPFEESKP---PNKTTNPHHPVLLPALAKLANFYTENALTDIEPV

Query:  VSNSHMNDGQTDNIASDNLSGVSLS--SADNTRSSEEIAEYEAESSSI-NAQNQTHEH---------VSDQPTEAASQHP--------------------
         S+   N     +   D +     +   +D   +S   A     SS +  A   +H H          S  P +A S  P                    
Subjt:  VSNSHMNDGQTDNIASDNLSGVSLS--SADNTRSSEEIAEYEAESSSI-NAQNQTHEH---------VSDQPTEAASQHP--------------------

Query:  -------------------------------------------------------------MRTRLRNNIVQAKQFTDGTIRYSETSRKFASAVTITTPI
                                                                     + TRLRNNI +    TDGTIRY+ +SR F  +       
Subjt:  -------------------------------------------------------------MRTRLRNNIVQAKQFTDGTIRYSETSRKFASAVTITTPI

Query:  IETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVI
              P N   A+ + +WR AM+ E +AL +N TW LVP   G N+I  KWV+++K  ADGSV++ KARLVA+GF Q+ G+DY +TFSPV+K ST+R++
Subjt:  IETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVI

Query:  LSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLI
        LS+A+++ W++RQ+DI NAFLHG+L E+VYM QPPGFQD +KP  ++CKL KA+YGLKQ+P+AW+SRL+ +L +LGF  SVAD+SLF      + +Y+L+
Subjt:  LSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLI

Query:  YVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRS
        YVDDIIIVSSS   T+ L+Q+L + F VKDLG L +FLGIEV     G+ L+Q++YA D+L+R +ME CK + TP+   +KL R  G  L  ++ F YRS
Subjt:  YVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRS

Query:  TVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWSSRKQA
         VGALQYLT+TRPDL+FAVNKVCQ+L  P D HW AVKRILR+VKGTL  G+ ++K+ + +LS F+DADWAGC DDRRST GFA+F G+NLISWS+RKQ 
Subjt:  TVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWSSRKQA

Query:  TVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVDFHFVREQVARKAMEVRFISSSDQVADILTKPLS
        TVSRSSTEAEYKA+AN TAE IWI+SLLKEL + Q + P LWCDNLGATYL++NPVFH R KH+EVDFHFVRE+VA  A++VR ISS DQ+ADI TKP +
Subjt:  TVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVDFHFVREQVARKAMEVRFISSSDQVADILTKPLS

Query:  KTPFTTHCNNLNMYKT
        K        NLN+  T
Subjt:  KTPFTTHCNNLNMYKT

TrEMBL top hitse value%identityAlignment
A0A3L6Q0W7 Putative polyprotein0.0e+0048.02Show/hide
Query:  MSIPNNTMSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLS
        M+  ++T ++P     ++ KL++ N+ LW  Q+L  +R     G ++G   AP   I  + ++  G      NP F  WY +DQ +L  + SS  ++V +
Subjt:  MSIPNNTMSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLS

Query:  TMVGITTAREAWITLERQFAS---------------------TIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDVYTVSD
         +    TA +AW  +E+ F++                     +I DY  K+K   D LAA+GK ++DEEL+A++  GL  DY+P+VTS+TTR D  ++ D
Subjt:  TMVGITTAREAWITLERQFAS---------------------TIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDVYTVSD

Query:  VYAHMLSYEMRHLRKGTFEQLSSANNVNRISIRG-GANGGRGSRGRSRQLNSGHGQSRRTVNNPGRQP---SKTQSSSGIVCQICGKPNHDALQCWHRFD
        +YA +LS+E R   +        AN       RG G   GRG RGR +    G+G  ++   N GRQP     + S    +CQIC K  H A +CWHRFD
Subjt:  VYAHMLSYEMRHLRKGTFEQLSSANNVNRISIRG-GANGGRGSRGRSRQLNSGHGQSRRTVNNPGRQP---SKTQSSSGIVCQICGKPNHDALQCWHRFD

Query:  QAYQAENNLKQAALATSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLI--SGSSLVLKHILYVPKINKHLISVQRL
        + Y  E     AA+A + Y  DTNWY DTGATDHIT+DL +LT RE+Y G DQI  A+G+G+ I ++G++ +     SL L ++L+VPK  K+L+SV RL
Subjt:  QAYQAENNLKQAALATSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLI--SGSSLVLKHILYVPKINKHLISVQRL

Query:  ASDNNAVVEFHPNYFLVKDRVTKKLLLHGRCKNGLYVLPHNFS--QALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAH
        A DN+A +EFHP++FL+KD+ T+  +L G C+ GLY LP + S  QA    K +  +WH RLGHP+ PI  +++  NNL+  +     S+C+ACQ  K+H
Subjt:  ASDNNAVVEFHPNYFLVKDRVTKKLLLHGRCKNGLYVLPHNFS--QALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAH

Query:  QLPFGSSQHVSTAPLQLIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGI
        QLP+  S   ++ PL+L+H+DVWGP++ SV   +YYVSFVDDFSR+ WIYFL+ KS+V   F +FQK VE   + KI ++Q+DWGGEY +L+++F   GI
Subjt:  QLPFGSSQHVSTAPLQLIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGI

Query:  EHHISCPHTHQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSF
         HH+SCPH HQQNG++ERKHRHIVE GL+LLA A+MPL +WDEAF  A FLINR+PS+ I   TP  +L  + P+Y  LR FGCACWPNLRPYN  KL  
Subjt:  EHHISCPHTHQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSF

Query:  RTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPFEESK--------------PPN---------------KTTNPHHPVLLPALAKLANFY-
        R+ +C FLGYS+ HKGYKCL+  +GR+YISRDV+FDE +FPF                  PP                 +TNP +     A     +   
Subjt:  RTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPFEESK--------------PPN---------------KTTNPHHPVLLPALAKLANFY-

Query:  --TENALTDIEPVVSNSHMNDGQTDNIASDNLSGVSLSS--ADNTRSSEEIAEYEAESSSINAQNQTHEHVSDQPTEAASQHP--MRTRLRNNIVQAKQF
          TE       PV   S +     D +      G + +S  A  T  ++      A +             +  P+  A++ P   RTRLR+ I + K +
Subjt:  --TENALTDIEPVVSNSHMNDGQTDNIASDNLSGVSLSS--ADNTRSSEEIAEYEAESSSINAQNQTHEHVSDQPTEAASQHP--MRTRLRNNIVQAKQF

Query:  TDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGF
        TDGT+RY                    + EP++L EA+    W+ AM+ E  AL  N TW LVPPK GIN+ID KWVYKVKRK+DGS++R KARLVAKGF
Subjt:  TDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGF

Query:  KQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELG
        +QR+G+DY DTFSPV+KP+TIR IL +AV++GW++R++D+QNAFLHG L+E+VYM+QPPG+QD +K   YICKL KALYGLK+AP+AW+SRL+ KL +LG
Subjt:  KQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELG

Query:  FKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPM
        FKAS AD+SLF      + IY+LIYVDDII+ SS+ +AT  L+Q L+ +FA+KDLG L +FLGIEVKK  +GI+L+Q +YA D+L+R +M +CKP+S+P+
Subjt:  FKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPM

Query:  GSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSDADWAGCPDD
         ++EKL   +G PL  ++   YRS VG LQYL +TRPD++FAVNKVCQYLH PT  HW  VKRILRY+K T+ +G+KI KS ++++S FSDADWAG  DD
Subjt:  GSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSDADWAGCPDD

Query:  RRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVDFHFVREQVA
        RRST GFAVFLG+NLISWS+RKQ+TVSRSSTEAEYKA+AN TAE++WI++LL ELG+   K  +LWCDN+GA YL++NPVFHARTKHIEVD+HFVRE+VA
Subjt:  RRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVDFHFVREQVA

Query:  RKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNLNM
        R+ +++ +IS+ DQ+A+  TKPL+        NNLN+
Subjt:  RKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNLNM

A0A4D6GKR5 Hopscotch gagpol polyprotein0.0e+0050.1Show/hide
Query:  ISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLE
        +S KLT+ NYLLW  Q+LP +R+  L   + G    P +TI    S+ +     + NP +  W  +DQ VL  + SS++ EVLS++V  +T+   W TL 
Subjt:  ISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLE

Query:  RQF---------------------ASTIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDVYTVSDVYAHMLSYEMR-HLRK
          +                     AS++A+YF K++   D L A GK ++DEE ++++L GL  D++PLVT++  R+D  T  D+Y  +LSYE R HL+ 
Subjt:  RQF---------------------ASTIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDVYTVSDVYAHMLSYEMR-HLRK

Query:  GTFEQL-SSANNVNR-ISIRGGANGGRG-SRGRSRQLNSGHGQSRRTVNNPGR-----QPSKTQSSSGIVCQICGKPNHDALQCWHRFDQAYQAENNLKQ
        G+   + SSAN  +R   +  G +GGRG SRGR R    G G SR    + GR       +   +SS   CQ+C +  H AL CW+RFD+ Y  +     
Subjt:  GTFEQL-SSANNVNR-ISIRGGANGGRG-SRGRSRQLNSGHGQSRRTVNNPGR-----QPSKTQSSSGIVCQICGKPNHDALQCWHRFDQAYQAENNLKQ

Query:  AALATSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLI--SGSSLVLKHILYVPKINKHLISVQRLASDNNAVVEFH
        +A   +G  S+  WY DTGATDHIT DL+RLT  ++YTGTDQI  ANG G++IS+IGN+++  S  SL L+ +L+VP  +K+LISV RL +DN+  +EFH
Subjt:  AALATSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLI--SGSSLVLKHILYVPKINKHLISVQRLASDNNAVVEFH

Query:  PNYFLVKDRVTKKLLLHGRCKNGLYVLP--------HNFSQALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFG
         ++FL+KDR TK +LLHG+C++GLY LP        HNFS    + ++  E WH+RLGHP+  I  R++ +NNL   +N  ++S+C+AC   KAHQLP+ 
Subjt:  PNYFLVKDRVTKKLLLHGRCKNGLYVLP--------HNFSQALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFG

Query:  SSQHVSTAPLQLIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHIS
         S   S+APL LI +DV+GP+I S    KYYVSF+DD+S++ WIY LR KSDV   F +FQ  VE M   KI + QSDWGGEY +L+ +FK+ GI H +S
Subjt:  SSQHVSTAPLQLIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHIS

Query:  CPHTHQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRC
        CPHTHQQNG  ERKHRHIVE GLALLAQ++MPL YWD AF  A +LINR PS+TI  DTPLHKL G +PDYS LR+FGCACWPNLRPYN  KL FR+TRC
Subjt:  CPHTHQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRC

Query:  IFLGYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPHHPVLLPALAKLANFYTENA-----LTDIEPVVSNSHMNDGQTDNIASDNL
        +FLGYS+ HKG+KCL+ STGRIYISRDVVFDE++FPF               +LLP  +   N  T++A      +   P ++  H   G ++   S+N 
Subjt:  IFLGYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPHHPVLLPALAKLANFYTENA-----LTDIEPVVSNSHMNDGQTDNIASDNL

Query:  -------------------------------SGVSLSS-----ADNTRSSEEIAE------------YEAESSSINAQNQTHEHVSDQPTEAASQHPMRT
                                       +GVS ++     AD+  S   +A              +A  SS+      H+     P  AA+    RT
Subjt:  -------------------------------SGVSLSS-----ADNTRSSEEIAE------------YEAESSSINAQNQTHEHVSDQPTEAASQHPMRT

Query:  RLRNNIVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSV
        RL++ I + KQFTDGT+RY   + +               TEP ++ EA+  P+WR AM  E  AL++N TW LVPP    NLID KWV+KVK  ADGS+
Subjt:  RLRNNIVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSV

Query:  ERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAW
        +RLKARLVAKGFKQ++G+DY DTFSPV+K STIR++LSLAV++ W++RQ+D+QNAFLHGIL+E VYM+QPPGF D+  P NY C L+K+LYGLKQAP+AW
Subjt:  ERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAW

Query:  HSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRV
        +SRL+ KL  LGF  S AD SLFI       IY+L+YVDDIII  SS  A + ++ KLK DFA+KDLG L YFLGIEV +  DG++L Q +YA DLLKRV
Subjt:  HSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRV

Query:  NMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSG
         ME CKP+ TP+ ++EKL    G  LS EE  KYRS VGALQYLT+TRPDL++A+N+VCQ+LH PTD HW AVKRILR ++ T+ LG+ I+ S ++MLS 
Subjt:  NMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSG

Query:  FSDADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHI
        FSDADWAGCPDDR+ST G+A+FLG NLISW+S+KQ+TVSRSSTEAEYKA+AN TAE+IW++SLL ELG+  +  PRLWCDNLGATYL+S P+F+ARTKHI
Subjt:  FSDADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHI

Query:  EVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNLNM
        EVDFHFVR++V  K +++R IS++DQVAD  TK L+         NLN+
Subjt:  EVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNLNM

Q75G45 Putative polyprotein0.0e+0049.2Show/hide
Query:  MSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIII-NPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGIT
        +S+P     IS KLT++N+ LW+ QIL  LR   L   +  +  AP+  I  E  ++    KI+I NPE+  W+ QDQ VL  I SS++ EVL  + G  
Subjt:  MSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIII-NPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGIT

Query:  TAREAWITLERQFA------------STIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSI--TTRTDVYTVSDVYAHMLSYEMRHL
        TA +AW  ++  F+             ++++Y  K++ L D +AA GK +++EEL+AY++ GL  ++D  V  +  T R    ++S VY+ +LSYE R +
Subjt:  TAREAWITLERQFA------------STIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSI--TTRTDVYTVSDVYAHMLSYEMRHL

Query:  RKGTFEQLSSANNVNRISIRGGANGGRGSRGRSRQ--LNSGHGQSRRTVNNPGRQPSKTQSSSGIVCQICGKPNHDALQCWHRFDQAYQAENNLKQAALA
        R       +SAN  NR   RGG     G+RG  R      GHG+   +  + GR      + +  VCQ+C K  H A  CWHR+D +Y  +  L  A  A
Subjt:  RKGTFEQLSSANNVNRISIRGGANGGRGSRGRSRQ--LNSGHGQSRRTVNNPGRQPSKTQSSSGIVCQICGKPNHDALQCWHRFDQAYQAENNLKQAALA

Query:  TSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLISGSS--LVLKHILYVPKINKHLISVQRLASDNNAVVEFHPNYF
        T  Y  DTNWYVDTGATDHIT  L++LTT+ERY GTDQI  A+G G SI H+G++++   S  L LK++L+VP+  K+L+SV +L +DN A +E H  YF
Subjt:  TSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLISGSS--LVLKHILYVPKINKHLISVQRLASDNNAVVEFHPNYF

Query:  LVKDRVTKKLLLHGRCKNGLYVLPHNFS-QALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFGSSQHVSTAPLQ
        L+KD+ T++ +L G C+ GLY LP   S +    A  S  +WH RLGHP+ PI +RIL  N L   +N  + S+C+ACQ  K HQLPF  S  VS  PL+
Subjt:  LVKDRVTKKLLLHGRCKNGLYVLPHNFS-QALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFGSSQHVSTAPLQ

Query:  LIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHISCPHTHQQNGLV
        LIH+DVWGP+  SV   KYYVSF+DD+S++VWIYFL+ KS+V   F +FQK VE   + KI S+Q+DWGGEY +LH +F   GI HH+SCPHTHQQNG V
Subjt:  LIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHISCPHTHQQNGLV

Query:  ERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKG
        ERKHRHIVE GL+LLA A+MPL +WDEAF  A +LINR+PS+ I+ DTPL +LF ++PDY  LR FGCACWPNLRPYN  KL FR+ +C+FLG+S+ HKG
Subjt:  ERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKG

Query:  YKCLNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPHHPVLLPALA---------------KLANFYTENALTDIEPVVSNSHMND----------GQ
        +KCL+ STGR+Y+SRDV FDE +FPF    P           LLP                   L N    NA TDI    S  ++N           G 
Subjt:  YKCLNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPHHPVLLPALA---------------KLANFYTENALTDIEPVVSNSHMND----------GQ

Query:  TDNIASDNL---------------------------SGVSLSSADNTRSSEEIA---EYEAESSSINAQNQTHEHVSDQPTEAASQHPM---RTRLRNNI
          ++A ++                            SG+  + A +  +S   A      A +SS     Q+H+  S  P+E+A    +   +TRL++ I
Subjt:  TDNIASDNL---------------------------SGVSLSSADNTRSSEEIA---EYEAESSSINAQNQTHEHVSDQPTEAASQHPM---RTRLRNNI

Query:  VQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKAR
         + K +TDGT+RYS                  ++ EP+ L EA+    W+ AM+ E  AL +N TW LVP K G N+ID KWVYKVKRKADGS++R KAR
Subjt:  VQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKAR

Query:  LVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTG
        +VAKGFKQR+G+DY DTF+PV+K +TIR ILS+A+++GW +RQ+D+QNAFLHG+L+E+V+MRQPPG++   +   Y+CKL KALYGLKQAP+AW+SRL+ 
Subjt:  LVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTG

Query:  KLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCK
        KL ELGFK+S +D+SLF     ++ ++ML+YVDDII+ SSS  AT  L++ L  +FA+KDLG L YFLGIEVK+  +GI+L+Q +YA+D+LKRVNM  CK
Subjt:  KLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCK

Query:  PMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSDADW
         ++TP+  +EKL   +G P   E+  +YRS VGALQYLT+TRPDL+F+VNKVCQYLH PT  HW A KRILRY+K T+ LG+KI KS ++++S FSDADW
Subjt:  PMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSDADW

Query:  AGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVDFHF
        AGC DDR ST GFAVF+G NL+SWS+RKQATVSRSSTEAEYKA+AN+TAE++WI++LL ELG+   K  ++WCDN+GA Y+T+NPVFHARTKHIEVD+HF
Subjt:  AGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVDFHF

Query:  VREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNLNM
        VRE+VARK ++V +IS+ DQVAD  TK LS        NNLN+
Subjt:  VREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNLNM

Q75HT9 Putative polyprotein0.0e+0049.2Show/hide
Query:  MSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIII-NPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGIT
        +S+P     IS KLT++N+ LW+ QIL  LR   L   +  +  AP+  I  E  ++    KI+I NPE+  W+ QDQ VL  I SS++ EVL  + G  
Subjt:  MSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIII-NPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGIT

Query:  TAREAWITLERQFA------------STIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSI--TTRTDVYTVSDVYAHMLSYEMRHL
        TA +AW  ++  F+             ++++Y  K++ L D +AA GK +++EEL+AY++ GL  ++D  V  +  T R    ++S VY+ +LSYE R +
Subjt:  TAREAWITLERQFA------------STIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSI--TTRTDVYTVSDVYAHMLSYEMRHL

Query:  RKGTFEQLSSANNVNRISIRGGANGGRGSRGRSRQ--LNSGHGQSRRTVNNPGRQPSKTQSSSGIVCQICGKPNHDALQCWHRFDQAYQAENNLKQAALA
        R       +SAN  NR   RGG     G+RG  R      GHG+   +  + GR      + +  VCQ+C K  H A  CWHR+D +Y  +  L  A  A
Subjt:  RKGTFEQLSSANNVNRISIRGGANGGRGSRGRSRQ--LNSGHGQSRRTVNNPGRQPSKTQSSSGIVCQICGKPNHDALQCWHRFDQAYQAENNLKQAALA

Query:  TSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLISGSS--LVLKHILYVPKINKHLISVQRLASDNNAVVEFHPNYF
        T  Y  DTNWYVDTGATDHIT  L++LTT+ERY GTDQI  A+G G SI H+G++++   S  L LK++L+VP+  K+L+SV +L +DN A +E H  YF
Subjt:  TSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLISGSS--LVLKHILYVPKINKHLISVQRLASDNNAVVEFHPNYF

Query:  LVKDRVTKKLLLHGRCKNGLYVLPHNFS-QALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFGSSQHVSTAPLQ
        L+KD+ T++ +L G C+ GLY LP   S +    A  S  +WH RLGHP+ PI +RIL  N L   +N  + S+C+ACQ  K HQLPF  S  VS  PL+
Subjt:  LVKDRVTKKLLLHGRCKNGLYVLPHNFS-QALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFGSSQHVSTAPLQ

Query:  LIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHISCPHTHQQNGLV
        LIH+DVWGP+  SV   KYYVSF+DD+S++VWIYFL+ KS+V   F +FQK VE   + KI S+Q+DWGGEY +LH +F   GI HH+SCPHTHQQNG V
Subjt:  LIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHISCPHTHQQNGLV

Query:  ERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKG
        ERKHRHIVE GL+LLA A+MPL +WDEAF  A +LINR+PS+ I+ DTPL +LF ++PDY  LR FGCACWPNLRPYN  KL FR+ +C+FLG+S+ HKG
Subjt:  ERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKG

Query:  YKCLNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPHHPVLLPALA---------------KLANFYTENALTDIEPVVSNSHMND----------GQ
        +KCL+ STGR+Y+SRDV FDE +FPF    P           LLP                   L N    NA TDI    S  ++N           G 
Subjt:  YKCLNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPHHPVLLPALA---------------KLANFYTENALTDIEPVVSNSHMND----------GQ

Query:  TDNIASDNL---------------------------SGVSLSSADNTRSSEEIA---EYEAESSSINAQNQTHEHVSDQPTEAASQHPM---RTRLRNNI
          ++A ++                            SG+  + A +  +S   A      A +SS     Q+H+  S  P+E+A    +   +TRL++ I
Subjt:  TDNIASDNL---------------------------SGVSLSSADNTRSSEEIA---EYEAESSSINAQNQTHEHVSDQPTEAASQHPM---RTRLRNNI

Query:  VQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKAR
         + K +TDGT+RYS                  ++ EP+ L EA+    W+ AM+ E  AL +N TW LVP K G N+ID KWVYKVKRKADGS++R KAR
Subjt:  VQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKAR

Query:  LVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTG
        +VAKGFKQR+G+DY DTF+PV+K +TIR ILS+A+++GW +RQ+D+QNAFLHG+L+E+V+MRQPPG++   +   Y+CKL KALYGLKQAP+AW+SRL+ 
Subjt:  LVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTG

Query:  KLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCK
        KL ELGFK+S +D+SLF     ++ ++ML+YVDDII+ SSS  AT  L++ L  +FA+KDLG L YFLGIEVK+  +GI+L+Q +YA+D+LKRVNM  CK
Subjt:  KLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCK

Query:  PMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSDADW
         ++TP+  +EKL   +G P   E+  +YRS VGALQYLT+TRPDL+F+VNKVCQYLH PT  HW A KRILRY+K T+ LG+KI KS ++++S FSDADW
Subjt:  PMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSDADW

Query:  AGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVDFHF
        AGC DDR ST GFAVF+G NL+SWS+RKQATVSRSSTEAEYKA+AN+TAE++WI++LL ELG+   K  ++WCDN+GA Y+T+NPVFHARTKHIEVD+HF
Subjt:  AGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVDFHF

Query:  VREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNLNM
        VRE+VARK ++V +IS+ DQVAD  TK LS        NNLN+
Subjt:  VREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNLNM

V9GZT4 Copia-like retrotransposon Hopscotch polyprotein0.0e+0050.1Show/hide
Query:  ISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLE
        +S KLT+ NYLLW  Q+LP +R+  L   + G    P +TI    S+ +     + NP +  W  +DQ VL  + SS++ EVLS++V  +T+   W TL 
Subjt:  ISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLE

Query:  RQF---------------------ASTIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDVYTVSDVYAHMLSYEMR-HLRK
          +                     AS++A+YF K++   D L A GK ++DEE ++++L GL  D++PLVT++  R+D  T  D+Y  +LSYE R HL+ 
Subjt:  RQF---------------------ASTIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDVYTVSDVYAHMLSYEMR-HLRK

Query:  GTFEQLSSANNVNRISIRGGANGGRGSRGRSRQLNSGHGQSRRTVNNPGR-----QPSKTQSSSGIVCQICGKPNHDALQCWHRFDQAYQAENNLKQAAL
        G+   + S+ N  R   RG + G  G RG SR    G G SR    + GR       +   +SS   CQ+C +  H AL CW+RFD+ Y  +     +A 
Subjt:  GTFEQLSSANNVNRISIRGGANGGRGSRGRSRQLNSGHGQSRRTVNNPGR-----QPSKTQSSSGIVCQICGKPNHDALQCWHRFDQAYQAENNLKQAAL

Query:  ATSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLI--SGSSLVLKHILYVPKINKHLISVQRLASDNNAVVEFHPNY
          +G  S+  WY DTGATDHIT DL+RLT  ++YTGTDQI  ANG G++IS+IGN+++  S  SL L+ +L+VP  +K+LISV RL +DN+  +EFH ++
Subjt:  ATSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLI--SGSSLVLKHILYVPKINKHLISVQRLASDNNAVVEFHPNY

Query:  FLVKDRVTKKLLLHGRCKNGLYVLP--------HNFSQALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFGSSQ
        FL+KDR TK +LLHG+C++GLY LP        HNFS    + ++  E WH+RLGHP+  I  R++ +NNL   +N  ++S+C+AC   KAHQLP+  S 
Subjt:  FLVKDRVTKKLLLHGRCKNGLYVLP--------HNFSQALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFGSSQ

Query:  HVSTAPLQLIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHISCPH
          S+APL LI +DV+GP+I S    KYYVSF+DD+S++ WIY LR KSDV   F +FQ  VE M   KI + QSDWGGEY +L+ +FK+ GI H +SCPH
Subjt:  HVSTAPLQLIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHISCPH

Query:  THQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRCIFL
        THQQNG  ERKHRHIVE GLALLAQ++MPL YWD AF  A +LINR PS+TI  DTPLHKL G +PDYS LR+FGCACWPNLRPYN  KL FR+TRC+FL
Subjt:  THQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRCIFL

Query:  GYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPHHPVLLPALAKLANFYTENA-----LTDIEPVVSNSHMNDGQTDNIASDNL---
        GYS+ HKG+KCL+ STGRIYISRDVVFDE++FPF               +LLP  +   N  T++A      +   P ++  H   G ++   S+N    
Subjt:  GYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPHHPVLLPALAKLANFYTENA-----LTDIEPVVSNSHMNDGQTDNIASDNL---

Query:  ----------------------------SGVSLSS-----ADNTRSSEEIAE------------YEAESSSINAQNQTHEHVSDQPTEAASQHPMRTRLR
                                    +GVS ++     AD+  S   +A              +A  SS+      H+     P  AA+    RTRL+
Subjt:  ----------------------------SGVSLSS-----ADNTRSSEEIAE------------YEAESSSINAQNQTHEHVSDQPTEAASQHPMRTRLR

Query:  NNIVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERL
        + I + KQFTDGT+RY   + +               TEP ++ EA+  P+WR AM  E  AL++N TW LVPP    NLID KWV+KVK  ADGS++RL
Subjt:  NNIVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERL

Query:  KARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSR
        KARLVAKGFKQ++G+DY DTFSPV+K STIR++LSLAV++ W++RQ+D+QNAFLHGIL+E VYM+QPPGF D+  P NY C L+K+LYGLKQ P+AW+SR
Subjt:  KARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSR

Query:  LTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNME
        L+ KL  LGF  S AD SLFI       IY+L+YVDDIII  SS  A + ++ KLK DFA+KDLG L YFLGIEV +  DG++L Q +YA DLLKRV ME
Subjt:  LTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNME

Query:  KCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSD
         CKP+ TP+ ++EKL    G  LS EE  KYRS VGALQYLT+TRPDL++A+N+VCQ+LH PTD HW AVKRILR ++ T+ LG+ I+ S ++MLS FSD
Subjt:  KCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKS-TMMLSGFSD

Query:  ADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVD
        ADWAGCPDDR+ST G+A+FLG NLISW+S+KQ+TVSRSSTEAEYKA+AN TAE+IW++SLL ELG+  +  PRLWCDNLGATYL+S P+F+ARTKHIEVD
Subjt:  ADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVD

Query:  FHFVREQVARKAMEVRFISSSDQVADILTKPLS
        FHFVR++V  K +++R IS++DQVAD  TK L+
Subjt:  FHFVREQVARKAMEVRFISSSDQVADILTKPLS

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.4e-13728.42Show/hide
Query:  ENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKII--INPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLERQFAS
        E Y +W  +I   L  Q+++  VDG MP      + + +E      II  ++  F  +   D     ++ +        ++      R+  ++L+     
Subjt:  ENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKII--INPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLERQFAS

Query:  TIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDV-YTVSDVYAHMLSYEMRHLRKGTFEQLSSANNVNRISIRGGANGGRG
        ++  +F     L   L A G +IE+ + I+++L  L   YD ++T+I T ++   T++ V   +L  E+    K   +   ++  V    +    N  + 
Subjt:  TIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDV-YTVSDVYAHMLSYEMRHLRKGTFEQLSSANNVNRISIRGGANGGRG

Query:  SRGRSRQLNSGHGQSRRTVNNPGRQPSKTQSSSGIVCQICGKPNHDALQCWHRFDQAYQAEN--NLKQAALATS-------------GYTSDTNWYVDTG
        +  ++R            V  P ++  K  S   + C  CG+  H    C+H + +    +N  N KQ   ATS                 +  + +D+G
Subjt:  SRGRSRQLNSGHGQSRRTVNNPGRQPSKTQSSSGIVCQICGKPNHDALQCWHRFDQAYQAEN--NLKQAALATS-------------GYTSDTNWYVDTG

Query:  ATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGN----------SLISGSSLVLKHILYVPKINKHLISVQRLASDNNAVVEFHPNYFLVKDRVT
        A+DH+ ND E L        TD ++V     ++++  G            L +   + L+ +L+  +   +L+SV+RL  +    +EF       K  VT
Subjt:  ATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGN----------SLISGSSLVLKHILYVPKINKHLISVQRLASDNNAVVEFHPNYFLVKDRVT

Query:  KKLLLHGRCKNGLYVLPH----------NFSQALLTAKLSK--EQWHRRLGHPASPITIRILQDN---NLAIDTNIP-SSSICNACQLGKAHQLPF---G
                 KNGL V+ +          NF    + AK       WH R GH +    + I + N   + ++  N+  S  IC  C  GK  +LPF    
Subjt:  KKLLLHGRCKNGLYVLPH----------NFSQALLTAKLSK--EQWHRRLGHPASPITIRILQDN---NLAIDTNIP-SSSICNACQLGKAHQLPF---G

Query:  SSQHVSTAPLQLIHTDVWGP-SIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEY--HRLHNYFKSTGIEH
           H+   PL ++H+DV GP +  ++++  Y+V FVD F+ Y   Y ++ KSDV S+F  F    E   N K+  +  D G EY  + +  +    GI +
Subjt:  SSQHVSTAPLQLIHTDVWGP-SIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEY--HRLHNYFKSTGIEH

Query:  HISCPHTHQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTI--QQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSF
        H++ PHT Q NG+ ER  R I E    +++ A +  S+W EA  TA +LINR+PSR +     TP      K P    LRVFG   + +++   NK+  F
Subjt:  HISCPHTHQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTI--QQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSF

Query:  --RTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFD------------ENIFPFEESKPPNKT-TNPHHPVL-------------LPALAKLANFYTE
          ++ + IF+GY  +  G+K  +    +  ++RDVV D            E +F  +  +  NK   N    ++             +  L        +
Subjt:  --RTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFD------------ENIFPFEESKPPNKT-TNPHHPVL-------------LPALAKLANFYTE

Query:  NALTDIEPVVSNSHMNDG-QTDNIA----SDNLSGVSLSSADNTRSSEEIAEYEAESS-SINAQNQTHEHVS----DQPTEAASQHPMRTRLRNNIVQAK
        N   D   ++     N+  + DNI     S   +   L+ +   +  + + E +   + + + +++T EH+     D PT+      +  R        +
Subjt:  NALTDIEPVVSNSHMNDG-QTDNIA----SDNLSGVSLSSADNTRSSEEIAEYEAESS-SINAQNQTHEHVS----DQPTEAASQHPMRTRLRNNIVQAK

Query:  QFTDGTIRYSETSRKFASAVTITTPII-ETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVA
          T   I Y+E        V     I  +       +Q       W  A+N EL+A K N TW +       N++DS+WV+ VK    G+  R KARLVA
Subjt:  QFTDGTIRYSETSRKFASAVTITTPII-ETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVA

Query:  KGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLI
        +GF Q++ +DY +TF+PV + S+ R ILSL +     + Q+D++ AFL+G LKEE+YMR P G   ++   + +CKL KA+YGLKQA + W       L 
Subjt:  KGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLI

Query:  ELGFKASVADSSLFILKNREI--TIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKP
        E  F  S  D  ++IL    I   IY+L+YVDD++I +          + L   F + DL  +++F+GI ++   D I LSQ  Y   +L + NME C  
Subjt:  ELGFKASVADSSLFILKNREI--TIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKP

Query:  MSTPMGSAEKLFREQGIPLSAEEQFK--YRSTVGALQYLTM-TRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKSTMM---LSGFS
        +STP+ S  K+  E    L+++E      RS +G L Y+ + TRPDL  AVN + +Y        W  +KR+LRY+KGT+ + +  +K+      + G+ 
Subjt:  MSTPMGSAEKLFREQGIPLSAEEQFK--YRSTVGALQYLTM-TRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKSTMM---LSGFS

Query:  DADWAGCPDDRRSTSGFAV-FLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIE
        D+DWAG   DR+ST+G+       NLI W++++Q +V+ SSTEAEY A+     E +W+K LL  + +      +++ DN G   + +NP  H R KHI+
Subjt:  DADWAGCPDDRRSTSGFAV-FLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIE

Query:  VDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPF
        + +HF REQV    + + +I + +Q+ADI TKPL    F
Subjt:  VDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-16431.73Show/hide
Query:  WYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLERQFAS---TIADYFRK-------------VKHLG------DTLAAIGKRIEDEELIAYMLQG
        W   D+   S I   ++++V++ ++   TAR  W  LE  + S   T   Y +K             + HL         LA +G +IE+E+    +L  
Subjt:  WYPQDQLVLSLINSSVTEEVLSTMVGITTAREAWITLERQFAS---TIADYFRK-------------VKHLG------DTLAAIGKRIEDEELIAYMLQG

Query:  LGPDYDPLVTSITTRTDVYTVSDVYAHMLSYEMRHLRKGTFEQLSSANNVNRISIRGGANGGR-GSRGRSRQLNSGHGQSRRTVNNPGRQPSKTQSSSGI
        L   YD L T+I        + DV + +L  E    +     Q        R   R   N GR G+RG+S+  +    ++    N PG       +    
Subjt:  LGPDYDPLVTSITTRTDVYTVSDVYAHMLSYEMRHLRKGTFEQLSSANNVNRISIRGGANGGR-GSRGRSRQLNSGHGQSRRTVNNPGRQPSKTQSSSGI

Query:  VCQICGKPNHDALQCWHRFDQAYQAENNLKQAALATSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTD--QIQVANGAGLSISHIGNSLIS---GS
          +  G+ N D      + +       N ++  +  SG   ++ W VDT A+ H T   +      RY   D   +++ N +   I+ IG+  I    G 
Subjt:  VCQICGKPNHDALQCWHRFDQAYQAENNLKQAALATSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTD--QIQVANGAGLSISHIGNSLIS---GS

Query:  SLVLKHILYVPKINKHLISVQRLASDNNAVVEFHPNYFLVKDRVTKKLLL--HGRCKNGLYVLPHNFSQALLTA---KLSKEQWHRRLGHPASPITIRIL
        +LVLK + +VP +  +LIS   +A D +    +  N    K R+TK  L+   G  +  LY       Q  L A   ++S + WH+R+GH  S   ++IL
Subjt:  SLVLKHILYVPKINKHLISVQRLASDNNAVVEFHPNYFLVKDRVTKKLLL--HGRCKNGLYVLPHNFSQALLTA---KLSKEQWHRRLGHPASPITIRIL

Query:  QDNNLAIDTNIPSSSICNACQLGKAHQLPFGSSQHVSTAPLQLIHTDVWGP-SIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETML
           +L       +   C+ C  GK H++ F +S       L L+++DV GP  I S+  +KY+V+F+DD SR +W+Y L+ K  V  VF +F   VE   
Subjt:  QDNNLAIDTNIPSSSICNACQLGKAHQLPFGSSQHVSTAPLQLIHTDVWGP-SIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETML

Query:  NTKIRSVQSDWGGEY--HRLHNYFKSTGIEHHISCPHTHQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFG
          K++ ++SD GGEY       Y  S GI H  + P T Q NG+ ER +R IVE   ++L  A +P S+W EA  TAC+LINR PS  +  + P      
Subjt:  NTKIRSVQSDWGGEY--HRLHNYFKSTGIEHHISCPHTHQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFG

Query:  KSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPHHPVLLPALAKLANFYT
        K   YS L+VFGC  + ++      KL  ++  CIF+GY     GY+  +    ++  SRDVVF E+                                 
Subjt:  KSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPHHPVLLPALAKLANFYT

Query:  ENALTDIEPVVSNSHMNDGQTDNIASDNLSGVSLSSADNTRSSEEIAEYEAESSSINAQNQTHEHVSDQPTEAASQH-PMRTRLRNNIVQAKQFTDGTIR
             D+   V N  + +  T    S+N +    ++ + +   E+  E   +   ++   +  EH    PT+   QH P+R   R  +            
Subjt:  ENALTDIEPVVSNSHMNDGQTDNIASDNLSGVSLSSADNTRSSEEIAEYEAESSSINAQNQTHEHVSDQPTEAASQH-PMRTRLRNNIVQAKQFTDGTIR

Query:  YSETSRKFASAVTITTPIIETATEPRNLQEAMQHP---RWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQR
            SR++ S   +   +I    EP +L+E + HP   +   AM +E+ +L++N T+ LV    G   +  KWV+K+K+  D  + R KARLV KGF+Q+
Subjt:  YSETSRKFASAVTITTPIIETATEPRNLQEAMQHP---RWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQR

Query:  FGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELGFKA
         G+D+ + FSPV+K ++IR ILSLA +    + Q+D++ AFLHG L+EE+YM QP GF+ + K K+ +CKL K+LYGLKQAP+ W+ +    +    +  
Subjt:  FGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELGFKA

Query:  SVADSSLFILKNREIT-IYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEV--KKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPM
        + +D  ++  +  E   I +L+YVDD++IV        +L   L   F +KDLG  +  LG+++  ++T   + LSQ +Y   +L+R NM+  KP+STP+
Subjt:  SVADSSLFILKNREIT-IYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEV--KKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPM

Query:  GSAEKLFREQGIPLSAEE-----QFKYRSTVGALQY-LTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKSTMMLSGFSDADWA
            KL ++   P + EE     +  Y S VG+L Y +  TRPD+A AV  V ++L  P   HW AVK ILRY++GT    +    S  +L G++DAD A
Subjt:  GSAEKLFREQGIPLSAEE-----QFKYRSTVGALQY-LTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQKSTMMLSGFSDADWA

Query:  GCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVDFHFV
        G  D+R+S++G+        ISW S+ Q  V+ S+TEAEY A      EMIW+K  L+ELG++Q K   ++CD+  A  L+ N ++HARTKHI+V +H++
Subjt:  GCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVDFHFV

Query:  REQVARKAMEVRFISSSDQVADILTKPLSKTPF
        RE V  ++++V  IS+++  AD+LTK + +  F
Subjt:  REQVARKAMEVRFISSSDQVADILTKPLSKTPF

P92519 Uncharacterized mitochondrial protein AtMg008105.1e-5549.12Show/hide
Query:  IYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQ
        +Y+L+YVDDI++  SS+     LI +L   F++KDLG + YFLGI++K    G+ LSQ +YA  +L    M  CKPMSTP+                 + 
Subjt:  IYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQ

Query:  FKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQK-STMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWS
          +RS VGALQYLT+TRPD+++AVN VCQ +H PT A +  +KR+LRYVKGT+  G+ I K S + +  F D+DWAGC   RRST+GF  FLG N+ISWS
Subjt:  FKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQK-STMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWS

Query:  SRKQATVSRSSTEAEYKAIANLTAEMIW
        +++Q TVSRSSTE EY+A+A   AE+ W
Subjt:  SRKQATVSRSSTEAEYKAIANLTAEMIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.6e-28038.94Show/hide
Query:  NTMSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGI
        N  S  +++     KLT  NYL+WS Q+        L GF+DGS   P  TI  + +         +NP++T W  QD+L+ S +  +++  V   +   
Subjt:  NTMSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGI

Query:  TTAREAWITLERQFAS---------------------TIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDVYTVSDVYAHM
        TTA + W TL + +A+                     TI DY + +    D LA +GK ++ +E +  +L+ L  +Y P++  I  +    T+++++  +
Subjt:  TTAREAWITLERQFAS---------------------TIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDVYTVSDVYAHM

Query:  LSYEMRHLRKGTFEQL----SSANNVNRISIRGGANGGRGSR--GRSRQLNSGHGQSRRTVNNPGRQPSKTQSSSGIVCQICGKPNHDALQCWHRFDQAY
        L++E + L   +   +    ++ ++ N  +     NG R +R   R+   NS   Q   T  +P    SK        CQICG   H A +C     Q +
Subjt:  LSYEMRHLRKGTFEQL----SSANNVNRISIRGGANGGRGSR--GRSRQLNSGHGQSRRTVNNPGRQPSKTQSSSGIVCQICGKPNHDALQCWHRFDQAY

Query:  QAENNLKQ-----------AALATSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLISGSS--LVLKHILYVPKINK
         +  N +Q           A LA     S  NW +D+GAT HIT+D   L+  + YTG D + VA+G+ + ISH G++ +S  S  L L +ILYVP I+K
Subjt:  QAENNLKQ-----------AALATSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLISGSS--LVLKHILYVPKINK

Query:  HLISVQRLASDNNAVVEFHPNYFLVKDRVTKKLLLHGRCKNGLYVLPHNFSQ-----ALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSS
        +LISV RL + N   VEF P  F VKD  T   LL G+ K+ LY  P   SQ     A  ++K +   WH RLGHPA  I   ++ + +L++        
Subjt:  HLISVQRLASDNNAVVEFHPNYFLVKDRVTKKLLLHGRCKNGLYVLPHNFSQ-----ALLTAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSS

Query:  ICNACQLGKAHQLPFGSSQHVSTAPLQLIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYH
         C+ C + K++++PF  S   ST PL+ I++DVW   I S +N +YYV FVD F+RY W+Y L+ KS V+  F+ F+  +E    T+I +  SD GGE+ 
Subjt:  ICNACQLGKAHQLPFGSSQHVSTAPLQLIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYH

Query:  RLHNYFKSTGIEHHISCPHTHQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPN
         L  YF   GI H  S PHT + NGL ERKHRHIVETGL LL+ A++P +YW  AF  A +LINR+P+  +Q ++P  KLFG SP+Y  LRVFGCAC+P 
Subjt:  RLHNYFKSTGIEHHISCPHTHQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPN

Query:  LRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPF--------------EESK----------------PPNKTTNPHHPVL
        LRPYN  KL  ++ +C+FLGYS +   Y CL+  T R+YISR V FDEN FPF               ES                 P    ++PHH   
Subjt:  LRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPF--------------EESK----------------PPNKTTNPHHPVL

Query:  LP------------ALAKLANFYTENALTDIEPVVSNSHMNDGQTDNIASDNLSGVSLSSADNTRSSEEIAEYEAESSSINAQNQTHEHVSDQPTEAASQ
         P            + + L + ++ +  +  EP     +     T    +   +  S +++ N  ++E  ++  A+S S  AQ+ +    S  PT +AS 
Subjt:  LP------------ALAKLANFYTENALTDIEPVVSNSHMNDGQTDNIASDNLSGVSLSSADNTRSSEEIAEYEAESSSINAQNQTHEHVSDQPTEAASQ

Query:  H--------------PMRTRLRNNIVQA-----KQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVP
                       P   ++ NN  QA        T       + + K++ AV++        +EPR   +A++  RWR AM  E++A   N TWDLVP
Subjt:  H--------------PMRTRLRNNIVQA-----KQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVP

Query:  PKPG-INLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQD
        P P  + ++  +W++  K  +DGS+ R KARLVAKG+ QR G+DY +TFSPVIK ++IR++L +AV + W +RQ+D+ NAFL G L ++VYM QPPGF D
Subjt:  PKPG-INLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQD

Query:  SAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLG
          +P NY+CKL+KALYGLKQAP+AW+  L   L+ +GF  SV+D+SLF+L+  +  +YML+YVDDI+I  +        +  L   F+VKD   L YFLG
Subjt:  SAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLG

Query:  IEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKR
        IE K+   G+ LSQRRY LDLL R NM   KP++TPM  + KL    G  L+  +  +YR  VG+LQYL  TRPD+++AVN++ Q++H PT+ H  A+KR
Subjt:  IEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKR

Query:  ILRYVKGTLALGVKIQK-STMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAP
        ILRY+ GT   G+ ++K +T+ L  +SDADWAG  DD  ST+G+ V+LG + ISWSS+KQ  V RSSTEAEY+++AN ++EM WI SLL ELG+  ++ P
Subjt:  ILRYVKGTLALGVKIQK-STMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAP

Query:  RLWCDNLGATYLTSNPVFHARTKHIEVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNLNM
         ++CDN+GATYL +NPVFH+R KHI +D+HF+R QV   A+ V  +S+ DQ+AD LTKPLS+T F    + + +
Subjt:  RLWCDNLGATYLTSNPVFHARTKHIEVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNLNM

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.1e-27938.68Show/hide
Query:  NNTMSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVG
        N  + + ++S V   KLT  NYL+WS Q+        L GF+DGS P P  TI  +           +NP++T W  QD+L+ S I  +++  V   +  
Subjt:  NNTMSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVG

Query:  ITTAREAWITLERQFASTIADYFRKVKHLG--DTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDVYTVSDVYAHMLSYEMRHLRKGTFEQLSS
         TTA + W TL + +A+    +  +++ +   D LA +GK ++ +E +  +L+ L  DY P++  I  +    ++++++  +++ E + L   + E +  
Subjt:  ITTAREAWITLERQFASTIADYFRKVKHLG--DTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDVYTVSDVYAHMLSYEMRHLRKGTFEQLSS

Query:  ANNVNRISIRG-GANGGRGSRGRSRQLNSGHGQSRRTVNNPGRQPSKTQSSSGIV--CQICGKPNHDALQC--WHRFDQAYQAENNLK-------QAALA
          NV  ++ R    N  + +RG +R  N+ + +S     +     S  +     +  CQIC    H A +C   H+F      + +         +A LA
Subjt:  ANNVNRISIRG-GANGGRGSRGRSRQLNSGHGQSRRTVNNPGRQPSKTQSSSGIV--CQICGKPNHDALQC--WHRFDQAYQAENNLK-------QAALA

Query:  TSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLI--SGSSLVLKHILYVPKINKHLISVQRLASDNNAVVEFHPNYF
         +   +  NW +D+GAT HIT+D   L+  + YTG D + +A+G+ + I+H G++ +  S  SL L  +LYVP I+K+LISV RL + N   VEF P  F
Subjt:  TSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQVANGAGLSISHIGNSLI--SGSSLVLKHILYVPKINKHLISVQRLASDNNAVVEFHPNYF

Query:  LVKDRVTKKLLLHGRCKNGLYVLPHNFSQALL-----TAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFGSSQHVST
         VKD  T   LL G+ K+ LY  P   SQA+       +K +   WH RLGHP+  I   ++ +++L +         C+ C + K+H++PF +S   S+
Subjt:  LVKDRVTKKLLLHGRCKNGLYVLPHNFSQALL-----TAKLSKEQWHRRLGHPASPITIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFGSSQHVST

Query:  APLQLIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHISCPHTHQQ
         PL+ I++DVW   I S++N +YYV FVD F+RY W+Y L+ KS V+  F+ F+  VE    T+I ++ SD GGE+  L +Y    GI H  S PHT + 
Subjt:  APLQLIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRSVQSDWGGEYHRLHNYFKSTGIEHHISCPHTHQQ

Query:  NGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSS
        NGL ERKHRHIVE GL LL+ A++P +YW  AF+ A +LINR+P+  +Q  +P  KLFG+ P+Y  L+VFGCAC+P LRPYN  KL  ++ +C F+GYS 
Subjt:  NGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPNLRPYNNKKLSFRTTRCIFLGYSS

Query:  SHKGYKCLNRSTGRIYISRDVVFDENIFPF--------------EESKP--PNKTTNPHHPVLLPALAKLANFYTENALTDIEPVVSNSHMNDGQTDNIA
        +   Y CL+  TGR+Y SR V FDE  FPF               +S P  P+ TT P  P++LPA   L      +  T   P  S S +    T  ++
Subjt:  SHKGYKCLNRSTGRIYISRDVVFDENIFPF--------------EESKP--PNKTTNPHHPVLLPALAKLANFYTENALTDIEPVVSNSHMNDGQTDNIA

Query:  SDNLSGVSLSSADNTRSSEEIAEYEAESSSINAQNQTHEHVSDQP--------TEAASQHPMRTRLRNNIVQAKQFTDGTIRYSETSRKFASA-------
        S NL   S+SS     SSE  A            +QT    S+ P        + + +     + L  + + +      +   SE +   +S+       
Subjt:  SDNLSGVSLSSADNTRSSEEIAEYEAESSSINAQNQTHEHVSDQP--------TEAASQHPMRTRLRNNIVQAKQFTDGTIRYSETSRKFASA-------

Query:  -VTITTPIIE----------------------------------TATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLV-PPKPGINLIDSKWVYKVK
         V    PII+                                    +EPR   +AM+  RWR AM  E++A   N TWDLV PP P + ++  +W++  K
Subjt:  -VTITTPIIE----------------------------------TATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLV-PPKPGINLIDSKWVYKVK

Query:  RKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGL
          +DGS+ R KARLVAKG+ QR G+DY +TFSPVIK ++IR++L +AV + W +RQ+D+ NAFL G L +EVYM QPPGF D  +P +Y+C+L+KA+YGL
Subjt:  RKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGFQDSAKPKNYICKLKKALYGL

Query:  KQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYA
        KQAP+AW+  L   L+ +GF  S++D+SLF+L+     IYML+YVDDI+I  +     +  +  L   F+VK+   L YFLGIE K+   G+ LSQRRY 
Subjt:  KQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYA

Query:  LDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQK-
        LDLL R NM   KP++TPM ++ KL    G  L   +  +YR  VG+LQYL  TRPDL++AVN++ QY+H PTD HW A+KR+LRY+ GT   G+ ++K 
Subjt:  LDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQK-

Query:  STMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVF
        +T+ L  +SDADWAG  DD  ST+G+ V+LG + ISWSS+KQ  V RSSTEAEY+++AN ++E+ WI SLL ELG+  S  P ++CDN+GATYL +NPVF
Subjt:  STMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVF

Query:  HARTKHIEVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNLNMYK
        H+R KHI +D+HF+R QV   A+ V  +S+ DQ+AD LTKPLS+  F      + + K
Subjt:  HARTKHIEVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNLNMYK

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.7e-0822.22Show/hide
Query:  ISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIIINPEFTVWYPQDQLV-LSLINSSVTEEVLSTMVGITTAREAWITL
        + + + + NY  W    L +  S +++G +DG++                   +  N     W  +D +V LSL  +   ++   + V  +T+R+ W+ +
Subjt:  ISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIIINPEFTVWYPQDQLV-LSLINSSVTEEVLSTMVGITTAREAWITL

Query:  ERQFAST---------------------IADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDVYTVSDVYAHMLSYEMR----
        + QF +                      +ADY+RK+K L D+L  +   + D  L+ Y+L GL P +D ++  I  R    +  D    +   E R    
Subjt:  ERQFAST---------------------IADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDVYTVSDVYAHMLSYEMR----

Query:  ---------HLRKGTFEQLSSANNVNRISIRGGANGGRGSRGRSRQLNSGHGQSRRTVNNP
                 H    T    S A  V      GG   G   RGR   +  G G      N P
Subjt:  ---------HLRKGTFEQLSSANNVNRISIRGGANGGRGSRGRSRQLNSGHGQSRRTVNNP

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.5e-11343.98Show/hide
Query:  IETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVI
        I  A EP    EA +   W GAM+DE+ A++   TW++    P    I  KWVYK+K  +DG++ER KARLVAKG+ Q+ G+D+ +TFSPV K +++++I
Subjt:  IETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVI

Query:  LSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGF---QDSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIY
        L+++    + + Q+DI NAFL+G L EE+YM+ PPG+   Q  + P N +C LKK++YGLKQA + W  + +  LI  GF  S +D + F+     + + 
Subjt:  LSLAVTKGWNMRQVDIQNAFLHGILKEEVYMRQPPGF---QDSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIY

Query:  MLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFK
        +L+YVDDIII S++D A + L  +LK  F ++DLG L+YFLG+E+ ++  GI + QR+YALDLL    +  CKP S PM  +       G      +   
Subjt:  MLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFK

Query:  YRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGV-KIQKSTMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWSSR
        YR  +G L YL +TR D++FAVNK+ Q+   P  AH  AV +IL Y+KGT+  G+    ++ M L  FSDA +  C D RRST+G+ +FLG +LISW S+
Subjt:  YRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGV-KIQKSTMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWSSR

Query:  KQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVDFHFVREQVARKA
        KQ  VS+SS EAEY+A++  T EM+W+    +EL +  SK   L+CDN  A ++ +N VFH RTKHIE D H VRE+   +A
Subjt:  KQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTKHIEVDFHFVREQVARKA

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.2e-1447.73Show/hide
Query:  YLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQ-KSTMMLSGFSDADWAGCPDDRRSTSGFAV-----FLGA
        YLT+TRPDL FAVN++ Q+      A   AV ++L YVKGT+  G+     S + L  F+D+DWA CPD RRS +GF       FLGA
Subjt:  YLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQ-KSTMMLSGFSDADWAGCPDDRRSTSGFAV-----FLGA

ATMG00810.1 DNA/RNA polymerases superfamily protein3.6e-5649.12Show/hide
Query:  IYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQ
        +Y+L+YVDDI++  SS+     LI +L   F++KDLG + YFLGI++K    G+ LSQ +YA  +L    M  CKPMSTP+                 + 
Subjt:  IYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIEVKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQ

Query:  FKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQK-STMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWS
          +RS VGALQYLT+TRPD+++AVN VCQ +H PT A +  +KR+LRYVKGT+  G+ I K S + +  F D+DWAGC   RRST+GF  FLG N+ISWS
Subjt:  FKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALGVKIQK-STMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWS

Query:  SRKQATVSRSSTEAEYKAIANLTAEMIW
        +++Q TVSRSSTE EY+A+A   AE+ W
Subjt:  SRKQATVSRSSTEAEYKAIANLTAEMIW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.2e-2449.11Show/hide
Query:  AVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVI
        ++TITT I     EP+++  A++ P W  AM +EL AL RN TW LVPP    N++  KWV+K K  +DG+++RLKARLVAKGF Q  G+ + +T+SPV+
Subjt:  AVTITTPIIETATEPRNLQEAMQHPRWRGAMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVI

Query:  KPSTIRVILSLA
        + +TIR IL++A
Subjt:  KPSTIRVILSLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAATCCCAAACAACACAATGTCTTCTCCGTCGATCAGTCAGGTGATTAGTGTTAAGCTTACACAAGAAAATTATCTACTGTGGTCTACCCAAATCCTTCCCTACTT
GCGTAGCCAAAACCTTGTTGGTTTTGTGGATGGATCCATGCCTGCACCAAGCCAGACGATCGCCGTTGAACCAAGTGAAGAAACAGGGAATCGCAAAATTATCATCAACC
CTGAGTTCACAGTCTGGTACCCCCAGGACCAGCTGGTACTCAGCCTCATCAACTCATCAGTCACTGAGGAGGTTCTCAGCACGATGGTTGGAATCACCACTGCACGAGAA
GCCTGGATTACGCTGGAGCGACAATTTGCTTCGACAATTGCTGACTACTTTCGTAAAGTAAAACATCTTGGTGATACACTTGCTGCCATTGGCAAGCGAATAGAGGATGA
AGAACTCATCGCCTACATGCTGCAAGGACTTGGTCCAGATTATGATCCTCTAGTCACAAGCATTACAACCAGAACAGATGTATACACTGTCAGCGACGTGTATGCTCACA
TGCTGAGCTATGAGATGCGGCACTTGCGTAAGGGTACATTTGAGCAACTTTCATCTGCTAACAATGTCAATAGGATATCCATTCGTGGAGGTGCCAATGGAGGTCGAGGT
AGTCGCGGTCGTAGTCGTCAGTTAAATAGTGGTCATGGACAATCAAGGCGTACTGTGAACAATCCTGGACGTCAACCATCAAAGACACAAAGCAGCTCAGGCATTGTCTG
TCAGATTTGTGGTAAGCCCAATCACGATGCTTTGCAATGCTGGCACAGATTTGATCAGGCATATCAAGCCGAAAATAATCTCAAACAAGCAGCTTTGGCAACAAGTGGAT
ACACTAGTGACACAAACTGGTATGTTGACACTGGAGCCACAGATCATATCACCAATGACCTAGAGAGGCTTACCACCAGAGAACGCTACACTGGCACCGACCAAATTCAG
GTTGCAAATGGCGCAGGTTTGTCTATCTCTCATATTGGGAATTCATTAATTTCTGGTTCATCTCTTGTTCTGAAACATATCCTATATGTTCCTAAAATCAATAAGCACCT
AATTTCAGTACAAAGACTAGCATCTGATAATAATGCTGTTGTAGAATTTCACCCAAACTATTTTTTGGTTAAGGACCGAGTCACGAAGAAACTCCTGCTCCACGGTAGAT
GTAAGAATGGCCTATACGTTCTACCGCATAATTTCAGTCAAGCCTTGCTGACAGCCAAACTTTCGAAAGAACAATGGCACAGAAGGCTAGGGCACCCTGCATCTCCAATT
ACCATTAGAATTCTACAAGATAATAATTTAGCTATAGATACTAATATTCCCTCTTCCTCAATTTGTAATGCTTGTCAATTAGGGAAAGCACATCAATTGCCATTTGGTTC
TTCTCAGCATGTATCTACAGCACCCCTTCAATTAATTCACACTGATGTATGGGGTCCATCCATTGCGTCAGTAAATAATTCCAAATATTATGTTTCCTTTGTTGATGATT
TTAGTCGTTATGTTTGGATTTACTTTCTGAGATGCAAATCTGATGTTGAGTCTGTGTTCCTTCAATTTCAAAAACATGTTGAAACTATGCTAAATACCAAAATTCGCTCC
GTCCAATCAGATTGGGGGGGTGAATACCATCGGTTACACAATTATTTCAAATCCACAGGCATTGAACATCATATCTCCTGTCCTCACACACACCAGCAGAATGGGTTAGT
CGAAAGAAAACACAGACACATTGTAGAAACTGGCCTTGCTTTACTCGCTCAAGCCAACATGCCTCTATCCTACTGGGATGAAGCTTTCAACACAGCTTGCTTTCTTATAA
ATAGAATGCCCAGCCGAACCATACAACAAGACACACCACTTCATAAATTGTTTGGTAAAAGTCCAGACTACTCCATGCTTAGGGTGTTTGGCTGTGCTTGCTGGCCTAAT
TTAAGGCCTTACAACAACAAGAAACTGAGTTTCAGAACTACTAGATGTATATTCTTGGGTTATAGTTCTTCTCATAAGGGATATAAATGCTTAAATAGAAGTACAGGACG
TATTTACATCTCTAGAGACGTGGTTTTCGATGAAAATATTTTTCCTTTTGAAGAATCTAAGCCACCAAACAAAACCACAAATCCACATCATCCTGTTCTACTTCCAGCCT
TAGCCAAACTTGCTAATTTTTACACTGAAAATGCTCTTACAGATATTGAACCAGTTGTTAGTAATTCCCATATGAATGATGGTCAAACTGATAATATTGCTAGTGACAAC
TTGTCTGGTGTCAGCTTATCTTCTGCAGATAATACAAGAAGTTCAGAGGAAATTGCAGAATATGAAGCTGAGAGCAGTTCGATCAATGCTCAAAACCAAACTCATGAACA
TGTGTCTGATCAACCAACTGAAGCAGCTAGTCAACATCCAATGCGAACAAGGTTGAGAAATAACATTGTACAAGCTAAACAATTCACTGATGGAACTATCAGATATTCAG
AAACCTCAAGAAAATTCGCAAGCGCTGTAACTATCACAACTCCGATCATAGAGACTGCTACTGAACCTCGAAACCTGCAGGAAGCCATGCAACATCCAAGATGGAGAGGA
GCAATGAATGATGAGCTCTCAGCGCTAAAACGAAATGCCACTTGGGATCTAGTTCCACCCAAACCTGGAATAAATCTCATTGATAGTAAATGGGTGTATAAAGTGAAAAG
AAAAGCAGATGGGTCAGTTGAAAGATTAAAAGCAAGATTAGTTGCCAAAGGATTCAAGCAAAGATTTGGTGTTGATTACACTGATACTTTTAGCCCTGTGATCAAACCGT
CAACAATCAGGGTCATTCTTTCGCTAGCAGTAACCAAGGGCTGGAATATGAGACAAGTTGATATCCAAAATGCATTTTTGCATGGAATTCTGAAAGAGGAAGTGTACATG
CGACAACCACCAGGATTTCAAGACTCAGCCAAACCAAAGAATTACATATGCAAGCTCAAGAAAGCCCTTTATGGCCTGAAACAAGCCCCAAAAGCTTGGCATTCAAGGTT
GACTGGAAAACTTATTGAGTTAGGCTTCAAGGCTTCAGTAGCTGATTCATCTCTTTTTATTCTCAAAAACAGAGAGATAACTATCTATATGCTCATCTATGTTGATGATA
TAATTATTGTGAGCTCCTCTGATCAAGCAACCGAAAGGTTGATTCAGAAATTGAAAATAGATTTTGCAGTAAAAGATTTGGGTGGTCTTGAGTATTTTCTGGGTATTGAA
GTCAAGAAAACACGAGATGGTATCATACTGTCACAGAGACGATATGCCTTAGATTTGTTGAAAAGAGTAAACATGGAAAAATGCAAACCTATGTCTACACCAATGGGTTC
TGCTGAAAAATTATTCAGAGAACAAGGAATACCCTTATCAGCTGAAGAACAATTCAAATACAGAAGTACAGTGGGAGCACTACAATATTTGACAATGACTAGGCCTGATT
TGGCATTTGCTGTCAATAAAGTGTGTCAATATCTTCATACACCTACTGATGCTCATTGGGGTGCTGTGAAGAGAATTCTTCGTTATGTTAAAGGCACACTAGCATTAGGA
GTGAAAATTCAGAAATCAACCATGATGTTGTCGGGGTTTTCTGATGCTGATTGGGCTGGTTGTCCCGATGATCGACGTTCAACTAGCGGCTTTGCTGTATTTCTTGGAGC
AAATCTAATCTCATGGAGTTCCAGAAAACAGGCTACAGTGTCAAGATCAAGCACCGAAGCAGAATACAAGGCCATTGCGAATCTTACTGCAGAAATGATTTGGATCAAGT
CATTACTGAAGGAACTGGGCGTGTATCAATCAAAGGCTCCTCGCCTCTGGTGTGACAACCTCGGAGCTACATATTTAACTTCAAATCCAGTATTTCATGCTAGAACGAAA
CATATTGAAGTTGATTTTCATTTTGTTCGAGAACAAGTAGCACGTAAAGCAATGGAAGTTCGGTTCATTTCATCAAGTGATCAAGTAGCTGATATCCTGACAAAACCACT
GTCTAAAACTCCTTTTACTACACATTGTAACAATCTCAACATGTACAAGACTTGTTGGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAATCCCAAACAACACAATGTCTTCTCCGTCGATCAGTCAGGTGATTAGTGTTAAGCTTACACAAGAAAATTATCTACTGTGGTCTACCCAAATCCTTCCCTACTT
GCGTAGCCAAAACCTTGTTGGTTTTGTGGATGGATCCATGCCTGCACCAAGCCAGACGATCGCCGTTGAACCAAGTGAAGAAACAGGGAATCGCAAAATTATCATCAACC
CTGAGTTCACAGTCTGGTACCCCCAGGACCAGCTGGTACTCAGCCTCATCAACTCATCAGTCACTGAGGAGGTTCTCAGCACGATGGTTGGAATCACCACTGCACGAGAA
GCCTGGATTACGCTGGAGCGACAATTTGCTTCGACAATTGCTGACTACTTTCGTAAAGTAAAACATCTTGGTGATACACTTGCTGCCATTGGCAAGCGAATAGAGGATGA
AGAACTCATCGCCTACATGCTGCAAGGACTTGGTCCAGATTATGATCCTCTAGTCACAAGCATTACAACCAGAACAGATGTATACACTGTCAGCGACGTGTATGCTCACA
TGCTGAGCTATGAGATGCGGCACTTGCGTAAGGGTACATTTGAGCAACTTTCATCTGCTAACAATGTCAATAGGATATCCATTCGTGGAGGTGCCAATGGAGGTCGAGGT
AGTCGCGGTCGTAGTCGTCAGTTAAATAGTGGTCATGGACAATCAAGGCGTACTGTGAACAATCCTGGACGTCAACCATCAAAGACACAAAGCAGCTCAGGCATTGTCTG
TCAGATTTGTGGTAAGCCCAATCACGATGCTTTGCAATGCTGGCACAGATTTGATCAGGCATATCAAGCCGAAAATAATCTCAAACAAGCAGCTTTGGCAACAAGTGGAT
ACACTAGTGACACAAACTGGTATGTTGACACTGGAGCCACAGATCATATCACCAATGACCTAGAGAGGCTTACCACCAGAGAACGCTACACTGGCACCGACCAAATTCAG
GTTGCAAATGGCGCAGGTTTGTCTATCTCTCATATTGGGAATTCATTAATTTCTGGTTCATCTCTTGTTCTGAAACATATCCTATATGTTCCTAAAATCAATAAGCACCT
AATTTCAGTACAAAGACTAGCATCTGATAATAATGCTGTTGTAGAATTTCACCCAAACTATTTTTTGGTTAAGGACCGAGTCACGAAGAAACTCCTGCTCCACGGTAGAT
GTAAGAATGGCCTATACGTTCTACCGCATAATTTCAGTCAAGCCTTGCTGACAGCCAAACTTTCGAAAGAACAATGGCACAGAAGGCTAGGGCACCCTGCATCTCCAATT
ACCATTAGAATTCTACAAGATAATAATTTAGCTATAGATACTAATATTCCCTCTTCCTCAATTTGTAATGCTTGTCAATTAGGGAAAGCACATCAATTGCCATTTGGTTC
TTCTCAGCATGTATCTACAGCACCCCTTCAATTAATTCACACTGATGTATGGGGTCCATCCATTGCGTCAGTAAATAATTCCAAATATTATGTTTCCTTTGTTGATGATT
TTAGTCGTTATGTTTGGATTTACTTTCTGAGATGCAAATCTGATGTTGAGTCTGTGTTCCTTCAATTTCAAAAACATGTTGAAACTATGCTAAATACCAAAATTCGCTCC
GTCCAATCAGATTGGGGGGGTGAATACCATCGGTTACACAATTATTTCAAATCCACAGGCATTGAACATCATATCTCCTGTCCTCACACACACCAGCAGAATGGGTTAGT
CGAAAGAAAACACAGACACATTGTAGAAACTGGCCTTGCTTTACTCGCTCAAGCCAACATGCCTCTATCCTACTGGGATGAAGCTTTCAACACAGCTTGCTTTCTTATAA
ATAGAATGCCCAGCCGAACCATACAACAAGACACACCACTTCATAAATTGTTTGGTAAAAGTCCAGACTACTCCATGCTTAGGGTGTTTGGCTGTGCTTGCTGGCCTAAT
TTAAGGCCTTACAACAACAAGAAACTGAGTTTCAGAACTACTAGATGTATATTCTTGGGTTATAGTTCTTCTCATAAGGGATATAAATGCTTAAATAGAAGTACAGGACG
TATTTACATCTCTAGAGACGTGGTTTTCGATGAAAATATTTTTCCTTTTGAAGAATCTAAGCCACCAAACAAAACCACAAATCCACATCATCCTGTTCTACTTCCAGCCT
TAGCCAAACTTGCTAATTTTTACACTGAAAATGCTCTTACAGATATTGAACCAGTTGTTAGTAATTCCCATATGAATGATGGTCAAACTGATAATATTGCTAGTGACAAC
TTGTCTGGTGTCAGCTTATCTTCTGCAGATAATACAAGAAGTTCAGAGGAAATTGCAGAATATGAAGCTGAGAGCAGTTCGATCAATGCTCAAAACCAAACTCATGAACA
TGTGTCTGATCAACCAACTGAAGCAGCTAGTCAACATCCAATGCGAACAAGGTTGAGAAATAACATTGTACAAGCTAAACAATTCACTGATGGAACTATCAGATATTCAG
AAACCTCAAGAAAATTCGCAAGCGCTGTAACTATCACAACTCCGATCATAGAGACTGCTACTGAACCTCGAAACCTGCAGGAAGCCATGCAACATCCAAGATGGAGAGGA
GCAATGAATGATGAGCTCTCAGCGCTAAAACGAAATGCCACTTGGGATCTAGTTCCACCCAAACCTGGAATAAATCTCATTGATAGTAAATGGGTGTATAAAGTGAAAAG
AAAAGCAGATGGGTCAGTTGAAAGATTAAAAGCAAGATTAGTTGCCAAAGGATTCAAGCAAAGATTTGGTGTTGATTACACTGATACTTTTAGCCCTGTGATCAAACCGT
CAACAATCAGGGTCATTCTTTCGCTAGCAGTAACCAAGGGCTGGAATATGAGACAAGTTGATATCCAAAATGCATTTTTGCATGGAATTCTGAAAGAGGAAGTGTACATG
CGACAACCACCAGGATTTCAAGACTCAGCCAAACCAAAGAATTACATATGCAAGCTCAAGAAAGCCCTTTATGGCCTGAAACAAGCCCCAAAAGCTTGGCATTCAAGGTT
GACTGGAAAACTTATTGAGTTAGGCTTCAAGGCTTCAGTAGCTGATTCATCTCTTTTTATTCTCAAAAACAGAGAGATAACTATCTATATGCTCATCTATGTTGATGATA
TAATTATTGTGAGCTCCTCTGATCAAGCAACCGAAAGGTTGATTCAGAAATTGAAAATAGATTTTGCAGTAAAAGATTTGGGTGGTCTTGAGTATTTTCTGGGTATTGAA
GTCAAGAAAACACGAGATGGTATCATACTGTCACAGAGACGATATGCCTTAGATTTGTTGAAAAGAGTAAACATGGAAAAATGCAAACCTATGTCTACACCAATGGGTTC
TGCTGAAAAATTATTCAGAGAACAAGGAATACCCTTATCAGCTGAAGAACAATTCAAATACAGAAGTACAGTGGGAGCACTACAATATTTGACAATGACTAGGCCTGATT
TGGCATTTGCTGTCAATAAAGTGTGTCAATATCTTCATACACCTACTGATGCTCATTGGGGTGCTGTGAAGAGAATTCTTCGTTATGTTAAAGGCACACTAGCATTAGGA
GTGAAAATTCAGAAATCAACCATGATGTTGTCGGGGTTTTCTGATGCTGATTGGGCTGGTTGTCCCGATGATCGACGTTCAACTAGCGGCTTTGCTGTATTTCTTGGAGC
AAATCTAATCTCATGGAGTTCCAGAAAACAGGCTACAGTGTCAAGATCAAGCACCGAAGCAGAATACAAGGCCATTGCGAATCTTACTGCAGAAATGATTTGGATCAAGT
CATTACTGAAGGAACTGGGCGTGTATCAATCAAAGGCTCCTCGCCTCTGGTGTGACAACCTCGGAGCTACATATTTAACTTCAAATCCAGTATTTCATGCTAGAACGAAA
CATATTGAAGTTGATTTTCATTTTGTTCGAGAACAAGTAGCACGTAAAGCAATGGAAGTTCGGTTCATTTCATCAAGTGATCAAGTAGCTGATATCCTGACAAAACCACT
GTCTAAAACTCCTTTTACTACACATTGTAACAATCTCAACATGTACAAGACTTGTTGGGATTGA
Protein sequenceShow/hide protein sequence
MSIPNNTMSSPSISQVISVKLTQENYLLWSTQILPYLRSQNLVGFVDGSMPAPSQTIAVEPSEETGNRKIIINPEFTVWYPQDQLVLSLINSSVTEEVLSTMVGITTARE
AWITLERQFASTIADYFRKVKHLGDTLAAIGKRIEDEELIAYMLQGLGPDYDPLVTSITTRTDVYTVSDVYAHMLSYEMRHLRKGTFEQLSSANNVNRISIRGGANGGRG
SRGRSRQLNSGHGQSRRTVNNPGRQPSKTQSSSGIVCQICGKPNHDALQCWHRFDQAYQAENNLKQAALATSGYTSDTNWYVDTGATDHITNDLERLTTRERYTGTDQIQ
VANGAGLSISHIGNSLISGSSLVLKHILYVPKINKHLISVQRLASDNNAVVEFHPNYFLVKDRVTKKLLLHGRCKNGLYVLPHNFSQALLTAKLSKEQWHRRLGHPASPI
TIRILQDNNLAIDTNIPSSSICNACQLGKAHQLPFGSSQHVSTAPLQLIHTDVWGPSIASVNNSKYYVSFVDDFSRYVWIYFLRCKSDVESVFLQFQKHVETMLNTKIRS
VQSDWGGEYHRLHNYFKSTGIEHHISCPHTHQQNGLVERKHRHIVETGLALLAQANMPLSYWDEAFNTACFLINRMPSRTIQQDTPLHKLFGKSPDYSMLRVFGCACWPN
LRPYNNKKLSFRTTRCIFLGYSSSHKGYKCLNRSTGRIYISRDVVFDENIFPFEESKPPNKTTNPHHPVLLPALAKLANFYTENALTDIEPVVSNSHMNDGQTDNIASDN
LSGVSLSSADNTRSSEEIAEYEAESSSINAQNQTHEHVSDQPTEAASQHPMRTRLRNNIVQAKQFTDGTIRYSETSRKFASAVTITTPIIETATEPRNLQEAMQHPRWRG
AMNDELSALKRNATWDLVPPKPGINLIDSKWVYKVKRKADGSVERLKARLVAKGFKQRFGVDYTDTFSPVIKPSTIRVILSLAVTKGWNMRQVDIQNAFLHGILKEEVYM
RQPPGFQDSAKPKNYICKLKKALYGLKQAPKAWHSRLTGKLIELGFKASVADSSLFILKNREITIYMLIYVDDIIIVSSSDQATERLIQKLKIDFAVKDLGGLEYFLGIE
VKKTRDGIILSQRRYALDLLKRVNMEKCKPMSTPMGSAEKLFREQGIPLSAEEQFKYRSTVGALQYLTMTRPDLAFAVNKVCQYLHTPTDAHWGAVKRILRYVKGTLALG
VKIQKSTMMLSGFSDADWAGCPDDRRSTSGFAVFLGANLISWSSRKQATVSRSSTEAEYKAIANLTAEMIWIKSLLKELGVYQSKAPRLWCDNLGATYLTSNPVFHARTK
HIEVDFHFVREQVARKAMEVRFISSSDQVADILTKPLSKTPFTTHCNNLNMYKTCWD