; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025847 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025847
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr10:22022263..22033172
RNA-Seq ExpressionLag0025847
SyntenyLag0025847
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBG97282.1 hypothetical protein Prudu_006352 [Prunus dulcis]4.0e-20440.22Show/hide
Query:  NYSFQGRHPPTKLSAMAATASPVTSSVTRSPNALPQDSQVWLSDTCCNAHLTSDLANLNISNTYHGDENITVGNGHSLPISHYDSSQISLYDRTFTLSNL
        NY++QG  PP+ L+AMAA AS        SP+A      VW++D+  + H+   +  ++        EN+ VGNG  L I+H   S I     + TLS +
Subjt:  NYSFQGRHPPTKLSAMAATASPVTSSVTRSPNALPQDSQVWLSDTCCNAHLTSDLANLNISNTYHGDENITVGNGHSLPISHYDSSQISLYDRTFTLSNL

Query:  LRVPHISTNLLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPVSALPISKAQVG------------------LTANVGTKASFSVWHDR
        L VP ++ NLLSV+Q C DNNC  IF  S F IQDK T + L  G   + LYPV   P S + +G                   +A +G K   S+WH R
Subjt:  LRVPHISTNLLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPVSALPISKAQVG------------------LTANVGTKASFSVWHDR

Query:  LGHPSTPILRSLLNSSDISAANFS-SHICRHSLDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTT
        LGHP+  +++ +L ++ I   + S S +C   LDGKM +LPFS      + P   L SDVWG +P  SI+  +Y VSFID+Y+ + WL+PL  KSEV+T 
Subjt:  LGHPSTPILRSLLNSSDISAANFS-SHICRHSLDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTT

Query:  FINFLPFAENLLSSKLKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPT
        F   + F     ++ +K  +SDGG E++S     FL++ GI+HQ S P T +QNG+ ERK+RHL++ +I+L+ ++S+  +FW  A A   +LINR+PS  
Subjt:  FINFLPFAENLLSSKLKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPT

Query:  LGNKSP-CELFHKPPEYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVYFDELNFP----FSVPSPPSQPSP-L
        L N+SP   L H+ P+   LR FG   YP LR  N TKLQPRT  CVF+GY LGYKG +CY+ ST+KF +SRHV  DE  FP    FS+PS P  PS  L
Subjt:  LGNKSP-CELFHKPPEYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVYFDELNFP----FSVPSPPSQPSP-L

Query:  SSMP------SFSPDLALSFATCDLPSFYVPP--------------THPCSSSSLLPQPSRQASNISSSPLNQTQLSSSLPVPEQQDPSS--SVHTSVLP
        S +P      + SP +  S ++   PS   P               T       LL  P    S      L++ QL   LP  +    SS  SV  SV  
Subjt:  SSMP------SFSPDLALSFATCDLPSFYVPP--------------THPCSSSSLLPQPSRQASNISSSPLNQTQLSSSLPVPEQQDPSS--SVHTSVLP

Query:  ASSNTQPS------SLSSVQPQQSPSD------LAPPTTDTQPSP----------------------------IAPMSDHTTVTN---LHPMQTRSKS--
         S+   PS      S + V  ++  SD      +   T+  +PS                             + P   +T +     ++ ++  S    
Subjt:  ASSNTQPS------SLSSVQPQQSPSD------LAPPTTDTQPSP----------------------------IAPMSDHTTVTN---LHPMQTRSKS--

Query:  ARYKARLVAKGFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNNSLYGLKQAPRAWF
        +RYKARLVA+GF QE G DY ETFSPVV+  TVRI+LS+A +  W+LRQLDVKNAFLHG+LEEEVYM QPQGF +  +P YVCKL  SLYGLKQAPRAW 
Subjt:  ARYKARLVAKGFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNNSLYGLKQAPRAWF

Query:  HCFTSQLSTYGFVASQADPSLFVRNVRGSITYLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTG-ILVSQTKYASDLLTRF
          FT  L   GF  S +DPSLFV+    +I  LLLYVDDIILTG++   +  ++ +L  VF++ D+G LTYFLGL++  +S G I VSQ KYA+DLL + 
Subjt:  HCFTSQLSTYGFVASQADPSLFVRNVRGSITYLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTG-ILVSQTKYASDLLTRF

Query:  GMTIAKTCATPCSSGPLDSPSLICSADDAKVYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSS-LSLSAFS
        GM+  K C TP         +      D K YRS+VGAL YLTFTRPDI++ ++ V QFM+ PT  H  +VKRILRY++G+LS G  +  S  L LSA+S
Subjt:  GMTIAKTCATPCSSGPLDSPSLICSADDAKVYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSS-LSLSAFS

Query:  DSDWTGN----------------------------------RLDRRALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEV
        DSDW G+                                    + RALA+A+A++ WI+Q+L DL +++S  P L+CDN SAL L+ NP++H R KH+++
Subjt:  DSDWTGN----------------------------------RLDRRALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEV

Query:  DFHFVREKATPKETRSGYFFNSQENKDFVST---NAKFSEEVHIKNHKPQSKVGLSKLTNE
        DFHF+RE+   ++    Y    ++  D ++    +  FS+        P SK+    L N+
Subjt:  DFHFVREKATPKETRSGYFFNSQENKDFVST---NAKFSEEVHIKNHKPQSKVGLSKLTNE

RVW73890.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]6.7e-21241.6Show/hide
Query:  MNYSFQGRHPPTKLSAMAATASPVTSSVTRSPNALPQDSQVWLSDTCCNAHLTSDLANLNISNTYHGDENITVGNGHSLPISHYDSSQISLYDRTFTLSN
        M+Y++QGRHPPT+L AM A ++              Q+ + W +D+  N H+T++L +L +   Y G EN+ VGNG  L I+H  ++     +    L  
Subjt:  MNYSFQGRHPPTKLSAMAATASPVTSSVTRSPNALPQDSQVWLSDTCCNAHLTSDLANLNISNTYHGDENITVGNGHSLPISHYDSSQISLYDRTFTLSN

Query:  LLRVPHISTNLLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPVS--ALPISKAQVGLTANVGTKASFSVWHDRLGHPSTPILRSLLN-
        +L  P    NLLS++QFC+DNNC FI   + + ++D QTG TL  G     LYP+   ++ I+K+   L+A VG KAS SVWH RLGH S PI+  LLN 
Subjt:  LLRVPHISTNLLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPVS--ALPISKAQVGLTANVGTKASFSVWHDRLGHPSTPILRSLLN-

Query:  -SSDISAANFSSHICRHSLDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFAENLLSS
         S  +  +    H C     GK  +LPF  ST  ++ PL+++ +DVW  + + S++  KYYV FIDD+S ++WL+PL  K +VY  F+ F    EN  S 
Subjt:  -SSDISAANFSSHICRHSLDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFAENLLSS

Query:  KLKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPTLGNKSP-CELFHKP
        K+K  +SDGG E++S   Q FLS+ GI H ++ P+T +QNGV ERKHRH+V+M +SL+++S +  KFW  AF T +FLINRLP+ TL N++P  +LF K 
Subjt:  KLKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPTLGNKSP-CELFHKP

Query:  PEYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVYFDELNFPFS--------VPSPPSQPSPLSSMPSFSPDLA
        P+Y+SL TFG  C+PLLRPY   KL  R+ +C+FLGY     GY C D  T K  ISRHV F+E  FP          +P   SQ +PL S PSFS  L+
Subjt:  PEYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVYFDELNFPFS--------VPSPPSQPSPLSSMPSFSPDLA

Query:  LSFATCDLPSFYVPPTHPCSSSSLLPQPSRQASNISSSPLNQTQLSSSLPVPEQQDPSSSVHTSVLPAS--SNTQPSSLSSVQ------PQQSPSDLAPP
         S      P     P  P  S   LP P+  + +IS+ P   T  +  L +PE    S+  H  +   S   N +P + S  +      P Q+ S    P
Subjt:  LSFATCDLPSFYVPPTHPCSSSSLLPQPSRQASNISSSPLNQTQLSSSLPVPEQQDPSSSVHTSVLPAS--SNTQPSSLSSVQ------PQQSPSDLAPP

Query:  TTDTQ------------------------------PSPIAPMSDHTTVTNLHPMQTRSKSA----RYKARLVAKGFHQEEGVDYFETFSPVVKKPTVRIV
           T                               P P+    DH  V N    + +  S     R+KARLVA GF QEEG+D+ ETFSPV+K  TVR++
Subjt:  TTDTQ------------------------------PSPIAPMSDHTTVTNLHPMQTRSKSA----RYKARLVAKGFHQEEGVDYFETFSPVVKKPTVRIV

Query:  LSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNNSLYGLKQAPRAWFHCFTSQLSTYGFVASQADPSLFVRNVRGSITYLLLY
        L+++  F W +RQLDV NAFLHG L E+VYM QP+GFVN  +P YVCKLN SLYGLKQAPRAWF   +  L  +GF++S  D SLFV +      ++L+Y
Subjt:  LSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNNSLYGLKQAPRAWFHCFTSQLSTYGFVASQADPSLFVRNVRGSITYLLLY

Query:  VDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCATPCSSGPLDSPSL--ICSADDAKVYRSM
        VDDI++TG D   + SL+ +L+  F M DLG L YFLG++    S+G+ + Q+KY  DLL R  M  AK  ++PC++G   S  +    +A     YR  
Subjt:  VDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCATPCSSGPLDSPSL--ICSADDAKVYRSM

Query:  VGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSSLSLSAFSDSDWTGNRLDR-------------------------
        VGAL Y T TRPDI+F ++++ Q MH P  +H    KR+LRY++G++ +GL + +   +L AF DSDW GN  DR                         
Subjt:  VGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSSLSLSAFSDSDWTGNRLDR-------------------------

Query:  ---------RALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEVDFHFVREKATPKE
                 RALA  TAEL WI+ +LK+L I + TAP+++CDN  AL LA NPVFH RTKH+EVDFHF+REK   ++
Subjt:  ---------RALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEVDFHFVREKATPKE

TQD88914.1 hypothetical protein C1H46_025506 [Malus baccata]7.7e-20842.65Show/hide
Query:  LTSDLANLNISNTYHGDENITVGNGHSLPISHYDSSQISLYDRTFTLSNLLRVPHISTNLLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVND
        +TSDL+NL  +  Y   E +T  NG  L ISH   + +S +  T  L+++L VP +S +LLS++Q C DNNC  I    S  IQDK T + L+ G     
Subjt:  LTSDLANLNISNTYHGDENITVGNGHSLPISHYDSSQISLYDRTFTLSNLLRVPHISTNLLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVND

Query:  LYPVSALPISKAQVGLTANVGTKASFSVWHDRLGHPSTPILRSLLNSSDI-SAANFSSHICRHSLDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELS
        +YP+  L   K    +   +G K + S+WH RLGHP+  +L+  L+ S I S+ N  S  C   L GK +KLPF +  + + VPLE++ +DVWG +P  S
Subjt:  LYPVSALPISKAQVGLTANVGTKASFSVWHDRLGHPSTPILRSLLNSSDI-SAANFSSHICRHSLDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELS

Query:  INNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFAENLLSSKLKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMA
        I    YYVSFID+ + YTW+FP+  K+ V+  F+ F  + +N  S+ +K  +SDGG E++S   Q FL+ +GI+HQKS PYT EQNG+ ERK+RHLV+ A
Subjt:  INNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFAENLLSSKLKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMA

Query:  ISLMSKSSVLMKFWPFAFATVVFLINRLPSPTLGNKSPCELFHK-PPEYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKF
        ++L+ K+S+  KFW  A AT  +L+NRLP+  L   SP E+ +K PP    LR FG  CYP L+PY   KL P+TT C+FLGY   YKGY+CY +S  K 
Subjt:  ISLMSKSSVLMKFWPFAFATVVFLINRLPSPTLGNKSPCELFHK-PPEYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKF

Query:  YISRHVYFDELNFPFSVPSPPSQPSPLSSMPSFSPDLALSFATCDLPSFYVPPTHPCSSSSLLPQPSRQ--------ASNISSSPLNQTQLSSSLPVPEQ
         +SRHV FDE  FP +     S  SP  S  S    LA        P   +P  H  SS   + Q   Q          N +S   +   LS S P    
Subjt:  YISRHVYFDELNFPFSVPSPPSQPSPLSSMPSFSPDLALSFATCDLPSFYVPPTHPCSSSSLLPQPSRQ--------ASNISSSPLNQTQLSSSLPVPEQ

Query:  QDPSSSVHTSVLPASSNTQPSSLSS-----------------VQPQQSPSDLAPPTTDTQPS-------------PIAPMSDHTTVTNLHPMQTRSKS--
               H SV+  SSNT P    S                 V+  QS S  A  +   Q                + P+  H  +     +    ++  
Subjt:  QDPSSSVHTSVLPASSNTQPSSLSS-----------------VQPQQSPSDLAPPTTDTQPS-------------PIAPMSDHTTVTNLHPMQTRSKS--

Query:  ---ARYKARLVAKGFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNNSLYGLKQAPR
           ARYKARLVAKGF QE G+DY+ETFSPVVK  TVR++LS+A T GW L+QLDVKNAFLHG L+EEVYM QPQGF++  +P +VCKL  SLYGLKQAPR
Subjt:  ---ARYKARLVAKGFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNNSLYGLKQAPR

Query:  AWFHCFTSQLSTYGFVASQADPSLFVRNVRGSITYLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLT
        AW   F+S L + GF +S  DPSL+V+    S+  LLLYVDDIIL+G+D  +V S+++QL   FDM DLG L YFLGL+++YQ   + V Q+KY  DLL 
Subjt:  AWFHCFTSQLSTYGFVASQADPSLFVRNVRGSITYLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLT

Query:  RFGMTIAKTCATPC---------SSGPLDSPSLICSADDAKVYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFI
        +  M  +K C TPC          S P   P L         YRS++GAL YLTFTRPDI++ +++V QFMH+P   H   VKRILRY++GSL +GL F 
Subjt:  RFGMTIAKTCATPC---------SSGPLDSPSLICSADDAKVYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFI

Query:  RSSLSLSAFSDSDWTGNRLDR----------------------------------RALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPV
          +L + A++D+DW G+  DR                                  RA+A+ TAE+ WIQQ+L+DL +  ST P L CDN SA+ LA NP+
Subjt:  RSSLSLSAFSDSDWTGNRLDR----------------------------------RALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPV

Query:  FHGRTKHVEVDFHFVREKATPKETRSGYFFNSQE
         H + KH+E+D HFVRE+   + T    F NS E
Subjt:  FHGRTKHVEVDFHFVREKATPKETRSGYFFNSQE

TQD93593.1 hypothetical protein C1H46_020801 [Malus baccata]1.9e-20641.06Show/hide
Query:  HPPTKLSAM--AATASPV--TSSVTRSPNALPQDSQVWLSDTCCNAHLTSDLANLNISNTYHGDENITVGNGHSLPISHYDSSQISLYDRTFTLSNLLRV
        HPP    AM  +   SPV  T+  T SP+A PQ  + WL D     H+TSDL+N++++  Y   + +T  NG  L I+H   S + L   T  L ++L V
Subjt:  HPPTKLSAM--AATASPV--TSSVTRSPNALPQDSQVWLSDTCCNAHLTSDLANLNISNTYHGDENITVGNGHSLPISHYDSSQISLYDRTFTLSNLLRV

Query:  PHISTNLLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPVSALPISKAQ-VGLTANVGTKASFSVWHDRLGHPSTPILRSLLNSSDISA
        P +S +LLS+HQ C DNNC  I   SS  IQDK T + L+ G   N +YP   LP+ K+  V   A +  + + ++WH RLGHP++ ++++ L+ +DI  
Subjt:  PHISTNLLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPVSALPISKAQ-VGLTANVGTKASFSVWHDRLGHPSTPILRSLLNSSDISA

Query:  ANFSS-HICRHSLDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFAENLLSSKLKTFR
            S + C+  L GK + LPF    + + +P E++ +DVWG +P LSI   +YYVSFID+ + YTW+FP+  K+ V+  F+ F  F  N  +  ++  +
Subjt:  ANFSS-HICRHSLDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFAENLLSSKLKTFR

Query:  SDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPTLGNKSPCE-LFHKPPEYSSL
        SDGG E++  + Q FL ++GILH KS PYT +QNG+ ERK+RH+ + AI+L+ ++ +  +FW  A AT V+LINR+P+  L  KSP E L+H  P+   L
Subjt:  SDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPTLGNKSPCE-LFHKPPEYSSL

Query:  RTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVYFDELNFPFSVPSPP-SQPSPLSSMPSFSPDLALSFATCDLPSFYV
        + FG  CYP L+PY   KL P+T++C+FLGY   YKG++C++    K  +SRHV FDE +FP  + +      S ++SM S  P            +F+ 
Subjt:  RTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVYFDELNFPFSVPSPP-SQPSPLSSMPSFSPDLALSFATCDLPSFYV

Query:  PPTHPCSSSSLLPQP-SRQASNISSSPLNQTQLSSSLPVPEQQDPSSSVHT---SVLPASSNTQPSSLSSVQPQQSPSDLAPPTTDTQPSPIAPMSDHTT
           HP  +S  +PQ  SR +S+  S P+     SS+        PS  V++   S +P   +  P S S+    Q+ S+L P T  +Q S +  +S    
Subjt:  PPTHPCSSSSLLPQP-SRQASNISSSPLNQTQLSSSLPVPEQQDPSSSVHT---SVLPASSNTQPSSLSSVQPQQSPSDLAPPTTDTQPSPIAPMSDHTT

Query:  VTNLHPMQTRSKS-----------------------------------------------------------------------------ARYKARLVAK
          N HPMQTRSKS                                                                             ARYKARLVAK
Subjt:  VTNLHPMQTRSKS-----------------------------------------------------------------------------ARYKARLVAK

Query:  GFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNNSLYGLKQAPRAWFHCFTSQLSTY
        GF QE G+DY+ETFSPVVK  TVR++LS+A + GW L QLDVKNAFLHG L+EEVYM QPQGFV+  +P++VCKL  SLYGLKQAPRAW   FT  L T 
Subjt:  GFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNNSLYGLKQAPRAWFHCFTSQLSTY

Query:  GFVASQADPSLFVRNVRGSITYLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCATP
        GF +S ADPSLFV+    SI  LLLYVDDIILTGN  A V S++ QL   FDM +LG L YFLGL+++Y+S+G+ V Q+KY +DLL +  M   K C TP
Subjt:  GFVASQADPSLFVRNVRGSITYLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCATP

Query:  CSSGPLDSPSLICSADDAKVYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSSLSLSAFSDSDWTGNRLDR-
        C            S  D   YRS+VGAL YLTFTRPDI++ +++V QFMHAP   H   VKRILRY+RG++ +G+ F   SL + A++D+DW G+  DR 
Subjt:  CSSGPLDSPSLICSADDAKVYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSSLSLSAFSDSDWTGNRLDR-

Query:  ---------------------------------RALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEVDFHFVREK
                                         RA+A+ TAE+ W+QQ+L+DL I   + P L CDN  A+ LA NP+ H + KH+E+D HFVRE+
Subjt:  ---------------------------------RALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEVDFHFVREK

TQE01264.1 hypothetical protein C1H46_013171 [Malus baccata]2.1e-21340.48Show/hide
Query:  NYSFQGRHPPTKLSAMAATASPVTSSVTRSPNALPQDSQVWLSDTCCNAHLTSDLANLNISNTYHGDENITVGNGHSLPISHYDSSQISLYDRTFTLSNL
        N+++QGR PP+ LSAM  T SP          + PQ+ Q W++DT   +H+TSDL+NLN++  + G + +T  +G  LPISH  S+ +      F L N+
Subjt:  NYSFQGRHPPTKLSAMAATASPVTSSVTRSPNALPQDSQVWLSDTCCNAHLTSDLANLNISNTYHGDENITVGNGHSLPISHYDSSQISLYDRTFTLSNL

Query:  LRVPHISTNLLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYP----------VSALPISKAQVGLTANVGTKASFSVWHDRLGHPSTPI
        L VP +S +LLS++Q C DN C FI     F IQDK TG+ +  G     LYP          + A   + +    T  +G++   ++WH RLGHPS  +
Subjt:  LRVPHISTNLLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYP----------VSALPISKAQVGLTANVGTKASFSVWHDRLGHPSTPI

Query:  LRSLLNSSDI-SAANFSSHICRHSLDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFA
          ++L  S I  + +  S IC   L+GK +KLPF      T  PLE++ SDVWG +  +SI   K+YVSF+D+ + +TW+FPL  KSEV+  F++F  F 
Subjt:  LRSLLNSSDI-SAANFSSHICRHSLDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFA

Query:  ENLLSSKLKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPTLGNKSPCE
            S+ +K F+SDGG E+ S   + +L  +GILHQKS PYT +QNG+ ERKHRH+++ AI+L+  +S+  K W  A A  V+LINR+   TL   SP +
Subjt:  ENLLSSKLKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPTLGNKSPCE

Query:  -LFHKPPEYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVYFDELNFPFSVPSPPSQPSPLSSMPSFSPDLALS
         LF   P  S L+ FG  C+PLL+  N +KLQP+T+QC+F+GY   YKGY+C +  T K Y+SRHV FDE  FP+S           S + S S    +S
Subjt:  -LFHKPPEYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVYFDELNFPFSVPSPPSQPSPLSSMPSFSPDLALS

Query:  FATCDLPSFYVPPTHPCSSSSLLPQPSRQASNISSSPLNQTQLSSSLPVPEQQ---DPSSSVHTSVLPASS---------NTQPSSLSSVQP--------
          +  +    +P      ++ + P P + +    S+P N ++ S S P+P  +    P+S    + LPA S         + QP  LS V P        
Subjt:  FATCDLPSFYVPPTHPCSSSSLLPQPSRQASNISSSPLNQTQLSSSLPVPEQQ---DPSSSVHTSVLPASS---------NTQPSSLSSVQP--------

Query:  ------------------QQSPSDLAPPTTDTQ----PSPIAPMSDHTTVT------NLHPMQT-----------RSKS------ARYKARLVAKGFHQE
                           QS   ++ P T       P  +A M D  T        +L P+ +           R K+      ARYKARLVAKG+ QE
Subjt:  ------------------QQSPSDLAPPTTDTQ----PSPIAPMSDHTTVT------NLHPMQT-----------RSKS------ARYKARLVAKGFHQE

Query:  EGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFVNLTYPS-YVCKLNNSLYGLKQAPRAWFHCFTSQLSTYGFVA
        EGVDY ETFSPVVK  TVR++L++A  F W+LRQLDVKNAFLHGDL EEVYM QPQGF +  +PS YVC+L+ SLYGLKQAPRAW   FTS L   GF A
Subjt:  EGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFVNLTYPS-YVCKLNNSLYGLKQAPRAWFHCFTSQLSTYGFVA

Query:  SQADPSLFVRNVRGSITYLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCATPCSSG
        S ADPSLFV++       LLLYVDDIILTG+    +D ++  L   FD+ DLG L YFLGL++ YQ  G+ VSQTKY  DLL +  +  +K C TPC   
Subjt:  SQADPSLFVRNVRGSITYLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCATPCSSG

Query:  PLDSPSLICSADDAKVYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSSLSLSAFSDSDWTGNRLDR-----
           S +          YRS+VGAL YLTFTRPDI+F +++  QFMH P   H+  VK ILRY+ G+L  GL+F    L L A+SD+DW G+  DR     
Subjt:  PLDSPSLICSADDAKVYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSSLSLSAFSDSDWTGNRLDR-----

Query:  -----------------------------RALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEVDFHFVREKATPKETRS
                                     RALA A AEL+W++QVL DLC+ + TAP L+CDN S + L+ NPVFH R KH+E+D+HFVRE+    +   
Subjt:  -----------------------------RALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEVDFHFVREKATPKETRS

Query:  GYFFNSQENKDFVSTNAKFSEEVHIKNHKPQSKVGLSKLTNEATVTSTKVVDEIGC
         +  +    + F     K       ++H     +G SK   E      +++D   C
Subjt:  GYFFNSQENKDFVSTNAKFSEEVHIKNHKPQSKVGLSKLTNEATVTSTKVVDEIGC

TrEMBL top hitse value%identityAlignment
A0A2N9FMC6 Integrase catalytic domain-containing protein1.7e-24043.7Show/hide
Query:  MNYSFQGRHPPTKLSAMAATASPVTSSVTRSPNALPQDSQ--------------------------------VWLSDTCCNAHLTSDLANLNISNTYHGD
        M+YS+QGRHPP KL A+A++ + + S+  ++ +  P  +Q                                 W+SDT    H T DL NLN    Y G 
Subjt:  MNYSFQGRHPPTKLSAMAATASPVTSSVTRSPNALPQDSQ--------------------------------VWLSDTCCNAHLTSDLANLNISNTYHGD

Query:  ENITVGNGHSLPISHYDSSQISLYDRTFTLSNLLRVPHISTNLLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPVSAL--------PI
        + +++GNG  LPI+H   SQ+      F L  +LRVP + TNLLSV++FC DN C F F A+ F+IQD  +G+TL+ G   + LYP+  L        P 
Subjt:  ENITVGNGHSLPISHYDSSQISLYDRTFTLSNLLRVPHISTNLLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPVSAL--------PI

Query:  SKAQVGLTANVGTKASFSVWHDRLGHPSTPILRSLLNSS---DISAANFSSHICRHSLDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYY
          +    +A +GTK + SVWH RLGHP   +L S+LN      ++ A FSS  C H + GK+ + PF  S+ + T PLE++ SDVWG AP  SIN  ++Y
Subjt:  SKAQVGLTANVGTKASFSVWHDRLGHPSTPILRSLLNSS---DISAANFSSHICRHSLDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYY

Query:  VSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFAENLLSSKLKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKS
        VSF+D ++ +TWLFP+  KS+V  TF +F    EN+L++++K  R+D G E+ +   ++F S RGILHQ S P+T +QNGV ERKHRH+V+ A++L+S+S
Subjt:  VSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFAENLLSSKLKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKS

Query:  SVLMKFWPFAFATVVFLINRLPSPTLGNKSPCE-LFHKPPEYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVY
        S+ +++WP+AF+T ++LINR+P+P L   SP + LFH  P+YS L+TFG  C+PLLRPYN  KL+PR++ CVFLGY L  KGY+C ++ T K  ISRHV 
Subjt:  SVLMKFWPFAFATVVFLINRLPSPTLGNKSPCE-LFHKPPEYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVY

Query:  FDELNFPFSVPSPPSQPSPLSSMPSFSPDLALSFATCDLPSFYVPPTH--PCSSSSLLPQ--PSRQASNISSSPLNQTQLSS--SLPVPEQQDPSSSV--
        F E +FPF   + PS  S  S+    S    L F  C  PS   PP    P S S+ L    P   A     SPL  T  S   S PVP    PS  +  
Subjt:  FDELNFPFSVPSPPSQPSPLSSMPSFSPDLALSFATCDLPSFYVPPTH--PCSSSSLLPQ--PSRQASNISSSPLNQTQLSS--SLPVPEQQDPSSSV--

Query:  ----------------------HTSVLPASSNTQPS-SLSSVQPQQSPSDLAPPTTDTQP---SPIAPMSDHTTVTNLHPMQTRSKS----ARYKARLVA
                              HT  L       PS  ++S  P+   + L   T   +    S + P S+H  V      + + K     ARYKARLVA
Subjt:  ----------------------HTSVLPASSNTQPS-SLSSVQPQQSPSDLAPPTTDTQP---SPIAPMSDHTTVTNLHPMQTRSKS----ARYKARLVA

Query:  KGFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNNSLYGLKQAPRAWFHCFTSQLST
        KG+HQ+ G+DY ETFSPVVK  TVR++LSIA  F W+LRQLDV NAFLH  L+E+VYM QPQGFV+ + P +VCKL  SLYGLKQAPRAWF  FTSQL  
Subjt:  KGFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNNSLYGLKQAPRAWFHCFTSQLST

Query:  YGFVASQADPSLFVRNVRGSITYLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCAT
         GF AS ADPSLF+     ++ +LL+YVDDII+TGN  + + SLV QL   F++ DLG LTYFLGLEV Y +TG  V Q KYASDLL ++ M   K C+T
Subjt:  YGFVASQADPSLFVRNVRGSITYLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCAT

Query:  PCSSGPLDSPSLICSADDAKVYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSSLSLSAFSDSDWTGN----
        PC +    +  +     DA  +RS+VGAL YLTFTRPD+++ ++ + QFMHAPT +HL+  KR+LRY+RGSL +GL F   SL L A+SD+DW G+    
Subjt:  PCSSGPLDSPSLICSADDAKVYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSSLSLSAFSDSDWTGN----

Query:  ------------------------------RLDRRALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEVDFHFVREKATP
                                        + RALAS  AE+SW++ VL DL I + + P+L+CDN SAL LA NPVFH RTKH+EVD+HF+R++   
Subjt:  ------------------------------RLDRRALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEVDFHFVREKATP

Query:  KETRSGYFFNSQENKDFVS
        K+ +  +  +  +  D ++
Subjt:  KETRSGYFFNSQENKDFVS

A0A2N9GRJ0 Uncharacterized protein2.6e-24143.79Show/hide
Query:  MNYSFQGRHPPTKLSAMAATASPVTSSVTRSPNALPQDSQ--------------------------------VWLSDTCCNAHLTSDLANLNISNTYHGD
        M+YS+QGRHPP KL+A+A+  + + S+  ++ +  P  +Q                                 W+SDT    H T DL NLN    Y G 
Subjt:  MNYSFQGRHPPTKLSAMAATASPVTSSVTRSPNALPQDSQ--------------------------------VWLSDTCCNAHLTSDLANLNISNTYHGD

Query:  ENITVGNGHSLPISHYDSSQISLYDRTFTLSNLLRVPHISTNLLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPVSAL--------PI
        + +++GNG  LPI+H   SQ+      F L  +LRVP + TNLLSV++FC DN C F F A+ F+IQD  +G+TL+ G   + LYP+  L        P 
Subjt:  ENITVGNGHSLPISHYDSSQISLYDRTFTLSNLLRVPHISTNLLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPVSAL--------PI

Query:  SKAQVGLTANVGTKASFSVWHDRLGHPSTPILRSLLNSS---DISAANFSSHICRHSLDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYY
          +    +A +GTK + SVWH RLGHP   +L S+LN      ++ A FSS  C H + GK+ + PF  S+ + T PLE++ SDVWG AP  SIN  ++Y
Subjt:  SKAQVGLTANVGTKASFSVWHDRLGHPSTPILRSLLNSS---DISAANFSSHICRHSLDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYY

Query:  VSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFAENLLSSKLKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKS
        VSF+D ++ +TWLFP+  KS+V  TF +F    EN+L++++K  R+D G E+ +   ++F S RGILHQ S P+T +QNGV ERKHRH+V+ A++L+S+S
Subjt:  VSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFAENLLSSKLKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKS

Query:  SVLMKFWPFAFATVVFLINRLPSPTLGNKSPCE-LFHKPPEYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVY
        S+ +++WP+AF+T ++LINR+P+P L   SP + LFH  P+YS L+TFG  C+PLLRPYN  KL+PR++ CVFLGY L  KGY+C ++ T K  ISRHV 
Subjt:  SVLMKFWPFAFATVVFLINRLPSPTLGNKSPCE-LFHKPPEYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVY

Query:  FDELNFPFSVPSPPSQPSPLSSMPSFSPDLALSFATCDLPSFYVPPTH--PCSSSSLLPQ--PSRQASNISSSPLNQTQLSS--SLPVPEQQDPSSSV--
        F E +FPF   + PS  S  S+    S    L F  C  PS   PP    P S S+ L    P   A     SPL  T  S   S PVP    PS  +  
Subjt:  FDELNFPFSVPSPPSQPSPLSSMPSFSPDLALSFATCDLPSFYVPPTH--PCSSSSLLPQ--PSRQASNISSSPLNQTQLSS--SLPVPEQQDPSSSV--

Query:  ----------------------HTSVLPASSNTQPS-SLSSVQPQQSPSDLAPPTTDTQP---SPIAPMSDHTTVTNLHPMQTRSKS----ARYKARLVA
                              HT  L       PS  ++S  P+   + L   T   +    S + P S+H  V      + + K     ARYKARLVA
Subjt:  ----------------------HTSVLPASSNTQPS-SLSSVQPQQSPSDLAPPTTDTQP---SPIAPMSDHTTVTNLHPMQTRSKS----ARYKARLVA

Query:  KGFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNNSLYGLKQAPRAWFHCFTSQLST
        KG+HQ+ G+DY ETFSPVVK  TVR++LSIA  F W+LRQLDV NAFLHG L+E+VYM QPQGFV+ + P +VCKL  SLYGLKQAPRAWF  FTSQL  
Subjt:  KGFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNNSLYGLKQAPRAWFHCFTSQLST

Query:  YGFVASQADPSLFVRNVRGSITYLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCAT
         GF AS ADPSLF+     ++ +LL+YVDDII+TGN  + + SLV QL   F++ DLG LTYFLGLEV Y +TG  V Q KYASDLL ++ M   K C+T
Subjt:  YGFVASQADPSLFVRNVRGSITYLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCAT

Query:  PCSSGPLDSPSLICSADDAKVYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSSLSLSAFSDSDWTGN----
        PC +    +  +     DA  +RS+VGAL YLTFTRPD+++ ++ + QFMHAPT +HL+  KR+LRY+RGSL +GL F   SL L A+SD+DW G+    
Subjt:  PCSSGPLDSPSLICSADDAKVYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSSLSLSAFSDSDWTGN----

Query:  ------------------------------RLDRRALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEVDFHFVREKATP
                                        + RALAS  AE+SW++ VL DL I + + P+L+CDN SAL LA NPVFH RTKH+EVD+HF+R++   
Subjt:  ------------------------------RLDRRALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEVDFHFVREKATP

Query:  KETRSGYFFNSQENKDFVS
        K+ +  +  +  +  D ++
Subjt:  KETRSGYFFNSQENKDFVS

A0A2N9HUP1 Uncharacterized protein3.1e-23144.66Show/hide
Query:  MNYSFQGRHPPTKLSAMAATASPVTSSVTRSPNALPQDSQVWLSDTCCNAHLTSDLANLNISNTYHGDENITVGNGHSLPISHYDSSQISLYDRTFTLSN
        MN+++QGRHPP KL+A+A+T      +++ + +A       W+SDT    H T D+ ++   + Y G++++TVGNG SLPI+H  +SQ+      F L  
Subjt:  MNYSFQGRHPPTKLSAMAATASPVTSSVTRSPNALPQDSQVWLSDTCCNAHLTSDLANLNISNTYHGDENITVGNGHSLPISHYDSSQISLYDRTFTLSN

Query:  LLRVPHISTNLLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPV--SALPISKAQVGLTANVGTKASFSVWHDRLGHPSTPILRSLLNS
        +L VP +S++LLSV++FC DN+  F F AS F I+D  +G+ L+ G     LYPV  + LP S +    +    +  S  +WH RLGHP + +   +LN 
Subjt:  LLRVPHISTNLLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPV--SALPISKAQVGLTANVGTKASFSVWHDRLGHPSTPILRSLLNS

Query:  -SDISAANFSSHICRHSLDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFAENLLSSK
           +++ +     C H ++GK  +LPF+ S + TT PLE++ +DVWG AP  S N  +YYVSFIDD++ +TW FPL  KS+V  +F +F    EN+L  K
Subjt:  -SDISAANFSSHICRHSLDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFAENLLSSK

Query:  LKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPTLGNKSPCE-LFHKPP
        +K  RSD G E+     Q+F S+ GILHQ S P+T +QNGV ERKHRH+VDMA++L+S+SS+ +  WP+AF+T VFLINRLPS +    SP E LF   P
Subjt:  LKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPTLGNKSPCE-LFHKPP

Query:  EYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVYFDELNFPF-SVPSPPSQPSPLSSMPSFSPDLALS----FA
        +Y S R FG TCYPLLR Y+  KLQPR+  CVFLGY    KG++CYD S  +FY+SRHV FDE +FP+ ++PS PS  S  SS+ S +  + LS    F 
Subjt:  EYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVYFDELNFPF-SVPSPPSQPSPLSSMPSFSPDLALS----FA

Query:  TCDLPSFYVPP----THPCSSSSLLPQPSRQA---SNISS------SPLNQTQ---LSSSLP--VPEQQDPSSSVHTSVLPASS----------------
         C +PS   PP    + P  SSS++P  S  +   S++SS      +P+  TQ   L+  +P  VP     S+++H     A S                
Subjt:  TCDLPSFYVPP----THPCSSSSLLPQPSRQA---SNISS------SPLNQTQ---LSSSLP--VPEQQDPSSSVHTSVLPASS----------------

Query:  -------NTQPSSLS-SVQPQQSPSDLAPPTTDTQPS---PIAPMSDHTTVTNLH-----PMQTRSKSARYKARLVAKGFHQEEGVDYFETFSPVVKKPT
               NT+P++ + + +  Q  +D+A      Q      + P S    V            T    AR+KARLVAKG HQ+ G+D+ ETFSPVVK  T
Subjt:  -------NTQPSSLS-SVQPQQSPSDLAPPTTDTQPS---PIAPMSDHTTVTNLH-----PMQTRSKSARYKARLVAKGFHQEEGVDYFETFSPVVKKPT

Query:  VRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNNSLYGLKQAPRAWFHCFTSQLSTYGFVASQADPSLFVRNVRGSITY
        VR+VLS+A  + W+LRQLDV NAFLHG L+E V+M QP GFV+  +PS+VC L  S+YGL+QAPRAWF  F++ L T GF AS ADPSLFV     ++ Y
Subjt:  VRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNNSLYGLKQAPRAWFHCFTSQLSTYGFVASQADPSLFVRNVRGSITY

Query:  LLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCATPC---SSGPLDSPSLICSADDAK
        LLLYVDDIILTG+  A +  L+  L   F++ DLG L YFLGL+V+Y ++G+LV QTKYA+DLL +  M+  K C+TP    S+  L   SL+    D  
Subjt:  LLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCATPC---SSGPLDSPSLICSADDAK

Query:  VYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSSLSLSAFSDSDWTGNRLDR--------------------
         YRS+VGAL YLTFTRPD+SF ++ + Q MH PT  HL   KR+LRY+RG++S G+ F    L L+AF+DSDW GN +DR                    
Subjt:  VYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSSLSLSAFSDSDWTGNRLDR--------------------

Query:  --------------RALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEVDFHFVREK
                      RALA   AEL+WI+ +L+DL IF+S AP L+CDN SA+ LA NPVFH RTKHVE+D+HFVREK
Subjt:  --------------RALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEVDFHFVREK

A0A2N9IE26 Uncharacterized protein2.8e-23244.66Show/hide
Query:  MNYSFQGRHPPTKLSAMAATASPVTSSVTRSPNALPQDSQVWLSDTCCNAHLTSDLANLNISNTYHGDENITVGNGHSLPISHYDSSQISLYDRTFTLSN
        MN+++QGRHPP KL+A+A+T      +++ + +A       W+SDT    H T D+ ++   + Y G++ +TVGNG SLPI+H  +SQ+      F L  
Subjt:  MNYSFQGRHPPTKLSAMAATASPVTSSVTRSPNALPQDSQVWLSDTCCNAHLTSDLANLNISNTYHGDENITVGNGHSLPISHYDSSQISLYDRTFTLSN

Query:  LLRVPHISTNLLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPV--SALPISKAQVGLTANVGTKASFSVWHDRLGHPSTPILRSLLNS
        +L VP +S++LLSV++FC DN+  F F AS F I+  ++G+ L+ G     LYPV  + LP S +    +    +  S  +WH RLGHP + +   +LN 
Subjt:  LLRVPHISTNLLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPV--SALPISKAQVGLTANVGTKASFSVWHDRLGHPSTPILRSLLNS

Query:  -SDISAANFSSHICRHSLDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFAENLLSSK
           +++ +     C H ++GK  +LPF+ S + TT PLE++ +DVWG AP  S N  +YYVSFIDD++ +TW FPL  KS+V  +F +F    EN+L  K
Subjt:  -SDISAANFSSHICRHSLDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFAENLLSSK

Query:  LKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPTLGNKSPCE-LFHKPP
        +K  RSD G E+     Q+F S+ GILHQ S P+T +QNGV ERKHRH+VDMA++L+S+SS+ +  WP+AF+T VFLINRLPS +    SP E LF   P
Subjt:  LKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPTLGNKSPCE-LFHKPP

Query:  EYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVYFDELNFPF-SVPSPPSQPSPLSSMPSFSPDLALS----FA
        +Y S R FG TCYPLLR Y+  KLQPR+  CVFLGY    KG++CYD S  +FY+SRHV FDE +FP+ ++PSPPS  S  SS+ S +  + LS    F 
Subjt:  EYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVYFDELNFPF-SVPSPPSQPSPLSSMPSFSPDLALS----FA

Query:  TCDLPSFYVPP----THPCSSSSLLPQPSRQASNISSS---------PLNQTQLSSSLP-VPEQQDPSSSVHTSVLP-----------------ASSNTQ
         C +PS   PP    + P  SSS++P  S  +   SSS         P++ TQ +   P VP          T++ P                 A+  + 
Subjt:  TCDLPSFYVPP----THPCSSSSLLPQPSRQASNISSS---------PLNQTQLSSSLP-VPEQQDPSSSVHTSVLP-----------------ASSNTQ

Query:  PSSLSSVQPQQSPSDLA---PPTTDTQPSPIAPMSDHTTVT------NLHPMQTR----------SKSARYKARLVAKGFHQEEGVDYFETFSPVVKKPT
        P SL  +  + +   +A   P       S  A +    T T      + H +  R             AR+KARLVAKG HQ+ G+D+ ETFSPVVK  T
Subjt:  PSSLSSVQPQQSPSDLA---PPTTDTQPSPIAPMSDHTTVT------NLHPMQTR----------SKSARYKARLVAKGFHQEEGVDYFETFSPVVKKPT

Query:  VRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNNSLYGLKQAPRAWFHCFTSQLSTYGFVASQADPSLFVRNVRGSITY
        VR+VLS+A  +GW+LRQLDV NAFLHG L+E V+M QP GFV+  +PS+VC L  S+YGL+QAPRAWF  F+S L T GF AS ADPSLFV     ++ Y
Subjt:  VRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNNSLYGLKQAPRAWFHCFTSQLSTYGFVASQADPSLFVRNVRGSITY

Query:  LLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCATPC---SSGPLDSPSLICSADDAK
        LLLYVDDIILTG+  A +  L+  L   F++ DLG L YFLGL+V+Y  +G+LV QTKYA+DLL +  M+  K C+TP    S+  L   SL+    D  
Subjt:  LLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCATPC---SSGPLDSPSLICSADDAK

Query:  VYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSSLSLSAFSDSDWTGNRLDR--------------------
         YRS+VGAL YLTFTRPD+SF ++ + Q MH PT  HL   KR+LRY+RG++S G+ F    L L+AF+DSDW GN +DR                    
Subjt:  VYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSSLSLSAFSDSDWTGNRLDR--------------------

Query:  --------------RALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEVDFHFVREK
                      RALA   AEL+WI+ +L+DL IF+S AP L+CDN SA+ LA NPVFH RTKHVE+D+HFVREK
Subjt:  --------------RALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEVDFHFVREK

A0A2N9IEP2 Uncharacterized protein9.4e-23643.81Show/hide
Query:  MNYSFQGRHPPTKLSAMAATASPVTSSVTRSPNALPQDSQVWLSDTCCNAHLTSDLANLNISNTYHGDENITVGNGHSLPISHYDSSQISLYDRTFTLSN
        M+YS+QG+ PP+KL+AMAAT+           N+   D   W+SDT    H T DL+ +     Y G +  TVGNG ++PI+H  +SQ+      F L  
Subjt:  MNYSFQGRHPPTKLSAMAATASPVTSSVTRSPNALPQDSQVWLSDTCCNAHLTSDLANLNISNTYHGDENITVGNGHSLPISHYDSSQISLYDRTFTLSN

Query:  LLRVPHISTNLLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPVS--ALPISKAQVGLTANVGTKA-SFSVWHDRLGHPSTPILRSLLN
        +LRVP +++NLLSV++FC DNNCCF+F A+ F I+D  TG+ L+ GP  N LYP+   +LP        ++   TK+ S  VWHDRLGHP++ + + + +
Subjt:  LLRVPHISTNLLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPVS--ALPISKAQVGLTANVGTKA-SFSVWHDRLGHPSTPILRSLLN

Query:  SSDI--SAANFSSHICRHSLDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFAENLLS
        +S +  S++N +   C H + GKM+ LPF  S +    PLEI+ SDVWG +P  S    ++YV F+D+++ +TW +P+  KS+V + F++F    +NLL+
Subjt:  SSDI--SAANFSSHICRHSLDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFAENLLS

Query:  SKLKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPTLGNKSPCE-LFHK
         K+K  R+D G E+ S    +F  + GI HQ + P+T +QNG+ ERKHRH+VD+A++L+S+SS+ + FWP+AF+T V+LINR+P       SP E LFH+
Subjt:  SKLKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPTLGNKSPCE-LFHK

Query:  PPEYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVYFDELNFPFSVPSPPSQPSPLSSMPSFSPDLALSFATCD
         P Y+SLRTFG  CYPL+RPYN  KLQPR+ +CVFLGY    KGY+CY++ T K+Y SRHV F +  FPF   S    P P + +     +  LSF TC 
Subjt:  PPEYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVYFDELNFPFSVPSPPSQPSPLSSMPSFSPDLALSFATCD

Query:  L-------PSFYVPPT----HP-CSSSSLLPQPSRQASNISSSP-----LNQTQLSSSLPVPEQQDPSSSV------HTSVLPASSNTQPSS--LSSVQP
        L       P    PP+    HP  SS  +L  PS  +  +SS+P      N      S P P Q    S +      H +++    NT+P +  ++S  P
Subjt:  L-------PSFYVPPT----HP-CSSSSLLPQPSRQASNISSSP-----LNQTQLSSSLPVPEQQDPSSSV------HTSVLPASSNTQPSS--LSSVQP

Query:  Q-QSPSDLAPPTTDTQPS-PIAPMSDHTTVTN---LHPMQTRSKS--ARYKARLVAKGFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLDVK
        Q Q            Q +  + P S +  +     ++ ++  S    ARYKARLVAKG+HQ++G+DY ETFSPVVK  TVR++LSIA    W+L+QLDV 
Subjt:  Q-QSPSDLAPPTTDTQPS-PIAPMSDHTTVTN---LHPMQTRSKS--ARYKARLVAKGFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLDVK

Query:  NAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNNSLYGLKQAPRAWFHCFTSQLSTYGFVASQADPSLFVRNVRGSITYLLLYVDDIILTGNDCAYVDSL
        NAFLHG L+E VYM QP GF++  YP +VC+L  +LYGLKQAPRAWF  FTS L T GF  S ADPSLF+     ++ YLLLYVDDII+TGN    + SL
Subjt:  NAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNNSLYGLKQAPRAWFHCFTSQLSTYGFVASQADPSLFVRNVRGSITYLLLYVDDIILTGNDCAYVDSL

Query:  VTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCATPCSSGPLDSPSLICSADDAKVYRSMVGALHYLTFTRPDISFVLS
        +T+L   FD+ DLG L +FLGL++ Y+S+G  V Q KYA+DLL +F M+  K C+TP  S             D   +RSMVG L YLTFTRPD+S+ ++
Subjt:  VTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCATPCSSGPLDSPSLICSADDAKVYRSMVGALHYLTFTRPDISFVLS

Query:  RVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSSLSLSAFSDSDWTGNRLDR----------------------------------RALASATAEL
         + QFMH PT  HL   KRILRYV+G+L  GL F    LSL+AF+DSDW G+ +DR                                  RALA+  A+L
Subjt:  RVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSSLSLSAFSDSDWTGNRLDR----------------------------------RALASATAEL

Query:  SWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEVDFHFVREKATPKETRSGYFFNSQENKDFVS
        +W++ VLKDL IF+   P+++CDN SAL LA NPVFH RTKHVEVD+HF+REK T ++ +  +     +  D ++
Subjt:  SWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEVDFHFVREKATPKETRSGYFFNSQENKDFVS

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.3e-8227.3Show/hide
Query:  DRTFTLSNLLRVPHISTNLLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPVSALPISKAQVGLTANVGTKASFSVWHDRLGHPSTPIL
        D   TL ++L     + NL+SV +   +      F  S  TI   + G  +    G+     ++ +P+   Q   + N   K +F +WH+R GH S   L
Subjt:  DRTFTLSNLLRVPHISTNLLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPVSALPISKAQVGLTANVGTKASFSVWHDRLGHPSTPIL

Query:  RSLLNS---SDISAAN---FSSHICRHSLDGKMSKLPFSLSTTSTTV--PLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFI
          +      SD S  N    S  IC   L+GK ++LPF      T +  PL ++ SDV G    +++++K Y+V F+D ++ Y   + +  KS+V++ F 
Subjt:  RSLLNS---SDISAAN---FSSHICRHSLDGKMSKLPFSLSTTSTTV--PLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFI

Query:  NFLPFAENLLSSKLKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPTL-
        +F+  +E   + K+     D G E+LS  ++ F   +GI +  + P+T + NGV ER  R + + A +++S + +   FW  A  T  +LINR+PS  L 
Subjt:  NFLPFAENLLSSKLKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPTL-

Query:  -GNKSPCELFH-KPPEYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGY-PLGYKGYMCYDMSTTKFYISRHVYFDELNF----------PFSVPSPP
          +K+P E++H K P    LR FG T Y  ++     K   ++ + +F+GY P G+K    +D    KF ++R V  DE N            F   S  
Subjt:  -GNKSPCELFH-KPPEYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGY-PLGYKGYMCYDMSTTKFYISRHVYFDELNF----------PFSVPSPP

Query:  SQPS--PLSSMPSFSPDLALSFATCDLPSFYVP------PTHPCSSSSLL----PQPSRQASNISSSPLNQTQLSSSLPVPEQQDPSSSVHTSVLPASSN
        S+    P  S      +       CD   F            P  S  ++    P  S++  NI    L  ++ S+   + E +      H +    S N
Subjt:  SQPS--PLSSMPSFSPDLALSFATCDLPSFYVP------PTHPCSSSSLL----PQPSRQASNISSSPLNQTQLSSSLPVPEQQDPSSSVHTSVLPASSN

Query:  TQPSSLSSVQPQQSPSDLAPPTTD--------------TQPSPIAPMSDHT---TVTNLH-----------PMQTRSKSA--------------------
           S  S          +  PT +              T+P       D++    V N H            +Q R   +                    
Subjt:  TQPSSLSSVQPQQSPSDLAPPTTD--------------TQPSPIAPMSDHT---TVTNLH-----------PMQTRSKSA--------------------

Query:  ----------------------------RYKARLVAKGFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGF
                                    RYKARLVA+GF Q+  +DY ETF+PV +  + R +LS+   +   + Q+DVK AFL+G L+EE+YM  PQG 
Subjt:  ----------------------------RYKARLVAKGFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGF

Query:  VNLTYPSYVCKLNNSLYGLKQAPRAWFHCFTSQLSTYGFVASQADPSLFVRNVRGSIT---YLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLT
                VCKLN ++YGLKQA R WF  F   L    FV S  D  +++ + +G+I    Y+LLYVDD+++   D   +++    L   F MTDL  + 
Subjt:  VNLTYPSYVCKLNNSLYGLKQAPRAWFHCFTSQLSTYGFVASQADPSLFVRNVRGSIT---YLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLT

Query:  YFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCATPCSSGPLDSPSLICSADDAKVYRSMVGALHYLTF-TRPDISF---VLSRVSQFMHAPTYLH
        +F+G+ ++ Q   I +SQ+ Y   +L++F M      +TP  S  ++   L    D     RS++G L Y+   TRPD++    +LSR S   ++  + +
Subjt:  YFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCATPCSSGPLDSPSLICSADDAKVYRSMVGALHYLTF-TRPDISF---VLSRVSQFMHAPTYLH

Query:  LAVVKRILRYVRGSLSVGLKFIRS---SLSLSAFSDSDWTGNRLDRR-----------------------------------ALASATAELSWIQQVLKD
        L   KR+LRY++G++ + L F ++      +  + DSDW G+ +DR+                                   AL  A  E  W++ +L  
Subjt:  LAVVKRILRYVRGSLSVGLKFIRS---SLSLSAFSDSDWTGNRLDRR-----------------------------------ALASATAELSWIQQVLKD

Query:  LCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEVDFHFVREK
        + I +     ++ DNQ  + +A NP  H R KH+++ +HF RE+
Subjt:  LCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEVDFHFVREK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-10031.34Show/hide
Query:  TFTLSNLLRVPHISTNLLSVHQFCIDNNCCFIFYASSFTIQD-KQTGQTLFHGPGV--NDLYPVSALPISKAQVGLTANVGTKASFSVWHDRLGHPSTPI
        T  L ++  VP +  NL+S     +D +     Y S F  Q  + T  +L    GV    LY  +A  I + ++    +   + S  +WH R+GH S   
Subjt:  TFTLSNLLRVPHISTNLLSVHQFCIDNNCCFIFYASSFTIQD-KQTGQTLFHGPGV--NDLYPVSALPISKAQVGLTANVGTKASFSVWHDRLGHPSTPI

Query:  LRSLLNSSDISAANFSS-HICRHSLDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFA
        L+ L   S IS A  ++   C + L GK  ++ F  S+      L+++ SDV G     S+   KY+V+FIDD S   W++ L  K +V+  F  F    
Subjt:  LRSLLNSSDISAANFSS-HICRHSLDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFA

Query:  ENLLSSKLKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPTLGNKSPCE
        E     KLK  RSD G E+ SR  + + S+ GI H+K+ P T + NGV ER +R +V+   S++  + +   FW  A  T  +LINR PS  L  + P  
Subjt:  ENLLSSKLKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPTLGNKSPCE

Query:  LF-HKPPEYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVYFDELNFPFSVPSPPSQPSPLSSMPSFSPDLALS
        ++ +K   YS L+ FG   +  +     TKL  ++  C+F+GY     GY  +D    K   SR V F E                  S    + D++  
Subjt:  LF-HKPPEYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVYFDELNFPFSVPSPPSQPSPLSSMPSFSPDLALS

Query:  FATCDLPSFYVPPT---HPCSSSSLLPQPSRQASNISSSPLNQTQLSSSLPVPEQQDPSSSVH------------------TSVLPASSNTQPSSLSSVQ
             +P+F   P+   +P S+ S   + S Q            QL   +   E        H                  T  +  S + +P SL  V 
Subjt:  FATCDLPSFYVPPT---HPCSSSSLLPQPSRQASNISSSPLNQTQLSSSLPVPEQQDPSSSVH------------------TSVLPASSNTQPSSLSSVQ

Query:  PQQSPSDLAPPTTDTQPSPIAPMSDHTTV---TNLHPMQTR----------SKSARYKARLVAKGFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWN
             + L     +   S +     +  V       P++ +           K  RYKARLV KGF Q++G+D+ E FSPVVK  ++R +LS+A +    
Subjt:  PQQSPSDLAPPTTDTQPSPIAPMSDHTTV---TNLHPMQTR----------SKSARYKARLVAKGFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWN

Query:  LRQLDVKNAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNNSLYGLKQAPRAWFHCFTSQLSTYGFVASQADPSL-FVRNVRGSITYLLLYVDDIILTGN
        + QLDVK AFLHGDLEEE+YM QP+GF        VCKLN SLYGLKQAPR W+  F S + +  ++ + +DP + F R    +   LLLYVDD+++ G 
Subjt:  LRQLDVKNAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNNSLYGLKQAPRAWFHCFTSQLSTYGFVASQADPSL-FVRNVRGSITYLLLYVDDIILTGN

Query:  DCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTG--ILVSQTKYASDLLTRFGMTIAKTCATPCSSGPLDSPSLIC------SADDAKV-YRSMVG
        D   +  L   L   FDM DLG     LG+++  + T   + +SQ KY   +L RF M  AK  +TP  +G L     +C        + AKV Y S VG
Subjt:  DCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTG--ILVSQTKYASDLLTRFGMTIAKTCATPCSSGPLDSPSLIC------SADDAKV-YRSMVG

Query:  ALHY-LTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSSLSLSAFSDSDWTGNRLDRR------------------------A
        +L Y +  TRPDI+  +  VS+F+  P   H   VK ILRY+RG+    L F  S   L  ++D+D  G+  +R+                        A
Subjt:  ALHY-LTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSSLSLSAFSDSDWTGNRLDRR------------------------A

Query:  LASATA----------ELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEVDFHFVRE
        L++  A          E+ W+++ L++L +       ++CD+QSA+ L++N ++H RTKH++V +H++RE
Subjt:  LASATA----------ELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEVDFHFVRE

P92519 Uncharacterized mitochondrial protein AtMg008105.8e-3345.9Show/hide
Query:  YLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCATPCSSGPLDSPSLICSA--DDAK
        YLLLYVDDI+LTG+    ++ L+ QL   F M DLG + YFLG+++K   +G+ +SQTKYA  +L   GM   K  +TP    PL   S + +A   D  
Subjt:  YLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCATPCSSGPLDSPSLICSA--DDAK

Query:  VYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSS-LSLSAFSDSDWTGNRLDRRA
         +RS+VGAL YLT TRPDIS+ ++ V Q MH PT     ++KR+LRYV+G++  GL   ++S L++ AF DSDW G    RR+
Subjt:  VYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSS-LSLSAFSDSDWTGNRLDRRA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-18538.09Show/hide
Query:  SPVTSSVTRSPNAL--PQDSQVWLSDTCCNAHLTSDLANLNISNTYHGDENITVGNGHSLPISHYDSSQISLYDRTFTLSNLLRVPHISTNLLSVHQFCI
        SP T    R+  AL  P  S  WL D+    H+TSD  NL++   Y G +++ V +G ++PISH  S+ +S   R   L N+L VP+I  NL+SV++ C 
Subjt:  SPVTSSVTRSPNAL--PQDSQVWLSDTCCNAHLTSDLANLNISNTYHGDENITVGNGHSLPISHYDSSQISLYDRTFTLSNLLRVPHISTNLLSVHQFCI

Query:  DNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPVSALPISKAQ-VGLTANVGTKASFSVWHDRLGHPSTPILRSLLNSSDISAANFSSHI--CRHSLD
         N     F+ +SF ++D  TG  L  G   ++LY     PI+ +Q V L A+  +KA+ S WH RLGHP+  IL S++++  +S  N S     C   L 
Subjt:  DNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPVSALPISKAQ-VGLTANVGTKASFSVWHDRLGHPSTPILRSLLNSSDISAANFSSHI--CRHSLD

Query:  GKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFAENLLSSKLKTFRSDGGEEFLSRNLQT
         K +K+PFS ST ++T PLE + SDVW  +P LS +N +YYV F+D ++ YTWL+PL  KS+V  TFI F    EN   +++ TF SD G EF++  L  
Subjt:  GKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFAENLLSSKLKTFRSDGGEEFLSRNLQT

Query:  FLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPTLGNKSPCE-LFHKPPEYSSLRTFGYTCYPLLRPY
        + S  GI H  S P+T E NG+ ERKHRH+V+  ++L+S +S+   +WP+AFA  V+LINRLP+P L  +SP + LF   P Y  LR FG  CYP LRPY
Subjt:  FLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPTLGNKSPCE-LFHKPPEYSSLRTFGYTCYPLLRPY

Query:  NCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVYFDELNFPFS---VPSPPSQPSPLSSMPSFSPDLALSFATCDLPSFYVPPTHPCSSSSLL
        N  KL  ++ QCVFLGY L    Y+C  + T++ YISRHV FDE  FPFS       P Q     S   +SP   L   T  LP+      H  ++    
Subjt:  NCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVYFDELNFPFS---VPSPPSQPSPLSSMPSFSPDLALSFATCDLPSFYVPPTHPCSSSSLL

Query:  PQPSRQASNISSSPLNQTQLSSSLP------VPEQQDP---------SSSVHTSVLPAS---SNTQPSSL--SSVQPQQSPSDLAPPTTDTQ--------
        P    + S +SSS L+ +  SSS P       P Q  P          +  H+S   +    +N  PS L  S   P QS S    PTT           
Subjt:  PQPSRQASNISSSPLNQTQLSSSLP------VPEQQDP---------SSSVHTSVLPAS---SNTQPSSL--SSVQPQQSPSDLAPPTTDTQ--------

Query:  -------PSPIAPM--SDHTTVTNLHPMQTRSKSA-----------------------------------------------------------------
               P P+A +  +++    N H M TR+K+                                                                  
Subjt:  -------PSPIAPM--SDHTTVTNLHPMQTRSKSA-----------------------------------------------------------------

Query:  --------------RYKARLVAKGFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNN
                      RYKARLVAKG++Q  G+DY ETFSPV+K  ++RIVL +A    W +RQLDV NAFL G L ++VYM QP GF++   P+YVCKL  
Subjt:  --------------RYKARLVAKGFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNN

Query:  SLYGLKQAPRAWFHCFTSQLSTYGFVASQADPSLFVRNVRGSITYLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVS
        +LYGLKQAPRAW+    + L T GFV S +D SLFV     SI Y+L+YVDDI++TGND   + + +  L   F + D   L YFLG+E K   TG+ +S
Subjt:  SLYGLKQAPRAWFHCFTSQLSTYGFVASQADPSLFVRNVRGSITYLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVS

Query:  QTKYASDLLTRFGMTIAKTCATPCSSGPLDSPSLICSADDAKVYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKF
        Q +Y  DLL R  M  AK   TP +  P  S        D   YR +VG+L YL FTRPDIS+ ++R+SQFMH PT  HL  +KRILRY+ G+ + G+ F
Subjt:  QTKYASDLLTRFGMTIAKTCATPCSSGPLDSPSLICSADDAKVYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKF

Query:  IR--SSLSLSAFSDSDWTGNRLD----------------------------------RRALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLAR
        ++  ++LSL A+SD+DW G++ D                                   R++A+ ++E+ WI  +L +L I ++  P ++CDN  A  L  
Subjt:  IR--SSLSLSAFSDSDWTGNRLD----------------------------------RRALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLAR

Query:  NPVFHGRTKHVEVDFHFVREK
        NPVFH R KH+ +D+HF+R +
Subjt:  NPVFHGRTKHVEVDFHFVREK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.3e-18136.24Show/hide
Query:  TASPVTSSVTRSPNAL--PQDSQVWLSDTCCNAHLTSDLANLNISNTYHGDENITVGNGHSLPISHYDSSQISLYDRTFTLSNLLRVPHISTNLLSVHQF
        + SP T    R+  A+  P ++  WL D+    H+TSD  NL+    Y G +++ + +G ++PI+H  S+ +    R+  L+ +L VP+I  NL+SV++ 
Subjt:  TASPVTSSVTRSPNAL--PQDSQVWLSDTCCNAHLTSDLANLNISNTYHGDENITVGNGHSLPISHYDSSQISLYDRTFTLSNLLRVPHISTNLLSVHQF

Query:  CIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPVSALPISKAQ-VGLTANVGTKASFSVWHDRLGHPSTPILRSLLNSSDISAANFSSHI--CRHS
        C  N     F+ +SF ++D  TG  L  G   ++LY     PI+ +Q V + A+  +KA+ S WH RLGHPS  IL S++++  +   N S  +  C   
Subjt:  CIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPVSALPISKAQ-VGLTANVGTKASFSVWHDRLGHPSTPILRSLLNSSDISAANFSSHI--CRHS

Query:  LDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFAENLLSSKLKTFRSDGGEEFLSRNL
           K  K+PFS ST +++ PLE + SDVW  +P LSI+N +YYV F+D ++ YTWL+PL  KS+V  TFI F    EN   +++ T  SD G EF+   L
Subjt:  LDGKMSKLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFAENLLSSKLKTFRSDGGEEFLSRNL

Query:  QTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPTLGNKSPCE-LFHKPPEYSSLRTFGYTCYPLLR
        + +LS  GI H  S P+T E NG+ ERKHRH+V+M ++L+S +SV   +WP+AF+  V+LINRLP+P L  +SP + LF +PP Y  L+ FG  CYP LR
Subjt:  QTFLSNRGILHQKSYPYTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPTLGNKSPCE-LFHKPPEYSSLRTFGYTCYPLLR

Query:  PYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVYFDELNFPFSV---------------------------------------------PS
        PYN  KL+ ++ QC F+GY L    Y+C  + T + Y SRHV FDE  FPFS                                              P 
Subjt:  PYNCTKLQPRTTQCVFLGYPLGYKGYMCYDMSTTKFYISRHVYFDELNFPFSV---------------------------------------------PS

Query:  PPSQPSPLSSMPSFSPDLALSFATCDLPSFYVPPTHPCSSSSLLPQPSRQASNISSSPLNQTQLSSSLPVPEQQDPSSSVHTSVLPASSNTQP------S
        PPS PSPL +    S +L  S  +    S    PT P  +    PQP+ Q     +S  N   L++  P P    P+S    S LP S  + P      +
Subjt:  PPSQPSPLSSMPSFSPDLALSFATCDLPSFYVPPTHPCSSSSLLPQPSRQASNISSSPLNQTQLSSSLPVPEQQDPSSSVHTSVLPASSNTQP------S

Query:  SLSSVQPQQSPSDLAPPTTDTQPSPIAPMSDHTTVTNLHPMQTRSKSA----------------------------------------------------
        S+S      S S   PP     P+P     +     N H M TR+K                                                      
Subjt:  SLSSVQPQQSPSDLAPPTTDTQPSPIAPMSDHTTVTNLHPMQTRSKSA----------------------------------------------------

Query:  ---------------------------RYKARLVAKGFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFV
                                   RYKARLVAKG++Q  G+DY ETFSPV+K  ++RIVL +A    W +RQLDV NAFL G L +EVYM QP GFV
Subjt:  ---------------------------RYKARLVAKGFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFV

Query:  NLTYPSYVCKLNNSLYGLKQAPRAWFHCFTSQLSTYGFVASQADPSLFVRNVRGSITYLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLG
        +   P YVC+L  ++YGLKQAPRAW+    + L T GFV S +D SLFV     SI Y+L+YVDDI++TGND   +   +  L   F + +   L YFLG
Subjt:  NLTYPSYVCKLNNSLYGLKQAPRAWFHCFTSQLSTYGFVASQADPSLFVRNVRGSITYLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLG

Query:  LEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCATPCSSGPLDSPSLICSADDAKVYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRIL
        +E K    G+ +SQ +Y  DLL R  M  AK  ATP ++ P  +        D   YR +VG+L YL FTRPD+S+ ++R+SQ+MH PT  H   +KR+L
Subjt:  LEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCATPCSSGPLDSPSLICSADDAKVYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRIL

Query:  RYVRGSLSVGLKFIR--SSLSLSAFSDSDWTGN----------------------------------RLDRRALASATAELSWIQQVLKDLCIFVSTAPS
        RY+ G+   G+ F++  ++LSL A+SD+DW G+                                    + R++A+ ++EL WI  +L +L I +S  P 
Subjt:  RYVRGSLSVGLKFIR--SSLSLSAFSDSDWTGN----------------------------------RLDRRALASATAELSWIQQVLKDLCIFVSTAPS

Query:  LFCDNQSALQLARNPVFHGRTKHVEVDFHFVREK
        ++CDN  A  L  NPVFH R KH+ +D+HF+R +
Subjt:  LFCDNQSALQLARNPVFHGRTKHVEVDFHFVREK

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.2e-7940.46Show/hide
Query:  RYKARLVAKGFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFV----NLTYPSYVCKLNNSLYGLKQAPR
        RYKARLVAKG+ Q+EG+D+ ETFSPV K  +V+++L+I+  + + L QLD+ NAFL+GDL+EE+YM  P G+     +   P+ VC L  S+YGLKQA R
Subjt:  RYKARLVAKGFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLDVKNAFLHGDLEEEVYMYQPQGFV----NLTYPSYVCKLNNSLYGLKQAPR

Query:  AWFHCFTSQLSTYGFVASQADPSLFVRNVRGSITYLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLT
         WF  F+  L  +GFV S +D + F++        +L+YVDDII+  N+ A VD L +QL+  F + DLG L YFLGLE+   + GI + Q KYA DLL 
Subjt:  AWFHCFTSQLSTYGFVASQADPSLFVRNVRGSITYLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLT

Query:  RFGMTIAKTCATPCSSGPLDSPSLICSAD------DAKVYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKF-IRS
          G+   K      SS P+D PS+  SA       DAK YR ++G L YL  TR DISF ++++SQF  AP   H   V +IL Y++G++  GL +  ++
Subjt:  RFGMTIAKTCATPCSSGPLDSPSLICSAD------DAKVYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKF-IRS

Query:  SLSLSAFSDS------------------------DWTGNR----------LDRRALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFH
         + L  FSD+                         W   +           + RAL+ AT E+ W+ Q  ++L + +S    LFCDN +A+ +A N VFH
Subjt:  SLSLSAFSDS------------------------DWTGNR----------LDRRALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFH

Query:  GRTKHVEVDFHFVREKATPKETRSGYFFNSQENKD
         RTKH+E D H VRE++  + T S Y F + + +D
Subjt:  GRTKHVEVDFHFVREKATPKETRSGYFFNSQENKD

ATMG00240.1 Gag-Pol-related retrotransposon family protein3.9e-0841.89Show/hide
Query:  YLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKF-IRSSLSLSAFSDSDWTGNRLDRRAL
        YLT TRPD++F ++R+SQF  A     +  V ++L YV+G++  GL +   S L L AF+DSDW      RR++
Subjt:  YLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKF-IRSSLSLSAFSDSDWTGNRLDRRAL

ATMG00810.1 DNA/RNA polymerases superfamily protein4.1e-3445.9Show/hide
Query:  YLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCATPCSSGPLDSPSLICSA--DDAK
        YLLLYVDDI+LTG+    ++ L+ QL   F M DLG + YFLG+++K   +G+ +SQTKYA  +L   GM   K  +TP    PL   S + +A   D  
Subjt:  YLLLYVDDIILTGNDCAYVDSLVTQLRVVFDMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCATPCSSGPLDSPSLICSA--DDAK

Query:  VYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSS-LSLSAFSDSDWTGNRLDRRA
         +RS+VGAL YLT TRPDIS+ ++ V Q MH PT     ++KR+LRYV+G++  GL   ++S L++ AF DSDW G    RR+
Subjt:  VYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVKRILRYVRGSLSVGLKFIRSS-LSLSAFSDSDWTGNRLDRRA

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.1e-0764.1Show/hide
Query:  RYKARLVAKGFHQEEGVDYFETFSPVVKKPTVRIVLSIA
        R KARLVAKGFHQEEG+ + ET+SPVV+  T+R +L++A
Subjt:  RYKARLVAKGFHQEEGVDYFETFSPVVKKPTVRIVLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTACTCGTTTCAAGGAAGGCATCCTCCTACAAAGTTGTCTGCCATGGCCGCTACTGCCTCTCCTGTTACTTCCAGTGTAACTAGATCACCTAATGCTCTTCCTCA
GGATTCGCAGGTTTGGTTGTCTGACACTTGTTGCAACGCACATTTGACGAGTGATCTAGCTAATCTCAATATCTCAAACACTTACCATGGAGACGAGAATATCACAGTTG
GTAATGGCCACTCTCTTCCCATTTCTCACTACGACTCTAGCCAAATTTCTCTTTATGATCGTACATTTACTCTTTCTAATCTTCTTCGTGTTCCGCATATCTCCACCAAT
CTTCTTTCAGTTCATCAGTTCTGCATCGATAATAACTGTTGTTTTATTTTTTATGCTTCATCCTTCACCATTCAGGACAAACAAACAGGCCAAACCTTGTTCCACGGACC
TGGCGTGAACGACCTTTACCCTGTCTCAGCTCTGCCCATATCAAAAGCACAAGTAGGCCTTACTGCTAACGTAGGTACCAAGGCTTCATTCAGTGTGTGGCACGATAGGT
TAGGCCACCCCTCTACCCCTATTCTTCGATCTTTACTTAATTCTTCTGATATCTCTGCTGCCAATTTTAGTTCTCATATATGTAGACATTCTCTTGATGGAAAAATGTCT
AAACTACCTTTTTCCTTATCCACTACTTCTACTACTGTACCACTGGAAATCTTACAAAGTGATGTATGGGGCCTTGCCCCTGAATTGTCAATAAATAATAAAAAATATTA
TGTGTCTTTCATTGACGATTACTCTTGCTATACCTGGTTGTTTCCCTTGACATGTAAGTCTGAAGTTTATACTACATTCATTAATTTTCTACCTTTTGCTGAGAACTTAT
TGTCTTCCAAATTAAAAACATTTCGTAGTGATGGGGGTGAGGAGTTCCTTAGCCGTAATCTACAGACCTTTCTTTCAAACCGTGGCATTCTTCACCAAAAATCCTACCCA
TATACACGGGAACAAAATGGGGTAGGCGAGCGCAAGCATCGTCACCTTGTTGATATGGCTATTTCCCTTATGTCCAAGTCTTCTGTTCTAATGAAATTTTGGCCTTTTGC
CTTTGCTACTGTTGTGTTTCTTATAAACCGGCTTCCTTCACCTACTCTTGGCAATAAATCGCCCTGTGAACTCTTTCACAAGCCTCCTGAATATTCTTCCCTTCGCACCT
TTGGTTACACTTGTTATCCACTTCTTCGTCCATATAATTGTACCAAACTCCAACCTCGTACTACTCAATGTGTGTTCCTTGGTTATCCTCTAGGTTATAAGGGCTACATG
TGCTATGACATGTCTACCACCAAATTCTACATATCTCGCCATGTCTACTTTGATGAACTCAACTTTCCCTTTTCTGTTCCTTCACCTCCATCTCAACCAAGTCCGCTCAG
TTCCATGCCTTCCTTTTCACCAGATTTAGCTCTATCCTTTGCAACCTGTGACTTGCCTTCCTTTTATGTCCCTCCTACTCATCCTTGCTCTTCCTCATCTCTTTTGCCTC
AACCTAGTAGGCAAGCCTCCAATATTTCCTCATCACCACTTAACCAAACACAGCTTAGTTCTTCACTCCCTGTTCCAGAACAGCAAGATCCTTCTTCTTCTGTTCATACA
TCAGTTCTACCAGCATCTTCCAATACACAACCTTCTTCCTTATCTTCTGTCCAACCACAGCAAAGTCCTTCTGACCTTGCTCCACCTACGACTGATACTCAACCATCTCC
TATTGCCCCAATGAGTGATCATACCACTGTGACTAATCTCCATCCAATGCAAACTCGTTCTAAATCAGCTCGTTACAAGGCTAGATTGGTTGCCAAAGGCTTCCATCAAG
AAGAAGGGGTTGATTATTTTGAAACTTTTAGCCCAGTTGTAAAGAAACCAACAGTTCGCATAGTTTTATCTATTGCTACAACTTTTGGTTGGAATCTTAGACAATTGGAT
GTCAAGAATGCATTCCTTCATGGTGATTTAGAAGAGGAAGTTTACATGTATCAACCACAAGGTTTTGTCAATTTAACTTATCCTTCATATGTGTGTAAGTTGAACAACTC
TCTTTACGGCCTAAAGCAAGCACCTCGTGCTTGGTTTCATTGTTTTACTTCCCAACTCTCTACCTATGGTTTCGTTGCTTCACAAGCCGATCCATCCCTTTTTGTTCGAA
ATGTGCGTGGCTCTATCACATATCTCCTTTTGTATGTTGATGATATTATCTTAACTGGGAATGATTGTGCTTATGTTGATTCTCTTGTTACTCAATTGCGAGTTGTCTTT
GATATGACTGACTTGGGGACATTAACCTATTTTCTTGGCCTTGAGGTCAAGTATCAGTCCACTGGTATTTTAGTCTCCCAAACCAAGTATGCATCAGATTTACTTACAAG
ATTTGGAATGACTATTGCTAAGACTTGTGCTACACCGTGCTCTTCGGGTCCCCTTGATTCTCCTTCTCTGATTTGTAGTGCTGATGATGCCAAGGTTTATCGCAGCATGG
TTGGTGCACTCCATTATTTGACCTTCACACGTCCCGACATATCATTTGTATTGAGTCGAGTTTCTCAGTTCATGCATGCTCCAACATACCTTCATCTTGCTGTTGTTAAA
CGCATACTGAGATATGTTCGTGGTTCCTTATCTGTTGGCCTCAAGTTTATACGCAGCTCTCTGTCTCTGTCAGCCTTCTCTGATTCTGATTGGACAGGAAACCGTCTGGA
TCGCAGGGCATTGGCTTCTGCTACAGCTGAGTTGTCATGGATCCAACAAGTTCTAAAGGATTTGTGTATTTTTGTCTCTACTGCTCCTTCATTGTTCTGTGACAACCAGT
CAGCCTTACAACTTGCTAGAAATCCAGTCTTTCATGGTCGTACCAAGCACGTGGAGGTAGACTTCCATTTTGTCCGGGAGAAGGCAACCCCAAAGGAAACGAGAAGTGGA
TACTTCTTCAATTCACAAGAGAATAAAGACTTTGTATCGACAAATGCTAAATTCTCAGAAGAAGTCCACATCAAAAATCATAAACCACAAAGTAAAGTCGGTTTAAGTAA
GTTAACTAATGAAGCTACAGTAACATCCACAAAAGTTGTTGATGAAATTGGCTGTTCAACAAGAGTTGTTGGTGGACCAGTGCATCTGGTTATCTTACTGACATTAAAAG
AGGCTAGTAGCTCATTTCCAAATGAAAATTTGGGTGTAGCTGAGCTGGCCTTGTCTCAAGTGTCTTATATCTACAAGATGTTAGTTAGATATAAGATGCATAATTCCAAG
AAGGGTTCACTACCTTTTAGGCATGAAATTCTTTTGTCTAAAGAACTATGTCTTGAGACACCTCAAGATGTTGAGGACTTGAGACATAAGCCCTATGCAGTAGGGATTGT
CAGTAAGTATCAATCAAACCCAAGAATTGATCTCTGGAGAACCGTTAAGTACATCCTCCCGTATCTTAGGAGAACGGGGGTCTATAGGCTCGTGTATGGCACGGAGGATT
GGATCCTTACACGATACACTGACATAGATTTTCAAACTGATAAAGATTTTAGGAAATCTACATCAGTATCTGTGTTCACTTTTAACAGAGGAGATATAGCTATCCTCCGA
CAACAAAAAGTTGATTATGCTCTCATGGAAGCCAAGGACATTCTGGAAAGCATAAAGGAAGATCAACTTAAAGAAATGAATAAGGTAGCACATAGCCTTATCATTTTGCA
TTTGTCTGACACGATTCTTAGACAAATGAGAGCGAGGATACAACAGGAAAATCATGCTATTGTTCTTGTAAATTCTTTGCCTGATGCATATAATGATGTGAAATCCGCTA
TGAAGTTTGGTAGCGATAGTTTGACTACTGAGATTGTGATTGTTGCGATTAGAGTGAAAGAACAAGAATTAAAAGAAAATAAAATACCTAGTGAGTCATTGTTTGCTAGA
GGGAGATCTGAGAGTCGAAATTCTAAAAATAGGGGAAGAAGTAGAGAAGGTCAAGGGACAATTAGGATTAGACTTTATGATGGGAATATTAGGCTTCTTACAGGAGTTAG
GTACGTACCTGAAGTTAAGAGAAACCTAATATCGCTTAGCATGCTAGATAAGTTAGGTTACACCATTAAGTTAGAGGGTGGTGTGCAACAAGACCTACTGACCCTTTACA
AATACCTCAGCGCGATTTTGATCCATCTTGAAGTCCATGGGATGTCGAGGCGGTCAAAAGACCTTAGTGGAGTGCTCAGGATTCGAAATCAGCACCAAACAGCTCAAGTT
TTGAAGTGTCCAATTCTTAAGGACCCGAGAGATGGCGTCGAGGCGCCTCGAATGCAGAAGCGCGAAAATATCAGCGTCAGCGTCTCGACGCCGGTGCGTTTATTTAAACA
GCTACTGCTGTATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACTACTCGTTTCAAGGAAGGCATCCTCCTACAAAGTTGTCTGCCATGGCCGCTACTGCCTCTCCTGTTACTTCCAGTGTAACTAGATCACCTAATGCTCTTCCTCA
GGATTCGCAGGTTTGGTTGTCTGACACTTGTTGCAACGCACATTTGACGAGTGATCTAGCTAATCTCAATATCTCAAACACTTACCATGGAGACGAGAATATCACAGTTG
GTAATGGCCACTCTCTTCCCATTTCTCACTACGACTCTAGCCAAATTTCTCTTTATGATCGTACATTTACTCTTTCTAATCTTCTTCGTGTTCCGCATATCTCCACCAAT
CTTCTTTCAGTTCATCAGTTCTGCATCGATAATAACTGTTGTTTTATTTTTTATGCTTCATCCTTCACCATTCAGGACAAACAAACAGGCCAAACCTTGTTCCACGGACC
TGGCGTGAACGACCTTTACCCTGTCTCAGCTCTGCCCATATCAAAAGCACAAGTAGGCCTTACTGCTAACGTAGGTACCAAGGCTTCATTCAGTGTGTGGCACGATAGGT
TAGGCCACCCCTCTACCCCTATTCTTCGATCTTTACTTAATTCTTCTGATATCTCTGCTGCCAATTTTAGTTCTCATATATGTAGACATTCTCTTGATGGAAAAATGTCT
AAACTACCTTTTTCCTTATCCACTACTTCTACTACTGTACCACTGGAAATCTTACAAAGTGATGTATGGGGCCTTGCCCCTGAATTGTCAATAAATAATAAAAAATATTA
TGTGTCTTTCATTGACGATTACTCTTGCTATACCTGGTTGTTTCCCTTGACATGTAAGTCTGAAGTTTATACTACATTCATTAATTTTCTACCTTTTGCTGAGAACTTAT
TGTCTTCCAAATTAAAAACATTTCGTAGTGATGGGGGTGAGGAGTTCCTTAGCCGTAATCTACAGACCTTTCTTTCAAACCGTGGCATTCTTCACCAAAAATCCTACCCA
TATACACGGGAACAAAATGGGGTAGGCGAGCGCAAGCATCGTCACCTTGTTGATATGGCTATTTCCCTTATGTCCAAGTCTTCTGTTCTAATGAAATTTTGGCCTTTTGC
CTTTGCTACTGTTGTGTTTCTTATAAACCGGCTTCCTTCACCTACTCTTGGCAATAAATCGCCCTGTGAACTCTTTCACAAGCCTCCTGAATATTCTTCCCTTCGCACCT
TTGGTTACACTTGTTATCCACTTCTTCGTCCATATAATTGTACCAAACTCCAACCTCGTACTACTCAATGTGTGTTCCTTGGTTATCCTCTAGGTTATAAGGGCTACATG
TGCTATGACATGTCTACCACCAAATTCTACATATCTCGCCATGTCTACTTTGATGAACTCAACTTTCCCTTTTCTGTTCCTTCACCTCCATCTCAACCAAGTCCGCTCAG
TTCCATGCCTTCCTTTTCACCAGATTTAGCTCTATCCTTTGCAACCTGTGACTTGCCTTCCTTTTATGTCCCTCCTACTCATCCTTGCTCTTCCTCATCTCTTTTGCCTC
AACCTAGTAGGCAAGCCTCCAATATTTCCTCATCACCACTTAACCAAACACAGCTTAGTTCTTCACTCCCTGTTCCAGAACAGCAAGATCCTTCTTCTTCTGTTCATACA
TCAGTTCTACCAGCATCTTCCAATACACAACCTTCTTCCTTATCTTCTGTCCAACCACAGCAAAGTCCTTCTGACCTTGCTCCACCTACGACTGATACTCAACCATCTCC
TATTGCCCCAATGAGTGATCATACCACTGTGACTAATCTCCATCCAATGCAAACTCGTTCTAAATCAGCTCGTTACAAGGCTAGATTGGTTGCCAAAGGCTTCCATCAAG
AAGAAGGGGTTGATTATTTTGAAACTTTTAGCCCAGTTGTAAAGAAACCAACAGTTCGCATAGTTTTATCTATTGCTACAACTTTTGGTTGGAATCTTAGACAATTGGAT
GTCAAGAATGCATTCCTTCATGGTGATTTAGAAGAGGAAGTTTACATGTATCAACCACAAGGTTTTGTCAATTTAACTTATCCTTCATATGTGTGTAAGTTGAACAACTC
TCTTTACGGCCTAAAGCAAGCACCTCGTGCTTGGTTTCATTGTTTTACTTCCCAACTCTCTACCTATGGTTTCGTTGCTTCACAAGCCGATCCATCCCTTTTTGTTCGAA
ATGTGCGTGGCTCTATCACATATCTCCTTTTGTATGTTGATGATATTATCTTAACTGGGAATGATTGTGCTTATGTTGATTCTCTTGTTACTCAATTGCGAGTTGTCTTT
GATATGACTGACTTGGGGACATTAACCTATTTTCTTGGCCTTGAGGTCAAGTATCAGTCCACTGGTATTTTAGTCTCCCAAACCAAGTATGCATCAGATTTACTTACAAG
ATTTGGAATGACTATTGCTAAGACTTGTGCTACACCGTGCTCTTCGGGTCCCCTTGATTCTCCTTCTCTGATTTGTAGTGCTGATGATGCCAAGGTTTATCGCAGCATGG
TTGGTGCACTCCATTATTTGACCTTCACACGTCCCGACATATCATTTGTATTGAGTCGAGTTTCTCAGTTCATGCATGCTCCAACATACCTTCATCTTGCTGTTGTTAAA
CGCATACTGAGATATGTTCGTGGTTCCTTATCTGTTGGCCTCAAGTTTATACGCAGCTCTCTGTCTCTGTCAGCCTTCTCTGATTCTGATTGGACAGGAAACCGTCTGGA
TCGCAGGGCATTGGCTTCTGCTACAGCTGAGTTGTCATGGATCCAACAAGTTCTAAAGGATTTGTGTATTTTTGTCTCTACTGCTCCTTCATTGTTCTGTGACAACCAGT
CAGCCTTACAACTTGCTAGAAATCCAGTCTTTCATGGTCGTACCAAGCACGTGGAGGTAGACTTCCATTTTGTCCGGGAGAAGGCAACCCCAAAGGAAACGAGAAGTGGA
TACTTCTTCAATTCACAAGAGAATAAAGACTTTGTATCGACAAATGCTAAATTCTCAGAAGAAGTCCACATCAAAAATCATAAACCACAAAGTAAAGTCGGTTTAAGTAA
GTTAACTAATGAAGCTACAGTAACATCCACAAAAGTTGTTGATGAAATTGGCTGTTCAACAAGAGTTGTTGGTGGACCAGTGCATCTGGTTATCTTACTGACATTAAAAG
AGGCTAGTAGCTCATTTCCAAATGAAAATTTGGGTGTAGCTGAGCTGGCCTTGTCTCAAGTGTCTTATATCTACAAGATGTTAGTTAGATATAAGATGCATAATTCCAAG
AAGGGTTCACTACCTTTTAGGCATGAAATTCTTTTGTCTAAAGAACTATGTCTTGAGACACCTCAAGATGTTGAGGACTTGAGACATAAGCCCTATGCAGTAGGGATTGT
CAGTAAGTATCAATCAAACCCAAGAATTGATCTCTGGAGAACCGTTAAGTACATCCTCCCGTATCTTAGGAGAACGGGGGTCTATAGGCTCGTGTATGGCACGGAGGATT
GGATCCTTACACGATACACTGACATAGATTTTCAAACTGATAAAGATTTTAGGAAATCTACATCAGTATCTGTGTTCACTTTTAACAGAGGAGATATAGCTATCCTCCGA
CAACAAAAAGTTGATTATGCTCTCATGGAAGCCAAGGACATTCTGGAAAGCATAAAGGAAGATCAACTTAAAGAAATGAATAAGGTAGCACATAGCCTTATCATTTTGCA
TTTGTCTGACACGATTCTTAGACAAATGAGAGCGAGGATACAACAGGAAAATCATGCTATTGTTCTTGTAAATTCTTTGCCTGATGCATATAATGATGTGAAATCCGCTA
TGAAGTTTGGTAGCGATAGTTTGACTACTGAGATTGTGATTGTTGCGATTAGAGTGAAAGAACAAGAATTAAAAGAAAATAAAATACCTAGTGAGTCATTGTTTGCTAGA
GGGAGATCTGAGAGTCGAAATTCTAAAAATAGGGGAAGAAGTAGAGAAGGTCAAGGGACAATTAGGATTAGACTTTATGATGGGAATATTAGGCTTCTTACAGGAGTTAG
GTACGTACCTGAAGTTAAGAGAAACCTAATATCGCTTAGCATGCTAGATAAGTTAGGTTACACCATTAAGTTAGAGGGTGGTGTGCAACAAGACCTACTGACCCTTTACA
AATACCTCAGCGCGATTTTGATCCATCTTGAAGTCCATGGGATGTCGAGGCGGTCAAAAGACCTTAGTGGAGTGCTCAGGATTCGAAATCAGCACCAAACAGCTCAAGTT
TTGAAGTGTCCAATTCTTAAGGACCCGAGAGATGGCGTCGAGGCGCCTCGAATGCAGAAGCGCGAAAATATCAGCGTCAGCGTCTCGACGCCGGTGCGTTTATTTAAACA
GCTACTGCTGTATTAG
Protein sequenceShow/hide protein sequence
MNYSFQGRHPPTKLSAMAATASPVTSSVTRSPNALPQDSQVWLSDTCCNAHLTSDLANLNISNTYHGDENITVGNGHSLPISHYDSSQISLYDRTFTLSNLLRVPHISTN
LLSVHQFCIDNNCCFIFYASSFTIQDKQTGQTLFHGPGVNDLYPVSALPISKAQVGLTANVGTKASFSVWHDRLGHPSTPILRSLLNSSDISAANFSSHICRHSLDGKMS
KLPFSLSTTSTTVPLEILQSDVWGLAPELSINNKKYYVSFIDDYSCYTWLFPLTCKSEVYTTFINFLPFAENLLSSKLKTFRSDGGEEFLSRNLQTFLSNRGILHQKSYP
YTREQNGVGERKHRHLVDMAISLMSKSSVLMKFWPFAFATVVFLINRLPSPTLGNKSPCELFHKPPEYSSLRTFGYTCYPLLRPYNCTKLQPRTTQCVFLGYPLGYKGYM
CYDMSTTKFYISRHVYFDELNFPFSVPSPPSQPSPLSSMPSFSPDLALSFATCDLPSFYVPPTHPCSSSSLLPQPSRQASNISSSPLNQTQLSSSLPVPEQQDPSSSVHT
SVLPASSNTQPSSLSSVQPQQSPSDLAPPTTDTQPSPIAPMSDHTTVTNLHPMQTRSKSARYKARLVAKGFHQEEGVDYFETFSPVVKKPTVRIVLSIATTFGWNLRQLD
VKNAFLHGDLEEEVYMYQPQGFVNLTYPSYVCKLNNSLYGLKQAPRAWFHCFTSQLSTYGFVASQADPSLFVRNVRGSITYLLLYVDDIILTGNDCAYVDSLVTQLRVVF
DMTDLGTLTYFLGLEVKYQSTGILVSQTKYASDLLTRFGMTIAKTCATPCSSGPLDSPSLICSADDAKVYRSMVGALHYLTFTRPDISFVLSRVSQFMHAPTYLHLAVVK
RILRYVRGSLSVGLKFIRSSLSLSAFSDSDWTGNRLDRRALASATAELSWIQQVLKDLCIFVSTAPSLFCDNQSALQLARNPVFHGRTKHVEVDFHFVREKATPKETRSG
YFFNSQENKDFVSTNAKFSEEVHIKNHKPQSKVGLSKLTNEATVTSTKVVDEIGCSTRVVGGPVHLVILLTLKEASSSFPNENLGVAELALSQVSYIYKMLVRYKMHNSK
KGSLPFRHEILLSKELCLETPQDVEDLRHKPYAVGIVSKYQSNPRIDLWRTVKYILPYLRRTGVYRLVYGTEDWILTRYTDIDFQTDKDFRKSTSVSVFTFNRGDIAILR
QQKVDYALMEAKDILESIKEDQLKEMNKVAHSLIILHLSDTILRQMRARIQQENHAIVLVNSLPDAYNDVKSAMKFGSDSLTTEIVIVAIRVKEQELKENKIPSESLFAR
GRSESRNSKNRGRSREGQGTIRIRLYDGNIRLLTGVRYVPEVKRNLISLSMLDKLGYTIKLEGGVQQDLLTLYKYLSAILIHLEVHGMSRRSKDLSGVLRIRNQHQTAQV
LKCPILKDPRDGVEAPRMQKRENISVSVSTPVRLFKQLLLY