; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030971 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030971
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr11:3380437..3388386
RNA-Seq ExpressionLag0030971
SyntenyLag0030971
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBG97282.1 hypothetical protein Prudu_006352 [Prunus dulcis]8.2e-26339.81Show/hide
Query:  SSIFLLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEF-LLDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVIGCKS
        +S   L  +  ++ +RL   N+L W++Q++S+L+ H LFG  DGS+  PP++ +LD +G  T+ + +A+  W+  D AL++L+ ATLS  A  YVIG K+
Subjt:  SSIFLLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEF-LLDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVIGCKS

Query:  SKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQEL--HAL
        + E W++L  ++++++R+ V+ LK+ L T  K A +SI+++LLR+K + D+LA   + + D+DL+L  LNGL SEY+  +T +  R+ +L  ++   H L
Subjt:  SKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQEL--HAL

Query:  LKSEAKILEQQNKSITTTPLVPTAMFSSVS-GQFNSNRG---RGRGRNSNPSYWSGRGGGRSNQGYYGNSNQ---GRGGFNSSNP-SPGFPQGHSAPNQG
           +A         +  +P+V     SS S G  +S       G G    PS  S    G  +   +G S+     RG F+ S P   GFP     P + 
Subjt:  LKSEAKILEQQNKSITTTPLVPTAMFSSVS-GQFNSNRG---RGRGRNSNPSYWSGRGGGRSNQGYYGNSNQ---GRGGFNSSNP-SPGFPQGHSAPNQG

Query:  -----------RGANSG------SNSSSGRGVI-CQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQSSNNSGNNNCTWLADSGCNSHVTPD
                   RG  +       S SS G  VI CQIC + GHGALDCY+R N ++QG  PPS L AMA       +  S + +  W+ADSG + H+ P 
Subjt:  -----------RGANSG------SNSSSGRGVI-CQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQSSNNSGNNNCTWLADSGCNSHVTPD

Query:  LSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPI
        ++T+   +     + + V NG+G+ I   G   + T  S L LSK+L VP L+ANLLSV Q C DNNC  +FD   F IQDK + + L +GKS+ GLYP+
Subjt:  LSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPI

Query:  -SSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKLLSANSITCDTKYTCRDCVSCLKGKASKLPFASSTSITTRPL
         +S+SS G+   S+S    + TS+  S    S F+     + LWH RLGHP+  V+Q +L+A  I   +    + C  CL GK  +LPF+S  +  + P 
Subjt:  -SSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKLLSANSITCDTKYTCRDCVSCLKGKASKLPFASSTSITTRPL

Query:  ALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQ
          LHSDVWGPSP  S+SG+RY V+F+D+++ F W++PL  KS+V ++  + V F+    + S++  +SDGGGE+++    EF +S G++HQ SCP TP+Q
Subjt:  ALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQ

Query:  NGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTL
        NG AE+K+R +++T+I L+  AS+P +FW+HA A   +L+NR+PS  +   +P+ +L    PD+ HLRVFG A YP L+  NT KLQP+T   VF+GY L
Subjt:  NGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTL

Query:  EYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHSHSHTLPSQP-------------QPNPAVVTSLLSSQSS----------------LVVRVL
         YKG LCYN  T K LVSRHV+  E +FPF          H  +LPS P              P  A   S+L+S SS                +V    
Subjt:  EYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHSHSHTLPSQP-------------QPNPAVVTSLLSSQSS----------------LVVRVL

Query:  PNNSSVG--DVLLSP------INTPSTSDVQPSPVLPTTDVLPSSASHS---SVVVASANSSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRK
           SSVG  D+L SP         P  S+ Q   +LP+ D + SS+  S   SV V SA   P                     SH M+TRSK G+ ++K
Subjt:  PNNSSVG--DVLLSP------INTPSTSDVQPSPVLPTTDVLPSSASHS---SVVVASANSSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRK

Query:  AFSAVI-------PKSIPEPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDET
         FS  +         S+ EP+S+  AS   +W +AM +E +AL  Q TW LVP   + NIVG KW+++ K + DGS++RYKARLVA+G++Q  G D++ET
Subjt:  AFSAVI-------PKSIPEPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDET

Query:  FSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPNHT-----------------------------------------
        FSPVV+  TVR+IL+LAA   WSL QLDVKNAFLHG L+E+VYM QP GF D ++P++                                          
Subjt:  FSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPNHT-----------------------------------------

Query:  --------------------------------TLGREFQLTDLGSLRYFIGLEITRFSDG-FHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCEL---HG
                                         LG  F+L D+G L YF+GL+I+  S+G   V+Q KY TDLL K+GM+  K C TP+    ++    G
Subjt:  --------------------------------TLGREFQLTDLGSLRYFIGLEITRFSDG-FHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCEL---HG

Query:  VSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILF-SKGKLELTAYSDADWAGDSMDRRSTSGYVVFF
        +     D   YR IVG+LQYLTFTRPDI Y V+ V QFM+ PTE H+  VKRILRYL GT   G  + S G L L+AYSD+DWAGD   RRST+G+VV+ 
Subjt:  VSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILF-SKGKLELTAYSDADWAGDSMDRRSTSGYVVFF

Query:  CGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAAELYWIRQLLCDLCVFLPTVPLLFCD
           PVSW +KKQ +VSRSSTEAEYR+LA  +A++ WIRQLL DL V+L   P+L+CD
Subjt:  CGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAAELYWIRQLLCDLCVFLPTVPLLFCD

CCH50966.1 T4.5 [Malus x robusta]5.3e-25438.38Show/hide
Query:  SSSSSVSDNSSGCSTQPNSSIFL-----------------LSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVN
        S+++S SD+ S     PNS   L                 ++N+  +VP +L+  N++ W+     +LK   L G+V+G   CPP F+ D  G  T   N
Subjt:  SSSSSVSDNSSGCSTQPNSSIFL-----------------LSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVN

Query:  SAHTQWIAQDSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLL
        ++   W  +D  L+  IN+TLSK      IG + S+ +W SLE++FS  +R+HVH L+S + TI K    S+ ++L  +KE+ +KLA    P+ + DL+ 
Subjt:  SAHTQWIAQDSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLL

Query:  YTLNGLSSEYNSFRTSIRTREGTLKLQELHALLKSEAKILEQQNKSITTTPLVP----TAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYG
        Y L+GL  EY SF  SI TR  ++   ELH LL S+   L+++    +++   P     A  S+  G FN    RGR  N N    +   GG     ++ 
Subjt:  YTLNGLSSEYNSFRTSIRTREGTLKLQELHALLKSEAKILEQQNKSITTTPLVP----TAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYG

Query:  NSNQGRGGFNSSNPSPGFPQGHSAPNQGRGANSGSNSSSGRGVICQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQSSNNSGNNNCTWLAD
        N++   GG   + PS                 +G +SSSG  V CQ+C + GH A  C NRL+  F     P+ ++AM S   P            WL D
Subjt:  NSNQGRGGFNSSNPSPGFPQGHSAPNQGRGANSGSNSSSGRGVICQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQSSNNSGNNNCTWLAD

Query:  SGCNSHVTPDLSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYK
        SG + HVTPD S L     Y+G D + V +G+G+ I+ TGS  + T H+   L+ +L VP  S NLLSV +   DN C   FD   F ++D ++G++L++
Subjt:  SGCNSHVTPDLSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYK

Query:  GKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKLLSANSI-TCDTKYTCRDCVSCLKGKASKLPFA
        G S+ GLYP    +S G        V+G+     +SP    +  + + +   WH RLGHPS   L  ++  N +           C +C  GK+ +L F+
Subjt:  GKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKLLSANSI-TCDTKYTCRDCVSCLKGKASKLPFA

Query:  SSTSITTRPLALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLH
        +    ++RPL LLH+DVWGPSP  S +G+R+Y+  VDDF+K++W++PL  KSDV S +K F+  ++  L   +Q  RSD GGEF+N  +  FF+ +G+ H
Subjt:  SSTSITTRPLALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLH

Query:  QRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKT
        Q SC HT EQNG AE+KHR +V+    L++ + +P +FW  AF TVV+L+NRLP  +   ++P++ L+   P    L+ FGCACYP L+PY+  KL  K+
Subjt:  QRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKT

Query:  AQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHSHSHTLPSQPQP----NPAVVTSLLSSQSSLVVRVLPNNSSVGDVLLSPIN
         Q VFLGY+L + GY C++  + ++ +SRHVVF E +FP+       SH HS  + S   P    +  +  S L  QSS    +   N+S   +  +  +
Subjt:  AQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHSHSHTLPSQPQP----NPAVVTSLLSSQSSLVVRVLPNNSSVGDVLLSPIN

Query:  TPSTSDVQPSPVLPTTDVLPSSASHSSVVVASANSSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRKAFSAV---IPKSI-------PEPTSF
        T                 +PSSA   S+     +SSP++   +        P  + +N+H MITR+KAGI K K F+A    +P ++       P P++F
Subjt:  TPSTSDVQPSPVLPTTDVLPSSASHSSVVVASANSSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRKAFSAV---IPKSI-------PEPTSF

Query:  TAASKIPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWS
          ASK   W  AM  E+ AL +  TW LVP+    NIVGCKWVF+ K  PDG++ RYKARLVAKG++Q+EG+DF ETFSPV K TT+R++L++A  Y W 
Subjt:  TAASKIPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWS

Query:  LHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPNH-----------------------------------------------------------------
        +HQLDV NAFLHG+LKEDVYMVQPPGF D + P+H                                                                 
Subjt:  LHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPNH-----------------------------------------------------------------

Query:  -------TTLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCELHGVSPFFSDALLYRQIVGSLQYLTFTRPDI
               + L   F + DLG + YF+G+E+ +   G  + Q KY  DLL+KT M  +K C+TP+ST+ +L       SD   YR  VG+LQYLT+TRPD+
Subjt:  -------TTLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCELHGVSPFFSDALLYRQIVGSLQYLTFTRPDI

Query:  TYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGIPVSWSAKKQSTVSRSSTEAEYRSLAQ
         + V++V Q+MH P  IH  AVKRILRYL GT D G+ F+KG   LTA+SDADWAG  +DRRSTSGY VF     +SWSAKKQ+TV+RSSTEAEYRSLA 
Subjt:  TYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGIPVSWSAKKQSTVSRSSTEAEYRSLAQ

Query:  TAAELYWIRQLLCDLCVFLPTVPLLFCD
        TAAE+ W+ ++L D+   L   P+++CD
Subjt:  TAAELYWIRQLLCDLCVFLPTVPLLFCD

TQD93593.1 hypothetical protein C1H46_020801 [Malus baccata]2.4e-27038.94Show/hide
Query:  LLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQ-----WIAQDSALITLINATLSKTAYSYVIGCKS
        L+ ++ N V V+LD SN++ W FQ+  +L+ + +FG VDGS+ CP ++  D+D  + T  NS H       W   D AL+TLI ATLS  A S VIGC+S
Subjt:  LLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQ-----WIAQDSALITLINATLSKTAYSYVIGCKS

Query:  SKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQELHALLK
        S+++W +L+++FS++TR+ + ++K  L  I K  +ESID YL R+K+  D+LA V V I DED+++  L GL  E+N+ +  IR RE  + L+EL + LK
Subjt:  SKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQELHALLK

Query:  SEAKILEQQNKS--------------------------------------ITTTPL-----VPTAMFSSVSG-QFNSNRGRGRGRNSNPSYWSGRGGGRS
        +E   L++  K                                       I++TP+      P  MF   S   F S  G G   N   + +  +G G+ 
Subjt:  SEAKILEQQNKS--------------------------------------ITTTPL-----VPTAMFSSVSG-QFNSNRGRGRGRNSNPSYWSGRGGGRS

Query:  NQGYYGNSNQGRGGFNSSNPSPGFPQGHSAPNQGRGANSGSN----------SSSGRGVI------CQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAM
           ++    Q    F   + S  F QG S  +      +                G G +      CQICNR GH A  C++R N  F    PP + ++ 
Subjt:  NQGYYGNSNQGRGGFNSSNPSPGFPQGHSAPNQGRGANSGSN----------SSSGRGVI------CQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAM

Query:  ASIYDPQ---------------------------SSNNSGNNNCT-----------------WLADSGCNSHVTPDLSTLALNSNYNGEDAITVANGQGV
        +  + PQ                           +  NS N++                   WL D G   H+T DLS + + + Y+  D +T ANG+G+
Subjt:  ASIYDPQ---------------------------SSNNSGNNNCT-----------------WLADSGCNSHVTPDLSTLALNSNYNGEDAITVANGQGV

Query:  PITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAP
         I   G   L      L L  +L VP LS +LLS+ Q C DNNC  + D     IQDK +  ILY+G S + +YP+  + S                   
Subjt:  PITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAP

Query:  LSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKLLSANSI---TCDTKYTCRDCVSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVSGFRYY
         SP+  + ++    ++ LWH RLGHP+  V++  LS   I     D+ YTC+   +CL+GK + LPF S  S +  P  ++H+DVWGPSP +S+ G+RYY
Subjt:  LSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKLLSANSI---TCDTKYTCRDCVSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVSGFRYY

Query:  VNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHA
        V+F+D+ +++TWIFP++ K+ V  +  QF  F+ N  +  +++ +SDGGGE++  +   F   KG+LH +SCP+TP+QNG  E+K+R I +TAI L+  A
Subjt:  VNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHA

Query:  SVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVV
         +P +FWYHA AT V+L+NR+P+  +   +PF+ LY   P L HL++FGCACYP LKPY +HKL PKT++ +FLGY  +YKG++C+N +  K++VSRHV+
Subjt:  SVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVV

Query:  FHEDVFP---FAIRPVTRSHSHS----------HTLPSQPQPNPAVVTSLLSSQ-SSLVVRVLPN--------NSSVGDVLLSPINTPSTSDVQPSPVLP
        F E  FP    A R V+ S   S          H  P    P P V +   SSQ S  V    P+        +S  GD+ + P+++ + SD   S + P
Subjt:  FHEDVFP---FAIRPVTRSHSHS----------HTLPSQPQPNPAVVTSLLSSQ-SSLVVRVLPN--------NSSVGDVLLSPINTPSTSDVQPSPVLP

Query:  TTDVLPSSASHSSVVVASANSSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRK-AFSAVIPKSIPEPTSFTAASKIPEWKRAMLDEYTALTNQ
         T        H S V      S  Q  +++A           +NSHPM TRSK+GI K+K AFS  +     EP+S++AA K+ EW+ AM DE  AL  Q
Subjt:  TTDVLPSSASHSSVVVASANSSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRK-AFSAVIPKSIPEPTSFTAASKIPEWKRAMLDEYTALTNQ

Query:  NTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQ
         TWTLVP   D N+VGCKW+++ K +PDG+VARYKARLVAKG++Q  G+D+ ETFSPVVK TTVR++L+LAA  GW L+QLDVKNAFLHG+L E+VYM Q
Subjt:  NTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQ

Query:  PPGFRDSNYPNHT--------------------------TLG-----------------------------------------------REFQLTDLGSL
        P GF D  +P H                           TLG                                                EF + +LG L
Subjt:  PPGFRDSNYPNHT--------------------------TLG-----------------------------------------------REFQLTDLGSL

Query:  RYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCEL--HGVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYS
         YF+GL+I   S G  V Q KY+TDLL K  M   K C TP     +L  HG SP FSD   YR IVG+LQYLTFTRPDI Y V++V QFMH P E H+ 
Subjt:  RYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCEL--HGVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYS

Query:  AVKRILRYLNGTRDCGILFSKGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAAELYWIRQLLCDLCVFLP
        AVKRILRYL GT   GI F+ G L++ AY+DADWAGD  DRRST+G+VVF    P+SWS+KKQ TVSRSSTEAEYR++A T AE+ W++QLL DL +  P
Subjt:  AVKRILRYLNGTRDCGILFSKGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAAELYWIRQLLCDLCVFLP

Query:  TVPLLFCDIFYGRKL
        +VPLL CD  +   L
Subjt:  TVPLLFCDIFYGRKL

TQE01264.1 hypothetical protein C1H46_013171 [Malus baccata]3.4e-27743.15Show/hide
Query:  DSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSE
        D A++ LI ATLS TA S VIGC SS ++W+SL+ +FS++T++ + +LK+ L  I K   +S+  YL R+K++ D L+   V  +D+D+++  L GL SE
Subjt:  DSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSE

Query:  YNSFRTSIRTREGTLKLQELHALLKSEAKILEQQNKSITTTPLVPTAMFSSVSG-------QFNSNRGRGRGR----NSNPSYWSG--------------
        YN+F+T IR RE  + L+E  + L +E   +E  N SI+ +  V   +  + SG       Q  S+ G  + +     S PS +SG              
Subjt:  YNSFRTSIRTREGTLKLQELHALLKSEAKILEQQNKSITTTPLVPTAMFSSVSG-------QFNSNRGRGRGR----NSNPSYWSG--------------

Query:  RGGGRSNQGYYGNSNQGRGGFNSSNP--SPGFPQGHSAPNQG-----------RG-------ANSGSNSSSGRGVICQICNRSGHGALDCYNRLNLSFQG
        RG GR    ++ NS    G   SS     P  P   + P  G           RG           S S+    V CQIC + GH A+ CY+R N ++QG
Subjt:  RGGGRSNQGYYGNSNQGRGGFNSSNP--SPGFPQGHSAPNQG-----------RG-------ANSGSNSSSGRGVICQICNRSGHGALDCYNRLNLSFQG

Query:  RHPPSKLAAMASIYDPQSSNNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLS
        R PPS L+AM + + P +          W+AD+G  SH+T DLS L L + ++G D +T A+G G+PI+  GS TL T      L  IL VP LS +LLS
Subjt:  RHPPSKLAAMASIYDPQSSNNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLS

Query:  VSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKL
        + Q C DN C F+ D   F IQDK +GRI+ +G  ++GLYPI         +++       H ++     C   ++       LWH RLGHPS  V   +
Subjt:  VSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKL

Query:  LSANSITCDTKYTCRDCVSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQL
        L  + I     +    C SCL+GK +KLPF    + T  PL ++HSDVWGPS  +S+ G+++YV+FVD+ ++FTWIFPL+ KS+V  V   F  F+  Q 
Subjt:  LSANSITCDTKYTCRDCVSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQL

Query:  SCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYG
        S +++VF+SDGGGE+ +H   ++   KG+LHQ+SCP+TP+QNG AE+KHR I++TAI L+  AS+P + W+HA A  V+L+NR+    +   +PFQ L+G
Subjt:  SCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYG

Query:  CVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHSHSHTLPSQPQPNPAVVTSL
          P +SHL+VFGCAC+PLLK  N+ KLQPKT+Q +F+GY  +YKGYLC N  T K+ VSRHV+F E  FP++   +T + + SH         P V+ SL
Subjt:  CVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHSHSHTLPSQPQPNPAVVTSL

Query:  LSSQSSLVVRVLPNNSSVGDVLLSPINTPSTSDVQPSPVLPTTDVLPSSASHSSVVVASANSS---PSQSAAISANNQSDPPAAVTINSHPMITRSKAGI
        ++S ++ VV  +P  +S      SP   P+ S+   S  LP +  + S  S  S     A+S+   P         + S       ++ HPM TRSK+GI
Subjt:  LSSQSSLVVRVLPNNSSVGDVLLSPINTPSTSDVQPSPVLPTTDVLPSSASHSSVVVASANSS---PSQSAAISANNQSDPPAAVTINSHPMITRSKAGI

Query:  SKRKAFSAVIPKS--IPEPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETF
        SK+K FSA +  S  + EP +F +A KIPEW  AM DE TAL +QNTW+LVP     N+VGCKWV+R K NPDGSVARYKARLVAKGY+Q EGVD+ ETF
Subjt:  SKRKAFSAVIPKS--IPEPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETF

Query:  SPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPN--------------------------------------------
        SPVVK TTVR+ILALAA + WSL QLDVKNAFLHG L E+VYM QP GF  S +P+                                            
Subjt:  SPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPN--------------------------------------------

Query:  -HTTLG-----------------------------REFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCELHGVS--
         HT+LG                             +EF L DLG L YF+GL+IT    G  V+Q KY+ DLLEK  + DSK C+TP      L      
Subjt:  -HTTLG-----------------------------REFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCELHGVS--

Query:  PFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGI
        P+ S    YR IVG+LQYLTFTRPDI + V++  QFMH P + H  AVK ILRYL+GT   G+ F  GKL+L AYSDADWAGD  DRRSTSG++V+    
Subjt:  PFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGI

Query:  PVSWSAKKQSTVSRSSTEAEYRSLAQTAAELYWIRQLLCDLCVFLPTVPLLFCDIFYG
        P+SW++KKQ TVSRSSTEAEYR+LA  AAEL W+RQ+LCDLCV L T P+L+CD   G
Subjt:  PVSWSAKKQSTVSRSSTEAEYRSLAQTAAELYWIRQLLCDLCVFLPTVPLLFCDIFYG

WP_081894301.1 DDE-type integrase/transposase/recombinase [Acetobacter malorum]3.6e-25840.57Show/hide
Query:  LLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVIGCKSSKEVW
        L+ N+   V V+LD +N+L W +Q++ +L++H + G VDGS  CP  F+ + D       N  +  W   D AL+ LI  TLS TA S +IGC S+ E+W
Subjt:  LLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVIGCKSSKEVW

Query:  LSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQELHALLKSEAKI
        ++L  +FS++T++ + ++K  L  I K  +ESI +Y  R+K++ D L+   V  DD+D+++  L GL SEYN+FRT IR RE  + L++  A L +E   
Subjt:  LSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQELHALLKSEAKI

Query:  LEQQNKSITTTPLVPTAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQG--RGGFNSSN---PSPGFPQGHSAPNQGRGAN------
        +E    S + T    TAM +    Q N ++G+G       S+  G     S   +  +SNQG   G +NS+    PS GF   H+   + RG N      
Subjt:  LEQQNKSITTTPLVPTAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQG--RGGFNSSN---PSPGFPQGHSAPNQGRGAN------

Query:  --SGSN---------------SSSGRGV-ICQICNRS-----------------------------GHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQ
          SG+N               S  G GV  CQICN+                              GH AL CY+R N S+QGR PPS L  M + Y P 
Subjt:  --SGSN---------------SSSGRGV-ICQICNRS-----------------------------GHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQ

Query:  SSNNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDAD
        +  +       W+AD+G  SH+T DL+ L   + + G D IT A+G G+PI+ TGS  L        L  IL VP +S +LLS+ + C DNNC F+ D  
Subjt:  SSNNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDAD

Query:  WFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKLLSANSITCDTKYTCRDC
         F IQDK +G IL +G  +DGLYPI         +  +      HTS  L+    + F+ H  +T LWH RLGHPS  V+  +L+ + I+     +   C
Subjt:  WFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKLLSANSITCDTKYTCRDC

Query:  VSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVN
        +SCL+GK +KLPF+     + +P  +LHSDVWGPSP +SV G+++YV F+D+ ++FTWIFPL  KS+V  V   F  FI  Q S S++ F+SDGGGE+ +
Subjt:  VSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVN

Query:  HYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYP
            +F   KG++H +SCPHTPEQNG AE+KH  IV+TA+ L++ A +P +FW+HA A  V+L+NR+P S +   +P+  L+     L+HL+VFG +CYP
Subjt:  HYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYP

Query:  LLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHSHSHTLPSQPQP---NPAVVTSLLSSQSSLV------
        LLKPYNT+KLQPKT Q +FLGY  +YKGY+C+N  + +  VSRHVVF+E  FP+          H    PSQ  P    P  +T L++ Q+ +V      
Subjt:  LLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHSHSHTLPSQPQP---NPAVVTSLLSSQSSLV------

Query:  -------------VRVLPNNSSVGDVLLSPINTPSTSDVQPSPVLPTTDVLPSSASHSSVVVASANSSPSQSAAISANNQSDPPAAV------TINSHPM
                        L   +S  + L SPI  P  S    SP LP   +  +S S +S+      SSPS  + + A+    P          +++ HPM
Subjt:  -------------VRVLPNNSSVGDVLLSPINTPSTSDVQPSPVLPTTDVLPSSASHSSVVVASANSSPSQSAAISANNQSDPPAAV------TINSHPM

Query:  ITRSKAGISKRKAFSAVIP---KSIPEPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQR
         TRSK+GI K+KAF A I    +S  EP++F AASKI EW+ AM DE  AL  Q+TW LVP     N+VGCKWV+R K NPDGS+ARYKARLVAKGYNQ 
Subjt:  ITRSKAGISKRKAFSAVIP---KSIPEPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQR

Query:  EGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPN-----------------------------------
        EG+D+ ETFSPVVK TTVR+ILALAA + WSL QLDVKNAFLHG L E++YM QPPGF   ++P+                                   
Subjt:  EGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPN-----------------------------------

Query:  ----------HTTLG-----------------------------REFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTT
                  HT+LG                             + F + DLG L YF+GL+I+  S G  V+Q KY+ +LL +  M++SK C+TP   +
Subjt:  ----------HTTLG-----------------------------REFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTT

Query:  CEL--HGVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGKLELTAYSDADWAGDSMD
          L      P+ S    YR IVG+LQYLTFTRPDI + V++  QFMH+P   H  AVKRILRYL+GT   GI F  G   L AYSDADWAGD  D
Subjt:  CEL--HGVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGKLELTAYSDADWAGDSMD

TrEMBL top hitse value%identityAlignment
A0A2N9GRJ0 Uncharacterized protein1.1e-30843.87Show/hide
Query:  STQPNSSIFLLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVI
        +TQ ++ I LLSNI NLV V+LD++N++ WK+QV S+L+A+SL   +DGS PCP +FL D  G  +  VNS +T+W+++D  L+T++NATLS +  S V+
Subjt:  STQPNSSIFLLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVI

Query:  GCKSSKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQELH
        G KS++ VW +LEK+F+S+ RS++  LK  LH + K+  + +D +L RVKE  DKL  V V I DE++L   L GL +E++S R++IRTR   +   EL 
Subjt:  GCKSSKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQELH

Query:  ALLKSEAKILEQQNKSITTTPLVPTAMFSSVS--------GQFNSNRGRGRGRNSNPSYWSGRG-GGRSNQGYYGNSNQGRGGFNSSNPSPGFPQGHSAP
         LL +E   L+    +    PL+  AM S+ +         QFN++  RGRGRN+N     GRG GGR+N         GRGGF + N S          
Subjt:  ALLKSEAKILEQQNKSITTTPLVPTAMFSSVS--------GQFNSNRGRGRGRNSNPSYWSGRG-GGRSNQGYYGNSNQGRGGFNSSNPSPGFPQGHSAP

Query:  NQGRGANSGSNSSSGRGVICQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAMAS----------------------------------IYDPQS----S
                G++ +  +   CQIC + GH ALDCY+R++ S+QGRHPP+KLAA+AS                                    +P S    S
Subjt:  NQGRGANSGSNSSSGRGVICQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAMAS----------------------------------IYDPQS----S

Query:  NNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWF
         N   +  TW++D+G   H TPDL+ L    +Y G D +++ NG G+PIT  G   L  S    NL KIL VP +  NLLSV++ C DN C F FDA+ F
Subjt:  NNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWF

Query:  SIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKLLSANS-ITCDT-KYTCRDC
        SIQD  SGR LYKG SKDGLYPI  +SS+    + +ST    H+S P +    S F+    +  +WH RLGHP   VL  +L+    ++ +T K++   C
Subjt:  SIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKLLSANS-ITCDT-KYTCRDC

Query:  VSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVN
          C++GK  + PF SS+   T PL L+HSDVWGP+P+ S++G R+YV+FVD F++FTW+FP+  KS V +  + F   +EN L+  ++V R+D GGE+ N
Subjt:  VSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVN

Query:  HYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYP
             F S++G+LHQ SCPHTP+QNG AE+KHR IV+TA+ L++ +S+PL++W +AF+T ++L+NR+P+  + F +P+Q L+   PD S L+ FGC C+P
Subjt:  HYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYP

Query:  LLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIR--PVTRSHSHSHTLPSQPQPNPAVVTSLLSSQSSL--VVRVLP
        LL+PYN HKL+P+++  VFLGY L  KGYLC N++T K+L+SRHV FHE+ FPF  +  P + S   +  L S    +P    S+L    SL  +    P
Subjt:  LLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIR--PVTRSHSHSHTLPSQPQPNPAVVTSLLSSQSSL--VVRVLP

Query:  NNSSVGDVLLSPINTPSTSDVQPSPVLPTTDVLPSSASHSSVVVASANSSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRKAF--SAVIPKSI
         +SS+ DV        S     PSP+L TT                  SSP  S  + + +    P    INSHPM TR K+GISKRK    +  +    
Subjt:  NNSSVGDVLLSPINTPSTSDVQPSPVLPTTDVLPSSASHSSVVVASANSSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRKAF--SAVIPKSI

Query:  PEPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALA
         EP S+  ASK PEW+ AMLDEYTAL  Q TW+LVP   + NIVGCKWV++ K  PDGSVARYKARLVAKGY+Q+ G+D+DETFSPVVK  TVR+IL++A
Subjt:  PEPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALA

Query:  AHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPNHT----------------------------------------------------------
        A + WSL QLDV NAFLHG LKEDVYMVQP GF DS+ P+H                                                           
Subjt:  AHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPNHT----------------------------------------------------------

Query:  ---------------TLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCEL-HGVSPFFSDALLYRQIVGSLQY
                        L   F+L DLG L YF+GLE+   + GF V Q KY +DLL+K  M D K CSTP  T+ +L   +     DA  +R +VG+LQY
Subjt:  ---------------TLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCEL-HGVSPFFSDALLYRQIVGSLQY

Query:  LTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGIPVSWSAKKQSTVSRSSTE
        LTFTRPD+ Y V+ + QFMH PT+IH SA KR+LRY+ G+   G+ FS G L+L AYSDADWAGD   RRST+GY+VF    P++W +KKQSTVSRSSTE
Subjt:  LTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGIPVSWSAKKQSTVSRSSTE

Query:  AEYRSLAQTAAELYWIRQLLCDLCVFLPTVPLLFCD
        AEYR+LA  AAE+ W+R +L DL + L + P L+CD
Subjt:  AEYRSLAQTAAELYWIRQLLCDLCVFLPTVPLLFCD

A0A2N9HKM9 Uncharacterized protein7.7e-29143.72Show/hide
Query:  NLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQ
        +LV V+LD++N++ WK+Q+ S+ + +SL  ++DG+VP P ++L D +G  +   N  + QW A+D AL TLINATLS +A + VIG  +++ VW  LE++
Subjt:  NLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQ

Query:  FSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQELHALLKSEAKILEQQNK
        ++SL+R+H+  LK+ L  + KS TE+I  YL RVKE+ DKL +V V +DDEDLL   L GL +EY+ F +++RTR+  +  +ELH LL SE +  +    
Subjt:  FSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQELHALLKSEAKILEQQNK

Query:  SITTTPLVPTAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGRGGFNSSNPSPGFPQGHSAPNQGRGANSGSNSSSGRGVICQICN
             P +  AM ++ S  F           S P  W+   GGR N    G  N  RGGF+S++      QG ++   G   N  S+ +S R   CQIC 
Subjt:  SITTTPLVPTAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGRGGFNSSNPSPGFPQGHSAPNQGRGANSGSNSSSGRGVICQICN

Query:  RSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQSSNNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHS
        ++GH ALDC++R+N ++QGR PP+KLAA+AS     + N   +N  +W++D+G   H TPD+S +    +Y G D +TV NGQ +PIT TG+  L  S  
Subjt:  RSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQSSNNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHS

Query:  DLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPS
           L KIL VP +S+NLLSV + C DNN  F FDA  F I+D +SGR+LY G S+ GLYPI       SS +        HTSA  S            S
Subjt:  DLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPS

Query:  TVLWHLRLGHPSFPVLQKLLSANSITCDTKYTCRDCVSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVR
        + LWH RLGHP   V++ +L  N +      T   C+ CL+GK  KLPF +S SIT+ PL ++HSDVWGP+P+ S +  RYYV FVDDF++FTW FPL  
Subjt:  TVLWHLRLGHPSFPVLQKLLSANSITCDTKYTCRDCVSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVR

Query:  KSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLL
        KS V S    F   +EN LSC L++ R+D GGE+  H    F SS GV HQ +CPHT +QNG AE+KHR IVD  + LM+ AS+PL FW +AF+T VFL+
Subjt:  KSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLL

Query:  NRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHS
        NRLPS   G ++P++ L+G  P  S  R FGCACYPLL+PY+ HKL P++ Q +FLGY    KG+LC++  + +  VSRHV F E VFPF     T S S
Subjt:  NRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHS

Query:  HSHTLPSQPQ-PNPAVVTSLLSSQSSLVVRVL-----PNNSSVGDVLLSPINTPSTSDVQPSPVLPTTDVLPSSASHSSVVVASANSSPSQSAA-ISANN
        H   LP+  Q  NPA +++LL   S  +  +L     P  +S  ++ + P+   S + +   PV+P++    S++S + V  +S    PS S A ++A++
Subjt:  HSHTLPSQPQ-PNPAVVTSLLSSQSSLVVRVL-----PNNSSVGDVLLSPINTPSTSDVQPSPVLPTTDVLPSSASHSSVVVASANSSPSQSAA-ISANN

Query:  QSD--PPAAVTINSHPMITRSKAGISKRKAFSAVIPKSIP-----EPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNP
         +   P + +  N+HPM TR K+GI+K+K    ++ KS P     EP SF+ A  IP+W  AM  E+ ALT Q+TW+LVP   D +I+GC WVF+ K N 
Subjt:  QSD--PPAAVTINSHPMITRSKAGISKRKAFSAVIPKSIP-----EPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNP

Query:  DGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPNH---------------
        DGSVARYKARLVAKG +Q  G+DF ETFSPVVK  TVR+IL++AA   WSL QLDV NAFLHG LKE V+M QPPGF DS  P+H               
Subjt:  DGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPNH---------------

Query:  ----------------------------------------------------------TTLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLL
                                                                  T L   F+L DLG L++F+GL+I   + GF V Q KY  D+L
Subjt:  ----------------------------------------------------------TTLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLL

Query:  EKTGMADSKTCSTPMSTTCELHG-VSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGKLELTA
         +  M   K C++P  +  +L   V  F  D   YR +VG+LQYLTFTRPD+++ V+ + Q M  PT  H  A KR+LRY+ GT   GILF  G + LT 
Subjt:  EKTGMADSKTCSTPMSTTCELHG-VSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGKLELTA

Query:  YSDADWAGDSMDRRSTSGYVVFFCGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAAELYWIRQLLCDLCVF
        ++DADWAG+ +DRRST+G++VF     ++W++KKQ TVSRSSTEAEYRSLA  AAE+ WIR LLCDL +F
Subjt:  YSDADWAGDSMDRRSTSGYVVFFCGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAAELYWIRQLLCDLCVF

A0A2N9HUP1 Uncharacterized protein8.8e-28742.32Show/hide
Query:  MASSSSSV-------SDNSSGCSTQPNSSIFLLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIA
        MA +SSS        ++ SS  S    S I LLSNI NL+  +LDS+N+  WK+Q+ S+ +++SL   +DGS   P  +L D  G  TT+ +  + QW  
Subjt:  MASSSSSV-------SDNSSGCSTQPNSSIFLLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIA

Query:  QDSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSS
        +D AL TL+NATLS +A S+VI   +++ VW  LE++++SL+R+HV  LK  L  I K   ES+  +L RVKEL DKL+ V V +DDE+LL   L GL S
Subjt:  QDSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSS

Query:  EYNSFRTSIRTREGTLKLQELHALLKSEAKILEQQNKSITTTPLVPTAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGRGGFNSS
        EY++F +++RTR+ ++  +ELH LL SE    E+  K+               S   +S+  RGRG  S     + RG GR N   YGNS   RGGF   
Subjt:  EYNSFRTSIRTREGTLKLQELHALLKSEAKILEQQNKSITTTPLVPTAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGRGGFNSS

Query:  NPSPGFPQGHSAPNQGRGANSGSNSSSGRGVICQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQSSNNSGNNNCTWLADSGCNSHVTPDLS
             FPQ     +Q    NS ++ +S     CQIC + GH ALDC++R+N ++QGRHPP+KLAA+AS     + +   +    W++D+G   H TPD++
Subjt:  NPSPGFPQGHSAPNQGRGANSGSNSSSGRGVICQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQSSNNSGNNNCTWLADSGCNSHVTPDLS

Query:  TLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISS
         +     Y G D++TV NGQ +PIT TG+  L  S    NL K+L VP +S++LLSV + C DN+  F FDA  F I+D  SG++LY G S+ GLYP+  
Subjt:  TLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISS

Query:  ISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPS----FPVLQKLLSANSITCDTKYTCRDCVSCLKGKASKLPFASSTSITTRP
            G+ L  +S+              +S   S   S  LWH RLGHP       VL K L  NS++    +    C  C++GK  +LPF  S SITTRP
Subjt:  ISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPS----FPVLQKLLSANSITCDTKYTCRDCVSCLKGKASKLPFASSTSITTRP

Query:  LALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPE
        L L+H+DVWGP+P+ S +G RYYV+F+DDF++FTW FPL  KS V    K F   +EN L C +++ RSD GGE+       F SS G+LHQ SCPHT +
Subjt:  LALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPE

Query:  QNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYT
        QNG AE+KHR IVD A+ L++ +S+PL  W +AF+T VFL+NRLPS +    +P++ L+G  PD    RVFGC CYPLL+ Y+ HKLQP++   VFLGY 
Subjt:  QNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYT

Query:  LEYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHSHSHTLPSQPQP----------NPAVVTSLLSSQSSLVVRVLPNNSSVGDVLLSPINTPS
           KG+LCY+    +  VSRHV F E  FP+   P   S+S S +  +              +P  V S+L    S     L ++S V  V L      S
Subjt:  LEYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHSHSHTLPSQPQP----------NPAVVTSLLSSQSSLVVRVLPNNSSVGDVLLSPINTPS

Query:  TSDVQPSPVLPTTDVLPSSASHSSVVVASANSSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRKAFSAVIPKSIP--------EPTSFTAASK
         S +   P  P  D  P            A  +P   A + A      P   + N HPM TR+K+GI+KRK        S+P        EPT++T A K
Subjt:  TSDVQPSPVLPTTDVLPSSASHSSVVVASANSSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRKAFSAVIPKSIP--------EPTSFTAASK

Query:  IPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLD
        IP+W   M  E+ AL  Q TWTLVPS    +++GC+WVF+ K N DGSVAR+KARLVAKG +Q+ G+DFDETFSPVVK  TVR++L+LAA Y WSL QLD
Subjt:  IPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLD

Query:  VKNAFLHGYLKEDVYMVQPPGFRDSNYPNHT---------------------------------------------------------------------
        V NAFLHG LKE V+M QPPGF D N+P+H                                                                      
Subjt:  VKNAFLHGYLKEDVYMVQPPGFRDSNYPNHT---------------------------------------------------------------------

Query:  ----TLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTP-MSTTCELHGVSPFFSDALLYRQIVGSLQYLTFTRPDITYV
             L + F+L DLG L+YF+GL++   + G  V Q KY TDLL+K  M+  K CSTP +  +  +   S   SD   YR +VG+LQYLTFTRPD+++ 
Subjt:  ----TLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTP-MSTTCELHGVSPFFSDALLYRQIVGSLQYLTFTRPDITYV

Query:  VSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAA
        ++ + Q MH+PT  H  A KR+LRY+ GT   GILF  G L LTA++D+DWAG+ +DRRST+G+++F     ++W++KKQ TVSRSSTEAEYR+LA  AA
Subjt:  VSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAA

Query:  ELYWIRQLLCDLCVFLPTVPLLFCD
        EL WIR LL DL +F+   P+L+CD
Subjt:  ELYWIRQLLCDLCVFLPTVPLLFCD

A0A2N9IE26 Uncharacterized protein1.4e-28942.39Show/hide
Query:  SDNSSGCSTQPNSSIFLLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIAQDSALITLINATLSK
        ++ SS  S    S I LLSNI NL+  +LDS+N+  WK+Q+ S+ +++SL   +DGS   P  +L D  G  TT+ +  + QW  +D AL TL+NATLS 
Subjt:  SDNSSGCSTQPNSSIFLLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIAQDSALITLINATLSK

Query:  TAYSYVIGCKSSKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGT
        +A S V+   +++ VW  LE++++SL+R+H+  LK  L  I K   ES+  +L RVKEL DKL+ V V +DDE+LL   L GL SEY++F +++RTR+ +
Subjt:  TAYSYVIGCKSSKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGT

Query:  LKLQELHALLKSEAKILEQQNKSITTTPLVPTAMFSSVSGQ--------FNS--NRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGRGGFNSSNPSPG
        +  +ELH LL SE +   ++N    ++ +   AM ++ S          F+S  NRGRG GR+ N   + GRG     +G YGNS   RGGF        
Subjt:  LKLQELHALLKSEAKILEQQNKSITTTPLVPTAMFSSVSGQ--------FNS--NRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGRGGFNSSNPSPG

Query:  FPQGHSAPNQGRGANSGSNSSSGRGVICQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQSSNNSGNNNCTWLADSGCNSHVTPDLSTLALN
        FPQ +  PN    + + S S + R   CQIC + GH ALDC++R+N ++QGRHPP+KLAA+AS     + +   +    W++D+G   H TPD++ +   
Subjt:  FPQGHSAPNQGRGANSGSNSSSGRGVICQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQSSNNSGNNNCTWLADSGCNSHVTPDLSTLALN

Query:  SNYNGEDAITVANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAG
          Y G D +TV NGQ +PIT T +  L  S    NL K+L VP +S++LLSV + C DN+  F FDA  F I+   SG++LY G S+ GLYP+      G
Subjt:  SNYNGEDAITVANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAG

Query:  SSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPS----FPVLQKLLSANSITCDTKYTCRDCVSCLKGKASKLPFASSTSITTRPLALLH
        + L ++S+              +S   S   S  LWH RLGHP       VL K    NS++    +    C  C++GK  +LPF  S SITTRPL L+H
Subjt:  SSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPS----FPVLQKLLSANSITCDTKYTCRDCVSCLKGKASKLPFASSTSITTRPLALLH

Query:  SDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAA
        +DVWGP+P+ S +G RYYV+F+DDF++FTW FPL  KS V    K F   +EN L C +++ RSD GGE+       F SS G+LHQ SCPHT +QNG A
Subjt:  SDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAA

Query:  EQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKG
        E+KHR IVD A+ L++ +S+PL  W +AF+T VFL+NRLPS +  F +P++ L+G  PD    RVFGC CYPLL+ Y+ HKLQP++   VFLGY    KG
Subjt:  EQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKG

Query:  YLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHSHSHTLPSQPQP----------NPAVVTSLLSSQSSLVVRVLPNNSSVGDVLLSPINTPSTSDVQ
        +LCY+    +  VSRHV F E  FP+   P   S+S S +  +              +P  V S+L    S     L ++S V  V L      S+S + 
Subjt:  YLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHSHSHTLPSQPQP----------NPAVVTSLLSSQSSLVVRVLPNNSSVGDVLLSPINTPSTSDVQ

Query:  PSPVLPTTDVLPSSASHSSVVVASANSSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRKAFSAVIPKSIP--------EPTSFTAASKIPEWK
          P  P +D  P            A  +P   A +SA      P   + N HPM TR+K+GI+KRK        S+P        EPT++T A KIP+W 
Subjt:  PSPVLPTTDVLPSSASHSSVVVASANSSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRKAFSAVIPKSIP--------EPTSFTAASKIPEWK

Query:  RAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAF
         AM  E+ AL  Q TWTLVPS    +++GC+WVF+ K N DGSVAR+KARLVAKG +Q+ G+DFDETFSPVVK  TVR++L+LAA YGWSL QLDV NAF
Subjt:  RAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAF

Query:  LHGYLKEDVYMVQPPGFRDSNYPNHT-------------------------------------------------------------------------T
        LHG LKE V+M QPPGF D N+P+H                                                                           
Subjt:  LHGYLKEDVYMVQPPGFRDSNYPNHT-------------------------------------------------------------------------T

Query:  LGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTP-MSTTCELHGVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVS
        L + F+L DLG L+YF+GL++     G  V Q KY TDLL+K  M+  K CSTP +  +  +   S   SD   YR +VG+LQYLTFTRPD+++ ++ + 
Subjt:  LGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTP-MSTTCELHGVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVS

Query:  QFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAAELYWI
        Q MH+PT  H  A KR+LRY+ GT   GILF  G L LTA++D+DWAG+ +DRRST+G+++F     ++W++KKQ TVSRSSTEAEYR+LA  AAEL WI
Subjt:  QFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAAELYWI

Query:  RQLLCDLCVFLPTVPLLFCD
        R LL DL +F+   P+L+CD
Subjt:  RQLLCDLCVFLPTVPLLFCD

A0A2N9IEP2 Uncharacterized protein2.4e-28440.96Show/hide
Query:  IFLLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVIGCKSSKE
        + LLSNI NLV V+LD SN++ WK+Q+ S+LKA+S+   VDG+  CPPE+L +++G  + + NS + QWI++D  L+TLIN+TLS TA S V+G  ++  
Subjt:  IFLLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVIGCKSSKE

Query:  VWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQELHALLKSEA
        VW  LEK+++S +RS++  LK  LH+I K +T+SI+ +L ++K+  D+L  V V ID+E++L   L GL  EY++F    R                   
Subjt:  VWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQELHALLKSEA

Query:  KILEQQNKSITTTPLVPTAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGR-GGFNSSNPSPGFPQGHSAPNQGRGANSGSNSSSG
                                 G+ + NRGRGR  ++N    SGR GG +N G   + N G  GGFN+                    NS    S  
Subjt:  KILEQQNKSITTTPLVPTAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGR-GGFNSSNPSPGFPQGHSAPNQGRGANSGSNSSSG

Query:  RGVICQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQSSNNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITVANGQGVPITQTG
        +   CQIC ++GH ALDCY+R++ S+QG+ PPSKLAAMA+     +SN+  ++   W++D+G   H TPDLST+  +  Y G D  TV NGQ +PIT  G
Subjt:  RGVICQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQSSNNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITVANGQGVPITQTG

Query:  SGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICA
        +  L  S    +L K+L VP +++NLLSV++ C DNNC F+FDA+ F I+D  +G++LY+G SK+GLYPI  +S                    L P C 
Subjt:  SGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICA

Query:  SVFVSHVPST-----VLWHLRLGHPSFPVLQKLLSANSI-TCDTKYTCRDCVSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVSGFRYYVNF
        +   S + ST      +WH RLGHP+  V Q++ S + +    +  T   C  C++GK + LPF  S S   +PL ++HSDVWGPSP+ S  G R+YV F
Subjt:  SVFVSHVPST-----VLWHLRLGHPSFPVLQKLLSANSI-TCDTKYTCRDCVSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVSGFRYYVNF

Query:  VDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVP
        VD+F++FTW +P+  KS V S    F   ++N L+  +++ R+D GGE+ ++  + F  S G+ HQ +CPHT +QNG AE+KHR IVD A+ L++ +S+P
Subjt:  VDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVP

Query:  LEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHE
        L FW +AF+T V+L+NR+P S     +P++ L+   P+ + LR FGC CYPL++PYN+HKLQP++ + VFLGY    KGYLCYN+ T+K   SRHV+F +
Subjt:  LEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHE

Query:  DVFPFAIRPVTRSHSHSHTLPSQPQPNPAVVTSLLSSQSSLVVRVLPNNSSVGDVLLSPINTPSTSDVQPS----PVL-PTTDVLPSSASHSSVVVASAN
         VFPF          H  +    P P P  + + LS  +  +  +L +    G V+ SP   PS     PS    P+L P +D+ P+ +S          
Subjt:  DVFPFAIRPVTRSHSHSHTLPSQPQPNPAVVTSLLSSQSSLVVRVLPNNSSVGDVLLSPINTPSTSDVQPS----PVL-PTTDVLPSSASHSSVVVASAN

Query:  SSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRKAFSAVIPKSI--PEPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKW
        + P    A S  N    P       HPM TR+K+GI K K F   +       EP ++  AS++P+W+ AM  E+ AL  QNTWTLVPS  + N+VGC+W
Subjt:  SSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRKAFSAVIPKSI--PEPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKW

Query:  VFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPNH-------
        V++ K N DGS+ARYKARLVAKGY+Q++G+D+DETFSPVVK  TVR+IL++AA   WSL QLDV NAFLHG LKE+VYM QPPGF D  YP H       
Subjt:  VFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPNH-------

Query:  ------------------------------------------------------------------TTLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQ
                                                                          T L +EF + DLG L++F+GL+I   S GF V Q
Subjt:  ------------------------------------------------------------------TTLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQ

Query:  LKYLTDLLEKTGMADSKTCSTPMSTTCELHGVSPF-FSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILFS
         KY TDLL K  M+  K CSTP  +   +         D   +R +VG LQYLTFTRPD++Y V+ + QFMH+PT+ H  A KRILRY+ GT   G+ F 
Subjt:  LKYLTDLLEKTGMADSKTCSTPMSTTCELHGVSPF-FSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILFS

Query:  KGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAAELYWIRQLLCDLCVFLPTVPLLFCDIFYGRKLFEAIL
         G L LTA++D+DWAGD MDRRST+G +VF    P++W +KKQ TV+RSSTEAEYR+LA  AA+L W+R +L DL +FL   P ++CD      L    +
Subjt:  KGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAAELYWIRQLLCDLCVFLPTVPLLFCDIFYGRKLFEAIL

Query:  DHPDTQGADED
         H  T+  + D
Subjt:  DHPDTQGADED

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.9e-9324.44Show/hide
Query:  DSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQFSSLTRS
        D   +  WKF+++++L    +  +VDG +P                 N     W   +    + I   LS +  ++     +++++  +L+  +   + +
Subjt:  DSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQFSSLTRS

Query:  HVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRT-REGTLKLQEL-HALLKSEAKILEQQNKSITTT
            L+  L ++  S+  S+  +     EL+ +L      I++ D + + L  L S Y+   T+I T  E  L L  + + LL  E KI    N    T+
Subjt:  HVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRT-REGTLKLQEL-HALLKSEAKILEQQNKSITTT

Query:  PLVPTAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGRGGFNSSNPSPGFPQGHSAPNQGRGANSGSNSSSGRGVICQICNRSGHG
          V  A+  + +  + +N  + R               +  + + GNS                                        V C  C R GH 
Subjt:  PLVPTAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGRGGFNSSNPSPGFPQGHSAPNQGRGANSGSNSSSGRGVICQICNRSGHG

Query:  ALDCYNRLNLSFQGRHPPSKLAAMASIYD-----PQSSNNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITVA-NGQGVPITQTGSGTLSTSH
          DC++   +         K    A+ +       + +N S  +NC ++ DSG + H+  D S    +        I VA  G+ +  T+ G   L   H
Subjt:  ALDCYNRLNLSFQGRHPPSKLAAMASIYD-----PQSSNNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITVA-NGQGVPITQTGSGTLSTSH

Query:  SDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICA-SVFVSHV
         ++ L  +L   + + NL+SV +                SI+   SG  +    SK+GL  + +              +G+  + P+    A S+   H 
Subjt:  SDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICA-SVFVSHV

Query:  PSTVLWHLRLGHPSFPVL-----QKLLSANSITCDTKYTCRDCVSCLKGKASKLPFASSTSIT--TRPLALLHSDVWGPSPLISVSGFRYYVNFVDDFSK
         +  LWH R GH S   L     + + S  S+  + + +C  C  CL GK ++LPF      T   RPL ++HSDV GP   +++    Y+V FVD F+ 
Subjt:  PSTVLWHLRLGHPSFPVL-----QKLLSANSITCDTKYTCRDCVSCLKGKASKLPFASSTSIT--TRPLALLHSDVWGPSPLISVSGFRYYVNFVDDFSK

Query:  FTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYH
        +   + +  KSDV S+ + FV   E   +  +     D G E++++ + +F   KG+ +  + PHTP+ NG +E+  R+I + A  +++ A +   FW  
Subjt:  FTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYH

Query:  AFATVVFLLNRLPSSAI--GFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHED--V
        A  T  +L+NR+PS A+     TP++  +   P L HLRVFG   Y  +K     K   K+ + +F+GY  E  G+  ++   +K +V+R VV  E   V
Subjt:  AFATVVFLLNRLPSSAI--GFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHED--V

Query:  FPFAIRPVT-----RSHSHSHTLPSQPQ-------PNPAV----VTSLLSSQSS-----------LVVRVLPNNSSVGDVL--LSPINTPSTSDVQPSPV
           A++  T        S +   P+  +       PN +     +  L  S+ S           ++    PN S   D +  L      +   +  S  
Subjt:  FPFAIRPVT-----RSHSHSHTLPSQPQ-------PNPAV----VTSLLSSQSS-----------LVVRVLPNNSSVGDVL--LSPINTPSTSDVQPSPV

Query:  LPTTDVLPSSASHSSVVVASANSSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRKAFSAV-------------IPKSIPEPTSFTAASKIPEW
            D L  S    +   +  + +      I  +N +       IN      ++K  IS  +  +++             +P S  E            W
Subjt:  LPTTDVLPSSASHSSVVVASANSSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRKAFSAV-------------IPKSIPEPTSFTAASKIPEW

Query:  KRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNA
        + A+  E  A    NTWT+    E+ NIV  +WVF  K+N  G+  RYKARLVA+G+ Q+  +D++ETF+PV + ++ R IL+L   Y   +HQ+DVK A
Subjt:  KRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNA

Query:  FLHGYLKEDVYMVQPPG--------------------------------FRDSNYPNHTT----------------------------------------
        FL+G LKE++YM  P G                                 ++  + N +                                         
Subjt:  FLHGYLKEDVYMVQPPG--------------------------------FRDSNYPNHTT----------------------------------------

Query:  -LGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCELHGVSPFFSDALLYRQIVGSLQYLTF-TRPDITYVVSKV
         L  +F++TDL  +++FIG+ I    D  +++Q  Y+  +L K  M +    STP+ +      ++         R ++G L Y+   TRPD+T  V+ +
Subjt:  -LGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCELHGVSPFFSDALLYRQIVGSLQYLTF-TRPDITYVVSKV

Query:  SQFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKG---KLELTAYSDADWAGDSMDRRSTSGYVV-FFCGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAA
        S++  K     +  +KR+LRYL GT D  ++F K    + ++  Y D+DWAG  +DR+ST+GY+   F    + W+ K+Q++V+ SSTEAEY +L +   
Subjt:  SQFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKG---KLELTAYSDADWAGDSMDRRSTSGYVV-FFCGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAA

Query:  ELYWIRQLLCDLCVFL
        E  W++ LL  + + L
Subjt:  ELYWIRQLLCDLCVFL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.1e-10827.13Show/hide
Query:  LLDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLA
        +LD D +K   + +    W   D    + I   LS    + +I   +++ +W  LE  + S T ++   LK  L+ +  S   +   +L     L+ +LA
Subjt:  LLDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLA

Query:  TVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQEL-HALLKSE--AKILEQQNKSITTTPLVPTAMFSSVSGQFNSNRGRGRGRNSNPSYWSG
         + V I++ED  +  LN L S Y++  T+I   + T++L+++  ALL +E   K  E Q +++ T                   RGR   R+SN    SG
Subjt:  TVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQEL-HALLKSE--AKILEQQNKSITTTPLVPTAMFSSVSGQFNSNRGRGRGRNSNPSYWSG

Query:  RGGGRSNQGYYGNSNQGRGGFNSSNPSPGFPQGHSAPNQGRGANSGSNSSSGRGVICQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQSSN
          G   N+    + ++ R  +N + P   F +    P +G+G  SG  +      + Q             N  N+             +  I + +   
Subjt:  RGGGRSNQGYYGNSNQGRGGFNSSNPSPGFPQGHSAPNQGRGANSGSNSSSGRGVICQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQSSN

Query:  NSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHS---DLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDAD
        +       W+ D+  + H TP      L   Y   D  TV  G        G G +    +    L L  +  VPDL  NL+S      D    +  +  
Subjt:  NSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHS---DLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDAD

Query:  WFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKLLSANSITCDTKYTCRDC
        W   +      ++ KG ++  LY            R+N+ +     +A    I          S  LWH R+GH S   LQ L   + I+     T + C
Subjt:  WFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKLLSANSITCDTKYTCRDC

Query:  VSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVN
          CL GK  ++ F +S+      L L++SDV GP  + S+ G +Y+V F+DD S+  W++ L  K  V  V ++F   +E +    L+  RSD GGE+ +
Subjt:  VSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVN

Query:  HYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYP
            E+ SS G+ H+++ P TP+ NG AE+ +R+IV+   +++  A +P  FW  A  T  +L+NR PS  + F  P +         SHL+VFGC  + 
Subjt:  HYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYP

Query:  LLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHSHSHTLPSQPQPNPAVVTSLLSSQSSLVVRVLPNNSS
         +      KL  K+   +F+GY  E  GY  ++   KK++ SR VVF E                            + V +       +   ++PN   
Subjt:  LLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHSHSHTLPSQPQPNPAVVTSLLSSQSSLVVRVLPNNSS

Query:  VGDVLLSPINTPSTSDVQPSPVLPTTDVLPSSASHSSVVVASANSSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRKAFSA---VIPKSIPEP
                +  PSTS+  P+    TTD +         V+            +    Q +         H  + RS+    + + + +   V+     EP
Subjt:  VGDVLLSPINTPSTSDVQPSPVLPTTDVLPSSASHSSVVVASANSSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRKAFSA---VIPKSIPEP

Query:  TSFTAASKIPE---WKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALA
         S       PE     +AM +E  +L    T+ LV   +    + CKWVF+ K + D  + RYKARLV KG+ Q++G+DFDE FSPVVK T++R IL+LA
Subjt:  TSFTAASKIPE---WKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALA

Query:  AHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFR-----------------------------DSNYPNHT-----------------------------
        A     + QLDVK AFLHG L+E++YM QP GF                              DS   + T                             
Subjt:  AHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFR-----------------------------DSNYPNHT-----------------------------

Query:  ----------------TLGREFQLTDLGSLRYFIGLEIT--RFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMS-----------TTCELHGVSPFFSD
                         L + F + DLG  +  +G++I   R S    ++Q KY+  +LE+  M ++K  STP++           TT E  G       
Subjt:  ----------------TLGREFQLTDLGSLRYFIGLEIT--RFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMS-----------TTCELHGVSPFFSD

Query:  ALLYRQIVGSLQY-LTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGIPVSW
         + Y   VGSL Y +  TRPDI + V  VS+F+  P + H+ AVK ILRYL GT    + F      L  Y+DAD AGD  +R+S++GY+  F G  +SW
Subjt:  ALLYRQIVGSLQY-LTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGIPVSW

Query:  SAKKQSTVSRSSTEAEYRSLAQTAAELYWIRQLLCDLCVFLPTVPLLFCDIFYGRKLFEAILDHPDTQGAD
         +K Q  V+ S+TEAEY +  +T  E+ W+++ L +L +      +++CD      L +  + H  T+  D
Subjt:  SAKKQSTVSRSSTEAEYRSLAQTAAELYWIRQLLCDLCVFLPTVPLLFCDIFYGRKLFEAILDHPDTQGAD

P92519 Uncharacterized mitochondrial protein AtMg008101.5e-4147Show/hide
Query:  LGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCELHGVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQ
        L   F + DLG + YF+G++I     G  ++Q KY   +L   GM D K  STP+         +  + D   +R IVG+LQYLT TRPDI+Y V+ V Q
Subjt:  LGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCELHGVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQ

Query:  FMHKPTEIHYSAVKRILRYLNGTRDCGILFSK-GKLELTAYSDADWAGDSMDRRSTSGYVVFF-CGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAAELYW
         MH+PT   +  +KR+LRY+ GT   G+   K  KL + A+ D+DWAG +  RRST+G+  F  C I +SWSAK+Q TVSRSSTE EYR+LA TAAEL W
Subjt:  FMHKPTEIHYSAVKRILRYLNGTRDCGILFSK-GKLELTAYSDADWAGDSMDRRSTSGYVVFF-CGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAAELYW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.7e-22135.99Show/hide
Query:  NSSIFLLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFL-LDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVIGCK
        N++  L  N+ N+   +L S+N+L W  QV ++   + L G +DGS   PP  +  DA  R    VN  +T+W  QD  + + +   +S +    V    
Subjt:  NSSIFLLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFL-LDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVIGCK

Query:  SSKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQELH-AL
        ++ ++W +L K +++ +  HV +L++ L   +K  T++ID+Y+  +    D+LA +  P+D ++ +   L  L  EY      I  ++    L E+H  L
Subjt:  SSKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQELH-AL

Query:  LKSEAKILEQQNKSITTTPLVPTAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGRGGFNSSNPSPGFPQGHSAPNQGRGANSGSN
        L  E+KIL     S T  P+   A+         S+R      N+N        G R+N+  Y N N      N++N         S P Q    N   N
Subjt:  LKSEAKILEQQNKSITTTPLVPTAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGRGGFNSSNPSPGFPQGHSAPNQGRGANSGSN

Query:  SSSGRGVI--CQICNRSGHGALDCYNRLNL--SFQGRHPPSKLA-----AMASIYDPQSSNNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAIT
        ++  +  +  CQIC   GH A  C    +   S   + PPS        A  ++  P SSNN       WL DSG   H+T D + L+L+  Y G D + 
Subjt:  SSSGRGVI--CQICNRSGHGALDCYNRLNL--SFQGRHPPSKLA-----AMASIYDPQSSNNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAIT

Query:  VANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVA
        VA+G  +PI+ TGS +LST    LNL  IL VP++  NL+SV + C  N     F    F ++D  +G  L +GK+KD LY     SS   SL ++ +  
Subjt:  VANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVA

Query:  GLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKLLSANSIT-CDTKYTCRDCVSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVS
          H+S                    WH RLGHP+  +L  ++S  S++  +  +    C  CL  K++K+PF+ ST  +TRPL  ++SDVW  SP++S  
Subjt:  GLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKLLSANSIT-CDTKYTCRDCVSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVS

Query:  GFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIA
         +RYYV FVD F+++TW++PL +KS V      F   +EN+    +  F SD GGEFV   ++E+FS  G+ H  S PHTPE NG +E+KHR IV+T + 
Subjt:  GFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIA

Query:  LMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLV
        L++HAS+P  +W +AFA  V+L+NRLP+  +   +PFQKL+G  P+   LRVFGCACYP L+PYN HKL  K+ Q VFLGY+L    YLC +++T ++ +
Subjt:  LMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLV

Query:  SRHVVFHEDVFPFA-----IRPVTRSHSHS------HTL-------------------------PSQPQPNPAVVTSLLSSQSSLVVRVLPNNSSVGDVL
        SRHV F E+ FPF+     + PV      S      HT                          PS P  N  V +S L S  S      P  ++     
Subjt:  SRHVVFHEDVFPFA-----IRPVTRSHSHS------HTL-------------------------PSQPQPNPAVVTSLLSSQSSLVVRVLPNNSSVGDVL

Query:  LSPINTPSTSDVQPSPVLPTTDVLPSSASHS------SVVVASANSSPSQSAAISANNQSDPPAAVTI------------------NSHPMITRSKAGIS
          P   P+ +  Q      T+   P++ S S      S    S++SSPS + + S+++ S  P ++ I                  N+H M TR+KAGI 
Subjt:  LSPINTPSTSDVQPSPVLPTTDVLPSSASHS------SVVVASANSSPSQSAAISANNQSDPPAAVTI------------------NSHPMITRSKAGIS

Query:  K--RKAFSAVIPKSIPEPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLV-PSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETF
        K   K   AV   +  EP +   A K   W+ AM  E  A    +TW LV P    + IVGC+W+F  K+N DGS+ RYKARLVAKGYNQR G+D+ ETF
Subjt:  K--RKAFSAVIPKSIPEPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLV-PSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETF

Query:  SPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPNHT------------------------------------------
        SPV+K T++R++L +A    W + QLDV NAFL G L +DVYM QPPGF D + PN+                                           
Subjt:  SPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPNHT------------------------------------------

Query:  -------------------------------TLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCELHGVS-PF
                                        L + F + D   L YF+G+E  R   G H++Q +Y+ DLL +T M  +K  +TPM+ + +L   S   
Subjt:  -------------------------------TLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCELHGVS-PF

Query:  FSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGK-LELTAYSDADWAGDSMDRRSTSGYVVFFCGIP
         +D   YR IVGSLQYL FTRPDI+Y V+++SQFMH PTE H  A+KRILRYL GT + GI   KG  L L AYSDADWAGD  D  ST+GY+V+    P
Subjt:  FSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGK-LELTAYSDADWAGDSMDRRSTSGYVVFFCGIP

Query:  VSWSAKKQSTVSRSSTEAEYRSLAQTAAELYWIRQLLCDLCVFLPTVPLLFCD
        +SWS+KKQ  V RSSTEAEYRS+A T++E+ WI  LL +L + L   P+++CD
Subjt:  VSWSAKKQSTVSRSSTEAEYRSLAQTAAELYWIRQLLCDLCVFLPTVPLLFCD

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.0e-20734.5Show/hide
Query:  STQPNSSIFLLSNICNL---VPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFL-LDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAY
        +T     + + +NI N+      +L S+N+L W  QV ++   + L G +DGS P PP  +  DA  R    VN  +T+W  QD  + + I   +S +  
Subjt:  STQPNSSIFLLSNICNL---VPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFL-LDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAY

Query:  SYVIGCKSSKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKL
          V    ++ ++W +L K +++ +  HV +L+                ++ R     D+LA +  P+D ++ +   L  L  +Y      I  ++    L
Subjt:  SYVIGCKSSKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKL

Query:  QELH-ALLKSEAKILEQQNKSITTTPLVPTAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGRGGFNSSNPSPGFPQGHSAPNQGR
         E+H  L+  E+K+L     ++ +  +VP      ++    ++R     RN N      RG    N+ Y  N+N+     NS  PS          +  R
Subjt:  QELH-ALLKSEAKILEQQNKSITTTPLVPTAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGRGGFNSSNPSPGFPQGHSAPNQGR

Query:  GANSGSNSSSGRGVICQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQS--SNNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAIT
          N       GR   CQIC+  GH A  C  +L+  FQ      +  +  + + P++  + NS  N   WL DSG   H+T D + L+ +  Y G D + 
Subjt:  GANSGSNSSSGRGVICQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQS--SNNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAIT

Query:  VANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVA
        +A+G  +PIT TGS +L TS   L+L+K+L VP++  NL+SV + C  N     F    F ++D  +G  L +GK+KD LY     SS   S+ ++    
Subjt:  VANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVA

Query:  GLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKLLSANSI-TCDTKYTCRDCVSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVS
          H+S                    WH RLGHPS  +L  ++S +S+   +  +    C  C   K+ K+PF++ST  +++PL  ++SDVW  SP++S+ 
Subjt:  GLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKLLSANSI-TCDTKYTCRDCVSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVS

Query:  GFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIA
         +RYYV FVD F+++TW++PL +KS V      F   +EN+    +    SD GGEFV   + ++ S  G+ H  S PHTPE NG +E+KHR IV+  + 
Subjt:  GFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIA

Query:  LMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLV
        L++HASVP  +W +AF+  V+L+NRLP+  +   +PFQKL+G  P+   L+VFGCACYP L+PYN HKL+ K+ Q  F+GY+L    YLC ++ T ++  
Subjt:  LMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLV

Query:  SRHVVFHEDVFPFAIRPVTRSHSH-----------SHT--------LPSQPQPNPAVVTSLL--SSQSSLVVRVLPNNSSVGDVLLSPINT----PSTSD
        SRHV F E  FPF+      S S            SHT        LP+ P   P + TS    SS S L    + +++     + SP ++    PS + 
Subjt:  SRHVVFHEDVFPFAIRPVTRSHSH-----------SHT--------LPSQPQPNPAVVTSLL--SSQSSLVVRVLPNNSSVGDVLLSPINT----PSTSD

Query:  VQP-------------SPVLPT---TDVLPSSASHSSVVVASANSSP---------SQSAAISANNQSDPP--------------AAVTINSHPMITRSK
         QP             SP+L         P+S + +S +  S  SSP         S+  + S+++ S PP              A   +N+H M TR+K
Subjt:  VQP-------------SPVLPT---TDVLPSSASHSSVVVASANSSP---------SQSAAISANNQSDPP--------------AAVTINSHPMITRSK

Query:  AGISK--RKAFSAVIPKSIPEPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLV-PSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDF
         GI K  +K   A    +  EP +   A K   W++AM  E  A    +TW LV P    + IVGC+W+F  KFN DGS+ RYKARLVAKGYNQR G+D+
Subjt:  AGISK--RKAFSAVIPKSIPEPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLV-PSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDF

Query:  DETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPN----------------------------------------
         ETFSPV+K T++R++L +A    W + QLDV NAFL G L ++VYM QPPGF D + P+                                        
Subjt:  DETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPN----------------------------------------

Query:  -------------------------------HT--TLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTT--CELH
                                       HT   L + F + +   L YF+G+E  R   G H++Q +Y  DLL +T M  +K  +TPM+T+    LH
Subjt:  -------------------------------HT--TLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTT--CELH

Query:  GVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGK-LELTAYSDADWAGDSMDRRSTSGYVVF
          +    D   YR IVGSLQYL FTRPD++Y V+++SQ+MH PT+ H++A+KR+LRYL GT D GI   KG  L L AYSDADWAGD+ D  ST+GY+V+
Subjt:  GVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGK-LELTAYSDADWAGDSMDRRSTSGYVVF

Query:  FCGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAAELYWIRQLLCDLCVFLPTVPLLFCD
            P+SWS+KKQ  V RSSTEAEYRS+A T++EL WI  LL +L + L   P+++CD
Subjt:  FCGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAAELYWIRQLLCDLCVFLPTVPLLFCD

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.5e-0925.16Show/hide
Query:  IFLLSNICNLVPVRLD--SSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIAQDSALITLINATLS-KTAYSYVIGCKS
        I+ +SNI + +PV LD   SN+  W+    +   +  + G +DG++                  N+    W  +D  +   +  TL+ K      +   +
Subjt:  IFLLSNICNLVPVRLD--SSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIAQDSALITLINATLS-KTAYSYVIGCKS

Query:  SKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQELHALLK
        S+++WL ++ QF +   +    L S L T        + +Y  ++K+L D L  V VP+ D +L++Y LNGL+ ++++    I+ R+      +   +L+
Subjt:  SKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQELHALLK

Query:  SEAKILEQQNK--------SITTTPLV-----PTAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGRGGFNSSNPSPGFPQGHSAP
         E   L++  K        S ++T L      P   F    G     RGRGRG N     + GRGG  S   YY         FNS N  P +   +   
Subjt:  SEAKILEQQNK--------SITTTPLV-----PTAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGRGGFNSSNPSPGFPQGHSAP

Query:  NQGRGANSGSNSSSGRGV
        N   G     N++ G G+
Subjt:  NQGRGANSGSNSSSGRGV

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.4e-7335.11Show/hide
Query:  SDVQPSPVLPTTDVLPSSASHSSVVVASANSS--PSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRKAFSAVIPKSIPEPTSFTAASKIPEWKRA
        SD   S    + D++PS+   + V   S ++S   ++  A   +      A++TI+        +       +F   I K+  EP+++  A +   W  A
Subjt:  SDVQPSPVLPTTDVLPSSASHSSVVVASANSS--PSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRKAFSAVIPKSIPEPTSFTAASKIPEWKRA

Query:  MLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLH
        M DE  A+   +TW +     +   +GCKWV++ K+N DG++ RYKARLVAKGY Q+EG+DF ETFSPV K T+V++ILA++A Y ++LHQLD+ NAFL+
Subjt:  MLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLH

Query:  GYLKEDVYMVQPPGFR----DSNYPN--------------------------------------HT----------------------------------
        G L E++YM  PPG+     DS  PN                                      HT                                  
Subjt:  GYLKEDVYMVQPPGFR----DSNYPN--------------------------------------HT----------------------------------

Query:  -TLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPM--STTCELHGVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVS
          L   F+L DLG L+YF+GLEI R + G ++ Q KY  DLL++TG+   K  S PM  S T   H    F  DA  YR+++G L YL  TR DI++ V+
Subjt:  -TLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPM--STTCELHGVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVS

Query:  KVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILF-SKGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAAE
        K+SQF   P   H  AV +IL Y+ GT   G+ + S+ +++L  +SDA +      RRST+GY +F     +SW +KKQ  VS+SS EAEYR+L+    E
Subjt:  KVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILF-SKGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAAE

Query:  LYWIRQLLCDLCVFLPTVPLLFCD
        + W+ Q   +L + L    LLFCD
Subjt:  LYWIRQLLCDLCVFLPTVPLLFCD

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.8e-1026.02Show/hide
Query:  VPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVI--GCKSSKEVWLSLEKQ
        V + L+  N+  W+   +++  +  + G +DGS               +T       +W  +D  +   I  T++ +    +I  GC +++++WLSLE  
Subjt:  VPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVI--GCKSSKEVWLSLEKQ

Query:  FSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQELHALLKSEAKILEQQNK
        F     +   + ++ L T +     S+ EY  ++K L D L  V  PI D  L+++ LNGL+ +Y+     I+ +       E  ++L  E   L  ++K
Subjt:  FSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQELHALLKSEAKILEQQNK

Query:  SI---TTTPLVPTAMFSSVSGQ---------FNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGR
        S    T  P +   +F+    Q          NSN GRGR +  N      RGGG S+ G Y N+N  R
Subjt:  SI---TTTPLVPTAMFSSVSGQ---------FNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGR

ATMG00810.1 DNA/RNA polymerases superfamily protein1.1e-4247Show/hide
Query:  LGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCELHGVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQ
        L   F + DLG + YF+G++I     G  ++Q KY   +L   GM D K  STP+         +  + D   +R IVG+LQYLT TRPDI+Y V+ V Q
Subjt:  LGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCELHGVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQ

Query:  FMHKPTEIHYSAVKRILRYLNGTRDCGILFSK-GKLELTAYSDADWAGDSMDRRSTSGYVVFF-CGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAAELYW
         MH+PT   +  +KR+LRY+ GT   G+   K  KL + A+ D+DWAG +  RRST+G+  F  C I +SWSAK+Q TVSRSSTE EYR+LA TAAEL W
Subjt:  FMHKPTEIHYSAVKRILRYLNGTRDCGILFSK-GKLELTAYSDADWAGDSMDRRSTSGYVVFF-CGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAAELYW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.8e-2752Show/hide
Query:  MITRSKAGISK-RKAFSAVIPKSI-PEPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQR
        M+TRSKAGI+K    +S  I  +I  EP S   A K P W +AM +E  AL+   TW LVP   + NI+GCKWVF+TK + DG++ R KARLVAKG++Q 
Subjt:  MITRSKAGISK-RKAFSAVIPKSI-PEPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQR

Query:  EGVDFDETFSPVVKKTTVRVILALA
        EG+ F ET+SPVV+  T+R IL +A
Subjt:  EGVDFDETFSPVVKKTTVRVILALA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTTCATCTAGCGTCTCTGATAACTCTTCTGGCTGTAGTACTCAGCCAAATTCGTCAATTTTTCTTCTATCAAATATATGCAATCTCGTCCCTGTTCGCCT
CGATTCCTCAAATTTCTTGTTCTGGAAGTTTCAAGTTCAGTCAATGCTCAAGGCTCATTCCCTGTTTGGAATCGTTGATGGATCTGTGCCGTGTCCGCCTGAGTTTTTGC
TTGATGCTGACGGAAGGAAGACTACTGAAGTTAACTCTGCACATACTCAGTGGATTGCTCAAGATAGCGCTCTAATCACGTTGATTAACGCTACCCTATCGAAGACTGCT
TATTCATATGTCATTGGATGTAAATCCTCCAAGGAGGTATGGCTTTCTCTTGAAAAACAATTTTCTTCTTTAACGAGGTCTCATGTTCATGAACTCAAGTCATCGCTTCA
TACAATCTCTAAATCTGCCACTGAATCTATTGATGAATACCTGCTTCGAGTGAAAGAATTAGTAGATAAGCTTGCCACTGTCTCAGTACCAATTGATGATGAAGATCTGT
TGTTGTACACCTTAAATGGTTTGTCGTCGGAGTACAATTCATTTCGTACCTCTATTCGAACTCGAGAAGGAACTTTAAAATTACAAGAACTTCATGCCTTGTTGAAGTCA
GAGGCCAAGATTCTCGAGCAGCAAAACAAGTCCATCACTACCACTCCTCTTGTTCCGACTGCGATGTTTTCGTCTGTTTCAGGCCAGTTCAATTCCAATCGAGGACGCGG
TAGAGGCAGAAATTCAAATCCGAGTTACTGGTCAGGTCGCGGAGGTGGACGATCGAATCAGGGATACTATGGGAATTCAAATCAAGGTCGTGGTGGTTTCAATTCCTCTA
ATCCATCCCCTGGTTTTCCTCAGGGACATTCTGCTCCAAATCAGGGACGTGGTGCTAATTCTGGATCTAATTCGAGTTCTGGACGTGGAGTTATATGTCAAATTTGCAAT
CGGTCTGGCCATGGAGCCTTAGATTGTTACAATCGTCTGAATCTTTCGTTCCAGGGAAGACATCCTCCATCAAAGCTTGCTGCAATGGCCTCTATCTATGATCCTCAGTC
GTCTAATAATAGTGGCAACAACAATTGTACTTGGTTGGCTGATAGTGGTTGTAACTCTCATGTTACACCTGATCTGTCTACTCTTGCTCTGAATTCAAACTATAATGGTG
AGGATGCTATCACGGTTGCAAATGGGCAAGGTGTCCCTATTACTCAAACAGGTTCTGGCACACTCTCCACCTCCCACAGTGACCTTAATCTGTCGAAAATTCTTTGTGTT
CCTGATCTCTCAGCCAATTTGTTATCTGTTTCACAATGTTGCCTTGATAACAACTGCATCTTTGTTTTCGATGCTGATTGGTTTTCCATACAGGACAAAACCTCGGGACG
AATTTTATACAAGGGCAAGAGTAAGGATGGGTTATATCCCATATCTTCCATATCTTCGGCTGGTTCTTCTTTAAGAAGTAACTCTACAGTGGCTGGTTTGCATACGTCTG
CACCTTTATCTCCTATATGTGCATCTGTTTTTGTTTCTCATGTTCCTAGTACTGTGTTGTGGCATTTACGTCTTGGACATCCCTCTTTTCCTGTTTTGCAAAAGTTGTTG
TCTGCAAATTCTATTACTTGTGATACTAAGTATACTTGTCGAGACTGTGTGAGTTGTCTCAAAGGGAAAGCTTCTAAGCTTCCTTTTGCATCCTCTACTTCCATTACTAC
TAGGCCTCTTGCCCTGTTGCATAGTGATGTGTGGGGACCATCACCTTTAATCTCTGTTTCGGGCTTTCGATACTATGTCAATTTTGTTGATGACTTCAGCAAGTTTACTT
GGATATTTCCTTTAGTACGAAAATCTGATGTTTCTTCTGTTATAAAGCAGTTTGTTCCTTTTATTGAAAATCAGTTATCTTGCTCTTTACAAGTATTTCGATCAGATGGA
GGGGGTGAGTTCGTTAACCATTATGTGTATGAGTTCTTCTCATCCAAAGGTGTTCTACATCAGAGGAGTTGCCCTCATACACCTGAACAGAATGGGGCTGCTGAGCAAAA
GCATCGATCTATTGTTGACACTGCTATTGCTCTTATGAATCATGCATCCGTTCCTTTAGAATTTTGGTATCATGCTTTTGCTACTGTTGTGTTTTTGTTGAATCGTCTGC
CTTCTAGTGCTATTGGCTTTATGACTCCGTTTCAGAAGTTATATGGTTGTGTGCCTGATTTATCTCATTTGCGTGTGTTTGGTTGTGCCTGTTATCCACTGTTAAAGCCT
TACAACACTCATAAGTTACAACCAAAAACTGCTCAGCATGTTTTCTTGGGCTATACCTTGGAATACAAAGGGTATCTATGTTATAATATGGAAACCAAGAAGATGTTGGT
CTCTCGGCATGTTGTATTTCATGAAGATGTGTTCCCATTTGCGATTAGACCTGTCACACGGTCTCACAGTCACAGCCACACCCTCCCTTCACAGCCACAACCAAATCCTG
CTGTTGTAACTTCGTTGTTGTCTTCACAATCAAGTCTTGTTGTTCGTGTACTACCAAATAATTCTAGTGTTGGTGATGTTTTATTATCACCAATTAACACACCATCTACA
AGTGATGTTCAACCATCACCTGTCTTGCCTACTACTGATGTTTTACCATCTTCAGCTTCTCATTCTTCTGTTGTTGTGGCATCAGCAAACAGTAGTCCTAGTCAGTCTGC
TGCTATATCAGCAAACAATCAGTCTGATCCACCTGCAGCTGTTACAATCAATAGTCACCCTATGATTACTCGCTCAAAAGCCGGTATTTCCAAACGGAAGGCTTTTAGTG
CTGTTATTCCAAAGTCTATACCTGAACCAACTTCGTTCACAGCTGCTTCTAAAATCCCTGAGTGGAAACGTGCTATGTTGGATGAGTATACTGCCCTAACAAATCAGAAT
ACTTGGACTCTTGTTCCATCTCAAGAAGACATGAATATTGTTGGTTGTAAGTGGGTCTTTCGCACTAAATTTAATCCAGATGGGTCAGTTGCTCGTTACAAAGCCCGTTT
GGTTGCTAAAGGCTACAATCAACGTGAGGGTGTTGACTTCGATGAAACCTTTAGTCCTGTAGTCAAGAAAACTACTGTGCGAGTTATACTTGCTTTGGCAGCTCATTATG
GGTGGTCTTTACATCAGTTAGATGTTAAGAACGCCTTTCTCCATGGCTATCTTAAGGAAGATGTATACATGGTTCAGCCCCCCGGCTTTAGGGATTCAAATTATCCTAAT
CACACAACTCTTGGTCGGGAGTTTCAACTTACTGATCTCGGTTCTCTTCGTTATTTTATTGGTCTCGAAATTACTCGATTTTCTGATGGTTTTCATGTAACTCAGTTGAA
ATATCTTACTGATTTACTAGAAAAGACTGGTATGGCAGACTCAAAGACATGTTCAACTCCCATGTCTACTACGTGTGAGCTTCATGGTGTTTCTCCATTTTTTTCTGATG
CTTTGCTTTATAGACAGATTGTTGGTTCCTTACAATACTTGACCTTTACTAGACCCGACATTACCTATGTTGTAAGTAAAGTGAGTCAGTTTATGCACAAACCCACTGAA
ATACATTATTCTGCTGTGAAGCGTATTCTTCGCTATCTTAATGGCACTCGAGATTGTGGAATTCTGTTTTCCAAAGGGAAACTTGAACTCACTGCCTATAGTGATGCTGA
TTGGGCAGGAGATTCCATGGATCGACGGTCTACTTCTGGATATGTGGTCTTTTTTTGTGGTATTCCGGTTTCTTGGTCTGCTAAAAAGCAAAGCACGGTGTCTCGGTCTT
CAACTGAGGCTGAATATCGCTCTCTTGCACAAACGGCTGCTGAACTATATTGGATAAGACAATTACTGTGTGATCTTTGTGTTTTTCTTCCTACAGTTCCTTTATTGTTT
TGTGATATTTTCTACGGGAGAAAGTTGTTCGAGGCAATTTTGGACCACCCCGATACACAAGGAGCTGACGAGGACAACCGGGGAGAAATCGGGCTGAAAGATGGACCAAG
GAGGCAAAACCGGCAAATGGGACGGGCCAAGACCGAAGGGGTCGGGTTTTCGGCCCGACCCCCTGCTCGGCCATGGCCGAGGCCGACCCTCGGCCCGCTCGTGCGGGCCG
AGTCCGTTTGGTCCCGTCTGGTCCCCACCGCCTCTGGATGCCCCGGTTTCGCCTGGTTTGACCTAAAACGCCTCCGAAACCCTAAAAAGGCTAGGAGGATGAACAGGTCA
GGTGAGATCCTCTGGCCAAAATCGACCATCAACAGTTGGCGCCGTCTGTGGGGAAGAAAGCCTGTTAATCTGCACATCGGTTATTCCATGAGTAAGGGTATGGAAAAGAA
AGACCAAGACGTAAACATAGAGAATTCGGATGGTGACCGCCACCAGCGGAGGCCGAGGCCGAGCCGAGGCCGAGCAGAGGATGCCGACACCAAGATTGCCGCCCTTGAGG
ATGAGGTGAAGGGAATGAATCGGAGTTTATCCAAAATACTCCAGATCCTGGATAAACCCGGCCCTAGCACCAAAGTCCATGAGGGGAGCTTGATTAGAGACCCGAGGAAG
GGGAAGGAGCCTATGGAGCACACTGCAGAGTCAGGGACGAGGTCAAGAGGAAAGAAGACTGACAGCATGACCAGCAAGGTCAGGGGGCTCAAACCCACTGATCGCACGAT
TTTGAGGAGCCCAGAGTCAAGCACACTTAAGGGGCGTCACTACACAGTTTCTACCCCAAGCTTCGGTCATACTAAGACAGACCTGAGGAATCTGATCGTTGAGAAGCGCA
GAAGTGCCAAAACTGTCGAGTCCGAGGCCAAAGCCGCCGAAGCTGAAGCCCGGGCTGCCGAGGCCGAGGCCAGAGCAGCCGAGGCCGAGGCTAGGTTGGCCGAGGCCGAG
GCCAAGAAAGACGACCTCCCTTGGAAGACCGAGCTTCTTAATGCACTAAAGGAGCTCGGAAATCCTCAGGGAGACCAGCAGAGGTCAAAGAACTTTGGAGATCAAAACTT
GGAGGAACTAGCCGACCAAGTCGATCCGCCCTTCACAGAAGAAGTCATGAAAGCCGAGGTGCCTCAAAAGTTTAAGGTACCCACGTTCAAACAGTACGATGGCAAGAAAG
ACCCCGTGCAACATCTAAATGCATACAGAAGTTGGATGGACTTCCACGGCGTCTCAGATGCAATCAGAGTTAGCCCAAGCATTCCTTGCACAGTTCATGGGGCTAGAGAG
CAGCGCAAGCCTCACATCAACCTCTTGACGGTCAAACAACAACCAGGTGAGAGCTTGCGTGATTACATAACTCGTTTTAATGATGAGGCACTACAGGTTGAGGGGTACAG
CGAGGGAGCAGCCCTGGTAGCCATAACAGCCGGTCTGGAAGACGAAAGACTGCTCAATTCAATAGGTAAGAGCCAGCCTCGAACCTATGCGGAGTTCGTCTCCCGGGCAC
AGAAGTATATGAGCGCAGAGGAGTTACTGAAGTCAAAAAGGTCAGAACGAGAATATAAGAGGTTTTCTTCATCTAGCTATGACAGTAAAAAGGACAAAAGGCAGCGGACC
GACGAAGGAGGCCGGGGCCGAGCAGACCATGGCCGAGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTCTTCATCTAGCGTCTCTGATAACTCTTCTGGCTGTAGTACTCAGCCAAATTCGTCAATTTTTCTTCTATCAAATATATGCAATCTCGTCCCTGTTCGCCT
CGATTCCTCAAATTTCTTGTTCTGGAAGTTTCAAGTTCAGTCAATGCTCAAGGCTCATTCCCTGTTTGGAATCGTTGATGGATCTGTGCCGTGTCCGCCTGAGTTTTTGC
TTGATGCTGACGGAAGGAAGACTACTGAAGTTAACTCTGCACATACTCAGTGGATTGCTCAAGATAGCGCTCTAATCACGTTGATTAACGCTACCCTATCGAAGACTGCT
TATTCATATGTCATTGGATGTAAATCCTCCAAGGAGGTATGGCTTTCTCTTGAAAAACAATTTTCTTCTTTAACGAGGTCTCATGTTCATGAACTCAAGTCATCGCTTCA
TACAATCTCTAAATCTGCCACTGAATCTATTGATGAATACCTGCTTCGAGTGAAAGAATTAGTAGATAAGCTTGCCACTGTCTCAGTACCAATTGATGATGAAGATCTGT
TGTTGTACACCTTAAATGGTTTGTCGTCGGAGTACAATTCATTTCGTACCTCTATTCGAACTCGAGAAGGAACTTTAAAATTACAAGAACTTCATGCCTTGTTGAAGTCA
GAGGCCAAGATTCTCGAGCAGCAAAACAAGTCCATCACTACCACTCCTCTTGTTCCGACTGCGATGTTTTCGTCTGTTTCAGGCCAGTTCAATTCCAATCGAGGACGCGG
TAGAGGCAGAAATTCAAATCCGAGTTACTGGTCAGGTCGCGGAGGTGGACGATCGAATCAGGGATACTATGGGAATTCAAATCAAGGTCGTGGTGGTTTCAATTCCTCTA
ATCCATCCCCTGGTTTTCCTCAGGGACATTCTGCTCCAAATCAGGGACGTGGTGCTAATTCTGGATCTAATTCGAGTTCTGGACGTGGAGTTATATGTCAAATTTGCAAT
CGGTCTGGCCATGGAGCCTTAGATTGTTACAATCGTCTGAATCTTTCGTTCCAGGGAAGACATCCTCCATCAAAGCTTGCTGCAATGGCCTCTATCTATGATCCTCAGTC
GTCTAATAATAGTGGCAACAACAATTGTACTTGGTTGGCTGATAGTGGTTGTAACTCTCATGTTACACCTGATCTGTCTACTCTTGCTCTGAATTCAAACTATAATGGTG
AGGATGCTATCACGGTTGCAAATGGGCAAGGTGTCCCTATTACTCAAACAGGTTCTGGCACACTCTCCACCTCCCACAGTGACCTTAATCTGTCGAAAATTCTTTGTGTT
CCTGATCTCTCAGCCAATTTGTTATCTGTTTCACAATGTTGCCTTGATAACAACTGCATCTTTGTTTTCGATGCTGATTGGTTTTCCATACAGGACAAAACCTCGGGACG
AATTTTATACAAGGGCAAGAGTAAGGATGGGTTATATCCCATATCTTCCATATCTTCGGCTGGTTCTTCTTTAAGAAGTAACTCTACAGTGGCTGGTTTGCATACGTCTG
CACCTTTATCTCCTATATGTGCATCTGTTTTTGTTTCTCATGTTCCTAGTACTGTGTTGTGGCATTTACGTCTTGGACATCCCTCTTTTCCTGTTTTGCAAAAGTTGTTG
TCTGCAAATTCTATTACTTGTGATACTAAGTATACTTGTCGAGACTGTGTGAGTTGTCTCAAAGGGAAAGCTTCTAAGCTTCCTTTTGCATCCTCTACTTCCATTACTAC
TAGGCCTCTTGCCCTGTTGCATAGTGATGTGTGGGGACCATCACCTTTAATCTCTGTTTCGGGCTTTCGATACTATGTCAATTTTGTTGATGACTTCAGCAAGTTTACTT
GGATATTTCCTTTAGTACGAAAATCTGATGTTTCTTCTGTTATAAAGCAGTTTGTTCCTTTTATTGAAAATCAGTTATCTTGCTCTTTACAAGTATTTCGATCAGATGGA
GGGGGTGAGTTCGTTAACCATTATGTGTATGAGTTCTTCTCATCCAAAGGTGTTCTACATCAGAGGAGTTGCCCTCATACACCTGAACAGAATGGGGCTGCTGAGCAAAA
GCATCGATCTATTGTTGACACTGCTATTGCTCTTATGAATCATGCATCCGTTCCTTTAGAATTTTGGTATCATGCTTTTGCTACTGTTGTGTTTTTGTTGAATCGTCTGC
CTTCTAGTGCTATTGGCTTTATGACTCCGTTTCAGAAGTTATATGGTTGTGTGCCTGATTTATCTCATTTGCGTGTGTTTGGTTGTGCCTGTTATCCACTGTTAAAGCCT
TACAACACTCATAAGTTACAACCAAAAACTGCTCAGCATGTTTTCTTGGGCTATACCTTGGAATACAAAGGGTATCTATGTTATAATATGGAAACCAAGAAGATGTTGGT
CTCTCGGCATGTTGTATTTCATGAAGATGTGTTCCCATTTGCGATTAGACCTGTCACACGGTCTCACAGTCACAGCCACACCCTCCCTTCACAGCCACAACCAAATCCTG
CTGTTGTAACTTCGTTGTTGTCTTCACAATCAAGTCTTGTTGTTCGTGTACTACCAAATAATTCTAGTGTTGGTGATGTTTTATTATCACCAATTAACACACCATCTACA
AGTGATGTTCAACCATCACCTGTCTTGCCTACTACTGATGTTTTACCATCTTCAGCTTCTCATTCTTCTGTTGTTGTGGCATCAGCAAACAGTAGTCCTAGTCAGTCTGC
TGCTATATCAGCAAACAATCAGTCTGATCCACCTGCAGCTGTTACAATCAATAGTCACCCTATGATTACTCGCTCAAAAGCCGGTATTTCCAAACGGAAGGCTTTTAGTG
CTGTTATTCCAAAGTCTATACCTGAACCAACTTCGTTCACAGCTGCTTCTAAAATCCCTGAGTGGAAACGTGCTATGTTGGATGAGTATACTGCCCTAACAAATCAGAAT
ACTTGGACTCTTGTTCCATCTCAAGAAGACATGAATATTGTTGGTTGTAAGTGGGTCTTTCGCACTAAATTTAATCCAGATGGGTCAGTTGCTCGTTACAAAGCCCGTTT
GGTTGCTAAAGGCTACAATCAACGTGAGGGTGTTGACTTCGATGAAACCTTTAGTCCTGTAGTCAAGAAAACTACTGTGCGAGTTATACTTGCTTTGGCAGCTCATTATG
GGTGGTCTTTACATCAGTTAGATGTTAAGAACGCCTTTCTCCATGGCTATCTTAAGGAAGATGTATACATGGTTCAGCCCCCCGGCTTTAGGGATTCAAATTATCCTAAT
CACACAACTCTTGGTCGGGAGTTTCAACTTACTGATCTCGGTTCTCTTCGTTATTTTATTGGTCTCGAAATTACTCGATTTTCTGATGGTTTTCATGTAACTCAGTTGAA
ATATCTTACTGATTTACTAGAAAAGACTGGTATGGCAGACTCAAAGACATGTTCAACTCCCATGTCTACTACGTGTGAGCTTCATGGTGTTTCTCCATTTTTTTCTGATG
CTTTGCTTTATAGACAGATTGTTGGTTCCTTACAATACTTGACCTTTACTAGACCCGACATTACCTATGTTGTAAGTAAAGTGAGTCAGTTTATGCACAAACCCACTGAA
ATACATTATTCTGCTGTGAAGCGTATTCTTCGCTATCTTAATGGCACTCGAGATTGTGGAATTCTGTTTTCCAAAGGGAAACTTGAACTCACTGCCTATAGTGATGCTGA
TTGGGCAGGAGATTCCATGGATCGACGGTCTACTTCTGGATATGTGGTCTTTTTTTGTGGTATTCCGGTTTCTTGGTCTGCTAAAAAGCAAAGCACGGTGTCTCGGTCTT
CAACTGAGGCTGAATATCGCTCTCTTGCACAAACGGCTGCTGAACTATATTGGATAAGACAATTACTGTGTGATCTTTGTGTTTTTCTTCCTACAGTTCCTTTATTGTTT
TGTGATATTTTCTACGGGAGAAAGTTGTTCGAGGCAATTTTGGACCACCCCGATACACAAGGAGCTGACGAGGACAACCGGGGAGAAATCGGGCTGAAAGATGGACCAAG
GAGGCAAAACCGGCAAATGGGACGGGCCAAGACCGAAGGGGTCGGGTTTTCGGCCCGACCCCCTGCTCGGCCATGGCCGAGGCCGACCCTCGGCCCGCTCGTGCGGGCCG
AGTCCGTTTGGTCCCGTCTGGTCCCCACCGCCTCTGGATGCCCCGGTTTCGCCTGGTTTGACCTAAAACGCCTCCGAAACCCTAAAAAGGCTAGGAGGATGAACAGGTCA
GGTGAGATCCTCTGGCCAAAATCGACCATCAACAGTTGGCGCCGTCTGTGGGGAAGAAAGCCTGTTAATCTGCACATCGGTTATTCCATGAGTAAGGGTATGGAAAAGAA
AGACCAAGACGTAAACATAGAGAATTCGGATGGTGACCGCCACCAGCGGAGGCCGAGGCCGAGCCGAGGCCGAGCAGAGGATGCCGACACCAAGATTGCCGCCCTTGAGG
ATGAGGTGAAGGGAATGAATCGGAGTTTATCCAAAATACTCCAGATCCTGGATAAACCCGGCCCTAGCACCAAAGTCCATGAGGGGAGCTTGATTAGAGACCCGAGGAAG
GGGAAGGAGCCTATGGAGCACACTGCAGAGTCAGGGACGAGGTCAAGAGGAAAGAAGACTGACAGCATGACCAGCAAGGTCAGGGGGCTCAAACCCACTGATCGCACGAT
TTTGAGGAGCCCAGAGTCAAGCACACTTAAGGGGCGTCACTACACAGTTTCTACCCCAAGCTTCGGTCATACTAAGACAGACCTGAGGAATCTGATCGTTGAGAAGCGCA
GAAGTGCCAAAACTGTCGAGTCCGAGGCCAAAGCCGCCGAAGCTGAAGCCCGGGCTGCCGAGGCCGAGGCCAGAGCAGCCGAGGCCGAGGCTAGGTTGGCCGAGGCCGAG
GCCAAGAAAGACGACCTCCCTTGGAAGACCGAGCTTCTTAATGCACTAAAGGAGCTCGGAAATCCTCAGGGAGACCAGCAGAGGTCAAAGAACTTTGGAGATCAAAACTT
GGAGGAACTAGCCGACCAAGTCGATCCGCCCTTCACAGAAGAAGTCATGAAAGCCGAGGTGCCTCAAAAGTTTAAGGTACCCACGTTCAAACAGTACGATGGCAAGAAAG
ACCCCGTGCAACATCTAAATGCATACAGAAGTTGGATGGACTTCCACGGCGTCTCAGATGCAATCAGAGTTAGCCCAAGCATTCCTTGCACAGTTCATGGGGCTAGAGAG
CAGCGCAAGCCTCACATCAACCTCTTGACGGTCAAACAACAACCAGGTGAGAGCTTGCGTGATTACATAACTCGTTTTAATGATGAGGCACTACAGGTTGAGGGGTACAG
CGAGGGAGCAGCCCTGGTAGCCATAACAGCCGGTCTGGAAGACGAAAGACTGCTCAATTCAATAGGTAAGAGCCAGCCTCGAACCTATGCGGAGTTCGTCTCCCGGGCAC
AGAAGTATATGAGCGCAGAGGAGTTACTGAAGTCAAAAAGGTCAGAACGAGAATATAAGAGGTTTTCTTCATCTAGCTATGACAGTAAAAAGGACAAAAGGCAGCGGACC
GACGAAGGAGGCCGGGGCCGAGCAGACCATGGCCGAGGCTGA
Protein sequenceShow/hide protein sequence
MASSSSSVSDNSSGCSTQPNSSIFLLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTA
YSYVIGCKSSKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQELHALLKS
EAKILEQQNKSITTTPLVPTAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGRGGFNSSNPSPGFPQGHSAPNQGRGANSGSNSSSGRGVICQICN
RSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQSSNNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHSDLNLSKILCV
PDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKLL
SANSITCDTKYTCRDCVSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDG
GGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKP
YNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHSHSHTLPSQPQPNPAVVTSLLSSQSSLVVRVLPNNSSVGDVLLSPINTPST
SDVQPSPVLPTTDVLPSSASHSSVVVASANSSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRKAFSAVIPKSIPEPTSFTAASKIPEWKRAMLDEYTALTNQN
TWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPN
HTTLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCELHGVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTE
IHYSAVKRILRYLNGTRDCGILFSKGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAAELYWIRQLLCDLCVFLPTVPLLF
CDIFYGRKLFEAILDHPDTQGADEDNRGEIGLKDGPRRQNRQMGRAKTEGVGFSARPPARPWPRPTLGPLVRAESVWSRLVPTASGCPGFAWFDLKRLRNPKKARRMNRS
GEILWPKSTINSWRRLWGRKPVNLHIGYSMSKGMEKKDQDVNIENSDGDRHQRRPRPSRGRAEDADTKIAALEDEVKGMNRSLSKILQILDKPGPSTKVHEGSLIRDPRK
GKEPMEHTAESGTRSRGKKTDSMTSKVRGLKPTDRTILRSPESSTLKGRHYTVSTPSFGHTKTDLRNLIVEKRRSAKTVESEAKAAEAEARAAEAEARAAEAEARLAEAE
AKKDDLPWKTELLNALKELGNPQGDQQRSKNFGDQNLEELADQVDPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRVSPSIPCTVHGARE
QRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSEREYKRFSSSSYDSKKDKRQRT
DEGGRGRADHGRG