; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012170 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012170
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr1:38221449..38232832
RNA-Seq ExpressionLag0012170
SyntenyLag0012170
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016747 - transferase activity, transferring acyl groups other than amino-acyl groups (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR003480 - Transferase
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR023213 - Chloramphenicol acetyltransferase-like domain superfamily
IPR025724 - GAG-pre-integrase domain
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7015607.1 unnamed protein product [Microthlaspi erraticum]8.2e-21434.27Show/hide
Query:  DSVKPTN------PTVIDQYVNPYYLHHSDGTNLVLVSE-LLTESNYSSWYQAMLIGLTVKNKL------------------------------------
        DS  PT+      PT +DQY NPY+LH +D   L+LVS+ L T S++ SW +++ + L  +NKL                                    
Subjt:  DSVKPTN------PTVIDQYVNPYYLHHSDGTNLVLVSE-LLTESNYSSWYQAMLIGLTVKNKL------------------------------------

Query:  -----------------------------------RREISNLMQEQLSVTAYFAKLKALWNELVSY--RPSCTCDRCSCGGVQGLVQYFQTEHVVAFLMG
                                            + +S + Q  L V+AY+ +L  LW E  +Y   P CTC +C C       +  Q   V  FLMG
Subjt:  -----------------------------------RREISNLMQEQLSVTAYFAKLKALWNELVSY--RPSCTCDRCSCGGVQGLVQYFQTEHVVAFLMG

Query:  LNESFSQIRTQLLLMEPEPTIVKAFSLVAQEVEQCASTVVPIAAPSPTIDATALLVKNTQP------------NSAAKNTQQIKKKERSHCTHCNVLGHT
        LNESF   R  +L+++P PTI +AF++V Q+  Q       +  P   +DA A     + P            N A       +  ++  CTHC  +GHT
Subjt:  LNESFSQIRTQLLLMEPEPTIVKAFSLVAQEVEQCASTVVPIAAPSPTIDATALLVKNTQP------------NSAAKNTQQIKKKERSHCTHCNVLGHT

Query:  IGRCYKLHGYPPGYR-NQKAPSTA----------KPESNP-----------------------------------------------LPNLTPEQCQSIL
        + +C+K+HG+PPGY+ N  AP++A          +P+  P                                               LP  +P+Q Q ++
Subjt:  IGRCYKLHGYPPGYR-NQKAPSTA----------KPESNP-----------------------------------------------LPNLTPEQCQSIL

Query:  AMLQSHLSSVKTT---------------PESSSSSSIGNAHVAGTC------------SSL--LPPSYSWIVDSGASAHICFSKDLFSSFKSVSGFSVTL
        +  QSH+S  ++                 + S+S +I     + TC            SSL  L P  +WI+DSGAS+H+C    LFS    VS F+VTL
Subjt:  AMLQSHLSSVKTT---------------PESSSSSSIGNAHVAGTC------------SSL--LPPSYSWIVDSGASAHICFSKDLFSSFKSVSGFSVTL

Query:  PNQARLVVDYVGDVRLSSGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDKSSLKTIGKARFWQGLYLLDHHALDTVPLPVYLNSTKHCS
        PN AR+ + +   V +SS L+L NVL +P F FNLISVS +      S  F    CLIQ+ S    IG+A+    LY+LD +  D  P     + +  C 
Subjt:  PNQARLVVDYVGDVRLSSGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDKSSLKTIGKARFWQGLYLLDHHALDTVPLPVYLNSTKHCS

Query:  LVSNCNTDLWHDRLGHPSHKNLNALKPLL-SFKESPS--HPCFICPLAKQRRLSFSAHNRLVDKPFDLFHCDTWGPFRSLTHLGHRYFLTLVDDATRYTW
         V   +  +WH RLGHPS   L  LK +L SFK+  S    C +CPLAKQRRL++ +HN L  +PFDL H D WGPF   +  G+RYFLT+VDD TR TW
Subjt:  LVSNCNTDLWHDRLGHPSHKNLNALKPLL-SFKESPS--HPCFICPLAKQRRLSFSAHNRLVDKPFDLFHCDTWGPFRSLTHLGHRYFLTLVDDATRYTW

Query:  IFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPELSFTDFFRSSGVVHQFSCVECP----------------SKTLSWKGNIST---SSMLLVLY
        ++ ++ KSD  T+ P+F  L+ TQ+   +K  RSDNAPEL+FT   +  G+VHQ SC   P                ++ L ++ N+     S  +    
Subjt:  IFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPELSFTDFFRSSGVVHQFSCVECP----------------SKTLSWKGNIST---SSMLLVLY

Query:  FFSQGCP---YNFGTPYFRLHSKNVDYSQLRVFGSLCFASTLKAGRSKFSPRVIPAVFLGYPQGMKAFKLYDIENKHVFISRDVVFHESIFPFH-SVVAQ
        F     P    N  +P+  L  K  DYS L+ FG LC+ASTL   R+KFSPR  P VFLGYP G K +K+ D+E++ V ISR+VVF E++FPF  S    
Subjt:  FFSQGCP---YNFGTPYFRLHSKNVDYSQLRVFGSLCFASTLKAGRSKFSPRVIPAVFLGYPQGMKAFKLYDIENKHVFISRDVVFHESIFPFH-SVVAQ

Query:  KDTSVFLNDLV-LPKSFNYAGTPDVSHGHELQLSVGERPVLPNEDI---VSNDVFVQPSIDCSLGQSESQMLPQSLASQEPNVPLSDSLGCSLPSITRHR
        K   +F N ++ +P   ++  T        + L+  E    P+ +I   V +     P    S     S   P ++ S     P+S        +I R +
Subjt:  KDTSVFLNDLV-LPKSFNYAGTPDVSHGHELQLSVGERPVLPNEDI---VSNDVFVQPSIDCSLGQSESQMLPQSLASQEPNVPLSDSLGCSLPSITRHR

Query:  ACPQRKSNRQIKQPSYLKDYHCSLLHANAFPPT---QTKFPLNKYLSYNSLSDSYRTFVLNISTTFEPQFYHQAASSSHWRDAMSVELKAMESNSTWSV-
          P        K PSYL +YHC+ L   ++ P     T +P++ ++SY+ LS  +++++   +   EP+ + QA +S  W  A++VEL A+E   TW + 
Subjt:  ACPQRKSNRQIKQPSYLKDYHCSLLHANAFPPT---QTKFPLNKYLSYNSLSDSYRTFVLNISTTFEPQFYHQAASSSHWRDAMSVELKAMESNSTWSV-

Query:  -----------------------------ARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYV-
                                     ARL+A+G+TQ+EG+DY +TFSPVAKL ++K LL +A++ GW L Q+DV+NAFLHGDL EE+YM LP GY  
Subjt:  -----------------------------ARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYV-

Query:  -HGKVCPQGQRLVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNTFVALLVYVDDI-----------------------------
          G V P     VC+L KS+YGLKQASRQW+ + S  L+   + QS +D +LF +    +FV +LVYVDD+                             
Subjt:  -HGKVCPQGQRLVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNTFVALLVYVDDI-----------------------------

Query:  ---SSPEPLSSSKGIFLSQRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQ----------ASFSSSPSCSVIASVHQRLAY------AGDFLPASKSF
              E   SS GI + Q KYAL L+ED GLL  KPS++PMDP+  L+  +          + F S+P+ + + + H+ L Y       G    A    
Subjt:  ---SSPEPLSSSKGIFLSQRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQ----------ASFSSSPSCSVIASVHQRLAY------AGDFLPASKSF

Query:  QLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDNRQS
         L  F+DADW +C ++R+S TG+CV+LG +L+SWK+KKQS VSR+S EAEYR+LA  T E++WL+QLL++L I    P+ LFCDN+ +
Subjt:  QLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDNRQS

KAG7588551.1 Ribonuclease H domain [Arabidopsis suecica]8.5e-21134.52Show/hide
Query:  IDQYVNPYYLHHSDGTNLVLVSE-LLTESNYSSWYQAMLIGLTVKNKL----------------------------------------------------
        +DQY NPY+LH SD   LVLVS+ L T +++ SW +++ + L V+NKL                                                    
Subjt:  IDQYVNPYYLHHSDGTNLVLVSE-LLTESNYSSWYQAMLIGLTVKNKL----------------------------------------------------

Query:  -------------------RREISNLMQEQLSVTAYFAKLKALWNELVSY--RPSCTCDRCSCGGVQGLVQYFQTEHVVAFLMGLNESFSQIRTQLLLME
                            + +S++ Q  + V+AY+ +L  +W E  +Y   P CTC +C C       +  Q   V  FLMGLNES+   R  +L+++
Subjt:  -------------------RREISNLMQEQLSVTAYFAKLKALWNELVSY--RPSCTCDRCSCGGVQGLVQYFQTEHVVAFLMGLNESFSQIRTQLLLME

Query:  PEPTIVKAFSLVAQEVEQCA-----------STVVPIAAPSPTIDA-TALLVKNTQPNSA-AKNTQQIKKKERSHCTHCNVLGHTIGRCYKLHGYPPGY-
        P P+I   F++V Q+  Q +               P A  +P +        +  Q N+A A       ++ R  CTHC   GH I +C+KLHGYPPGY 
Subjt:  PEPTIVKAFSLVAQEVEQCA-----------STVVPIAAPSPTIDA-TALLVKNTQPNSA-AKNTQQIKKKERSHCTHCNVLGHTIGRCYKLHGYPPGY-

Query:  -------------------------------RNQKAPSTAKPESNP-------------LPNLTPEQCQSILAMLQSHLSSVKTTPES------------
                                        NQ+A S A     P                ++P+Q QS+L  L +H+   +TT  S            
Subjt:  -------------------------------RNQKAPSTAKPESNP-------------LPNLTPEQCQSILAMLQSHLSSVKTTPES------------

Query:  SSSSSIGNAHVAGT-------------------------CSSLL---PPSYSWIVDSGASAHICFSKDLFSSFKSVSGFSVTLPNQARLVVDYVGDVRLS
        ++ SS GN H   +                         C S L    P  SWI+DSGA++H+C    LFS    VSG +V+LPN  R+ + + G + +S
Subjt:  SSSSSIGNAHVAGT-------------------------CSSLL---PPSYSWIVDSGASAHICFSKDLFSSFKSVSGFSVTLPNQARLVVDYVGDVRLS

Query:  SGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDKSSLKTIGKARFWQGLYLLDHHALDTVPLPVYLNSTKHCSLVSNCNTDLWHDRLGHP
          L+L +VL +P+F+FNLISVS++   S  S  F    C IQ+ +   TIGK      LY+L   +  ++  P   + T H S     + DLWH RLGHP
Subjt:  SGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDKSSLKTIGKARFWQGLYLLDHHALDTVPLPVYLNSTKHCSLVSNCNTDLWHDRLGHP

Query:  SHKNLNALKPLLSFKESPSH---PCFICPLAKQRRLSFSAHNRLVDKPFDLFHCDTWGPFRSLTHLGHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSF
        S   L AL   LS  +S      PC +C LAKQ+RLSF +HN L   PFDL H D WGPF + +  G+RYFLT+VDD TR TWI+ M+ KS+      +F
Subjt:  SHKNLNALKPLLSFKESPSH---PCFICPLAKQRRLSFSAHNRLVDKPFDLFHCDTWGPFRSLTHLGHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSF

Query:  FKLVDTQFQVAIKCFRSDNAPELSFTDFFRSSGVVHQFSCVECP----------------SKTLSWKGNIST---SSMLLVLYFFSQGCP---YNFGTPY
         +LV TQ++  IK  R+DNAPEL+FT+    +G++HQFSC   P                ++ L ++ N+     S  +    F     P       +PY
Subjt:  FKLVDTQFQVAIKCFRSDNAPELSFTDFFRSSGVVHQFSCVECP----------------SKTLSWKGNIST---SSMLLVLYFFSQGCP---YNFGTPY

Query:  FRLHSKNVDYSQLRVFGSLCFASTLKAGRSKFSPRVIPAVFLGYPQGMKAFKLYDIENKHVFISRDVVFHESIFPFHSVVAQKDTSVFLNDLVLPKSFNY
          L  K  DYS LR FG LC+ASTL   R KFSPR    VF+GY  G K +KL  ++   V +SR+VVF+E IFPFH +          N L LP S   
Subjt:  FRLHSKNVDYSQLRVFGSLCFASTLKAGRSKFSPRVIPAVFLGYPQGMKAFKLYDIENKHVFISRDVVFHESIFPFHSVVAQKDTSVFLNDLVLPKSFNY

Query:  AGTPDVSHGHELQLSVGERPVLPNEDIVSNDVFVQPSIDCSLGQSESQMLPQSLASQEPNVPLSDSLGCSLPSITRHRACPQRKSNRQIKQPSYLKDYHC
             V H  +  +        PN+   S+              S +     S+ S    VP   S+  SLP++         +  R  K PSYL DYHC
Subjt:  AGTPDVSHGHELQLSVGERPVLPNEDIVSNDVFVQPSIDCSLGQSESQMLPQSLASQEPNVPLSDSLGCSLPSITRHRACPQRKSNRQIKQPSYLKDYHC

Query:  SLLHANAFP--------PTQTKFPLNKYLSYNSLSDSYRTFVLNISTTFEPQFYHQAASSSHWRDAMSVELKAMESNSTWSV------------------
        SL+  ++ P           T +PL+  LSY+ L + Y++ VL+ S   EP  + QA +S  W  AM VEL+AME N TWS+                  
Subjt:  SLLHANAFP--------PTQTKFPLNKYLSYNSLSDSYRTFVLNISTTFEPQFYHQAASSSHWRDAMSVELKAMESNSTWSV------------------

Query:  ------------ARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGY--VHGKVCPQGQRLVCQLH
                    ARL+AKG+TQQEG+D+ +TFSPVAKL ++K++L +A +  W L Q+DV+NAFLH +L EE+YM LP GY    G+  P     VC+LH
Subjt:  ------------ARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGY--VHGKVCPQGQRLVCQLH

Query:  KSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNTFVALLVYVDDI----SSPEPLSSSK----------------------------GIFL
        KSIYGLKQASRQW+  FS  LL   F QS SD +LF K  GN+F+ LLVYVDDI    +S E +SS K                            GI +
Subjt:  KSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNTFVALLVYVDDI----SSPEPLSSSK----------------------------GIFL

Query:  SQRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQASFSS-----------------------------SPSCSVIASVHQRLAY-----------AGDF
        SQRKY L L++D G L  KP  +PMDP   L  D  +  +                             S   S    VH + AY            G F
Subjt:  SQRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQASFSS-----------------------------SPSCSVIASVHQRLAY-----------AGDF

Query:  LPASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDNRQS
           +    L  FSDADW +C D+R+STTG CVFLG +L++ K+KKQ   S SS EAEYRA+A+T+ ELVWL QLLK+L +P+  P+ L+CD++ +
Subjt:  LPASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDNRQS

KZV25004.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum]4.4e-22336.06Show/hide
Query:  PTNPTVIDQYVNPYYLHHSDGTNLVLVSELLTESNYSSWYQAMLIGLTVKNKL-----------------------------------------------
        P   T ++   +PYYLH+ D   L LVS  L  SNY++W +AM++ LT KNKL                                               
Subjt:  PTNPTVIDQYVNPYYLHHSDGTNLVLVSELLTESNYSSWYQAMLIGLTVKNKL-----------------------------------------------

Query:  ------------------------RREISNLMQEQLSVTAYFAKLKALWNELVSYRPSCTCDRCSCGGVQGLVQYFQTEHVVAFLMGLNESFSQIRTQLL
                                ++ +S L Q  + V++Y+ KL+ LW+EL  Y+P+     C+CG ++    Y   E V+ FLMGLN+S++Q+R Q+L
Subjt:  ------------------------RREISNLMQEQLSVTAYFAKLKALWNELVSYRPSCTCDRCSCGGVQGLVQYFQTEHVVAFLMGLNESFSQIRTQLL

Query:  LMEPEPTIVKAFSLVAQEVEQCASTVVPIAAPSPTIDATALLVK-NTQPNSAA--KNTQQIK--KKERSHCTHCNVLGHTIGRCYKLHGYPPGYRNQK--
        ++EP PTI K F+LV QE  Q +   +        +D + +L   N+  N+A   + +Q  K  + +R  C+HC+   HT+ +CYKLHGYPPG+   K  
Subjt:  LMEPEPTIVKAFSLVAQEVEQCASTVVPIAAPSPTIDATALLVK-NTQPNSAA--KNTQQIK--KKERSHCTHCNVLGHTIGRCYKLHGYPPGYRNQK--

Query:  ------------APSTAKPESNPL---PNLTPEQCQSILAMLQSHLSS-----VKTTPESSSSSSIGNAHVAGTCS--SLLP--PSYSWIVDSGASAHIC
                    + S    E+  +    +LT  QC+ ++  L S L +     ++  PE++ S       + G CS  S +P      WI+D+GA+ HIC
Subjt:  ------------APSTAKPESNPL---PNLTPEQCQSILAMLQSHLSS-----VKTTPESSSSSSIGNAHVAGTCS--SLLP--PSYSWIVDSGASAHIC

Query:  FSKDLFSSFKSVSGFSVTLPNQARLVVDYVGDVRLSSGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDKSSLKTIGKARFWQGLYLLDH
         S  +F S +++    V LPN   + V   G V ++S L+LQNVL++P FQFNL+SVS++T   + S+SF    C IQD S ++ IG  +    LY+L  
Subjt:  FSKDLFSSFKSVSGFSVTLPNQARLVVDYVGDVRLSSGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDKSSLKTIGKARFWQGLYLLDH

Query:  HALDTVPLPVYLNSTKHCSLVSNCNTDLWHDRLGHPSHKNLNALKPLLSFKESP-SHPCFICPLAKQRRLSFSAHNRLVDKPFDLFHCDTWGPFRSLTHL
               LP Y+ +T     VS  N++LWH R+GHPS   L++LK +L+ + +   + C  C L+KQRRL  ++ N +  + F+L H DTWGPF   +  
Subjt:  HALDTVPLPVYLNSTKHCSLVSNCNTDLWHDRLGHPSHKNLNALKPLLSFKESP-SHPCFICPLAKQRRLSFSAHNRLVDKPFDLFHCDTWGPFRSLTHL

Query:  GHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPELSFTDFFRSSGVVHQFSCVECPSKTLSWKGNISTSSMLLVLYFF
        G R+F T+VDD +RYTW++ +K KSD L+I P F ++V TQF V +K  RSDNAPEL F DFF  +G+ H  SCVE P +    +        +     F
Subjt:  GHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPELSFTDFFRSSGVVHQFSCVECPSKTLSWKGNISTSSMLLVLYFF

Query:  SQGCPYNF----------------------GTPYFRLHSKNVDYSQLRVFGSLCFASTLKAGRSKFSPRVIPAVFLGYPQGMKAFKLYDIENKHVFISRD
            P ++                       TP+  LH K   YS L+VFG LC+ASTL + R KFSPR I  VF+GYP G K +KL ++E   +FISRD
Subjt:  SQGCPYNF----------------------GTPYFRLHSKNVDYSQLRVFGSLCFASTLKAGRSKFSPRVIPAVFLGYPQGMKAFKLYDIENKHVFISRD

Query:  VVFHESIFPFHSVVAQKDTSVFLNDLVLPKSFNYAGTPDVSHGHELQLSVGERPVLPNEDIVSNDVFVQPSIDCSLGQSESQMLPQSLASQEPNVPLSDS
        V+FHE+ FP+     Q  + + L+D+                                         V PS         SQ+ P               
Subjt:  VVFHESIFPFHSVVAQKDTSVFLNDLVLPKSFNYAGTPDVSHGHELQLSVGERPVLPNEDIVSNDVFVQPSIDCSLGQSESQMLPQSLASQEPNVPLSDS

Query:  LGCSLPSITRHRACPQRKSNRQIKQPSYLKDYHCSLLHANAFPPTQTKFPLNKYLSYNSLSDSYRTFVLNISTTFEPQFYHQAASSSHWRDAMSVELKAM
           S+P+  +  +    +++R    PS+L+DYHC  +       T T  P++  ++Y+ LS S+R FV NIS+  EP  + QA S   WR AM  ELKA+
Subjt:  LGCSLPSITRHRACPQRKSNRQIKQPSYLKDYHCSLLHANAFPPTQTKFPLNKYLSYNSLSDSYRTFVLNISTTFEPQFYHQAASSSHWRDAMSVELKAM

Query:  ESNSTWSV------------------------------ARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVY
        E N TWS+                              ARL+AKGYTQQEGLDY+ETFSPVAKLVT++ LL +A   GW LIQLDVNNAFLHGDL EEVY
Subjt:  ESNSTWSV------------------------------ARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVY

Query:  MDLPLGYVHGKVCPQGQ---RLVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNTFVALLVYVDDI-------------------
        M LP G+     C +G+   R VC+LHKSIYGLKQASRQWF KFS  LLS GF QS +D SLF +   N F+AL+VYVDDI                   
Subjt:  MDLPLGYVHGKVCPQGQ---RLVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNTFVALLVYVDDI-------------------

Query:  -------------SSPEPLSSSKGIFLSQRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQASFSSSPSC----------------SVIASVHQRLAYA
                        E   S++G+ + QR YA+ L+ + GLL  KP   PM+ N+KL+ D     S P+                  ++ +V++   Y 
Subjt:  -------------SSPEPLSSSKGIFLSQRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQASFSSSPSC----------------SVIASVHQRLAYA

Query:  ------------------------GDFLPASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQ
                                G F  +S   +L+AFSDADW +C D+R+S TGYCVFLG++L+SW+AKKQ TVSRSSAEAEYR+LA +T E++W+ Q
Subjt:  ------------------------GDFLPASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQ

Query:  LLKELLIPSEVPSLLFCDNR
        LL +L +    P++LFCD++
Subjt:  LLKELLIPSEVPSLLFCDNR

RVW81690.1 Copia protein [Vitis vinifera]4.8e-21435.64Show/hide
Query:  PTNPTVIDQYVNPYYLHHSDGTNLVLVSELLTESNYSSWYQAMLIGLTVKNK------------------------------------------------
        P NP+  D   +PYYLH SD    +LVSE+    NY +W ++++I LTVKNK                                                
Subjt:  PTNPTVIDQYVNPYYLHHSDGTNLVLVSELLTESNYSSWYQAMLIGLTVKNK------------------------------------------------

Query:  ------------------------LRREISNLMQEQLSVTAYFAKLKALWNELVSYR--PSCTCDR---CSCGGVQGLVQYFQTEHVVAFLMGLNESFSQ
                                L + +S + Q  LSVT YF+  K  W+E +SYR  P+CTC +   C+C     L    Q+++V+ FL+GLN+S++ 
Subjt:  ------------------------LRREISNLMQEQLSVTAYFAKLKALWNELVSYR--PSCTCDR---CSCGGVQGLVQYFQTEHVVAFLMGLNESFSQ

Query:  IRTQLLLMEPEPTIVKAFSLVAQEVEQCASTVVPIAAPSPTIDAT-ALLVKNTQPNSAAKNTQQIK---KKERSHCTHCNVLGHTIGRCYKLHGYPPGYR
        IR+QLLLM P P + K FSL+ QE  Q   T       S T + T AL+ K  Q N  +   Q  K   KK   HCTHC   GHT+ +C++LHGYPPG+ 
Subjt:  IRTQLLLMEPEPTIVKAFSLVAQEVEQCASTVVPIAAPSPTIDAT-ALLVKNTQPNSAAKNTQQIK---KKERSHCTHCNVLGHTIGRCYKLHGYPPGYR

Query:  NQK-----APSTAKPESNPLPN----------LTPEQCQSILAMLQSHLSSVKTTPESSSSSSIGNA-------HVAGTCSSLLPPSYS-----------
          K     A + A   +  +PN           TPE+   ++ +  S        P SS+ +++  A       H +G     L P +S           
Subjt:  NQK-----APSTAKPESNPLPN----------LTPEQCQSILAMLQSHLSSVKTTPESSSSSSIGNA-------HVAGTCSSLLPPSYS-----------

Query:  --WIVDSGASAHICFSKDLFSSFKSVSGFSVTLPNQARLVVDYVGDVRLSSGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDKSSLKTI
          W++D+GA+ H+  S  L+ S       S+ LPN   +   ++G V LS  L L NVL +P F FNL+SVS +T QS++SL+F   +CL+QD+S  K I
Subjt:  --WIVDSGASAHICFSKDLFSSFKSVSGFSVTLPNQARLVVDYVGDVRLSSGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDKSSLKTI

Query:  GKARFWQGLY-LLDHHALDTVPLPVYLNSTKHCSLVSNCNTDLWHDRLGHPSHKNLNALKPLLS-FKESPSHPCFICPLAKQRRLSFSAHNRLVDKPFDL
        G A   +GLY L+   +          NST       + + D+WH RLGH S    ++LK + S      ++ C ICPLAKQRRL FS      +K FDL
Subjt:  GKARFWQGLY-LLDHHALDTVPLPVYLNSTKHCSLVSNCNTDLWHDRLGHPSHKNLNALKPLLS-FKESPSHPCFICPLAKQRRLSFSAHNRLVDKPFDL

Query:  FHCDTWGPFRSLTHLGHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPELSFTDFFRSSGVVHQFSCVECP-------
         HCD WGPF + ++ G RYFLT+VDD TR TW++ +K KS+  +++ +F K+V  QF  ++K  RSDN PE   T F+   G++HQ SCVE P       
Subjt:  FHCDTWGPFRSLTHLGHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPELSFTDFFRSSGVVHQFSCVECP-------

Query:  ---------SKTLSWKGN---ISTSSMLLVLYFFSQGCP---YNFGTPYFRLHSKNVDYSQLRVFGSLCFASTLKAGRSKFSPRVIPAVFLGYPQGMKAF
                 ++ L ++ N   I  S  +L         P       TP+  L +K  + S LRVFG LCFASTL + R KF  R    +FLGYP  +K +
Subjt:  ---------SKTLSWKGN---ISTSSMLLVLYFFSQGCP---YNFGTPYFRLHSKNVDYSQLRVFGSLCFASTLKAGRSKFSPRVIPAVFLGYPQGMKAF

Query:  KLYDIENKHVFISRDVVFHESIFPFHSVVAQKDTSVFLNDLVLPKSFNYAGTPDVSHGHELQLSVGERPVLPNEDIVSNDVFVQPSIDCSLGQSESQMLP
        KL D+    + +SR+V+FHE+IFPF  +    +T  F+     P S +++ +P  +       +VG      N   +S                     P
Subjt:  KLYDIENKHVFISRDVVFHESIFPFHSVVAQKDTSVFLNDLVLPKSFNYAGTPDVSHGHELQLSVGERPVLPNEDIVSNDVFVQPSIDCSLGQSESQMLP

Query:  QSLASQEPNVPLSDSLGCSLPSITRHRACPQRKSNRQIKQPSYLKDYHCSLLHANA----------FPPTQTKF-PLNKYLSYNSLSDSYRTFVLNISTT
          L S  P  P    +  + P + +      RKS R  ++PSYL+DY+C  + ++             PT+  F  L+ +LS + LS S++ F+ +I+ +
Subjt:  QSLASQEPNVPLSDSLGCSLPSITRHRACPQRKSNRQIKQPSYLKDYHCSLLHANA----------FPPTQTKF-PLNKYLSYNSLSDSYRTFVLNISTT

Query:  FEPQFYHQAASSSHWRDAMSVELKAMESNSTWSV------------------------------ARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLTVA
         EP+ Y QA+    W+ AM  EL A+E N TW +                              ARL+AKGYTQQEGLD+ +TFSPVAK+ +I++LL VA
Subjt:  FEPQFYHQAASSSHWRDAMSVELKAMESNSTWSV------------------------------ARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLTVA

Query:  TSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYVHGKVCPQGQRLVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNTFVALLVY
            W L QLDVNNAFLHGDL EEVYM+LP G     +  QG++ VC+L KS+YGLKQASRQW+ K S ALLS+GF Q  SD SLF K   ++F+ALLVY
Subjt:  TSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYVHGKVCPQGQRLVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNTFVALLVY

Query:  VDDI--------------------------------SSPEPLSSSKGIFLSQRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQASFSSSPS-------
        VDD+                                   E   S  GI L QRKY L ++ED GL  SKP+A PM+   KLS++  +F   PS       
Subjt:  VDDI--------------------------------SSPEPLSSSKGIFLSQRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQASFSSSPS-------

Query:  -CSVIASVHQRLAYA--------------------------------GDFLPASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKKQSTVS
            +      LAY+                                G F P+S  FQLK FSD+DWA C D+R+S TG+ +FLG++L+SWK+KKQ TVS
Subjt:  -CSVIASVHQRLAYA--------------------------------GDFLPASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKKQSTVS

Query:  RSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDNRQS
        RSSAEAEYRALA TT E+ WL   L++L I     +LL+ D++ +
Subjt:  RSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDNRQS

RVW82526.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.6e-21536.62Show/hide
Query:  IDQYVNPYYLHHSDGTNLVLVSELL--TESNYSSWYQAMLIGLTVKNK----------------------------------------------------
        ++ + +PY+LH+ D  +L LVS  L  + SNY SW ++M+  L  KNK                                                    
Subjt:  IDQYVNPYYLHHSDGTNLVLVSELL--TESNYSSWYQAMLIGLTVKNK----------------------------------------------------

Query:  -------------------LRREISNLMQEQLSVTAYFAKLKALWNELVSYRPSCTCDRCSCGGVQGLVQYFQTEHVVAFLMGLNESFSQIRTQLLLMEP
                           L+++I    Q    V  Y+ +LK+LW+EL  ++       C+CGG++  ++  Q E V+ FL+GLNESF+ I+ Q+LLMEP
Subjt:  -------------------LRREISNLMQEQLSVTAYFAKLKALWNELVSYRPSCTCDRCSCGGVQGLVQYFQTEHVVAFLMGLNESFSQIRTQLLLMEP

Query:  EPTIVKAFSLVAQEVEQCASTVVPIAAPSPTIDATALLVKNTQPNSAAKNTQQIKKKERSHCTHCNVLGHTIGRCYKLHGYPPGYRNQK--APSTAKP--
         P + K FSLV QE  Q + T     +P+ T   ++     ++ +S   +++   +K+R  CTHCN+LGHT+ RCYK+HGY PG+RN+    P+ ++P  
Subjt:  EPTIVKAFSLVAQEVEQCASTVVPIAAPSPTIDATALLVKNTQPNSAAKNTQQIKKKERSHCTHCNVLGHTIGRCYKLHGYPPGYRNQK--APSTAKP--

Query:  ------ESNPL------------PNLTPEQCQSILAMLQSHLSSVK------TTPESSSSSSIGNAHVAGTCSSLLPPSYSWIVDSGASAHICFSKDLFS
               +N L            P LT +Q   +LA+L  H SS        + P   S S+          SS L PS  WI+DSGA+ H+C +  +F 
Subjt:  ------ESNPL------------PNLTPEQCQSILAMLQSHLSSVK------TTPESSSSSSIGNAHVAGTCSSLLPPSYSWIVDSGASAHICFSKDLFS

Query:  SFKSVSGFSVTLPNQARLVVDYVGDVRLSSGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDKSSLKTIGKARFWQGLYLLDHHALDTVP
        S  S S  +VTLP   ++ +  +G + LS  L+L++VL+IPTFQFNLIS+SA+T  +  S  F   +C IQD S  K IG  R    LYLLD     ++ 
Subjt:  SFKSVSGFSVTLPNQARLVVDYVGDVRLSSGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDKSSLKTIGKARFWQGLYLLDHHALDTVP

Query:  --LPVYLNSTKHCSLVSNCNTDLWHDRLGHPSHKNLNALKPLLSFKE--SPSHPCFICPLAKQRRLSFSAHNRLVDKPFDLFHCDTWGPFRSLTHLGHRY
            V  N++ H +        LWH RL HPS+  L+ LKP L  +   + +  C ICPLAKQ+RL F  HN L   PFDL HCD WGPF   TH G RY
Subjt:  --LPVYLNSTKHCSLVSNCNTDLWHDRLGHPSHKNLNALKPLLSFKE--SPSHPCFICPLAKQRRLSFSAHNRLVDKPFDLFHCDTWGPFRSLTHLGHRY

Query:  FLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPELSFTDFFRSSGVVHQFSCVECP----------------SKTLSWKGNI
        FLT+VDD TR TW+  ++ KSD  TI P FF +V T+F + IK  RSDNAPEL+ ++ F    V+H FSCVE P                ++ L ++ NI
Subjt:  FLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPELSFTDFFRSSGVVHQFSCVECP----------------SKTLSWKGNI

Query:  ST---SSMLLVLYFFSQGCP---YNFGTPYFRLHSKNVDYSQLRVFGSLCFASTLKAGRSKFSPRVIPAVFLGYPQGMKAFKLYDIENKHVFISRDVVFH
                +L   +     P    N  TP+  LH K+  YS L+ FG LC++STL + R KFSPR +P VFLGYP G K +K+ D+E   + +SR+V F 
Subjt:  ST---SSMLLVLYFFSQGCP---YNFGTPYFRLHSKNVDYSQLRVFGSLCFASTLKAGRSKFSPRVIPAVFLGYPQGMKAFKLYDIENKHVFISRDVVFH

Query:  ESIFPFHSVVAQKDTSV---FLNDLVLPKSFNYAGTPDVSHGHELQLSVGERPVLPNEDIVSNDVFVQPSIDCSLGQSESQMLPQSLASQEPNVPLSDSL
        ES+FPF   ++Q + SV   F +  VL                         PV+P    VS      PS D S              +  PN P   S 
Subjt:  ESIFPFHSVVAQKDTSV---FLNDLVLPKSFNYAGTPDVSHGHELQLSVGERPVLPNEDIVSNDVFVQPSIDCSLGQSESQMLPQSLASQEPNVPLSDSL

Query:  GCSLPSITRHRACPQRKSNRQIKQPSYLKDYHCSLLHANAFP----PTQTKFPLNKYLSYNSLSDSYRTFVLNISTTFEPQFYHQAASSSHWRDAMSVEL
          + P  T H      +S+R  + P YL DYHC L  A++ P       T +PL+  +SYN LS S+R F ++IST  EP  Y +A     W+ AM  EL
Subjt:  GCSLPSITRHRACPQRKSNRQIKQPSYLKDYHCSLLHANAFP----PTQTKFPLNKYLSYNSLSDSYRTFVLNISTTFEPQFYHQAASSSHWRDAMSVEL

Query:  KAMESNSTWSV------------------------------ARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFE
        +A+ESN+TWS+                              ARL+AKG+TQQEG+D+   F    +                                 +
Subjt:  KAMESNSTWSV------------------------------ARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFE

Query:  EVYMDLPLGYVHGKVCPQGQRLVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNTFVALLVYVDDI-------------------
        EV+M LP GY   +       +VC+LHKSIYGL+QASRQWF KFS  L+S GF+QS SDYSLF K  GN F+ALLVYVDDI                   
Subjt:  EVYMDLPLGYVHGKVCPQGQRLVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNTFVALLVYVDDI-------------------

Query:  -------------SSPEPLSSSKGIFLSQRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQ----------------------------------ASFS
                        E   S+KGI ++QRKYAL+L+ + G L  KP+  PM PN +LS D                                   + F 
Subjt:  -------------SSPEPLSSSKGIFLSQRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQ----------------------------------ASFS

Query:  SSPSCSVIASVHQRLAY------AGDFLPASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQ
        S P    + +V++ L Y       G F  AS S QLKAFSD+DWA+CPDSRKS TG+C+FL D+L+SWK+KKQ TVSRSSAEAEYRA+A  T EL WL  
Subjt:  SSPSCSVIASVHQRLAY------AGDFLPASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQ

Query:  LLKELLIPSEVPSLLFCDNR
        LLK+L IP   P+LL+CDN+
Subjt:  LLKELLIPSEVPSLLFCDNR

TrEMBL top hitse value%identityAlignment
A0A2N9EHN7 Integrase catalytic domain-containing protein1.1e-23239.04Show/hide
Query:  LRREISNLMQEQLSVTAYFAKLKALWNELVSYR--PSCTCD-RCSCGGVQGLVQYFQTEHVVAFLMGLNESFSQIRTQLLLMEPEPTIVKAFSLVAQEVE
        L++ I++L Q+ + V+ YF +LK LW+E ++YR  P CTC  +C CG  + L+ Y   ++V +FLMGLN+SF+ +R Q+LLMEP P I K FSL+  + +
Subjt:  LRREISNLMQEQLSVTAYFAKLKALWNELVSYR--PSCTCD-RCSCGGVQGLVQYFQTEHVVAFLMGLNESFSQIRTQLLLMEPEPTIVKAFSLVAQEVE

Query:  QCASTVVPIAAPSPTIDATALLVK------------NTQPNSAAKNTQQIKK--------KERSHCTHCNVLGHTIGRCYKLHGYPPGYRNQ--------
        Q  + ++P+    PT+ +TALL +            NT PN+    T   K+        K    C+HC   GHT  +CYKLHGYPPG+R++        
Subjt:  QCASTVVPIAAPSPTIDATALLVK------------NTQPNSAAKNTQQIKK--------KERSHCTHCNVLGHTIGRCYKLHGYPPGYRNQ--------

Query:  KAPSTAKPES------NPLPNLT--PEQCQSILAMLQSHLSSVKTTPES------SSSSSIG----NAHVAG--TCSS----------------LLPPSY
        +  S+A P S        +PNL     QCQ +L ML +      +  +S      +S SSI     ++++AG  TC S                 + P +
Subjt:  KAPSTAKPES------NPLPNLT--PEQCQSILAMLQSHLSSVKTTPES------SSSSSIG----NAHVAG--TCSS----------------LLPPSY

Query:  S---WIVDSGASAHICFSKDLFSSFKSVSGFSVTLPNQARLVVDYVGDVRLSSGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDKSSLK
        S   W++D+GA+ H+  +   +++   V   SV LPN   ++V ++G V+++  LLL +VL +P+F FNLISVS +T+     + F   YC IQD    +
Subjt:  S---WIVDSGASAHICFSKDLFSSFKSVSGFSVTLPNQARLVVDYVGDVRLSSGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDKSSLK

Query:  TIGKARFWQGLYLLDHHALDTVPLPVYLNST----KH---CSLVSNCNTDL--WHDRLGHPSHKN---LNALKPLLSFKESPSHPCFICPLAKQRRLSFS
         IG  +   GLYLLD  +  T      L+S     KH    S + N N D+  WH R GHPS      L+++ P +S     +  C +CPLAKQ+RL F 
Subjt:  TIGKARFWQGLYLLDHHALDTVPLPVYLNST----KH---CSLVSNCNTDL--WHDRLGHPSHKN---LNALKPLLSFKESPSHPCFICPLAKQRRLSFS

Query:  AHNRLVDKPFDLFHCDTWGPFRSLTHLGHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPELSFTDFFRSSGVVHQFS
          N L    FDL H D WGP+   T  G+RYFLTLVDD TR TWI+ M+ KSD   ++ SF  ++ TQF   IK  RSDN  E    +F+ S G++HQ S
Subjt:  AHNRLVDKPFDLFHCDTWGPFRSLTHLGHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPELSFTDFFRSSGVVHQFS

Query:  CVECPSKT----------LSWKGNISTSSMLLVLYF-----------FSQGCP-YNFGTPYFRLHSKNVDYSQLRVFGSLCFASTLKAGRSKFSPRVIPA
        CVE P +           L+   ++   S L + Y+               CP  +  +P+  L  K   Y+ L+VFG LCFASTL + R+KF PR    
Subjt:  CVECPSKT----------LSWKGNISTSSMLLVLYF-----------FSQGCP-YNFGTPYFRLHSKNVDYSQLRVFGSLCFASTLKAGRSKFSPRVIPA

Query:  VFLGYPQGMKAFKLYDIENKHVFISRDVVFHESIFPFHSVVAQKDTSVFLNDLVLPKSFNYAGTPDVSHGHELQLSVGERPVLPNEDIVSNDVFVQPSID
        VFLGYP G+K +KL D+    VFISRDVVFHE+IFPF +     D + FLN    P S      P  S   +  L     P+ P+  + S      P  D
Subjt:  VFLGYPQGMKAFKLYDIENKHVFISRDVVFHESIFPFHSVVAQKDTSVFLNDLVLPKSFNYAGTPDVSHGHELQLSVGERPVLPNEDIVSNDVFVQPSID

Query:  CSLGQSESQMLPQSLASQEPNVPLSDSLGCSLPSITRHRACPQRKSNRQIKQPSYLKDYHCSLLH---ANAFPP---TQTKFPLNKYLSYNSLSDSYRTF
         S     +     SL   E N P     G S+ S       P R+S R  K P+YL+DYHC L H   + + PP   + T +PL+  LSY+ LS ++R F
Subjt:  CSLGQSESQMLPQSLASQEPNVPLSDSLGCSLPSITRHRACPQRKSNRQIKQPSYLKDYHCSLLH---ANAFPP---TQTKFPLNKYLSYNSLSDSYRTF

Query:  VLNISTTFEPQFYHQAASSSHWRDAMSVELKAMESNSTWSV------------------------------ARLIAKGYTQQEGLDYIETFSPVAKLVTI
         L+++   EP  +HQA  + HW++AM  EL A+E+N+TW++                              ARL+AKGYTQQEGLDY ETFSPVAK  T+
Subjt:  VLNISTTFEPQFYHQAASSSHWRDAMSVELKAMESNSTWSV------------------------------ARLIAKGYTQQEGLDYIETFSPVAKLVTI

Query:  KVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYVHGKVCPQGQRLVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNT
        + LL VA++  W L QLDVNNAFLHGDL EEVYM LPLG+       +   LVC+L+KS+YGLKQASRQWF KFS  ++  GF QS SDYSLFT+  G  
Subjt:  KVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYVHGKVCPQGQRLVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNT

Query:  FVALLVYVDDI--------------------------------SSPEPLSSSKGIFLSQRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQASFSSSPS
        F+ALLVYVDDI                                   E   S+KGI L QRKYAL ++ D G+L SKP A PM+ N K+S       + PS
Subjt:  FVALLVYVDDI--------------------------------SSPEPLSSSKGIFLSQRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQASFSSSPS

Query:  -----------------------------CSVIASVHQRLAY-----------AGDFLPASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKA
                                      S    +H   AY            G F P+    QLKAFSD+DWA CPD+R+S TGYCV++G +L+SWK+
Subjt:  -----------------------------CSVIASVHQRLAY-----------AGDFLPASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKA

Query:  KKQSTVSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDNR
        KKQ TVSRSSAEAEYRA+A    EL+WL  LL EL  P    +LLFCD++
Subjt:  KKQSTVSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDNR

A0A2N9EN17 Uncharacterized protein2.2e-23336.67Show/hide
Query:  RSFLSSIQFKILGGLRNIRILEVGCTDS-VKPTNPTVIDQYV--NPYYLHHSDGTNLVLVSELLTESN----------YSSWYQAMLIGLT-----VKNK
        ++FLS  +F++   + +    E+  T S    +  + +D  V  +PYYLH  D ++L+LV+E L              Y+ W +   + +T     V  K
Subjt:  RSFLSSIQFKILGGLRNIRILEVGCTDS-VKPTNPTVIDQYV--NPYYLHHSDGTNLVLVSELLTESN----------YSSWYQAMLIGLT-----VKNK

Query:  --------------------------------LRREISNLMQEQLSVTAYFAKLKALWNELVSYRPS---CTCDRCSCGGVQGLVQYFQTEHVVAFLMGL
                                        L +EI +L Q Q SV+ Y+  L+ LW EL++Y P+   C    C CG +   ++ ++   ++ FLMGL
Subjt:  --------------------------------LRREISNLMQEQLSVTAYFAKLKALWNELVSYRPS---CTCDRCSCGGVQGLVQYFQTEHVVAFLMGL

Query:  NESFSQIRTQLLLMEPEPTIVKAFSLVAQEVEQCASTVVPIAAPSPTIDATALLVKNTQPNSAAKNTQQIKKKERSHCTHCNVLGHTIGRCYKLHGYPPG
        NESF  +R Q+LLM+P P I K FSL+ QE  Q +   +  +  +P +++TAL VK+  P   A      +KKER  CTHC +LGHT+ +CYKLHGYPPG
Subjt:  NESFSQIRTQLLLMEPEPTIVKAFSLVAQEVEQCASTVVPIAAPSPTIDATALLVKNTQPNSAAKNTQQIKKKERSHCTHCNVLGHTIGRCYKLHGYPPG

Query:  YRNQ-KAPSTAKPES---------------NPLP-NLTPEQCQSILAMLQSH-----LSSVKTTPESSSSSSIGNAHVAGT---CSSLL--------PPS
        YR + KAP+ A   S               +PL  +    QC+  LA + S      + +  T+P   ++++  ++  + T   CS+           PS
Subjt:  YRNQ-KAPSTAKPES---------------NPLP-NLTPEQCQSILAMLQSH-----LSSVKTTPESSSSSSIGNAHVAGT---CSSLL--------PPS

Query:  Y----------------SWIVDSGASAHICFSKDLFSSFKSVSGFSVTLPNQARLVVDYVGDVRLSSGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSF
        +                SWI+D+GA+ H+  S    +S  S+   +V LPN   + V ++G V+LSS L+L +VL +P+F FNLISVS +   S+  L F
Subjt:  Y----------------SWIVDSGASAHICFSKDLFSSFKSVSGFSVTLPNQARLVVDYVGDVRLSSGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSF

Query:  ADGYCLIQDKSSLKTIGKARFWQGLYLLDHHALDTVPLPVYLNSTKHCS--------LVSN-CNTDLWHDRLGHPSHKNLNALKPLLSFKESPSHP---C
           YC IQ  +  + IG  +   GLYLL+   L +      + S+   S         VSN   T LWH RLGHPSH  +  L  L+    S +     C
Subjt:  ADGYCLIQDKSSLKTIGKARFWQGLYLLDHHALDTVPLPVYLNSTKHCS--------LVSN-CNTDLWHDRLGHPSHKNLNALKPLLSFKESPSHP---C

Query:  FICPLAKQRRLSFSAHNRLVDKPFDLFHCDTWGPFRSLTHLGHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPELSF
         +CPLAK +RL F          FDL HCD WGP+   TH G +YFLT+VDD +R TWI+ M  K+D   ++ SFF +++TQF   IK  RSDN  E   
Subjt:  FICPLAKQRRLSFSAHNRLVDKPFDLFHCDTWGPFRSLTHLGHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPELSF

Query:  TDFFRSSGVVHQFSCVECP----------------SKTLSWKGNISTS---SMLLVLYFFSQGCP---YNFGTPYFRLHSKNVDYSQLRVFGSLCFASTL
        +DFF S GV+HQ SCV+ P                ++ L ++ N+  +    ++L   +     P       TP+  L      YS L+VFG L +AS L
Subjt:  TDFFRSSGVVHQFSCVECP----------------SKTLSWKGNISTS---SMLLVLYFFSQGCP---YNFGTPYFRLHSKNVDYSQLRVFGSLCFASTL

Query:  KAGRSKFSPRVIPAVFLGYPQGMKAFKLYDIENKHVFISRDVVFHESIFPFH---------------SVVAQKDTSVFLNDLV--------------LPK
           ++KF  R IP VFLGYP G+K +KL+D+  K   +SRDVVFHESIFPF+               SV A  D++ +L   +              LP 
Subjt:  KAGRSKFSPRVIPAVFLGYPQGMKAFKLYDIENKHVFISRDVVFHESIFPFH---------------SVVAQKDTSVFLNDLV--------------LPK

Query:  SFNYAGTPDVSH------GHELQLSVGERPVLPNEDIVSNDVFVQPSIDCSLGQSESQMLPQSLASQEPNVPLSDSLGCSLPSITRHRACPQRKSNRQIK
               P + H       H LQ  +   P  P +    + +   P             L  S    +P+ PL+      + + +   +   RKS+R +K
Subjt:  SFNYAGTPDVSH------GHELQLSVGERPVLPNEDIVSNDVFVQPSIDCSLGQSESQMLPQSLASQEPNVPLSDSLGCSLPSITRHRACPQRKSNRQIK

Query:  QPSYLKDYHCSLLHA--NAFP-PTQTKFPLNKYLSYNSLSDSYRTFVLNISTTFEPQFYHQAASSSHWRDAMSVELKAMESNSTWSV-------------
         PSYL+DYHC L  +   +FP       P+   LSY+ LSDS++ F L +ST  EP FYH+A  S  W +AMS EL A+E+N TW +             
Subjt:  QPSYLKDYHCSLLHA--NAFP-PTQTKFPLNKYLSYNSLSDSYRTFVLNISTTFEPQFYHQAASSSHWRDAMSVELKAMESNSTWSV-------------

Query:  -----------------ARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYVHGKVCPQGQRLVC
                         ARL+AKGY QQEGLDY ETFSPVAKLVT++  + +A + GW L QLDVNNAFLHG+L EEVYM LPLGY      P     VC
Subjt:  -----------------ARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYVHGKVCPQGQRLVC

Query:  QLHKSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNTFVALLVYVDDI----SSP----------------------------EPLSSSKG
        +L KS+YGLKQASRQWF KFS  LL  GF QS+ DYSLFTK  G+TF+ALLVYVDDI    ++P                            E   S +G
Subjt:  QLHKSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNTFVALLVYVDDI----SSP----------------------------EPLSSSKG

Query:  IFLSQRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQASFSSSP--------------------SCSV--------------IASVHQRLAY------A
        I L QRKY L ++ED G LASKP   PM+ + KLS D  S  S P                    S SV              + + H+ L Y       
Subjt:  IFLSQRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQASFSSSP--------------------SCSV--------------IASVHQRLAY------A

Query:  GDFLPASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDNRQSTL
        G F P + S QLKAF D+DWA C D+R+S TG+C+FLGD+L+SW++KKQS VSRSS EAEYRA+A+TT E+ WL  LL++  I   + +LLFCDN Q+TL
Subjt:  GDFLPASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDNRQSTL

A0A2N9G1L6 Integrase catalytic domain-containing protein7.5e-23736.68Show/hide
Query:  NPYYLHHSDGTNLVLVSELLTESNYSSWYQAMLIGLTVKNK-----------------------------------------------------------
        +PYYLH SD ++L+LV+E LT  N+ SW+++M + LT+KNK                                                           
Subjt:  NPYYLHHSDGTNLVLVSELLTESNYSSWYQAMLIGLTVKNK-----------------------------------------------------------

Query:  --------------LRREISNLMQEQLSVTAYFAKLKALWNELVSYRPSCTCD---RCSCGGVQGLVQYFQTEHVVAFLMGLNESFSQIRTQLLLMEPEP
                      L ++I +L Q Q SV+ Y+  L+ LW EL++Y P+  C+    CSCG +   ++ ++   V+ FLMGLNESF+ +R Q+LLM+P P
Subjt:  --------------LRREISNLMQEQLSVTAYFAKLKALWNELVSYRPSCTCD---RCSCGGVQGLVQYFQTEHVVAFLMGLNESFSQIRTQLLLMEPEP

Query:  TIVKAFSLVAQEVEQCASTVVPIAAPSPTIDATALLVKNTQPNSAAKNTQQIKKKERSHCTHCNVLGHTIGRCYKLHGYPPGYRNQ-KAPSTAKPES---
         I K FSL+ QE  Q +   +  +  +P +++TAL+ K   P           KKER  CTHC +LGHT+ +CYKLHG+PPGY+ + KAP+ A   S   
Subjt:  TIVKAFSLVAQEVEQCASTVVPIAAPSPTIDATALLVKNTQPNSAAKNTQQIKKKERSHCTHCNVLGHTIGRCYKLHGYPPGYRNQ-KAPSTAKPES---

Query:  ------NPLPNLTPEQCQSILAMLQSHLSSVKTTPESSSSSSIGNAHVAGTC----SSLLPPS-------------------------------------
                   ++P Q   + A  +  L+ +     ++S S++ N H   T     SS  P S                                     
Subjt:  ------NPLPNLTPEQCQSILAMLQSHLSSVKTTPESSSSSSIGNAHVAGTC----SSLLPPS-------------------------------------

Query:  -YSWIVDSGASAHICFSKDLFSSFKSVSGFSVTLPNQARLVVDYVGDVRLSSGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDKSSLKT
          SWI+D+GA+ H+  S   F++  S+   +V LPN   + V ++G V+LSS L+L +VL +P+F FNLISVS +   S+  L F   YC IQ  +  + 
Subjt:  -YSWIVDSGASAHICFSKDLFSSFKSVSGFSVTLPNQARLVVDYVGDVRLSSGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDKSSLKT

Query:  IGKARFWQGLYLLDHHAL-----DTVPLPVYLNSTKHCSLVSN-CNTDLWHDRLGHPSHKNLNALK---PLLSFKESPSHPCFICPLAKQRRLSFSAHNR
        IG  +   GLY+LD   L      +     + +     + VSN   T LWH RLGHPS   +  L    P L F    S+ C +CPLAK +RL F     
Subjt:  IGKARFWQGLYLLDHHAL-----DTVPLPVYLNSTKHCSLVSN-CNTDLWHDRLGHPSHKNLNALK---PLLSFKESPSHPCFICPLAKQRRLSFSAHNR

Query:  LVDKPFDLFHCDTWGPFRSLTHLGHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPELSFTDFFRSSGVVHQFSCVEC
             FDL HCD WGP+   TH G +YFLT+VDD +R TW++ M  K    +++ SFF +++TQF   IK  RSDN  E   +DFF S GV+HQ SCV+ 
Subjt:  LVDKPFDLFHCDTWGPFRSLTHLGHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPELSFTDFFRSSGVVHQFSCVEC

Query:  P----------------SKTLSWKGNISTS---SMLLVLYFFSQGCP---YNFGTPYFRLHSKNVDYSQLRVFGSLCFASTLKAGRSKFSPRVIPAVFLG
        P                ++ + ++ N+  S     +L   +     P    +  TPY  L  K   Y+ L+VFG L +AS L + ++KF  +  P VFLG
Subjt:  P----------------SKTLSWKGNISTS---SMLLVLYFFSQGCP---YNFGTPYFRLHSKNVDYSQLRVFGSLCFASTLKAGRSKFSPRVIPAVFLG

Query:  YPQGMKAFKLYDIENKHVFISRDVVFHESIFPFHSVVAQKDTSVFLNDLVLPK--SFNYAGTPDVSHGHELQLSVGERPVLPNEDIVSNDVFVQPSIDCS
        YP G K +KL D+     F+SRDVVFHESIFPFH+  +  +    L+ +       F++A  P++     L       P  P+           PS +  
Subjt:  YPQGMKAFKLYDIENKHVFISRDVVFHESIFPFHSVVAQKDTSVFLNDLVLPK--SFNYAGTPDVSHGHELQLSVGERPVLPNEDIVSNDVFVQPSIDCS

Query:  LGQSESQMLPQSLASQEPNVPLSDSLGCSLPSITRHRACPQRKSNRQIKQPSYLKDYHCSL---LHANAFPPTQTKFPLNKYLSYNSLSDSYRTFVLNIS
           S S  LP + AS         S+  S   IT H   P R+S+R +K PSYL+DYHCSL   L ++      T +P+   LSY+ LS  ++ F L IS
Subjt:  LGQSESQMLPQSLASQEPNVPLSDSLGCSLPSITRHRACPQRKSNRQIKQPSYLKDYHCSL---LHANAFPPTQTKFPLNKYLSYNSLSDSYRTFVLNIS

Query:  TTFEPQFYHQAASSSHWRDAMSVELKAMESNSTWSV------------------------------ARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLT
        T  EPQFYH+A  S HW DAMS EL+A+E+N TW +                              ARL+AKGY Q+EG+DY ETFSPVAKLVT++  + 
Subjt:  TTFEPQFYHQAASSSHWRDAMSVELKAMESNSTWSV------------------------------ARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLT

Query:  VATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYVHGKVCPQGQRLVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNTFVALL
        +A + GWP+ QLDVNNAFLHGDL EEV+M LP G+ + +  P     VC+L KS+YGLKQASRQWF KFS  LL+ GF QS+ DYSLFTK  G+ F+ALL
Subjt:  VATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYVHGKVCPQGQRLVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNTFVALL

Query:  VYVDD--ISSPEPLS------------------------------SSKGIFLSQRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQ-------------
        VYVDD  I+S +  S                              ++KGI L QRKY L +++D G L SKP   PM+ + KLS D+             
Subjt:  VYVDD--ISSPEPLS------------------------------SSKGIFLSQRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQ-------------

Query:  ---------------------ASFSSSPSCSVIASVHQRLAY------AGDFLPASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKKQST
                             + F   P    + + H+ L Y       G F P+  S QLKAF D+DWA C D+R+S TG+C+FLGD+L+SW++KKQS 
Subjt:  ---------------------ASFSSSPSCSVIASVHQRLAY------AGDFLPASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKKQST

Query:  VSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDNR
        VSRSSAEAEYRA+AVTT E+ WL  LL +  I   + ++LFCDN+
Subjt:  VSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDNR

A0A2N9H2Y3 Integrase catalytic domain-containing protein2.2e-23339.02Show/hide
Query:  LRREISNLMQEQLSVTAYFAKLKALWNELVSYR--PSCTCD-RCSCGGVQGLVQYFQTEHVVAFLMGLNESFSQIRTQLLLMEPEPTIVKAFSLVAQEVE
        L++ I++L Q+ + V+ YF +LK LW+E ++YR  P CTC  +C CG  + L+ Y   ++V +FLMGLN+SF+ +R Q+LLMEP P I K FSL+  + +
Subjt:  LRREISNLMQEQLSVTAYFAKLKALWNELVSYR--PSCTCD-RCSCGGVQGLVQYFQTEHVVAFLMGLNESFSQIRTQLLLMEPEPTIVKAFSLVAQEVE

Query:  QCASTVVPIAAPSPTIDATALLVK-NTQPNSA-----------------AKNTQQIKKKERSH--CTHCNVLGHTIGRCYKLHGYPPGYRNQ--------
        Q  + ++P+    PT+D+TALL +    PN+A                  K   Q  +K++    C+HC   GHT  +CYKLHGYPPG+R++        
Subjt:  QCASTVVPIAAPSPTIDATALLVK-NTQPNSA-----------------AKNTQQIKKKERSH--CTHCNVLGHTIGRCYKLHGYPPGYRNQ--------

Query:  KAPSTAKPES------NPLPNLT--PEQCQSILAML-------------QSHLSSVKTTPESSSSSSIGNAHVAGTCSS------------LLPPSYS--
        +  S+A P S        +PNLT    QCQ +L ML             Q+H ++   +   S S+  G      T S+             + P++S  
Subjt:  KAPSTAKPES------NPLPNLT--PEQCQSILAML-------------QSHLSSVKTTPESSSSSSIGNAHVAGTCSS------------LLPPSYS--

Query:  -WIVDSGASAHICFSKDLFSSFKSVSGFSVTLPNQARLVVDYVGDVRLSSGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDKSSLKTIG
         W++D+GA  H+  +   +++   V   SV LPN   ++V ++G V+L+  LLL NVL +P+F FNLISVS +T+     + F   YC IQD    + IG
Subjt:  -WIVDSGASAHICFSKDLFSSFKSVSGFSVTLPNQARLVVDYVGDVRLSSGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDKSSLKTIG

Query:  KARFWQGLYLLDHHALDTV---------PLPVYLNSTKHCSLVSNCNTDL--WHDRLGHPSHKN---LNALKPLLSFKESPSHPCFICPLAKQRRLSFSA
          R   GLYLLD  +  T           LP +L S    S + N N D+  WH RLGHPS      L+++ P  S+  + +  C +CPLAKQR+L F  
Subjt:  KARFWQGLYLLDHHALDTV---------PLPVYLNSTKHCSLVSNCNTDL--WHDRLGHPSHKN---LNALKPLLSFKESPSHPCFICPLAKQRRLSFSA

Query:  HNRLVDKPFDLFHCDTWGPFRSLTHLGHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPELSFTDFFRSSGVVHQFSC
        +N L  K FDL H D WGP+   T  G+RYFLTLVDD TR TWI+ M+ KSD  T++ SF  ++ TQF   IK  RSDN  E    DF+ S G++HQ SC
Subjt:  HNRLVDKPFDLFHCDTWGPFRSLTHLGHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPELSFTDFFRSSGVVHQFSC

Query:  VECP----------------SKTLSWKGNISTS----SMLLVLYFFSQ-GCP-YNFGTPYFRLHSKNVDYSQLRVFGSLCFASTLKAGRSKFSPRVIPAV
        VE P                ++ L ++ ++        +   +Y  ++  CP  +  +P+  L  K   Y+ L+VFG LCFASTL   R+KF PR     
Subjt:  VECP----------------SKTLSWKGNISTS----SMLLVLYFFSQ-GCP-YNFGTPYFRLHSKNVDYSQLRVFGSLCFASTLKAGRSKFSPRVIPAV

Query:  FLGYPQGMKAFKLYDIENKHVFISRDVVFHESIFPFHSVVAQKDTSVFLNDLVLPKSFNYAGTPDVSHGHELQLSVGERPVLPNEDIVSNDVFVQPSIDC
        FLGYP G+K +KL ++    V ISRDVVFHE+IFPF +     D S FL+    P S      P  SH      S    P  P            P +  
Subjt:  FLGYPQGMKAFKLYDIENKHVFISRDVVFHESIFPFHSVVAQKDTSVFLNDLVLPKSFNYAGTPDVSHGHELQLSVGERPVLPNEDIVSNDVFVQPSIDC

Query:  SLGQSESQMLPQSLASQEPNVP--LSDSLGCSLPSITRHRACPQRKSNRQIKQPSYLKDYHCSLLH---ANAFPPTQTK---FPLNKYLSYNSLSDSYRT
        SL   ++  L    +S  P++    +DS G S+ S       P R+S R  K P+YL+DYHC L H   + + PP  +    +PL+  LSY+ LS ++R 
Subjt:  SLGQSESQMLPQSLASQEPNVP--LSDSLGCSLPSITRHRACPQRKSNRQIKQPSYLKDYHCSLLH---ANAFPPTQTK---FPLNKYLSYNSLSDSYRT

Query:  FVLNISTTFEPQFYHQAASSSHWRDAMSVELKAMESNSTWSV------------------------------ARLIAKGYTQQEGLDYIETFSPVAKLVT
        F L+++   EP F+HQA  S HW++AM  EL A+E+N+TW++                              ARL+AKGYTQQEGLDY ETFSPVAK  T
Subjt:  FVLNISTTFEPQFYHQAASSSHWRDAMSVELKAMESNSTWSV------------------------------ARLIAKGYTQQEGLDYIETFSPVAKLVT

Query:  IKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYVHGKVCPQGQRLVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGN
        ++ LL VA+   W L QLDVNNAFLHGDL EEVYM LP G+      P    LVC+L+KS+YGLKQASRQWF KFS  ++  GF QS+SDYSLFT+  G 
Subjt:  IKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYVHGKVCPQGQRLVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGN

Query:  TFVALLVYVDDI-----------------------------SSPEPLSSSKGIFLSQRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQ----------
         F+ALLVYVDDI                                E   S+KGI L QRKYAL ++ D G+L SKP   PM+ N K+S             
Subjt:  TFVALLVYVDDI-----------------------------SSPEPLSSSKGIFLSQRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQ----------

Query:  ------------------------ASFSSSPSCSVIASVHQRLAY------AGDFLPASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKK
                                + F S P+   +++ ++ L Y       G F P+    QLKAFSD+DWA C D+R+S TGYCV++GD+L+SWK+KK
Subjt:  ------------------------ASFSSSPSCSVIASVHQRLAY------AGDFLPASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKK

Query:  QSTVSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDNR
        Q TVSRSSAEAEYRA+A    EL+WL  LL EL  P    +LLFCD++
Subjt:  QSTVSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDNR

A0A2N9HYD2 Integrase catalytic domain-containing protein1.6e-23436.89Show/hide
Query:  DQYVNPYYLHHSDGTNLVLVSELLTESNYSSWYQAMLIGLTVKNK-------------------------------------------------------
        D+  N ++LHH D    +LVS+ L+  NY +W ++M++ LT KNK                                                       
Subjt:  DQYVNPYYLHHSDGTNLVLVSELLTESNYSSWYQAMLIGLTVKNK-------------------------------------------------------

Query:  -----------------LRREISNLMQEQLSVTAYFAKLKALWNELVSYRPSCTCDRCSCGGVQGLVQYFQTEHVVAFLMGLNESFSQIRTQLLLMEPEP
                         +++ IS+L Q+Q +V+AYF KLK+LW+EL +YR   +   CSCG ++ L+   Q E+V+ FLMGLN+SF+ +R Q+L+MEP P
Subjt:  -----------------LRREISNLMQEQLSVTAYFAKLKALWNELVSYRPSCTCDRCSCGGVQGLVQYFQTEHVVAFLMGLNESFSQIRTQLLLMEPEP

Query:  TIVKAFSLVAQEVEQCASTVVPIAAPSPTIDATALLVKNTQPNSAAKNTQQIKKKERSHCTHCNVLGHTIGRCYKLHGYPPGYRNQKAPSTAK-----PE
         I KAFSLV QE  Q +   + + A   + D+ AL  ++  P +         KKER  C+HC + GH + +CYKLHG+PPG++ + A   A       E
Subjt:  TIVKAFSLVAQEVEQCASTVVPIAAPSPTIDATALLVKNTQPNSAAKNTQQIKKKERSHCTHCNVLGHTIGRCYKLHGYPPGYRNQKAPSTAK-----PE

Query:  SNPLPNLTPEQCQSILAML--QSHLSSVKT--------------------TPESSSSSSIGNAH-VAGTCSSLLP-----------PSYS----------
        S+P   +T  QCQ +LAML  Q+ LSS  +                    +  +S +SS  N H VA   S  +            P +S          
Subjt:  SNPLPNLTPEQCQSILAML--QSHLSSVKT--------------------TPESSSSSSIGNAH-VAGTCSSLLP-----------PSYS----------

Query:  ------WIVDSGASAHICFSKDLFSSFKSVSGFSVTLPNQARLVVDYVGDVRLSSGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDKSS
              WIVD+GA+ H+ +S   F+S  S     + LPN  +++  ++G V++S  L L NVL +P F FNLIS++ +T      + F+  +C IQD  S
Subjt:  ------WIVDSGASAHICFSKDLFSSFKSVSGFSVTLPNQARLVVDYVGDVRLSSGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDKSS

Query:  LKTIGKARFWQGLYLLDHHA--LDTVPLP-VYLNSTKHCSLVSNCNTDLWHDRLGHPSHKNLNALKPLLS--FKESPSHPCFICPLAKQRRLSFSAHNRL
         K IG A+   GLY+L   A   D++P P   L S K  S V+    ++WH RLGHPS   LN L  ++S     S S  C +C L+K RRL F     +
Subjt:  LKTIGKARFWQGLYLLDHHA--LDTVPLP-VYLNSTKHCSLVSNCNTDLWHDRLGHPSHKNLNALKPLLS--FKESPSHPCFICPLAKQRRLSFSAHNRL

Query:  VDKPFDLFHCDTWGPFRSLTHLGHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPELSFTDFFRSSGVVHQFSCVECP
           PFDL HCD WGPF   T    +YFLT+VDD TR TWIF MK KS+   ++ SFF LV TQF  +IK  RSDN  E S T+F+   G +HQ SC+  P
Subjt:  VDKPFDLFHCDTWGPFRSLTHLGHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPELSFTDFFRSSGVVHQFSCVECP

Query:  ----------------SKTLSWKGNISTS----SMLLVLYFFSQ--GCPYNFGTPYFRLHSKNVDYSQLRVFGSLCFASTLKAGRSKFSPRVIPAVFLGY
                        +++L ++ NI        +L   Y  ++      N  +PY  L      YS LRVFGSLC+A+TL   R KF+PR    + LGY
Subjt:  ----------------SKTLSWKGNISTS----SMLLVLYFFSQ--GCPYNFGTPYFRLHSKNVDYSQLRVFGSLCFASTLKAGRSKFSPRVIPAVFLGY

Query:  PQGMKAFKLYDIENKHVFISRDVVFHESIFPFHSVVAQKDTSVFLNDLVLPKSFNYA----------GTPDVSHGHELQLSVGERP----VLPNEDIVSN
        P G K ++L D++   VF+SRDV+FHE+IFPF            LN  + P S N +           + D +  H   L   E P      P+      
Subjt:  PQGMKAFKLYDIENKHVFISRDVVFHESIFPFHSVVAQKDTSVFLNDLVLPKSFNYA----------GTPDVSHGHELQLSVGERP----VLPNEDIVSN

Query:  DVFVQPSIDCSLGQSESQMLPQSLASQEPNVPLSDSLGCSLPSITRHRACPQ----RKSNRQIKQPSYLKDYHCSLLHANAFP-------PTQTKFPLNK
        D   +   D S  ++ +   P    S  P+ P         P + +    P     R+S R  K P+YL++YHCS   ++  P        + TKFPL++
Subjt:  DVFVQPSIDCSLGQSESQMLPQSLASQEPNVPLSDSLGCSLPSITRHRACPQ----RKSNRQIKQPSYLKDYHCSLLHANAFP-------PTQTKFPLNK

Query:  YLSYNSLSDSYRTFVLNISTTFEPQFYHQAASSSHWRDAMSVELKAMESNSTWSV------------------------------ARLIAKGYTQQEGLD
         LSY+ LS +Y++FVLN ST  EP  Y++A+ S HW +AM  E+ A+E+N TWS+                              ARL+AKGY QQEG D
Subjt:  YLSYNSLSDSYRTFVLNISTTFEPQFYHQAASSSHWRDAMSVELKAMESNSTWSV------------------------------ARLIAKGYTQQEGLD

Query:  YIETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYVHGKVCPQGQRLVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQS
        Y ETFSPVAK VT++ LL VA   GW L QLDVNNAFLHG L EEVYM LP G+            VC+L KSIYGLKQASRQWF KFS  LL+ GF QS
Subjt:  YIETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYVHGKVCPQGQRLVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQS

Query:  RSDYSLFTKGCGNTFVALLVYVDDI--------------------------------SSPEPLSSSKGIFLSQRKYALQLVEDEGLLASKPSALPMDPNS
        ++DYSLFTK  G+ F+ALLVYVDDI                                   E   SSKGI +SQRKYAL+++ED G+L  KP+  PMD N 
Subjt:  RSDYSLFTKGCGNTFVALLVYVDDI--------------------------------SSPEPLSSSKGIFLSQRKYALQLVEDEGLLASKPSALPMDPNS

Query:  KLSSDQASFSSSPSC----------------SVIASVHQRLAY------------------------AGDFLPASKSFQLKAFSDADWASCPDSRKSTTG
        KLS  +      PS                  ++ SVH+   +                         G    ++    +K FSD+DWA CPD+R+STTG
Subjt:  KLSSDQASFSSSPSC----------------SVIASVHQRLAY------------------------AGDFLPASKSFQLKAFSDADWASCPDSRKSTTG

Query:  YCVFLGDALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDNR
        YCVFLG +LVSW++KKQ+TVSRSSAEAEYRA+A T  E++W++ LL +L I     +LLF D++
Subjt:  YCVFLGDALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDNR

SwissProt top hitse value%identityAlignment
O64470 Spermidine hydroxycinnamoyl transferase3.0e-6538.56Show/hide
Query:  PRGRLELHCCTAGAHFIEAEAAAKLDDYDDFKSCDATVTNQLVPSVDYTVPLDKQPVFLVQVTRFACGGIVIGTALSHFVIDGIGASTFVNSWASITRGE
        PRGR EL+C   G  FIEAE+  KL D+ DF          L+P V+Y  P++  P+FL QVT+F CGGI +   +SH ++DG  A   ++ W  + RGE
Subjt:  PRGRLELHCCTAGAHFIEAEAAAKLDDYDDFKSCDATVTNQLVPSVDYTVPLDKQPVFLVQVTRFACGGIVIGTALSHFVIDGIGASTFVNSWASITRGE

Query:  STADSVVPKPYHERDVLRV-QPR----KPSRFHHQEYEKPPLLVGHSSPEEERNRETIVASLKLTTCQVEKLKNRANHYHLEPNSNSNP-KPYTRFEVVA
             +   P+ +R +L   +P      P +F H+E+++PP L+G +   EER ++TIV  L L+T Q++KL+++AN      + +S+P K +TR+E V 
Subjt:  STADSVVPKPYHERDVLRV-QPR----KPSRFHHQEYEKPPLLVGHSSPEEERNRETIVASLKLTTCQVEKLKNRANHYHLEPNSNSNP-KPYTRFEVVA

Query:  GHIWVCACKARKTCVTEQPTVALVTADIRRRTRPPLPEGFTGNATLLTVTRICKFGDLMNNPLNYAAEKIREGTWKLTDEYIESAIDFLAS-KDNEWIKE
        GH+W CACKAR     EQPT   +  D R R  PPLP G+ GNAT L V      G+L++N L +AA  I +    +T+EY+   I++L + KD +  ++
Subjt:  GHIWVCACKARKTCVTEQPTVALVTADIRRRTRPPLPEGFTGNATLLTVTRICKFGDLMNNPLNYAAEKIREGTWKLTDEYIESAIDFLAS-KDNEWIKE

Query:  LYEKGICSEGAPFWGNPNMEFGCWTSLSLYDTDFGWGRPRYVGLASLEDLEDGESLIMSGPEKDGCLIVLATVTVS
        L+  G  S   PF+GNPN+    W +L +Y  DFGWG+  Y G  + +   DG+SLI+    +DG +I+   + V+
Subjt:  LYEKGICSEGAPFWGNPNMEFGCWTSLSLYDTDFGWGRPRYVGLASLEDLEDGESLIMSGPEKDGCLIVLATVTVS

P04146 Copia protein5.1e-5725.07Show/hide
Query:  WIVDSGASAHICFSKDLFS-SFKSVSGFSVTLPNQARLV-VDYVGDVRLSSG--LLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDKSSLK
        +++DSGAS H+   + L++ S + V    + +  Q   +     G VRL +   + L++VLF      NL+SV  +  ++ +S+ F        DKS + 
Subjt:  WIVDSGASAHICFSKDLFS-SFKSVSGFSVTLPNQARLV-VDYVGDVRLSSG--LLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDKSSLK

Query:  TIGKARFWQGLYLLDHHA-LDTVPL---PVYLNSTKHCSLVSNCNTDLWHDRLGHPS--------HKNLNALKPLLSFKESPSHPCFICPLAKQRRLSFS
        TI K     GL ++ +   L+ VP+     Y  + KH +     N  LWH+R GH S         KN+ + + LL+  E     C  C   KQ RL F 
Subjt:  TIGKARFWQGLYLLDHHA-LDTVPL---PVYLNSTKHCSLVSNCNTDLWHDRLGHPS--------HKNLNALKPLLSFKESPSHPCFICPLAKQRRLSFS

Query:  --AHNRLVDKPFDLFHCDTWGPFRSLTHLGHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPEL---SFTDFFRSSGV
               + +P  + H D  GP   +T     YF+  VD  T Y   + +K KSD  ++   F    +  F + +     DN  E        F    G+
Subjt:  --AHNRLVDKPFDLFHCDTWGPFRSLTHLGHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPEL---SFTDFFRSSGV

Query:  VHQFSCVECPS---------KTLSWKGNISTSSMLLVLYFF---------------SQGCPYNFGTPYFRLHSKNVDYSQLRVFGSLCFASTLKAGRSKF
         +  +    P          +T++ K     S   L   F+               S+    +  TPY   H+K      LRVFG+  +   +K  + KF
Subjt:  VHQFSCVECPS---------KTLSWKGNISTSSMLLVLYFF---------------SQGCPYNFGTPYFRLHSKNVDYSQLRVFGSLCFASTLKAGRSKF

Query:  SPRVIPAVFLGY-PQGMKAFKLYDIENKHVFISRDVVFHESIFPFHSVVAQKDTSVFLNDLVLPKSFNYAGT---------PDVSHGHE-----LQLSVG
          +   ++F+GY P G   FKL+D  N+   ++RDVV  E+       V  K  +VFL D    ++ N+            P+ S   +           
Subjt:  SPRVIPAVFLGY-PQGMKAFKLYDIENKHVFISRDVVFHESIFPFHSVVAQKDTSVFLNDLVLPKSFNYAGT---------PDVSHGHE-----LQLSVG

Query:  ERPVLPNED--IVSNDVFVQPSIDC-------SLGQSESQMLPQSLASQEPNVPLSDSLGCSLPSITRHRACPQRKSNRQIKQPSYLKDYHCSLLHANAF
        E    PN+   I+  + F   S +C          +S    L +S   ++ +  L++S G   P+ +R     +      I  P+  K+    +++  + 
Subjt:  ERPVLPNED--IVSNDVFVQPSIDC-------SLGQSESQMLPQSLASQEPNVPLSDSLGCSLPSITRHRACPQRKSNRQIKQPSYLKDYHCSLLHANAF

Query:  PPTQTKFPLNKYLSYNSLSDSYRTFVLNISTTFE--PQFYHQAA---SSSHWRDAMSVELKAMESNSTWSV-----------------------------
             +      +SYN   +S    VLN  T F   P  + +       S W +A++ EL A + N+TW++                             
Subjt:  PPTQTKFPLNKYLSYNSLSDSYRTFVLNISTTFE--PQFYHQAA---SSSHWRDAMSVELKAMESNSTWSV-----------------------------

Query:  -ARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYVHGKVCPQGQRLVCQLHKSIYGLKQASRQW
         ARL+A+G+TQ+  +DY ETF+PVA++ + + +L++       + Q+DV  AFL+G L EE+YM LP     G  C      VC+L+K+IYGLKQA+R W
Subjt:  -ARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYVHGKVCPQGQRLVCQLHKSIYGLKQASRQW

Query:  FDKFSCALLSFGFKQSRSDYSLF--TKGCGNTFVALLVYVDDI----SSPEPLSSSK----------------------------GIFLSQRKYALQLVE
        F+ F  AL    F  S  D  ++   KG  N  + +L+YVDD+         +++ K                             I+LSQ  Y  +++ 
Subjt:  FDKFSCALLSFGFKQSRSDYSLF--TKGCGNTFVALLVYVDDI----SSPEPLSSSK----------------------------GIFLSQRKYALQLVE

Query:  DEGLLASKPSALPMDPN---SKLSSDQ-------------------------------ASFSSSPSCSVIASVHQRLAYAGDFLPASKSF--------QL
           +      + P+        L+SD+                               + +SS  +  +  ++ + L Y    +     F        ++
Subjt:  DEGLLASKPSALPMDPN---SKLSSDQ-------------------------------ASFSSSPSCSVIASVHQRLAYAGDFLPASKSF--------QL

Query:  KAFSDADWASCPDSRKSTTGYCVFLGD-ALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDNR
          + D+DWA     RKSTTGY   + D  L+ W  K+Q++V+ SS EAEY AL     E +WL+ LL  + I  E P  ++ DN+
Subjt:  KAFSDADWASCPDSRKSTTGYCVFLGD-ALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDNR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-7625.08Show/hide
Query:  NTQPNSAAKNTQQIKKKER-SHCTHCNVLGHTIGRCYKLHGYPPGYRNQKAPSTAKPESNPLPNLTPEQCQSILAMLQSHLSSVKTTPESSSSSSIGNAH
        N    S A+   + + K R  +C +CN  GH    C       P  R  K  ++ +   +           +  AM+Q++ + V    E           
Subjt:  NTQPNSAAKNTQQIKKKER-SHCTHCNVLGHTIGRCYKLHGYPPGYRNQKAPSTAKPESNPLPNLTPEQCQSILAMLQSHLSSVKTTPESSSSSSIGNAH

Query:  VAGTCSSLLPPSYSWIVDSGASAHICFSKDLFSSFKSVSGFSVTLPNQARLVVDYVGDVRLSSG----LLLQNVLFIPTFQFNLISVSAITAQSSVSLSF
            C  L  P   W+VD+ AS H    +DLF  + +    +V + N +   +  +GD+ + +     L+L++V  +P  + NLIS  A+          
Subjt:  VAGTCSSLLPPSYSWIVDSGASAHICFSKDLFSSFKSVSGFSVTLPNQARLVVDYVGDVRLSSG----LLLQNVLFIPTFQFNLISVSAITAQSSVSLSF

Query:  ADGYCLIQDKSSLKTIGKARFWQGLYLLDHHALDTVPLPVYLNSTKHCSLVSNC-----NTDLWHDRLGHPSHKNLNAL--KPLLSF-KESPSHPCFICP
         DGY       S     K R  +G  ++   A       +Y  + + C    N      + DLWH R+GH S K L  L  K L+S+ K +   PC  C 
Subjt:  ADGYCLIQDKSSLKTIGKARFWQGLYLLDHHALDTVPLPVYLNSTKHCSLVSNC-----NTDLWHDRLGHPSHKNLNAL--KPLLSF-KESPSHPCFICP

Query:  LAKQRRLSFSAHNRLVDKPFDLFHCDTWGPFRSLTHLGHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPELS---FT
          KQ R+SF   +       DL + D  GP    +  G++YF+T +DDA+R  W++ +K K     +   F  LV+ +    +K  RSDN  E +   F 
Subjt:  LAKQRRLSFSAHNRLVDKPFDLFHCDTWGPFRSLTHLGHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPELS---FT

Query:  DFFRSSGVVHQFSCVECP---------SKTL----------------SWKGNISTSSMLLVLYFFSQGCPYNFGTPYFRLHSKNVDYSQLRVFGSLCFAS
        ++  S G+ H+ +    P         ++T+                 W   + T+  L+     S   P  F  P     +K V YS L+VFG   FA 
Subjt:  DFFRSSGVVHQFSCVECP---------SKTL----------------SWKGNISTSSMLLVLYFFSQGCPYNFGTPYFRLHSKNVDYSQLRVFGSLCFAS

Query:  TLKAGRSKFSPRVIPAVFLGYPQGMKAFKLYDIENKHVFISRDVVFHESIFPFHSVVAQKDTSVFLNDLVLPKSFNYAGTPDVSHGHELQLSVGERPVLP
          K  R+K   + IP +F+GY      ++L+D   K V  SRDVVF ES      V    D S  + + ++P   N+   P  S+               
Subjt:  TLKAGRSKFSPRVIPAVFLGYPQGMKAFKLYDIENKHVFISRDVVFHESIFPFHSVVAQKDTSVFLNDLVLPKSFNYAGTPDVSHGHELQLSVGERPVLP

Query:  NEDIVSNDVFVQPSIDCSLGQSESQMLPQSLASQEPNVPLSDSLGCSLPSITRHRACPQRKSNRQIKQPSYLKDYHCSLLHANAFPPTQTKFPLNKYLSY
        + +  +++V  Q       G+   +++ Q     E                             +++ P+  ++ H  L  +        ++P  +Y+  
Subjt:  NEDIVSNDVFVQPSIDCSLGQSESQMLPQSLASQEPNVPLSDSLGCSLPSITRHRACPQRKSNRQIKQPSYLKDYHCSLLHANAFPPTQTKFPLNKYLSY

Query:  NSLSDSYRTFVLNISTTFEPQFYHQAAS---SSHWRDAMSVELKAMESNSTWSV------------------------------ARLIAKGYTQQEGLDY
                     IS   EP+   +  S    +    AM  E+++++ N T+ +                              ARL+ KG+ Q++G+D+
Subjt:  NSLSDSYRTFVLNISTTFEPQFYHQAAS---SSHWRDAMSVELKAMESNSTWSV------------------------------ARLIAKGYTQQEGLDY

Query:  IETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGY-VHGKVCPQGQRLVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQS
         E FSPV K+ +I+ +L++A SL   + QLDV  AFLHGDL EE+YM+ P G+ V GK     + +VC+L+KS+YGLKQA RQW+ KF   + S  + ++
Subjt:  IETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGY-VHGKVCPQGQRLVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQS

Query:  RSDYSLFTKGCG-NTFVALLVYVDD---ISSPEPL-------------------------------SSSKGIFLSQRKYALQLVEDEGLLASKPSALPMD
         SD  ++ K    N F+ LL+YVDD   +   + L                                +S+ ++LSQ KY  +++E   +  +KP + P+ 
Subjt:  RSDYSLFTKGCG-NTFVALLVYVDD---ISSPEPL-------------------------------SSSKGIFLSQRKYALQLVEDEGLLASKPSALPMD

Query:  PNSKLS--------SDQASFSSSPSCSVIAS-----------VHQRLAYAGDFL--PASKSFQ-------------------------LKAFSDADWASC
         + KLS         ++ + +  P  S + S           +   +     FL  P  + ++                         LK ++DAD A  
Subjt:  PNSKLS--------SDQASFSSSPSCSVIAS-----------VHQRLAYAGDFL--PASKSFQ-------------------------LKAFSDADWASC

Query:  PDSRKSTTGYCVFLGDALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDNRQSTLQPTQCSM
         D+RKS+TGY        +SW++K Q  V+ S+ EAEY A   T  E++WL++ L+EL +  +   +++CD+ QS +  ++ SM
Subjt:  PDSRKSTTGYCVFLGDALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDNRQSTLQPTQCSM

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.7e-9226.1Show/hide
Query:  EHVVAFLMGLNESFSQIRTQLLLMEPEPTIVKAFSLVAQEVEQ----CASTVVPIAAPSPTIDATALL-----------VKNTQPNSAAKNTQQI-----
        E V   L  L E +  +  Q+   +  PT+ +    +     +     ++TV+PI A + +   T                N   N+ +K  QQ      
Subjt:  EHVVAFLMGLNESFSQIRTQLLLMEPEPTIVKAFSLVAQEVEQ----CASTVVPIAAPSPTIDATALL-----------VKNTQPNSAAKNTQQI-----

Query:  -----KKKERSHCTHCNVLGHTIGRCYKLHGYPPGYRNQKAPSTAKPESNPLPNLTPEQCQSILAMLQSHLSSVKTTPESSSSSSIGNAHVAGTCSSLLP
              K     C  C V GH+  RC +L  +     +Q+ PS            TP Q ++ LA+          +P SS+                  
Subjt:  -----KKKERSHCTHCNVLGHTIGRCYKLHGYPPGYRNQKAPSTAKPESNPLPNLTPEQCQSILAMLQSHLSSVKTTPESSSSSSIGNAHVAGTCSSLLP

Query:  PSYSWIVDSGASAHICFSKDLFSSFKS-VSGFSVTLPNQARLVVDYVGDVRLSS---GLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDK
           +W++DSGA+ HI    +  S  +    G  V + + + + + + G   LS+    L L N+L++P    NLISV  +   + VS+ F      ++D 
Subjt:  PSYSWIVDSGASAHICFSKDLFSSFKS-VSGFSVTLPNQARLVVDYVGDVRLSS---GLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDK

Query:  SSLKTIGKARFWQGLYLLDHHALDTVPLPVYLNSTKHCSLVSNCNTDLWHDRLGHPSHKNLNALKPLLSFKE-SPSH---PCFICPLAKQRRLSFSAHNR
        ++   + + +    LY  +     + P+ ++ + +      S      WH RLGHP+   LN++    S    +PSH    C  C + K  ++ FS    
Subjt:  SSLKTIGKARFWQGLYLLDHHALDTVPLPVYLNSTKHCSLVSNCNTDLWHDRLGHPSHKNLNALKPLLSFKE-SPSH---PCFICPLAKQRRLSFSAHNR

Query:  LVDKPFDLFHCDTWGPFRSLTHLGHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPE-LSFTDFFRSSGVVHQFSCVE
           +P +  + D W     L+H  +RY++  VD  TRYTW++ +K+KS       +F  L++ +FQ  I  F SDN  E ++  ++F   G+ H  S   
Subjt:  LVDKPFDLFHCDTWGPFRSLTHLGHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPE-LSFTDFFRSSGVVHQFSCVE

Query:  CPSKT-LSWKGN---ISTSSMLLVLYFFSQG-CPYNFG-----------------TPYFRLHSKNVDYSQLRVFGSLCFASTLKAGRSKFSPRVIPAVFL
         P    LS + +   + T   LL      +   PY F                  +P+ +L   + +Y +LRVFG  C+       + K   +    VFL
Subjt:  CPSKT-LSWKGN---ISTSSMLLVLYFFSQG-CPYNFG-----------------TPYFRLHSKNVDYSQLRVFGSLCFASTLKAGRSKFSPRVIPAVFL

Query:  GYPQGMKAFKLYDIENKHVFISRDVVFHESIFPFHSVVA---------QKDTSVFLNDLVLPKSFNYAGTPDVSHGHELQLSVGERPVLPNEDIVSNDVF
        GY     A+    ++   ++ISR V F E+ FPF + +A         ++ + V+     LP        P  S  H         P  P+    ++ V 
Subjt:  GYPQGMKAFKLYDIENKHVFISRDVVFHESIFPFHSVVA---------QKDTSVFLNDLVLPKSFNYAGTPDVSHGHELQLSVGERPVLPNEDIVSNDVF

Query:  VQPSIDCSLGQSESQMLPQSLASQEPNVPLSDS-LGCSLPSITR---HRACPQRKSNRQIKQPSYLKDYHCSLLHANAFPPTQTKF--------------
               +L  S S   P   +S EP  P  +     + P+ T+   H +    ++N   + PS L     +   +++  P+ T                
Subjt:  VQPSIDCSLGQSESQMLPQSLASQEPNVPLSDS-LGCSLPSITR---HRACPQRKSNRQIKQPSYLKDYHCSLLHANAFPPTQTKF--------------

Query:  -----PLNKYLSYNSLS------------------DSYRTFVLNISTTFEPQFYHQAASSSHWRDAMSVELKAMESNSTWSV------------------
             PL + ++ N+ +                  +   +  ++++   EP+   QA     WR+AM  E+ A   N TW +                  
Subjt:  -----PLNKYLSYNSLS------------------DSYRTFVLNISTTFEPQFYHQAASSSHWRDAMSVELKAMESNSTWSV------------------

Query:  -------------ARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYVHGKVCPQGQRLVCQLHK
                     ARL+AKGY Q+ GLDY ETFSPV K  +I+++L VA    WP+ QLDVNNAFL G L ++VYM  P G++           VC+L K
Subjt:  -------------ARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYVHGKVCPQGQRLVCQLHK

Query:  SIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNTFVALLVYVDDI----SSPEPLSSS----------------------------KGIFLS
        ++YGLKQA R W+ +    LL+ GF  S SD SLF    G + V +LVYVDDI    + P  L ++                             G+ LS
Subjt:  SIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNTFVALLVYVDDI----SSPEPLSSS----------------------------KGIFLS

Query:  QRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQASFSSSP--------SCSVIASVHQRLAYA--------------------------------GDFL
        QR+Y L L+    ++ +KP   PM P+ KLS    +  + P        S   +A     ++YA                                G FL
Subjt:  QRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQASFSSSP--------SCSVIASVHQRLAYA--------------------------------GDFL

Query:  PASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDNRQST
            +  L A+SDADWA   D   ST GY V+LG   +SW +KKQ  V RSS EAEYR++A T++E+ W+  LL EL I    P +++CDN  +T
Subjt:  PASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDNRQST

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.5e-9027.72Show/hide
Query:  QPNSA-AKNTQQIKKKERSHCTHCNVLGHTIGRCYKLHGYPPGYRNQKAPSTAKPESNPLPNLTPEQCQSILAMLQSHLSSVKTTPESSSSSSIGNAHVA
        QP+S+ +++  +  K     C  C+V GH+  RC +LH +          ST   + +  P  TP Q ++ LA+                 +S  NA+  
Subjt:  QPNSA-AKNTQQIKKKERSHCTHCNVLGHTIGRCYKLHGYPPGYRNQKAPSTAKPESNPLPNLTPEQCQSILAMLQSHLSSVKTTPESSSSSSIGNAHVA

Query:  GTCSSLLPPSYSWIVDSGASAHICFSKDLFSSFKS-VSGFSVTLPNQARLVVDYVGDVRL---SSGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFAD
                   +W++DSGA+ HI    +  S  +    G  V + + + + + + G   L   S  L L  VL++P    NLISV  +   + VS+ F  
Subjt:  GTCSSLLPPSYSWIVDSGASAHICFSKDLFSSFKS-VSGFSVTLPNQARLVVDYVGDVRL---SSGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFAD

Query:  GYCLIQDKSSLKTIGKARFWQGLYLLDHHALDTV---PLPVYLNSTKHCSLVSNCNTDLWHDRLGHPSHKNLNALKPLLSFKE-SPSH---PCFICPLAK
            ++D ++           G+ LL     D +   P+      +   S  S      WH RLGHPS   LN++    S    +PSH    C  C + K
Subjt:  GYCLIQDKSSLKTIGKARFWQGLYLLDHHALDTV---PLPVYLNSTKHCSLVSNCNTDLWHDRLGHPSHKNLNALKPLLSFKE-SPSH---PCFICPLAK

Query:  QRRLSFSAHNRLVDKPFDLFHCDTW-GPFRSLTHLGHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPE-LSFTDFFR
          ++ FS       KP +  + D W  P  S+ +  +RY++  VD  TRYTW++ +K+KS        F  LV+ +FQ  I    SDN  E +   D+  
Subjt:  QRRLSFSAHNRLVDKPFDLFHCDTW-GPFRSLTHLGHRYFLTLVDDATRYTWIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPE-LSFTDFFR

Query:  SSGVVHQFSCVECPSKT-LSWKGNISTSSMLLVLYFFSQ----GCPYNFG-----------------TPYFRLHSKNVDYSQLRVFGSLCFASTLKAGRS
          G+ H  S    P    LS + +     M L L   +       PY F                  +P+ +L  +  +Y +L+VFG  C+       R 
Subjt:  SSGVVHQFSCVECPSKT-LSWKGNISTSSMLLVLYFFSQ----GCPYNFG-----------------TPYFRLHSKNVDYSQLRVFGSLCFASTLKAGRS

Query:  KFSPRVIPAVFLGYPQGMKAFKLYDIENKHVFISRDVVFHESIFPFHSVVAQKDTSVFLNDLVLPKSFNYAGTPDVSHGHELQLSVGERPVLPNEDIVSN
        K   +     F+GY     A+    I    ++ SR V F E  FPF +      TS        P   ++   P         L +   P L      S 
Subjt:  KFSPRVIPAVFLGYPQGMKAFKLYDIENKHVFISRDVVFHESIFPFHSVVAQKDTSVFLNDLVLPKSFNYAGTPDVSHGHELQLSVGERPVLPNEDIVSN

Query:  DVFVQPSIDCSLGQSESQMLPQSLASQEPNVPLSDSLGCSLPSITRHRA-------------CPQRKSNRQIKQPSYLKDYHCSLLH-------------
             PS  C+   S S +   S++S   + P + S     P+   H+               P   S     Q S L     S  H             
Subjt:  DVFVQPSIDCSLGQSESQMLPQSLASQEPNVPLSDSLGCSLPSITRHRA-------------CPQRKSNRQIKQPSYLKDYHCSLLH-------------

Query:  ----ANAFPP------------TQTKFPLNKYLSYNSLSDSYR------TFVLNISTTFEPQFYHQAASSSHWRDAMSVELKAMESNSTWSV--------
            + + PP               + P+N +       D  R      ++  +++   EP+   QA     WR AM  E+ A   N TW +        
Subjt:  ----ANAFPP------------TQTKFPLNKYLSYNSLSDSYR------TFVLNISTTFEPQFYHQAASSSHWRDAMSVELKAMESNSTWSV--------

Query:  -----------------------ARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYVHGKVCPQ
                               ARL+AKGY Q+ GLDY ETFSPV K  +I+++L VA    WP+ QLDVNNAFL G L +EVYM  P G+V  K  P 
Subjt:  -----------------------ARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYVHGKVCPQ

Query:  GQRLVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNTFVALLVYVDDI-----------SSPEPLSS------------------
            VC+L K+IYGLKQA R W+ +    LL+ GF  S SD SLF    G + + +LVYVDDI            + + LS                   
Subjt:  GQRLVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNTFVALLVYVDDI-----------SSPEPLSS------------------

Query:  ---SKGIFLSQRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQASFSSSP--------SCSVIASVHQRLAYA--------------------------
            +G+ LSQR+Y L L+    +L +KP A PM  + KL+    +    P        S   +A     L+YA                          
Subjt:  ---SKGIFLSQRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQASFSSSP--------SCSVIASVHQRLAYA--------------------------

Query:  ------GDFLPASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCD
              G FL    +  L A+SDADWA   D   ST GY V+LG   +SW +KKQ  V RSS EAEYR++A T++EL W+  LL EL I    P +++CD
Subjt:  ------GDFLPASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCD

Query:  NRQST
        N  +T
Subjt:  NRQST

Arabidopsis top hitse value%identityAlignment
AT2G19070.1 spermidine hydroxycinnamoyl transferase2.1e-6638.56Show/hide
Query:  PRGRLELHCCTAGAHFIEAEAAAKLDDYDDFKSCDATVTNQLVPSVDYTVPLDKQPVFLVQVTRFACGGIVIGTALSHFVIDGIGASTFVNSWASITRGE
        PRGR EL+C   G  FIEAE+  KL D+ DF          L+P V+Y  P++  P+FL QVT+F CGGI +   +SH ++DG  A   ++ W  + RGE
Subjt:  PRGRLELHCCTAGAHFIEAEAAAKLDDYDDFKSCDATVTNQLVPSVDYTVPLDKQPVFLVQVTRFACGGIVIGTALSHFVIDGIGASTFVNSWASITRGE

Query:  STADSVVPKPYHERDVLRV-QPR----KPSRFHHQEYEKPPLLVGHSSPEEERNRETIVASLKLTTCQVEKLKNRANHYHLEPNSNSNP-KPYTRFEVVA
             +   P+ +R +L   +P      P +F H+E+++PP L+G +   EER ++TIV  L L+T Q++KL+++AN      + +S+P K +TR+E V 
Subjt:  STADSVVPKPYHERDVLRV-QPR----KPSRFHHQEYEKPPLLVGHSSPEEERNRETIVASLKLTTCQVEKLKNRANHYHLEPNSNSNP-KPYTRFEVVA

Query:  GHIWVCACKARKTCVTEQPTVALVTADIRRRTRPPLPEGFTGNATLLTVTRICKFGDLMNNPLNYAAEKIREGTWKLTDEYIESAIDFLAS-KDNEWIKE
        GH+W CACKAR     EQPT   +  D R R  PPLP G+ GNAT L V      G+L++N L +AA  I +    +T+EY+   I++L + KD +  ++
Subjt:  GHIWVCACKARKTCVTEQPTVALVTADIRRRTRPPLPEGFTGNATLLTVTRICKFGDLMNNPLNYAAEKIREGTWKLTDEYIESAIDFLAS-KDNEWIKE

Query:  LYEKGICSEGAPFWGNPNMEFGCWTSLSLYDTDFGWGRPRYVGLASLEDLEDGESLIMSGPEKDGCLIVLATVTVS
        L+  G  S   PF+GNPN+    W +L +Y  DFGWG+  Y G  + +   DG+SLI+    +DG +I+   + V+
Subjt:  LYEKGICSEGAPFWGNPNMEFGCWTSLSLYDTDFGWGRPRYVGLASLEDLEDGESLIMSGPEKDGCLIVLATVTVS

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.5e-8538.96Show/hide
Query:  SNRQIKQPSYLKDYHCSLLHANAFPPTQTKFPLNKYLSYNSLSDSYRTFVLNISTTFEPQFYHQAASSSHWRDAMSVELKAMESNSTWSV----------
        S+R+ ++P+YL+DY+C   H+ A   + T   ++++LSY  +S  Y +F++ I+   EP  Y++A     W  AM  E+ AME+  TW +          
Subjt:  SNRQIKQPSYLKDYHCSLLHANAFPPTQTKFPLNKYLSYNSLSDSYRTFVLNISTTFEPQFYHQAASSSHWRDAMSVELKAMESNSTWSV----------

Query:  --------------------ARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYVHGKVCPQGQR
                            ARL+AKGYTQQEG+D+IETFSPV KL ++K++L ++    + L QLD++NAFL+GDL EE+YM LP GY   +       
Subjt:  --------------------ARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYVHGKVCPQGQR

Query:  LVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNTFVALLVYVDDI--------------------------------SSPEPLSS
         VC L KSIYGLKQASRQWF KFS  L+ FGF QS SD++ F K     F+ +LVYVDDI                                   E   S
Subjt:  LVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNTFVALLVYVDDI--------------------------------SSPEPLSS

Query:  SKGIFLSQRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQ----------------------------------ASFSSSPSCSVIASVHQRLAY----
        + GI + QRKYAL L+++ GLL  KPS++PMDP+   S+                                    + FS +P  +   +V + L Y    
Subjt:  SKGIFLSQRKYALQLVEDEGLLASKPSALPMDPNSKLSSDQ----------------------------------ASFSSSPSCSVIASVHQRLAY----

Query:  --AGDFLPASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDN
           G F  +    QL+ FSDA + SC D+R+ST GYC+FLG +L+SWK+KKQ  VS+SSAEAEYRAL+  T E++WL Q  +EL +P   P+LLFCDN
Subjt:  --AGDFLPASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQLLKELLIPSEVPSLLFCDN

AT5G41040.1 HXXXD-type acyl-transferase family protein3.9e-3630.26Show/hide
Query:  LDPRGRLELHCCTAGAHFIEAEAAAKLDDYDDFKSCDATVTNQLVPSVDYTVPLDKQPVFLVQVTRFACGGIVIGTALSHFVIDGIGASTFVNSWASITR
        + P G+L + C   G  F+EAEA  K+D+  D    D     +LV  V     + + P    QVT+F CGG V+G  ++H + DGIGA  FVNSW  + R
Subjt:  LDPRGRLELHCCTAGAHFIEAEAAAKLDDYDDFKSCDATVTNQLVPSVDYTVPLDKQPVFLVQVTRFACGGIVIGTALSHFVIDGIGASTFVNSWASITR

Query:  GESTADSVVPKPYHERDVLRVQPRKPSRFHHQEYEKPPLLVGHSSPEEERNRE-TIVASLKLTTCQVEKLKNRANHYHLEPNSNSNPKPYTRFEVVAGHI
        G      +   P+ +R +L  +        HQE+E+   +   S+      +E T+  S      +++KLK +A         NS     T FE ++  +
Subjt:  GESTADSVVPKPYHERDVLRVQPRKPSRFHHQEYEKPPLLVGHSSPEEERNRE-TIVASLKLTTCQVEKLKNRANHYHLEPNSNSNPKPYTRFEVVAGHI

Query:  WVCACKARKTCVTEQPTVALVTADIRRRTRPPLPEGFTGNATLLTVTRICKFGDLMNNPLNYAAEKIREGTWKLTDEYIESAIDFLASKDNEWIKELYEK
        W    K+ K  +++Q T  L   D R +  P LP+G+ GN  +LT   IC+ G+L+  PL++A   +RE    +TD Y+ SAID+               
Subjt:  WVCACKARKTCVTEQPTVALVTADIRRRTRPPLPEGFTGNATLLTVTRICKFGDLMNNPLNYAAEKIREGTWKLTDEYIESAIDFLASKDNEWIKELYEK

Query:  GICSEGAPFWGNPNMEFGCWTSLSLYDTDFGWGRPRYVGLASLEDLE
           +   P   +  +    W+ L  + TDFGWG P   G  +L + E
Subjt:  GICSEGAPFWGNPNMEFGCWTSLSLYDTDFGWGRPRYVGLASLEDLE

AT5G41040.2 HXXXD-type acyl-transferase family protein3.9e-3630.26Show/hide
Query:  LDPRGRLELHCCTAGAHFIEAEAAAKLDDYDDFKSCDATVTNQLVPSVDYTVPLDKQPVFLVQVTRFACGGIVIGTALSHFVIDGIGASTFVNSWASITR
        + P G+L + C   G  F+EAEA  K+D+  D    D     +LV  V     + + P    QVT+F CGG V+G  ++H + DGIGA  FVNSW  + R
Subjt:  LDPRGRLELHCCTAGAHFIEAEAAAKLDDYDDFKSCDATVTNQLVPSVDYTVPLDKQPVFLVQVTRFACGGIVIGTALSHFVIDGIGASTFVNSWASITR

Query:  GESTADSVVPKPYHERDVLRVQPRKPSRFHHQEYEKPPLLVGHSSPEEERNRE-TIVASLKLTTCQVEKLKNRANHYHLEPNSNSNPKPYTRFEVVAGHI
        G      +   P+ +R +L  +        HQE+E+   +   S+      +E T+  S      +++KLK +A         NS     T FE ++  +
Subjt:  GESTADSVVPKPYHERDVLRVQPRKPSRFHHQEYEKPPLLVGHSSPEEERNRE-TIVASLKLTTCQVEKLKNRANHYHLEPNSNSNPKPYTRFEVVAGHI

Query:  WVCACKARKTCVTEQPTVALVTADIRRRTRPPLPEGFTGNATLLTVTRICKFGDLMNNPLNYAAEKIREGTWKLTDEYIESAIDFLASKDNEWIKELYEK
        W    K+ K  +++Q T  L   D R +  P LP+G+ GN  +LT   IC+ G+L+  PL++A   +RE    +TD Y+ SAID+               
Subjt:  WVCACKARKTCVTEQPTVALVTADIRRRTRPPLPEGFTGNATLLTVTRICKFGDLMNNPLNYAAEKIREGTWKLTDEYIESAIDFLASKDNEWIKELYEK

Query:  GICSEGAPFWGNPNMEFGCWTSLSLYDTDFGWGRPRYVGLASLEDLE
           +   P   +  +    W+ L  + TDFGWG P   G  +L + E
Subjt:  GICSEGAPFWGNPNMEFGCWTSLSLYDTDFGWGRPRYVGLASLEDLE

AT5G48930.1 hydroxycinnamoyl-CoA shikimate/quinate hydroxycinnamoyl transferase2.3e-4932.23Show/hide
Query:  DPRGRLELHCCTAGAHFIEAEAAAKLDDYDDFKSCDATVTNQLVPSVDYTVPLDKQPVFLVQVTRFACGGIVIGTALSHFVIDGIGASTFVNSWASITRG
        D  GR+E+ C  AG  F+ A+  + +DD+ DF         QL+P VD++  +   P+ ++QVT F CGG  +G  + H   DG     F+N+W+ + RG
Subjt:  DPRGRLELHCCTAGAHFIEAEAAAKLDDYDDFKSCDATVTNQLVPSVDYTVPLDKQPVFLVQVTRFACGGIVIGTALSHFVIDGIGASTFVNSWASITRG

Query:  ESTADSVVPKPYHERDVLRVQPRKPSRFHHQEYEKPPLLVGHSSPEEERNRETIVASLKLTTCQVEKLKNRANHYHLEPNSNSNPKPYTRFEVVAGHIWV
            D  +P P+ +R +LR +      FHH EY+  P +     P +     T V+  KLT  Q+  LK ++         + N   Y+ +E++AGH+W 
Subjt:  ESTADSVVPKPYHERDVLRVQPRKPSRFHHQEYEKPPLLVGHSSPEEERNRETIVASLKLTTCQVEKLKNRANHYHLEPNSNSNPKPYTRFEVVAGHIWV

Query:  CACKARKTCVTEQPTVALVTADIRRRTRPPLPEGFTGNATLLTVTRICKFGDLMNNPLNYAAEKIREGTWKLTDEYIESAIDFLASKDNEWIKELYEKGI
           KAR     +Q T   +  D R R RP LP G+ GN  + T T +   GDL++ P  YAA +I +   ++ D Y+ SA+D+L  +         +   
Subjt:  CACKARKTCVTEQPTVALVTADIRRRTRPPLPEGFTGNATLLTVTRICKFGDLMNNPLNYAAEKIREGTWKLTDEYIESAIDFLASKDNEWIKELYEKGI

Query:  CSEGAPFWGNPNMEFGCWTSLSLYDTDFGWGRPRYVGLASLEDLEDGESLIMSGPEKDGCLIV
           GA  +  PN+    W  L +YD DFGWGRP ++G   +    +G S ++  P  DG L V
Subjt:  CSEGAPFWGNPNMEFGCWTSLSLYDTDFGWGRPRYVGLASLEDLEDGESLIMSGPEKDGCLIV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACCTCTAAGTCTAAGCATTCACTCTTCCTCTACGATCCTTCCAAGCAACCCAACTCCTCATCGCACACTTCCCCTTTCTGAATTCCTTGAGAACCTCACTCAGCC
AAATCTTGGTTCCCTACTACCCTCTCGTAGGCCGCCTCCACTGGATCCCCGCGGCCGCCTCGAGCTTCATTGCTGCACCGCGGGAGCCCATTTCATCGAGGCCGAAGCCG
CGGCCAAGCTCGATGATTACGACGACTTCAAATCATGCGATGCCACCGTAACGAACCAGCTTGTTCCCTCGGTTGATTACACCGTCCCCCTCGACAAGCAGCCGGTATTC
CTCGTTCAAGTGACGAGGTTCGCCTGCGGTGGCATCGTGATCGGCACTGCACTCTCTCACTTTGTGATCGATGGGATTGGCGCCAGCACATTCGTCAACTCGTGGGCGAG
CATTACGCGTGGGGAGAGCACGGCGGATAGCGTTGTGCCGAAGCCGTATCATGAGAGGGATGTTCTTCGGGTTCAGCCGCGAAAGCCGTCGAGATTTCATCATCAAGAGT
ATGAGAAACCTCCCCTTTTGGTTGGGCATTCCAGTCCTGAGGAAGAGCGCAACAGGGAAACCATTGTTGCGTCGCTTAAACTCACAACGTGCCAAGTTGAAAAGCTTAAA
AACAGAGCCAATCATTATCATTTAGAACCAAACTCAAACTCAAATCCAAAGCCATATACCAGATTTGAGGTGGTGGCTGGGCACATATGGGTATGTGCATGCAAGGCTCG
CAAAACATGCGTAACCGAGCAACCAACAGTGGCCCTCGTCACAGCCGACATACGACGACGAACAAGGCCACCTCTCCCAGAAGGATTCACCGGAAATGCAACATTGTTGA
CGGTGACACGGATATGCAAGTTTGGAGATTTGATGAATAATCCTCTAAACTATGCAGCCGAGAAGATAAGAGAAGGAACATGGAAGCTTACGGACGAATACATCGAGTCG
GCGATCGATTTCTTAGCAAGCAAAGACAATGAATGGATAAAGGAGTTGTATGAGAAAGGAATTTGCAGTGAAGGAGCTCCCTTTTGGGGGAATCCAAACATGGAGTTCGG
GTGTTGGACGTCGCTATCGCTTTACGATACGGATTTTGGGTGGGGGAGGCCTCGTTATGTTGGGTTGGCTTCGTTGGAGGATCTCGAGGATGGAGAGTCGTTGATCATGT
CAGGTCCAGAGAAGGATGGATGTCTCATTGTGCTAGCTACCGTCACCGTCAGTCTAGAGGAGAAACCAGGTAGCAACATCAGTCTCCAGCTACAGTTGCAATCCAGTCTT
GTCGGTGTTTCCTCGTCAGTCAGAAGCTTCCTTTCCTCGATTCAGTTCAAGATATTAGGAGGTTTAAGAAATATTAGAATATTAGAAGTTGGATGTACTGATTCTGTTAA
ACCTACCAATCCTACTGTCATCGATCAATATGTTAATCCCTACTACCTTCATCATTCGGATGGGACCAATTTGGTTCTTGTCTCTGAATTGCTTACGGAATCCAATTATT
CCTCCTGGTACCAGGCGATGCTCATCGGACTCACGGTGAAGAACAAGTTGCGACGCGAGATTTCAAACCTGATGCAAGAACAGTTGTCTGTCACTGCATACTTTGCAAAA
TTAAAGGCTCTGTGGAATGAACTTGTTTCTTACAGACCTTCTTGCACTTGCGACCGTTGTTCTTGTGGAGGAGTTCAAGGGTTAGTTCAATATTTCCAAACTGAACACGT
CGTGGCGTTTCTGATGGGGCTAAATGAGTCTTTTAGTCAAATTCGTACTCAGTTGTTACTGATGGAGCCAGAACCGACGATCGTTAAAGCTTTCTCCTTAGTTGCTCAGG
AAGTCGAACAATGTGCTTCTACTGTTGTTCCTATCGCTGCTCCTTCTCCTACGATTGATGCTACTGCACTTCTTGTCAAGAACACTCAGCCGAATTCAGCCGCCAAAAAT
ACCCAACAGATCAAGAAGAAGGAACGCTCTCACTGTACACATTGCAACGTTCTTGGGCACACAATTGGCCGATGTTATAAGCTCCACGGATATCCCCCTGGATATCGCAA
TCAGAAGGCTCCCTCAACTGCTAAACCAGAATCGAATCCTCTGCCAAATCTCACTCCTGAGCAATGTCAGAGTATTCTTGCCATGCTTCAGTCTCATTTGAGCTCAGTTA
AAACAACACCGGAGTCTTCTTCATCCTCATCCATTGGAAATGCTCATGTGGCAGGTACATGTTCTTCACTATTACCTCCGTCATACAGTTGGATTGTTGATTCTGGTGCA
TCTGCTCATATTTGTTTTTCGAAGGATTTGTTTTCCTCGTTTAAGTCAGTTTCGGGGTTCTCTGTCACTTTGCCTAATCAGGCTCGACTTGTTGTTGATTATGTTGGTGA
TGTGAGGTTATCTTCTGGTCTTCTGCTTCAGAATGTCTTGTTTATTCCGACATTTCAGTTCAATCTCATATCTGTCAGTGCAATAACTGCTCAATCCTCTGTTTCGTTGT
CCTTTGCTGATGGTTATTGTCTAATTCAGGACAAGTCTTCTTTGAAGACGATTGGGAAGGCTAGGTTTTGGCAAGGGCTCTATTTGTTGGATCATCATGCTCTAGACACT
GTTCCTTTGCCTGTTTACTTGAATTCTACTAAGCATTGTAGTCTTGTTTCGAATTGTAATACAGACTTATGGCATGACCGTCTTGGTCATCCTTCTCACAAGAATTTAAA
TGCATTGAAACCTTTGTTGTCGTTTAAAGAGAGCCCATCCCATCCTTGTTTCATTTGTCCCTTAGCCAAGCAGCGTAGACTTTCATTTTCTGCCCATAATCGTCTTGTAG
ATAAACCTTTTGATCTATTCCATTGTGATACATGGGGACCTTTTCGTTCACTTACACACTTAGGCCATCGGTATTTTCTTACTTTGGTCGATGATGCCACTCGCTATACA
TGGATTTTTTTTATGAAGAAGAAGTCTGACGCTCTTACTATAGTGCCAAGTTTTTTTAAGCTTGTTGATACACAGTTCCAAGTTGCAATCAAGTGTTTCCGCTCCGATAA
TGCTCCAGAACTCTCTTTCACTGATTTTTTTCGATCAAGCGGTGTTGTCCATCAGTTTTCATGTGTCGAGTGCCCGAGCAAAACTCTGTCGTGGAAAGGGAACATCAGCA
CCTCCTCAATGTTGCTCGTGCTCTATTTTTTCAGTCAAGGGTGCCCATACAATTTTGGGACTCCTTATTTTCGGTTGCATAGCAAGAATGTTGATTACTCCCAGCTTCGA
GTGTTTGGCTCGTTGTGTTTTGCCTCTACTCTAAAAGCTGGTAGATCCAAATTCTCTCCAAGGGTGATACCGGCTGTTTTCCTTGGCTATCCTCAAGGCATGAAGGCTTT
CAAGCTTTATGACATAGAGAACAAACATGTCTTCATCTCTCGTGATGTTGTGTTCCACGAGTCCATTTTTCCTTTTCATTCAGTTGTTGCTCAGAAGGATACCTCTGTTT
TTTTAAATGACTTAGTCTTACCCAAGTCATTCAACTATGCTGGCACTCCTGATGTTTCTCACGGTCACGAGCTTCAACTATCTGTTGGTGAACGACCTGTTCTGCCAAAT
GAAGACATTGTCTCTAATGATGTTTTTGTGCAACCTTCCATTGACTGTTCTTTGGGTCAATCTGAGTCTCAAATGCTGCCACAATCCCTTGCCTCTCAAGAGCCAAATGT
GCCTTTGTCTGATTCTCTTGGTTGTTCTTTGCCTAGTATCACTCGACATCGTGCTTGTCCTCAACGAAAGTCGAATAGGCAAATAAAGCAGCCTTCATACTTGAAGGACT
ACCATTGCAGCCTTCTCCACGCCAATGCTTTTCCGCCTACTCAGACAAAGTTTCCTTTGAACAAGTATCTGTCGTATAATAGTCTTTCTGATTCATATAGGACATTTGTT
CTCAATATATCTACAACCTTTGAGCCCCAGTTCTACCATCAGGCCGCGTCTTCTAGTCATTGGAGGGATGCCATGTCTGTAGAACTCAAAGCCATGGAGTCCAATTCTAC
TTGGTCTGTGGCTAGATTGATTGCCAAAGGATACACCCAACAAGAGGGACTTGATTACATTGAGACGTTCTCTCCCGTTGCCAAACTTGTCACCATTAAAGTGTTGTTAA
CTGTTGCTACTTCTCTTGGCTGGCCGCTCATACAACTCGATGTCAACAATGCTTTTTTGCATGGAGACCTTTTTGAAGAGGTCTACATGGACTTACCCTTGGGATATGTA
CATGGTAAGGTTTGTCCTCAAGGTCAACGTTTGGTATGTCAGCTTCACAAGTCCATCTATGGACTCAAACAAGCTTCAAGGCAATGGTTCGATAAGTTCTCATGTGCCCT
GTTGTCTTTTGGTTTTAAGCAGTCAAGATCAGACTATTCCCTCTTTACTAAAGGTTGTGGAAATACTTTTGTGGCCCTTCTCGTCTATGTTGACGATATATCATCACCAG
AGCCTTTGTCGTCCTCCAAAGGCATATTTCTTTCTCAAAGAAAGTATGCATTACAGCTTGTGGAAGATGAGGGTCTTCTTGCATCCAAACCTTCTGCTCTTCCTATGGAT
CCTAACTCAAAGTTGTCTTCTGACCAAGCCTCATTCTCCTCATCTCCAAGCTGCTCAGTCATTGCTTCGGTACATCAAAGGCTCGCCTACGCAGGGGATTTTCTTCCAGC
TTCGAAGTCATTTCAACTCAAGGCTTTCTCAGATGCTGATTGGGCCTCCTGCCCTGATTCTCGAAAGTCTACAACTGGATATTGTGTTTTTCTAGGTGATGCTTTAGTTT
CATGGAAAGCAAAGAAGCAGTCCACCGTTAGCCGTTCATCTGCAGAGGCCGAATACAGGGCCTTGGCTGTCACAACTGCTGAATTAGTCTGGTTGCGTCAGCTCTTGAAG
GAGCTCCTTATTCCTTCTGAGGTTCCTTCTCTGTTGTTCTGTGATAATCGACAATCCACATTGCAACCAACCCAATGTTCCATGAAAGGACCAAACACATTGAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAACCTCTAAGTCTAAGCATTCACTCTTCCTCTACGATCCTTCCAAGCAACCCAACTCCTCATCGCACACTTCCCCTTTCTGAATTCCTTGAGAACCTCACTCAGCC
AAATCTTGGTTCCCTACTACCCTCTCGTAGGCCGCCTCCACTGGATCCCCGCGGCCGCCTCGAGCTTCATTGCTGCACCGCGGGAGCCCATTTCATCGAGGCCGAAGCCG
CGGCCAAGCTCGATGATTACGACGACTTCAAATCATGCGATGCCACCGTAACGAACCAGCTTGTTCCCTCGGTTGATTACACCGTCCCCCTCGACAAGCAGCCGGTATTC
CTCGTTCAAGTGACGAGGTTCGCCTGCGGTGGCATCGTGATCGGCACTGCACTCTCTCACTTTGTGATCGATGGGATTGGCGCCAGCACATTCGTCAACTCGTGGGCGAG
CATTACGCGTGGGGAGAGCACGGCGGATAGCGTTGTGCCGAAGCCGTATCATGAGAGGGATGTTCTTCGGGTTCAGCCGCGAAAGCCGTCGAGATTTCATCATCAAGAGT
ATGAGAAACCTCCCCTTTTGGTTGGGCATTCCAGTCCTGAGGAAGAGCGCAACAGGGAAACCATTGTTGCGTCGCTTAAACTCACAACGTGCCAAGTTGAAAAGCTTAAA
AACAGAGCCAATCATTATCATTTAGAACCAAACTCAAACTCAAATCCAAAGCCATATACCAGATTTGAGGTGGTGGCTGGGCACATATGGGTATGTGCATGCAAGGCTCG
CAAAACATGCGTAACCGAGCAACCAACAGTGGCCCTCGTCACAGCCGACATACGACGACGAACAAGGCCACCTCTCCCAGAAGGATTCACCGGAAATGCAACATTGTTGA
CGGTGACACGGATATGCAAGTTTGGAGATTTGATGAATAATCCTCTAAACTATGCAGCCGAGAAGATAAGAGAAGGAACATGGAAGCTTACGGACGAATACATCGAGTCG
GCGATCGATTTCTTAGCAAGCAAAGACAATGAATGGATAAAGGAGTTGTATGAGAAAGGAATTTGCAGTGAAGGAGCTCCCTTTTGGGGGAATCCAAACATGGAGTTCGG
GTGTTGGACGTCGCTATCGCTTTACGATACGGATTTTGGGTGGGGGAGGCCTCGTTATGTTGGGTTGGCTTCGTTGGAGGATCTCGAGGATGGAGAGTCGTTGATCATGT
CAGGTCCAGAGAAGGATGGATGTCTCATTGTGCTAGCTACCGTCACCGTCAGTCTAGAGGAGAAACCAGGTAGCAACATCAGTCTCCAGCTACAGTTGCAATCCAGTCTT
GTCGGTGTTTCCTCGTCAGTCAGAAGCTTCCTTTCCTCGATTCAGTTCAAGATATTAGGAGGTTTAAGAAATATTAGAATATTAGAAGTTGGATGTACTGATTCTGTTAA
ACCTACCAATCCTACTGTCATCGATCAATATGTTAATCCCTACTACCTTCATCATTCGGATGGGACCAATTTGGTTCTTGTCTCTGAATTGCTTACGGAATCCAATTATT
CCTCCTGGTACCAGGCGATGCTCATCGGACTCACGGTGAAGAACAAGTTGCGACGCGAGATTTCAAACCTGATGCAAGAACAGTTGTCTGTCACTGCATACTTTGCAAAA
TTAAAGGCTCTGTGGAATGAACTTGTTTCTTACAGACCTTCTTGCACTTGCGACCGTTGTTCTTGTGGAGGAGTTCAAGGGTTAGTTCAATATTTCCAAACTGAACACGT
CGTGGCGTTTCTGATGGGGCTAAATGAGTCTTTTAGTCAAATTCGTACTCAGTTGTTACTGATGGAGCCAGAACCGACGATCGTTAAAGCTTTCTCCTTAGTTGCTCAGG
AAGTCGAACAATGTGCTTCTACTGTTGTTCCTATCGCTGCTCCTTCTCCTACGATTGATGCTACTGCACTTCTTGTCAAGAACACTCAGCCGAATTCAGCCGCCAAAAAT
ACCCAACAGATCAAGAAGAAGGAACGCTCTCACTGTACACATTGCAACGTTCTTGGGCACACAATTGGCCGATGTTATAAGCTCCACGGATATCCCCCTGGATATCGCAA
TCAGAAGGCTCCCTCAACTGCTAAACCAGAATCGAATCCTCTGCCAAATCTCACTCCTGAGCAATGTCAGAGTATTCTTGCCATGCTTCAGTCTCATTTGAGCTCAGTTA
AAACAACACCGGAGTCTTCTTCATCCTCATCCATTGGAAATGCTCATGTGGCAGGTACATGTTCTTCACTATTACCTCCGTCATACAGTTGGATTGTTGATTCTGGTGCA
TCTGCTCATATTTGTTTTTCGAAGGATTTGTTTTCCTCGTTTAAGTCAGTTTCGGGGTTCTCTGTCACTTTGCCTAATCAGGCTCGACTTGTTGTTGATTATGTTGGTGA
TGTGAGGTTATCTTCTGGTCTTCTGCTTCAGAATGTCTTGTTTATTCCGACATTTCAGTTCAATCTCATATCTGTCAGTGCAATAACTGCTCAATCCTCTGTTTCGTTGT
CCTTTGCTGATGGTTATTGTCTAATTCAGGACAAGTCTTCTTTGAAGACGATTGGGAAGGCTAGGTTTTGGCAAGGGCTCTATTTGTTGGATCATCATGCTCTAGACACT
GTTCCTTTGCCTGTTTACTTGAATTCTACTAAGCATTGTAGTCTTGTTTCGAATTGTAATACAGACTTATGGCATGACCGTCTTGGTCATCCTTCTCACAAGAATTTAAA
TGCATTGAAACCTTTGTTGTCGTTTAAAGAGAGCCCATCCCATCCTTGTTTCATTTGTCCCTTAGCCAAGCAGCGTAGACTTTCATTTTCTGCCCATAATCGTCTTGTAG
ATAAACCTTTTGATCTATTCCATTGTGATACATGGGGACCTTTTCGTTCACTTACACACTTAGGCCATCGGTATTTTCTTACTTTGGTCGATGATGCCACTCGCTATACA
TGGATTTTTTTTATGAAGAAGAAGTCTGACGCTCTTACTATAGTGCCAAGTTTTTTTAAGCTTGTTGATACACAGTTCCAAGTTGCAATCAAGTGTTTCCGCTCCGATAA
TGCTCCAGAACTCTCTTTCACTGATTTTTTTCGATCAAGCGGTGTTGTCCATCAGTTTTCATGTGTCGAGTGCCCGAGCAAAACTCTGTCGTGGAAAGGGAACATCAGCA
CCTCCTCAATGTTGCTCGTGCTCTATTTTTTCAGTCAAGGGTGCCCATACAATTTTGGGACTCCTTATTTTCGGTTGCATAGCAAGAATGTTGATTACTCCCAGCTTCGA
GTGTTTGGCTCGTTGTGTTTTGCCTCTACTCTAAAAGCTGGTAGATCCAAATTCTCTCCAAGGGTGATACCGGCTGTTTTCCTTGGCTATCCTCAAGGCATGAAGGCTTT
CAAGCTTTATGACATAGAGAACAAACATGTCTTCATCTCTCGTGATGTTGTGTTCCACGAGTCCATTTTTCCTTTTCATTCAGTTGTTGCTCAGAAGGATACCTCTGTTT
TTTTAAATGACTTAGTCTTACCCAAGTCATTCAACTATGCTGGCACTCCTGATGTTTCTCACGGTCACGAGCTTCAACTATCTGTTGGTGAACGACCTGTTCTGCCAAAT
GAAGACATTGTCTCTAATGATGTTTTTGTGCAACCTTCCATTGACTGTTCTTTGGGTCAATCTGAGTCTCAAATGCTGCCACAATCCCTTGCCTCTCAAGAGCCAAATGT
GCCTTTGTCTGATTCTCTTGGTTGTTCTTTGCCTAGTATCACTCGACATCGTGCTTGTCCTCAACGAAAGTCGAATAGGCAAATAAAGCAGCCTTCATACTTGAAGGACT
ACCATTGCAGCCTTCTCCACGCCAATGCTTTTCCGCCTACTCAGACAAAGTTTCCTTTGAACAAGTATCTGTCGTATAATAGTCTTTCTGATTCATATAGGACATTTGTT
CTCAATATATCTACAACCTTTGAGCCCCAGTTCTACCATCAGGCCGCGTCTTCTAGTCATTGGAGGGATGCCATGTCTGTAGAACTCAAAGCCATGGAGTCCAATTCTAC
TTGGTCTGTGGCTAGATTGATTGCCAAAGGATACACCCAACAAGAGGGACTTGATTACATTGAGACGTTCTCTCCCGTTGCCAAACTTGTCACCATTAAAGTGTTGTTAA
CTGTTGCTACTTCTCTTGGCTGGCCGCTCATACAACTCGATGTCAACAATGCTTTTTTGCATGGAGACCTTTTTGAAGAGGTCTACATGGACTTACCCTTGGGATATGTA
CATGGTAAGGTTTGTCCTCAAGGTCAACGTTTGGTATGTCAGCTTCACAAGTCCATCTATGGACTCAAACAAGCTTCAAGGCAATGGTTCGATAAGTTCTCATGTGCCCT
GTTGTCTTTTGGTTTTAAGCAGTCAAGATCAGACTATTCCCTCTTTACTAAAGGTTGTGGAAATACTTTTGTGGCCCTTCTCGTCTATGTTGACGATATATCATCACCAG
AGCCTTTGTCGTCCTCCAAAGGCATATTTCTTTCTCAAAGAAAGTATGCATTACAGCTTGTGGAAGATGAGGGTCTTCTTGCATCCAAACCTTCTGCTCTTCCTATGGAT
CCTAACTCAAAGTTGTCTTCTGACCAAGCCTCATTCTCCTCATCTCCAAGCTGCTCAGTCATTGCTTCGGTACATCAAAGGCTCGCCTACGCAGGGGATTTTCTTCCAGC
TTCGAAGTCATTTCAACTCAAGGCTTTCTCAGATGCTGATTGGGCCTCCTGCCCTGATTCTCGAAAGTCTACAACTGGATATTGTGTTTTTCTAGGTGATGCTTTAGTTT
CATGGAAAGCAAAGAAGCAGTCCACCGTTAGCCGTTCATCTGCAGAGGCCGAATACAGGGCCTTGGCTGTCACAACTGCTGAATTAGTCTGGTTGCGTCAGCTCTTGAAG
GAGCTCCTTATTCCTTCTGAGGTTCCTTCTCTGTTGTTCTGTGATAATCGACAATCCACATTGCAACCAACCCAATGTTCCATGAAAGGACCAAACACATTGAAATAG
Protein sequenceShow/hide protein sequence
MKPLSLSIHSSSTILPSNPTPHRTLPLSEFLENLTQPNLGSLLPSRRPPPLDPRGRLELHCCTAGAHFIEAEAAAKLDDYDDFKSCDATVTNQLVPSVDYTVPLDKQPVF
LVQVTRFACGGIVIGTALSHFVIDGIGASTFVNSWASITRGESTADSVVPKPYHERDVLRVQPRKPSRFHHQEYEKPPLLVGHSSPEEERNRETIVASLKLTTCQVEKLK
NRANHYHLEPNSNSNPKPYTRFEVVAGHIWVCACKARKTCVTEQPTVALVTADIRRRTRPPLPEGFTGNATLLTVTRICKFGDLMNNPLNYAAEKIREGTWKLTDEYIES
AIDFLASKDNEWIKELYEKGICSEGAPFWGNPNMEFGCWTSLSLYDTDFGWGRPRYVGLASLEDLEDGESLIMSGPEKDGCLIVLATVTVSLEEKPGSNISLQLQLQSSL
VGVSSSVRSFLSSIQFKILGGLRNIRILEVGCTDSVKPTNPTVIDQYVNPYYLHHSDGTNLVLVSELLTESNYSSWYQAMLIGLTVKNKLRREISNLMQEQLSVTAYFAK
LKALWNELVSYRPSCTCDRCSCGGVQGLVQYFQTEHVVAFLMGLNESFSQIRTQLLLMEPEPTIVKAFSLVAQEVEQCASTVVPIAAPSPTIDATALLVKNTQPNSAAKN
TQQIKKKERSHCTHCNVLGHTIGRCYKLHGYPPGYRNQKAPSTAKPESNPLPNLTPEQCQSILAMLQSHLSSVKTTPESSSSSSIGNAHVAGTCSSLLPPSYSWIVDSGA
SAHICFSKDLFSSFKSVSGFSVTLPNQARLVVDYVGDVRLSSGLLLQNVLFIPTFQFNLISVSAITAQSSVSLSFADGYCLIQDKSSLKTIGKARFWQGLYLLDHHALDT
VPLPVYLNSTKHCSLVSNCNTDLWHDRLGHPSHKNLNALKPLLSFKESPSHPCFICPLAKQRRLSFSAHNRLVDKPFDLFHCDTWGPFRSLTHLGHRYFLTLVDDATRYT
WIFFMKKKSDALTIVPSFFKLVDTQFQVAIKCFRSDNAPELSFTDFFRSSGVVHQFSCVECPSKTLSWKGNISTSSMLLVLYFFSQGCPYNFGTPYFRLHSKNVDYSQLR
VFGSLCFASTLKAGRSKFSPRVIPAVFLGYPQGMKAFKLYDIENKHVFISRDVVFHESIFPFHSVVAQKDTSVFLNDLVLPKSFNYAGTPDVSHGHELQLSVGERPVLPN
EDIVSNDVFVQPSIDCSLGQSESQMLPQSLASQEPNVPLSDSLGCSLPSITRHRACPQRKSNRQIKQPSYLKDYHCSLLHANAFPPTQTKFPLNKYLSYNSLSDSYRTFV
LNISTTFEPQFYHQAASSSHWRDAMSVELKAMESNSTWSVARLIAKGYTQQEGLDYIETFSPVAKLVTIKVLLTVATSLGWPLIQLDVNNAFLHGDLFEEVYMDLPLGYV
HGKVCPQGQRLVCQLHKSIYGLKQASRQWFDKFSCALLSFGFKQSRSDYSLFTKGCGNTFVALLVYVDDISSPEPLSSSKGIFLSQRKYALQLVEDEGLLASKPSALPMD
PNSKLSSDQASFSSSPSCSVIASVHQRLAYAGDFLPASKSFQLKAFSDADWASCPDSRKSTTGYCVFLGDALVSWKAKKQSTVSRSSAEAEYRALAVTTAELVWLRQLLK
ELLIPSEVPSLLFCDNRQSTLQPTQCSMKGPNTLK