; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039118 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039118
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr2:36341509..36350934
RNA-Seq ExpressionLag0039118
SyntenyLag0039118
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN78598.1 hypothetical protein VITISV_001332 [Vitis vinifera]6.0e-28641.71Show/hide
Query:  GGESSEVVADSPPLPISINGSSSGPS---SDHHSVAITAFKLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTSDPSFSSWDMNNSLVMSWLIHSMD
        G  SS+V + + P      G S  P+   +D   + IT  KLNGHN++ WS+S+ + I GKGK  ++ G  V P T++P F  W + NS++MSWLI+SM+
Subjt:  GGESSEVVADSPPLPISINGSSSGPS---SDHHSVAITAFKLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTSDPSFSSWDMNNSLVMSWLIHSMD

Query:  SCIGELYLLYSTAKAIWDAVNLAYSDLEDSSQMFSLRTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSS
        + IGE +LL+ TAK IWDA    YS  E++S++F + + + + RQ   +VT Y++ LTR WQ+LDLF+   WK   D   YRQ++ ++R++ F  GLN  
Subjt:  SCIGELYLLYSTAKAIWDAVNLAYSDLEDSSQMFSLRTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSS

Query:  LDEVRGRILGSKPLPSIDEIFAEVRREEHRKTIMLCS--TPSGSSDASALVA----MHPADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASNGPRA
        LD+VRGRI+G KPLPS+ E F+EVRREE RK +M+ S   P+ + DASAL A        DR+KR   W D+C KP H +E CWKLHGKP++     PR 
Subjt:  LDEVRGRILGSKPLPSIDEIFAEVRREEHRKTIMLCS--TPSGSSDASALVA----MHPADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASNGPRA

Query:  PVDSSHAAAAAKSSLCSPSAAPFNQAQLDHLTKMM----SSSTLTTSLMAHSGLATISDLHCEPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGS
          D     AA   S   P  +PFN+ Q++ L K++    S ST   +  A+ G          PWIVD+GASDH+TG   +L N+ P +   +V IA+GS
Subjt:  PVDSSHAAAAAKSSLCSPSAAPFNQAQLDHLTKMM----SSSTLTTSLMAHSGLATISDLHCEPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGS

Query:  VLRVVGTDTIMFNPDFSLHNVLYVPQFEYNLMSISKLTASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLYY------------------------FS
          ++ GT +I    D  L +VL+VP  + NL+SISKL         F  N C+FQD  +G+MIG A+  SGLY                         F+
Subjt:  VLRVVGTDTIMFNPDFSLHNVLYVPQFEYNLMSISKLTASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLYY------------------------FS

Query:  GTSSSLIHS------------------TAALFPKLF--RNNESFHCTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRISTLRN------------
          S+S ++                    A LFP+LF  +N  S+HC +CQ +KH R  +    Y PS  F+LVHSDVWGPSRI  +              
Subjt:  GTSSSLIHS------------------TAALFPKLF--RNNESFHCTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRISTLRN------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------KRQPSKILKFNTPLSVLRSHFPSSR-VFSELELRVFGCKAFVHNLIPARWGISASEE---FFGKLPTIADFELFWEPTPSSNPTNIS---
                   R PS++L F +P  +    FP +    S+L L+VFGC AFVH     R   +       F G  PT   ++ +  PT     T +    
Subjt:  ----------KRQPSKILKFNTPLSVLRSHFPSSR-VFSELELRVFGCKAFVHNLIPARWGISASEE---FFGKLPTIADFELFWEPTPSSNPTNIS---

Query:  --------NELAPTNISNNH---DSPSPNSNPPISRNQSFSPNSSTPENLLAHQNQSNAQNSSPKSMSSNLENQIPVPTSNSVPVIPSGNIP--------
                         N H   +  S     P   ++S +P+   P  L      S  Q +   ++ S +  Q P+P     P + + N+         
Subjt:  --------NELAPTNISNNH---DSPSPNSNPPISRNQSFSPNSSTPENLLAHQNQSNAQNSSPKSMSSNLENQIPVPTSNSVPVIPSGNIP--------

Query:  -----GGKSAIQPYQFTYSRRDGKIEEPEDP-HEHEISPVSETELNDIDLPIAIRKGTRSCTQHSISNFLGYAHLSTPMQALVTQLSKESVPATIQEAWQ
             G +S    Y       D     PE+   E     V    ++D  LPIA+RKG R CT H I N++ Y  LS   +A  T L    VP TIQEA +
Subjt:  -----GGKSAIQPYQFTYSRRDGKIEEPEDP-HEHEISPVSETELNDIDLPIAIRKGTRSCTQHSISNFLGYAHLSTPMQALVTQLSKESVPATIQEAWQ

Query:  DPKWKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVLLSLAANHDWPLFQLD
          +WKKAV +E+ ALEKNGTW I      K+ VGC+W+FTIKYK+DG++ER+KARLVA+G++Q++G+DYQETFAPVAK+N+IR+LLSLA N DW L QLD
Subjt:  DPKWKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVLLSLAANHDWPLFQLD

Query:  VKNAFLNGELEEEVYMDLPPGF-NSNGDNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKRQHTG-LTILIVYVDDIVLTGDDKTEI
        +KNAFLNG+LEEEVYM++PPGF  S   N+VC L+KSLY LKQSPRAWF RFTKA+L  GYKQ  +DHTLF K+ H G L ILIVYVDDI+L+G+D  E+
Subjt:  VKNAFLNGELEEEVYMDLPPGF-NSNGDNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKRQHTG-LTILIVYVDDIVLTGDDKTEI

Query:  QSVKNKLAAEFELKDLGN------LRPVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKG
        Q++K  L+ EFE+KDLGN       +P+DTPMD   KL  +    PVD+GRYQ LVG+LIYLS+TRPDI FAVS VSQFMHSP+E HM+A++R+L+YLK 
Subjt:  QSVKNKLAAEFELKDLGN------LRPVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKG

Query:  TPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKMFLEELDMKTDGPITNYCDNR
        TPG GL F+K  +R  E++++ADWAG+  DRRSTSGYC+FVWGNLVTWRSKKQSVVARSSAEAE R+LA GICEG+W+K  L EL   +  PI   CDN+
Subjt:  TPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKMFLEELDMKTDGPITNYCDNR

Query:  AAISISHNPVHHDRVKHVELDRHFIKEKVDEGILKI
        AAISI+ NPVHHDR KHVE+DRHFI EKV    +K+
Subjt:  AAISISHNPVHHDRVKHVELDRHFIKEKVDEGILKI

EXC35359.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Morus notabilis]1.3e-28541.41Show/hide
Query:  SINGSSSGPSSDHHSVAITAFKLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTSDPSFSSWDMNNSLVMSWLIHSMDSCIGELYLLYSTAKAIWDA
        ++ GS S P      + IT  KLNGHN++ WS+S+ + I GKGK  ++ G    P T+D  F  W + NS++MSWLI+SM++ IGE +LL+ TAK IWDA
Subjt:  SINGSSSGPSSDHHSVAITAFKLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTSDPSFSSWDMNNSLVMSWLIHSMDSCIGELYLLYSTAKAIWDA

Query:  VNLAYSDLEDSSQMFSLRTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSSLDEVRGRILGSKPLPSIDE
            YS  E++S++F + + + + RQ   +VT Y+  L R WQ+LDLF+   WK P D   YRQ++ ++R++ F  GLN  LD+VRGRI+G+KPLPS+ E
Subjt:  VNLAYSDLEDSSQMFSLRTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSSLDEVRGRILGSKPLPSIDE

Query:  IFAEVRREEHRKTIMLCSTPSGSS--DASALVA----MHPADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASNGPRAPVDSSHAAAAAKSSLCSPS
         F+EVRREE RK +M+ S    +S  DASAL       +  D +KR   W D+C K  H +E CWKLHGKP++     PR   DS    A+   S   P 
Subjt:  IFAEVRREEHRKTIMLCSTPSGSS--DASALVA----MHPADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASNGPRAPVDSSHAAAAAKSSLCSPS

Query:  AAPFNQAQLDHLTKMMSSSTLTTSLMAHSGLATISDLH---------CEPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGSVLRVVGTDTIMFNP
         +PFN+ Q++ L K+ S   + +  +  +GL   +D H           PWIVD+GASDH+TG   LL N+ P S   +V IA+GS  ++ GT +I    
Subjt:  AAPFNQAQLDHLTKMMSSSTLTTSLMAHSGLATISDLH---------CEPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGSVLRVVGTDTIMFNP

Query:  DFSLHNVLYVPQFEYNLMSISKLTASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLYYFSGTSSS---------------------------------
        +  L +VL+VP  + NL+SISKL         F  N C+FQD  +G+MIG A+  SGLY  S   SS                                 
Subjt:  DFSLHNVLYVPQFEYNLMSISKLTASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLYYFSGTSSS---------------------------------

Query:  LIH---------STAALFPKLF--RNNESFHCTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRI-------------------------------
        L+H           A LFPKLF  +N  SFHC +CQ++KH R  +    Y PS  F+L+HSDVWGPSRI                               
Subjt:  LIH---------STAALFPKLF--RNNESFHCTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRI-------------------------------

Query:  -------------------------------------------------------------------------------------------STLRNKRQP
                                                                                                   +T    R P
Subjt:  -------------------------------------------------------------------------------------------STLRNKRQP

Query:  SKILKFNTPLSVLRSHFPSSR-VFSELELRVFGCKAFVH-------------------NLIPARWGISA----SEEFFGKLPTIADFELFWEPT-----P
        S++L F +P  +L  +FP +R V S+L  +VFGC AFVH                      P + G       S+ F+  +       +F+ P       
Subjt:  SKILKFNTPLSVLRSHFPSSR-VFSELELRVFGCKAFVH-------------------NLIPARWGISA----SEEFFGKLPTIADFELFWEPT-----P

Query:  SSNPTNISNELAPTNISNNHDSPSPNSNPPISRNQSFSPNSSTPENLLAHQNQSNAQNSSPKSMSSNLENQIPVPTSNSVPVIPSGNIPG--GKSAIQPY
        S N   I   +  +  S++ +SP P+   PI        +SSTP  L          +  P ++SS      PVP  +  P + + N+     +   Q  
Subjt:  SSNPTNISNELAPTNISNNHDSPSPNSNPPISRNQSFSPNSSTPENLLAHQNQSNAQNSSPKSMSSNLENQIPVPTSNSVPVIPSGNIPG--GKSAIQPY

Query:  QFTYSRRDGK-IEEPEDPHEHEI-----SPVSETELNDIDLPIAIRKGTRSCTQHSISNFLGYAHLSTPMQALVTQLSKESVPATIQEAWQDPKWKKAVM
        +       G+ I+    P E  +       VS   ++D  LPIA+RKG R CT H I N++ Y  LS   +A  T L    +P+TI EA Q+ +WKKAV 
Subjt:  QFTYSRRDGK-IEEPEDPHEHEI-----SPVSETELNDIDLPIAIRKGTRSCTQHSISNFLGYAHLSTPMQALVTQLSKESVPATIQEAWQDPKWKKAVM

Query:  EEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVLLSLAANHDWPLFQLDVKNAFLNGE
        +E+ ALEKNGTW I      K+ VGC+W+FTIKYK+DG++ER+KARLVA+G++Q++G+DYQETFAPVAK+N+IR+LLSLA N DW L QLD+KNAFLNG+
Subjt:  EEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVLLSLAANHDWPLFQLDVKNAFLNGE

Query:  LEEEVYMDLPPGF-NSNGDNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKRQHT-GLTILIVYVDDIVLTGDDKTEIQSVKNKLAA
        LEEEVYM++PPGF  S   N+VC L+KSLY LKQSPRAWF RFTKA+L  GY Q  SDHTLF K+ H   + ILIVYVDDI+L+G+D  E+Q +K  L+ 
Subjt:  LEEEVYMDLPPGF-NSNGDNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKRQHT-GLTILIVYVDDIVLTGDDKTEIQSVKNKLAA

Query:  EFELKDLGNL-----------------------------------RPVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQFM
        EFE+KDLGNL                                   +PVDTPMD   KL T+    PVD+GRYQRLVG+LIYLS+TRPDI FAVS VSQFM
Subjt:  EFELKDLGNL-----------------------------------RPVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQFM

Query:  HSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKM
        HSP+E HM+A++RVL+YLK TPG GL F K  +R  E++++ADWAG+  DRRSTSGYC+FVWGNLVTWRSKKQSVVARSSAEAE R+LA GICEG+W++ 
Subjt:  HSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKM

Query:  FLEELDMKTDGPITNYCDNRAAISISHNPVHHDRVKHVELDRHFIKEKV
         L EL   +  PI   CDN+AAISI+ NPVHHDR KHVE+DRHFI EKV
Subjt:  FLEELDMKTDGPITNYCDNRAAISISHNPVHHDRVKHVELDRHFIKEKV

RVW57183.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]5.4e-28742.43Show/hide
Query:  GGESSEVVADSPPLPISINGSSSGPS---SDHHSVAITAFKLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTSDPSFSSWDMNNSLVMSWLIHSMD
        G  SS+V + + P      G S  P+   +D   + IT  KLNGHN++ WS+S+ + I GKGK  ++ G  V P T++P F  W + NS++MSWLI+SM+
Subjt:  GGESSEVVADSPPLPISINGSSSGPS---SDHHSVAITAFKLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTSDPSFSSWDMNNSLVMSWLIHSMD

Query:  SCIGELYLLYSTAKAIWDAVNLAYSDLEDSSQMFSLRTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSS
        + IGE +LL+ TAK IWDA    YS  E++S++F + + + N  Q   +VT Y++ LTR WQ+LDLF+   WK   D   YRQ++ ++R++ F  GLN  
Subjt:  SCIGELYLLYSTAKAIWDAVNLAYSDLEDSSQMFSLRTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSS

Query:  LDEVRGRILGSKPLPSIDEIFAEVRREEHRKTIMLCS--TPSGSSDASALVA----MHPADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASNGPRA
        LD+VRGRI+G KPLPS+ E F+EVRREE RK +M+ S   P+ + DASAL A        DR+KR   W D+C KP H +E CWKLHGKP++     PR 
Subjt:  LDEVRGRILGSKPLPSIDEIFAEVRREEHRKTIMLCS--TPSGSSDASALVA----MHPADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASNGPRA

Query:  PVDSSHAAAAAKSSLCSPSAAPFNQAQLDHLTKMM----SSSTLTTSLMAHSGLATISDLHCEPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGS
          D     AA   S   P  +PFN+ Q++ L K++    S ST   +  A+ G          PWIVD+GASDH+TG   +L N+ P +   +V IA+GS
Subjt:  PVDSSHAAAAAKSSLCSPSAAPFNQAQLDHLTKMM----SSSTLTTSLMAHSGLATISDLHCEPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGS

Query:  VLRVVGTDTIMFNPDFSLHNVLYVPQFEYNLMSISKLTASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLYY------------------------FS
          ++ GT +I    D  L +VL+VP  + NL+SISKL         F  N C+FQD  +G+MIG A+  SGLY                         F+
Subjt:  VLRVVGTDTIMFNPDFSLHNVLYVPQFEYNLMSISKLTASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLYY------------------------FS

Query:  GTSSSLIHS------------------TAALFPKLF--RNNESFHCTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRISTLRNKR----------
          S+S ++                    A LFPKLF  +N  S+HC +CQ +KH R  +    Y PS  F+LVHSDVWGPSRI  +   R          
Subjt:  GTSSSLIHS------------------TAALFPKLF--RNNESFHCTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRISTLRNKR----------

Query:  ---------QPSKI------------LKFNTPLSVLRSHFPSSRVFSELE-------------------------------LRVFGCKAFVHNLIPARWG
                 + S++             +FN+ + VL+S        S L                                L V  C  F  N+    WG
Subjt:  ---------QPSKI------------LKFNTPLSVLRSHFPSSRVFSELE-------------------------------LRVFGCKAFVHNLIPARWG

Query:  --ISASEEFFGKLPT-IADF----ELFWEPTP------SSNPTNISNELAPTNISNNHDSPSPNSNPPISRNQSFSPNSSTPENLLAHQNQSNAQNSSPK
          I  +     ++P+ +  F    +LF +  P      S  P  +    A  ++   + SP+P+   P   +    P               + Q +   
Subjt:  --ISASEEFFGKLPT-IADF----ELFWEPTP------SSNPTNISNELAPTNISNNHDSPSPNSNPPISRNQSFSPNSSTPENLLAHQNQSNAQNSSPK

Query:  SMSSNLENQIPVPTSNSVPVIPSGNIP-------------GGKSAIQPYQFTYSRRDGKIEEPEDP-HEHEISPVSETELNDIDLPIAIRKGTRSCTQHS
        ++ S +  Q P+P     P + + N+              G +S    Y       D     PE+   E     V    ++D  LPIA+RKG R CT H 
Subjt:  SMSSNLENQIPVPTSNSVPVIPSGNIP-------------GGKSAIQPYQFTYSRRDGKIEEPEDP-HEHEISPVSETELNDIDLPIAIRKGTRSCTQHS

Query:  ISNFLGYAHLSTPMQALVTQLSKESVPATIQEAWQDPKWKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTH
        I N++ Y  LS   +A  T L    VP TIQEA +  +WKKAV +E+ ALEKN TW I      K+ VGC+W+FTIKYK+DG++ER+KARLVA+G++Q++
Subjt:  ISNFLGYAHLSTPMQALVTQLSKESVPATIQEAWQDPKWKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTH

Query:  GVDYQETFAPVAKINSIRVLLSLAANHDWPLFQLDVKNAFLNGELEEEVYMDLPPGF-NSNGDNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAH
        G+DYQETFAPVAK+N+IR+LLSLA N DW L QLD+KNAFLNG+LEEEVYM++PPGF  S   N+VC L+KSLY LKQSPRAWF RFTKA+L  GYKQ  
Subjt:  GVDYQETFAPVAKINSIRVLLSLAANHDWPLFQLDVKNAFLNGELEEEVYMDLPPGF-NSNGDNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAH

Query:  SDHTLFFKRQHTG-LTILIVYVDDIVLTGDDKTEIQSVKNKLAAEFELKDLGNL-----------------------------------RPVDTPMDPSL
        +DHTLF K+ H G L ILIVYVDDI+L+G+D  E+Q++K  L+ EFE+KDLGNL                                   +P+DTPMD   
Subjt:  SDHTLFFKRQHTG-LTILIVYVDDIVLTGDDKTEIQSVKNKLAAEFELKDLGNL-----------------------------------RPVDTPMDPSL

Query:  KLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSG
        KL  +    PVD+GRYQRLVG+LIYLS+TRPDI FAVS VSQFMHSP+E HM+A++R+L+YLK TPG GL F+K  +R  +++++ADWAG+  DRRSTSG
Subjt:  KLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSG

Query:  YCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKMFLEELDMKTDGPITNYCDNRAAISISHNPVHHDRVKHVELDRHFIKEKVDEGILKI
        YC+FVWGNLVTWRSKKQSVVARSSAEAE R+LA GICEG+W+K  L EL   +  PI   CDN+AAISI+ NPVHHDR KHVE+DRHFI EKV    +K+
Subjt:  YCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKMFLEELDMKTDGPITNYCDNRAAISISHNPVHHDRVKHVELDRHFIKEKVDEGILKI

XP_024044151.1 uncharacterized protein LOC18046468 isoform X1 [Citrus clementina]3.4e-28941.61Show/hide
Query:  ITAFKLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTSDPSFSSWDMNNSLVMSWLIHSMDSCIGELYLLYSTAKAIWDAVNLAYSDLEDSSQMFSL
        IT  KLNG NF+ WS+S++M IRGKGKIG++ GS+ +P+  DP F +WD +NS++MSWL++SM+  IG+ YL   TAK +WDAV   YSDL +S+Q++ L
Subjt:  ITAFKLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTSDPSFSSWDMNNSLVMSWLIHSMDSCIGELYLLYSTAKAIWDAVNLAYSDLEDSSQMFSL

Query:  RTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSSLDEVRGRILGSKPLPSIDEIFAEVRREEHRKTIMLC
        +TRIR  +Q +  VT Y++ L  LWQELD +    W+  +D   Y++ML KER+++FLAGL+S LDEVRGR+LG +PLPS  E+F+ VRREE RK +M+ 
Subjt:  RTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSSLDEVRGRILGSKPLPSIDEIFAEVRREEHRKTIMLC

Query:  STPSGSSDASALVAMHP-------------ADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASN---GPRAPVDSSHAAAAAKSSLCSPSAAPFNQA
             S++ SAL+++ P             +D K R   W D+C+KP HTR+ CWKLHGKP N  +N   G  +            ++     +  F + 
Subjt:  STPSGSSDASALVAMHP-------------ADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASN---GPRAPVDSSHAAAAAKSSLCSPSAAPFNQA

Query:  QLDHLTKMMSSSTLTTSLMAHSGLATISDLHC---------EPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGSVLRVVGTDTIMFNPDFSLHNV
        QL+ L + ++ S    +  + S LA   +            +PWI+DSGA+DH+T    L S++ PCS  + ++IA+GS+  V G  +I  + +  L +V
Subjt:  QLDHLTKMMSSSTLTTSLMAHSGLATISDLHC---------EPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGSVLRVVGTDTIMFNPDFSLHNV

Query:  LYVPQFEYNLMSISKLTASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLYYFSGTSSSLIHSTAA-----------------------------LFPK
        L+VP    NL+S+SK+T   H  A F  + C FQD  +G+ IG A+E  GLYYF    S    + AA                             LFP 
Subjt:  LYVPQFEYNLMSISKLTASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLYYFSGTSSSLIHSTAA-----------------------------LFPK

Query:  LFRNNES--FHCTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRISTLRN----------------------------------------------
        LF+N     F C +C +SKH R  F SQ Y  S PF+L+HSD+WGPSR++ +                                                
Subjt:  LFRNNES--FHCTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRISTLRN----------------------------------------------

Query:  ----------------------------------------------------------------------------KRQPSKILKFNTPLSVLRSHFPSS
                                                                                     R P++I  F +PL+V    +P +
Subjt:  ----------------------------------------------------------------------------KRQPSKILKFNTPLSVLRSHFPSS

Query:  RVFSELELRVFGCKAFVHNLIPARWGISASE---EFFGKLPTIADFELFWEPTPSSNPTNISNELAPTNISNNHDSPSPNSNPPISRNQSFSPNSSTPEN
        +VF+ L  ++FGC AFVH     R  +        F G  PT   ++ +    P SN   ++ ++                      N+SF P +S    
Subjt:  RVFSELELRVFGCKAFVHNLIPARWGISASE---EFFGKLPTIADFELFWEPTPSSNPTNISNELAPTNISNNHDSPSPNSNPPISRNQSFSPNSSTPEN

Query:  LLAHQNQSNAQNSSPKSMSSNLENQIPVPTSNSVPVIPS------GNIPGGKSAIQPYQFTYSRRDGKIE--------EPEDPHEHEI--------SPVS
            Q + + Q      +S      +P+P  + VP +PS       N    +   +P    Y+RR+ K          +  DP+   +        +P+S
Subjt:  LLAHQNQSNAQNSSPKSMSSNLENQIPVPTSNSVPVIPS------GNIPGGKSAIQPYQFTYSRRDGKIE--------EPEDPHEHEI--------SPVS

Query:  ETEL---NDIDLPIAIRKGTRSCTQHSISNFLGYAHLSTPMQALVTQLSKESVPATIQEAWQDPKWKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWV
        E+ L   ND+D+PIA RKGTRSCT H IS ++ Y  LS   +A    LS  ++P ++Q+A   P+W+ AV  EM ALEKN TW++V   ++KK VGC+W+
Subjt:  ETEL---NDIDLPIAIRKGTRSCTQHSISNFLGYAHLSTPMQALVTQLSKESVPATIQEAWQDPKWKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWV

Query:  FTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVLLSLAANHDWPLFQLDVKNAFLNGELEEEVYMDLPPGF-NSNGDNKVCLLKKSL
        FT+KY++DG++ERYKARLVAKG++QT+G+DYQETFAPVAK+NSIRVLLSLAA+  W L QLDVKNAFLNGELEEEVYMDLPPGF N  G  KVC LK+SL
Subjt:  FTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVLLSLAANHDWPLFQLDVKNAFLNGELEEEVYMDLPPGF-NSNGDNKVCLLKKSL

Query:  YRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKRQHTG-LTILIVYVDDIVLTGDDKTEIQSVKNKLAAEFELKDLGNLR------------------
        Y LKQSPRAWF RFTK++ +FGY Q+ +DHT+FFK    G + I+IVYVDDI+LTG +  E++++K  LA EFE+KDLG LR                  
Subjt:  YRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKRQHTG-LTILIVYVDDIVLTGDDKTEIQSVKNKLAAEFELKDLGNLR------------------

Query:  -----------------PVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLLFK
                         P DTP+DP  KL        VD+  YQRLVGKLIYLS+TRPDIAFAVS +SQFMHSP + H++A+++VLKYLK TPG GLLFK
Subjt:  -----------------PVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLLFK

Query:  KNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKMFLEELDMKTDGPITNYCDNRAAISISHNP
        KN +  VE++T+ADWAGS  DRRSTSGYCTFV GNLVTWRSKKQSVVARSSAEAE R++A GICE LWLK  LEEL +  D P+  YCDN+AAI+I+HNP
Subjt:  KNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKMFLEELDMKTDGPITNYCDNRAAISISHNP

Query:  VHHDRVKHVELDRHFIKEKVDEGIL
        VHHDR KHVE+DRHFIKEK++ G++
Subjt:  VHHDRVKHVELDRHFIKEKVDEGIL

XP_024044152.1 uncharacterized protein LOC18046468 isoform X2 [Citrus clementina]3.4e-28941.61Show/hide
Query:  ITAFKLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTSDPSFSSWDMNNSLVMSWLIHSMDSCIGELYLLYSTAKAIWDAVNLAYSDLEDSSQMFSL
        IT  KLNG NF+ WS+S++M IRGKGKIG++ GS+ +P+  DP F +WD +NS++MSWL++SM+  IG+ YL   TAK +WDAV   YSDL +S+Q++ L
Subjt:  ITAFKLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTSDPSFSSWDMNNSLVMSWLIHSMDSCIGELYLLYSTAKAIWDAVNLAYSDLEDSSQMFSL

Query:  RTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSSLDEVRGRILGSKPLPSIDEIFAEVRREEHRKTIMLC
        +TRIR  +Q +  VT Y++ L  LWQELD +    W+  +D   Y++ML KER+++FLAGL+S LDEVRGR+LG +PLPS  E+F+ VRREE RK +M+ 
Subjt:  RTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSSLDEVRGRILGSKPLPSIDEIFAEVRREEHRKTIMLC

Query:  STPSGSSDASALVAMHP-------------ADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASN---GPRAPVDSSHAAAAAKSSLCSPSAAPFNQA
             S++ SAL+++ P             +D K R   W D+C+KP HTR+ CWKLHGKP N  +N   G  +            ++     +  F + 
Subjt:  STPSGSSDASALVAMHP-------------ADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASN---GPRAPVDSSHAAAAAKSSLCSPSAAPFNQA

Query:  QLDHLTKMMSSSTLTTSLMAHSGLATISDLHC---------EPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGSVLRVVGTDTIMFNPDFSLHNV
        QL+ L + ++ S    +  + S LA   +            +PWI+DSGA+DH+T    L S++ PCS  + ++IA+GS+  V G  +I  + +  L +V
Subjt:  QLDHLTKMMSSSTLTTSLMAHSGLATISDLHC---------EPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGSVLRVVGTDTIMFNPDFSLHNV

Query:  LYVPQFEYNLMSISKLTASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLYYFSGTSSSLIHSTAA-----------------------------LFPK
        L+VP    NL+S+SK+T   H  A F  + C FQD  +G+ IG A+E  GLYYF    S    + AA                             LFP 
Subjt:  LYVPQFEYNLMSISKLTASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLYYFSGTSSSLIHSTAA-----------------------------LFPK

Query:  LFRNNES--FHCTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRISTLRN----------------------------------------------
        LF+N     F C +C +SKH R  F SQ Y  S PF+L+HSD+WGPSR++ +                                                
Subjt:  LFRNNES--FHCTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRISTLRN----------------------------------------------

Query:  ----------------------------------------------------------------------------KRQPSKILKFNTPLSVLRSHFPSS
                                                                                     R P++I  F +PL+V    +P +
Subjt:  ----------------------------------------------------------------------------KRQPSKILKFNTPLSVLRSHFPSS

Query:  RVFSELELRVFGCKAFVHNLIPARWGISASE---EFFGKLPTIADFELFWEPTPSSNPTNISNELAPTNISNNHDSPSPNSNPPISRNQSFSPNSSTPEN
        +VF+ L  ++FGC AFVH     R  +        F G  PT   ++ +    P SN   ++ ++                      N+SF P +S    
Subjt:  RVFSELELRVFGCKAFVHNLIPARWGISASE---EFFGKLPTIADFELFWEPTPSSNPTNISNELAPTNISNNHDSPSPNSNPPISRNQSFSPNSSTPEN

Query:  LLAHQNQSNAQNSSPKSMSSNLENQIPVPTSNSVPVIPS------GNIPGGKSAIQPYQFTYSRRDGKIE--------EPEDPHEHEI--------SPVS
            Q + + Q      +S      +P+P  + VP +PS       N    +   +P    Y+RR+ K          +  DP+   +        +P+S
Subjt:  LLAHQNQSNAQNSSPKSMSSNLENQIPVPTSNSVPVIPS------GNIPGGKSAIQPYQFTYSRRDGKIE--------EPEDPHEHEI--------SPVS

Query:  ETEL---NDIDLPIAIRKGTRSCTQHSISNFLGYAHLSTPMQALVTQLSKESVPATIQEAWQDPKWKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWV
        E+ L   ND+D+PIA RKGTRSCT H IS ++ Y  LS   +A    LS  ++P ++Q+A   P+W+ AV  EM ALEKN TW++V   ++KK VGC+W+
Subjt:  ETEL---NDIDLPIAIRKGTRSCTQHSISNFLGYAHLSTPMQALVTQLSKESVPATIQEAWQDPKWKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWV

Query:  FTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVLLSLAANHDWPLFQLDVKNAFLNGELEEEVYMDLPPGF-NSNGDNKVCLLKKSL
        FT+KY++DG++ERYKARLVAKG++QT+G+DYQETFAPVAK+NSIRVLLSLAA+  W L QLDVKNAFLNGELEEEVYMDLPPGF N  G  KVC LK+SL
Subjt:  FTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVLLSLAANHDWPLFQLDVKNAFLNGELEEEVYMDLPPGF-NSNGDNKVCLLKKSL

Query:  YRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKRQHTG-LTILIVYVDDIVLTGDDKTEIQSVKNKLAAEFELKDLGNLR------------------
        Y LKQSPRAWF RFTK++ +FGY Q+ +DHT+FFK    G + I+IVYVDDI+LTG +  E++++K  LA EFE+KDLG LR                  
Subjt:  YRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKRQHTG-LTILIVYVDDIVLTGDDKTEIQSVKNKLAAEFELKDLGNLR------------------

Query:  -----------------PVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLLFK
                         P DTP+DP  KL        VD+  YQRLVGKLIYLS+TRPDIAFAVS +SQFMHSP + H++A+++VLKYLK TPG GLLFK
Subjt:  -----------------PVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLLFK

Query:  KNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKMFLEELDMKTDGPITNYCDNRAAISISHNP
        KN +  VE++T+ADWAGS  DRRSTSGYCTFV GNLVTWRSKKQSVVARSSAEAE R++A GICE LWLK  LEEL +  D P+  YCDN+AAI+I+HNP
Subjt:  KNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKMFLEELDMKTDGPITNYCDNRAAISISHNP

Query:  VHHDRVKHVELDRHFIKEKVDEGIL
        VHHDR KHVE+DRHFIKEK++ G++
Subjt:  VHHDRVKHVELDRHFIKEKVDEGIL

TrEMBL top hitse value%identityAlignment
A0A438FB27 Retrovirus-related Pol polyprotein from transposon TNT 1-942.6e-28742.43Show/hide
Query:  GGESSEVVADSPPLPISINGSSSGPS---SDHHSVAITAFKLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTSDPSFSSWDMNNSLVMSWLIHSMD
        G  SS+V + + P      G S  P+   +D   + IT  KLNGHN++ WS+S+ + I GKGK  ++ G  V P T++P F  W + NS++MSWLI+SM+
Subjt:  GGESSEVVADSPPLPISINGSSSGPS---SDHHSVAITAFKLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTSDPSFSSWDMNNSLVMSWLIHSMD

Query:  SCIGELYLLYSTAKAIWDAVNLAYSDLEDSSQMFSLRTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSS
        + IGE +LL+ TAK IWDA    YS  E++S++F + + + N  Q   +VT Y++ LTR WQ+LDLF+   WK   D   YRQ++ ++R++ F  GLN  
Subjt:  SCIGELYLLYSTAKAIWDAVNLAYSDLEDSSQMFSLRTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSS

Query:  LDEVRGRILGSKPLPSIDEIFAEVRREEHRKTIMLCS--TPSGSSDASALVA----MHPADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASNGPRA
        LD+VRGRI+G KPLPS+ E F+EVRREE RK +M+ S   P+ + DASAL A        DR+KR   W D+C KP H +E CWKLHGKP++     PR 
Subjt:  LDEVRGRILGSKPLPSIDEIFAEVRREEHRKTIMLCS--TPSGSSDASALVA----MHPADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASNGPRA

Query:  PVDSSHAAAAAKSSLCSPSAAPFNQAQLDHLTKMM----SSSTLTTSLMAHSGLATISDLHCEPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGS
          D     AA   S   P  +PFN+ Q++ L K++    S ST   +  A+ G          PWIVD+GASDH+TG   +L N+ P +   +V IA+GS
Subjt:  PVDSSHAAAAAKSSLCSPSAAPFNQAQLDHLTKMM----SSSTLTTSLMAHSGLATISDLHCEPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGS

Query:  VLRVVGTDTIMFNPDFSLHNVLYVPQFEYNLMSISKLTASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLYY------------------------FS
          ++ GT +I    D  L +VL+VP  + NL+SISKL         F  N C+FQD  +G+MIG A+  SGLY                         F+
Subjt:  VLRVVGTDTIMFNPDFSLHNVLYVPQFEYNLMSISKLTASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLYY------------------------FS

Query:  GTSSSLIHS------------------TAALFPKLF--RNNESFHCTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRISTLRNKR----------
          S+S ++                    A LFPKLF  +N  S+HC +CQ +KH R  +    Y PS  F+LVHSDVWGPSRI  +   R          
Subjt:  GTSSSLIHS------------------TAALFPKLF--RNNESFHCTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRISTLRNKR----------

Query:  ---------QPSKI------------LKFNTPLSVLRSHFPSSRVFSELE-------------------------------LRVFGCKAFVHNLIPARWG
                 + S++             +FN+ + VL+S        S L                                L V  C  F  N+    WG
Subjt:  ---------QPSKI------------LKFNTPLSVLRSHFPSSRVFSELE-------------------------------LRVFGCKAFVHNLIPARWG

Query:  --ISASEEFFGKLPT-IADF----ELFWEPTP------SSNPTNISNELAPTNISNNHDSPSPNSNPPISRNQSFSPNSSTPENLLAHQNQSNAQNSSPK
          I  +     ++P+ +  F    +LF +  P      S  P  +    A  ++   + SP+P+   P   +    P               + Q +   
Subjt:  --ISASEEFFGKLPT-IADF----ELFWEPTP------SSNPTNISNELAPTNISNNHDSPSPNSNPPISRNQSFSPNSSTPENLLAHQNQSNAQNSSPK

Query:  SMSSNLENQIPVPTSNSVPVIPSGNIP-------------GGKSAIQPYQFTYSRRDGKIEEPEDP-HEHEISPVSETELNDIDLPIAIRKGTRSCTQHS
        ++ S +  Q P+P     P + + N+              G +S    Y       D     PE+   E     V    ++D  LPIA+RKG R CT H 
Subjt:  SMSSNLENQIPVPTSNSVPVIPSGNIP-------------GGKSAIQPYQFTYSRRDGKIEEPEDP-HEHEISPVSETELNDIDLPIAIRKGTRSCTQHS

Query:  ISNFLGYAHLSTPMQALVTQLSKESVPATIQEAWQDPKWKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTH
        I N++ Y  LS   +A  T L    VP TIQEA +  +WKKAV +E+ ALEKN TW I      K+ VGC+W+FTIKYK+DG++ER+KARLVA+G++Q++
Subjt:  ISNFLGYAHLSTPMQALVTQLSKESVPATIQEAWQDPKWKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTH

Query:  GVDYQETFAPVAKINSIRVLLSLAANHDWPLFQLDVKNAFLNGELEEEVYMDLPPGF-NSNGDNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAH
        G+DYQETFAPVAK+N+IR+LLSLA N DW L QLD+KNAFLNG+LEEEVYM++PPGF  S   N+VC L+KSLY LKQSPRAWF RFTKA+L  GYKQ  
Subjt:  GVDYQETFAPVAKINSIRVLLSLAANHDWPLFQLDVKNAFLNGELEEEVYMDLPPGF-NSNGDNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAH

Query:  SDHTLFFKRQHTG-LTILIVYVDDIVLTGDDKTEIQSVKNKLAAEFELKDLGNL-----------------------------------RPVDTPMDPSL
        +DHTLF K+ H G L ILIVYVDDI+L+G+D  E+Q++K  L+ EFE+KDLGNL                                   +P+DTPMD   
Subjt:  SDHTLFFKRQHTG-LTILIVYVDDIVLTGDDKTEIQSVKNKLAAEFELKDLGNL-----------------------------------RPVDTPMDPSL

Query:  KLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSG
        KL  +    PVD+GRYQRLVG+LIYLS+TRPDI FAVS VSQFMHSP+E HM+A++R+L+YLK TPG GL F+K  +R  +++++ADWAG+  DRRSTSG
Subjt:  KLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSG

Query:  YCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKMFLEELDMKTDGPITNYCDNRAAISISHNPVHHDRVKHVELDRHFIKEKVDEGILKI
        YC+FVWGNLVTWRSKKQSVVARSSAEAE R+LA GICEG+W+K  L EL   +  PI   CDN+AAISI+ NPVHHDR KHVE+DRHFI EKV    +K+
Subjt:  YCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKMFLEELDMKTDGPITNYCDNRAAISISHNPVHHDRVKHVELDRHFIKEKVDEGILKI

A0A438IZK5 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-28541.11Show/hide
Query:  GGESSEVVADSPPLPISINGSSSGPS---SDHHSVAITAFKLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTSDPSFSSWDMNNSLVMSWLIHSMD
        G  SS+V + + P      G S  P+   +D   + IT  KLNGHN++ WS+S+ + I GKGK  ++ G  V P T++P F  W + NS++MSWLI+SM+
Subjt:  GGESSEVVADSPPLPISINGSSSGPS---SDHHSVAITAFKLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTSDPSFSSWDMNNSLVMSWLIHSMD

Query:  SCIGELYLLYSTAKAIWDAVNLAYSDLEDSSQMFSLRTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSS
        + IGE +LL+ TAK IWDA    YS  E++S++F + + + + RQ   +VT Y++ LTR WQ+LDLF+   WK   D   YRQ++ ++R++ F  GLN  
Subjt:  SCIGELYLLYSTAKAIWDAVNLAYSDLEDSSQMFSLRTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSS

Query:  LDEVRGRILGSKPLPSIDEIFAEVRREEHRKTIMLCS--TPSGSSDASALVA----MHPADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASNGPRA
        LD+VRGRI+G KPLPS+ E F+EVRREE RK +M+ S   P+ + DASAL A        DR+KR   W D+C KP H +E CWKLHGKP++     PR 
Subjt:  LDEVRGRILGSKPLPSIDEIFAEVRREEHRKTIMLCS--TPSGSSDASALVA----MHPADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASNGPRA

Query:  PVDSSHAAAAAKSSLCSPSAAPFNQAQLDHLTKMM----SSSTLTTSLMAHSGLATISDLHCEPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGS
          D     AA   S   P  +PFN+ Q++ L K++    S ST   +  A+ G          PWIVD+GASDH+TG   +L N+ P +   +V IA+GS
Subjt:  PVDSSHAAAAAKSSLCSPSAAPFNQAQLDHLTKMM----SSSTLTTSLMAHSGLATISDLHCEPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGS

Query:  VLRVVGTDTIMFNPDFSLHNVLYVPQFEYNLMSISKLTASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLYY------------------------FS
          ++ GT +I    D  L +VL+VP  + NL+SISKL         F  N C+FQD  +G+MIG A+  SGLY                         F+
Subjt:  VLRVVGTDTIMFNPDFSLHNVLYVPQFEYNLMSISKLTASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLYY------------------------FS

Query:  GTSSSLIHS------------------TAALFPKLF--RNNESFHCTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRISTLRN------------
          S+S ++                    A LFP+LF  +N  S+HC +CQ +KH R  +    Y PS  F+LVHSDVWGPSRI  +              
Subjt:  GTSSSLIHS------------------TAALFPKLF--RNNESFHCTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRISTLRN------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------KRQPSKILKFNTPLSVLRSHFPSSR-VFSELELRVFGCKAFVHNLIPARWGISASEE---FFGKLPTIADFELFWEPTPSSNPTNI----
                   R PS++L F +P  +    FP +    S+L L+VFGC AFVH     R   +       F G  PT   ++ +  PT     T +    
Subjt:  ----------KRQPSKILKFNTPLSVLRSHFPSSR-VFSELELRVFGCKAFVHNLIPARWGISASEE---FFGKLPTIADFELFWEPTPSSNPTNI----

Query:  --------SNELAPTNISNNHDSPSPNSNPPISRNQSFSPNSSTPENLLAHQNQSNAQNSSPKSMSSNLENQIPVPTSNSVPVIPSGNIP------GGKS
                 + +   +++ +    S     P   ++S +P+   P  L      S  Q +   ++ S +  Q P+P     P + + N+         + 
Subjt:  --------SNELAPTNISNNHDSPSPNSNPPISRNQSFSPNSSTPENLLAHQNQSNAQNSSPKSMSSNLENQIPVPTSNSVPVIPSGNIP------GGKS

Query:  AIQPYQFTYSRR-DGKIEEPEDP-HEHEISPVSETELNDIDLPIAIRKGTRSCTQHSISNFLGYAHLSTPMQALVTQLSKESVPATIQEAWQDPKWKKAV
             Q TY +  D     PE+   E     V    ++D  LPIA+RKG R CT H I N++ Y  LS   +A  T L    VP TIQEA +  +WKKAV
Subjt:  AIQPYQFTYSRR-DGKIEEPEDP-HEHEISPVSETELNDIDLPIAIRKGTRSCTQHSISNFLGYAHLSTPMQALVTQLSKESVPATIQEAWQDPKWKKAV

Query:  MEEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVLLSLAANHDWPLFQLDVKNAFLNG
         +E+ ALEKNGTW I      K+ VGC+W+FTIKYK+DG++ER+KARLVA+G++Q++G+DYQETFAPVAK+N+IR+LLSLA N DW L QLD+KNAFLNG
Subjt:  MEEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVLLSLAANHDWPLFQLDVKNAFLNG

Query:  ELEEEVYMDLPPGF-NSNGDNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKRQHTG-LTILIVYVDDIVLTGDDKTEIQSVKNKLA
        +LEEEVYM++PPGF  S   N+VC L+KSLY LKQSPRAWF RFTKA+L  GYKQ  +DHTLF K+ H G L ILIVYVDDI+L+G+D  E+Q++K  L+
Subjt:  ELEEEVYMDLPPGF-NSNGDNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKRQHTG-LTILIVYVDDIVLTGDDKTEIQSVKNKLA

Query:  AEFELKDLGNL-----------------------------------RPVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQF
         EFE+KDLGNL                                   +P+DTPMD   KL  +    PVD+GRYQRLVG+LIYLS+TRPDI FAVS VSQF
Subjt:  AEFELKDLGNL-----------------------------------RPVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQF

Query:  MHSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLK
        MHSP+E HM+A++R+L+YLK TPG GL F+K  +R  E++++ADWAG+  DRRSTSGYC+FVWGNLVTWRSKKQSVVARSSAEAE R+LA GICEG+W+K
Subjt:  MHSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLK

Query:  MFLEELDMKTDGPITNYCDNRAAISISHNPVHHDRVKHVELDRHFIKEKVDEGILKI
          L EL   +  PI   CDN+AAISI+ NPVHHDR KHVE+DRHFI EKV    +K+
Subjt:  MFLEELDMKTDGPITNYCDNRAAISISHNPVHHDRVKHVELDRHFIKEKVDEGILKI

A0A438K9S0 Retrovirus-related Pol polyprotein from transposon TNT 1-944.2e-28543.12Show/hide
Query:  GGESSEVVADSPPLPISINGSSSGPS---SDHHSVAITAFKLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTSDPSFSSWDMNNSLVMSWLIHSMD
        G  SS+V + + P      G S  P+   +D   + IT  KLNGHN++ WS+S+ + I GKGK  ++ G  V P T++P F  W + NS++MSWLI+SM+
Subjt:  GGESSEVVADSPPLPISINGSSSGPS---SDHHSVAITAFKLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTSDPSFSSWDMNNSLVMSWLIHSMD

Query:  SCIGELYLLYSTAKAIWDAVNLAYSDLEDSSQMFSLRTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSS
        + IGE +LL+ TAK IWDA    YS  E++S++F + + + + RQ   +VT Y++ LTR WQ+LDLF+   WK   D   YRQ++ ++R++ F  GLN  
Subjt:  SCIGELYLLYSTAKAIWDAVNLAYSDLEDSSQMFSLRTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSS

Query:  LDEVRGRILGSKPLPSIDEIFAEVRREEHRKTIMLCS--TPSGSSDASALVA----MHPADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASNGPRA
        LD+VRGRI+G KPLPS+ E F+EVRREE RK +M+ S   P+ + DASAL A        DR+KR   W D+C KP H +E CWKLHGKP++     PR 
Subjt:  LDEVRGRILGSKPLPSIDEIFAEVRREEHRKTIMLCS--TPSGSSDASALVA----MHPADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASNGPRA

Query:  PVDSSHAAAAAKSSLCSPSAAPFNQAQLDHLTKMM----SSSTLTTSLMAHSGLATISDLHCEPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGS
          D     AA   S   P  +PFN+ Q++ L K++    S ST   +  A+ G          PWIVD+GASDH+TG   +L N+ P +   +V IA+GS
Subjt:  PVDSSHAAAAAKSSLCSPSAAPFNQAQLDHLTKMM----SSSTLTTSLMAHSGLATISDLHCEPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGS

Query:  VLRVVGTDTIMFNPDFSLHNVLYVPQFEYNLMSISKLTASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLYY------------------------FS
          ++ GT +I    D  L +VL+VP  + NL+SISKL         F  N C+FQD  +G+MIG A+  SGLY                         F+
Subjt:  VLRVVGTDTIMFNPDFSLHNVLYVPQFEYNLMSISKLTASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLYY------------------------FS

Query:  GTSSSLIHS------------------TAALFPKLF--RNNESFHCTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRISTLRNKR----------
          S+S ++                    A LFPKLF  +N  S+HC +CQ +KH R  +    Y PS  F+LVHSDVWGPSRI  +   R          
Subjt:  GTSSSLIHS------------------TAALFPKLF--RNNESFHCTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRISTLRNKR----------

Query:  ---------QPSKI------------LKFNTPLSVLRSHFPSSRVFSELE-------------------------------LRVFGCKAFVHNLIPARWG
                 + S++             +FN+ + VL+S        S L                                L V  C  F  N+    WG
Subjt:  ---------QPSKI------------LKFNTPLSVLRSHFPSSRVFSELE-------------------------------LRVFGCKAFVHNLIPARWG

Query:  --ISASEEFFGKLPT-IADF----ELFWEPTPSSNPTNISNELAPTNISNNHDSPSPNSNPPISRNQSFSPNSSTPENLLAHQNQSNAQNSSPKSMSSNL
          I  +     ++P+ +  F    +LF +  P ++  +    L   +++ +    S     P   ++S +P+   P  L      S  Q +   ++ S +
Subjt:  --ISASEEFFGKLPT-IADF----ELFWEPTPSSNPTNISNELAPTNISNNHDSPSPNSNPPISRNQSFSPNSSTPENLLAHQNQSNAQNSSPKSMSSNL

Query:  ENQIPVPTSNSVPVIPSGNIP-------------GGKSAIQPYQFTYSRRDGKIEEPEDP-HEHEISPVSETELNDIDLPIAIRKGTRSCTQHSISNFLG
          Q P+P     P + + N+              G +S    Y       D     PE+   E     V    ++D  LPIA+RKG R CT H I N++ 
Subjt:  ENQIPVPTSNSVPVIPSGNIP-------------GGKSAIQPYQFTYSRRDGKIEEPEDP-HEHEISPVSETELNDIDLPIAIRKGTRSCTQHSISNFLG

Query:  YAHLSTPMQALVTQLSKESVPATIQEAWQDPKWKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQE
        Y  LS   +A  T L    VP TIQEA +  +WKKAV +E+ ALEKNGTW I      K+ VGC+W+FTIKYK+DG++ER+KARLVA+G++Q++G+DYQE
Subjt:  YAHLSTPMQALVTQLSKESVPATIQEAWQDPKWKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQE

Query:  TFAPVAKINSIRVLLSLAANHDWPLFQLDVKNAFLNGELEEEVYMDLPPGF-NSNGDNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLF
        TFAPVAK+N+IR+LLSLA N DW L QLD+KNAFLNG+LEEEVYM++PPGF  S   N+VC L+KSLY LKQSPRAWF RFTKA+L  GYKQ  +DHTLF
Subjt:  TFAPVAKINSIRVLLSLAANHDWPLFQLDVKNAFLNGELEEEVYMDLPPGF-NSNGDNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLF

Query:  FKRQHTG-LTILIVYVDDIVLTGDDKTEIQSVKNKLAAEFE-----LKDLGNL--RPVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIA
         K+ H G L ILIVYVDDI+L+G+D    +S K  + ++ +     LK+ G L  +P+DTPMD   KL  +    PVD+GRYQRLVG+LIYLS+TRPDI 
Subjt:  FKRQHTG-LTILIVYVDDIVLTGDDKTEIQSVKNKLAAEFE-----LKDLGNL--RPVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIA

Query:  FAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAELRSLAL
        FAVS VSQFMHSP+E HM+A++R+L+YLK TPG GL F+K  +R  E++++ADWAG+  DRRSTSGYC+FVWGNLVTWRSKKQSVVARSSAEAE R+LA 
Subjt:  FAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAELRSLAL

Query:  GICEGLWLKMFLEELDMKTDGPITNYCDNRAAISISHNPVHHDRVKHVELDRHFIKEKVDEGILKI
        GICEG+W+K  L EL   +  PI   CDN+AAISI+ NPVHHDR KHVE+DRHFI EKV    +K+
Subjt:  GICEGLWLKMFLEELDMKTDGPITNYCDNRAAISISHNPVHHDRVKHVELDRHFIKEKVDEGILKI

A5B2S5 Integrase catalytic domain-containing protein2.9e-28641.71Show/hide
Query:  GGESSEVVADSPPLPISINGSSSGPS---SDHHSVAITAFKLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTSDPSFSSWDMNNSLVMSWLIHSMD
        G  SS+V + + P      G S  P+   +D   + IT  KLNGHN++ WS+S+ + I GKGK  ++ G  V P T++P F  W + NS++MSWLI+SM+
Subjt:  GGESSEVVADSPPLPISINGSSSGPS---SDHHSVAITAFKLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTSDPSFSSWDMNNSLVMSWLIHSMD

Query:  SCIGELYLLYSTAKAIWDAVNLAYSDLEDSSQMFSLRTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSS
        + IGE +LL+ TAK IWDA    YS  E++S++F + + + + RQ   +VT Y++ LTR WQ+LDLF+   WK   D   YRQ++ ++R++ F  GLN  
Subjt:  SCIGELYLLYSTAKAIWDAVNLAYSDLEDSSQMFSLRTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSS

Query:  LDEVRGRILGSKPLPSIDEIFAEVRREEHRKTIMLCS--TPSGSSDASALVA----MHPADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASNGPRA
        LD+VRGRI+G KPLPS+ E F+EVRREE RK +M+ S   P+ + DASAL A        DR+KR   W D+C KP H +E CWKLHGKP++     PR 
Subjt:  LDEVRGRILGSKPLPSIDEIFAEVRREEHRKTIMLCS--TPSGSSDASALVA----MHPADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASNGPRA

Query:  PVDSSHAAAAAKSSLCSPSAAPFNQAQLDHLTKMM----SSSTLTTSLMAHSGLATISDLHCEPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGS
          D     AA   S   P  +PFN+ Q++ L K++    S ST   +  A+ G          PWIVD+GASDH+TG   +L N+ P +   +V IA+GS
Subjt:  PVDSSHAAAAAKSSLCSPSAAPFNQAQLDHLTKMM----SSSTLTTSLMAHSGLATISDLHCEPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGS

Query:  VLRVVGTDTIMFNPDFSLHNVLYVPQFEYNLMSISKLTASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLYY------------------------FS
          ++ GT +I    D  L +VL+VP  + NL+SISKL         F  N C+FQD  +G+MIG A+  SGLY                         F+
Subjt:  VLRVVGTDTIMFNPDFSLHNVLYVPQFEYNLMSISKLTASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLYY------------------------FS

Query:  GTSSSLIHS------------------TAALFPKLF--RNNESFHCTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRISTLRN------------
          S+S ++                    A LFP+LF  +N  S+HC +CQ +KH R  +    Y PS  F+LVHSDVWGPSRI  +              
Subjt:  GTSSSLIHS------------------TAALFPKLF--RNNESFHCTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRISTLRN------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------KRQPSKILKFNTPLSVLRSHFPSSR-VFSELELRVFGCKAFVHNLIPARWGISASEE---FFGKLPTIADFELFWEPTPSSNPTNIS---
                   R PS++L F +P  +    FP +    S+L L+VFGC AFVH     R   +       F G  PT   ++ +  PT     T +    
Subjt:  ----------KRQPSKILKFNTPLSVLRSHFPSSR-VFSELELRVFGCKAFVHNLIPARWGISASEE---FFGKLPTIADFELFWEPTPSSNPTNIS---

Query:  --------NELAPTNISNNH---DSPSPNSNPPISRNQSFSPNSSTPENLLAHQNQSNAQNSSPKSMSSNLENQIPVPTSNSVPVIPSGNIP--------
                         N H   +  S     P   ++S +P+   P  L      S  Q +   ++ S +  Q P+P     P + + N+         
Subjt:  --------NELAPTNISNNH---DSPSPNSNPPISRNQSFSPNSSTPENLLAHQNQSNAQNSSPKSMSSNLENQIPVPTSNSVPVIPSGNIP--------

Query:  -----GGKSAIQPYQFTYSRRDGKIEEPEDP-HEHEISPVSETELNDIDLPIAIRKGTRSCTQHSISNFLGYAHLSTPMQALVTQLSKESVPATIQEAWQ
             G +S    Y       D     PE+   E     V    ++D  LPIA+RKG R CT H I N++ Y  LS   +A  T L    VP TIQEA +
Subjt:  -----GGKSAIQPYQFTYSRRDGKIEEPEDP-HEHEISPVSETELNDIDLPIAIRKGTRSCTQHSISNFLGYAHLSTPMQALVTQLSKESVPATIQEAWQ

Query:  DPKWKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVLLSLAANHDWPLFQLD
          +WKKAV +E+ ALEKNGTW I      K+ VGC+W+FTIKYK+DG++ER+KARLVA+G++Q++G+DYQETFAPVAK+N+IR+LLSLA N DW L QLD
Subjt:  DPKWKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVLLSLAANHDWPLFQLD

Query:  VKNAFLNGELEEEVYMDLPPGF-NSNGDNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKRQHTG-LTILIVYVDDIVLTGDDKTEI
        +KNAFLNG+LEEEVYM++PPGF  S   N+VC L+KSLY LKQSPRAWF RFTKA+L  GYKQ  +DHTLF K+ H G L ILIVYVDDI+L+G+D  E+
Subjt:  VKNAFLNGELEEEVYMDLPPGF-NSNGDNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKRQHTG-LTILIVYVDDIVLTGDDKTEI

Query:  QSVKNKLAAEFELKDLGN------LRPVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKG
        Q++K  L+ EFE+KDLGN       +P+DTPMD   KL  +    PVD+GRYQ LVG+LIYLS+TRPDI FAVS VSQFMHSP+E HM+A++R+L+YLK 
Subjt:  QSVKNKLAAEFELKDLGN------LRPVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKG

Query:  TPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKMFLEELDMKTDGPITNYCDNR
        TPG GL F+K  +R  E++++ADWAG+  DRRSTSGYC+FVWGNLVTWRSKKQSVVARSSAEAE R+LA GICEG+W+K  L EL   +  PI   CDN+
Subjt:  TPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKMFLEELDMKTDGPITNYCDNR

Query:  AAISISHNPVHHDRVKHVELDRHFIKEKVDEGILKI
        AAISI+ NPVHHDR KHVE+DRHFI EKV    +K+
Subjt:  AAISISHNPVHHDRVKHVELDRHFIKEKVDEGILKI

W9SCZ3 Non-specific serine/threonine protein kinase6.4e-28641.41Show/hide
Query:  SINGSSSGPSSDHHSVAITAFKLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTSDPSFSSWDMNNSLVMSWLIHSMDSCIGELYLLYSTAKAIWDA
        ++ GS S P      + IT  KLNGHN++ WS+S+ + I GKGK  ++ G    P T+D  F  W + NS++MSWLI+SM++ IGE +LL+ TAK IWDA
Subjt:  SINGSSSGPSSDHHSVAITAFKLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTSDPSFSSWDMNNSLVMSWLIHSMDSCIGELYLLYSTAKAIWDA

Query:  VNLAYSDLEDSSQMFSLRTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSSLDEVRGRILGSKPLPSIDE
            YS  E++S++F + + + + RQ   +VT Y+  L R WQ+LDLF+   WK P D   YRQ++ ++R++ F  GLN  LD+VRGRI+G+KPLPS+ E
Subjt:  VNLAYSDLEDSSQMFSLRTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSSLDEVRGRILGSKPLPSIDE

Query:  IFAEVRREEHRKTIMLCSTPSGSS--DASALVA----MHPADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASNGPRAPVDSSHAAAAAKSSLCSPS
         F+EVRREE RK +M+ S    +S  DASAL       +  D +KR   W D+C K  H +E CWKLHGKP++     PR   DS    A+   S   P 
Subjt:  IFAEVRREEHRKTIMLCSTPSGSS--DASALVA----MHPADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASNGPRAPVDSSHAAAAAKSSLCSPS

Query:  AAPFNQAQLDHLTKMMSSSTLTTSLMAHSGLATISDLH---------CEPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGSVLRVVGTDTIMFNP
         +PFN+ Q++ L K+ S   + +  +  +GL   +D H           PWIVD+GASDH+TG   LL N+ P S   +V IA+GS  ++ GT +I    
Subjt:  AAPFNQAQLDHLTKMMSSSTLTTSLMAHSGLATISDLH---------CEPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGSVLRVVGTDTIMFNP

Query:  DFSLHNVLYVPQFEYNLMSISKLTASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLYYFSGTSSS---------------------------------
        +  L +VL+VP  + NL+SISKL         F  N C+FQD  +G+MIG A+  SGLY  S   SS                                 
Subjt:  DFSLHNVLYVPQFEYNLMSISKLTASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLYYFSGTSSS---------------------------------

Query:  LIH---------STAALFPKLF--RNNESFHCTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRI-------------------------------
        L+H           A LFPKLF  +N  SFHC +CQ++KH R  +    Y PS  F+L+HSDVWGPSRI                               
Subjt:  LIH---------STAALFPKLF--RNNESFHCTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRI-------------------------------

Query:  -------------------------------------------------------------------------------------------STLRNKRQP
                                                                                                   +T    R P
Subjt:  -------------------------------------------------------------------------------------------STLRNKRQP

Query:  SKILKFNTPLSVLRSHFPSSR-VFSELELRVFGCKAFVH-------------------NLIPARWGISA----SEEFFGKLPTIADFELFWEPT-----P
        S++L F +P  +L  +FP +R V S+L  +VFGC AFVH                      P + G       S+ F+  +       +F+ P       
Subjt:  SKILKFNTPLSVLRSHFPSSR-VFSELELRVFGCKAFVH-------------------NLIPARWGISA----SEEFFGKLPTIADFELFWEPT-----P

Query:  SSNPTNISNELAPTNISNNHDSPSPNSNPPISRNQSFSPNSSTPENLLAHQNQSNAQNSSPKSMSSNLENQIPVPTSNSVPVIPSGNIPG--GKSAIQPY
        S N   I   +  +  S++ +SP P+   PI        +SSTP  L          +  P ++SS      PVP  +  P + + N+     +   Q  
Subjt:  SSNPTNISNELAPTNISNNHDSPSPNSNPPISRNQSFSPNSSTPENLLAHQNQSNAQNSSPKSMSSNLENQIPVPTSNSVPVIPSGNIPG--GKSAIQPY

Query:  QFTYSRRDGK-IEEPEDPHEHEI-----SPVSETELNDIDLPIAIRKGTRSCTQHSISNFLGYAHLSTPMQALVTQLSKESVPATIQEAWQDPKWKKAVM
        +       G+ I+    P E  +       VS   ++D  LPIA+RKG R CT H I N++ Y  LS   +A  T L    +P+TI EA Q+ +WKKAV 
Subjt:  QFTYSRRDGK-IEEPEDPHEHEI-----SPVSETELNDIDLPIAIRKGTRSCTQHSISNFLGYAHLSTPMQALVTQLSKESVPATIQEAWQDPKWKKAVM

Query:  EEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVLLSLAANHDWPLFQLDVKNAFLNGE
        +E+ ALEKNGTW I      K+ VGC+W+FTIKYK+DG++ER+KARLVA+G++Q++G+DYQETFAPVAK+N+IR+LLSLA N DW L QLD+KNAFLNG+
Subjt:  EEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVLLSLAANHDWPLFQLDVKNAFLNGE

Query:  LEEEVYMDLPPGF-NSNGDNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKRQHT-GLTILIVYVDDIVLTGDDKTEIQSVKNKLAA
        LEEEVYM++PPGF  S   N+VC L+KSLY LKQSPRAWF RFTKA+L  GY Q  SDHTLF K+ H   + ILIVYVDDI+L+G+D  E+Q +K  L+ 
Subjt:  LEEEVYMDLPPGF-NSNGDNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKRQHT-GLTILIVYVDDIVLTGDDKTEIQSVKNKLAA

Query:  EFELKDLGNL-----------------------------------RPVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQFM
        EFE+KDLGNL                                   +PVDTPMD   KL T+    PVD+GRYQRLVG+LIYLS+TRPDI FAVS VSQFM
Subjt:  EFELKDLGNL-----------------------------------RPVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQFM

Query:  HSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKM
        HSP+E HM+A++RVL+YLK TPG GL F K  +R  E++++ADWAG+  DRRSTSGYC+FVWGNLVTWRSKKQSVVARSSAEAE R+LA GICEG+W++ 
Subjt:  HSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKM

Query:  FLEELDMKTDGPITNYCDNRAAISISHNPVHHDRVKHVELDRHFIKEKV
         L EL   +  PI   CDN+AAISI+ NPVHHDR KHVE+DRHFI EKV
Subjt:  FLEELDMKTDGPITNYCDNRAAISISHNPVHHDRVKHVELDRHFIKEKV

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.0e-8036.02Show/hide
Query:  VPATIQE-AWQDPK--WKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVLLS
        VP +  E  ++D K  W++A+  E+ A + N TW I  + ++K  V  RWVF++KY   G   RYKARLVA+G++Q + +DY+ETFAPVA+I+S R +LS
Subjt:  VPATIQE-AWQDPK--WKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVLLS

Query:  LAANHDWPLFQLDVKNAFLNGELEEEVYMDLPPGFNSNGDNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFF--KRQHTGLTILIVYV
        L   ++  + Q+DVK AFLNG L+EE+YM LP G + N DN VC L K++Y LKQ+ R WF  F +A+    +  +  D  ++   K        +++YV
Subjt:  LAANHDWPLFQLDVKNAFLNGELEEEVYMDLPPGFNSNGDNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFF--KRQHTGLTILIVYV

Query:  DDIVLTGDDKTEIQSVKNKLAAEFELKDLG-----------------------------------NLRPVDTPMDPSLKLETKAGDPPVDKGRYQRLVGK
        DD+V+   D T + + K  L  +F + DL                                    N   V TP+   +  E    D   +    + L+G 
Subjt:  DDIVLTGDDKTEIQSVKNKLAAEFELKDLG-----------------------------------NLRPVDTPMDPSLKLETKAGDPPVDKGRYQRLVGK

Query:  LIYLSY-TRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNT--HRGVELFTNADWAGSAEDRRSTSGYCTFVWG-NLVTWRSKKQS
        L+Y+   TRPD+  AV+ +S++    +    Q + RVL+YLKGT  + L+FKKN      +  + ++DWAGS  DR+ST+GY   ++  NL+ W +K+Q+
Subjt:  LIYLSY-TRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNT--HRGVELFTNADWAGSAEDRRSTSGYCTFVWG-NLVTWRSKKQS

Query:  VVARSSAEAELRSLALGICEGLWLKMFLEELDMKTDGPITNYCDNRAAISISHNPVHHDRVKHVELDRHFIKEKVDEGILKIQ
         VA SS EAE  +L   + E LWLK  L  +++K + PI  Y DN+  ISI++NP  H R KH+++  HF +E+V   ++ ++
Subjt:  VVARSSAEAELRSLALGICEGLWLKMFLEELDMKTDGPITNYCDNRAAISISHNPVHHDRVKHVELDRHFIKEKVDEGILKIQ

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.5e-8738.41Show/hide
Query:  LSKESVPATIQEAWQDP---KWKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSI
        +S +  P +++E    P   +  KA+ EEM +L+KNGT+++V   K K+ + C+WVF +K   D  + RYKARLV KG+ Q  G+D+ E F+PV K+ SI
Subjt:  LSKESVPATIQEAWQDP---KWKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSI

Query:  RVLLSLAANHDWPLFQLDVKNAFLNGELEEEVYMDLPPGFNSNG-DNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKR-QHTGLTI
        R +LSLAA+ D  + QLDVK AFL+G+LEEE+YM+ P GF   G  + VC L KSLY LKQ+PR W+ +F   + +  Y + +SD  ++FKR       I
Subjt:  RVLLSLAANHDWPLFQLDVKNAFLNGELEEEVYMDLPPGFNSNG-DNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKR-QHTGLTI

Query:  LIVYVDDIVLTGDDKTEIQSVKNKLAAEFELKDLG-------------------------------------NLRPVDTPMDPSLKLETKAGDPPVD---
        L++YVDD+++ G DK  I  +K  L+  F++KDLG                                     N +PV TP+   LKL  K     V+   
Subjt:  LIVYVDDIVLTGDDKTEIQSVKNKLAAEFELKDLG-------------------------------------NLRPVDTPMDPSLKLETKAGDPPVD---

Query:  ---KGRYQRLVGKLIY-LSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGN
           K  Y   VG L+Y +  TRPDIA AV  VS+F+ +P + H +A+  +L+YL+GT G  L F   +   ++ +T+AD AG  ++R+S++GY     G 
Subjt:  ---KGRYQRLVGKLIY-LSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGN

Query:  LVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKMFLEELDMKTDGPITNYCDNRAAISISHNPVHHDRVKHVELDRHFIKEKVDEGILKI
         ++W+SK Q  VA S+ EAE  +      E +WLK FL+EL +     +  YCD+++AI +S N ++H R KH+++  H+I+E VD+  LK+
Subjt:  LVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKMFLEELDMKTDGPITNYCDNRAAISISHNPVHHDRVKHVELDRHFIKEKVDEGILKI

P92519 Uncharacterized mitochondrial protein AtMg008102.4e-3235.71Show/hide
Query:  LIVYVDDIVLTGDDKTEIQSVKNKLAAEFELKDLG-----------------------------------NLRPVDTPMDPSLKLETKAGDPPVDKGRYQ
        L++YVDDI+LTG   T +  +  +L++ F +KDLG                                   + +P+ TP+   L         P D   ++
Subjt:  LIVYVDDIVLTGDDKTEIQSVKNKLAAEFELKDLG-----------------------------------NLRPVDTPMDPSLKLETKAGDPPVDKGRYQ

Query:  RLVGKLIYLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQ
         +VG L YL+ TRPDI++AV+ V Q MH P+      + RVL+Y+KGT   GL   KN+   V+ F ++DWAG    RRST+G+CTF+  N+++W +K+Q
Subjt:  RLVGKLIYLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQ

Query:  SVVARSSAEAELRSLALGICEGLW
          V+RSS E E R+LAL   E  W
Subjt:  SVVARSSAEAELRSLALGICEGLW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.9e-11726.3Show/hide
Query:  KLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTS---------DPSFSSWDMNNSLVMSWLIHSMDSCIGELYLLYSTAKAIWDAVNLAYSDLEDSS
        KL   N++ WSR +  L  G    GF+DGS   P  +         +P ++ W   + L+ S ++ ++   +       +TA  IW+ +   Y++     
Subjt:  KLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTS---------DPSFSSWDMNNSLVMSWLIHSMDSCIGELYLLYSTAKAIWDAVNLAYSDLEDSS

Query:  QMFSLRTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSSLDEVRGRILGSKPLPSIDEIFAEVRREEHRK
         +  LRT+++   + T T+ +Y   L   + +L L             L + M   E++   L  L      V  +I      P++ EI   +   E  K
Subjt:  QMFSLRTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSSLDEVRGRILGSKPLPSIDEIFAEVRREEHRK

Query:  TIMLCSTPSGSSDASALVAMHPA----DRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASNGPRAPVDSSHAAAAAKSSLCSPSAAPFNQ-AQLDHLT
         + + S       A+A+   +      +     +  YD  N+  +   K W+         S+    P ++       K  +C        + +QL H  
Subjt:  TIMLCSTPSGSSDASALVAMHPA----DRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASNGPRAPVDSSHAAAAAKSSLCSPSAAPFNQ-AQLDHLT

Query:  KMMSSS---TLTTSLMAHSGLATISDLHCEPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGSVLRVVGTDTIMFNP---DFSLHNVLYVPQFEYN
          ++S    +  T     + LA  S      W++DSGA+ HIT   + LS   P +    V +A+GS + +  T +   +      +LHN+LYVP    N
Subjt:  KMMSSS---TLTTSLMAHSGLATISDLHCEPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGSVLRVVGTDTIMFNP---DFSLHNVLYVPQFEYN

Query:  LMSISKLTASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLY-----------YFSGTSSSLIHST---------AALFPKLFRN---------NESFH
        L+S+ +L  +  ++  F       +D  TG  +   K    LY            F+  SS   HS+          ++   +  N         ++   
Subjt:  LMSISKLTASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLY-----------YFSGTSSSLIHST---------AALFPKLFRN---------NESFH

Query:  CTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRIS---------------------TLRNKRQPSKIL---------KFNT---------------
        C+ C ++K  ++PF   +   + P   ++SDVW    +S                      L+ K Q  +           +F T               
Subjt:  CTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRIS---------------------TLRNKRQPSKIL---------KFNT---------------

Query:  -----------------------------------------------------------------PLSVLRSHFPSSRVFSEL----ELRVFGCKAF---
                                                                         P  +L+   P  ++F       +LRVFGC  +   
Subjt:  -----------------------------------------------------------------PLSVLRSHFPSSRVFSEL----ELRVFGCKAF---

Query:  ----------------------------VHNLIPARWGISASEE-------FFGKLPTIA-------DFELFWEPTPSSNPTNISNELAPTNISNNHDSP
                                      +L  +R  IS           F   L T++       +    W P  ++ PT      AP + S+ H + 
Subjt:  ----------------------------VHNLIPARWGISASEE-------FFGKLPTIA-------DFELFWEPTPSSNPTNISNELAPTNISNNHDSP

Query:  SPNSNP--PISRNQSFSPN---------SSTPENLLAHQN--QSNAQNSSPKSMSSNLENQIPVPTSNSVP--VIPSGNIPGGKSAIQPYQFTYSRRDGK
        +P S+P  P   +Q  S N          S+PE     QN  Q   Q +  ++ + + +N      +N  P  +  S + P   S+  P   T +     
Subjt:  SPNSNP--PISRNQSFSPN---------SSTPENLLAHQN--QSNAQNSSPKSMSSNLENQIPVPTSNSVP--VIPSGNIPGGKSAIQPYQFTYSRRDGK

Query:  IEEPEDPHEHEISPVSETELNDIDLPIAIRK-GTRSCTQHSISNFLGYAHLSTPMQALVTQLSKESVPATIQEAWQDPKWKKAVMEEMTALEKNGTWQIV
           P     H   P+++   N+   P+     GTR+       N         P  +L   L+ ES P T  +A +D +W+ A+  E+ A   N TW +V
Subjt:  IEEPEDPHEHEISPVSETELNDIDLPIAIRK-GTRSCTQHSISNFLGYAHLSTPMQALVTQLSKESVPATIQEAWQDPKWKKAVMEEMTALEKNGTWQIV

Query:  TKSKDKKS-VGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVLLSLAANHDWPLFQLDVKNAFLNGELEEEVYMDLPPGF-
               + VGCRW+FT KY SDG++ RYKARLVAKGY+Q  G+DY ETF+PV K  SIR++L +A +  WP+ QLDV NAFL G L ++VYM  PPGF 
Subjt:  TKSKDKKS-VGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVLLSLAANHDWPLFQLDVKNAFLNGELEEEVYMDLPPGF-

Query:  NSNGDNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKRQHTGLTILIVYVDDIVLTGDDKTEIQSVKNKLAAEFELKDLGNL-----
        + +  N VC L+K+LY LKQ+PRAW+      +L  G+  + SD +LF  ++   +  ++VYVDDI++TG+D T + +  + L+  F +KD   L     
Subjt:  NSNGDNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKRQHTGLTILIVYVDDIVLTGDDKTEIQSVKNKLAAEFELKDLGNL-----

Query:  ------------------------------RPVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVL
                                      +PV TPM PS KL   +G    D   Y+ +VG L YL++TRPDI++AV+ +SQFMH P+E H+QA+ R+L
Subjt:  ------------------------------RPVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVL

Query:  KYLKGTPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKMFLEELDMKTDGPITN
        +YL GTP  G+  KK     +  +++ADWAG  +D  ST+GY  ++  + ++W SKKQ  V RSS EAE RS+A    E  W+   L EL ++   P   
Subjt:  KYLKGTPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKMFLEELDMKTDGPITN

Query:  YCDNRAAISISHNPVHHDRVKHVELDRHFIKEKVDEGILKI
        YCDN  A  +  NPV H R+KH+ +D HFI+ +V  G L++
Subjt:  YCDNRAAISISHNPVHHDRVKHVELDRHFIKEKVDEGILKI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.2e-10926.24Show/hide
Query:  KLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTS---------DPSFSSWDMNNSLVMSWLIHSMDSCIGELYLLYSTAKAIWDAVNLAYSDLEDSS
        KL   N++ WSR +  L  G    GF+DGS   P  +         +P ++ W   + L+ S ++ ++   +       +TA  IW+ +   Y     ++
Subjt:  KLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTS---------DPSFSSWDMNNSLVMSWLIHSMDSCIGELYLLYSTAKAIWDAVNLAYSDLEDSS

Query:  QMFSLRTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSSLDEVRGRILGSKPLPSIDEIFAEVRREEHRK
          +   T++R + +        F QL  L + +D                      E++   L  L      V  +I      PS+ EI   +   E  K
Subjt:  QMFSLRTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSSLDEVRGRILGSKPLPSIDEIFAEVRREEHRK

Query:  TIMLCSTPSGSSDASALVAMHPADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASNGPRAPVDSSHAAAAAKSSLCSPSAAPFNQA-QLDHLTKMMS
         + L S       A+ +   +    + + +   +      + R   W+     S+G+ +  R P    +       S+   SA    Q  Q    T    
Subjt:  TIMLCSTPSGSSDASALVAMHPADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASNGPRAPVDSSHAAAAAKSSLCSPSAAPFNQA-QLDHLTKMMS

Query:  SSTLTTSLMAHSGLATISDLHCEPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGSVLRVVGTDTIMF---NPDFSLHNVLYVPQFEYNLMSISKL
        S++  T     + LA  S  +   W++DSGA+ HIT   + LS   P +    V IA+GS + +  T +      +    L+ VLYVP    NL+S+ +L
Subjt:  SSTLTTSLMAHSGLATISDLHCEPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGSVLRVVGTDTIMF---NPDFSLHNVLYVPQFEYNLMSISKL

Query:  TASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLYYFSGTSSSLI--------------------HSTAAL---------FPKLFRNNESFHCTVCQVS
          +  ++  F       +D  TG  +   K    LY +   SS  +                    H + A+          P L  +++   C+ C ++
Subjt:  TASQHLTAIFKSNGCLFQDQITGRMIGHAKEHSGLYYFSGTSSSLI--------------------HSTAAL---------FPKLFRNNESFHCTVCQVS

Query:  KHQRIPFVSQSYCPSHPFALVHSDVWGP------------------SRISTLRNKRQPSKIL------------KFNTPLSVLRS---------------
        K  ++PF + +   S P   ++SDVW                    +R + L   +Q S++             +F T +  L S               
Subjt:  KHQRIPFVSQSYCPSHPFALVHSDVWGP------------------SRISTLRNKRQPSKIL------------KFNTPLSVLRS---------------

Query:  ----------HFPSSRVFSELELR---VFGCKAFVHNLIP-ARWGISAS--------------------EEFFGKLPTIADFELF------W-EPTPSSN
                  H P     SE + R     G     H  +P   W  + S                    ++ FG+ P     ++F      W  P     
Subjt:  ----------HFPSSRVFSELELR---VFGCKAFVHNLIP-ARWGISAS--------------------EEFFGKLPTIADFELF------W-EPTPSSN

Query:  PTNISNELA----------------PTN--ISNNHDS------PSPNSNPPISRNQSFSPNSS-----------------TPENLLAHQNQSNAQNSSPK
          + S + A                PT    ++ H        P   +N  +S +Q    +S+                  P  L  H + S    SSP 
Subjt:  PTNISNELA----------------PTN--ISNNHDS------PSPNSNPPISRNQSFSPNSS-----------------TPENLLAHQNQSNAQNSSPK

Query:  SM------SSNLENQIPVPTSNSVPVIPSGNIPGGKSAIQPYQFTYSRRDGKI--------EEPEDPHEHEISPVSETELNDIDLPIAIRKGTRSCTQHS
         +      SSNL +      S+S P  PS N  G +   QP+Q   S  +  I          P  P+++   P S      I  P        S +  S
Subjt:  SM------SSNLENQIPVPTSNSVPVIPSGNIPGGKSAIQPYQFTYSRRDGKI--------EEPEDPHEHEISPVSETELNDIDLPIAIRKGTRSCTQHS

Query:  IS--------------NFLGYAHLSTPMQA---------------LVTQLSKESVPATIQEAWQDPKWKKAVMEEMTALEKNGTWQIVTKSKDKKS-VGC
         S                   A ++T   A                 T L+  S P T  +A +D +W++A+  E+ A   N TW +V       + VGC
Subjt:  IS--------------NFLGYAHLSTPMQA---------------LVTQLSKESVPATIQEAWQDPKWKKAVMEEMTALEKNGTWQIVTKSKDKKS-VGC

Query:  RWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVLLSLAANHDWPLFQLDVKNAFLNGELEEEVYMDLPPGF-NSNGDNKVCLLK
        RW+FT K+ SDG++ RYKARLVAKGY+Q  G+DY ETF+PV K  SIR++L +A +  WP+ QLDV NAFL G L +EVYM  PPGF + +  + VC L+
Subjt:  RWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVLLSLAANHDWPLFQLDVKNAFLNGELEEEVYMDLPPGF-NSNGDNKVCLLK

Query:  KSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKRQHTGLTILIVYVDDIVLTGDDKTEIQSVKNKLAAEFELKDLGNL-----------------
        K++Y LKQ+PRAW+      +L  G+  + SD +LF  ++   +  ++VYVDDI++TG+D   ++   + L+  F +K+  +L                 
Subjt:  KSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKRQHTGLTILIVYVDDIVLTGDDKTEIQSVKNKLAAEFELKDLGNL-----------------

Query:  ------------------RPVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLL
                          +PV TPM  S KL   +G    D   Y+ +VG L YL++TRPD+++AV+ +SQ+MH P++ H  A+ RVL+YL GTP  G+ 
Subjt:  ------------------RPVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLL

Query:  FKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKMFLEELDMKTDGPITNYCDNRAAISISH
         KK     +  +++ADWAG  +D  ST+GY  ++  + ++W SKKQ  V RSS EAE RS+A    E  W+   L EL ++   P   YCDN  A  +  
Subjt:  FKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKMFLEELDMKTDGPITNYCDNRAAISISH

Query:  NPVHHDRVKHVELDRHFIKEKVDEGILKI
        NPV H R+KH+ LD HFI+ +V  G L++
Subjt:  NPVHHDRVKHVELDRHFIKEKVDEGILKI

Arabidopsis top hitse value%identityAlignment
AT1G20890.1 unknown protein8.1e-3946.94Show/hide
Query:  KVLNVGEELWRETLPLQMGSRLYHLQGLKPDTWYEVKISHPGSIPASFSLE-LKRDLSSLVEKQSRKLLDTEKLIFKTESMNLQGDQVDTHVLVTVKSEG
        K L  G+ELW+ETLPL+ GSR+Y LQGLK ++WYEVKIS+P SIPA FSL+ LK  +  L     R+LL+TEKLIFKTES+    ++   +VLVTV+ EG
Subjt:  KVLNVGEELWRETLPLQMGSRLYHLQGLKPDTWYEVKISHPGSIPASFSLE-LKRDLSSLVEKQSRKLLDTEKLIFKTESMNLQGDQVDTHVLVTVKSEG

Query:  VVAIPSVQERESIIFNIAYKTCYLLPLSYAGVILSFACLTATSNVSFISMIVSGFVTVCDELLISIPHKAWGVGVLVIICLILAFIIPSLLPSWLL
        +VAIP+V+ER SII+NI                                        VC+E L+ IPH +W V VLV++CL++A I+P  LPS+LL
Subjt:  VVAIPSVQERESIIFNIAYKTCYLLPLSYAGVILSFACLTATSNVSFISMIVSGFVTVCDELLISIPHKAWGVGVLVIICLILAFIIPSLLPSWLL

AT1G76480.2 unknown protein2.9e-3643.26Show/hide
Query:  DSKVLNVGEELWRETLPLQMGSRLYHLQGLKPDTWYEVKISHPGSIPASFSLE-LKRDLSSLVEKQSRKLLDTEKLIFKTESMNLQGDQVDTHVLVTVKS
        D K L VG+ELWRETLPLQ GSR+Y LQGLK ++WYEVKIS+P SIPA  SL+ LK     L   Q R+LL+TEKLIFK ES+    ++   +VLVT++ 
Subjt:  DSKVLNVGEELWRETLPLQMGSRLYHLQGLKPDTWYEVKISHPGSIPASFSLE-LKRDLSSLVEKQSRKLLDTEKLIFKTESMNLQGDQVDTHVLVTVKS

Query:  EGVVAIPSVQERESIIFNIAYKTCYLLPLSYAGVILSFACLTATSNVSFISMIVSGFVTVCDELLISIPHKAWGVGVLVIICLILAFIIPSLLPSWLLQT
        EG+VAIP+ +ER  II+NI                                        VC+E L+ IP+ +W V +LV +CL+++FI+P  LPS LL  
Subjt:  EGVVAIPSVQERESIIFNIAYKTCYLLPLSYAGVILSFACLTATSNVSFISMIVSGFVTVCDELLISIPHKAWGVGVLVIICLILAFIIPSLLPSWLLQT

Query:  NQTQGPGNHHSSKNS
        +Q     +    K+S
Subjt:  NQTQGPGNHHSSKNS

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.6e-11639.26Show/hide
Query:  TSNSVPVIPSGNIPGGKSAIQPYQFTYSRRDGKIEEPEDPHEHEISPVSETELNDIDLPIAIRKGTRSCTQHSISNFLGYAHLSTPMQALVTQLSKESVP
        +S+S+ ++PS NI       +P   T  RR  K    +D + H ++                     S T H IS FL Y  +S    + +  ++K   P
Subjt:  TSNSVPVIPSGNIPGGKSAIQPYQFTYSRRDGKIEEPEDPHEHEISPVSETELNDIDLPIAIRKGTRSCTQHSISNFLGYAHLSTPMQALVTQLSKESVP

Query:  ATIQEAWQDPKWKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVLLSLAANH
        +T  EA +   W  A+ +E+ A+E   TW+I T   +KK +GC+WV+ IKY SDGTIERYKARLVAKGY+Q  G+D+ ETF+PV K+ S++++L+++A +
Subjt:  ATIQEAWQDPKWKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVLLSLAANH

Query:  DWPLFQLDVKNAFLNGELEEEVYMDLPPGFNS-NGD----NKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKRQHTGLTILIVYVDD
        ++ L QLD+ NAFLNG+L+EE+YM LPPG+ +  GD    N VC LKKS+Y LKQ+ R WF +F+  ++ FG+ Q+HSDHT F K   T    ++VYVDD
Subjt:  DWPLFQLDVKNAFLNGELEEEVYMDLPPGFNS-NGD----NKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKRQHTGLTILIVYVDD

Query:  IVLTGDDKTEIQSVKNKLAAEFELKDLGNL-----------------------------------RPVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLI
        I++  ++   +  +K++L + F+L+DLG L                                   +P   PMDPS+     +G   VD   Y+RL+G+L+
Subjt:  IVLTGDDKTEIQSVKNKLAAEFELKDLGNL-----------------------------------RPVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLI

Query:  YLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSS
        YL  TR DI+FAV+ +SQF  +P   H QA+ ++L Y+KGT G GL +       +++F++A +    + RRST+GYC F+  +L++W+SKKQ VV++SS
Subjt:  YLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSS

Query:  AEAELRSLALGICEGLWLKMFLEELDMKTDGPITNYCDNRAAISISHNPVHHDRVKHVELDRHFIKEK
        AEAE R+L+    E +WL  F  EL +    P   +CDN AAI I+ N V H+R KH+E D H ++E+
Subjt:  AEAELRSLALGICEGLWLKMFLEELDMKTDGPITNYCDNRAAISISHNPVHHDRVKHVELDRHFIKEK

ATMG00810.1 DNA/RNA polymerases superfamily protein1.7e-3335.71Show/hide
Query:  LIVYVDDIVLTGDDKTEIQSVKNKLAAEFELKDLG-----------------------------------NLRPVDTPMDPSLKLETKAGDPPVDKGRYQ
        L++YVDDI+LTG   T +  +  +L++ F +KDLG                                   + +P+ TP+   L         P D   ++
Subjt:  LIVYVDDIVLTGDDKTEIQSVKNKLAAEFELKDLG-----------------------------------NLRPVDTPMDPSLKLETKAGDPPVDKGRYQ

Query:  RLVGKLIYLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQ
         +VG L YL+ TRPDI++AV+ V Q MH P+      + RVL+Y+KGT   GL   KN+   V+ F ++DWAG    RRST+G+CTF+  N+++W +K+Q
Subjt:  RLVGKLIYLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNTHRGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQ

Query:  SVVARSSAEAELRSLALGICEGLW
          V+RSS E E R+LAL   E  W
Subjt:  SVVARSSAEAELRSLALGICEGLW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.2e-1942.59Show/hide
Query:  LVTQLSKESVPATIQEAWQDPKWKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINS
        + T + KE  P ++  A +DP W +A+ EE+ AL +N TW +V    ++  +GC+WVF  K  SDGT++R KARLVAKG+ Q  G+ + ET++PV +  +
Subjt:  LVTQLSKESVPATIQEAWQDPKWKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINS

Query:  IRVLLSLA
        IR +L++A
Subjt:  IRVLLSLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGTCGGCTCTAATGGTGGTGAATCTTCTGAGGTGGTGGCTGACTCTCCTCCTCTTCCAATCTCCATTAATGGCTCTTCTTCGGGTCCTTCATCTGACCATCACTC
TGTGGCTATTACTGCTTTTAAGCTTAATGGCCATAATTTTATTCCCTGGTCTAGATCCATTCAAATGTTGATTCGTGGCAAGGGAAAAATTGGGTTTATTGATGGTTCTC
TTGTTCAGCCTTCAACCTCTGACCCCTCGTTTTCTTCATGGGACATGAACAACTCTCTTGTGATGTCTTGGTTGATTCATTCCATGGATTCTTGTATTGGTGAATTGTAT
CTTCTTTATTCAACAGCTAAAGCTATTTGGGATGCTGTTAATCTCGCCTATTCAGACCTTGAAGACTCATCACAGATGTTCTCGCTTCGCACTCGTATTCGCAACCTTCG
CCAAGAAACTGGTACGGTCACAAACTACTTTCACCAACTTACTCGATTGTGGCAAGAACTTGACCTCTTTCAGCAGCAACCATGGAAGGACCCACTTGATGGTACTTTAT
ACCGTCAGATGCTTGCTAAGGAGAGAATCTATGATTTCCTTGCTGGTCTTAACTCTTCTCTCGATGAAGTCCGTGGTCGTATACTCGGTTCGAAACCTCTTCCAAGTATT
GATGAGATATTTGCTGAAGTAAGACGAGAAGAACACCGAAAGACTATTATGCTTTGCTCTACACCTTCTGGTTCATCTGATGCTTCGGCTCTGGTTGCTATGCATCCTGC
TGATCGCAAAAAAAGAGCTCATACTTGGTATGATTTCTGCAATAAGCCTTATCATACACGAGAGAAATGTTGGAAACTTCATGGTAAGCCAAGTAATGGAGCATCTAATG
GCCCTCGAGCTCCTGTCGACTCTAGCCATGCTGCCGCCGCTGCCAAATCTAGTCTTTGTTCGCCTTCCGCTGCCCCTTTCAATCAGGCCCAGCTTGATCATCTCACCAAG
ATGATGTCGTCCTCTACTCTCACTACATCCCTTATGGCTCATTCAGGACTTGCTACAATTTCTGATTTACACTGCGAACCTTGGATTGTCGACTCAGGGGCTTCCGACCA
TATAACGGGTAGAATTCACTTACTCTCTAACTTCTCACCTTGTTCTCTGCCTCGAACAGTTGAGATTGCCAATGGCTCAGTTCTACGTGTGGTTGGTACCGACACTATTA
TGTTCAATCCGGATTTCTCACTACACAATGTTCTATATGTTCCTCAATTTGAGTATAATCTTATGTCTATTAGCAAACTTACTGCTTCACAACACCTTACTGCCATTTTT
AAGTCTAATGGTTGTTTATTTCAGGACCAGATTACGGGCCGGATGATTGGACATGCTAAGGAACACTCGGGCCTTTACTATTTTTCTGGCACTTCTTCCTCTTTGATTCA
CAGTACAGCGGCTTTATTTCCAAAATTATTTCGAAATAATGAATCCTTCCATTGTACTGTTTGTCAAGTTTCAAAGCATCAACGGATTCCGTTTGTGTCTCAATCTTATT
GCCCTTCTCATCCTTTTGCTTTAGTTCATAGTGATGTTTGGGGTCCTTCCCGAATTTCAACTCTTCGAAATAAGAGACAGCCATCAAAAATCCTCAAGTTCAATACACCT
TTATCTGTTCTTCGTTCCCATTTTCCATCTTCTCGAGTTTTCTCTGAGTTGGAATTACGTGTATTTGGCTGCAAAGCCTTTGTTCATAATTTAATTCCAGCACGATGGGG
GATCTCGGCATCAGAAGAATTTTTTGGCAAGTTGCCAACTATAGCTGATTTTGAGTTATTTTGGGAACCTACACCATCCTCAAATCCCACAAATATCTCCAACGAACTAG
CTCCCACAAATATCTCCAATAATCATGATTCTCCCTCTCCAAATTCAAATCCGCCTATTTCCCGGAATCAATCTTTCTCTCCAAATTCGTCCACTCCAGAAAATCTGCTA
GCTCACCAAAATCAATCAAATGCTCAAAATTCGTCTCCCAAGTCCATGTCTTCCAACCTGGAAAATCAAATCCCAGTTCCCACTTCAAATTCCGTTCCGGTCATACCCTC
AGGAAACATTCCAGGTGGTAAAAGTGCTATACAACCATACCAATTTACATATTCCAGGAGAGATGGAAAAATAGAGGAACCCGAGGATCCTCATGAACATGAAATTAGCC
CCGTGTCTGAAACTGAACTAAATGACATTGATTTACCTATAGCAATACGGAAAGGAACGAGGTCTTGTACTCAACACTCAATCTCCAATTTCTTGGGTTATGCACACTTA
TCCACTCCTATGCAAGCACTGGTAACTCAACTCTCCAAAGAAAGTGTACCAGCAACCATTCAAGAGGCGTGGCAAGATCCTAAGTGGAAAAAGGCAGTAATGGAAGAAAT
GACAGCGTTAGAAAAAAATGGAACTTGGCAGATTGTTACTAAATCCAAAGACAAGAAATCAGTGGGTTGCAGGTGGGTTTTCACAATTAAATACAAGTCTGATGGAACTA
TAGAACGCTATAAAGCTCGACTGGTAGCTAAGGGTTATTCTCAAACCCATGGAGTTGATTATCAGGAGACGTTCGCACCCGTTGCAAAGATCAACTCAATCAGAGTCTTA
CTTTCCTTAGCTGCTAATCATGATTGGCCACTGTTTCAACTTGATGTGAAGAATGCATTTTTGAATGGAGAACTAGAAGAGGAGGTGTACATGGATTTACCACCTGGCTT
TAATTCTAATGGAGATAATAAAGTCTGCCTATTGAAGAAATCCTTATACAGGTTAAAACAATCACCAAGGGCTTGGTTTGGTCGGTTCACGAAAGCTATTTTAAACTTTG
GTTACAAACAAGCTCACTCAGACCACACACTCTTCTTCAAGAGACAACATACTGGACTAACAATTTTAATTGTTTATGTTGATGATATTGTGCTTACAGGAGATGACAAA
ACCGAGATTCAATCTGTTAAGAACAAATTAGCAGCAGAGTTTGAACTCAAAGATTTGGGGAATCTACGGCCAGTTGATACTCCTATGGACCCGAGTTTAAAATTGGAAAC
AAAAGCTGGAGATCCTCCTGTTGATAAAGGAAGATATCAACGATTGGTTGGTAAACTAATATATCTCTCCTACACACGTCCAGACATAGCTTTTGCAGTAAGCTGTGTGA
GCCAATTTATGCACTCTCCATCCGAGATACACATGCAAGCAATACACAGAGTTCTCAAGTATTTAAAGGGAACACCGGGTTTGGGCTTACTCTTCAAGAAAAACACACAT
AGAGGAGTCGAACTATTCACAAATGCAGATTGGGCAGGCTCAGCTGAAGACCGAAGATCAACTTCTGGTTATTGTACCTTTGTATGGGGCAACTTGGTTACTTGGCGAAG
TAAGAAACAGTCCGTTGTAGCTAGAAGCAGTGCAGAGGCAGAACTGCGATCGTTAGCTCTTGGAATCTGTGAGGGTTTGTGGCTGAAGATGTTTCTTGAGGAACTTGACA
TGAAAACTGATGGACCAATCACTAACTACTGCGATAATAGAGCTGCAATATCTATTTCACATAATCCGGTTCACCATGATAGAGTGAAACATGTAGAACTTGATCGCCAT
TTCATTAAAGAAAAGGTGGATGAAGGAATATTGAAGATTCAGGGCGATGTACAGTTTGATTGTGAAAGTCATCAGGTGCCTTGGGTAATATGGCCAAGGGTCAGGGCGAT
GTACAGTTTCATTTTGAAGTCATCAGGTGCCTTGGGTAATATGGTCAAGGGTCGAACGCTGAGCGCCGTAGAGAAACGTGAGGCCCTGGGTAATATGGTCAGGGGTCGGC
GTGACTCGAAGGGGTGTGAGCCAGTTACTCAGTACTGTGGTTTTGTACTGACCCACCACCAGATTTTGCAGGTGCTGAAAGCATATTCGAGTTTGATGATGCAGAGAGAT
GGGAGAAGGACTATTAGAGAACTGGTTTCCCAAACCCTCTTTAGTGGGAATTTGAGGTGTCGTTTTTCGTTCTCCTCTCCCTCCAACTTCGCACTTCTCTCTCTCGGTTT
TCCTTCCACTGTCGCGAGGGTTAGCAGTCACTCTGAGGTAAAGTTTCATTCGTCCACTGATCTGAACCTTCATCTTCTACTGCTTTACTCTCCAGACGGTGATCGACAGT
GTTGTTCTAGCGATTTGTGTATGAGAAAGTCACAGCTGCTCCTTCTCTCGAGGGTTGACAGCAAAGTTTTAAATGTTGGAGAGGAGCTGTGGAGGGAAACATTACCATTA
CAAATGGGTTCTCGTCTGTATCATCTTCAAGGTCTCAAACCTGATACGTGGTATGAAGTGAAGATATCACATCCGGGCTCTATACCTGCCAGCTTTTCGTTAGAGCTAAA
GAGAGACTTGTCAAGCCTCGTCGAGAAGCAGAGCAGAAAACTACTTGACACTGAAAAGCTTATTTTTAAGACGGAAAGCATGAACTTGCAGGGAGATCAGGTTGATACAC
ATGTATTGGTCACTGTGAAGTCCGAGGGAGTTGTTGCTATACCATCTGTCCAGGAAAGGGAGTCGATCATCTTCAATATAGCATACAAGACCTGTTACCTCTTGCCCCTT
TCATATGCTGGAGTCATATTGTCGTTTGCTTGCTTGACAGCAACATCTAATGTCTCATTCATTAGTATGATAGTTTCTGGTTTCGTGACAGTTTGTGACGAACTACTTAT
CAGCATCCCGCACAAAGCTTGGGGGGTTGGAGTCTTGGTTATAATCTGCTTGATATTGGCATTCATCATCCCATCTCTTCTTCCGTCATGGCTGCTACAGACCAACCAAA
CCCAAGGACCAGGAAATCATCATTCTTCCAAGAATTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGTCGGCTCTAATGGTGGTGAATCTTCTGAGGTGGTGGCTGACTCTCCTCCTCTTCCAATCTCCATTAATGGCTCTTCTTCGGGTCCTTCATCTGACCATCACTC
TGTGGCTATTACTGCTTTTAAGCTTAATGGCCATAATTTTATTCCCTGGTCTAGATCCATTCAAATGTTGATTCGTGGCAAGGGAAAAATTGGGTTTATTGATGGTTCTC
TTGTTCAGCCTTCAACCTCTGACCCCTCGTTTTCTTCATGGGACATGAACAACTCTCTTGTGATGTCTTGGTTGATTCATTCCATGGATTCTTGTATTGGTGAATTGTAT
CTTCTTTATTCAACAGCTAAAGCTATTTGGGATGCTGTTAATCTCGCCTATTCAGACCTTGAAGACTCATCACAGATGTTCTCGCTTCGCACTCGTATTCGCAACCTTCG
CCAAGAAACTGGTACGGTCACAAACTACTTTCACCAACTTACTCGATTGTGGCAAGAACTTGACCTCTTTCAGCAGCAACCATGGAAGGACCCACTTGATGGTACTTTAT
ACCGTCAGATGCTTGCTAAGGAGAGAATCTATGATTTCCTTGCTGGTCTTAACTCTTCTCTCGATGAAGTCCGTGGTCGTATACTCGGTTCGAAACCTCTTCCAAGTATT
GATGAGATATTTGCTGAAGTAAGACGAGAAGAACACCGAAAGACTATTATGCTTTGCTCTACACCTTCTGGTTCATCTGATGCTTCGGCTCTGGTTGCTATGCATCCTGC
TGATCGCAAAAAAAGAGCTCATACTTGGTATGATTTCTGCAATAAGCCTTATCATACACGAGAGAAATGTTGGAAACTTCATGGTAAGCCAAGTAATGGAGCATCTAATG
GCCCTCGAGCTCCTGTCGACTCTAGCCATGCTGCCGCCGCTGCCAAATCTAGTCTTTGTTCGCCTTCCGCTGCCCCTTTCAATCAGGCCCAGCTTGATCATCTCACCAAG
ATGATGTCGTCCTCTACTCTCACTACATCCCTTATGGCTCATTCAGGACTTGCTACAATTTCTGATTTACACTGCGAACCTTGGATTGTCGACTCAGGGGCTTCCGACCA
TATAACGGGTAGAATTCACTTACTCTCTAACTTCTCACCTTGTTCTCTGCCTCGAACAGTTGAGATTGCCAATGGCTCAGTTCTACGTGTGGTTGGTACCGACACTATTA
TGTTCAATCCGGATTTCTCACTACACAATGTTCTATATGTTCCTCAATTTGAGTATAATCTTATGTCTATTAGCAAACTTACTGCTTCACAACACCTTACTGCCATTTTT
AAGTCTAATGGTTGTTTATTTCAGGACCAGATTACGGGCCGGATGATTGGACATGCTAAGGAACACTCGGGCCTTTACTATTTTTCTGGCACTTCTTCCTCTTTGATTCA
CAGTACAGCGGCTTTATTTCCAAAATTATTTCGAAATAATGAATCCTTCCATTGTACTGTTTGTCAAGTTTCAAAGCATCAACGGATTCCGTTTGTGTCTCAATCTTATT
GCCCTTCTCATCCTTTTGCTTTAGTTCATAGTGATGTTTGGGGTCCTTCCCGAATTTCAACTCTTCGAAATAAGAGACAGCCATCAAAAATCCTCAAGTTCAATACACCT
TTATCTGTTCTTCGTTCCCATTTTCCATCTTCTCGAGTTTTCTCTGAGTTGGAATTACGTGTATTTGGCTGCAAAGCCTTTGTTCATAATTTAATTCCAGCACGATGGGG
GATCTCGGCATCAGAAGAATTTTTTGGCAAGTTGCCAACTATAGCTGATTTTGAGTTATTTTGGGAACCTACACCATCCTCAAATCCCACAAATATCTCCAACGAACTAG
CTCCCACAAATATCTCCAATAATCATGATTCTCCCTCTCCAAATTCAAATCCGCCTATTTCCCGGAATCAATCTTTCTCTCCAAATTCGTCCACTCCAGAAAATCTGCTA
GCTCACCAAAATCAATCAAATGCTCAAAATTCGTCTCCCAAGTCCATGTCTTCCAACCTGGAAAATCAAATCCCAGTTCCCACTTCAAATTCCGTTCCGGTCATACCCTC
AGGAAACATTCCAGGTGGTAAAAGTGCTATACAACCATACCAATTTACATATTCCAGGAGAGATGGAAAAATAGAGGAACCCGAGGATCCTCATGAACATGAAATTAGCC
CCGTGTCTGAAACTGAACTAAATGACATTGATTTACCTATAGCAATACGGAAAGGAACGAGGTCTTGTACTCAACACTCAATCTCCAATTTCTTGGGTTATGCACACTTA
TCCACTCCTATGCAAGCACTGGTAACTCAACTCTCCAAAGAAAGTGTACCAGCAACCATTCAAGAGGCGTGGCAAGATCCTAAGTGGAAAAAGGCAGTAATGGAAGAAAT
GACAGCGTTAGAAAAAAATGGAACTTGGCAGATTGTTACTAAATCCAAAGACAAGAAATCAGTGGGTTGCAGGTGGGTTTTCACAATTAAATACAAGTCTGATGGAACTA
TAGAACGCTATAAAGCTCGACTGGTAGCTAAGGGTTATTCTCAAACCCATGGAGTTGATTATCAGGAGACGTTCGCACCCGTTGCAAAGATCAACTCAATCAGAGTCTTA
CTTTCCTTAGCTGCTAATCATGATTGGCCACTGTTTCAACTTGATGTGAAGAATGCATTTTTGAATGGAGAACTAGAAGAGGAGGTGTACATGGATTTACCACCTGGCTT
TAATTCTAATGGAGATAATAAAGTCTGCCTATTGAAGAAATCCTTATACAGGTTAAAACAATCACCAAGGGCTTGGTTTGGTCGGTTCACGAAAGCTATTTTAAACTTTG
GTTACAAACAAGCTCACTCAGACCACACACTCTTCTTCAAGAGACAACATACTGGACTAACAATTTTAATTGTTTATGTTGATGATATTGTGCTTACAGGAGATGACAAA
ACCGAGATTCAATCTGTTAAGAACAAATTAGCAGCAGAGTTTGAACTCAAAGATTTGGGGAATCTACGGCCAGTTGATACTCCTATGGACCCGAGTTTAAAATTGGAAAC
AAAAGCTGGAGATCCTCCTGTTGATAAAGGAAGATATCAACGATTGGTTGGTAAACTAATATATCTCTCCTACACACGTCCAGACATAGCTTTTGCAGTAAGCTGTGTGA
GCCAATTTATGCACTCTCCATCCGAGATACACATGCAAGCAATACACAGAGTTCTCAAGTATTTAAAGGGAACACCGGGTTTGGGCTTACTCTTCAAGAAAAACACACAT
AGAGGAGTCGAACTATTCACAAATGCAGATTGGGCAGGCTCAGCTGAAGACCGAAGATCAACTTCTGGTTATTGTACCTTTGTATGGGGCAACTTGGTTACTTGGCGAAG
TAAGAAACAGTCCGTTGTAGCTAGAAGCAGTGCAGAGGCAGAACTGCGATCGTTAGCTCTTGGAATCTGTGAGGGTTTGTGGCTGAAGATGTTTCTTGAGGAACTTGACA
TGAAAACTGATGGACCAATCACTAACTACTGCGATAATAGAGCTGCAATATCTATTTCACATAATCCGGTTCACCATGATAGAGTGAAACATGTAGAACTTGATCGCCAT
TTCATTAAAGAAAAGGTGGATGAAGGAATATTGAAGATTCAGGGCGATGTACAGTTTGATTGTGAAAGTCATCAGGTGCCTTGGGTAATATGGCCAAGGGTCAGGGCGAT
GTACAGTTTCATTTTGAAGTCATCAGGTGCCTTGGGTAATATGGTCAAGGGTCGAACGCTGAGCGCCGTAGAGAAACGTGAGGCCCTGGGTAATATGGTCAGGGGTCGGC
GTGACTCGAAGGGGTGTGAGCCAGTTACTCAGTACTGTGGTTTTGTACTGACCCACCACCAGATTTTGCAGGTGCTGAAAGCATATTCGAGTTTGATGATGCAGAGAGAT
GGGAGAAGGACTATTAGAGAACTGGTTTCCCAAACCCTCTTTAGTGGGAATTTGAGGTGTCGTTTTTCGTTCTCCTCTCCCTCCAACTTCGCACTTCTCTCTCTCGGTTT
TCCTTCCACTGTCGCGAGGGTTAGCAGTCACTCTGAGGTAAAGTTTCATTCGTCCACTGATCTGAACCTTCATCTTCTACTGCTTTACTCTCCAGACGGTGATCGACAGT
GTTGTTCTAGCGATTTGTGTATGAGAAAGTCACAGCTGCTCCTTCTCTCGAGGGTTGACAGCAAAGTTTTAAATGTTGGAGAGGAGCTGTGGAGGGAAACATTACCATTA
CAAATGGGTTCTCGTCTGTATCATCTTCAAGGTCTCAAACCTGATACGTGGTATGAAGTGAAGATATCACATCCGGGCTCTATACCTGCCAGCTTTTCGTTAGAGCTAAA
GAGAGACTTGTCAAGCCTCGTCGAGAAGCAGAGCAGAAAACTACTTGACACTGAAAAGCTTATTTTTAAGACGGAAAGCATGAACTTGCAGGGAGATCAGGTTGATACAC
ATGTATTGGTCACTGTGAAGTCCGAGGGAGTTGTTGCTATACCATCTGTCCAGGAAAGGGAGTCGATCATCTTCAATATAGCATACAAGACCTGTTACCTCTTGCCCCTT
TCATATGCTGGAGTCATATTGTCGTTTGCTTGCTTGACAGCAACATCTAATGTCTCATTCATTAGTATGATAGTTTCTGGTTTCGTGACAGTTTGTGACGAACTACTTAT
CAGCATCCCGCACAAAGCTTGGGGGGTTGGAGTCTTGGTTATAATCTGCTTGATATTGGCATTCATCATCCCATCTCTTCTTCCGTCATGGCTGCTACAGACCAACCAAA
CCCAAGGACCAGGAAATCATCATTCTTCCAAGAATTCTTAA
Protein sequenceShow/hide protein sequence
MAVGSNGGESSEVVADSPPLPISINGSSSGPSSDHHSVAITAFKLNGHNFIPWSRSIQMLIRGKGKIGFIDGSLVQPSTSDPSFSSWDMNNSLVMSWLIHSMDSCIGELY
LLYSTAKAIWDAVNLAYSDLEDSSQMFSLRTRIRNLRQETGTVTNYFHQLTRLWQELDLFQQQPWKDPLDGTLYRQMLAKERIYDFLAGLNSSLDEVRGRILGSKPLPSI
DEIFAEVRREEHRKTIMLCSTPSGSSDASALVAMHPADRKKRAHTWYDFCNKPYHTREKCWKLHGKPSNGASNGPRAPVDSSHAAAAAKSSLCSPSAAPFNQAQLDHLTK
MMSSSTLTTSLMAHSGLATISDLHCEPWIVDSGASDHITGRIHLLSNFSPCSLPRTVEIANGSVLRVVGTDTIMFNPDFSLHNVLYVPQFEYNLMSISKLTASQHLTAIF
KSNGCLFQDQITGRMIGHAKEHSGLYYFSGTSSSLIHSTAALFPKLFRNNESFHCTVCQVSKHQRIPFVSQSYCPSHPFALVHSDVWGPSRISTLRNKRQPSKILKFNTP
LSVLRSHFPSSRVFSELELRVFGCKAFVHNLIPARWGISASEEFFGKLPTIADFELFWEPTPSSNPTNISNELAPTNISNNHDSPSPNSNPPISRNQSFSPNSSTPENLL
AHQNQSNAQNSSPKSMSSNLENQIPVPTSNSVPVIPSGNIPGGKSAIQPYQFTYSRRDGKIEEPEDPHEHEISPVSETELNDIDLPIAIRKGTRSCTQHSISNFLGYAHL
STPMQALVTQLSKESVPATIQEAWQDPKWKKAVMEEMTALEKNGTWQIVTKSKDKKSVGCRWVFTIKYKSDGTIERYKARLVAKGYSQTHGVDYQETFAPVAKINSIRVL
LSLAANHDWPLFQLDVKNAFLNGELEEEVYMDLPPGFNSNGDNKVCLLKKSLYRLKQSPRAWFGRFTKAILNFGYKQAHSDHTLFFKRQHTGLTILIVYVDDIVLTGDDK
TEIQSVKNKLAAEFELKDLGNLRPVDTPMDPSLKLETKAGDPPVDKGRYQRLVGKLIYLSYTRPDIAFAVSCVSQFMHSPSEIHMQAIHRVLKYLKGTPGLGLLFKKNTH
RGVELFTNADWAGSAEDRRSTSGYCTFVWGNLVTWRSKKQSVVARSSAEAELRSLALGICEGLWLKMFLEELDMKTDGPITNYCDNRAAISISHNPVHHDRVKHVELDRH
FIKEKVDEGILKIQGDVQFDCESHQVPWVIWPRVRAMYSFILKSSGALGNMVKGRTLSAVEKREALGNMVRGRRDSKGCEPVTQYCGFVLTHHQILQVLKAYSSLMMQRD
GRRTIRELVSQTLFSGNLRCRFSFSSPSNFALLSLGFPSTVARVSSHSEVKFHSSTDLNLHLLLLYSPDGDRQCCSSDLCMRKSQLLLLSRVDSKVLNVGEELWRETLPL
QMGSRLYHLQGLKPDTWYEVKISHPGSIPASFSLELKRDLSSLVEKQSRKLLDTEKLIFKTESMNLQGDQVDTHVLVTVKSEGVVAIPSVQERESIIFNIAYKTCYLLPL
SYAGVILSFACLTATSNVSFISMIVSGFVTVCDELLISIPHKAWGVGVLVIICLILAFIIPSLLPSWLLQTNQTQGPGNHHSSKNS