; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036294 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036294
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:43574383..43582885
RNA-Seq ExpressionLag0036294
SyntenyLag0036294
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]2.9e-30040.28Show/hide
Query:  ANIVSSSFGHPLSTVLTVKIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRPNPLFEEWVTVDQALSGWLFGSMTPTIAADVVS
        A+   S+  + L + ++VK+D  NY LW+ +VL ++RG K+DGY+LGT+  P E    +T +      N  F EW   DQ L GW+  SMT  IA  ++ 
Subjt:  ANIVSSSFGHPLSTVLTVKIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRPNPLFEEWVTVDQALSGWLFGSMTPTIAADVVS

Query:  FKTSREVWTTLEEVYGATSKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLASLSPEYILIVSSINDKDVKTWQELSSIL
         +TS+++W   + + GA ++++I  L+    + +KG MKM  +L  MK   + L+LAGNPV   DLI   L  L  EY  +V  ++D+   +W +L + L
Subjt:  FKTSREVWTTLEEVYGATSKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLASLSPEYILIVSSINDKDVKTWQELSSIL

Query:  VTFEGTLAKYSNDVVSADLPDLSIQLALNRPNQFS-------------------------------------------ACYMRFEESFNDLHAGNSSGQI
        +TFE  + + +N      L +L++    N  N+                                              C+ RF+++++     N S   
Subjt:  VTFEGTLAKYSNDVVSADLPDLSIQLALNRPNQFS-------------------------------------------ACYMRFEESFNDLHAGNSSGQI

Query:  GSSGNSSAYIATLDILNDPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACLT
           G+ +A++A+ + + D  W  DSGA+ HVT         T+++GK SL+VG+G  L I   G + + S       + L ++L+VP+I +NL+S++ L 
Subjt:  GSSGNSSAYIATLDILNDPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACLT

Query:  ADNNVFVEFHSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSSSVFQAESPKSCNVSRSTWHKRLGHASDRIV
        ADNN+ VEF  N C VKDK++ +VIL+G LK  LY++         SG +                          +P +    + +WH+RLGH +++++
Subjt:  ADNNVFVEFHSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSSSVFQAESPKSCNVSRSTWHKRLGHASDRIV

Query:  SHVLQSCNLKVSLNEKSEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLVE
          VL+SC +KV  ++   FCEACQ GK H LPF  S S + +PL LVH D+WGP+P++++ G++YYV FVDDFSRFT IYPLK K E + +F  +K+L E
Subjt:  SHVLQSCNLKVSLNEKSEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLVE

Query:  NKFEKRIKVLQTDWG---------------------------------------EMGLTFLAQAAMPLSYWWDAFCSAVFVINQLPTPVLDYLSPWEKAF
        N+F KRIKV+Q D G                                       E GLT LAQA MPL YWW+AF +AV++IN+LP+ V    SP+    
Subjt:  NKFEKRIKVLQTDWG---------------------------------------EMGLTFLAQAAMPLSYWWDAFCSAVFVINQLPTPVLDYLSPWEKAF

Query:  KSSPDFEFLRTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYSPSHK-----------------VLNSPPLSLPEPSTCVSPMPQPAIPAPSSEF-------
        +  PD++ L+TFGCA +PCL PY  HK Q+H+++CVFLGYS SHK                 + N       +         +  I  PS+ F       
Subjt:  KSSPDFEFLRTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYSPSHK-----------------VLNSPPLSLPEPSTCVSPMPQPAIPAPSSEF-------

Query:  ---QATSPSLGTTSPIQHENEGSISICVDTA----SPSGSVSPIQPVVSLP-------------VGHHMQTRSKSGIFKPKDWGSFLAASSDDVVIEPVL
            A+ P L   +P +   E S  +  DT      PS   +  +  + +                H + TRSKSGI KPK     L  +  D  +EP  
Subjt:  ---QATSPSLGTTSPIQHENEGSISICVDTA----SPSGSVSPIQPVVSLP-------------VGHHMQTRSKSGIFKPKDWGSFLAASSDDVVIEPVL

Query:  VKDALQSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPTIQIILTVAVSKSW
         K+AL   +WK A+ +E  AL  NKTW LVP  +  N++ SKWVFK K K DGS ++ KARLVAKGF Q  G+D+ ETFSPV+KA T++IIL++AV  +W
Subjt:  VKDALQSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPTIQIILTVAVSKSW

Query:  PVRKLDVNNAILNGKLDEDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQLTVLVFIYVDDILVTGD
         VR+LD+NNA LNG L E VFM QP GF+D   P+++CKL KA+YGL QAPRAW+D  K  L+ W F N+K+D+ +F          + IYVDDI+VTG 
Subjt:  PVRKLDVNNAILNGKLDEDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQLTVLVFIYVDDILVTGD

Query:  SIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYRSVLGALQYITHTR
        +  F+   + +LN  F+LKD+G L Y+LG+EV R  SGMYL QSKYI +L+++  + +  PC TP++TG++F T EG  + DPT++R  +G LQY+THT 
Subjt:  SIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYRSVLGALQYITHTR

Query:  PDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQTVVARSSTESEYR
        PDI+F VNKLSQ++  PS  HWQ  KR+LRYL+GT++Y L +  S DL I  ++DADWA ++DDR+S++  C+F+G +L+SWSS+KQ VV+RSSTESEYR
Subjt:  PDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQTVVARSSTESEYR

Query:  SLAQTAADVTWVQSLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQLADALTKPLSEGQFSSIRGKLG
        +LA  AA++ W++SLLTEL L     P++WCDN+SA +L  NPV H+RSKHIE+D+H++RD+VL   V + YVP+++Q+AD LTKPLS  +FS +R KLG
Subjt:  SLAQTAADVTWVQSLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQLADALTKPLSEGQFSSIRGKLG

Query:  VLPSP
        V+ SP
Subjt:  VLPSP

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]8.6e-29240.74Show/hide
Query:  SAANIVSSSFGHPLSTVLTVKIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRPNPLFEEWVTVDQALSGWLFGSMTPTIAADV
        SAAN   S   + L ++++VK+D  NY LW+ +VL+++RG K+DGY+LGT   P +    VT A K  + NP F +W+  DQAL GWL  SM   IA  +
Subjt:  SAANIVSSSFGHPLSTVLTVKIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRPNPLFEEWVTVDQALSGWLFGSMTPTIAADV

Query:  VSFKTSREVWTTLEEVYGATSKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLASLSPEYILIVSSINDKDVKTWQELSS
        +  +TS+++W   + + GA +K+RI  L+    NT+KG MKM  +L  MK  S+ L+LAG+P+   DL+   L  L  EY  +V  ++D+   +W ++ +
Subjt:  VSFKTSREVWTTLEEVYGATSKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLASLSPEYILIVSSINDKDVKTWQELSS

Query:  ILVTFEGTLAKYSNDVVSADLPDLSIQLALN-------RPNQFSACYMRFEESFNDLHAGNSSGQIGSS-------------------------------
         L+ FE  L +++N         L++  + N       R N+F++       +F  +  G   G++ ++                               
Subjt:  ILVTFEGTLAKYSNDVVSADLPDLSIQLALN-------RPNQFSACYMRFEESFNDLHAGNSSGQIGSS-------------------------------

Query:  ----GNSSAYIATLDILNDPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACL
            G+ SA+IA+     D +W  DSGA  HVT          ++NGK SL+VG+G  L I   G T + +       + L ++L+VP I +NL+S++ L
Subjt:  ----GNSSAYIATLDILNDPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACL

Query:  TADNNVFVEFHSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSSSVFQAESPKSCNVSRSTWHKRLGHASDRI
        TADNN+ VEF +N C VKDK++ + +L+G LK  LY++                        NK   +  S              + +WH++LGH ++++
Subjt:  TADNNVFVEFHSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSSSVFQAESPKSCNVSRSTWHKRLGHASDRI

Query:  VSHVLQSCNLKVSLNEKSEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLV
        +  VL+ CN+K+S +++  FCEACQ GK H LPF  S S   +PLAL+H D+WGP+P++S  G++YYV F+DDFSRFT I+PLK K + + +F  +K+L 
Subjt:  VSHVLQSCNLKVSLNEKSEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLV

Query:  ENKFEKRIKVLQTDWG---------------------------------------EMGLTFLAQAAMPLSYWWDAFCSAVFVINQLPTPVLDYLSPWEKA
        EN+F K+IK++Q D G                                       E+GLT LAQA MPL YWW+AF +AV++IN+LP+ V    SP+   
Subjt:  ENKFEKRIKVLQTDWG---------------------------------------EMGLTFLAQAAMPLSYWWDAFCSAVFVINQLPTPVLDYLSPWEKA

Query:  FKSSPDFEFLRTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYSPSH---KVLNSPP---------------------LSLPEP---STCVSPMPQPAIPAP
        FK  PD+  L+ FGCA +PCL PY  HK QFH+++CVF+GYS SH   K +NS                       L    P    T  S +  P   A 
Subjt:  FKSSPDFEFLRTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYSPSH---KVLNSPP---------------------LSLPEP---STCVSPMPQPAIPAP

Query:  SSEFQATSPSLGTT---------SPIQHENE---GSISICVDTASPS-------GSVSPIQPVVSLPVG-------------HHMQTRSKSGIFKPKDWG
        ++   A  P   TT         S   +ENE    S    V+T + S        SV       S   G             H M+TRSK GI KPK   
Subjt:  SSEFQATSPSLGTT---------SPIQHENE---GSISICVDTASPS-------GSVSPIQPVVSLPVG-------------HHMQTRSKSGIFKPKDWG

Query:  SFLAASSDDVVIEPVLVKDALQSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVK
            A +D    EP  VK+AL   +WK A+D+E  AL  N TWTLVP     N+I SKW+FK K KSDGS ++ KARLVAKGF Q  G+DF ETFSPVVK
Subjt:  SFLAASSDDVVIEPVLVKDALQSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVK

Query:  APTIQIILTVAVSKSWPVRKLDVNNAILNGKLDEDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQL
        + T++IILT+AV  +W VR+LD+NNA LNGKL E VFM QP G+ID   P+++CKL KA+YGL QAPRAWYD  ++TL+ W F N+K D  +FF      
Subjt:  APTIQIILTVAVSKSWPVRKLDVNNAILNGKLDEDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQL

Query:  TVLVFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPT
        T  + IYVDDI+VTG +I F++   ++LNT ++LKD+GPL Y+LGVEVHR  SGMYL Q+KYI +++++  + +   C TP+VTG++F   EG  M +PT
Subjt:  TVLVFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPT

Query:  LYRSVLGALQYITHTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSS
        LYR  +GALQY+T+TRPDI+F VNKLSQ++  P+  HWQ  KR+LRYL+GT ++ L +  S +L I  + DADWA + DDR+S    C+F+G +LVSW+S
Subjt:  LYRSVLGALQYITHTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSS

Query:  KKQTVVARSSTESEYRSLAQTAADVTWVQ-------------------SLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLT
        +KQ VV+RSSTESEYRSLA   A+V+                      +LL EL L     P++WCDN+SA +L  NPV H+RSKHIE+DMH++RD+VL 
Subjt:  KKQTVVARSSTESEYRSLAQTAADVTWVQ-------------------SLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLT

Query:  KAVEIRYVPSSEQLADALTKPLSEGQFSSIRGKLGVLPSPS
          V I YVP+++Q+AD LTKPL   +F+ +R KLGV  SPS
Subjt:  KAVEIRYVPSSEQLADALTKPLSEGQFSSIRGKLGVLPSPS

RVW64314.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]4.9e-27940.53Show/hide
Query:  LSTVLTVKIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRPNPLFEEWVTVDQALSGWLFGSMTPTIAADVVSFKTSREVWTTL
        L+  L +K+D  NY+LWR  +  ++     + ++ G K  P +     T +G+    NP F  W   D+ +  W++ S+TP I   +V +++S   W  L
Subjt:  LSTVLTVKIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRPNPLFEEWVTVDQALSGWLFGSMTPTIAADVVSFKTSREVWTTL

Query:  EEVYGATSKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLASLSPEYILIVSSINDK-DVKTWQELSSILVTFEGTLAKY
        E ++ A+S+AR+ QLR   Q T+KG + M+ ++  +K  ++NL   G PV   D I  +L  L  +Y  IV+S+  + D  +   + SIL+T E  L+  
Subjt:  EEVYGATSKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLASLSPEYILIVSSINDK-DVKTWQELSSILVTFEGTLAKY

Query:  S----NDVVSADLPDLSIQ-------LALNRPNQFSA------------------------------CYMRFEESFNDLHAGNSSGQI---GSSGNSSAY
        +    ++V+SA+L     Q          NR + F+                               CY RF+ +F   +    + Q     +     A 
Subjt:  S----NDVVSADLPDLSIQ-------LALNRPNQFSA------------------------------CYMRFEESFNDLHAGNSSGQI---GSSGNSSAY

Query:  IATLDILNDPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACLTADNNVFVEF
        +A+   ++D  W  D+GAT H++  +  L     Y G + +IVG+G HL I H G T  PS   ++    L+ +LHVP I  NLIS++   ADNN F EF
Subjt:  IATLDILNDPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACLTADNNVFVEF

Query:  HSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSSSVFQAESPKSCNVSRSTWHKRLGHASDRIVSHVLQSCNL
        H     VKD+V+++++LQG+L+  LY      +P   + V                    SS +   S  S   + + WH RLGH ++ I+ H+L SCN+
Subjt:  HSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSSSVFQAESPKSCNVSRSTWHKRLGHASDRIVSHVLQSCNL

Query:  KVSLNEKSEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLVENKFEKRIKV
            + K+  C ACQ  KSH LPF+ S+SR++ PLAL+H DLWGP+ + ST G RY++ FVDDFSRF+ IYPL SK +AL  F  +KSLVEN+F  RI+ 
Subjt:  KVSLNEKSEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLVENKFEKRIKV

Query:  LQTDWG---------------------------------------EMGLTFLAQAAMPLSYWWDAFCSAVFVINQLPTPVLDYLSPWEKAFKSSPDFEFL
        L++D G                                       E GL  LA A++P  +W  AF +A+F+IN+LPT VL+Y SP++  F  SP++   
Subjt:  LQTDWG---------------------------------------EMGLTFLAQAAMPLSYWWDAFCSAVFVINQLPTPVLDYLSPWEKAFKSSPDFEFL

Query:  RTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYSPSHK---VLNS------------------PPLSLPEPSTCVSPMPQPAI-----PAPSSEFQATSPSL
        + FGC  +P + PY  +K  + SS+CVFLGYS +HK    LN                   P  S P+ S+ V  +P PA      P  SS    T+PS 
Subjt:  RTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYSPSHK---VLNS------------------PPLSLPEPSTCVSPMPQPAI-----PAPSSEFQATSPSL

Query:  GTTSPIQHENEGSISICVDTASPSGSVSPIQPVVSLPVGHH-MQTRSKSGIFKPKDWGSFLAASSDDVVIEPVLVKDALQSAIWKSAIDQEIAALHRNKT
         ++ P+ +    +IS+      P   +S  +P    P   H M TR+K+GI K K + S         + EP     A++ + W  A+++E +AL RN T
Subjt:  GTTSPIQHENEGSISICVDTASPSGSVSPIQPVVSLPVGHH-MQTRSKSGIFKPKDWGSFLAASSDDVVIEPVLVKDALQSAIWKSAIDQEIAALHRNKT

Query:  WTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPTIQIILTVAVSKSWPVRKLDVNNAILNGKLDEDVFMKQPI
        W LVPPPSN N+IG KWV+K+K K DG+ D+ KARLVA+GF Q  G+D+ ETFSPVVKA TI+IIL VA+S +W V +LDV NA L+G L+E VFM+QP 
Subjt:  WTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPTIQIILTVAVSKSWPVRKLDVNNAILNGKLDEDVFMKQPI

Query:  GFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQLTVLVFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSY
        GFI+  +PS+VCKL+KALYGL QAPRAWY++   +L+ W F  S+AD+ +F  HS    +++ IYVDDILVTG S   +   + RLN+ FAL+D+G ++Y
Subjt:  GFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQLTVLVFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSY

Query:  YLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYRSVLGALQYITHTRPDISFIVNKLSQFLQQPSDIHWQAAK
        +LG+EV R  +  +L Q KY  +L+ R  +   +P  TP + G+  S  +G P  D TLYRS +GALQY+T TRPDISF VNK  QF+  P+  HW A K
Subjt:  YLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYRSVLGALQYITHTRPDISFIVNKLSQFLQQPSDIHWQAAK

Query:  RVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQTVVARSSTESEYRSLAQTAADVTWVQSLLTELGLSSSTI
        R+LRYLKGTL YG+ + +S  L I  YTDADWA   DDRRS   + +F+G +LVSWSS KQ VV+RSS ESEYR+LA   +++ W+Q +L EL LSSS+ 
Subjt:  RVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQTVVARSSTESEYRSLAQTAADVTWVQSLLTELGLSSSTI

Query:  PLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQLADALTKPLSEGQFSSIRGKLGVLPSPSRLRG
        PL+WCDN SA  L  NPVFH+R+KHIE+D+HF+RD VL K + I+Y+PS+EQ+AD  TK +S  QF S R KL V+PSP  LRG
Subjt:  PLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQLADALTKPLSEGQFSSIRGKLGVLPSPSRLRG

RVW85836.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.4e-27038.8Show/hide
Query:  SSAAETANAAVNVFSAANIVSSSFGHP----LSTVLTVKIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRP---NPLFEEWVT
        S+  ++++++ ++ S  N   +S  HP    L+  L VK+D  NY+LW+  +  ++     + ++ G+   P           KEL     NP F  W  
Subjt:  SSAAETANAAVNVFSAANIVSSSFGHP----LSTVLTVKIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRP---NPLFEEWVT

Query:  VDQALSGWLFGSMTPTIAADVVSFKTSREVWTTLEEVYGATSKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLASLSPE
         D+ +  WL+ S+TP I A ++   +S   W  LE+ + ++S+ARI QLR  LQ+TKKG + MI ++  +K A+ +L   G PV   D +  +L  L  +
Subjt:  VDQALSGWLFGSMTPTIAADVVSFKTSREVWTTLEEVYGATSKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLASLSPE

Query:  YILIVSSINDKDVK-TWQELSSILVTFEGTLAKYSN----DVVSAD----------------------LPDLS---------------------------
        Y  +V++IN +D K + + + S+L+ FE  L + S+      +SA+                       P+ S                           
Subjt:  YILIVSSINDKDVK-TWQELSSILVTFEGTLAKYSN----DVVSAD----------------------LPDLS---------------------------

Query:  -IQLALNRPNQFSACYMRFEESFNDLHAGNSS-GQIGSSGNSSAYIATLDILNDPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVG
          QL     +    CY RF+ S+    + N+S    G+  +  A +A+ + L D  W  DSGA+ H+T  V NL   + Y G + + +G+G HL+IS+ G
Subjt:  -IQLALNRPNQFSACYMRFEESFNDLHAGNSS-GQIGSSGNSSAYIATLDILNDPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVG

Query:  YTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACLTADNNVFVEFHSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKS
           +   L N+   +LK + HVP I  NLIS+A   +DNN  +EF SN+  VKD  +++V+ QG L+  LY       P +NS   + V A    Y + S
Subjt:  YTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACLTADNNVFVEFHSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKS

Query:  SVLKSSSVFQAESPKSCNVSRSTWHKRLGHASDRIVSHVLQSCNLKVSLNEK---SEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTL
        S+              C+   + WH RLGHAS  IV+ ++QSCN+    N+    S  C +CQL KSH LP   S+S +++PL LVH DLWGP+PV ST 
Subjt:  SVLKSSSVFQAESPKSCNVSRSTWHKRLGHASDRIVSHVLQSCNLKVSLNEK---SEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTL

Query:  GYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLVENKFEKRIKVLQTDWG---------------------------------------EMGLTFL
        G RY++ F+DD+SR+T  YPL++K +AL  F+ +K  VEN+F+ +IK LQ+D G                                       E GL  L
Subjt:  GYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLVENKFEKRIKVLQTDWG---------------------------------------EMGLTFL

Query:  AQAAMPLSYWWDAFCSAVFVINQLPTPVLDYLSPWEKAFKSSPDFEFLRTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYSPSHK-----------VLNSP
        A A++P+ +W  AF +A F+IN++P+ VL   SP+   F+  PD++ LR FGC  +P + PY SHK Q+ S + +FLGYS  +K           V  +P
Subjt:  AQAAMPLSYWWDAFCSAVFVINQLPTPVLDYLSPWEKAFKSSPDFEFLRTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYSPSHK-----------VLNSP

Query:  ---------PLSLPEP--------STCVSPMPQPAIPAPSSEFQATSPSLGTTSPIQHENEGSISICVDTASPSGSVSP-----IQPVVSLPVGHHMQTR
                 PL+   P        +  ++P    + PAP+     +  S  ++SP    +E S S+   T +P  S  P      QP  S P    M TR
Subjt:  ---------PLSLPEP--------STCVSPMPQPAIPAPSSEFQATSPSLGTTSPIQHENEGSISICVDTASPSGSVSP-----IQPVVSLPVGHHMQTR

Query:  SKSGIFKPKDWGSFLAASSDDVVIEPVLVKDALQSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPG
           GI + K      A      + EP  +K AL+   W  A+D EIAALHRN+TW LV  P  VNLIG KWV+K+K K DGS ++ KARLVAKG+NQ  G
Subjt:  SKSGIFKPKDWGSFLAASSDDVVIEPVLVKDALQSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPG

Query:  VDFHETFSPVVKAPTIQIILTVAVSKSWPVRKLDVNNAILNGKLDEDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKA
        +D+ ETFSPVVKA TI+IILTVA+S  W +R+LDV+NA LNG+L+E V+M QP G++D   P+ VC+L KALYGL QAPRAW+ R  + LI W FSNS+ 
Subjt:  VDFHETFSPVVKAPTIQIILTVAVSKSWPVRKLDVNNAILNGKLDEDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKA

Query:  DNYVFFFHSDQLTVLVFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRF
        D+ +F +  +  T++V +YVDDI++TG S   I   + +LN++FAL+D+G LSY+LG+EV      M L Q+KY+++L+ R G+   +P  TP   GK  
Subjt:  DNYVFFFHSDQLTVLVFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRF

Query:  STFEGTPMHDPTLYRSVLGALQYITHTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHC
        S F+G PM + T YRSV+GALQY+T TRPDI+F VNK  QF+QQP+  HW + KR+LRYLKGT+  GL L  S +L+I  ++DADW    DDRRS + + 
Subjt:  STFEGTPMHDPTLYRSVLGALQYITHTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHC

Query:  LFIGHSLVSWSSKKQTVVARSSTESEYRSLAQTAADVTWVQSLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRY
        +++G +LVSWSS KQ VV+RSS ESEYR+LA   A++ W+Q+LL EL +    IPL+W DN+SA  + +NPVFH+R+KHIE+D+HF+RD+V+   +++ +
Subjt:  LFIGHSLVSWSSKKQTVVARSSTESEYRSLAQTAADVTWVQSLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRY

Query:  VPSSEQLADALTKPLSEGQFSSIRGKLGVLPSPSRLRG
        VP+ +Q AD LTK L+  +F S++ +L + P P  LRG
Subjt:  VPSSEQLADALTKPLSEGQFSSIRGKLGVLPSPSRLRG

RVX06084.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.7e-27639.78Show/hide
Query:  KVSNMTDNVSSTSSAAETANAAVNVFSAANIVSSSFGHPLSTVLTVKIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRPNPLF
        +++++   +SST S+   A    +   A   +       L+  L +K+D  NY+LWR  +  ++     + ++ G K  P +     T +G+    NP F
Subjt:  KVSNMTDNVSSTSSAAETANAAVNVFSAANIVSSSFGHPLSTVLTVKIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRPNPLF

Query:  EEWVTVDQALSGWLFGSMTPTIAADVVSFKTSREVWTTLEEVYGATSKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLA
          W   D+ +  W++ S+TP I   +V +++S   W  LE ++ A+S+AR+ QLR   Q T+KG + M+ ++  +K  ++NL   G PV   D I  +L 
Subjt:  EEWVTVDQALSGWLFGSMTPTIAADVVSFKTSREVWTTLEEVYGATSKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLA

Query:  SLSPEYILIVSSINDK-DVKTWQELSSILVTFEGTLAKYS----NDVVSADLPDLSIQ-------LALNRPNQFSA------------------------
         L  +Y  IV+S+  + D  +   + SIL+T E  L+  +    ++V+SA+L     Q          NR + F+                         
Subjt:  SLSPEYILIVSSINDK-DVKTWQELSSILVTFEGTLAKYS----NDVVSADLPDLSIQ-------LALNRPNQFSA------------------------

Query:  ------CYMRFEESFNDLHAGNSSGQI---GSSGNSSAYIATLDILNDPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPS
              CY RF+ +F   +    + Q     +     A +A+   ++D  W  D+GAT H++  +  L     Y G + +IVG+G HL I H G T  PS
Subjt:  ------CYMRFEESFNDLHAGNSSGQI---GSSGNSSAYIATLDILNDPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPS

Query:  NLPNTAPVYLKNMLHVPSIKRNLISIACLTADNNVFVEFHSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSS
           ++    L+ +LHVP I  NLIS++   ADNN F EFH     VKD+V+++++LQG+L+  LY      +P   + V                    S
Subjt:  NLPNTAPVYLKNMLHVPSIKRNLISIACLTADNNVFVEFHSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSS

Query:  SVFQAESPKSCNVSRSTWHKRLGHASDRIVSHVLQSCNLKVSLNEKSEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFV
        S +   S  S   + + WH RLGH +D I+ H+L SCN+    + K+  C ACQ  KSH LPF+ S+SR++ PLAL+H DLWGP+ + ST G RY++ FV
Subjt:  SVFQAESPKSCNVSRSTWHKRLGHASDRIVSHVLQSCNLKVSLNEKSEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFV

Query:  DDFSRFTHIYPLKSKGEALVSFQTYKSLVENKFEKRIKVLQTDWG---------------------------------------EMGLTFLAQAAMPLSY
        DDFSRF+ IYPL SK +AL  F  +KSLVEN+F  RI+ L++D G                                       E GL  LA A++P  +
Subjt:  DDFSRFTHIYPLKSKGEALVSFQTYKSLVENKFEKRIKVLQTDWG---------------------------------------EMGLTFLAQAAMPLSY

Query:  WWDAFCSAVFVINQLPTPVLDYLSPWEKAFKSSPDFEFLRTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYSPSHK---VLNS------------------
        W  AF + +F+IN+LPT VL+Y SP++  F  SP++   + FGC  +P + PY  +K  + SS+CVFLGYS +HK    LN                   
Subjt:  WWDAFCSAVFVINQLPTPVLDYLSPWEKAFKSSPDFEFLRTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYSPSHK---VLNS------------------

Query:  PPLSLPEPSTCVSPMPQPAI-----PAPSSEFQATSPSLGTTSPIQHENEGSISICVDTASPSGSVSPIQPVVSLPVGHH-MQTRSKSGIFKPKDWGSFL
        P  S P+ S+ V  +P PA      P  SS    T+PS  ++ P+ +    +IS+      P   +S  +P    P   H M TR+K+GI K K + S  
Subjt:  PPLSLPEPSTCVSPMPQPAI-----PAPSSEFQATSPSLGTTSPIQHENEGSISICVDTASPSGSVSPIQPVVSLPVGHH-MQTRSKSGIFKPKDWGSFL

Query:  AASSDDVVIEPVLVKDALQSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPT
               + EP     A++ + W  A+++E +AL RN TW LVPPPSN N+IG KWV+K+K K DG+ D+ KARLVA+GF Q  G+D+ ETFSPVVKA T
Subjt:  AASSDDVVIEPVLVKDALQSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPT

Query:  IQIILTVAVSKSWPVRKLDVNNAILNGKLDEDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQLTVL
        I+IIL VA+S +W V +LDV NA L+G L+E VFM+QP GFI+  +PS+VCKL+KALYGL QAPRAWY++   +L+ W F  S+AD+ +F  HS    ++
Subjt:  IQIILTVAVSKSWPVRKLDVNNAILNGKLDEDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQLTVL

Query:  VFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYR
        + IYVDDILVTG S   +   + RLN  FAL+D+G ++Y+LG+EV    +  +L Q KY  +L+ R  +   +P  TP + G+  S  +G P  D TLYR
Subjt:  VFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYR

Query:  SVLGALQYITHTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQ
        S +GALQY+T TRPDISF VNK  QF+  P+  HW A KR+LRYLKGTL YG+ + +S  L I  YTDADWA   DDRRS   + +F+G +LVSWSS KQ
Subjt:  SVLGALQYITHTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQ

Query:  TVVARSSTESEYRSLAQTAADVTWVQSLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQLADALTKPL
         VV+RSS ESEYR+LA   +++ W+Q +L EL LSSS+ PL+WCDN SA  L  NPVFH+R+KHIE+D+HF+RD VL K + I+Y+PS+EQ+AD  TK +
Subjt:  TVVARSSTESEYRSLAQTAADVTWVQSLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQLADALTKPL

Query:  SEGQFSSIRGKLGVLPSPSRLRG
        S  QF S R KL V+PSP  LRG
Subjt:  SEGQFSSIRGKLGVLPSPSRLRG

TrEMBL top hitse value%identityAlignment
A0A2Z6MBG6 Integrase catalytic domain-containing protein1.4e-30040.28Show/hide
Query:  ANIVSSSFGHPLSTVLTVKIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRPNPLFEEWVTVDQALSGWLFGSMTPTIAADVVS
        A+   S+  + L + ++VK+D  NY LW+ +VL ++RG K+DGY+LGT+  P E    +T +      N  F EW   DQ L GW+  SMT  IA  ++ 
Subjt:  ANIVSSSFGHPLSTVLTVKIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRPNPLFEEWVTVDQALSGWLFGSMTPTIAADVVS

Query:  FKTSREVWTTLEEVYGATSKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLASLSPEYILIVSSINDKDVKTWQELSSIL
         +TS+++W   + + GA ++++I  L+    + +KG MKM  +L  MK   + L+LAGNPV   DLI   L  L  EY  +V  ++D+   +W +L + L
Subjt:  FKTSREVWTTLEEVYGATSKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLASLSPEYILIVSSINDKDVKTWQELSSIL

Query:  VTFEGTLAKYSNDVVSADLPDLSIQLALNRPNQFS-------------------------------------------ACYMRFEESFNDLHAGNSSGQI
        +TFE  + + +N      L +L++    N  N+                                              C+ RF+++++     N S   
Subjt:  VTFEGTLAKYSNDVVSADLPDLSIQLALNRPNQFS-------------------------------------------ACYMRFEESFNDLHAGNSSGQI

Query:  GSSGNSSAYIATLDILNDPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACLT
           G+ +A++A+ + + D  W  DSGA+ HVT         T+++GK SL+VG+G  L I   G + + S       + L ++L+VP+I +NL+S++ L 
Subjt:  GSSGNSSAYIATLDILNDPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACLT

Query:  ADNNVFVEFHSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSSSVFQAESPKSCNVSRSTWHKRLGHASDRIV
        ADNN+ VEF  N C VKDK++ +VIL+G LK  LY++         SG +                          +P +    + +WH+RLGH +++++
Subjt:  ADNNVFVEFHSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSSSVFQAESPKSCNVSRSTWHKRLGHASDRIV

Query:  SHVLQSCNLKVSLNEKSEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLVE
          VL+SC +KV  ++   FCEACQ GK H LPF  S S + +PL LVH D+WGP+P++++ G++YYV FVDDFSRFT IYPLK K E + +F  +K+L E
Subjt:  SHVLQSCNLKVSLNEKSEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLVE

Query:  NKFEKRIKVLQTDWG---------------------------------------EMGLTFLAQAAMPLSYWWDAFCSAVFVINQLPTPVLDYLSPWEKAF
        N+F KRIKV+Q D G                                       E GLT LAQA MPL YWW+AF +AV++IN+LP+ V    SP+    
Subjt:  NKFEKRIKVLQTDWG---------------------------------------EMGLTFLAQAAMPLSYWWDAFCSAVFVINQLPTPVLDYLSPWEKAF

Query:  KSSPDFEFLRTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYSPSHK-----------------VLNSPPLSLPEPSTCVSPMPQPAIPAPSSEF-------
        +  PD++ L+TFGCA +PCL PY  HK Q+H+++CVFLGYS SHK                 + N       +         +  I  PS+ F       
Subjt:  KSSPDFEFLRTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYSPSHK-----------------VLNSPPLSLPEPSTCVSPMPQPAIPAPSSEF-------

Query:  ---QATSPSLGTTSPIQHENEGSISICVDTA----SPSGSVSPIQPVVSLP-------------VGHHMQTRSKSGIFKPKDWGSFLAASSDDVVIEPVL
            A+ P L   +P +   E S  +  DT      PS   +  +  + +                H + TRSKSGI KPK     L  +  D  +EP  
Subjt:  ---QATSPSLGTTSPIQHENEGSISICVDTA----SPSGSVSPIQPVVSLP-------------VGHHMQTRSKSGIFKPKDWGSFLAASSDDVVIEPVL

Query:  VKDALQSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPTIQIILTVAVSKSW
         K+AL   +WK A+ +E  AL  NKTW LVP  +  N++ SKWVFK K K DGS ++ KARLVAKGF Q  G+D+ ETFSPV+KA T++IIL++AV  +W
Subjt:  VKDALQSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPTIQIILTVAVSKSW

Query:  PVRKLDVNNAILNGKLDEDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQLTVLVFIYVDDILVTGD
         VR+LD+NNA LNG L E VFM QP GF+D   P+++CKL KA+YGL QAPRAW+D  K  L+ W F N+K+D+ +F          + IYVDDI+VTG 
Subjt:  PVRKLDVNNAILNGKLDEDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQLTVLVFIYVDDILVTGD

Query:  SIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYRSVLGALQYITHTR
        +  F+   + +LN  F+LKD+G L Y+LG+EV R  SGMYL QSKYI +L+++  + +  PC TP++TG++F T EG  + DPT++R  +G LQY+THT 
Subjt:  SIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYRSVLGALQYITHTR

Query:  PDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQTVVARSSTESEYR
        PDI+F VNKLSQ++  PS  HWQ  KR+LRYL+GT++Y L +  S DL I  ++DADWA ++DDR+S++  C+F+G +L+SWSS+KQ VV+RSSTESEYR
Subjt:  PDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQTVVARSSTESEYR

Query:  SLAQTAADVTWVQSLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQLADALTKPLSEGQFSSIRGKLG
        +LA  AA++ W++SLLTEL L     P++WCDN+SA +L  NPV H+RSKHIE+D+H++RD+VL   V + YVP+++Q+AD LTKPLS  +FS +R KLG
Subjt:  SLAQTAADVTWVQSLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQLADALTKPLSEGQFSSIRGKLG

Query:  VLPSP
        V+ SP
Subjt:  VLPSP

A0A2Z6P4D5 Integrase catalytic domain-containing protein4.2e-29240.74Show/hide
Query:  SAANIVSSSFGHPLSTVLTVKIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRPNPLFEEWVTVDQALSGWLFGSMTPTIAADV
        SAAN   S   + L ++++VK+D  NY LW+ +VL+++RG K+DGY+LGT   P +    VT A K  + NP F +W+  DQAL GWL  SM   IA  +
Subjt:  SAANIVSSSFGHPLSTVLTVKIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRPNPLFEEWVTVDQALSGWLFGSMTPTIAADV

Query:  VSFKTSREVWTTLEEVYGATSKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLASLSPEYILIVSSINDKDVKTWQELSS
        +  +TS+++W   + + GA +K+RI  L+    NT+KG MKM  +L  MK  S+ L+LAG+P+   DL+   L  L  EY  +V  ++D+   +W ++ +
Subjt:  VSFKTSREVWTTLEEVYGATSKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLASLSPEYILIVSSINDKDVKTWQELSS

Query:  ILVTFEGTLAKYSNDVVSADLPDLSIQLALN-------RPNQFSACYMRFEESFNDLHAGNSSGQIGSS-------------------------------
         L+ FE  L +++N         L++  + N       R N+F++       +F  +  G   G++ ++                               
Subjt:  ILVTFEGTLAKYSNDVVSADLPDLSIQLALN-------RPNQFSACYMRFEESFNDLHAGNSSGQIGSS-------------------------------

Query:  ----GNSSAYIATLDILNDPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACL
            G+ SA+IA+     D +W  DSGA  HVT          ++NGK SL+VG+G  L I   G T + +       + L ++L+VP I +NL+S++ L
Subjt:  ----GNSSAYIATLDILNDPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACL

Query:  TADNNVFVEFHSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSSSVFQAESPKSCNVSRSTWHKRLGHASDRI
        TADNN+ VEF +N C VKDK++ + +L+G LK  LY++                        NK   +  S              + +WH++LGH ++++
Subjt:  TADNNVFVEFHSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSSSVFQAESPKSCNVSRSTWHKRLGHASDRI

Query:  VSHVLQSCNLKVSLNEKSEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLV
        +  VL+ CN+K+S +++  FCEACQ GK H LPF  S S   +PLAL+H D+WGP+P++S  G++YYV F+DDFSRFT I+PLK K + + +F  +K+L 
Subjt:  VSHVLQSCNLKVSLNEKSEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLV

Query:  ENKFEKRIKVLQTDWG---------------------------------------EMGLTFLAQAAMPLSYWWDAFCSAVFVINQLPTPVLDYLSPWEKA
        EN+F K+IK++Q D G                                       E+GLT LAQA MPL YWW+AF +AV++IN+LP+ V    SP+   
Subjt:  ENKFEKRIKVLQTDWG---------------------------------------EMGLTFLAQAAMPLSYWWDAFCSAVFVINQLPTPVLDYLSPWEKA

Query:  FKSSPDFEFLRTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYSPSH---KVLNSPP---------------------LSLPEP---STCVSPMPQPAIPAP
        FK  PD+  L+ FGCA +PCL PY  HK QFH+++CVF+GYS SH   K +NS                       L    P    T  S +  P   A 
Subjt:  FKSSPDFEFLRTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYSPSH---KVLNSPP---------------------LSLPEP---STCVSPMPQPAIPAP

Query:  SSEFQATSPSLGTT---------SPIQHENE---GSISICVDTASPS-------GSVSPIQPVVSLPVG-------------HHMQTRSKSGIFKPKDWG
        ++   A  P   TT         S   +ENE    S    V+T + S        SV       S   G             H M+TRSK GI KPK   
Subjt:  SSEFQATSPSLGTT---------SPIQHENE---GSISICVDTASPS-------GSVSPIQPVVSLPVG-------------HHMQTRSKSGIFKPKDWG

Query:  SFLAASSDDVVIEPVLVKDALQSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVK
            A +D    EP  VK+AL   +WK A+D+E  AL  N TWTLVP     N+I SKW+FK K KSDGS ++ KARLVAKGF Q  G+DF ETFSPVVK
Subjt:  SFLAASSDDVVIEPVLVKDALQSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVK

Query:  APTIQIILTVAVSKSWPVRKLDVNNAILNGKLDEDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQL
        + T++IILT+AV  +W VR+LD+NNA LNGKL E VFM QP G+ID   P+++CKL KA+YGL QAPRAWYD  ++TL+ W F N+K D  +FF      
Subjt:  APTIQIILTVAVSKSWPVRKLDVNNAILNGKLDEDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQL

Query:  TVLVFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPT
        T  + IYVDDI+VTG +I F++   ++LNT ++LKD+GPL Y+LGVEVHR  SGMYL Q+KYI +++++  + +   C TP+VTG++F   EG  M +PT
Subjt:  TVLVFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPT

Query:  LYRSVLGALQYITHTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSS
        LYR  +GALQY+T+TRPDI+F VNKLSQ++  P+  HWQ  KR+LRYL+GT ++ L +  S +L I  + DADWA + DDR+S    C+F+G +LVSW+S
Subjt:  LYRSVLGALQYITHTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSS

Query:  KKQTVVARSSTESEYRSLAQTAADVTWVQ-------------------SLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLT
        +KQ VV+RSSTESEYRSLA   A+V+                      +LL EL L     P++WCDN+SA +L  NPV H+RSKHIE+DMH++RD+VL 
Subjt:  KKQTVVARSSTESEYRSLAQTAADVTWVQ-------------------SLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLT

Query:  KAVEIRYVPSSEQLADALTKPLSEGQFSSIRGKLGVLPSPS
          V I YVP+++Q+AD LTKPL   +F+ +R KLGV  SPS
Subjt:  KAVEIRYVPSSEQLADALTKPLSEGQFSSIRGKLGVLPSPS

A0A438FWJ3 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-27940.53Show/hide
Query:  LSTVLTVKIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRPNPLFEEWVTVDQALSGWLFGSMTPTIAADVVSFKTSREVWTTL
        L+  L +K+D  NY+LWR  +  ++     + ++ G K  P +     T +G+    NP F  W   D+ +  W++ S+TP I   +V +++S   W  L
Subjt:  LSTVLTVKIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRPNPLFEEWVTVDQALSGWLFGSMTPTIAADVVSFKTSREVWTTL

Query:  EEVYGATSKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLASLSPEYILIVSSINDK-DVKTWQELSSILVTFEGTLAKY
        E ++ A+S+AR+ QLR   Q T+KG + M+ ++  +K  ++NL   G PV   D I  +L  L  +Y  IV+S+  + D  +   + SIL+T E  L+  
Subjt:  EEVYGATSKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLASLSPEYILIVSSINDK-DVKTWQELSSILVTFEGTLAKY

Query:  S----NDVVSADLPDLSIQ-------LALNRPNQFSA------------------------------CYMRFEESFNDLHAGNSSGQI---GSSGNSSAY
        +    ++V+SA+L     Q          NR + F+                               CY RF+ +F   +    + Q     +     A 
Subjt:  S----NDVVSADLPDLSIQ-------LALNRPNQFSA------------------------------CYMRFEESFNDLHAGNSSGQI---GSSGNSSAY

Query:  IATLDILNDPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACLTADNNVFVEF
        +A+   ++D  W  D+GAT H++  +  L     Y G + +IVG+G HL I H G T  PS   ++    L+ +LHVP I  NLIS++   ADNN F EF
Subjt:  IATLDILNDPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACLTADNNVFVEF

Query:  HSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSSSVFQAESPKSCNVSRSTWHKRLGHASDRIVSHVLQSCNL
        H     VKD+V+++++LQG+L+  LY      +P   + V                    SS +   S  S   + + WH RLGH ++ I+ H+L SCN+
Subjt:  HSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSSSVFQAESPKSCNVSRSTWHKRLGHASDRIVSHVLQSCNL

Query:  KVSLNEKSEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLVENKFEKRIKV
            + K+  C ACQ  KSH LPF+ S+SR++ PLAL+H DLWGP+ + ST G RY++ FVDDFSRF+ IYPL SK +AL  F  +KSLVEN+F  RI+ 
Subjt:  KVSLNEKSEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLVENKFEKRIKV

Query:  LQTDWG---------------------------------------EMGLTFLAQAAMPLSYWWDAFCSAVFVINQLPTPVLDYLSPWEKAFKSSPDFEFL
        L++D G                                       E GL  LA A++P  +W  AF +A+F+IN+LPT VL+Y SP++  F  SP++   
Subjt:  LQTDWG---------------------------------------EMGLTFLAQAAMPLSYWWDAFCSAVFVINQLPTPVLDYLSPWEKAFKSSPDFEFL

Query:  RTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYSPSHK---VLNS------------------PPLSLPEPSTCVSPMPQPAI-----PAPSSEFQATSPSL
        + FGC  +P + PY  +K  + SS+CVFLGYS +HK    LN                   P  S P+ S+ V  +P PA      P  SS    T+PS 
Subjt:  RTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYSPSHK---VLNS------------------PPLSLPEPSTCVSPMPQPAI-----PAPSSEFQATSPSL

Query:  GTTSPIQHENEGSISICVDTASPSGSVSPIQPVVSLPVGHH-MQTRSKSGIFKPKDWGSFLAASSDDVVIEPVLVKDALQSAIWKSAIDQEIAALHRNKT
         ++ P+ +    +IS+      P   +S  +P    P   H M TR+K+GI K K + S         + EP     A++ + W  A+++E +AL RN T
Subjt:  GTTSPIQHENEGSISICVDTASPSGSVSPIQPVVSLPVGHH-MQTRSKSGIFKPKDWGSFLAASSDDVVIEPVLVKDALQSAIWKSAIDQEIAALHRNKT

Query:  WTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPTIQIILTVAVSKSWPVRKLDVNNAILNGKLDEDVFMKQPI
        W LVPPPSN N+IG KWV+K+K K DG+ D+ KARLVA+GF Q  G+D+ ETFSPVVKA TI+IIL VA+S +W V +LDV NA L+G L+E VFM+QP 
Subjt:  WTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPTIQIILTVAVSKSWPVRKLDVNNAILNGKLDEDVFMKQPI

Query:  GFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQLTVLVFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSY
        GFI+  +PS+VCKL+KALYGL QAPRAWY++   +L+ W F  S+AD+ +F  HS    +++ IYVDDILVTG S   +   + RLN+ FAL+D+G ++Y
Subjt:  GFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQLTVLVFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSY

Query:  YLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYRSVLGALQYITHTRPDISFIVNKLSQFLQQPSDIHWQAAK
        +LG+EV R  +  +L Q KY  +L+ R  +   +P  TP + G+  S  +G P  D TLYRS +GALQY+T TRPDISF VNK  QF+  P+  HW A K
Subjt:  YLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYRSVLGALQYITHTRPDISFIVNKLSQFLQQPSDIHWQAAK

Query:  RVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQTVVARSSTESEYRSLAQTAADVTWVQSLLTELGLSSSTI
        R+LRYLKGTL YG+ + +S  L I  YTDADWA   DDRRS   + +F+G +LVSWSS KQ VV+RSS ESEYR+LA   +++ W+Q +L EL LSSS+ 
Subjt:  RVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQTVVARSSTESEYRSLAQTAADVTWVQSLLTELGLSSSTI

Query:  PLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQLADALTKPLSEGQFSSIRGKLGVLPSPSRLRG
        PL+WCDN SA  L  NPVFH+R+KHIE+D+HF+RD VL K + I+Y+PS+EQ+AD  TK +S  QF S R KL V+PSP  LRG
Subjt:  PLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQLADALTKPLSEGQFSSIRGKLGVLPSPSRLRG

A0A438JAU4 Retrovirus-related Pol polyprotein from transposon TNT 1-948.4e-27739.78Show/hide
Query:  KVSNMTDNVSSTSSAAETANAAVNVFSAANIVSSSFGHPLSTVLTVKIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRPNPLF
        +++++   +SST S+   A    +   A   +       L+  L +K+D  NY+LWR  +  ++     + ++ G K  P +     T +G+    NP F
Subjt:  KVSNMTDNVSSTSSAAETANAAVNVFSAANIVSSSFGHPLSTVLTVKIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRPNPLF

Query:  EEWVTVDQALSGWLFGSMTPTIAADVVSFKTSREVWTTLEEVYGATSKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLA
          W   D+ +  W++ S+TP I   +V +++S   W  LE ++ A+S+AR+ QLR   Q T+KG + M+ ++  +K  ++NL   G PV   D I  +L 
Subjt:  EEWVTVDQALSGWLFGSMTPTIAADVVSFKTSREVWTTLEEVYGATSKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLA

Query:  SLSPEYILIVSSINDK-DVKTWQELSSILVTFEGTLAKYS----NDVVSADLPDLSIQ-------LALNRPNQFSA------------------------
         L  +Y  IV+S+  + D  +   + SIL+T E  L+  +    ++V+SA+L     Q          NR + F+                         
Subjt:  SLSPEYILIVSSINDK-DVKTWQELSSILVTFEGTLAKYS----NDVVSADLPDLSIQ-------LALNRPNQFSA------------------------

Query:  ------CYMRFEESFNDLHAGNSSGQI---GSSGNSSAYIATLDILNDPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPS
              CY RF+ +F   +    + Q     +     A +A+   ++D  W  D+GAT H++  +  L     Y G + +IVG+G HL I H G T  PS
Subjt:  ------CYMRFEESFNDLHAGNSSGQI---GSSGNSSAYIATLDILNDPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPS

Query:  NLPNTAPVYLKNMLHVPSIKRNLISIACLTADNNVFVEFHSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSS
           ++    L+ +LHVP I  NLIS++   ADNN F EFH     VKD+V+++++LQG+L+  LY      +P   + V                    S
Subjt:  NLPNTAPVYLKNMLHVPSIKRNLISIACLTADNNVFVEFHSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSS

Query:  SVFQAESPKSCNVSRSTWHKRLGHASDRIVSHVLQSCNLKVSLNEKSEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFV
        S +   S  S   + + WH RLGH +D I+ H+L SCN+    + K+  C ACQ  KSH LPF+ S+SR++ PLAL+H DLWGP+ + ST G RY++ FV
Subjt:  SVFQAESPKSCNVSRSTWHKRLGHASDRIVSHVLQSCNLKVSLNEKSEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFV

Query:  DDFSRFTHIYPLKSKGEALVSFQTYKSLVENKFEKRIKVLQTDWG---------------------------------------EMGLTFLAQAAMPLSY
        DDFSRF+ IYPL SK +AL  F  +KSLVEN+F  RI+ L++D G                                       E GL  LA A++P  +
Subjt:  DDFSRFTHIYPLKSKGEALVSFQTYKSLVENKFEKRIKVLQTDWG---------------------------------------EMGLTFLAQAAMPLSY

Query:  WWDAFCSAVFVINQLPTPVLDYLSPWEKAFKSSPDFEFLRTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYSPSHK---VLNS------------------
        W  AF + +F+IN+LPT VL+Y SP++  F  SP++   + FGC  +P + PY  +K  + SS+CVFLGYS +HK    LN                   
Subjt:  WWDAFCSAVFVINQLPTPVLDYLSPWEKAFKSSPDFEFLRTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYSPSHK---VLNS------------------

Query:  PPLSLPEPSTCVSPMPQPAI-----PAPSSEFQATSPSLGTTSPIQHENEGSISICVDTASPSGSVSPIQPVVSLPVGHH-MQTRSKSGIFKPKDWGSFL
        P  S P+ S+ V  +P PA      P  SS    T+PS  ++ P+ +    +IS+      P   +S  +P    P   H M TR+K+GI K K + S  
Subjt:  PPLSLPEPSTCVSPMPQPAI-----PAPSSEFQATSPSLGTTSPIQHENEGSISICVDTASPSGSVSPIQPVVSLPVGHH-MQTRSKSGIFKPKDWGSFL

Query:  AASSDDVVIEPVLVKDALQSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPT
               + EP     A++ + W  A+++E +AL RN TW LVPPPSN N+IG KWV+K+K K DG+ D+ KARLVA+GF Q  G+D+ ETFSPVVKA T
Subjt:  AASSDDVVIEPVLVKDALQSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPT

Query:  IQIILTVAVSKSWPVRKLDVNNAILNGKLDEDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQLTVL
        I+IIL VA+S +W V +LDV NA L+G L+E VFM+QP GFI+  +PS+VCKL+KALYGL QAPRAWY++   +L+ W F  S+AD+ +F  HS    ++
Subjt:  IQIILTVAVSKSWPVRKLDVNNAILNGKLDEDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQLTVL

Query:  VFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYR
        + IYVDDILVTG S   +   + RLN  FAL+D+G ++Y+LG+EV    +  +L Q KY  +L+ R  +   +P  TP + G+  S  +G P  D TLYR
Subjt:  VFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYR

Query:  SVLGALQYITHTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQ
        S +GALQY+T TRPDISF VNK  QF+  P+  HW A KR+LRYLKGTL YG+ + +S  L I  YTDADWA   DDRRS   + +F+G +LVSWSS KQ
Subjt:  SVLGALQYITHTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQ

Query:  TVVARSSTESEYRSLAQTAADVTWVQSLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQLADALTKPL
         VV+RSS ESEYR+LA   +++ W+Q +L EL LSSS+ PL+WCDN SA  L  NPVFH+R+KHIE+D+HF+RD VL K + I+Y+PS+EQ+AD  TK +
Subjt:  TVVARSSTESEYRSLAQTAADVTWVQSLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQLADALTKPL

Query:  SEGQFSSIRGKLGVLPSPSRLRG
        S  QF S R KL V+PSP  LRG
Subjt:  SEGQFSSIRGKLGVLPSPSRLRG

A0A803PM38 Uncharacterized protein3.6e-28039.2Show/hide
Query:  AANIVSSSFGHPLSTVLTVKIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKEL---RPNPLFEEWVTVDQALSGWLFGSMTPTIAA
        A NIV   FG  L+    +K+D  N+ LWR MV AI+RG ++DGY+ GT  KP E + +    G      + NP FE+W+  DQ L GWL+GSMT  IA 
Subjt:  AANIVSSSFGHPLSTVLTVKIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKEL---RPNPLFEEWVTVDQALSGWLFGSMTPTIAA

Query:  DVVSFKTSREVWTTLEEVYGATSKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLASLSPEYILIVSSINDKDVKTWQEL
        +V+   +S  +WT LEE++GA SKA++++ R  +Q  +KG + M  +L   +Q ++ L LAG P     L+S VL+ L  EY+ +V  I  +   TWQ+L
Subjt:  DVVSFKTSREVWTTLEEVYGATSKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLASLSPEYILIVSSINDKDVKTWQEL

Query:  SSILVTFEGTLAKYSNDVVSADL------PDLSIQLALNRPNQFSACYMRFEESFNDLHAGNSSGQIGSSGNSSAYIATLDIL----NDPKWLADSGATT
          +L++ +  + +  +   S+ L      P  S+      P      +       +  + G+++   G  G +S    T  +     +      + GA+ 
Subjt:  SSILVTFEGTLAKYSNDVVSADL------PDLSIQLALNRPNQFSACYMRFEESFNDLHAGNSSGQIGSSGNSSAYIATLDIL----NDPKWLADSGATT

Query:  HVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACLTADNNVFVEFHSNACIVKDKVSRRVILQGT
        H+T+++  + +K +YNGKE + V +G+ L I H+G  ++ +   + +P+ LK +LHVPSI +NL+SI+ LT+DNNV VEF S+ C VKDK + +V+L+G 
Subjt:  HVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACLTADNNVFVEFHSNACIVKDKVSRRVILQGT

Query:  LKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSSSVFQAESPKSCNVSRSTWHKRLGHASDRIVSHVLQSCNLKVSLNEKSEFCEACQLGKSH
        LK  LY+ + P+     S  +S +  P        S ++S+      +   C++ +  WH+RLGH S R++  VL   N+K ++N    FC+ACQLGKSH
Subjt:  LKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSSSVFQAESPKSCNVSRSTWHKRLGHASDRIVSHVLQSCNLKVSLNEKSEFCEACQLGKSH

Query:  CLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLVENKFEKRIKVLQTDWG--------------
         LPF  +  R+T PL LVH D+WGPSP++S   +RYY+ F+DDFSR+T IYPLK+K EAL +F  +K LVEN+F  R+K +QTDWG              
Subjt:  CLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLVENKFEKRIKVLQTDWG--------------

Query:  -------------------------EMGLTFLAQAAMPLSYWWDAFCSAVFVINQLPTPVLDYLSPWEKAFKSSPDFEFLRTFGCAYFPCLWPYQSHKFQ
                                 EMGLT LAQA +P  YWWDAF +AV++IN+LPTPVL   +P+E  FK  PD++FL+ FG + FPCL  YQ+HKFQ
Subjt:  -------------------------EMGLTFLAQAAMPLSYWWDAFCSAVFVINQLPTPVLDYLSPWEKAFKSSPDFEFLRTFGCAYFPCLWPYQSHKFQ

Query:  FHSSKCVFLGYSPSHK------------------------VLNSPPLSLPEPSTCVSPM-----PQPAIPAPSSEFQATSPSLG------------TTSP
        FHS+KCV LGYS  HK                           S  L+  +P T VS +         + + SS     S S+G            TTS 
Subjt:  FHSSKCVFLGYSPSHK------------------------VLNSPPLSLPEPSTCVSPM-----PQPAIPAPSSEFQATSPSLG------------TTSP

Query:  I-------------------------------QHENEGSISICVDTASPSGSVSPIQPVVSLPVGHHMQTRSKSGIFKPKDW---GSFLAASSDDVVIEP
        +                               QH +  ++    D    S S   ++ VVS    H M TR+K+GIFKPK +     ++  SS     EP
Subjt:  I-------------------------------QHENEGSISICVDTASPSGSVSPIQPVVSLPVGHHMQTRSKSGIFKPKDW---GSFLAASSDDVVIEP

Query:  VLVKDALQSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPTIQIILTVAVSK
          +++ALQ   W +A+  E+ AL RN TW LVP   ++++I +KWV+K KR +DGSF + KARLVAKGF Q PGVDF ETFSPV+KA T++I+L++AV+K
Subjt:  VLVKDALQSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPTIQIILTVAVSK

Query:  SWPVRKLDVNNAILNGKLDEDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQLTVLVFIYVDDILVT
         W VR+LD+NNA LNG + ED++MKQP+GF DK+ P++VCKL K++YGL QAPRAW+D+ KATL +W+F NSKAD+ +FF  +    +LV IYVDDI++T
Subjt:  SWPVRKLDVNNAILNGKLDEDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQLTVLVFIYVDDILVT

Query:  GDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYRSVLGALQYITH
        G++   +   +++LN  FALKD+G L Y+LG+EV+R  +GMYL Q KYI  L++++ + +++ C TP+ TGK  S  +G  + +PT YR           
Subjt:  GDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYRSVLGALQYITH

Query:  TRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQTVVARSSTESE
                                                                        +DRRS+A  C+++G +L+SWSS+KQ VV+RSSTESE
Subjt:  TRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQTVVARSSTESE

Query:  YRSLAQTAADVTWVQSLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQLADALTKPLSEGQFSSIRGK
        YR+LAQ AA++TWVQSLL EL       P+IWCDN+ A +L  NPV+H+R+KHIE+D+HFVRDK++ K +E+RY+PSSEQ+AD LTK L+ G    +  K
Subjt:  YRSLAQTAADVTWVQSLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQLADALTKPLSEGQFSSIRGK

Query:  LGVLPSPSRLRGILPLGFSSPSGSKQPSGRG
        LGV+P P  LRG +    +  +   Q S  G
Subjt:  LGVLPSPSRLRGILPLGFSSPSGSKQPSGRG

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-9424.62Show/hide
Query:  DEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRPNPLFEEWVTVDQALSGWLFGSMTPTIAADVVSFKTSREVWTTLEEVYGATSK
        D + Y +W+  + A+L  Q V   V G                  L PN + + W   ++     +   ++ +      S  T+R++   L+ VY   S 
Subjt:  DEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRPNPLFEEWVTVDQALSGWLFGSMTPTIAADVVSFKTSREVWTTLEEVYGATSK

Query:  ARINQLRGILQNTK-KGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLASLSPEYILIVSSINDKDVKTWQELSSILVTFEGTL----AKYSNDVV
        A    LR  L + K    M +++   +  +    L  AG  +   D IS++L +L   Y  I+++I     +T  E +  L   +  L     K  ND  
Subjt:  ARINQLRGILQNTK-KGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLASLSPEYILIVSSINDKDVKTWQELSSILVTFEGTL----AKYSNDVV

Query:  SADLPDLSIQLALNRPNQFSACYMRFEESFNDLHAGNSSGQI-----GSSGNSSA----YIATLDILNDPKWLADSGATTH----VTADVGNLGVKTDYN
              ++  +  N     +  +         +  GNS  ++     G  G+       Y   L+  N         AT+H    +  +V N  V    N
Subjt:  SADLPDLSIQLALNRPNQFSACYMRFEESFNDLHAGNSSGQI-----GSSGNSSA----YIATLDILNDPKWLADSGATTH----VTADVGNLGVKTDYN

Query:  GKESLIVGDGSHLNISHVGYTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACLTADNNVFVEFHSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKV
            L  G   HL      YT     +P       K    + + KR ++    L  D+ + +E   +    K+     + +     KRL E  + SI   
Subjt:  GKESLIVGDGSHLNISHVGYTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACLTADNNVFVEFHSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKV

Query:  NSGVQSPVVAPVFEYPNKSSVLKSSSV--FQAESPKSCNVSR-STWHKRLGHASDRIV-----SHVLQSCNLKVSLNEKSEFCEACQLGKSHCLPFSRSI
         SGV   +          S +L +  V  FQA S  + + +    WH+R GH SD  +      ++    +L  +L    E CE C  GK   LPF +  
Subjt:  NSGVQSPVVAPVFEYPNKSSVLKSSSV--FQAESPKSCNVSR-STWHKRLGHASDRIV-----SHVLQSCNLKVSLNEKSEFCEACQLGKSHCLPFSRSI

Query:  SRS--TQPLALVHCDLWGPSPVVSTLGYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLVENKFEKRIKVLQTDWG--------------------
         ++   +PL +VH D+ GP   V+     Y+V FVD F+ +   Y +K K +    FQ + +  E  F  ++  L  D G                    
Subjt:  SRS--TQPLALVHCDLWGPSPVVSTLGYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLVENKFEKRIKVLQTDWG--------------------

Query:  ---------------------EMGLTFLAQAAMPLSYWWDAFCSAVFVINQLPTPVL--DYLSPWEKAFKSSPDFEFLRTFGCAYFPCLWPYQSHKFQFH
                             E   T ++ A +  S+W +A  +A ++IN++P+  L     +P+E      P  + LR FG   +  +   Q  KF   
Subjt:  ---------------------EMGLTFLAQAAMPLSYWWDAFCSAVFVINQLPTPVL--DYLSPWEKAFKSSPDFEFLRTFGCAYFPCLWPYQSHKFQFH

Query:  SSKCVFLGYSPS----------------------HKVLNSPPL----------------SLPEPSTCV--SPMPQPAIPAPSSEFQATSPS---------
        S K +F+GY P+                        ++NS  +                + P  S  +  +  P  +    + +F   S           
Subjt:  SSKCVFLGYSPS----------------------HKVLNSPPL----------------SLPEPSTCV--SPMPQPAIPAPSSEFQATSPS---------

Query:  ----LGTTSPIQHENEGSISICVDTASPS--------------------GSVSPIQPVVSLPVGH----------------HMQTRSKSGIFKPKDWGSF
            + T  P + +   +I    D+   +                    GS +P +   S    H                 +  RS+    KP+     
Subjt:  ----LGTTSPIQHENEGSISICVDTASPS--------------------GSVSPIQPVVSLPVGH----------------HMQTRSKSGIFKPKDWGSF

Query:  LAASSDDVVIEPVLVK------------DALQ----SAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQI
        ++ + +D  +  V++             D +Q     + W+ AI+ E+ A   N TWT+   P N N++ S+WVF VK    G+  + KARLVA+GF Q 
Subjt:  LAASSDDVVIEPVLVK------------DALQ----SAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQI

Query:  PGVDFHETFSPVVKAPTIQIILTVAVSKSWPVRKLDVNNAILNGKLDEDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNS
          +D+ ETF+PV +  + + IL++ +  +  V ++DV  A LNG L E+++M+ P G     +   VCKL+KA+YGL QA R W++ ++  L    F NS
Subjt:  PGVDFHETFSPVVKAPTIQIILTVAVSKSWPVRKLDVNNAILNGKLDEDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNS

Query:  KADNYVFFFHSDQL--TVLVFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVT
          D  ++      +   + V +YVDD+++    +  ++     L   F + D+  + +++G+ +      +YL QS Y+  ++ +  + +    +TP+ +
Subjt:  KADNYVFFFHSDQL--TVLVFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVT

Query:  GKRFSTFEGTPMHDPTLYRSVLGALQYI-THTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPK--SDDLSIIAYTDADWACNLDDR
           +         + T  RS++G L YI   TRPD++  VN LS++  + +   WQ  KRVLRYLKGT+D  L   K  + +  II Y D+DWA +  DR
Subjt:  GKRFSTFEGTPMHDPTLYRSVLGALQYI-THTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPK--SDDLSIIAYTDADWACNLDDR

Query:  RSIAAHCL-FIGHSLVSWSSKKQTVVARSSTESEYRSLAQTAADVTWVQSLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVL
        +S   +       +L+ W++K+Q  VA SSTE+EY +L +   +  W++ LLT + +       I+ DN   IS+  NP  H R+KHI++  HF R++V 
Subjt:  RSIAAHCL-FIGHSLVSWSSKKQTVVARSSTESEYRSLAQTAADVTWVQSLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVL

Query:  TKAVEIRYVPSSEQLADALTKPLSEGQFSSIRGKLGVL
           + + Y+P+  QLAD  TKPL   +F  +R KLG+L
Subjt:  TKAVEIRYVPSSEQLADALTKPLSEGQFSSIRGKLGVL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-11526.91Show/hide
Query:  EEWVTVDQALSGWLFGSMTPTIAADVVSFKTSREVWTTLEEVYGATSKARINQLRGILQ----------------NTKKGFMKMIAFLAVMKQASEN--L
        E+W  +D+  +  +   ++  +  +++   T+R +WT LE +Y   SK   N+L    Q                N   G +  +A L V  +  +   L
Subjt:  EEWVTVDQALSGWLFGSMTPTIAADVVSFKTSREVWTTLEEVYGATSKARINQLRGILQ----------------NTKKGFMKMIAFLAVMKQASEN--L

Query:  QLAGNPVCLGDLISYVLASLSPEYILIVSS---INDKDVKTWQELSSILVTFEGTLAKY---SNDVVSADLPDLSIQLALNRPNQFSACYM--RFEESFN
         L   P    +L + +L   +   +  V+S   +N+K  K  +     L+T EG    Y   SN+   +     S   + +R      C     F+    
Subjt:  QLAGNPVCLGDLISYVLASLSPEYILIVSS---INDKDVKTWQELSSILVTFEGTLAKY---SNDVVSADLPDLSIQLALNRPNQFSACYM--RFEESFN

Query:  DLH--AGNSSGQIGSSGNSSAYIATLD----ILN-----------DPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPSNL
        +     G +SGQ  +  N++A +   D     +N           + +W+ D+ A+ H T  V +L  +       ++ +G+ S+  I+ +G   + +N+
Subjt:  DLH--AGNSSGQIGSSGNSSAYIATLD----ILN-----------DPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPSNL

Query:  PNTAPVYLKNMLHVPSIKRNLISIACLTADNNVFVEFHSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSSSV
          T  + LK++ HVP ++ NLIS   L  D + +  + +N      K S  VI +G  +  LY                                 ++ +
Subjt:  PNTAPVYLKNMLHVPSIKRNLISIACLTADNNVFVEFHSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSSSV

Query:  FQAE-SPKSCNVSRSTWHKRLGHASDRIVSHVLQSCNLKVSLNEKSEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFVD
         Q E +     +S   WHKR+GH S++ +  + +   +  +     + C+ C  GK H + F  S  R    L LV+ D+ GP  + S  G +Y+V+F+D
Subjt:  FQAE-SPKSCNVSRSTWHKRLGHASDRIVSHVLQSCNLKVSLNEKSEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFVD

Query:  DFSRFTHIYPLKSKGEALVSFQTYKSLVENKFEKRIKVLQTDWG-----------------------------------------EMGLTFLAQAAMPLS
        D SR   +Y LK+K +    FQ + +LVE +  +++K L++D G                                         E   + L  A +P S
Subjt:  DFSRFTHIYPLKSKGEALVSFQTYKSLVENKFEKRIKVLQTDWG-----------------------------------------EMGLTFLAQAAMPLS

Query:  YWWDAFCSAVFVINQLPTPVLDYLSPWEKAFKSSPDFEFLRTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYSPSH-----------KVLNSPPLSLPEPS
        +W +A  +A ++IN+ P+  L +  P          +  L+ FGC  F  +   Q  K    S  C+F+GY               KV+ S  +   E  
Subjt:  YWWDAFCSAVFVINQLPTPVLDYLSPWEKAFKSSPDFEFLRTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYSPSH-----------KVLNSPPLSLPEPS

Query:  T-CVSPMPQPA--------IPAPSSEFQATSPSLGTTSPIQHENEGSISICVDTASPSGSVSPIQPVVSLPVGHHMQTRSKSGIFKPKDWGS--FLAASS
            + M +          +  PS+    TS    TT  +  + E    +          V  ++        H    RS+    + + + S  ++  S 
Subjt:  T-CVSPMPQPA--------IPAPSSEFQATSPSLGTTSPIQHENEGSISICVDTASPSGSVSPIQPVVSLPVGHHMQTRSKSGIFKPKDWGS--FLAASS

Query:  DDVVIEPVLVKDAL---QSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPTI
        D    EP  +K+ L   +      A+ +E+ +L +N T+ LV  P     +  KWVFK+K+  D    + KARLV KGF Q  G+DF E FSPVVK  +I
Subjt:  DDVVIEPVLVKDAL---QSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPTI

Query:  QIILTVAVSKSWPVRKLDVNNAILNGKLDEDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFH-SDQLTVL
        + IL++A S    V +LDV  A L+G L+E+++M+QP GF        VCKL+K+LYGL QAPR WY ++ + + +  +  + +D  V+F   S+   ++
Subjt:  QIILTVAVSKSWPVRKLDVNNAILNGKLDEDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFH-SDQLTVL

Query:  VFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEV--HRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFS------TFEGTP
        + +YVDD+L+ G   G I +    L+  F +KD+GP    LG+++   R    ++L Q KYI  ++ER  + + +P +TP+    + S      T E   
Subjt:  VFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEV--HRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFS------TFEGTP

Query:  MHDPTLYRSVLGALQY-ITHTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHS
              Y S +G+L Y +  TRPDI+  V  +S+FL+ P   HW+A K +LRYL+GT    L    SD + +  YTDAD A ++D+R+S   +       
Subjt:  MHDPTLYRSVLGALQY-ITHTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHS

Query:  LVSWSSKKQTVVARSSTESEYRSLAQTAADVTWVQSLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQ
         +SW SK Q  VA S+TE+EY +  +T  ++ W++  L ELGL      +++CD+ SAI L++N ++H+R+KHI++  H++R+ V  +++++  + ++E 
Subjt:  LVSWSSKKQTVVARSSTESEYRSLAQTAADVTWVQSLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQ

Query:  LADALTKPLSEGQFSSIRGKLGV
         AD LTK +   +F   +  +G+
Subjt:  LADALTKPLSEGQFSSIRGKLGV

P92519 Uncharacterized mitochondrial protein AtMg008106.2e-5142.86Show/hide
Query:  VFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYR
        + +YVDDIL+TG S   ++  + +L++ F++KD+GP+ Y+LG+++   PSG++L Q+KY   ++   G+   +P +TP+   K  S+       DP+ +R
Subjt:  VFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYR

Query:  SVLGALQYITHTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQ
        S++GALQY+T TRPDIS+ VN + Q + +P+   +   KRVLRY+KGT+ +GL++ K+  L++ A+ D+DWA     RRS    C F+G +++SWS+K+Q
Subjt:  SVLGALQYITHTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQ

Query:  TVVARSSTESEYRSLAQTAADVTW
          V+RSSTE+EYR+LA TAA++TW
Subjt:  TVVARSSTESEYRSLAQTAADVTW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.3e-23035.16Show/hide
Query:  KIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRPNPLFEEWVTVDQALSGWLFGSMTPTIAADVVSFKTSREVWTTLEEVYGAT
        K+   NYL+W   V A+  G ++ G++ G+   P   +   TDA    R NP +  W   D+ +   + G+++ ++   V    T+ ++W TL ++Y   
Subjt:  KIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRPNPLFEEWVTVDQALSGWLFGSMTPTIAADVVSFKTSREVWTTLEEVYGAT

Query:  SKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLASLSPEYILIVSSINDKDV-KTWQELSSILVTFEGTLAKYS------
        S   + QLR  L+   KG   +  ++  +    + L L G P+   + +  VL +L  EY  ++  I  KD   T  E+   L+  E  +   S      
Subjt:  SKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLASLSPEYILIVSSINDKDV-KTWQELSSILVTFEGTLAKYS------

Query:  ---NDVVSADLPDLSIQLALNRPNQF-----SACYMRFEESFNDLHAGNSSGQ--------IGSSGNSSAYIATLD------------------------
           N V   +    +     NR N++     +     +++S  + H  N+  +         G  G+S+   + L                         
Subjt:  ---NDVVSADLPDLSIQLALNRPNQF-----SACYMRFEESFNDLHAGNSSGQ--------IGSSGNSSAYIATLD------------------------

Query:  ILNDP----KWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACLTADNNVFVEFH
         L  P     WL DSGAT H+T+D  NL +   Y G + ++V DGS + ISH G T++ +    + P+ L N+L+VP+I +NLIS+  L   N V VEF 
Subjt:  ILNDP----KWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACLTADNNVFVEFH

Query:  SNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSSSVFQAESPKSCNVSRSTWHKRLGHASDRIVSHVLQSCNLK
          +  VKD  +   +LQG  K  LYE  + S          PV                 S+F + S K+   + S+WH RLGH +  I++ V+ + +L 
Subjt:  SNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSSSVFQAESPKSCNVSRSTWHKRLGHASDRIVSHVLQSCNLK

Query:  VSLNEKSEF--CEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLVENKFEKRIK
        V LN   +F  C  C + KS+ +PFS+S   ST+PL  ++ D+W  SP++S   YRYYV FVD F+R+T +YPLK K +   +F T+K+L+EN+F+ RI 
Subjt:  VSLNEKSEF--CEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLVENKFEKRIK

Query:  VLQTDWG---------------------------------------EMGLTFLAQAAMPLSYWWDAFCSAVFVINQLPTPVLDYLSPWEKAFKSSPDFEF
           +D G                                       E GLT L+ A++P +YW  AF  AV++IN+LPTP+L   SP++K F +SP+++ 
Subjt:  VLQTDWG---------------------------------------EMGLTFLAQAAMPLSYWWDAFCSAVFVINQLPTPVLDYLSPWEKAFKSSPDFEF

Query:  LRTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYS-------------------------------PSHKVLNSPPLSLPEPSTCV------SPMPQPAIPA
        LR FGCA +P L PY  HK    S +CVFLGYS                                ++    SP       S+CV       P   P +PA
Subjt:  LRTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYS-------------------------------PSHKVLNSPPLSLPEPSTCV------SPMPQPAIPA

Query:  PS-----------------------------SEFQATSPS------------LGTTSPIQHENE------------------------------GSISIC
        PS                             S F ++ PS              TT P Q + +                               S S  
Subjt:  PS-----------------------------SEFQATSPS------------LGTTSPIQHENE------------------------------GSISIC

Query:  VDTASPSGSVSPIQPVVSL----------------PVG-HHMQTRSKSGIFKPKDWGSF---LAASSDDVVIEPVLVKDALQSAIWKSAIDQEIAALHRN
          T++ S S SP  P + +                P+  H M TR+K+GI KP    S    LAA S     EP     AL+   W++A+  EI A   N
Subjt:  VDTASPSGSVSPIQPVVSL----------------PVG-HHMQTRSKSGIFKPKDWGSF---LAASSDDVVIEPVLVKDALQSAIWKSAIDQEIAALHRN

Query:  KTWTLV-PPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPTIQIILTVAVSKSWPVRKLDVNNAILNGKLDEDVFMK
         TW LV PPPS+V ++G +W+F  K  SDGS ++ KARLVAKG+NQ PG+D+ ETFSPV+K+ +I+I+L VAV +SWP+R+LDVNNA L G L +DV+M 
Subjt:  KTWTLV-PPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPTIQIILTVAVSKSWPVRKLDVNNAILNGKLDEDVFMK

Query:  QPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQLTVLVFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGP
        QP GFIDKD P+YVCKL KALYGL QAPRAWY   +  L+T  F NS +D  +F     +  V + +YVDDIL+TG+    +  ++  L+  F++KD   
Subjt:  QPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQLTVLVFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGP

Query:  LSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYRSVLGALQYITHTRPDISFIVNKLSQFLQQPSDIHWQ
        L Y+LG+E  R+P+G++L Q +YI +L+ R  +   +P  TP+    + S + GT + DPT YR ++G+LQY+  TRPDIS+ VN+LSQF+  P++ H Q
Subjt:  LSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYRSVLGALQYITHTRPDISFIVNKLSQFLQQPSDIHWQ

Query:  AAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQTVVARSSTESEYRSLAQTAADVTWVQSLLTELGLSS
        A KR+LRYL GT ++G+FL K + LS+ AY+DADWA + DD  S   + +++GH  +SWSSKKQ  V RSSTE+EYRS+A T++++ W+ SLLTELG+  
Subjt:  AAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQTVVARSSTESEYRSLAQTAADVTWVQSLLTELGLSS

Query:  STIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQLADALTKPLSEGQFSSIRGKLGVLPSP
        +  P+I+CDNV A  L  NPVFHSR KHI +D HF+R++V + A+ + +V + +QLAD LTKPLS   F +   K+GV   P
Subjt:  STIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQLADALTKPLSEGQFSSIRGKLGVLPSP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.0e-22634.38Show/hide
Query:  KIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRPNPLFEEWVTVDQALSGWLFGSMTPTIAADVVSFKTSREVWTTLEEVYGAT
        K+   NYL+W   V A+  G ++ G++ G+   P   +   TDA    R NP +  W   D+ +   + G+++ ++   V    T+ ++W TL ++Y   
Subjt:  KIDEKNYLLWRGMVLAILRGQKVDGYVLGTKAKPSEMVEAVTDAGKELRPNPLFEEWVTVDQALSGWLFGSMTPTIAADVVSFKTSREVWTTLEEVYGAT

Query:  SKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLASLSPEYILIVSSINDKDV-KTWQELSSILVTFEG-TLAKYSNDVVS
        S   + QLR I +                    + L L G P+   + +  VL +L  +Y  ++  I  KD   +  E+   L+  E   LA  S +VV 
Subjt:  SKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQLAGNPVCLGDLISYVLASLSPEYILIVSSINDKDV-KTWQELSSILVTFEG-TLAKYSNDVVS

Query:  ADLPDLSIQLALNRPNQFSACYMRFEESFNDLHAGNSSGQIGSSGNSS--------------------------------------------------AY
             ++ +      NQ +    R   ++N+ +  ++S Q  SSG+ S                                                  A 
Subjt:  ADLPDLSIQLALNRPNQFSACYMRFEESFNDLHAGNSSGQIGSSGNSS--------------------------------------------------AY

Query:  IATLDILNDPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACLTADNNVFVEF
        +A     N   WL DSGAT H+T+D  NL     Y G + +++ DGS + I+H G  ++P+   ++  + L  +L+VP+I +NLIS+  L   N V VEF
Subjt:  IATLDILNDPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACLTADNNVFVEF

Query:  HSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSSSVFQAESPKSCNVSRSTWHKRLGHASDRIVSHVLQSCNL
           +  VKD  +   +LQG  K  LYE  + S                           S +V    SP S   + S+WH RLGH S  I++ V+ + +L
Subjt:  HSNACIVKDKVSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSSSVFQAESPKSCNVSRSTWHKRLGHASDRIVSHVLQSCNL

Query:  KV-SLNEKSEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLVENKFEKRIK
         V + + K   C  C + KSH +PFS S   S++PL  ++ D+W  SP++S   YRYYV FVD F+R+T +YPLK K +   +F  +KSLVEN+F+ RI 
Subjt:  KV-SLNEKSEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLVENKFEKRIK

Query:  VLQTDWG---------------------------------------EMGLTFLAQAAMPLSYWWDAFCSAVFVINQLPTPVLDYLSPWEKAFKSSPDFEF
         L +D G                                       EMGLT L+ A++P +YW  AF  AV++IN+LPTP+L   SP++K F   P++E 
Subjt:  VLQTDWG---------------------------------------EMGLTFLAQAAMPLSYWWDAFCSAVFVINQLPTPVLDYLSPWEKAFKSSPDFEF

Query:  LRTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYS----------------------------------------------------PSHKVLNSPPLSLPE
        L+ FGCA +P L PY  HK +  S +C F+GYS                                                    PSH  L + PL LP 
Subjt:  LRTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYS----------------------------------------------------PSHKVLNSPPLSLPE

Query:  PSTCVSP------------------------MPQPAIPAPSSEFQATSPSLG----TTSPIQHENEGSISICVDTASP----------------------
        P  C+ P                        +P  +I +PSS  + T+PS      T  P Q +N  S S  ++  +P                      
Subjt:  PSTCVSP------------------------MPQPAIPAPSSEFQATSPSLG----TTSPIQHENEGSISICVDTASP----------------------

Query:  ------------------SGSVSPIQPVVSLPV-----------GHHMQTRSKSGIFKPK---DWGSFLAASSDDVVIEPVLVKDALQSAIWKSAIDQEI
                          S S  P+ PV+  P             H M TR+K GI KP     + + LAA+S     EP     A++   W+ A+  EI
Subjt:  ------------------SGSVSPIQPVVSLPV-----------GHHMQTRSKSGIFKPK---DWGSFLAASSDDVVIEPVLVKDALQSAIWKSAIDQEI

Query:  AALHRNKTWTLV-PPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPTIQIILTVAVSKSWPVRKLDVNNAILNGKLD
         A   N TW LV PPP +V ++G +W+F  K  SDGS ++ KARLVAKG+NQ PG+D+ ETFSPV+K+ +I+I+L VAV +SWP+R+LDVNNA L G L 
Subjt:  AALHRNKTWTLV-PPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPTIQIILTVAVSKSWPVRKLDVNNAILNGKLD

Query:  EDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQLTVLVFIYVDDILVTGDSIGFIDQSVHRLNTVFA
        ++V+M QP GF+DKD P YVC+L KA+YGL QAPRAWY   +  L+T  F NS +D  +F     +  + + +YVDDIL+TG+    +  ++  L+  F+
Subjt:  EDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQLTVLVFIYVDDILVTGDSIGFIDQSVHRLNTVFA

Query:  LKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYRSVLGALQYITHTRPDISFIVNKLSQFLQQP
        +K+   L Y+LG+E  R+P G++L Q +Y  +L+ R  +   +P ATP+ T  + +   GT + DPT YR ++G+LQY+  TRPD+S+ VN+LSQ++  P
Subjt:  LKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYRSVLGALQYITHTRPDISFIVNKLSQFLQQP

Query:  SDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQTVVARSSTESEYRSLAQTAADVTWVQSLLT
        +D HW A KRVLRYL GT D+G+FL K + LS+ AY+DADWA + DD  S   + +++GH  +SWSSKKQ  V RSSTE+EYRS+A T++++ W+ SLLT
Subjt:  SDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQTVVARSSTESEYRSLAQTAADVTWVQSLLT

Query:  ELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQLADALTKPLSEGQFSSIRGKLGVLPSPSRLRGIL
        ELG+  S  P+I+CDNV A  L  NPVFHSR KHI LD HF+R++V + A+ + +V + +QLAD LTKPLS   F +   K+GV+  P    G+L
Subjt:  ELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQLADALTKPLSEGQFSSIRGKLGVLPSPSRLRGIL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.4e-10638.92Show/hide
Query:  EPVLVKDALQSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPTIQIILTVAV
        EP    +A +  +W  A+D EI A+    TW +   P N   IG KWV+K+K  SDG+ ++ KARLVAKG+ Q  G+DF ETFSPV K  ++++IL ++ 
Subjt:  EPVLVKDALQSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPTIQIILTVAV

Query:  SKSWPVRKLDVNNAILNGKLDEDVFMKQPIGFI----DKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQLTVLVFIYV
          ++ + +LD++NA LNG LDE+++MK P G+     D   P+ VC L K++YGL QA R W+ ++  TLI + F  S +D+  F   +  L + V +YV
Subjt:  SKSWPVRKLDVNNAILNGKLDEDVFMKQPIGFI----DKDHPSYVCKLDKALYGLLQAPRAWYDRYKATLITWRFSNSKADNYVFFFHSDQLTVLVFIYV

Query:  DDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYRSVLGA
        DDI++  ++   +D+   +L + F L+D+GPL Y+LG+E+ R  +G+ + Q KY  +L++  GL   +P + P+     FS   G    D   YR ++G 
Subjt:  DDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYRSVLGA

Query:  LQYITHTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQTVVAR
        L Y+  TR DISF VNKLSQF + P   H QA  ++L Y+KGT+  GLF     ++ +  ++DA +    D RRS   +C+F+G SL+SW SKKQ VV++
Subjt:  LQYITHTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQTVVAR

Query:  SSTESEYRSLAQTAADVTWVQSLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQLADALTKPLSEGQF
        SS E+EYR+L+    ++ W+     EL L  S   L++CDN +AI +  N VFH R+KHIE D H VR++ + +A  + Y   +    D  T+ LS    
Subjt:  SSTESEYRSLAQTAADVTWVQSLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQLADALTKPLSEGQF

Query:  SSIRGKLGVLPSPSRLRGI
          +RG +  + S   L G+
Subjt:  SSIRGKLGVLPSPSRLRGI

ATMG00240.1 Gag-Pol-related retrotransposon family protein3.6e-1444.44Show/hide
Query:  YITHTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFI
        Y+T TRPD++F VN+LSQF         QA  +VL Y+KGT+  GLF   + DL + A+ D+DWA   D RRS+   C  +
Subjt:  YITHTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFI

ATMG00300.1 Gag-Pol-related retrotransposon family protein2.1e-0638.57Show/hide
Query:  WHKRLGHASDRIVSHVLQSCNLKVSLNEKSEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPV
        WH RL H S R +  +++   L  S     +FCE C  GK+H + FS     +  PL  VH DLWG   V
Subjt:  WHKRLGHASDRIVSHVLQSCNLKVSLNEKSEFCEACQLGKSHCLPFSRSISRSTQPLALVHCDLWGPSPV

ATMG00810.1 DNA/RNA polymerases superfamily protein4.4e-5242.86Show/hide
Query:  VFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYR
        + +YVDDIL+TG S   ++  + +L++ F++KD+GP+ Y+LG+++   PSG++L Q+KY   ++   G+   +P +TP+   K  S+       DP+ +R
Subjt:  VFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPCATPVVTGKRFSTFEGTPMHDPTLYR

Query:  SVLGALQYITHTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQ
        S++GALQY+T TRPDIS+ VN + Q + +P+   +   KRVLRY+KGT+ +GL++ K+  L++ A+ D+DWA     RRS    C F+G +++SWS+K+Q
Subjt:  SVLGALQYITHTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHCLFIGHSLVSWSSKKQ

Query:  TVVARSSTESEYRSLAQTAADVTW
          V+RSSTE+EYR+LA TAA++TW
Subjt:  TVVARSSTESEYRSLAQTAADVTW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.1e-2651.94Show/hide
Query:  MQTRSKSGIFK--PKDWGSFLAASSDDVVIEPVLVKDALQSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKG
        M TRSK+GI K  PK    +    +  +  EP  V  AL+   W  A+ +E+ AL RNKTW LVPPP N N++G KWVFK K  SDG+ D+ KARLVAKG
Subjt:  MQTRSKSGIFK--PKDWGSFLAASSDDVVIEPVLVKDALQSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSKWVFKVKRKSDGSFDQCKARLVAKG

Query:  FNQIPGVDFHETFSPVVKAPTIQIILTVA
        F+Q  G+ F ET+SPVV+  TI+ IL VA
Subjt:  FNQIPGVDFHETFSPVVKAPTIQIILTVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAACACACAGCGTCGAGACGCTGTGACCTGGTTGCGCCCAATCAGATAAGGAAAGGCAATGTCGAGACGCTCAAGGAGCTGCGTCCCGACGCTGAGCTACCAAA
GGTTTCAAACATGACAGACAACGTCTCTTCTACCTCATCAGCTGCTGAGACTGCGAATGCTGCAGTCAATGTTTTCTCTGCTGCTAATATTGTGAGTTCCTCTTTTGGTC
ATCCTTTGAGCACCGTTTTGACGGTGAAGATAGATGAGAAGAATTATCTTCTCTGGCGTGGGATGGTTCTAGCCATTCTCAGAGGCCAAAAAGTTGATGGATATGTGCTG
GGTACTAAAGCAAAGCCATCAGAAATGGTTGAAGCAGTGACAGACGCGGGTAAGGAGCTTCGACCTAATCCGCTTTTTGAGGAATGGGTTACTGTGGATCAAGCTTTATC
AGGCTGGTTGTTTGGGTCAATGACCCCAACCATTGCAGCAGATGTTGTAAGTTTTAAAACCTCGAGGGAGGTCTGGACGACACTTGAAGAAGTTTATGGAGCTACTAGCA
AAGCTCGTATCAACCAGCTGAGAGGGATTCTTCAGAACACGAAGAAGGGGTTTATGAAGATGATCGCTTTTCTTGCTGTCATGAAACAAGCCTCGGAGAATCTCCAACTT
GCGGGTAATCCTGTGTGTCTTGGTGATTTAATATCTTATGTTTTAGCCAGTTTAAGCCCTGAATACATTTTGATAGTTTCCTCTATTAATGATAAAGATGTGAAAACTTG
GCAAGAATTGAGTTCTATTCTAGTTACGTTTGAAGGAACTTTGGCTAAGTACAGTAATGATGTGGTGTCTGCCGATTTACCTGATTTATCTATTCAACTTGCCCTTAATC
GACCAAATCAGTTTTCGGCTTGTTACATGCGCTTTGAAGAGAGTTTTAATGACCTTCACGCTGGAAACTCCTCTGGTCAGATTGGTTCTTCTGGAAATTCATCTGCCTAC
ATTGCCACCCTTGATATCCTCAATGACCCGAAATGGTTGGCAGATAGTGGGGCTACTACACATGTCACTGCTGATGTTGGGAACCTTGGGGTGAAGACCGATTACAATGG
TAAGGAGTCTCTTATTGTTGGTGATGGTTCTCATTTAAATATTTCTCATGTGGGTTATACTACTGTGCCATCTAATCTTCCGAATACAGCTCCTGTTTATTTGAAAAATA
TGTTACATGTTCCTAGCATCAAGCGTAATTTAATAAGCATTGCTTGTTTGACTGCCGATAATAATGTCTTTGTTGAATTTCATTCTAATGCTTGTATTGTGAAGGACAAG
GTTTCAAGGAGGGTGATTCTACAAGGAACTCTTAAGAAGAGACTTTATGAAATCGAGCTTCCTTCAATCCCAAAGGTCAATTCCGGTGTTCAGTCACCTGTTGTTGCTCC
TGTTTTTGAGTATCCAAATAAGTCCTCTGTCTTGAAGTCATCTTCTGTATTTCAGGCTGAAAGTCCTAAGTCTTGTAATGTGTCTCGTTCTACTTGGCACAAGCGTCTGG
GTCATGCGTCTGATAGGATAGTGAGTCATGTCTTGCAATCTTGTAATTTGAAAGTTTCACTAAATGAGAAATCTGAATTTTGTGAAGCTTGCCAATTAGGAAAATCTCAC
TGTTTACCTTTTTCTAGGTCTATCTCTCGCAGTACTCAACCACTTGCACTTGTTCATTGTGATCTTTGGGGTCCATCCCCGGTTGTTTCCACCTTGGGATATCGTTATTA
CGTTAGCTTTGTGGATGATTTTTCCAGATTTACACACATTTATCCTCTAAAATCTAAGGGTGAAGCTTTAGTTTCCTTTCAAACTTACAAATCTCTAGTTGAAAACAAAT
TTGAAAAGAGGATTAAGGTTCTTCAAACAGATTGGGGCGAAATGGGATTAACCTTTCTTGCTCAAGCTGCCATGCCTTTGTCTTATTGGTGGGATGCTTTCTGTTCTGCT
GTTTTTGTGATTAATCAACTCCCTACACCTGTTTTAGATTATCTCTCACCTTGGGAAAAGGCTTTTAAATCTTCTCCTGACTTTGAGTTCTTAAGAACCTTTGGGTGTGC
TTATTTCCCGTGTCTCTGGCCTTATCAATCCCATAAATTTCAGTTTCATAGCTCCAAATGTGTTTTCCTTGGTTATAGCCCTTCACACAAAGTGTTGAATTCTCCTCCTC
TTTCCCTTCCTGAGCCTTCTACTTGTGTATCTCCTATGCCTCAGCCTGCTATTCCAGCTCCTTCTTCTGAGTTTCAAGCGACATCACCTTCCTTGGGAACTACTTCTCCA
ATTCAGCATGAAAATGAAGGTTCAATCTCGATCTGCGTTGATACAGCCTCCCCATCCGGCTCTGTTTCTCCTATTCAGCCGGTTGTTTCCTTGCCTGTTGGTCATCATAT
GCAAACACGGTCTAAAAGTGGCATCTTTAAGCCAAAGGATTGGGGTTCTTTCTTGGCTGCCAGCTCTGATGATGTTGTGATTGAGCCTGTGTTGGTTAAGGATGCTTTGC
AATCAGCTATTTGGAAATCAGCAATAGATCAGGAAATTGCTGCTCTTCATCGGAACAAAACTTGGACTCTTGTTCCTCCTCCATCAAATGTCAACCTTATTGGAAGTAAG
TGGGTGTTTAAGGTTAAACGCAAATCTGATGGTTCGTTTGACCAATGTAAGGCCCGTCTCGTTGCAAAAGGTTTTAATCAAATTCCTGGGGTTGACTTTCATGAAACTTT
TAGTCCGGTGGTTAAGGCTCCAACTATTCAGATTATTCTTACTGTTGCTGTTTCGAAAAGCTGGCCGGTTAGAAAGTTGGACGTTAATAATGCTATCCTTAATGGAAAGC
TTGATGAGGATGTTTTTATGAAACAACCAATTGGTTTTATAGACAAAGATCATCCTTCTTATGTTTGTAAACTTGATAAGGCTCTCTATGGTCTTCTACAGGCTCCGAGG
GCTTGGTATGATCGGTACAAGGCAACTCTTATTACTTGGAGATTCTCAAATTCAAAAGCTGATAATTATGTTTTCTTCTTTCACAGTGATCAACTCACGGTTCTTGTGTT
CATATATGTTGACGATATCCTTGTCACTGGGGATTCCATCGGCTTTATTGATCAATCTGTGCATCGTTTGAACACTGTCTTTGCGTTGAAAGACATTGGTCCTCTCAGTT
ACTATCTCGGTGTGGAGGTCCATCGTATTCCATCTGGTATGTATTTATTGCAATCTAAATATATTACAAACTTGGTTGAGCGTGTTGGTCTCTCGCATATCCGGCCGTGT
GCTACTCCTGTAGTCACGGGTAAACGTTTCTCTACCTTTGAAGGAACCCCTATGCATGATCCTACATTGTATCGTAGTGTGCTAGGAGCACTTCAGTATATTACTCATAC
ACGCCCAGATATTTCTTTTATAGTCAACAAACTCAGTCAGTTCCTACAGCAACCTTCAGATATACATTGGCAGGCTGCTAAACGAGTTCTTCGATATCTCAAGGGTACTC
TGGATTATGGTTTGTTCCTACCTAAGTCTGATGATCTATCTATTATAGCTTATACAGATGCTGATTGGGCTTGCAACTTGGATGATCGCCGGTCTATTGCTGCGCACTGC
TTGTTTATTGGTCATTCTCTTGTTTCATGGTCGTCGAAAAAGCAGACTGTTGTAGCCCGCTCTAGTACTGAATCGGAATATCGCTCTTTAGCTCAGACTGCTGCAGATGT
TACTTGGGTGCAGTCCTTGCTTACTGAGCTTGGTTTATCATCCTCTACAATCCCACTTATCTGGTGTGACAATGTAAGTGCAATTTCTCTCACACAAAATCCAGTCTTTC
ATTCCCGTAGTAAGCACATTGAGTTGGATATGCATTTCGTTCGTGACAAAGTTCTCACCAAGGCAGTTGAGATTCGTTATGTTCCATCATCTGAACAATTAGCGGACGCC
CTCACCAAACCTTTATCTGAAGGACAGTTCTCATCTATCAGAGGCAAACTTGGGGTACTACCGTCACCCTCTCGTTTGAGGGGGATACTCCCCTTAGGGTTTTCGTCGCC
GTCGGGCTCCAAGCAACCATCGGGCCGAGGCCGACCTCATCCTCAGCCTCTCCCTTGGGCCAAGGCCGAGGATGACAGTTTCGGGCCGGAGGGAGCTCCGACCCTTTGCT
TCGTTGACTTGTCACTAGGTTCACGAAAAGAACTTTTCGGTCTGCATGGAACTATCCTTGGAACTTGGGGAAATGGGAATCTAGTTAGCACAAAGGACAAAAGGGAAGAG
GACAAAAACGGGGAAAGGGCCGAGGCAGTCAGCCTCGCCTTCCACGCCAGGCGCGTGCCTAGCGTGGCTGCGTGCCTGCTGCATAGCAGCAGTGCGCTCGGCCTGCCGTG
GCAGGCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAACACACAGCGTCGAGACGCTGTGACCTGGTTGCGCCCAATCAGATAAGGAAAGGCAATGTCGAGACGCTCAAGGAGCTGCGTCCCGACGCTGAGCTACCAAA
GGTTTCAAACATGACAGACAACGTCTCTTCTACCTCATCAGCTGCTGAGACTGCGAATGCTGCAGTCAATGTTTTCTCTGCTGCTAATATTGTGAGTTCCTCTTTTGGTC
ATCCTTTGAGCACCGTTTTGACGGTGAAGATAGATGAGAAGAATTATCTTCTCTGGCGTGGGATGGTTCTAGCCATTCTCAGAGGCCAAAAAGTTGATGGATATGTGCTG
GGTACTAAAGCAAAGCCATCAGAAATGGTTGAAGCAGTGACAGACGCGGGTAAGGAGCTTCGACCTAATCCGCTTTTTGAGGAATGGGTTACTGTGGATCAAGCTTTATC
AGGCTGGTTGTTTGGGTCAATGACCCCAACCATTGCAGCAGATGTTGTAAGTTTTAAAACCTCGAGGGAGGTCTGGACGACACTTGAAGAAGTTTATGGAGCTACTAGCA
AAGCTCGTATCAACCAGCTGAGAGGGATTCTTCAGAACACGAAGAAGGGGTTTATGAAGATGATCGCTTTTCTTGCTGTCATGAAACAAGCCTCGGAGAATCTCCAACTT
GCGGGTAATCCTGTGTGTCTTGGTGATTTAATATCTTATGTTTTAGCCAGTTTAAGCCCTGAATACATTTTGATAGTTTCCTCTATTAATGATAAAGATGTGAAAACTTG
GCAAGAATTGAGTTCTATTCTAGTTACGTTTGAAGGAACTTTGGCTAAGTACAGTAATGATGTGGTGTCTGCCGATTTACCTGATTTATCTATTCAACTTGCCCTTAATC
GACCAAATCAGTTTTCGGCTTGTTACATGCGCTTTGAAGAGAGTTTTAATGACCTTCACGCTGGAAACTCCTCTGGTCAGATTGGTTCTTCTGGAAATTCATCTGCCTAC
ATTGCCACCCTTGATATCCTCAATGACCCGAAATGGTTGGCAGATAGTGGGGCTACTACACATGTCACTGCTGATGTTGGGAACCTTGGGGTGAAGACCGATTACAATGG
TAAGGAGTCTCTTATTGTTGGTGATGGTTCTCATTTAAATATTTCTCATGTGGGTTATACTACTGTGCCATCTAATCTTCCGAATACAGCTCCTGTTTATTTGAAAAATA
TGTTACATGTTCCTAGCATCAAGCGTAATTTAATAAGCATTGCTTGTTTGACTGCCGATAATAATGTCTTTGTTGAATTTCATTCTAATGCTTGTATTGTGAAGGACAAG
GTTTCAAGGAGGGTGATTCTACAAGGAACTCTTAAGAAGAGACTTTATGAAATCGAGCTTCCTTCAATCCCAAAGGTCAATTCCGGTGTTCAGTCACCTGTTGTTGCTCC
TGTTTTTGAGTATCCAAATAAGTCCTCTGTCTTGAAGTCATCTTCTGTATTTCAGGCTGAAAGTCCTAAGTCTTGTAATGTGTCTCGTTCTACTTGGCACAAGCGTCTGG
GTCATGCGTCTGATAGGATAGTGAGTCATGTCTTGCAATCTTGTAATTTGAAAGTTTCACTAAATGAGAAATCTGAATTTTGTGAAGCTTGCCAATTAGGAAAATCTCAC
TGTTTACCTTTTTCTAGGTCTATCTCTCGCAGTACTCAACCACTTGCACTTGTTCATTGTGATCTTTGGGGTCCATCCCCGGTTGTTTCCACCTTGGGATATCGTTATTA
CGTTAGCTTTGTGGATGATTTTTCCAGATTTACACACATTTATCCTCTAAAATCTAAGGGTGAAGCTTTAGTTTCCTTTCAAACTTACAAATCTCTAGTTGAAAACAAAT
TTGAAAAGAGGATTAAGGTTCTTCAAACAGATTGGGGCGAAATGGGATTAACCTTTCTTGCTCAAGCTGCCATGCCTTTGTCTTATTGGTGGGATGCTTTCTGTTCTGCT
GTTTTTGTGATTAATCAACTCCCTACACCTGTTTTAGATTATCTCTCACCTTGGGAAAAGGCTTTTAAATCTTCTCCTGACTTTGAGTTCTTAAGAACCTTTGGGTGTGC
TTATTTCCCGTGTCTCTGGCCTTATCAATCCCATAAATTTCAGTTTCATAGCTCCAAATGTGTTTTCCTTGGTTATAGCCCTTCACACAAAGTGTTGAATTCTCCTCCTC
TTTCCCTTCCTGAGCCTTCTACTTGTGTATCTCCTATGCCTCAGCCTGCTATTCCAGCTCCTTCTTCTGAGTTTCAAGCGACATCACCTTCCTTGGGAACTACTTCTCCA
ATTCAGCATGAAAATGAAGGTTCAATCTCGATCTGCGTTGATACAGCCTCCCCATCCGGCTCTGTTTCTCCTATTCAGCCGGTTGTTTCCTTGCCTGTTGGTCATCATAT
GCAAACACGGTCTAAAAGTGGCATCTTTAAGCCAAAGGATTGGGGTTCTTTCTTGGCTGCCAGCTCTGATGATGTTGTGATTGAGCCTGTGTTGGTTAAGGATGCTTTGC
AATCAGCTATTTGGAAATCAGCAATAGATCAGGAAATTGCTGCTCTTCATCGGAACAAAACTTGGACTCTTGTTCCTCCTCCATCAAATGTCAACCTTATTGGAAGTAAG
TGGGTGTTTAAGGTTAAACGCAAATCTGATGGTTCGTTTGACCAATGTAAGGCCCGTCTCGTTGCAAAAGGTTTTAATCAAATTCCTGGGGTTGACTTTCATGAAACTTT
TAGTCCGGTGGTTAAGGCTCCAACTATTCAGATTATTCTTACTGTTGCTGTTTCGAAAAGCTGGCCGGTTAGAAAGTTGGACGTTAATAATGCTATCCTTAATGGAAAGC
TTGATGAGGATGTTTTTATGAAACAACCAATTGGTTTTATAGACAAAGATCATCCTTCTTATGTTTGTAAACTTGATAAGGCTCTCTATGGTCTTCTACAGGCTCCGAGG
GCTTGGTATGATCGGTACAAGGCAACTCTTATTACTTGGAGATTCTCAAATTCAAAAGCTGATAATTATGTTTTCTTCTTTCACAGTGATCAACTCACGGTTCTTGTGTT
CATATATGTTGACGATATCCTTGTCACTGGGGATTCCATCGGCTTTATTGATCAATCTGTGCATCGTTTGAACACTGTCTTTGCGTTGAAAGACATTGGTCCTCTCAGTT
ACTATCTCGGTGTGGAGGTCCATCGTATTCCATCTGGTATGTATTTATTGCAATCTAAATATATTACAAACTTGGTTGAGCGTGTTGGTCTCTCGCATATCCGGCCGTGT
GCTACTCCTGTAGTCACGGGTAAACGTTTCTCTACCTTTGAAGGAACCCCTATGCATGATCCTACATTGTATCGTAGTGTGCTAGGAGCACTTCAGTATATTACTCATAC
ACGCCCAGATATTTCTTTTATAGTCAACAAACTCAGTCAGTTCCTACAGCAACCTTCAGATATACATTGGCAGGCTGCTAAACGAGTTCTTCGATATCTCAAGGGTACTC
TGGATTATGGTTTGTTCCTACCTAAGTCTGATGATCTATCTATTATAGCTTATACAGATGCTGATTGGGCTTGCAACTTGGATGATCGCCGGTCTATTGCTGCGCACTGC
TTGTTTATTGGTCATTCTCTTGTTTCATGGTCGTCGAAAAAGCAGACTGTTGTAGCCCGCTCTAGTACTGAATCGGAATATCGCTCTTTAGCTCAGACTGCTGCAGATGT
TACTTGGGTGCAGTCCTTGCTTACTGAGCTTGGTTTATCATCCTCTACAATCCCACTTATCTGGTGTGACAATGTAAGTGCAATTTCTCTCACACAAAATCCAGTCTTTC
ATTCCCGTAGTAAGCACATTGAGTTGGATATGCATTTCGTTCGTGACAAAGTTCTCACCAAGGCAGTTGAGATTCGTTATGTTCCATCATCTGAACAATTAGCGGACGCC
CTCACCAAACCTTTATCTGAAGGACAGTTCTCATCTATCAGAGGCAAACTTGGGGTACTACCGTCACCCTCTCGTTTGAGGGGGATACTCCCCTTAGGGTTTTCGTCGCC
GTCGGGCTCCAAGCAACCATCGGGCCGAGGCCGACCTCATCCTCAGCCTCTCCCTTGGGCCAAGGCCGAGGATGACAGTTTCGGGCCGGAGGGAGCTCCGACCCTTTGCT
TCGTTGACTTGTCACTAGGTTCACGAAAAGAACTTTTCGGTCTGCATGGAACTATCCTTGGAACTTGGGGAAATGGGAATCTAGTTAGCACAAAGGACAAAAGGGAAGAG
GACAAAAACGGGGAAAGGGCCGAGGCAGTCAGCCTCGCCTTCCACGCCAGGCGCGTGCCTAGCGTGGCTGCGTGCCTGCTGCATAGCAGCAGTGCGCTCGGCCTGCCGTG
GCAGGCCTAG
Protein sequenceShow/hide protein sequence
MEKHTASRRCDLVAPNQIRKGNVETLKELRPDAELPKVSNMTDNVSSTSSAAETANAAVNVFSAANIVSSSFGHPLSTVLTVKIDEKNYLLWRGMVLAILRGQKVDGYVL
GTKAKPSEMVEAVTDAGKELRPNPLFEEWVTVDQALSGWLFGSMTPTIAADVVSFKTSREVWTTLEEVYGATSKARINQLRGILQNTKKGFMKMIAFLAVMKQASENLQL
AGNPVCLGDLISYVLASLSPEYILIVSSINDKDVKTWQELSSILVTFEGTLAKYSNDVVSADLPDLSIQLALNRPNQFSACYMRFEESFNDLHAGNSSGQIGSSGNSSAY
IATLDILNDPKWLADSGATTHVTADVGNLGVKTDYNGKESLIVGDGSHLNISHVGYTTVPSNLPNTAPVYLKNMLHVPSIKRNLISIACLTADNNVFVEFHSNACIVKDK
VSRRVILQGTLKKRLYEIELPSIPKVNSGVQSPVVAPVFEYPNKSSVLKSSSVFQAESPKSCNVSRSTWHKRLGHASDRIVSHVLQSCNLKVSLNEKSEFCEACQLGKSH
CLPFSRSISRSTQPLALVHCDLWGPSPVVSTLGYRYYVSFVDDFSRFTHIYPLKSKGEALVSFQTYKSLVENKFEKRIKVLQTDWGEMGLTFLAQAAMPLSYWWDAFCSA
VFVINQLPTPVLDYLSPWEKAFKSSPDFEFLRTFGCAYFPCLWPYQSHKFQFHSSKCVFLGYSPSHKVLNSPPLSLPEPSTCVSPMPQPAIPAPSSEFQATSPSLGTTSP
IQHENEGSISICVDTASPSGSVSPIQPVVSLPVGHHMQTRSKSGIFKPKDWGSFLAASSDDVVIEPVLVKDALQSAIWKSAIDQEIAALHRNKTWTLVPPPSNVNLIGSK
WVFKVKRKSDGSFDQCKARLVAKGFNQIPGVDFHETFSPVVKAPTIQIILTVAVSKSWPVRKLDVNNAILNGKLDEDVFMKQPIGFIDKDHPSYVCKLDKALYGLLQAPR
AWYDRYKATLITWRFSNSKADNYVFFFHSDQLTVLVFIYVDDILVTGDSIGFIDQSVHRLNTVFALKDIGPLSYYLGVEVHRIPSGMYLLQSKYITNLVERVGLSHIRPC
ATPVVTGKRFSTFEGTPMHDPTLYRSVLGALQYITHTRPDISFIVNKLSQFLQQPSDIHWQAAKRVLRYLKGTLDYGLFLPKSDDLSIIAYTDADWACNLDDRRSIAAHC
LFIGHSLVSWSSKKQTVVARSSTESEYRSLAQTAADVTWVQSLLTELGLSSSTIPLIWCDNVSAISLTQNPVFHSRSKHIELDMHFVRDKVLTKAVEIRYVPSSEQLADA
LTKPLSEGQFSSIRGKLGVLPSPSRLRGILPLGFSSPSGSKQPSGRGRPHPQPLPWAKAEDDSFGPEGAPTLCFVDLSLGSRKELFGLHGTILGTWGNGNLVSTKDKREE
DKNGERAEAVSLAFHARRVPSVAACLLHSSSALGLPWQA