; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G10160 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G10160
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRNA-directed DNA polymerase
Genome locationChr7:8228760..8234392
RNA-Seq ExpressionCSPI07G10160
SyntenyCSPI07G10160
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR000571 - Zinc finger, CCCH-type
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7014963.1 unnamed protein product [Microthlaspi erraticum]1.3e-27152.15Show/hide
Query:  LLGLVVAGKPQEKECEITDP-RLESLFAEFPHLKNE--PQGLPPIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPA
        L  +VV G       E T P  +  +  +F  L  +  P  LPP+RDIQH IDLI G+SLPNL HYRMSP+E +IL + IEDLL+KG I+ S+S CAVP 
Subjt:  LLGLVVAGKPQEKECEITDP-RLESLFAEFPHLKNE--PQGLPPIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPA

Query:  LLTPKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLDQLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNE----------GCSN------------
        LL PKK   WRMCVDSRAIN+IT++YRFPIPR+ D+LD+L  + IFSKIDL+SGYHQIRIRPGDEWKT FK+ +          G SN            
Subjt:  LLTPKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLDQLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNE----------GCSN------------

Query:  ---------VVYFDDILVYNKNSEDHIQHLKKLFQVLTETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFY
                 VVYFDDIL+Y+K  E+H+ HLK++ QVL E +LY+N KKCTF   ++VFLGF++ E  + ++ +K  AI+ WP   S+ E+++F GL +FY
Subjt:  ---------VVYFDDILVYNKNSEDHIQHLKKLFQVLTETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFY

Query:  RRFIKNFSSIVAPLTDCLKKGNYKWDGNQQQSFEEIKRRLTSSPILQLPGFTSPFEVAVDACGTGIGAVF-----------EKLSTSRQSWSTYEQELYA
        RRF+++FS+I AP+T+CLKKG + W   Q +SF  IK +L ++P+L LP F   F+V  DA G GIGAV            EKLS +RQ WSTY+QE YA
Subjt:  RRFIKNFSSIVAPLTDCLKKGNYKWDGNQQQSFEEIKRRLTSSPILQLPGFTSPFEVAVDACGTGIGAVF-----------EKLSTSRQSWSTYEQELYA

Query:  LVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRMHARWISFLQRFDFVIMHQSGKDNKLADALSRKSSLLTILSMEVEAFKHLPNLYKEDVDFSQM
        + RAL+QWEHYL+ +EF+L TDH +LK+L SQK I++MHARW+SFLQ+F F+I H+SG  NK+ADALSR++SLL  L+ E+  F+ L  LY+ D +F ++
Subjt:  LVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRMHARWISFLQRFDFVIMHQSGKDNKLADALSRKSSLLTILSMEVEAFKHLPNLYKEDVDFSQM

Query:  WTKCNNFIKAEDFHIMEGYLFKGDQ--------------------------QDKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNAGLYSPLPT
        W KCN    + DFHI +G+LFKGD+                          +DKT   + +R+YWP LRRD  + V RC ICQ +KG S N GLY PLP 
Subjt:  WTKCNNFIKAEDFHIMEGYLFKGDQ--------------------------QDKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNAGLYSPLPT

Query:  PISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAH
        P  IW DL +DFVLGLP+TQR  DSV VVVDRFSKMTHF+ACKKT DA+ IA LFFRE+VRLHG+P TI+SDRD KFLSHFW TLW  F TTLK S+TAH
Subjt:  PISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAH

Query:  PQIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKE
        PQ DGQTEVTNRTLGN+IR + G +PKQWDLAL Q EFA+N+  + +T K PF +V+T  P+   DL  LP     S  AE M E I    E V + L+ 
Subjt:  PQIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKE

Query:  STQFYKEAADRKRRQATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDDFYARGRAKPTNSN
        + Q  K AAD++RR   F EGD VM+ LR+ RFP GTY KL+  + GPF VL+KI DNAY ++LP +++I   FNV D+ +Y+A    Y    ++ ++S 
Subjt:  STQFYKEAADRKRRQATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDDFYARGRAKPTNSN

Query:  NERSN
         E ++
Subjt:  NERSN

CAA7028195.1 unnamed protein product [Microthlaspi erraticum]1.3e-27152.15Show/hide
Query:  LLGLVVAGKPQEKECEITDP-RLESLFAEFPHLKNE--PQGLPPIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPA
        L  +VV G       E T P  +  +  +F  L  +  P  LPP+RDIQH IDLI G+SLPNL HYRMSP+E +IL + IEDLL+KG I+ S+S CAVP 
Subjt:  LLGLVVAGKPQEKECEITDP-RLESLFAEFPHLKNE--PQGLPPIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPA

Query:  LLTPKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLDQLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNE----------GCSN------------
        LL PKK   WRMCVDSRAIN+IT++YRFPIPR+ D+LD+L  + IFSKIDL+SGYHQIRIRPGDEWKT FK+ +          G SN            
Subjt:  LLTPKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLDQLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNE----------GCSN------------

Query:  ---------VVYFDDILVYNKNSEDHIQHLKKLFQVLTETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFY
                 VVYFDDIL+Y+K  E+H+ HLK++ QVL E +LY+N KKCTF   ++VFLGF++ E  + ++ +K  AI+ WP   S+ E+++F GL +FY
Subjt:  ---------VVYFDDILVYNKNSEDHIQHLKKLFQVLTETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFY

Query:  RRFIKNFSSIVAPLTDCLKKGNYKWDGNQQQSFEEIKRRLTSSPILQLPGFTSPFEVAVDACGTGIGAVF-----------EKLSTSRQSWSTYEQELYA
        RRF+++FS+I AP+T+CLKKG + W   Q +SF  IK +L ++P+L LP F   F+V  DA G GIGAV            EKLS +RQ WSTY+QE YA
Subjt:  RRFIKNFSSIVAPLTDCLKKGNYKWDGNQQQSFEEIKRRLTSSPILQLPGFTSPFEVAVDACGTGIGAVF-----------EKLSTSRQSWSTYEQELYA

Query:  LVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRMHARWISFLQRFDFVIMHQSGKDNKLADALSRKSSLLTILSMEVEAFKHLPNLYKEDVDFSQM
        + RAL+QWEHYL+ +EF+L TDH +LK+L SQK I++MHARW+SFLQ+F F+I H+SG  NK+ADALSR++SLL  L+ E+  F+ L  LY+ D +F ++
Subjt:  LVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRMHARWISFLQRFDFVIMHQSGKDNKLADALSRKSSLLTILSMEVEAFKHLPNLYKEDVDFSQM

Query:  WTKCNNFIKAEDFHIMEGYLFKGDQ--------------------------QDKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNAGLYSPLPT
        W KCN    + DFHI +G+LFKGD+                          +DKT   + +R+YWP LRRD  + V RC ICQ +KG S N GLY PLP 
Subjt:  WTKCNNFIKAEDFHIMEGYLFKGDQ--------------------------QDKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNAGLYSPLPT

Query:  PISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAH
        P  IW DL +DFVLGLP+TQR  DSV VVVDRFSKMTHF+ACKKT DA+ IA LFFRE+VRLHG+P TI+SDRD KFLSHFW TLW  F TTLK S+TAH
Subjt:  PISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAH

Query:  PQIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKE
        PQ DGQTEVTNRTLGN+IR + G +PKQWDLAL Q EFA+N+  + +T K PF +V+T  P+   DL  LP     S  AE M E I    E V + L+ 
Subjt:  PQIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKE

Query:  STQFYKEAADRKRRQATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDDFYARGRAKPTNSN
        + Q  K AAD++RR   F EGD VM+ LR+ RFP GTY KL+  + GPF VL+KI DNAY ++LP +++I   FNV D+ +Y+A    Y    ++ ++S 
Subjt:  STQFYKEAADRKRRQATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDDFYARGRAKPTNSN

Query:  NERSN
         E ++
Subjt:  NERSN

KAG7588770.1 Integrase catalytic core [Arabidopsis suecica]1.3e-26651.21Show/hide
Query:  LVVAGKPQEKECEITDPR-LESLFAEFPHLKNE--PQGLPPIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPALLT
        +VV G       E T PR +  +  ++  L  E  P  LPP+RDIQH IDLI G+SLPNL HYRMSP+E +I+   IEDLL+KG I+ S+S CAVP LL 
Subjt:  LVVAGKPQEKECEITDPR-LESLFAEFPHLKNE--PQGLPPIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPALLT

Query:  PKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLDQLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNE----------GCSN---------------
        PKK   WRMCVDSRAIN+IT++YRFPIPR+ D+LD+L  + +FSKIDL+SGYHQIRIR GDEWKT FK+ +          G SN               
Subjt:  PKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLDQLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNE----------GCSN---------------

Query:  ------VVYFDDILVYNKNSEDHIQHLKKLFQVLTETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYRRF
              VVYFDDIL+Y+K  +DH++H++++ QVL E +LY+N KKCTF   +++FLGF++ E  + ++  K  AI+ WPV  +  E+++F GLA+FYRRF
Subjt:  ------VVYFDDILVYNKNSEDHIQHLKKLFQVLTETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYRRF

Query:  IKNFSSIVAPLTDCLKKGNYKWDGNQQQSFEEIKRRLTSSPILQLPGFTSPFEVAVDACGTGIGAVF-----------EKLSTSRQSWSTYEQELYALVR
        +++FS+I AP+T+CLKKG + W   Q +SF  IK +L ++P+L LP F   F+V  DA G GIGAV            EKLS +RQ WSTY+QE YA+ R
Subjt:  IKNFSSIVAPLTDCLKKGNYKWDGNQQQSFEEIKRRLTSSPILQLPGFTSPFEVAVDACGTGIGAVF-----------EKLSTSRQSWSTYEQELYALVR

Query:  ALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRMHARWISFLQRFDFVIMHQSGKDNKLADALSRKSSLLTILSMEVEAFKHLPNLYKEDVDFSQMWTK
        AL+QWEHYL+ +EF+L TDH +LK+L SQK I++MHARW+SFLQ+F F+I H+SG  NK+ADALSR++SLLT L+ E+  F+ L  LY+ D +F ++W K
Subjt:  ALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRMHARWISFLQRFDFVIMHQSGKDNKLADALSRKSSLLTILSMEVEAFKHLPNLYKEDVDFSQMWTK

Query:  CNNFIKAEDFHIMEGYLFKGDQ--------------------------QDKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNAGLYSPLPTPIS
        CN    + DFHI EGYLFKGD+                          +DKT   + +R+YWP LR+D  + V RC +CQ +KG S N GLY PL  P  
Subjt:  CNNFIKAEDFHIMEGYLFKGDQ--------------------------QDKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNAGLYSPLPTPIS

Query:  IWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAHPQI
        IW DL +DFVLGLP+TQR  DSV VVVD+FSKMTHF+AC+KT DAT IA LFFRE+VRLHG+P +IVSDRD KFLSHFW TLW  F T+LK S+TAHPQ 
Subjt:  IWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAHPQI

Query:  DGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKESTQ
        DGQTEVTNRTLGN+IR + G KPKQWDLAL Q EFA+N+  + +T K PF +V+T  P+   DL  LP     S+ A+ M ++I    E V + L+ + Q
Subjt:  DGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKESTQ

Query:  FYKEAADRKRRQATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDDFYARGRAKPTNSNNER
          K AAD+K+R   F EGD VM+ L++ RFP GTY KL+ R+ GPF +L+K+ DNAY ++LP D+ I   FNV D+ +Y+A    Y   +   T+S    
Subjt:  FYKEAADRKRRQATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDDFYARGRAKPTNSNNER

Query:  SNDQASGSTT
          D   G  T
Subjt:  SNDQASGSTT

PWA81295.1 transposon Ty3-I Gag-Pol polyprotein [Artemisia annua]7.2e-27052.59Show/hide
Query:  PQGLPPIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPALLTPKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLD
        P  LPP+R+IQHQIDL+ GASLPNL HYRMSP+E  IL + +E+LLRKGHI+ S+S CAVPALLTPKKDGSWRMCVDSRAIN+ITVRYRFPIPR+ DLLD
Subjt:  PQGLPPIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPALLTPKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLD

Query:  QLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNE----------GCSN---------------------VVYFDDILVYNKNSEDHIQHLKKLFQVLT
        QL  A +FSKIDL+SGYHQIRI+PGDEWKT FKT +          G SN                     VVYFDDILVY++  ++H+ HL+K+ + LT
Subjt:  QLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNE----------GCSN---------------------VVYFDDILVYNKNSEDHIQHLKKLFQVLT

Query:  ETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYRRFIKNFSSIVAPLTDCLKKGNYKWDGNQQQSFEEIKR
        E EL++N KKCTFL  +++FLG+++    + ++  K +A++ WP   ++ E+++F GLA+FYRRF++NFSSIVAP+T+C+KKG +KW    ++SF+ IK 
Subjt:  ETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYRRFIKNFSSIVAPLTDCLKKGNYKWDGNQQQSFEEIKR

Query:  RLTSSPILQLPGFTSPFEVAVDACGTGIGAVF-----------EKLSTSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRM
        RLT++P+L LP F + FE+  DACGTGIGAV            EKL+ +RQ WSTYEQELYA+V+A+K+WEHYL+ +EFV+ +DH +LKY Q+Q+ ++++
Subjt:  RLTSSPILQLPGFTSPFEVAVDACGTGIGAVF-----------EKLSTSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRM

Query:  HARWISFLQRFDFVIMHQSGKDNKLADALSRKSSLLTILSMEVEAFKHLPNLYKEDVDFSQMWTKCNNFIKAEDFHIMEGYLFKGDQ-------------
        HARW SFL++F++VI H+SG  NK+ADALSRK++LL  +S +V  F+ +  LY+ D DF   W +        +F +++GYLFKG++             
Subjt:  HARWISFLQRFDFVIMHQSGKDNKLADALSRKSSLLTILSMEVEAFKHLPNLYKEDVDFSQMWTKCNNFIKAEDFHIMEGYLFKGDQ-------------

Query:  -------------QDKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNAGLYSPLPTPISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTH
                     +DKT   +  RFYWPQL+RD  SFV RC +CQ  KG + N GLY PLP P S W D+ +DFVLGLP+TQR  DSV VVVDRFSKM H
Subjt:  -------------QDKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNAGLYSPLPTPISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTH

Query:  FVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAHPQIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEF
        F+ CKKT+DA +IA LFF+E+VRLHG+P +I SDRD KFL+HFW TLW +  T+L +S+TAHPQ DGQTEV NRTLGN+IRCL G KPK WD++LAQAEF
Subjt:  FVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAHPQIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEF

Query:  AFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKESTQFYKEAADRKRRQATFTEGDLVMIHLRRNRFPTGTY
        A+N+  + ST   PF+VV+   PR   DL  LP   +   +A KMVE +Q  HE V + + ES   YK AAD+ RR   F  GD VM+ LR+ RFP GTY
Subjt:  AFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKESTQFYKEAADRKRRQATFTEGDLVMIHLRRNRFPTGTY

Query:  NKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDDFYARGRAKPTNSNNERSNDQ
        +KL+ ++ GP+ +L+KI DNAY ++LP  + I   FNV D+ +++ P+D    G    T+S+    ND+
Subjt:  NKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDDFYARGRAKPTNSNNERSNDQ

XP_025979678.1 uncharacterized protein LOC112997809 [Glycine max]9.8e-26752.68Show/hide
Query:  PQGLPPIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPALLTPKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLD
        P  LPP+RDIQHQIDLI G+SLPNL HYRMSP+E +IL + IEDLLRKG I+ S+S CAVP LL PKK   WRMCVDSRAIN+IT++YRFPIPR+ D+LD
Subjt:  PQGLPPIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPALLTPKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLD

Query:  QLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNE----------GCSN---------------------VVYFDDILVYNKNSEDHIQHLKKLFQVLT
        +L  + +FSKIDL+SGYHQIRIRPGDEWKT FK+ +          G SN                     VVYFDDIL+Y+K  E+H++H++ + QVL 
Subjt:  QLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNE----------GCSN---------------------VVYFDDILVYNKNSEDHIQHLKKLFQVLT

Query:  ETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYRRFIKNFSSIVAPLTDCLKKGNYKWDGNQQQSFEEIKR
        E +LYIN KKCTF   +++FLGF++ E  + ++ +K  AI+ WP  TS+ E+++F GLA+FYRRFI++FS+I AP+T+CLKKG Y W   Q+QSF  IK 
Subjt:  ETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYRRFIKNFSSIVAPLTDCLKKGNYKWDGNQQQSFEEIKR

Query:  RLTSSPILQLPGFTSPFEVAVDACGTGIGAVF-----------EKLSTSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRM
        +L ++P+L LP F   F+V  DA G GIGAV            EKLS +R+ WSTY+QE YA+ RAL+QWEHYL+ +EF+L TDH +LK+L SQK I++M
Subjt:  RLTSSPILQLPGFTSPFEVAVDACGTGIGAVF-----------EKLSTSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRM

Query:  HARWISFLQRFDFVIMHQSGKDNKLADALSRKSSLLTILSMEVEAFKHLPNLYKEDVDFSQMWTKCNNFIKAEDFHIMEGYLFKGDQ-------------
        HARW+SFLQ+F F+I H+SG  NK+ADALSR+ SLL  L+ EV  F+ L  LY+ D +F ++W KC      +DFH+ EG+LFKG++             
Subjt:  HARWISFLQRFDFVIMHQSGKDNKLADALSRKSSLLTILSMEVEAFKHLPNLYKEDVDFSQMWTKCNNFIKAEDFHIMEGYLFKGDQ-------------

Query:  -------------QDKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNAGLYSPLPTPISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTH
                     +DKT   + +RFYWP LR+D  + V +C  CQ +KG S N GLY PLP P  IW DL +DFVLGLP+TQR  DSV VVVDRFSKM+H
Subjt:  -------------QDKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNAGLYSPLPTPISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTH

Query:  FVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAHPQIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEF
        F+ACKKT DA+ IA LFFRE+V LHG+P +I SDRD KFLSHFW TLW  FDT+L  S+TAHPQ DGQTEVTNRTLGN+IRC+ G KPKQWDLAL Q EF
Subjt:  FVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAHPQIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEF

Query:  AFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKESTQFYKEAADRKRRQATFTEGDLVMIHLRRNRFPTGTY
        A+N+  + +T K PF +V+T  PR   DL  LP     S  AE M E I  + + V S L+ +    K AAD+++R   F  GD VM+ LR+ RFP GTY
Subjt:  AFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKESTQFYKEAADRKRRQATFTEGDLVMIHLRRNRFPTGTY

Query:  NKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDDFYARGRAKPTNSNNERSNDQASGSTTFQ
        +KL+ R+ GPF V +KI DNAY + LP  ++I   FNV D+ +Y+A +  Y                D+ SGS++F+
Subjt:  NKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDDFYARGRAKPTNSNNERSNDQASGSTTFQ

TrEMBL top hitse value%identityAlignment
A0A2U1P6A2 Transposon Ty3-I Gag-Pol polyprotein3.5e-27052.59Show/hide
Query:  PQGLPPIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPALLTPKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLD
        P  LPP+R+IQHQIDL+ GASLPNL HYRMSP+E  IL + +E+LLRKGHI+ S+S CAVPALLTPKKDGSWRMCVDSRAIN+ITVRYRFPIPR+ DLLD
Subjt:  PQGLPPIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPALLTPKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLD

Query:  QLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNE----------GCSN---------------------VVYFDDILVYNKNSEDHIQHLKKLFQVLT
        QL  A +FSKIDL+SGYHQIRI+PGDEWKT FKT +          G SN                     VVYFDDILVY++  ++H+ HL+K+ + LT
Subjt:  QLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNE----------GCSN---------------------VVYFDDILVYNKNSEDHIQHLKKLFQVLT

Query:  ETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYRRFIKNFSSIVAPLTDCLKKGNYKWDGNQQQSFEEIKR
        E EL++N KKCTFL  +++FLG+++    + ++  K +A++ WP   ++ E+++F GLA+FYRRF++NFSSIVAP+T+C+KKG +KW    ++SF+ IK 
Subjt:  ETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYRRFIKNFSSIVAPLTDCLKKGNYKWDGNQQQSFEEIKR

Query:  RLTSSPILQLPGFTSPFEVAVDACGTGIGAVF-----------EKLSTSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRM
        RLT++P+L LP F + FE+  DACGTGIGAV            EKL+ +RQ WSTYEQELYA+V+A+K+WEHYL+ +EFV+ +DH +LKY Q+Q+ ++++
Subjt:  RLTSSPILQLPGFTSPFEVAVDACGTGIGAVF-----------EKLSTSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRM

Query:  HARWISFLQRFDFVIMHQSGKDNKLADALSRKSSLLTILSMEVEAFKHLPNLYKEDVDFSQMWTKCNNFIKAEDFHIMEGYLFKGDQ-------------
        HARW SFL++F++VI H+SG  NK+ADALSRK++LL  +S +V  F+ +  LY+ D DF   W +        +F +++GYLFKG++             
Subjt:  HARWISFLQRFDFVIMHQSGKDNKLADALSRKSSLLTILSMEVEAFKHLPNLYKEDVDFSQMWTKCNNFIKAEDFHIMEGYLFKGDQ-------------

Query:  -------------QDKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNAGLYSPLPTPISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTH
                     +DKT   +  RFYWPQL+RD  SFV RC +CQ  KG + N GLY PLP P S W D+ +DFVLGLP+TQR  DSV VVVDRFSKM H
Subjt:  -------------QDKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNAGLYSPLPTPISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTH

Query:  FVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAHPQIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEF
        F+ CKKT+DA +IA LFF+E+VRLHG+P +I SDRD KFL+HFW TLW +  T+L +S+TAHPQ DGQTEV NRTLGN+IRCL G KPK WD++LAQAEF
Subjt:  FVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAHPQIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEF

Query:  AFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKESTQFYKEAADRKRRQATFTEGDLVMIHLRRNRFPTGTY
        A+N+  + ST   PF+VV+   PR   DL  LP   +   +A KMVE +Q  HE V + + ES   YK AAD+ RR   F  GD VM+ LR+ RFP GTY
Subjt:  AFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKESTQFYKEAADRKRRQATFTEGDLVMIHLRRNRFPTGTY

Query:  NKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDDFYARGRAKPTNSNNERSNDQ
        +KL+ ++ GP+ +L+KI DNAY ++LP  + I   FNV D+ +++ P+D    G    T+S+    ND+
Subjt:  NKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDDFYARGRAKPTNSNNERSNDQ

A0A5B7BER3 Uncharacterized protein6.4e-28854.51Show/hide
Query:  LLGLVVAGKPQEKECEITDPRLESLFAEFPHL--KNEPQGLPPIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPAL
        ++ ++V GK   +  ++ +  L+ L AEF  +     P  LPP+RDIQH IDL+ GASLPNL HYRMSP+E +IL   +EDL+ KG I+ S+S CAVPAL
Subjt:  LLGLVVAGKPQEKECEITDPRLESLFAEFPHL--KNEPQGLPPIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPAL

Query:  LTPKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLDQLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNE----------GCSN-------------
        LTPKKDGSWRMCVDSRAIN+ITV+YRFPIPR+ D+LD L  + IFSKIDL+SGYHQIRIRPGDEWKT FKT E          G SN             
Subjt:  LTPKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLDQLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNE----------GCSN-------------

Query:  --------VVYFDDILVYNKNSEDHIQHLKKLFQVLTETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYR
                VVYFDDIL+Y+K+  +H++H++++   L E++LYIN KKC FL   ++FLGF+I    + ++ +K  AI+ WP   ++ ++++F GLA+FYR
Subjt:  --------VVYFDDILVYNKNSEDHIQHLKKLFQVLTETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYR

Query:  RFIKNFSSIVAPLTDCLKKGNYKWDGNQQQSFEEIKRRLTSSPILQLPGFTSPFEVAVDACGTGIGAVF-----------EKLSTSRQSWSTYEQELYAL
        RFI+NFSSIVAP+TDC+KKG ++W+ +Q+ SF  IK +L+++P+L LP F   F+V  DA  TGIGAV            EKL+ +RQ W+TYE EL+A+
Subjt:  RFIKNFSSIVAPLTDCLKKGNYKWDGNQQQSFEEIKRRLTSSPILQLPGFTSPFEVAVDACGTGIGAVF-----------EKLSTSRQSWSTYEQELYAL

Query:  VRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRMHARWISFLQRFDFVIMHQSGKDNKLADALSRKSSLLTILSMEVEAFKHLPNLYKEDVDFSQMW
        VRALK WEHYL+ +EFV+ +DH +LK++ +Q S+SRMH RWI+FLQRF FV+ H++G+ NK+ADALSR+++LL ++S E+ +F+ L  LY+ED DF Q W
Subjt:  VRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRMHARWISFLQRFDFVIMHQSGKDNKLADALSRKSSLLTILSMEVEAFKHLPNLYKEDVDFSQMW

Query:  TKCNNFIKAEDFHIMEGYLFKGDQ--------------------------QDKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNAGLYSPLPTP
         KC     + +FHI +GYLFKG+Q                          +DKT  ++ +R+YWPQL+RD   FV +CPICQ AKG + N GLY+PLP P
Subjt:  TKCNNFIKAEDFHIMEGYLFKGDQ--------------------------QDKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNAGLYSPLPTP

Query:  ISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAHP
          IW DL +DF+LGLP+TQR  DSV VVVDRFSKM HF+ CKKT+DA+++ANLFFREIVRLHG+P +I SDRD KFLSHFW+TLW KFDT+L+YS+TAHP
Subjt:  ISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAHP

Query:  QIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKES
        Q DGQTEVTNRTLGNLIRC SG +PKQWD+ L Q EFA+N M NRST K PFE+V+TK P+   DLA LP    +S  AE   +    + EEV  +L+++
Subjt:  QIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKES

Query:  TQFYKEAADRKRRQATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDD
           YK AAD+ RR   FTEGDLVM+ LR+NRFP GTYNKLK+R+ GPF V +KI DNAY +ELP D+ I   FNV DL +Y+ PD+
Subjt:  TQFYKEAADRKRRQATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDD

A0A6D2HLB5 Reverse transcriptase6.4e-27252.15Show/hide
Query:  LLGLVVAGKPQEKECEITDP-RLESLFAEFPHLKNE--PQGLPPIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPA
        L  +VV G       E T P  +  +  +F  L  +  P  LPP+RDIQH IDLI G+SLPNL HYRMSP+E +IL + IEDLL+KG I+ S+S CAVP 
Subjt:  LLGLVVAGKPQEKECEITDP-RLESLFAEFPHLKNE--PQGLPPIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPA

Query:  LLTPKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLDQLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNE----------GCSN------------
        LL PKK   WRMCVDSRAIN+IT++YRFPIPR+ D+LD+L  + IFSKIDL+SGYHQIRIRPGDEWKT FK+ +          G SN            
Subjt:  LLTPKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLDQLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNE----------GCSN------------

Query:  ---------VVYFDDILVYNKNSEDHIQHLKKLFQVLTETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFY
                 VVYFDDIL+Y+K  E+H+ HLK++ QVL E +LY+N KKCTF   ++VFLGF++ E  + ++ +K  AI+ WP   S+ E+++F GL +FY
Subjt:  ---------VVYFDDILVYNKNSEDHIQHLKKLFQVLTETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFY

Query:  RRFIKNFSSIVAPLTDCLKKGNYKWDGNQQQSFEEIKRRLTSSPILQLPGFTSPFEVAVDACGTGIGAVF-----------EKLSTSRQSWSTYEQELYA
        RRF+++FS+I AP+T+CLKKG + W   Q +SF  IK +L ++P+L LP F   F+V  DA G GIGAV            EKLS +RQ WSTY+QE YA
Subjt:  RRFIKNFSSIVAPLTDCLKKGNYKWDGNQQQSFEEIKRRLTSSPILQLPGFTSPFEVAVDACGTGIGAVF-----------EKLSTSRQSWSTYEQELYA

Query:  LVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRMHARWISFLQRFDFVIMHQSGKDNKLADALSRKSSLLTILSMEVEAFKHLPNLYKEDVDFSQM
        + RAL+QWEHYL+ +EF+L TDH +LK+L SQK I++MHARW+SFLQ+F F+I H+SG  NK+ADALSR++SLL  L+ E+  F+ L  LY+ D +F ++
Subjt:  LVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRMHARWISFLQRFDFVIMHQSGKDNKLADALSRKSSLLTILSMEVEAFKHLPNLYKEDVDFSQM

Query:  WTKCNNFIKAEDFHIMEGYLFKGDQ--------------------------QDKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNAGLYSPLPT
        W KCN    + DFHI +G+LFKGD+                          +DKT   + +R+YWP LRRD  + V RC ICQ +KG S N GLY PLP 
Subjt:  WTKCNNFIKAEDFHIMEGYLFKGDQ--------------------------QDKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNAGLYSPLPT

Query:  PISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAH
        P  IW DL +DFVLGLP+TQR  DSV VVVDRFSKMTHF+ACKKT DA+ IA LFFRE+VRLHG+P TI+SDRD KFLSHFW TLW  F TTLK S+TAH
Subjt:  PISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAH

Query:  PQIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKE
        PQ DGQTEVTNRTLGN+IR + G +PKQWDLAL Q EFA+N+  + +T K PF +V+T  P+   DL  LP     S  AE M E I    E V + L+ 
Subjt:  PQIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKE

Query:  STQFYKEAADRKRRQATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDDFYARGRAKPTNSN
        + Q  K AAD++RR   F EGD VM+ LR+ RFP GTY KL+  + GPF VL+KI DNAY ++LP +++I   FNV D+ +Y+A    Y    ++ ++S 
Subjt:  STQFYKEAADRKRRQATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDDFYARGRAKPTNSN

Query:  NERSN
         E ++
Subjt:  NERSN

A0A6D2IKM3 Reverse transcriptase6.4e-27252.15Show/hide
Query:  LLGLVVAGKPQEKECEITDP-RLESLFAEFPHLKNE--PQGLPPIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPA
        L  +VV G       E T P  +  +  +F  L  +  P  LPP+RDIQH IDLI G+SLPNL HYRMSP+E +IL + IEDLL+KG I+ S+S CAVP 
Subjt:  LLGLVVAGKPQEKECEITDP-RLESLFAEFPHLKNE--PQGLPPIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPA

Query:  LLTPKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLDQLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNE----------GCSN------------
        LL PKK   WRMCVDSRAIN+IT++YRFPIPR+ D+LD+L  + IFSKIDL+SGYHQIRIRPGDEWKT FK+ +          G SN            
Subjt:  LLTPKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLDQLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNE----------GCSN------------

Query:  ---------VVYFDDILVYNKNSEDHIQHLKKLFQVLTETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFY
                 VVYFDDIL+Y+K  E+H+ HLK++ QVL E +LY+N KKCTF   ++VFLGF++ E  + ++ +K  AI+ WP   S+ E+++F GL +FY
Subjt:  ---------VVYFDDILVYNKNSEDHIQHLKKLFQVLTETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFY

Query:  RRFIKNFSSIVAPLTDCLKKGNYKWDGNQQQSFEEIKRRLTSSPILQLPGFTSPFEVAVDACGTGIGAVF-----------EKLSTSRQSWSTYEQELYA
        RRF+++FS+I AP+T+CLKKG + W   Q +SF  IK +L ++P+L LP F   F+V  DA G GIGAV            EKLS +RQ WSTY+QE YA
Subjt:  RRFIKNFSSIVAPLTDCLKKGNYKWDGNQQQSFEEIKRRLTSSPILQLPGFTSPFEVAVDACGTGIGAVF-----------EKLSTSRQSWSTYEQELYA

Query:  LVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRMHARWISFLQRFDFVIMHQSGKDNKLADALSRKSSLLTILSMEVEAFKHLPNLYKEDVDFSQM
        + RAL+QWEHYL+ +EF+L TDH +LK+L SQK I++MHARW+SFLQ+F F+I H+SG  NK+ADALSR++SLL  L+ E+  F+ L  LY+ D +F ++
Subjt:  LVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRMHARWISFLQRFDFVIMHQSGKDNKLADALSRKSSLLTILSMEVEAFKHLPNLYKEDVDFSQM

Query:  WTKCNNFIKAEDFHIMEGYLFKGDQ--------------------------QDKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNAGLYSPLPT
        W KCN    + DFHI +G+LFKGD+                          +DKT   + +R+YWP LRRD  + V RC ICQ +KG S N GLY PLP 
Subjt:  WTKCNNFIKAEDFHIMEGYLFKGDQ--------------------------QDKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNAGLYSPLPT

Query:  PISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAH
        P  IW DL +DFVLGLP+TQR  DSV VVVDRFSKMTHF+ACKKT DA+ IA LFFRE+VRLHG+P TI+SDRD KFLSHFW TLW  F TTLK S+TAH
Subjt:  PISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAH

Query:  PQIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKE
        PQ DGQTEVTNRTLGN+IR + G +PKQWDLAL Q EFA+N+  + +T K PF +V+T  P+   DL  LP     S  AE M E I    E V + L+ 
Subjt:  PQIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKE

Query:  STQFYKEAADRKRRQATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDDFYARGRAKPTNSN
        + Q  K AAD++RR   F EGD VM+ LR+ RFP GTY KL+  + GPF VL+KI DNAY ++LP +++I   FNV D+ +Y+A    Y    ++ ++S 
Subjt:  STQFYKEAADRKRRQATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDDFYARGRAKPTNSN

Query:  NERSN
         E ++
Subjt:  NERSN

A0A6N2LVR1 Uncharacterized protein1.8e-27450.98Show/hide
Query:  EREQDLLGLVVAGKPQEKECEITDPRLESLFAEFPHLKNE--PQGLPPIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSC
        +++ ++  +++ G+ +     I +  L++L AEF  +  E  P+GLPP+RDIQH IDLI GASLPN  HYRMSP E  IL   +E+L++KG ++ S+S C
Subjt:  EREQDLLGLVVAGKPQEKECEITDPRLESLFAEFPHLKNE--PQGLPPIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSC

Query:  AVPALLTPKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLDQLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNE----------GCSN--------
        AVPALL PKKDGSWRMC+DSRAIN+IT++YRFPIPR+ D+LD L  + IFSKIDL+SGYHQIRIRPGDEWKT FKT E          G SN        
Subjt:  AVPALLTPKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLDQLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNE----------GCSN--------

Query:  -------------VVYFDDILVYNKNSEDHIQHLKKLFQVLTETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGL
                     VVYFDDIL+Y+++  DHI HL+++F VL   +L++N  KC F+   +VFLGF++    + ++ +K  AI+ WP   +I E+++F GL
Subjt:  -------------VVYFDDILVYNKNSEDHIQHLKKLFQVLTETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGL

Query:  ASFYRRFIKNFSSIVAPLTDCLKKGNYKWDGNQQQSFEEIKRRLTSSPILQLPGFTSPFEVAVDACGTGIGAVF-----------EKLSTSRQSWSTYEQ
        A+FYRRF+++FS IVAP+T+C+KKG + W    + SF  IK +L S+P+L LP F   FEV  DA   GIGAV            EKLS +R+ WSTYE 
Subjt:  ASFYRRFIKNFSSIVAPLTDCLKKGNYKWDGNQQQSFEEIKRRLTSSPILQLPGFTSPFEVAVDACGTGIGAVF-----------EKLSTSRQSWSTYEQ

Query:  ELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRMHARWISFLQRFDFVIMHQSGKDNKLADALSRKSSLLTILSMEVEAFKHLPNLYKEDVD
        ELYA+ RA+K WEHYL+ +EF+L +DH +LK++ +Q +++RMHARW++F+QRF+F + H+SG+ NK+ADALSRK SLLT L  EV  F+ + +LY  D D
Subjt:  ELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRMHARWISFLQRFDFVIMHQSGKDNKLADALSRKSSLLTILSMEVEAFKHLPNLYKEDVD

Query:  FSQMWTKCNNFIKAEDFHIMEGYLFKGDQ--------------------------QDKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNAGLYS
        F   W KC   +  E  H  +GYLF+G+Q                          +DKT  +  +R+YWPQL+RD  + V RCP CQ +KG + N GLY 
Subjt:  FSQMWTKCNNFIKAEDFHIMEGYLFKGDQ--------------------------QDKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNAGLYS

Query:  PLPTPISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYS
        PLP P   W DL +DF+LGLP+TQR  DSV VVVDRFSKM HF+ACKKT+DA ++ANLFF+E+VRLHG+P +I SDRD KFLSHFW+TLW +FDTTL +S
Subjt:  PLPTPISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYS

Query:  TTAHPQIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHS
        +T+HPQ DGQTEV NRTLGNLIRCLSG +PKQWDL LAQAEFA+N+M NRST K PF+VV+ + P+   DL  LP     +  AE M + ++ + EEV  
Subjt:  TTAHPQIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHS

Query:  HLKESTQFYKEAADRKRRQATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDDFYARGRAKP
        +L+ S + YK AAD+KRR   F EGDLVM++LR+ R P GT +KL D++ GP+ +L+KI DNAY+++LP D+ I P FNV DL +Y+ PD+        P
Subjt:  HLKESTQFYKEAADRKRRQATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDDFYARGRAKP

Query:  TNSNNERSNDQASGST
        +N +N R++    G T
Subjt:  TNSNNERSNDQASGST

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein4.2e-10329.4Show/hide
Query:  ITDPRLESLFAEFPHLKNE--PQGLP-PIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPALLTPKKDGSWRMCVDS
        + +P L  ++ EF  +  E   + LP PI+ ++ +++L        + +Y + P + + ++D I   L+ G I+ S +  A P +  PKK+G+ RM VD 
Subjt:  ITDPRLESLFAEFPHLKNE--PQGLP-PIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPALLTPKKDGSWRMCVDS

Query:  RAINRITVRYRFPIPRIGDLLDQLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGC------------------------------SNVV-YFDDI
        + +N+      +P+P I  LL ++  + IF+K+DLKS YH IR+R GDE K  F+   G                               S+VV Y DDI
Subjt:  RAINRITVRYRFPIPRIGDLLDQLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGC------------------------------SNVV-YFDDI

Query:  LVYNKNSEDHIQHLKKLFQVLTETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYRRFIKNFSSIVAPLTD
        L+++K+  +H++H+K + Q L    L IN  KC F   ++ F+G+ I E       +  + +  W    + KEL+ FLG  ++ R+FI   S +  PL +
Subjt:  LVYNKNSEDHIQHLKKLFQVLTETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYRRFIKNFSSIVAPLTD

Query:  CLKKG-NYKWDGNQQQSFEEIKRRLTSSPILQLPGFTSPFEVAVDACGTGIGAVFE----------------KLSTSRQSWSTYEQELYALVRALKQWEH
         LKK   +KW   Q Q+ E IK+ L S P+L+   F+    +  DA    +GAV                  K+S ++ ++S  ++E+ A++++LK W H
Subjt:  CLKKG-NYKWDGNQQQSFEEIKRRLTSSPILQLPGFTSPFEVAVDACGTGIGAVFE----------------KLSTSRQSWSTYEQELYALVRALKQWEH

Query:  YLLS--KEFVLLTDHFSL--KYLQSQKSISRMHARWISFLQRFDFVIMHQSGKDNKLADALSR-------------KSSLLTILSMEV-EAFKHLPNLYK
        YL S  + F +LTDH +L  +     +  ++  ARW  FLQ F+F I ++ G  N +ADALSR              +S+  +  + + + FK+   +  
Subjt:  YLLS--KEFVLLTDHFSL--KYLQSQKSISRMHARWISFLQRFDFVIMHQSGKDNKLADALSR-------------KSSLLTILSMEV-EAFKHLPNLYK

Query:  EDVDFSQMWTKCNNFIK--AEDFHIMEGYLFKGDQQ----------------------------DKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGP
        E  + +++    NN  K   E+  + +G L     Q                            +    II +RF W  +R+    +V  C  CQ  K  
Subjt:  EDVDFSQMWTKCNNFIK--AEDFHIMEGYLFKGDQQ----------------------------DKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGP

Query:  STNA-GLYSPLPTPISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWN
        +    G   P+P     W  L +DF+  LP++   Y+++ VVVDRFSKM   V C K+  A   A +F + ++   G P  I++D D  F S  WK   +
Subjt:  STNA-GLYSPLPTPISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWN

Query:  KFDTTLKYSTTAHPQIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENI
        K++  +K+S    PQ DGQTE TN+T+  L+RC+  + P  W   ++  + ++NN  + +T   PFE+V    P L+     LP+  D + E  +  E I
Subjt:  KFDTTLKYSTTAHPQIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENI

Query:  QKLHEEVHSHLKESTQFYKEAADRKRRQ-ATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLH--IHPIFNVVDLKQY
        Q + + V  HL  +    K+  D K ++   F  GDLVM+   +  F     NKL     GPF VL+K   N Y+++LP  +       F+V  L++Y
Subjt:  QKLHEEVHSHLKESTQFYKEAADRKRRQ-ATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLH--IHPIFNVVDLKQY

P0CT35 Transposon Tf2-2 polyprotein4.2e-10329.4Show/hide
Query:  ITDPRLESLFAEFPHLKNE--PQGLP-PIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPALLTPKKDGSWRMCVDS
        + +P L  ++ EF  +  E   + LP PI+ ++ +++L        + +Y + P + + ++D I   L+ G I+ S +  A P +  PKK+G+ RM VD 
Subjt:  ITDPRLESLFAEFPHLKNE--PQGLP-PIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPALLTPKKDGSWRMCVDS

Query:  RAINRITVRYRFPIPRIGDLLDQLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGC------------------------------SNVV-YFDDI
        + +N+      +P+P I  LL ++  + IF+K+DLKS YH IR+R GDE K  F+   G                               S+VV Y DDI
Subjt:  RAINRITVRYRFPIPRIGDLLDQLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGC------------------------------SNVV-YFDDI

Query:  LVYNKNSEDHIQHLKKLFQVLTETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYRRFIKNFSSIVAPLTD
        L+++K+  +H++H+K + Q L    L IN  KC F   ++ F+G+ I E       +  + +  W    + KEL+ FLG  ++ R+FI   S +  PL +
Subjt:  LVYNKNSEDHIQHLKKLFQVLTETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYRRFIKNFSSIVAPLTD

Query:  CLKKG-NYKWDGNQQQSFEEIKRRLTSSPILQLPGFTSPFEVAVDACGTGIGAVFE----------------KLSTSRQSWSTYEQELYALVRALKQWEH
         LKK   +KW   Q Q+ E IK+ L S P+L+   F+    +  DA    +GAV                  K+S ++ ++S  ++E+ A++++LK W H
Subjt:  CLKKG-NYKWDGNQQQSFEEIKRRLTSSPILQLPGFTSPFEVAVDACGTGIGAVFE----------------KLSTSRQSWSTYEQELYALVRALKQWEH

Query:  YLLS--KEFVLLTDHFSL--KYLQSQKSISRMHARWISFLQRFDFVIMHQSGKDNKLADALSR-------------KSSLLTILSMEV-EAFKHLPNLYK
        YL S  + F +LTDH +L  +     +  ++  ARW  FLQ F+F I ++ G  N +ADALSR              +S+  +  + + + FK+   +  
Subjt:  YLLS--KEFVLLTDHFSL--KYLQSQKSISRMHARWISFLQRFDFVIMHQSGKDNKLADALSR-------------KSSLLTILSMEV-EAFKHLPNLYK

Query:  EDVDFSQMWTKCNNFIK--AEDFHIMEGYLFKGDQQ----------------------------DKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGP
        E  + +++    NN  K   E+  + +G L     Q                            +    II +RF W  +R+    +V  C  CQ  K  
Subjt:  EDVDFSQMWTKCNNFIK--AEDFHIMEGYLFKGDQQ----------------------------DKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGP

Query:  STNA-GLYSPLPTPISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWN
        +    G   P+P     W  L +DF+  LP++   Y+++ VVVDRFSKM   V C K+  A   A +F + ++   G P  I++D D  F S  WK   +
Subjt:  STNA-GLYSPLPTPISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWN

Query:  KFDTTLKYSTTAHPQIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENI
        K++  +K+S    PQ DGQTE TN+T+  L+RC+  + P  W   ++  + ++NN  + +T   PFE+V    P L+     LP+  D + E  +  E I
Subjt:  KFDTTLKYSTTAHPQIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENI

Query:  QKLHEEVHSHLKESTQFYKEAADRKRRQ-ATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLH--IHPIFNVVDLKQY
        Q + + V  HL  +    K+  D K ++   F  GDLVM+   +  F     NKL     GPF VL+K   N Y+++LP  +       F+V  L++Y
Subjt:  QKLHEEVHSHLKESTQFYKEAADRKRRQ-ATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLH--IHPIFNVVDLKQY

P0CT41 Transposon Tf2-12 polyprotein4.2e-10329.4Show/hide
Query:  ITDPRLESLFAEFPHLKNE--PQGLP-PIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPALLTPKKDGSWRMCVDS
        + +P L  ++ EF  +  E   + LP PI+ ++ +++L        + +Y + P + + ++D I   L+ G I+ S +  A P +  PKK+G+ RM VD 
Subjt:  ITDPRLESLFAEFPHLKNE--PQGLP-PIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPALLTPKKDGSWRMCVDS

Query:  RAINRITVRYRFPIPRIGDLLDQLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGC------------------------------SNVV-YFDDI
        + +N+      +P+P I  LL ++  + IF+K+DLKS YH IR+R GDE K  F+   G                               S+VV Y DDI
Subjt:  RAINRITVRYRFPIPRIGDLLDQLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGC------------------------------SNVV-YFDDI

Query:  LVYNKNSEDHIQHLKKLFQVLTETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYRRFIKNFSSIVAPLTD
        L+++K+  +H++H+K + Q L    L IN  KC F   ++ F+G+ I E       +  + +  W    + KEL+ FLG  ++ R+FI   S +  PL +
Subjt:  LVYNKNSEDHIQHLKKLFQVLTETELYINPKKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYRRFIKNFSSIVAPLTD

Query:  CLKKG-NYKWDGNQQQSFEEIKRRLTSSPILQLPGFTSPFEVAVDACGTGIGAVFE----------------KLSTSRQSWSTYEQELYALVRALKQWEH
         LKK   +KW   Q Q+ E IK+ L S P+L+   F+    +  DA    +GAV                  K+S ++ ++S  ++E+ A++++LK W H
Subjt:  CLKKG-NYKWDGNQQQSFEEIKRRLTSSPILQLPGFTSPFEVAVDACGTGIGAVFE----------------KLSTSRQSWSTYEQELYALVRALKQWEH

Query:  YLLS--KEFVLLTDHFSL--KYLQSQKSISRMHARWISFLQRFDFVIMHQSGKDNKLADALSR-------------KSSLLTILSMEV-EAFKHLPNLYK
        YL S  + F +LTDH +L  +     +  ++  ARW  FLQ F+F I ++ G  N +ADALSR              +S+  +  + + + FK+   +  
Subjt:  YLLS--KEFVLLTDHFSL--KYLQSQKSISRMHARWISFLQRFDFVIMHQSGKDNKLADALSR-------------KSSLLTILSMEV-EAFKHLPNLYK

Query:  EDVDFSQMWTKCNNFIK--AEDFHIMEGYLFKGDQQ----------------------------DKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGP
        E  + +++    NN  K   E+  + +G L     Q                            +    II +RF W  +R+    +V  C  CQ  K  
Subjt:  EDVDFSQMWTKCNNFIK--AEDFHIMEGYLFKGDQQ----------------------------DKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGP

Query:  STNA-GLYSPLPTPISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWN
        +    G   P+P     W  L +DF+  LP++   Y+++ VVVDRFSKM   V C K+  A   A +F + ++   G P  I++D D  F S  WK   +
Subjt:  STNA-GLYSPLPTPISIWGDLPIDFVLGLPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWN

Query:  KFDTTLKYSTTAHPQIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENI
        K++  +K+S    PQ DGQTE TN+T+  L+RC+  + P  W   ++  + ++NN  + +T   PFE+V    P L+     LP+  D + E  +  E I
Subjt:  KFDTTLKYSTTAHPQIDGQTEVTNRTLGNLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENI

Query:  QKLHEEVHSHLKESTQFYKEAADRKRRQ-ATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLH--IHPIFNVVDLKQY
        Q + + V  HL  +    K+  D K ++   F  GDLVM+   +  F     NKL     GPF VL+K   N Y+++LP  +       F+V  L++Y
Subjt:  QKLHEEVHSHLKESTQFYKEAADRKRRQ-ATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLH--IHPIFNVVDLKQY

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.7e-11731.4Show/hide
Query:  IQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPALLTPKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLDQLGKAAIFS
        ++H I++  GA LP L  Y ++ +  + ++  ++ LL    I PS S C+ P +L PKKDG++R+CVD R +N+ T+   FP+PRI +LL ++G A IF+
Subjt:  IQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPALLTPKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLDQLGKAAIFS

Query:  KIDLKSGYHQIRIRPGDEWKTTFKTNEGCSNV-----------------------------VYFDDILVYNKNSEDHIQHLKKLFQVLTETELYINPKKC
         +DL SGYHQI + P D +KT F T  G                                 VY DDIL+++++ E+H +HL  + + L    L +  KKC
Subjt:  KIDLKSGYHQIRIRPGDEWKTTFKTNEGCSNV-----------------------------VYFDDILVYNKNSEDHIQHLKKLFQVLTETELYINPKKC

Query:  TFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYRRFIKNFSSIVAP--LTDCLKKGNYKWDGNQQQSFEEIKRRLTSSPILQ
         F   E  FLG+ I   K+     K  AI+ +P   ++K+ Q FLG+ ++YRRFI N S I  P  L  C K    +W   Q ++ E++K  L +SP+L 
Subjt:  TFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYRRFIKNFSSIVAP--LTDCLKKGNYKWDGNQQQSFEEIKRRLTSSPILQ

Query:  LPGFTSPFEVAVDACGTGIGAVFEK-----------------LSTSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRMHAR
             + + +  DA   GIGAV E+                 L ++++++   E EL  +++AL  + + L  K F L TDH SL  LQ++   +R   R
Subjt:  LPGFTSPFEVAVDACGTGIGAVFEK-----------------LSTSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRMHAR

Query:  WISFLQRFDFVIMHQSGKDNKLADALSRKSSLL---TILSMEVEAFK--------------HLPNL---------------YKEDVDFSQMWTKCNNFIK
        W+  L  +DF + + +G  N +ADA+SR    +   T   ++ E++K              H+  L               Y++ ++ S+ + K N  ++
Subjt:  WISFLQRFDFVIMHQSGKDNKLADALSRKSSLL---TILSMEVEAFK--------------HLPNL---------------YKEDVDFSQMWTKCNNFIK

Query:  AEDFHIMEGYLFKGDQQDKTFEI-------------------ISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNA-GLYSPLPTPISIWGDLPIDFVLG
         E  +  +  +    QQ+    +                   IS  +YWP+L+     ++  C  CQ  K       GL  PLP     W D+ +DFV G
Subjt:  AEDFHIMEGYLFKGDQQDKTFEI-------------------ISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNA-GLYSPLPTPISIWGDLPIDFVLG

Query:  LPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAHPQIDGQTEVTNRTLG
        LP T    + ++VVVDRFSK  HF+A +KT DAT + +L FR I   HG P TI SDRD +  +  ++ L  +       S+  HPQ DGQ+E T +TL 
Subjt:  LPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAHPQIDGQTEVTNRTLG

Query:  NLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKESTQFYKEAAD-----
         L+R    +  + W + L Q EF +N+   R+  K PFE+          DL  LP T    S+ E    +   +  E+  HLK  T   KE  +     
Subjt:  NLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKESTQFYKEAAD-----

Query:  ------RKRRQATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYY
              ++R+      GD V++H R   F  G Y K++   +GPF V+KKI DNAY+++L      H + NV  LK  Y
Subjt:  ------RKRRQATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYY

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.3e-11730.71Show/hide
Query:  IQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPALLTPKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLDQLGKAAIFS
        ++H I++  GA LP L  Y ++ +  + ++  ++ LL    I PS S C+ P +L PKKDG++R+CVD R +N+ T+   FP+PRI +LL ++G A IF+
Subjt:  IQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPALLTPKKDGSWRMCVDSRAINRITVRYRFPIPRIGDLLDQLGKAAIFS

Query:  KIDLKSGYHQIRIRPGDEWKTTFKTNEGCSNV-----------------------------VYFDDILVYNKNSEDHIQHLKKLFQVLTETELYINPKKC
         +DL SGYHQI + P D +KT F T  G                                 VY DDIL+++++ E+H +HL  + + L    L +  KKC
Subjt:  KIDLKSGYHQIRIRPGDEWKTTFKTNEGCSNV-----------------------------VYFDDILVYNKNSEDHIQHLKKLFQVLTETELYINPKKC

Query:  TFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYRRFIKNFSSIVAP--LTDCLKKGNYKWDGNQQQSFEEIKRRLTSSPILQ
         F   E  FLG+ I   K+     K  AI+ +P   ++K+ Q FLG+ ++YRRFI N S I  P  L  C K    +W   Q ++ +++K  L +SP+L 
Subjt:  TFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYRRFIKNFSSIVAP--LTDCLKKGNYKWDGNQQQSFEEIKRRLTSSPILQ

Query:  LPGFTSPFEVAVDACGTGIGAVFEK-----------------LSTSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRMHAR
             + + +  DA   GIGAV E+                 L ++++++   E EL  +++AL  + + L  K F L TDH SL  LQ++   +R   R
Subjt:  LPGFTSPFEVAVDACGTGIGAVFEK-----------------LSTSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRMHAR

Query:  WISFLQRFDFVIMHQSGKDNKLADALSRKSSLL---TILSMEVEAFK--------------HLPNL---------------YKEDVDFSQMWTKCNNFIK
        W+  L  +DF + + +G  N +ADA+SR    +   T   ++ E++K              H+  L               Y++ ++ S+ + K N  ++
Subjt:  WISFLQRFDFVIMHQSGKDNKLADALSRKSSLL---TILSMEVEAFK--------------HLPNL---------------YKEDVDFSQMWTKCNNFIK

Query:  AEDFHIMEGYLFKGDQQDKTFEI-------------------ISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNA-GLYSPLPTPISIWGDLPIDFVLG
         E  +  +  +    QQ+    +                   IS  +YWP+L+     ++  C  CQ  K       GL  PLP     W D+ +DFV G
Subjt:  AEDFHIMEGYLFKGDQQDKTFEI-------------------ISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNA-GLYSPLPTPISIWGDLPIDFVLG

Query:  LPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAHPQIDGQTEVTNRTLG
        LP T    + ++VVVDRFSK  HF+A +KT DAT + +L FR I   HG P TI SDRD +  +  ++ L  +       S+  HPQ DGQ+E T +TL 
Subjt:  LPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAHPQIDGQTEVTNRTLG

Query:  NLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKESTQFYKEAAD-----
         L+R  + +  + W + L Q EF +N+   R+  K PFE+          DL  LP T    S+ E    +   +  E+  HLK  T   KE  +     
Subjt:  NLIRCLSGSKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKESTQFYKEAAD-----

Query:  ------RKRRQATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDDFYARGRAKPTNSNNERS
              ++R+      GD V++H R   F  G Y K++   +GPF V+KKI DNAY+++L      H + NV  LK++    D Y + +   +    +R+
Subjt:  ------RKRRQATFTEGDLVMIHLRRNRFPTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDDFYARGRAKPTNSNNERS

Query:  ND
        ++
Subjt:  ND

Arabidopsis top hitse value%identityAlignment
AT1G21580.1 Zinc finger C-x8-C-x5-C-x3-H type family protein7.6e-9257.09Show/hide
Query:  KKFYIPARLVIGNDEYVKIGKGNQLVRNPKRRARILASEKIRWSLHTARQRLAKKRMYCQFFTRFGKCNKDGGKCPYIHDTSKIAVCTKFLNGLCSNASC
        K+ +IP RLVIGN+EYV+ G GNQLVR+PK+R R+LA+EK+RWSLH AR RLAKK+ YCQFFTRFGKCNKD GKCPY+HD SKIAVCTKFLNGLC+NA+C
Subjt:  KKFYIPARLVIGNDEYVKIGKGNQLVRNPKRRARILASEKIRWSLHTARQRLAKKRMYCQFFTRFGKCNKDGGKCPYIHDTSKIAVCTKFLNGLCSNASC

Query:  KLTHKVIPERMPDCSYFLQGLCSSKNCAYRHVNVNSKVPTCEAFLRGYCALGNECRKKHSYVCPLLEATGTCPDRSTCKLHHPKRQTKGRKRKRLE--GR
        KLTHKVIPERMPDCSY+LQGLC+++ C YRHV+VN   P C+ FL+GYC+ G+ECRKKHSY CP+ EATG+C     CKLHHPK Q+KGRKRKR     +
Subjt:  KLTHKVIPERMPDCSYFLQGLCSSKNCAYRHVNVNSKVPTCEAFLRGYCALGNECRKKHSYVCPLLEATGTCPDRSTCKLHHPKRQTKGRKRKRLE--GR

Query:  NNDQGRYFGSTNQDVSRSRLVVSEKQLPVKSSDPFLEDLTDYISLDVGSDEDIEESRDSTSQTTSFSQGYLSELLLEDPDELIKPIRVM
         N + RYF S +  +S S  +V  ++     S+ F  +  D+I+L     E  +++  +T Q+ S     L  +       LI P+ +M
Subjt:  NNDQGRYFGSTNQDVSRSRLVVSEKQLPVKSSDPFLEDLTDYISLDVGSDEDIEESRDSTSQTTSFSQGYLSELLLEDPDELIKPIRVM

ATMG00860.1 DNA/RNA polymerases superfamily protein8.3e-2238.52Show/hide
Query:  IQHLKKLFQVLTETELYINPKKCTFLIREIVFLG--FLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYRRFIKNFSSIVAPLTDCLKKGNYK
        + HL  + Q+  + + Y N KKC F   +I +LG   +I    V  +P K EA+  WP   +  EL+ FLGL  +YRRF+KN+  IV PLT+ LKK + K
Subjt:  IQHLKKLFQVLTETELYINPKKCTFLIREIVFLG--FLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYRRFIKNFSSIVAPLTDCLKKGNYK

Query:  WDGNQQQSFEEIKRRLTSSPILQLPGFTSPFEVAV
        W      +F+ +K  +T+ P+L LP    PF   V
Subjt:  WDGNQQQSFEEIKRRLTSSPILQLPGFTSPFEVAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAGAGAAAGGGAGCAAGATCTATTGGGGCTTGTAGTGGCTGGCAAACCCCAGGAGAAAGAATGTGAGATTACCGACCCTAGATTGGAGAGCTTGTTCGCGGAGTT
CCCCCACCTGAAGAATGAGCCACAAGGTCTACCACCGATTCGTGACATCCAGCATCAGATTGACCTCATACTAGGAGCATCACTTCCTAACTTAGCACACTACAGGATGA
GCCCAGAAGAGTATAAAATCCTGCATGATCACATAGAAGACTTGCTGAGAAAGGGTCATATCAAGCCAAGCCTTAGCTCATGCGCCGTGCCTGCACTACTCACACCTAAG
AAGGATGGAAGCTGGAGAATGTGTGTAGACAGCAGAGCTATCAACCGAATTACTGTGAGGTATCGGTTCCCCATTCCTCGAATTGGAGACCTGCTGGATCAGCTAGGCAA
GGCTGCTATCTTTTCAAAGATTGATCTAAAAAGCGGCTATCACCAAATAAGGATCAGACCAGGGGACGAATGGAAGACAACCTTCAAGACAAATGAAGGCTGTTCGAATG
TGGTTTACTTTGACGATATCCTCGTATACAACAAAAACAGTGAGGACCATATTCAACATCTGAAGAAACTGTTTCAAGTCTTAACAGAAACAGAGTTGTATATCAATCCA
AAGAAATGCACATTCCTCATTAGGGAAATTGTCTTTCTCGGCTTCTTAATCAAAGAAGGGAAGGTAGGCATGGAACCAAAAAAGACAGAAGCCATACAGTCTTGGCCAGT
ACAAACTTCAATCAAAGAACTACAAGCTTTCCTTGGCTTGGCATCCTTCTACCGAAGATTCATCAAAAACTTTAGCTCAATAGTGGCCCCTTTAACTGACTGCCTAAAGA
AAGGAAACTATAAATGGGACGGAAATCAACAACAGAGCTTCGAAGAGATTAAAAGAAGACTAACTTCCAGCCCTATTCTACAATTGCCAGGTTTCACATCACCGTTTGAA
GTGGCTGTTGATGCTTGCGGAACTGGAATTGGAGCTGTATTTGAGAAACTAAGCACATCAAGACAGTCCTGGAGCACATATGAACAAGAGCTTTATGCCCTTGTTCGAGC
ACTCAAACAGTGGGAACATTACCTCCTAAGCAAAGAATTTGTACTTCTAACAGACCACTTTTCCTTAAAGTATCTACAGTCACAAAAGAGCATCAGTCGAATGCATGCTC
GATGGATCTCCTTCCTACAAAGATTTGATTTTGTCATCATGCACCAAAGTGGTAAAGATAACAAGTTGGCAGACGCCCTATCCAGAAAAAGCTCCTTGCTTACTATCTTA
TCAATGGAGGTCGAGGCATTTAAACATCTACCTAACCTATATAAGGAAGATGTTGATTTCTCCCAAATGTGGACTAAATGCAACAACTTTATCAAGGCCGAAGATTTTCA
CATAATGGAAGGTTATCTATTCAAAGGAGATCAGCAAGACAAAACCTTCGAAATAATCTCTAAGAGATTCTATTGGCCTCAACTAAGAAGAGACTGTAACAGTTTTGTCA
ACAGATGCCCTATATGTCAAAGAGCTAAGGGCCCAAGTACAAACGCAGGCCTATATTCACCACTACCTACCCCTATTTCCATTTGGGGAGATCTCCCAATTGATTTCGTG
CTCGGACTGCCCAAGACCCAAAGACAGTATGACTCAGTCATGGTTGTAGTTGACAGATTCAGCAAGATGACACACTTTGTAGCCTGCAAAAAGACAAATGACGCCACCTA
CATAGCTAATCTCTTCTTCCGAGAGATAGTACGGCTGCATGGAATACCAAATACCATTGTTTCTGATCGGGATGACAAATTCCTAAGCCATTTCTGGAAGACATTATGGA
ACAAATTTGACACAACCTTGAAATATAGCACAACAGCACACCCTCAAATAGATGGCCAAACAGAGGTTACAAACAGGACCCTTGGCAACTTAATACGCTGTCTCAGTGGA
TCCAAACCAAAGCAGTGGGACTTAGCTCTTGCTCAGGCAGAATTCGCCTTCAACAATATGAAGAATAGATCAACCAGCAAATGTCCATTTGAGGTTGTATTCACGAAACA
ACCAAGGCTAACCTTTGACCTAGCATCACTCCCCGCAACTATGGACACTAGCTCAGAAGCAGAAAAGATGGTAGAAAATATTCAGAAATTACACGAAGAGGTCCACAGTC
ACCTAAAAGAATCAACTCAGTTCTACAAAGAGGCAGCAGACAGAAAGAGAAGACAAGCTACCTTTACTGAGGGAGATTTAGTAATGATTCACCTTAGAAGAAACCGATTC
CCAACTGGAACATACAACAAACTGAAAGACAGACAATTAGGACCATTTTGTGTCCTAAAGAAGATCAGAGATAACGCTTACAAAATAGAACTACCACCAGACTTACACAT
TCATCCTATTTTTAATGTAGTAGACCTAAAACAGTACTATGCCCCAGATGATTTCTATGCTAGGGGGAGGGCCAAGCCCACTAATTCTAACAATGAGCGATCCAATGACC
AAGCATCCGGTTCTACTACTTTCCAATCAAGAACGAGTGCTAAGAAATTTTATATTCCAGCGAGACTAGTGATTGGCAATGATGAGTATGTTAAGATTGGAAAAGGCAAT
CAATTGGTCCGAAATCCAAAAAGAAGAGCACGCATACTGGCAAGTGAGAAAATTCGATGGAGTTTGCACACTGCAAGACAGCGGCTGGCTAAGAAGCGGATGTACTGTCA
ATTTTTCACAAGATTTGGTAAATGTAACAAAGATGGTGGCAAGTGCCCTTATATTCACGACACTTCCAAGATTGCAGTCTGCACAAAATTTCTCAACGGTTTATGTTCTA
ACGCAAGCTGCAAATTGACTCATAAGGTGATTCCAGAAAGGATGCCTGATTGTTCATACTTTTTACAGGGTTTATGCAGCAGCAAAAATTGTGCTTATAGACATGTAAAT
GTGAACTCAAAGGTTCCTACTTGCGAGGCTTTCCTTAGGGGCTATTGTGCTCTGGGAAACGAGTGCCGTAAGAAGCACAGTTATGTGTGCCCCTTGTTAGAAGCAACAGG
AACATGCCCGGATAGATCAACATGCAAACTTCACCATCCTAAACGACAAACTAAAGGAAGGAAAAGGAAGCGATTGGAAGGGAGGAATAATGATCAAGGACGCTACTTCG
GTTCTACGAATCAGGATGTTTCTAGATCTAGATTGGTGGTGAGTGAGAAGCAGCTTCCTGTTAAATCAAGTGACCCTTTTCTTGAAGATCTGACAGATTATATCAGCCTT
GATGTCGGCAGTGATGAAGATATTGAAGAAAGTCGTGACTCGACAAGCCAGACTACGTCCTTTAGTCAAGGTTACCTCTCTGAGTTATTGTTAGAAGATCCCGACGAGCT
AATCAAACCAATTCGGGTAATGAATGAGAATTTGACTGTGCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGAGAGAAAGGGAGCAAGATCTATTGGGGCTTGTAGTGGCTGGCAAACCCCAGGAGAAAGAATGTGAGATTACCGACCCTAGATTGGAGAGCTTGTTCGCGGAGTT
CCCCCACCTGAAGAATGAGCCACAAGGTCTACCACCGATTCGTGACATCCAGCATCAGATTGACCTCATACTAGGAGCATCACTTCCTAACTTAGCACACTACAGGATGA
GCCCAGAAGAGTATAAAATCCTGCATGATCACATAGAAGACTTGCTGAGAAAGGGTCATATCAAGCCAAGCCTTAGCTCATGCGCCGTGCCTGCACTACTCACACCTAAG
AAGGATGGAAGCTGGAGAATGTGTGTAGACAGCAGAGCTATCAACCGAATTACTGTGAGGTATCGGTTCCCCATTCCTCGAATTGGAGACCTGCTGGATCAGCTAGGCAA
GGCTGCTATCTTTTCAAAGATTGATCTAAAAAGCGGCTATCACCAAATAAGGATCAGACCAGGGGACGAATGGAAGACAACCTTCAAGACAAATGAAGGCTGTTCGAATG
TGGTTTACTTTGACGATATCCTCGTATACAACAAAAACAGTGAGGACCATATTCAACATCTGAAGAAACTGTTTCAAGTCTTAACAGAAACAGAGTTGTATATCAATCCA
AAGAAATGCACATTCCTCATTAGGGAAATTGTCTTTCTCGGCTTCTTAATCAAAGAAGGGAAGGTAGGCATGGAACCAAAAAAGACAGAAGCCATACAGTCTTGGCCAGT
ACAAACTTCAATCAAAGAACTACAAGCTTTCCTTGGCTTGGCATCCTTCTACCGAAGATTCATCAAAAACTTTAGCTCAATAGTGGCCCCTTTAACTGACTGCCTAAAGA
AAGGAAACTATAAATGGGACGGAAATCAACAACAGAGCTTCGAAGAGATTAAAAGAAGACTAACTTCCAGCCCTATTCTACAATTGCCAGGTTTCACATCACCGTTTGAA
GTGGCTGTTGATGCTTGCGGAACTGGAATTGGAGCTGTATTTGAGAAACTAAGCACATCAAGACAGTCCTGGAGCACATATGAACAAGAGCTTTATGCCCTTGTTCGAGC
ACTCAAACAGTGGGAACATTACCTCCTAAGCAAAGAATTTGTACTTCTAACAGACCACTTTTCCTTAAAGTATCTACAGTCACAAAAGAGCATCAGTCGAATGCATGCTC
GATGGATCTCCTTCCTACAAAGATTTGATTTTGTCATCATGCACCAAAGTGGTAAAGATAACAAGTTGGCAGACGCCCTATCCAGAAAAAGCTCCTTGCTTACTATCTTA
TCAATGGAGGTCGAGGCATTTAAACATCTACCTAACCTATATAAGGAAGATGTTGATTTCTCCCAAATGTGGACTAAATGCAACAACTTTATCAAGGCCGAAGATTTTCA
CATAATGGAAGGTTATCTATTCAAAGGAGATCAGCAAGACAAAACCTTCGAAATAATCTCTAAGAGATTCTATTGGCCTCAACTAAGAAGAGACTGTAACAGTTTTGTCA
ACAGATGCCCTATATGTCAAAGAGCTAAGGGCCCAAGTACAAACGCAGGCCTATATTCACCACTACCTACCCCTATTTCCATTTGGGGAGATCTCCCAATTGATTTCGTG
CTCGGACTGCCCAAGACCCAAAGACAGTATGACTCAGTCATGGTTGTAGTTGACAGATTCAGCAAGATGACACACTTTGTAGCCTGCAAAAAGACAAATGACGCCACCTA
CATAGCTAATCTCTTCTTCCGAGAGATAGTACGGCTGCATGGAATACCAAATACCATTGTTTCTGATCGGGATGACAAATTCCTAAGCCATTTCTGGAAGACATTATGGA
ACAAATTTGACACAACCTTGAAATATAGCACAACAGCACACCCTCAAATAGATGGCCAAACAGAGGTTACAAACAGGACCCTTGGCAACTTAATACGCTGTCTCAGTGGA
TCCAAACCAAAGCAGTGGGACTTAGCTCTTGCTCAGGCAGAATTCGCCTTCAACAATATGAAGAATAGATCAACCAGCAAATGTCCATTTGAGGTTGTATTCACGAAACA
ACCAAGGCTAACCTTTGACCTAGCATCACTCCCCGCAACTATGGACACTAGCTCAGAAGCAGAAAAGATGGTAGAAAATATTCAGAAATTACACGAAGAGGTCCACAGTC
ACCTAAAAGAATCAACTCAGTTCTACAAAGAGGCAGCAGACAGAAAGAGAAGACAAGCTACCTTTACTGAGGGAGATTTAGTAATGATTCACCTTAGAAGAAACCGATTC
CCAACTGGAACATACAACAAACTGAAAGACAGACAATTAGGACCATTTTGTGTCCTAAAGAAGATCAGAGATAACGCTTACAAAATAGAACTACCACCAGACTTACACAT
TCATCCTATTTTTAATGTAGTAGACCTAAAACAGTACTATGCCCCAGATGATTTCTATGCTAGGGGGAGGGCCAAGCCCACTAATTCTAACAATGAGCGATCCAATGACC
AAGCATCCGGTTCTACTACTTTCCAATCAAGAACGAGTGCTAAGAAATTTTATATTCCAGCGAGACTAGTGATTGGCAATGATGAGTATGTTAAGATTGGAAAAGGCAAT
CAATTGGTCCGAAATCCAAAAAGAAGAGCACGCATACTGGCAAGTGAGAAAATTCGATGGAGTTTGCACACTGCAAGACAGCGGCTGGCTAAGAAGCGGATGTACTGTCA
ATTTTTCACAAGATTTGGTAAATGTAACAAAGATGGTGGCAAGTGCCCTTATATTCACGACACTTCCAAGATTGCAGTCTGCACAAAATTTCTCAACGGTTTATGTTCTA
ACGCAAGCTGCAAATTGACTCATAAGGTGATTCCAGAAAGGATGCCTGATTGTTCATACTTTTTACAGGGTTTATGCAGCAGCAAAAATTGTGCTTATAGACATGTAAAT
GTGAACTCAAAGGTTCCTACTTGCGAGGCTTTCCTTAGGGGCTATTGTGCTCTGGGAAACGAGTGCCGTAAGAAGCACAGTTATGTGTGCCCCTTGTTAGAAGCAACAGG
AACATGCCCGGATAGATCAACATGCAAACTTCACCATCCTAAACGACAAACTAAAGGAAGGAAAAGGAAGCGATTGGAAGGGAGGAATAATGATCAAGGACGCTACTTCG
GTTCTACGAATCAGGATGTTTCTAGATCTAGATTGGTGGTGAGTGAGAAGCAGCTTCCTGTTAAATCAAGTGACCCTTTTCTTGAAGATCTGACAGATTATATCAGCCTT
GATGTCGGCAGTGATGAAGATATTGAAGAAAGTCGTGACTCGACAAGCCAGACTACGTCCTTTAGTCAAGGTTACCTCTCTGAGTTATTGTTAGAAGATCCCGACGAGCT
AATCAAACCAATTCGGGTAATGAATGAGAATTTGACTGTGCAGTAGTTGGCAAACTGAGTTGCTTGCTCCTTCCTCGTTAGGAGCAAGGTTTGTGTCCGTTCTCTCACCA
TTAAGGTTTAACAGAATCACAGTTTAGTTTAGTTTTAGTTTTAGTTTTAGTTTAGTTTTTTTTTCTTTCTTTGTTTAGTTTAGTTTCAGCTTCTTCTAGTTGCAGAAATC
TGGATAACTCCGTTTCTGCCATTACTGTAAATAGACATTTAAATTTGACAGGAATGAAATGATTTTGTA
Protein sequenceShow/hide protein sequence
MLREREQDLLGLVVAGKPQEKECEITDPRLESLFAEFPHLKNEPQGLPPIRDIQHQIDLILGASLPNLAHYRMSPEEYKILHDHIEDLLRKGHIKPSLSSCAVPALLTPK
KDGSWRMCVDSRAINRITVRYRFPIPRIGDLLDQLGKAAIFSKIDLKSGYHQIRIRPGDEWKTTFKTNEGCSNVVYFDDILVYNKNSEDHIQHLKKLFQVLTETELYINP
KKCTFLIREIVFLGFLIKEGKVGMEPKKTEAIQSWPVQTSIKELQAFLGLASFYRRFIKNFSSIVAPLTDCLKKGNYKWDGNQQQSFEEIKRRLTSSPILQLPGFTSPFE
VAVDACGTGIGAVFEKLSTSRQSWSTYEQELYALVRALKQWEHYLLSKEFVLLTDHFSLKYLQSQKSISRMHARWISFLQRFDFVIMHQSGKDNKLADALSRKSSLLTIL
SMEVEAFKHLPNLYKEDVDFSQMWTKCNNFIKAEDFHIMEGYLFKGDQQDKTFEIISKRFYWPQLRRDCNSFVNRCPICQRAKGPSTNAGLYSPLPTPISIWGDLPIDFV
LGLPKTQRQYDSVMVVVDRFSKMTHFVACKKTNDATYIANLFFREIVRLHGIPNTIVSDRDDKFLSHFWKTLWNKFDTTLKYSTTAHPQIDGQTEVTNRTLGNLIRCLSG
SKPKQWDLALAQAEFAFNNMKNRSTSKCPFEVVFTKQPRLTFDLASLPATMDTSSEAEKMVENIQKLHEEVHSHLKESTQFYKEAADRKRRQATFTEGDLVMIHLRRNRF
PTGTYNKLKDRQLGPFCVLKKIRDNAYKIELPPDLHIHPIFNVVDLKQYYAPDDFYARGRAKPTNSNNERSNDQASGSTTFQSRTSAKKFYIPARLVIGNDEYVKIGKGN
QLVRNPKRRARILASEKIRWSLHTARQRLAKKRMYCQFFTRFGKCNKDGGKCPYIHDTSKIAVCTKFLNGLCSNASCKLTHKVIPERMPDCSYFLQGLCSSKNCAYRHVN
VNSKVPTCEAFLRGYCALGNECRKKHSYVCPLLEATGTCPDRSTCKLHHPKRQTKGRKRKRLEGRNNDQGRYFGSTNQDVSRSRLVVSEKQLPVKSSDPFLEDLTDYISL
DVGSDEDIEESRDSTSQTTSFSQGYLSELLLEDPDELIKPIRVMNENLTVQ