; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G17380 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G17380
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationChr1:13073596..13077003
RNA-Seq ExpressionCSPI01G17380
SyntenyCSPI01G17380
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN76546.1 hypothetical protein VITISV_010420 [Vitis vinifera]9.5e-16736.31Show/hide
Query:  MTIGLSVKNKIGFVDGTIAKP-TVDLL-PTWIRNNNILKTIPCNLGAK--------------------PRY----HCYDGVREMTDFLQMEYLMDFLMGL
        M + L+ KNK+GFVDGTI++P + DL+   W R N+++ +   N  ++                     R+    HC  G+R  TD+   EY++ FLMGL
Subjt:  MTIGLSVKNKIGFVDGTIAKP-TVDLL-PTWIRNNNILKTIPCNLGAK--------------------PRY----HCYDGVREMTDFLQMEYLMDFLMGL

Query:  NENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSIGSLSSTAPTMVFVVSSNSSKNGTNNRQRRGKPTCTHCNTPGHIATQNSENISQDTTQNSQMVNN
        N++++Q RGQ+L+MDPLP+ ++ FS+++QEE  R++G                 S +G++N                 +   ++ N    + Q +  +  
Subjt:  NENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSIGSLSSTAPTMVFVVSSNSSKNGTNNRQRRGKPTCTHCNTPGHIATQNSENISQDTTQNSQMVNN

Query:  ISTAEAFI--QCQNLLNQLQSQINASNQIAT---------SHIAGTSYSFPL--WIIDSGASTHISCCKSYF-TSTQPCSTTISLPNKQVFEVKSAGTIK
        I      +  +CQ L+  L +Q+++++  +T         S+ AG         WIIDSGA+ H+    S F +S    +  ++LP      +   G++ 
Subjt:  ISTAEAFI--QCQNLLNQLQSQINASNQIAT---------SHIAGTSYSFPL--WIIDSGASTHISCCKSYF-TSTQPCSTTISLPNKQVFEVKSAGTIK

Query:  LSESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLRNASNPQSPIC---VVNSDNAFLWHQRLGHPCVD
        LS+ ++L NVL++  F +NL+ VSA T  L + + F+ + C+IQ+    K IG       LY     +    ++ +    +  S+   LWH RLGHP   
Subjt:  LSESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLRNASNPQSPIC---VVNSDNAFLWHQRLGHPCVD

Query:  VLKSLQSMLQL-KSFTSHTCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFTWI------------IPNFFKLIE
         LK LQS+L    SF    C  C LAKQR L + + N    ++FDL+H+DIWGP S  +  G+ +FLTIVDD +R TW+            IP+FF  ++
Subjt:  VLKSLQSMLQL-KSFTSHTCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFTWI------------IPNFFKLIE

Query:  TQHGKTIKHMRSDNAPELMFTEFFKEKGVLR---------------------------------IPLTFWGECILTAVYLINKTPSKMLNWKSPFQLLNN
         Q GK +K +RSDNAPEL  + F+   GV+                                  +P+ +W +CILTAVYLIN+TPS  LN K+PF++L++
Subjt:  TQHGKTIKHMRSDNAPELMFTEFFKEKGVLR---------------------------------IPLTFWGECILTAVYLINKTPSKMLNWKSPFQLLNN

Query:  NKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYNIDQQKFFISRDVIFREEVFPFKDSTVSQPMEYPLPGFSLPKVFHESSIDH
          PDY++L+VFG LCY STL  NR+KF PRA   VF+GYP G K YKL +I+ +   ISR+                                       
Subjt:  NKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYNIDQQKFFISRDVIFREEVFPFKDSTVSQPMEYPLPGFSLPKVFHESSIDH

Query:  MNPSTNMPSPSFSPNTNNDSTSQTIVQSTNNPTNDSTSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQPH-QPSSTKYPLNQYISYQN
         NP +   SP  S +  +D     I    +  ++       QP   ++ +S      R TR+ KQPSYL+ YHC+L+ S  H +  ST +P+  ++SY  
Subjt:  MNPSTNMPSPSFSPNTNNDSTSQTIVQSTNNPTNDSTSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQPH-QPSSTKYPLNQYISYQN

Query:  LSHSFKHSVLQASTLHEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGLDYFETFS
        LS S+K   L  S + EP+ + +A    +WR AM+ EL+ +E N+T SIVSLP  K+ VGC+W+YK KHK DG++ERYKAR VAKGYTQ+EG+DY +TFS
Subjt:  LSHSFKHSVLQASTLHEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGLDYFETFS

Query:  PVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLHGDLFEEVYMDLSLGYK------PNLTI---------------------------QGEKMSKADYSLF
        PVAK+VTVK LL IA  K WHL QLDVNNAFLHGDL EEVYM L  GY       P+  +                            G   S +D+SLF
Subjt:  PVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLHGDLFEEVYMDLSLGYK------PNLTI---------------------------QGEKMSKADYSLF

Query:  VKGQGTNFIALLVYVDDIIITGAN--------------FLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKPA
        +K     FIA+LVYVDD+II   N              F LKDLG +++FLG E+A +STGI +SQR Y L LL + G LG K A
Subjt:  VKGQGTNFIALLVYVDDIIITGAN--------------FLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKPA

KAA0068025.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]1.7e-20344.3Show/hide
Query:  GVREMTDFLQMEYLMDFLMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSIGSLSSTAPTMVFVVSSNSSKNGTNNRQRRGKPTCTHCNTPGHI
        G  E  +FLQ EYLMDFLMGLN++++Q R QLLLM+P+PS SRAFS++LQEEQQR+I S S                                 NTP   
Subjt:  GVREMTDFLQMEYLMDFLMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSIGSLSSTAPTMVFVVSSNSSKNGTNNRQRRGKPTCTHCNTPGHI

Query:  ATQNSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQINASNQIATSHIAGTSYSFPLWIIDSGASTHISCCKSYFTSTQPCSTTISLPNKQVFEV
                                                                                                            
Subjt:  ATQNSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQINASNQIATSHIAGTSYSFPLWIIDSGASTHISCCKSYFTSTQPCSTTISLPNKQVFEV

Query:  KSAGTIKLSESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLRNASNPQSPICVVNSDNAFLWHQRLGH
                               +F+L+                       DK+T KKIGS +L  GLY+    +       IC V ++ A LWHQR+GH
Subjt:  KSAGTIKLSESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLRNASNPQSPICVVNSDNAFLWHQRLGH

Query:  PCVDVLKSLQSMLQLKSFT----SHTCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFTW------------IIP
        P  + L SLQ  L LK  T    SH CT C LAKQR+LSF++NNH+S N+FDLIHVDIWGP ST T+  ++YFLTIVDDATR+TW            IIP
Subjt:  PCVDVLKSLQSMLQLKSFT----SHTCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFTW------------IIP

Query:  NFFKLIETQHGKTIKHMRSDNAPELMFTEFFKEKGVL---------------------------------RIPLTFWGECILTAVYLINKTPSKMLNWKS
        +FFKLIETQ+GK IK +RSDNA +L FT FF++KGV+                                 ++PL FWG+CI+TAVYLI++TPS++L WK 
Subjt:  NFFKLIETQHGKTIKHMRSDNAPELMFTEFFKEKGVL---------------------------------RIPLTFWGECILTAVYLINKTPSKMLNWKS

Query:  PFQLLNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYNIDQQKFFISRDVIFREEVFPFKDSTVSQPMEYPLPGFSLPKVF
        PFQ LNN  PDYN+LKVFGSLCYAS+LPHN SKFQPRAIP VF+GYP+GMK YKLY+I+ +K FISRD        P  D  +++               
Subjt:  PFQLLNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYNIDQQKFFISRDVIFREEVFPFKDSTVSQPMEYPLPGFSLPKVF

Query:  HESSIDHMNPSTNMPSPSFSPNTNNDSTSQTIVQSTNNPTNDSTSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQ-PHQPSSTKYPLN
                                            N+  N++T S Q  + N +E +   T+RRSTRI K PSYLQAYHC+LLT+Q P    STKYP+N
Subjt:  HESSIDHMNPSTNMPSPSFSPNTNNDSTSQTIVQSTNNPTNDSTSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQ-PHQPSSTKYPLN

Query:  QYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGL
        QY+SYQ L  ++K+S+LQ ST  E +FYHEAV+  +W +AM+ EL+ METNQTWSIV LP  KN +GCRW+YKIKHKADGS+ERYK R VAKGYTQQEGL
Subjt:  QYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGL

Query:  DYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLHGDLFEEVYMDLSLGYKPNLTIQGEKM---------------------------------S
        DYFETFS V KMVTVKTLLTIAVSK+W L QLDVNNAFLHG+LFEEVYMDL LGYKP   IQGEK+                                 S
Subjt:  DYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLHGDLFEEVYMDLSLGYKPNLTIQGEKM---------------------------------S

Query:  KADYSLFVKGQGTNFIALLVYVDDIIITGAN--------------FLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKP
        KA+YSLF++G+  +FIALLVYVDDIIITGAN              FLLKDLG L+FFLG ELA  S+G+ LSQ++Y L L+E+ GLLGAKP
Subjt:  KADYSLFVKGQGTNFIALLVYVDDIIITGAN--------------FLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKP

KZV25004.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum]1.1e-17037.13Show/hide
Query:  KPRYHCYDG-VREMTDFLQMEYLMDFLMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSI---------------GSLSSTAPTMVFVVSSNSS
        +P   C  G +RE  ++   E +M FLMGLN++++Q R Q+L+++PLP+ ++ F++++QEE+QRSI                +++S+A T   + +S +S
Subjt:  KPRYHCYDG-VREMTDFLQMEYLMDFLMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSI---------------GSLSSTAPTMVFVVSSNSS

Query:  KNGTNNRQRRGKPTCTHCN---------------TPGH----------IATQNSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQINASNQI---
        K G  +R       C+HC+                PGH           A  +  + S +T Q +Q +++ S +    QC+ L+  L S++     +   
Subjt:  KNGTNNRQRRGKPTCTHCN---------------TPGH----------IATQNSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQINASNQI---

Query:  ---------------ATSHIAGTSYSFPLWIIDSGASTHISCCKSYFTSTQPCSTTISLPNKQVFEVKSAGTIKLSESIELKNVLYISEFSFNLIPVSAL
                       ATSHI   +     WI+D+GA+ HI C  S F S++   + + LPN     V  AGT+ ++ ++ L+NVLY+  F FNL+ VS+L
Subjt:  ---------------ATSHIAGTSYSFPLWIIDSGASTHISCCKSYFTSTQPCSTTISLPNKQVFEVKSAGTIKLSESIELKNVLYISEFSFNLIPVSAL

Query:  TKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLRNASNPQSPICVVNSDNAFLWHQRLGHPCVDVLKSLQSMLQLKSF-TSHTCTTCTLAKQR
        T +    VSF +++C IQD   ++ IG  + +  LYV +  +   P S IC     N+ LWH+R+GHP  + L SL+++L +++    + C +C L+KQR
Subjt:  TKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLRNASNPQSPICVVNSDNAFLWHQRLGHPCVDVLKSLQSMLQLKSF-TSHTCTTCTLAKQR

Query:  RLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFTW------------IIPNFFKLIETQHGKTIKHMRSDNAPELMFTEFFKEKGV
        RL  ++ N++S   F+L+H+D WGP S  +  G  +F TIVDD +R+TW            I P+F +++ TQ G T+K +RSDNAPEL F +FF + G+
Subjt:  RLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFTW------------IIPNFFKLIETQHGKTIKHMRSDNAPELMFTEFFKEKGV

Query:  L---------------------------------RIPLTFWGECILTAVYLINKTPSKMLNWKSPFQLLNNNKPDYNNLKVFGSLCYASTLPHNRSKFQP
                                           IPL +W +CI T+VYLIN+TPS +L  K+PF+LL+   P Y++LKVFG LCYASTL  +R KF P
Subjt:  L---------------------------------RIPLTFWGECILTAVYLINKTPSKMLNWKSPFQLLNNNKPDYNNLKVFGSLCYASTLPHNRSKFQP

Query:  RAIPVVFVGYPQGMKAYKLYNIDQQKFFISRDVIFREEVFPFKDSTVSQPMEYPLPGFSLPKVFHESSIDHMNPSTNMPSPSFSPNTNNDSTSQTIVQST
        RAI  VF+GYP G K YKL N++  + FISRDVIF E  FP++++                                      SP + +D T +      
Subjt:  RAIPVVFVGYPQGMKAYKLYNIDQQKFFISRDVIFREEVFPFKDSTVSQPMEYPLPGFSLPKVFHESSIDHMNPSTNMPSPSFSPNTNNDSTSQTIVQST

Query:  NNPTNDSTSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQPHQPSSTKYPLNQYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHDDW
         +P++  T S     Q  S TS         R    PS+L+ YHC  +++     +ST +P++  ++Y  LS S +  V   S++ EP  + +AV   +W
Subjt:  NNPTNDSTSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQPHQPSSTKYPLNQYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHDDW

Query:  RKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNA
        R+AM+ ELK +E N TWSIVSLP  K+ VGCRW+YK K  ADGS++RYKAR VAKGYTQQEGLDY ETFSPVAK+VTV+TLL +A  + W L QLDVNNA
Subjt:  RKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNA

Query:  FLHGDLFEEVYMDLSLGY------------KPNLTIQGEK--------------------MSKADYSLFVKGQGTNFIALLVYVDDIIIT----------
        FLHGDL EEVYM L  G+            K + +I G K                     S AD SLF++     F+AL+VYVDDI+I           
Subjt:  FLHGDLFEEVYMDLSLGY------------KPNLTIQGEK--------------------MSKADYSLFVKGQGTNFIALLVYVDDIIIT----------

Query:  ----GANFLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKP
             + F LKDLG+L++FLG E+A ++ G+ + QR+YA+ LL E GLLG KP
Subjt:  ----GANFLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKP

TYK16758.1 Copia protein [Cucumis melo var. makuwa]6.5e-20044.33Show/hide
Query:  REMTDFLQMEYLMDFLMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSIGSLSS--TAPTMVFVVSSNSSKNGTNNRQRRGKPTCTHCNTPGHI
        +E+ +FLQ EYLMDFLMGLN++++Q R QLLLM+P+PS SRAFS++LQEEQQR+I S S     PT+    SSN+ KN + ++QR+ +P CTHCN PGH 
Subjt:  REMTDFLQMEYLMDFLMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSIGSLSS--TAPTMVFVVSSNSSKNGTNNRQRRGKPTCTHCNTPGHI

Query:  ATQNSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQINASNQIATSHIAGTSYSFPLWIIDSGASTHISCCKSYFTSTQPCSTTISLPNKQVFEV
                                                                                                            
Subjt:  ATQNSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQINASNQIATSHIAGTSYSFPLWIIDSGASTHISCCKSYFTSTQPCSTTISLPNKQVFEV

Query:  KSAGTIKLSESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLRNASNPQSPICVVNSDNAFLWHQRLGH
                                                   + + C  +DK+T KKIGS +L  GLY+    +       IC V ++ A LWHQR   
Subjt:  KSAGTIKLSESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLRNASNPQSPICVVNSDNAFLWHQRLGH

Query:  PCVDVLKSLQSMLQLKSFTSHTCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFTW------------IIPNFFK
                                                           +DIWGP ST T+ G++YFLTIVDDATR+TW            IIP FFK
Subjt:  PCVDVLKSLQSMLQLKSFTSHTCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFTW------------IIPNFFK

Query:  LIETQHGKTIKHMRSDNAPELMFTEFFKEKGVL---------------------------------RIPLTFWGECILTAVYLINKTPSKMLNWKSPFQL
        LIETQ+GK IK +RSDNAPEL FT FF++KGV+                                 ++PL FWG+CILTA+YLIN+TPSK+L WKS FQ 
Subjt:  LIETQHGKTIKHMRSDNAPELMFTEFFKEKGVL---------------------------------RIPLTFWGECILTAVYLINKTPSKMLNWKSPFQL

Query:  LNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYNIDQQKFFISRDVIFREEVFPFKDSTVSQPMEYPLPGFSLPKVFHESS
        LNN  PDYN+LKVFGSLCYAS+LP+NRSKFQ RAIP VF+GYPQGMKAYKLY+I+ +K FISRDVIF E  FPF +   +Q    PLPGFSLPK FHE +
Subjt:  LNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYNIDQQKFFISRDVIFREEVFPFKDSTVSQPMEYPLPGFSLPKVFHESS

Query:  IDHMNPSTNMPSPSFSPNT--------------NNDSTSQTIV----QSTNNPTNDSTSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTS
        +   + +T +P P+   NT              +ND  +Q  +     + N+  N++T S Q  + N +E +   T+R+STRI K PSYLQAYHC+LLT+
Subjt:  IDHMNPSTNMPSPSFSPNT--------------NNDSTSQTIV----QSTNNPTNDSTSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTS

Query:  Q-PHQPSSTKYPLNQYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYK
        Q P    STKYP+NQY+SYQ LS ++K+S+LQ ST  E +FYHEAVI  +WR+AM+ EL+ METNQTWSIV LP  KN +GCRW+YKIKHK DGS+ERYK
Subjt:  Q-PHQPSSTKYPLNQYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYK

Query:  ARFVAKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLHGDLFEEVYMDLSLGYKPNLTIQGEK---------------------
        AR VAKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSK+W L QLDVNNAFLHG+LFEEVYMDL LGYKP   IQGEK                     
Subjt:  ARFVAKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLHGDLFEEVYMDLSLGYKPNLTIQGEK---------------------

Query:  ------------MSKADYSLFVKGQGTNFIALLVYVDDIIITGAN--------------FLLKDLGSLRFFLGFELAHT
                     SKADY LF++G+  +FIALLVYVDDIIITGAN              FLLKDLG L+FFL      T
Subjt:  ------------MSKADYSLFVKGQGTNFIALLVYVDDIIITGAN--------------FLLKDLGSLRFFLGFELAHT

TYK18103.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]1.7e-20344.3Show/hide
Query:  GVREMTDFLQMEYLMDFLMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSIGSLSSTAPTMVFVVSSNSSKNGTNNRQRRGKPTCTHCNTPGHI
        G  E  +FLQ EYLMDFLMGLN++++Q R QLLLM+P+PS SRAFS++LQEEQQR+I S S                                 NTP   
Subjt:  GVREMTDFLQMEYLMDFLMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSIGSLSSTAPTMVFVVSSNSSKNGTNNRQRRGKPTCTHCNTPGHI

Query:  ATQNSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQINASNQIATSHIAGTSYSFPLWIIDSGASTHISCCKSYFTSTQPCSTTISLPNKQVFEV
                                                                                                            
Subjt:  ATQNSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQINASNQIATSHIAGTSYSFPLWIIDSGASTHISCCKSYFTSTQPCSTTISLPNKQVFEV

Query:  KSAGTIKLSESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLRNASNPQSPICVVNSDNAFLWHQRLGH
                               +F+L+                       DK+T KKIGS +L  GLY+    +       IC V ++ A LWHQR+GH
Subjt:  KSAGTIKLSESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLRNASNPQSPICVVNSDNAFLWHQRLGH

Query:  PCVDVLKSLQSMLQLKSFT----SHTCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFTW------------IIP
        P  + L SLQ  L LK  T    SH CT C LAKQR+LSF++NNH+S N+FDLIHVDIWGP ST T+  ++YFLTIVDDATR+TW            IIP
Subjt:  PCVDVLKSLQSMLQLKSFT----SHTCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFTW------------IIP

Query:  NFFKLIETQHGKTIKHMRSDNAPELMFTEFFKEKGVL---------------------------------RIPLTFWGECILTAVYLINKTPSKMLNWKS
        +FFKLIETQ+GK IK +RSDNA +L FT FF++KGV+                                 ++PL FWG+CI+TAVYLI++TPS++L WK 
Subjt:  NFFKLIETQHGKTIKHMRSDNAPELMFTEFFKEKGVL---------------------------------RIPLTFWGECILTAVYLINKTPSKMLNWKS

Query:  PFQLLNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYNIDQQKFFISRDVIFREEVFPFKDSTVSQPMEYPLPGFSLPKVF
        PFQ LNN  PDYN+LKVFGSLCYAS+LPHN SKFQPRAIP VF+GYP+GMK YKLY+I+ +K FISRD        P  D  +++               
Subjt:  PFQLLNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYNIDQQKFFISRDVIFREEVFPFKDSTVSQPMEYPLPGFSLPKVF

Query:  HESSIDHMNPSTNMPSPSFSPNTNNDSTSQTIVQSTNNPTNDSTSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQ-PHQPSSTKYPLN
                                            N+  N++T S Q  + N +E +   T+RRSTRI K PSYLQAYHC+LLT+Q P    STKYP+N
Subjt:  HESSIDHMNPSTNMPSPSFSPNTNNDSTSQTIVQSTNNPTNDSTSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQ-PHQPSSTKYPLN

Query:  QYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGL
        QY+SYQ L  ++K+S+LQ ST  E +FYHEAV+  +W +AM+ EL+ METNQTWSIV LP  KN +GCRW+YKIKHKADGS+ERYK R VAKGYTQQEGL
Subjt:  QYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGL

Query:  DYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLHGDLFEEVYMDLSLGYKPNLTIQGEKM---------------------------------S
        DYFETFS V KMVTVKTLLTIAVSK+W L QLDVNNAFLHG+LFEEVYMDL LGYKP   IQGEK+                                 S
Subjt:  DYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLHGDLFEEVYMDLSLGYKPNLTIQGEKM---------------------------------S

Query:  KADYSLFVKGQGTNFIALLVYVDDIIITGAN--------------FLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKP
        KA+YSLF++G+  +FIALLVYVDDIIITGAN              FLLKDLG L+FFLG ELA  S+G+ LSQ++Y L L+E+ GLLGAKP
Subjt:  KADYSLFVKGQGTNFIALLVYVDDIIITGAN--------------FLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKP

TrEMBL top hitse value%identityAlignment
A0A2N9GZW3 Integrase catalytic domain-containing protein3.2e-17638.29Show/hide
Query:  WIRNNNILKTIPCNLGAKPRYHCYDGVREMTDFLQMEYLMDFLMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSIG--SLSSTAPTMVFVVSS
        W   +N      C+ GA         ++ + D  Q EY+M FLMGLN++FS  R Q+L+ DPLPS ++AF++++QEE+QR+I   SL+  A ++      
Subjt:  WIRNNNILKTIPCNLGAKPRYHCYDGVREMTDFLQMEYLMDFLMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSIG--SLSSTAPTMVFVVSS

Query:  NSSKN--GTNNRQRRGKPTCTHCNTPGHIATQ-----------NSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQIN----------ASNQIAT
         ++++  G N   ++ +P C+HC   GH   +             +       Q+S +V +        QCQ LL+ L SQ +           +NQ+ +
Subjt:  NSSKN--GTNNRQRRGKPTCTHCNTPGHIATQ-----------NSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQIN----------ASNQIAT

Query:  SHIAGTS---------------------------------------YSFPLWIIDSGASTHISCCKSYFTS-TQPCSTTISLPNKQVFEVKSAGTIKLSE
           AGTS                                       +S   WI+D+GA+ H+      FTS T   +T I LPN +       GT++++ 
Subjt:  SHIAGTS---------------------------------------YSFPLWIIDSGASTHISCCKSYFTS-TQPCSTTISLPNKQVFEVKSAGTIKLSE

Query:  SIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLRNASNPQSPICVVNSDNAF-------LWHQRLGHPC-
        S+ L +VL +  FSFNLI +S LT      V F ++ C IQD  T K+IG      GLY  +    + P S   +V +  A        +WH RLGHP  
Subjt:  SIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLRNASNPQSPICVVNSDNAF-------LWHQRLGHPC-

Query:  --VDVLKSLQSMLQLKSFTSHTCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFTWI------------IPNFFK
          + +LK++ S L + S   H C  C ++KQ+RL F    H +   FDLIH DIWGP    T     YFLTIVDD TR TW+            I +FF 
Subjt:  --VDVLKSLQSMLQLKSFTSHTCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFTWI------------IPNFFK

Query:  LIETQHGKTIKHMRSDNAPELMFTEFFKEKGVL---------------------------------RIPLTFWGECILTAVYLINKTPSKMLNWKSPFQL
        LI+TQ   +IK +RSDN PE     F+ + G L                                  +PL FWG C+LTA +LIN+ P+ +L  KSPF+L
Subjt:  LIETQHGKTIKHMRSDNAPELMFTEFFKEKGVL---------------------------------RIPLTFWGECILTAVYLINKTPSKMLNWKSPFQL

Query:  LNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYNIDQQKFFISRDVIFREEVFPF---KDSTVSQPMEYPLPGFSLPKVFH
        L    P+Y+ L+VFG LCYA+TL HNR KF PR+   V +GYPQG+K Y+L ++D ++ F+SRDV+F E  FPF   + ST +  M  P P   LP    
Subjt:  LNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYNIDQQKFFISRDVIFREEVFPF---KDSTVSQPMEYPLPGFSLPKVFH

Query:  ESSIDHMNPSTN---MPSPSFSP--NTNNDSTSQTIVQSTNNPTNDSTSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQP-HQPSS--
          + D  N ST+     SP  SP   +++ ++S   V ST     D  S    P  N   T    T+R+STRI K PSYLQA+HCN  +S P H PSS  
Subjt:  ESSIDHMNPSTN---MPSPSFSP--NTNNDSTSQTIVQSTNNPTNDSTSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQP-HQPSS--

Query:  -------TKYPLNQYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKA
               T +PL+ YISY  L+  +   VL AS + EP  +HEA    +W +AM+ EL  +E N TWS+  LP  K  +G +W++K+K ++DGS+ERYKA
Subjt:  -------TKYPLNQYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKA

Query:  RFVAKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLHGDLFEEVYMDLSLG-------------------------------YK
        R VAKGY QQEG DYFETFSPVAK VTV++LL IA  K W L+QLDVNNAFLHG+L EEVYM L  G                               + 
Subjt:  RFVAKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLHGDLFEEVYMDLSLG-------------------------------YK

Query:  PNLTIQGEKMSKADYSLFVKGQGTNFIALLVYVDDIIITG--------------ANFLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGA
          L   G   SKADYSLF +  G++FIALLVYVDDI+I                A F LKDLG +R+FLG E+A +S GI +SQR YAL +LE+ GLLG 
Subjt:  PNLTIQGEKMSKADYSLFVKGQGTNFIALLVYVDDIIITG--------------ANFLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGA

Query:  KP
        KP
Subjt:  KP

A0A2N9HYD2 Integrase catalytic domain-containing protein2.9e-17736.86Show/hide
Query:  WIRNNNILKTIPCNLGAKPRYHCYDGVREMTDFLQMEYLMDFLMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSIG--SLSSTAPTMVFVVSS
        W   NN      C+ GA         ++ + D  Q E +M FLMGLN++F+  R Q+L+M+PLP+ ++AFS+++QEE+QRSIG  +L ++  +M     S
Subjt:  WIRNNNILKTIPCNLGAKPRYHCYDGVREMTDFLQMEYLMDFLMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSIG--SLSSTAPTMVFVVSS

Query:  NSSKNGTNNRQRRGK---PTCTHCNTPGHIAT--------------QNSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQI---NASNQI-----
           +N    R   GK   P C+HC   GHI                +N+ + +   +   +   ++   +A  QCQ LL  L SQ    +AS+Q+     
Subjt:  NSSKNGTNNRQRRGK---PTCTHCNTPGHIAT--------------QNSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQI---NASNQI-----

Query:  --------------ATSHIAGTS--------------------YSFPL---------------------WIIDSGASTHISCCKSYFTS-TQPCSTTISL
                      A++  A +S                    YS P                      WI+D+GA+ H+    S FTS T    + I L
Subjt:  --------------ATSHIAGTS--------------------YSFPL---------------------WIIDSGASTHISCCKSYFTS-TQPCSTTISL

Query:  PNKQVFEVKSAGTIKLSESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLR-----NASNPQSPI----
        PN Q       G++++S ++ L NVL +  FSFNLI ++ LT  +P  V FS++ C IQD  + K+IG  +  +GLY+ ++      +  NP + +    
Subjt:  PNKQVFEVKSAGTIKLSESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLR-----NASNPQSPI----

Query:  CVVNSDNAFLWHQRLGHPC---VDVLKSLQSMLQLKSFTSHTCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFT
          VN     +WH RLGHP    +++L  + S L + S + H C  C L+K RRL F  + H+S   FDLIH DIWGP    T     YFLTIVDD TR T
Subjt:  CVVNSDNAFLWHQRLGHPC---VDVLKSLQSMLQLKSFTSHTCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFT

Query:  WI------------IPNFFKLIETQHGKTIKHMRSDNAPELMFTEFFKEKGVL---------------------------------RIPLTFWGECILTA
        WI            + +FF L++TQ   +IK +RSDN  E   TEF+ + G +                                  IPL +WG C+LTA
Subjt:  WI------------IPNFFKLIETQHGKTIKHMRSDNAPELMFTEFFKEKGVL---------------------------------RIPLTFWGECILTA

Query:  VYLINKTPSKMLNWKSPFQLLNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYNIDQQKFFISRDVIFREEVFPFKDSTVS
         YLIN+ PS +LN KSP+++L    P Y++L+VFGSLCYA+TL HNR KF PR+   + +GYP G K Y+L ++   + F+SRDV+F E +FPF+  T +
Subjt:  VYLINKTPSKMLNWKSPFQLLNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYNIDQQKFFISRDVIFREEVFPFKDSTVS

Query:  QPMEYPL------------PGFS---LPKVFHESSIDHMN--------------PSTNMPSPSFSPNTNNDSTSQT------IVQSTNNPTNDSTSSTQQ
         P  + L            P F    +P    +S++ H +              P T  P    S  T+ D++++T       + ++ NP + +   +  
Subjt:  QPMEYPL------------PGFS---LPKVFHESSIDHMN--------------PSTNMPSPSFSPNTNNDSTSQT------IVQSTNNPTNDSTSSTQQ

Query:  PKQNISETSNNPTV---RRSTRIGKQPSYLQAYHCNLLTSQP-------HQPSSTKYPLNQYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHDDWRKA
        P   + + S  P V   R+STR+ K P+YLQ YHC+  +S P          SSTK+PL+Q +SY +LS ++K  VL AST+ EP+ Y+EA     W +A
Subjt:  PKQNISETSNNPTV---RRSTRIGKQPSYLQAYHCNLLTSQP-------HQPSSTKYPLNQYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHDDWRKA

Query:  MENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLH
        M+ E+  +ETNQTWS+ SLP  K  +GC+W+YK+K ++DG++ERYKAR VAKGY QQEG DYFETFSPVAK VTV+ LL +A  K W L+QLDVNNAFLH
Subjt:  MENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLH

Query:  GDLFEEVYMDLSLGYK-------PNLTI---------------------------QGEKMSKADYSLFVKGQGTNFIALLVYVDDIIITGAN--------
        G L EEVYM L  G+        P+ T+                            G   SKADYSLF K QG+ FIALLVYVDDI+I   N        
Subjt:  GDLFEEVYMDLSLGYK-------PNLTI---------------------------QGEKMSKADYSLFVKGQGTNFIALLVYVDDIIITGAN--------

Query:  ------FLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKPA
              F LKDLG +R+FLG E+A +S GI +SQR YAL ++E+ G+LG KPA
Subjt:  ------FLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKPA

A0A5A7VQN7 Cysteine-rich RLK (Receptor-like protein kinase) 88.0e-20444.3Show/hide
Query:  GVREMTDFLQMEYLMDFLMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSIGSLSSTAPTMVFVVSSNSSKNGTNNRQRRGKPTCTHCNTPGHI
        G  E  +FLQ EYLMDFLMGLN++++Q R QLLLM+P+PS SRAFS++LQEEQQR+I S S                                 NTP   
Subjt:  GVREMTDFLQMEYLMDFLMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSIGSLSSTAPTMVFVVSSNSSKNGTNNRQRRGKPTCTHCNTPGHI

Query:  ATQNSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQINASNQIATSHIAGTSYSFPLWIIDSGASTHISCCKSYFTSTQPCSTTISLPNKQVFEV
                                                                                                            
Subjt:  ATQNSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQINASNQIATSHIAGTSYSFPLWIIDSGASTHISCCKSYFTSTQPCSTTISLPNKQVFEV

Query:  KSAGTIKLSESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLRNASNPQSPICVVNSDNAFLWHQRLGH
                               +F+L+                       DK+T KKIGS +L  GLY+    +       IC V ++ A LWHQR+GH
Subjt:  KSAGTIKLSESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLRNASNPQSPICVVNSDNAFLWHQRLGH

Query:  PCVDVLKSLQSMLQLKSFT----SHTCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFTW------------IIP
        P  + L SLQ  L LK  T    SH CT C LAKQR+LSF++NNH+S N+FDLIHVDIWGP ST T+  ++YFLTIVDDATR+TW            IIP
Subjt:  PCVDVLKSLQSMLQLKSFT----SHTCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFTW------------IIP

Query:  NFFKLIETQHGKTIKHMRSDNAPELMFTEFFKEKGVL---------------------------------RIPLTFWGECILTAVYLINKTPSKMLNWKS
        +FFKLIETQ+GK IK +RSDNA +L FT FF++KGV+                                 ++PL FWG+CI+TAVYLI++TPS++L WK 
Subjt:  NFFKLIETQHGKTIKHMRSDNAPELMFTEFFKEKGVL---------------------------------RIPLTFWGECILTAVYLINKTPSKMLNWKS

Query:  PFQLLNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYNIDQQKFFISRDVIFREEVFPFKDSTVSQPMEYPLPGFSLPKVF
        PFQ LNN  PDYN+LKVFGSLCYAS+LPHN SKFQPRAIP VF+GYP+GMK YKLY+I+ +K FISRD        P  D  +++               
Subjt:  PFQLLNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYNIDQQKFFISRDVIFREEVFPFKDSTVSQPMEYPLPGFSLPKVF

Query:  HESSIDHMNPSTNMPSPSFSPNTNNDSTSQTIVQSTNNPTNDSTSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQ-PHQPSSTKYPLN
                                            N+  N++T S Q  + N +E +   T+RRSTRI K PSYLQAYHC+LLT+Q P    STKYP+N
Subjt:  HESSIDHMNPSTNMPSPSFSPNTNNDSTSQTIVQSTNNPTNDSTSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQ-PHQPSSTKYPLN

Query:  QYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGL
        QY+SYQ L  ++K+S+LQ ST  E +FYHEAV+  +W +AM+ EL+ METNQTWSIV LP  KN +GCRW+YKIKHKADGS+ERYK R VAKGYTQQEGL
Subjt:  QYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGL

Query:  DYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLHGDLFEEVYMDLSLGYKPNLTIQGEKM---------------------------------S
        DYFETFS V KMVTVKTLLTIAVSK+W L QLDVNNAFLHG+LFEEVYMDL LGYKP   IQGEK+                                 S
Subjt:  DYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLHGDLFEEVYMDLSLGYKPNLTIQGEKM---------------------------------S

Query:  KADYSLFVKGQGTNFIALLVYVDDIIITGAN--------------FLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKP
        KA+YSLF++G+  +FIALLVYVDDIIITGAN              FLLKDLG L+FFLG ELA  S+G+ LSQ++Y L L+E+ GLLGAKP
Subjt:  KADYSLFVKGQGTNFIALLVYVDDIIITGAN--------------FLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKP

A0A5D3CZP1 Copia protein3.1e-20044.33Show/hide
Query:  REMTDFLQMEYLMDFLMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSIGSLSS--TAPTMVFVVSSNSSKNGTNNRQRRGKPTCTHCNTPGHI
        +E+ +FLQ EYLMDFLMGLN++++Q R QLLLM+P+PS SRAFS++LQEEQQR+I S S     PT+    SSN+ KN + ++QR+ +P CTHCN PGH 
Subjt:  REMTDFLQMEYLMDFLMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSIGSLSS--TAPTMVFVVSSNSSKNGTNNRQRRGKPTCTHCNTPGHI

Query:  ATQNSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQINASNQIATSHIAGTSYSFPLWIIDSGASTHISCCKSYFTSTQPCSTTISLPNKQVFEV
                                                                                                            
Subjt:  ATQNSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQINASNQIATSHIAGTSYSFPLWIIDSGASTHISCCKSYFTSTQPCSTTISLPNKQVFEV

Query:  KSAGTIKLSESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLRNASNPQSPICVVNSDNAFLWHQRLGH
                                                   + + C  +DK+T KKIGS +L  GLY+    +       IC V ++ A LWHQR   
Subjt:  KSAGTIKLSESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLRNASNPQSPICVVNSDNAFLWHQRLGH

Query:  PCVDVLKSLQSMLQLKSFTSHTCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFTW------------IIPNFFK
                                                           +DIWGP ST T+ G++YFLTIVDDATR+TW            IIP FFK
Subjt:  PCVDVLKSLQSMLQLKSFTSHTCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFTW------------IIPNFFK

Query:  LIETQHGKTIKHMRSDNAPELMFTEFFKEKGVL---------------------------------RIPLTFWGECILTAVYLINKTPSKMLNWKSPFQL
        LIETQ+GK IK +RSDNAPEL FT FF++KGV+                                 ++PL FWG+CILTA+YLIN+TPSK+L WKS FQ 
Subjt:  LIETQHGKTIKHMRSDNAPELMFTEFFKEKGVL---------------------------------RIPLTFWGECILTAVYLINKTPSKMLNWKSPFQL

Query:  LNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYNIDQQKFFISRDVIFREEVFPFKDSTVSQPMEYPLPGFSLPKVFHESS
        LNN  PDYN+LKVFGSLCYAS+LP+NRSKFQ RAIP VF+GYPQGMKAYKLY+I+ +K FISRDVIF E  FPF +   +Q    PLPGFSLPK FHE +
Subjt:  LNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYNIDQQKFFISRDVIFREEVFPFKDSTVSQPMEYPLPGFSLPKVFHESS

Query:  IDHMNPSTNMPSPSFSPNT--------------NNDSTSQTIV----QSTNNPTNDSTSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTS
        +   + +T +P P+   NT              +ND  +Q  +     + N+  N++T S Q  + N +E +   T+R+STRI K PSYLQAYHC+LLT+
Subjt:  IDHMNPSTNMPSPSFSPNT--------------NNDSTSQTIV----QSTNNPTNDSTSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTS

Query:  Q-PHQPSSTKYPLNQYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYK
        Q P    STKYP+NQY+SYQ LS ++K+S+LQ ST  E +FYHEAVI  +WR+AM+ EL+ METNQTWSIV LP  KN +GCRW+YKIKHK DGS+ERYK
Subjt:  Q-PHQPSSTKYPLNQYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYK

Query:  ARFVAKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLHGDLFEEVYMDLSLGYKPNLTIQGEK---------------------
        AR VAKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSK+W L QLDVNNAFLHG+LFEEVYMDL LGYKP   IQGEK                     
Subjt:  ARFVAKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLHGDLFEEVYMDLSLGYKPNLTIQGEK---------------------

Query:  ------------MSKADYSLFVKGQGTNFIALLVYVDDIIITGAN--------------FLLKDLGSLRFFLGFELAHT
                     SKADY LF++G+  +FIALLVYVDDIIITGAN              FLLKDLG L+FFL      T
Subjt:  ------------MSKADYSLFVKGQGTNFIALLVYVDDIIITGAN--------------FLLKDLGSLRFFLGFELAHT

A0A5D3D1N3 Cysteine-rich RLK (Receptor-like protein kinase) 88.0e-20444.3Show/hide
Query:  GVREMTDFLQMEYLMDFLMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSIGSLSSTAPTMVFVVSSNSSKNGTNNRQRRGKPTCTHCNTPGHI
        G  E  +FLQ EYLMDFLMGLN++++Q R QLLLM+P+PS SRAFS++LQEEQQR+I S S                                 NTP   
Subjt:  GVREMTDFLQMEYLMDFLMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSIGSLSSTAPTMVFVVSSNSSKNGTNNRQRRGKPTCTHCNTPGHI

Query:  ATQNSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQINASNQIATSHIAGTSYSFPLWIIDSGASTHISCCKSYFTSTQPCSTTISLPNKQVFEV
                                                                                                            
Subjt:  ATQNSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQINASNQIATSHIAGTSYSFPLWIIDSGASTHISCCKSYFTSTQPCSTTISLPNKQVFEV

Query:  KSAGTIKLSESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLRNASNPQSPICVVNSDNAFLWHQRLGH
                               +F+L+                       DK+T KKIGS +L  GLY+    +       IC V ++ A LWHQR+GH
Subjt:  KSAGTIKLSESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLRNASNPQSPICVVNSDNAFLWHQRLGH

Query:  PCVDVLKSLQSMLQLKSFT----SHTCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFTW------------IIP
        P  + L SLQ  L LK  T    SH CT C LAKQR+LSF++NNH+S N+FDLIHVDIWGP ST T+  ++YFLTIVDDATR+TW            IIP
Subjt:  PCVDVLKSLQSMLQLKSFT----SHTCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFTW------------IIP

Query:  NFFKLIETQHGKTIKHMRSDNAPELMFTEFFKEKGVL---------------------------------RIPLTFWGECILTAVYLINKTPSKMLNWKS
        +FFKLIETQ+GK IK +RSDNA +L FT FF++KGV+                                 ++PL FWG+CI+TAVYLI++TPS++L WK 
Subjt:  NFFKLIETQHGKTIKHMRSDNAPELMFTEFFKEKGVL---------------------------------RIPLTFWGECILTAVYLINKTPSKMLNWKS

Query:  PFQLLNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYNIDQQKFFISRDVIFREEVFPFKDSTVSQPMEYPLPGFSLPKVF
        PFQ LNN  PDYN+LKVFGSLCYAS+LPHN SKFQPRAIP VF+GYP+GMK YKLY+I+ +K FISRD        P  D  +++               
Subjt:  PFQLLNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYNIDQQKFFISRDVIFREEVFPFKDSTVSQPMEYPLPGFSLPKVF

Query:  HESSIDHMNPSTNMPSPSFSPNTNNDSTSQTIVQSTNNPTNDSTSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQ-PHQPSSTKYPLN
                                            N+  N++T S Q  + N +E +   T+RRSTRI K PSYLQAYHC+LLT+Q P    STKYP+N
Subjt:  HESSIDHMNPSTNMPSPSFSPNTNNDSTSQTIVQSTNNPTNDSTSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQ-PHQPSSTKYPLN

Query:  QYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGL
        QY+SYQ L  ++K+S+LQ ST  E +FYHEAV+  +W +AM+ EL+ METNQTWSIV LP  KN +GCRW+YKIKHKADGS+ERYK R VAKGYTQQEGL
Subjt:  QYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGL

Query:  DYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLHGDLFEEVYMDLSLGYKPNLTIQGEKM---------------------------------S
        DYFETFS V KMVTVKTLLTIAVSK+W L QLDVNNAFLHG+LFEEVYMDL LGYKP   IQGEK+                                 S
Subjt:  DYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLHGDLFEEVYMDLSLGYKPNLTIQGEKM---------------------------------S

Query:  KADYSLFVKGQGTNFIALLVYVDDIIITGAN--------------FLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKP
        KA+YSLF++G+  +FIALLVYVDDIIITGAN              FLLKDLG L+FFLG ELA  S+G+ LSQ++Y L L+E+ GLLGAKP
Subjt:  KADYSLFVKGQGTNFIALLVYVDDIIITGAN--------------FLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKP

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-5625.89Show/hide
Query:  KPTCTHCNTPGHIATQNSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQINASNQIA--TSHIAGTSYSFPL-WIIDSGASTHISCCKSYFTST-
        K  C HC   GH        I +D     +++NN          +N  N+ Q Q   S+ IA     +  TS      +++DSGAS H+   +S +T + 
Subjt:  KPTCTHCNTPGHIATQNSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQINASNQIA--TSHIAGTSYSFPL-WIIDSGASTHISCCKSYFTST-

Query:  ---QPCSTTISLPNKQVFEVKSAGTIKL--SESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLRNASN
            P    ++   + ++  K  G ++L     I L++VL+  E + NL+ V  L ++  + + F  +   I  K  L  + +  +L  + V   +  S 
Subjt:  ---QPCSTTISLPNKQVFEVKSAGTIKL--SESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLRNASN

Query:  PQSPICVVNSDNAFLWHQRLGHPCVDVLKSL--------QSMLQLKSFTSHTCTTCTLAKQRRLSFSA---NNHVSPNSFDLIHVDIWGPLSTATHVGHT
            I   + +N  LWH+R GH     L  +        QS+L     +   C  C   KQ RL F       H+    F ++H D+ GP++  T     
Subjt:  PQSPICVVNSDNAFLWHQRLGHPCVDVLKSL--------QSMLQLKSFTSHTCTTCTLAKQRRLSFSA---NNHVSPNSFDLIHVDIWGPLSTATHVGHT

Query:  YFLTIVDDATRF--TWII----------PNFFKLIETQHGKTIKHMRSDNAPELMFTE---FFKEKGV---LRIPLT-----------------------
        YF+  VD  T +  T++I           +F    E      + ++  DN  E +  E   F  +KG+   L +P T                       
Subjt:  YFLTIVDDATRF--TWII----------PNFFKLIETQHGKTIKHMRSDNAPELMFTE---FFKEKGV---LRIPLT-----------------------

Query:  -------FWGECILTAVYLINKTPSKML--NWKSPFQLLNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGY-PQGMKAYKLYNIDQQKFFI
               FWGE +LTA YLIN+ PS+ L  + K+P+++ +N KP   +L+VFG+  Y   + + + KF  ++   +FVGY P G   +KL++   +KF +
Subjt:  -------FWGECILTAVYLINKTPSKML--NWKSPFQLLNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGY-PQGMKAYKLYNIDQQKFFI

Query:  SRDVIFRE-----------EVFPFKDSTVSQPMEYPLPGFSLPKVFHESSIDHMNPSTNMPSPSFSPNTNNDSTSQTIVQ--------------------
        +RDV+  E           E    KDS  S+   +P     + +    +     +    +     S N N  + S+ I+Q                    
Subjt:  SRDVIFRE-----------EVFPFKDSTVSQPMEYPLPGFSLPKVFHESSIDHMNPSTNMPSPSFSPNTNNDSTSQTIVQ--------------------

Query:  -STNNPTNDSTSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQPH----QPSSTKYPLNQYISYQNLSHSFKHSVLQASTLHE--PNFY
         S     N+S    +    N S+ S NP   R +   +   +L+    +  T           S +      ISY    +S    VL A T+    PN +
Subjt:  -STNNPTNDSTSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQPH----QPSSTKYPLNQYISYQNLSHSFKHSVLQASTLHE--PNFY

Query:  HEAVIHDD---WRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSK
         E    DD   W +A+  EL   + N TW+I   P +KNIV  RW++ +K+   G+  RYKAR VA+G+TQ+  +DY ETF+PVA++ + + +L++ +  
Subjt:  HEAVIHDD---WRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSK

Query:  DWHLFQLDVNNAFLHGDLFEEVYMDLSLGY--------KPNLTIQGEKM--------------------SKADYSLFV--KGQGTNFIALLVYVDDIII-
        +  + Q+DV  AFL+G L EE+YM L  G         K N  I G K                     S  D  +++  KG     I +L+YVDD++I 
Subjt:  DWHLFQLDVNNAFLHGDLFEEVYMDLSLGY--------KPNLTIQGEKM--------------------SKADYSLFV--KGQGTNFIALLVYVDDIII-

Query:  TG-------------ANFLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGL
        TG               F + DL  ++ F+G  +      I LSQ  Y   +L +  +
Subjt:  TG-------------ANFLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-7426.6Show/hide
Query:  SSNSSKNG----TNNRQRRGKPTCTHCNTPGHIATQNSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQINASNQIATSHIAGTSYSFPLWIIDS
        S+N  ++G    + NR +     C +CN PGH   ++  N  +   + S   N+ +TA A +Q     N     +  + +    H++G       W++D+
Subjt:  SSNSSKNG----TNNRQRRGKPTCTHCNTPGHIATQNSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQINASNQIATSHIAGTSYSFPLWIIDS

Query:  GASTHISCCKSYFTS-TQPCSTTISLPNKQVFEVKSAGTIKLSESI----ELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSV
         AS H +  +  F         T+ + N    ++   G I +  ++     LK+V ++ +   NLI   AL +D          +     K+ L K GS+
Subjt:  GASTHISCCKSYFTS-TQPCSTTISLPNKQVFEVKSAGTIKLSESI----ELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSV

Query:  ELLYGLYVFKL--RNASNPQSPICVVNSD-NAFLWHQRLGHPC---VDVLKSLQSMLQLKSFTSHTCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWG
         +  G+    L   NA   Q  +     + +  LWH+R+GH     + +L     +   K  T   C  C   KQ R+SF  ++    N  DL++ D+ G
Subjt:  ELLYGLYVFKL--RNASNPQSPICVVNSD-NAFLWHQRLGHPC---VDVLKSLQSMLQLKSFTSHTCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWG

Query:  PLSTATHVGHTYFLTIVDDATRFTW------------IIPNFFKLIETQHGKTIKHMRSDNAPELM---FTEFFKEKGV---------------------
        P+   +  G+ YF+T +DDA+R  W            +   F  L+E + G+ +K +RSDN  E     F E+    G+                     
Subjt:  PLSTATHVGHTYFLTIVDDATRFTW------------IIPNFFKLIETQHGKTIKHMRSDNAPELM---FTEFFKEKGV---------------------

Query:  ------------LRIPLTFWGECILTAVYLINKTPSKMLNWKSPFQLLNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYN
                     ++P +FWGE + TA YLIN++PS  L ++ P ++  N +  Y++LKVFG   +A      R+K   ++IP +F+GY      Y+L++
Subjt:  ------------LRIPLTFWGECILTAVYLINKTPSKMLNWKSPFQLLNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYN

Query:  IDQQKFFISRDVIFREEVFPFKDSTVSQPMEYPLPGFSLPKVFHESSIDHMNPSTNMPSPSFSPNTNNDSTSQTIVQSTNNPTNDSTSSTQQPKQNISET
          ++K   SRDV+FRE           +     +P F                   +PS S +P T+ +ST+  + +    P  +     +Q  + + E 
Subjt:  IDQQKFFISRDVIFREEVFPFKDSTVSQPMEYPLPGFSLPKVFHESSIDHMNPSTNMPSPSFSPNTNNDSTSQTIVQSTNNPTNDSTSSTQQPKQNISET

Query:  SNNPTVRRSTRIGKQPSYLQAYHCNLLTSQPHQPSSTKYPLNQYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIH---DDWRKAMENELKVMETNQTWS
                     + P+  +  H  L  S+  +  S +YP  +Y+                S   EP    E + H   +   KAM+ E++ ++ N T+ 
Subjt:  SNNPTVRRSTRIGKQPSYLQAYHCNLLTSQPHQPSSTKYPLNQYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIH---DDWRKAMENELKVMETNQTWS

Query:  IVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLHGDLFEEVYMDLSLGY
        +V LP  K  + C+W++K+K   D  + RYKAR V KG+ Q++G+D+ E FSPV KM +++T+L++A S D  + QLDV  AFLHGDL EE+YM+   G+
Subjt:  IVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLHGDLFEEVYMDLSLGY

Query:  ----------KPNLTIQGEKMSKADYSL----FVKGQ-----------------GTNFIALLVYVDDIIITG--------------ANFLLKDLGSLRFF
                  K N ++ G K +   + +    F+K Q                   NFI LL+YVDD++I G               +F +KDLG  +  
Subjt:  ----------KPNLTIQGEKMSKADYSL----FVKGQ-----------------GTNFIALLVYVDDIIITG--------------ANFLLKDLGSLRFF

Query:  LGFELA--HTSTGILLSQRHYALCLLEEVGLLGAKP
        LG ++    TS  + LSQ  Y   +LE   +  AKP
Subjt:  LGFELA--HTSTGILLSQRHYALCLLEEVGLLGAKP

P92520 Uncharacterized mitochondrial protein AtMg008203.1e-1940.54Show/hide
Query:  KHSVLQASTL-HEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGLDYFETFSPVAK
        K+S+   +T+  EP     A+    W +AM+ EL  +  N+TW +V  P ++NI+GC+W++K K  +DG+++R KAR VAKG+ Q+EG+ + ET+SPV +
Subjt:  KHSVLQASTL-HEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGLDYFETFSPVAK

Query:  MVTVKTLLTIA
          T++T+L +A
Subjt:  MVTVKTLLTIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.6e-8428.72Show/hide
Query:  LMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSIGSLSSTAPTMVFVVS--SNSSKNGTNNRQRRGKPTCTHCNTPGHIATQNSENISQDTTQN
        L  L E +     Q+   D  P+ +     +L  E +    S ++  P     VS  + ++ N  NN  R  +    + N       Q+S N   +  Q+
Subjt:  LMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSIGSLSSTAPTMVFVVS--SNSSKNGTNNRQRRGKPTCTHCNTPGHIATQNSENISQDTTQN

Query:  SQMVNNIS-------TAEAFIQCQNLLNQLQSQINASN----QIATSHIAGTSYSFPLWIIDSGASTHISCCKSYFTSTQPCS--TTISLPNKQVFEVKS
           +           +A+   Q Q+ L+ + SQ   S     Q   +   G+ YS   W++DSGA+ HI+   +  +  QP +    + + +     +  
Subjt:  SQMVNNIS-------TAEAFIQCQNLLNQLQSQINASN----QIATSHIAGTSYSFPLWIIDSGASTHISCCKSYFTSTQPCS--TTISLPNKQVFEVKS

Query:  AGTIKL---SESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYG-----LYVFKLRNASNPQSPICVVNSDNAF-L
         G+  L   S  + L N+LY+     NLI V  L     V V F   +  ++D  T      V LL G     LY + +  +S P S     +S      
Subjt:  AGTIKL---SESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYG-----LYVFKLRNASNPQSPICVVNSDNAF-L

Query:  WHQRLGHPCVDVLKSLQSMLQLKSFT-SH---TCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFTWIIP-----
        WH RLGHP   +L S+ S   L     SH   +C+ C + K  ++ FS +   S    + I+ D+W      +H  + Y++  VD  TR+TW+ P     
Subjt:  WHQRLGHPCVDVLKSLQSMLQLKSFT-SH---TCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFTWIIP-----

Query:  -------NFFKLIETQHGKTIKHMRSDNAPELM-FTEFFKEKGV---------------------------------LRIPLTFWGECILTAVYLINKTP
                F  L+E +    I    SDN  E +   E+F + G+                                   IP T+W      AVYLIN+ P
Subjt:  -------NFFKLIETQHGKTIKHMRSDNAPELM-FTEFFKEKGV---------------------------------LRIPLTFWGECILTAVYLINKTP

Query:  SKMLNWKSPFQLLNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYNIDQQKFFISRDVIFREEVFPFKD--STVSQPME--
        + +L  +SPFQ L    P+Y+ L+VFG  CY    P+N+ K   ++   VF+GY     AY   ++   + +ISR V F E  FPF +  +T+S   E  
Subjt:  SKMLNWKSPFQLLNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYNIDQQKFFISRDVIFREEVFPFKD--STVSQPME--

Query:  -------------------YPLPGFSLP-----------KVFHESSIDHMN-----------------PSTNMPSPSFSPNTNNDSTSQTIVQSTNNPTN
                            P P  S P             F  S +   N                 P  N P P+  P      T  +   S NNPTN
Subjt:  -------------------YPLPGFSLP-----------KVFHESSIDHMN-----------------PSTNMPSPSFSPNTNNDSTSQTIVQSTNNPTN

Query:  D--------------STSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQPHQPSSTKYPLNQYISYQNLSHSFKHSVLQASTLHEPNFY
        +              S+SS+  P  + S +S +PT   S  I   P   Q  + N     P    S        I   N  +S   S+   S   EP   
Subjt:  D--------------STSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQPHQPSSTKYPLNQYISYQNLSHSFKHSVLQASTLHEPNFY

Query:  HEAVIHDDWRKAMENELKVMETNQTWSIVSLPASK-NIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSKDW
         +A+  + WR AM +E+     N TW +V  P S   IVGCRWI+  K+ +DGS+ RYKAR VAKGY Q+ GLDY ETFSPV K  +++ +L +AV + W
Subjt:  HEAVIHDDWRKAMENELKVMETNQTWSIVSLPASK-NIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSKDW

Query:  HLFQLDVNNAFLHGDLFEEVYMDLSLGY----KPN---------------------------LTIQGEKMSKADYSLFVKGQGTNFIALLVYVDDIIITG
         + QLDVNNAFL G L ++VYM    G+    +PN                           LTI G   S +D SLFV  +G + + +LVYVDDI+ITG
Subjt:  HLFQLDVNNAFLHGDLFEEVYMDLSLGY----KPN---------------------------LTIQGEKMSKADYSLFVKGQGTNFIALLVYVDDIIITG

Query:  --------------ANFLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKP
                        F +KD   L +FLG E     TG+ LSQR Y L LL    ++ AKP
Subjt:  --------------ANFLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.2e-7827.41Show/hide
Query:  AKPRYHCYDGVREMTDFLQMEYL---MD-------FLMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSIGSLSSTAPTMVFVVSSNSSKNGTN
        A P Y     +R +T F Q+  L   MD        L  L +++     Q+   D  PS +     ++  E +    + +   P    VV+  ++    N
Subjt:  AKPRYHCYDGVREMTDFLQMEYL---MD-------FLMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSIGSLSSTAPTMVFVVSSNSSKNGTN

Query:  NRQRRGKPTCTHCNTPGHIATQNSENISQDTTQN----------SQMVNNISTAEAFIQCQNLLNQLQSQINASNQIATSHIAGTS-YSFPLWIIDSGAS
           R       + N   +    +S     D  Q           S   ++        Q Q+  NQ QS    +     +++A  S Y+   W++DSGA+
Subjt:  NRQRRGKPTCTHCNTPGHIATQNSENISQDTTQN----------SQMVNNISTAEAFIQCQNLLNQLQSQINASNQIATSHIAGTS-YSFPLWIIDSGAS

Query:  THISCCKSYFTSTQPCS--TTISLPNKQVFEVKSAGTIKL---SESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELL
         HI+   +  +  QP +    + + +     +   G+  L   S S++L  VLY+     NLI V  L     V V F   +  ++D  T      V LL
Subjt:  THISCCKSYFTSTQPCS--TTISLPNKQVFEVKSAGTIKL---SESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELL

Query:  YG-----LYVFKLRNASNPQSPICVVNSDNAFLWHQRLGHPCVDVLKSLQSMLQLKSFT-SH---TCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIW-
         G     LY + + ++          +      WH RLGHP + +L S+ S   L     SH   +C+ C + K  ++ FS +   S    + I+ D+W 
Subjt:  YG-----LYVFKLRNASNPQSPICVVNSDNAFLWHQRLGHPCVDVLKSLQSMLQLKSFT-SH---TCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIW-

Query:  GPLSTATHVGHTYFLTIVDDATRFTWIIP------------NFFKLIETQHGKTIKHMRSDNAPE-LMFTEFFKEKGV----------------------
         P+ +  +  + Y++  VD  TR+TW+ P             F  L+E +    I  + SDN  E ++  ++  + G+                      
Subjt:  GPLSTATHVGHTYFLTIVDDATRFTWIIP------------NFFKLIETQHGKTIKHMRSDNAPE-LMFTEFFKEKGV----------------------

Query:  -----------LRIPLTFWGECILTAVYLINKTPSKMLNWKSPFQLLNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYNI
                     +P T+W      AVYLIN+ P+ +L  +SPFQ L    P+Y  LKVFG  CY    P+NR K + ++    F+GY     AY   +I
Subjt:  -----------LRIPLTFWGECILTAVYLINKTPSKMLNWKSPFQLLNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGYPQGMKAYKLYNI

Query:  DQQKFFISRDVIFREEVFPF--------------KDSTVSQPMEYPLP-----------------------------------GFSLP--KVFHESSIDH
           + + SR V F E  FPF               DS  + P    LP                                     +LP   +   SS + 
Subjt:  DQQKFFISRDVIFREEVFPF--------------KDSTVSQPMEYPLP-----------------------------------GFSLP--KVFHESSIDH

Query:  MNPSTNMPSPSFSPNTNNDSTSQTIVQSTNNPTNDSTSSTQQ----------------PKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQ-PHQ
          PS N P P+  P+   +S S + + +  NP + S +S  Q                P  +ISE  N+P+   ST     P  L A     + +Q P  
Subjt:  MNPSTNMPSPSFSPNTNNDSTSQTIVQSTNNPTNDSTSSTQQ----------------PKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQ-PHQ

Query:  PSSTKYPLNQYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIV-SLPASKNIVGCRWIYKIKHKADGSVERYKARFV
          S        I   N  +S+  S+   S   EP    +A+  D WR+AM +E+     N TW +V   P S  IVGCRWI+  K  +DGS+ RYKAR V
Subjt:  PSSTKYPLNQYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIV-SLPASKNIVGCRWIYKIKHKADGSVERYKARFV

Query:  AKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLHGDLFEEVYMDLSLGY------------------------------KPNLT
        AKGY Q+ GLDY ETFSPV K  +++ +L +AV + W + QLDVNNAFL G L +EVYM    G+                              +  L 
Subjt:  AKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLHGDLFEEVYMDLSLGY------------------------------KPNLT

Query:  IQGEKMSKADYSLFVKGQGTNFIALLVYVDDIIITG--------------ANFLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKP
          G   S +D SLFV  +G + I +LVYVDDI+ITG                F +K+   L +FLG E      G+ LSQR Y L LL    +L AKP
Subjt:  IQGEKMSKADYSLFVKGQGTNFIALLVYVDDIIITG--------------ANFLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.8e-6239.94Show/hide
Query:  STNNPTNDSTSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQPHQPSSTKYPLNQYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHD
        S  + +  S+S    P  NI      P+V  S R  ++P+YLQ Y+C+ +       S T + ++Q++SY+ +S  +   ++  +   EP+ Y+EA    
Subjt:  STNNPTNDSTSSTQQPKQNISETSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQPHQPSSTKYPLNQYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHD

Query:  DWRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVN
         W  AM++E+  MET  TW I +LP +K  +GC+W+YKIK+ +DG++ERYKAR VAKGYTQQEG+D+ ETFSPV K+ +VK +L I+   ++ L QLD++
Subjt:  DWRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVN

Query:  NAFLHGDLFEEVYMDLSLGY--------KPNL------TIQGEK--------------------MSKADYSLFVKGQGTNFIALLVYVDDIIITGAN---
        NAFL+GDL EE+YM L  GY         PN       +I G K                     S +D++ F+K   T F+ +LVYVDDIII   N   
Subjt:  NAFLHGDLFEEVYMDLSLGY--------KPNL------TIQGEK--------------------MSKADYSLFVKGQGTNFIALLVYVDDIIITGAN---

Query:  -----------FLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKPA
                   F L+DLG L++FLG E+A ++ GI + QR YAL LL+E GLLG KP+
Subjt:  -----------FLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKPA

ATMG00810.1 DNA/RNA polymerases superfamily protein1.4e-0637.84Show/hide
Query:  LLVYVDDIIITGAN--------------FLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKP
        LL+YVDDI++TG++              F +KDLG + +FLG ++    +G+ LSQ  YA  +L   G+L  KP
Subjt:  LLVYVDDIIITGAN--------------FLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKP

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.2e-2040.54Show/hide
Query:  KHSVLQASTL-HEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGLDYFETFSPVAK
        K+S+   +T+  EP     A+    W +AM+ EL  +  N+TW +V  P ++NI+GC+W++K K  +DG+++R KAR VAKG+ Q+EG+ + ET+SPV +
Subjt:  KHSVLQASTL-HEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIVSLPASKNIVGCRWIYKIKHKADGSVERYKARFVAKGYTQQEGLDYFETFSPVAK

Query:  MVTVKTLLTIA
          T++T+L +A
Subjt:  MVTVKTLLTIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCATTGGTCTCTCAGTGAAGAACAAGATTGGATTTGTTGACGGAACTATTGCCAAACCAACCGTTGATCTTCTTCCAACTTGGATCAGAAATAATAACATCTTGAA
AACGATCCCTTGCAACCTTGGAGCAAAACCAAGATACCATTGTTATGATGGTGTACGAGAAATGACAGATTTCCTTCAAATGGAATATCTCATGGACTTTCTTATGGGAT
TAAATGAGAACTTCTCTCAAGCACGGGGTCAACTTCTTCTCATGGATCCTCTACCATCAACTAGTCGAGCCTTTTCTATTATTCTCCAAGAAGAACAACAAAGGTCAATT
GGATCTCTTTCTTCTACAGCACCAACAATGGTCTTTGTGGTATCCTCTAACTCATCCAAAAATGGAACCAACAATCGACAAAGGAGAGGAAAACCCACGTGCACCCACTG
CAACACTCCTGGACACATAGCTACTCAAAACAGTGAGAATATCTCTCAAGACACCACACAGAACAGTCAAATGGTGAATAATATCAGCACTGCAGAGGCATTCATCCAAT
GTCAAAACCTTCTCAACCAGCTTCAGTCCCAAATCAATGCCTCCAACCAAATAGCTACCTCACATATAGCAGGTACTTCTTATTCATTCCCTTTATGGATAATTGATTCT
GGAGCATCCACCCATATTTCTTGTTGCAAGTCCTATTTTACATCTACTCAACCATGCTCAACAACCATTAGTTTACCAAATAAACAAGTTTTTGAAGTTAAAAGTGCTGG
CACCATCAAACTATCTGAATCTATAGAGCTGAAGAATGTCTTATATATTTCTGAATTTTCATTCAACCTTATACCAGTCAGTGCCTTAACAAAAGATCTTCCTGTGGATG
TTAGTTTCTCTACTAATAATTGTGTAATTCAGGACAAGTTCACTTTGAAGAAGATTGGCAGTGTTGAACTTCTATATGGTCTATATGTCTTCAAATTGAGAAATGCTTCC
AATCCGCAATCTCCCATATGTGTTGTAAACAGTGATAATGCCTTTTTATGGCATCAAAGGCTTGGGCACCCCTGTGTTGATGTTTTAAAGTCTTTACAAAGCATGTTACA
ATTGAAATCTTTTACTTCTCATACTTGTACTACTTGTACATTAGCAAAGCAAAGAAGACTAAGCTTTTCTGCAAATAATCATGTATCTCCAAATTCTTTTGACCTTATAC
ATGTTGACATTTGGGGACCTTTATCTACTGCTACACATGTTGGACATACATACTTCCTCACCATTGTCGATGATGCCACGCGTTTTACATGGATTATACCTAATTTCTTT
AAGCTTATCGAAACCCAACATGGTAAAACGATTAAACATATGCGTTCTGACAATGCTCCTGAACTTATGTTCACTGAATTCTTTAAAGAAAAAGGAGTACTGCGTATTCC
TTTAACCTTTTGGGGAGAATGCATACTGACTGCTGTTTACTTGATCAACAAAACTCCATCAAAAATGTTGAACTGGAAATCTCCATTCCAACTGCTTAACAACAACAAAC
CTGATTACAATAACCTAAAGGTGTTCGGGTCCTTATGTTATGCTTCAACTCTGCCACACAATCGCTCTAAGTTTCAACCAAGAGCAATACCAGTTGTCTTTGTTGGCTAT
CCACAAGGCATGAAAGCATACAAACTATACAACATTGATCAACAAAAATTCTTTATCTCAAGGGATGTAATATTTCGAGAAGAAGTCTTTCCTTTCAAAGATTCGACCGT
CTCACAGCCCATGGAATACCCTTTACCAGGTTTTTCTTTACCTAAAGTCTTTCATGAATCCTCAATCGATCACATGAATCCCTCAACCAATATGCCCTCACCCTCCTTTT
CACCGAATACAAACAATGACAGCACCTCACAGACTATAGTCCAATCCACAAATAACCCAACCAATGATAGTACCTCATCAACCCAACAGCCTAAACAAAATATTTCTGAG
ACTTCTAACAATCCTACAGTTAGAAGATCCACTCGAATCGGAAAACAACCTTCTTATCTCCAAGCTTATCATTGCAATCTTCTTACATCACAACCTCACCAACCAAGCTC
CACAAAATACCCTTTAAACCAGTACATATCATACCAAAATCTATCTCACTCCTTTAAACATTCCGTACTCCAAGCCTCTACCTTACATGAACCAAACTTCTACCATGAGG
CTGTCATACATGATGATTGGAGAAAGGCCATGGAAAATGAACTGAAAGTCATGGAGACGAATCAAACATGGAGTATAGTTTCTCTACCAGCCAGCAAGAATATTGTAGGA
TGTCGATGGATATATAAAATTAAACACAAAGCTGATGGATCTGTTGAAAGGTATAAGGCAAGGTTCGTTGCAAAAGGATACACACAGCAAGAAGGTCTTGATTATTTTGA
AACCTTCTCACCGGTGGCCAAAATGGTGACTGTCAAAACCTTGCTAACTATAGCAGTATCGAAAGATTGGCATCTATTTCAATTAGATGTGAACAATGCATTTCTTCATG
GAGACTTATTTGAAGAAGTATACATGGACTTGTCATTGGGTTATAAACCGAACCTCACCATTCAGGGGGAGAAAATGTCCAAGGCTGACTATTCTCTTTTTGTAAAAGGC
CAAGGCACCAACTTTATAGCTCTACTTGTCTATGTTGATGATATTATCATCACTGGAGCAAATTTTTTGCTTAAAGACTTGGGATCTCTCAGATTCTTCCTTGGTTTTGA
ATTAGCTCATACATCAACTGGTATCCTATTGTCACAAAGACACTATGCACTATGCTTGCTAGAAGAAGTTGGTTTGCTTGGTGCTAAACCAGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGACCATTGGTCTCTCAGTGAAGAACAAGATTGGATTTGTTGACGGAACTATTGCCAAACCAACCGTTGATCTTCTTCCAACTTGGATCAGAAATAATAACATCTTGAA
AACGATCCCTTGCAACCTTGGAGCAAAACCAAGATACCATTGTTATGATGGTGTACGAGAAATGACAGATTTCCTTCAAATGGAATATCTCATGGACTTTCTTATGGGAT
TAAATGAGAACTTCTCTCAAGCACGGGGTCAACTTCTTCTCATGGATCCTCTACCATCAACTAGTCGAGCCTTTTCTATTATTCTCCAAGAAGAACAACAAAGGTCAATT
GGATCTCTTTCTTCTACAGCACCAACAATGGTCTTTGTGGTATCCTCTAACTCATCCAAAAATGGAACCAACAATCGACAAAGGAGAGGAAAACCCACGTGCACCCACTG
CAACACTCCTGGACACATAGCTACTCAAAACAGTGAGAATATCTCTCAAGACACCACACAGAACAGTCAAATGGTGAATAATATCAGCACTGCAGAGGCATTCATCCAAT
GTCAAAACCTTCTCAACCAGCTTCAGTCCCAAATCAATGCCTCCAACCAAATAGCTACCTCACATATAGCAGGTACTTCTTATTCATTCCCTTTATGGATAATTGATTCT
GGAGCATCCACCCATATTTCTTGTTGCAAGTCCTATTTTACATCTACTCAACCATGCTCAACAACCATTAGTTTACCAAATAAACAAGTTTTTGAAGTTAAAAGTGCTGG
CACCATCAAACTATCTGAATCTATAGAGCTGAAGAATGTCTTATATATTTCTGAATTTTCATTCAACCTTATACCAGTCAGTGCCTTAACAAAAGATCTTCCTGTGGATG
TTAGTTTCTCTACTAATAATTGTGTAATTCAGGACAAGTTCACTTTGAAGAAGATTGGCAGTGTTGAACTTCTATATGGTCTATATGTCTTCAAATTGAGAAATGCTTCC
AATCCGCAATCTCCCATATGTGTTGTAAACAGTGATAATGCCTTTTTATGGCATCAAAGGCTTGGGCACCCCTGTGTTGATGTTTTAAAGTCTTTACAAAGCATGTTACA
ATTGAAATCTTTTACTTCTCATACTTGTACTACTTGTACATTAGCAAAGCAAAGAAGACTAAGCTTTTCTGCAAATAATCATGTATCTCCAAATTCTTTTGACCTTATAC
ATGTTGACATTTGGGGACCTTTATCTACTGCTACACATGTTGGACATACATACTTCCTCACCATTGTCGATGATGCCACGCGTTTTACATGGATTATACCTAATTTCTTT
AAGCTTATCGAAACCCAACATGGTAAAACGATTAAACATATGCGTTCTGACAATGCTCCTGAACTTATGTTCACTGAATTCTTTAAAGAAAAAGGAGTACTGCGTATTCC
TTTAACCTTTTGGGGAGAATGCATACTGACTGCTGTTTACTTGATCAACAAAACTCCATCAAAAATGTTGAACTGGAAATCTCCATTCCAACTGCTTAACAACAACAAAC
CTGATTACAATAACCTAAAGGTGTTCGGGTCCTTATGTTATGCTTCAACTCTGCCACACAATCGCTCTAAGTTTCAACCAAGAGCAATACCAGTTGTCTTTGTTGGCTAT
CCACAAGGCATGAAAGCATACAAACTATACAACATTGATCAACAAAAATTCTTTATCTCAAGGGATGTAATATTTCGAGAAGAAGTCTTTCCTTTCAAAGATTCGACCGT
CTCACAGCCCATGGAATACCCTTTACCAGGTTTTTCTTTACCTAAAGTCTTTCATGAATCCTCAATCGATCACATGAATCCCTCAACCAATATGCCCTCACCCTCCTTTT
CACCGAATACAAACAATGACAGCACCTCACAGACTATAGTCCAATCCACAAATAACCCAACCAATGATAGTACCTCATCAACCCAACAGCCTAAACAAAATATTTCTGAG
ACTTCTAACAATCCTACAGTTAGAAGATCCACTCGAATCGGAAAACAACCTTCTTATCTCCAAGCTTATCATTGCAATCTTCTTACATCACAACCTCACCAACCAAGCTC
CACAAAATACCCTTTAAACCAGTACATATCATACCAAAATCTATCTCACTCCTTTAAACATTCCGTACTCCAAGCCTCTACCTTACATGAACCAAACTTCTACCATGAGG
CTGTCATACATGATGATTGGAGAAAGGCCATGGAAAATGAACTGAAAGTCATGGAGACGAATCAAACATGGAGTATAGTTTCTCTACCAGCCAGCAAGAATATTGTAGGA
TGTCGATGGATATATAAAATTAAACACAAAGCTGATGGATCTGTTGAAAGGTATAAGGCAAGGTTCGTTGCAAAAGGATACACACAGCAAGAAGGTCTTGATTATTTTGA
AACCTTCTCACCGGTGGCCAAAATGGTGACTGTCAAAACCTTGCTAACTATAGCAGTATCGAAAGATTGGCATCTATTTCAATTAGATGTGAACAATGCATTTCTTCATG
GAGACTTATTTGAAGAAGTATACATGGACTTGTCATTGGGTTATAAACCGAACCTCACCATTCAGGGGGAGAAAATGTCCAAGGCTGACTATTCTCTTTTTGTAAAAGGC
CAAGGCACCAACTTTATAGCTCTACTTGTCTATGTTGATGATATTATCATCACTGGAGCAAATTTTTTGCTTAAAGACTTGGGATCTCTCAGATTCTTCCTTGGTTTTGA
ATTAGCTCATACATCAACTGGTATCCTATTGTCACAAAGACACTATGCACTATGCTTGCTAGAAGAAGTTGGTTTGCTTGGTGCTAAACCAGCATAA
Protein sequenceShow/hide protein sequence
MTIGLSVKNKIGFVDGTIAKPTVDLLPTWIRNNNILKTIPCNLGAKPRYHCYDGVREMTDFLQMEYLMDFLMGLNENFSQARGQLLLMDPLPSTSRAFSIILQEEQQRSI
GSLSSTAPTMVFVVSSNSSKNGTNNRQRRGKPTCTHCNTPGHIATQNSENISQDTTQNSQMVNNISTAEAFIQCQNLLNQLQSQINASNQIATSHIAGTSYSFPLWIIDS
GASTHISCCKSYFTSTQPCSTTISLPNKQVFEVKSAGTIKLSESIELKNVLYISEFSFNLIPVSALTKDLPVDVSFSTNNCVIQDKFTLKKIGSVELLYGLYVFKLRNAS
NPQSPICVVNSDNAFLWHQRLGHPCVDVLKSLQSMLQLKSFTSHTCTTCTLAKQRRLSFSANNHVSPNSFDLIHVDIWGPLSTATHVGHTYFLTIVDDATRFTWIIPNFF
KLIETQHGKTIKHMRSDNAPELMFTEFFKEKGVLRIPLTFWGECILTAVYLINKTPSKMLNWKSPFQLLNNNKPDYNNLKVFGSLCYASTLPHNRSKFQPRAIPVVFVGY
PQGMKAYKLYNIDQQKFFISRDVIFREEVFPFKDSTVSQPMEYPLPGFSLPKVFHESSIDHMNPSTNMPSPSFSPNTNNDSTSQTIVQSTNNPTNDSTSSTQQPKQNISE
TSNNPTVRRSTRIGKQPSYLQAYHCNLLTSQPHQPSSTKYPLNQYISYQNLSHSFKHSVLQASTLHEPNFYHEAVIHDDWRKAMENELKVMETNQTWSIVSLPASKNIVG
CRWIYKIKHKADGSVERYKARFVAKGYTQQEGLDYFETFSPVAKMVTVKTLLTIAVSKDWHLFQLDVNNAFLHGDLFEEVYMDLSLGYKPNLTIQGEKMSKADYSLFVKG
QGTNFIALLVYVDDIIITGANFLLKDLGSLRFFLGFELAHTSTGILLSQRHYALCLLEEVGLLGAKPA