; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G14380 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G14380
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationChr7:12896094..12897274
RNA-Seq ExpressionCSPI07G14380
SyntenyCSPI07G14380
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]2.7e-11157.87Show/hide
Query:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESD--------------------------------KKSTYDHSVFYRRSDNGIVLLVVYVD
        MAAT+KW LHQLDI NAFLHG+LQEEVYMEQPPGFVAQGESD                                KKST DHSVFYRRS+ GIVLLVVYVD
Subjt:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESD--------------------------------KKSTYDHSVFYRRSDNGIVLLVVYVD

Query:  DIVITGNYASGILSLKNFLH--------------------------------------------GAKPSGTPMMPNQQLVK-GDLCKDPERYRRLVGKLN
        DIVITGN A GI SLK FL                                             GAKPSGTPMMPNQQLVK G+LCKDPERYRRLVGKLN
Subjt:  DIVITGNYASGILSLKNFLH--------------------------------------------GAKPSGTPMMPNQQLVK-GDLCKDPERYRRLVGKLN

Query:  YLIVTRLDIAYS----------------AAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWKT----------
        YL VTR DIAYS                AAVEQILCYLKAAPG GILYKDHGHTR+ECFSDADW GSREDRRSTSGYCVFVGGNLVSWK+          
Subjt:  YLIVTRLDIAYS----------------AAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWKT----------

Query:  -----------------------------------------ALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMISY
                                                 ALHIASNPVF E+TKHI+VDCHFI EKIQDGLVS GYV TGEQLGDILTKALNG  ISY
Subjt:  -----------------------------------------ALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMISY

Query:  LRNKLDVIDIFAP
        L NKL +IDIFAP
Subjt:  LRNKLDVIDIFAP

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]2.7e-11157.87Show/hide
Query:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESD--------------------------------KKSTYDHSVFYRRSDNGIVLLVVYVD
        MAAT+KW LHQLDI NAFLHG+LQEEVYMEQPPGFVAQGESD                                KKST DHSVFYRRS+ GIVLLVVYVD
Subjt:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESD--------------------------------KKSTYDHSVFYRRSDNGIVLLVVYVD

Query:  DIVITGNYASGILSLKNFLH--------------------------------------------GAKPSGTPMMPNQQLVK-GDLCKDPERYRRLVGKLN
        DIVITGN A GI SLK FL                                             GAKPSGTPMMPNQQLVK G+LCKDPERYRRLVGKLN
Subjt:  DIVITGNYASGILSLKNFLH--------------------------------------------GAKPSGTPMMPNQQLVK-GDLCKDPERYRRLVGKLN

Query:  YLIVTRLDIAYS----------------AAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWKT----------
        YL VTR DIAYS                AAVEQILCYLKAAPG GILYKDHGHTR+ECFSDADW GSREDRRSTSGYCVFVGGNLVSWK+          
Subjt:  YLIVTRLDIAYS----------------AAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWKT----------

Query:  -----------------------------------------ALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMISY
                                                 ALHIASNPVF E+TKHI+VDCHFI EKIQDGLVS GYV TGEQLGDILTKALNG  ISY
Subjt:  -----------------------------------------ALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMISY

Query:  LRNKLDVIDIFAP
        L NKL +IDIFAP
Subjt:  LRNKLDVIDIFAP

XP_031744755.1 uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus]2.7e-11157.87Show/hide
Query:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESD--------------------------------KKSTYDHSVFYRRSDNGIVLLVVYVD
        MAAT+KW LHQLDI NAFLHG+LQEEVYMEQPPGFVAQGESD                                KKST DHSVFYRRS+ GIVLLVVYVD
Subjt:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESD--------------------------------KKSTYDHSVFYRRSDNGIVLLVVYVD

Query:  DIVITGNYASGILSLKNFLH--------------------------------------------GAKPSGTPMMPNQQLVK-GDLCKDPERYRRLVGKLN
        DIVITGN A GI SLK FL                                             GAKPSGTPMMPNQQLVK G+LCKDPERYRRLVGKLN
Subjt:  DIVITGNYASGILSLKNFLH--------------------------------------------GAKPSGTPMMPNQQLVK-GDLCKDPERYRRLVGKLN

Query:  YLIVTRLDIAYS----------------AAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWKT----------
        YL VTR DIAYS                AAVEQILCYLKAAPG GILYKDHGHTR+ECFSDADW GSREDRRSTSGYCVFVGGNLVSWK+          
Subjt:  YLIVTRLDIAYS----------------AAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWKT----------

Query:  -----------------------------------------ALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMISY
                                                 ALHIASNPVF E+TKHI+VDCHFI EKIQDGLVS GYV TGEQLGDILTKALNG  ISY
Subjt:  -----------------------------------------ALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMISY

Query:  LRNKLDVIDIFAP
        L NKL +IDIFAP
Subjt:  LRNKLDVIDIFAP

XP_031744756.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus]2.7e-11157.87Show/hide
Query:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESD--------------------------------KKSTYDHSVFYRRSDNGIVLLVVYVD
        MAAT+KW LHQLDI NAFLHG+LQEEVYMEQPPGFVAQGESD                                KKST DHSVFYRRS+ GIVLLVVYVD
Subjt:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESD--------------------------------KKSTYDHSVFYRRSDNGIVLLVVYVD

Query:  DIVITGNYASGILSLKNFLH--------------------------------------------GAKPSGTPMMPNQQLVK-GDLCKDPERYRRLVGKLN
        DIVITGN A GI SLK FL                                             GAKPSGTPMMPNQQLVK G+LCKDPERYRRLVGKLN
Subjt:  DIVITGNYASGILSLKNFLH--------------------------------------------GAKPSGTPMMPNQQLVK-GDLCKDPERYRRLVGKLN

Query:  YLIVTRLDIAYS----------------AAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWKT----------
        YL VTR DIAYS                AAVEQILCYLKAAPG GILYKDHGHTR+ECFSDADW GSREDRRSTSGYCVFVGGNLVSWK+          
Subjt:  YLIVTRLDIAYS----------------AAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWKT----------

Query:  -----------------------------------------ALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMISY
                                                 ALHIASNPVF E+TKHI+VDCHFI EKIQDGLVS GYV TGEQLGDILTKALNG  ISY
Subjt:  -----------------------------------------ALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMISY

Query:  LRNKLDVIDIFAP
        L NKL +IDIFAP
Subjt:  LRNKLDVIDIFAP

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]2.7e-11157.87Show/hide
Query:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESD--------------------------------KKSTYDHSVFYRRSDNGIVLLVVYVD
        MAAT+KW LHQLDI NAFLHG+LQEEVYMEQPPGFVAQGESD                                KKST DHSVFYRRS+ GIVLLVVYVD
Subjt:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESD--------------------------------KKSTYDHSVFYRRSDNGIVLLVVYVD

Query:  DIVITGNYASGILSLKNFLH--------------------------------------------GAKPSGTPMMPNQQLVK-GDLCKDPERYRRLVGKLN
        DIVITGN A GI SLK FL                                             GAKPSGTPMMPNQQLVK G+LCKDPERYRRLVGKLN
Subjt:  DIVITGNYASGILSLKNFLH--------------------------------------------GAKPSGTPMMPNQQLVK-GDLCKDPERYRRLVGKLN

Query:  YLIVTRLDIAYS----------------AAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWKT----------
        YL VTR DIAYS                AAVEQILCYLKAAPG GILYKDHGHTR+ECFSDADW GSREDRRSTSGYCVFVGGNLVSWK+          
Subjt:  YLIVTRLDIAYS----------------AAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWKT----------

Query:  -----------------------------------------ALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMISY
                                                 ALHIASNPVF E+TKHI+VDCHFI EKIQDGLVS GYV TGEQLGDILTKALNG  ISY
Subjt:  -----------------------------------------ALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMISY

Query:  LRNKLDVIDIFAP
        L NKL +IDIFAP
Subjt:  LRNKLDVIDIFAP

TrEMBL top hitse value%identityAlignment
A0A438CR71 Retrovirus-related Pol polyprotein from transposon RE12.6e-9155.56Show/hide
Query:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESD------KKSTYDHSVFYRRSDNGIVLLVVYVDDIVITGNYASGILSLKNFLHG-----
        +AA+ +W++HQLDI NAFLHG+L+EEVY+EQPPGFVAQG+ +       KS  DHSVFY++S  GI+LLVVYVDDIVITGN  +GI  LK F+H      
Subjt:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESD------KKSTYDHSVFYRRSDNGIVLLVVYVDDIVITGNYASGILSLKNFLHG-----

Query:  ------------AKPSGTPMMPNQQLVKGDLCKD---------PERYRRLVGKLNYLIVTRLDIAYS----------------AAVEQILCYLKAAPGCG
                       S   M  +Q+    DL K+         PERYRR+VGKLNYL VTR DIAY+                AA+EQILCYLK APG G
Subjt:  ------------AKPSGTPMMPNQQLVKGDLCKD---------PERYRRLVGKLNYLIVTRLDIAYS----------------AAVEQILCYLKAAPGCG

Query:  ILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWK----------------TALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVM
        ILY   GHTRIECFSDADW GS+ DRRST+GYCVF GGNLV+WK                 ALHIA+NPV+ E+TKHI+VDCHFI EKI++ LVS GYV 
Subjt:  ILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWK----------------TALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVM

Query:  TGEQLGDILTKALNGGMISYLRNKLDVIDIFAP
        TGEQLGDI TKALNG  + Y  NKL +I+I+AP
Subjt:  TGEQLGDILTKALNGGMISYLRNKLDVIDIFAP

A0A438IJ87 Retrovirus-related Pol polyprotein from transposon RE16.9e-9251.47Show/hide
Query:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESDK--------------------------------KSTYDHSVFYRRSDNGIVLLVVYVD
        +AA+ +W++HQLDI NAFLHG+L+EEVY+EQPPGFVAQGE  K                                KS  DHSVFY++S  GI+LLVVYVD
Subjt:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESDK--------------------------------KSTYDHSVFYRRSDNGIVLLVVYVD

Query:  DIVITGNYASGILSLKNFLHG--------------------------------------------AKPSGTPMMPNQQLV--KGDLCKDPERYRRLVGKL
        DIVITGN  +GI  LK F+H                                             AKP  TPM+PN QL+   GD   +PERYRR+VGKL
Subjt:  DIVITGNYASGILSLKNFLHG--------------------------------------------AKPSGTPMMPNQQLV--KGDLCKDPERYRRLVGKL

Query:  NYLIVTRLDIAYS----------------AAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWK----------
        NYL VTR DIAY+                AA+EQILCYLK APG GILY   GHTRIECFSDADW GS+ DRRST+GYCVF GGNLV+WK          
Subjt:  NYLIVTRLDIAYS----------------AAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWK----------

Query:  --TALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMISYLRNKLDVIDIFAP
           ALHIA+NPV+ E+TKHI+VDCHFI EKI++ LVS GYV TGEQLGDI TKALNG  + Y  NKL +I+I+AP
Subjt:  --TALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMISYLRNKLDVIDIFAP

A0A5D3C204 Putative mitochondrial protein1.2e-10959.69Show/hide
Query:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESDK--------------------------------KSTYDHSVFYRRSDNGIVLLVVYVD
        MAATH W LHQLDI NAFLH +LQEEVYMEQPPGFVAQ ESDK                                 STYDHSVFYRRSDNGIVLLVVYVD
Subjt:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESDK--------------------------------KSTYDHSVFYRRSDNGIVLLVVYVD

Query:  DIVITGNYASGILSLKNFLH--------------------------------------------GAKPSGTPMMPNQQLVK-GDLCKDPERYRRLVGKLN
        D VITGN A GI SLK FL                                             GAKPSGT MMPNQQLVK G LCKDPERYRRLVGKLN
Subjt:  DIVITGNYASGILSLKNFLH--------------------------------------------GAKPSGTPMMPNQQLVK-GDLCKDPERYRRLVGKLN

Query:  YLIVTRLDIAYS----------------AAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWK-----------
        YL +T+ DIAYS                AAVEQILCYLKAAPG GILYKDHGHTR+ECFSDADW GSRED  S+SGYCVFVGGNLVSWK           
Subjt:  YLIVTRLDIAYS----------------AAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWK-----------

Query:  --------------TALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMISYLRNKLDVIDIFAP
                      TALHIASNPVF E+TKHI+VDCHFI  KIQDGLVS GYV TGEQLGDILTK +NG  ISYL NKLD IDIFAP
Subjt:  --------------TALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMISYLRNKLDVIDIFAP

A0A5D3CGG6 Putative Polyprotein3.2e-10576.38Show/hide
Query:  NAFLHGNLQEEVYMEQPPGFVAQGESDKKSTYDHSVFYRRSDNGIVLLVVYVDDIVITGNYASGILSLKNFLHGAKPSGTPMMPNQQLVKGD-LCKDPER
        NAFLHG+LQEEVYMEQPP FVAQGESD KSTYDHS FYRRSDNGIVLLVVYVDDIVITGN ASGI SLK FL GAKPSG+PMMPNQQLVK + LCKD ER
Subjt:  NAFLHGNLQEEVYMEQPPGFVAQGESDKKSTYDHSVFYRRSDNGIVLLVVYVDDIVITGNYASGILSLKNFLHGAKPSGTPMMPNQQLVKGD-LCKDPER

Query:  YRRLVGKLNYLIVTRLDIAYS----------------AAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWKTA
        YRRLVGKLNYL VTR DIAYS                AAVEQILCYLK APG  ILYK+HGHTR+ECFSD DW  S EDRRST GYCVFV         A
Subjt:  YRRLVGKLNYLIVTRLDIAYS----------------AAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWKTA

Query:  LHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMISYLRNKLDVIDIFAP
        LHIASNPVF E+TKHI+VDCHFI EKIQDGLVS GYV T EQLGDILTKA+NG  ISYL NKL +I+IFAP
Subjt:  LHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMISYLRNKLDVIDIFAP

A0A5D3CID2 Putative mitochondrial protein7.1e-10565.74Show/hide
Query:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQG---ESDKKSTYDHSVFYRRSDNGIVLLVVYVDDIVITGNYASGILSLKNFLHG---AKPSG
        MAATH W LHQLDI NAFLHG+LQE+VYMEQPPG  +Q       +KSTYDHSVFYRRSDNGIVLLVVYVDDIVIT N  SGI S K F+ G   AKPSG
Subjt:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQG---ESDKKSTYDHSVFYRRSDNGIVLLVVYVDDIVITGNYASGILSLKNFLHG---AKPSG

Query:  TPMMPNQQLVK-GDLCKDPERYRRLVGKLNYLIVTRLDIAYS----------------AAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSRED
        TPMMPNQQLVK G LCKDPERYRRLVGKLNYL VTR DIAYS                A VEQILCYLKAAPGCGIL KDHGHTR+ECFSDADW GSRED
Subjt:  TPMMPNQQLVK-GDLCKDPERYRRLVGKLNYLIVTRLDIAYS----------------AAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSRED

Query:  RRSTSGYCVFVGGNLVSWKT---------------------------------------------------ALHIASNPVFREQTKHIKVDCHFICEKIQ
        RRST GYCVFVGGNLVSWK+                                                   ALHIASNPVF E+TKHI+VDCHFI EKIQ
Subjt:  RRSTSGYCVFVGGNLVSWKT---------------------------------------------------ALHIASNPVFREQTKHIKVDCHFICEKIQ

Query:  DGLVSIGYVMTGEQLGDILTKALN
        DGLVS GYV TGEQLGDILTKA+N
Subjt:  DGLVSIGYVMTGEQLGDILTKALN

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.5e-1422.91Show/hide
Query:  LHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESDKK------------------------------STYDHSVFY--RRSDNGIVLLVVYVDDIVI-TGN
        +HQ+D+  AFL+G L+EE+YM  P G     ++  K                              S+ D  ++   + + N  + +++YVDD+VI TG+
Subjt:  LHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESDKK------------------------------STYDHSVFY--RRSDNGIVLLVVYVDDIVI-TGN

Query:  --------------------------------------------YASGILSLKNFLH-GAKPSGTPMMPNQQLVKGDL-CKDPERYRRLVGKLNYLIV-T
                                                    Y   ILS  N  +  A  +  P   N +L+  D  C  P   R L+G L Y+++ T
Subjt:  --------------------------------------------YASGILSLKNFLH-GAKPSGTPMMPNQQLVKGDL-CKDPERYRRLVGKLNYLIV-T

Query:  RLDIA--------YSAA--------VEQILCYLKAAPGCGILYKDH--GHTRIECFSDADWVGSREDRRSTSGYCV-FVGGNLVSWKT------------
        R D+         YS+         ++++L YLK      +++K +     +I  + D+DW GS  DR+ST+GY       NL+ W T            
Subjt:  RLDIA--------YSAA--------VEQILCYLKAAPGCGILYKDH--GHTRIECFSDADWVGSREDRRSTSGYCV-FVGGNLVSWKT------------

Query:  ---------------------------------------ALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMISYLR
                                                + IA+NP   ++ KHI +  HF  E++Q+ ++ + Y+ T  QL DI TK L       LR
Subjt:  ---------------------------------------ALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMISYLR

Query:  NKLDVI
        +KL ++
Subjt:  NKLDVI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-1823.7Show/hide
Query:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESD---------------------------KKSTY-----DHSVFYRR-SDNGIVLLVVYV
        +AA+    + QLD+  AFLHG+L+EE+YMEQP GF   G+                             K  TY     D  V+++R S+N  ++L++YV
Subjt:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESD---------------------------KKSTY-----DHSVFYRR-SDNGIVLLVVYV

Query:  DDIVITGNYASGILSLKNFL----------------------------------------------HGAKPSGTP----------MMPNQQLVKGDLCKD
        DD++I G     I  LK  L                                                AKP  TP          M P     KG++ K 
Subjt:  DDIVITGNYASGILSLKNFL----------------------------------------------HGAKPSGTP----------MMPNQQLVKGDLCKD

Query:  PERYRRLVGKLNY-LIVTRLDIAYSA----------------AVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVS
        P  Y   VG L Y ++ TR DIA++                 AV+ IL YL+   G  + +       ++ ++DAD  G  ++R+S++GY     G  +S
Subjt:  PERYRRLVGKLNY-LIVTRLDIAYSA----------------AVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVS

Query:  W--------------------------------------------------KTALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDI
        W                                                  ++A+ ++ N ++  +TKHI V  H+I E + D  + +  + T E   D+
Subjt:  W--------------------------------------------------KTALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDI

Query:  LTKAL
        LTK +
Subjt:  LTKAL

P92519 Uncharacterized mitochondrial protein AtMg008101.0e-1231.39Show/hide
Query:  YASGILSLKNFLHGAKPSGTPM-MPNQQLVKGDLCKDPERYRRLVGKLNYLIVTRLDIAYSAAV----------------EQILCYLKAAPGCGILYKDH
        YA  IL+    L   KP  TP+ +     V      DP  +R +VG L YL +TR DI+Y+  +                +++L Y+K     G+    +
Subjt:  YASGILSLKNFLHGAKPSGTPM-MPNQQLVKGDLCKDPERYRRLVGKLNYLIVTRLDIAYSAAV----------------EQILCYLKAAPGCGILYKDH

Query:  GHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSW
            ++ F D+DW G    RRST+G+C F+G N++SW
Subjt:  GHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-3527.38Show/hide
Query:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESD-----KKSTY---------------------------DHSVFYRRSDNGIVLLVVYVD
        +A    W + QLD+NNAFL G L ++VYM QPPGF+ +   +     +K+ Y                           D S+F  +    IV ++VYVD
Subjt:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESD-----KKSTY---------------------------DHSVFYRRSDNGIVLLVVYVD

Query:  DIVITGN---------------------------------------------YASGILSLKNFLHGAKPSGTPMMPNQQ--LVKGDLCKDPERYRRLVGK
        DI+ITGN                                             Y   +L+  N +  AKP  TPM P+ +  L  G    DP  YR +VG 
Subjt:  DIVITGN---------------------------------------------YASGILSLKNFLHGAKPSGTPMMPNQQ--LVKGDLCKDPERYRRLVGK

Query:  LNYLIVTRLDIAYSA----------------AVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWKT--------
        L YL  TR DI+Y+                 A+++IL YL   P  GI  K      +  +SDADW G ++D  ST+GY V++G + +SW +        
Subjt:  LNYLIVTRLDIAYSA----------------AVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWKT--------

Query:  -------------------------------------------ALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMI
                                                   A ++ +NPVF  + KHI +D HFI  ++Q G + + +V T +QL D LTK L+    
Subjt:  -------------------------------------------ALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMI

Query:  SYLRNKLDV
            +K+ V
Subjt:  SYLRNKLDV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.8e-3627.67Show/hide
Query:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESD-----KKSTY---------------------------DHSVFYRRSDNGIVLLVVYVD
        +A    W + QLD+NNAFL G L +EVYM QPPGFV +   D     +K+ Y                           D S+F  +    I+ ++VYVD
Subjt:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESD-----KKSTY---------------------------DHSVFYRRSDNGIVLLVVYVD

Query:  DIVITGN---------------------------------------------YASGILSLKNFLHGAKPSGTPM--MPNQQLVKGDLCKDPERYRRLVGK
        DI+ITGN                                             Y   +L+  N L  AKP  TPM   P   L  G    DP  YR +VG 
Subjt:  DIVITGN---------------------------------------------YASGILSLKNFLHGAKPSGTPM--MPNQQLVKGDLCKDPERYRRLVGK

Query:  LNYLIVTRLDIAYSA----------------AVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWKT--------
        L YL  TR D++Y+                 A++++L YL   P  GI  K      +  +SDADW G  +D  ST+GY V++G + +SW +        
Subjt:  LNYLIVTRLDIAYSA----------------AVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWKT--------

Query:  -------------------------------------------ALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMI
                                                   A ++ +NPVF  + KHI +D HFI  ++Q G + + +V T +QL D LTK L+    
Subjt:  -------------------------------------------ALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMI

Query:  SYLRNKLDVIDI
             K+ VI +
Subjt:  SYLRNKLDVIDI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.8e-3428.29Show/hide
Query:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVA-QGES--------DKKSTY---------------------------DHSVFYRRSDNGIVLLV
        ++A + + LHQLDI+NAFL+G+L EE+YM+ PPG+ A QG+S         KKS Y                           DH+ F + +    + ++
Subjt:  MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVA-QGES--------DKKSTY---------------------------DHSVFYRRSDNGIVLLV

Query:  VYVDDIVITGNYASGILSLKNFLH--------------------------------------------GAKPSGTPMMPNQQLV--KGDLCKDPERYRRL
        VYVDDI+I  N  + +  LK+ L                                             G KPS  PM P+       G    D + YRRL
Subjt:  VYVDDIVITGNYASGILSLKNFLH--------------------------------------------GAKPSGTPMMPNQQLV--KGDLCKDPERYRRL

Query:  VGKLNYLIVTRLDI----------------AYSAAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWKT-----
        +G+L YL +TRLDI                A+  AV +IL Y+K   G G+ Y      +++ FSDA +   ++ RRST+GYC+F+G +L+SWK+     
Subjt:  VGKLNYLIVTRLDI----------------AYSAAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWKT-----

Query:  ----------------------------------------------ALHIASNPVFREQTKHIKVDCHFICEK-IQDGLVSIGYVMTGEQLG--DILTKA
                                                      A+HIA+N VF E+TKHI+ DCH + E+ +    +S  +    EQ G  + L+  
Subjt:  ----------------------------------------------ALHIASNPVFREQTKHIKVDCHFICEK-IQDGLVSIGYVMTGEQLG--DILTKA

Query:  LNG
        L G
Subjt:  LNG

ATMG00810.1 DNA/RNA polymerases superfamily protein7.4e-1431.39Show/hide
Query:  YASGILSLKNFLHGAKPSGTPM-MPNQQLVKGDLCKDPERYRRLVGKLNYLIVTRLDIAYSAAV----------------EQILCYLKAAPGCGILYKDH
        YA  IL+    L   KP  TP+ +     V      DP  +R +VG L YL +TR DI+Y+  +                +++L Y+K     G+    +
Subjt:  YASGILSLKNFLHGAKPSGTPM-MPNQQLVKGDLCKDPERYRRLVGKLNYLIVTRLDIAYSAAV----------------EQILCYLKAAPGCGILYKDH

Query:  GHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSW
            ++ F D+DW G    RRST+G+C F+G N++SW
Subjt:  GHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCTACCCATAAATGGTTATTGCATCAACTTGACATTAACAATGCTTTTCTGCATGGTAATCTTCAAGAGGAAGTTTATATGGAGCAACCACCTGGGTTTGTTGC
TCAGGGTGAGAGTGATAAAAAGAGTACATATGATCATTCAGTTTTCTATCGCCGATCTGATAACGGTATAGTTCTACTTGTTGTATATGTTGATGATATTGTTATTACTG
GAAATTATGCATCGGGTATTTTGTCTCTCAAAAATTTCCTTCATGGAGCCAAACCAAGTGGCACTCCTATGATGCCAAATCAACAACTTGTTAAAGGAGATTTATGTAAA
GATCCTGAGAGATATAGGAGATTAGTTGGGAAGTTGAACTACTTAATAGTGACTCGACTAGACATTGCTTATTCTGCTGCAGTAGAGCAGATTTTGTGTTATCTGAAAGC
TGCTCCTGGATGTGGGATCTTATACAAAGATCATGGACATACGAGAATTGAATGTTTTTCTGATGCTGATTGGGTGGGATCTCGTGAGGATAGAAGATCAACTTCTGGAT
ATTGTGTCTTTGTAGGTGGAAACTTAGTCTCATGGAAGACTGCACTTCACATTGCATCTAATCCAGTATTTCGTGAACAAACTAAACATATTAAGGTGGATTGTCACTTC
ATTTGTGAGAAAATCCAAGATGGGTTGGTGTCCATAGGATATGTAATGACTGGAGAACAATTGGGAGATATTCTAACTAAAGCTTTAAATGGAGGAATGATAAGTTATCT
GCGCAACAAGCTGGACGTGATTGACATATTTGCTCCAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCTACCCATAAATGGTTATTGCATCAACTTGACATTAACAATGCTTTTCTGCATGGTAATCTTCAAGAGGAAGTTTATATGGAGCAACCACCTGGGTTTGTTGC
TCAGGGTGAGAGTGATAAAAAGAGTACATATGATCATTCAGTTTTCTATCGCCGATCTGATAACGGTATAGTTCTACTTGTTGTATATGTTGATGATATTGTTATTACTG
GAAATTATGCATCGGGTATTTTGTCTCTCAAAAATTTCCTTCATGGAGCCAAACCAAGTGGCACTCCTATGATGCCAAATCAACAACTTGTTAAAGGAGATTTATGTAAA
GATCCTGAGAGATATAGGAGATTAGTTGGGAAGTTGAACTACTTAATAGTGACTCGACTAGACATTGCTTATTCTGCTGCAGTAGAGCAGATTTTGTGTTATCTGAAAGC
TGCTCCTGGATGTGGGATCTTATACAAAGATCATGGACATACGAGAATTGAATGTTTTTCTGATGCTGATTGGGTGGGATCTCGTGAGGATAGAAGATCAACTTCTGGAT
ATTGTGTCTTTGTAGGTGGAAACTTAGTCTCATGGAAGACTGCACTTCACATTGCATCTAATCCAGTATTTCGTGAACAAACTAAACATATTAAGGTGGATTGTCACTTC
ATTTGTGAGAAAATCCAAGATGGGTTGGTGTCCATAGGATATGTAATGACTGGAGAACAATTGGGAGATATTCTAACTAAAGCTTTAAATGGAGGAATGATAAGTTATCT
GCGCAACAAGCTGGACGTGATTGACATATTTGCTCCAACTTGA
Protein sequenceShow/hide protein sequence
MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESDKKSTYDHSVFYRRSDNGIVLLVVYVDDIVITGNYASGILSLKNFLHGAKPSGTPMMPNQQLVKGDLCK
DPERYRRLVGKLNYLIVTRLDIAYSAAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWKTALHIASNPVFREQTKHIKVDCHF
ICEKIQDGLVSIGYVMTGEQLGDILTKALNGGMISYLRNKLDVIDIFAPT