; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G20380 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G20380
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr4:18500980..18504028
RNA-Seq ExpressionCSPI04G20380
SyntenyCSPI04G20380
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8475264.1 hypothetical protein CXB51_032125 [Gossypium anomalum]6.5e-5241.38Show/hide
Query:  VKGFKMWHPTNKKFIISRDVHFRETEMF--MQGKGSIERNPNAT-------ETYTTRIEVENTRNNAQSIDKTTGTDQEQVKNIGGEQTGIIEEQPDLSQ
        VKG+K+W P N+K +ISRDV F ET M   +  K S  +  + T       + Y     V    N A+ ID     +QE       E T  +EE    ++
Subjt:  VKGFKMWHPTNKKFIISRDVHFRETEMF--MQGKGSIERNPNAT-------ETYTTRIEVENTRNNAQSIDKTTGTDQEQVKNIGGEQTGIIEEQPDLSQ

Query:  -----YSLASDKQRRIIVPPA------RLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSP-----RYD--
             YS         +  P       R LL +VA ++LEL+QLDVKT FLHG L+E IYM QP+GF V  KED   LLKKS+YGLKQSP     R+D  
Subjt:  -----YSLASDKQRRIIVPPA------RLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSP-----RYD--

Query:  ----------RSKEDLNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSP-SETDIEHK
                  + K ++  VK  LS+EF+MKDLG ++KILG++I RDR  S L + Q  Y EK++ RFN+ + +PV+ P+  +F+L +  SP S+ +IE+ 
Subjt:  ----------RSKEDLNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSP-SETDIEHK

Query:  LKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRYMTNLGKRHWKTL
          M +VPY   VGSL+Y M+ +RPDL Y+ + V RYM N GK HWK +
Subjt:  LKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRYMTNLGKRHWKTL

KAG8475264.1 hypothetical protein CXB51_032125 [Gossypium anomalum]2.2e-0753.23Show/hide
Query:  GLEALSKQGILP-QDICNKLSFCEHCVLGKARKQSFTKAQHKTKRILDYIYSDLGGPTSTPS
        G+  LSK+G+L  Q IC KL+FCE+CV GK ++  FT+  H TK  L+YI+SDL GP+  PS
Subjt:  GLEALSKQGILP-QDICNKLSFCEHCVLGKARKQSFTKAQHKTKRILDYIYSDLGGPTSTPS

KAG8475264.1 hypothetical protein CXB51_032125 [Gossypium anomalum]1.1e-5140.22Show/hide
Query:  VKGFKMWHPTNKKFIISRDVHFRETEMF----------MQGKGSIERNPNATETYTTRIEVENTRNNAQSID-KTTGTDQEQVKNI--------GGEQTG
        VKG+K+W P N+K +ISRDV F ET M            + K   +R     + Y     V    N A+ ID     ++  +V+ +          E T 
Subjt:  VKGFKMWHPTNKKFIISRDVHFRETEMF----------MQGKGSIERNPNATETYTTRIEVENTRNNAQSID-KTTGTDQEQVKNI--------GGEQTG

Query:  IIEEQPDLSQ-----YSLASDKQRRIIVPPA------RLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSP
         +EE    ++     YS         +  P       R LL +VA ++LEL+QLDVKT FLHG L+E IYM QP+GF V  KED   LLKKS+YGLKQSP
Subjt:  IIEEQPDLSQ-----YSLASDKQRRIIVPPA------RLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSP

Query:  -----RYD------------RSKEDLNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNS
             R+D            + K ++  VK  LS+EF+MKDLG ++KILG++I RDR  S L + Q  Y EK++ RFN+ + +PV+ P+  +F+L +  S
Subjt:  -----RYD------------RSKEDLNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNS

Query:  P-SETDIEHKLKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRYMTNLGKRHWKTL
        P S+ +IE+   M +VPYS  VGSLMY M+ +RPDL Y+ + V RYM N GK HWK +
Subjt:  P-SETDIEHKLKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRYMTNLGKRHWKTL

KAG8493469.1 hypothetical protein CXB51_010771 [Gossypium anomalum]1.7e-5534.44Show/hide
Query:  GLEALSKQGILP-QDICNKLSFCEHCVLGKARKQSFTKAQHKTKRILDYIYSDLGGPTSTPS-----------------------------LSAQEE---
        G+  L K G+L  Q IC KL+FCEHCV GK ++  FT+  H TK  L+YI+SDL G +  PS                             + A+ +   
Subjt:  GLEALSKQGILP-QDICNKLSFCEHCVLGKARKQSFTKAQHKTKRILDYIYSDLGGPTSTPS-----------------------------LSAQEE---

Query:  -------------------------VKGFKMWHPTNKKFIISRDVHFRETEMFMQGKGSIERNPNATETYTTRIEVENTRNNAQSIDKTTGTDQEQVKNI
                                 VKG+K+W P N+K +ISRDV F ET M           PN +       +  N  N  Q   K  GT    V+  
Subjt:  -------------------------VKGFKMWHPTNKKFIISRDVHFRETEMFMQGKGSIERNPNATETYTTRIEVENTRNNAQSIDKTTGTDQEQVKNI

Query:  GGEQTGIIEEQPDLSQYSLASDKQRRIIVPPARLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSPR----
          +   + +    + +    +     +     R LL +VA ++LEL+QLDVKT FLHG L+E IYM QP+GF +  KE+   LLKKS+YGLKQSPR    
Subjt:  GGEQTGIIEEQPDLSQYSLASDKQRRIIVPPARLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSPR----

Query:  ---------------YD---------------------------RSKEDLNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRR
                       +D                           + K ++  VK  L++EF+MKDLG ++KILG++I RDR +S L + Q  Y EKV+ R
Subjt:  ---------------YD---------------------------RSKEDLNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRR

Query:  FNLTTVRPVTLPITQNFKLFAYNSP-SETDIEHKLKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRYMTNLGKRHWKTL
        FN+ +V+PV+ P+  +F+L +  SP S+  IE+   M +VPYS  VGSLMY+M+ +R DL Y+ + V RYM N  K  WK +
Subjt:  FNLTTVRPVTLPITQNFKLFAYNSP-SETDIEHKLKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRYMTNLGKRHWKTL

KAG8499189.1 hypothetical protein CXB51_005621 [Gossypium anomalum]3.0e-5737.96Show/hide
Query:  GLEALSKQGILP-QDICNKLSFCEHCVLGKARKQSFTKAQHKTKRILDYIYSDLGGPTSTPS------LSAQEEVKGFKMWHPTNKKFIISRDVHFRETE
        G+  LSK+G+L  Q IC KL+FCEHCV GK ++  FT+  H TK  L+YI+SDL GP+  PS      L+  E++   +   P+N    IS      E  
Subjt:  GLEALSKQGILP-QDICNKLSFCEHCVLGKARKQSFTKAQHKTKRILDYIYSDLGGPTSTPS------LSAQEEVKGFKMWHPTNKKFIISRDVHFRETE

Query:  MFMQGKGSIERNPNATETYTTRIEVENTRNNAQSIDKTTGT-DQEQVKNIGGEQTGIIEEQPDLSQYSLASDKQRRIIVPPARLLLSLVAQNNLELDQLD
        MF   +     + N T       + + T        K  GT   E+ K           + P +    + S   +   +   R LL +VA ++LEL+QLD
Subjt:  MFMQGKGSIERNPNATETYTTRIEVENTRNNAQSIDKTTGT-DQEQVKNIGGEQTGIIEEQPDLSQYSLASDKQRRIIVPPARLLLSLVAQNNLELDQLD

Query:  VKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSPR-------------------YD---------------------------RSKEDL
        VKT FLHG L+E IYM QP+GF V  KED   LLKKS+YGLKQSPR                   +D                           + K ++
Subjt:  VKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSPR-------------------YD---------------------------RSKEDL

Query:  NNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSP-SETDIEHKLKMENVPYSQVVGSLM
          VK  LS+EF+MKDLG ++KILG++I RDR  S L + Q  Y EK++ RFN+ + +PV+ P+  +F+L +  SP S+ +IE+   M +VPYS  VGSLM
Subjt:  NNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSP-SETDIEHKLKMENVPYSQVVGSLM

Query:  YLMISTRPDLFYSANLVGRYMTNLGKRHWKTL
        Y M+ +RPDL Y+ + V RYM N GK HWK +
Subjt:  YLMISTRPDLFYSANLVGRYMTNLGKRHWKTL

RVW25647.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.4e-5133.62Show/hide
Query:  GIESVTMKLKDGTVKL-HR------NKGLEALSKQGILPQDICNKLSFCEHCVLGKARKQSFTKAQHKTKRILDYIYSDLGGPTSTPSLSAQEEVK----
        G  S  +K   GT KL H+      ++GL+ L KQG+L       L FCEHCV GKA +  F KA H+T+   D+I+SDL  P+  PS+     VK    
Subjt:  GIESVTMKLKDGTVKL-HR------NKGLEALSKQGILPQDICNKLSFCEHCVLGKARKQSFTKAQHKTKRILDYIYSDLGGPTSTPSLSAQEEVK----

Query:  ---------GFKMWH---------PTNKKFIISRDVHFRETEMFM-------------------QGKGSIERNPNATETYTTRIEVE----NTRNNAQSI
                  F++ H          T+ K +    VH R+ E                      +  G  E    A      R+++E        N+   
Subjt:  ---------GFKMWH---------PTNKKFIISRDVHFRETEMFM-------------------QGKGSIERNPNATETYTTRIEVE----NTRNNAQSI

Query:  DKTTGTDQEQVKNIGGEQTGIIEEQPDLSQYSLASDKQRRIIVP--PARLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLL
        D+     Q+++ ++   +T  +  +P   +  +  ++   ++V     RLLL+ VA  +LELDQLDVKTTFLHG LDE IYM  P+GF    K+   +LL
Subjt:  DKTTGTDQEQVKNIGGEQTGIIEEQPDLSQYSLASDKQRRIIVP--PARLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLL

Query:  KKSIYGLKQSPR--------------YDRSKED--------LNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRP
        KKS+YGLKQSPR              ++RS  D         +N+  LL    DM  L  S++ILG++I RDR++ +L + Q++Y  KV+ RF +  V+ 
Subjt:  KKSIYGLKQSPR--------------YDRSKED--------LNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRP

Query:  VTLPITQNFKLFAYNSPSETDIEHKLKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRYMTNLGKRHWKTL
        ++ P+ Q+F+L    +P ET  E K  ME +PY+ +VGS+MY M+ +RPDL Y+ +++ RYM+  GK HW+ +
Subjt:  VTLPITQNFKLFAYNSPSETDIEHKLKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRYMTNLGKRHWKTL

TrEMBL top hitse value%identityAlignment
A0A2N9IB36 CCHC-type domain-containing protein5.6e-4931.15Show/hide
Query:  NGEFVFMGNNNACNNAGIESVTMKLKDGTVK----------LHRN---------KGLEALSKQGIL--PQDICNKLSFCE---------HCVLG--KARK
        N + V MGN+  C   G+ ++ +K+ DG V+          + +N         KG    S+ GI+   +     ++  E         H  LG      
Subjt:  NGEFVFMGNNNACNNAGIESVTMKLKDGTVK----------LHRN---------KGLEALSKQGIL--PQDICNKLSFCE---------HCVLG--KARK

Query:  QSFTKAQHKTKRILDYIYSDLGGPTSTPSLSAQEEVKGFKMWHPTNKKFIISRDVHFRE---TEMFMQGKGSIERNPNATETYTTRIEV--------ENT
        +  +K   K+K+ +               L  ++ VKG+K+W P  +K +ISRDV F E   T+ F + K    ++ N     T ++E+        E  
Subjt:  QSFTKAQHKTKRILDYIYSDLGGPTSTPSLSAQEEVKGFKMWHPTNKKFIISRDVHFRE---TEMFMQGKGSIERNPNATETYTTRIEV--------ENT

Query:  RNNAQSIDKTTGTDQEQVKNIGGEQT-----------GIIEEQPDLSQ------YSLASDKQRRIIVPPARLLLSLVAQNNLELDQLDVKTTFLHGYLDE
         +N Q  D T     +  K   G++             ++EE   LS+        L   K+  +     R +L+LVA  +LEL+QLDVKT FLHG L+E
Subjt:  RNNAQSIDKTTGTDQEQVKNIGGEQT-----------GIIEEQPDLSQ------YSLASDKQRRIIVPPARLLLSLVAQNNLELDQLDVKTTFLHGYLDE

Query:  TIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSPR-------------------YD---------------------------RSKEDLNNVKTLLSKEFD
         I+MVQP+GF+  G E+L   LKKS+YGLKQSPR                   YD                           +S  ++N +K+LL KEF+
Subjt:  TIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSPR-------------------YD---------------------------RSKEDLNNVKTLLSKEFD

Query:  MKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSP-SETDIEHKLKMENVPYSQVVGSLMYLMISTRPDLFY
        MKDLG ++KILG++I RDR    L + Q  Y  KV+ +F++   +PV+ P+  +F+L +   P +E +IE+   M  VPY+ VVG LMY M+ TRPDL +
Subjt:  MKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSP-SETDIEHKLKMENVPYSQVVGSLMYLMISTRPDLFY

Query:  SANLVGRYMTNLGKRHWKTL
        + + V RYM N G+ HW  +
Subjt:  SANLVGRYMTNLGKRHWKTL

A0A2N9IN85 Integrase catalytic domain-containing protein6.6e-5029.87Show/hide
Query:  NGEFVFMGNN----NACNNAGIESVTMKLKDGTVKLHRNKGLEALSKQGILPQDICNKLSFCEHCVLGKARKQSFTKAQHKTKRILDYIYSDLGGPTSTP
        NG ++  G+      A ++  ++S   +L    +      G+  LSKQG+L      KL FCEHCVLGK  +  F   QHK+K  +DYI+SDL  P+   
Subjt:  NGEFVFMGNN----NACNNAGIESVTMKLKDGTVKLHRNKGLEALSKQGILPQDICNKLSFCEHCVLGKARKQSFTKAQHKTKRILDYIYSDLGGPTSTP

Query:  S----------------------LSAQEEV-KGFKMWHPTNKKFIISRDVHFR----------ETEMFMQGKGSIE----RNPNATETYTTRIEVEN---
        S                      L  +  V   F  W    KK    +   FR          E + F + +G +     R+P+      T  EV +   
Subjt:  S----------------------LSAQEEV-KGFKMWHPTNKKFIISRDVHFR----------ETEMFMQGKGSIE----RNPNATETYTTRIEVEN---

Query:  ------------TRNNAQSIDKTTGTDQEQVKNIGGEQTGIIEEQPDLSQYSLASDKQRRIIVPP-----------------------------------
                    T  +        GT  + + +   + T  ++      QYS+A+ +++R I PP                                   
Subjt:  ------------TRNNAQSIDKTTGTDQEQVKNIGGEQTGIIEEQPDLSQYSLASDKQRRIIVPP-----------------------------------

Query:  ------------------------------------ARLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSP
                                             R+LLS+VA  +LEL+QLDVKT FLH  L++ IYM QP+GF++QGKE    LLKKS+YGLKQSP
Subjt:  ------------------------------------ARLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSP

Query:  R-------------------YDRSKEDLNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAY
        R                     ++  ++N +KT LS EF+MKDLG ++KILG++I RDR    L +   +Y EKV+ RF++   +PV+ P+  +FKL A 
Subjt:  R-------------------YDRSKEDLNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAY

Query:  NSPSETDIEHKLKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRYMTNLGKRHWKTL
         SP   + EH   M  VPY+  V S+MY+M+ T PD+     +V RYMTN  K HW+ +
Subjt:  NSPSETDIEHKLKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRYMTNLGKRHWKTL

A0A438CPX9 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-4828.41Show/hide
Query:  NGEFVFMGNNNACNNAGIESVTMKLKDGTVKL-HR------NKGLEALSKQGILPQDICNKLSFCEHCVLGKARKQSFTKAQHKTKRILDYIYSDLGGPT
        +G    M        +G  S  +K   GT KL H+      ++GL+ L KQG+L       L FCEHCV GKA +  F KA H+T+  LDYI+SDL GP+
Subjt:  NGEFVFMGNNNACNNAGIESVTMKLKDGTVKL-HR------NKGLEALSKQGILPQDICNKLSFCEHCVLGKARKQSFTKAQHKTKRILDYIYSDLGGPT

Query:  STPSL-------------SAQEE------------------------------------------VKGFKMWHPT--NKKFIISRDVHFRETEMFMQ---
          PS+             + QE+                                          VKG+K+W  T    K IISRDV F E +M  Q   
Subjt:  STPSL-------------SAQEE------------------------------------------VKGFKMWHPT--NKKFIISRDVHFRETEMFMQ---

Query:  ----GKGSIE----------RNPNATETYTTRIEVENTRN-------------------------------------------------------NAQSI
            G   ++               T + T + E+ + R                                                        N+  +
Subjt:  ----GKGSIE----------RNPNATETYTTRIEVENTRN-------------------------------------------------------NAQSI

Query:  DKTTGTDQEQVKNIGGEQTGIIEEQP------------DLSQYSLASDKQR-------------------RIIVP-----PARLLLSLVAQNNLELDQLD
        D+     QE++ ++   +T  +  +P               Q +L ++  R                    I  P       RLLL+ VA  +LELDQLD
Subjt:  DKTTGTDQEQVKNIGGEQTGIIEEQP------------DLSQYSLASDKQR-------------------RIIVP-----PARLLLSLVAQNNLELDQLD

Query:  VKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSPR--------------YDRSKED-------------------------------LN
        VKTTFLHG LDE IYM  P+GF    K+    LLKKS+YGLKQSPR              ++RS  D                               L 
Subjt:  VKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSPR--------------YDRSKED-------------------------------LN

Query:  NVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSPSETDIEHKLKMENVPYSQVVGSLMYL
         VK +L  +F+MKDLG +++ILG++I RDR++ +L + Q++Y  KV+ RF +  V+ V+ P+ Q+F+L    +P     E K  ME +PY+ +VGS+MY 
Subjt:  NVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSPSETDIEHKLKMENVPYSQVVGSLMYL

Query:  MISTRPDLFYSANLVGRYMTNLGKRHWKTL
        M+ +RPDL Y+ +++ RYM+   K HW+ +
Subjt:  MISTRPDLFYSANLVGRYMTNLGKRHWKTL

A0A438CR10 Retrovirus-related Pol polyprotein from transposon TNT 1-947.0e-5233.62Show/hide
Query:  GIESVTMKLKDGTVKL-HR------NKGLEALSKQGILPQDICNKLSFCEHCVLGKARKQSFTKAQHKTKRILDYIYSDLGGPTSTPSLSAQEEVK----
        G  S  +K   GT KL H+      ++GL+ L KQG+L       L FCEHCV GKA +  F KA H+T+   D+I+SDL  P+  PS+     VK    
Subjt:  GIESVTMKLKDGTVKL-HR------NKGLEALSKQGILPQDICNKLSFCEHCVLGKARKQSFTKAQHKTKRILDYIYSDLGGPTSTPSLSAQEEVK----

Query:  ---------GFKMWH---------PTNKKFIISRDVHFRETEMFM-------------------QGKGSIERNPNATETYTTRIEVE----NTRNNAQSI
                  F++ H          T+ K +    VH R+ E                      +  G  E    A      R+++E        N+   
Subjt:  ---------GFKMWH---------PTNKKFIISRDVHFRETEMFM-------------------QGKGSIERNPNATETYTTRIEVE----NTRNNAQSI

Query:  DKTTGTDQEQVKNIGGEQTGIIEEQPDLSQYSLASDKQRRIIVP--PARLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLL
        D+     Q+++ ++   +T  +  +P   +  +  ++   ++V     RLLL+ VA  +LELDQLDVKTTFLHG LDE IYM  P+GF    K+   +LL
Subjt:  DKTTGTDQEQVKNIGGEQTGIIEEQPDLSQYSLASDKQRRIIVP--PARLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLL

Query:  KKSIYGLKQSPR--------------YDRSKED--------LNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRP
        KKS+YGLKQSPR              ++RS  D         +N+  LL    DM  L  S++ILG++I RDR++ +L + Q++Y  KV+ RF +  V+ 
Subjt:  KKSIYGLKQSPR--------------YDRSKED--------LNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRP

Query:  VTLPITQNFKLFAYNSPSETDIEHKLKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRYMTNLGKRHWKTL
        ++ P+ Q+F+L    +P ET  E K  ME +PY+ +VGS+MY M+ +RPDL Y+ +++ RYM+  GK HW+ +
Subjt:  VTLPITQNFKLFAYNSPSETDIEHKLKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRYMTNLGKRHWKTL

Q6L4X8 Putative polyprotein1.9e-4928.6Show/hide
Query:  GLEALSKQGILPQDICNKLSFCEHCVLGKARKQSFTKAQHKTKRILDYIYSDLGGPTS------------------TP----------------------
        GL  LSK+G+L      KL FCEHC+ GK ++  F  + H T+ ILDY++SDL G  S                  TP                      
Subjt:  GLEALSKQGILPQDICNKLSFCEHCVLGKARKQSFTKAQHKTKRILDYIYSDLGGPTS------------------TP----------------------

Query:  ----------------SLSAQEEVKGFKMWHPTNKKFIISRDVHFRETEMFMQGKGSIERNPNATETYTTRIEVENTRNNAQSIDKTTGTDQEQVKNIGG
                         L     VK +K+W P  KK +ISR+V F E+ M +  K S   N          ++VE+  ++  + +K      +    I  
Subjt:  ----------------SLSAQEEVKGFKMWHPTNKKFIISRDVHFRETEMFMQGKGSIERNPNATETYTTRIEVENTRNNAQSIDKTTGTDQEQVKNIGG

Query:  EQTGIIEEQPDLSQYSLASDKQRRIIVPP-----------------------------------------------------------------------
          +  +++ P   + S+A DK +R I PP                                                                       
Subjt:  EQTGIIEEQPDLSQYSLASDKQRRIIVPP-----------------------------------------------------------------------

Query:  ------------------------------------------------ARLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYL
                                                         R LLS+VA ++ EL+Q+DVKT FLHG L+E IYM QP+GF V GKE+L Y 
Subjt:  ------------------------------------------------ARLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYL

Query:  LKKSIYGLKQSPR-------------------YD--------------------------RSKEDLNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQS
        LKKS+YGLKQSPR                   YD                          + K ++  +K  LS EF+MKDLG ++KILG++ITR+R+  
Subjt:  LKKSIYGLKQSPR-------------------YD--------------------------RSKEDLNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQS

Query:  ILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSP-SETDIEHKLKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRYMTNLGKRHWKTL
         L + Q  Y EKV+RRFN+   +PV+ P+  +F+L +   P S+ DIE+   M  VPY  VVGSLMY M+ +R DL ++ ++V RYM N GK HWK +
Subjt:  ILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSP-SETDIEHKLKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRYMTNLGKRHWKTL

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.2e-2130.4Show/hide
Query:  RLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSPR--------------YDRSKED---------------
        R +LSLV Q NL++ Q+DVKT FL+G L E IYM  P+G  +    D    L K+IYGLKQ+ R              +  S  D               
Subjt:  RLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSPR--------------YDRSKED---------------

Query:  ------------------LNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSPSETDIE
                          +NN K  L ++F M DL E +  +GI I    ++  LS  QS Y +K++ +FN+     V+ P+         NS  +    
Subjt:  ------------------LNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSPSETDIE

Query:  HKLKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRYMTNLGKRHWKTL
              N P   ++G LMY+M+ TRPDL  + N++ RY +      W+ L
Subjt:  HKLKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRYMTNLGKRHWKTL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.1e-4139.76Show/hide
Query:  RLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSPR-----YDRSKEDLNNVKTL-----------------
        R +LSL A  +LE++QLDVKT FLHG L+E IYM QP+GFEV GK+ +   L KS+YGLKQ+PR     +D   +    +KT                  
Subjt:  RLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSPR-----YDRSKEDLNNVKTL-----------------

Query:  ------------------------LSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSPSETDIEH
                                LSK FDMKDLG +++ILG+ I R+R    L + Q  Y E+V+ RFN+   +PV+ P+  + KL     P  T +E 
Subjt:  ------------------------LSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSPSETDIEH

Query:  KLKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRYMTNLGKRHWKTL
        K  M  VPYS  VGSLMY M+ TRPD+ ++  +V R++ N GK HW+ +
Subjt:  KLKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRYMTNLGKRHWKTL

P25600 Putative transposon Ty5-1 protein YCL074W2.8e-1327.6Show/hide
Query:  LDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSP------------------------RYDRSKED---------------------
        +DV T FL+  +DE IY+ QP GF  +   D  + L   +YGLKQ+P                         Y RS  D                     
Subjt:  LDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSP------------------------RYDRSKED---------------------

Query:  LNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSPSETDIEHKLKMENVPYSQVVGSLM
         + VK  L+K + MKDLG+  K LG++I +  N  I ++    Y  K      + T +    P+  +  LF   SP   DI         PY  +VG L+
Subjt:  LNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSPSETDIEHKLKMENVPYSQVVGSLM

Query:  YLMISTRPDLFYSANLVGRYM
        +   + RPD+ Y  +L+ R++
Subjt:  YLMISTRPDLFYSANLVGRYM

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.7e-1528.23Show/hide
Query:  RLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSPR--------------YDRSKED---------------
        R++L +    +  + QLDV   FL G L + +YM QP GF  + + +    L+K++YGLKQ+PR              +  S  D               
Subjt:  RLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSPR--------------YDRSKED---------------

Query:  ----------------LNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSPSETDIEHK
                        L+N    LS+ F +KD  E    LGI+    R  + L + Q  Y   ++ R N+ T +PVT P+  + KL  Y+    TD    
Subjt:  ----------------LNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSPSETDIEHK

Query:  LKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRYMTNLGKRHWKTL
               Y  +VGSL YL   TRPD+ Y+ N + ++M    + H + L
Subjt:  LKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRYMTNLGKRHWKTL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.6e-1527.82Show/hide
Query:  RLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSPR--------------YDRSKED---------------
        R++L +    +  + QLDV   FL G L + +YM QP GF  + + D    L+K+IYGLKQ+PR              +  S  D               
Subjt:  RLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSPR--------------YDRSKED---------------

Query:  ----------------LNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSPSETDIEHK
                        L +    LS+ F +K+  +    LGI+  R      L + Q  Y   ++ R N+ T +PV  P+  + KL  ++     D    
Subjt:  ----------------LNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSPSETDIEHK

Query:  LKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRYMTNLGKRHWKTL
               Y  +VGSL YL   TRPDL Y+ N + +YM      HW  L
Subjt:  LKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRYMTNLGKRHWKTL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.1e-1225.83Show/hide
Query:  RLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDL----YYLLKKSIYGLKQSPR--------------YDRSKED-----------
        +L+L++ A  N  L QLD+   FL+G LDE IYM  P G+  +  + L       LKKSIYGLKQ+ R              + +S  D           
Subjt:  RLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDL----YYLLKKSIYGLKQSPR--------------YDRSKED-----------

Query:  --------------------LNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSPSETD
                            ++ +K+ L   F ++DLG  +  LG++I   R+ + ++I Q  Y   ++    L   +P ++P+  +    A++     D
Subjt:  --------------------LNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSPSETD

Query:  IEHKLKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRY
         +         Y +++G LMYL I TR D+ ++ N + ++
Subjt:  IEHKLKMENVPYSQVVGSLMYLMISTRPDLFYSANLVGRY

ATMG00810.1 DNA/RNA polymerases superfamily protein1.5e-0630.89Show/hide
Query:  LNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVT--LPITQNFKLFAYNSPSETDIEHKLKMENVPYSQVVGS
        LN +   LS  F MKDLG     LGI I    + S L + Q+ Y E+++    +   +P++  LP+  N  +     P  +D           +  +VG+
Subjt:  LNNVKTLLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVT--LPITQNFKLFAYNSPSETDIEHKLKMENVPYSQVVGS

Query:  LMYLMISTRPDLFYSANLVGRYM
        L YL + TRPD+ Y+ N+V + M
Subjt:  LMYLMISTRPDLFYSANLVGRYM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCTATTATATCAGCCCTGAGAACCAGAGAATTGGAAACACAATCATCTCAAAATGAGCATCAAAGTGGAAATGGTTTATTTGTCAACGGAACATCCCAAAGCAA
TCAAGCTAAAAACAACAGCAATGGAGAGTTTGTGTTCATGGGGAATAATAATGCATGTAACAATGCTGGAATTGAATCAGTCACCATGAAATTAAAAGATGGGACTGTGA
AGCTCCATAGAAATAAGGGACTTGAGGCTCTATCTAAACAAGGCATTCTACCTCAAGATATATGCAATAAGTTGTCTTTCTGTGAACACTGTGTGCTAGGCAAAGCAAGG
AAACAAAGCTTCACCAAAGCACAACACAAAACTAAAAGAATTCTAGACTACATCTATTCAGATTTAGGGGGTCCAACCTCAACTCCAAGCCTAAGTGCTCAAGAAGAGGT
AAAGGGTTTTAAGATGTGGCACCCTACTAACAAGAAGTTTATAATTAGTAGGGATGTTCACTTCAGAGAAACCGAGATGTTTATGCAAGGAAAAGGAAGTATCGAAAGGA
ATCCTAATGCCACAGAAACCTATACAACTCGGATTGAGGTGGAGAATACTAGGAACAATGCTCAATCTATAGATAAAACTACTGGTACAGATCAAGAACAAGTAAAGAAC
ATAGGTGGAGAACAAACTGGAATAATAGAAGAACAGCCTGACTTAAGCCAATATTCCCTAGCAAGTGACAAACAAAGAAGGATAATTGTTCCTCCAGCCAGACTTCTCCT
ATCCCTAGTTGCTCAAAACAATCTTGAGTTGGACCAACTTGATGTAAAGACAACCTTCCTTCATGGCTATCTAGACGAAACAATTTACATGGTTCAACCCAAAGGCTTTG
AGGTTCAAGGTAAGGAAGACCTCTACTACTTACTAAAGAAGTCGATATATGGATTGAAGCAATCACCTAGGTACGACCGCTCTAAGGAAGATTTGAATAATGTCAAAACT
CTTTTGAGTAAAGAATTTGACATGAAGGATTTAGGTGAATCAAGAAAGATCCTAGGAATTGACATCACAAGAGACCGAAACCAATCAATACTAAGCATCGGTCAATCAAC
CTATTGTGAGAAGGTAATCAGAAGATTCAACCTCACTACTGTTAGACCAGTCACACTCCCTATTACACAAAATTTTAAACTATTTGCCTATAATTCCCCAAGTGAGACAG
ACATTGAACACAAACTAAAAATGGAGAATGTACCTTACAGCCAAGTAGTAGGAAGTTTAATGTACTTGATGATCTCTACTAGACCTGATCTATTCTATTCAGCAAACCTT
GTTGGCAGATATATGACTAATCTTGGAAAACGACACTGGAAAACTTTACTGGTGACAGTGACAAAAGAAGAAGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGCTATTATATCAGCCCTGAGAACCAGAGAATTGGAAACACAATCATCTCAAAATGAGCATCAAAGTGGAAATGGTTTATTTGTCAACGGAACATCCCAAAGCAA
TCAAGCTAAAAACAACAGCAATGGAGAGTTTGTGTTCATGGGGAATAATAATGCATGTAACAATGCTGGAATTGAATCAGTCACCATGAAATTAAAAGATGGGACTGTGA
AGCTCCATAGAAATAAGGGACTTGAGGCTCTATCTAAACAAGGCATTCTACCTCAAGATATATGCAATAAGTTGTCTTTCTGTGAACACTGTGTGCTAGGCAAAGCAAGG
AAACAAAGCTTCACCAAAGCACAACACAAAACTAAAAGAATTCTAGACTACATCTATTCAGATTTAGGGGGTCCAACCTCAACTCCAAGCCTAAGTGCTCAAGAAGAGGT
AAAGGGTTTTAAGATGTGGCACCCTACTAACAAGAAGTTTATAATTAGTAGGGATGTTCACTTCAGAGAAACCGAGATGTTTATGCAAGGAAAAGGAAGTATCGAAAGGA
ATCCTAATGCCACAGAAACCTATACAACTCGGATTGAGGTGGAGAATACTAGGAACAATGCTCAATCTATAGATAAAACTACTGGTACAGATCAAGAACAAGTAAAGAAC
ATAGGTGGAGAACAAACTGGAATAATAGAAGAACAGCCTGACTTAAGCCAATATTCCCTAGCAAGTGACAAACAAAGAAGGATAATTGTTCCTCCAGCCAGACTTCTCCT
ATCCCTAGTTGCTCAAAACAATCTTGAGTTGGACCAACTTGATGTAAAGACAACCTTCCTTCATGGCTATCTAGACGAAACAATTTACATGGTTCAACCCAAAGGCTTTG
AGGTTCAAGGTAAGGAAGACCTCTACTACTTACTAAAGAAGTCGATATATGGATTGAAGCAATCACCTAGGTACGACCGCTCTAAGGAAGATTTGAATAATGTCAAAACT
CTTTTGAGTAAAGAATTTGACATGAAGGATTTAGGTGAATCAAGAAAGATCCTAGGAATTGACATCACAAGAGACCGAAACCAATCAATACTAAGCATCGGTCAATCAAC
CTATTGTGAGAAGGTAATCAGAAGATTCAACCTCACTACTGTTAGACCAGTCACACTCCCTATTACACAAAATTTTAAACTATTTGCCTATAATTCCCCAAGTGAGACAG
ACATTGAACACAAACTAAAAATGGAGAATGTACCTTACAGCCAAGTAGTAGGAAGTTTAATGTACTTGATGATCTCTACTAGACCTGATCTATTCTATTCAGCAAACCTT
GTTGGCAGATATATGACTAATCTTGGAAAACGACACTGGAAAACTTTACTGGTGACAGTGACAAAAGAAGAAGTTTAA
Protein sequenceShow/hide protein sequence
MDAIISALRTRELETQSSQNEHQSGNGLFVNGTSQSNQAKNNSNGEFVFMGNNNACNNAGIESVTMKLKDGTVKLHRNKGLEALSKQGILPQDICNKLSFCEHCVLGKAR
KQSFTKAQHKTKRILDYIYSDLGGPTSTPSLSAQEEVKGFKMWHPTNKKFIISRDVHFRETEMFMQGKGSIERNPNATETYTTRIEVENTRNNAQSIDKTTGTDQEQVKN
IGGEQTGIIEEQPDLSQYSLASDKQRRIIVPPARLLLSLVAQNNLELDQLDVKTTFLHGYLDETIYMVQPKGFEVQGKEDLYYLLKKSIYGLKQSPRYDRSKEDLNNVKT
LLSKEFDMKDLGESRKILGIDITRDRNQSILSIGQSTYCEKVIRRFNLTTVRPVTLPITQNFKLFAYNSPSETDIEHKLKMENVPYSQVVGSLMYLMISTRPDLFYSANL
VGRYMTNLGKRHWKTLLVTVTKEEV