; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G01350 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G01350
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGag-pol polyprotein
Genome locationChr5:1849038..1853483
RNA-Seq ExpressionCSPI05G01350
SyntenyCSPI05G01350
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032034.1 F5J5.1 [Cucumis melo var. makuwa]4.8e-4628.86Show/hide
Query:  QLLTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDY
        QL   +QT +WH+KLGH S+  +   +K++AI+ IP+LD+N + F +DCQ GK+ R++HKS+ ECYTNRVLE LHMDLMGPMQTKSL  K+YV + VDDY
Subjt:  QLLTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDY

Query:  SWFTWIEII---------------------------IENEMSSQIKEFSWDILK-----------------------------IAVHTDIHAKYKYSSE-
        S +TW+  +                           I N+ + +     WD                                  V  D+++  K  ++ 
Subjt:  SWFTWIEII---------------------------IENEMSSQIKEFSWDILK-----------------------------IAVHTDIHAKYKYSSE-

Query:  ----PVQSVDSLSSTNEDDKNDNNNFDP---------------------------------LQR-----------TLETETETE----------------
            P  S    +ST E  K DN++ DP                                 +Q            TL ++TE                  
Subjt:  ----PVQSVDSLSSTNEDDKNDNNNFDP---------------------------------LQR-----------TLETETETE----------------

Query:  -------------------TPSKHVAPSSHAKKNHVYAKDQIISNGCQ---------KCLLKLQGYVGGGTDKTLFISWTNKNIIMAQLYVDDIVFGGFQ
                              K    S H K  HVY  ++ +    Q             + +GY  G  DKTLFI   +  I++AQ+YVDDI+FGGF 
Subjt:  -------------------TPSKHVAPSSHAKKNHVYAKDQIISNGCQ---------KCLLKLQGYVGGGTDKTLFISWTNKNIIMAQLYVDDIVFGGFQ

Query:  DDIVD-----------------------LSN---------------------------------------------SHLMAAKIIIKYVHGTYDFGWAGC
         D+V+                       L N                                             SHL A K I+KYVHGT DFG    
Subjt:  DDIVD-----------------------LSN---------------------------------------------SHLMAAKIIIKYVHGTYDFGWAGC

Query:  SDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCEYGISRDTMILYSDSISVIDISMNLVQHSRTKH
         +     +  C         W S   N     K++ EY+ +GS  +QLIWMK +L EYG  +DT+ LY D++S IDIS N VQHSRTKH
Subjt:  SDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCEYGISRDTMILYSDSISVIDISMNLVQHSRTKH

KAA0035673.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.6e-5227.88Show/hide
Query:  RKRDYDKPNGRYSKSFKYKRCGGHGYYQAECLTYLRRQKKSFGATLSNEES---DVSNEEAEYTNVVISITSKDES---------------LQRQWKENS
        R  D+ K      +SF+ + C    +YQAECLTYLRRQKK++ ATL +E+S   +V +    +T  +  I S+ ES               L+   KE+S
Subjt:  RKRDYDKPNGRYSKSFKYKRCGGHGYYQAECLTYLRRQKKSFGATLSNEES---DVSNEEAEYTNVVISITSKDES---------------LQRQWKENS

Query:  PARAIQKENIQKLLDENQQL--------------------------LTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIR
         ARAIQKE IQ L++EN++L                          L +  ++L  +KLGH SL ++   ++++A++ IP+L+IN + F  DCQ GK+ +
Subjt:  PARAIQKENIQKLLDENQQL--------------------------LTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIR

Query:  TSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYSWFTWIEII---------IENEMSSQI-------------------------KEFS
        TSH+S+ ECYT RVLE LH+DLMGPMQ +SL  KKYV + VDDYS  TW++ +         I +E ++ I                         K   
Subjt:  TSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYSWFTWIEII---------IENEMSSQI-------------------------KEFS

Query:  WDILKIAVHTDIHAKYKYSSEPVQSV----------------------DSLSSTNEDDKNDNNNF-----------DPLQRTLETETETETPSKHVAPSS
         +    A++T  H   K+  +  Q +                      ++++    D +++ N F           D     L+   + ++    + P+S
Subjt:  WDILKIAVHTDIHAKYKYSSEPVQSV----------------------DSLSSTNEDDKNDNNNF-----------DPLQRTLETETETETPSKHVAPSS

Query:  ------------HAKKNHVYA-----KDQIISNG-------------------------------CQKCLLKL-----------------QGYVGGGTDK
                      KK++  +     K ++++ G                                 K L  L                 +GY  G TDK
Subjt:  ------------HAKKNHVYA-----KDQIISNG-------------------------------CQKCLLKL-----------------QGYVGGGTDK

Query:  TLFISWTNKNIIMAQLYVDDIVFGGFQDDIV---------------------------------------------------------------------
         LF + T+ ++I+AQ+YVDDI+FGGF   ++                                                                     
Subjt:  TLFISWTNKNIIMAQLYVDDIVFGGFQDDIV---------------------------------------------------------------------

Query:  --------------DLSNSHLMAAKIIIKYVHGTYDFG------------------WAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIV
                      D   SHL   K IIKYVHGT DFG                  WAG +DDRK+T  GCFFLGNN++ WFSKKQN +SLS +E EYIV
Subjt:  --------------DLSNSHLMAAKIIIKYVHGTYDFG------------------WAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIV

Query:  VGS
        VGS
Subjt:  VGS

KAE8648228.1 hypothetical protein Csa_018353 [Cucumis sativus]1.6e-5294.44Show/hide
Query:  LLTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYS
        L  EEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYS
Subjt:  LLTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYS

Query:  WFTWIEII
        WFTW+  +
Subjt:  WFTWIEII

PNX93845.1 gag-protease polyprotein, partial [Trifolium pratense]1.2e-4730.82Show/hide
Query:  EEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYSWFT
        E++ + WH+KLGH +  ++   +  +AI  +PNL I       +CQ GK+ +  H  +    T RV+E LHMDLMGP+QT+SL  K+Y ++ VD +S +T
Subjt:  EEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYSWFT

Query:  WIEII-IENEMSSQIKEFSWDILKIAVHTDIHAKYKYSSEPVQSVDSLSSTNEDDKNDNNNFDPLQRTLETETETETPSKHVAPSSHAKK----------
        WI  I  ++E     K+    + +   + D   K++ S         L S+       ++   P Q  +  E +  T +K      HAKK          
Subjt:  WIEII-IENEMSSQIKEFSWDILKIAVHTDIHAKYKYSSEPVQSVDSLSSTNEDDKNDNNNFDPLQRTLETETETETPSKHVAPSSHAKK----------

Query:  -----NHVYAKDQIISNGCQ--------------KCLLK-----------------LQGYVGGGTDKTLFISWTNKNIIMAQLYVDDIVFGGFQDDIVDL
             NHV  +    S   +              KC +                    GY        ++ S T   +    + +DD+     +DD+ D 
Subjt:  -----NHVYAKDQIISNGCQ--------------KCLLK-----------------LQGYVGGGTDKTLFISWTNKNIIMAQLYVDDIVFGGFQDDIVDL

Query:  --------------------------------------------SNSHLMAAKIIIKYVHGTYDFG------------------WAGCSDDRKNTLEGCF
                                                      SHL   K I+KYV+GT D+G                  W GC+DDRK+T   CF
Subjt:  --------------------------------------------SNSHLMAAKIIIKYVHGTYDFG------------------WAGCSDDRKNTLEGCF

Query:  FLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCEYGISRDTMILYSDSISVIDISMNLVQHSRTKH
        FLGNNLISWFSKKQN +SLS +E EYI  GS+ SQLIWMK+ML E+ + +D M LY DS+S I IS N +QHSRT H
Subjt:  FLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCEYGISRDTMILYSDSISVIDISMNLVQHSRTKH

TYK16854.1 F5J5.1 [Cucumis melo var. makuwa]4.8e-4628.86Show/hide
Query:  QLLTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDY
        QL   +QT +WH+KLGH S+  +   +K++AI+ IP+LD+N + F +DCQ GK+ R++HKS+ ECYTNRVLE LHMDLMGPMQTKSL  K+YV + VDDY
Subjt:  QLLTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDY

Query:  SWFTWIEII---------------------------IENEMSSQIKEFSWDILK-----------------------------IAVHTDIHAKYKYSSE-
        S +TW+  +                           I N+ + +     WD                                  V  D+++  K  ++ 
Subjt:  SWFTWIEII---------------------------IENEMSSQIKEFSWDILK-----------------------------IAVHTDIHAKYKYSSE-

Query:  ----PVQSVDSLSSTNEDDKNDNNNFDP---------------------------------LQR-----------TLETETETE----------------
            P  S    +ST E  K DN++ DP                                 +Q            TL ++TE                  
Subjt:  ----PVQSVDSLSSTNEDDKNDNNNFDP---------------------------------LQR-----------TLETETETE----------------

Query:  -------------------TPSKHVAPSSHAKKNHVYAKDQIISNGCQ---------KCLLKLQGYVGGGTDKTLFISWTNKNIIMAQLYVDDIVFGGFQ
                              K    S H K  HVY  ++ +    Q             + +GY  G  DKTLFI   +  I++AQ+YVDDI+FGGF 
Subjt:  -------------------TPSKHVAPSSHAKKNHVYAKDQIISNGCQ---------KCLLKLQGYVGGGTDKTLFISWTNKNIIMAQLYVDDIVFGGFQ

Query:  DDIVD-----------------------LSN---------------------------------------------SHLMAAKIIIKYVHGTYDFGWAGC
         D+V+                       L N                                             SHL A K I+KYVHGT DFG    
Subjt:  DDIVD-----------------------LSN---------------------------------------------SHLMAAKIIIKYVHGTYDFGWAGC

Query:  SDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCEYGISRDTMILYSDSISVIDISMNLVQHSRTKH
         +     +  C         W S   N     K++ EY+ +GS  +QLIWMK +L EYG  +DT+ LY D++S IDIS N VQHSRTKH
Subjt:  SDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCEYGISRDTMILYSDSISVIDISMNLVQHSRTKH

TrEMBL top hitse value%identityAlignment
A0A2K3MSR1 Gag-protease polyprotein (Fragment)5.6e-4830.82Show/hide
Query:  EEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYSWFT
        E++ + WH+KLGH +  ++   +  +AI  +PNL I       +CQ GK+ +  H  +    T RV+E LHMDLMGP+QT+SL  K+Y ++ VD +S +T
Subjt:  EEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYSWFT

Query:  WIEII-IENEMSSQIKEFSWDILKIAVHTDIHAKYKYSSEPVQSVDSLSSTNEDDKNDNNNFDPLQRTLETETETETPSKHVAPSSHAKK----------
        WI  I  ++E     K+    + +   + D   K++ S         L S+       ++   P Q  +  E +  T +K      HAKK          
Subjt:  WIEII-IENEMSSQIKEFSWDILKIAVHTDIHAKYKYSSEPVQSVDSLSSTNEDDKNDNNNFDPLQRTLETETETETPSKHVAPSSHAKK----------

Query:  -----NHVYAKDQIISNGCQ--------------KCLLK-----------------LQGYVGGGTDKTLFISWTNKNIIMAQLYVDDIVFGGFQDDIVDL
             NHV  +    S   +              KC +                    GY        ++ S T   +    + +DD+     +DD+ D 
Subjt:  -----NHVYAKDQIISNGCQ--------------KCLLK-----------------LQGYVGGGTDKTLFISWTNKNIIMAQLYVDDIVFGGFQDDIVDL

Query:  --------------------------------------------SNSHLMAAKIIIKYVHGTYDFG------------------WAGCSDDRKNTLEGCF
                                                      SHL   K I+KYV+GT D+G                  W GC+DDRK+T   CF
Subjt:  --------------------------------------------SNSHLMAAKIIIKYVHGTYDFG------------------WAGCSDDRKNTLEGCF

Query:  FLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCEYGISRDTMILYSDSISVIDISMNLVQHSRTKH
        FLGNNLISWFSKKQN +SLS +E EYI  GS+ SQLIWMK+ML E+ + +D M LY DS+S I IS N +QHSRT H
Subjt:  FLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCEYGISRDTMILYSDSISVIDISMNLVQHSRTKH

A0A2Z6MGE8 Uncharacterized protein2.5e-4028.08Show/hide
Query:  EAEYTNVVISITSK-DESLQR--QWKENSPARAIQKE-NIQKLLDENQQLLTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRG
        +  +T     +TS+ DE L +  + K+N      Q+E N+   L     +  E++  LWH+KLGH +L ++   +  +AI  +P L I       +CQ G
Subjt:  EAEYTNVVISITSK-DESLQR--QWKENSPARAIQKE-NIQKLLDENQQLLTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRG

Query:  KKIRTSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYSWFTWIEIIIENEMSSQIKEFSWDILKIAVHTDIHAKYKYSSEPVQSVDS-L
        K+ +  HK +    T RV E LHMDLMGPMQ +SL  KKY  + VDD+S +TWI  I E       K  ++DI K     D+  + +   + V  V +  
Subjt:  KKIRTSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYSWFTWIEIIIENEMSSQIKEFSWDILKIAVHTDIHAKYKYSSEPVQSVDS-L

Query:  SSTNEDDKND----------------------NNNFDPLQR-----------------TLETETETETPSKHVAPSSHAKK----------NHVYAKDQI
           N+ D+N                       +  F P+ R                   + + ++   + ++      ++          NHVY K + 
Subjt:  SSTNEDDKND----------------------NNNFDPLQR-----------------TLETETETETPSKHVAPSSHAKK----------NHVYAKDQI

Query:  ISNGCQKC----------LLKLQGYVGGGTDKTLFIS---------------------------WTNKNIIMAQ---------------------LYVDD
           G ++            L  QGY  GG DK LF+                            W N + +  Q                     LY+  
Subjt:  ISNGCQKC----------LLKLQGYVGGGTDKTLFIS---------------------------WTNKNIIMAQ---------------------LYVDD

Query:  IVFG-----------GFQDDIVDLSN-----------------------------------------------------------SHLMAAKIIIKYVHG
           G           G +  I+D  N                                                           SHL+  K I+KYV+G
Subjt:  IVFG-----------GFQDDIVDLSN-----------------------------------------------------------SHLMAAKIIIKYVHG

Query:  TYDFG------------------WAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCEYGISRDTMILYSDSIS
        T D+G                  W G +DDRK+T   CFFLGNNLISWFSKKQNS+SLS +E EYI  GS+ SQL+WMKQML EY + +D M LY D++S
Subjt:  TYDFG------------------WAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCEYGISRDTMILYSDSIS

Query:  VIDISMNLVQHSRTKH
         I+IS N +QHSRTKH
Subjt:  VIDISMNLVQHSRTKH

A0A5A7SLH7 F5J5.12.3e-4628.86Show/hide
Query:  QLLTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDY
        QL   +QT +WH+KLGH S+  +   +K++AI+ IP+LD+N + F +DCQ GK+ R++HKS+ ECYTNRVLE LHMDLMGPMQTKSL  K+YV + VDDY
Subjt:  QLLTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDY

Query:  SWFTWIEII---------------------------IENEMSSQIKEFSWDILK-----------------------------IAVHTDIHAKYKYSSE-
        S +TW+  +                           I N+ + +     WD                                  V  D+++  K  ++ 
Subjt:  SWFTWIEII---------------------------IENEMSSQIKEFSWDILK-----------------------------IAVHTDIHAKYKYSSE-

Query:  ----PVQSVDSLSSTNEDDKNDNNNFDP---------------------------------LQR-----------TLETETETE----------------
            P  S    +ST E  K DN++ DP                                 +Q            TL ++TE                  
Subjt:  ----PVQSVDSLSSTNEDDKNDNNNFDP---------------------------------LQR-----------TLETETETE----------------

Query:  -------------------TPSKHVAPSSHAKKNHVYAKDQIISNGCQ---------KCLLKLQGYVGGGTDKTLFISWTNKNIIMAQLYVDDIVFGGFQ
                              K    S H K  HVY  ++ +    Q             + +GY  G  DKTLFI   +  I++AQ+YVDDI+FGGF 
Subjt:  -------------------TPSKHVAPSSHAKKNHVYAKDQIISNGCQ---------KCLLKLQGYVGGGTDKTLFISWTNKNIIMAQLYVDDIVFGGFQ

Query:  DDIVD-----------------------LSN---------------------------------------------SHLMAAKIIIKYVHGTYDFGWAGC
         D+V+                       L N                                             SHL A K I+KYVHGT DFG    
Subjt:  DDIVD-----------------------LSN---------------------------------------------SHLMAAKIIIKYVHGTYDFGWAGC

Query:  SDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCEYGISRDTMILYSDSISVIDISMNLVQHSRTKH
         +     +  C         W S   N     K++ EY+ +GS  +QLIWMK +L EYG  +DT+ LY D++S IDIS N VQHSRTKH
Subjt:  SDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCEYGISRDTMILYSDSISVIDISMNLVQHSRTKH

A0A5A7SYL2 Gag-pol polyprotein7.5e-5327.88Show/hide
Query:  RKRDYDKPNGRYSKSFKYKRCGGHGYYQAECLTYLRRQKKSFGATLSNEES---DVSNEEAEYTNVVISITSKDES---------------LQRQWKENS
        R  D+ K      +SF+ + C    +YQAECLTYLRRQKK++ ATL +E+S   +V +    +T  +  I S+ ES               L+   KE+S
Subjt:  RKRDYDKPNGRYSKSFKYKRCGGHGYYQAECLTYLRRQKKSFGATLSNEES---DVSNEEAEYTNVVISITSKDES---------------LQRQWKENS

Query:  PARAIQKENIQKLLDENQQL--------------------------LTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIR
         ARAIQKE IQ L++EN++L                          L +  ++L  +KLGH SL ++   ++++A++ IP+L+IN + F  DCQ GK+ +
Subjt:  PARAIQKENIQKLLDENQQL--------------------------LTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIR

Query:  TSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYSWFTWIEII---------IENEMSSQI-------------------------KEFS
        TSH+S+ ECYT RVLE LH+DLMGPMQ +SL  KKYV + VDDYS  TW++ +         I +E ++ I                         K   
Subjt:  TSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYSWFTWIEII---------IENEMSSQI-------------------------KEFS

Query:  WDILKIAVHTDIHAKYKYSSEPVQSV----------------------DSLSSTNEDDKNDNNNF-----------DPLQRTLETETETETPSKHVAPSS
         +    A++T  H   K+  +  Q +                      ++++    D +++ N F           D     L+   + ++    + P+S
Subjt:  WDILKIAVHTDIHAKYKYSSEPVQSV----------------------DSLSSTNEDDKNDNNNF-----------DPLQRTLETETETETPSKHVAPSS

Query:  ------------HAKKNHVYA-----KDQIISNG-------------------------------CQKCLLKL-----------------QGYVGGGTDK
                      KK++  +     K ++++ G                                 K L  L                 +GY  G TDK
Subjt:  ------------HAKKNHVYA-----KDQIISNG-------------------------------CQKCLLKL-----------------QGYVGGGTDK

Query:  TLFISWTNKNIIMAQLYVDDIVFGGFQDDIV---------------------------------------------------------------------
         LF + T+ ++I+AQ+YVDDI+FGGF   ++                                                                     
Subjt:  TLFISWTNKNIIMAQLYVDDIVFGGFQDDIV---------------------------------------------------------------------

Query:  --------------DLSNSHLMAAKIIIKYVHGTYDFG------------------WAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIV
                      D   SHL   K IIKYVHGT DFG                  WAG +DDRK+T  GCFFLGNN++ WFSKKQN +SLS +E EYIV
Subjt:  --------------DLSNSHLMAAKIIIKYVHGTYDFG------------------WAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIV

Query:  VGS
        VGS
Subjt:  VGS

A0A5D3D0U1 F5J5.12.3e-4628.86Show/hide
Query:  QLLTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDY
        QL   +QT +WH+KLGH S+  +   +K++AI+ IP+LD+N + F +DCQ GK+ R++HKS+ ECYTNRVLE LHMDLMGPMQTKSL  K+YV + VDDY
Subjt:  QLLTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDY

Query:  SWFTWIEII---------------------------IENEMSSQIKEFSWDILK-----------------------------IAVHTDIHAKYKYSSE-
        S +TW+  +                           I N+ + +     WD                                  V  D+++  K  ++ 
Subjt:  SWFTWIEII---------------------------IENEMSSQIKEFSWDILK-----------------------------IAVHTDIHAKYKYSSE-

Query:  ----PVQSVDSLSSTNEDDKNDNNNFDP---------------------------------LQR-----------TLETETETE----------------
            P  S    +ST E  K DN++ DP                                 +Q            TL ++TE                  
Subjt:  ----PVQSVDSLSSTNEDDKNDNNNFDP---------------------------------LQR-----------TLETETETE----------------

Query:  -------------------TPSKHVAPSSHAKKNHVYAKDQIISNGCQ---------KCLLKLQGYVGGGTDKTLFISWTNKNIIMAQLYVDDIVFGGFQ
                              K    S H K  HVY  ++ +    Q             + +GY  G  DKTLFI   +  I++AQ+YVDDI+FGGF 
Subjt:  -------------------TPSKHVAPSSHAKKNHVYAKDQIISNGCQ---------KCLLKLQGYVGGGTDKTLFISWTNKNIIMAQLYVDDIVFGGFQ

Query:  DDIVD-----------------------LSN---------------------------------------------SHLMAAKIIIKYVHGTYDFGWAGC
         D+V+                       L N                                             SHL A K I+KYVHGT DFG    
Subjt:  DDIVD-----------------------LSN---------------------------------------------SHLMAAKIIIKYVHGTYDFGWAGC

Query:  SDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCEYGISRDTMILYSDSISVIDISMNLVQHSRTKH
         +     +  C         W S   N     K++ EY+ +GS  +QLIWMK +L EYG  +DT+ LY D++S IDIS N VQHSRTKH
Subjt:  SDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCEYGISRDTMILYSDSISVIDISMNLVQHSRTKH

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.4e-0936.63Show/hide
Query:  GTYDFGWAGCSDDRKNTLEGCFFLGN-NLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCEYGISRDTMI-LYSDSISVIDISMNLVQHSRTK
        G  D  WAG   DRK+T    F + + NLI W +K+QNS++ S +E EY+ +  A  + +W+K +L    I  +  I +Y D+   I I+ N   H R K
Subjt:  GTYDFGWAGCSDDRKNTLEGCFFLGN-NLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCEYGISRDTMI-LYSDSISVIDISMNLVQHSRTK

Query:  H
        H
Subjt:  H

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-1435.57Show/hide
Query:  HLMAAKIIIKYVHGT-----------------YDFGWAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCEYGI
        H  A K I++Y+ GT                  D   AG  D+RK++    F      ISW SK Q  ++LS +E EYI       ++IW+K+ L E G+
Subjt:  HLMAAKIIIKYVHGT-----------------YDFGWAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCEYGI

Query:  SRDTMILYSDSISVIDISMNLVQHSRTKH---RLHWTTKK-RKESLKRL
         +   ++Y DS S ID+S N + H+RTKH   R HW  +    ESLK L
Subjt:  SRDTMILYSDSISVIDISMNLVQHSRTKH---RLHWTTKK-RKESLKRL

P92519 Uncharacterized mitochondrial protein AtMg008107.9e-0742.11Show/hide
Query:  DFGWAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIW
        D  WAGC+  R++T   C FLG N+ISW +K+Q ++S S +E EY  +    ++L W
Subjt:  DFGWAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.3e-1132.84Show/hide
Query:  SNSHLMAAKIIIKYVHGTYDFG------------------WAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLC
        +  HL A K I++Y+ GT + G                  WAG  DD  +T     +LG++ ISW SKKQ  +  S +E EY  V +  S++ W+  +L 
Subjt:  SNSHLMAAKIIIKYVHGTYDFG------------------WAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLC

Query:  EYGIS-RDTMILYSDSISVIDISMNLVQHSRTKH
        E GI      ++Y D++    +  N V HSR KH
Subjt:  EYGIS-RDTMILYSDSISVIDISMNLVQHSRTKH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.1e-1132.84Show/hide
Query:  SNSHLMAAKIIIKYVHGTYDFG------------------WAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLC
        ++ H  A K +++Y+ GT D G                  WAG +DD  +T     +LG++ ISW SKKQ  +  S +E EY  V +  S+L W+  +L 
Subjt:  SNSHLMAAKIIIKYVHGTYDFG------------------WAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLC

Query:  EYGIS-RDTMILYSDSISVIDISMNLVQHSRTKH
        E GI      ++Y D++    +  N V HSR KH
Subjt:  EYGIS-RDTMILYSDSISVIDISMNLVQHSRTKH

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.6e-1333.83Show/hide
Query:  SHLMAAKIIIKYVHGTY------------------DFGWAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCE-
        +H  A   I+ Y+ GT                   D  +  C D R++T   C FLG +LISW SKKQ  +S S +E EY  +  A  +++W+ Q   E 
Subjt:  SHLMAAKIIIKYVHGTY------------------DFGWAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCE-

Query:  -YGISRDTMILYSDSISVIDISMNLVQHSRTKH
           +S+ T +L+ D+ + I I+ N V H RTKH
Subjt:  -YGISRDTMILYSDSISVIDISMNLVQHSRTKH

ATMG00810.1 DNA/RNA polymerases superfamily protein5.6e-0842.11Show/hide
Query:  DFGWAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIW
        D  WAGC+  R++T   C FLG N+ISW +K+Q ++S S +E EY  +    ++L W
Subjt:  DFGWAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGAGAACAAAACAATTATAGAAAGAGAGATTATGACAAACCAAATGGTAGATATAGCAAATCATTCAAATATAAGAGATGTGGTGGTCATGGCTATTATCAGGC
TGAATGTCTGACTTACCTGAGGAGACAGAAGAAAAGTTTTGGTGCTACCCTATCTAATGAAGAGTCTGATGTAAGTAACGAGGAAGCAGAATATACCAATGTTGTTATCA
GTATTACATCTAAAGATGAATCTTTGCAACGTCAATGGAAGGAGAATTCGCCAGCTCGAGCTATTCAGAAGGAAAATATACAAAAGTTGCTGGATGAAAATCAACAATTA
TTAACTGAAGAACAAACTCAGCTGTGGCATAGGAAACTTGGACATGCAAGTCTCAGTACAGTAAGCAACGATCTGAAACACGATGCTATCTTGAGAATACCAAATCTGGA
TATAAATAGTCAGTTATTCTATAAAGATTGTCAACGTGGCAAGAAAATCAGAACATCACATAAAAGTATTAGTGAATGCTATACTAATAGAGTTCTTGAATTTCTTCATA
TGGATCTAATGGGTCCAATGCAAACCAAGAGCCTTGACAGAAAGAAGTATGTATTTATTTGTGTTGATGATTATTCATGGTTTACGTGGATCGAGATTATCATAGAAAAT
GAGATGTCAAGTCAGATAAAGGAATTTTCTTGGGATATTCTCAAAATAGCCGTGCATACAGATATTCATGCAAAATACAAATATTCTTCTGAACCTGTGCAAAGTGTTGA
CAGTTTGTCTTCAACTAACGAGGATGATAAAAATGATAACAACAACTTTGATCCTCTTCAAAGAACCTTGGAAACTGAGACTGAAACAGAGACTCCCTCTAAACATGTTG
CTCCATCATCACATGCCAAAAAGAATCACGTGTATGCAAAAGATCAAATTATATCAAATGGATGTCAAAAGTGCCTTCTTAAATTGCAAGGATATGTTGGAGGAGGCACT
GATAAAACTCTATTTATTAGTTGGACAAACAAAAACATAATTATGGCTCAACTATATGTTGATGATATCGTCTTTGGTGGTTTTCAAGACGATATTGTTGATCTAAGTAA
CTCACATCTTATGGCTGCTAAAATAATTATCAAATATGTTCATGGAACCTATGACTTTGGTTGGGCTGGATGCTCTGATGATAGGAAAAACACTTTAGAAGGGTGTTTCT
TTCTAGGAAACAATTTAATATCTTGGTTCAGTAAGAAGCAAAATTCTATTTCTCTATCTAAATCTGAAGTTGAATACATTGTTGTAGGAAGTGCACGTTCTCAACTAATT
TGGATGAAACAAATGTTGTGTGAGTATGGTATTTCTCGAGATACCATGATTCTTTACAGTGATAGTATAAGTGTAATTGACATTTCGATGAATCTTGTTCAACACAGTAG
AACTAAACATAGATTGCATTGGACGACCAAGAAAAGGAAAGAATCTCTAAAGAGGTTATTTGGCATCTTGGAGATGGGTCTAGGCGTTTTGAGATGGCTTAACGAAACGA
CTTTAGAGGCGAGAAGAACTTCCAAGTGGGTGTCATTGGGTGAGAAAGGTGCTATGCAAGAAGACATAGAGGTTAGTAAGGGATTTTGGACGTATAAGGTTGGTAAGAAA
GAAGGATCATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGAGAGAACAAAACAATTATAGAAAGAGAGATTATGACAAACCAAATGGTAGATATAGCAAATCATTCAAATATAAGAGATGTGGTGGTCATGGCTATTATCAGGC
TGAATGTCTGACTTACCTGAGGAGACAGAAGAAAAGTTTTGGTGCTACCCTATCTAATGAAGAGTCTGATGTAAGTAACGAGGAAGCAGAATATACCAATGTTGTTATCA
GTATTACATCTAAAGATGAATCTTTGCAACGTCAATGGAAGGAGAATTCGCCAGCTCGAGCTATTCAGAAGGAAAATATACAAAAGTTGCTGGATGAAAATCAACAATTA
TTAACTGAAGAACAAACTCAGCTGTGGCATAGGAAACTTGGACATGCAAGTCTCAGTACAGTAAGCAACGATCTGAAACACGATGCTATCTTGAGAATACCAAATCTGGA
TATAAATAGTCAGTTATTCTATAAAGATTGTCAACGTGGCAAGAAAATCAGAACATCACATAAAAGTATTAGTGAATGCTATACTAATAGAGTTCTTGAATTTCTTCATA
TGGATCTAATGGGTCCAATGCAAACCAAGAGCCTTGACAGAAAGAAGTATGTATTTATTTGTGTTGATGATTATTCATGGTTTACGTGGATCGAGATTATCATAGAAAAT
GAGATGTCAAGTCAGATAAAGGAATTTTCTTGGGATATTCTCAAAATAGCCGTGCATACAGATATTCATGCAAAATACAAATATTCTTCTGAACCTGTGCAAAGTGTTGA
CAGTTTGTCTTCAACTAACGAGGATGATAAAAATGATAACAACAACTTTGATCCTCTTCAAAGAACCTTGGAAACTGAGACTGAAACAGAGACTCCCTCTAAACATGTTG
CTCCATCATCACATGCCAAAAAGAATCACGTGTATGCAAAAGATCAAATTATATCAAATGGATGTCAAAAGTGCCTTCTTAAATTGCAAGGATATGTTGGAGGAGGCACT
GATAAAACTCTATTTATTAGTTGGACAAACAAAAACATAATTATGGCTCAACTATATGTTGATGATATCGTCTTTGGTGGTTTTCAAGACGATATTGTTGATCTAAGTAA
CTCACATCTTATGGCTGCTAAAATAATTATCAAATATGTTCATGGAACCTATGACTTTGGTTGGGCTGGATGCTCTGATGATAGGAAAAACACTTTAGAAGGGTGTTTCT
TTCTAGGAAACAATTTAATATCTTGGTTCAGTAAGAAGCAAAATTCTATTTCTCTATCTAAATCTGAAGTTGAATACATTGTTGTAGGAAGTGCACGTTCTCAACTAATT
TGGATGAAACAAATGTTGTGTGAGTATGGTATTTCTCGAGATACCATGATTCTTTACAGTGATAGTATAAGTGTAATTGACATTTCGATGAATCTTGTTCAACACAGTAG
AACTAAACATAGATTGCATTGGACGACCAAGAAAAGGAAAGAATCTCTAAAGAGGTTATTTGGCATCTTGGAGATGGGTCTAGGCGTTTTGAGATGGCTTAACGAAACGA
CTTTAGAGGCGAGAAGAACTTCCAAGTGGGTGTCATTGGGTGAGAAAGGTGCTATGCAAGAAGACATAGAGGTTAGTAAGGGATTTTGGACGTATAAGGTTGGTAAGAAA
GAAGGATCATAG
Protein sequenceShow/hide protein sequence
MKREQNNYRKRDYDKPNGRYSKSFKYKRCGGHGYYQAECLTYLRRQKKSFGATLSNEESDVSNEEAEYTNVVISITSKDESLQRQWKENSPARAIQKENIQKLLDENQQL
LTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYSWFTWIEIIIEN
EMSSQIKEFSWDILKIAVHTDIHAKYKYSSEPVQSVDSLSSTNEDDKNDNNNFDPLQRTLETETETETPSKHVAPSSHAKKNHVYAKDQIISNGCQKCLLKLQGYVGGGT
DKTLFISWTNKNIIMAQLYVDDIVFGGFQDDIVDLSNSHLMAAKIIIKYVHGTYDFGWAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLI
WMKQMLCEYGISRDTMILYSDSISVIDISMNLVQHSRTKHRLHWTTKKRKESLKRLFGILEMGLGVLRWLNETTLEARRTSKWVSLGEKGAMQEDIEVSKGFWTYKVGKK
EGS