; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G13490 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G13490
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr5:13193200..13195135
RNA-Seq ExpressionCSPI05G13490
SyntenyCSPI05G13490
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047995.1 retrotransposon protein, putative, Ty1-copia sub-class [Cucumis melo var. makuwa]8.3e-14463.15Show/hide
Query:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTGELWKMLESLYLTKSLPNKL
        M STRFEV+KFNG GDFALWRKKIRAILVQ KVAKILDEE L + IT+SEKRDMDEMAY    LYLSD+V+RL DEATTTGELWK LESLYLTKSLPNK+
Subjt:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTGELWKMLESLYLTKSLPNKL

Query:  HLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKDGELLMAKGR
        ++KEKFFGY M+QSK L+ENL++F KI+VDLNNI  KM DENQ VILLNSLP+TY+EV A IKYG DSL M++VLDAL TRNLEIKKERKDGELLMA+GR
Subjt:  HLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKDGELLMAKGR

Query:  SEKRSWKGKEKSSKSKSRGK-----------------------------ANVTNRYDSTEVLMG--------------LIGILRMLGSWIQ---------
        SEK+SWKGKE+S +SKS+GK                             ANVT+ Y+S E+  G              ++    +  +WI          
Subjt:  SEKRSWKGKEKSSKSKSRGK-----------------------------ANVTNRYDSTEVLMG--------------LIGILRMLGSWIQ---------

Query:  ------------------------------GALQIATHDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKDSLVKSRGTLRNGLYVLE
                                      G++QIATHDG++R+LTNV YVP+LK NLISLG+L+RS  T KS+NGV+KV+K SLVK RGTLR+GLYVLE
Subjt:  ------------------------------GALQIATHDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKDSLVKSRGTLRNGLYVLE

Query:  GTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR
        GT VSGSAAIASGKVTNMSMLWHKRLAHVSE GLQALSQQGLLGGVK+VELPFC+HCIM KSTR
Subjt:  GTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR

KAA0050070.1 putative polyprotein [Cucumis melo var. makuwa]1.7e-12562.53Show/hide
Query:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTGELWKMLESLYLTKSLPNKL
        M STRFEV+KFNG GDFALWRKKIRAILVQ KVAKILDEE L E IT+SEKRDMDEM YS   LYLSD+V+RL DEATT GELWK LESLY+ KSL NK+
Subjt:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTGELWKMLESLYLTKSLPNKL

Query:  HLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKDGELLMAKGR
        ++KEKFFGY M+QSKGL+ENL++F KIIVDLNNI  KM DENQ VILLNSLP+ Y+EV A IKYGRDSL M+++LDAL TRNLEIKKERKDGELLMA+ R
Subjt:  HLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKDGELLMAKGR

Query:  SEKRSWKGKEKSSKSKSRGK--------------------------------------ANVTNRYDSTE-------VLMGLIGILRMLGSWIQGALQIAT
        SEK+SWKGKE+SS+SKS+ K                                      A +TN YDS E       VLM      RMLGSWIQ       
Subjt:  SEKRSWKGKEKSSKSKSRGK--------------------------------------ANVTNRYDSTE-------VLMGLIGILRMLGSWIQGALQIAT

Query:  HDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKDSLVKSRGTLRNGLYVLEGTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQAL
                                   +RS  T K +N V+KV+K SLVK RGTLR+GLYVLEGT VSGSAAIASGKV +MSMLWH+RLAHVSE GLQAL
Subjt:  HDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKDSLVKSRGTLRNGLYVLEGTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQAL

Query:  SQQGLLGGVKDVELPFCKHCIMEKSTR
        SQQGLL GVK+VELPFC+HCIM KSTR
Subjt:  SQQGLLGGVKDVELPFCKHCIMEKSTR

KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]6.4e-14462.93Show/hide
Query:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTGELWKMLESLYLTKSLPNKL
        M STRFEV+KFNG GDF+LWRKKIRAILVQ KVAKILDEE L + IT+SEKRDMDEMAYS   LYLSD+V+RL DEATTTGELWK LESLYLTKSL NK+
Subjt:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTGELWKMLESLYLTKSLPNKL

Query:  HLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKDGELLMAKGR
        ++KEKFFGY M+QSK L+ENL++F KI+VDLNNI  KM DENQ VILLNSLP+TY+EV A IKYGRDSL M++VLDAL TRNLEIKKERKDGELLMA+GR
Subjt:  HLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKDGELLMAKGR

Query:  SEKRSWKGKEKSSKSKSRGK-----------------------------ANVTNRYDSTEVLMG--------------LIGILRMLGSWIQ---------
        SEK+SWKGKE+S +SKS+GK                             ANVT+ Y+S E+  G              ++    +  +WI          
Subjt:  SEKRSWKGKEKSSKSKSRGK-----------------------------ANVTNRYDSTEVLMG--------------LIGILRMLGSWIQ---------

Query:  ------------------------------GALQIATHDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKDSLVKSRGTLRNGLYVLE
                                      G++QIATHDG++R+LTNV YVP+LK NLISLG+L+RS  T KS+NGV+KV+K SLVK RGTLR+GLYVLE
Subjt:  ------------------------------GALQIATHDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKDSLVKSRGTLRNGLYVLE

Query:  GTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR
        GT VSGSAAIASGKVT+MSMLWHKRLAHVSE GLQALSQQGLLGGVK+VELPFC+HCIM KSTR
Subjt:  GTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR

KAA0067607.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]4.1e-13561.57Show/hide
Query:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTGELWKMLESLYLTKSLPNKL
        M STRFE++KFNG GDFA+WRKKIR ILVQ KVAKILDEE L E IT+SEKRDMDEM YS   LYLSD+V+RL DEATTTGELWK LESLYLTK LPN  
Subjt:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTGELWKMLESLYLTKSLPNKL

Query:  HLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKDGELLMAKGR
          KEKFF Y M+QSK L+ENL++F KIIVDL+NI  KM DENQ VILLNSLP+TY+EV A IKYGRDSL M++VLDAL TRNLEIKKERKDGELLMA+GR
Subjt:  HLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKDGELLMAKGR

Query:  SEKRSWKGKEKSSKSKSRGK-------------------------------------ANVTNRYDSTEVLMGLIGILRMLGSWIQ---------------
        SEK+SWKGKE+SS+SKS+GK                                     A +T+ Y+S EVLM  +    +  +WI                
Subjt:  SEKRSWKGKEKSSKSKSRGK-------------------------------------ANVTNRYDSTEVLMGLIGILRMLGSWIQ---------------

Query:  ------------------------GALQIATHDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKDSLVKSRGTLRNGLYVLEGTAVSG
                                G++QIATHD ++R+LTNV YVP+LK NLISLG+L+RS  T KS+NGV+KV+K SLVK RGTLR+GLYVLEGT VSG
Subjt:  ------------------------GALQIATHDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKDSLVKSRGTLRNGLYVLEGTAVSG

Query:  SAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR
        S AIASGKVT M MLWH RLAHVSE GLQALSQQGLL GVK+VEL FC+HCIM K TR
Subjt:  SAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]2.0e-14563.36Show/hide
Query:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTGELWKMLESLYLTKSLPNKL
        M STRFEV+KFNG GDFALWRKKIRAILVQ KVAKILDEE L + IT+SEKRDMDEMAYS   LYLSD+V+RL DEATTTGELWK LESLYLTKSLPNK+
Subjt:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTGELWKMLESLYLTKSLPNKL

Query:  HLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKDGELLMAKGR
        ++KEKFFGY M+QSK L+ENL++F KI+VDLNNI  KM DENQ VILLNSLP+TY+EV A IKYGRDSL M++VLDAL TRNLEIKKERKDGELLMA+GR
Subjt:  HLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKDGELLMAKGR

Query:  SEKRSWKGKEKSSKSKSRGK-----------------------------ANVTNRYDSTEVLMG--------------LIGILRMLGSWIQ---------
        SEK+SWKGKE+S +SKS+GK                             ANVT+ Y+S E+  G              ++    +  +WI          
Subjt:  SEKRSWKGKEKSSKSKSRGK-----------------------------ANVTNRYDSTEVLMG--------------LIGILRMLGSWIQ---------

Query:  ------------------------------GALQIATHDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKDSLVKSRGTLRNGLYVLE
                                      G++QIATHDG++R+LTNV YVP+LK NLISLG+L+RS  T KS+NGV+KV+K SLVK RGTLR+GLYVLE
Subjt:  ------------------------------GALQIATHDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKDSLVKSRGTLRNGLYVLE

Query:  GTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR
        GT VSGSAAIASGKVT+MSMLWHKRLAHVSE GLQALSQQGLLGGVK+VELPFC+HCIM KSTR
Subjt:  GTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR

TrEMBL top hitse value%identityAlignment
A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class4.0e-14463.15Show/hide
Query:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTGELWKMLESLYLTKSLPNKL
        M STRFEV+KFNG GDFALWRKKIRAILVQ KVAKILDEE L + IT+SEKRDMDEMAY    LYLSD+V+RL DEATTTGELWK LESLYLTKSLPNK+
Subjt:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTGELWKMLESLYLTKSLPNKL

Query:  HLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKDGELLMAKGR
        ++KEKFFGY M+QSK L+ENL++F KI+VDLNNI  KM DENQ VILLNSLP+TY+EV A IKYG DSL M++VLDAL TRNLEIKKERKDGELLMA+GR
Subjt:  HLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKDGELLMAKGR

Query:  SEKRSWKGKEKSSKSKSRGK-----------------------------ANVTNRYDSTEVLMG--------------LIGILRMLGSWIQ---------
        SEK+SWKGKE+S +SKS+GK                             ANVT+ Y+S E+  G              ++    +  +WI          
Subjt:  SEKRSWKGKEKSSKSKSRGK-----------------------------ANVTNRYDSTEVLMG--------------LIGILRMLGSWIQ---------

Query:  ------------------------------GALQIATHDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKDSLVKSRGTLRNGLYVLE
                                      G++QIATHDG++R+LTNV YVP+LK NLISLG+L+RS  T KS+NGV+KV+K SLVK RGTLR+GLYVLE
Subjt:  ------------------------------GALQIATHDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKDSLVKSRGTLRNGLYVLE

Query:  GTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR
        GT VSGSAAIASGKVTNMSMLWHKRLAHVSE GLQALSQQGLLGGVK+VELPFC+HCIM KSTR
Subjt:  GTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR

A0A5A7U459 Putative polyprotein8.4e-12662.53Show/hide
Query:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTGELWKMLESLYLTKSLPNKL
        M STRFEV+KFNG GDFALWRKKIRAILVQ KVAKILDEE L E IT+SEKRDMDEM YS   LYLSD+V+RL DEATT GELWK LESLY+ KSL NK+
Subjt:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTGELWKMLESLYLTKSLPNKL

Query:  HLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKDGELLMAKGR
        ++KEKFFGY M+QSKGL+ENL++F KIIVDLNNI  KM DENQ VILLNSLP+ Y+EV A IKYGRDSL M+++LDAL TRNLEIKKERKDGELLMA+ R
Subjt:  HLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKDGELLMAKGR

Query:  SEKRSWKGKEKSSKSKSRGK--------------------------------------ANVTNRYDSTE-------VLMGLIGILRMLGSWIQGALQIAT
        SEK+SWKGKE+SS+SKS+ K                                      A +TN YDS E       VLM      RMLGSWIQ       
Subjt:  SEKRSWKGKEKSSKSKSRGK--------------------------------------ANVTNRYDSTE-------VLMGLIGILRMLGSWIQGALQIAT

Query:  HDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKDSLVKSRGTLRNGLYVLEGTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQAL
                                   +RS  T K +N V+KV+K SLVK RGTLR+GLYVLEGT VSGSAAIASGKV +MSMLWH+RLAHVSE GLQAL
Subjt:  HDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKDSLVKSRGTLRNGLYVLEGTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQAL

Query:  SQQGLLGGVKDVELPFCKHCIMEKSTR
        SQQGLL GVK+VELPFC+HCIM KSTR
Subjt:  SQQGLLGGVKDVELPFCKHCIMEKSTR

A0A5A7UB25 Putative gag-pol polyprotein3.1e-14462.93Show/hide
Query:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTGELWKMLESLYLTKSLPNKL
        M STRFEV+KFNG GDF+LWRKKIRAILVQ KVAKILDEE L + IT+SEKRDMDEMAYS   LYLSD+V+RL DEATTTGELWK LESLYLTKSL NK+
Subjt:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTGELWKMLESLYLTKSLPNKL

Query:  HLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKDGELLMAKGR
        ++KEKFFGY M+QSK L+ENL++F KI+VDLNNI  KM DENQ VILLNSLP+TY+EV A IKYGRDSL M++VLDAL TRNLEIKKERKDGELLMA+GR
Subjt:  HLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKDGELLMAKGR

Query:  SEKRSWKGKEKSSKSKSRGK-----------------------------ANVTNRYDSTEVLMG--------------LIGILRMLGSWIQ---------
        SEK+SWKGKE+S +SKS+GK                             ANVT+ Y+S E+  G              ++    +  +WI          
Subjt:  SEKRSWKGKEKSSKSKSRGK-----------------------------ANVTNRYDSTEVLMG--------------LIGILRMLGSWIQ---------

Query:  ------------------------------GALQIATHDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKDSLVKSRGTLRNGLYVLE
                                      G++QIATHDG++R+LTNV YVP+LK NLISLG+L+RS  T KS+NGV+KV+K SLVK RGTLR+GLYVLE
Subjt:  ------------------------------GALQIATHDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKDSLVKSRGTLRNGLYVLE

Query:  GTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR
        GT VSGSAAIASGKVT+MSMLWHKRLAHVSE GLQALSQQGLLGGVK+VELPFC+HCIM KSTR
Subjt:  GTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR

A0A5A7VKC2 Retrotransposon protein, putative, Ty1-copia subclass2.0e-13561.57Show/hide
Query:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTGELWKMLESLYLTKSLPNKL
        M STRFE++KFNG GDFA+WRKKIR ILVQ KVAKILDEE L E IT+SEKRDMDEM YS   LYLSD+V+RL DEATTTGELWK LESLYLTK LPN  
Subjt:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTGELWKMLESLYLTKSLPNKL

Query:  HLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKDGELLMAKGR
          KEKFF Y M+QSK L+ENL++F KIIVDL+NI  KM DENQ VILLNSLP+TY+EV A IKYGRDSL M++VLDAL TRNLEIKKERKDGELLMA+GR
Subjt:  HLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKDGELLMAKGR

Query:  SEKRSWKGKEKSSKSKSRGK-------------------------------------ANVTNRYDSTEVLMGLIGILRMLGSWIQ---------------
        SEK+SWKGKE+SS+SKS+GK                                     A +T+ Y+S EVLM  +    +  +WI                
Subjt:  SEKRSWKGKEKSSKSKSRGK-------------------------------------ANVTNRYDSTEVLMGLIGILRMLGSWIQ---------------

Query:  ------------------------GALQIATHDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKDSLVKSRGTLRNGLYVLEGTAVSG
                                G++QIATHD ++R+LTNV YVP+LK NLISLG+L+RS  T KS+NGV+KV+K SLVK RGTLR+GLYVLEGT VSG
Subjt:  ------------------------GALQIATHDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKDSLVKSRGTLRNGLYVLEGTAVSG

Query:  SAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR
        S AIASGKVT M MLWH RLAHVSE GLQALSQQGLL GVK+VEL FC+HCIM K TR
Subjt:  SAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR

A0A5D3DNU1 Putative gag-pol polyprotein9.6e-14663.36Show/hide
Query:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTGELWKMLESLYLTKSLPNKL
        M STRFEV+KFNG GDFALWRKKIRAILVQ KVAKILDEE L + IT+SEKRDMDEMAYS   LYLSD+V+RL DEATTTGELWK LESLYLTKSLPNK+
Subjt:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTGELWKMLESLYLTKSLPNKL

Query:  HLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKDGELLMAKGR
        ++KEKFFGY M+QSK L+ENL++F KI+VDLNNI  KM DENQ VILLNSLP+TY+EV A IKYGRDSL M++VLDAL TRNLEIKKERKDGELLMA+GR
Subjt:  HLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKDGELLMAKGR

Query:  SEKRSWKGKEKSSKSKSRGK-----------------------------ANVTNRYDSTEVLMG--------------LIGILRMLGSWIQ---------
        SEK+SWKGKE+S +SKS+GK                             ANVT+ Y+S E+  G              ++    +  +WI          
Subjt:  SEKRSWKGKEKSSKSKSRGK-----------------------------ANVTNRYDSTEVLMG--------------LIGILRMLGSWIQ---------

Query:  ------------------------------GALQIATHDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKDSLVKSRGTLRNGLYVLE
                                      G++QIATHDG++R+LTNV YVP+LK NLISLG+L+RS  T KS+NGV+KV+K SLVK RGTLR+GLYVLE
Subjt:  ------------------------------GALQIATHDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKDSLVKSRGTLRNGLYVLE

Query:  GTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR
        GT VSGSAAIASGKVT+MSMLWHKRLAHVSE GLQALSQQGLLGGVK+VELPFC+HCIM KSTR
Subjt:  GTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.0e-0823.44Show/hide
Query:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTGELWKMLESLYLTKSLPNKL
        M   +  +  F+G   +A+W+ +IRA+L +Q V K++D   L         +  +  A S    YLSD  +       T  ++ + L+++Y  KSL ++L
Subjt:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTGELWKMLESLYLTKSLPNKL

Query:  HLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIK-YGRDSLAMNVVLDALNTRNLEIKKERKD
         L+++     ++    L  + + F ++I +L     K+ + ++   LL +LP  Y  ++  I+    ++L +  V + L  + ++IK +  D
Subjt:  HLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIK-YGRDSLAMNVVLDALNTRNLEIKKERKD

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.7e-4129.57Show/hide
Query:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEK-RDMDEMAYSMFFLYLSDDVV-RLEDEATTTGELWKMLESLYLTKSLPN
        M+  ++EVAKFNG   F+ W++++R +L+QQ + K+LD ++      K+E   D+DE A S   L+LSDDVV  + DE T  G +W  LESLY++K+L N
Subjt:  MTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEK-RDMDEMAYSMFFLYLSDDVV-RLEDEATTTGELWKMLESLYLTKSLPN

Query:  KLHLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKDGELLM--
        KL+LK++ +   M++      +LN F  +I  L N+  K+ +E++ ++LLNSLP +Y  +  TI +G+ ++ +  V  AL       KK    G+ L+  
Subjt:  KLHLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKDGELLM--

Query:  AKGRSEKRSWKGKEKS---SKSKSRGKANVTNRY----------------------------DSTEVLM-------------------------------
         +GRS +RS     +S    KSK+R K+ V N Y                            D+T  ++                               
Subjt:  AKGRSEKRSWKGKEKS---SKSKSRGKANVTNRY----------------------------DSTEVLM-------------------------------

Query:  -----------------GLIGILRMLGSWIQ-----GALQIATHDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKDSLVKSRGTLRN
                         G  G ++M  +        G + I T+ G   VL +V +VP+L+ NLIS   L+R  Y     N   +++K SLV ++G  R 
Subjt:  -----------------GLIGILRMLGSWIQ-----GALQIATHDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKDSLVKSRGTLRN

Query:  GLYVLEGTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR
         LY        G    A  +++    LWHKR+ H+SE GLQ L+++ L+   K   +  C +C+  K  R
Subjt:  GLYVLEGTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR

P93293 Uncharacterized mitochondrial protein AtMg003006.2e-0937.08Show/hide
Query:  GVLKVSKDSLVKSRGTLRNGLYVLEGTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR
        GVLKV K      +G   + LY+L+G+  +G + +A     + + LWH RLAH+S+ G++ L ++G L   K   L FC+ CI  K+ R
Subjt:  GVLKVSKDSLVKSRGTLRNGLYVLEGTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein4.4e-1037.08Show/hide
Query:  GVLKVSKDSLVKSRGTLRNGLYVLEGTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR
        GVLKV K      +G   + LY+L+G+  +G + +A     + + LWH RLAH+S+ G++ L ++G L   K   L FC+ CI  K+ R
Subjt:  GVLKVSKDSLVKSRGTLRNGLYVLEGTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGACGGTGAGTTCCAAGTGAAGTTTGACGGAGCACGTTCATCTATGAGCTCACATATTCTTGGTTTAATCAGTTCTGTAAAGATGACTTCAACACGATTTGAGGT
GGCTAAGTTTAACGGGATGGGTGATTTTGCTCTTTGGAGGAAAAAGATTAGAGCTATTTTAGTTCAACAGAAAGTAGCTAAAATCTTAGATGAAGAGAACCTTTCGGAAA
CTATTACAAAAAGTGAGAAAAGGGATATGGATGAAATGGCTTATTCAATGTTCTTTTTGTATCTGTCAGATGATGTGGTTAGGCTTGAAGATGAGGCTACTACTACAGGG
GAGTTGTGGAAGATGCTAGAGAGTCTTTACTTGACAAAGTCATTGCCAAATAAATTACATCTAAAGGAGAAATTCTTTGGATATAGTATGAACCAAAGTAAAGGTTTAAA
GGAGAACTTGAATAAATTTCTTAAGATTATAGTTGATCTAAATAACATCGATGGGAAGATGTTGGATGAAAATCAAACGGTGATTCTTTTAAATTCATTACCTAAAACAT
ATCAAGAGGTTTTGGCAACTATTAAATATGGTCGGGACTCATTGGCCATGAATGTAGTGTTGGATGCCTTAAATACTAGAAATCTCGAAATTAAGAAAGAGCGCAAGGAT
GGAGAATTACTCATGGCCAAAGGAAGGAGTGAGAAAAGGAGTTGGAAAGGCAAAGAGAAGAGTTCCAAATCAAAATCAAGGGGAAAAGCGAATGTTACTAATAGGTATGA
TTCAACAGAGGTCTTGATGGGTCTCATAGGGATACTCAGGATGCTTGGATCATGGATTCAAGGTGCACTCCAAATTGCAACACATGACGGGATTATCAGAGTTCTCACTA
ATGTGAGTTATGTTCCAGAACTCAAATGTAATCTAATATCTCTTGGTGATTTAGAAAGATCAATTTATACTTATAAATCTAAGAATGGAGTTCTGAAAGTTAGCAAGGAT
TCCTTGGTTAAATCGAGGGGAACCTTGAGGAATGGTTTATATGTATTGGAAGGTACTGCAGTTTCAGGCAGTGCTGCTATTGCATCAGGTAAAGTTACAAATATGTCAAT
GTTATGGCACAAAAGGTTAGCTCATGTGAGTGAAATAGGTTTACAAGCACTTTCTCAACAAGGTTTACTAGGAGGAGTTAAAGATGTTGAACTACCATTTTGTAAGCATT
GTATAATGGAAAAGTCTACCAGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCGACGGTGAGTTCCAAGTGAAGTTTGACGGAGCACGTTCATCTATGAGCTCACATATTCTTGGTTTAATCAGTTCTGTAAAGATGACTTCAACACGATTTGAGGT
GGCTAAGTTTAACGGGATGGGTGATTTTGCTCTTTGGAGGAAAAAGATTAGAGCTATTTTAGTTCAACAGAAAGTAGCTAAAATCTTAGATGAAGAGAACCTTTCGGAAA
CTATTACAAAAAGTGAGAAAAGGGATATGGATGAAATGGCTTATTCAATGTTCTTTTTGTATCTGTCAGATGATGTGGTTAGGCTTGAAGATGAGGCTACTACTACAGGG
GAGTTGTGGAAGATGCTAGAGAGTCTTTACTTGACAAAGTCATTGCCAAATAAATTACATCTAAAGGAGAAATTCTTTGGATATAGTATGAACCAAAGTAAAGGTTTAAA
GGAGAACTTGAATAAATTTCTTAAGATTATAGTTGATCTAAATAACATCGATGGGAAGATGTTGGATGAAAATCAAACGGTGATTCTTTTAAATTCATTACCTAAAACAT
ATCAAGAGGTTTTGGCAACTATTAAATATGGTCGGGACTCATTGGCCATGAATGTAGTGTTGGATGCCTTAAATACTAGAAATCTCGAAATTAAGAAAGAGCGCAAGGAT
GGAGAATTACTCATGGCCAAAGGAAGGAGTGAGAAAAGGAGTTGGAAAGGCAAAGAGAAGAGTTCCAAATCAAAATCAAGGGGAAAAGCGAATGTTACTAATAGGTATGA
TTCAACAGAGGTCTTGATGGGTCTCATAGGGATACTCAGGATGCTTGGATCATGGATTCAAGGTGCACTCCAAATTGCAACACATGACGGGATTATCAGAGTTCTCACTA
ATGTGAGTTATGTTCCAGAACTCAAATGTAATCTAATATCTCTTGGTGATTTAGAAAGATCAATTTATACTTATAAATCTAAGAATGGAGTTCTGAAAGTTAGCAAGGAT
TCCTTGGTTAAATCGAGGGGAACCTTGAGGAATGGTTTATATGTATTGGAAGGTACTGCAGTTTCAGGCAGTGCTGCTATTGCATCAGGTAAAGTTACAAATATGTCAAT
GTTATGGCACAAAAGGTTAGCTCATGTGAGTGAAATAGGTTTACAAGCACTTTCTCAACAAGGTTTACTAGGAGGAGTTAAAGATGTTGAACTACCATTTTGTAAGCATT
GTATAATGGAAAAGTCTACCAGA
Protein sequenceShow/hide protein sequence
MFDGEFQVKFDGARSSMSSHILGLISSVKMTSTRFEVAKFNGMGDFALWRKKIRAILVQQKVAKILDEENLSETITKSEKRDMDEMAYSMFFLYLSDDVVRLEDEATTTG
ELWKMLESLYLTKSLPNKLHLKEKFFGYSMNQSKGLKENLNKFLKIIVDLNNIDGKMLDENQTVILLNSLPKTYQEVLATIKYGRDSLAMNVVLDALNTRNLEIKKERKD
GELLMAKGRSEKRSWKGKEKSSKSKSRGKANVTNRYDSTEVLMGLIGILRMLGSWIQGALQIATHDGIIRVLTNVSYVPELKCNLISLGDLERSIYTYKSKNGVLKVSKD
SLVKSRGTLRNGLYVLEGTAVSGSAAIASGKVTNMSMLWHKRLAHVSEIGLQALSQQGLLGGVKDVELPFCKHCIMEKSTR