; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0007588 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0007588
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr06:27231096..27233094
RNA-Seq ExpressionPay0007588
SyntenyPay0007588
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046865.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]7.3e-16281.44Show/hide
Query:  MGSNGNAMGTTQPLIPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIAAFQGDSRVLVVKLKSLR
        MGSNGN MGTTQPLIPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRI A             +  
Subjt:  MGSNGNAMGTTQPLIPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIAAFQGDSRVLVVKLKSLR

Query:  RDFETLMMKNGESIADFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKD
        RDFETLMMKNGESIADFLSRA  IISQMQTYGETIKDQTIVEKVLISLTPKFDHVV AIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQ  D
Subjt:  RDFETLMMKNGESIADFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKD

Query:  VVPKSNDSDCVMTRGRGRGGYRGRGRGTGKGINKANI-QCYHCKKFGHVKADYCGCSNHMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHH
           K+   +                   GKG NK  +      +K G V     GCSNHMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHH
Subjt:  VVPKSNDSDCVMTRGRGRGGYRGRGRGTGKGINKANI-QCYHCKKFGHVKADYCGCSNHMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHH

Query:  GNRILTNVQYVPDIGYNFLSVGQLMESGFSILFDDESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRE
        GNRILTNVQYVPDIGYN LSVGQLMESG SILFDDESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKE GIH E
Subjt:  GNRILTNVQYVPDIGYNFLSVGQLMESGFSILFDDESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRE

KAA0055915.1 copia protein [Cucumis melo var. makuwa]2.0e-18368.68Show/hide
Query:  MGSNGNAMGTTQPLIPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIA--------------AFQ
        M +NGN MGTTQPLIPIFKGEGYEFWSI MKTLL SQDLWDLVEQGY DPDD+GKL+EN++KD KALVI+QQAVHD+VFSRIA              AFQ
Subjt:  MGSNGNAMGTTQPLIPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIA--------------AFQ

Query:  GDSRVLVVKLKSLRRDFETLMMKNGESIADFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINR
        GDSRVLVVKL+SL+RDFETLMMKNGESIADFLSRA TIISQMQTYGETI DQTIVEKVL SLTPKFDHVVAAIEESK+LSTFTFIELMGSL+AHESRIN 
Subjt:  GDSRVLVVKLKSLRRDFETLMMKNGESIADFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINR

Query:  SMERNEEKAFQVKDVVPKSNDSDCVMTRGRGRGGYRGRGRGTGKGI--------------NKANIQCYHCKKFGHVKADY--------------------
        SME+N+EKAF+VKDVVPK NDSDCVMT+G+G GGYR RGRGTGKG               NKANIQCYHCKKFGHVKAD                     
Subjt:  SMERNEEKAFQVKDVVPKSNDSDCVMTRGRGRGGYRGRGRGTGKGI--------------NKANIQCYHCKKFGHVKADY--------------------

Query:  ------------------------CGCSNHMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQ-----------YVPDIGYNF
                                 G  NHMTGLKPVFKELNEGEKLKVEL N KELQVEGK  +GIETH+GNRILTNVQ            V ++    
Subjt:  ------------------------CGCSNHMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQ-----------YVPDIGYNF

Query:  LSVGQLMESGFSILFDDESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVEMARSMLQM
        L+      +  +      S+SETFEKFKHFKAKVEKQSGMFIKSLRSDRGG+FLSNNFNHFC+E GIHRELTTPYT EQNGVAERKNRTVVEMARSMLQM
Subjt:  LSVGQLMESGFSILFDDESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVEMARSMLQM

Query:  KGLLNDFWAEAVSASI------PTEHLTNK
        KGL NDFW EAVS SI      PT+ + NK
Subjt:  KGLLNDFWAEAVSASI------PTEHLTNK

TYK03281.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]3.4e-15979Show/hide
Query:  MGSNGNAMGTTQPLIPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIAAFQGDSRVLVVKLKSLR
        MGSNGN MGTTQPLIPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRI A             +  
Subjt:  MGSNGNAMGTTQPLIPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIAAFQGDSRVLVVKLKSLR

Query:  RDFETLMMKNGESIADFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKD
        RDFETLMMKNGESIADFLSRA  IISQMQTYGETIKDQTIVEKVLISLTPKFDHVV AIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQ  D
Subjt:  RDFETLMMKNGESIADFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKD

Query:  VVPKSNDSDCVMTRGRGRGGYRGRGRGTGKGINKANI-QCYHCKKFGHVKADYCGCSNHMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHH
           K+   +                   GKG NK  +      +K G V     GCSNHMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHH
Subjt:  VVPKSNDSDCVMTRGRGRGGYRGRGRGTGKGINKANI-QCYHCKKFGHVKADYCGCSNHMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHH

Query:  GNRILTNVQYVPDIGYNFLSVGQLMESGFSILFDD------------ESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRE
        GNRILTNVQYVPDIGYN LSVGQLMESG SILFDD            ESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKE GIH E
Subjt:  GNRILTNVQYVPDIGYNFLSVGQLMESGFSILFDD------------ESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRE

TYK27735.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]1.3e-17959.87Show/hide
Query:  MKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIA--------------AFQGDSRVLVVKLKSLRRDFETLMMKNGESIA
        +KTLLRSQDLWDLVEQGYVDPDD+GKLREN+KKDSKALVIIQQAVHDSVFSRIA              AFQGDSRVL+VKL+SLRRDFETLMMKNGESIA
Subjt:  MKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIA--------------AFQGDSRVLVVKLKSLRRDFETLMMKNGESIA

Query:  DFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKDVVPKSNDSDCVMTRG
        DFLSRA TIISQMQTYGETIKDQTIVEKVL SLTPKFDHVVAAIEESKNL TFTFIELMGSLEAHESRINRSMERNEEKAFQVKD VPK NDSD VMTRG
Subjt:  DFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKDVVPKSNDSDCVMTRG

Query:  RGRGGYRGRGRGTGKGI--------------NKANIQCYHCKKFGHVKADY--------------------------------------------CGCSN
        RGRGGYRGRG GT KG               NKANIQCYHCKKFGHVKAD                                               CSN
Subjt:  RGRGGYRGRGRGTGKGI--------------NKANIQCYHCKKFGHVKADY--------------------------------------------CGCSN

Query:  HMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNFLSVGQLMESGFSILFDD-----------------------
        HMTGLKPVFKELNEGEKLKV+L NGKELQVEGKGTV IETHHGNRILTNVQYVPDIGYN LSVGQLMESG+SILFDD                       
Subjt:  HMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNFLSVGQLMESGFSILFDD-----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------ESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVE
                                 ESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRG EFLSNNFNHFCKE GIHRELTTPYTPEQNGVAERKN+TVVE
Subjt:  -------------------------ESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVE

Query:  MARSMLQMKGLLNDFWAEAVSASI------PTEHLTNK
        MARSMLQMKGLLNDFWAEAVS SI      PT+ + NK
Subjt:  MARSMLQMKGLLNDFWAEAVSASI------PTEHLTNK

TYK28117.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]1.2e-18363.16Show/hide
Query:  MGSNGNAMGTTQPLIPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIAAFQGDSRVLVVKLKSLR
        M +NGN MGT QPLIPIFKGEGYEFWSI MKTLL SQDLWDLVEQGY DPDD+GKL+EN++KDSKALVIIQQAVHD+VFSRIAA             +  
Subjt:  MGSNGNAMGTTQPLIPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIAAFQGDSRVLVVKLKSLR

Query:  RDFETLMMKNGESIADFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKD
        RDFETLMMKNGESIADFLSRA TIISQMQTYGETI DQTIVEKVL SLTPKFDHVV AIEESK+LSTFTFIELMGSL+AHESRIN SME+NEEKAF+VKD
Subjt:  RDFETLMMKNGESIADFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKD

Query:  VVPKSNDSDCVMTRGRGRGGYRGRGRGTGKGI--------------NKANIQCYHCKKFGHVKADY----------------------------------
        VVPK NDSDCVMT+G+G GGYR RGRGTGKG               NKANIQCYHCKKFGHVKAD                                   
Subjt:  VVPKSNDSDCVMTRGRGRGGYRGRGRGTGKGI--------------NKANIQCYHCKKFGHVKADY----------------------------------

Query:  ----------CGCSNHMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNFLSVGQLMESGFSILFDD--------
                   G  NHMT LKPVFKELNEGEKLKVEL NGKELQVEGK T+GIETH+GNRILTNVQYVPDIGYN LSVGQLMESG SILFDD        
Subjt:  ----------CGCSNHMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNFLSVGQLMESGFSILFDD--------

Query:  ----------------------------------------------------------------------------ESKSETFEKFKHFKAKVEKQSGMF
                                                                                    +S+SETFEKFKHFKAKVEKQSGMF
Subjt:  ----------------------------------------------------------------------------ESKSETFEKFKHFKAKVEKQSGMF

Query:  IKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVEMARSMLQMKGLLNDFWAEAVSASI------PTEHLTNK
        IKS RSDRGG+FLSNNFNHFC+E GIHRELTTPYT EQNGVAERKNRTVVEMARSMLQMKGL NDFW EA S SI      PT+ + NK
Subjt:  IKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVEMARSMLQMKGLLNDFWAEAVSASI------PTEHLTNK

TrEMBL top hitse value%identityAlignment
A0A5A7TZP7 Putative gag-pol polyprotein, identical3.6e-16281.44Show/hide
Query:  MGSNGNAMGTTQPLIPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIAAFQGDSRVLVVKLKSLR
        MGSNGN MGTTQPLIPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRI A             +  
Subjt:  MGSNGNAMGTTQPLIPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIAAFQGDSRVLVVKLKSLR

Query:  RDFETLMMKNGESIADFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKD
        RDFETLMMKNGESIADFLSRA  IISQMQTYGETIKDQTIVEKVLISLTPKFDHVV AIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQ  D
Subjt:  RDFETLMMKNGESIADFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKD

Query:  VVPKSNDSDCVMTRGRGRGGYRGRGRGTGKGINKANI-QCYHCKKFGHVKADYCGCSNHMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHH
           K+   +                   GKG NK  +      +K G V     GCSNHMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHH
Subjt:  VVPKSNDSDCVMTRGRGRGGYRGRGRGTGKGINKANI-QCYHCKKFGHVKADYCGCSNHMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHH

Query:  GNRILTNVQYVPDIGYNFLSVGQLMESGFSILFDDESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRE
        GNRILTNVQYVPDIGYN LSVGQLMESG SILFDDESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKE GIH E
Subjt:  GNRILTNVQYVPDIGYNFLSVGQLMESGFSILFDDESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRE

A0A5A7UQM0 Copia protein9.6e-18468.68Show/hide
Query:  MGSNGNAMGTTQPLIPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIA--------------AFQ
        M +NGN MGTTQPLIPIFKGEGYEFWSI MKTLL SQDLWDLVEQGY DPDD+GKL+EN++KD KALVI+QQAVHD+VFSRIA              AFQ
Subjt:  MGSNGNAMGTTQPLIPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIA--------------AFQ

Query:  GDSRVLVVKLKSLRRDFETLMMKNGESIADFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINR
        GDSRVLVVKL+SL+RDFETLMMKNGESIADFLSRA TIISQMQTYGETI DQTIVEKVL SLTPKFDHVVAAIEESK+LSTFTFIELMGSL+AHESRIN 
Subjt:  GDSRVLVVKLKSLRRDFETLMMKNGESIADFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINR

Query:  SMERNEEKAFQVKDVVPKSNDSDCVMTRGRGRGGYRGRGRGTGKGI--------------NKANIQCYHCKKFGHVKADY--------------------
        SME+N+EKAF+VKDVVPK NDSDCVMT+G+G GGYR RGRGTGKG               NKANIQCYHCKKFGHVKAD                     
Subjt:  SMERNEEKAFQVKDVVPKSNDSDCVMTRGRGRGGYRGRGRGTGKGI--------------NKANIQCYHCKKFGHVKADY--------------------

Query:  ------------------------CGCSNHMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQ-----------YVPDIGYNF
                                 G  NHMTGLKPVFKELNEGEKLKVEL N KELQVEGK  +GIETH+GNRILTNVQ            V ++    
Subjt:  ------------------------CGCSNHMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQ-----------YVPDIGYNF

Query:  LSVGQLMESGFSILFDDESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVEMARSMLQM
        L+      +  +      S+SETFEKFKHFKAKVEKQSGMFIKSLRSDRGG+FLSNNFNHFC+E GIHRELTTPYT EQNGVAERKNRTVVEMARSMLQM
Subjt:  LSVGQLMESGFSILFDDESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVEMARSMLQM

Query:  KGLLNDFWAEAVSASI------PTEHLTNK
        KGL NDFW EAVS SI      PT+ + NK
Subjt:  KGLLNDFWAEAVSASI------PTEHLTNK

A0A5D3BU79 Putative gag-pol polyprotein, identical1.7e-15979Show/hide
Query:  MGSNGNAMGTTQPLIPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIAAFQGDSRVLVVKLKSLR
        MGSNGN MGTTQPLIPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRI A             +  
Subjt:  MGSNGNAMGTTQPLIPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIAAFQGDSRVLVVKLKSLR

Query:  RDFETLMMKNGESIADFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKD
        RDFETLMMKNGESIADFLSRA  IISQMQTYGETIKDQTIVEKVLISLTPKFDHVV AIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQ  D
Subjt:  RDFETLMMKNGESIADFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKD

Query:  VVPKSNDSDCVMTRGRGRGGYRGRGRGTGKGINKANI-QCYHCKKFGHVKADYCGCSNHMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHH
           K+   +                   GKG NK  +      +K G V     GCSNHMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHH
Subjt:  VVPKSNDSDCVMTRGRGRGGYRGRGRGTGKGINKANI-QCYHCKKFGHVKADYCGCSNHMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHH

Query:  GNRILTNVQYVPDIGYNFLSVGQLMESGFSILFDD------------ESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRE
        GNRILTNVQYVPDIGYN LSVGQLMESG SILFDD            ESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKE GIH E
Subjt:  GNRILTNVQYVPDIGYNFLSVGQLMESGFSILFDD------------ESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRE

A0A5D3DWC7 Putative gag-pol polyprotein, identical5.6e-18463.16Show/hide
Query:  MGSNGNAMGTTQPLIPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIAAFQGDSRVLVVKLKSLR
        M +NGN MGT QPLIPIFKGEGYEFWSI MKTLL SQDLWDLVEQGY DPDD+GKL+EN++KDSKALVIIQQAVHD+VFSRIAA             +  
Subjt:  MGSNGNAMGTTQPLIPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIAAFQGDSRVLVVKLKSLR

Query:  RDFETLMMKNGESIADFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKD
        RDFETLMMKNGESIADFLSRA TIISQMQTYGETI DQTIVEKVL SLTPKFDHVV AIEESK+LSTFTFIELMGSL+AHESRIN SME+NEEKAF+VKD
Subjt:  RDFETLMMKNGESIADFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKD

Query:  VVPKSNDSDCVMTRGRGRGGYRGRGRGTGKGI--------------NKANIQCYHCKKFGHVKADY----------------------------------
        VVPK NDSDCVMT+G+G GGYR RGRGTGKG               NKANIQCYHCKKFGHVKAD                                   
Subjt:  VVPKSNDSDCVMTRGRGRGGYRGRGRGTGKGI--------------NKANIQCYHCKKFGHVKADY----------------------------------

Query:  ----------CGCSNHMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNFLSVGQLMESGFSILFDD--------
                   G  NHMT LKPVFKELNEGEKLKVEL NGKELQVEGK T+GIETH+GNRILTNVQYVPDIGYN LSVGQLMESG SILFDD        
Subjt:  ----------CGCSNHMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNFLSVGQLMESGFSILFDD--------

Query:  ----------------------------------------------------------------------------ESKSETFEKFKHFKAKVEKQSGMF
                                                                                    +S+SETFEKFKHFKAKVEKQSGMF
Subjt:  ----------------------------------------------------------------------------ESKSETFEKFKHFKAKVEKQSGMF

Query:  IKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVEMARSMLQMKGLLNDFWAEAVSASI------PTEHLTNK
        IKS RSDRGG+FLSNNFNHFC+E GIHRELTTPYT EQNGVAERKNRTVVEMARSMLQMKGL NDFW EA S SI      PT+ + NK
Subjt:  IKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVEMARSMLQMKGLLNDFWAEAVSASI------PTEHLTNK

A0A5D3DWP2 Putative gag-pol polyprotein, identical6.4e-18059.87Show/hide
Query:  MKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIA--------------AFQGDSRVLVVKLKSLRRDFETLMMKNGESIA
        +KTLLRSQDLWDLVEQGYVDPDD+GKLREN+KKDSKALVIIQQAVHDSVFSRIA              AFQGDSRVL+VKL+SLRRDFETLMMKNGESIA
Subjt:  MKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIA--------------AFQGDSRVLVVKLKSLRRDFETLMMKNGESIA

Query:  DFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKDVVPKSNDSDCVMTRG
        DFLSRA TIISQMQTYGETIKDQTIVEKVL SLTPKFDHVVAAIEESKNL TFTFIELMGSLEAHESRINRSMERNEEKAFQVKD VPK NDSD VMTRG
Subjt:  DFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKDVVPKSNDSDCVMTRG

Query:  RGRGGYRGRGRGTGKGI--------------NKANIQCYHCKKFGHVKADY--------------------------------------------CGCSN
        RGRGGYRGRG GT KG               NKANIQCYHCKKFGHVKAD                                               CSN
Subjt:  RGRGGYRGRGRGTGKGI--------------NKANIQCYHCKKFGHVKADY--------------------------------------------CGCSN

Query:  HMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNFLSVGQLMESGFSILFDD-----------------------
        HMTGLKPVFKELNEGEKLKV+L NGKELQVEGKGTV IETHHGNRILTNVQYVPDIGYN LSVGQLMESG+SILFDD                       
Subjt:  HMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNFLSVGQLMESGFSILFDD-----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------ESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVE
                                 ESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRG EFLSNNFNHFCKE GIHRELTTPYTPEQNGVAERKN+TVVE
Subjt:  -------------------------ESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVE

Query:  MARSMLQMKGLLNDFWAEAVSASI------PTEHLTNK
        MARSMLQMKGLLNDFWAEAVS SI      PT+ + NK
Subjt:  MARSMLQMKGLLNDFWAEAVSASI------PTEHLTNK

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.5e-1344.09Show/hide
Query:  KSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVEMARSMLQMKGLLNDFWAEAV
        KS+ F  F+ F AK E    + +  L  D G E+LSN    FC + GI   LT P+TP+ NGV+ER  RT+ E AR+M+    L   FW EAV
Subjt:  KSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVEMARSMLQMKGLLNDFWAEAV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.5e-2048.98Show/hide
Query:  ESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVEMARSMLQMKGLLNDFWAEAVSAS
        ++K + F+ F+ F A VE+++G  +K LRSD GGE+ S  F  +C   GI  E T P TP+ NGVAER NRT+VE  RSML+M  L   FW EAV  +
Subjt:  ESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVEMARSMLQMKGLLNDFWAEAVSAS

Q07163 Transposon TyH3 Gag-Pol polyprotein1.2e-0530.3Show/hide
Query:  DESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVEMARSMLQMKGLLNDFWAEAVSAS
        D  +    + F    A ++ Q    +  ++ DRG E+ +   + F +++GI    TT      +GVAER NRT+++  R+ LQ  GL N  W  A+  S
Subjt:  DESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVEMARSMLQMKGLLNDFWAEAVSAS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.4e-0630.19Show/hide
Query:  FSILFDDESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVEMARSMLQMKGLLNDFWAE
        ++ L+  + KS+  E F  FK  +E +    I +  SD GGEF++     +  + GI    + P+TPE NG++ERK+R +VE   ++L    +   +W  
Subjt:  FSILFDDESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVEMARSMLQMKGLLNDFWAE

Query:  AVSASI
        A + ++
Subjt:  AVSASI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.4e-0833.02Show/hide
Query:  FSILFDDESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVEMARSMLQMKGLLNDFWAE
        ++ L+  + KS+  + F  FK+ VE +    I +L SD GGEF+      +  + GI    + P+TPE NG++ERK+R +VEM  ++L    +   +W  
Subjt:  FSILFDDESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVEMARSMLQMKGLLNDFWAE

Query:  AVSASI
        A S ++
Subjt:  AVSASI

Arabidopsis top hitse value%identityAlignment
AT1G48720.1 unknown protein3.7e-1031.58Show/hide
Query:  IPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGK--------LRENKKKDSKALVIIQQAVHDSVFSRI
        +P+     Y+ WS+ MK +L + D+W++VE+G+++P+++G         LR+++K+D KAL +I Q + +  F ++
Subjt:  IPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGK--------LRENKKKDSKALVIIQQAVHDSVFSRI

AT3G21000.1 Gag-Pol-related retrotransposon family protein1.3e-1025.36Show/hide
Query:  YEFWSIPMKTLLRSQDLWDLVEQGY-------------VDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIAAF-----------QGDSRVLV-----
        YE W+   K+ L  Q LWD+V  G              + P++  K R+   KD+KAL I+Q ++ DSVF +  +            +G+ +  +     
Subjt:  YEFWSIPMKTLLRSQDLWDLVEQGY-------------VDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIAAF-----------QGDSRVLV-----

Query:  VKLKSLRRDFETLMMKNGESIADFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEE
        V ++ L +  E L M + ES + +L +A+ I+ ++        D  I + V  +L+  FD + + +EE  ++   T   L   +E    R++ S    EE
Subjt:  VKLKSLRRDFETLMMKNGESIADFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEE

Query:  KAF-QVKDVVPKS-NDSDCVMTRGRGRGGYRGRGR-GTGKGINKANIQC-YHCKKFGHVKAD---------YCGCSNHMTGLKPVFKELNEGEKLKVELE
          F  +KD+  KS ++  C +           + R  T K   +  I   Y  +   ++ A          +     +MT     F  L+   K  V   
Subjt:  KAF-QVKDVVPKS-NDSDCVMTRGRGRGGYRGRGR-GTGKGINKANIQC-YHCKKFGHVKAD---------YCGCSNHMTGLKPVFKELNEGEKLKVELE

Query:  NGKELQVEGKGTVGIETHHG-NRILTNVQYVPDIGYNFLSVGQLMESGFSI
        +G  L VEGKG V I    G  + + NV +VP +  N LS G+++   +SI
Subjt:  NGKELQVEGKGTVGIETHHG-NRILTNVQYVPDIGYNFLSVGQLMESGFSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCAGCAATGGCAATGCTATGGGTACAACACAACCACTCATTCCAATCTTCAAAGGAGAAGGCTACGAGTTTTGGAGTATACCTATGAAGACTCTTCTCAGA
TCTCAAGACTTATGGGACTTAGTAGAACAAGGCTATGTAGATCCTGACGACAAAGGCAAGTTGCGGGAGAACAAGAAGAAAGACTCGAAGGCGTTGGTGATTATT
CAACAAGCTGTCCATGATAGTGTTTTTTCGCGGATTGCTGCATTTCAAGGAGATTCAAGAGTACTTGTGGTTAAATTGAAATCACTTAGACGAGACTTTGAGACC
TTGATGATGAAAAATGGAGAATCAATTGCTGATTTTTTGTCACGGGCAATGACAATTATTAGTCAGATGCAAACATATGGCGAGACGATTAAGGACCAGACTATA
GTGGAGAAAGTATTGATAAGTTTGACTCCAAAGTTTGATCATGTTGTGGCTGCAATAGAAGAATCAAAGAATTTGTCCACTTTCACATTTATTGAATTAATGGGA
TCTCTTGAAGCACACGAGTCGAGAATCAATAGATCGATGGAAAGAAACGAAGAAAAAGCGTTTCAGGTAAAGGATGTAGTTCCAAAGTCTAATGACAGTGATTGT
GTGATGACTCGAGGCCGAGGAAGAGGAGGATATCGTGGTCGAGGTCGTGGTACCGGAAAAGGGATCAACAAAGCTAATATTCAATGCTACCATTGCAAGAAGTTT
GGTCATGTAAAGGCAGACTATTGCGGTTGTTCGAATCACATGACAGGTTTGAAGCCTGTATTCAAAGAGCTTAACGAAGGAGAAAAGTTGAAGGTAGAGCTCGAA
AACGGCAAGGAGCTACAAGTAGAAGGCAAAGGAACGGTGGGAATTGAAACTCACCATGGAAATAGAATTCTCACAAATGTTCAGTATGTGCCTGATATTGGATAT
AATTTTCTGAGTGTTGGACAACTAATGGAGAGTGGGTTTTCTATCTTGTTCGATGATGAAAGCAAATCAGAAACATTTGAGAAGTTCAAGCATTTCAAGGCAAAG
GTAGAAAAGCAGAGTGGCATGTTCATCAAATCTCTTCGCAGTGATAGAGGTGGAGAATTTTTGTCCAACAACTTTAACCATTTTTGCAAGGAAGATGGCATCCAT
AGGGAGTTGACAACACCATATACTCCGGAGCAAAATGGAGTAGCTGAGAGGAAGAATCGAACTGTGGTGGAGATGGCAAGAAGCATGTTGCAAATGAAAGGCCTT
TTGAATGATTTTTGGGCTGAAGCAGTCTCAGCTTCCATACCTACTGAACATCTCACCAACAAAGGCTGTCATGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGCAGCAATGGCAATGCTATGGGTACAACACAACCACTCATTCCAATCTTCAAAGGAGAAGGCTACGAGTTTTGGAGTATACCTATGAAGACTCTTCTCAGA
TCTCAAGACTTATGGGACTTAGTAGAACAAGGCTATGTAGATCCTGACGACAAAGGCAAGTTGCGGGAGAACAAGAAGAAAGACTCGAAGGCGTTGGTGATTATT
CAACAAGCTGTCCATGATAGTGTTTTTTCGCGGATTGCTGCATTTCAAGGAGATTCAAGAGTACTTGTGGTTAAATTGAAATCACTTAGACGAGACTTTGAGACC
TTGATGATGAAAAATGGAGAATCAATTGCTGATTTTTTGTCACGGGCAATGACAATTATTAGTCAGATGCAAACATATGGCGAGACGATTAAGGACCAGACTATA
GTGGAGAAAGTATTGATAAGTTTGACTCCAAAGTTTGATCATGTTGTGGCTGCAATAGAAGAATCAAAGAATTTGTCCACTTTCACATTTATTGAATTAATGGGA
TCTCTTGAAGCACACGAGTCGAGAATCAATAGATCGATGGAAAGAAACGAAGAAAAAGCGTTTCAGGTAAAGGATGTAGTTCCAAAGTCTAATGACAGTGATTGT
GTGATGACTCGAGGCCGAGGAAGAGGAGGATATCGTGGTCGAGGTCGTGGTACCGGAAAAGGGATCAACAAAGCTAATATTCAATGCTACCATTGCAAGAAGTTT
GGTCATGTAAAGGCAGACTATTGCGGTTGTTCGAATCACATGACAGGTTTGAAGCCTGTATTCAAAGAGCTTAACGAAGGAGAAAAGTTGAAGGTAGAGCTCGAA
AACGGCAAGGAGCTACAAGTAGAAGGCAAAGGAACGGTGGGAATTGAAACTCACCATGGAAATAGAATTCTCACAAATGTTCAGTATGTGCCTGATATTGGATAT
AATTTTCTGAGTGTTGGACAACTAATGGAGAGTGGGTTTTCTATCTTGTTCGATGATGAAAGCAAATCAGAAACATTTGAGAAGTTCAAGCATTTCAAGGCAAAG
GTAGAAAAGCAGAGTGGCATGTTCATCAAATCTCTTCGCAGTGATAGAGGTGGAGAATTTTTGTCCAACAACTTTAACCATTTTTGCAAGGAAGATGGCATCCAT
AGGGAGTTGACAACACCATATACTCCGGAGCAAAATGGAGTAGCTGAGAGGAAGAATCGAACTGTGGTGGAGATGGCAAGAAGCATGTTGCAAATGAAAGGCCTT
TTGAATGATTTTTGGGCTGAAGCAGTCTCAGCTTCCATACCTACTGAACATCTCACCAACAAAGGCTGTCATGAATAA
Protein sequenceShow/hide protein sequence
MGSNGNAMGTTQPLIPIFKGEGYEFWSIPMKTLLRSQDLWDLVEQGYVDPDDKGKLRENKKKDSKALVIIQQAVHDSVFSRIAAFQGDSRVLVVKLKSLRRDFET
LMMKNGESIADFLSRAMTIISQMQTYGETIKDQTIVEKVLISLTPKFDHVVAAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKDVVPKSNDSDC
VMTRGRGRGGYRGRGRGTGKGINKANIQCYHCKKFGHVKADYCGCSNHMTGLKPVFKELNEGEKLKVELENGKELQVEGKGTVGIETHHGNRILTNVQYVPDIGY
NFLSVGQLMESGFSILFDDESKSETFEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCKEDGIHRELTTPYTPEQNGVAERKNRTVVEMARSMLQMKGL
LNDFWAEAVSASIPTEHLTNKGCHE