; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0003401 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0003401
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr07:10610564..10612555
RNA-Seq ExpressionPay0003401
SyntenyPay0003401
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055915.1 copia protein [Cucumis melo var. makuwa]1.5e-15661.7Show/hide
Query:  MSNNDNAMGTAQPLIPIFKGE----------TLLKSQDLWDLVEQGYADPDDEGKLRENMKKDSKALVITQQAVHDS---------------CILQKAFQ
        MSNN N MGT QPLIPIFKGE          TLL SQDLWDLVEQGY DPDDEGKL+EN +KD KALVI QQAVHD+                ILQKAFQ
Subjt:  MSNNDNAMGTAQPLIPIFKGE----------TLLKSQDLWDLVEQGYADPDDEGKLRENMKKDSKALVITQQAVHDS---------------CILQKAFQ

Query:  GDSRVLVVKLQSLRRDFETLMMKNGESI-------------MQTYGEAIKDQTIVEKVLRSLTPKFDHVVVAIEESKNLSTFTFIELMGSLEAHESRINR
        GDSRVLVVKLQSL+RDFETLMMKNGESI             MQTYGE I DQTIVEKVLRSLTPKFDHVV AIEESK+LSTFTFIELMGSL+AHESRIN 
Subjt:  GDSRVLVVKLQSLRRDFETLMMKNGESI-------------MQTYGEAIKDQTIVEKVLRSLTPKFDHVVVAIEESKNLSTFTFIELMGSLEAHESRINR

Query:  SMERNEEKAFQVKDIVPKYNDNDRVMTRGRGRGRYRGRGRGTGK-----------GLQSSNNANIQCYHCKKFGHVKADC--------------------
        SME+N+EKAF+VKD+VPKYND+D VMT+G+G G YR RGRGTGK           G+QSSN ANIQCYHCKKFGHVKADC                    
Subjt:  SMERNEEKAFQVKDIVPKYNDNDRVMTRGRGRGRYRGRGRGTGK-----------GLQSSNNANIQCYHCKKFGHVKADC--------------------

Query:  ---------------------------------LKPVFKELNEGEKLKVELGNSKELQVEGKGTVGIETHHGNRILTNVQYV-------PDIGYNLLSVG
                                         LKPVFKELNEGEKLKVELGNSKELQVEGK  +GIETH+GNRILTNVQ         P    N+ +  
Subjt:  ---------------------------------LKPVFKELNEGEKLKVELGNSKELQVEGKGTVGIETHHGNRILTNVQYV-------PDIGYNLLSVG

Query:  QLMESGYSILFDDGACLIKNKQTERKFKHFKTKVEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEMARIMLQM
            +  +   +     +   +T  KFKHFK KVEKQS M IKSLRSDR G+FLSNNFNHFC+EHGIHR+LT PYT +QNGVAERKNRTVVEMAR MLQM
Subjt:  QLMESGYSILFDDGACLIKNKQTERKFKHFKTKVEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEMARIMLQM

Query:  KGLSNDFWAEAVSTS------RPTEHLTNK
        KGLSNDFW EAVSTS       PT+ + NK
Subjt:  KGLSNDFWAEAVSTS------RPTEHLTNK

KAE8650579.1 hypothetical protein Csa_010963 [Cucumis sativus]6.5e-13671.88Show/hide
Query:  MSNNDNAMGTAQPLIPIFKGE----------TLLKSQDLWDLVEQGYADPDDEGKLRENMKKDSKALVITQQAVHDS---------------CILQKAFQ
        M NN NAMGT QPLI IFKGE          TLL+SQDLWDLVE  YADPDDEGKLRE  +KDSKALVI QQAVHDS                ILQKAF+
Subjt:  MSNNDNAMGTAQPLIPIFKGE----------TLLKSQDLWDLVEQGYADPDDEGKLRENMKKDSKALVITQQAVHDS---------------CILQKAFQ

Query:  GDSRVLVVKLQSLRRDFETLMMKNGESI-------------MQTYGEAIKDQTIVEKVLRSLTPKFDHVVVAIEESKNLSTFTFIELMGSLEAHESRINR
        GD RVLVVKLQSLR++FETLMMKN ESI             MQTYGE I DQTIVEKVLRSLT KFD VV AIEESK+LSTFTFIELMGSL+AHESRINR
Subjt:  GDSRVLVVKLQSLRRDFETLMMKNGESI-------------MQTYGEAIKDQTIVEKVLRSLTPKFDHVVVAIEESKNLSTFTFIELMGSLEAHESRINR

Query:  SMERNEEKAFQVKDIVPKYNDNDRVMTRGRGRGRYRGRGRGTGKG-----------LQSSNNANIQCYHCKKFGHVKADC--------------------
        SMERNEEKAFQVKD+VPKYN++DRVMTRGRGRG YRG+GRGT KG           +QSSN ANIQCYH KKFGHVKADC                    
Subjt:  SMERNEEKAFQVKDIVPKYNDNDRVMTRGRGRGRYRGRGRGTGKG-----------LQSSNNANIQCYHCKKFGHVKADC--------------------

Query:  -LKPVFKELNEGEKLKVELGNSKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNLLSVGQLMESGYSILFDDGACLIKNKQT
         LKP+F ELNEGEKLKVELGN+KELQVE KGTVGIETHHGNRILTNVQYVPDIGYNLLSVGQLMESG+SILFDDGACLIKNKQT
Subjt:  -LKPVFKELNEGEKLKVELGNSKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNLLSVGQLMESGYSILFDDGACLIKNKQT

TYK12002.1 UBN2 domain-containing protein [Cucumis melo var. makuwa]6.3e-12366.24Show/hide
Query:  DLVEQGYADPDDEGKLRENMKKDSKALVITQQAVHDS---------------CILQKAFQGDSRVLVVKLQSLRRDFETLMMKNGESI------------
        +LVEQGYADPDDEGKLR N KKDSK LVI QQAVHDS                ILQK FQGDSRVLVVKLQSLRRDFETLMMKNGESI            
Subjt:  DLVEQGYADPDDEGKLRENMKKDSKALVITQQAVHDS---------------CILQKAFQGDSRVLVVKLQSLRRDFETLMMKNGESI------------

Query:  -MQTYGEAIKDQTIVEKVLRSLTPKFDHVVVAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKDIVPKYNDNDRVMTRGRGRGRYRGRGR
         MQTY E IKD TIVEKVLRSLTPKFDHVV  IEESK+LSTFTFIELMGSL+AHESRINRSMERNEEKAFQVKD+V KYND+DRV TRGRGRG YRGRG 
Subjt:  -MQTYGEAIKDQTIVEKVLRSLTPKFDHVVVAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKDIVPKYNDNDRVMTRGRGRGRYRGRGR

Query:  GTGK-----------GLQSSNNANIQCYHCKKFGHVKADC----LKPVFKELNEGEKL--KVELGNSKELQVEGKGTVGIETHHGNRILTNVQYVPDIGY
        G  K           G+QSSN ANIQCYH KKFGHVKADC     +   ++  E +++  +VELGN KELQVEGKGTVGIETHHGNRILT          
Subjt:  GTGK-----------GLQSSNNANIQCYHCKKFGHVKADC----LKPVFKELNEGEKL--KVELGNSKELQVEGKGTVGIETHHGNRILTNVQYVPDIGY

Query:  NLLSVGQLMESGYSILFDDGACLIKNK-QTERKFKHFKTKVEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKN
                              + K+K +T  KFKHFK KVEKQS M IKSLRSDR GEFLSNNFNHFC+EHGIHR+LT PYTP+QNGV ERKN
Subjt:  NLLSVGQLMESGYSILFDDGACLIKNK-QTERKFKHFKTKVEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKN

TYK27735.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]4.1e-16255.89Show/hide
Query:  ETLLKSQDLWDLVEQGYADPDDEGKLRENMKKDSKALVITQQAVHDS---------------CILQKAFQGDSRVLVVKLQSLRRDFETLMMKNGESI--
        +TLL+SQDLWDLVEQGY DPDDEGKLREN KKDSKALVI QQAVHDS                ILQKAFQGDSRVL+VKLQSLRRDFETLMMKNGESI  
Subjt:  ETLLKSQDLWDLVEQGYADPDDEGKLRENMKKDSKALVITQQAVHDS---------------CILQKAFQGDSRVLVVKLQSLRRDFETLMMKNGESI--

Query:  -----------MQTYGEAIKDQTIVEKVLRSLTPKFDHVVVAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKDIVPKYNDNDRVMTRGR
                   MQTYGE IKDQTIVEKVLRSLTPKFDHVV AIEESKNL TFTFIELMGSLEAHESRINRSMERNEEKAFQVKD VPKYND+DRVMTRGR
Subjt:  -----------MQTYGEAIKDQTIVEKVLRSLTPKFDHVVVAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKDIVPKYNDNDRVMTRGR

Query:  GRGRYRGRGRGTGK-----------GLQSSNNANIQCYHCKKFGHVKADC--------------------------------------------------
        GRG YRGRG GT K           G+QSSN ANIQCYHCKKFGHVKADC                                                  
Subjt:  GRGRYRGRGRGTGK-----------GLQSSNNANIQCYHCKKFGHVKADC--------------------------------------------------

Query:  ---LKPVFKELNEGEKLKVELGNSKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNLLSVGQLMESGYSILFDDGACLIKNKQTER------------
           LKPVFKELNEGEKLKV+L N KELQVEGKGTV IETHHGNRILTNVQYVPDIGYNLLSVGQLMESGYSILFDDGACLIKNKQT R            
Subjt:  ---LKPVFKELNEGEKLKVELGNSKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNLLSVGQLMESGYSILFDDGACLIKNKQTER------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------KFKHFKTKVEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEM
                                        KFKHFK KVEKQS M IKSLRSDR  EFLSNNFNHFCKEHGIHR+LT PYTP+QNGVAERKN+TVVEM
Subjt:  --------------------------------KFKHFKTKVEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEM

Query:  ARIMLQMKGLSNDFWAEAVSTS------RPTEHLTNK
        AR MLQMKGL NDFWAEAVS S       PT+ + NK
Subjt:  ARIMLQMKGLSNDFWAEAVSTS------RPTEHLTNK

TYK28117.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]5.3e-16258.5Show/hide
Query:  MSNNDNAMGTAQPLIPIFKGE----------TLLKSQDLWDLVEQGYADPDDEGKLRENMKKDSKALVITQQAVHDSCILQKAFQGDSRVLVVKLQSLRR
        MSNN N MGTAQPLIPIFKGE          TLL SQDLWDLVEQGY DPDDEGKL+EN +KDSKALVI QQAVHD+          SR+      +  R
Subjt:  MSNNDNAMGTAQPLIPIFKGE----------TLLKSQDLWDLVEQGYADPDDEGKLRENMKKDSKALVITQQAVHDSCILQKAFQGDSRVLVVKLQSLRR

Query:  DFETLMMKNGESI-------------MQTYGEAIKDQTIVEKVLRSLTPKFDHVVVAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKDI
        DFETLMMKNGESI             MQTYGE I DQTIVEKVLRSLTPKFDHVVVAIEESK+LSTFTFIELMGSL+AHESRIN SME+NEEKAF+VKD+
Subjt:  DFETLMMKNGESI-------------MQTYGEAIKDQTIVEKVLRSLTPKFDHVVVAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKDI

Query:  VPKYNDNDRVMTRGRGRGRYRGRGRGTGK-----------GLQSSNNANIQCYHCKKFGHVKADC-----------------------------------
        VPKYND+D VMT+G+G G YR RGRGTGK           G+QSSN ANIQCYHCKKFGHVKADC                                   
Subjt:  VPKYNDNDRVMTRGRGRGRYRGRGRGTGK-----------GLQSSNNANIQCYHCKKFGHVKADC-----------------------------------

Query:  ------------------LKPVFKELNEGEKLKVELGNSKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNLLSVGQLMESGYSILFDDGACLIKNKQ
                          LKPVFKELNEGEKLKVELGN KELQVEGK T+GIETH+GNRILTNVQYVPDIGYNLLSVGQLMESG+SILFDDGACLIKNKQ
Subjt:  ------------------LKPVFKELNEGEKLKVELGNSKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNLLSVGQLMESGYSILFDDGACLIKNKQ

Query:  TER--------------------------------------------------------------------------------KFKHFKTKVEKQSDMSI
        T R                                                                                KFKHFK KVEKQS M I
Subjt:  TER--------------------------------------------------------------------------------KFKHFKTKVEKQSDMSI

Query:  KSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEMARIMLQMKGLSNDFWAEAVSTS------RPTEHLTNK
        KS RSDR G+FLSNNFNHFC+EHGIHR+LT PYT +QNGVAERKNRTVVEMAR MLQMKGLSNDFW EA STS       PT+ + NK
Subjt:  KSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEMARIMLQMKGLSNDFWAEAVSTS------RPTEHLTNK

TrEMBL top hitse value%identityAlignment
A0A5A7TZP7 Putative gag-pol polyprotein, identical5.4e-12067.26Show/hide
Query:  MSNNDNAMGTAQPLIPIFKGE----------TLLKSQDLWDLVEQGYADPDDEGKLRENMKKDSKALVITQQAVHDSCILQKAFQGDSRVLVVKLQSLRR
        M +N N MGT QPLIPIFKGE          TLL+SQDLWDLVEQGY DPDD+GKLREN KKDSKALVI QQAVHDS          SR++     +  R
Subjt:  MSNNDNAMGTAQPLIPIFKGE----------TLLKSQDLWDLVEQGYADPDDEGKLRENMKKDSKALVITQQAVHDSCILQKAFQGDSRVLVVKLQSLRR

Query:  DFETLMMKNGESI-------------MQTYGEAIKDQTIVEKVLRSLTPKFDHVVVAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKDI
        DFETLMMKNGESI             MQTYGE IKDQTIVEKVL SLTPKFDHVVVAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQ    
Subjt:  DFETLMMKNGESI-------------MQTYGEAIKDQTIVEKVLRSLTPKFDHVVVAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKDI

Query:  VPKYNDNDRVMTRGRGRGRYRGRGRGTGKGLQSSNNANIQCYHCKKFGHV---KADC------LKPVFKELNEGEKLKVELGNSKELQVEGKGTVGIETH
           +  N RV                 GKG       NI     +K G V    + C      LKPVFKELNEGEKLKVEL N KELQVEGKGTVGIETH
Subjt:  VPKYNDNDRVMTRGRGRGRYRGRGRGTGKGLQSSNNANIQCYHCKKFGHV---KADC------LKPVFKELNEGEKLKVELGNSKELQVEGKGTVGIETH

Query:  HGNRILTNVQYVPDIGYNLLSVGQLMESGYSILFDDGACLIKNKQTERKFKHFKTKVEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIH
        HGNRILTNVQYVPDIGYNLLSVGQLMESG SILFDD +      +T  KFKHFK KVEKQS M IKSLRSDR GEFLSNNFNHFCKEHGIH
Subjt:  HGNRILTNVQYVPDIGYNLLSVGQLMESGYSILFDDGACLIKNKQTERKFKHFKTKVEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIH

A0A5A7UQM0 Copia protein7.2e-15761.7Show/hide
Query:  MSNNDNAMGTAQPLIPIFKGE----------TLLKSQDLWDLVEQGYADPDDEGKLRENMKKDSKALVITQQAVHDS---------------CILQKAFQ
        MSNN N MGT QPLIPIFKGE          TLL SQDLWDLVEQGY DPDDEGKL+EN +KD KALVI QQAVHD+                ILQKAFQ
Subjt:  MSNNDNAMGTAQPLIPIFKGE----------TLLKSQDLWDLVEQGYADPDDEGKLRENMKKDSKALVITQQAVHDS---------------CILQKAFQ

Query:  GDSRVLVVKLQSLRRDFETLMMKNGESI-------------MQTYGEAIKDQTIVEKVLRSLTPKFDHVVVAIEESKNLSTFTFIELMGSLEAHESRINR
        GDSRVLVVKLQSL+RDFETLMMKNGESI             MQTYGE I DQTIVEKVLRSLTPKFDHVV AIEESK+LSTFTFIELMGSL+AHESRIN 
Subjt:  GDSRVLVVKLQSLRRDFETLMMKNGESI-------------MQTYGEAIKDQTIVEKVLRSLTPKFDHVVVAIEESKNLSTFTFIELMGSLEAHESRINR

Query:  SMERNEEKAFQVKDIVPKYNDNDRVMTRGRGRGRYRGRGRGTGK-----------GLQSSNNANIQCYHCKKFGHVKADC--------------------
        SME+N+EKAF+VKD+VPKYND+D VMT+G+G G YR RGRGTGK           G+QSSN ANIQCYHCKKFGHVKADC                    
Subjt:  SMERNEEKAFQVKDIVPKYNDNDRVMTRGRGRGRYRGRGRGTGK-----------GLQSSNNANIQCYHCKKFGHVKADC--------------------

Query:  ---------------------------------LKPVFKELNEGEKLKVELGNSKELQVEGKGTVGIETHHGNRILTNVQYV-------PDIGYNLLSVG
                                         LKPVFKELNEGEKLKVELGNSKELQVEGK  +GIETH+GNRILTNVQ         P    N+ +  
Subjt:  ---------------------------------LKPVFKELNEGEKLKVELGNSKELQVEGKGTVGIETHHGNRILTNVQYV-------PDIGYNLLSVG

Query:  QLMESGYSILFDDGACLIKNKQTERKFKHFKTKVEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEMARIMLQM
            +  +   +     +   +T  KFKHFK KVEKQS M IKSLRSDR G+FLSNNFNHFC+EHGIHR+LT PYT +QNGVAERKNRTVVEMAR MLQM
Subjt:  QLMESGYSILFDDGACLIKNKQTERKFKHFKTKVEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEMARIMLQM

Query:  KGLSNDFWAEAVSTS------RPTEHLTNK
        KGLSNDFW EAVSTS       PT+ + NK
Subjt:  KGLSNDFWAEAVSTS------RPTEHLTNK

A0A5D3CL10 UBN2 domain-containing protein3.1e-12366.24Show/hide
Query:  DLVEQGYADPDDEGKLRENMKKDSKALVITQQAVHDS---------------CILQKAFQGDSRVLVVKLQSLRRDFETLMMKNGESI------------
        +LVEQGYADPDDEGKLR N KKDSK LVI QQAVHDS                ILQK FQGDSRVLVVKLQSLRRDFETLMMKNGESI            
Subjt:  DLVEQGYADPDDEGKLRENMKKDSKALVITQQAVHDS---------------CILQKAFQGDSRVLVVKLQSLRRDFETLMMKNGESI------------

Query:  -MQTYGEAIKDQTIVEKVLRSLTPKFDHVVVAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKDIVPKYNDNDRVMTRGRGRGRYRGRGR
         MQTY E IKD TIVEKVLRSLTPKFDHVV  IEESK+LSTFTFIELMGSL+AHESRINRSMERNEEKAFQVKD+V KYND+DRV TRGRGRG YRGRG 
Subjt:  -MQTYGEAIKDQTIVEKVLRSLTPKFDHVVVAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKDIVPKYNDNDRVMTRGRGRGRYRGRGR

Query:  GTGK-----------GLQSSNNANIQCYHCKKFGHVKADC----LKPVFKELNEGEKL--KVELGNSKELQVEGKGTVGIETHHGNRILTNVQYVPDIGY
        G  K           G+QSSN ANIQCYH KKFGHVKADC     +   ++  E +++  +VELGN KELQVEGKGTVGIETHHGNRILT          
Subjt:  GTGK-----------GLQSSNNANIQCYHCKKFGHVKADC----LKPVFKELNEGEKL--KVELGNSKELQVEGKGTVGIETHHGNRILTNVQYVPDIGY

Query:  NLLSVGQLMESGYSILFDDGACLIKNK-QTERKFKHFKTKVEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKN
                              + K+K +T  KFKHFK KVEKQS M IKSLRSDR GEFLSNNFNHFC+EHGIHR+LT PYTP+QNGV ERKN
Subjt:  NLLSVGQLMESGYSILFDDGACLIKNK-QTERKFKHFKTKVEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKN

A0A5D3DWC7 Putative gag-pol polyprotein, identical2.6e-16258.5Show/hide
Query:  MSNNDNAMGTAQPLIPIFKGE----------TLLKSQDLWDLVEQGYADPDDEGKLRENMKKDSKALVITQQAVHDSCILQKAFQGDSRVLVVKLQSLRR
        MSNN N MGTAQPLIPIFKGE          TLL SQDLWDLVEQGY DPDDEGKL+EN +KDSKALVI QQAVHD+          SR+      +  R
Subjt:  MSNNDNAMGTAQPLIPIFKGE----------TLLKSQDLWDLVEQGYADPDDEGKLRENMKKDSKALVITQQAVHDSCILQKAFQGDSRVLVVKLQSLRR

Query:  DFETLMMKNGESI-------------MQTYGEAIKDQTIVEKVLRSLTPKFDHVVVAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKDI
        DFETLMMKNGESI             MQTYGE I DQTIVEKVLRSLTPKFDHVVVAIEESK+LSTFTFIELMGSL+AHESRIN SME+NEEKAF+VKD+
Subjt:  DFETLMMKNGESI-------------MQTYGEAIKDQTIVEKVLRSLTPKFDHVVVAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKDI

Query:  VPKYNDNDRVMTRGRGRGRYRGRGRGTGK-----------GLQSSNNANIQCYHCKKFGHVKADC-----------------------------------
        VPKYND+D VMT+G+G G YR RGRGTGK           G+QSSN ANIQCYHCKKFGHVKADC                                   
Subjt:  VPKYNDNDRVMTRGRGRGRYRGRGRGTGK-----------GLQSSNNANIQCYHCKKFGHVKADC-----------------------------------

Query:  ------------------LKPVFKELNEGEKLKVELGNSKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNLLSVGQLMESGYSILFDDGACLIKNKQ
                          LKPVFKELNEGEKLKVELGN KELQVEGK T+GIETH+GNRILTNVQYVPDIGYNLLSVGQLMESG+SILFDDGACLIKNKQ
Subjt:  ------------------LKPVFKELNEGEKLKVELGNSKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNLLSVGQLMESGYSILFDDGACLIKNKQ

Query:  TER--------------------------------------------------------------------------------KFKHFKTKVEKQSDMSI
        T R                                                                                KFKHFK KVEKQS M I
Subjt:  TER--------------------------------------------------------------------------------KFKHFKTKVEKQSDMSI

Query:  KSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEMARIMLQMKGLSNDFWAEAVSTS------RPTEHLTNK
        KS RSDR G+FLSNNFNHFC+EHGIHR+LT PYT +QNGVAERKNRTVVEMAR MLQMKGLSNDFW EA STS       PT+ + NK
Subjt:  KSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEMARIMLQMKGLSNDFWAEAVSTS------RPTEHLTNK

A0A5D3DWP2 Putative gag-pol polyprotein, identical2.0e-16255.89Show/hide
Query:  ETLLKSQDLWDLVEQGYADPDDEGKLRENMKKDSKALVITQQAVHDS---------------CILQKAFQGDSRVLVVKLQSLRRDFETLMMKNGESI--
        +TLL+SQDLWDLVEQGY DPDDEGKLREN KKDSKALVI QQAVHDS                ILQKAFQGDSRVL+VKLQSLRRDFETLMMKNGESI  
Subjt:  ETLLKSQDLWDLVEQGYADPDDEGKLRENMKKDSKALVITQQAVHDS---------------CILQKAFQGDSRVLVVKLQSLRRDFETLMMKNGESI--

Query:  -----------MQTYGEAIKDQTIVEKVLRSLTPKFDHVVVAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKDIVPKYNDNDRVMTRGR
                   MQTYGE IKDQTIVEKVLRSLTPKFDHVV AIEESKNL TFTFIELMGSLEAHESRINRSMERNEEKAFQVKD VPKYND+DRVMTRGR
Subjt:  -----------MQTYGEAIKDQTIVEKVLRSLTPKFDHVVVAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKDIVPKYNDNDRVMTRGR

Query:  GRGRYRGRGRGTGK-----------GLQSSNNANIQCYHCKKFGHVKADC--------------------------------------------------
        GRG YRGRG GT K           G+QSSN ANIQCYHCKKFGHVKADC                                                  
Subjt:  GRGRYRGRGRGTGK-----------GLQSSNNANIQCYHCKKFGHVKADC--------------------------------------------------

Query:  ---LKPVFKELNEGEKLKVELGNSKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNLLSVGQLMESGYSILFDDGACLIKNKQTER------------
           LKPVFKELNEGEKLKV+L N KELQVEGKGTV IETHHGNRILTNVQYVPDIGYNLLSVGQLMESGYSILFDDGACLIKNKQT R            
Subjt:  ---LKPVFKELNEGEKLKVELGNSKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNLLSVGQLMESGYSILFDDGACLIKNKQTER------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------KFKHFKTKVEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEM
                                        KFKHFK KVEKQS M IKSLRSDR  EFLSNNFNHFCKEHGIHR+LT PYTP+QNGVAERKN+TVVEM
Subjt:  --------------------------------KFKHFKTKVEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEM

Query:  ARIMLQMKGLSNDFWAEAVSTS------RPTEHLTNK
        AR MLQMKGL NDFWAEAVS S       PT+ + NK
Subjt:  ARIMLQMKGLSNDFWAEAVSTS------RPTEHLTNK

SwissProt top hitse value%identityAlignment
O13527 Truncated transposon Ty1-A Gag-Pol polyprotein3.3e-0531.71Show/hide
Query:  VEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEMARIMLQMKGLSNDFWAEAVSTS
        ++ Q   S+  ++ DR  E+ +   + F +++GI    T     + +GVAER NRT+++  R  LQ  GL N  W  A+  S
Subjt:  VEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEMARIMLQMKGLSNDFWAEAVSTS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.4e-1843.1Show/hide
Query:  YSILFDDGAC------LIKNK-QTERKFKHFKTKVEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEMARIMLQ
        Y + F D A       ++K K Q  + F+ F   VE+++   +K LRSD  GE+ S  F  +C  HGI  + T+P TP+ NGVAER NRT+VE  R ML+
Subjt:  YSILFDDGAC------LIKNK-QTERKFKHFKTKVEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEMARIMLQ

Query:  MKGLSNDFWAEAVSTS
        M  L   FW EAV T+
Subjt:  MKGLSNDFWAEAVSTS

Q07163 Transposon TyH3 Gag-Pol polyprotein3.3e-0531.71Show/hide
Query:  VEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEMARIMLQMKGLSNDFWAEAVSTS
        ++ Q   S+  ++ DR  E+ +   + F +++GI    T     + +GVAER NRT+++  R  LQ  GL N  W  A+  S
Subjt:  VEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEMARIMLQMKGLSNDFWAEAVSTS

Q12490 Transposon Ty1-BL Gag-Pol polyprotein3.3e-0531.71Show/hide
Query:  VEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEMARIMLQMKGLSNDFWAEAVSTS
        ++ Q   S+  ++ DR  E+ +   + F +++GI    T     + +GVAER NRT+++  R  LQ  GL N  W  A+  S
Subjt:  VEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEMARIMLQMKGLSNDFWAEAVSTS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.6e-0733.33Show/hide
Query:  LIKNKQTERKFKHFKTKVEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEMARIMLQMKGLSNDFWAEAVSTS
        L +  Q +  F  FK+ VE +    I +L SD  GEF+      +  +HGI    + P+TP+ NG++ERK+R +VEM   +L    +   +W  A S +
Subjt:  LIKNKQTERKFKHFKTKVEKQSDMSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEMARIMLQMKGLSNDFWAEAVSTS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAACAATGACAATGCTATGGGTACAGCACAACCACTAATTCCAATCTTCAAAGGAGAAACTCTTCTCAAATCTCAAGACTTATGGGACTTAGTAGAACAAGGCTA
TGCAGATCCTGACGACGAAGGCAAGTTGCGGGAGAACATGAAGAAAGACTCGAAGGCATTGGTGATTACTCAACAAGCAGTCCATGATAGTTGTATTTTGCAAAAGGCAT
TTCAAGGAGATTCAAGAGTACTTGTGGTTAAATTGCAATCACTTAGACGAGACTTCGAGACCTTGATGATGAAAAATGGAGAATCAATTATGCAAACATACGGCGAGGCG
ATTAAGGATCAGACTATAGTGGAGAAAGTATTGAGAAGTTTGACTCCAAAGTTTGATCATGTTGTAGTTGCAATAGAAGAATCAAAGAATCTGTCCACTTTCACATTTAT
TGAATTAATGGGATCTCTTGAAGCACATGAGTCGAGAATCAATAGATCGATGGAAAGAAACGAAGAAAAAGCGTTTCAGGTAAAGGATATAGTTCCAAAGTATAATGACA
ATGATCGTGTGATGACTCGAGGCCGAGGAAGAGGAAGATATCGTGGTCGAGGTCGTGGTACCGGAAAAGGATTGCAATCAAGCAACAATGCTAATATTCAATGCTACCAT
TGCAAGAAGTTTGGTCATGTAAAGGCAGACTGTTTGAAGCCTGTATTCAAGGAGCTTAACGAAGGAGAAAAGTTGAAGGTAGAGCTCGGAAACAGTAAGGAGCTACAAGT
AGAAGGCAAAGGAACGGTGGGAATTGAAACTCACCATGGAAATAGAATTCTCACAAATGTTCAGTATGTGCCCGATATTGGATATAATTTGCTGAGTGTTGGACAACTAA
TGGAGAGTGGGTATTCTATCTTGTTTGATGATGGTGCGTGCTTGATAAAAAATAAGCAAACAGAACGAAAGTTCAAGCATTTCAAGACAAAGGTAGAAAAGCAGAGTGAC
ATGTCCATCAAATCTCTTCGCAGTGATAGAAGTGGAGAATTTTTGTCCAACAACTTTAACCATTTTTGCAAGGAACATGGCATCCATAGGAAGTTGACAATACCTTATAC
TCCGAAGCAAAATGGGGTAGCTGAGAGGAAGAATCGAACCGTGGTGGAGATGGCAAGAATCATGTTGCAAATGAAAGGCCTTTCGAATGATTTTTGGGCTGAAGCAGTCT
CGACTTCCAGACCTACTGAACATCTCACCAACAAAGACTGTCATGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCAACAATGACAATGCTATGGGTACAGCACAACCACTAATTCCAATCTTCAAAGGAGAAACTCTTCTCAAATCTCAAGACTTATGGGACTTAGTAGAACAAGGCTA
TGCAGATCCTGACGACGAAGGCAAGTTGCGGGAGAACATGAAGAAAGACTCGAAGGCATTGGTGATTACTCAACAAGCAGTCCATGATAGTTGTATTTTGCAAAAGGCAT
TTCAAGGAGATTCAAGAGTACTTGTGGTTAAATTGCAATCACTTAGACGAGACTTCGAGACCTTGATGATGAAAAATGGAGAATCAATTATGCAAACATACGGCGAGGCG
ATTAAGGATCAGACTATAGTGGAGAAAGTATTGAGAAGTTTGACTCCAAAGTTTGATCATGTTGTAGTTGCAATAGAAGAATCAAAGAATCTGTCCACTTTCACATTTAT
TGAATTAATGGGATCTCTTGAAGCACATGAGTCGAGAATCAATAGATCGATGGAAAGAAACGAAGAAAAAGCGTTTCAGGTAAAGGATATAGTTCCAAAGTATAATGACA
ATGATCGTGTGATGACTCGAGGCCGAGGAAGAGGAAGATATCGTGGTCGAGGTCGTGGTACCGGAAAAGGATTGCAATCAAGCAACAATGCTAATATTCAATGCTACCAT
TGCAAGAAGTTTGGTCATGTAAAGGCAGACTGTTTGAAGCCTGTATTCAAGGAGCTTAACGAAGGAGAAAAGTTGAAGGTAGAGCTCGGAAACAGTAAGGAGCTACAAGT
AGAAGGCAAAGGAACGGTGGGAATTGAAACTCACCATGGAAATAGAATTCTCACAAATGTTCAGTATGTGCCCGATATTGGATATAATTTGCTGAGTGTTGGACAACTAA
TGGAGAGTGGGTATTCTATCTTGTTTGATGATGGTGCGTGCTTGATAAAAAATAAGCAAACAGAACGAAAGTTCAAGCATTTCAAGACAAAGGTAGAAAAGCAGAGTGAC
ATGTCCATCAAATCTCTTCGCAGTGATAGAAGTGGAGAATTTTTGTCCAACAACTTTAACCATTTTTGCAAGGAACATGGCATCCATAGGAAGTTGACAATACCTTATAC
TCCGAAGCAAAATGGGGTAGCTGAGAGGAAGAATCGAACCGTGGTGGAGATGGCAAGAATCATGTTGCAAATGAAAGGCCTTTCGAATGATTTTTGGGCTGAAGCAGTCT
CGACTTCCAGACCTACTGAACATCTCACCAACAAAGACTGTCATGAATAA
Protein sequenceShow/hide protein sequence
MSNNDNAMGTAQPLIPIFKGETLLKSQDLWDLVEQGYADPDDEGKLRENMKKDSKALVITQQAVHDSCILQKAFQGDSRVLVVKLQSLRRDFETLMMKNGESIMQTYGEA
IKDQTIVEKVLRSLTPKFDHVVVAIEESKNLSTFTFIELMGSLEAHESRINRSMERNEEKAFQVKDIVPKYNDNDRVMTRGRGRGRYRGRGRGTGKGLQSSNNANIQCYH
CKKFGHVKADCLKPVFKELNEGEKLKVELGNSKELQVEGKGTVGIETHHGNRILTNVQYVPDIGYNLLSVGQLMESGYSILFDDGACLIKNKQTERKFKHFKTKVEKQSD
MSIKSLRSDRSGEFLSNNFNHFCKEHGIHRKLTIPYTPKQNGVAERKNRTVVEMARIMLQMKGLSNDFWAEAVSTSRPTEHLTNKDCHE