; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0018472 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0018472
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr05:12453489..12455568
RNA-Seq ExpressionPay0018472
SyntenyPay0018472
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055915.1 copia protein [Cucumis melo var. makuwa]4.3e-17065.34Show/hide
Query:  MGTTQPLIPIFKGEGYEFWSISMKTLLRSQNLWDLVEQGYANPDDKDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQA-------------VLV
        MGTTQPLIPIFKGEGYEFWSI MKTLL SQ+LWDLVEQGY +PDD+ KL+ENR+KD KALVI+QQAVHD+VFSRIAAATTSKQA             VLV
Subjt:  MGTTQPLIPIFKGEGYEFWSISMKTLLRSQNLWDLVEQGYANPDDKDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQA-------------VLV

Query:  VKLQSLRRDFETLMMKNGESIVDFLSRATTIISQMQTYGETITDQTIVEKVLRT--------------------------------HESRINRSMEKNEE
        VKLQSL+RDFETLMMKNGESI DFLSRATTIISQMQTYGETITDQTIVEKVLR+                                HESRIN SMEKN+E
Subjt:  VKLQSLRRDFETLMMKNGESIVDFLSRATTIISQMQTYGETITDQTIVEKVLRT--------------------------------HESRINRSMEKNEE

Query:  KAFQVKDVVPKYNDRDRVMTRGRGRGGYRGRGRGTEKRCYRNEEQRQFRMQSSNKANIQCYHYKKFGHVKADCC--------------------------
        KAF+VKDVVPKYND D VMT+G+G GGYR RGRGT K C +NEEQRQF +QSSNKANIQCYH KKFGHVKADC                           
Subjt:  KAFQVKDVVPKYNDRDRVMTRGRGRGGYRGRGRGTEKRCYRNEEQRQFRMQSSNKANIQCYHYKKFGHVKADCC--------------------------

Query:  --------------------------LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRILTNVQYGLDIG-------YNLLSVGQLMESGH
                                  LKPVFKELNEGEKLKVELGN KELQVEGK  +GIETH+GNRILTNVQ              N+ +      + +
Subjt:  --------------------------LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRILTNVQYGLDIG-------YNLLSVGQLMESGH

Query:  SIMFDDE----RKSEIFEKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNHFCKERGIHRELITPYTPEQNGIAERKNRTVVEMARSMLQMKGLSNDF
        +   + E     +SE FEKFKHF AKVEKQSGMF+KSLRSDRGG+FLSNNFNHFC+E GIHREL TPYT EQNG+AERKNRTVVEMARSMLQMKGLSNDF
Subjt:  SIMFDDE----RKSEIFEKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNHFCKERGIHRELITPYTPEQNGIAERKNRTVVEMARSMLQMKGLSNDF

Query:  WAEAVSTSIYLLNISPTKIVMNKTPFEA
        W EAVSTSIYLLNISPTK+VMNKTPFEA
Subjt:  WAEAVSTSIYLLNISPTKIVMNKTPFEA

KAA0061308.1 UBN2 domain-containing protein [Cucumis melo var. makuwa]6.7e-13163.56Show/hide
Query:  MGTTQPLIPIFKGEGYEFWSISMKTLLRSQNLWDLVEQGYANPDDKDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQA-------------VLV
        +G  QPLIPIFKGEGYEFWSI MKTLLRSQ+LWDLVEQGY + DD+ KL ENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQA             V V
Subjt:  MGTTQPLIPIFKGEGYEFWSISMKTLLRSQNLWDLVEQGYANPDDKDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQA-------------VLV

Query:  VKLQSLRRDFETLMMKNGESIVDFLSRATTIISQMQTYGETITDQTIVEKVLRTHESRINRSMEKNEEKAFQVKDVVPKYNDRDRVMTRGRGRGGYRGRG
        VKLQSLRRDFETLMMKN ESI +FLSRATTII QMQTYGETI DQTIVEK  R + +  N + E N                                 G
Subjt:  VKLQSLRRDFETLMMKNGESIVDFLSRATTIISQMQTYGETITDQTIVEKVLRTHESRINRSMEKNEEKAFQVKDVVPKYNDRDRVMTRGRGRGGYRGRG

Query:  RGTEKRCYRNEEQRQFRMQSSNKANIQCYHYKKFGHVKADCCLKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRILTNVQYGLDIGYNLLS
        +G  K    N    Q        A +         H+    CLKPVFKELNEGEKLKVEL NGK+LQVEGKGTVGIET+ GNRILTNVQY  DIGYNLLS
Subjt:  RGTEKRCYRNEEQRQFRMQSSNKANIQCYHYKKFGHVKADCCLKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRILTNVQYGLDIGYNLLS

Query:  VGQLMESGHSIMFDDERKSEIF--------------------------EKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNHFCKERGIHRELITPYT
        V QLMESG+SI+FDD   ++ F                          EKFKHF AKVEKQSGMF+KSLRSDRGGEFLSNNFN+FCKE GI REL TPYT
Subjt:  VGQLMESGHSIMFDDERKSEIF--------------------------EKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNHFCKERGIHRELITPYT

Query:  PEQNGIAERKNRTVVEMARSMLQMKGLSNDFWAEAVSTSIYLLNISPTKI
        PEQNG+AERKNRTVVEMARSMLQMK L NDF AEAVSTSIYLLNISP ++
Subjt:  PEQNGIAERKNRTVVEMARSMLQMKGLSNDFWAEAVSTSIYLLNISPTKI

KAE8650579.1 hypothetical protein Csa_010963 [Cucumis sativus]7.4e-13071.39Show/hide
Query:  MGTTQPLIPIFKGEGYEFWSISMKTLLRSQNLWDLVEQGYANPDDKDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQA-------------VLV
        MGTTQPLI IFKGEGYEFWS+ MKTLLRSQ+LWDLVE  YA+PDD+ KLRE R+KDSKALVIIQQAVHDS FSRI   TTSK+A             VLV
Subjt:  MGTTQPLIPIFKGEGYEFWSISMKTLLRSQNLWDLVEQGYANPDDKDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQA-------------VLV

Query:  VKLQSLRRDFETLMMKNGESIVDFLSRATTIISQMQTYGETITDQTIVEKVLRT--------------------------------HESRINRSMEKNEE
        VKLQSLR++FETLMMKN ESI +FLSRATTIISQMQTYGETITDQTIVEKVLR+                                HESRINRSME+NEE
Subjt:  VKLQSLRRDFETLMMKNGESIVDFLSRATTIISQMQTYGETITDQTIVEKVLRT--------------------------------HESRINRSMEKNEE

Query:  KAFQVKDVVPKYNDRDRVMTRGRGRGGYRGRGRGTEKRCYRNEEQRQFRMQSSNKANIQCYHYKKFGHVKADCC--------------------LKPVFK
        KAFQVKDVVPKYN+ DRVMTRGRGRGGYRG+GRGTEK C +NEE+ QFR+QSSNKANIQCYH KKFGHVKADCC                    LKP+F 
Subjt:  KAFQVKDVVPKYNDRDRVMTRGRGRGGYRGRGRGTEKRCYRNEEQRQFRMQSSNKANIQCYHYKKFGHVKADCC--------------------LKPVFK

Query:  ELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRILTNVQYGLDIGYNLLSVGQLMESGHSIMFDD
        ELNEGEKLKVELGN KELQVE KGTVGIETHHGNRILTNVQY  DIGYNLLSVGQLMESGHSI+FDD
Subjt:  ELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRILTNVQYGLDIGYNLLSVGQLMESGHSIMFDD

TYK27735.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]9.6e-16255.43Show/hide
Query:  ISMKTLLRSQNLWDLVEQGYANPDDKDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQA-------------VLVVKLQSLRRDFETLMMKNGES
        + +KTLLRSQ+LWDLVEQGY +PDD+ KLRENRKKDSKALVIIQQAVHDSVFSRIA ATTSKQA             VL+VKLQSLRRDFETLMMKNGES
Subjt:  ISMKTLLRSQNLWDLVEQGYANPDDKDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQA-------------VLVVKLQSLRRDFETLMMKNGES

Query:  IVDFLSRATTIISQMQTYGETITDQTIVEKVLRT--------------------------------HESRINRSMEKNEEKAFQVKDVVPKYNDRDRVMT
        I DFLSRATTIISQMQTYGETI DQTIVEKVLR+                                HESRINRSME+NEEKAFQVKD VPKYND DRVMT
Subjt:  IVDFLSRATTIISQMQTYGETITDQTIVEKVLRT--------------------------------HESRINRSMEKNEEKAFQVKDVVPKYNDRDRVMT

Query:  RGRGRGGYRGRGRGTEKRCYRNEEQRQFRMQSSNKANIQCYHYKKFGHVKADC----------------------------------------------C
        RGRGRGGYRGRG GTEK C RNE QRQF +QSSNKANIQCYH KKFGHVKADC                                              C
Subjt:  RGRGRGGYRGRGRGTEKRCYRNEEQRQFRMQSSNKANIQCYHYKKFGHVKADC----------------------------------------------C

Query:  ------LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRILTNVQYGLDIGYNLLSVGQLMESGHSIMFDD---------------------
              LKPVFKELNEGEKLKV+L NGKELQVEGKGTV IETHHGNRILTNVQY  DIGYNLLSVGQLMESG+SI+FDD                     
Subjt:  ------LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRILTNVQYGLDIGYNLLSVGQLMESGHSIMFDD---------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------ERKSEIFEKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNHFCKERGIHRELITPYTPEQNGIAERKNRTV
                                   E KSE FEKFKHF AKVEKQSGMF+KSLRSDRG EFLSNNFNHFCKE GIHREL TPYTPEQNG+AERKN+TV
Subjt:  ---------------------------ERKSEIFEKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNHFCKERGIHRELITPYTPEQNGIAERKNRTV

Query:  VEMARSMLQMKGLSNDFWAEAVSTSIYLLNISPTKIVMNKTPFE
        VEMARSMLQMKGL NDFWAEAVS SIYLLNISPTK VMNKTPFE
Subjt:  VEMARSMLQMKGLSNDFWAEAVSTSIYLLNISPTKIVMNKTPFE

TYK28117.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]6.0e-17260.37Show/hide
Query:  MGTTQPLIPIFKGEGYEFWSISMKTLLRSQNLWDLVEQGYANPDDKDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQAVLVVKLQSLRRDFETL
        MGT QPLIPIFKGEGYEFWSI MKTLL SQ+LWDLVEQGY +PDD+ KL+ENR+KDSKALVIIQQAVHD+VFSRIAAATT              RDFETL
Subjt:  MGTTQPLIPIFKGEGYEFWSISMKTLLRSQNLWDLVEQGYANPDDKDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQAVLVVKLQSLRRDFETL

Query:  MMKNGESIVDFLSRATTIISQMQTYGETITDQTIVEKVLRT--------------------------------HESRINRSMEKNEEKAFQVKDVVPKYN
        MMKNGESI DFLSRATTIISQMQTYGETITDQTIVEKVLR+                                HESRIN SMEKNEEKAF+VKDVVPKYN
Subjt:  MMKNGESIVDFLSRATTIISQMQTYGETITDQTIVEKVLRT--------------------------------HESRINRSMEKNEEKAFQVKDVVPKYN

Query:  DRDRVMTRGRGRGGYRGRGRGTEKRCYRNEEQRQFRMQSSNKANIQCYHYKKFGHVKADCC---------------------------------------
        D D VMT+G+G GGYR RGRGT K C +NEEQRQF +QSSNKANIQCYH KKFGHVKADC                                        
Subjt:  DRDRVMTRGRGRGGYRGRGRGTEKRCYRNEEQRQFRMQSSNKANIQCYHYKKFGHVKADCC---------------------------------------

Query:  -------------LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRILTNVQYGLDIGYNLLSVGQLMESGHSIMFDD--------------
                     LKPVFKELNEGEKLKVELGNGKELQVEGK T+GIETH+GNRILTNVQY  DIGYNLLSVGQLMESGHSI+FDD              
Subjt:  -------------LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRILTNVQYGLDIGYNLLSVGQLMESGHSIMFDD--------------

Query:  ----------------------------------------------------------------------ERKSEIFEKFKHFNAKVEKQSGMFVKSLRS
                                                                              + +SE FEKFKHF AKVEKQSGMF+KS RS
Subjt:  ----------------------------------------------------------------------ERKSEIFEKFKHFNAKVEKQSGMFVKSLRS

Query:  DRGGEFLSNNFNHFCKERGIHRELITPYTPEQNGIAERKNRTVVEMARSMLQMKGLSNDFWAEAVSTSIYLLNISPTKIVMNKTPFEA
        DRGG+FLSNNFNHFC+E GIHREL TPYT EQNG+AERKNRTVVEMARSMLQMKGLSNDFW EA STSIYLLNISPTK+VMNKTPFEA
Subjt:  DRGGEFLSNNFNHFCKERGIHRELITPYTPEQNGIAERKNRTVVEMARSMLQMKGLSNDFWAEAVSTSIYLLNISPTKIVMNKTPFEA

TrEMBL top hitse value%identityAlignment
A0A0V0IV83 Putative ovule protein (Fragment)8.9e-13744.07Show/hide
Query:  MGTTQPLIPIFKGEGYEFWSISMKTLLRSQNLWDLVEQGYANPDDKDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQA-------------VLV
        +   QPLIP+FKGE YEFWSI MKT+L+SQ+LWDLVE+GY +PD++++LR+N+KKD+KALV IQQAVHDS+FSRIA ATTSKQA             V+V
Subjt:  MGTTQPLIPIFKGEGYEFWSISMKTLLRSQNLWDLVEQGYANPDDKDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQA-------------VLV

Query:  VKLQSLRRDFETLMMKNGESIVDFLSRATTIISQMQTYGETITDQTIVEKVLRT--------------------------------HESRINRSMEKNEE
        V+LQSLRRDFETLMMK+GESI  FLSRA TI+SQ+++YGE +TDQ IVEKVLR+                                HE+R NRS+EKNEE
Subjt:  VKLQSLRRDFETLMMKNGESIVDFLSRATTIISQMQTYGETITDQTIVEKVLRT--------------------------------HESRINRSMEKNEE

Query:  KAFQVKDVVPKYNDRDRVMTRGRGRGGYR-GRGRGTEKRCYRNEEQRQFRMQSSNKANIQCYHYKKFGHVKADC-----------------------CL-
        KAFQVKD   KY D +   +RGRGRGG+R GRGRG  +   RN   RQ   Q + K  +QC+H  ++GH+KADC                       C+ 
Subjt:  KAFQVKDVVPKYNDRDRVMTRGRGRGGYR-GRGRGTEKRCYRNEEQRQFRMQSSNKANIQCYHYKKFGHVKADC-----------------------CL-

Query:  ----------------------KPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGN-RILTNVQYGLDIGYNLLSVGQLMESGHSIMFDD-----
                              K +F++L+E +K KV+LGN KE+QVEGKG V ++T H   ++L +VQ+  D+G+NLLSVGQLM  G+S++FDD     
Subjt:  ----------------------KPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGN-RILTNVQYGLDIGYNLLSVGQLMESGHSIMFDD-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------ERKSEIFEKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNHFCKERGIHRELITPYTPEQ
                                              E KSE FEKF+ F A VE QS   +K LR DRGGEF+SN FN FC+  GIHREL TPYTPEQ
Subjt:  --------------------------------------ERKSEIFEKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNHFCKERGIHRELITPYTPEQ

Query:  NGIAERKNRTVVEMARSMLQMKGLSNDFWAEAVSTSIYLLNISPTKIVMNKTPFEAWYGKKPNVSHLRVFGCISYALVPSEVR
        NG+AERKNRTVVEMARSMLQ K L+N FWAEAV+ SIYLLN+SPTK+VMNKTP+EAW+ +KPNVSHLRVFGC++YALV S+ R
Subjt:  NGIAERKNRTVVEMARSMLQMKGLSNDFWAEAVSTSIYLLNISPTKIVMNKTPFEAWYGKKPNVSHLRVFGCISYALVPSEVR

A0A5A7UQM0 Copia protein2.1e-17065.34Show/hide
Query:  MGTTQPLIPIFKGEGYEFWSISMKTLLRSQNLWDLVEQGYANPDDKDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQA-------------VLV
        MGTTQPLIPIFKGEGYEFWSI MKTLL SQ+LWDLVEQGY +PDD+ KL+ENR+KD KALVI+QQAVHD+VFSRIAAATTSKQA             VLV
Subjt:  MGTTQPLIPIFKGEGYEFWSISMKTLLRSQNLWDLVEQGYANPDDKDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQA-------------VLV

Query:  VKLQSLRRDFETLMMKNGESIVDFLSRATTIISQMQTYGETITDQTIVEKVLRT--------------------------------HESRINRSMEKNEE
        VKLQSL+RDFETLMMKNGESI DFLSRATTIISQMQTYGETITDQTIVEKVLR+                                HESRIN SMEKN+E
Subjt:  VKLQSLRRDFETLMMKNGESIVDFLSRATTIISQMQTYGETITDQTIVEKVLRT--------------------------------HESRINRSMEKNEE

Query:  KAFQVKDVVPKYNDRDRVMTRGRGRGGYRGRGRGTEKRCYRNEEQRQFRMQSSNKANIQCYHYKKFGHVKADCC--------------------------
        KAF+VKDVVPKYND D VMT+G+G GGYR RGRGT K C +NEEQRQF +QSSNKANIQCYH KKFGHVKADC                           
Subjt:  KAFQVKDVVPKYNDRDRVMTRGRGRGGYRGRGRGTEKRCYRNEEQRQFRMQSSNKANIQCYHYKKFGHVKADCC--------------------------

Query:  --------------------------LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRILTNVQYGLDIG-------YNLLSVGQLMESGH
                                  LKPVFKELNEGEKLKVELGN KELQVEGK  +GIETH+GNRILTNVQ              N+ +      + +
Subjt:  --------------------------LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRILTNVQYGLDIG-------YNLLSVGQLMESGH

Query:  SIMFDDE----RKSEIFEKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNHFCKERGIHRELITPYTPEQNGIAERKNRTVVEMARSMLQMKGLSNDF
        +   + E     +SE FEKFKHF AKVEKQSGMF+KSLRSDRGG+FLSNNFNHFC+E GIHREL TPYT EQNG+AERKNRTVVEMARSMLQMKGLSNDF
Subjt:  SIMFDDE----RKSEIFEKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNHFCKERGIHRELITPYTPEQNGIAERKNRTVVEMARSMLQMKGLSNDF

Query:  WAEAVSTSIYLLNISPTKIVMNKTPFEA
        W EAVSTSIYLLNISPTK+VMNKTPFEA
Subjt:  WAEAVSTSIYLLNISPTKIVMNKTPFEA

A0A5A7V170 UBN2 domain-containing protein3.3e-13163.56Show/hide
Query:  MGTTQPLIPIFKGEGYEFWSISMKTLLRSQNLWDLVEQGYANPDDKDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQA-------------VLV
        +G  QPLIPIFKGEGYEFWSI MKTLLRSQ+LWDLVEQGY + DD+ KL ENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQA             V V
Subjt:  MGTTQPLIPIFKGEGYEFWSISMKTLLRSQNLWDLVEQGYANPDDKDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQA-------------VLV

Query:  VKLQSLRRDFETLMMKNGESIVDFLSRATTIISQMQTYGETITDQTIVEKVLRTHESRINRSMEKNEEKAFQVKDVVPKYNDRDRVMTRGRGRGGYRGRG
        VKLQSLRRDFETLMMKN ESI +FLSRATTII QMQTYGETI DQTIVEK  R + +  N + E N                                 G
Subjt:  VKLQSLRRDFETLMMKNGESIVDFLSRATTIISQMQTYGETITDQTIVEKVLRTHESRINRSMEKNEEKAFQVKDVVPKYNDRDRVMTRGRGRGGYRGRG

Query:  RGTEKRCYRNEEQRQFRMQSSNKANIQCYHYKKFGHVKADCCLKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRILTNVQYGLDIGYNLLS
        +G  K    N    Q        A +         H+    CLKPVFKELNEGEKLKVEL NGK+LQVEGKGTVGIET+ GNRILTNVQY  DIGYNLLS
Subjt:  RGTEKRCYRNEEQRQFRMQSSNKANIQCYHYKKFGHVKADCCLKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRILTNVQYGLDIGYNLLS

Query:  VGQLMESGHSIMFDDERKSEIF--------------------------EKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNHFCKERGIHRELITPYT
        V QLMESG+SI+FDD   ++ F                          EKFKHF AKVEKQSGMF+KSLRSDRGGEFLSNNFN+FCKE GI REL TPYT
Subjt:  VGQLMESGHSIMFDDERKSEIF--------------------------EKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNHFCKERGIHRELITPYT

Query:  PEQNGIAERKNRTVVEMARSMLQMKGLSNDFWAEAVSTSIYLLNISPTKI
        PEQNG+AERKNRTVVEMARSMLQMK L NDF AEAVSTSIYLLNISP ++
Subjt:  PEQNGIAERKNRTVVEMARSMLQMKGLSNDFWAEAVSTSIYLLNISPTKI

A0A5D3DWC7 Putative gag-pol polyprotein, identical2.9e-17260.37Show/hide
Query:  MGTTQPLIPIFKGEGYEFWSISMKTLLRSQNLWDLVEQGYANPDDKDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQAVLVVKLQSLRRDFETL
        MGT QPLIPIFKGEGYEFWSI MKTLL SQ+LWDLVEQGY +PDD+ KL+ENR+KDSKALVIIQQAVHD+VFSRIAAATT              RDFETL
Subjt:  MGTTQPLIPIFKGEGYEFWSISMKTLLRSQNLWDLVEQGYANPDDKDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQAVLVVKLQSLRRDFETL

Query:  MMKNGESIVDFLSRATTIISQMQTYGETITDQTIVEKVLRT--------------------------------HESRINRSMEKNEEKAFQVKDVVPKYN
        MMKNGESI DFLSRATTIISQMQTYGETITDQTIVEKVLR+                                HESRIN SMEKNEEKAF+VKDVVPKYN
Subjt:  MMKNGESIVDFLSRATTIISQMQTYGETITDQTIVEKVLRT--------------------------------HESRINRSMEKNEEKAFQVKDVVPKYN

Query:  DRDRVMTRGRGRGGYRGRGRGTEKRCYRNEEQRQFRMQSSNKANIQCYHYKKFGHVKADCC---------------------------------------
        D D VMT+G+G GGYR RGRGT K C +NEEQRQF +QSSNKANIQCYH KKFGHVKADC                                        
Subjt:  DRDRVMTRGRGRGGYRGRGRGTEKRCYRNEEQRQFRMQSSNKANIQCYHYKKFGHVKADCC---------------------------------------

Query:  -------------LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRILTNVQYGLDIGYNLLSVGQLMESGHSIMFDD--------------
                     LKPVFKELNEGEKLKVELGNGKELQVEGK T+GIETH+GNRILTNVQY  DIGYNLLSVGQLMESGHSI+FDD              
Subjt:  -------------LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRILTNVQYGLDIGYNLLSVGQLMESGHSIMFDD--------------

Query:  ----------------------------------------------------------------------ERKSEIFEKFKHFNAKVEKQSGMFVKSLRS
                                                                              + +SE FEKFKHF AKVEKQSGMF+KS RS
Subjt:  ----------------------------------------------------------------------ERKSEIFEKFKHFNAKVEKQSGMFVKSLRS

Query:  DRGGEFLSNNFNHFCKERGIHRELITPYTPEQNGIAERKNRTVVEMARSMLQMKGLSNDFWAEAVSTSIYLLNISPTKIVMNKTPFEA
        DRGG+FLSNNFNHFC+E GIHREL TPYT EQNG+AERKNRTVVEMARSMLQMKGLSNDFW EA STSIYLLNISPTK+VMNKTPFEA
Subjt:  DRGGEFLSNNFNHFCKERGIHRELITPYTPEQNGIAERKNRTVVEMARSMLQMKGLSNDFWAEAVSTSIYLLNISPTKIVMNKTPFEA

A0A5D3DWP2 Putative gag-pol polyprotein, identical4.7e-16255.43Show/hide
Query:  ISMKTLLRSQNLWDLVEQGYANPDDKDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQA-------------VLVVKLQSLRRDFETLMMKNGES
        + +KTLLRSQ+LWDLVEQGY +PDD+ KLRENRKKDSKALVIIQQAVHDSVFSRIA ATTSKQA             VL+VKLQSLRRDFETLMMKNGES
Subjt:  ISMKTLLRSQNLWDLVEQGYANPDDKDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQA-------------VLVVKLQSLRRDFETLMMKNGES

Query:  IVDFLSRATTIISQMQTYGETITDQTIVEKVLRT--------------------------------HESRINRSMEKNEEKAFQVKDVVPKYNDRDRVMT
        I DFLSRATTIISQMQTYGETI DQTIVEKVLR+                                HESRINRSME+NEEKAFQVKD VPKYND DRVMT
Subjt:  IVDFLSRATTIISQMQTYGETITDQTIVEKVLRT--------------------------------HESRINRSMEKNEEKAFQVKDVVPKYNDRDRVMT

Query:  RGRGRGGYRGRGRGTEKRCYRNEEQRQFRMQSSNKANIQCYHYKKFGHVKADC----------------------------------------------C
        RGRGRGGYRGRG GTEK C RNE QRQF +QSSNKANIQCYH KKFGHVKADC                                              C
Subjt:  RGRGRGGYRGRGRGTEKRCYRNEEQRQFRMQSSNKANIQCYHYKKFGHVKADC----------------------------------------------C

Query:  ------LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRILTNVQYGLDIGYNLLSVGQLMESGHSIMFDD---------------------
              LKPVFKELNEGEKLKV+L NGKELQVEGKGTV IETHHGNRILTNVQY  DIGYNLLSVGQLMESG+SI+FDD                     
Subjt:  ------LKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRILTNVQYGLDIGYNLLSVGQLMESGHSIMFDD---------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------ERKSEIFEKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNHFCKERGIHRELITPYTPEQNGIAERKNRTV
                                   E KSE FEKFKHF AKVEKQSGMF+KSLRSDRG EFLSNNFNHFCKE GIHREL TPYTPEQNG+AERKN+TV
Subjt:  ---------------------------ERKSEIFEKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNHFCKERGIHRELITPYTPEQNGIAERKNRTV

Query:  VEMARSMLQMKGLSNDFWAEAVSTSIYLLNISPTKIVMNKTPFE
        VEMARSMLQMKGL NDFWAEAVS SIYLLNISPTK VMNKTPFE
Subjt:  VEMARSMLQMKGLSNDFWAEAVSTSIYLLNISPTKIVMNKTPFE

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.7e-2631.32Show/hide
Query:  FRMQSSNKANIQCYHYKKFGHVKADCCLKPVFKELNEGEKL--KVELG--------NGKELQVEGKGTVGIETHHGNRILTNVQYGL----------DIG
        + + + +K N + +H ++FGH+     L+   K +   + L   +EL         NGK+ ++  K     +  H  R L  V   +          D  
Subjt:  FRMQSSNKANIQCYHYKKFGHVKADCCLKPVFKELNEGEKL--KVELG--------NGKELQVEGKGTVGIETHHGNRILTNVQYGL----------DIG

Query:  YNLLSVGQLMESGHSIMFDDERKSEIFEKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNHFCKERGIHRELITPYTPEQNGIAERKNRTVVEMARSM
        Y ++ V Q   + + + +  + KS++F  F+ F AK E    + V  L  D G E+LSN    FC ++GI   L  P+TP+ NG++ER  RT+ E AR+M
Subjt:  YNLLSVGQLMESGHSIMFDDERKSEIFEKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNHFCKERGIHRELITPYTPEQNGIAERKNRTVVEMARSM

Query:  LQMKGLSNDFWAEAVSTSIYLLNISPTKIVM--NKTPFEAWYGKKPNVSHLRVFGCISYALVPSE
        +    L   FW EAV T+ YL+N  P++ ++  +KTP+E W+ KKP + HLRVFG   Y  + ++
Subjt:  LQMKGLSNDFWAEAVSTSIYLLNISPTKIVM--NKTPFEAWYGKKPNVSHLRVFGCISYALVPSE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-3246.85Show/hide
Query:  KSEIFEKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNHFCKERGIHRELITPYTPEQNGIAERKNRTVVEMARSMLQMKGLSNDFWAEAVSTSIYLL
        K ++F+ F+ F+A VE+++G  +K LRSD GGE+ S  F  +C   GI  E   P TP+ NG+AER NRT+VE  RSML+M  L   FW EAV T+ YL+
Subjt:  KSEIFEKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNHFCKERGIHRELITPYTPEQNGIAERKNRTVVEMARSMLQMKGLSNDFWAEAVSTSIYLL

Query:  NISPTKIVMNKTPFEAWYGKKPNVSHLRVFGCISYALVPSEVR
        N SP+  +  + P   W  K+ + SHL+VFGC ++A VP E R
Subjt:  NISPTKIVMNKTPFEAWYGKKPNVSHLRVFGCISYALVPSEVR

P92512 Uncharacterized mitochondrial protein AtMg007102.0e-0839.71Show/hide
Query:  NRTVVEMARSMLQMKGLSNDFWAEAVSTSIYLLNISPTKIVMNKTPFEAWYGKKPNVSHLRVFGCISY
        NRT++E  RSML   GL   F A+A +T+++++N  P+  +    P E W+   P  S+LR FGC++Y
Subjt:  NRTVVEMARSMLQMKGLSNDFWAEAVSTSIYLLNISPTKIVMNKTPFEAWYGKKPNVSHLRVFGCISY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.1e-1833.58Show/hide
Query:  ERKSEIFEKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNHFCKERGIHRELITPYTPEQNGIAERKNRTVVEMARSMLQMKGLSNDFWAEAVSTSIY
        ++KS++ E F  F   +E +    + +  SD GGEF++     +  + GI      P+TPE NG++ERK+R +VE   ++L    +   +W  A + ++Y
Subjt:  ERKSEIFEKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNHFCKERGIHRELITPYTPEQNGIAERKNRTVVEMARSMLQMKGLSNDFWAEAVSTSIY

Query:  LLNISPTKIVMNKTPFEAWYGKKPNVSHLRVFGCISY
        L+N  PT ++  ++PF+  +G  PN   LRVFGC  Y
Subjt:  LLNISPTKIVMNKTPFEAWYGKKPNVSHLRVFGCISY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.9e-1935.04Show/hide
Query:  ERKSEIFEKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNHFCKERGIHRELITPYTPEQNGIAERKNRTVVEMARSMLQMKGLSNDFWAEAVSTSIY
        ++KS++ + F  F + VE +    + +L SD GGEF+      +  + GI      P+TPE NG++ERK+R +VEM  ++L    +   +W  A S ++Y
Subjt:  ERKSEIFEKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNHFCKERGIHRELITPYTPEQNGIAERKNRTVVEMARSMLQMKGLSNDFWAEAVSTSIY

Query:  LLNISPTKIVMNKTPFEAWYGKKPNVSHLRVFGCISY
        L+N  PT ++  ++PF+  +G+ PN   L+VFGC  Y
Subjt:  LLNISPTKIVMNKTPFEAWYGKKPNVSHLRVFGCISY

Arabidopsis top hitse value%identityAlignment
AT1G48720.1 unknown protein4.4e-1132.56Show/hide
Query:  IPIFKGEGYEFWSISMKTLLRSQNLWDLVEQGYANPDD--------KDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQAV
        +P+     Y+ WS+ MK +L + ++W++VE+G+  P++        KD LR++RK+D KAL +I Q + +  F ++  AT++K  +
Subjt:  IPIFKGEGYEFWSISMKTLLRSQNLWDLVEQGYANPDD--------KDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQAV

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.4e-0939.71Show/hide
Query:  NRTVVEMARSMLQMKGLSNDFWAEAVSTSIYLLNISPTKIVMNKTPFEAWYGKKPNVSHLRVFGCISY
        NRT++E  RSML   GL   F A+A +T+++++N  P+  +    P E W+   P  S+LR FGC++Y
Subjt:  NRTVVEMARSMLQMKGLSNDFWAEAVSTSIYLLNISPTKIVMNKTPFEAWYGKKPNVSHLRVFGCISY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTACAACACAACCACTTATTCCAATCTTCAAAGGAGAAGGCTACGAATTTTGGAGTATTAGTATGAAGACTCTTCTCAGATCTCAAAATTTATGGGACTTAGTAGA
ACAAGGCTATGCAAATCCTGACGACAAAGACAAGTTGCGGGAGAACAGGAAGAAAGATTCAAAGGCGTTGGTGATTATTCAACAAGCAGTCCATGACAGTGTTTTTTCGC
GAATTGCTGCAGCAACAACGTCAAAACAAGCAGTACTTGTGGTTAAATTGCAATCACTTAGACGAGACTTTGAGACCTTGATGATGAAAAATGGAGAATCAATTGTTGAT
TTTTTATCACGGGCGACGACAATTATTAGTCAGATGCAAACATACGGCGAGACGATTACGGATCAAACTATAGTGGAGAAAGTATTGAGAACACATGAATCGAGAATCAA
TAGATCGATGGAAAAAAACGAAGAAAAAGCGTTTCAGGTAAAGGATGTAGTTCCAAAGTATAATGACCGTGATCGTGTGATGACTCGAGGCCGAGGAAGAGGAGGATATC
GTGGTCGAGGTCGTGGTACCGAAAAAAGATGTTATCGAAATGAAGAACAAAGGCAATTCAGAATGCAATCAAGCAACAAAGCTAATATTCAATGCTACCATTACAAGAAG
TTTGGTCATGTAAAGGCAGATTGTTGTTTGAAGCCTGTATTCAAGGAGCTTAACGAAGGAGAAAAGTTGAAGGTGGAGCTTGGAAACGGCAAGGAGCTACAAGTAGAAGG
CAAAGGAACGGTGGGAATTGAAACTCACCATGGAAATAGAATTCTCACAAATGTTCAGTATGGGCTCGATATTGGATATAATTTGTTGAGTGTTGGACAACTAATGGAGA
GTGGGCATTCTATCATGTTTGATGATGAAAGAAAATCAGAAATATTTGAGAAGTTCAAGCATTTCAACGCAAAGGTAGAAAAGCAGAGTGGCATGTTCGTTAAATCTCTT
CGTAGTGATAGAGGTGGAGAATTTTTGTCCAACAACTTCAACCATTTTTGCAAGGAACGTGGCATCCATAGGGAGTTGATAACACCTTATACTCCAGAGCAAAATGGGAT
AGCCGAGAGGAAGAATCGAACTGTGGTGGAGATGGCAAGAAGCATGTTGCAAATGAAAGGCCTTTCGAATGATTTTTGGGCTGAAGCAGTCTCGACTTCCATCTACCTAC
TGAACATCTCACCAACAAAGATTGTCATGAATAAAACTCCATTTGAAGCTTGGTATGGCAAAAAACCCAATGTAAGTCATTTAAGAGTTTTTGGTTGTATTTCTTATGCT
TTGGTACCTTCTGAAGTTCGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTACAACACAACCACTTATTCCAATCTTCAAAGGAGAAGGCTACGAATTTTGGAGTATTAGTATGAAGACTCTTCTCAGATCTCAAAATTTATGGGACTTAGTAGA
ACAAGGCTATGCAAATCCTGACGACAAAGACAAGTTGCGGGAGAACAGGAAGAAAGATTCAAAGGCGTTGGTGATTATTCAACAAGCAGTCCATGACAGTGTTTTTTCGC
GAATTGCTGCAGCAACAACGTCAAAACAAGCAGTACTTGTGGTTAAATTGCAATCACTTAGACGAGACTTTGAGACCTTGATGATGAAAAATGGAGAATCAATTGTTGAT
TTTTTATCACGGGCGACGACAATTATTAGTCAGATGCAAACATACGGCGAGACGATTACGGATCAAACTATAGTGGAGAAAGTATTGAGAACACATGAATCGAGAATCAA
TAGATCGATGGAAAAAAACGAAGAAAAAGCGTTTCAGGTAAAGGATGTAGTTCCAAAGTATAATGACCGTGATCGTGTGATGACTCGAGGCCGAGGAAGAGGAGGATATC
GTGGTCGAGGTCGTGGTACCGAAAAAAGATGTTATCGAAATGAAGAACAAAGGCAATTCAGAATGCAATCAAGCAACAAAGCTAATATTCAATGCTACCATTACAAGAAG
TTTGGTCATGTAAAGGCAGATTGTTGTTTGAAGCCTGTATTCAAGGAGCTTAACGAAGGAGAAAAGTTGAAGGTGGAGCTTGGAAACGGCAAGGAGCTACAAGTAGAAGG
CAAAGGAACGGTGGGAATTGAAACTCACCATGGAAATAGAATTCTCACAAATGTTCAGTATGGGCTCGATATTGGATATAATTTGTTGAGTGTTGGACAACTAATGGAGA
GTGGGCATTCTATCATGTTTGATGATGAAAGAAAATCAGAAATATTTGAGAAGTTCAAGCATTTCAACGCAAAGGTAGAAAAGCAGAGTGGCATGTTCGTTAAATCTCTT
CGTAGTGATAGAGGTGGAGAATTTTTGTCCAACAACTTCAACCATTTTTGCAAGGAACGTGGCATCCATAGGGAGTTGATAACACCTTATACTCCAGAGCAAAATGGGAT
AGCCGAGAGGAAGAATCGAACTGTGGTGGAGATGGCAAGAAGCATGTTGCAAATGAAAGGCCTTTCGAATGATTTTTGGGCTGAAGCAGTCTCGACTTCCATCTACCTAC
TGAACATCTCACCAACAAAGATTGTCATGAATAAAACTCCATTTGAAGCTTGGTATGGCAAAAAACCCAATGTAAGTCATTTAAGAGTTTTTGGTTGTATTTCTTATGCT
TTGGTACCTTCTGAAGTTCGTTAA
Protein sequenceShow/hide protein sequence
MGTTQPLIPIFKGEGYEFWSISMKTLLRSQNLWDLVEQGYANPDDKDKLRENRKKDSKALVIIQQAVHDSVFSRIAAATTSKQAVLVVKLQSLRRDFETLMMKNGESIVD
FLSRATTIISQMQTYGETITDQTIVEKVLRTHESRINRSMEKNEEKAFQVKDVVPKYNDRDRVMTRGRGRGGYRGRGRGTEKRCYRNEEQRQFRMQSSNKANIQCYHYKK
FGHVKADCCLKPVFKELNEGEKLKVELGNGKELQVEGKGTVGIETHHGNRILTNVQYGLDIGYNLLSVGQLMESGHSIMFDDERKSEIFEKFKHFNAKVEKQSGMFVKSL
RSDRGGEFLSNNFNHFCKERGIHRELITPYTPEQNGIAERKNRTVVEMARSMLQMKGLSNDFWAEAVSTSIYLLNISPTKIVMNKTPFEAWYGKKPNVSHLRVFGCISYA
LVPSEVR