; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0014181 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0014181
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr01:11080736..11082595
RNA-Seq ExpressionCmc01g0014181
SyntenyCmc01g0014181
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048721.1 gag-pol polyprotein [Cucumis melo var. makuwa]0.0e+0096.1Show/hide
Query:  MTGNADFFSELSECKAGSVVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWD
        MTGNADFFSELSECKAGSVVF DGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWD
Subjt:  MTGNADFFSELSECKAGSVVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWD

Query:  AEVTLCNLSKVEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAV
        AEVTLCNLSKVEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSV+KPVNIS TSHILELLHIDLM PMQTESLGRK+YAV
Subjt:  AEVTLCNLSKVEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAV

Query:  VCVDDFSRYTWIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQ
        VCVDDFSRYTWIKFIL+KLETFKTCQTLVTQLQREKNTGIGRIRT+HG EFENKHFAEFCDNEGIFHEFSA LTPQENGVVEKRN+TLQEMARVMIHAKQ
Subjt:  VCVDDFSRYTWIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQ

Query:  LPIQFWAEALNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVII
        LPIQFWAEALNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFG TCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVY+QRTKIVIESINVII
Subjt:  LPIQFWAEALNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVII

Query:  DDLGKEPNKNLDDEDEVFWNSLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPMHI
        DDLGKEPN+NLDDEDEVFWNSLSHKTAEGESEST PTN  TYLPSHFDSNKIDMSTPSTS NHSNTYESEAAVSASQHTPERTAGATDSPK+D IPPMHI
Subjt:  DDLGKEPNKNLDDEDEVFWNSLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPMHI

Query:  AKNHPSSFIIGDVHSGIITQKKERKDYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVI
        AKNHP SFIIGDVHSGIITQKKERKDYAKMVANVCYTFSLEPTT+SAVLTDEH ILA+QEELLQFERNQVWELVPKSPYANII TKWIFKNKTDEEGRVI
Subjt:  AKNHPSSFIIGDVHSGIITQKKERKDYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVI

Query:  CNKARLVAQGYSQIEG
        CNKARLVAQGYSQIEG
Subjt:  CNKARLVAQGYSQIEG

KAA0059225.1 gag-pol polyprotein [Cucumis melo var. makuwa]9.5e-28581.1Show/hide
Query:  MTGNADFFSELSECKAGSVVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWD
        MTGNADFFSELSECK GSVVFGDGGKGKIIGKGTIN  GLPFLLDVRL+QGL+ANLIS SQLCDQGY+V+F+KDRCNVLD QNKVFLSGTRLSDNCYHWD
Subjt:  MTGNADFFSELSECKAGSVVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWD

Query:  AEVTLCNLSKVEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAV
        AEVTLCNLSKVEEA LWHKRLGHL GATISKV K +AIIGLPPL+F SLESCSEC AGKQVKSV+KPVNISSTSHILELLHIDLMGPMQTESLGRK YAV
Subjt:  AEVTLCNLSKVEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAV

Query:  VCVDDFSRYTWIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQ
        VCVDDFSRYTWIKFILDK ETFKTCQTL TQLQREKNTGIG+I+TDHG EFEN+HFAEFCDNEGIFHEFSAPLT Q+NGV                    
Subjt:  VCVDDFSRYTWIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQ

Query:  LPIQFWAEALNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVII
              AEALNTACHIHNRVILRP TTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGY AN+RAYRVY+Q +KIV+ESINVII
Subjt:  LPIQFWAEALNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVII

Query:  DDLGKEPNKNLDDEDEVFWNSLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPMHI
        DDL                        EGE ES A TN  TYLPSH   ++IDMSTPSTSA H NT+ESEA VSASQHTPE+TAGATDS K D IPP H 
Subjt:  DDLGKEPNKNLDDEDEVFWNSLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPMHI

Query:  AKNHPSSFIIGDVHSGIITQKKERKDYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVI
        AKNHPSSFII D+HSGIIT+KKERKDYAKMVANVCYT  LEPTTVSA L+DEHWIL +QEELLQFERNQVWELVPK PYANII TKWIFKNKTDEEGRVI
Subjt:  AKNHPSSFIIGDVHSGIITQKKERKDYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVI

Query:  CNKARLVAQGYSQIEGLDF
         NKARLVAQGYSQIEGLDF
Subjt:  CNKARLVAQGYSQIEGLDF

KAA0060049.1 F9C16.17 [Cucumis melo var. makuwa]3.0e-19862.18Show/hide
Query:  MTGNADFFSELSECKAGSVVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWD
        MTGN DFFSELSECK GSVVFG GGKGKIIGKGTINRPGLPFLLDVRLVQGLSANL S SQLCDQGY+                               D
Subjt:  MTGNADFFSELSECKAGSVVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWD

Query:  AEVTLCNLSKVEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAV
        AEV LCNLSKVEE GLWHKRLGHL G TISKV KA+AIIGLPPLSFSSL+SCSECPA KQ     +PV++SSTSH LELLHIDLMGPMQTESLGRK    
Subjt:  AEVTLCNLSKVEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAV

Query:  VCVDDFSRYTWIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQ
                                                                                                            
Subjt:  VCVDDFSRYTWIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQ

Query:  LPIQFWAEALNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVII
                          RVILR  TTTTSYELWKGRKPNVKYFHIF STCFILSDR+H RKWDSKSDRGIFLGYS N+RAYRVY+QRTKIV+E INVII
Subjt:  LPIQFWAEALNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVII

Query:  DDLGKEPNKNLDDEDEVFWNSLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPMHI
         DLGKEPN+NLDDEDE FW+SLSHK+ + ESEST+ T   TY P H DSN+IDMSTPSTS NH  T E EAAVSASQHTPERT G+TDSPKH  +P  +I
Subjt:  DDLGKEPNKNLDDEDEVFWNSLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPMHI

Query:  AKNHPSSFIIGDVHSGIITQKKERKDYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVI
        AK+HPSSFII DVHSGIIT+KKERKDYAKMV N+CYT SLEPTTVS  LT+EHWILA+Q+ELLQFERN+VWELVPK P+ANII TKWIFKNKTDE+GRVI
Subjt:  AKNHPSSFIIGDVHSGIITQKKERKDYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVI

Query:  CNKARLVAQGYSQIEG
         NKARLVAQGYSQIEG
Subjt:  CNKARLVAQGYSQIEG

TYK06509.1 gag-pol polyprotein [Cucumis melo var. makuwa]6.3e-23672.44Show/hide
Query:  LSECKAGSVVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWDAEVTLCNLSK
        + ECKAG VVF DGGKGKIIGKGTIN PGLPFLLDVRLVQGL+ANLIS SQLCDQGY+V+F+KDRCNVLDGQN++FLSGTRLSDNCYHWDAEV       
Subjt:  LSECKAGSVVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWDAEVTLCNLSK

Query:  VEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAVVCVDDFSRYT
                               KA+AIIGLPPLSF SLES SECP GKQVKSV+KP                                           
Subjt:  VEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAVVCVDDFSRYT

Query:  WIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQLPIQFWAEAL
         IKFILDK E FKTCQTL TQLQREKNTGIGRIRTDH REFEN+HF++F DNEGIFHEFS PLTPQ+NGVVE+RN+TLQ+MA VMIHAK LPIQ WA+AL
Subjt:  WIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQLPIQFWAEAL

Query:  NTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVIIDDLGKEPNKN
        NTACHIHN VILRP TTTTSYELWKGRKPNVKYFHIF S CFILSDRDHR KWDSKSDRGIFLGYS N+RAYRVY+QR+K V+ESINVIIDDLGKEPN N
Subjt:  NTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVIIDDLGKEPNKN

Query:  LDDEDEVFWNSLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPMHIAKNHPSSFII
        LDDEDEVFWNSLSHK  EGE +STA  N  TYLPSH  S + DMSTPSTS  H++T   EA VSA QHT E+TAGATDS K + IPP HIAKNHPSSFI 
Subjt:  LDDEDEVFWNSLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPMHIAKNHPSSFII

Query:  GDVHSGIITQKKERKDYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVICNKAR
        GD+HSGIIT+KKERKDYAKMVANVCYT SLEPTTVSAVL+DEHWILA+QEELLQFERNQVWELVPK PYANII TKWIFKNKTDEE    C   R
Subjt:  GDVHSGIITQKKERKDYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVICNKAR

TYK21888.1 hypothetical protein E5676_scaffold494G00240 [Cucumis melo var. makuwa]3.4e-25073.81Show/hide
Query:  SELSECKAGSVVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWDAEVTLCNL
        S+L ECK GSVVFG G KGK IGKGTINRP LPFLLDVRLVQGLSANLIS SQLCDQGY+                               DAEV LCNL
Subjt:  SELSECKAGSVVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWDAEVTLCNL

Query:  SKVEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAVVCVDDFSR
        SKVEE GLWHKRLGHL G TISKV KA+AIIGLPPLSFSSLESCSEC A KQ     +PV++SSTSH LELLHIDLMGPMQTESLGRK            
Subjt:  SKVEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAVVCVDDFSR

Query:  YTWIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQLPIQFWAE
                                 REKNTGI RI+TDHGREFENK+F EFCDNEGIFHEFSA L PQ+NGVVE+RNRTLQEMA+VMIHAKQLPIQFW E
Subjt:  YTWIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQLPIQFWAE

Query:  ALNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVIIDDLGKEPN
        ALNTACHIHNRVIL   TTTTSYELWKGRKPNVKYFHIF STCFILSDR+H RKWDSKSDRGIFLGYS N+RAYRVY+QRTK V+ES+NVII DLGKEPN
Subjt:  ALNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVIIDDLGKEPN

Query:  KNLDDEDEVFWNSLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPMHIAKNHPSSF
        +NLDDEDE FW+SLSHK+ + ESEST+ T   TY P H DSN+IDMSTPSTS NH  T E EAAVSASQHTPERTAG+TDSPKH  +P  +IAK+HPSSF
Subjt:  KNLDDEDEVFWNSLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPMHIAKNHPSSF

Query:  IIGDVHSGIITQKKERKDYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVICNKARLVA
        II DVHSGIIT+KKERKDYAKMV N+CYT SLEPTTVS  LT+EHWILA+QEELLQFERNQVWELVPK P+ANII TKWIFKNKTDE+GRVI NKARLVA
Subjt:  IIGDVHSGIITQKKERKDYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVICNKARLVA

Query:  QGYSQIEGLDF
        QGYSQIEGLDF
Subjt:  QGYSQIEGLDF

TrEMBL top hitse value%identityAlignment
A0A5A7V046 Gag-pol polyprotein4.6e-28581.1Show/hide
Query:  MTGNADFFSELSECKAGSVVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWD
        MTGNADFFSELSECK GSVVFGDGGKGKIIGKGTIN  GLPFLLDVRL+QGL+ANLIS SQLCDQGY+V+F+KDRCNVLD QNKVFLSGTRLSDNCYHWD
Subjt:  MTGNADFFSELSECKAGSVVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWD

Query:  AEVTLCNLSKVEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAV
        AEVTLCNLSKVEEA LWHKRLGHL GATISKV K +AIIGLPPL+F SLESCSEC AGKQVKSV+KPVNISSTSHILELLHIDLMGPMQTESLGRK YAV
Subjt:  AEVTLCNLSKVEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAV

Query:  VCVDDFSRYTWIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQ
        VCVDDFSRYTWIKFILDK ETFKTCQTL TQLQREKNTGIG+I+TDHG EFEN+HFAEFCDNEGIFHEFSAPLT Q+NGV                    
Subjt:  VCVDDFSRYTWIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQ

Query:  LPIQFWAEALNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVII
              AEALNTACHIHNRVILRP TTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGY AN+RAYRVY+Q +KIV+ESINVII
Subjt:  LPIQFWAEALNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVII

Query:  DDLGKEPNKNLDDEDEVFWNSLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPMHI
        DDL                        EGE ES A TN  TYLPSH   ++IDMSTPSTSA H NT+ESEA VSASQHTPE+TAGATDS K D IPP H 
Subjt:  DDLGKEPNKNLDDEDEVFWNSLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPMHI

Query:  AKNHPSSFIIGDVHSGIITQKKERKDYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVI
        AKNHPSSFII D+HSGIIT+KKERKDYAKMVANVCYT  LEPTTVSA L+DEHWIL +QEELLQFERNQVWELVPK PYANII TKWIFKNKTDEEGRVI
Subjt:  AKNHPSSFIIGDVHSGIITQKKERKDYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVI

Query:  CNKARLVAQGYSQIEGLDF
         NKARLVAQGYSQIEGLDF
Subjt:  CNKARLVAQGYSQIEGLDF

A0A5A7V2P3 F9C16.171.5e-19862.18Show/hide
Query:  MTGNADFFSELSECKAGSVVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWD
        MTGN DFFSELSECK GSVVFG GGKGKIIGKGTINRPGLPFLLDVRLVQGLSANL S SQLCDQGY+                               D
Subjt:  MTGNADFFSELSECKAGSVVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWD

Query:  AEVTLCNLSKVEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAV
        AEV LCNLSKVEE GLWHKRLGHL G TISKV KA+AIIGLPPLSFSSL+SCSECPA KQ     +PV++SSTSH LELLHIDLMGPMQTESLGRK    
Subjt:  AEVTLCNLSKVEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAV

Query:  VCVDDFSRYTWIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQ
                                                                                                            
Subjt:  VCVDDFSRYTWIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQ

Query:  LPIQFWAEALNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVII
                          RVILR  TTTTSYELWKGRKPNVKYFHIF STCFILSDR+H RKWDSKSDRGIFLGYS N+RAYRVY+QRTKIV+E INVII
Subjt:  LPIQFWAEALNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVII

Query:  DDLGKEPNKNLDDEDEVFWNSLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPMHI
         DLGKEPN+NLDDEDE FW+SLSHK+ + ESEST+ T   TY P H DSN+IDMSTPSTS NH  T E EAAVSASQHTPERT G+TDSPKH  +P  +I
Subjt:  DDLGKEPNKNLDDEDEVFWNSLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPMHI

Query:  AKNHPSSFIIGDVHSGIITQKKERKDYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVI
        AK+HPSSFII DVHSGIIT+KKERKDYAKMV N+CYT SLEPTTVS  LT+EHWILA+Q+ELLQFERN+VWELVPK P+ANII TKWIFKNKTDE+GRVI
Subjt:  AKNHPSSFIIGDVHSGIITQKKERKDYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVI

Query:  CNKARLVAQGYSQIEG
         NKARLVAQGYSQIEG
Subjt:  CNKARLVAQGYSQIEG

A0A5D3C826 Gag-pol polyprotein3.0e-23672.44Show/hide
Query:  LSECKAGSVVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWDAEVTLCNLSK
        + ECKAG VVF DGGKGKIIGKGTIN PGLPFLLDVRLVQGL+ANLIS SQLCDQGY+V+F+KDRCNVLDGQN++FLSGTRLSDNCYHWDAEV       
Subjt:  LSECKAGSVVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWDAEVTLCNLSK

Query:  VEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAVVCVDDFSRYT
                               KA+AIIGLPPLSF SLES SECP GKQVKSV+KP                                           
Subjt:  VEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAVVCVDDFSRYT

Query:  WIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQLPIQFWAEAL
         IKFILDK E FKTCQTL TQLQREKNTGIGRIRTDH REFEN+HF++F DNEGIFHEFS PLTPQ+NGVVE+RN+TLQ+MA VMIHAK LPIQ WA+AL
Subjt:  WIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQLPIQFWAEAL

Query:  NTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVIIDDLGKEPNKN
        NTACHIHN VILRP TTTTSYELWKGRKPNVKYFHIF S CFILSDRDHR KWDSKSDRGIFLGYS N+RAYRVY+QR+K V+ESINVIIDDLGKEPN N
Subjt:  NTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVIIDDLGKEPNKN

Query:  LDDEDEVFWNSLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPMHIAKNHPSSFII
        LDDEDEVFWNSLSHK  EGE +STA  N  TYLPSH  S + DMSTPSTS  H++T   EA VSA QHT E+TAGATDS K + IPP HIAKNHPSSFI 
Subjt:  LDDEDEVFWNSLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPMHIAKNHPSSFII

Query:  GDVHSGIITQKKERKDYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVICNKAR
        GD+HSGIIT+KKERKDYAKMVANVCYT SLEPTTVSAVL+DEHWILA+QEELLQFERNQVWELVPK PYANII TKWIFKNKTDEE    C   R
Subjt:  GDVHSGIITQKKERKDYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVICNKAR

A0A5D3C9Q6 Gag-pol polyprotein0.0e+0096.1Show/hide
Query:  MTGNADFFSELSECKAGSVVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWD
        MTGNADFFSELSECKAGSVVF DGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWD
Subjt:  MTGNADFFSELSECKAGSVVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWD

Query:  AEVTLCNLSKVEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAV
        AEVTLCNLSKVEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSV+KPVNIS TSHILELLHIDLM PMQTESLGRK+YAV
Subjt:  AEVTLCNLSKVEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAV

Query:  VCVDDFSRYTWIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQ
        VCVDDFSRYTWIKFIL+KLETFKTCQTLVTQLQREKNTGIGRIRT+HG EFENKHFAEFCDNEGIFHEFSA LTPQENGVVEKRN+TLQEMARVMIHAKQ
Subjt:  VCVDDFSRYTWIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQ

Query:  LPIQFWAEALNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVII
        LPIQFWAEALNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFG TCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVY+QRTKIVIESINVII
Subjt:  LPIQFWAEALNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVII

Query:  DDLGKEPNKNLDDEDEVFWNSLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPMHI
        DDLGKEPN+NLDDEDEVFWNSLSHKTAEGESEST PTN  TYLPSHFDSNKIDMSTPSTS NHSNTYESEAAVSASQHTPERTAGATDSPK+D IPPMHI
Subjt:  DDLGKEPNKNLDDEDEVFWNSLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPMHI

Query:  AKNHPSSFIIGDVHSGIITQKKERKDYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVI
        AKNHP SFIIGDVHSGIITQKKERKDYAKMVANVCYTFSLEPTT+SAVLTDEH ILA+QEELLQFERNQVWELVPKSPYANII TKWIFKNKTDEEGRVI
Subjt:  AKNHPSSFIIGDVHSGIITQKKERKDYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVI

Query:  CNKARLVAQGYSQIEG
        CNKARLVAQGYSQIEG
Subjt:  CNKARLVAQGYSQIEG

A0A5D3DEL4 Integrase catalytic domain-containing protein1.7e-25073.81Show/hide
Query:  SELSECKAGSVVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWDAEVTLCNL
        S+L ECK GSVVFG G KGK IGKGTINRP LPFLLDVRLVQGLSANLIS SQLCDQGY+                               DAEV LCNL
Subjt:  SELSECKAGSVVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWDAEVTLCNL

Query:  SKVEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAVVCVDDFSR
        SKVEE GLWHKRLGHL G TISKV KA+AIIGLPPLSFSSLESCSEC A KQ     +PV++SSTSH LELLHIDLMGPMQTESLGRK            
Subjt:  SKVEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAVVCVDDFSR

Query:  YTWIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQLPIQFWAE
                                 REKNTGI RI+TDHGREFENK+F EFCDNEGIFHEFSA L PQ+NGVVE+RNRTLQEMA+VMIHAKQLPIQFW E
Subjt:  YTWIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQLPIQFWAE

Query:  ALNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVIIDDLGKEPN
        ALNTACHIHNRVIL   TTTTSYELWKGRKPNVKYFHIF STCFILSDR+H RKWDSKSDRGIFLGYS N+RAYRVY+QRTK V+ES+NVII DLGKEPN
Subjt:  ALNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVIIDDLGKEPN

Query:  KNLDDEDEVFWNSLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPMHIAKNHPSSF
        +NLDDEDE FW+SLSHK+ + ESEST+ T   TY P H DSN+IDMSTPSTS NH  T E EAAVSASQHTPERTAG+TDSPKH  +P  +IAK+HPSSF
Subjt:  KNLDDEDEVFWNSLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPMHIAKNHPSSF

Query:  IIGDVHSGIITQKKERKDYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVICNKARLVA
        II DVHSGIIT+KKERKDYAKMV N+CYT SLEPTTVS  LT+EHWILA+QEELLQFERNQVWELVPK P+ANII TKWIFKNKTDE+GRVI NKARLVA
Subjt:  IIGDVHSGIITQKKERKDYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVICNKARLVA

Query:  QGYSQIEGLDF
        QGYSQIEGLDF
Subjt:  QGYSQIEGLDF

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.1e-4524.29Show/hide
Query:  LLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWDAEVTLCNLSKVEEAGLWHKRLGHLGGATISKVIKANAIIGLP
        L DV   +  + NL+S  +L + G  + F K    +      + +  + + +N    + +    N        LWH+R GH+    + ++ + N      
Subjt:  LLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWDAEVTLCNLSKVEEAGLWHKRLGHLGGATISKVIKANAIIGLP

Query:  PLSFSSL--ESCSECPAGKQVKSVYKPVNISSTSHI---LELLHIDLMGPMQTESLGRKQYAVVCVDDFSRYTWIKFILDKLETFKTCQTLVTQLQREKN
         L+   L  E C  C  GKQ +  +K   +   +HI   L ++H D+ GP+   +L  K Y V+ VD F+ Y     I  K + F   Q  V + +   N
Subjt:  PLSFSSL--ESCSECPAGKQVKSVYKPVNISSTSHI---LELLHIDLMGPMQTESLGRKQYAVVCVDDFSRYTWIKFILDKLETFKTCQTLVTQLQREKN

Query:  TGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQLPIQFWAEALNTACHIHNRVILRP--RTTTTSYELWK
          +  +  D+GRE+ +    +FC  +GI +  + P TPQ NGV E+  RT+ E AR M+   +L   FW EA+ TA ++ NR+  R    ++ T YE+W 
Subjt:  TGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQLPIQFWAEALNTACHIHNRVILRP--RTTTTSYELWK

Query:  GRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSAN-----NRAYRVYSQRTKIVIESIN-----------VIIDDLGKEPNKNL-DDEDEVF
         +KP +K+  +FG+T ++   ++ + K+D KS + IF+GY  N     +     +     +V++  N           V + D  +  NKN  +D  ++ 
Subjt:  GRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSAN-----NRAYRVYSQRTKIVIESIN-----------VIIDDLGKEPNKNL-DDEDEVF

Query:  WNSLSHKTAE----------GESES-TAPTNGKTYLPSHF-----------------DSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTA------
             +++ E           ESE+   P + +  + + F                 +SNK  ++         +  ES+ + + ++     TA      
Subjt:  WNSLSHKTAE----------GESES-TAPTNGKTYLPSHF-----------------DSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTA------

Query:  GATDSPKHDPIPPMHIAKNHPSSFIIGDVHSGIITQKKERKDYAKMVANVCYTFSLEPTTVSAVL---TDEHWILALQEELLQFERNQVWELVPKSPYAN
        G  +  K+D I  ++       +          I+  +E     K+V N    F+  P +   +        W  A+  EL   + N  W +  +    N
Subjt:  GATDSPKHDPIPPMHIAKNHPSSFIIGDVHSGIITQKKERKDYAKMVANVCYTFSLEPTTVSAVL---TDEHWILALQEELLQFERNQVWELVPKSPYAN

Query:  IISTKWIFKNKTDEEGRVICNKARLVAQGYSQIEGLDF
        I+ ++W+F  K +E G  I  KARLVA+G++Q   +D+
Subjt:  IISTKWIFKNKTDEEGRVICNKARLVAQGYSQIEGLDF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.6e-5627.8Show/hide
Query:  DFFSELSECKAGSVVFGDGGKGKIIGKGTI---NRPGLPFLL-DVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWDA
        D F        G+V  G+    KI G G I      G   +L DVR V  L  NLIS   L   GY+  F+  +  +  G   +     R     Y  +A
Subjt:  DFFSELSECKAGSVVFGDGGKGKIIGKGTI---NRPGLPFLL-DVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWDA

Query:  EVTLCNLSKVEE---AGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQY
        E+    L+  ++     LWHKR+GH+    +  + K + I        ++++ C  C  GKQ +  ++  +     +IL+L++ D+ GPM+ ES+G  +Y
Subjt:  EVTLCNLSKVEE---AGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQY

Query:  AVVCVDDFSRYTWIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHA
         V  +DD SR  W+  +  K + F+  Q     ++RE    + R+R+D+G E+ ++ F E+C + GI HE + P TPQ NGV E+ NRT+ E  R M+  
Subjt:  AVVCVDDFSRYTWIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHA

Query:  KQLPIQFWAEALNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINV
         +LP  FW EA+ TAC++ NR    P        +W  ++ +  +  +FG   F    ++ R K D KS   IF+GY      YR++    K VI S +V
Subjt:  KQLPIQFWAEALNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINV

Query:  IIDDLGKEPNKNLDDEDEVFWNSLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPM
        +              E EV       +TA   SE     NG   +P+        ++ PSTS N ++   +   VS     P       +          
Subjt:  IIDDLGKEPNKNLDDEDEVFWNSLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPM

Query:  HIAKNHPSSFIIGDVHSGIITQKKERKDYAKMVAN--VCYTFSLEPTTVSAVLT---DEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKT
        H  +         + H  +   ++ R +  +  +   V  +   EP ++  VL+       + A+QEE+   ++N  ++LV        +  KW+FK K 
Subjt:  HIAKNHPSSFIIGDVHSGIITQKKERKDYAKMVAN--VCYTFSLEPTTVSAVLT---DEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKT

Query:  DEEGRVICNKARLVAQGYSQIEGLDF
        D + +++  KARLV +G+ Q +G+DF
Subjt:  DEEGRVICNKARLVAQGYSQIEGLDF

Q03494 Transposon Ty2-DR2 Gag-Pol polyprotein1.7e-1821.53Show/hide
Query:  LSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHW----------DAEVTLCNLSKVEEAG-----LWHKRLGHLGGATISKVIKAN
        ++ +L+S S+L +Q     F+++     DG     +    +    ++W           +++T+ N++K +        L H+ LGH    +I K +K N
Subjt:  LSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHW----------DAEVTLCNLSKVEEAG-----LWHKRLGHLGGATISKVIKAN

Query:  AIIGLP----PLSFSSLESCSECPAGKQVKSVY---KPVNISSTSHILELLHIDLMGPMQTESLGRKQYAVVCVDDFSRYTWIKFILDKLE--TFKTCQT
        A+  L       S +S   C +C  GK  K  +     +    +    + LH D+ GP+         Y +   D+ +R+ W+  + D+ E        +
Subjt:  AIIGLP----PLSFSSLESCSECPAGKQVKSVY---KPVNISSTSHILELLHIDLMGPMQTESLGRKQYAVVCVDDFSRYTWIKFILDKLE--TFKTCQT

Query:  LVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQLPIQFWAEALNTACHIHNRVILRPRTT
        ++  ++ + N  +  I+ D G E+ NK   +F  N GI   ++     + +GV E+ NRTL    R ++H   LP   W  A+  +  I N ++  P+  
Subjt:  LVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQLPIQFWAEALNTACHIHNRVILRPRTT

Query:  TTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVIIDDLGKEPNKNLDDEDEVFWNSLSHKTA
         ++ +       ++     FG    I+++ +   K   +   G  L  S N+  Y +Y    K  +++ N +I    +      D +   F + L+  TA
Subjt:  TTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVIIDDLGKEPNKNLDDEDEVFWNSLSHKTA

Query:  EGES
          +S
Subjt:  EGES

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.1e-4125.91Show/hide
Query:  LLDVRLVQGLSANLISTSQLCD-QGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWDAEVT-----LCNLSKVEEAGLWHKRLGHLGGATISKVIKAN
        L ++  V  +  NLIS  +LC+  G  V F      V D    V L   +  D  Y W    +       + S       WH RLGH   + ++ VI   
Subjt:  LLDVRLVQGLSANLISTSQLCD-QGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWDAEVT-----LCNLSKVEEAGLWHKRLGHLGGATISKVIKAN

Query:  AIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAVVCVDDFSRYTWIKFILDKLETFKTCQTLVTQLQREK
        ++  L P       SCS+C   K  K  +    I+ST   LE ++ D+       S    +Y V+ VD F+RYTW+  +  K +  +T  T    L+   
Subjt:  AIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAVVCVDDFSRYTWIKFILDKLETFKTCQTLVTQLQREK

Query:  NTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQLPIQFWAEALNTACHIHNRVILRPRTTTTSYELWKG
         T IG   +D+G EF      E+    GI H  S P TP+ NG+ E+++R + E    ++    +P  +W  A   A ++ NR+        + ++   G
Subjt:  NTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQLPIQFWAEALNTACHIHNRVILRPRTTTTSYELWKG

Query:  RKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVIIDD---------LGKEPNKNLDDEDEVFWN-------
          PN     +FG  C+      ++ K D KS + +FLGYS    AY     +T  +  S +V  D+             P +    E    W+       
Subjt:  RKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVIIDD---------LGKEPNKNLDDEDEVFWN-------

Query:  ----------SLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDM--STPSTSANHSN--------------------------TYESEAAVSASQHTPER
                  S  H  A   S  +AP        S+ DS+      S+P  +A   N                          T ES + ++ S  TP +
Subjt:  ----------SLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDM--STPSTSANHSN--------------------------TYESEAAVSASQHTPER

Query:  TAGATDSPK-----------------HDPIPPMHIAKNHPSSFIIGDVHS-GIITQKKERKDYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQ
        ++ ++ SP                  H P P   I  N+  + +  + HS G   +    K   K    V      EP T    L DE W  A+  E+  
Subjt:  TAGATDSPK-----------------HDPIPPMHIAKNHPSSFIIGDVHS-GIITQKKERKDYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQ

Query:  FERNQVWELVPKSP-YANIISTKWIFKNKTDEEGRVICNKARLVAQGYSQIEGLDF
           N  W+LVP  P +  I+  +WIF  K + +G +   KARLVA+GY+Q  GLD+
Subjt:  FERNQVWELVPKSP-YANIISTKWIFKNKTDEEGRVICNKARLVAQGYSQIEGLDF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.1e-3725.62Show/hide
Query:  VVFGDGGKGKIIGKGTINRPGLPFLLD---VRLVQGLSANLISTSQLCDQG-YKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHW---DAEVTLCNLSKV
        V+  DG    I   G+ + P     LD   V  V  +  NLIS  +LC+     V F      V D    V L   +  D  Y W    ++      S  
Subjt:  VVFGDGGKGKIIGKGTINRPGLPFLLD---VRLVQGLSANLISTSQLCDQG-YKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHW---DAEVTLCNLSKV

Query:  EEA--GLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAVVCVDDFSRY
         +A    WH RLGH   A ++ VI  +++  L P     L SCS+C   K  K  +    I+S S  LE ++ D+       S+   +Y V+ VD F+RY
Subjt:  EEA--GLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAVVCVDDFSRY

Query:  TWIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQLPIQFWAEA
        TW+  +  K +   T     + ++    T IG + +D+G EF      ++    GI H  S P TP+ NG+ E+++R + EM   ++    +P  +W  A
Subjt:  TWIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQLPIQFWAEA

Query:  LNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVIIDDLGKEPNK
         + A ++ NR+        + ++   G+ PN +   +FG  C+      +R K + KS +  F+GYS    AY      T  +  S +V  D+    P  
Subjt:  LNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVIIDDLGKEPNK

Query:  NLDDEDEVFWNSLSHKTAEGESESTAPTN-----GKTYLPSHFDSNKIDMSTPS----TSANHSNTYESEAAV---------------------------
          +          S       S +T PT          L  H D++    S+PS    T  + SN   S  +                            
Subjt:  NLDDEDEVFWNSLSHKTAEGESESTAPTN-----GKTYLPSHFDSNKIDMSTPS----TSANHSNTYESEAAV---------------------------

Query:  ------------SASQHTPERTAGATDSP---KHDPIPPMHIAK-NHPSSFIIG----------------------DVHSGIITQKKE--RKDYAKMVAN
                    S S ++P + +    SP    H P P   I++ N PSS                          + HS + T+ K+  RK   K    
Subjt:  ------------SASQHTPERTAGATDSP---KHDPIPPMHIAK-NHPSSFIIG----------------------DVHSGIITQKKE--RKDYAKMVAN

Query:  VCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELV-PKSPYANIISTKWIFKNKTDEEGRVICNKARLVAQGYSQIEGLDF
             + EP T    + D+ W  A+  E+     N  W+LV P  P   I+  +WIF  K + +G +   KARLVA+GY+Q  GLD+
Subjt:  VCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELV-PKSPYANIISTKWIFKNKTDEEGRVICNKARLVAQGYSQIEGLDF

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.1e-0936.05Show/hide
Query:  VCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVICNKARLVAQGYSQIEGLDF
        VC   + EP+T +       W  A+ +E+   E    WE+    P    I  KW++K K + +G +   KARLVA+GY+Q EG+DF
Subjt:  VCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVICNKARLVAQGYSQIEGLDF

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.1e-1241.76Show/hide
Query:  KMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVICNKARLVAQGYSQIEGLDF
        K    +  T   EP +V   L D  W  A+QEEL    RN+ W LVP     NI+  KW+FK K   +G +   KARLVA+G+ Q EG+ F
Subjt:  KMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVICNKARLVAQGYSQIEGLDF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGGTAATGCAGATTTCTTTTCTGAACTTAGTGAATGCAAAGCTGGGTCAGTAGTATTTGGAGATGGAGGGAAAGGAAAAATAATTGGCAAAGGAACGATT
AACCGTCCAGGTCTACCGTTTCTTCTTGATGTTCGACTAGTACAAGGATTGTCTGCAAATCTCATAAGCACCAGTCAATTATGTGACCAAGGCTATAAAGTCAAT
TTCAGTAAAGATAGGTGCAATGTGCTAGATGGTCAAAATAAAGTATTTCTCAGCGGAACAAGGTTGTCAGATAACTGCTATCACTGGGATGCAGAGGTTACCTTA
TGCAATCTATCAAAAGTGGAAGAAGCTGGACTCTGGCACAAACGACTTGGACATCTTGGTGGCGCTACTATTTCCAAGGTCATCAAAGCTAATGCCATTATTGGT
CTTCCCCCGCTATCATTCTCGTCACTAGAGAGCTGTTCGGAGTGTCCAGCTGGTAAGCAAGTCAAGTCCGTGTACAAGCCTGTAAATATCTCCTCGACGTCCCAT
ATTTTGGAACTTCTTCATATAGACCTAATGGGGCCCATGCAAACAGAAAGCTTGGGAAGGAAACAGTATGCAGTAGTTTGTGTAGACGATTTCTCTCGCTATACC
TGGATAAAGTTTATTCTTGACAAATTGGAAACCTTTAAGACGTGTCAGACCCTGGTCACTCAACTCCAAAGAGAGAAAAATACTGGTATTGGCCGAATACGAACT
GATCATGGGCGTGAATTTGAGAATAAGCACTTTGCTGAGTTCTGTGATAATGAAGGCATCTTTCATGAGTTCTCTGCCCCATTAACACCACAGGAAAATGGAGTT
GTAGAGAAAAGGAACCGAACTTTACAGGAGATGGCCCGAGTGATGATCCATGCAAAGCAACTGCCAATTCAATTCTGGGCTGAAGCTCTAAACACTGCATGCCAT
ATACACAATAGGGTCATTCTCCGTCCAAGGACCACTACTACCTCTTATGAGCTGTGGAAAGGAAGAAAACCCAATGTGAAGTATTTTCACATCTTTGGCAGCACA
TGCTTTATCTTGAGTGATAGGGATCATCGCAGAAAATGGGATTCAAAGTCAGATCGTGGAATATTTCTGGGATATTCTGCTAACAACCGAGCCTATAGGGTTTAC
AGTCAGCGTACAAAAATAGTAATCGAATCCATTAACGTCATTATTGATGACCTTGGTAAGGAACCCAACAAAAATCTTGATGATGAAGATGAGGTTTTCTGGAAT
TCCCTTTCTCATAAAACTGCAGAGGGAGAGTCAGAATCGACGGCCCCCACTAATGGAAAAACATACTTACCCTCTCATTTCGATTCAAACAAAATTGACATGTCA
ACACCTTCTACATCAGCCAATCATTCTAACACATATGAAAGTGAAGCAGCAGTATCTGCAAGTCAGCACACTCCAGAGCGGACTGCTGGTGCAACTGATTCTCCA
AAGCATGACCCCATACCTCCTATGCATATAGCCAAAAATCACCCCTCTAGCTTTATTATTGGAGATGTTCACAGTGGAATCATAACTCAGAAGAAGGAGAGGAAA
GATTATGCGAAAATGGTTGCCAATGTGTGCTACACATTTTCACTAGAACCTACCACGGTCTCTGCAGTACTTACCGATGAACACTGGATCTTGGCTTTGCAGGAA
GAGCTACTACAGTTTGAAAGAAACCAAGTATGGGAATTAGTACCAAAGTCACCTTATGCTAACATAATTAGTACAAAATGGATCTTTAAGAACAAAACAGATGAA
GAAGGAAGAGTTATCTGTAATAAAGCAAGACTGGTTGCTCAAGGGTATTCTCAAATAGAAGGTCTGGATTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACAGGTAATGCAGATTTCTTTTCTGAACTTAGTGAATGCAAAGCTGGGTCAGTAGTATTTGGAGATGGAGGGAAAGGAAAAATAATTGGCAAAGGAACGATT
AACCGTCCAGGTCTACCGTTTCTTCTTGATGTTCGACTAGTACAAGGATTGTCTGCAAATCTCATAAGCACCAGTCAATTATGTGACCAAGGCTATAAAGTCAAT
TTCAGTAAAGATAGGTGCAATGTGCTAGATGGTCAAAATAAAGTATTTCTCAGCGGAACAAGGTTGTCAGATAACTGCTATCACTGGGATGCAGAGGTTACCTTA
TGCAATCTATCAAAAGTGGAAGAAGCTGGACTCTGGCACAAACGACTTGGACATCTTGGTGGCGCTACTATTTCCAAGGTCATCAAAGCTAATGCCATTATTGGT
CTTCCCCCGCTATCATTCTCGTCACTAGAGAGCTGTTCGGAGTGTCCAGCTGGTAAGCAAGTCAAGTCCGTGTACAAGCCTGTAAATATCTCCTCGACGTCCCAT
ATTTTGGAACTTCTTCATATAGACCTAATGGGGCCCATGCAAACAGAAAGCTTGGGAAGGAAACAGTATGCAGTAGTTTGTGTAGACGATTTCTCTCGCTATACC
TGGATAAAGTTTATTCTTGACAAATTGGAAACCTTTAAGACGTGTCAGACCCTGGTCACTCAACTCCAAAGAGAGAAAAATACTGGTATTGGCCGAATACGAACT
GATCATGGGCGTGAATTTGAGAATAAGCACTTTGCTGAGTTCTGTGATAATGAAGGCATCTTTCATGAGTTCTCTGCCCCATTAACACCACAGGAAAATGGAGTT
GTAGAGAAAAGGAACCGAACTTTACAGGAGATGGCCCGAGTGATGATCCATGCAAAGCAACTGCCAATTCAATTCTGGGCTGAAGCTCTAAACACTGCATGCCAT
ATACACAATAGGGTCATTCTCCGTCCAAGGACCACTACTACCTCTTATGAGCTGTGGAAAGGAAGAAAACCCAATGTGAAGTATTTTCACATCTTTGGCAGCACA
TGCTTTATCTTGAGTGATAGGGATCATCGCAGAAAATGGGATTCAAAGTCAGATCGTGGAATATTTCTGGGATATTCTGCTAACAACCGAGCCTATAGGGTTTAC
AGTCAGCGTACAAAAATAGTAATCGAATCCATTAACGTCATTATTGATGACCTTGGTAAGGAACCCAACAAAAATCTTGATGATGAAGATGAGGTTTTCTGGAAT
TCCCTTTCTCATAAAACTGCAGAGGGAGAGTCAGAATCGACGGCCCCCACTAATGGAAAAACATACTTACCCTCTCATTTCGATTCAAACAAAATTGACATGTCA
ACACCTTCTACATCAGCCAATCATTCTAACACATATGAAAGTGAAGCAGCAGTATCTGCAAGTCAGCACACTCCAGAGCGGACTGCTGGTGCAACTGATTCTCCA
AAGCATGACCCCATACCTCCTATGCATATAGCCAAAAATCACCCCTCTAGCTTTATTATTGGAGATGTTCACAGTGGAATCATAACTCAGAAGAAGGAGAGGAAA
GATTATGCGAAAATGGTTGCCAATGTGTGCTACACATTTTCACTAGAACCTACCACGGTCTCTGCAGTACTTACCGATGAACACTGGATCTTGGCTTTGCAGGAA
GAGCTACTACAGTTTGAAAGAAACCAAGTATGGGAATTAGTACCAAAGTCACCTTATGCTAACATAATTAGTACAAAATGGATCTTTAAGAACAAAACAGATGAA
GAAGGAAGAGTTATCTGTAATAAAGCAAGACTGGTTGCTCAAGGGTATTCTCAAATAGAAGGTCTGGATTTTTGA
Protein sequenceShow/hide protein sequence
MTGNADFFSELSECKAGSVVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWDAEVTL
CNLSKVEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVYKPVNISSTSHILELLHIDLMGPMQTESLGRKQYAVVCVDDFSRYT
WIKFILDKLETFKTCQTLVTQLQREKNTGIGRIRTDHGREFENKHFAEFCDNEGIFHEFSAPLTPQENGVVEKRNRTLQEMARVMIHAKQLPIQFWAEALNTACH
IHNRVILRPRTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYSQRTKIVIESINVIIDDLGKEPNKNLDDEDEVFWN
SLSHKTAEGESESTAPTNGKTYLPSHFDSNKIDMSTPSTSANHSNTYESEAAVSASQHTPERTAGATDSPKHDPIPPMHIAKNHPSSFIIGDVHSGIITQKKERK
DYAKMVANVCYTFSLEPTTVSAVLTDEHWILALQEELLQFERNQVWELVPKSPYANIISTKWIFKNKTDEEGRVICNKARLVAQGYSQIEGLDF