; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0016463 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0016463
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag-pol polyprotein
Genome locationchr01:4377199..4380075
RNA-Seq ExpressionPay0016463
SyntenyPay0016463
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAO73521.1 gag-pol polyprotein [Glycine max]7.8e-19351.04Show/hide
Query:  VALTSVKSPNSSDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCN
        V  TS+++    DWY DSGCSRHMTG  +F   +  C    V FGDG KGKIIG G + H GLP L  V L++GL ANLISISQLCD+G+ V+F K  C 
Subjt:  VALTSVKSPNSSDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCN

Query:  VLDVQNKVFLSGTRLSDNCYHWDAEVT----LCNLSKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISST
        V + +++V + G+R  DNCY W  + T     C  SK +E R+WH+R GHL    + K+    A+ G+P L       C EC  GKQVK  H+ +   +T
Subjt:  VLDVQNKVFLSGTRLSDNCYHWDAEVT----LCNLSKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISST

Query:  SHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPL
        S +LELLH+DLMGPMQ ESLG K YA V VDDFSR+TW+KFI +KSETF+  + L  +LQREK+  I +I++DHG EFEN    EFC +EGI HEFSA +
Subjt:  SHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPL

Query:  TLQQNGVVERRNRTLQEMAR-------------AEALNTACHIHNRVILRPGTTTTSYELWKGRKPNVK---------------DHRRKWDSKSDRGIFL
        T QQNG+VER+NRTLQE AR             AEA+NTAC+IHNRV LR GT TT YE+WKGRKP+VK               + RRK D KSD GIFL
Subjt:  TLQQNGVVERRNRTLQEMAR-------------AEALNTACHIHNRVILRPGTTTTSYELWKGRKPNVK---------------DHRRKWDSKSDRGIFL

Query:  GYLANSRAYRVYNQCSKIVMESINVIIDDLG---------------------------------------------RNLTEILMM---------------
        GY  NSRAYRV+N  ++ VMESINV++DDL                                              R+ T I  M               
Subjt:  GYLANSRAYRVYNQCSKIVMESINVIIDDLG---------------------------------------------RNLTEILMM---------------

Query:  -KLRFFGIL---------------------FLINQMKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGET
         + R   I+                     F IN M+E+L QF+RN+VWELVP+P   N+IGTKWIFKNKT+EEG + RNKARLVAQGY+QIEG+DF ET
Subjt:  -KLRFFGIL---------------------FLINQMKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGET

Query:  FAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFI
        FAPVARLE+IRLLL  AC  +FKL+QMDVKSAFLNGYL EEVYV QPKGF DP H DHVY+L+KALY LKQAPRA YERL+ +L QQGY++G  D+T+F+
Subjt:  FAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFI

Query:  YRQGTDFLIIQIYVDGIIFGDTS
         +   + +I QIYVD I+FG  S
Subjt:  YRQGTDFLIIQIYVDGIIFGDTS

AAO73527.1 gag-pol polyprotein [Glycine max]6.0e-19351.04Show/hide
Query:  VALTSVKSPNSSDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCN
        V  TS+++    DWY DSGCSRHMTG  +F   +  C    V FGDG KGKIIG G + H GLP L  V L++GL ANLISISQLCD+G+ V+F K  C 
Subjt:  VALTSVKSPNSSDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCN

Query:  VLDVQNKVFLSGTRLSDNCYHWDAEVT----LCNLSKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISST
        V + +++V + G+R  DNCY W  + T     C  SK +E R+WH+R GHL    + K+    A+ G+P L       C EC  GKQVK  H+ +   +T
Subjt:  VLDVQNKVFLSGTRLSDNCYHWDAEVT----LCNLSKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISST

Query:  SHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPL
        S +LELLH+DLMGPMQ ESLG K YA V VDDFSR+TW+ FI +KSETF+  + L  +LQREK+  I +I++DHG EFEN  F EFC +EGI HEFSA +
Subjt:  SHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPL

Query:  TLQQNGVVERRNRTLQEMAR-------------AEALNTACHIHNRVILRPGTTTTSYELWKGRKPNVK---------------DHRRKWDSKSDRGIFL
        T QQNG+VER+NRTLQE AR             AEA+NTAC+IHNRV LR GT TT YE+WKGRKP+VK               + RRK D KSD GIFL
Subjt:  TLQQNGVVERRNRTLQEMAR-------------AEALNTACHIHNRVILRPGTTTTSYELWKGRKPNVK---------------DHRRKWDSKSDRGIFL

Query:  GYLANSRAYRVYNQCSKIVMESINVIIDDLG---------------------------------------------RNLTEILMM---------------
        GY  NSRAYRV+N  ++ VMESINV++DDL                                              R+ T I  M               
Subjt:  GYLANSRAYRVYNQCSKIVMESINVIIDDLG---------------------------------------------RNLTEILMM---------------

Query:  -KLRFFGIL---------------------FLINQMKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGET
         + R   I+                     F IN M+E+L QF+RN+VWELVP+P   N+IGTKWIFKNKT+EEG + RNKARLVAQGY+QIEG+DF ET
Subjt:  -KLRFFGIL---------------------FLINQMKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGET

Query:  FAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFI
        FAPVARLE+IRLLL  AC  +FKL+QMDVKSAFLNGYL EEVYV QPKGF DP H DHVY+L+KALY LKQAPRA YERL+ +L QQGY++G  D+T+F+
Subjt:  FAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFI

Query:  YRQGTDFLIIQIYVDGIIFGDTS
         +   + +I QIYVD I+FG  S
Subjt:  YRQGTDFLIIQIYVDGIIFGDTS

KAA0040705.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.6e-19857.46Show/hide
Query:  MALISVCTMNDEE----NVQTHDQLES---KNLTNDTAN-RKIEDQEVILQQQERIQDLVEENQSFLSSIVTLKEELAETKHQFEELLKFARM-------
        MALIS+C MNDEE    N QTHD  ES   K LT+   + +K EDQE+ILQQQERIQDLVEENQSFLSSIVTLK EL ETKHQFEELLKFARM       
Subjt:  MALISVCTMNDEE----NVQTHDQLES---KNLTNDTAN-RKIEDQEVILQQQERIQDLVEENQSFLSSIVTLKEELAETKHQFEELLKFARM-------

Query:  ----------------------DTPVRKTVFIREGTLQNSPTNKEQGKGTEITSMPVKSPNKRTQNCKVALTSVKSPNSSDWYFDSGCSRHMTGNADFFS
                              D PVR T+FIREG       + +   G +I+           ++CKVA+TSVKSPNS DWYFDSGCSRHMTGNADFFS
Subjt:  ----------------------DTPVRKTVFIREGTLQNSPTNKEQGKGTEITSMPVKSPNKRTQNCKVALTSVKSPNSSDWYFDSGCSRHMTGNADFFS

Query:  ELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNLS
        ELSECKVGSVVFGDGGKGKIIGKGTINH GLPFLLDV+L+QGL+ANL+SISQLCDQGYQVS +KDR NVLD QNKVF S TR+SDNCYHWDAEV LCNLS
Subjt:  ELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNLS

Query:  KVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRY
        KV+EA LWHKRLGHL G TI KVTK DAIIGLPP SF SL+SC EC AGKQVKSVHKP                                          
Subjt:  KVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRY

Query:  TWIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGVVERRNRTLQEMARAEALNTACHIHNRVIL
                      TCQTLFTQLQREKNT IG+I+TDHG EFEN++F EFCDNE    E S  LT +           + +M+                 
Subjt:  TWIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGVVERRNRTLQEMARAEALNTACHIHNRVIL

Query:  RPGTTTTSYELWKGRK-PNVKDHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDLGRNLTEILMMKLRFFGILFLINQMKEKLLQFER
         P T+    E+ +G    +   H  +W   S          +S  +++              II D+   +  I   K R      + N   E+LLQFER
Subjt:  RPGTTTTSYELWKGRK-PNVKDHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDLGRNLTEILMMKLRFFGILFLINQMKEKLLQFER

Query:  NQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVA
        NQVWELVPKPP+ANIIGTKWIFK KT E+GRVIRN+ARLVAQGYSQIEGLD  ETFA VARLEAIRLLLSYA F RFKLF MDVKSAFLNGYL EEVYVA
Subjt:  NQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVA

Query:  QPKGFVDPVHEDHVYKLRKALYRLKQAPRAC
        +PKGFVD VH DHVYKL+KALY LKQA RAC
Subjt:  QPKGFVDPVHEDHVYKLRKALYRLKQAPRAC

KAA0048721.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.0e-23264.66Show/hide
Query:  LKFARMDTPVRKTVFIREGTLQNSPTNKEQGKGTEITSMPVKSPNKRTQNCKVALTSVKSPNSSDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGG
        L+F   DTPVRKTVFIREGTLQNSPTN EQGK                +NCKVALTSVKSPNS DWYFDSGCSRHMTGNADFFSELSECK GSVVF DGG
Subjt:  LKFARMDTPVRKTVFIREGTLQNSPTNKEQGKGTEITSMPVKSPNKRTQNCKVALTSVKSPNSSDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGG

Query:  KGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNLSKVEEARLWHKRLGHLS
        KGKIIGKGTIN  GLPFLLDVRL+QGL+ANLIS SQLCDQGY+V+F+KDRCNVLD QNKVFLSGTRLSDNCYHWDAEVTLCNLSKVEEA LWHKRLGHL 
Subjt:  KGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNLSKVEEARLWHKRLGHLS

Query:  GATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTC
        GATISKV K +AIIGLPPLSF SLESCSEC AGKQVKSVHKPVNIS TSHILELLHIDLM PMQTESLGRK YAVVCVDDFSRYTWIKFIL+K ETFKTC
Subjt:  GATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTC

Query:  QTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGVVERRNRTLQEMAR-------------AEALNTACHIHNRVILRPG
        QTL TQLQREKNT IG+I+T+HG EFEN+HFAEFCDNEGIFHEFSA LT Q+NGVVE+RN+TLQEMAR             AEALNTACHIHNRVILRP 
Subjt:  QTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGVVERRNRTLQEMAR-------------AEALNTACHIHNRVILRPG

Query:  TTTTSYELWKGRKPNVK---------------DHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDLG----RNL--------------
        TTTTSYELWKGRKPNVK               DHRRKWDSKSDRGIFLGY AN+RAYRVYNQ +KIV+ESINVIIDDLG    RNL              
Subjt:  TTTTSYELWKGRKPNVK---------------DHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDLG----RNL--------------

Query:  ------------TEILMMKLRF-------------------------------------------------------------------FGIL-------
                     E   +   F                                                                    GI+       
Subjt:  ------------TEILMMKLRF-------------------------------------------------------------------FGIL-------

Query:  ---------------------------FLINQMKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFA
                                    LI  M+E+LLQFERNQVWELVPK PYANIIGTKWIFKNKTDEEGRVI NKARLVAQGYSQIEG +    F+
Subjt:  ---------------------------FLINQMKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFA

KAA0059225.1 gag-pol polyprotein [Cucumis melo var. makuwa]0.0e+0077.03Show/hide
Query:  MALISVCTMNDEENVQTHDQLESKNLTNDTANRKIEDQEVILQQQERIQDLVEENQSFLSSIVTLKEELAETKHQFEELLKFARM---------------
        MALISVCTMNDEENVQTHDQLESKNLTNDTANRKIEDQEVILQQQERIQDLVEENQSFLSSIVTLKEELA+TKHQFEELLKFARM               
Subjt:  MALISVCTMNDEENVQTHDQLESKNLTNDTANRKIEDQEVILQQQERIQDLVEENQSFLSSIVTLKEELAETKHQFEELLKFARM---------------

Query:  --------------DTPVRKTVFIREGTLQNSPTNKEQGKGTEITSMPVK---SPNK------RTQNCKVALTSVKSPNSSDWYFDSGCSRHMTGNADFF
                      DTPVRKTVFIREGTLQNSPTN EQGKGTEITSMP K   SP          +NCKVALTSVKSPNSSDWYFDSGCSRHMTGNADFF
Subjt:  --------------DTPVRKTVFIREGTLQNSPTNKEQGKGTEITSMPVK---SPNK------RTQNCKVALTSVKSPNSSDWYFDSGCSRHMTGNADFF

Query:  SELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNL
        SELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNL
Subjt:  SELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNL

Query:  SKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSR
        SKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPL+FLSLESCSECTAGKQVKSVHKPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSR
Subjt:  SKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSR

Query:  YTWIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGVVERRNRTLQEMARAEALNTACHIHNRVI
        YTWIKFILDK ETFKTCQTLFTQLQREKNT IGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGV             AEALNTACHIHNRVI
Subjt:  YTWIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGVVERRNRTLQEMARAEALNTACHIHNRVI

Query:  LRPGTTTTSYELWKGRKPNVK---------------DHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDLGR---------NLTEILM
        LRPGTTTTSYELWKGRKPNVK               DHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDL           N T  L 
Subjt:  LRPGTTTTSYELWKGRKPNVK---------------DHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDLGR---------NLTEILM

Query:  MKLRFFGI------------------------------------------------------LFLINQ--------------------------------
          L    I                                                       F+I                                  
Subjt:  MKLRFFGI------------------------------------------------------LFLINQ--------------------------------

Query:  -------------MKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYACFWRF
                     ++E+LLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYACFWRF
Subjt:  -------------MKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYACFWRF

Query:  KLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIIQIYVDGIIFGDT
        KLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKALYRLKQAPRA YERLSTYLLQQGYQRGSADQTMFIYRQGT+FLI+QIYVDGIIFGDT
Subjt:  KLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIIQIYVDGIIFGDT

Query:  S
        S
Subjt:  S

TrEMBL top hitse value%identityAlignment
A0A5A7TGY4 Retrovirus-related Pol polyprotein from transposon TNT 1-947.9e-19957.46Show/hide
Query:  MALISVCTMNDEE----NVQTHDQLES---KNLTNDTAN-RKIEDQEVILQQQERIQDLVEENQSFLSSIVTLKEELAETKHQFEELLKFARM-------
        MALIS+C MNDEE    N QTHD  ES   K LT+   + +K EDQE+ILQQQERIQDLVEENQSFLSSIVTLK EL ETKHQFEELLKFARM       
Subjt:  MALISVCTMNDEE----NVQTHDQLES---KNLTNDTAN-RKIEDQEVILQQQERIQDLVEENQSFLSSIVTLKEELAETKHQFEELLKFARM-------

Query:  ----------------------DTPVRKTVFIREGTLQNSPTNKEQGKGTEITSMPVKSPNKRTQNCKVALTSVKSPNSSDWYFDSGCSRHMTGNADFFS
                              D PVR T+FIREG       + +   G +I+           ++CKVA+TSVKSPNS DWYFDSGCSRHMTGNADFFS
Subjt:  ----------------------DTPVRKTVFIREGTLQNSPTNKEQGKGTEITSMPVKSPNKRTQNCKVALTSVKSPNSSDWYFDSGCSRHMTGNADFFS

Query:  ELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNLS
        ELSECKVGSVVFGDGGKGKIIGKGTINH GLPFLLDV+L+QGL+ANL+SISQLCDQGYQVS +KDR NVLD QNKVF S TR+SDNCYHWDAEV LCNLS
Subjt:  ELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNLS

Query:  KVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRY
        KV+EA LWHKRLGHL G TI KVTK DAIIGLPP SF SL+SC EC AGKQVKSVHKP                                          
Subjt:  KVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRY

Query:  TWIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGVVERRNRTLQEMARAEALNTACHIHNRVIL
                      TCQTLFTQLQREKNT IG+I+TDHG EFEN++F EFCDNE    E S  LT +           + +M+                 
Subjt:  TWIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGVVERRNRTLQEMARAEALNTACHIHNRVIL

Query:  RPGTTTTSYELWKGRK-PNVKDHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDLGRNLTEILMMKLRFFGILFLINQMKEKLLQFER
         P T+    E+ +G    +   H  +W   S          +S  +++              II D+   +  I   K R      + N   E+LLQFER
Subjt:  RPGTTTTSYELWKGRK-PNVKDHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDLGRNLTEILMMKLRFFGILFLINQMKEKLLQFER

Query:  NQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVA
        NQVWELVPKPP+ANIIGTKWIFK KT E+GRVIRN+ARLVAQGYSQIEGLD  ETFA VARLEAIRLLLSYA F RFKLF MDVKSAFLNGYL EEVYVA
Subjt:  NQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVA

Query:  QPKGFVDPVHEDHVYKLRKALYRLKQAPRAC
        +PKGFVD VH DHVYKL+KALY LKQA RAC
Subjt:  QPKGFVDPVHEDHVYKLRKALYRLKQAPRAC

A0A5A7V046 Gag-pol polyprotein0.0e+0077.03Show/hide
Query:  MALISVCTMNDEENVQTHDQLESKNLTNDTANRKIEDQEVILQQQERIQDLVEENQSFLSSIVTLKEELAETKHQFEELLKFARM---------------
        MALISVCTMNDEENVQTHDQLESKNLTNDTANRKIEDQEVILQQQERIQDLVEENQSFLSSIVTLKEELA+TKHQFEELLKFARM               
Subjt:  MALISVCTMNDEENVQTHDQLESKNLTNDTANRKIEDQEVILQQQERIQDLVEENQSFLSSIVTLKEELAETKHQFEELLKFARM---------------

Query:  --------------DTPVRKTVFIREGTLQNSPTNKEQGKGTEITSMPVK---SPNK------RTQNCKVALTSVKSPNSSDWYFDSGCSRHMTGNADFF
                      DTPVRKTVFIREGTLQNSPTN EQGKGTEITSMP K   SP          +NCKVALTSVKSPNSSDWYFDSGCSRHMTGNADFF
Subjt:  --------------DTPVRKTVFIREGTLQNSPTNKEQGKGTEITSMPVK---SPNK------RTQNCKVALTSVKSPNSSDWYFDSGCSRHMTGNADFF

Query:  SELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNL
        SELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNL
Subjt:  SELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNL

Query:  SKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSR
        SKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPL+FLSLESCSECTAGKQVKSVHKPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSR
Subjt:  SKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSR

Query:  YTWIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGVVERRNRTLQEMARAEALNTACHIHNRVI
        YTWIKFILDK ETFKTCQTLFTQLQREKNT IGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGV             AEALNTACHIHNRVI
Subjt:  YTWIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGVVERRNRTLQEMARAEALNTACHIHNRVI

Query:  LRPGTTTTSYELWKGRKPNVK---------------DHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDLGR---------NLTEILM
        LRPGTTTTSYELWKGRKPNVK               DHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDL           N T  L 
Subjt:  LRPGTTTTSYELWKGRKPNVK---------------DHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDLGR---------NLTEILM

Query:  MKLRFFGI------------------------------------------------------LFLINQ--------------------------------
          L    I                                                       F+I                                  
Subjt:  MKLRFFGI------------------------------------------------------LFLINQ--------------------------------

Query:  -------------MKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYACFWRF
                     ++E+LLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYACFWRF
Subjt:  -------------MKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYACFWRF

Query:  KLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIIQIYVDGIIFGDT
        KLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKALYRLKQAPRA YERLSTYLLQQGYQRGSADQTMFIYRQGT+FLI+QIYVDGIIFGDT
Subjt:  KLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIIQIYVDGIIFGDT

Query:  S
        S
Subjt:  S

A0A5D3C9Q6 Gag-pol polyprotein4.9e-23364.66Show/hide
Query:  LKFARMDTPVRKTVFIREGTLQNSPTNKEQGKGTEITSMPVKSPNKRTQNCKVALTSVKSPNSSDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGG
        L+F   DTPVRKTVFIREGTLQNSPTN EQGK                +NCKVALTSVKSPNS DWYFDSGCSRHMTGNADFFSELSECK GSVVF DGG
Subjt:  LKFARMDTPVRKTVFIREGTLQNSPTNKEQGKGTEITSMPVKSPNKRTQNCKVALTSVKSPNSSDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGG

Query:  KGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNLSKVEEARLWHKRLGHLS
        KGKIIGKGTIN  GLPFLLDVRL+QGL+ANLIS SQLCDQGY+V+F+KDRCNVLD QNKVFLSGTRLSDNCYHWDAEVTLCNLSKVEEA LWHKRLGHL 
Subjt:  KGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNLSKVEEARLWHKRLGHLS

Query:  GATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTC
        GATISKV K +AIIGLPPLSF SLESCSEC AGKQVKSVHKPVNIS TSHILELLHIDLM PMQTESLGRK YAVVCVDDFSRYTWIKFIL+K ETFKTC
Subjt:  GATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTC

Query:  QTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGVVERRNRTLQEMAR-------------AEALNTACHIHNRVILRPG
        QTL TQLQREKNT IG+I+T+HG EFEN+HFAEFCDNEGIFHEFSA LT Q+NGVVE+RN+TLQEMAR             AEALNTACHIHNRVILRP 
Subjt:  QTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGVVERRNRTLQEMAR-------------AEALNTACHIHNRVILRPG

Query:  TTTTSYELWKGRKPNVK---------------DHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDLG----RNL--------------
        TTTTSYELWKGRKPNVK               DHRRKWDSKSDRGIFLGY AN+RAYRVYNQ +KIV+ESINVIIDDLG    RNL              
Subjt:  TTTTSYELWKGRKPNVK---------------DHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDLG----RNL--------------

Query:  ------------TEILMMKLRF-------------------------------------------------------------------FGIL-------
                     E   +   F                                                                    GI+       
Subjt:  ------------TEILMMKLRF-------------------------------------------------------------------FGIL-------

Query:  ---------------------------FLINQMKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFA
                                    LI  M+E+LLQFERNQVWELVPK PYANIIGTKWIFKNKTDEEGRVI NKARLVAQGYSQIEG +    F+
Subjt:  ---------------------------FLINQMKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFA

Q84VH8 Gag-pol polyprotein2.9e-19351.04Show/hide
Query:  VALTSVKSPNSSDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCN
        V  TS+++    DWY DSGCSRHMTG  +F   +  C    V FGDG KGKIIG G + H GLP L  V L++GL ANLISISQLCD+G+ V+F K  C 
Subjt:  VALTSVKSPNSSDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCN

Query:  VLDVQNKVFLSGTRLSDNCYHWDAEVT----LCNLSKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISST
        V + +++V + G+R  DNCY W  + T     C  SK +E R+WH+R GHL    + K+    A+ G+P L       C EC  GKQVK  H+ +   +T
Subjt:  VLDVQNKVFLSGTRLSDNCYHWDAEVT----LCNLSKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISST

Query:  SHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPL
        S +LELLH+DLMGPMQ ESLG K YA V VDDFSR+TW+ FI +KSETF+  + L  +LQREK+  I +I++DHG EFEN  F EFC +EGI HEFSA +
Subjt:  SHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPL

Query:  TLQQNGVVERRNRTLQEMAR-------------AEALNTACHIHNRVILRPGTTTTSYELWKGRKPNVK---------------DHRRKWDSKSDRGIFL
        T QQNG+VER+NRTLQE AR             AEA+NTAC+IHNRV LR GT TT YE+WKGRKP+VK               + RRK D KSD GIFL
Subjt:  TLQQNGVVERRNRTLQEMAR-------------AEALNTACHIHNRVILRPGTTTTSYELWKGRKPNVK---------------DHRRKWDSKSDRGIFL

Query:  GYLANSRAYRVYNQCSKIVMESINVIIDDLG---------------------------------------------RNLTEILMM---------------
        GY  NSRAYRV+N  ++ VMESINV++DDL                                              R+ T I  M               
Subjt:  GYLANSRAYRVYNQCSKIVMESINVIIDDLG---------------------------------------------RNLTEILMM---------------

Query:  -KLRFFGIL---------------------FLINQMKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGET
         + R   I+                     F IN M+E+L QF+RN+VWELVP+P   N+IGTKWIFKNKT+EEG + RNKARLVAQGY+QIEG+DF ET
Subjt:  -KLRFFGIL---------------------FLINQMKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGET

Query:  FAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFI
        FAPVARLE+IRLLL  AC  +FKL+QMDVKSAFLNGYL EEVYV QPKGF DP H DHVY+L+KALY LKQAPRA YERL+ +L QQGY++G  D+T+F+
Subjt:  FAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFI

Query:  YRQGTDFLIIQIYVDGIIFGDTS
         +   + +I QIYVD I+FG  S
Subjt:  YRQGTDFLIIQIYVDGIIFGDTS

Q84VI4 Gag-pol polyprotein3.8e-19351.04Show/hide
Query:  VALTSVKSPNSSDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCN
        V  TS+++    DWY DSGCSRHMTG  +F   +  C    V FGDG KGKIIG G + H GLP L  V L++GL ANLISISQLCD+G+ V+F K  C 
Subjt:  VALTSVKSPNSSDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCN

Query:  VLDVQNKVFLSGTRLSDNCYHWDAEVT----LCNLSKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISST
        V + +++V + G+R  DNCY W  + T     C  SK +E R+WH+R GHL    + K+    A+ G+P L       C EC  GKQVK  H+ +   +T
Subjt:  VLDVQNKVFLSGTRLSDNCYHWDAEVT----LCNLSKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISST

Query:  SHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPL
        S +LELLH+DLMGPMQ ESLG K YA V VDDFSR+TW+KFI +KSETF+  + L  +LQREK+  I +I++DHG EFEN    EFC +EGI HEFSA +
Subjt:  SHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPL

Query:  TLQQNGVVERRNRTLQEMAR-------------AEALNTACHIHNRVILRPGTTTTSYELWKGRKPNVK---------------DHRRKWDSKSDRGIFL
        T QQNG+VER+NRTLQE AR             AEA+NTAC+IHNRV LR GT TT YE+WKGRKP+VK               + RRK D KSD GIFL
Subjt:  TLQQNGVVERRNRTLQEMAR-------------AEALNTACHIHNRVILRPGTTTTSYELWKGRKPNVK---------------DHRRKWDSKSDRGIFL

Query:  GYLANSRAYRVYNQCSKIVMESINVIIDDLG---------------------------------------------RNLTEILMM---------------
        GY  NSRAYRV+N  ++ VMESINV++DDL                                              R+ T I  M               
Subjt:  GYLANSRAYRVYNQCSKIVMESINVIIDDLG---------------------------------------------RNLTEILMM---------------

Query:  -KLRFFGIL---------------------FLINQMKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGET
         + R   I+                     F IN M+E+L QF+RN+VWELVP+P   N+IGTKWIFKNKT+EEG + RNKARLVAQGY+QIEG+DF ET
Subjt:  -KLRFFGIL---------------------FLINQMKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGET

Query:  FAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFI
        FAPVARLE+IRLLL  AC  +FKL+QMDVKSAFLNGYL EEVYV QPKGF DP H DHVY+L+KALY LKQAPRA YERL+ +L QQGY++G  D+T+F+
Subjt:  FAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFI

Query:  YRQGTDFLIIQIYVDGIIFGDTS
         +   + +I QIYVD I+FG  S
Subjt:  YRQGTDFLIIQIYVDGIIFGDTS

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.8e-3040Show/hide
Query:  NQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVA
        N  W +  +P   NI+ ++W+F  K +E G  IR KARLVA+G++Q   +D+ ETFAPVAR+ + R +LS    +  K+ QMDVK+AFLNG L EE+Y+ 
Subjt:  NQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVA

Query:  QPKGFVDPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFIYRQG--TDFLIIQIYVDGII
         P+G     + D+V KL KA+Y LKQA R  +E     L +  +   S D+ ++I  +G   + + + +YVD ++
Subjt:  QPKGFVDPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFIYRQG--TDFLIIQIYVDGII

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-6426.65Show/hide
Query:  SDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTI----NHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNK
        S+W  D+  S H T   D F        G+V  G+    KI G G I    N      L DVR +  L  NLIS   L   GY+  F   +  +   +  
Subjt:  SDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTI----NHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNK

Query:  VFLSGTRLSDNCYHWDAEVTLCNLSKVEE---ARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISSTSHILELLH
        + ++        Y  +AE+    L+  ++     LWHKR+GH+S   +  + K   I         +++ C  C  GKQ + V    +     +IL+L++
Subjt:  VFLSGTRLSDNCYHWDAEVTLCNLSKVEE---ARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISSTSHILELLH

Query:  IDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGVV
         D+ GPM+ ES+G   Y V  +DD SR  W+  +  K + F+  Q     ++RE    + ++++D+G E+ ++ F E+C + GI HE + P T Q NGV 
Subjt:  IDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGVV

Query:  ERRNRTLQEMARA-------------EALNTACHIHNRVILRPGTTTTSYELWKGRKPNV---------------KDHRRKWDSKSDRGIFLGYLANSRA
        ER NRT+ E  R+             EA+ TAC++ NR    P        +W  ++ +                K+ R K D KS   IF+GY      
Subjt:  ERRNRTLQEMARA-------------EALNTACHIHNRVILRPGTTTTSYELWKGRKPNV---------------KDHRRKWDSKSDRGIFLGYLANSRA

Query:  YRVYNQCSKIVMESINVI-------------------------------------------IDDLGRNLTEIL---------------------------
        YR+++   K V+ S +V+                                           + + G    E++                           
Subjt:  YRVYNQCSKIVMESINVI-------------------------------------------IDDLGRNLTEIL---------------------------

Query:  ------MMKLRFFGILF-----------------------LINQMKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQ
              +   R+    +                       L+  M+E++   ++N  ++LV  P     +  KW+FK K D + +++R KARLV +G+ Q
Subjt:  ------MMKLRFFGILF-----------------------LINQMKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQ

Query:  IEGLDFGETFAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQR
         +G+DF E F+PV ++ +IR +LS A     ++ Q+DVK+AFL+G L EE+Y+ QP+GF     +  V KL K+LY LKQAPR  Y +  +++  Q Y +
Subjt:  IEGLDFGETFAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQR

Query:  GSADQTMFIYR-QGTDFLIIQIYVDGII
          +D  ++  R    +F+I+ +YVD ++
Subjt:  GSADQTMFIYR-QGTDFLIIQIYVDGII

P92520 Uncharacterized mitochondrial protein AtMg008208.0e-1540.68Show/hide
Query:  MKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYA--------CFWRFKL-FQ
        M+E+L    RN+ W LVP P   NI+G KW+FK K   +G + R KARLVA+G+ Q EG+ F ET++PV R   IR +L+ A          W FK+ F 
Subjt:  MKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYA--------CFWRFKL-FQ

Query:  MDVKSAFLNGYLCEEVYV
        M +   F   ++C  + V
Subjt:  MDVKSAFLNGYLCEEVYV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.0e-3845.4Show/hide
Query:  NQVWELVPKPP-YANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYV
        N  W+LVP PP +  I+G +WIF  K + +G + R KARLVA+GY+Q  GLD+ ETF+PV +  +IR++L  A    + + Q+DV +AFL G L ++VY+
Subjt:  NQVWELVPKPP-YANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYV

Query:  AQPKGFVDPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIIQIYVDGII
        +QP GF+D    ++V KLRKALY LKQAPRA Y  L  YLL  G+    +D ++F+ ++G   + + +YVD I+
Subjt:  AQPKGFVDPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIIQIYVDGII

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.7e-4123.91Show/hide
Query:  SVKSP-NSSDWYFDSGCSRHMTGNADFFSELSECKVG-SVVFGDGGKGKIIGKGTINHSGLPFLLD---VRLIQGLAANLISISQLCDQG-YQVSFNKDR
        +V SP N+++W  DSG + H+T + +  S       G  V+  DG    I   G+ +       LD   V  +  +  NLIS+ +LC+     V F    
Subjt:  SVKSP-NSSDWYFDSGCSRHMTGNADFFSELSECKVG-SVVFGDGGKGKIIGKGTINHSGLPFLLD---VRLIQGLAANLISISQLCDQG-YQVSFNKDR

Query:  CNVLDVQNKVFLSGTRLSDNCYHW---DAEVTLCNLSKVEEA--RLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNI
          V D+   V L   +  D  Y W    ++      S   +A    WH RLGH S A ++ V    ++  L P     L SCS+C   K  K       I
Subjt:  CNVLDVQNKVFLSGTRLSDNCYHW---DAEVTLCNLSKVEEA--RLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNI

Query:  SSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFS
        +S S  LE ++ D+       S+    Y V+ VD F+RYTW+  +  KS+   T     + ++    T IG + +D+G EF      ++    GI H  S
Subjt:  SSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFS

Query:  APLTLQQNGVVERRNRTLQEMARA-------------EALNTACHIHNRVILRPGTTTTSYELWKGRKPNVKD---------------HRRKWDSKSDRG
         P T + NG+ ER++R + EM                 A + A ++ NR+        + ++   G+ PN +                +R K + KS + 
Subjt:  APLTLQQNGVVERRNRTLQEMARA-------------EALNTACHIHNRVILRPGTTTTSYELWKGRKPNVKD---------------HRRKWDSKSDRG

Query:  IFLGYLANSRAYRVYNQCSKIVMESINVIIDD--------------------------------------------LGRNL---------------TEIL
         F+GY     AY   +  +  +  S +V  D+                                            LG +L               T++ 
Subjt:  IFLGYLANSRAYRVYNQCSKIVMESINVIIDD--------------------------------------------LGRNL---------------TEIL

Query:  MMKLRFFGI-------------------------------------------------------------------------------------------
           L    I                                                                                           
Subjt:  MMKLRFFGI-------------------------------------------------------------------------------------------

Query:  ----LFLIN---------------------------------------------------QMKEKLLQFERNQVWELV-PKPPYANIIGTKWIFKNKTDE
            +  +N                                                    M  ++     N  W+LV P PP   I+G +WIF  K + 
Subjt:  ----LFLIN---------------------------------------------------QMKEKLLQFERNQVWELV-PKPPYANIIGTKWIFKNKTDE

Query:  EGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKALYRLKQAP
        +G + R KARLVA+GY+Q  GLD+ ETF+PV +  +IR++L  A    + + Q+DV +AFL G L +EVY++QP GFVD    D+V +LRKA+Y LKQAP
Subjt:  EGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKALYRLKQAP

Query:  RACYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIIQIYVDGII
        RA Y  L TYLL  G+    +D ++F+ ++G   + + +YVD I+
Subjt:  RACYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIIQIYVDGII

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.7e-3439.57Show/hide
Query:  MKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLN
        M +++   E    WE+   PP    IG KW++K K + +G + R KARLVA+GY+Q EG+DF ETF+PV +L +++L+L+ +  + F L Q+D+ +AFLN
Subjt:  MKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLN

Query:  GYLCEEVYVAQPKGFV----DPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIIQIYVDGII
        G L EE+Y+  P G+     D +  + V  L+K++Y LKQA R  + + S  L+  G+ +  +D T F+    T FL + +YVD II
Subjt:  GYLCEEVYVAQPKGFV----DPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIIQIYVDGII

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.7e-1640.68Show/hide
Query:  MKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYA--------CFWRFKL-FQ
        M+E+L    RN+ W LVP P   NI+G KW+FK K   +G + R KARLVA+G+ Q EG+ F ET++PV R   IR +L+ A          W FK+ F 
Subjt:  MKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYA--------CFWRFKL-FQ

Query:  MDVKSAFLNGYLCEEVYV
        M +   F   ++C  + V
Subjt:  MDVKSAFLNGYLCEEVYV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCTGATTAGTGTTTGCACCATGAATGACGAAGAAAATGTTCAAACACATGACCAGCTGGAATCAAAGAACTTAACTAACGACACAGCAAACAGAAAGATA
GAAGATCAAGAAGTTATCTTGCAACAACAAGAACGAATTCAAGATCTAGTGGAAGAAAACCAAAGCTTTCTATCCTCTATAGTAACTCTAAAAGAAGAACTAGCA
GAAACCAAGCATCAATTCGAAGAGCTCCTAAAATTTGCAAGGATGGACACACCTGTCAGAAAAACTGTCTTTATCCGAGAAGGTACCCTTCAGAACAGCCCTACA
AATAAAGAACAGGGAAAGGGTACTGAGATTACTAGCATGCCTGTGAAATCCCCAAACAAGCGAACACAAAACTGCAAGGTGGCTCTCACCTCTGTCAAAAGCCCC
AACTCTAGTGACTGGTACTTTGACAGTGGGTGTTCCAGACACATGACAGGTAATGCAGATTTCTTTTCTGAACTGAGTGAATGCAAAGTCGGATCAGTAGTGTTT
GGAGATGGAGGAAAAGGAAAAATAATTGGCAAAGGAACGATTAACCATTCAGGTCTACCGTTTCTTCTTGATGTTCGACTAATACAAGGACTGGCTGCAAATCTC
ATAAGCATCAGCCAATTATGTGACCAAGGCTATCAAGTCAGTTTCAATAAAGATAGATGTAATGTGTTAGATGTTCAAAATAAAGTATTTCTCAGCGGAACAAGG
CTGTCAGACAACTGCTATCACTGGGATGCAGAGGTAACCTTATGCAATCTATCAAAAGTGGAAGAAGCTAGACTCTGGCACAAACGACTTGGACACCTTAGTGGC
GCTACTATCTCCAAGGTCACCAAAGTTGATGCCATTATCGGTCTTCCCCCACTATCATTTTTGTCACTAGAAAGCTGTTCGGAGTGCACAGCTGGCAAGCAAGTC
AAGTCTGTACACAAGCCTGTAAATATCTCCTCGACGTCCCATATTCTGGAACTTCTTCATATAGACCTAATGGGGCCCATGCAAACAGAAAGCTTGGGTAGAAAA
TGGTATGCAGTAGTGTGTGTAGATGATTTCTCTCGCTACACCTGGATAAAATTTATCCTTGACAAATCGGAAACCTTTAAGACATGTCAGACCCTGTTCACTCAA
CTCCAAAGAGAGAAAAATACTAGCATTGGCCAAATACAAACTGATCATGGGCATGAATTTGAGAATCAGCACTTTGCTGAGTTCTGTGATAATGAAGGCATCTTT
CATGAGTTCTCTGCCCCATTAACACTACAGCAAAATGGAGTTGTAGAGAGAAGGAATCGAACCTTACAGGAGATGGCCCGAGCTGAGGCTCTAAACACTGCATGC
CATATACATAACAGAGTTATTCTCCGTCCAGGGACCACTACTACCTCGTATGAGCTGTGGAAAGGAAGAAAACCAAATGTGAAGGATCATCGCAGAAAGTGGGAC
TCAAAGTCAGATCGTGGAATATTTCTGGGATATTTAGCTAACAGCCGAGCCTACAGGGTCTACAACCAATGTTCCAAAATAGTAATGGAATCCATTAACGTGATT
ATTGATGACCTTGGTAGGAACCTAACAGAAATCTTGATGATGAAGTTGAGGTTTTTTGGAATTCTCTTTCTCATAAACCAGATGAAGGAGAAGCTACTGCAGTTT
GAAAGAAACCAAGTATGGGAATTAGTGCCAAAGCCACCTTATGCTAACATAATTGGTACCAAATGGATCTTTAAGAACAAAACGGATGAAGAAGGTAGAGTTATC
CGTAATAAAGCTAGACTGGTTGCTCAAGGGTATTCTCAAATAGAAGGGCTGGATTTTGGAGAAACATTTGCCCCAGTTGCCAGATTAGAAGCCATCCGACTACTG
CTAAGCTACGCATGTTTTTGGAGGTTCAAACTGTTCCAAATGGATGTAAAGAGTGCGTTCCTAAATGGGTACTTATGTGAGGAAGTGTATGTGGCCCAGCCAAAA
GGATTTGTTGATCCAGTGCATGAGGATCATGTTTACAAACTTCGAAAGGCACTCTATAGACTTAAACAAGCTCCTAGAGCTTGTTATGAGAGACTCTCCACTTAC
CTGTTACAACAAGGATATCAAAGGGGCAGTGCGGATCAAACTATGTTTATATATCGTCAAGGCACTGACTTTCTGATCATTCAGATCTATGTTGATGGAATTATA
TTTGGTGATACGTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCTGATTAGTGTTTGCACCATGAATGACGAAGAAAATGTTCAAACACATGACCAGCTGGAATCAAAGAACTTAACTAACGACACAGCAAACAGAAAGATA
GAAGATCAAGAAGTTATCTTGCAACAACAAGAACGAATTCAAGATCTAGTGGAAGAAAACCAAAGCTTTCTATCCTCTATAGTAACTCTAAAAGAAGAACTAGCA
GAAACCAAGCATCAATTCGAAGAGCTCCTAAAATTTGCAAGGATGGACACACCTGTCAGAAAAACTGTCTTTATCCGAGAAGGTACCCTTCAGAACAGCCCTACA
AATAAAGAACAGGGAAAGGGTACTGAGATTACTAGCATGCCTGTGAAATCCCCAAACAAGCGAACACAAAACTGCAAGGTGGCTCTCACCTCTGTCAAAAGCCCC
AACTCTAGTGACTGGTACTTTGACAGTGGGTGTTCCAGACACATGACAGGTAATGCAGATTTCTTTTCTGAACTGAGTGAATGCAAAGTCGGATCAGTAGTGTTT
GGAGATGGAGGAAAAGGAAAAATAATTGGCAAAGGAACGATTAACCATTCAGGTCTACCGTTTCTTCTTGATGTTCGACTAATACAAGGACTGGCTGCAAATCTC
ATAAGCATCAGCCAATTATGTGACCAAGGCTATCAAGTCAGTTTCAATAAAGATAGATGTAATGTGTTAGATGTTCAAAATAAAGTATTTCTCAGCGGAACAAGG
CTGTCAGACAACTGCTATCACTGGGATGCAGAGGTAACCTTATGCAATCTATCAAAAGTGGAAGAAGCTAGACTCTGGCACAAACGACTTGGACACCTTAGTGGC
GCTACTATCTCCAAGGTCACCAAAGTTGATGCCATTATCGGTCTTCCCCCACTATCATTTTTGTCACTAGAAAGCTGTTCGGAGTGCACAGCTGGCAAGCAAGTC
AAGTCTGTACACAAGCCTGTAAATATCTCCTCGACGTCCCATATTCTGGAACTTCTTCATATAGACCTAATGGGGCCCATGCAAACAGAAAGCTTGGGTAGAAAA
TGGTATGCAGTAGTGTGTGTAGATGATTTCTCTCGCTACACCTGGATAAAATTTATCCTTGACAAATCGGAAACCTTTAAGACATGTCAGACCCTGTTCACTCAA
CTCCAAAGAGAGAAAAATACTAGCATTGGCCAAATACAAACTGATCATGGGCATGAATTTGAGAATCAGCACTTTGCTGAGTTCTGTGATAATGAAGGCATCTTT
CATGAGTTCTCTGCCCCATTAACACTACAGCAAAATGGAGTTGTAGAGAGAAGGAATCGAACCTTACAGGAGATGGCCCGAGCTGAGGCTCTAAACACTGCATGC
CATATACATAACAGAGTTATTCTCCGTCCAGGGACCACTACTACCTCGTATGAGCTGTGGAAAGGAAGAAAACCAAATGTGAAGGATCATCGCAGAAAGTGGGAC
TCAAAGTCAGATCGTGGAATATTTCTGGGATATTTAGCTAACAGCCGAGCCTACAGGGTCTACAACCAATGTTCCAAAATAGTAATGGAATCCATTAACGTGATT
ATTGATGACCTTGGTAGGAACCTAACAGAAATCTTGATGATGAAGTTGAGGTTTTTTGGAATTCTCTTTCTCATAAACCAGATGAAGGAGAAGCTACTGCAGTTT
GAAAGAAACCAAGTATGGGAATTAGTGCCAAAGCCACCTTATGCTAACATAATTGGTACCAAATGGATCTTTAAGAACAAAACGGATGAAGAAGGTAGAGTTATC
CGTAATAAAGCTAGACTGGTTGCTCAAGGGTATTCTCAAATAGAAGGGCTGGATTTTGGAGAAACATTTGCCCCAGTTGCCAGATTAGAAGCCATCCGACTACTG
CTAAGCTACGCATGTTTTTGGAGGTTCAAACTGTTCCAAATGGATGTAAAGAGTGCGTTCCTAAATGGGTACTTATGTGAGGAAGTGTATGTGGCCCAGCCAAAA
GGATTTGTTGATCCAGTGCATGAGGATCATGTTTACAAACTTCGAAAGGCACTCTATAGACTTAAACAAGCTCCTAGAGCTTGTTATGAGAGACTCTCCACTTAC
CTGTTACAACAAGGATATCAAAGGGGCAGTGCGGATCAAACTATGTTTATATATCGTCAAGGCACTGACTTTCTGATCATTCAGATCTATGTTGATGGAATTATA
TTTGGTGATACGTCCTAA
Protein sequenceShow/hide protein sequence
MALISVCTMNDEENVQTHDQLESKNLTNDTANRKIEDQEVILQQQERIQDLVEENQSFLSSIVTLKEELAETKHQFEELLKFARMDTPVRKTVFIREGTLQNSPT
NKEQGKGTEITSMPVKSPNKRTQNCKVALTSVKSPNSSDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRLIQGLAANL
ISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNLSKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQV
KSVHKPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIF
HEFSAPLTLQQNGVVERRNRTLQEMARAEALNTACHIHNRVILRPGTTTTSYELWKGRKPNVKDHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVI
IDDLGRNLTEILMMKLRFFGILFLINQMKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLL
LSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIIQIYVDGII
FGDTS