; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0009341 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0009341
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag-pol polyprotein
Genome locationchr03:3134077..3138415
RNA-Seq ExpressionPay0009341
SyntenyPay0009341
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAO73527.1 gag-pol polyprotein [Glycine max]5.4e-27247.79Show/hide
Query:  GNQRNNREDQSLRHRQKEDRRSRSYPATTRTNSRSSGREPKLPILYSLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRP
        G+  +  +  + R +     + ++      T+ R+S +E            DWY DS CSRHMTG  +F  ++  C    + FGDG KGKIIG G +   
Subjt:  GNQRNNREDQSLRHRQKEDRRSRSYPATTRTNSRSSGREPKLPILYSLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRP

Query:  GLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRC----NDAEVTL----------------------CNLLKVEEVGLWHKRLRHLSGSTISKVTK
        GLP L  V LV+GL ANLISI+QL D+G+ V+F    C      +EV +                      C   K +EV +WH+R  HL    + K+  
Subjt:  GLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRC----NDAEVTL----------------------CNLLKVEEVGLWHKRLRHLSGSTISKVTK

Query:  VDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQR
          A+ G+  L       C EC  GKQVK  H+ +   +TS +LELLH+DLMGPMQ ESLG KRYA V VDDF R+TW+ FI +KSETF+  + L  +LQR
Subjt:  VDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQR

Query:  EKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELW
        EK+  + RIR+DHG EFEN  F EFC +EGI HEFSA +TPQQNG+VER+NRTL+E AR             AE +NTA +IHNRV L  GT  T YE+W
Subjt:  EKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELW

Query:  KGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELEST
        KGRKP+VK+FHIFGS C+I++DR+ RRK D KSD GIFLGYS NSRAYRV+N R++TVMESINV++DDL   P R  D E++V                 
Subjt:  KGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELEST

Query:  APTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANV
                    LG + +D +    +A  +D+   E++I+                + D    T I K HP   IIGD + G+ TR +E     ++V+N 
Subjt:  APTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANV

Query:  CYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRL
        C+ S +EP  V  AL+D+ WI AMQEEL QF+RN+VWELVP+P   N+IGTKWIFKNKT+EEG + RNKARLVAQGY+QIEG+DF ETFAPVARLE+IRL
Subjt:  CYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRL

Query:  LLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLI---
        LL   C  +FKL+QMDVKSAFLNGYL EEVYV QPKGF +  H DHVY+L+KALYGLKQAPRAWYERL+ +L QQG +KG  D+T+F+ +   + +I   
Subjt:  LLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLI---

Query:  -----------------FVAQIKGEFEMNMVGELTFFLGTLYISLE---------WIRSDLKEL-------------THLKMTKDTNGKKVDTNLYRSII
                         FV Q++ EFEM++VGELT+FLG     +E         + ++ +K+              THLK++KD  G  VD +LYRS+I
Subjt:  -----------------FVAQIKGEFEMNMVGELTFFLGTLYISLE---------WIRSDLKEL-------------THLKMTKDTNGKKVDTNLYRSII

Query:  GSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSV
        GSLLYLTASRPDI +A GV ARYQA+P+ SHL   KRILKY++                                               +WFSKKQN V
Subjt:  GSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSV

Query:  SLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVAT
        SLSTAEAEYI AGSSCSQL+WMKQML EY + Q  M LYCDN+SAI+ISKN VQHSR +HIDI+HH+IR+LV+  +I L+HV +  Q+ADIF K LD   
Subjt:  SLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVAT

Query:  FEGLRANVGVC
        FE LR  +G+C
Subjt:  FEGLRANVGVC

AAO73529.1 gag-pol polyprotein [Glycine max]7.6e-27448.67Show/hide
Query:  TNSRSSGRE----PK-----LPILYSLNSS---DWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANL
        T   SSGR+    PK     L +  SL +S   DWY DS CSRHMTG  +F  ++  C    + FGDG KGKI G G +   GLP L  V LV+GL  NL
Subjt:  TNSRSSGRE----PK-----LPILYSLNSS---DWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANL

Query:  ISITQLYDQGYQVSFNNDRC--------------------------NDAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERC
        ISI+QL D+G+ V+F    C                            +  + C   K +EV +WH+R  HL    + K+    A+ G+  L       C
Subjt:  ISITQLYDQGYQVSFNNDRC--------------------------NDAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERC

Query:  LECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFE
         EC  GKQVK  H+ +   +TS +LELLH+DLMGPMQ ESLG KRYA V VDDF R+TW+ FI +KS+TF+  + L  +LQREK+  + RIR+DHG EFE
Subjt:  LECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFE

Query:  NQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCF
        N  F EFC +EGI HEFSA +TPQQNG+VER+NRTL+E AR             AE +NTA +IHNRV L  GT  T YE+WKGRKP VK+FHIFGS C+
Subjt:  NQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCF

Query:  IMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRS
        I++DR+ RRK D KSD GIFLGYS NSRAYRV+N R++TVMESINV++DDL   P R  D E++V  +                           G + +
Subjt:  IMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRS

Query:  DMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDK
        D +  + +A  +D+   E +I+                + D  P   I K HP   IIGD + G+ TR +E     ++V+N C+ S +EP  V  AL+D+
Subjt:  DMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDK

Query:  HWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVK
         WI AMQEEL QF+RN+VWELVP+P   N+IGTKWIFKNKT+EEG + RNKARLVAQGY+QIEG+DF ETFAPVARLE+IRLLL   C  +FKL+QMDVK
Subjt:  HWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVK

Query:  SAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLI--------------------F
        SAFLNGYL EE YV QPKGFV+  H DHVY+L+KALYGLKQAPRAWYERL+ +L QQG +KG  D+T+F+ +   + +I                    F
Subjt:  SAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLI--------------------F

Query:  VAQIKGEFEMNMVGELTFFLG--------TLYIS-LEWIRSDLKEL-------------THLKMTKDTNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGG
        V Q++ EFEM++VGELT+FLG        ++++S  ++ ++ +K+              THLK++KD  G  VD +LYRS+IGSLLYLTASRPDI +A G
Subjt:  VAQIKGEFEMNMVGELTFFLG--------TLYIS-LEWIRSDLKEL-------------THLKMTKDTNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGG

Query:  VYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSVSLSTAEAEYIVAGSSCSQ
        V ARYQA+P+ SHL+  KRILKY++                                               +WFSKKQN VSLSTAEAEYI AGSSCSQ
Subjt:  VYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSVSLSTAEAEYIVAGSSCSQ

Query:  LLWMKQMLDEYGITQSSMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVC
        L+WMKQML EY + Q  M LYCDN+SAI+ISKN VQHSR +HIDI+HH+IRELV+  +I LEHV +  Q+ADIF K LD   FE LR  +G+C
Subjt:  LLWMKQMLDEYGITQSSMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVC

KAA0059225.1 gag-pol polyprotein [Cucumis melo var. makuwa]0.0e+0081.65Show/hide
Query:  SLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCN-------
        S NSSDWYFDS CSRHMTGNADFFS+LSECK GS+VFGDGGKGKIIGKGTIN  GLPFLLDVRL+QGLAANLISI+QL DQGYQVSFN DRCN       
Subjt:  SLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCN-------

Query:  ---------------DAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDL
                       DAEVTLCNL KVEE  LWHKRL HLSG+TISKVTKVDAIIGL  L+F SLE C EC AGKQVKSVHKP+NISSTSHILELLHIDL
Subjt:  ---------------DAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDL

Query:  MGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERR
        MGPMQTESLGRK YAVVCVDDF RYTWIKFILDK ETFKTCQTLFTQLQREKNTG+G+I+TDHGHEFENQHFAEFCDNEGIFHEFSAPLT QQNGV    
Subjt:  MGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERR

Query:  NRTLEEMARAEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESIN
                 AE LNTA HIHNRVIL  GTT TSYELWKGRKPNVKYFHIFGSTCFI+SDRDHRRKWDSKSDRGIFLGY ANSRAYRVYNQ SK VMESIN
Subjt:  NRTLEEMARAEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESIN

Query:  VIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPP
        VIIDDL                        EGELES A TNETTYLPSHLG SR DMSTPSTSAI  +THESEA +SASQHT E+T GATDS KCDLIPP
Subjt:  VIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPP

Query:  THIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEG
        TH AKNHPSSFII DIHSGIITRKKERKDYAKMVANVCYTS LEPTTVSAALSD+HWIL +QEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEG
Subjt:  THIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEG

Query:  RVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRA
        RVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSY CF RFKLFQMDVKSAFLNGYL EEVYVAQPKGFV+ VH+DHVYKLRKALY LKQAPRA
Subjt:  RVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRA

Query:  WYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIFVAQIKG
        WYERLSTYLLQQG ++GSADQTMFIYRQG +FLI    + G
Subjt:  WYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIFVAQIKG

KAA0059225.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.1e-1150.44Show/hide
Query:  MKLDELFGSLRTFEIHLRHTASRRKLGLALTSVKEEPIEEHRVMQGNDALTESVVVLKKQVAKLKNQFHKYMGNQRNNREDQSLRHRQKEDRRSRSYPAT
        MKLDELFGSLR FEIHL HT SRRKLGLALTS                            VAKLKNQFHK+MG+QRNNREDQ+LR  +  D  S  +   
Subjt:  MKLDELFGSLRTFEIHLRHTASRRKLGLALTSVKEEPIEEHRVMQGNDALTESVVVLKKQVAKLKNQFHKYMGNQRNNREDQSLRHRQKEDRRSRSYPAT

Query:  TRTNSRSSGREPK
         R      G+E K
Subjt:  TRTNSRSSGREPK

KAA0059225.1 gag-pol polyprotein [Cucumis melo var. makuwa]0.0e+0064.07Show/hide
Query:  SLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCN-------
        S NS DWYFDS CSRHMTGNADFFS+LSECKAGS+VF DGGKGKIIGKGTINRPGLPFLLDVRLVQGL+ANLIS +QL DQGY+V+F+ DRCN       
Subjt:  SLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCN-------

Query:  ---------------DAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDL
                       DAEVTLCNL KVEE GLWHKRL HL G+TISKV K +AIIGL  LSFSSLE C ECPAGKQVKSVHKP+NIS TSHILELLHIDL
Subjt:  ---------------DAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDL

Query:  MGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERR
        M PMQTESLGRKRYAVVCVDDF RYTWIKFIL+K ETFKTCQTL TQLQREKNTG+GRIRT+HG EFEN+HFAEFCDNEGIFHEFSA LTPQ+NGVVE+R
Subjt:  MGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERR

Query:  NRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRV
        N+TL+EMAR             AE LNTA HIHNRVIL   TT TSYELWKGRKPNVKYFHIFG TCFI+SDRDHRRKWDSKSDRGIFLGYSAN+RAYRV
Subjt:  NRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRV

Query:  YNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTT
        YNQR+K V+ESINVIIDDLGKEPNRNLDDEDEVFWNSLS K  EGE EST PTNETTYLPSH  S++ DMSTPSTS   ++T+ESEA++SASQHT ERT 
Subjt:  YNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTT

Query:  GATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIG
        GATDS K DLIPP HIAKNHP SFIIGD+HSGIIT+KKERKDYAKMVANVCYT SLEPTT+SA L+D+H ILAMQEELLQFERNQVWELVPK PYANIIG
Subjt:  GATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIG

Query:  TKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKL
        TKWIFKNKTDEEGRVI NKARLVAQGYSQIEG                                                         N +        
Subjt:  TKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKL

Query:  RKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIFVAQIKGEFEMNMVGELTFFLGTLYISLEWIRSDLKELTHLKMTKDTNGKKVD
                                                                                                            
Subjt:  RKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIFVAQIKGEFEMNMVGELTFFLGTLYISLEWIRSDLKELTHLKMTKDTNGKKVD

Query:  TNLYRSIIGSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISAAWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMIL
                                                           AAWFSKKQNSVSLSTAEAEYI  GSSCSQLLWMKQMLDEYGITQSSMIL
Subjt:  TNLYRSIIGSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISAAWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMIL

Query:  YCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQRPT
        YCDNLSAISISKN VQHS+ +HI+IQHHFIRELVEANII LEHVQSAFQLADIF KPLDVATFEGLRA+VGVCQRPT
Subjt:  YCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQRPT

TYK21888.1 hypothetical protein E5676_scaffold494G00240 [Cucumis melo var. makuwa]0.0e+0064.9Show/hide
Query:  SDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCNDAEVTLCNLLKVEEVGLWHKRLRHLSGSTIS
        S L ECK GS+VFG G KGK IGKGTINRP LPFLLDVRLVQGL+ANLISI+QL DQGYQ         DAEV LCNL KVEEVGLWHKRL HLSG+TIS
Subjt:  SDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCNDAEVTLCNLLKVEEVGLWHKRLRHLSGSTIS

Query:  KVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFT
        KVTK DAIIGL  LSFSSLE C EC A KQ     +P+++SSTSH LELLHIDLMGPMQTESLGRK                                  
Subjt:  KVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFT

Query:  QLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITS
           REKNTG+ RI+TDHG EFEN++F EFCDNEGIFHEFSA L PQQNGVVERRNRTL+EMA+              E LNTA HIHNRVIL  GTT TS
Subjt:  QLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITS

Query:  YELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGE
        YELWKGRKPNVKYFHIF STCFI+SDR+H RKWDSKSDRGIFLGYS NSRAYRVYNQR+KTVMES+NVII DLGKEPNRNLDDEDE FW+SLS K V+ E
Subjt:  YELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGE

Query:  LESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKM
         EST+ T ETTY P H  S+R DMSTPSTS    +T E EA++SASQHT ERT G+TDS K  L+P T+IAK+HPSSFII D+HSGIITRKKERKDYAKM
Subjt:  LESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKM

Query:  VANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLE
        V N+CYTSSLEPTTVS AL+++HWILAMQEELLQFERNQVWELVPKPP+ANIIGTKWIFKNKTDE+GRVIRNKARLVAQGYSQIEGLDFGETF PVARLE
Subjt:  VANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLE

Query:  AIRLLLSYTCFR---------------RFKLFQMDVKSAFLNG--YLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKK
        AIRLLLSY CFR                F+   MD      +G  +L  ++YV            D ++  RK                  +  Q+ C K
Subjt:  AIRLLLSYTCFR---------------RFKLFQMDVKSAFLNG--YLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKK

Query:  GSADQTMFIYRQGIDFLIFVAQIKGEFEMNMVGELTFFLGTLYISLEWIRSDLKELTHLKMTKDTNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGGVYA
                I + G+D     A+ K                                THLKMTKD   +KVDTNLYRSIIGSLLYLTASRPDIAF  GV A
Subjt:  GSADQTMFIYRQGIDFLIFVAQIKGEFEMNMVGELTFFLGTLYISLEWIRSDLKELTHLKMTKDTNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGGVYA

Query:  RYQ----------------ADPRTSHLHLAKRILKYISAAWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMILYCDNLSAISISKN
        RY                  D R S       +   I A WFSKKQNSVSLSTAEAEYI AGSSCSQLLWMKQMLDEYGITQSSM LYCDNLSAISISKN
Subjt:  RYQ----------------ADPRTSHLHLAKRILKYISAAWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMILYCDNLSAISISKN

Query:  LVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQRP
         VQHSR +HIDI+HHFIRELVEANII LEHVQSAFQLADIF KPLDVATFEGLRA+VGVCQRP
Subjt:  LVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQRP

TrEMBL top hitse value%identityAlignment
A0A5A7V046 Gag-pol polyprotein0.0e+0081.65Show/hide
Query:  SLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCN-------
        S NSSDWYFDS CSRHMTGNADFFS+LSECK GS+VFGDGGKGKIIGKGTIN  GLPFLLDVRL+QGLAANLISI+QL DQGYQVSFN DRCN       
Subjt:  SLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCN-------

Query:  ---------------DAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDL
                       DAEVTLCNL KVEE  LWHKRL HLSG+TISKVTKVDAIIGL  L+F SLE C EC AGKQVKSVHKP+NISSTSHILELLHIDL
Subjt:  ---------------DAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDL

Query:  MGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERR
        MGPMQTESLGRK YAVVCVDDF RYTWIKFILDK ETFKTCQTLFTQLQREKNTG+G+I+TDHGHEFENQHFAEFCDNEGIFHEFSAPLT QQNGV    
Subjt:  MGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERR

Query:  NRTLEEMARAEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESIN
                 AE LNTA HIHNRVIL  GTT TSYELWKGRKPNVKYFHIFGSTCFI+SDRDHRRKWDSKSDRGIFLGY ANSRAYRVYNQ SK VMESIN
Subjt:  NRTLEEMARAEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESIN

Query:  VIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPP
        VIIDDL                        EGELES A TNETTYLPSHLG SR DMSTPSTSAI  +THESEA +SASQHT E+T GATDS KCDLIPP
Subjt:  VIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPP

Query:  THIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEG
        TH AKNHPSSFII DIHSGIITRKKERKDYAKMVANVCYTS LEPTTVSAALSD+HWIL +QEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEG
Subjt:  THIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEG

Query:  RVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRA
        RVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSY CF RFKLFQMDVKSAFLNGYL EEVYVAQPKGFV+ VH+DHVYKLRKALY LKQAPRA
Subjt:  RVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRA

Query:  WYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIFVAQIKG
        WYERLSTYLLQQG ++GSADQTMFIYRQG +FLI    + G
Subjt:  WYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIFVAQIKG

A0A5A7V046 Gag-pol polyprotein5.5e-1250.44Show/hide
Query:  MKLDELFGSLRTFEIHLRHTASRRKLGLALTSVKEEPIEEHRVMQGNDALTESVVVLKKQVAKLKNQFHKYMGNQRNNREDQSLRHRQKEDRRSRSYPAT
        MKLDELFGSLR FEIHL HT SRRKLGLALTS                            VAKLKNQFHK+MG+QRNNREDQ+LR  +  D  S  +   
Subjt:  MKLDELFGSLRTFEIHLRHTASRRKLGLALTSVKEEPIEEHRVMQGNDALTESVVVLKKQVAKLKNQFHKYMGNQRNNREDQSLRHRQKEDRRSRSYPAT

Query:  TRTNSRSSGREPK
         R      G+E K
Subjt:  TRTNSRSSGREPK

A0A5A7V046 Gag-pol polyprotein0.0e+0064.07Show/hide
Query:  SLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCN-------
        S NS DWYFDS CSRHMTGNADFFS+LSECKAGS+VF DGGKGKIIGKGTINRPGLPFLLDVRLVQGL+ANLIS +QL DQGY+V+F+ DRCN       
Subjt:  SLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCN-------

Query:  ---------------DAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDL
                       DAEVTLCNL KVEE GLWHKRL HL G+TISKV K +AIIGL  LSFSSLE C ECPAGKQVKSVHKP+NIS TSHILELLHIDL
Subjt:  ---------------DAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDL

Query:  MGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERR
        M PMQTESLGRKRYAVVCVDDF RYTWIKFIL+K ETFKTCQTL TQLQREKNTG+GRIRT+HG EFEN+HFAEFCDNEGIFHEFSA LTPQ+NGVVE+R
Subjt:  MGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERR

Query:  NRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRV
        N+TL+EMAR             AE LNTA HIHNRVIL   TT TSYELWKGRKPNVKYFHIFG TCFI+SDRDHRRKWDSKSDRGIFLGYSAN+RAYRV
Subjt:  NRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRV

Query:  YNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTT
        YNQR+K V+ESINVIIDDLGKEPNRNLDDEDEVFWNSLS K  EGE EST PTNETTYLPSH  S++ DMSTPSTS   ++T+ESEA++SASQHT ERT 
Subjt:  YNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTT

Query:  GATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIG
        GATDS K DLIPP HIAKNHP SFIIGD+HSGIIT+KKERKDYAKMVANVCYT SLEPTT+SA L+D+H ILAMQEELLQFERNQVWELVPK PYANIIG
Subjt:  GATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIG

Query:  TKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKL
        TKWIFKNKTDEEGRVI NKARLVAQGYSQIEG                                                         N +        
Subjt:  TKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKL

Query:  RKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIFVAQIKGEFEMNMVGELTFFLGTLYISLEWIRSDLKELTHLKMTKDTNGKKVD
                                                                                                            
Subjt:  RKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIFVAQIKGEFEMNMVGELTFFLGTLYISLEWIRSDLKELTHLKMTKDTNGKKVD

Query:  TNLYRSIIGSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISAAWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMIL
                                                           AAWFSKKQNSVSLSTAEAEYI  GSSCSQLLWMKQMLDEYGITQSSMIL
Subjt:  TNLYRSIIGSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISAAWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMIL

Query:  YCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQRPT
        YCDNLSAISISKN VQHS+ +HI+IQHHFIRELVEANII LEHVQSAFQLADIF KPLDVATFEGLRA+VGVCQRPT
Subjt:  YCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQRPT

A0A5D3DEL4 Integrase catalytic domain-containing protein0.0e+0064.9Show/hide
Query:  SDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCNDAEVTLCNLLKVEEVGLWHKRLRHLSGSTIS
        S L ECK GS+VFG G KGK IGKGTINRP LPFLLDVRLVQGL+ANLISI+QL DQGYQ         DAEV LCNL KVEEVGLWHKRL HLSG+TIS
Subjt:  SDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCNDAEVTLCNLLKVEEVGLWHKRLRHLSGSTIS

Query:  KVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFT
        KVTK DAIIGL  LSFSSLE C EC A KQ     +P+++SSTSH LELLHIDLMGPMQTESLGRK                                  
Subjt:  KVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFT

Query:  QLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITS
           REKNTG+ RI+TDHG EFEN++F EFCDNEGIFHEFSA L PQQNGVVERRNRTL+EMA+              E LNTA HIHNRVIL  GTT TS
Subjt:  QLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITS

Query:  YELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGE
        YELWKGRKPNVKYFHIF STCFI+SDR+H RKWDSKSDRGIFLGYS NSRAYRVYNQR+KTVMES+NVII DLGKEPNRNLDDEDE FW+SLS K V+ E
Subjt:  YELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGE

Query:  LESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKM
         EST+ T ETTY P H  S+R DMSTPSTS    +T E EA++SASQHT ERT G+TDS K  L+P T+IAK+HPSSFII D+HSGIITRKKERKDYAKM
Subjt:  LESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKM

Query:  VANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLE
        V N+CYTSSLEPTTVS AL+++HWILAMQEELLQFERNQVWELVPKPP+ANIIGTKWIFKNKTDE+GRVIRNKARLVAQGYSQIEGLDFGETF PVARLE
Subjt:  VANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLE

Query:  AIRLLLSYTCFR---------------RFKLFQMDVKSAFLNG--YLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKK
        AIRLLLSY CFR                F+   MD      +G  +L  ++YV            D ++  RK                  +  Q+ C K
Subjt:  AIRLLLSYTCFR---------------RFKLFQMDVKSAFLNG--YLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKK

Query:  GSADQTMFIYRQGIDFLIFVAQIKGEFEMNMVGELTFFLGTLYISLEWIRSDLKELTHLKMTKDTNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGGVYA
                I + G+D     A+ K                                THLKMTKD   +KVDTNLYRSIIGSLLYLTASRPDIAF  GV A
Subjt:  GSADQTMFIYRQGIDFLIFVAQIKGEFEMNMVGELTFFLGTLYISLEWIRSDLKELTHLKMTKDTNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGGVYA

Query:  RYQ----------------ADPRTSHLHLAKRILKYISAAWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMILYCDNLSAISISKN
        RY                  D R S       +   I A WFSKKQNSVSLSTAEAEYI AGSSCSQLLWMKQMLDEYGITQSSM LYCDNLSAISISKN
Subjt:  RYQ----------------ADPRTSHLHLAKRILKYISAAWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMILYCDNLSAISISKN

Query:  LVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQRP
         VQHSR +HIDI+HHFIRELVEANII LEHVQSAFQLADIF KPLDVATFEGLRA+VGVCQRP
Subjt:  LVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQRP

Q84VH6 Gag-pol polyprotein3.7e-27448.67Show/hide
Query:  TNSRSSGRE----PK-----LPILYSLNSS---DWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANL
        T   SSGR+    PK     L +  SL +S   DWY DS CSRHMTG  +F  ++  C    + FGDG KGKI G G +   GLP L  V LV+GL  NL
Subjt:  TNSRSSGRE----PK-----LPILYSLNSS---DWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANL

Query:  ISITQLYDQGYQVSFNNDRC--------------------------NDAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERC
        ISI+QL D+G+ V+F    C                            +  + C   K +EV +WH+R  HL    + K+    A+ G+  L       C
Subjt:  ISITQLYDQGYQVSFNNDRC--------------------------NDAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERC

Query:  LECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFE
         EC  GKQVK  H+ +   +TS +LELLH+DLMGPMQ ESLG KRYA V VDDF R+TW+ FI +KS+TF+  + L  +LQREK+  + RIR+DHG EFE
Subjt:  LECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFE

Query:  NQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCF
        N  F EFC +EGI HEFSA +TPQQNG+VER+NRTL+E AR             AE +NTA +IHNRV L  GT  T YE+WKGRKP VK+FHIFGS C+
Subjt:  NQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCF

Query:  IMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRS
        I++DR+ RRK D KSD GIFLGYS NSRAYRV+N R++TVMESINV++DDL   P R  D E++V  +                           G + +
Subjt:  IMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRS

Query:  DMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDK
        D +  + +A  +D+   E +I+                + D  P   I K HP   IIGD + G+ TR +E     ++V+N C+ S +EP  V  AL+D+
Subjt:  DMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDK

Query:  HWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVK
         WI AMQEEL QF+RN+VWELVP+P   N+IGTKWIFKNKT+EEG + RNKARLVAQGY+QIEG+DF ETFAPVARLE+IRLLL   C  +FKL+QMDVK
Subjt:  HWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVK

Query:  SAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLI--------------------F
        SAFLNGYL EE YV QPKGFV+  H DHVY+L+KALYGLKQAPRAWYERL+ +L QQG +KG  D+T+F+ +   + +I                    F
Subjt:  SAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLI--------------------F

Query:  VAQIKGEFEMNMVGELTFFLG--------TLYIS-LEWIRSDLKEL-------------THLKMTKDTNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGG
        V Q++ EFEM++VGELT+FLG        ++++S  ++ ++ +K+              THLK++KD  G  VD +LYRS+IGSLLYLTASRPDI +A G
Subjt:  VAQIKGEFEMNMVGELTFFLG--------TLYIS-LEWIRSDLKEL-------------THLKMTKDTNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGG

Query:  VYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSVSLSTAEAEYIVAGSSCSQ
        V ARYQA+P+ SHL+  KRILKY++                                               +WFSKKQN VSLSTAEAEYI AGSSCSQ
Subjt:  VYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSVSLSTAEAEYIVAGSSCSQ

Query:  LLWMKQMLDEYGITQSSMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVC
        L+WMKQML EY + Q  M LYCDN+SAI+ISKN VQHSR +HIDI+HH+IRELV+  +I LEHV +  Q+ADIF K LD   FE LR  +G+C
Subjt:  LLWMKQMLDEYGITQSSMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVC

Q84VH8 Gag-pol polyprotein2.6e-27247.79Show/hide
Query:  GNQRNNREDQSLRHRQKEDRRSRSYPATTRTNSRSSGREPKLPILYSLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRP
        G+  +  +  + R +     + ++      T+ R+S +E            DWY DS CSRHMTG  +F  ++  C    + FGDG KGKIIG G +   
Subjt:  GNQRNNREDQSLRHRQKEDRRSRSYPATTRTNSRSSGREPKLPILYSLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRP

Query:  GLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRC----NDAEVTL----------------------CNLLKVEEVGLWHKRLRHLSGSTISKVTK
        GLP L  V LV+GL ANLISI+QL D+G+ V+F    C      +EV +                      C   K +EV +WH+R  HL    + K+  
Subjt:  GLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRC----NDAEVTL----------------------CNLLKVEEVGLWHKRLRHLSGSTISKVTK

Query:  VDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQR
          A+ G+  L       C EC  GKQVK  H+ +   +TS +LELLH+DLMGPMQ ESLG KRYA V VDDF R+TW+ FI +KSETF+  + L  +LQR
Subjt:  VDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQR

Query:  EKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELW
        EK+  + RIR+DHG EFEN  F EFC +EGI HEFSA +TPQQNG+VER+NRTL+E AR             AE +NTA +IHNRV L  GT  T YE+W
Subjt:  EKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHSGTTITSYELW

Query:  KGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELEST
        KGRKP+VK+FHIFGS C+I++DR+ RRK D KSD GIFLGYS NSRAYRV+N R++TVMESINV++DDL   P R  D E++V                 
Subjt:  KGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELEST

Query:  APTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANV
                    LG + +D +    +A  +D+   E++I+                + D    T I K HP   IIGD + G+ TR +E     ++V+N 
Subjt:  APTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANV

Query:  CYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRL
        C+ S +EP  V  AL+D+ WI AMQEEL QF+RN+VWELVP+P   N+IGTKWIFKNKT+EEG + RNKARLVAQGY+QIEG+DF ETFAPVARLE+IRL
Subjt:  CYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRL

Query:  LLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLI---
        LL   C  +FKL+QMDVKSAFLNGYL EEVYV QPKGF +  H DHVY+L+KALYGLKQAPRAWYERL+ +L QQG +KG  D+T+F+ +   + +I   
Subjt:  LLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLI---

Query:  -----------------FVAQIKGEFEMNMVGELTFFLGTLYISLE---------WIRSDLKEL-------------THLKMTKDTNGKKVDTNLYRSII
                         FV Q++ EFEM++VGELT+FLG     +E         + ++ +K+              THLK++KD  G  VD +LYRS+I
Subjt:  -----------------FVAQIKGEFEMNMVGELTFFLGTLYISLE---------WIRSDLKEL-------------THLKMTKDTNGKKVDTNLYRSII

Query:  GSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSV
        GSLLYLTASRPDI +A GV ARYQA+P+ SHL   KRILKY++                                               +WFSKKQN V
Subjt:  GSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSV

Query:  SLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVAT
        SLSTAEAEYI AGSSCSQL+WMKQML EY + Q  M LYCDN+SAI+ISKN VQHSR +HIDI+HH+IR+LV+  +I L+HV +  Q+ADIF K LD   
Subjt:  SLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVAT

Query:  FEGLRANVGVC
        FE LR  +G+C
Subjt:  FEGLRANVGVC

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.3e-8626.69Show/hide
Query:  LLDVRLVQGLAANLISITQLYDQGYQVSF---------------------NNDRCNDAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSL
        L DV   +  A NL+S+ +L + G  + F                     NN    + +    N        LWH+R  H+S   + ++ + +     SL
Subjt:  LLDVRLVQGLAANLISITQLYDQGYQVSF---------------------NNDRCNDAEVTLCNLLKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSL

Query:  LSFSSL--ERCLECPAGKQVKSVHKPINISSTSHI---LELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNT
        L+   L  E C  C  GKQ +   K   +   +HI   L ++H D+ GP+   +L  K Y V+ VD F  Y     I  KS+ F   Q    + +   N 
Subjt:  LSFSSL--ERCLECPAGKQVKSVHKPINISSTSHI---LELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNT

Query:  GVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHS--GTTITSYELWKG
         V  +  D+G E+ +    +FC  +GI +  + P TPQ NGV ER  RT+ E AR              E + TA ++ NR+   +   ++ T YE+W  
Subjt:  GVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMAR-------------AEGLNTAGHIHNRVILHS--GTTITSYELWKG

Query:  RKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNR-----------NLDDEDEVFWNSLSPK
        +KP +K+  +FG+T ++   ++ + K+D KS + IF+GY  N   +++++  ++  + + +V++D+     +R           + + E++ F N  S K
Subjt:  RKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNR-----------NLDDEDEVFWNSLSPK

Query:  PVEGEL-ESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQT----------------------------------DTHESEASISASQHTLERTTGATDSL
         ++ E    +   +   +L     S   +    S   IQT                                  D H +E+  S + +    +  A    
Subjt:  PVEGEL-ESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQT----------------------------------DTHESEASISASQHTLERTTGATDSL

Query:  KCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAAL---------------SDK-HWILAMQEELLQFERNQVWELV
        +  +  PT   KN             II R+ ER      ++     +SL    ++A                  DK  W  A+  EL   + N  W + 
Subjt:  KCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAAL---------------SDK-HWILAMQEELLQFERNQVWELV

Query:  PKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVN
         +P   NI+ ++W+F  K +E G  IR KARLVA+G++Q   +D+ ETFAPVAR+ + R +LS       K+ QMDVK+AFLNG L EE+Y+  P+G   
Subjt:  PKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVN

Query:  RVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQG-----------IDFLI-----------FVAQIKGEFEMNMVGELTFFL
          + D+V KL KA+YGLKQA R W+E     L +      S D+ ++I  +G           +D ++           F   +  +F M  + E+  F+
Subjt:  RVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQG-----------IDFLI-----------FVAQIKGEFEMNMVGELTFFL

Query:  GTLYISLEWIRSDLKELTHLKMTKD----TNGKKVDTNL------------------YRSIIGSLLY-LTASRPDIAFAGGVYARYQADPRTSHLHLAKR
        G + I ++  +  L +  ++K         N   V T L                   RS+IG L+Y +  +RPD+  A  + +RY +   +      KR
Subjt:  GTLYISLEWIRSDLKELTHLKMTKD----TNGKKVDTNL------------------YRSIIGSLLY-LTASRPDIAFAGGVYARYQADPRTSHLHLAKR

Query:  ILKYISA-------------------------------------------------AWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQS
        +L+Y+                                                    W +K+QNSV+ S+ EAEY+    +  + LW+K +L    I   
Subjt:  ILKYISA-------------------------------------------------AWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQS

Query:  SMI-LYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQ
        + I +Y DN   ISI+ N   H R +HIDI++HF RE V+ N+I LE++ +  QLADIF KPL  A F  LR  +G+ Q
Subjt:  SMI-LYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQ

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-10228.23Show/hide
Query:  SDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTI---NRPGLPFLL-DVRLVQGLAANLISITQLYDQGYQVSFNNDRC--------
        S+W  D+  S H T   D F        G++  G+    KI G G I      G   +L DVR V  L  NLIS   L   GY+  F N +         
Subjt:  SDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTI---NRPGLPFLL-DVRLVQGLAANLISITQLYDQGYQVSFNNDRC--------

Query:  ------------NDAEVTLCNLLKVEE---VGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHID
                     +AE+    L   ++   V LWHKR+ H+S   +  + K   I   S    ++++ C  C  GKQ + V    +     +IL+L++ D
Subjt:  ------------NDAEVTLCNLLKVEE---VGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHID

Query:  LMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVER
        + GPM+ ES+G  +Y V  +DD  R  W+  +  K + F+  Q     ++RE    + R+R+D+G E+ ++ F E+C + GI HE + P TPQ NGV ER
Subjt:  LMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVER

Query:  RNRTLEEMARA-------------EGLNTAGHIHNRVILHSGTTITSYEL----WKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANS
         NRT+ E  R+             E + TA ++ NR    S +   ++E+    W  ++ +  +  +FG   F    ++ R K D KS   IF+GY    
Subjt:  RNRTLEEMARA-------------EGLNTAGHIHNRVILHSGTTITSYEL----WKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANS

Query:  RAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHT
          YR+++   K V+ S +V+  +      R   D  E   N + P  V               +PS   +S +  S  ST+   ++  E    +      
Subjt:  RAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSRSDMSTPSTSAIQTDTHESEASISASQHT

Query:  LERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALS--DKHWIL-AMQEELLQFERNQVWELVPK
        L+      +        PT   + H    +       + +R+    +Y      V  +   EP ++   LS  +K+ ++ AMQEE+   ++N  ++LV  
Subjt:  LERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALS--DKHWIL-AMQEELLQFERNQVWELVPK

Query:  PPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRV
        P     +  KW+FK K D + +++R KARLV +G+ Q +G+DF E F+PV ++ +IR +LS       ++ Q+DVK+AFL+G L EE+Y+ QP+GF    
Subjt:  PPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRV

Query:  HQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQG----------IDFLIFVAQIKG-----------EFEMNMVGELTFFLG--
         +  V KL K+LYGLKQAPR WY +  +++  Q   K  +D  ++  R            +D ++ V + KG            F+M  +G     LG  
Subjt:  HQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQG----------IDFLIFVAQIKG-----------EFEMNMVGELTFFLG--

Query:  --------TLYISLE-WIRSDLKELT-------------HLKMTKDTNGKKVDTN------LYRSIIGSLLY-LTASRPDIAFAGGVYARYQADPRTSHL
                 L++S E +I   L+                HLK++K      V+         Y S +GSL+Y +  +RPDIA A GV +R+  +P   H 
Subjt:  --------TLYISLE-WIRSDLKELT-------------HLKMTKDTNGKKVDTN------LYRSIIGSLLY-LTASRPDIAFAGGVYARYQADPRTSHL

Query:  HLAKRILKYI---------------------------------------------SAAWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQ
           K IL+Y+                                             + +W SK Q  V+LST EAEYI A  +  +++W+K+ L E G+ Q
Subjt:  HLAKRILKYI---------------------------------------------SAAWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQ

Query:  SSMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGV
           ++YCD+ SAI +SKN + H+R +HID+++H+IRE+V+   +++  + +    AD+  K +    FE  +  VG+
Subjt:  SSMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGV

P92520 Uncharacterized mitochondrial protein AtMg008201.5e-1944.72Show/hide
Query:  IITRKKE--RKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQI
        ++TR K    K   K    +  T   EP +V  AL D  W  AMQEEL    RN+ W LVP P   NI+G KW+FK K   +G + R KARLVA+G+ Q 
Subjt:  IITRKKE--RKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQI

Query:  EGLDFGETFAPVARLEAIRLLLS
        EG+ F ET++PV R   IR +L+
Subjt:  EGLDFGETFAPVARLEAIRLLLS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.1e-8928.66Show/hide
Query:  WHKRLRHLSGSTISKVTKVDAIIGLSLLSFS-SLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFI
        WH RL H + S ++ V    +   LS+L+ S     C +C   K  K       I+ST   LE ++ D+       S    RY V+ VD F RYTW+  +
Subjt:  WHKRLRHLSGSTISKVTKVDAIIGLSLLSFS-SLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFI

Query:  LDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEE-----MARAE--------GLNTAGH
          KS+  +T  T    L+    T +G   +D+G EF      E+    GI H  S P TP+ NG+ ER++R + E     ++ A             A +
Subjt:  LDKSETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEE-----MARAE--------GLNTAGH

Query:  IHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDD---------LGKE
        + NR+        + ++   G  PN     +FG  C+      ++ K D KS + +FLGYS    AY   + ++  +  S +V  D+             
Subjt:  IHNRVILHSGTTITSYELWKGRKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDD---------LGKE

Query:  PNRNLDDEDEVFWNSLSPKPVEGEL-----------------ESTAPTNETTYLPSHLGSSRSDM--STPSTSA--------------IQTDTHESE---
        P +    E    W+  +  P    +                   +AP   +    S+L SS S    S+P  +A               QT TH S+   
Subjt:  PNRNLDDEDEVFWNSLSPKPVEGEL-----------------ESTAPTNETTYLPSHLGSSRSDM--STPSTSA--------------IQTDTHESE---

Query:  -----------------ASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGII------TRKKER--KDYAKMVANVCYTSSLEPTTV
                             +S  +   TT A+ S      P   I    P + I+ + +   +      TR K    K   K    V   +  EP T 
Subjt:  -----------------ASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGII------TRKKER--KDYAKMVANVCYTSSLEPTTV

Query:  SAALSDKHWILAMQEELLQFERNQVWELVPKPP-YANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRF
          AL D+ W  AM  E+     N  W+LVP PP +  I+G +WIF  K + +G + R KARLVA+GY+Q  GLD+ ETF+PV +  +IR++L     R +
Subjt:  SAALSDKHWILAMQEELLQFERNQVWELVPKPP-YANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRF

Query:  KLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQG---IDFLIFVAQI-----
         + Q+DV +AFL G L ++VY++QP GF+++   ++V KLRKALYGLKQAPRAWY  L  YLL  G     +D ++F+ ++G   +  L++V  I     
Subjt:  KLFQMDVKSAFLNGYLYEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQG---IDFLIFVAQI-----

Query:  ------------KGEFEMNMVGELTFFLGT--------LYISLEWIRSDLKELTHL--------------KMTKDTNGKKVDTNLYRSIIGSLLYLTASR
                       F +    EL +FLG         L++S      DL   T++              K++  +  K  D   YR I+GSL YL  +R
Subjt:  ------------KGEFEMNMVGELTFFLGT--------LYISLEWIRSDLKELTHL--------------KMTKDTNGKKVDTNLYRSIIGSLLYLTASR

Query:  PDIAFAGGVYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSVSLSTAEAEYI
        PDI++A    +++   P   HL   KRIL+Y++                                               +W SKKQ  V  S+ EAEY 
Subjt:  PDIAFAGGVYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSVSLSTAEAEYI

Query:  VAGSSCSQLLWMKQMLDEYGITQS-SMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVG
           ++ S++ W+  +L E GI  +   ++YCDN+ A  +  N V HSR +HI I +HFIR  V++  +R+ HV +  QLAD   KPL    F+   + +G
Subjt:  VAGSSCSQLLWMKQMLDEYGITQS-SMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVG

Query:  VCQRP
        V + P
Subjt:  VCQRP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.1e-8926.75Show/hide
Query:  HRQKEDRRSRSYPATTRTNSRSSGREP-----KLPILYSLNSSDWYFDSRCSRHMTGNADFFSDLSECKAG-SIVFGDGGKGKIIGKGTINRPGLPFLLD
        H  K   +   + +TT     +S   P      L +    N+++W  DS  + H+T + +  S       G  ++  DG    I   G+ + P     LD
Subjt:  HRQKEDRRSRSYPATTRTNSRSSGREP-----KLPILYSLNSSDWYFDSRCSRHMTGNADFFSDLSECKAG-SIVFGDGGKGKIIGKGTINRPGLPFLLD

Query:  ---VRLVQGLAANLISITQLYD------QGYQVSFNNDRCNDAEVTLCNLLKVE----------------------EVGLWHKRLRHLSGSTISKVTKVD
           V  V  +  NLIS+ +L +      + +  SF     N     L    K E                          WH RL H S + ++ V    
Subjt:  ---VRLVQGLAANLISITQLYD------QGYQVSFNNDRCNDAEVTLCNLLKVE----------------------EVGLWHKRLRHLSGSTISKVTKVD

Query:  AIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREK
        ++  L+      L  C +C   K  K       I+S S  LE ++ D+       S+   RY V+ VD F RYTW+  +  KS+   T     + ++   
Subjt:  AIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDKSETFKTCQTLFTQLQREK

Query:  NTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMARA-------------EGLNTAGHIHNRVILHSGTTITSYELWKG
         T +G + +D+G EF      ++    GI H  S P TP+ NG+ ER++R + EM                   + A ++ NR+        + ++   G
Subjt:  NTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMARA-------------EGLNTAGHIHNRVILHSGTTITSYELWKG

Query:  RKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDD---------LGKEPNRNLDDEDEVFWNS------
        + PN +   +FG  C+      +R K + KS +  F+GYS    AY   +  +  +  S +V  D+          G   ++    +    W S      
Subjt:  RKPNVKYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDD---------LGKEPNRNLDDEDEVFWNS------

Query:  ----LSPKPVEGELESTAPTNETTYLP---SHLGSSR---SDMSTPSTSAIQTDTHESEASISASQHTLERTTGA------------------TDSLKCD
            L   P  G    T+P   ++  P   + + SS    S +S+PS+S     +H      +    T    + +                     L   
Subjt:  ----LSPKPVEGELESTAPTNETTYLP---SHLGSSR---SDMSTPSTSAIQTDTHESEASISASQHTLERTTGA------------------TDSLKCD

Query:  LIPPTHI--------AKNHPSSFIIG----------------------DIHSGIITRKKE--RKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEE
         I   HI          N PSS                          + HS + TR K+  RK   K        ++ EP T   A+ D  W  AM  E
Subjt:  LIPPTHI--------AKNHPSSFIIG----------------------DIHSGIITRKKE--RKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEE

Query:  LLQFERNQVWELV-PKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYL
        +     N  W+LV P PP   I+G +WIF  K + +G + R KARLVA+GY+Q  GLD+ ETF+PV +  +IR++L     R + + Q+DV +AFL G L
Subjt:  LLQFERNQVWELV-PKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYL

Query:  YEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQG---IDFLIFVAQI-----------------KGEF
         +EVY++QP GFV++   D+V +LRKA+YGLKQAPRAWY  L TYLL  G     +D ++F+ ++G   I  L++V  I                    F
Subjt:  YEEVYVAQPKGFVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQG---IDFLIFVAQI-----------------KGEF

Query:  EMNMVGELTFFLGT--------LYISLEWIRSDLKELTHL--------------KMTKDTNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGGVYARYQAD
         +    +L +FLG         L++S      DL   T++              K+T  +  K  D   YR I+GSL YL  +RPD+++A    ++Y   
Subjt:  EMNMVGELTFFLGT--------LYISLEWIRSDLKELTHL--------------KMTKDTNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGGVYARYQAD

Query:  PRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQML
        P   H +  KR+L+Y++                                               +W SKKQ  V  S+ EAEY    ++ S+L W+  +L
Subjt:  PRTSHLHLAKRILKYISA----------------------------------------------AWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQML

Query:  DEYGITQS-SMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQRP
         E GI  S   ++YCDN+ A  +  N V HSR +HI + +HFIR  V++  +R+ HV +  QLAD   KPL    F+     +GV + P
Subjt:  DEYGITQS-SMILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQRP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.1e-5631.3Show/hide
Query:  VCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIR
        VC   + EP+T + A     W  AM +E+   E    WE+   PP    IG KW++K K + +G + R KARLVA+GY+Q EG+DF ETF+PV +L +++
Subjt:  VCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIR

Query:  LLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNR----VHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDF
        L+L+ +    F L Q+D+ +AFLNG L EE+Y+  P G+  R    +  + V  L+K++YGLKQA R W+ + S  L+  G  +  +D T F+      F
Subjt:  LLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKGFVNR----VHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDF

Query:  ---LIFV-----------------AQIKGEFEMNMVGELTFFLGT--------LYISLEWIRSDLKELTHLKMTKDTN--------------GKKVDTNL
           L++V                 +Q+K  F++  +G L +FLG         + I       DL + T L   K ++              G  VD   
Subjt:  ---LIFV-----------------AQIKGEFEMNMVGELTFFLGT--------LYISLEWIRSDLKELTHLKMTKDTN--------------GKKVDTNL

Query:  YRSIIGSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSK
        YR +IG L+YL  +R DI+FA    +++   PR +H     +IL YI                                                +W SK
Subjt:  YRSIIGSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISA----------------------------------------------AWFSK

Query:  KQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSS-MILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRE
        KQ  VS S+AEAEY     +  +++W+ Q   E  +  S   +L+CDN +AI I+ N V H R +HI+   H +RE
Subjt:  KQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSS-MILYCDNLSAISISKNLVQHSRRRHIDIQHHFIRE

ATMG00810.1 DNA/RNA polymerases superfamily protein3.8e-0523.53Show/hide
Query:  TNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISA-----------------------------------------
        +  K  D + +RSI+G+L YLT +RPDI++A  +  +   +P  +   L KR+L+Y+                                           
Subjt:  TNGKKVDTNLYRSIIGSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISA-----------------------------------------

Query:  -----AWFSKKQNSVSLSTAEAEYIVAGSSCSQLLW
             +W +K+Q +VS S+ E EY     + ++L W
Subjt:  -----AWFSKKQNSVSLSTAEAEYIVAGSSCSQLLW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.1e-2044.72Show/hide
Query:  IITRKKE--RKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQI
        ++TR K    K   K    +  T   EP +V  AL D  W  AMQEEL    RN+ W LVP P   NI+G KW+FK K   +G + R KARLVA+G+ Q 
Subjt:  IITRKKE--RKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEELLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQI

Query:  EGLDFGETFAPVARLEAIRLLLS
        EG+ F ET++PV R   IR +L+
Subjt:  EGLDFGETFAPVARLEAIRLLLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATTGGATGAACTGTTTGGGTCACTGCGAACCTTCGAAATTCACTTGAGACACACAGCCAGCAGAAGGAAACTGGGACTAGCACTAACCTCAGTAAAGGAAGAGCC
AATCGAGGAACACCGAGTGATGCAGGGTAATGATGCTCTTACAGAATCAGTAGTTGTGCTGAAAAAACAGGTTGCGAAACTTAAAAACCAGTTTCACAAATACATGGGAA
ACCAACGCAACAACAGAGAGGACCAGTCCTTAAGACACAGACAGAAAGAAGACAGAAGATCAAGAAGTTATCCTGCAACAACAAGAACAAATTCAAGATCTAGTGGAAGA
GAACCAAAGCTTCCTATCCTCTATAGCCTCAACTCTAGTGACTGGTACTTTGACAGTAGGTGTTCCAGACACATGACAGGTAATGCAGATTTCTTTTCTGATCTGAGTGA
ATGCAAAGCTGGGTCAATAGTATTTGGAGATGGAGGAAAAGGAAAAATAATTGGCAAAGGAACGATTAACCGTCCAGGTCTACCATTTCTTCTTGATGTTCGACTAGTAC
AAGGACTGGCTGCAAATCTCATAAGCATCACCCAATTATATGACCAAGGCTATCAAGTCAGTTTCAATAATGATAGATGTAATGATGCAGAGGTGACCTTATGCAATCTA
TTAAAAGTGGAAGAAGTTGGACTCTGGCACAAACGACTTAGACACCTTAGTGGCTCTACTATCTCAAAAGTCACCAAAGTTGATGCCATCATCGGTCTTTCCTTGCTATC
ATTCTCGTCATTAGAAAGATGTTTAGAGTGCCCAGCTGGCAAGCAAGTCAAGTCTGTGCACAAGCCTATAAATATCTCCTCGACGTCCCATATTCTGGAACTTCTTCATA
TAGACCTAATGGGGCCCATGCAAACAGAAAGCTTGGGAAGAAAACGGTATGCAGTAGTGTGTGTAGACGATTTCTTCCGCTACACCTGGATAAAATTTATCCTTGACAAA
TCAGAAACCTTTAAGACATGTCAGACCTTGTTCACTCAACTCCAAAGAGAGAAAAATACTGGTGTTGGCCGAATACGAACTGATCATGGGCATGAATTTGAGAATCAACA
CTTTGCTGAGTTCTGTGATAATGAAGGCATCTTTCATGAGTTCTCTGCCCCTTTAACACCACAACAAAATGGAGTTGTAGAGAGAAGGAATCGAACCTTAGAGGAGATGG
CCCGGGCTGAAGGTCTAAACACTGCAGGCCATATACATAACAGAGTCATTCTCCATTCAGGGACCACTATTACCTCATATGAGCTGTGGAAAGGGAGAAAACCAAATGTA
AAGTATTTTCACATCTTTGGCAGCACGTGCTTTATCATGAGTGATAGGGATCATCGCAGAAAGTGGGACTCAAAGTCTGATCGTGGAATATTTCTGGGATATTCTGCTAA
CAGCCGAGCCTACAGGGTCTACAACCAACGTTCCAAAACAGTAATGGAATCCATTAACGTCATTATTGATGACCTTGGTAAGGAACCCAATAGAAATCTTGATGATGAAG
ATGAGGTTTTTTGGAATTCCCTTTCTCCTAAACCTGTTGAAGGAGAGTTAGAATCGACGGCCCCCACTAATGAAACAACATACTTACCCTCTCATCTCGGTTCAAGCAGA
AGTGACATGTCAACACCTTCTACATCAGCCATTCAGACTGACACACATGAAAGTGAAGCATCAATATCTGCAAGTCAGCACACTCTAGAGCGAACTACGGGTGCAACTGA
TTCTTTAAAGTGTGACCTCATACCTCCTACGCATATAGCCAAAAACCACCCCTCCAGCTTCATTATTGGAGATATTCACAGTGGAATCATAACTCGGAAGAAGGAGAGGA
AAGATTATGCGAAAATGGTTGCCAATGTATGCTACACATCTTCACTAGAACCGACCACGGTCTCTGCAGCACTCTCCGACAAACACTGGATCTTGGCTATGCAGGAAGAG
CTACTCCAGTTTGAAAGAAACCAAGTATGGGAATTAGTGCCAAAGCCACCTTATGCTAACATAATTGGTACCAAATGGATCTTTAAGAACAAAACGGATGAAGAAGGTAG
AGTTATTCGTAATAAAGCTAGACTGGTTGCTCAAGGATATTCTCAAATAGAAGGGCTGGATTTTGGAGAAACATTTGCCCCAGTTGCCAGATTAGAAGCCATCCGACTAC
TGCTAAGCTACACATGTTTTCGGAGATTCAAACTATTCCAAATGGATGTAAAGAGTGCGTTCCTAAATGGGTACTTATATGAGGAAGTATATGTGGCCCAGCCAAAAGGA
TTTGTTAATCGAGTACATCAGGATCATGTTTACAAACTTCGAAAGGCACTCTATGGACTTAAACAAGCTCCTAGAGCTTGGTATGAGAGACTCTCCACTTACCTGTTACA
ACAAGGATGTAAAAAGGGCAGTGCGGATCAAACTATGTTTATATATCGTCAAGGCATTGACTTTCTGATCTTTGTTGCCCAGATAAAGGGAGAATTCGAAATGAACATGG
TTGGGGAACTAACCTTCTTCCTGGGAACCTTATATATAAGTTTGGAATGGATAAGGTCAGACCTAAAAGAACTTACACATCTGAAAATGACTAAGGACACAAATGGTAAA
AAAGTTGATACAAACTTGTACCGAAGCATAATTGGGAGCTTGCTCTATCTAACAGCTAGCAGACCGGATATAGCCTTCGCAGGAGGAGTTTATGCTCGGTATCAGGCTGA
CCCGCGTACGTCACATCTTCACTTGGCAAAACGAATACTCAAATATATATCAGCTGCTTGGTTCAGCAAGAAACAAAACAGTGTCTCCCTCTCAACAGCCGAAGCTGAAT
ACATTGTTGCAGGGAGTAGCTGCTCTCAACTGTTGTGGATGAAACAAATGCTTGATGAATATGGGATAACTCAGTCTTCCATGATTCTCTACTGTGATAATCTGAGCGCA
ATAAGCATCTCCAAAAATCTAGTTCAACATAGTCGAAGAAGGCACATAGATATCCAACATCATTTTATTCGAGAACTCGTTGAAGCCAATATTATAAGATTAGAACATGT
TCAAAGTGCCTTTCAGCTAGCAGACATATTCATCAAGCCTCTGGATGTTGCAACATTTGAAGGACTGAGGGCCAATGTTGGAGTCTGTCAACGGCCCACATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAATTGGATGAACTGTTTGGGTCACTGCGAACCTTCGAAATTCACTTGAGACACACAGCCAGCAGAAGGAAACTGGGACTAGCACTAACCTCAGTAAAGGAAGAGCC
AATCGAGGAACACCGAGTGATGCAGGGTAATGATGCTCTTACAGAATCAGTAGTTGTGCTGAAAAAACAGGTTGCGAAACTTAAAAACCAGTTTCACAAATACATGGGAA
ACCAACGCAACAACAGAGAGGACCAGTCCTTAAGACACAGACAGAAAGAAGACAGAAGATCAAGAAGTTATCCTGCAACAACAAGAACAAATTCAAGATCTAGTGGAAGA
GAACCAAAGCTTCCTATCCTCTATAGCCTCAACTCTAGTGACTGGTACTTTGACAGTAGGTGTTCCAGACACATGACAGGTAATGCAGATTTCTTTTCTGATCTGAGTGA
ATGCAAAGCTGGGTCAATAGTATTTGGAGATGGAGGAAAAGGAAAAATAATTGGCAAAGGAACGATTAACCGTCCAGGTCTACCATTTCTTCTTGATGTTCGACTAGTAC
AAGGACTGGCTGCAAATCTCATAAGCATCACCCAATTATATGACCAAGGCTATCAAGTCAGTTTCAATAATGATAGATGTAATGATGCAGAGGTGACCTTATGCAATCTA
TTAAAAGTGGAAGAAGTTGGACTCTGGCACAAACGACTTAGACACCTTAGTGGCTCTACTATCTCAAAAGTCACCAAAGTTGATGCCATCATCGGTCTTTCCTTGCTATC
ATTCTCGTCATTAGAAAGATGTTTAGAGTGCCCAGCTGGCAAGCAAGTCAAGTCTGTGCACAAGCCTATAAATATCTCCTCGACGTCCCATATTCTGGAACTTCTTCATA
TAGACCTAATGGGGCCCATGCAAACAGAAAGCTTGGGAAGAAAACGGTATGCAGTAGTGTGTGTAGACGATTTCTTCCGCTACACCTGGATAAAATTTATCCTTGACAAA
TCAGAAACCTTTAAGACATGTCAGACCTTGTTCACTCAACTCCAAAGAGAGAAAAATACTGGTGTTGGCCGAATACGAACTGATCATGGGCATGAATTTGAGAATCAACA
CTTTGCTGAGTTCTGTGATAATGAAGGCATCTTTCATGAGTTCTCTGCCCCTTTAACACCACAACAAAATGGAGTTGTAGAGAGAAGGAATCGAACCTTAGAGGAGATGG
CCCGGGCTGAAGGTCTAAACACTGCAGGCCATATACATAACAGAGTCATTCTCCATTCAGGGACCACTATTACCTCATATGAGCTGTGGAAAGGGAGAAAACCAAATGTA
AAGTATTTTCACATCTTTGGCAGCACGTGCTTTATCATGAGTGATAGGGATCATCGCAGAAAGTGGGACTCAAAGTCTGATCGTGGAATATTTCTGGGATATTCTGCTAA
CAGCCGAGCCTACAGGGTCTACAACCAACGTTCCAAAACAGTAATGGAATCCATTAACGTCATTATTGATGACCTTGGTAAGGAACCCAATAGAAATCTTGATGATGAAG
ATGAGGTTTTTTGGAATTCCCTTTCTCCTAAACCTGTTGAAGGAGAGTTAGAATCGACGGCCCCCACTAATGAAACAACATACTTACCCTCTCATCTCGGTTCAAGCAGA
AGTGACATGTCAACACCTTCTACATCAGCCATTCAGACTGACACACATGAAAGTGAAGCATCAATATCTGCAAGTCAGCACACTCTAGAGCGAACTACGGGTGCAACTGA
TTCTTTAAAGTGTGACCTCATACCTCCTACGCATATAGCCAAAAACCACCCCTCCAGCTTCATTATTGGAGATATTCACAGTGGAATCATAACTCGGAAGAAGGAGAGGA
AAGATTATGCGAAAATGGTTGCCAATGTATGCTACACATCTTCACTAGAACCGACCACGGTCTCTGCAGCACTCTCCGACAAACACTGGATCTTGGCTATGCAGGAAGAG
CTACTCCAGTTTGAAAGAAACCAAGTATGGGAATTAGTGCCAAAGCCACCTTATGCTAACATAATTGGTACCAAATGGATCTTTAAGAACAAAACGGATGAAGAAGGTAG
AGTTATTCGTAATAAAGCTAGACTGGTTGCTCAAGGATATTCTCAAATAGAAGGGCTGGATTTTGGAGAAACATTTGCCCCAGTTGCCAGATTAGAAGCCATCCGACTAC
TGCTAAGCTACACATGTTTTCGGAGATTCAAACTATTCCAAATGGATGTAAAGAGTGCGTTCCTAAATGGGTACTTATATGAGGAAGTATATGTGGCCCAGCCAAAAGGA
TTTGTTAATCGAGTACATCAGGATCATGTTTACAAACTTCGAAAGGCACTCTATGGACTTAAACAAGCTCCTAGAGCTTGGTATGAGAGACTCTCCACTTACCTGTTACA
ACAAGGATGTAAAAAGGGCAGTGCGGATCAAACTATGTTTATATATCGTCAAGGCATTGACTTTCTGATCTTTGTTGCCCAGATAAAGGGAGAATTCGAAATGAACATGG
TTGGGGAACTAACCTTCTTCCTGGGAACCTTATATATAAGTTTGGAATGGATAAGGTCAGACCTAAAAGAACTTACACATCTGAAAATGACTAAGGACACAAATGGTAAA
AAAGTTGATACAAACTTGTACCGAAGCATAATTGGGAGCTTGCTCTATCTAACAGCTAGCAGACCGGATATAGCCTTCGCAGGAGGAGTTTATGCTCGGTATCAGGCTGA
CCCGCGTACGTCACATCTTCACTTGGCAAAACGAATACTCAAATATATATCAGCTGCTTGGTTCAGCAAGAAACAAAACAGTGTCTCCCTCTCAACAGCCGAAGCTGAAT
ACATTGTTGCAGGGAGTAGCTGCTCTCAACTGTTGTGGATGAAACAAATGCTTGATGAATATGGGATAACTCAGTCTTCCATGATTCTCTACTGTGATAATCTGAGCGCA
ATAAGCATCTCCAAAAATCTAGTTCAACATAGTCGAAGAAGGCACATAGATATCCAACATCATTTTATTCGAGAACTCGTTGAAGCCAATATTATAAGATTAGAACATGT
TCAAAGTGCCTTTCAGCTAGCAGACATATTCATCAAGCCTCTGGATGTTGCAACATTTGAAGGACTGAGGGCCAATGTTGGAGTCTGTCAACGGCCCACATGA
Protein sequenceShow/hide protein sequence
MKLDELFGSLRTFEIHLRHTASRRKLGLALTSVKEEPIEEHRVMQGNDALTESVVVLKKQVAKLKNQFHKYMGNQRNNREDQSLRHRQKEDRRSRSYPATTRTNSRSSGR
EPKLPILYSLNSSDWYFDSRCSRHMTGNADFFSDLSECKAGSIVFGDGGKGKIIGKGTINRPGLPFLLDVRLVQGLAANLISITQLYDQGYQVSFNNDRCNDAEVTLCNL
LKVEEVGLWHKRLRHLSGSTISKVTKVDAIIGLSLLSFSSLERCLECPAGKQVKSVHKPINISSTSHILELLHIDLMGPMQTESLGRKRYAVVCVDDFFRYTWIKFILDK
SETFKTCQTLFTQLQREKNTGVGRIRTDHGHEFENQHFAEFCDNEGIFHEFSAPLTPQQNGVVERRNRTLEEMARAEGLNTAGHIHNRVILHSGTTITSYELWKGRKPNV
KYFHIFGSTCFIMSDRDHRRKWDSKSDRGIFLGYSANSRAYRVYNQRSKTVMESINVIIDDLGKEPNRNLDDEDEVFWNSLSPKPVEGELESTAPTNETTYLPSHLGSSR
SDMSTPSTSAIQTDTHESEASISASQHTLERTTGATDSLKCDLIPPTHIAKNHPSSFIIGDIHSGIITRKKERKDYAKMVANVCYTSSLEPTTVSAALSDKHWILAMQEE
LLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYTCFRRFKLFQMDVKSAFLNGYLYEEVYVAQPKG
FVNRVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGCKKGSADQTMFIYRQGIDFLIFVAQIKGEFEMNMVGELTFFLGTLYISLEWIRSDLKELTHLKMTKDTNGK
KVDTNLYRSIIGSLLYLTASRPDIAFAGGVYARYQADPRTSHLHLAKRILKYISAAWFSKKQNSVSLSTAEAEYIVAGSSCSQLLWMKQMLDEYGITQSSMILYCDNLSA
ISISKNLVQHSRRRHIDIQHHFIRELVEANIIRLEHVQSAFQLADIFIKPLDVATFEGLRANVGVCQRPT