; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G021440 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G021440
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionGag/pol protein
Genome locationCmo_Chr04:14906602..14914959
RNA-Seq ExpressionCmoCh04G021440
SyntenyCmoCh04G021440
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]9.0e-28746.66Show/hide
Query:  MTNSIVQLLASEKLNGNNYTTWKSNLNTILVIDDLRFVLTEECPPNPSSNANRTVRDAYDRWTKENDKARVYILASISDVLAKKHNVMGTAKEIMESLKG
        M  SIVQLLASEKLNG+NY+ WKSNLNTILV+DDLRFVLTEECP  P+ NANRTVR+AYDRW K NDKARVYILAS++DVLAKKH+ + TAK IM+SL+ 
Subjt:  MTNSIVQLLASEKLNGNNYTTWKSNLNTILVIDDLRFVLTEECPPNPSSNANRTVRDAYDRWTKENDKARVYILASISDVLAKKHNVMGTAKEIMESLKG

Query:  MFGQPSFSLRYDAIKYIYNYRMKEGTSVREHVLDMMVHFNVTKKNEAVIDEKSQVSFIMESLPKSFFQFRTNVIMNKKEYNLTALPNELQTYQFLLTNKG
        MFGQPS+SLR++AIK+IY  RMKEGTSVREHVLDMM+HFN+ + N   IDE +QVSFI++SLPKSF  F+TN  +NK E+NLT L NELQ +Q L  +KG
Subjt:  MFGQPSFSLRYDAIKYIYNYRMKEGTSVREHVLDMMVHFNVTKKNEAVIDEKSQVSFIMESLPKSFFQFRTNVIMNKKEYNLTALPNELQTYQFLLTNKG

Query:  QTGEANVAIFK-KLLRGLSSKN------------------------------------------------------------KYDYLYL-----------
        +  EANVA+ K K +RG SSKN                                                            KYD L +           
Subjt:  QTGEANVAIFK-KLLRGLSSKN------------------------------------------------------------KYDYLYL-----------

Query:  -------MHHKSEALEKFREYKTEVENLLGKTIKT---LRSDRGGEYMDLRFQD-YMIEHGI-------RSQLSAP-----VETATYILNMV--------
                +H   + ++   +K   E  +   + T   + ++  G+ + L FQD Y+I   +       R+ +S       + T ++ +N V        
Subjt:  -------MHHKSEALEKFREYKTEVENLLGKTIKT---LRSDRGGEYMDLRFQD-YMIEHGI-------RSQLSAP-----VETATYILNMV--------

Query:  ----------------------------------PSKSVSETPYELWKGR--------------------------------------------KGYRAK
                                            + VS   Y LW  R                                            KG RAK
Subjt:  ----------------------------------PSKSVSETPYELWKGR--------------------------------------------KGYRAK

Query:  ETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAP--
          LELVH+DLCGPMN+KA+GG EYFISFIDD+SRY ++YL+HHKSE+ EKF+EYK EVEN +GKTIKTLRSDRGGEYMD +FQDY+IE GI+SQLSAP  
Subjt:  ETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAP--

Query:  -----------------------------------VETATYILNMVPSKSVSETPYELWKGRK-------------------------------------
                                           +ETA +ILN VPSKSV ETPYELWKGRK                                     
Subjt:  -----------------------------------VETATYILNMVPSKSVSETPYELWKGRK-------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------DQPDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSI
                                                  D P  VKPIGCKWIYKRKRDQ GKVQ FKARLVAK YTQ+E VDYEETFSP+AMLKSI
Subjt:  ------------------------------------------DQPDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSI

Query:  RILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFL
        RILLSIATFY+YEIW+MDVKTAFLN NLEESIYM Q EGFI QD+EQ+VCKL++ IYGLKQASRSWNI+FDTAIKSYGF+QNVDEPCVYK+IVNS VAFL
Subjt:  RILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFL

Query:  VLY-DYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELWKGRKDQPDG
        +LY D + L+ +         EY T+V+                ++++ +FQ               +  A YIL +           ++ + RK++   
Subjt:  VLY-DYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELWKGRKDQPDG

Query:  VKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKY
        +      +I K       ++SKKG L FRHG+HLSKEQ PKTPQEVEDMR+IPY+SAVGSLMYAMLCT+PDICY++GIVSRYQSN G  HWTAVKNILKY
Subjt:  VKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKY

Query:  LRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLY
        LRRTR+YML+YGAKDLILTGYTDSDFQ+D D+RKSTS SVFTLNGGA++WRS+KQ CIA+STMEAEYVA  EAAKE+VW RKFLT LEVVPNMHLP+TLY
Subjt:  LRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLY

Query:  CDNSGAVANSKEPRSHKRGKHIERKYHLIREIV
        CDNSGAVANSKEPRSHKRGKHIERKYHLIREIV
Subjt:  CDNSGAVANSKEPRSHKRGKHIERKYHLIREIV

KAA0026233.1 gag/pol protein [Cucumis melo var. makuwa]3.8e-28550Show/hide
Query:  MTNSIVQLLASEKLNGNNYTTWKSNLNTILVIDDLRFVLTEECPPNPSSNANRTVRDAYDRWTKENDKARVYILASISDVLAKKHNVMGTAKEIMESLKG
        ++++ + +LA++KLNGNNY +WK+ +N +L+IDDLRFVL E+CP  P++NA RTVR+ Y+RW K N+KAR YILAS+S+VLAKKH  M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGNNYTTWKSNLNTILVIDDLRFVLTEECPPNPSSNANRTVRDAYDRWTKENDKARVYILASISDVLAKKHNVMGTAKEIMESLKG

Query:  MFGQPSFSLRYDAIKYIYNYRMKEGTSVREHVLDMMVHFNVTKKNEAVIDEKSQVSFIMESLPKSFFQFRTNVIMNKKEYNLTALPNELQTYQFLLTNKG
        MFGQ S+ +++DA+KYIYN RM EG SVREHVL+MMVHFNV   NEAVIDE SQVSFI+ESLP+SF QFR+N +MNK  Y LT L NELQT++ L+  KG
Subjt:  MFGQPSFSLRYDAIKYIYNYRMKEGTSVREHVLDMMVHFNVTKKNEAVIDEKSQVSFIMESLPKSFFQFRTNVIMNKKEYNLTALPNELQTYQFLLTNKG

Query:  QTGEANVAI-FKKLLRGLSSKNKYDYLYLMHHKSEALEKFREYK----TEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILN
        Q GEANVA   +K  RG +S  K          S   +K+++ K     +      KT K  ++ +G     + F      H  R+          Y+  
Subjt:  QTGEANVAI-FKKLLRGLSSKNKYDYLYLMHHKSEALEKFREYK----TEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILN

Query:  MVPSK--SVSETPYELWKGRKGYRAKETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRG
           +K   +++ P+      KG+RAKE LELVH+DLCGPMN+KA+GG EYFI+F DDYSRY Y+YLM HKSEALEKF+EYK EVEN L KTIKT RSDRG
Subjt:  MVPSK--SVSETPYELWKGRKGYRAKETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRG

Query:  GEYMDLRFQDYMIEHGIRSQLSAP-------------------------------------VETATYILNMVPSKSVSETPYELWKGRK-----------
        GEYMDL+FQ+Y++E GI SQLSAP                                     V+TA YILN VPSKSVSETP +LW G K           
Subjt:  GEYMDLRFQDYMIEHGIRSQLSAP-------------------------------------VETATYILNMVPSKSVSETPYELWKGRK-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------DQPDGVKPIGCKWIYKRKRDQTGKVQ
                                                                                  DQPDGVKPIGCKWIYKRKR   GKVQ
Subjt:  --------------------------------------------------------------------------DQPDGVKPIGCKWIYKRKRDQTGKVQ

Query:  IFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNI
         FKARLVAK YTQ E VDYEETFSP+AMLKSIRILLSIA ++DYEIW+MDVKTAFLN NLEE+IYM Q EGFI   +EQ++CKL R IYGLKQASRSWNI
Subjt:  IFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNI

Query:  KFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLY--DYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAP
        +FDTAIKSYGF Q VDEPCVYKRI+N +VAFLVLY  D L                      L+G  I  L   +  +++  +FQ               
Subjt:  KFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLY--DYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAP

Query:  VETATYILNMVPSKSVSETPYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLC
        +  A ++L +           ++++ RK++   +      +I K     + ++SK+GLL FRHGV LSKEQ PKTPQ+VE+MRHIPYASAVGSLMYAMLC
Subjt:  VETATYILNMVPSKSVSETPYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLC

Query:  TQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEY
        T+PDICYA+GIVSRYQSN G AHWTAVK ILKYLRRTRDY L+YG+KDLILTGYTDSDFQTD DSRKSTS SVFTLNGGA++WRSIKQGCIA+STMEAEY
Subjt:  TQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEY

Query:  VAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIV
        VA  EAAKE+VW R FL  LEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV
Subjt:  VAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIV

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]3.3e-27344.54Show/hide
Query:  MTNSIVQLLASEKLNGNNYTTWKSNLNTILVIDDLRFVLTEECPPNPSSNANRTVRDAYDRWTKENDKARVYILASISDVLAKKHNVMGTAKEIMESLKG
        MT++ + +LA++KLNGNNY +WK+ +NT+L+IDDLRFVL EECP  P++NA RTVR+ Y+RW K N+KAR YILAS+S+VLAKKH  M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGNNYTTWKSNLNTILVIDDLRFVLTEECPPNPSSNANRTVRDAYDRWTKENDKARVYILASISDVLAKKHNVMGTAKEIMESLKG

Query:  MFGQPSFSLRYDAIKYIYNYRMKEGTSVREHVLDMMVHFNVTKKNEAVIDEKSQVSFIMESLPKSFFQFRTNVIMNKKEYNLTALPNELQTYQFLLTNKG
        MFGQ S+ +++DA+KYIYN RM EG SVREHVL+MMVHFNV + N AVIDE SQVSFI+ESLP+SF QFR+N +MNK  Y LT L NELQT++ L+  KG
Subjt:  MFGQPSFSLRYDAIKYIYNYRMKEGTSVREHVLDMMVHFNVTKKNEAVIDEKSQVSFIMESLPKSFFQFRTNVIMNKKEYNLTALPNELQTYQFLLTNKG

Query:  QTGEANVAI-FKKLLRGLSS-------------------------------------------------------------------KNKYDYLYL----
        Q GEANVA   +K  RG +S                                                                   + KYD L L    
Subjt:  QTGEANVAI-FKKLLRGLSS-------------------------------------------------------------------KNKYDYLYL----

Query:  -----------------MHHKSEALEKFREYKT-EVENLLGKTIKTLRSDRGGEYMDLR--------------------------FQDYMIEHGIRSQL-
                         +    + +  +R+ +T E+   +G          GG  + L+                           Q Y +   +     
Subjt:  -----------------MHHKSEALEKFREYKT-EVENLLGKTIKTLRSDRGGEYMDLR--------------------------FQDYMIEHGIRSQL-

Query:  --------SAPVETATYILNMVPSKSVSET----------------PYE---LWKGR-------------------------------------------
                SA +E   Y+L  + SK++  T                P E   LW  R                                           
Subjt:  --------SAPVETATYILNMVPSKSVSET----------------PYE---LWKGR-------------------------------------------

Query:  -KGYRAKETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS
         KG+RAKE LELVH+DLCGPMN+KA+GG EYFI+F DDYSRY Y+YLM HKSEALEKF+EYK EVEN L KTIKT RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  -KGYRAKETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS

Query:  QLSAP-------------------------------------VETATYILNMVPSKSVSETPYELWKGRK------------------------------
        QLSAP                                     V+TA YILN VPSKSVSETP +LW GRK                              
Subjt:  QLSAP-------------------------------------VETATYILNMVPSKSVSETPYELWKGRK------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------DQPDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDY
                                                               DQPDGVKPIGCKWIYKRKR   GKVQ FKARLVAK YTQ E VDY
Subjt:  -------------------------------------------------------DQPDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDY

Query:  EETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPC
        EETFSP+AMLKSIRILLSIA ++DYEIW+MDVKTAFLN NLEE+IYM Q EGFI   +EQ++CKL R IYGLKQASRSWNI+FDTAIKSYGF Q VDEPC
Subjt:  EETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPC

Query:  VYKRIVNSTVAFLVLY--DYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSET
        VYKRI+N +VAFLVLY  D L                      L+G  I  L   +  +++  +FQ               +  A ++L +         
Subjt:  VYKRIVNSTVAFLVLY--DYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSET

Query:  PYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNS
          ++++ RK++   +      +I K     + ++SK+GLL FRHGV LSKEQ PKTPQ+VE+MRHIPYASAVGSLMYAMLCT+PDICYA+GIVSRYQSN 
Subjt:  PYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNS

Query:  GHAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTH
        G AHWTAVK ILKYLRRTRDY L+YG+KDLILTGYTDSDFQTD DSRKSTS SVFTLNGGA++WRSIKQGCIA+STMEAEYVA  EAAKE+VW R FL  
Subjt:  GHAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTH

Query:  LEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIV
        LEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV
Subjt:  LEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIV

KAA0067084.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-27354.86Show/hide
Query:  MTNSIVQLLASEKLNGNNYTTWKSNLNTILVIDDLRFVLTEECPPNPSSNANRTVRDAYDRWTKENDKARVYILASISDVLAKKHNVMGTAKEIMESLKG
        MT++ + +LA++KLNGNNY +WK+  NT+L+IDDLRFVL EECP  P++NA RTVR+ Y+RW K N+KAR YILAS+S+VLAKKH  M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGNNYTTWKSNLNTILVIDDLRFVLTEECPPNPSSNANRTVRDAYDRWTKENDKARVYILASISDVLAKKHNVMGTAKEIMESLKG

Query:  MFGQPSFSLRYDAIKYIYNYRMKEGTSVREHVLDMMVHFNVTKKNEAVIDEKSQVSFIMESLPKSFFQFRTNVIMNKKEYNLTALPNELQTYQFLLTNKG
        MFGQ S+ +++DA+KYIYN RM +  SVREHVL MMVHFNV K N AVIDE SQVSFI+ESLP+S  QFR+N +MNK  Y LT L +ELQT++ L+  KG
Subjt:  MFGQPSFSLRYDAIKYIYNYRMKEGTSVREHVLDMMVHFNVTKKNEAVIDEKSQVSFIMESLPKSFFQFRTNVIMNKKEYNLTALPNELQTYQFLLTNKG

Query:  QTGEANVAI-FKKLLRGLSSKNKYDYLYLMHHKSEALEKFREYK----TEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILN
        Q GE NVA   +K  +G +S  K          S   +K+++ K     +      KT K  ++ +G     + F      H  R+      E      N
Subjt:  QTGEANVAI-FKKLLRGLSSKNKYDYLYLMHHKSEALEKFREYK----TEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILN

Query:  MVPSKSVSETPYELWKGRKGYRAKETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGE
              ++  P+      KG+RAKE LELVH+DLCGPMN+KA+G  EYFI+F DDYSRY Y+YLM HKSEALEKF+EYK EVEN L KTIKT RSDRGGE
Subjt:  MVPSKSVSETPYELWKGRKGYRAKETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGE

Query:  YMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSK---------SVSETPYELWKGRK---------------------------------------DQ
        YMDL+FQ+Y++E GI SQLS+P    T   N VP +          VSETP +LW GRK                                       +Q
Subjt:  YMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSK---------SVSETPYELWKGRK---------------------------------------DQ

Query:  PDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQD
        PD  K IGCKWIYKRKR   GKVQ FKARLVAK YTQ E VDYEETFSP+AMLKSIRILLSIA ++DYEIW+MDVKTAFLN NLEE+IYM Q EGFI   
Subjt:  PDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQD

Query:  REQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGE
        +EQ+VCKL R IYGLKQASRSWNI+FDTAIKSYGF Q VDEPCVYKRI+N+++AFLVLY                      V+++L      L  + GG 
Subjt:  REQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGE

Query:  YMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQE
          D++                      +++     K + E  + L                        D      +  LL  R+ V           Q+
Subjt:  YMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQE

Query:  VEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNG
        VE+MRHIPYASAVGSLMYAMLCT+PDICYA+GI+SRYQSN G A+WTA+K ILKYLRRTRDYML+YG+KDLILT YTDSDFQTD DSRKSTS SVFTLNG
Subjt:  VEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNG

Query:  GAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIV
        GA++WRSIKQGCI +S MEAEYVA  EAAKE+VW RKFL  LEVV N+  P+TLYCDNSGAVANSKEPRSHKR KHIE KYHLIREIV
Subjt:  GAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIV

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]3.3e-27344.54Show/hide
Query:  MTNSIVQLLASEKLNGNNYTTWKSNLNTILVIDDLRFVLTEECPPNPSSNANRTVRDAYDRWTKENDKARVYILASISDVLAKKHNVMGTAKEIMESLKG
        MT++ + +LA++KLNGNNY +WK+ +NT+L+IDDLRFVL EECP  P++NA RTVR+ Y+RW K N+KAR YILAS+S+VLAKKH  M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGNNYTTWKSNLNTILVIDDLRFVLTEECPPNPSSNANRTVRDAYDRWTKENDKARVYILASISDVLAKKHNVMGTAKEIMESLKG

Query:  MFGQPSFSLRYDAIKYIYNYRMKEGTSVREHVLDMMVHFNVTKKNEAVIDEKSQVSFIMESLPKSFFQFRTNVIMNKKEYNLTALPNELQTYQFLLTNKG
        MFGQ S+ +++DA+KYIYN RM EG SVREHVL+MMVHFNV + N AVIDE SQVSFI+ESLP+SF QFR+N +MNK  Y LT L NELQT++ L+  KG
Subjt:  MFGQPSFSLRYDAIKYIYNYRMKEGTSVREHVLDMMVHFNVTKKNEAVIDEKSQVSFIMESLPKSFFQFRTNVIMNKKEYNLTALPNELQTYQFLLTNKG

Query:  QTGEANVAI-FKKLLRGLSS-------------------------------------------------------------------KNKYDYLYL----
        Q GEANVA   +K  RG +S                                                                   + KYD L L    
Subjt:  QTGEANVAI-FKKLLRGLSS-------------------------------------------------------------------KNKYDYLYL----

Query:  -----------------MHHKSEALEKFREYKT-EVENLLGKTIKTLRSDRGGEYMDLR--------------------------FQDYMIEHGIRSQL-
                         +    + +  +R+ +T E+   +G          GG  + L+                           Q Y +   +     
Subjt:  -----------------MHHKSEALEKFREYKT-EVENLLGKTIKTLRSDRGGEYMDLR--------------------------FQDYMIEHGIRSQL-

Query:  --------SAPVETATYILNMVPSKSVSET----------------PYE---LWKGR-------------------------------------------
                SA +E   Y+L  + SK++  T                P E   LW  R                                           
Subjt:  --------SAPVETATYILNMVPSKSVSET----------------PYE---LWKGR-------------------------------------------

Query:  -KGYRAKETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS
         KG+RAKE LELVH+DLCGPMN+KA+GG EYFI+F DDYSRY Y+YLM HKSEALEKF+EYK EVEN L KTIKT RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  -KGYRAKETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS

Query:  QLSAP-------------------------------------VETATYILNMVPSKSVSETPYELWKGRK------------------------------
        QLSAP                                     V+TA YILN VPSKSVSETP +LW GRK                              
Subjt:  QLSAP-------------------------------------VETATYILNMVPSKSVSETPYELWKGRK------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------DQPDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDY
                                                               DQPDGVKPIGCKWIYKRKR   GKVQ FKARLVAK YTQ E VDY
Subjt:  -------------------------------------------------------DQPDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDY

Query:  EETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPC
        EETFSP+AMLKSIRILLSIA ++DYEIW+MDVKTAFLN NLEE+IYM Q EGFI   +EQ++CKL R IYGLKQASRSWNI+FDTAIKSYGF Q VDEPC
Subjt:  EETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPC

Query:  VYKRIVNSTVAFLVLY--DYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSET
        VYKRI+N +VAFLVLY  D L                      L+G  I  L   +  +++  +FQ               +  A ++L +         
Subjt:  VYKRIVNSTVAFLVLY--DYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSET

Query:  PYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNS
          ++++ RK++   +      +I K     + ++SK+GLL FRHGV LSKEQ PKTPQ+VE+MRHIPYASAVGSLMYAMLCT+PDICYA+GIVSRYQSN 
Subjt:  PYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNS

Query:  GHAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTH
        G AHWTAVK ILKYLRRTRDY L+YG+KDLILTGYTDSDFQTD DSRKSTS SVFTLNGGA++WRSIKQGCIA+STMEAEYVA  EAAKE+VW R FL  
Subjt:  GHAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTH

Query:  LEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIV
        LEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV
Subjt:  LEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIV

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.6e-27344.54Show/hide
Query:  MTNSIVQLLASEKLNGNNYTTWKSNLNTILVIDDLRFVLTEECPPNPSSNANRTVRDAYDRWTKENDKARVYILASISDVLAKKHNVMGTAKEIMESLKG
        MT++ + +LA++KLNGNNY +WK+ +NT+L+IDDLRFVL EECP  P++NA RTVR+ Y+RW K N+KAR YILAS+S+VLAKKH  M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGNNYTTWKSNLNTILVIDDLRFVLTEECPPNPSSNANRTVRDAYDRWTKENDKARVYILASISDVLAKKHNVMGTAKEIMESLKG

Query:  MFGQPSFSLRYDAIKYIYNYRMKEGTSVREHVLDMMVHFNVTKKNEAVIDEKSQVSFIMESLPKSFFQFRTNVIMNKKEYNLTALPNELQTYQFLLTNKG
        MFGQ S+ +++DA+KYIYN RM EG SVREHVL+MMVHFNV + N AVIDE SQVSFI+ESLP+SF QFR+N +MNK  Y LT L NELQT++ L+  KG
Subjt:  MFGQPSFSLRYDAIKYIYNYRMKEGTSVREHVLDMMVHFNVTKKNEAVIDEKSQVSFIMESLPKSFFQFRTNVIMNKKEYNLTALPNELQTYQFLLTNKG

Query:  QTGEANVAI-FKKLLRGLSS-------------------------------------------------------------------KNKYDYLYL----
        Q GEANVA   +K  RG +S                                                                   + KYD L L    
Subjt:  QTGEANVAI-FKKLLRGLSS-------------------------------------------------------------------KNKYDYLYL----

Query:  -----------------MHHKSEALEKFREYKT-EVENLLGKTIKTLRSDRGGEYMDLR--------------------------FQDYMIEHGIRSQL-
                         +    + +  +R+ +T E+   +G          GG  + L+                           Q Y +   +     
Subjt:  -----------------MHHKSEALEKFREYKT-EVENLLGKTIKTLRSDRGGEYMDLR--------------------------FQDYMIEHGIRSQL-

Query:  --------SAPVETATYILNMVPSKSVSET----------------PYE---LWKGR-------------------------------------------
                SA +E   Y+L  + SK++  T                P E   LW  R                                           
Subjt:  --------SAPVETATYILNMVPSKSVSET----------------PYE---LWKGR-------------------------------------------

Query:  -KGYRAKETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS
         KG+RAKE LELVH+DLCGPMN+KA+GG EYFI+F DDYSRY Y+YLM HKSEALEKF+EYK EVEN L KTIKT RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  -KGYRAKETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS

Query:  QLSAP-------------------------------------VETATYILNMVPSKSVSETPYELWKGRK------------------------------
        QLSAP                                     V+TA YILN VPSKSVSETP +LW GRK                              
Subjt:  QLSAP-------------------------------------VETATYILNMVPSKSVSETPYELWKGRK------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------DQPDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDY
                                                               DQPDGVKPIGCKWIYKRKR   GKVQ FKARLVAK YTQ E VDY
Subjt:  -------------------------------------------------------DQPDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDY

Query:  EETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPC
        EETFSP+AMLKSIRILLSIA ++DYEIW+MDVKTAFLN NLEE+IYM Q EGFI   +EQ++CKL R IYGLKQASRSWNI+FDTAIKSYGF Q VDEPC
Subjt:  EETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPC

Query:  VYKRIVNSTVAFLVLY--DYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSET
        VYKRI+N +VAFLVLY  D L                      L+G  I  L   +  +++  +FQ               +  A ++L +         
Subjt:  VYKRIVNSTVAFLVLY--DYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSET

Query:  PYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNS
          ++++ RK++   +      +I K     + ++SK+GLL FRHGV LSKEQ PKTPQ+VE+MRHIPYASAVGSLMYAMLCT+PDICYA+GIVSRYQSN 
Subjt:  PYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNS

Query:  GHAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTH
        G AHWTAVK ILKYLRRTRDY L+YG+KDLILTGYTDSDFQTD DSRKSTS SVFTLNGGA++WRSIKQGCIA+STMEAEYVA  EAAKE+VW R FL  
Subjt:  GHAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTH

Query:  LEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIV
        LEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV
Subjt:  LEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIV

A0A5A7SNP8 Gag/pol protein1.8e-28550Show/hide
Query:  MTNSIVQLLASEKLNGNNYTTWKSNLNTILVIDDLRFVLTEECPPNPSSNANRTVRDAYDRWTKENDKARVYILASISDVLAKKHNVMGTAKEIMESLKG
        ++++ + +LA++KLNGNNY +WK+ +N +L+IDDLRFVL E+CP  P++NA RTVR+ Y+RW K N+KAR YILAS+S+VLAKKH  M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGNNYTTWKSNLNTILVIDDLRFVLTEECPPNPSSNANRTVRDAYDRWTKENDKARVYILASISDVLAKKHNVMGTAKEIMESLKG

Query:  MFGQPSFSLRYDAIKYIYNYRMKEGTSVREHVLDMMVHFNVTKKNEAVIDEKSQVSFIMESLPKSFFQFRTNVIMNKKEYNLTALPNELQTYQFLLTNKG
        MFGQ S+ +++DA+KYIYN RM EG SVREHVL+MMVHFNV   NEAVIDE SQVSFI+ESLP+SF QFR+N +MNK  Y LT L NELQT++ L+  KG
Subjt:  MFGQPSFSLRYDAIKYIYNYRMKEGTSVREHVLDMMVHFNVTKKNEAVIDEKSQVSFIMESLPKSFFQFRTNVIMNKKEYNLTALPNELQTYQFLLTNKG

Query:  QTGEANVAI-FKKLLRGLSSKNKYDYLYLMHHKSEALEKFREYK----TEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILN
        Q GEANVA   +K  RG +S  K          S   +K+++ K     +      KT K  ++ +G     + F      H  R+          Y+  
Subjt:  QTGEANVAI-FKKLLRGLSSKNKYDYLYLMHHKSEALEKFREYK----TEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILN

Query:  MVPSK--SVSETPYELWKGRKGYRAKETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRG
           +K   +++ P+      KG+RAKE LELVH+DLCGPMN+KA+GG EYFI+F DDYSRY Y+YLM HKSEALEKF+EYK EVEN L KTIKT RSDRG
Subjt:  MVPSK--SVSETPYELWKGRKGYRAKETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRG

Query:  GEYMDLRFQDYMIEHGIRSQLSAP-------------------------------------VETATYILNMVPSKSVSETPYELWKGRK-----------
        GEYMDL+FQ+Y++E GI SQLSAP                                     V+TA YILN VPSKSVSETP +LW G K           
Subjt:  GEYMDLRFQDYMIEHGIRSQLSAP-------------------------------------VETATYILNMVPSKSVSETPYELWKGRK-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------DQPDGVKPIGCKWIYKRKRDQTGKVQ
                                                                                  DQPDGVKPIGCKWIYKRKR   GKVQ
Subjt:  --------------------------------------------------------------------------DQPDGVKPIGCKWIYKRKRDQTGKVQ

Query:  IFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNI
         FKARLVAK YTQ E VDYEETFSP+AMLKSIRILLSIA ++DYEIW+MDVKTAFLN NLEE+IYM Q EGFI   +EQ++CKL R IYGLKQASRSWNI
Subjt:  IFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNI

Query:  KFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLY--DYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAP
        +FDTAIKSYGF Q VDEPCVYKRI+N +VAFLVLY  D L                      L+G  I  L   +  +++  +FQ               
Subjt:  KFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLY--DYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAP

Query:  VETATYILNMVPSKSVSETPYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLC
        +  A ++L +           ++++ RK++   +      +I K     + ++SK+GLL FRHGV LSKEQ PKTPQ+VE+MRHIPYASAVGSLMYAMLC
Subjt:  VETATYILNMVPSKSVSETPYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLC

Query:  TQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEY
        T+PDICYA+GIVSRYQSN G AHWTAVK ILKYLRRTRDY L+YG+KDLILTGYTDSDFQTD DSRKSTS SVFTLNGGA++WRSIKQGCIA+STMEAEY
Subjt:  TQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEY

Query:  VAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIV
        VA  EAAKE+VW R FL  LEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV
Subjt:  VAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIV

A0A5A7VI97 Gag/pol protein9.5e-27454.86Show/hide
Query:  MTNSIVQLLASEKLNGNNYTTWKSNLNTILVIDDLRFVLTEECPPNPSSNANRTVRDAYDRWTKENDKARVYILASISDVLAKKHNVMGTAKEIMESLKG
        MT++ + +LA++KLNGNNY +WK+  NT+L+IDDLRFVL EECP  P++NA RTVR+ Y+RW K N+KAR YILAS+S+VLAKKH  M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGNNYTTWKSNLNTILVIDDLRFVLTEECPPNPSSNANRTVRDAYDRWTKENDKARVYILASISDVLAKKHNVMGTAKEIMESLKG

Query:  MFGQPSFSLRYDAIKYIYNYRMKEGTSVREHVLDMMVHFNVTKKNEAVIDEKSQVSFIMESLPKSFFQFRTNVIMNKKEYNLTALPNELQTYQFLLTNKG
        MFGQ S+ +++DA+KYIYN RM +  SVREHVL MMVHFNV K N AVIDE SQVSFI+ESLP+S  QFR+N +MNK  Y LT L +ELQT++ L+  KG
Subjt:  MFGQPSFSLRYDAIKYIYNYRMKEGTSVREHVLDMMVHFNVTKKNEAVIDEKSQVSFIMESLPKSFFQFRTNVIMNKKEYNLTALPNELQTYQFLLTNKG

Query:  QTGEANVAI-FKKLLRGLSSKNKYDYLYLMHHKSEALEKFREYK----TEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILN
        Q GE NVA   +K  +G +S  K          S   +K+++ K     +      KT K  ++ +G     + F      H  R+      E      N
Subjt:  QTGEANVAI-FKKLLRGLSSKNKYDYLYLMHHKSEALEKFREYK----TEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILN

Query:  MVPSKSVSETPYELWKGRKGYRAKETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGE
              ++  P+      KG+RAKE LELVH+DLCGPMN+KA+G  EYFI+F DDYSRY Y+YLM HKSEALEKF+EYK EVEN L KTIKT RSDRGGE
Subjt:  MVPSKSVSETPYELWKGRKGYRAKETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGE

Query:  YMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSK---------SVSETPYELWKGRK---------------------------------------DQ
        YMDL+FQ+Y++E GI SQLS+P    T   N VP +          VSETP +LW GRK                                       +Q
Subjt:  YMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSK---------SVSETPYELWKGRK---------------------------------------DQ

Query:  PDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQD
        PD  K IGCKWIYKRKR   GKVQ FKARLVAK YTQ E VDYEETFSP+AMLKSIRILLSIA ++DYEIW+MDVKTAFLN NLEE+IYM Q EGFI   
Subjt:  PDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQD

Query:  REQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGE
        +EQ+VCKL R IYGLKQASRSWNI+FDTAIKSYGF Q VDEPCVYKRI+N+++AFLVLY                      V+++L      L  + GG 
Subjt:  REQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGE

Query:  YMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQE
          D++                      +++     K + E  + L                        D      +  LL  R+ V           Q+
Subjt:  YMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQE

Query:  VEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNG
        VE+MRHIPYASAVGSLMYAMLCT+PDICYA+GI+SRYQSN G A+WTA+K ILKYLRRTRDYML+YG+KDLILT YTDSDFQTD DSRKSTS SVFTLNG
Subjt:  VEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNG

Query:  GAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIV
        GA++WRSIKQGCI +S MEAEYVA  EAAKE+VW RKFL  LEVV N+  P+TLYCDNSGAVANSKEPRSHKR KHIE KYHLIREIV
Subjt:  GAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIV

A0A5D3CPJ6 Gag/pol protein1.6e-27344.54Show/hide
Query:  MTNSIVQLLASEKLNGNNYTTWKSNLNTILVIDDLRFVLTEECPPNPSSNANRTVRDAYDRWTKENDKARVYILASISDVLAKKHNVMGTAKEIMESLKG
        MT++ + +LA++KLNGNNY +WK+ +NT+L+IDDLRFVL EECP  P++NA RTVR+ Y+RW K N+KAR YILAS+S+VLAKKH  M TA+EIM+SL+ 
Subjt:  MTNSIVQLLASEKLNGNNYTTWKSNLNTILVIDDLRFVLTEECPPNPSSNANRTVRDAYDRWTKENDKARVYILASISDVLAKKHNVMGTAKEIMESLKG

Query:  MFGQPSFSLRYDAIKYIYNYRMKEGTSVREHVLDMMVHFNVTKKNEAVIDEKSQVSFIMESLPKSFFQFRTNVIMNKKEYNLTALPNELQTYQFLLTNKG
        MFGQ S+ +++DA+KYIYN RM EG SVREHVL+MMVHFNV + N AVIDE SQVSFI+ESLP+SF QFR+N +MNK  Y LT L NELQT++ L+  KG
Subjt:  MFGQPSFSLRYDAIKYIYNYRMKEGTSVREHVLDMMVHFNVTKKNEAVIDEKSQVSFIMESLPKSFFQFRTNVIMNKKEYNLTALPNELQTYQFLLTNKG

Query:  QTGEANVAI-FKKLLRGLSS-------------------------------------------------------------------KNKYDYLYL----
        Q GEANVA   +K  RG +S                                                                   + KYD L L    
Subjt:  QTGEANVAI-FKKLLRGLSS-------------------------------------------------------------------KNKYDYLYL----

Query:  -----------------MHHKSEALEKFREYKT-EVENLLGKTIKTLRSDRGGEYMDLR--------------------------FQDYMIEHGIRSQL-
                         +    + +  +R+ +T E+   +G          GG  + L+                           Q Y +   +     
Subjt:  -----------------MHHKSEALEKFREYKT-EVENLLGKTIKTLRSDRGGEYMDLR--------------------------FQDYMIEHGIRSQL-

Query:  --------SAPVETATYILNMVPSKSVSET----------------PYE---LWKGR-------------------------------------------
                SA +E   Y+L  + SK++  T                P E   LW  R                                           
Subjt:  --------SAPVETATYILNMVPSKSVSET----------------PYE---LWKGR-------------------------------------------

Query:  -KGYRAKETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS
         KG+RAKE LELVH+DLCGPMN+KA+GG EYFI+F DDYSRY Y+YLM HKSEALEKF+EYK EVEN L KTIKT RSDRGGEYMDL+FQ+Y++E GI S
Subjt:  -KGYRAKETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRS

Query:  QLSAP-------------------------------------VETATYILNMVPSKSVSETPYELWKGRK------------------------------
        QLSAP                                     V+TA YILN VPSKSVSETP +LW GRK                              
Subjt:  QLSAP-------------------------------------VETATYILNMVPSKSVSETPYELWKGRK------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------DQPDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDY
                                                               DQPDGVKPIGCKWIYKRKR   GKVQ FKARLVAK YTQ E VDY
Subjt:  -------------------------------------------------------DQPDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDY

Query:  EETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPC
        EETFSP+AMLKSIRILLSIA ++DYEIW+MDVKTAFLN NLEE+IYM Q EGFI   +EQ++CKL R IYGLKQASRSWNI+FDTAIKSYGF Q VDEPC
Subjt:  EETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPC

Query:  VYKRIVNSTVAFLVLY--DYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSET
        VYKRI+N +VAFLVLY  D L                      L+G  I  L   +  +++  +FQ               +  A ++L +         
Subjt:  VYKRIVNSTVAFLVLY--DYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSET

Query:  PYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNS
          ++++ RK++   +      +I K     + ++SK+GLL FRHGV LSKEQ PKTPQ+VE+MRHIPYASAVGSLMYAMLCT+PDICYA+GIVSRYQSN 
Subjt:  PYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNS

Query:  GHAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTH
        G AHWTAVK ILKYLRRTRDY L+YG+KDLILTGYTDSDFQTD DSRKSTS SVFTLNGGA++WRSIKQGCIA+STMEAEYVA  EAAKE+VW R FL  
Subjt:  GHAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTH

Query:  LEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIV
        LEVVPNM  P+TLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV
Subjt:  LEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIV

E2GK51 Gag/pol protein (Fragment)4.4e-28746.66Show/hide
Query:  MTNSIVQLLASEKLNGNNYTTWKSNLNTILVIDDLRFVLTEECPPNPSSNANRTVRDAYDRWTKENDKARVYILASISDVLAKKHNVMGTAKEIMESLKG
        M  SIVQLLASEKLNG+NY+ WKSNLNTILV+DDLRFVLTEECP  P+ NANRTVR+AYDRW K NDKARVYILAS++DVLAKKH+ + TAK IM+SL+ 
Subjt:  MTNSIVQLLASEKLNGNNYTTWKSNLNTILVIDDLRFVLTEECPPNPSSNANRTVRDAYDRWTKENDKARVYILASISDVLAKKHNVMGTAKEIMESLKG

Query:  MFGQPSFSLRYDAIKYIYNYRMKEGTSVREHVLDMMVHFNVTKKNEAVIDEKSQVSFIMESLPKSFFQFRTNVIMNKKEYNLTALPNELQTYQFLLTNKG
        MFGQPS+SLR++AIK+IY  RMKEGTSVREHVLDMM+HFN+ + N   IDE +QVSFI++SLPKSF  F+TN  +NK E+NLT L NELQ +Q L  +KG
Subjt:  MFGQPSFSLRYDAIKYIYNYRMKEGTSVREHVLDMMVHFNVTKKNEAVIDEKSQVSFIMESLPKSFFQFRTNVIMNKKEYNLTALPNELQTYQFLLTNKG

Query:  QTGEANVAIFK-KLLRGLSSKN------------------------------------------------------------KYDYLYL-----------
        +  EANVA+ K K +RG SSKN                                                            KYD L +           
Subjt:  QTGEANVAIFK-KLLRGLSSKN------------------------------------------------------------KYDYLYL-----------

Query:  -------MHHKSEALEKFREYKTEVENLLGKTIKT---LRSDRGGEYMDLRFQD-YMIEHGI-------RSQLSAP-----VETATYILNMV--------
                +H   + ++   +K   E  +   + T   + ++  G+ + L FQD Y+I   +       R+ +S       + T ++ +N V        
Subjt:  -------MHHKSEALEKFREYKTEVENLLGKTIKT---LRSDRGGEYMDLRFQD-YMIEHGI-------RSQLSAP-----VETATYILNMV--------

Query:  ----------------------------------PSKSVSETPYELWKGR--------------------------------------------KGYRAK
                                            + VS   Y LW  R                                            KG RAK
Subjt:  ----------------------------------PSKSVSETPYELWKGR--------------------------------------------KGYRAK

Query:  ETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAP--
          LELVH+DLCGPMN+KA+GG EYFISFIDD+SRY ++YL+HHKSE+ EKF+EYK EVEN +GKTIKTLRSDRGGEYMD +FQDY+IE GI+SQLSAP  
Subjt:  ETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAP--

Query:  -----------------------------------VETATYILNMVPSKSVSETPYELWKGRK-------------------------------------
                                           +ETA +ILN VPSKSV ETPYELWKGRK                                     
Subjt:  -----------------------------------VETATYILNMVPSKSVSETPYELWKGRK-------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------DQPDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSI
                                                  D P  VKPIGCKWIYKRKRDQ GKVQ FKARLVAK YTQ+E VDYEETFSP+AMLKSI
Subjt:  ------------------------------------------DQPDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSI

Query:  RILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFL
        RILLSIATFY+YEIW+MDVKTAFLN NLEESIYM Q EGFI QD+EQ+VCKL++ IYGLKQASRSWNI+FDTAIKSYGF+QNVDEPCVYK+IVNS VAFL
Subjt:  RILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFL

Query:  VLY-DYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELWKGRKDQPDG
        +LY D + L+ +         EY T+V+                ++++ +FQ               +  A YIL +           ++ + RK++   
Subjt:  VLY-DYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELWKGRKDQPDG

Query:  VKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKY
        +      +I K       ++SKKG L FRHG+HLSKEQ PKTPQEVEDMR+IPY+SAVGSLMYAMLCT+PDICY++GIVSRYQSN G  HWTAVKNILKY
Subjt:  VKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKY

Query:  LRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLY
        LRRTR+YML+YGAKDLILTGYTDSDFQ+D D+RKSTS SVFTLNGGA++WRS+KQ CIA+STMEAEYVA  EAAKE+VW RKFLT LEVVPNMHLP+TLY
Subjt:  LRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLY

Query:  CDNSGAVANSKEPRSHKRGKHIERKYHLIREIV
        CDNSGAVANSKEPRSHKRGKHIERKYHLIREIV
Subjt:  CDNSGAVANSKEPRSHKRGKHIERKYHLIREIV

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.3e-5928.74Show/hide
Query:  IDDYSRYDYLYLMHHKSEALE-KFREYKTEVENLLGKTI---KTLRSDRGGEYMDLRFQD--YMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELW
        ID+ ++ D + +++ +SE L+ K +    E +N L K +    T+ +D    + +++++D     E  I ++L+A     T+ +                
Subjt:  IDDYSRYDYLYLMHHKSEALE-KFREYKTEVENLLGKTI---KTLRSDRGGEYMDLRFQD--YMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELW

Query:  KGRKDQPDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLE
             +P+    +  +W++  K ++ G    +KARLVA+ +TQ+ ++DYEETF+P+A + S R +LS+   Y+ ++ +MDVKTAFLN  L+E IYM   +
Subjt:  KGRKDQPDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLE

Query:  GFIEQDREQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY---KRIVNSTVAFLVLYDYLYLMHHKSEALEKFREYKTEVENLLGKTIK
        G         VCKL + IYGLKQA+R W   F+ A+K   F  +  + C+Y   K  +N  +  L+  D + +       +  F+ Y       L +  +
Subjt:  GFIEQDREQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY---KRIVNSTVAFLVLYDYLYLMHHKSEALEKFREYKTEVENLLGKTIK

Query:  TLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSK
            +    ++ +R +  M E  I    SA V+      NM    +VS                  P+  K  Y+                      L+ 
Subjt:  TLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSK

Query:  EQSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDYMLMYG---AKDLILTGYTDSDFQTDVDSR
        ++   T          P  S +G LMY MLCT+PD+  A+ I+SRY S +    W  +K +L+YL+ T D  L++    A +  + GY DSD+      R
Subjt:  EQSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDYMLMYG---AKDLILTGYTDSDFQTDVDSR

Query:  KSTSESVFTL-NGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREI
        KST+  +F + +   I W + +Q  +A S+ EAEY+A++EA +E++W +  LT + +   +  P+ +Y DN G ++ +  P  HKR KHI+ KYH  RE 
Subjt:  KSTSESVFTL-NGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREI

Query:  VQ
        VQ
Subjt:  VQ

P0CV72 Secreted RxLR effector protein 1618.0e-2846.21Show/hide
Query:  MRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDYMLMY-GAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGA
        M+++PY SAVG++MY M+ T+PD+  A+G++S++ S+    HW A+K +L+YL+ T+ Y L +  A    L GY+D+D+  DV+SR+STS  +F LNGG 
Subjt:  MRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDYMLMY-GAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGA

Query:  IIWRSIKQGCIANSTMEAEYVAVYEAAKESVW
        + WRS KQ  +A S+ E EY+A+ EA +E+VW
Subjt:  IIWRSIKQGCIANSTMEAEYVAVYEAAKESVW

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.8e-8930.72Show/hide
Query:  YLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAP-------------------------------------V
        ++Y++  K +  + F+++   VE   G+ +K LRSD GGEY    F++Y   HGIR + + P                                     V
Subjt:  YLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAP-------------------------------------V

Query:  ETATYILNMVPSKSVS-ETPYELWKGRK---------GYRAKETLELVHTDLCGPMNIKAQGGCEYFISFIDD---YSRYDYLYLMHHKSEALEKFREYK
        +TA Y++N  PS  ++ E P  +W  ++         G RA   +           +I        FI + D+   Y  +D +     +S  +  FRE +
Subjt:  ETATYILNMVPSKSVS-ETPYELWKGRK---------GYRAKETLELVHTDLCGPMNIKAQGGCEYFISFIDD---YSRYDYLYLMHHKSEALEKFREYK

Query:  T--------EVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIR--------SQLSAPVETATYILN--------------MVPSKSVSETPY------
                 +V+N +     T+ S            D + E G +         QL   VE   +                  V S+    T Y      
Subjt:  T--------EVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIR--------SQLSAPVETATYILN--------------MVPSKSVSETPY------

Query:  ----------------ELWKGRKDQ---------------PDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILL
                        +L K  +++               P G +P+ CKW++K K+D   K+  +KARLV K + Q++ +D++E FSP+  + SIR +L
Subjt:  ----------------ELWKGRKDQ---------------PDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILL

Query:  SIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY-KRIVNSTVAFLVLY
        S+A   D E+ ++DVKTAFL+ +LEE IYM Q EGF    ++  VCKL + +YGLKQA R W +KFD+ +KS  + +   +PCVY KR   +    L+LY
Subjt:  SIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVY-KRIVNSTVAFLVLY

Query:  DYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSE-TPYELWKGRKDQPDGVKP
            L+  K + L                 I  L+ D    + D++                 +  A  IL M   K V E T  +LW  ++        
Subjt:  DYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSE-TPYELWKGRKDQPDGVKP

Query:  IGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRR
           K+I +       K++K         + LSK+  P T +E  +M  +PY+SAVGSLMYAM+CT+PDI +A+G+VSR+  N G  HW AVK IL+YLR 
Subjt:  IGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRR

Query:  TRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDN
        T    L +G  D IL GYTD+D   D+D+RKS++  +FT +GGAI W+S  Q C+A ST EAEY+A  E  KE +W ++FL  L +    ++   +YCD+
Subjt:  TRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDN

Query:  SGAVANSKEPRSHKRGKHIERKYHLIREIV
          A+  SK    H R KHI+ +YH IRE+V
Subjt:  SGAVANSKEPRSHKRGKHIERKYHLIREIV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.7e-5030Show/hide
Query:  NMVPSKSVSETPYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTA
        N + S+  ++     W      P  V  +GC+WI+ +K +  G +  +KARLVAK Y QR  +DY ETFSP+    SIRI+L +A    + I ++DV  A
Subjt:  NMVPSKSVSETPYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTA

Query:  FLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLY--DYLY------LMHHKSE
        FL   L + +YM+Q  GFI++DR   VCKL++ +YGLKQA R+W ++    + + GF  +V +  ++      ++ ++++Y  D L       L+H+  +
Subjt:  FLNDNLEESIYMTQLEGFIEQDREQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLY--DYLY------LMHHKSE

Query:  ALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELWKGRKDQPDGVKPIGCKWIYKRKRD
         L +    K   E      I+  R   G   + L  + Y+++   R+             NM+ +K V+ TP                            
Subjt:  ALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELWKGRKDQPDGVKPIGCKWIYKRKRD

Query:  QTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDY-MLMYGAK
             +    LS   G  L+       P E        Y   VGSL Y +  T+PDI YA+  +S++       H  A+K IL+YL  T ++ + +    
Subjt:  QTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDY-MLMYGAK

Query:  DLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPR
         L L  Y+D+D+  D D   ST+  +  L    I W S KQ  +  S+ EAEY +V   + E  W    LT L +   +  P  +YCDN GA      P 
Subjt:  DLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPR

Query:  SHKRGKHIERKYHLIREIVQ
         H R KHI   YH IR  VQ
Subjt:  SHKRGKHIERKYHLIREIVQ

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.6e-0734.34Show/hide
Query:  LELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVET
        LE +++D+     I +     Y++ F+D ++RY +LY +  KS+  E F  +K  +EN     I T  SD GGE++ L   +Y  +HGI S L++P  T
Subjt:  LELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVET

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.1e-5230.32Show/hide
Query:  PDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQD
        P  V  +GC+WI+ +K +  G +  +KARLVAK Y QR  +DY ETFSP+    SIRI+L +A    + I ++DV  AFL   L + +YM+Q  GF+++D
Subjt:  PDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQD

Query:  REQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLY--DYL------YLMHHKSEALEKFREYKTEVENLLGKTIKT
        R   VC+L++ IYGLKQA R+W ++  T + + GF  ++ +  ++      ++ ++++Y  D L       L+ H  +AL +    K   +      I+ 
Subjt:  REQRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLY--DYL------YLMHHKSEALEKFREYKTEVENLLGKTIKT

Query:  LRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKE
         R  +G   + L  + Y ++   R+             NM+ +K V+             P    P                              L+  
Subjt:  LRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKE

Query:  QSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDY-MLMYGAKDLILTGYTDSDFQTDVDSRKST
           K P   E      Y   VGSL Y +  T+PD+ YA+  +S+Y       HW A+K +L+YL  T D+ + +     L L  Y+D+D+  D D   ST
Subjt:  QSPKTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDY-MLMYGAKDLILTGYTDSDFQTDVDSRKST

Query:  SESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ
        +  +  L    I W S KQ  +  S+ EAEY +V   + E  W    LT L +   +  P  +YCDN GA      P  H R KHI   YH IR  VQ
Subjt:  SESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.1e-0835Show/hide
Query:  AKETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAP
        + + LE +++D+     I +     Y++ F+D ++RY +LY +  KS+  + F  +K+ VEN     I TL SD GGE++ LR  DY+ +HGI    S P
Subjt:  AKETLELVHTDLCGPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.8e-5129.47Show/hide
Query:  PDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQD
        P   KPIGCKW+YK K +  G ++ +KARLVAK YTQ+E +D+ ETFSP+  L S++++L+I+  Y++ + ++D+  AFLN +L+E IYM    G+  + 
Subjt:  PDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQD

Query:  RE----QRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNST-VAFLVLYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRS
         +      VC LK+ IYGLKQASR W +KF   +  +GF Q+  +   + +I  +  +  LV  D + +  +   A++   E K+++++           
Subjt:  RE----QRVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNST-VAFLVLYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRS

Query:  DRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSP
            +  DL                 P++   Y L +  ++S +       K   D  D    +GC      K      D      +   G  +  +   
Subjt:  DRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSP

Query:  KTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDYMLMYGAK-DLILTGYTDSDFQTDVDSRKSTSES
                     Y   +G LMY  + T+ DI +A+  +S++      AH  AV  IL Y++ T    L Y ++ ++ L  ++D+ FQ+  D+R+ST+  
Subjt:  KTPQEVEDMRHIPYASAVGSLMYAMLCTQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDYMLMYGAK-DLILTGYTDSDFQTDVDSRKSTSES

Query:  VFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE
           L    I W+S KQ  ++ S+ EAEY A+  A  E +W  +F   L++   +  P  L+CDN+ A+  +     H+R KHIE   H +RE
Subjt:  VFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESVWHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein6.3e-0435.21Show/hide
Query:  TQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDYMLMYGA-KDLILTGYTDSDFQTDVDSRKSTS
        T+PD+ +A+  +S++ S S  A   AV  +L Y++ T    L Y A  DL L  + DSD+ +  D+R+S +
Subjt:  TQPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDYMLMYGA-KDLILTGYTDSDFQTDVDSRKSTS

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.7e-0742.11Show/hide
Query:  IGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIA
        +GCKW++K K    G +   KARLVAK + Q E + + ET+SP+    +IR +L++A
Subjt:  IGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAACTCAATAGTACAATTACTTGCTTCTGAGAAATTAAACGGCAACAATTACACAACTTGGAAATCAAACCTAAATACAATACTTGTAATTGATGATTTGAGGTT
TGTTTTAACTGAGGAATGTCCTCCAAACCCTAGCTCAAATGCAAACCGAACAGTTCGGGATGCATATGACAGATGGACAAAGGAAAATGACAAAGCCCGAGTATACATTT
TAGCCAGCATATCTGATGTTTTAGCTAAGAAACACAATGTCATGGGTACTGCTAAAGAGATTATGGAATCTCTGAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGA
TATGACGCCATAAAATACATTTACAACTACCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGACAAAAAAAAATGAAGC
TGTCATTGATGAGAAGAGTCAAGTTAGTTTTATCATGGAGTCTCTTCCGAAGAGTTTCTTCCAGTTTCGCACAAATGTGATAATGAACAAGAAAGAATATAACTTGACTG
CTCTTCCCAATGAGCTACAGACTTATCAGTTCCTCTTAACGAACAAGGGACAAACAGGAGAAGCAAATGTTGCTATCTTTAAGAAATTACTACGAGGATTGTCCTCCAAA
AATAAGTATGACTATCTTTACCTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTGAGAATCTATTAGGTAAAACTATTAAAACACT
TCGATCAGATCGAGGTGGAGAGTATATGGACTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCCCTGTGGAAACTGCTACATACATTT
TGAACATGGTTCCTTCTAAGAGTGTTTCAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGTTATAGAGCCAAAGAAACCTTAGAACTTGTGCATACAGATCTCTGT
GGTCCAATGAATATCAAAGCACAAGGAGGGTGTGAATATTTCATCAGTTTTATTGATGATTATTCAAGGTATGACTATCTTTACCTAATGCATCACAAGTCCGAAGCTCT
TGAAAAATTCAGAGAGTATAAGACTGAGGTTGAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTAAGATTTCAGGACT
ATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCCCTGTGGAAACTGCTACATACATTTTGAACATGGTTCCTTCTAAGAGTGTTTCAGAAACACCTTATGAGTTA
TGGAAAGGGCGTAAAGATCAACCAGATGGGGTAAAACCTATTGGTTGCAAATGGATCTATAAGAGGAAACGAGACCAAACTGGTAAGGTACAGATCTTTAAAGCTCGACT
TGTAGCAAAGTGTTATACCCAAAGAGAGAGGGTGGACTATGAAGAAACTTTCTCTCCTATTGCTATGCTTAAATCGATTAGAATACTCTTGTCCATTGCCACATTTTATG
ATTATGAAATTTGGGAAATGGATGTTAAGACAGCCTTTCTAAACGACAATCTTGAAGAGAGTATCTATATGACTCAACTAGAGGGGTTCATTGAACAGGATCGTGAGCAA
AGGGTTTGCAAGCTTAAAAGATTCATTTATGGGTTAAAGCAAGCATCTCGATCCTGGAATATAAAGTTTGATACTGCGATCAAATCTTATGGCTTTAAACAGAATGTTGA
CGAACCTTGTGTTTATAAAAGGATAGTCAACTCTACAGTAGCTTTCCTAGTATTGTATGACTATCTTTACCTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAG
AGTATAAGACTGAGGTTGAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTAAGATTTCAGGACTATATGATAGAACAT
GGAATAAGGTCTCAACTCTCAGCCCCTGTGGAAACTGCTACATACATTTTGAACATGGTTCCTTCTAAGAGTGTTTCAGAAACACCTTATGAGTTATGGAAAGGGCGTAA
AGATCAACCAGATGGGGTAAAACCTATTGGTTGCAAATGGATCTATAAGAGGAAACGAGACCAAACTGGTAAGGACTCCAAGAAAGGATTATTATCTTTCAGGCATGGAG
TTCATTTGTCGAAGGAACAAAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACATATTCCCTATGCATCAGCAGTCGGTAGTCTGATGTATGCCATGCTATGTACC
CAACCCGACATATGCTATGCTATTGGAATTGTCAGCAGATATCAGTCCAATTCGGGGCATGCTCATTGGACTGCTGTTAAGAATATCCTCAAGTATCTTCGGAGAACGAG
GGACTATATGCTAATGTACGGTGCTAAGGATCTGATCCTTACAGGGTACACTGACTCAGATTTTCAGACCGATGTAGATTCAAGGAAATCGACATCAGAATCTGTCTTCA
CTCTGAACGGAGGAGCAATAATATGGAGAAGCATAAAGCAAGGTTGCATTGCTAATTCCACCATGGAGGCTGAGTATGTTGCTGTTTATGAAGCAGCGAAAGAATCTGTA
TGGCATAGGAAGTTCTTAACTCATTTGGAAGTCGTTCCAAATATGCATCTTCCCGTCACTCTTTATTGTGATAACAGTGGAGCAGTTGCAAATTCAAAAGAACCAAGAAG
CCATAAGCGAGGAAAACATATTGAGCGCAAATATCATCTCATAAGGGAGATTGTGCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGACAAACTCAATAGTACAATTACTTGCTTCTGAGAAATTAAACGGCAACAATTACACAACTTGGAAATCAAACCTAAATACAATACTTGTAATTGATGATTTGAGGTT
TGTTTTAACTGAGGAATGTCCTCCAAACCCTAGCTCAAATGCAAACCGAACAGTTCGGGATGCATATGACAGATGGACAAAGGAAAATGACAAAGCCCGAGTATACATTT
TAGCCAGCATATCTGATGTTTTAGCTAAGAAACACAATGTCATGGGTACTGCTAAAGAGATTATGGAATCTCTGAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGA
TATGACGCCATAAAATACATTTACAACTACCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGACAAAAAAAAATGAAGC
TGTCATTGATGAGAAGAGTCAAGTTAGTTTTATCATGGAGTCTCTTCCGAAGAGTTTCTTCCAGTTTCGCACAAATGTGATAATGAACAAGAAAGAATATAACTTGACTG
CTCTTCCCAATGAGCTACAGACTTATCAGTTCCTCTTAACGAACAAGGGACAAACAGGAGAAGCAAATGTTGCTATCTTTAAGAAATTACTACGAGGATTGTCCTCCAAA
AATAAGTATGACTATCTTTACCTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAGAGTATAAGACTGAGGTTGAGAATCTATTAGGTAAAACTATTAAAACACT
TCGATCAGATCGAGGTGGAGAGTATATGGACTTAAGATTTCAGGACTATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCCCTGTGGAAACTGCTACATACATTT
TGAACATGGTTCCTTCTAAGAGTGTTTCAGAAACACCTTATGAGTTATGGAAAGGGCGTAAAGGTTATAGAGCCAAAGAAACCTTAGAACTTGTGCATACAGATCTCTGT
GGTCCAATGAATATCAAAGCACAAGGAGGGTGTGAATATTTCATCAGTTTTATTGATGATTATTCAAGGTATGACTATCTTTACCTAATGCATCACAAGTCCGAAGCTCT
TGAAAAATTCAGAGAGTATAAGACTGAGGTTGAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTAAGATTTCAGGACT
ATATGATAGAACATGGAATAAGGTCTCAACTCTCAGCCCCTGTGGAAACTGCTACATACATTTTGAACATGGTTCCTTCTAAGAGTGTTTCAGAAACACCTTATGAGTTA
TGGAAAGGGCGTAAAGATCAACCAGATGGGGTAAAACCTATTGGTTGCAAATGGATCTATAAGAGGAAACGAGACCAAACTGGTAAGGTACAGATCTTTAAAGCTCGACT
TGTAGCAAAGTGTTATACCCAAAGAGAGAGGGTGGACTATGAAGAAACTTTCTCTCCTATTGCTATGCTTAAATCGATTAGAATACTCTTGTCCATTGCCACATTTTATG
ATTATGAAATTTGGGAAATGGATGTTAAGACAGCCTTTCTAAACGACAATCTTGAAGAGAGTATCTATATGACTCAACTAGAGGGGTTCATTGAACAGGATCGTGAGCAA
AGGGTTTGCAAGCTTAAAAGATTCATTTATGGGTTAAAGCAAGCATCTCGATCCTGGAATATAAAGTTTGATACTGCGATCAAATCTTATGGCTTTAAACAGAATGTTGA
CGAACCTTGTGTTTATAAAAGGATAGTCAACTCTACAGTAGCTTTCCTAGTATTGTATGACTATCTTTACCTAATGCATCACAAGTCCGAAGCTCTTGAAAAATTCAGAG
AGTATAAGACTGAGGTTGAGAATCTATTAGGTAAAACTATTAAAACACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTAAGATTTCAGGACTATATGATAGAACAT
GGAATAAGGTCTCAACTCTCAGCCCCTGTGGAAACTGCTACATACATTTTGAACATGGTTCCTTCTAAGAGTGTTTCAGAAACACCTTATGAGTTATGGAAAGGGCGTAA
AGATCAACCAGATGGGGTAAAACCTATTGGTTGCAAATGGATCTATAAGAGGAAACGAGACCAAACTGGTAAGGACTCCAAGAAAGGATTATTATCTTTCAGGCATGGAG
TTCATTTGTCGAAGGAACAAAGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACATATTCCCTATGCATCAGCAGTCGGTAGTCTGATGTATGCCATGCTATGTACC
CAACCCGACATATGCTATGCTATTGGAATTGTCAGCAGATATCAGTCCAATTCGGGGCATGCTCATTGGACTGCTGTTAAGAATATCCTCAAGTATCTTCGGAGAACGAG
GGACTATATGCTAATGTACGGTGCTAAGGATCTGATCCTTACAGGGTACACTGACTCAGATTTTCAGACCGATGTAGATTCAAGGAAATCGACATCAGAATCTGTCTTCA
CTCTGAACGGAGGAGCAATAATATGGAGAAGCATAAAGCAAGGTTGCATTGCTAATTCCACCATGGAGGCTGAGTATGTTGCTGTTTATGAAGCAGCGAAAGAATCTGTA
TGGCATAGGAAGTTCTTAACTCATTTGGAAGTCGTTCCAAATATGCATCTTCCCGTCACTCTTTATTGTGATAACAGTGGAGCAGTTGCAAATTCAAAAGAACCAAGAAG
CCATAAGCGAGGAAAACATATTGAGCGCAAATATCATCTCATAAGGGAGATTGTGCAATGA
Protein sequenceShow/hide protein sequence
MTNSIVQLLASEKLNGNNYTTWKSNLNTILVIDDLRFVLTEECPPNPSSNANRTVRDAYDRWTKENDKARVYILASISDVLAKKHNVMGTAKEIMESLKGMFGQPSFSLR
YDAIKYIYNYRMKEGTSVREHVLDMMVHFNVTKKNEAVIDEKSQVSFIMESLPKSFFQFRTNVIMNKKEYNLTALPNELQTYQFLLTNKGQTGEANVAIFKKLLRGLSSK
NKYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYELWKGRKGYRAKETLELVHTDLC
GPMNIKAQGGCEYFISFIDDYSRYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEHGIRSQLSAPVETATYILNMVPSKSVSETPYEL
WKGRKDQPDGVKPIGCKWIYKRKRDQTGKVQIFKARLVAKCYTQRERVDYEETFSPIAMLKSIRILLSIATFYDYEIWEMDVKTAFLNDNLEESIYMTQLEGFIEQDREQ
RVCKLKRFIYGLKQASRSWNIKFDTAIKSYGFKQNVDEPCVYKRIVNSTVAFLVLYDYLYLMHHKSEALEKFREYKTEVENLLGKTIKTLRSDRGGEYMDLRFQDYMIEH
GIRSQLSAPVETATYILNMVPSKSVSETPYELWKGRKDQPDGVKPIGCKWIYKRKRDQTGKDSKKGLLSFRHGVHLSKEQSPKTPQEVEDMRHIPYASAVGSLMYAMLCT
QPDICYAIGIVSRYQSNSGHAHWTAVKNILKYLRRTRDYMLMYGAKDLILTGYTDSDFQTDVDSRKSTSESVFTLNGGAIIWRSIKQGCIANSTMEAEYVAVYEAAKESV
WHRKFLTHLEVVPNMHLPVTLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQ