; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007735 (gene) of Snake gourd v1 genome

Gene IDTan0007735
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG09:35785729..35789672
RNA-Seq ExpressionTan0007735
SyntenyTan0007735
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026233.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-15636.77Show/hide
Query:  MSSSFIQLLASDKLNGDNFRTLKSNLNTVLVTDDLRFVLTKECPPPPSLSATRTVRYTFDKLTRANDKAPVYILVNISDVLSKKHEGMITAKEIMGSLQA
        +SS+ + +LA+DKLNG+N+ + K+ +N VL+ DDLRFVL ++CP  P+ +ATRTVR  +++  +AN+KA  YIL ++S+VL+KKHE M+TA+EIM SLQ 
Subjt:  MSSSFIQLLASDKLNGDNFRTLKSNLNTVLVTDDLRFVLTKECPPPPSLSATRTVRYTFDKLTRANDKAPVYILVNISDVLSKKHEGMITAKEIMGSLQA

Query:  MFGQPSSSVRYDALKYIYNSRMKEGTSVREHVLNMMVHFNVAEVNGVVIDENS--------------QFRTNSVMNKIEYNLTTLFNELQTFETLMKSKG
        MFGQ S  +++DALKYIYN+RM EG SVREHVLNMMVHFNVA +N  VIDE S              QFR+N+VMNKI Y LTTL NELQTFE+LMK KG
Subjt:  MFGQPSSSVRYDALKYIYNSRMKEGTSVREHVLNMMVHFNVAEVNGVVIDENS--------------QFRTNSVMNKIEYNLTTLFNELQTFETLMKSKG

Query:  KKKEANVVTS-KKFLRGSSSWTKSGPFFLRILVFRKRRRIMGRK------------------------------KLP--YARLKPQENVSIVVKMGTGRG
        +K EANV TS +KF RGS+S TKS P       ++K++   G K                                P   A+ K  +   +  +  TG+G
Subjt:  KKKEANVVTS-KKFLRGSSSWTKSGPFFLRILVFRKRRRIMGRK------------------------------KLP--YARLKPQENVSIVVKMGTGRG

Query:  -TSQSPLQ---------------------------------------KRKLRKKNKETSSWRQLADSQI--TLRE-------------------------
          ++ PL+                                       K +  +K KE  +  + A S+   T R                          
Subjt:  -TSQSPLQ---------------------------------------KRKLRKKNKETSSWRQLADSQI--TLRE-------------------------

Query:  -----KQNLV---RHGSIHDELRSAAS------SFWGYAVETVAYILNMVHSKSVSETPYELWKWRK---------------------------------
             +QN V   R+ ++ D +RS  S      SFWGYAV+T  YILN V SKSVSETP +LW   K                                 
Subjt:  -----KQNLV---RHGSIHDELRSAAS------SFWGYAVETVAYILNMVHSKSVSETPYELWKWRK---------------------------------

Query:  GYSKETKRGLFYDPQEDKV---------------------------------------------------------------------------------
        GY K T+ G FYDP+++KV                                                                                 
Subjt:  GYSKETKRGLFYDPQEDKV---------------------------------------------------------------------------------

Query:  -------VIPDDCIEDPLTYKHVMNDTDRDQWIKAMDLEMESMYFNLVWELVDQPDGGRPIRCK------------------------------------
               VI D  IEDPLT+K  M D D+D+WIKAM+LE+ESMYFN VW+LVDQPDG +PI CK                                    
Subjt:  -------VIPDDCIEDPLTYKHVMNDTDRDQWIKAMDLEMESMYFNLVWELVDQPDGGRPIRCK------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------SSQHGIHLSKEQ
                                                                                                  +HG+ LSKEQ
Subjt:  ----------------------------------------------------------------------------------------SSQHGIHLSKEQ

Query:  CLKTPQEVEDMRHIPYASVVGSLMYAMLCTRPDICYVVGIVSRIQ-------------------------------DLILTGYTNSSFQTDVDSRKSTSR
        C KTPQ+VE+MRHIPYAS VGSLMYAMLCTRPDICY VGIVSR Q                               DLILTGYT+S FQTD DSRKSTS 
Subjt:  CLKTPQEVEDMRHIPYASVVGSLMYAMLCTRPDICYVVGIVSRIQ-------------------------------DLILTGYTNSSFQTDVDSRKSTSR

Query:  LVFTLNEGAIVRRSIKQGCITDSTMDAEYVAACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDV
         VFTLN GA+V RSIKQGCI DSTM+AEYVAACEAAKE  WLR FL DLEVVP M+  ITLYCDNSGAV NS+EPRSHKRGKHIERKYHLIREIV RGDV
Subjt:  LVFTLNEGAIVRRSIKQGCITDSTMDAEYVAACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDV

Query:  IVTQIASKHNIDDPFTKPLTAKVFEGHLESLGLR
        IVTQIAS HN+ DPFTKPLTAKVFEGHLESLGLR
Subjt:  IVTQIASKHNIDDPFTKPLTAKVFEGHLESLGLR

KAA0032972.1 gag/pol protein [Cucumis melo var. makuwa]3.5e-14941.63Show/hide
Query:  MSSSFIQLLASDKLNGDNFRTLKSNLNTVLVTDDLRFVLTKECPPPPSLSATRTVRYTFDKLTRANDKAPVYILVNISDVLSKKHEGMITAKEIMGSLQA
        M+SS +QLLAS KLNGDN+   KSNLNT+LV D L FVLT+ECP  PS +A R  R  +D+  + N+KA VYIL ++ DVL+KKHE + TAKEIM SL+ 
Subjt:  MSSSFIQLLASDKLNGDNFRTLKSNLNTVLVTDDLRFVLTKECPPPPSLSATRTVRYTFDKLTRANDKAPVYILVNISDVLSKKHEGMITAKEIMGSLQA

Query:  MFGQPSSSVRYDALKYIYNSRMKEGTSVREHVLNMMVHFNVAEVNGVVIDENSQFRTNSVMNKIEYNLTTLFNELQTFETLMKSKGKKKEANVVTSK-KF
        MFGQP  S+R++A+KYIY  RMKEGTSVREHV++MM+HFN+AEVN   IDE +Q            NLT             KSKGK+ EANV T+K KF
Subjt:  MFGQPSSSVRYDALKYIYNSRMKEGTSVREHVLNMMVHFNVAEVNGVVIDENSQFRTNSVMNKIEYNLTTLFNELQTFETLMKSKGKKKEANVVTSK-KF

Query:  LRGSSSWTKSGPFFLRILVFRKRRRIMGRKKLPYARLKPQENVSIVVKMGTGRGTSQSP-------------LQKRKLRKKNKETSSWRQLADSQIT---
         RGSSS +K+GP      + +K +    ++    +  K      I +K+GTG   S                L K  L ++  +     QL D+ +    
Subjt:  LRGSSSWTKSGPFFLRILVFRKRRRIMGRKKLPYARLKPQENVSIVVKMGTGRGTSQSP-------------LQKRKLRKKNKETSSWRQLADSQIT---

Query:  ----------------LREKQNL-VRHGSIHDELRSAASSFWGYAVETVAYI-----LNMVHSKSVSETPYELWKW----RKGYSKETKRGLFYDPQEDK
                        LR K  L + H ++   +   A   + Y +  +        + ++H+KS S   ++ +K       GYSKE++ GLFYDPQE+K
Subjt:  ----------------LREKQNL-VRHGSIHDELRSAASSFWGYAVETVAYI-----LNMVHSKSVSETPYELWKW----RKGYSKETKRGLFYDPQEDK

Query:  V----------------------------------------------------------------------------------VIPDDCIEDPLTYKHVM
        V                                                                                  +IPDD IEDPL +K VM
Subjt:  V----------------------------------------------------------------------------------VIPDDCIEDPLTYKHVM

Query:  NDTDRDQWIKAMDLEMESMYFNLVWELVDQPDGGRPIRCK------------------------------------------------------------
        ND D DQWIK+MDL+MESMY N VW LVDQP+  R I CK                                                            
Subjt:  NDTDRDQWIKAMDLEMESMYFNLVWELVDQPDGGRPIRCK------------------------------------------------------------

Query:  --------------SSQHGIHLSKEQCLKTPQEVEDMRHIPYASVVGSLMYAMLCTRPDICYVVGIVSRIQ-----------------------------
                        ++GIHLSKEQC KTPQEVEDM +IPYASVV SLMY MLCTRPDI Y VGIVS+ Q                             
Subjt:  --------------SSQHGIHLSKEQCLKTPQEVEDMRHIPYASVVGSLMYAMLCTRPDICYVVGIVSRIQ-----------------------------

Query:  --DLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRSIKQGCITDSTMDAEYVAACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKE
          DLILTGYT+S FQ D D+R ST   VFTLN GA+V RSIKQ CI DSTM+AEYVA CEA KEV WL+KFL  L+ VP M+L I LYCDNSGA+ NS+E
Subjt:  --DLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRSIKQGCITDSTMDAEYVAACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKE

Query:  PRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHNIDDPFTKPLTAKVFEGHLESLGLR
        PRSHKRGKHIERKYHLIRE V R DVIVT+I+ + N+ DPFTK LT KVFE HL  LGLR
Subjt:  PRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHNIDDPFTKPLTAKVFEGHLESLGLR

KAA0054309.1 gag/pol protein [Cucumis melo var. makuwa]8.0e-16239.42Show/hide
Query:  MSSSFIQLLASDKLNGDNFRTLKSNLNTVLVTDDLRFVLTKECPPPPSLSATRTVRYTFDKLTRANDKAPVYILVNISDVLSKKHEGMITAKEIMGSLQA
        M+S+ + +LA+DKLNG+N+ + K+ +NTVL+ DDLRFVL +ECP  P+ +ATRTVR  +++  + N+KA  YIL ++S+VL+KKHE M+TA+EIM SLQ 
Subjt:  MSSSFIQLLASDKLNGDNFRTLKSNLNTVLVTDDLRFVLTKECPPPPSLSATRTVRYTFDKLTRANDKAPVYILVNISDVLSKKHEGMITAKEIMGSLQA

Query:  MFGQPSSSVRYDALKYIYNSRMKEGTSVREHVLNMMVHFNVAEVNGVVIDENS--------------QFRTNSVMNKIEYNLTTLFNELQTFETLMKSKG
        MFGQ S  +++DALKYI N+RM EG SVREHV+NMMVHFNVAE+NG VIDE S              QFR+N+VMNKI Y LTTL NELQTFE+LMK KG
Subjt:  MFGQPSSSVRYDALKYIYNSRMKEGTSVREHVLNMMVHFNVAEVNGVVIDENS--------------QFRTNSVMNKIEYNLTTLFNELQTFETLMKSKG

Query:  KKKEANVVTS-KKFLRGSSSWTKSGPFFLRILVFRKRRRIMGRKK----------------------------------LPYARLKPQENVSI-----VV
        +K E NV TS +KF RGS+S TKS P       ++K++   G K                                   L   +   Q ++++     +V
Subjt:  KKKEANVVTS-KKFLRGSSSWTKSGPFFLRILVFRKRRRIMGRKK----------------------------------LPYARLKPQENVSI-----VV

Query:  KMG--------------------------TGRG-TSQSPLQ---------------------------------------KRKLRKKNKETSSWRQLADS
        K G                          T +G  ++ PL+                                       K +  +K KE  +  + A S
Subjt:  KMG--------------------------TGRG-TSQSPLQ---------------------------------------KRKLRKKNKETSSWRQLADS

Query:  QI--TLRE------------------------------KQNLV---RHGSIHDELRSAAS------SFWGYAVETVAYILNMVHSKSVSETPYELWKWRK
        +   T R                               +QN V   R+ ++ D +RS  S      SFWGYAV+T  YILN V SKSVSETP +LW  RK
Subjt:  QI--TLRE------------------------------KQNLV---RHGSIHDELRSAAS------SFWGYAVETVAYILNMVHSKSVSETPYELWKWRK

Query:  --------------------GYSKETKRGLFYDPQ----------------------------------------EDKVVIPDDCIEDPLTYKHVMNDTD
                             Y+ E    +  +P                                         E   VI D  IEDPLT+K  M D D
Subjt:  --------------------GYSKETKRGLFYDPQ----------------------------------------EDKVVIPDDCIEDPLTYKHVMNDTD

Query:  RDQWIKAMDLEMESMYFNLVWELVDQPDGGRPIRCK----------------------------------------------------------------
        +D+WIKAM+LE+ESMYFN VW+LVDQPDG +PI CK                                                                
Subjt:  RDQWIKAMDLEMESMYFNLVWELVDQPDGGRPIRCK----------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------------SSQHGIHLSKE
                                                                                                   +HG+ LSKE
Subjt:  -----------------------------------------------------------------------------------------SSQHGIHLSKE

Query:  QCLKTPQEVEDMRHIPYASVVGSLMYAMLCTRPDICYVVGIVSRIQ-------------------------------DLILTGYTNSSFQTDVDSRKSTS
        QC KTPQ+VE+MRHIPYAS VGSLMYAMLCTRPDICY VGIVSR Q                               DLILTGYT+S FQTD DSRKSTS
Subjt:  QCLKTPQEVEDMRHIPYASVVGSLMYAMLCTRPDICYVVGIVSRIQ-------------------------------DLILTGYTNSSFQTDVDSRKSTS

Query:  RLVFTLNEGAIVRRSIKQGCITDSTMDAEYVAACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGD
          VFTLN GA+V RSIKQGCI DSTM+ EYVAACEAAKEV WLR FL DLEVVP M+  ITLYCDN+G V NS+EPRSHKRGKHIERKYHLIREIV RGD
Subjt:  RLVFTLNEGAIVRRSIKQGCITDSTMDAEYVAACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGD

Query:  VIVTQIASKHNIDDPFTKPLTAKVFEGHLESLGLR
        VIVTQI S HN+ DPFTKPLTAKVFEGHLESLG+R
Subjt:  VIVTQIASKHNIDDPFTKPLTAKVFEGHLESLGLR

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-17040.55Show/hide
Query:  MSSSFIQLLASDKLNGDNFRTLKSNLNTVLVTDDLRFVLTKECPPPPSLSATRTVRYTFDKLTRANDKAPVYILVNISDVLSKKHEGMITAKEIMGSLQA
        M+S+ + +LA+DKLNG+N+   K+ +NTVL+ DDLRFVL +ECP  P+ +AT+TVR  +++  +AN+KA  YIL ++S+VL+KKHE M+TA+EIM SLQ 
Subjt:  MSSSFIQLLASDKLNGDNFRTLKSNLNTVLVTDDLRFVLTKECPPPPSLSATRTVRYTFDKLTRANDKAPVYILVNISDVLSKKHEGMITAKEIMGSLQA

Query:  MFGQPSSSVRYDALKYIYNSRMKEGTSVREHVLNMMVHFNVAEVNGVVIDENS--------------QFRTNSVMNKIEYNLTTLFNELQTFETLMKSKG
        MFGQ S  +++DALKYIYN+RM EG SVREHVLN+MVHFNVAE+NG VIDE S              QFR+N+VMNKI Y LTTL NELQTFE+LMK KG
Subjt:  MFGQPSSSVRYDALKYIYNSRMKEGTSVREHVLNMMVHFNVAEVNGVVIDENS--------------QFRTNSVMNKIEYNLTTLFNELQTFETLMKSKG

Query:  KKKEANVVTS-KKFLRGSSSWTKSGPFFLRILVFRKRRRIMGRK-KLPYARLKPQENVSIVVKMGTGRGTSQSPLQKRKLRKKNKETSSWRQLADSQITL
        +K EANV TS +KF RGS+S TKS P       ++K++   G K  L  A+   +   +  +  GT +    S  + R L    +   S+  L       
Subjt:  KKKEANVVTS-KKFLRGSSSWTKSGPFFLRILVFRKRRRIMGRK-KLPYARLKPQENVSIVVKMGTGRGTSQSPLQKRKLRKKNKETSSWRQLADSQITL

Query:  REKQNLVRHGSIHDELRSAASSFWGYAVETVAYILNMVHSKSVSETPYELWKWRKGYSKETKRGLFYDPQEDKV--------------------------
                            +SFWGYAV+T  YILN V SKSVSETP +LW   KGY K T+ G FYDP+++KV                          
Subjt:  REKQNLVRHGSIHDELRSAASSFWGYAVETVAYILNMVHSKSVSETPYELWKWRKGYSKETKRGLFYDPQEDKV--------------------------

Query:  --------------------------------------------------------------VIPDDCIEDPLTYKHVMNDTDRDQWIKAMDLEMESMYF
                                                                      VI D  IEDPLT+K  M D D+D+WIKAM+LE+ESMYF
Subjt:  --------------------------------------------------------------VIPDDCIEDPLTYKHVMNDTDRDQWIKAMDLEMESMYF

Query:  NLVWELVDQPDGGRPIRCK---------------------------------------------------------------------------------
        N VW+L+DQPDG +PI CK                                                                                 
Subjt:  NLVWELVDQPDGGRPIRCK---------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------SSQHGIHLSKEQCLKTPQEVEDMRHIPYASVVGSLMYAMLCTRPDICYVVGIVSRIQ
                                                     +HG+ LSKEQC KTPQ+VE+MRHIPYAS VGSLMYAMLCTRPDICY VGIVSR Q
Subjt:  -------------------------------------------SSQHGIHLSKEQCLKTPQEVEDMRHIPYASVVGSLMYAMLCTRPDICYVVGIVSRIQ

Query:  -------------------------------DLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRSIKQGCITDSTMDAEYVAACEAAKEVEWLRKF
                                       DLILTGYT+S FQTD DSRKST   VFTLN GA+V RSIKQGCI DSTM+AEYV ACEAAKE  WLR F
Subjt:  -------------------------------DLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRSIKQGCITDSTMDAEYVAACEAAKEVEWLRKF

Query:  LADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHNIDDPFTKPLTAKVFEGHLESLGLR
        L DLEVVP M+  ITLYCDNSGAV NS+EPRSHKRGKHIERKYHLIREIV RGDVIVTQIAS HNI D FTKPLTAKVFEGHLESLGLR
Subjt:  LADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHNIDDPFTKPLTAKVFEGHLESLGLR

TYJ96755.1 gag/pol protein [Cucumis melo var. makuwa]3.8e-15644.5Show/hide
Query:  MSSSFIQLLASDKLNGDNFRTLKSNLNTVLVTDDLRFVLTKECPPPPSLSATRTVRYTFDKLTRANDKAPVYILVNISDVLSKKHEGMITAKEIMGSLQA
        M+S+ + +LA+DKLNG+N+ + K+ +NTVL+ DDLRF L +ECP  P+ +ATRTVR  +++  +AN+KA  YIL ++S+                     
Subjt:  MSSSFIQLLASDKLNGDNFRTLKSNLNTVLVTDDLRFVLTKECPPPPSLSATRTVRYTFDKLTRANDKAPVYILVNISDVLSKKHEGMITAKEIMGSLQA

Query:  MFGQPSSSVRYDALKYIYNSRMKEGTSVREHVLNMMVHFNVAEVNGVVIDENS--------------QFRTNSVMNKIEYNLTTLFNELQTFETLMKSKG
             S  +++DALKYIYN+RM EG S+REHVLNMMVHFN+AE+N  VIDE S              QFR+N+VMNKI Y LTTL NELQTFE+LMK KG
Subjt:  MFGQPSSSVRYDALKYIYNSRMKEGTSVREHVLNMMVHFNVAEVNGVVIDENS--------------QFRTNSVMNKIEYNLTTLFNELQTFETLMKSKG

Query:  KKKEANVVTS-KKFLRGSSSWTKSGPFFLRILVFRKRRRIMGRK-KLPYARLKPQ--------------------------ENVSIVVKMGTGR------
        +K EANV TS +KF RGS S TKS P+       +K++   G K  L  A++  +                          E   + +++GTG       
Subjt:  KKKEANVVTS-KKFLRGSSSWTKSGPFFLRILVFRKRRRIMGRK-KLPYARLKPQ--------------------------ENVSIVVKMGTGR------

Query:  --GTSQSPLQKRKLRKKNKETSSWRQLADSQIT--------------LREKQNLVRHGSIH---------DELRSAASSFWGYAVETVAYILNMVHSKSV
          G     + KR    K        +L  S +               +    +  R+G ++         ++ +   +      ++T   ILN V SKSV
Subjt:  --GTSQSPLQKRKLRKKNKETSSWRQLADSQIT--------------LREKQNLVRHGSIH---------DELRSAASSFWGYAVETVAYILNMVHSKSV

Query:  SETPYELWKWRKGYSKETKRGLFYDPQEDKV-------------------------------------------------------------------VI
        SETP +LW  RKGY K  + G FYDP+++KV                                                                   VI
Subjt:  SETPYELWKWRKGYSKETKRGLFYDPQEDKV-------------------------------------------------------------------VI

Query:  PDDCIEDPLTYKHVMNDTDRDQWIKAMDLEMESMYFNLVWELVDQPDGGRPIRCKSS---------------------------------------QHGI
         D  IEDPLT+K  + D D+D+WIKAM+LE+ESMYFN VW+LVDQ DG +PI CK S                                        HG+
Subjt:  PDDCIEDPLTYKHVMNDTDRDQWIKAMDLEMESMYFNLVWELVDQPDGGRPIRCKSS---------------------------------------QHGI

Query:  HLSKEQCLKTPQEVEDMRHIPYASVVGSLMYAMLCTRPDICYVVGIVSRIQDLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRSIKQGCITDSTM
         LSKEQC KTPQ+VE+MRHIPYAS VGSLM     TR D   V G     +DLILT Y +S FQTD DSRKSTS  VFTLN GA+V RSIKQGCI DSTM
Subjt:  HLSKEQCLKTPQEVEDMRHIPYASVVGSLMYAMLCTRPDICYVVGIVSRIQDLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRSIKQGCITDSTM

Query:  DAEYVAACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHNIDDPFTKPLTAKVFE
        +AEY+AACEAAKE  WLRKFL DLEVVP M   ITLYCDNSGAV NS+EPRSHKRGKHIERKYHLIREI  RGDVIVTQIA  HN+ DPFTKPLTAKVFE
Subjt:  DAEYVAACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHNIDDPFTKPLTAKVFE

Query:  GHLESLGLR
        GHL SLGLR
Subjt:  GHLESLGLR

TrEMBL top hitse value%identityAlignment
A0A5A7SNP8 Gag/pol protein6.4e-15736.77Show/hide
Query:  MSSSFIQLLASDKLNGDNFRTLKSNLNTVLVTDDLRFVLTKECPPPPSLSATRTVRYTFDKLTRANDKAPVYILVNISDVLSKKHEGMITAKEIMGSLQA
        +SS+ + +LA+DKLNG+N+ + K+ +N VL+ DDLRFVL ++CP  P+ +ATRTVR  +++  +AN+KA  YIL ++S+VL+KKHE M+TA+EIM SLQ 
Subjt:  MSSSFIQLLASDKLNGDNFRTLKSNLNTVLVTDDLRFVLTKECPPPPSLSATRTVRYTFDKLTRANDKAPVYILVNISDVLSKKHEGMITAKEIMGSLQA

Query:  MFGQPSSSVRYDALKYIYNSRMKEGTSVREHVLNMMVHFNVAEVNGVVIDENS--------------QFRTNSVMNKIEYNLTTLFNELQTFETLMKSKG
        MFGQ S  +++DALKYIYN+RM EG SVREHVLNMMVHFNVA +N  VIDE S              QFR+N+VMNKI Y LTTL NELQTFE+LMK KG
Subjt:  MFGQPSSSVRYDALKYIYNSRMKEGTSVREHVLNMMVHFNVAEVNGVVIDENS--------------QFRTNSVMNKIEYNLTTLFNELQTFETLMKSKG

Query:  KKKEANVVTS-KKFLRGSSSWTKSGPFFLRILVFRKRRRIMGRK------------------------------KLP--YARLKPQENVSIVVKMGTGRG
        +K EANV TS +KF RGS+S TKS P       ++K++   G K                                P   A+ K  +   +  +  TG+G
Subjt:  KKKEANVVTS-KKFLRGSSSWTKSGPFFLRILVFRKRRRIMGRK------------------------------KLP--YARLKPQENVSIVVKMGTGRG

Query:  -TSQSPLQ---------------------------------------KRKLRKKNKETSSWRQLADSQI--TLRE-------------------------
          ++ PL+                                       K +  +K KE  +  + A S+   T R                          
Subjt:  -TSQSPLQ---------------------------------------KRKLRKKNKETSSWRQLADSQI--TLRE-------------------------

Query:  -----KQNLV---RHGSIHDELRSAAS------SFWGYAVETVAYILNMVHSKSVSETPYELWKWRK---------------------------------
             +QN V   R+ ++ D +RS  S      SFWGYAV+T  YILN V SKSVSETP +LW   K                                 
Subjt:  -----KQNLV---RHGSIHDELRSAAS------SFWGYAVETVAYILNMVHSKSVSETPYELWKWRK---------------------------------

Query:  GYSKETKRGLFYDPQEDKV---------------------------------------------------------------------------------
        GY K T+ G FYDP+++KV                                                                                 
Subjt:  GYSKETKRGLFYDPQEDKV---------------------------------------------------------------------------------

Query:  -------VIPDDCIEDPLTYKHVMNDTDRDQWIKAMDLEMESMYFNLVWELVDQPDGGRPIRCK------------------------------------
               VI D  IEDPLT+K  M D D+D+WIKAM+LE+ESMYFN VW+LVDQPDG +PI CK                                    
Subjt:  -------VIPDDCIEDPLTYKHVMNDTDRDQWIKAMDLEMESMYFNLVWELVDQPDGGRPIRCK------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------SSQHGIHLSKEQ
                                                                                                  +HG+ LSKEQ
Subjt:  ----------------------------------------------------------------------------------------SSQHGIHLSKEQ

Query:  CLKTPQEVEDMRHIPYASVVGSLMYAMLCTRPDICYVVGIVSRIQ-------------------------------DLILTGYTNSSFQTDVDSRKSTSR
        C KTPQ+VE+MRHIPYAS VGSLMYAMLCTRPDICY VGIVSR Q                               DLILTGYT+S FQTD DSRKSTS 
Subjt:  CLKTPQEVEDMRHIPYASVVGSLMYAMLCTRPDICYVVGIVSRIQ-------------------------------DLILTGYTNSSFQTDVDSRKSTSR

Query:  LVFTLNEGAIVRRSIKQGCITDSTMDAEYVAACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDV
         VFTLN GA+V RSIKQGCI DSTM+AEYVAACEAAKE  WLR FL DLEVVP M+  ITLYCDNSGAV NS+EPRSHKRGKHIERKYHLIREIV RGDV
Subjt:  LVFTLNEGAIVRRSIKQGCITDSTMDAEYVAACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDV

Query:  IVTQIASKHNIDDPFTKPLTAKVFEGHLESLGLR
        IVTQIAS HN+ DPFTKPLTAKVFEGHLESLGLR
Subjt:  IVTQIASKHNIDDPFTKPLTAKVFEGHLESLGLR

A0A5A7UYX7 Gag/pol protein1.3e-14638.09Show/hide
Query:  MSSSFIQLLASDKLNGDNFRTLKSNLNTVLVTDDLRFVLTKECPPPPSLSATRTVRYTFDKLTRANDKAPVYILVNISDVLSKKHEGMITAKEIMGSLQA
        M+++ + +LA+DKLNG+N+ + K+ +N VL+ DDL+FVL +ECP  P+ +AT+TVR  +++  + N+K   YIL ++S+VL+KKHE M+TA+EIM SLQ 
Subjt:  MSSSFIQLLASDKLNGDNFRTLKSNLNTVLVTDDLRFVLTKECPPPPSLSATRTVRYTFDKLTRANDKAPVYILVNISDVLSKKHEGMITAKEIMGSLQA

Query:  MFGQPSSSVRYDALKYIYNSRMKEGTSVREHVLNMMVHFNVAEVNGVVIDENSQFRTNSVMNKIEYNLTTLFNELQTFET-----LMKSKGKKK------
        MFGQ S  + +DALKYIYN+RM EG SVREHVLNMMVHFNVAE+NG VIDE SQ +      K E N+ T   +     T     +  S G KK      
Subjt:  MFGQPSSSVRYDALKYIYNSRMKEGTSVREHVLNMMVHFNVAEVNGVVIDENSQFRTNSVMNKIEYNLTTLFNELQTFET-----LMKSKGKKK------

Query:  ----EANVVTSKKFLRGSSS------------WTKSGPFFLRILVFRKRRRI-MGR--KKLPYARLKPQENVSIVV-----------KMGTGRG-TSQSP
            +AN+  +K   +  ++            W ++ P +L      K+  I + R  + +    L   E  S+ +           +  TG+G  ++ P
Subjt:  ----EANVVTSKKFLRGSSS------------WTKSGPFFLRILVFRKRRRI-MGR--KKLPYARLKPQENVSIVV-----------KMGTGRG-TSQSP

Query:  LQ---------------------------------------KRKLRKKNKETSSWRQLADSQI--TLRE------------------------------K
        L+                                       K +  +K KE  +  + A S+   T R                               +
Subjt:  LQ---------------------------------------KRKLRKKNKETSSWRQLADSQI--TLRE------------------------------K

Query:  QNLV---RHGSIHDELRSAAS------SFWGYAVETVAYILNMVHSKSVSETPYELWKWRK---------------------------------GYSKET
        QN V   R+ ++ D +RS  S      SFWGYAV+T  YILN V SKSV ETP +LW  RK                                 GY K T
Subjt:  QNLV---RHGSIHDELRSAAS------SFWGYAVETVAYILNMVHSKSVSETPYELWKWRK---------------------------------GYSKET

Query:  KRGLFYDPQEDKV---------------------------------------------------------------------------------------
        + G FYD +++KV                                                                                       
Subjt:  KRGLFYDPQEDKV---------------------------------------------------------------------------------------

Query:  -VIPDDCIEDPLTYKHVMNDTDRDQWIKAMDLEMESMYFNLVWELVDQPDGGRPIRCK------------------------------------------
         VI D  IEDPLT+K  M D D+D+WIKAM+LE+ESMYFN VW+LVDQPDG +PI CK                                          
Subjt:  -VIPDDCIEDPLTYKHVMNDTDRDQWIKAMDLEMESMYFNLVWELVDQPDGGRPIRCK------------------------------------------

Query:  ---------------------------------------------------------SSQHGIHLSKEQCLKTPQEVEDMRHIPYASVVGSLMYAMLCTR
                                                                   +H + LSKEQC KTPQ+VE+MRHIPYASVVGSLMY MLC R
Subjt:  ---------------------------------------------------------SSQHGIHLSKEQCLKTPQEVEDMRHIPYASVVGSLMYAMLCTR

Query:  PDICYVVGIVSRIQ-------------------------------DLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRSIKQGCITDSTMDAEYVA
        PDICY VGIVSR Q                               DLILT YT+S FQTD DSRKS    VFTLN GA+V RSIKQGCI DSTM+AEYVA
Subjt:  PDICYVVGIVSRIQ-------------------------------DLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRSIKQGCITDSTMDAEYVA

Query:  ACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHNIDDPFTKPLTAKVFEGHLESL
        ACE AKE  WLRKFL DLEVVP M+  ITLYCDNSGAV NS+EPRSHKRGKHI+RKYHLIREIVQRGDVIVTQIAS HN+ DPFTKPLTAKVFEGHLESL
Subjt:  ACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHNIDDPFTKPLTAKVFEGHLESL

Query:  GLR
        GLR
Subjt:  GLR

A0A5A7V6N0 Gag/pol protein1.0e-17040.55Show/hide
Query:  MSSSFIQLLASDKLNGDNFRTLKSNLNTVLVTDDLRFVLTKECPPPPSLSATRTVRYTFDKLTRANDKAPVYILVNISDVLSKKHEGMITAKEIMGSLQA
        M+S+ + +LA+DKLNG+N+   K+ +NTVL+ DDLRFVL +ECP  P+ +AT+TVR  +++  +AN+KA  YIL ++S+VL+KKHE M+TA+EIM SLQ 
Subjt:  MSSSFIQLLASDKLNGDNFRTLKSNLNTVLVTDDLRFVLTKECPPPPSLSATRTVRYTFDKLTRANDKAPVYILVNISDVLSKKHEGMITAKEIMGSLQA

Query:  MFGQPSSSVRYDALKYIYNSRMKEGTSVREHVLNMMVHFNVAEVNGVVIDENS--------------QFRTNSVMNKIEYNLTTLFNELQTFETLMKSKG
        MFGQ S  +++DALKYIYN+RM EG SVREHVLN+MVHFNVAE+NG VIDE S              QFR+N+VMNKI Y LTTL NELQTFE+LMK KG
Subjt:  MFGQPSSSVRYDALKYIYNSRMKEGTSVREHVLNMMVHFNVAEVNGVVIDENS--------------QFRTNSVMNKIEYNLTTLFNELQTFETLMKSKG

Query:  KKKEANVVTS-KKFLRGSSSWTKSGPFFLRILVFRKRRRIMGRK-KLPYARLKPQENVSIVVKMGTGRGTSQSPLQKRKLRKKNKETSSWRQLADSQITL
        +K EANV TS +KF RGS+S TKS P       ++K++   G K  L  A+   +   +  +  GT +    S  + R L    +   S+  L       
Subjt:  KKKEANVVTS-KKFLRGSSSWTKSGPFFLRILVFRKRRRIMGRK-KLPYARLKPQENVSIVVKMGTGRGTSQSPLQKRKLRKKNKETSSWRQLADSQITL

Query:  REKQNLVRHGSIHDELRSAASSFWGYAVETVAYILNMVHSKSVSETPYELWKWRKGYSKETKRGLFYDPQEDKV--------------------------
                            +SFWGYAV+T  YILN V SKSVSETP +LW   KGY K T+ G FYDP+++KV                          
Subjt:  REKQNLVRHGSIHDELRSAASSFWGYAVETVAYILNMVHSKSVSETPYELWKWRKGYSKETKRGLFYDPQEDKV--------------------------

Query:  --------------------------------------------------------------VIPDDCIEDPLTYKHVMNDTDRDQWIKAMDLEMESMYF
                                                                      VI D  IEDPLT+K  M D D+D+WIKAM+LE+ESMYF
Subjt:  --------------------------------------------------------------VIPDDCIEDPLTYKHVMNDTDRDQWIKAMDLEMESMYF

Query:  NLVWELVDQPDGGRPIRCK---------------------------------------------------------------------------------
        N VW+L+DQPDG +PI CK                                                                                 
Subjt:  NLVWELVDQPDGGRPIRCK---------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------SSQHGIHLSKEQCLKTPQEVEDMRHIPYASVVGSLMYAMLCTRPDICYVVGIVSRIQ
                                                     +HG+ LSKEQC KTPQ+VE+MRHIPYAS VGSLMYAMLCTRPDICY VGIVSR Q
Subjt:  -------------------------------------------SSQHGIHLSKEQCLKTPQEVEDMRHIPYASVVGSLMYAMLCTRPDICYVVGIVSRIQ

Query:  -------------------------------DLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRSIKQGCITDSTMDAEYVAACEAAKEVEWLRKF
                                       DLILTGYT+S FQTD DSRKST   VFTLN GA+V RSIKQGCI DSTM+AEYV ACEAAKE  WLR F
Subjt:  -------------------------------DLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRSIKQGCITDSTMDAEYVAACEAAKEVEWLRKF

Query:  LADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHNIDDPFTKPLTAKVFEGHLESLGLR
        L DLEVVP M+  ITLYCDNSGAV NS+EPRSHKRGKHIERKYHLIREIV RGDVIVTQIAS HNI D FTKPLTAKVFEGHLESLGLR
Subjt:  LADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHNIDDPFTKPLTAKVFEGHLESLGLR

A0A5D3BE74 Gag/pol protein1.9e-15644.5Show/hide
Query:  MSSSFIQLLASDKLNGDNFRTLKSNLNTVLVTDDLRFVLTKECPPPPSLSATRTVRYTFDKLTRANDKAPVYILVNISDVLSKKHEGMITAKEIMGSLQA
        M+S+ + +LA+DKLNG+N+ + K+ +NTVL+ DDLRF L +ECP  P+ +ATRTVR  +++  +AN+KA  YIL ++S+                     
Subjt:  MSSSFIQLLASDKLNGDNFRTLKSNLNTVLVTDDLRFVLTKECPPPPSLSATRTVRYTFDKLTRANDKAPVYILVNISDVLSKKHEGMITAKEIMGSLQA

Query:  MFGQPSSSVRYDALKYIYNSRMKEGTSVREHVLNMMVHFNVAEVNGVVIDENS--------------QFRTNSVMNKIEYNLTTLFNELQTFETLMKSKG
             S  +++DALKYIYN+RM EG S+REHVLNMMVHFN+AE+N  VIDE S              QFR+N+VMNKI Y LTTL NELQTFE+LMK KG
Subjt:  MFGQPSSSVRYDALKYIYNSRMKEGTSVREHVLNMMVHFNVAEVNGVVIDENS--------------QFRTNSVMNKIEYNLTTLFNELQTFETLMKSKG

Query:  KKKEANVVTS-KKFLRGSSSWTKSGPFFLRILVFRKRRRIMGRK-KLPYARLKPQ--------------------------ENVSIVVKMGTGR------
        +K EANV TS +KF RGS S TKS P+       +K++   G K  L  A++  +                          E   + +++GTG       
Subjt:  KKKEANVVTS-KKFLRGSSSWTKSGPFFLRILVFRKRRRIMGRK-KLPYARLKPQ--------------------------ENVSIVVKMGTGR------

Query:  --GTSQSPLQKRKLRKKNKETSSWRQLADSQIT--------------LREKQNLVRHGSIH---------DELRSAASSFWGYAVETVAYILNMVHSKSV
          G     + KR    K        +L  S +               +    +  R+G ++         ++ +   +      ++T   ILN V SKSV
Subjt:  --GTSQSPLQKRKLRKKNKETSSWRQLADSQIT--------------LREKQNLVRHGSIH---------DELRSAASSFWGYAVETVAYILNMVHSKSV

Query:  SETPYELWKWRKGYSKETKRGLFYDPQEDKV-------------------------------------------------------------------VI
        SETP +LW  RKGY K  + G FYDP+++KV                                                                   VI
Subjt:  SETPYELWKWRKGYSKETKRGLFYDPQEDKV-------------------------------------------------------------------VI

Query:  PDDCIEDPLTYKHVMNDTDRDQWIKAMDLEMESMYFNLVWELVDQPDGGRPIRCKSS---------------------------------------QHGI
         D  IEDPLT+K  + D D+D+WIKAM+LE+ESMYFN VW+LVDQ DG +PI CK S                                        HG+
Subjt:  PDDCIEDPLTYKHVMNDTDRDQWIKAMDLEMESMYFNLVWELVDQPDGGRPIRCKSS---------------------------------------QHGI

Query:  HLSKEQCLKTPQEVEDMRHIPYASVVGSLMYAMLCTRPDICYVVGIVSRIQDLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRSIKQGCITDSTM
         LSKEQC KTPQ+VE+MRHIPYAS VGSLM     TR D   V G     +DLILT Y +S FQTD DSRKSTS  VFTLN GA+V RSIKQGCI DSTM
Subjt:  HLSKEQCLKTPQEVEDMRHIPYASVVGSLMYAMLCTRPDICYVVGIVSRIQDLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRSIKQGCITDSTM

Query:  DAEYVAACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHNIDDPFTKPLTAKVFE
        +AEY+AACEAAKE  WLRKFL DLEVVP M   ITLYCDNSGAV NS+EPRSHKRGKHIERKYHLIREI  RGDVIVTQIA  HN+ DPFTKPLTAKVFE
Subjt:  DAEYVAACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHNIDDPFTKPLTAKVFE

Query:  GHLESLGLR
        GHL SLGLR
Subjt:  GHLESLGLR

A0A5D3DE90 Gag/pol protein1.7e-14941.63Show/hide
Query:  MSSSFIQLLASDKLNGDNFRTLKSNLNTVLVTDDLRFVLTKECPPPPSLSATRTVRYTFDKLTRANDKAPVYILVNISDVLSKKHEGMITAKEIMGSLQA
        M+SS +QLLAS KLNGDN+   KSNLNT+LV D L FVLT+ECP  PS +A R  R  +D+  + N+KA VYIL ++ DVL+KKHE + TAKEIM SL+ 
Subjt:  MSSSFIQLLASDKLNGDNFRTLKSNLNTVLVTDDLRFVLTKECPPPPSLSATRTVRYTFDKLTRANDKAPVYILVNISDVLSKKHEGMITAKEIMGSLQA

Query:  MFGQPSSSVRYDALKYIYNSRMKEGTSVREHVLNMMVHFNVAEVNGVVIDENSQFRTNSVMNKIEYNLTTLFNELQTFETLMKSKGKKKEANVVTSK-KF
        MFGQP  S+R++A+KYIY  RMKEGTSVREHV++MM+HFN+AEVN   IDE +Q            NLT             KSKGK+ EANV T+K KF
Subjt:  MFGQPSSSVRYDALKYIYNSRMKEGTSVREHVLNMMVHFNVAEVNGVVIDENSQFRTNSVMNKIEYNLTTLFNELQTFETLMKSKGKKKEANVVTSK-KF

Query:  LRGSSSWTKSGPFFLRILVFRKRRRIMGRKKLPYARLKPQENVSIVVKMGTGRGTSQSP-------------LQKRKLRKKNKETSSWRQLADSQIT---
         RGSSS +K+GP      + +K +    ++    +  K      I +K+GTG   S                L K  L ++  +     QL D+ +    
Subjt:  LRGSSSWTKSGPFFLRILVFRKRRRIMGRKKLPYARLKPQENVSIVVKMGTGRGTSQSP-------------LQKRKLRKKNKETSSWRQLADSQIT---

Query:  ----------------LREKQNL-VRHGSIHDELRSAASSFWGYAVETVAYI-----LNMVHSKSVSETPYELWKW----RKGYSKETKRGLFYDPQEDK
                        LR K  L + H ++   +   A   + Y +  +        + ++H+KS S   ++ +K       GYSKE++ GLFYDPQE+K
Subjt:  ----------------LREKQNL-VRHGSIHDELRSAASSFWGYAVETVAYI-----LNMVHSKSVSETPYELWKW----RKGYSKETKRGLFYDPQEDK

Query:  V----------------------------------------------------------------------------------VIPDDCIEDPLTYKHVM
        V                                                                                  +IPDD IEDPL +K VM
Subjt:  V----------------------------------------------------------------------------------VIPDDCIEDPLTYKHVM

Query:  NDTDRDQWIKAMDLEMESMYFNLVWELVDQPDGGRPIRCK------------------------------------------------------------
        ND D DQWIK+MDL+MESMY N VW LVDQP+  R I CK                                                            
Subjt:  NDTDRDQWIKAMDLEMESMYFNLVWELVDQPDGGRPIRCK------------------------------------------------------------

Query:  --------------SSQHGIHLSKEQCLKTPQEVEDMRHIPYASVVGSLMYAMLCTRPDICYVVGIVSRIQ-----------------------------
                        ++GIHLSKEQC KTPQEVEDM +IPYASVV SLMY MLCTRPDI Y VGIVS+ Q                             
Subjt:  --------------SSQHGIHLSKEQCLKTPQEVEDMRHIPYASVVGSLMYAMLCTRPDICYVVGIVSRIQ-----------------------------

Query:  --DLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRSIKQGCITDSTMDAEYVAACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKE
          DLILTGYT+S FQ D D+R ST   VFTLN GA+V RSIKQ CI DSTM+AEYVA CEA KEV WL+KFL  L+ VP M+L I LYCDNSGA+ NS+E
Subjt:  --DLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRSIKQGCITDSTMDAEYVAACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKE

Query:  PRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHNIDDPFTKPLTAKVFEGHLESLGLR
        PRSHKRGKHIERKYHLIRE V R DVIVT+I+ + N+ DPFTK LT KVFE HL  LGLR
Subjt:  PRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHNIDDPFTKPLTAKVFEGHLESLGLR

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.6e-2232.29Show/hide
Query:  PYASVVGSLMYAMLCTRPDICYVVGIVSRIQD------------------------LI----------LTGYTNSSFQTDVDSRKSTSRLVFTLNE-GAI
        P  S++G LMY MLCTRPD+   V I+SR                           LI          + GY +S +      RKST+  +F + +   I
Subjt:  PYASVVGSLMYAMLCTRPDICYVVGIVSRIQD------------------------LI----------LTGYTNSSFQTDVDSRKSTSRLVFTLNE-GAI

Query:  VRRSIKQGCITDSTMDAEYVAACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHN
           + +Q  +  S+ +AEY+A  EA +E  WL+  L  + +  K+   I +Y DN G ++ +  P  HKR KHI+ KYH  RE VQ   + +  I +++ 
Subjt:  VRRSIKQGCITDSTMDAEYVAACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHN

Query:  IDDPFTKPLTAKVFEGHLESLGL
        + D FTKPL A  F    + LGL
Subjt:  IDDPFTKPLTAKVFEGHLESLGL

P0CV72 Secreted RxLR effector protein 1611.6e-1133.08Show/hide
Query:  MRHIPYASVVGSLMYAMLCTRPDICYVVGIVS--------------------------------RIQDLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGA
        M+++PY S VG++MY M+ TRPD+   VG++S                                R     L GY+++ +  DV+SR+STS  +F LN G 
Subjt:  MRHIPYASVVGSLMYAMLCTRPDICYVVGIVS--------------------------------RIQDLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGA

Query:  IVRRSIKQGCITDSTMDAEYVAACEAAKEVEWL
        +  RS KQ  +  S+ + EY+A  EA +E  WL
Subjt:  IVRRSIKQGCITDSTMDAEYVAACEAAKEVEWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.8e-3834.93Show/hide
Query:  LTYKHVMNDTDRDQWIKAMDLEMESMYFNLVWELVDQPDGGRPIRCKSSQHGIHLSKEQCLKTPQEVEDMRHIPYASVVGSLMYAMLCTRPDICYVVGIV
        L  K V   T R  W+       +  Y   V E  +  +  +P+    + H + LSK+ C  T +E  +M  +PY+S VGSLMYAM+CTRPDI + VG+V
Subjt:  LTYKHVMNDTDRDQWIKAMDLEMESMYFNLVWELVDQPDGGRPIRCKSSQHGIHLSKEQCLKTPQEVEDMRHIPYASVVGSLMYAMLCTRPDICYVVGIV

Query:  SRI-------------------------------QDLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRSIKQGCITDSTMDAEYVAACEAAKEVEW
        SR                                 D IL GYT++    D+D+RKS++  +FT + GAI  +S  Q C+  ST +AEY+AA E  KE+ W
Subjt:  SRI-------------------------------QDLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRSIKQGCITDSTMDAEYVAACEAAKEVEW

Query:  LRKFLADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHNIDDPFTKPLTAKVFEGHLESLGL
        L++FL +L +  K      +YCD+  A+  SK    H R KHI+ +YH IRE+V    + V +I++  N  D  TK +    FE   E +G+
Subjt:  LRKFLADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHNIDDPFTKPLTAKVFEGHLESLGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.9e-0539.68Show/hide
Query:  PQEDKVVIPDDCIEDPLTYKHVMNDTDRDQWIKAMDLEMESMYFNLVWELVDQPDGGRPIRCK
        P  + V+I DD   +P + K V++  +++Q +KAM  EMES+  N  ++LV+ P G RP++CK
Subjt:  PQEDKVVIPDDCIEDPLTYKHVMNDTDRDQWIKAMDLEMESMYFNLVWELVDQPDGGRPIRCK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.2e-1627.85Show/hide
Query:  YASVVGSLMYAMLCTRPDICYVVGIVSRI--------------------------------QDLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRS
        Y  +VGSL Y +  TRPDI Y V  +S+                                   L L  Y+++ +  D D   ST+  +  L    I   S
Subjt:  YASVVGSLMYAMLCTRPDICYVVGIVSRI--------------------------------QDLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRS

Query:  IKQGCITDSTMDAEYVAACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHNIDDP
         KQ  +  S+ +AEY +    + E++W+   L +L +  ++     +YCDN GA      P  H R KHI   YH IR  VQ G + V  +++   + D 
Subjt:  IKQGCITDSTMDAEYVAACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHNIDDP

Query:  FTKPLTAKVFEGHLESLGL
         TKPL+   F+     +G+
Subjt:  FTKPLTAKVFEGHLESLGL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.4e-1527.4Show/hide
Query:  YASVVGSLMYAMLCTRPDICYVVGIVSRI--------------------------------QDLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRS
        Y  +VGSL Y +  TRPD+ Y V  +S+                                   L L  Y+++ +  D D   ST+  +  L    I   S
Subjt:  YASVVGSLMYAMLCTRPDICYVVGIVSRI--------------------------------QDLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRS

Query:  IKQGCITDSTMDAEYVAACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHNIDDP
         KQ  +  S+ +AEY +    + E++W+   L +L +  +++    +YCDN GA      P  H R KHI   YH IR  VQ G + V  +++   + D 
Subjt:  IKQGCITDSTMDAEYVAACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHNIDDP

Query:  FTKPLTAKVFEGHLESLGL
         TKPL+   F+     +G+
Subjt:  FTKPLTAKVFEGHLESLGL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.4e-1027.37Show/hide
Query:  YASVVGSLMYAMLCTRPDICYVVGIVSRIQ--------------------------------DLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRS
        Y  ++G LMY  + TR DI + V  +S+                                  ++ L  ++++SFQ+  D+R+ST+     L    I  +S
Subjt:  YASVVGSLMYAMLCTRPDICYVVGIVSRIQ--------------------------------DLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRS

Query:  IKQGCITDSTMDAEYVAACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIRE
         KQ  ++ S+ +AEY A   A  E+ WL +F  +L++   ++    L+CDN+ A+  +     H+R KHIE   H +RE
Subjt:  IKQGCITDSTMDAEYVAACEAAKEVEWLRKFLADLEVVPKMNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAGCTCTTTTATTCAATTACTCGCCTCAGACAAACTTAACGGCGATAACTTTCGAACTTTGAAATCAAACTTGAATACGGTTCTTGTAACTGATGATTTAAGGTT
CGTCTTGACGAAGGAATGTCCTCCCCCTCCCAGCTTGTCTGCAACCCGAACAGTTCGGTATACATTTGACAAATTGACTAGGGCTAATGATAAAGCCCCGGTCTACATCT
TAGTCAACATATCTGATGTGTTGTCTAAGAAACATGAGGGCATGATCACCGCAAAGGAGATCATGGGATCATTACAGGCCATGTTTGGACAACCGTCCTCGTCGGTTCGT
TATGATGCTCTCAAGTACATTTACAACTCTCGAATGAAGGAGGGAACTTCTGTTAGGGAGCATGTCCTTAATATGATGGTCCACTTCAACGTGGCAGAGGTAAACGGGGT
TGTCATAGATGAGAACAGTCAGTTCCGCACCAATTCGGTGATGAACAAGATAGAGTATAACCTGACTACTCTATTTAACGAGCTACAGACATTCGAGACCCTAATGAAAT
CAAAGGGAAAAAAGAAGGAGGCAAACGTTGTCACTTCGAAGAAGTTTCTAAGAGGATCGTCCTCATGGACTAAGTCTGGTCCTTTTTTTCTAAGAATATTGGTATTCAGA
AAAAGAAGAAGGATAATGGGAAGGAAAAAGCTCCCATACGCAAGGTTAAAGCCACAGGAAAATGTTTCTATTGTGGTGAAGATGGGCACTGGAAGAGGAACTTCCCAAAG
TCCCTTGCAGAAAAGAAAGCTGAGAAAGAAAAATAAGGAAACTAGTTCCTGGAGGCAGCTTGCTGATAGTCAAATAACTCTCAGGGAGAAACAGAACCTTGTTAGACATG
GTTCGATCCATGATGAGCTACGCTCAGCTGCCAGTTCGTTTTGGGGATATGCAGTAGAAACTGTTGCATATATTTTGAACATGGTTCATTCTAAGAGTGTTTCAGAAACA
CCCTATGAGTTATGGAAATGGCGTAAAGGATACTCCAAGGAAACGAAACGTGGTCTGTTTTATGATCCTCAGGAGGACAAGGTCGTCATACCTGATGATTGCATTGAGGA
TCCACTGACCTATAAACATGTAATGAATGATACAGATAGGGACCAGTGGATTAAAGCCATGGACCTCGAAATGGAGTCAATGTACTTCAATTTAGTCTGGGAACTTGTAG
ATCAACCAGATGGTGGAAGACCTATCAGATGCAAATCAAGCCAGCATGGAATTCACCTATCTAAGGAACAGTGTCTTAAGACACCTCAAGAAGTTGAGGATATGAGACAC
ATTCCCTATGCATCAGTTGTCGGTAGTCTGATGTATGCCATGCTATGTACTCGACCCGACATATGCTATGTAGTGGGAATTGTCAGCAGGATACAGGATTTGATCCTTAC
AGGATACACTAACTCTAGTTTTCAGACTGATGTAGATTCGAGGAAATCCACATCGAGATTAGTGTTCACTCTTAACGAAGGAGCTATAGTACGGAGGAGTATAAAGCAAG
GTTGTATTACTGACTCTACCATGGATGCTGAGTATGTCGCAGCTTGCGAAGCAGCGAAGGAGGTTGAATGGCTTAGGAAGTTCTTGGCTGATTTGGAAGTTGTTCCAAAG
ATGAATTTGCATATCACCCTTTATTGTGATAATAGTGGTGCAGTGACAAATTCTAAAGAACCTAGAAGCCATAAGCGCGGAAAGCACATAGAGCGCAAATATCATCTCAT
CAGAGAGATTGTGCAACGAGGAGACGTGATCGTCACGCAGATTGCTTCGAAGCACAACATTGATGATCCATTTACAAAGCCCCTCACGGCTAAAGTGTTTGAGGGACACC
TAGAGAGTCTAGGACTACGAGTTGTACAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGAGCTCTTTTATTCAATTACTCGCCTCAGACAAACTTAACGGCGATAACTTTCGAACTTTGAAATCAAACTTGAATACGGTTCTTGTAACTGATGATTTAAGGTT
CGTCTTGACGAAGGAATGTCCTCCCCCTCCCAGCTTGTCTGCAACCCGAACAGTTCGGTATACATTTGACAAATTGACTAGGGCTAATGATAAAGCCCCGGTCTACATCT
TAGTCAACATATCTGATGTGTTGTCTAAGAAACATGAGGGCATGATCACCGCAAAGGAGATCATGGGATCATTACAGGCCATGTTTGGACAACCGTCCTCGTCGGTTCGT
TATGATGCTCTCAAGTACATTTACAACTCTCGAATGAAGGAGGGAACTTCTGTTAGGGAGCATGTCCTTAATATGATGGTCCACTTCAACGTGGCAGAGGTAAACGGGGT
TGTCATAGATGAGAACAGTCAGTTCCGCACCAATTCGGTGATGAACAAGATAGAGTATAACCTGACTACTCTATTTAACGAGCTACAGACATTCGAGACCCTAATGAAAT
CAAAGGGAAAAAAGAAGGAGGCAAACGTTGTCACTTCGAAGAAGTTTCTAAGAGGATCGTCCTCATGGACTAAGTCTGGTCCTTTTTTTCTAAGAATATTGGTATTCAGA
AAAAGAAGAAGGATAATGGGAAGGAAAAAGCTCCCATACGCAAGGTTAAAGCCACAGGAAAATGTTTCTATTGTGGTGAAGATGGGCACTGGAAGAGGAACTTCCCAAAG
TCCCTTGCAGAAAAGAAAGCTGAGAAAGAAAAATAAGGAAACTAGTTCCTGGAGGCAGCTTGCTGATAGTCAAATAACTCTCAGGGAGAAACAGAACCTTGTTAGACATG
GTTCGATCCATGATGAGCTACGCTCAGCTGCCAGTTCGTTTTGGGGATATGCAGTAGAAACTGTTGCATATATTTTGAACATGGTTCATTCTAAGAGTGTTTCAGAAACA
CCCTATGAGTTATGGAAATGGCGTAAAGGATACTCCAAGGAAACGAAACGTGGTCTGTTTTATGATCCTCAGGAGGACAAGGTCGTCATACCTGATGATTGCATTGAGGA
TCCACTGACCTATAAACATGTAATGAATGATACAGATAGGGACCAGTGGATTAAAGCCATGGACCTCGAAATGGAGTCAATGTACTTCAATTTAGTCTGGGAACTTGTAG
ATCAACCAGATGGTGGAAGACCTATCAGATGCAAATCAAGCCAGCATGGAATTCACCTATCTAAGGAACAGTGTCTTAAGACACCTCAAGAAGTTGAGGATATGAGACAC
ATTCCCTATGCATCAGTTGTCGGTAGTCTGATGTATGCCATGCTATGTACTCGACCCGACATATGCTATGTAGTGGGAATTGTCAGCAGGATACAGGATTTGATCCTTAC
AGGATACACTAACTCTAGTTTTCAGACTGATGTAGATTCGAGGAAATCCACATCGAGATTAGTGTTCACTCTTAACGAAGGAGCTATAGTACGGAGGAGTATAAAGCAAG
GTTGTATTACTGACTCTACCATGGATGCTGAGTATGTCGCAGCTTGCGAAGCAGCGAAGGAGGTTGAATGGCTTAGGAAGTTCTTGGCTGATTTGGAAGTTGTTCCAAAG
ATGAATTTGCATATCACCCTTTATTGTGATAATAGTGGTGCAGTGACAAATTCTAAAGAACCTAGAAGCCATAAGCGCGGAAAGCACATAGAGCGCAAATATCATCTCAT
CAGAGAGATTGTGCAACGAGGAGACGTGATCGTCACGCAGATTGCTTCGAAGCACAACATTGATGATCCATTTACAAAGCCCCTCACGGCTAAAGTGTTTGAGGGACACC
TAGAGAGTCTAGGACTACGAGTTGTACAATAG
Protein sequenceShow/hide protein sequence
MSSSFIQLLASDKLNGDNFRTLKSNLNTVLVTDDLRFVLTKECPPPPSLSATRTVRYTFDKLTRANDKAPVYILVNISDVLSKKHEGMITAKEIMGSLQAMFGQPSSSVR
YDALKYIYNSRMKEGTSVREHVLNMMVHFNVAEVNGVVIDENSQFRTNSVMNKIEYNLTTLFNELQTFETLMKSKGKKKEANVVTSKKFLRGSSSWTKSGPFFLRILVFR
KRRRIMGRKKLPYARLKPQENVSIVVKMGTGRGTSQSPLQKRKLRKKNKETSSWRQLADSQITLREKQNLVRHGSIHDELRSAASSFWGYAVETVAYILNMVHSKSVSET
PYELWKWRKGYSKETKRGLFYDPQEDKVVIPDDCIEDPLTYKHVMNDTDRDQWIKAMDLEMESMYFNLVWELVDQPDGGRPIRCKSSQHGIHLSKEQCLKTPQEVEDMRH
IPYASVVGSLMYAMLCTRPDICYVVGIVSRIQDLILTGYTNSSFQTDVDSRKSTSRLVFTLNEGAIVRRSIKQGCITDSTMDAEYVAACEAAKEVEWLRKFLADLEVVPK
MNLHITLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIASKHNIDDPFTKPLTAKVFEGHLESLGLRVVQ