; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015119 (gene) of Snake gourd v1 genome

Gene IDTan0015119
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG03:31672179..31675895
RNA-Seq ExpressionTan0015119
SyntenyTan0015119
Gene Ontology termsNA
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026233.1 gag/pol protein [Cucumis melo var. makuwa]7.5e-23446.96Show/hide
Query:  MSSAIIALLASDKLVGDNYQSWKNNINTIFVTDDIKFVLSEECPQMPSSTPSRSVRVAYDRWIRANEKAKVYIIASMSDVLAKKHELMVTAKEIMESLQE
        +SSA + +LA+DKL G+NY SWKN IN + + DD++FVL E+CPQ+P++  +R+VR  Y+RW +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQE
Subjt:  MSSAIIALLASDKLVGDNYQSWKNNINTIFVTDDIKFVLSEECPQMPSSTPSRSVRVAYDRWIRANEKAKVYIIASMSDVLAKKHELMVTAKEIMESLQE

Query:  MFGQQSFQVWHDSLKYVFNARMKEGSFVREHVLDMMTHFNLAEMNGASIDESSQESFILETLPKSFLQLRINVVMNKISYTLTTLLNELQNFQSLNRVKA
        MFGQ S+Q+ HD+LKY++NARM EG+ VREHVL+MM HFN+A MN A IDE+SQ SFILE+LP+SFLQ R N VMNKI+YTLTTLLNELQ F+SL ++K 
Subjt:  MFGQQSFQVWHDSLKYVFNARMKEGSFVREHVLDMMTHFNLAEMNGASIDESSQESFILETLPKSFLQLRINVVMNKISYTLTTLLNELQNFQSLNRVKA

Query:  SESEANVA--YRSYHRGSTSGTKPVAPSRPKGKKRMKRG----KTDRGAAHKGKKVKDVAEKGKGFHRNGADIGRGGIP---------------------
         + EANVA   R +HRGSTSGTK +  S    K + K+G    K +  AA   KK K  A KG  F  N     +   P                     
Subjt:  SESEANVA--YRSYHRGSTSGTKPVAPSRPKGKKRMKRG----KTDRGAAHKGKKVKDVAEKGKGFHRNGADIGRGGIP---------------------

Query:  NSGREEESRGLIPGSRC-------------------------------------ERFKEYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIEHEITPQ
           R +E   L+    C                                     E+FKEYK EVEN L K++KT R DRGGEYMD++FQ+Y++E  I  Q
Subjt:  NSGREEESRGLIPGSRC-------------------------------------ERFKEYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIEHEITPQ

Query:  LSK-------------------------------------TVETAVYILNNVPSKSVCETPFEVLSGCKSSLHHFRIWGCPAHVLVSNPKKLEPRSKLCL
        LS                                       V+TAVYILN VPSKSV ETP ++ +G K SL HFRIWGCPAHVL +NPKKLEPRSKLCL
Subjt:  LSK-------------------------------------TVETAVYILNNVPSKSVCETPFEVLSGCKSSLHHFRIWGCPAHVLVSNPKKLEPRSKLCL

Query:  FVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLNEMD----STSAKVVNGASTSTSVVDPSTSSQIRSPE-LGMPRRSGRVVRQRDR
        FVGYPK TRGG FYDPK N+V VSTNATFLEEDH+R+H PRSK VLNE+       S +VV   S    VV   +S++   P+ L  PRRSGRV     R
Subjt:  FVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLNEMD----STSAKVVNGASTSTSVVDPSTSSQIRSPE-LGMPRRSGRVVRQRDR

Query:  YMGLAETSVIAPDDDCEDPLTYDQAMVDVGKDEWIKAIN-------------------GV------YVLQFCLGACGSTG--------------------
        YM L ET  +  D D EDPLT+ +AM DV KDEWIKA+N                   GV      ++ +   GA G                       
Subjt:  YMGLAETSVIAPDDDCEDPLTYDQAMVDVGKDEWIKAIN-------------------GV------YVLQFCLGACGSTG--------------------

Query:  --------------------------W----------------------------GQEQKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPCV
                                  W                            GQEQK+C+L+RSIYGLKQASRSWNIRFD AIKSY FDQ VD PCV
Subjt:  --------------------------W----------------------------GQEQKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPCV

Query:  YKKIVDKTVAFLVLYVHDILLFGN----------------------------------------------------------ENSKKGLLPFRHGVHLSK
        YK+I++K+VAFLVLYV DILL GN                                                          +NSK+GLLPFRHGV LSK
Subjt:  YKKIVDKTVAFLVLYVHDILLFGN----------------------------------------------------------ENSKKGLLPFRHGVHLSK

Query:  DQCPKTPQDVQNMRRIPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTWGFDLTYGIQIRLFGPI-----RILGNPLGVDLHYEWRSR---
        +QCPKTPQDV+ MR IPYASA+GSLMYAMLCTRPDICYAVGIV+RYQSN GL HWT     L Y  + R +  +      IL      D   +  SR   
Subjt:  DQCPKTPQDVQNMRRIPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTWGFDLTYGIQIRLFGPI-----RILGNPLGVDLHYEWRSR---

Query:  ------------VWRSIKQGCIADSTMEAEYVVACEAAKEVVWLRKFMTDLEVVPNMILPITLFCDKRG
                    VWRSIKQGCIADSTMEAEYV ACEAAKE VWLR F+ DLEVVPNM  PITL+CD  G
Subjt:  ------------VWRSIKQGCIADSTMEAEYVVACEAAKEVVWLRKFMTDLEVVPNMILPITLFCDKRG

KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-21640.08Show/hide
Query:  MSSAIIALLASDKLVGDNYQSWKNNINTIFVTDDIKFVLSEECPQMPSSTPSRSVRVAYDRWIRANEKAKVYIIASMSDVLAKKHELMVTAKEIMESLQE
        M+SA + +LA+DKL G+NY SWKN INT+ + DD++FVL EECPQ+P++  +R+VR  Y+RW +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQE
Subjt:  MSSAIIALLASDKLVGDNYQSWKNNINTIFVTDDIKFVLSEECPQMPSSTPSRSVRVAYDRWIRANEKAKVYIIASMSDVLAKKHELMVTAKEIMESLQE

Query:  MFGQQSFQVWHDSLKYVFNARMKEGSFVREHVLDMMTHFNLAEMNGASIDESSQESFILETLPKSFLQLRINVVMNKISYTLTTLLNELQNFQSLNRVKA
        MFGQ S+Q+ HD+LKY++NARM EG+ VREHVL+MM HFN+AEMNGA IDE+SQ SFILE+LP+SFLQ R N VMNKI+YTLTTLLNELQ F+SL ++K 
Subjt:  MFGQQSFQVWHDSLKYVFNARMKEGSFVREHVLDMMTHFNLAEMNGASIDESSQESFILETLPKSFLQLRINVVMNKISYTLTTLLNELQNFQSLNRVKA

Query:  SESEANVA--YRSYHRGSTSGTKPVAPSRPKGKKRMKRG----KTDRGAAHKGKKVKD---------------------VAEKGKG--------------
         + EANVA   R +HRGSTSGTK +  S    K + K+G    K +  AA   KK K                      +AEK K               
Subjt:  SESEANVA--YRSYHRGSTSGTKPVAPSRPKGKKRMKRG----KTDRGAAHKGKKVKD---------------------VAEKGKG--------------

Query:  ---------------------------------------------------------------------------------------------------F
                                                                                                            
Subjt:  ---------------------------------------------------------------------------------------------------F

Query:  HRNGADIGRGGIPNS-------------------------------------------------------------------------------------
        ++NG +I    + N+                                                                                     
Subjt:  HRNGADIGRGGIPNS-------------------------------------------------------------------------------------

Query:  ---GREEESRGLIPGSRC-------------------------------------ERFKEYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIEHEITP
            R +E   L+    C                                     E+FKEYK EVEN L K++KT R DRGGEYMD++FQ+Y++E  I  
Subjt:  ---GREEESRGLIPGSRC-------------------------------------ERFKEYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIEHEITP

Query:  QLSK-------------------------------------TVETAVYILNNVPSKSVCETPFEVLSGCKSSLHHFRIWGCPAHVLVSNPKKLEPRSKLC
        QLS                                       V+TAVYILN VPSKSV ETP ++ +G K SL HFRIWGCPAHVL +NPKKLEPRSKLC
Subjt:  QLSK-------------------------------------TVETAVYILNNVPSKSVCETPFEVLSGCKSSLHHFRIWGCPAHVLVSNPKKLEPRSKLC

Query:  LFVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLNEMD----STSAKVVNGASTSTSVVDPSTSSQIRSPE-LGMPRRSGRVVRQRD
        LFVGYPK TRGG FYDPK N+V VSTNATFLEEDH+R+H PRSK VLNE+       S +VV   S  T VV   +S++   P+ L  PRRSGRV     
Subjt:  LFVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLNEMD----STSAKVVNGASTSTSVVDPSTSSQIRSPE-LGMPRRSGRVVRQRD

Query:  RYMGLAETSVIAPDDDCEDPLTYDQAMVDVGKDEWIKAIN-------------------GV------YVLQFCLGACGSTG-------------------
        RYM L ET  +  D D EDPLT+ +AM DV KDEWIKA+N                   GV      ++ +   GA G                      
Subjt:  RYMGLAETSVIAPDDDCEDPLTYDQAMVDVGKDEWIKAIN-------------------GV------YVLQFCLGACGSTG-------------------

Query:  ---------------------------W----------------------------GQEQKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPC
                                   W                            GQEQK+C+L+RSIYGLKQASRSWNIRFD AIKSY FDQ VD PC
Subjt:  ---------------------------W----------------------------GQEQKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPC

Query:  VYKKIVDKTVAFLVLYVHDILLFGN----------------------------------------------------------ENSKKGLLPFRHGVHLS
        VYK+I++K+VAFLVLYV DILL GN                                                          +NSK+GLLPFRHGV LS
Subjt:  VYKKIVDKTVAFLVLYVHDILLFGN----------------------------------------------------------ENSKKGLLPFRHGVHLS

Query:  KDQCPKTPQDVQNMRRIPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTWGFDLTYGIQIRLFGPI-----RILGNPLGVDLHYEWRSR--
        K+QCPKTPQDV+ MR IPYASA+GSLMYAMLCTRPDICYAVGIV+RYQSN GL HWT     L Y  + R +  +      IL      D   +  SR  
Subjt:  KDQCPKTPQDVQNMRRIPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTWGFDLTYGIQIRLFGPI-----RILGNPLGVDLHYEWRSR--

Query:  -------------VWRSIKQGCIADSTMEAEYVVACEAAKEVVWLRKFMTDLEVVPNMILPITLFCDKRG
                     VWRSIKQGCIADSTMEAEYV ACEAAKE VWLR F+ DLEVVPNM  PITL+CD  G
Subjt:  -------------VWRSIKQGCIADSTMEAEYVVACEAAKEVVWLRKFMTDLEVVPNMILPITLFCDKRG

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-21640.08Show/hide
Query:  MSSAIIALLASDKLVGDNYQSWKNNINTIFVTDDIKFVLSEECPQMPSSTPSRSVRVAYDRWIRANEKAKVYIIASMSDVLAKKHELMVTAKEIMESLQE
        M+SA + +LA+DKL G+NY SWKN INT+ + DD++FVL EECPQ+P++  +R+VR  Y+RW +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQE
Subjt:  MSSAIIALLASDKLVGDNYQSWKNNINTIFVTDDIKFVLSEECPQMPSSTPSRSVRVAYDRWIRANEKAKVYIIASMSDVLAKKHELMVTAKEIMESLQE

Query:  MFGQQSFQVWHDSLKYVFNARMKEGSFVREHVLDMMTHFNLAEMNGASIDESSQESFILETLPKSFLQLRINVVMNKISYTLTTLLNELQNFQSLNRVKA
        MFGQ S+Q+ HD+LKY++NARM EG+ VREHVL+MM HFN+AEMNGA IDE+SQ SFILE+LP+SFLQ R N VMNKI+YTLTTLLNELQ F+SL ++K 
Subjt:  MFGQQSFQVWHDSLKYVFNARMKEGSFVREHVLDMMTHFNLAEMNGASIDESSQESFILETLPKSFLQLRINVVMNKISYTLTTLLNELQNFQSLNRVKA

Query:  SESEANVA--YRSYHRGSTSGTKPVAPSRPKGKKRMKRG----KTDRGAAHKGKKVKD---------------------VAEKGKG--------------
         + EANVA   R +HRGSTSGTK +  S    K + K+G    K +  AA   KK K                      +AEK K               
Subjt:  SESEANVA--YRSYHRGSTSGTKPVAPSRPKGKKRMKRG----KTDRGAAHKGKKVKD---------------------VAEKGKG--------------

Query:  ---------------------------------------------------------------------------------------------------F
                                                                                                            
Subjt:  ---------------------------------------------------------------------------------------------------F

Query:  HRNGADIGRGGIPNS-------------------------------------------------------------------------------------
        ++NG +I    + N+                                                                                     
Subjt:  HRNGADIGRGGIPNS-------------------------------------------------------------------------------------

Query:  ---GREEESRGLIPGSRC-------------------------------------ERFKEYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIEHEITP
            R +E   L+    C                                     E+FKEYK EVEN L K++KT R DRGGEYMD++FQ+Y++E  I  
Subjt:  ---GREEESRGLIPGSRC-------------------------------------ERFKEYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIEHEITP

Query:  QLSK-------------------------------------TVETAVYILNNVPSKSVCETPFEVLSGCKSSLHHFRIWGCPAHVLVSNPKKLEPRSKLC
        QLS                                       V+TAVYILN VPSKSV ETP ++ +G K SL HFRIWGCPAHVL +NPKKLEPRSKLC
Subjt:  QLSK-------------------------------------TVETAVYILNNVPSKSVCETPFEVLSGCKSSLHHFRIWGCPAHVLVSNPKKLEPRSKLC

Query:  LFVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLNEMD----STSAKVVNGASTSTSVVDPSTSSQIRSPE-LGMPRRSGRVVRQRD
        LFVGYPK TRGG FYDPK N+V VSTNATFLEEDH+R+H PRSK VLNE+       S +VV   S  T VV   +S++   P+ L  PRRSGRV     
Subjt:  LFVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLNEMD----STSAKVVNGASTSTSVVDPSTSSQIRSPE-LGMPRRSGRVVRQRD

Query:  RYMGLAETSVIAPDDDCEDPLTYDQAMVDVGKDEWIKAIN-------------------GV------YVLQFCLGACGSTG-------------------
        RYM L ET  +  D D EDPLT+ +AM DV KDEWIKA+N                   GV      ++ +   GA G                      
Subjt:  RYMGLAETSVIAPDDDCEDPLTYDQAMVDVGKDEWIKAIN-------------------GV------YVLQFCLGACGSTG-------------------

Query:  ---------------------------W----------------------------GQEQKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPC
                                   W                            GQEQK+C+L+RSIYGLKQASRSWNIRFD AIKSY FDQ VD PC
Subjt:  ---------------------------W----------------------------GQEQKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPC

Query:  VYKKIVDKTVAFLVLYVHDILLFGN----------------------------------------------------------ENSKKGLLPFRHGVHLS
        VYK+I++K+VAFLVLYV DILL GN                                                          +NSK+GLLPFRHGV LS
Subjt:  VYKKIVDKTVAFLVLYVHDILLFGN----------------------------------------------------------ENSKKGLLPFRHGVHLS

Query:  KDQCPKTPQDVQNMRRIPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTWGFDLTYGIQIRLFGPI-----RILGNPLGVDLHYEWRSR--
        K+QCPKTPQDV+ MR IPYASA+GSLMYAMLCTRPDICYAVGIV+RYQSN GL HWT     L Y  + R +  +      IL      D   +  SR  
Subjt:  KDQCPKTPQDVQNMRRIPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTWGFDLTYGIQIRLFGPI-----RILGNPLGVDLHYEWRSR--

Query:  -------------VWRSIKQGCIADSTMEAEYVVACEAAKEVVWLRKFMTDLEVVPNMILPITLFCDKRG
                     VWRSIKQGCIADSTMEAEYV ACEAAKE VWLR F+ DLEVVPNM  PITL+CD  G
Subjt:  -------------VWRSIKQGCIADSTMEAEYVVACEAAKEVVWLRKFMTDLEVVPNMILPITLFCDKRG

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-21640.08Show/hide
Query:  MSSAIIALLASDKLVGDNYQSWKNNINTIFVTDDIKFVLSEECPQMPSSTPSRSVRVAYDRWIRANEKAKVYIIASMSDVLAKKHELMVTAKEIMESLQE
        M+SA + +LA+DKL G+NY SWKN INT+ + DD++FVL EECPQ+P++  +R+VR  Y+RW +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQE
Subjt:  MSSAIIALLASDKLVGDNYQSWKNNINTIFVTDDIKFVLSEECPQMPSSTPSRSVRVAYDRWIRANEKAKVYIIASMSDVLAKKHELMVTAKEIMESLQE

Query:  MFGQQSFQVWHDSLKYVFNARMKEGSFVREHVLDMMTHFNLAEMNGASIDESSQESFILETLPKSFLQLRINVVMNKISYTLTTLLNELQNFQSLNRVKA
        MFGQ S+Q+ HD+LKY++NARM EG+ VREHVL+MM HFN+AEMNGA IDE+SQ SFILE+LP+SFLQ R N VMNKI+YTLTTLLNELQ F+SL ++K 
Subjt:  MFGQQSFQVWHDSLKYVFNARMKEGSFVREHVLDMMTHFNLAEMNGASIDESSQESFILETLPKSFLQLRINVVMNKISYTLTTLLNELQNFQSLNRVKA

Query:  SESEANVA--YRSYHRGSTSGTKPVAPSRPKGKKRMKRG----KTDRGAAHKGKKVKD---------------------VAEKGKG--------------
         + EANVA   R +HRGSTSGTK +  S    K + K+G    K +  AA   KK K                      +AEK K               
Subjt:  SESEANVA--YRSYHRGSTSGTKPVAPSRPKGKKRMKRG----KTDRGAAHKGKKVKD---------------------VAEKGKG--------------

Query:  ---------------------------------------------------------------------------------------------------F
                                                                                                            
Subjt:  ---------------------------------------------------------------------------------------------------F

Query:  HRNGADIGRGGIPNS-------------------------------------------------------------------------------------
        ++NG +I    + N+                                                                                     
Subjt:  HRNGADIGRGGIPNS-------------------------------------------------------------------------------------

Query:  ---GREEESRGLIPGSRC-------------------------------------ERFKEYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIEHEITP
            R +E   L+    C                                     E+FKEYK EVEN L K++KT R DRGGEYMD++FQ+Y++E  I  
Subjt:  ---GREEESRGLIPGSRC-------------------------------------ERFKEYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIEHEITP

Query:  QLSK-------------------------------------TVETAVYILNNVPSKSVCETPFEVLSGCKSSLHHFRIWGCPAHVLVSNPKKLEPRSKLC
        QLS                                       V+TAVYILN VPSKSV ETP ++ +G K SL HFRIWGCPAHVL +NPKKLEPRSKLC
Subjt:  QLSK-------------------------------------TVETAVYILNNVPSKSVCETPFEVLSGCKSSLHHFRIWGCPAHVLVSNPKKLEPRSKLC

Query:  LFVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLNEMD----STSAKVVNGASTSTSVVDPSTSSQIRSPE-LGMPRRSGRVVRQRD
        LFVGYPK TRGG FYDPK N+V VSTNATFLEEDH+R+H PRSK VLNE+       S +VV   S  T VV   +S++   P+ L  PRRSGRV     
Subjt:  LFVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLNEMD----STSAKVVNGASTSTSVVDPSTSSQIRSPE-LGMPRRSGRVVRQRD

Query:  RYMGLAETSVIAPDDDCEDPLTYDQAMVDVGKDEWIKAIN-------------------GV------YVLQFCLGACGSTG-------------------
        RYM L ET  +  D D EDPLT+ +AM DV KDEWIKA+N                   GV      ++ +   GA G                      
Subjt:  RYMGLAETSVIAPDDDCEDPLTYDQAMVDVGKDEWIKAIN-------------------GV------YVLQFCLGACGSTG-------------------

Query:  ---------------------------W----------------------------GQEQKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPC
                                   W                            GQEQK+C+L+RSIYGLKQASRSWNIRFD AIKSY FDQ VD PC
Subjt:  ---------------------------W----------------------------GQEQKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPC

Query:  VYKKIVDKTVAFLVLYVHDILLFGN----------------------------------------------------------ENSKKGLLPFRHGVHLS
        VYK+I++K+VAFLVLYV DILL GN                                                          +NSK+GLLPFRHGV LS
Subjt:  VYKKIVDKTVAFLVLYVHDILLFGN----------------------------------------------------------ENSKKGLLPFRHGVHLS

Query:  KDQCPKTPQDVQNMRRIPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTWGFDLTYGIQIRLFGPI-----RILGNPLGVDLHYEWRSR--
        K+QCPKTPQDV+ MR IPYASA+GSLMYAMLCTRPDICYAVGIV+RYQSN GL HWT     L Y  + R +  +      IL      D   +  SR  
Subjt:  KDQCPKTPQDVQNMRRIPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTWGFDLTYGIQIRLFGPI-----RILGNPLGVDLHYEWRSR--

Query:  -------------VWRSIKQGCIADSTMEAEYVVACEAAKEVVWLRKFMTDLEVVPNMILPITLFCDKRG
                     VWRSIKQGCIADSTMEAEYV ACEAAKE VWLR F+ DLEVVPNM  PITL+CD  G
Subjt:  -------------VWRSIKQGCIADSTMEAEYVVACEAAKEVVWLRKFMTDLEVVPNMILPITLFCDKRG

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-21640.08Show/hide
Query:  MSSAIIALLASDKLVGDNYQSWKNNINTIFVTDDIKFVLSEECPQMPSSTPSRSVRVAYDRWIRANEKAKVYIIASMSDVLAKKHELMVTAKEIMESLQE
        M+SA + +LA+DKL G+NY SWKN INT+ + DD++FVL EECPQ+P++  +R+VR  Y+RW +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQE
Subjt:  MSSAIIALLASDKLVGDNYQSWKNNINTIFVTDDIKFVLSEECPQMPSSTPSRSVRVAYDRWIRANEKAKVYIIASMSDVLAKKHELMVTAKEIMESLQE

Query:  MFGQQSFQVWHDSLKYVFNARMKEGSFVREHVLDMMTHFNLAEMNGASIDESSQESFILETLPKSFLQLRINVVMNKISYTLTTLLNELQNFQSLNRVKA
        MFGQ S+Q+ HD+LKY++NARM EG+ VREHVL+MM HFN+AEMNGA IDE+SQ SFILE+LP+SFLQ R N VMNKI+YTLTTLLNELQ F+SL ++K 
Subjt:  MFGQQSFQVWHDSLKYVFNARMKEGSFVREHVLDMMTHFNLAEMNGASIDESSQESFILETLPKSFLQLRINVVMNKISYTLTTLLNELQNFQSLNRVKA

Query:  SESEANVA--YRSYHRGSTSGTKPVAPSRPKGKKRMKRG----KTDRGAAHKGKKVKD---------------------VAEKGKG--------------
         + EANVA   R +HRGSTSGTK +  S    K + K+G    K +  AA   KK K                      +AEK K               
Subjt:  SESEANVA--YRSYHRGSTSGTKPVAPSRPKGKKRMKRG----KTDRGAAHKGKKVKD---------------------VAEKGKG--------------

Query:  ---------------------------------------------------------------------------------------------------F
                                                                                                            
Subjt:  ---------------------------------------------------------------------------------------------------F

Query:  HRNGADIGRGGIPNS-------------------------------------------------------------------------------------
        ++NG +I    + N+                                                                                     
Subjt:  HRNGADIGRGGIPNS-------------------------------------------------------------------------------------

Query:  ---GREEESRGLIPGSRC-------------------------------------ERFKEYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIEHEITP
            R +E   L+    C                                     E+FKEYK EVEN L K++KT R DRGGEYMD++FQ+Y++E  I  
Subjt:  ---GREEESRGLIPGSRC-------------------------------------ERFKEYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIEHEITP

Query:  QLSK-------------------------------------TVETAVYILNNVPSKSVCETPFEVLSGCKSSLHHFRIWGCPAHVLVSNPKKLEPRSKLC
        QLS                                       V+TAVYILN VPSKSV ETP ++ +G K SL HFRIWGCPAHVL +NPKKLEPRSKLC
Subjt:  QLSK-------------------------------------TVETAVYILNNVPSKSVCETPFEVLSGCKSSLHHFRIWGCPAHVLVSNPKKLEPRSKLC

Query:  LFVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLNEMD----STSAKVVNGASTSTSVVDPSTSSQIRSPE-LGMPRRSGRVVRQRD
        LFVGYPK TRGG FYDPK N+V VSTNATFLEEDH+R+H PRSK VLNE+       S +VV   S  T VV   +S++   P+ L  PRRSGRV     
Subjt:  LFVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLNEMD----STSAKVVNGASTSTSVVDPSTSSQIRSPE-LGMPRRSGRVVRQRD

Query:  RYMGLAETSVIAPDDDCEDPLTYDQAMVDVGKDEWIKAIN-------------------GV------YVLQFCLGACGSTG-------------------
        RYM L ET  +  D D EDPLT+ +AM DV KDEWIKA+N                   GV      ++ +   GA G                      
Subjt:  RYMGLAETSVIAPDDDCEDPLTYDQAMVDVGKDEWIKAIN-------------------GV------YVLQFCLGACGSTG-------------------

Query:  ---------------------------W----------------------------GQEQKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPC
                                   W                            GQEQK+C+L+RSIYGLKQASRSWNIRFD AIKSY FDQ VD PC
Subjt:  ---------------------------W----------------------------GQEQKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPC

Query:  VYKKIVDKTVAFLVLYVHDILLFGN----------------------------------------------------------ENSKKGLLPFRHGVHLS
        VYK+I++K+VAFLVLYV DILL GN                                                          +NSK+GLLPFRHGV LS
Subjt:  VYKKIVDKTVAFLVLYVHDILLFGN----------------------------------------------------------ENSKKGLLPFRHGVHLS

Query:  KDQCPKTPQDVQNMRRIPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTWGFDLTYGIQIRLFGPI-----RILGNPLGVDLHYEWRSR--
        K+QCPKTPQDV+ MR IPYASA+GSLMYAMLCTRPDICYAVGIV+RYQSN GL HWT     L Y  + R +  +      IL      D   +  SR  
Subjt:  KDQCPKTPQDVQNMRRIPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTWGFDLTYGIQIRLFGPI-----RILGNPLGVDLHYEWRSR--

Query:  -------------VWRSIKQGCIADSTMEAEYVVACEAAKEVVWLRKFMTDLEVVPNMILPITLFCDKRG
                     VWRSIKQGCIADSTMEAEYV ACEAAKE VWLR F+ DLEVVPNM  PITL+CD  G
Subjt:  -------------VWRSIKQGCIADSTMEAEYVVACEAAKEVVWLRKFMTDLEVVPNMILPITLFCDKRG

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein6.9e-21740.08Show/hide
Query:  MSSAIIALLASDKLVGDNYQSWKNNINTIFVTDDIKFVLSEECPQMPSSTPSRSVRVAYDRWIRANEKAKVYIIASMSDVLAKKHELMVTAKEIMESLQE
        M+SA + +LA+DKL G+NY SWKN INT+ + DD++FVL EECPQ+P++  +R+VR  Y+RW +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQE
Subjt:  MSSAIIALLASDKLVGDNYQSWKNNINTIFVTDDIKFVLSEECPQMPSSTPSRSVRVAYDRWIRANEKAKVYIIASMSDVLAKKHELMVTAKEIMESLQE

Query:  MFGQQSFQVWHDSLKYVFNARMKEGSFVREHVLDMMTHFNLAEMNGASIDESSQESFILETLPKSFLQLRINVVMNKISYTLTTLLNELQNFQSLNRVKA
        MFGQ S+Q+ HD+LKY++NARM EG+ VREHVL+MM HFN+AEMNGA IDE+SQ SFILE+LP+SFLQ R N VMNKI+YTLTTLLNELQ F+SL ++K 
Subjt:  MFGQQSFQVWHDSLKYVFNARMKEGSFVREHVLDMMTHFNLAEMNGASIDESSQESFILETLPKSFLQLRINVVMNKISYTLTTLLNELQNFQSLNRVKA

Query:  SESEANVA--YRSYHRGSTSGTKPVAPSRPKGKKRMKRG----KTDRGAAHKGKKVKD---------------------VAEKGKG--------------
         + EANVA   R +HRGSTSGTK +  S    K + K+G    K +  AA   KK K                      +AEK K               
Subjt:  SESEANVA--YRSYHRGSTSGTKPVAPSRPKGKKRMKRG----KTDRGAAHKGKKVKD---------------------VAEKGKG--------------

Query:  ---------------------------------------------------------------------------------------------------F
                                                                                                            
Subjt:  ---------------------------------------------------------------------------------------------------F

Query:  HRNGADIGRGGIPNS-------------------------------------------------------------------------------------
        ++NG +I    + N+                                                                                     
Subjt:  HRNGADIGRGGIPNS-------------------------------------------------------------------------------------

Query:  ---GREEESRGLIPGSRC-------------------------------------ERFKEYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIEHEITP
            R +E   L+    C                                     E+FKEYK EVEN L K++KT R DRGGEYMD++FQ+Y++E  I  
Subjt:  ---GREEESRGLIPGSRC-------------------------------------ERFKEYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIEHEITP

Query:  QLSK-------------------------------------TVETAVYILNNVPSKSVCETPFEVLSGCKSSLHHFRIWGCPAHVLVSNPKKLEPRSKLC
        QLS                                       V+TAVYILN VPSKSV ETP ++ +G K SL HFRIWGCPAHVL +NPKKLEPRSKLC
Subjt:  QLSK-------------------------------------TVETAVYILNNVPSKSVCETPFEVLSGCKSSLHHFRIWGCPAHVLVSNPKKLEPRSKLC

Query:  LFVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLNEMD----STSAKVVNGASTSTSVVDPSTSSQIRSPE-LGMPRRSGRVVRQRD
        LFVGYPK TRGG FYDPK N+V VSTNATFLEEDH+R+H PRSK VLNE+       S +VV   S  T VV   +S++   P+ L  PRRSGRV     
Subjt:  LFVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLNEMD----STSAKVVNGASTSTSVVDPSTSSQIRSPE-LGMPRRSGRVVRQRD

Query:  RYMGLAETSVIAPDDDCEDPLTYDQAMVDVGKDEWIKAIN-------------------GV------YVLQFCLGACGSTG-------------------
        RYM L ET  +  D D EDPLT+ +AM DV KDEWIKA+N                   GV      ++ +   GA G                      
Subjt:  RYMGLAETSVIAPDDDCEDPLTYDQAMVDVGKDEWIKAIN-------------------GV------YVLQFCLGACGSTG-------------------

Query:  ---------------------------W----------------------------GQEQKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPC
                                   W                            GQEQK+C+L+RSIYGLKQASRSWNIRFD AIKSY FDQ VD PC
Subjt:  ---------------------------W----------------------------GQEQKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPC

Query:  VYKKIVDKTVAFLVLYVHDILLFGN----------------------------------------------------------ENSKKGLLPFRHGVHLS
        VYK+I++K+VAFLVLYV DILL GN                                                          +NSK+GLLPFRHGV LS
Subjt:  VYKKIVDKTVAFLVLYVHDILLFGN----------------------------------------------------------ENSKKGLLPFRHGVHLS

Query:  KDQCPKTPQDVQNMRRIPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTWGFDLTYGIQIRLFGPI-----RILGNPLGVDLHYEWRSR--
        K+QCPKTPQDV+ MR IPYASA+GSLMYAMLCTRPDICYAVGIV+RYQSN GL HWT     L Y  + R +  +      IL      D   +  SR  
Subjt:  KDQCPKTPQDVQNMRRIPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTWGFDLTYGIQIRLFGPI-----RILGNPLGVDLHYEWRSR--

Query:  -------------VWRSIKQGCIADSTMEAEYVVACEAAKEVVWLRKFMTDLEVVPNMILPITLFCDKRG
                     VWRSIKQGCIADSTMEAEYV ACEAAKE VWLR F+ DLEVVPNM  PITL+CD  G
Subjt:  -------------VWRSIKQGCIADSTMEAEYVVACEAAKEVVWLRKFMTDLEVVPNMILPITLFCDKRG

A0A5A7SNP8 Gag/pol protein3.6e-23446.96Show/hide
Query:  MSSAIIALLASDKLVGDNYQSWKNNINTIFVTDDIKFVLSEECPQMPSSTPSRSVRVAYDRWIRANEKAKVYIIASMSDVLAKKHELMVTAKEIMESLQE
        +SSA + +LA+DKL G+NY SWKN IN + + DD++FVL E+CPQ+P++  +R+VR  Y+RW +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQE
Subjt:  MSSAIIALLASDKLVGDNYQSWKNNINTIFVTDDIKFVLSEECPQMPSSTPSRSVRVAYDRWIRANEKAKVYIIASMSDVLAKKHELMVTAKEIMESLQE

Query:  MFGQQSFQVWHDSLKYVFNARMKEGSFVREHVLDMMTHFNLAEMNGASIDESSQESFILETLPKSFLQLRINVVMNKISYTLTTLLNELQNFQSLNRVKA
        MFGQ S+Q+ HD+LKY++NARM EG+ VREHVL+MM HFN+A MN A IDE+SQ SFILE+LP+SFLQ R N VMNKI+YTLTTLLNELQ F+SL ++K 
Subjt:  MFGQQSFQVWHDSLKYVFNARMKEGSFVREHVLDMMTHFNLAEMNGASIDESSQESFILETLPKSFLQLRINVVMNKISYTLTTLLNELQNFQSLNRVKA

Query:  SESEANVA--YRSYHRGSTSGTKPVAPSRPKGKKRMKRG----KTDRGAAHKGKKVKDVAEKGKGFHRNGADIGRGGIP---------------------
         + EANVA   R +HRGSTSGTK +  S    K + K+G    K +  AA   KK K  A KG  F  N     +   P                     
Subjt:  SESEANVA--YRSYHRGSTSGTKPVAPSRPKGKKRMKRG----KTDRGAAHKGKKVKDVAEKGKGFHRNGADIGRGGIP---------------------

Query:  NSGREEESRGLIPGSRC-------------------------------------ERFKEYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIEHEITPQ
           R +E   L+    C                                     E+FKEYK EVEN L K++KT R DRGGEYMD++FQ+Y++E  I  Q
Subjt:  NSGREEESRGLIPGSRC-------------------------------------ERFKEYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIEHEITPQ

Query:  LSK-------------------------------------TVETAVYILNNVPSKSVCETPFEVLSGCKSSLHHFRIWGCPAHVLVSNPKKLEPRSKLCL
        LS                                       V+TAVYILN VPSKSV ETP ++ +G K SL HFRIWGCPAHVL +NPKKLEPRSKLCL
Subjt:  LSK-------------------------------------TVETAVYILNNVPSKSVCETPFEVLSGCKSSLHHFRIWGCPAHVLVSNPKKLEPRSKLCL

Query:  FVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLNEMD----STSAKVVNGASTSTSVVDPSTSSQIRSPE-LGMPRRSGRVVRQRDR
        FVGYPK TRGG FYDPK N+V VSTNATFLEEDH+R+H PRSK VLNE+       S +VV   S    VV   +S++   P+ L  PRRSGRV     R
Subjt:  FVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLNEMD----STSAKVVNGASTSTSVVDPSTSSQIRSPE-LGMPRRSGRVVRQRDR

Query:  YMGLAETSVIAPDDDCEDPLTYDQAMVDVGKDEWIKAIN-------------------GV------YVLQFCLGACGSTG--------------------
        YM L ET  +  D D EDPLT+ +AM DV KDEWIKA+N                   GV      ++ +   GA G                       
Subjt:  YMGLAETSVIAPDDDCEDPLTYDQAMVDVGKDEWIKAIN-------------------GV------YVLQFCLGACGSTG--------------------

Query:  --------------------------W----------------------------GQEQKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPCV
                                  W                            GQEQK+C+L+RSIYGLKQASRSWNIRFD AIKSY FDQ VD PCV
Subjt:  --------------------------W----------------------------GQEQKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPCV

Query:  YKKIVDKTVAFLVLYVHDILLFGN----------------------------------------------------------ENSKKGLLPFRHGVHLSK
        YK+I++K+VAFLVLYV DILL GN                                                          +NSK+GLLPFRHGV LSK
Subjt:  YKKIVDKTVAFLVLYVHDILLFGN----------------------------------------------------------ENSKKGLLPFRHGVHLSK

Query:  DQCPKTPQDVQNMRRIPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTWGFDLTYGIQIRLFGPI-----RILGNPLGVDLHYEWRSR---
        +QCPKTPQDV+ MR IPYASA+GSLMYAMLCTRPDICYAVGIV+RYQSN GL HWT     L Y  + R +  +      IL      D   +  SR   
Subjt:  DQCPKTPQDVQNMRRIPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTWGFDLTYGIQIRLFGPI-----RILGNPLGVDLHYEWRSR---

Query:  ------------VWRSIKQGCIADSTMEAEYVVACEAAKEVVWLRKFMTDLEVVPNMILPITLFCDKRG
                    VWRSIKQGCIADSTMEAEYV ACEAAKE VWLR F+ DLEVVPNM  PITL+CD  G
Subjt:  ------------VWRSIKQGCIADSTMEAEYVVACEAAKEVVWLRKFMTDLEVVPNMILPITLFCDKRG

A0A5A7U869 Gag/pol protein1.4e-21449.02Show/hide
Query:  MSSAIIALLASDKLVGDNYQSWKNNINTIFVTDDIKFVLSEECPQMPSSTPSRSVRVAYDRWIRANEKAKVYIIASMSDVLAKKHELMVTAKEIMESLQE
        M+SA + +LA+DKL G+NY SWKN INT+ + DD++FVL EECPQ+P++  +R+VR  Y+RW +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQE
Subjt:  MSSAIIALLASDKLVGDNYQSWKNNINTIFVTDDIKFVLSEECPQMPSSTPSRSVRVAYDRWIRANEKAKVYIIASMSDVLAKKHELMVTAKEIMESLQE

Query:  MFGQQSFQVWHDSLKYVFNARMKEGSFVREHVLDMMTHFNLAEMNGASIDESSQESFILETLPKSFLQLRINVVMNKISYTLTTLLNELQNFQSLNRVKA
        MFGQ S+Q+ HD+LKY++NARM EG+ VREHVL+MM HFN+AEMNGA IDE+SQ SFILE+LP+SFLQ R N VMNKI+YTLTTLLNELQ F+SL ++K 
Subjt:  MFGQQSFQVWHDSLKYVFNARMKEGSFVREHVLDMMTHFNLAEMNGASIDESSQESFILETLPKSFLQLRINVVMNKISYTLTTLLNELQNFQSLNRVKA

Query:  SESEANVA--YRSYHRGSTSGTKPVAPSRPKGKKRMKRG----KTDRGAAHKGKKVKDVAEKGKGFHRNGADIGRGGIP---------------------
         + EANVA   R +HRGSTSGTK +  S    K + K+G    K +  AA   KK K  A KG  FH N     +   P                     
Subjt:  SESEANVA--YRSYHRGSTSGTKPVAPSRPKGKKRMKRG----KTDRGAAHKGKKVKDVAEKGKGFHRNGADIGRGGIP---------------------

Query:  NSGREEESRGLIPGSRC-------------------------------------ERFKEYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIEHEITPQ
           + +E   L+    C                                     E+FKEYK EVEN L K++KT R DRGGEYMD++FQ+Y++E  I  Q
Subjt:  NSGREEESRGLIPGSRC-------------------------------------ERFKEYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIEHEITPQ

Query:  LS-------------------------------------KTVETAVYILNNVPSKSVCETPFEVLSGCKSSLHHFRIWGCPAHVLVSNPKKLEPRSKLCL
        LS                                       V+TAVYILN VPSKSV +TP ++ +G K SL HFRIWGCPAHVL +NPKKLEPRSKLCL
Subjt:  LS-------------------------------------KTVETAVYILNNVPSKSVCETPFEVLSGCKSSLHHFRIWGCPAHVLVSNPKKLEPRSKLCL

Query:  FVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLNEMD----STSAKVVNGASTSTSVVDPSTSSQIRSPE-LGMPRRSGRVVRQRDR
        FVGYPK TRGG FYDPK N+V VSTNATFLEEDH+R+H PRSK VLNE+       S +VV   S  T VV   +S++   P+ L  PRRSGRV      
Subjt:  FVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLNEMD----STSAKVVNGASTSTSVVDPSTSSQIRSPE-LGMPRRSGRVVRQRDR

Query:  YMGLAETSVIAPDDDCEDPLTYDQAMVDVGKDEWIKAINGVYVLQFCLGACGSTG---------------------------------W-----------
        YM L ET  +  D D EDPLT+ +AM DV KDEWIKA+N    L+  L A G T                                  W           
Subjt:  YMGLAETSVIAPDDDCEDPLTYDQAMVDVGKDEWIKAINGVYVLQFCLGACGSTG---------------------------------W-----------

Query:  -----------------GQEQKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPCVYKKIVDKTVAFLVLYVHDILLFGN--------------
                         GQEQK+C+L+RSIYGLKQASRSWNIRFD AIKSY FDQ VD PCVYK+I++K+VAFLVLYV DILL GN              
Subjt:  -----------------GQEQKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPCVYKKIVDKTVAFLVLYVHDILLFGN--------------

Query:  --------------------------------------------ENSKKGLLPFRHGVHLSKDQCPKTPQDVQNMRRIPYASAIGSLMYAMLCTRPDICY
                                                    +NSK+GLLPFRHGV LSK+QCPKTPQDV+ MR IPYASAIGSLMYAMLCTRPDICY
Subjt:  --------------------------------------------ENSKKGLLPFRHGVHLSKDQCPKTPQDVQNMRRIPYASAIGSLMYAMLCTRPDICY

Query:  AVGIVNRYQSNSGLDH
        AVGIV+RYQSN GL H
Subjt:  AVGIVNRYQSNSGLDH

A0A5A7VH46 Gag/pol protein2.4e-21446.92Show/hide
Query:  MSSAIIALLASDKLVGDNYQSWKNNINTIFVTDDIKFVLSEECPQMPSSTPSRSVRVAYDRWIRANEKAKVYIIASMSDVLAKKHELMVTAKEIMESLQE
        M+SA + +LA+DKL G+NY SWKN INT+ + DD++FVL EECPQ+P++  +R+VR  Y+RW +ANEKA+ YI+AS+S+VLAKKHE ++TA+EIM+SLQE
Subjt:  MSSAIIALLASDKLVGDNYQSWKNNINTIFVTDDIKFVLSEECPQMPSSTPSRSVRVAYDRWIRANEKAKVYIIASMSDVLAKKHELMVTAKEIMESLQE

Query:  MFGQQSFQVWHDSLKYVFNARMKEGSFVREHVLDMMTHFNLAEMNGASIDESSQESFILETLPKSFLQLRINVVMNKISYTLTTLLNELQNFQSLNRVKA
        MFGQ S+Q+ HD+LKY++NARM EG+ VREHVL+MM HF++AEMNGA IDE+SQ SFILE+LP+SFLQ R N VMNKI+YTLTTLLNELQ F+SL ++K 
Subjt:  MFGQQSFQVWHDSLKYVFNARMKEGSFVREHVLDMMTHFNLAEMNGASIDESSQESFILETLPKSFLQLRINVVMNKISYTLTTLLNELQNFQSLNRVKA

Query:  SESEANVA--YRSYHRGSTSGTKPVAPSRPKGKKRMKRG----KTDRGAAHKGKKVKDVAEKGKGFHRNGADIGRGGIPNSGREEESRGLIPGSRCERFK
         + EANVA   R +HRGSTSGTK +  S    K + K+G    K +  AA   KK K  A KG  FH N     +   P    E++          E+FK
Subjt:  SESEANVA--YRSYHRGSTSGTKPVAPSRPKGKKRMKRG----KTDRGAAHKGKKVKDVAEKGKGFHRNGADIGRGGIPNSGREEESRGLIPGSRCERFK

Query:  EYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIEHEITPQLSK-------------------------------------TVETAVYILNNVPSKSVC
        EYK EVEN L K++KT R DR GEYMD++FQ+Y++E  I  QLS                                       V+TAVYILN VPSKSV 
Subjt:  EYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIEHEITPQLSK-------------------------------------TVETAVYILNNVPSKSVC

Query:  ETPFEVLSGCKSSLHHFRIWGCPAHVLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLNEMD----STSA
        ETP ++ +G K SL HFRIWGC AHVL +NPKKLEPRSKLCLFVGYPK TRGG FYDPK N+V VSTNATFLEEDH+R+H PRSK VLNE+       S 
Subjt:  ETPFEVLSGCKSSLHHFRIWGCPAHVLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLNEMD----STSA

Query:  KVVNGASTSTSVVDPSTSSQIRSPE-LGMPRRSGRVVRQRDRYMGLAETSVIAPDDDCEDPLTYDQAMVDVGKDEWIKAIN-------------------
        +VV   S    VV   +S++   P+ L  PRRSGRV     RYM L ET  +  D D EDPLT+ +AM DV KDEWIKA+N                   
Subjt:  KVVNGASTSTSVVDPSTSSQIRSPE-LGMPRRSGRVVRQRDRYMGLAETSVIAPDDDCEDPLTYDQAMVDVGKDEWIKAIN-------------------

Query:  -------------------GVYVLQFCLGACGSTG---------------------------------W----------------------------GQE
                            V   Q  L A G T                                  W                            GQE
Subjt:  -------------------GVYVLQFCLGACGSTG---------------------------------W----------------------------GQE

Query:  QKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPCVYKKIVDKTVAFLVLYVHDILLFGN----------------------------------
        QK+C+L+RSIYGLKQASRSWNIRFD AIKSY FDQ VD PCVYK+I++K+VAFLVLYV DILL  N                                  
Subjt:  QKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPCVYKKIVDKTVAFLVLYVHDILLFGN----------------------------------

Query:  ------------------------ENSKKGLLPFRHGVHLSKDQCPKTPQDVQNMRRIPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTW
                                +NSK+GLLPFRHGV LSK+QCPKTPQDV+ MR       + +L   M  TR D     G       +  L  +T  
Subjt:  ------------------------ENSKKGLLPFRHGVHLSKDQCPKTPQDVQNMRRIPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTW

Query:  GFDLTYGIQIRLFGPIRILGNPLGVDLHYEWRSRVWRSIKQGCIADSTMEAEYVVACEAAKEVVWLRKFMTDLEVVPNMILPITLFCDKRG
         F      +    G +  L             + VWRSIKQGCIADSTMEAEY  ACE AKE VWLRKF+ DLEVVPNM+ PITL+CD  G
Subjt:  GFDLTYGIQIRLFGPIRILGNPLGVDLHYEWRSRVWRSIKQGCIADSTMEAEYVVACEAAKEVVWLRKFMTDLEVVPNMILPITLFCDKRG

A0A5D3CPJ6 Gag/pol protein6.9e-21740.08Show/hide
Query:  MSSAIIALLASDKLVGDNYQSWKNNINTIFVTDDIKFVLSEECPQMPSSTPSRSVRVAYDRWIRANEKAKVYIIASMSDVLAKKHELMVTAKEIMESLQE
        M+SA + +LA+DKL G+NY SWKN INT+ + DD++FVL EECPQ+P++  +R+VR  Y+RW +ANEKA+ YI+AS+S+VLAKKHE M+TA+EIM+SLQE
Subjt:  MSSAIIALLASDKLVGDNYQSWKNNINTIFVTDDIKFVLSEECPQMPSSTPSRSVRVAYDRWIRANEKAKVYIIASMSDVLAKKHELMVTAKEIMESLQE

Query:  MFGQQSFQVWHDSLKYVFNARMKEGSFVREHVLDMMTHFNLAEMNGASIDESSQESFILETLPKSFLQLRINVVMNKISYTLTTLLNELQNFQSLNRVKA
        MFGQ S+Q+ HD+LKY++NARM EG+ VREHVL+MM HFN+AEMNGA IDE+SQ SFILE+LP+SFLQ R N VMNKI+YTLTTLLNELQ F+SL ++K 
Subjt:  MFGQQSFQVWHDSLKYVFNARMKEGSFVREHVLDMMTHFNLAEMNGASIDESSQESFILETLPKSFLQLRINVVMNKISYTLTTLLNELQNFQSLNRVKA

Query:  SESEANVA--YRSYHRGSTSGTKPVAPSRPKGKKRMKRG----KTDRGAAHKGKKVKD---------------------VAEKGKG--------------
         + EANVA   R +HRGSTSGTK +  S    K + K+G    K +  AA   KK K                      +AEK K               
Subjt:  SESEANVA--YRSYHRGSTSGTKPVAPSRPKGKKRMKRG----KTDRGAAHKGKKVKD---------------------VAEKGKG--------------

Query:  ---------------------------------------------------------------------------------------------------F
                                                                                                            
Subjt:  ---------------------------------------------------------------------------------------------------F

Query:  HRNGADIGRGGIPNS-------------------------------------------------------------------------------------
        ++NG +I    + N+                                                                                     
Subjt:  HRNGADIGRGGIPNS-------------------------------------------------------------------------------------

Query:  ---GREEESRGLIPGSRC-------------------------------------ERFKEYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIEHEITP
            R +E   L+    C                                     E+FKEYK EVEN L K++KT R DRGGEYMD++FQ+Y++E  I  
Subjt:  ---GREEESRGLIPGSRC-------------------------------------ERFKEYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIEHEITP

Query:  QLSK-------------------------------------TVETAVYILNNVPSKSVCETPFEVLSGCKSSLHHFRIWGCPAHVLVSNPKKLEPRSKLC
        QLS                                       V+TAVYILN VPSKSV ETP ++ +G K SL HFRIWGCPAHVL +NPKKLEPRSKLC
Subjt:  QLSK-------------------------------------TVETAVYILNNVPSKSVCETPFEVLSGCKSSLHHFRIWGCPAHVLVSNPKKLEPRSKLC

Query:  LFVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLNEMD----STSAKVVNGASTSTSVVDPSTSSQIRSPE-LGMPRRSGRVVRQRD
        LFVGYPK TRGG FYDPK N+V VSTNATFLEEDH+R+H PRSK VLNE+       S +VV   S  T VV   +S++   P+ L  PRRSGRV     
Subjt:  LFVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLNEMD----STSAKVVNGASTSTSVVDPSTSSQIRSPE-LGMPRRSGRVVRQRD

Query:  RYMGLAETSVIAPDDDCEDPLTYDQAMVDVGKDEWIKAIN-------------------GV------YVLQFCLGACGSTG-------------------
        RYM L ET  +  D D EDPLT+ +AM DV KDEWIKA+N                   GV      ++ +   GA G                      
Subjt:  RYMGLAETSVIAPDDDCEDPLTYDQAMVDVGKDEWIKAIN-------------------GV------YVLQFCLGACGSTG-------------------

Query:  ---------------------------W----------------------------GQEQKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPC
                                   W                            GQEQK+C+L+RSIYGLKQASRSWNIRFD AIKSY FDQ VD PC
Subjt:  ---------------------------W----------------------------GQEQKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPC

Query:  VYKKIVDKTVAFLVLYVHDILLFGN----------------------------------------------------------ENSKKGLLPFRHGVHLS
        VYK+I++K+VAFLVLYV DILL GN                                                          +NSK+GLLPFRHGV LS
Subjt:  VYKKIVDKTVAFLVLYVHDILLFGN----------------------------------------------------------ENSKKGLLPFRHGVHLS

Query:  KDQCPKTPQDVQNMRRIPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTWGFDLTYGIQIRLFGPI-----RILGNPLGVDLHYEWRSR--
        K+QCPKTPQDV+ MR IPYASA+GSLMYAMLCTRPDICYAVGIV+RYQSN GL HWT     L Y  + R +  +      IL      D   +  SR  
Subjt:  KDQCPKTPQDVQNMRRIPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTWGFDLTYGIQIRLFGPI-----RILGNPLGVDLHYEWRSR--

Query:  -------------VWRSIKQGCIADSTMEAEYVVACEAAKEVVWLRKFMTDLEVVPNMILPITLFCDKRG
                     VWRSIKQGCIADSTMEAEYV ACEAAKE VWLR F+ DLEVVPNM  PITL+CD  G
Subjt:  -------------VWRSIKQGCIADSTMEAEYVVACEAAKEVVWLRKFMTDLEVVPNMILPITLFCDKRG

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.1e-1726.3Show/hide
Query:  VCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPCVYKKIVDK----TVAFLVLYVHDILLFGNE-----NSKKGLL-PFR---------------
        VC+L+++IYGLKQA+R W   F++A+K  +F  +    C+Y  I+DK       +++LYV D+++   +     N K+ L+  FR               
Subjt:  VCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPCVYKKIVDK----TVAFLVLYVHDILLFGNE-----NSKKGLL-PFR---------------

Query:  ----HGVHLSKDQCPKTPQDVQNMR------------------------RIPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTWGFDLTY-
              ++LS+    K      NM                           P  S IG LMY MLCTRPD+  AV I++RY S +  + W      L Y 
Subjt:  ----HGVHLSKDQCPKTPQDVQNMR------------------------RIPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTWGFDLTY-

Query:  --GIQIRL-FGPIRILGNPLGVDLHYEWRSR--------------------VWRSIKQGCIADSTMEAEYVVACEAAKEVVWLRKFMTDLEVVPNMILPI
           I ++L F       N +   +  +W                        W + +Q  +A S+ EAEY+   EA +E +WL+  +T + +   +  PI
Subjt:  --GIQIRL-FGPIRILGNPLGVDLHYEWRSR--------------------VWRSIKQGCIADSTMEAEYVVACEAAKEVVWLRKFMTDLEVVPNMILPI

Query:  TLFCDKRG
         ++ D +G
Subjt:  TLFCDKRG

P0CV72 Secreted RxLR effector protein 1611.5e-1132.84Show/hide
Query:  MRRIPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTWGFDL-------TYGIQIRLFGPIRILGNPLGVDLHYEWRSR-------------
        M+ +PY SA+G++MY M+ TRPD+  AVG+++++ S+    HW      L       TYG++    G  +++G     D   +  SR             
Subjt:  MRRIPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTWGFDL-------TYGIQIRLFGPIRILGNPLGVDLHYEWRSR-------------

Query:  --VWRSIKQGCIADSTMEAEYVVACEAAKEVVWL
           WRS KQ  +A S+ E EY+   EA +E VWL
Subjt:  --VWRSIKQGCIADSTMEAEYVVACEAAKEVVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.9e-4525.1Show/hide
Query:  FKEYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIEHEI--------TPQ-----------------------------LSKTVETAVYILNNVPSKS
        F+++   VE   G+ LK LR D GGEY   EF++Y   H I        TPQ                               + V+TA Y++N  PS  
Subjt:  FKEYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIEHEI--------TPQ-----------------------------LSKTVETAVYILNNVPSKS

Query:  VC-ETPFEVLSGCKSSLHHFRIWGCP--AHVLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLN------
        +  E P  V +  + S  H +++GC   AHV      KL+ +S  C+F+GY  E  G   +DP K +V  S +  F  E  VR     S+ V N      
Subjt:  VC-ETPFEVLSGCKSSLHHFRIWGCP--AHVLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLN------

Query:  -EMDSTSAKVVNGASTSTSVVDPSTS---------------SQIRSPELGMP-----RRSGRVVRQRDRYMGLAETSVIAPDDDCEDPLTYDQAMVDVGK
          + STS    +  ST+  V +                    ++  P  G       RRS R   +  RY     T  +   DD  +P +  + +    K
Subjt:  -EMDSTSAKVVNGASTSTSVVDPSTS---------------SQIRSPELGMP-----RRSGRVVRQRDRYMGLAETSVIAPDDDCEDPLTYDQAMVDVGK

Query:  DEWIKAI---------NGVYVL--------------QFCL---GAC------------------------------------------------------
        ++ +KA+         NG Y L               F L   G C                                                      
Subjt:  DEWIKAI---------NGVYVL--------------QFCL---GAC------------------------------------------------------

Query:  -------------------GSTGWGQEQKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPCVY-KKIVDKTVAFLVLYVHDILLFGNE-----
                           G    G++  VC+L++S+YGLKQA R W ++FD  +KS  + +    PCVY K+  +     L+LYV D+L+ G +     
Subjt:  -------------------GSTGWGQEQKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPCVY-KKIVDKTVAFLVLYVHDILLFGNE-----

Query:  -----------------------------------------------------NSKKGLLPFRHGVHLSKDQCPKTPQDVQNMRRIPYASAIGSLMYAML
                                                             N+K    P    + LSK  CP T ++  NM ++PY+SA+GSLMYAM+
Subjt:  -----------------------------------------------------NSKKGLLPFRHGVHLSKDQCPKTPQDVQNMRRIPYASAIGSLMYAML

Query:  CTRPDICYAVGIVNRYQSNSGLDHWT-----------TWGFDLTYG---------IQIRLFGPIRILGNPLGVDLHYEWRSRVWRSIKQGCIADSTMEAE
        CTRPDI +AVG+V+R+  N G +HW            T G  L +G             + G I    +  G    +   +  W+S  Q C+A ST EAE
Subjt:  CTRPDICYAVGIVNRYQSNSGLDHWT-----------TWGFDLTYG---------IQIRLFGPIRILGNPLGVDLHYEWRSRVWRSIKQGCIADSTMEAE

Query:  YVVACEAAKEVVWLRKFMTDL
        Y+ A E  KE++WL++F+ +L
Subjt:  YVVACEAAKEVVWLRKFMTDL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-0524.78Show/hide
Query:  ERFKEYKTEVENLLGKSLKTLRLDRGGEYMDI--EFQDYMIEHEITPQ-------LSK-----TVET---------------------AVYILNNVPSKS
        E F  +K  +EN     + T   D GGE++ +   F  + I H  +P        LS+      VET                     AVY++N +P+  
Subjt:  ERFKEYKTEVENLLGKSLKTLRLDRGGEYMDI--EFQDYMIEHEITPQ-------LSK-----TVET---------------------AVYILNNVPSKS

Query:  V-CETPFEVLSGCKSSLHHFRIWGCPAHVLVS--NPKKLEPRSKLCLFVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLNEMDSTS
        +  E+PF+ L G   +    R++GC  +  +   N  KL+ +S+ C+F+GY       L    + +R+ +S +  F E     +  P S ++     +T 
Subjt:  V-CETPFEVLSGCKSSLHHFRIWGCPAHVLVS--NPKKLEPRSKLCLFVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPRSKFVLNEMDSTS

Query:  AKVVNGASTSTSVVDPSTSSQIRSPELGMP
        + V      S+ V  P T+   R+P L  P
Subjt:  AKVVNGASTSTSVVDPSTSSQIRSPELGMP

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.7e-0532.31Show/hide
Query:  VCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPCVYKKIVDKTVAFLVLYVHDILLFGNE
        VC+L +++YGLKQA R+W +     + +  F  +V    ++     K++ ++++YV DIL+ GN+
Subjt:  VCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPCVYKKIVDKTVAFLVLYVHDILLFGNE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.9e-0723.53Show/hide
Query:  VCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPCVYKKIVDKTVAFLVLYVHDILLFGNE----NSKKGLLPFRHGVHLSKD-------QCPKTP
        VCRL ++IYGLKQA R+W +     + +  F  ++    ++     +++ ++++YV DIL+ GN+          L  R  V   +D       +  + P
Subjt:  VCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPCVYKKIVDKTVAFLVLYVHDILLFGNE----NSKKGLLPFRHGVHLSKD-------QCPKTP

Query:  QDVQNMRR---------------------------------------IPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHW
        Q +   +R                                         Y   +GSL Y +  TRPD+ YAV  +++Y      DHW
Subjt:  QDVQNMRR---------------------------------------IPYASAIGSLMYAMLCTRPDICYAVGIVNRYQSNSGLDHW

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.4e-0635.82Show/hide
Query:  VCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPCVYKKIVDKTVAFLVLYVHDILLFGNENS
        VC L +SIYGLKQASR W ++F   +  + F Q+      + KI       +++YV DI++  N ++
Subjt:  VCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPCVYKKIVDKTVAFLVLYVHDILLFGNENS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAGCGCAATAATAGCTCTACTCGCGTCCGATAAATTAGTGGGAGATAATTACCAAAGTTGGAAAAATAACATAAACACAATTTTTGTAACTGACGACATTAAGTT
CGTGCTCTCTGAGGAGTGTCCTCAGATGCCGAGCTCGACTCCATCGCGAAGTGTTCGCGTTGCATACGATCGATGGATCAGGGCCAATGAAAAGGCCAAGGTCTACATCA
TTGCCAGTATGTCTGATGTTTTGGCAAAGAAGCATGAGTTGATGGTCACTGCCAAGGAGATCATGGAGTCCTTGCAGGAAATGTTTGGACAACAGTCCTTTCAGGTCTGG
CATGACTCGCTCAAATACGTATTCAATGCACGGATGAAAGAGGGGTCGTTTGTCCGTGAACATGTTCTAGACATGATGACCCACTTTAATCTAGCTGAGATGAACGGAGC
TTCGATCGACGAGTCGAGCCAAGAAAGTTTTATTTTGGAGACTCTTCCGAAGAGTTTCCTTCAACTTCGTATTAATGTTGTTATGAACAAAATTAGCTACACTCTGACTA
CCCTCCTCAATGAGCTACAGAATTTCCAGTCCTTGAATAGGGTCAAGGCATCAGAATCTGAGGCAAATGTTGCCTACAGGTCTTATCACAGGGGTTCGACCTCTGGGACG
AAACCTGTTGCTCCTTCACGCCCGAAAGGGAAGAAGAGGATGAAGAGGGGTAAAACTGACCGAGGTGCCGCCCACAAGGGAAAGAAGGTCAAGGACGTTGCAGAGAAAGG
AAAGGGTTTCCACCGCAATGGGGCGGACATCGGAAGAGGAGGCATCCCAAATTCTGGCCGAGAGGAAGAATCAAGGGGATTGATTCCTGGCAGCCGCTGCGAGAGGTTCA
AGGAGTACAAGACTGAGGTTGAGAACCTCTTAGGTAAATCGCTTAAAACACTTCGATTGGATCGAGGTGGAGAGTACATGGACATAGAATTCCAGGACTATATGATAGAA
CATGAAATTACGCCCCAACTCTCAAAGACTGTGGAGACTGCGGTTTATATTTTGAACAATGTTCCGTCGAAGAGTGTTTGTGAAACACCTTTCGAAGTCTTGAGTGGATG
TAAAAGCAGTTTACATCACTTCAGAATTTGGGGATGCCCGGCCCACGTGTTGGTGTCAAACCCGAAAAAGTTGGAACCCCGTTCGAAATTGTGCCTATTCGTAGGTTACC
CTAAAGAGACTAGGGGTGGTCTATTTTATGATCCTAAGAAAAATAGAGTGTGTGTGTCAACAAACGCCACTTTCCTAGAGGAAGACCACGTCAGGGATCATTTACCAAGG
AGTAAATTTGTATTAAATGAAATGGACAGTACTTCAGCAAAAGTTGTTAATGGGGCTAGTACGTCAACAAGTGTTGTTGATCCTAGCACGTCTAGTCAAATCCGTTCCCC
AGAGTTGGGAATGCCTCGACGTAGTGGGAGGGTTGTGAGACAGCGTGATCGCTACATGGGTTTAGCTGAAACCTCAGTCATCGCTCCTGATGATGACTGTGAGGATCCAT
TGACCTATGATCAGGCAATGGTTGATGTTGGCAAAGACGAATGGATTAAAGCTATAAATGGAGTCTATGTACTTCAATTCTGTCTGGGAGCTTGTGGATCAACCGGATGG
GGCCAAGAGCAAAAGGTTTGTCGGCTTCATAGGTCCATTTATGGGTTGAAACAAGCTTCGAGGTCTTGGAATATAAGGTTTGATGAGGCGATCAAATCTTATGATTTTGA
TCAAAATGTCGACGGGCCTTGTGTCTACAAGAAAATCGTTGACAAAACTGTCGCATTTTTAGTGTTGTATGTGCACGATATTCTTCTCTTTGGGAATGAGAACTCCAAGA
AGGGTTTGTTGCCTTTCAGGCATGGGGTTCACTTGTCTAAGGATCAGTGTCCTAAGACTCCTCAAGATGTTCAGAATATGAGACGGATTCCATATGCTTCGGCTATAGGG
AGTCTGATGTATGCCATGTTGTGTACTAGGCCCGACATCTGTTATGCAGTAGGAATCGTCAATAGATATCAGTCCAATTCAGGATTAGATCACTGGACAACCTGGGGATT
TGATCTCACTTACGGGATACAGATTCGACTTTTCGGACCGATAAGGATTCTAGGAAATCCACTTGGGGTCGACCTTCATTACGAATGGAGGAGTAGAGTATGGCGAAGCA
TCAAGCAGGGATGCATTGCCGATTCCACTATGGAAGCGGAGTATGTTGTGGCTTGTGAAGCTGCAAAGGAAGTTGTTTGGCTTAGGAAGTTCATGACGGATTTGGAAGTT
GTTCCAAATATGATCTTGCCGATCACACTGTTCTGTGATAAGAGAGGCAAACTCACGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTAGCGCAATAATAGCTCTACTCGCGTCCGATAAATTAGTGGGAGATAATTACCAAAGTTGGAAAAATAACATAAACACAATTTTTGTAACTGACGACATTAAGTT
CGTGCTCTCTGAGGAGTGTCCTCAGATGCCGAGCTCGACTCCATCGCGAAGTGTTCGCGTTGCATACGATCGATGGATCAGGGCCAATGAAAAGGCCAAGGTCTACATCA
TTGCCAGTATGTCTGATGTTTTGGCAAAGAAGCATGAGTTGATGGTCACTGCCAAGGAGATCATGGAGTCCTTGCAGGAAATGTTTGGACAACAGTCCTTTCAGGTCTGG
CATGACTCGCTCAAATACGTATTCAATGCACGGATGAAAGAGGGGTCGTTTGTCCGTGAACATGTTCTAGACATGATGACCCACTTTAATCTAGCTGAGATGAACGGAGC
TTCGATCGACGAGTCGAGCCAAGAAAGTTTTATTTTGGAGACTCTTCCGAAGAGTTTCCTTCAACTTCGTATTAATGTTGTTATGAACAAAATTAGCTACACTCTGACTA
CCCTCCTCAATGAGCTACAGAATTTCCAGTCCTTGAATAGGGTCAAGGCATCAGAATCTGAGGCAAATGTTGCCTACAGGTCTTATCACAGGGGTTCGACCTCTGGGACG
AAACCTGTTGCTCCTTCACGCCCGAAAGGGAAGAAGAGGATGAAGAGGGGTAAAACTGACCGAGGTGCCGCCCACAAGGGAAAGAAGGTCAAGGACGTTGCAGAGAAAGG
AAAGGGTTTCCACCGCAATGGGGCGGACATCGGAAGAGGAGGCATCCCAAATTCTGGCCGAGAGGAAGAATCAAGGGGATTGATTCCTGGCAGCCGCTGCGAGAGGTTCA
AGGAGTACAAGACTGAGGTTGAGAACCTCTTAGGTAAATCGCTTAAAACACTTCGATTGGATCGAGGTGGAGAGTACATGGACATAGAATTCCAGGACTATATGATAGAA
CATGAAATTACGCCCCAACTCTCAAAGACTGTGGAGACTGCGGTTTATATTTTGAACAATGTTCCGTCGAAGAGTGTTTGTGAAACACCTTTCGAAGTCTTGAGTGGATG
TAAAAGCAGTTTACATCACTTCAGAATTTGGGGATGCCCGGCCCACGTGTTGGTGTCAAACCCGAAAAAGTTGGAACCCCGTTCGAAATTGTGCCTATTCGTAGGTTACC
CTAAAGAGACTAGGGGTGGTCTATTTTATGATCCTAAGAAAAATAGAGTGTGTGTGTCAACAAACGCCACTTTCCTAGAGGAAGACCACGTCAGGGATCATTTACCAAGG
AGTAAATTTGTATTAAATGAAATGGACAGTACTTCAGCAAAAGTTGTTAATGGGGCTAGTACGTCAACAAGTGTTGTTGATCCTAGCACGTCTAGTCAAATCCGTTCCCC
AGAGTTGGGAATGCCTCGACGTAGTGGGAGGGTTGTGAGACAGCGTGATCGCTACATGGGTTTAGCTGAAACCTCAGTCATCGCTCCTGATGATGACTGTGAGGATCCAT
TGACCTATGATCAGGCAATGGTTGATGTTGGCAAAGACGAATGGATTAAAGCTATAAATGGAGTCTATGTACTTCAATTCTGTCTGGGAGCTTGTGGATCAACCGGATGG
GGCCAAGAGCAAAAGGTTTGTCGGCTTCATAGGTCCATTTATGGGTTGAAACAAGCTTCGAGGTCTTGGAATATAAGGTTTGATGAGGCGATCAAATCTTATGATTTTGA
TCAAAATGTCGACGGGCCTTGTGTCTACAAGAAAATCGTTGACAAAACTGTCGCATTTTTAGTGTTGTATGTGCACGATATTCTTCTCTTTGGGAATGAGAACTCCAAGA
AGGGTTTGTTGCCTTTCAGGCATGGGGTTCACTTGTCTAAGGATCAGTGTCCTAAGACTCCTCAAGATGTTCAGAATATGAGACGGATTCCATATGCTTCGGCTATAGGG
AGTCTGATGTATGCCATGTTGTGTACTAGGCCCGACATCTGTTATGCAGTAGGAATCGTCAATAGATATCAGTCCAATTCAGGATTAGATCACTGGACAACCTGGGGATT
TGATCTCACTTACGGGATACAGATTCGACTTTTCGGACCGATAAGGATTCTAGGAAATCCACTTGGGGTCGACCTTCATTACGAATGGAGGAGTAGAGTATGGCGAAGCA
TCAAGCAGGGATGCATTGCCGATTCCACTATGGAAGCGGAGTATGTTGTGGCTTGTGAAGCTGCAAAGGAAGTTGTTTGGCTTAGGAAGTTCATGACGGATTTGGAAGTT
GTTCCAAATATGATCTTGCCGATCACACTGTTCTGTGATAAGAGAGGCAAACTCACGTGA
Protein sequenceShow/hide protein sequence
MSSAIIALLASDKLVGDNYQSWKNNINTIFVTDDIKFVLSEECPQMPSSTPSRSVRVAYDRWIRANEKAKVYIIASMSDVLAKKHELMVTAKEIMESLQEMFGQQSFQVW
HDSLKYVFNARMKEGSFVREHVLDMMTHFNLAEMNGASIDESSQESFILETLPKSFLQLRINVVMNKISYTLTTLLNELQNFQSLNRVKASESEANVAYRSYHRGSTSGT
KPVAPSRPKGKKRMKRGKTDRGAAHKGKKVKDVAEKGKGFHRNGADIGRGGIPNSGREEESRGLIPGSRCERFKEYKTEVENLLGKSLKTLRLDRGGEYMDIEFQDYMIE
HEITPQLSKTVETAVYILNNVPSKSVCETPFEVLSGCKSSLHHFRIWGCPAHVLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKKNRVCVSTNATFLEEDHVRDHLPR
SKFVLNEMDSTSAKVVNGASTSTSVVDPSTSSQIRSPELGMPRRSGRVVRQRDRYMGLAETSVIAPDDDCEDPLTYDQAMVDVGKDEWIKAINGVYVLQFCLGACGSTGW
GQEQKVCRLHRSIYGLKQASRSWNIRFDEAIKSYDFDQNVDGPCVYKKIVDKTVAFLVLYVHDILLFGNENSKKGLLPFRHGVHLSKDQCPKTPQDVQNMRRIPYASAIG
SLMYAMLCTRPDICYAVGIVNRYQSNSGLDHWTTWGFDLTYGIQIRLFGPIRILGNPLGVDLHYEWRSRVWRSIKQGCIADSTMEAEYVVACEAAKEVVWLRKFMTDLEV
VPNMILPITLFCDKRGKLT