; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g15710 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g15710
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr4:11766324..11779549
RNA-Seq ExpressionMoc04g15710
SyntenyMoc04g15710
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-11036.82Show/hide
Query:  VDDLRFVLQEDCPQAPAPNATVAMRNAYDRWIKADYACLHAINAQSKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSRQSMFGQPSSQARHEALKFI
        +DDLRFVL E+CPQ PA NAT  +R  Y+RW              +KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDS Q MFGQ S Q +H+ALK+I
Subjt:  VDDLRFVLQEDCPQAPAPNATVAMRNAYDRWIKADYACLHAINAQSKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSRQSMFGQPSSQARHEALKFI

Query:  YNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLPKSFLPFGSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSNR-FNRG
        YN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILESLP+SFL F SNAVMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS R F+RG
Subjt:  YNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLPKSFLPFGSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSNR-FNRG

Query:  SSSGTRSAPSSFGSKTFKKKKAAGKGSKPDSAAA-------AQKG--------------------KVKKANEGKYDLLVLETYLVENDDSAWILDSGATN
        S+SGT+S PSS G+K +KKKK  G+G+K + AAA       A KG                    + KKA +GKYDLLVLET LVENDDSAWI+DSGATN
Subjt:  SSSGTRSAPSSFGSKTFKKKKAAGKGSKPDSAAA-------AQKG--------------------KVKKANEGKYDLLVLETYLVENDDSAWILDSGATN

Query:  HVRSSFQGISSWRQRDAGEMTLKVGTREVVSAVAM-------------------------------------YSISFHLNEAFISRNGVNICSALLENNL
        HV SSFQGISSWRQ + GEMT++VGT  VVSA+A+                                     YS++F++N+ FI +NGV ICSA LENNL
Subjt:  HVRSSFQGISSWRQRDAGEMTLKVGTREVVSAVAM-------------------------------------YSISFHLNEAFISRNGVNICSALLENNL

Query:  YVLR------------------------------------------------------------------------------------------------
        YVLR                                                                                                
Subjt:  YVLR------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------PTIK------------------------------------------
                                                               TIK                                          
Subjt:  ------------------------------------------------------PTIK------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------EGHVRDHKPRSKVVFNEISEEATNTSTRVVDQTSTTTRVVDEAITSCQLYPPQVLKVPRRSGRIVSQPDCYVGLTETQVVIPDD
                        E H+R+HKPRSK+V NE+S+E T  STRVV++ S  TRVV    +S + + PQ L+ PRRSGR+ + P  Y+ LTET  VI D 
Subjt:  ----------------EGHVRDHKPRSKVVFNEISEEATNTSTRVVDQTSTTTRVVDEAITSCQLYPPQVLKVPRRSGRIVSQPDCYVGLTETQVVIPDD

Query:  GVEDPLTYKKAMEDTDKDKWVNAMDLEMESM
         +EDPLT+KKAMED DKD+W+ AM+LE+ESM
Subjt:  GVEDPLTYKKAMEDTDKDKWVNAMDLEMESM

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-11036.82Show/hide
Query:  VDDLRFVLQEDCPQAPAPNATVAMRNAYDRWIKADYACLHAINAQSKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSRQSMFGQPSSQARHEALKFI
        +DDLRFVL E+CPQ PA NAT  +R  Y+RW              +KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDS Q MFGQ S Q +H+ALK+I
Subjt:  VDDLRFVLQEDCPQAPAPNATVAMRNAYDRWIKADYACLHAINAQSKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSRQSMFGQPSSQARHEALKFI

Query:  YNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLPKSFLPFGSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSNR-FNRG
        YN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILESLP+SFL F SNAVMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS R F+RG
Subjt:  YNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLPKSFLPFGSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSNR-FNRG

Query:  SSSGTRSAPSSFGSKTFKKKKAAGKGSKPDSAAA-------AQKG--------------------KVKKANEGKYDLLVLETYLVENDDSAWILDSGATN
        S+SGT+S PSS G+K +KKKK  G+G+K + AAA       A KG                    + KKA +GKYDLLVLET LVENDDSAWI+DSGATN
Subjt:  SSSGTRSAPSSFGSKTFKKKKAAGKGSKPDSAAA-------AQKG--------------------KVKKANEGKYDLLVLETYLVENDDSAWILDSGATN

Query:  HVRSSFQGISSWRQRDAGEMTLKVGTREVVSAVAM-------------------------------------YSISFHLNEAFISRNGVNICSALLENNL
        HV SSFQGISSWRQ + GEMT++VGT  VVSA+A+                                     YS++F++N+ FI +NGV ICSA LENNL
Subjt:  HVRSSFQGISSWRQRDAGEMTLKVGTREVVSAVAM-------------------------------------YSISFHLNEAFISRNGVNICSALLENNL

Query:  YVLR------------------------------------------------------------------------------------------------
        YVLR                                                                                                
Subjt:  YVLR------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------PTIK------------------------------------------
                                                               TIK                                          
Subjt:  ------------------------------------------------------PTIK------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------EGHVRDHKPRSKVVFNEISEEATNTSTRVVDQTSTTTRVVDEAITSCQLYPPQVLKVPRRSGRIVSQPDCYVGLTETQVVIPDD
                        E H+R+HKPRSK+V NE+S+E T  STRVV++ S  TRVV    +S + + PQ L+ PRRSGR+ + P  Y+ LTET  VI D 
Subjt:  ----------------EGHVRDHKPRSKVVFNEISEEATNTSTRVVDQTSTTTRVVDEAITSCQLYPPQVLKVPRRSGRIVSQPDCYVGLTETQVVIPDD

Query:  GVEDPLTYKKAMEDTDKDKWVNAMDLEMESM
         +EDPLT+KKAMED DKD+W+ AM+LE+ESM
Subjt:  GVEDPLTYKKAMEDTDKDKWVNAMDLEMESM

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-11036.82Show/hide
Query:  VDDLRFVLQEDCPQAPAPNATVAMRNAYDRWIKADYACLHAINAQSKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSRQSMFGQPSSQARHEALKFI
        +DDLRFVL E+CPQ PA NAT  +R  Y+RW              +KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDS Q MFGQ S Q +H+ALK+I
Subjt:  VDDLRFVLQEDCPQAPAPNATVAMRNAYDRWIKADYACLHAINAQSKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSRQSMFGQPSSQARHEALKFI

Query:  YNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLPKSFLPFGSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSNR-FNRG
        YN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILESLP+SFL F SNAVMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS R F+RG
Subjt:  YNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLPKSFLPFGSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSNR-FNRG

Query:  SSSGTRSAPSSFGSKTFKKKKAAGKGSKPDSAAA-------AQKG--------------------KVKKANEGKYDLLVLETYLVENDDSAWILDSGATN
        S+SGT+S PSS G+K +KKKK  G+G+K + AAA       A KG                    + KKA +GKYDLLVLET LVENDDSAWI+DSGATN
Subjt:  SSSGTRSAPSSFGSKTFKKKKAAGKGSKPDSAAA-------AQKG--------------------KVKKANEGKYDLLVLETYLVENDDSAWILDSGATN

Query:  HVRSSFQGISSWRQRDAGEMTLKVGTREVVSAVAM-------------------------------------YSISFHLNEAFISRNGVNICSALLENNL
        HV SSFQGISSWRQ + GEMT++VGT  VVSA+A+                                     YS++F++N+ FI +NGV ICSA LENNL
Subjt:  HVRSSFQGISSWRQRDAGEMTLKVGTREVVSAVAM-------------------------------------YSISFHLNEAFISRNGVNICSALLENNL

Query:  YVLR------------------------------------------------------------------------------------------------
        YVLR                                                                                                
Subjt:  YVLR------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------PTIK------------------------------------------
                                                               TIK                                          
Subjt:  ------------------------------------------------------PTIK------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------EGHVRDHKPRSKVVFNEISEEATNTSTRVVDQTSTTTRVVDEAITSCQLYPPQVLKVPRRSGRIVSQPDCYVGLTETQVVIPDD
                        E H+R+HKPRSK+V NE+S+E T  STRVV++ S  TRVV    +S + + PQ L+ PRRSGR+ + P  Y+ LTET  VI D 
Subjt:  ----------------EGHVRDHKPRSKVVFNEISEEATNTSTRVVDQTSTTTRVVDEAITSCQLYPPQVLKVPRRSGRIVSQPDCYVGLTETQVVIPDD

Query:  GVEDPLTYKKAMEDTDKDKWVNAMDLEMESM
         +EDPLT+KKAMED DKD+W+ AM+LE+ESM
Subjt:  GVEDPLTYKKAMEDTDKDKWVNAMDLEMESM

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-11036.82Show/hide
Query:  VDDLRFVLQEDCPQAPAPNATVAMRNAYDRWIKADYACLHAINAQSKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSRQSMFGQPSSQARHEALKFI
        +DDLRFVL E+CPQ PA NAT  +R  Y+RW              +KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDS Q MFGQ S Q +H+ALK+I
Subjt:  VDDLRFVLQEDCPQAPAPNATVAMRNAYDRWIKADYACLHAINAQSKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSRQSMFGQPSSQARHEALKFI

Query:  YNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLPKSFLPFGSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSNR-FNRG
        YN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILESLP+SFL F SNAVMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS R F+RG
Subjt:  YNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLPKSFLPFGSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSNR-FNRG

Query:  SSSGTRSAPSSFGSKTFKKKKAAGKGSKPDSAAA-------AQKG--------------------KVKKANEGKYDLLVLETYLVENDDSAWILDSGATN
        S+SGT+S PSS G+K +KKKK  G+G+K + AAA       A KG                    + KKA +GKYDLLVLET LVENDDSAWI+DSGATN
Subjt:  SSSGTRSAPSSFGSKTFKKKKAAGKGSKPDSAAA-------AQKG--------------------KVKKANEGKYDLLVLETYLVENDDSAWILDSGATN

Query:  HVRSSFQGISSWRQRDAGEMTLKVGTREVVSAVAM-------------------------------------YSISFHLNEAFISRNGVNICSALLENNL
        HV SSFQGISSWRQ + GEMT++VGT  VVSA+A+                                     YS++F++N+ FI +NGV ICSA LENNL
Subjt:  HVRSSFQGISSWRQRDAGEMTLKVGTREVVSAVAM-------------------------------------YSISFHLNEAFISRNGVNICSALLENNL

Query:  YVLR------------------------------------------------------------------------------------------------
        YVLR                                                                                                
Subjt:  YVLR------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------PTIK------------------------------------------
                                                               TIK                                          
Subjt:  ------------------------------------------------------PTIK------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------EGHVRDHKPRSKVVFNEISEEATNTSTRVVDQTSTTTRVVDEAITSCQLYPPQVLKVPRRSGRIVSQPDCYVGLTETQVVIPDD
                        E H+R+HKPRSK+V NE+S+E T  STRVV++ S  TRVV    +S + + PQ L+ PRRSGR+ + P  Y+ LTET  VI D 
Subjt:  ----------------EGHVRDHKPRSKVVFNEISEEATNTSTRVVDQTSTTTRVVDEAITSCQLYPPQVLKVPRRSGRIVSQPDCYVGLTETQVVIPDD

Query:  GVEDPLTYKKAMEDTDKDKWVNAMDLEMESM
         +EDPLT+KKAMED DKD+W+ AM+LE+ESM
Subjt:  GVEDPLTYKKAMEDTDKDKWVNAMDLEMESM

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-11036.82Show/hide
Query:  VDDLRFVLQEDCPQAPAPNATVAMRNAYDRWIKADYACLHAINAQSKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSRQSMFGQPSSQARHEALKFI
        +DDLRFVL E+CPQ PA NAT  +R  Y+RW              +KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDS Q MFGQ S Q +H+ALK+I
Subjt:  VDDLRFVLQEDCPQAPAPNATVAMRNAYDRWIKADYACLHAINAQSKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSRQSMFGQPSSQARHEALKFI

Query:  YNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLPKSFLPFGSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSNR-FNRG
        YN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILESLP+SFL F SNAVMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS R F+RG
Subjt:  YNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLPKSFLPFGSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSNR-FNRG

Query:  SSSGTRSAPSSFGSKTFKKKKAAGKGSKPDSAAA-------AQKG--------------------KVKKANEGKYDLLVLETYLVENDDSAWILDSGATN
        S+SGT+S PSS G+K +KKKK  G+G+K + AAA       A KG                    + KKA +GKYDLLVLET LVENDDSAWI+DSGATN
Subjt:  SSSGTRSAPSSFGSKTFKKKKAAGKGSKPDSAAA-------AQKG--------------------KVKKANEGKYDLLVLETYLVENDDSAWILDSGATN

Query:  HVRSSFQGISSWRQRDAGEMTLKVGTREVVSAVAM-------------------------------------YSISFHLNEAFISRNGVNICSALLENNL
        HV SSFQGISSWRQ + GEMT++VGT  VVSA+A+                                     YS++F++N+ FI +NGV ICSA LENNL
Subjt:  HVRSSFQGISSWRQRDAGEMTLKVGTREVVSAVAM-------------------------------------YSISFHLNEAFISRNGVNICSALLENNL

Query:  YVLR------------------------------------------------------------------------------------------------
        YVLR                                                                                                
Subjt:  YVLR------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------PTIK------------------------------------------
                                                               TIK                                          
Subjt:  ------------------------------------------------------PTIK------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------EGHVRDHKPRSKVVFNEISEEATNTSTRVVDQTSTTTRVVDEAITSCQLYPPQVLKVPRRSGRIVSQPDCYVGLTETQVVIPDD
                        E H+R+HKPRSK+V NE+S+E T  STRVV++ S  TRVV    +S + + PQ L+ PRRSGR+ + P  Y+ LTET  VI D 
Subjt:  ----------------EGHVRDHKPRSKVVFNEISEEATNTSTRVVDQTSTTTRVVDEAITSCQLYPPQVLKVPRRSGRIVSQPDCYVGLTETQVVIPDD

Query:  GVEDPLTYKKAMEDTDKDKWVNAMDLEMESM
         +EDPLT+KKAMED DKD+W+ AM+LE+ESM
Subjt:  GVEDPLTYKKAMEDTDKDKWVNAMDLEMESM

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein6.9e-11136.82Show/hide
Query:  VDDLRFVLQEDCPQAPAPNATVAMRNAYDRWIKADYACLHAINAQSKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSRQSMFGQPSSQARHEALKFI
        +DDLRFVL E+CPQ PA NAT  +R  Y+RW              +KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDS Q MFGQ S Q +H+ALK+I
Subjt:  VDDLRFVLQEDCPQAPAPNATVAMRNAYDRWIKADYACLHAINAQSKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSRQSMFGQPSSQARHEALKFI

Query:  YNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLPKSFLPFGSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSNR-FNRG
        YN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILESLP+SFL F SNAVMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS R F+RG
Subjt:  YNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLPKSFLPFGSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSNR-FNRG

Query:  SSSGTRSAPSSFGSKTFKKKKAAGKGSKPDSAAA-------AQKG--------------------KVKKANEGKYDLLVLETYLVENDDSAWILDSGATN
        S+SGT+S PSS G+K +KKKK  G+G+K + AAA       A KG                    + KKA +GKYDLLVLET LVENDDSAWI+DSGATN
Subjt:  SSSGTRSAPSSFGSKTFKKKKAAGKGSKPDSAAA-------AQKG--------------------KVKKANEGKYDLLVLETYLVENDDSAWILDSGATN

Query:  HVRSSFQGISSWRQRDAGEMTLKVGTREVVSAVAM-------------------------------------YSISFHLNEAFISRNGVNICSALLENNL
        HV SSFQGISSWRQ + GEMT++VGT  VVSA+A+                                     YS++F++N+ FI +NGV ICSA LENNL
Subjt:  HVRSSFQGISSWRQRDAGEMTLKVGTREVVSAVAM-------------------------------------YSISFHLNEAFISRNGVNICSALLENNL

Query:  YVLR------------------------------------------------------------------------------------------------
        YVLR                                                                                                
Subjt:  YVLR------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------PTIK------------------------------------------
                                                               TIK                                          
Subjt:  ------------------------------------------------------PTIK------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------EGHVRDHKPRSKVVFNEISEEATNTSTRVVDQTSTTTRVVDEAITSCQLYPPQVLKVPRRSGRIVSQPDCYVGLTETQVVIPDD
                        E H+R+HKPRSK+V NE+S+E T  STRVV++ S  TRVV    +S + + PQ L+ PRRSGR+ + P  Y+ LTET  VI D 
Subjt:  ----------------EGHVRDHKPRSKVVFNEISEEATNTSTRVVDQTSTTTRVVDEAITSCQLYPPQVLKVPRRSGRIVSQPDCYVGLTETQVVIPDD

Query:  GVEDPLTYKKAMEDTDKDKWVNAMDLEMESM
         +EDPLT+KKAMED DKD+W+ AM+LE+ESM
Subjt:  GVEDPLTYKKAMEDTDKDKWVNAMDLEMESM

A0A5A7TWB9 Gag/pol protein6.9e-11136.82Show/hide
Query:  VDDLRFVLQEDCPQAPAPNATVAMRNAYDRWIKADYACLHAINAQSKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSRQSMFGQPSSQARHEALKFI
        +DDLRFVL E+CPQ PA NAT  +R  Y+RW              +KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDS Q MFGQ S Q +H+ALK+I
Subjt:  VDDLRFVLQEDCPQAPAPNATVAMRNAYDRWIKADYACLHAINAQSKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSRQSMFGQPSSQARHEALKFI

Query:  YNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLPKSFLPFGSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSNR-FNRG
        YN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILESLP+SFL F SNAVMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS R F+RG
Subjt:  YNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLPKSFLPFGSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSNR-FNRG

Query:  SSSGTRSAPSSFGSKTFKKKKAAGKGSKPDSAAA-------AQKG--------------------KVKKANEGKYDLLVLETYLVENDDSAWILDSGATN
        S+SGT+S PSS G+K +KKKK  G+G+K + AAA       A KG                    + KKA +GKYDLLVLET LVENDDSAWI+DSGATN
Subjt:  SSSGTRSAPSSFGSKTFKKKKAAGKGSKPDSAAA-------AQKG--------------------KVKKANEGKYDLLVLETYLVENDDSAWILDSGATN

Query:  HVRSSFQGISSWRQRDAGEMTLKVGTREVVSAVAM-------------------------------------YSISFHLNEAFISRNGVNICSALLENNL
        HV SSFQGISSWRQ + GEMT++VGT  VVSA+A+                                     YS++F++N+ FI +NGV ICSA LENNL
Subjt:  HVRSSFQGISSWRQRDAGEMTLKVGTREVVSAVAM-------------------------------------YSISFHLNEAFISRNGVNICSALLENNL

Query:  YVLR------------------------------------------------------------------------------------------------
        YVLR                                                                                                
Subjt:  YVLR------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------PTIK------------------------------------------
                                                               TIK                                          
Subjt:  ------------------------------------------------------PTIK------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------EGHVRDHKPRSKVVFNEISEEATNTSTRVVDQTSTTTRVVDEAITSCQLYPPQVLKVPRRSGRIVSQPDCYVGLTETQVVIPDD
                        E H+R+HKPRSK+V NE+S+E T  STRVV++ S  TRVV    +S + + PQ L+ PRRSGR+ + P  Y+ LTET  VI D 
Subjt:  ----------------EGHVRDHKPRSKVVFNEISEEATNTSTRVVDQTSTTTRVVDEAITSCQLYPPQVLKVPRRSGRIVSQPDCYVGLTETQVVIPDD

Query:  GVEDPLTYKKAMEDTDKDKWVNAMDLEMESM
         +EDPLT+KKAMED DKD+W+ AM+LE+ESM
Subjt:  GVEDPLTYKKAMEDTDKDKWVNAMDLEMESM

A0A5A7TZD7 Gag/pol protein6.9e-11136.82Show/hide
Query:  VDDLRFVLQEDCPQAPAPNATVAMRNAYDRWIKADYACLHAINAQSKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSRQSMFGQPSSQARHEALKFI
        +DDLRFVL E+CPQ PA NAT  +R  Y+RW              +KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDS Q MFGQ S Q +H+ALK+I
Subjt:  VDDLRFVLQEDCPQAPAPNATVAMRNAYDRWIKADYACLHAINAQSKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSRQSMFGQPSSQARHEALKFI

Query:  YNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLPKSFLPFGSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSNR-FNRG
        YN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILESLP+SFL F SNAVMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS R F+RG
Subjt:  YNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLPKSFLPFGSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSNR-FNRG

Query:  SSSGTRSAPSSFGSKTFKKKKAAGKGSKPDSAAA-------AQKG--------------------KVKKANEGKYDLLVLETYLVENDDSAWILDSGATN
        S+SGT+S PSS G+K +KKKK  G+G+K + AAA       A KG                    + KKA +GKYDLLVLET LVENDDSAWI+DSGATN
Subjt:  SSSGTRSAPSSFGSKTFKKKKAAGKGSKPDSAAA-------AQKG--------------------KVKKANEGKYDLLVLETYLVENDDSAWILDSGATN

Query:  HVRSSFQGISSWRQRDAGEMTLKVGTREVVSAVAM-------------------------------------YSISFHLNEAFISRNGVNICSALLENNL
        HV SSFQGISSWRQ + GEMT++VGT  VVSA+A+                                     YS++F++N+ FI +NGV ICSA LENNL
Subjt:  HVRSSFQGISSWRQRDAGEMTLKVGTREVVSAVAM-------------------------------------YSISFHLNEAFISRNGVNICSALLENNL

Query:  YVLR------------------------------------------------------------------------------------------------
        YVLR                                                                                                
Subjt:  YVLR------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------PTIK------------------------------------------
                                                               TIK                                          
Subjt:  ------------------------------------------------------PTIK------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------EGHVRDHKPRSKVVFNEISEEATNTSTRVVDQTSTTTRVVDEAITSCQLYPPQVLKVPRRSGRIVSQPDCYVGLTETQVVIPDD
                        E H+R+HKPRSK+V NE+S+E T  STRVV++ S  TRVV    +S + + PQ L+ PRRSGR+ + P  Y+ LTET  VI D 
Subjt:  ----------------EGHVRDHKPRSKVVFNEISEEATNTSTRVVDQTSTTTRVVDEAITSCQLYPPQVLKVPRRSGRIVSQPDCYVGLTETQVVIPDD

Query:  GVEDPLTYKKAMEDTDKDKWVNAMDLEMESM
         +EDPLT+KKAMED DKD+W+ AM+LE+ESM
Subjt:  GVEDPLTYKKAMEDTDKDKWVNAMDLEMESM

A0A5D3CPJ6 Gag/pol protein6.9e-11136.82Show/hide
Query:  VDDLRFVLQEDCPQAPAPNATVAMRNAYDRWIKADYACLHAINAQSKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSRQSMFGQPSSQARHEALKFI
        +DDLRFVL E+CPQ PA NAT  +R  Y+RW              +KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDS Q MFGQ S Q +H+ALK+I
Subjt:  VDDLRFVLQEDCPQAPAPNATVAMRNAYDRWIKADYACLHAINAQSKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSRQSMFGQPSSQARHEALKFI

Query:  YNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLPKSFLPFGSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSNR-FNRG
        YN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILESLP+SFL F SNAVMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS R F+RG
Subjt:  YNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLPKSFLPFGSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSNR-FNRG

Query:  SSSGTRSAPSSFGSKTFKKKKAAGKGSKPDSAAA-------AQKG--------------------KVKKANEGKYDLLVLETYLVENDDSAWILDSGATN
        S+SGT+S PSS G+K +KKKK  G+G+K + AAA       A KG                    + KKA +GKYDLLVLET LVENDDSAWI+DSGATN
Subjt:  SSSGTRSAPSSFGSKTFKKKKAAGKGSKPDSAAA-------AQKG--------------------KVKKANEGKYDLLVLETYLVENDDSAWILDSGATN

Query:  HVRSSFQGISSWRQRDAGEMTLKVGTREVVSAVAM-------------------------------------YSISFHLNEAFISRNGVNICSALLENNL
        HV SSFQGISSWRQ + GEMT++VGT  VVSA+A+                                     YS++F++N+ FI +NGV ICSA LENNL
Subjt:  HVRSSFQGISSWRQRDAGEMTLKVGTREVVSAVAM-------------------------------------YSISFHLNEAFISRNGVNICSALLENNL

Query:  YVLR------------------------------------------------------------------------------------------------
        YVLR                                                                                                
Subjt:  YVLR------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------PTIK------------------------------------------
                                                               TIK                                          
Subjt:  ------------------------------------------------------PTIK------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------EGHVRDHKPRSKVVFNEISEEATNTSTRVVDQTSTTTRVVDEAITSCQLYPPQVLKVPRRSGRIVSQPDCYVGLTETQVVIPDD
                        E H+R+HKPRSK+V NE+S+E T  STRVV++ S  TRVV    +S + + PQ L+ PRRSGR+ + P  Y+ LTET  VI D 
Subjt:  ----------------EGHVRDHKPRSKVVFNEISEEATNTSTRVVDQTSTTTRVVDEAITSCQLYPPQVLKVPRRSGRIVSQPDCYVGLTETQVVIPDD

Query:  GVEDPLTYKKAMEDTDKDKWVNAMDLEMESM
         +EDPLT+KKAMED DKD+W+ AM+LE+ESM
Subjt:  GVEDPLTYKKAMEDTDKDKWVNAMDLEMESM

A0A5D3CSZ6 Gag/pol protein6.9e-11136.82Show/hide
Query:  VDDLRFVLQEDCPQAPAPNATVAMRNAYDRWIKADYACLHAINAQSKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSRQSMFGQPSSQARHEALKFI
        +DDLRFVL E+CPQ PA NAT  +R  Y+RW              +KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDS Q MFGQ S Q +H+ALK+I
Subjt:  VDDLRFVLQEDCPQAPAPNATVAMRNAYDRWIKADYACLHAINAQSKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSRQSMFGQPSSQARHEALKFI

Query:  YNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLPKSFLPFGSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSNR-FNRG
        YN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILESLP+SFL F SNAVMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS R F+RG
Subjt:  YNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLPKSFLPFGSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSNR-FNRG

Query:  SSSGTRSAPSSFGSKTFKKKKAAGKGSKPDSAAA-------AQKG--------------------KVKKANEGKYDLLVLETYLVENDDSAWILDSGATN
        S+SGT+S PSS G+K +KKKK  G+G+K + AAA       A KG                    + KKA +GKYDLLVLET LVENDDSAWI+DSGATN
Subjt:  SSSGTRSAPSSFGSKTFKKKKAAGKGSKPDSAAA-------AQKG--------------------KVKKANEGKYDLLVLETYLVENDDSAWILDSGATN

Query:  HVRSSFQGISSWRQRDAGEMTLKVGTREVVSAVAM-------------------------------------YSISFHLNEAFISRNGVNICSALLENNL
        HV SSFQGISSWRQ + GEMT++VGT  VVSA+A+                                     YS++F++N+ FI +NGV ICSA LENNL
Subjt:  HVRSSFQGISSWRQRDAGEMTLKVGTREVVSAVAM-------------------------------------YSISFHLNEAFISRNGVNICSALLENNL

Query:  YVLR------------------------------------------------------------------------------------------------
        YVLR                                                                                                
Subjt:  YVLR------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------PTIK------------------------------------------
                                                               TIK                                          
Subjt:  ------------------------------------------------------PTIK------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------EGHVRDHKPRSKVVFNEISEEATNTSTRVVDQTSTTTRVVDEAITSCQLYPPQVLKVPRRSGRIVSQPDCYVGLTETQVVIPDD
                        E H+R+HKPRSK+V NE+S+E T  STRVV++ S  TRVV    +S + + PQ L+ PRRSGR+ + P  Y+ LTET  VI D 
Subjt:  ----------------EGHVRDHKPRSKVVFNEISEEATNTSTRVVDQTSTTTRVVDEAITSCQLYPPQVLKVPRRSGRIVSQPDCYVGLTETQVVIPDD

Query:  GVEDPLTYKKAMEDTDKDKWVNAMDLEMESM
         +EDPLT+KKAMED DKD+W+ AM+LE+ESM
Subjt:  GVEDPLTYKKAMEDTDKDKWVNAMDLEMESM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGTTGCGAGATCTGTTCATGGGTCACTGCCGGTCGCCAATATCACCCTCAGCCACATTTTAAGTGGTTTCGCTGTCAGGCGTTGTCCTAAGTGGTCTCGCCGGAC
GCGCTGCATGTGGCTGAAGGTCGTCGCATGTGGATCATGGTGGGGTGCGGTCGCTGCCGGGTTGTTGCTTGTTGCTGGAGAAGACAGGTGGTTTGAGCTACACAACGTTT
CAGTTGGGTTGTTGTATCCTGGAGGGGACAGACCAGAGAATACCTGCTGCACCGGGAGATTCAACATCAATTTAAGAAGAATTGATATGGAGTCTGCTGTTGAGAAACCT
AATGAGAATGTTGGAGATCACAAGGTGGCCTATGTCCTCACAGAAAATAACCGAAAGAAAATCTATATTGAAAGCATGAACGTTGAAGAGTTCATGGAACACCAGGAGAA
AATCAACAAGTATGAGAAGAGTAAATATGATACTGAAGAGGCAGGGGCGAAAAAATATGCTGCAAGTCGTTTCTTTCGCTTTCATATGGTGGATGAAAAATCTGTAGTGG
AACAAGAGCAAGACTTTCAGGTGTGTTCGGGGATAAAATATCTGATTGCCACATGGAGGACGGGGATTGGCTGTCAGAAGAAATTGTTTGAAAGAATTGTTCAGCAAAGC
TCCTCCTCGGAACCTTCCGAGGAGGGAAGTTCATTACACCTAATTGTACTAGAACCCCGAAAGTATCCGAGTTCGATCTACTCGAACCCGACACGTCTGACCTCAAAACT
GACCAAGGTCTTCCCATTGCCCTTTCAAGTCGAAGCTAGAACCGGGTTCGAGCTTAGTATCCGTTCGAACTCTGCCAACACGACAGAACGAAAAGGTCTGAACGGTAACA
ACGGCACTCAACGCGAGCTCGATGCCAAAACAGTCGAGGACCAGGCCTCCCGAATGGCTACCCCAGTTGAGGACCCCGAAGTTCTGGCTACCCTCCAACTGGAGTTGGAT
GACATGCGCAATCAAGTGAGCAATATGGATGAAATGTACATCGAGATGATGCGGGCCAATCGGACGGGATCTCCTTGCAAAGTCCCAGGCGGAGATGACATGGTTGATGA
TGAGAATGGACATGATCTACCCCTCTATCATGACGATCACGAACCCGTCGCTAATAACGAAGGGATTGACTATAGCCTGAGGGGTTATGATCTCAGGAAGCACCTCGTTG
ACAAGAAGAGGAGAGCCTCACGAGGAACGGGGGTCTCTCCGTCCTGCTCCCGAGGATACTCTAACTCCGACCTCAAGGCTCAGTCGAAGTACACGCCTTTAACACCAGAG
GCTGTGGTAAAGAGAAAAGAGTTCGATCTAATGAAAAACAAGTTTGACGAACAAGTTGAGGCACTTAGGGCGAAGTGCGAAAAGAAAGAAAGTCCTTTTGATAATGGCGA
GATAGTCGAATCGTCGTTCACCTCGGACATCCTAGAGGCGCCCATTCCTCTAAAGTTCAAGACGTCCGCCATGAAGCCCTACGACAGGTCCAAGGACCCAAAGGATTATG
TGGAAGTCTTTCAGGGTCTCATGGACTTCCAGCCATTAAGTGTCGAGCCTTTCAAATCGCTCTCACGGGAAACGCTCTACTCTGGTACAGGAGGCTTCCAGGGAGTAGAT
GATCTTAGGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCGCCTAACGCCACTGTGGCGATGCGCAATGCCTATGACAGGTGGATCAAGGCTGATTACGCTTGTTT
ACATGCCATAAACGCCCAGTCAAAGGCCAATGACAAGGCCAAGGTCTACATCTTGGCGAGCATATCTGATGTGCTTGCCAAGAAGCACGAGGACACAGTCACCGTTAAGG
AGATCATGGACTCACGGCAGAGCATGTTTGGACAACCGTCCTCACAGGCTAGACATGAAGCCCTTAAGTTCATTTACAACTCCCGCATGAAGGAGGGTTCATCAGTGCGA
GAACACGTTCTCAACCTGATGGTCCACTTCAACGTGGCTGAGTCGAACGGGGCCGTCATAGACGAGCAGAGTCAGGTCAGCTTTATTCTGGAATCTCTTCCGAAGAGTTT
CCTTCCATTCGGCAGCAATGCGGTTATGAATAAGCTGGAGTACACTCTTACCACACTCCTAAACGAGCTTCAGACCTACCAGTCTCTTATGAAATGTAAGGGACAAGAAG
GGGAGGCAAATGTTGCCACCTCAAATAGGTTCAACAGAGGATCGTCCTCTGGAACCAGGTCTGCGCCCTCTTCTTTTGGAAGTAAGACTTTTAAGAAGAAGAAGGCTGCT
GGTAAGGGGTCTAAACCTGACTCAGCTGCTGCTGCCCAGAAAGGCAAGGTCAAGAAAGCCAACGAAGGTAAATATGATTTACTTGTATTGGAAACATATTTAGTGGAGAA
TGATGACTCCGCCTGGATACTGGATTCAGGAGCCACTAATCACGTTCGTTCTTCATTTCAGGGAATTAGTTCCTGGAGGCAGCGTGACGCCGGAGAGATGACTCTCAAGG
TCGGAACGAGAGAGGTCGTCTCAGCTGTGGCGATGTATTCAATTTCATTTCATTTAAATGAAGCGTTCATTTCAAGAAATGGTGTTAACATTTGTTCAGCTTTACTTGAA
AACAACTTGTATGTGCTAAGACCAACCATAAAAGAAGGCCATGTTCGAGATCATAAACCACGAAGTAAGGTAGTATTTAACGAGATTTCCGAAGAGGCTACAAACACGTC
AACAAGAGTTGTTGATCAAACTAGCACTACAACAAGAGTTGTTGATGAAGCCATCACATCGTGTCAGTTGTATCCACCTCAAGTGTTGAAGGTGCCTCGACGTAGTGGGA
GGATTGTGTCACAACCTGACTGCTACGTGGGTTTAACTGAAACTCAAGTTGTCATACCTGATGACGGCGTCGAGGATCCATTGACCTATAAGAAGGCAATGGAAGATACT
GACAAGGACAAATGGGTCAACGCAATGGACCTGGAAATGGAGTCGATGCCTAGCACCACTACTTCGCAAGCACCACTCTACTTTAGTCCTTCAAGCCTAACTCCTCGAGT
CGCATTGGCCCTTGACATTTATCCCCAAGGCCTCAATACTCCTCTACAGAAACTTGATGTTCGGCCTTGCCATTTATTCCCAAGGCCTGAATTGTGCACCGGTCATTGCC
GTTTATCCCCAAGGTACCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTGGGTTGCGAGATCTGTTCATGGGTCACTGCCGGTCGCCAATATCACCCTCAGCCACATTTTAAGTGGTTTCGCTGTCAGGCGTTGTCCTAAGTGGTCTCGCCGGAC
GCGCTGCATGTGGCTGAAGGTCGTCGCATGTGGATCATGGTGGGGTGCGGTCGCTGCCGGGTTGTTGCTTGTTGCTGGAGAAGACAGGTGGTTTGAGCTACACAACGTTT
CAGTTGGGTTGTTGTATCCTGGAGGGGACAGACCAGAGAATACCTGCTGCACCGGGAGATTCAACATCAATTTAAGAAGAATTGATATGGAGTCTGCTGTTGAGAAACCT
AATGAGAATGTTGGAGATCACAAGGTGGCCTATGTCCTCACAGAAAATAACCGAAAGAAAATCTATATTGAAAGCATGAACGTTGAAGAGTTCATGGAACACCAGGAGAA
AATCAACAAGTATGAGAAGAGTAAATATGATACTGAAGAGGCAGGGGCGAAAAAATATGCTGCAAGTCGTTTCTTTCGCTTTCATATGGTGGATGAAAAATCTGTAGTGG
AACAAGAGCAAGACTTTCAGGTGTGTTCGGGGATAAAATATCTGATTGCCACATGGAGGACGGGGATTGGCTGTCAGAAGAAATTGTTTGAAAGAATTGTTCAGCAAAGC
TCCTCCTCGGAACCTTCCGAGGAGGGAAGTTCATTACACCTAATTGTACTAGAACCCCGAAAGTATCCGAGTTCGATCTACTCGAACCCGACACGTCTGACCTCAAAACT
GACCAAGGTCTTCCCATTGCCCTTTCAAGTCGAAGCTAGAACCGGGTTCGAGCTTAGTATCCGTTCGAACTCTGCCAACACGACAGAACGAAAAGGTCTGAACGGTAACA
ACGGCACTCAACGCGAGCTCGATGCCAAAACAGTCGAGGACCAGGCCTCCCGAATGGCTACCCCAGTTGAGGACCCCGAAGTTCTGGCTACCCTCCAACTGGAGTTGGAT
GACATGCGCAATCAAGTGAGCAATATGGATGAAATGTACATCGAGATGATGCGGGCCAATCGGACGGGATCTCCTTGCAAAGTCCCAGGCGGAGATGACATGGTTGATGA
TGAGAATGGACATGATCTACCCCTCTATCATGACGATCACGAACCCGTCGCTAATAACGAAGGGATTGACTATAGCCTGAGGGGTTATGATCTCAGGAAGCACCTCGTTG
ACAAGAAGAGGAGAGCCTCACGAGGAACGGGGGTCTCTCCGTCCTGCTCCCGAGGATACTCTAACTCCGACCTCAAGGCTCAGTCGAAGTACACGCCTTTAACACCAGAG
GCTGTGGTAAAGAGAAAAGAGTTCGATCTAATGAAAAACAAGTTTGACGAACAAGTTGAGGCACTTAGGGCGAAGTGCGAAAAGAAAGAAAGTCCTTTTGATAATGGCGA
GATAGTCGAATCGTCGTTCACCTCGGACATCCTAGAGGCGCCCATTCCTCTAAAGTTCAAGACGTCCGCCATGAAGCCCTACGACAGGTCCAAGGACCCAAAGGATTATG
TGGAAGTCTTTCAGGGTCTCATGGACTTCCAGCCATTAAGTGTCGAGCCTTTCAAATCGCTCTCACGGGAAACGCTCTACTCTGGTACAGGAGGCTTCCAGGGAGTAGAT
GATCTTAGGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCGCCTAACGCCACTGTGGCGATGCGCAATGCCTATGACAGGTGGATCAAGGCTGATTACGCTTGTTT
ACATGCCATAAACGCCCAGTCAAAGGCCAATGACAAGGCCAAGGTCTACATCTTGGCGAGCATATCTGATGTGCTTGCCAAGAAGCACGAGGACACAGTCACCGTTAAGG
AGATCATGGACTCACGGCAGAGCATGTTTGGACAACCGTCCTCACAGGCTAGACATGAAGCCCTTAAGTTCATTTACAACTCCCGCATGAAGGAGGGTTCATCAGTGCGA
GAACACGTTCTCAACCTGATGGTCCACTTCAACGTGGCTGAGTCGAACGGGGCCGTCATAGACGAGCAGAGTCAGGTCAGCTTTATTCTGGAATCTCTTCCGAAGAGTTT
CCTTCCATTCGGCAGCAATGCGGTTATGAATAAGCTGGAGTACACTCTTACCACACTCCTAAACGAGCTTCAGACCTACCAGTCTCTTATGAAATGTAAGGGACAAGAAG
GGGAGGCAAATGTTGCCACCTCAAATAGGTTCAACAGAGGATCGTCCTCTGGAACCAGGTCTGCGCCCTCTTCTTTTGGAAGTAAGACTTTTAAGAAGAAGAAGGCTGCT
GGTAAGGGGTCTAAACCTGACTCAGCTGCTGCTGCCCAGAAAGGCAAGGTCAAGAAAGCCAACGAAGGTAAATATGATTTACTTGTATTGGAAACATATTTAGTGGAGAA
TGATGACTCCGCCTGGATACTGGATTCAGGAGCCACTAATCACGTTCGTTCTTCATTTCAGGGAATTAGTTCCTGGAGGCAGCGTGACGCCGGAGAGATGACTCTCAAGG
TCGGAACGAGAGAGGTCGTCTCAGCTGTGGCGATGTATTCAATTTCATTTCATTTAAATGAAGCGTTCATTTCAAGAAATGGTGTTAACATTTGTTCAGCTTTACTTGAA
AACAACTTGTATGTGCTAAGACCAACCATAAAAGAAGGCCATGTTCGAGATCATAAACCACGAAGTAAGGTAGTATTTAACGAGATTTCCGAAGAGGCTACAAACACGTC
AACAAGAGTTGTTGATCAAACTAGCACTACAACAAGAGTTGTTGATGAAGCCATCACATCGTGTCAGTTGTATCCACCTCAAGTGTTGAAGGTGCCTCGACGTAGTGGGA
GGATTGTGTCACAACCTGACTGCTACGTGGGTTTAACTGAAACTCAAGTTGTCATACCTGATGACGGCGTCGAGGATCCATTGACCTATAAGAAGGCAATGGAAGATACT
GACAAGGACAAATGGGTCAACGCAATGGACCTGGAAATGGAGTCGATGCCTAGCACCACTACTTCGCAAGCACCACTCTACTTTAGTCCTTCAAGCCTAACTCCTCGAGT
CGCATTGGCCCTTGACATTTATCCCCAAGGCCTCAATACTCCTCTACAGAAACTTGATGTTCGGCCTTGCCATTTATTCCCAAGGCCTGAATTGTGCACCGGTCATTGCC
GTTTATCCCCAAGGTACCGATGA
Protein sequenceShow/hide protein sequence
MWVARSVHGSLPVANITLSHILSGFAVRRCPKWSRRTRCMWLKVVACGSWWGAVAAGLLLVAGEDRWFELHNVSVGLLYPGGDRPENTCCTGRFNINLRRIDMESAVEKP
NENVGDHKVAYVLTENNRKKIYIESMNVEEFMEHQEKINKYEKSKYDTEEAGAKKYAASRFFRFHMVDEKSVVEQEQDFQVCSGIKYLIATWRTGIGCQKKLFERIVQQS
SSSEPSEEGSSLHLIVLEPRKYPSSIYSNPTRLTSKLTKVFPLPFQVEARTGFELSIRSNSANTTERKGLNGNNGTQRELDAKTVEDQASRMATPVEDPEVLATLQLELD
DMRNQVSNMDEMYIEMMRANRTGSPCKVPGGDDMVDDENGHDLPLYHDDHEPVANNEGIDYSLRGYDLRKHLVDKKRRASRGTGVSPSCSRGYSNSDLKAQSKYTPLTPE
AVVKRKEFDLMKNKFDEQVEALRAKCEKKESPFDNGEIVESSFTSDILEAPIPLKFKTSAMKPYDRSKDPKDYVEVFQGLMDFQPLSVEPFKSLSRETLYSGTGGFQGVD
DLRFVLQEDCPQAPAPNATVAMRNAYDRWIKADYACLHAINAQSKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSRQSMFGQPSSQARHEALKFIYNSRMKEGSSVR
EHVLNLMVHFNVAESNGAVIDEQSQVSFILESLPKSFLPFGSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSNRFNRGSSSGTRSAPSSFGSKTFKKKKAA
GKGSKPDSAAAAQKGKVKKANEGKYDLLVLETYLVENDDSAWILDSGATNHVRSSFQGISSWRQRDAGEMTLKVGTREVVSAVAMYSISFHLNEAFISRNGVNICSALLE
NNLYVLRPTIKEGHVRDHKPRSKVVFNEISEEATNTSTRVVDQTSTTTRVVDEAITSCQLYPPQVLKVPRRSGRIVSQPDCYVGLTETQVVIPDDGVEDPLTYKKAMEDT
DKDKWVNAMDLEMESMPSTTTSQAPLYFSPSSLTPRVALALDIYPQGLNTPLQKLDVRPCHLFPRPELCTGHCRLSPRYR