; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G06120 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G06120
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGag/pol protein
Genome locationClcChr07:10735526..10748075
RNA-Seq ExpressionClc07G06120
SyntenyClc07G06120
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.7e-20140.66Show/hide
Query:  RDAHDRWTKANEKVKVYILASISDILSKKHEK-------MNSLQVLFGQLSSSAMHDAIKYVYNCRMKEGTNVKEHVLDMMVHFNIAEVNRVVMNEKSQI
        R+A+DRW KAN+K +VYILAS++D+L+KKH+        M+SL+ +FGQ S S  H+AIK++Y  RMKEGT+V+EHVLDMM+HFNIAEVN   ++E +Q+
Subjt:  RDAHDRWTKANEKVKVYILASISDILSKKHEK-------MNSLQVLFGQLSSSAMHDAIKYVYNCRMKEGTNVKEHVLDMMVHFNIAEVNRVVMNEKSQI

Query:  ------------------------------------YQSLLKNKGFEAKANVATTSKRRFQKGPTSGNK--------------------SVPKNKEKGKC
                                            +Q+L  +KG E +ANVA T KR+F +G +S NK                     V KN +KGKC
Subjt:  ------------------------------------YQSLLKNKGFEAKANVATTSKRRFQKGPTSGNK--------------------SVPKNKEKGKC

Query:  FYCNGDGHWKRNCPKYLADKKAGKENQ------------------------GATNHVCSSFQETSSWKQLEDGEITLWVGLGELISALAVGDV-------
        F+CN DGHWKRNCPKYLA+KKA K  Q                        GATNH+C SFQETSSWK+L++GEITL VG GE++SA AVGD+       
Subjt:  FYCNGDGHWKRNCPKYLADKKAGKENQ------------------------GATNHVCSSFQETSSWKQLEDGEITLWVGLGELISALAVGDV-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -KIERLVKSGLLNKLEDDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS-----------------------
         +IERLVKSG+LN+LED+SLPPCESCLEGKMTKRSFT KGLRAK PLELVH DLCGPMNVKARG YEYFI+FIDD+S                       
Subjt:  -KIERLVKSGLLNKLEDDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS-----------------------

Query:  ------------------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVPSKSVLGTPY
                                                  STPQQNGVSERRNRTLLDMVRSMMSYA LP+ FW +A+ET ++ILNNVPSKSVL TPY
Subjt:  ------------------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVPSKSVLGTPY

Query:  ELWTGHKASLRHFRIWGCPAYVLVKNPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVATNRSTKVAD
        ELW G K+SLR+FRIWGCPA+VLV+NPKKLE  SKLCLF+GYPKE+RGGLFY PQ+ KV VSTNATFLEEDH R+H+PRSK+VL+ + K AT       D
Subjt:  ELWTGHKASLRHFRIWGCPAYVLVKNPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVATNRSTKVAD

Query:  QADQSTRVVDRPSTSSPSRPSQELREPR-----------------------------------AMNDVDRDQWVKAMNLEMESMYFNLVWELVNQPDEIR
        +   ST+VVD+ + S  S  SQELR PR                                   AMNDVDRDQW+KAMNLEMESMYFN VW LV+ P +++
Subjt:  QADQSTRVVDRPSTSSPSRPSQELREPR-----------------------------------AMNDVDRDQWVKAMNLEMESMYFNLVWELVNQPDEIR

Query:  PIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQ--------------------------------------------------------------------
        PI CKWIYK KRD  G VQT+KA+LVAKGYTQ                                                                    
Subjt:  PIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQ--------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------EQCPKTPQEVEDMRRVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEFALWTVVKNI
                                               EQCPKTPQEVEDMR +PY+SAVGSLMYAMLCTRP+IC++VGIVSRYQSNP    WT VKNI
Subjt:  ---------------------------------------EQCPKTPQEVEDMRRVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEFALWTVVKNI

Query:  LKYLRRTRNYVLVYENKDLILTGYTDFDF
        LKYLRRTRNY+LVY  KDLILTGYTD DF
Subjt:  LKYLRRTRNYVLVYENKDLILTGYTDFDF

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]2.5e-19239.02Show/hide
Query:  RDAHDRWTKANEKVKVYILASISDILSKKHEK-------MNSLQVLFGQLSSSAMHDAIKYVYNCRMKEGTNVKEHVLDMMVHFNIAEVNRVVMNEKSQI
        R+ ++RW KANEK + YILAS+S++L+KKHE        M+SLQ +FGQ S    HDA+KY+YN RM EG +V+EHVL+MMVHFN+AE+N  V++E SQ+
Subjt:  RDAHDRWTKANEKVKVYILASISDILSKKHEK-------MNSLQVLFGQLSSSAMHDAIKYVYNCRMKEGTNVKEHVLDMMVHFNIAEVNRVVMNEKSQI

Query:  ------------------------------------YQSLLKNKGFEAKANVATTSKRRFQKGPTSGNKSVPK---NKE---------------------
                                            ++SL+K KG + +ANVA TS R+F +G TSG KS+P    NK+                     
Subjt:  ------------------------------------YQSLLKNKGFEAKANVATTSKRRFQKGPTSGNKSVPK---NKE---------------------

Query:  -----KGKCFYCNGDGHWKRNCPKYLADKKAGKE----------------------NQGATNHVCSSFQETSSWKQLEDGEITLWVGLGELISALAVGDV
             KG CF+CN +GHWKRNCPKYLA+KK  K+                      + GATNHVCSSFQ  SSW+QLE GE+T+ VG G ++SA+AVG +
Subjt:  -----KGKCFYCNGDGHWKRNCPKYLADKKAGKE----------------------NQGATNHVCSSFQETSSWKQLEDGEITLWVGLGELISALAVGDV

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------KIERLVKSGLLNKLEDDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS--------------
                  +IERLVK+GLL++LE++SLP CESCLEGKMTKR FT KG RAKEPLELVH DLCGPMNVKARG +EYFI F DDYS              
Subjt:  ----------KIERLVKSGLLNKLEDDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS--------------

Query:  ---------------------------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVP
                                                            TPQQNGVSERRNRTLLDMVRSMMSYA LPN FW +AV+T VYILN VP
Subjt:  ---------------------------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVP

Query:  SKSVLGTPYELWTGHKASLRHFRIWGCPAYVLVKNPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVA
        SKSV  TP +LW G K SLRHFRIWGCPA+VL  NPKKLE  SKLCLF+GYPK TRGG FYDP+D KV VSTNATFLEEDHIR+HKPRSK+VL  +SK  
Subjt:  SKSVLGTPYELWTGHKASLRHFRIWGCPAYVLVKNPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVA

Query:  TNRSTKVADQADQSTRVVDRPSTSSPSRPSQELREPR-----------------------------------AMNDVDRDQWVKAMNLEMESMYFNLVWE
        T  ST+V ++    TRVV   S++   +P Q LREPR                                   AM DVD+D+W+KAMNLE+ESMYFN VW+
Subjt:  TNRSTKVADQADQSTRVVDRPSTSSPSRPSQELREPR-----------------------------------AMNDVDRDQWVKAMNLEMESMYFNLVWE

Query:  LVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQ-----------------------------------------------------------
        LV+QPD ++PI CKWIYK KR   G VQT+KA+LVAKGYTQ                                                           
Subjt:  LVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQ-----------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------EQCPKTPQEVEDMRRVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEF
                                                        EQCPKTPQ+VE+MR +PYASAVGSLMYAMLCTRP+IC+AVGIVSRYQSNP  
Subjt:  ------------------------------------------------EQCPKTPQEVEDMRRVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEF

Query:  ALWTVVKNILKYLRRTRNYVLVYENKDLILTGYTDFDF
        A WT VK ILKYLRRTR+Y+LVY +KDLILTGYTD DF
Subjt:  ALWTVVKNILKYLRRTRNYVLVYENKDLILTGYTDFDF

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]3.3e-19239.02Show/hide
Query:  RDAHDRWTKANEKVKVYILASISDILSKKHEK-------MNSLQVLFGQLSSSAMHDAIKYVYNCRMKEGTNVKEHVLDMMVHFNIAEVNRVVMNEKSQI
        R+ ++RW KANEK + YILAS+S++L+KKHE        M+SLQ +FGQ S    HDA+KY+YN RM EG +V+EHVL+MMVHFN+AE+N  V++E SQ+
Subjt:  RDAHDRWTKANEKVKVYILASISDILSKKHEK-------MNSLQVLFGQLSSSAMHDAIKYVYNCRMKEGTNVKEHVLDMMVHFNIAEVNRVVMNEKSQI

Query:  ------------------------------------YQSLLKNKGFEAKANVATTSKRRFQKGPTSGNKSVPK---NKE---------------------
                                            ++SL+K KG + +ANVA TS R+F +G TSG KS+P    NK+                     
Subjt:  ------------------------------------YQSLLKNKGFEAKANVATTSKRRFQKGPTSGNKSVPK---NKE---------------------

Query:  -----KGKCFYCNGDGHWKRNCPKYLADKKAGKE----------------------NQGATNHVCSSFQETSSWKQLEDGEITLWVGLGELISALAVGDV
             KG CF+CN +GHWKRNCPKYLA+KK  K+                      + GATNHVCSSFQ  SSW+QLE GE+T+ VG G ++SA+AVG +
Subjt:  -----KGKCFYCNGDGHWKRNCPKYLADKKAGKE----------------------NQGATNHVCSSFQETSSWKQLEDGEITLWVGLGELISALAVGDV

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------KIERLVKSGLLNKLEDDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS--------------
                  +IERLVK+GLL++LE++SLP CESCLEGKMTKR FT KG RAKEPLELVH DLCGPMNVKARG +EYFI F DDYS              
Subjt:  ----------KIERLVKSGLLNKLEDDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS--------------

Query:  ---------------------------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVP
                                                            TPQQNGVSERRNRTLLDMVRSMMSYA LPN FW +AV+T VYILN VP
Subjt:  ---------------------------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVP

Query:  SKSVLGTPYELWTGHKASLRHFRIWGCPAYVLVKNPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVA
        SKSV  TP +LW G K SLRHFRIWGCPA+VL  NPKKLE  SKLCLF+GYPK TRGG FYDP+D KV VSTNATFLEEDHIR+HKPRSK+VL  +SK  
Subjt:  SKSVLGTPYELWTGHKASLRHFRIWGCPAYVLVKNPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVA

Query:  TNRSTKVADQADQSTRVVDRPSTSSPSRPSQELREPR-----------------------------------AMNDVDRDQWVKAMNLEMESMYFNLVWE
        T  ST+V ++    TRVV   S++   +P Q LREPR                                   AM DVD+D+W+KAMNLE+ESMYFN VW+
Subjt:  TNRSTKVADQADQSTRVVDRPSTSSPSRPSQELREPR-----------------------------------AMNDVDRDQWVKAMNLEMESMYFNLVWE

Query:  LVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQ-----------------------------------------------------------
        LV+QPD ++PI CKWIYK KR   G VQT+KA+LVAKGYTQ                                                           
Subjt:  LVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQ-----------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------EQCPKTPQEVEDMRRVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEF
                                                        EQCPKTPQ+VE+MR +PYASAVGSLMYAMLCTRP+IC+AVGIVSRYQSNP  
Subjt:  ------------------------------------------------EQCPKTPQEVEDMRRVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEF

Query:  ALWTVVKNILKYLRRTRNYVLVYENKDLILTGYTDFDF
        A WT VK ILKYLRRTR+Y LVY +KDLILTGYTD DF
Subjt:  ALWTVVKNILKYLRRTRNYVLVYENKDLILTGYTDFDF

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]3.3e-19239.02Show/hide
Query:  RDAHDRWTKANEKVKVYILASISDILSKKHEK-------MNSLQVLFGQLSSSAMHDAIKYVYNCRMKEGTNVKEHVLDMMVHFNIAEVNRVVMNEKSQI
        R+ ++RW KANEK + YILAS+S++L+KKHE        M+SLQ +FGQ S    HDA+KY+YN RM EG +V+EHVL+MMVHFN+AE+N  V++E SQ+
Subjt:  RDAHDRWTKANEKVKVYILASISDILSKKHEK-------MNSLQVLFGQLSSSAMHDAIKYVYNCRMKEGTNVKEHVLDMMVHFNIAEVNRVVMNEKSQI

Query:  ------------------------------------YQSLLKNKGFEAKANVATTSKRRFQKGPTSGNKSVPK---NKE---------------------
                                            ++SL+K KG + +ANVA TS R+F +G TSG KS+P    NK+                     
Subjt:  ------------------------------------YQSLLKNKGFEAKANVATTSKRRFQKGPTSGNKSVPK---NKE---------------------

Query:  -----KGKCFYCNGDGHWKRNCPKYLADKKAGKE----------------------NQGATNHVCSSFQETSSWKQLEDGEITLWVGLGELISALAVGDV
             KG CF+CN +GHWKRNCPKYLA+KK  K+                      + GATNHVCSSFQ  SSW+QLE GE+T+ VG G ++SA+AVG +
Subjt:  -----KGKCFYCNGDGHWKRNCPKYLADKKAGKE----------------------NQGATNHVCSSFQETSSWKQLEDGEITLWVGLGELISALAVGDV

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------KIERLVKSGLLNKLEDDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS--------------
                  +IERLVK+GLL++LE++SLP CESCLEGKMTKR FT KG RAKEPLELVH DLCGPMNVKARG +EYFI F DDYS              
Subjt:  ----------KIERLVKSGLLNKLEDDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS--------------

Query:  ---------------------------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVP
                                                            TPQQNGVSERRNRTLLDMVRSMMSYA LPN FW +AV+T VYILN VP
Subjt:  ---------------------------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVP

Query:  SKSVLGTPYELWTGHKASLRHFRIWGCPAYVLVKNPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVA
        SKSV  TP +LW G K SLRHFRIWGCPA+VL  NPKKLE  SKLCLF+GYPK TRGG FYDP+D KV VSTNATFLEEDHIR+HKPRSK+VL  +SK  
Subjt:  SKSVLGTPYELWTGHKASLRHFRIWGCPAYVLVKNPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVA

Query:  TNRSTKVADQADQSTRVVDRPSTSSPSRPSQELREPR-----------------------------------AMNDVDRDQWVKAMNLEMESMYFNLVWE
        T  ST+V ++    TRVV   S++   +P Q LREPR                                   AM DVD+D+W+KAMNLE+ESMYFN VW+
Subjt:  TNRSTKVADQADQSTRVVDRPSTSSPSRPSQELREPR-----------------------------------AMNDVDRDQWVKAMNLEMESMYFNLVWE

Query:  LVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQ-----------------------------------------------------------
        LV+QPD ++PI CKWIYK KR   G VQT+KA+LVAKGYTQ                                                           
Subjt:  LVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQ-----------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------EQCPKTPQEVEDMRRVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEF
                                                        EQCPKTPQ+VE+MR +PYASAVGSLMYAMLCTRP+IC+AVGIVSRYQSNP  
Subjt:  ------------------------------------------------EQCPKTPQEVEDMRRVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEF

Query:  ALWTVVKNILKYLRRTRNYVLVYENKDLILTGYTDFDF
        A WT VK ILKYLRRTR+Y LVY +KDLILTGYTD DF
Subjt:  ALWTVVKNILKYLRRTRNYVLVYENKDLILTGYTDFDF

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]3.3e-19239.02Show/hide
Query:  RDAHDRWTKANEKVKVYILASISDILSKKHEK-------MNSLQVLFGQLSSSAMHDAIKYVYNCRMKEGTNVKEHVLDMMVHFNIAEVNRVVMNEKSQI
        R+ ++RW KANEK + YILAS+S++L+KKHE        M+SLQ +FGQ S    HDA+KY+YN RM EG +V+EHVL+MMVHFN+AE+N  V++E SQ+
Subjt:  RDAHDRWTKANEKVKVYILASISDILSKKHEK-------MNSLQVLFGQLSSSAMHDAIKYVYNCRMKEGTNVKEHVLDMMVHFNIAEVNRVVMNEKSQI

Query:  ------------------------------------YQSLLKNKGFEAKANVATTSKRRFQKGPTSGNKSVPK---NKE---------------------
                                            ++SL+K KG + +ANVA TS R+F +G TSG KS+P    NK+                     
Subjt:  ------------------------------------YQSLLKNKGFEAKANVATTSKRRFQKGPTSGNKSVPK---NKE---------------------

Query:  -----KGKCFYCNGDGHWKRNCPKYLADKKAGKE----------------------NQGATNHVCSSFQETSSWKQLEDGEITLWVGLGELISALAVGDV
             KG CF+CN +GHWKRNCPKYLA+KK  K+                      + GATNHVCSSFQ  SSW+QLE GE+T+ VG G ++SA+AVG +
Subjt:  -----KGKCFYCNGDGHWKRNCPKYLADKKAGKE----------------------NQGATNHVCSSFQETSSWKQLEDGEITLWVGLGELISALAVGDV

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------KIERLVKSGLLNKLEDDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS--------------
                  +IERLVK+GLL++LE++SLP CESCLEGKMTKR FT KG RAKEPLELVH DLCGPMNVKARG +EYFI F DDYS              
Subjt:  ----------KIERLVKSGLLNKLEDDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS--------------

Query:  ---------------------------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVP
                                                            TPQQNGVSERRNRTLLDMVRSMMSYA LPN FW +AV+T VYILN VP
Subjt:  ---------------------------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVP

Query:  SKSVLGTPYELWTGHKASLRHFRIWGCPAYVLVKNPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVA
        SKSV  TP +LW G K SLRHFRIWGCPA+VL  NPKKLE  SKLCLF+GYPK TRGG FYDP+D KV VSTNATFLEEDHIR+HKPRSK+VL  +SK  
Subjt:  SKSVLGTPYELWTGHKASLRHFRIWGCPAYVLVKNPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVA

Query:  TNRSTKVADQADQSTRVVDRPSTSSPSRPSQELREPR-----------------------------------AMNDVDRDQWVKAMNLEMESMYFNLVWE
        T  ST+V ++    TRVV   S++   +P Q LREPR                                   AM DVD+D+W+KAMNLE+ESMYFN VW+
Subjt:  TNRSTKVADQADQSTRVVDRPSTSSPSRPSQELREPR-----------------------------------AMNDVDRDQWVKAMNLEMESMYFNLVWE

Query:  LVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQ-----------------------------------------------------------
        LV+QPD ++PI CKWIYK KR   G VQT+KA+LVAKGYTQ                                                           
Subjt:  LVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQ-----------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------EQCPKTPQEVEDMRRVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEF
                                                        EQCPKTPQ+VE+MR +PYASAVGSLMYAMLCTRP+IC+AVGIVSRYQSNP  
Subjt:  ------------------------------------------------EQCPKTPQEVEDMRRVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEF

Query:  ALWTVVKNILKYLRRTRNYVLVYENKDLILTGYTDFDF
        A WT VK ILKYLRRTR+Y LVY +KDLILTGYTD DF
Subjt:  ALWTVVKNILKYLRRTRNYVLVYENKDLILTGYTDFDF

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.6e-19239.02Show/hide
Query:  RDAHDRWTKANEKVKVYILASISDILSKKHEK-------MNSLQVLFGQLSSSAMHDAIKYVYNCRMKEGTNVKEHVLDMMVHFNIAEVNRVVMNEKSQI
        R+ ++RW KANEK + YILAS+S++L+KKHE        M+SLQ +FGQ S    HDA+KY+YN RM EG +V+EHVL+MMVHFN+AE+N  V++E SQ+
Subjt:  RDAHDRWTKANEKVKVYILASISDILSKKHEK-------MNSLQVLFGQLSSSAMHDAIKYVYNCRMKEGTNVKEHVLDMMVHFNIAEVNRVVMNEKSQI

Query:  ------------------------------------YQSLLKNKGFEAKANVATTSKRRFQKGPTSGNKSVPK---NKE---------------------
                                            ++SL+K KG + +ANVA TS R+F +G TSG KS+P    NK+                     
Subjt:  ------------------------------------YQSLLKNKGFEAKANVATTSKRRFQKGPTSGNKSVPK---NKE---------------------

Query:  -----KGKCFYCNGDGHWKRNCPKYLADKKAGKE----------------------NQGATNHVCSSFQETSSWKQLEDGEITLWVGLGELISALAVGDV
             KG CF+CN +GHWKRNCPKYLA+KK  K+                      + GATNHVCSSFQ  SSW+QLE GE+T+ VG G ++SA+AVG +
Subjt:  -----KGKCFYCNGDGHWKRNCPKYLADKKAGKE----------------------NQGATNHVCSSFQETSSWKQLEDGEITLWVGLGELISALAVGDV

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------KIERLVKSGLLNKLEDDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS--------------
                  +IERLVK+GLL++LE++SLP CESCLEGKMTKR FT KG RAKEPLELVH DLCGPMNVKARG +EYFI F DDYS              
Subjt:  ----------KIERLVKSGLLNKLEDDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS--------------

Query:  ---------------------------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVP
                                                            TPQQNGVSERRNRTLLDMVRSMMSYA LPN FW +AV+T VYILN VP
Subjt:  ---------------------------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVP

Query:  SKSVLGTPYELWTGHKASLRHFRIWGCPAYVLVKNPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVA
        SKSV  TP +LW G K SLRHFRIWGCPA+VL  NPKKLE  SKLCLF+GYPK TRGG FYDP+D KV VSTNATFLEEDHIR+HKPRSK+VL  +SK  
Subjt:  SKSVLGTPYELWTGHKASLRHFRIWGCPAYVLVKNPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVA

Query:  TNRSTKVADQADQSTRVVDRPSTSSPSRPSQELREPR-----------------------------------AMNDVDRDQWVKAMNLEMESMYFNLVWE
        T  ST+V ++    TRVV   S++   +P Q LREPR                                   AM DVD+D+W+KAMNLE+ESMYFN VW+
Subjt:  TNRSTKVADQADQSTRVVDRPSTSSPSRPSQELREPR-----------------------------------AMNDVDRDQWVKAMNLEMESMYFNLVWE

Query:  LVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQ-----------------------------------------------------------
        LV+QPD ++PI CKWIYK KR   G VQT+KA+LVAKGYTQ                                                           
Subjt:  LVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQ-----------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------EQCPKTPQEVEDMRRVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEF
                                                        EQCPKTPQ+VE+MR +PYASAVGSLMYAMLCTRP+IC+AVGIVSRYQSNP  
Subjt:  ------------------------------------------------EQCPKTPQEVEDMRRVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEF

Query:  ALWTVVKNILKYLRRTRNYVLVYENKDLILTGYTDFDF
        A WT VK ILKYLRRTR+Y LVY +KDLILTGYTD DF
Subjt:  ALWTVVKNILKYLRRTRNYVLVYENKDLILTGYTDFDF

A0A5A7TWB9 Gag/pol protein1.2e-19239.02Show/hide
Query:  RDAHDRWTKANEKVKVYILASISDILSKKHEK-------MNSLQVLFGQLSSSAMHDAIKYVYNCRMKEGTNVKEHVLDMMVHFNIAEVNRVVMNEKSQI
        R+ ++RW KANEK + YILAS+S++L+KKHE        M+SLQ +FGQ S    HDA+KY+YN RM EG +V+EHVL+MMVHFN+AE+N  V++E SQ+
Subjt:  RDAHDRWTKANEKVKVYILASISDILSKKHEK-------MNSLQVLFGQLSSSAMHDAIKYVYNCRMKEGTNVKEHVLDMMVHFNIAEVNRVVMNEKSQI

Query:  ------------------------------------YQSLLKNKGFEAKANVATTSKRRFQKGPTSGNKSVPK---NKE---------------------
                                            ++SL+K KG + +ANVA TS R+F +G TSG KS+P    NK+                     
Subjt:  ------------------------------------YQSLLKNKGFEAKANVATTSKRRFQKGPTSGNKSVPK---NKE---------------------

Query:  -----KGKCFYCNGDGHWKRNCPKYLADKKAGKE----------------------NQGATNHVCSSFQETSSWKQLEDGEITLWVGLGELISALAVGDV
             KG CF+CN +GHWKRNCPKYLA+KK  K+                      + GATNHVCSSFQ  SSW+QLE GE+T+ VG G ++SA+AVG +
Subjt:  -----KGKCFYCNGDGHWKRNCPKYLADKKAGKE----------------------NQGATNHVCSSFQETSSWKQLEDGEITLWVGLGELISALAVGDV

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------KIERLVKSGLLNKLEDDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS--------------
                  +IERLVK+GLL++LE++SLP CESCLEGKMTKR FT KG RAKEPLELVH DLCGPMNVKARG +EYFI F DDYS              
Subjt:  ----------KIERLVKSGLLNKLEDDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS--------------

Query:  ---------------------------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVP
                                                            TPQQNGVSERRNRTLLDMVRSMMSYA LPN FW +AV+T VYILN VP
Subjt:  ---------------------------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVP

Query:  SKSVLGTPYELWTGHKASLRHFRIWGCPAYVLVKNPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVA
        SKSV  TP +LW G K SLRHFRIWGCPA+VL  NPKKLE  SKLCLF+GYPK TRGG FYDP+D KV VSTNATFLEEDHIR+HKPRSK+VL  +SK  
Subjt:  SKSVLGTPYELWTGHKASLRHFRIWGCPAYVLVKNPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVA

Query:  TNRSTKVADQADQSTRVVDRPSTSSPSRPSQELREPR-----------------------------------AMNDVDRDQWVKAMNLEMESMYFNLVWE
        T  ST+V ++    TRVV   S++   +P Q LREPR                                   AM DVD+D+W+KAMNLE+ESMYFN VW+
Subjt:  TNRSTKVADQADQSTRVVDRPSTSSPSRPSQELREPR-----------------------------------AMNDVDRDQWVKAMNLEMESMYFNLVWE

Query:  LVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQ-----------------------------------------------------------
        LV+QPD ++PI CKWIYK KR   G VQT+KA+LVAKGYTQ                                                           
Subjt:  LVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQ-----------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------EQCPKTPQEVEDMRRVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEF
                                                        EQCPKTPQ+VE+MR +PYASAVGSLMYAMLCTRP+IC+AVGIVSRYQSNP  
Subjt:  ------------------------------------------------EQCPKTPQEVEDMRRVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEF

Query:  ALWTVVKNILKYLRRTRNYVLVYENKDLILTGYTDFDF
        A WT VK ILKYLRRTR+Y+LVY +KDLILTGYTD DF
Subjt:  ALWTVVKNILKYLRRTRNYVLVYENKDLILTGYTDFDF

A0A5D3CPJ6 Gag/pol protein1.6e-19239.02Show/hide
Query:  RDAHDRWTKANEKVKVYILASISDILSKKHEK-------MNSLQVLFGQLSSSAMHDAIKYVYNCRMKEGTNVKEHVLDMMVHFNIAEVNRVVMNEKSQI
        R+ ++RW KANEK + YILAS+S++L+KKHE        M+SLQ +FGQ S    HDA+KY+YN RM EG +V+EHVL+MMVHFN+AE+N  V++E SQ+
Subjt:  RDAHDRWTKANEKVKVYILASISDILSKKHEK-------MNSLQVLFGQLSSSAMHDAIKYVYNCRMKEGTNVKEHVLDMMVHFNIAEVNRVVMNEKSQI

Query:  ------------------------------------YQSLLKNKGFEAKANVATTSKRRFQKGPTSGNKSVPK---NKE---------------------
                                            ++SL+K KG + +ANVA TS R+F +G TSG KS+P    NK+                     
Subjt:  ------------------------------------YQSLLKNKGFEAKANVATTSKRRFQKGPTSGNKSVPK---NKE---------------------

Query:  -----KGKCFYCNGDGHWKRNCPKYLADKKAGKE----------------------NQGATNHVCSSFQETSSWKQLEDGEITLWVGLGELISALAVGDV
             KG CF+CN +GHWKRNCPKYLA+KK  K+                      + GATNHVCSSFQ  SSW+QLE GE+T+ VG G ++SA+AVG +
Subjt:  -----KGKCFYCNGDGHWKRNCPKYLADKKAGKE----------------------NQGATNHVCSSFQETSSWKQLEDGEITLWVGLGELISALAVGDV

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------KIERLVKSGLLNKLEDDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS--------------
                  +IERLVK+GLL++LE++SLP CESCLEGKMTKR FT KG RAKEPLELVH DLCGPMNVKARG +EYFI F DDYS              
Subjt:  ----------KIERLVKSGLLNKLEDDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS--------------

Query:  ---------------------------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVP
                                                            TPQQNGVSERRNRTLLDMVRSMMSYA LPN FW +AV+T VYILN VP
Subjt:  ---------------------------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVP

Query:  SKSVLGTPYELWTGHKASLRHFRIWGCPAYVLVKNPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVA
        SKSV  TP +LW G K SLRHFRIWGCPA+VL  NPKKLE  SKLCLF+GYPK TRGG FYDP+D KV VSTNATFLEEDHIR+HKPRSK+VL  +SK  
Subjt:  SKSVLGTPYELWTGHKASLRHFRIWGCPAYVLVKNPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVA

Query:  TNRSTKVADQADQSTRVVDRPSTSSPSRPSQELREPR-----------------------------------AMNDVDRDQWVKAMNLEMESMYFNLVWE
        T  ST+V ++    TRVV   S++   +P Q LREPR                                   AM DVD+D+W+KAMNLE+ESMYFN VW+
Subjt:  TNRSTKVADQADQSTRVVDRPSTSSPSRPSQELREPR-----------------------------------AMNDVDRDQWVKAMNLEMESMYFNLVWE

Query:  LVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQ-----------------------------------------------------------
        LV+QPD ++PI CKWIYK KR   G VQT+KA+LVAKGYTQ                                                           
Subjt:  LVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQ-----------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------EQCPKTPQEVEDMRRVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEF
                                                        EQCPKTPQ+VE+MR +PYASAVGSLMYAMLCTRP+IC+AVGIVSRYQSNP  
Subjt:  ------------------------------------------------EQCPKTPQEVEDMRRVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEF

Query:  ALWTVVKNILKYLRRTRNYVLVYENKDLILTGYTDFDF
        A WT VK ILKYLRRTR+Y LVY +KDLILTGYTD DF
Subjt:  ALWTVVKNILKYLRRTRNYVLVYENKDLILTGYTDFDF

A0A5D3CSZ6 Gag/pol protein1.6e-19239.02Show/hide
Query:  RDAHDRWTKANEKVKVYILASISDILSKKHEK-------MNSLQVLFGQLSSSAMHDAIKYVYNCRMKEGTNVKEHVLDMMVHFNIAEVNRVVMNEKSQI
        R+ ++RW KANEK + YILAS+S++L+KKHE        M+SLQ +FGQ S    HDA+KY+YN RM EG +V+EHVL+MMVHFN+AE+N  V++E SQ+
Subjt:  RDAHDRWTKANEKVKVYILASISDILSKKHEK-------MNSLQVLFGQLSSSAMHDAIKYVYNCRMKEGTNVKEHVLDMMVHFNIAEVNRVVMNEKSQI

Query:  ------------------------------------YQSLLKNKGFEAKANVATTSKRRFQKGPTSGNKSVPK---NKE---------------------
                                            ++SL+K KG + +ANVA TS R+F +G TSG KS+P    NK+                     
Subjt:  ------------------------------------YQSLLKNKGFEAKANVATTSKRRFQKGPTSGNKSVPK---NKE---------------------

Query:  -----KGKCFYCNGDGHWKRNCPKYLADKKAGKE----------------------NQGATNHVCSSFQETSSWKQLEDGEITLWVGLGELISALAVGDV
             KG CF+CN +GHWKRNCPKYLA+KK  K+                      + GATNHVCSSFQ  SSW+QLE GE+T+ VG G ++SA+AVG +
Subjt:  -----KGKCFYCNGDGHWKRNCPKYLADKKAGKE----------------------NQGATNHVCSSFQETSSWKQLEDGEITLWVGLGELISALAVGDV

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------KIERLVKSGLLNKLEDDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS--------------
                  +IERLVK+GLL++LE++SLP CESCLEGKMTKR FT KG RAKEPLELVH DLCGPMNVKARG +EYFI F DDYS              
Subjt:  ----------KIERLVKSGLLNKLEDDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS--------------

Query:  ---------------------------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVP
                                                            TPQQNGVSERRNRTLLDMVRSMMSYA LPN FW +AV+T VYILN VP
Subjt:  ---------------------------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVP

Query:  SKSVLGTPYELWTGHKASLRHFRIWGCPAYVLVKNPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVA
        SKSV  TP +LW G K SLRHFRIWGCPA+VL  NPKKLE  SKLCLF+GYPK TRGG FYDP+D KV VSTNATFLEEDHIR+HKPRSK+VL  +SK  
Subjt:  SKSVLGTPYELWTGHKASLRHFRIWGCPAYVLVKNPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVA

Query:  TNRSTKVADQADQSTRVVDRPSTSSPSRPSQELREPR-----------------------------------AMNDVDRDQWVKAMNLEMESMYFNLVWE
        T  ST+V ++    TRVV   S++   +P Q LREPR                                   AM DVD+D+W+KAMNLE+ESMYFN VW+
Subjt:  TNRSTKVADQADQSTRVVDRPSTSSPSRPSQELREPR-----------------------------------AMNDVDRDQWVKAMNLEMESMYFNLVWE

Query:  LVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQ-----------------------------------------------------------
        LV+QPD ++PI CKWIYK KR   G VQT+KA+LVAKGYTQ                                                           
Subjt:  LVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQ-----------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------EQCPKTPQEVEDMRRVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEF
                                                        EQCPKTPQ+VE+MR +PYASAVGSLMYAMLCTRP+IC+AVGIVSRYQSNP  
Subjt:  ------------------------------------------------EQCPKTPQEVEDMRRVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEF

Query:  ALWTVVKNILKYLRRTRNYVLVYENKDLILTGYTDFDF
        A WT VK ILKYLRRTR+Y LVY +KDLILTGYTD DF
Subjt:  ALWTVVKNILKYLRRTRNYVLVYENKDLILTGYTDFDF

E2GK51 Gag/pol protein (Fragment)8.4e-20240.66Show/hide
Query:  RDAHDRWTKANEKVKVYILASISDILSKKHEK-------MNSLQVLFGQLSSSAMHDAIKYVYNCRMKEGTNVKEHVLDMMVHFNIAEVNRVVMNEKSQI
        R+A+DRW KAN+K +VYILAS++D+L+KKH+        M+SL+ +FGQ S S  H+AIK++Y  RMKEGT+V+EHVLDMM+HFNIAEVN   ++E +Q+
Subjt:  RDAHDRWTKANEKVKVYILASISDILSKKHEK-------MNSLQVLFGQLSSSAMHDAIKYVYNCRMKEGTNVKEHVLDMMVHFNIAEVNRVVMNEKSQI

Query:  ------------------------------------YQSLLKNKGFEAKANVATTSKRRFQKGPTSGNK--------------------SVPKNKEKGKC
                                            +Q+L  +KG E +ANVA T KR+F +G +S NK                     V KN +KGKC
Subjt:  ------------------------------------YQSLLKNKGFEAKANVATTSKRRFQKGPTSGNK--------------------SVPKNKEKGKC

Query:  FYCNGDGHWKRNCPKYLADKKAGKENQ------------------------GATNHVCSSFQETSSWKQLEDGEITLWVGLGELISALAVGDV-------
        F+CN DGHWKRNCPKYLA+KKA K  Q                        GATNH+C SFQETSSWK+L++GEITL VG GE++SA AVGD+       
Subjt:  FYCNGDGHWKRNCPKYLADKKAGKENQ------------------------GATNHVCSSFQETSSWKQLEDGEITLWVGLGELISALAVGDV-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -KIERLVKSGLLNKLEDDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS-----------------------
         +IERLVKSG+LN+LED+SLPPCESCLEGKMTKRSFT KGLRAK PLELVH DLCGPMNVKARG YEYFI+FIDD+S                       
Subjt:  -KIERLVKSGLLNKLEDDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS-----------------------

Query:  ------------------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVPSKSVLGTPY
                                                  STPQQNGVSERRNRTLLDMVRSMMSYA LP+ FW +A+ET ++ILNNVPSKSVL TPY
Subjt:  ------------------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVPSKSVLGTPY

Query:  ELWTGHKASLRHFRIWGCPAYVLVKNPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVATNRSTKVAD
        ELW G K+SLR+FRIWGCPA+VLV+NPKKLE  SKLCLF+GYPKE+RGGLFY PQ+ KV VSTNATFLEEDH R+H+PRSK+VL+ + K AT       D
Subjt:  ELWTGHKASLRHFRIWGCPAYVLVKNPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVATNRSTKVAD

Query:  QADQSTRVVDRPSTSSPSRPSQELREPR-----------------------------------AMNDVDRDQWVKAMNLEMESMYFNLVWELVNQPDEIR
        +   ST+VVD+ + S  S  SQELR PR                                   AMNDVDRDQW+KAMNLEMESMYFN VW LV+ P +++
Subjt:  QADQSTRVVDRPSTSSPSRPSQELREPR-----------------------------------AMNDVDRDQWVKAMNLEMESMYFNLVWELVNQPDEIR

Query:  PIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQ--------------------------------------------------------------------
        PI CKWIYK KRD  G VQT+KA+LVAKGYTQ                                                                    
Subjt:  PIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQ--------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------EQCPKTPQEVEDMRRVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEFALWTVVKNI
                                               EQCPKTPQEVEDMR +PY+SAVGSLMYAMLCTRP+IC++VGIVSRYQSNP    WT VKNI
Subjt:  ---------------------------------------EQCPKTPQEVEDMRRVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEFALWTVVKNI

Query:  LKYLRRTRNYVLVYENKDLILTGYTDFDF
        LKYLRRTRNY+LVY  KDLILTGYTD DF
Subjt:  LKYLRRTRNYVLVYENKDLILTGYTDFDF

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.0e-2328.18Show/hide
Query:  LLNKLEDDSLPPCESCLEGKMTKRSF--TRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS-------------------------------
        LLN LE  S   CE CL GK  +  F   +     K PL +VH D+CGP+         YF+ F+D ++                               
Subjt:  LLNKLEDDSLPPCESCLEGKMTKRSF--TRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS-------------------------------

Query:  ----------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVPSKSVLG---TPYELWTG
                                           TPQ NGVSER  RT+ +  R+M+S A L   FW  AV T  Y++N +PS++++    TPYE+W  
Subjt:  ----------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVPSKSVLG---TPYELWTG

Query:  HKASLRHFRIWGCPAYVLVKNPK-KLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVATNRS
         K  L+H R++G   YV +KN + K +  S   +F+GY  E  G   +D  +EK IV+ +    E + +     + + V    SK + N++
Subjt:  HKASLRHFRIWGCPAYVLVKNPK-KLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVATNRS

P04146 Copia protein6.1e-0842.5Show/hide
Query:  PYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEFALWTVVKNILKYLRRTRNYVLV------YENKDLILTGYTDFDF
        P  S +G LMY MLCTRP++  AV I+SRY S     LW  +K +L+YL+ T +  L+      +ENK   + GY D D+
Subjt:  PYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEFALWTVVKNILKYLRRTRNYVLV------YENKDLILTGYTDFDF

P04146 Copia protein2.2e-0532.31Show/hide
Query:  DRDQWVKAMNLEMESMYFNLVWELVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQE
        D+  W +A+N E+ +   N  W +  +P+    +  +W++  K + +GN   YKA+LVA+G+TQ+
Subjt:  DRDQWVKAMNLEMESMYFNLVWELVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQE

P0CV72 Secreted RxLR effector protein 1616.6e-1041.77Show/hide
Query:  MRRVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEFALWTVVKNILKYLRRTRNYVLVYENKDLI-LTGYTDFDF
        M+ VPY SAVG++MY M+ TRP++  AVG++S++ S+P    W  +K +L+YL+ T+ Y L +       L GY+D D+
Subjt:  MRRVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEFALWTVVKNILKYLRRTRNYVLVYENKDLI-LTGYTDFDF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.1e-4423.83Show/hide
Query:  IERLVKSGLLNKLEDDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS-------------------------
        ++ L K  L++  +  ++ PC+ CL GK  + SF     R    L+LV+ D+CGPM +++ G  +YF+ FIDD S                         
Subjt:  IERLVKSGLLNKLEDDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYS-------------------------

Query:  ----------------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVPSKSV-LGTPYE
                                                 TPQ NGV+ER NRT+++ VRSM+  A LP  FW  AV+T  Y++N  PS  +    P  
Subjt:  ----------------------------------------STPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVPSKSV-LGTPYE

Query:  LWTGHKASLRHFRIWGCPAYVLVKNPK--KLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAI------------
        +WT  + S  H +++GC A+  V   +  KL+  S  C+FIGY  E  G   +DP  +KVI S +  F  E  +R     S+ V   I            
Subjt:  LWTGHKASLRHFRIWGCPAYVLVKNPK--KLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAI------------

Query:  -SKVATNRSTKVADQADQSTRVV-----------------------------DRPSTSSPSRPSQEL------REPRAMNDV----DRDQWVKAMNLEME
            A + + +V++Q +Q   V+                             +RP   S   PS E       REP ++ +V    +++Q +KAM  EME
Subjt:  -SKVATNRSTKVADQADQSTRVV-----------------------------DRPSTSSPSRPSQEL------REPRAMNDV----DRDQWVKAMNLEME

Query:  SMYFNLVWELVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQEQ------------------------------------------------
        S+  N  ++LV  P   RP+ CKW++K K+D    +  YKA+LV KG+ Q++                                                
Subjt:  SMYFNLVWELVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQEQ------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------CPKTPQEVEDMRRVPYASAVGSLMYAMLCTRPNICFAVGI
                                                                    CP T +E  +M +VPY+SAVGSLMYAM+CTRP+I  AVG+
Subjt:  ------------------------------------------------------------CPKTPQEVEDMRRVPYASAVGSLMYAMLCTRPNICFAVGI

Query:  VSRYQSNPEFALWTVVKNILKYLRRTRNYVLVYENKDLILTGYTDFD
        VSR+  NP    W  VK IL+YLR T    L +   D IL GYTD D
Subjt:  VSRYQSNPEFALWTVVKNILKYLRRTRNYVLVYENKDLILTGYTDFD

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.3e-1325.12Show/hide
Query:  YEYF----INFIDDYSSTPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVPSKSV-LGTPYELWTGHKASLRHFRIWGCPAYVLVK
        +EYF    I+ +     TP+ NG+SER++R +++   +++S+AS+P  +W +A    VY++N +P+  + L +P++   G   +    R++GC  Y  ++
Subjt:  YEYF----INFIDDYSSTPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVPSKSV-LGTPYELWTGHKASLRHFRIWGCPAYVLVK

Query:  --NPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEE-----------DHIRDHKPRSKLVLEAISKVATNRSTKVADQADQSTRVVDRP
          N  KL+  S+ C+F+GY       L    Q  ++ +S +  F E              +++ +  S  V    + + T      A            P
Subjt:  --NPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEE-----------DHIRDHKPRSKLVLEAISKVATNRSTKVADQADQSTRVVDRP

Query:  ST-SSPSRPSQ
        S+ S+P R SQ
Subjt:  ST-SSPSRPSQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.8e-1334.31Show/hide
Query:  TPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVPSKSV-LGTPYELWTGHKASLRHFRIWGCPAYVLVK--NPKKLEHCSKLCLFI
        TP+ NG+SER++R +++M  +++S+AS+P  +W +A    VY++N +P+  + L +P++   G   +    +++GC  Y  ++  N  KLE  SK C F+
Subjt:  TPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVPSKSV-LGTPYELWTGHKASLRHFRIWGCPAYVLVK--NPKKLEHCSKLCLFI

Query:  GY
        GY
Subjt:  GY

Arabidopsis top hitse value%identityAlignment
AT2G24100.1 unknown protein2.7e-1170.83Show/hide
Query:  NNPLDEPSPLAKTLPCQLARRPLFFRETNPQPRKHTLWQATTDFTNSE
        N P DEP     TL   LARRPLFFRETNPQPRKHTLWQAT+DFT+ +
Subjt:  NNPLDEPSPLAKTLPCQLARRPLFFRETNPQPRKHTLWQATTDFTNSE

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.0e-0940.79Show/hide
Query:  REPRAMNDVDRD-QWVKAMNLEMESMYFNLVWELVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQEQ
        +EP   N+      W  AM+ E+ +M     WE+   P   +PI CKW+YK K +  G ++ YKA+LVAKGYTQ++
Subjt:  REPRAMNDVDRD-QWVKAMNLEMESMYFNLVWELVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQEQ

AT4G30780.1 unknown protein5.2e-1066.67Show/hide
Query:  NNPLDEPSPLAKTLPCQLARRPLFFRETNPQPRKHTLWQATTDFTNSE
        N P D P     TL   LAR+PLFFRETNPQPRKHTLWQAT+DFT+ +
Subjt:  NNPLDEPSPLAKTLPCQLARRPLFFRETNPQPRKHTLWQATTDFTNSE

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.9e-0636.23Show/hide
Query:  NRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVPSKSV-LGTPYELWTGHKASLRHFRIWGCPAYV
        NRT+++ VRSM+    LP  F   A  T V+I+N  PS ++    P E+W     +  + R +GC AY+
Subjt:  NRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVPSKSV-LGTPYELWTGHKASLRHFRIWGCPAYV

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)7.7e-0635.53Show/hide
Query:  REPRAMNDVDRDQ-WVKAMNLEMESMYFNLVWELVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQEQ
        +EP+++    +D  W +AM  E++++  N  W LV  P     + CKW++K K    G +   KA+LVAKG+ QE+
Subjt:  REPRAMNDVDRDQ-WVKAMNLEMESMYFNLVWELVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQEQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAACTTTTGCTCATGGTTCTCAAGATGACTAGTGCATCACCATCTAGTATCTCAAAGCCACCTGGTACCTCAGAAGTCAGAATTTTGTCCATGTCCATAAGATT
CACATCCCGTATGTACCGAATGAGGAGATTGTGGGTTTCTTATGGCCAGAGGAAAAGGAAATCGAATTTGACGACAATACTGGCAGACACCAATGATAGTAGAGGTGGAA
AACGACGACAACATCGAGAGTTTAGGACGACGTTGAGGACGAAGTATAGTGGTAATAGGGGCGCAAAACATCATTTTTCTCCATTAGCAGTATTACAAGTGGAAATATGG
GTTGCCGACGGAACTGTGCACGAGTTGTTAATGGATCTGCGCATGGGTTGCCGACGGATCTATAGATGGACATATCTGTGCACAGGTGGATTTGCAGTCAGTTCTGAGTC
TGAATGGATGGATCTGGTCAGATCTGAAAGGTTGGATCTAGTGAGAACATGGGTCGGATTTGTGCACGGGTGGATCTGGTCGGATCTGCGCGCATTGGTGGATCTGTACC
TGGGTTGCCGACGAAGAAAGACAGATCTGAGAGGTTTACCGACGAAAACGCGACAGGAGACTAATTTAGCAATGATTGAGTACAACCAAAACGAGGAAGAACATATGGAG
TTGCTTGATCTGGTCGACGGCAGGAGATTTGGTAGCGGTGGCGAATCTAGTAGGTGCAGAGATCCGGATCTAGCGGCTACAGACTGGAACGAGTGGGCAGCGACGGCTAA
TAAAACCCTAGGAACAAGAGATTGGCAGTCTGTTTCGAGACCTGACGCATTCTCTACAACCTCTTCACACAATAATCCACTTGATGAGCCTAGTCCTCTTGCCAAGACGC
TACCATGTCAGTTGGCTAGAAGGCCTCTTTTCTTCCGGGAGACCAACCCTCAACCAAGGAAGCACACGTTGTGGCAAGCAACAACTGATTTTACTAATAGTGAGAATTAT
CCCTTGATCTGCATAGGTGAGAGTAAGCTCAATAGCGCTGCTCAACATTCCTCCCATTTCAGGGATAAGACCGGGGTTAATTTTGCAAAATTCCATCCATTGTTCATCTT
CTTCCCAACCTTCATTCCTTCTCAGATTCACCACAAATTTTGGATCCCACCATCAAATCTAAGCCTAGAGAATAGTAGAGAAGACTCTTGTGGTGGTCTACAACCAAGTG
GTGCTGAAATTCATCAAGAATTGAAGGAGAATTTGGAGCTTCAATGGTCACACCGTGAGATTCCTATGTTGCACCTGCTTGGCGTTTATAGGCGCCTATCCCTTCGGAGG
GACGCTCATGATAGATGGACTAAGGCTAATGAAAAAGTCAAAGTCTATATTTTGGCAAGTATATCAGATATACTTTCCAAAAAGCATGAGAAAATGAACTCACTGCAAGT
CCTATTTGGACAACTGTCCTCCTCGGCTATGCATGATGCTATCAAATATGTTTACAATTGCCGGATGAAAGAAGGAACGAATGTTAAGGAACATGTTCTGGACATGATGG
TCCATTTCAACATCGCGGAAGTAAACAGGGTTGTCATGAACGAGAAGAGCCAAATTTACCAATCCTTATTGAAAAATAAGGGTTTTGAGGCTAAGGCAAATGTTGCTACT
ACCTCAAAGAGGAGATTCCAAAAGGGACCTACCTCTGGAAATAAATCCGTCCCAAAAAATAAAGAGAAAGGAAAATGTTTCTACTGCAACGGTGATGGGCACTGGAAAAG
GAACTGCCCAAAGTACCTTGCCGATAAGAAGGCTGGAAAAGAGAATCAAGGGGCCACTAATCATGTTTGTTCTTCTTTTCAGGAAACTAGTTCCTGGAAACAACTAGAAG
ATGGTGAGATAACTCTCTGGGTTGGATTAGGAGAGCTCATCTCAGCTTTGGCAGTAGGAGATGTAAAGATTGAGAGATTGGTCAAGAGTGGACTTCTAAACAAATTAGAA
GATGACTCTCTACCACCATGTGAGTCTTGTCTTGAGGGAAAAATGACAAAGAGATCTTTTACTAGAAAAGGTCTTCGAGCCAAAGAACCCTTAGAACTTGTGCATTTGGA
CCTTTGTGGACCAATGAATGTAAAGGCTCGAGGACGGTATGAATATTTCATCAACTTTATTGATGATTATTCAAGTACGCCACAGCAGAACGGAGTATCTGAAAGGAGAA
ACAGAACCTTGTTGGACATGGTTCGTTCGATGATGAGTTACGCATCGTTGCCTAACTTGTTTTGGGAATTCGCAGTGGAGACCGAGGTTTATATTTTGAATAATGTTCCC
TCTAAAAGTGTTTTAGGAACACCCTATGAGTTATGGACAGGGCATAAAGCTAGTTTACGTCACTTCCGGATATGGGGTTGCCCAGCATATGTGTTGGTGAAAAATCCTAA
AAAATTGGAACATTGTTCAAAATTATGCCTATTTATAGGATACCCTAAAGAAACGAGAGGTGGTCTATTTTATGATCCTCAAGATGAAAAGGTAATTGTATCGACAAATG
CCACATTCCTAGAGGAAGATCACATACGTGATCATAAACCACGCAGTAAACTAGTATTGGAAGCGATTTCAAAAGTTGCTACAAATAGATCAACAAAAGTTGCTGATCAA
GCTGACCAATCAACAAGAGTTGTTGATAGACCAAGCACATCTAGTCCGTCACGTCCATCTCAAGAGTTGAGAGAGCCTCGAGCAATGAATGATGTTGATCGAGATCAGTG
GGTCAAAGCCATGAACCTCGAAATGGAGTCTATGTACTTCAATTTAGTCTGGGAACTTGTAAATCAACCTGATGAGATAAGACCTATCTATTGTAAATGGATTTACAAAC
ATAAACGAGACTATATGGGGAACGTACAGACCTATAAAGCCCAGTTAGTAGCAAAGGGTTATACCCAGGAACAATGTCCTAAAACACCTCAAGAGGTTGAGGATATGAGA
CGAGTTCCCTACGCATCAGCTGTCGGGAGTCTGATGTATGCAATGCTATGTACACGACCAAACATATGTTTTGCAGTAGGAATTGTCAGTAGGTATCAGTCCAATCCCGA
GTTTGCACTTTGGACTGTCGTTAAGAATATCCTCAAGTATCTAAGGAGAACGAGGAACTACGTGCTTGTGTATGAAAATAAGGATCTGATCCTTACCGGATACACTGATT
TCGATTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAACTTTTGCTCATGGTTCTCAAGATGACTAGTGCATCACCATCTAGTATCTCAAAGCCACCTGGTACCTCAGAAGTCAGAATTTTGTCCATGTCCATAAGATT
CACATCCCGTATGTACCGAATGAGGAGATTGTGGGTTTCTTATGGCCAGAGGAAAAGGAAATCGAATTTGACGACAATACTGGCAGACACCAATGATAGTAGAGGTGGAA
AACGACGACAACATCGAGAGTTTAGGACGACGTTGAGGACGAAGTATAGTGGTAATAGGGGCGCAAAACATCATTTTTCTCCATTAGCAGTATTACAAGTGGAAATATGG
GTTGCCGACGGAACTGTGCACGAGTTGTTAATGGATCTGCGCATGGGTTGCCGACGGATCTATAGATGGACATATCTGTGCACAGGTGGATTTGCAGTCAGTTCTGAGTC
TGAATGGATGGATCTGGTCAGATCTGAAAGGTTGGATCTAGTGAGAACATGGGTCGGATTTGTGCACGGGTGGATCTGGTCGGATCTGCGCGCATTGGTGGATCTGTACC
TGGGTTGCCGACGAAGAAAGACAGATCTGAGAGGTTTACCGACGAAAACGCGACAGGAGACTAATTTAGCAATGATTGAGTACAACCAAAACGAGGAAGAACATATGGAG
TTGCTTGATCTGGTCGACGGCAGGAGATTTGGTAGCGGTGGCGAATCTAGTAGGTGCAGAGATCCGGATCTAGCGGCTACAGACTGGAACGAGTGGGCAGCGACGGCTAA
TAAAACCCTAGGAACAAGAGATTGGCAGTCTGTTTCGAGACCTGACGCATTCTCTACAACCTCTTCACACAATAATCCACTTGATGAGCCTAGTCCTCTTGCCAAGACGC
TACCATGTCAGTTGGCTAGAAGGCCTCTTTTCTTCCGGGAGACCAACCCTCAACCAAGGAAGCACACGTTGTGGCAAGCAACAACTGATTTTACTAATAGTGAGAATTAT
CCCTTGATCTGCATAGGTGAGAGTAAGCTCAATAGCGCTGCTCAACATTCCTCCCATTTCAGGGATAAGACCGGGGTTAATTTTGCAAAATTCCATCCATTGTTCATCTT
CTTCCCAACCTTCATTCCTTCTCAGATTCACCACAAATTTTGGATCCCACCATCAAATCTAAGCCTAGAGAATAGTAGAGAAGACTCTTGTGGTGGTCTACAACCAAGTG
GTGCTGAAATTCATCAAGAATTGAAGGAGAATTTGGAGCTTCAATGGTCACACCGTGAGATTCCTATGTTGCACCTGCTTGGCGTTTATAGGCGCCTATCCCTTCGGAGG
GACGCTCATGATAGATGGACTAAGGCTAATGAAAAAGTCAAAGTCTATATTTTGGCAAGTATATCAGATATACTTTCCAAAAAGCATGAGAAAATGAACTCACTGCAAGT
CCTATTTGGACAACTGTCCTCCTCGGCTATGCATGATGCTATCAAATATGTTTACAATTGCCGGATGAAAGAAGGAACGAATGTTAAGGAACATGTTCTGGACATGATGG
TCCATTTCAACATCGCGGAAGTAAACAGGGTTGTCATGAACGAGAAGAGCCAAATTTACCAATCCTTATTGAAAAATAAGGGTTTTGAGGCTAAGGCAAATGTTGCTACT
ACCTCAAAGAGGAGATTCCAAAAGGGACCTACCTCTGGAAATAAATCCGTCCCAAAAAATAAAGAGAAAGGAAAATGTTTCTACTGCAACGGTGATGGGCACTGGAAAAG
GAACTGCCCAAAGTACCTTGCCGATAAGAAGGCTGGAAAAGAGAATCAAGGGGCCACTAATCATGTTTGTTCTTCTTTTCAGGAAACTAGTTCCTGGAAACAACTAGAAG
ATGGTGAGATAACTCTCTGGGTTGGATTAGGAGAGCTCATCTCAGCTTTGGCAGTAGGAGATGTAAAGATTGAGAGATTGGTCAAGAGTGGACTTCTAAACAAATTAGAA
GATGACTCTCTACCACCATGTGAGTCTTGTCTTGAGGGAAAAATGACAAAGAGATCTTTTACTAGAAAAGGTCTTCGAGCCAAAGAACCCTTAGAACTTGTGCATTTGGA
CCTTTGTGGACCAATGAATGTAAAGGCTCGAGGACGGTATGAATATTTCATCAACTTTATTGATGATTATTCAAGTACGCCACAGCAGAACGGAGTATCTGAAAGGAGAA
ACAGAACCTTGTTGGACATGGTTCGTTCGATGATGAGTTACGCATCGTTGCCTAACTTGTTTTGGGAATTCGCAGTGGAGACCGAGGTTTATATTTTGAATAATGTTCCC
TCTAAAAGTGTTTTAGGAACACCCTATGAGTTATGGACAGGGCATAAAGCTAGTTTACGTCACTTCCGGATATGGGGTTGCCCAGCATATGTGTTGGTGAAAAATCCTAA
AAAATTGGAACATTGTTCAAAATTATGCCTATTTATAGGATACCCTAAAGAAACGAGAGGTGGTCTATTTTATGATCCTCAAGATGAAAAGGTAATTGTATCGACAAATG
CCACATTCCTAGAGGAAGATCACATACGTGATCATAAACCACGCAGTAAACTAGTATTGGAAGCGATTTCAAAAGTTGCTACAAATAGATCAACAAAAGTTGCTGATCAA
GCTGACCAATCAACAAGAGTTGTTGATAGACCAAGCACATCTAGTCCGTCACGTCCATCTCAAGAGTTGAGAGAGCCTCGAGCAATGAATGATGTTGATCGAGATCAGTG
GGTCAAAGCCATGAACCTCGAAATGGAGTCTATGTACTTCAATTTAGTCTGGGAACTTGTAAATCAACCTGATGAGATAAGACCTATCTATTGTAAATGGATTTACAAAC
ATAAACGAGACTATATGGGGAACGTACAGACCTATAAAGCCCAGTTAGTAGCAAAGGGTTATACCCAGGAACAATGTCCTAAAACACCTCAAGAGGTTGAGGATATGAGA
CGAGTTCCCTACGCATCAGCTGTCGGGAGTCTGATGTATGCAATGCTATGTACACGACCAAACATATGTTTTGCAGTAGGAATTGTCAGTAGGTATCAGTCCAATCCCGA
GTTTGCACTTTGGACTGTCGTTAAGAATATCCTCAAGTATCTAAGGAGAACGAGGAACTACGTGCTTGTGTATGAAAATAAGGATCTGATCCTTACCGGATACACTGATT
TCGATTTTTAAACCGATATAGACTCAAGGAGATCACTTTGGGATCGGTATTCACTCTTATTGGAGGAGCAGTAGTGTGGAGAAATATTAAGCAGAGCTGCATTGCCGACT
CCACAATGGAAGCATAATATGTTGCTACATGCAAGGCAGCCAAAGAAGCAGTATGACTTAGGAAGTTTCTAAATTCACCACAAATTTTACATTGTGATAACAGTGAGGCA
GTAACGAACTCCAAAGAACCAAGAAGCCATAATCGTGGAAAGCATAATGAGCACAAATACCATCTCATCAGAGAAATTGTGCAGAGAGGAGACGTTATTGTAAAACAAAT
AGCATTTGAG
Protein sequenceShow/hide protein sequence
MAKLLLMVLKMTSASPSSISKPPGTSEVRILSMSIRFTSRMYRMRRLWVSYGQRKRKSNLTTILADTNDSRGGKRRQHREFRTTLRTKYSGNRGAKHHFSPLAVLQVEIW
VADGTVHELLMDLRMGCRRIYRWTYLCTGGFAVSSESEWMDLVRSERLDLVRTWVGFVHGWIWSDLRALVDLYLGCRRRKTDLRGLPTKTRQETNLAMIEYNQNEEEHME
LLDLVDGRRFGSGGESSRCRDPDLAATDWNEWAATANKTLGTRDWQSVSRPDAFSTTSSHNNPLDEPSPLAKTLPCQLARRPLFFRETNPQPRKHTLWQATTDFTNSENY
PLICIGESKLNSAAQHSSHFRDKTGVNFAKFHPLFIFFPTFIPSQIHHKFWIPPSNLSLENSREDSCGGLQPSGAEIHQELKENLELQWSHREIPMLHLLGVYRRLSLRR
DAHDRWTKANEKVKVYILASISDILSKKHEKMNSLQVLFGQLSSSAMHDAIKYVYNCRMKEGTNVKEHVLDMMVHFNIAEVNRVVMNEKSQIYQSLLKNKGFEAKANVAT
TSKRRFQKGPTSGNKSVPKNKEKGKCFYCNGDGHWKRNCPKYLADKKAGKENQGATNHVCSSFQETSSWKQLEDGEITLWVGLGELISALAVGDVKIERLVKSGLLNKLE
DDSLPPCESCLEGKMTKRSFTRKGLRAKEPLELVHLDLCGPMNVKARGRYEYFINFIDDYSSTPQQNGVSERRNRTLLDMVRSMMSYASLPNLFWEFAVETEVYILNNVP
SKSVLGTPYELWTGHKASLRHFRIWGCPAYVLVKNPKKLEHCSKLCLFIGYPKETRGGLFYDPQDEKVIVSTNATFLEEDHIRDHKPRSKLVLEAISKVATNRSTKVADQ
ADQSTRVVDRPSTSSPSRPSQELREPRAMNDVDRDQWVKAMNLEMESMYFNLVWELVNQPDEIRPIYCKWIYKHKRDYMGNVQTYKAQLVAKGYTQEQCPKTPQEVEDMR
RVPYASAVGSLMYAMLCTRPNICFAVGIVSRYQSNPEFALWTVVKNILKYLRRTRNYVLVYENKDLILTGYTDFDF