; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G01185 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G01185
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGag/pol protein
Genome locationClcChr04:3332872..3340937
RNA-Seq ExpressionClc04G01185
SyntenyClc04G01185
Gene Ontology termsGO:0006334 - nucleosome assembly (biological process)
GO:0000786 - nucleosome (cellular component)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR005819 - Linker histone H1/H5
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026233.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-14650.72Show/hide
Query:  DLKFAITEECSQVPTQNVSQNLRDAYEKWIRANEKASAYILANFSKVFAKKHETMVTAH---------YGFIAKDV-WTVRSWIHCKRCLDNRPIRSKHV
        DL+F + E+C QVP  N ++ +R+ YE+W +ANEKA AYILA+ S+V AKKHE+M+TA          +G  +  +      +I+  R  +   +R +HV
Subjt:  DLKFAITEECSQVPTQNVSQNLRDAYEKWIRANEKASAYILANFSKVFAKKHETMVTAH---------YGFIAKDV-WTVRSWIHCKRCLDNRPIRSKHV

Query:  LNMMVHFNVAEMNGAIIVEANYVSFIMESLPENFLQFRINA--------------------SMHKGKRPDGEANVARFSKKFHKGLTSGTKS-ASSSTRK
        LNMMVHFNVA MN A+I EA+ VSFI+ESLPE+FLQFR NA                    S+ K K   GEANVA  ++KFH+G TSGTKS  SSS  K
Subjt:  LNMMVHFNVAEMNGAIIVEANYVSFIMESLPENFLQFRINA--------------------SMHKGKRPDGEANVARFSKKFHKGLTSGTKS-ASSSTRK

Query:  KWRKKKGNKGGNKTNQAAGQ--KGKQKALGI-------SSWHQLGTSDMTLQVGTGQIVSTITVGGKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKAR
        KW+KKKG + GNK N AA +  K  + A GI         W +     +  +    Q        GKMTKR FTGKG+RAKEPL+LV+SDLCGPMNVKAR
Subjt:  KWRKKKGNKGGNKTNQAAGQ--KGKQKALGI-------SSWHQLGTSDMTLQVGTGQIVSTITVGGKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKAR

Query:  GG-------------YGYLYLMQHKSEALEKFEKFKAEVQNLL---------------------------------------------ERRNRTLLDMVW
        GG             YGY+YLMQHKSEALEKF+++KAEV+N L                                             ERRNRTLLDMV 
Subjt:  GG-------------YGYLYLMQHKSEALEKFEKFKAEVQNLL---------------------------------------------ERRNRTLLDMVW

Query:  FMMSYAQLLDSFWGYAITTALQILNNVPSKSVSKTPLELWRGRKYSLRHFRIWGSLAYVLETNPKKLEPRSRVCLFVGYPKETRGGFFFDPKLNKVFVST
         MMSYA L +SFWGYA+ TA+ ILN VPSKSVS+TPL+LW G K SLRHFRIWG  A+VLE NPKKLEPRS++CLFVGYPK TRGG+F+DPK NKVFVST
Subjt:  FMMSYAQLLDSFWGYAITTALQILNNVPSKSVSKTPLELWRGRKYSLRHFRIWGSLAYVLETNPKKLEPRSRVCLFVGYPKETRGGFFFDPKLNKVFVST

Query:  NAKFLEEDHIREH------------------ISHIVEEASTSTRVVNGASSSRTHPPQEWREPHRTGRVVNPPTLYLGLIEAQDIIPDDEVDDPLTYNQA
        NA FLEEDHIREH                   + +VEE S   RVV+  SS+RTH PQ  REP R+GRV N P  Y+ L E   +I D +++DPLT+ +A
Subjt:  NAKFLEEDHIREH------------------ISHIVEEASTSTRVVNGASSSRTHPPQEWREPHRTGRVVNPPTLYLGLIEAQDIIPDDEVDDPLTYNQA

Query:  MTDVDKDEWVKAKDLEMESMFFNSL
        M DVDKDEW+KA +LE+ESM+FNS+
Subjt:  MTDVDKDEWVKAKDLEMESMFFNSL

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]8.3e-14041.46Show/hide
Query:  DLKFAITEECSQVPTQNVSQNLRDAYEKWIRANEKASAYILANFSKVFAKKHETMVTAH---------YGFIAKDV-WTVRSWIHCKRCLDNRPIRSKHV
        DL+F + EEC QVP  N ++ +R+ YE+W +ANEKA AYILA+ S+V AKKHE+M+TA          +G  +  +      +I+  R  +   +R +HV
Subjt:  DLKFAITEECSQVPTQNVSQNLRDAYEKWIRANEKASAYILANFSKVFAKKHETMVTAH---------YGFIAKDV-WTVRSWIHCKRCLDNRPIRSKHV

Query:  LNMMVHFNVAEMNGAIIVEANYVSFIMESLPENFLQFRINA--------------------SMHKGKRPDGEANVARFSKKFHKGLTSGTKS-ASSSTRK
        LNMMVHFNVAEMNGA+I EA+ VSFI+ESLPE+FLQFR NA                    S+ K K   GEANVA  ++KFH+G TSGTKS  SSS  K
Subjt:  LNMMVHFNVAEMNGAIIVEANYVSFIMESLPENFLQFRINA--------------------SMHKGKRPDGEANVARFSKKFHKGLTSGTKS-ASSSTRK

Query:  KWRKKKGNKGGNKTNQAA-----------------GQKG------------KQKA------------------------------------LGISSWHQL
        KW+KKKG + GNK N AA                  Q+G            K+KA                                     GISSW QL
Subjt:  KWRKKKGNKGGNKTNQAA-----------------GQKG------------KQKA------------------------------------LGISSWHQL

Query:  GTSDMTLQVGTGQIVSTITVG-------------------------------------------------------------------------------
         T +MT++VGTG +VS I VG                                                                               
Subjt:  GTSDMTLQVGTGQIVSTITVG-------------------------------------------------------------------------------

Query:  ------------------------------------------------------------GKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKARGG---
                                                                    GKMTKR FTGKG+RAKEPL+LV+SDLCGPMNVKARGG   
Subjt:  ------------------------------------------------------------GKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKARGG---

Query:  ----------YGYLYLMQHKSEALEKFEKFKAEVQNLL---------------------------------------------ERRNRTLLDMVWFMMSY
                  YGY+YLMQHKSEALEKF+++KAEV+N L                                             ERRNRTLLDMV  MMSY
Subjt:  ----------YGYLYLMQHKSEALEKFEKFKAEVQNLL---------------------------------------------ERRNRTLLDMVWFMMSY

Query:  AQLLDSFWGYAITTALQILNNVPSKSVSKTPLELWRGRKYSLRHFRIWGSLAYVLETNPKKLEPRSRVCLFVGYPKETRGGFFFDPKLNKVFVSTNAKFL
        A L +SFWGYA+ TA+ ILN VPSKSVS+TPL+LW GRK SLRHFRIWG  A+VLE NPKKLEPRS++CLFVGYPK TRGG+F+DPK NKVFVSTNA FL
Subjt:  AQLLDSFWGYAITTALQILNNVPSKSVSKTPLELWRGRKYSLRHFRIWGSLAYVLETNPKKLEPRSRVCLFVGYPKETRGGFFFDPKLNKVFVSTNAKFL

Query:  EEDHIREH------------------ISHIVEEASTSTRVVNGASSSRTHPPQEWREPHRTGRVVNPPTLYLGLIEAQDIIPDDEVDDPLTYNQAMTDVD
        EEDHIREH                   + +VEE S  TRVV+  SS+RTH PQ  REP R+GRV N P  Y+ L E   +I D +++DPLT+ +AM DVD
Subjt:  EEDHIREH------------------ISHIVEEASTSTRVVNGASSSRTHPPQEWREPHRTGRVVNPPTLYLGLIEAQDIIPDDEVDDPLTYNQAMTDVD

Query:  KDEWVKAKDLEMESMFFNSL
        KDEW+KA +LE+ESM+FNS+
Subjt:  KDEWVKAKDLEMESMFFNSL

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]4.6e-14651.31Show/hide
Query:  DLKFAITEECSQVPTQNVSQNLRDAYEKWIRANEKASAYILANFSKVFAKKHETMVTAH---------YGFIAKDV-WTVRSWIHCKRCLDNRPIRSKHV
        DL+F + EEC QVP  N ++ +R+ YE+W +ANEKA AYILA+ S+V AKKHE+M+TA          +G  +  +      +I+  R  +   +R +HV
Subjt:  DLKFAITEECSQVPTQNVSQNLRDAYEKWIRANEKASAYILANFSKVFAKKHETMVTAH---------YGFIAKDV-WTVRSWIHCKRCLDNRPIRSKHV

Query:  LNMMVHFNVAEMNGAIIVEANYVSFIMESLPENFLQFRINA--------------------SMHKGKRPDGEANVARFSKKFHKGLTSGTKS-ASSSTRK
        LNMMVHFNVAEMNGA+I EA+ VSFI+ESLPE+FLQFR NA                    S+ K K   GEANVA  ++KFH+G TSGTKS  SSS  K
Subjt:  LNMMVHFNVAEMNGAIIVEANYVSFIMESLPENFLQFRINA--------------------SMHKGKRPDGEANVARFSKKFHKGLTSGTKS-ASSSTRK

Query:  KWRKKKGNKGGNKTNQAAGQKGKQ-KALGISSWHQLGTSDMTLQVGTGQIVSTITVGGKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKARGG------
        KW+KKKG + GNK N AA +  K+ KA     +H                       GKMTKR FTGKG++AKEPL+LV+SDLCGPMNVKARGG      
Subjt:  KWRKKKGNKGGNKTNQAAGQKGKQ-KALGISSWHQLGTSDMTLQVGTGQIVSTITVGGKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKARGG------

Query:  -------YGYLYLMQHKSEALEKFEKFKAEVQNLL---------------------------------------------ERRNRTLLDMVWFMMSYAQL
               YGY+YLMQHKSEALEKF+++KAEV+N L                                             +RRNRTLLDMV  MMSY  L
Subjt:  -------YGYLYLMQHKSEALEKFEKFKAEVQNLL---------------------------------------------ERRNRTLLDMVWFMMSYAQL

Query:  LDSFWGYAITTALQILNNVPSKSVSKTPLELWRGRKYSLRHFRIWGSLAYVLETNPKKLEPRSRVCLFVGYPKETRGGFFFDPKLNKVFVSTNAKFLEED
         +SFWGYA+ TA+ ILN VPSKSVSKTPL+LW GRK SLRHFRIWG  A+VLE NPKKLEPRS++CLFVGYPK TRGG+F+DPK NKVFVSTNA FLEED
Subjt:  LDSFWGYAITTALQILNNVPSKSVSKTPLELWRGRKYSLRHFRIWGSLAYVLETNPKKLEPRSRVCLFVGYPKETRGGFFFDPKLNKVFVSTNAKFLEED

Query:  HIREH------------------ISHIVEEASTSTRVVNGASSSRTHPPQEWREPHRTGRVVNPPTLYLGLIEAQDIIPDDEVDDPLTYNQAMTDVDKDE
        HIREH                   + +VEE S  TRVV+  SS+RTH PQ  REP R+GRV N P  Y+ L E   +I D +++DPLT+ +AM DVDKDE
Subjt:  HIREH------------------ISHIVEEASTSTRVVNGASSSRTHPPQEWREPHRTGRVVNPPTLYLGLIEAQDIIPDDEVDDPLTYNQAMTDVDKDE

Query:  WVKAKDLEMESM
        W+KA +LE+ES+
Subjt:  WVKAKDLEMESM

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]8.3e-14041.46Show/hide
Query:  DLKFAITEECSQVPTQNVSQNLRDAYEKWIRANEKASAYILANFSKVFAKKHETMVTAH---------YGFIAKDV-WTVRSWIHCKRCLDNRPIRSKHV
        DL+F + EEC QVP  N ++ +R+ YE+W +ANEKA AYILA+ S+V AKKHE+M+TA          +G  +  +      +I+  R  +   +R +HV
Subjt:  DLKFAITEECSQVPTQNVSQNLRDAYEKWIRANEKASAYILANFSKVFAKKHETMVTAH---------YGFIAKDV-WTVRSWIHCKRCLDNRPIRSKHV

Query:  LNMMVHFNVAEMNGAIIVEANYVSFIMESLPENFLQFRINA--------------------SMHKGKRPDGEANVARFSKKFHKGLTSGTKS-ASSSTRK
        LNMMVHFNVAEMNGA+I EA+ VSFI+ESLPE+FLQFR NA                    S+ K K   GEANVA  ++KFH+G TSGTKS  SSS  K
Subjt:  LNMMVHFNVAEMNGAIIVEANYVSFIMESLPENFLQFRINA--------------------SMHKGKRPDGEANVARFSKKFHKGLTSGTKS-ASSSTRK

Query:  KWRKKKGNKGGNKTNQAA-----------------GQKG------------KQKA------------------------------------LGISSWHQL
        KW+KKKG + GNK N AA                  Q+G            K+KA                                     GISSW QL
Subjt:  KWRKKKGNKGGNKTNQAA-----------------GQKG------------KQKA------------------------------------LGISSWHQL

Query:  GTSDMTLQVGTGQIVSTITVG-------------------------------------------------------------------------------
         T +MT++VGTG +VS I VG                                                                               
Subjt:  GTSDMTLQVGTGQIVSTITVG-------------------------------------------------------------------------------

Query:  ------------------------------------------------------------GKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKARGG---
                                                                    GKMTKR FTGKG+RAKEPL+LV+SDLCGPMNVKARGG   
Subjt:  ------------------------------------------------------------GKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKARGG---

Query:  ----------YGYLYLMQHKSEALEKFEKFKAEVQNLL---------------------------------------------ERRNRTLLDMVWFMMSY
                  YGY+YLMQHKSEALEKF+++KAEV+N L                                             ERRNRTLLDMV  MMSY
Subjt:  ----------YGYLYLMQHKSEALEKFEKFKAEVQNLL---------------------------------------------ERRNRTLLDMVWFMMSY

Query:  AQLLDSFWGYAITTALQILNNVPSKSVSKTPLELWRGRKYSLRHFRIWGSLAYVLETNPKKLEPRSRVCLFVGYPKETRGGFFFDPKLNKVFVSTNAKFL
        A L +SFWGYA+ TA+ ILN VPSKSVS+TPL+LW GRK SLRHFRIWG  A+VLE NPKKLEPRS++CLFVGYPK TRGG+F+DPK NKVFVSTNA FL
Subjt:  AQLLDSFWGYAITTALQILNNVPSKSVSKTPLELWRGRKYSLRHFRIWGSLAYVLETNPKKLEPRSRVCLFVGYPKETRGGFFFDPKLNKVFVSTNAKFL

Query:  EEDHIREH------------------ISHIVEEASTSTRVVNGASSSRTHPPQEWREPHRTGRVVNPPTLYLGLIEAQDIIPDDEVDDPLTYNQAMTDVD
        EEDHIREH                   + +VEE S  TRVV+  SS+RTH PQ  REP R+GRV N P  Y+ L E   +I D +++DPLT+ +AM DVD
Subjt:  EEDHIREH------------------ISHIVEEASTSTRVVNGASSSRTHPPQEWREPHRTGRVVNPPTLYLGLIEAQDIIPDDEVDDPLTYNQAMTDVD

Query:  KDEWVKAKDLEMESMFFNSL
        KDEW+KA +LE+ESM+FNS+
Subjt:  KDEWVKAKDLEMESMFFNSL

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]8.3e-14041.46Show/hide
Query:  DLKFAITEECSQVPTQNVSQNLRDAYEKWIRANEKASAYILANFSKVFAKKHETMVTAH---------YGFIAKDV-WTVRSWIHCKRCLDNRPIRSKHV
        DL+F + EEC QVP  N ++ +R+ YE+W +ANEKA AYILA+ S+V AKKHE+M+TA          +G  +  +      +I+  R  +   +R +HV
Subjt:  DLKFAITEECSQVPTQNVSQNLRDAYEKWIRANEKASAYILANFSKVFAKKHETMVTAH---------YGFIAKDV-WTVRSWIHCKRCLDNRPIRSKHV

Query:  LNMMVHFNVAEMNGAIIVEANYVSFIMESLPENFLQFRINA--------------------SMHKGKRPDGEANVARFSKKFHKGLTSGTKS-ASSSTRK
        LNMMVHFNVAEMNGA+I EA+ VSFI+ESLPE+FLQFR NA                    S+ K K   GEANVA  ++KFH+G TSGTKS  SSS  K
Subjt:  LNMMVHFNVAEMNGAIIVEANYVSFIMESLPENFLQFRINA--------------------SMHKGKRPDGEANVARFSKKFHKGLTSGTKS-ASSSTRK

Query:  KWRKKKGNKGGNKTNQAA-----------------GQKG------------KQKA------------------------------------LGISSWHQL
        KW+KKKG + GNK N AA                  Q+G            K+KA                                     GISSW QL
Subjt:  KWRKKKGNKGGNKTNQAA-----------------GQKG------------KQKA------------------------------------LGISSWHQL

Query:  GTSDMTLQVGTGQIVSTITVG-------------------------------------------------------------------------------
         T +MT++VGTG +VS I VG                                                                               
Subjt:  GTSDMTLQVGTGQIVSTITVG-------------------------------------------------------------------------------

Query:  ------------------------------------------------------------GKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKARGG---
                                                                    GKMTKR FTGKG+RAKEPL+LV+SDLCGPMNVKARGG   
Subjt:  ------------------------------------------------------------GKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKARGG---

Query:  ----------YGYLYLMQHKSEALEKFEKFKAEVQNLL---------------------------------------------ERRNRTLLDMVWFMMSY
                  YGY+YLMQHKSEALEKF+++KAEV+N L                                             ERRNRTLLDMV  MMSY
Subjt:  ----------YGYLYLMQHKSEALEKFEKFKAEVQNLL---------------------------------------------ERRNRTLLDMVWFMMSY

Query:  AQLLDSFWGYAITTALQILNNVPSKSVSKTPLELWRGRKYSLRHFRIWGSLAYVLETNPKKLEPRSRVCLFVGYPKETRGGFFFDPKLNKVFVSTNAKFL
        A L +SFWGYA+ TA+ ILN VPSKSVS+TPL+LW GRK SLRHFRIWG  A+VLE NPKKLEPRS++CLFVGYPK TRGG+F+DPK NKVFVSTNA FL
Subjt:  AQLLDSFWGYAITTALQILNNVPSKSVSKTPLELWRGRKYSLRHFRIWGSLAYVLETNPKKLEPRSRVCLFVGYPKETRGGFFFDPKLNKVFVSTNAKFL

Query:  EEDHIREH------------------ISHIVEEASTSTRVVNGASSSRTHPPQEWREPHRTGRVVNPPTLYLGLIEAQDIIPDDEVDDPLTYNQAMTDVD
        EEDHIREH                   + +VEE S  TRVV+  SS+RTH PQ  REP R+GRV N P  Y+ L E   +I D +++DPLT+ +AM DVD
Subjt:  EEDHIREH------------------ISHIVEEASTSTRVVNGASSSRTHPPQEWREPHRTGRVVNPPTLYLGLIEAQDIIPDDEVDDPLTYNQAMTDVD

Query:  KDEWVKAKDLEMESMFFNSL
        KDEW+KA +LE+ESM+FNS+
Subjt:  KDEWVKAKDLEMESMFFNSL

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein4.0e-14041.46Show/hide
Query:  DLKFAITEECSQVPTQNVSQNLRDAYEKWIRANEKASAYILANFSKVFAKKHETMVTAH---------YGFIAKDV-WTVRSWIHCKRCLDNRPIRSKHV
        DL+F + EEC QVP  N ++ +R+ YE+W +ANEKA AYILA+ S+V AKKHE+M+TA          +G  +  +      +I+  R  +   +R +HV
Subjt:  DLKFAITEECSQVPTQNVSQNLRDAYEKWIRANEKASAYILANFSKVFAKKHETMVTAH---------YGFIAKDV-WTVRSWIHCKRCLDNRPIRSKHV

Query:  LNMMVHFNVAEMNGAIIVEANYVSFIMESLPENFLQFRINA--------------------SMHKGKRPDGEANVARFSKKFHKGLTSGTKS-ASSSTRK
        LNMMVHFNVAEMNGA+I EA+ VSFI+ESLPE+FLQFR NA                    S+ K K   GEANVA  ++KFH+G TSGTKS  SSS  K
Subjt:  LNMMVHFNVAEMNGAIIVEANYVSFIMESLPENFLQFRINA--------------------SMHKGKRPDGEANVARFSKKFHKGLTSGTKS-ASSSTRK

Query:  KWRKKKGNKGGNKTNQAA-----------------GQKG------------KQKA------------------------------------LGISSWHQL
        KW+KKKG + GNK N AA                  Q+G            K+KA                                     GISSW QL
Subjt:  KWRKKKGNKGGNKTNQAA-----------------GQKG------------KQKA------------------------------------LGISSWHQL

Query:  GTSDMTLQVGTGQIVSTITVG-------------------------------------------------------------------------------
         T +MT++VGTG +VS I VG                                                                               
Subjt:  GTSDMTLQVGTGQIVSTITVG-------------------------------------------------------------------------------

Query:  ------------------------------------------------------------GKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKARGG---
                                                                    GKMTKR FTGKG+RAKEPL+LV+SDLCGPMNVKARGG   
Subjt:  ------------------------------------------------------------GKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKARGG---

Query:  ----------YGYLYLMQHKSEALEKFEKFKAEVQNLL---------------------------------------------ERRNRTLLDMVWFMMSY
                  YGY+YLMQHKSEALEKF+++KAEV+N L                                             ERRNRTLLDMV  MMSY
Subjt:  ----------YGYLYLMQHKSEALEKFEKFKAEVQNLL---------------------------------------------ERRNRTLLDMVWFMMSY

Query:  AQLLDSFWGYAITTALQILNNVPSKSVSKTPLELWRGRKYSLRHFRIWGSLAYVLETNPKKLEPRSRVCLFVGYPKETRGGFFFDPKLNKVFVSTNAKFL
        A L +SFWGYA+ TA+ ILN VPSKSVS+TPL+LW GRK SLRHFRIWG  A+VLE NPKKLEPRS++CLFVGYPK TRGG+F+DPK NKVFVSTNA FL
Subjt:  AQLLDSFWGYAITTALQILNNVPSKSVSKTPLELWRGRKYSLRHFRIWGSLAYVLETNPKKLEPRSRVCLFVGYPKETRGGFFFDPKLNKVFVSTNAKFL

Query:  EEDHIREH------------------ISHIVEEASTSTRVVNGASSSRTHPPQEWREPHRTGRVVNPPTLYLGLIEAQDIIPDDEVDDPLTYNQAMTDVD
        EEDHIREH                   + +VEE S  TRVV+  SS+RTH PQ  REP R+GRV N P  Y+ L E   +I D +++DPLT+ +AM DVD
Subjt:  EEDHIREH------------------ISHIVEEASTSTRVVNGASSSRTHPPQEWREPHRTGRVVNPPTLYLGLIEAQDIIPDDEVDDPLTYNQAMTDVD

Query:  KDEWVKAKDLEMESMFFNSL
        KDEW+KA +LE+ESM+FNS+
Subjt:  KDEWVKAKDLEMESMFFNSL

A0A5A7SNP8 Gag/pol protein9.9e-14750.72Show/hide
Query:  DLKFAITEECSQVPTQNVSQNLRDAYEKWIRANEKASAYILANFSKVFAKKHETMVTAH---------YGFIAKDV-WTVRSWIHCKRCLDNRPIRSKHV
        DL+F + E+C QVP  N ++ +R+ YE+W +ANEKA AYILA+ S+V AKKHE+M+TA          +G  +  +      +I+  R  +   +R +HV
Subjt:  DLKFAITEECSQVPTQNVSQNLRDAYEKWIRANEKASAYILANFSKVFAKKHETMVTAH---------YGFIAKDV-WTVRSWIHCKRCLDNRPIRSKHV

Query:  LNMMVHFNVAEMNGAIIVEANYVSFIMESLPENFLQFRINA--------------------SMHKGKRPDGEANVARFSKKFHKGLTSGTKS-ASSSTRK
        LNMMVHFNVA MN A+I EA+ VSFI+ESLPE+FLQFR NA                    S+ K K   GEANVA  ++KFH+G TSGTKS  SSS  K
Subjt:  LNMMVHFNVAEMNGAIIVEANYVSFIMESLPENFLQFRINA--------------------SMHKGKRPDGEANVARFSKKFHKGLTSGTKS-ASSSTRK

Query:  KWRKKKGNKGGNKTNQAAGQ--KGKQKALGI-------SSWHQLGTSDMTLQVGTGQIVSTITVGGKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKAR
        KW+KKKG + GNK N AA +  K  + A GI         W +     +  +    Q        GKMTKR FTGKG+RAKEPL+LV+SDLCGPMNVKAR
Subjt:  KWRKKKGNKGGNKTNQAAGQ--KGKQKALGI-------SSWHQLGTSDMTLQVGTGQIVSTITVGGKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKAR

Query:  GG-------------YGYLYLMQHKSEALEKFEKFKAEVQNLL---------------------------------------------ERRNRTLLDMVW
        GG             YGY+YLMQHKSEALEKF+++KAEV+N L                                             ERRNRTLLDMV 
Subjt:  GG-------------YGYLYLMQHKSEALEKFEKFKAEVQNLL---------------------------------------------ERRNRTLLDMVW

Query:  FMMSYAQLLDSFWGYAITTALQILNNVPSKSVSKTPLELWRGRKYSLRHFRIWGSLAYVLETNPKKLEPRSRVCLFVGYPKETRGGFFFDPKLNKVFVST
         MMSYA L +SFWGYA+ TA+ ILN VPSKSVS+TPL+LW G K SLRHFRIWG  A+VLE NPKKLEPRS++CLFVGYPK TRGG+F+DPK NKVFVST
Subjt:  FMMSYAQLLDSFWGYAITTALQILNNVPSKSVSKTPLELWRGRKYSLRHFRIWGSLAYVLETNPKKLEPRSRVCLFVGYPKETRGGFFFDPKLNKVFVST

Query:  NAKFLEEDHIREH------------------ISHIVEEASTSTRVVNGASSSRTHPPQEWREPHRTGRVVNPPTLYLGLIEAQDIIPDDEVDDPLTYNQA
        NA FLEEDHIREH                   + +VEE S   RVV+  SS+RTH PQ  REP R+GRV N P  Y+ L E   +I D +++DPLT+ +A
Subjt:  NAKFLEEDHIREH------------------ISHIVEEASTSTRVVNGASSSRTHPPQEWREPHRTGRVVNPPTLYLGLIEAQDIIPDDEVDDPLTYNQA

Query:  MTDVDKDEWVKAKDLEMESMFFNSL
        M DVDKDEW+KA +LE+ESM+FNS+
Subjt:  MTDVDKDEWVKAKDLEMESMFFNSL

A0A5A7TWB9 Gag/pol protein4.0e-14041.46Show/hide
Query:  DLKFAITEECSQVPTQNVSQNLRDAYEKWIRANEKASAYILANFSKVFAKKHETMVTAH---------YGFIAKDV-WTVRSWIHCKRCLDNRPIRSKHV
        DL+F + EEC QVP  N ++ +R+ YE+W +ANEKA AYILA+ S+V AKKHE+M+TA          +G  +  +      +I+  R  +   +R +HV
Subjt:  DLKFAITEECSQVPTQNVSQNLRDAYEKWIRANEKASAYILANFSKVFAKKHETMVTAH---------YGFIAKDV-WTVRSWIHCKRCLDNRPIRSKHV

Query:  LNMMVHFNVAEMNGAIIVEANYVSFIMESLPENFLQFRINA--------------------SMHKGKRPDGEANVARFSKKFHKGLTSGTKS-ASSSTRK
        LNMMVHFNVAEMNGA+I EA+ VSFI+ESLPE+FLQFR NA                    S+ K K   GEANVA  ++KFH+G TSGTKS  SSS  K
Subjt:  LNMMVHFNVAEMNGAIIVEANYVSFIMESLPENFLQFRINA--------------------SMHKGKRPDGEANVARFSKKFHKGLTSGTKS-ASSSTRK

Query:  KWRKKKGNKGGNKTNQAA-----------------GQKG------------KQKA------------------------------------LGISSWHQL
        KW+KKKG + GNK N AA                  Q+G            K+KA                                     GISSW QL
Subjt:  KWRKKKGNKGGNKTNQAA-----------------GQKG------------KQKA------------------------------------LGISSWHQL

Query:  GTSDMTLQVGTGQIVSTITVG-------------------------------------------------------------------------------
         T +MT++VGTG +VS I VG                                                                               
Subjt:  GTSDMTLQVGTGQIVSTITVG-------------------------------------------------------------------------------

Query:  ------------------------------------------------------------GKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKARGG---
                                                                    GKMTKR FTGKG+RAKEPL+LV+SDLCGPMNVKARGG   
Subjt:  ------------------------------------------------------------GKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKARGG---

Query:  ----------YGYLYLMQHKSEALEKFEKFKAEVQNLL---------------------------------------------ERRNRTLLDMVWFMMSY
                  YGY+YLMQHKSEALEKF+++KAEV+N L                                             ERRNRTLLDMV  MMSY
Subjt:  ----------YGYLYLMQHKSEALEKFEKFKAEVQNLL---------------------------------------------ERRNRTLLDMVWFMMSY

Query:  AQLLDSFWGYAITTALQILNNVPSKSVSKTPLELWRGRKYSLRHFRIWGSLAYVLETNPKKLEPRSRVCLFVGYPKETRGGFFFDPKLNKVFVSTNAKFL
        A L +SFWGYA+ TA+ ILN VPSKSVS+TPL+LW GRK SLRHFRIWG  A+VLE NPKKLEPRS++CLFVGYPK TRGG+F+DPK NKVFVSTNA FL
Subjt:  AQLLDSFWGYAITTALQILNNVPSKSVSKTPLELWRGRKYSLRHFRIWGSLAYVLETNPKKLEPRSRVCLFVGYPKETRGGFFFDPKLNKVFVSTNAKFL

Query:  EEDHIREH------------------ISHIVEEASTSTRVVNGASSSRTHPPQEWREPHRTGRVVNPPTLYLGLIEAQDIIPDDEVDDPLTYNQAMTDVD
        EEDHIREH                   + +VEE S  TRVV+  SS+RTH PQ  REP R+GRV N P  Y+ L E   +I D +++DPLT+ +AM DVD
Subjt:  EEDHIREH------------------ISHIVEEASTSTRVVNGASSSRTHPPQEWREPHRTGRVVNPPTLYLGLIEAQDIIPDDEVDDPLTYNQAMTDVD

Query:  KDEWVKAKDLEMESMFFNSL
        KDEW+KA +LE+ESM+FNS+
Subjt:  KDEWVKAKDLEMESMFFNSL

A0A5A7U869 Gag/pol protein2.2e-14651.31Show/hide
Query:  DLKFAITEECSQVPTQNVSQNLRDAYEKWIRANEKASAYILANFSKVFAKKHETMVTAH---------YGFIAKDV-WTVRSWIHCKRCLDNRPIRSKHV
        DL+F + EEC QVP  N ++ +R+ YE+W +ANEKA AYILA+ S+V AKKHE+M+TA          +G  +  +      +I+  R  +   +R +HV
Subjt:  DLKFAITEECSQVPTQNVSQNLRDAYEKWIRANEKASAYILANFSKVFAKKHETMVTAH---------YGFIAKDV-WTVRSWIHCKRCLDNRPIRSKHV

Query:  LNMMVHFNVAEMNGAIIVEANYVSFIMESLPENFLQFRINA--------------------SMHKGKRPDGEANVARFSKKFHKGLTSGTKS-ASSSTRK
        LNMMVHFNVAEMNGA+I EA+ VSFI+ESLPE+FLQFR NA                    S+ K K   GEANVA  ++KFH+G TSGTKS  SSS  K
Subjt:  LNMMVHFNVAEMNGAIIVEANYVSFIMESLPENFLQFRINA--------------------SMHKGKRPDGEANVARFSKKFHKGLTSGTKS-ASSSTRK

Query:  KWRKKKGNKGGNKTNQAAGQKGKQ-KALGISSWHQLGTSDMTLQVGTGQIVSTITVGGKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKARGG------
        KW+KKKG + GNK N AA +  K+ KA     +H                       GKMTKR FTGKG++AKEPL+LV+SDLCGPMNVKARGG      
Subjt:  KWRKKKGNKGGNKTNQAAGQKGKQ-KALGISSWHQLGTSDMTLQVGTGQIVSTITVGGKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKARGG------

Query:  -------YGYLYLMQHKSEALEKFEKFKAEVQNLL---------------------------------------------ERRNRTLLDMVWFMMSYAQL
               YGY+YLMQHKSEALEKF+++KAEV+N L                                             +RRNRTLLDMV  MMSY  L
Subjt:  -------YGYLYLMQHKSEALEKFEKFKAEVQNLL---------------------------------------------ERRNRTLLDMVWFMMSYAQL

Query:  LDSFWGYAITTALQILNNVPSKSVSKTPLELWRGRKYSLRHFRIWGSLAYVLETNPKKLEPRSRVCLFVGYPKETRGGFFFDPKLNKVFVSTNAKFLEED
         +SFWGYA+ TA+ ILN VPSKSVSKTPL+LW GRK SLRHFRIWG  A+VLE NPKKLEPRS++CLFVGYPK TRGG+F+DPK NKVFVSTNA FLEED
Subjt:  LDSFWGYAITTALQILNNVPSKSVSKTPLELWRGRKYSLRHFRIWGSLAYVLETNPKKLEPRSRVCLFVGYPKETRGGFFFDPKLNKVFVSTNAKFLEED

Query:  HIREH------------------ISHIVEEASTSTRVVNGASSSRTHPPQEWREPHRTGRVVNPPTLYLGLIEAQDIIPDDEVDDPLTYNQAMTDVDKDE
        HIREH                   + +VEE S  TRVV+  SS+RTH PQ  REP R+GRV N P  Y+ L E   +I D +++DPLT+ +AM DVDKDE
Subjt:  HIREH------------------ISHIVEEASTSTRVVNGASSSRTHPPQEWREPHRTGRVVNPPTLYLGLIEAQDIIPDDEVDDPLTYNQAMTDVDKDE

Query:  WVKAKDLEMESM
        W+KA +LE+ES+
Subjt:  WVKAKDLEMESM

A0A5D3CPJ6 Gag/pol protein4.0e-14041.46Show/hide
Query:  DLKFAITEECSQVPTQNVSQNLRDAYEKWIRANEKASAYILANFSKVFAKKHETMVTAH---------YGFIAKDV-WTVRSWIHCKRCLDNRPIRSKHV
        DL+F + EEC QVP  N ++ +R+ YE+W +ANEKA AYILA+ S+V AKKHE+M+TA          +G  +  +      +I+  R  +   +R +HV
Subjt:  DLKFAITEECSQVPTQNVSQNLRDAYEKWIRANEKASAYILANFSKVFAKKHETMVTAH---------YGFIAKDV-WTVRSWIHCKRCLDNRPIRSKHV

Query:  LNMMVHFNVAEMNGAIIVEANYVSFIMESLPENFLQFRINA--------------------SMHKGKRPDGEANVARFSKKFHKGLTSGTKS-ASSSTRK
        LNMMVHFNVAEMNGA+I EA+ VSFI+ESLPE+FLQFR NA                    S+ K K   GEANVA  ++KFH+G TSGTKS  SSS  K
Subjt:  LNMMVHFNVAEMNGAIIVEANYVSFIMESLPENFLQFRINA--------------------SMHKGKRPDGEANVARFSKKFHKGLTSGTKS-ASSSTRK

Query:  KWRKKKGNKGGNKTNQAA-----------------GQKG------------KQKA------------------------------------LGISSWHQL
        KW+KKKG + GNK N AA                  Q+G            K+KA                                     GISSW QL
Subjt:  KWRKKKGNKGGNKTNQAA-----------------GQKG------------KQKA------------------------------------LGISSWHQL

Query:  GTSDMTLQVGTGQIVSTITVG-------------------------------------------------------------------------------
         T +MT++VGTG +VS I VG                                                                               
Subjt:  GTSDMTLQVGTGQIVSTITVG-------------------------------------------------------------------------------

Query:  ------------------------------------------------------------GKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKARGG---
                                                                    GKMTKR FTGKG+RAKEPL+LV+SDLCGPMNVKARGG   
Subjt:  ------------------------------------------------------------GKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKARGG---

Query:  ----------YGYLYLMQHKSEALEKFEKFKAEVQNLL---------------------------------------------ERRNRTLLDMVWFMMSY
                  YGY+YLMQHKSEALEKF+++KAEV+N L                                             ERRNRTLLDMV  MMSY
Subjt:  ----------YGYLYLMQHKSEALEKFEKFKAEVQNLL---------------------------------------------ERRNRTLLDMVWFMMSY

Query:  AQLLDSFWGYAITTALQILNNVPSKSVSKTPLELWRGRKYSLRHFRIWGSLAYVLETNPKKLEPRSRVCLFVGYPKETRGGFFFDPKLNKVFVSTNAKFL
        A L +SFWGYA+ TA+ ILN VPSKSVS+TPL+LW GRK SLRHFRIWG  A+VLE NPKKLEPRS++CLFVGYPK TRGG+F+DPK NKVFVSTNA FL
Subjt:  AQLLDSFWGYAITTALQILNNVPSKSVSKTPLELWRGRKYSLRHFRIWGSLAYVLETNPKKLEPRSRVCLFVGYPKETRGGFFFDPKLNKVFVSTNAKFL

Query:  EEDHIREH------------------ISHIVEEASTSTRVVNGASSSRTHPPQEWREPHRTGRVVNPPTLYLGLIEAQDIIPDDEVDDPLTYNQAMTDVD
        EEDHIREH                   + +VEE S  TRVV+  SS+RTH PQ  REP R+GRV N P  Y+ L E   +I D +++DPLT+ +AM DVD
Subjt:  EEDHIREH------------------ISHIVEEASTSTRVVNGASSSRTHPPQEWREPHRTGRVVNPPTLYLGLIEAQDIIPDDEVDDPLTYNQAMTDVD

Query:  KDEWVKAKDLEMESMFFNSL
        KDEW+KA +LE+ESM+FNS+
Subjt:  KDEWVKAKDLEMESMFFNSL

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.5e-1426.2Show/hide
Query:  KEPLKLVYSDLCGPMN-------------VKARGGYGYLYLMQHKSEALEKFEKFKA-------------------------------------------
        K PL +V+SD+CGP+              V     Y   YL+++KS+    F+ F A                                           
Subjt:  KEPLKLVYSDLCGPMN-------------VKARGGYGYLYLMQHKSEALEKFEKFKA-------------------------------------------

Query:  --EVQNLLERRNRTLLDMVWFMMSYAQLLDSFWGYAITTALQILNNVPSKSV---SKTPLELWRGRKYSLRHFRIWGSLAYV-LETNPKKLEPRSRVCLF
          ++  + ER  RT+ +    M+S A+L  SFWG A+ TA  ++N +PS+++   SKTP E+W  +K  L+H R++G+  YV ++    K + +S   +F
Subjt:  --EVQNLLERRNRTLLDMVWFMMSYAQLLDSFWGYAITTALQILNNVPSKSV---SKTPLELWRGRKYSLRHFRIWGSLAYV-LETNPKKLEPRSRVCLF

Query:  VGYPKETRGGFFFDPKLNKVFVSTNAKFL
        VGY          +P   K++ + N KF+
Subjt:  VGYPKETRGGFFFDPKLNKVFVSTNAKFL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.6e-1927.24Show/hide
Query:  GKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKARGGYGY-------------LYLMQHKSEALEKFEKFKAEVQ-------------------------
        GK  + SF     R    L LVYSD+CGPM +++ GG  Y             +Y+++ K +  + F+KF A V+                         
Subjt:  GKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKARGGYGY-------------LYLMQHKSEALEKFEKFKAEVQ-------------------------

Query:  --------------------NLLERRNRTLLDMVWFMMSYAQLLDSFWGYAITTALQILNNVPSKSVS-KTPLELWRGRKYSLRHFRIWG--SLAYVLET
                             + ER NRT+++ V  M+  A+L  SFWG A+ TA  ++N  PS  ++ + P  +W  ++ S  H +++G  + A+V + 
Subjt:  --------------------NLLERRNRTLLDMVWFMMSYAQLLDSFWGYAITTALQILNNVPSKSVS-KTPLELWRGRKYSLRHFRIWG--SLAYVLET

Query:  NPKKLEPRSRVCLFVGYPKETRGGFFFDPKLNKVFVSTNAKFLEEDHIREHISHIVEEASTSTRVVNG
           KL+ +S  C+F+GY  E  G   +DP   KV  S +  F E        S +   A  S +V NG
Subjt:  NPKKLEPRSRVCLFVGYPKETRGGFFFDPKLNKVFVSTNAKFLEEDHIREHISHIVEEASTSTRVVNG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.3e-0623.62Show/hide
Query:  EVQNLLERRNRTLLDMVWFMMSYAQLLDSFWGYAITTALQILNNVPSKSVS-KTPLELWRGRKYSLRHFRIWGSLAY--VLETNPKKLEPRSRVCLFVGY
        E   L ER++R +++M   ++S+A +  ++W YA + A+ ++N +P+  +  ++P +   G+  +    +++G   Y  +   N  KLE +S+ C F+GY
Subjt:  EVQNLLERRNRTLLDMVWFMMSYAQLLDSFWGYAITTALQILNNVPSKSVS-KTPLELWRGRKYSLRHFRIWGSLAY--VLETNPKKLEPRSRVCLFVGY

Query:  PKETRGGFFFDPKLNKVFVSTNAKFLE
                       +++ S + +F E
Subjt:  PKETRGGFFFDPKLNKVFVSTNAKFLE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCCCTTCTATCCTCAACTCATGCGTGAGTTGATTGTGAACTTGCCTGCTCAGTTCATTGATCCTAGCAGTCCAGACTTTCAGAAGGTTCATATTCATGGAACCTA
TTTTCATCTATCTTCGACCATTATCAATGCATATATGGGTTGTGATGAGTCTCCATCTCTCGAAAAACCTCATCCTTCCACTTCAGACCTTGTAGTGGAGTTGATTGGTG
GTACTAAGTCTTCCTCGCCTGCTTTTGGGCAATTACTTGTATCAACTCTAAGTGTTAAATATGCCCTTCTTCACAAGGTTGGCATAGCAAATTGGTGTCCCTCAACTCAT
GGGTCTGGTTGTAGGGATGAAAGCCCTAACGCAGCAGAAAAAATACAAGTTCTGAATATTGAAGCTCTGAAGAATACAAAACCTCCTAAAGCTTTGGAGATCAAAAGACT
TGCTCTAAAGAGTAAACTCCAAAGGTCAATCAACAAGCCTAAAGGTCGATCACCCAAGAAGATCATCAAGCCCAAAGACCGATCTTCACCTAAGAAGTTCATCAAGCCCA
AAGGTCGATCATCACCCAAGAAGATCATCAAGCCCAAAGGCCGATCACCCAAGATGATCATCAAGCCCAAAACACCCAAGAAGATCATTAAGCTCAAAGGCCAATCATCA
CCCAAGAAGATCATCAAGCCCAAAGGTCGATCATCACCTAAGAAGATCATCAAGCCCAAAGTTCGATCGCCCAAGAAGATCATCAAGCTTAAAGTGTTGCTAAGGAAGAG
GATCGAAAACAGAGCAAAAACGAAGCAGAAAAAGCTTTCTATGCTTCAGCCTAGAGCACAACATCGCTACTCTGTGTGCTATGACTTCAAAAGCCAGAGCGACCTAAAGT
TTGCCATCACTGAGGAATGTTCTCAAGTTCCTACTCAAAATGTGTCACAAAATCTTCGTGATGCGTATGAGAAATGGATAAGGGCAAACGAAAAGGCCAGCGCATATATT
CTTGCCAACTTTTCTAAAGTTTTCGCAAAGAAGCATGAAACCATGGTCACTGCTCATTATGGATTCATTGCAAAAGATGTTTGGACCGTGAGATCATGGATTCATTGCAA
AAGATGTTTGGACAATCGTCCTATTAGATCAAAACATGTTCTCAATATGATGGTCCACTTCAACGTTGCTGAAATGAATGGGGCCATTATCGTTGAGGCCAATTATGTTA
GTTTTATTATGGAATCTCTACCTGAGAATTTCCTACAGTTTAGAATCAATGCTTCTATGCATAAAGGCAAAAGACCAGATGGGGAGGCAAATGTTGCCAGATTCTCTAAA
AAGTTCCATAAAGGTTTGACCTCTGGAACTAAATCTGCTTCTTCCTCTACCAGAAAGAAATGGAGGAAGAAGAAGGGTAATAAGGGTGGTAATAAAACTAACCAAGCTGC
AGGCCAAAAAGGAAAACAAAAGGCTCTGGGGATTAGTTCTTGGCATCAACTTGGGACTAGCGATATGACTTTGCAGGTTGGAACAGGACAGATCGTCTCAACGATTACAG
TGGGAGGCAAAATGACCAAAAGATCTTTTACTGGAAAAGGTTATAGAGCCAAAGAACCCTTAAAACTTGTATATTCAGACCTTTGTGGTCCTATGAATGTTAAGGCAAGA
GGAGGATATGGATATCTTTACCTAATGCAACACAAGTCTGAAGCTCTTGAAAAGTTCGAAAAGTTTAAGGCTGAAGTTCAAAACTTATTAGAAAGGAGAAATAGAACCTT
GTTAGACATGGTTTGGTTTATGATGAGTTATGCTCAATTGCTTGACTCATTTTGGGGTTATGCAATAACGACTGCATTACAAATTTTGAACAATGTTCCATCCAAAAGTG
TTTCTAAAACTCCTTTGGAATTATGGAGAGGTCGTAAATATAGTTTACGACACTTCAGAATTTGGGGTTCTCTAGCATATGTGCTTGAGACAAATCCTAAGAAATTGGAA
CCTCGTTCAAGAGTATGCCTATTTGTAGGTTATCCAAAGGAAACGAGAGGTGGTTTTTTCTTTGATCCAAAACTGAATAAAGTATTTGTATCGACAAATGCTAAATTTTT
GGAAGAAGACCACATAAGAGAGCATATATCGCATATTGTTGAAGAAGCTAGTACATCAACAAGAGTTGTTAATGGGGCTTCATCAAGTCGTACACATCCTCCTCAAGAGT
GGAGGGAGCCTCATCGTACTGGGAGGGTTGTAAACCCACCTACTCTATATTTGGGTTTAATTGAAGCTCAAGACATCATACCAGATGATGAAGTTGATGATCCATTGACC
TATAATCAGGCAATGACTGATGTAGACAAGGATGAATGGGTTAAAGCTAAGGATCTTGAAATGGAGTCTATGTTCTTCAATTCACTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTCCCTTCTATCCTCAACTCATGCGTGAGTTGATTGTGAACTTGCCTGCTCAGTTCATTGATCCTAGCAGTCCAGACTTTCAGAAGGTTCATATTCATGGAACCTA
TTTTCATCTATCTTCGACCATTATCAATGCATATATGGGTTGTGATGAGTCTCCATCTCTCGAAAAACCTCATCCTTCCACTTCAGACCTTGTAGTGGAGTTGATTGGTG
GTACTAAGTCTTCCTCGCCTGCTTTTGGGCAATTACTTGTATCAACTCTAAGTGTTAAATATGCCCTTCTTCACAAGGTTGGCATAGCAAATTGGTGTCCCTCAACTCAT
GGGTCTGGTTGTAGGGATGAAAGCCCTAACGCAGCAGAAAAAATACAAGTTCTGAATATTGAAGCTCTGAAGAATACAAAACCTCCTAAAGCTTTGGAGATCAAAAGACT
TGCTCTAAAGAGTAAACTCCAAAGGTCAATCAACAAGCCTAAAGGTCGATCACCCAAGAAGATCATCAAGCCCAAAGACCGATCTTCACCTAAGAAGTTCATCAAGCCCA
AAGGTCGATCATCACCCAAGAAGATCATCAAGCCCAAAGGCCGATCACCCAAGATGATCATCAAGCCCAAAACACCCAAGAAGATCATTAAGCTCAAAGGCCAATCATCA
CCCAAGAAGATCATCAAGCCCAAAGGTCGATCATCACCTAAGAAGATCATCAAGCCCAAAGTTCGATCGCCCAAGAAGATCATCAAGCTTAAAGTGTTGCTAAGGAAGAG
GATCGAAAACAGAGCAAAAACGAAGCAGAAAAAGCTTTCTATGCTTCAGCCTAGAGCACAACATCGCTACTCTGTGTGCTATGACTTCAAAAGCCAGAGCGACCTAAAGT
TTGCCATCACTGAGGAATGTTCTCAAGTTCCTACTCAAAATGTGTCACAAAATCTTCGTGATGCGTATGAGAAATGGATAAGGGCAAACGAAAAGGCCAGCGCATATATT
CTTGCCAACTTTTCTAAAGTTTTCGCAAAGAAGCATGAAACCATGGTCACTGCTCATTATGGATTCATTGCAAAAGATGTTTGGACCGTGAGATCATGGATTCATTGCAA
AAGATGTTTGGACAATCGTCCTATTAGATCAAAACATGTTCTCAATATGATGGTCCACTTCAACGTTGCTGAAATGAATGGGGCCATTATCGTTGAGGCCAATTATGTTA
GTTTTATTATGGAATCTCTACCTGAGAATTTCCTACAGTTTAGAATCAATGCTTCTATGCATAAAGGCAAAAGACCAGATGGGGAGGCAAATGTTGCCAGATTCTCTAAA
AAGTTCCATAAAGGTTTGACCTCTGGAACTAAATCTGCTTCTTCCTCTACCAGAAAGAAATGGAGGAAGAAGAAGGGTAATAAGGGTGGTAATAAAACTAACCAAGCTGC
AGGCCAAAAAGGAAAACAAAAGGCTCTGGGGATTAGTTCTTGGCATCAACTTGGGACTAGCGATATGACTTTGCAGGTTGGAACAGGACAGATCGTCTCAACGATTACAG
TGGGAGGCAAAATGACCAAAAGATCTTTTACTGGAAAAGGTTATAGAGCCAAAGAACCCTTAAAACTTGTATATTCAGACCTTTGTGGTCCTATGAATGTTAAGGCAAGA
GGAGGATATGGATATCTTTACCTAATGCAACACAAGTCTGAAGCTCTTGAAAAGTTCGAAAAGTTTAAGGCTGAAGTTCAAAACTTATTAGAAAGGAGAAATAGAACCTT
GTTAGACATGGTTTGGTTTATGATGAGTTATGCTCAATTGCTTGACTCATTTTGGGGTTATGCAATAACGACTGCATTACAAATTTTGAACAATGTTCCATCCAAAAGTG
TTTCTAAAACTCCTTTGGAATTATGGAGAGGTCGTAAATATAGTTTACGACACTTCAGAATTTGGGGTTCTCTAGCATATGTGCTTGAGACAAATCCTAAGAAATTGGAA
CCTCGTTCAAGAGTATGCCTATTTGTAGGTTATCCAAAGGAAACGAGAGGTGGTTTTTTCTTTGATCCAAAACTGAATAAAGTATTTGTATCGACAAATGCTAAATTTTT
GGAAGAAGACCACATAAGAGAGCATATATCGCATATTGTTGAAGAAGCTAGTACATCAACAAGAGTTGTTAATGGGGCTTCATCAAGTCGTACACATCCTCCTCAAGAGT
GGAGGGAGCCTCATCGTACTGGGAGGGTTGTAAACCCACCTACTCTATATTTGGGTTTAATTGAAGCTCAAGACATCATACCAGATGATGAAGTTGATGATCCATTGACC
TATAATCAGGCAATGACTGATGTAGACAAGGATGAATGGGTTAAAGCTAAGGATCTTGAAATGGAGTCTATGTTCTTCAATTCACTCTGA
Protein sequenceShow/hide protein sequence
MGPFYPQLMRELIVNLPAQFIDPSSPDFQKVHIHGTYFHLSSTIINAYMGCDESPSLEKPHPSTSDLVVELIGGTKSSSPAFGQLLVSTLSVKYALLHKVGIANWCPSTH
GSGCRDESPNAAEKIQVLNIEALKNTKPPKALEIKRLALKSKLQRSINKPKGRSPKKIIKPKDRSSPKKFIKPKGRSSPKKIIKPKGRSPKMIIKPKTPKKIIKLKGQSS
PKKIIKPKGRSSPKKIIKPKVRSPKKIIKLKVLLRKRIENRAKTKQKKLSMLQPRAQHRYSVCYDFKSQSDLKFAITEECSQVPTQNVSQNLRDAYEKWIRANEKASAYI
LANFSKVFAKKHETMVTAHYGFIAKDVWTVRSWIHCKRCLDNRPIRSKHVLNMMVHFNVAEMNGAIIVEANYVSFIMESLPENFLQFRINASMHKGKRPDGEANVARFSK
KFHKGLTSGTKSASSSTRKKWRKKKGNKGGNKTNQAAGQKGKQKALGISSWHQLGTSDMTLQVGTGQIVSTITVGGKMTKRSFTGKGYRAKEPLKLVYSDLCGPMNVKAR
GGYGYLYLMQHKSEALEKFEKFKAEVQNLLERRNRTLLDMVWFMMSYAQLLDSFWGYAITTALQILNNVPSKSVSKTPLELWRGRKYSLRHFRIWGSLAYVLETNPKKLE
PRSRVCLFVGYPKETRGGFFFDPKLNKVFVSTNAKFLEEDHIREHISHIVEEASTSTRVVNGASSSRTHPPQEWREPHRTGRVVNPPTLYLGLIEAQDIIPDDEVDDPLT
YNQAMTDVDKDEWVKAKDLEMESMFFNSL