; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022427 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022427
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr7:28231398..28239935
RNA-Seq ExpressionLag0022427
SyntenyLag0022427
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]4.3e-12058.75Show/hide
Query:  HLQDGEMTL-KIGRLVKTRLLTDLEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEA
        HL+ G + L +IGRLVK  LL  L+D SLPPCESCLE KMTKRPFTGKGYRAKEPLELIHS+LC PMNVKARG  EYFISFIDDYS+YGYL+LM HKSEA
Subjt:  HLQDGEMTL-KIGRLVKTRLLTDLEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEA

Query:  LEKFKEFKTEVESLLGKTIKTLQLDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWG--------------
        LEKFKE+KTEVE+LL K IK L+ D  GEYMD RFQDYMIEHGIQ QLSAPGTPQQN VSERRNRTLLDMVRSMMSYAQLPSSFWG              
Subjt:  LEKFKEFKTEVESLLGKTIKTLQLDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWG--------------

Query:  -------------GCKPSLHYFCIWHCPAHVLVTNPKKLD--------------------------------------------------LIL-------
                     G KPSL +F IW CPAHVLVTNPKKL+                                                  L+L       
Subjt:  -------------GCKPSLHYFCIWHCPAHVLVTNPKKLD--------------------------------------------------LIL-------

Query:  ---------------------------------DGRVVIQPNRYLGLTETQVAIPDDGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPP
                                          GRVV QPNRYLGLTETQV IPDDGV+DPLSY QA N VDKD+WVKAMDLEMESMYF+ VW+ VD P
Subjt:  ---------------------------------DGRVVIQPNRYLGLTETQVAIPDDGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPP

Query:  EGVKPIGCKWTYKRKRE
        EGVKPIGCKW YKRKR+
Subjt:  EGVKPIGCKWTYKRKRE

KAA0056413.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-12347.29Show/hide
Query:  VLTKRYETVEIAREIMNSLQEMFGLPSYQLHHDALKNFFNARMQEGHLVREHVLDMMNQFNIAEANGGVVFEQSQEPNLYLRLLQCGHSNKQIPKRKGDK
        +L+K++E +  AR+I +SL+EMFG PS Q+  +A+K  +NARM++G  V+EH+L+M+  FN+ + NG V  E+SQ   +   LL+      +  KRKG K
Subjt:  VLTKRYETVEIAREIMNSLQEMFGLPSYQLHHDALKNFFNARMQEGHLVREHVLDMMNQFNIAEANGGVVFEQSQEPNLYLRLLQCGHSNKQIPKRKGDK

Query:  EKAPAQAVNMARERPRSCLTRTSVSTAMWMVTRRGTALVTLLRKREKRKGNDFKHLQDGEMTLK------------------------IGRLVKTRLLTD
        EK P  A                                    K + ++ + F+ L++GEMTLK                        IGRLVK  LL +
Subjt:  EKAPAQAVNMARERPRSCLTRTSVSTAMWMVTRRGTALVTLLRKREKRKGNDFKHLQDGEMTLK------------------------IGRLVKTRLLTD

Query:  LEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEALEKFKEFKTEVESLLGKTIKTLQ
        LEDDSLPPCESCLE KMTKR FTGKGYRAKEPLELIHS+LC PMNVK R   EYFISFIDDYS+YGYL+LM +KSE LEKFKE+K EVE+LL K IK L+
Subjt:  LEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEALEKFKEFKTEVESLLGKTIKTLQ

Query:  LDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWG---------------------------GCKPSLHYFC
         D  GEYMD RFQDYMI+H IQ QLSAP TPQQN VSERRNRTLLDMV SMMSYA LPSSFWG                           G KPSL +F 
Subjt:  LDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWG---------------------------GCKPSLHYFC

Query:  IWHCPAHVLVTNPKKLD--------------------------------------------------LILD------GRVV--IQPNRYLGLTET-----
        IW C AHVLVTNPKKL+                                                  L+L+       RVV  + P+  +  T T     
Subjt:  IWHCPAHVLVTNPKKLD--------------------------------------------------LILD------GRVV--IQPNRYLGLTET-----

Query:  ---QVAIPD-DGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPPEGVKPIGCKWTYKRKRE
            + +P  DG++DPLSY +A NYVDKD+WVKAM+LEMESMYF+ VW+ V+ PEGVKPIGCKW YKRKR+
Subjt:  ---QVAIPD-DGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPPEGVKPIGCKWTYKRKRE

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]4.9e-12446.93Show/hide
Query:  VLTKRYETVEIAREIMNSLQEMFGLPSYQLHHDALKNFFNARMQEGHLVREHVLDMMNQFNIAEANGGVVFEQSQEPNLYLRLLQCGHSNKQIPKRKGDK
        +L+K++E +  AR+IM+SL+EMFG PS Q+  +A                          N+A +                R +     +++I KRK  K
Subjt:  VLTKRYETVEIAREIMNSLQEMFGLPSYQLHHDALKNFFNARMQEGHLVREHVLDMMNQFNIAEANGGVVFEQSQEPNLYLRLLQCGHSNKQIPKRKGDK

Query:  EKAPAQAVNMARER----PRSCL-----TRTSVSTAMWMVTRRGTALVTLLRKREKRKGNDFKHLQDGEMTLK------------------------IGR
         K P  AV    +      R C           +   ++V ++     T       ++ + FK L+D EMTLK                        IGR
Subjt:  EKAPAQAVNMARER----PRSCL-----TRTSVSTAMWMVTRRGTALVTLLRKREKRKGNDFKHLQDGEMTLK------------------------IGR

Query:  LVKTRLLTDLEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEALEKFKEFKTEVESL
        LVK  LL  L+D SLPPCESCLE KMTKRPFTGKGYRAKEPLELIHS+LC PMNVKARG  EYFISFIDDYS+YGYL+LM HKSEALEKFKE+KTEVE+L
Subjt:  LVKTRLLTDLEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEALEKFKEFKTEVESL

Query:  LGKTIKTLQLDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWG---------------------------G
        L K IK L+ D  GEYMD RFQDYMIEHGIQ QLSAPGTPQQN VSERRNRTLLDMVRSMMSYAQLPSSFWG                           G
Subjt:  LGKTIKTLQLDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWG---------------------------G

Query:  CKPSLHYFCIWHCPAHVLVTNPKKLD--------------------------------------------------LIL---------------------
         KPSL +F IW CPAHVLVTNPKKL+                                                  L+L                     
Subjt:  CKPSLHYFCIWHCPAHVLVTNPKKLD--------------------------------------------------LIL---------------------

Query:  -------------------DGRVVIQPNRYLGLTETQVAIPDDGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPPEGVKPIGCKWTYKR
                            GRVV QPNRYLGLTETQV IPDDGV+DPLSY QA N VDKD+WVKAMDLEMESMYF+ VW+ VD PEGVKPIGCKW YKR
Subjt:  -------------------DGRVVIQPNRYLGLTETQVAIPDDGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPPEGVKPIGCKWTYKR

Query:  KRE
        KR+
Subjt:  KRE

TYJ97618.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-11944.3Show/hide
Query:  ANRNHFAAAELGVAECSVLTKRYETVEIAREIMNSLQEMFGLPSYQLHHDALKNFFNARMQEGHLVREHVLDMMNQFNIAEANGGVVFEQSQEPNLYLRL
        A  N    A +  +   VL K++E++  AREIM+SLQEMFG  SYQ++HDALK  +NARM EG  VREHVL+MM  FN+AE NG V+ E SQ        
Subjt:  ANRNHFAAAELGVAECSVLTKRYETVEIAREIMNSLQEMFGLPSYQLHHDALKNFFNARMQEGHLVREHVLDMMNQFNIAEANGGVVFEQSQEPNLYLRL

Query:  LQCGHSNKQIPKRK-------GDKEKAPAQAVNMARERPRSCLTRTSVSTAMWMVTRRGTALVTLLRKREKR-KGNDFKHL------QDGEMTL-KIGRL
         Q G +N     RK       G K    +      +++      + +++ A      + T  +     +E   K N  K+L      + G + L +I RL
Subjt:  LQCGHSNKQIPKRK-------GDKEKAPAQAVNMARERPRSCLTRTSVSTAMWMVTRRGTALVTLLRKREKR-KGNDFKHL------QDGEMTL-KIGRL

Query:  VKTRLLTDLEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEALEKFKEFKTEVESLL
        VK  +L++LE++SLP CESCLE KMTKRPFTGKG+RAKEPLEL+HS+LC PMNVKARG  EYFI+F DDYS+YGY++LM HKSEALEKFKE+K EVE+ L
Subjt:  VKTRLLTDLEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEALEKFKEFKTEVESLL

Query:  GKTIKTLQLDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWG---------------------------GC
         KTIKT + D  GEYMD +FQ+Y++E  I  QLSAPGTPQQN VSERRNRTLLDMVRSM+SYA LP+SFWG                           G 
Subjt:  GKTIKTLQLDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWG---------------------------GC

Query:  KPSLHYFCIWHCPAHVLVTNPKKLD--------------------------------------------------LILD---------------------
        K SL +F IW CPAHVL  NPKKL+                                                  ++L+                     
Subjt:  KPSLHYFCIWHCPAHVLVTNPKKLD--------------------------------------------------LILD---------------------

Query:  -----------------------GRVVIQPNRYLGLTETQVAIPDDGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPPEGVKPIGCKWT
                               GRV   P RY+ LTET   I D  ++DPL++ +A   VDKDEW+KAM+LE+ESMYF+ VWD VD P+GVKPIGCKW 
Subjt:  -----------------------GRVVIQPNRYLGLTETQVAIPDDGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPPEGVKPIGCKWT

Query:  YKRKR
        YKRKR
Subjt:  YKRKR

TYK02840.1 gag/pol protein [Cucumis melo var. makuwa]4.9e-12446.93Show/hide
Query:  VLTKRYETVEIAREIMNSLQEMFGLPSYQLHHDALKNFFNARMQEGHLVREHVLDMMNQFNIAEANGGVVFEQSQEPNLYLRLLQCGHSNKQIPKRKGDK
        +L+K++E +  AR+IM+SL+EMFG PS Q+  +A                          N+A +                R +     +++I KRK  K
Subjt:  VLTKRYETVEIAREIMNSLQEMFGLPSYQLHHDALKNFFNARMQEGHLVREHVLDMMNQFNIAEANGGVVFEQSQEPNLYLRLLQCGHSNKQIPKRKGDK

Query:  EKAPAQAVNMARER----PRSCL-----TRTSVSTAMWMVTRRGTALVTLLRKREKRKGNDFKHLQDGEMTLK------------------------IGR
         K P  AV    +      R C           +   ++V ++     T       ++ + FK L+D EMTLK                        IGR
Subjt:  EKAPAQAVNMARER----PRSCL-----TRTSVSTAMWMVTRRGTALVTLLRKREKRKGNDFKHLQDGEMTLK------------------------IGR

Query:  LVKTRLLTDLEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEALEKFKEFKTEVESL
        LVK  LL  L+D SLPPCESCLE KMTKRPFTGKGYRAKEPLELIHS+LC PMNVKARG  EYFISFIDDYS+YGYL+LM HKSEALEKFKE+KTEVE+L
Subjt:  LVKTRLLTDLEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEALEKFKEFKTEVESL

Query:  LGKTIKTLQLDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWG---------------------------G
        L K IK L+ D  GEYMD RFQDYMIEHGIQ QLSAPGTPQQN VSERRNRTLLDMVRSMMSYAQLPSSFWG                           G
Subjt:  LGKTIKTLQLDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWG---------------------------G

Query:  CKPSLHYFCIWHCPAHVLVTNPKKLD--------------------------------------------------LIL---------------------
         KPSL +F IW CPAHVLVTNPKKL+                                                  L+L                     
Subjt:  CKPSLHYFCIWHCPAHVLVTNPKKLD--------------------------------------------------LIL---------------------

Query:  -------------------DGRVVIQPNRYLGLTETQVAIPDDGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPPEGVKPIGCKWTYKR
                            GRVV QPNRYLGLTETQV IPDDGV+DPLSY QA N VDKD+WVKAMDLEMESMYF+ VW+ VD PEGVKPIGCKW YKR
Subjt:  -------------------DGRVVIQPNRYLGLTETQVAIPDDGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPPEGVKPIGCKWTYKR

Query:  KRE
        KR+
Subjt:  KRE

TrEMBL top hitse value%identityAlignment
A0A5A7TZD0 Gag/pol protein2.1e-12058.75Show/hide
Query:  HLQDGEMTL-KIGRLVKTRLLTDLEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEA
        HL+ G + L +IGRLVK  LL  L+D SLPPCESCLE KMTKRPFTGKGYRAKEPLELIHS+LC PMNVKARG  EYFISFIDDYS+YGYL+LM HKSEA
Subjt:  HLQDGEMTL-KIGRLVKTRLLTDLEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEA

Query:  LEKFKEFKTEVESLLGKTIKTLQLDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWG--------------
        LEKFKE+KTEVE+LL K IK L+ D  GEYMD RFQDYMIEHGIQ QLSAPGTPQQN VSERRNRTLLDMVRSMMSYAQLPSSFWG              
Subjt:  LEKFKEFKTEVESLLGKTIKTLQLDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWG--------------

Query:  -------------GCKPSLHYFCIWHCPAHVLVTNPKKLD--------------------------------------------------LIL-------
                     G KPSL +F IW CPAHVLVTNPKKL+                                                  L+L       
Subjt:  -------------GCKPSLHYFCIWHCPAHVLVTNPKKLD--------------------------------------------------LIL-------

Query:  ---------------------------------DGRVVIQPNRYLGLTETQVAIPDDGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPP
                                          GRVV QPNRYLGLTETQV IPDDGV+DPLSY QA N VDKD+WVKAMDLEMESMYF+ VW+ VD P
Subjt:  ---------------------------------DGRVVIQPNRYLGLTETQVAIPDDGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPP

Query:  EGVKPIGCKWTYKRKRE
        EGVKPIGCKW YKRKR+
Subjt:  EGVKPIGCKWTYKRKRE

A0A5A7UYE8 Gag/pol protein2.4e-12446.93Show/hide
Query:  VLTKRYETVEIAREIMNSLQEMFGLPSYQLHHDALKNFFNARMQEGHLVREHVLDMMNQFNIAEANGGVVFEQSQEPNLYLRLLQCGHSNKQIPKRKGDK
        +L+K++E +  AR+IM+SL+EMFG PS Q+  +A                          N+A +                R +     +++I KRK  K
Subjt:  VLTKRYETVEIAREIMNSLQEMFGLPSYQLHHDALKNFFNARMQEGHLVREHVLDMMNQFNIAEANGGVVFEQSQEPNLYLRLLQCGHSNKQIPKRKGDK

Query:  EKAPAQAVNMARER----PRSCL-----TRTSVSTAMWMVTRRGTALVTLLRKREKRKGNDFKHLQDGEMTLK------------------------IGR
         K P  AV    +      R C           +   ++V ++     T       ++ + FK L+D EMTLK                        IGR
Subjt:  EKAPAQAVNMARER----PRSCL-----TRTSVSTAMWMVTRRGTALVTLLRKREKRKGNDFKHLQDGEMTLK------------------------IGR

Query:  LVKTRLLTDLEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEALEKFKEFKTEVESL
        LVK  LL  L+D SLPPCESCLE KMTKRPFTGKGYRAKEPLELIHS+LC PMNVKARG  EYFISFIDDYS+YGYL+LM HKSEALEKFKE+KTEVE+L
Subjt:  LVKTRLLTDLEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEALEKFKEFKTEVESL

Query:  LGKTIKTLQLDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWG---------------------------G
        L K IK L+ D  GEYMD RFQDYMIEHGIQ QLSAPGTPQQN VSERRNRTLLDMVRSMMSYAQLPSSFWG                           G
Subjt:  LGKTIKTLQLDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWG---------------------------G

Query:  CKPSLHYFCIWHCPAHVLVTNPKKLD--------------------------------------------------LIL---------------------
         KPSL +F IW CPAHVLVTNPKKL+                                                  L+L                     
Subjt:  CKPSLHYFCIWHCPAHVLVTNPKKLD--------------------------------------------------LIL---------------------

Query:  -------------------DGRVVIQPNRYLGLTETQVAIPDDGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPPEGVKPIGCKWTYKR
                            GRVV QPNRYLGLTETQV IPDDGV+DPLSY QA N VDKD+WVKAMDLEMESMYF+ VW+ VD PEGVKPIGCKW YKR
Subjt:  -------------------DGRVVIQPNRYLGLTETQVAIPDDGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPPEGVKPIGCKWTYKR

Query:  KRE
        KR+
Subjt:  KRE

A0A5A7UYX7 Gag/pol protein1.8e-11944.3Show/hide
Query:  ANRNHFAAAELGVAECSVLTKRYETVEIAREIMNSLQEMFGLPSYQLHHDALKNFFNARMQEGHLVREHVLDMMNQFNIAEANGGVVFEQSQEPNLYLRL
        A  N    A +  +   VL K++E++  AREIM+SLQEMFG  SYQ++HDALK  +NARM EG  VREHVL+MM  FN+AE NG V+ E SQ        
Subjt:  ANRNHFAAAELGVAECSVLTKRYETVEIAREIMNSLQEMFGLPSYQLHHDALKNFFNARMQEGHLVREHVLDMMNQFNIAEANGGVVFEQSQEPNLYLRL

Query:  LQCGHSNKQIPKRK-------GDKEKAPAQAVNMARERPRSCLTRTSVSTAMWMVTRRGTALVTLLRKREKR-KGNDFKHL------QDGEMTL-KIGRL
         Q G +N     RK       G K    +      +++      + +++ A      + T  +     +E   K N  K+L      + G + L +I RL
Subjt:  LQCGHSNKQIPKRK-------GDKEKAPAQAVNMARERPRSCLTRTSVSTAMWMVTRRGTALVTLLRKREKR-KGNDFKHL------QDGEMTL-KIGRL

Query:  VKTRLLTDLEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEALEKFKEFKTEVESLL
        VK  +L++LE++SLP CESCLE KMTKRPFTGKG+RAKEPLEL+HS+LC PMNVKARG  EYFI+F DDYS+YGY++LM HKSEALEKFKE+K EVE+ L
Subjt:  VKTRLLTDLEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEALEKFKEFKTEVESLL

Query:  GKTIKTLQLDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWG---------------------------GC
         KTIKT + D  GEYMD +FQ+Y++E  I  QLSAPGTPQQN VSERRNRTLLDMVRSM+SYA LP+SFWG                           G 
Subjt:  GKTIKTLQLDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWG---------------------------GC

Query:  KPSLHYFCIWHCPAHVLVTNPKKLD--------------------------------------------------LILD---------------------
        K SL +F IW CPAHVL  NPKKL+                                                  ++L+                     
Subjt:  KPSLHYFCIWHCPAHVLVTNPKKLD--------------------------------------------------LILD---------------------

Query:  -----------------------GRVVIQPNRYLGLTETQVAIPDDGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPPEGVKPIGCKWT
                               GRV   P RY+ LTET   I D  ++DPL++ +A   VDKDEW+KAM+LE+ESMYF+ VWD VD P+GVKPIGCKW 
Subjt:  -----------------------GRVVIQPNRYLGLTETQVAIPDDGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPPEGVKPIGCKWT

Query:  YKRKR
        YKRKR
Subjt:  YKRKR

A0A5D3BUN8 Gag/pol protein2.4e-12446.93Show/hide
Query:  VLTKRYETVEIAREIMNSLQEMFGLPSYQLHHDALKNFFNARMQEGHLVREHVLDMMNQFNIAEANGGVVFEQSQEPNLYLRLLQCGHSNKQIPKRKGDK
        +L+K++E +  AR+IM+SL+EMFG PS Q+  +A                          N+A +                R +     +++I KRK  K
Subjt:  VLTKRYETVEIAREIMNSLQEMFGLPSYQLHHDALKNFFNARMQEGHLVREHVLDMMNQFNIAEANGGVVFEQSQEPNLYLRLLQCGHSNKQIPKRKGDK

Query:  EKAPAQAVNMARER----PRSCL-----TRTSVSTAMWMVTRRGTALVTLLRKREKRKGNDFKHLQDGEMTLK------------------------IGR
         K P  AV    +      R C           +   ++V ++     T       ++ + FK L+D EMTLK                        IGR
Subjt:  EKAPAQAVNMARER----PRSCL-----TRTSVSTAMWMVTRRGTALVTLLRKREKRKGNDFKHLQDGEMTLK------------------------IGR

Query:  LVKTRLLTDLEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEALEKFKEFKTEVESL
        LVK  LL  L+D SLPPCESCLE KMTKRPFTGKGYRAKEPLELIHS+LC PMNVKARG  EYFISFIDDYS+YGYL+LM HKSEALEKFKE+KTEVE+L
Subjt:  LVKTRLLTDLEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEALEKFKEFKTEVESL

Query:  LGKTIKTLQLDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWG---------------------------G
        L K IK L+ D  GEYMD RFQDYMIEHGIQ QLSAPGTPQQN VSERRNRTLLDMVRSMMSYAQLPSSFWG                           G
Subjt:  LGKTIKTLQLDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWG---------------------------G

Query:  CKPSLHYFCIWHCPAHVLVTNPKKLD--------------------------------------------------LIL---------------------
         KPSL +F IW CPAHVLVTNPKKL+                                                  L+L                     
Subjt:  CKPSLHYFCIWHCPAHVLVTNPKKLD--------------------------------------------------LIL---------------------

Query:  -------------------DGRVVIQPNRYLGLTETQVAIPDDGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPPEGVKPIGCKWTYKR
                            GRVV QPNRYLGLTETQV IPDDGV+DPLSY QA N VDKD+WVKAMDLEMESMYF+ VW+ VD PEGVKPIGCKW YKR
Subjt:  -------------------DGRVVIQPNRYLGLTETQVAIPDDGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPPEGVKPIGCKWTYKR

Query:  KRE
        KR+
Subjt:  KRE

A0A5D3DZL3 Gag/pol protein9.1e-12447.29Show/hide
Query:  VLTKRYETVEIAREIMNSLQEMFGLPSYQLHHDALKNFFNARMQEGHLVREHVLDMMNQFNIAEANGGVVFEQSQEPNLYLRLLQCGHSNKQIPKRKGDK
        +L+K++E +  AR+I +SL+EMFG PS Q+  +A+K  +NARM++G  V+EH+L+M+  FN+ + NG V  E+SQ   +   LL+      +  KRKG K
Subjt:  VLTKRYETVEIAREIMNSLQEMFGLPSYQLHHDALKNFFNARMQEGHLVREHVLDMMNQFNIAEANGGVVFEQSQEPNLYLRLLQCGHSNKQIPKRKGDK

Query:  EKAPAQAVNMARERPRSCLTRTSVSTAMWMVTRRGTALVTLLRKREKRKGNDFKHLQDGEMTLK------------------------IGRLVKTRLLTD
        EK P  A                                    K + ++ + F+ L++GEMTLK                        IGRLVK  LL +
Subjt:  EKAPAQAVNMARERPRSCLTRTSVSTAMWMVTRRGTALVTLLRKREKRKGNDFKHLQDGEMTLK------------------------IGRLVKTRLLTD

Query:  LEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEALEKFKEFKTEVESLLGKTIKTLQ
        LEDDSLPPCESCLE KMTKR FTGKGYRAKEPLELIHS+LC PMNVK R   EYFISFIDDYS+YGYL+LM +KSE LEKFKE+K EVE+LL K IK L+
Subjt:  LEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEALEKFKEFKTEVESLLGKTIKTLQ

Query:  LDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWG---------------------------GCKPSLHYFC
         D  GEYMD RFQDYMI+H IQ QLSAP TPQQN VSERRNRTLLDMV SMMSYA LPSSFWG                           G KPSL +F 
Subjt:  LDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWG---------------------------GCKPSLHYFC

Query:  IWHCPAHVLVTNPKKLD--------------------------------------------------LILD------GRVV--IQPNRYLGLTET-----
        IW C AHVLVTNPKKL+                                                  L+L+       RVV  + P+  +  T T     
Subjt:  IWHCPAHVLVTNPKKLD--------------------------------------------------LILD------GRVV--IQPNRYLGLTET-----

Query:  ---QVAIPD-DGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPPEGVKPIGCKWTYKRKRE
            + +P  DG++DPLSY +A NYVDKD+WVKAM+LEMESMYF+ VW+ V+ PEGVKPIGCKW YKRKR+
Subjt:  ---QVAIPD-DGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPPEGVKPIGCKWTYKRKRE

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-2430.37Show/hide
Query:  FKHLQDGE-MTLKIGRLVKTRLLTDLEDDSLPPCESCLEDKMTKRPFTGKGYRA--KEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGH
        F H+ DG+ + +K   +   + L +  + S   CE CL  K  + PF     +   K PL ++HS++C P+         YF+ F+D ++ Y   +L+ +
Subjt:  FKHLQDGE-MTLKIGRLVKTRLLTDLEDDSLPPCESCLEDKMTKRPFTGKGYRA--KEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGH

Query:  KSEALEKFKEFKTEVESLLGKTIKTLQLDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWGGCKPSLHYFC
        KS+    F++F  + E+     +  L +D   EY+    + + ++ GI + L+ P TPQ N VSER  RT+ +  R+M+S A+L  SFWG    +  Y  
Subjt:  KSEALEKFKEFKTEVESLLGKTIKTLQLDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWGGCKPSLHYFC

Query:  IWHCPAHVLVTNPK
        I   P+  LV + K
Subjt:  IWHCPAHVLVTNPK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-3236.51Show/hide
Query:  DFKHLQDGEMTLK-IGRLVKTRLLTDLEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHK
        D  H + G M+ K +  L K  L++  +  ++ PC+ CL  K  +  F     R    L+L++S++C PM +++ G  +YF++FIDD S+  +++++  K
Subjt:  DFKHLQDGEMTLK-IGRLVKTRLLTDLEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHK

Query:  SEALEKFKEFKTEVESLLGKTIKTLQLDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWG
         +  + F++F   VE   G+ +K L+ D  GEY  + F++Y   HGI+ + + PGTPQ N V+ER NRT+++ VRSM+  A+LP SFWG
Subjt:  SEALEKFKEFKTEVESLLGKTIKTLQLDGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-0334.85Show/hide
Query:  VAIPDDGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPPEGVKPIGCKWTYKRKRE
        V I DD   +P S  +  ++ +K++ +KAM  EMES+  +  +  V+ P+G +P+ CKW +K K++
Subjt:  VAIPDDGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPPEGVKPIGCKWTYKRKRE

Q12491 Transposon Ty2-B Gag-Pol polyprotein7.7e-1126.09Show/hide
Query:  CESCLEDKMTKRPFTGKGYRAK-----EPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSE--ALEKFKEFKTEVESLLGKTIKTLQL
        C  CL  K TK     KG R K     EP + +H+++  P++   +    YFISF D+ +++ +++ +  + E   L  F      +++     +  +Q+
Subjt:  CESCLEDKMTKRPFTGKGYRAK-----EPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSE--ALEKFKEFKTEVESLLGKTIKTLQL

Query:  DGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFW
        D   EY +K    +    GI    +     + + V+ER NRTLL+  R+++  + LP+  W
Subjt:  DGSGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.5e-1729.87Show/hide
Query:  CESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEALEKFKEFKTEVESLLGKTIKTLQLDGSGEYM
        C  CL +K  K PF+     +  PLE I+S++     + +     Y++ F+D +++Y +L+ +  KS+  E F  FK  +E+     I T   D  GE++
Subjt:  CESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEALEKFKEFKTEVESLLGKTIKTLQLDGSGEYM

Query:  DKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFW
             +Y  +HGI    S P TP+ N +SER++R +++   +++S+A +P ++W
Subjt:  DKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFW

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.0e-1831.82Show/hide
Query:  CESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEALEKFKEFKTEVESLLGKTIKTLQLDGSGEYM
        C  C  +K  K PF+     + +PLE I+S++     +       Y++ F+D +++Y +L+ +  KS+  + F  FK+ VE+     I TL  D  GE++
Subjt:  CESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEALEKFKEFKTEVESLLGKTIKTLQLDGSGEYM

Query:  DKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFW
          R  DY+ +HGI    S P TP+ N +SER++R +++M  +++S+A +P ++W
Subjt:  DKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFW

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.8e-0540Show/hide
Query:  DPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPPEGVKPIGCKWTYKRK
        +P +Y++AK ++    W  AMD E+ +M  +  W+    P   KPIGCKW YK K
Subjt:  DPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPPEGVKPIGCKWTYKRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCTTACTTATCTTTTAGAAGTTGAATTATTCGATGTATGGGGTATTGATTTTATGGGGCCATTTCCCCCTTCTATTGGCAATGTTTTTATCTTATTAGCAGTTGA
TTATGTGTCCAAATGGGTGGAGGCCATTGCATGCCATCAGAGTGATGCCAAGACAGTTGCAAGGTTGGATGAGGCTCTTTGGGCTTATAGGACAGCCTATAAGACTCCTC
TAGGAGCAATAAGAATGCTGCAGCTTAATGAATTAGAGGAGTTTCGCCAATTTTCTTATGAGAATGCAAAAATGTATAAAGAAAAGACTAAGTTGTGGCATGACAAGAGA
ATTAAATCTAAAGAGTTTGTCAAGGGTCAAAAAGTCTTGCTTTATAATTCTAGATTGAAATTGTTTCTTGGGAAACTAAAATCTAAATGGTCAGGACCATTTACTGTGGT
TGAGGTTCAGAAGATTGTTGCGTCAAAGATAAAGCTGGAGCAAATTTTTCGTACGAATAGGAAGGATTTTAGTACTTTGGTAAGTTTCTCCTCACTTCGTCTTCTGGTTT
CAATCTTGCCATCTACATTATTTCTTTCTCCTTTTTCATTATCTGTAAAACCATTTGAGCTATCTATGGCCAAAACAAGATCTAGGAAAGAGAGGGAGAGTGAAGAGGAG
GAGGTACCGATCACGCCGGAAGTGCAAAAAGGGAAAACCAAAAAGAAAAGAACGCCGGAGGAAAAAGAAGCAAAAAGAAAAGAACAGGAGGAAGTTCAGGAGGTGGCAGA
AGTTGTTGCCACTACTGCGGAAGAAGGAATTACTCAAGAACCTGAAGTGCAAAACCCAGATACGGTTCAAGAAAAGATTGCTGAGAAAAATCAAGAAACAGAGGTTGAGG
AGCAGGCTGTAGGTGAGCCTGACAAGGAGAAAACACCGGCGCAGGAGGCTCATGTTGAAGTCATTATACCTGAACCACCCAGACGTCGCCGCATCAAGTGGAAGGCGGGT
CGCGTGAGAGTGATTCGGAGCACTCCATCGCCTCTGACGTCAGACTTTGAGGAAGATAAAAGAGAAGAGGAAAATACGACAAAAGAAGAAGAGGCAAGGAAGGCAGAAGA
TGAGCGTTTGCGAGAACAGAGAGAAAGCAAGGGCAAAGGAAATGCCGAAGCGTCGGGTGAGATTGAGGAGCCGAGGGCACCATTCATTCGCTTCGTCAATGAACTTGCCA
GAGCAAAATATCAGGAAGTACTGAAGCGTGATTTCTTGTTCGAGCGAGGATTTGGCAGTGATTTGCCCAGGTTTTTGGAGTCTGGAATAGCGAGCCTAGGGTGGAGGCAG
TTTTGTGCGAAGCCTGATCCTGTCAATGCTAACTTCGTTCGGGAATTCTACGCCAATCTTGACGTGAAGGATGATTTCGAAGTCATAGTGCGAGGAGTGCCTGTCCAATG
GAGCCTAGAGGCCATTAATAATTTGTTTGATCTTCAGTACTTTCCACACGCAGTTTTCAATGAAATGGTGGTTGCCCCATCGAGCGACCAATTAAGCGCGGTGGTCCGAG
AGCGCAAGAAGGTAGGGAAGCTGTTCTTTCTAAACACGATTACAGTGTTATGCAGCAGGGCGAGAGTGCCCATGATTCCAAAAGATATGATTATGCTTGATAAGGGAGTC
ATTGACACACCTAATCTGGCGAGGCTTCAGCGTACGCAAAAGGCTCGCAAGGGTGGGCTTGTGTATGGCGTTCATCAGATCCTAGAGCAACTGACATTGGTGGCCAACAG
GAATCATTTTGCTGCAGCAGAGCTCGGTGTTGCAGAGTGCTCAGTTTTGACCAAGAGGTATGAGACCGTGGAGATAGCAAGGGAAATTATGAATTCCCTTCAGGAGATGT
TTGGACTTCCGTCCTATCAACTCCACCATGACGCTCTGAAGAATTTTTTCAATGCCCGCATGCAAGAGGGGCATCTTGTTCGGGAACACGTTCTCGACATGATGAACCAA
TTTAACATCGCGGAGGCAAATGGCGGGGTCGTCTTCGAGCAAAGTCAGGAACCAAATCTGTACCTCAGGCTTCTTCAATGTGGCCATTCCAATAAGCAGATTCCAAAGAG
GAAGGGTGACAAGGAGAAGGCTCCTGCTCAAGCTGTGAACATGGCAAGGGAAAGGCCAAGGTCGTGCCTGACAAGGACAAGTGTTTCCACCGCAATGTGGATGGTCACTA
GAAGAGGAACTGCCCTCGTTACCTTGCTGAGAAAAAGAGAGAAAAGGAAGGGAAATGATTTCAAGCATCTTCAAGATGGTGAGATGACTCTCAAGATCGGTCGTTTGGTA
AAAACTAGACTTCTAACTGATTTAGAAGATGATTCTTTACCACCCTGTGAATCGTGTCTCGAGGATAAAATGACTAAGAGGCCTTTTACTGGAAAAGGTTATCGTGCCAA
AGAACCTCTAGAGTTGATACATTCAAATCTTTGTTGTCCAATGAATGTAAAAGCTCGAGGAGTTTGTGAATATTTCATCTCTTTCATAGATGATTATTCTCAATATGGTT
ACTTACATCTAATGGGTCATAAGTCTGAAGCTCTTGAAAAGTTTAAAGAGTTTAAGACTGAAGTAGAAAGCCTATTGGGTAAAACAATCAAAACACTTCAATTAGATGGA
AGTGGAGAGTATATGGACAAAAGATTCCAGGACTATATGATAGAACATGGAATCCAATTCCAACTCTCAGCACCTGGCACACCTCAACAAAATGATGTATCAGAAAGGAG
AAATAGAACCTTGTTAGACATGGTTCGTTCAATGATGAGCTATGCTCAATTGCCTAGCTCATTTTGGGGGGGGTGTAAGCCTAGTTTGCATTACTTCTGTATCTGGCATT
GTCCTGCACACGTGCTAGTGACAAATCCTAAGAAACTGGACCTCATTCTAGATGGGAGGGTTGTAATACAACCTAACCGTTACTTGGGTTTAACTGAAACACAAGTAGCC
ATACCTGATGACGGTGTTGATGATCCATTGTCTTATCATCAGGCAAAAAATTATGTAGATAAAGACGAATGGGTCAAAGCAATGGACCTTGAGATGGAGTCTATGTACTT
CAGTCAAGTTTGGGATCATGTAGATCCACCTGAAGGGGTCAAACCCATAGGGTGTAAATGGACCTATAAGAGGAAAAGAGAGATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTCTTACTTATCTTTTAGAAGTTGAATTATTCGATGTATGGGGTATTGATTTTATGGGGCCATTTCCCCCTTCTATTGGCAATGTTTTTATCTTATTAGCAGTTGA
TTATGTGTCCAAATGGGTGGAGGCCATTGCATGCCATCAGAGTGATGCCAAGACAGTTGCAAGGTTGGATGAGGCTCTTTGGGCTTATAGGACAGCCTATAAGACTCCTC
TAGGAGCAATAAGAATGCTGCAGCTTAATGAATTAGAGGAGTTTCGCCAATTTTCTTATGAGAATGCAAAAATGTATAAAGAAAAGACTAAGTTGTGGCATGACAAGAGA
ATTAAATCTAAAGAGTTTGTCAAGGGTCAAAAAGTCTTGCTTTATAATTCTAGATTGAAATTGTTTCTTGGGAAACTAAAATCTAAATGGTCAGGACCATTTACTGTGGT
TGAGGTTCAGAAGATTGTTGCGTCAAAGATAAAGCTGGAGCAAATTTTTCGTACGAATAGGAAGGATTTTAGTACTTTGGTAAGTTTCTCCTCACTTCGTCTTCTGGTTT
CAATCTTGCCATCTACATTATTTCTTTCTCCTTTTTCATTATCTGTAAAACCATTTGAGCTATCTATGGCCAAAACAAGATCTAGGAAAGAGAGGGAGAGTGAAGAGGAG
GAGGTACCGATCACGCCGGAAGTGCAAAAAGGGAAAACCAAAAAGAAAAGAACGCCGGAGGAAAAAGAAGCAAAAAGAAAAGAACAGGAGGAAGTTCAGGAGGTGGCAGA
AGTTGTTGCCACTACTGCGGAAGAAGGAATTACTCAAGAACCTGAAGTGCAAAACCCAGATACGGTTCAAGAAAAGATTGCTGAGAAAAATCAAGAAACAGAGGTTGAGG
AGCAGGCTGTAGGTGAGCCTGACAAGGAGAAAACACCGGCGCAGGAGGCTCATGTTGAAGTCATTATACCTGAACCACCCAGACGTCGCCGCATCAAGTGGAAGGCGGGT
CGCGTGAGAGTGATTCGGAGCACTCCATCGCCTCTGACGTCAGACTTTGAGGAAGATAAAAGAGAAGAGGAAAATACGACAAAAGAAGAAGAGGCAAGGAAGGCAGAAGA
TGAGCGTTTGCGAGAACAGAGAGAAAGCAAGGGCAAAGGAAATGCCGAAGCGTCGGGTGAGATTGAGGAGCCGAGGGCACCATTCATTCGCTTCGTCAATGAACTTGCCA
GAGCAAAATATCAGGAAGTACTGAAGCGTGATTTCTTGTTCGAGCGAGGATTTGGCAGTGATTTGCCCAGGTTTTTGGAGTCTGGAATAGCGAGCCTAGGGTGGAGGCAG
TTTTGTGCGAAGCCTGATCCTGTCAATGCTAACTTCGTTCGGGAATTCTACGCCAATCTTGACGTGAAGGATGATTTCGAAGTCATAGTGCGAGGAGTGCCTGTCCAATG
GAGCCTAGAGGCCATTAATAATTTGTTTGATCTTCAGTACTTTCCACACGCAGTTTTCAATGAAATGGTGGTTGCCCCATCGAGCGACCAATTAAGCGCGGTGGTCCGAG
AGCGCAAGAAGGTAGGGAAGCTGTTCTTTCTAAACACGATTACAGTGTTATGCAGCAGGGCGAGAGTGCCCATGATTCCAAAAGATATGATTATGCTTGATAAGGGAGTC
ATTGACACACCTAATCTGGCGAGGCTTCAGCGTACGCAAAAGGCTCGCAAGGGTGGGCTTGTGTATGGCGTTCATCAGATCCTAGAGCAACTGACATTGGTGGCCAACAG
GAATCATTTTGCTGCAGCAGAGCTCGGTGTTGCAGAGTGCTCAGTTTTGACCAAGAGGTATGAGACCGTGGAGATAGCAAGGGAAATTATGAATTCCCTTCAGGAGATGT
TTGGACTTCCGTCCTATCAACTCCACCATGACGCTCTGAAGAATTTTTTCAATGCCCGCATGCAAGAGGGGCATCTTGTTCGGGAACACGTTCTCGACATGATGAACCAA
TTTAACATCGCGGAGGCAAATGGCGGGGTCGTCTTCGAGCAAAGTCAGGAACCAAATCTGTACCTCAGGCTTCTTCAATGTGGCCATTCCAATAAGCAGATTCCAAAGAG
GAAGGGTGACAAGGAGAAGGCTCCTGCTCAAGCTGTGAACATGGCAAGGGAAAGGCCAAGGTCGTGCCTGACAAGGACAAGTGTTTCCACCGCAATGTGGATGGTCACTA
GAAGAGGAACTGCCCTCGTTACCTTGCTGAGAAAAAGAGAGAAAAGGAAGGGAAATGATTTCAAGCATCTTCAAGATGGTGAGATGACTCTCAAGATCGGTCGTTTGGTA
AAAACTAGACTTCTAACTGATTTAGAAGATGATTCTTTACCACCCTGTGAATCGTGTCTCGAGGATAAAATGACTAAGAGGCCTTTTACTGGAAAAGGTTATCGTGCCAA
AGAACCTCTAGAGTTGATACATTCAAATCTTTGTTGTCCAATGAATGTAAAAGCTCGAGGAGTTTGTGAATATTTCATCTCTTTCATAGATGATTATTCTCAATATGGTT
ACTTACATCTAATGGGTCATAAGTCTGAAGCTCTTGAAAAGTTTAAAGAGTTTAAGACTGAAGTAGAAAGCCTATTGGGTAAAACAATCAAAACACTTCAATTAGATGGA
AGTGGAGAGTATATGGACAAAAGATTCCAGGACTATATGATAGAACATGGAATCCAATTCCAACTCTCAGCACCTGGCACACCTCAACAAAATGATGTATCAGAAAGGAG
AAATAGAACCTTGTTAGACATGGTTCGTTCAATGATGAGCTATGCTCAATTGCCTAGCTCATTTTGGGGGGGGTGTAAGCCTAGTTTGCATTACTTCTGTATCTGGCATT
GTCCTGCACACGTGCTAGTGACAAATCCTAAGAAACTGGACCTCATTCTAGATGGGAGGGTTGTAATACAACCTAACCGTTACTTGGGTTTAACTGAAACACAAGTAGCC
ATACCTGATGACGGTGTTGATGATCCATTGTCTTATCATCAGGCAAAAAATTATGTAGATAAAGACGAATGGGTCAAAGCAATGGACCTTGAGATGGAGTCTATGTACTT
CAGTCAAGTTTGGGATCATGTAGATCCACCTGAAGGGGTCAAACCCATAGGGTGTAAATGGACCTATAAGAGGAAAAGAGAGATGTAG
Protein sequenceShow/hide protein sequence
MPLTYLLEVELFDVWGIDFMGPFPPSIGNVFILLAVDYVSKWVEAIACHQSDAKTVARLDEALWAYRTAYKTPLGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKR
IKSKEFVKGQKVLLYNSRLKLFLGKLKSKWSGPFTVVEVQKIVASKIKLEQIFRTNRKDFSTLVSFSSLRLLVSILPSTLFLSPFSLSVKPFELSMAKTRSRKERESEEE
EVPITPEVQKGKTKKKRTPEEKEAKRKEQEEVQEVAEVVATTAEEGITQEPEVQNPDTVQEKIAEKNQETEVEEQAVGEPDKEKTPAQEAHVEVIIPEPPRRRRIKWKAG
RVRVIRSTPSPLTSDFEEDKREEENTTKEEEARKAEDERLREQRESKGKGNAEASGEIEEPRAPFIRFVNELARAKYQEVLKRDFLFERGFGSDLPRFLESGIASLGWRQ
FCAKPDPVNANFVREFYANLDVKDDFEVIVRGVPVQWSLEAINNLFDLQYFPHAVFNEMVVAPSSDQLSAVVRERKKVGKLFFLNTITVLCSRARVPMIPKDMIMLDKGV
IDTPNLARLQRTQKARKGGLVYGVHQILEQLTLVANRNHFAAAELGVAECSVLTKRYETVEIAREIMNSLQEMFGLPSYQLHHDALKNFFNARMQEGHLVREHVLDMMNQ
FNIAEANGGVVFEQSQEPNLYLRLLQCGHSNKQIPKRKGDKEKAPAQAVNMARERPRSCLTRTSVSTAMWMVTRRGTALVTLLRKREKRKGNDFKHLQDGEMTLKIGRLV
KTRLLTDLEDDSLPPCESCLEDKMTKRPFTGKGYRAKEPLELIHSNLCCPMNVKARGVCEYFISFIDDYSQYGYLHLMGHKSEALEKFKEFKTEVESLLGKTIKTLQLDG
SGEYMDKRFQDYMIEHGIQFQLSAPGTPQQNDVSERRNRTLLDMVRSMMSYAQLPSSFWGGCKPSLHYFCIWHCPAHVLVTNPKKLDLILDGRVVIQPNRYLGLTETQVA
IPDDGVDDPLSYHQAKNYVDKDEWVKAMDLEMESMYFSQVWDHVDPPEGVKPIGCKWTYKRKREM