; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0010628 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0010628
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase
Genome locationchr11:24030759..24033516
RNA-Seq ExpressionPay0010628
SyntenyPay0010628
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048442.1 pol protein [Cucumis melo var. makuwa]1.1e-21954.36Show/hide
Query:  MLQAALAPFLAAQQNQAAPVQAQTVVPLAPAEAQPVPFQLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA-------
        MLQAALAPFLAAQQNQAAPVQAQ V P AP EAQPVP QLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA       
Subjt:  MLQAALAPFLAAQQNQAAPVQAQTVVPLAPAEAQPVPFQLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA-------

Query:  ------TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAELDMLSRFAPDVVRDEAARTEKFVRGLRLDLQGIVRALRP
              TAERMLGGDV+KITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAE DMLSRFAPD++RDEAARTEKFVRGLRLDLQGIVRALRP
Subjt:  ------TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAELDMLSRFAPDVVRDEAARTEKFVRGLRLDLQGIVRALRP

Query:  GTHADALRIALDLSLHERVDSSKAASRGTALGQKRKVETQPDVISQRTLRSEGVFQRHRRELAAAGRTLRELPACTTCGR--------------------
         THADALRIALDLSL ER DSSKAA RG+ALGQKRKVETQPDV  QRTLRS GVFQRHRRELAAAGRTLRELPACTTCGR                    
Subjt:  GTHADALRIALDLSLHERVDSSKAASRGTALGQKRKVETQPDVISQRTLRSEGVFQRHRRELAAAGRTLRELPACTTCGR--------------------

Query:  -----------------------GRVFATTRQEAERAGTVVT----------------------------------------------------------
                               GRVFATTRQEAERAGTVVT                                                          
Subjt:  -----------------------GRVFATTRQEAERAGTVVT----------------------------------------------------------

Query:  -----DCFGKEVVFNPPSGASFKFRGAGMVCIPKVISTMKASKLLSQ-----------------------VVREYPDVFPNELPGLPPPREVDFAIELER
             DCFGKEVVFNPPS ASFKFRGAGMVCIPKVIS MKASKLLSQ                       VVREYPDVFP+ELPGLP PREVDFAIELE 
Subjt:  -----DCFGKEVVFNPPSGASFKFRGAGMVCIPKVISTMKASKLLSQ-----------------------VVREYPDVFPNELPGLPPPREVDFAIELER

Query:  DTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGALVLF------------------------------------------------------
         TAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGA VLF                                                      
Subjt:  DTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGALVLF------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------LKQKLVTAPVLTVPDGSGSF
                                                                                        LKQKLVTAPVLTVPDGSG+F
Subjt:  --------------------------------------------------------------------------------LKQKLVTAPVLTVPDGSGSF

Query:  VIYSDASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKELNMR
        VIYSDASKKGLGCVLMQQGKVVAYASRQLK HEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTD KSLKYFFTQKELNMR
Subjt:  VIYSDASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKELNMR

KAA0053574.1 pol protein [Cucumis melo var. makuwa]3.4e-22459.07Show/hide
Query:  MLQAALAPFLAAQQNQAAPVQAQTVVPLAPAEAQPVPFQLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA-------
        MLQAALAPFLAAQQNQAA VQAQTV+P AP  +QPVP QLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA       
Subjt:  MLQAALAPFLAAQQNQAAPVQAQTVVPLAPAEAQPVPFQLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA-------

Query:  ------TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAELDMLSRFAPDVVRDEAARTEKFVRGLRLDLQGIVRALRP
              TAERMLGGDVSKITWEQFK+NFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAE DMLSRFAPDVVRDEAARTEKF+R LRLDLQGIVRALRP
Subjt:  ------TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAELDMLSRFAPDVVRDEAARTEKFVRGLRLDLQGIVRALRP

Query:  GTHADALRIALDLSLHERVDSSKAASRGTALGQKRKVETQPDVISQRTLRSEGVFQRHRRELAAAGRTLRELPACTTCGR--------------------
         THADALRIALDLSL+ER DSSKA  RG ALGQKRKVETQPDVI QRTLRS GVFQRHRRELAAAGRTLR LPACTTCGR                    
Subjt:  GTHADALRIALDLSLHERVDSSKAASRGTALGQKRKVETQPDVISQRTLRSEGVFQRHRRELAAAGRTLRELPACTTCGR--------------------

Query:  -----------------------GRVFATTRQEAERAGTVVT------------DCFGKEVVFNPPSGASFKFRGAGMVCIPKVISTMKASKLLSQ----
                               G  FATTRQEAERAGTVVT            DCFGKEVVFN PSG SFKFRGAG+VC+PKVIS MKASKLLSQ    
Subjt:  -----------------------GRVFATTRQEAERAGTVVT------------DCFGKEVVFNPPSGASFKFRGAGMVCIPKVISTMKASKLLSQ----

Query:  -------------------VVREYPDVFPNELPGLPPPREVDFAIELERDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGALVLF-----
                           VVREYPDVFPNEL GLPPPRE+DFAIELE  TAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGA VLF     
Subjt:  -------------------VVREYPDVFPNELPGLPPPREVDFAIELERDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGALVLF-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------------LKQKLVTAP
                                                                                                   LKQKLVTAP
Subjt:  -------------------------------------------------------------------------------------------LKQKLVTAP

Query:  VLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKELNMR
        VLTVPDGSGSF IYSDASKKGLGCV+MQQGKVVAYA RQLK+HEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTD KSLKYFFT+KELNMR
Subjt:  VLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKELNMR

KAA0066365.1 pol protein [Cucumis melo var. makuwa]4.7e-22153.9Show/hide
Query:  MLQAALAPFLAAQQNQAAPVQAQTVVPLAPAEAQPVPFQLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA-------
        MLQAALAPFLAAQQNQAAPVQAQ V P AP EAQPVP QLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA       
Subjt:  MLQAALAPFLAAQQNQAAPVQAQTVVPLAPAEAQPVPFQLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA-------

Query:  ------TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAELDMLSRFAPDVVRDEAARTEKFVRGLRLDLQGIVRALRP
              TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAE DMLSRFAPD+VRDEAARTEKFVRGLRLDLQGIVRALRP
Subjt:  ------TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAELDMLSRFAPDVVRDEAARTEKFVRGLRLDLQGIVRALRP

Query:  GTHADALRIALDLSLHERVDSSKAASRGTALGQKRKVETQPDVISQRTLRSEGVFQRHRRELAAAGRTLRELPACTTCGR--------------------
         THADALRIALDLSL ER DSSKAA RG+ALGQKRKVETQPDV  QRTLRS GVFQRHRRELAAAG+TLRELPACTTCGR                    
Subjt:  GTHADALRIALDLSLHERVDSSKAASRGTALGQKRKVETQPDVISQRTLRSEGVFQRHRRELAAAGRTLRELPACTTCGR--------------------

Query:  -----------------------GRVFATTRQEAERAGTVVT----------------------------------------------------------
                               GRVFATTRQEAERAGTVVT                                                          
Subjt:  -----------------------GRVFATTRQEAERAGTVVT----------------------------------------------------------

Query:  ----------------------------------------DCFGKEVVFNPPSGASFKFRGAGMVCIPKVISTMKASKLLSQ------------------
                                                DCFGKEVVFNPPSGASFKFRGAGMVCIPKVIS MKASKLLSQ                  
Subjt:  ----------------------------------------DCFGKEVVFNPPSGASFKFRGAGMVCIPKVISTMKASKLLSQ------------------

Query:  -----VVREYPDVFPNELPGLPPPREVDFAIELERDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGALVLF-------------------
             VVREYPDVFP+ELPGLPPPREVDFAIELE DTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGA VLF                   
Subjt:  -----VVREYPDVFPNELPGLPPPREVDFAIELERDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGALVLF-------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------------------LKQKL
                                                                                                       LKQKL
Subjt:  -----------------------------------------------------------------------------------------------LKQKL

Query:  VTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKELNMR
        VTAPVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAYASRQLK HEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTD KSLKYFFTQKELNMR
Subjt:  VTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKELNMR

KAA0067484.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.0e-22054.67Show/hide
Query:  MLQAALAPFLAAQQNQAAPVQAQTVVPLAPAEAQPVPFQLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA-------
        MLQAALAPFLAAQQNQAAPVQAQ V P AP EAQPVP QLSAEAKHLRDFRKYN KTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA       
Subjt:  MLQAALAPFLAAQQNQAAPVQAQTVVPLAPAEAQPVPFQLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA-------

Query:  ------TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAELDMLSRFAPDVVRDEAARTEKFVRGLRLDLQGIVRALRP
              TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAE DMLSRFAPD+VRDE+ARTEKFVRGLRLDLQGIVRALRP
Subjt:  ------TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAELDMLSRFAPDVVRDEAARTEKFVRGLRLDLQGIVRALRP

Query:  GTHADALRIALDLSLHERVDSSKAASRGTALGQKRKVETQPDVISQRTLRSEGVFQRHRRELAAAGRTLRELPACTTCGR--------------------
         THADALRIALDLSL ER DSSKAA RG+ALGQKRKVETQPDV+ QRTLRS GVFQRHRRELAAAGRTLRELPACTTCGR                    
Subjt:  GTHADALRIALDLSLHERVDSSKAASRGTALGQKRKVETQPDVISQRTLRSEGVFQRHRRELAAAGRTLRELPACTTCGR--------------------

Query:  -----------------------GRVFATTRQEAERAGTVVT----------------------------------------------------------
                               GRVFATTRQEAERAGTVVT                                                          
Subjt:  -----------------------GRVFATTRQEAERAGTVVT----------------------------------------------------------

Query:  ----------------------------------------DCFGKEVVFNPPSGASFKFRGAGMVCIPKVISTMKASKLLSQ------------------
                                                DCFGKEVVFNPPSGASFKFRGAGMVCIPK IS MKASKLLSQ                  
Subjt:  ----------------------------------------DCFGKEVVFNPPSGASFKFRGAGMVCIPKVISTMKASKLLSQ------------------

Query:  -----VVREYPDVFPNELPGLPPPREVDFAIELERDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGALVLF-------------------
             VVREYPDVFP+ELPGLPPPREVDFAIELE  TAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGA VLF                   
Subjt:  -----VVREYPDVFPNELPGLPPPREVDFAIELERDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGALVLF-------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------LKQKLVTAPVLTVPDGSGSFVIYSD
                                                                                   LKQKLVTAPVLTVPDGSG+FVIYSD
Subjt:  ---------------------------------------------------------------------------LKQKLVTAPVLTVPDGSGSFVIYSD

Query:  ASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKELNMR
        ASKKGLGCVLMQQGKVVAYASRQLK HEQNYP HDLELAAVVFALKIWRHYLYGEKIQIYTD KSLKYFFTQKELNMR
Subjt:  ASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKELNMR

TYK19164.1 pol protein [Cucumis melo var. makuwa]3.4e-22459.07Show/hide
Query:  MLQAALAPFLAAQQNQAAPVQAQTVVPLAPAEAQPVPFQLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA-------
        MLQAALAPFLAAQQNQAA VQAQTV+P AP  +QPVP QLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA       
Subjt:  MLQAALAPFLAAQQNQAAPVQAQTVVPLAPAEAQPVPFQLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA-------

Query:  ------TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAELDMLSRFAPDVVRDEAARTEKFVRGLRLDLQGIVRALRP
              TAERMLGGDVSKITWEQFK+NFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAE DMLSRFAPDVVRDEAARTEKF+R LRLDLQGIVRALRP
Subjt:  ------TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAELDMLSRFAPDVVRDEAARTEKFVRGLRLDLQGIVRALRP

Query:  GTHADALRIALDLSLHERVDSSKAASRGTALGQKRKVETQPDVISQRTLRSEGVFQRHRRELAAAGRTLRELPACTTCGR--------------------
         THADALRIALDLSL+ER DSSKA  RG ALGQKRKVETQPDVI QRTLRS GVFQRHRRELAAAGRTLR LPACTTCGR                    
Subjt:  GTHADALRIALDLSLHERVDSSKAASRGTALGQKRKVETQPDVISQRTLRSEGVFQRHRRELAAAGRTLRELPACTTCGR--------------------

Query:  -----------------------GRVFATTRQEAERAGTVVT------------DCFGKEVVFNPPSGASFKFRGAGMVCIPKVISTMKASKLLSQ----
                               G  FATTRQEAERAGTVVT            DCFGKEVVFN PSG SFKFRGAG+VC+PKVIS MKASKLLSQ    
Subjt:  -----------------------GRVFATTRQEAERAGTVVT------------DCFGKEVVFNPPSGASFKFRGAGMVCIPKVISTMKASKLLSQ----

Query:  -------------------VVREYPDVFPNELPGLPPPREVDFAIELERDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGALVLF-----
                           VVREYPDVFPNEL GLPPPRE+DFAIELE  TAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGA VLF     
Subjt:  -------------------VVREYPDVFPNELPGLPPPREVDFAIELERDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGALVLF-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------------LKQKLVTAP
                                                                                                   LKQKLVTAP
Subjt:  -------------------------------------------------------------------------------------------LKQKLVTAP

Query:  VLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKELNMR
        VLTVPDGSGSF IYSDASKKGLGCV+MQQGKVVAYA RQLK+HEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTD KSLKYFFT+KELNMR
Subjt:  VLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKELNMR

TrEMBL top hitse value%identityAlignment
A0A5A7TY28 Reverse transcriptase5.6e-22054.36Show/hide
Query:  MLQAALAPFLAAQQNQAAPVQAQTVVPLAPAEAQPVPFQLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA-------
        MLQAALAPFLAAQQNQAAPVQAQ V P AP EAQPVP QLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA       
Subjt:  MLQAALAPFLAAQQNQAAPVQAQTVVPLAPAEAQPVPFQLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA-------

Query:  ------TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAELDMLSRFAPDVVRDEAARTEKFVRGLRLDLQGIVRALRP
              TAERMLGGDV+KITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAE DMLSRFAPD++RDEAARTEKFVRGLRLDLQGIVRALRP
Subjt:  ------TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAELDMLSRFAPDVVRDEAARTEKFVRGLRLDLQGIVRALRP

Query:  GTHADALRIALDLSLHERVDSSKAASRGTALGQKRKVETQPDVISQRTLRSEGVFQRHRRELAAAGRTLRELPACTTCGR--------------------
         THADALRIALDLSL ER DSSKAA RG+ALGQKRKVETQPDV  QRTLRS GVFQRHRRELAAAGRTLRELPACTTCGR                    
Subjt:  GTHADALRIALDLSLHERVDSSKAASRGTALGQKRKVETQPDVISQRTLRSEGVFQRHRRELAAAGRTLRELPACTTCGR--------------------

Query:  -----------------------GRVFATTRQEAERAGTVVT----------------------------------------------------------
                               GRVFATTRQEAERAGTVVT                                                          
Subjt:  -----------------------GRVFATTRQEAERAGTVVT----------------------------------------------------------

Query:  -----DCFGKEVVFNPPSGASFKFRGAGMVCIPKVISTMKASKLLSQ-----------------------VVREYPDVFPNELPGLPPPREVDFAIELER
             DCFGKEVVFNPPS ASFKFRGAGMVCIPKVIS MKASKLLSQ                       VVREYPDVFP+ELPGLP PREVDFAIELE 
Subjt:  -----DCFGKEVVFNPPSGASFKFRGAGMVCIPKVISTMKASKLLSQ-----------------------VVREYPDVFPNELPGLPPPREVDFAIELER

Query:  DTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGALVLF------------------------------------------------------
         TAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGA VLF                                                      
Subjt:  DTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGALVLF------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------LKQKLVTAPVLTVPDGSGSF
                                                                                        LKQKLVTAPVLTVPDGSG+F
Subjt:  --------------------------------------------------------------------------------LKQKLVTAPVLTVPDGSGSF

Query:  VIYSDASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKELNMR
        VIYSDASKKGLGCVLMQQGKVVAYASRQLK HEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTD KSLKYFFTQKELNMR
Subjt:  VIYSDASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKELNMR

A0A5A7UED4 Pol protein1.7e-22459.07Show/hide
Query:  MLQAALAPFLAAQQNQAAPVQAQTVVPLAPAEAQPVPFQLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA-------
        MLQAALAPFLAAQQNQAA VQAQTV+P AP  +QPVP QLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA       
Subjt:  MLQAALAPFLAAQQNQAAPVQAQTVVPLAPAEAQPVPFQLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA-------

Query:  ------TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAELDMLSRFAPDVVRDEAARTEKFVRGLRLDLQGIVRALRP
              TAERMLGGDVSKITWEQFK+NFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAE DMLSRFAPDVVRDEAARTEKF+R LRLDLQGIVRALRP
Subjt:  ------TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAELDMLSRFAPDVVRDEAARTEKFVRGLRLDLQGIVRALRP

Query:  GTHADALRIALDLSLHERVDSSKAASRGTALGQKRKVETQPDVISQRTLRSEGVFQRHRRELAAAGRTLRELPACTTCGR--------------------
         THADALRIALDLSL+ER DSSKA  RG ALGQKRKVETQPDVI QRTLRS GVFQRHRRELAAAGRTLR LPACTTCGR                    
Subjt:  GTHADALRIALDLSLHERVDSSKAASRGTALGQKRKVETQPDVISQRTLRSEGVFQRHRRELAAAGRTLRELPACTTCGR--------------------

Query:  -----------------------GRVFATTRQEAERAGTVVT------------DCFGKEVVFNPPSGASFKFRGAGMVCIPKVISTMKASKLLSQ----
                               G  FATTRQEAERAGTVVT            DCFGKEVVFN PSG SFKFRGAG+VC+PKVIS MKASKLLSQ    
Subjt:  -----------------------GRVFATTRQEAERAGTVVT------------DCFGKEVVFNPPSGASFKFRGAGMVCIPKVISTMKASKLLSQ----

Query:  -------------------VVREYPDVFPNELPGLPPPREVDFAIELERDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGALVLF-----
                           VVREYPDVFPNEL GLPPPRE+DFAIELE  TAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGA VLF     
Subjt:  -------------------VVREYPDVFPNELPGLPPPREVDFAIELERDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGALVLF-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------------LKQKLVTAP
                                                                                                   LKQKLVTAP
Subjt:  -------------------------------------------------------------------------------------------LKQKLVTAP

Query:  VLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKELNMR
        VLTVPDGSGSF IYSDASKKGLGCV+MQQGKVVAYA RQLK+HEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTD KSLKYFFT+KELNMR
Subjt:  VLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKELNMR

A0A5A7VJF1 Reverse transcriptase5.0e-22154.67Show/hide
Query:  MLQAALAPFLAAQQNQAAPVQAQTVVPLAPAEAQPVPFQLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA-------
        MLQAALAPFLAAQQNQAAPVQAQ V P AP EAQPVP QLSAEAKHLRDFRKYN KTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA       
Subjt:  MLQAALAPFLAAQQNQAAPVQAQTVVPLAPAEAQPVPFQLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA-------

Query:  ------TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAELDMLSRFAPDVVRDEAARTEKFVRGLRLDLQGIVRALRP
              TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAE DMLSRFAPD+VRDE+ARTEKFVRGLRLDLQGIVRALRP
Subjt:  ------TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAELDMLSRFAPDVVRDEAARTEKFVRGLRLDLQGIVRALRP

Query:  GTHADALRIALDLSLHERVDSSKAASRGTALGQKRKVETQPDVISQRTLRSEGVFQRHRRELAAAGRTLRELPACTTCGR--------------------
         THADALRIALDLSL ER DSSKAA RG+ALGQKRKVETQPDV+ QRTLRS GVFQRHRRELAAAGRTLRELPACTTCGR                    
Subjt:  GTHADALRIALDLSLHERVDSSKAASRGTALGQKRKVETQPDVISQRTLRSEGVFQRHRRELAAAGRTLRELPACTTCGR--------------------

Query:  -----------------------GRVFATTRQEAERAGTVVT----------------------------------------------------------
                               GRVFATTRQEAERAGTVVT                                                          
Subjt:  -----------------------GRVFATTRQEAERAGTVVT----------------------------------------------------------

Query:  ----------------------------------------DCFGKEVVFNPPSGASFKFRGAGMVCIPKVISTMKASKLLSQ------------------
                                                DCFGKEVVFNPPSGASFKFRGAGMVCIPK IS MKASKLLSQ                  
Subjt:  ----------------------------------------DCFGKEVVFNPPSGASFKFRGAGMVCIPKVISTMKASKLLSQ------------------

Query:  -----VVREYPDVFPNELPGLPPPREVDFAIELERDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGALVLF-------------------
             VVREYPDVFP+ELPGLPPPREVDFAIELE  TAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGA VLF                   
Subjt:  -----VVREYPDVFPNELPGLPPPREVDFAIELERDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGALVLF-------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------LKQKLVTAPVLTVPDGSGSFVIYSD
                                                                                   LKQKLVTAPVLTVPDGSG+FVIYSD
Subjt:  ---------------------------------------------------------------------------LKQKLVTAPVLTVPDGSGSFVIYSD

Query:  ASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKELNMR
        ASKKGLGCVLMQQGKVVAYASRQLK HEQNYP HDLELAAVVFALKIWRHYLYGEKIQIYTD KSLKYFFTQKELNMR
Subjt:  ASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKELNMR

A0A5A7VKS7 Reverse transcriptase2.3e-22153.9Show/hide
Query:  MLQAALAPFLAAQQNQAAPVQAQTVVPLAPAEAQPVPFQLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA-------
        MLQAALAPFLAAQQNQAAPVQAQ V P AP EAQPVP QLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA       
Subjt:  MLQAALAPFLAAQQNQAAPVQAQTVVPLAPAEAQPVPFQLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA-------

Query:  ------TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAELDMLSRFAPDVVRDEAARTEKFVRGLRLDLQGIVRALRP
              TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAE DMLSRFAPD+VRDEAARTEKFVRGLRLDLQGIVRALRP
Subjt:  ------TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAELDMLSRFAPDVVRDEAARTEKFVRGLRLDLQGIVRALRP

Query:  GTHADALRIALDLSLHERVDSSKAASRGTALGQKRKVETQPDVISQRTLRSEGVFQRHRRELAAAGRTLRELPACTTCGR--------------------
         THADALRIALDLSL ER DSSKAA RG+ALGQKRKVETQPDV  QRTLRS GVFQRHRRELAAAG+TLRELPACTTCGR                    
Subjt:  GTHADALRIALDLSLHERVDSSKAASRGTALGQKRKVETQPDVISQRTLRSEGVFQRHRRELAAAGRTLRELPACTTCGR--------------------

Query:  -----------------------GRVFATTRQEAERAGTVVT----------------------------------------------------------
                               GRVFATTRQEAERAGTVVT                                                          
Subjt:  -----------------------GRVFATTRQEAERAGTVVT----------------------------------------------------------

Query:  ----------------------------------------DCFGKEVVFNPPSGASFKFRGAGMVCIPKVISTMKASKLLSQ------------------
                                                DCFGKEVVFNPPSGASFKFRGAGMVCIPKVIS MKASKLLSQ                  
Subjt:  ----------------------------------------DCFGKEVVFNPPSGASFKFRGAGMVCIPKVISTMKASKLLSQ------------------

Query:  -----VVREYPDVFPNELPGLPPPREVDFAIELERDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGALVLF-------------------
             VVREYPDVFP+ELPGLPPPREVDFAIELE DTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGA VLF                   
Subjt:  -----VVREYPDVFPNELPGLPPPREVDFAIELERDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGALVLF-------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------------------------LKQKL
                                                                                                       LKQKL
Subjt:  -----------------------------------------------------------------------------------------------LKQKL

Query:  VTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKELNMR
        VTAPVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAYASRQLK HEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTD KSLKYFFTQKELNMR
Subjt:  VTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKELNMR

A0A5D3D6H0 Pol protein1.7e-22459.07Show/hide
Query:  MLQAALAPFLAAQQNQAAPVQAQTVVPLAPAEAQPVPFQLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA-------
        MLQAALAPFLAAQQNQAA VQAQTV+P AP  +QPVP QLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA       
Subjt:  MLQAALAPFLAAQQNQAAPVQAQTVVPLAPAEAQPVPFQLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCA-------

Query:  ------TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAELDMLSRFAPDVVRDEAARTEKFVRGLRLDLQGIVRALRP
              TAERMLGGDVSKITWEQFK+NFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAE DMLSRFAPDVVRDEAARTEKF+R LRLDLQGIVRALRP
Subjt:  ------TAERMLGGDVSKITWEQFKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAELDMLSRFAPDVVRDEAARTEKFVRGLRLDLQGIVRALRP

Query:  GTHADALRIALDLSLHERVDSSKAASRGTALGQKRKVETQPDVISQRTLRSEGVFQRHRRELAAAGRTLRELPACTTCGR--------------------
         THADALRIALDLSL+ER DSSKA  RG ALGQKRKVETQPDVI QRTLRS GVFQRHRRELAAAGRTLR LPACTTCGR                    
Subjt:  GTHADALRIALDLSLHERVDSSKAASRGTALGQKRKVETQPDVISQRTLRSEGVFQRHRRELAAAGRTLRELPACTTCGR--------------------

Query:  -----------------------GRVFATTRQEAERAGTVVT------------DCFGKEVVFNPPSGASFKFRGAGMVCIPKVISTMKASKLLSQ----
                               G  FATTRQEAERAGTVVT            DCFGKEVVFN PSG SFKFRGAG+VC+PKVIS MKASKLLSQ    
Subjt:  -----------------------GRVFATTRQEAERAGTVVT------------DCFGKEVVFNPPSGASFKFRGAGMVCIPKVISTMKASKLLSQ----

Query:  -------------------VVREYPDVFPNELPGLPPPREVDFAIELERDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGALVLF-----
                           VVREYPDVFPNEL GLPPPRE+DFAIELE  TAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGA VLF     
Subjt:  -------------------VVREYPDVFPNELPGLPPPREVDFAIELERDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGALVLF-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------------LKQKLVTAP
                                                                                                   LKQKLVTAP
Subjt:  -------------------------------------------------------------------------------------------LKQKLVTAP

Query:  VLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKELNMR
        VLTVPDGSGSF IYSDASKKGLGCV+MQQGKVVAYA RQLK+HEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTD KSLKYFFT+KELNMR
Subjt:  VLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKELNMR

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.69.4e-1540.78Show/hide
Query:  LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKEL
        LK  +   P+L VPD +  F + +DAS   LG VL Q G  ++Y SR L  HE NY T + EL A+V+A K +RHYL G   +I +D + L + +  K+ 
Subjt:  LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKEL

Query:  NMR
        N +
Subjt:  NMR

P10394 Retrovirus-related Pol polyprotein from transposon 4121.8e-1039.22Show/hide
Query:  LFLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK----VVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYF
        + LK +L+   +L  PD S  F I +DASK+  G VL Q        VAYASR     E N  T + ELAA+ +A+  +R Y+YG+   + TD + L Y 
Subjt:  LFLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK----VVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYF

Query:  FT
        F+
Subjt:  FT

P10401 Retrovirus-related Pol polyprotein from transposon gypsy2.8e-1138.95Show/hide
Query:  VLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEK-IQIYTDPKSLKYFFTQKELNMR
        +L  PD    F + +DAS  G+G VL Q+G+ +   SR LK  EQNY T++ EL A+V+AL   +++LYG + I I+TD + L +    +  N +
Subjt:  VLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEK-IQIYTDPKSLKYFFTQKELNMR

P20825 Retrovirus-related Pol polyprotein from transposon 2973.6e-1440.4Show/hide
Query:  LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKE
        LK  ++  P+L +PD    FV+ +DAS   LG VL Q G  +++ SR L +HE NY   + EL A+V+A K +RHYL G +  I +D + L++    KE
Subjt:  LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKE

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus2.9e-0833.33Show/hide
Query:  LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQ----QGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGE-KIQIYTDPKSLKYFF
        LK  L ++ +L  P  +  F + +DAS   +G VL Q    + + +AY SR L   E+NY T + E+ A++++L   R YLYG   I++YTD + L +  
Subjt:  LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQ----QGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGE-KIQIYTDPKSLKYFF

Query:  TQKELNMR
          +  N +
Subjt:  TQKELNMR

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGCAAGCTGCTTTGGCGCCTTTCCTCGCCGCCCAACAGAACCAGGCCGCCCCTGTTCAGGCCCAGACCGTCGTTCCTCTAGCCCCAGCGGAAGCTCAACCCGTGCC
ATTTCAACTGTCGGCCGAGGCTAAACACTTACGGGACTTTAGGAAGTATAATCCTAAGACCTTCGATGGATCCATGGACAACCCCACAAAGGCCCAAATGTGGTTGACGT
CAATAGAGACTATCTTCCGGTACATGAAGTGCCCAGAAGACCAGAAGGTGCAGTGTGCAACTGCTGAGAGAATGCTGGGAGGGGACGTTAGCAAGATAACTTGGGAGCAG
TTCAAGGAGAACTTCTATGCTAAGTTCTTCTCCGCCAATGTGAAACACGCCAAGCTGCAAGAGTTCCTAAACTTGGAGCAAGGCGACATGACTGTGGAGCAGTATGACGC
CGAGCTCGACATGCTGTCCCGTTTCGCTCCCGATGTGGTAAGGGATGAGGCCGCCAGGACTGAGAAATTCGTTAGAGGTCTCAGACTAGACCTTCAGGGCATCGTTCGAG
CTCTCCGGCCAGGCACTCATGCGGATGCATTACGCATAGCACTGGATTTGAGCCTGCATGAGAGAGTTGATTCGTCTAAGGCTGCCAGCAGAGGGACAGCCTTAGGACAG
AAGAGAAAGGTGGAGACGCAGCCTGACGTGATATCACAGCGGACTCTGAGGTCAGAAGGTGTCTTTCAGAGACACCGACGAGAGCTTGCAGCAGCCGGGAGGACCCTGAG
AGAGCTACCCGCCTGTACCACCTGTGGGAGAGGGAGAGTTTTTGCCACTACCCGGCAGGAGGCCGAGCGAGCTGGTACAGTGGTGACAGACTGTTTTGGTAAGGAAGTTG
TCTTTAACCCTCCCTCCGGGGCTAGTTTCAAATTTAGGGGGGCAGGCATGGTATGTATACCCAAGGTCATCTCAACCATGAAGGCTAGTAAACTACTCAGCCAGGTGGTA
AGGGAGTACCCTGATGTTTTCCCCAACGAACTCCCAGGACTTCCGCCTCCCAGGGAGGTAGACTTCGCCATCGAGTTAGAGCGGGACACTGCCCCTATCTCGAGGGCCCC
TTACAGAATGGCTCCAGCCGAGCTAAAAGAGTTGAAGGTCCAGTTACAGGAGTTGCTGGACAAGGGTTTCATCCGGCCCAGTGTGTCACCTTGGGGAGCCCTAGTGTTGT
TTCTCAAGCAGAAGCTGGTGACTGCACCAGTTCTGACAGTGCCCGATGGGTCGGGAAGCTTTGTGATCTATAGTGATGCCTCCAAGAAGGGATTGGGCTGTGTCCTGATG
CAGCAAGGTAAGGTAGTTGCTTATGCCTCCCGCCAGTTGAAGAATCATGAGCAGAACTACCCTACCCACGACTTGGAGTTGGCAGCTGTAGTCTTTGCACTGAAGATATG
GAGGCACTACCTGTACGGTGAGAAGATACAGATTTACACTGACCCTAAGAGCCTGAAGTACTTCTTCACTCAGAAGGAGTTGAACATGAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTGCAAGCTGCTTTGGCGCCTTTCCTCGCCGCCCAACAGAACCAGGCCGCCCCTGTTCAGGCCCAGACCGTCGTTCCTCTAGCCCCAGCGGAAGCTCAACCCGTGCC
ATTTCAACTGTCGGCCGAGGCTAAACACTTACGGGACTTTAGGAAGTATAATCCTAAGACCTTCGATGGATCCATGGACAACCCCACAAAGGCCCAAATGTGGTTGACGT
CAATAGAGACTATCTTCCGGTACATGAAGTGCCCAGAAGACCAGAAGGTGCAGTGTGCAACTGCTGAGAGAATGCTGGGAGGGGACGTTAGCAAGATAACTTGGGAGCAG
TTCAAGGAGAACTTCTATGCTAAGTTCTTCTCCGCCAATGTGAAACACGCCAAGCTGCAAGAGTTCCTAAACTTGGAGCAAGGCGACATGACTGTGGAGCAGTATGACGC
CGAGCTCGACATGCTGTCCCGTTTCGCTCCCGATGTGGTAAGGGATGAGGCCGCCAGGACTGAGAAATTCGTTAGAGGTCTCAGACTAGACCTTCAGGGCATCGTTCGAG
CTCTCCGGCCAGGCACTCATGCGGATGCATTACGCATAGCACTGGATTTGAGCCTGCATGAGAGAGTTGATTCGTCTAAGGCTGCCAGCAGAGGGACAGCCTTAGGACAG
AAGAGAAAGGTGGAGACGCAGCCTGACGTGATATCACAGCGGACTCTGAGGTCAGAAGGTGTCTTTCAGAGACACCGACGAGAGCTTGCAGCAGCCGGGAGGACCCTGAG
AGAGCTACCCGCCTGTACCACCTGTGGGAGAGGGAGAGTTTTTGCCACTACCCGGCAGGAGGCCGAGCGAGCTGGTACAGTGGTGACAGACTGTTTTGGTAAGGAAGTTG
TCTTTAACCCTCCCTCCGGGGCTAGTTTCAAATTTAGGGGGGCAGGCATGGTATGTATACCCAAGGTCATCTCAACCATGAAGGCTAGTAAACTACTCAGCCAGGTGGTA
AGGGAGTACCCTGATGTTTTCCCCAACGAACTCCCAGGACTTCCGCCTCCCAGGGAGGTAGACTTCGCCATCGAGTTAGAGCGGGACACTGCCCCTATCTCGAGGGCCCC
TTACAGAATGGCTCCAGCCGAGCTAAAAGAGTTGAAGGTCCAGTTACAGGAGTTGCTGGACAAGGGTTTCATCCGGCCCAGTGTGTCACCTTGGGGAGCCCTAGTGTTGT
TTCTCAAGCAGAAGCTGGTGACTGCACCAGTTCTGACAGTGCCCGATGGGTCGGGAAGCTTTGTGATCTATAGTGATGCCTCCAAGAAGGGATTGGGCTGTGTCCTGATG
CAGCAAGGTAAGGTAGTTGCTTATGCCTCCCGCCAGTTGAAGAATCATGAGCAGAACTACCCTACCCACGACTTGGAGTTGGCAGCTGTAGTCTTTGCACTGAAGATATG
GAGGCACTACCTGTACGGTGAGAAGATACAGATTTACACTGACCCTAAGAGCCTGAAGTACTTCTTCACTCAGAAGGAGTTGAACATGAGGTAG
Protein sequenceShow/hide protein sequence
MLQAALAPFLAAQQNQAAPVQAQTVVPLAPAEAQPVPFQLSAEAKHLRDFRKYNPKTFDGSMDNPTKAQMWLTSIETIFRYMKCPEDQKVQCATAERMLGGDVSKITWEQ
FKENFYAKFFSANVKHAKLQEFLNLEQGDMTVEQYDAELDMLSRFAPDVVRDEAARTEKFVRGLRLDLQGIVRALRPGTHADALRIALDLSLHERVDSSKAASRGTALGQ
KRKVETQPDVISQRTLRSEGVFQRHRRELAAAGRTLRELPACTTCGRGRVFATTRQEAERAGTVVTDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISTMKASKLLSQVV
REYPDVFPNELPGLPPPREVDFAIELERDTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGALVLFLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLM
QQGKVVAYASRQLKNHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIYTDPKSLKYFFTQKELNMR