; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005546 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005546
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr6:21408504..21429410
RNA-Seq ExpressionLag0005546
SyntenyLag0005546
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIM97577.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]4.0e-17238.1Show/hide
Query:  WRNHPNFSWG------------------------DKEVMCKHNKRCRIQSNQASMRALELQVGQLANELKARPQGKLP---------------RILNTLE
        WR HPNFSW                         +K+   +      + S  A+ + ++ Q+GQLAN + +RPQG LP               + +    
Subjt:  WRNHPNFSWG------------------------DKEVMCKHNKRCRIQSNQASMRALELQVGQLANELKARPQGKLP---------------RILNTLE

Query:  GKGAGGSNKNSRASGSVPMWNHLMCRP----------HLMYHLYLFHKGKRLRIRMDILTKKKRLGEFEIVSLTEECSAILKMGYHPRLRIQG-------
        G+    + K    S    + +    +           ++ +   L      ++   DIL+KK+ LG++E V+LTEECSAI++    P+L+  G       
Subjt:  GKGAGGSNKNSRASGSVPMWNHLMCRP----------HLMYHLYLFHKGKRLRIRMDILTKKKRLGEFEIVSLTEECSAILKMGYHPRLRIQG-------

Query:  ---HL---------LYSWYGEARPTTVTLQLADRSITYPEGKIEDVLVK--------------------------SAILATGRALIDVQKGELTMRVYNE
           HL              GEA+PT++TLQLADRS+TYP+G IED+LVK                             LATGR LIDVQ           
Subjt:  ---HL---------LYSWYGEARPTTVTLQLADRSITYPEGKIEDVLVK--------------------------SAILATGRALIDVQKGELTMRVYNE

Query:  EVKFNVFKAMKYPDEMEDCSFIRILKSTV------------IETTMQD------------SSNLD-------------QRKAPP--IKPSLIEAPTLDLK
               KAMK+P+E ++C  + +  + V            +E  + D               LD             +R AP   +KPS+ E PTL+LK
Subjt:  EVKFNVFKAMKYPDEMEDCSFIRILKSTV------------IETTMQD------------SSNLD-------------QRKAPP--IKPSLIEAPTLDLK

Query:  SLSDHLKYVYLGESETLPIIVASNLMPEHEEALIKLLQQYRKAIGLTL----------------------------------------------LDDGII
         L +HL Y YLGES+TLP+I++S+L     E L+++L+ ++ AIG T+                                              LD GII
Subjt:  SLSDHLKYVYLGESETLPIIVASNLMPEHEEALIKLLQQYRKAIGLTL----------------------------------------------LDDGII

Query:  YPIANSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQEKTTF
        YPI++S+WVSPVQCVPKKGG+TVV N +NELIPTRTVTGWRVCMDYR+LNKATRK+HFPLLFIDQMLD+LA + +YCFLDGYSGYNQI IAPEDQEK TF
Subjt:  YPIANSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQEKTTF

Query:  TCPYGTFTFRRMPFGLYNAPATFQQ---------------------------------------------------------------------------
        TCPYGTF FRRMPFGL NAPATFQ+                                                                           
Subjt:  TCPYGTFTFRRMPFGLYNAPATFQQ---------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------TPILCAPNWNLPFEVMCDASDAAV---
                                                                                  PI+  P+W+ PFE+MCDASD AV   
Subjt:  -------------------------------------------------------------------------TPILCAPNWNLPFEVMCDASDAAV---

Query:  ------------------LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENIIAD
                          LN+AQ+NYTTTEKELLAVVFAF+KFR YLVG+KV V+TDHAAIRYL+ KKDAKPRLIRW+LLLQEFDLEI+D+KG+EN IAD
Subjt:  ------------------LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENIIAD

Query:  HLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDGIIRRCVSGAEAKEIL
        HLSRL+  +   E + I+D+FPDEQL A+   V  DVPWYADI N+L  G+ P D+  +QKKKF  D + ++WD+PF++KQ  D I+RRCV   E  +IL
Subjt:  HLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDGIIRRCVSGAEAKEIL

Query:  EQCHSSPYGGHFSDQRIAMRILHCGFFWP
        EQCH+SPYGGHF   R A +IL  GFFWP
Subjt:  EQCHSSPYGGHFSDQRIAMRILHCGFFWP

PIN12235.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.4e-17240.08Show/hide
Query:  WRNHPNFSWGDKEVMCKHNKRCRIQSNQASMRALELQVGQLANELKARPQGKL-------PR-----------ILNTLEGKGAGGSNKNSRASGSVPMWN
        WR HPNFSW                +NQ    A   Q GQLAN + +RPQG L       PR           + N  E +        S+    +P   
Subjt:  WRNHPNFSWGDKEVMCKHNKRCRIQSNQASMRALELQVGQLANELKARPQGKL-------PR-----------ILNTLEGKGAGGSNKNSRASGSVPMWN

Query:  H-----LMCRPH--LMYHLYLFHKGKRLRIRMDILTKKKRLGEFEIVSLTEECSAILKMGYHPRLRIQGH-------------------------LLYSW
              L+ + H  + +   L      ++   DIL+KK+RLG++E+V+LTEECSAI++    P+L+  G                          + YS 
Subjt:  H-----LMCRPH--LMYHLYLFHKGKRLRIRMDILTKKKRLGEFEIVSLTEECSAILKMGYHPRLRIQGH-------------------------LLYSW

Query:  Y-----GEARPTTVTLQLADRSITYPEGKIEDVLVK--------------------------SAILATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYP
        Y     GEA+ T++TLQLADRS+TYP+G IED+LVK                             LATGR LIDVQKGELTMRV ++++ FNVFKAMK+P
Subjt:  Y-----GEARPTTVTLQLADRSITYPEGKIEDVLVK--------------------------SAILATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYP

Query:  DEMEDCSFIRI-----------------LKSTVIETTMQDS-------SNLD-------------QRKAPP--IKPSLIEAPTLDLKSLSDHLKYVYLGE
        +E ++C  + +                 L+  +++   +D+         LD             +R AP   +KPS+ + PTL+LK L  HL Y YLGE
Subjt:  DEMEDCSFIRI-----------------LKSTVIETTMQDS-------SNLD-------------QRKAPP--IKPSLIEAPTLDLKSLSDHLKYVYLGE

Query:  SETLPIIVASNLMPEHEEALIKLLQQYRKAIGLTL----------------------------------------------LDDGIIYPIANSNWVSPVQ
         +TLP+I++S+L     E L+++++ ++ AIG T+                                              LD GIIYPI++S+WVSPVQ
Subjt:  SETLPIIVASNLMPEHEEALIKLLQQYRKAIGLTL----------------------------------------------LDDGIIYPIANSNWVSPVQ

Query:  CVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFTFRRMP
        CVPKKGG+TVV N +NELIPTRTVTG RVCMDYR+LNKATRK+HFPL FIDQMLD+LA + +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTF FRRMP
Subjt:  CVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFTFRRMP

Query:  FGLYNAPATFQQ----------------------------------------------------------------------------------------
        FGL NAPATFQ+                                                                                        
Subjt:  FGLYNAPATFQQ----------------------------------------------------------------------------------------

Query:  ----------------------TPILCAPNWNLPFEVMCDASDAAV---------------------LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSK
                               PI+  P+W  PFE+MCDASD A+                     LN+AQ+NYTTTEKELLAVVFAF+KFR YLVG+K
Subjt:  ----------------------TPILCAPNWNLPFEVMCDASDAAV---------------------LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSK

Query:  VTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENIIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGV
        V V+TDHAAIRYL+ KKDA P LIRW+LLLQEFDLEI+D+KG+EN IADHLSRLD  +   E + I+D+FPDEQL A+   V  DVP             
Subjt:  VTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENIIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGV

Query:  TPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDGIIRRCVSGAEAKEILEQCHSSPYGGHFSDQRIAMRILHCGFFWP
               +QKKKF  D + ++WD+PF++KQ  D I+RRCV   E  +ILEQCH+SPYGGHF   R A +IL  GFFWP
Subjt:  TPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDGIIRRCVSGAEAKEILEQCHSSPYGGHFSDQRIAMRILHCGFFWP

PIN21854.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]2.2e-17839.52Show/hide
Query:  WRNHPNFSWGDKEVM---------CKHN-KRCRIQSNQASMRALELQVGQLANELKARP-QGKLPRILNTLEGKGAGGSNKNSRASGSVPMWNHLMCRPH
        WR HPNFSW + +            +HN K    Q  +A  +A+ L+ G+   E+   P + K   + +  + K      + S+ +   P +   + +  
Subjt:  WRNHPNFSWGDKEVM---------CKHN-KRCRIQSNQASMRALELQVGQLANELKARP-QGKLPRILNTLEGKGAGGSNKNSRASGSVPMWNHLMCRPH

Query:  LMYHLYLF-HKGKRLRIRM-----------------DILTKKKRLGEFEIVSLTEECSAILKMGYHPRLRIQGH------------LLYSWYG-----EA
        L      F    K+L I +                 DIL+KK+RLG++E V+LTEECSAI++    P+L+  G             + YS Y      EA
Subjt:  LMYHLYLF-HKGKRLRIRM-----------------DILTKKKRLGEFEIVSLTEECSAILKMGYHPRLRIQGH------------LLYSWYG-----EA

Query:  RPTTVTLQLADRSITYPEGKIEDVLVK--------------------------SAILATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFI
        +PT++TLQLADRS+TYP+G IED+LVK                             LATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  +
Subjt:  RPTTVTLQLADRSITYPEGKIEDVLVK--------------------------SAILATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFI

Query:  RIL-----KSTVIETTMQ----------DSSN---------LD-------------QRKAPP--IKPSLIEAPTLDLKSLSDHLKYVYLGESETLPIIVA
         +      K ++ E  +           D  N         LD             +R AP   +KPS+ E PTL+LK L  HL Y YLGES+TLP+I++
Subjt:  RIL-----KSTVIETTMQ----------DSSN---------LD-------------QRKAPP--IKPSLIEAPTLDLKSLSDHLKYVYLGESETLPIIVA

Query:  SNLMPEHEEALIKLLQQYRKAIGLTL----------------------------------------------LDDGIIYPIANSNWVSPVQCVPKKGGVT
        S+L     E L+++L+ ++ AIG T+                                              LD GIIYPI++ +W+SPVQCVPKKGG+T
Subjt:  SNLMPEHEEALIKLLQQYRKAIGLTL----------------------------------------------LDDGIIYPIANSNWVSPVQCVPKKGGVT

Query:  VVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFTFRRMPFGLYNAPAT
        VV N +NE IPT+TVTGWRVCMDYR+LNKATRK+HFPL FIDQMLD+LA + +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTF FRR+PF L NAPAT
Subjt:  VVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFTFRRMPFGLYNAPAT

Query:  FQQ-------------------------------------------------------------------------------------------------
        FQ+                                                                                                 
Subjt:  FQQ-------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------TPILCAPNWNLPFEVMCDASDAAV---------------------LNEA
                                                            PI+  P+W+ PFE+MCDASD A+                     LN+A
Subjt:  ---------------------------------------------------TPILCAPNWNLPFEVMCDASDAAV---------------------LNEA

Query:  QVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENIIADHLSRLDPSSSLLEQSAISDSFP
        Q+NYTTTEKELLAVVFAF+KFR YLVG+KV V+TDHAAIRYL+ KKDAKPRLIRW+LLLQEFDLEI+D+KG EN IADHLSRL+  +   E + I+D+FP
Subjt:  QVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENIIADHLSRLDPSSSLLEQSAISDSFP

Query:  DEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDGIIRRCVSGAEAKEILEQCHSSPYGGHFSDQRIAMRIL
        DEQL A+   V  DVPWYADI N+L  G+ P D+  +QKKKF  D + ++WD+PF++KQ  D I+RRCV   E  +I EQCH+SPYGGHF   R A +IL
Subjt:  DEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDGIIRRCVSGAEAKEILEQCHSSPYGGHFSDQRIAMRIL

Query:  HCGFFWP
          GFFWP
Subjt:  HCGFFWP

PIN22487.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.8e-18037.78Show/hide
Query:  WRNHPNFSWG------------------------DKEVMCKHNKRCRIQSNQASMRALELQVGQLANELKARPQGKLP----------------------
        WR HPNFSW                         +K+   +      + S  A+ + +E Q+GQLAN + +RPQG LP                      
Subjt:  WRNHPNFSWG------------------------DKEVMCKHNKRCRIQSNQASMRALELQVGQLANELKARPQGKLP----------------------

Query:  -----------------RILNTLEGKGAGGSNKNSRASGSVPMWNHLMCRPHL-MYHLYLFHKGKRLRIRM-----------------DILTKKKRLGEF
                          +++  + K      + S+ +   P +   + +  L    L      K+L I +                 DIL+KK+RLG++
Subjt:  -----------------RILNTLEGKGAGGSNKNSRASGSVPMWNHLMCRPHL-MYHLYLFHKGKRLRIRM-----------------DILTKKKRLGEF

Query:  EIVSLTEECSAILKMGYHPRLRIQGH-------------------------LLYSWY-----GEARPTTVTLQLADRSITYPEGKIEDVLVK--------
        E V+LTEECSAI++    P+L+  G                          + YS Y     GEA+PT++TLQLADRS+TYP+G IED+LVK        
Subjt:  EIVSLTEECSAILKMGYHPRLRIQGH-------------------------LLYSWY-----GEARPTTVTLQLADRSITYPEGKIEDVLVK--------

Query:  ------------------SAILATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFIRIL------------------------------KS
                             LATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + +                               + 
Subjt:  ------------------SAILATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFIRIL------------------------------KS

Query:  TVIETTMQDSSNLDQRK---------APPIKPSLIEAPTLDLKSLSDHLKYVYLGESETLPIIVASNLMPEHEEALIKLLQQYRKAIGLTL---------
          +  T+  S  L  R+         +  +KPS+ + PTL+LK L  HL Y YLGES+TLP+I++S+L     E L+++L+ ++ AIG T+         
Subjt:  TVIETTMQDSSNLDQRK---------APPIKPSLIEAPTLDLKSLSDHLKYVYLGESETLPIIVASNLMPEHEEALIKLLQQYRKAIGLTL---------

Query:  -------------------------------------LDDGIIYPIANSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNH
                                             LD GIIYPI++S+WVSPVQCVPKKGG+TVV N +NELIPTRTVTGWRVCMDYR+LNKATRK+H
Subjt:  -------------------------------------LDDGIIYPIANSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNH

Query:  FPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFTFRRMPFGLYNAPATFQQ--------------------------------
        FPL FIDQMLD+LA + +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTF FRRMPFGL NAPATFQ+                                
Subjt:  FPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFTFRRMPFGLYNAPATFQQ--------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------TPILCAPNWNLPFEVMCDASDAAV---------------------LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTD
                         PI+  P+W+ PFE+MCDASD AV                     LN+AQ+NYTTTEKELLAVVFAF+KFR YLVG+KV V+TD
Subjt:  ----------------TPILCAPNWNLPFEVMCDASDAAV---------------------LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTD

Query:  HAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENIIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMD
        HAAIRYL+ KKDAKPRLIRW+LLLQEFDLEI+D+KG+EN IADHLSRL+  +   E + I+D+FPDEQL A+   V  +VPWYADI N+L  G+ P D+ 
Subjt:  HAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENIIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMD

Query:  WRQKKKFKHDAKFFYWDEPFMYKQCSDGIIRRCVSGAEAKEILEQCHSSPYGGHFSDQRIAMRILHCGFFWP
         +QKKKF  D + ++WD+PF++KQ  D I+RRCV   E  +ILEQCH+SPYGGHF   R A +IL  GFFWP
Subjt:  WRQKKKFKHDAKFFYWDEPFMYKQCSDGIIRRCVSGAEAKEILEQCHSSPYGGHFSDQRIAMRILHCGFFWP

PIN22518.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]2.3e-18037.25Show/hide
Query:  RAIRAYVVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHVISHQQPPATELAAVVNQVTEEACWRNHPNFSWG---------------------
        +A R   V     LN  I        NFE  P   Q   +V     +S+ + P           T    WR HPNFSW                      
Subjt:  RAIRAYVVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHVISHQQPPATELAAVVNQVTEEACWRNHPNFSWG---------------------

Query:  ---DKEVMCKHNKRCRIQSNQASMRALELQVGQLANELKARPQGKLP-----------------------RILNTLEGKGAGGSNKN-------------
           +K+   +      + S  A+ + +E Q+GQLAN + +RPQG LP                       R L  +  K      K              
Subjt:  ---DKEVMCKHNKRCRIQSNQASMRALELQVGQLANELKARPQGKLP-----------------------RILNTLEGKGAGGSNKN-------------

Query:  ---SRASGSVPMWNHLMCRPHL-MYHLYLFHKGKRLRIRM-----------------DILTKKKRLGEFEIVSLTEECSAILKMGYHPRLRIQGH-----
           S+ +   P +   + +  L    L      K+L I +                 DIL+KK+RLG++E  +LTEEC+AI++    P+L+  G      
Subjt:  ---SRASGSVPMWNHLMCRPHL-MYHLYLFHKGKRLRIRM-----------------DILTKKKRLGEFEIVSLTEECSAILKMGYHPRLRIQGH-----

Query:  --------------------LLYSWY-----GEARPTTVTLQLADRSITYPEGKIEDVLVK--------------------------SAILATGRALIDV
                            + YS Y     GEA+PT++TLQLADRS+TYP+G IED+LVK                             LATGR LIDV
Subjt:  --------------------LLYSWY-----GEARPTTVTLQLADRSITYPEGKIEDVLVK--------------------------SAILATGRALIDV

Query:  QKGELTMRVYNEEVKFNVFKAMKYPDEMEDC---------------------SFIRILKSTVIETTMQD--------------SSNLD--QRKAPP--IK
        QKGELTMRV ++++ FNVFKAMK+P+E ++C                     S  R L   + E   +D              S  ++  +R  P   +K
Subjt:  QKGELTMRVYNEEVKFNVFKAMKYPDEMEDC---------------------SFIRILKSTVIETTMQD--------------SSNLD--QRKAPP--IK

Query:  PSLIEAPTLDLKSLSDHLKYVYLGESETLPIIVASNLMPEHEEALIKLLQQYRKAIGLTL----------------------------------------
        PS+ + PTL+LK L +HL YVYLGES+TLP+I++S+L     E L+++L+ ++ AIG T+                                        
Subjt:  PSLIEAPTLDLKSLSDHLKYVYLGESETLPIIVASNLMPEHEEALIKLLQQYRKAIGLTL----------------------------------------

Query:  ------LDDGIIYPIANSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQI
              LD GIIYPI++S+WVSPVQCVPKKGG+TVV N +NELIPTRTVTGWRVCMDYR+LNKATRK+HFPL FIDQMLD+LA + +YCFLDGYSGYNQI
Subjt:  ------LDDGIIYPIANSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQI

Query:  TIAPEDQEKTTFTCPYGTFTFRRMPFGLYNAPATFQQ---------------------------------------------------------------
         IAPEDQEKTTFTCPYGTF FRRMPFGL NAPATFQ+                                                               
Subjt:  TIAPEDQEKTTFTCPYGTFTFRRMPFGLYNAPATFQQ---------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------TPILCAPNWNLPFEV
                                                                                              PI+  P+W+ PFE+
Subjt:  -------------------------------------------------------------------------------------TPILCAPNWNLPFEV

Query:  MCDASDAAV---------------------LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEI
        MCDASD A+                     LN+AQ+NYTTTEKELLAVVFAF+KFR YLVG+KV V+TDHAAIRYL+ KKDAKPRLIRW+LLLQEFDLEI
Subjt:  MCDASDAAV---------------------LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEI

Query:  KDKKGSENIIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDGIIR
        +D+KG+EN IADHLSRL+  +   E + I+D+FPDEQL A+   V  +VPWYADI N+L  G+ P D+  +QKKKF  D + ++WD+PF++KQ  D I+R
Subjt:  KDKKGSENIIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDGIIR

Query:  RCVSGAEAKEILEQCHSSPYGGHFSDQRIAMRILHCGFFWP
        RCV   E  +ILEQCH+SPYGGHF   R A +IL  GFFWP
Subjt:  RCVSGAEAKEILEQCHSSPYGGHFSDQRIAMRILHCGFFWP

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase1.9e-17238.1Show/hide
Query:  WRNHPNFSWG------------------------DKEVMCKHNKRCRIQSNQASMRALELQVGQLANELKARPQGKLP---------------RILNTLE
        WR HPNFSW                         +K+   +      + S  A+ + ++ Q+GQLAN + +RPQG LP               + +    
Subjt:  WRNHPNFSWG------------------------DKEVMCKHNKRCRIQSNQASMRALELQVGQLANELKARPQGKLP---------------RILNTLE

Query:  GKGAGGSNKNSRASGSVPMWNHLMCRP----------HLMYHLYLFHKGKRLRIRMDILTKKKRLGEFEIVSLTEECSAILKMGYHPRLRIQG-------
        G+    + K    S    + +    +           ++ +   L      ++   DIL+KK+ LG++E V+LTEECSAI++    P+L+  G       
Subjt:  GKGAGGSNKNSRASGSVPMWNHLMCRP----------HLMYHLYLFHKGKRLRIRMDILTKKKRLGEFEIVSLTEECSAILKMGYHPRLRIQG-------

Query:  ---HL---------LYSWYGEARPTTVTLQLADRSITYPEGKIEDVLVK--------------------------SAILATGRALIDVQKGELTMRVYNE
           HL              GEA+PT++TLQLADRS+TYP+G IED+LVK                             LATGR LIDVQ           
Subjt:  ---HL---------LYSWYGEARPTTVTLQLADRSITYPEGKIEDVLVK--------------------------SAILATGRALIDVQKGELTMRVYNE

Query:  EVKFNVFKAMKYPDEMEDCSFIRILKSTV------------IETTMQD------------SSNLD-------------QRKAPP--IKPSLIEAPTLDLK
               KAMK+P+E ++C  + +  + V            +E  + D               LD             +R AP   +KPS+ E PTL+LK
Subjt:  EVKFNVFKAMKYPDEMEDCSFIRILKSTV------------IETTMQD------------SSNLD-------------QRKAPP--IKPSLIEAPTLDLK

Query:  SLSDHLKYVYLGESETLPIIVASNLMPEHEEALIKLLQQYRKAIGLTL----------------------------------------------LDDGII
         L +HL Y YLGES+TLP+I++S+L     E L+++L+ ++ AIG T+                                              LD GII
Subjt:  SLSDHLKYVYLGESETLPIIVASNLMPEHEEALIKLLQQYRKAIGLTL----------------------------------------------LDDGII

Query:  YPIANSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQEKTTF
        YPI++S+WVSPVQCVPKKGG+TVV N +NELIPTRTVTGWRVCMDYR+LNKATRK+HFPLLFIDQMLD+LA + +YCFLDGYSGYNQI IAPEDQEK TF
Subjt:  YPIANSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQEKTTF

Query:  TCPYGTFTFRRMPFGLYNAPATFQQ---------------------------------------------------------------------------
        TCPYGTF FRRMPFGL NAPATFQ+                                                                           
Subjt:  TCPYGTFTFRRMPFGLYNAPATFQQ---------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------TPILCAPNWNLPFEVMCDASDAAV---
                                                                                  PI+  P+W+ PFE+MCDASD AV   
Subjt:  -------------------------------------------------------------------------TPILCAPNWNLPFEVMCDASDAAV---

Query:  ------------------LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENIIAD
                          LN+AQ+NYTTTEKELLAVVFAF+KFR YLVG+KV V+TDHAAIRYL+ KKDAKPRLIRW+LLLQEFDLEI+D+KG+EN IAD
Subjt:  ------------------LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENIIAD

Query:  HLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDGIIRRCVSGAEAKEIL
        HLSRL+  +   E + I+D+FPDEQL A+   V  DVPWYADI N+L  G+ P D+  +QKKKF  D + ++WD+PF++KQ  D I+RRCV   E  +IL
Subjt:  HLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDGIIRRCVSGAEAKEIL

Query:  EQCHSSPYGGHFSDQRIAMRILHCGFFWP
        EQCH+SPYGGHF   R A +IL  GFFWP
Subjt:  EQCHSSPYGGHFSDQRIAMRILHCGFFWP

A0A2G9H400 Reverse transcriptase6.6e-17340.08Show/hide
Query:  WRNHPNFSWGDKEVMCKHNKRCRIQSNQASMRALELQVGQLANELKARPQGKL-------PR-----------ILNTLEGKGAGGSNKNSRASGSVPMWN
        WR HPNFSW                +NQ    A   Q GQLAN + +RPQG L       PR           + N  E +        S+    +P   
Subjt:  WRNHPNFSWGDKEVMCKHNKRCRIQSNQASMRALELQVGQLANELKARPQGKL-------PR-----------ILNTLEGKGAGGSNKNSRASGSVPMWN

Query:  H-----LMCRPH--LMYHLYLFHKGKRLRIRMDILTKKKRLGEFEIVSLTEECSAILKMGYHPRLRIQGH-------------------------LLYSW
              L+ + H  + +   L      ++   DIL+KK+RLG++E+V+LTEECSAI++    P+L+  G                          + YS 
Subjt:  H-----LMCRPH--LMYHLYLFHKGKRLRIRMDILTKKKRLGEFEIVSLTEECSAILKMGYHPRLRIQGH-------------------------LLYSW

Query:  Y-----GEARPTTVTLQLADRSITYPEGKIEDVLVK--------------------------SAILATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYP
        Y     GEA+ T++TLQLADRS+TYP+G IED+LVK                             LATGR LIDVQKGELTMRV ++++ FNVFKAMK+P
Subjt:  Y-----GEARPTTVTLQLADRSITYPEGKIEDVLVK--------------------------SAILATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYP

Query:  DEMEDCSFIRI-----------------LKSTVIETTMQDS-------SNLD-------------QRKAPP--IKPSLIEAPTLDLKSLSDHLKYVYLGE
        +E ++C  + +                 L+  +++   +D+         LD             +R AP   +KPS+ + PTL+LK L  HL Y YLGE
Subjt:  DEMEDCSFIRI-----------------LKSTVIETTMQDS-------SNLD-------------QRKAPP--IKPSLIEAPTLDLKSLSDHLKYVYLGE

Query:  SETLPIIVASNLMPEHEEALIKLLQQYRKAIGLTL----------------------------------------------LDDGIIYPIANSNWVSPVQ
         +TLP+I++S+L     E L+++++ ++ AIG T+                                              LD GIIYPI++S+WVSPVQ
Subjt:  SETLPIIVASNLMPEHEEALIKLLQQYRKAIGLTL----------------------------------------------LDDGIIYPIANSNWVSPVQ

Query:  CVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFTFRRMP
        CVPKKGG+TVV N +NELIPTRTVTG RVCMDYR+LNKATRK+HFPL FIDQMLD+LA + +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTF FRRMP
Subjt:  CVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFTFRRMP

Query:  FGLYNAPATFQQ----------------------------------------------------------------------------------------
        FGL NAPATFQ+                                                                                        
Subjt:  FGLYNAPATFQQ----------------------------------------------------------------------------------------

Query:  ----------------------TPILCAPNWNLPFEVMCDASDAAV---------------------LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSK
                               PI+  P+W  PFE+MCDASD A+                     LN+AQ+NYTTTEKELLAVVFAF+KFR YLVG+K
Subjt:  ----------------------TPILCAPNWNLPFEVMCDASDAAV---------------------LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSK

Query:  VTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENIIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGV
        V V+TDHAAIRYL+ KKDA P LIRW+LLLQEFDLEI+D+KG+EN IADHLSRLD  +   E + I+D+FPDEQL A+   V  DVP             
Subjt:  VTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENIIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGV

Query:  TPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDGIIRRCVSGAEAKEILEQCHSSPYGGHFSDQRIAMRILHCGFFWP
               +QKKKF  D + ++WD+PF++KQ  D I+RRCV   E  +ILEQCH+SPYGGHF   R A +IL  GFFWP
Subjt:  TPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDGIIRRCVSGAEAKEILEQCHSSPYGGHFSDQRIAMRILHCGFFWP

A0A2G9HWF8 Reverse transcriptase1.0e-17839.52Show/hide
Query:  WRNHPNFSWGDKEVM---------CKHN-KRCRIQSNQASMRALELQVGQLANELKARP-QGKLPRILNTLEGKGAGGSNKNSRASGSVPMWNHLMCRPH
        WR HPNFSW + +            +HN K    Q  +A  +A+ L+ G+   E+   P + K   + +  + K      + S+ +   P +   + +  
Subjt:  WRNHPNFSWGDKEVM---------CKHN-KRCRIQSNQASMRALELQVGQLANELKARP-QGKLPRILNTLEGKGAGGSNKNSRASGSVPMWNHLMCRPH

Query:  LMYHLYLF-HKGKRLRIRM-----------------DILTKKKRLGEFEIVSLTEECSAILKMGYHPRLRIQGH------------LLYSWYG-----EA
        L      F    K+L I +                 DIL+KK+RLG++E V+LTEECSAI++    P+L+  G             + YS Y      EA
Subjt:  LMYHLYLF-HKGKRLRIRM-----------------DILTKKKRLGEFEIVSLTEECSAILKMGYHPRLRIQGH------------LLYSWYG-----EA

Query:  RPTTVTLQLADRSITYPEGKIEDVLVK--------------------------SAILATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFI
        +PT++TLQLADRS+TYP+G IED+LVK                             LATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  +
Subjt:  RPTTVTLQLADRSITYPEGKIEDVLVK--------------------------SAILATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFI

Query:  RIL-----KSTVIETTMQ----------DSSN---------LD-------------QRKAPP--IKPSLIEAPTLDLKSLSDHLKYVYLGESETLPIIVA
         +      K ++ E  +           D  N         LD             +R AP   +KPS+ E PTL+LK L  HL Y YLGES+TLP+I++
Subjt:  RIL-----KSTVIETTMQ----------DSSN---------LD-------------QRKAPP--IKPSLIEAPTLDLKSLSDHLKYVYLGESETLPIIVA

Query:  SNLMPEHEEALIKLLQQYRKAIGLTL----------------------------------------------LDDGIIYPIANSNWVSPVQCVPKKGGVT
        S+L     E L+++L+ ++ AIG T+                                              LD GIIYPI++ +W+SPVQCVPKKGG+T
Subjt:  SNLMPEHEEALIKLLQQYRKAIGLTL----------------------------------------------LDDGIIYPIANSNWVSPVQCVPKKGGVT

Query:  VVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFTFRRMPFGLYNAPAT
        VV N +NE IPT+TVTGWRVCMDYR+LNKATRK+HFPL FIDQMLD+LA + +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTF FRR+PF L NAPAT
Subjt:  VVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFTFRRMPFGLYNAPAT

Query:  FQQ-------------------------------------------------------------------------------------------------
        FQ+                                                                                                 
Subjt:  FQQ-------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------TPILCAPNWNLPFEVMCDASDAAV---------------------LNEA
                                                            PI+  P+W+ PFE+MCDASD A+                     LN+A
Subjt:  ---------------------------------------------------TPILCAPNWNLPFEVMCDASDAAV---------------------LNEA

Query:  QVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENIIADHLSRLDPSSSLLEQSAISDSFP
        Q+NYTTTEKELLAVVFAF+KFR YLVG+KV V+TDHAAIRYL+ KKDAKPRLIRW+LLLQEFDLEI+D+KG EN IADHLSRL+  +   E + I+D+FP
Subjt:  QVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENIIADHLSRLDPSSSLLEQSAISDSFP

Query:  DEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDGIIRRCVSGAEAKEILEQCHSSPYGGHFSDQRIAMRIL
        DEQL A+   V  DVPWYADI N+L  G+ P D+  +QKKKF  D + ++WD+PF++KQ  D I+RRCV   E  +I EQCH+SPYGGHF   R A +IL
Subjt:  DEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDGIIRRCVSGAEAKEILEQCHSSPYGGHFSDQRIAMRIL

Query:  HCGFFWP
          GFFWP
Subjt:  HCGFFWP

A0A2G9HYA0 Reverse transcriptase8.6e-18137.78Show/hide
Query:  WRNHPNFSWG------------------------DKEVMCKHNKRCRIQSNQASMRALELQVGQLANELKARPQGKLP----------------------
        WR HPNFSW                         +K+   +      + S  A+ + +E Q+GQLAN + +RPQG LP                      
Subjt:  WRNHPNFSWG------------------------DKEVMCKHNKRCRIQSNQASMRALELQVGQLANELKARPQGKLP----------------------

Query:  -----------------RILNTLEGKGAGGSNKNSRASGSVPMWNHLMCRPHL-MYHLYLFHKGKRLRIRM-----------------DILTKKKRLGEF
                          +++  + K      + S+ +   P +   + +  L    L      K+L I +                 DIL+KK+RLG++
Subjt:  -----------------RILNTLEGKGAGGSNKNSRASGSVPMWNHLMCRPHL-MYHLYLFHKGKRLRIRM-----------------DILTKKKRLGEF

Query:  EIVSLTEECSAILKMGYHPRLRIQGH-------------------------LLYSWY-----GEARPTTVTLQLADRSITYPEGKIEDVLVK--------
        E V+LTEECSAI++    P+L+  G                          + YS Y     GEA+PT++TLQLADRS+TYP+G IED+LVK        
Subjt:  EIVSLTEECSAILKMGYHPRLRIQGH-------------------------LLYSWY-----GEARPTTVTLQLADRSITYPEGKIEDVLVK--------

Query:  ------------------SAILATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFIRIL------------------------------KS
                             LATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + +                               + 
Subjt:  ------------------SAILATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFIRIL------------------------------KS

Query:  TVIETTMQDSSNLDQRK---------APPIKPSLIEAPTLDLKSLSDHLKYVYLGESETLPIIVASNLMPEHEEALIKLLQQYRKAIGLTL---------
          +  T+  S  L  R+         +  +KPS+ + PTL+LK L  HL Y YLGES+TLP+I++S+L     E L+++L+ ++ AIG T+         
Subjt:  TVIETTMQDSSNLDQRK---------APPIKPSLIEAPTLDLKSLSDHLKYVYLGESETLPIIVASNLMPEHEEALIKLLQQYRKAIGLTL---------

Query:  -------------------------------------LDDGIIYPIANSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNH
                                             LD GIIYPI++S+WVSPVQCVPKKGG+TVV N +NELIPTRTVTGWRVCMDYR+LNKATRK+H
Subjt:  -------------------------------------LDDGIIYPIANSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNH

Query:  FPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFTFRRMPFGLYNAPATFQQ--------------------------------
        FPL FIDQMLD+LA + +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTF FRRMPFGL NAPATFQ+                                
Subjt:  FPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFTFRRMPFGLYNAPATFQQ--------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------TPILCAPNWNLPFEVMCDASDAAV---------------------LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTD
                         PI+  P+W+ PFE+MCDASD AV                     LN+AQ+NYTTTEKELLAVVFAF+KFR YLVG+KV V+TD
Subjt:  ----------------TPILCAPNWNLPFEVMCDASDAAV---------------------LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTD

Query:  HAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENIIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMD
        HAAIRYL+ KKDAKPRLIRW+LLLQEFDLEI+D+KG+EN IADHLSRL+  +   E + I+D+FPDEQL A+   V  +VPWYADI N+L  G+ P D+ 
Subjt:  HAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENIIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMD

Query:  WRQKKKFKHDAKFFYWDEPFMYKQCSDGIIRRCVSGAEAKEILEQCHSSPYGGHFSDQRIAMRILHCGFFWP
         +QKKKF  D + ++WD+PF++KQ  D I+RRCV   E  +ILEQCH+SPYGGHF   R A +IL  GFFWP
Subjt:  WRQKKKFKHDAKFFYWDEPFMYKQCSDGIIRRCVSGAEAKEILEQCHSSPYGGHFSDQRIAMRILHCGFFWP

A0A2G9HYD8 Reverse transcriptase1.1e-18037.25Show/hide
Query:  RAIRAYVVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHVISHQQPPATELAAVVNQVTEEACWRNHPNFSWG---------------------
        +A R   V     LN  I        NFE  P   Q   +V     +S+ + P           T    WR HPNFSW                      
Subjt:  RAIRAYVVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHVISHQQPPATELAAVVNQVTEEACWRNHPNFSWG---------------------

Query:  ---DKEVMCKHNKRCRIQSNQASMRALELQVGQLANELKARPQGKLP-----------------------RILNTLEGKGAGGSNKN-------------
           +K+   +      + S  A+ + +E Q+GQLAN + +RPQG LP                       R L  +  K      K              
Subjt:  ---DKEVMCKHNKRCRIQSNQASMRALELQVGQLANELKARPQGKLP-----------------------RILNTLEGKGAGGSNKN-------------

Query:  ---SRASGSVPMWNHLMCRPHL-MYHLYLFHKGKRLRIRM-----------------DILTKKKRLGEFEIVSLTEECSAILKMGYHPRLRIQGH-----
           S+ +   P +   + +  L    L      K+L I +                 DIL+KK+RLG++E  +LTEEC+AI++    P+L+  G      
Subjt:  ---SRASGSVPMWNHLMCRPHL-MYHLYLFHKGKRLRIRM-----------------DILTKKKRLGEFEIVSLTEECSAILKMGYHPRLRIQGH-----

Query:  --------------------LLYSWY-----GEARPTTVTLQLADRSITYPEGKIEDVLVK--------------------------SAILATGRALIDV
                            + YS Y     GEA+PT++TLQLADRS+TYP+G IED+LVK                             LATGR LIDV
Subjt:  --------------------LLYSWY-----GEARPTTVTLQLADRSITYPEGKIEDVLVK--------------------------SAILATGRALIDV

Query:  QKGELTMRVYNEEVKFNVFKAMKYPDEMEDC---------------------SFIRILKSTVIETTMQD--------------SSNLD--QRKAPP--IK
        QKGELTMRV ++++ FNVFKAMK+P+E ++C                     S  R L   + E   +D              S  ++  +R  P   +K
Subjt:  QKGELTMRVYNEEVKFNVFKAMKYPDEMEDC---------------------SFIRILKSTVIETTMQD--------------SSNLD--QRKAPP--IK

Query:  PSLIEAPTLDLKSLSDHLKYVYLGESETLPIIVASNLMPEHEEALIKLLQQYRKAIGLTL----------------------------------------
        PS+ + PTL+LK L +HL YVYLGES+TLP+I++S+L     E L+++L+ ++ AIG T+                                        
Subjt:  PSLIEAPTLDLKSLSDHLKYVYLGESETLPIIVASNLMPEHEEALIKLLQQYRKAIGLTL----------------------------------------

Query:  ------LDDGIIYPIANSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQI
              LD GIIYPI++S+WVSPVQCVPKKGG+TVV N +NELIPTRTVTGWRVCMDYR+LNKATRK+HFPL FIDQMLD+LA + +YCFLDGYSGYNQI
Subjt:  ------LDDGIIYPIANSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQI

Query:  TIAPEDQEKTTFTCPYGTFTFRRMPFGLYNAPATFQQ---------------------------------------------------------------
         IAPEDQEKTTFTCPYGTF FRRMPFGL NAPATFQ+                                                               
Subjt:  TIAPEDQEKTTFTCPYGTFTFRRMPFGLYNAPATFQQ---------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------TPILCAPNWNLPFEV
                                                                                              PI+  P+W+ PFE+
Subjt:  -------------------------------------------------------------------------------------TPILCAPNWNLPFEV

Query:  MCDASDAAV---------------------LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEI
        MCDASD A+                     LN+AQ+NYTTTEKELLAVVFAF+KFR YLVG+KV V+TDHAAIRYL+ KKDAKPRLIRW+LLLQEFDLEI
Subjt:  MCDASDAAV---------------------LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEI

Query:  KDKKGSENIIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDGIIR
        +D+KG+EN IADHLSRL+  +   E + I+D+FPDEQL A+   V  +VPWYADI N+L  G+ P D+  +QKKKF  D + ++WD+PF++KQ  D I+R
Subjt:  KDKKGSENIIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDGIIR

Query:  RCVSGAEAKEILEQCHSSPYGGHFSDQRIAMRILHCGFFWP
        RCV   E  +ILEQCH+SPYGGHF   R A +IL  GFFWP
Subjt:  RCVSGAEAKEILEQCHSSPYGGHFSDQRIAMRILHCGFFWP

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.66.4e-2421.9Show/hide
Query:  PTTVTLQLADRSITYPEGKIEDVLVKSAILATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFI--RILK-----STVIETTMQDSSNLDQ
        P+ +     +  + +P  +  D+L+   +LA  +A I  +  E+T+  YN + K     A       ++ + I   +L+     S ++E+ +    +L+ 
Subjt:  PTTVTLQLADRSITYPEGKIEDVLVKSAILATGRALIDVQKGELTMRVYNEEVKFNVFKAMKYPDEMEDCSFI--RILK-----STVIETTMQDSSNLDQ

Query:  RKAPPIKPSL-----IEAPTLDLKSLSDHLKYVYLGESETLPIIVASNLMPEHEEALIKLLQQYRKAIGLTLLDDGIIYPIANSNWVSPVQCVPKKGGVT
         +   +   L     I+    D  + ++  K+  +     LP+    +    +E+ +   +Q         +L+ GII   +NS + SP+  VPKK    
Subjt:  RKAPPIKPSL-----IEAPTLDLKSLSDHLKYVYLGESETLPIIVASNLMPEHEEALIKLLQQYRKAIGLTLLDDGIIYPIANSNWVSPVQCVPKKGGVT

Query:  VVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFTFRRMPFGLYNAPAT
          S K            +R+ +DYR+LN+ T  +  P+  +D++L +L    Y+  +D   G++QI + PE   KT F+  +G + + RMPFGL NAPAT
Subjt:  VVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFTFRRMPFGLYNAPAT

Query:  FQ--------------------------------------------------------------------------------------------------
        FQ                                                                                                  
Subjt:  FQ--------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------QTPILCAPNWNLPFEVMCDASDAAV-----------------LNEAQVN
                                                           + PIL  P++   F +  DASD A+                 LNE ++N
Subjt:  ---------------------------------------------------QTPILCAPNWNLPFEVMCDASDAAV-----------------LNEAQVN

Query:  YTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENIIADHLSRLDPSSSLLEQ
        Y+T EKELLA+V+A + FR YL+G    + +DH  + +L   KD   +L RW + L EFD +IK  KG EN +AD LSR+    + L +
Subjt:  YTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENIIADHLSRLDPSSSLLEQ

P20825 Retrovirus-related Pol polyprotein from transposon 2978.9e-1822.98Show/hide
Query:  LLDDGIIYPIANSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPE
        +L+ G+I   +NS + SP   VPKK   +  +NK            +RV +DYR+LN+ T  + +P+  +D++L +L    Y+  +D   G++QI +  E
Subjt:  LLDDGIIYPIANSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPE

Query:  DQEKTTFTCPYGTFTFRRMPFGLYNAPATFQ---------------------------------------------------------------------
           KT F+   G + + RMPFGL NAPATFQ                                                                     
Subjt:  DQEKTTFTCPYGTFTFRRMPFGLYNAPATFQ---------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------QTPILCAPNWNLPFEVMCDA
                                                                                        + PIL  P++   F +  DA
Subjt:  --------------------------------------------------------------------------------QTPILCAPNWNLPFEVMCDA

Query:  SDAAV-----------------LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSEN
        S+ A+                 LN+ ++NY+  EKELLA+V+A + FR YL+G +  + +DH  +R+L + K+   +L RW + L E+  +I   KG EN
Subjt:  SDAAV-----------------LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSEN

Query:  IIADHLSRL
         +AD LSR+
Subjt:  IIADHLSRL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.1e-1836.65Show/hide
Query:  LPIIVASNLMPEHEEALIKLLQQYRKAIGLTLLDDGIIYPIANSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLF
        LP +   ++  ++E+ + K++Q+        LLD+  I P + S   SPV  VPKK G                   +R+C+DYR LNKAT  + FPL  
Subjt:  LPIIVASNLMPEHEEALIKLLQQYRKAIGLTLLDDGIIYPIANSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLF

Query:  IDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFTFRRMPFGLYNAPATF
        ID +L ++ +   +  LD +SGY+QI + P+D+ KT F  P G + +  MPFGL NAP+TF
Subjt:  IDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFTFRRMPFGLYNAPATF

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.2e-0636.05Show/hide
Query:  LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENIIADHLSR
        L  AQ NY   E ELL ++ A   FR  L G   T+ TDH ++  L +K +   R+ RW+  L  +D  ++   G +N++AD +SR
Subjt:  LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENIIADHLSR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus2.2e-1621.32Show/hide
Query:  LLDDGIIYPIANSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPE
        LL DGII P +NS + SP+  VPKK         N E         +R+ +D++RLN  T  + +P+  I+  L  L +  Y+  LD  SG++QI +   
Subjt:  LLDDGIIYPIANSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPE

Query:  DQEKTTFTCPYGTFTFRRMPFGLYNAPATFQQ--------------------------------------------------------------------
        D  KT F+   G + F R+PFGL NAPA FQ+                                                                    
Subjt:  DQEKTTFTCPYGTFTFRRMPFGLYNAPATFQQ--------------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------------TPILCAPNW
                                                                                                   + IL  P +
Subjt:  -------------------------------------------------------------------------------------------TPILCAPNW

Query:  NLPFEVMCDASDAAV---------------------LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGS-KVTVFTDHAAIRYLMSKKDAKPRLIRWILLL
          PF +  DAS+ A+                     LN+ + NY T EKE+LA++++ +  R YL G+  + V+TDH  + + +  ++   +L RW   +
Subjt:  NLPFEVMCDASDAAV---------------------LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGS-KVTVFTDHAAIRYLMSKKDAKPRLIRWILLL

Query:  QEFDLEIKDKKGSENIIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVVRD
        +E++ E+  K G  N++AD LSR+ P  + L     ++   D Q  A     + D
Subjt:  QEFDLEIKDKKGSENIIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVVRD

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.1e-1836.65Show/hide
Query:  LPIIVASNLMPEHEEALIKLLQQYRKAIGLTLLDDGIIYPIANSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLF
        LP +   ++  ++E+ + K++Q+        LLD+  I P + S   SPV  VPKK G                   +R+C+DYR LNKAT  + FPL  
Subjt:  LPIIVASNLMPEHEEALIKLLQQYRKAIGLTLLDDGIIYPIANSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLF

Query:  IDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFTFRRMPFGLYNAPATF
        ID +L ++ +   +  LD +SGY+QI + P+D+ KT F  P G + +  MPFGL NAP+TF
Subjt:  IDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFTFRRMPFGLYNAPATF

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.2e-0636.05Show/hide
Query:  LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENIIADHLSR
        L  AQ NY   E ELL ++ A   FR  L G   T+ TDH ++  L +K +   R+ RW+  L  +D  ++   G +N++AD +SR
Subjt:  LNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRLIRWILLLQEFDLEIKDKKGSENIIADHLSR

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATAGGTGAGATTAAGTCTACTCCTGTAAAGCTCCAATTGGCTGATCAATCTGTGGTGACACCAGTTGGTATTGTAGAAAATGTATTAATCAGAGTAGGTAGATTTTTCCT
CCCTATTGACTTGTATGTTATGGACATGATAGAAAATCCTTCGATGCCTGTCATATTAGGACGACCATTCCTCGCAACTGGGCGAGTGATTATAGATATCGAGCGTAGGG
AGCTCACTATTAGAGTCAAGAAAGAAAAAGAAATCTTTAAAGCAGTTGAAGACTCTAAAGATGATGTGCTTTTCATGGGATATAGGAAAGGTCTTGTTTTCTTTCTCTCC
TGTGCTTTCAAGCTTTCAAGATTCCAAGTTCCAAGCAAGAAGTCATCATCATTCCAAGCCAGTATTGTTCACTCCTATCATTCTCTTATTGTTCTCATGCTTGTTTATGC
CTTATTTTCTTTATACATTGAGGACAATGCATATTTTAAGTTTGGGGGTGGATTAGCATGGCTAGTAGAGTCTGATCTTGTTGAGTTAGATCTCGCTTGTGCAATTTCCT
CGAGGACTAGCAAAAGCTTAAGTTTGGGGGAAATGGTTGATTGGAGCTATATTGCCATATTAGAGTTAATCGGGTGCTCGGGGCGTGAAAAGATGCAAAGGAATGAAAAG
AGTAAAAGAGGAAAAAAGTCAAATTTCAGTCAACAGCAGGCTAGCGTCGAGACGCTAGCTCTTGAGCGTCTCGACGCTCACATTACATATCAGATTAGGCGCGTAAAGCT
TACAGCGTCGGGACGCTTCCTATTACAGCGTCGGGATGCTTCCTATTACGCTTCCTATTACCGTTTTTCCCTTATTCAGAACGCGCGAAAAGTTCACATCAAATCTCGTT
TCCATATCTTCAGGCCAAGAGGAAATAGGTCGAGTCTACAGACCAAAGAACTAGCTAAGACACCCGAGGCTATACGGTACCGTGTGCACACAGGTTATATCCCGTTGTTG
ACGTTGAGTGTACTCCGTGACAACGATGTTGTCGTAGAGATCGAGCTCCCGGTGCCTGATACACTGCCAACGTCTGCTGAAAGTTCCAAAACAAGCTCCGGTTATTATCC
GAACCAAGTTATGTCCGGGCGGCACCCCGTGTATGTTGGATGGTCTCAGGGTTTCCGCTGTGTTGTGTTTCATATAGAATATTTTATATCTAAGATTAGTATGTTTATCA
AGTGGAACGTTTCAGGAAGGTCAACGGTTTTGCCACACAATCCCGATGGCCAACAAAGTTCTCGTAGCCAATTTGGAGATTATGCTGCTGGGCGACTGGAGGGAGCAAAT
TCTGTGTTGCAGCAAAACTGGGAACAAAACTGCCACATCACAGCTCGTGTGAGTTTGGTGCATGAGCGATCCACCTGGGGTACGGTCCAGCGAACCCCAGCAGAATCGTT
GGTGCAGCAAAACCCGCTGTTTGAGCAAAATGAGCAGCAAAATAATCAGGCGGAAAATCCTATCTTGATTGCGAACGATAGGACCAGAGCCATTCGAGCGTATGTTGTCC
CAATGTTTAATGAGTTGAATCCAGGGATTGCACGTCCCCAAATCCAAGCGGCGAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGACAATTTCAT
GTGATTAGTCATCAGCAGCCACCAGCTACGGAGCTTGCAGCAGTGGTGAACCAAGTCACAGAAGAAGCATGTTGGCGCAACCACCCCAACTTCTCATGGGGAGACAAGGA
AGTAATGTGCAAGCACAACAAAAGATGCCGAATTCAAAGTAATCAAGCTTCGATGAGAGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTGAAGGCAAGGCCTC
AAGGGAAACTTCCTCGGATACTGAACACCCTCGAAGGGAAGGGTGCTGGAGGCAGCAATAAAAATTCTCGAGCATCTGGTTCTGTTCCAATGTGGAACCACCTTATGTGC
CGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAACGCCTAAGAATCAGGATGGATATTTTAACTAAAAAGAAGAGGCTAGGTGAGTTTGAAATTGTATC
TCTTACTGAGGAGTGTAGTGCTATTCTTAAAATGGGCTACCACCCAAGGCTAAGGATCCAGGGTCATTTACTATACTCTTGGTATGGTGAAGCTAGGCCAACCACAGTCA
CACTCCAACTAGCTGATAGGTCTATCACATATCCAGAGGGTAAAATTGAGGATGTCTTAGTAAAGTCGGCCATTTTGGCTACTGGTAGGGCGTTAATAGATGTTCAAAAA
GGGGAGTTAACAATGAGAGTCTATAATGAGGAAGTGAAATTTAATGTGTTTAAAGCCATGAAGTATCCAGACGAAATGGAGGATTGCTCTTTCATTAGGATTCTGAAGAG
CACAGTTATTGAGACAACAATGCAGGATTCGTCTAATTTAGATCAGAGGAAAGCTCCTCCCATTAAGCCATCCCTGATTGAGGCACCTACTTTAGATTTGAAGTCCTTGT
CGGATCATCTAAAGTATGTGTATCTTGGGGAAAGTGAGACGTTGCCCATTATTGTTGCATCAAATTTAATGCCAGAGCATGAAGAGGCCTTAATAAAATTGCTGCAGCAA
TACCGCAAGGCTATAGGTTTGACATTGTTGGATGATGGGATCATTTATCCAATTGCCAATAGCAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAGGGAGGTGTCAC
TGTGGTGAGCAATAAAAACAATGAGTTGATCCCAACCAGGACAGTAACTGGCTGGAGGGTTTGCATGGATTACAGGAGGCTTAATAAAGCTACCCGTAAGAACCATTTCC
CTCTACTATTTATCGACCAGATGTTGGATCAATTGGCTAGTCAGGCCTACTACTGTTTCTTGGATGGTTATTCTGGGTATAATCAGATTACTATTGCTCCTGAGGATCAG
GAGAAAACCACTTTCACCTGTCCTTATGGGACATTCACTTTTAGGAGAATGCCTTTCGGCCTCTACAATGCTCCAGCAACATTTCAGCAGACACCCATTCTTTGCGCACC
TAATTGGAATTTACCATTCGAGGTAATGTGTGATGCGAGTGATGCTGCGGTTTTAAATGAAGCACAAGTCAACTATACAACTACTGAAAAGGAGTTGTTAGCTGTGGTGT
TCGCTTTTGAGAAGTTCCGGCCATATTTGGTTGGATCCAAAGTCACGGTTTTTACGGATCATGCAGCAATAAGGTATCTAATGTCTAAGAAAGATGCAAAGCCTAGACTA
ATTCGTTGGATTTTACTGTTGCAGGAATTCGACTTGGAGATAAAGGACAAGAAGGGATCAGAAAACATCATTGCAGATCATTTATCTCGTCTTGATCCGTCATCATCTTT
GCTGGAGCAATCTGCCATTTCCGATTCTTTTCCAGATGAACAGCTCTTTGCTGTTGAGGTAAAGGTAGTTAGGGATGTCCCTTGGTATGCTGATATTGCCAACTTTTTGG
TAAAGGGAGTCACTCCTATTGACATGGATTGGAGGCAGAAGAAAAAGTTTAAGCACGATGCGAAATTTTTCTATTGGGATGAGCCATTTATGTATAAGCAATGCTCTGAC
GGTATTATTCGTAGGTGTGTTTCAGGTGCTGAAGCAAAGGAAATCCTGGAGCAATGTCACTCTTCGCCGTATGGAGGTCATTTCAGCGATCAGAGGATAGCTATGAGGAT
TTTGCACTGCGGATTCTTCTGGCCAACTTATACCCCTTTAAAAGAAAATTCTTCCTTGGAAAGCATAATTCAAAATTATATAGAGGAGTCTAGAGCGAGGAGTCTAAAAA
TTGATGAAATCATAAGAGGACTTCAAACTCCACAGGAAGTTAGTGAGGTGGGATCCACTTCCATAGGTTGTCTAGTGAGTCATGTGGTCGATGATTCTTTGGCACCCATT
CATGAGCTTAGTTCTTTCAATCTTGATGAGGATGAGCAGTTAGGGGTGAGTTGCATAGATAGTAGAGAAGAATTGGAGAGTTGTAGCACCTTTCAAGAACATGTTTGTGA
GGAAGAAAAAGAAAATGAGCTTGCATTGACGGTTCAGGAGGTAGAGTTTGAGGTTGAAAAGCCTTCGTCTGTTTTATCTTCTCCATCCTTTATAGATGTTGATTGTTCTT
TTTTCTATTCTCACGAGTCAGTTTTAGAGTCAGGGTCTCTTTATTTTGATATTTCTCCTTTTATGCTTCTTTTATTCAAAAATACCAAAAATAATTCTGTTTCAGGGGCC
CATGAGAAACTAACTGCACGAGCACGTCGAGCTAGAGACGTTAAACAAGCGCTTATGGGAGGCAACCCAAGAGTTCCCGTGTTCTTGCCTTTGCCAAGCATTCATGAAAA
GAGGGCTGAGGAGCAAGAAGCCATTCGAGAAGAAACAGTGGATGACGTGGATAAAGAGAGAGCTAAAAATCCTGAGGAAGAAACGAGAATTTATGATACGGTTCAAGAAA
AGATTGTTGAGAAAAATCAAGAAACGGAGGTTGAGGAGCAGGCCGCAGGTATGCCTGAAAAAGAGAAAACACCGGAGCCGGTGCAGGAGGCTCATGTTGAAGTGGTAATG
CCTGAACCACCAAAGCGCCGCCGCATCAAACGGAAGGCGGGTCGCGTGAGGGTGATTCGGAACACTCCATCGCCTCCGACGTCGGACTTTGAGGAAGAAAAGGGGAAGCT
GAAAATAAGGCAAAGGAAGAAGAGGCAAGGAAGGCAGAAGAAGCGATTTTGCGCGAACAAAGAGAAGACAAGGGCAAAGGAATTACCGAAGCATCGGGTGAGATTGAGGA
ACCAAGGTGATTTGCCAAGGTTCTTAGAGTCTGAAATAGCGAACCTTGGGTGGAGGCAGTTTTGTGCGAAGCCTGAACCTGTCAATACCAACATTGTTCGAGAATTCTAC
GTTAATCTTGACGTTAAGGATGACTTTGAAGTTATAGTGCAAGGAGTGCTTGTACAATGGAGCCCAGAAGCCATTAATAGTTTGTTTGATCTTCAGGATTTTCCGCATGC
AGTTTTTAATGAGATGATGGTTGCGCCATCGAGCGACCAATTAAGTGCGGCGGTCCGAGAGAGTGAAGCCAACACTTGGATGGGTTTCATTAGGCTACGCTTACTGCCGA
CAACACACGACTCCACTGTATCTCGGGACAGAGTATTGCTTGCCTTTGCCATCCTTCGCTCAATGAGTATAGATGTTGGAAAAATAATTTCTACTGAGATTGCTGACTGT
TGGCGCAAAAAGGTGGGGAAGCTGTTTTTTCCAAACACGATTACGATGTTATGCAGCAGGGCAGGAGTGCCCACGGTTCCAGAGGATATGATTATGCTTGATAAGGGAAT
CATTGACACACCTAATCTGGTGCGGCTTCAGCGTACGCAAGATGCTCGCCAGGGTGGGCTTGTGTATGGAGTTCATCAGATCCTAGAGCAACTGACACTGTTGGCCAGTA
GGTTAGAGTTTGCTGAAAGGCAAGCTCAGACCTATTGGACTTATGCTAAAAGGAGAGATGATGCGCTCAGGAGGGCCTTGCAAACCAATTTCTCAAAACCATATCAGGCC
TTCCCAGTGTTTCCCGATGACTTATTTAATCTGTGGATACCACCCCCACCTGTTGAACGAGAAGAGGATGTTATTGAGGAGCAGGGTCAGGAAGACTGA
mRNA sequenceShow/hide mRNA sequence
ATAGGTGAGATTAAGTCTACTCCTGTAAAGCTCCAATTGGCTGATCAATCTGTGGTGACACCAGTTGGTATTGTAGAAAATGTATTAATCAGAGTAGGTAGATTTTTCCT
CCCTATTGACTTGTATGTTATGGACATGATAGAAAATCCTTCGATGCCTGTCATATTAGGACGACCATTCCTCGCAACTGGGCGAGTGATTATAGATATCGAGCGTAGGG
AGCTCACTATTAGAGTCAAGAAAGAAAAAGAAATCTTTAAAGCAGTTGAAGACTCTAAAGATGATGTGCTTTTCATGGGATATAGGAAAGGTCTTGTTTTCTTTCTCTCC
TGTGCTTTCAAGCTTTCAAGATTCCAAGTTCCAAGCAAGAAGTCATCATCATTCCAAGCCAGTATTGTTCACTCCTATCATTCTCTTATTGTTCTCATGCTTGTTTATGC
CTTATTTTCTTTATACATTGAGGACAATGCATATTTTAAGTTTGGGGGTGGATTAGCATGGCTAGTAGAGTCTGATCTTGTTGAGTTAGATCTCGCTTGTGCAATTTCCT
CGAGGACTAGCAAAAGCTTAAGTTTGGGGGAAATGGTTGATTGGAGCTATATTGCCATATTAGAGTTAATCGGGTGCTCGGGGCGTGAAAAGATGCAAAGGAATGAAAAG
AGTAAAAGAGGAAAAAAGTCAAATTTCAGTCAACAGCAGGCTAGCGTCGAGACGCTAGCTCTTGAGCGTCTCGACGCTCACATTACATATCAGATTAGGCGCGTAAAGCT
TACAGCGTCGGGACGCTTCCTATTACAGCGTCGGGATGCTTCCTATTACGCTTCCTATTACCGTTTTTCCCTTATTCAGAACGCGCGAAAAGTTCACATCAAATCTCGTT
TCCATATCTTCAGGCCAAGAGGAAATAGGTCGAGTCTACAGACCAAAGAACTAGCTAAGACACCCGAGGCTATACGGTACCGTGTGCACACAGGTTATATCCCGTTGTTG
ACGTTGAGTGTACTCCGTGACAACGATGTTGTCGTAGAGATCGAGCTCCCGGTGCCTGATACACTGCCAACGTCTGCTGAAAGTTCCAAAACAAGCTCCGGTTATTATCC
GAACCAAGTTATGTCCGGGCGGCACCCCGTGTATGTTGGATGGTCTCAGGGTTTCCGCTGTGTTGTGTTTCATATAGAATATTTTATATCTAAGATTAGTATGTTTATCA
AGTGGAACGTTTCAGGAAGGTCAACGGTTTTGCCACACAATCCCGATGGCCAACAAAGTTCTCGTAGCCAATTTGGAGATTATGCTGCTGGGCGACTGGAGGGAGCAAAT
TCTGTGTTGCAGCAAAACTGGGAACAAAACTGCCACATCACAGCTCGTGTGAGTTTGGTGCATGAGCGATCCACCTGGGGTACGGTCCAGCGAACCCCAGCAGAATCGTT
GGTGCAGCAAAACCCGCTGTTTGAGCAAAATGAGCAGCAAAATAATCAGGCGGAAAATCCTATCTTGATTGCGAACGATAGGACCAGAGCCATTCGAGCGTATGTTGTCC
CAATGTTTAATGAGTTGAATCCAGGGATTGCACGTCCCCAAATCCAAGCGGCGAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGACAATTTCAT
GTGATTAGTCATCAGCAGCCACCAGCTACGGAGCTTGCAGCAGTGGTGAACCAAGTCACAGAAGAAGCATGTTGGCGCAACCACCCCAACTTCTCATGGGGAGACAAGGA
AGTAATGTGCAAGCACAACAAAAGATGCCGAATTCAAAGTAATCAAGCTTCGATGAGAGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTGAAGGCAAGGCCTC
AAGGGAAACTTCCTCGGATACTGAACACCCTCGAAGGGAAGGGTGCTGGAGGCAGCAATAAAAATTCTCGAGCATCTGGTTCTGTTCCAATGTGGAACCACCTTATGTGC
CGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAACGCCTAAGAATCAGGATGGATATTTTAACTAAAAAGAAGAGGCTAGGTGAGTTTGAAATTGTATC
TCTTACTGAGGAGTGTAGTGCTATTCTTAAAATGGGCTACCACCCAAGGCTAAGGATCCAGGGTCATTTACTATACTCTTGGTATGGTGAAGCTAGGCCAACCACAGTCA
CACTCCAACTAGCTGATAGGTCTATCACATATCCAGAGGGTAAAATTGAGGATGTCTTAGTAAAGTCGGCCATTTTGGCTACTGGTAGGGCGTTAATAGATGTTCAAAAA
GGGGAGTTAACAATGAGAGTCTATAATGAGGAAGTGAAATTTAATGTGTTTAAAGCCATGAAGTATCCAGACGAAATGGAGGATTGCTCTTTCATTAGGATTCTGAAGAG
CACAGTTATTGAGACAACAATGCAGGATTCGTCTAATTTAGATCAGAGGAAAGCTCCTCCCATTAAGCCATCCCTGATTGAGGCACCTACTTTAGATTTGAAGTCCTTGT
CGGATCATCTAAAGTATGTGTATCTTGGGGAAAGTGAGACGTTGCCCATTATTGTTGCATCAAATTTAATGCCAGAGCATGAAGAGGCCTTAATAAAATTGCTGCAGCAA
TACCGCAAGGCTATAGGTTTGACATTGTTGGATGATGGGATCATTTATCCAATTGCCAATAGCAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAGGGAGGTGTCAC
TGTGGTGAGCAATAAAAACAATGAGTTGATCCCAACCAGGACAGTAACTGGCTGGAGGGTTTGCATGGATTACAGGAGGCTTAATAAAGCTACCCGTAAGAACCATTTCC
CTCTACTATTTATCGACCAGATGTTGGATCAATTGGCTAGTCAGGCCTACTACTGTTTCTTGGATGGTTATTCTGGGTATAATCAGATTACTATTGCTCCTGAGGATCAG
GAGAAAACCACTTTCACCTGTCCTTATGGGACATTCACTTTTAGGAGAATGCCTTTCGGCCTCTACAATGCTCCAGCAACATTTCAGCAGACACCCATTCTTTGCGCACC
TAATTGGAATTTACCATTCGAGGTAATGTGTGATGCGAGTGATGCTGCGGTTTTAAATGAAGCACAAGTCAACTATACAACTACTGAAAAGGAGTTGTTAGCTGTGGTGT
TCGCTTTTGAGAAGTTCCGGCCATATTTGGTTGGATCCAAAGTCACGGTTTTTACGGATCATGCAGCAATAAGGTATCTAATGTCTAAGAAAGATGCAAAGCCTAGACTA
ATTCGTTGGATTTTACTGTTGCAGGAATTCGACTTGGAGATAAAGGACAAGAAGGGATCAGAAAACATCATTGCAGATCATTTATCTCGTCTTGATCCGTCATCATCTTT
GCTGGAGCAATCTGCCATTTCCGATTCTTTTCCAGATGAACAGCTCTTTGCTGTTGAGGTAAAGGTAGTTAGGGATGTCCCTTGGTATGCTGATATTGCCAACTTTTTGG
TAAAGGGAGTCACTCCTATTGACATGGATTGGAGGCAGAAGAAAAAGTTTAAGCACGATGCGAAATTTTTCTATTGGGATGAGCCATTTATGTATAAGCAATGCTCTGAC
GGTATTATTCGTAGGTGTGTTTCAGGTGCTGAAGCAAAGGAAATCCTGGAGCAATGTCACTCTTCGCCGTATGGAGGTCATTTCAGCGATCAGAGGATAGCTATGAGGAT
TTTGCACTGCGGATTCTTCTGGCCAACTTATACCCCTTTAAAAGAAAATTCTTCCTTGGAAAGCATAATTCAAAATTATATAGAGGAGTCTAGAGCGAGGAGTCTAAAAA
TTGATGAAATCATAAGAGGACTTCAAACTCCACAGGAAGTTAGTGAGGTGGGATCCACTTCCATAGGTTGTCTAGTGAGTCATGTGGTCGATGATTCTTTGGCACCCATT
CATGAGCTTAGTTCTTTCAATCTTGATGAGGATGAGCAGTTAGGGGTGAGTTGCATAGATAGTAGAGAAGAATTGGAGAGTTGTAGCACCTTTCAAGAACATGTTTGTGA
GGAAGAAAAAGAAAATGAGCTTGCATTGACGGTTCAGGAGGTAGAGTTTGAGGTTGAAAAGCCTTCGTCTGTTTTATCTTCTCCATCCTTTATAGATGTTGATTGTTCTT
TTTTCTATTCTCACGAGTCAGTTTTAGAGTCAGGGTCTCTTTATTTTGATATTTCTCCTTTTATGCTTCTTTTATTCAAAAATACCAAAAATAATTCTGTTTCAGGGGCC
CATGAGAAACTAACTGCACGAGCACGTCGAGCTAGAGACGTTAAACAAGCGCTTATGGGAGGCAACCCAAGAGTTCCCGTGTTCTTGCCTTTGCCAAGCATTCATGAAAA
GAGGGCTGAGGAGCAAGAAGCCATTCGAGAAGAAACAGTGGATGACGTGGATAAAGAGAGAGCTAAAAATCCTGAGGAAGAAACGAGAATTTATGATACGGTTCAAGAAA
AGATTGTTGAGAAAAATCAAGAAACGGAGGTTGAGGAGCAGGCCGCAGGTATGCCTGAAAAAGAGAAAACACCGGAGCCGGTGCAGGAGGCTCATGTTGAAGTGGTAATG
CCTGAACCACCAAAGCGCCGCCGCATCAAACGGAAGGCGGGTCGCGTGAGGGTGATTCGGAACACTCCATCGCCTCCGACGTCGGACTTTGAGGAAGAAAAGGGGAAGCT
GAAAATAAGGCAAAGGAAGAAGAGGCAAGGAAGGCAGAAGAAGCGATTTTGCGCGAACAAAGAGAAGACAAGGGCAAAGGAATTACCGAAGCATCGGGTGAGATTGAGGA
ACCAAGGTGATTTGCCAAGGTTCTTAGAGTCTGAAATAGCGAACCTTGGGTGGAGGCAGTTTTGTGCGAAGCCTGAACCTGTCAATACCAACATTGTTCGAGAATTCTAC
GTTAATCTTGACGTTAAGGATGACTTTGAAGTTATAGTGCAAGGAGTGCTTGTACAATGGAGCCCAGAAGCCATTAATAGTTTGTTTGATCTTCAGGATTTTCCGCATGC
AGTTTTTAATGAGATGATGGTTGCGCCATCGAGCGACCAATTAAGTGCGGCGGTCCGAGAGAGTGAAGCCAACACTTGGATGGGTTTCATTAGGCTACGCTTACTGCCGA
CAACACACGACTCCACTGTATCTCGGGACAGAGTATTGCTTGCCTTTGCCATCCTTCGCTCAATGAGTATAGATGTTGGAAAAATAATTTCTACTGAGATTGCTGACTGT
TGGCGCAAAAAGGTGGGGAAGCTGTTTTTTCCAAACACGATTACGATGTTATGCAGCAGGGCAGGAGTGCCCACGGTTCCAGAGGATATGATTATGCTTGATAAGGGAAT
CATTGACACACCTAATCTGGTGCGGCTTCAGCGTACGCAAGATGCTCGCCAGGGTGGGCTTGTGTATGGAGTTCATCAGATCCTAGAGCAACTGACACTGTTGGCCAGTA
GGTTAGAGTTTGCTGAAAGGCAAGCTCAGACCTATTGGACTTATGCTAAAAGGAGAGATGATGCGCTCAGGAGGGCCTTGCAAACCAATTTCTCAAAACCATATCAGGCC
TTCCCAGTGTTTCCCGATGACTTATTTAATCTGTGGATACCACCCCCACCTGTTGAACGAGAAGAGGATGTTATTGAGGAGCAGGGTCAGGAAGACTGA
Protein sequenceShow/hide protein sequence
IGEIKSTPVKLQLADQSVVTPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRPFLATGRVIIDIERRELTIRVKKEKEIFKAVEDSKDDVLFMGYRKGLVFFLS
CAFKLSRFQVPSKKSSSFQASIVHSYHSLIVLMLVYALFSLYIEDNAYFKFGGGLAWLVESDLVELDLACAISSRTSKSLSLGEMVDWSYIAILELIGCSGREKMQRNEK
SKRGKKSNFSQQQASVETLALERLDAHITYQIRRVKLTASGRFLLQRRDASYYASYYRFSLIQNARKVHIKSRFHIFRPRGNRSSLQTKELAKTPEAIRYRVHTGYIPLL
TLSVLRDNDVVVEIELPVPDTLPTSAESSKTSSGYYPNQVMSGRHPVYVGWSQGFRCVVFHIEYFISKISMFIKWNVSGRSTVLPHNPDGQQSSRSQFGDYAAGRLEGAN
SVLQQNWEQNCHITARVSLVHERSTWGTVQRTPAESLVQQNPLFEQNEQQNNQAENPILIANDRTRAIRAYVVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFH
VISHQQPPATELAAVVNQVTEEACWRNHPNFSWGDKEVMCKHNKRCRIQSNQASMRALELQVGQLANELKARPQGKLPRILNTLEGKGAGGSNKNSRASGSVPMWNHLMC
RPHLMYHLYLFHKGKRLRIRMDILTKKKRLGEFEIVSLTEECSAILKMGYHPRLRIQGHLLYSWYGEARPTTVTLQLADRSITYPEGKIEDVLVKSAILATGRALIDVQK
GELTMRVYNEEVKFNVFKAMKYPDEMEDCSFIRILKSTVIETTMQDSSNLDQRKAPPIKPSLIEAPTLDLKSLSDHLKYVYLGESETLPIIVASNLMPEHEEALIKLLQQ
YRKAIGLTLLDDGIIYPIANSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKNHFPLLFIDQMLDQLASQAYYCFLDGYSGYNQITIAPEDQ
EKTTFTCPYGTFTFRRMPFGLYNAPATFQQTPILCAPNWNLPFEVMCDASDAAVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHAAIRYLMSKKDAKPRL
IRWILLLQEFDLEIKDKKGSENIIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSD
GIIRRCVSGAEAKEILEQCHSSPYGGHFSDQRIAMRILHCGFFWPTYTPLKENSSLESIIQNYIEESRARSLKIDEIIRGLQTPQEVSEVGSTSIGCLVSHVVDDSLAPI
HELSSFNLDEDEQLGVSCIDSREELESCSTFQEHVCEEEKENELALTVQEVEFEVEKPSSVLSSPSFIDVDCSFFYSHESVLESGSLYFDISPFMLLLFKNTKNNSVSGA
HEKLTARARRARDVKQALMGGNPRVPVFLPLPSIHEKRAEEQEAIREETVDDVDKERAKNPEEETRIYDTVQEKIVEKNQETEVEEQAAGMPEKEKTPEPVQEAHVEVVM
PEPPKRRRIKRKAGRVRVIRNTPSPPTSDFEEEKGKLKIRQRKKRQGRQKKRFCANKEKTRAKELPKHRVRLRNQGDLPRFLESEIANLGWRQFCAKPEPVNTNIVREFY
VNLDVKDDFEVIVQGVLVQWSPEAINSLFDLQDFPHAVFNEMMVAPSSDQLSAAVRESEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISTEIADC
WRKKVGKLFFPNTITMLCSRAGVPTVPEDMIMLDKGIIDTPNLVRLQRTQDARQGGLVYGVHQILEQLTLLASRLEFAERQAQTYWTYAKRRDDALRRALQTNFSKPYQA
FPVFPDDLFNLWIPPPPVEREEDVIEEQGQED