; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001077 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001077
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr4:23666733..23674606
RNA-Seq ExpressionLag0001077
SyntenyLag0001077
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN21854.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]5.4e-25740.24Show/hide
Query:  ILVANDRARAIRAYAFPMFDELNPRITRPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRKALRLTLFPYSLRDGAKAW
        ++  N     +R  A P   E    +  P++  A  +++  M QM+Q   QF GLS E+P+ H+ +FL + D+   +GV + ALRL LF +SL   A  W
Subjt:  ILVANDRARAIRAYAFPMFDELNPRITRPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRKALRLTLFPYSLRDGAKAW

Query:  LNSFAPRSIRTWNELAENFLSKYFSPNRNAKLRSEIVGFRQLEDETFSEAWER--------------------------------------------ADL
          S    SI TW +L E F+SK+FSP + A LR+EI+ FRQ   ET  EAW R                                               
Subjt:  LNSFAPRSIRTWNELAENFLSKYFSPNRNAKLRSEIVGFRQLEDETFSEAWER--------------------------------------------ADL

Query:  AMIANAFKNVTVISHQQ------PPAVEPAALVNQVT-----------EEACVYCGEDHNYEFCPNNPTSMFFV-----------------GWRNHPNFS
        A   N   N+ V  +++      PP       V+QVT                 CGE H  + CP++  S+ FV                 GWR HPNFS
Subjt:  AMIANAFKNVTVISHQQ------PPAVEPAALVNQVT-----------EEACVYCGEDHNYEFCPNNPTSMFFV-----------------GWRNHPNFS

Query:  WG---GQGSNVQTQQKVNQSGFAKAQVMPQQTKQALPQQ---NSGSSLEAMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPLGNFPQIL--NT
        W    GQGS  + Q    Q G    +  P+Q  +A  Q     +G  L+ ++KE        + S +   + +E  + +++     +P   FPQ L    
Subjt:  WG---GQGSNVQTQQKVNQSGFAKAQVMPQQTKQALPQQ---NSGSSLEAMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPLGNFPQIL--NT

Query:  LEGKGAGGSNKDAGASGSVPDVE-----PPYVP------PPPYRLGEFETVSLTEECSAILKNGLPSKAKDPESFTIPVSIGGKEFGRALCDLGASINVM
        L+ +             ++P  E     P YV           RLG++ETV+LTEECSAI++N LP K KDP          G+   RALCDLGASIN+M
Subjt:  LEGKGAGGSNKDAGASGSVPDVE-----PPYVP------PPPYRLGEFETVSLTEECSAILKNGLPSKAKDPESFTIPVSIGGKEFGRALCDLGASINVM

Query:  PLSVYRKLGIGEARPTTVTLQLADRSITYPDGKIEDVLVKEDKFIFPVDFIILDYEADKDVPIILGRPFLATGGALIDVQKGELTMRVCNEEVKFNVFKV
        P S+YR LG+ EA+PT++TLQLADRS+TYP G IED+LVK DKFIFP DF++LD E D +VPIILGRPFLATG  LIDVQKGELTMRV ++++ FNVFK 
Subjt:  PLSVYRKLGIGEARPTTVTLQLADRSITYPDGKIEDVLVKEDKFIFPVDFIILDYEADKDVPIILGRPFLATGGALIDVQKGELTMRVCNEEVKFNVFKV

Query:  MKYPDDMEDCSFI---------------------RTLESTIVETTTQD-------SADKHLEDHG------EAPP--IKPSLIEAPTLDLKPLPDHLKYV
        MK+P++ ++C  +                     R L   + E   +D        A K+ +  G       AP   +KPS+ E PTL+LKPLP HL Y 
Subjt:  MKYPDDMEDCSFI---------------------RTLESTIVETTTQD-------SADKHLEDHG------EAPP--IKPSLIEAPTLDLKPLPDHLKYV

Query:  YLGE---------------------------------------GISPSFCMHKITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKWLDADIIYPIADRNWV
        YLGE                                       GISPSFCMHKI LE+    SVE QRRLNP MKEVVKKE+IKWLDA IIYPI+DR+W+
Subjt:  YLGE---------------------------------------GISPSFCMHKITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKWLDADIIYPIADRNWV

Query:  SPVQCVPKKGRVTVVSNKDNELIPTRIITGWRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFTCPYGTF--
        SPVQCVPKKG +TVV N  NE IPT+ +TGWRVCMDY++LNKATRKDHFPL FIDQMLDRLAG+ +YCFL GY GYNQI IAPEDQEKTTFTCPYGTF  
Subjt:  SPVQCVPKKGRVTVVSNKDNELIPTRIITGWRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFTCPYGTF--

Query:  ------------VLGGCLLA---------FAMLQQHFS------SSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKDGIVLD-------------------
                        C++A           +    FS        CL+NL  VLKRCEDT+LVLNWEKCHFMV++GIVL                    
Subjt:  ------------VLGGCLLA---------FAMLQQHFS------SSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKDGIVLD-------------------

Query:  --------------------------------------------------CRKNFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQKQGKFI
                                                          C   F+ LK  LISAPI+  P+W+ PFE+MCDASD A+GA+ GQ++ K  
Subjt:  --------------------------------------------------CRKNFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQKQGKFI

Query:  HPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHA-----------------------EFDLEIKDKKGSENVIADHLSRRDPSS
          IYYASK LN+AQ+NYTTTEKELLAVVFAF+KFR YLVG+KV V+TDHA                       EFDLEI+D+KG EN IADHLSR +  +
Subjt:  HPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHA-----------------------EFDLEIKDKKGSENVIADHLSRRDPSS

Query:  CLLNNSAISDSFPDEQLFTVEVKVVRDAPW--------------------------------------------------CVSGDEAKEILEQCHSSLYG
             + I+D+FPDEQL  +   V  D PW                                                  CV   E  +I EQCH+S YG
Subjt:  CLLNNSAISDSFPDEQLFTVEVKVVRDAPW--------------------------------------------------CVSGDEAKEILEQCHSSLYG

Query:  GHFSSQRTAMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPLTYILEIELFDL---------------------ITCPSEW-RPPACHMSDA
        GHF   RTA +IL  GFFWP LFKD H F   CD CQR GN+  R EMPL  ILE+ELFD+                     +   S+W    A   +D+
Subjt:  GHFSSQRTAMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPLTYILEIELFDL---------------------ITCPSEW-RPPACHMSDA

Query:  KIIARFLQSHIFARFGTPRALVSD
        K++  F++ +IF RFGTPRA++SD
Subjt:  KIIARFLQSHIFARFGTPRALVSD

XP_038972405.1 uncharacterized protein LOC120104748 [Phoenix dactylifera]3.9e-26338.47Show/hide
Query:  NDRARAIRAYAFPMFDELNPRITRPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRKALRLTLFPYSLRDGAKAWLNSF
        N   R +  YA P  +   P I RP + A NFE+KP + QM+Q   QF G  SEDPH HL +FL + D+  + GV   A+RL LFP+SL+D AKAWLNS 
Subjt:  NDRARAIRAYAFPMFDELNPRITRPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRKALRLTLFPYSLRDGAKAWLNSF

Query:  APRSIRTWNELAENFLSKYFSPNRNAKLRSEIVGFRQLEDETFSEAWER-ADLA-----------MIANAFKNVTVISHQQPPAVEPAA---LVNQVTEE
        AP S  TWN L++ FLSKYF P + AKLR++I  F Q + E+  EAWER  DL            +I   F N   ++H     ++ AA   L+++ TEE
Subjt:  APRSIRTWNELAENFLSKYFSPNRNAKLRSEIVGFRQLEDETFSEAWER-ADLA-----------MIANAFKNVTVISHQQPPAVEPAA---LVNQVTEE

Query:  A--------------------------------------------------------------CVYCGEDHNYEFC--------------PNNP-TSMFF
        A                                                              C  CG  H    C               NNP ++ + 
Subjt:  A--------------------------------------------------------------CVYCGEDHNYEFC--------------PNNP-TSMFF

Query:  VGWRNHPNFSWGGQGSNVQTQQKVNQSGFAKAQVMPQQTKQALPQQNSGSSLEAMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPLGNFP---
         GWRNHPNFSW  QG+   + + ++  GF   Q  P Q +     + +   L     E   R  A +    +S R +E+Q+GQLAN + +R  GN P   
Subjt:  VGWRNHPNFSWGGQGSNVQTQQKVNQSGFAKAQVMPQQTKQALPQQNSGSSLEAMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPLGNFP---

Query:  ----------------QILNTLEGKGAGGSNKD---------------AGASGSVPDVEPPYVPPPPY--------------------------------
                        + L  + G+   G   D               A     +P VE PYVPP P+                                
Subjt:  ----------------QILNTLEGKGAGGSNKD---------------AGASGSVPDVEPPYVPPPPY--------------------------------

Query:  -------------------RLGEFETVSLTEECSAILKNGLPSKAKDPESFTIPVSIGGKEFGRALCDLGASINVMPLSVYRKLGIGEARPTTVTLQLAD
                           +L +FET++LTEECSAI++N LP K +DP SF+IP +IG  +F RALCDLGAS+++MPLSV RKLG+ E +PTT++LQLAD
Subjt:  -------------------RLGEFETVSLTEECSAILKNGLPSKAKDPESFTIPVSIGGKEFGRALCDLGASINVMPLSVYRKLGIGEARPTTVTLQLAD

Query:  RSITYPDGKIEDVLVKEDKFIFPVDFIILDYEADKDVPIILGRPFLATGGALIDVQKGELTMRVCNEEVKFNVFKVMKYPD-----------DMEDCSFI
        RS+ YP G +E+VL+K  KFI PVDFI+L+ E D ++PIILGRPFLAT GA+IDV+ G LT++V  EEV+FN+F+  KYP            D     F 
Subjt:  RSITYPDGKIEDVLVKEDKFIFPVDFIILDYEADKDVPIILGRPFLATGGALIDVQKGELTMRVCNEEVKFNVFKVMKYPD-----------DMEDCSFI

Query:  RT------LESTIVE--TTTQDSAD-------------------KHLEDHGEA-PPIKPSLIEAPTLDLKPLPDHLKYVYLGE-----------------
        +       LE+ +V   T+  D+ +                    + ED G+  PP  PS ++AP L+LKPLP HL Y +LGE                 
Subjt:  RT------LESTIVE--TTTQDSAD-------------------KHLEDHGEA-PPIKPSLIEAPTLDLKPLPDHLKYVYLGE-----------------

Query:  ----------------------GISPSFCMHKITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNE
                              GISPS CMH+I +E+     VE QRRLNP MKEVV+ EV+KWLDA IIYPI+D  W+SPVQ VPKKG +TVV N++NE
Subjt:  ----------------------GISPSFCMHKITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNE

Query:  LIPTRIITGWRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFTCPYGTF--------------VLGGCLLA-
        LIPTR +TGWRVC+DY++LN  TRKDHFPL F+DQ+L+RLAG AYYCFL GY GYNQI+I+PEDQEKTTFTCPYGTF                  C++A 
Subjt:  LIPTRIITGWRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFTCPYGTF--------------VLGGCLLA-

Query:  ---------------FAMLQQHFSSSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKDGIVL----------------------------------------
                       F++    F  SCLDNL +VL+RCE+T+LVLNWEKCHFMV++GIVL                                        
Subjt:  ---------------FAMLQQHFSSSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKDGIVL----------------------------------------

Query:  -----------------------------DCRKNFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTT
                                     DC   F  LK  L+SAPI+ AP+W+LPFE+MCDASD A+GA+ GQ++ + +H IYYAS+VLN AQ+NY TT
Subjt:  -----------------------------DCRKNFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTT

Query:  EKELLAVVFAFEKFRPYLVGSKVTVFTDHA-----------------------EFDLEIKDKKGSENVIADHLSRRDPSSCLLNNSAISDSFPDEQLFTV
        EKELLAVVFAF+KFR YLVGSKV V+TDH+                       EFDLEI+DK+G ENV+ADHLSR +  S   +   I++SFPDEQL  V
Subjt:  EKELLAVVFAFEKFRPYLVGSKVTVFTDHA-----------------------EFDLEIKDKKGSENVIADHLSRRDPSSCLLNNSAISDSFPDEQLFTV

Query:  EV--------------------------KVVRDAP---W----------------CVSGDEAKEILEQCHSSLYGGHFSSQRTAMRILHCGFFWPTLFKD
         V                          K + D     W                CV  DE ++IL+ CHS   GGHFS+ +T  ++   GF+WPT+++D
Subjt:  EV--------------------------KVVRDAP---W----------------CVSGDEAKEILEQCHSSLYGGHFSSQRTAMRILHCGFFWPTLFKD

Query:  AHWFYKQCDVCQRRGNLGPRDEMPLTYILEIELFDL---------------------ITCPSEW-RPPACHMSDAKIIARFLQSHIFARFGTPRALVSDE
           +   CD CQR GN+  ++EMPLT ILE+ELFDL                     +   S+W    A   +D++++ RF++ +IF+RFG PRA++SDE
Subjt:  AHWFYKQCDVCQRRGNLGPRDEMPLTYILEIELFDL---------------------ITCPSEW-RPPACHMSDAKIIARFLQSHIFARFGTPRALVSDE

XP_038973683.1 uncharacterized protein LOC120105384 [Phoenix dactylifera]3.0e-26338.47Show/hide
Query:  NDRARAIRAYAFPMFDELNPRITRPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRKALRLTLFPYSLRDGAKAWLNSF
        N   R +  YA P  +   P I RP + A NFE+KP + QM+Q   QF G  SEDPH HL +FL + D+  + GV   A+RL LFP+SL+D AKAWLNS 
Subjt:  NDRARAIRAYAFPMFDELNPRITRPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRKALRLTLFPYSLRDGAKAWLNSF

Query:  APRSIRTWNELAENFLSKYFSPNRNAKLRSEIVGFRQLEDETFSEAWER-ADLA-----------MIANAFKNVTVISHQQPPAVEPAA---LVNQVTEE
        AP S  TWN L++ FLSKYF P + AKLR++I  F Q + E+  EAWER  DL            +I   F N   ++H     ++ AA   L+++ TEE
Subjt:  APRSIRTWNELAENFLSKYFSPNRNAKLRSEIVGFRQLEDETFSEAWER-ADLA-----------MIANAFKNVTVISHQQPPAVEPAA---LVNQVTEE

Query:  A--------------------------------------------------------------CVYCGEDHNYEFC--------------PNNP-TSMFF
        A                                                              C  CG  H    C               NNP ++ + 
Subjt:  A--------------------------------------------------------------CVYCGEDHNYEFC--------------PNNP-TSMFF

Query:  VGWRNHPNFSWGGQGSNVQTQQKVNQSGFAKAQVMPQQTKQALPQQNSGSSLEAMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPLGNFP---
         GWRNHPNFSW  QG+   + + ++  GF   Q  P Q +     + +   L     E   R  A +    +S R +E+Q+GQLAN + +R  GN P   
Subjt:  VGWRNHPNFSWGGQGSNVQTQQKVNQSGFAKAQVMPQQTKQALPQQNSGSSLEAMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPLGNFP---

Query:  ----------------QILNTLEGKGAGGSNKD---------------AGASGSVPDVEPPYVPPPPY--------------------------------
                        + L  + G+   G   D               A     +P VE PYVPP P+                                
Subjt:  ----------------QILNTLEGKGAGGSNKD---------------AGASGSVPDVEPPYVPPPPY--------------------------------

Query:  -------------------RLGEFETVSLTEECSAILKNGLPSKAKDPESFTIPVSIGGKEFGRALCDLGASINVMPLSVYRKLGIGEARPTTVTLQLAD
                           +L +FET++LTEECSAI++N LP K +DP SF+IP +IG  +F RALCDLGAS+++MPLSV RKLG+ E +PTT++LQLAD
Subjt:  -------------------RLGEFETVSLTEECSAILKNGLPSKAKDPESFTIPVSIGGKEFGRALCDLGASINVMPLSVYRKLGIGEARPTTVTLQLAD

Query:  RSITYPDGKIEDVLVKEDKFIFPVDFIILDYEADKDVPIILGRPFLATGGALIDVQKGELTMRVCNEEVKFNVFKVMKYPD-----------DMEDCSFI
        RS+ YP G +E+VL+K  KFI PVDFI+L+ E D ++PIILGRPFLAT GA+IDV+ G LT++V  EEV+FN+F+  KYP            D     F 
Subjt:  RSITYPDGKIEDVLVKEDKFIFPVDFIILDYEADKDVPIILGRPFLATGGALIDVQKGELTMRVCNEEVKFNVFKVMKYPD-----------DMEDCSFI

Query:  RT------LESTIVE--TTTQDSAD-------------------KHLEDHGEA-PPIKPSLIEAPTLDLKPLPDHLKYVYLGE-----------------
        +       LE+ +V   T+  D+ +                    + ED G+  PP  PS ++AP L+LKPLP HL Y +LGE                 
Subjt:  RT------LESTIVE--TTTQDSAD-------------------KHLEDHGEA-PPIKPSLIEAPTLDLKPLPDHLKYVYLGE-----------------

Query:  ----------------------GISPSFCMHKITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNE
                              GISPS CMH+I +E+     VE QRRLNP MKEVV+ EV+KWLDA IIYPI+D  W+SPVQ VPKKG +TVV N++NE
Subjt:  ----------------------GISPSFCMHKITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNE

Query:  LIPTRIITGWRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFTCPYGTF--------------VLGGCLLA-
        LIPTR +TGWRVC+DY++LN  TRKDHFPL F+DQ+L+RLAG AYYCFL GY GYNQI+I+PEDQEKTTFTCPYGTF                  C++A 
Subjt:  LIPTRIITGWRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFTCPYGTF--------------VLGGCLLA-

Query:  ---------------FAMLQQHFSSSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKDGIVL----------------------------------------
                       F++    F  SCLDNL +VL+RCE+T+LVLNWEKCHFMV++GIVL                                        
Subjt:  ---------------FAMLQQHFSSSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKDGIVL----------------------------------------

Query:  -----------------------------DCRKNFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTT
                                     DC   F  LK  L+SAPI+ AP+W+LPFE+MCDASD A+GA+ GQ++ + +H IYYAS+VLN AQ+NY TT
Subjt:  -----------------------------DCRKNFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTT

Query:  EKELLAVVFAFEKFRPYLVGSKVTVFTDHA-----------------------EFDLEIKDKKGSENVIADHLSRRDPSSCLLNNSAISDSFPDEQLFTV
        EKELLAVVFAF+KFR YLVGSKV V+TDH+                       EFDLEI+DK+G ENV+ADHLSR +  S   +   I++SFPDEQL  V
Subjt:  EKELLAVVFAFEKFRPYLVGSKVTVFTDHA-----------------------EFDLEIKDKKGSENVIADHLSRRDPSSCLLNNSAISDSFPDEQLFTV

Query:  EV--------------------------KVVRDAP---W----------------CVSGDEAKEILEQCHSSLYGGHFSSQRTAMRILHCGFFWPTLFKD
         V                          K + D     W                CV  DE ++IL+ CHS   GGHFS+ +T  ++   GF+WPT+++D
Subjt:  EV--------------------------KVVRDAP---W----------------CVSGDEAKEILEQCHSSLYGGHFSSQRTAMRILHCGFFWPTLFKD

Query:  AHWFYKQCDVCQRRGNLGPRDEMPLTYILEIELFDL---------------------ITCPSEW-RPPACHMSDAKIIARFLQSHIFARFGTPRALVSDE
           +   CD CQR GN+  ++EMPLT ILE+ELFDL                     +   S+W    A   +D++++ RF++ +IF+RFG PRA++SDE
Subjt:  AHWFYKQCDVCQRRGNLGPRDEMPLTYILEIELFDL---------------------ITCPSEW-RPPACHMSDAKIIARFLQSHIFARFGTPRALVSDE

XP_038976300.1 uncharacterized protein LOC120107204 [Phoenix dactylifera]3.9e-26338.4Show/hide
Query:  NDRARAIRAYAFPMFDELNPRITRPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRKALRLTLFPYSLRDGAKAWLNSF
        N   R +  YA P  +   P I RP + A NFE+KP + QM+Q   QF G  SEDPH HL +FL + D+  + GV   A+RL LFP+SL+D AKAWLNS 
Subjt:  NDRARAIRAYAFPMFDELNPRITRPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRKALRLTLFPYSLRDGAKAWLNSF

Query:  APRSIRTWNELAENFLSKYFSPNRNAKLRSEIVGFRQLEDETFSEAWER-ADLA-----------MIANAFKNVTVISHQQPPAVEPAA---LVNQVTEE
        AP S  TWN L++ FLSKYF P + AKLR++I  F Q + E+  EAWER  DL            +I   F N   ++H     ++ AA   L+++ TEE
Subjt:  APRSIRTWNELAENFLSKYFSPNRNAKLRSEIVGFRQLEDETFSEAWER-ADLA-----------MIANAFKNVTVISHQQPPAVEPAA---LVNQVTEE

Query:  A--------------------------------------------------------------CVYCGEDHNYEFC--------------PNNP-TSMFF
        A                                                              C  CG  H    C               NNP ++ + 
Subjt:  A--------------------------------------------------------------CVYCGEDHNYEFC--------------PNNP-TSMFF

Query:  VGWRNHPNFSWGGQGSNVQTQQKVNQSGFAKAQVMPQQTKQALPQQNSGSSLEAMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPLGNFP---
         GWRNHPNFSW  QG+   + + ++  GF   Q  P Q +     + +   L     E   R  A +    +S R +E+Q+GQLAN + +R  GN P   
Subjt:  VGWRNHPNFSWGGQGSNVQTQQKVNQSGFAKAQVMPQQTKQALPQQNSGSSLEAMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPLGNFP---

Query:  ----------------QILNTLEGKGAGGSNKD---------------AGASGSVPDVEPPYVPPPPY--------------------------------
                        + L  + G+   G   D               A     +P VE PYVPP P+                                
Subjt:  ----------------QILNTLEGKGAGGSNKD---------------AGASGSVPDVEPPYVPPPPY--------------------------------

Query:  -------------------RLGEFETVSLTEECSAILKNGLPSKAKDPESFTIPVSIGGKEFGRALCDLGASINVMPLSVYRKLGIGEARPTTVTLQLAD
                           +L +FET++LTEECSAI++N LP K +DP SF+IP +IG  +F RALCDLGAS+++MPLSV RKLG+ E +PTT++LQLAD
Subjt:  -------------------RLGEFETVSLTEECSAILKNGLPSKAKDPESFTIPVSIGGKEFGRALCDLGASINVMPLSVYRKLGIGEARPTTVTLQLAD

Query:  RSITYPDGKIEDVLVKEDKFIFPVDFIILDYEADKDVPIILGRPFLATGGALIDVQKGELTMRVCNEEVKFNVFKVMKYPD-----------DMEDCSFI
        RS+ YP G +E+VL+K  KFI PVDFI+L+ E D ++PIILGRPFLAT GA+IDV+ G LT++V  EEV+FN+F+  KYP            D     F 
Subjt:  RSITYPDGKIEDVLVKEDKFIFPVDFIILDYEADKDVPIILGRPFLATGGALIDVQKGELTMRVCNEEVKFNVFKVMKYPD-----------DMEDCSFI

Query:  RT------LESTIVE--TTTQDSAD-------------------KHLEDHGEA-PPIKPSLIEAPTLDLKPLPDHLKYVYLGE-----------------
        +       LE+ +V   T+  D+ +                    + ED G+  PP  PS ++AP L+LKPLP HL Y +LGE                 
Subjt:  RT------LESTIVE--TTTQDSAD-------------------KHLEDHGEA-PPIKPSLIEAPTLDLKPLPDHLKYVYLGE-----------------

Query:  ----------------------GISPSFCMHKITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNE
                              GISPS CMH+I +E+     VE QRRLNP MKEVV+ EV+KWLDA IIYPI+D  W+SPVQ VPKKG +TVV N++NE
Subjt:  ----------------------GISPSFCMHKITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNE

Query:  LIPTRIITGWRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFTCPYGTF--------------VLGGCLLA-
        LIPTR +TGWRVC+DY++LN  TRKDHFPL F+DQ+L+RLAG AYYCFL GY GYNQI+I+PEDQEKTTFTCPYGTF                  C++A 
Subjt:  LIPTRIITGWRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFTCPYGTF--------------VLGGCLLA-

Query:  ---------------FAMLQQHFSSSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKDGIVL----------------------------------------
                       F++    F  SCLDNL +VL+RCE+T+LVLNWEKCHFMV++GI+L                                        
Subjt:  ---------------FAMLQQHFSSSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKDGIVL----------------------------------------

Query:  -----------------------------DCRKNFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTT
                                     DC   F  LK  L+SAPI+ AP+W+LPFE+MCDASD A+GA+ GQ++ + +H IYYAS+VLN AQ+NY TT
Subjt:  -----------------------------DCRKNFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTT

Query:  EKELLAVVFAFEKFRPYLVGSKVTVFTDHA-----------------------EFDLEIKDKKGSENVIADHLSRRDPSSCLLNNSAISDSFPDEQLFTV
        EKELLAVVFAF+KFR YLVGSKV V+TDH+                       EFDLEI+DK+G ENV+ADHLSR +  S   +   I++SFPDEQL  V
Subjt:  EKELLAVVFAFEKFRPYLVGSKVTVFTDHA-----------------------EFDLEIKDKKGSENVIADHLSRRDPSSCLLNNSAISDSFPDEQLFTV

Query:  EV--------------------------KVVRDAP---W----------------CVSGDEAKEILEQCHSSLYGGHFSSQRTAMRILHCGFFWPTLFKD
         V                          K + D     W                CV  DE ++IL+ CHS   GGHFS+ +T  ++   GF+WPT+++D
Subjt:  EV--------------------------KVVRDAP---W----------------CVSGDEAKEILEQCHSSLYGGHFSSQRTAMRILHCGFFWPTLFKD

Query:  AHWFYKQCDVCQRRGNLGPRDEMPLTYILEIELFDL---------------------ITCPSEW-RPPACHMSDAKIIARFLQSHIFARFGTPRALVSDE
           +   CD CQR GN+  ++EMPLT ILE+ELFDL                     +   S+W    A   +D++++ RF++ +IF+RFG PRA++SDE
Subjt:  AHWFYKQCDVCQRRGNLGPRDEMPLTYILEIELFDL---------------------ITCPSEW-RPPACHMSDAKIIARFLQSHIFARFGTPRALVSDE

XP_038976409.1 uncharacterized protein LOC113461320 [Phoenix dactylifera]4.3e-26238.53Show/hide
Query:  NDRARAIRAYAFPMFDELNPRITRPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRKALRLTLFPYSLRDGAKAWLNSF
        N   R +  YA P  +   P I RP + A NFE+KP + QM+Q   QF G  SEDPH HL +FL + D+  + GV   A+RL LFP+SL+D AKAWLNS 
Subjt:  NDRARAIRAYAFPMFDELNPRITRPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRKALRLTLFPYSLRDGAKAWLNSF

Query:  APRSIRTWNELAENFLSKYFSPNRNAKLRSEIVGFRQLEDETFSEAWER-ADLA-----------MIANAFKNVTVISHQQPPAVEPAA---LVNQVTEE
        AP S  TWN L++ FLSKYF P + AKLR++I  F Q + E+  EAWER  DL            +I   F N   ++H     ++ AA   L+++ TEE
Subjt:  APRSIRTWNELAENFLSKYFSPNRNAKLRSEIVGFRQLEDETFSEAWER-ADLA-----------MIANAFKNVTVISHQQPPAVEPAA---LVNQVTEE

Query:  A--------------------------------------------------------------CVYCGEDH--------------NYEFCPNNP-TSMFF
        A                                                              C  CG  H              N +   NNP ++ + 
Subjt:  A--------------------------------------------------------------CVYCGEDH--------------NYEFCPNNP-TSMFF

Query:  VGWRNHPNFSWGGQGSNVQTQQKVNQSGFAKAQVMPQQTKQALPQQNSGSSLEAMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPLGNFP---
         GWRNHPNFSW  QG+   + + ++  GF      P ++KQ+   + +   L     E   R  A +    +S R +E+Q+GQLAN + +R  GN P   
Subjt:  VGWRNHPNFSWGGQGSNVQTQQKVNQSGFAKAQVMPQQTKQALPQQNSGSSLEAMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPLGNFP---

Query:  --------QILNTLEGKGAG--------GSNKD---------------AGASGSVPDVEPPYVPPPPY--------------------------------
                + +    GK  G        G   D               A     +P VE PYVPP P+                                
Subjt:  --------QILNTLEGKGAG--------GSNKD---------------AGASGSVPDVEPPYVPPPPY--------------------------------

Query:  -------------------RLGEFETVSLTEECSAILKNGLPSKAKDPESFTIPVSIGGKEFGRALCDLGASINVMPLSVYRKLGIGEARPTTVTLQLAD
                           +L +FET++LTEECSAI++N LP K +DP SF+IP +IG  +F RALCDLGAS+++MPLSV RKLG+ E +PTT++LQLAD
Subjt:  -------------------RLGEFETVSLTEECSAILKNGLPSKAKDPESFTIPVSIGGKEFGRALCDLGASINVMPLSVYRKLGIGEARPTTVTLQLAD

Query:  RSITYPDGKIEDVLVKEDKFIFPVDFIILDYEADKDVPIILGRPFLATGGALIDVQKGELTMRVCNEEVKFNVFKVMKYPD-----------DMEDCSFI
        RS+ YP G +E+VL+K  KFI PVDFI+L+ E D ++PIILGRPFLAT GA+IDV+ G LT++V  EEV+FN+F+  KYP            D     F 
Subjt:  RSITYPDGKIEDVLVKEDKFIFPVDFIILDYEADKDVPIILGRPFLATGGALIDVQKGELTMRVCNEEVKFNVFKVMKYPD-----------DMEDCSFI

Query:  RT------LESTIVE--TTTQDSAD-------------------KHLEDHGEA-PPIKPSLIEAPTLDLKPLPDHLKYVYLGE-----------------
        +       LE+ +V   T+  D+ +                    + ED G+  PP  PS ++AP L+LKPLP HL Y +LGE                 
Subjt:  RT------LESTIVE--TTTQDSAD-------------------KHLEDHGEA-PPIKPSLIEAPTLDLKPLPDHLKYVYLGE-----------------

Query:  ----------------------GISPSFCMHKITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNE
                              GISPS CMH+I +E+     VE QRRLNP MKEVV+ EV+KWLDA IIYPI+D  W+SPVQ VPKKG +TVV N++NE
Subjt:  ----------------------GISPSFCMHKITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNE

Query:  LIPTRIITGWRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFTCPYGTF--------------VLGGCLLA-
        LIPTR +TGWRVC+DY++LN  TRKDHFPL F+DQ+L+RLAG AYYCFL GY GYNQI+I+PEDQEKTTFTCPYGTF                  C++A 
Subjt:  LIPTRIITGWRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFTCPYGTF--------------VLGGCLLA-

Query:  ---------------FAMLQQHFSSSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKDGIVL----------------------------------------
                       F++    F  SCLDNL +VL+RCE+T+LVLNWEKCHFMV++GIVL                                        
Subjt:  ---------------FAMLQQHFSSSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKDGIVL----------------------------------------

Query:  -----------------------------DCRKNFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTT
                                     DC   F  LK  L+SAPI+ AP+W+LPFE+MCDASD A+GA+ GQ++ + +H IYYAS+VLN AQ+NY TT
Subjt:  -----------------------------DCRKNFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTT

Query:  EKELLAVVFAFEKFRPYLVGSKVTVFTDHA-----------------------EFDLEIKDKKGSENVIADHLSRRDPSSCLLNNSAISDSFPDEQLFTV
        EKELLAVVFAF+KFR YLVGSKV V+TDH+                       EFDLEI+DK+G ENV+ADHLSR +  S   +   I++SFPDEQL  V
Subjt:  EKELLAVVFAFEKFRPYLVGSKVTVFTDHA-----------------------EFDLEIKDKKGSENVIADHLSRRDPSSCLLNNSAISDSFPDEQLFTV

Query:  EV--------------------------KVVRDAP---W----------------CVSGDEAKEILEQCHSSLYGGHFSSQRTAMRILHCGFFWPTLFKD
         V                          K + D     W                CV  DE ++IL+ CHS   GGHFS+ +T  ++   GF+WPT+++D
Subjt:  EV--------------------------KVVRDAP---W----------------CVSGDEAKEILEQCHSSLYGGHFSSQRTAMRILHCGFFWPTLFKD

Query:  AHWFYKQCDVCQRRGNLGPRDEMPLTYILEIELFDL---------------------ITCPSEW-RPPACHMSDAKIIARFLQSHIFARFGTPRALVSDE
           +   CD CQR GN+  ++EMPLT ILE+ELFDL                     +   S+W    A   +D++++ RF++ +IF+RFG PRA++SDE
Subjt:  AHWFYKQCDVCQRRGNLGPRDEMPLTYILEIELFDL---------------------ITCPSEW-RPPACHMSDAKIIARFLQSHIFARFGTPRALVSDE

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase4.2e-23939.32Show/hide
Query:  MLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRKALRLTLFPYSLRDGAKAWLNSFAPRSIRTW--NELAENFLSKYFS-----PNRNAKLRSEI-
        M+Q   QF GLS E+P+ H+ +FL + D+   +GV + ALRL LF +SL   A  W  S    SI TW  +E      S++       PN +     ++ 
Subjt:  MLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRKALRLTLFPYSLRDGAKAWLNSFAPRSIRTW--NELAENFLSKYFS-----PNRNAKLRSEI-

Query:  -------VGFRQLEDETFSEAWERADLAMIANAFKNVTVISHQQ------PPAVEPAALVNQVT------------------------EEACVYCGEDHN
                G +   D    +++     A   N   N+    +++      PP       V+QVT                           C  CGE H 
Subjt:  -------VGFRQLEDETFSEAWERADLAMIANAFKNVTVISHQQ------PPAVEPAALVNQVT------------------------EEACVYCGEDHN

Query:  YEFCPNNPTSMFFV-----------------GWRNHPNFSWG---GQGSNVQTQQKVNQSGFAKAQVMPQQTKQALPQQNSGSSLEAMMKEFMARTYAAI
         + CP++  S+ FV                 GWR HPNFSW    GQGS  + QQ              QQ +Q  P Q    SLE  + +FMA      
Subjt:  YEFCPNNPTSMFFV-----------------GWRNHPNFSWG---GQGSNVQTQQKVNQSGFAKAQVMPQQTKQALPQQNSGSSLEAMMKEFMARTYAAI

Query:  QSNQASMRALELQVGQLANELKARPLGNFP---------------QILNTLEGKGAGGSNKDAGASG----------------------SVPDVE-----
         S  A+ + ++ Q+GQLAN + +RP G+ P               Q +    G+    + K+   S                       ++P  E     
Subjt:  QSNQASMRALELQVGQLANELKARPLGNFP---------------QILNTLEGKGAGGSNKDAGASG----------------------SVPDVE-----

Query:  PPYVP------PPPYRLGEFETVSLTEECSAILKNGLPSKAKDPESFTIPVSIGGKEFGRALCDLGASINVMPLSVYRKLGIGEARPTTVTLQLADRSIT
        P YV            LG++ETV+LTEECSAI++N LP K KDP SFTIP +IG    GRALCDLG            +  +GEA+PT++TLQLADRS+T
Subjt:  PPYVP------PPPYRLGEFETVSLTEECSAILKNGLPSKAKDPESFTIPVSIGGKEFGRALCDLGASINVMPLSVYRKLGIGEARPTTVTLQLADRSIT

Query:  YPDGKIEDVLVKEDKFIFPVDFIILDYEADKDVPIILGRPFLATGGALIDVQKGELTMRVCNEEVKFNVF-------KVMKYP--------------DDM
        YP G IED+LVK DKFIFP DF++LD E D +VPIILGRPFLATG  LIDVQK        +E    ++F        + + P              ++ 
Subjt:  YPDGKIEDVLVKEDKFIFPVDFIILDYEADKDVPIILGRPFLATGGALIDVQKGELTMRVCNEEVKFNVF-------KVMKYP--------------DDM

Query:  EDCSFIRTLESTIVETTTQDSADKHLEDHGEAPPIKPSLIEAPTLDLKPLPDHLKYVYLGE---------------------------------------
        ED   ++TL+++      +    + LE    +  +KPS+ E PTL+LKPLP+HL Y YLGE                                       
Subjt:  EDCSFIRTLESTIVETTTQDSADKHLEDHGEAPPIKPSLIEAPTLDLKPLPDHLKYVYLGE---------------------------------------

Query:  GISPSFCMHKITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNELIPTRIITGWRVCMDYKRLNKA
        GISPSFCMHKI LE+    SVE QRRLNP MKEVVKKE+IKWLDA IIYPI+D +WVSPVQCVPKKG +TVV N  NELIPTR +TGWRVCMDY++LNKA
Subjt:  GISPSFCMHKITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNELIPTRIITGWRVCMDYKRLNKA

Query:  TRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFTCPYGTF--------------VLGGCLLA---------FAMLQQHFS-----
        TRKDHFPLLFIDQMLDRLAG+ +YCFL GY GYNQI IAPEDQEK TFTCPYGTF                  C++A           +    FS     
Subjt:  TRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFTCPYGTF--------------VLGGCLLA---------FAMLQQHFS-----

Query:  -SSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKDGIVLD--------------------------------------------------------------
           CL+NL  VLKRCEDT+L+LNWEKCHFMV++GIVL                                                               
Subjt:  -SSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKDGIVLD--------------------------------------------------------------

Query:  -------CRKNFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKV
               CR  F  LK  LISAPI+  P+W+ PFE+MCDASD AVGA+ GQ++ K    IYYASK LN+AQ+NYTTTEKELLAVVFAF+KFR YLVG+KV
Subjt:  -------CRKNFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKV

Query:  TVFTDHA-----------------------EFDLEIKDKKGSENVIADHLSRRDPSSCLLNNSAISDSFPDEQLFTVEVKVVRDAPW-------------
         V+TDHA                       EFDLEI+D+KG+EN IADHLSR +  +     + I+D+FPDEQL  +   V  D PW             
Subjt:  TVFTDHA-----------------------EFDLEIKDKKGSENVIADHLSRRDPSSCLLNNSAISDSFPDEQLFTVEVKVVRDAPW-------------

Query:  -------------------------------------CVSGDEAKEILEQCHSSLYGGHFSSQRTAMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLG
                                             CV   E  +ILEQCH+S YGGHF   RTA +IL  GFFWP LFKDAH F   CD CQR GN+ 
Subjt:  -------------------------------------CVSGDEAKEILEQCHSSLYGGHFSSQRTAMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLG

Query:  PRDEMPLTYILEIELFDL---------------------ITCPSEW-RPPACHMSDAKIIARFLQSHIFARFGTPRALVSD
         R EMPL  ILE+ELFD+                     +   S+W    A   +D+K++  F++ +IF RFGTPRA++SD
Subjt:  PRDEMPLTYILEIELFDL---------------------ITCPSEW-RPPACHMSDAKIIARFLQSHIFARFGTPRALVSD

A0A2G9G6G2 Reverse transcriptase6.1e-23839.59Show/hide
Query:  MLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRKALRLTLFPYSLRDGAKAWLNSFAPRSIRTW--NELAENFLSKYFS-----PNRNAKLRSEI-
        M+Q   QF GLS E+P+ H+ +FL + D+   +GV + ALRL LF +SL   A  W  S    SI TW  +E      S++       PN +     ++ 
Subjt:  MLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRKALRLTLFPYSLRDGAKAWLNSFAPRSIRTW--NELAENFLSKYFS-----PNRNAKLRSEI-

Query:  -------VGFRQLEDETFSEAWERADLA--------MIANAF----------KNVTVISHQQPPAVEPAA----------LVNQV--TEEACVYCGEDHN
                G +   D    +++     A        ++AN +          K V VI   Q  A+               VNQV  T   C  CGE H 
Subjt:  -------VGFRQLEDETFSEAWERADLA--------MIANAF----------KNVTVISHQQPPAVEPAA----------LVNQV--TEEACVYCGEDHN

Query:  YEFCPNNPTSMFFV-----------------GWRNHPNFSWG---GQGSNVQTQQKVNQSGFAKAQVMPQQTKQALPQQNSGSSLEAMMKEFMARTYAAI
         + CP++  S+ FV                 GWR HPNFSW    GQGS  + QQ+  Q      QV P       P Q    SLE  + +FMA      
Subjt:  YEFCPNNPTSMFFV-----------------GWRNHPNFSWG---GQGSNVQTQQKVNQSGFAKAQVMPQQTKQALPQQNSGSSLEAMMKEFMARTYAAI

Query:  QSNQASMRALELQVGQLANELKARPLGNFP----------------QILNTLEGKGAGGSNKDAGASGSVPDVEPPY------VP-------PPPY----
         S  A+ + +E Q+GQLAN + +RP G+ P                + L  +  +      K+  +     +VE P       +P        P Y    
Subjt:  QSNQASMRALELQVGQLANELKARPLGNFP----------------QILNTLEGKGAGGSNKDAGASGSVPDVEPPY------VP-------PPPY----

Query:  --------RLGEFETVSLTEECSAILKNGLPSKAKDPESFTIPVSIGGKEFGRALCDLGASINVMPLSVYRKLGIGEARPTTVTLQLADRSITYPDGKIE
                RLG++E V+LTEECS I++N LP K K+P SFTIP +IG    GRALCDLGASIN+MP S+YR LG+GEA+PT++TLQLADRS+TYP G I+
Subjt:  --------RLGEFETVSLTEECSAILKNGLPSKAKDPESFTIPVSIGGKEFGRALCDLGASINVMPLSVYRKLGIGEARPTTVTLQLADRSITYPDGKIE

Query:  DVLVKEDKFIFPVDFIILDYEADKDVPIILGRPFLATGGALIDVQKGELTMRVCNEEVKFNVF-------KVMKYP--------------DDMEDCSFIR
        D+LVK DKFIFP DF++LD E D +VPIILGRPFLATG  LIDVQK        +E    ++F        + + P              ++ +DC  ++
Subjt:  DVLVKEDKFIFPVDFIILDYEADKDVPIILGRPFLATGGALIDVQKGELTMRVCNEEVKFNVF-------KVMKYP--------------DDMEDCSFIR

Query:  TLESTIVETTTQDSADKHLEDHGEAPPIKPSLIEAPTLDLKPLPDHLKYVYLGE---------------------------------------GISPSFC
        TL+++      +    + LE    +  +KPS+ E PTL+LKPLP HL Y YLGE                                       GISPSFC
Subjt:  TLESTIVETTTQDSADKHLEDHGEAPPIKPSLIEAPTLDLKPLPDHLKYVYLGE---------------------------------------GISPSFC

Query:  MHKITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNELIPTRIITGWRVCMDYKRLNKATRKDHFP
        MHKI LE+    S+E QRRLNP MKEVVKKE+IKWLDA IIYPI+D +WVSPVQCVPKKG +TVV N  NELIPTR +TGWRVCMDY++LNKATRKDHFP
Subjt:  MHKITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNELIPTRIITGWRVCMDYKRLNKATRKDHFP

Query:  LLFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFTCPYGTF--------------VLGGCLLA---------FAMLQQHFS------SSCLDN
        L FIDQMLDRLAG+ +YCFL GY GYNQI IAPEDQEKTTFTCPYGTF                  C++A           +   +FS        CL+N
Subjt:  LLFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFTCPYGTF--------------VLGGCLLA---------FAMLQQHFS------SSCLDN

Query:  LGKVLKRCEDTHLVLNWEKCHFMVKDGIVLD---------------------------------------------------------------------
        L  VLKRCEDT+LVLNWEKCHFMV++GIVL                                                                      
Subjt:  LGKVLKRCEDTHLVLNWEKCHFMVKDGIVLD---------------------------------------------------------------------

Query:  CRKNFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHA
        C   F  LK  LISAPI+  P+            D AVGA+ GQ++ K    IYYASK LN+AQ+NYTTTEKELLAVVFAF+KFR YLV +KV V+TDHA
Subjt:  CRKNFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHA

Query:  -----------------------EFDLEIKDKKGSENVIADHLSRRDPSSCLLNNSAISDSFPDEQLFTVEVKVVRDAPW--------------------
                               EFDLEI+D+KG+EN IADHLSR +  +     + I+D+FPDEQL  +   V  + PW                    
Subjt:  -----------------------EFDLEIKDKKGSENVIADHLSRRDPSSCLLNNSAISDSFPDEQLFTVEVKVVRDAPW--------------------

Query:  ------------------------------CVSGDEAKEILEQCHSSLYGGHFSSQRTAMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPL
                                      CV   E  +ILEQCH+S YGGHF   RTA +IL  GFFWP LFKDAH F   CD CQR  N+  R EMPL
Subjt:  ------------------------------CVSGDEAKEILEQCHSSLYGGHFSSQRTAMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPL

Query:  TYILEIELFDL---------------------ITCPSEW-RPPACHMSDAKIIARFLQSHIFARFGTPRALVSD
          ILE+ELFD+                     +   S+W    A   +D+K++  F++ +IF RFGTPRA++SD
Subjt:  TYILEIELFDL---------------------ITCPSEW-RPPACHMSDAKIIARFLQSHIFARFGTPRALVSD

A0A2G9HWF8 Reverse transcriptase2.6e-25740.24Show/hide
Query:  ILVANDRARAIRAYAFPMFDELNPRITRPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRKALRLTLFPYSLRDGAKAW
        ++  N     +R  A P   E    +  P++  A  +++  M QM+Q   QF GLS E+P+ H+ +FL + D+   +GV + ALRL LF +SL   A  W
Subjt:  ILVANDRARAIRAYAFPMFDELNPRITRPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRKALRLTLFPYSLRDGAKAW

Query:  LNSFAPRSIRTWNELAENFLSKYFSPNRNAKLRSEIVGFRQLEDETFSEAWER--------------------------------------------ADL
          S    SI TW +L E F+SK+FSP + A LR+EI+ FRQ   ET  EAW R                                               
Subjt:  LNSFAPRSIRTWNELAENFLSKYFSPNRNAKLRSEIVGFRQLEDETFSEAWER--------------------------------------------ADL

Query:  AMIANAFKNVTVISHQQ------PPAVEPAALVNQVT-----------EEACVYCGEDHNYEFCPNNPTSMFFV-----------------GWRNHPNFS
        A   N   N+ V  +++      PP       V+QVT                 CGE H  + CP++  S+ FV                 GWR HPNFS
Subjt:  AMIANAFKNVTVISHQQ------PPAVEPAALVNQVT-----------EEACVYCGEDHNYEFCPNNPTSMFFV-----------------GWRNHPNFS

Query:  WG---GQGSNVQTQQKVNQSGFAKAQVMPQQTKQALPQQ---NSGSSLEAMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPLGNFPQIL--NT
        W    GQGS  + Q    Q G    +  P+Q  +A  Q     +G  L+ ++KE        + S +   + +E  + +++     +P   FPQ L    
Subjt:  WG---GQGSNVQTQQKVNQSGFAKAQVMPQQTKQALPQQ---NSGSSLEAMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPLGNFPQIL--NT

Query:  LEGKGAGGSNKDAGASGSVPDVE-----PPYVP------PPPYRLGEFETVSLTEECSAILKNGLPSKAKDPESFTIPVSIGGKEFGRALCDLGASINVM
        L+ +             ++P  E     P YV           RLG++ETV+LTEECSAI++N LP K KDP          G+   RALCDLGASIN+M
Subjt:  LEGKGAGGSNKDAGASGSVPDVE-----PPYVP------PPPYRLGEFETVSLTEECSAILKNGLPSKAKDPESFTIPVSIGGKEFGRALCDLGASINVM

Query:  PLSVYRKLGIGEARPTTVTLQLADRSITYPDGKIEDVLVKEDKFIFPVDFIILDYEADKDVPIILGRPFLATGGALIDVQKGELTMRVCNEEVKFNVFKV
        P S+YR LG+ EA+PT++TLQLADRS+TYP G IED+LVK DKFIFP DF++LD E D +VPIILGRPFLATG  LIDVQKGELTMRV ++++ FNVFK 
Subjt:  PLSVYRKLGIGEARPTTVTLQLADRSITYPDGKIEDVLVKEDKFIFPVDFIILDYEADKDVPIILGRPFLATGGALIDVQKGELTMRVCNEEVKFNVFKV

Query:  MKYPDDMEDCSFI---------------------RTLESTIVETTTQD-------SADKHLEDHG------EAPP--IKPSLIEAPTLDLKPLPDHLKYV
        MK+P++ ++C  +                     R L   + E   +D        A K+ +  G       AP   +KPS+ E PTL+LKPLP HL Y 
Subjt:  MKYPDDMEDCSFI---------------------RTLESTIVETTTQD-------SADKHLEDHG------EAPP--IKPSLIEAPTLDLKPLPDHLKYV

Query:  YLGE---------------------------------------GISPSFCMHKITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKWLDADIIYPIADRNWV
        YLGE                                       GISPSFCMHKI LE+    SVE QRRLNP MKEVVKKE+IKWLDA IIYPI+DR+W+
Subjt:  YLGE---------------------------------------GISPSFCMHKITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKWLDADIIYPIADRNWV

Query:  SPVQCVPKKGRVTVVSNKDNELIPTRIITGWRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFTCPYGTF--
        SPVQCVPKKG +TVV N  NE IPT+ +TGWRVCMDY++LNKATRKDHFPL FIDQMLDRLAG+ +YCFL GY GYNQI IAPEDQEKTTFTCPYGTF  
Subjt:  SPVQCVPKKGRVTVVSNKDNELIPTRIITGWRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFTCPYGTF--

Query:  ------------VLGGCLLA---------FAMLQQHFS------SSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKDGIVLD-------------------
                        C++A           +    FS        CL+NL  VLKRCEDT+LVLNWEKCHFMV++GIVL                    
Subjt:  ------------VLGGCLLA---------FAMLQQHFS------SSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKDGIVLD-------------------

Query:  --------------------------------------------------CRKNFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQKQGKFI
                                                          C   F+ LK  LISAPI+  P+W+ PFE+MCDASD A+GA+ GQ++ K  
Subjt:  --------------------------------------------------CRKNFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQKQGKFI

Query:  HPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHA-----------------------EFDLEIKDKKGSENVIADHLSRRDPSS
          IYYASK LN+AQ+NYTTTEKELLAVVFAF+KFR YLVG+KV V+TDHA                       EFDLEI+D+KG EN IADHLSR +  +
Subjt:  HPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHA-----------------------EFDLEIKDKKGSENVIADHLSRRDPSS

Query:  CLLNNSAISDSFPDEQLFTVEVKVVRDAPW--------------------------------------------------CVSGDEAKEILEQCHSSLYG
             + I+D+FPDEQL  +   V  D PW                                                  CV   E  +I EQCH+S YG
Subjt:  CLLNNSAISDSFPDEQLFTVEVKVVRDAPW--------------------------------------------------CVSGDEAKEILEQCHSSLYG

Query:  GHFSSQRTAMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPLTYILEIELFDL---------------------ITCPSEW-RPPACHMSDA
        GHF   RTA +IL  GFFWP LFKD H F   CD CQR GN+  R EMPL  ILE+ELFD+                     +   S+W    A   +D+
Subjt:  GHFSSQRTAMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPLTYILEIELFDL---------------------ITCPSEW-RPPACHMSDA

Query:  KIIARFLQSHIFARFGTPRALVSD
        K++  F++ +IF RFGTPRA++SD
Subjt:  KIIARFLQSHIFARFGTPRALVSD

A0A2G9HYA0 Reverse transcriptase2.7e-23842.82Show/hide
Query:  VNQV--TEEACVYCGEDHNYEFCPNNPTSMFFV-----------------GWRNHPNFSWG---GQGSNVQTQQKVNQSGFAKAQVMPQQTKQALPQQNS
        VNQV  T   C  CGE H  + CP++  S+ FV                 GWR HPNFSW    GQGS  + QQ              QQ +Q  P Q  
Subjt:  VNQV--TEEACVYCGEDHNYEFCPNNPTSMFFV-----------------GWRNHPNFSWG---GQGSNVQTQQKVNQSGFAKAQVMPQQTKQALPQQNS

Query:  GSSLEAMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPLGNFP-----------------------QILNTLEGKGAGGSNKDAGASGSVPDVE
          SLE  + +FMA       S  A+ + +E Q+GQLAN + +RP G+ P                       + L  +  +      K+  +     +VE
Subjt:  GSSLEAMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPLGNFP-----------------------QILNTLEGKGAGGSNKDAGASGSVPDVE

Query:  PP--------YVPP-------------------------------------PPY------------RLGEFETVSLTEECSAILKNGLPSKAKDPESFTI
         P          PP                                     P Y            RLG++ETV+LTEECSAI++N LP K KDP SFTI
Subjt:  PP--------YVPP-------------------------------------PPY------------RLGEFETVSLTEECSAILKNGLPSKAKDPESFTI

Query:  PVSIGGKEFGRALCDLGASINVMPLSVYRKLGIGEARPTTVTLQLADRSITYPDGKIEDVLVKEDKFIFPVDFIILDYEADKDVPIILGRPFLATGGALI
        P +IG    GRALCDLGASIN+MP S+YR LG+GEA+PT++TLQLADRS+TYP G IED+LVK DKFIFP DF++LD E D +VPIILGRPFLATG  LI
Subjt:  PVSIGGKEFGRALCDLGASINVMPLSVYRKLGIGEARPTTVTLQLADRSITYPDGKIEDVLVKEDKFIFPVDFIILDYEADKDVPIILGRPFLATGGALI

Query:  DVQKGELTMRVCNEEVKFNVFKVMKYPDDMEDCSFIR-----------------TLESTIVETTTQDSAD-------------------KHLEDHGEAPP
        DVQKGELTMRV ++++ FNVFK MK+P++ ++C  +                   LE  +++   +++ +                   + LE    +  
Subjt:  DVQKGELTMRVCNEEVKFNVFKVMKYPDDMEDCSFIR-----------------TLESTIVETTTQDSAD-------------------KHLEDHGEAPP

Query:  IKPSLIEAPTLDLKPLPDHLKYVYLGE---------------------------------------GISPSFCMHKITLEEGSFRSVEQQRRLNPAMKEV
        +KPS+ + PTL+LKPLP HL Y YLGE                                       GISPSFCMHKI LE+    SVE QRRLNP MKEV
Subjt:  IKPSLIEAPTLDLKPLPDHLKYVYLGE---------------------------------------GISPSFCMHKITLEEGSFRSVEQQRRLNPAMKEV

Query:  VKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNELIPTRIITGWRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFGYN
        VKKE+IKWLDA IIYPI+D +WVSPVQCVPKKG +TVV N  NELIPTR +TGWRVCMDY++LNKATRKDHFPL FIDQMLDRLAG+ +YCFL GY GYN
Subjt:  VKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNELIPTRIITGWRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFGYN

Query:  QITIAPEDQEKTTFTCPYGTF--------------VLGGCLLA---------FAMLQQHFS------SSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKDG
        QI IAPEDQEKTTFTCPYGTF                  C++A           +    FS        CL+NL  VLKRCEDT+L+LNWEKCHFMV++G
Subjt:  QITIAPEDQEKTTFTCPYGTF--------------VLGGCLLA---------FAMLQQHFS------SSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKDG

Query:  IVLD---------------------------------------------------------------------CRKNFETLKAALISAPILCAPNWNLPF
        IVL                                                                      C   F  LK  LISAPI+  P+W+ PF
Subjt:  IVLD---------------------------------------------------------------------CRKNFETLKAALISAPILCAPNWNLPF

Query:  EVMCDASDVAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHA-----------------------EFDL
        E+MCDASD AVGA+ GQ++ K    IYYASK LN+AQ+NYTTTEKELLAVVFAF+KFR YLVG+KV V+TDHA                       EFDL
Subjt:  EVMCDASDVAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHA-----------------------EFDL

Query:  EIKDKKGSENVIADHLSRRDPSSCLLNNSAISDSFPDEQLFTVEVKVVRDAPW-----------------------------------------------
        EI+D+KG+EN IADHLSR +  +    ++ I+D+FPDEQL  +   V  + PW                                               
Subjt:  EIKDKKGSENVIADHLSRRDPSSCLLNNSAISDSFPDEQLFTVEVKVVRDAPW-----------------------------------------------

Query:  ---CVSGDEAKEILEQCHSSLYGGHFSSQRTAMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPLTYILEIELFDL
           CV   E  +ILEQCH+S YGGHF   RTA +IL  GFFWP LFKDAH F   CD CQR GN+  R EMPL  ILE+ELFD+
Subjt:  ---CVSGDEAKEILEQCHSSLYGGHFSSQRTAMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPLTYILEIELFDL

A0A2G9HYD8 Reverse transcriptase1.3e-23540.99Show/hide
Query:  VPRKALRLTLFPYSLRDGAKAWL-----NSFAPRSIRTWNELAENFLSKYFSPNRNAKLRSEIVGFRQLEDETFSEAWERADLAMIANAFKNV-TVISHQ
        +PR  +++  F + L +G K  L     NSF   +    + L  N ++ ++   + ++  +     R +E +  +    + D  M   + KN  +  S Q
Subjt:  VPRKALRLTLFPYSLRDGAKAWL-----NSFAPRSIRTWNELAENFLSKYFSPNRNAKLRSEIVGFRQLEDETFSEAWERADLAMIANAFKNV-TVISHQ

Query:  QPPAVEPAALVNQVTEEACVYCGEDHNYEFCPNNP-TSMFFVGWRNHPNFSWG---GQGSNVQTQQKVNQSGFAKAQVMPQQTKQALPQQNSGSSLEAMM
         P +VE    V+   +                NNP ++ +  GWR HPNFSW    GQGS  + QQ   Q         P Q     P Q    SLE  +
Subjt:  QPPAVEPAALVNQVTEEACVYCGEDHNYEFCPNNP-TSMFFVGWRNHPNFSWG---GQGSNVQTQQKVNQSGFAKAQVMPQQTKQALPQQNSGSSLEAMM

Query:  KEFMARTYAAIQSNQASMRALELQVGQLANELKARPLGNFP-----------------------QILNTLEGKGAGGSNKDAGASGSVPDVEPP------
         +FMA       S  A+ + +E Q+GQLAN + +RP G+ P                       + L  +  K      K+  +     +VE P      
Subjt:  KEFMARTYAAIQSNQASMRALELQVGQLANELKARPLGNFP-----------------------QILNTLEGKGAGGSNKDAGASGSVPDVEPP------

Query:  --YVPP-------------------------------------PPY------------RLGEFETVSLTEECSAILKNGLPSKAKDPESFTIPVSIGGKE
            PP                                     P Y            RLG++ET +LTEEC+AI++N LP K KDP SFTIP +IG   
Subjt:  --YVPP-------------------------------------PPY------------RLGEFETVSLTEECSAILKNGLPSKAKDPESFTIPVSIGGKE

Query:  FGRALCDLGASINVMPLSVYRKLGIGEARPTTVTLQLADRSITYPDGKIEDVLVKEDKFIFPVDFIILDYEADKDVPIILGRPFLATGGALIDVQKGELT
         GRALCDLGASIN+MP S+YR LG+GEA+PT++TLQLADRS+TYP G IED+LVK DKFIFP DF++LD E D +VPIILGRPFLATG  LIDVQKGELT
Subjt:  FGRALCDLGASINVMPLSVYRKLGIGEARPTTVTLQLADRSITYPDGKIEDVLVKEDKFIFPVDFIILDYEADKDVPIILGRPFLATGGALIDVQKGELT

Query:  MRVCNEEVKFNVFKVMKYPDDMEDC---------------------SFIRTLESTIVETTTQD-------SADKHLEDHG--------EAPPIKPSLIEA
        MRV ++++ FNVFK MK+P++ ++C                     S  R L   I E   +D       +A K  +  G         +  +KPS+ + 
Subjt:  MRVCNEEVKFNVFKVMKYPDDMEDC---------------------SFIRTLESTIVETTTQD-------SADKHLEDHG--------EAPPIKPSLIEA

Query:  PTLDLKPLPDHLKYVYLGE---------------------------------------GISPSFCMHKITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKW
        PTL+LKPLP+HL YVYLGE                                       GISPSFCMHKI LE+    SVE QRRLN  MKEVVKKE+IKW
Subjt:  PTLDLKPLPDHLKYVYLGE---------------------------------------GISPSFCMHKITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKW

Query:  LDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNELIPTRIITGWRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPED
        LDA IIYPI+D +WVSPVQCVPKKG +TVV N  NELIPTR +TGWRVCMDY++LNKATRKDHFPL FIDQMLDRLAG+ +YCFL GY GYNQI IAPED
Subjt:  LDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNELIPTRIITGWRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPED

Query:  QEKTTFTCPYGTF--------------VLGGCLLA---------FAMLQQHFS------SSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKDGIVLD----
        QEKTTFTCPYGTF                  C++A           +    FS        CL+NL  VLKRCEDT+LVLNWEKCHFMV++GIVL     
Subjt:  QEKTTFTCPYGTF--------------VLGGCLLA---------FAMLQQHFS------SSCLDNLGKVLKRCEDTHLVLNWEKCHFMVKDGIVLD----

Query:  -----------------------------------------------------------------CRKNFETLKAALISAPILCAPNWNLPFEVMCDASD
                                                                         C   F+ LK  LISAPI+  P+W+ PFE+MCDASD
Subjt:  -----------------------------------------------------------------CRKNFETLKAALISAPILCAPNWNLPFEVMCDASD

Query:  VAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHA-----------------------EFDLEIKDKKGS
         A+GA+ GQ++ K    IYYASK LN+AQ+NYTTTEKELLAVVFAF+KFR YLVG+KV V+TDHA                       EFDLEI+D+KG+
Subjt:  VAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDHA-----------------------EFDLEIKDKKGS

Query:  ENVIADHLSRRDPSSCLLNNSAISDSFPDEQLFTVEVKVVRDAPW--------------------------------------------------CVSGD
        EN IADHLSR +  +    ++ I+D+FPDEQL  +   V  + PW                                                  CV   
Subjt:  ENVIADHLSRRDPSSCLLNNSAISDSFPDEQLFTVEVKVVRDAPW--------------------------------------------------CVSGD

Query:  EAKEILEQCHSSLYGGHFSSQRTAMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPLTYILEIELFDL
        E  +ILEQCH+S YGGHF   RTA +IL  GFFWP LFKDAH F   CD CQR GN+  R EMPL  ILE+ELFD+
Subjt:  EAKEILEQCHSSLYGGHFSSQRTAMRILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPLTYILEIELFDL

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.66.7e-2425.94Show/hide
Query:  AMKEVVKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNELIPTRIITGWRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYG
        A ++ V+ ++   L+  II   ++  + SP+  VPKK      S K            +R+ +DY++LN+ T  D  P+  +D++L +L    Y+  +  
Subjt:  AMKEVVKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNELIPTRIITGWRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYG

Query:  YFGYNQITIAPEDQEKTTFTCPYGTF--------------VLGGCL--LAFAMLQQH----------FSSSC---LDNLGKVLKRCEDTHLVLNWEKC--
          G++QI + PE   KT F+  +G +                  C+  +   +L +H          FS+S    L +LG V ++    +L L  +KC  
Subjt:  YFGYNQITIAPEDQEKTTFTCPYGTF--------------VLGGCL--LAFAMLQQH----------FSSSC---LDNLGKVLKRCEDTHLVLNWEKC--

Query:  ---------HFMVKDGI---------------------------------------------VLDCRKN--------------FETLKAALISAPILCAP
                 H +  DGI                                             +  C K               F+ LK  +   PIL  P
Subjt:  ---------HFMVKDGI---------------------------------------------VLDCRKN--------------FETLKAALISAPILCAP

Query:  NWNLPFEVMCDASDVAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDH----------------------
        ++   F +  DASDVA+GA+  Q      HP+ Y S+ LNE ++NY+T EKELLA+V+A + FR YL+G    + +DH                      
Subjt:  NWNLPFEVMCDASDVAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDH----------------------

Query:  -AEFDLEIKDKKGSENVIADHLSR
         +EFD +IK  KG EN +AD LSR
Subjt:  -AEFDLEIKDKKGSENVIADHLSR

P10394 Retrovirus-related Pol polyprotein from transposon 4123.1e-2124.76Show/hide
Query:  EVVKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNELIPTRIITGWRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFG
        E ++ +V K +   I+ P   + + SP+  VPKK       N D +         WR+ +DY+++NK    D FPL  ID +LD+L    Y+  L    G
Subjt:  EVVKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNELIPTRIITGWRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFG

Query:  YNQITIAPEDQEKTTFT----------CPYG------------TFVLGGC--LLAFAMLQQHFSSSC-----LDNLGKVLKRCEDTHLVLNWEKCHFMVK
        ++QI +    ++ T+F+           P+G            T    G     AF  +       C     L NL +V  +C + +L L+ EKC F + 
Subjt:  YNQITIAPEDQEKTTFT----------CPYG------------TFVLGGC--LLAFAMLQQHFSSSC-----LDNLGKVLKRCEDTHLVLNWEKCHFMVK

Query:  D-----------GIVLD----------------------------------------------------------CRKNFETLKAALISAPILCAPNWNL
        +           GI+ D                                                          C+K F  LK+ LI+  +L  P+++ 
Subjt:  D-----------GIVLD----------------------------------------------------------CRKNFETLKAALISAPILCAPNWNL

Query:  PFEVMCDASDVAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDH-----------------------AEF
         F +  DAS  A GA+  Q       P+ YAS+   + + N +TTE+EL A+ +A   FRPY+ G   TV TDH                        E+
Subjt:  PFEVMCDASDVAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDH-----------------------AEF

Query:  DLEIKDKKGSENVIADHLSR
        +  ++  KG +N +AD LSR
Subjt:  DLEIKDKKGSENVIADHLSR

P20825 Retrovirus-related Pol polyprotein from transposon 2972.6e-2025.96Show/hide
Query:  WRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFT----------CPYG------TFVLGGCLLAFAMLQQH-
        +RV +DY++LN+ T  D +P+  +D++L +L    Y+  +    G++QI +  E   KT F+           P+G      TF      +   +L +H 
Subjt:  WRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFT----------CPYG------TFVLGGCLLAFAMLQQH-

Query:  ---------FSSSCLDNLGK---VLKRCEDTHLVLNWEKCHFMVK-----------DGI-----------------------------------------
                 FS+S  ++L     V  +  D +L L  +KC F+ K           DGI                                         
Subjt:  ---------FSSSCLDNLGK---VLKRCEDTHLVLNWEKCHFMVK-----------DGI-----------------------------------------

Query:  ----VLDCRKN--------------FETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVF
            +  C K               FE LKA +I  PIL  P++   F +  DAS++A+GA+  Q      HPI + S+ LN+ ++NY+  EKELLA+V+
Subjt:  ----VLDCRKN--------------FETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVF

Query:  AFEKFRPYLVGSKVTVFTDH-----------------------AEFDLEIKDKKGSENVIADHLSR
        A + FR YL+G +  + +DH                       +E+  +I   KG EN +AD LSR
Subjt:  AFEKFRPYLVGSKVTVFTDH-----------------------AEFDLEIKDKKGSENVIADHLSR

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein4.5e-2024.43Show/hide
Query:  HKITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNELIPTRIITGWRVCMDYKRLNKATRKDHFPL
        H I ++ G+     Q   +    ++ + K V K LD   I P +     SPV  VPKK                     +R+C+DY+ LNKAT  D FPL
Subjt:  HKITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNELIPTRIITGWRVCMDYKRLNKATRKDHFPL

Query:  LFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFTCPYG----TFVLGGCLLAFAMLQQHFSSSCLD-----------------------NLGK
          ID +L R+     +  L  + GY+QI + P+D+ KT F  P G    T +  G + A +   ++ + +  D                       +L  
Subjt:  LFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFTCPYG----TFVLGGCLLAFAMLQQHFSSSCLD-----------------------NLGK

Query:  VLKRCEDTHLVLNWEKCHFMVKD--------GI--------------------------------------VLDCR---------------------KNF
        VL+R ++ +L++  +KC F  ++        GI                                      + +C                      K  
Subjt:  VLKRCEDTHLVLNWEKCHFMVKD--------GI--------------------------------------VLDCR---------------------KNF

Query:  ETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQ--KQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDH----
        E LKAAL ++P+L   N    + +  DAS   +GA+  +   + K +  + Y SK L  AQ NY   E ELL ++ A   FR  L G   T+ TDH    
Subjt:  ETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQ--KQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVFTDH----

Query:  -------------------AEFDLEIKDKKGSENVIADHLSR
                           A +D  ++   G +NV+AD +SR
Subjt:  -------------------AEFDLEIKDKKGSENVIADHLSR

Q9UR07 Transposon Tf2-11 polyprotein2.7e-1721.46Show/hide
Query:  KITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNELIPTRIITGWRVCMDYKRLNKATRKDHFPLL
        ++ L + ++R   +   L P   + +  E+ + L + II      N   PV  VPKK                      R+ +DYK LNK  + + +PL 
Subjt:  KITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNELIPTRIITGWRVCMDYKRLNKATRKDHFPLL

Query:  FIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFTCPYGTF----------------------VLGG-------CLLAFAMLQQHFSSSCLDNLG
         I+Q+L ++ G   +  L     Y+ I +   D+ K  F CP G F                      +LG        C +   ++     S  + ++ 
Subjt:  FIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFTCPYGTF----------------------VLGG-------CLLAFAMLQQHFSSSCLDNLG

Query:  KVLKRCEDTHLVLNWEKCHF----------MVKDGIVLDCRKN---------------------------------------------------------
         VL++ ++ +L++N  KC F           + +     C++N                                                         
Subjt:  KVLKRCEDTHLVLNWEKCHF----------MVKDGIVLDCRKN---------------------------------------------------------

Query:  --FETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQK-QGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVGS--KVTVFTDH
           E +K  L+S P+L   +++    +  DASDVAVGA+  QK      +P+ Y S  +++AQ+NY+ ++KE+LA++ + + +R YL  +     + TDH
Subjt:  --FETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQK-QGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVGS--KVTVFTDH

Query:  -------------------------AEFDLEIKDKKGSENVIADHLSR------------RDPSSCLLNNSAISDSFPDE
                                  +F+ EI  + GS N IAD LSR             D S   +N  +I+D F ++
Subjt:  -------------------------AEFDLEIKDKKGSENVIADHLSR------------RDPSSCLLNNSAISDSFPDE

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein1.9e-1358Show/hide
Query:  ILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPLTYILEIELFDL
        +L  GF+WPT FKDAH F   CD CQR+GN   R+EMP  +ILE+E+FD+
Subjt:  ILHCGFFWPTLFKDAHWFYKQCDVCQRRGNLGPRDEMPLTYILEIELFDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAGCAATATCTTGGTAGCGAATGATAGGGCCAGAGCCATTCGAGCGTATGCTTTTCCAATGTTTGATGAGCTAAATCCAAGGATTACACGTCCTCAAATTCAAGC
GGCAAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGTCAATTCCATGGTTTGTCATCTGAAGACCCTCATTTGCATCTTAAGTCTTTTCTAGGAG
TTAGTGATTCTTTTGTAATTCAAGGAGTGCCTAGAAAGGCTCTTCGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAAGGCATGGTTAAATTCTTTTGCTCCA
AGATCAATTAGGACATGGAATGAATTAGCTGAAAATTTTCTTAGTAAATATTTCTCACCAAATAGAAATGCTAAATTAAGGAGTGAAATAGTAGGGTTTAGGCAACTTGA
AGATGAAACTTTTAGTGAGGCTTGGGAAAGGGCTGATCTTGCAATGATTGCTAACGCTTTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCTGTGGAGCCTG
CTGCATTGGTGAACCAAGTCACAGAGGAAGCATGTGTCTATTGTGGTGAAGATCACAACTACGAGTTTTGCCCCAACAATCCAACTTCTATGTTTTTTGTAGGTTGGCGC
AACCACCCCAACTTCTCATGGGGAGGACAAGGAAGTAATGTGCAAACACAACAAAAGGTGAACCAGTCGGGATTTGCGAAAGCGCAGGTAATGCCCCAACAAACTAAGCA
GGCTTTGCCCCAGCAAAATTCAGGGAGTTCTCTCGAGGCGATGATGAAAGAATTTATGGCTCGTACATATGCTGCAATTCAAAGTAATCAAGCTTCAATGAGAGCCTTGG
AATTGCAAGTGGGTCAACTAGCTAATGAGCTAAAGGCAAGGCCTCTGGGAAACTTCCCTCAGATATTGAACACCCTCGAAGGGAAGGGTGCTGGAGGCAGCAATAAAGAT
GCTGGAGCATCTGGTTCTGTTCCAGATGTGGAACCACCTTATGTGCCGCCCCCACCTTATAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAATGTAGTGCTAT
TCTTAAGAATGGGCTACCATCCAAGGCTAAGGATCCAGAATCATTTACTATACCTGTATCTATAGGTGGAAAAGAGTTTGGTAGAGCACTCTGTGATTTAGGTGCAAGCA
TTAACGTTATGCCTCTTTCGGTCTATCGAAAGTTAGGTATTGGTGAAGCTAGGCCTACCACAGTCACGCTCCAACTCGCTGATAGGTCTATCACATATCCAGATGGTAAA
ATTGAGGATGTCTTAGTGAAAGAGGATAAATTCATATTTCCTGTTGATTTTATTATTTTAGACTATGAGGCTGATAAAGATGTCCCAATTATTTTAGGTCGTCCATTTTT
GGCTACTGGTGGGGCATTAATAGATGTTCAAAAAGGGGAATTAACAATGAGAGTCTGTAATGAGGAAGTGAAATTTAATGTGTTTAAGGTCATGAAGTATCCAGATGATA
TGGAAGATTGTTCTTTCATTAGGACTCTAGAGAGCACAATTGTTGAGACAACAACTCAGGATTCGGCTGACAAGCATTTGGAAGATCATGGAGAGGCTCCTCCTATTAAA
CCATCCCTGATTGAGGCACCCACTTTAGATTTGAAGCCCTTGCCGGATCATCTAAAGTATGTGTATCTTGGGGAAGGGATTAGCCCATCTTTTTGTATGCACAAAATCAC
TTTAGAGGAGGGATCCTTTAGGAGTGTTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTGATTAAATGGTTGGATGCTGACATCATTT
ATCCAATTGCAGACAGAAATTGGGTGAGCCCAGTCCAATGTGTTCCTAAGAAAGGACGTGTCACTGTGGTAAGCAATAAAGACAATGAGTTGATCCCAACCAGGATAATA
ACTGGCTGGAGGGTTTGCATGGATTACAAGAGGCTTAATAAAGCTACCCGTAAGGACCATTTCCCTCTACTATTTATTGACCAGATGTTGGATAGATTGGCTGGTCAGGC
CTACTACTGTTTCTTATATGGTTATTTCGGGTATAACCAGATTACTATTGCTCCTGAGGATCAGGAAAAAACCACTTTCACCTGTCCTTATGGGACGTTTGTTTTAGGCG
GATGCCTTTTGGCCTTTGCAATGCTTCAGCAACATTTCAGCAGTAGTTGTTTAGATAATTTAGGTAAGGTGTTAAAGAGATGTGAGGATACCCATCTAGTTCTCAATTGG
GAGAAATGCCACTTCATGGTGAAGGACGGCATAGTGTTAGATTGTAGGAAGAATTTTGAAACTTTAAAAGCTGCTTTAATCTCAGCACCCATTCTTTGTGCACCTAATTG
GAATTTACCATTTGAAGTAATGTGTGATGCAAGTGATGTTGCAGTAGGTGCGATGTTTGGGCAAAAGCAGGGCAAATTTATCCATCCTATATATTATGCAAGCAAAGTTT
TAAATGAGGCACAAGTCAACTACACAACTACTGAAAAGGAGTTGTTAGCTGTTGTGTTCGCTTTTGAGAAATTCCGGCCATATTTGGTTGGATCCAAAGTCACGGTGTTC
ACGGATCATGCAGAATTCGACTTGGAGATAAAGGACAAGAAGGGATCAGAGAATGTCATTGCAGATCATTTGTCTCGTCGTGATCCATCATCATGTTTGCTAAATAATTC
TGCCATTTCAGATTCTTTTCCAGATGAACAGCTTTTTACTGTTGAGGTAAAGGTAGTCAGGGATGCCCCTTGGTGTGTTTCAGGTGATGAAGCGAAGGAAATCTTGGAGC
AATGTCACTCTTCGCTGTATGGAGGTCATTTCAGCAGTCAGAGGACAGCTATGAGGATTTTGCATTGTGGATTCTTCTGGCCTACCTTGTTTAAGGATGCCCATTGGTTC
TACAAGCAATGTGATGTTTGCCAAAGGAGAGGAAACTTAGGGCCTAGAGATGAAATGCCTCTTACTTACATTTTAGAAATTGAATTATTCGATTTGATTACGTGTCCAAG
TGAGTGGAGGCCACCCGCATGCCATATGAGTGATGCCAAGATAATAGCAAGGTTTCTTCAATCGCACATCTTTGCGCGGTTTGGGACACCTAGAGCTCTAGTGAGTGATG
AGGATTGTCCTTTGATTTGTATGGGTGAGAGTGGCCAATACGCCGACTCAATAAGCCTACCACTTTGGGGACAAGACCGAATGTTGAGCTGGGAACATAATCGTACAAGA
TGGAATTCACTCCTTCCCGACTTTAGGGAAGCAGATGAGTGTTCCCTTAAGTGGTTACTCCGAGTCTTGAACAAAGGGCCCTACCCTCTCAATGGCACGAGAGGGTTTTC
TGTTGGATGGTTGGACCACAAACAGATCCGTGGACATAGAAAATATGTCTGCAGTGAGAAGAGTGCAACTGTCCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCAGCAATATCTTGGTAGCGAATGATAGGGCCAGAGCCATTCGAGCGTATGCTTTTCCAATGTTTGATGAGCTAAATCCAAGGATTACACGTCCTCAAATTCAAGC
GGCAAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGTCAATTCCATGGTTTGTCATCTGAAGACCCTCATTTGCATCTTAAGTCTTTTCTAGGAG
TTAGTGATTCTTTTGTAATTCAAGGAGTGCCTAGAAAGGCTCTTCGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAAGGCATGGTTAAATTCTTTTGCTCCA
AGATCAATTAGGACATGGAATGAATTAGCTGAAAATTTTCTTAGTAAATATTTCTCACCAAATAGAAATGCTAAATTAAGGAGTGAAATAGTAGGGTTTAGGCAACTTGA
AGATGAAACTTTTAGTGAGGCTTGGGAAAGGGCTGATCTTGCAATGATTGCTAACGCTTTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCTGTGGAGCCTG
CTGCATTGGTGAACCAAGTCACAGAGGAAGCATGTGTCTATTGTGGTGAAGATCACAACTACGAGTTTTGCCCCAACAATCCAACTTCTATGTTTTTTGTAGGTTGGCGC
AACCACCCCAACTTCTCATGGGGAGGACAAGGAAGTAATGTGCAAACACAACAAAAGGTGAACCAGTCGGGATTTGCGAAAGCGCAGGTAATGCCCCAACAAACTAAGCA
GGCTTTGCCCCAGCAAAATTCAGGGAGTTCTCTCGAGGCGATGATGAAAGAATTTATGGCTCGTACATATGCTGCAATTCAAAGTAATCAAGCTTCAATGAGAGCCTTGG
AATTGCAAGTGGGTCAACTAGCTAATGAGCTAAAGGCAAGGCCTCTGGGAAACTTCCCTCAGATATTGAACACCCTCGAAGGGAAGGGTGCTGGAGGCAGCAATAAAGAT
GCTGGAGCATCTGGTTCTGTTCCAGATGTGGAACCACCTTATGTGCCGCCCCCACCTTATAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAATGTAGTGCTAT
TCTTAAGAATGGGCTACCATCCAAGGCTAAGGATCCAGAATCATTTACTATACCTGTATCTATAGGTGGAAAAGAGTTTGGTAGAGCACTCTGTGATTTAGGTGCAAGCA
TTAACGTTATGCCTCTTTCGGTCTATCGAAAGTTAGGTATTGGTGAAGCTAGGCCTACCACAGTCACGCTCCAACTCGCTGATAGGTCTATCACATATCCAGATGGTAAA
ATTGAGGATGTCTTAGTGAAAGAGGATAAATTCATATTTCCTGTTGATTTTATTATTTTAGACTATGAGGCTGATAAAGATGTCCCAATTATTTTAGGTCGTCCATTTTT
GGCTACTGGTGGGGCATTAATAGATGTTCAAAAAGGGGAATTAACAATGAGAGTCTGTAATGAGGAAGTGAAATTTAATGTGTTTAAGGTCATGAAGTATCCAGATGATA
TGGAAGATTGTTCTTTCATTAGGACTCTAGAGAGCACAATTGTTGAGACAACAACTCAGGATTCGGCTGACAAGCATTTGGAAGATCATGGAGAGGCTCCTCCTATTAAA
CCATCCCTGATTGAGGCACCCACTTTAGATTTGAAGCCCTTGCCGGATCATCTAAAGTATGTGTATCTTGGGGAAGGGATTAGCCCATCTTTTTGTATGCACAAAATCAC
TTTAGAGGAGGGATCCTTTAGGAGTGTTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTGATTAAATGGTTGGATGCTGACATCATTT
ATCCAATTGCAGACAGAAATTGGGTGAGCCCAGTCCAATGTGTTCCTAAGAAAGGACGTGTCACTGTGGTAAGCAATAAAGACAATGAGTTGATCCCAACCAGGATAATA
ACTGGCTGGAGGGTTTGCATGGATTACAAGAGGCTTAATAAAGCTACCCGTAAGGACCATTTCCCTCTACTATTTATTGACCAGATGTTGGATAGATTGGCTGGTCAGGC
CTACTACTGTTTCTTATATGGTTATTTCGGGTATAACCAGATTACTATTGCTCCTGAGGATCAGGAAAAAACCACTTTCACCTGTCCTTATGGGACGTTTGTTTTAGGCG
GATGCCTTTTGGCCTTTGCAATGCTTCAGCAACATTTCAGCAGTAGTTGTTTAGATAATTTAGGTAAGGTGTTAAAGAGATGTGAGGATACCCATCTAGTTCTCAATTGG
GAGAAATGCCACTTCATGGTGAAGGACGGCATAGTGTTAGATTGTAGGAAGAATTTTGAAACTTTAAAAGCTGCTTTAATCTCAGCACCCATTCTTTGTGCACCTAATTG
GAATTTACCATTTGAAGTAATGTGTGATGCAAGTGATGTTGCAGTAGGTGCGATGTTTGGGCAAAAGCAGGGCAAATTTATCCATCCTATATATTATGCAAGCAAAGTTT
TAAATGAGGCACAAGTCAACTACACAACTACTGAAAAGGAGTTGTTAGCTGTTGTGTTCGCTTTTGAGAAATTCCGGCCATATTTGGTTGGATCCAAAGTCACGGTGTTC
ACGGATCATGCAGAATTCGACTTGGAGATAAAGGACAAGAAGGGATCAGAGAATGTCATTGCAGATCATTTGTCTCGTCGTGATCCATCATCATGTTTGCTAAATAATTC
TGCCATTTCAGATTCTTTTCCAGATGAACAGCTTTTTACTGTTGAGGTAAAGGTAGTCAGGGATGCCCCTTGGTGTGTTTCAGGTGATGAAGCGAAGGAAATCTTGGAGC
AATGTCACTCTTCGCTGTATGGAGGTCATTTCAGCAGTCAGAGGACAGCTATGAGGATTTTGCATTGTGGATTCTTCTGGCCTACCTTGTTTAAGGATGCCCATTGGTTC
TACAAGCAATGTGATGTTTGCCAAAGGAGAGGAAACTTAGGGCCTAGAGATGAAATGCCTCTTACTTACATTTTAGAAATTGAATTATTCGATTTGATTACGTGTCCAAG
TGAGTGGAGGCCACCCGCATGCCATATGAGTGATGCCAAGATAATAGCAAGGTTTCTTCAATCGCACATCTTTGCGCGGTTTGGGACACCTAGAGCTCTAGTGAGTGATG
AGGATTGTCCTTTGATTTGTATGGGTGAGAGTGGCCAATACGCCGACTCAATAAGCCTACCACTTTGGGGACAAGACCGAATGTTGAGCTGGGAACATAATCGTACAAGA
TGGAATTCACTCCTTCCCGACTTTAGGGAAGCAGATGAGTGTTCCCTTAAGTGGTTACTCCGAGTCTTGAACAAAGGGCCCTACCCTCTCAATGGCACGAGAGGGTTTTC
TGTTGGATGGTTGGACCACAAACAGATCCGTGGACATAGAAAATATGTCTGCAGTGAGAAGAGTGCAACTGTCCATTAG
Protein sequenceShow/hide protein sequence
MSSNILVANDRARAIRAYAFPMFDELNPRITRPQIQAANFEMKPVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRKALRLTLFPYSLRDGAKAWLNSFAP
RSIRTWNELAENFLSKYFSPNRNAKLRSEIVGFRQLEDETFSEAWERADLAMIANAFKNVTVISHQQPPAVEPAALVNQVTEEACVYCGEDHNYEFCPNNPTSMFFVGWR
NHPNFSWGGQGSNVQTQQKVNQSGFAKAQVMPQQTKQALPQQNSGSSLEAMMKEFMARTYAAIQSNQASMRALELQVGQLANELKARPLGNFPQILNTLEGKGAGGSNKD
AGASGSVPDVEPPYVPPPPYRLGEFETVSLTEECSAILKNGLPSKAKDPESFTIPVSIGGKEFGRALCDLGASINVMPLSVYRKLGIGEARPTTVTLQLADRSITYPDGK
IEDVLVKEDKFIFPVDFIILDYEADKDVPIILGRPFLATGGALIDVQKGELTMRVCNEEVKFNVFKVMKYPDDMEDCSFIRTLESTIVETTTQDSADKHLEDHGEAPPIK
PSLIEAPTLDLKPLPDHLKYVYLGEGISPSFCMHKITLEEGSFRSVEQQRRLNPAMKEVVKKEVIKWLDADIIYPIADRNWVSPVQCVPKKGRVTVVSNKDNELIPTRII
TGWRVCMDYKRLNKATRKDHFPLLFIDQMLDRLAGQAYYCFLYGYFGYNQITIAPEDQEKTTFTCPYGTFVLGGCLLAFAMLQQHFSSSCLDNLGKVLKRCEDTHLVLNW
EKCHFMVKDGIVLDCRKNFETLKAALISAPILCAPNWNLPFEVMCDASDVAVGAMFGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVGSKVTVF
TDHAEFDLEIKDKKGSENVIADHLSRRDPSSCLLNNSAISDSFPDEQLFTVEVKVVRDAPWCVSGDEAKEILEQCHSSLYGGHFSSQRTAMRILHCGFFWPTLFKDAHWF
YKQCDVCQRRGNLGPRDEMPLTYILEIELFDLITCPSEWRPPACHMSDAKIIARFLQSHIFARFGTPRALVSDEDCPLICMGESGQYADSISLPLWGQDRMLSWEHNRTR
WNSLLPDFREADECSLKWLLRVLNKGPYPLNGTRGFSVGWLDHKQIRGHRKYVCSEKSATVH