; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022090 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022090
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr7:17955259..17966459
RNA-Seq ExpressionLag0022090
SyntenyLag0022090
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017239676.1 PREDICTED: uncharacterized protein LOC108212460 [Daucus carota subsp. sativus]0.0e+0041.72Show/hide
Query:  MSRLPQGLKDPRNPQQNRLLQQNQQLEQNEQQNNQAENPILVANDRARAIRAYTFPMFDELNPGIARPQIEAENFEMKPVMFQMLHTVGQFHGLSFEDPH
        M RL      P + +  +  + N++  +  +  +           R   ++ Y  P FD ++  IARP I A NF +     Q +    +F+GLS EDP+
Subjt:  MSRLPQGLKDPRNPQQNRLLQQNQQLEQNEQQNNQAENPILVANDRARAIRAYTFPMFDELNPGIARPQIEAENFEMKPVMFQMLHTVGQFHGLSFEDPH

Query:  LHLKSFIGVSDSFVIQGVPRDALRLTLFLYSLRDGAKAWLNSFAPGSIRTLNELAEKFLSKYFPPNRNDKSRSEIVGFKQLEDETFSEAWERFKELLRKC
         HL++F+ + D+F + GVP + +RL LF  SL   A+ WL+S    SI T N+L EKFL+KYF P + ++   EI  F+Q + E+  EA+ERFK+LLRKC
Subjt:  LHLKSFIGVSDSFVIQGVPRDALRLTLFLYSLRDGAKAWLNSFAPGSIRTLNELAEKFLSKYFPPNRNDKSRSEIVGFKQLEDETFSEAWERFKELLRKC

Query:  PHHGLPHCIQMETFYNGLYGATQGMVDASTGGALLVKTFNEAHEILERISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRDDLAMIANSLNVGIEHACFRS
        PHHGL    ++ TFYNGL       VD +  G+L  +   +A EILE I++N+C+  D R ++KKV  V EVD +++    +     +LN          
Subjt:  PHHGLPHCIQMETFYNGLYGATQGMVDASTGGALLVKTFNEAHEILERISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRDDLAMIANSLNVGIEHACFRS

Query:  KIECLKCFRSSIASACVICTPSLKNVTVISHQQPLALEPAALVNQVAEEACVYCGEDHNYEFCP------NNPASVFFVG---NQRNNPYSIFYNPGWRN
        K+E L                       +  QQP+    +A   Q     C  CGE H  + CP         +SV +VG   NQ+NNP+S  YNPGWRN
Subjt:  KIECLKCFRSSIASACVICTPSLKNVTVISHQQPLALEPAALVNQVAEEACVYCGEDHNYEFCP------NNPASVFFVG---NQRNNPYSIFYNPGWRN

Query:  HPNFSWGG-----------------QGSNVQAQQK-------------------VNQSGFAKA------QLANELKARPQGKLPSDTE-HPRREGKEQVK
        HPNFSW                   Q    Q  +K                    +QS   +A      QLA+ +  RP G LPS+TE +P+ + +E  K
Subjt:  HPNFSWGG-----------------QGSNVQAQQK-------------------VNQSGFAKA------QLANELKARPQGKLPSDTE-HPRREGKEQVK

Query:  AVTLRIELETGQGAGGSNNDAGAYGSVPDVEPPYV---------PPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKD
        A+TLR   E  +G     +D G    V + EP  +         P     PP PFPQR + + QD QF+KF+++ K+L INIP  EA+EQM +Y KF+KD
Subjt:  AVTLRIELETGQGAGGSNNDAGAYGSVPDVEPPYV---------PPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKD

Query:  ILTKKKRLCEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRALCDLGANINLMPLSVYRKLGIGEARPTTGTFQLANRSITYPEGKIEDV
        IL++K+RL EFETV+LTEECS IL+  LP K KDPGSFTIP +IG +  G+ALCDLGA++NLMPLS++ KLG+GE +PT+   QLA+RS+ YP G +EDV
Subjt:  ILTKKKRLCEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRALCDLGANINLMPLSVYRKLGIGEARPTTGTFQLANRSITYPEGKIEDV

Query:  LVKVDKFIFPVDFIILDYEAYKDVPIILGRPFLATGRTLIDVQKGELTMRVYNEEVKFNVFKVMKYPDEMEDCSFVRILEN-----TIVETAMQESTNKH
        LVKVDKFIFP DFI+LD E   D+P++LGRPFLATGRTLIDVQKGELTMRV +E+V FNVF  MK+ ++ E C  V           ++E    +     
Subjt:  LVKVDKFIFPVDFIILDYEAYKDVPIILGRPFLATGRTLIDVQKGELTMRVYNEEVKFNVFKVMKYPDEMEDCSFVRILEN-----TIVETAMQESTNKH

Query:  LEDHGEDSIEDLEVCSLERKNEKEVFRCEDVFESLDLDQRKAPPIKPSLIEAPNLDLKPLSDHLKYVYLGEGDTLPIIVASDLMAEHEKALIKLLQQYRK
        L + G++S E++  C  E        R    FES ++   K+   KPS+ E P L+LK L  HLKY +LGE  TLP+I++S L AEHE+ L+++L++Y++
Subjt:  LEDHGEDSIEDLEVCSLERKNEKEVFRCEDVFESLDLDQRKAPPIKPSLIEAPNLDLKPLSDHLKYVYLGEGDTLPIIVASDLMAEHEKALIKLLQQYRK

Query:  AIGWTLADIQEISPSFCMHKITLEEGSFRLNRNYARYHPAMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRV
        AIGW +ADI+ ISPSFCMHKI++E+       +  R +P MKEVVKK++IKWLDAGIIYPI+D++WVSP+QCVPKKGG+TVV+N+ NELIPTRTVTGWRV
Subjt:  AIGWTLADIQEISPSFCMHKITLEEGSFRLNRNYARYHPAMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRV

Query:  CMDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYYCFLDGYSG--------------------------RMPFDLCYAPTTFQRCMLAIFSYMIESTVE--
        CMDYRKLNKAT KDHFPLPFIDQMLDRL G+ +YCFLDGYSG                          ++ F LC AP+TFQRCM+AIFS MIE  VE  
Subjt:  CMDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYYCFLDGYSG--------------------------RMPFDLCYAPTTFQRCMLAIFSYMIESTVE--

Query:  --EGIMLGH-------RISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRR-----------------------------KAFETLKTVLISTP
          +  +LG         +++   +VD+A +E I +L PP+SVK +RSFLGHAGFYRR                             +A+E LK  L+S P
Subjt:  --EGIMLGH-------RISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRR-----------------------------KAFETLKTVLISTP

Query:  ILCAPNWNLPFEVMCAASDAA-------------------------------------------------------------------------------
        I+ AP+W+LPFE+MC ASD A                                                                               
Subjt:  ILCAPNWNLPFEVMCAASDAA-------------------------------------------------------------------------------

Query:  -------EFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDSFPDEQLFIVEVKMVKDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDE
               EFDLEIK +KGSEN +ADHLSRL+ +        I+DSFPDE+L  +     K +PW+AD  N+LV  + P +++++QKKKF H  K F WDE
Subjt:  -------EFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDSFPDEQLFIVEVKMVKDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDE

Query:  PFMYKQCSDGIILRRMEAILVCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGSFPPSNGNVFILLAVDYVFKWVEAIACHQNDAKTVARFLQSHI
        P+++K   D ++ R +    +CD CQR GN+  R+EMPL YILEVELFDVWGIDFMG F  S  N +ILLAVDYV KWVEAIA   +DA  V  F++ +I
Subjt:  PFMYKQCSDGIILRRMEAILVCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGSFPPSNGNVFILLAVDYVFKWVEAIACHQNDAKTVARFLQSHI

Query:  FVWFGTPRALVSDEGTHSVNNILTKLLAKYEIKHKIATPYHPQANGQAEI--RAI---------------------------------------------
        F  FGTPRA++SDEGTH  N     LL KY + HK+A  YHPQ NGQAE+  R I                                             
Subjt:  FVWFGTPRALVSDEGTHSVNNILTKLLAKYEIKHKIATPYHPQANGQAEI--RAI---------------------------------------------

Query:  -----------------------------RMLQLNELEEFRQFSYENSKMYKEKTKLWHDKKIKSK
                                     R+L+LNELEEFR  ++E+S++YKEK K WHD+KI  K
Subjt:  -----------------------------RMLQLNELEEFRQFSYENSKMYKEKTKLWHDKKIKSK

XP_038972405.1 uncharacterized protein LOC120104748 [Phoenix dactylifera]0.0e+0040.31Show/hide
Query:  NDRARAIRAYTFPMFDELNPGIARPQIEAENFEMKPVMFQMLHTVGQFHGLSFEDPHLHLKSFIGVSDSFVIQGVPRDALRLTLFLYSLRDGAKAWLNSF
        N   R +  Y  P  +   P I RP + A NFE+KP + QM+    QF G   EDPH HL +F+ + D+  + GV  DA+RL LF +SL+D AKAWLNS 
Subjt:  NDRARAIRAYTFPMFDELNPGIARPQIEAENFEMKPVMFQMLHTVGQFHGLSFEDPHLHLKSFIGVSDSFVIQGVPRDALRLTLFLYSLRDGAKAWLNSF

Query:  APGSIRTLNELAEKFLSKYFPPNRNDKSRSEIVGFKQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLYGATQGMVDASTGGALLVKTFNEAH
        AP S  T N L++ FLSKYFPP +  K R++I  F Q + E+  EAWERFK+L RKCPHHGLP  + ++TFYNGL  + +  +DA+ GG L+ K+  EA+
Subjt:  APGSIRTLNELAEKFLSKYFPPNRNDKSRSEIVGFKQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLYGATQGMVDASTGGALLVKTFNEAH

Query:  EILERISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRDDLAMIANSLNVGIEHACFRSKIECLKCFRSSIASACVICTPSLKNVTVISHQQPLALEPAALV
        E+LE +++N+ QWS+ R   KKV  + +VDG+           N LN         +K++ L      + +   + +P L                    
Subjt:  EILERISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRDDLAMIANSLNVGIEHACFRSKIECLKCFRSSIASACVICTPSLKNVTVISHQQPLALEPAALV

Query:  NQVAEEACVYCGEDHNYEFCPNNPASVFFVGN-----QRNNPYSIFYNPGWRNHPNFSWGGQGSNVQAQQKVNQSGF-----------------------
              +C  CG  H    C      V FV N     Q+NNPYS  YNPGWRNHPNFSW  QG+   + + ++  GF                       
Subjt:  NQVAEEACVYCGEDHNYEFCPNNPASVFFVGN-----QRNNPYSIFYNPGWRNHPNFSWGGQGSNVQAQQKVNQSGF-----------------------

Query:  ---------AK---------------AQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRIELETGQGAGGS-NNDAGAYGSV-----PDVE------
                 AK                QLAN + +R QG LPS TE      KE  KAVTLR   + GQ +G +   D   Y  V      +VE      
Subjt:  ---------AK---------------AQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRIELETGQGAGGS-NNDAGAYGSV-----PDVE------

Query:  PPYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLCEFETVSLTEECSVILKNGLPTKAKDPGSFTI
         P  P  PYVPP+PFPQR +    D QF+KFL++ +QLHINIP  +A+ Q+P Y KFLK+I++KK++L +FET++LTEECS I++N LP K +DPGSF+I
Subjt:  PPYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLCEFETVSLTEECSVILKNGLPTKAKDPGSFTI

Query:  PVSIGGKELGRALCDLGANINLMPLSVYRKLGIGEARPTTGTFQLANRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAYKDVPIILGRPFLATGRTLI
        P +IG  +  RALCDLGA+++LMPLSV RKLG+ E +PTT + QLA+RS+ YP G +E+VL+KV KFI PVDFI+L+ E   ++PIILGRPFLAT   +I
Subjt:  PVSIGGKELGRALCDLGANINLMPLSVYRKLGIGEARPTTGTFQLANRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAYKDVPIILGRPFLATGRTLI

Query:  DVQKGELTMRVYNEEVKFNVFKVMKYPDEMEDCSFVRILENTIVETAMQESTNKHLE----DHGEDSIEDLEV----CSLER---KNEKEVFRCEDVFES
        DV+ G LT++V  EEV+FN+F+  KYP   +    V +++ +  E    E+T + LE      G    ++LE+    C+LE    K +K     ED+   
Subjt:  DVQKGELTMRVYNEEVKFNVFKVMKYPDEMEDCSFVRILENTIVETAMQESTNKHLE----DHGEDSIEDLEV----CSLER---KNEKEVFRCEDVFES

Query:  LDLDQRKAPPIKPSLIEAPNLDLKPLSDHLKYVYLGEGDTLPIIVASDLMAEHEKALIKLLQQYRKAIGWTLADIQEISPSFCMHKITLEEGSFRLNRNY
             +  PP  PS ++AP L+LKPL  HL Y +LGE +TLP+IV+  L  E    LI++L+  +KAIGWT++D++ ISPS CMH+I +E+    +  N 
Subjt:  LDLDQRKAPPIKPSLIEAPNLDLKPLSDHLKYVYLGEGDTLPIIVASDLMAEHEKALIKLLQQYRKAIGWTLADIQEISPSFCMHKITLEEGSFRLNRNY

Query:  ARYHPAMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYY
         R +P MKEVV+ +V+KWLDAGIIYPI+D+ W+SPVQ VPKKGG+TVV N++NELIPTRTVTGWRVC+DYRKLN  T KDHFPLPF+DQ+L+RL G  YY
Subjt:  ARYHPAMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYY

Query:  CFLDGYSG--------------------------RMPFDLCYAPTTFQRCMLAIFSYMIES---------------------------------------
        CFLDGYSG                          RMPF LC AP TFQRCM+AIFS  +E                                        
Subjt:  CFLDGYSG--------------------------RMPFDLCYAPTTFQRCMLAIFSYMIES---------------------------------------

Query:  -----TVEEGIMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRR-----------------------------KAFETLKTVLISTPI
              V+EGI+LGH+IS  GLEVDRA IE+IE+L PP +VK +RSFLGH GFYRR                              AF  LK  L+S PI
Subjt:  -----TVEEGIMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRR-----------------------------KAFETLKTVLISTPI

Query:  LCAPNWNLPFEVMCAASDAA--------------------------------------------------------------------------------
        + AP+W+LPFE+MC ASD A                                                                                
Subjt:  LCAPNWNLPFEVMCAASDAA--------------------------------------------------------------------------------

Query:  ------EFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDSFPDEQLFIVEVKMVKDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEP
              EFDLEI+DK+G ENV+ADHLSRL+   S   +  I++SFPDEQL  V V     +PWYAD+ N+LV G+ P D+ + QKKKF  D K ++W+EP
Subjt:  ------EFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDSFPDEQLFIVEVKMVKDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEP

Query:  FMYKQCSDGIILR-----RMEAIL----------------------------------------VCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFM
         +YK C+DG+I R      ME IL                                        +CD CQR GN+  ++EMPLT ILEVELFD+WGIDFM
Subjt:  FMYKQCSDGIILR-----RMEAIL----------------------------------------VCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFM

Query:  GSFPPSNGNVFILLAVDYVFKWVEAIACHQNDAKTVARFLQSHIFVWFGTPRALVSDEGTHSVNNILTKLLAKYEIKHKIATPYHPQANGQAEI------
        G FP S  N +IL+AVDYV KWVEA A   ND++ V RF++ +IF  FG PRA++SDEG+H  N     LL KY + HK+A  YHPQ NGQ E+      
Subjt:  GSFPPSNGNVFILLAVDYVFKWVEAIACHQNDAKTVARFLQSHIFVWFGTPRALVSDEGTHSVNNILTKLLAKYEIKHKIATPYHPQANGQAEI------

Query:  ------------------------------------------------------RAI----------------RMLQLNELEEFRQFSYENSKMYKEKTK
                                                              RA                 R+LQL+ELEEFR  +YEN+++YKEKTK
Subjt:  ------------------------------------------------------RAI----------------RMLQLNELEEFRQFSYENSKMYKEKTK

Query:  LWHDKKIKSKDF----SKFLFTSPSRFIP
         WHDK ++ ++F       LF S  +  P
Subjt:  LWHDKKIKSKDF----SKFLFTSPSRFIP

XP_038973683.1 uncharacterized protein LOC120105384 [Phoenix dactylifera]0.0e+0040.31Show/hide
Query:  NDRARAIRAYTFPMFDELNPGIARPQIEAENFEMKPVMFQMLHTVGQFHGLSFEDPHLHLKSFIGVSDSFVIQGVPRDALRLTLFLYSLRDGAKAWLNSF
        N   R +  Y  P  +   P I RP + A NFE+KP + QM+    QF G   EDPH HL +F+ + D+  + GV  DA+RL LF +SL+D AKAWLNS 
Subjt:  NDRARAIRAYTFPMFDELNPGIARPQIEAENFEMKPVMFQMLHTVGQFHGLSFEDPHLHLKSFIGVSDSFVIQGVPRDALRLTLFLYSLRDGAKAWLNSF

Query:  APGSIRTLNELAEKFLSKYFPPNRNDKSRSEIVGFKQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLYGATQGMVDASTGGALLVKTFNEAH
        AP S  T N L++ FLSKYFPP +  K R++I  F Q + E+  EAWERFK+L RKCPHHGLP  + ++TFYNGL  + +  +DA+ GG L+ K+  EA+
Subjt:  APGSIRTLNELAEKFLSKYFPPNRNDKSRSEIVGFKQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLYGATQGMVDASTGGALLVKTFNEAH

Query:  EILERISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRDDLAMIANSLNVGIEHACFRSKIECLKCFRSSIASACVICTPSLKNVTVISHQQPLALEPAALV
        E+LE +++N+ QWS+ R   KKV  + +VDG+           N LN         +K++ L      + +   + +P L                    
Subjt:  EILERISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRDDLAMIANSLNVGIEHACFRSKIECLKCFRSSIASACVICTPSLKNVTVISHQQPLALEPAALV

Query:  NQVAEEACVYCGEDHNYEFCPNNPASVFFVGN-----QRNNPYSIFYNPGWRNHPNFSWGGQGSNVQAQQKVNQSGF-----------------------
              +C  CG  H    C      V FV N     Q+NNPYS  YNPGWRNHPNFSW  QG+   + + ++  GF                       
Subjt:  NQVAEEACVYCGEDHNYEFCPNNPASVFFVGN-----QRNNPYSIFYNPGWRNHPNFSWGGQGSNVQAQQKVNQSGF-----------------------

Query:  ---------AK---------------AQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRIELETGQGAGGS-NNDAGAYGSV-----PDVE------
                 AK                QLAN + +R QG LPS TE      KE  KAVTLR   + GQ +G +   D   Y  V      +VE      
Subjt:  ---------AK---------------AQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRIELETGQGAGGS-NNDAGAYGSV-----PDVE------

Query:  PPYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLCEFETVSLTEECSVILKNGLPTKAKDPGSFTI
         P  P  PYVPP+PFPQR +    D QF+KFL++ +QLHINIP  +A+ Q+P Y KFLK+I++KK++L +FET++LTEECS I++N LP K +DPGSF+I
Subjt:  PPYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLCEFETVSLTEECSVILKNGLPTKAKDPGSFTI

Query:  PVSIGGKELGRALCDLGANINLMPLSVYRKLGIGEARPTTGTFQLANRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAYKDVPIILGRPFLATGRTLI
        P +IG  +  RALCDLGA+++LMPLSV RKLG+ E +PTT + QLA+RS+ YP G +E+VL+KV KFI PVDFI+L+ E   ++PIILGRPFLAT   +I
Subjt:  PVSIGGKELGRALCDLGANINLMPLSVYRKLGIGEARPTTGTFQLANRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAYKDVPIILGRPFLATGRTLI

Query:  DVQKGELTMRVYNEEVKFNVFKVMKYPDEMEDCSFVRILENTIVETAMQESTNKHLE----DHGEDSIEDLEV----CSLER---KNEKEVFRCEDVFES
        DV+ G LT++V  EEV+FN+F+  KYP   +    V +++ +  E    E+T + LE      G    ++LE+    C+LE    K +K     ED+   
Subjt:  DVQKGELTMRVYNEEVKFNVFKVMKYPDEMEDCSFVRILENTIVETAMQESTNKHLE----DHGEDSIEDLEV----CSLER---KNEKEVFRCEDVFES

Query:  LDLDQRKAPPIKPSLIEAPNLDLKPLSDHLKYVYLGEGDTLPIIVASDLMAEHEKALIKLLQQYRKAIGWTLADIQEISPSFCMHKITLEEGSFRLNRNY
             +  PP  PS ++AP L+LKPL  HL Y +LGE +TLP+IV+  L  E    LI++L+  +KAIGWT++D++ ISPS CMH+I +E+    +  N 
Subjt:  LDLDQRKAPPIKPSLIEAPNLDLKPLSDHLKYVYLGEGDTLPIIVASDLMAEHEKALIKLLQQYRKAIGWTLADIQEISPSFCMHKITLEEGSFRLNRNY

Query:  ARYHPAMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYY
         R +P MKEVV+ +V+KWLDAGIIYPI+D+ W+SPVQ VPKKGG+TVV N++NELIPTRTVTGWRVC+DYRKLN  T KDHFPLPF+DQ+L+RL G  YY
Subjt:  ARYHPAMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYY

Query:  CFLDGYSG--------------------------RMPFDLCYAPTTFQRCMLAIFSYMIES---------------------------------------
        CFLDGYSG                          RMPF LC AP TFQRCM+AIFS  +E                                        
Subjt:  CFLDGYSG--------------------------RMPFDLCYAPTTFQRCMLAIFSYMIES---------------------------------------

Query:  -----TVEEGIMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRR-----------------------------KAFETLKTVLISTPI
              V+EGI+LGH+IS  GLEVDRA IE+IE+L PP +VK +RSFLGH GFYRR                              AF  LK  L+S PI
Subjt:  -----TVEEGIMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRR-----------------------------KAFETLKTVLISTPI

Query:  LCAPNWNLPFEVMCAASDAA--------------------------------------------------------------------------------
        + AP+W+LPFE+MC ASD A                                                                                
Subjt:  LCAPNWNLPFEVMCAASDAA--------------------------------------------------------------------------------

Query:  ------EFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDSFPDEQLFIVEVKMVKDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEP
              EFDLEI+DK+G ENV+ADHLSRL+   S   +  I++SFPDEQL  V V     +PWYAD+ N+LV G+ P D+ + QKKKF  D K ++W+EP
Subjt:  ------EFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDSFPDEQLFIVEVKMVKDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEP

Query:  FMYKQCSDGIILR-----RMEAIL----------------------------------------VCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFM
         +YK C+DG+I R      ME IL                                        +CD CQR GN+  ++EMPLT ILEVELFD+WGIDFM
Subjt:  FMYKQCSDGIILR-----RMEAIL----------------------------------------VCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFM

Query:  GSFPPSNGNVFILLAVDYVFKWVEAIACHQNDAKTVARFLQSHIFVWFGTPRALVSDEGTHSVNNILTKLLAKYEIKHKIATPYHPQANGQAEI------
        G FP S  N +IL+AVDYV KWVEA A   ND++ V RF++ +IF  FG PRA++SDEG+H  N     LL KY + HK+A  YHPQ NGQ E+      
Subjt:  GSFPPSNGNVFILLAVDYVFKWVEAIACHQNDAKTVARFLQSHIFVWFGTPRALVSDEGTHSVNNILTKLLAKYEIKHKIATPYHPQANGQAEI------

Query:  ------------------------------------------------------RAI----------------RMLQLNELEEFRQFSYENSKMYKEKTK
                                                              RA                 R+LQL+ELEEFR  +YEN+++YKEKTK
Subjt:  ------------------------------------------------------RAI----------------RMLQLNELEEFRQFSYENSKMYKEKTK

Query:  LWHDKKIKSKDF----SKFLFTSPSRFIP
         WHDK ++ ++F       LF S  +  P
Subjt:  LWHDKKIKSKDF----SKFLFTSPSRFIP

XP_038976300.1 uncharacterized protein LOC120107204 [Phoenix dactylifera]0.0e+0040.43Show/hide
Query:  NDRARAIRAYTFPMFDELNPGIARPQIEAENFEMKPVMFQMLHTVGQFHGLSFEDPHLHLKSFIGVSDSFVIQGVPRDALRLTLFLYSLRDGAKAWLNSF
        N   R +  Y  P  +   P I RP + A NFE+KP + QM+    QF G   EDPH HL +F+ + D+  + GV  DA+RL LF +SL+D AKAWLNS 
Subjt:  NDRARAIRAYTFPMFDELNPGIARPQIEAENFEMKPVMFQMLHTVGQFHGLSFEDPHLHLKSFIGVSDSFVIQGVPRDALRLTLFLYSLRDGAKAWLNSF

Query:  APGSIRTLNELAEKFLSKYFPPNRNDKSRSEIVGFKQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLYGATQGMVDASTGGALLVKTFNEAH
        AP S  T N L++ FLSKYFPP +  K R++I  F Q + E+  EAWERFK+L RKCPHHGLP  + ++TFYNGL  + +  +DA+ GG L+ K+  EA+
Subjt:  APGSIRTLNELAEKFLSKYFPPNRNDKSRSEIVGFKQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLYGATQGMVDASTGGALLVKTFNEAH

Query:  EILERISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRDDLAMIANSLNVGIEHACFRSKIECLKCFRSSIASACVICTPSLKNVTVISHQQPLALEPAALV
        E+LE +++N+ QWS+ R   KKV  + +VDG+           N LN         +K++ L    S + +   + +P L                    
Subjt:  EILERISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRDDLAMIANSLNVGIEHACFRSKIECLKCFRSSIASACVICTPSLKNVTVISHQQPLALEPAALV

Query:  NQVAEEACVYCGEDHNYEFCPNNPASVFFVGN-----QRNNPYSIFYNPGWRNHPNFSWGGQGSNVQAQQKVNQSGF-----------------------
              +C  CG  H    C      V FV N     Q+NNPYS  YNPGWRNHPNFSW  QG+   + + ++  GF                       
Subjt:  NQVAEEACVYCGEDHNYEFCPNNPASVFFVGN-----QRNNPYSIFYNPGWRNHPNFSWGGQGSNVQAQQKVNQSGF-----------------------

Query:  ---------AK---------------AQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRIELETGQGAGGS-NNDAGAYGSV-----PDVE------
                 AK                QLAN + +R QG LPS TE      KE  KAVTLR   + GQ +G +   D   Y  V      +VE      
Subjt:  ---------AK---------------AQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRIELETGQGAGGS-NNDAGAYGSV-----PDVE------

Query:  PPYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLCEFETVSLTEECSVILKNGLPTKAKDPGSFTI
         P  P  PYVPP+PFPQR +    D QF+KFL++ +QLHINIP  +A+ Q+P Y KFLK+I++KK++L +FET++LTEECS I++N LP K +DPGSF+I
Subjt:  PPYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLCEFETVSLTEECSVILKNGLPTKAKDPGSFTI

Query:  PVSIGGKELGRALCDLGANINLMPLSVYRKLGIGEARPTTGTFQLANRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAYKDVPIILGRPFLATGRTLI
        P +IG  +  RALCDLGA+++LMPLSV RKLG+ E +PTT + QLA+RS+ YP G +E+VL+KV KFI PVDFI+L+ E   ++PIILGRPFLAT   +I
Subjt:  PVSIGGKELGRALCDLGANINLMPLSVYRKLGIGEARPTTGTFQLANRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAYKDVPIILGRPFLATGRTLI

Query:  DVQKGELTMRVYNEEVKFNVFKVMKYPDEMEDCSFVRILENTIVETAMQESTNKHLE----DHGEDSIEDLEV----CSLER---KNEKEVFRCEDVFES
        DV+ G LT++V  EEV+FN+F+  KYP   +    V +++ +  E    E+T + LE      G    ++LE+    C+LE    K +K     ED+   
Subjt:  DVQKGELTMRVYNEEVKFNVFKVMKYPDEMEDCSFVRILENTIVETAMQESTNKHLE----DHGEDSIEDLEV----CSLER---KNEKEVFRCEDVFES

Query:  LDLDQRKAPPIKPSLIEAPNLDLKPLSDHLKYVYLGEGDTLPIIVASDLMAEHEKALIKLLQQYRKAIGWTLADIQEISPSFCMHKITLEEGSFRLNRNY
             +  PP  PS ++AP L+LKPL  HL Y +LGE +TLP+IV+  L AE    LI++L+  +KAIGWT++D++ ISPS CMH+I +E+    +  N 
Subjt:  LDLDQRKAPPIKPSLIEAPNLDLKPLSDHLKYVYLGEGDTLPIIVASDLMAEHEKALIKLLQQYRKAIGWTLADIQEISPSFCMHKITLEEGSFRLNRNY

Query:  ARYHPAMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYY
         R +P MKEVV+ +V+KWLDAGIIYPI+D+ W+SPVQ VPKKGG+TVV N++NELIPTRTVTGWRVC+DYRKLN  T KDHFPLPF+DQ+L+RL G  YY
Subjt:  ARYHPAMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYY

Query:  CFLDGYSG--------------------------RMPFDLCYAPTTFQRCMLAIFSYMIES---------------------------------------
        CFLDGYSG                          RMPF LC AP TFQRCM+AIFS  +E                                        
Subjt:  CFLDGYSG--------------------------RMPFDLCYAPTTFQRCMLAIFSYMIES---------------------------------------

Query:  -----TVEEGIMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRR-----------------------------KAFETLKTVLISTPI
              V+EGI+LGH+IS  GLEVDRA IE+IE+L PP +VK +RSFLGH GFYRR                              AF  LK  L+S PI
Subjt:  -----TVEEGIMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRR-----------------------------KAFETLKTVLISTPI

Query:  LCAPNWNLPFEVMCAASDAA--------------------------------------------------------------------------------
        + AP+W+LPFE+MC ASD A                                                                                
Subjt:  LCAPNWNLPFEVMCAASDAA--------------------------------------------------------------------------------

Query:  ------EFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDSFPDEQLFIVEVKMVKDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEP
              EFDLEI+DK+G ENV+ADHLSRL+   S   +  I++SFPDEQL  V V     +PWYAD+ N+LV G+ P D+ + QKKKF  D K ++W+EP
Subjt:  ------EFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDSFPDEQLFIVEVKMVKDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEP

Query:  FMYKQCSDGIILR-----RMEAIL----------------------------------------VCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFM
         +YK C+DG+I R      ME IL                                        +CD CQR GN+  ++EMPLT ILEVELFD+WGIDFM
Subjt:  FMYKQCSDGIILR-----RMEAIL----------------------------------------VCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFM

Query:  GSFPPSNGNVFILLAVDYVFKWVEAIACHQNDAKTVARFLQSHIFVWFGTPRALVSDEGTHSVNNILTKLLAKYEIKHKIATPYHPQANGQAEI------
        G FP S  N +IL+AVDYV KWVEA A   ND++ V RF++ +IF  FG PRA++SDEG+H  N     LL KY + HK+A  YHPQ NGQ E+      
Subjt:  GSFPPSNGNVFILLAVDYVFKWVEAIACHQNDAKTVARFLQSHIFVWFGTPRALVSDEGTHSVNNILTKLLAKYEIKHKIATPYHPQANGQAEI------

Query:  ------------------------------------------------------RAI----------------RMLQLNELEEFRQFSYENSKMYKEKTK
                                                              RA                 R+LQL+ELEEFR  +YEN+++YKEKTK
Subjt:  ------------------------------------------------------RAI----------------RMLQLNELEEFRQFSYENSKMYKEKTK

Query:  LWHDKKIKSKDF----SKFLFTSPSRFIP
         WHDK ++ ++F       LF S  +  P
Subjt:  LWHDKKIKSKDF----SKFLFTSPSRFIP

XP_038976409.1 uncharacterized protein LOC113461320 [Phoenix dactylifera]0.0e+0040.2Show/hide
Query:  NDRARAIRAYTFPMFDELNPGIARPQIEAENFEMKPVMFQMLHTVGQFHGLSFEDPHLHLKSFIGVSDSFVIQGVPRDALRLTLFLYSLRDGAKAWLNSF
        N   R +  Y  P  +   P I RP + A NFE+KP + QM+    QF G   EDPH HL +F+ + D+  + GV  DA+RL LF +SL+D AKAWLNS 
Subjt:  NDRARAIRAYTFPMFDELNPGIARPQIEAENFEMKPVMFQMLHTVGQFHGLSFEDPHLHLKSFIGVSDSFVIQGVPRDALRLTLFLYSLRDGAKAWLNSF

Query:  APGSIRTLNELAEKFLSKYFPPNRNDKSRSEIVGFKQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLYGATQGMVDASTGGALLVKTFNEAH
        AP S  T N L++ FLSKYFPP +  K R++I  F Q + E+  EAWERFK+L RKCPHHGLP  + ++TFYNGL  + +  +DA+ GG L+ K+  EA+
Subjt:  APGSIRTLNELAEKFLSKYFPPNRNDKSRSEIVGFKQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLYGATQGMVDASTGGALLVKTFNEAH

Query:  EILERISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRDDLAMIANSLNVGIEHACFRSKIECLKCFRSSIASACVICTPSLKNVTVISHQQPLALEPAALV
        E+LE +++N+ QWS+ R   KKV  + +VDG+           N LN         +K++ L      + +   + +P L                    
Subjt:  EILERISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRDDLAMIANSLNVGIEHACFRSKIECLKCFRSSIASACVICTPSLKNVTVISHQQPLALEPAALV

Query:  NQVAEEACVYCGEDHNYEFCPNNPASVFFVGN-----QRNNPYSIFYNPGWRNHPNFSWGGQGSNVQAQQKVNQSGF-----------------------
              +C  CG  H      ++   V FV N     Q+NNPYS  YNPGWRNHPNFSW  QG+   + + ++  GF                       
Subjt:  NQVAEEACVYCGEDHNYEFCPNNPASVFFVGN-----QRNNPYSIFYNPGWRNHPNFSWGGQGSNVQAQQKVNQSGF-----------------------

Query:  ---------AK---------------AQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRIELETGQGAGGS-NNDAGAYGSV-----PDVE------
                 AK                QLAN + +R QG LPS TE      KE  KAVTLR   + GQ +  +   D   Y  V      +VE      
Subjt:  ---------AK---------------AQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRIELETGQGAGGS-NNDAGAYGSV-----PDVE------

Query:  PPYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLCEFETVSLTEECSVILKNGLPTKAKDPGSFTI
         P  P  PYVPP+PFPQR +    D QF+KFL++ +QLHINIP  +A+ Q+P Y KFLK+I++KK++L +FET++LTEECS I++N LP K +DPGSF+I
Subjt:  PPYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLCEFETVSLTEECSVILKNGLPTKAKDPGSFTI

Query:  PVSIGGKELGRALCDLGANINLMPLSVYRKLGIGEARPTTGTFQLANRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAYKDVPIILGRPFLATGRTLI
        P +IG  +  RALCDLGA+++LMPLSV RKLG+ E +PTT + QLA+RS+ YP G +E+VL+KV KFI PVDFI+L+ E   ++PIILGRPFLAT   +I
Subjt:  PVSIGGKELGRALCDLGANINLMPLSVYRKLGIGEARPTTGTFQLANRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAYKDVPIILGRPFLATGRTLI

Query:  DVQKGELTMRVYNEEVKFNVFKVMKYPDEMEDCSFVRILENTIVETAMQESTNKHLE----DHGEDSIEDLEV----CSLER---KNEKEVFRCEDVFES
        DV+ G LT++V  EEV+FN+F+  KYP   +    V +++ +  E    E+T + LE      G    ++LE+    C+LE    K +K     ED+   
Subjt:  DVQKGELTMRVYNEEVKFNVFKVMKYPDEMEDCSFVRILENTIVETAMQESTNKHLE----DHGEDSIEDLEV----CSLER---KNEKEVFRCEDVFES

Query:  LDLDQRKAPPIKPSLIEAPNLDLKPLSDHLKYVYLGEGDTLPIIVASDLMAEHEKALIKLLQQYRKAIGWTLADIQEISPSFCMHKITLEEGSFRLNRNY
             +  PP  PS ++AP L+LKPL  HL Y +LGE +TLP+IV+  L  E    LI++L+  +KAIGWT++D++ ISPS CMH+I +E+    +  N 
Subjt:  LDLDQRKAPPIKPSLIEAPNLDLKPLSDHLKYVYLGEGDTLPIIVASDLMAEHEKALIKLLQQYRKAIGWTLADIQEISPSFCMHKITLEEGSFRLNRNY

Query:  ARYHPAMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYY
         R +P MKEVV+ +V+KWLDAGIIYPI+D+ W+SPVQ VPKKGG+TVV N++NELIPTRTVTGWRVC+DYRKLN  T KDHFPLPF+DQ+L+RL G  YY
Subjt:  ARYHPAMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYY

Query:  CFLDGYSG--------------------------RMPFDLCYAPTTFQRCMLAIFSYMIES---------------------------------------
        CFLDGYSG                          RMPF LC AP TFQRCM+AIFS  +E                                        
Subjt:  CFLDGYSG--------------------------RMPFDLCYAPTTFQRCMLAIFSYMIES---------------------------------------

Query:  -----TVEEGIMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRR-----------------------------KAFETLKTVLISTPI
              V+EGI+LGH+IS  GLEVDRA IE+IE+L PP +VK +RSFLGH GFYRR                              AF  LK  L+S PI
Subjt:  -----TVEEGIMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRR-----------------------------KAFETLKTVLISTPI

Query:  LCAPNWNLPFEVMCAASDAA--------------------------------------------------------------------------------
        + AP+W+LPFE+MC ASD A                                                                                
Subjt:  LCAPNWNLPFEVMCAASDAA--------------------------------------------------------------------------------

Query:  ------EFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDSFPDEQLFIVEVKMVKDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEP
              EFDLEI+DK+G ENV+ADHLSRL+   S   +  I++SFPDEQL  V V     +PWYAD+ N+LV G+ P D+ + QKKKF  D K ++W+EP
Subjt:  ------EFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDSFPDEQLFIVEVKMVKDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEP

Query:  FMYKQCSDGIILR-----RMEAIL----------------------------------------VCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFM
         +YK C+DG+I R      ME IL                                        +CD CQR GN+  ++EMPLT ILEVELFD+WGIDFM
Subjt:  FMYKQCSDGIILR-----RMEAIL----------------------------------------VCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFM

Query:  GSFPPSNGNVFILLAVDYVFKWVEAIACHQNDAKTVARFLQSHIFVWFGTPRALVSDEGTHSVNNILTKLLAKYEIKHKIATPYHPQANGQAEI------
        G FP S  N +IL+AVDYV KWVEA A   ND++ V RF++ +IF  FG PRA++SDEG+H  N     LL KY + HK+A  YHPQ NGQ E+      
Subjt:  GSFPPSNGNVFILLAVDYVFKWVEAIACHQNDAKTVARFLQSHIFVWFGTPRALVSDEGTHSVNNILTKLLAKYEIKHKIATPYHPQANGQAEI------

Query:  ------------------------------------------------------RAI----------------RMLQLNELEEFRQFSYENSKMYKEKTK
                                                              RA                 R+LQL+ELEEFR  +YEN+++YKEKTK
Subjt:  ------------------------------------------------------RAI----------------RMLQLNELEEFRQFSYENSKMYKEKTK

Query:  LWHDKKIKSKDF----SKFLFTSPSRFIP
         WHDK ++ ++F       LF S  +  P
Subjt:  LWHDKKIKSKDF----SKFLFTSPSRFIP

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase1.6e-29339.22Show/hide
Query:  MLHTVGQFHGLSFEDPHLHLKSFIGVSDSFVIQGVPRDALRLTLFLYSLRDGAKAWLNSFAPGSIRTLNELAEKFLSKYFPPNRNDKSRSEIVGFKQLED
        M+    QF GLS E+P+ H+ +F+ + D+   +GV +DALRL LF +SL   A  W  S    SI T                                 
Subjt:  MLHTVGQFHGLSFEDPHLHLKSFIGVSDSFVIQGVPRDALRLTLFLYSLRDGAKAWLNSFAPGSIRTLNELAEKFLSKYFPPNRNDKSRSEIVGFKQLED

Query:  ETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLYGATQGMVDASTGGALLVKTFNEAHEILERISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRDDLA
        ET  EAW RF+++LR CP+H +P  IQ+ TFY+GL    +  +D   G + L  T  E H +L  +  N  +    R T  K   V+EVD V+ +     
Subjt:  ETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLYGATQGMVDASTGGALLVKTFNEAHEILERISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRDDLA

Query:  MIANSLNVGIEHACFRSKIECLKCFRSSIASACVICTPSLKNVTVISHQQPLALEPAALVNQVAE--EACVYCGEDHNYEFCPNNPASVFFVGNQR---N
                        +KI+ L                S+KN                 VNQV      C  CGE H  + CP++  S+ FV N R   N
Subjt:  MIANSLNVGIEHACFRSKIECLKCFRSSIASACVICTPSLKNVTVISHQQPLALEPAALVNQVAE--EACVYCGEDHNYEFCPNNPASVFFVGNQR---N

Query:  NPYSIFYNPGWRNHPNFSWG---GQGSNVQ----AQQKVNQSGFAK--------------------------AQLANELKARPQGKLPSDTE-HPRREGK
        NPYS  YNPGWR HPNFSW    GQGS  +     QQ+V Q    K                           QLAN + +RPQG LPS+TE +PR++GK
Subjt:  NPYSIFYNPGWRNHPNFSWG---GQGSNVQ----AQQKVNQSGFAK--------------------------AQLANELKARPQGKLPSDTE-HPRREGK

Query:  EQVKAVTLRIELETGQGAGGSNNDAGAYGSVPDVEPPYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKK
         Q +AVTLR   E  +                                P   +++    + + K+    L++LHINIP  EA+EQMP+Y KF+KDIL+KK
Subjt:  EQVKAVTLRIELETGQGAGGSNNDAGAYGSVPDVEPPYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKK

Query:  KRLCEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRALCDLGANINLMPLSVYRKLGIGEARPTTGTFQLANRSITYPEGKIEDVLVKVD
        + L ++ETV+LTEECS I++N LP K KDPGSFTIP +IG    GRALCDLG    L           GEA+PT+ T QLA+RS+TYP+G IED+LVKVD
Subjt:  KRLCEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRALCDLGANINLMPLSVYRKLGIGEARPTTGTFQLANRSITYPEGKIEDVLVKVD

Query:  KFIFPVDFIILDYEAYKDVPIILGRPFLATGRTLIDVQKGELTMRVYNEEVKFNVFKVMKYPDEMEDCSFVRILENTIVETAMQESTNKHLEDH-----G
        KFIFP DF++LD E   +VPIILGRPFLATGRTLIDVQ                  K MK+P+E ++C  V + +N +   ++ E     LE        
Subjt:  KFIFPVDFIILDYEAYKDVPIILGRPFLATGRTLIDVQKGELTMRVYNEEVKFNVFKVMKYPDEMEDCSFVRILENTIVETAMQESTNKHLEDH-----G

Query:  EDSIEDLEVCSLERKNEKEVFRCEDVFESLDLDQRKAPP--IKPSLIEAPNLDLKPLSDHLKYVYLGEGDTLPIIVASDLMAEHEKALIKLLQQYRKAIG
        E++ ED EV  ++  +  + F+   V ESL   +R AP   +KPS+ E P L+LKPL +HL Y YLGE DTLP+I++S L     + L+++L+ ++ AIG
Subjt:  EDSIEDLEVCSLERKNEKEVFRCEDVFESLDLDQRKAPP--IKPSLIEAPNLDLKPLSDHLKYVYLGEGDTLPIIVASDLMAEHEKALIKLLQQYRKAIG

Query:  WTLADIQEISPSFCMHKITLEEGSFRLNRNYARYHPAMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMD
        WT+ADI+ ISPSFCMHKI LE+       +  R +P MKEVVKK++IKWLDAGIIYPI+D++WVSPVQCVPKKGG+TVV N  NELIPTRTVTGWRVCMD
Subjt:  WTLADIQEISPSFCMHKITLEEGSFRLNRNYARYHPAMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMD

Query:  YRKLNKATHKDHFPLPFIDQMLDRLDGQTYYCFLDGYSG--------------------------RMPFDLCYAPTTFQRCMLAIFSYMIES--------
        YRKLNKAT KDHFPL FIDQMLDRL G+ +YCFLDGYSG                          RMPF LC AP TFQRCM+AIF+ M+E+        
Subjt:  YRKLNKATHKDHFPLPFIDQMLDRLDGQTYYCFLDGYSG--------------------------RMPFDLCYAPTTFQRCMLAIFSYMIES--------

Query:  ------------------------------------TVEEGIMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYR--------------
                                             V+EGI+LGH++S  G+EVD+A +E IE+L PP SVK +RSFLGHAGFYR              
Subjt:  ------------------------------------TVEEGIMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYR--------------

Query:  ---------------RKAFETLKTVLISTPILCAPNWNLPFEVMCAASDAA-------------------------------------------------
                       R AF  LK  LIS PI+  P+W+ PFE+MC ASD A                                                 
Subjt:  ---------------RKAFETLKTVLISTPILCAPNWNLPFEVMCAASDAA-------------------------------------------------

Query:  -------------------------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDSFPDEQLFIVEVKMVKDVPWYADIAN
                                             EFDLEI+D+KG+EN IADHLSRL+  +   + + I+D+FPDEQL  +   +  DVPWYADI N
Subjt:  -------------------------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDSFPDEQLFIVEVKMVKDVPWYADIAN

Query:  FLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDGIILR-----RMEAILV----------------------------------------CDAC
        +L  G+ P D+  +QKKKF  D + ++WD+PF++KQ  D I+ R      M  IL                                         CD C
Subjt:  FLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDGIILR-----RMEAILV----------------------------------------CDAC

Query:  QRRGNLGPRDEMPLTYILEVELFDVWGIDFMGSFPPSNGNVFILLAVDYVFKWVEAIACHQNDAKTVARFLQSHIFVWFGTPRALVSDEGTHSVNNILTK
        QR GN+  R EMPL  ILEVELFDVWGIDFMG F PS GN++IL+AVDYV KWVEA A   ND+K V  F++ +IF  FGTPRA++SD GTH  N     
Subjt:  QRRGNLGPRDEMPLTYILEVELFDVWGIDFMGSFPPSNGNVFILLAVDYVFKWVEAIACHQNDAKTVARFLQSHIFVWFGTPRALVSDEGTHSVNNILTK

Query:  LLAKYEIKHKIATPYHPQANGQAEI--RAI----------------------------------------------------------------------
        LL+KY +KHKI+TPYHPQ +GQ E+  R I                                                                      
Subjt:  LLAKYEIKHKIATPYHPQANGQAEI--RAI----------------------------------------------------------------------

Query:  ----RMLQLNELEEFRQFSYENSKMYKEKTKLWHDKKIKSKDFSK----FLFTSPSRFIP
            R+LQLNEL+EFR  +YEN+K+YKEK K WH+KKI  + F       LF S  +  P
Subjt:  ----RMLQLNELEEFRQFSYENSKMYKEKTKLWHDKKIKSKDFSK----FLFTSPSRFIP

A0A2G9HWF8 Reverse transcriptase0.0e+0040.92Show/hide
Query:  DPKIERTFRRRMREQRTNQNQIDNMSRLPQGLKDPRNPQQNRLLQQNQQLEQNEQQNNQAENPILVANDRARAIRAYTFPMFDELNPGIARPQIEAENFE
        DP+IERTFR R R             +L + ++     ++N+++      E    +N        +A  RAR  ++    +F EL  G+          +
Subjt:  DPKIERTFRRRMREQRTNQNQIDNMSRLPQGLKDPRNPQQNRLLQQNQQLEQNEQQNNQAENPILVANDRARAIRAYTFPMFDELNPGIARPQIEAENFE

Query:  MKPVMFQMLHTVGQFHGLSFEDPHLHLKSFIGVSDSFVIQGVPRDALRLTLFLYSLRDGAKAWLNSFAPGSIRTLNELAEKFLSKYFPPNRNDKSRSEIV
        ++  M QM+    QF GLS E+P+ H+ +F+ + D+   +GV +DALRL LF +SL   A  W  S    SI T  +L E+F+SK+F P +    R+EI+
Subjt:  MKPVMFQMLHTVGQFHGLSFEDPHLHLKSFIGVSDSFVIQGVPRDALRLTLFLYSLRDGAKAWLNSFAPGSIRTLNELAEKFLSKYFPPNRNDKSRSEIV

Query:  GFKQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLYGATQGMVDASTGGALLVKTFNEAHEILERISTNSCQWSDVRGTNKKVKSVLEVDGVS
         F+Q   ET  EAW RF+++LR CP+H +P  IQ+ TFY+GL    +  +D   G + L  T  E H +L  +  N  +    R T  K   V+EVD V+
Subjt:  GFKQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLYGATQGMVDASTGGALLVKTFNEAHEILERISTNSCQWSDVRGTNKKVKSVLEVDGVS

Query:  TIRDDLAMIANSLNVGIEHACFRSKIECLKCFRSSIASACVICTPSLKNVTVISHQQPLALEPAALVNQVAEEACVYCGEDHNYEFCPNNPASVFFVGNQ
         +                     +KI+ L                S+KN                            CGE H  + CP++  S+ FV N 
Subjt:  TIRDDLAMIANSLNVGIEHACFRSKIECLKCFRSSIASACVICTPSLKNVTVISHQQPLALEPAALVNQVAEEACVYCGEDHNYEFCPNNPASVFFVGNQ

Query:  R---NNPYSIFYNPGWRNHPNFSWG---GQGSNVQAQQKVNQSGFAKAQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLR----IELETGQGAGGSN
        R   NNPYS  YNPGWR HPNFSW    GQGS  + Q                     QG   +   +PR++GK Q +AVTLR    ++    +      
Subjt:  R---NNPYSIFYNPGWRNHPNFSWG---GQGSNVQAQQKVNQSGFAKAQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLR----IELETGQGAGGSN

Query:  NDAGAYGSVPDVEPPYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLCEFETVSLTEECSVILKNG
         +  +     +VE P     P     PFPQR + +    QF KFLE+ K+LHINIP  EA+EQMP+Y KF+KDIL+KK+RL ++ETV+LTEECS I++N 
Subjt:  NDAGAYGSVPDVEPPYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLCEFETVSLTEECSVILKNG

Query:  LPTKAKDPGSFTIPVSIGGKELGRALCDLGANINLMPLSVYRKLGIGEARPTTGTFQLANRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAYKDVPII
        LP K KDPG              RALCDLGA+INLMP S+YR LG+ EA+PT+ T QLA+RS+TYP+G IED+LVKVDKFIFP DF++LD E   +VPII
Subjt:  LPTKAKDPGSFTIPVSIGGKELGRALCDLGANINLMPLSVYRKLGIGEARPTTGTFQLANRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAYKDVPII

Query:  LGRPFLATGRTLIDVQKGELTMRVYNEEVKFNVFKVMKYPDEMEDCSFVRILENTIVETAMQESTNKHLEDH-----GEDSIEDLEVCSLERKNEKEVFR
        LGRPFLATGRTLIDVQKGELTMRV ++++ FNVFK MK+P+E ++C  V + +N   + ++ E     LE        ED+ ED EV  ++  +  + F+
Subjt:  LGRPFLATGRTLIDVQKGELTMRVYNEEVKFNVFKVMKYPDEMEDCSFVRILENTIVETAMQESTNKHLEDH-----GEDSIEDLEVCSLERKNEKEVFR

Query:  CEDVFESLDLDQRKAPP--IKPSLIEAPNLDLKPLSDHLKYVYLGEGDTLPIIVASDLMAEHEKALIKLLQQYRKAIGWTLADIQEISPSFCMHKITLEE
           V ESL   +R AP   +KPS+ E P L+LKPL  HL Y YLGE DTLP+I++S L     + L+++L+ ++ AIGWT+ADI+ ISPSFCMHKI LE+
Subjt:  CEDVFESLDLDQRKAPP--IKPSLIEAPNLDLKPLSDHLKYVYLGEGDTLPIIVASDLMAEHEKALIKLLQQYRKAIGWTLADIQEISPSFCMHKITLEE

Query:  GSFRLNRNYARYHPAMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFIDQML
               +  R +P MKEVVKK++IKWLDAGIIYPI+D +W+SPVQCVPKKGG+TVV N  NE IPT+TVTGWRVCMDYRKLNKAT KDHFPLPFIDQML
Subjt:  GSFRLNRNYARYHPAMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFIDQML

Query:  DRLDGQTYYCFLDGYSG--------------------------RMPFDLCYAPTTFQRCMLAIFSYMIES------------------------------
        DRL G+ +YCFLDGYSG                          R+PF LC AP TFQRCM+AIF+ M+E+                              
Subjt:  DRLDGQTYYCFLDGYSG--------------------------RMPFDLCYAPTTFQRCMLAIFSYMIES------------------------------

Query:  --------------TVEEGIMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRR-----------------------------KAFETL
                       V+EGI+LGH++S  G+EVD+A +E IE+L P  SVK +RSFLGHAGFYRR                              AF+ L
Subjt:  --------------TVEEGIMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRR-----------------------------KAFETL

Query:  KTVLISTPILCAPNWNLPFEVMCAASDAA-----------------------------------------------------------------------
        K  LIS PI+  P+W+ PFE+MC ASD A                                                                       
Subjt:  KTVLISTPILCAPNWNLPFEVMCAASDAA-----------------------------------------------------------------------

Query:  ---------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDSFPDEQLFIVEVKMVKDVPWYADIANFLVKGVTPIDMDWRQKKKFKHD
                       EFDLEI+D+KG EN IADHLSRL+  +   + + I+D+FPDEQL  +   +  DVPWYADI N+L  G+ P D+  +QKKKF  D
Subjt:  ---------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDSFPDEQLFIVEVKMVKDVPWYADIANFLVKGVTPIDMDWRQKKKFKHD

Query:  AKFFYWDEPFMYKQCSDGIILRRMEAILV---------------------------------------------CDACQRRGNLGPRDEMPLTYILEVEL
         + ++WD+PF++KQ  D I+ R +  I +                                             CD CQR GN+  R EMPL  ILEVEL
Subjt:  AKFFYWDEPFMYKQCSDGIILRRMEAILV---------------------------------------------CDACQRRGNLGPRDEMPLTYILEVEL

Query:  FDVWGIDFMGSFPPSNGNVFILLAVDYVFKWVEAIACHQNDAKTVARFLQSHIFVWFGTPRALVSDEGTHSVNNILTKLLAKYEIKHKIATPYHPQANGQ
        FDVWGIDFMG F PS GN++IL+AVDY+ KWVEA+A   ND+K V  F++ +IF  FGTPRA++SD GTH  N     LL+KY +KHKI+TPYHPQ +GQ
Subjt:  FDVWGIDFMGSFPPSNGNVFILLAVDYVFKWVEAIACHQNDAKTVARFLQSHIFVWFGTPRALVSDEGTHSVNNILTKLLAKYEIKHKIATPYHPQANGQ

Query:  AEI
         E+
Subjt:  AEI

A0A2G9IA86 DNA-directed DNA polymerase7.6e-29641.54Show/hide
Query:  FKQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLYGATQGMVDASTGGALLVKTFNEAHEILERISTNSCQWSDVRGTNKKVKSVLEVDGVST
        F+Q   ET  EAW RF+++LR CP+H +P  IQ+ TFY+GL    +  +D   G + L  T  E H +L  +  N  +    R T  K   V+EVD V+ 
Subjt:  FKQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLYGATQGMVDASTGGALLVKTFNEAHEILERISTNSCQWSDVRGTNKKVKSVLEVDGVST

Query:  IRDDLAMIANSLNVGIEHACFRSKIECLKCFRSSIASACVICTPSLKNVTVISHQQPLALEPAALVNQVAEE--ACVYCGEDHNYEFCPNNPASVFFVGN
        +                     +KI+ L                S+KN                 VNQV      C  CGE H    CPN+  S+ FV N
Subjt:  IRDDLAMIANSLNVGIEHACFRSKIECLKCFRSSIASACVICTPSLKNVTVISHQQPLALEPAALVNQVAEE--ACVYCGEDHNYEFCPNNPASVFFVGN

Query:  QR---NNPYSIFYNPGWRNHPNFSWGG---QGS----NVQAQQKVNQSGFAK--------------------------AQLANELKARPQGKLPSDTE-H
         R   NNPYS  YNPGWR HPNFSW     QGS        QQ+V Q    K                           QLAN + +RPQG L S+TE +
Subjt:  QR---NNPYSIFYNPGWRNHPNFSWGG---QGS----NVQAQQKVNQSGFAK--------------------------AQLANELKARPQGKLPSDTE-H

Query:  PRREGKEQVKAVTLRIELETGQGAGGSNNDAGAYGSVPDVEPPYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLK
        PR++GK Q +AVTLR   E  +         G    + + E   V       PL   Q+Q+ K    QF KFLE+ K+LHIN P  EA+EQMP+Y KF+K
Subjt:  PRREGKEQVKAVTLRIELETGQGAGGSNNDAGAYGSVPDVEPPYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLK

Query:  DILTKKKRLCEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRALCDLGANINLMPLSVYRKLGIGEARPTTGTFQLANRSITYPEGKIED
         IL+KK+RL ++ETV+LTEECS I++N LP K KDPGSFTIP +IG    GRALCDLGA+INLMP S+YR LG+GEA+PT+ T QLANRS+TYP+G IED
Subjt:  DILTKKKRLCEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRALCDLGANINLMPLSVYRKLGIGEARPTTGTFQLANRSITYPEGKIED

Query:  VLVKVDKFIFPVDFIILDYEAYKDVPIILGRPFLATGRTLIDVQKGELTMRVYNEEVKFNVFKVMKYPDEMEDCSFVRILENTIVETAMQESTNKHLED-
        +LVKVDKFIFP DF++LD E   +VPIILGRPFLATGRTLIDVQKG+LTMRV ++++ FNVFK MK+P+E ++C  V + +N     ++ +   + L D 
Subjt:  VLVKVDKFIFPVDFIILDYEAYKDVPIILGRPFLATGRTLIDVQKGELTMRVYNEEVKFNVFKVMKYPDEMEDCSFVRILENTIVETAMQESTNKHLED-

Query:  HGEDSIEDLEVCSLERKNEKEVFRCEDVFESLDLDQRKAPP--IKPSLIEAPNLDLKPLSDHLKYVYLGEGDTLPIIVASDLMAEHEKALIKLLQQYRKA
          ED+ ED EV  ++  +  + F+   V ESL   +R AP   +KPS+ E+P L+LKPL  HL Y YLGE DTLP+I++S L     + L+++ + ++ A
Subjt:  HGEDSIEDLEVCSLERKNEKEVFRCEDVFESLDLDQRKAPP--IKPSLIEAPNLDLKPLSDHLKYVYLGEGDTLPIIVASDLMAEHEKALIKLLQQYRKA

Query:  IGWTLADIQEISPSFCMHKITLEEGSFRLNRNYARYHPAMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVC
        IGWT+ADI+ IS SFCMHKI LE+       +  R +P MKEVVKK++IKW+DAGIIYPI+D++WVSPVQCVPKKGG+TVV N  NELIPTRTVTGWRVC
Subjt:  IGWTLADIQEISPSFCMHKITLEEGSFRLNRNYARYHPAMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVC

Query:  MDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYYCFLDGYSG----------------RMPFDLCYAPTTFQRCMLAIFSYMIES----------------
        MDYRKLNKAT KDHFPLPFIDQMLDRL G+ +YCFLDGY                  R+PF LC AP TFQRCM+AIF+ M+E+                
Subjt:  MDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYYCFLDGYSG----------------RMPFDLCYAPTTFQRCMLAIFSYMIES----------------

Query:  ----------------------------TVEEGIMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRR---------------------
                                     V+EGI+L H++S  G+EV++A +E IE+L PP SVK IRSFLGHAGFYRR                     
Subjt:  ----------------------------TVEEGIMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRR---------------------

Query:  --------KAFETLKTVLISTPILCAPNWNLPFEVMCAASDAA------------------------------------------EFDLEIKDKKGSENV
                 AF  LK  LIS PI+  P+W+ PFE+MC ASD A                                           FD       G++ +
Subjt:  --------KAFETLKTVLISTPILCAPNWNLPFEVMCAASDAA------------------------------------------EFDLEIKDKKGSENV

Query:  I-ADHLS------------RLDPSSSLLKQSAISDSFPDEQLFIVEVKMVKDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSD
        +  DH +            RL+  +   + + I+D+FPDEQL  +   +  DVPWY+DI N+L  G+ P D+  +QKKKF  D + ++WD+ F++KQ  D
Subjt:  I-ADHLS------------RLDPSSSLLKQSAISDSFPDEQLFIVEVKMVKDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSD

Query:  GIILR-----RMEAILV----------------------------------------CDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGSFPPSNG
         I+ R      M  IL                                         CD CQR GN+  R EMPL  IL+VELFDVWGIDF+G F PS G
Subjt:  GIILR-----RMEAILV----------------------------------------CDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGSFPPSNG

Query:  NVFILLAVDYVFKWVEAIACHQNDAKTVARFLQSHIFVWFGTPRALVSDEGTHSVNNILTKLLAKYEIKHKIATPYHPQANGQAEI--------------
        N++IL+AVDYV KWVEA+A   ND+K V  F++ +IF  FGTPRA++SD G H  N      L+KY +KHKI TPYHPQ +GQ E+              
Subjt:  NVFILLAVDYVFKWVEAIACHQNDAKTVARFLQSHIFVWFGTPRALVSDEGTHSVNNILTKLLAKYEIKHKIATPYHPQANGQAEI--------------

Query:  -----------------RAI---------------------------------------------RMLQLNELEEFRQFSYENSKMYKEKTKLWHDKKIK
                         R +                                             R+LQLNEL+EFR  +YEN+K+YKEKTK WHDKKI 
Subjt:  -----------------RAI---------------------------------------------RMLQLNELEEFRQFSYENSKMYKEKTKLWHDKKIK

Query:  SKDFSK----FLFTSPSRFIPLLFVFLLSFVVRCTK
         + F       LF S  +  P    F  S   R T+
Subjt:  SKDFSK----FLFTSPSRFIPLLFVFLLSFVVRCTK

A0A6P6XAQ1 Reverse transcriptase2.1e-30639.17Show/hide
Query:  RAIRAYTFPMFDELNPGIARPQIEAENFEMKPVMFQMLHTVGQFHGLSFEDPHLHLKSFIGVSDSFVIQGVPRDALRLTLFLYSLRDGAKAWLNSFAPGS
        R +R +  P        I RP + A NFE+KP + QM+    Q+ G + EDP+ HL +F+ + D+    GV  DA++L LF +SLRD AK WL S  P +
Subjt:  RAIRAYTFPMFDELNPGIARPQIEAENFEMKPVMFQMLHTVGQFHGLSFEDPHLHLKSFIGVSDSFVIQGVPRDALRLTLFLYSLRDGAKAWLNSFAPGS

Query:  IRTLNELAEKFLSKYFPPNRNDKSRSEIVGFKQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLYGATQGMVDASTGGALLVKTFNEAHEILE
          T +ELA+ FL+K+FPP +  K R +I  F Q E ET  EAWER++EL R+CPHHGLP  + ++TFYNGL   T+  VDA+ GGAL+ KT  EA +++E
Subjt:  IRTLNELAEKFLSKYFPPNRNDKSRSEIVGFKQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLYGATQGMVDASTGGALLVKTFNEAHEILE

Query:  RISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRDDLAMIANSLNVGIEHACFRSKIECLKCFRSSIASACVICTPSLKNVTVISHQQPLALEPAALVNQVA
         ++ N+ QW++ RG +++   +LEVD ++ +   +  +   LN                                 + V   S+Q             V 
Subjt:  RISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRDDLAMIANSLNVGIEHACFRSKIECLKCFRSSIASACVICTPSLKNVTVISHQQPLALEPAALVNQVA

Query:  EEACVYCGEDHNYEFCPNNPASVFFVGN----QRNNPYSIFYNPGWRNHPNFSWGGQGSNVQAQQKVNQSGFAKAQLANELKARPQ--------------
          +C  CG DH+ +F  ++   V ++ N     +NNPYS  YNPGWRNHPNF W  QG+    Q+ VN  GF + Q  +E K   +              
Subjt:  EEACVYCGEDHNYEFCPNNPASVFFVGN----QRNNPYSIFYNPGWRNHPNFSWGGQGSNVQAQQKVNQSGFAKAQLANELKARPQ--------------

Query:  GKLPSDT--EHPRREGK-EQVKAVTLRIELETGQGAGGSNN-DAGAYGSVPDVEP-------------PYVPPPPYVPPLPFPQRQRPK----NQDGQFK
         KL S T     R EG+ +Q+  +   +E++ GQ A   NN + G   S  +V P               V PP       F +R+  K     +  + +
Subjt:  GKLPSDT--EHPRREGK-EQVKAVTLRIELETGQGAGGSNN-DAGAYGSVPDVEP-------------PYVPPPPYVPPLPFPQRQRPK----NQDGQFK

Query:  KFLEILKQLHINIPLVEAI-EQMPNYAKFLKDILTKKKRLCEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRALCDLGANINLMPLSVY
        K  E +++  + +     I   +P+YAKFLK+I+TKK++L + ET++LTEECS I++N LP K KDPGSFT+P +IG  E  +ALCDLGA+++L+PL+V 
Subjt:  KFLEILKQLHINIPLVEAI-EQMPNYAKFLKDILTKKKRLCEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRALCDLGANINLMPLSVY

Query:  RKLGIGEARPTTGTFQLANRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAYKDVPIILGRPFLATGRTLIDVQKGELTMRVYNEEVKFNVFKVMKYPD
        R+LG+ E + T  + QLA+RSI +P G +E+VL+KV KFI PVDF++LD E   +VPIILGRPFLAT  T+IDV++G+   ++  EEV+F++ KV KYP 
Subjt:  RKLGIGEARPTTGTFQLANRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAYKDVPIILGRPFLATGRTLIDVQKGELTMRVYNEEVKFNVFKVMKYPD

Query:  EMEDCSFVRILENTIVETAMQESTNKHLE------DHGEDSIEDLEVCSLERKNEKEVFRCEDVFESLDLDQRKAPPIKPSLIEAPNLDLKPLSDHLKYV
          +    V I +   +E +     N  LE         E+ IE++     E    +  ++  + +E+L L +   P   PS  +AP L+LKPL  HLKY 
Subjt:  EMEDCSFVRILENTIVETAMQESTNKHLE------DHGEDSIEDLEVCSLERKNEKEVFRCEDVFESLDLDQRKAPPIKPSLIEAPNLDLKPLSDHLKYV

Query:  YLGEGDTLPIIVASDLMAEHEKALIKLLQQYRKAIGWTLADIQEISPSFCMHKITLEEGSFRLNRNYARYHPAMKEVVKKKVIKWLDAGIIYPIADNNWV
        +LGE +TLP+IV S L  E    L+++L+++ KAIGWT++DI+ ISP+ CMH+I LEE S  +     R +P MKEVV+ +++KWLDAGII+PI+D+ W+
Subjt:  YLGEGDTLPIIVASDLMAEHEKALIKLLQQYRKAIGWTLADIQEISPSFCMHKITLEEGSFRLNRNYARYHPAMKEVVKKKVIKWLDAGIIYPIADNNWV

Query:  SPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYYCFLDGYSG-------------------------
        SP+  VPKKGG+T+V  K++E+IP+R V GWRVC+DYRKLN AT KDHFPLPF+DQM++RL G  +YCFLDG+SG                         
Subjt:  SPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYYCFLDGYSG-------------------------

Query:  -RMPFDLCYAPTTFQRCMLAIFSYMIESTVE--------------------------------------------EGIMLGHRISKNGLEVDRANIEVIE
         RMPF LC AP TFQRCM+AIFS   E  +E                                            EGI+LGH+IS  G+EVD+A IEVIE
Subjt:  -RMPFDLCYAPTTFQRCMLAIFSYMIESTVE--------------------------------------------EGIMLGHRISKNGLEVDRANIEVIE

Query:  RLEPPNSVKEIRSFLGHAGFYRR-----------------------------KAFETLKTVLISTPILCAPNWNLPFEVMCAASDAA-------------
        R+ PP +VK IRSFLGH GFYRR                              AF  LK  L+S PI+ +P+W+LPFE+MC ASD A             
Subjt:  RLEPPNSVKEIRSFLGHAGFYRR-----------------------------KAFETLKTVLISTPILCAPNWNLPFEVMCAASDAA-------------

Query:  -------------------------------------------------------------------------EFDLEIKDKKGSENVIADHLSRLDPSS
                                                                                 EFDLEIKDKKGSEN++ADHLSRL+ + 
Subjt:  -------------------------------------------------------------------------EFDLEIKDKKGSENVIADHLSRLDPSS

Query:  SLLKQSAISDSFPDEQLFIVEVKMVKDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDGIILRRMEAILVCDACQRRGNLGPR
              +I + FPDE L  +        PWYAD+ NF+  GV P D++  Q+KKF +DAK ++W+EPF YK C+DGII R +     CD CQR GN+  R
Subjt:  SLLKQSAISDSFPDEQLFIVEVKMVKDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDGIILRRMEAILVCDACQRRGNLGPR

Query:  DEMPLTYILEVELFDVWGIDFMGSFPPSNGNVFILLAVDYVFKWVEAIACHQNDAKTVARFLQSHIFVWFGTPRALVSDEGTHSVNNILTKLLAKYEIKH
        +EMPLT  LEVELFDVWGIDFMG FP S  N +ILLAVDYV KWVEAI    N+AK V +FL+ +IF  FG P+A++SDEG H  N  +  LL KY  +H
Subjt:  DEMPLTYILEVELFDVWGIDFMGSFPPSNGNVFILLAVDYVFKWVEAIACHQNDAKTVARFLQSHIFVWFGTPRALVSDEGTHSVNNILTKLLAKYEIKH

Query:  KIATPYHPQANGQAE------------------------------------------------------------------IRAI----------RMLQL
        K + PYHPQANGQAE                                                                  ++AI          RML+L
Subjt:  KIATPYHPQANGQAE------------------------------------------------------------------IRAI----------RMLQL

Query:  NELEEFRQFSYENSKMYKEKTKLWHDKKIKSKDF
        +ELEE R  SYEN K+YKEK K WHDK I  K F
Subjt:  NELEEFRQFSYENSKMYKEKTKLWHDKKIKSKDF

A0A6P8CBX2 Reverse transcriptase7.1e-30238.51Show/hide
Query:  QFHGLSFEDPHLHLKSFIGVSDSFVIQGVPRDALRLTLFLYSLRDGAKAWLNSFAPGSIRTLNELAEKFLSKYFPPNRNDKSRSEIVGFKQLEDETFSEA
        QF G   E P  H+  F+   ++  +  V  D +RL LF +SLRD A+AW NS    SI T  +L+ KFL ++FPP R  + R+EI  F +   E+  EA
Subjt:  QFHGLSFEDPHLHLKSFIGVSDSFVIQGVPRDALRLTLFLYSLRDGAKAWLNSFAPGSIRTLNELAEKFLSKYFPPNRNDKSRSEIVGFKQLEDETFSEA

Query:  WERFKELLRKCPHHGLPHCIQMETFYNGLYGATQGMVDASTGGALLVKTFNEAHEILERISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRDDLAMIANSL
        WERFKE +RKCPHHGLP  + +E FY  L    + +VDA+ GGAL+ K ++EA  ++E +++++  W + R +  +V SV ++D ++ +   ++ +   +
Subjt:  WERFKELLRKCPHHGLPHCIQMETFYNGLYGATQGMVDASTGGALLVKTFNEAHEILERISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRDDLAMIANSL

Query:  NVGIEHACFRSKIECLKCFRSSIASACVICTPSLKNVTVISHQQPLALEPAALVNQVAEEACVYCGEDHNYEFCPNNPASVFFVGN-QRNN--PYSIFYN
                  SK+     F ++  + C +C+     +  +S        P+A  N          GE  N            FV N QR+N  PYS  YN
Subjt:  NVGIEHACFRSKIECLKCFRSSIASACVICTPSLKNVTVISHQQPLALEPAALVNQVAEEACVYCGEDHNYEFCPNNPASVFFVGN-QRNN--PYSIFYN

Query:  PGWRNHPNFSW---------------GGQGSNVQAQQK-------------------VNQSGFAK------AQLANELKARPQGKLPSDTEHPRREGKEQ
        PGWRNHPNFSW                G   N   QQ                     NQ    +      +Q++ +L  RP G LPS+TE    E  + 
Subjt:  PGWRNHPNFSW---------------GGQGSNVQAQQK-------------------VNQSGFAK------AQLANELKARPQGKLPSDTEHPRREGKEQ

Query:  VKAVTLRIELETGQGAGGSNNDAGAYGSVPD-------VEPP---YVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKF
        V A+ LR    +G+     N  A      P+       VE P    +   PYVPP+PFP R + +  D QF KFL++ K+L INIP  EA++QMP+YA+F
Subjt:  VKAVTLRIELETGQGAGGSNNDAGAYGSVPD-------VEPP---YVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKF

Query:  LKDILTKKKRLCEFETVSLTEECSVILKN---GLPTKAKDPGSFTIPVSIGGKELGRALCDLGANINLMPLSVYRKLGIGEARPTTGTFQLANRSITYPE
        +KD+LTKK++    E V LT ECS+IL+     LP K +D GSFT+P +IG       L D GA+INLMPLS++RKLG+GE + T  T QLA+RSI YP+
Subjt:  LKDILTKKKRLCEFETVSLTEECSVILKN---GLPTKAKDPGSFTIPVSIGGKELGRALCDLGANINLMPLSVYRKLGIGEARPTTGTFQLANRSITYPE

Query:  GKIEDVLVKVDKFIFPVDFIILDYEAYKDVPIILGRPFLATGRTLIDVQKGELTMRVYNEEVKFNVFKVMKYPDEMEDCSFVRILENTIVETAMQESTNK
        G +E+VLVKVDKFIFPVDFI+L+ E  ++VP+ILGRPFLATG+ LIDV++G+LT+RV NE++ FNV+  +K  D+ + C  + I++  I E+  +++   
Subjt:  GKIEDVLVKVDKFIFPVDFIILDYEAYKDVPIILGRPFLATGRTLIDVQKGELTMRVYNEEVKFNVFKVMKYPDEMEDCSFVRILENTIVETAMQESTNK

Query:  HLEDHGEDSIEDLEVCSLERKNEKEVFRCEDVFESLDLDQRKAPPIKP--SLIEAPNLDLKPLSDHLKYVYLGEGDTLPIIVASDLMAEHEKALIKLLQQ
           D  E  + DL+  S + ++E+E        ++   ++      KP  SL ++P L+LKPL  HLKY YLG  DTLPII++S L  + E+ L+ +L++
Subjt:  HLEDHGEDSIEDLEVCSLERKNEKEVFRCEDVFESLDLDQRKAPPIKP--SLIEAPNLDLKPLSDHLKYVYLGEGDTLPIIVASDLMAEHEKALIKLLQQ

Query:  YRKAIGWTLADIQEISPSFCMHKITLEEGSFRLNRNYARYHPAMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTG
        +++AIGWT+ADI+ ISP  C H+I LE     + +   R +P +KEVVKK+V+K LDAGIIYPI+D+ WVSPVQ VPKKGG+TVV N+ N+LIPTRTVTG
Subjt:  YRKAIGWTLADIQEISPSFCMHKITLEEGSFRLNRNYARYHPAMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTG

Query:  WRVCMDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYYCFLDGYSG--------------------------RMPFDLCYAPTTFQRCMLAIFSYMIESTV
        WRVC+DYRKLN AT KDHFPLPFIDQML++L G  YYCFLDGYSG                          RMPF LC AP TFQRCM++IFS M+E+ +
Subjt:  WRVCMDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYYCFLDGYSG--------------------------RMPFDLCYAPTTFQRCMLAIFSYMIESTV

Query:  E--------------------------------------------EGIMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRR-------
        E                                            EGI+LGH++SK G+EVDRA +E+IE+L PP S K +RSFLGHAGFYRR       
Subjt:  E--------------------------------------------EGIMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRR-------

Query:  ----------------------KAFETLKTVLISTPILCAPNWNLPFEVMCAASDAA-------------------------------------------
                              +AF  LK  L S P++ APNW LPFE+MC ASD A                                           
Subjt:  ----------------------KAFETLKTVLISTPILCAPNWNLPFEVMCAASDAA-------------------------------------------

Query:  -------------------------------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDSFPDEQLFIVEVKMVKDVPW
                                                   EFDLEI+D KG+ENV+ADHLSRL+   S    S I++ FPDEQL + E   ++ +PW
Subjt:  -------------------------------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDSFPDEQLFIVEVKMVKDVPW

Query:  YADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDGIILRRM---------------------------------------------EAI
        YADI N++V  +TP  +  +QKKKF HD K+++WDEP+++K C+D +I R +                                               I
Subjt:  YADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDEPFMYKQCSDGIILRRM---------------------------------------------EAI

Query:  LVCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGSFPPSNGNVFILLAVDYVFKWVEAIACHQNDAKTVARFLQSHIFVWFGTPRALVSDEGTHSV
        + C  CQR GN+  R E+P   IL +ELFDVWGIDFMG FP S  N +IL+AVDYV KWVEA+A   NDA+ V RFL+ +IF  FG PRA++SD G+H  
Subjt:  LVCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGSFPPSNGNVFILLAVDYVFKWVEAIACHQNDAKTVARFLQSHIFVWFGTPRALVSDEGTHSV

Query:  NNILTKLLAKYEIKHKIATPYHPQANGQAEI--RAI----------------------------------------------------------------
        N    KLL+KY + HKIATPYHPQ  GQ E+  R I                                                                
Subjt:  NNILTKLLAKYEIKHKIATPYHPQANGQAEI--RAI----------------------------------------------------------------

Query:  ----------RMLQLNELEEFRQFSYENSKMYKEKTKLWHDKKIKSKDF
                  R+LQLN++ E R+ +YEN+++YKE+ K WHD+ I  ++F
Subjt:  ----------RMLQLNELEEFRQFSYENSKMYKEKTKLWHDKKIKSKDF

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.65.2e-1524.45Show/hide
Query:  YHPAMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYYCF
        Y  A ++ V+ ++   L+ GII   +++ + SP+  VPKK      S K            +R+ +DYRKLN+ T  D  P+P +D++L +L    Y+  
Subjt:  YHPAMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYYCF

Query:  LDGYSG--------------------------RMPFDLCYAPTTFQRCM------------------LAIFSYMIESTV---------------------
        +D   G                          RMPF L  AP TFQRCM                  + +FS  ++  +                     
Subjt:  LDGYSG--------------------------RMPFDLCYAPTTFQRCM------------------LAIFSYMIESTV---------------------

Query:  -----EEGIMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRR------------------------------KAFETLKTVLISTPIL
             +E   LGH ++ +G++ +   IE I++   P   KEI++FLG  G+YR+                               AF+ LK ++   PIL
Subjt:  -----EEGIMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRR------------------------------KAFETLKTVLISTPIL

Query:  CAPNWNLPFEVMCAASDAA
          P++   F +   ASD A
Subjt:  CAPNWNLPFEVMCAASDAA

P20825 Retrovirus-related Pol polyprotein from transposon 2977.5e-1425.4Show/hide
Query:  VKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYYCFLDGYSG--
        V+ +V + L+ G+I   +++ + SP   VPKK   +  +NK            +RV +DYRKLN+ T  D +P+P +D++L +L    Y+  +D   G  
Subjt:  VKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYYCFLDGYSG--

Query:  ------------------------RMPFDLCYAPTTFQRCM------------------LAIFSYMI--------------------------ESTVEEG
                                RMPF L  AP TFQRCM                  + IFS  +                          E   +E 
Subjt:  ------------------------RMPFDLCYAPTTFQRCM------------------LAIFSYMI--------------------------ESTVEEG

Query:  IMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRR------------------------------KAFETLKTVLISTPILCAPNWNLP
          LGH ++ +G++ +   ++ I     P   KEIR+FLG  G+YR+                              +AFE LK ++I  PIL  P++   
Subjt:  IMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRR------------------------------KAFETLKTVLISTPILCAPNWNLP

Query:  FEVMCAASDAA
        F +   AS+ A
Subjt:  FEVMCAASDAA

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.2e-1033.66Show/hide
Query:  LLQQYRKAIGWTL----ADIQEISPSFCMHKITLEEGSFRLNRNYARYHPAMK--EVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDN
        L Q+YR+ I   L    ADI  I      H I ++ G+ RL R    YH   K  + + K V K LD   I P + +   SPV  VPKK G         
Subjt:  LLQQYRKAIGWTL----ADIQEISPSFCMHKITLEEGSFRLNRNYARYHPAMK--EVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDN

Query:  ELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYYCFLDGYSGR--------------------------MPFDLCYAPTTFQRCMLA
                  +R+C+DYR LNKAT  D FPLP ID +L R+     +  LD +SG                           MPF L  AP+TF R M  
Subjt:  ELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYYCFLDGYSGR--------------------------MPFDLCYAPTTFQRCMLA

Query:  IF
         F
Subjt:  IF

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.2e-1122.5Show/hide
Query:  YARYHP---AMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFIDQMLDRLDG
        YA+ +P    M+  V++++ + L  GII P +++ + SP+  VPKK       N + +         +R+ +D+++LN  T  D +P+P I+  L  L  
Subjt:  YARYHP---AMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFIDQMLDRLDG

Query:  QTYYCFLDGYSG--------------------------RMPFDLCYAPTTFQRCM------------------LAIF-----------------------
          Y+  LD  SG                          R+PF L  AP  FQR +                  + +F                       
Subjt:  QTYYCFLDGYSG--------------------------RMPFDLCYAPTTFQRCM------------------LAIF-----------------------

Query:  ------SYMIESTVEEGIMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRR-------------------------------------
              S+ +++ VE    LG+ ++ +G++ D   +  I  + PP SVKE++ FLG   +YR+                                     
Subjt:  ------SYMIESTVEEGIMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRR-------------------------------------

Query:  ---KAFETLKTVLISTPILCAPNWNLPFEVMCAASDAAEFDLEIKDKKGSENVIADHLSR
           ++F  LK++L S+ IL  P +  PF +   AS+ A   +  +D +G +  IA ++SR
Subjt:  ---KAFETLKTVLISTPILCAPNWNLPFEVMCAASDAAEFDLEIKDKKGSENVIADHLSR

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.2e-1033.66Show/hide
Query:  LLQQYRKAIGWTL----ADIQEISPSFCMHKITLEEGSFRLNRNYARYHPAMK--EVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDN
        L Q+YR+ I   L    ADI  I      H I ++ G+ RL R    YH   K  + + K V K LD   I P + +   SPV  VPKK G         
Subjt:  LLQQYRKAIGWTL----ADIQEISPSFCMHKITLEEGSFRLNRNYARYHPAMK--EVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDN

Query:  ELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYYCFLDGYSGR--------------------------MPFDLCYAPTTFQRCMLA
                  +R+C+DYR LNKAT  D FPLP ID +L R+     +  LD +SG                           MPF L  AP+TF R M  
Subjt:  ELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFIDQMLDRLDGQTYYCFLDGYSGR--------------------------MPFDLCYAPTTFQRCMLA

Query:  IF
         F
Subjt:  IF

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein2.2e-0871.43Show/hide
Query:  CDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFM
        CDACQR+GN   R+EMP  +ILEVE+FDVWGI FM
Subjt:  CDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGATCTGCCTGGGGTAAGGTTCGAGCTTGATCCAAAAATCGAGAGGACATTTAGGAGAAGAATGAGAGAGCAACGTACAAACCAAAATCAAATAGATAACATGTC
GCGTCTCCCGCAGGGTCTTAAAGATCCAAGGAACCCCCAGCAGAATCGTTTGCTGCAGCAAAACCAGCAGCTGGAACAAAATGAGCAGCAAAATAATCAGGCTGAGAATC
CTATCTTGGTAGCAAACGATAGGGCCAGAGCCATTCGAGCGTATACTTTTCCAATGTTTGATGAGTTAAATCCAGGGATTGCACGTCCTCAAATTGAGGCAGAAAATTTT
GAAATGAAACCGGTAATGTTTCAGATGTTGCACACTGTGGGTCAATTTCATGGTTTGTCATTTGAAGACCCTCATTTACATCTCAAGTCTTTTATAGGAGTTAGTGATTC
TTTTGTGATTCAAGGAGTTCCTAGGGATGCCCTTAGATTAACTTTGTTCTTGTATTCTCTTAGAGATGGAGCAAAGGCATGGTTAAATTCTTTTGCTCCAGGATCAATTA
GGACATTGAATGAATTAGCTGAGAAATTTTTGAGTAAATATTTCCCACCAAATAGAAATGATAAATCGAGGAGTGAAATAGTAGGGTTTAAGCAACTTGAAGATGAAACT
TTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCACATTGTATTCAAATGGAAACATTTTACAATGGGTTATATGGAGCAAC
CCAAGGTATGGTTGATGCTTCGACTGGAGGGGCCCTTTTGGTAAAAACTTTTAATGAAGCCCATGAAATTTTGGAAAGAATATCTACTAATAGTTGTCAGTGGTCGGATG
TTAGAGGCACAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGGGATGATCTTGCAATGATAGCTAACTCTCTTAATGTTGGGATAGAACAT
GCATGCTTTAGGAGTAAAATTGAATGTCTTAAGTGCTTTCGATCTTCAATTGCTTCCGCATGTGTAATATGTACACCTTCACTTAAGAATGTGACAGTGATTAGTCATCA
GCAGCCACTAGCTTTGGAGCCTGCTGCATTGGTGAACCAAGTTGCAGAGGAAGCATGTGTCTATTGTGGTGAAGATCACAACTACGAGTTTTGCCCCAACAATCCAGCTT
CTGTGTTTTTTGTAGGTAATCAGAGGAATAACCCTTATTCTATCTTTTATAATCCAGGTTGGCGCAACCACCCCAACTTCTCATGGGGAGGACAAGGAAGTAATGTGCAA
GCACAACAGAAGGTGAACCAATCGGGATTTGCTAAAGCACAGCTAGCTAATGAGCTGAAGGCAAGGCCTCAAGGGAAACTTCCTTCAGATACTGAACACCCTAGAAGGGA
AGGTAAGGAGCAGGTAAAGGCAGTGACTCTTAGGATTGAGTTGGAGACTGGTCAGGGTGCTGGAGGCAGCAATAATGATGCTGGAGCATATGGTTCTGTTCCAGATGTGG
AACCACCTTATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAGGCCTAAGAATCAGGATGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAG
CAATTGCATATAAATATCCCTTTAGTAGAAGCCATTGAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATATTTTAACTAAAAAAAAGAGGTTATGTGAGTTTGAAAC
TGTATCTCTTACTGAGGAATGTAGTGTTATTCTTAAGAATGGGCTACCAACCAAGGCTAAGGATCCAGGGTCATTTACTATACCTGTGTCTATAGGTGGAAAAGAGTTAG
GTAGAGCACTCTGTGATTTAGGTGCAAACATTAACCTTATGCCTCTCTCGGTCTATCGAAAGCTAGGTATTGGTGAAGCTAGGCCTACCACAGGCACCTTCCAACTGGCT
AATAGGTCTATCACATATCCAGAGGGTAAAATTGAGGATGTCTTAGTGAAAGTAGATAAATTCATATTTCCTGTTGATTTTATTATCTTAGACTATGAGGCTTATAAAGA
TGTCCCAATTATTCTAGGTCGTCCATTTTTGGCTACTGGTAGGACATTGATAGATGTTCAAAAAGGAGAATTAACAATGAGAGTCTATAATGAGGAAGTAAAATTTAATG
TGTTTAAGGTCATGAAGTATCCAGACGAAATGGAAGATTGCTCTTTCGTTAGGATTCTGGAGAACACAATTGTTGAGACAGCAATGCAGGAGTCGACTAACAAACATTTG
GAAGATCATGGAGAGGATAGTATAGAAGATTTAGAAGTTTGTTCTTTAGAAAGAAAAAATGAAAAAGAAGTGTTTAGGTGTGAGGATGTTTTTGAGTCTTTAGATTTAGA
TCAAAGGAAGGCTCCTCCTATTAAGCCATCCCTGATTGAGGCACCCAATTTAGATTTGAAACCCTTGTCGGATCATCTAAAGTATGTGTATCTTGGGGAGGGTGATACGT
TGCCCATTATTGTTGCATCAGATTTAATGGCAGAGCATGAAAAGGCCTTAATTAAATTGTTGCAGCAATATCGCAAAGCAATAGGTTGGACATTGGCTGACATTCAGGAA
ATTAGCCCATCTTTCTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTTAGGCTTAACCGCAACTATGCTCGTTATCACCCTGCAATGAAAGAAGTGGTTAAAAAGAA
GGTGATTAAATGGTTGGATGCTGGCATTATTTATCCAATTGCAGACAACAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAAGGAGGTGTCACTGTGGTGAGCAATA
AAGATAATGAGTTGATCCCAACCAGGACAGTAACTGGCTGGAGGGTTTGCATGGATTACAGGAAGCTTAATAAAGCTACCCATAAGGACCATTTCCCTCTACCATTTATT
GACCAGATGTTGGATAGATTGGATGGTCAGACCTACTACTGTTTCTTAGATGGTTATTCTGGGCGAATGCCTTTTGACCTCTGCTATGCTCCAACAACATTTCAGCGTTG
TATGTTAGCAATTTTTTCCTATATGATTGAGTCCACTGTCGAGGAGGGCATTATGTTAGGTCATAGGATTTCTAAGAATGGTCTAGAAGTTGATAGAGCAAATATTGAGG
TTATCGAAAGATTAGAACCACCGAATTCAGTGAAAGAGATTCGAAGTTTTCTAGGTCATGCAGGATTTTATAGGAGGAAGGCTTTTGAAACTTTAAAGACTGTTTTAATC
TCAACACCCATTCTTTGTGCACCTAATTGGAACTTACCATTTGAGGTAATGTGTGCTGCGAGTGATGCTGCGGAATTTGACTTGGAGATAAAGGATAAGAAGGGATCAGA
AAATGTCATTGCAGATCATTTGTCTCGTCTTGATCCATCATCATCTTTGTTGAAGCAATCTGCCATTTCAGATTCTTTTCCAGATGAACAACTCTTTATTGTTGAGGTAA
AGATGGTCAAGGATGTCCCTTGGTATGCTGATATTGCTAACTTTTTGGTAAAGGGAGTCACTCCTATTGATATGGATTGGAGGCAGAAGAAAAAGTTTAAGCATGATGCA
AAATTTTTCTATTGGGATGAGCCATTTATGTATAAGCAATGCTCTGATGGTATTATTCTTCGTCGTATGGAGGCCATTTTAGTGTGTGATGCTTGCCAAAGGAGAGGAAA
CTTGGGGCCTAGAGATGAAATGCCTCTTACTTACATTTTGGAAGTAGAATTATTTGATGTATGGGGTATTGACTTTATGGGGTCATTTCCCCCTTCTAATGGCAATGTTT
TTATCTTATTGGCAGTTGATTATGTGTTCAAGTGGGTGGAGGCCATCGCATGCCATCAGAATGATGCCAAGACAGTAGCAAGGTTTCTTCAATCGCACATCTTTGTGTGG
TTTGGGACACCTAGAGCTCTAGTGAGTGATGAGGGTACACATTCTGTTAATAATATCTTAACTAAGCTGTTAGCTAAGTATGAGATTAAGCATAAGATAGCTACCCCTTA
TCATCCACAAGCAAATGGTCAAGCTGAGATTAGGGCAATAAGAATGCTGCAGCTTAATGAATTAGAGGAATTTCGACAATTTTCTTATGAAAATTCGAAAATGTATAAAG
AGAAGACTAAGTTGTGGCATGACAAGAAAATTAAATCTAAAGACTTCAGTAAGTTTCTTTTCACTTCACCTTCTCGTTTCATTCCTTTGCTATTCGTATTTCTCCTTTCT
TTTGTTGTTCGCTGTACAAAAAACCCTTGGGTCTCTAATGGCAAAACGAGGGCAAGAAAAGAGAGAGACAACGAGGAAGAAGAGGTGTTGGTCACCCCTGAGACACAGAA
AGTTAAAACAAAGAAGAAGAAAACGCCGAGGAAAAAGAAGCCAAAAGAAGGAGAAGGAAACAGAGGGTTGATGACCAAGAAGCTGTCCAGAAAGCGGCAGAAGATGTTGT
TACTGCGGAGGTGTGAGAAGAGAATCGGAAGAAGTTCAAGAAAAGCAGGTGAGGATGTGCGAGGAGGACAAGCAGAGGTTGCGCCTGAAGAAGATAATGAGCCAGTGCAG
GAGGCTCGAGTAGAGGTGATCATGCCGGAAGTGCCCAAACGTCGCCGCATTAAGCGAAAAGCGGGTCGCATCAAGGGTTTTGGAGATGATCTACCATGGTTCTTGAGGAC
GAGAATAGTGAACCATGGCTGGAGTCAGTTCTGTTCGAAGCCGGAGCCGGTCAATTCCAACATTGTTCGAGAGTTTTATGCGAATGTTGATAATGCAGAGAAATTTCAGG
CTATAGTTCGAGGAGTACCCGTTGACTGGAGCCCAGGAGCTATTAATTCTTTATTCAACCTCCAGGACTTTCCACAAGCAGGCTTTAATGAGATGGTGGTGGCACCATCG
ATCGATCAATTGAATTCGGTTGTCCGAGAGGTTGGCATCGAGGGGGCTCAATGGAGATTATCGAAGACAGAAAAGCGTACATTTCAAGCTGCTTATCTGAAGAGTGAGAC
TAATACATGGATGGGCTTCGTCAAGCTACGCTTACTGCCGACAACTCACGATTCAACGGTGTCTCGGGACCGGGTACTACTTGTGTTTGCTATTCTTCGTTCGATGAGTA
TTGATGTGGGGGTTCCAGAGAATGAGGATGATGTGACTCTGATAGACAAGGGAATAATCGACACACCAAATTTGGCTAGGTTCAAAGGTCACAAGAAGCACGCCAAGGAG
GATTGGGATGTTGCGTTGAGGGTAGCCTTGCAATCGAATTTTTCCAAACCATATCCGACTTTACCCATATTCCCTGATGACCTACTGAACCCCTGGATTCCGCCACCGCC
TGTTGAAAGAGAAGGAGATGATGAAGAAGATCCTGATCAGGAGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCGATCTGCCTGGGGTAAGGTTCGAGCTTGATCCAAAAATCGAGAGGACATTTAGGAGAAGAATGAGAGAGCAACGTACAAACCAAAATCAAATAGATAACATGTC
GCGTCTCCCGCAGGGTCTTAAAGATCCAAGGAACCCCCAGCAGAATCGTTTGCTGCAGCAAAACCAGCAGCTGGAACAAAATGAGCAGCAAAATAATCAGGCTGAGAATC
CTATCTTGGTAGCAAACGATAGGGCCAGAGCCATTCGAGCGTATACTTTTCCAATGTTTGATGAGTTAAATCCAGGGATTGCACGTCCTCAAATTGAGGCAGAAAATTTT
GAAATGAAACCGGTAATGTTTCAGATGTTGCACACTGTGGGTCAATTTCATGGTTTGTCATTTGAAGACCCTCATTTACATCTCAAGTCTTTTATAGGAGTTAGTGATTC
TTTTGTGATTCAAGGAGTTCCTAGGGATGCCCTTAGATTAACTTTGTTCTTGTATTCTCTTAGAGATGGAGCAAAGGCATGGTTAAATTCTTTTGCTCCAGGATCAATTA
GGACATTGAATGAATTAGCTGAGAAATTTTTGAGTAAATATTTCCCACCAAATAGAAATGATAAATCGAGGAGTGAAATAGTAGGGTTTAAGCAACTTGAAGATGAAACT
TTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCACATTGTATTCAAATGGAAACATTTTACAATGGGTTATATGGAGCAAC
CCAAGGTATGGTTGATGCTTCGACTGGAGGGGCCCTTTTGGTAAAAACTTTTAATGAAGCCCATGAAATTTTGGAAAGAATATCTACTAATAGTTGTCAGTGGTCGGATG
TTAGAGGCACAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGGGATGATCTTGCAATGATAGCTAACTCTCTTAATGTTGGGATAGAACAT
GCATGCTTTAGGAGTAAAATTGAATGTCTTAAGTGCTTTCGATCTTCAATTGCTTCCGCATGTGTAATATGTACACCTTCACTTAAGAATGTGACAGTGATTAGTCATCA
GCAGCCACTAGCTTTGGAGCCTGCTGCATTGGTGAACCAAGTTGCAGAGGAAGCATGTGTCTATTGTGGTGAAGATCACAACTACGAGTTTTGCCCCAACAATCCAGCTT
CTGTGTTTTTTGTAGGTAATCAGAGGAATAACCCTTATTCTATCTTTTATAATCCAGGTTGGCGCAACCACCCCAACTTCTCATGGGGAGGACAAGGAAGTAATGTGCAA
GCACAACAGAAGGTGAACCAATCGGGATTTGCTAAAGCACAGCTAGCTAATGAGCTGAAGGCAAGGCCTCAAGGGAAACTTCCTTCAGATACTGAACACCCTAGAAGGGA
AGGTAAGGAGCAGGTAAAGGCAGTGACTCTTAGGATTGAGTTGGAGACTGGTCAGGGTGCTGGAGGCAGCAATAATGATGCTGGAGCATATGGTTCTGTTCCAGATGTGG
AACCACCTTATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAGGCCTAAGAATCAGGATGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAG
CAATTGCATATAAATATCCCTTTAGTAGAAGCCATTGAGCAAATGCCTAATTATGCTAAATTTCTTAAGGATATTTTAACTAAAAAAAAGAGGTTATGTGAGTTTGAAAC
TGTATCTCTTACTGAGGAATGTAGTGTTATTCTTAAGAATGGGCTACCAACCAAGGCTAAGGATCCAGGGTCATTTACTATACCTGTGTCTATAGGTGGAAAAGAGTTAG
GTAGAGCACTCTGTGATTTAGGTGCAAACATTAACCTTATGCCTCTCTCGGTCTATCGAAAGCTAGGTATTGGTGAAGCTAGGCCTACCACAGGCACCTTCCAACTGGCT
AATAGGTCTATCACATATCCAGAGGGTAAAATTGAGGATGTCTTAGTGAAAGTAGATAAATTCATATTTCCTGTTGATTTTATTATCTTAGACTATGAGGCTTATAAAGA
TGTCCCAATTATTCTAGGTCGTCCATTTTTGGCTACTGGTAGGACATTGATAGATGTTCAAAAAGGAGAATTAACAATGAGAGTCTATAATGAGGAAGTAAAATTTAATG
TGTTTAAGGTCATGAAGTATCCAGACGAAATGGAAGATTGCTCTTTCGTTAGGATTCTGGAGAACACAATTGTTGAGACAGCAATGCAGGAGTCGACTAACAAACATTTG
GAAGATCATGGAGAGGATAGTATAGAAGATTTAGAAGTTTGTTCTTTAGAAAGAAAAAATGAAAAAGAAGTGTTTAGGTGTGAGGATGTTTTTGAGTCTTTAGATTTAGA
TCAAAGGAAGGCTCCTCCTATTAAGCCATCCCTGATTGAGGCACCCAATTTAGATTTGAAACCCTTGTCGGATCATCTAAAGTATGTGTATCTTGGGGAGGGTGATACGT
TGCCCATTATTGTTGCATCAGATTTAATGGCAGAGCATGAAAAGGCCTTAATTAAATTGTTGCAGCAATATCGCAAAGCAATAGGTTGGACATTGGCTGACATTCAGGAA
ATTAGCCCATCTTTCTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTTAGGCTTAACCGCAACTATGCTCGTTATCACCCTGCAATGAAAGAAGTGGTTAAAAAGAA
GGTGATTAAATGGTTGGATGCTGGCATTATTTATCCAATTGCAGACAACAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAAGGAGGTGTCACTGTGGTGAGCAATA
AAGATAATGAGTTGATCCCAACCAGGACAGTAACTGGCTGGAGGGTTTGCATGGATTACAGGAAGCTTAATAAAGCTACCCATAAGGACCATTTCCCTCTACCATTTATT
GACCAGATGTTGGATAGATTGGATGGTCAGACCTACTACTGTTTCTTAGATGGTTATTCTGGGCGAATGCCTTTTGACCTCTGCTATGCTCCAACAACATTTCAGCGTTG
TATGTTAGCAATTTTTTCCTATATGATTGAGTCCACTGTCGAGGAGGGCATTATGTTAGGTCATAGGATTTCTAAGAATGGTCTAGAAGTTGATAGAGCAAATATTGAGG
TTATCGAAAGATTAGAACCACCGAATTCAGTGAAAGAGATTCGAAGTTTTCTAGGTCATGCAGGATTTTATAGGAGGAAGGCTTTTGAAACTTTAAAGACTGTTTTAATC
TCAACACCCATTCTTTGTGCACCTAATTGGAACTTACCATTTGAGGTAATGTGTGCTGCGAGTGATGCTGCGGAATTTGACTTGGAGATAAAGGATAAGAAGGGATCAGA
AAATGTCATTGCAGATCATTTGTCTCGTCTTGATCCATCATCATCTTTGTTGAAGCAATCTGCCATTTCAGATTCTTTTCCAGATGAACAACTCTTTATTGTTGAGGTAA
AGATGGTCAAGGATGTCCCTTGGTATGCTGATATTGCTAACTTTTTGGTAAAGGGAGTCACTCCTATTGATATGGATTGGAGGCAGAAGAAAAAGTTTAAGCATGATGCA
AAATTTTTCTATTGGGATGAGCCATTTATGTATAAGCAATGCTCTGATGGTATTATTCTTCGTCGTATGGAGGCCATTTTAGTGTGTGATGCTTGCCAAAGGAGAGGAAA
CTTGGGGCCTAGAGATGAAATGCCTCTTACTTACATTTTGGAAGTAGAATTATTTGATGTATGGGGTATTGACTTTATGGGGTCATTTCCCCCTTCTAATGGCAATGTTT
TTATCTTATTGGCAGTTGATTATGTGTTCAAGTGGGTGGAGGCCATCGCATGCCATCAGAATGATGCCAAGACAGTAGCAAGGTTTCTTCAATCGCACATCTTTGTGTGG
TTTGGGACACCTAGAGCTCTAGTGAGTGATGAGGGTACACATTCTGTTAATAATATCTTAACTAAGCTGTTAGCTAAGTATGAGATTAAGCATAAGATAGCTACCCCTTA
TCATCCACAAGCAAATGGTCAAGCTGAGATTAGGGCAATAAGAATGCTGCAGCTTAATGAATTAGAGGAATTTCGACAATTTTCTTATGAAAATTCGAAAATGTATAAAG
AGAAGACTAAGTTGTGGCATGACAAGAAAATTAAATCTAAAGACTTCAGTAAGTTTCTTTTCACTTCACCTTCTCGTTTCATTCCTTTGCTATTCGTATTTCTCCTTTCT
TTTGTTGTTCGCTGTACAAAAAACCCTTGGGTCTCTAATGGCAAAACGAGGGCAAGAAAAGAGAGAGACAACGAGGAAGAAGAGGTGTTGGTCACCCCTGAGACACAGAA
AGTTAAAACAAAGAAGAAGAAAACGCCGAGGAAAAAGAAGCCAAAAGAAGGAGAAGGAAACAGAGGGTTGATGACCAAGAAGCTGTCCAGAAAGCGGCAGAAGATGTTGT
TACTGCGGAGGTGTGAGAAGAGAATCGGAAGAAGTTCAAGAAAAGCAGGTGAGGATGTGCGAGGAGGACAAGCAGAGGTTGCGCCTGAAGAAGATAATGAGCCAGTGCAG
GAGGCTCGAGTAGAGGTGATCATGCCGGAAGTGCCCAAACGTCGCCGCATTAAGCGAAAAGCGGGTCGCATCAAGGGTTTTGGAGATGATCTACCATGGTTCTTGAGGAC
GAGAATAGTGAACCATGGCTGGAGTCAGTTCTGTTCGAAGCCGGAGCCGGTCAATTCCAACATTGTTCGAGAGTTTTATGCGAATGTTGATAATGCAGAGAAATTTCAGG
CTATAGTTCGAGGAGTACCCGTTGACTGGAGCCCAGGAGCTATTAATTCTTTATTCAACCTCCAGGACTTTCCACAAGCAGGCTTTAATGAGATGGTGGTGGCACCATCG
ATCGATCAATTGAATTCGGTTGTCCGAGAGGTTGGCATCGAGGGGGCTCAATGGAGATTATCGAAGACAGAAAAGCGTACATTTCAAGCTGCTTATCTGAAGAGTGAGAC
TAATACATGGATGGGCTTCGTCAAGCTACGCTTACTGCCGACAACTCACGATTCAACGGTGTCTCGGGACCGGGTACTACTTGTGTTTGCTATTCTTCGTTCGATGAGTA
TTGATGTGGGGGTTCCAGAGAATGAGGATGATGTGACTCTGATAGACAAGGGAATAATCGACACACCAAATTTGGCTAGGTTCAAAGGTCACAAGAAGCACGCCAAGGAG
GATTGGGATGTTGCGTTGAGGGTAGCCTTGCAATCGAATTTTTCCAAACCATATCCGACTTTACCCATATTCCCTGATGACCTACTGAACCCCTGGATTCCGCCACCGCC
TGTTGAAAGAGAAGGAGATGATGAAGAAGATCCTGATCAGGAGGATTGA
Protein sequenceShow/hide protein sequence
MSDLPGVRFELDPKIERTFRRRMREQRTNQNQIDNMSRLPQGLKDPRNPQQNRLLQQNQQLEQNEQQNNQAENPILVANDRARAIRAYTFPMFDELNPGIARPQIEAENF
EMKPVMFQMLHTVGQFHGLSFEDPHLHLKSFIGVSDSFVIQGVPRDALRLTLFLYSLRDGAKAWLNSFAPGSIRTLNELAEKFLSKYFPPNRNDKSRSEIVGFKQLEDET
FSEAWERFKELLRKCPHHGLPHCIQMETFYNGLYGATQGMVDASTGGALLVKTFNEAHEILERISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRDDLAMIANSLNVGIEH
ACFRSKIECLKCFRSSIASACVICTPSLKNVTVISHQQPLALEPAALVNQVAEEACVYCGEDHNYEFCPNNPASVFFVGNQRNNPYSIFYNPGWRNHPNFSWGGQGSNVQ
AQQKVNQSGFAKAQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRIELETGQGAGGSNNDAGAYGSVPDVEPPYVPPPPYVPPLPFPQRQRPKNQDGQFKKFLEILK
QLHINIPLVEAIEQMPNYAKFLKDILTKKKRLCEFETVSLTEECSVILKNGLPTKAKDPGSFTIPVSIGGKELGRALCDLGANINLMPLSVYRKLGIGEARPTTGTFQLA
NRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEAYKDVPIILGRPFLATGRTLIDVQKGELTMRVYNEEVKFNVFKVMKYPDEMEDCSFVRILENTIVETAMQESTNKHL
EDHGEDSIEDLEVCSLERKNEKEVFRCEDVFESLDLDQRKAPPIKPSLIEAPNLDLKPLSDHLKYVYLGEGDTLPIIVASDLMAEHEKALIKLLQQYRKAIGWTLADIQE
ISPSFCMHKITLEEGSFRLNRNYARYHPAMKEVVKKKVIKWLDAGIIYPIADNNWVSPVQCVPKKGGVTVVSNKDNELIPTRTVTGWRVCMDYRKLNKATHKDHFPLPFI
DQMLDRLDGQTYYCFLDGYSGRMPFDLCYAPTTFQRCMLAIFSYMIESTVEEGIMLGHRISKNGLEVDRANIEVIERLEPPNSVKEIRSFLGHAGFYRRKAFETLKTVLI
STPILCAPNWNLPFEVMCAASDAAEFDLEIKDKKGSENVIADHLSRLDPSSSLLKQSAISDSFPDEQLFIVEVKMVKDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDA
KFFYWDEPFMYKQCSDGIILRRMEAILVCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGSFPPSNGNVFILLAVDYVFKWVEAIACHQNDAKTVARFLQSHIFVW
FGTPRALVSDEGTHSVNNILTKLLAKYEIKHKIATPYHPQANGQAEIRAIRMLQLNELEEFRQFSYENSKMYKEKTKLWHDKKIKSKDFSKFLFTSPSRFIPLLFVFLLS
FVVRCTKNPWVSNGKTRARKERDNEEEEVLVTPETQKVKTKKKKTPRKKKPKEGEGNRGLMTKKLSRKRQKMLLLRRCEKRIGRSSRKAGEDVRGGQAEVAPEEDNEPVQ
EARVEVIMPEVPKRRRIKRKAGRIKGFGDDLPWFLRTRIVNHGWSQFCSKPEPVNSNIVREFYANVDNAEKFQAIVRGVPVDWSPGAINSLFNLQDFPQAGFNEMVVAPS
IDQLNSVVREVGIEGAQWRLSKTEKRTFQAAYLKSETNTWMGFVKLRLLPTTHDSTVSRDRVLLVFAILRSMSIDVGVPENEDDVTLIDKGIIDTPNLARFKGHKKHAKE
DWDVALRVALQSNFSKPYPTLPIFPDDLLNPWIPPPPVEREGDDEEDPDQED