; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028361 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028361
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr8:19466225..19471685
RNA-Seq ExpressionLag0028361
SyntenyLag0028361
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN16590.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]5.5e-22739.61Show/hide
Query:  SRINPLFEQNEQQNNQAENPILIAND--RTRAIRAYVVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHLKSFLGVSDSFVIQA
        +R   L E  EQ+    EN I+I  D      +R   +P   E    +  P++  A  +++  M +M+Q   QF GL+ E+P+ H+ +FL + D+   + 
Subjt:  SRINPLFEQNEQQNNQAENPILIAND--RTRAIRAYVVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHLKSFLGVSDSFVIQA

Query:  VPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNG
        V +DALRL LF +SL   A  W  S    SI TW +L E+F+SK+F P + A LR+EI+ FRQ   ET  EAW RF+++LR CP+H +P  IQ+ TFY+G
Subjt:  VPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNG

Query:  -----------------LNGVTQE------RISINSCQWSDIRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEAC
                         L+G T E       +  N  +    R T  K   V+EVD V+ + A +  +  ++KN  V   Q  P               C
Subjt:  -----------------LNGVTQE------RISINSCQWSDIRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEAC

Query:  VYCGEDHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWHNHPNFSWGG-----------QGNNVQAQQKMNQ--PGFAKAFDESPGIAS--------
          CGE H  + CP++  S+ FV N R   NNPYSN YNPGW  HPNFSW             QG   Q QQ M +  P   +   +     +        
Subjt:  VYCGEDHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWHNHPNFSWGG-----------QGNNVQAQQKMNQ--PGFAKAFDESPGIAS--------

Query:  --GSLASELKARPQGKLPSDTE-HPRREVKEQVKAVSLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYVPPHLMY
          G LA+ + +RP+  LPS+TE +PR++ K Q +AV+LR+G  L+E  K  +   + +K ++ E++        G    A       +  P YV      
Subjt:  --GSLASELKARPQGKLPSDTE-HPRREVKEQVKAVSLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYVPPHLMY

Query:  HLYLFHKGKSLRIRMDILTKKKRLGEFETVSLTDECSAILKNGLPPKGKDPGSFIIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQ
                   +   DIL+KK+RLG++ETV+LT+ECSAI++N LPPK KDPGSF I  +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQ
Subjt:  HLYLFHKGKSLRIRMDILTKKKRLGEFETVSLTDECSAILKNGLPPKGKDPGSFIIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQ

Query:  LADRTITYPEGKTEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV-
        LADR++TYP G  ED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + +    
Subjt:  LADRTITYPEGKTEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV-

Query:  -----------IETAIQDSADKHSEK--------------------------------------------------------HGETLPIIVASDLMSEHE
                   +E A+ D  D+ +E+                                                          +TLP+I++S L     
Subjt:  -----------IETAIQDSADKHSEK--------------------------------------------------------HGETLPIIVASDLMSEHE

Query:  EALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLGAGIIYPIGDSNWVSPFQCVPKKGGVTVVSNKDNE
        E L+++L+ ++ AIGWT+ADI+GIS SFCMHKI LE+G   S+E QRRLNP MKEVVKKE+IKWL AGIIYPI DS+WVSP QCVPKKGG+TVV N  NE
Subjt:  EALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLGAGIIYPIGDSNWVSPFQCVPKKGGVTVVSNKDNE

Query:  LIPTRT-----------------------------MLDRLAGQAYYCFLDGYSGYNQVTIAPEDQEKTTFTCPYGTFAFRRMPFGL--------------
        LIPTRT                             MLDRLAG+ +YCFLDGYSGYNQ+ I PEDQEKTTFTCPYGTF FR+MPFGL              
Subjt:  LIPTRT-----------------------------MLDRLAGQAYYCFLDGYSGYNQVTIAPEDQEKTTFTCPYGTFAFRRMPFGL--------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------YCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTE
                                     C   F  LK  LISAPI+ AP+W+ PFE+MCDASD A+ A+LGQ++ K    IYYASK LN+AQ+NYTTTE
Subjt:  ----------------------------YCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTE

Query:  KELLAVVFAFEKFRPYLVASKVTVFTDHAAIR
        KELLAVVFAF+KFR YLV +KV V+TDHAAIR
Subjt:  KELLAVVFAFEKFRPYLVASKVTVFTDHAAIR

XP_017239676.1 PREDICTED: uncharacterized protein LOC108212460 [Daucus carota subsp. sativus]4.0e-23040.33Show/hide
Query:  RTRAIRAYVVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHLKSFLGVSDSFVIQAVPRDALRLTLFPYSLRDGAKSWLNSFAP
        R   ++ Y++P F+ ++  IARP I A NF +     Q ++   +F+GL++EDP+ HL++FL + D+F +  VP + +RL LF  SL   A+ WL+S   
Subjt:  RTRAIRAYVVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHLKSFLGVSDSFVIQAVPRDALRLTLFPYSLRDGAKSWLNSFAP

Query:  GSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGL--------NGVTQ--------------
         SI TW++L EKFL+KYF P +  +L  EI  F+Q + E+  EA+ERFK+LLRKCPHHGL    ++ TFYNGL        +G  +              
Subjt:  GSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGL--------NGVTQ--------------

Query:  -ERISINSCQWSDIRGTNKKVKSVLEVDGVSTIRADLAMIANAL--KNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCP------NNPASVF
         E I+ N+C+  D R ++KKV  V EVD +++  A +     AL  K  ++   +Q P      V N      C  CGE H  + CP         +SV 
Subjt:  -ERISINSCQWSDIRGTNKKVKSVLEVDGVSTIRADLAMIANAL--KNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCP------NNPASVF

Query:  FVG---NQRNNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQ---PGF-----AKAFDESPG---------------------------IASGSLASE
        +VG   NQ+NNP+SN YNPGW NHPNFSW    NNV+      Q   PGF      +  ++ P                            +  G LAS 
Subjt:  FVG---NQRNNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQ---PGF-----AKAFDESPG---------------------------IASGSLASE

Query:  LKARPQGKLPSDTE-HPRREVKEQVKAVSLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYVPP------------
        +  RP G LPS+TE +P+ + +E  KA++LRSGK +E + K  D   + +K +  E  +        SN  A AS     V PP  PP            
Subjt:  LKARPQGKLPSDTE-HPRREVKEQVKAVSLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYVPP------------

Query:  --------------HLMYHLYLFHKGKSLRIRMDILTKKKRLGEFETVSLTDECSAILKNGLPPKGKDPGSFIIHVSIGGKELGRALCDLGASINLMPLS
                      ++ +   L      ++   DIL++K+RL EFETV+LT+ECSAIL+  LPPK KDPGSF I  +IG +  G+ALCDLGAS+NLMPLS
Subjt:  --------------HLMYHLYLFHKGKSLRIRMDILTKKKRLGEFETVSLTDECSAILKNGLPPKGKDPGSFIIHVSIGGKELGRALCDLGASINLMPLS

Query:  VYRKLGIGEARPTTVTLQLADRTITYPEGKTEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKY
        ++ KLG+GE +PT+V LQLADR++ YP G  EDVLVKVDKFIFP DFI+LD E D D+P++LGRPFLATGR LIDVQKGELTMRV +E+V FNVF AMK+
Subjt:  VYRKLGIGEARPTTVTLQLADRTITYPEGKTEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKY

Query:  PDEMEDCSFIRIL-------------ESTVIETAIQDSADKHSEKHGE----------------------------------------------------
         ++ E C  +                 +  +E +++++ D+ +E+  E                                                    
Subjt:  PDEMEDCSFIRIL-------------ESTVIETAIQDSADKHSEKHGE----------------------------------------------------

Query:  ------TLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLGAGIIYPIGDSNWV
              TLP+I++S L +EHEE L+++L++Y++AIGW +ADI+GIS SFCMHKI++E+    +IE QRRLNP MKEVVKKE+IKWL AGIIYPI DS+WV
Subjt:  ------TLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLGAGIIYPIGDSNWV

Query:  SPFQCVPKKGGVTVVSNKDNELIPTRT-----------------------------MLDRLAGQAYYCFLDGYSGYNQVTIAPEDQEKTTFTCPYGTFAF
        SP QCVPKKGG+TVV+N+ NELIPTRT                             MLDRLAG+ +YCFLDGYSGY+Q+ IAPEDQEKTTFTCP+GTFAF
Subjt:  SPFQCVPKKGGVTVVSNKDNELIPTRT-----------------------------MLDRLAGQAYYCFLDGYSGYNQVTIAPEDQEKTTFTCPYGTFAF

Query:  RRMPFGL---------------------------------------------------------------------------------------------
        R++ FGL                                                                                             
Subjt:  RRMPFGL---------------------------------------------------------------------------------------------

Query:  ----------------YCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEK
                         C +A+E LK  L+SAPI+ AP+W+LPFE+MCDASD A+  +LGQ+  K  H IYYASK L+ AQ+NYTTTEKELLAVV+AFEK
Subjt:  ----------------YCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEK

Query:  FRPYLVASKVTVFTDHAAIR
        FR YLV +KV V+TDH+AI+
Subjt:  FRPYLVASKVTVFTDHAAIR

XP_038972405.1 uncharacterized protein LOC120104748 [Phoenix dactylifera]9.3e-21938.42Show/hide
Query:  NDRTRAIRAYVVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHLKSFLGVSDSFVIQAVPRDALRLTLFPYSLRDGAKSWLNSF
        N   R +  Y VP  N   P I RP + A NFE+KP + QM+Q   QF G  SEDPH HL +FL + D+  +  V  DA+RL LFP+SL+D AK+WLNS 
Subjt:  NDRTRAIRAYVVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHLKSFLGVSDSFVIQAVPRDALRLTLFPYSLRDGAKSWLNSF

Query:  APGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQ--------------------
        AP S  TW+ L++ FLSKYFPP + AKLR++I  F Q + E+  EAWERFK+L RKCPHHGLP  + ++TFYNGL    +                    
Subjt:  APGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQ--------------------

Query:  ---ERISINSCQWSDIRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPNNPASVFFVGN--
           E ++ N+ QWS+ R   KKV  + +VDG++ + A +  +      +  ++    P +            +C  CG  H    C      V FV N  
Subjt:  ---ERISINSCQWSDIRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPNNPASVFFVGN--

Query:  ---QRNNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQPGF--------------------AKAFDE-----------------SPGIASGSLASELK
           Q+NNPYSN YNPGW NHPNFSW  QGN   + + ++ PGF                    A A  E                 +  I  G LA+ + 
Subjt:  ---QRNNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQPGF--------------------AKAFDE-----------------SPGIASGSLASELK

Query:  ARPQGKLPSDTEHPRREVKEQVKAVSLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYVPP---------------
        +R QG LPS TE      KE  KAV+LRSGK L +      + +  D   V +K  E  +        A     +P VE PYVPP               
Subjt:  ARPQGKLPSDTEHPRREVKEQVKAVSLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYVPP---------------

Query:  -------------HLMYHLYLFHKGKSLRIRMDILTKKKRLGEFETVSLTDECSAILKNGLPPKGKDPGSFIIHVSIGGKELGRALCDLGASINLMPLSV
                     ++ +   L       +   +I++KK++L +FET++LT+ECSAI++N LPPK +DPGSF I  +IG  +  RALCDLGAS++LMPLSV
Subjt:  -------------HLMYHLYLFHKGKSLRIRMDILTKKKRLGEFETVSLTDECSAILKNGLPPKGKDPGSFIIHVSIGGKELGRALCDLGASINLMPLSV

Query:  YRKLGIGEARPTTVTLQLADRTITYPEGKTEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP
         RKLG+ E +PTT++LQLADR++ YP G  E+VL+KV KFI PVDFI+L+ E D ++PIILGRPFLAT  A+IDV+ G LT++V  EEV+FN+F+A KYP
Subjt:  YRKLGIGEARPTTVTLQLADRTITYPEGKTEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP

Query:  --------------------------DEMEDC------SFIRILESTVIETAIQDSADKHSEK-------------------------------------
                                  + +E C      S    LE   +  A++ +  K  ++                                     
Subjt:  --------------------------DEMEDC------SFIRILESTVIETAIQDSADKHSEK-------------------------------------

Query:  --HGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLGAGIIYPIGDSNWVS
             TLP+IV+  L  E  + LI++L+  +KAIGWT++D++GIS S CMH+I +E+     +E QRRLNP MKEVV+ EV+KWL AGIIYPI DS W+S
Subjt:  --HGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLGAGIIYPIGDSNWVS

Query:  PFQCVPKKGGVTVVSNKDNELIPTRT-----------------------------MLDRLAGQAYYCFLDGYSGYNQVTIAPEDQEKTTFTCPYGTFAFR
        P Q VPKKGG+TVV N++NELIPTRT                             +L+RLAG AYYCFLDGYSGYNQ++I+PEDQEKTTFTCPYGTFAFR
Subjt:  PFQCVPKKGGVTVVSNKDNELIPTRT-----------------------------MLDRLAGQAYYCFLDGYSGYNQVTIAPEDQEKTTFTCPYGTFAFR

Query:  RMPFGLY---------------------------------------------------------------------------------------------
        RMPFGL                                                                                              
Subjt:  RMPFGLY---------------------------------------------------------------------------------------------

Query:  -------------------------------------------------CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIH
                                                         C  AF  LK  L+SAPI+ AP+W+LPFE+MCDASD A+ A+LGQ++ + +H
Subjt:  -------------------------------------------------CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIH

Query:  PIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVASKVTVFTDHAAIR
         IYYAS+VLN AQ+NY TTEKELLAVVFAF+KFR YLV SKV V+TDH+AI+
Subjt:  PIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVASKVTVFTDHAAIR

XP_038973683.1 uncharacterized protein LOC120105384 [Phoenix dactylifera]9.3e-21938.42Show/hide
Query:  NDRTRAIRAYVVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHLKSFLGVSDSFVIQAVPRDALRLTLFPYSLRDGAKSWLNSF
        N   R +  Y VP  N   P I RP + A NFE+KP + QM+Q   QF G  SEDPH HL +FL + D+  +  V  DA+RL LFP+SL+D AK+WLNS 
Subjt:  NDRTRAIRAYVVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHLKSFLGVSDSFVIQAVPRDALRLTLFPYSLRDGAKSWLNSF

Query:  APGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQ--------------------
        AP S  TW+ L++ FLSKYFPP + AKLR++I  F Q + E+  EAWERFK+L RKCPHHGLP  + ++TFYNGL    +                    
Subjt:  APGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQ--------------------

Query:  ---ERISINSCQWSDIRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPNNPASVFFVGN--
           E ++ N+ QWS+ R   KKV  + +VDG++ + A +  +      +  ++    P +            +C  CG  H    C      V FV N  
Subjt:  ---ERISINSCQWSDIRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPNNPASVFFVGN--

Query:  ---QRNNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQPGF--------------------AKAFDE-----------------SPGIASGSLASELK
           Q+NNPYSN YNPGW NHPNFSW  QGN   + + ++ PGF                    A A  E                 +  I  G LA+ + 
Subjt:  ---QRNNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQPGF--------------------AKAFDE-----------------SPGIASGSLASELK

Query:  ARPQGKLPSDTEHPRREVKEQVKAVSLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYVPP---------------
        +R QG LPS TE      KE  KAV+LRSGK L +      + +  D   V +K  E  +        A     +P VE PYVPP               
Subjt:  ARPQGKLPSDTEHPRREVKEQVKAVSLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYVPP---------------

Query:  -------------HLMYHLYLFHKGKSLRIRMDILTKKKRLGEFETVSLTDECSAILKNGLPPKGKDPGSFIIHVSIGGKELGRALCDLGASINLMPLSV
                     ++ +   L       +   +I++KK++L +FET++LT+ECSAI++N LPPK +DPGSF I  +IG  +  RALCDLGAS++LMPLSV
Subjt:  -------------HLMYHLYLFHKGKSLRIRMDILTKKKRLGEFETVSLTDECSAILKNGLPPKGKDPGSFIIHVSIGGKELGRALCDLGASINLMPLSV

Query:  YRKLGIGEARPTTVTLQLADRTITYPEGKTEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP
         RKLG+ E +PTT++LQLADR++ YP G  E+VL+KV KFI PVDFI+L+ E D ++PIILGRPFLAT  A+IDV+ G LT++V  EEV+FN+F+A KYP
Subjt:  YRKLGIGEARPTTVTLQLADRTITYPEGKTEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP

Query:  --------------------------DEMEDC------SFIRILESTVIETAIQDSADKHSEK-------------------------------------
                                  + +E C      S    LE   +  A++ +  K  ++                                     
Subjt:  --------------------------DEMEDC------SFIRILESTVIETAIQDSADKHSEK-------------------------------------

Query:  --HGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLGAGIIYPIGDSNWVS
             TLP+IV+  L  E  + LI++L+  +KAIGWT++D++GIS S CMH+I +E+     +E QRRLNP MKEVV+ EV+KWL AGIIYPI DS W+S
Subjt:  --HGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLGAGIIYPIGDSNWVS

Query:  PFQCVPKKGGVTVVSNKDNELIPTRT-----------------------------MLDRLAGQAYYCFLDGYSGYNQVTIAPEDQEKTTFTCPYGTFAFR
        P Q VPKKGG+TVV N++NELIPTRT                             +L+RLAG AYYCFLDGYSGYNQ++I+PEDQEKTTFTCPYGTFAFR
Subjt:  PFQCVPKKGGVTVVSNKDNELIPTRT-----------------------------MLDRLAGQAYYCFLDGYSGYNQVTIAPEDQEKTTFTCPYGTFAFR

Query:  RMPFGLY---------------------------------------------------------------------------------------------
        RMPFGL                                                                                              
Subjt:  RMPFGLY---------------------------------------------------------------------------------------------

Query:  -------------------------------------------------CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIH
                                                         C  AF  LK  L+SAPI+ AP+W+LPFE+MCDASD A+ A+LGQ++ + +H
Subjt:  -------------------------------------------------CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIH

Query:  PIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVASKVTVFTDHAAIR
         IYYAS+VLN AQ+NY TTEKELLAVVFAF+KFR YLV SKV V+TDH+AI+
Subjt:  PIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVASKVTVFTDHAAIR

XP_038976300.1 uncharacterized protein LOC120107204 [Phoenix dactylifera]4.2e-21938.42Show/hide
Query:  NDRTRAIRAYVVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHLKSFLGVSDSFVIQAVPRDALRLTLFPYSLRDGAKSWLNSF
        N   R +  Y VP  N   P I RP + A NFE+KP + QM+Q   QF G  SEDPH HL +FL + D+  +  V  DA+RL LFP+SL+D AK+WLNS 
Subjt:  NDRTRAIRAYVVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHLKSFLGVSDSFVIQAVPRDALRLTLFPYSLRDGAKSWLNSF

Query:  APGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQ--------------------
        AP S  TW+ L++ FLSKYFPP + AKLR++I  F Q + E+  EAWERFK+L RKCPHHGLP  + ++TFYNGL    +                    
Subjt:  APGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQ--------------------

Query:  ---ERISINSCQWSDIRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPNNPASVFFVGN--
           E ++ N+ QWS+ R   KKV  + +VDG++ + A +  +      +  ++    P +            +C  CG  H    C      V FV N  
Subjt:  ---ERISINSCQWSDIRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPNNPASVFFVGN--

Query:  ---QRNNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQPGF--------------------AKAFDE-----------------SPGIASGSLASELK
           Q+NNPYSN YNPGW NHPNFSW  QGN   + + ++ PGF                    A A  E                 +  I  G LA+ + 
Subjt:  ---QRNNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQPGF--------------------AKAFDE-----------------SPGIASGSLASELK

Query:  ARPQGKLPSDTEHPRREVKEQVKAVSLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYVPP---------------
        +R QG LPS TE      KE  KAV+LRSGK L +      + +  D   V +K  E  +        A     +P VE PYVPP               
Subjt:  ARPQGKLPSDTEHPRREVKEQVKAVSLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYVPP---------------

Query:  -------------HLMYHLYLFHKGKSLRIRMDILTKKKRLGEFETVSLTDECSAILKNGLPPKGKDPGSFIIHVSIGGKELGRALCDLGASINLMPLSV
                     ++ +   L       +   +I++KK++L +FET++LT+ECSAI++N LPPK +DPGSF I  +IG  +  RALCDLGAS++LMPLSV
Subjt:  -------------HLMYHLYLFHKGKSLRIRMDILTKKKRLGEFETVSLTDECSAILKNGLPPKGKDPGSFIIHVSIGGKELGRALCDLGASINLMPLSV

Query:  YRKLGIGEARPTTVTLQLADRTITYPEGKTEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP
         RKLG+ E +PTT++LQLADR++ YP G  E+VL+KV KFI PVDFI+L+ E D ++PIILGRPFLAT  A+IDV+ G LT++V  EEV+FN+F+A KYP
Subjt:  YRKLGIGEARPTTVTLQLADRTITYPEGKTEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP

Query:  --------------------------DEMEDC------SFIRILESTVIETAIQDSADKHSEK-------------------------------------
                                  + +E C      S    LE   +  A++ +  K  ++                                     
Subjt:  --------------------------DEMEDC------SFIRILESTVIETAIQDSADKHSEK-------------------------------------

Query:  --HGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLGAGIIYPIGDSNWVS
             TLP+IV+  L +E  + LI++L+  +KAIGWT++D++GIS S CMH+I +E+     +E QRRLNP MKEVV+ EV+KWL AGIIYPI DS W+S
Subjt:  --HGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLGAGIIYPIGDSNWVS

Query:  PFQCVPKKGGVTVVSNKDNELIPTRT-----------------------------MLDRLAGQAYYCFLDGYSGYNQVTIAPEDQEKTTFTCPYGTFAFR
        P Q VPKKGG+TVV N++NELIPTRT                             +L+RLAG AYYCFLDGYSGYNQ++I+PEDQEKTTFTCPYGTFAFR
Subjt:  PFQCVPKKGGVTVVSNKDNELIPTRT-----------------------------MLDRLAGQAYYCFLDGYSGYNQVTIAPEDQEKTTFTCPYGTFAFR

Query:  RMPFGLY---------------------------------------------------------------------------------------------
        RMPFGL                                                                                              
Subjt:  RMPFGLY---------------------------------------------------------------------------------------------

Query:  -------------------------------------------------CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIH
                                                         C  AF  LK  L+SAPI+ AP+W+LPFE+MCDASD A+ A+LGQ++ + +H
Subjt:  -------------------------------------------------CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIH

Query:  PIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVASKVTVFTDHAAIR
         IYYAS+VLN AQ+NY TTEKELLAVVFAF+KFR YLV SKV V+TDH+AI+
Subjt:  PIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVASKVTVFTDHAAIR

TrEMBL top hitse value%identityAlignment
A0A2G9HH15 Reverse transcriptase2.6e-22739.61Show/hide
Query:  SRINPLFEQNEQQNNQAENPILIAND--RTRAIRAYVVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHLKSFLGVSDSFVIQA
        +R   L E  EQ+    EN I+I  D      +R   +P   E    +  P++  A  +++  M +M+Q   QF GL+ E+P+ H+ +FL + D+   + 
Subjt:  SRINPLFEQNEQQNNQAENPILIAND--RTRAIRAYVVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHLKSFLGVSDSFVIQA

Query:  VPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNG
        V +DALRL LF +SL   A  W  S    SI TW +L E+F+SK+F P + A LR+EI+ FRQ   ET  EAW RF+++LR CP+H +P  IQ+ TFY+G
Subjt:  VPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNG

Query:  -----------------LNGVTQE------RISINSCQWSDIRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEAC
                         L+G T E       +  N  +    R T  K   V+EVD V+ + A +  +  ++KN  V   Q  P               C
Subjt:  -----------------LNGVTQE------RISINSCQWSDIRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEAC

Query:  VYCGEDHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWHNHPNFSWGG-----------QGNNVQAQQKMNQ--PGFAKAFDESPGIAS--------
          CGE H  + CP++  S+ FV N R   NNPYSN YNPGW  HPNFSW             QG   Q QQ M +  P   +   +     +        
Subjt:  VYCGEDHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWHNHPNFSWGG-----------QGNNVQAQQKMNQ--PGFAKAFDESPGIAS--------

Query:  --GSLASELKARPQGKLPSDTE-HPRREVKEQVKAVSLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYVPPHLMY
          G LA+ + +RP+  LPS+TE +PR++ K Q +AV+LR+G  L+E  K  +   + +K ++ E++        G    A       +  P YV      
Subjt:  --GSLASELKARPQGKLPSDTE-HPRREVKEQVKAVSLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYVPPHLMY

Query:  HLYLFHKGKSLRIRMDILTKKKRLGEFETVSLTDECSAILKNGLPPKGKDPGSFIIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQ
                   +   DIL+KK+RLG++ETV+LT+ECSAI++N LPPK KDPGSF I  +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQ
Subjt:  HLYLFHKGKSLRIRMDILTKKKRLGEFETVSLTDECSAILKNGLPPKGKDPGSFIIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQ

Query:  LADRTITYPEGKTEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV-
        LADR++TYP G  ED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + +    
Subjt:  LADRTITYPEGKTEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV-

Query:  -----------IETAIQDSADKHSEK--------------------------------------------------------HGETLPIIVASDLMSEHE
                   +E A+ D  D+ +E+                                                          +TLP+I++S L     
Subjt:  -----------IETAIQDSADKHSEK--------------------------------------------------------HGETLPIIVASDLMSEHE

Query:  EALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLGAGIIYPIGDSNWVSPFQCVPKKGGVTVVSNKDNE
        E L+++L+ ++ AIGWT+ADI+GIS SFCMHKI LE+G   S+E QRRLNP MKEVVKKE+IKWL AGIIYPI DS+WVSP QCVPKKGG+TVV N  NE
Subjt:  EALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLGAGIIYPIGDSNWVSPFQCVPKKGGVTVVSNKDNE

Query:  LIPTRT-----------------------------MLDRLAGQAYYCFLDGYSGYNQVTIAPEDQEKTTFTCPYGTFAFRRMPFGL--------------
        LIPTRT                             MLDRLAG+ +YCFLDGYSGYNQ+ I PEDQEKTTFTCPYGTF FR+MPFGL              
Subjt:  LIPTRT-----------------------------MLDRLAGQAYYCFLDGYSGYNQVTIAPEDQEKTTFTCPYGTFAFRRMPFGL--------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------YCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTE
                                     C   F  LK  LISAPI+ AP+W+ PFE+MCDASD A+ A+LGQ++ K    IYYASK LN+AQ+NYTTTE
Subjt:  ----------------------------YCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTE

Query:  KELLAVVFAFEKFRPYLVASKVTVFTDHAAIR
        KELLAVVFAF+KFR YLV +KV V+TDHAAIR
Subjt:  KELLAVVFAFEKFRPYLVASKVTVFTDHAAIR

A0A2G9HWF8 Reverse transcriptase8.3e-21338.1Show/hide
Query:  SRINPLFEQNEQQNNQAENPILIAND--RTRAIRAYVVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHLKSFLGVSDSFVIQA
        +R   L E  EQ+    EN I++  D      +R   +P   E    +  P++  A  +++  M QM+Q   QF GL+ E+P+ H+ +FL + D+   + 
Subjt:  SRINPLFEQNEQQNNQAENPILIAND--RTRAIRAYVVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHLKSFLGVSDSFVIQA

Query:  VPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNG
        V +DALRL LF +SL   A  W  S    SI TW +L E+F+SK+F P + A LR+EI+ FRQ   ET  EAW RF+++LR CP+H +P  IQ+ TFY+G
Subjt:  VPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNG

Query:  -----------------LNGVTQE------RISINSCQWSDIRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEAC
                         L+G T E       + +N  +    R T  K   V+EVD V+ + A +  +  ++KN                          
Subjt:  -----------------LNGVTQE------RISINSCQWSDIRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEAC

Query:  VYCGEDHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQPGFAKAFDESPGIASGSLASELKARPQGKLPSDTEH
          CGE H  + CP++  S+ FV N R   NNPYSN YNPGW  HPNFSW    NN Q Q      G A  F                   QG   +   +
Subjt:  VYCGEDHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQPGFAKAFDESPGIASGSLASELKARPQGKLPSDTEH

Query:  PRREVKEQVKAVSLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPD-VEPPYVPPHLMYHLYLFHK--------------GK
        PR++ K Q +AV+LR+G+ L+E  K  +   + +K +  E++ +  +     +K        P  ++   +    +  L +F K                
Subjt:  PRREVKEQVKAVSLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPD-VEPPYVPPHLMYHLYLFHK--------------GK

Query:  SLRIRMDILTKKKRLGEFETVSLTDECSAILKNGLPPKGKDPGSFIIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRTITYP
         ++   DIL+KK+RLG++ETV+LT+ECSAI++N LPPK KDPG              RALCDLGASINLMP S+YR LG+ EA+PT++TLQLADR++TYP
Subjt:  SLRIRMDILTKKKRLGEFETVSLTDECSAILKNGLPPKGKDPGSFIIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRTITYP

Query:  EGKTEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV----------
        +G  ED+LVKVDKFIFP DF++LD E D +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + ++            
Subjt:  EGKTEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV----------

Query:  --IETAIQDSADKHSEK--------------------------------------------------------HGETLPIIVASDLMSEHEEALIKLLQQ
          +E A+ D  D+ +E+                                                          +TLP+I++S L     E L+++L+ 
Subjt:  --IETAIQDSADKHSEK--------------------------------------------------------HGETLPIIVASDLMSEHEEALIKLLQQ

Query:  YRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLGAGIIYPIGDSNWVSPFQCVPKKGGVTVVSNKDNELIPTRT---
        ++ AIGWT+ADI+GIS SFCMHKI LE+    S+E QRRLNP MKEVVKKE+IKWL AGIIYPI D +W+SP QCVPKKGG+TVV N  NE IPT+T   
Subjt:  YRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLGAGIIYPIGDSNWVSPFQCVPKKGGVTVVSNKDNELIPTRT---

Query:  --------------------------MLDRLAGQAYYCFLDGYSGYNQVTIAPEDQEKTTFTCPYGTFAFRRMPF-------------------------
                                  MLDRLAG+ +YCFLDGYSGYNQ+ IAPEDQEKTTFTCPYGTFAFRR+PF                         
Subjt:  --------------------------MLDRLAGQAYYCFLDGYSGYNQVTIAPEDQEKTTFTCPYGTFAFRRMPF-------------------------

Query:  ----------------------------------------------------------------------------------------GLY---------
                                                                                                G Y         
Subjt:  ----------------------------------------------------------------------------------------GLY---------

Query:  --------------------CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFA
                            C  AF+ LK  LISAPI+  P+W+ PFE+MCDASD A+ A+LGQ++ K    IYYASK LN+AQ+NYTTTEKELLAVVFA
Subjt:  --------------------CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFA

Query:  FEKFRPYLVASKVTVFTDHAAIR
        F+KFR YLV +KV V+TDHAAIR
Subjt:  FEKFRPYLVASKVTVFTDHAAIR

A0A2G9HYA0 Reverse transcriptase5.6e-20140.11Show/hide
Query:  FRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNG-----------------LNGVTQE------RISINSCQWSDIRGTNKKVKSVLEVDGVST
        FRQ   ET  EAW RF+++LR CP+H +P  IQ+ TFY+G                 L+G T E       +  N  +    R T  K   V+EVD V+ 
Subjt:  FRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNG-----------------LNGVTQE------RISINSCQWSDIRGTNKKVKSVLEVDGVST

Query:  IRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWHNHPNFSWG---GQGNNVQ
        + A +  +  ++KN  V   Q  P               C  CGE H  + CP++  S+ FV N R   NNPYSN YNPGW  HPNFSW    GQG+  +
Subjt:  IRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWHNHPNFSWG---GQGNNVQ

Query:  AQQKMNQPGFAKAFDESPGIAS--------------------GSLASELKARPQGKLPSDTE-HPRREVKEQVKAVSLRSGKPLEESRKTQDLNNNSDKN
         QQ   Q       ++ P +                      G LA+ + +RPQG LPS+TE +PR++ K Q +AV+LR+G+ L+E  K  +   + +K 
Subjt:  AQQKMNQPGFAKAFDESPGIAS--------------------GSLASELKARPQGKLPSDTE-HPRREVKEQVKAVSLRSGKPLEESRKTQDLNNNSDKN

Query:  IVVEKELESGQGAGGSNKNAGASGSVPD-VEPPYVPPHLMYHLYLFHK--------------GKSLRIRMDILTKKKRLGEFETVSLTDECSAILKNGLP
        ++ E++ +  +     +K        P  ++   +    +  L +F K                 ++   DIL+KK+RLG++ETV+LT+ECSAI++N LP
Subjt:  IVVEKELESGQGAGGSNKNAGASGSVPD-VEPPYVPPHLMYHLYLFHK--------------GKSLRIRMDILTKKKRLGEFETVSLTDECSAILKNGLP

Query:  PKGKDPGSFIIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRTITYPEGKTEDVLVKVDKFIFPVDFIILDYEADKDVPIILG
        PK KDPGSF I  +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADR++TYP+G  ED+LVKVDKFIFP DF++LD E D +VPIILG
Subjt:  PKGKDPGSFIIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRTITYPEGKTEDVLVKVDKFIFPVDFIILDYEADKDVPIILG

Query:  RPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETAIQDSADKHSEK-------------------
        RPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + +               +E A+ D  D+ +E+                   
Subjt:  RPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETAIQDSADKHSEK-------------------

Query:  -------------------------------------HGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQ
                                               +TLP+I++S L     E L+++L+ ++ AIGWT+ADI+GIS SFCMHKI LE+    S+E 
Subjt:  -------------------------------------HGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQ

Query:  QRRLNPAMKEVVKKEVIKWLGAGIIYPIGDSNWVSPFQCVPKKGGVTVVSNKDNELIPTRT-----------------------------MLDRLAGQAY
        QRRLNP MKEVVKKE+IKWL AGIIYPI DS+WVSP QCVPKKGG+TVV N  NELIPTRT                             MLDRLAG+ +
Subjt:  QRRLNPAMKEVVKKEVIKWLGAGIIYPIGDSNWVSPFQCVPKKGGVTVVSNKDNELIPTRT-----------------------------MLDRLAGQAY

Query:  YCFLDGYSGYNQVTIAPEDQEKTTFTCPYGTFAFRRMPFGL-----------------------------------------------------------
        YCFLDGYSGYNQ+ IAPEDQEKTTFTCPYGTFAFRRMPFGL                                                           
Subjt:  YCFLDGYSGYNQVTIAPEDQEKTTFTCPYGTFAFRRMPFGL-----------------------------------------------------------

Query:  -----------------------------------------------------------------------------------YCRKAFETLKAALISAP
                                                                                            C  AF  LK  LISAP
Subjt:  -----------------------------------------------------------------------------------YCRKAFETLKAALISAP

Query:  ILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVASKVTVFTDHAAIR
        I+  P+W+ PFE+MCDASD AV A+LGQ++ K    IYYASK LN+AQ+NYTTTEKELLAVVFAF+KFR YLV +KV V+TDHAAIR
Subjt:  ILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVASKVTVFTDHAAIR

A0A2G9HYD8 Reverse transcriptase7.8e-19539.45Show/hide
Query:  FRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNG-----------------LNGVTQE------RISINSCQWSDIRGTNKKVKSVLEVDGVST
        FRQ   ET  EAW RF+++LR CP+H +P  IQ+ TFY+G                 L+G T E       +  N  +    R T  K   V+EVD V+ 
Subjt:  FRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNG-----------------LNGVTQE------RISINSCQWSDIRGTNKKVKSVLEVDGVST

Query:  IRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWHNHPNFSWG---GQGNNVQ
        + A +  +  ++KN                              E H  + CP++  S+ FV N R   NNPYSN YNPGW  HPNFSW    GQG+  +
Subjt:  IRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPNNPASVFFVGNQR---NNPYSNFYNPGWHNHPNFSWG---GQGNNVQ

Query:  AQQKMNQPGFAKAFDESPGIAS--------------------GSLASELKARPQGKLPSDTE-HPRREVKEQVKAVSLRSGKPLEESRKTQDLNNNSDKN
         QQ   QP      ++ P +                      G LA+ + +RPQG LPS+TE +PR++ K Q +AV+LR+G+ L+E  K       S + 
Subjt:  AQQKMNQPGFAKAFDESPGIAS--------------------GSLASELKARPQGKLPSDTE-HPRREVKEQVKAVSLRSGKPLEESRKTQDLNNNSDKN

Query:  IVVEKELESGQGAGGSNKNAGASGSVP-DVEPPY--------VPPHLMYHLYLFHK--------------GKSLRIRMDILTKKKRLGEFETVSLTDECS
         V+ KE E           A    S P  ++PP+        +    +  L +F K                 ++   DIL+KK+RLG++ET +LT+EC+
Subjt:  IVVEKELESGQGAGGSNKNAGASGSVP-DVEPPY--------VPPHLMYHLYLFHK--------------GKSLRIRMDILTKKKRLGEFETVSLTDECS

Query:  AILKNGLPPKGKDPGSFIIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRTITYPEGKTEDVLVKVDKFIFPVDFIILDYEAD
        AI++N LPPK KDPGSF I  +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADR++TYP+G  ED+LVKVDKFIFP DF++LD E D
Subjt:  AILKNGLPPKGKDPGSFIIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRTITYPEGKTEDVLVKVDKFIFPVDFIILDYEAD

Query:  KDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETAIQDSADKHSEK-----------
         +VPIILGRPFLATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + ++              +E A+ D  ++ +E+           
Subjt:  KDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IETAIQDSADKHSEK-----------

Query:  ---------------------------------------------HGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEE
                                                       +TLP+I++S L     E L+++L+ ++ AIGWT+ADI+GIS SFCMHKI LE+
Subjt:  ---------------------------------------------HGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEE

Query:  GSFRSIEQQRRLNPAMKEVVKKEVIKWLGAGIIYPIGDSNWVSPFQCVPKKGGVTVVSNKDNELIPTRT-----------------------------ML
            S+E QRRLN  MKEVVKKE+IKWL AGIIYPI DS+WVSP QCVPKKGG+TVV N  NELIPTRT                             ML
Subjt:  GSFRSIEQQRRLNPAMKEVVKKEVIKWLGAGIIYPIGDSNWVSPFQCVPKKGGVTVVSNKDNELIPTRT-----------------------------ML

Query:  DRLAGQAYYCFLDGYSGYNQVTIAPEDQEKTTFTCPYGTFAFRRMPFGL---------------------------------------------------
        DRLAG+ +YCFLDGYSGYNQ+ IAPEDQEKTTFTCPYGTFAFRRMPFGL                                                   
Subjt:  DRLAGQAYYCFLDGYSGYNQVTIAPEDQEKTTFTCPYGTFAFRRMPFGL---------------------------------------------------

Query:  -------------------------------------------------------------------------------------------YCRKAFETL
                                                                                                    C  AF+ L
Subjt:  -------------------------------------------------------------------------------------------YCRKAFETL

Query:  KAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVASKVTVFTDHAAIR
        K  LISAPI+  P+W+ PFE+MCDASD A+ A+LGQ++ K    IYYASK LN+AQ+NYTTTEKELLAVVFAF+KFR YLV +KV V+TDHAAIR
Subjt:  KAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVASKVTVFTDHAAIR

A0A6P6XAQ1 Reverse transcriptase4.4e-20637.1Show/hide
Query:  RAIRAYVVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHLKSFLGVSDSFVIQAVPRDALRLTLFPYSLRDGAKSWLNSFAPGS
        R +R + +P        I RP + A NFE+KP + QM+Q   Q+ G  +EDP+ HL +FL + D+     V  DA++L LFP+SLRD AK WL S  P +
Subjt:  RAIRAYVVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHLKSFLGVSDSFVIQAVPRDALRLTLFPYSLRDGAKSWLNSFAPGS

Query:  IRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQ-----------------------E
          TWDELA+ FL+K+FPP + AKLR +I  F Q E ET  EAWER++EL R+CPHHGLP  + ++TFYNGL   T+                       E
Subjt:  IRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQ-----------------------E

Query:  RISINSCQWSDIRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPNNPASVFFVGN----QR
         ++ N+ QW++ RG +++   +LEVD        L M++  + NV  + ++Q  +     VV      +C  CG DH+ +F  ++   V ++ N     +
Subjt:  RISINSCQWSDIRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPNNPASVFFVGN----QR

Query:  NNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQPGF-----------------------------------AKAFDESPG-------------IASGS
        NNPYSN YNPGW NHPNF W  QGN    Q+ +N PGF                                    + F+   G             +  G 
Subjt:  NNPYSNFYNPGWHNHPNFSWGGQGNNVQAQQKMNQPGF-----------------------------------AKAFDESPG-------------IASGS

Query:  LASELKARPQGKLPSDTE-HPRREVKEQVKAVSLRSGKPLEES---RKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYVPPHLMYH
        +A+ +  R QG LPS TE +PR    E VKA++LRSGK L E       ++     +K +    EL+ G       +    +    +   P  PP   Y 
Subjt:  LASELKARPQGKLPSDTE-HPRREVKEQVKAVSLRSGKPLEES---RKTQDLNNNSDKNIVVEKELESGQGAGGSNKNAGASGSVPDVEPPYVPPHLMYH

Query:  LYLFHKGKSLRIRMDILTKKKRLGEFETVSLTDECSAILKNGLPPKGKDPGSFIIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQL
         +L           +I+TKK++L + ET++LT+ECSAI++N LPPK KDPGSF +  +IG  E  +ALCDLGAS++L+PL+V R+LG+ E + T ++LQL
Subjt:  LYLFHKGKSLRIRMDILTKKKRLGEFETVSLTDECSAILKNGLPPKGKDPGSFIIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQL

Query:  ADRTITYPEGKTEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIE
        ADR+I +P G  E+VL+KV KFI PVDF++LD E D +VPIILGRPFLAT   +IDV++G+   ++  EEV+F++ K  KYP   +    + I +   +E
Subjt:  ADRTITYPEGKTEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIE

Query:  TA-----------------IQD----------------------------------SADKHSE-------KH--------GETLPIIVASDLMSEHEEAL
         +                 IQ+                                  S +K  +       KH         ETLP+IV S L  E  + L
Subjt:  TA-----------------IQD----------------------------------SADKHSE-------KH--------GETLPIIVASDLMSEHEEAL

Query:  IKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLGAGIIYPIGDSNWVSPFQCVPKKGGVTVVSNKDNELIP
        +++L+++ KAIGWT++DI+GIS + CMH+I LEE S   +E QRRLNP MKEVV+ E++KWL AGII+PI DS W+SP   VPKKGG+T+V  K++E+IP
Subjt:  IKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLGAGIIYPIGDSNWVSPFQCVPKKGGVTVVSNKDNELIP

Query:  TRT-----------------------------MLDRLAGQAYYCFLDGYSGYNQVTIAPEDQEKTTFTCPYGTFAFRRMPFGL-----------------
        +R                              M++RLAG  +YCFLDG+SGYNQ+ IAPEDQEKTTFTCPYGTFAFRRMPFGL                 
Subjt:  TRT-----------------------------MLDRLAGQAYYCFLDGYSGYNQVTIAPEDQEKTTFTCPYGTFAFRRMPFGL-----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------YCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKEL
                                  C  AF  LK  L+SAPI+ +P+W+LPFE+MCDASD AV A+LGQK  K +H IYYASK+LNEAQVNY TTEKEL
Subjt:  -------------------------YCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKEL

Query:  LAVVFAFEKFRPYLVASKVTVFTDHAAIR
        LAV+FA +KFR YLV SKV V+TDHAA++
Subjt:  LAVVFAFEKFRPYLVASKVTVFTDHAAIR

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.9e-1343.75Show/hide
Query:  AFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVASKVTVFTDH
        AF+ LK  +   PIL  P++   F +  DASDVA+ A+L Q      HP+ Y S+ LNE ++NY+T EKELLA+V+A + FR YL+     + +DH
Subjt:  AFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVASKVTVFTDH

P10394 Retrovirus-related Pol polyprotein from transposon 4127.9e-1139.39Show/hide
Query:  CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVASKVTVFTDH
        C+KAF  LK+ LI+  +L  P+++  F +  DAS  A  A+L Q       P+ YAS+   + + N +TTE+EL A+ +A   FRPY+     TV TDH
Subjt:  CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVASKVTVFTDH

P10401 Retrovirus-related Pol polyprotein from transposon gypsy2.2e-0838Show/hide
Query:  RKAFETLKAALISAP-ILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVASK-VTVFTDH
        R AF+ L+  L S   IL  P++  PF++  DAS   + A+L Q+      PI   S+ L + + NY T E+ELLA+V+A  K + +L  S+ + +FTDH
Subjt:  RKAFETLKAALISAP-ILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVASK-VTVFTDH

P10401 Retrovirus-related Pol polyprotein from transposon gypsy3.4e-0634.07Show/hide
Query:  MKEVVKKEVIKWLGAGIIYPIGDSNWVSPFQCVPKKG---------GVTVVSNKDNE-LIPTR-------TMLDRLAGQAYYCFLDGYSGYNQVTIAPED
        + + V  EV + L  GII P   S + SP   V KKG          + +   K NE  IP R        +L  L    ++  LD  SGY+Q+ +A  D
Subjt:  MKEVVKKEVIKWLGAGIIYPIGDSNWVSPFQCVPKKG---------GVTVVSNKDNE-LIPTR-------TMLDRLAGQAYYCFLDGYSGYNQVTIAPED

Query:  QEKTTFTCPYGTFAFRRMPFGLYCRKAFETLKAAL
        +EKT+F+   G + F R+PFGL  R A    + AL
Subjt:  QEKTTFTCPYGTFAFRRMPFGLYCRKAFETLKAAL

P20825 Retrovirus-related Pol polyprotein from transposon 2971.7e-1341.58Show/hide
Query:  KAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVASKVTVFTDHAAI
        +AFE LKA +I  PIL  P++   F +  DAS++A+ A+L Q      HPI + S+ LN+ ++NY+  EKELLA+V+A + FR YL+  +  + +DH  +
Subjt:  KAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVASKVTVFTDHAAI

Query:  R
        R
Subjt:  R

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus9.3e-1239.8Show/hide
Query:  KAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLV-ASKVTVFTDH
        ++F  LK+ L S+ IL  P +  PF +  DAS+ A+ A+L Q       PI Y S+ LN+ + NY T EKE+LA++++ +  R YL  A  + V+TDH
Subjt:  KAFETLKAALISAPILCAPNWNLPFEVMCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLV-ASKVTVFTDH

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCGACGCAGTGGGAGGGTTGCATTACAACCTGATCGTTACTTGGGTCTAACTGAAACTCAAGTTGTCATACCTAATTATGGCATTGAGGATCCATTGTTCTATCG
ACAGACAATGAATGACGAAGATAAGGATGAATGGGTCAAAGCCATGGACCTGGAAATGGAGTCAATGTACTCCAATTCTGTATGGGAACTCGCAGATCTACCAGATGAGA
CTGCCTTTCTGAATGGTAATCTTGAAGAGAATATCTTCATGTCTCAACACGAAGGGTTTGTAGCCCAAGGTCAGGAACAAAAGGTTTGTGTGAGTTTGTTGCATGAACGA
TCCGCCTGGGGTAAGATCCTGAAGATGCAGCAAATCCCCAGCAGAATCAACCCACTGTTTGAGCAAAATGAGCAGCAAAATAATCAGGCTGAGAATCCTATCTTGATAGC
AAACGATAGGACTAGAGCCATTCGAGCATATGTTGTCCCGATGTTTAATGAGTTGAATCCAGGGATTGCACGTCCCCAAATCCAAGCGGCAAATTTTGAAATGAAACCGG
TAATGTTTCAGATGTTGCAAACTGTGGGGCAATTCCATGGTTTGACATCTGAAGACCCTCATTTACATCTTAAGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAA
GCAGTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAAGTCATGGTTAAACTCTTTTGCTCCAGGATCAATTAGGACGTGGGATGA
GTTAGCTGAAAAATTTTTGAGTAAATATTTCCCACCTAATAGAAATGCTAAATTGAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAAGATGAGACTTTTAGTGAGGCTT
GGGAAAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCTCATTGTATTCAAATGGAAACATTCTACAATGGTTTAAATGGAGTAACCCAAGAGAGAATA
TCTATTAATAGTTGTCAGTGGTCGGATATTAGAGGCACGAATAAAAAAGTTAAGAGTGTATTAGAGGTTGATGGTGTGTCCACCATTAGGGCTGATCTTGCTATGATTGC
TAATGCTCTTAAGAATGTGACAGTGATTAGTCATCAACAGCCACCAGCTATGGAGCCTACTGCAGTGGTGAACCAAGTCACGGACGAAGCATGTGTCTATTGTGGTGAAG
ACCACAACTACGAGTTTTGCCCCAACAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAACAACCCTTATTCTAACTTCTATAATCCAGGTTGGCACAACCACCCC
AACTTCTCGTGGGGAGGTCAGGGAAATAACGTACAAGCGCAACAGAAGATGAACCAGCCGGGATTTGCTAAAGCCTTCGATGAGAGCCCTGGAATTGCAAGTGGGTCACT
GGCCAGTGAGCTGAAGGCAAGGCCTCAAGGGAAACTTCCCTCGGATACTGAACACCCTCGAAGGGAAGTTAAGGAGCAGGTAAAGGCAGTATCTCTTAGGAGTGGTAAGC
CACTAGAAGAGTCTAGAAAGACCCAGGATTTAAATAATAATAGTGATAAAAATATTGTTGTTGAGAAAGAGTTGGAGTCTGGTCAGGGTGCTGGAGGCAGCAATAAAAAT
GCTGGAGCATCTGGTTCTGTGCCAGATGTAGAACCACCATATGTGCCGCCCCACCTTATGTACCACCTCTACCTTTTTCACAAAGGCAAAAGCCTAAGAATCAGGATGGA
TATTTTGACTAAAAAGAAGAGGTTGGGCGAGTTTGAAACTGTATCTCTGACTGACGAGTGTAGTGCTATTCTTAAGAATGGGCTACCACCCAAGGGTAAGGATCCAGGGT
CATTTATTATACATGTGTCTATAGGTGGAAAAGAATTAGGTAGAGCACTCTGTGATTTAGGCGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGGAAGCTTGGTATT
GGTGAAGCTAGGCCTACCACAGTCACACTCCAACTAGCTGATAGGACTATCACATATCCAGAGGGTAAAACTGAGGACGTCTTAGTAAAAGTTGATAAGTTCATATTTCC
TGTTGATTTTATTATTTTAGACTACGAGGCTGATAAGGATGTCCCAATTATTCTAGGTCGTCCATTTTTGGCTACTGGTAGGGCGTTAATAGATGTTCAAAAAGGGGAAT
TAACAATGAGAGTCTGTAATGAGGAAGTAAAATTTAATGTCTTTAAAGCCATGAAATATCCAGACGAAATGGAGGATTGCTCCTTCATCAGGATTCTGGAGAGCACAGTT
ATTGAGACAGCAATACAGGATTCAGCTGATAAGCATTCGGAAAAGCATGGAGAGACGTTGCCCATTATTGTTGCATCAGATTTAATGTCGGAGCATGAAGAGGCCTTAAT
AAAATTGTTGCAACAATACCGCAAAGCTATAGGTTGGACATTGGCTGATATTCAGGGAATTAGCTCATCTTTTTGTATGCACAAAATCACTCTAGAGGAGGGATCATTTA
GGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTAATTAAATGGTTGGGTGCTGGGATCATCTATCCAATTGGCGATAGCAAT
TGGGTGAGCCCTTTCCAATGTGTTCCTAAGAAAGGAGGTGTCACTGTGGTGAGCAATAAAGATAATGAGTTGATCCCCACTAGGACAATGTTGGATCGATTGGCTGGTCA
GGCCTACTACTGTTTCTTGGATGGTTATTCTGGGTATAACCAGGTTACTATTGCTCCTGAGGATCAGGAGAAAACCACTTTCACCTGTCCTTATGGGACATTCGCTTTTA
GGCGAATGCCTTTTGGCCTCTATTGTAGGAAGGCTTTTGAGACTTTAAAGGCTGCTTTAATCTCAGCGCCCATTCTTTGCGCACCTAATTGGAATTTACCGTTTGAGGTA
ATGTGTGATGCGAGTGATGTTGCGGTAGCTGCGATGTTGGGGCAAAAGCAGGGAAAATTTATCCATCCTATATACTATGCAAGCAAGGTTTTAAATGAAGCACAAGTCAA
CTATACAACTACTGAAAAGGAGTTGTTAGCTGTGGTGTTCGCTTTTGAGAAGTTCCGGCCATATTTGGTTGCATCCAAAGTCACGGTTTTCACGGATCATGCAGCAATAA
GGAACCCAACAGAGTTCTGTCTAATAGTATGTAATCTCAAGTCATCATGTTCTTTAAGATTTGCTCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCGACGCAGTGGGAGGGTTGCATTACAACCTGATCGTTACTTGGGTCTAACTGAAACTCAAGTTGTCATACCTAATTATGGCATTGAGGATCCATTGTTCTATCG
ACAGACAATGAATGACGAAGATAAGGATGAATGGGTCAAAGCCATGGACCTGGAAATGGAGTCAATGTACTCCAATTCTGTATGGGAACTCGCAGATCTACCAGATGAGA
CTGCCTTTCTGAATGGTAATCTTGAAGAGAATATCTTCATGTCTCAACACGAAGGGTTTGTAGCCCAAGGTCAGGAACAAAAGGTTTGTGTGAGTTTGTTGCATGAACGA
TCCGCCTGGGGTAAGATCCTGAAGATGCAGCAAATCCCCAGCAGAATCAACCCACTGTTTGAGCAAAATGAGCAGCAAAATAATCAGGCTGAGAATCCTATCTTGATAGC
AAACGATAGGACTAGAGCCATTCGAGCATATGTTGTCCCGATGTTTAATGAGTTGAATCCAGGGATTGCACGTCCCCAAATCCAAGCGGCAAATTTTGAAATGAAACCGG
TAATGTTTCAGATGTTGCAAACTGTGGGGCAATTCCATGGTTTGACATCTGAAGACCCTCATTTACATCTTAAGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAA
GCAGTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAAGTCATGGTTAAACTCTTTTGCTCCAGGATCAATTAGGACGTGGGATGA
GTTAGCTGAAAAATTTTTGAGTAAATATTTCCCACCTAATAGAAATGCTAAATTGAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAAGATGAGACTTTTAGTGAGGCTT
GGGAAAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCTCATTGTATTCAAATGGAAACATTCTACAATGGTTTAAATGGAGTAACCCAAGAGAGAATA
TCTATTAATAGTTGTCAGTGGTCGGATATTAGAGGCACGAATAAAAAAGTTAAGAGTGTATTAGAGGTTGATGGTGTGTCCACCATTAGGGCTGATCTTGCTATGATTGC
TAATGCTCTTAAGAATGTGACAGTGATTAGTCATCAACAGCCACCAGCTATGGAGCCTACTGCAGTGGTGAACCAAGTCACGGACGAAGCATGTGTCTATTGTGGTGAAG
ACCACAACTACGAGTTTTGCCCCAACAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAACAACCCTTATTCTAACTTCTATAATCCAGGTTGGCACAACCACCCC
AACTTCTCGTGGGGAGGTCAGGGAAATAACGTACAAGCGCAACAGAAGATGAACCAGCCGGGATTTGCTAAAGCCTTCGATGAGAGCCCTGGAATTGCAAGTGGGTCACT
GGCCAGTGAGCTGAAGGCAAGGCCTCAAGGGAAACTTCCCTCGGATACTGAACACCCTCGAAGGGAAGTTAAGGAGCAGGTAAAGGCAGTATCTCTTAGGAGTGGTAAGC
CACTAGAAGAGTCTAGAAAGACCCAGGATTTAAATAATAATAGTGATAAAAATATTGTTGTTGAGAAAGAGTTGGAGTCTGGTCAGGGTGCTGGAGGCAGCAATAAAAAT
GCTGGAGCATCTGGTTCTGTGCCAGATGTAGAACCACCATATGTGCCGCCCCACCTTATGTACCACCTCTACCTTTTTCACAAAGGCAAAAGCCTAAGAATCAGGATGGA
TATTTTGACTAAAAAGAAGAGGTTGGGCGAGTTTGAAACTGTATCTCTGACTGACGAGTGTAGTGCTATTCTTAAGAATGGGCTACCACCCAAGGGTAAGGATCCAGGGT
CATTTATTATACATGTGTCTATAGGTGGAAAAGAATTAGGTAGAGCACTCTGTGATTTAGGCGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGGAAGCTTGGTATT
GGTGAAGCTAGGCCTACCACAGTCACACTCCAACTAGCTGATAGGACTATCACATATCCAGAGGGTAAAACTGAGGACGTCTTAGTAAAAGTTGATAAGTTCATATTTCC
TGTTGATTTTATTATTTTAGACTACGAGGCTGATAAGGATGTCCCAATTATTCTAGGTCGTCCATTTTTGGCTACTGGTAGGGCGTTAATAGATGTTCAAAAAGGGGAAT
TAACAATGAGAGTCTGTAATGAGGAAGTAAAATTTAATGTCTTTAAAGCCATGAAATATCCAGACGAAATGGAGGATTGCTCCTTCATCAGGATTCTGGAGAGCACAGTT
ATTGAGACAGCAATACAGGATTCAGCTGATAAGCATTCGGAAAAGCATGGAGAGACGTTGCCCATTATTGTTGCATCAGATTTAATGTCGGAGCATGAAGAGGCCTTAAT
AAAATTGTTGCAACAATACCGCAAAGCTATAGGTTGGACATTGGCTGATATTCAGGGAATTAGCTCATCTTTTTGTATGCACAAAATCACTCTAGAGGAGGGATCATTTA
GGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTAATTAAATGGTTGGGTGCTGGGATCATCTATCCAATTGGCGATAGCAAT
TGGGTGAGCCCTTTCCAATGTGTTCCTAAGAAAGGAGGTGTCACTGTGGTGAGCAATAAAGATAATGAGTTGATCCCCACTAGGACAATGTTGGATCGATTGGCTGGTCA
GGCCTACTACTGTTTCTTGGATGGTTATTCTGGGTATAACCAGGTTACTATTGCTCCTGAGGATCAGGAGAAAACCACTTTCACCTGTCCTTATGGGACATTCGCTTTTA
GGCGAATGCCTTTTGGCCTCTATTGTAGGAAGGCTTTTGAGACTTTAAAGGCTGCTTTAATCTCAGCGCCCATTCTTTGCGCACCTAATTGGAATTTACCGTTTGAGGTA
ATGTGTGATGCGAGTGATGTTGCGGTAGCTGCGATGTTGGGGCAAAAGCAGGGAAAATTTATCCATCCTATATACTATGCAAGCAAGGTTTTAAATGAAGCACAAGTCAA
CTATACAACTACTGAAAAGGAGTTGTTAGCTGTGGTGTTCGCTTTTGAGAAGTTCCGGCCATATTTGGTTGCATCCAAAGTCACGGTTTTCACGGATCATGCAGCAATAA
GGAACCCAACAGAGTTCTGTCTAATAGTATGTAATCTCAAGTCATCATGTTCTTTAAGATTTGCTCCATGA
Protein sequenceShow/hide protein sequence
MPRRSGRVALQPDRYLGLTETQVVIPNYGIEDPLFYRQTMNDEDKDEWVKAMDLEMESMYSNSVWELADLPDETAFLNGNLEENIFMSQHEGFVAQGQEQKVCVSLLHER
SAWGKILKMQQIPSRINPLFEQNEQQNNQAENPILIANDRTRAIRAYVVPMFNELNPGIARPQIQAANFEMKPVMFQMLQTVGQFHGLTSEDPHLHLKSFLGVSDSFVIQ
AVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQERI
SINSCQWSDIRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPNNPASVFFVGNQRNNPYSNFYNPGWHNHP
NFSWGGQGNNVQAQQKMNQPGFAKAFDESPGIASGSLASELKARPQGKLPSDTEHPRREVKEQVKAVSLRSGKPLEESRKTQDLNNNSDKNIVVEKELESGQGAGGSNKN
AGASGSVPDVEPPYVPPHLMYHLYLFHKGKSLRIRMDILTKKKRLGEFETVSLTDECSAILKNGLPPKGKDPGSFIIHVSIGGKELGRALCDLGASINLMPLSVYRKLGI
GEARPTTVTLQLADRTITYPEGKTEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV
IETAIQDSADKHSEKHGETLPIIVASDLMSEHEEALIKLLQQYRKAIGWTLADIQGISSSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLGAGIIYPIGDSN
WVSPFQCVPKKGGVTVVSNKDNELIPTRTMLDRLAGQAYYCFLDGYSGYNQVTIAPEDQEKTTFTCPYGTFAFRRMPFGLYCRKAFETLKAALISAPILCAPNWNLPFEV
MCDASDVAVAAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTEKELLAVVFAFEKFRPYLVASKVTVFTDHAAIRNPTEFCLIVCNLKSSCSLRFAP