; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028738 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028738
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr8:29512491..29527734
RNA-Seq ExpressionLag0028738
SyntenyLag0028738
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_012842899.1 PREDICTED: uncharacterized protein LOC105963074 [Erythranthe guttata]3.2e-14226.09Show/hide
Query:  DLEIDRTLRTIRRLK-RLAEVMA--------------------HQDEAP-----KAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMAHDNSFKGH
        D EI+RTLR +R  + + +E MA                    H+   P     + +R++  P +    SGI    I A NFELKTGLI M   N F G 
Subjt:  DLEIDRTLRTIRRLK-RLAEVMA--------------------HQDEAP-----KAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMAHDNSFKGH

Query:  PSDDPHSHLQSFLEICGTVKMNRVPTDAIRLRLFPFSLQGKANDWLESVETDSISTWDELAHAFLTKFFPPVETTKVGTEIETFRQLEEEQLYEAWERYK
         + DP+ HL +FLEIC T+K+N V  DAIRL+LF FS++ KA  WL S+   S++ W+EL+ AFL +FFPP +T ++  ++  FRQ+ +E ++EAWER+K
Subjt:  PSDDPHSHLQSFLEICGTVKMNRVPTDAIRLRLFPFSLQGKANDWLESVETDSISTWDELAHAFLTKFFPPVETTKVGTEIETFRQLEEEQLYEAWERYK

Query:  EMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTKAKDLLEEMVATSYQWSTERGVISKKAEIYELDESSSLKAQMSSLTNALNKLT-
        ++LR+CPQHG+  W Q++LFYNGL+   ++++D +AGGS  +KT T A+D++E M   +Y W +ER  I K A +++LD  +++ AQ+++L+N + +L+ 
Subjt:  EMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTKAKDLLEEMVATSYQWSTERGVISKKAEIYELDESSSLKAQMSSLTNALNKLT-

Query:  ---SSEVVKSISTLAEGYLKKEGQDVEEVQYIGNR---------------PFTQGVPN-----FYHPVCAIMRT---SHTRTRRMFC----SHRQ-TTVN
            +E V + ST      +    D E+  ++ +R                +  G+ N     + +P  A+      +H R +R        HRQ   + 
Subjt:  ---SSEVVKSISTLAEGYLKKEGQDVEEVQYIGNR---------------PFTQGVPN-----FYHPVCAIMRT---SHTRTRRMFC----SHRQ-TTVN

Query:  NHDTALKNMEVQIGQIASVVNALQKGKFPSETEPNPREQCKMVRLRSGRNLEINSEKKMKKKRARMKMKGLRHKKPPLKDYAAERLEGANSVLQQNWEQN
           + +KNME QIGQIA  ++ + KG FPS TE NP+E C+ +  RS                      GL+   PP     + R               
Subjt:  NHDTALKNMEVQIGQIASVVNALQKGKFPSETEPNPREQCKMVRLRSGRNLEINSEKKMKKKRARMKMKGLRHKKPPLKDYAAERLEGANSVLQQNWEQN

Query:  CHITARVIWCMSDPPGVRFELDPEIERTLGTEGESSAETRWRTCRVFRGPEGPATPQNPLLQQNPLFEQNEQQNNQAENPILIATIGPEPFEHVANRGAI
                     P      ++PEI  +  +  E+S             P   + P NP L   P+                       PF         
Subjt:  CHITARVIWCMSDPPGVRFELDPEIERTLGTEGESSAETRWRTCRVFRGPEGPATPQNPLLQQNPLFEQNEQQNNQAENPILIATIGPEPFEHVANRGAI

Query:  PWFASEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFRILLEMEQRHERISINSCQWSDVRGTNKKVKSVLE--VDGVSTIRADLAMIANALKNVTVISH
                                                                    R   KK K  L+  ++ +  IR ++               
Subjt:  PWFASEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFRILLEMEQRHERISINSCQWSDVRGTNKKVKSVLE--VDGVSTIRADLAMIANALKNVTVISH

Query:  QQPPAMEPTAVVNQVAEEACVYCGFAKA-QVMPQQN---KQALPQQIREFSRGDDERIYGSYRCAIQSNQASMRALELQVGAGGSNKNAGASGSVPDVEP
                                FA+A +VMP      K+ L ++IR                 I+ +                               
Subjt:  QQPPAMEPTAVVNQVAEEACVYCGFAKA-QVMPQQN---KQALPQQIREFSRGDDERIYGSYRCAIQSNQASMRALELQVGAGGSNKNAGASGSVPDVEP

Query:  PYVPPTLCTTSTFSTKAKPKNQDGQFKNAILKNELPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRK-----------------------
          +P TL  T                 +AIL++ LPPK KDPGS+TIP  IG     +ALCDLGASINLMP+SV+ K                       
Subjt:  PYVPPTLCTTSTFSTKAKPKNQDGQFKNAILKNELPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRK-----------------------

Query:  ----------------------LDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVI---------
                              L+   D + PIILGRPFLATG+A+ID++ G L +RV  EEV FN+  A K+ + +E CS I ++E  V          
Subjt:  ----------------------LDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVI---------

Query:  ------------------ETAIQDSL--------------------------------------------------------------------------
                          ET I++ +                                                                          
Subjt:  ------------------ETAIQDSL--------------------------------------------------------------------------

Query:  -QYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNW-----------------------------
         +YR AIGW++ DI+GISPS   HKI +E+     ++ QRRLNP+MKEVVKKEV+K L AG+IY I+DS W                             
Subjt:  -QYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNW-----------------------------

Query:  --------------------------------------------------ITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR------------
                                                          I+IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR            
Subjt:  --------------------------------------------------ITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------YCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAA-----------------------------------------
                              C +AF  LK  L  +PI+  PNW  PFE+MCDASD A                                         
Subjt:  ---------------------YCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAA-----------------------------------------

Query:  ---------------------------------------------EFDLEIKDKKGSENVIADHLSRL--------------DPSSSLLEQSAISDLFQM
                                                     EFDLEI+DKKGSENV+ADHLSRL               P   LL  SA +  +  
Subjt:  ---------------------------------------------EFDLEIKDKKGSENVIADHLSRL--------------DPSSSLLEQSAISDLFQM

Query:  NSSLLLSKGNPGAMSL-----------FAVW-----------RSFQR-------------------------SKDNYEDFALWILLATLFKDAHWFYKQC
         ++ L S   P  +S            F +W           R  +R                         S+   +   L     TLF+D++ F K+C
Subjt:  NSSLLLSKGNPGAMSL-----------FAVW-----------RSFQR-------------------------SKDNYEDFALWILLATLFKDAHWFYKQC

Query:  DACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAFACHQSDAKTI------------------------------
        D CQR GNL  + +MPL  + EVELFDVWGIDFMGPFP SNG ++ILLAVDYVSKWVEA A   +DA+T+                              
Subjt:  DACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAFACHQSDAKTI------------------------------

Query:  -------------------------------------------------AKLDEALWAYRTAYKTPLGAI------------------------------
                                                          KLD+ALWAYRTA+KTP+G                                
Subjt:  -------------------------------------------------AKLDEALWAYRTAYKTPLGAI------------------------------

Query:  -------RMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQK
               R+LQLNE+EEFR  +YENAK+YKEKTK WHDK+I  +EF  G +
Subjt:  -------RMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQK

XP_016646912.1 PREDICTED: uncharacterized protein LOC103318979 [Prunus mume]3.1e-14527.34Show/hide
Query:  DEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMAHDNS-FKGHPSDDPHSHLQSFLEICGTVKMNRVPTDAIRLRLFPFSLQGKANDWLESV
        +EA +A+ +F  PV     S I    I A NFE+K  +I M  ++S F G P++DP+ HL  FLEIC T K N V  DAIRLRLFPFSL+ KA  WL S 
Subjt:  DEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMAHDNS-FKGHPSDDPHSHLQSFLEICGTVKMNRVPTDAIRLRLFPFSLQGKANDWLESV

Query:  ETDSISTWDELAHAFLTKFFPPVETTKVGTEIETFRQLEEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTKAK
          DSI TWD+L+  FL KFFPP +T K   +I +F Q ++E LYEAWER+K++LR+CP H  P W+QVQ FYNGL+ +++T++D +AGG+ ++KT T+A 
Subjt:  ETDSISTWDELAHAFLTKFFPPVETTKVGTEIETFRQLEEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTKAK

Query:  DLLEEMVATSYQWSTERGVISKKAEIYELDESSSLKAQMSSLTNALNKLTSSEVVKSIS----------TLAEGYLKKEGQDVEEVQYIG------NRPF
        +LLE M + +YQW +ER +  K A + E+D  + L AQ+S+LT  ++ L+ + +  S +            +E          E+V  +G      N P+
Subjt:  DLLEEMVATSYQWSTERGVISKKAEIYELDESSSLKAQMSSLTNALNKLTSSEVVKSIS----------TLAEGYLKKEGQDVEEVQYIG------NRPF

Query:  T-------QGVPNF-YHPVCAIMR------------------TSHTRTRRMFCSHRQTTVNNHDTALKNMEVQIGQIASVVNALQKGKFPSETEPNPR--
        +       +  PNF +     + R                  T  T +   F +  +T   N   +++N+EVQ+GQ+A+V++   +G FPS+ E NP+  
Subjt:  T-------QGVPNF-YHPVCAIMR------------------TSHTRTRRMFCSHRQTTVNNHDTALKNMEVQIGQIASVVNALQKGKFPSETEPNPR--

Query:  EQCKMVRLRSGRNLEINSEKKMKKKRARMKMKGLRHKKPPLKDYAAERLEGANSVLQQNWEQNCHITARVIWCMSDPPGVRFELDPEIERTLGTEGESSA
        EQ K + LR G+  ++N+   ++K+           K+   K +AAE                                +     P I  T         
Subjt:  EQCKMVRLRSGRNLEINSEKKMKKKRARMKMKGLRHKKPPLKDYAAERLEGANSVLQQNWEQNCHITARVIWCMSDPPGVRFELDPEIERTLGTEGESSA

Query:  ETRWRTCRVFRGPEGPATPQNPLLQQNPLFEQNEQQNNQAENPILIATIGPEPFEHVANRGAIPWFASEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLF
                                          +++ + EN I I                        P L LK +           VP+        
Subjt:  ETRWRTCRVFRGPEGPATPQNPLLQQNPLFEQNEQQNNQAENPILIATIGPEPFEHVANRGAIPWFASEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLF

Query:  RILLEMEQRHERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGFAKAQVMPQQNKQAL
           +   QR  +              NK       VDG           A  L+    +    P A        +  E+   Y  F K  ++ ++ K   
Subjt:  RILLEMEQRHERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGFAKAQVMPQQNKQAL

Query:  PQQIREFSRGDDERIYGSYRCAIQSNQASMRALELQVGAGGSNKNAGASGSVPDVEPPYVPPTLCTTSTFSTKAKPKNQDGQFKNAILKNELPPKAKDPG
                 G+ E+I  +  C                                                               +AIL+ +LPPK KD G
Subjt:  PQQIREFSRGDDERIYGSYRCAIQSNQASMRALELQVGAGGSNKNAGASGSVPDVEPPYVPPTLCTTSTFSTKAKPKNQDGQFKNAILKNELPPKAKDPG

Query:  SFTIPVSIGGKELGRALCDLGASINLMPLSVYRK---------------------------------------------LDYEADKDVPIILGRPFLATG
        SF IP +IG     RALCDLG+SINL+PLSV +K                                             LD E D D  +ILGRPFL T 
Subjt:  SFTIPVSIGGKELGRALCDLGASINLMPLSVYRK---------------------------------------------LDYEADKDVPIILGRPFLATG

Query:  RALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRI-----------------LESTVIETAI-QDS---------------------------
        R LIDV++G LT+RV NE+  F VF+A+K+P E EDC  I +                 LEST++  A  QD                            
Subjt:  RALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRI-----------------LESTVIETAI-QDS---------------------------

Query:  ----------------------------------------------------------LQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRL
                                                                   +++ AIGWT+ADI+GISPS CMH+I +EE    S+E QRRL
Subjt:  ----------------------------------------------------------LQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRL

Query:  NPAMKEVVKKEVIKWLDAGIIYPIADSNWIT---------------------------------------------------------------------
        NP MKEVV+ EV+K LDAGIIYPI+DS+W++                                                                     
Subjt:  NPAMKEVVKKEVIKWLDAGIIYPIADSNWIT---------------------------------------------------------------------

Query:  ----------IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRY-----------------------------------------------------
                  IAPEDQEKTTFTCP+GTFA+RRMPFGLCNAPATFQR                                                      
Subjt:  ----------IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRY-----------------------------------------------------

Query:  -----------------------------------------------CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAA----------------
                                                       C +AF  LK  L +AP++ AP+W LPFE+MCDASD A                
Subjt:  -----------------------------------------------CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAA----------------

Query:  ----------------------------------------------------------------------EFDLEIKDKKGSENVIADHLSRL-------
                                                                              EFD+EI+DKKGSENV+ADHLSRL       
Subjt:  ----------------------------------------------------------------------EFDLEIKDKKGSENVIADHLSRL-------

Query:  DPSSSLLEQSAISDLFQMNS-------------SLLLSKGNPGAMSLFAVWRSFQRSKDNY-EDFALW------ILLATLFKDAHWFYKQCDACQRRGNL
        +    +LE      LF +NS             + L     P  MS +   +     K  Y +D  LW      I+   + +        CD CQR GN+
Subjt:  DPSSSLLEQSAISDLFQMNS-------------SLLLSKGNPGAMSLFAVWRSFQRSKDNY-EDFALW------ILLATLFKDAHWFYKQCDACQRRGNL

Query:  GPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAFACHQSDAKTIA--------------------------------------
          R +MPL  ILEVELFDVWGIDFMGPFP S GN++IL+AVDYVSKWVEA A   +DAK +                                       
Subjt:  GPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAFACHQSDAKTIA--------------------------------------

Query:  -----------------------------------------KLDEALWAYRTAYKTPLG------------------------AI-------------RM
                                                 KLD+ALWAYRTA+K P+G                        AI             R 
Subjt:  -----------------------------------------KLDEALWAYRTAYKTPLG------------------------AI-------------RM

Query:  LQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQ-------------------------------------KDEKDGRVFKVNRQRVK
        LQLNELEE R  SYENAK+YK++TK WHDK I  KEF  GQ                                     K+++DG  FKVN  R+K
Subjt:  LQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQ-------------------------------------KDEKDGRVFKVNRQRVK

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]1.0e-16427.31Show/hide
Query:  QDEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMAHDNSFKGHPSDDPHSHLQSFLEICGTVKMNRVPTDAIRLRLFPFSLQGKANDWLESV
        Q+  P+ ++D+++P++    SGI    I A NFELK  LI M     F G P DDP+ HL  FLEIC T+KMN V  D IRLRLFPFSL+ KA  WL+S+
Subjt:  QDEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMAHDNSFKGHPSDDPHSHLQSFLEICGTVKMNRVPTDAIRLRLFPFSLQGKANDWLESV

Query:  ETDSISTWDELAHAFLTKFFPPVETTKVGTEIETFRQLEEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTKAK
        +  SI++W ++A  FL KFFPP +T ++ +EI  FRQ + E LYEAWERYK+++R CPQHG PDWLQVQ+FYNGLN  T+T++D ++GG+ +SKT   A 
Subjt:  ETDSISTWDELAHAFLTKFFPPVETTKVGTEIETFRQLEEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTKAK

Query:  DLLEEMVATSYQWSTERGVISKKAEIYELDESSSLKAQMSSLTNALNKLTSSEVVKSISTLAEGYLKKEGQDV--EEVQYIGNRPFT---QGVPNFYHP-
         LLEEM + +YQW TER +  K A I+EL+  ++L AQ++SL++ ++ LT+  + +    +A   +     +   E+VQYI NR +      +PN+YHP 
Subjt:  DLLEEMVATSYQWSTERGVISKKAEIYELDESSSLKAQMSSLTNALNKLTSSEVVKSISTLAEGYLKKEGQDV--EEVQYIGNRPFT---QGVPNFYHP-

Query:  -------------------------------------VCAIMRTSHT-RTRRMFCSHRQTTVNNHDTALKNMEVQIGQIASVVNALQKGKFPSETEPNPR
                                             V  +  T  T +       + +T  +N    +KN+EVQIGQ+A+ +NA Q+G FPS TE NP+
Subjt:  -------------------------------------VCAIMRTSHT-RTRRMFCSHRQTTVNNHDTALKNMEVQIGQIASVVNALQKGKFPSETEPNPR

Query:  EQCKMVRLRSGRNLEINSEKKMKKKRARMKMKGLRHKKPPLKDYAAERLEGANSVLQQNWEQNCHITARVIWCMSDPPGVRFELDPEIERTLGTEGESSA
        EQCK + LRSGR +E +  K+ +                             N+   +N  +   I    +     PP + F                  
Subjt:  EQCKMVRLRSGRNLEINSEKKMKKKRARMKMKGLRHKKPPLKDYAAERLEGANSVLQQNWEQNCHITARVIWCMSDPPGVRFELDPEIERTLGTEGESSA

Query:  ETRWRTCRVFRGPEGPATPQNPLLQQNPLFEQNEQQNNQAENPILIATIGPEPFEHVANRGAIPWFASEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLF
                          P NP +   PL      Q  + +                                  K F    D F               
Subjt:  ETRWRTCRVFRGPEGPATPQNPLLQQNPLFEQNEQQNNQAENPILIATIGPEPFEHVANRGAIPWFASEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLF

Query:  RILLEMEQRHERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGFAKAQVMPQQNKQAL
                  ++I IN                                 A+AL                        E+   Y  F K  +  +      
Subjt:  RILLEMEQRHERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGFAKAQVMPQQNKQAL

Query:  PQQIREFSRGDDERIYGSYRCAIQSNQASMRALELQVGAGGSNKNAGASGSVPDVEPPYVPPTLCTTSTFSTKAKPKNQDGQFKNAILKNELPPKAKDPG
         +++ EF     E +  S  C                                                               +AI++ +LP K KDPG
Subjt:  PQQIREFSRGDDERIYGSYRCAIQSNQASMRALELQVGAGGSNKNAGASGSVPDVEPPYVPPTLCTTSTFSTKAKPKNQDGQFKNAILKNELPPKAKDPG

Query:  SFTIPVSIGGKELGRALCDLGASINLMPLSVYRK---------------------------------------------LDYEADKDVPIILGRPFLATG
        SFT+P +IG     + LCDLGASINLMPLSVYRK                                             LD E D++VP+ILGRPFLATG
Subjt:  SFTIPVSIGGKELGRALCDLGASINLMPLSVYRK---------------------------------------------LDYEADKDVPIILGRPFLATG

Query:  RALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIET--------------------------AIQDS-------------------
        RAL+DVQKGELT+RV  EEV FN+++AMK+P++   C  + ++E  V+E                           A+ DS                   
Subjt:  RALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIET--------------------------AIQDS-------------------

Query:  ---------------------------------------------------------------------------LQYRKAIGWTLADIQGISPSFCMHK
                                                                                    ++R A+GWT++DI+GISPS CMHK
Subjt:  ---------------------------------------------------------------------------LQYRKAIGWTLADIQGISPSFCMHK

Query:  ITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNW------------------------------------------------------
        I +EE    SIE QRRLNPAMKEVV+ E++K L+AGIIY I+DS+W                                                      
Subjt:  ITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNW------------------------------------------------------

Query:  -------------------------ITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR-------------------------------------
                                 I IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR                                     
Subjt:  -------------------------ITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR-------------------------------------

Query:  ------------------------------------------------------------------------------------------------YCRK
                                                                                                         C +
Subjt:  ------------------------------------------------------------------------------------------------YCRK

Query:  AFETLKAALISAPILCAPNWNLPFEVMCDASDAA------------------------------------------------------------------
        AF  +K  LISAP++  P+W+ PFEVMCDASD A                                                                  
Subjt:  AFETLKAALISAPILCAPNWNLPFEVMCDASDAA------------------------------------------------------------------

Query:  --------------------EFDLEIKDKKGSENVIADHLSRLDPSS---SLLEQSAISD--LFQMNSSL---------LLSKGNPGAMSL---------
                            EFDLE++DKKGSEN +ADHLSRL+       L+ Q A  D  LF     L         L  K  P  ++          
Subjt:  --------------------EFDLEIKDKKGSENVIADHLSRLDPSS---SLLEQSAISD--LFQMNSSL---------LLSKGNPGAMSL---------

Query:  --FAVWRS---FQRSKDNY------EDFALWIL---------------------------LATLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVEL
          + +W     F+R  D        E+    IL                             ++F+D++   K CD CQR GN+  R E+PL  ILEVEL
Subjt:  --FAVWRS---FQRSKDNY------EDFALWIL---------------------------LATLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVEL

Query:  FDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAFACHQSDAKTI-------------------------------------------------------
        FDVWGIDFMGPFPPS G V+ILLAVDYVSKWVEA A   +DAK +                                                       
Subjt:  FDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAFACHQSDAKTI-------------------------------------------------------

Query:  ------------------------AKLDEALWAYRTAYKTPLGAI-------------------------------------RMLQLNELEEFRQFSYEN
                                 KLD+ALWAYRTA+KTP+G                                       R+LQLNE++EFR  +YEN
Subjt:  ------------------------AKLDEALWAYRTAYKTPLGAI-------------------------------------RMLQLNELEEFRQFSYEN

Query:  AKMYKEKTKLWHDKKIKSKEFVKGQK
        AK+YKE+TK WHDK+I  +EF  GQ+
Subjt:  AKMYKEKTKLWHDKKIKSKEFVKGQK

XP_034899370.1 LOW QUALITY PROTEIN: uncharacterized protein LOC118037487 [Populus alba]1.7e-14626.71Show/hide
Query:  DEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMAHDN-SFKGHPSDDPHSHLQSFLEICGTVKMNRVPTDAIRLRLFPFSLQGKANDWLESV
        ++  +A+RDF  P +    S I    I+A NFE+K  ++QM   +  F G PSDDP++H+ SFLEIC T K N V  DAIRLRLFPFSL+ +A +WL S+
Subjt:  DEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMAHDN-SFKGHPSDDPHSHLQSFLEICGTVKMNRVPTDAIRLRLFPFSLQGKANDWLESV

Query:  ETDSISTWDELAHAFLTKFFPPVETTKVGTEIETFRQLEEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTKAK
          DS+ +W++LA  FL KFFPP +T K+  EI  F QLE E LYE WERYK++LRRCP HG P W+QVQ FYNGLN ST+T++D ++GG+F+SK+   A 
Subjt:  ETDSISTWDELAHAFLTKFFPPVETTKVGTEIETFRQLEEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTKAK

Query:  DLLEEMVATSYQWSTERGVISKKAEIYELDESSSLKAQMSSLTNAL--NKLTSSEVVKSISTLAEGYLKKEGQ------DVEEVQYIGN-----------
        +LLEEM   +YQW  ER V  K   ++E+D  ++L AQ+ SLT  L   +L+++ +  +       +  +E Q        E   ++ N           
Subjt:  DLLEEMVATSYQWSTERGVISKKAEIYELDESSSLKAQMSSLTNAL--NKLTSSEVVKSISTLAEGYLKKEGQ------DVEEVQYIGN-----------

Query:  -RPFTQGVPNFYHPVCAIMR-------TSHTRTR--------------RMFCSHRQTTVNNHDTALKNMEVQIGQIASVVNALQKGKFPSETEPNPREQC
          P  +  PNF      +M+       + H + +                F +   T + N   +++N+EVQ+GQ+A+++   Q+G  PS TE NP+EQC
Subjt:  -RPFTQGVPNFYHPVCAIMR-------TSHTRTR--------------RMFCSHRQTTVNNHDTALKNMEVQIGQIASVVNALQKGKFPSETEPNPREQC

Query:  KMVRLRSGRNLEINSEKKMKKKRARMKMKGLRHKKPPLKDYAAERLEGANSVLQQNWEQNCHITARVIWCMSDPPGVRFELDPEIERTLGTEGESSAETR
        K + LRSG+                                                                          E+E+T G +     E  
Subjt:  KMVRLRSGRNLEINSEKKMKKKRARMKMKGLRHKKPPLKDYAAERLEGANSVLQQNWEQNCHITARVIWCMSDPPGVRFELDPEIERTLGTEGESSAETR

Query:  WRTCRVFRGPEGPATPQNPLLQQNPLFEQNEQQNNQAENPILIATIGPEPFEHVANRGAIPWFASEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFRIL
                  E    P   + + +PL                     PEP + +  R   P                                       
Subjt:  WRTCRVFRGPEGPATPQNPLLQQNPLFEQNEQQNNQAENPILIATIGPEPFEHVANRGAIPWFASEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFRIL

Query:  LEMEQRHERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGFAKAQVMPQQNKQALPQQ
               +R+  N          +K+    L+V                                                 F K Q+            
Subjt:  LEMEQRHERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGFAKAQVMPQQNKQALPQQ

Query:  IREFSRGDDERIYGSYRCAIQSNQASMRALELQVGAGGSNKNAGASGSVPDVEPPYVPPTLCTTSTFSTKAKPKNQDGQFKNAILKNELPPKAKDPGSFT
                              N     ALE                      P Y   T+  T   S              AIL+ +LPPK KDPGSFT
Subjt:  IREFSRGDDERIYGSYRCAIQSNQASMRALELQVGAGGSNKNAGASGSVPDVEPPYVPPTLCTTSTFSTKAKPKNQDGQFKNAILKNELPPKAKDPGSFT

Query:  IPVSIGGKELGRALCDLGASINLMPLSVYRK---------------------------------------------LDYEADKDVPIILGRPFLATGRAL
        IP SIG     +ALCDLGASINLMPLS+++K                                             LD E D ++PI+LGRPFLATG AL
Subjt:  IPVSIGGKELGRALCDLGASINLMPLSVYRK---------------------------------------------LDYEADKDVPIILGRPFLATGRAL

Query:  IDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIE---------------------------TAIQDSLQ-------------------
        IDV+KGEL +RV  EEV FNVFKA+K PD  E C  I++++S + E                           T   DS +                   
Subjt:  IDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIE---------------------------TAIQDSLQ-------------------

Query:  --------------------------------------------------YRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVV-K
                                                          ++ A+GW LADI+GISPS CMHKI LE+    ++E QRRLNP MKEV   
Subjt:  --------------------------------------------------YRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVV-K

Query:  KEVIKWLDAGII---------------------------YPIADS--------------------NWITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAP
          V+K  +  +I                            P  D                     N I IAPEDQEKTTFTCPYGTF FRRMPFGLCNAP
Subjt:  KEVIKWLDAGII---------------------------YPIADS--------------------NWITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAP

Query:  ATFQR-----------------------------------------------------------------------------------------------
        ATFQR                                                                                               
Subjt:  ATFQR-----------------------------------------------------------------------------------------------

Query:  --------------------------------------YCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAA------------------------
                                               C+ AF  LK  L++API+ AP+W  PFE+MCDASD A                        
Subjt:  --------------------------------------YCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAA------------------------

Query:  --------------------------------------------------------------EFDLEIKDKKGSENVIADHLSRLD-PSSSLLEQSAISD
                                                                      EFD+EIKD KG+ENV+ADHLSRL+   +   + + I++
Subjt:  --------------------------------------------------------------EFDLEIKDKKGSENVIADHLSRLD-PSSSLLEQSAISD

Query:  LFQMNSSLLLSKG----------NPGAMSLFAVWRSFQRSKDNYEDFA--LW------------------------------------------------
         F     L +S            N  A  +     S+Q+ K  + +     W                                                
Subjt:  LFQMNSSLLLSKG----------NPGAMSLFAVWRSFQRSKDNYEDFA--LW------------------------------------------------

Query:  ----ILLATLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAFACHQSDAKTI---------
                T+FKDAH F   CD CQ  GN+  R+EMPL  ILEVELFDVWGIDFMGPFP S  N +ILLAVDYVSKW+EA A   +D K +         
Subjt:  ----ILLATLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAFACHQSDAKTI---------

Query:  ----------------------------------------------------------------------AKLDEALWAYRTAYKTPLGAI---------
                                                                               KLD+ALWAYRTA+KTPLG           
Subjt:  ----------------------------------------------------------------------AKLDEALWAYRTAYKTPLGAI---------

Query:  ----------------------------RMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQK
                                    R+LQLNE++EF   SYENAK+YKE+TK WHDK I  KEFV GQ+
Subjt:  ----------------------------RMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQK

XP_038976653.1 uncharacterized protein LOC120107448 [Phoenix dactylifera]1.3e-15929.18Show/hide
Query:  KAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMAHDNSFKGHPSDDPHSHLQSFLEICGTVKMNRVPTDAIRLRLFPFSLQGKANDWLESVETDSI
        + + D+  P +      IV   + A NFE+K GLIQM     F G PS+DPH+HL +FLEIC T+KMN V  DAIRLRLFPFSL+ KA  WL S   +S 
Subjt:  KAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMAHDNSFKGHPSDDPHSHLQSFLEICGTVKMNRVPTDAIRLRLFPFSLQGKANDWLESVETDSI

Query:  STWDELAHAFLTKFFPPVETTKVGTEIETFRQLEEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTKAKDLLEE
        +TW+ L+ AFL+K+FPP +T K+  +I +F Q + E LYEAWER K++ R+CP HG PDWL VQ FYNGL  S +  +D +AGG+ +SK+  +A +LLEE
Subjt:  STWDELAHAFLTKFFPPVETTKVGTEIETFRQLEEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTKAKDLLEE

Query:  MVATSYQWSTERGVISKKAEIYELDESSSLKAQMSSLTNALNKL----TSSEVVKSISTLAEGYLKKEGQDVEEV-----QYIGNRPFT-------QGVP
        +V+ +YQWS+ERG+  K   +Y++D  + L A++ SL     KL    + S  V S       ++  +   V+ V     Q   N P++       +  P
Subjt:  MVATSYQWSTERGVISKKAEIYELDESSSLKAQMSSLTNALNKL----TSSEVVKSISTLAEGYLKKEGQDVEEV-----QYIGNRPFT-------QGVP

Query:  NF--------------YHP----------------VCAIMRTSHTRTRRMFCSHRQTTVNNHDTALKNMEVQIGQIASVVNALQKGKFPSETEPNPREQC
        NF               HP                  AI + ++  + R      +  V+   ++ +N+E+Q+GQ+A+ +N   +   PS+TE NP+E C
Subjt:  NF--------------YHP----------------VCAIMRTSHTRTRRMFCSHRQTTVNNHDTALKNMEVQIGQIASVVNALQKGKFPSETEPNPREQC

Query:  KMVRLRSGRNL-EINSEKKMKKKRARMKMKGLRHKKPPLKDYAAERLEGANSVLQQNWEQNCHITARVIWCMSDPPGVRFELDPEIERTLGTEGESSAET
        K V LRSG+ L +++ E  +  K                 DY                                          E+ + +  E E  A+T
Subjt:  KMVRLRSGRNL-EINSEKKMKKKRARMKMKGLRHKKPPLKDYAAERLEGANSVLQQNWEQNCHITARVIWCMSDPPGVRFELDPEIERTLGTEGESSAET

Query:  RWRTCRVFRGPEGPATPQNPLLQQNPLFEQNEQQNNQAENPILIATIGPEPFEHVANRGAIPWFASEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFRI
                     P  P  P +   P F Q  +QN                                                                 
Subjt:  RWRTCRVFRGPEGPATPQNPLLQQNPLFEQNEQQNNQAENPILIATIGPEPFEHVANRGAIPWFASEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFRI

Query:  LLEMEQRHERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGFAKAQVMPQQNKQALPQ
          +++Q+ E+                KV   L ++            A+AL  +        PA                Y  F K ++M ++ K     
Subjt:  LLEMEQRHERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGFAKAQVMPQQNKQALPQ

Query:  QIREFSRGDDERIYGSYRCAIQSNQASMRALELQVGAGGSNKNAGASGSVPDVEPPYVPPTLCTTSTFSTKAKPKNQDGQFKNAILKNELPPKAKDPGSF
                D E I  +  C                                                               +AI++N+LPPK +DPGSF
Subjt:  QIREFSRGDDERIYGSYRCAIQSNQASMRALELQVGAGGSNKNAGASGSVPDVEPPYVPPTLCTTSTFSTKAKPKNQDGQFKNAILKNELPPKAKDPGSF

Query:  TIPVSIGGKELGRALCDLGASINLMPLSVYRK---------------------------------------------LDYEADKDVPIILGRPFLATGRA
        +I  +IG  +  RALCDLGAS++LMPLSV RK                                             L+ E D ++PIILGRPFLAT  A
Subjt:  TIPVSIGGKELGRALCDLGASINLMPLSVYRK---------------------------------------------LDYEADKDVPIILGRPFLATGRA

Query:  LIDVQKGELTMRVCNEEVKFNVFKAMKYP-----------------------------------------DEMEDCSFIRILESTVIET-----------
        +IDV+ G LT++V  EEV+ N+F+A KYP                                         D +E       LE+T  +T           
Subjt:  LIDVQKGELTMRVCNEEVKFNVFKAMKYP-----------------------------------------DEMEDCSFIRILESTVIET-----------

Query:  -------------------AIQDSLQY--------------------------------RKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNP
                            +   L+Y                                +KAIGWT++D++GISPS CMH+I +E+     +E QRRLNP
Subjt:  -------------------AIQDSLQY--------------------------------RKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNP

Query:  AMKEVVKKEVIKWLDAGIIYPIADSNW---------------------------ITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRY-------
         MKEVV+ EV+KWLDAGIIYPI+DS+W                           I+I+PEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR        
Subjt:  AMKEVVKKEVIKWLDAGIIYPIADSNW---------------------------ITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRY-------

Query:  ------------------------------------------------CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAA---------------
                                                        C  AF  LK  L+SAPI+ AP+W+LPFE+MCDAS+ A               
Subjt:  ------------------------------------------------CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAA---------------

Query:  --------------------------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDLFQMNSSL--------LLSKGNP-G
                                              EFDLEI+DK+G ENV+ADHLSRL+   S  ++  I++ F     L         L+   P  
Subjt:  --------------------------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDLFQMNSSL--------LLSKGNP-G

Query:  AMSLFAVWRSFQRSKDNYEDFALWILLATLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEA
         + +    R  QR  +      L        K        CD CQR GN+  ++EMPLT ILEVELFD+W IDFMGPFP S  N +IL+AVDYVSKWVEA
Subjt:  AMSLFAVWRSFQRSKDNYEDFALWILLATLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEA

Query:  FACHQSDAKTIA-------------------------------------------------------------------------------KLDEALWAY
         A   +D++ +                                                                                KLD+ALWAY
Subjt:  FACHQSDAKTIA-------------------------------------------------------------------------------KLDEALWAY

Query:  RTAYKTPLG------------------------AI-------------RMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQ--KDEKDGR
        RTA+KTPLG                        AI             R+LQL+ELEEFR  +YEN ++YKEKTK WHDK ++ + F  GQ  + E  G 
Subjt:  RTAYKTPLG------------------------AI-------------RMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQ--KDEKDGR

Query:  VFKVNRQRVK
         FKVN QR+K
Subjt:  VFKVNRQRVK

TrEMBL top hitse value%identityAlignment
A0A1B5Z879 Reverse transcriptase (Fragment)2.2e-12025.5Show/hide
Query:  DLEIDRTLRTIRRLKRL---------AEVMAHQDE-APKAIRDFLQPVLPTE--------------NSGIVYAPIQATNFELKTGLIQMAHDNSFKGHPS
        D EI++T R  R+  RL         +EV +  DE   K + + ++     E                 I   P+    FE+   +++   DN F G  +
Subjt:  DLEIDRTLRTIRRLKRL---------AEVMAHQDE-APKAIRDFLQPVLPTE--------------NSGIVYAPIQATNFELKTGLIQMAHDNSFKGHPS

Query:  DDPHSHLQSFLEICGTVKMNRVPTDAIRLRLFPFSLQGKANDWLESVETDSISTWDELAHAFLTKFFPPVETTKVGTEIETFRQLEEEQLYEAWERYKEM
        +D + HL++F+ +C T+K+     +  RLRLFPF+L+  A +W + +  DSI+TWD++   FL +FFP     +   EI  F Q E E L +A++R+K+ 
Subjt:  DDPHSHLQSFLEICGTVKMNRVPTDAIRLRLFPFSLQGKANDWLESVETDSISTWDELAHAFLTKFFPPVETTKVGTEIETFRQLEEEQLYEAWERYKEM

Query:  LRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTKAKDLLEEMVATSYQWSTERGVISKKAEIYELDESSSLKAQMSSLTNALNKLTSSE
        L  CP+H +    Q+Q F NGL  ST+ VLDT+AGGS   KT T  K ++E + A       +R     KA + +L+      AQ +     + +   +E
Subjt:  LRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTKAKDLLEEMVATSYQWSTERGVISKKAEIYELDESSSLKAQMSSLTNALNKLTSSE

Query:  VVKSISTLAEGYLKKEGQDVEEVQYIGNRPFTQGVPNFYHPVCAIMRTSHTRTRRMFCSHRQTTVNNHDTALKNMEVQIGQIASVVNALQKGKFPSETEP
        V K ++ L     +     V ++  +G      G P+F                 M C               N+ +   Q+ +V          S  + 
Subjt:  VVKSISTLAEGYLKKEGQDVEEVQYIGNRPFTQGVPNFYHPVCAIMRTSHTRTRRMFCSHRQTTVNNHDTALKNMEVQIGQIASVVNALQKGKFPSETEP

Query:  NPREQCKMVRLRSGRNLEINSEKKMKKKRARMKMKGLRHKKPPLKDYAAERLEGANSVLQQNWEQNCHITARVIWCMSDPPGVRFELDPEIERTLGTEGE
        NP         R+  N    S K  +    R +  G +         +    + A    + +W+      A  I   +D                    E
Subjt:  NPREQCKMVRLRSGRNLEINSEKKMKKKRARMKMKGLRHKKPPLKDYAAERLEGANSVLQQNWEQNCHITARVIWCMSDPPGVRFELDPEIERTLGTEGE

Query:  SSAETRWRTCRVFRGPEGPATPQNPLLQQNPLFEQNEQQNNQAENPILIATIGPEPFEHVANRGAIPWFASEDPHLHLKSFLGVSDSFVIQGVPRD--AL
        + A  +  T          A+ +N  +Q   + +Q  Q++           +G  P   V N                               PRD   +
Subjt:  SSAETRWRTCRVFRGPEGPATPQNPLLQQNPLFEQNEQQNNQAENPILIATIGPEPFEHVANRGAIPWFASEDPHLHLKSFLGVSDSFVIQGVPRD--AL

Query:  RLTLFRILLEMEQRHERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGFAKAQVMPQQ
           + RI    E +  ++          +G ++   +++EV+    +R+      N   +V  +  Q+ P ++                   K  +   +
Subjt:  RLTLFRILLEMEQRHERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGFAKAQVMPQQ

Query:  NKQALPQQIREFSRGDDERIYGSYRCAIQSNQASMRALELQVGAGGSNKNAGASGSVPDVEPPYVPPTLCTTSTFSTKAKPKNQDGQFKNAILKNELPPK
         K   PQ+++   +  DE+ +G +    +  Q ++   E                   +  P Y        S   T    K +  +  +AIL+ ++P K
Subjt:  NKQALPQQIREFSRGDDERIYGSYRCAIQSNQASMRALELQVGAGGSNKNAGASGSVPDVEPPYVPPTLCTTSTFSTKAKPKNQDGQFKNAILKNELPPK

Query:  AKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRK---------------------------------------------LDYEADKDVPIILGRP
         KDPGS TIP +IG +   +AL DLGAS++LMPLS+Y+                                              L+   D D+P+ILGRP
Subjt:  AKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRK---------------------------------------------LDYEADKDVPIILGRP

Query:  FLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIETAIQDSL---------------------------------------
        +L  GR LID++ G LT++V +E VK NV +AMK+P E E+C  + IL S VIE  I++ +                                       
Subjt:  FLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIETAIQDSL---------------------------------------

Query:  -------------------------------------------------------------------QYRKAIGWTLADIQGISPSFCMHKITLEEGSFR
                                                                           QY+ AIGW++ D++GISP+FCMHKI +E+    
Subjt:  -------------------------------------------------------------------QYRKAIGWTLADIQGISPSFCMHKITLEEGSFR

Query:  SIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNW---------------------------------------------------------------
         ++ QRRLNPAMKEV++KEV+K L+AG+IYPI+DS+W                                                               
Subjt:  SIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNW---------------------------------------------------------------

Query:  ----------------ITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRYCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAA--------
                        I +AP+DQEKT FTC YG FA+RRMPFGLCNAPATFQR            L++AP++ AP+W+LPFE+MCDASD A        
Subjt:  ----------------ITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRYCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAA--------

Query:  ----------------------------------EFDL---------------------EIKDKKGSENVIADHLSRLDPSSSLLEQSAISDLFQMNSSL
                                           FD                       I+DKKGSEN +ADHLSRL+      ++  I DLF     L
Subjt:  ----------------------------------EFDL---------------------EIKDKKGSENVIADHLSRLDPSSSLLEQSAISDLFQMNSSL

Query:  LLSKGNPGA-MSLFAVWR-------SFQRSKDNYE-DFALW-------------------------ILLATLFKDAHWFYKQCDACQRRGNLGPRDEMPL
         ++     A  + + V R       S QR K  ++  F +W                         +   TLFKDA  + K+CD CQR  N+  R+EMP 
Subjt:  LLSKGNPGA-MSLFAVWR-------SFQRSKDNYE-DFALW-------------------------ILLATLFKDAHWFYKQCDACQRRGNLGPRDEMPL

Query:  TYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAFACHQSDAKTI-----------------------------------------------
          +LEVE+FDVWGIDFMGPFP S   ++IL+AV+YVSKWVEA A   +DA+ +                                               
Subjt:  TYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAFACHQSDAKTI-----------------------------------------------

Query:  --------------------------------AKLDEALWAYRTAYKTPL-------------------------------------GAIRMLQLNELEE
                                         KLD+ALWAYRTA+KTP+                                     G  R+LQL+EL+E
Subjt:  --------------------------------AKLDEALWAYRTAYKTPL-------------------------------------GAIRMLQLNELEE

Query:  FRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQ
        FR ++YENAK++KEKTK WHDKKI+++EF +GQ
Subjt:  FRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQ

A0A5N6MBJ1 Reverse transcriptase1.7e-11724.62Show/hide
Query:  FKGHPSDDPHSHLQSFLEICGTVKMNRVPTDAIRLRLFPFSLQGKANDWLESVETDSISTWDELAHAFLTKFFPPVETTKVGTEIETFRQLEEEQLYEAW
        F G   +DP +H+ SF+EIC T K N V  DAI+LR+FPFSL+ +A  WL S+   S++TW++LA  FL K+FPP +T ++   I +F Q + E LY+AW
Subjt:  FKGHPSDDPHSHLQSFLEICGTVKMNRVPTDAIRLRLFPFSLQGKANDWLESVETDSISTWDELAHAFLTKFFPPVETTKVGTEIETFRQLEEEQLYEAW

Query:  ERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTKAKDLLEEMVATSYQWSTERGVISKKAEIYELDESSSLKAQMSSLTNALN
        ERYK+++R+CP HG   W+QV  FYNGL P  + ++D +AGG+F  KT  +   LLE++ A ++QW   RG   K+  ++++D+ +SL AQ+ ++T  +N
Subjt:  ERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTKAKDLLEEMVATSYQWSTERGVISKKAEIYELDESSSLKAQMSSLTNALN

Query:  KLTSSEVVKSISTLAEGYLKKEGQDVEEVQYIGNRPFTQGVPNFYHPVCAIMRTSHTRTRRMFCSHRQTTVNNHDTALKNMEVQIGQIASVVNALQKGKF
        ++  ++V                                                  +T   FC     +VN     LK    Q+  + S  N  Q   +
Subjt:  KLTSSEVVKSISTLAEGYLKKEGQDVEEVQYIGNRPFTQGVPNFYHPVCAIMRTSHTRTRRMFCSHRQTTVNNHDTALKNMEVQIGQIASVVNALQKGKF

Query:  PSETEPNPREQCKMVRLRSGRNLEINSEKKMKKKRARMKMKGLRHKKPPLKDYAAERLEGANSVLQQNWEQNCHITARVIWCMSDPPGVRFELDPEIERT
         +   P  +         SG N                +  G + + P  ++         N    QN  QN +   +         G + E    +E+ 
Subjt:  PSETEPNPREQCKMVRLRSGRNLEINSEKKMKKKRARMKMKGLRHKKPPLKDYAAERLEGANSVLQQNWEQNCHITARVIWCMSDPPGVRFELDPEIERT

Query:  LGTEGESSAETRWRTCRVFRGPEGPATPQ---NPLLQQNPLFEQNEQQNNQAENPILIATIGPEPFEHVANRGAIPWFASEDPHLHLKSFLGVSDSFVIQ
        + T+  S+AE R +     R  +  A  Q   N L  Q       E Q  Q     L+A            +G +P     +P  H K+           
Subjt:  LGTEGESSAETRWRTCRVFRGPEGPATPQ---NPLLQQNPLFEQNEQQNNQAENPILIATIGPEPFEHVANRGAIPWFASEDPHLHLKSFLGVSDSFVIQ

Query:  GVPRDALRLTLFRILLEMEQRHERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGFAK
              LR                 S  + +  D+  T+K +    EV+           + + +KN    S  + P  EP     +V +    Y G  K
Subjt:  GVPRDALRLTLFRILLEMEQRHERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGFAK

Query:  AQVMPQQNKQALPQQIREFSRGDDERIYGSYRCAIQSNQASMRALELQVGAGGSNKNAGASGSVPDVEPPYVPPTLCTTSTFSTKAKPKNQDGQFKN---
         + M                    E+ YG +    +    ++  +E             A   +P                 + K K +       N   
Subjt:  AQVMPQQNKQALPQQIREFSRGDDERIYGSYRCAIQSNQASMRALELQVGAGGSNKNAGASGSVPDVEPPYVPPTLCTTSTFSTKAKPKNQDGQFKN---

Query:  -AILKNELPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRK---------------------------------------------LDYEA
         A+L+N+LP K KDPGSFTIP  IGG  +  AL DLGASINLMP S++ K                                             LD + 
Subjt:  -AILKNELPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRK---------------------------------------------LDYEA

Query:  DKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFI-------------RILESTVIETAI-----QDSLQ---------
        D++VP+ILGRPFLAT RAL+DV +G+LT+RV  EEV F +  +M++    +D  +               ILE  V++T +      DS+Q         
Subjt:  DKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFI-------------RILESTVIETAI-----QDSLQ---------

Query:  --------------------------------------------------------------------------------YRKAIGWTLADIQGISPSFC
                                                                                        ++KA+ W + DI+GI+PSFC
Subjt:  --------------------------------------------------------------------------------YRKAIGWTLADIQGISPSFC

Query:  MHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWIT-------------------------------------------------
         HKI +E+     ++ QRRLNP M+EVVKKEVIK LDAG+IYPI+DS W++                                                 
Subjt:  MHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWIT-------------------------------------------------

Query:  ------------------------------IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR----------------------------------
                                      IAPEDQEKTTFTCPYGTFA+RRMPFGLCNAPATFQR                                  
Subjt:  ------------------------------IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR----------------------------------

Query:  ---------------------------------------------------------------------------------------------------Y
                                                                                                            
Subjt:  ---------------------------------------------------------------------------------------------------Y

Query:  CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAA---------------------------------------------------------------
        C KAF  LK  L++API+ AP+W LPFE+MCDASD A                                                               
Subjt:  CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAA---------------------------------------------------------------

Query:  -----------------------EFDLEIKDKKGSENVIADHLSRLD-PSSSLLEQSAISDLFQMNSSLLLSK----------GNPGAMSLFAVWRSFQR
                               EFD+EI+DKKG+ENV ADHLSRL+ PS   L +S I+D F     L +             N  A  +     + Q+
Subjt:  -----------------------EFDLEIKDKKGSENVIADHLSRLD-PSSSLLEQSAISDLFQMNSSLLLSK----------GNPGAMSLFAVWRSFQR

Query:  SKDNYED--FALW----------------------------------------------------ILLATLFKDAHWFYKQCDACQRRGNLGPRDEMPLT
         +  + D  +  W                                                        T+FKDAH   K CDACQR GN+  RDEMP  
Subjt:  SKDNYED--FALW----------------------------------------------------ILLATLFKDAHWFYKQCDACQRRGNLGPRDEMPLT

Query:  YILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAFACHQSDAKTIA-----------------------------------------------
         I   E+FDVWGIDFMGPFP S G+ +IL+AVDYVSKWVEA A   +DA+ +                                                
Subjt:  YILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAFACHQSDAKTIA-----------------------------------------------

Query:  -------------------------------KLDEALWAYRTAYKTPLGAI-------------------------------------RMLQLNELEEFR
                                       KLD+ALWA+RTAYKTP+G                                       R +Q++ELE+ R
Subjt:  -------------------------------KLDEALWAYRTAYKTPLGAI-------------------------------------RMLQLNELEEFR

Query:  QFSYENAKMYKEKTKLWHDKKIK-SKEFVKGQK------------------------------------DEKDGRVFKVNRQRVK
          +YEN+++YKE+TK  HD  +K +K+F  G +                                    +  DGR+FKVN  R+K
Subjt:  QFSYENAKMYKEKTKLWHDKKIK-SKEFVKGQK------------------------------------DEKDGRVFKVNRQRVK

A0A5N6N4K2 Reverse transcriptase1.7e-11724.6Show/hide
Query:  HPSDDPHSHLQSFLEICGTVKMNRVPTDAIRLRLFPFSLQGKANDWLESVETDSISTWDELAHAFLTKFFPPVETTKVGTEIETFRQLEEEQLYEAWERY
        H  +DP +H+ SF+EIC T K N V  DAI+LR+FPFSL+ +A  WL S+   S++TW++LA  FL K+FPP +T ++   I +F Q + E LY+AWERY
Subjt:  HPSDDPHSHLQSFLEICGTVKMNRVPTDAIRLRLFPFSLQGKANDWLESVETDSISTWDELAHAFLTKFFPPVETTKVGTEIETFRQLEEEQLYEAWERY

Query:  KEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTKAKDLLEEMVATSYQWSTERGVISKKAEIYELDESSSLKAQMSSLTNALNKLT
        K+++R+CP HG   W+QV  FYNGL P  + ++D +AGG+F  KT  +   LLE++ A ++QW   RG   K+  ++++D+ +SL AQ+ ++T  +N++ 
Subjt:  KEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVTKAKDLLEEMVATSYQWSTERGVISKKAEIYELDESSSLKAQMSSLTNALNKLT

Query:  SSEVVKSISTLAEGYLKKEGQDVEEVQYIGNRPFTQGVPNFYHPVCAIMRTSHTRTRRMFCSHRQTTVNNHDTALKNMEVQIGQIASVVNALQKGKFPSE
         ++V                                                  +T   FC     +VN     LK    Q+  + S  N  Q   + + 
Subjt:  SSEVVKSISTLAEGYLKKEGQDVEEVQYIGNRPFTQGVPNFYHPVCAIMRTSHTRTRRMFCSHRQTTVNNHDTALKNMEVQIGQIASVVNALQKGKFPSE

Query:  TEPNPREQCKMVRLRSGRNLEINSEKKMKKKRARMKMKGLRHKKPPLKDYAAERLEGANSVLQQNWEQNCHITARVIWCMSDPPGVRFELDPEIERTLGT
          P  +         SG N                +  G + + P  ++         N    QN  QN +   +         G + E    +E+ + T
Subjt:  TEPNPREQCKMVRLRSGRNLEINSEKKMKKKRARMKMKGLRHKKPPLKDYAAERLEGANSVLQQNWEQNCHITARVIWCMSDPPGVRFELDPEIERTLGT

Query:  EGESSAETRWRTCRVFRGPEGPATPQ---NPLLQQNPLFEQNEQQNNQAENPILIATIGPEPFEHVANRGAIPWFASEDPHLHLKSFLGVSDSFVIQGVP
        +  S+AE R +     R  +  A  Q   N L  Q       E Q  Q     L+A            +G +P     +P  H K+              
Subjt:  EGESSAETRWRTCRVFRGPEGPATPQ---NPLLQQNPLFEQNEQQNNQAENPILIATIGPEPFEHVANRGAIPWFASEDPHLHLKSFLGVSDSFVIQGVP

Query:  RDALRLTLFRILLEMEQRHERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGFAKAQV
           LR                 S  + +  D+  T+K +    EV+           + + +KN    S  + P  EP     +V +    Y G  K + 
Subjt:  RDALRLTLFRILLEMEQRHERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGFAKAQV

Query:  MPQQNKQALPQQIREFSRGDDERIYGSYRCAIQSNQASMRALELQVGAGGSNKNAGASGSVPDVEPPYVPPTLCTTSTFSTKAKPKNQDGQFKN----AI
        M                    E+ YG +    +    ++  +E             A   +P                 + K K +       N    A+
Subjt:  MPQQNKQALPQQIREFSRGDDERIYGSYRCAIQSNQASMRALELQVGAGGSNKNAGASGSVPDVEPPYVPPTLCTTSTFSTKAKPKNQDGQFKN----AI

Query:  LKNELPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRK---------------------------------------------LDYEADKD
        L+N+LP K KDPGSFTIP  IGG  +  AL DLGASINLMP S++ K                                             LD + D++
Subjt:  LKNELPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRK---------------------------------------------LDYEADKD

Query:  VPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFI-------------RILESTVIETAI-----QDSLQ------------
        VP+ILGRPFLAT RAL+DV +G+LT+RV  EEV F +  +M++    +D  +               ILE  V++T +      DS+Q            
Subjt:  VPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFI-------------RILESTVIETAI-----QDSLQ------------

Query:  -----------------------------------------------------------------------------YRKAIGWTLADIQGISPSFCMHK
                                                                                     ++KA+ W + DI+GI+PSFC HK
Subjt:  -----------------------------------------------------------------------------YRKAIGWTLADIQGISPSFCMHK

Query:  ITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWIT----------------------------------------------------
        I +E+     ++ QRRLNP M+EVVKKEVIK LDAG+IYPI+DS W++                                                    
Subjt:  ITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWIT----------------------------------------------------

Query:  ---------------------------IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR-------------------------------------
                                   IAPEDQEKTTFTCPYGTFA+RRMPFGLCNAPATFQR                                     
Subjt:  ---------------------------IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR-------------------------------------

Query:  ------------------------------------------------------------------------------------------------YCRK
                                                                                                         C K
Subjt:  ------------------------------------------------------------------------------------------------YCRK

Query:  AFETLKAALISAPILCAPNWNLPFEVMCDASDAA------------------------------------------------------------------
        AF  LK  L++API+ AP+W LPFE+MCDASD A                                                                  
Subjt:  AFETLKAALISAPILCAPNWNLPFEVMCDASDAA------------------------------------------------------------------

Query:  --------------------EFDLEIKDKKGSENVIADHLSRLD-PSSSLLEQSAISDLFQMNSSLLLSK----------GNPGAMSLFAVWRSFQRSKD
                            EFD++I+DKKG+ENV ADHLSRL+ PS   L +S I+D F     L +             N  A  +     + Q+ + 
Subjt:  --------------------EFDLEIKDKKGSENVIADHLSRLD-PSSSLLEQSAISDLFQMNSSLLLSK----------GNPGAMSLFAVWRSFQRSKD

Query:  NYED--FALW----------------------------------------------------ILLATLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYIL
         + D  +  W                                                        T+FKDAH   K CDACQR GN+  RDEMP   I 
Subjt:  NYED--FALW----------------------------------------------------ILLATLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYIL

Query:  EVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAFACHQSDAKTIA--------------------------------------------------
          E+FDVWGIDFMGPFP S G+ +IL+AVDYVSKWVEA A   +DA+ +                                                   
Subjt:  EVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAFACHQSDAKTIA--------------------------------------------------

Query:  ----------------------------KLDEALWAYRTAYKTPLGAI-------------------------------------RMLQLNELEEFRQFS
                                    KLD+ALWA+RTAYKTP+G                                       R +Q++ELEE R  +
Subjt:  ----------------------------KLDEALWAYRTAYKTPLGAI-------------------------------------RMLQLNELEEFRQFS

Query:  YENAKMYKEKTKLWHDKKIK-SKEFVKGQK------------------------------------DEKDGRVFKVNRQRVK
        YEN+++YKE+TK  HD  +K +K+F  G +                                    +  DGR+FKVN  R+K
Subjt:  YENAKMYKEKTKLWHDKKIK-SKEFVKGQK------------------------------------DEKDGRVFKVNRQRVK

A0A6A3CBX2 Uncharacterized protein3.9e-12527.11Show/hide
Query:  SRLLPPRLRLSPPRLRSSDIGFQFQSSSFSSSKASPLVLWLSAPPMLFKSRRWFKSNLRSKFLRISASLLCYHHHDLPRIRGNNPDLEIDRTLRTIRRLK
        S L PP L ++PP    +D+G  +       + A+ L   LS  P L+ S              + +  L Y   + P       D EI+R  R  R  +
Subjt:  SRLLPPRLRLSPPRLRSSDIGFQFQSSSFSSSKASPLVLWLSAPPMLFKSRRWFKSNLRSKFLRISASLLCYHHHDLPRIRGNNPDLEIDRTLRTIRRLK

Query:  RL------AEVMAHQDE-------------APKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMAHD-NSFKGHPSDDPHSHLQSFLEICGTVKM
        R+       + + H+++              P+AI D L P+L   N GI+   IQAT+FELK  +  M      F G P++D   H+++FL++C + + 
Subjt:  RL------AEVMAHQDE-------------APKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMAHD-NSFKGHPSDDPHSHLQSFLEICGTVKM

Query:  NRVPTDAIRLRLFPFSLQGKANDWLESVETDSISTWDELAHAFLTKFFPPVETTKVGTEIETFRQLEEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFY
          V  D ++L+LF +SL+ +A  WL  V   S+ +W +L   FL ++ PP   T++ +    FRQ ++E +YE W+RYK +LR+C  HG+ DW QV +FY
Subjt:  NRVPTDAIRLRLFPFSLQGKANDWLESVETDSISTWDELAHAFLTKFFPPVETTKVGTEIETFRQLEEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFY

Query:  NGLNPSTKTVLDTSAGGSFLSKTVTKAKDLLEEMVATSYQWSTERGVISKKA-EIYELDESSSLKAQMSSLTNALNKLTSSEVVKSISTLAEGYLKKEGQ
        NG+N  T+ +LD SA  + L K+ T+  D+L+++    YQ+ + R  + +K+ E +ELD                                         
Subjt:  NGLNPSTKTVLDTSAGGSFLSKTVTKAKDLLEEMVATSYQWSTERGVISKKA-EIYELDESSSLKAQMSSLTNALNKLTSSEVVKSISTLAEGYLKKEGQ

Query:  DVEEVQYIGNRPFTQGVPNFYHPVCAIMRTSHTRTRRMFCSHRQTTVNNHDTALKNMEVQIGQIASVVNALQKGKFPSETE---PNPREQCKMVRLRSGR
                 N P  Q                                 +H ++L+ +E Q+GQI +     + G+  S+TE    + +E C ++ LRSG 
Subjt:  DVEEVQYIGNRPFTQGVPNFYHPVCAIMRTSHTRTRRMFCSHRQTTVNNHDTALKNMEVQIGQIASVVNALQKGKFPSETE---PNPREQCKMVRLRSGR

Query:  NLEINSEKKMKKKRARMKMKGLRHKKPPLKDYAAERLEGANSVLQQNWEQNCHITARVIWCMSDPPGVRFELDPEIERTLGTEGESSAETRWRTCRVFRG
          +IN E K           G R K                                        P V  + +PE++                       
Subjt:  NLEINSEKKMKKKRARMKMKGLRHKKPPLKDYAAERLEGANSVLQQNWEQNCHITARVIWCMSDPPGVRFELDPEIERTLGTEGESSAETRWRTCRVFRG

Query:  PEGPATPQNPLLQQNPLFEQNEQQNNQAENPILIATIGPEPFEHVANRGAIPWFASEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFRILLEMEQRHER
                    ++ P+ E+  + ++  E                                                                       
Subjt:  PEGPATPQNPLLQQNPLFEQNEQQNNQAENPILIATIGPEPFEHVANRGAIPWFASEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFRILLEMEQRHER

Query:  ISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGFAKAQVMPQQNKQALPQQIREFSRGDD
                    G NK  K+      VST                ++   +PP                                   PQ++++++    
Subjt:  ISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGFAKAQVMPQQNKQALPQQIREFSRGDD

Query:  ERIYGSYRCAIQSNQASMRALELQVGAGGSNKNAGASGSVPDVEPPYVPPTLCTTSTFSTKAKPKNQDGQFKNAILKNELPPKAKDPGSFTIPVSIGGKE
         + +      +Q N                                           + T A  K     F      ++LPPK  DP SF IP  IG K 
Subjt:  ERIYGSYRCAIQSNQASMRALELQVGAGGSNKNAGASGSVPDVEPPYVPPTLCTTSTFSTKAKPKNQDGQFKNAILKNELPPKAKDPGSFTIPVSIGGKE

Query:  LGRALCDLGASINLMPLSVYRKLDY------EADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDC------SFIRI---
        +G ALCDLG+S+NLMP S++ KL          D   PIILGRPFLATGR LID ++GELTMRV ++ V  NVF+++KY D+ E+C      SF+++   
Subjt:  LGRALCDLGASINLMPLSVYRKLDY------EADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDC------SFIRI---

Query:  -----------------------LE-------------------STVIETAI----QDSL-----QYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSI
                               LE                     +I +A+    + SL     Q +KA+GW +AD++GISP+ CMHKI LEE   +SI
Subjt:  -----------------------LE-------------------STVIETAI----QDSL-----QYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSI

Query:  EQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNW---ITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRYCRKAFETLKAALISAPILCAPNW
        E QRRLNP MK+VV KE++KW DAG+IYPI+DS+W   +   P+    T            R      N P    + C  AF+ LK  L  API+  P+W
Subjt:  EQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNW---ITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRYCRKAFETLKAALISAPILCAPNW

Query:  NLPFEVMCDASDAAEFDLEIKDKKGSENVIADHLSRLD---------------PSSSLLEQSAI---SDL----------FQMNSSLLLSKGNPGAMSLF
          PFE+MCD S     D ++    G      DHLSRL+               P   +L   AI   +DL          +++N    +S     A+S  
Subjt:  NLPFEVMCDASDAAEFDLEIKDKKGSENVIADHLSRLD---------------PSSSLLEQSAI---SDL----------FQMNSSLLLSKGNPGAMSLF

Query:  AVWRSFQRSKDNYEDFALWILLATLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAFACHQ
            + Q++    +    W    +LFKDAH F K CD C R  NL  R EMPL  I+E+ELFDVWGIDFMGPFP    +++ILLAVDYVSKWVEA A  +
Subjt:  AVWRSFQRSKDNYEDFALWILLATLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAFACHQ

Query:  SDAKTI-----------------------------------------AKLDEALWAYRTAYKTPLGAI--------------------------------
        +D+KTI                                          KLDEALW YRT +KTPLG                                  
Subjt:  SDAKTI-----------------------------------------AKLDEALWAYRTAYKTPLGAI--------------------------------

Query:  -----RMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQK
             R+L LNE+EEFR  +YEN K+YKEK K WHDK +  + F +GQK
Subjt:  -----RMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQK

A0A6P6XAQ1 Reverse transcriptase1.2e-13726.27Show/hide
Query:  MAHQDEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMAHDNSFKGHPSDDPHSHLQSFLEICGTVKMNRVPTDAIRLRLFPFSLQGKANDWL
        MA  +   + +RDF  P      + IV   + A NFE+K  LIQM   + + G+ ++DP+SHL +FLEIC T+K N V  DAI+LRLFPFSL+ KA  WL
Subjt:  MAHQDEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMAHDNSFKGHPSDDPHSHLQSFLEICGTVKMNRVPTDAIRLRLFPFSLQGKANDWL

Query:  ESVETDSISTWDELAHAFLTKFFPPVETTKVGTEIETFRQLEEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVT
        +S   ++ +TWDELA AFL KFFPP +T K+  +I +F Q E E LYEAWERY+E+ RRCP HG PDWL VQ FYNGL   TKT +D +AGG+ + KT  
Subjt:  ESVETDSISTWDELAHAFLTKFFPPVETTKVGTEIETFRQLEEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGLNPSTKTVLDTSAGGSFLSKTVT

Query:  KAKDLLEEMVATSYQWSTERGVISKKAEIYELDESSSLKAQMSSLTNALNKLTSSEVVKSISTLAEGYLKKEGQDVEEVQYIGNRPFTQGVPNFYHPVCA
        +A+ L+EEM A +YQW+ ERG   + A + E+D  + L A+M ++   LN+   S                                 QGV      +C 
Subjt:  KAKDLLEEMVATSYQWSTERGVISKKAEIYELDESSSLKAQMSSLTNALNKLTSSEVVKSISTLAEGYLKKEGQDVEEVQYIGNRPFTQGVPNFYHPVCA

Query:  IMRTSHTRTRRMFCSHRQTTVNNHDTALKNMEVQIGQIASVVNALQKGKFPSETEPNPREQCKMVRLRSGRNLEINSEKKMKKKRARMKMKGLRHKKPPL
                              +HD  + +   Q+  + +     Q   + +   P  R          G   + N ++ +       + K   H+  P 
Subjt:  IMRTSHTRTRRMFCSHRQTTVNNHDTALKNMEVQIGQIASVVNALQKGKFPSETEPNPREQCKMVRLRSGRNLEINSEKKMKKKRARMKMKGLRHKKPPL

Query:  KDYAAERLEGANSVLQQNWEQNCHITARVIWCMSDPPGVRFELDPEIERTLGTEGESSAETRWRTCRVFRGPEGPATPQNPLLQQNPLFEQNEQQNNQAE
         + A E+L  A+                               + +IE+               T + F   EG       + Q   ++   E Q  Q  
Subjt:  KDYAAERLEGANSVLQQNWEQNCHITARVIWCMSDPPGVRFELDPEIERTLGTEGESSAETRWRTCRVFRGPEGPATPQNPLLQQNPLFEQNEQQNNQAE

Query:  NPILIATIGPEPFEHVANRGAIPWFASEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFRILLEMEQRHERISINSCQWSDVRGTNKKVKSVLEVDGVST
        N +           +  N+G +P     +P  H+K+    S   +++     + R        E E+R  +      + S+++  +K+ K   +++    
Subjt:  NPILIATIGPEPFEHVANRGAIPWFASEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFRILLEMEQRHERISINSCQWSDVRGTNKKVKSVLEVDGVST

Query:  IRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGFAKAQVMPQQNKQALPQQIREFSRGDDERIYGSYRCAIQSNQASMRALELQVGAGG
           D   I              PP +               Y  F K ++M ++ K             D E I  +  C                    
Subjt:  IRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVAEEACVYCGFAKAQVMPQQNKQALPQQIREFSRGDDERIYGSYRCAIQSNQASMRALELQVGAGG

Query:  SNKNAGASGSVPDVEPPYVPPTLCTTSTFSTKAKPKNQDGQFKNAILKNELPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRK-------
                                                   +AI++N+LPPK KDPGSFT+P +IG  E  +ALCDLGAS++L+PL+V R+       
Subjt:  SNKNAGASGSVPDVEPPYVPPTLCTTSTFSTKAKPKNQDGQFKNAILKNELPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRK-------

Query:  --------------------------------------LDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP----------
                                              LD E D +VPIILGRPFLAT   +IDV++G+   ++  EEV+F++ K  KYP          
Subjt:  --------------------------------------LDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP----------

Query:  ----------------DEMEDC-SFIRILESTVIETA--IQDSLQYR-----------------------------------------------------
                        D +E C + I I E  + E    +Q  + Y+                                                     
Subjt:  ----------------DEMEDC-SFIRILESTVIETA--IQDSLQYR-----------------------------------------------------

Query:  ----------------KAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWIT---------------
                        KAIGWT++DI+GISP+ CMH+I LEE S   +E QRRLNP MKEVV+ E++KWLDAGII+PI+DS WI+               
Subjt:  ----------------KAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWIT---------------

Query:  ----------------------------------------------------------------IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR
                                                                        IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR
Subjt:  ----------------------------------------------------------------IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------YCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAA-----------------------------
                                          C  AF  LK  L+SAPI+ +P+W+LPFE+MCDASD A                             
Subjt:  ---------------------------------YCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAA-----------------------------

Query:  ---------------------------------------------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDLFQMNS
                                                                 EFDLEIKDKKGSEN++ADHLSRL+               Q + 
Subjt:  ---------------------------------------------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDLFQMNS

Query:  SLLLSKGNPGAMSLFAVWRSFQRSKDNYEDFALWILLATLFKD-------------AHWFYKQ---------------------CDACQRRGNLGPRDEM
         L + +  P    L A+++S       Y D   +I    +  D              H+F+++                     CD CQR GN+  R+EM
Subjt:  SLLLSKGNPGAMSLFAVWRSFQRSKDNYEDFALWILLATLFKD-------------AHWFYKQ---------------------CDACQRRGNLGPRDEM

Query:  PLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAFACHQSDAKTIA--------------------------------------------
        PLT  LEVELFDVWGIDFMGPFP S  N +ILLAVDYVSKWVEA     ++AK +                                             
Subjt:  PLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAFACHQSDAKTIA--------------------------------------------

Query:  -----------------------------------KLDEALWAYRTAYKTPLG---------------------------AI----------RMLQLNEL
                                           KL++ALWAYRTA+KTPLG                           AI          RML+L+EL
Subjt:  -----------------------------------KLDEALWAYRTAYKTPLG---------------------------AI----------RMLQLNEL

Query:  EEFRQFSYENAKMYKEKTKLWHDKKIKSKEF
        EE R  SYEN K+YKEK K WHDK I  K F
Subjt:  EEFRQFSYENAKMYKEKTKLWHDKKIKSKEF

SwissProt top hitse value%identityAlignment
P92516 Uncharacterized mitochondrial protein AtMg007501.5e-1246.51Show/hide
Query:  KGNPG-AMSLFAVWRSFQRSKDNYEDFALWILLA-----TLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFM
        +G+ G ++ L  + RS Q    + + +  ++L A     T FKDAH F   CDACQR+GN   R+EMP  +ILEVE+FDVWGI FM
Subjt:  KGNPG-AMSLFAVWRSFQRSKDNYEDFALWILLA-----TLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFM

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein1.0e-1346.51Show/hide
Query:  KGNPG-AMSLFAVWRSFQRSKDNYEDFALWILLA-----TLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFM
        +G+ G ++ L  + RS Q    + + +  ++L A     T FKDAH F   CDACQR+GN   R+EMP  +ILEVE+FDVWGI FM
Subjt:  KGNPG-AMSLFAVWRSFQRSKDNYEDFALWILLA-----TLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTGTTCGTGTAAAGAAACAACCAACGAGATGCTTGAATTCGGAGTTCAAAGAGACTTAGGGTTTCAAGTCTTCGTCTCCATCTTCGCCTCCGCCACCTCGTCTTC
CGCCTCCTCGTCTTCGCCTCCGCCTCCGCCTTCTCGTCTTCTGCCTCCTCGTCTTCGCCTCTCGCCTCCTCGTCTTCGAAGCTCAGATATAGGGTTTCAATTTCAATCGT
CTTCGTTTTCGTCTTCAAAAGCCTCTCCTCTCGTGCTATGGTTGTCGGCGCCGCCGATGCTGTTTAAGAGCCGTCGATGGTTTAAGTCGAATCTCAGATCGAAATTTCTT
AGGATTTCGGCCTCTTTGCTCTGCTACCACCACCACGACCTCCCTAGAATCCGCGGAAACAACCCAGATCTTGAAATTGACAGGACTCTTAGAACCATTCGTAGATTGAA
AAGATTAGCAGAAGTAATGGCCCATCAAGATGAAGCTCCCAAGGCAATCAGAGACTTTTTACAGCCAGTTCTTCCCACCGAGAATTCTGGAATTGTTTACGCTCCGATCC
AAGCTACCAATTTTGAGTTAAAGACAGGATTGATTCAGATGGCGCACGATAACTCTTTTAAGGGACATCCTTCTGATGACCCACACTCACATCTGCAATCATTCTTGGAA
ATATGTGGGACGGTAAAAATGAACAGAGTTCCGACCGACGCTATAAGATTGAGGTTGTTCCCATTTTCTCTTCAAGGCAAAGCAAACGATTGGCTCGAATCAGTCGAGAC
GGACAGTATTAGTACATGGGACGAGCTTGCCCATGCTTTTCTGACGAAATTTTTCCCACCTGTCGAGACCACAAAGGTCGGGACTGAGATCGAAACGTTTAGGCAGCTTG
AAGAGGAGCAGTTGTACGAGGCATGGGAGAGATACAAGGAGATGCTTAGGCGGTGTCCCCAACATGGATATCCGGATTGGCTCCAAGTGCAGTTGTTTTACAATGGTTTG
AATCCCTCCACCAAGACAGTTCTAGACACATCAGCTGGAGGAAGTTTTCTTTCCAAGACAGTAACGAAAGCCAAAGATCTGCTTGAGGAAATGGTGGCAACCAGTTATCA
GTGGTCGACCGAGAGGGGAGTAATTTCTAAGAAGGCTGAAATTTATGAATTAGATGAGTCGAGTTCGTTGAAGGCACAAATGTCATCTTTGACCAACGCCCTGAACAAGC
TAACTTCATCTGAGGTGGTCAAATCCATTTCCACCTTAGCTGAAGGTTATTTAAAGAAGGAAGGTCAAGATGTGGAAGAAGTCCAGTACATAGGAAACAGACCATTTACT
CAAGGAGTACCGAACTTCTACCACCCAGTCTGCGCAATCATGAGAACTTCTCATACTCGAACACGAAGAATGTTTTGCAGCCACCGCCAGACGACAGTTAATAACCACGA
CACAGCTCTGAAAAACATGGAAGTTCAGATAGGTCAGATAGCTTCAGTAGTGAATGCTCTTCAGAAGGGAAAATTTCCAAGCGAAACTGAACCTAACCCAAGAGAGCAGT
GCAAGATGGTGAGACTAAGAAGTGGTAGGAATCTGGAGATCAATTCAGAAAAGAAAATGAAGAAGAAAAGAGCAAGGATGAAGATGAAAGGGTTGAGGCACAAAAAGCCT
CCTCTGAAAGATTATGCTGCTGAGCGACTGGAGGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAACAGAACTGCCACATCACAGCTCGTGTGATTTGGTGCATGAGCGA
TCCGCCTGGGGTAAGGTTTGAGCTTGATCCAGAAATTGAAAGGACATTAGGAACAGAAGGAGAGAGCAGCGCAGAAACCAGATGGAGAACGTGTCGCGTCTTCCGAGGTC
CTGAAGGTCCAGCAACCCCACAGAATCCGTTGCTGCAGCAAAACCCGCTGTTTGAGCAAAATGAGCAGCAAAATAATCAGGCTGAGAATCCTATCTTGATAGCAACGATA
GGACCAGAGCCATTCGAGCATGTTGCAAACCGTGGGGCAATTCCATGGTTTGCATCTGAAGACCCTCATTTACATCTTAAGTCTTTTCTAGGAGTTAGTGATTCTTTTGT
AATTCAAGGAGTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCGTATTCTCTTAGAGATGGAGCAAAGGCATGAAAGAATATCTATTAATAGTTGTCAGTGGTCGGATG
TTAGAGGCACAAATAAAAAGGTTAAGAGTGTATTAGAGGTTGATGGTGTGTCCACCATTAGGGCTGATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAGTGATT
AGTCATCAGCAGCCACCAGCTATGGAGCCTACTGCAGTGGTGAACCAAGTGGCAGAAGAAGCATGTGTCTATTGTGGATTTGCTAAAGCGCAGGTAATGCCCCAGCAAAA
TAAGCAGGCTTTGCCCCAGCAAATTCGGGAATTCTCTCGAGGCGATGATGAAAGAATTTATGGCTCGTACAGATGCGCAATTCAAAGTAATCAAGCTTCGATGAGAGCCC
TGGAATTGCAAGTGGGTGCTGGAGGCAGCAATAAAAATGCTGGAGCATCTGGTTCTGTTCCAGATGTGGAACCACCTTATGTGCCCCCCACCTTATGTACCACCTCTACC
TTTTCCACAAAGGCAAAACCTAAGAATCAGGATGGTCAATTTAAAAATGCTATTCTTAAGAATGAGCTACCACCCAAGGCTAAGGATCCAGGATCATTTACTATACCTGT
GTCTATAGGTGGAAAAGAGTTAGGTAGAGCACTCTGTGATTTAGGCGCGAGCATTAACCTAATGCCTCTTTCGGTCTATCGAAAGTTAGACTATGAGGCTGATAAAGATG
TCCCAATTATTCTAGGTCGTCCATTTTTGGCTACTGGTAGGGCATTAATAGATGTTCAAAAAGGAGAATTAACAATGAGAGTCTGTAATGAGGAAGTGAAATTTAATGTG
TTTAAAGCCATGAAGTATCCAGACGAAATGGAGGATTGCTCCTTCATTAGGATTCTGGAGAGCACAGTTATTGAGACAGCAATACAGGATTCGCTACAATACCGCAAGGC
TATAGGTTGGACATTGGCTGACATTCAGGGAATTAGCCCATCTTTTTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAAGGCTTA
ACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTGATTAAATGGTTGGATGCTGGGATCATTTATCCAATTGCCGATAGCAATTGGATTACTATTGCTCCTGAGGATCAG
GAGAAAACCACTTTCACCTGCCCTTATGGGACGTTTGCTTTTAGGCGAATGCCTTTTGGCCTCTGCAATGCTCCAGCAACATTTCAGCGGTATTGTAGGAAGGCTTTTGA
GACTTTAAAGGCTGCTTTAATCTCAGCACCCATTCTTTGCGCACCTAATTGGAATTTACCATTCGAGGTAATGTGTGATGCGAGTGATGCTGCGGAATTCGACTTGGAAA
TAAAGGATAAGAAGGGATCAGAAAATGTCATTGCAGATCATTTATCTCGTCTTGATCCGTCATCATCTTTGCTGGAGCAATCTGCCATTTCAGATCTTTTCCAGATGAAC
AGCTCTTTGCTGTTGAGCAAAGGAAATCCTGGAGCAATGTCACTCTTCGCCGTATGGAGGTCATTTCAGCGGTCAAAGGACAACTATGAGGATTTTGCATTGTGGATTCT
TCTGGCCACCTTATTCAAAGATGCCCATTGGTTCTACAAGCAATGTGATGCTTGCCAAAGGAGAGGAAATTTAGGGCCTAGAGATGAAATGCCTCTTACTTACATTTTAG
AAGTTGAATTATTCGATGTATGGGGTATTGATTTTATGGGGCCATTTCCCCCTTCTAATGGCAATGTTTTTATCTTATTGGCAGTTGATTATGTGTCCAAGTGGGTGGAG
GCCTTCGCATGCCATCAGAGTGATGCCAAGACAATAGCTAAGTTGGATGAGGCTCTTTGGGCTTATAGGACAGCCTATAAGACTCCTCTAGGAGCAATCAGAATGCTGCA
GCTTAATGAATTAGAGGAATTTCGCCAATTTTCTTATGAGAATGCGAAAATGTATAAAGAAAAGACTAAGCTATGGCATGACAAGAAAATAAAATCTAAAGAGTTTGTCA
AGGGTCAAAAAGATGAAAAAGATGGGAGAGTGTTCAAAGTGAATAGACAGCGTGTGAAAATTATTGGGGAGAGGAATTTCAGCGAAATATCCTTCCCTAAGGTAAATTTG
ATGTATCATCTGACGAGGCCGCGTGTCCCACAGCAATCATCCAAAGTTATCTATGGCAAAACAAGAGCTAGGAAAGAGAGGGAAAGTGAAGAGGAGGAGGTACCGGTCAC
CCGGAAGTCAAAAAGGGAAAACCAAGAAGAAAAGAACGCCGGAGGAAAAGGAAGCAAAGAAAAGGAGAAGGCAGCAAAGGGCTGCAGAACAGGAGGAAGTTCAGGAGGTG
GCAGAAGTTGTTGCCACTACTGCGGAGGAAGAAGTACTCAAGAACCTGAAGTGCAAAACCCAGATACGGTTCAAGAAAAGATTGCTGAGAAAAATCAAGAAACAGAGGTT
GAGGAGCAGGTCGCAGGTATGCCTGAAAAAGAGAAAACACCGGAGCCGGTGCAGGAGGCTCATGCTGAAGTCGTAATGCCTGAACCACCAAAGCGCCGCCGCATCAAACA
GAAGGCGGGTCGCGTGAGGGCAAAGGAAGAAGAGGCAAGGAAGGCAGAGGAAGAGACTTTGCGAGAGCAACGAGAAGACAAGGGCAAAGGAATTGTCGAAGCATCGGGTG
AGATTGAGGAACCAACGGTACCGTTCATTCGCTTTGTCAACGAGCTTGCGAGAGCAAAATACCAAGAGGTGCTGAAGCGTGATTTCTTATTCGAGCGGGGATTTGGCAGT
GATTTGCCAAGCTTCTTAGAGTCTGGAATAGCGAACCTTGGGTGGAGGCAGTTTTGTGCGAAGCCTGAACCTACAATGCCAACATTTTATAGTTCGAGGAGTGCCTGTAC
AGTGGAGCCCAAAGCCATTAATAATTTGTTTGATCTCCAGGATTTTCCGCATGCAGTTTTCAATGAGATGATGGTTGCGCCATCGAGCGACCAATTAAGTGCGGCGGTCC
GAGAGAGTGAAGCCAACACTTGGATGGGTTTCATTAGGCTACGCTTACTGCCGACACCACACGACTCCACTGTATCTCGGGACAGAGTATTGCTTGCCTTTACCATCCTT
CGCTCAATGAGTATAGATGTTGGAAAAATAATTTCTACTGAGATTGCTGACTGTTGGCGCAAAAAGGTGGGGAAGCTGTTTTTTCCAAACACGATTACGATGTTATGCAG
CAGGACAGGAGTGCCCACGGTTCCAGAGGATATGATTATGCTTGATAAGGGAATCATTGACACACCTAATCTGGCGCGGCTTCAGCAATTTGCTGAAAGGCAAGCTCAGA
CCTATTGGACTTATGCTAAAAGGAGAGATGATGCGCTCAGGAGGGCCTTGCAAACCAATTTCTCAAAACCATATCAGGCCTTCCCAGTGTTTCCCGATGACTTATTTAAT
CTGTGGATACCACCCCCACCTGTTGAACGAGAAGAGGATGTTGATGAGGAGCAGGGTCAGGAAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCTGTTCGTGTAAAGAAACAACCAACGAGATGCTTGAATTCGGAGTTCAAAGAGACTTAGGGTTTCAAGTCTTCGTCTCCATCTTCGCCTCCGCCACCTCGTCTTC
CGCCTCCTCGTCTTCGCCTCCGCCTCCGCCTTCTCGTCTTCTGCCTCCTCGTCTTCGCCTCTCGCCTCCTCGTCTTCGAAGCTCAGATATAGGGTTTCAATTTCAATCGT
CTTCGTTTTCGTCTTCAAAAGCCTCTCCTCTCGTGCTATGGTTGTCGGCGCCGCCGATGCTGTTTAAGAGCCGTCGATGGTTTAAGTCGAATCTCAGATCGAAATTTCTT
AGGATTTCGGCCTCTTTGCTCTGCTACCACCACCACGACCTCCCTAGAATCCGCGGAAACAACCCAGATCTTGAAATTGACAGGACTCTTAGAACCATTCGTAGATTGAA
AAGATTAGCAGAAGTAATGGCCCATCAAGATGAAGCTCCCAAGGCAATCAGAGACTTTTTACAGCCAGTTCTTCCCACCGAGAATTCTGGAATTGTTTACGCTCCGATCC
AAGCTACCAATTTTGAGTTAAAGACAGGATTGATTCAGATGGCGCACGATAACTCTTTTAAGGGACATCCTTCTGATGACCCACACTCACATCTGCAATCATTCTTGGAA
ATATGTGGGACGGTAAAAATGAACAGAGTTCCGACCGACGCTATAAGATTGAGGTTGTTCCCATTTTCTCTTCAAGGCAAAGCAAACGATTGGCTCGAATCAGTCGAGAC
GGACAGTATTAGTACATGGGACGAGCTTGCCCATGCTTTTCTGACGAAATTTTTCCCACCTGTCGAGACCACAAAGGTCGGGACTGAGATCGAAACGTTTAGGCAGCTTG
AAGAGGAGCAGTTGTACGAGGCATGGGAGAGATACAAGGAGATGCTTAGGCGGTGTCCCCAACATGGATATCCGGATTGGCTCCAAGTGCAGTTGTTTTACAATGGTTTG
AATCCCTCCACCAAGACAGTTCTAGACACATCAGCTGGAGGAAGTTTTCTTTCCAAGACAGTAACGAAAGCCAAAGATCTGCTTGAGGAAATGGTGGCAACCAGTTATCA
GTGGTCGACCGAGAGGGGAGTAATTTCTAAGAAGGCTGAAATTTATGAATTAGATGAGTCGAGTTCGTTGAAGGCACAAATGTCATCTTTGACCAACGCCCTGAACAAGC
TAACTTCATCTGAGGTGGTCAAATCCATTTCCACCTTAGCTGAAGGTTATTTAAAGAAGGAAGGTCAAGATGTGGAAGAAGTCCAGTACATAGGAAACAGACCATTTACT
CAAGGAGTACCGAACTTCTACCACCCAGTCTGCGCAATCATGAGAACTTCTCATACTCGAACACGAAGAATGTTTTGCAGCCACCGCCAGACGACAGTTAATAACCACGA
CACAGCTCTGAAAAACATGGAAGTTCAGATAGGTCAGATAGCTTCAGTAGTGAATGCTCTTCAGAAGGGAAAATTTCCAAGCGAAACTGAACCTAACCCAAGAGAGCAGT
GCAAGATGGTGAGACTAAGAAGTGGTAGGAATCTGGAGATCAATTCAGAAAAGAAAATGAAGAAGAAAAGAGCAAGGATGAAGATGAAAGGGTTGAGGCACAAAAAGCCT
CCTCTGAAAGATTATGCTGCTGAGCGACTGGAGGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAACAGAACTGCCACATCACAGCTCGTGTGATTTGGTGCATGAGCGA
TCCGCCTGGGGTAAGGTTTGAGCTTGATCCAGAAATTGAAAGGACATTAGGAACAGAAGGAGAGAGCAGCGCAGAAACCAGATGGAGAACGTGTCGCGTCTTCCGAGGTC
CTGAAGGTCCAGCAACCCCACAGAATCCGTTGCTGCAGCAAAACCCGCTGTTTGAGCAAAATGAGCAGCAAAATAATCAGGCTGAGAATCCTATCTTGATAGCAACGATA
GGACCAGAGCCATTCGAGCATGTTGCAAACCGTGGGGCAATTCCATGGTTTGCATCTGAAGACCCTCATTTACATCTTAAGTCTTTTCTAGGAGTTAGTGATTCTTTTGT
AATTCAAGGAGTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCGTATTCTCTTAGAGATGGAGCAAAGGCATGAAAGAATATCTATTAATAGTTGTCAGTGGTCGGATG
TTAGAGGCACAAATAAAAAGGTTAAGAGTGTATTAGAGGTTGATGGTGTGTCCACCATTAGGGCTGATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAGTGATT
AGTCATCAGCAGCCACCAGCTATGGAGCCTACTGCAGTGGTGAACCAAGTGGCAGAAGAAGCATGTGTCTATTGTGGATTTGCTAAAGCGCAGGTAATGCCCCAGCAAAA
TAAGCAGGCTTTGCCCCAGCAAATTCGGGAATTCTCTCGAGGCGATGATGAAAGAATTTATGGCTCGTACAGATGCGCAATTCAAAGTAATCAAGCTTCGATGAGAGCCC
TGGAATTGCAAGTGGGTGCTGGAGGCAGCAATAAAAATGCTGGAGCATCTGGTTCTGTTCCAGATGTGGAACCACCTTATGTGCCCCCCACCTTATGTACCACCTCTACC
TTTTCCACAAAGGCAAAACCTAAGAATCAGGATGGTCAATTTAAAAATGCTATTCTTAAGAATGAGCTACCACCCAAGGCTAAGGATCCAGGATCATTTACTATACCTGT
GTCTATAGGTGGAAAAGAGTTAGGTAGAGCACTCTGTGATTTAGGCGCGAGCATTAACCTAATGCCTCTTTCGGTCTATCGAAAGTTAGACTATGAGGCTGATAAAGATG
TCCCAATTATTCTAGGTCGTCCATTTTTGGCTACTGGTAGGGCATTAATAGATGTTCAAAAAGGAGAATTAACAATGAGAGTCTGTAATGAGGAAGTGAAATTTAATGTG
TTTAAAGCCATGAAGTATCCAGACGAAATGGAGGATTGCTCCTTCATTAGGATTCTGGAGAGCACAGTTATTGAGACAGCAATACAGGATTCGCTACAATACCGCAAGGC
TATAGGTTGGACATTGGCTGACATTCAGGGAATTAGCCCATCTTTTTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAAGGCTTA
ACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTGATTAAATGGTTGGATGCTGGGATCATTTATCCAATTGCCGATAGCAATTGGATTACTATTGCTCCTGAGGATCAG
GAGAAAACCACTTTCACCTGCCCTTATGGGACGTTTGCTTTTAGGCGAATGCCTTTTGGCCTCTGCAATGCTCCAGCAACATTTCAGCGGTATTGTAGGAAGGCTTTTGA
GACTTTAAAGGCTGCTTTAATCTCAGCACCCATTCTTTGCGCACCTAATTGGAATTTACCATTCGAGGTAATGTGTGATGCGAGTGATGCTGCGGAATTCGACTTGGAAA
TAAAGGATAAGAAGGGATCAGAAAATGTCATTGCAGATCATTTATCTCGTCTTGATCCGTCATCATCTTTGCTGGAGCAATCTGCCATTTCAGATCTTTTCCAGATGAAC
AGCTCTTTGCTGTTGAGCAAAGGAAATCCTGGAGCAATGTCACTCTTCGCCGTATGGAGGTCATTTCAGCGGTCAAAGGACAACTATGAGGATTTTGCATTGTGGATTCT
TCTGGCCACCTTATTCAAAGATGCCCATTGGTTCTACAAGCAATGTGATGCTTGCCAAAGGAGAGGAAATTTAGGGCCTAGAGATGAAATGCCTCTTACTTACATTTTAG
AAGTTGAATTATTCGATGTATGGGGTATTGATTTTATGGGGCCATTTCCCCCTTCTAATGGCAATGTTTTTATCTTATTGGCAGTTGATTATGTGTCCAAGTGGGTGGAG
GCCTTCGCATGCCATCAGAGTGATGCCAAGACAATAGCTAAGTTGGATGAGGCTCTTTGGGCTTATAGGACAGCCTATAAGACTCCTCTAGGAGCAATCAGAATGCTGCA
GCTTAATGAATTAGAGGAATTTCGCCAATTTTCTTATGAGAATGCGAAAATGTATAAAGAAAAGACTAAGCTATGGCATGACAAGAAAATAAAATCTAAAGAGTTTGTCA
AGGGTCAAAAAGATGAAAAAGATGGGAGAGTGTTCAAAGTGAATAGACAGCGTGTGAAAATTATTGGGGAGAGGAATTTCAGCGAAATATCCTTCCCTAAGGTAAATTTG
ATGTATCATCTGACGAGGCCGCGTGTCCCACAGCAATCATCCAAAGTTATCTATGGCAAAACAAGAGCTAGGAAAGAGAGGGAAAGTGAAGAGGAGGAGGTACCGGTCAC
CCGGAAGTCAAAAAGGGAAAACCAAGAAGAAAAGAACGCCGGAGGAAAAGGAAGCAAAGAAAAGGAGAAGGCAGCAAAGGGCTGCAGAACAGGAGGAAGTTCAGGAGGTG
GCAGAAGTTGTTGCCACTACTGCGGAGGAAGAAGTACTCAAGAACCTGAAGTGCAAAACCCAGATACGGTTCAAGAAAAGATTGCTGAGAAAAATCAAGAAACAGAGGTT
GAGGAGCAGGTCGCAGGTATGCCTGAAAAAGAGAAAACACCGGAGCCGGTGCAGGAGGCTCATGCTGAAGTCGTAATGCCTGAACCACCAAAGCGCCGCCGCATCAAACA
GAAGGCGGGTCGCGTGAGGGCAAAGGAAGAAGAGGCAAGGAAGGCAGAGGAAGAGACTTTGCGAGAGCAACGAGAAGACAAGGGCAAAGGAATTGTCGAAGCATCGGGTG
AGATTGAGGAACCAACGGTACCGTTCATTCGCTTTGTCAACGAGCTTGCGAGAGCAAAATACCAAGAGGTGCTGAAGCGTGATTTCTTATTCGAGCGGGGATTTGGCAGT
GATTTGCCAAGCTTCTTAGAGTCTGGAATAGCGAACCTTGGGTGGAGGCAGTTTTGTGCGAAGCCTGAACCTACAATGCCAACATTTTATAGTTCGAGGAGTGCCTGTAC
AGTGGAGCCCAAAGCCATTAATAATTTGTTTGATCTCCAGGATTTTCCGCATGCAGTTTTCAATGAGATGATGGTTGCGCCATCGAGCGACCAATTAAGTGCGGCGGTCC
GAGAGAGTGAAGCCAACACTTGGATGGGTTTCATTAGGCTACGCTTACTGCCGACACCACACGACTCCACTGTATCTCGGGACAGAGTATTGCTTGCCTTTACCATCCTT
CGCTCAATGAGTATAGATGTTGGAAAAATAATTTCTACTGAGATTGCTGACTGTTGGCGCAAAAAGGTGGGGAAGCTGTTTTTTCCAAACACGATTACGATGTTATGCAG
CAGGACAGGAGTGCCCACGGTTCCAGAGGATATGATTATGCTTGATAAGGGAATCATTGACACACCTAATCTGGCGCGGCTTCAGCAATTTGCTGAAAGGCAAGCTCAGA
CCTATTGGACTTATGCTAAAAGGAGAGATGATGCGCTCAGGAGGGCCTTGCAAACCAATTTCTCAAAACCATATCAGGCCTTCCCAGTGTTTCCCGATGACTTATTTAAT
CTGTGGATACCACCCCCACCTGTTGAACGAGAAGAGGATGTTGATGAGGAGCAGGGTCAGGAAGATTGA
Protein sequenceShow/hide protein sequence
MSCSCKETTNEMLEFGVQRDLGFQVFVSIFASATSSSASSSSPPPPPSRLLPPRLRLSPPRLRSSDIGFQFQSSSFSSSKASPLVLWLSAPPMLFKSRRWFKSNLRSKFL
RISASLLCYHHHDLPRIRGNNPDLEIDRTLRTIRRLKRLAEVMAHQDEAPKAIRDFLQPVLPTENSGIVYAPIQATNFELKTGLIQMAHDNSFKGHPSDDPHSHLQSFLE
ICGTVKMNRVPTDAIRLRLFPFSLQGKANDWLESVETDSISTWDELAHAFLTKFFPPVETTKVGTEIETFRQLEEEQLYEAWERYKEMLRRCPQHGYPDWLQVQLFYNGL
NPSTKTVLDTSAGGSFLSKTVTKAKDLLEEMVATSYQWSTERGVISKKAEIYELDESSSLKAQMSSLTNALNKLTSSEVVKSISTLAEGYLKKEGQDVEEVQYIGNRPFT
QGVPNFYHPVCAIMRTSHTRTRRMFCSHRQTTVNNHDTALKNMEVQIGQIASVVNALQKGKFPSETEPNPREQCKMVRLRSGRNLEINSEKKMKKKRARMKMKGLRHKKP
PLKDYAAERLEGANSVLQQNWEQNCHITARVIWCMSDPPGVRFELDPEIERTLGTEGESSAETRWRTCRVFRGPEGPATPQNPLLQQNPLFEQNEQQNNQAENPILIATI
GPEPFEHVANRGAIPWFASEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFRILLEMEQRHERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVI
SHQQPPAMEPTAVVNQVAEEACVYCGFAKAQVMPQQNKQALPQQIREFSRGDDERIYGSYRCAIQSNQASMRALELQVGAGGSNKNAGASGSVPDVEPPYVPPTLCTTST
FSTKAKPKNQDGQFKNAILKNELPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLDYEADKDVPIILGRPFLATGRALIDVQKGELTMRVCNEEVKFNV
FKAMKYPDEMEDCSFIRILESTVIETAIQDSLQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWITIAPEDQ
EKTTFTCPYGTFAFRRMPFGLCNAPATFQRYCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAAEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDLFQMN
SSLLLSKGNPGAMSLFAVWRSFQRSKDNYEDFALWILLATLFKDAHWFYKQCDACQRRGNLGPRDEMPLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVE
AFACHQSDAKTIAKLDEALWAYRTAYKTPLGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFVKGQKDEKDGRVFKVNRQRVKIIGERNFSEISFPKVNL
MYHLTRPRVPQQSSKVIYGKTRARKERESEEEEVPVTRKSKRENQEEKNAGGKGSKEKEKAAKGCRTGGSSGGGRSCCHYCGGRSTQEPEVQNPDTVQEKIAEKNQETEV
EEQVAGMPEKEKTPEPVQEAHAEVVMPEPPKRRRIKQKAGRVRAKEEEARKAEEETLREQREDKGKGIVEASGEIEEPTVPFIRFVNELARAKYQEVLKRDFLFERGFGS
DLPSFLESGIANLGWRQFCAKPEPTMPTFYSSRSACTVEPKAINNLFDLQDFPHAVFNEMMVAPSSDQLSAAVRESEANTWMGFIRLRLLPTPHDSTVSRDRVLLAFTIL
RSMSIDVGKIISTEIADCWRKKVGKLFFPNTITMLCSRTGVPTVPEDMIMLDKGIIDTPNLARLQQFAERQAQTYWTYAKRRDDALRRALQTNFSKPYQAFPVFPDDLFN
LWIPPPPVEREEDVDEEQGQED