; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031570 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031570
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNA-directed DNA polymerase
Genome locationchr11:10144511..10158297
RNA-Seq ExpressionLag0031570
SyntenyLag0031570
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016620 - oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR015590 - Aldehyde dehydrogenase domain
IPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR041588 - Integrase zinc-binding domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR036875 - Zinc finger, CCHC-type superfamily
IPR036397 - Ribonuclease H superfamily
IPR029510 - Aldehyde dehydrogenase, glutamic acid active site
IPR021109 - Aspartic peptidase domain superfamily
IPR016163 - Aldehyde dehydrogenase, C-terminal
IPR016162 - Aldehyde dehydrogenase, N-terminal
IPR016161 - Aldehyde/histidinol dehydrogenase
IPR012337 - Ribonuclease H-like superfamily
IPR005162 - Retrotransposon gag domain
IPR001969 - Aspartic peptidase, active site
IPR001878 - Zinc finger, CCHC-type
IPR001584 - Integrase, catalytic core
IPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PKU68162.1 RNA-directed DNA polymerase [Dendrobium catenatum]3.5e-29341.62Show/hide
Query:  YHKQESDFKMKVDIPTYNGKMEIEAFLEWIRHVENFFDYMNTPESKKVKLVALKLKGGAQAWWDQLEINRQRFGKRPIRKWERMKRLMRDRFLPANFEQI
        Y   + +FK+K+DIP ++G M IE +L+W + VENFF+YM     K+VK VA +LKGGA AWW Q+   RQR GK  IR W RMK+L+R +FLP +FEQI
Subjt:  YHKQESDFKMKVDIPTYNGKMEIEAFLEWIRHVENFFDYMNTPESKKVKLVALKLKGGAQAWWDQLEINRQRFGKRPIRKWERMKRLMRDRFLPANFEQI

Query:  LYTQYQNCRQGTRTVMEYTEEFHRLGACTNLGENQQCLVARFKGGLRLDIQERISFQPIAFLNEAITAAATAEEQITNRFKRQYTRRNVVESSTSNPR-N
        LY +YQ+C QGTR+V EYTEEF+RL A  NL E+   LVAR+ GGL+ +IQ+R+    +  + +A+  A   E Q   + K  Y RR+  ES+T+N + N
Subjt:  LYTQYQNCRQGTRTVMEYTEEFHRLGACTNLGENQQCLVARFKGGLRLDIQERISFQPIAFLNEAITAAATAEEQITNRFKRQYTRRNVVESSTSNPR-N

Query:  TTVDKT---TSPTTSVGAKGKQI-DAGTSKDLVPKRNTNNYSRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVDYQAESDE-TDSDEDIDYLEPDEGDA
         +  KT    SP  S  +    + +    +   P +  N Y++P   KCFRC Q GH SNECPQR+ + +A+  +   +++E  D D DI+ L+ D+G+ 
Subjt:  TTVDKT---TSPTTSVGAKGKQI-DAGTSKDLVPKRNTNNYSRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVDYQAESDE-TDSDEDIDYLEPDEGDA

Query:  LSLVIQRLLLAPKIDQSYQRHALFKTRCTINGKICNVIIDSGSTENVVASKLVSTLNLPLTPHPAPYK--------------------------------
        L  V+++LLLAP+ + S QRHA+F+TRCT+ GK+C+++ID+G TENV+A  +V  LNL  T +  PYK                                
Subjt:  LSLVIQRLLLAPKIDQSYQRHALFKTRCTINGKICNVIIDSGSTENVVASKLVSTLNLPLTPHPAPYK--------------------------------

Query:  ---------------------AIHKGRDNTYEFTWMGKKIVLLPINPSKQVGTTTNKKGQLFSISSGKK--NFQEKQFPIIGLVIKYFDNLDPNIA-IPS
                             A++  R N+Y F W G+++ LLP     +     + K     + SG    +  ++Q P+  L++    ++DP +   P 
Subjt:  ---------------------AIHKGRDNTYEFTWMGKKIVLLPINPSKQVGTTTNKKGQLFSISSGKK--NFQEKQFPIIGLVIKYFDNLDPNIA-IPS

Query:  EVQNLLQDFKEII--DTPIELPPLRDIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINK
        EV  LL+ +++I+  + P+ LPP+R++QH ++ LPGA+LPN+PHYR++P E  IL E + ELL+K  IQ SLSPCAVPALL PKKDGSWRMC+DSRAINK
Subjt:  EVQNLLQDFKEII--DTPIELPPLRDIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINK

Query:  ITVEYRFPILRINDLLNQLSGATIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ----------------------
        ITV+YRFP+ RI++LL+ LSG+++F K+DL+SGYHQIRIR GDEWKTAFKT +GL+EW VMPFGL NAPSTFMR+M++                      
Subjt:  ITVEYRFPILRINDLLNQLSGATIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------EKIITSPVLALPNFDLPFEVAVDASGIGVGAVLSQNNHPIEFFSEKLSPSRQ--------------------------------
                          +I++PVLALPNFD  F V  DASG+G+GAVLSQ   P+ FFSEKL  +RQ                                
Subjt:  ----------------EKIITSPVLALPNFDLPFEVAVDASGIGVGAVLSQNNHPIEFFSEKLSPSRQ--------------------------------

Query:  ----------------------------------SPG---------------------EIIAFNSLPEDYHSDKDFGSIGKNCTEHKPTGDFHLVDGFLF
                                           PG                     E      L E Y +D DFG   K C + + +  + +  G+LF
Subjt:  ----------------------------------SPG---------------------EIIAFNSLPEDYHSDKDFGSIGKNCTEHKPTGDFHLVDGFLF

Query:  KGNLLCIPFSSLREALLQEAHAGGLAGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQTCKGTSQNTGLYLPLPIPDNIWEDLSMDFILGLPRTQR
        K NLLCIP SS R+ L++EAH+GGLA H GRD TL  L+ R++WP+L++DV   V  C  CQ  KG  QNTGLY+PLP+P++IWEDLS+DF+LGLPRT+R
Subjt:  KGNLLCIPFSSLREALLQEAHAGGLAGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQTCKGTSQNTGLYLPLPIPDNIWEDLSMDFILGLPRTQR

Query:  GFDSVLVVVDRYSKMAHFLACKKTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNRTLGNLIRCL
        G DS++VVVDR+SKMAHF++CKKTS+A +IANLFF E+VRLHG+P+S+ SDRDVKF+SHFW+ LWK+  T L+ S+  HPQTDGQTEV NRTLGN++RCL
Subjt:  GFDSVLVVVDRYSKMAHFLACKKTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNRTLGNLIRCL

Query:  SGDKPRQWDMTLAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDLSLEAEEMATRIRDLHQQVHKHIEEQTIKYKAAADKKRRYKEFAVG
          D+P++W+  L++AEFA+N M NR+TGKSPF +VYTK P + LDL+++P     S  A        DL  +V +H+     +YK AAD +RR + F VG
Subjt:  SGDKPRQWDMTLAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDLSLEAEEMATRIRDLHQQVHKHIEEQTIKYKAAADKKRRYKEFAVG

Query:  DLVMVHLRKNRFPAGTYSKLKEKKIGPLPILEKYGDNAYRVQLPATYNISSTFNIADIFPYNPPDES
        DLVMV LR+ RFP GTYSKL  +KIGP+PI  K  DNAY V LPA  N S+TFN++DI  Y P DE+
Subjt:  DLVMVHLRKNRFPAGTYSKLKEKKIGPLPILEKYGDNAYRVQLPATYNISSTFNIADIFPYNPPDES

PWA81295.1 transposon Ty3-I Gag-Pol polyprotein [Artemisia annua]8.4e-29540.89Show/hide
Query:  HKRYKELKENPLFRRYQDWSESSSEDEKEPRVFYQQNPRYDHAERKEDARIPYHPYGHQEFPMYNHPEGRDDARMFYHKQESDFKMKVDIPTYNGKMEIE
        H  Y+    N  F R    ++S  +  +   V +         E  E+        G +E      P  R +     + +   +++K +IP + G ++IE
Subjt:  HKRYKELKENPLFRRYQDWSESSSEDEKEPRVFYQQNPRYDHAERKEDARIPYHPYGHQEFPMYNHPEGRDDARMFYHKQESDFKMKVDIPTYNGKMEIE

Query:  AFLEWIRHVENFFDYMNTPESKKVKLVALKLKGGAQAWWDQLEINRQRFGKRPIRKWERMKRLMRDRFLPANFEQILYTQYQNCRQGTRTVMEYTEEFHR
        A L+W+  V+ FFD M+ P+ ++VK+VA KL+GGA AWW + + NR+  G+RP+  W  MKR+++ RFLP + EQILY QY NC QG RTV EYT EF R
Subjt:  AFLEWIRHVENFFDYMNTPESKKVKLVALKLKGGAQAWWDQLEINRQRFGKRPIRKWERMKRLMRDRFLPANFEQILYTQYQNCRQGTRTVMEYTEEFHR

Query:  LGACTNLGENQQCLVARFKGGLRLDIQERISFQPIAFLNEAITAAATAEEQITNRFK--RQYTRRNVVESSTSNPRNTTVDKTTSPTTSVGAKGKQIDAG
        L A  NL E  +   AR+  GL   IQE++S   I  +++A   A  AE   T      R+ T  +         +N     +T+ +TS  +K       
Subjt:  LGACTNLGENQQCLVARFKGGLRLDIQERISFQPIAFLNEAITAAATAEEQITNRFK--RQYTRRNVVESSTSNPRNTTVDKTTSPTTSVGAKGKQIDAG

Query:  TSKDLVPKRNTNNYSRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVDYQAESDET-DSDEDIDYLEPDEG--DALSLVIQRLLLAPKIDQSYQRHALFK
         SK+  P + +N Y++P   KCFRC + GH SN CP+R TL  ++  +     DE+   ++D++Y EP +G  + ++ VIQR L +PK+  S QR+ +F+
Subjt:  TSKDLVPKRNTNNYSRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVDYQAESDET-DSDEDIDYLEPDEG--DALSLVIQRLLLAPKIDQSYQRHALFK

Query:  TRCTINGKICNVIIDSGSTENVVASKLVSTLNLPLTPHPAPYK-----------------------------------------------------AIHK
        T+C +  KIC++IID GS EN+V+  LV    LP  PHP PY+                                                     A H+
Subjt:  TRCTINGKICNVIIDSGSTENVVASKLVSTLNLPLTPHPAPYK-----------------------------------------------------AIHK

Query:  GRDNTYEFTWMGKKIVLLPI---NPSKQVGTTTNKKGQLFSISSGKKNFQEKQFPI---IGLVIKYFDNLDPNIAIPSEVQNLLQDFKEII--DTPIELP
        G+ N Y F W GK I +LP+   +P K+V   T     L ++ S  K FQ ++        LV+K  ++   N+ IP  ++ +L++F +++  DTP  LP
Subjt:  GRDNTYEFTWMGKKIVLLPI---NPSKQVGTTTNKKGQLFSISSGKKNFQEKQFPI---IGLVIKYFDNLDPNIAIPSEVQNLLQDFKEII--DTPIELP

Query:  PLRDIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKITVEYRFPILRINDLLNQLSGA
        PLR+IQH I+ +PGA+LPNLPHYRMSP E  IL E+V+ELL KGHIQ S+SPCAVPALLTPKKDGSWRMCVDSRAINKITV YRFPI R++DLL+QLSGA
Subjt:  PLRDIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKITVEYRFPILRINDLLNQLSGA

Query:  TIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ--------------------------------------------
         +F KIDL+SGYHQIRI+PGDEWKTAFKT +GL+EWLVMPFGLSNAPSTFMRLM Q                                            
Subjt:  TIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------EKIITS
                                                                                                      E++ T+
Subjt:  ----------------------------------------------------------------------------------------------EKIITS

Query:  PVLALPNFDLPFEVAVDASGIGVGAVLSQNNHPIEFFSEKLSPSRQS-----------------------------------------------------
        PVL+LPNFD  FE+  DA G G+GAVLSQ   P+ F SEKL+ +RQ                                                      
Subjt:  PVLALPNFDLPFEVAVDASGIGVGAVLSQNNHPIEFFSEKLSPSRQS-----------------------------------------------------

Query:  ----------------------------------PGEIIAFNSLPEDYHSDKDFGSIGKNCTEHKPTGDFHLVDGFLFKGNLLCIPFSSLREALLQEAHA
                                            +++ F S+   Y +D+DF S  +     +  G+F L+DG+LFKGN LCIP +SLR  L++E HA
Subjt:  ----------------------------------PGEIIAFNSLPEDYHSDKDFGSIGKNCTEHKPTGDFHLVDGFLFKGNLLCIPFSSLREALLQEAHA

Query:  GGLAGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQTCKGTSQNTGLYLPLPIPDNIWEDLSMDFILGLPRTQRGFDSVLVVVDRYSKMAHFLACK
        GGL+ HLGRDKT+  +E R+YWPQLK+DV ++V RC  CQ  KG +QNTGLY+PLP+P++ W D+SMDF+LGLPRTQRG DSV VVVDR+SKMAHF+ CK
Subjt:  GGLAGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQTCKGTSQNTGLYLPLPIPDNIWEDLSMDFILGLPRTQRGFDSVLVVVDRYSKMAHFLACK

Query:  KTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNRTLGNLIRCLSGDKPRQWDMTLAQAEFAYNHM
        KTSDA++IA LFF+EVVRLHG+PKSI SDRD KFL+HFW TLW++  T+L +S+T+HPQTDGQTEV NRTLGN+IRCL G+KP+ WD++LAQAEFAYN  
Subjt:  KTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNRTLGNLIRCLSGDKPRQWDMTLAQAEFAYNHM

Query:  KNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDLSLEAEEMATRIRDLHQQVHKHIEEQTIKYKAAADKKRRYKEFAVGDLVMVHLRKNRFPAGTYSKLKE
         + +TG SPF++VY   PR  +DL  +P     +++A +M   ++  H+ V   I E   KYKAAADK RR K F VGD VMV LRK RFP GTYSKL+ 
Subjt:  KNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDLSLEAEEMATRIRDLHQQVHKHIEEQTIKYKAAADKKRRYKEFAVGDLVMVHLRKNRFPAGTYSKLKE

Query:  KKIGPLPILEKYGDNAYRVQLPATYNISSTFNIADIFPYNPPD
        KK GP  IL K  DNAY V LP T +IS TFN++DI+ ++P D
Subjt:  KKIGPLPILEKYGDNAYRVQLPATYNISSTFNIADIFPYNPPD

TXG62763.1 hypothetical protein EZV62_009757 [Acer yangbiense]1.9e-30242.81Show/hide
Query:  HKQESDFKMKVDIPTYNGKMEIEAFLEWIRHVENFFDYMNTPESKKVKLVALKLKGGAQAWWDQLEINRQRFGKRPIRKWERMKRLMRDRFLPANFEQIL
        ++ + D++++ DIP + GK++IE FL+WI  VE FF++    E ++VKLVA KL+ GA  WW++L+++R+R GK PIR W RMK+LM +RFLP ++EQ +
Subjt:  HKQESDFKMKVDIPTYNGKMEIEAFLEWIRHVENFFDYMNTPESKKVKLVALKLKGGAQAWWDQLEINRQRFGKRPIRKWERMKRLMRDRFLPANFEQIL

Query:  YTQYQNCRQGTRTVMEYTEEFHRLGACTNLGENQQCLVARFKGGLRLDIQERISFQPIAFLNEAITAAATAE--EQITNRFKRQYTRRNVVESSTS----
        +  YQNC QG +TV +YTEE+ RL    NL E +   V+R+ GGL+  I+++I  Q +  ++EA   A  AE  E+ +NRF   + ++++ ESS +    
Subjt:  YTQYQNCRQGTRTVMEYTEEFHRLGACTNLGENQQCLVARFKGGLRLDIQERISFQPIAFLNEAITAAATAE--EQITNRFKRQYTRRNVVESSTS----

Query:  ---------NPRNTTVDKTTSPTTSVGAKGKQIDAGTSKDLVPKRNTNNYSRPTLGKCFRCNQTGHLSNECPQRKTL----AIADNVDYQAESDETDSDE
                      T D+    +      G +  A  +    P+   N YSRP + KC+RC Q GH SNECP RK +    A  +  D Q   +E + DE
Subjt:  ---------NPRNTTVDKTTSPTTSVGAKGKQIDAGTSKDLVPKRNTNNYSRPTLGKCFRCNQTGHLSNECPQRKTL----AIADNVDYQAESDETDSDE

Query:  DID--YLEPDEGDALSLVIQRLLLAPKIDQSYQRHALFKTRCTINGKICNVIIDSGSTENVVASKLVSTLNLPLTPHPAPY-------------------
         ++   +  ++G+ ++ VIQR++ A K+D   QR+++FKT C+I GK+C++I+DSGS EN V+ KL+  L L    HP PY                   
Subjt:  DID--YLEPDEGDALSLVIQRLLLAPKIDQSYQRHALFKTRCTINGKICNVIIDSGSTENVVASKLVSTLNLPLTPHPAPY-------------------

Query:  ----------------------------------KAIHKGRDNTYEFTWMGKKIVLLPINPSKQVGT--TTNKKGQLFSISSGK------KNFQEKQFPI
                                             ++GRDN   F+W GK+I ++P   S    T  T N++  +  I+S        K  QE    +
Subjt:  ----------------------------------KAIHKGRDNTYEFTWMGKKIVLLPINPSKQVGT--TTNKKGQLFSISSGK------KNFQEKQFPI

Query:  I-GLVIKYFDNLDPNIAIPSEVQNLLQDFKEII--DTPIELPPLRDIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPAL
        +  LVI+     +    +P +V  LL +F +++  D P +LPPLRDIQH I+ +PGA+LPNLPHYRMSP E +IL  +V+EL++KG I+ S+SPCAVP L
Subjt:  I-GLVIKYFDNLDPNIAIPSEVQNLLQDFKEII--DTPIELPPLRDIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPAL

Query:  LTPKKDGSWRMCVDSRAINKITVEYRFPILRINDLLNQLSGATIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ--
        L PKKDGSWRMCVDSRAINKITV Y FPI R++D+L+ L G+ +F KIDL+SGYHQIR++PGDEWKTAFKT +GL+EWLVMPFGLSNAPSTFMRLMNQ  
Subjt:  LTPKKDGSWRMCVDSRAINKITVEYRFPILRINDLLNQLSGATIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ--

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----EKIITSPVLALPNFDLPFEVAVDASGIGVGAVLSQNNHPIEFFSEKLSPSRQS------------------------------------------
             EK+ T+PVLALP+F+  FEV  DASG+G+GAVLSQ   P+ FFSEKLS +R+                                           
Subjt:  -----EKIITSPVLALPNFDLPFEVAVDASGIGVGAVLSQNNHPIEFFSEKLSPSRQS------------------------------------------

Query:  ---------------------------------------------PGEIIAFNSLPEDYHSDKDFGSIGKNCTEHKPTGDFHLVDGFLFKGNLLCIPFSS
                                                       EII F  L E Y  D+DFG + + C   +  G+FH+ +G+LF GN LCIP SS
Subjt:  ---------------------------------------------PGEIIAFNSLPEDYHSDKDFGSIGKNCTEHKPTGDFHLVDGFLFKGNLLCIPFSS

Query:  LREALLQEAHAGGLAGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQTCKGTSQNTGLYLPLPIPDNIWEDLSMDFILGLPRTQRGFDSVLVVVDR
        LRE L++E H GGL GHLGRDKT+  + ERYYWPQLK+DV N+V +C+  QT KG +QNTGLY+PLP+P+ IWEDL+MDF+LGLPRTQRG DSV VVVDR
Subjt:  LREALLQEAHAGGLAGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQTCKGTSQNTGLYLPLPIPDNIWEDLSMDFILGLPRTQRGFDSVLVVVDR

Query:  YSKMAHFLACKKTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNRTLGNLIRCLSGDKPRQWDMT
        +SKM HF+ C+KTSDAS++A LFFREVVRLHG+P+SI SDRD KFLSHFW TLW++ DT LK+S+T+HPQTDGQTE  NRTLGNLIR + GDKP+QWD+ 
Subjt:  YSKMAHFLACKKTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNRTLGNLIRCLSGDKPRQWDMT

Query:  LAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDLSLEAEEMATRIRDLHQQVHKHIEEQTIKYKAAADKKRRYKEFAVGDLVMVHLRKNR
        LAQAEFAYN+  +  TGKSPF LVY + P+  LDL  +P    +S+ AE MA ++RD+  +V   +EE+  KYKAAAD KRR K FA GD VMV LRK R
Subjt:  LAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDLSLEAEEMATRIRDLHQQVHKHIEEQTIKYKAAADKKRRYKEFAVGDLVMVHLRKNR

Query:  FPAGTYSKLKEKKIGPLPILEKYGDNAYRVQLPATYNISSTFNIADIFPY
        FP G+Y+KLK +K GP  +++K  +NAY + LPA  NIS TFN+AD++ +
Subjt:  FPAGTYSKLKEKKIGPLPILEKYGDNAYRVQLPATYNISSTFNIADIFPY

XP_025979678.1 uncharacterized protein LOC112997809 [Glycine max]7.1e-29442.07Show/hide
Query:  DFKMKVDIPTYNGKMEIEAFLEWIRHVENFFDYMNTPESKKVKLVALKLKGGAQAWWDQLEINRQRFGKRPIRKWERMKRLMRDRFLPANFEQILYTQYQ
        D+++K DIP + G M +E FL+W   V+ FFD M+ PE K+VK+VA++LK  A  WWD+L + R R  K PIR W +MK+LM +RFLP ++EQILY  Y 
Subjt:  DFKMKVDIPTYNGKMEIEAFLEWIRHVENFFDYMNTPESKKVKLVALKLKGGAQAWWDQLEINRQRFGKRPIRKWERMKRLMRDRFLPANFEQILYTQYQ

Query:  NCRQGTRTVMEYTEEFHRLGACTNLGENQQCLVARFKGGLRLDIQERISFQPIAFLNEAITAAATAEEQITNRFKRQYTRRNVVESSTSNPRNTTVDKTT
         C QG R+V EYT EF R      LGE++   VAR+  GL+  +QE++  Q +  + EA   A  AE  +  +  R ++  +  +SS  N   ++ DK  
Subjt:  NCRQGTRTVMEYTEEFHRLGACTNLGENQQCLVARFKGGLRLDIQERISFQPIAFLNEAITAAATAEEQITNRFKRQYTRRNVVESSTSNPRNTTVDKTT

Query:  SPTTSVGAKGKQIDAGTS---KDLVPKRNTNN-YSRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVDYQAESDETDSDE--DIDYLEPDEGDALSLVIQ
        S        G +     S   +   P +  NN Y +PT+  CFRCN  GH SN CP R+  A+    D   E+++ D  E  ++++ E +  + ++ V+Q
Subjt:  SPTTSVGAKGKQIDAGTS---KDLVPKRNTNN-YSRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVDYQAESDETDSDE--DIDYLEPDEGDALSLVIQ

Query:  RLLLAPKIDQSYQRHALFKTRCTINGKICNVIIDSGSTENVVASKLVSTLNLPLTPHPAPY---------------------------------------
        R+LLA K  +  QR  LFKT C++  K+C++IID+GSTEN+V+ KLV+   LP  PH  PY                                       
Subjt:  RLLLAPKIDQSYQRHALFKTRCTINGKICNVIIDSGSTENVVASKLVSTLNLPLTPHPAPY---------------------------------------

Query:  --------------KAIHKGRDNTYEFTWMGKKIVLLPI-----NPSKQVGTTTNKKGQLFSISSGKKNFQE---KQFPIIGLVIK-YFDNLDPNIAIPS
                         ++GRDN   FTW   KI + PI     NP        +KK     ++  ++   +   +      +V K     +   + IP 
Subjt:  --------------KAIHKGRDNTYEFTWMGKKIVLLPI-----NPSKQVGTTTNKKGQLFSISSGKKNFQE---KQFPIIGLVIK-YFDNLDPNIAIPS

Query:  EVQNLLQDFKEII--DTPIELPPLRDIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINK
        E+  +L+DF E+I  + P +LPP+RDIQH I+ +PG++LPNLPHYRMSP E +IL EQ+++LL KG I+ S+SPCAVP LL PKK   WRMCVDSRAINK
Subjt:  EVQNLLQDFKEII--DTPIELPPLRDIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINK

Query:  ITVEYRFPILRINDLLNQLSGATIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ----------------------
        IT++YRFPI R+ D+L++L+G+ +F KIDL+SGYHQIRIRPGDEWKTAFK+ +GL+EWLVMPFGLSNAPSTFMRLMNQ                      
Subjt:  ITVEYRFPILRINDLLNQLSGATIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------EKIITSPVLALPNFDLPFEVAVDASGIGVGAVLSQNNHPIEFFSEKLSPSRQS-------------------------------
                        EK+ T+PVLALP+FD  F+V  DASGIG+GAVLSQ   PI FFSEKLS +R+                                
Subjt:  ----------------EKIITSPVLALPNFDLPFEVAVDASGIGVGAVLSQNNHPIEFFSEKLSPSRQS-------------------------------

Query:  --------------------------------------------------------PGEIIAFNSLPEDYHSDKDFGSIGKNCTEHKPTGDFHLVDGFLF
                                                                  E++ F  L E Y +D +F  +   C EH P  DFH+ +GFLF
Subjt:  --------------------------------------------------------PGEIIAFNSLPEDYHSDKDFGSIGKNCTEHKPTGDFHLVDGFLF

Query:  KGNLLCIPFSSLREALLQEAHAGGLAGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQTCKGTSQNTGLYLPLPIPDNIWEDLSMDFILGLPRTQR
        KGN LCIP SSLRE L+++ H GGL+GH+GRDKT+  LEER+YWP L+KD    V +C+TCQ  KG SQNTGLY+PLPIPD+IW+DL+MDF+LGLPRTQR
Subjt:  KGNLLCIPFSSLREALLQEAHAGGLAGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQTCKGTSQNTGLYLPLPIPDNIWEDLSMDFILGLPRTQR

Query:  GFDSVLVVVDRYSKMAHFLACKKTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNRTLGNLIRCL
        G DSV VVVDR+SKM+HF+ACKKT+DAS IA LFFREVV LHG+PKSI SDRD KFLSHFW TLWK FDT+L  S+T+HPQTDGQTEVTNRTLGN+IRC+
Subjt:  GFDSVLVVVDRYSKMAHFLACKKTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNRTLGNLIRCL

Query:  SGDKPRQWDMTLAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDLSLEAEEMATRIRDLHQQVHKHIEEQTIKYKAAADKKRRYKEFAVG
         GDKP+QWD+ L Q EFAYN   +  TGK+PF LVYT +PR  +DL  +P +   S+ AE MA  I  +   V   +E   +K K AADK++R K F VG
Subjt:  SGDKPRQWDMTLAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDLSLEAEEMATRIRDLHQQVHKHIEEQTIKYKAAADKKRRYKEFAVG

Query:  DLVMVHLRKNRFPAGTYSKLKEKKIGPLPILEKYGDNAYRVQLPATYNISSTFNIADIFPYN
        D VMV LRK RFP GTYSKL+ +K GP  +  K  DNAY V LPA+ NIS+TFN+ADI+ Y+
Subjt:  DLVMVHLRKNRFPAGTYSKLKEKKIGPLPILEKYGDNAYRVQLPATYNISSTFNIADIFPYN

XP_028555656.1 uncharacterized protein LOC114580913 [Dendrobium catenatum]1.0e-29239.66Show/hide
Query:  ATRGKE--TQIRQGESSKLTNQHEDRTKFQQERCFNEQKIT----QDWAAAPTKYQEWEMAHKRYKELK---------ENPLFRRY-----QDWSESSSE
        A RGKE  T   +G  +   NQ E+  +   +   + Q++T    +++    T+  + +  H  +++L+           P  RR+     QD+SES   
Subjt:  ATRGKE--TQIRQGESSKLTNQHEDRTKFQQERCFNEQKIT----QDWAAAPTKYQEWEMAHKRYKELK---------ENPLFRRY-----QDWSESSSE

Query:  DEKEPRVFYQQNPRYDHAERKEDARIPYHPYGHQEFPMYNHPEGRDDARMFYHKQESDFKMKVDIPTYNGKMEIEAFLEWIRHVENFFDYMNTPESKKVK
        DE+ P  F ++    D AE  EDA        HQ +   +H   R     F   Q  +FK K+DIP ++G+M IE +L+W + VENFF+YM     ++V+
Subjt:  DEKEPRVFYQQNPRYDHAERKEDARIPYHPYGHQEFPMYNHPEGRDDARMFYHKQESDFKMKVDIPTYNGKMEIEAFLEWIRHVENFFDYMNTPESKKVK

Query:  LVALKLKGGAQAWWDQLEINRQRFGKRPIRKWERMKRLMRDRFLPANFEQILYTQYQNCRQGTRTVMEYTEEFHRLGACTNLGENQQCLVARFKGGLRLD
         VA +LKGGA AWW+Q+   RQR GK  +R W+RMK+L++ +FLP ++EQILY +YQ+C QG+RTV +YTEEFHRL A  NL E+   LVARF GGL+  
Subjt:  LVALKLKGGAQAWWDQLEINRQRFGKRPIRKWERMKRLMRDRFLPANFEQILYTQYQNCRQGTRTVMEYTEEFHRLGACTNLGENQQCLVARFKGGLRLD

Query:  IQERISFQPIAFLNEAITAAATAEEQITNRFKRQYTRRNVVESSTSNPRNTTVDKTTSPTTSVGAKGKQIDAGTSKDLVPKRNT----NNYSRPTLGKCF
        IQ+R+    +  + +A+  A   E Q     +  Y RR+  E   S   +T+V K+T             +  T     PK  T    N Y++P+  KCF
Subjt:  IQERISFQPIAFLNEAITAAATAEEQITNRFKRQYTRRNVVESSTSNPRNTTVDKTTSPTTSVGAKGKQIDAGTSKDLVPKRNT----NNYSRPTLGKCF

Query:  RCNQTGHLSNECPQRKTLAIADNVDY-QAESDETDSDEDIDYLEPDEGDALSLVIQRLLLAPKIDQSYQRHALFKTRCTINGKICNVIIDSGSTENVVAS
        RC Q GH SNECPQR+ + +A+  +  +  +DE + + + + +  DEG+AL  V+++LLLAP+ +   QRH++FKTRCTI GK+C+++ID+G TENV++ 
Subjt:  RCNQTGHLSNECPQRKTLAIADNVDY-QAESDETDSDEDIDYLEPDEGDALSLVIQRLLLAPKIDQSYQRHALFKTRCTINGKICNVIIDSGSTENVVAS

Query:  KLVSTLNLPLTPHPAPYK-----------------------------------------------------AIHKGRDNTYEFTWMGKKIVLLPINPSKQ
         +V +L+L    +  PYK                                                     A +  R N Y F W G+++ LLP      
Subjt:  KLVSTLNLPLTPHPAPYK-----------------------------------------------------AIHKGRDNTYEFTWMGKKIVLLPINPSKQ

Query:  VGTTTNKKGQLFSISSGKK--NFQEKQFPIIGLVIKYFDNLDPNIAIPSEVQNLLQDFKEII--DTPIELPPLRDIQHTIEFLPGATLPNLPHYRMSPNE
        V TT +++     + +G    +  ++  PI  L++           +PS VQ LLQ + +I+  + P ELPPLR+IQH I+ +PGA LPNLPHYR++P E
Subjt:  VGTTTNKKGQLFSISSGKK--NFQEKQFPIIGLVIKYFDNLDPNIAIPSEVQNLLQDFKEII--DTPIELPPLRDIQHTIEFLPGATLPNLPHYRMSPNE

Query:  YKILHEQVQELLEKGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKITVEYRFPILRINDLLNQLSGATIFPKIDLKSGYHQIRIRPGDEWKTAFKT
          IL + + +LL +  IQPSLSPCAVPALL PKKDG+WRMCVDSRAINKIT++Y FP+ RI +LL++L+G  IF K+DL+SGYHQIR+RPGDEWKTAFKT
Subjt:  YKILHEQVQELLEKGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKITVEYRFPILRINDLLNQLSGATIFPKIDLKSGYHQIRIRPGDEWKTAFKT

Query:  NEGLFEWLVMPFGLSNAPSTFMRLMNQ-------------------------------------------------------------------------
         +GL+EW VMPFGL NAPSTFMR+MN+                                                                         
Subjt:  NEGLFEWLVMPFGLSNAPSTFMRLMNQ-------------------------------------------------------------------------

Query:  -----------------------------------------------------------------EKIITSPVLALPNFDLPFEVAVDASGIGVGAVLSQ
                                                                         + +I++PVL LP+FD  F V  DAS IG+GAVLSQ
Subjt:  -----------------------------------------------------------------EKIITSPVLALPNFDLPFEVAVDASGIGVGAVLSQ

Query:  NNHPIEFFSEKLSPSRQS------------------------------------------------------------------PG--------------
           P+ FFSEKLS +R++                                                                  PG              
Subjt:  NNHPIEFFSEKLSPSRQS------------------------------------------------------------------PG--------------

Query:  -------EIIAFNSLPEDYHSDKDFGSIGKNCTEHKPTGDFHLVDGFLFKGNLLCIPFSSLREALLQEAHAGGLAGHLGRDKTLHMLEERYYWPQLKKDV
               E    + L E Y  D DF  + K C E +  GD+ +  G+LFK NLLCIP SS R  L++EAH GGLA H+GRDKTL  ++ R++WP+L++DV
Subjt:  -------EIIAFNSLPEDYHSDKDFGSIGKNCTEHKPTGDFHLVDGFLFKGNLLCIPFSSLREALLQEAHAGGLAGHLGRDKTLHMLEERYYWPQLKKDV

Query:  QNYVSRCFTCQTCKGTSQNTGLYLPLPIPDNIWEDLSMDFILGLPRTQRGFDSVLVVVDRYSKMAHFLACKKTSDASYIANLFFREVVRLHGIPKSIVSD
           V  C  CQ+ KG +QNTGLY+PLP+PD+IWED+S+DFILGLPRT++G DS+LVVVDR+SKMAHFLACKKTS+A  IA LFF ++VRLHG+P+S+ SD
Subjt:  QNYVSRCFTCQTCKGTSQNTGLYLPLPIPDNIWEDLSMDFILGLPRTQRGFDSVLVVVDRYSKMAHFLACKKTSDASYIANLFFREVVRLHGIPKSIVSD

Query:  RDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNRTLGNLIRCLSGDKPRQWDMTLAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTLDLTLIPS
        RDVKF+SHFW+ LWK+  T +  S+  HPQTDGQTEV NRTLGN++RCL  D P+QW+  L +AEFAYN M NR+TGKSPF +VYTK P L +D+ ++P 
Subjt:  RDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNRTLGNLIRCLSGDKPRQWDMTLAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTLDLTLIPS

Query:  SVDLSLEAEEMATRIRDLHQQVHKHIEEQTIKYKAAADKKRRYKEFAVGDLVMVHLRKNRFPAGTYSKLKEKKIGPLPILEKYGDNAYRVQLPATYNISS
            SL AE  +    D+  +V  HI +   +YK  AD  RR + F++GDLVMV +RK RFP GTYSKL  +KIGP+PI  K  DNAY V LPA YN+SS
Subjt:  SVDLSLEAEEMATRIRDLHQQVHKHIEEQTIKYKAAADKKRRYKEFAVGDLVMVHLRKNRFPAGTYSKLKEKKIGPLPILEKYGDNAYRVQLPATYNISS

Query:  TFNIADIFPYNPPDESCG
        TFN++DI+ Y+PPD++ G
Subjt:  TFNIADIFPYNPPDESCG

TrEMBL top hitse value%identityAlignment
A0A2I0VXL7 RNA-directed DNA polymerase1.7e-29341.62Show/hide
Query:  YHKQESDFKMKVDIPTYNGKMEIEAFLEWIRHVENFFDYMNTPESKKVKLVALKLKGGAQAWWDQLEINRQRFGKRPIRKWERMKRLMRDRFLPANFEQI
        Y   + +FK+K+DIP ++G M IE +L+W + VENFF+YM     K+VK VA +LKGGA AWW Q+   RQR GK  IR W RMK+L+R +FLP +FEQI
Subjt:  YHKQESDFKMKVDIPTYNGKMEIEAFLEWIRHVENFFDYMNTPESKKVKLVALKLKGGAQAWWDQLEINRQRFGKRPIRKWERMKRLMRDRFLPANFEQI

Query:  LYTQYQNCRQGTRTVMEYTEEFHRLGACTNLGENQQCLVARFKGGLRLDIQERISFQPIAFLNEAITAAATAEEQITNRFKRQYTRRNVVESSTSNPR-N
        LY +YQ+C QGTR+V EYTEEF+RL A  NL E+   LVAR+ GGL+ +IQ+R+    +  + +A+  A   E Q   + K  Y RR+  ES+T+N + N
Subjt:  LYTQYQNCRQGTRTVMEYTEEFHRLGACTNLGENQQCLVARFKGGLRLDIQERISFQPIAFLNEAITAAATAEEQITNRFKRQYTRRNVVESSTSNPR-N

Query:  TTVDKT---TSPTTSVGAKGKQI-DAGTSKDLVPKRNTNNYSRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVDYQAESDE-TDSDEDIDYLEPDEGDA
         +  KT    SP  S  +    + +    +   P +  N Y++P   KCFRC Q GH SNECPQR+ + +A+  +   +++E  D D DI+ L+ D+G+ 
Subjt:  TTVDKT---TSPTTSVGAKGKQI-DAGTSKDLVPKRNTNNYSRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVDYQAESDE-TDSDEDIDYLEPDEGDA

Query:  LSLVIQRLLLAPKIDQSYQRHALFKTRCTINGKICNVIIDSGSTENVVASKLVSTLNLPLTPHPAPYK--------------------------------
        L  V+++LLLAP+ + S QRHA+F+TRCT+ GK+C+++ID+G TENV+A  +V  LNL  T +  PYK                                
Subjt:  LSLVIQRLLLAPKIDQSYQRHALFKTRCTINGKICNVIIDSGSTENVVASKLVSTLNLPLTPHPAPYK--------------------------------

Query:  ---------------------AIHKGRDNTYEFTWMGKKIVLLPINPSKQVGTTTNKKGQLFSISSGKK--NFQEKQFPIIGLVIKYFDNLDPNIA-IPS
                             A++  R N+Y F W G+++ LLP     +     + K     + SG    +  ++Q P+  L++    ++DP +   P 
Subjt:  ---------------------AIHKGRDNTYEFTWMGKKIVLLPINPSKQVGTTTNKKGQLFSISSGKK--NFQEKQFPIIGLVIKYFDNLDPNIA-IPS

Query:  EVQNLLQDFKEII--DTPIELPPLRDIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINK
        EV  LL+ +++I+  + P+ LPP+R++QH ++ LPGA+LPN+PHYR++P E  IL E + ELL+K  IQ SLSPCAVPALL PKKDGSWRMC+DSRAINK
Subjt:  EVQNLLQDFKEII--DTPIELPPLRDIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINK

Query:  ITVEYRFPILRINDLLNQLSGATIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ----------------------
        ITV+YRFP+ RI++LL+ LSG+++F K+DL+SGYHQIRIR GDEWKTAFKT +GL+EW VMPFGL NAPSTFMR+M++                      
Subjt:  ITVEYRFPILRINDLLNQLSGATIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------EKIITSPVLALPNFDLPFEVAVDASGIGVGAVLSQNNHPIEFFSEKLSPSRQ--------------------------------
                          +I++PVLALPNFD  F V  DASG+G+GAVLSQ   P+ FFSEKL  +RQ                                
Subjt:  ----------------EKIITSPVLALPNFDLPFEVAVDASGIGVGAVLSQNNHPIEFFSEKLSPSRQ--------------------------------

Query:  ----------------------------------SPG---------------------EIIAFNSLPEDYHSDKDFGSIGKNCTEHKPTGDFHLVDGFLF
                                           PG                     E      L E Y +D DFG   K C + + +  + +  G+LF
Subjt:  ----------------------------------SPG---------------------EIIAFNSLPEDYHSDKDFGSIGKNCTEHKPTGDFHLVDGFLF

Query:  KGNLLCIPFSSLREALLQEAHAGGLAGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQTCKGTSQNTGLYLPLPIPDNIWEDLSMDFILGLPRTQR
        K NLLCIP SS R+ L++EAH+GGLA H GRD TL  L+ R++WP+L++DV   V  C  CQ  KG  QNTGLY+PLP+P++IWEDLS+DF+LGLPRT+R
Subjt:  KGNLLCIPFSSLREALLQEAHAGGLAGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQTCKGTSQNTGLYLPLPIPDNIWEDLSMDFILGLPRTQR

Query:  GFDSVLVVVDRYSKMAHFLACKKTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNRTLGNLIRCL
        G DS++VVVDR+SKMAHF++CKKTS+A +IANLFF E+VRLHG+P+S+ SDRDVKF+SHFW+ LWK+  T L+ S+  HPQTDGQTEV NRTLGN++RCL
Subjt:  GFDSVLVVVDRYSKMAHFLACKKTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNRTLGNLIRCL

Query:  SGDKPRQWDMTLAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDLSLEAEEMATRIRDLHQQVHKHIEEQTIKYKAAADKKRRYKEFAVG
          D+P++W+  L++AEFA+N M NR+TGKSPF +VYTK P + LDL+++P     S  A        DL  +V +H+     +YK AAD +RR + F VG
Subjt:  SGDKPRQWDMTLAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDLSLEAEEMATRIRDLHQQVHKHIEEQTIKYKAAADKKRRYKEFAVG

Query:  DLVMVHLRKNRFPAGTYSKLKEKKIGPLPILEKYGDNAYRVQLPATYNISSTFNIADIFPYNPPDES
        DLVMV LR+ RFP GTYSKL  +KIGP+PI  K  DNAY V LPA  N S+TFN++DI  Y P DE+
Subjt:  DLVMVHLRKNRFPAGTYSKLKEKKIGPLPILEKYGDNAYRVQLPATYNISSTFNIADIFPYNPPDES

A0A2U1P6A2 Transposon Ty3-I Gag-Pol polyprotein4.1e-29540.89Show/hide
Query:  HKRYKELKENPLFRRYQDWSESSSEDEKEPRVFYQQNPRYDHAERKEDARIPYHPYGHQEFPMYNHPEGRDDARMFYHKQESDFKMKVDIPTYNGKMEIE
        H  Y+    N  F R    ++S  +  +   V +         E  E+        G +E      P  R +     + +   +++K +IP + G ++IE
Subjt:  HKRYKELKENPLFRRYQDWSESSSEDEKEPRVFYQQNPRYDHAERKEDARIPYHPYGHQEFPMYNHPEGRDDARMFYHKQESDFKMKVDIPTYNGKMEIE

Query:  AFLEWIRHVENFFDYMNTPESKKVKLVALKLKGGAQAWWDQLEINRQRFGKRPIRKWERMKRLMRDRFLPANFEQILYTQYQNCRQGTRTVMEYTEEFHR
        A L+W+  V+ FFD M+ P+ ++VK+VA KL+GGA AWW + + NR+  G+RP+  W  MKR+++ RFLP + EQILY QY NC QG RTV EYT EF R
Subjt:  AFLEWIRHVENFFDYMNTPESKKVKLVALKLKGGAQAWWDQLEINRQRFGKRPIRKWERMKRLMRDRFLPANFEQILYTQYQNCRQGTRTVMEYTEEFHR

Query:  LGACTNLGENQQCLVARFKGGLRLDIQERISFQPIAFLNEAITAAATAEEQITNRFK--RQYTRRNVVESSTSNPRNTTVDKTTSPTTSVGAKGKQIDAG
        L A  NL E  +   AR+  GL   IQE++S   I  +++A   A  AE   T      R+ T  +         +N     +T+ +TS  +K       
Subjt:  LGACTNLGENQQCLVARFKGGLRLDIQERISFQPIAFLNEAITAAATAEEQITNRFK--RQYTRRNVVESSTSNPRNTTVDKTTSPTTSVGAKGKQIDAG

Query:  TSKDLVPKRNTNNYSRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVDYQAESDET-DSDEDIDYLEPDEG--DALSLVIQRLLLAPKIDQSYQRHALFK
         SK+  P + +N Y++P   KCFRC + GH SN CP+R TL  ++  +     DE+   ++D++Y EP +G  + ++ VIQR L +PK+  S QR+ +F+
Subjt:  TSKDLVPKRNTNNYSRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVDYQAESDET-DSDEDIDYLEPDEG--DALSLVIQRLLLAPKIDQSYQRHALFK

Query:  TRCTINGKICNVIIDSGSTENVVASKLVSTLNLPLTPHPAPYK-----------------------------------------------------AIHK
        T+C +  KIC++IID GS EN+V+  LV    LP  PHP PY+                                                     A H+
Subjt:  TRCTINGKICNVIIDSGSTENVVASKLVSTLNLPLTPHPAPYK-----------------------------------------------------AIHK

Query:  GRDNTYEFTWMGKKIVLLPI---NPSKQVGTTTNKKGQLFSISSGKKNFQEKQFPI---IGLVIKYFDNLDPNIAIPSEVQNLLQDFKEII--DTPIELP
        G+ N Y F W GK I +LP+   +P K+V   T     L ++ S  K FQ ++        LV+K  ++   N+ IP  ++ +L++F +++  DTP  LP
Subjt:  GRDNTYEFTWMGKKIVLLPI---NPSKQVGTTTNKKGQLFSISSGKKNFQEKQFPI---IGLVIKYFDNLDPNIAIPSEVQNLLQDFKEII--DTPIELP

Query:  PLRDIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKITVEYRFPILRINDLLNQLSGA
        PLR+IQH I+ +PGA+LPNLPHYRMSP E  IL E+V+ELL KGHIQ S+SPCAVPALLTPKKDGSWRMCVDSRAINKITV YRFPI R++DLL+QLSGA
Subjt:  PLRDIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKITVEYRFPILRINDLLNQLSGA

Query:  TIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ--------------------------------------------
         +F KIDL+SGYHQIRI+PGDEWKTAFKT +GL+EWLVMPFGLSNAPSTFMRLM Q                                            
Subjt:  TIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------EKIITS
                                                                                                      E++ T+
Subjt:  ----------------------------------------------------------------------------------------------EKIITS

Query:  PVLALPNFDLPFEVAVDASGIGVGAVLSQNNHPIEFFSEKLSPSRQS-----------------------------------------------------
        PVL+LPNFD  FE+  DA G G+GAVLSQ   P+ F SEKL+ +RQ                                                      
Subjt:  PVLALPNFDLPFEVAVDASGIGVGAVLSQNNHPIEFFSEKLSPSRQS-----------------------------------------------------

Query:  ----------------------------------PGEIIAFNSLPEDYHSDKDFGSIGKNCTEHKPTGDFHLVDGFLFKGNLLCIPFSSLREALLQEAHA
                                            +++ F S+   Y +D+DF S  +     +  G+F L+DG+LFKGN LCIP +SLR  L++E HA
Subjt:  ----------------------------------PGEIIAFNSLPEDYHSDKDFGSIGKNCTEHKPTGDFHLVDGFLFKGNLLCIPFSSLREALLQEAHA

Query:  GGLAGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQTCKGTSQNTGLYLPLPIPDNIWEDLSMDFILGLPRTQRGFDSVLVVVDRYSKMAHFLACK
        GGL+ HLGRDKT+  +E R+YWPQLK+DV ++V RC  CQ  KG +QNTGLY+PLP+P++ W D+SMDF+LGLPRTQRG DSV VVVDR+SKMAHF+ CK
Subjt:  GGLAGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQTCKGTSQNTGLYLPLPIPDNIWEDLSMDFILGLPRTQRGFDSVLVVVDRYSKMAHFLACK

Query:  KTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNRTLGNLIRCLSGDKPRQWDMTLAQAEFAYNHM
        KTSDA++IA LFF+EVVRLHG+PKSI SDRD KFL+HFW TLW++  T+L +S+T+HPQTDGQTEV NRTLGN+IRCL G+KP+ WD++LAQAEFAYN  
Subjt:  KTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNRTLGNLIRCLSGDKPRQWDMTLAQAEFAYNHM

Query:  KNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDLSLEAEEMATRIRDLHQQVHKHIEEQTIKYKAAADKKRRYKEFAVGDLVMVHLRKNRFPAGTYSKLKE
         + +TG SPF++VY   PR  +DL  +P     +++A +M   ++  H+ V   I E   KYKAAADK RR K F VGD VMV LRK RFP GTYSKL+ 
Subjt:  KNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDLSLEAEEMATRIRDLHQQVHKHIEEQTIKYKAAADKKRRYKEFAVGDLVMVHLRKNRFPAGTYSKLKE

Query:  KKIGPLPILEKYGDNAYRVQLPATYNISSTFNIADIFPYNPPD
        KK GP  IL K  DNAY V LP T +IS TFN++DI+ ++P D
Subjt:  KKIGPLPILEKYGDNAYRVQLPATYNISSTFNIADIFPYNPPD

A0A5B7BER3 Uncharacterized protein0.0e+0045.39Show/hide
Query:  DFKMKVDIPTYNGKMEIEAFLEWIRHVENFFDYMNTPESKKVKLVALKLKGGAQAWWDQLEINRQRFGKRPIRKWERMKRLMRDRFLPANFEQILYTQYQ
        +++MK+D+P++NG + IE+FL+WI  VE FFD M   + K+VKLVA KLKGGA AWWDQ++ NR+R GK+P+R W++M+RL+R+RFLP ++EQ+LY QYQ
Subjt:  DFKMKVDIPTYNGKMEIEAFLEWIRHVENFFDYMNTPESKKVKLVALKLKGGAQAWWDQLEINRQRFGKRPIRKWERMKRLMRDRFLPANFEQILYTQYQ

Query:  NCRQGTRTVMEYTEEFHRLGACTNLGENQQCLVARFKGGLRLDIQERISFQPIAFLNEAITAAATAEEQITNRFKRQYTRRNVVESSTSNPRNTTVDKTT
        NCRQG R+V EY++EF+ L +  NL E +   VAR+ GGLR  IQ++++ + I  LNEA + A   E Q + +  R          S+ N +N   DK  
Subjt:  NCRQGTRTVMEYTEEFHRLGACTNLGENQQCLVARFKGGLRLDIQERISFQPIAFLNEAITAAATAEEQITNRFKRQYTRRNVVESSTSNPRNTTVDKTT

Query:  SPTTSVGAKGKQID-AGTSKD----LVP-KRNTNNYSRPTLGKCFRCNQTGHLSNECPQRKTLAIA-----DNVDYQAESDETDSDE--DIDYLEPDEGD
                K    D A +SK+    + P +++TN Y+RP  GKCFRC Q GH SNECP R+ + +      ++ D++ E +    DE    +  E DEG+
Subjt:  SPTTSVGAKGKQID-AGTSKD----LVP-KRNTNNYSRPTLGKCFRCNQTGHLSNECPQRKTLAIA-----DNVDYQAESDETDSDE--DIDYLEPDEGD

Query:  ALSLVIQRLLLAPKIDQSYQRHALFKTRCTINGKICNVIIDSGSTENVVASKLVSTLNLPLTPHPAPYK-------------------------------
         +S V+QRLLL PK +   QRH +F+TRCTIN K+C+VIIDSGS+EN+V+  LV  L L    HP PYK                               
Subjt:  ALSLVIQRLLLAPKIDQSYQRHALFKTRCTINGKICNVIIDSGSTENVVASKLVSTLNLPLTPHPAPYK-------------------------------

Query:  ----------------------AIHKGRDNTYEFTWMGKKIVLLPINPSKQVGTTTNKKGQLFSISSGKKNFQE--KQFPIIGLVIKYFDNLDPNIAIPS
                              A HKG+DNTY F W  KK+VL+P      +  T+  +G+     +G +  ++  +   II +++K     +P   +P 
Subjt:  ----------------------AIHKGRDNTYEFTWMGKKIVLLPINPSKQVGTTTNKKGQLFSISSGKKNFQE--KQFPIIGLVIKYFDNLDPNIAIPS

Query:  EVQNLLQDFKEII--DTPIELPPLRDIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINK
         +Q LL +F++I   + P  LPP+RDIQH I+ +PGA+LPNLPHYRMSP E +IL +QV++L+ KG IQ S+SPCAVPALLTPKKDGSWRMCVDSRAINK
Subjt:  EVQNLLQDFKEII--DTPIELPPLRDIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINK

Query:  ITVEYRFPILRINDLLNQLSGATIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ----------------------
        ITV+YRFPI R+ND+L+ L G+ IF KIDL+SGYHQIRIRPGDEWKTAFKT EGL+EWLVMPFGLSNAPSTFMR+MNQ                      
Subjt:  ITVEYRFPILRINDLLNQLSGATIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------EKIITSPVLALPNFDLPFEVAVDASGIGVGAVLSQNNHPIEFFSEKLSPSRQS-------------------------------
                        EK+ T+PVLALP+F+  F+V  DAS  G+GAVLSQ   P+EFFSEKL+ +RQ                                
Subjt:  ----------------EKIITSPVLALPNFDLPFEVAVDASGIGVGAVLSQNNHPIEFFSEKLSPSRQS-------------------------------

Query:  --------------------------------------------------------PGEIIAFNSLPEDYHSDKDFGSIGKNCTEHKPTGDFHLVDGFLF
                                                                  EI +F SL E Y  D+DF      C   + + +FH+ DG+LF
Subjt:  --------------------------------------------------------PGEIIAFNSLPEDYHSDKDFGSIGKNCTEHKPTGDFHLVDGFLF

Query:  KGNLLCIPFSSLREALLQEAHAGGLAGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQTCKGTSQNTGLYLPLPIPDNIWEDLSMDFILGLPRTQR
        KGN LCIP +SLRE +L++ H+GGL GHLGRDKT+ ++EERYYWPQLK+DV  +V +C  CQT KG +QNTGLY PLP+P++IWEDL+MDFILGLPRTQR
Subjt:  KGNLLCIPFSSLREALLQEAHAGGLAGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQTCKGTSQNTGLYLPLPIPDNIWEDLSMDFILGLPRTQR

Query:  GFDSVLVVVDRYSKMAHFLACKKTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNRTLGNLIRCL
        G DSV VVVDR+SKMAHF+ CKKTSDAS++ANLFFRE+VRLHG+PKSI SDRDVKFLSHFW+TLW+KFDT+L+YS+T+HPQTDGQTEVTNRTLGNLIRC 
Subjt:  GFDSVLVVVDRYSKMAHFLACKKTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNRTLGNLIRCL

Query:  SGDKPRQWDMTLAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDLSLEAEEMATRIRDLHQQVHKHIEEQTIKYKAAADKKRRYKEFAVG
        SGD+P+QWD+ L Q EFAYN M NR+T K+PFE+VYTK P+  LDL  +P     S+ AE  A R   + ++V +++E+    YKAAADK RR K F  G
Subjt:  SGDKPRQWDMTLAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDLSLEAEEMATRIRDLHQQVHKHIEEQTIKYKAAADKKRRYKEFAVG

Query:  DLVMVHLRKNRFPAGTYSKLKEKKIGPLPILEKYGDNAYRVQLPATYNISSTFNIADIFPYNPPDE
        DLVMV LRKNRFP GTY+KLK +K GP  +  K  DNAY V+LP    ISSTFN+AD+F Y+PPDE
Subjt:  DLVMVHLRKNRFPAGTYSKLKEKKIGPLPILEKYGDNAYRVQLPATYNISSTFNIADIFPYNPPDE

A0A5C7HZV4 Reverse transcriptase9.0e-30342.81Show/hide
Query:  HKQESDFKMKVDIPTYNGKMEIEAFLEWIRHVENFFDYMNTPESKKVKLVALKLKGGAQAWWDQLEINRQRFGKRPIRKWERMKRLMRDRFLPANFEQIL
        ++ + D++++ DIP + GK++IE FL+WI  VE FF++    E ++VKLVA KL+ GA  WW++L+++R+R GK PIR W RMK+LM +RFLP ++EQ +
Subjt:  HKQESDFKMKVDIPTYNGKMEIEAFLEWIRHVENFFDYMNTPESKKVKLVALKLKGGAQAWWDQLEINRQRFGKRPIRKWERMKRLMRDRFLPANFEQIL

Query:  YTQYQNCRQGTRTVMEYTEEFHRLGACTNLGENQQCLVARFKGGLRLDIQERISFQPIAFLNEAITAAATAE--EQITNRFKRQYTRRNVVESSTS----
        +  YQNC QG +TV +YTEE+ RL    NL E +   V+R+ GGL+  I+++I  Q +  ++EA   A  AE  E+ +NRF   + ++++ ESS +    
Subjt:  YTQYQNCRQGTRTVMEYTEEFHRLGACTNLGENQQCLVARFKGGLRLDIQERISFQPIAFLNEAITAAATAE--EQITNRFKRQYTRRNVVESSTS----

Query:  ---------NPRNTTVDKTTSPTTSVGAKGKQIDAGTSKDLVPKRNTNNYSRPTLGKCFRCNQTGHLSNECPQRKTL----AIADNVDYQAESDETDSDE
                      T D+    +      G +  A  +    P+   N YSRP + KC+RC Q GH SNECP RK +    A  +  D Q   +E + DE
Subjt:  ---------NPRNTTVDKTTSPTTSVGAKGKQIDAGTSKDLVPKRNTNNYSRPTLGKCFRCNQTGHLSNECPQRKTL----AIADNVDYQAESDETDSDE

Query:  DID--YLEPDEGDALSLVIQRLLLAPKIDQSYQRHALFKTRCTINGKICNVIIDSGSTENVVASKLVSTLNLPLTPHPAPY-------------------
         ++   +  ++G+ ++ VIQR++ A K+D   QR+++FKT C+I GK+C++I+DSGS EN V+ KL+  L L    HP PY                   
Subjt:  DID--YLEPDEGDALSLVIQRLLLAPKIDQSYQRHALFKTRCTINGKICNVIIDSGSTENVVASKLVSTLNLPLTPHPAPY-------------------

Query:  ----------------------------------KAIHKGRDNTYEFTWMGKKIVLLPINPSKQVGT--TTNKKGQLFSISSGK------KNFQEKQFPI
                                             ++GRDN   F+W GK+I ++P   S    T  T N++  +  I+S        K  QE    +
Subjt:  ----------------------------------KAIHKGRDNTYEFTWMGKKIVLLPINPSKQVGT--TTNKKGQLFSISSGK------KNFQEKQFPI

Query:  I-GLVIKYFDNLDPNIAIPSEVQNLLQDFKEII--DTPIELPPLRDIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPAL
        +  LVI+     +    +P +V  LL +F +++  D P +LPPLRDIQH I+ +PGA+LPNLPHYRMSP E +IL  +V+EL++KG I+ S+SPCAVP L
Subjt:  I-GLVIKYFDNLDPNIAIPSEVQNLLQDFKEII--DTPIELPPLRDIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPAL

Query:  LTPKKDGSWRMCVDSRAINKITVEYRFPILRINDLLNQLSGATIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ--
        L PKKDGSWRMCVDSRAINKITV Y FPI R++D+L+ L G+ +F KIDL+SGYHQIR++PGDEWKTAFKT +GL+EWLVMPFGLSNAPSTFMRLMNQ  
Subjt:  LTPKKDGSWRMCVDSRAINKITVEYRFPILRINDLLNQLSGATIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ--

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----EKIITSPVLALPNFDLPFEVAVDASGIGVGAVLSQNNHPIEFFSEKLSPSRQS------------------------------------------
             EK+ T+PVLALP+F+  FEV  DASG+G+GAVLSQ   P+ FFSEKLS +R+                                           
Subjt:  -----EKIITSPVLALPNFDLPFEVAVDASGIGVGAVLSQNNHPIEFFSEKLSPSRQS------------------------------------------

Query:  ---------------------------------------------PGEIIAFNSLPEDYHSDKDFGSIGKNCTEHKPTGDFHLVDGFLFKGNLLCIPFSS
                                                       EII F  L E Y  D+DFG + + C   +  G+FH+ +G+LF GN LCIP SS
Subjt:  ---------------------------------------------PGEIIAFNSLPEDYHSDKDFGSIGKNCTEHKPTGDFHLVDGFLFKGNLLCIPFSS

Query:  LREALLQEAHAGGLAGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQTCKGTSQNTGLYLPLPIPDNIWEDLSMDFILGLPRTQRGFDSVLVVVDR
        LRE L++E H GGL GHLGRDKT+  + ERYYWPQLK+DV N+V +C+  QT KG +QNTGLY+PLP+P+ IWEDL+MDF+LGLPRTQRG DSV VVVDR
Subjt:  LREALLQEAHAGGLAGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQTCKGTSQNTGLYLPLPIPDNIWEDLSMDFILGLPRTQRGFDSVLVVVDR

Query:  YSKMAHFLACKKTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNRTLGNLIRCLSGDKPRQWDMT
        +SKM HF+ C+KTSDAS++A LFFREVVRLHG+P+SI SDRD KFLSHFW TLW++ DT LK+S+T+HPQTDGQTE  NRTLGNLIR + GDKP+QWD+ 
Subjt:  YSKMAHFLACKKTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNRTLGNLIRCLSGDKPRQWDMT

Query:  LAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDLSLEAEEMATRIRDLHQQVHKHIEEQTIKYKAAADKKRRYKEFAVGDLVMVHLRKNR
        LAQAEFAYN+  +  TGKSPF LVY + P+  LDL  +P    +S+ AE MA ++RD+  +V   +EE+  KYKAAAD KRR K FA GD VMV LRK R
Subjt:  LAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDLSLEAEEMATRIRDLHQQVHKHIEEQTIKYKAAADKKRRYKEFAVGDLVMVHLRKNR

Query:  FPAGTYSKLKEKKIGPLPILEKYGDNAYRVQLPATYNISSTFNIADIFPY
        FP G+Y+KLK +K GP  +++K  +NAY + LPA  NIS TFN+AD++ +
Subjt:  FPAGTYSKLKEKKIGPLPILEKYGDNAYRVQLPATYNISSTFNIADIFPY

A0A6N2LVR1 Uncharacterized protein0.0e+0044.48Show/hide
Query:  YHK----QESDFKMKVDIPTYNGKMEIEAFLEWIRHVENFFDYMNTPESKKVKLVALKLKGGAQAWWDQLEINRQRFGKRPIRKWERMKRLMRDRFLPAN
        YHK    ++  F+MK+D+P++NG+++IE FL+W+  VE FFDYM  PE KKVKLVA +L GGA AWW+QL++ R R  K  ++ W +M+RL+R R+LP +
Subjt:  YHK----QESDFKMKVDIPTYNGKMEIEAFLEWIRHVENFFDYMNTPESKKVKLVALKLKGGAQAWWDQLEINRQRFGKRPIRKWERMKRLMRDRFLPAN

Query:  FEQILYTQYQNCRQGTRTVMEYTEEFHRLGACTNLGENQQCLVARFKGGLRLDIQERISFQPIAFLNEAITAAATAEEQITNRFKRQYTRRNVVESSTSN
        +EQIL+ QYQNC+QG R V  Y EEFHRL +  NL E     VARF GGLR +IQ+R+S   I  L EAI  A  AE Q+T          +  ++S   
Subjt:  FEQILYTQYQNCRQGTRTVMEYTEEFHRLGACTNLGENQQCLVARFKGGLRLDIQERISFQPIAFLNEAITAAATAEEQITNRFKRQYTRRNVVESSTSN

Query:  ---PRNTTVDKTTSPTTSVGAKGKQIDAGT-SKDLVPKRNTNNYSRPTLGKCFRCNQTGHLSNECPQRKTL-AIADNVDYQAESDETDSDEDIDYLEP--
           P N   +  + PT    +   Q+   T +  + P+   N YSRPT  KC+RC Q GH SN CP+R  +  I    +   E +  ++++D  Y E   
Subjt:  ---PRNTTVDKTTSPTTSVGAKGKQIDAGT-SKDLVPKRNTNNYSRPTLGKCFRCNQTGHLSNECPQRKTL-AIADNVDYQAESDETDSDEDIDYLEP--

Query:  ---DEGDALS--LVIQRLLLAPKIDQSYQRHALFKTRCTINGKICNVIIDSGSTENVVASKLVSTLNLPLTPHPAPYK----------------------
           DEG+ LS  LV++R++LAPK++   QR+ +F+TRCT+N K+C+VIIDSGS+EN+++  +V+ L L    H APYK                      
Subjt:  ---DEGDALS--LVIQRLLLAPKIDQSYQRHALFKTRCTINGKICNVIIDSGSTENVVASKLVSTLNLPLTPHPAPYK----------------------

Query:  -------------------------------AIHKGRDNTYEFTWMGKKIVLLPINPS-KQVGTTTNKKGQLFSISSGKKNFQEKQFPIIGLVI-KYFDN
                                         +KG+DN Y F   G+K++L P+      VG    ++  L           +K   +  +++    + 
Subjt:  -------------------------------AIHKGRDNTYEFTWMGKKIVLLPINPS-KQVGTTTNKKGQLFSISSGKKNFQEKQFPIIGLVI-KYFDN

Query:  LDPNIAIPSEVQNLLQDFKEII--DTPIELPPLRDIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPALLTPKKDGSWRM
          PN  IP  +Q LL +F  II  + P  LPP+RDIQH I+ +PGA+LPN PHYRMSP E  IL  QV+EL++KG +Q S+SPCAVPALL PKKDGSWRM
Subjt:  LDPNIAIPSEVQNLLQDFKEII--DTPIELPPLRDIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPALLTPKKDGSWRM

Query:  CVDSRAINKITVEYRFPILRINDLLNQLSGATIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ-------------
        C+DSRAINKIT++YRFPI R+ D+L+ L+G+ IF KIDL+SGYHQIRIRPGDEWKTAFKT EGL+EWLVMPFGLSNAPSTFMRLMNQ             
Subjt:  CVDSRAINKITVEYRFPILRINDLLNQLSGATIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ-------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------EKIITSPVLALPNFDLPFEVAVDASGIGVGAVLSQNNHPIEFFSEKLSPSRQS----------------------
                                 EK+ ++PVLALP+FD  FEV  DAS IG+GAVLSQ N P+ F+SEKLS +R+                       
Subjt:  -------------------------EKIITSPVLALPNFDLPFEVAVDASGIGVGAVLSQNNHPIEFFSEKLSPSRQS----------------------

Query:  -----------------------------------------------------------------PGEIIAFNSLPEDYHSDKDFGSIGKNCTEHKPTGD
                                                                           E+I F  + + Y  D+DFG+    C +      
Subjt:  -----------------------------------------------------------------PGEIIAFNSLPEDYHSDKDFGSIGKNCTEHKPTGD

Query:  FHLVDGFLFKGNLLCIPFSSLREALLQEAHAGGLAGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQTCKGTSQNTGLYLPLPIPDNIWEDLSMDF
         H  DG+LF+GN LCIP SSLRE ++ E H GGL GHLGRDKT+ + EERYYWPQLK+D+ N+V RC TCQ  KG +QNTGLYLPLPIP   WEDLSMDF
Subjt:  FHLVDGFLFKGNLLCIPFSSLREALLQEAHAGGLAGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQTCKGTSQNTGLYLPLPIPDNIWEDLSMDF

Query:  ILGLPRTQRGFDSVLVVVDRYSKMAHFLACKKTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNR
        ILGLPRTQRG DSV VVVDR+SKMAHF+ACKKTSDA ++ANLFF+EVVRLHG+PKSI SDRD KFLSHFW+TLW++FDTTL +S+TSHPQTDGQTEV NR
Subjt:  ILGLPRTQRGFDSVLVVVDRYSKMAHFLACKKTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNR

Query:  TLGNLIRCLSGDKPRQWDMTLAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDLSLEAEEMATRIRDLHQQVHKHIEEQTIKYKAAADKK
        TLGNLIRCLSG++P+QWD+TLAQAEFAYN M NR+TGK+PF++VY + P+  LDL  +P    +++ AE MA R+R + ++V K++E    KYKAAADKK
Subjt:  TLGNLIRCLSGDKPRQWDMTLAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDLSLEAEEMATRIRDLHQQVHKHIEEQTIKYKAAADKK

Query:  RRYKEFAVGDLVMVHLRKNRFPAGTYSKLKEKKIGPLPILEKYGDNAYRVQLPATYNISSTFNIADIFPYNPPDES
        RR K F  GDLVMV+LRK R P GT  KL +KK GP  IL+K  DNAYRV LPA   IS TFN+AD+F Y+PPDE+
Subjt:  RRYKEFAVGDLVMVHLRKNRFPAGTYSKLKEKKIGPLPILEKYGDNAYRVQLPATYNISSTFNIADIFPYNPPDES

SwissProt top hitse value%identityAlignment
P38694 Putative aldehyde dehydrogenase-like protein YHR039C9.0e-8240.28Show/hide
Query:  HKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYIRIIQAALAAIGAPESLVDVI-----TGFAETGEALVS--SVDK
        +K A + + PLGVI +IV WNYPFHN+  P++AA+F GN IVVK SE   WS  F++ +I+  L A      LV +      T   ++     S      
Subjt:  HKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYIRIIQAALAAIGAPESLVDVI-----TGFAETGEALVS--SVDK

Query:  MIFVGSTGVGRMIMKTAAETLIPVTLELGGKDAFIVCEDV-DLDHVVNVALRASFTSSGHNCTGAERYYVHKNIYSSFVDKISERVKAITVGPPLAG---
        + F+GS  V   I+K AA++L PV +ELGGKDAFIV +   +LD + ++ +R +F SSG NC G ER  V K  Y   V  +++R   +T  P   G   
Subjt:  MIFVGSTGVGRMIMKTAAETLIPVTLELGGKDAFIVCEDV-DLDHVVNVALRASFTSSGHNCTGAERYYVHKNIYSSFVDKISERVKAITVGPPLAG---

Query:  ----KYDMGAICTQEQSEKLQSLVNDALDRGAKIVARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGC
              DMGA+ +  + ++L++LV DA+ +GA+++  G+    P+     YF PT++VDV   MK+ Q E FGPIL +MK    D  V+LAN + FGLG 
Subjt:  ----KYDMGAICTQEQSEKLQSLVNDALDRGAKIVARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGC

Query:  AVFSGSKQRAKNIASQIRSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVVEDRWWPYIQTKHPKPLTYPVADN--AFEFQMSLVE
        +VF    +    +A+ +++G+VAINDFAT Y+CQ LPFGG+  SG+G+F G EGL   C  K+V  D   P++ T+ PKPL YP+ +N  A+ F  S + 
Subjt:  AVFSGSKQRAKNIASQIRSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVVEDRWWPYIQTKHPKPLTYPVADN--AFEFQMSLVE

Query:  ATYGLNIWDRLKALVNVLKMLS
          Y  + W R+K+L ++ K  S
Subjt:  ATYGLNIWDRLKALVNVLKMLS

P38694 Putative aldehyde dehydrogenase-like protein YHR039C5.7e-2045.28Show/hide
Query:  VQCYEPATMKYLGFFPALSRDEVKERVASARKAQKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRDTGKTIVDASMGEIMATCEKITWLLSEGEKW
        +QC+ PAT +YLG FP+ +  ++ E V+ A KAQ  W  S F +R  +L  L  YI+ +Q+LI  ++ RD+GKT++DASMGEI+ T EKI W +  G++ 
Subjt:  VQCYEPATMKYLGFFPALSRDEVKERVASARKAQKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRDTGKTIVDASMGEIMATCEKITWLLSEGEKW

Query:  LKPESR
        L+P  R
Subjt:  LKPESR

Q0WSF1 Aldehyde dehydrogenase 22A16.0e-19575.95Show/hide
Query:  PPDESCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYIRIIQAALAAIGAPESLVDVITGFAETGEALVS
        P   S GRA LHK +RVEFHPLGVIGAIVPWNYPFHNIFNP+LAAVF+GNGIV+KVSEHASWSGCFY RIIQAALAA+GAPE+LVDVITGFAETGEALVS
Subjt:  PPDESCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYIRIIQAALAAIGAPESLVDVITGFAETGEALVS

Query:  SVDKMIFVGSTGVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASFTSSGHNCTGAERYYVHKNIYSSFVDKISERVKAITVGPPLAG
        SVDKMIFVGST VG+MIM+ AAETL PVTLELGGKDAFI+CED D+ HV  VA+R +  SSG NC GAER+YVHK+IY++F+ ++++ VK+++ GPPL G
Subjt:  SVDKMIFVGSTGVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASFTSSGHNCTGAERYYVHKNIYSSFVDKISERVKAITVGPPLAG

Query:  KYDMGAICTQEQSEKLQSLVNDALDRGAKIVARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVFS
        +YDMGAIC QE SE LQSLVNDALD+GA+I  RG+FGHL E AVDQYFPPTV+++VNH MK+M+EEAFGPI+PIM+FSTD+E +KLANDSR+ LGCAVFS
Subjt:  KYDMGAICTQEQSEKLQSLVNDALDRGAKIVARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVFS

Query:  GSKQRAKNIASQIRSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLN
        GSK RAK IASQI+ G  AINDFA+NY+CQSLPFGGVK+SGFGRFAG+EGLRACCLVK+VVEDR+WP I+TK PKP+ YPVA+NAFEFQ +LVE  YGLN
Subjt:  GSKQRAKNIASQIRSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLN

Query:  IWDRLKALVNVLKMLSEQNS
        IWDRL++L++VLK L++Q+S
Subjt:  IWDRLKALVNVLKMLSEQNS

Q0WSF1 Aldehyde dehydrogenase 22A19.0e-7480.12Show/hide
Query:  MAFWWTLLVLGFAYAICRFLLMLIPPNVPSIEVDASDVTDDGNQTQENSYIYIPPRVKTQQQDKKVQCYEPATMKYLGFFPALSRDEVKERVASARKAQK
        M FWW L+VL FAYAIC+FLLMLIPPNVPSI+VDASDV   G  T+ENS+IYIPPR ++QQ DKKVQCYEPATMKYLG+FPALS  EV+ERV  +RKAQK
Subjt:  MAFWWTLLVLGFAYAICRFLLMLIPPNVPSIEVDASDVTDDGNQTQENSYIYIPPRVKTQQQDKKVQCYEPATMKYLGFFPALSRDEVKERVASARKAQK

Query:  EWAKSSFKQRRLLLRILLKYIIEHQELICEISSRDTGKTIVDASMGEIMATCEKITWLLSEGEKWLKPESR
         WA+SSFK RR  LRILLKYIIEHQELICE+SSRDTGKT+VDAS+GEIM TCEKITWLLSEGE+WLKPESR
Subjt:  EWAKSSFKQRRLLLRILLKYIIEHQELICEISSRDTGKTIVDASMGEIMATCEKITWLLSEGEKWLKPESR

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.4e-8227.89Show/hide
Query:  LLQDFKEIIDTPIELPPLR------DIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINK
        L Q ++EII    +LPP         ++H IE  PGA LP L  Y ++    + +++ VQ+LL+   I PS SPC+ P +L PKKDG++R+CVD R +NK
Subjt:  LLQDFKEIIDTPIELPPLR------DIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINK

Query:  ITVEYRFPILRINDLLNQLSGATIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLM------------------------
         T+   FP+ RI++LL+++  A IF  +DL SGYHQI + P D +KTAF T  G +E+ VMPFGL NAPSTF R M                        
Subjt:  ITVEYRFPILRINDLLNQLSGATIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLM------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---NQEKII--------TSPVLALPNFDLPFEVAVDASGIGVGAVL----------------------SQNNHP--------------------------
            Q+K I         SPVL   N    + +  DAS  G+GAVL                      +Q N+P                          
Subjt:  ---NQEKII--------TSPVLALPNFDLPFEVAVDASGIGVGAVL----------------------SQNNHP--------------------------

Query:  -------IEFFSEKLSPSRQ-------------------SPGEIIA-------FNSLPED------------YHSDKDFGSI---GKNCTEHKPTGD---
               +     K  P+R+                    P  ++A       +   PE             Y SD    ++    K  T+H  T +   
Subjt:  -------IEFFSEKLSPSRQ-------------------SPGEIIA-------FNSLPED------------YHSDKDFGSI---GKNCTEHKPTGD---

Query:  ------------------FHLVDGFLFKGNLLCIPFSSLREALLQEAHAGGL-AGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQTCKGTSQNT-
                          + L D  ++  + L +P    + A+++  H   L  GH G   TL  +   YYWP+L+  +  Y+  C  CQ  K       
Subjt:  ------------------FHLVDGFLFKGNLLCIPFSSLREALLQEAHAGGL-AGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQTCKGTSQNT-

Query:  GLYLPLPIPDNIWEDLSMDFILGLPRTQRGFDSVLVVVDRYSKMAHFLACKKTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTT
        GL  PLPI +  W D+SMDF+ GLP T    + +LVVVDR+SK AHF+A +KT DA+ + +L FR +   HG P++I SDRDV+  +  ++ L K+    
Subjt:  GLYLPLPIPDNIWEDLSMDFILGLPRTQRGFDSVLVVVDRYSKMAHFLACKKTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTT

Query:  LKYSTTSHPQTDGQTEVTNRTLGNLIRCLSGDKPRQWDMTLAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDL---SLEAEEMATRIRD
           S+ +HPQTDGQ+E T +TL  L+R       + W + L Q EF YN    RT GKSPFE+    LP    +   I S  ++   S  A E+A  ++ 
Subjt:  LKYSTTSHPQTDGQTEVTNRTLGNLIRCLSGDKPRQWDMTLAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDL---SLEAEEMATRIRD

Query:  LHQQVHKHIEEQTIKYKAAADKKRRYKEFAVGDLVMVHLRKNRFPAGTYSKLKEKKIGPLPILEKYGDNAYRVQL
        L  Q  + +E   I+ +   +++R+     +GD V+VH R   F  G Y K+++  +GP  +++K  DNAY + L
Subjt:  LHQQVHKHIEEQTIKYKAAADKKRRYKEFAVGDLVMVHLRKNRFPAGTYSKLKEKKIGPLPILEKYGDNAYRVQL

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.3e-8327.55Show/hide
Query:  LLQDFKEIIDTPIELPPLR------DIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINK
        L Q ++EII    +LPP         ++H IE  PGA LP L  Y ++    + +++ VQ+LL+   I PS SPC+ P +L PKKDG++R+CVD R +NK
Subjt:  LLQDFKEIIDTPIELPPLR------DIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINK

Query:  ITVEYRFPILRINDLLNQLSGATIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLM------------------------
         T+   FP+ RI++LL+++  A IF  +DL SGYHQI + P D +KTAF T  G +E+ VMPFGL NAPSTF R M                        
Subjt:  ITVEYRFPILRINDLLNQLSGATIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLM------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---NQEKII--------TSPVLALPNFDLPFEVAVDASGIGVGAVL----------------------SQNNHP--------------------------
            Q+K I         SPVL   N    + +  DAS  G+GAVL                      +Q N+P                          
Subjt:  ---NQEKII--------TSPVLALPNFDLPFEVAVDASGIGVGAVL----------------------SQNNHP--------------------------

Query:  -------IEFFSEKLSPSRQ-------------------SPGEIIA-------FNSLPED------------YHSDKDFGSI---GKNCTEHKPTGD---
               +     K  P+R+                    P  ++A       +   PE             Y SD    ++    K  T+H  T +   
Subjt:  -------IEFFSEKLSPSRQ-------------------SPGEIIA-------FNSLPED------------YHSDKDFGSI---GKNCTEHKPTGD---

Query:  ------------------FHLVDGFLFKGNLLCIPFSSLREALLQEAHAGGL-AGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQTCKGTSQNT-
                          + L D  ++  + L +P    + A+++  H   L  GH G   TL  +   YYWP+L+  +  Y+  C  CQ  K       
Subjt:  ------------------FHLVDGFLFKGNLLCIPFSSLREALLQEAHAGGL-AGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQTCKGTSQNT-

Query:  GLYLPLPIPDNIWEDLSMDFILGLPRTQRGFDSVLVVVDRYSKMAHFLACKKTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTT
        GL  PLPI +  W D+SMDF+ GLP T    + +LVVVDR+SK AHF+A +KT DA+ + +L FR +   HG P++I SDRDV+  +  ++ L K+    
Subjt:  GLYLPLPIPDNIWEDLSMDFILGLPRTQRGFDSVLVVVDRYSKMAHFLACKKTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTT

Query:  LKYSTTSHPQTDGQTEVTNRTLGNLIRCLSGDKPRQWDMTLAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDL---SLEAEEMATRIRD
           S+ +HPQTDGQ+E T +TL  L+R  +    + W + L Q EF YN    RT GKSPFE+    LP    +   I S  ++   S  A E+A  ++ 
Subjt:  LKYSTTSHPQTDGQTEVTNRTLGNLIRCLSGDKPRQWDMTLAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDL---SLEAEEMATRIRD

Query:  LHQQVHKHIEEQTIKYKAAADKKRRYKEFAVGDLVMVHLRKNRFPAGTYSKLKEKKIGPLPILEKYGDNAYRVQLPATYNISSTFNI---------ADIF
        L  Q  + +E   I+ +   +++R+     +GD V+VH R   F  G Y K+++  +GP  +++K  DNAY + L +        N+          D +
Subjt:  LHQQVHKHIEEQTIKYKAAADKKRRYKEFAVGDLVMVHLRKNRFPAGTYSKLKEKKIGPLPILEKYGDNAYRVQLPATYNISSTFNI---------ADIF

Query:  PYNPPDESCGR
        P N P  S  R
Subjt:  PYNPPDESCGR

Q9P7K9 Putative aldehyde dehydrogenase-like protein C21C33.0e-7737.91Show/hide
Query:  TLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYIRIIQAALAAIGAPESLVDVITGFAETGEALV--SSVDKMIF
        T +K   V++ PLGVI A+V WNYP HN   P+++A+FAGN IVVK SE  +WS   Y  ++++ L ++G    LV  IT   +  + L   S +  + F
Subjt:  TLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYIRIIQAALAAIGAPESLVDVITGFAETGEALV--SSVDKMIF

Query:  VGSTGVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASFTSSGHNCTGAERYYVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAI
        +GS  + +++  +AA+ L P+ LELGGKD  I+ +D  L+ ++++ +R  F S+G NC G ER      +Y + + K+  R+  + +G       DMGA+
Subjt:  VGSTGVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASFTSSGHNCTGAERYYVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAI

Query:  CTQEQSEKLQSLVNDALDRGAKIVARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVFSGSKQRAK
         +  + + L+SL+ DA+ +GA++V  G     P+     YF PT++VD  + MK+ QEE F PI  + +  + + A+++AN + FGLG +VF   KQ  +
Subjt:  CTQEQSEKLQSLVNDALDRGAKIVARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVFSGSKQRAK

Query:  NIASQIRSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVVEDRWWPYIQTKHPKPLTYPVADN--AFEFQMSLVEATYG
             + +G VA+NDF   YL Q +PFGG K+SG+GRFAG EGLR  C  KA+  DR +  I T  P  + YP+ D+  A++F   L+   YG
Subjt:  NIASQIRSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVVEDRWWPYIQTKHPKPLTYPVADN--AFEFQMSLVEATYG

Q9P7K9 Putative aldehyde dehydrogenase-like protein C21C31.0e-1642.31Show/hide
Query:  CYEPATMKYLGFFPALSRDEVKERVASARKAQKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRDTGKTIVDASMGEIMATCEKITWLLSEGEKWLK
        CY P     LG     ++ ++ + +  A +AQKEW  +SF +RR  L+ L + II +Q+   EI+ +DTGKT+VDA+ GEI+ T EKI W L+ GE+ L+
Subjt:  CYEPATMKYLGFFPALSRDEVKERVASARKAQKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRDTGKTIVDASMGEIMATCEKITWLLSEGEKWLK

Query:  PESR
        P  R
Subjt:  PESR

Arabidopsis top hitse value%identityAlignment
AT1G23800.1 aldehyde dehydrogenase 2B71.1e-5537.1Show/hide
Query:  PLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYIRIIQAALAAIGAPESLVDVITGFAETGEALVSS---VDKMIFVGSTGVGRMI
        P+GV G I+PWN+P   +   +  A+  GN +V+K +E    S     +++  A    G P+ +V++++GF  T  A ++S   VDK+ F GST VG++I
Subjt:  PLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYIRIIQAALAAIGAPESLVDVITGFAETGEALVSS---VDKMIFVGSTGVGRMI

Query:  MKTAAET-LIPVTLELGGKDAFIVCEDVDLDHVVNVALRASFTSSGHNCTGAERYYVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKL
        ++ A+++ L  VTLELGGK  FIVCED D+D  V +A  A F + G  C    R +VH+ +Y  FV+K   R     VG P     + G     EQ  K+
Subjt:  MKTAAET-LIPVTLELGGKDAFIVCEDVDLDHVVNVALRASFTSSGHNCTGAERYYVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKL

Query:  QSLVNDALDRGAKIVARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVFSGSKQRAKNIASQIRSG
           +   ++ GA + A G       G+   Y  PTV  DV   M +  +E FGP+  I+KF   DE +  AN+SR+GL   VF+ +   A  +   +R G
Subjt:  QSLVNDALDRGAKIVARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVFSGSKQRAKNIASQIRSG

Query:  SVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVV
        +V IN F  + L  S+PFGG K SG GR  G+  L     VKAVV
Subjt:  SVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVV

AT1G74920.1 aldehyde dehydrogenase 10A84.3e-5537.47Show/hide
Query:  KARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYIRIIQAALAAIGAPESLVDVITGF-AETGEALVS--SVDKMIFVGS
        K+ V   PLGV+G I PWNYP       V  ++ AG   ++K SE AS + C  +  I      +G P  +++V+TGF +E G  L S   VDK+ F GS
Subjt:  KARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYIRIIQAALAAIGAPESLVDVITGF-AETGEALVS--SVDKMIFVGS

Query:  TGVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASFTSSGHNCTGAERYYVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQ
           G  +M  AA+ + PV++ELGGK   IV +DVDLD     AL   F ++G  C+   R  VH++I S F++K+ +  K I +  P+     +G + ++
Subjt:  TGVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASFTSSGHNCTGAERYYVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQ

Query:  EQSEKLQSLVNDALDRGAKIVARGTF-GHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVFSGSKQRAKNI
         Q EK+   ++ A   GA I+  G+   HL +G    +  PT+I DV  +M++ +EE FGP+L +  F+++DEA++LANDS +GLG AV S   +R   I
Subjt:  EQSEKLQSLVNDALDRGAKIVARGTF-GHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVFSGSKQRAKNI

Query:  ASQIRSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVV----EDRWWPY
        +    +G V IN   +       P+GGVK SGFGR  G  GL     VK V      D W  Y
Subjt:  ASQIRSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVV----EDRWWPY

AT1G74920.2 aldehyde dehydrogenase 10A84.3e-5537.47Show/hide
Query:  KARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYIRIIQAALAAIGAPESLVDVITGF-AETGEALVS--SVDKMIFVGS
        K+ V   PLGV+G I PWNYP       V  ++ AG   ++K SE AS + C  +  I      +G P  +++V+TGF +E G  L S   VDK+ F GS
Subjt:  KARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYIRIIQAALAAIGAPESLVDVITGF-AETGEALVS--SVDKMIFVGS

Query:  TGVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASFTSSGHNCTGAERYYVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQ
           G  +M  AA+ + PV++ELGGK   IV +DVDLD     AL   F ++G  C+   R  VH++I S F++K+ +  K I +  P+     +G + ++
Subjt:  TGVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASFTSSGHNCTGAERYYVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQ

Query:  EQSEKLQSLVNDALDRGAKIVARGTF-GHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVFSGSKQRAKNI
         Q EK+   ++ A   GA I+  G+   HL +G    +  PT+I DV  +M++ +EE FGP+L +  F+++DEA++LANDS +GLG AV S   +R   I
Subjt:  EQSEKLQSLVNDALDRGAKIVARGTF-GHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVFSGSKQRAKNI

Query:  ASQIRSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVV----EDRWWPY
        +    +G V IN   +       P+GGVK SGFGR  G  GL     VK V      D W  Y
Subjt:  ASQIRSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVV----EDRWWPY

AT3G66658.1 aldehyde dehydrogenase 22A12.0e-17776.92Show/hide
Query:  PPDESCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYIRIIQAALAAIGAPESLVDVITGFAETGEALVS
        P   S GRA LHK +RVEFHPLGVIGAIVPWNYPFHNIFNP+LAAVF+GNGIV+KVSEHASWSGCFY RIIQAALAA+GAPE+LVDVITGFAETGEALVS
Subjt:  PPDESCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYIRIIQAALAAIGAPESLVDVITGFAETGEALVS

Query:  SVDKMIFVGSTGVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASFTSSGHNCTGAERYYVHKNIYSSFVDKISERVKAITVGPPLAG
        SVDKMIFVGST VG+MIM+ AAETL PVTLELGGKDAFI+CED D+ HV  VA+R +  SSG NC GAER+YVHK+IY++F+ ++++ VK+++ GPPL G
Subjt:  SVDKMIFVGSTGVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASFTSSGHNCTGAERYYVHKNIYSSFVDKISERVKAITVGPPLAG

Query:  KYDMGAICTQEQSEKLQSLVNDALDRGAKIVARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVFS
        +YDMGAIC QE SE LQSLVNDALD+GA+I  RG+FGHL E AVDQYFPPTV+++VNH MK+M+EEAFGPI+PIM+FSTD+E +KLANDSR+ LGCAVFS
Subjt:  KYDMGAICTQEQSEKLQSLVNDALDRGAKIVARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVFS

Query:  GSKQRAKNIASQIRSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVVEDRWWPYIQTKHPKPL
        GSK RAK IASQI+ G  AINDFA+NY+CQSLPFGGVK+SGFGRFAG+EGLRACCLVK+VVEDR+WP I+TK PKP+
Subjt:  GSKQRAKNIASQIRSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVVEDRWWPYIQTKHPKPL

AT3G66658.1 aldehyde dehydrogenase 22A16.4e-7580.12Show/hide
Query:  MAFWWTLLVLGFAYAICRFLLMLIPPNVPSIEVDASDVTDDGNQTQENSYIYIPPRVKTQQQDKKVQCYEPATMKYLGFFPALSRDEVKERVASARKAQK
        M FWW L+VL FAYAIC+FLLMLIPPNVPSI+VDASDV   G  T+ENS+IYIPPR ++QQ DKKVQCYEPATMKYLG+FPALS  EV+ERV  +RKAQK
Subjt:  MAFWWTLLVLGFAYAICRFLLMLIPPNVPSIEVDASDVTDDGNQTQENSYIYIPPRVKTQQQDKKVQCYEPATMKYLGFFPALSRDEVKERVASARKAQK

Query:  EWAKSSFKQRRLLLRILLKYIIEHQELICEISSRDTGKTIVDASMGEIMATCEKITWLLSEGEKWLKPESR
         WA+SSFK RR  LRILLKYIIEHQELICE+SSRDTGKT+VDAS+GEIM TCEKITWLLSEGE+WLKPESR
Subjt:  EWAKSSFKQRRLLLRILLKYIIEHQELICEISSRDTGKTIVDASMGEIMATCEKITWLLSEGEKWLKPESR

AT3G66658.2 aldehyde dehydrogenase 22A14.3e-19675.95Show/hide
Query:  PPDESCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYIRIIQAALAAIGAPESLVDVITGFAETGEALVS
        P   S GRA LHK +RVEFHPLGVIGAIVPWNYPFHNIFNP+LAAVF+GNGIV+KVSEHASWSGCFY RIIQAALAA+GAPE+LVDVITGFAETGEALVS
Subjt:  PPDESCGRATLHKKARVEFHPLGVIGAIVPWNYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYIRIIQAALAAIGAPESLVDVITGFAETGEALVS

Query:  SVDKMIFVGSTGVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASFTSSGHNCTGAERYYVHKNIYSSFVDKISERVKAITVGPPLAG
        SVDKMIFVGST VG+MIM+ AAETL PVTLELGGKDAFI+CED D+ HV  VA+R +  SSG NC GAER+YVHK+IY++F+ ++++ VK+++ GPPL G
Subjt:  SVDKMIFVGSTGVGRMIMKTAAETLIPVTLELGGKDAFIVCEDVDLDHVVNVALRASFTSSGHNCTGAERYYVHKNIYSSFVDKISERVKAITVGPPLAG

Query:  KYDMGAICTQEQSEKLQSLVNDALDRGAKIVARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVFS
        +YDMGAIC QE SE LQSLVNDALD+GA+I  RG+FGHL E AVDQYFPPTV+++VNH MK+M+EEAFGPI+PIM+FSTD+E +KLANDSR+ LGCAVFS
Subjt:  KYDMGAICTQEQSEKLQSLVNDALDRGAKIVARGTFGHLPEGAVDQYFPPTVIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVFS

Query:  GSKQRAKNIASQIRSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLN
        GSK RAK IASQI+ G  AINDFA+NY+CQSLPFGGVK+SGFGRFAG+EGLRACCLVK+VVEDR+WP I+TK PKP+ YPVA+NAFEFQ +LVE  YGLN
Subjt:  GSKQRAKNIASQIRSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVVEDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLN

Query:  IWDRLKALVNVLKMLSEQNS
        IWDRL++L++VLK L++Q+S
Subjt:  IWDRLKALVNVLKMLSEQNS

AT3G66658.2 aldehyde dehydrogenase 22A16.4e-7580.12Show/hide
Query:  MAFWWTLLVLGFAYAICRFLLMLIPPNVPSIEVDASDVTDDGNQTQENSYIYIPPRVKTQQQDKKVQCYEPATMKYLGFFPALSRDEVKERVASARKAQK
        M FWW L+VL FAYAIC+FLLMLIPPNVPSI+VDASDV   G  T+ENS+IYIPPR ++QQ DKKVQCYEPATMKYLG+FPALS  EV+ERV  +RKAQK
Subjt:  MAFWWTLLVLGFAYAICRFLLMLIPPNVPSIEVDASDVTDDGNQTQENSYIYIPPRVKTQQQDKKVQCYEPATMKYLGFFPALSRDEVKERVASARKAQK

Query:  EWAKSSFKQRRLLLRILLKYIIEHQELICEISSRDTGKTIVDASMGEIMATCEKITWLLSEGEKWLKPESR
         WA+SSFK RR  LRILLKYIIEHQELICE+SSRDTGKT+VDAS+GEIM TCEKITWLLSEGE+WLKPESR
Subjt:  EWAKSSFKQRRLLLRILLKYIIEHQELICEISSRDTGKTIVDASMGEIMATCEKITWLLSEGEKWLKPESR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAATCTGCTTGAGGTTCGACACAGTGCCTCCATCAGTGTCAGTCGCAGTTCTATGTCTGTATCGTCTTTTCCCGCGATCTCTCCTCTTGAACGCCATGGCATTTTG
GTGGACTCTGCTTGTTCTAGGATTTGCCTACGCGATCTGTAGGTTTCTCTTGATGCTCATCCCTCCGAATGTGCCTTCAATTGAAGTTGACGCATCTGACGTTACGGACG
ACGGGAATCAGACGCAGGAGAATAGCTACATCTATATTCCTCCTAGGGTAAAGACACAACAGCAAGACAAGAAAGTCCAGTGTTACGAGCCTGCAACTATGAAATACTTG
GGCTTTTTTCCTGCATTGTCACGAGATGAGGTCAAGGAGCGTGTTGCTTCAGCCAGGAAAGCACAGAAAGAATGGGCCAAAAGTAGTTTCAAGCAAAGGCGTTTGCTTTT
ACGGATACTTTTGAAGTATATAATTGAACATCAAGAGCTTATTTGCGAGATTTCCTCCCGTGACACTGGAAAAACAATAGTGGATGCTAGTATGGGAGAAATAATGGCGA
CATGTGAAAAAATTACTTGGCTTCTTTCAGAGGGTGAGAAATGGCTGAAGCCTGAGAGCCGCCATGTGCTTGCCATGTGCTCCTTGGTTATTCACTTCTTCGGCTCCATC
AAATTGGTATCAGAGCGCTCGATTCTCGGGACACCTTTTACCATGACAACCAAGAAAGTCTCGACTAATGTGAGTGGAGATCCTTCTTTGGTTGATAAAGAGGCGGAATC
TATCCCCGTCCTCTCACCACAAGAGACAACTGTACGCTTGCTGTCAGTTGAAGACGACGTGCGTGAGATCAAGAAAATTCTAAAGCTAATATGGGAAAAGATAGGCGTCC
AAACCGAGCAACTAACGATAAATCCGGATGCAGAAGCAACTAGAGGAAAAGAAACACAGATTCGGCAAGGGGAAAGCAGCAAATTGACGAACCAACATGAAGACAGAACC
AAATTCCAGCAAGAAAGGTGTTTTAATGAACAAAAGATAACCCAAGATTGGGCTGCAGCTCCAACAAAATACCAAGAATGGGAAATGGCGCATAAAAGGTACAAAGAACT
TAAGGAAAATCCGCTATTTCGAAGGTATCAAGACTGGTCAGAATCTTCAAGTGAGGACGAGAAAGAACCAAGAGTCTTTTATCAACAAAATCCAAGGTATGACCATGCGG
AAAGAAAAGAAGATGCAAGAATCCCGTACCATCCATATGGTCATCAAGAATTCCCAATGTATAACCACCCGGAAGGAAGAGATGATGCAAGAATGTTCTACCATAAACAA
GAAAGCGATTTCAAAATGAAGGTGGACATCCCAACTTACAATGGTAAAATGGAAATTGAAGCATTCTTGGAATGGATACGCCATGTGGAGAATTTTTTCGATTACATGAA
TACCCCGGAATCAAAGAAGGTCAAATTAGTAGCACTCAAATTAAAGGGAGGAGCACAAGCATGGTGGGATCAACTCGAAATTAACCGCCAAAGATTTGGCAAGAGACCAA
TCCGTAAATGGGAAAGAATGAAGCGGTTGATGAGGGATAGATTTCTCCCCGCCAATTTTGAACAGATTTTATACACCCAATACCAAAATTGCCGCCAAGGGACACGCACG
GTAATGGAGTATACCGAAGAGTTCCATAGATTGGGGGCTTGTACGAATCTTGGAGAAAATCAGCAATGTTTGGTTGCAAGATTCAAAGGTGGTCTGCGTTTGGATATCCA
AGAAAGAATTTCTTTCCAACCTATAGCTTTCTTGAATGAAGCTATTACGGCTGCTGCTACGGCTGAAGAGCAAATTACCAACAGGTTTAAAAGGCAATACACACGAAGGA
ATGTCGTAGAATCATCAACTTCCAATCCAAGAAACACCACAGTTGACAAGACCACATCCCCAACCACATCGGTCGGGGCAAAAGGCAAGCAAATAGACGCTGGAACAAGT
AAGGATTTGGTTCCTAAGAGGAACACCAACAATTACTCGAGACCTACTTTGGGAAAATGCTTCCGCTGTAACCAAACTGGACATTTATCGAATGAATGCCCACAAAGAAA
GACCTTAGCGATTGCGGACAATGTTGATTATCAAGCGGAAAGTGACGAAACAGATAGTGATGAAGACATAGATTATCTTGAGCCAGATGAAGGGGATGCCTTATCTTTGG
TCATTCAACGACTACTTTTGGCCCCAAAGATCGACCAAAGTTACCAAAGACATGCTTTATTCAAGACCCGCTGCACTATAAATGGGAAAATCTGCAATGTCATCATCGAC
AGTGGCAGTACCGAGAATGTGGTTGCAAGTAAGTTAGTTTCTACTCTAAACTTACCTTTGACTCCACATCCAGCCCCATATAAGGCAATCCACAAAGGAAGAGACAACAC
CTACGAATTTACATGGATGGGGAAGAAAATCGTGCTTCTACCTATCAATCCTTCCAAGCAAGTAGGTACAACCACCAATAAAAAAGGTCAACTCTTCTCTATATCTTCTG
GAAAAAAAAATTTCCAAGAAAAACAATTTCCTATCATAGGTTTAGTGATTAAATATTTCGATAATCTTGATCCTAACATTGCTATTCCAAGTGAAGTTCAAAATCTTTTA
CAAGATTTTAAGGAAATCATTGACACTCCAATCGAACTACCCCCATTGCGTGACATCCAGCACACCATCGAATTCCTCCCAGGCGCAACCTTGCCCAACTTACCACACTA
CCGAATGAGTCCTAACGAATACAAAATATTGCATGAACAAGTGCAAGAATTGCTGGAAAAAGGTCACATACAACCAAGCCTCAGTCCCTGTGCAGTACCCGCTCTTCTAA
CACCAAAAAAAGACGGATCTTGGCGCATGTGTGTTGACAGTCGAGCAATTAACAAGATCACAGTGGAATACAGATTTCCAATCCTCAGAATCAATGATCTCCTTAACCAA
TTAAGTGGGGCAACCATATTTCCAAAGATAGATCTCAAGAGTGGATATCACCAAATACGGATTCGGCCAGGGGATGAATGGAAGACTGCATTTAAGACGAATGAAGGATT
GTTTGAGTGGTTGGTCATGCCATTCGGCCTCTCCAATGCTCCAAGCACATTTATGAGGCTCATGAACCAGGAAAAAATTATCACTAGCCCGGTGCTCGCACTTCCAAACT
TTGACTTACCTTTTGAAGTAGCAGTGGATGCCTCCGGCATTGGTGTTGGTGCAGTCCTATCCCAAAATAACCACCCTATTGAGTTCTTTAGTGAAAAACTAAGCCCCTCA
AGACAAAGTCCGGGCGAAATTATAGCATTCAATAGCCTACCGGAAGACTACCACAGTGACAAAGATTTTGGCTCCATCGGGAAAAATTGCACTGAACATAAACCAACTGG
AGATTTCCACTTGGTCGACGGATTCCTTTTTAAAGGGAACCTTCTGTGTATACCTTTCTCATCCTTACGAGAAGCCTTACTCCAAGAAGCACACGCGGGAGGATTGGCTG
GACATTTAGGCCGAGATAAAACTCTACATATGCTGGAAGAACGATATTACTGGCCGCAACTTAAAAAAGATGTTCAAAACTATGTCTCACGATGTTTTACATGCCAAACT
TGTAAAGGAACAAGCCAAAACACAGGTTTATATTTGCCTTTACCTATCCCCGACAACATTTGGGAAGACTTATCCATGGATTTTATACTCGGCCTCCCTCGAACCCAACG
GGGTTTTGATTCCGTCTTGGTGGTAGTAGATCGTTATAGCAAGATGGCACATTTTCTTGCTTGTAAAAAAACTTCTGATGCAAGTTACATTGCAAACCTATTTTTTCGAG
AGGTTGTCCGCCTGCATGGAATCCCAAAATCCATAGTTTCAGACAGGGATGTCAAGTTTTTGAGCCATTTTTGGAAAACCTTGTGGAAAAAATTTGATACTACTCTCAAA
TATAGTACTACTAGCCATCCACAAACAGACGGCCAAACGGAGGTAACAAATAGAACTCTTGGAAACCTTATCAGATGTCTAAGTGGCGACAAACCGAGACAATGGGACAT
GACTCTAGCACAAGCTGAATTTGCCTACAATCATATGAAGAACCGGACAACAGGGAAGTCACCATTCGAGCTTGTTTACACTAAACTCCCACGCCTAACTCTTGATCTAA
CACTAATTCCTTCCTCTGTTGATCTTAGCCTTGAAGCAGAAGAGATGGCAACTAGGATTCGAGATCTACATCAGCAAGTGCACAAACACATCGAAGAACAAACAATAAAG
TACAAGGCTGCCGCAGACAAAAAACGAAGATACAAGGAATTTGCTGTTGGGGACTTAGTGATGGTCCACTTACGCAAAAACAGATTTCCAGCTGGAACTTACAGCAAGCT
TAAGGAAAAGAAGATTGGCCCACTTCCTATTCTGGAAAAATACGGAGATAATGCCTACCGCGTGCAACTCCCAGCCACTTACAACATCAGTTCTACTTTCAACATAGCTG
ATATCTTTCCGTACAATCCTCCGGATGAATCATGTGGAAGGGCAACACTTCACAAGAAAGCCCGAGTAGAATTTCATCCCCTTGGAGTTATTGGGGCAATAGTCCCATGG
AACTATCCATTCCATAACATTTTCAATCCAGTGCTTGCAGCAGTCTTTGCAGGAAACGGTATTGTAGTTAAGGTTTCAGAACATGCAAGTTGGTCCGGATGCTTCTACAT
CAGGATTATCCAAGCTGCACTTGCTGCTATTGGAGCTCCTGAGAGCTTGGTCGATGTGATAACAGGGTTTGCTGAAACAGGTGAAGCGCTAGTATCTTCAGTAGACAAAA
TGATATTTGTTGGATCTACTGGTGTGGGTAGGATGATAATGAAAACTGCTGCTGAGACACTTATTCCAGTTACTCTTGAGTTGGGTGGAAAGGACGCATTTATTGTGTGT
GAGGATGTAGATTTGGATCATGTTGTAAACGTTGCACTCAGGGCTTCTTTTACATCAAGTGGACATAATTGTACTGGAGCTGAGAGATATTATGTCCACAAGAACATTTA
CTCCTCATTTGTGGATAAAATATCAGAACGTGTAAAGGCTATTACAGTTGGTCCACCGTTAGCTGGGAAATATGATATGGGTGCCATCTGTACGCAAGAGCAGTCTGAGA
AACTTCAAAGCCTTGTAAATGATGCCTTAGACAGGGGGGCAAAAATTGTTGCTCGTGGAACTTTTGGACACTTACCTGAAGGTGCCGTTGATCAATATTTCCCACCTACC
GTTATTGTTGATGTCAACCATACCATGAAGTTGATGCAAGAGGAGGCATTTGGACCAATCCTGCCTATAATGAAATTCAGCACTGATGATGAGGCCGTGAAGCTTGCAAA
TGATTCAAGATTTGGACTTGGCTGTGCTGTGTTTTCTGGCAGTAAACAACGTGCCAAGAATATTGCTTCGCAGATACGTTCTGGAAGTGTTGCAATTAATGACTTTGCCA
CAAATTATTTGTGTCAGTCCTTGCCATTTGGTGGCGTGAAAGAGAGTGGATTCGGTCGGTTTGCAGGCGTTGAAGGATTACGAGCTTGTTGTCTCGTCAAAGCAGTGGTT
GAGGATAGGTGGTGGCCGTACATTCAAACTAAGCATCCTAAGCCTCTTACGTATCCTGTTGCAGACAACGCTTTTGAGTTTCAGATGTCACTCGTTGAAGCAACGTACGG
CCTCAACATATGGGATCGATTAAAAGCATTGGTCAATGTTCTGAAGATGCTGTCCGAGCAAAACAGTCCGGCTAAAGATAACTCTGCCATTGGCGATGGAAGTAAGAGAG
CGGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAATCTGCTTGAGGTTCGACACAGTGCCTCCATCAGTGTCAGTCGCAGTTCTATGTCTGTATCGTCTTTTCCCGCGATCTCTCCTCTTGAACGCCATGGCATTTTG
GTGGACTCTGCTTGTTCTAGGATTTGCCTACGCGATCTGTAGGTTTCTCTTGATGCTCATCCCTCCGAATGTGCCTTCAATTGAAGTTGACGCATCTGACGTTACGGACG
ACGGGAATCAGACGCAGGAGAATAGCTACATCTATATTCCTCCTAGGGTAAAGACACAACAGCAAGACAAGAAAGTCCAGTGTTACGAGCCTGCAACTATGAAATACTTG
GGCTTTTTTCCTGCATTGTCACGAGATGAGGTCAAGGAGCGTGTTGCTTCAGCCAGGAAAGCACAGAAAGAATGGGCCAAAAGTAGTTTCAAGCAAAGGCGTTTGCTTTT
ACGGATACTTTTGAAGTATATAATTGAACATCAAGAGCTTATTTGCGAGATTTCCTCCCGTGACACTGGAAAAACAATAGTGGATGCTAGTATGGGAGAAATAATGGCGA
CATGTGAAAAAATTACTTGGCTTCTTTCAGAGGGTGAGAAATGGCTGAAGCCTGAGAGCCGCCATGTGCTTGCCATGTGCTCCTTGGTTATTCACTTCTTCGGCTCCATC
AAATTGGTATCAGAGCGCTCGATTCTCGGGACACCTTTTACCATGACAACCAAGAAAGTCTCGACTAATGTGAGTGGAGATCCTTCTTTGGTTGATAAAGAGGCGGAATC
TATCCCCGTCCTCTCACCACAAGAGACAACTGTACGCTTGCTGTCAGTTGAAGACGACGTGCGTGAGATCAAGAAAATTCTAAAGCTAATATGGGAAAAGATAGGCGTCC
AAACCGAGCAACTAACGATAAATCCGGATGCAGAAGCAACTAGAGGAAAAGAAACACAGATTCGGCAAGGGGAAAGCAGCAAATTGACGAACCAACATGAAGACAGAACC
AAATTCCAGCAAGAAAGGTGTTTTAATGAACAAAAGATAACCCAAGATTGGGCTGCAGCTCCAACAAAATACCAAGAATGGGAAATGGCGCATAAAAGGTACAAAGAACT
TAAGGAAAATCCGCTATTTCGAAGGTATCAAGACTGGTCAGAATCTTCAAGTGAGGACGAGAAAGAACCAAGAGTCTTTTATCAACAAAATCCAAGGTATGACCATGCGG
AAAGAAAAGAAGATGCAAGAATCCCGTACCATCCATATGGTCATCAAGAATTCCCAATGTATAACCACCCGGAAGGAAGAGATGATGCAAGAATGTTCTACCATAAACAA
GAAAGCGATTTCAAAATGAAGGTGGACATCCCAACTTACAATGGTAAAATGGAAATTGAAGCATTCTTGGAATGGATACGCCATGTGGAGAATTTTTTCGATTACATGAA
TACCCCGGAATCAAAGAAGGTCAAATTAGTAGCACTCAAATTAAAGGGAGGAGCACAAGCATGGTGGGATCAACTCGAAATTAACCGCCAAAGATTTGGCAAGAGACCAA
TCCGTAAATGGGAAAGAATGAAGCGGTTGATGAGGGATAGATTTCTCCCCGCCAATTTTGAACAGATTTTATACACCCAATACCAAAATTGCCGCCAAGGGACACGCACG
GTAATGGAGTATACCGAAGAGTTCCATAGATTGGGGGCTTGTACGAATCTTGGAGAAAATCAGCAATGTTTGGTTGCAAGATTCAAAGGTGGTCTGCGTTTGGATATCCA
AGAAAGAATTTCTTTCCAACCTATAGCTTTCTTGAATGAAGCTATTACGGCTGCTGCTACGGCTGAAGAGCAAATTACCAACAGGTTTAAAAGGCAATACACACGAAGGA
ATGTCGTAGAATCATCAACTTCCAATCCAAGAAACACCACAGTTGACAAGACCACATCCCCAACCACATCGGTCGGGGCAAAAGGCAAGCAAATAGACGCTGGAACAAGT
AAGGATTTGGTTCCTAAGAGGAACACCAACAATTACTCGAGACCTACTTTGGGAAAATGCTTCCGCTGTAACCAAACTGGACATTTATCGAATGAATGCCCACAAAGAAA
GACCTTAGCGATTGCGGACAATGTTGATTATCAAGCGGAAAGTGACGAAACAGATAGTGATGAAGACATAGATTATCTTGAGCCAGATGAAGGGGATGCCTTATCTTTGG
TCATTCAACGACTACTTTTGGCCCCAAAGATCGACCAAAGTTACCAAAGACATGCTTTATTCAAGACCCGCTGCACTATAAATGGGAAAATCTGCAATGTCATCATCGAC
AGTGGCAGTACCGAGAATGTGGTTGCAAGTAAGTTAGTTTCTACTCTAAACTTACCTTTGACTCCACATCCAGCCCCATATAAGGCAATCCACAAAGGAAGAGACAACAC
CTACGAATTTACATGGATGGGGAAGAAAATCGTGCTTCTACCTATCAATCCTTCCAAGCAAGTAGGTACAACCACCAATAAAAAAGGTCAACTCTTCTCTATATCTTCTG
GAAAAAAAAATTTCCAAGAAAAACAATTTCCTATCATAGGTTTAGTGATTAAATATTTCGATAATCTTGATCCTAACATTGCTATTCCAAGTGAAGTTCAAAATCTTTTA
CAAGATTTTAAGGAAATCATTGACACTCCAATCGAACTACCCCCATTGCGTGACATCCAGCACACCATCGAATTCCTCCCAGGCGCAACCTTGCCCAACTTACCACACTA
CCGAATGAGTCCTAACGAATACAAAATATTGCATGAACAAGTGCAAGAATTGCTGGAAAAAGGTCACATACAACCAAGCCTCAGTCCCTGTGCAGTACCCGCTCTTCTAA
CACCAAAAAAAGACGGATCTTGGCGCATGTGTGTTGACAGTCGAGCAATTAACAAGATCACAGTGGAATACAGATTTCCAATCCTCAGAATCAATGATCTCCTTAACCAA
TTAAGTGGGGCAACCATATTTCCAAAGATAGATCTCAAGAGTGGATATCACCAAATACGGATTCGGCCAGGGGATGAATGGAAGACTGCATTTAAGACGAATGAAGGATT
GTTTGAGTGGTTGGTCATGCCATTCGGCCTCTCCAATGCTCCAAGCACATTTATGAGGCTCATGAACCAGGAAAAAATTATCACTAGCCCGGTGCTCGCACTTCCAAACT
TTGACTTACCTTTTGAAGTAGCAGTGGATGCCTCCGGCATTGGTGTTGGTGCAGTCCTATCCCAAAATAACCACCCTATTGAGTTCTTTAGTGAAAAACTAAGCCCCTCA
AGACAAAGTCCGGGCGAAATTATAGCATTCAATAGCCTACCGGAAGACTACCACAGTGACAAAGATTTTGGCTCCATCGGGAAAAATTGCACTGAACATAAACCAACTGG
AGATTTCCACTTGGTCGACGGATTCCTTTTTAAAGGGAACCTTCTGTGTATACCTTTCTCATCCTTACGAGAAGCCTTACTCCAAGAAGCACACGCGGGAGGATTGGCTG
GACATTTAGGCCGAGATAAAACTCTACATATGCTGGAAGAACGATATTACTGGCCGCAACTTAAAAAAGATGTTCAAAACTATGTCTCACGATGTTTTACATGCCAAACT
TGTAAAGGAACAAGCCAAAACACAGGTTTATATTTGCCTTTACCTATCCCCGACAACATTTGGGAAGACTTATCCATGGATTTTATACTCGGCCTCCCTCGAACCCAACG
GGGTTTTGATTCCGTCTTGGTGGTAGTAGATCGTTATAGCAAGATGGCACATTTTCTTGCTTGTAAAAAAACTTCTGATGCAAGTTACATTGCAAACCTATTTTTTCGAG
AGGTTGTCCGCCTGCATGGAATCCCAAAATCCATAGTTTCAGACAGGGATGTCAAGTTTTTGAGCCATTTTTGGAAAACCTTGTGGAAAAAATTTGATACTACTCTCAAA
TATAGTACTACTAGCCATCCACAAACAGACGGCCAAACGGAGGTAACAAATAGAACTCTTGGAAACCTTATCAGATGTCTAAGTGGCGACAAACCGAGACAATGGGACAT
GACTCTAGCACAAGCTGAATTTGCCTACAATCATATGAAGAACCGGACAACAGGGAAGTCACCATTCGAGCTTGTTTACACTAAACTCCCACGCCTAACTCTTGATCTAA
CACTAATTCCTTCCTCTGTTGATCTTAGCCTTGAAGCAGAAGAGATGGCAACTAGGATTCGAGATCTACATCAGCAAGTGCACAAACACATCGAAGAACAAACAATAAAG
TACAAGGCTGCCGCAGACAAAAAACGAAGATACAAGGAATTTGCTGTTGGGGACTTAGTGATGGTCCACTTACGCAAAAACAGATTTCCAGCTGGAACTTACAGCAAGCT
TAAGGAAAAGAAGATTGGCCCACTTCCTATTCTGGAAAAATACGGAGATAATGCCTACCGCGTGCAACTCCCAGCCACTTACAACATCAGTTCTACTTTCAACATAGCTG
ATATCTTTCCGTACAATCCTCCGGATGAATCATGTGGAAGGGCAACACTTCACAAGAAAGCCCGAGTAGAATTTCATCCCCTTGGAGTTATTGGGGCAATAGTCCCATGG
AACTATCCATTCCATAACATTTTCAATCCAGTGCTTGCAGCAGTCTTTGCAGGAAACGGTATTGTAGTTAAGGTTTCAGAACATGCAAGTTGGTCCGGATGCTTCTACAT
CAGGATTATCCAAGCTGCACTTGCTGCTATTGGAGCTCCTGAGAGCTTGGTCGATGTGATAACAGGGTTTGCTGAAACAGGTGAAGCGCTAGTATCTTCAGTAGACAAAA
TGATATTTGTTGGATCTACTGGTGTGGGTAGGATGATAATGAAAACTGCTGCTGAGACACTTATTCCAGTTACTCTTGAGTTGGGTGGAAAGGACGCATTTATTGTGTGT
GAGGATGTAGATTTGGATCATGTTGTAAACGTTGCACTCAGGGCTTCTTTTACATCAAGTGGACATAATTGTACTGGAGCTGAGAGATATTATGTCCACAAGAACATTTA
CTCCTCATTTGTGGATAAAATATCAGAACGTGTAAAGGCTATTACAGTTGGTCCACCGTTAGCTGGGAAATATGATATGGGTGCCATCTGTACGCAAGAGCAGTCTGAGA
AACTTCAAAGCCTTGTAAATGATGCCTTAGACAGGGGGGCAAAAATTGTTGCTCGTGGAACTTTTGGACACTTACCTGAAGGTGCCGTTGATCAATATTTCCCACCTACC
GTTATTGTTGATGTCAACCATACCATGAAGTTGATGCAAGAGGAGGCATTTGGACCAATCCTGCCTATAATGAAATTCAGCACTGATGATGAGGCCGTGAAGCTTGCAAA
TGATTCAAGATTTGGACTTGGCTGTGCTGTGTTTTCTGGCAGTAAACAACGTGCCAAGAATATTGCTTCGCAGATACGTTCTGGAAGTGTTGCAATTAATGACTTTGCCA
CAAATTATTTGTGTCAGTCCTTGCCATTTGGTGGCGTGAAAGAGAGTGGATTCGGTCGGTTTGCAGGCGTTGAAGGATTACGAGCTTGTTGTCTCGTCAAAGCAGTGGTT
GAGGATAGGTGGTGGCCGTACATTCAAACTAAGCATCCTAAGCCTCTTACGTATCCTGTTGCAGACAACGCTTTTGAGTTTCAGATGTCACTCGTTGAAGCAACGTACGG
CCTCAACATATGGGATCGATTAAAAGCATTGGTCAATGTTCTGAAGATGCTGTCCGAGCAAAACAGTCCGGCTAAAGATAACTCTGCCATTGGCGATGGAAGTAAGAGAG
CGGAGTGA
Protein sequenceShow/hide protein sequence
MPICLRFDTVPPSVSVAVLCLYRLFPRSLLLNAMAFWWTLLVLGFAYAICRFLLMLIPPNVPSIEVDASDVTDDGNQTQENSYIYIPPRVKTQQQDKKVQCYEPATMKYL
GFFPALSRDEVKERVASARKAQKEWAKSSFKQRRLLLRILLKYIIEHQELICEISSRDTGKTIVDASMGEIMATCEKITWLLSEGEKWLKPESRHVLAMCSLVIHFFGSI
KLVSERSILGTPFTMTTKKVSTNVSGDPSLVDKEAESIPVLSPQETTVRLLSVEDDVREIKKILKLIWEKIGVQTEQLTINPDAEATRGKETQIRQGESSKLTNQHEDRT
KFQQERCFNEQKITQDWAAAPTKYQEWEMAHKRYKELKENPLFRRYQDWSESSSEDEKEPRVFYQQNPRYDHAERKEDARIPYHPYGHQEFPMYNHPEGRDDARMFYHKQ
ESDFKMKVDIPTYNGKMEIEAFLEWIRHVENFFDYMNTPESKKVKLVALKLKGGAQAWWDQLEINRQRFGKRPIRKWERMKRLMRDRFLPANFEQILYTQYQNCRQGTRT
VMEYTEEFHRLGACTNLGENQQCLVARFKGGLRLDIQERISFQPIAFLNEAITAAATAEEQITNRFKRQYTRRNVVESSTSNPRNTTVDKTTSPTTSVGAKGKQIDAGTS
KDLVPKRNTNNYSRPTLGKCFRCNQTGHLSNECPQRKTLAIADNVDYQAESDETDSDEDIDYLEPDEGDALSLVIQRLLLAPKIDQSYQRHALFKTRCTINGKICNVIID
SGSTENVVASKLVSTLNLPLTPHPAPYKAIHKGRDNTYEFTWMGKKIVLLPINPSKQVGTTTNKKGQLFSISSGKKNFQEKQFPIIGLVIKYFDNLDPNIAIPSEVQNLL
QDFKEIIDTPIELPPLRDIQHTIEFLPGATLPNLPHYRMSPNEYKILHEQVQELLEKGHIQPSLSPCAVPALLTPKKDGSWRMCVDSRAINKITVEYRFPILRINDLLNQ
LSGATIFPKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQEKIITSPVLALPNFDLPFEVAVDASGIGVGAVLSQNNHPIEFFSEKLSPS
RQSPGEIIAFNSLPEDYHSDKDFGSIGKNCTEHKPTGDFHLVDGFLFKGNLLCIPFSSLREALLQEAHAGGLAGHLGRDKTLHMLEERYYWPQLKKDVQNYVSRCFTCQT
CKGTSQNTGLYLPLPIPDNIWEDLSMDFILGLPRTQRGFDSVLVVVDRYSKMAHFLACKKTSDASYIANLFFREVVRLHGIPKSIVSDRDVKFLSHFWKTLWKKFDTTLK
YSTTSHPQTDGQTEVTNRTLGNLIRCLSGDKPRQWDMTLAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTLDLTLIPSSVDLSLEAEEMATRIRDLHQQVHKHIEEQTIK
YKAAADKKRRYKEFAVGDLVMVHLRKNRFPAGTYSKLKEKKIGPLPILEKYGDNAYRVQLPATYNISSTFNIADIFPYNPPDESCGRATLHKKARVEFHPLGVIGAIVPW
NYPFHNIFNPVLAAVFAGNGIVVKVSEHASWSGCFYIRIIQAALAAIGAPESLVDVITGFAETGEALVSSVDKMIFVGSTGVGRMIMKTAAETLIPVTLELGGKDAFIVC
EDVDLDHVVNVALRASFTSSGHNCTGAERYYVHKNIYSSFVDKISERVKAITVGPPLAGKYDMGAICTQEQSEKLQSLVNDALDRGAKIVARGTFGHLPEGAVDQYFPPT
VIVDVNHTMKLMQEEAFGPILPIMKFSTDDEAVKLANDSRFGLGCAVFSGSKQRAKNIASQIRSGSVAINDFATNYLCQSLPFGGVKESGFGRFAGVEGLRACCLVKAVV
EDRWWPYIQTKHPKPLTYPVADNAFEFQMSLVEATYGLNIWDRLKALVNVLKMLSEQNSPAKDNSAIGDGSKRAE