; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025318 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025318
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNA-directed DNA polymerase
Genome locationchr10:11267710..11279712
RNA-Seq ExpressionLag0025318
SyntenyLag0025318
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR003959 - ATPase, AAA-type, core
IPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR041588 - Integrase zinc-binding domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR036875 - Zinc finger, CCHC-type superfamily
IPR036397 - Ribonuclease H superfamily
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR021109 - Aspartic peptidase domain superfamily
IPR012337 - Ribonuclease H-like superfamily
IPR005162 - Retrotransposon gag domain
IPR003960 - ATPase, AAA-type, conserved site
IPR003593 - AAA+ ATPase domain
IPR001969 - Aspartic peptidase, active site
IPR001878 - Zinc finger, CCHC-type
IPR001584 - Integrase, catalytic core
IPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7586297.1 Integrase catalytic core [Arabidopsis thaliana x Arabidopsis arenosa]6.7e-25144.93Show/hide
Query:  MKERFLPMNFEQILYDQYQNCKQGTRSVMEYTEEFHRLGARTNLGESQQYLVARYKGGLRNDIKEQLSLQPIGYLNEAISAA--AVIEEQITNRFK--RT
        M ERFLP ++EQILY  Y  C QG R+V EYT EF R   R +LGE++   VARY  GL++ I+E++ LQ +  + EA S A  A + E+    F   R 
Subjt:  MKERFLPMNFEQILYDQYQNCKQGTRSVMEYTEEFHRLGARTNLGESQQYLVARYKGGLRNDIKEQLSLQPIGYLNEAISAA--AVIEEQITNRFK--RT

Query:  YSRRATGEQTNPFSKTTSGERLPQQASSAISKGKQIDNGSNKDVKNQENPKRSTNTYSRPTMGKCFRCNQTGHLSNECPQRRTLTIADDVEYVEERDGTE
        YS +   E T    K+     + + A+         +NGSN   +++   +R  N Y+RP++ KC+RC   GH SN CP RRTL +   +E  EER+  +
Subjt:  YSRRATGEQTNPFSKTTSGERLPQQASSAISKGKQIDNGSNKDVKNQENPKRSTNTYSRPTMGKCFRCNQTGHLSNECPQRRTLTIADDVEYVEERDGTE

Query:  SEENTDFLEPYEGEPVSLVVQRLLLAPKKDPSYQRHALFKTRCTINGKICNVIIDSGSTENIVASKLVTSLNLPLHPHSAPYKVSWIKKGGEAQVTHTST
             +F E    E ++LV+QR+LL+ K++   QR  LF+TRC++N K+CN+I+D+GSTEN+V+ KLV  L LP   H  PY + W+ KG +  VT +  
Subjt:  SEENTDFLEPYEGEPVSLVVQRLLLAPKKDPSYQRHALFKTRCTINGKICNVIIDSGSTENIVASKLVTSLNLPLHPHSAPYKVSWIKKGGEAQVTHTST

Query:  IPLSIGASYKDQIICDVLDMDACHILLGRLWQYDVQAVHKGRDNTYEFHWMGKKI------------------------------------QHIIGFVVK
        +P+SIG  YK+++ICDVLDMD CHI+LGR WQYD   +++GRDN   F W G KI                                    + I   V+K
Subjt:  IPLSIGASYKDQIICDVLDMDACHILLGRLWQYDVQAVHKGRDNTYEFHWMGKKI------------------------------------QHIIGFVVK

Query:  HFDTSISDHV-IPNEVQKLLQQHTTIV--DNPTKLPPLRDIQHSIDLIPGATLPNLPHYKMSPTEYQILHEQIQELLDKGHIQPSLSPCAVPALLTPKND
           +++ + V  P EV ++L+    ++  + P +LPP+R+IQH IDLIPG++LPNLPHY+MSP E +IL EQI++LL KG I+ S+SPCAVP LL PK  
Subjt:  HFDTSISDHV-IPNEVQKLLQQHTTIV--DNPTKLPPLRDIQHSIDLIPGATLPNLPHYKMSPTEYQILHEQIQELLDKGHIQPSLSPCAVPALLTPKND

Query:  GTWRMCVDSRAINKITVKYRFPIPRIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQGKFLWGLE
          WRMCVDSRAINKIT+KYRFPIPR+ D+LD+L G+ +FSKIDL+SGYHQIRI+PGDEWKTAFK+ +GL+EWLVMPFGLSNAPSTFMRLMNQ        
Subjt:  GTWRMCVDSRAINKITVKYRFPIPRIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQGKFLWGLE

Query:  QQKSFGVIKEKLASSPVLALPNFNTPFEVAVDASGIGVGAVLSQNSHPIDFFSEKLSPSRQNWSTFD---------------------------------
                       P  ALP+F+  F+V  DASG+G+GAVLSQ   P+ FFSEKLS +RQ WST+D                                 
Subjt:  QQKSFGVIKEKLASSPVLALPNFNTPFEVAVDASGIGVGAVLSQNSHPIDFFSEKLSPSRQNWSTFD---------------------------------

Query:  ----------------------FVIKHTSGITNKVADTLSRKESILTILKGEIIAMDHLPETYADDKDFGTTWKKCIDHTPASGFHILDGFLFKGDVLCI
                              F+I+H SG  NKVAD LSR+ S+L  L  EI+  + + E Y +D +F   W KC +  P++ FH+ +G+LFKGD LCI
Subjt:  ----------------------FVIKHTSGITNKVADTLSRKESILTILKGEIIAMDHLPETYADDKDFGTTWKKCIDHTPASGFHILDGFLFKGDVLCI

Query:  PHSSLREALIREAHAGGLAGHLGRDKTLQILETRYFWSQLKRDVTNFVARY---------RYSKMAHFLPCKKTLDAVYIANLFFKEIVRLHGIPKSIVS
        P SSLRE LIR+ H GGL+GHLGRDKT+  LE RYFW  L+RD    V R          R+SKM HF+ CKKT DA  IA LFFKE+VRLHG+PKSI S
Subjt:  PHSSLREALIREAHAGGLAGHLGRDKTLQILETRYFWSQLKRDVTNFVARY---------RYSKMAHFLPCKKTLDAVYIANLFFKEIVRLHGIPKSIVS

Query:  DRDVKFLGHFWKTLWKKFDTKLKYSTTSHPQTDGQTEVTNRTLGNLL----------------------------STEAEEMATRITELHNQVQDHIQ--
        DRD KFL HFW TLW+ F T L  S+T+HPQTDGQTEVTNRTLGN++                            +T     +   T +   V D ++  
Subjt:  DRDVKFLGHFWKTLWKKFDTKLKYSTTSHPQTDGQTEVTNRTLGNLL----------------------------STEAEEMATRITELHNQVQDHIQ--

Query:  -------------------KQTAKYKEAADRKKYKEAADRKKRFKEFEVGDLVMIHLRKSRLPAGTYNKLKKRKLGPFPILERL
                           K+  K K  A  KK K AAD+++RFK F+ GD VM+ LRK R P GTYNKLK RK GPF +L ++
Subjt:  -------------------KQTAKYKEAADRKKYKEAADRKKRFKEFEVGDLVMIHLRKSRLPAGTYNKLKKRKLGPFPILERL

PKU86669.1 RNA-directed DNA polymerase [Dendrobium catenatum]4.5e-23940.1Show/hide
Query:  MKERFLPMNFEQILYDQYQNCKQGTRSVMEYTEEFHRLGARTNLGESQQYLVARYKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQITNRFKRTYSRR
        M+  FLP +FEQ+LY QYQ C+Q  R+V EYTEEF+RL AR NL ES+  LVARY GGL+  ++++L L  +  L++A++ A   E Q+  ++K + SRR
Subjt:  MKERFLPMNFEQILYDQYQNCKQGTRSVMEYTEEFHRLGARTNLGESQQYLVARYKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQITNRFKRTYSRR

Query:  A-TGEQTNPFSKTTSGERLPQQASSAISKGKQIDNGSNKDVKNQENPKRST--NTYSRPTMGKCFRCNQTGHLSNECPQRRTLTIADDVEYVEERDGTES
          T   T+    T   ++ P  ++S  +   Q       +VK+   PK     N Y++P   KCFRC Q GH SNECP R  L +A+  E + + +  E 
Subjt:  A-TGEQTNPFSKTTSGERLPQQASSAISKGKQIDNGSNKDVKNQENPKRST--NTYSRPTMGKCFRCNQTGHLSNECPQRRTLTIADDVEYVEERDGTES

Query:  -EENTDFLEPYEGEPVSLVVQRLLLAPKKDPSYQRHALFKTRCTINGKICNVIIDSGSTENIVASKLVTSLNLPLHPHSAPYKVSWIKKGGEAQVTHTST
         EE  + L   EG PV  V+++LLLAP++    QRH+LF+T+CTIN K+C+++IDSG TENI++   V +L +    +  P+K+SW+KKG +  V+    
Subjt:  -EENTDFLEPYEGEPVSLVVQRLLLAPKKDPSYQRHALFKTRCTINGKICNVIIDSGSTENIVASKLVTSLNLPLHPHSAPYKVSWIKKGGEAQVTHTST

Query:  IPLSIGASYKDQIICDVLDMDACHILLGRLWQYDVQAVHKGRDNTYEFHWMGKKIQHIIGFVV----------------KHFDTSISDH-----------
        +  SIG  Y  Q+ CDVL+MD CH++LGR WQYD  AVH+GR N+Y   W G K++ +   V+                + F   ISD+           
Subjt:  IPLSIGASYKDQIICDVLDMDACHILLGRLWQYDVQAVHKGRDNTYEFHWMGKKIQHIIGFVV----------------KHFDTSISDH-----------

Query:  -----VIPN-EVQKLLQQ--HTTIVDNPTKLPPLRDIQHSIDLIPGATLPNLPHYKMSPTEYQILHEQIQELLDKGHIQPSLSPCAVPALLTPKNDGTWR
               P+ EVQ LL      T VD P  LPP+R IQH I+L+PGA LPNLPHYKMSP E+ IL + + ELL+K  IQPSLS CAVPALL PK DG WR
Subjt:  -----VIPN-EVQKLLQQ--HTTIVDNPTKLPPLRDIQHSIDLIPGATLPNLPHYKMSPTEYQILHEQIQELLDKGHIQPSLSPCAVPALLTPKNDGTWR

Query:  MCVDSRAINKITVKYRFPIPRIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ------------
        MC+DSRAIN+ITVKYRFPIPRI +LLD+L GATIFSK+DL+SGYHQIRIRPGDEWKTAFKT +GLFEW VMPFGL NAPSTFMRLM++            
Subjt:  MCVDSRAINKITVKYRFPIPRIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------GKFLWGLEQQKSFGVIKEKLASSPVLALPNFNTPFEVAVDASGIGVGAVLSQNSHPIDFFSEKLSPSRQNWST------------------
                   F W   QQKSF  IK  L+S+PVLA P+F+ PF+V  DAS IGVGAV++Q+  PI++FSEKLS +RQ WS                   
Subjt:  ---------GKFLWGLEQQKSFGVIKEKLASSPVLALPNFNTPFEVAVDASGIGVGAVLSQNSHPIDFFSEKLSPSRQNWST------------------

Query:  -------------------------------------FDFVIKHTSGITNKVADTLSRKESILTILKGEIIAMDHLPETYADDKDFGTTWKKCIDHTPAS
                                             F FVIKH +G  N+VAD LSR  ++L  L+ E+  +  L E YADDKDF   WK C +  P  
Subjt:  -------------------------------------FDFVIKHTSGITNKVADTLSRKESILTILKGEIIAMDHLPETYADDKDFGTTWKKCIDHTPAS

Query:  GFHILDGFLFKGDVLCIPHSSLREALIREAHAGGLAGHLGRDKTLQILETRYFWSQLKRDVTNFVARY--------------------------------
         F +  GFLFKG+ LCIP SS R  LI+E H+GGLA H GRD+TL++L+ R++W  L+RDV  FV +                                 
Subjt:  GFHILDGFLFKGDVLCIPHSSLREALIREAHAGGLAGHLGRDKTLQILETRYFWSQLKRDVTNFVARY--------------------------------

Query:  --------------------RYSKMAHFLPCKKTLDAVYIANLFFKEIVRLHGIPKSIVSDRDVKFLGHFWKTLWKKFDTKLKYSTTSHPQTDGQTEVTN
                            R+SKM HF+PCKKT DAV IANLFFKEIVRLHG+P+S+ SDRDVKF+ HFW+ LW++  T+LK+S+  HPQTDGQTEV N
Subjt:  --------------------RYSKMAHFLPCKKTLDAVYIANLFFKEIVRLHGIPKSIVSDRDVKFLGHFWKTLWKKFDTKLKYSTTSHPQTDGQTEVTN

Query:  RTLGNLLSTEAEEMATRITELHNQVQ---DHIQKQTAKY--------------------------------KEAAD------------RKKYKEAADRKK
        RTLGN+L    ++   R  EL  Q +   + +  ++ K+                                 + AD              KYK AAD  +
Subjt:  RTLGNLLSTEAEEMATRITELHNQVQ---DHIQKQTAKY--------------------------------KEAAD------------RKKYKEAADRKK

Query:  RFKEFEVGDLVMIHLRKSRLPAGTYNKLKKRKLGPFPILERL
        R K F  G+LVM+ +R+ R P GTY+KL  RK+GPFPI  ++
Subjt:  RFKEFEVGDLVMIHLRKSRLPAGTYNKLKKRKLGPFPILERL

PWA81295.1 transposon Ty3-I Gag-Pol polyprotein [Artemisia annua]9.7e-24239.78Show/hide
Query:  MKERFLPMNFEQILYDQYQNCKQGTRSVMEYTEEFHRLGARTNLGESQQYLVARYKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQITNRFKRTYSRR
        +K RFLP + EQILY QY NC QG R+V EYT EF RL AR NL E+ +   ARY  GL + I+E+LSL  I  +++A + A   E   T        RR
Subjt:  MKERFLPMNFEQILYDQYQNCKQGTRSVMEYTEEFHRLGARTNLGESQQYLVARYKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQITNRFKRTYSRR

Query:  ATGEQTNPFSKTTSGER--LPQQASSAISKGKQIDNGSNKDVKNQENPKRSTNTYSRPTMGKCFRCNQTGHLSNECPQRRTL---TIADDVEYVEERDGT
         T E T+ +    +  +  LP   +S  +  K   +GS  D   +  P +S N Y++P   KCFRC + GH SN CP+R TL    + +D  + +E    
Subjt:  ATGEQTNPFSKTTSGER--LPQQASSAISKGKQIDNGSNKDVKNQENPKRSTNTYSRPTMGKCFRCNQTGHLSNECPQRRTL---TIADDVEYVEERDGT

Query:  ESEENTDFLEPYEG--EPVSLVVQRLLLAPKKDPSYQRHALFKTRCTINGKICNVIIDSGSTENIVASKLVTSLNLPLHPHSAPYKVSWIKKGGEAQVTH
        + E++ ++ EP +G  E ++ V+QR L +PK   S QR+ +F+T+C +  KIC++IID GS EN+V+  LV +  LP  PH  PY++ WIKKG   +VT 
Subjt:  ESEENTDFLEPYEG--EPVSLVVQRLLLAPKKDPSYQRHALFKTRCTINGKICNVIIDSGSTENIVASKLVTSLNLPLHPHSAPYKVSWIKKGGEAQVTH

Query:  TSTIPLSIGASYKDQIICDVLDMDACHILLGRLWQYDVQAVHKGRDNTYEFHWMGKKIQHI-IG----------------------FVVKHFDTSIS---
           +PL+IG  Y + + CDV+DM+ACH+LLGR WQ+DV A H+G+ N Y F W GK I  + +G                      F  +  +T +S   
Subjt:  TSTIPLSIGASYKDQIICDVLDMDACHILLGRLWQYDVQAVHKGRDNTYEFHWMGKKIQHI-IG----------------------FVVKHFDTSIS---

Query:  ---------DHVIPNEVQKLLQQHTTIV--DNPTKLPPLRDIQHSIDLIPGATLPNLPHYKMSPTEYQILHEQIQELLDKGHIQPSLSPCAVPALLTPKN
                 D+VIP  ++ +L++ + +V  D P  LPPLR+IQH IDL+PGA+LPNLPHY+MSP E  IL E+++ELL KGHIQ S+SPCAVPALLTPK 
Subjt:  ---------DHVIPNEVQKLLQQHTTIV--DNPTKLPPLRDIQHSIDLIPGATLPNLPHYKMSPTEYQILHEQIQELLDKGHIQPSLSPCAVPALLTPKN

Query:  DGTWRMCVDSRAINKITVKYRFPIPRIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ-------
        DG+WRMCVDSRAINKITV+YRFPIPR+ DLLDQL GA +FSKIDL+SGYHQIRI+PGDEWKTAFKT +GL+EWLVMPFGLSNAPSTFMRLM Q       
Subjt:  DGTWRMCVDSRAINKITVKYRFPIPRIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------GKFLWGLEQQKSFGVIKEKLASSPVLALPNFNTPFEVAVDASGIGVGAVLSQNSHPIDFFSEKLSPSRQNWST-------------
                      G F W  E ++SF +IKE+L ++PVL+LPNF+  FE+  DA G G+GAVLSQ   P+ F SEKL+ +RQ WST             
Subjt:  --------------GKFLWGLEQQKSFGVIKEKLASSPVLALPNFNTPFEVAVDASGIGVGAVLSQNSHPIDFFSEKLSPSRQNWST-------------

Query:  ------------------------------------------FDFVIKHTSGITNKVADTLSRKESILTILKGEIIAMDHLPETYADDKDFGTTWKKCID
                                                  F++VIKH SG +NKVAD LSRK ++L  +  +++  + +   Y +D+DF +TW++   
Subjt:  ------------------------------------------FDFVIKHTSGITNKVADTLSRKESILTILKGEIIAMDHLPETYADDKDFGTTWKKCID

Query:  HTPASGFHILDGFLFKGDVLCIPHSSLREALIREAHAGGLAGHLGRDKTLQILETRYFWSQLKRDVTNFVARY---------------------------
              F +LDG+LFKG+ LCIP +SLR  LI+E HAGGL+ HLGRDKT+  +E+R++W QLKRDV +FV R                            
Subjt:  HTPASGFHILDGFLFKGDVLCIPHSSLREALIREAHAGGLAGHLGRDKTLQILETRYFWSQLKRDVTNFVARY---------------------------

Query:  -------------------------RYSKMAHFLPCKKTLDAVYIANLFFKEIVRLHGIPKSIVSDRDVKFLGHFWKTLWKKFDTKLKYSTTSHPQTDGQ
                                 R+SKMAHF+PCKKT DA +IA LFF+E+VRLHG+PKSI SDRD KFL HFW TLW++  T L +S+T+HPQTDGQ
Subjt:  -------------------------RYSKMAHFLPCKKTLDAVYIANLFFKEIVRLHGIPKSIVSDRDVKFLGHFWKTLWKKFDTKLKYSTTSHPQTDGQ

Query:  TEVTNRTLGNLL-------------STEAEEMA----------------------TRITEL----------HNQVQDHIQ--KQTAKYKEAADRKKYKEA
        TEV NRTLGN++             S    E A                       ++ +L           N++ + +Q   +  + K +    KYK A
Subjt:  TEVTNRTLGNLL-------------STEAEEMA----------------------TRITEL----------HNQVQDHIQ--KQTAKYKEAADRKKYKEA

Query:  ADRKKRFKEFEVGDLVMIHLRKSRLPAGTYNKLKKRKLGPFPILERL
        AD+ +R K F+VGD VM+ LRK R P GTY+KL+ +K GP+ IL ++
Subjt:  ADRKKRFKEFEVGDLVMIHLRKSRLPAGTYNKLKKRKLGPFPILERL

TXG62763.1 hypothetical protein EZV62_009757 [Acer yangbiense]3.8e-24639.92Show/hide
Query:  MKERFLPMNFEQILYDQYQNCKQGTRSVMEYTEEFHRLGARTNLGESQQYLVARYKGGLRNDIKEQLSLQPIGYLNEAISAA--AVIEEQITNRFKRTYS
        M  RFLP ++EQ ++  YQNC QG ++V +YTEE+ RL  R NL E++   V+RY GGL++ I++++ LQ +  ++EA + A  A + E+ +NRF  ++ 
Subjt:  MKERFLPMNFEQILYDQYQNCKQGTRSVMEYTEEFHRLGARTNLGESQQYLVARYKGGLRNDIKEQLSLQPIGYLNEAISAA--AVIEEQITNRFKRTYS

Query:  RRATGEQTNPFS-----KTTSGERLPQQASSAISKGKQIDNGSNK----DVKNQENPKRSTNTYSRPTMGKCFRCNQTGHLSNECPQRRTLTIA------
        ++  GE +N  +       + G+   +  +  + +G   +N S          +E P+ + N YSRP + KC+RC Q GH SNECP R+ +         
Subjt:  RRATGEQTNPFS-----KTTSGERLPQQASSAISKGKQIDNGSNK----DVKNQENPKRSTNTYSRPTMGKCFRCNQTGHLSNECPQRRTLTIA------

Query:  -DDVEYVEERDGTESEENTDFLEPYEGEPVSLVVQRLLLAPKKDPSYQRHALFKTRCTINGKICNVIIDSGSTENIVASKLVTSLNLPLHPHSAPYKVSW
         D+  Y EE +  E  E  D  E  +GEPV+ V+QR++ A K D   QR+++FKT C+I GK+C++I+DSGS EN V+ KL+  L L    H  PY + W
Subjt:  -DDVEYVEERDGTESEENTDFLEPYEGEPVSLVVQRLLLAPKKDPSYQRHALFKTRCTINGKICNVIIDSGSTENIVASKLVTSLNLPLHPHSAPYKVSW

Query:  IKKGGEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRLWQYDVQAVHKGRDNTYEFHWMGKKIQHI-----IGFVVK---------HFDTSISD
        I+KG +A VT    +P+SIG  Y++++ CDV+DMDA H+LLGR WQ+DV   ++GRDN   F W GK+I  +     +G   K            TS+SD
Subjt:  IKKGGEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRLWQYDVQAVHKGRDNTYEFHWMGKKIQHI-----IGFVVK---------HFDTSISD

Query:  ----------------------------HVIPNEVQKLLQQHTTIV--DNPTKLPPLRDIQHSIDLIPGATLPNLPHYKMSPTEYQILHEQIQELLDKGH
                                     V+P +V  LL + + +V  D P  LPPLRDIQH IDL+PGA+LPNLPHY+MSP E QIL  +++EL+ KG 
Subjt:  ----------------------------HVIPNEVQKLLQQHTTIV--DNPTKLPPLRDIQHSIDLIPGATLPNLPHYKMSPTEYQILHEQIQELLDKGH

Query:  IQPSLSPCAVPALLTPKNDGTWRMCVDSRAINKITVKYRFPIPRIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSN
        I+ S+SPCAVP LL PK DG+WRMCVDSRAINKITV+Y FPIPR+ D+LD L G+ +FSKIDL+SGYHQIR++PGDEWKTAFKT +GL+EWLVMPFGLSN
Subjt:  IQPSLSPCAVPALLTPKNDGTWRMCVDSRAINKITVKYRFPIPRIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSN

Query:  APSTFMRLMNQ-----------------------------------------------------------------------------------------
        APSTFMRLMNQ                                                                                         
Subjt:  APSTFMRLMNQ-----------------------------------------------------------------------------------------

Query:  -GKFLWGLEQQKSFGVIKEKLASSPVLALPNFNTPFEVAVDASGIGVGAVLSQNSHPIDFFSEKLSPSRQNWSTFD------------------------
         GKF W      +F VIKEKL ++PVLALP+F   FEV  DASG+G+GAVLSQ   P+ FFSEKLS +R+ WST+D                        
Subjt:  -GKFLWGLEQQKSFGVIKEKLASSPVLALPNFNTPFEVAVDASGIGVGAVLSQNSHPIDFFSEKLSPSRQNWSTFD------------------------

Query:  -------------------------------FVIKHTSGITNKVADTLSRKESILTILKGEIIAMDHLPETYADDKDFGTTWKKCIDHTPASGFHILDGF
                                       FV+KH SG+ NKVAD LSR+ S+L  ++ EII  + L E Y+DD+DFG  W+ C+       FH+ +G+
Subjt:  -------------------------------FVIKHTSGITNKVADTLSRKESILTILKGEIIAMDHLPETYADDKDFGTTWKKCIDHTPASGFHILDGF

Query:  LFKGDVLCIPHSSLREALIREAHAGGLAGHLGRDKTLQILETRYFWSQLKRDVTNFVARY----------------------------------------
        LF G+ LCIP SSLRE LIRE H GGL GHLGRDKT+  +  RY+W QLKRDV NFV +                                         
Subjt:  LFKGDVLCIPHSSLREALIREAHAGGLAGHLGRDKTLQILETRYFWSQLKRDVTNFVARY----------------------------------------

Query:  ------------RYSKMAHFLPCKKTLDAVYIANLFFKEIVRLHGIPKSIVSDRDVKFLGHFWKTLWKKFDTKLKYSTTSHPQTDGQTEVTNRTLGNL--
                    R+SKM HF+PC+KT DA ++A LFF+E+VRLHG+P+SI SDRD KFL HFW TLW++ DT LK+S+T+HPQTDGQTE  NRTLGNL  
Subjt:  ------------RYSKMAHFLPCKKTLDAVYIANLFFKEIVRLHGIPKSIVSDRDVKFLGHFWKTLWKKFDTKLKYSTTSHPQTDGQTEVTNRTLGNL--

Query:  --------------------------------------------------------LSTEAEEMATRITELHNQVQDHIQKQTAKYKEAADRKKYKEAAD
                                                                +S  AE MA ++ ++  +V+  ++++ AKYK AAD K+      
Subjt:  --------------------------------------------------------LSTEAEEMATRITELHNQVQDHIQKQTAKYKEAADRKKYKEAAD

Query:  RKKRFKEFEVGDLVMIHLRKSRLPAGTYNKLKKRKLGPFPILERL
        R+K F E   GD VM+ LRK R P G+YNKLK RK GP+ +++++
Subjt:  RKKRFKEFEVGDLVMIHLRKSRLPAGTYNKLKKRKLGPFPILERL

XP_034219000.1 uncharacterized protein LOC117630375 [Prunus dulcis]2.9e-23839.65Show/hide
Query:  MKERFLPMNFEQILYDQYQNCKQGTRSVMEYTEEFHRLGARTNLGESQQYLVARYKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQITNRFKRTYS-R
        M ERFLP ++EQILY  Y  C QG RSV EYTEEF RL  R +L E+    VARY  GL+  I+E++ +Q I  L EAI+ A  ++ ++  + KR  + R
Subjt:  MKERFLPMNFEQILYDQYQNCKQGTRSVMEYTEEFHRLGARTNLGESQQYLVARYKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQITNRFKRTYS-R

Query:  RATGEQTNPFSKTTSGE----RLPQQASSAIS-----KGKQIDNGSNKDVKNQENPKRSTNTYSRPTMGKCFRCNQTGHLSNECPQRRTLTIADDVEYVE
        R T E +   +  +SG     ++ QQ           + K  +  S++     ++  +S N Y++P    C+RC + GH SN CP+R+     ++V+  E
Subjt:  RATGEQTNPFSKTTSGE----RLPQQASSAIS-----KGKQIDNGSNKDVKNQENPKRSTNTYSRPTMGKCFRCNQTGHLSNECPQRRTLTIADDVEYVE

Query:  ERD--GTESEENTDFLEPYEGEPVSLVVQRLLLAPKKDPSYQRHALFKTRCTINGKICNVIIDSGSTENIVASKLVTSLNLPLHPHSAPYKVSWIKKGGE
        E+D  G +     +F      E ++LV+QR+LLAPK++   QRH++F++ C+I  K+C+VI+D+GS EN V+ KLV  L L   PH +PY + W+KKG  
Subjt:  ERD--GTESEENTDFLEPYEGEPVSLVVQRLLLAPKKDPSYQRHALFKTRCTINGKICNVIIDSGSTENIVASKLVTSLNLPLHPHSAPYKVSWIKKGGE

Query:  AQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRLWQYDVQAVHKGRDNTYEFHWMGKKI-------------------------------------
         +V  T  +PLSIG  Y D ++CDV+DMDACHILLGR WQ+DV A +KGRDN   F W  +KI                                     
Subjt:  AQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRLWQYDVQAVHKGRDNTYEFHWMGKKI-------------------------------------

Query:  -QHIIGFVVKH-FDTSISDHVIPNEVQKLLQQHTTIVDN--PTKLPPLRDIQHSIDLIPGATLPNLPHYKMSPTEYQILHEQIQELLDKGHIQPSLSPCA
         ++    V+K        +  IP +VQ++L Q   ++    P +LPP+RDIQH IDL+PGA+LPNLPHY+ SP E  IL EQI+ELL KG I+ SLSPCA
Subjt:  -QHIIGFVVKH-FDTSISDHVIPNEVQKLLQQHTTIVDN--PTKLPPLRDIQHSIDLIPGATLPNLPHYKMSPTEYQILHEQIQELLDKGHIQPSLSPCA

Query:  VPALLTPKNDGTWRMCVDSRAINKITVKYRFPIPRIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLM
        VP LL    D TWRMCVDSRAINKITVKYRFPIPR+ D+LD L G+ +FSKIDL+SGYHQIRIRPGDEWKTAFK+ +GLFEWLVMPFGLSNAPSTFMRLM
Subjt:  VPALLTPKNDGTWRMCVDSRAINKITVKYRFPIPRIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLM

Query:  NQ--------------------------------------------------------------------------------------------------
        NQ                                                                                                  
Subjt:  NQ--------------------------------------------------------------------------------------------------

Query:  -----------------------GKFLWGLEQQKSFGVIKEKLASSPVLALPNFNTPFEVAVDASGIGVGAVLSQNSHPIDFFSEKLSPSRQNWSTFD--
                               G+F WG EQ++SF  IKEKL ++PVLALPNF   FEV  DASG+GVGAVLSQ+  P+ FFS+KLS +RQ WST+D  
Subjt:  -----------------------GKFLWGLEQQKSFGVIKEKLASSPVLALPNFNTPFEVAVDASGIGVGAVLSQNSHPIDFFSEKLSPSRQNWSTFD--

Query:  -----------------------------------------------------FVIKHTSGITNKVADTLSRKESILTILKGEIIAMDHLPETYADDKDF
                                                             FVIKHTSG TN+VAD LSR+ S+L  L  E++  + L E Y  D DF
Subjt:  -----------------------------------------------------FVIKHTSGITNKVADTLSRKESILTILKGEIIAMDHLPETYADDKDF

Query:  GTTWKKCIDHTPASGFHILDGFLFKGDVLCIPHSSLREALIREAHAGGLAGHLGRDKTLQILETRYFWSQLKRDV-------------------------
           W KC +    + + + +G+LFKG+ LCIP SSLRE LIR+ H GGL+GHLGRDKT+  +E R++W QLKRDV                         
Subjt:  GTTWKKCIDHTPASGFHILDGFLFKGDVLCIPHSSLREALIREAHAGGLAGHLGRDKTLQILETRYFWSQLKRDV-------------------------

Query:  ---------------------------TNFVARYRYSKMAHFLPCKKTLDAVYIANLFFKEIVRLHGIPKSIVSDRDVKFLGHFWKTLWKKFDTKLKYST
                                   + FV   R+SKMAHF+ CKKT DA  IA LFF+E+VRLHG+P SI SDRD KFL HFW TLW+ F T L  S+
Subjt:  ---------------------------TNFVARYRYSKMAHFLPCKKTLDAVYIANLFFKEIVRLHGIPKSIVSDRDVKFLGHFWKTLWKKFDTKLKYST

Query:  TSHPQTDGQTEVTNRTLGNLL----STEAEEMA-------------------TRITELHNQVQD--------HIQKQT----------------------
        T+HPQTDGQTEVTNRTLGN+     ST + ++                    +R T   + + +         I++Q                       
Subjt:  TSHPQTDGQTEVTNRTLGNLL----STEAEEMA-------------------TRITELHNQVQD--------HIQKQT----------------------

Query:  ---------AKYKEAADRKKYKEAADRKKRFKEFEVGDLVMIHLRKSRLPAGTYNKLKKRKLGPFPILERL
                  K K      KYK AA R +R K F+ GD VMI LRK R P GTY+KLK +K GP+ +L+R+
Subjt:  ---------AKYKEAADRKKYKEAADRKKRFKEFEVGDLVMIHLRKSRLPAGTYNKLKKRKLGPFPILERL

TrEMBL top hitse value%identityAlignment
A0A5B7BER3 Uncharacterized protein3.1e-26241.93Show/hide
Query:  MKERFLPMNFEQILYDQYQNCKQGTRSVMEYTEEFHRLGARTNLGESQQYLVARYKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQITNRFKRTYSRR
        ++ERFLP+++EQ+LY QYQNC+QG RSV EY++EF+ L +R NL E++   VARY GGLR  I++QL+L+ I  LNEA S A  +E Q +    R   R 
Subjt:  MKERFLPMNFEQILYDQYQNCKQGTRSVMEYTEEFHRLGARTNLGESQQYLVARYKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQITNRFKRTYSRR

Query:  ATGEQTNPFSKTTSGERLPQQASSAISKGKQIDNGSNKDVKNQENP----KRSTNTYSRPTMGKCFRCNQTGHLSNECPQRRTLTI-------ADDVEYV
            ++ P S      R  Q         K          KNQ  P    ++STN Y+RP  GKCFRC Q GH SNECP RR + +       + D E  
Subjt:  ATGEQTNPFSKTTSGERLPQQASSAISKGKQIDNGSNKDVKNQENP----KRSTNTYSRPTMGKCFRCNQTGHLSNECPQRRTLTI-------ADDVEYV

Query:  EERDGTESEENTDFLEPYEGEPVSLVVQRLLLAPKKDPSYQRHALFKTRCTINGKICNVIIDSGSTENIVASKLVTSLNLPLHPHSAPYKVSWIKKGGEA
        EE +  +     +  E  EGE VS VVQRLLL PK++   QRH +F+TRCTIN K+C+VIIDSGS+ENIV+  LV +L L    H  PYK+ WIKKG E 
Subjt:  EERDGTESEENTDFLEPYEGEPVSLVVQRLLLAPKKDPSYQRHALFKTRCTINGKICNVIIDSGSTENIVASKLVTSLNLPLHPHSAPYKVSWIKKGGEA

Query:  QVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRLWQYDVQAVHKGRDNTYEFHWMGKKI-------------------------------------Q
        +VT    +P SIG  YKD++ CD++DMDACH+LLGR WQ+DV A HKG+DNTY F W  KK+                                      
Subjt:  QVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRLWQYDVQAVHKGRDNTYEFHWMGKKI-------------------------------------Q

Query:  HIIGFVVKHFDTSISDHVIPNEVQKLLQQHTTIVDN--PTKLPPLRDIQHSIDLIPGATLPNLPHYKMSPTEYQILHEQIQELLDKGHIQPSLSPCAVPA
         II  +VK   T      +P  +Q LL +   I  +  P  LPP+RDIQH IDL+PGA+LPNLPHY+MSP E +IL +Q+++L++KG IQ S+SPCAVPA
Subjt:  HIIGFVVKHFDTSISDHVIPNEVQKLLQQHTTIVDN--PTKLPPLRDIQHSIDLIPGATLPNLPHYKMSPTEYQILHEQIQELLDKGHIQPSLSPCAVPA

Query:  LLTPKNDGTWRMCVDSRAINKITVKYRFPIPRIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ-
        LLTPK DG+WRMCVDSRAINKITVKYRFPIPR+ D+LD L G+ IFSKIDL+SGYHQIRIRPGDEWKTAFKT EGL+EWLVMPFGLSNAPSTFMR+MNQ 
Subjt:  LLTPKNDGTWRMCVDSRAINKITVKYRFPIPRIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ-

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------GKFLWGLEQQKSFGVIKEKLASSPVLALPNFNTPFEVAVDASGIGVGAVLSQNSHPIDFFSEKLSPSRQNWST-------
                            GKF W  +Q+ SF +IKEKL+++PVLALP+F   F+V  DAS  G+GAVLSQ   P++FFSEKL+ +RQ W+T       
Subjt:  --------------------GKFLWGLEQQKSFGVIKEKLASSPVLALPNFNTPFEVAVDASGIGVGAVLSQNSHPIDFFSEKLSPSRQNWST-------

Query:  ------------------------------------------------FDFVIKHTSGITNKVADTLSRKESILTILKGEIIAMDHLPETYADDKDFGTT
                                                        F FV+KH +G  NKVAD LSR+ ++L ++  EI + + L E Y +D+DF   
Subjt:  ------------------------------------------------FDFVIKHTSGITNKVADTLSRKESILTILKGEIIAMDHLPETYADDKDFGTT

Query:  WKKCIDHTPASGFHILDGFLFKGDVLCIPHSSLREALIREAHAGGLAGHLGRDKTLQILETRYFWSQLKRDVTNFVARY---------------------
        W KC     ++ FHI DG+LFKG+ LCIP +SLRE ++R+ H+GGL GHLGRDKT+ ++E RY+W QLKRDV  FV +                      
Subjt:  WKKCIDHTPASGFHILDGFLFKGDVLCIPHSSLREALIREAHAGGLAGHLGRDKTLQILETRYFWSQLKRDVTNFVARY---------------------

Query:  -------------------------------RYSKMAHFLPCKKTLDAVYIANLFFKEIVRLHGIPKSIVSDRDVKFLGHFWKTLWKKFDTKLKYSTTSH
                                       R+SKMAHF+PCKKT DA ++ANLFF+EIVRLHG+PKSI SDRDVKFL HFW+TLW+KFDT L+YS+T+H
Subjt:  -------------------------------RYSKMAHFLPCKKTLDAVYIANLFFKEIVRLHGIPKSIVSDRDVKFLGHFWKTLWKKFDTKLKYSTTSH

Query:  PQTDGQTEVTNRTLGNLL----------------------------------------------------------STEAEEMATRITELHNQVQDHIQK
        PQTDGQTEVTNRTLGNL+                                                          S  AE  A R   +  +V+ +++K
Subjt:  PQTDGQTEVTNRTLGNLL----------------------------------------------------------STEAEEMATRITELHNQVQDHIQK

Query:  QTAKYKEAADRKKYKEAADRKKRFKEFEVGDLVMIHLRKSRLPAGTYNKLKKRKLGPFPILERL
                     YK AAD+ +R K F  GDLVM+ LRK+R P GTYNKLK RK GPF +  ++
Subjt:  QTAKYKEAADRKKYKEAADRKKRFKEFEVGDLVMIHLRKSRLPAGTYNKLKKRKLGPFPILERL

A0A5C7HZV4 Reverse transcriptase1.9e-24639.92Show/hide
Query:  MKERFLPMNFEQILYDQYQNCKQGTRSVMEYTEEFHRLGARTNLGESQQYLVARYKGGLRNDIKEQLSLQPIGYLNEAISAA--AVIEEQITNRFKRTYS
        M  RFLP ++EQ ++  YQNC QG ++V +YTEE+ RL  R NL E++   V+RY GGL++ I++++ LQ +  ++EA + A  A + E+ +NRF  ++ 
Subjt:  MKERFLPMNFEQILYDQYQNCKQGTRSVMEYTEEFHRLGARTNLGESQQYLVARYKGGLRNDIKEQLSLQPIGYLNEAISAA--AVIEEQITNRFKRTYS

Query:  RRATGEQTNPFS-----KTTSGERLPQQASSAISKGKQIDNGSNK----DVKNQENPKRSTNTYSRPTMGKCFRCNQTGHLSNECPQRRTLTIA------
        ++  GE +N  +       + G+   +  +  + +G   +N S          +E P+ + N YSRP + KC+RC Q GH SNECP R+ +         
Subjt:  RRATGEQTNPFS-----KTTSGERLPQQASSAISKGKQIDNGSNK----DVKNQENPKRSTNTYSRPTMGKCFRCNQTGHLSNECPQRRTLTIA------

Query:  -DDVEYVEERDGTESEENTDFLEPYEGEPVSLVVQRLLLAPKKDPSYQRHALFKTRCTINGKICNVIIDSGSTENIVASKLVTSLNLPLHPHSAPYKVSW
         D+  Y EE +  E  E  D  E  +GEPV+ V+QR++ A K D   QR+++FKT C+I GK+C++I+DSGS EN V+ KL+  L L    H  PY + W
Subjt:  -DDVEYVEERDGTESEENTDFLEPYEGEPVSLVVQRLLLAPKKDPSYQRHALFKTRCTINGKICNVIIDSGSTENIVASKLVTSLNLPLHPHSAPYKVSW

Query:  IKKGGEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRLWQYDVQAVHKGRDNTYEFHWMGKKIQHI-----IGFVVK---------HFDTSISD
        I+KG +A VT    +P+SIG  Y++++ CDV+DMDA H+LLGR WQ+DV   ++GRDN   F W GK+I  +     +G   K            TS+SD
Subjt:  IKKGGEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRLWQYDVQAVHKGRDNTYEFHWMGKKIQHI-----IGFVVK---------HFDTSISD

Query:  ----------------------------HVIPNEVQKLLQQHTTIV--DNPTKLPPLRDIQHSIDLIPGATLPNLPHYKMSPTEYQILHEQIQELLDKGH
                                     V+P +V  LL + + +V  D P  LPPLRDIQH IDL+PGA+LPNLPHY+MSP E QIL  +++EL+ KG 
Subjt:  ----------------------------HVIPNEVQKLLQQHTTIV--DNPTKLPPLRDIQHSIDLIPGATLPNLPHYKMSPTEYQILHEQIQELLDKGH

Query:  IQPSLSPCAVPALLTPKNDGTWRMCVDSRAINKITVKYRFPIPRIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSN
        I+ S+SPCAVP LL PK DG+WRMCVDSRAINKITV+Y FPIPR+ D+LD L G+ +FSKIDL+SGYHQIR++PGDEWKTAFKT +GL+EWLVMPFGLSN
Subjt:  IQPSLSPCAVPALLTPKNDGTWRMCVDSRAINKITVKYRFPIPRIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSN

Query:  APSTFMRLMNQ-----------------------------------------------------------------------------------------
        APSTFMRLMNQ                                                                                         
Subjt:  APSTFMRLMNQ-----------------------------------------------------------------------------------------

Query:  -GKFLWGLEQQKSFGVIKEKLASSPVLALPNFNTPFEVAVDASGIGVGAVLSQNSHPIDFFSEKLSPSRQNWSTFD------------------------
         GKF W      +F VIKEKL ++PVLALP+F   FEV  DASG+G+GAVLSQ   P+ FFSEKLS +R+ WST+D                        
Subjt:  -GKFLWGLEQQKSFGVIKEKLASSPVLALPNFNTPFEVAVDASGIGVGAVLSQNSHPIDFFSEKLSPSRQNWSTFD------------------------

Query:  -------------------------------FVIKHTSGITNKVADTLSRKESILTILKGEIIAMDHLPETYADDKDFGTTWKKCIDHTPASGFHILDGF
                                       FV+KH SG+ NKVAD LSR+ S+L  ++ EII  + L E Y+DD+DFG  W+ C+       FH+ +G+
Subjt:  -------------------------------FVIKHTSGITNKVADTLSRKESILTILKGEIIAMDHLPETYADDKDFGTTWKKCIDHTPASGFHILDGF

Query:  LFKGDVLCIPHSSLREALIREAHAGGLAGHLGRDKTLQILETRYFWSQLKRDVTNFVARY----------------------------------------
        LF G+ LCIP SSLRE LIRE H GGL GHLGRDKT+  +  RY+W QLKRDV NFV +                                         
Subjt:  LFKGDVLCIPHSSLREALIREAHAGGLAGHLGRDKTLQILETRYFWSQLKRDVTNFVARY----------------------------------------

Query:  ------------RYSKMAHFLPCKKTLDAVYIANLFFKEIVRLHGIPKSIVSDRDVKFLGHFWKTLWKKFDTKLKYSTTSHPQTDGQTEVTNRTLGNL--
                    R+SKM HF+PC+KT DA ++A LFF+E+VRLHG+P+SI SDRD KFL HFW TLW++ DT LK+S+T+HPQTDGQTE  NRTLGNL  
Subjt:  ------------RYSKMAHFLPCKKTLDAVYIANLFFKEIVRLHGIPKSIVSDRDVKFLGHFWKTLWKKFDTKLKYSTTSHPQTDGQTEVTNRTLGNL--

Query:  --------------------------------------------------------LSTEAEEMATRITELHNQVQDHIQKQTAKYKEAADRKKYKEAAD
                                                                +S  AE MA ++ ++  +V+  ++++ AKYK AAD K+      
Subjt:  --------------------------------------------------------LSTEAEEMATRITELHNQVQDHIQKQTAKYKEAADRKKYKEAAD

Query:  RKKRFKEFEVGDLVMIHLRKSRLPAGTYNKLKKRKLGPFPILERL
        R+K F E   GD VM+ LRK R P G+YNKLK RK GP+ +++++
Subjt:  RKKRFKEFEVGDLVMIHLRKSRLPAGTYNKLKKRKLGPFPILERL

A0A6N2LVR1 Uncharacterized protein3.9e-25240.79Show/hide
Query:  MKERFLPMNFEQILYDQYQNCKQGTRSVMEYTEEFHRLGARTNLGESQQYLVARYKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQITNRFKRTYSRR
        ++ R+LP ++EQIL+ QYQNCKQG R+V  Y EEFHRL +R NL E+    VAR+ GGLR +I++++S+  I  L EAI+ A   E Q+T         R
Subjt:  MKERFLPMNFEQILYDQYQNCKQGTRSVMEYTEEFHRLGARTNLGESQQYLVARYKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQITNRFKRTYSRR

Query:  ATGEQTNPFSKTTS----GERLPQQASSAISKGKQIDNGSNKDVKNQE-----NPKRSTNTYSRPTMGKCFRCNQTGHLSNECPQRRTLTIADDVEYVE-
         T     PF  T +    G+      ++++S   +  + S   V          P+   N YSRPT  KC+RC Q GH SN CP+R  + + +  E  + 
Subjt:  ATGEQTNPFSKTTS----GERLPQQASSAISKGKQIDNGSNKDVKNQE-----NPKRSTNTYSRPTMGKCFRCNQTGHLSNECPQRRTLTIADDVEYVE-

Query:  ERDGTESEENTDFLEPY-----EGEPVS--LVVQRLLLAPKKDPSYQRHALFKTRCTINGKICNVIIDSGSTENIVASKLVTSLNLPLHPHSAPYKVSWI
        E +G E+E++  + E       EGE +S  LVV+R++LAPK +   QR+ +F+TRCT+N K+C+VIIDSGS+ENI++  +V  L L    H APYK+ WI
Subjt:  ERDGTESEENTDFLEPY-----EGEPVS--LVVQRLLLAPKKDPSYQRHALFKTRCTINGKICNVIIDSGSTENIVASKLVTSLNLPLHPHSAPYKVSWI

Query:  KKGGEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRLWQYDVQAVHKGRDNTYEFHWMGKKIQHIIGFVVKHF---------------------
        KKG E +VT T     SIG +Y D+I+CDV++MDACH++LGR WQYDV   +KG+DN Y F   G+K+  I+G + + F                     
Subjt:  KKGGEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRLWQYDVQAVHKGRDNTYEFHWMGKKIQHIIGFVVKHF---------------------

Query:  ----------------DTSISDHVIPNEVQKLLQQHTTIV--DNPTKLPPLRDIQHSIDLIPGATLPNLPHYKMSPTEYQILHEQIQELLDKGHIQPSLS
                        +   +   IP  +Q LL +  TI+  + P  LPP+RDIQH IDLIPGA+LPN PHY+MSP E  IL  Q++EL+ KG +Q S+S
Subjt:  ----------------DTSISDHVIPNEVQKLLQQHTTIV--DNPTKLPPLRDIQHSIDLIPGATLPNLPHYKMSPTEYQILHEQIQELLDKGHIQPSLS

Query:  PCAVPALLTPKNDGTWRMCVDSRAINKITVKYRFPIPRIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFM
        PCAVPALL PK DG+WRMC+DSRAINKIT+KYRFPIPR+ D+LD L G+ IFSKIDL+SGYHQIRIRPGDEWKTAFKT EGL+EWLVMPFGLSNAPSTFM
Subjt:  PCAVPALLTPKNDGTWRMCVDSRAINKITVKYRFPIPRIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFM

Query:  RLMNQ-----------------------------------------------------------------------------------------------
        RLMNQ                                                                                               
Subjt:  RLMNQ-----------------------------------------------------------------------------------------------

Query:  --------------------------GKFLWGLEQQKSFGVIKEKLASSPVLALPNFNTPFEVAVDASGIGVGAVLSQNSHPIDFFSEKLSPSRQNWST-
                                  GKF WG E + SF +IKEKLAS+PVLALP+F+  FEV  DAS IG+GAVLSQ + P+ F+SEKLS +R+ WST 
Subjt:  --------------------------GKFLWGLEQQKSFGVIKEKLASSPVLALPNFNTPFEVAVDASGIGVGAVLSQNSHPIDFFSEKLSPSRQNWST-

Query:  ------------------------------------------------------FDFVIKHTSGITNKVADTLSRKESILTILKGEIIAMDHLPETYADD
                                                              F+F +KH SG  NKVAD LSRK S+LT L+ E+I  + + + YA D
Subjt:  ------------------------------------------------------FDFVIKHTSGITNKVADTLSRKESILTILKGEIIAMDHLPETYADD

Query:  KDFGTTWKKCIDHTPASGFHILDGFLFKGDVLCIPHSSLREALIREAHAGGLAGHLGRDKTLQILETRYFWSQLKRDVTNFVARY---------------
        +DFG TW KC       G H  DG+LF+G+ LCIP SSLRE +I E H GGL GHLGRDKT+ + E RY+W QLKRD+ N V R                
Subjt:  KDFGTTWKKCIDHTPASGFHILDGFLFKGDVLCIPHSSLREALIREAHAGGLAGHLGRDKTLQILETRYFWSQLKRDVTNFVARY---------------

Query:  -------------------------------------RYSKMAHFLPCKKTLDAVYIANLFFKEIVRLHGIPKSIVSDRDVKFLGHFWKTLWKKFDTKLK
                                             R+SKMAHF+ CKKT DAV++ANLFFKE+VRLHG+PKSI SDRD KFL HFW+TLW++FDT L 
Subjt:  -------------------------------------RYSKMAHFLPCKKTLDAVYIANLFFKEIVRLHGIPKSIVSDRDVKFLGHFWKTLWKKFDTKLK

Query:  YSTTSHPQTDGQTEVTNRTLGNL----------------------------------------------------------LSTEAEEMATRITELHNQV
        +S+TSHPQTDGQTEV NRTLGNL                                                          ++  AE MA R+  +  +V
Subjt:  YSTTSHPQTDGQTEVTNRTLGNL----------------------------------------------------------LSTEAEEMATRITELHNQV

Query:  QDHIQKQTAKYKEAADRKKYKEAADRKKRFKEFEVGDLVMIHLRKSRLPAGTYNKLKKRKLGPFPILERL
        + +++         A  +KYK AAD+K+R K F+ GDLVM++LRK R+P GT +KL  +K GP+ IL+++
Subjt:  QDHIQKQTAKYKEAADRKKYKEAADRKKRFKEFEVGDLVMIHLRKSRLPAGTYNKLKKRKLGPFPILERL

M5W531 Reverse transcriptase2.7e-24541.02Show/hide
Query:  MKERFLPMNFEQILYDQYQNCKQGTRSVMEYTEEFHRLGARTNLGESQQYLVARYKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQITNRFKRTYS-R
        M ERFLP ++EQILY  Y  C QGTRSV EYTEEF RL  R +L E+    VARY  GL++ I+E++ +Q I  L EAI+ A  ++ ++  + KR  + R
Subjt:  MKERFLPMNFEQILYDQYQNCKQGTRSVMEYTEEFHRLGARTNLGESQQYLVARYKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQITNRFKRTYS-R

Query:  RATGEQTNPFSKTTSG----ERLPQQASSAISK------GKQIDNGSNKDVKNQENPKRSTNTYSRPTMGKCFRCNQTGHLSNECPQRRTLTIADDVEYV
        R   E ++  +  +SG    E+  QQ S  ++K       K  + GS+++    +   +S N Y++P    C+RC + GH SN CP+R+     ++ +  
Subjt:  RATGEQTNPFSKTTSG----ERLPQQASSAISK------GKQIDNGSNKDVKNQENPKRSTNTYSRPTMGKCFRCNQTGHLSNECPQRRTLTIADDVEYV

Query:  EERD--GTESEENTDFLEPYEGEPVSLVVQRLLLAPKKDPSYQRHALFKTRCTINGKICNVIIDSGSTENIVASKLVTSLNLPLHPHSAPYKVSWIKKGG
        EE+D  G       +F      E ++LV+QR+LLAPK++   QRH +F++ C+I  K+C+VI+D+GS EN V+ KLV  L L   PH +PY + W+KKG 
Subjt:  EERD--GTESEENTDFLEPYEGEPVSLVVQRLLLAPKKDPSYQRHALFKTRCTINGKICNVIIDSGSTENIVASKLVTSLNLPLHPHSAPYKVSWIKKGG

Query:  EAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRLWQYDVQAVHKGRDNTYEFHWMGKKIQHII-------GFVVKHFDTSISDHV----------
          +V  T  +PLSIG  Y+D ++CDV+DMDACHILLGR WQ+DV A  KGRDN   F W  +KI                 F T IS+            
Subjt:  EAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRLWQYDVQAVHKGRDNTYEFHWMGKKIQHII-------GFVVKHFDTSISDHV----------

Query:  ----IPNEVQKLLQQHTTIVDN--PTKLPPLRDIQHSIDLIPGATLPNLPHYKMSPTEYQILHEQIQELLDKGHIQPSLSPCAVPALLTPKNDGTWRMCV
            IP +VQ++L Q   ++    P +LPP+RDIQH IDL+ GA+LPNLPHY+MSP E  IL EQI+ELL KG I+ SLSPCAVP LL PK D TWRMCV
Subjt:  ----IPNEVQKLLQQHTTIVDN--PTKLPPLRDIQHSIDLIPGATLPNLPHYKMSPTEYQILHEQIQELLDKGHIQPSLSPCAVPALLTPKNDGTWRMCV

Query:  DSRAINKITVKYRFPIPRIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ---------------
        DSRA+NKI VKYRF IPR+ D+LD L G+ +FSKIDL+SGYHQIRIRPGDEWKTAFK+ +GLFEWLVMPFGLSNAPSTFMRLMNQ               
Subjt:  DSRAINKITVKYRFPIPRIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ---------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------GKFLWGLEQQKSFGVIKEKLASSPVLALPNFNTPFEVAVDASGIGVGAVLSQNSHPIDFFSEKLSPSRQNWSTFD-------------------
              G+F WG EQ++SF  IKEKL ++PVLALPNF   FEV  DASG+GVGAVL Q+  P+ FFSEKLS +RQ WST+D                   
Subjt:  ------GKFLWGLEQQKSFGVIKEKLASSPVLALPNFNTPFEVAVDASGIGVGAVLSQNSHPIDFFSEKLSPSRQNWSTFD-------------------

Query:  ------------------------------------FVIKHTSGITNKVADTLSRKESILTILKGEIIAMDHLPETYADDKDFGTTWKKCIDHTPASGFH
                                            FVIKHTSG TN+VAD LSR+ S+L  L  E++  + L E Y  D DF   W KC +  P + + 
Subjt:  ------------------------------------FVIKHTSGITNKVADTLSRKESILTILKGEIIAMDHLPETYADDKDFGTTWKKCIDHTPASGFH

Query:  ILDGFLFKGDVLCIPHSSLREALIREAHAGGLAGHLGRDKTLQILETRYFWSQLKRDV------------------------------------------
        + +G+LFKG+ LCIP SSLRE LIR+ H GGL+GHLGRDKT+  +E R++W QLKRDV                                          
Subjt:  ILDGFLFKGDVLCIPHSSLREALIREAHAGGLAGHLGRDKTLQILETRYFWSQLKRDV------------------------------------------

Query:  ----------TNFVARYRYSKMAHFLPCKKTLDAVYIANLFFKEIVRLHGIPKSIVSDRDVKFLGHFWKTLWKKFDTKLKYSTTSHPQTDGQTEVTNRTL
                  + FV   R+SKMAHF+ CKKT DA  IA LFF+E+VRLHG+P SI SDRD KFL HFW TLW+ F T L  S+T+HPQTDGQTEVTNRTL
Subjt:  ----------TNFVARYRYSKMAHFLPCKKTLDAVYIANLFFKEIVRLHGIPKSIVSDRDVKFLGHFWKTLWKKFDTKLKYSTTSHPQTDGQTEVTNRTL

Query:  GNLL----------------------------STEAEEMATRITELHNQVQDHIQ---------------------KQTAKYKEAADRKKYKEAADRKKR
        GN++                            +T     +   T   N V D ++                     +   K K      KYK AADR +R
Subjt:  GNLL----------------------------STEAEEMATRITELHNQVQDHIQ---------------------KQTAKYKEAADRKKYKEAADRKKR

Query:  FKEFEVGDLVMIHLRKSRLPAGTYNKLKKRKLGPFPILERL
         K F+ GD VM+ LRK R PAGTY+KLK +K GP+ +L+R+
Subjt:  FKEFEVGDLVMIHLRKSRLPAGTYNKLKKRKLGPFPILERL

M5WCC7 Reverse transcriptase4.6e-24540.84Show/hide
Query:  MKERFLPMNFEQILYDQYQNCKQGTRSVMEYTEEFHRLGARTNLGESQQYLVARYKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQITNRFKRTYS-R
        M E+FLP ++EQILY  Y  C QGT SV EYTEEF RL  R +L E+    VARY  GL+  I+E++ +Q I  L EAI+ A  ++ ++  + KR  + R
Subjt:  MKERFLPMNFEQILYDQYQNCKQGTRSVMEYTEEFHRLGARTNLGESQQYLVARYKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQITNRFKRTYS-R

Query:  RATGEQTNPFSKTTSGE----RLPQQASSAISK------GKQIDNGSNKDVKNQENPKRSTNTYSRPTMGKCFRCNQTGHLSNECPQRRTLTIADDVEYV
        R T E ++  +  +SG     +  QQ+S  ++K       K  + GS+++    +   +S N Y++P    C+RC + GH SN CP+ +     ++ +  
Subjt:  RATGEQTNPFSKTTSGE----RLPQQASSAISK------GKQIDNGSNKDVKNQENPKRSTNTYSRPTMGKCFRCNQTGHLSNECPQRRTLTIADDVEYV

Query:  EERD--GTESEENTDFLEPYEGEPVSLVVQRLLLAPKKDPSYQRHALFKTRCTINGKICNVIIDSGSTENIVASKLVTSLNLPLHPHSAPYKVSWIKKGG
        EE D  G       +F      E ++LV+QR+LLAP+++   QRH++F++ C+I  K+C+VI+D+GS EN V+ KLV  L L   PH +PY + W+KKG 
Subjt:  EERD--GTESEENTDFLEPYEGEPVSLVVQRLLLAPKKDPSYQRHALFKTRCTINGKICNVIIDSGSTENIVASKLVTSLNLPLHPHSAPYKVSWIKKGG

Query:  EAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRLWQYDVQAVHKGRDNTYEFHWMGKKIQHIIGFVVK----------HFDTSISDHV-------
          +V  T  +PLSIG  Y+D+++CDV+DMDACHILLGR WQ+DV A  KGRDN   F W  +KI        K           F T IS+         
Subjt:  EAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRLWQYDVQAVHKGRDNTYEFHWMGKKIQHIIGFVVK----------HFDTSISDHV-------

Query:  -------IPNEVQKLLQQHTTIVDN--PTKLPPLRDIQHSIDLIPGATLPNLPHYKMSPTEYQILHEQIQELLDKGHIQPSLSPCAVPALLTPKNDGTWR
               IP +VQ++L Q   +     P +LPP+RDIQH IDL+PGA+L NLPHY+MSP E  IL EQI+ELL KG I+ SLSPCAVP LL PK D TWR
Subjt:  -------IPNEVQKLLQQHTTIVDN--PTKLPPLRDIQHSIDLIPGATLPNLPHYKMSPTEYQILHEQIQELLDKGHIQPSLSPCAVPALLTPKNDGTWR

Query:  MCVDSRAINKITVKYRFPIPRIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ------------
        MCVDSRAINKITVKYRFPIPR+ D+LD L G+ +FSKIDL+SGYHQIRIRPGDEWKTAFK+ +GLFEWLVMPFGLSN PSTFMRLMNQ            
Subjt:  MCVDSRAINKITVKYRFPIPRIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQ------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------GKFLWGLEQQKSFGVIKEKLASSPVLALPNFNTPFEVAVDASGIGVGAVLSQNSHPIDFFSEKLSPSRQNWSTFD----------------
                 G+F WG EQ++SF  IKEKL ++PVLALPNF   FEV  DASG+GVGAVLSQ+  P+ FFSEKLS +RQ WST+D                
Subjt:  ---------GKFLWGLEQQKSFGVIKEKLASSPVLALPNFNTPFEVAVDASGIGVGAVLSQNSHPIDFFSEKLSPSRQNWSTFD----------------

Query:  ---------------------------------------FVIKHTSGITNKVADTLSRKESILTILKGEIIAMDHLPETYADDKDFGTTWKKCIDHTPAS
                                               FVIKHTSG TN+VAD LSR+ S+L  L  E++  + L E Y  D DFG  W KC +  P +
Subjt:  ---------------------------------------FVIKHTSGITNKVADTLSRKESILTILKGEIIAMDHLPETYADDKDFGTTWKKCIDHTPAS

Query:  GFHILDGFLFKGDVLCIPHSSLREALIREAHAGGLAGHLGRDKTLQILETRYFWSQLKRDV---------------------------------------
         + + +G+LFKG+ LCIP SSLRE LIR+ H GGL+GHLGRDKT+  +E R++W QLKRDV                                       
Subjt:  GFHILDGFLFKGDVLCIPHSSLREALIREAHAGGLAGHLGRDKTLQILETRYFWSQLKRDV---------------------------------------

Query:  -------------TNFVARYRYSKMAHFLPCKKTLDAVYIANLFFKEIVRLHGIPKSIVSDRDVKFLGHFWKTLWKKFDTKLKYSTTSHPQTDGQTEVTN
                     + FV   R+SKMAHF+ C+KT DA  IA LFF+E+VRLHG+P SI SDRD KFL HFW TLW+ F T L  S+T+HPQTDGQTEVTN
Subjt:  -------------TNFVARYRYSKMAHFLPCKKTLDAVYIANLFFKEIVRLHGIPKSIVSDRDVKFLGHFWKTLWKKFDTKLKYSTTSHPQTDGQTEVTN

Query:  RTLGNLL----------------------------STEAEEMATRITELHNQVQDHIQ---------------------KQTAKYKEAADRKKYKEAADR
        RTLGN++                            +T     +   T + N V D ++                     +   K K      KYK AAD+
Subjt:  RTLGNLL----------------------------STEAEEMATRITELHNQVQDHIQ---------------------KQTAKYKEAADRKKYKEAADR

Query:  KKRFKEFEVGDLVMIHLRKSRLPAGTYNKLKKRKLGPFPILERL
         +R K F+ GD VMI LRK R P GTY+KLK +K GP+ +L+R+
Subjt:  KKRFKEFEVGDLVMIHLRKSRLPAGTYNKLKKRKLGPFPILERL

SwissProt top hitse value%identityAlignment
O43933 Peroxisome biogenesis factor 16.4e-8745.81Show/hide
Query:  VSCSRLASEKAQTIRQSLLNYISEALDHAPSLIVLDDLDSI--VLSTSDSEGFQPSTSMSAMTEFLTDVIDEYEEKRKSSCQVGPIAFVASVQTLDKIPQ
        V C  L  ++ + I+++L    SEA+   PS+++LDDLD I  + +  + E    +     +   L D+I E+         V  IA   S Q+L  +  
Subjt:  VSCSRLASEKAQTIRQSLLNYISEALDHAPSLIVLDDLDSI--VLSTSDSEGFQPSTSMSAMTEFLTDVIDEYEEKRKSSCQVGPIAFVASVQTLDKIPQ

Query:  SLRSSGRFDFHVELPAPAASERTAILKHEIQRRSLDC-----SDVTLQDIASKCDGYDAYDLEILVDRAVHGAVSRFLPLSLNQKQSPTLIESDFSLAMK
        S +    F     +  P   +R  IL + I+ + LDC     +D+ LQ +A +  G+ A D  +LVDRA+H  +SR    S++ ++   L   DF  A++
Subjt:  SLRSSGRFDFHVELPAPAASERTAILKHEIQRRSLDC-----SDVTLQDIASKCDGYDAYDLEILVDRAVHGAVSRFLPLSLNQKQSPTLIESDFSLAMK

Query:  EFVPASMRDIT--KPSAEGGRSGWDDVGGLVEVKNSIKEMLVFPSKFPNIFAQAPLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLNKYI
         F+PAS+R +   KP       GWD +GGL EV+  + + +  P+K+P +FA  P+R R+ +LLYGPPG GKT + G  A    + FISVKGPELL+KYI
Subjt:  EFVPASMRDIT--KPSAEGGRSGWDDVGGLVEVKNSIKEMLVFPSKFPNIFAQAPLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLNKYI

Query:  GASEQAVRDIFSKATAAAPCILFFDEFDSIAPKRGHDNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRLLFCDFPSPVERLN
        GASEQAVRDIF +A AA PCILFFDEF+SIAP+RGHDNTGVTDRVVNQ LT+LDGVE L GV+V AATSRPDL+D ALLRPGRLD+ ++C  P  V RL 
Subjt:  GASEQAVRDIFSKATAAAPCILFFDEFDSIAPKRGHDNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRLLFCDFPSPVERLN

Query:  ILQVLS
        IL VLS
Subjt:  ILQVLS

P46463 Peroxisome biosynthesis protein PAS12.8e-8238.01Show/hide
Query:  LIKSNYHLIVFVSCSRLASEKAQTIRQSLLNYISEALDHAPSLIVLDDLDSIVLSTSDSEGFQPSTSMSAMTEFLTDVIDEYEEKRKSSCQVG-PIAFVA
        ++ +  H +  ++C ++ SE    +R    +  SE    APSL++L+DLDS++ +  +      S  +S           EY   + S+  +   I  +A
Subjt:  LIKSNYHLIVFVSCSRLASEKAQTIRQSLLNYISEALDHAPSLIVLDDLDSIVLSTSDSEGFQPSTSMSAMTEFLTDVIDEYEEKRKSSCQVG-PIAFVA

Query:  SVQTLDKIPQSLRSSGRFDFHVELPAPAASERTAILKHEIQRRSLDCSD-VTLQDIASKCDGYDAYDLEILVDRAVHGAVSRFL------PLSLNQKQSP
        S ++ + +   + ++   +   +L AP    R  IL+  +   ++ CS+   L +IA + +GY   DL++L DRA H  +SR +       L + +  +P
Subjt:  SVQTLDKIPQSLRSSGRFDFHVELPAPAASERTAILKHEIQRRSLDCSD-VTLQDIASKCDGYDAYDLEILVDRAVHGAVSRFL------PLSLNQKQSP

Query:  ----------------------------------------TLIESDFSLAMKEFVPASMRDITKPSAEGGRSGWDDVGGLVEVKNSIKEMLVFPSKFPNI
                                                T+ + +F  A+  ++P S+R +    ++     WDD+GGL + K+ + E L +P+K+  I
Subjt:  ----------------------------------------TLIESDFSLAMKEFVPASMRDITKPSAEGGRSGWDDVGGLVEVKNSIKEMLVFPSKFPNI

Query:  FAQAPLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLNKYIGASEQAVRDIFSKATAAAPCILFFDEFDSIAPKRGHDNTGVTDRVVNQFL
        F+  PLRLRS +LLYG PGCGKT +  A AA C L FIS+KGPE+LNKYIG SEQ+VR++F +A AA PCILFFDEFDSIAPKRGHD+TGVTDRVVNQ L
Subjt:  FAQAPLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLNKYIGASEQAVRDIFSKATAAAPCILFFDEFDSIAPKRGHDNTGVTDRVVNQFL

Query:  TELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRLLFCDFPSPVERLNILQVLSTKVGGSK
        T++DG E L GV+V AATSRPDL+D+ALLRPGRLD+ + CD P   +RL+ILQ ++  +  SK
Subjt:  TELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRLLFCDFPSPVERLNILQVLSTKVGGSK

Q54GX5 Peroxisome biogenesis factor 14.3e-8340.94Show/hide
Query:  LERLEKMLIKSNYHLIVFVSCSRLASEKAQTIRQSLLNYISEALDHAPSLIVLDDLDSIVLSTSDSEGFQPSTSMSAMTEFLTDVIDEYEEKRKSSCQVG
        L+ L+   I+  ++ + + SC    +  + T         S      P +I+L+ LD ++L T + +          +   +  +  +Y+ +        
Subjt:  LERLEKMLIKSNYHLIVFVSCSRLASEKAQTIRQSLLNYISEALDHAPSLIVLDDLDSIVLSTSDSEGFQPSTSMSAMTEFLTDVIDEYEEKRKSSCQVG

Query:  PIAFVASVQTLDKIPQSLRSSGRFDFHVELPAPAASERTAILKHEIQ---RRSLDCSDVTLQDIASKCDGYDAYDLEILVDRAVHGAVSRFLPLSLN---
        PI  +A+V +   + QS++    F   +EL AP   ER  IL+  ++   ++  D   + L   ++  +GY   D+E +VDR++H +  + +  + N   
Subjt:  PIAFVASVQTLDKIPQSLRSSGRFDFHVELPAPAASERTAILKHEIQ---RRSLDCSDVTLQDIASKCDGYDAYDLEILVDRAVHGAVSRFLPLSLN---

Query:  QKQSPTLIE-SDFSLAMKEFVPASMRDITKPSAEGGRSGWDDVGGLVEVKNSIKEMLVFPSKFPNIFAQAPLRLRSNVLLYGPPGCGKTHIVGAAAAACS
              +IE S    A + + P +++ I   S+E     W D+GGL  V+  +KE + +P+K+P +F  +PLRLRS +LLYGP GCGKT +  A A  C 
Subjt:  QKQSPTLIE-SDFSLAMKEFVPASMRDITKPSAEGGRSGWDDVGGLVEVKNSIKEMLVFPSKFPNIFAQAPLRLRSNVLLYGPPGCGKTHIVGAAAAACS

Query:  LRFISVKGPELLNKYIGASEQAVRDIFSKATAAAPCILFFDEFDSIAPKRGHDNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRL
        L FISVKGPELLNKYIG+SEQ VRD+FS+A++A PC+LFFDEFDSIAP+RGHDN+GVTDRVVNQFLT+LDGVE LTGV+V AATSRPDL+D ALLRPGRL
Subjt:  LRFISVKGPELLNKYIGASEQAVRDIFSKATAAAPCILFFDEFDSIAPKRGHDNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRL

Query:  DRLLFCDFPSPVERLNILQVLSTKV
        D+ L+C+ P   ERL+IL  L +K+
Subjt:  DRLLFCDFPSPVERLNILQVLSTKV

Q5BL07 Peroxisome biogenesis factor 14.9e-8745.48Show/hide
Query:  VSCSRLASEKAQTIRQSLLNYISEALDHAPSLIVLDDLDSI--VLSTSDSEGFQPSTSMSAMTEFLTDVIDEYEEKRKSSCQVGPIAFVASVQTLDKIPQ
        V C  L  ++ ++I+++L    SEA    PS+I+LDDLD I  + S  + E    +     +   L D+I E+            +A +A+ Q    +  
Subjt:  VSCSRLASEKAQTIRQSLLNYISEALDHAPSLIVLDDLDSI--VLSTSDSEGFQPSTSMSAMTEFLTDVIDEYEEKRKSSCQVGPIAFVASVQTLDKIPQ

Query:  SLRSS---GRFDFHVELPAPAASERTAILKHEIQRRSLDCS-----DVTLQDIASKCDGYDAYDLEILVDRAVHGAVSRFLPLSLNQKQSPTLIESDFSL
        SL S+     F     L  P   +R  IL H + +  L C      D+ LQ IA   + + A D  +LVDRA+H ++SR      + ++  TL  SDF  
Subjt:  SLRSS---GRFDFHVELPAPAASERTAILKHEIQRRSLDCS-----DVTLQDIASKCDGYDAYDLEILVDRAVHGAVSRFLPLSLNQKQSPTLIESDFSL

Query:  AMKEFVPASMRDIT--KPSAEGGRSGWDDVGGLVEVKNSIKEMLVFPSKFPNIFAQAPLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLN
        A++ F+PAS+R++   KP       GWD +GGL EV+  + + +  P+K+P +FA  P+R R+ +LLYGPPG GKT + G  A    + FIS+KGPELL+
Subjt:  AMKEFVPASMRDIT--KPSAEGGRSGWDDVGGLVEVKNSIKEMLVFPSKFPNIFAQAPLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLN

Query:  KYIGASEQAVRDIFSKATAAAPCILFFDEFDSIAPKRGHDNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRLLFCDFPSPVE
        KYIGASEQAVRD+F +A AA PCILFFDEF+SIAP+RGHDNTGVTDRVVNQ LT+LDGVE L GV+V AATSRPDL+D ALLRPGRLD+ ++C  P  V 
Subjt:  KYIGASEQAVRDIFSKATAAAPCILFFDEFDSIAPKRGHDNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRLLFCDFPSPVE

Query:  RLNILQVLS
        RL IL VLS
Subjt:  RLNILQVLS

Q9FNP1 Peroxisome biogenesis protein 16.5e-17274.69Show/hide
Query:  IVFVSCSRLASEKAQTIRQSLLNYISEALDHAPSLIVLDDLDSIVLSTSDSEGFQPSTSMSAMTEFLTDVIDEYEEKRKSSCQVGPIAFVASVQTLDKIP
        ++ VSCS LA EK Q I   L + I+E L+HAPS+I+LDDLDSI+ S+SD+EG Q S  ++ +T+FLTDVID+Y E R SSC +GP+AFVASVQ+L++IP
Subjt:  IVFVSCSRLASEKAQTIRQSLLNYISEALDHAPSLIVLDDLDSIVLSTSDSEGFQPSTSMSAMTEFLTDVIDEYEEKRKSSCQVGPIAFVASVQTLDKIP

Query:  QSLRSSGRFDFHVELPAPAASERTAILKHEIQRRSLDCSDVTLQDIASKCDGYDAYDLEILVDRAVHGAVSRFLPLSLNQKQSPTLIESDFSLAMKEFVP
        Q+L SSGRFDFHV+L APA SER AILKHEIQ+R LDCS+  L ++A+KC+GYDAYDLEILVDRAVH A+ R LPL  N  +   L++ DF+ AM +FVP
Subjt:  QSLRSSGRFDFHVELPAPAASERTAILKHEIQRRSLDCSDVTLQDIASKCDGYDAYDLEILVDRAVHGAVSRFLPLSLNQKQSPTLIESDFSLAMKEFVP

Query:  ASMRDITKPSAEGGRSGWDDVGGLVEVKNSIKEMLVFPSKFPNIFAQAPLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLNKYIGASEQA
         +MRDITK ++EGGR GW+DVGG+ ++KN+IKEM+  PSKFP IFA++PLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLNKYIGASEQA
Subjt:  ASMRDITKPSAEGGRSGWDDVGGLVEVKNSIKEMLVFPSKFPNIFAQAPLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLNKYIGASEQA

Query:  VRDIFSKATAAAPCILFFDEFDSIAPKRGHDNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRLLFCDFPSPVERLNILQVLS
        VRDIFSKA AAAPCILFFDEFDSIAPKRGHDNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLD ALLRPGRLDRLL CDFPSP ERL IL VLS
Subjt:  VRDIFSKATAAAPCILFFDEFDSIAPKRGHDNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRLLFCDFPSPVERLNILQVLS

Query:  TKV
         K+
Subjt:  TKV

Arabidopsis top hitse value%identityAlignment
AT2G03670.1 cell division cycle 48B6.6e-5534.86Show/hide
Query:  ASEKAQTIRQSLLNYISEALDHAPSLIVLDDLDSIVLSTSDSEGFQPSTSMSAMTEFLTDVIDEYEEKRKSSCQVGPIAFVASVQTLDKIPQSLRSSGRF
        A E  + +R++     S A+   PS+I +D++D ++    D+   Q     S +   +        +  K S     +  VAS   +D I  +LR +GRF
Subjt:  ASEKAQTIRQSLLNYISEALDHAPSLIVLDDLDSIVLSTSDSEGFQPSTSMSAMTEFLTDVIDEYEEKRKSSCQVGPIAFVASVQTLDKIPQSLRSSGRF

Query:  DFHVELPAPAASERTAILKHEIQRRSLDCSDVTLQDIASKCDGYDAYDLEILVDRAVHGAVSRFLPLSLNQKQSPTLIESDFSLAMKEFVPASMRDITKP
        D  VE+  P   +R  IL+   ++ +LD S V LQ IA  C+GY   DLE L   A   A  R          S  L   DF +A     P+  R IT  
Subjt:  DFHVELPAPAASERTAILKHEIQRRSLDCSDVTLQDIASKCDGYDAYDLEILVDRAVHGAVSRFLPLSLNQKQSPTLIESDFSLAMKEFVPASMRDITKP

Query:  SAEGGRSGWDDVGGLVEVKNSIKEMLVFPSKFPNIFAQAPLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLNKYIGASEQAVRDIFSKAT
          E  +  WDDVGGL ++K  +++ + +P K    F +  +     +LL+GPPGC KT +  AAA A    F S+   EL + Y+G  E  +R+ F +A 
Subjt:  SAEGGRSGWDDVGGLVEVKNSIKEMLVFPSKFPNIFAQAPLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLNKYIGASEQAVRDIFSKAT

Query:  AAAPCILFFDEFDSIAPKRGHDNTG----VTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRLLFCDFPSPVERLNILQV
         A+P I+FFDE D +A KRG +++     V +R+++  LTE+DG+E   G+ V AAT+RP  +DAAL+RPGR D +L+   P    R  ILQV
Subjt:  AAAPCILFFDEFDSIAPKRGHDNTG----VTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRLLFCDFPSPVERLNILQV

AT3G09840.1 cell division cycle 489.9e-5934.56Show/hide
Query:  SRLASEKAQTIRQSLLNYISEALDHAPSLIVLDDLDSIVLSTSDSEGFQPSTSMSAMTEFLTDVIDEYEEKRKSSCQVGPIAFVASVQTLDKIPQSLRSS
        S+LA E    +R++      EA  +APS+I +D++DSI      + G         +++ LT ++D  + +         +  + +    + I  +LR  
Subjt:  SRLASEKAQTIRQSLLNYISEALDHAPSLIVLDDLDSIVLSTSDSEGFQPSTSMSAMTEFLTDVIDEYEEKRKSSCQVGPIAFVASVQTLDKIPQSLRSS

Query:  GRFDFHVELPAPAASERTAILKHEIQRRSLDCSDVTLQDIASKCDGYDAYDLEILVDRAVHGAVSRFLPL------SLNQK--QSPTLIESDFSLAMKEF
        GRFD  +++  P    R  +L+   +   L   DV L+ I+    GY   DL  L   A    +   + +      S++ +   S  +    F  A+   
Subjt:  GRFDFHVELPAPAASERTAILKHEIQRRSLDCSDVTLQDIASKCDGYDAYDLEILVDRAVHGAVSRFLPL------SLNQK--QSPTLIESDFSLAMKEF

Query:  VPASMRDITKPSAEGGRSGWDDVGGLVEVKNSIKEMLVFPSKFPNIFAQAPLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLNKYIGASE
         P+++R+      E     W+D+GGL  VK  ++E + +P + P  F +  +     VL YGPPGCGKT +  A A  C   FISVKGPELL  + G SE
Subjt:  VPASMRDITKPSAEGGRSGWDDVGGLVEVKNSIKEMLVFPSKFPNIFAQAPLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLNKYIGASE

Query:  QAVRDIFSKATAAAPCILFFDEFDSIAPKR----GHDNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRLLFCDFPSPVERLN
          VR+IF KA  +APC+LFFDE DSIA +R    G D  G  DRV+NQ LTE+DG+     VF+  AT+RPD++D+ALLRPGRLD+L++   P    RLN
Subjt:  QAVRDIFSKATAAAPCILFFDEFDSIAPKR----GHDNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRLLFCDFPSPVERLN

Query:  ILQVLSTK
        I +    K
Subjt:  ILQVLSTK

AT3G53230.1 ATPase, AAA-type, CDC48 protein1.9e-5733.83Show/hide
Query:  SRLASEKAQTIRQSLLNYISEALDHAPSLIVLDDLDSIVLSTSDSEGFQPSTSMSAMTEFLTDVIDEYEEKRKSSCQVGPIAFVASVQTLDKIPQSLRSS
        S+LA E    +R++      EA  +APS+I +D++DSI      + G         +++ LT ++D  + +         +  + +    + I  +LR  
Subjt:  SRLASEKAQTIRQSLLNYISEALDHAPSLIVLDDLDSIVLSTSDSEGFQPSTSMSAMTEFLTDVIDEYEEKRKSSCQVGPIAFVASVQTLDKIPQSLRSS

Query:  GRFDFHVELPAPAASERTAILKHEIQRRSLDCSDVTLQDIASKCDGYDAYDLEILVDRAVHGAVSRFLP-LSLNQKQ-------SPTLIESDFSLAMKEF
        GRFD  +++  P    R  +L+   +   L   DV L+ ++    GY   DL  L   A    +   +  + L+ ++       S  +    F  A+   
Subjt:  GRFDFHVELPAPAASERTAILKHEIQRRSLDCSDVTLQDIASKCDGYDAYDLEILVDRAVHGAVSRFLP-LSLNQKQ-------SPTLIESDFSLAMKEF

Query:  VPASMRDITKPSAEGGRSGWDDVGGLVEVKNSIKEMLVFPSKFPNIFAQAPLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLNKYIGASE
         P+++R+      E     W+D+GGL  VK  ++E + +P + P  F +  +     VL YGPPGCGKT +  A A  C   FIS+KGPELL  + G SE
Subjt:  VPASMRDITKPSAEGGRSGWDDVGGLVEVKNSIKEMLVFPSKFPNIFAQAPLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLNKYIGASE

Query:  QAVRDIFSKATAAAPCILFFDEFDSIAPKRGH---DNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRLLFCDFPSPVERLNI
          VR+IF KA  +APC+LFFDE DSIA +RG+   D  G  DRV+NQ LTE+DG+     VF+  AT+RPD++D ALLRPGRLD+L++   P    R  I
Subjt:  QAVRDIFSKATAAAPCILFFDEFDSIAPKRGH---DNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRLLFCDFPSPVERLNI

Query:  LQ
         +
Subjt:  LQ

AT5G03340.1 ATPase, AAA-type, CDC48 protein5.8e-5934.64Show/hide
Query:  SRLASEKAQTIRQSLLNYISEALDHAPSLIVLDDLDSIVLSTSDSEGFQPSTSMSAMTEFLTDVIDEYEEKRKSSCQVGPIAFVASVQTLDKIPQSLRSS
        S+LA E    +R++      EA  +APS+I +D++DSI      + G         +++ LT ++D  + +         +  + +    + I  +LR  
Subjt:  SRLASEKAQTIRQSLLNYISEALDHAPSLIVLDDLDSIVLSTSDSEGFQPSTSMSAMTEFLTDVIDEYEEKRKSSCQVGPIAFVASVQTLDKIPQSLRSS

Query:  GRFDFHVELPAPAASERTAILKHEIQRRSLDCSDVTLQDIASKCDGYDAYDLEILVDRAVHGAVSRFLPL------SLNQK--QSPTLIESDFSLAMKEF
        GRFD  +++  P    R  +L+   +   L   DV L+ I+    GY   DL  L   A    +   + +      S++ +   S  +    F  A+   
Subjt:  GRFDFHVELPAPAASERTAILKHEIQRRSLDCSDVTLQDIASKCDGYDAYDLEILVDRAVHGAVSRFLPL------SLNQK--QSPTLIESDFSLAMKEF

Query:  VPASMRDITKPSAEGGRSGWDDVGGLVEVKNSIKEMLVFPSKFPNIFAQAPLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLNKYIGASE
         P+++R+      E     W+D+GGL  VK  ++E + +P + P  F +  +     VL YGPPGCGKT +  A A  C   FISVKGPELL  + G SE
Subjt:  VPASMRDITKPSAEGGRSGWDDVGGLVEVKNSIKEMLVFPSKFPNIFAQAPLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLNKYIGASE

Query:  QAVRDIFSKATAAAPCILFFDEFDSIAPKRGH---DNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRLLFCDFPSPVERLNI
          VR+IF KA  +APC+LFFDE DSIA +RG+   D  G  DRV+NQ LTE+DG+     VF+  AT+RPD++D+ALLRPGRLD+L++   P    RLNI
Subjt:  QAVRDIFSKATAAAPCILFFDEFDSIAPKRGH---DNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRLLFCDFPSPVERLNI

Query:  LQVLSTK
         +    K
Subjt:  LQVLSTK

AT5G08470.1 peroxisome 14.6e-17374.69Show/hide
Query:  IVFVSCSRLASEKAQTIRQSLLNYISEALDHAPSLIVLDDLDSIVLSTSDSEGFQPSTSMSAMTEFLTDVIDEYEEKRKSSCQVGPIAFVASVQTLDKIP
        ++ VSCS LA EK Q I   L + I+E L+HAPS+I+LDDLDSI+ S+SD+EG Q S  ++ +T+FLTDVID+Y E R SSC +GP+AFVASVQ+L++IP
Subjt:  IVFVSCSRLASEKAQTIRQSLLNYISEALDHAPSLIVLDDLDSIVLSTSDSEGFQPSTSMSAMTEFLTDVIDEYEEKRKSSCQVGPIAFVASVQTLDKIP

Query:  QSLRSSGRFDFHVELPAPAASERTAILKHEIQRRSLDCSDVTLQDIASKCDGYDAYDLEILVDRAVHGAVSRFLPLSLNQKQSPTLIESDFSLAMKEFVP
        Q+L SSGRFDFHV+L APA SER AILKHEIQ+R LDCS+  L ++A+KC+GYDAYDLEILVDRAVH A+ R LPL  N  +   L++ DF+ AM +FVP
Subjt:  QSLRSSGRFDFHVELPAPAASERTAILKHEIQRRSLDCSDVTLQDIASKCDGYDAYDLEILVDRAVHGAVSRFLPLSLNQKQSPTLIESDFSLAMKEFVP

Query:  ASMRDITKPSAEGGRSGWDDVGGLVEVKNSIKEMLVFPSKFPNIFAQAPLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLNKYIGASEQA
         +MRDITK ++EGGR GW+DVGG+ ++KN+IKEM+  PSKFP IFA++PLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLNKYIGASEQA
Subjt:  ASMRDITKPSAEGGRSGWDDVGGLVEVKNSIKEMLVFPSKFPNIFAQAPLRLRSNVLLYGPPGCGKTHIVGAAAAACSLRFISVKGPELLNKYIGASEQA

Query:  VRDIFSKATAAAPCILFFDEFDSIAPKRGHDNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRLLFCDFPSPVERLNILQVLS
        VRDIFSKA AAAPCILFFDEFDSIAPKRGHDNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLD ALLRPGRLDRLL CDFPSP ERL IL VLS
Subjt:  VRDIFSKATAAAPCILFFDEFDSIAPKRGHDNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRLLFCDFPSPVERLNILQVLS

Query:  TKV
         K+
Subjt:  TKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGAAAGGTTCTTGCCCATGAATTTCGAGCAAATCCTATACGATCAATACCAAAATTGTAAGCAAGGTACAAGATCGGTGATGGAGTACACTGAGGAATTCCATCG
ATTAGGAGCACGAACGAATCTTGGAGAAAGCCAACAATATTTAGTTGCAAGATACAAAGGAGGTCTTCGAAATGACATCAAAGAACAACTCTCTTTACAACCTATTGGAT
ATCTAAATGAGGCCATCTCAGCAGCAGCCGTCATTGAAGAACAAATAACCAACCGCTTTAAAAGAACATATTCAAGAAGAGCCACAGGTGAACAAACCAATCCATTCAGC
AAAACTACCTCGGGTGAAAGATTGCCACAGCAGGCCTCCTCGGCAATTTCAAAGGGAAAGCAAATCGACAATGGATCAAATAAAGATGTCAAGAATCAAGAAAATCCAAA
AAGATCTACAAATACATATTCAAGACCAACTATGGGAAAGTGTTTTCGATGCAACCAAACCGGTCATTTGTCAAACGAATGTCCACAAAGAAGAACTCTCACTATTGCAG
ATGATGTCGAATATGTGGAAGAACGGGATGGCACTGAATCCGAAGAAAACACAGATTTCCTTGAACCATATGAAGGAGAACCGGTATCATTAGTAGTCCAAAGACTTTTG
CTCGCACCCAAGAAAGATCCAAGTTATCAAAGACATGCTCTTTTTAAAACAAGATGCACCATCAATGGGAAAATTTGCAACGTTATCATCGACAGTGGGAGCACCGAGAA
CATTGTGGCAAGCAAGTTAGTAACCTCCCTAAACCTACCTTTACACCCTCATTCAGCACCTTACAAGGTGAGTTGGATAAAGAAAGGGGGCGAGGCTCAAGTGACACATA
CCAGCACTATTCCTTTATCTATTGGAGCAAGCTACAAGGACCAAATAATTTGTGATGTCTTAGACATGGATGCTTGTCATATCCTCCTCGGCCGACTATGGCAATATGAT
GTGCAAGCAGTTCACAAGGGCCGTGATAACACATATGAATTCCATTGGATGGGAAAGAAAATCCAACATATTATTGGCTTTGTTGTTAAACATTTTGATACTTCCATTTC
TGACCATGTCATTCCTAATGAGGTACAAAAATTGCTACAACAACACACGACCATAGTCGACAACCCAACAAAACTTCCTCCTTTAAGAGATATCCAACATTCTATTGATC
TTATCCCGGGCGCTACACTCCCCAACCTTCCACACTATAAGATGAGCCCAACAGAATATCAAATACTTCATGAACAAATACAAGAATTGCTGGATAAAGGACATATTCAA
CCCAGCCTCAGTCCTTGTGCCGTTCCTGCCCTCCTCACTCCCAAAAACGATGGCACTTGGAGGATGTGTGTAGATAGCCGAGCAATTAACAAGATTACGGTGAAATACCG
TTTTCCAATTCCAAGGATTGGTGACTTACTAGATCAATTAGGAGGGGCAACAATATTCTCGAAGATTGATTTAAAGAGTGGCTACCACCAAATCCGAATCCGACCGGGAG
ATGAGTGGAAAACAGCGTTCAAAACCAATGAAGGTCTCTTTGAATGGCTTGTAATGCCTTTCGGATTATCAAATGCCCCAAGTACCTTCATGAGGCTTATGAACCAGGGG
AAATTTCTTTGGGGACTTGAACAACAAAAGAGCTTCGGTGTTATCAAAGAAAAACTTGCCAGTAGTCCCGTATTAGCCCTCCCAAATTTCAATACTCCTTTTGAAGTAGC
AGTTGATGCTTCAGGCATAGGTGTAGGAGCTGTTTTATCACAAAATTCCCATCCAATCGATTTTTTTAGTGAAAAACTAAGCCCCTCAAGACAAAACTGGTCCACATTTG
ATTTTGTGATCAAACATACGTCTGGTATTACTAACAAAGTAGCAGATACATTGAGCCGAAAGGAAAGCATTTTGACCATTTTAAAAGGAGAAATAATAGCTATGGACCAC
CTACCGGAAACATATGCTGATGATAAAGATTTTGGCACCACATGGAAAAAATGTATAGACCATACGCCGGCCAGTGGCTTCCATATCTTGGATGGATTCCTCTTCAAGGG
GGATGTCTTATGCATTCCACATTCGTCTTTAAGGGAAGCGCTAATTCGAGAAGCTCATGCTGGAGGTTTAGCCGGACATCTTGGGCGAGACAAGACATTACAAATTCTAG
AAACCAGATATTTTTGGTCGCAATTAAAGAGGGATGTCACCAATTTCGTAGCACGATACCGATACAGCAAAATGGCTCATTTTTTGCCATGTAAGAAAACGTTAGATGCT
GTATACATCGCAAACCTATTCTTCAAAGAGATAGTACGACTTCATGGTATTCCAAAATCGATTGTTTCCGATAGGGATGTGAAATTCCTAGGACATTTCTGGAAAACCCT
TTGGAAGAAGTTCGATACAAAACTCAAGTACAGCACTACAAGCCATCCTCAAACTGATGGTCAAACGGAAGTAACAAACCGGACCTTGGGCAACTTGTTAAGTACGGAAG
CTGAGGAGATGGCTACCAGAATCACGGAGTTGCATAACCAAGTCCAAGACCACATTCAAAAGCAAACTGCCAAGTACAAAGAAGCTGCAGATCGGAAGAAGTACAAAGAA
GCTGCAGATCGGAAGAAGAGATTCAAGGAATTCGAAGTTGGAGATCTAGTGATGATACATTTGAGGAAAAGCAGGCTCCCAGCAGGAACCTACAACAAGCTCAAGAAAAG
AAAGCTTGGCCCATTCCCAATATTGGAAAGATTGGAGAAAATGCTTATAAAATCCAACTACCATTTAATAGTTTTTGTATCTTGTTCTCGACTTGCTTCTGAAAAAGCCC
AGACTATACGCCAGTCACTATTGAACTATATATCAGAAGCTTTAGATCACGCGCCTTCTCTTATTGTTTTGGATGACTTAGACAGTATTGTCTTGTCAACATCTGACTCG
GAAGGATTTCAGCCATCCACATCAATGTCTGCAATGACAGAGTTTCTCACTGACGTGATAGATGAATATGAGGAAAAGAGGAAAAGCTCATGTCAGGTTGGTCCTATTGC
ATTTGTGGCTTCTGTTCAGACTCTAGATAAGATACCACAATCATTGAGATCTTCAGGAAGATTCGACTTTCATGTGGAATTGCCTGCTCCTGCAGCATCAGAGCGAACTG
CTATTCTGAAGCATGAAATACAGAGACGTTCCTTAGATTGTTCGGATGTGACCTTACAAGACATTGCCTCAAAATGTGATGGATATGATGCTTATGACTTAGAAATTTTG
GTCGATAGAGCTGTCCATGGTGCTGTTAGCCGTTTTCTGCCACTGAGCCTTAACCAGAAGCAAAGTCCCACTTTAATTGAAAGTGATTTTTCTCTAGCAATGAAAGAATT
TGTTCCGGCTTCAATGCGTGATATAACAAAACCATCTGCAGAAGGTGGTCGCTCTGGCTGGGATGATGTTGGTGGTTTAGTGGAAGTTAAGAATTCTATCAAAGAGATGC
TTGTATTTCCTTCAAAGTTTCCCAATATCTTTGCTCAAGCCCCTTTAAGGTTACGGTCAAATGTTCTCCTCTACGGACCTCCTGGTTGTGGCAAGACGCACATAGTTGGT
GCTGCTGCTGCTGCTTGTTCATTGCGATTTATATCAGTCAAAGGGCCTGAACTTTTAAACAAATACATTGGTGCTTCTGAGCAAGCTGTTCGGGATATTTTCTCTAAAGC
AACCGCAGCTGCACCTTGCATACTCTTCTTTGACGAGTTTGATTCAATTGCCCCAAAGAGAGGGCACGATAATACTGGAGTGACTGATCGTGTAGTCAATCAATTTCTTA
CTGAATTGGATGGTGTTGAAGTTCTAACCGGTGTATTTGTTTTTGCTGCAACAAGTAGGCCAGATTTGCTTGATGCTGCCCTGCTAAGACCTGGCAGACTGGATCGCCTT
CTGTTTTGTGATTTCCCCTCTCCAGTGGAACGGCTAAATATTCTTCAAGTTCTTTCCACAAAGGTTGGTGGCTCTAAACTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGAAAGGTTCTTGCCCATGAATTTCGAGCAAATCCTATACGATCAATACCAAAATTGTAAGCAAGGTACAAGATCGGTGATGGAGTACACTGAGGAATTCCATCG
ATTAGGAGCACGAACGAATCTTGGAGAAAGCCAACAATATTTAGTTGCAAGATACAAAGGAGGTCTTCGAAATGACATCAAAGAACAACTCTCTTTACAACCTATTGGAT
ATCTAAATGAGGCCATCTCAGCAGCAGCCGTCATTGAAGAACAAATAACCAACCGCTTTAAAAGAACATATTCAAGAAGAGCCACAGGTGAACAAACCAATCCATTCAGC
AAAACTACCTCGGGTGAAAGATTGCCACAGCAGGCCTCCTCGGCAATTTCAAAGGGAAAGCAAATCGACAATGGATCAAATAAAGATGTCAAGAATCAAGAAAATCCAAA
AAGATCTACAAATACATATTCAAGACCAACTATGGGAAAGTGTTTTCGATGCAACCAAACCGGTCATTTGTCAAACGAATGTCCACAAAGAAGAACTCTCACTATTGCAG
ATGATGTCGAATATGTGGAAGAACGGGATGGCACTGAATCCGAAGAAAACACAGATTTCCTTGAACCATATGAAGGAGAACCGGTATCATTAGTAGTCCAAAGACTTTTG
CTCGCACCCAAGAAAGATCCAAGTTATCAAAGACATGCTCTTTTTAAAACAAGATGCACCATCAATGGGAAAATTTGCAACGTTATCATCGACAGTGGGAGCACCGAGAA
CATTGTGGCAAGCAAGTTAGTAACCTCCCTAAACCTACCTTTACACCCTCATTCAGCACCTTACAAGGTGAGTTGGATAAAGAAAGGGGGCGAGGCTCAAGTGACACATA
CCAGCACTATTCCTTTATCTATTGGAGCAAGCTACAAGGACCAAATAATTTGTGATGTCTTAGACATGGATGCTTGTCATATCCTCCTCGGCCGACTATGGCAATATGAT
GTGCAAGCAGTTCACAAGGGCCGTGATAACACATATGAATTCCATTGGATGGGAAAGAAAATCCAACATATTATTGGCTTTGTTGTTAAACATTTTGATACTTCCATTTC
TGACCATGTCATTCCTAATGAGGTACAAAAATTGCTACAACAACACACGACCATAGTCGACAACCCAACAAAACTTCCTCCTTTAAGAGATATCCAACATTCTATTGATC
TTATCCCGGGCGCTACACTCCCCAACCTTCCACACTATAAGATGAGCCCAACAGAATATCAAATACTTCATGAACAAATACAAGAATTGCTGGATAAAGGACATATTCAA
CCCAGCCTCAGTCCTTGTGCCGTTCCTGCCCTCCTCACTCCCAAAAACGATGGCACTTGGAGGATGTGTGTAGATAGCCGAGCAATTAACAAGATTACGGTGAAATACCG
TTTTCCAATTCCAAGGATTGGTGACTTACTAGATCAATTAGGAGGGGCAACAATATTCTCGAAGATTGATTTAAAGAGTGGCTACCACCAAATCCGAATCCGACCGGGAG
ATGAGTGGAAAACAGCGTTCAAAACCAATGAAGGTCTCTTTGAATGGCTTGTAATGCCTTTCGGATTATCAAATGCCCCAAGTACCTTCATGAGGCTTATGAACCAGGGG
AAATTTCTTTGGGGACTTGAACAACAAAAGAGCTTCGGTGTTATCAAAGAAAAACTTGCCAGTAGTCCCGTATTAGCCCTCCCAAATTTCAATACTCCTTTTGAAGTAGC
AGTTGATGCTTCAGGCATAGGTGTAGGAGCTGTTTTATCACAAAATTCCCATCCAATCGATTTTTTTAGTGAAAAACTAAGCCCCTCAAGACAAAACTGGTCCACATTTG
ATTTTGTGATCAAACATACGTCTGGTATTACTAACAAAGTAGCAGATACATTGAGCCGAAAGGAAAGCATTTTGACCATTTTAAAAGGAGAAATAATAGCTATGGACCAC
CTACCGGAAACATATGCTGATGATAAAGATTTTGGCACCACATGGAAAAAATGTATAGACCATACGCCGGCCAGTGGCTTCCATATCTTGGATGGATTCCTCTTCAAGGG
GGATGTCTTATGCATTCCACATTCGTCTTTAAGGGAAGCGCTAATTCGAGAAGCTCATGCTGGAGGTTTAGCCGGACATCTTGGGCGAGACAAGACATTACAAATTCTAG
AAACCAGATATTTTTGGTCGCAATTAAAGAGGGATGTCACCAATTTCGTAGCACGATACCGATACAGCAAAATGGCTCATTTTTTGCCATGTAAGAAAACGTTAGATGCT
GTATACATCGCAAACCTATTCTTCAAAGAGATAGTACGACTTCATGGTATTCCAAAATCGATTGTTTCCGATAGGGATGTGAAATTCCTAGGACATTTCTGGAAAACCCT
TTGGAAGAAGTTCGATACAAAACTCAAGTACAGCACTACAAGCCATCCTCAAACTGATGGTCAAACGGAAGTAACAAACCGGACCTTGGGCAACTTGTTAAGTACGGAAG
CTGAGGAGATGGCTACCAGAATCACGGAGTTGCATAACCAAGTCCAAGACCACATTCAAAAGCAAACTGCCAAGTACAAAGAAGCTGCAGATCGGAAGAAGTACAAAGAA
GCTGCAGATCGGAAGAAGAGATTCAAGGAATTCGAAGTTGGAGATCTAGTGATGATACATTTGAGGAAAAGCAGGCTCCCAGCAGGAACCTACAACAAGCTCAAGAAAAG
AAAGCTTGGCCCATTCCCAATATTGGAAAGATTGGAGAAAATGCTTATAAAATCCAACTACCATTTAATAGTTTTTGTATCTTGTTCTCGACTTGCTTCTGAAAAAGCCC
AGACTATACGCCAGTCACTATTGAACTATATATCAGAAGCTTTAGATCACGCGCCTTCTCTTATTGTTTTGGATGACTTAGACAGTATTGTCTTGTCAACATCTGACTCG
GAAGGATTTCAGCCATCCACATCAATGTCTGCAATGACAGAGTTTCTCACTGACGTGATAGATGAATATGAGGAAAAGAGGAAAAGCTCATGTCAGGTTGGTCCTATTGC
ATTTGTGGCTTCTGTTCAGACTCTAGATAAGATACCACAATCATTGAGATCTTCAGGAAGATTCGACTTTCATGTGGAATTGCCTGCTCCTGCAGCATCAGAGCGAACTG
CTATTCTGAAGCATGAAATACAGAGACGTTCCTTAGATTGTTCGGATGTGACCTTACAAGACATTGCCTCAAAATGTGATGGATATGATGCTTATGACTTAGAAATTTTG
GTCGATAGAGCTGTCCATGGTGCTGTTAGCCGTTTTCTGCCACTGAGCCTTAACCAGAAGCAAAGTCCCACTTTAATTGAAAGTGATTTTTCTCTAGCAATGAAAGAATT
TGTTCCGGCTTCAATGCGTGATATAACAAAACCATCTGCAGAAGGTGGTCGCTCTGGCTGGGATGATGTTGGTGGTTTAGTGGAAGTTAAGAATTCTATCAAAGAGATGC
TTGTATTTCCTTCAAAGTTTCCCAATATCTTTGCTCAAGCCCCTTTAAGGTTACGGTCAAATGTTCTCCTCTACGGACCTCCTGGTTGTGGCAAGACGCACATAGTTGGT
GCTGCTGCTGCTGCTTGTTCATTGCGATTTATATCAGTCAAAGGGCCTGAACTTTTAAACAAATACATTGGTGCTTCTGAGCAAGCTGTTCGGGATATTTTCTCTAAAGC
AACCGCAGCTGCACCTTGCATACTCTTCTTTGACGAGTTTGATTCAATTGCCCCAAAGAGAGGGCACGATAATACTGGAGTGACTGATCGTGTAGTCAATCAATTTCTTA
CTGAATTGGATGGTGTTGAAGTTCTAACCGGTGTATTTGTTTTTGCTGCAACAAGTAGGCCAGATTTGCTTGATGCTGCCCTGCTAAGACCTGGCAGACTGGATCGCCTT
CTGTTTTGTGATTTCCCCTCTCCAGTGGAACGGCTAAATATTCTTCAAGTTCTTTCCACAAAGGTTGGTGGCTCTAAACTCTAA
Protein sequenceShow/hide protein sequence
MKERFLPMNFEQILYDQYQNCKQGTRSVMEYTEEFHRLGARTNLGESQQYLVARYKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQITNRFKRTYSRRATGEQTNPFS
KTTSGERLPQQASSAISKGKQIDNGSNKDVKNQENPKRSTNTYSRPTMGKCFRCNQTGHLSNECPQRRTLTIADDVEYVEERDGTESEENTDFLEPYEGEPVSLVVQRLL
LAPKKDPSYQRHALFKTRCTINGKICNVIIDSGSTENIVASKLVTSLNLPLHPHSAPYKVSWIKKGGEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRLWQYD
VQAVHKGRDNTYEFHWMGKKIQHIIGFVVKHFDTSISDHVIPNEVQKLLQQHTTIVDNPTKLPPLRDIQHSIDLIPGATLPNLPHYKMSPTEYQILHEQIQELLDKGHIQ
PSLSPCAVPALLTPKNDGTWRMCVDSRAINKITVKYRFPIPRIGDLLDQLGGATIFSKIDLKSGYHQIRIRPGDEWKTAFKTNEGLFEWLVMPFGLSNAPSTFMRLMNQG
KFLWGLEQQKSFGVIKEKLASSPVLALPNFNTPFEVAVDASGIGVGAVLSQNSHPIDFFSEKLSPSRQNWSTFDFVIKHTSGITNKVADTLSRKESILTILKGEIIAMDH
LPETYADDKDFGTTWKKCIDHTPASGFHILDGFLFKGDVLCIPHSSLREALIREAHAGGLAGHLGRDKTLQILETRYFWSQLKRDVTNFVARYRYSKMAHFLPCKKTLDA
VYIANLFFKEIVRLHGIPKSIVSDRDVKFLGHFWKTLWKKFDTKLKYSTTSHPQTDGQTEVTNRTLGNLLSTEAEEMATRITELHNQVQDHIQKQTAKYKEAADRKKYKE
AADRKKRFKEFEVGDLVMIHLRKSRLPAGTYNKLKKRKLGPFPILERLEKMLIKSNYHLIVFVSCSRLASEKAQTIRQSLLNYISEALDHAPSLIVLDDLDSIVLSTSDS
EGFQPSTSMSAMTEFLTDVIDEYEEKRKSSCQVGPIAFVASVQTLDKIPQSLRSSGRFDFHVELPAPAASERTAILKHEIQRRSLDCSDVTLQDIASKCDGYDAYDLEIL
VDRAVHGAVSRFLPLSLNQKQSPTLIESDFSLAMKEFVPASMRDITKPSAEGGRSGWDDVGGLVEVKNSIKEMLVFPSKFPNIFAQAPLRLRSNVLLYGPPGCGKTHIVG
AAAAACSLRFISVKGPELLNKYIGASEQAVRDIFSKATAAAPCILFFDEFDSIAPKRGHDNTGVTDRVVNQFLTELDGVEVLTGVFVFAATSRPDLLDAALLRPGRLDRL
LFCDFPSPVERLNILQVLSTKVGGSKL