; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G07425 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G07425
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionBeta-galactosidase
Genome locationClcChr07:19806296..19811299
RNA-Seq ExpressionClc07G07425
SyntenyClc07G07425
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052172.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.6e-29144.53Show/hide
Query:  PSSAKVQQQLANLQASFQQQIAALGAALGASTIFDQGPINSGIQSS------LPIYLKNPITVYLTLTTAPHMSGPMGTSMGLIVGEKLNGQNYFSWSQS
        PS+  +  Q +  Q         L +  G     ++  I +G  S+      LP+Y KNP+T +   + + +++G +G+S G   GEKLNGQNYFSWSQS
Subjt:  PSSAKVQQQLANLQASFQQQIAALGAALGASTIFDQGPINSGIQSS------LPIYLKNPITVYLTLTTAPHMSGPMGTSMGLIVGEKLNGQNYFSWSQS

Query:  IKMELEGRHKFGYLTGEISKPKPGDPQEPI-----------------------------AHDIWDAIQKLYSKRQNASRLYTLRKQIHECKQGSMD----
        IKM LEGR++FG+LTGE  +P PGD  E +                             A D+WD  Q LYSKRQNASRLYTLRKQ+H CKQG++D    
Subjt:  IKMELEGRHKFGYLTGEISKPKPGDPQEPI-----------------------------AHDIWDAIQKLYSKRQNASRLYTLRKQIHECKQGSMD----

Query:  -------------------DCPCGGVQYYKLEEVDRVYD-----------------------------FSVRLEEDRSSAMNITTTSITDSASFSAKSSG
                           D P    QY KLEE DRVYD                             F VRLEEDR++AM + TT   DSA+FSA+SS 
Subjt:  -------------------DCPCGGVQYYKLEEVDRVYD-----------------------------FSVRLEEDRSSAMNITTTSITDSASFSAKSSG

Query:  TTKQR---------------------------RPPNNK---------PSQALVSETTSGSQPQYQENCPVDTGTSSLGVIAQ------------------
            +                           RPP  K           +A +SET   S  Q  +     T T +LG IAQ                  
Subjt:  TTKQR---------------------------RPPNNK---------PSQALVSETTSGSQPQYQENCPVDTGTSSLGVIAQ------------------

Query:  ---SGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNYHMAFSPDDVIFQDLSSGRMIGT
           SGATDHLTG+S++F+ Y  CAGNEKI I +G+LA +AGKG I PFDG  LQNVLHVPK+SYNLLS+SKITR+L+    F P+ V FQD+SSGR IGT
Subjt:  ---SGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNYHMAFSPDDVIFQDLSSGRMIGT

Query:  ARHNTGLYFLDGDSSSRHSYRASL-------------------------------------------------------ITTSSGKRWFVTFIDDHTRLT
        ARH+ GLY LD D+S     R SL                                                       +TTSSGKRWFVTFIDDHTRLT
Subjt:  ARHNTGLYFLDGDSSSRHSYRASL-------------------------------------------------------ITTSSGKRWFVTFIDDHTRLT

Query:  WVFLLTDKSEVSSIFQQLYTTIITQFNAKISILRSDNGRKFLPNTLREFLSTKG-VHQSSCAYTPQQNGVAERKNRHLLEVARPSNKSDA----------
        WV+L++DKSEV SIFQ  Y TI TQF+ KI+ILRSDNGR+F  + L EFL++KG VHQ+SCAYTPQQNGVAERKNRHL+EVAR    S +          
Subjt:  WVFLLTDKSEVSSIFQQLYTTIITQFNAKISILRSDNGRKFLPNTLREFLSTKG-VHQSSCAYTPQQNGVAERKNRHLLEVARPSNKSDA----------

Query:  -----------SRVLNFQTPLDCLKLSYPTTRLLSDVPLRVFGCSAF--------------------------------------KYFISMDITFLKDKP
                   SR+L+ QTPLDCLK SYP+TRL+S+VPLRVFGC+A+                                      KYF++MD+TF +++P
Subjt:  -----------SRVLNFQTPLDCLKLSYPTTRLLSDVPLRVFGCSAF--------------------------------------KYFISMDITFLKDKP

Query:  FFPVSPLQEESSNEETNYSNSLPEPILTT------HNTVLPTRQAQGTT------------------DPDNNIVFAEDVCVENDIVDLTKLPI-ENDRDD
        +FPVS LQ E+ +EE+N +    EP L T      H  +LPT Q    T                   P  N     D  +EN     T   + END+ D
Subjt:  FFPVSPLQEESSNEETNYSNSLPEPILTT------HNTVLPTRQAQGTT------------------DPDNNIVFAEDVCVENDIVDLTKLPI-ENDRDD

Query:  ------MTENNQENENEIMPQNFTTKEE---LDKPGEYDATLDMPIALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEWKA
              M E N ++E E+  +    + E     K  EYD +LD+PIALRKGTRS TK+P+ +++SY+NLS +FRAFT +L++  IP+NIY A+E PEWK 
Subjt:  ------MTENNQENENEIMPQNFTTKEE---LDKPGEYDATLDMPIALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEWKA

Query:  T----------------------HKPVGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFL
                               HK VGCKWVF++KYK+DGTLD++KARLVAKGFTQT+GIDYSETFSPVAKLNT+RVLL VAVNKDWPLYQLD KNAFL
Subjt:  T----------------------HKPVGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFL

Query:  NGDLKEEVYMSHPPGFEAPFDHCVCRLRKALYGL--------------------NQGHSDPTLFTKRSVSSKEMILLK---------------STSKNRK
        NGDL EEVYMS PPGFEA F   VC+L+K+LYGL                    +QGHSD TLFTK S + K  IL+                S  K R 
Subjt:  NGDLKEEVYMSHPPGFEAPFDHCVCRLRKALYGL--------------------NQGHSDPTLFTKRSVSSKEMILLK---------------STSKNRK

Query:  WRTNLIKNLGNLKYFLGMEMA--------------------------RPADTPIECNANLGESVDKVPVNKE---------------RPDISYAVSVVCQ
             IK+LGNLKYFLGME+A                          RPADTPIE N  LG S D+VPV+KE               RPDIS+AVSVV Q
Subjt:  WRTNLIKNLGNLKYFLGMEMA--------------------------RPADTPIECNANLGESVDKVPVNKE---------------RPDISYAVSVVCQ

Query:  FMQAPYEEHVEAINMILRCLKSTS------RKENALNPIPTLT----GRSVIDRKSTS-------------RSKKQSVVARSSAKAKYKAMSHGICEEIW
        FMQAPYE+H+EA+N ILR LK+T       RK N    I   T      SVIDRKSTS             RSKKQSVVARSSA+A+Y+AMS GICEEIW
Subjt:  FMQAPYEEHVEAINMILRCLKSTS------RKENALNPIPTLT----GRSVIDRKSTS-------------RSKKQSVVARSSAKAKYKAMSHGICEEIW

Query:  LKKVLSDIHQTGDLPMKLYCDNKAAISIANNP-----------------EKLENGSICIPYIPSSQQVTDILTKGLLKQNFD
        L+KVLSD+HQ  + P+KL+CDNKAAISIANNP                 E+L++GSICIPYIPSSQQ+ D+LTKGLL+ +FD
Subjt:  LKKVLSDIHQTGDLPMKLYCDNKAAISIANNP-----------------EKLENGSICIPYIPSSQQVTDILTKGLLKQNFD

KAA0057250.1 Beta-galactosidase [Cucumis melo var. makuwa]7.8e-29143.94Show/hide
Query:  QHGSVVPSSAKVQQQLANLQASFQQQIAALGAALGASTIFDQGPINSGIQ----------SSLPIYLKNPITVYLTLTTAPHMSGPMGTSMGLIVGEKLN
        QH S+V         L+N  +  Q  +  L   L +    DQ    S I+          + LP+Y KNP+T +   + + +++G +G+S G   GEKLN
Subjt:  QHGSVVPSSAKVQQQLANLQASFQQQIAALGAALGASTIFDQGPINSGIQ----------SSLPIYLKNPITVYLTLTTAPHMSGPMGTSMGLIVGEKLN

Query:  GQNYFSWSQSIKMELEGRHKFGYLTGEISKPKPGDPQEPI-----------------------------AHDIWDAIQKLYSKRQNASRLYTLRKQIHEC
        GQNYFSWSQSIKM LEGR++FG+LTGEI +P PGD  E +                             A D+WD  Q LYSKRQNASRLYTLRKQ+H C
Subjt:  GQNYFSWSQSIKMELEGRHKFGYLTGEISKPKPGDPQEPI-----------------------------AHDIWDAIQKLYSKRQNASRLYTLRKQIHEC

Query:  KQGSMD-----------------------DCPCGGVQYYKLEEVDRVYD-----------------------------FSVRLEEDRSSAMNITTTSITD
        KQG++D                       D P    QY KLEE DRVYD                             F VRLEEDR++AM + TT   D
Subjt:  KQGSMD-----------------------DCPCGGVQYYKLEEVDRVYD-----------------------------FSVRLEEDRSSAMNITTTSITD

Query:  SASFSAKSSGTTKQR---------------------------RPPNNK---------PSQALVSETTSGSQPQYQENCPVDTGTSSLGVIAQ--------
        SA+FSA+SS     +                           RPP  K           +A +SETT  S  Q  +     T T +LG IAQ        
Subjt:  SASFSAKSSGTTKQR---------------------------RPPNNK---------PSQALVSETTSGSQPQYQENCPVDTGTSSLGVIAQ--------

Query:  -------------SGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNYHMAFSPDDVIFQ
                     SGATDHLTG+S++F+ Y PCAGNEKI I +G+LA +AGKG I PFDG  LQNVLHVPK+SYNLLS+SKITR+L+    F P+ V FQ
Subjt:  -------------SGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNYHMAFSPDDVIFQ

Query:  DLSSGRMIGTARHNTGLYFLDGDSSSRHSYRASL------------------------------------------------------------------
        D+SSGR IGTARH+ GLY LD D+S     R SL                                                                  
Subjt:  DLSSGRMIGTARHNTGLYFLDGDSSSRHSYRASL------------------------------------------------------------------

Query:  ---------------ITTSSGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIITQFNAKISILRSDNGRKFLPNTLREFLSTKG-VHQSSCAYT
                       +TTSSGKRWFVTFIDDHTRLTWV+L++DKSEV SIFQ  Y TI TQF+ KI+ILRSDNGR+F  + L EFL++KG VHQ+SCAYT
Subjt:  ---------------ITTSSGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIITQFNAKISILRSDNGRKFLPNTLREFLSTKG-VHQSSCAYT

Query:  PQQNGVAERKNRHLLEVARPSNKSDA---------------------SRVLNFQTPLDCLKLSYPTTRLLSDVPLRVFGCSAF-----------------
        PQQNGVAERKNRHL+EVAR    S +                     SR+L+ QTPLDCLK SYP+TRL+S+VPLRVFGC+A+                 
Subjt:  PQQNGVAERKNRHLLEVARPSNKSDA---------------------SRVLNFQTPLDCLKLSYPTTRLLSDVPLRVFGCSAF-----------------

Query:  ---------------------KYFISMDITFLKDKPFFPVSPLQEESSNEETNYSNSLPEPILTT------HNTVLPTRQAQGTT---------------
                             KYF++MD+TF +++P+FPVS LQ E+ +EE+N +    EP L T      H  +LPT Q    T               
Subjt:  ---------------------KYFISMDITFLKDKPFFPVSPLQEESSNEETNYSNSLPEPILTT------HNTVLPTRQAQGTT---------------

Query:  ---DPDNNIVFAEDVCVENDIVDLTKLPI-ENDRDD------MTENNQENENEIMPQNFTTKEE---LDKPGEYDATLDMPIALRKGTRSYTKYPMYSFL
            P  N     D  +EN     T   + END+ D      M E N ++E E+  +    + E     K  EYD +LD+PIALRKGTRS TK+P+ +++
Subjt:  ---DPDNNIVFAEDVCVENDIVDLTKLPI-ENDRDD------MTENNQENENEIMPQNFTTKEE---LDKPGEYDATLDMPIALRKGTRSYTKYPMYSFL

Query:  SYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEWKAT----------------------HKPVGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYS
        SY+NLS +FRAFT +L++  IP+NIY A+E PEWK                        HK VGCKWVF++KYK+DGTLD++KARLVAKGFTQT+GIDYS
Subjt:  SYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEWKAT----------------------HKPVGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYS

Query:  ETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGFEAPFDHCVCRLRKALYGL--------------------NQGHSDPTLF
        ETFSPVAKLNT+RVLL VAVNKDWPLYQLD KNAFLNGDL EEVYMS PPGFEA F   VC+L+K+LYGL                    +QGHSD TLF
Subjt:  ETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGFEAPFDHCVCRLRKALYGL--------------------NQGHSDPTLF

Query:  TKRSVSSKEMILLK---------------STSKNRKWRTNLIKNLGNLKYFLGMEMA--------------------------RPADTPIECNANLGESV
        TK S + K  IL+                S  K R      IK+LGNLKYFLGME+A                          RPADTPIE N  LG S 
Subjt:  TKRSVSSKEMILLK---------------STSKNRKWRTNLIKNLGNLKYFLGMEMA--------------------------RPADTPIECNANLGESV

Query:  DKVPVNKE---------------RPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTS------RKENALNPIPTLT----GRSVIDRKSTS------
        D+VPV+KE               RPDIS+AVSVV QFMQAPYE+H+EA+N ILR LK+T       RK N    I   T      SVIDRKSTS      
Subjt:  DKVPVNKE---------------RPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTS------RKENALNPIPTLT----GRSVIDRKSTS------

Query:  -------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKAAISIANNP-----------------EKLENGSICIPYIPS
               RSKKQSVVARSSA+A+Y+AMS GICEEIWL+KVLSD+HQ  + P+KL+CDNKAAISIANNP                 E+L++GSICIPYIPS
Subjt:  -------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKAAISIANNP-----------------EKLENGSICIPYIPS

Query:  SQQVTDILTKGLLKQNFD
        SQQ+ D+LTKGLL+ +FD
Subjt:  SQQVTDILTKGLLKQNFD

KAA0061823.1 Beta-galactosidase [Cucumis melo var. makuwa]2.7e-29144.86Show/hide
Query:  SSLPIYLKNPITVYLTLTTAPHMSGPMGTSMGLIVGEKLNGQNYFSWSQSIKMELEGRHKFGYLTGEISKPKPGDPQEPI--------------------
        + LP+Y KNP+T +   + + +++G +G+S G   GEKLNGQNYFSWSQSIKM LEGR++FG+LTGEI +P PGD  E +                    
Subjt:  SSLPIYLKNPITVYLTLTTAPHMSGPMGTSMGLIVGEKLNGQNYFSWSQSIKMELEGRHKFGYLTGEISKPKPGDPQEPI--------------------

Query:  ---------AHDIWDAIQKLYSKRQNASRLYTLRKQIHECKQGSMD-----------------------DCPCGGVQYYKLEEVDRVYD-----------
                 A D+WD  Q LYSKRQNASRLYTLRKQ+H CKQG++D                       D P    QY KLEE DRVYD           
Subjt:  ---------AHDIWDAIQKLYSKRQNASRLYTLRKQIHECKQGSMD-----------------------DCPCGGVQYYKLEEVDRVYD-----------

Query:  ------------------FSVRLEEDRSSAMNITTTSITDSASFSAKSSGTTKQR---------------------------RPPNNK---------PSQ
                          F VRLEEDR++AM + TT   DSA+FSA+SS     +                           RPP  K           +
Subjt:  ------------------FSVRLEEDRSSAMNITTTSITDSASFSAKSSGTTKQR---------------------------RPPNNK---------PSQ

Query:  ALVSETTSGSQPQYQENCPVDTGTSSLGVIAQ---------------------SGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHISPFDG
        A +SETT  S  Q  +     T T +LG IAQ                     SGATDHLTG+S++F+ Y PCAGNEKI I +G+LA +AGKG I PFDG
Subjt:  ALVSETTSGSQPQYQENCPVDTGTSSLGVIAQ---------------------SGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHISPFDG

Query:  LILQNVLHVPKISYNLLSVSKITRDLNYHMAFSPDDVIFQDLSSGRMIGTARHNTGLYFLDGDSSSRHSYRASL--------------------------
          LQNVLHVPK+SYNLLS+SKITR+L+    F P+ V FQD+SSGR IGTARH+ GLY LD D+S     R SL                          
Subjt:  LILQNVLHVPKISYNLLSVSKITRDLNYHMAFSPDDVIFQDLSSGRMIGTARHNTGLYFLDGDSSSRHSYRASL--------------------------

Query:  -------------------------------------------------------ITTSSGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIIT
                                                               +TTSSGKRWFVTFIDDHTRLTWV+L++DKSEV SIFQ  Y TI T
Subjt:  -------------------------------------------------------ITTSSGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIIT

Query:  QFNAKISILRSDNGRKFLPNTLREFLSTKG-VHQSSCAYTPQQNGVAERKNRHLLEVARPSNKSDA---------------------SRVLNFQTPLDCL
        QF+ KI+ILRSDNGR+F  + L EFL++KG VHQ+SCAYTPQQNGVAERKNRHL+EVAR    S +                     SR+L+ QTPLDCL
Subjt:  QFNAKISILRSDNGRKFLPNTLREFLSTKG-VHQSSCAYTPQQNGVAERKNRHLLEVARPSNKSDA---------------------SRVLNFQTPLDCL

Query:  KLSYPTTRLLSDVPLRVFGCSAF--------------------------------------KYFISMDITFLKDKPFFPVSPLQEESSNEETNYSNSLPE
        K SYP+TRL+S+VPLRVFGC+A+                                      KYF++MD+TF +++P+FPVS LQ E+ +EE+N +    E
Subjt:  KLSYPTTRLLSDVPLRVFGCSAF--------------------------------------KYFISMDITFLKDKPFFPVSPLQEESSNEETNYSNSLPE

Query:  PILTT------HNTVLPTRQAQGTT------------------DPDNNIVFAEDVCVENDIVDLTKLPI-ENDRDD------MTENNQENENEIMPQNFT
        P L T      H  +LPT Q    T                   P  N     D  +EN     T   + END+ D      M E N ++E E+  +   
Subjt:  PILTT------HNTVLPTRQAQGTT------------------DPDNNIVFAEDVCVENDIVDLTKLPI-ENDRDD------MTENNQENENEIMPQNFT

Query:  TKEE---LDKPGEYDATLDMPIALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEWKAT----------------------H
         + E     K  EYD +LD+PIALRKGTRS TK+P+ +++SY+NLS +FRAFT +L++  IP+NIY A+E PEWK                        H
Subjt:  TKEE---LDKPGEYDATLDMPIALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEWKAT----------------------H

Query:  KPVGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGFEAPFDHCV
        K VGCKWVF++KYK+DGTLD++KARLVAKGFTQT+GIDYSETFSPVAKLNT+RVLL VAVNKDWPLYQLD KNAFLNGDL EEVYMS PPGFEA F   V
Subjt:  KPVGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGFEAPFDHCV

Query:  CRLRKALYGL--------------------NQGHSDPTLFTKRSVSSKEMILLK---------------STSKNRKWRTNLIKNLGNLKYFLGMEMA---
        C+L+K+LYGL                    +QGHSD TLFTK S + K  IL+                S  K R      IK+LGNLKYFLGME+A   
Subjt:  CRLRKALYGL--------------------NQGHSDPTLFTKRSVSSKEMILLK---------------STSKNRKWRTNLIKNLGNLKYFLGMEMA---

Query:  -----------------------RPADTPIECNANLGESVDKVPVNKE---------------RPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTS
                               RPADTPIE N  LG S D+VPV+KE               RPDIS+AVSVV QFMQAPYE+H+EA+N ILR LK+T 
Subjt:  -----------------------RPADTPIECNANLGESVDKVPVNKE---------------RPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTS

Query:  ------RKENALNPIPTLT----GRSVIDRKSTS-------------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKA
              RK N    I   T      SVIDRKSTS             RSKKQSVVARSSA+A+Y+AMS GICEEIWL+KVLSD+HQ  + P+KL+CDNKA
Subjt:  ------RKENALNPIPTLT----GRSVIDRKSTS-------------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKA

Query:  AISIANNP-----------------EKLENGSICIPYIPSSQQVTDILTKGLLKQNFD
        AISIANNP                 E+L++GSICIPYIPSSQQ+ D+LTKGLL+ +FD
Subjt:  AISIANNP-----------------EKLENGSICIPYIPSSQQVTDILTKGLLKQNFD

TYK08054.1 Beta-galactosidase [Cucumis melo var. makuwa]7.8e-29143.94Show/hide
Query:  QHGSVVPSSAKVQQQLANLQASFQQQIAALGAALGASTIFDQGPINSGIQ----------SSLPIYLKNPITVYLTLTTAPHMSGPMGTSMGLIVGEKLN
        QH S+V         L+N  +  Q  +  L   L +    DQ    S I+          + LP+Y KNP+T +   + + +++G +G+S G   GEKLN
Subjt:  QHGSVVPSSAKVQQQLANLQASFQQQIAALGAALGASTIFDQGPINSGIQ----------SSLPIYLKNPITVYLTLTTAPHMSGPMGTSMGLIVGEKLN

Query:  GQNYFSWSQSIKMELEGRHKFGYLTGEISKPKPGDPQEPI-----------------------------AHDIWDAIQKLYSKRQNASRLYTLRKQIHEC
        GQNYFSWSQSIKM LEGR++FG+LTGEI +P PGD  E +                             A D+WD  Q LYSKRQNASRLYTLRKQ+H C
Subjt:  GQNYFSWSQSIKMELEGRHKFGYLTGEISKPKPGDPQEPI-----------------------------AHDIWDAIQKLYSKRQNASRLYTLRKQIHEC

Query:  KQGSMD-----------------------DCPCGGVQYYKLEEVDRVYD-----------------------------FSVRLEEDRSSAMNITTTSITD
        KQG++D                       D P    QY KLEE DRVYD                             F VRLEEDR++AM + TT   D
Subjt:  KQGSMD-----------------------DCPCGGVQYYKLEEVDRVYD-----------------------------FSVRLEEDRSSAMNITTTSITD

Query:  SASFSAKSSGTTKQR---------------------------RPPNNK---------PSQALVSETTSGSQPQYQENCPVDTGTSSLGVIAQ--------
        SA+FSA+SS     +                           RPP  K           +A +SETT  S  Q  +     T T +LG IAQ        
Subjt:  SASFSAKSSGTTKQR---------------------------RPPNNK---------PSQALVSETTSGSQPQYQENCPVDTGTSSLGVIAQ--------

Query:  -------------SGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNYHMAFSPDDVIFQ
                     SGATDHLTG+S++F+ Y PCAGNEKI I +G+LA +AGKG I PFDG  LQNVLHVPK+SYNLLS+SKITR+L+    F P+ V FQ
Subjt:  -------------SGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNYHMAFSPDDVIFQ

Query:  DLSSGRMIGTARHNTGLYFLDGDSSSRHSYRASL------------------------------------------------------------------
        D+SSGR IGTARH+ GLY LD D+S     R SL                                                                  
Subjt:  DLSSGRMIGTARHNTGLYFLDGDSSSRHSYRASL------------------------------------------------------------------

Query:  ---------------ITTSSGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIITQFNAKISILRSDNGRKFLPNTLREFLSTKG-VHQSSCAYT
                       +TTSSGKRWFVTFIDDHTRLTWV+L++DKSEV SIFQ  Y TI TQF+ KI+ILRSDNGR+F  + L EFL++KG VHQ+SCAYT
Subjt:  ---------------ITTSSGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIITQFNAKISILRSDNGRKFLPNTLREFLSTKG-VHQSSCAYT

Query:  PQQNGVAERKNRHLLEVARPSNKSDA---------------------SRVLNFQTPLDCLKLSYPTTRLLSDVPLRVFGCSAF-----------------
        PQQNGVAERKNRHL+EVAR    S +                     SR+L+ QTPLDCLK SYP+TRL+S+VPLRVFGC+A+                 
Subjt:  PQQNGVAERKNRHLLEVARPSNKSDA---------------------SRVLNFQTPLDCLKLSYPTTRLLSDVPLRVFGCSAF-----------------

Query:  ---------------------KYFISMDITFLKDKPFFPVSPLQEESSNEETNYSNSLPEPILTT------HNTVLPTRQAQGTT---------------
                             KYF++MD+TF +++P+FPVS LQ E+ +EE+N +    EP L T      H  +LPT Q    T               
Subjt:  ---------------------KYFISMDITFLKDKPFFPVSPLQEESSNEETNYSNSLPEPILTT------HNTVLPTRQAQGTT---------------

Query:  ---DPDNNIVFAEDVCVENDIVDLTKLPI-ENDRDD------MTENNQENENEIMPQNFTTKEE---LDKPGEYDATLDMPIALRKGTRSYTKYPMYSFL
            P  N     D  +EN     T   + END+ D      M E N ++E E+  +    + E     K  EYD +LD+PIALRKGTRS TK+P+ +++
Subjt:  ---DPDNNIVFAEDVCVENDIVDLTKLPI-ENDRDD------MTENNQENENEIMPQNFTTKEE---LDKPGEYDATLDMPIALRKGTRSYTKYPMYSFL

Query:  SYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEWKAT----------------------HKPVGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYS
        SY+NLS +FRAFT +L++  IP+NIY A+E PEWK                        HK VGCKWVF++KYK+DGTLD++KARLVAKGFTQT+GIDYS
Subjt:  SYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEWKAT----------------------HKPVGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYS

Query:  ETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGFEAPFDHCVCRLRKALYGL--------------------NQGHSDPTLF
        ETFSPVAKLNT+RVLL VAVNKDWPLYQLD KNAFLNGDL EEVYMS PPGFEA F   VC+L+K+LYGL                    +QGHSD TLF
Subjt:  ETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGFEAPFDHCVCRLRKALYGL--------------------NQGHSDPTLF

Query:  TKRSVSSKEMILLK---------------STSKNRKWRTNLIKNLGNLKYFLGMEMA--------------------------RPADTPIECNANLGESV
        TK S + K  IL+                S  K R      IK+LGNLKYFLGME+A                          RPADTPIE N  LG S 
Subjt:  TKRSVSSKEMILLK---------------STSKNRKWRTNLIKNLGNLKYFLGMEMA--------------------------RPADTPIECNANLGESV

Query:  DKVPVNKE---------------RPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTS------RKENALNPIPTLT----GRSVIDRKSTS------
        D+VPV+KE               RPDIS+AVSVV QFMQAPYE+H+EA+N ILR LK+T       RK N    I   T      SVIDRKSTS      
Subjt:  DKVPVNKE---------------RPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTS------RKENALNPIPTLT----GRSVIDRKSTS------

Query:  -------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKAAISIANNP-----------------EKLENGSICIPYIPS
               RSKKQSVVARSSA+A+Y+AMS GICEEIWL+KVLSD+HQ  + P+KL+CDNKAAISIANNP                 E+L++GSICIPYIPS
Subjt:  -------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKAAISIANNP-----------------EKLENGSICIPYIPS

Query:  SQQVTDILTKGLLKQNFD
        SQQ+ D+LTKGLL+ +FD
Subjt:  SQQVTDILTKGLLKQNFD

TYK10823.1 Beta-galactosidase [Cucumis melo var. makuwa]6.0e-29143.35Show/hide
Query:  HYFPPTAVVVVGIPH------QGVENYYEQHGSVVPSSAKVQQQLANLQASFQQQ---IAALGAALGASTIFDQGPINSGIQ----------SSLPIYLK
        H  PP  V    +P       Q    +   H   + +S+  Q    NL   + +Q   + +L   L +    DQ    S I+          + LP+Y K
Subjt:  HYFPPTAVVVVGIPH------QGVENYYEQHGSVVPSSAKVQQQLANLQASFQQQ---IAALGAALGASTIFDQGPINSGIQ----------SSLPIYLK

Query:  NPITVYLTLTTAPHMSGPMGTSMGLIVGEKLNGQNYFSWSQSIKMELEGRHKFGYLTGEISKPKPGDPQEPI----------------------------
        NP+T +   + + +++G +G+S G   GEKLNGQNYFSWSQSIKM LEGR++FG+LTGEI +P PGD  E +                            
Subjt:  NPITVYLTLTTAPHMSGPMGTSMGLIVGEKLNGQNYFSWSQSIKMELEGRHKFGYLTGEISKPKPGDPQEPI----------------------------

Query:  -AHDIWDAIQKLYSKRQNASRLYTLRKQIHECKQGSMD-----------------------DCPCGGVQYYKLEEVDRVYD-------------------
         A D+WD  Q LYSKRQNASRLYTLRKQ+H CKQG++D                       D P    QY KLEE DRVYD                   
Subjt:  -AHDIWDAIQKLYSKRQNASRLYTLRKQIHECKQGSMD-----------------------DCPCGGVQYYKLEEVDRVYD-------------------

Query:  ----------FSVRLEEDRSSAMNITTTSITDSASFSAKSSGTTKQR---------------------------RPPNNK---------PSQALVSETTS
                  F VRLEEDR++AM + TT   DSA+FSA+SS     +                           RPP  K           +A +SETT 
Subjt:  ----------FSVRLEEDRSSAMNITTTSITDSASFSAKSSGTTKQR---------------------------RPPNNK---------PSQALVSETTS

Query:  GSQPQYQENCPVDTGTSSLGVIAQ---------------------SGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHISPFDGLILQNVLH
         S  Q  +     T T +LG IAQ                     SGATDHLTG+S++F+ Y PCAGNEKI I +G+LA +AGKG I PFDG  LQNVLH
Subjt:  GSQPQYQENCPVDTGTSSLGVIAQ---------------------SGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHISPFDGLILQNVLH

Query:  VPKISYNLLSVSKITRDLNYHMAFSPDDVIFQDLSSGRMIGTARHNTGLYFLDGDSSSRHSYRASL----------------------------------
        VPK+SYNLLS+SKITR+L+    F P+ V FQD+SSGR IGTARH+ GLY LD D+S     R SL                                  
Subjt:  VPKISYNLLSVSKITRDLNYHMAFSPDDVIFQDLSSGRMIGTARHNTGLYFLDGDSSSRHSYRASL----------------------------------

Query:  -----------------------------------------------ITTSSGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIITQFNAKISI
                                                       +TTSSGKRWFVTFIDDHTRLTWV+L++DKSEV SIFQ  Y TI TQF+ KI+I
Subjt:  -----------------------------------------------ITTSSGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIITQFNAKISI

Query:  LRSDNGRKFLPNTLREFLSTKG-VHQSSCAYTPQQNGVAERKNRHLLEVARPSNKSDA---------------------SRVLNFQTPLDCLKLSYPTTR
        LRSDNGR+F  + L EFL++KG VHQ+SCAYTPQQNGVAERKNRHL+EVAR    S +                     SR+L+ QTPLDCLK SYP+TR
Subjt:  LRSDNGRKFLPNTLREFLSTKG-VHQSSCAYTPQQNGVAERKNRHLLEVARPSNKSDA---------------------SRVLNFQTPLDCLKLSYPTTR

Query:  LLSDVPLRVFGCSAF--------------------------------------KYFISMDITFLKDKPFFPVSPLQEESSNEETNYSNSLPEPILTT---
        L+S+VPLRVFGC+A+                                      KYF++MD+TF +++P+FPVS LQ E+ +EE+N +    EP L T   
Subjt:  LLSDVPLRVFGCSAF--------------------------------------KYFISMDITFLKDKPFFPVSPLQEESSNEETNYSNSLPEPILTT---

Query:  ---HNTVLPTRQAQGTT------------------DPDNNIVFAEDVCVENDIVDLTKLPI-ENDRDD------MTENNQENENEIMPQNFTTKEE---L
           H  +LPT Q    T                   P  N     D  +EN     T   + END+ D      M E N ++E E+  +    + E    
Subjt:  ---HNTVLPTRQAQGTT------------------DPDNNIVFAEDVCVENDIVDLTKLPI-ENDRDD------MTENNQENENEIMPQNFTTKEE---L

Query:  DKPGEYDATLDMPIALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEWKAT----------------------HKPVGCKWV
         K  EYD +LD+PIALRKGTRS TK+P+ +++SY+NLS +FRAFT +L++  IP+NIY A+E PEWK                        HK VGCKWV
Subjt:  DKPGEYDATLDMPIALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEWKAT----------------------HKPVGCKWV

Query:  FTMKYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGFEAPFDHCVCRLRKALY
        F++KYK+DGTLD++KARLVAKGFTQT+GIDYSETFSPVAKLNT+RVLL VAVNKDWPLYQLD KNAFLNGDL EEVYMS PPGFEA F   VC+L+K+LY
Subjt:  FTMKYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGFEAPFDHCVCRLRKALY

Query:  GL--------------------NQGHSDPTLFTKRSVSSKEMILLK---------------STSKNRKWRTNLIKNLGNLKYFLGMEMA-----------
        GL                    +QGHSD TLFTK S + K  IL+                S  K R      IK+LGNLKYFLGME+A           
Subjt:  GL--------------------NQGHSDPTLFTKRSVSSKEMILLK---------------STSKNRKWRTNLIKNLGNLKYFLGMEMA-----------

Query:  ---------------RPADTPIECNANLGESVDKVPVNKE---------------RPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTS------RK
                       RPADTPIE N  LG S D+VPV+KE               RPDIS+AVSVV QFMQAPYE+H+EA+N ILR LK+T       RK
Subjt:  ---------------RPADTPIECNANLGESVDKVPVNKE---------------RPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTS------RK

Query:  ENALNPIPTLT----GRSVIDRKSTS-------------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKAAISIANNP
         N    I   T      SVIDRKSTS             RSKKQSVVARSSA+A+Y+AMS GICEEIWL+KVLSD+HQ  + P+KL+CDNKAAISIANNP
Subjt:  ENALNPIPTLT----GRSVIDRKSTS-------------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKAAISIANNP

Query:  -----------------EKLENGSICIPYIPSSQQVTDILTKGLLKQNFD
                         E+L++GSICIPYIPSSQQ+ D+LTKGLL+ +FD
Subjt:  -----------------EKLENGSICIPYIPSSQQVTDILTKGLLKQNFD

TrEMBL top hitse value%identityAlignment
A0A5A7U8U2 Retrovirus-related Pol polyprotein from transposon TNT 1-947.6e-29244.53Show/hide
Query:  PSSAKVQQQLANLQASFQQQIAALGAALGASTIFDQGPINSGIQSS------LPIYLKNPITVYLTLTTAPHMSGPMGTSMGLIVGEKLNGQNYFSWSQS
        PS+  +  Q +  Q         L +  G     ++  I +G  S+      LP+Y KNP+T +   + + +++G +G+S G   GEKLNGQNYFSWSQS
Subjt:  PSSAKVQQQLANLQASFQQQIAALGAALGASTIFDQGPINSGIQSS------LPIYLKNPITVYLTLTTAPHMSGPMGTSMGLIVGEKLNGQNYFSWSQS

Query:  IKMELEGRHKFGYLTGEISKPKPGDPQEPI-----------------------------AHDIWDAIQKLYSKRQNASRLYTLRKQIHECKQGSMD----
        IKM LEGR++FG+LTGE  +P PGD  E +                             A D+WD  Q LYSKRQNASRLYTLRKQ+H CKQG++D    
Subjt:  IKMELEGRHKFGYLTGEISKPKPGDPQEPI-----------------------------AHDIWDAIQKLYSKRQNASRLYTLRKQIHECKQGSMD----

Query:  -------------------DCPCGGVQYYKLEEVDRVYD-----------------------------FSVRLEEDRSSAMNITTTSITDSASFSAKSSG
                           D P    QY KLEE DRVYD                             F VRLEEDR++AM + TT   DSA+FSA+SS 
Subjt:  -------------------DCPCGGVQYYKLEEVDRVYD-----------------------------FSVRLEEDRSSAMNITTTSITDSASFSAKSSG

Query:  TTKQR---------------------------RPPNNK---------PSQALVSETTSGSQPQYQENCPVDTGTSSLGVIAQ------------------
            +                           RPP  K           +A +SET   S  Q  +     T T +LG IAQ                  
Subjt:  TTKQR---------------------------RPPNNK---------PSQALVSETTSGSQPQYQENCPVDTGTSSLGVIAQ------------------

Query:  ---SGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNYHMAFSPDDVIFQDLSSGRMIGT
           SGATDHLTG+S++F+ Y  CAGNEKI I +G+LA +AGKG I PFDG  LQNVLHVPK+SYNLLS+SKITR+L+    F P+ V FQD+SSGR IGT
Subjt:  ---SGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNYHMAFSPDDVIFQDLSSGRMIGT

Query:  ARHNTGLYFLDGDSSSRHSYRASL-------------------------------------------------------ITTSSGKRWFVTFIDDHTRLT
        ARH+ GLY LD D+S     R SL                                                       +TTSSGKRWFVTFIDDHTRLT
Subjt:  ARHNTGLYFLDGDSSSRHSYRASL-------------------------------------------------------ITTSSGKRWFVTFIDDHTRLT

Query:  WVFLLTDKSEVSSIFQQLYTTIITQFNAKISILRSDNGRKFLPNTLREFLSTKG-VHQSSCAYTPQQNGVAERKNRHLLEVARPSNKSDA----------
        WV+L++DKSEV SIFQ  Y TI TQF+ KI+ILRSDNGR+F  + L EFL++KG VHQ+SCAYTPQQNGVAERKNRHL+EVAR    S +          
Subjt:  WVFLLTDKSEVSSIFQQLYTTIITQFNAKISILRSDNGRKFLPNTLREFLSTKG-VHQSSCAYTPQQNGVAERKNRHLLEVARPSNKSDA----------

Query:  -----------SRVLNFQTPLDCLKLSYPTTRLLSDVPLRVFGCSAF--------------------------------------KYFISMDITFLKDKP
                   SR+L+ QTPLDCLK SYP+TRL+S+VPLRVFGC+A+                                      KYF++MD+TF +++P
Subjt:  -----------SRVLNFQTPLDCLKLSYPTTRLLSDVPLRVFGCSAF--------------------------------------KYFISMDITFLKDKP

Query:  FFPVSPLQEESSNEETNYSNSLPEPILTT------HNTVLPTRQAQGTT------------------DPDNNIVFAEDVCVENDIVDLTKLPI-ENDRDD
        +FPVS LQ E+ +EE+N +    EP L T      H  +LPT Q    T                   P  N     D  +EN     T   + END+ D
Subjt:  FFPVSPLQEESSNEETNYSNSLPEPILTT------HNTVLPTRQAQGTT------------------DPDNNIVFAEDVCVENDIVDLTKLPI-ENDRDD

Query:  ------MTENNQENENEIMPQNFTTKEE---LDKPGEYDATLDMPIALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEWKA
              M E N ++E E+  +    + E     K  EYD +LD+PIALRKGTRS TK+P+ +++SY+NLS +FRAFT +L++  IP+NIY A+E PEWK 
Subjt:  ------MTENNQENENEIMPQNFTTKEE---LDKPGEYDATLDMPIALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEWKA

Query:  T----------------------HKPVGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFL
                               HK VGCKWVF++KYK+DGTLD++KARLVAKGFTQT+GIDYSETFSPVAKLNT+RVLL VAVNKDWPLYQLD KNAFL
Subjt:  T----------------------HKPVGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFL

Query:  NGDLKEEVYMSHPPGFEAPFDHCVCRLRKALYGL--------------------NQGHSDPTLFTKRSVSSKEMILLK---------------STSKNRK
        NGDL EEVYMS PPGFEA F   VC+L+K+LYGL                    +QGHSD TLFTK S + K  IL+                S  K R 
Subjt:  NGDLKEEVYMSHPPGFEAPFDHCVCRLRKALYGL--------------------NQGHSDPTLFTKRSVSSKEMILLK---------------STSKNRK

Query:  WRTNLIKNLGNLKYFLGMEMA--------------------------RPADTPIECNANLGESVDKVPVNKE---------------RPDISYAVSVVCQ
             IK+LGNLKYFLGME+A                          RPADTPIE N  LG S D+VPV+KE               RPDIS+AVSVV Q
Subjt:  WRTNLIKNLGNLKYFLGMEMA--------------------------RPADTPIECNANLGESVDKVPVNKE---------------RPDISYAVSVVCQ

Query:  FMQAPYEEHVEAINMILRCLKSTS------RKENALNPIPTLT----GRSVIDRKSTS-------------RSKKQSVVARSSAKAKYKAMSHGICEEIW
        FMQAPYE+H+EA+N ILR LK+T       RK N    I   T      SVIDRKSTS             RSKKQSVVARSSA+A+Y+AMS GICEEIW
Subjt:  FMQAPYEEHVEAINMILRCLKSTS------RKENALNPIPTLT----GRSVIDRKSTS-------------RSKKQSVVARSSAKAKYKAMSHGICEEIW

Query:  LKKVLSDIHQTGDLPMKLYCDNKAAISIANNP-----------------EKLENGSICIPYIPSSQQVTDILTKGLLKQNFD
        L+KVLSD+HQ  + P+KL+CDNKAAISIANNP                 E+L++GSICIPYIPSSQQ+ D+LTKGLL+ +FD
Subjt:  LKKVLSDIHQTGDLPMKLYCDNKAAISIANNP-----------------EKLENGSICIPYIPSSQQVTDILTKGLLKQNFD

A0A5A7UQ67 Beta-galactosidase3.8e-29143.94Show/hide
Query:  QHGSVVPSSAKVQQQLANLQASFQQQIAALGAALGASTIFDQGPINSGIQ----------SSLPIYLKNPITVYLTLTTAPHMSGPMGTSMGLIVGEKLN
        QH S+V         L+N  +  Q  +  L   L +    DQ    S I+          + LP+Y KNP+T +   + + +++G +G+S G   GEKLN
Subjt:  QHGSVVPSSAKVQQQLANLQASFQQQIAALGAALGASTIFDQGPINSGIQ----------SSLPIYLKNPITVYLTLTTAPHMSGPMGTSMGLIVGEKLN

Query:  GQNYFSWSQSIKMELEGRHKFGYLTGEISKPKPGDPQEPI-----------------------------AHDIWDAIQKLYSKRQNASRLYTLRKQIHEC
        GQNYFSWSQSIKM LEGR++FG+LTGEI +P PGD  E +                             A D+WD  Q LYSKRQNASRLYTLRKQ+H C
Subjt:  GQNYFSWSQSIKMELEGRHKFGYLTGEISKPKPGDPQEPI-----------------------------AHDIWDAIQKLYSKRQNASRLYTLRKQIHEC

Query:  KQGSMD-----------------------DCPCGGVQYYKLEEVDRVYD-----------------------------FSVRLEEDRSSAMNITTTSITD
        KQG++D                       D P    QY KLEE DRVYD                             F VRLEEDR++AM + TT   D
Subjt:  KQGSMD-----------------------DCPCGGVQYYKLEEVDRVYD-----------------------------FSVRLEEDRSSAMNITTTSITD

Query:  SASFSAKSSGTTKQR---------------------------RPPNNK---------PSQALVSETTSGSQPQYQENCPVDTGTSSLGVIAQ--------
        SA+FSA+SS     +                           RPP  K           +A +SETT  S  Q  +     T T +LG IAQ        
Subjt:  SASFSAKSSGTTKQR---------------------------RPPNNK---------PSQALVSETTSGSQPQYQENCPVDTGTSSLGVIAQ--------

Query:  -------------SGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNYHMAFSPDDVIFQ
                     SGATDHLTG+S++F+ Y PCAGNEKI I +G+LA +AGKG I PFDG  LQNVLHVPK+SYNLLS+SKITR+L+    F P+ V FQ
Subjt:  -------------SGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNYHMAFSPDDVIFQ

Query:  DLSSGRMIGTARHNTGLYFLDGDSSSRHSYRASL------------------------------------------------------------------
        D+SSGR IGTARH+ GLY LD D+S     R SL                                                                  
Subjt:  DLSSGRMIGTARHNTGLYFLDGDSSSRHSYRASL------------------------------------------------------------------

Query:  ---------------ITTSSGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIITQFNAKISILRSDNGRKFLPNTLREFLSTKG-VHQSSCAYT
                       +TTSSGKRWFVTFIDDHTRLTWV+L++DKSEV SIFQ  Y TI TQF+ KI+ILRSDNGR+F  + L EFL++KG VHQ+SCAYT
Subjt:  ---------------ITTSSGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIITQFNAKISILRSDNGRKFLPNTLREFLSTKG-VHQSSCAYT

Query:  PQQNGVAERKNRHLLEVARPSNKSDA---------------------SRVLNFQTPLDCLKLSYPTTRLLSDVPLRVFGCSAF-----------------
        PQQNGVAERKNRHL+EVAR    S +                     SR+L+ QTPLDCLK SYP+TRL+S+VPLRVFGC+A+                 
Subjt:  PQQNGVAERKNRHLLEVARPSNKSDA---------------------SRVLNFQTPLDCLKLSYPTTRLLSDVPLRVFGCSAF-----------------

Query:  ---------------------KYFISMDITFLKDKPFFPVSPLQEESSNEETNYSNSLPEPILTT------HNTVLPTRQAQGTT---------------
                             KYF++MD+TF +++P+FPVS LQ E+ +EE+N +    EP L T      H  +LPT Q    T               
Subjt:  ---------------------KYFISMDITFLKDKPFFPVSPLQEESSNEETNYSNSLPEPILTT------HNTVLPTRQAQGTT---------------

Query:  ---DPDNNIVFAEDVCVENDIVDLTKLPI-ENDRDD------MTENNQENENEIMPQNFTTKEE---LDKPGEYDATLDMPIALRKGTRSYTKYPMYSFL
            P  N     D  +EN     T   + END+ D      M E N ++E E+  +    + E     K  EYD +LD+PIALRKGTRS TK+P+ +++
Subjt:  ---DPDNNIVFAEDVCVENDIVDLTKLPI-ENDRDD------MTENNQENENEIMPQNFTTKEE---LDKPGEYDATLDMPIALRKGTRSYTKYPMYSFL

Query:  SYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEWKAT----------------------HKPVGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYS
        SY+NLS +FRAFT +L++  IP+NIY A+E PEWK                        HK VGCKWVF++KYK+DGTLD++KARLVAKGFTQT+GIDYS
Subjt:  SYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEWKAT----------------------HKPVGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYS

Query:  ETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGFEAPFDHCVCRLRKALYGL--------------------NQGHSDPTLF
        ETFSPVAKLNT+RVLL VAVNKDWPLYQLD KNAFLNGDL EEVYMS PPGFEA F   VC+L+K+LYGL                    +QGHSD TLF
Subjt:  ETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGFEAPFDHCVCRLRKALYGL--------------------NQGHSDPTLF

Query:  TKRSVSSKEMILLK---------------STSKNRKWRTNLIKNLGNLKYFLGMEMA--------------------------RPADTPIECNANLGESV
        TK S + K  IL+                S  K R      IK+LGNLKYFLGME+A                          RPADTPIE N  LG S 
Subjt:  TKRSVSSKEMILLK---------------STSKNRKWRTNLIKNLGNLKYFLGMEMA--------------------------RPADTPIECNANLGESV

Query:  DKVPVNKE---------------RPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTS------RKENALNPIPTLT----GRSVIDRKSTS------
        D+VPV+KE               RPDIS+AVSVV QFMQAPYE+H+EA+N ILR LK+T       RK N    I   T      SVIDRKSTS      
Subjt:  DKVPVNKE---------------RPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTS------RKENALNPIPTLT----GRSVIDRKSTS------

Query:  -------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKAAISIANNP-----------------EKLENGSICIPYIPS
               RSKKQSVVARSSA+A+Y+AMS GICEEIWL+KVLSD+HQ  + P+KL+CDNKAAISIANNP                 E+L++GSICIPYIPS
Subjt:  -------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKAAISIANNP-----------------EKLENGSICIPYIPS

Query:  SQQVTDILTKGLLKQNFD
        SQQ+ D+LTKGLL+ +FD
Subjt:  SQQVTDILTKGLLKQNFD

A0A5A7V0Y9 Beta-galactosidase1.3e-29144.86Show/hide
Query:  SSLPIYLKNPITVYLTLTTAPHMSGPMGTSMGLIVGEKLNGQNYFSWSQSIKMELEGRHKFGYLTGEISKPKPGDPQEPI--------------------
        + LP+Y KNP+T +   + + +++G +G+S G   GEKLNGQNYFSWSQSIKM LEGR++FG+LTGEI +P PGD  E +                    
Subjt:  SSLPIYLKNPITVYLTLTTAPHMSGPMGTSMGLIVGEKLNGQNYFSWSQSIKMELEGRHKFGYLTGEISKPKPGDPQEPI--------------------

Query:  ---------AHDIWDAIQKLYSKRQNASRLYTLRKQIHECKQGSMD-----------------------DCPCGGVQYYKLEEVDRVYD-----------
                 A D+WD  Q LYSKRQNASRLYTLRKQ+H CKQG++D                       D P    QY KLEE DRVYD           
Subjt:  ---------AHDIWDAIQKLYSKRQNASRLYTLRKQIHECKQGSMD-----------------------DCPCGGVQYYKLEEVDRVYD-----------

Query:  ------------------FSVRLEEDRSSAMNITTTSITDSASFSAKSSGTTKQR---------------------------RPPNNK---------PSQ
                          F VRLEEDR++AM + TT   DSA+FSA+SS     +                           RPP  K           +
Subjt:  ------------------FSVRLEEDRSSAMNITTTSITDSASFSAKSSGTTKQR---------------------------RPPNNK---------PSQ

Query:  ALVSETTSGSQPQYQENCPVDTGTSSLGVIAQ---------------------SGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHISPFDG
        A +SETT  S  Q  +     T T +LG IAQ                     SGATDHLTG+S++F+ Y PCAGNEKI I +G+LA +AGKG I PFDG
Subjt:  ALVSETTSGSQPQYQENCPVDTGTSSLGVIAQ---------------------SGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHISPFDG

Query:  LILQNVLHVPKISYNLLSVSKITRDLNYHMAFSPDDVIFQDLSSGRMIGTARHNTGLYFLDGDSSSRHSYRASL--------------------------
          LQNVLHVPK+SYNLLS+SKITR+L+    F P+ V FQD+SSGR IGTARH+ GLY LD D+S     R SL                          
Subjt:  LILQNVLHVPKISYNLLSVSKITRDLNYHMAFSPDDVIFQDLSSGRMIGTARHNTGLYFLDGDSSSRHSYRASL--------------------------

Query:  -------------------------------------------------------ITTSSGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIIT
                                                               +TTSSGKRWFVTFIDDHTRLTWV+L++DKSEV SIFQ  Y TI T
Subjt:  -------------------------------------------------------ITTSSGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIIT

Query:  QFNAKISILRSDNGRKFLPNTLREFLSTKG-VHQSSCAYTPQQNGVAERKNRHLLEVARPSNKSDA---------------------SRVLNFQTPLDCL
        QF+ KI+ILRSDNGR+F  + L EFL++KG VHQ+SCAYTPQQNGVAERKNRHL+EVAR    S +                     SR+L+ QTPLDCL
Subjt:  QFNAKISILRSDNGRKFLPNTLREFLSTKG-VHQSSCAYTPQQNGVAERKNRHLLEVARPSNKSDA---------------------SRVLNFQTPLDCL

Query:  KLSYPTTRLLSDVPLRVFGCSAF--------------------------------------KYFISMDITFLKDKPFFPVSPLQEESSNEETNYSNSLPE
        K SYP+TRL+S+VPLRVFGC+A+                                      KYF++MD+TF +++P+FPVS LQ E+ +EE+N +    E
Subjt:  KLSYPTTRLLSDVPLRVFGCSAF--------------------------------------KYFISMDITFLKDKPFFPVSPLQEESSNEETNYSNSLPE

Query:  PILTT------HNTVLPTRQAQGTT------------------DPDNNIVFAEDVCVENDIVDLTKLPI-ENDRDD------MTENNQENENEIMPQNFT
        P L T      H  +LPT Q    T                   P  N     D  +EN     T   + END+ D      M E N ++E E+  +   
Subjt:  PILTT------HNTVLPTRQAQGTT------------------DPDNNIVFAEDVCVENDIVDLTKLPI-ENDRDD------MTENNQENENEIMPQNFT

Query:  TKEE---LDKPGEYDATLDMPIALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEWKAT----------------------H
         + E     K  EYD +LD+PIALRKGTRS TK+P+ +++SY+NLS +FRAFT +L++  IP+NIY A+E PEWK                        H
Subjt:  TKEE---LDKPGEYDATLDMPIALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEWKAT----------------------H

Query:  KPVGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGFEAPFDHCV
        K VGCKWVF++KYK+DGTLD++KARLVAKGFTQT+GIDYSETFSPVAKLNT+RVLL VAVNKDWPLYQLD KNAFLNGDL EEVYMS PPGFEA F   V
Subjt:  KPVGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGFEAPFDHCV

Query:  CRLRKALYGL--------------------NQGHSDPTLFTKRSVSSKEMILLK---------------STSKNRKWRTNLIKNLGNLKYFLGMEMA---
        C+L+K+LYGL                    +QGHSD TLFTK S + K  IL+                S  K R      IK+LGNLKYFLGME+A   
Subjt:  CRLRKALYGL--------------------NQGHSDPTLFTKRSVSSKEMILLK---------------STSKNRKWRTNLIKNLGNLKYFLGMEMA---

Query:  -----------------------RPADTPIECNANLGESVDKVPVNKE---------------RPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTS
                               RPADTPIE N  LG S D+VPV+KE               RPDIS+AVSVV QFMQAPYE+H+EA+N ILR LK+T 
Subjt:  -----------------------RPADTPIECNANLGESVDKVPVNKE---------------RPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTS

Query:  ------RKENALNPIPTLT----GRSVIDRKSTS-------------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKA
              RK N    I   T      SVIDRKSTS             RSKKQSVVARSSA+A+Y+AMS GICEEIWL+KVLSD+HQ  + P+KL+CDNKA
Subjt:  ------RKENALNPIPTLT----GRSVIDRKSTS-------------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKA

Query:  AISIANNP-----------------EKLENGSICIPYIPSSQQVTDILTKGLLKQNFD
        AISIANNP                 E+L++GSICIPYIPSSQQ+ D+LTKGLL+ +FD
Subjt:  AISIANNP-----------------EKLENGSICIPYIPSSQQVTDILTKGLLKQNFD

A0A5D3CA05 Beta-galactosidase3.8e-29143.94Show/hide
Query:  QHGSVVPSSAKVQQQLANLQASFQQQIAALGAALGASTIFDQGPINSGIQ----------SSLPIYLKNPITVYLTLTTAPHMSGPMGTSMGLIVGEKLN
        QH S+V         L+N  +  Q  +  L   L +    DQ    S I+          + LP+Y KNP+T +   + + +++G +G+S G   GEKLN
Subjt:  QHGSVVPSSAKVQQQLANLQASFQQQIAALGAALGASTIFDQGPINSGIQ----------SSLPIYLKNPITVYLTLTTAPHMSGPMGTSMGLIVGEKLN

Query:  GQNYFSWSQSIKMELEGRHKFGYLTGEISKPKPGDPQEPI-----------------------------AHDIWDAIQKLYSKRQNASRLYTLRKQIHEC
        GQNYFSWSQSIKM LEGR++FG+LTGEI +P PGD  E +                             A D+WD  Q LYSKRQNASRLYTLRKQ+H C
Subjt:  GQNYFSWSQSIKMELEGRHKFGYLTGEISKPKPGDPQEPI-----------------------------AHDIWDAIQKLYSKRQNASRLYTLRKQIHEC

Query:  KQGSMD-----------------------DCPCGGVQYYKLEEVDRVYD-----------------------------FSVRLEEDRSSAMNITTTSITD
        KQG++D                       D P    QY KLEE DRVYD                             F VRLEEDR++AM + TT   D
Subjt:  KQGSMD-----------------------DCPCGGVQYYKLEEVDRVYD-----------------------------FSVRLEEDRSSAMNITTTSITD

Query:  SASFSAKSSGTTKQR---------------------------RPPNNK---------PSQALVSETTSGSQPQYQENCPVDTGTSSLGVIAQ--------
        SA+FSA+SS     +                           RPP  K           +A +SETT  S  Q  +     T T +LG IAQ        
Subjt:  SASFSAKSSGTTKQR---------------------------RPPNNK---------PSQALVSETTSGSQPQYQENCPVDTGTSSLGVIAQ--------

Query:  -------------SGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNYHMAFSPDDVIFQ
                     SGATDHLTG+S++F+ Y PCAGNEKI I +G+LA +AGKG I PFDG  LQNVLHVPK+SYNLLS+SKITR+L+    F P+ V FQ
Subjt:  -------------SGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNYHMAFSPDDVIFQ

Query:  DLSSGRMIGTARHNTGLYFLDGDSSSRHSYRASL------------------------------------------------------------------
        D+SSGR IGTARH+ GLY LD D+S     R SL                                                                  
Subjt:  DLSSGRMIGTARHNTGLYFLDGDSSSRHSYRASL------------------------------------------------------------------

Query:  ---------------ITTSSGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIITQFNAKISILRSDNGRKFLPNTLREFLSTKG-VHQSSCAYT
                       +TTSSGKRWFVTFIDDHTRLTWV+L++DKSEV SIFQ  Y TI TQF+ KI+ILRSDNGR+F  + L EFL++KG VHQ+SCAYT
Subjt:  ---------------ITTSSGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIITQFNAKISILRSDNGRKFLPNTLREFLSTKG-VHQSSCAYT

Query:  PQQNGVAERKNRHLLEVARPSNKSDA---------------------SRVLNFQTPLDCLKLSYPTTRLLSDVPLRVFGCSAF-----------------
        PQQNGVAERKNRHL+EVAR    S +                     SR+L+ QTPLDCLK SYP+TRL+S+VPLRVFGC+A+                 
Subjt:  PQQNGVAERKNRHLLEVARPSNKSDA---------------------SRVLNFQTPLDCLKLSYPTTRLLSDVPLRVFGCSAF-----------------

Query:  ---------------------KYFISMDITFLKDKPFFPVSPLQEESSNEETNYSNSLPEPILTT------HNTVLPTRQAQGTT---------------
                             KYF++MD+TF +++P+FPVS LQ E+ +EE+N +    EP L T      H  +LPT Q    T               
Subjt:  ---------------------KYFISMDITFLKDKPFFPVSPLQEESSNEETNYSNSLPEPILTT------HNTVLPTRQAQGTT---------------

Query:  ---DPDNNIVFAEDVCVENDIVDLTKLPI-ENDRDD------MTENNQENENEIMPQNFTTKEE---LDKPGEYDATLDMPIALRKGTRSYTKYPMYSFL
            P  N     D  +EN     T   + END+ D      M E N ++E E+  +    + E     K  EYD +LD+PIALRKGTRS TK+P+ +++
Subjt:  ---DPDNNIVFAEDVCVENDIVDLTKLPI-ENDRDD------MTENNQENENEIMPQNFTTKEE---LDKPGEYDATLDMPIALRKGTRSYTKYPMYSFL

Query:  SYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEWKAT----------------------HKPVGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYS
        SY+NLS +FRAFT +L++  IP+NIY A+E PEWK                        HK VGCKWVF++KYK+DGTLD++KARLVAKGFTQT+GIDYS
Subjt:  SYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEWKAT----------------------HKPVGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYS

Query:  ETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGFEAPFDHCVCRLRKALYGL--------------------NQGHSDPTLF
        ETFSPVAKLNT+RVLL VAVNKDWPLYQLD KNAFLNGDL EEVYMS PPGFEA F   VC+L+K+LYGL                    +QGHSD TLF
Subjt:  ETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGFEAPFDHCVCRLRKALYGL--------------------NQGHSDPTLF

Query:  TKRSVSSKEMILLK---------------STSKNRKWRTNLIKNLGNLKYFLGMEMA--------------------------RPADTPIECNANLGESV
        TK S + K  IL+                S  K R      IK+LGNLKYFLGME+A                          RPADTPIE N  LG S 
Subjt:  TKRSVSSKEMILLK---------------STSKNRKWRTNLIKNLGNLKYFLGMEMA--------------------------RPADTPIECNANLGESV

Query:  DKVPVNKE---------------RPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTS------RKENALNPIPTLT----GRSVIDRKSTS------
        D+VPV+KE               RPDIS+AVSVV QFMQAPYE+H+EA+N ILR LK+T       RK N    I   T      SVIDRKSTS      
Subjt:  DKVPVNKE---------------RPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTS------RKENALNPIPTLT----GRSVIDRKSTS------

Query:  -------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKAAISIANNP-----------------EKLENGSICIPYIPS
               RSKKQSVVARSSA+A+Y+AMS GICEEIWL+KVLSD+HQ  + P+KL+CDNKAAISIANNP                 E+L++GSICIPYIPS
Subjt:  -------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKAAISIANNP-----------------EKLENGSICIPYIPS

Query:  SQQVTDILTKGLLKQNFD
        SQQ+ D+LTKGLL+ +FD
Subjt:  SQQVTDILTKGLLKQNFD

A0A5D3CFX5 Beta-galactosidase2.9e-29143.35Show/hide
Query:  HYFPPTAVVVVGIPH------QGVENYYEQHGSVVPSSAKVQQQLANLQASFQQQ---IAALGAALGASTIFDQGPINSGIQ----------SSLPIYLK
        H  PP  V    +P       Q    +   H   + +S+  Q    NL   + +Q   + +L   L +    DQ    S I+          + LP+Y K
Subjt:  HYFPPTAVVVVGIPH------QGVENYYEQHGSVVPSSAKVQQQLANLQASFQQQ---IAALGAALGASTIFDQGPINSGIQ----------SSLPIYLK

Query:  NPITVYLTLTTAPHMSGPMGTSMGLIVGEKLNGQNYFSWSQSIKMELEGRHKFGYLTGEISKPKPGDPQEPI----------------------------
        NP+T +   + + +++G +G+S G   GEKLNGQNYFSWSQSIKM LEGR++FG+LTGEI +P PGD  E +                            
Subjt:  NPITVYLTLTTAPHMSGPMGTSMGLIVGEKLNGQNYFSWSQSIKMELEGRHKFGYLTGEISKPKPGDPQEPI----------------------------

Query:  -AHDIWDAIQKLYSKRQNASRLYTLRKQIHECKQGSMD-----------------------DCPCGGVQYYKLEEVDRVYD-------------------
         A D+WD  Q LYSKRQNASRLYTLRKQ+H CKQG++D                       D P    QY KLEE DRVYD                   
Subjt:  -AHDIWDAIQKLYSKRQNASRLYTLRKQIHECKQGSMD-----------------------DCPCGGVQYYKLEEVDRVYD-------------------

Query:  ----------FSVRLEEDRSSAMNITTTSITDSASFSAKSSGTTKQR---------------------------RPPNNK---------PSQALVSETTS
                  F VRLEEDR++AM + TT   DSA+FSA+SS     +                           RPP  K           +A +SETT 
Subjt:  ----------FSVRLEEDRSSAMNITTTSITDSASFSAKSSGTTKQR---------------------------RPPNNK---------PSQALVSETTS

Query:  GSQPQYQENCPVDTGTSSLGVIAQ---------------------SGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHISPFDGLILQNVLH
         S  Q  +     T T +LG IAQ                     SGATDHLTG+S++F+ Y PCAGNEKI I +G+LA +AGKG I PFDG  LQNVLH
Subjt:  GSQPQYQENCPVDTGTSSLGVIAQ---------------------SGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHISPFDGLILQNVLH

Query:  VPKISYNLLSVSKITRDLNYHMAFSPDDVIFQDLSSGRMIGTARHNTGLYFLDGDSSSRHSYRASL----------------------------------
        VPK+SYNLLS+SKITR+L+    F P+ V FQD+SSGR IGTARH+ GLY LD D+S     R SL                                  
Subjt:  VPKISYNLLSVSKITRDLNYHMAFSPDDVIFQDLSSGRMIGTARHNTGLYFLDGDSSSRHSYRASL----------------------------------

Query:  -----------------------------------------------ITTSSGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIITQFNAKISI
                                                       +TTSSGKRWFVTFIDDHTRLTWV+L++DKSEV SIFQ  Y TI TQF+ KI+I
Subjt:  -----------------------------------------------ITTSSGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIITQFNAKISI

Query:  LRSDNGRKFLPNTLREFLSTKG-VHQSSCAYTPQQNGVAERKNRHLLEVARPSNKSDA---------------------SRVLNFQTPLDCLKLSYPTTR
        LRSDNGR+F  + L EFL++KG VHQ+SCAYTPQQNGVAERKNRHL+EVAR    S +                     SR+L+ QTPLDCLK SYP+TR
Subjt:  LRSDNGRKFLPNTLREFLSTKG-VHQSSCAYTPQQNGVAERKNRHLLEVARPSNKSDA---------------------SRVLNFQTPLDCLKLSYPTTR

Query:  LLSDVPLRVFGCSAF--------------------------------------KYFISMDITFLKDKPFFPVSPLQEESSNEETNYSNSLPEPILTT---
        L+S+VPLRVFGC+A+                                      KYF++MD+TF +++P+FPVS LQ E+ +EE+N +    EP L T   
Subjt:  LLSDVPLRVFGCSAF--------------------------------------KYFISMDITFLKDKPFFPVSPLQEESSNEETNYSNSLPEPILTT---

Query:  ---HNTVLPTRQAQGTT------------------DPDNNIVFAEDVCVENDIVDLTKLPI-ENDRDD------MTENNQENENEIMPQNFTTKEE---L
           H  +LPT Q    T                   P  N     D  +EN     T   + END+ D      M E N ++E E+  +    + E    
Subjt:  ---HNTVLPTRQAQGTT------------------DPDNNIVFAEDVCVENDIVDLTKLPI-ENDRDD------MTENNQENENEIMPQNFTTKEE---L

Query:  DKPGEYDATLDMPIALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEWKAT----------------------HKPVGCKWV
         K  EYD +LD+PIALRKGTRS TK+P+ +++SY+NLS +FRAFT +L++  IP+NIY A+E PEWK                        HK VGCKWV
Subjt:  DKPGEYDATLDMPIALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEWKAT----------------------HKPVGCKWV

Query:  FTMKYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGFEAPFDHCVCRLRKALY
        F++KYK+DGTLD++KARLVAKGFTQT+GIDYSETFSPVAKLNT+RVLL VAVNKDWPLYQLD KNAFLNGDL EEVYMS PPGFEA F   VC+L+K+LY
Subjt:  FTMKYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGFEAPFDHCVCRLRKALY

Query:  GL--------------------NQGHSDPTLFTKRSVSSKEMILLK---------------STSKNRKWRTNLIKNLGNLKYFLGMEMA-----------
        GL                    +QGHSD TLFTK S + K  IL+                S  K R      IK+LGNLKYFLGME+A           
Subjt:  GL--------------------NQGHSDPTLFTKRSVSSKEMILLK---------------STSKNRKWRTNLIKNLGNLKYFLGMEMA-----------

Query:  ---------------RPADTPIECNANLGESVDKVPVNKE---------------RPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTS------RK
                       RPADTPIE N  LG S D+VPV+KE               RPDIS+AVSVV QFMQAPYE+H+EA+N ILR LK+T       RK
Subjt:  ---------------RPADTPIECNANLGESVDKVPVNKE---------------RPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTS------RK

Query:  ENALNPIPTLT----GRSVIDRKSTS-------------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKAAISIANNP
         N    I   T      SVIDRKSTS             RSKKQSVVARSSA+A+Y+AMS GICEEIWL+KVLSD+HQ  + P+KL+CDNKAAISIANNP
Subjt:  ENALNPIPTLT----GRSVIDRKSTS-------------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKAAISIANNP

Query:  -----------------EKLENGSICIPYIPSSQQVTDILTKGLLKQNFD
                         E+L++GSICIPYIPSSQQ+ D+LTKGLL+ +FD
Subjt:  -----------------EKLENGSICIPYIPSSQQVTDILTKGLLKQNFD

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.0e-4725.83Show/hide
Query:  TSSGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIITQFNAKISILRSDNGRKFLPNTLREFLSTKGV-HQSSCAYTPQQNGVAERKNRHLLEV
        T   K +FV F+D  T     +L+  KS+V S+FQ         FN K+  L  DNGR++L N +R+F   KG+ +  +  +TPQ NGV+ER  R + E 
Subjt:  TSSGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIITQFNAKISILRSDNGRKFLPNTLREFLSTKGV-HQSSCAYTPQQNGVAERKNRHLLEV

Query:  AR---PSNKSDASRVLNFQTPLDCLKLSYPTTRLL--SDVP-------------LRVFGCSAF-----------------------------------KY
        AR      K D S           L    P+  L+  S  P             LRVFG + +                                   K+
Subjt:  AR---PSNKSDASRVLNFQTPLDCLKLSYPTTRLL--SDVP-------------LRVFGCSAF-----------------------------------KY

Query:  FISMDI-----------------TFLKDK--------PFFPVSPLQEESSNEETNYSN-SLPEPILTTHNTVLP--TRQAQGTTDPD-----NNIVFAED
         ++ D+                  FLKD         P      +Q E  NE     N    +    + N   P  +R+   T  P+     +NI F +D
Subjt:  FISMDI-----------------TFLKDK--------PFFPVSPLQEESSNEETNYSN-SLPEPILTTHNTVLP--TRQAQGTTDPD-----NNIVFAED

Query:  VCVEND-IVDLTKLPIENDRDDMTENNQENENEIMPQNFTTKEELDKPGEYDATLD--MPIALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTVSLNTVTI
            N   ++ +K   +  RDD    ++ + N    +   T E L + G  + T +  + I  R+  R  TK P  S+   +N  +K     V LN  TI
Subjt:  VCVEND-IVDLTKLPIENDRDDMTENNQENENEIMPQNFTTKEELDKPGEYDATLD--MPIALRKGTRSYTKYPMYSFLSYNNLSSKFRAFTVSLNTVTI

Query:  PRNIYVAMEIPE------------------------WKATHKP-----VGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRV
          ++  + +  +                        W  T +P     V  +WVF++KY   G   +YKARLVA+GFTQ + IDY ETF+PVA++++ R 
Subjt:  PRNIYVAMEIPE------------------------WKATHKP-----VGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRV

Query:  LLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGFEAPFDHCVCRLRKALYGLNQ--------------------------------GHSDPTLFTK
        +L + +  +  ++Q+D K AFLNG LKEE+YM  P G     D+ VC+L KA+YGL Q                                G+ +  ++  
Subjt:  LLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGFEAPFDHCVCRLRKALYGLNQ--------------------------------GHSDPTLFTK

Query:  RSVSSKEMILLKSTSKNRKWRTNLIK----NLGNLKYFLG--------------------------MEMARPADTPIECNAN---LGESVD-KVPVNK--
          V    +     T  N   R  + K    +L  +K+F+G                          ME      TP+    N   L    D   P     
Subjt:  RSVSSKEMILLKSTSKNRKWRTNLIK----NLGNLKYFLG--------------------------MEMARPADTPIECNAN---LGESVD-KVPVNK--

Query:  ---------ERPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKST-----------SRKENALNPIPTLTGRSVIDRKSTS--------------RSKK
                  RPD++ AV+++ ++      E  + +  +LR LK T           + +   +  + +    S IDRKST+               +K+
Subjt:  ---------ERPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKST-----------SRKENALNPIPTLTGRSVIDRKSTS--------------RSKK

Query:  QSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKAAISIANNP-----------------EKLENGSICIPYIPSSQQVTDILTKG
        Q+ VA SS +A+Y A+   + E +WLK +L+ I+   + P+K+Y DN+  ISIANNP                 E+++N  IC+ YIP+  Q+ DI TK 
Subjt:  QSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKAAISIANNP-----------------EKLENGSICIPYIPSSQQVTDILTKG

Query:  LLKQNF
        L    F
Subjt:  LLKQNF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.5e-6027.72Show/hide
Query:  ITTSSGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIITQFNAKISILRSDNGRKFLPNTLREFLSTKGV-HQSSCAYTPQQNGVAERKNRHLL
        I +  G ++FVTFIDD +R  WV++L  K +V  +FQ+ +  +  +   K+  LRSDNG ++      E+ S+ G+ H+ +   TPQ NGVAER NR ++
Subjt:  ITTSSGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIITQFNAKISILRSDNGRKFLPNTLREFLSTKGV-HQSSCAYTPQQNGVAERKNRHLL

Query:  EVARPSNK-------------SDASRVLNFQTPLDCLKLSYP----TTRLLSDVPLRVFGCSAFKYFISMDITFLKDKPFFPVSPLQEESSNEETNYS--
        E  R   +               A  ++N ++P   L    P    T + +S   L+VFGC AF +      T L DK    +  +     +EE  Y   
Subjt:  EVARPSNK-------------SDASRVLNFQTPLDCLKLSYP----TTRLLSDVPLRVFGCSAFKYFISMDITFLKDKPFFPVSPLQEESSNEETNYS--

Query:  NSLPEPILTTHNTVL---PTRQAQGTTDPDNNIVFAEDVCVENDIVDLTKLPIENDRDDMTENNQE-----NENEIMPQNFTTKEELDKPGEYDATLDMP
        + + + ++ + + V      R A   ++   N +    V + +   + T    E+  D+++E  ++      + E + +     E   +  E    L   
Subjt:  NSLPEPILTTHNTVL---PTRQAQGTTDPDNNIVFAEDVCVENDIVDLTKLPIENDRDDMTENNQE-----NENEIMPQNFTTKEELDKPGEYDATLDMP

Query:  IALRKGTRSY--TKYPMYS----------FLSYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEWKATHKPVGCKWVFTMKYKSDGTLDKYKARLVAKGF
           R  +R Y  T+Y + S           LS+   +   +A    + ++      Y  +E+P+ K   +P+ CKWVF +K   D  L +YKARLV KGF
Subjt:  IALRKGTRSY--TKYPMYS----------FLSYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEWKATHKPVGCKWVFTMKYKSDGTLDKYKARLVAKGF

Query:  TQTFGIDYSETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGFE-APFDHCVCRLRKALYGLNQG-----------------
         Q  GID+ E FSPV K+ +IR +L +A + D  + QLD K AFL+GDL+EE+YM  P GFE A   H VC+L K+LYGL Q                  
Subjt:  TQTFGIDYSETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGFE-APFDHCVCRLRKALYGLNQG-----------------

Query:  ---HSDPTLFTKRSVSSKEMILL----------KSTSKNRKWRTNL-----IKNLGNLKYFLGMEM----------------------------ARPADT
           +SDP ++ KR   +  +ILL          K      K + +L     +K+LG  +  LGM++                            A+P  T
Subjt:  ---HSDPTLFTKRSVSSKEMILL----------KSTSKNRKWRTNL-----IKNLGNLKYFLGMEM----------------------------ARPADT

Query:  PIECNANLGE-----------SVDKVPVNK-----------ERPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTSRKENAL-NPIPTLTGRSVID-
        P+  +  L +           ++ KVP +             RPDI++AV VV +F++ P +EH EA+  ILR L+ T+          P L G +  D 
Subjt:  PIECNANLGE-----------SVDKVPVNK-----------ERPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTSRKENAL-NPIPTLTGRSVID-

Query:  ------RKSTS-------------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSD--IHQTGDLPMKLYCDNKAAISIANNP-------------
              RKS++             +SK Q  VA S+ +A+Y A +    E IWLK+ L +  +HQ       +YCD+++AI ++ N              
Subjt:  ------RKSTS-------------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSD--IHQTGDLPMKLYCDNKAAISIANNP-------------

Query:  ----EKLENGSICIPYIPSSQQVTDILTKGLLKQNFD
            E +++ S+ +  I +++   D+LTK + +  F+
Subjt:  ----EKLENGSICIPYIPSSQQVTDILTKGLLKQNFD

P92520 Uncharacterized mitochondrial protein AtMg008201.8e-1163.16Show/hide
Query:  VGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRVLLLVA
        +GCKWVF  K  SDGTLD+ KARLVAKGF Q  GI + ET+SPV +  TIR +L VA
Subjt:  VGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRVLLLVA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.1e-6524.87Show/hide
Query:  QYQENCPVDTGTSSLGVIAQSGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHIS---PFDGLILQNVLHVPKISYNLLSVSKITRDLNYHM
        Q + N  + +  SS   +  SGAT H+T   +N  L+ P  G + + + +G+   ++  G  S       L L N+L+VP I  NL+SV ++       +
Subjt:  QYQENCPVDTGTSSLGVIAQSGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHIS---PFDGLILQNVLHVPKISYNLLSVSKITRDLNYHM

Query:  AFSPDDVIFQDLSSGRMIGTARHNTGLY------------FLDGDSSSRHS--------------------YRASLITTS--------------------
         F P     +DL++G  +   +    LY            F    S + HS                    Y  S++  S                    
Subjt:  AFSPDDVIFQDLSSGRMIGTARHNTGLY------------FLDGDSSSRHS--------------------YRASLITTS--------------------

Query:  -------------------------SGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIITQFNAKISILRSDNGRKFLPNTLREFLSTKGV-HQ
                                    R++V F+D  TR TW++ L  KS+V   F      +  +F  +I    SDNG +F+   L E+ S  G+ H 
Subjt:  -------------------------SGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIITQFNAKISILRSDNGRKFLPNTLREFLSTKGV-HQ

Query:  SSCAYTPQQNGVAERKNRHLLEVAR-------------PSNKSDASRVLNFQTPLDCLKLSYPTTRLLSDVP----LRVFGCSAF---------------
        +S  +TP+ NG++ERK+RH++E                P   + A  ++N + P   L+L  P  +L    P    LRVFGC+ +               
Subjt:  SSCAYTPQQNGVAERKNRHLLEVAR-------------PSNKSDASRVLNFQTPLDCLKLSYPTTRLLSDVP----LRVFGCSAF---------------

Query:  -----------------------KYFISMDITFLKDKPFFP-------VSPLQEESSNEETNYSNSLPEPILTTHNTVLPTRQA----QGTTDPDNNIVF
                               + +IS  + F  D+  FP       +SP+QE+       +S   P   L T   VLP           T P +    
Subjt:  -----------------------KYFISMDITFLKDKPFFP-------VSPLQEESSNEETNYSNSLPEPILTTHNTVLPTRQA----QGTTDPDNNIVF

Query:  AEDVCVENDIVDLT-----------KLPIENDRDDMTE----NNQENENEIMPQNFTTKE-------ELDKPGEYDATLDMPIALRKGTRS---------
          +  V +  +D +             P +N     T+      Q + ++   QN  T E        L  P +  ++   P      + +         
Subjt:  AEDVCVENDIVDLT-----------KLPIENDRDDMTE----NNQENENEIMPQNFTTKE-------ELDKPGEYDATLDMPIALRKGTRS---------

Query:  YTKYPMYSFLSYNN--------LSSKFRA----------FTVSLNTVTIPRNIYVAMEIPEWK----------------------ATHKP-VGCKWVFTM
        +   P+   ++ NN        + ++ +A            VSL   + PR    A++   W+                       +H   VGC+W+FT 
Subjt:  YTKYPMYSFLSYNN--------LSSKFRA----------FTVSLNTVTIPRNIYVAMEIPEWK----------------------ATHKP-VGCKWVFTM

Query:  KYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGF-EAPFDHCVCRLRKALYGL
        KY SDG+L++YKARLVAKG+ Q  G+DY+ETFSPV K  +IR++L VAV++ WP+ QLD  NAFL G L ++VYMS PPGF +    + VC+LRKALYGL
Subjt:  KYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGF-EAPFDHCVCRLRKALYGL

Query:  NQGH--------------------SDPTLFTKRS---------------VSSKEMILLKSTSKNRKWRTNLIKNLGNLKYFLGME---------------
         Q                      SD +LF  +                ++  +  LL +T  N   R + +K+   L YFLG+E               
Subjt:  NQGH--------------------SDPTLFTKRS---------------VSSKEMILLKSTSKNRKWRTNLIKNLGNLKYFLGME---------------

Query:  -----------MARPADTPIECNANLG-ESVDKVPVNKE--------------RPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTSRKENALNPIP
                    A+P  TP+  +  L   S  K+    E              RPDISYAV+ + QFM  P EEH++A+  ILR L  T      L    
Subjt:  -----------MARPADTPIECNANLG-ESVDKVPVNKE--------------RPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTSRKENALNPIP

Query:  TLTGRSVIDRK---------STS-------------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKAAISIANNP---
        TL+  +  D           ST+              SKKQ  V RSS +A+Y+++++   E  W+  +L+++      P  +YCDN  A  +  NP   
Subjt:  TLTGRSVIDRK---------STS-------------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKAAISIANNP---

Query:  --------------EKLENGSICIPYIPSSQQVTDILTKGLLKQNF
                       ++++G++ + ++ +  Q+ D LTK L +  F
Subjt:  --------------EKLENGSICIPYIPSSQQVTDILTKGLLKQNF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.0e-6424.81Show/hide
Query:  ETTSGSQP-QYQENCPVDTGTSSLGVIAQSGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHIS---PFDGLILQNVLHVPKISYNLLSVSK
        ++TS   P Q + N  V++  ++   +  SGAT H+T   +N   + P  G + + I +G+   +   G  S       L L  VL+VP I  NL+SV +
Subjt:  ETTSGSQP-QYQENCPVDTGTSSLGVIAQSGATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHIS---PFDGLILQNVLHVPKISYNLLSVSK

Query:  ITRDLNYHMAFSPDDVIFQDLSSGRMIGTARHNTGLY------------FLDGDSSSRHS----------------------------------------
        +       + F P     +DL++G  +   +    LY            F    S + HS                                        
Subjt:  ITRDLNYHMAFSPDDVIFQDLSSGRMIGTARHNTGLY------------FLDGDSSSRHS----------------------------------------

Query:  -------YRASLITTS------------------SGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIITQFNAKISILRSDNGRKFLPNTLREF
               +  S IT+S                     R++V F+D  TR TW++ L  KS+V   F    + +  +F  +I  L SDNG +F+   LR++
Subjt:  -------YRASLITTS------------------SGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIITQFNAKISILRSDNGRKFLPNTLREF

Query:  LSTKGV-HQSSCAYTPQQNGVAERKNRHLLEVAR-------------PSNKSDASRVLNFQTPLDCLKLSYPTTRLLSDVP----LRVFGCSAFKYFISM
        LS  G+ H +S  +TP+ NG++ERK+RH++E+               P   S A  ++N + P   L+L  P  +L    P    L+VFGC+ + +    
Subjt:  LSTKGV-HQSSCAYTPQQNGVAERKNRHLLEVAR-------------PSNKSDASRVLNFQTPLDCLKLSYPTTRLLSDVP----LRVFGCSAFKYFISM

Query:  DITFLKDK---------------------------------------PF----FPVSPLQEESSNEETNYSN--SLP--------EPILTTHNTVLP---
        +   L+DK                                       PF    F VS  QE+ S+   N+ +  +LP         P L  H    P   
Subjt:  DITFLKDK---------------------------------------PF----FPVSPLQEESSNEETNYSN--SLP--------EPILTTHNTVLP---

Query:  -------TRQAQGTTDPDNNI---VFAEDVCVENDIVDLTKLP------------IENDRDDMTENNQENENEIMPQNFTTKEELDKPGEYDATLDMPIA
               T Q   +  P ++I     +E     ++    T  P            + N   +    N  N+N  +PQ+  +   +  P    +  + P +
Subjt:  -------TRQAQGTTDPDNNI---VFAEDVCVENDIVDLTKLP------------IENDRDDMTENNQENENEIMPQNFTTKEELDKPGEYDATLDMPIA

Query:  LRKGT-------------RSYTKYPMYSFLSYNNLSSKFR------AFTVSLNTVTIPRNIYVAMEIPEWKATHKP-----------------------V
            T             +   + P+ +           R      ++  SL   + PR    AM+   W+                            V
Subjt:  LRKGT-------------RSYTKYPMYSFLSYNNLSSKFR------AFTVSLNTVTIPRNIYVAMEIPEWKATHKP-----------------------V

Query:  GCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGF-EAPFDHCVCR
        GC+W+FT K+ SDG+L++YKARLVAKG+ Q  G+DY+ETFSPV K  +IR++L VAV++ WP+ QLD  NAFL G L +EVYMS PPGF +      VCR
Subjt:  GCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGF-EAPFDHCVCR

Query:  LRKALYGLNQGH--------------------SDPTLFTKRS---------------VSSKEMILLKSTSKNRKWRTNLIKNLGNLKYFLGME-------
        LRKA+YGL Q                      SD +LF  +                ++  + +LLK T      R + +K   +L YFLG+E       
Subjt:  LRKALYGLNQGH--------------------SDPTLFTKRS---------------VSSKEMILLKSTSKNRKWRTNLIKNLGNLKYFLGME-------

Query:  -------------------MARPADTPIECNANLG-ESVDKVPVNKE--------------RPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTSRK
                            A+P  TP+  +  L   S  K+P   E              RPD+SYAV+ + Q+M  P ++H  A+  +LR L  T   
Subjt:  -------------------MARPADTPIECNANLG-ESVDKVPVNKE--------------RPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTSRK

Query:  ENALNPIPTLTGRSVIDRK---------STS-------------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKAAIS
           L    TL+  +  D           ST+              SKKQ  V RSS +A+Y+++++   E  W+  +L+++      P  +YCDN  A  
Subjt:  ENALNPIPTLTGRSVIDRK---------STS-------------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKAAIS

Query:  IANNP-----------------EKLENGSICIPYIPSSQQVTDILTKGLLK---QNF
        +  NP                  ++++G++ + ++ +  Q+ D LTK L +   QNF
Subjt:  IANNP-----------------EKLENGSICIPYIPSSQQVTDILTKGLLK---QNF

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.8e-5232.22Show/hide
Query:  SYTKYPMYSFLSYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEW-----------KATH-----------KPVGCKWVFTMKYKSDGTLDKYKARLVAK
        S T + +  FLSY  +S  + +F V +     P     A E   W           + TH           KP+GCKWV+ +KY SDGT+++YKARLVAK
Subjt:  SYTKYPMYSFLSYNNLSSKFRAFTVSLNTVTIPRNIYVAMEIPEW-----------KATH-----------KPVGCKWVFTMKYKSDGTLDKYKARLVAK

Query:  GFTQTFGIDYSETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGFEAPF-----DHCVCRLRKALYGLN-------------
        G+TQ  GID+ ETFSPV KL +++++L ++   ++ L+QLD  NAFLNGDL EE+YM  PPG+ A        + VC L+K++YGL              
Subjt:  GFTQTFGIDYSETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLNGDLKEEVYMSHPPGFEAPF-----DHCVCRLRKALYGLN-------------

Query:  -------QGHSDPTLFTKRS--------VSSKEMILLKSTS------KNRKWRTNLIKNLGNLKYFLGMEMAR--------------------------P
               Q HSD T F K +        V   ++I+  +        K++      +++LG LKYFLG+E+AR                          P
Subjt:  -------QGHSDPTLFTKRS--------VSSKEMILLKSTS------KNRKWRTNLIKNLGNLKYFLGMEMAR--------------------------P

Query:  ADTPIECNANL-----GESVDKVPVNK----------ERPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKST--------SRKENALNPIPTLTGRSV
        +  P++ +        G+ VD     +           R DIS+AV+ + QF +AP   H +A+  IL  +K T        S+ E  L      + +S 
Subjt:  ADTPIECNANL-----GESVDKVPVNK----------ERPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKST--------SRKENALNPIPTLTGRSV

Query:  ID-RKSTS-------------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKAAISIANN
         D R+ST+             +SKKQ VV++SSA+A+Y+A+S    E +WL +   ++      P  L+CDN AAI IA N
Subjt:  ID-RKSTS-------------RSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQTGDLPMKLYCDNKAAISIANN

ATMG00810.1 DNA/RNA polymerases superfamily protein2.2e-0423.66Show/hide
Query:  DPTLFTKRSVSSKEMILLKSTSKNRKWRTNLIKNLGNLKYFLGMEM--------------------------ARPADTPIECNANLGESVDKVP------
        D  L T  S +   M++ + +S      T  +K+LG + YFLG+++                           +P  TP+    N   S  K P      
Subjt:  DPTLFTKRSVSSKEMILLKSTSKNRKWRTNLIKNLGNLKYFLGMEM--------------------------ARPADTPIECNANLGESVDKVP------

Query:  --------VNKERPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTSRKENALNPIPTLTGRSVID---------RKSTS-------------RSKKQ
                +   RPDISYAV++VCQ M  P     + +  +LR +K T      ++    L  ++  D         R+ST+              +K+Q
Subjt:  --------VNKERPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTSRKENALNPIPTLTGRSVID---------RKSTS-------------RSKKQ

Query:  SVVARSSAKAKYKAMSHGICEEIW
          V+RSS + +Y+A++    E  W
Subjt:  SVVARSSAKAKYKAMSHGICEEIW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.3e-1263.16Show/hide
Query:  VGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRVLLLVA
        +GCKWVF  K  SDGTLD+ KARLVAKGF Q  GI + ET+SPV +  TIR +L VA
Subjt:  VGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRVLLLVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGACTAGGCAAGATCTACCGCTACCAGTTCGGCCGTCCAAAAAGCTGTCAACCAGCACCTCCGATTCGAACAACGCTGATGGACCCTAGTGCTACTGCCACT
CAAATCGACCAACAACTTGCCGCCGTGTCCCTTAATCCGTTACTCGATTCAACGAGAGTGAGTCTCGCTCCAAGTCTCGCTCCAAGTTCGGTCACATGTAGTCTT
GCTCCAAGTCTCGCTCCAACTGCAGTCGCGTGTTCTTTCTCTCAACCAACGAGAGTGAGTCTCGCTCCAAGTCTCGCTCAAAGTGCAGTCGTGGGTTCAAGCACC
GTCGCGGGAACTTCTCAAGGACTAGCATTCCATGAGGCTGACAGCACCAACTTCAATCCACATGTTTTGGTTTGTCACTATTTCCCTCCTACAGCTGTTGTTGTC
GTTGGAATTCCTCACCAAGGAGTCGAAAACTACTATGAACAACACGGATCAGTTGTCCCTTCCTCGGCTAAAGTTCAACAACAACTTGCTAATCTTCAAGCAAGT
TTCCAACAACAAATTGCGGCTCTTGGGGCAGCCCTTGGCGCTTCAACCATCTTTGATCAAGGTCCAATAAATTCAGGTATTCAATCAAGTTTACCGATCTATCTA
AAGAATCCAATAACTGTCTATCTTACCTTAACAACTGCACCGCATATGTCTGGACCAATGGGTACCTCCATGGGGCTAATTGTTGGTGAAAAATTAAATGGTCAG
AATTATTTCTCATGGTCTCAGTCTATTAAAATGGAACTTGAAGGACGCCACAAATTTGGGTATCTAACTGGCGAGATATCGAAACCCAAACCAGGAGACCCTCAG
GAGCCTATTGCTCATGATATTTGGGATGCTATCCAGAAACTATACTCCAAGAGACAGAACGCATCACGGCTCTACACTCTGCGAAAACAAATTCATGAATGCAAA
CAGGGGTCTATGGATGACTGTCCTTGTGGAGGAGTCCAATATTATAAACTTGAAGAAGTTGACCGTGTGTATGATTTTTCAGTCCGTCTTGAGGAAGATAGATCA
AGTGCTATGAATATCACAACTACCTCCATAACTGACTCTGCCTCCTTCAGTGCTAAATCATCTGGTACTACTAAGCAACGACGACCTCCAAATAACAAACCCAGC
CAGGCTCTGGTGAGTGAGACTACTAGTGGCTCTCAGCCACAATATCAAGAGAACTGCCCAGTTGACACCGGTACTTCCTCTCTTGGGGTGATTGCACAATCAGGA
GCTACAGATCATCTAACTGGAACCTCTGACAATTTTCTCTTGTATCATCCATGTGCTGGTAACGAAAAAATCTGGATTGTTGAGGGGACACTTGCCACTGTTGCC
GGCAAGGGTCACATTTCTCCATTTGATGGTTTGATATTACAGAATGTGCTACATGTTCCTAAAATCTCTTATAATTTACTATCTGTGAGTAAGATAACCAGAGAT
CTAAACTATCATATGGCTTTCTCACCGGATGATGTTATTTTTCAGGACTTGAGCTCGGGGAGGATGATTGGCACTGCCCGACACAATACGGGGCTCTATTTCCTT
GATGGTGATAGTTCCTCTAGGCATAGTTATAGGGCTAGTTTGATTACTACTTCTTCGGGTAAGCGTTGGTTCGTTACCTTCATTGATGACCACACTCGCCTTACT
TGGGTTTTCCTTTTAACAGATAAGTCTGAGGTCTCATCCATTTTCCAACAGTTGTACACCACCATTATAACTCAGTTCAATGCCAAAATTTCTATCCTTCGGAGT
GATAATGGTCGTAAATTCCTCCCCAATACCCTTCGTGAGTTCCTATCCACTAAAGGTGTTCACCAGAGCTCGTGTGCCTATACACCTCAACAAAATGGAGTGGCT
GAAAGAAAAAACCGTCATCTCCTCGAAGTAGCTCGTCCATCTAATAAATCGGATGCCTCCCGCGTTCTCAACTTCCAAACTCCTCTTGACTGTCTCAAACTGTCT
TACCCTACCACTCGCCTTCTATCTGATGTCCCTCTCCGAGTATTCGGGTGTAGTGCATTTAAATACTTCATCTCTATGGATATCACCTTCCTGAAAGATAAACCT
TTCTTTCCCGTTAGTCCTCTTCAGGAGGAGAGTAGTAATGAAGAGACTAACTATTCTAATTCCCTTCCTGAACCCATCCTGACAACCCACAACACTGTCCTACCC
ACTAGACAGGCTCAAGGTACTACTGACCCTGATAATAATATTGTTTTTGCTGAAGATGTTTGTGTTGAAAATGATATTGTTGATCTGACAAAATTGCCGATAGAG
AATGATAGAGATGATATGACAGAAAATAACCAGGAGAATGAAAATGAGATCATGCCTCAGAATTTCACAACTAAAGAAGAACTTGACAAACCAGGAGAGTATGAT
GCCACCCTTGACATGCCCATTGCTCTGAGAAAAGGCACAAGGTCCTACACCAAGTATCCCATGTATAGCTTCCTCTCTTATAACAATCTGTCTTCTAAGTTCAGA
GCGTTTACTGTCAGCCTCAACACTGTAACTATACCGAGGAACATATATGTGGCTATGGAAATTCCTGAGTGGAAGGCTACCCATAAACCAGTTGGGTGTAAGTGG
GTGTTCACAATGAAGTATAAATCTGACGGAACTCTTGACAAGTATAAAGCCAGGCTTGTAGCAAAAGGATTCACCCAAACTTTTGGGATAGACTATTCTGAGACT
TTCTCTCCTGTTGCAAAGTTAAACACGATCAGGGTTCTTTTGTTGGTTGCTGTCAACAAAGATTGGCCTCTTTATCAGCTAGATGAGAAAAACGCATTTCTAAAC
GGTGATCTAAAAGAAGAGGTGTATATGAGCCATCCTCCTGGCTTTGAAGCCCCATTTGACCATTGTGTGTGCAGACTTAGAAAGGCTCTGTATGGGCTGAACCAG
GGACACTCTGATCCTACCTTGTTTACAAAAAGGTCCGTCTCTAGTAAGGAGATGATACTACTGAAATCAACAAGTAAAAACAGAAAATGGCGAACGAATTTAATC
AAAAACCTAGGAAATCTGAAGTACTTTCTTGGGATGGAGATGGCAAGGCCTGCTGACACTCCTATCGAGTGTAATGCAAACTTGGGAGAGTCTGTTGACAAAGTT
CCTGTTAACAAGGAAAGACCCGACATTTCCTATGCTGTCAGCGTTGTTTGCCAGTTTATGCAGGCTCCCTACGAAGAACACGTGGAAGCTATAAACATGATTCTA
AGATGTTTGAAAAGCACATCAAGAAAAGAAAATGCATTGAACCCTATACCGACTCTGACTGGGCGATCAGTGATTGACAGGAAATCTACATCCAGGAGTAAAAAG
CAAAGTGTGGTTGCTAGAAGCAGTGCAAAAGCTAAATATAAAGCTATGAGTCATGGCATTTGTGAAGAGATCTGGTTAAAGAAAGTTTTATCTGATATTCACCAA
ACCGGTGACCTACCAATGAAACTCTATTGTGATAATAAGGCTGCTATTAGCATTGCTAACAATCCAGAAAAACTGGAAAATGGAAGTATTTGTATTCCGTATATT
CCTTCTAGTCAACAAGTTACTGATATCCTCACTAAAGGGTTACTTAAGCAGAACTTTGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGACTAGGCAAGATCTACCGCTACCAGTTCGGCCGTCCAAAAAGCTGTCAACCAGCACCTCCGATTCGAACAACGCTGATGGACCCTAGTGCTACTGCCACT
CAAATCGACCAACAACTTGCCGCCGTGTCCCTTAATCCGTTACTCGATTCAACGAGAGTGAGTCTCGCTCCAAGTCTCGCTCCAAGTTCGGTCACATGTAGTCTT
GCTCCAAGTCTCGCTCCAACTGCAGTCGCGTGTTCTTTCTCTCAACCAACGAGAGTGAGTCTCGCTCCAAGTCTCGCTCAAAGTGCAGTCGTGGGTTCAAGCACC
GTCGCGGGAACTTCTCAAGGACTAGCATTCCATGAGGCTGACAGCACCAACTTCAATCCACATGTTTTGGTTTGTCACTATTTCCCTCCTACAGCTGTTGTTGTC
GTTGGAATTCCTCACCAAGGAGTCGAAAACTACTATGAACAACACGGATCAGTTGTCCCTTCCTCGGCTAAAGTTCAACAACAACTTGCTAATCTTCAAGCAAGT
TTCCAACAACAAATTGCGGCTCTTGGGGCAGCCCTTGGCGCTTCAACCATCTTTGATCAAGGTCCAATAAATTCAGGTATTCAATCAAGTTTACCGATCTATCTA
AAGAATCCAATAACTGTCTATCTTACCTTAACAACTGCACCGCATATGTCTGGACCAATGGGTACCTCCATGGGGCTAATTGTTGGTGAAAAATTAAATGGTCAG
AATTATTTCTCATGGTCTCAGTCTATTAAAATGGAACTTGAAGGACGCCACAAATTTGGGTATCTAACTGGCGAGATATCGAAACCCAAACCAGGAGACCCTCAG
GAGCCTATTGCTCATGATATTTGGGATGCTATCCAGAAACTATACTCCAAGAGACAGAACGCATCACGGCTCTACACTCTGCGAAAACAAATTCATGAATGCAAA
CAGGGGTCTATGGATGACTGTCCTTGTGGAGGAGTCCAATATTATAAACTTGAAGAAGTTGACCGTGTGTATGATTTTTCAGTCCGTCTTGAGGAAGATAGATCA
AGTGCTATGAATATCACAACTACCTCCATAACTGACTCTGCCTCCTTCAGTGCTAAATCATCTGGTACTACTAAGCAACGACGACCTCCAAATAACAAACCCAGC
CAGGCTCTGGTGAGTGAGACTACTAGTGGCTCTCAGCCACAATATCAAGAGAACTGCCCAGTTGACACCGGTACTTCCTCTCTTGGGGTGATTGCACAATCAGGA
GCTACAGATCATCTAACTGGAACCTCTGACAATTTTCTCTTGTATCATCCATGTGCTGGTAACGAAAAAATCTGGATTGTTGAGGGGACACTTGCCACTGTTGCC
GGCAAGGGTCACATTTCTCCATTTGATGGTTTGATATTACAGAATGTGCTACATGTTCCTAAAATCTCTTATAATTTACTATCTGTGAGTAAGATAACCAGAGAT
CTAAACTATCATATGGCTTTCTCACCGGATGATGTTATTTTTCAGGACTTGAGCTCGGGGAGGATGATTGGCACTGCCCGACACAATACGGGGCTCTATTTCCTT
GATGGTGATAGTTCCTCTAGGCATAGTTATAGGGCTAGTTTGATTACTACTTCTTCGGGTAAGCGTTGGTTCGTTACCTTCATTGATGACCACACTCGCCTTACT
TGGGTTTTCCTTTTAACAGATAAGTCTGAGGTCTCATCCATTTTCCAACAGTTGTACACCACCATTATAACTCAGTTCAATGCCAAAATTTCTATCCTTCGGAGT
GATAATGGTCGTAAATTCCTCCCCAATACCCTTCGTGAGTTCCTATCCACTAAAGGTGTTCACCAGAGCTCGTGTGCCTATACACCTCAACAAAATGGAGTGGCT
GAAAGAAAAAACCGTCATCTCCTCGAAGTAGCTCGTCCATCTAATAAATCGGATGCCTCCCGCGTTCTCAACTTCCAAACTCCTCTTGACTGTCTCAAACTGTCT
TACCCTACCACTCGCCTTCTATCTGATGTCCCTCTCCGAGTATTCGGGTGTAGTGCATTTAAATACTTCATCTCTATGGATATCACCTTCCTGAAAGATAAACCT
TTCTTTCCCGTTAGTCCTCTTCAGGAGGAGAGTAGTAATGAAGAGACTAACTATTCTAATTCCCTTCCTGAACCCATCCTGACAACCCACAACACTGTCCTACCC
ACTAGACAGGCTCAAGGTACTACTGACCCTGATAATAATATTGTTTTTGCTGAAGATGTTTGTGTTGAAAATGATATTGTTGATCTGACAAAATTGCCGATAGAG
AATGATAGAGATGATATGACAGAAAATAACCAGGAGAATGAAAATGAGATCATGCCTCAGAATTTCACAACTAAAGAAGAACTTGACAAACCAGGAGAGTATGAT
GCCACCCTTGACATGCCCATTGCTCTGAGAAAAGGCACAAGGTCCTACACCAAGTATCCCATGTATAGCTTCCTCTCTTATAACAATCTGTCTTCTAAGTTCAGA
GCGTTTACTGTCAGCCTCAACACTGTAACTATACCGAGGAACATATATGTGGCTATGGAAATTCCTGAGTGGAAGGCTACCCATAAACCAGTTGGGTGTAAGTGG
GTGTTCACAATGAAGTATAAATCTGACGGAACTCTTGACAAGTATAAAGCCAGGCTTGTAGCAAAAGGATTCACCCAAACTTTTGGGATAGACTATTCTGAGACT
TTCTCTCCTGTTGCAAAGTTAAACACGATCAGGGTTCTTTTGTTGGTTGCTGTCAACAAAGATTGGCCTCTTTATCAGCTAGATGAGAAAAACGCATTTCTAAAC
GGTGATCTAAAAGAAGAGGTGTATATGAGCCATCCTCCTGGCTTTGAAGCCCCATTTGACCATTGTGTGTGCAGACTTAGAAAGGCTCTGTATGGGCTGAACCAG
GGACACTCTGATCCTACCTTGTTTACAAAAAGGTCCGTCTCTAGTAAGGAGATGATACTACTGAAATCAACAAGTAAAAACAGAAAATGGCGAACGAATTTAATC
AAAAACCTAGGAAATCTGAAGTACTTTCTTGGGATGGAGATGGCAAGGCCTGCTGACACTCCTATCGAGTGTAATGCAAACTTGGGAGAGTCTGTTGACAAAGTT
CCTGTTAACAAGGAAAGACCCGACATTTCCTATGCTGTCAGCGTTGTTTGCCAGTTTATGCAGGCTCCCTACGAAGAACACGTGGAAGCTATAAACATGATTCTA
AGATGTTTGAAAAGCACATCAAGAAAAGAAAATGCATTGAACCCTATACCGACTCTGACTGGGCGATCAGTGATTGACAGGAAATCTACATCCAGGAGTAAAAAG
CAAAGTGTGGTTGCTAGAAGCAGTGCAAAAGCTAAATATAAAGCTATGAGTCATGGCATTTGTGAAGAGATCTGGTTAAAGAAAGTTTTATCTGATATTCACCAA
ACCGGTGACCTACCAATGAAACTCTATTGTGATAATAAGGCTGCTATTAGCATTGCTAACAATCCAGAAAAACTGGAAAATGGAAGTATTTGTATTCCGTATATT
CCTTCTAGTCAACAAGTTACTGATATCCTCACTAAAGGGTTACTTAAGCAGAACTTTGATTAG
Protein sequenceShow/hide protein sequence
MRLGKIYRYQFGRPKSCQPAPPIRTTLMDPSATATQIDQQLAAVSLNPLLDSTRVSLAPSLAPSSVTCSLAPSLAPTAVACSFSQPTRVSLAPSLAQSAVVGSST
VAGTSQGLAFHEADSTNFNPHVLVCHYFPPTAVVVVGIPHQGVENYYEQHGSVVPSSAKVQQQLANLQASFQQQIAALGAALGASTIFDQGPINSGIQSSLPIYL
KNPITVYLTLTTAPHMSGPMGTSMGLIVGEKLNGQNYFSWSQSIKMELEGRHKFGYLTGEISKPKPGDPQEPIAHDIWDAIQKLYSKRQNASRLYTLRKQIHECK
QGSMDDCPCGGVQYYKLEEVDRVYDFSVRLEEDRSSAMNITTTSITDSASFSAKSSGTTKQRRPPNNKPSQALVSETTSGSQPQYQENCPVDTGTSSLGVIAQSG
ATDHLTGTSDNFLLYHPCAGNEKIWIVEGTLATVAGKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNYHMAFSPDDVIFQDLSSGRMIGTARHNTGLYFL
DGDSSSRHSYRASLITTSSGKRWFVTFIDDHTRLTWVFLLTDKSEVSSIFQQLYTTIITQFNAKISILRSDNGRKFLPNTLREFLSTKGVHQSSCAYTPQQNGVA
ERKNRHLLEVARPSNKSDASRVLNFQTPLDCLKLSYPTTRLLSDVPLRVFGCSAFKYFISMDITFLKDKPFFPVSPLQEESSNEETNYSNSLPEPILTTHNTVLP
TRQAQGTTDPDNNIVFAEDVCVENDIVDLTKLPIENDRDDMTENNQENENEIMPQNFTTKEELDKPGEYDATLDMPIALRKGTRSYTKYPMYSFLSYNNLSSKFR
AFTVSLNTVTIPRNIYVAMEIPEWKATHKPVGCKWVFTMKYKSDGTLDKYKARLVAKGFTQTFGIDYSETFSPVAKLNTIRVLLLVAVNKDWPLYQLDEKNAFLN
GDLKEEVYMSHPPGFEAPFDHCVCRLRKALYGLNQGHSDPTLFTKRSVSSKEMILLKSTSKNRKWRTNLIKNLGNLKYFLGMEMARPADTPIECNANLGESVDKV
PVNKERPDISYAVSVVCQFMQAPYEEHVEAINMILRCLKSTSRKENALNPIPTLTGRSVIDRKSTSRSKKQSVVARSSAKAKYKAMSHGICEEIWLKKVLSDIHQ
TGDLPMKLYCDNKAAISIANNPEKLENGSICIPYIPSSQQVTDILTKGLLKQNFD