; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G19125 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G19125
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionBeta-galactosidase
Genome locationClcChr09:32219912..32225053
RNA-Seq ExpressionClc09G19125
SyntenyClc09G19125
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026232.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0046.32Show/hide
Query:  ELPLYSENS-TVFPTLTLE--SRPMENSTGLIVGEKLNGQNYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGK
        ELP+YS+N  T FP       +  + +STG   GEKLNGQNYFSW QS+KM L+GR +FG+LTGEI +P PGD  ER+WKGEDSL+RS+LI+SMEPQIGK
Subjt:  ELPLYSENS-TVFPTLTLE--SRPMENSTGLIVGEKLNGQNYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGK

Query:  PLLYATATRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQG--------------------------------IQYYKLEEVDRVYDFLAGLNSKFDAVR
        PLLYAT  +D+WD  Q LYSKRQNASRLYTLRKQ+H CKQG                                 QY KLEE DRVYDFLAGLN KFD V 
Subjt:  PLLYATATRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQG--------------------------------IQYYKLEEVDRVYDFLAGLNSKFDAVR

Query:  SRIL---------EIKINTDLDGKDRSSAMNITTTSVTDSAAFSAKSSSTTEDKINVGS-----------------------------------------
         RIL         E+     L+ +DR++AM + TT   DSAAFSA+SS+   DK N  S                                         
Subjt:  SRIL---------EIKINTDLDGKDRSSAMNITTTSVTDSAAFSAKSSSTTEDKINVGS-----------------------------------------

Query:  YMVGPRMADDLQTTN--LTKLCTFSLGAIAQL----------------------TTNHLTRTSDNFLSYHSCAGNEKIWIAGEILAPIAGKGHISPFNGL
        Y+     A   Q+T+  +++  T +LGAIAQ                        T+HLT +S++F+SY  CAGNEKI IA   LAPIAGKG I PF+G 
Subjt:  YMVGPRMADDLQTTN--LTKLCTFSLGAIAQL----------------------TTNHLTRTSDNFLSYHSCAGNEKIWIAGEILAPIAGKGHISPFNGL

Query:  ILQNMLHVPKISYNLLSMSKITRDPNCRVAFSSDDVIFQVLSSRKMIGTTQHNRGLYFLDGDTSSRHSYRA-----------------------------
         LQN+LHVPK+SYNLLS+SKITR+ +C+  F  + V FQ +SS + IGT +H+RGLY LD DTS     R                              
Subjt:  ILQNMLHVPKISYNLLSMSKITRDPNCRVAFSSDDVIFQVLSSRKMIGTTQHNRGLYFLDGDTSSRHSYRA-----------------------------

Query:  RSIFCPLTNPLNLSS----------------------------LIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQ
        + +F  L + +++SS                            LIHSDVWGPSK+TTSSGKRW VTFIDDHTRLTWV+L+ DKS++ SIFQ FY TI+TQ
Subjt:  RSIFCPLTNPLNLSS----------------------------LIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQ

Query:  FNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLE----------------RDAVLTTAHLINRMSSCVLSFQTPLDCLK
        F+ KI ILRSDNGREF    L EFL++KGIVHQ+SCAYTPQQNGVAERKNRHL+E                 DA+LT AHLINRM S +L  QTPLDCLK
Subjt:  FNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLE----------------RDAVLTTAHLINRMSSCVLSFQTPLDCLK

Query:  LSYPTTRLIPDC--------------------------------------GYKCFHPNSRKYFISMDLTFLEDKPFFPVSPLQGESSNEETN--YSNPIP
         SYP+TRL+ +                                       GYKCFHP SRKYF++MD+TF E++P+FPVS LQGE+ +EE+N  +    P
Subjt:  LSYPTTRLIPDC--------------------------------------GYKCFHPNSRKYFISMDLTFLEDKPFFPVSPLQGESSNEETN--YSNPIP

Query:  KLI----LTTHDTVLPTRQ-----------KENNGPGPTDMLAPIQSSELTQAQG---TTDPNNNIVCVEYDIVDLTKLS--VENDRDDMTENNQFRRMK
         LI    +  H  +LPT Q           ++  G   +   AP+Q+ E  + QG    T P  N    E D  D+  L    E + DD TE      ++
Subjt:  KLI----LTTHDTVLPTRQ-----------KENNGPGPTDMLAPIQSSELTQAQG---TTDPNNNIVCVEYDIVDLTKLS--VENDRDDMTENNQFRRMK

Query:  MRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAA------------------
        + +     +     KL +YD +L++PIALR GTRS T++P+ + +SY+NLS +FRAFTA+LD+  IPKNIY A++ PEWK A                  
Subjt:  MRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAA------------------

Query:  --KGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRL----------LDL----------------PPSS----
          KGHKTVGCKWVF++KYK+D TLDR+KA+LVAKGFTQT+G+DYSETFSPVAKL   R    +          LD+                PP      
Subjt:  --KGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRL----------LDL----------------PPSS----

Query:  -------------STKASLR---------------------DTLIIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYI
                       K S R                      TL     + G    ++ +LIVYVDDIVL+GDD  EI++LKQ+M DEF IKDLGNLKY 
Subjt:  -------------STKASLR---------------------DTLIIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYI

Query:  LGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNR
        LGMEVARS+EGIS+SQ+KYT+DLL ETGM GCRPA+TPIEFN KLG S D+VPV+KE+Y  LVG+LIYL+HTR DI +AV++VSQF+ A Y++HMEAVNR
Subjt:  LGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNR

Query:  ILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQM
        ILRYLK+T GKG                          KSTSGYCTFVWGNLVTWRSKKQSVVA+SSAE EY+AMS GICEEIWL+K+L +LHQ  +  +
Subjt:  ILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQM

Query:  KFYCDNKAAISIAN-------------------------------------IANILTKGLLKQNFDSCVSKMGLTDIYVPS
        K +CDNKAAISIAN                                     IA++LTKGLL+ +FD CVSK+GL DIY+P+
Subjt:  KFYCDNKAAISIAN-------------------------------------IANILTKGLLKQNFDSCVSKMGLTDIYVPS

KAA0034386.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0046.32Show/hide
Query:  ELPLYSENS-TVFPTLTLE--SRPMENSTGLIVGEKLNGQNYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGK
        ELP+YS+N  T FP       +  + +STG   GEKLNGQNYFSW QS+KM L+GR +FG+LTGEI +P PGD  ER+WKGEDSL+RS+LI+SMEPQIGK
Subjt:  ELPLYSENS-TVFPTLTLE--SRPMENSTGLIVGEKLNGQNYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGK

Query:  PLLYATATRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQG--------------------------------IQYYKLEEVDRVYDFLAGLNSKFDAVR
        PLLYAT  +D+WD  Q LYSKRQNASRLYTLRKQ+H CKQG                                 QY KLEE DRVYDFLAGLN KFD V 
Subjt:  PLLYATATRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQG--------------------------------IQYYKLEEVDRVYDFLAGLNSKFDAVR

Query:  SRIL---------EIKINTDLDGKDRSSAMNITTTSVTDSAAFSAKSSSTTEDKINVGS-----------------------------------------
         RIL         E+     L+ +DR++AM + TT   DSAAFSA+SS+   DK N  S                                         
Subjt:  SRIL---------EIKINTDLDGKDRSSAMNITTTSVTDSAAFSAKSSSTTEDKINVGS-----------------------------------------

Query:  YMVGPRMADDLQTTN--LTKLCTFSLGAIAQL----------------------TTNHLTRTSDNFLSYHSCAGNEKIWIAGEILAPIAGKGHISPFNGL
        Y+     A   Q+T+  +++  T +LGAIAQ                        T+HLT +S++F+SY  CAGNEKI IA   LAPIAGKG I PF+G 
Subjt:  YMVGPRMADDLQTTN--LTKLCTFSLGAIAQL----------------------TTNHLTRTSDNFLSYHSCAGNEKIWIAGEILAPIAGKGHISPFNGL

Query:  ILQNMLHVPKISYNLLSMSKITRDPNCRVAFSSDDVIFQVLSSRKMIGTTQHNRGLYFLDGDTSSRHSYRA-----------------------------
         LQN+LHVPK+SYNLLS+SKITR+ +C+  F  + V FQ +SS + IGT +H+RGLY LD DTS     R                              
Subjt:  ILQNMLHVPKISYNLLSMSKITRDPNCRVAFSSDDVIFQVLSSRKMIGTTQHNRGLYFLDGDTSSRHSYRA-----------------------------

Query:  RSIFCPLTNPLNLSS----------------------------LIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQ
        + +F  L + +++SS                            LIHSDVWGPSK+TTSSGKRW VTFIDDHTRLTWV+L+ DKS++ SIFQ FY TI+TQ
Subjt:  RSIFCPLTNPLNLSS----------------------------LIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQ

Query:  FNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLE----------------RDAVLTTAHLINRMSSCVLSFQTPLDCLK
        F+ KI ILRSDNGREF    L EFL++KGIVHQ+SCAYTPQQNGVAERKNRHL+E                 DA+LT AHLINRM S +L  QTPLDCLK
Subjt:  FNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLE----------------RDAVLTTAHLINRMSSCVLSFQTPLDCLK

Query:  LSYPTTRLIPDC--------------------------------------GYKCFHPNSRKYFISMDLTFLEDKPFFPVSPLQGESSNEETN--YSNPIP
         SYP+TRL+ +                                       GYKCFHP SRKYF++MD+TF E++P+FPVS LQGE+ +EE+N  +    P
Subjt:  LSYPTTRLIPDC--------------------------------------GYKCFHPNSRKYFISMDLTFLEDKPFFPVSPLQGESSNEETN--YSNPIP

Query:  KLI----LTTHDTVLPTRQ-----------KENNGPGPTDMLAPIQSSELTQAQG---TTDPNNNIVCVEYDIVDLTKLS--VENDRDDMTENNQFRRMK
         LI    +  H  +LPT Q           ++  G   +   AP+Q+ E  + QG    T P  N    E D  D+  L    E + DD TE      ++
Subjt:  KLI----LTTHDTVLPTRQ-----------KENNGPGPTDMLAPIQSSELTQAQG---TTDPNNNIVCVEYDIVDLTKLS--VENDRDDMTENNQFRRMK

Query:  MRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAA------------------
        + +     +     KL +YD +L++PIALR GTRS T++P+ + +SY+NLS +FRAFTA+LD+  IPKNIY A++ PEWK A                  
Subjt:  MRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAA------------------

Query:  --KGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRL----------LDL----------------PPSS----
          KGHKTVGCKWVF++KYK+D TLDR+KA+LVAKGFTQT+G+DYSETFSPVAKL   R    +          LD+                PP      
Subjt:  --KGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRL----------LDL----------------PPSS----

Query:  -------------STKASLR---------------------DTLIIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYI
                       K S R                      TL     + G    ++ +LIVYVDDIVL+GDD  EI++LKQ+M DEF IKDLGNLKY 
Subjt:  -------------STKASLR---------------------DTLIIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYI

Query:  LGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNR
        LGMEVARS+EGIS+SQ+KYT+DLL ETGM GCRPA+TPIEFN KLG S D+VPV+KE+Y  LVG+LIYL+HTR DI +AV++VSQF+ A Y++HMEAVNR
Subjt:  LGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNR

Query:  ILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQM
        ILRYLK+T GKG                          KSTSGYCTFVWGNLVTWRSKKQSVVA+SSAE EY+AMS GICEEIWL+K+L +LHQ  +  +
Subjt:  ILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQM

Query:  KFYCDNKAAISIAN-------------------------------------IANILTKGLLKQNFDSCVSKMGLTDIYVPS
        K +CDNKAAISIAN                                     IA++LTKGLL+ +FD CVSK+GL DIY+P+
Subjt:  KFYCDNKAAISIAN-------------------------------------IANILTKGLLKQNFDSCVSKMGLTDIYVPS

KAA0052775.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0046.32Show/hide
Query:  ELPLYSENS-TVFPTLTLE--SRPMENSTGLIVGEKLNGQNYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGK
        ELP+YS+N  T FP       +  + +STG   GEKLNGQNYFSW QS+KM L+GR +FG+LTGEI +P PGD  ER+WKGEDSL+RS+LI+SMEPQIGK
Subjt:  ELPLYSENS-TVFPTLTLE--SRPMENSTGLIVGEKLNGQNYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGK

Query:  PLLYATATRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQG--------------------------------IQYYKLEEVDRVYDFLAGLNSKFDAVR
        PLLYAT  +D+WD  Q LYSKRQNASRLYTLRKQ+H CKQG                                 QY KLEE DRVYDFLAGLN KFD V 
Subjt:  PLLYATATRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQG--------------------------------IQYYKLEEVDRVYDFLAGLNSKFDAVR

Query:  SRIL---------EIKINTDLDGKDRSSAMNITTTSVTDSAAFSAKSSSTTEDKINVGS-----------------------------------------
         RIL         E+     L+ +DR++AM + TT   DSAAFSA+SS+   DK N  S                                         
Subjt:  SRIL---------EIKINTDLDGKDRSSAMNITTTSVTDSAAFSAKSSSTTEDKINVGS-----------------------------------------

Query:  YMVGPRMADDLQTTN--LTKLCTFSLGAIAQL----------------------TTNHLTRTSDNFLSYHSCAGNEKIWIAGEILAPIAGKGHISPFNGL
        Y+     A   Q+T+  +++  T +LGAIAQ                        T+HLT +S++F+SY  CAGNEKI IA   LAPIAGKG I PF+G 
Subjt:  YMVGPRMADDLQTTN--LTKLCTFSLGAIAQL----------------------TTNHLTRTSDNFLSYHSCAGNEKIWIAGEILAPIAGKGHISPFNGL

Query:  ILQNMLHVPKISYNLLSMSKITRDPNCRVAFSSDDVIFQVLSSRKMIGTTQHNRGLYFLDGDTSSRHSYRA-----------------------------
         LQN+LHVPK+SYNLLS+SKITR+ +C+  F  + V FQ +SS + IGT +H+RGLY LD DTS     R                              
Subjt:  ILQNMLHVPKISYNLLSMSKITRDPNCRVAFSSDDVIFQVLSSRKMIGTTQHNRGLYFLDGDTSSRHSYRA-----------------------------

Query:  RSIFCPLTNPLNLSS----------------------------LIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQ
        + +F  L + +++SS                            LIHSDVWGPSK+TTSSGKRW VTFIDDHTRLTWV+L+ DKS++ SIFQ FY TI+TQ
Subjt:  RSIFCPLTNPLNLSS----------------------------LIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQ

Query:  FNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLE----------------RDAVLTTAHLINRMSSCVLSFQTPLDCLK
        F+ KI ILRSDNGREF    L EFL++KGIVHQ+SCAYTPQQNGVAERKNRHL+E                 DA+LT AHLINRM S +L  QTPLDCLK
Subjt:  FNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLE----------------RDAVLTTAHLINRMSSCVLSFQTPLDCLK

Query:  LSYPTTRLIPDC--------------------------------------GYKCFHPNSRKYFISMDLTFLEDKPFFPVSPLQGESSNEETN--YSNPIP
         SYP+TRL+ +                                       GYKCFHP SRKYF++MD+TF E++P+FPVS LQGE+ +EE+N  +    P
Subjt:  LSYPTTRLIPDC--------------------------------------GYKCFHPNSRKYFISMDLTFLEDKPFFPVSPLQGESSNEETN--YSNPIP

Query:  KLI----LTTHDTVLPTRQ-----------KENNGPGPTDMLAPIQSSELTQAQG---TTDPNNNIVCVEYDIVDLTKLS--VENDRDDMTENNQFRRMK
         LI    +  H  +LPT Q           ++  G   +   AP+Q+ E  + QG    T P  N    E D  D+  L    E + DD TE      ++
Subjt:  KLI----LTTHDTVLPTRQ-----------KENNGPGPTDMLAPIQSSELTQAQG---TTDPNNNIVCVEYDIVDLTKLS--VENDRDDMTENNQFRRMK

Query:  MRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAA------------------
        + +     +     KL +YD +L++PIALR GTRS T++P+ + +SY+NLS +FRAFTA+LD+  IPKNIY A++ PEWK A                  
Subjt:  MRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAA------------------

Query:  --KGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRL----------LDL----------------PPSS----
          KGHKTVGCKWVF++KYK+D TLDR+KA+LVAKGFTQT+G+DYSETFSPVAKL   R    +          LD+                PP      
Subjt:  --KGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRL----------LDL----------------PPSS----

Query:  -------------STKASLR---------------------DTLIIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYI
                       K S R                      TL     + G    ++ +LIVYVDDIVL+GDD  EI++LKQ+M DEF IKDLGNLKY 
Subjt:  -------------STKASLR---------------------DTLIIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYI

Query:  LGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNR
        LGMEVARS+EGIS+SQ+KYT+DLL ETGM GCRPA+TPIEFN KLG S D+VPV+KE+Y  LVG+LIYL+HTR DI +AV++VSQF+ A Y++HMEAVNR
Subjt:  LGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNR

Query:  ILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQM
        ILRYLK+T GKG                          KSTSGYCTFVWGNLVTWRSKKQSVVA+SSAE EY+AMS GICEEIWL+K+L +LHQ  +  +
Subjt:  ILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQM

Query:  KFYCDNKAAISIAN-------------------------------------IANILTKGLLKQNFDSCVSKMGLTDIYVPS
        K +CDNKAAISIAN                                     IA++LTKGLL+ +FD CVSK+GL DIY+P+
Subjt:  KFYCDNKAAISIAN-------------------------------------IANILTKGLLKQNFDSCVSKMGLTDIYVPS

KAA0057250.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0046.32Show/hide
Query:  ELPLYSENS-TVFPTLTLE--SRPMENSTGLIVGEKLNGQNYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGK
        ELP+YS+N  T FP       +  + +STG   GEKLNGQNYFSW QS+KM L+GR +FG+LTGEI +P PGD  ER+WKGEDSL+RS+LI+SMEPQIGK
Subjt:  ELPLYSENS-TVFPTLTLE--SRPMENSTGLIVGEKLNGQNYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGK

Query:  PLLYATATRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQG--------------------------------IQYYKLEEVDRVYDFLAGLNSKFDAVR
        PLLYAT  +D+WD  Q LYSKRQNASRLYTLRKQ+H CKQG                                 QY KLEE DRVYDFLAGLN KFD V 
Subjt:  PLLYATATRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQG--------------------------------IQYYKLEEVDRVYDFLAGLNSKFDAVR

Query:  SRIL---------EIKINTDLDGKDRSSAMNITTTSVTDSAAFSAKSSSTTEDKINVGS-----------------------------------------
         RIL         E+     L+ +DR++AM + TT   DSAAFSA+SS+   DK N  S                                         
Subjt:  SRIL---------EIKINTDLDGKDRSSAMNITTTSVTDSAAFSAKSSSTTEDKINVGS-----------------------------------------

Query:  YMVGPRMADDLQTTN--LTKLCTFSLGAIAQL----------------------TTNHLTRTSDNFLSYHSCAGNEKIWIAGEILAPIAGKGHISPFNGL
        Y+     A   Q+T+  +++  T +LGAIAQ                        T+HLT +S++F+SY  CAGNEKI IA   LAPIAGKG I PF+G 
Subjt:  YMVGPRMADDLQTTN--LTKLCTFSLGAIAQL----------------------TTNHLTRTSDNFLSYHSCAGNEKIWIAGEILAPIAGKGHISPFNGL

Query:  ILQNMLHVPKISYNLLSMSKITRDPNCRVAFSSDDVIFQVLSSRKMIGTTQHNRGLYFLDGDTSSRHSYRA-----------------------------
         LQN+LHVPK+SYNLLS+SKITR+ +C+  F  + V FQ +SS + IGT +H+RGLY LD DTS     R                              
Subjt:  ILQNMLHVPKISYNLLSMSKITRDPNCRVAFSSDDVIFQVLSSRKMIGTTQHNRGLYFLDGDTSSRHSYRA-----------------------------

Query:  RSIFCPLTNPLNLSS----------------------------LIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQ
        + +F  L + +++SS                            LIHSDVWGPSK+TTSSGKRW VTFIDDHTRLTWV+L+ DKS++ SIFQ FY TI+TQ
Subjt:  RSIFCPLTNPLNLSS----------------------------LIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQ

Query:  FNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLE----------------RDAVLTTAHLINRMSSCVLSFQTPLDCLK
        F+ KI ILRSDNGREF    L EFL++KGIVHQ+SCAYTPQQNGVAERKNRHL+E                 DA+LT AHLINRM S +L  QTPLDCLK
Subjt:  FNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLE----------------RDAVLTTAHLINRMSSCVLSFQTPLDCLK

Query:  LSYPTTRLIPDC--------------------------------------GYKCFHPNSRKYFISMDLTFLEDKPFFPVSPLQGESSNEETN--YSNPIP
         SYP+TRL+ +                                       GYKCFHP SRKYF++MD+TF E++P+FPVS LQGE+ +EE+N  +    P
Subjt:  LSYPTTRLIPDC--------------------------------------GYKCFHPNSRKYFISMDLTFLEDKPFFPVSPLQGESSNEETN--YSNPIP

Query:  KLI----LTTHDTVLPTRQ-----------KENNGPGPTDMLAPIQSSELTQAQG---TTDPNNNIVCVEYDIVDLTKLS--VENDRDDMTENNQFRRMK
         LI    +  H  +LPT Q           ++  G   +   AP+Q+ E  + QG    T P  N    E D  D+  L    E + DD TE      ++
Subjt:  KLI----LTTHDTVLPTRQ-----------KENNGPGPTDMLAPIQSSELTQAQG---TTDPNNNIVCVEYDIVDLTKLS--VENDRDDMTENNQFRRMK

Query:  MRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAA------------------
        + +     +     KL +YD +L++PIALR GTRS T++P+ + +SY+NLS +FRAFTA+LD+  IPKNIY A++ PEWK A                  
Subjt:  MRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAA------------------

Query:  --KGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRL----------LDL----------------PPSS----
          KGHKTVGCKWVF++KYK+D TLDR+KA+LVAKGFTQT+G+DYSETFSPVAKL   R    +          LD+                PP      
Subjt:  --KGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRL----------LDL----------------PPSS----

Query:  -------------STKASLR---------------------DTLIIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYI
                       K S R                      TL     + G    ++ +LIVYVDDIVL+GDD  EI++LKQ+M DEF IKDLGNLKY 
Subjt:  -------------STKASLR---------------------DTLIIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYI

Query:  LGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNR
        LGMEVARS+EGIS+SQ+KYT+DLL ETGM GCRPA+TPIEFN KLG S D+VPV+KE+Y  LVG+LIYL+HTR DI +AV++VSQF+ A Y++HMEAVNR
Subjt:  LGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNR

Query:  ILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQM
        ILRYLK+T GKG                          KSTSGYCTFVWGNLVTWRSKKQSVVA+SSAE EY+AMS GICEEIWL+K+L +LHQ  +  +
Subjt:  ILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQM

Query:  KFYCDNKAAISIAN-------------------------------------IANILTKGLLKQNFDSCVSKMGLTDIYVPS
        K +CDNKAAISIAN                                     IA++LTKGLL+ +FD CVSK+GL DIY+P+
Subjt:  KFYCDNKAAISIAN-------------------------------------IANILTKGLLKQNFDSCVSKMGLTDIYVPS

KAA0063471.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0046.32Show/hide
Query:  ELPLYSENS-TVFPTLTLE--SRPMENSTGLIVGEKLNGQNYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGK
        ELP+YS+N  T FP       +  + +STG   GEKLNGQNYFSW QS+KM L+GR +FG+LTGEI +P PGD  ER+WKGEDSL+RS+LI+SMEPQIGK
Subjt:  ELPLYSENS-TVFPTLTLE--SRPMENSTGLIVGEKLNGQNYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGK

Query:  PLLYATATRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQG--------------------------------IQYYKLEEVDRVYDFLAGLNSKFDAVR
        PLLYAT  +D+WD  Q LYSKRQNASRLYTLRKQ+H CKQG                                 QY KLEE DRVYDFLAGLN KFD V 
Subjt:  PLLYATATRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQG--------------------------------IQYYKLEEVDRVYDFLAGLNSKFDAVR

Query:  SRIL---------EIKINTDLDGKDRSSAMNITTTSVTDSAAFSAKSSSTTEDKINVGS-----------------------------------------
         RIL         E+     L+ +DR++AM + TT   DSAAFSA+SS+   DK N  S                                         
Subjt:  SRIL---------EIKINTDLDGKDRSSAMNITTTSVTDSAAFSAKSSSTTEDKINVGS-----------------------------------------

Query:  YMVGPRMADDLQTTN--LTKLCTFSLGAIAQL----------------------TTNHLTRTSDNFLSYHSCAGNEKIWIAGEILAPIAGKGHISPFNGL
        Y+     A   Q+T+  +++  T +LGAIAQ                        T+HLT +S++F+SY  CAGNEKI IA   LAPIAGKG I PF+G 
Subjt:  YMVGPRMADDLQTTN--LTKLCTFSLGAIAQL----------------------TTNHLTRTSDNFLSYHSCAGNEKIWIAGEILAPIAGKGHISPFNGL

Query:  ILQNMLHVPKISYNLLSMSKITRDPNCRVAFSSDDVIFQVLSSRKMIGTTQHNRGLYFLDGDTSSRHSYRA-----------------------------
         LQN+LHVPK+SYNLLS+SKITR+ +C+  F  + V FQ +SS + IGT +H+RGLY LD DTS     R                              
Subjt:  ILQNMLHVPKISYNLLSMSKITRDPNCRVAFSSDDVIFQVLSSRKMIGTTQHNRGLYFLDGDTSSRHSYRA-----------------------------

Query:  RSIFCPLTNPLNLSS----------------------------LIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQ
        + +F  L + +++SS                            LIHSDVWGPSK+TTSSGKRW VTFIDDHTRLTWV+L+ DKS++ SIFQ FY TI+TQ
Subjt:  RSIFCPLTNPLNLSS----------------------------LIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQ

Query:  FNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLE----------------RDAVLTTAHLINRMSSCVLSFQTPLDCLK
        F+ KI ILRSDNGREF    L EFL++KGIVHQ+SCAYTPQQNGVAERKNRHL+E                 DA+LT AHLINRM S +L  QTPLDCLK
Subjt:  FNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLE----------------RDAVLTTAHLINRMSSCVLSFQTPLDCLK

Query:  LSYPTTRLIPDC--------------------------------------GYKCFHPNSRKYFISMDLTFLEDKPFFPVSPLQGESSNEETN--YSNPIP
         SYP+TRL+ +                                       GYKCFHP SRKYF++MD+TF E++P+FPVS LQGE+ +EE+N  +    P
Subjt:  LSYPTTRLIPDC--------------------------------------GYKCFHPNSRKYFISMDLTFLEDKPFFPVSPLQGESSNEETN--YSNPIP

Query:  KLI----LTTHDTVLPTRQ-----------KENNGPGPTDMLAPIQSSELTQAQG---TTDPNNNIVCVEYDIVDLTKLS--VENDRDDMTENNQFRRMK
         LI    +  H  +LPT Q           ++  G   +   AP+Q+ E  + QG    T P  N    E D  D+  L    E + DD TE      ++
Subjt:  KLI----LTTHDTVLPTRQ-----------KENNGPGPTDMLAPIQSSELTQAQG---TTDPNNNIVCVEYDIVDLTKLS--VENDRDDMTENNQFRRMK

Query:  MRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAA------------------
        + +     +     KL +YD +L++PIALR GTRS T++P+ + +SY+NLS +FRAFTA+LD+  IPKNIY A++ PEWK A                  
Subjt:  MRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAA------------------

Query:  --KGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRL----------LDL----------------PPSS----
          KGHKTVGCKWVF++KYK+D TLDR+KA+LVAKGFTQT+G+DYSETFSPVAKL   R    +          LD+                PP      
Subjt:  --KGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRL----------LDL----------------PPSS----

Query:  -------------STKASLR---------------------DTLIIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYI
                       K S R                      TL     + G    ++ +LIVYVDDIVL+GDD  EI++LKQ+M DEF IKDLGNLKY 
Subjt:  -------------STKASLR---------------------DTLIIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYI

Query:  LGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNR
        LGMEVARS+EGIS+SQ+KYT+DLL ETGM GCRPA+TPIEFN KLG S D+VPV+KE+Y  LVG+LIYL+HTR DI +AV++VSQF+ A Y++HMEAVNR
Subjt:  LGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNR

Query:  ILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQM
        ILRYLK+T GKG                          KSTSGYCTFVWGNLVTWRSKKQSVVA+SSAE EY+AMS GICEEIWL+K+L +LHQ  +  +
Subjt:  ILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQM

Query:  KFYCDNKAAISIAN-------------------------------------IANILTKGLLKQNFDSCVSKMGLTDIYVPS
        K +CDNKAAISIAN                                     IA++LTKGLL+ +FD CVSK+GL DIY+P+
Subjt:  KFYCDNKAAISIAN-------------------------------------IANILTKGLLKQNFDSCVSKMGLTDIYVPS

TrEMBL top hitse value%identityAlignment
A0A5A7SM64 Beta-galactosidase0.0e+0046.32Show/hide
Query:  ELPLYSENS-TVFPTLTLE--SRPMENSTGLIVGEKLNGQNYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGK
        ELP+YS+N  T FP       +  + +STG   GEKLNGQNYFSW QS+KM L+GR +FG+LTGEI +P PGD  ER+WKGEDSL+RS+LI+SMEPQIGK
Subjt:  ELPLYSENS-TVFPTLTLE--SRPMENSTGLIVGEKLNGQNYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGK

Query:  PLLYATATRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQG--------------------------------IQYYKLEEVDRVYDFLAGLNSKFDAVR
        PLLYAT  +D+WD  Q LYSKRQNASRLYTLRKQ+H CKQG                                 QY KLEE DRVYDFLAGLN KFD V 
Subjt:  PLLYATATRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQG--------------------------------IQYYKLEEVDRVYDFLAGLNSKFDAVR

Query:  SRIL---------EIKINTDLDGKDRSSAMNITTTSVTDSAAFSAKSSSTTEDKINVGS-----------------------------------------
         RIL         E+     L+ +DR++AM + TT   DSAAFSA+SS+   DK N  S                                         
Subjt:  SRIL---------EIKINTDLDGKDRSSAMNITTTSVTDSAAFSAKSSSTTEDKINVGS-----------------------------------------

Query:  YMVGPRMADDLQTTN--LTKLCTFSLGAIAQL----------------------TTNHLTRTSDNFLSYHSCAGNEKIWIAGEILAPIAGKGHISPFNGL
        Y+     A   Q+T+  +++  T +LGAIAQ                        T+HLT +S++F+SY  CAGNEKI IA   LAPIAGKG I PF+G 
Subjt:  YMVGPRMADDLQTTN--LTKLCTFSLGAIAQL----------------------TTNHLTRTSDNFLSYHSCAGNEKIWIAGEILAPIAGKGHISPFNGL

Query:  ILQNMLHVPKISYNLLSMSKITRDPNCRVAFSSDDVIFQVLSSRKMIGTTQHNRGLYFLDGDTSSRHSYRA-----------------------------
         LQN+LHVPK+SYNLLS+SKITR+ +C+  F  + V FQ +SS + IGT +H+RGLY LD DTS     R                              
Subjt:  ILQNMLHVPKISYNLLSMSKITRDPNCRVAFSSDDVIFQVLSSRKMIGTTQHNRGLYFLDGDTSSRHSYRA-----------------------------

Query:  RSIFCPLTNPLNLSS----------------------------LIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQ
        + +F  L + +++SS                            LIHSDVWGPSK+TTSSGKRW VTFIDDHTRLTWV+L+ DKS++ SIFQ FY TI+TQ
Subjt:  RSIFCPLTNPLNLSS----------------------------LIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQ

Query:  FNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLE----------------RDAVLTTAHLINRMSSCVLSFQTPLDCLK
        F+ KI ILRSDNGREF    L EFL++KGIVHQ+SCAYTPQQNGVAERKNRHL+E                 DA+LT AHLINRM S +L  QTPLDCLK
Subjt:  FNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLE----------------RDAVLTTAHLINRMSSCVLSFQTPLDCLK

Query:  LSYPTTRLIPDC--------------------------------------GYKCFHPNSRKYFISMDLTFLEDKPFFPVSPLQGESSNEETN--YSNPIP
         SYP+TRL+ +                                       GYKCFHP SRKYF++MD+TF E++P+FPVS LQGE+ +EE+N  +    P
Subjt:  LSYPTTRLIPDC--------------------------------------GYKCFHPNSRKYFISMDLTFLEDKPFFPVSPLQGESSNEETN--YSNPIP

Query:  KLI----LTTHDTVLPTRQ-----------KENNGPGPTDMLAPIQSSELTQAQG---TTDPNNNIVCVEYDIVDLTKLS--VENDRDDMTENNQFRRMK
         LI    +  H  +LPT Q           ++  G   +   AP+Q+ E  + QG    T P  N    E D  D+  L    E + DD TE      ++
Subjt:  KLI----LTTHDTVLPTRQ-----------KENNGPGPTDMLAPIQSSELTQAQG---TTDPNNNIVCVEYDIVDLTKLS--VENDRDDMTENNQFRRMK

Query:  MRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAA------------------
        + +     +     KL +YD +L++PIALR GTRS T++P+ + +SY+NLS +FRAFTA+LD+  IPKNIY A++ PEWK A                  
Subjt:  MRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAA------------------

Query:  --KGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRL----------LDL----------------PPSS----
          KGHKTVGCKWVF++KYK+D TLDR+KA+LVAKGFTQT+G+DYSETFSPVAKL   R    +          LD+                PP      
Subjt:  --KGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRL----------LDL----------------PPSS----

Query:  -------------STKASLR---------------------DTLIIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYI
                       K S R                      TL     + G    ++ +LIVYVDDIVL+GDD  EI++LKQ+M DEF IKDLGNLKY 
Subjt:  -------------STKASLR---------------------DTLIIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYI

Query:  LGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNR
        LGMEVARS+EGIS+SQ+KYT+DLL ETGM GCRPA+TPIEFN KLG S D+VPV+KE+Y  LVG+LIYL+HTR DI +AV++VSQF+ A Y++HMEAVNR
Subjt:  LGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNR

Query:  ILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQM
        ILRYLK+T GKG                          KSTSGYCTFVWGNLVTWRSKKQSVVA+SSAE EY+AMS GICEEIWL+K+L +LHQ  +  +
Subjt:  ILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQM

Query:  KFYCDNKAAISIAN-------------------------------------IANILTKGLLKQNFDSCVSKMGLTDIYVPS
        K +CDNKAAISIAN                                     IA++LTKGLL+ +FD CVSK+GL DIY+P+
Subjt:  KFYCDNKAAISIAN-------------------------------------IANILTKGLLKQNFDSCVSKMGLTDIYVPS

A0A5A7SWY7 Beta-galactosidase0.0e+0046.32Show/hide
Query:  ELPLYSENS-TVFPTLTLE--SRPMENSTGLIVGEKLNGQNYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGK
        ELP+YS+N  T FP       +  + +STG   GEKLNGQNYFSW QS+KM L+GR +FG+LTGEI +P PGD  ER+WKGEDSL+RS+LI+SMEPQIGK
Subjt:  ELPLYSENS-TVFPTLTLE--SRPMENSTGLIVGEKLNGQNYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGK

Query:  PLLYATATRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQG--------------------------------IQYYKLEEVDRVYDFLAGLNSKFDAVR
        PLLYAT  +D+WD  Q LYSKRQNASRLYTLRKQ+H CKQG                                 QY KLEE DRVYDFLAGLN KFD V 
Subjt:  PLLYATATRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQG--------------------------------IQYYKLEEVDRVYDFLAGLNSKFDAVR

Query:  SRIL---------EIKINTDLDGKDRSSAMNITTTSVTDSAAFSAKSSSTTEDKINVGS-----------------------------------------
         RIL         E+     L+ +DR++AM + TT   DSAAFSA+SS+   DK N  S                                         
Subjt:  SRIL---------EIKINTDLDGKDRSSAMNITTTSVTDSAAFSAKSSSTTEDKINVGS-----------------------------------------

Query:  YMVGPRMADDLQTTN--LTKLCTFSLGAIAQL----------------------TTNHLTRTSDNFLSYHSCAGNEKIWIAGEILAPIAGKGHISPFNGL
        Y+     A   Q+T+  +++  T +LGAIAQ                        T+HLT +S++F+SY  CAGNEKI IA   LAPIAGKG I PF+G 
Subjt:  YMVGPRMADDLQTTN--LTKLCTFSLGAIAQL----------------------TTNHLTRTSDNFLSYHSCAGNEKIWIAGEILAPIAGKGHISPFNGL

Query:  ILQNMLHVPKISYNLLSMSKITRDPNCRVAFSSDDVIFQVLSSRKMIGTTQHNRGLYFLDGDTSSRHSYRA-----------------------------
         LQN+LHVPK+SYNLLS+SKITR+ +C+  F  + V FQ +SS + IGT +H+RGLY LD DTS     R                              
Subjt:  ILQNMLHVPKISYNLLSMSKITRDPNCRVAFSSDDVIFQVLSSRKMIGTTQHNRGLYFLDGDTSSRHSYRA-----------------------------

Query:  RSIFCPLTNPLNLSS----------------------------LIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQ
        + +F  L + +++SS                            LIHSDVWGPSK+TTSSGKRW VTFIDDHTRLTWV+L+ DKS++ SIFQ FY TI+TQ
Subjt:  RSIFCPLTNPLNLSS----------------------------LIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQ

Query:  FNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLE----------------RDAVLTTAHLINRMSSCVLSFQTPLDCLK
        F+ KI ILRSDNGREF    L EFL++KGIVHQ+SCAYTPQQNGVAERKNRHL+E                 DA+LT AHLINRM S +L  QTPLDCLK
Subjt:  FNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLE----------------RDAVLTTAHLINRMSSCVLSFQTPLDCLK

Query:  LSYPTTRLIPDC--------------------------------------GYKCFHPNSRKYFISMDLTFLEDKPFFPVSPLQGESSNEETN--YSNPIP
         SYP+TRL+ +                                       GYKCFHP SRKYF++MD+TF E++P+FPVS LQGE+ +EE+N  +    P
Subjt:  LSYPTTRLIPDC--------------------------------------GYKCFHPNSRKYFISMDLTFLEDKPFFPVSPLQGESSNEETN--YSNPIP

Query:  KLI----LTTHDTVLPTRQ-----------KENNGPGPTDMLAPIQSSELTQAQG---TTDPNNNIVCVEYDIVDLTKLS--VENDRDDMTENNQFRRMK
         LI    +  H  +LPT Q           ++  G   +   AP+Q+ E  + QG    T P  N    E D  D+  L    E + DD TE      ++
Subjt:  KLI----LTTHDTVLPTRQ-----------KENNGPGPTDMLAPIQSSELTQAQG---TTDPNNNIVCVEYDIVDLTKLS--VENDRDDMTENNQFRRMK

Query:  MRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAA------------------
        + +     +     KL +YD +L++PIALR GTRS T++P+ + +SY+NLS +FRAFTA+LD+  IPKNIY A++ PEWK A                  
Subjt:  MRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAA------------------

Query:  --KGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRL----------LDL----------------PPSS----
          KGHKTVGCKWVF++KYK+D TLDR+KA+LVAKGFTQT+G+DYSETFSPVAKL   R    +          LD+                PP      
Subjt:  --KGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRL----------LDL----------------PPSS----

Query:  -------------STKASLR---------------------DTLIIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYI
                       K S R                      TL     + G    ++ +LIVYVDDIVL+GDD  EI++LKQ+M DEF IKDLGNLKY 
Subjt:  -------------STKASLR---------------------DTLIIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYI

Query:  LGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNR
        LGMEVARS+EGIS+SQ+KYT+DLL ETGM GCRPA+TPIEFN KLG S D+VPV+KE+Y  LVG+LIYL+HTR DI +AV++VSQF+ A Y++HMEAVNR
Subjt:  LGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNR

Query:  ILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQM
        ILRYLK+T GKG                          KSTSGYCTFVWGNLVTWRSKKQSVVA+SSAE EY+AMS GICEEIWL+K+L +LHQ  +  +
Subjt:  ILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQM

Query:  KFYCDNKAAISIAN-------------------------------------IANILTKGLLKQNFDSCVSKMGLTDIYVPS
        K +CDNKAAISIAN                                     IA++LTKGLL+ +FD CVSK+GL DIY+P+
Subjt:  KFYCDNKAAISIAN-------------------------------------IANILTKGLLKQNFDSCVSKMGLTDIYVPS

A0A5A7UGB2 Beta-galactosidase0.0e+0046.32Show/hide
Query:  ELPLYSENS-TVFPTLTLE--SRPMENSTGLIVGEKLNGQNYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGK
        ELP+YS+N  T FP       +  + +STG   GEKLNGQNYFSW QS+KM L+GR +FG+LTGEI +P PGD  ER+WKGEDSL+RS+LI+SMEPQIGK
Subjt:  ELPLYSENS-TVFPTLTLE--SRPMENSTGLIVGEKLNGQNYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGK

Query:  PLLYATATRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQG--------------------------------IQYYKLEEVDRVYDFLAGLNSKFDAVR
        PLLYAT  +D+WD  Q LYSKRQNASRLYTLRKQ+H CKQG                                 QY KLEE DRVYDFLAGLN KFD V 
Subjt:  PLLYATATRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQG--------------------------------IQYYKLEEVDRVYDFLAGLNSKFDAVR

Query:  SRIL---------EIKINTDLDGKDRSSAMNITTTSVTDSAAFSAKSSSTTEDKINVGS-----------------------------------------
         RIL         E+     L+ +DR++AM + TT   DSAAFSA+SS+   DK N  S                                         
Subjt:  SRIL---------EIKINTDLDGKDRSSAMNITTTSVTDSAAFSAKSSSTTEDKINVGS-----------------------------------------

Query:  YMVGPRMADDLQTTN--LTKLCTFSLGAIAQL----------------------TTNHLTRTSDNFLSYHSCAGNEKIWIAGEILAPIAGKGHISPFNGL
        Y+     A   Q+T+  +++  T +LGAIAQ                        T+HLT +S++F+SY  CAGNEKI IA   LAPIAGKG I PF+G 
Subjt:  YMVGPRMADDLQTTN--LTKLCTFSLGAIAQL----------------------TTNHLTRTSDNFLSYHSCAGNEKIWIAGEILAPIAGKGHISPFNGL

Query:  ILQNMLHVPKISYNLLSMSKITRDPNCRVAFSSDDVIFQVLSSRKMIGTTQHNRGLYFLDGDTSSRHSYRA-----------------------------
         LQN+LHVPK+SYNLLS+SKITR+ +C+  F  + V FQ +SS + IGT +H+RGLY LD DTS     R                              
Subjt:  ILQNMLHVPKISYNLLSMSKITRDPNCRVAFSSDDVIFQVLSSRKMIGTTQHNRGLYFLDGDTSSRHSYRA-----------------------------

Query:  RSIFCPLTNPLNLSS----------------------------LIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQ
        + +F  L + +++SS                            LIHSDVWGPSK+TTSSGKRW VTFIDDHTRLTWV+L+ DKS++ SIFQ FY TI+TQ
Subjt:  RSIFCPLTNPLNLSS----------------------------LIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQ

Query:  FNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLE----------------RDAVLTTAHLINRMSSCVLSFQTPLDCLK
        F+ KI ILRSDNGREF    L EFL++KGIVHQ+SCAYTPQQNGVAERKNRHL+E                 DA+LT AHLINRM S +L  QTPLDCLK
Subjt:  FNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLE----------------RDAVLTTAHLINRMSSCVLSFQTPLDCLK

Query:  LSYPTTRLIPDC--------------------------------------GYKCFHPNSRKYFISMDLTFLEDKPFFPVSPLQGESSNEETN--YSNPIP
         SYP+TRL+ +                                       GYKCFHP SRKYF++MD+TF E++P+FPVS LQGE+ +EE+N  +    P
Subjt:  LSYPTTRLIPDC--------------------------------------GYKCFHPNSRKYFISMDLTFLEDKPFFPVSPLQGESSNEETN--YSNPIP

Query:  KLI----LTTHDTVLPTRQ-----------KENNGPGPTDMLAPIQSSELTQAQG---TTDPNNNIVCVEYDIVDLTKLS--VENDRDDMTENNQFRRMK
         LI    +  H  +LPT Q           ++  G   +   AP+Q+ E  + QG    T P  N    E D  D+  L    E + DD TE      ++
Subjt:  KLI----LTTHDTVLPTRQ-----------KENNGPGPTDMLAPIQSSELTQAQG---TTDPNNNIVCVEYDIVDLTKLS--VENDRDDMTENNQFRRMK

Query:  MRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAA------------------
        + +     +     KL +YD +L++PIALR GTRS T++P+ + +SY+NLS +FRAFTA+LD+  IPKNIY A++ PEWK A                  
Subjt:  MRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAA------------------

Query:  --KGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRL----------LDL----------------PPSS----
          KGHKTVGCKWVF++KYK+D TLDR+KA+LVAKGFTQT+G+DYSETFSPVAKL   R    +          LD+                PP      
Subjt:  --KGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRL----------LDL----------------PPSS----

Query:  -------------STKASLR---------------------DTLIIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYI
                       K S R                      TL     + G    ++ +LIVYVDDIVL+GDD  EI++LKQ+M DEF IKDLGNLKY 
Subjt:  -------------STKASLR---------------------DTLIIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYI

Query:  LGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNR
        LGMEVARS+EGIS+SQ+KYT+DLL ETGM GCRPA+TPIEFN KLG S D+VPV+KE+Y  LVG+LIYL+HTR DI +AV++VSQF+ A Y++HMEAVNR
Subjt:  LGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNR

Query:  ILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQM
        ILRYLK+T GKG                          KSTSGYCTFVWGNLVTWRSKKQSVVA+SSAE EY+AMS GICEEIWL+K+L +LHQ  +  +
Subjt:  ILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQM

Query:  KFYCDNKAAISIAN-------------------------------------IANILTKGLLKQNFDSCVSKMGLTDIYVPS
        K +CDNKAAISIAN                                     IA++LTKGLL+ +FD CVSK+GL DIY+P+
Subjt:  KFYCDNKAAISIAN-------------------------------------IANILTKGLLKQNFDSCVSKMGLTDIYVPS

A0A5A7UQ67 Beta-galactosidase0.0e+0046.32Show/hide
Query:  ELPLYSENS-TVFPTLTLE--SRPMENSTGLIVGEKLNGQNYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGK
        ELP+YS+N  T FP       +  + +STG   GEKLNGQNYFSW QS+KM L+GR +FG+LTGEI +P PGD  ER+WKGEDSL+RS+LI+SMEPQIGK
Subjt:  ELPLYSENS-TVFPTLTLE--SRPMENSTGLIVGEKLNGQNYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGK

Query:  PLLYATATRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQG--------------------------------IQYYKLEEVDRVYDFLAGLNSKFDAVR
        PLLYAT  +D+WD  Q LYSKRQNASRLYTLRKQ+H CKQG                                 QY KLEE DRVYDFLAGLN KFD V 
Subjt:  PLLYATATRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQG--------------------------------IQYYKLEEVDRVYDFLAGLNSKFDAVR

Query:  SRIL---------EIKINTDLDGKDRSSAMNITTTSVTDSAAFSAKSSSTTEDKINVGS-----------------------------------------
         RIL         E+     L+ +DR++AM + TT   DSAAFSA+SS+   DK N  S                                         
Subjt:  SRIL---------EIKINTDLDGKDRSSAMNITTTSVTDSAAFSAKSSSTTEDKINVGS-----------------------------------------

Query:  YMVGPRMADDLQTTN--LTKLCTFSLGAIAQL----------------------TTNHLTRTSDNFLSYHSCAGNEKIWIAGEILAPIAGKGHISPFNGL
        Y+     A   Q+T+  +++  T +LGAIAQ                        T+HLT +S++F+SY  CAGNEKI IA   LAPIAGKG I PF+G 
Subjt:  YMVGPRMADDLQTTN--LTKLCTFSLGAIAQL----------------------TTNHLTRTSDNFLSYHSCAGNEKIWIAGEILAPIAGKGHISPFNGL

Query:  ILQNMLHVPKISYNLLSMSKITRDPNCRVAFSSDDVIFQVLSSRKMIGTTQHNRGLYFLDGDTSSRHSYRA-----------------------------
         LQN+LHVPK+SYNLLS+SKITR+ +C+  F  + V FQ +SS + IGT +H+RGLY LD DTS     R                              
Subjt:  ILQNMLHVPKISYNLLSMSKITRDPNCRVAFSSDDVIFQVLSSRKMIGTTQHNRGLYFLDGDTSSRHSYRA-----------------------------

Query:  RSIFCPLTNPLNLSS----------------------------LIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQ
        + +F  L + +++SS                            LIHSDVWGPSK+TTSSGKRW VTFIDDHTRLTWV+L+ DKS++ SIFQ FY TI+TQ
Subjt:  RSIFCPLTNPLNLSS----------------------------LIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQ

Query:  FNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLE----------------RDAVLTTAHLINRMSSCVLSFQTPLDCLK
        F+ KI ILRSDNGREF    L EFL++KGIVHQ+SCAYTPQQNGVAERKNRHL+E                 DA+LT AHLINRM S +L  QTPLDCLK
Subjt:  FNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLE----------------RDAVLTTAHLINRMSSCVLSFQTPLDCLK

Query:  LSYPTTRLIPDC--------------------------------------GYKCFHPNSRKYFISMDLTFLEDKPFFPVSPLQGESSNEETN--YSNPIP
         SYP+TRL+ +                                       GYKCFHP SRKYF++MD+TF E++P+FPVS LQGE+ +EE+N  +    P
Subjt:  LSYPTTRLIPDC--------------------------------------GYKCFHPNSRKYFISMDLTFLEDKPFFPVSPLQGESSNEETN--YSNPIP

Query:  KLI----LTTHDTVLPTRQ-----------KENNGPGPTDMLAPIQSSELTQAQG---TTDPNNNIVCVEYDIVDLTKLS--VENDRDDMTENNQFRRMK
         LI    +  H  +LPT Q           ++  G   +   AP+Q+ E  + QG    T P  N    E D  D+  L    E + DD TE      ++
Subjt:  KLI----LTTHDTVLPTRQ-----------KENNGPGPTDMLAPIQSSELTQAQG---TTDPNNNIVCVEYDIVDLTKLS--VENDRDDMTENNQFRRMK

Query:  MRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAA------------------
        + +     +     KL +YD +L++PIALR GTRS T++P+ + +SY+NLS +FRAFTA+LD+  IPKNIY A++ PEWK A                  
Subjt:  MRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAA------------------

Query:  --KGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRL----------LDL----------------PPSS----
          KGHKTVGCKWVF++KYK+D TLDR+KA+LVAKGFTQT+G+DYSETFSPVAKL   R    +          LD+                PP      
Subjt:  --KGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRL----------LDL----------------PPSS----

Query:  -------------STKASLR---------------------DTLIIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYI
                       K S R                      TL     + G    ++ +LIVYVDDIVL+GDD  EI++LKQ+M DEF IKDLGNLKY 
Subjt:  -------------STKASLR---------------------DTLIIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYI

Query:  LGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNR
        LGMEVARS+EGIS+SQ+KYT+DLL ETGM GCRPA+TPIEFN KLG S D+VPV+KE+Y  LVG+LIYL+HTR DI +AV++VSQF+ A Y++HMEAVNR
Subjt:  LGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNR

Query:  ILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQM
        ILRYLK+T GKG                          KSTSGYCTFVWGNLVTWRSKKQSVVA+SSAE EY+AMS GICEEIWL+K+L +LHQ  +  +
Subjt:  ILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQM

Query:  KFYCDNKAAISIAN-------------------------------------IANILTKGLLKQNFDSCVSKMGLTDIYVPS
        K +CDNKAAISIAN                                     IA++LTKGLL+ +FD CVSK+GL DIY+P+
Subjt:  KFYCDNKAAISIAN-------------------------------------IANILTKGLLKQNFDSCVSKMGLTDIYVPS

A0A5A7VCN2 Beta-galactosidase0.0e+0046.32Show/hide
Query:  ELPLYSENS-TVFPTLTLE--SRPMENSTGLIVGEKLNGQNYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGK
        ELP+YS+N  T FP       +  + +STG   GEKLNGQNYFSW QS+KM L+GR +FG+LTGEI +P PGD  ER+WKGEDSL+RS+LI+SMEPQIGK
Subjt:  ELPLYSENS-TVFPTLTLE--SRPMENSTGLIVGEKLNGQNYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGK

Query:  PLLYATATRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQG--------------------------------IQYYKLEEVDRVYDFLAGLNSKFDAVR
        PLLYAT  +D+WD  Q LYSKRQNASRLYTLRKQ+H CKQG                                 QY KLEE DRVYDFLAGLN KFD V 
Subjt:  PLLYATATRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQG--------------------------------IQYYKLEEVDRVYDFLAGLNSKFDAVR

Query:  SRIL---------EIKINTDLDGKDRSSAMNITTTSVTDSAAFSAKSSSTTEDKINVGS-----------------------------------------
         RIL         E+     L+ +DR++AM + TT   DSAAFSA+SS+   DK N  S                                         
Subjt:  SRIL---------EIKINTDLDGKDRSSAMNITTTSVTDSAAFSAKSSSTTEDKINVGS-----------------------------------------

Query:  YMVGPRMADDLQTTN--LTKLCTFSLGAIAQL----------------------TTNHLTRTSDNFLSYHSCAGNEKIWIAGEILAPIAGKGHISPFNGL
        Y+     A   Q+T+  +++  T +LGAIAQ                        T+HLT +S++F+SY  CAGNEKI IA   LAPIAGKG I PF+G 
Subjt:  YMVGPRMADDLQTTN--LTKLCTFSLGAIAQL----------------------TTNHLTRTSDNFLSYHSCAGNEKIWIAGEILAPIAGKGHISPFNGL

Query:  ILQNMLHVPKISYNLLSMSKITRDPNCRVAFSSDDVIFQVLSSRKMIGTTQHNRGLYFLDGDTSSRHSYRA-----------------------------
         LQN+LHVPK+SYNLLS+SKITR+ +C+  F  + V FQ +SS + IGT +H+RGLY LD DTS     R                              
Subjt:  ILQNMLHVPKISYNLLSMSKITRDPNCRVAFSSDDVIFQVLSSRKMIGTTQHNRGLYFLDGDTSSRHSYRA-----------------------------

Query:  RSIFCPLTNPLNLSS----------------------------LIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQ
        + +F  L + +++SS                            LIHSDVWGPSK+TTSSGKRW VTFIDDHTRLTWV+L+ DKS++ SIFQ FY TI+TQ
Subjt:  RSIFCPLTNPLNLSS----------------------------LIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQ

Query:  FNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLE----------------RDAVLTTAHLINRMSSCVLSFQTPLDCLK
        F+ KI ILRSDNGREF    L EFL++KGIVHQ+SCAYTPQQNGVAERKNRHL+E                 DA+LT AHLINRM S +L  QTPLDCLK
Subjt:  FNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLE----------------RDAVLTTAHLINRMSSCVLSFQTPLDCLK

Query:  LSYPTTRLIPDC--------------------------------------GYKCFHPNSRKYFISMDLTFLEDKPFFPVSPLQGESSNEETN--YSNPIP
         SYP+TRL+ +                                       GYKCFHP SRKYF++MD+TF E++P+FPVS LQGE+ +EE+N  +    P
Subjt:  LSYPTTRLIPDC--------------------------------------GYKCFHPNSRKYFISMDLTFLEDKPFFPVSPLQGESSNEETN--YSNPIP

Query:  KLI----LTTHDTVLPTRQ-----------KENNGPGPTDMLAPIQSSELTQAQG---TTDPNNNIVCVEYDIVDLTKLS--VENDRDDMTENNQFRRMK
         LI    +  H  +LPT Q           ++  G   +   AP+Q+ E  + QG    T P  N    E D  D+  L    E + DD TE      ++
Subjt:  KLI----LTTHDTVLPTRQ-----------KENNGPGPTDMLAPIQSSELTQAQG---TTDPNNNIVCVEYDIVDLTKLS--VENDRDDMTENNQFRRMK

Query:  MRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAA------------------
        + +     +     KL +YD +L++PIALR GTRS T++P+ + +SY+NLS +FRAFTA+LD+  IPKNIY A++ PEWK A                  
Subjt:  MRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAA------------------

Query:  --KGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRL----------LDL----------------PPSS----
          KGHKTVGCKWVF++KYK+D TLDR+KA+LVAKGFTQT+G+DYSETFSPVAKL   R    +          LD+                PP      
Subjt:  --KGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRL----------LDL----------------PPSS----

Query:  -------------STKASLR---------------------DTLIIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYI
                       K S R                      TL     + G    ++ +LIVYVDDIVL+GDD  EI++LKQ+M DEF IKDLGNLKY 
Subjt:  -------------STKASLR---------------------DTLIIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYI

Query:  LGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNR
        LGMEVARS+EGIS+SQ+KYT+DLL ETGM GCRPA+TPIEFN KLG S D+VPV+KE+Y  LVG+LIYL+HTR DI +AV++VSQF+ A Y++HMEAVNR
Subjt:  LGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNR

Query:  ILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQM
        ILRYLK+T GKG                          KSTSGYCTFVWGNLVTWRSKKQSVVA+SSAE EY+AMS GICEEIWL+K+L +LHQ  +  +
Subjt:  ILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQM

Query:  KFYCDNKAAISIAN-------------------------------------IANILTKGLLKQNFDSCVSKMGLTDIYVPS
        K +CDNKAAISIAN                                     IA++LTKGLL+ +FD CVSK+GL DIY+P+
Subjt:  KFYCDNKAAISIAN-------------------------------------IANILTKGLLKQNFDSCVSKMGLTDIYVPS

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.6e-5625.54Show/hide
Query:  LIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQFNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQN
        ++HSDV GP    T   K + V F+D  T     +L+  KS + S+FQ+F    E  FN K+V L  DNGRE+L++ +R+F   KGI +  +  +TPQ N
Subjt:  LIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQFNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQN

Query:  GVAERKNRHLLER----------------DAVLTTAHLINRMSSCVL--SFQTPLDCLKLSYPTTRLI------------------------------PD
        GV+ER  R + E+                +AVLT  +LINR+ S  L  S +TP +      P  + +                                
Subjt:  GVAERKNRHLLER----------------DAVLTTAHLINRMSSCVL--SFQTPLDCLKLSYPTTRLI------------------------------PD

Query:  CGYKCFHPNSRKYFISMDLTFLEDKPF------FPVSPLQGESSNEETNYSNPIPKLILTTHDTVLPTRQKENNGPGPTDMLAPIQSSELTQAQGTTDPN
         G+K +   + K+ ++ D+   E          F    L+    +E  N+ N   K+I     T  P   KE       D +  ++ S+ ++ +   + +
Subjt:  CGYKCFHPNSRKYFISMDLTFLEDKPF------FPVSPLQGESSNEETNYSNPIPKLILTTHDTVLPTRQKENNGPGPTDMLAPIQSSELTQAQGTTDPN

Query:  NNIVCVEY-------DIVDLTKLSVENDRDDMTENNQFRR-------------MKMRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVL
          I+  E+       D +   K S E+++  + E+ + +R              + R       LK   ++G  + T N  I + N  R   R      +
Subjt:  NNIVCVEY-------DIVDLTKLSVENDRDDMTENNQFRR-------------MKMRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVL

Query:  SYNNLSSKFRAFTASLDTI--TIPKN---IYVAMKIPEWKAA-----KGHK---------------TVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTF
        SYN   +       +  TI   +P +   I        W+ A       HK                V  +WVF++KY       RYKA+LVA+GFTQ +
Subjt:  SYNNLSSKFRAFTASLDTI--TIPKN---IYVAMKIPEWKAA-----KGHK---------------TVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTF

Query:  GVDYSETFSPVAKLTQSRFFCRL-----LDLPPSSSTKASLRDTL-------------------------------------------------------
         +DY ETF+PVA+++  RF   L     L +       A L  TL                                                       
Subjt:  GVDYSETFSPVAKLTQSRFFCRL-----LDLPPSSSTKASLRDTL-------------------------------------------------------

Query:  -IIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYILGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPI--EF
          I  L KG     ++VL+ YVDD+V++  D   +N  K+ ++++F + DL  +K+ +G+ +    + I +SQ  Y   +L +  M  C   +TP+  + 
Subjt:  -IIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYILGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPI--EF

Query:  NAKLGESGDRVPVNKERYHCLVGRLIY-LAHTRLDIYYAVNIVSQFIHALYKEHMEAVNRILRYLKDTLG----------------------------KG
        N +L  S +           L+G L+Y +  TR D+  AVNI+S++      E  + + R+LRYLK T+                               
Subjt:  NAKLGESGDRVPVNKERYHCLVGRLIY-LAHTRLDIYYAVNIVSQFIHALYKEHMEAVNRILRYLKDTLG----------------------------KG

Query:  KSTSGYCTFVWG-NLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQMKFYCDNKAAISIAN
        KST+GY   ++  NL+ W +K+Q+ VA SS E EY A+   + E +WLK LL +++   +  +K Y DN+  ISIAN
Subjt:  KSTSGYCTFVWG-NLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQMKFYCDNKAAISIAN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.9e-6928.82Show/hide
Query:  LNLSSLIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQFNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAY
        LN+  L++SDV GP +I +  G ++ VTFIDD +R  WV++L  K ++  +FQ+F+  +E +   K+  LRSDNG E+ +    E+ S+ GI H+ +   
Subjt:  LNLSSLIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQFNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAY

Query:  TPQQNGVAERKNRHLLER----------------DAVLTTAHLINRMSSCVLSFQTPLDCLKLSYPTTRLIPDCGYKCFHPNSRKYFISMDLTFLEDKPF
        TPQ NGVAER NR ++E+                +AV T  +LINR  S  L+F+ P         T + +     K F   +  +      T L+DK  
Subjt:  TPQQNGVAERKNRHLLER----------------DAVLTTAHLINRMSSCVLSFQTPLDCLKLSYPTTRLIPDCGYKCFHPNSRKYFISMDLTFLEDKPF

Query:  FPVSPLQGESSNEETNYS--NPIPKLILTTHD---------TVLPTRQKENNGPGPTDMLAPIQSSELTQAQGTTD------PNNNIVCVEYDIVDLTKL
          +    G+   EE  Y   +P+ K ++ + D         T     +K  NG  P  +  P  S+  T A+ TTD           V  + + +D    
Subjt:  FPVSPLQGESSNEETNYS--NPIPKLILTTHD---------TVLPTRQKENNGPGPTDMLAPIQSSELTQAQGTTD------PNNNIVCVEYDIVDLTKL

Query:  SVENDRDDMTENNQFRRMKMRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKA
         VE+      ++   RR +       P++    +  +Y +T  + I+      S     +  VLS+   +   +A    ++++      Y  +++P    
Subjt:  SVENDRDDMTENNQFRRMKMRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKA

Query:  AKGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRL---LDLP-------------------------------
         KG + + CKWVF +K   D  L RYKA+LV KGF Q  G+D+ E FSPV K+T  R    L   LDL                                
Subjt:  AKGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRL---LDLP-------------------------------

Query:  --------------------------PSSSTKASLRDTLIIPCLRKGPFLVRLF-VLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYILGME
                                   S     +   T   PC+    F    F +L++YVDD+++ G D   I +LK  +   F +KDLG  + ILGM+
Subjt:  --------------------------PSSSTKASLRDTLIIPCLRKGPFLVRLF-VLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYILGME

Query:  VARSREG--ISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGES------GDRVPVNKERYHCLVGRLIY-LAHTRLDIYYAVNIVSQFIHALYKEHM
        + R R    + +SQ+KY   +L+   M   +P +TP+  + KL +        ++  + K  Y   VG L+Y +  TR DI +AV +VS+F+    KEH 
Subjt:  VARSREG--ISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGES------GDRVPVNKERYHCLVGRLIY-LAHTRLDIYYAVNIVSQFIHALYKEHM

Query:  EAVNRILRYLKDTLG------------KG-------------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLL--FNLHQ
        EAV  ILRYL+ T G            KG             KS++GY     G  ++W+SK Q  VA S+ E EY A +    E IWLK+ L    LHQ
Subjt:  EAVNRILRYLKDTLG------------KG-------------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLL--FNLHQ

Query:  YSDLQMKFYCDNKAAISIA
            +   YCD+++AI ++
Subjt:  YSDLQMKFYCDNKAAISIA

P92519 Uncharacterized mitochondrial protein AtMg008105.4e-3033.48Show/hide
Query:  LIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYILGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYH
        L++YVDDI+L+G     +N L  ++   F++KDLG + Y LG+++     G+ +SQ KY   +L   GM  C+P +TP+        S  + P +   + 
Subjt:  LIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYILGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYH

Query:  CLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNRILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQ
         +VG L YL  TR DI YAVNIV Q +H       + + R+LRY+K T+  G                          +ST+G+CTF+  N+++W +K+Q
Subjt:  CLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNRILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQ

Query:  SVVAKSSAEVEYKAMSHGICEEIW
          V++SS E EY+A++    E  W
Subjt:  SVVAKSSAEVEYKAMSHGICEEIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-6526.74Show/hide
Query:  TNPLNLSSLIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQFNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSS
        T PL     I+SDVW  S I +    R+ V F+D  TR TW++ L  KS++   F  F   +E +F  +I    SDNG EF+   L E+ S  GI H +S
Subjt:  TNPLNLSSLIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQFNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSS

Query:  CAYTPQQNGVAERKNRHLLERDAVL----------------TTAHLINRMSSCVLSFQTPL-------------------------------------DC
          +TP+ NG++ERK+RH++E    L                   +LINR+ + +L  ++P                                       C
Subjt:  CAYTPQQNGVAERKNRHLLERDAVL----------------TTAHLINRMSSCVLSFQTPL-------------------------------------DC

Query:  LKLSYPTTRLIPDCGYKCFHPNSRKYFISMDLTFLEDKPFFP-------VSPLQGESSNEETNYSNPIPKLILTTHDTVLPT---RQKENNGPGPTDMLA
        + L Y  T+      Y C H  + + +IS  + F  D+  FP       +SP+Q +       +S   P   L T   VLP        +    P+   A
Subjt:  LKLSYPTTRLIPDCGYKCFHPNSRKYFISMDLTFLEDKPFFP-------VSPLQGESSNEETNYSNPIPKLILTTHDTVLPT---RQKENNGPGPTDMLA

Query:  PIQSSELTQA--------------QGTTDPNNNIVCVEYDIVDLTKLSVENDRDDMTENNQFRRMKMRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTR
        P ++S+++ +              + T    N            T+     +       N+      +S     Q           A+ +          
Subjt:  PIQSSELTQA--------------QGTTDPNNNIVCVEYDIVDLTKLSVENDRDDMTENNQFRRMKMRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTR

Query:  SYTRYPMYSVLSYNN--------LSSKFRA----------FTASLDTITIPKNIYVAMKIPEWKAAKG--------------------HKT-VGCKWVFT
         +   P+  +++ NN        + ++ +A             SL   + P+    A+K   W+ A G                    H T VGC+W+FT
Subjt:  SYTRYPMYSVLSYNN--------LSSKFRA----------FTASLDTITIPKNIYVAMKIPEWKAAKG--------------------HKT-VGCKWVFT

Query:  MKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFF----------CRLLDL-----------------PPSSSTK---------------
         KY SD +L+RYKA+LVAKG+ Q  G+DY+ETFSPV K T  R             R LD+                 PP    K               
Subjt:  MKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFF----------CRLLDL-----------------PPSSSTK---------------

Query:  ---------ASLRDTLIIPCLRKGPFLVRLFVL---------IVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYILGMEVARSREGISISQQKY
                   LR+ L+            LFVL         +VYVDDI+++G+D   ++     +   F++KD   L Y LG+E  R   G+ +SQ++Y
Subjt:  ---------ASLRDTLIIPCLRKGPFLVRLFVL---------IVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYILGMEVARSREGISISQQKY

Query:  TIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNRILRYLKDT------LGKGK
         +DLL  T M   +P  TP+  + KL         +   Y  +VG L YLA TR DI YAVN +SQF+H   +EH++A+ RILRYL  T      L KG 
Subjt:  TIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNRILRYLKDT------LGKGK

Query:  --------------------STSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQMKFYCDNKAA
                            ST+GY  ++  + ++W SKKQ  V +SS E EY+++++   E  W+  LL  L          YCDN  A
Subjt:  --------------------STSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQMKFYCDNKAA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.8e-6526.99Show/hide
Query:  IHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQFNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNG
        I+SDVW  S I +    R+ V F+D  TR TW++ L  KS++   F  F + +E +F  +I  L SDNG EF+   LR++LS  GI H +S  +TP+ NG
Subjt:  IHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQFNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNG

Query:  VAERKNRHLLERDAVL----------------TTAHLINRMSSCVLSFQTPLDCL--------KLS------YPTTR-------------------LIPD
        ++ERK+RH++E    L                   +LINR+ + +L  Q+P   L        KL       YP  R                    +  
Subjt:  VAERKNRHLLERDAVL----------------TTAHLINRMSSCVLSFQTPLDCL--------KLS------YPTTR-------------------LIPD

Query:  CGYKCFHPNSRKYFISMDLTFLED-KPF----FPVSPLQGESSNEETNY------------------------------SNPIPKLILTTHDTVLPTRQ-
          Y C H  + + + S  + F E   PF    F VS  Q + S+   N+                              S+P P        + LP+   
Subjt:  CGYKCFHPNSRKYFISMDLTFLED-KPF----FPVSPLQGESSNEETNY------------------------------SNPIPKLILTTHDTVLPTRQ-

Query:  ----------KENNGPGPT---------DMLAPIQSSELTQAQGTTDPNNNIVCVEYDIVD--LTKLSVENDRDDMTENNQFRRMKMRSCLRIPQLKFFD
                    +NGP PT         +  +PI ++    +     PN N    +  I    +   S      +   ++      +   L  P +   +
Subjt:  ----------KENNGPGPT---------DMLAPIQSSELTQAQGTTDPNNNIVCVEYDIVD--LTKLSVENDRDDMTENNQFRRMKMRSCLRIPQLKFFD

Query:  KLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAAKGHK---------------------TVGCKWV
             + T +M    ++G R                 ++  ++  SL   + P+    AMK   W+ A G +                      VGC+W+
Subjt:  KLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAAKGHK---------------------TVGCKWV

Query:  FTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFF----------CRLLDL-----------------PPSSSTK-------------
        FT K+ SD +L+RYKA+LVAKG+ Q  G+DY+ETFSPV K T  R             R LD+                 PP    K             
Subjt:  FTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFF----------CRLLDL-----------------PPSSSTK-------------

Query:  -----------ASLRDTLIIPCLRKGPFLVRLFVL---------IVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYILGMEVARSREGISISQQ
                     LR  L+            LFVL         +VYVDDI+++G+DT  +      +   F++K+  +L Y LG+E  R  +G+ +SQ+
Subjt:  -----------ASLRDTLIIPCLRKGPFLVRLFVL---------IVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYILGMEVARSREGISISQQ

Query:  KYTIDLLKETGMTGCRPANTPIEFNAKLG-ESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNRILRYLKDT------LG
        +YT+DLL  T M   +P  TP+  + KL   SG ++P +   Y  +VG L YLA TR D+ YAVN +SQ++H    +H  A+ R+LRYL  T      L 
Subjt:  KYTIDLLKETGMTGCRPANTPIEFNAKLG-ESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNRILRYLKDT------LG

Query:  KGK--------------------STSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQMKFYCDNKAA
        KG                     ST+GY  ++  + ++W SKKQ  V +SS E EY+++++   E  W+  LL  L          YCDN  A
Subjt:  KGK--------------------STSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQMKFYCDNKAA

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).6.6e-0724.75Show/hide
Query:  NYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGKPLLYATATRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQ
        NY +W    +  L    KFG++ G +PKP P  P  + W+  ++++   L++SM  ++ + ++YA     +W+ +++++    +  ++Y LR+++   +Q
Subjt:  NYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGKPLLYATATRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQ

Query:  G
        G
Subjt:  G

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.4e-6232.44Show/hide
Query:  SYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAA--------------------KGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAK
        S T + +   LSY  +S  + +F   +     P     A +   W  A                       K +GCKWV+ +KY SD T++RYKA+LVAK
Subjt:  SYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAA--------------------KGHKTVGCKWVFTMKYKSDRTLDRYKAKLVAK

Query:  GFTQTFGVDYSETFSPVAKLTQSRFFCRL---------------------------LDLPPSSSTKASLRDTL---IIPCLRKGPFLVR-----------
        G+TQ  G+D+ ETFSPV KLT  +    +                           + LPP  + +    D+L    +  L+K  + ++           
Subjt:  GFTQTFGVDYSETFSPVAKLTQSRFFCRL---------------------------LDLPPSSSTKASLRDTL---IIPCLRKGPFLVR-----------

Query:  ------------------------LFV-LIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYILGMEVARSREGISISQQKYTIDLLKETGMTGC
                                LF+ ++VYVDDI++  ++   ++ELK ++   F ++DLG LKY LG+E+ARS  GI+I Q+KY +DLL ETG+ GC
Subjt:  ------------------------LFV-LIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYILGMEVARSREGISISQQKYTIDLLKETGMTGC

Query:  RPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNRILRYLKDTLGKG--------------------
        +P++ P++ +           V+ + Y  L+GRL+YL  TRLDI +AVN +SQF  A    H +AV +IL Y+K T+G+G                    
Subjt:  RPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNRILRYLKDTLGKG--------------------

Query:  ------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQMKFYCDNKAAISIANIA
              +ST+GYC F+  +L++W+SKKQ VV+KSSAE EY+A+S    E +WL +    L          +CDN AAI IA  A
Subjt:  ------KSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQMKFYCDNKAAISIANIA

ATMG00810.1 DNA/RNA polymerases superfamily protein3.8e-3133.48Show/hide
Query:  LIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYILGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYH
        L++YVDDI+L+G     +N L  ++   F++KDLG + Y LG+++     G+ +SQ KY   +L   GM  C+P +TP+        S  + P +   + 
Subjt:  LIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYILGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYH

Query:  CLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNRILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQ
         +VG L YL  TR DI YAVNIV Q +H       + + R+LRY+K T+  G                          +ST+G+CTF+  N+++W +K+Q
Subjt:  CLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNRILRYLKDTLGKG--------------------------KSTSGYCTFVWGNLVTWRSKKQ

Query:  SVVAKSSAEVEYKAMSHGICEEIW
          V++SS E EY+A++    E  W
Subjt:  SVVAKSSAEVEYKAMSHGICEEIW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.9e-1037.84Show/hide
Query:  NNLSSKFRAFTASLDTITIPKNIYVAMKIPEW--------KAAKGHKT------------VGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSET
        N L+ K+ + T +      PK++  A+K P W         A   +KT            +GCKWVF  K  SD TLDR KA+LVAKGF Q  G+ + ET
Subjt:  NNLSSKFRAFTASLDTITIPKNIYVAMKIPEW--------KAAKGHKT------------VGCKWVFTMKYKSDRTLDRYKAKLVAKGFTQTFGVDYSET

Query:  FSPVAKLTQSR
        +SPV +    R
Subjt:  FSPVAKLTQSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCAGAACATGGCGACGAAACCCTAGCCATCATTGATGACAACCAAACCCTACATGAGTCTGAAACCCTAGATCTACCGACAGAACCCGCCGCTGCTGCAACCGC
CACCATCCAAGCTACCGTCGACCGCTATATCCTATTTCTAAACCCTAGCGCCGTTGCCGCAGCCATCCAAAATGCCCTAGATCGGTACCCTTCGGCCAGTTCGACGGCCT
CTGGACAGTCCACTCGGGTTCCTGATGTCGAATCTCTCTTCTCCGACCAGCACACAATCGATCGGCGTTTTCTGACGAGTTTGACGGCTGCTGGACCTTCAACCGAGCAC
CAGTCGTCGAAACCTTCTTCCCCGATACAGATTTGCAGCCTCCAGCGAACAAAGGTTGTCGCCTCATCTGGAGTGGCGGCTGCAGGTTGCCAACCCATAGGTTCATTTTT
CGGCACCTACCCATTAACTCCGACCACTGTCACTGCTGCTGGAACATCTACATATGCTCCAAACACTTACTTTCAGGCCGAGAATAGGGAACCGTCAATAGGCTTATCTT
CGGTCATGATTCAACAACAATTTGCGAACCTTCAGGCAAGTTTTCAACAACAAATCGCTATCCTTGGGGTAGCGCTTGGAACCTCTACCACCATCAATCAGAGTGTAGGA
GGTTCGGGTGTCCAGTTAGAACTACCGCTGTATTCTGAGAATTCGACTGTTTTCCCAACCTTAACTCTTGAGTCTAGACCTATGGAAAACTCCACAGGATTAATTGTTGG
TGAAAAGTTAAATGGCCAGAATTATTTCTCATGGTTTCAGTCCGTTAAAATGGTTCTTGATGGACGCAATAAATTTGGGTATCTAACTGGTGAGATACCGAAACCCAAAC
CAGGAGACCCTCAGGAGCGCGTTTGGAAAGGAGAGGATTCTTTACTCCGTTCTCTGTTGATTCATAGCATGGAACCTCAAATTGGAAAACCTCTGTTGTATGCGACTGCT
ACTCGAGATATTTGGGATGTTGTTCAGAAACTTTACTCCAAGAGACAGAACGCATCACGGTTGTATACTCTGCGAAAACAAATTCACGAATGCAAACAGGGAATCCAATA
TTATAAACTTGAAGAGGTTGACCGTGTGTATGATTTTTTAGCAGGTCTAAATTCCAAGTTTGATGCTGTACGAAGTCGAATACTGGAGATAAAGATCAATACCGACCTTG
ATGGAAAAGACAGATCAAGTGCTATGAATATCACAACTACCTCTGTAACTGACTCTGCCGCGTTCAGTGCTAAATCATCTAGCACTACTGAAGACAAGATCAATGTTGGA
AGTTACATGGTCGGCCCCCGAATGGCAGATGACCTCCAAACAACAAACCTAACCAAGCTCTGTACCTTCTCTCTTGGGGCGATTGCGCAATTAACTACAAACCATCTAAC
TAGAACTTCTGACAATTTCCTTTCGTATCACTCGTGTGCTGGTAATGAAAAAATCTGGATTGCTGGCGAGATCCTTGCCCCTATTGCCGGCAAGGGTCACATTTCTCCAT
TTAATGGTTTGATATTACAGAATATGCTGCATGTTCCTAAAATTTCTTATAATTTACTGTCTATGAGTAAGATAACCAGAGATCCAAACTGTCGTGTGGCTTTCTCATCG
GATGATGTTATTTTTCAGGTCTTGAGCTCAAGAAAGATGATTGGCACTACCCAGCACAATAGGGGACTCTATTTCCTTGATGGTGATACTTCTTCTAGGCATAGTTATAG
GGCTAGGTCTATTTTCTGTCCCCTTACAAACCCTCTCAACCTTTCATCTCTTATTCACAGTGATGTTTGGGGTCCCTCAAAGATTACCACTTCTTCAGGTAAGCGTTGGT
GCGTTACCTTCATTGATGACCACACTCGCCTTACTTGGGTTTTTCTTCTAATAGATAAATCTAAGATCTCATCCATTTTCCAAGAATTTTACACTACTATCGAAACTCAG
TTCAATGCCAAAATTGTCATCCTTCGAAGTGACAATGGTCGTGAATTCCTCACCAGTACCCTTCGTGAGTTCCTATCCACTAAAGGTATTGTTCACCAGAGCTCGTGTGC
CTATACACCCCAACAAAATGGAGTGGCTGAAAGAAAAAACCGTCATCTCCTAGAAAGAGATGCAGTCTTGACTACAGCTCATCTTATAAATCGGATGTCTTCCTGCGTTC
TCAGCTTCCAAACTCCTCTTGATTGTCTCAAACTGTCTTACCCTACCACTCGCCTTATACCTGATTGTGGTTATAAATGCTTCCATCCTAATTCTCGGAAATACTTCATC
TCTATGGATCTCACCTTCCTGGAAGATAAACCTTTCTTTCCCGTTAGTCCTCTTCAGGGGGAGAGTAGTAATGAAGAGACTAACTATTCTAATCCCATTCCTAAACTCAT
CCTGACAACCCACGACACTGTCCTACCCACCAGACAGAAAGAAAATAATGGCCCTGGCCCTACTGACATGCTGGCTCCAATCCAATCTTCTGAACTAACACAGGCACAAG
GTACTACTGACCCTAATAATAATATTGTTTGTGTTGAATATGATATTGTTGATTTGACAAAACTATCAGTAGAGAATGATAGAGATGATATGACAGAAAATAATCAGTTC
AGGAGAATGAAAATGAGATCATGCCTCAGAATTCCACAATTGAAGTTCTTCGATAAACTAGGAAAGTATGATGCCACCCTTAACATGCCCATTGCTCTGAGAAATGGCAC
AAGGTCCTACACTAGGTATCCCATGTATAGCGTCCTTTCTTATAACAATCTATCTTCTAAGTTCAGAGCGTTTACTGCCAGCCTTGACACTATAACTATACCGAAAAACA
TATATGTGGCTATGAAAATTCCTGAATGGAAGGCTGCTAAAGGGCATAAAACAGTTGGGTGTAAGTGGGTGTTCACAATGAAGTATAAATCTGACAGAACTCTTGACAGG
TATAAAGCCAAGCTTGTAGCAAAAGGGTTCACTCAGACTTTTGGGGTAGACTATTCTGAGACTTTTTCTCCTGTTGCAAAGTTAACACAATCAAGATTCTTCTGTCGGTT
GCTAGATTTACCACCTTCGTCAAGTACCAAGGCTTCACTTAGGGACACTCTGATCATACCTTGTTTACGAAAGGGTCCGTTTCTAGTAAGGTTGTTTGTTCTAATTGTGT
ATGTTGATGACATTGTCCTATCAGGGGATGATACTAATGAAATCAACGAGTTAAAACAGAAAATGGTGGACGAATTTGCAATCAAAGACCTAGGAAACCTGAAGTACATT
CTTGGGATGGAGGTAGCAAGGTCAAGAGAAGGAATCTCAATTTCACAACAAAAATACACAATTGATCTCCTGAAAGAAACAGGTATGACTGGATGCAGGCCTGCTAACAC
TCCTATCGAGTTTAATGCAAAATTGGGAGAGTCTGGTGACAGAGTTCCTGTTAACAAGGAAAGGTACCATTGTCTAGTCGGAAGACTAATTTACTTAGCGCACACTAGAC
TCGACATTTACTATGCTGTCAACATTGTGAGCCAGTTTATTCATGCTCTCTACAAAGAACACATGGAAGCTGTAAACAGGATTCTAAGATATTTGAAAGACACACTAGGA
AAAGGGAAATCTACATCTGGGTACTGTACCTTTGTTTGGGGTAATCTTGTAACGTGGAGGAGTAAAAAGCAAAGTGTGGTTGCTAAAAGCAGTGCAGAAGTTGAATACAA
AGCTATGAGTCATGGCATCTGTGAAGAAATCTGGTTAAAGAAACTTTTATTTAATCTTCACCAATATAGTGACCTACAAATGAAATTCTATTGTGATAATAAGGCTGCTA
TTAGCATTGCTAACATTGCTAATATCCTCACTAAAGGGTTACTTAAGCAGAACTTTGACTCGTGTGTTAGCAAGATGGGTCTCACTGACATTTACGTCCCATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCATCAGAACATGGCGACGAAACCCTAGCCATCATTGATGACAACCAAACCCTACATGAGTCTGAAACCCTAGATCTACCGACAGAACCCGCCGCTGCTGCAACCGC
CACCATCCAAGCTACCGTCGACCGCTATATCCTATTTCTAAACCCTAGCGCCGTTGCCGCAGCCATCCAAAATGCCCTAGATCGGTACCCTTCGGCCAGTTCGACGGCCT
CTGGACAGTCCACTCGGGTTCCTGATGTCGAATCTCTCTTCTCCGACCAGCACACAATCGATCGGCGTTTTCTGACGAGTTTGACGGCTGCTGGACCTTCAACCGAGCAC
CAGTCGTCGAAACCTTCTTCCCCGATACAGATTTGCAGCCTCCAGCGAACAAAGGTTGTCGCCTCATCTGGAGTGGCGGCTGCAGGTTGCCAACCCATAGGTTCATTTTT
CGGCACCTACCCATTAACTCCGACCACTGTCACTGCTGCTGGAACATCTACATATGCTCCAAACACTTACTTTCAGGCCGAGAATAGGGAACCGTCAATAGGCTTATCTT
CGGTCATGATTCAACAACAATTTGCGAACCTTCAGGCAAGTTTTCAACAACAAATCGCTATCCTTGGGGTAGCGCTTGGAACCTCTACCACCATCAATCAGAGTGTAGGA
GGTTCGGGTGTCCAGTTAGAACTACCGCTGTATTCTGAGAATTCGACTGTTTTCCCAACCTTAACTCTTGAGTCTAGACCTATGGAAAACTCCACAGGATTAATTGTTGG
TGAAAAGTTAAATGGCCAGAATTATTTCTCATGGTTTCAGTCCGTTAAAATGGTTCTTGATGGACGCAATAAATTTGGGTATCTAACTGGTGAGATACCGAAACCCAAAC
CAGGAGACCCTCAGGAGCGCGTTTGGAAAGGAGAGGATTCTTTACTCCGTTCTCTGTTGATTCATAGCATGGAACCTCAAATTGGAAAACCTCTGTTGTATGCGACTGCT
ACTCGAGATATTTGGGATGTTGTTCAGAAACTTTACTCCAAGAGACAGAACGCATCACGGTTGTATACTCTGCGAAAACAAATTCACGAATGCAAACAGGGAATCCAATA
TTATAAACTTGAAGAGGTTGACCGTGTGTATGATTTTTTAGCAGGTCTAAATTCCAAGTTTGATGCTGTACGAAGTCGAATACTGGAGATAAAGATCAATACCGACCTTG
ATGGAAAAGACAGATCAAGTGCTATGAATATCACAACTACCTCTGTAACTGACTCTGCCGCGTTCAGTGCTAAATCATCTAGCACTACTGAAGACAAGATCAATGTTGGA
AGTTACATGGTCGGCCCCCGAATGGCAGATGACCTCCAAACAACAAACCTAACCAAGCTCTGTACCTTCTCTCTTGGGGCGATTGCGCAATTAACTACAAACCATCTAAC
TAGAACTTCTGACAATTTCCTTTCGTATCACTCGTGTGCTGGTAATGAAAAAATCTGGATTGCTGGCGAGATCCTTGCCCCTATTGCCGGCAAGGGTCACATTTCTCCAT
TTAATGGTTTGATATTACAGAATATGCTGCATGTTCCTAAAATTTCTTATAATTTACTGTCTATGAGTAAGATAACCAGAGATCCAAACTGTCGTGTGGCTTTCTCATCG
GATGATGTTATTTTTCAGGTCTTGAGCTCAAGAAAGATGATTGGCACTACCCAGCACAATAGGGGACTCTATTTCCTTGATGGTGATACTTCTTCTAGGCATAGTTATAG
GGCTAGGTCTATTTTCTGTCCCCTTACAAACCCTCTCAACCTTTCATCTCTTATTCACAGTGATGTTTGGGGTCCCTCAAAGATTACCACTTCTTCAGGTAAGCGTTGGT
GCGTTACCTTCATTGATGACCACACTCGCCTTACTTGGGTTTTTCTTCTAATAGATAAATCTAAGATCTCATCCATTTTCCAAGAATTTTACACTACTATCGAAACTCAG
TTCAATGCCAAAATTGTCATCCTTCGAAGTGACAATGGTCGTGAATTCCTCACCAGTACCCTTCGTGAGTTCCTATCCACTAAAGGTATTGTTCACCAGAGCTCGTGTGC
CTATACACCCCAACAAAATGGAGTGGCTGAAAGAAAAAACCGTCATCTCCTAGAAAGAGATGCAGTCTTGACTACAGCTCATCTTATAAATCGGATGTCTTCCTGCGTTC
TCAGCTTCCAAACTCCTCTTGATTGTCTCAAACTGTCTTACCCTACCACTCGCCTTATACCTGATTGTGGTTATAAATGCTTCCATCCTAATTCTCGGAAATACTTCATC
TCTATGGATCTCACCTTCCTGGAAGATAAACCTTTCTTTCCCGTTAGTCCTCTTCAGGGGGAGAGTAGTAATGAAGAGACTAACTATTCTAATCCCATTCCTAAACTCAT
CCTGACAACCCACGACACTGTCCTACCCACCAGACAGAAAGAAAATAATGGCCCTGGCCCTACTGACATGCTGGCTCCAATCCAATCTTCTGAACTAACACAGGCACAAG
GTACTACTGACCCTAATAATAATATTGTTTGTGTTGAATATGATATTGTTGATTTGACAAAACTATCAGTAGAGAATGATAGAGATGATATGACAGAAAATAATCAGTTC
AGGAGAATGAAAATGAGATCATGCCTCAGAATTCCACAATTGAAGTTCTTCGATAAACTAGGAAAGTATGATGCCACCCTTAACATGCCCATTGCTCTGAGAAATGGCAC
AAGGTCCTACACTAGGTATCCCATGTATAGCGTCCTTTCTTATAACAATCTATCTTCTAAGTTCAGAGCGTTTACTGCCAGCCTTGACACTATAACTATACCGAAAAACA
TATATGTGGCTATGAAAATTCCTGAATGGAAGGCTGCTAAAGGGCATAAAACAGTTGGGTGTAAGTGGGTGTTCACAATGAAGTATAAATCTGACAGAACTCTTGACAGG
TATAAAGCCAAGCTTGTAGCAAAAGGGTTCACTCAGACTTTTGGGGTAGACTATTCTGAGACTTTTTCTCCTGTTGCAAAGTTAACACAATCAAGATTCTTCTGTCGGTT
GCTAGATTTACCACCTTCGTCAAGTACCAAGGCTTCACTTAGGGACACTCTGATCATACCTTGTTTACGAAAGGGTCCGTTTCTAGTAAGGTTGTTTGTTCTAATTGTGT
ATGTTGATGACATTGTCCTATCAGGGGATGATACTAATGAAATCAACGAGTTAAAACAGAAAATGGTGGACGAATTTGCAATCAAAGACCTAGGAAACCTGAAGTACATT
CTTGGGATGGAGGTAGCAAGGTCAAGAGAAGGAATCTCAATTTCACAACAAAAATACACAATTGATCTCCTGAAAGAAACAGGTATGACTGGATGCAGGCCTGCTAACAC
TCCTATCGAGTTTAATGCAAAATTGGGAGAGTCTGGTGACAGAGTTCCTGTTAACAAGGAAAGGTACCATTGTCTAGTCGGAAGACTAATTTACTTAGCGCACACTAGAC
TCGACATTTACTATGCTGTCAACATTGTGAGCCAGTTTATTCATGCTCTCTACAAAGAACACATGGAAGCTGTAAACAGGATTCTAAGATATTTGAAAGACACACTAGGA
AAAGGGAAATCTACATCTGGGTACTGTACCTTTGTTTGGGGTAATCTTGTAACGTGGAGGAGTAAAAAGCAAAGTGTGGTTGCTAAAAGCAGTGCAGAAGTTGAATACAA
AGCTATGAGTCATGGCATCTGTGAAGAAATCTGGTTAAAGAAACTTTTATTTAATCTTCACCAATATAGTGACCTACAAATGAAATTCTATTGTGATAATAAGGCTGCTA
TTAGCATTGCTAACATTGCTAATATCCTCACTAAAGGGTTACTTAAGCAGAACTTTGACTCGTGTGTTAGCAAGATGGGTCTCACTGACATTTACGTCCCATCTTGA
Protein sequenceShow/hide protein sequence
MASEHGDETLAIIDDNQTLHESETLDLPTEPAAAATATIQATVDRYILFLNPSAVAAAIQNALDRYPSASSTASGQSTRVPDVESLFSDQHTIDRRFLTSLTAAGPSTEH
QSSKPSSPIQICSLQRTKVVASSGVAAAGCQPIGSFFGTYPLTPTTVTAAGTSTYAPNTYFQAENREPSIGLSSVMIQQQFANLQASFQQQIAILGVALGTSTTINQSVG
GSGVQLELPLYSENSTVFPTLTLESRPMENSTGLIVGEKLNGQNYFSWFQSVKMVLDGRNKFGYLTGEIPKPKPGDPQERVWKGEDSLLRSLLIHSMEPQIGKPLLYATA
TRDIWDVVQKLYSKRQNASRLYTLRKQIHECKQGIQYYKLEEVDRVYDFLAGLNSKFDAVRSRILEIKINTDLDGKDRSSAMNITTTSVTDSAAFSAKSSSTTEDKINVG
SYMVGPRMADDLQTTNLTKLCTFSLGAIAQLTTNHLTRTSDNFLSYHSCAGNEKIWIAGEILAPIAGKGHISPFNGLILQNMLHVPKISYNLLSMSKITRDPNCRVAFSS
DDVIFQVLSSRKMIGTTQHNRGLYFLDGDTSSRHSYRARSIFCPLTNPLNLSSLIHSDVWGPSKITTSSGKRWCVTFIDDHTRLTWVFLLIDKSKISSIFQEFYTTIETQ
FNAKIVILRSDNGREFLTSTLREFLSTKGIVHQSSCAYTPQQNGVAERKNRHLLERDAVLTTAHLINRMSSCVLSFQTPLDCLKLSYPTTRLIPDCGYKCFHPNSRKYFI
SMDLTFLEDKPFFPVSPLQGESSNEETNYSNPIPKLILTTHDTVLPTRQKENNGPGPTDMLAPIQSSELTQAQGTTDPNNNIVCVEYDIVDLTKLSVENDRDDMTENNQF
RRMKMRSCLRIPQLKFFDKLGKYDATLNMPIALRNGTRSYTRYPMYSVLSYNNLSSKFRAFTASLDTITIPKNIYVAMKIPEWKAAKGHKTVGCKWVFTMKYKSDRTLDR
YKAKLVAKGFTQTFGVDYSETFSPVAKLTQSRFFCRLLDLPPSSSTKASLRDTLIIPCLRKGPFLVRLFVLIVYVDDIVLSGDDTNEINELKQKMVDEFAIKDLGNLKYI
LGMEVARSREGISISQQKYTIDLLKETGMTGCRPANTPIEFNAKLGESGDRVPVNKERYHCLVGRLIYLAHTRLDIYYAVNIVSQFIHALYKEHMEAVNRILRYLKDTLG
KGKSTSGYCTFVWGNLVTWRSKKQSVVAKSSAEVEYKAMSHGICEEIWLKKLLFNLHQYSDLQMKFYCDNKAAISIANIANILTKGLLKQNFDSCVSKMGLTDIYVPS