; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C04G069735 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C04G069735
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionBeta-galactosidase
Genome locationCla97Chr04:6996257..7001741
RNA-Seq ExpressionCla97C04G069735
SyntenyCla97C04G069735
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031941.1 Beta-galactosidase [Cucumis melo var. makuwa]1.1e-18245.15Show/hide
Query:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL
        GR++FG+LTGE  +P  GD  E LWKGEDSL+RS+LI+SMEPQIGKPLLY ATA+D+WD  Q LYSKRQN SRLYTLRKQ+H CKQG++D+T+YFNKLSL
Subjt:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL

Query:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK
        +WQEMD+C E VWD P    QY KLEE DRVYD L GLN KFD +  RILG+RP+ ++ME C EVRLEED+ NAM +  +  IDSAAFSA SS    DK 
Subjt:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK

Query:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------
        N K  P                                          E  P +             T +LGA+AQSG+   L L++             
Subjt:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------

Query:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV
        TDHLTG+S++F+SY PCA NEKIRIADG+LA +AGKG I PF+G  L NVLHVPK+SYNLLS+ ++          LP            TIG +     
Subjt:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV

Query:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----
        + +LD      ++  V  L  +      D                   SFP+     T  F+  +SD WGPS+  TSSGK WFVTFIDDHTRLTW     
Subjt:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----

Query:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL
                   FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNGV ERKNRHL++VARSLMLSTSL SYLW D +LTAAHL
Subjt:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL

Query:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI
        INRMPSR+L+ QTPLD LK SYP+  L                        +      CVFVGYPLHQ  Y                           + 
Subjt:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI

Query:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN
        +  ENV  E+    + V+ T ++V +                                            RD   E       NN + ++D     V EN
Subjt:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN

Query:  --------ENEVIPQNSTTEEE---LDKLEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVFEEM
                E EV  + S  E E     KL++YD +LD+PIALRKGTRSCT++P  +++SY+NLS +FRAFTA+LD+  IPKNIY A+E PEWK AV EEM
Subjt:  --------ENEVIPQNSTTEEE---LDKLEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVFEEM

Query:  RAFEKNHT
        +A EKN T
Subjt:  RAFEKNHT

KAA0061447.1 Beta-galactosidase [Cucumis melo var. makuwa]7.1e-18546.3Show/hide
Query:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL
        GR++FG+LTGE  +P  GD  E LWKGEDSL+RS+LI+SMEPQIGKPLLY ATA+D+WD  Q LYSKRQN SRLYTLRKQ+H CKQG++D+T+YFNKLSL
Subjt:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL

Query:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK
        +WQEMD+C E VWD P    QY KLEE DRVYD L GLN KFD +  RILG+RP+ ++ME C EVRLEED+ NAM +  +  IDSAAFSA SS    DK 
Subjt:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK

Query:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------
        N K  P                                          E  P +             T +LGA+AQSG+   L L++             
Subjt:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------

Query:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV
        TDHLTG+S++F+SY PCA NEKIRIADG+LA +AGKG I PF+G  L NVLHVPK+SYNLLS+ ++          LP            TIG +     
Subjt:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV

Query:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----
        + +LD      ++  V  L  +      D                   SFP+     T  F+  +SD WGPS+  TSSGK WFVTFIDDHTRLTW     
Subjt:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----

Query:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL
                   FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNGV ERKNRHL++VARSLMLSTSL SYLW D +LTAAHL
Subjt:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL

Query:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI
        INRMPSR+L+ QTPLD LK SYP+  L                        +      CVFVGYPLHQ  Y                           + 
Subjt:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI

Query:  VCAENVCVEN----DIVDLTKLSVK--NDRDDMTENNQV---------VKSDVVSTTVQ------------------ENENEVIPQNSTTEEE---LDKL
        +  ENV  E+    + ++ T ++V   +    +   NQV         ++ +V S T Q                  ++E EV  + S  E E     KL
Subjt:  VCAENVCVEN----DIVDLTKLSVK--NDRDDMTENNQV---------VKSDVVSTTVQ------------------ENENEVIPQNSTTEEE---LDKL

Query:  EKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVFEEMRAFEKNHT
        ++YD +LD+PIALRKGTRSCT++P  +++SY+NLS +FRAFTA+LD+  IPKNIY A+E PEWK AV EEM+A EKN T
Subjt:  EKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVFEEMRAFEKNHT

TYK13763.1 Beta-galactosidase [Cucumis melo var. makuwa]1.1e-18246.19Show/hide
Query:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL
        GRH+FG+LTGE  +P  GD  E LWKGEDS +RS+LI+SMEPQIGKPLLY ATA+D+WD  Q LYSKRQN SRLYTLRKQ+H CKQG++D+T+YFNKLSL
Subjt:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL

Query:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK
        +WQEMD+C E VWD P    QY KLEE DRVYD L GLN KFD +  RILG+R + ++ME C EVRLEED+ NAM +  +  IDSAAFSA SS    DK 
Subjt:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK

Query:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------
        N K  P                                          E  P +             T +LGA+AQSG+   L L++             
Subjt:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------

Query:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV
        TDHLTG+S++F+SY PCA NEKIRIADG+LA +AGKG I PF+G  L NVLHVPK+SYNLLS+ ++          LP            TIG +     
Subjt:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV

Query:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----
        + +LD      ++  V  L  +      D                   SFP+     T  F+  +SD WGPS+  TSSGK WFVTFIDDHTRLTW     
Subjt:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----

Query:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL
                   FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNGV ERKNRHL++VARSLMLSTSL SYLW D +LTAAHL
Subjt:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL

Query:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYNNIVCAENVCVEN----DIVDLTKLSVKN
        INRMPSR+L+ QTPLD LK SYP+  L                        +      CVFVGYPLH          ENV  E+    + V+ T ++V +
Subjt:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYNNIVCAENVCVEN----DIVDLTKLSVKN

Query:  -------------------------------------------DRDDMTE----NNQVVKSDVVSTTVQEN--------ENEVIPQNSTTEEE---LDKL
                                                    RD   E    NN + ++D     V EN        E EV  + S  E E     KL
Subjt:  -------------------------------------------DRDDMTE----NNQVVKSDVVSTTVQEN--------ENEVIPQNSTTEEE---LDKL

Query:  EKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVFEEMRAFEKNHT
        ++YD +LD+PIALRKGTRSCT++P  +++SY+NLS +FRAFTASLD+  I KNIY A+E PEW  AV EEM+A EKN T
Subjt:  EKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVFEEMRAFEKNHT

TYK31050.1 Beta-galactosidase [Cucumis melo var. makuwa]3.3e-18245.04Show/hide
Query:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL
        GR++FG+LTGEI +P  GD  E LWKGEDSL+RS+LI+SMEPQIGKPLLY  TA+D+WD  Q LYSKRQN SRLYTLRKQ+H CKQG++D+T+YFNKLSL
Subjt:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL

Query:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK
        +WQEMD+C E VWD P    QY KLEE DRVYD L GLN KFD +  RILG+RP+ ++ME C EVRLEED+ NAM +  +  IDSAAFSA SS    DK 
Subjt:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK

Query:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------
        N K  P                                          E  P +             T +LGA+AQSG+   L L++             
Subjt:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------

Query:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV
        TDHLTG+S++F+SY PCA NEKIRIADG+LA +AGKG I PF+G  L NVLHVPK+SYNLLS+ ++          LP            TIG +     
Subjt:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV

Query:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----
        + +LD      ++  V  L  +      D                   SFP+     T  F+  +SD WGPS+  TSSGK WFVTFIDDHTRLTW     
Subjt:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----

Query:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL
                   FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNGV ERKNRHL++VARSLMLSTSL SYLW D +LTAAHL
Subjt:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL

Query:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI
        INRMPSR+L+ QTPLD LK SYP+  L                        +      CVFVGYPLHQ  Y                           + 
Subjt:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI

Query:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN
        +  ENV  E+    + V+ T ++V +                                            RD   E       NN + ++D       EN
Subjt:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN

Query:  --------ENEVIPQNSTTEEE---LDKLEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVFEEM
                E EV  + S  E E     KL++YD +LD+PIALRKGTRSCT++P  +++SY+NLS +FRAFTA+LD+  IPKNIY A+E PEWK AV EEM
Subjt:  --------ENEVIPQNSTTEEE---LDKLEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVFEEM

Query:  RAFEKNHT
        +A EKN T
Subjt:  RAFEKNHT

TYK31717.1 Beta-galactosidase [Cucumis melo var. makuwa]3.3e-18245.04Show/hide
Query:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL
        GR++FG+LTGEI +P  GD  E LWKGEDSL+RS+LI+SMEPQIGKPLLY  TA+D+WD  Q LYSKRQN SRLYTLRKQ+H CKQG++D+T+YFNKLSL
Subjt:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL

Query:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK
        +WQEMD+C E VWD P    QY KLEE DRVYD L GLN KFD +  RILG+RP+ ++ME C EVRLEED+ NAM +  +  IDSAAFSA SS    DK 
Subjt:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK

Query:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------
        N K  P                                          E  P +             T +LGA+AQSG+   L L++             
Subjt:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------

Query:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV
        TDHLTG+S++F+SY PCA NEKIRIADG+LA +AGKG I PF+G  L NVLHVPK+SYNLLS+ ++          LP            TIG +     
Subjt:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV

Query:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----
        + +LD      ++  V  L  +      D                   SFP+     T  F+  +SD WGPS+  TSSGK WFVTFIDDHTRLTW     
Subjt:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----

Query:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL
                   FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNGV ERKNRHL++VARSLMLSTSL SYLW D +LTAAHL
Subjt:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL

Query:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI
        INRMPSR+L+ QTPLD LK SYP+  L                        +      CVFVGYPLHQ  Y                           + 
Subjt:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI

Query:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN
        +  ENV  E+    + V+ T ++V +                                            RD   E       NN + ++D       EN
Subjt:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN

Query:  --------ENEVIPQNSTTEEE---LDKLEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVFEEM
                E EV  + S  E E     KL++YD +LD+PIALRKGTRSCT++P  +++SY+NLS +FRAFTA+LD+  IPKNIY A+E PEWK AV EEM
Subjt:  --------ENEVIPQNSTTEEE---LDKLEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVFEEM

Query:  RAFEKNHT
        +A EKN T
Subjt:  RAFEKNHT

TrEMBL top hitse value%identityAlignment
A0A5A7SQW1 Beta-galactosidase5.5e-18345.15Show/hide
Query:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL
        GR++FG+LTGE  +P  GD  E LWKGEDSL+RS+LI+SMEPQIGKPLLY ATA+D+WD  Q LYSKRQN SRLYTLRKQ+H CKQG++D+T+YFNKLSL
Subjt:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL

Query:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK
        +WQEMD+C E VWD P    QY KLEE DRVYD L GLN KFD +  RILG+RP+ ++ME C EVRLEED+ NAM +  +  IDSAAFSA SS    DK 
Subjt:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK

Query:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------
        N K  P                                          E  P +             T +LGA+AQSG+   L L++             
Subjt:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------

Query:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV
        TDHLTG+S++F+SY PCA NEKIRIADG+LA +AGKG I PF+G  L NVLHVPK+SYNLLS+ ++          LP            TIG +     
Subjt:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV

Query:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----
        + +LD      ++  V  L  +      D                   SFP+     T  F+  +SD WGPS+  TSSGK WFVTFIDDHTRLTW     
Subjt:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----

Query:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL
                   FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNGV ERKNRHL++VARSLMLSTSL SYLW D +LTAAHL
Subjt:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL

Query:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI
        INRMPSR+L+ QTPLD LK SYP+  L                        +      CVFVGYPLHQ  Y                           + 
Subjt:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI

Query:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN
        +  ENV  E+    + V+ T ++V +                                            RD   E       NN + ++D     V EN
Subjt:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN

Query:  --------ENEVIPQNSTTEEE---LDKLEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVFEEM
                E EV  + S  E E     KL++YD +LD+PIALRKGTRSCT++P  +++SY+NLS +FRAFTA+LD+  IPKNIY A+E PEWK AV EEM
Subjt:  --------ENEVIPQNSTTEEE---LDKLEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVFEEM

Query:  RAFEKNHT
        +A EKN T
Subjt:  RAFEKNHT

A0A5A7V3J5 Beta-galactosidase3.4e-18546.3Show/hide
Query:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL
        GR++FG+LTGE  +P  GD  E LWKGEDSL+RS+LI+SMEPQIGKPLLY ATA+D+WD  Q LYSKRQN SRLYTLRKQ+H CKQG++D+T+YFNKLSL
Subjt:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL

Query:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK
        +WQEMD+C E VWD P    QY KLEE DRVYD L GLN KFD +  RILG+RP+ ++ME C EVRLEED+ NAM +  +  IDSAAFSA SS    DK 
Subjt:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK

Query:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------
        N K  P                                          E  P +             T +LGA+AQSG+   L L++             
Subjt:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------

Query:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV
        TDHLTG+S++F+SY PCA NEKIRIADG+LA +AGKG I PF+G  L NVLHVPK+SYNLLS+ ++          LP            TIG +     
Subjt:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV

Query:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----
        + +LD      ++  V  L  +      D                   SFP+     T  F+  +SD WGPS+  TSSGK WFVTFIDDHTRLTW     
Subjt:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----

Query:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL
                   FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNGV ERKNRHL++VARSLMLSTSL SYLW D +LTAAHL
Subjt:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL

Query:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI
        INRMPSR+L+ QTPLD LK SYP+  L                        +      CVFVGYPLHQ  Y                           + 
Subjt:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI

Query:  VCAENVCVEN----DIVDLTKLSVK--NDRDDMTENNQV---------VKSDVVSTTVQ------------------ENENEVIPQNSTTEEE---LDKL
        +  ENV  E+    + ++ T ++V   +    +   NQV         ++ +V S T Q                  ++E EV  + S  E E     KL
Subjt:  VCAENVCVEN----DIVDLTKLSVK--NDRDDMTENNQV---------VKSDVVSTTVQ------------------ENENEVIPQNSTTEEE---LDKL

Query:  EKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVFEEMRAFEKNHT
        ++YD +LD+PIALRKGTRSCT++P  +++SY+NLS +FRAFTA+LD+  IPKNIY A+E PEWK AV EEM+A EKN T
Subjt:  EKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVFEEMRAFEKNHT

A0A5D3CPE0 Beta-galactosidase5.5e-18346.19Show/hide
Query:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL
        GRH+FG+LTGE  +P  GD  E LWKGEDS +RS+LI+SMEPQIGKPLLY ATA+D+WD  Q LYSKRQN SRLYTLRKQ+H CKQG++D+T+YFNKLSL
Subjt:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL

Query:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK
        +WQEMD+C E VWD P    QY KLEE DRVYD L GLN KFD +  RILG+R + ++ME C EVRLEED+ NAM +  +  IDSAAFSA SS    DK 
Subjt:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK

Query:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------
        N K  P                                          E  P +             T +LGA+AQSG+   L L++             
Subjt:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------

Query:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV
        TDHLTG+S++F+SY PCA NEKIRIADG+LA +AGKG I PF+G  L NVLHVPK+SYNLLS+ ++          LP            TIG +     
Subjt:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV

Query:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----
        + +LD      ++  V  L  +      D                   SFP+     T  F+  +SD WGPS+  TSSGK WFVTFIDDHTRLTW     
Subjt:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----

Query:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL
                   FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNGV ERKNRHL++VARSLMLSTSL SYLW D +LTAAHL
Subjt:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL

Query:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYNNIVCAENVCVEN----DIVDLTKLSVKN
        INRMPSR+L+ QTPLD LK SYP+  L                        +      CVFVGYPLH          ENV  E+    + V+ T ++V +
Subjt:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYNNIVCAENVCVEN----DIVDLTKLSVKN

Query:  -------------------------------------------DRDDMTE----NNQVVKSDVVSTTVQEN--------ENEVIPQNSTTEEE---LDKL
                                                    RD   E    NN + ++D     V EN        E EV  + S  E E     KL
Subjt:  -------------------------------------------DRDDMTE----NNQVVKSDVVSTTVQEN--------ENEVIPQNSTTEEE---LDKL

Query:  EKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVFEEMRAFEKNHT
        ++YD +LD+PIALRKGTRSCT++P  +++SY+NLS +FRAFTASLD+  I KNIY A+E PEW  AV EEM+A EKN T
Subjt:  EKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVFEEMRAFEKNHT

A0A5D3E603 Beta-galactosidase1.6e-18245.04Show/hide
Query:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL
        GR++FG+LTGEI +P  GD  E LWKGEDSL+RS+LI+SMEPQIGKPLLY  TA+D+WD  Q LYSKRQN SRLYTLRKQ+H CKQG++D+T+YFNKLSL
Subjt:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL

Query:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK
        +WQEMD+C E VWD P    QY KLEE DRVYD L GLN KFD +  RILG+RP+ ++ME C EVRLEED+ NAM +  +  IDSAAFSA SS    DK 
Subjt:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK

Query:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------
        N K  P                                          E  P +             T +LGA+AQSG+   L L++             
Subjt:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------

Query:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV
        TDHLTG+S++F+SY PCA NEKIRIADG+LA +AGKG I PF+G  L NVLHVPK+SYNLLS+ ++          LP            TIG +     
Subjt:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV

Query:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----
        + +LD      ++  V  L  +      D                   SFP+     T  F+  +SD WGPS+  TSSGK WFVTFIDDHTRLTW     
Subjt:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----

Query:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL
                   FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNGV ERKNRHL++VARSLMLSTSL SYLW D +LTAAHL
Subjt:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL

Query:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI
        INRMPSR+L+ QTPLD LK SYP+  L                        +      CVFVGYPLHQ  Y                           + 
Subjt:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI

Query:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN
        +  ENV  E+    + V+ T ++V +                                            RD   E       NN + ++D       EN
Subjt:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN

Query:  --------ENEVIPQNSTTEEE---LDKLEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVFEEM
                E EV  + S  E E     KL++YD +LD+PIALRKGTRSCT++P  +++SY+NLS +FRAFTA+LD+  IPKNIY A+E PEWK AV EEM
Subjt:  --------ENEVIPQNSTTEEE---LDKLEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVFEEM

Query:  RAFEKNHT
        +A EKN T
Subjt:  RAFEKNHT

A0A5D3E6F8 Beta-galactosidase1.6e-18245.04Show/hide
Query:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL
        GR++FG+LTGEI +P  GD  E LWKGEDSL+RS+LI+SMEPQIGKPLLY  TA+D+WD  Q LYSKRQN SRLYTLRKQ+H CKQG++D+T+YFNKLSL
Subjt:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL

Query:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK
        +WQEMD+C E VWD P    QY KLEE DRVYD L GLN KFD +  RILG+RP+ ++ME C EVRLEED+ NAM +  +  IDSAAFSA SS    DK 
Subjt:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK

Query:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------
        N K  P                                          E  P +             T +LGA+AQSG+   L L++             
Subjt:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------

Query:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV
        TDHLTG+S++F+SY PCA NEKIRIADG+LA +AGKG I PF+G  L NVLHVPK+SYNLLS+ ++          LP            TIG +     
Subjt:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV

Query:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----
        + +LD      ++  V  L  +      D                   SFP+     T  F+  +SD WGPS+  TSSGK WFVTFIDDHTRLTW     
Subjt:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----

Query:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL
                   FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNGV ERKNRHL++VARSLMLSTSL SYLW D +LTAAHL
Subjt:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL

Query:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI
        INRMPSR+L+ QTPLD LK SYP+  L                        +      CVFVGYPLHQ  Y                           + 
Subjt:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI

Query:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN
        +  ENV  E+    + V+ T ++V +                                            RD   E       NN + ++D       EN
Subjt:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN

Query:  --------ENEVIPQNSTTEEE---LDKLEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVFEEM
                E EV  + S  E E     KL++YD +LD+PIALRKGTRSCT++P  +++SY+NLS +FRAFTA+LD+  IPKNIY A+E PEWK AV EEM
Subjt:  --------ENEVIPQNSTTEEE---LDKLEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVFEEM

Query:  RAFEKNHT
        +A EKN T
Subjt:  RAFEKNHT

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.7e-1729.77Show/hide
Query:  YSDAWGPSRTATSSGKHWFVTFIDDHTR--LTW--------------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGV
        +SD  GP    T   K++FV F+D  T   +T+              F    E  FN K+  L  DNGRE+L+N +R+F   KGI +  +  +TPQ NGV
Subjt:  YSDAWGPSRTATSSGKHWFVTFIDDHTR--LTW--------------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGV

Query:  VERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHLINRMPSRVL--NFQTP----------LDYLKLSYPTIALYLMSLS--------KCVFVGYPLHQ
         ER  R + + AR+++    L    W + VLTA +LINR+PSR L  + +TP          L +L++   T+ +++ +          K +FVGY  + 
Subjt:  VERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHLINRMPSRVL--NFQTP----------LDYLKLSYPTIALYLMSLS--------KCVFVGYPLHQ

Query:  CDYNNIVCAENVCVENDIVDLTKLSVKNDRDDMTENNQVVKSDVV-STTVQENENEVIPQNS
            + V  + +   + +VD T +           N++ VK + V     +E+EN+  P +S
Subjt:  CDYNNIVCAENVCVENDIVDLTKLSVKNDRDDMTENNQVVKSDVV-STTVQENENEVIPQNS

P0C2J5 Transposon Ty2-LR2 Gag-Pol polyprotein1.7e-0823.7Show/hide
Query:  FSTFHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTWFY------------------TTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSC
        +  F   ++D +GP      S   +F++F D+ TR  W Y                    I+ QFNA++ +++ D G E+   TL +F + +GI    + 
Subjt:  FSTFHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTWFY------------------TTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSC

Query:  AYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHLINRMPS-------------------RVLNFQTP---------------------L
            + +GV ER NR LL   R+L+  + L ++LW   V  +  + N + S                    +L F  P                     L
Subjt:  AYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHLINRMPS-------------------RVLNFQTP---------------------L

Query:  DYLKLSYPTIALYLMSLSKCVFVGYPLHQCDYNNIVCAENVCVENDIVDLTKLSVKNDRDDMTENNQ-VVKSDVVSTTVQENENEVIPQNSTTEEELDKL
           + SY  I +YL SL K V         D  N V  +N   + D  D   L+  +D + +T +NQ  ++ +    +  +N+       S  E   D L
Subjt:  DYLKLSYPTIALYLMSLSKCVFVGYPLHQCDYNNIVCAENVCVENDIVDLTKLSVKNDRDDMTENNQ-VVKSDVVSTTVQENENEVIPQNSTTEEELDKL

Query:  -EKYDATLDMPIALRKGTRSCTRYPKN---SFLSYNNLSSKFRAFT
           + +    P+ L K      R PK        YN L S  R+ T
Subjt:  -EKYDATLDMPIALRKGTRSCTRYPKN---SFLSYNNLSSKFRAFT

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.4e-2037.33Show/hide
Query:  YSDAWGPSRTATSSGKHWFVTFIDDHTRLTW----------------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGV
        YSD  GP    +  G  +FVTFIDD +R  W                F+  +E +   K+  LRSDNG E+ +    E+ S  GI H+ +   TPQ NGV
Subjt:  YSDAWGPSRTATSSGKHWFVTFIDDHTRLTW----------------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGV

Query:  VERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHLINRMPSRVLNFQTP
         ER NR +++  RS++    L    W + V TA +LINR PS  L F+ P
Subjt:  VERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHLINRMPSRVLNFQTP

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.0e-2222.37Show/hide
Query:  PRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSY-------FNKLSLIWQEMDI
        PR+ +P    WK +D L+ S ++ ++   +   +    TA  IW+ ++++Y+   +   +  LR Q+ +  +G+  I  Y       F++L+L+ + MD 
Subjt:  PRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSY-------FNKLSLIWQEMDI

Query:  CCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTT-----GDKKNR
                          E+++RV   L  L  ++  +  +I  +    T+ E    +   E K+ A++      I + A S  ++ TT     G++ NR
Subjt:  CCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTT-----GDKKNR

Query:  ----------KP---------PPEN-----------CPVNIGTSSLGAMAQSGIS------PP------------------------LSLLTTDHLTGTS
                  KP         P  N           C V   ++   +  Q  +S      PP                        L    T H+T   
Subjt:  ----------KP---------PPEN-----------CPVNIGTSSLGAMAQSGIS------PP------------------------LSLLTTDHLTGTS

Query:  DNFLSYHPCADNEKIRIADGT---LAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLALPSTIG----------DSISLIVILLLDKT--------
        +N   + P    + + +ADG+   ++H       +    L LHN+L+VP I  NL+SV RL   + +             ++  V LL  KT        
Subjt:  DNFLSYHPCADNEKIRIADGT---LAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLALPSTIG----------DSISLIVILLLDKT--------

Query:  ----------------------------------------IGLVCYLPIFQLLKMTD---------PFSFPTLQIFSTFHSYYSDAWGPSRTATSSGKHW
                                                  L    P  + L  +D         PFS  T+         YSD W  S   +     +
Subjt:  ----------------------------------------IGLVCYLPIFQLLKMTD---------PFSFPTLQIFSTFHSYYSDAWGPSRTATSSGKHW

Query:  FVTFIDDHTRLTWFY----------------TTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLS
        +V F+D  TR TW Y                  +E +F  +I    SDNG EF+   L E+ S  GI H +S  +TP+ NG+ ERK+RH+++   +L+  
Subjt:  FVTFIDDHTRLTWFY----------------TTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLS

Query:  TSLLSYLWEDVVLTAAHLINRMPSRVLNFQTPL--------DYLKL------SYPTIALYLM-----SLSKCVFVGYPLHQCDY
         S+    W      A +LINR+P+ +L  ++P         +Y KL       YP +  Y          +CVF+GY L Q  Y
Subjt:  TSLLSYLWEDVVLTAAHLINRMPSRVLNFQTPL--------DYLKL------SYPTIALYLM-----SLSKCVFVGYPLHQCDY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.8e-2122.24Show/hide
Query:  GYLTGEIPKP--RIG-------DPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFN
        G+L G  P P   IG       +P    W+ +D L+ S ++ ++   +   +    TA  IW+ ++++Y+   N S  +  +          +   + F+
Subjt:  GYLTGEIPKP--RIG-------DPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFN

Query:  KLSLIWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNS--KFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVID----------
        +L+L+ + MD       D     V     ++   V D +   ++      +  R++ R   L  + +   V +  + +   N   +   +          
Subjt:  KLSLIWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNS--KFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVID----------

Query:  ----SAAFSANSSGTTGDKKNRKP-------------PPENCPVNIGTSSLGAMAQS-------------GISPP-------LSLLTTDHLTGTSDNFLS
            S ++  +SSG+  D +  KP               + CP      S     QS              ++ P       L    T H+T   +N   
Subjt:  ----SAAFSANSSGTTGDKKNRKP-------------PPENCPVNIGTSSLGAMAQS-------------GISPP-------LSLLTTDHLTGTSDNFLS

Query:  YHPCADNEKIRIADGT---LAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA------------------------------------LPSTIGD
        + P    + + IADG+   + H       +    L L+ VL+VP I  NL+SV RL                                      P     
Subjt:  YHPCADNEKIRIADGT---LAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA------------------------------------LPSTIGD

Query:  SISLIV-------------------ILLLDKTI---GLVCYLPIFQLLKMTD---------PFSFPTLQIFSTFHSYYSDAWGPSRTATSSGKHWFVTFI
        ++S+                     + +L+  I    L    P  +LL  +D         PFS  T+         YSD W  S   +     ++V F+
Subjt:  SISLIV-------------------ILLLDKTI---GLVCYLPIFQLLKMTD---------PFSFPTLQIFSTFHSYYSDAWGPSRTATSSGKHWFVTFI

Query:  DDHTRLTWFY----------------TTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLS
        D  TR TW Y                + +E +F  +I  L SDNG EF+   LR++LS  GI H +S  +TP+ NG+ ERK+RH++++  +L+   S+  
Subjt:  DDHTRLTWFY----------------TTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLS

Query:  YLWEDVVLTAAHLINRMPSRVLNFQTPL--------DYLKLS------YPTIALY----LMSLSK-CVFVGYPLHQCDY
          W      A +LINR+P+ +L  Q+P         +Y KL       YP +  Y    L   SK C F+GY L Q  Y
Subjt:  YLWEDVVLTAAHLINRMPSRVLNFQTPL--------DYLKLS------YPTIALY----LMSLSK-CVFVGYPLHQCDY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCCGTAACCGCCGTTGGGTCTGTCGTCCACGAAGTTGTCGACAGGTATGTTCGGTCACTCAACCCACGCGCAGCCGCCATCTTCTTCCAAAATTTTGGA
CCGACAGCACGATCTCCAGCCACCTCAACAGCCCAGGTTGCTTTTTGCACCCAGACCACCGTGCCTCCATTAGGTTTTCACGCCACCGCCACCCAAACCTCTCAC
CGTGGGCCGACTGTTCTTCCCGTCACTGGATCTTCTTCGTGGGTGAGTTTCGGGAACCAGTCGCGGTCAGATCTGGACACCGGCGGGGCTCATTACATCCTTGAA
GTTGTTGTTGCTCTTCACGCCACCAACGACGGTGCTTATTCCAGACGACCGACCACCACTGCCGCTGCCAAATCCGAAGGTCTGGTTTCAAATCCGACAGCTGTT
GCTACTGCTGGAACTAGTGGGTGTTCTTCAACCTCGGTCCAACAACAACTTACCAATCTTCAAGCAAGTTTTCAACAACAAATCGCCGCTCTTGCGTGTAGGAAG
TTCAAGTATTCCGTCAGACTTACTACTGGATCCAGAGATCCGGTAACTACTTTCCCTACCATATCGACTGTACCGTATTTGCCTGGACCAATGGGGCGTCATAAG
TTCGGATATTTGACCGGTGAAATACCTAAACCCAGAATCGGAGATCCTCAAGAACACCTCTGGAAGGGAGAAGACTCATTACTCAGATCTCTGTTAATTCATAGT
ATGGAACCCCAAATCGGAAAGCCCCTGTTGTATGTGGCGACCGCTCAGGATATTTGGGATGTTGTTCAACAACTTTATTCCAAGAGACAGAATGAATCACGGCTC
TACACTCTGCGAAAACAAATCCACGAATGCAAACAGGGATCTATGGATATCACGTCATATTTTAATAAATTATCTCTTATTTGGCAGGAGATGGACATATGTTGT
GAACTTGTCTGGGACTGCCCTTGTGGAGGAGTCCAATATTATAAACTTGAAGAGATTGACCGTGTATATGATCTTTTAGTAGGCCTCAATTCCAAGTTTGATGCT
ATGTGGAGCCGAATATTGGGACGAAGACCAATACTGACCGTAATGGAAGCTTGTTCAGAAGTACGTCTGGAGGAAGATAAAATGAATGCCATGAATATCACGGCT
TCCTCCGTAATTGATTCCGCCGCCTTCAGTGCTAACTCATCTGGTACTACTGGAGACAAGAAGAACAGGAAACCACCTCCAGAGAACTGCCCAGTCAATATCGGT
ACTTCCTCTTTAGGGGCGATGGCGCAATCAGGTATTTCTCCACCCTTGAGTCTCCTCACTACAGACCATCTAACTGGAACTTCTGATAACTTCCTCTCTTATCAT
CCATGTGCCGATAATGAAAAGATTCGAATTGCTGACGGGACACTTGCCCATGTTGCCGGCAAGGGTCACATTTCTCCATTCAATGGTTTGGTATTACATAATGTG
TTGCATGTTCCTAAAATCTCCTATAATTTATTGTCTGTGAGACGATTGGCACTGCCTAGCACAATAGGAGACTCTATTTCCTTAATAGTGATACTTCTTCTAGAC
AAGACTATAGGGCTAGTTTGTTATCTTCCTATTTTTCAACTTCTGAAAATGACTGATCCCTTTTCCTTCCCAACCTTACAAATCTTCTCAACCTTTCACTCTTAT
TATAGTGATGCCTGGGGTCCCTCACGAACTGCCACTTCTTCGGGTAAACATTGGTTTGTTACCTTCATTGATGACCACACTCGCCTTACTTGGTTTTACACCACC
ATTGAAACTCAGTTTAATGCCAAAATTGCAATCCTTCGGAGTGACAATGGTCGTGAATTCCTCACTAATACTCTTCGTGAGTTCTTATCCATTAAAGGTATCGTT
CATCAGAGTTCGTGTGCCTATACACCCCAACAAAATGGAGTGGTTGAAAGAAAAAACCGTCATCTCCTAAAAGTTGCTCGGTCTCTCATGCTATCAACCTCTCTT
CTGTCTTACCTATGGGAGGATGTAGTCTTGACTGCCGCTCATCTTATAAATCGGATGCCTTCTCGGGTTCTTAACTTCCAAACTCCTCTTGACTACCTCAAACTG
TCTTACCCTACCATCGCCTTATACTTGATGTCCCTCTCCAAATGTGTCTTCGTTGGGTATCCTCTCCACCAATGCGATTATAATAATATTGTTTGTGCTGAAAAT
GTTTGTGTTGAAAATGATATTGTTGATCTGACAAAACTGTCGGTAAAAAATGATAGAGATGATATGACAGAAAATAACCAGGTTGTTAAGAGTGATGTAGTGTCC
ACAACAGTTCAGGAGAATGAAAATGAGGTCATACCTCAGAATTCCACAACTGAAGAAGAACTCGACAAACTAGAGAAGTATGATGCCACCCTTGACATGCCCATT
GCTCTAAGAAAGGGCACGAGGTCCTGCACCAGGTATCCCAAGAATAGCTTCCTCTCGTATAACAATCTATCTTCTAAGTTCAGAGCGTTTACTGCTAGCCTTGAC
ACTGTAACTATACCGAAGAACATATATGTGGCTATGGAAATTCCTGAGTGGAAGGCTGCTGTTTTTGAAGAAATGAGAGCTTTTGAGAAAAATCATACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCCGTAACCGCCGTTGGGTCTGTCGTCCACGAAGTTGTCGACAGGTATGTTCGGTCACTCAACCCACGCGCAGCCGCCATCTTCTTCCAAAATTTTGGA
CCGACAGCACGATCTCCAGCCACCTCAACAGCCCAGGTTGCTTTTTGCACCCAGACCACCGTGCCTCCATTAGGTTTTCACGCCACCGCCACCCAAACCTCTCAC
CGTGGGCCGACTGTTCTTCCCGTCACTGGATCTTCTTCGTGGGTGAGTTTCGGGAACCAGTCGCGGTCAGATCTGGACACCGGCGGGGCTCATTACATCCTTGAA
GTTGTTGTTGCTCTTCACGCCACCAACGACGGTGCTTATTCCAGACGACCGACCACCACTGCCGCTGCCAAATCCGAAGGTCTGGTTTCAAATCCGACAGCTGTT
GCTACTGCTGGAACTAGTGGGTGTTCTTCAACCTCGGTCCAACAACAACTTACCAATCTTCAAGCAAGTTTTCAACAACAAATCGCCGCTCTTGCGTGTAGGAAG
TTCAAGTATTCCGTCAGACTTACTACTGGATCCAGAGATCCGGTAACTACTTTCCCTACCATATCGACTGTACCGTATTTGCCTGGACCAATGGGGCGTCATAAG
TTCGGATATTTGACCGGTGAAATACCTAAACCCAGAATCGGAGATCCTCAAGAACACCTCTGGAAGGGAGAAGACTCATTACTCAGATCTCTGTTAATTCATAGT
ATGGAACCCCAAATCGGAAAGCCCCTGTTGTATGTGGCGACCGCTCAGGATATTTGGGATGTTGTTCAACAACTTTATTCCAAGAGACAGAATGAATCACGGCTC
TACACTCTGCGAAAACAAATCCACGAATGCAAACAGGGATCTATGGATATCACGTCATATTTTAATAAATTATCTCTTATTTGGCAGGAGATGGACATATGTTGT
GAACTTGTCTGGGACTGCCCTTGTGGAGGAGTCCAATATTATAAACTTGAAGAGATTGACCGTGTATATGATCTTTTAGTAGGCCTCAATTCCAAGTTTGATGCT
ATGTGGAGCCGAATATTGGGACGAAGACCAATACTGACCGTAATGGAAGCTTGTTCAGAAGTACGTCTGGAGGAAGATAAAATGAATGCCATGAATATCACGGCT
TCCTCCGTAATTGATTCCGCCGCCTTCAGTGCTAACTCATCTGGTACTACTGGAGACAAGAAGAACAGGAAACCACCTCCAGAGAACTGCCCAGTCAATATCGGT
ACTTCCTCTTTAGGGGCGATGGCGCAATCAGGTATTTCTCCACCCTTGAGTCTCCTCACTACAGACCATCTAACTGGAACTTCTGATAACTTCCTCTCTTATCAT
CCATGTGCCGATAATGAAAAGATTCGAATTGCTGACGGGACACTTGCCCATGTTGCCGGCAAGGGTCACATTTCTCCATTCAATGGTTTGGTATTACATAATGTG
TTGCATGTTCCTAAAATCTCCTATAATTTATTGTCTGTGAGACGATTGGCACTGCCTAGCACAATAGGAGACTCTATTTCCTTAATAGTGATACTTCTTCTAGAC
AAGACTATAGGGCTAGTTTGTTATCTTCCTATTTTTCAACTTCTGAAAATGACTGATCCCTTTTCCTTCCCAACCTTACAAATCTTCTCAACCTTTCACTCTTAT
TATAGTGATGCCTGGGGTCCCTCACGAACTGCCACTTCTTCGGGTAAACATTGGTTTGTTACCTTCATTGATGACCACACTCGCCTTACTTGGTTTTACACCACC
ATTGAAACTCAGTTTAATGCCAAAATTGCAATCCTTCGGAGTGACAATGGTCGTGAATTCCTCACTAATACTCTTCGTGAGTTCTTATCCATTAAAGGTATCGTT
CATCAGAGTTCGTGTGCCTATACACCCCAACAAAATGGAGTGGTTGAAAGAAAAAACCGTCATCTCCTAAAAGTTGCTCGGTCTCTCATGCTATCAACCTCTCTT
CTGTCTTACCTATGGGAGGATGTAGTCTTGACTGCCGCTCATCTTATAAATCGGATGCCTTCTCGGGTTCTTAACTTCCAAACTCCTCTTGACTACCTCAAACTG
TCTTACCCTACCATCGCCTTATACTTGATGTCCCTCTCCAAATGTGTCTTCGTTGGGTATCCTCTCCACCAATGCGATTATAATAATATTGTTTGTGCTGAAAAT
GTTTGTGTTGAAAATGATATTGTTGATCTGACAAAACTGTCGGTAAAAAATGATAGAGATGATATGACAGAAAATAACCAGGTTGTTAAGAGTGATGTAGTGTCC
ACAACAGTTCAGGAGAATGAAAATGAGGTCATACCTCAGAATTCCACAACTGAAGAAGAACTCGACAAACTAGAGAAGTATGATGCCACCCTTGACATGCCCATT
GCTCTAAGAAAGGGCACGAGGTCCTGCACCAGGTATCCCAAGAATAGCTTCCTCTCGTATAACAATCTATCTTCTAAGTTCAGAGCGTTTACTGCTAGCCTTGAC
ACTGTAACTATACCGAAGAACATATATGTGGCTATGGAAATTCCTGAGTGGAAGGCTGCTGTTTTTGAAGAAATGAGAGCTTTTGAGAAAAATCATACATGA
Protein sequenceShow/hide protein sequence
MESVTAVGSVVHEVVDRYVRSLNPRAAAIFFQNFGPTARSPATSTAQVAFCTQTTVPPLGFHATATQTSHRGPTVLPVTGSSSWVSFGNQSRSDLDTGGAHYILE
VVVALHATNDGAYSRRPTTTAAAKSEGLVSNPTAVATAGTSGCSSTSVQQQLTNLQASFQQQIAALACRKFKYSVRLTTGSRDPVTTFPTISTVPYLPGPMGRHK
FGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSLIWQEMDICC
ELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGRRPILTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKKNRKPPPENCPVNIG
TSSLGAMAQSGISPPLSLLTTDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLALPSTIGDSISLIVILLLD
KTIGLVCYLPIFQLLKMTDPFSFPTLQIFSTFHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTWFYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIV
HQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHLINRMPSRVLNFQTPLDYLKLSYPTIALYLMSLSKCVFVGYPLHQCDYNNIVCAEN
VCVENDIVDLTKLSVKNDRDDMTENNQVVKSDVVSTTVQENENEVIPQNSTTEEELDKLEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLD
TVTIPKNIYVAMEIPEWKAAVFEEMRAFEKNHT