; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G02125 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G02125
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionBeta-galactosidase
Genome locationClcChr04:6827345..6832853
RNA-Seq ExpressionClc04G02125
SyntenyClc04G02125
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031941.1 Beta-galactosidase [Cucumis melo var. makuwa]1.0e-18345.26Show/hide
Query:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL
        GR++FG+LTGE  +P  GD  E LWKGEDSL+RS+LI+SMEPQIGKPLLY ATA+D+WD  Q LYSKRQN SRLYTLRKQ+H CKQG++D+T+YFNKLSL
Subjt:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL

Query:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK
        +WQEMD+C E VWD P    QY KLEE DRVYD L GLN KFD +  RILGQRP+P++ME C EVRLEED+ NAM +  +  IDSAAFSA SS    DK 
Subjt:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK

Query:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------
        N K  P                                          E  P +             T +LGA+AQSG+   L L++             
Subjt:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------

Query:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV
        TDHLTG+S++F+SY PCA NEKIRIADG+LA +AGKG I PF+G  L NVLHVPK+SYNLLS+ ++          LP            TIG +     
Subjt:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV

Query:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----
        + +LD      ++  V  L  +      D                   SFP+     T  F+  +SD WGPS+  TSSGK WFVTFIDDHTRLTW     
Subjt:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----

Query:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL
                   FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNGV ERKNRHL++VARSLMLSTSL SYLW D +LTAAHL
Subjt:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL

Query:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI
        INRMPSR+L+ QTPLD LK SYP+  L                        +      CVFVGYPLHQ  Y                           + 
Subjt:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI

Query:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN
        +  ENV  E+    + V+ T ++V +                                            RD   E       NN + ++D     V EN
Subjt:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN

Query:  --------ENEVIPQNSTTEEE---LDKPEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVLEEM
                E EV  + S  E E     K ++YD +LD+PIALRKGTRSCT++P  +++SY+NLS +FRAFTA+LD+  IPKNIY A+E PEWK AV+EEM
Subjt:  --------ENEVIPQNSTTEEE---LDKPEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVLEEM

Query:  RAFEKNHT
        +A EKN T
Subjt:  RAFEKNHT

KAA0061447.1 Beta-galactosidase [Cucumis melo var. makuwa]6.4e-18646.42Show/hide
Query:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL
        GR++FG+LTGE  +P  GD  E LWKGEDSL+RS+LI+SMEPQIGKPLLY ATA+D+WD  Q LYSKRQN SRLYTLRKQ+H CKQG++D+T+YFNKLSL
Subjt:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL

Query:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK
        +WQEMD+C E VWD P    QY KLEE DRVYD L GLN KFD +  RILGQRP+P++ME C EVRLEED+ NAM +  +  IDSAAFSA SS    DK 
Subjt:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK

Query:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------
        N K  P                                          E  P +             T +LGA+AQSG+   L L++             
Subjt:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------

Query:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV
        TDHLTG+S++F+SY PCA NEKIRIADG+LA +AGKG I PF+G  L NVLHVPK+SYNLLS+ ++          LP            TIG +     
Subjt:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV

Query:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----
        + +LD      ++  V  L  +      D                   SFP+     T  F+  +SD WGPS+  TSSGK WFVTFIDDHTRLTW     
Subjt:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----

Query:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL
                   FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNGV ERKNRHL++VARSLMLSTSL SYLW D +LTAAHL
Subjt:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL

Query:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI
        INRMPSR+L+ QTPLD LK SYP+  L                        +      CVFVGYPLHQ  Y                           + 
Subjt:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI

Query:  VCAENVCVEN----DIVDLTKLSVK--NDRDDMTENNQV---------VKSDVVSTTVQ------------------ENENEVIPQNSTTEEE---LDKP
        +  ENV  E+    + ++ T ++V   +    +   NQV         ++ +V S T Q                  ++E EV  + S  E E     K 
Subjt:  VCAENVCVEN----DIVDLTKLSVK--NDRDDMTENNQV---------VKSDVVSTTVQ------------------ENENEVIPQNSTTEEE---LDKP

Query:  EKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVLEEMRAFEKNHT
        ++YD +LD+PIALRKGTRSCT++P  +++SY+NLS +FRAFTA+LD+  IPKNIY A+E PEWK AV+EEM+A EKN T
Subjt:  EKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVLEEMRAFEKNHT

TYK13763.1 Beta-galactosidase [Cucumis melo var. makuwa]1.0e-18346.3Show/hide
Query:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL
        GRH+FG+LTGE  +P  GD  E LWKGEDS +RS+LI+SMEPQIGKPLLY ATA+D+WD  Q LYSKRQN SRLYTLRKQ+H CKQG++D+T+YFNKLSL
Subjt:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL

Query:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK
        +WQEMD+C E VWD P    QY KLEE DRVYD L GLN KFD +  RILGQR +P++ME C EVRLEED+ NAM +  +  IDSAAFSA SS    DK 
Subjt:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK

Query:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------
        N K  P                                          E  P +             T +LGA+AQSG+   L L++             
Subjt:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------

Query:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV
        TDHLTG+S++F+SY PCA NEKIRIADG+LA +AGKG I PF+G  L NVLHVPK+SYNLLS+ ++          LP            TIG +     
Subjt:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV

Query:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----
        + +LD      ++  V  L  +      D                   SFP+     T  F+  +SD WGPS+  TSSGK WFVTFIDDHTRLTW     
Subjt:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----

Query:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL
                   FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNGV ERKNRHL++VARSLMLSTSL SYLW D +LTAAHL
Subjt:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL

Query:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYNNIVCAENVCVEN----DIVDLTKLSVKN
        INRMPSR+L+ QTPLD LK SYP+  L                        +      CVFVGYPLH          ENV  E+    + V+ T ++V +
Subjt:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYNNIVCAENVCVEN----DIVDLTKLSVKN

Query:  -------------------------------------------DRDDMTE----NNQVVKSDVVSTTVQEN--------ENEVIPQNSTTEEE---LDKP
                                                    RD   E    NN + ++D     V EN        E EV  + S  E E     K 
Subjt:  -------------------------------------------DRDDMTE----NNQVVKSDVVSTTVQEN--------ENEVIPQNSTTEEE---LDKP

Query:  EKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVLEEMRAFEKNHT
        ++YD +LD+PIALRKGTRSCT++P  +++SY+NLS +FRAFTASLD+  I KNIY A+E PEW  AV+EEM+A EKN T
Subjt:  EKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVLEEMRAFEKNHT

TYK31050.1 Beta-galactosidase [Cucumis melo var. makuwa]3.0e-18345.15Show/hide
Query:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL
        GR++FG+LTGEI +P  GD  E LWKGEDSL+RS+LI+SMEPQIGKPLLY  TA+D+WD  Q LYSKRQN SRLYTLRKQ+H CKQG++D+T+YFNKLSL
Subjt:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL

Query:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK
        +WQEMD+C E VWD P    QY KLEE DRVYD L GLN KFD +  RILGQRP+P++ME C EVRLEED+ NAM +  +  IDSAAFSA SS    DK 
Subjt:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK

Query:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------
        N K  P                                          E  P +             T +LGA+AQSG+   L L++             
Subjt:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------

Query:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV
        TDHLTG+S++F+SY PCA NEKIRIADG+LA +AGKG I PF+G  L NVLHVPK+SYNLLS+ ++          LP            TIG +     
Subjt:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV

Query:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----
        + +LD      ++  V  L  +      D                   SFP+     T  F+  +SD WGPS+  TSSGK WFVTFIDDHTRLTW     
Subjt:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----

Query:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL
                   FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNGV ERKNRHL++VARSLMLSTSL SYLW D +LTAAHL
Subjt:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL

Query:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI
        INRMPSR+L+ QTPLD LK SYP+  L                        +      CVFVGYPLHQ  Y                           + 
Subjt:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI

Query:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN
        +  ENV  E+    + V+ T ++V +                                            RD   E       NN + ++D       EN
Subjt:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN

Query:  --------ENEVIPQNSTTEEE---LDKPEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVLEEM
                E EV  + S  E E     K ++YD +LD+PIALRKGTRSCT++P  +++SY+NLS +FRAFTA+LD+  IPKNIY A+E PEWK AV+EEM
Subjt:  --------ENEVIPQNSTTEEE---LDKPEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVLEEM

Query:  RAFEKNHT
        +A EKN T
Subjt:  RAFEKNHT

TYK31717.1 Beta-galactosidase [Cucumis melo var. makuwa]3.0e-18345.15Show/hide
Query:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL
        GR++FG+LTGEI +P  GD  E LWKGEDSL+RS+LI+SMEPQIGKPLLY  TA+D+WD  Q LYSKRQN SRLYTLRKQ+H CKQG++D+T+YFNKLSL
Subjt:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL

Query:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK
        +WQEMD+C E VWD P    QY KLEE DRVYD L GLN KFD +  RILGQRP+P++ME C EVRLEED+ NAM +  +  IDSAAFSA SS    DK 
Subjt:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK

Query:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------
        N K  P                                          E  P +             T +LGA+AQSG+   L L++             
Subjt:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------

Query:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV
        TDHLTG+S++F+SY PCA NEKIRIADG+LA +AGKG I PF+G  L NVLHVPK+SYNLLS+ ++          LP            TIG +     
Subjt:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV

Query:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----
        + +LD      ++  V  L  +      D                   SFP+     T  F+  +SD WGPS+  TSSGK WFVTFIDDHTRLTW     
Subjt:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----

Query:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL
                   FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNGV ERKNRHL++VARSLMLSTSL SYLW D +LTAAHL
Subjt:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL

Query:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI
        INRMPSR+L+ QTPLD LK SYP+  L                        +      CVFVGYPLHQ  Y                           + 
Subjt:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI

Query:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN
        +  ENV  E+    + V+ T ++V +                                            RD   E       NN + ++D       EN
Subjt:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN

Query:  --------ENEVIPQNSTTEEE---LDKPEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVLEEM
                E EV  + S  E E     K ++YD +LD+PIALRKGTRSCT++P  +++SY+NLS +FRAFTA+LD+  IPKNIY A+E PEWK AV+EEM
Subjt:  --------ENEVIPQNSTTEEE---LDKPEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVLEEM

Query:  RAFEKNHT
        +A EKN T
Subjt:  RAFEKNHT

TrEMBL top hitse value%identityAlignment
A0A5A7SQW1 Beta-galactosidase5.0e-18445.26Show/hide
Query:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL
        GR++FG+LTGE  +P  GD  E LWKGEDSL+RS+LI+SMEPQIGKPLLY ATA+D+WD  Q LYSKRQN SRLYTLRKQ+H CKQG++D+T+YFNKLSL
Subjt:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL

Query:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK
        +WQEMD+C E VWD P    QY KLEE DRVYD L GLN KFD +  RILGQRP+P++ME C EVRLEED+ NAM +  +  IDSAAFSA SS    DK 
Subjt:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK

Query:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------
        N K  P                                          E  P +             T +LGA+AQSG+   L L++             
Subjt:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------

Query:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV
        TDHLTG+S++F+SY PCA NEKIRIADG+LA +AGKG I PF+G  L NVLHVPK+SYNLLS+ ++          LP            TIG +     
Subjt:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV

Query:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----
        + +LD      ++  V  L  +      D                   SFP+     T  F+  +SD WGPS+  TSSGK WFVTFIDDHTRLTW     
Subjt:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----

Query:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL
                   FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNGV ERKNRHL++VARSLMLSTSL SYLW D +LTAAHL
Subjt:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL

Query:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI
        INRMPSR+L+ QTPLD LK SYP+  L                        +      CVFVGYPLHQ  Y                           + 
Subjt:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI

Query:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN
        +  ENV  E+    + V+ T ++V +                                            RD   E       NN + ++D     V EN
Subjt:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN

Query:  --------ENEVIPQNSTTEEE---LDKPEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVLEEM
                E EV  + S  E E     K ++YD +LD+PIALRKGTRSCT++P  +++SY+NLS +FRAFTA+LD+  IPKNIY A+E PEWK AV+EEM
Subjt:  --------ENEVIPQNSTTEEE---LDKPEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVLEEM

Query:  RAFEKNHT
        +A EKN T
Subjt:  RAFEKNHT

A0A5A7V3J5 Beta-galactosidase3.1e-18646.42Show/hide
Query:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL
        GR++FG+LTGE  +P  GD  E LWKGEDSL+RS+LI+SMEPQIGKPLLY ATA+D+WD  Q LYSKRQN SRLYTLRKQ+H CKQG++D+T+YFNKLSL
Subjt:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL

Query:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK
        +WQEMD+C E VWD P    QY KLEE DRVYD L GLN KFD +  RILGQRP+P++ME C EVRLEED+ NAM +  +  IDSAAFSA SS    DK 
Subjt:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK

Query:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------
        N K  P                                          E  P +             T +LGA+AQSG+   L L++             
Subjt:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------

Query:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV
        TDHLTG+S++F+SY PCA NEKIRIADG+LA +AGKG I PF+G  L NVLHVPK+SYNLLS+ ++          LP            TIG +     
Subjt:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV

Query:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----
        + +LD      ++  V  L  +      D                   SFP+     T  F+  +SD WGPS+  TSSGK WFVTFIDDHTRLTW     
Subjt:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----

Query:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL
                   FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNGV ERKNRHL++VARSLMLSTSL SYLW D +LTAAHL
Subjt:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL

Query:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI
        INRMPSR+L+ QTPLD LK SYP+  L                        +      CVFVGYPLHQ  Y                           + 
Subjt:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI

Query:  VCAENVCVEN----DIVDLTKLSVK--NDRDDMTENNQV---------VKSDVVSTTVQ------------------ENENEVIPQNSTTEEE---LDKP
        +  ENV  E+    + ++ T ++V   +    +   NQV         ++ +V S T Q                  ++E EV  + S  E E     K 
Subjt:  VCAENVCVEN----DIVDLTKLSVK--NDRDDMTENNQV---------VKSDVVSTTVQ------------------ENENEVIPQNSTTEEE---LDKP

Query:  EKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVLEEMRAFEKNHT
        ++YD +LD+PIALRKGTRSCT++P  +++SY+NLS +FRAFTA+LD+  IPKNIY A+E PEWK AV+EEM+A EKN T
Subjt:  EKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVLEEMRAFEKNHT

A0A5D3CPE0 Beta-galactosidase5.0e-18446.3Show/hide
Query:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL
        GRH+FG+LTGE  +P  GD  E LWKGEDS +RS+LI+SMEPQIGKPLLY ATA+D+WD  Q LYSKRQN SRLYTLRKQ+H CKQG++D+T+YFNKLSL
Subjt:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL

Query:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK
        +WQEMD+C E VWD P    QY KLEE DRVYD L GLN KFD +  RILGQR +P++ME C EVRLEED+ NAM +  +  IDSAAFSA SS    DK 
Subjt:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK

Query:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------
        N K  P                                          E  P +             T +LGA+AQSG+   L L++             
Subjt:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------

Query:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV
        TDHLTG+S++F+SY PCA NEKIRIADG+LA +AGKG I PF+G  L NVLHVPK+SYNLLS+ ++          LP            TIG +     
Subjt:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV

Query:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----
        + +LD      ++  V  L  +      D                   SFP+     T  F+  +SD WGPS+  TSSGK WFVTFIDDHTRLTW     
Subjt:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----

Query:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL
                   FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNGV ERKNRHL++VARSLMLSTSL SYLW D +LTAAHL
Subjt:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL

Query:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYNNIVCAENVCVEN----DIVDLTKLSVKN
        INRMPSR+L+ QTPLD LK SYP+  L                        +      CVFVGYPLH          ENV  E+    + V+ T ++V +
Subjt:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYNNIVCAENVCVEN----DIVDLTKLSVKN

Query:  -------------------------------------------DRDDMTE----NNQVVKSDVVSTTVQEN--------ENEVIPQNSTTEEE---LDKP
                                                    RD   E    NN + ++D     V EN        E EV  + S  E E     K 
Subjt:  -------------------------------------------DRDDMTE----NNQVVKSDVVSTTVQEN--------ENEVIPQNSTTEEE---LDKP

Query:  EKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVLEEMRAFEKNHT
        ++YD +LD+PIALRKGTRSCT++P  +++SY+NLS +FRAFTASLD+  I KNIY A+E PEW  AV+EEM+A EKN T
Subjt:  EKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVLEEMRAFEKNHT

A0A5D3E603 Beta-galactosidase1.4e-18345.15Show/hide
Query:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL
        GR++FG+LTGEI +P  GD  E LWKGEDSL+RS+LI+SMEPQIGKPLLY  TA+D+WD  Q LYSKRQN SRLYTLRKQ+H CKQG++D+T+YFNKLSL
Subjt:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL

Query:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK
        +WQEMD+C E VWD P    QY KLEE DRVYD L GLN KFD +  RILGQRP+P++ME C EVRLEED+ NAM +  +  IDSAAFSA SS    DK 
Subjt:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK

Query:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------
        N K  P                                          E  P +             T +LGA+AQSG+   L L++             
Subjt:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------

Query:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV
        TDHLTG+S++F+SY PCA NEKIRIADG+LA +AGKG I PF+G  L NVLHVPK+SYNLLS+ ++          LP            TIG +     
Subjt:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV

Query:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----
        + +LD      ++  V  L  +      D                   SFP+     T  F+  +SD WGPS+  TSSGK WFVTFIDDHTRLTW     
Subjt:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----

Query:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL
                   FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNGV ERKNRHL++VARSLMLSTSL SYLW D +LTAAHL
Subjt:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL

Query:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI
        INRMPSR+L+ QTPLD LK SYP+  L                        +      CVFVGYPLHQ  Y                           + 
Subjt:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI

Query:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN
        +  ENV  E+    + V+ T ++V +                                            RD   E       NN + ++D       EN
Subjt:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN

Query:  --------ENEVIPQNSTTEEE---LDKPEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVLEEM
                E EV  + S  E E     K ++YD +LD+PIALRKGTRSCT++P  +++SY+NLS +FRAFTA+LD+  IPKNIY A+E PEWK AV+EEM
Subjt:  --------ENEVIPQNSTTEEE---LDKPEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVLEEM

Query:  RAFEKNHT
        +A EKN T
Subjt:  RAFEKNHT

A0A5D3E6F8 Beta-galactosidase1.4e-18345.15Show/hide
Query:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL
        GR++FG+LTGEI +P  GD  E LWKGEDSL+RS+LI+SMEPQIGKPLLY  TA+D+WD  Q LYSKRQN SRLYTLRKQ+H CKQG++D+T+YFNKLSL
Subjt:  GRHKFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSL

Query:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK
        +WQEMD+C E VWD P    QY KLEE DRVYD L GLN KFD +  RILGQRP+P++ME C EVRLEED+ NAM +  +  IDSAAFSA SS    DK 
Subjt:  IWQEMDICCELVWDCPCGGVQYYKLEEIDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKK

Query:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------
        N K  P                                          E  P +             T +LGA+AQSG+   L L++             
Subjt:  NRKPPP------------------------------------------ENCPVNIG-----------TSSLGAMAQSGISPPLSLLT-------------

Query:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV
        TDHLTG+S++F+SY PCA NEKIRIADG+LA +AGKG I PF+G  L NVLHVPK+SYNLLS+ ++          LP            TIG +     
Subjt:  TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA---------LPS-----------TIGDSISLIV

Query:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----
        + +LD      ++  V  L  +      D                   SFP+     T  F+  +SD WGPS+  TSSGK WFVTFIDDHTRLTW     
Subjt:  ILLLD-----KTIGLVCYLPIFQLLKMTD-----------------PFSFPTLQIFST--FHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTW-----

Query:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL
                   FY TI+TQF+ KIAILRSDNGREF  + L EFL+ KGIVHQ+SCAYTPQQNGV ERKNRHL++VARSLMLSTSL SYLW D +LTAAHL
Subjt:  -----------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHL

Query:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI
        INRMPSR+L+ QTPLD LK SYP+  L                        +      CVFVGYPLHQ  Y                           + 
Subjt:  INRMPSRVLNFQTPLDYLKLSYPTIAL------------------------YLMSLSKCVFVGYPLHQCDYN--------------------------NI

Query:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN
        +  ENV  E+    + V+ T ++V +                                            RD   E       NN + ++D       EN
Subjt:  VCAENVCVEN----DIVDLTKLSVKN-------------------------------------------DRDDMTE-------NNQVVKSDVVSTTVQEN

Query:  --------ENEVIPQNSTTEEE---LDKPEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVLEEM
                E EV  + S  E E     K ++YD +LD+PIALRKGTRSCT++P  +++SY+NLS +FRAFTA+LD+  IPKNIY A+E PEWK AV+EEM
Subjt:  --------ENEVIPQNSTTEEE---LDKPEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVLEEM

Query:  RAFEKNHT
        +A EKN T
Subjt:  RAFEKNHT

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.7e-1729.77Show/hide
Query:  YSDAWGPSRTATSSGKHWFVTFIDDHTR--LTW--------------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGV
        +SD  GP    T   K++FV F+D  T   +T+              F    E  FN K+  L  DNGRE+L+N +R+F   KGI +  +  +TPQ NGV
Subjt:  YSDAWGPSRTATSSGKHWFVTFIDDHTR--LTW--------------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGV

Query:  VERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHLINRMPSRVL--NFQTP----------LDYLKLSYPTIALYLMSLS--------KCVFVGYPLHQ
         ER  R + + AR+++    L    W + VLTA +LINR+PSR L  + +TP          L +L++   T+ +++ +          K +FVGY  + 
Subjt:  VERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHLINRMPSRVL--NFQTP----------LDYLKLSYPTIALYLMSLS--------KCVFVGYPLHQ

Query:  CDYNNIVCAENVCVENDIVDLTKLSVKNDRDDMTENNQVVKSDVV-STTVQENENEVIPQNS
            + V  + +   + +VD T +           N++ VK + V     +E+EN+  P +S
Subjt:  CDYNNIVCAENVCVENDIVDLTKLSVKNDRDDMTENNQVVKSDVV-STTVQENENEVIPQNS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.4e-2037.33Show/hide
Query:  YSDAWGPSRTATSSGKHWFVTFIDDHTRLTW----------------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGV
        YSD  GP    +  G  +FVTFIDD +R  W                F+  +E +   K+  LRSDNG E+ +    E+ S  GI H+ +   TPQ NGV
Subjt:  YSDAWGPSRTATSSGKHWFVTFIDDHTRLTW----------------FYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGV

Query:  VERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHLINRMPSRVLNFQTP
         ER NR +++  RS++    L    W + V TA +LINR PS  L F+ P
Subjt:  VERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHLINRMPSRVLNFQTP

Q12491 Transposon Ty2-B Gag-Pol polyprotein1.8e-1027.15Show/hide
Query:  FSTFHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTWFY------------------TTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSC
        +  F   ++D +GP      S   +F++F D+ TR  W Y                    I+ QFNA++ +++ D G E+   TL +F + +GI    + 
Subjt:  FSTFHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTWFY------------------TTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSC

Query:  AYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHLINRMPS
            + +GV ER NR LL   R+L+  + L ++LW   V  +  + N + S
Subjt:  AYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDVVLTAAHLINRMPS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.2e-1925.26Show/hide
Query:  TDHLTGTSDNFLSYHPCADNEKIRIADGT---LAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLALPSTIG----------DSISLIVILLLDKT
        T H+T   +N   + P    + + +ADG+   ++H       +    L LHN+L+VP I  NL+SV RL   + +             ++  V LL  KT
Subjt:  TDHLTGTSDNFLSYHPCADNEKIRIADGT---LAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLALPSTIG----------DSISLIVILLLDKT

Query:  ------------------------------------------------IGLVCYLPIFQLLKMTD---------PFSFPTLQIFSTFHSYYSDAWGPSRT
                                                          L    P  + L  +D         PFS  T+         YSD W  S  
Subjt:  ------------------------------------------------IGLVCYLPIFQLLKMTD---------PFSFPTLQIFSTFHSYYSDAWGPSRT

Query:  ATSSGKHWFVTFIDDHTRLTWFY----------------TTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLK
         +     ++V F+D  TR TW Y                  +E +F  +I    SDNG EF+   L E+ S  GI H +S  +TP+ NG+ ERK+RH+++
Subjt:  ATSSGKHWFVTFIDDHTRLTWFY----------------TTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLK

Query:  VARSLMLSTSLLSYLWEDVVLTAAHLINRMPSRVLNFQTPL--------DYLKL------SYPTIALYLM-----SLSKCVFVGYPLHQCDY
           +L+   S+    W      A +LINR+P+ +L  ++P         +Y KL       YP +  Y          +CVF+GY L Q  Y
Subjt:  VARSLMLSTSLLSYLWEDVVLTAAHLINRMPSRVLNFQTPL--------DYLKL------SYPTIALYLM-----SLSKCVFVGYPLHQCDY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.2e-2122.29Show/hide
Query:  GYLTGEIPKP--RIG-------DPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFN
        G+L G  P P   IG       +P    W+ +D L+ S ++ ++   +   +    TA  IW+ ++++Y+   N S  +  +          +   + F+
Subjt:  GYLTGEIPKP--RIG-------DPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFN

Query:  KLSLIWQEMD------ICCELVWDCPCGGVQYYKLEE----IDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSA
        +L+L+ + MD         E + D     +     ++    +  +++ L+   SK  A+ S  +       V    +     ++         ++   S 
Subjt:  KLSLIWQEMD------ICCELVWDCPCGGVQYYKLEE----IDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSA

Query:  AFSANSSGTTGDKKNRKP-------------PPENCPVNIGTSSLGAMAQS-------------GISPP-------LSLLTTDHLTGTSDNFLSYHPCAD
        ++  +SSG+  D +  KP               + CP      S     QS              ++ P       L    T H+T   +N   + P   
Subjt:  AFSANSSGTTGDKKNRKP-------------PPENCPVNIGTSSLGAMAQS-------------GISPP-------LSLLTTDHLTGTSDNFLSYHPCAD

Query:  NEKIRIADGT---LAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA------------------------------------LPSTIGDSISLIV
         + + IADG+   + H       +    L L+ VL+VP I  NL+SV RL                                      P     ++S+  
Subjt:  NEKIRIADGT---LAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLA------------------------------------LPSTIGDSISLIV

Query:  -------------------ILLLDKTI---GLVCYLPIFQLLKMTD---------PFSFPTLQIFSTFHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRL
                           + +L+  I    L    P  +LL  +D         PFS  T+         YSD W  S   +     ++V F+D  TR 
Subjt:  -------------------ILLLDKTI---GLVCYLPIFQLLKMTD---------PFSFPTLQIFSTFHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRL

Query:  TWFY----------------TTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDV
        TW Y                + +E +F  +I  L SDNG EF+   LR++LS  GI H +S  +TP+ NG+ ERK+RH++++  +L+   S+    W   
Subjt:  TWFY----------------TTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLMLSTSLLSYLWEDV

Query:  VLTAAHLINRMPSRVLNFQTPL--------DYLKLS------YPTIALY----LMSLSK-CVFVGYPLHQCDY
           A +LINR+P+ +L  Q+P         +Y KL       YP +  Y    L   SK C F+GY L Q  Y
Subjt:  VLTAAHLINRMPSRVLNFQTPL--------DYLKLS------YPTIALY----LMSLSK-CVFVGYPLHQCDY

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.6e-1727.68Show/hide
Query:  KFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSLIWQ
        KFG++ G +PKP    P    W+  ++++   L++SM  ++ + ++Y  TA  +W+ +++++     + ++Y LR+++   +QG   +  YF KLS +W 
Subjt:  KFGYLTGEIPKPRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSLIWQ

Query:  EMDICCELVWDCPCGGV------QYYKLEEIDRVYDLLVG--LNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKM
        E+     +  +C CGG       +  +  E ++ Y+ L+G  LN  F+A+ ++I+ Q+P P++ EA + V+  E  M
Subjt:  EMDICCELVWDCPCGGV------QYYKLEEIDRVYDLLVG--LNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCCGTAACCGCCGTTGGGTCTGTCGTCCACGAAGTTGTCGACAGGTATGTTCGGTCACTCAACCCACGCGCAGCCGCCATCTTCTTCCAAAATTTTGGACCGAC
AGCACGATCTCCAGCCACCTCAACAGCCCAGGTTGCTTTTTGCACCCAGACCACCGTGCCTCCATTAGGTTTTCACGCCACCGCCACCCAAACCTCTCACCGTGGGCCGA
CTGTTCTTCCCGTCACTGGATCTTCTTCGTGGGTGAGTTTCGGGAACCAGTCGCGGTCAGATCTGGACACCGGCGGGGCTCATTACATCCTTGAAGTTGTTGTTGCTCTT
CACGCCACCAACGACGGTGCTTATTCCAGACGGCCGACCACCACTGCCGCTGCCAAATCCGAAGGTCTGGTTTCAAATCCGACAGCTGTTGCTACTGCTGGAACTAGTGG
GTGTTCTTCAACCTCGGTCCAACAACAACTTACCAATCTTCAAGCAAGTTTTCAACAACAAATCGCCGCTCTTGCGTGTAGGAAGTTCAAGTATTCCGTCAGACTTACTA
CTGGATCCAGAGATCCGGTAACTACTTTCCCTACCATATCGACTGTACCGTATTTGCCTGGACCAATGGGGCGTCATAAGTTCGGATATTTGACCGGTGAAATACCTAAA
CCCAGAATCGGAGATCCTCAAGAACACCTCTGGAAGGGAGAAGACTCATTACTCAGATCTCTGTTAATTCATAGTATGGAACCCCAAATCGGAAAGCCCCTGTTGTATGT
GGCGACCGCTCAGGATATTTGGGATGTTGTTCAACAACTTTATTCCAAGAGACAGAATGAATCACGGCTCTACACTCTGCGAAAACAAATCCACGAATGCAAACAGGGAT
CTATGGATATCACGTCATATTTTAATAAATTATCTCTTATTTGGCAGGAGATGGACATATGTTGTGAACTTGTCTGGGACTGCCCTTGTGGAGGAGTCCAATATTATAAA
CTTGAAGAGATTGACCGTGTATATGATCTTTTAGTAGGCCTCAATTCCAAGTTTGATGCTATGTGGAGCCGAATATTGGGACAAAGACCAATACCGACCGTAATGGAAGC
TTGTTCAGAAGTACGTCTGGAGGAAGATAAAATGAATGCCATGAATATCACGGCTTCCTCCGTAATTGATTCCGCCGCCTTCAGTGCTAACTCATCTGGTACTACTGGAG
ACAAGAAGAACAGGAAACCACCTCCAGAGAACTGCCCAGTCAATATCGGTACTTCCTCTTTAGGGGCGATGGCGCAATCAGGTATTTCTCCACCCTTGAGTCTCCTCACT
ACAGACCATCTAACTGGAACTTCTGATAACTTCCTCTCTTATCATCCATGTGCCGATAATGAAAAGATTCGAATTGCTGACGGGACACTTGCCCATGTTGCCGGCAAGGG
TCACATTTCTCCATTCAATGGTTTGGTATTACATAATGTGTTGCATGTTCCTAAAATCTCCTATAATTTATTGTCTGTGAGACGATTGGCACTGCCTAGCACAATAGGAG
ACTCTATTTCCTTAATAGTGATACTTCTTCTAGACAAGACTATAGGGCTAGTTTGTTATCTTCCTATTTTTCAACTTCTGAAAATGACTGATCCCTTTTCCTTCCCAACC
TTACAAATCTTCTCAACCTTTCACTCTTATTATAGTGATGCCTGGGGTCCCTCACGAACTGCCACTTCTTCGGGTAAACATTGGTTCGTTACCTTCATTGATGACCACAC
TCGCCTTACTTGGTTTTACACCACCATTGAAACTCAGTTTAATGCCAAAATTGCAATCCTTCGGAGTGACAATGGTCGTGAATTCCTCACTAATACTCTTCGTGAGTTCT
TATCCATTAAAGGTATCGTTCACCAGAGTTCGTGTGCCTATACACCCCAACAAAATGGAGTGGTTGAAAGAAAAAACCGTCATCTCCTAAAAGTTGCTCGGTCTCTCATG
CTATCAACCTCTCTTCTGTCTTACCTATGGGAGGATGTAGTCTTGACTGCCGCTCATCTTATAAATCGGATGCCTTCTCGGGTTCTTAACTTCCAAACTCCTCTTGACTA
CCTCAAACTGTCTTACCCTACCATCGCCTTATACTTGATGTCCCTCTCCAAATGTGTCTTCGTTGGGTATCCTCTCCACCAATGCGATTATAATAATATTGTTTGTGCTG
AAAATGTTTGTGTTGAAAATGATATTGTTGATCTGACAAAACTGTCGGTAAAAAATGATAGAGATGATATGACAGAAAATAACCAGGTTGTTAAGAGTGATGTAGTGTCC
ACAACAGTTCAGGAGAATGAAAATGAGGTCATACCTCAGAATTCCACAACTGAAGAAGAACTCGACAAACCAGAGAAGTATGATGCCACCCTTGACATGCCCATTGCTCT
AAGAAAGGGCACGAGGTCCTGCACCAGGTATCCCAAGAATAGCTTCCTCTCGTATAACAATCTATCTTCTAAGTTCAGAGCGTTTACTGCTAGCCTTGACACTGTAACTA
TACCGAAGAACATATATGTGGCTATGGAAATTCCTGAGTGGAAGGCTGCTGTTTTGGAAGAAATGAGAGCTTTTGAGAAAAATCATACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCCGTAACCGCCGTTGGGTCTGTCGTCCACGAAGTTGTCGACAGGTATGTTCGGTCACTCAACCCACGCGCAGCCGCCATCTTCTTCCAAAATTTTGGACCGAC
AGCACGATCTCCAGCCACCTCAACAGCCCAGGTTGCTTTTTGCACCCAGACCACCGTGCCTCCATTAGGTTTTCACGCCACCGCCACCCAAACCTCTCACCGTGGGCCGA
CTGTTCTTCCCGTCACTGGATCTTCTTCGTGGGTGAGTTTCGGGAACCAGTCGCGGTCAGATCTGGACACCGGCGGGGCTCATTACATCCTTGAAGTTGTTGTTGCTCTT
CACGCCACCAACGACGGTGCTTATTCCAGACGGCCGACCACCACTGCCGCTGCCAAATCCGAAGGTCTGGTTTCAAATCCGACAGCTGTTGCTACTGCTGGAACTAGTGG
GTGTTCTTCAACCTCGGTCCAACAACAACTTACCAATCTTCAAGCAAGTTTTCAACAACAAATCGCCGCTCTTGCGTGTAGGAAGTTCAAGTATTCCGTCAGACTTACTA
CTGGATCCAGAGATCCGGTAACTACTTTCCCTACCATATCGACTGTACCGTATTTGCCTGGACCAATGGGGCGTCATAAGTTCGGATATTTGACCGGTGAAATACCTAAA
CCCAGAATCGGAGATCCTCAAGAACACCTCTGGAAGGGAGAAGACTCATTACTCAGATCTCTGTTAATTCATAGTATGGAACCCCAAATCGGAAAGCCCCTGTTGTATGT
GGCGACCGCTCAGGATATTTGGGATGTTGTTCAACAACTTTATTCCAAGAGACAGAATGAATCACGGCTCTACACTCTGCGAAAACAAATCCACGAATGCAAACAGGGAT
CTATGGATATCACGTCATATTTTAATAAATTATCTCTTATTTGGCAGGAGATGGACATATGTTGTGAACTTGTCTGGGACTGCCCTTGTGGAGGAGTCCAATATTATAAA
CTTGAAGAGATTGACCGTGTATATGATCTTTTAGTAGGCCTCAATTCCAAGTTTGATGCTATGTGGAGCCGAATATTGGGACAAAGACCAATACCGACCGTAATGGAAGC
TTGTTCAGAAGTACGTCTGGAGGAAGATAAAATGAATGCCATGAATATCACGGCTTCCTCCGTAATTGATTCCGCCGCCTTCAGTGCTAACTCATCTGGTACTACTGGAG
ACAAGAAGAACAGGAAACCACCTCCAGAGAACTGCCCAGTCAATATCGGTACTTCCTCTTTAGGGGCGATGGCGCAATCAGGTATTTCTCCACCCTTGAGTCTCCTCACT
ACAGACCATCTAACTGGAACTTCTGATAACTTCCTCTCTTATCATCCATGTGCCGATAATGAAAAGATTCGAATTGCTGACGGGACACTTGCCCATGTTGCCGGCAAGGG
TCACATTTCTCCATTCAATGGTTTGGTATTACATAATGTGTTGCATGTTCCTAAAATCTCCTATAATTTATTGTCTGTGAGACGATTGGCACTGCCTAGCACAATAGGAG
ACTCTATTTCCTTAATAGTGATACTTCTTCTAGACAAGACTATAGGGCTAGTTTGTTATCTTCCTATTTTTCAACTTCTGAAAATGACTGATCCCTTTTCCTTCCCAACC
TTACAAATCTTCTCAACCTTTCACTCTTATTATAGTGATGCCTGGGGTCCCTCACGAACTGCCACTTCTTCGGGTAAACATTGGTTCGTTACCTTCATTGATGACCACAC
TCGCCTTACTTGGTTTTACACCACCATTGAAACTCAGTTTAATGCCAAAATTGCAATCCTTCGGAGTGACAATGGTCGTGAATTCCTCACTAATACTCTTCGTGAGTTCT
TATCCATTAAAGGTATCGTTCACCAGAGTTCGTGTGCCTATACACCCCAACAAAATGGAGTGGTTGAAAGAAAAAACCGTCATCTCCTAAAAGTTGCTCGGTCTCTCATG
CTATCAACCTCTCTTCTGTCTTACCTATGGGAGGATGTAGTCTTGACTGCCGCTCATCTTATAAATCGGATGCCTTCTCGGGTTCTTAACTTCCAAACTCCTCTTGACTA
CCTCAAACTGTCTTACCCTACCATCGCCTTATACTTGATGTCCCTCTCCAAATGTGTCTTCGTTGGGTATCCTCTCCACCAATGCGATTATAATAATATTGTTTGTGCTG
AAAATGTTTGTGTTGAAAATGATATTGTTGATCTGACAAAACTGTCGGTAAAAAATGATAGAGATGATATGACAGAAAATAACCAGGTTGTTAAGAGTGATGTAGTGTCC
ACAACAGTTCAGGAGAATGAAAATGAGGTCATACCTCAGAATTCCACAACTGAAGAAGAACTCGACAAACCAGAGAAGTATGATGCCACCCTTGACATGCCCATTGCTCT
AAGAAAGGGCACGAGGTCCTGCACCAGGTATCCCAAGAATAGCTTCCTCTCGTATAACAATCTATCTTCTAAGTTCAGAGCGTTTACTGCTAGCCTTGACACTGTAACTA
TACCGAAGAACATATATGTGGCTATGGAAATTCCTGAGTGGAAGGCTGCTGTTTTGGAAGAAATGAGAGCTTTTGAGAAAAATCATACATGA
Protein sequenceShow/hide protein sequence
MESVTAVGSVVHEVVDRYVRSLNPRAAAIFFQNFGPTARSPATSTAQVAFCTQTTVPPLGFHATATQTSHRGPTVLPVTGSSSWVSFGNQSRSDLDTGGAHYILEVVVAL
HATNDGAYSRRPTTTAAAKSEGLVSNPTAVATAGTSGCSSTSVQQQLTNLQASFQQQIAALACRKFKYSVRLTTGSRDPVTTFPTISTVPYLPGPMGRHKFGYLTGEIPK
PRIGDPQEHLWKGEDSLLRSLLIHSMEPQIGKPLLYVATAQDIWDVVQQLYSKRQNESRLYTLRKQIHECKQGSMDITSYFNKLSLIWQEMDICCELVWDCPCGGVQYYK
LEEIDRVYDLLVGLNSKFDAMWSRILGQRPIPTVMEACSEVRLEEDKMNAMNITASSVIDSAAFSANSSGTTGDKKNRKPPPENCPVNIGTSSLGAMAQSGISPPLSLLT
TDHLTGTSDNFLSYHPCADNEKIRIADGTLAHVAGKGHISPFNGLVLHNVLHVPKISYNLLSVRRLALPSTIGDSISLIVILLLDKTIGLVCYLPIFQLLKMTDPFSFPT
LQIFSTFHSYYSDAWGPSRTATSSGKHWFVTFIDDHTRLTWFYTTIETQFNAKIAILRSDNGREFLTNTLREFLSIKGIVHQSSCAYTPQQNGVVERKNRHLLKVARSLM
LSTSLLSYLWEDVVLTAAHLINRMPSRVLNFQTPLDYLKLSYPTIALYLMSLSKCVFVGYPLHQCDYNNIVCAENVCVENDIVDLTKLSVKNDRDDMTENNQVVKSDVVS
TTVQENENEVIPQNSTTEEELDKPEKYDATLDMPIALRKGTRSCTRYPKNSFLSYNNLSSKFRAFTASLDTVTIPKNIYVAMEIPEWKAAVLEEMRAFEKNHT