; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G08950 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G08950
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionBeta-galactosidase
Genome locationClcChr06:11275782..11288346
RNA-Seq ExpressionClc06G08950
SyntenyClc06G08950
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR021717 - Nucleoporin Nup120/160
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR036322 - WD40-repeat-containing domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026232.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0046.27Show/hide
Query:  GQSAVNSSVPSDLPLYPVHPVTTFPTITPAPYLSGPIGNSIELTVGEKLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRS
        G+S+ +S  P++LP+Y  +PVT+FP  + + Y++G +G+S     GEKLNGQNYFSWSQS+KM LEGR++F +LTGEI +P PGD  ER+WKGEDSL+RS
Subjt:  GQSAVNSSVPSDLPLYPVHPVTTFPTITPAPYLSGPIGNSIELTVGEKLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRS

Query:  LLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNASRLYTLRKQIYECKQ-----------------------ELVWDCPCGGVQYYKLEEVDQVYDF
        +LI+SMEPQIGKPLLYA TA+D+WD  Q LYSKRQNASRLYTLRKQ++ CKQ                       E VWD P    QY KLEE D+VYDF
Subjt:  LLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNASRLYTLRKQIYECKQ-----------------------ELVWDCPCGGVQYYKLEEVDQVYDF

Query:  LAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSITDSTAFSAKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRR
        LAGLN KFD V  RILGQ+P P+LMEVC EVRLEEDRT+AM  + T   DS AFSA+ ++   DK NGK  PVCEHCKK WHTKDQCWKLHGRP  GK+R
Subjt:  LAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSITDSTAFSAKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRR

Query:  PPNNKPN--QALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLSLLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVA
          N K N  +A +SETT  S  Q  +   +   T +LG IA SG+  SL L+S+ G+ PWILDSGATDHLTG+S++F+SY PC  NEKIRIADG+LAP+A
Subjt:  PPNNKPN--QALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLSLLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVA

Query:  GKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDVLFQDLSLGKTIGTAQHKR-----------DSISLMVILP--------------
        GKG I PFDG  LQNVLHVPK+SYNLLS+SKITR+L+C+  F P+ V FQD+S G+TIGTA+H R            S+S + +L               
Subjt:  GKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDVLFQDLSLGKTIGTAQHKR-----------DSISLMVILP--------------

Query:  --LG------------------TVLGLVC-----------------YLP--------------------------------------------------I
          LG                   V  L C                 Y P                                                  I
Subjt:  --LG------------------TVLGLVC-----------------YLP--------------------------------------------------I

Query:  FQ-------------LLKLTSNNGRELLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARSRF---QPPFHLTYGA---AAHLINRMPSRV
        FQ             +  L S+NGRE   + L EFL++KGIVHQ+ CAYTPQQNGVAERKNRH++EVARS       P +L   A   AAHLINRMPSR+
Subjt:  FQ-------------LLKLTSNNGRELLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARSRF---QPPFHLTYGA---AAHLINRMPSRV

Query:  LHFQTPLDYLKLSYPTTRLIPDVPLRVFGCTAFV--LASVQTRLSS----------PLEPRNVSSLDIPSTSVVINASIPLL------------------
        LH QTPLD LK SYP+TRL+ +VPLRVFGCTA+V      QT+ +           PL          PS    +   +                     
Subjt:  LHFQTPLDYLKLSYPTTRLIPDVPLRVFGCTAFV--LASVQTRLSS----------PLEPRNVSSLDIPSTSVVINASIPLL------------------

Query:  -GNTSSLWISPS---------WKINLSFPQVPWITYYRKNLRKEIAAPTN-IPAPVQSSELTQAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDD
          N +  ++ P+           I L   QVPW TYYR+NLRKE+ +PT+  PAPVQ+ E  + QG  +P     C  N   END  D+  L      ++
Subjt:  -GNTSSLWISPS---------WKINLSFPQVPWITYYRKNLRKEIAAPTN-IPAPVQSSELTQAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDD

Query:  MTENNQVAESDV-VSTIVEENEKGIIPQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSLDTVTIPKNIYVTMEIP
        M E N   E++V + T  +E E+G          KLD   E D +LD+PIALRKGTRSCTK+P+ +++SY+NLS + RAF  +LD+  IPKNIY  +E P
Subjt:  MTENNQVAESDV-VSTIVEENEKGIIPQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSLDTVTIPKNIYVTMEIP

Query:  EWKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVA--------------------------------------------
        EWK A+MEEM+ALEKN TWE+ ALPKGHK VG KWVF+ KYK DGTLDR+KARLVA                                            
Subjt:  EWKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVA--------------------------------------------

Query:  ---------------------------------------------------------------------KRSVSGKIAVLIVYVDDIVLSGDDTAKINKL
                                                                             K S +GKIA+LIVYVDDIVL+GDD  +I++L
Subjt:  ---------------------------------------------------------------------KRSVSGKIAVLIVYVDDIVLSGDDTAKINKL

Query:  KQKMADEFEIKDLGNLKYFLGMEVARSKKGISVSQWKYTLDLMKET------------------------------------------------------
        KQ+M DEFEIKDLGNLKYFLGMEVARSK+GISVSQ KYTLDL+ ET                                                      
Subjt:  KQKMADEFEIKDLGNLKYFLGMEVARSKKGISVSQWKYTLDLMKET------------------------------------------------------

Query:  ---------------------------------------------------GSAIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICE
                                                           GS IDRKSTSGYCTFVWGNLVTWRSKKQSVVARS AEAEYRAMS GICE
Subjt:  ---------------------------------------------------GSAIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICE

Query:  EIWLKKVLSDLHQSSELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDIYVPS
        EIWL+KVLSDLHQ  E P+KL+CD KAAISIANN VQHDRTKHVEIDRHF+KE+L++GSICIPYIPSSQQ+ DVLTKGLL+  FD CVSK+GLIDIY+P+
Subjt:  EIWLKKVLSDLHQSSELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDIYVPS

KAA0034386.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0046.27Show/hide
Query:  GQSAVNSSVPSDLPLYPVHPVTTFPTITPAPYLSGPIGNSIELTVGEKLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRS
        G+S+ +S  P++LP+Y  +PVT+FP  + + Y++G +G+S     GEKLNGQNYFSWSQS+KM LEGR++F +LTGEI +P PGD  ER+WKGEDSL+RS
Subjt:  GQSAVNSSVPSDLPLYPVHPVTTFPTITPAPYLSGPIGNSIELTVGEKLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRS

Query:  LLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNASRLYTLRKQIYECKQ-----------------------ELVWDCPCGGVQYYKLEEVDQVYDF
        +LI+SMEPQIGKPLLYA TA+D+WD  Q LYSKRQNASRLYTLRKQ++ CKQ                       E VWD P    QY KLEE D+VYDF
Subjt:  LLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNASRLYTLRKQIYECKQ-----------------------ELVWDCPCGGVQYYKLEEVDQVYDF

Query:  LAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSITDSTAFSAKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRR
        LAGLN KFD V  RILGQ+P P+LMEVC EVRLEEDRT+AM  + T   DS AFSA+ ++   DK NGK  PVCEHCKK WHTKDQCWKLHGRP  GK+R
Subjt:  LAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSITDSTAFSAKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRR

Query:  PPNNKPN--QALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLSLLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVA
          N K N  +A +SETT  S  Q  +   +   T +LG IA SG+  SL L+S+ G+ PWILDSGATDHLTG+S++F+SY PC  NEKIRIADG+LAP+A
Subjt:  PPNNKPN--QALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLSLLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVA

Query:  GKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDVLFQDLSLGKTIGTAQHKR-----------DSISLMVILP--------------
        GKG I PFDG  LQNVLHVPK+SYNLLS+SKITR+L+C+  F P+ V FQD+S G+TIGTA+H R            S+S + +L               
Subjt:  GKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDVLFQDLSLGKTIGTAQHKR-----------DSISLMVILP--------------

Query:  --LG------------------TVLGLVC-----------------YLP--------------------------------------------------I
          LG                   V  L C                 Y P                                                  I
Subjt:  --LG------------------TVLGLVC-----------------YLP--------------------------------------------------I

Query:  FQ-------------LLKLTSNNGRELLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARSRF---QPPFHLTYGA---AAHLINRMPSRV
        FQ             +  L S+NGRE   + L EFL++KGIVHQ+ CAYTPQQNGVAERKNRH++EVARS       P +L   A   AAHLINRMPSR+
Subjt:  FQ-------------LLKLTSNNGRELLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARSRF---QPPFHLTYGA---AAHLINRMPSRV

Query:  LHFQTPLDYLKLSYPTTRLIPDVPLRVFGCTAFV--LASVQTRLSS----------PLEPRNVSSLDIPSTSVVINASIPLL------------------
        LH QTPLD LK SYP+TRL+ +VPLRVFGCTA+V      QT+ +           PL          PS    +   +                     
Subjt:  LHFQTPLDYLKLSYPTTRLIPDVPLRVFGCTAFV--LASVQTRLSS----------PLEPRNVSSLDIPSTSVVINASIPLL------------------

Query:  -GNTSSLWISPS---------WKINLSFPQVPWITYYRKNLRKEIAAPTN-IPAPVQSSELTQAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDD
          N +  ++ P+           I L   QVPW TYYR+NLRKE+ +PT+  PAPVQ+ E  + QG  +P     C  N   END  D+  L      ++
Subjt:  -GNTSSLWISPS---------WKINLSFPQVPWITYYRKNLRKEIAAPTN-IPAPVQSSELTQAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDD

Query:  MTENNQVAESDV-VSTIVEENEKGIIPQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSLDTVTIPKNIYVTMEIP
        M E N   E++V + T  +E E+G          KLD   E D +LD+PIALRKGTRSCTK+P+ +++SY+NLS + RAF  +LD+  IPKNIY  +E P
Subjt:  MTENNQVAESDV-VSTIVEENEKGIIPQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSLDTVTIPKNIYVTMEIP

Query:  EWKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVA--------------------------------------------
        EWK A+MEEM+ALEKN TWE+ ALPKGHK VG KWVF+ KYK DGTLDR+KARLVA                                            
Subjt:  EWKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVA--------------------------------------------

Query:  ---------------------------------------------------------------------KRSVSGKIAVLIVYVDDIVLSGDDTAKINKL
                                                                             K S +GKIA+LIVYVDDIVL+GDD  +I++L
Subjt:  ---------------------------------------------------------------------KRSVSGKIAVLIVYVDDIVLSGDDTAKINKL

Query:  KQKMADEFEIKDLGNLKYFLGMEVARSKKGISVSQWKYTLDLMKET------------------------------------------------------
        KQ+M DEFEIKDLGNLKYFLGMEVARSK+GISVSQ KYTLDL+ ET                                                      
Subjt:  KQKMADEFEIKDLGNLKYFLGMEVARSKKGISVSQWKYTLDLMKET------------------------------------------------------

Query:  ---------------------------------------------------GSAIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICE
                                                           GS IDRKSTSGYCTFVWGNLVTWRSKKQSVVARS AEAEYRAMS GICE
Subjt:  ---------------------------------------------------GSAIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICE

Query:  EIWLKKVLSDLHQSSELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDIYVPS
        EIWL+KVLSDLHQ  E P+KL+CD KAAISIANN VQHDRTKHVEIDRHF+KE+L++GSICIPYIPSSQQ+ DVLTKGLL+  FD CVSK+GLIDIY+P+
Subjt:  EIWLKKVLSDLHQSSELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDIYVPS

KAA0052775.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0046.27Show/hide
Query:  GQSAVNSSVPSDLPLYPVHPVTTFPTITPAPYLSGPIGNSIELTVGEKLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRS
        G+S+ +S  P++LP+Y  +PVT+FP  + + Y++G +G+S     GEKLNGQNYFSWSQS+KM LEGR++F +LTGEI +P PGD  ER+WKGEDSL+RS
Subjt:  GQSAVNSSVPSDLPLYPVHPVTTFPTITPAPYLSGPIGNSIELTVGEKLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRS

Query:  LLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNASRLYTLRKQIYECKQ-----------------------ELVWDCPCGGVQYYKLEEVDQVYDF
        +LI+SMEPQIGKPLLYA TA+D+WD  Q LYSKRQNASRLYTLRKQ++ CKQ                       E VWD P    QY KLEE D+VYDF
Subjt:  LLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNASRLYTLRKQIYECKQ-----------------------ELVWDCPCGGVQYYKLEEVDQVYDF

Query:  LAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSITDSTAFSAKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRR
        LAGLN KFD V  RILGQ+P P+LMEVC EVRLEEDRT+AM  + T   DS AFSA+ ++   DK NGK  PVCEHCKK WHTKDQCWKLHGRP  GK+R
Subjt:  LAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSITDSTAFSAKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRR

Query:  PPNNKPN--QALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLSLLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVA
          N K N  +A +SETT  S  Q  +   +   T +LG IA SG+  SL L+S+ G+ PWILDSGATDHLTG+S++F+SY PC  NEKIRIADG+LAP+A
Subjt:  PPNNKPN--QALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLSLLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVA

Query:  GKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDVLFQDLSLGKTIGTAQHKR-----------DSISLMVILP--------------
        GKG I PFDG  LQNVLHVPK+SYNLLS+SKITR+L+C+  F P+ V FQD+S G+TIGTA+H R            S+S + +L               
Subjt:  GKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDVLFQDLSLGKTIGTAQHKR-----------DSISLMVILP--------------

Query:  --LG------------------TVLGLVC-----------------YLP--------------------------------------------------I
          LG                   V  L C                 Y P                                                  I
Subjt:  --LG------------------TVLGLVC-----------------YLP--------------------------------------------------I

Query:  FQ-------------LLKLTSNNGRELLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARSRF---QPPFHLTYGA---AAHLINRMPSRV
        FQ             +  L S+NGRE   + L EFL++KGIVHQ+ CAYTPQQNGVAERKNRH++EVARS       P +L   A   AAHLINRMPSR+
Subjt:  FQ-------------LLKLTSNNGRELLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARSRF---QPPFHLTYGA---AAHLINRMPSRV

Query:  LHFQTPLDYLKLSYPTTRLIPDVPLRVFGCTAFV--LASVQTRLSS----------PLEPRNVSSLDIPSTSVVINASIPLL------------------
        LH QTPLD LK SYP+TRL+ +VPLRVFGCTA+V      QT+ +           PL          PS    +   +                     
Subjt:  LHFQTPLDYLKLSYPTTRLIPDVPLRVFGCTAFV--LASVQTRLSS----------PLEPRNVSSLDIPSTSVVINASIPLL------------------

Query:  -GNTSSLWISPS---------WKINLSFPQVPWITYYRKNLRKEIAAPTN-IPAPVQSSELTQAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDD
          N +  ++ P+           I L   QVPW TYYR+NLRKE+ +PT+  PAPVQ+ E  + QG  +P     C  N   END  D+  L      ++
Subjt:  -GNTSSLWISPS---------WKINLSFPQVPWITYYRKNLRKEIAAPTN-IPAPVQSSELTQAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDD

Query:  MTENNQVAESDV-VSTIVEENEKGIIPQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSLDTVTIPKNIYVTMEIP
        M E N   E++V + T  +E E+G          KLD   E D +LD+PIALRKGTRSCTK+P+ +++SY+NLS + RAF  +LD+  IPKNIY  +E P
Subjt:  MTENNQVAESDV-VSTIVEENEKGIIPQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSLDTVTIPKNIYVTMEIP

Query:  EWKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVA--------------------------------------------
        EWK A+MEEM+ALEKN TWE+ ALPKGHK VG KWVF+ KYK DGTLDR+KARLVA                                            
Subjt:  EWKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVA--------------------------------------------

Query:  ---------------------------------------------------------------------KRSVSGKIAVLIVYVDDIVLSGDDTAKINKL
                                                                             K S +GKIA+LIVYVDDIVL+GDD  +I++L
Subjt:  ---------------------------------------------------------------------KRSVSGKIAVLIVYVDDIVLSGDDTAKINKL

Query:  KQKMADEFEIKDLGNLKYFLGMEVARSKKGISVSQWKYTLDLMKET------------------------------------------------------
        KQ+M DEFEIKDLGNLKYFLGMEVARSK+GISVSQ KYTLDL+ ET                                                      
Subjt:  KQKMADEFEIKDLGNLKYFLGMEVARSKKGISVSQWKYTLDLMKET------------------------------------------------------

Query:  ---------------------------------------------------GSAIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICE
                                                           GS IDRKSTSGYCTFVWGNLVTWRSKKQSVVARS AEAEYRAMS GICE
Subjt:  ---------------------------------------------------GSAIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICE

Query:  EIWLKKVLSDLHQSSELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDIYVPS
        EIWL+KVLSDLHQ  E P+KL+CD KAAISIANN VQHDRTKHVEIDRHF+KE+L++GSICIPYIPSSQQ+ DVLTKGLL+  FD CVSK+GLIDIY+P+
Subjt:  EIWLKKVLSDLHQSSELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDIYVPS

TYJ97179.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0047.17Show/hide
Query:  GQSAVNSSVPSDLPLYPVHPVTTFPTITPAPYLSGPIGNSIELTVGEKLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRS
        G+S+ +S  P++LP+Y  +PVT+FP  + + Y++G +G+S     GEKLNGQNYFSWSQS+KM LEGR++F +LTGEI +P PGD  ER+WKGEDSL+RS
Subjt:  GQSAVNSSVPSDLPLYPVHPVTTFPTITPAPYLSGPIGNSIELTVGEKLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRS

Query:  LLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNASRLYTLRKQIYECKQ-----------------------ELVWDCPCGGVQYYKLEEVDQVYDF
        +LI+SMEPQIGKPLLYA TA+D+WD  Q LYSKRQNASRLYTLRKQ++ CKQ                       E VWD P    QY KLEE D+VYDF
Subjt:  LLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNASRLYTLRKQIYECKQ-----------------------ELVWDCPCGGVQYYKLEEVDQVYDF

Query:  LAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSITDSTAFSAKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRR
        LAGLN KFD V  RILGQ+P P+LMEVC EVRLEEDRT+AM  + T   DS AFSA+ ++   DK NGK  PVCEHCKK WHTKDQCWKLHGRP  GK+R
Subjt:  LAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSITDSTAFSAKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRR

Query:  PPNNKPN--QALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLSLLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVA
          N K N  +A +SETT  S  Q  +   +   T +LG IA SG+  SL L+S+ G+ PWILDSGATDHLTG+S++F+SY PC  NEKIRIADG+LAP+A
Subjt:  PPNNKPN--QALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLSLLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVA

Query:  GKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDVLFQDLSLGKTIGTAQHKR-----------DSISLMVILP--------------
        GKG I PFDG  LQNVLHVPK+SYNLLS+SKITR+L+C+  F P+ V FQD+S G+TIGTA+H R            S+S + +L               
Subjt:  GKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDVLFQDLSLGKTIGTAQHKR-----------DSISLMVILP--------------

Query:  --LG------------------TVLGLVC-----------------YLP--------------------------------------------------I
          LG                   V  L C                 Y P                                                  I
Subjt:  --LG------------------TVLGLVC-----------------YLP--------------------------------------------------I

Query:  FQ-------------LLKLTSNNGRELLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARSRF---QPPFHLTYGA---AAHLINRMPSRV
        FQ             +  L S+NGRE   + L EFL++KGIVHQ+ CAYTPQQNGVAERKNRH++EVARS       P +L   A   AAHLINRMPSR+
Subjt:  FQ-------------LLKLTSNNGRELLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARSRF---QPPFHLTYGA---AAHLINRMPSRV

Query:  LHFQTPLDYLKLSYPTTRLIPDVPLRVFGCTAFV--LASVQTRLSS----------PLEPRNVSSLDIPSTSVVINASIPLL------------------
        LH QTPLD LK SYP+TRL+ +VPLRVFGCTA+V      QT+ +           PL          PS    +   +                     
Subjt:  LHFQTPLDYLKLSYPTTRLIPDVPLRVFGCTAFV--LASVQTRLSS----------PLEPRNVSSLDIPSTSVVINASIPLL------------------

Query:  -GNTSSLWISPS---------WKINLSFPQVPWITYYRKNLRKEIAAPTN-IPAPVQSSELTQAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDD
          N +  ++ P+           I L   QVPW TYYR+NLRKE+ +PT+  PAPVQ+ E  + QG  +P     C  N   END  D+  L      ++
Subjt:  -GNTSSLWISPS---------WKINLSFPQVPWITYYRKNLRKEIAAPTN-IPAPVQSSELTQAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDD

Query:  MTENNQVAESDV-VSTIVEENEKGIIPQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSLDTVTIPKNIYVTMEIP
        M E N   E++V + T  +E E+G          KLD   E D +LD+PIALRKGTRSCTK+P+ +++SY+NLS + RAF  +LD+  IPKNIY  +E P
Subjt:  MTENNQVAESDV-VSTIVEENEKGIIPQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSLDTVTIPKNIYVTMEIP

Query:  EWKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVA--------------------------------------------
        EWK A+MEEM+ALEKN TWE+ ALPKGHK VG KWVF+ KYK DGTLDR+KARLVA                                            
Subjt:  EWKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVA--------------------------------------------

Query:  ---------------------------------------------------------------------KRSVSGKIAVLIVYVDDIVLSGDDTAKINKL
                                                                             K S +GKIA+LIVYVDDIVL+GDD  +I++L
Subjt:  ---------------------------------------------------------------------KRSVSGKIAVLIVYVDDIVLSGDDTAKINKL

Query:  KQKMADEFEIKDLGNLKYFLGMEVARSKKGISVSQWKYTLDLMKETG-----------------------------------------------------
        KQ+M DEFEIKDLGNLKYFLGMEVARSK+GISVSQ KYTLDL+ ETG                                                     
Subjt:  KQKMADEFEIKDLGNLKYFLGMEVARSKKGISVSQWKYTLDLMKETG-----------------------------------------------------

Query:  -----------------SAIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICEEIWLKKVLSDLHQSSELPMKLYCDTKAAISIANNS
                         + I RKSTSGYCTFVWGNLVTWRSKKQSVVARS AEAEYRAMS GICEEIWL+KVLSDLHQ  E P+KL+CD KAAISIANN 
Subjt:  -----------------SAIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICEEIWLKKVLSDLHQSSELPMKLYCDTKAAISIANNS

Query:  VQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDIYVPS
        VQHDRTKHVEIDRHF+KE+L++GSICIPYIPSSQQ+ DVLTKGLL+  FD CVSK+G IDIY+P+
Subjt:  VQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDIYVPS

TYK31050.1 Beta-galactosidase [Cucumis melo var. makuwa]0.0e+0047.74Show/hide
Query:  GQSAVNSSVPSDLPLYPVHPVTTFPTITPAPYLSGPIGNSIELTVGEKLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRS
        G+S+ +S  P++LP+Y  +PVT+FP  + + Y++G +G+S     GEKLNGQNYFSWSQS+KM LEGR++F +LTGEI +P PGD  ER+WKGEDSL+RS
Subjt:  GQSAVNSSVPSDLPLYPVHPVTTFPTITPAPYLSGPIGNSIELTVGEKLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRS

Query:  LLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNASRLYTLRKQIYECKQ-----------------------ELVWDCPCGGVQYYKLEEVDQVYDF
        +LI+SMEPQIGKPLLYA TA+D+WD  Q LYSKRQNASRLYTLRKQ++ CKQ                       E VWD P    QY KLEE D+VYDF
Subjt:  LLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNASRLYTLRKQIYECKQ-----------------------ELVWDCPCGGVQYYKLEEVDQVYDF

Query:  LAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSITDSTAFSAKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRR
        LAGLN KFD V  RILGQ+P P+LMEVC EVRLEEDRT+AM  + T   DS AFSA+ ++   DK NGK  PVCEHCKK WHTKDQCWKLHGRP  GK+R
Subjt:  LAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSITDSTAFSAKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRR

Query:  PPNNKPN--QALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLSLLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVA
          N K N  +A +SETT  S  Q  +   +   T +LG IA SG+  SL L+S+ G+ PWILDSGATDHLTG+S++F+SY PC  NEKIRIADG+LAP+A
Subjt:  PPNNKPN--QALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLSLLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVA

Query:  GKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDVLFQDLSLGKTIGTAQHKR-----------DSISLMVIL-----------PLGT
        GKG I PFDG  LQNVLHVPK+SYNLLS+SKITR+L+C+  F P+ V FQD+S G+TIGTA+H R            S+S + +L            + +
Subjt:  GKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDVLFQDLSLGKTIGTAQHKR-----------DSISLMVIL-----------PLGT

Query:  VLGLVC--------------YLP--------------------------------------------------IFQ-------------LLKLTSNNGRE
        +   VC              Y P                                                  IFQ             +  L S+NGRE
Subjt:  VLGLVC--------------YLP--------------------------------------------------IFQ-------------LLKLTSNNGRE

Query:  LLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARSRF---QPPFHLTYGA---AAHLINRMPSRVLHFQTPLDYLKLSYPTTRLIPDVPLR
           + L EFL++KGIVHQ+ CAYTPQQNGVAERKNRH++EVARS       P +L   A   AAHLINRMPSR+LH QTPLD LK SYP+TRL+ +VPLR
Subjt:  LLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARSRF---QPPFHLTYGA---AAHLINRMPSRVLHFQTPLDYLKLSYPTTRLIPDVPLR

Query:  VFGCTAFV--LASVQTRLSS----------PLEPRNVSSLDIPSTSVVINASIPLL-------------------GNTSSLWISPS---------WKINL
        VFGCTA+V      QT+ +           PL          PS    +   +                       N +  ++ P+           I L
Subjt:  VFGCTAFV--LASVQTRLSS----------PLEPRNVSSLDIPSTSVVINASIPLL-------------------GNTSSLWISPS---------WKINL

Query:  SFPQVPWITYYRKNLRKEIAAPTN-IPAPVQSSELTQAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDDMTENNQVAESDV-VSTIVEENEKGII
           QVPW TYYR+NLRKE+ +PT+  PAPVQ+ E  + QG  +P     C  N   END  D+  L      ++M E N   E++V + T  +E E+G  
Subjt:  SFPQVPWITYYRKNLRKEIAAPTN-IPAPVQSSELTQAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDDMTENNQVAESDV-VSTIVEENEKGII

Query:  PQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSLDTVTIPKNIYVTMEIPEWKAAIMEEMRALEKNHTWELVALPK
                KLD   E D +LD+PIALRKGTRSCTK+P+ +++SY+NLS + RAF  +LD+  IPKNIY  +E PEWK A+MEEM+ALEKN TWE+ ALPK
Subjt:  PQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSLDTVTIPKNIYVTMEIPEWKAAIMEEMRALEKNHTWELVALPK

Query:  GHKPVGYKWVFTTKYKFDGTLDRYKARLVA----------------------------------------------------------------------
        GHK VG KWVF+ KYK DGTLDR+KARLVA                                                                      
Subjt:  GHKPVGYKWVFTTKYKFDGTLDRYKARLVA----------------------------------------------------------------------

Query:  -------------------------------------------KRSVSGKIAVLIVYVDDIVLSGDDTAKINKLKQKMADEFEIKDLGNLKYFLGMEVAR
                                                   K S +GKIA+LIVYVDDIVL+GDD  +I++LKQ+M DEFEIKDLGNLKYFLGMEVAR
Subjt:  -------------------------------------------KRSVSGKIAVLIVYVDDIVLSGDDTAKINKLKQKMADEFEIKDLGNLKYFLGMEVAR

Query:  SKKGISVSQWKYTLDLMKETG----------------------------------------------------------------------SAIDRKSTS
        SK+GISVSQ KYTLDL+ ETG                                                                      + I RKSTS
Subjt:  SKKGISVSQWKYTLDLMKETG----------------------------------------------------------------------SAIDRKSTS

Query:  GYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICEEIWLKKVLSDLHQSSELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKEKLENGSIC
        GYCTFVWGNLVTWRSKKQSVVARS AEAEYRAMS GICEEIWL+KVLSDLHQ  E P+KL+CD KAAISIA N VQHDRTKHVEIDRHF+KE+L++GSIC
Subjt:  GYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICEEIWLKKVLSDLHQSSELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKEKLENGSIC

Query:  IPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDIYVPS
        IPYIPSSQQ+ DVL KGLL+  FD CVSK+GLIDIY+P+
Subjt:  IPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDIYVPS

TrEMBL top hitse value%identityAlignment
A0A5A7SM64 Beta-galactosidase0.0e+0046.27Show/hide
Query:  GQSAVNSSVPSDLPLYPVHPVTTFPTITPAPYLSGPIGNSIELTVGEKLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRS
        G+S+ +S  P++LP+Y  +PVT+FP  + + Y++G +G+S     GEKLNGQNYFSWSQS+KM LEGR++F +LTGEI +P PGD  ER+WKGEDSL+RS
Subjt:  GQSAVNSSVPSDLPLYPVHPVTTFPTITPAPYLSGPIGNSIELTVGEKLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRS

Query:  LLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNASRLYTLRKQIYECKQ-----------------------ELVWDCPCGGVQYYKLEEVDQVYDF
        +LI+SMEPQIGKPLLYA TA+D+WD  Q LYSKRQNASRLYTLRKQ++ CKQ                       E VWD P    QY KLEE D+VYDF
Subjt:  LLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNASRLYTLRKQIYECKQ-----------------------ELVWDCPCGGVQYYKLEEVDQVYDF

Query:  LAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSITDSTAFSAKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRR
        LAGLN KFD V  RILGQ+P P+LMEVC EVRLEEDRT+AM  + T   DS AFSA+ ++   DK NGK  PVCEHCKK WHTKDQCWKLHGRP  GK+R
Subjt:  LAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSITDSTAFSAKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRR

Query:  PPNNKPN--QALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLSLLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVA
          N K N  +A +SETT  S  Q  +   +   T +LG IA SG+  SL L+S+ G+ PWILDSGATDHLTG+S++F+SY PC  NEKIRIADG+LAP+A
Subjt:  PPNNKPN--QALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLSLLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVA

Query:  GKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDVLFQDLSLGKTIGTAQHKR-----------DSISLMVILP--------------
        GKG I PFDG  LQNVLHVPK+SYNLLS+SKITR+L+C+  F P+ V FQD+S G+TIGTA+H R            S+S + +L               
Subjt:  GKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDVLFQDLSLGKTIGTAQHKR-----------DSISLMVILP--------------

Query:  --LG------------------TVLGLVC-----------------YLP--------------------------------------------------I
          LG                   V  L C                 Y P                                                  I
Subjt:  --LG------------------TVLGLVC-----------------YLP--------------------------------------------------I

Query:  FQ-------------LLKLTSNNGRELLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARSRF---QPPFHLTYGA---AAHLINRMPSRV
        FQ             +  L S+NGRE   + L EFL++KGIVHQ+ CAYTPQQNGVAERKNRH++EVARS       P +L   A   AAHLINRMPSR+
Subjt:  FQ-------------LLKLTSNNGRELLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARSRF---QPPFHLTYGA---AAHLINRMPSRV

Query:  LHFQTPLDYLKLSYPTTRLIPDVPLRVFGCTAFV--LASVQTRLSS----------PLEPRNVSSLDIPSTSVVINASIPLL------------------
        LH QTPLD LK SYP+TRL+ +VPLRVFGCTA+V      QT+ +           PL          PS    +   +                     
Subjt:  LHFQTPLDYLKLSYPTTRLIPDVPLRVFGCTAFV--LASVQTRLSS----------PLEPRNVSSLDIPSTSVVINASIPLL------------------

Query:  -GNTSSLWISPS---------WKINLSFPQVPWITYYRKNLRKEIAAPTN-IPAPVQSSELTQAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDD
          N +  ++ P+           I L   QVPW TYYR+NLRKE+ +PT+  PAPVQ+ E  + QG  +P     C  N   END  D+  L      ++
Subjt:  -GNTSSLWISPS---------WKINLSFPQVPWITYYRKNLRKEIAAPTN-IPAPVQSSELTQAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDD

Query:  MTENNQVAESDV-VSTIVEENEKGIIPQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSLDTVTIPKNIYVTMEIP
        M E N   E++V + T  +E E+G          KLD   E D +LD+PIALRKGTRSCTK+P+ +++SY+NLS + RAF  +LD+  IPKNIY  +E P
Subjt:  MTENNQVAESDV-VSTIVEENEKGIIPQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSLDTVTIPKNIYVTMEIP

Query:  EWKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVA--------------------------------------------
        EWK A+MEEM+ALEKN TWE+ ALPKGHK VG KWVF+ KYK DGTLDR+KARLVA                                            
Subjt:  EWKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVA--------------------------------------------

Query:  ---------------------------------------------------------------------KRSVSGKIAVLIVYVDDIVLSGDDTAKINKL
                                                                             K S +GKIA+LIVYVDDIVL+GDD  +I++L
Subjt:  ---------------------------------------------------------------------KRSVSGKIAVLIVYVDDIVLSGDDTAKINKL

Query:  KQKMADEFEIKDLGNLKYFLGMEVARSKKGISVSQWKYTLDLMKET------------------------------------------------------
        KQ+M DEFEIKDLGNLKYFLGMEVARSK+GISVSQ KYTLDL+ ET                                                      
Subjt:  KQKMADEFEIKDLGNLKYFLGMEVARSKKGISVSQWKYTLDLMKET------------------------------------------------------

Query:  ---------------------------------------------------GSAIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICE
                                                           GS IDRKSTSGYCTFVWGNLVTWRSKKQSVVARS AEAEYRAMS GICE
Subjt:  ---------------------------------------------------GSAIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICE

Query:  EIWLKKVLSDLHQSSELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDIYVPS
        EIWL+KVLSDLHQ  E P+KL+CD KAAISIANN VQHDRTKHVEIDRHF+KE+L++GSICIPYIPSSQQ+ DVLTKGLL+  FD CVSK+GLIDIY+P+
Subjt:  EIWLKKVLSDLHQSSELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDIYVPS

A0A5A7UGB2 Beta-galactosidase0.0e+0046.27Show/hide
Query:  GQSAVNSSVPSDLPLYPVHPVTTFPTITPAPYLSGPIGNSIELTVGEKLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRS
        G+S+ +S  P++LP+Y  +PVT+FP  + + Y++G +G+S     GEKLNGQNYFSWSQS+KM LEGR++F +LTGEI +P PGD  ER+WKGEDSL+RS
Subjt:  GQSAVNSSVPSDLPLYPVHPVTTFPTITPAPYLSGPIGNSIELTVGEKLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRS

Query:  LLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNASRLYTLRKQIYECKQ-----------------------ELVWDCPCGGVQYYKLEEVDQVYDF
        +LI+SMEPQIGKPLLYA TA+D+WD  Q LYSKRQNASRLYTLRKQ++ CKQ                       E VWD P    QY KLEE D+VYDF
Subjt:  LLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNASRLYTLRKQIYECKQ-----------------------ELVWDCPCGGVQYYKLEEVDQVYDF

Query:  LAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSITDSTAFSAKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRR
        LAGLN KFD V  RILGQ+P P+LMEVC EVRLEEDRT+AM  + T   DS AFSA+ ++   DK NGK  PVCEHCKK WHTKDQCWKLHGRP  GK+R
Subjt:  LAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSITDSTAFSAKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRR

Query:  PPNNKPN--QALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLSLLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVA
          N K N  +A +SETT  S  Q  +   +   T +LG IA SG+  SL L+S+ G+ PWILDSGATDHLTG+S++F+SY PC  NEKIRIADG+LAP+A
Subjt:  PPNNKPN--QALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLSLLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVA

Query:  GKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDVLFQDLSLGKTIGTAQHKR-----------DSISLMVILP--------------
        GKG I PFDG  LQNVLHVPK+SYNLLS+SKITR+L+C+  F P+ V FQD+S G+TIGTA+H R            S+S + +L               
Subjt:  GKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDVLFQDLSLGKTIGTAQHKR-----------DSISLMVILP--------------

Query:  --LG------------------TVLGLVC-----------------YLP--------------------------------------------------I
          LG                   V  L C                 Y P                                                  I
Subjt:  --LG------------------TVLGLVC-----------------YLP--------------------------------------------------I

Query:  FQ-------------LLKLTSNNGRELLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARSRF---QPPFHLTYGA---AAHLINRMPSRV
        FQ             +  L S+NGRE   + L EFL++KGIVHQ+ CAYTPQQNGVAERKNRH++EVARS       P +L   A   AAHLINRMPSR+
Subjt:  FQ-------------LLKLTSNNGRELLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARSRF---QPPFHLTYGA---AAHLINRMPSRV

Query:  LHFQTPLDYLKLSYPTTRLIPDVPLRVFGCTAFV--LASVQTRLSS----------PLEPRNVSSLDIPSTSVVINASIPLL------------------
        LH QTPLD LK SYP+TRL+ +VPLRVFGCTA+V      QT+ +           PL          PS    +   +                     
Subjt:  LHFQTPLDYLKLSYPTTRLIPDVPLRVFGCTAFV--LASVQTRLSS----------PLEPRNVSSLDIPSTSVVINASIPLL------------------

Query:  -GNTSSLWISPS---------WKINLSFPQVPWITYYRKNLRKEIAAPTN-IPAPVQSSELTQAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDD
          N +  ++ P+           I L   QVPW TYYR+NLRKE+ +PT+  PAPVQ+ E  + QG  +P     C  N   END  D+  L      ++
Subjt:  -GNTSSLWISPS---------WKINLSFPQVPWITYYRKNLRKEIAAPTN-IPAPVQSSELTQAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDD

Query:  MTENNQVAESDV-VSTIVEENEKGIIPQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSLDTVTIPKNIYVTMEIP
        M E N   E++V + T  +E E+G          KLD   E D +LD+PIALRKGTRSCTK+P+ +++SY+NLS + RAF  +LD+  IPKNIY  +E P
Subjt:  MTENNQVAESDV-VSTIVEENEKGIIPQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSLDTVTIPKNIYVTMEIP

Query:  EWKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVA--------------------------------------------
        EWK A+MEEM+ALEKN TWE+ ALPKGHK VG KWVF+ KYK DGTLDR+KARLVA                                            
Subjt:  EWKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVA--------------------------------------------

Query:  ---------------------------------------------------------------------KRSVSGKIAVLIVYVDDIVLSGDDTAKINKL
                                                                             K S +GKIA+LIVYVDDIVL+GDD  +I++L
Subjt:  ---------------------------------------------------------------------KRSVSGKIAVLIVYVDDIVLSGDDTAKINKL

Query:  KQKMADEFEIKDLGNLKYFLGMEVARSKKGISVSQWKYTLDLMKET------------------------------------------------------
        KQ+M DEFEIKDLGNLKYFLGMEVARSK+GISVSQ KYTLDL+ ET                                                      
Subjt:  KQKMADEFEIKDLGNLKYFLGMEVARSKKGISVSQWKYTLDLMKET------------------------------------------------------

Query:  ---------------------------------------------------GSAIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICE
                                                           GS IDRKSTSGYCTFVWGNLVTWRSKKQSVVARS AEAEYRAMS GICE
Subjt:  ---------------------------------------------------GSAIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICE

Query:  EIWLKKVLSDLHQSSELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDIYVPS
        EIWL+KVLSDLHQ  E P+KL+CD KAAISIANN VQHDRTKHVEIDRHF+KE+L++GSICIPYIPSSQQ+ DVLTKGLL+  FD CVSK+GLIDIY+P+
Subjt:  EIWLKKVLSDLHQSSELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDIYVPS

A0A5A7UQ67 Beta-galactosidase0.0e+0046.27Show/hide
Query:  GQSAVNSSVPSDLPLYPVHPVTTFPTITPAPYLSGPIGNSIELTVGEKLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRS
        G+S+ +S  P++LP+Y  +PVT+FP  + + Y++G +G+S     GEKLNGQNYFSWSQS+KM LEGR++F +LTGEI +P PGD  ER+WKGEDSL+RS
Subjt:  GQSAVNSSVPSDLPLYPVHPVTTFPTITPAPYLSGPIGNSIELTVGEKLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRS

Query:  LLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNASRLYTLRKQIYECKQ-----------------------ELVWDCPCGGVQYYKLEEVDQVYDF
        +LI+SMEPQIGKPLLYA TA+D+WD  Q LYSKRQNASRLYTLRKQ++ CKQ                       E VWD P    QY KLEE D+VYDF
Subjt:  LLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNASRLYTLRKQIYECKQ-----------------------ELVWDCPCGGVQYYKLEEVDQVYDF

Query:  LAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSITDSTAFSAKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRR
        LAGLN KFD V  RILGQ+P P+LMEVC EVRLEEDRT+AM  + T   DS AFSA+ ++   DK NGK  PVCEHCKK WHTKDQCWKLHGRP  GK+R
Subjt:  LAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSITDSTAFSAKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRR

Query:  PPNNKPN--QALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLSLLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVA
          N K N  +A +SETT  S  Q  +   +   T +LG IA SG+  SL L+S+ G+ PWILDSGATDHLTG+S++F+SY PC  NEKIRIADG+LAP+A
Subjt:  PPNNKPN--QALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLSLLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVA

Query:  GKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDVLFQDLSLGKTIGTAQHKR-----------DSISLMVILP--------------
        GKG I PFDG  LQNVLHVPK+SYNLLS+SKITR+L+C+  F P+ V FQD+S G+TIGTA+H R            S+S + +L               
Subjt:  GKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDVLFQDLSLGKTIGTAQHKR-----------DSISLMVILP--------------

Query:  --LG------------------TVLGLVC-----------------YLP--------------------------------------------------I
          LG                   V  L C                 Y P                                                  I
Subjt:  --LG------------------TVLGLVC-----------------YLP--------------------------------------------------I

Query:  FQ-------------LLKLTSNNGRELLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARSRF---QPPFHLTYGA---AAHLINRMPSRV
        FQ             +  L S+NGRE   + L EFL++KGIVHQ+ CAYTPQQNGVAERKNRH++EVARS       P +L   A   AAHLINRMPSR+
Subjt:  FQ-------------LLKLTSNNGRELLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARSRF---QPPFHLTYGA---AAHLINRMPSRV

Query:  LHFQTPLDYLKLSYPTTRLIPDVPLRVFGCTAFV--LASVQTRLSS----------PLEPRNVSSLDIPSTSVVINASIPLL------------------
        LH QTPLD LK SYP+TRL+ +VPLRVFGCTA+V      QT+ +           PL          PS    +   +                     
Subjt:  LHFQTPLDYLKLSYPTTRLIPDVPLRVFGCTAFV--LASVQTRLSS----------PLEPRNVSSLDIPSTSVVINASIPLL------------------

Query:  -GNTSSLWISPS---------WKINLSFPQVPWITYYRKNLRKEIAAPTN-IPAPVQSSELTQAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDD
          N +  ++ P+           I L   QVPW TYYR+NLRKE+ +PT+  PAPVQ+ E  + QG  +P     C  N   END  D+  L      ++
Subjt:  -GNTSSLWISPS---------WKINLSFPQVPWITYYRKNLRKEIAAPTN-IPAPVQSSELTQAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDD

Query:  MTENNQVAESDV-VSTIVEENEKGIIPQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSLDTVTIPKNIYVTMEIP
        M E N   E++V + T  +E E+G          KLD   E D +LD+PIALRKGTRSCTK+P+ +++SY+NLS + RAF  +LD+  IPKNIY  +E P
Subjt:  MTENNQVAESDV-VSTIVEENEKGIIPQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSLDTVTIPKNIYVTMEIP

Query:  EWKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVA--------------------------------------------
        EWK A+MEEM+ALEKN TWE+ ALPKGHK VG KWVF+ KYK DGTLDR+KARLVA                                            
Subjt:  EWKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVA--------------------------------------------

Query:  ---------------------------------------------------------------------KRSVSGKIAVLIVYVDDIVLSGDDTAKINKL
                                                                             K S +GKIA+LIVYVDDIVL+GDD  +I++L
Subjt:  ---------------------------------------------------------------------KRSVSGKIAVLIVYVDDIVLSGDDTAKINKL

Query:  KQKMADEFEIKDLGNLKYFLGMEVARSKKGISVSQWKYTLDLMKET------------------------------------------------------
        KQ+M DEFEIKDLGNLKYFLGMEVARSK+GISVSQ KYTLDL+ ET                                                      
Subjt:  KQKMADEFEIKDLGNLKYFLGMEVARSKKGISVSQWKYTLDLMKET------------------------------------------------------

Query:  ---------------------------------------------------GSAIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICE
                                                           GS IDRKSTSGYCTFVWGNLVTWRSKKQSVVARS AEAEYRAMS GICE
Subjt:  ---------------------------------------------------GSAIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICE

Query:  EIWLKKVLSDLHQSSELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDIYVPS
        EIWL+KVLSDLHQ  E P+KL+CD KAAISIANN VQHDRTKHVEIDRHF+KE+L++GSICIPYIPSSQQ+ DVLTKGLL+  FD CVSK+GLIDIY+P+
Subjt:  EIWLKKVLSDLHQSSELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDIYVPS

A0A5D3BDV4 Beta-galactosidase0.0e+0047.17Show/hide
Query:  GQSAVNSSVPSDLPLYPVHPVTTFPTITPAPYLSGPIGNSIELTVGEKLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRS
        G+S+ +S  P++LP+Y  +PVT+FP  + + Y++G +G+S     GEKLNGQNYFSWSQS+KM LEGR++F +LTGEI +P PGD  ER+WKGEDSL+RS
Subjt:  GQSAVNSSVPSDLPLYPVHPVTTFPTITPAPYLSGPIGNSIELTVGEKLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRS

Query:  LLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNASRLYTLRKQIYECKQ-----------------------ELVWDCPCGGVQYYKLEEVDQVYDF
        +LI+SMEPQIGKPLLYA TA+D+WD  Q LYSKRQNASRLYTLRKQ++ CKQ                       E VWD P    QY KLEE D+VYDF
Subjt:  LLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNASRLYTLRKQIYECKQ-----------------------ELVWDCPCGGVQYYKLEEVDQVYDF

Query:  LAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSITDSTAFSAKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRR
        LAGLN KFD V  RILGQ+P P+LMEVC EVRLEEDRT+AM  + T   DS AFSA+ ++   DK NGK  PVCEHCKK WHTKDQCWKLHGRP  GK+R
Subjt:  LAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSITDSTAFSAKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRR

Query:  PPNNKPN--QALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLSLLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVA
          N K N  +A +SETT  S  Q  +   +   T +LG IA SG+  SL L+S+ G+ PWILDSGATDHLTG+S++F+SY PC  NEKIRIADG+LAP+A
Subjt:  PPNNKPN--QALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLSLLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVA

Query:  GKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDVLFQDLSLGKTIGTAQHKR-----------DSISLMVILP--------------
        GKG I PFDG  LQNVLHVPK+SYNLLS+SKITR+L+C+  F P+ V FQD+S G+TIGTA+H R            S+S + +L               
Subjt:  GKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDVLFQDLSLGKTIGTAQHKR-----------DSISLMVILP--------------

Query:  --LG------------------TVLGLVC-----------------YLP--------------------------------------------------I
          LG                   V  L C                 Y P                                                  I
Subjt:  --LG------------------TVLGLVC-----------------YLP--------------------------------------------------I

Query:  FQ-------------LLKLTSNNGRELLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARSRF---QPPFHLTYGA---AAHLINRMPSRV
        FQ             +  L S+NGRE   + L EFL++KGIVHQ+ CAYTPQQNGVAERKNRH++EVARS       P +L   A   AAHLINRMPSR+
Subjt:  FQ-------------LLKLTSNNGRELLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARSRF---QPPFHLTYGA---AAHLINRMPSRV

Query:  LHFQTPLDYLKLSYPTTRLIPDVPLRVFGCTAFV--LASVQTRLSS----------PLEPRNVSSLDIPSTSVVINASIPLL------------------
        LH QTPLD LK SYP+TRL+ +VPLRVFGCTA+V      QT+ +           PL          PS    +   +                     
Subjt:  LHFQTPLDYLKLSYPTTRLIPDVPLRVFGCTAFV--LASVQTRLSS----------PLEPRNVSSLDIPSTSVVINASIPLL------------------

Query:  -GNTSSLWISPS---------WKINLSFPQVPWITYYRKNLRKEIAAPTN-IPAPVQSSELTQAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDD
          N +  ++ P+           I L   QVPW TYYR+NLRKE+ +PT+  PAPVQ+ E  + QG  +P     C  N   END  D+  L      ++
Subjt:  -GNTSSLWISPS---------WKINLSFPQVPWITYYRKNLRKEIAAPTN-IPAPVQSSELTQAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDD

Query:  MTENNQVAESDV-VSTIVEENEKGIIPQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSLDTVTIPKNIYVTMEIP
        M E N   E++V + T  +E E+G          KLD   E D +LD+PIALRKGTRSCTK+P+ +++SY+NLS + RAF  +LD+  IPKNIY  +E P
Subjt:  MTENNQVAESDV-VSTIVEENEKGIIPQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSLDTVTIPKNIYVTMEIP

Query:  EWKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVA--------------------------------------------
        EWK A+MEEM+ALEKN TWE+ ALPKGHK VG KWVF+ KYK DGTLDR+KARLVA                                            
Subjt:  EWKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVA--------------------------------------------

Query:  ---------------------------------------------------------------------KRSVSGKIAVLIVYVDDIVLSGDDTAKINKL
                                                                             K S +GKIA+LIVYVDDIVL+GDD  +I++L
Subjt:  ---------------------------------------------------------------------KRSVSGKIAVLIVYVDDIVLSGDDTAKINKL

Query:  KQKMADEFEIKDLGNLKYFLGMEVARSKKGISVSQWKYTLDLMKETG-----------------------------------------------------
        KQ+M DEFEIKDLGNLKYFLGMEVARSK+GISVSQ KYTLDL+ ETG                                                     
Subjt:  KQKMADEFEIKDLGNLKYFLGMEVARSKKGISVSQWKYTLDLMKETG-----------------------------------------------------

Query:  -----------------SAIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICEEIWLKKVLSDLHQSSELPMKLYCDTKAAISIANNS
                         + I RKSTSGYCTFVWGNLVTWRSKKQSVVARS AEAEYRAMS GICEEIWL+KVLSDLHQ  E P+KL+CD KAAISIANN 
Subjt:  -----------------SAIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICEEIWLKKVLSDLHQSSELPMKLYCDTKAAISIANNS

Query:  VQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDIYVPS
        VQHDRTKHVEIDRHF+KE+L++GSICIPYIPSSQQ+ DVLTKGLL+  FD CVSK+G IDIY+P+
Subjt:  VQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDIYVPS

A0A5D3E603 Beta-galactosidase0.0e+0047.74Show/hide
Query:  GQSAVNSSVPSDLPLYPVHPVTTFPTITPAPYLSGPIGNSIELTVGEKLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRS
        G+S+ +S  P++LP+Y  +PVT+FP  + + Y++G +G+S     GEKLNGQNYFSWSQS+KM LEGR++F +LTGEI +P PGD  ER+WKGEDSL+RS
Subjt:  GQSAVNSSVPSDLPLYPVHPVTTFPTITPAPYLSGPIGNSIELTVGEKLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRS

Query:  LLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNASRLYTLRKQIYECKQ-----------------------ELVWDCPCGGVQYYKLEEVDQVYDF
        +LI+SMEPQIGKPLLYA TA+D+WD  Q LYSKRQNASRLYTLRKQ++ CKQ                       E VWD P    QY KLEE D+VYDF
Subjt:  LLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNASRLYTLRKQIYECKQ-----------------------ELVWDCPCGGVQYYKLEEVDQVYDF

Query:  LAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSITDSTAFSAKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRR
        LAGLN KFD V  RILGQ+P P+LMEVC EVRLEEDRT+AM  + T   DS AFSA+ ++   DK NGK  PVCEHCKK WHTKDQCWKLHGRP  GK+R
Subjt:  LAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSITDSTAFSAKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRR

Query:  PPNNKPN--QALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLSLLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVA
          N K N  +A +SETT  S  Q  +   +   T +LG IA SG+  SL L+S+ G+ PWILDSGATDHLTG+S++F+SY PC  NEKIRIADG+LAP+A
Subjt:  PPNNKPN--QALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLSLLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVA

Query:  GKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDVLFQDLSLGKTIGTAQHKR-----------DSISLMVIL-----------PLGT
        GKG I PFDG  LQNVLHVPK+SYNLLS+SKITR+L+C+  F P+ V FQD+S G+TIGTA+H R            S+S + +L            + +
Subjt:  GKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDVLFQDLSLGKTIGTAQHKR-----------DSISLMVIL-----------PLGT

Query:  VLGLVC--------------YLP--------------------------------------------------IFQ-------------LLKLTSNNGRE
        +   VC              Y P                                                  IFQ             +  L S+NGRE
Subjt:  VLGLVC--------------YLP--------------------------------------------------IFQ-------------LLKLTSNNGRE

Query:  LLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARSRF---QPPFHLTYGA---AAHLINRMPSRVLHFQTPLDYLKLSYPTTRLIPDVPLR
           + L EFL++KGIVHQ+ CAYTPQQNGVAERKNRH++EVARS       P +L   A   AAHLINRMPSR+LH QTPLD LK SYP+TRL+ +VPLR
Subjt:  LLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARSRF---QPPFHLTYGA---AAHLINRMPSRVLHFQTPLDYLKLSYPTTRLIPDVPLR

Query:  VFGCTAFV--LASVQTRLSS----------PLEPRNVSSLDIPSTSVVINASIPLL-------------------GNTSSLWISPS---------WKINL
        VFGCTA+V      QT+ +           PL          PS    +   +                       N +  ++ P+           I L
Subjt:  VFGCTAFV--LASVQTRLSS----------PLEPRNVSSLDIPSTSVVINASIPLL-------------------GNTSSLWISPS---------WKINL

Query:  SFPQVPWITYYRKNLRKEIAAPTN-IPAPVQSSELTQAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDDMTENNQVAESDV-VSTIVEENEKGII
           QVPW TYYR+NLRKE+ +PT+  PAPVQ+ E  + QG  +P     C  N   END  D+  L      ++M E N   E++V + T  +E E+G  
Subjt:  SFPQVPWITYYRKNLRKEIAAPTN-IPAPVQSSELTQAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDDMTENNQVAESDV-VSTIVEENEKGII

Query:  PQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSLDTVTIPKNIYVTMEIPEWKAAIMEEMRALEKNHTWELVALPK
                KLD   E D +LD+PIALRKGTRSCTK+P+ +++SY+NLS + RAF  +LD+  IPKNIY  +E PEWK A+MEEM+ALEKN TWE+ ALPK
Subjt:  PQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSLDTVTIPKNIYVTMEIPEWKAAIMEEMRALEKNHTWELVALPK

Query:  GHKPVGYKWVFTTKYKFDGTLDRYKARLVA----------------------------------------------------------------------
        GHK VG KWVF+ KYK DGTLDR+KARLVA                                                                      
Subjt:  GHKPVGYKWVFTTKYKFDGTLDRYKARLVA----------------------------------------------------------------------

Query:  -------------------------------------------KRSVSGKIAVLIVYVDDIVLSGDDTAKINKLKQKMADEFEIKDLGNLKYFLGMEVAR
                                                   K S +GKIA+LIVYVDDIVL+GDD  +I++LKQ+M DEFEIKDLGNLKYFLGMEVAR
Subjt:  -------------------------------------------KRSVSGKIAVLIVYVDDIVLSGDDTAKINKLKQKMADEFEIKDLGNLKYFLGMEVAR

Query:  SKKGISVSQWKYTLDLMKETG----------------------------------------------------------------------SAIDRKSTS
        SK+GISVSQ KYTLDL+ ETG                                                                      + I RKSTS
Subjt:  SKKGISVSQWKYTLDLMKETG----------------------------------------------------------------------SAIDRKSTS

Query:  GYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICEEIWLKKVLSDLHQSSELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKEKLENGSIC
        GYCTFVWGNLVTWRSKKQSVVARS AEAEYRAMS GICEEIWL+KVLSDLHQ  E P+KL+CD KAAISIA N VQHDRTKHVEIDRHF+KE+L++GSIC
Subjt:  GYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICEEIWLKKVLSDLHQSSELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKEKLENGSIC

Query:  IPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDIYVPS
        IPYIPSSQQ+ DVL KGLL+  FD CVSK+GLIDIY+P+
Subjt:  IPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDIYVPS

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.9e-3322.64Show/hide
Query:  QLLKLTSNNGRELLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARS---------RFQPPFHLTYGAAAHLINRMPSRVL--HFQTPLDY
        +++ L  +NGRE L+N +R+F   KGI +     +TPQ NGV+ER  R I E AR+          F     LT   A +LINR+PSR L    +TP + 
Subjt:  QLLKLTSNNGRELLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARS---------RFQPPFHLTYGAAAHLINRMPSRVL--HFQTPLDY

Query:  LKLSYPTTRLIPDVPLRVFGCTAFV-LASVQTRLSSP--------LEPRNVSSLDIPSTSVVINASIPLLGNTSSLWISPSWKINLSFPQVPWITYYRKN
             P  +      LRVFG T +V + + Q +             EP      D  +   ++   + +  + +++  S + K    F +    +   KN
Subjt:  LKLSYPTTRLIPDVPLRVFGCTAFV-LASVQTRLSSP--------LEPRNVSSLDIPSTSVVINASIPLLGNTSSLWISPSWKINLSFPQVPWITYYRKN

Query:  LRKEIAAPTNIPAPVQSSELTQAQGTTDP----------DNNAICA---ENVCVENDIVDLTKLSVEDGKDDMTENNQVAESDVVSTIVEENEKGIIPQN
           +         P +S E    Q   D           D+  I      N   E D +   K S E  K  + E+ +    D ++       KG    N
Subjt:  LRKEIAAPTNIPAPVQSSELTQAQGTTDP----------DNNAICA---ENVCVENDIVDLTKLSVEDGKDDMTENNQVAESDVVSTIVEENEKGIIPQN

Query:  PT----TGEKLDKPG--ECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSLDTV--TIPKN---IYVTMEIPEWKAAIMEEMRALEKNHT
         +    T E L + G      N  + I  R+  R  TK      +SYN   +     +++  T+   +P +   I    +   W+ AI  E+ A + N+T
Subjt:  PT----TGEKLDKPG--ECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSLDTV--TIPKN---IYVTMEIPEWKAAIMEEMRALEKNHT

Query:  WELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVAK-------------------------------------------------------------
        W +   P+    V  +WVF+ KY   G   RYKARLVA+                                                             
Subjt:  WELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVAK-------------------------------------------------------------

Query:  -----------------------------------------RSVSGKIAVL-----------IVYVDDIVLSGDDTAKINKLKQKMADEFEIKDLGNLKY
                                                  SV   I +L           ++YVDD+V++  D  ++N  K+ + ++F + DL  +K+
Subjt:  -----------------------------------------RSVSGKIAVL-----------IVYVDDIVLSGDDTAKINKLKQKMADEFEIKDLGNLKY

Query:  FLGMEVARSKKGISVSQWKYTLDLMKE-------------------------------------------------------------------------
        F+G+ +   +  I +SQ  Y   ++ +                                                                         
Subjt:  FLGMEVARSKKGISVSQWKYTLDLMKE-------------------------------------------------------------------------

Query:  ----------------------------------TGSAIDRKSTSGYCTFVWG-NLVTWRSKKQSVVARSGAEAEYRAMSHGICEEIWLKKVLSDLHQSS
                                           GS IDRKST+GY   ++  NL+ W +K+Q+ VA S  EAEY A+   + E +WLK +L+ ++   
Subjt:  ----------------------------------TGSAIDRKSTSGYCTFVWG-NLVTWRSKKQSVVARSGAEAEYRAMSHGICEEIWLKKVLSDLHQSS

Query:  ELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLI
        E P+K+Y D +  ISIANN   H R KH++I  HF +E+++N  IC+ YIP+  Q+ D+ TK L    F     K+GL+
Subjt:  ELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.5e-3721.15Show/hide
Query:  KLNGQNYFS-WSQSVK--MILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRSLLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNASRLYTLR
        K NG N FS W + ++  +I +G HK      ++   +P       W   D    S +   +   +   ++   TAR IW  ++ LY  +   ++LY L+
Subjt:  KLNGQNYFS-WSQSVK--MILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRSLLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNASRLYTLR

Query:  KQIYECKQE--------------LVWDCPCGGVQYYKLEEVDQVYDFLAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEED-RTSAMNTVGTSITDST
        KQ+Y                   L+      GV   K+EE D+    L  L S +D + + IL  K T  L +V S + L E  R    N     IT+  
Subjt:  KQIYECKQE--------------LVWDCPCGGVQYYKLEEVDQVYDFLAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEED-RTSAMNTVGTSITDST

Query:  AFSAKLASTT-------GDKQNGKPPPV--CEHCKKPWHTKDQCWKLHGRPLNGKRRPPNNKPNQALVSETTSDSQPQRQENYPADNGTASLGEIAHSGI
          S + +S         G  +N     V  C +C +P H K  C      P  GK              ET+             +N    L       I
Subjt:  AFSAKLASTT-------GDKQNGKPPPV--CEHCKKPWHTKDQCWKLHGRPLNGKRRPPNNKPNQALVSETTSDSQPQRQENYPADNGTASLGEIAHSGI

Query:  SPSLSLLSITG-QKPWILDSGATDHLTGTSDNFLSYHPCVSNE--KIRIADGTLAPVAGKGHISPFDG----LILQNVLHVPKISYNLLSVSKITRD---
        +     + ++G +  W++D+ A+ H T   D F  Y   V+ +   +++ + + + +AG G I         L+L++V HVP +  NL+S   + RD   
Subjt:  SPSLSLLSITG-QKPWILDSGATDHLTGTSDNFLSYHPCVSNE--KIRIADGTLAPVAGKGHISPFDG----LILQNVLHVPKISYNLLSVSKITRD---

Query:  ----------------------------LNCRVAFSPDDVLFQDLSL----------------------------GKTIGTA-------QHK--------
                                     N  +     +    ++S+                            G T+          QH+        
Subjt:  ----------------------------LNCRVAFSPDDVLFQDLSL----------------------------GKTIGTA-------QHK--------

Query:  ----------RDSISLMVILPLGTVLGLVCYL-------------------PIFQ-------------LLKLTSNNGRELLTNTLREFLSTKGIVHQSLC
                   D    M I  +G     V ++                    +FQ             L +L S+NG E  +    E+ S+ GI H+   
Subjt:  ----------RDSISLMVILPLGTVLGLVCYL-------------------PIFQ-------------LLKLTSNNGRELLTNTLREFLSTKGIVHQSLC

Query:  AYTPQQNGVAERKNRHILEVARSRFQPP------FHLTYGAAAHLINRMPSRVLHFQTPLDYLKLSYPTTRLIPDVPLRVFGCTAFVLASVQTRLSSPLE
          TPQ NGVAER NR I+E  RS  +        +      A +LINR PS  L F+ P         T + +    L+VFGC AF  A V     + L+
Subjt:  AYTPQQNGVAERKNRHILEVARSRFQPP------FHLTYGAAAHLINRMPSRVLHFQTPLDYLKLSYPTTRLIPDVPLRVFGCTAFVLASVQTRLSSPLE

Query:  PRNVSSLDIPSTSVVINASIPLLGNTSSLWISPSWKINLSFPQVPWITYYRKNLRKEIAAPTNIPAPVQSSELTQAQGTTDPDNNAICAENVCVENDIVD
         +++  + I                            +  F    W    +K +R             + SE+  A   ++   N I    V + +   +
Subjt:  PRNVSSLDIPSTSVVINASIPLLGNTSSLWISPSWKINLSFPQVPWITYYRKNLRKEIAAPTNIPAPVQSSELTQAQGTTDPDNNAICAENVCVENDIVD

Query:  LTKLSVEDGKDDMTENNQ-----VAESDVVSTIVEENEKGIIPQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSL
         T  S E   D+++E  +     + + + +   VEE E      +PT GE+  +P            LR+  R   +   Y    Y  +S          
Subjt:  LTKLSVEDGKDDMTENNQ-----VAESDVVSTIVEENEKGIIPQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNLSSKSRAFIVSL

Query:  DTVTIPKNIYVTMEIPE---WKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVAKR-----------------------
             P+++   +  PE      A+ EEM +L+KN T++LV LPKG +P+  KWVF  K   D  L RYKARLV K                        
Subjt:  DTVTIPKNIYVTMEIPE---WKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVAKR-----------------------

Query:  ---------------------------------------SVSGK----------------------------------------------------IAVL
                                                V+GK                                                      +L
Subjt:  ---------------------------------------SVSGK----------------------------------------------------IAVL

Query:  IVYVDDIVLSGDDTAKINKLKQKMADEFEIKDLGNLKYFLGMEVARSK--KGISVSQWKYT----------------------LDLMKE-----------
        ++YVDD+++ G D   I KLK  ++  F++KDLG  +  LGM++ R +  + + +SQ KY                       L L K+           
Subjt:  IVYVDDIVLSGDDTAKINKLKQKMADEFEIKDLGNLKYFLGMEVARSK--KGISVSQWKYT----------------------LDLMKE-----------

Query:  ------------------------------------------------------------------------------TGSAIDRKSTSGYCTFVWGNLV
                                                                                       G   +RKS++GY     G  +
Subjt:  ------------------------------------------------------------------------------TGSAIDRKSTSGYCTFVWGNLV

Query:  TWRSKKQSVVARSGAEAEYRAMSHGICEEIWLKKVLSD--LHQSSELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQ
        +W+SK Q  VA S  EAEY A +    E IWLK+ L +  LHQ   +   +YCD+++AI ++ NS+ H RTKH+++  H+++E +++ S+ +  I +++ 
Subjt:  TWRSKKQSVVARSGAEAEYRAMSHGICEEIWLKKVLSD--LHQSSELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQ

Query:  VVDVLTKGLLKQSFDACVSKMGL
          D+LTK + +  F+ C   +G+
Subjt:  VVDVLTKGLLKQSFDACVSKMGL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.4e-5121.23Show/hide
Query:  KLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPG---------DPHERIWKGEDSLLRSLLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQ--N
        KL   NY  WS+ V  + +G     +L G    P            +P    WK +D L+ S ++ ++   +   +  A TA  IW+ ++K+Y+     +
Subjt:  KLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPG---------DPHERIWKGEDSLLRSLLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQ--N

Query:  ASRLYTLRKQ-------IYECKQELVWDCPCGGVQYYKLEEVDQVYDFLAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSITDST
         ++L T  KQ       I +  Q LV       +    ++  +QV   L  L  ++  V  +I  +   PTL E+   +   E +  A+++       + 
Subjt:  ASRLYTLRKQ-------IYECKQELVWDCPCGGVQYYKLEEVDQVYDFLAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSITDST

Query:  AFSAKLASTTGDKQNGKPPPVCE-----HCKKPWHTKDQCWKLHGRPLNGKRRPPNNKPNQALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSL
        A S +  +TT +  NG      +     +  KPW    Q    +  P N + +P   K     V   ++    Q Q    + N               +L
Subjt:  AFSAKLASTTGDKQNGKPPPVCE-----HCKKPWHTKDQCWKLHGRPLNGKRRPPNNKPNQALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSL

Query:  SLLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVAGKGHIS---PFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDD
        +L S      W+LDSGAT H+T   +N   + P    + + +ADG+  P++  G  S       L L N+L+VP I  NL+SV ++       V F P  
Subjt:  SLLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVAGKGHIS---PFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDD

Query:  VLFQDLSLG-------------------------------KTIGTAQHKR-------------DSISLMVI-----------------------------
           +DL+ G                               K   ++ H R              + SL V+                             
Subjt:  VLFQDLSLG-------------------------------KTIGTAQHKR-------------DSISLMVI-----------------------------

Query:  -LPLGTVLGLVCYLPIF-----------------------------------------------QLLKLTSNNGRELLTNTLREFLSTKGIVHQSLCAYT
          PL  +   V   PI                                                ++    S+NG E +   L E+ S  GI H +   +T
Subjt:  -LPLGTVLGLVCYLPIF-----------------------------------------------QLLKLTSNNGRELLTNTLREFLSTKGIVHQSLCAYT

Query:  PQQNGVAERKNRHILEVARSRFQ----PPFHLTY--GAAAHLINRMPSRVLHFQTPLDYLKLSYPTTRLIPDVPLRVFGCTAFVLASVQTRLSSPLEPRN
        P+ NG++ERK+RHI+E   +       P  +  Y    A +LINR+P+ +L  ++P   L  + P         LRVFGC  +            L P N
Subjt:  PQQNGVAERKNRHILEVARSRFQ----PPFHLTY--GAAAHLINRMPSRVLHFQTPLDYLKLSYPTTRLIPDVPLRVFGCTAFVLASVQTRLSSPLEPRN

Query:  VSSLDIPSTSVVI-------NASIPLLGNTSSLWISPSWKINLS-FPQVPWITYYR--KNLRKEIAA--------PTNIPAPVQSSELTQAQGTTDPDNN
           LD  S   V        +A + L   TS L+IS   + + + FP   ++      +  R+E +         PT  P     S        T P + 
Subjt:  VSSLDIPSTSVVI-------NASIPLLGNTSSLWISPSWKINLS-FPQVPWITYYR--KNLRKEIAA--------PTNIPAPVQSSELTQAQGTTDPDNN

Query:  AICAENVCVENDIVDLTKLSVEDGKDDMT---ENNQVAESDVVSTIVEENEKGIIPQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKY---------
        +    N  V +  +D +  S      + T   +N     +    T  + +      QN  T E    P +   +L  P      + S T           
Subjt:  AICAENVCVENDIVDLTKLSVEDGKDDMT---ENNQVAESDVVSTIVEENEKGIIPQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKY---------

Query:  ----------PMYSFLSYNN--------LSSKSRAFI----------VSLDTVTIPKNIYVTMEIPEWKAAIMEEMRALEKNHTWELVALPKGHKP-VGY
                  P+   ++ NN        + ++++A I          VSL   + P+     ++   W+ A+  E+ A   NHTW+LV  P  H   VG 
Subjt:  ----------PMYSFLSYNN--------LSSKSRAFI----------VSLDTVTIPKNIYVTMEIPEWKAAIMEEMRALEKNHTWELVALPKGHKP-VGY

Query:  KWVFTTKYKFDGTLDRYKARLVAK----------------------------------------------------------------------------
        +W+FT KY  DG+L+RYKARLVAK                                                                            
Subjt:  KWVFTTKYKFDGTLDRYKARLVAK----------------------------------------------------------------------------

Query:  ---------------------------RSVS---------GK-IAVLIVYVDDIVLSGDDTAKINKLKQKMADEFEIKDLGNLKYFLGMEVARSKKGISV
                                    SVS         GK I  ++VYVDDI+++G+D   ++     ++  F +KD   L YFLG+E  R   G+ +
Subjt:  ---------------------------RSVS---------GK-IAVLIVYVDDIVLSGDDTAKINKLKQKMADEFEIKDLGNLKYFLGMEVARSKKGISV

Query:  SQWKYTLDLMKET---------------------------------------------------------------------------------------
        SQ +Y LDL+  T                                                                                       
Subjt:  SQWKYTLDLMKET---------------------------------------------------------------------------------------

Query:  ------------------GSAIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICEEIWLKKVLSDLHQSSELPMKLYCDTKAAISIAN
                          G   D  ST+GY  ++  + ++W SKKQ  V RS  EAEYR++++   E  W+  +L++L      P  +YCD   A  +  
Subjt:  ------------------GSAIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICEEIWLKKVLSDLHQSSELPMKLYCDTKAAISIAN

Query:  NSVQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGL
        N V H R KH+ ID HF++ ++++G++ + ++ +  Q+ D LTK L + +F    SK+G+
Subjt:  NSVQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGL

Q9C811 Nuclear pore complex protein NUP1606.2e-6840.83Show/hide
Query:  RRMPLAGMETSIMATDS-VEFIDLSLPSSTSTQATITSVASQ----ATTSLTEDHASCVVVRDPPTYLIWRIRKILPHVVELLEFSTANEFRRVGLRIAF
        RR P AGME  +    + V++I++S+PS + + ++I + +S+        L+ED+AS  V+ +P    +WRI K  P+ +ELL+ S  + F   GLR  F
Subjt:  RRMPLAGMETSIMATDS-VEFIDLSLPSSTSTQATITSVASQ----ATTSLTEDHASCVVVRDPPTYLIWRIRKILPHVVELLEFSTANEFRRVGLRIAF

Query:  PETLHPTVFVCKNEISSSTRYPFLIYAVTSSGVAYCLKLRNISSYVSYSVIPLDEVVEFNLQTHPDNKPVTCVSAISGCLVIGRNDGSVSCYKLGSLDQH
         +TL P  FV  +E   S R  + +Y++T SGV Y LKL N  +Y S SV PLD ++  +++ + +   VT V+A  G + +GR+DG VSC++     Q 
Subjt:  PETLHPTVFVCKNEISSSTRYPFLIYAVTSSGVAYCLKLRNISSYVSYSVIPLDEVVEFNLQTHPDNKPVTCVSAISGCLVIGRNDGSVSCYKLGSLDQH

Query:  TPGFEHELRDDSGFNRLLGFISRVKLAGAVQDMVISEVCGRQFLFVLHSDGVLRVWDLSCHSRVLNHSMNIATLAEARFVRLWVGQASADMSTIPLAILY
        + GF  ELRDD+GF RL        +  AVQD+ ISEV GR +L VLH+DG LRVWD+  +SRVL  S+    L     VRLW+G+A  D   IPLA+LY
Subjt:  TPGFEHELRDDSGFNRLLGFISRVKLAGAVQDMVISEVCGRQFLFVLHSDGVLRVWDLSCHSRVLNHSMNIATLAEARFVRLWVGQASADMSTIPLAILY

Query:  KHTEDLNLEMIYVYSLRFSWGDRVGLLMEPSTHNIPLNQGGCLDVKLASDKIWVLKSNGL
        + + + ++++I VY L FS  + + L ++    NIPL +G   DV+  SDKIW LK+N L
Subjt:  KHTEDLNLEMIYVYSLRFSWGDRVGLLMEPSTHNIPLNQGGCLDVKLASDKIWVLKSNGL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.1e-4819.84Show/hide
Query:  KLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPG---------DPHERIWKGEDSLLRSLLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNAS
        KL   NY  WS+ V  + +G     +L G  P P            +P    W+ +D L+ S ++ ++   +   +  A TA  IW+ ++K+Y+      
Subjt:  KLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPG---------DPHERIWKGEDSLLRSLLIHSMEPQIGKPLLYAATARDIWDAVQKLYSKRQNAS

Query:  RLYTLRKQIYECKQELVWDCPCGGVQYYKLEEVDQVYDFLAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNT------VGTSITDSTAFS
           T  + I    Q  +   P        ++  +QV   L  L   +  V  +I  +   P+L E+   +   E +  A+N+          +T     +
Subjt:  RLYTLRKQIYECKQELVWDCPCGGVQYYKLEEVDQVYDFLAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNT------VGTSITDSTAFS

Query:  AKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRRPPNNKPNQAL-------VSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLS
         +  +  GD +N              + +   W    +P +   R  N +P   L       V   ++   PQ  +     N   S           +L+
Subjt:  AKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRRPPNNKPNQAL-------VSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLS

Query:  LLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVAGKGHIS---PFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDV
        + S      W+LDSGAT H+T   +N   + P    + + IADG+  P+   G  S       L L  VL+VP I  NL+SV ++       V F P   
Subjt:  LLSITGQKPWILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVAGKGHIS---PFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDV

Query:  LFQDLSLG-------------------------------KTIGTAQHKRDSISLMVIL------------------------------------------
          +DL+ G                               K   ++ H R     + IL                                          
Subjt:  LFQDLSLG-------------------------------KTIGTAQHKRDSISLMVIL------------------------------------------

Query:  -PLGTVLGLVCYLPIFQL---------------------LK--------------------------LTSNNGRELLTNTLREFLSTKGIVHQSLCAYTP
         PL  +   V   PI  +                     LK                          L S+NG E +   LR++LS  GI H +   +TP
Subjt:  -PLGTVLGLVCYLPIFQL---------------------LK--------------------------LTSNNGRELLTNTLREFLSTKGIVHQSLCAYTP

Query:  QQNGVAERKNRHILEVARSRFQ----PPFHLTY--GAAAHLINRMPSRVLHFQTPLDYLKLSYPTTRLIPDVPLRVFGCTAFVLASVQTR----------
        + NG++ERK+RHI+E+  +       P  +  Y    A +LINR+P+ +L  Q+P   L    P         L+VFGC  +       R          
Subjt:  QQNGVAERKNRHILEVARSRFQ----PPFHLTY--GAAAHLINRMPSRVLHFQTPLDYLKLSYPTTRLIPDVPLRVFGCTAFVLASVQTR----------

Query:  --LSSPLEPRNVSSLDIPSTSVVINASIP-----LLGNTSSLWIS----------PSWKINLSFPQVPWITYYRKNLRKEI---AAPTNIPAPVQSSELT
          +   L       L IP+  +  +  +         +T++  +S          P+W  + + P  P +      L   +     P + P+P+ +++++
Subjt:  --LSSPLEPRNVSSLDIPSTSVVINASIP-----LLGNTSSLWIS----------PSWKINLSFPQVPWITYYRKNLRKEI---AAPTNIPAPVQSSELT

Query:  QAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDDMTENNQVAESDVVSTIVEE-----------NEKGIIPQNPTTGEKLDKPGECDTNLDLPIAL
         +   +   ++   +E     +           +G     + +Q   S+  S I+             N+   +PQ+P +   +  P    T++  P + 
Subjt:  QAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDDMTENNQVAESDVVSTIVEE-----------NEKGIIPQNPTTGEKLDKPGECDTNLDLPIAL

Query:  RKGTRSCTKYP----------MYSFLSYNNLSSKSRA------------FIVSLDTVTIPKNIYVTMEIPEWKAAIMEEMRALEKNHTWELVALPKGHKP
           + S    P          + +    N  S  +RA            +  SL   + P+     M+   W+ A+  E+ A   NHTW+LV  P     
Subjt:  RKGTRSCTKYP----------MYSFLSYNNLSSKSRA------------FIVSLDTVTIPKNIYVTMEIPEWKAAIMEEMRALEKNHTWELVALPKGHKP

Query:  -VGYKWVFTTKYKFDGTLDRYKARLVAK-----------------------RSVSG--------------------------------------------
         VG +W+FT K+  DG+L+RYKARLVAK                       R V G                                            
Subjt:  -VGYKWVFTTKYKFDGTLDRYKARLVAK-----------------------RSVSG--------------------------------------------

Query:  ----------------------------------------------KIAVLIVYVDDIVLSGDDTAKINKLKQKMADEFEIKDLGNLKYFLGMEVARSKK
                                                       I  ++VYVDDI+++G+DT  +      ++  F +K+  +L YFLG+E  R  +
Subjt:  ----------------------------------------------KIAVLIVYVDDIVLSGDDTAKINKLKQKMADEFEIKDLGNLKYFLGMEVARSKK

Query:  GISVSQWKYTLDLMKET-----------------------------------------------------------------------------------
        G+ +SQ +YTLDL+  T                                                                                   
Subjt:  GISVSQWKYTLDLMKET-----------------------------------------------------------------------------------

Query:  ----------------------GSAIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICEEIWLKKVLSDLHQSSELPMKLYCDTKAAI
                              G   D  ST+GY  ++  + ++W SKKQ  V RS  EAEYR++++   E  W+  +L++L      P  +YCD   A 
Subjt:  ----------------------GSAIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICEEIWLKKVLSDLHQSSELPMKLYCDTKAAI

Query:  SIANNSVQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDI
         +  N V H R KH+ +D HF++ ++++G++ + ++ +  Q+ D LTK L + +F     K+G+I +
Subjt:  SIANNSVQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDI

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.8e-1429.03Show/hide
Query:  NYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRSLLIHSMEPQIGKPLLYAATARDIWDAVQKLY------SKRQNASRLYTLRK-
        NY +W    +  L    KF ++ G +PKP P  P  + W+  ++++   L++SM  ++ + ++YA TA  +W+ +++++         Q   RL TLR+ 
Subjt:  NYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRSLLIHSMEPQIGKPLLYAATARDIWDAVQKLY------SKRQNASRLYTLRK-

Query:  ----QIYECKQELVW----------DCPCGGV------QYYKLEEVDQVYDFLAG--LNSKFDAVRSRILGQKPTPTLMEVCSEVR
            + Y  K   VW          +C CGG       +  +  E +Q Y+FL G  LN  F+AV ++I+ QKP P+L E  + V+
Subjt:  ----QIYECKQELVW----------DCPCGGV------QYYKLEEVDQVYDFLAG--LNSKFDAVRSRILGQKPTPTLMEVCSEVR

AT1G33410.1 SUPPRESSOR OF AUXIN RESISTANCE14.4e-6940.83Show/hide
Query:  RRMPLAGMETSIMATDS-VEFIDLSLPSSTSTQATITSVASQ----ATTSLTEDHASCVVVRDPPTYLIWRIRKILPHVVELLEFSTANEFRRVGLRIAF
        RR P AGME  +    + V++I++S+PS + + ++I + +S+        L+ED+AS  V+ +P    +WRI K  P+ +ELL+ S  + F   GLR  F
Subjt:  RRMPLAGMETSIMATDS-VEFIDLSLPSSTSTQATITSVASQ----ATTSLTEDHASCVVVRDPPTYLIWRIRKILPHVVELLEFSTANEFRRVGLRIAF

Query:  PETLHPTVFVCKNEISSSTRYPFLIYAVTSSGVAYCLKLRNISSYVSYSVIPLDEVVEFNLQTHPDNKPVTCVSAISGCLVIGRNDGSVSCYKLGSLDQH
         +TL P  FV  +E   S R  + +Y++T SGV Y LKL N  +Y S SV PLD ++  +++ + +   VT V+A  G + +GR+DG VSC++     Q 
Subjt:  PETLHPTVFVCKNEISSSTRYPFLIYAVTSSGVAYCLKLRNISSYVSYSVIPLDEVVEFNLQTHPDNKPVTCVSAISGCLVIGRNDGSVSCYKLGSLDQH

Query:  TPGFEHELRDDSGFNRLLGFISRVKLAGAVQDMVISEVCGRQFLFVLHSDGVLRVWDLSCHSRVLNHSMNIATLAEARFVRLWVGQASADMSTIPLAILY
        + GF  ELRDD+GF RL        +  AVQD+ ISEV GR +L VLH+DG LRVWD+  +SRVL  S+    L     VRLW+G+A  D   IPLA+LY
Subjt:  TPGFEHELRDDSGFNRLLGFISRVKLAGAVQDMVISEVCGRQFLFVLHSDGVLRVWDLSCHSRVLNHSMNIATLAEARFVRLWVGQASADMSTIPLAILY

Query:  KHTEDLNLEMIYVYSLRFSWGDRVGLLMEPSTHNIPLNQGGCLDVKLASDKIWVLKSNGL
        + + + ++++I VY L FS  + + L ++    NIPL +G   DV+  SDKIW LK+N L
Subjt:  KHTEDLNLEMIYVYSLRFSWGDRVGLLMEPSTHNIPLNQGGCLDVKLASDKIWVLKSNGL

AT1G33410.2 SUPPRESSOR OF AUXIN RESISTANCE11.6e-7141.32Show/hide
Query:  RRMPLAGMETSIMATDS-VEFIDLSLPSSTSTQATITSVASQ----ATTSLTEDHASCVVVRDPPTYLIWRIRKILPHVVELLEFSTANEFRRVGLRIAF
        RR P AGME  +    + V++I++S+PS + + ++I + +S+        L+ED+AS  V+ +P    +WRI K  P+ +ELL+ S  + F   GLR  F
Subjt:  RRMPLAGMETSIMATDS-VEFIDLSLPSSTSTQATITSVASQ----ATTSLTEDHASCVVVRDPPTYLIWRIRKILPHVVELLEFSTANEFRRVGLRIAF

Query:  PETLHPTVFVCKNEISSSTRYPFLIYAVTSSGVAYCLKLRNISSYVSYSVIPLDEVVEFNLQTHPDNKPVTCVSAISGCLVIGRNDGSVSCYKLGSLDQH
         +TL P  FV  +E   S R  + +Y++T SGV Y LKL N  +Y S SV PLD ++  +++ + +   VT V+A  G + +GR+DG VSC++     Q 
Subjt:  PETLHPTVFVCKNEISSSTRYPFLIYAVTSSGVAYCLKLRNISSYVSYSVIPLDEVVEFNLQTHPDNKPVTCVSAISGCLVIGRNDGSVSCYKLGSLDQH

Query:  TPGFEHELRDDSGFNRLLGFI---SRVKLAGAVQDMVISEVCGRQFLFVLHSDGVLRVWDLSCHSRVLNHSMNIATLAEARFVRLWVGQASADMSTIPLA
        + GF  ELRDD+GF RL GF+    R  +  AVQD+ ISEV GR +L VLH+DG LRVWD+  +SRVL  S+    L     VRLW+G+A  D   IPLA
Subjt:  TPGFEHELRDDSGFNRLLGFI---SRVKLAGAVQDMVISEVCGRQFLFVLHSDGVLRVWDLSCHSRVLNHSMNIATLAEARFVRLWVGQASADMSTIPLA

Query:  ILYKHTEDLNLEMIYVYSLRFSWGDRVGLLMEPSTHNIPLNQGGCLDVKLASDKIWVLKSNGL
        +LY+ + + ++++I VY L FS  + + L ++    NIPL +G   DV+  SDKIW LK+N L
Subjt:  ILYKHTEDLNLEMIYVYSLRFSWGDRVGLLMEPSTHNIPLNQGGCLDVKLASDKIWVLKSNGL

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.6e-3925.75Show/hide
Query:  SCTKYPMYSFLSYNNLSSKSRAFIVSLDTVTIPKNIYVTMEIPEWKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVA-
        S T + +  FLSY  +S    +F+V +     P       E   W  A+ +E+ A+E  HTWE+  LP   KP+G KWV+  KY  DGT++RYKARLVA 
Subjt:  SCTKYPMYSFLSYNNLSSKSRAFIVSLDTVTIPKNIYVTMEIPEWKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVA-

Query:  -------------------------------------------------------------------------------KRSVSG----------KIAV-
                                                                                       K+S+ G          K +V 
Subjt:  -------------------------------------------------------------------------------KRSVSG----------KIAV-

Query:  --------------------------LIVYVDDIVLSGDDTAKINKLKQKMADEFEIKDLGNLKYFLGMEVARSKKGISVSQWKYTLDLMKETG------
                                  ++VYVDDI++  ++ A +++LK ++   F+++DLG LKYFLG+E+ARS  GI++ Q KY LDL+ ETG      
Subjt:  --------------------------LIVYVDDIVLSGDDTAKINKLKQKMADEFEIKDLGNLKYFLGMEVARSKKGISVSQWKYTLDLMKETG------

Query:  -------------------------------------SAID-----------------------------------------------------------
                                             + +D                                                           
Subjt:  -------------------------------------SAID-----------------------------------------------------------

Query:  ---RKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICEEIWLKKVLSDLHQSSELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKE
           R+ST+GYC F+  +L++W+SKKQ VV++S AEAEYRA+S    E +WL +   +L      P  L+CD  AAI IA N+V H+RTKH+E D H ++E
Subjt:  ---RKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICEEIWLKKVLSDLHQSSELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKE

Query:  K
        +
Subjt:  K

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)8.4e-1251.47Show/hide
Query:  PKNIYVTMEIPEWKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVAK
        PK++   ++ P W  A+ EE+ AL +N TW LV  P     +G KWVF TK   DGTLDR KARLVAK
Subjt:  PKNIYVTMEIPEWKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCGTAGGATGCCGTTGGCCGGAATGGAAACCTCTATCATGGCTACCGACTCTGTAGAATTCATTGATCTTTCTCTTCCTTCTTCCACATCAACACAGGCCACCAT
TACTTCCGTTGCTTCACAAGCTACCACCTCATTGACTGAGGACCATGCCTCCTGCGTTGTCGTACGAGACCCTCCAACCTATCTCATCTGGAGGATTCGGAAGATTCTAC
CTCATGTAGTCGAGTTGCTTGAGTTTTCAACTGCCAACGAGTTTCGTAGGGTTGGTCTGCGAATTGCATTTCCCGAAACACTTCACCCGACCGTGTTTGTTTGCAAAAAT
GAGATTAGTTCGTCTACGAGATATCCTTTCTTGATCTACGCGGTGACTTCGTCCGGTGTCGCTTATTGTTTGAAGCTGCGAAATATTTCCTCATACGTATCCTACTCTGT
CATTCCGCTCGATGAGGTTGTGGAGTTTAATTTGCAAACCCATCCGGACAACAAACCAGTAACCTGCGTATCTGCTATATCTGGATGTCTCGTGATTGGGAGAAATGATG
GATCAGTTTCATGTTACAAACTTGGGTCGCTTGACCAACATACCCCAGGCTTTGAACATGAACTACGGGATGATTCAGGATTCAATCGTCTATTGGGTTTCATTTCAAGG
GTCAAGTTGGCAGGAGCTGTGCAAGATATGGTAATATCTGAGGTCTGTGGAAGACAATTTCTGTTTGTTCTTCATTCTGACGGAGTCTTACGAGTGTGGGATCTTTCATG
TCATAGTAGGGTACTTAATCACTCCATGAATATTGCAACTTTGGCAGAAGCAAGGTTTGTGAGGTTGTGGGTGGGTCAAGCTAGTGCTGATATGAGTACAATTCCTTTGG
CGATTCTCTATAAACACACGGAGGATCTTAACTTGGAAATGATCTACGTGTATAGCTTACGTTTTAGTTGGGGAGATAGGGTTGGTTTATTGATGGAGCCTTCAACACAC
AATATTCCATTGAATCAGGGTGGATGTCTTGATGTGAAACTTGCATCTGATAAGATATGGGTCTTAAAGAGCAACGGATTGATCTTTCTCGACTTGTTAGCCTCTGATGG
TCATTTATCCGCCGCCACCGTCGCCGCTGCTGCCACCACCGAGTCTGCCGTCCAAGCCGCCGCCGCCTTTTTTCAGAGTCTACTACCACAGTCGCAAAACCCTAGCGCCG
CGCCTCTTCAGATTCATCTGCCGCCGATTGGGAACTTTAGATCTACTACTCAGATCGACCAACAAGCCGCCGGCCATCCCGATCTGCGTTTCCCGTCGTTTGTCTCCGTT
GCACAGCCGCACTTCGTGGAGTTAAGTTTCGCGTCTAACCTCGATCTGCACTTCCCTCCGTCCGTTGCTGCTACACAGCCACAGTTTGTGGAGCCGAGATTCGAACACCC
TCAGACCATTGCTGCCGGCGCCGTTATTCCTCCGACCACCGCGGCTACCACTAGATCTGGTGGTTTTTACTCGGCTCCGACTGCTGTTGCCGGATCTAGTGGTTTTTCTT
CAACGCTTTCTCTTCAACAACTCGCCGATCTTCAAGCAAATTTTCAGCAACAAATCGCCGCTCTTGGGGCCACACTGAGTGCTTCTACCCATATAGGTCAAAGTGCAGTA
AATTCAAGTGTCCCGTCAGATTTACCGTTGTATCCAGTGCATCCGGTAACTACATTCCCAACCATTACACCTGCACCTTATTTGTCTGGTCCAATAGGTAACTCCATCGA
GCTAACTGTTGGTGAAAAATTGAATGGCCAAAATTACTTCTCCTGGTCTCAGTCGGTAAAAATGATTCTTGAAGGGCGTCACAAATTCAAGTATTTAACTGGCGAGATAC
CGAAACCCAGACCAGGAGACCCGCACGAGCGCATTTGGAAAGGAGAGGACTCCCTACTCCGGTCACTATTAATTCACAGTATGGAACCTCAAATTGGAAAACCTTTACTG
TATGCTGCCACAGCTCGAGATATTTGGGATGCCGTTCAAAAATTATACTCCAAAAGACAGAACGCCTCTCGACTTTACACACTGCGGAAACAAATCTACGAATGTAAACA
AGAACTTGTCTGGGATTGTCCGTGTGGGGGAGTTCAATATTACAAGCTTGAGGAAGTCGACCAAGTATATGATTTCTTAGCCGGTTTAAACTCCAAATTCGATGCTGTGC
GAAGTCGAATATTGGGACAGAAACCAACACCGACCTTGATGGAAGTTTGTTCAGAAGTTCGGCTTGAGGAAGATAGAACGAGTGCTATGAATACTGTGGGTACCTCCATT
ACTGACTCAACTGCGTTCAGTGCAAAATTAGCTAGTACAACTGGAGACAAGCAGAACGGGAAACCTCCTCCAGTTTGTGAACACTGTAAGAAACCATGGCACACGAAAGA
TCAGTGTTGGAAGCTACATGGTCGGCCTCTGAATGGTAAACGCCGACCTCCGAACAACAAACCCAACCAGGCCTTGGTAAGTGAAACTACAAGTGACTCGCAACCACAAC
GTCAAGAGAACTACCCAGCTGACAATGGTACCGCTTCTCTGGGGGAAATTGCACATTCAGGTATCTCTCCATCCTTGAGTTTACTTAGTATTACTGGCCAGAAACCCTGG
ATTCTTGATTCAGGAGCTACAGACCATTTGACTGGAACTTCTGACAATTTCCTCTCTTATCATCCATGTGTTAGTAATGAAAAAATTCGAATTGCTGATGGGACACTTGC
CCCTGTTGCCGGCAAGGGTCACATTTCTCCTTTCGATGGTTTAATATTACAGAATGTTTTGCATGTCCCCAAAATCTCGTATAATCTACTGTCTGTGAGTAAGATAACCA
GAGATTTAAATTGTCGTGTAGCTTTCTCACCCGATGATGTTCTTTTTCAGGACTTGAGCTTGGGGAAGACGATTGGCACTGCCCAGCACAAAAGGGACTCTATTTCCTTA
ATGGTGATACTTCCTCTAGGCACAGTTCTAGGGCTAGTCTGCTATCTTCCTATTTTTCAACTTCTGAAACTGACTAGTAACAATGGTCGTGAATTACTTACTAATACCCT
TCGCGAGTTCTTATCCACTAAAGGTATCGTTCACCAGAGCTTATGTGCCTATACGCCCCAACAAAATGGAGTGGCTGAAAGGAAAAACCGTCATATCCTCGAAGTTGCTC
GGTCTCGCTTTCAGCCTCCCTTCCATCTTACCTATGGGGCTGCAGCTCATCTTATAAATCGGATGCCTTCTCGTGTTCTCCACTTCCAAACCCCTCTTGACTACCTCAAA
TTGTCTTATCCTACAACTCGCCTTATACCTGATGTCCCTCTCCGAGTGTTTGGGTGTACAGCATTTGTCCTAGCTTCAGTCCAAACCAGACTAAGTTCTCCCCTCGAGCC
CAGAAATGTGTCTTCGTTGGATATCCCTTCCACCAGCGTGGTTATAAATGCTTCCATCCCACTTCTCGGAAATACTTCATCTCTATGGATATCACCTTCCTGGAAGATCA
ACCTTTCTTTCCCACAAGTCCCATGGATAACGTACTACAGGAAGAATCTTAGAAAGGAAATAGCGGCCCCTACTAATATTCCGGCTCCGGTCCAATCATCCGAACTGACA
CAAGCCCAAGGTACCACTGATCCTGATAATAATGCTATTTGCGCTGAAAATGTTTGTGTTGAAAACGATATTGTTGACCTGACAAAACTTTCGGTAGAAGATGGTAAAGA
TGATATGACAGAAAATAACCAGGTTGCTGAAAGTGATGTAGTGTCAACAATAGTTGAGGAGAATGAAAAAGGGATAATACCTCAAAACCCTACAACTGGAGAAAAGCTTG
ACAAGCCAGGAGAGTGTGATACCAATCTAGACCTGCCCATTGCTTTAAGAAAGGGCACAAGATCTTGCACCAAGTATCCTATGTATAGCTTCCTCTCTTATAATAATTTG
TCTTCTAAGTCCAGAGCATTTATTGTTAGCCTTGATACTGTAACCATACCGAAAAACATATATGTGACTATGGAAATCCCTGAGTGGAAGGCTGCTATTATGGAAGAAAT
GAGAGCTCTTGAGAAAAATCATACTTGGGAACTTGTTGCTCTTCCGAAAGGGCATAAACCAGTTGGATATAAATGGGTATTCACAACAAAGTATAAATTTGACGGAACTC
TTGACAGATATAAAGCCAGGCTTGTAGCAAAAAGGTCTGTCTCGGGTAAGATTGCCGTTCTGATTGTATACGTTGATGACATCGTCTTATCTGGAGATGATACTGCTAAA
ATCAACAAGTTAAAACAGAAAATGGCAGACGAATTTGAAATCAAAGACCTAGGAAATCTGAAGTATTTTCTTGGGATGGAGGTGGCAAGATCAAAAAAAGGAATCTCAGT
TTCTCAATGGAAATACACTCTTGATTTAATGAAAGAAACAGGATCAGCAATTGACAGAAAATCTACATCTGGGTATTGCACCTTTGTTTGGGGTAATCTTGTAACTTGGA
GAAGTAAGAAGCAAAGTGTGGTTGCTAGAAGCGGTGCTGAAGCAGAATACAGAGCTATGAGTCATGGAATCTGTGAAGAAATTTGGTTGAAGAAAGTTCTATCTGATCTT
CACCAAAGCAGTGAACTACCTATGAAACTCTATTGTGACACTAAGGCTGCTATTAGTATTGCAAATAATTCGGTTCAGCACGATAGAACAAAACATGTGGAGATCGATAG
ACACTTTTTGAAGGAGAAATTGGAAAACGGAAGCATTTGCATTCCTTATATTCCTTCCAGCCAACAAGTTGTAGATGTTCTTACCAAGGGGTTGCTTAAGCAGAGCTTTG
ATGCATGTGTAAGCAAGATGGGTCTAATTGACATTTACGTTCCATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGCGTAGGATGCCGTTGGCCGGAATGGAAACCTCTATCATGGCTACCGACTCTGTAGAATTCATTGATCTTTCTCTTCCTTCTTCCACATCAACACAGGCCACCAT
TACTTCCGTTGCTTCACAAGCTACCACCTCATTGACTGAGGACCATGCCTCCTGCGTTGTCGTACGAGACCCTCCAACCTATCTCATCTGGAGGATTCGGAAGATTCTAC
CTCATGTAGTCGAGTTGCTTGAGTTTTCAACTGCCAACGAGTTTCGTAGGGTTGGTCTGCGAATTGCATTTCCCGAAACACTTCACCCGACCGTGTTTGTTTGCAAAAAT
GAGATTAGTTCGTCTACGAGATATCCTTTCTTGATCTACGCGGTGACTTCGTCCGGTGTCGCTTATTGTTTGAAGCTGCGAAATATTTCCTCATACGTATCCTACTCTGT
CATTCCGCTCGATGAGGTTGTGGAGTTTAATTTGCAAACCCATCCGGACAACAAACCAGTAACCTGCGTATCTGCTATATCTGGATGTCTCGTGATTGGGAGAAATGATG
GATCAGTTTCATGTTACAAACTTGGGTCGCTTGACCAACATACCCCAGGCTTTGAACATGAACTACGGGATGATTCAGGATTCAATCGTCTATTGGGTTTCATTTCAAGG
GTCAAGTTGGCAGGAGCTGTGCAAGATATGGTAATATCTGAGGTCTGTGGAAGACAATTTCTGTTTGTTCTTCATTCTGACGGAGTCTTACGAGTGTGGGATCTTTCATG
TCATAGTAGGGTACTTAATCACTCCATGAATATTGCAACTTTGGCAGAAGCAAGGTTTGTGAGGTTGTGGGTGGGTCAAGCTAGTGCTGATATGAGTACAATTCCTTTGG
CGATTCTCTATAAACACACGGAGGATCTTAACTTGGAAATGATCTACGTGTATAGCTTACGTTTTAGTTGGGGAGATAGGGTTGGTTTATTGATGGAGCCTTCAACACAC
AATATTCCATTGAATCAGGGTGGATGTCTTGATGTGAAACTTGCATCTGATAAGATATGGGTCTTAAAGAGCAACGGATTGATCTTTCTCGACTTGTTAGCCTCTGATGG
TCATTTATCCGCCGCCACCGTCGCCGCTGCTGCCACCACCGAGTCTGCCGTCCAAGCCGCCGCCGCCTTTTTTCAGAGTCTACTACCACAGTCGCAAAACCCTAGCGCCG
CGCCTCTTCAGATTCATCTGCCGCCGATTGGGAACTTTAGATCTACTACTCAGATCGACCAACAAGCCGCCGGCCATCCCGATCTGCGTTTCCCGTCGTTTGTCTCCGTT
GCACAGCCGCACTTCGTGGAGTTAAGTTTCGCGTCTAACCTCGATCTGCACTTCCCTCCGTCCGTTGCTGCTACACAGCCACAGTTTGTGGAGCCGAGATTCGAACACCC
TCAGACCATTGCTGCCGGCGCCGTTATTCCTCCGACCACCGCGGCTACCACTAGATCTGGTGGTTTTTACTCGGCTCCGACTGCTGTTGCCGGATCTAGTGGTTTTTCTT
CAACGCTTTCTCTTCAACAACTCGCCGATCTTCAAGCAAATTTTCAGCAACAAATCGCCGCTCTTGGGGCCACACTGAGTGCTTCTACCCATATAGGTCAAAGTGCAGTA
AATTCAAGTGTCCCGTCAGATTTACCGTTGTATCCAGTGCATCCGGTAACTACATTCCCAACCATTACACCTGCACCTTATTTGTCTGGTCCAATAGGTAACTCCATCGA
GCTAACTGTTGGTGAAAAATTGAATGGCCAAAATTACTTCTCCTGGTCTCAGTCGGTAAAAATGATTCTTGAAGGGCGTCACAAATTCAAGTATTTAACTGGCGAGATAC
CGAAACCCAGACCAGGAGACCCGCACGAGCGCATTTGGAAAGGAGAGGACTCCCTACTCCGGTCACTATTAATTCACAGTATGGAACCTCAAATTGGAAAACCTTTACTG
TATGCTGCCACAGCTCGAGATATTTGGGATGCCGTTCAAAAATTATACTCCAAAAGACAGAACGCCTCTCGACTTTACACACTGCGGAAACAAATCTACGAATGTAAACA
AGAACTTGTCTGGGATTGTCCGTGTGGGGGAGTTCAATATTACAAGCTTGAGGAAGTCGACCAAGTATATGATTTCTTAGCCGGTTTAAACTCCAAATTCGATGCTGTGC
GAAGTCGAATATTGGGACAGAAACCAACACCGACCTTGATGGAAGTTTGTTCAGAAGTTCGGCTTGAGGAAGATAGAACGAGTGCTATGAATACTGTGGGTACCTCCATT
ACTGACTCAACTGCGTTCAGTGCAAAATTAGCTAGTACAACTGGAGACAAGCAGAACGGGAAACCTCCTCCAGTTTGTGAACACTGTAAGAAACCATGGCACACGAAAGA
TCAGTGTTGGAAGCTACATGGTCGGCCTCTGAATGGTAAACGCCGACCTCCGAACAACAAACCCAACCAGGCCTTGGTAAGTGAAACTACAAGTGACTCGCAACCACAAC
GTCAAGAGAACTACCCAGCTGACAATGGTACCGCTTCTCTGGGGGAAATTGCACATTCAGGTATCTCTCCATCCTTGAGTTTACTTAGTATTACTGGCCAGAAACCCTGG
ATTCTTGATTCAGGAGCTACAGACCATTTGACTGGAACTTCTGACAATTTCCTCTCTTATCATCCATGTGTTAGTAATGAAAAAATTCGAATTGCTGATGGGACACTTGC
CCCTGTTGCCGGCAAGGGTCACATTTCTCCTTTCGATGGTTTAATATTACAGAATGTTTTGCATGTCCCCAAAATCTCGTATAATCTACTGTCTGTGAGTAAGATAACCA
GAGATTTAAATTGTCGTGTAGCTTTCTCACCCGATGATGTTCTTTTTCAGGACTTGAGCTTGGGGAAGACGATTGGCACTGCCCAGCACAAAAGGGACTCTATTTCCTTA
ATGGTGATACTTCCTCTAGGCACAGTTCTAGGGCTAGTCTGCTATCTTCCTATTTTTCAACTTCTGAAACTGACTAGTAACAATGGTCGTGAATTACTTACTAATACCCT
TCGCGAGTTCTTATCCACTAAAGGTATCGTTCACCAGAGCTTATGTGCCTATACGCCCCAACAAAATGGAGTGGCTGAAAGGAAAAACCGTCATATCCTCGAAGTTGCTC
GGTCTCGCTTTCAGCCTCCCTTCCATCTTACCTATGGGGCTGCAGCTCATCTTATAAATCGGATGCCTTCTCGTGTTCTCCACTTCCAAACCCCTCTTGACTACCTCAAA
TTGTCTTATCCTACAACTCGCCTTATACCTGATGTCCCTCTCCGAGTGTTTGGGTGTACAGCATTTGTCCTAGCTTCAGTCCAAACCAGACTAAGTTCTCCCCTCGAGCC
CAGAAATGTGTCTTCGTTGGATATCCCTTCCACCAGCGTGGTTATAAATGCTTCCATCCCACTTCTCGGAAATACTTCATCTCTATGGATATCACCTTCCTGGAAGATCA
ACCTTTCTTTCCCACAAGTCCCATGGATAACGTACTACAGGAAGAATCTTAGAAAGGAAATAGCGGCCCCTACTAATATTCCGGCTCCGGTCCAATCATCCGAACTGACA
CAAGCCCAAGGTACCACTGATCCTGATAATAATGCTATTTGCGCTGAAAATGTTTGTGTTGAAAACGATATTGTTGACCTGACAAAACTTTCGGTAGAAGATGGTAAAGA
TGATATGACAGAAAATAACCAGGTTGCTGAAAGTGATGTAGTGTCAACAATAGTTGAGGAGAATGAAAAAGGGATAATACCTCAAAACCCTACAACTGGAGAAAAGCTTG
ACAAGCCAGGAGAGTGTGATACCAATCTAGACCTGCCCATTGCTTTAAGAAAGGGCACAAGATCTTGCACCAAGTATCCTATGTATAGCTTCCTCTCTTATAATAATTTG
TCTTCTAAGTCCAGAGCATTTATTGTTAGCCTTGATACTGTAACCATACCGAAAAACATATATGTGACTATGGAAATCCCTGAGTGGAAGGCTGCTATTATGGAAGAAAT
GAGAGCTCTTGAGAAAAATCATACTTGGGAACTTGTTGCTCTTCCGAAAGGGCATAAACCAGTTGGATATAAATGGGTATTCACAACAAAGTATAAATTTGACGGAACTC
TTGACAGATATAAAGCCAGGCTTGTAGCAAAAAGGTCTGTCTCGGGTAAGATTGCCGTTCTGATTGTATACGTTGATGACATCGTCTTATCTGGAGATGATACTGCTAAA
ATCAACAAGTTAAAACAGAAAATGGCAGACGAATTTGAAATCAAAGACCTAGGAAATCTGAAGTATTTTCTTGGGATGGAGGTGGCAAGATCAAAAAAAGGAATCTCAGT
TTCTCAATGGAAATACACTCTTGATTTAATGAAAGAAACAGGATCAGCAATTGACAGAAAATCTACATCTGGGTATTGCACCTTTGTTTGGGGTAATCTTGTAACTTGGA
GAAGTAAGAAGCAAAGTGTGGTTGCTAGAAGCGGTGCTGAAGCAGAATACAGAGCTATGAGTCATGGAATCTGTGAAGAAATTTGGTTGAAGAAAGTTCTATCTGATCTT
CACCAAAGCAGTGAACTACCTATGAAACTCTATTGTGACACTAAGGCTGCTATTAGTATTGCAAATAATTCGGTTCAGCACGATAGAACAAAACATGTGGAGATCGATAG
ACACTTTTTGAAGGAGAAATTGGAAAACGGAAGCATTTGCATTCCTTATATTCCTTCCAGCCAACAAGTTGTAGATGTTCTTACCAAGGGGTTGCTTAAGCAGAGCTTTG
ATGCATGTGTAAGCAAGATGGGTCTAATTGACATTTACGTTCCATCTTGA
Protein sequenceShow/hide protein sequence
MGRRMPLAGMETSIMATDSVEFIDLSLPSSTSTQATITSVASQATTSLTEDHASCVVVRDPPTYLIWRIRKILPHVVELLEFSTANEFRRVGLRIAFPETLHPTVFVCKN
EISSSTRYPFLIYAVTSSGVAYCLKLRNISSYVSYSVIPLDEVVEFNLQTHPDNKPVTCVSAISGCLVIGRNDGSVSCYKLGSLDQHTPGFEHELRDDSGFNRLLGFISR
VKLAGAVQDMVISEVCGRQFLFVLHSDGVLRVWDLSCHSRVLNHSMNIATLAEARFVRLWVGQASADMSTIPLAILYKHTEDLNLEMIYVYSLRFSWGDRVGLLMEPSTH
NIPLNQGGCLDVKLASDKIWVLKSNGLIFLDLLASDGHLSAATVAAAATTESAVQAAAAFFQSLLPQSQNPSAAPLQIHLPPIGNFRSTTQIDQQAAGHPDLRFPSFVSV
AQPHFVELSFASNLDLHFPPSVAATQPQFVEPRFEHPQTIAAGAVIPPTTAATTRSGGFYSAPTAVAGSSGFSSTLSLQQLADLQANFQQQIAALGATLSASTHIGQSAV
NSSVPSDLPLYPVHPVTTFPTITPAPYLSGPIGNSIELTVGEKLNGQNYFSWSQSVKMILEGRHKFKYLTGEIPKPRPGDPHERIWKGEDSLLRSLLIHSMEPQIGKPLL
YAATARDIWDAVQKLYSKRQNASRLYTLRKQIYECKQELVWDCPCGGVQYYKLEEVDQVYDFLAGLNSKFDAVRSRILGQKPTPTLMEVCSEVRLEEDRTSAMNTVGTSI
TDSTAFSAKLASTTGDKQNGKPPPVCEHCKKPWHTKDQCWKLHGRPLNGKRRPPNNKPNQALVSETTSDSQPQRQENYPADNGTASLGEIAHSGISPSLSLLSITGQKPW
ILDSGATDHLTGTSDNFLSYHPCVSNEKIRIADGTLAPVAGKGHISPFDGLILQNVLHVPKISYNLLSVSKITRDLNCRVAFSPDDVLFQDLSLGKTIGTAQHKRDSISL
MVILPLGTVLGLVCYLPIFQLLKLTSNNGRELLTNTLREFLSTKGIVHQSLCAYTPQQNGVAERKNRHILEVARSRFQPPFHLTYGAAAHLINRMPSRVLHFQTPLDYLK
LSYPTTRLIPDVPLRVFGCTAFVLASVQTRLSSPLEPRNVSSLDIPSTSVVINASIPLLGNTSSLWISPSWKINLSFPQVPWITYYRKNLRKEIAAPTNIPAPVQSSELT
QAQGTTDPDNNAICAENVCVENDIVDLTKLSVEDGKDDMTENNQVAESDVVSTIVEENEKGIIPQNPTTGEKLDKPGECDTNLDLPIALRKGTRSCTKYPMYSFLSYNNL
SSKSRAFIVSLDTVTIPKNIYVTMEIPEWKAAIMEEMRALEKNHTWELVALPKGHKPVGYKWVFTTKYKFDGTLDRYKARLVAKRSVSGKIAVLIVYVDDIVLSGDDTAK
INKLKQKMADEFEIKDLGNLKYFLGMEVARSKKGISVSQWKYTLDLMKETGSAIDRKSTSGYCTFVWGNLVTWRSKKQSVVARSGAEAEYRAMSHGICEEIWLKKVLSDL
HQSSELPMKLYCDTKAAISIANNSVQHDRTKHVEIDRHFLKEKLENGSICIPYIPSSQQVVDVLTKGLLKQSFDACVSKMGLIDIYVPS