; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021770 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021770
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionBeta-galactosidase
Genome locationscaffold2:3820983..3842542
RNA-Seq ExpressionSpg021770
SyntenySpg021770
Gene Ontology termsNA
InterPro domainsIPR017853 - Glycoside hydrolase superfamily
IPR031330 - Glycoside hydrolase 35, catalytic domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025702.1 FG-GAP repeat-containing protein [Cucumis melo var. makuwa]1.1e-8448.92Show/hide
Query:  AMAMGVIDRHSRLGQPVTQIL-VVTSGWSVMCFDHNLNELWETNLQEDFPH--------------TLKHGDSGLVIIGGEWKC-SHIWNAARFVFSPSMV
        AMA GVIDRH R GQPVTQ+L VVTSGWSVMCFDHNLN+LWETNLQEDFPH              TLKHGDSGLVI+GG  +  SH              
Subjt:  AMAMGVIDRHSRLGQPVTQIL-VVTSGWSVMCFDHNLNELWETNLQEDFPH--------------TLKHGDSGLVIIGGEWKC-SHIWNAARFVFSPSMV

Query:  NIKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEKEIYAILQFMYLLVDLVYYDGAGR-------------------------MSNYKLDVHSLNAQ
                    IFMDPFEEIG+AEK+ EQHR SATEKE       + L      Y  AGR                           NYKLDVHSLNA+
Subjt:  NIKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEKEIYAILQFMYLLVDLVYYDGAGR-------------------------MSNYKLDVHSLNAQ

Query:  HPGEFERREFKESILGVMPH-----------------------------------HLPEENHPPGKDSSKKIPKIIGTA---AGSAKTKKPLPYVPTITN
        HPGEFE REF+ESILGVMPH                                   H PEENHPPGKDSSK+IPKIIGTA   AG+AKTKKPLPYVPTITN
Subjt:  HPGEFERREFKESILGVMPH-----------------------------------HLPEENHPPGKDSSKKIPKIIGTA---AGSAKTKKPLPYVPTITN

Query:  YTELWWLPNVAVVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGGL
        YT+LWWLPNV V HQ +                                                            GIEALHLASGRT+CKLHLQEGGL
Subjt:  YTELWWLPNVAVVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGGL

Query:  HADINGDGAPDHVQA
        HADINGDG  DHVQA
Subjt:  HADINGDGAPDHVQA

XP_004149977.2 uncharacterized protein LOC101223217 isoform X2 [Cucumis sativus]1.5e-8448.43Show/hide
Query:  AMAMGVIDRHSRLGQPVTQIL-VVTSGWSVMCFDHNLNELWETNLQEDFPH--------------TLKHGDSGLVIIGGEWKC-SHIWNAARFVFSPSMV
        AMA GVIDRH R GQPVTQ+L VVTSGWSV+CFDHNLN+LWE NLQEDFPH              TLKHGDSGL+I+GG  +  SH              
Subjt:  AMAMGVIDRHSRLGQPVTQIL-VVTSGWSVMCFDHNLNELWETNLQEDFPH--------------TLKHGDSGLVIIGGEWKC-SHIWNAARFVFSPSMV

Query:  NIKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEKEIYAILQFMYLLVDLVYYDGAGR-------------------------MSNYKLDVHSLNAQ
                    IFMDPFEEIG+AEK+ EQHR SATEKE       + L     +Y  AGR                           NYKLDVHSLNA+
Subjt:  NIKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEKEIYAILQFMYLLVDLVYYDGAGR-------------------------MSNYKLDVHSLNAQ

Query:  HPGEFERREFKESILGVMPH-----------------------------------HLPEENHPPGKDSSKKIPKIIGTA---AGSAKTKKPLPYVPTITN
        HPGEFE REF+ESILGVMPH                                   H PEENHPPGKDSSK+IPKIIGTA   AGSAKTKKPLPYVPTITN
Subjt:  HPGEFERREFKESILGVMPH-----------------------------------HLPEENHPPGKDSSKKIPKIIGTA---AGSAKTKKPLPYVPTITN

Query:  YTELWWLPNVAVVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGGL
        YT+LWWLPNV V HQ +                                                            GIEALHLASGRT+CKLHLQEGGL
Subjt:  YTELWWLPNVAVVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGGL

Query:  HADINGDGAPDHVQA
        HADINGDG  DHVQA
Subjt:  HADINGDGAPDHVQA

XP_008440781.1 PREDICTED: uncharacterized protein LOC103485097 [Cucumis melo]1.1e-8448.92Show/hide
Query:  AMAMGVIDRHSRLGQPVTQIL-VVTSGWSVMCFDHNLNELWETNLQEDFPH--------------TLKHGDSGLVIIGGEWKC-SHIWNAARFVFSPSMV
        AMA GVIDRH R GQPVTQ+L VVTSGWSVMCFDHNLN+LWETNLQEDFPH              TLKHGDSGLVI+GG  +  SH              
Subjt:  AMAMGVIDRHSRLGQPVTQIL-VVTSGWSVMCFDHNLNELWETNLQEDFPH--------------TLKHGDSGLVIIGGEWKC-SHIWNAARFVFSPSMV

Query:  NIKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEKEIYAILQFMYLLVDLVYYDGAGR-------------------------MSNYKLDVHSLNAQ
                    IFMDPFEEIG+AEK+ EQHR SATEKE       + L      Y  AGR                           NYKLDVHSLNA+
Subjt:  NIKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEKEIYAILQFMYLLVDLVYYDGAGR-------------------------MSNYKLDVHSLNAQ

Query:  HPGEFERREFKESILGVMPH-----------------------------------HLPEENHPPGKDSSKKIPKIIGTA---AGSAKTKKPLPYVPTITN
        HPGEFE REF+ESILGVMPH                                   H PEENHPPGKDSSK+IPKIIGTA   AG+AKTKKPLPYVPTITN
Subjt:  HPGEFERREFKESILGVMPH-----------------------------------HLPEENHPPGKDSSKKIPKIIGTA---AGSAKTKKPLPYVPTITN

Query:  YTELWWLPNVAVVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGGL
        YT+LWWLPNV V HQ +                                                            GIEALHLASGRT+CKLHLQEGGL
Subjt:  YTELWWLPNVAVVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGGL

Query:  HADINGDGAPDHVQA
        HADINGDG  DHVQA
Subjt:  HADINGDGAPDHVQA

XP_031743490.1 uncharacterized protein LOC101223217 isoform X1 [Cucumis sativus]1.9e-8448.32Show/hide
Query:  AMAMGVIDRHSRLGQPVTQIL-VVTSGWSVMCFDHNLNELWETNLQEDFPH--------------TLKHGDSGLVIIGGEWKC-SHIWNAARFVFSPSMV
        AMA GVIDRH R GQPVTQ+L VVTSGWSV+CFDHNLN+LWE NLQEDFPH              TLKHGDSGL+I+GG  +  SH              
Subjt:  AMAMGVIDRHSRLGQPVTQIL-VVTSGWSVMCFDHNLNELWETNLQEDFPH--------------TLKHGDSGLVIIGGEWKC-SHIWNAARFVFSPSMV

Query:  NIKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEKEIYAILQFMYLLVDLVYYDGAGR--------------------------MSNYKLDVHSLNA
                    IFMDPFEEIG+AEK+ EQHR SATEKE       + L     +Y  AGR                            NYKLDVHSLNA
Subjt:  NIKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEKEIYAILQFMYLLVDLVYYDGAGR--------------------------MSNYKLDVHSLNA

Query:  QHPGEFERREFKESILGVMPH-----------------------------------HLPEENHPPGKDSSKKIPKIIGTA---AGSAKTKKPLPYVPTIT
        +HPGEFE REF+ESILGVMPH                                   H PEENHPPGKDSSK+IPKIIGTA   AGSAKTKKPLPYVPTIT
Subjt:  QHPGEFERREFKESILGVMPH-----------------------------------HLPEENHPPGKDSSKKIPKIIGTA---AGSAKTKKPLPYVPTIT

Query:  NYTELWWLPNVAVVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGG
        NYT+LWWLPNV V HQ +                                                            GIEALHLASGRT+CKLHLQEGG
Subjt:  NYTELWWLPNVAVVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGG

Query:  LHADINGDGAPDHVQA
        LHADINGDG  DHVQA
Subjt:  LHADINGDGAPDHVQA

XP_038882888.1 uncharacterized protein LOC120074000 [Benincasa hispida]1.5e-8448.67Show/hide
Query:  AMAMGVIDRHSRLGQPVTQIL-VVTSGWSVMCFDHNLNELWETNLQEDFPH--------------TLKHGDSGLVIIGGEWKC-SHIWNAARFVFSPSMV
        AMA GVIDRH R GQPVTQ+L VVTSGWSVMCFDHNLN+LWE NLQEDFPH              TLKHGDSGLVI+GG  +  SH              
Subjt:  AMAMGVIDRHSRLGQPVTQIL-VVTSGWSVMCFDHNLNELWETNLQEDFPH--------------TLKHGDSGLVIIGGEWKC-SHIWNAARFVFSPSMV

Query:  NIKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEKEIYAILQFMYLLVDLVYYDGAGR-------------------------MSNYKLDVHSLNAQ
                    IFMDPFEEIG+AEK+ EQHR SATEKE       + L     +Y  AGR                           NYKLDVHSLNA+
Subjt:  NIKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEKEIYAILQFMYLLVDLVYYDGAGR-------------------------MSNYKLDVHSLNAQ

Query:  HPGEFERREFKESILGVMPH-----------------------------------HLPEENHPPGKDSSKKIPKIIGTA---AGSAKTKKPLPYVPTITN
        HPGEFE REF+ESILGVMPH                                   H PEENHPPGKDSSK+IPKIIGTA   AGS KTKKPLPYVPTITN
Subjt:  HPGEFERREFKESILGVMPH-----------------------------------HLPEENHPPGKDSSKKIPKIIGTA---AGSAKTKKPLPYVPTITN

Query:  YTELWWLPNVAVVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGGL
        YT+LWWLPNV V HQ +                                                            GIEALHLASGRT+CKLHLQEGGL
Subjt:  YTELWWLPNVAVVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGGL

Query:  HADINGDGAPDHVQA
        HADINGDG  DHVQA
Subjt:  HADINGDGAPDHVQA

TrEMBL top hitse value%identityAlignment
A0A0A0KJ99 Uncharacterized protein7.1e-8548.43Show/hide
Query:  AMAMGVIDRHSRLGQPVTQIL-VVTSGWSVMCFDHNLNELWETNLQEDFPH--------------TLKHGDSGLVIIGGEWKC-SHIWNAARFVFSPSMV
        AMA GVIDRH R GQPVTQ+L VVTSGWSV+CFDHNLN+LWE NLQEDFPH              TLKHGDSGL+I+GG  +  SH              
Subjt:  AMAMGVIDRHSRLGQPVTQIL-VVTSGWSVMCFDHNLNELWETNLQEDFPH--------------TLKHGDSGLVIIGGEWKC-SHIWNAARFVFSPSMV

Query:  NIKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEKEIYAILQFMYLLVDLVYYDGAGR-------------------------MSNYKLDVHSLNAQ
                    IFMDPFEEIG+AEK+ EQHR SATEKE       + L     +Y  AGR                           NYKLDVHSLNA+
Subjt:  NIKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEKEIYAILQFMYLLVDLVYYDGAGR-------------------------MSNYKLDVHSLNAQ

Query:  HPGEFERREFKESILGVMPH-----------------------------------HLPEENHPPGKDSSKKIPKIIGTA---AGSAKTKKPLPYVPTITN
        HPGEFE REF+ESILGVMPH                                   H PEENHPPGKDSSK+IPKIIGTA   AGSAKTKKPLPYVPTITN
Subjt:  HPGEFERREFKESILGVMPH-----------------------------------HLPEENHPPGKDSSKKIPKIIGTA---AGSAKTKKPLPYVPTITN

Query:  YTELWWLPNVAVVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGGL
        YT+LWWLPNV V HQ +                                                            GIEALHLASGRT+CKLHLQEGGL
Subjt:  YTELWWLPNVAVVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGGL

Query:  HADINGDGAPDHVQA
        HADINGDG  DHVQA
Subjt:  HADINGDGAPDHVQA

A0A1S3B1X7 uncharacterized protein LOC1034850975.4e-8548.92Show/hide
Query:  AMAMGVIDRHSRLGQPVTQIL-VVTSGWSVMCFDHNLNELWETNLQEDFPH--------------TLKHGDSGLVIIGGEWKC-SHIWNAARFVFSPSMV
        AMA GVIDRH R GQPVTQ+L VVTSGWSVMCFDHNLN+LWETNLQEDFPH              TLKHGDSGLVI+GG  +  SH              
Subjt:  AMAMGVIDRHSRLGQPVTQIL-VVTSGWSVMCFDHNLNELWETNLQEDFPH--------------TLKHGDSGLVIIGGEWKC-SHIWNAARFVFSPSMV

Query:  NIKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEKEIYAILQFMYLLVDLVYYDGAGR-------------------------MSNYKLDVHSLNAQ
                    IFMDPFEEIG+AEK+ EQHR SATEKE       + L      Y  AGR                           NYKLDVHSLNA+
Subjt:  NIKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEKEIYAILQFMYLLVDLVYYDGAGR-------------------------MSNYKLDVHSLNAQ

Query:  HPGEFERREFKESILGVMPH-----------------------------------HLPEENHPPGKDSSKKIPKIIGTA---AGSAKTKKPLPYVPTITN
        HPGEFE REF+ESILGVMPH                                   H PEENHPPGKDSSK+IPKIIGTA   AG+AKTKKPLPYVPTITN
Subjt:  HPGEFERREFKESILGVMPH-----------------------------------HLPEENHPPGKDSSKKIPKIIGTA---AGSAKTKKPLPYVPTITN

Query:  YTELWWLPNVAVVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGGL
        YT+LWWLPNV V HQ +                                                            GIEALHLASGRT+CKLHLQEGGL
Subjt:  YTELWWLPNVAVVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGGL

Query:  HADINGDGAPDHVQA
        HADINGDG  DHVQA
Subjt:  HADINGDGAPDHVQA

A0A5A7SKN7 FG-GAP repeat-containing protein5.4e-8548.92Show/hide
Query:  AMAMGVIDRHSRLGQPVTQIL-VVTSGWSVMCFDHNLNELWETNLQEDFPH--------------TLKHGDSGLVIIGGEWKC-SHIWNAARFVFSPSMV
        AMA GVIDRH R GQPVTQ+L VVTSGWSVMCFDHNLN+LWETNLQEDFPH              TLKHGDSGLVI+GG  +  SH              
Subjt:  AMAMGVIDRHSRLGQPVTQIL-VVTSGWSVMCFDHNLNELWETNLQEDFPH--------------TLKHGDSGLVIIGGEWKC-SHIWNAARFVFSPSMV

Query:  NIKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEKEIYAILQFMYLLVDLVYYDGAGR-------------------------MSNYKLDVHSLNAQ
                    IFMDPFEEIG+AEK+ EQHR SATEKE       + L      Y  AGR                           NYKLDVHSLNA+
Subjt:  NIKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEKEIYAILQFMYLLVDLVYYDGAGR-------------------------MSNYKLDVHSLNAQ

Query:  HPGEFERREFKESILGVMPH-----------------------------------HLPEENHPPGKDSSKKIPKIIGTA---AGSAKTKKPLPYVPTITN
        HPGEFE REF+ESILGVMPH                                   H PEENHPPGKDSSK+IPKIIGTA   AG+AKTKKPLPYVPTITN
Subjt:  HPGEFERREFKESILGVMPH-----------------------------------HLPEENHPPGKDSSKKIPKIIGTA---AGSAKTKKPLPYVPTITN

Query:  YTELWWLPNVAVVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGGL
        YT+LWWLPNV V HQ +                                                            GIEALHLASGRT+CKLHLQEGGL
Subjt:  YTELWWLPNVAVVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGGL

Query:  HADINGDGAPDHVQA
        HADINGDG  DHVQA
Subjt:  HADINGDGAPDHVQA

A0A6J1GFL4 uncharacterized protein LOC1114534822.3e-8348.79Show/hide
Query:  AMAMGVIDRHSRLGQPVTQIL-VVTSGWSVMCFDHNLNELWETNLQEDFPH--------------TLKHGDSGLVIIGGEWKCSHIWNAARFVFSPSMVN
        AMA GVIDRH R GQPVTQ+L VVTSGWSVMCFDHNL  LWETNLQEDFPH              TLKHGDSGLVI+GG           R    P    
Subjt:  AMAMGVIDRHSRLGQPVTQIL-VVTSGWSVMCFDHNLNELWETNLQEDFPH--------------TLKHGDSGLVIIGGEWKCSHIWNAARFVFSPSMVN

Query:  IKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEKEIYAILQFMYLLVDLVYYDGAGR-------------------------MSNYKLDVHSLNAQH
                   IFMDPFEEIG+AEK+ EQHR SATEKE       + L     +Y  AGR                           NYKLDVHSLNA+ 
Subjt:  IKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEKEIYAILQFMYLLVDLVYYDGAGR-------------------------MSNYKLDVHSLNAQH

Query:  PGEFERREFKESILGVMPH-----------------------------------HLPEENHPPGKDSSKKIPKIIGTA---AGSAKTKKPLPYVPTITNY
        PGEFE REF+ESILGVMPH                                   H PEENHPPGKDSSK+IPKIIG+A   AGSAKTKKPLPYVPTITNY
Subjt:  PGEFERREFKESILGVMPH-----------------------------------HLPEENHPPGKDSSKKIPKIIGTA---AGSAKTKKPLPYVPTITNY

Query:  TELWWLPNVAVVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGGLH
        T+LWWLPNV V HQ +                                                            GIEALHLASGRTVCKLHLQEGGLH
Subjt:  TELWWLPNVAVVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGGLH

Query:  ADINGDGAPDHVQA
        ADINGDG  DHVQA
Subjt:  ADINGDGAPDHVQA

A0A6J1IJG7 uncharacterized protein LOC1114779933.9e-8348.55Show/hide
Query:  AMAMGVIDRHSRLGQPVTQIL-VVTSGWSVMCFDHNLNELWETNLQEDFPH--------------TLKHGDSGLVIIGGEWKCSHIWNAARFVFSPSMVN
        AMA GVIDRH R GQPVTQ+L VVTSGWSVMCFDHNL +LWE NLQEDFPH              TLKHGDSGLVI+GG           R    P    
Subjt:  AMAMGVIDRHSRLGQPVTQIL-VVTSGWSVMCFDHNLNELWETNLQEDFPH--------------TLKHGDSGLVIIGGEWKCSHIWNAARFVFSPSMVN

Query:  IKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEKEIYAILQFMYLLVDLVYYDGAGR-------------------------MSNYKLDVHSLNAQH
                   IFMDPFEEIG+AEK+ EQHR SATEKE       + L     +Y  AGR                           NYKLDVHSLNA+ 
Subjt:  IKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEKEIYAILQFMYLLVDLVYYDGAGR-------------------------MSNYKLDVHSLNAQH

Query:  PGEFERREFKESILGVMPH-----------------------------------HLPEENHPPGKDSSKKIPKIIGTA---AGSAKTKKPLPYVPTITNY
        PGEFE REF+ESILGVMPH                                   H PEENHPPGKDSSK+IPKIIG+A   AGSAKTKKPLPYVPTITNY
Subjt:  PGEFERREFKESILGVMPH-----------------------------------HLPEENHPPGKDSSKKIPKIIGTA---AGSAKTKKPLPYVPTITNY

Query:  TELWWLPNVAVVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGGLH
        T+LWWLPNV V HQ +                                                            GIEALHLASGRTVCKLHLQEGGLH
Subjt:  TELWWLPNVAVVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGGLH

Query:  ADINGDGAPDHVQA
        ADINGDG  DHVQA
Subjt:  ADINGDGAPDHVQA

SwissProt top hitse value%identityAlignment
P23780 Beta-galactosidase2.9e-1129.91Show/hide
Query:  GFSAWLLTKKRAPRLRSSDPGYL----QW-----------------------IENEFGSY-GDDQAYLHHLVTLARGYLGNEVIRYTTDGGTRETFEKGT
        G  AWLL +K++  LRSSDP YL    +W                       +ENE+GSY   D  YL  LV   R +LGN+VI +TTDG + +  + GT
Subjt:  GFSAWLLTKKRAPRLRSSDPGYL----QW-----------------------IENEFGSY-GDDQAYLHHLVTLARGYLGNEVIRYTTDGGTRETFEKGT

Query:  IRGVAV---FSEG--------IQRPWKISTSFCMAHF-----------------------------------------GANFEFYNGANTGDNVLDYKPD
        ++ +     F  G        +QR ++       + F                                         G NF ++NGANT      Y+P 
Subjt:  IRGVAV---FSEG--------IQRPWKISTSFCMAHF-----------------------------------------GANFEFYNGANTGDNVLDYKPD

Query:  LTSYDYDAPIKEYGDVDNAKYGAM
         TSYDYDAP+ E GD+   KY A+
Subjt:  LTSYDYDAPIKEYGDVDNAKYGAM

Q0DGD7 Beta-galactosidase 82.5e-3137.96Show/hide
Query:  GFSAWLLTKKRAPRLRSSDPGYL----QW-----------------------IENEFGSYGDDQAYLHHLVTLARGYLGNEVIRYTTDGGTRETFEKGTI
        GF  WLLT +    LRSSD  YL    +W                       IENEFGS+GDD+ YLH+LV +AR YLGN+++ YTTDGG     + GTI
Subjt:  GFSAWLLTKKRAPRLRSSDPGYL----QW-----------------------IENEFGSYGDDQAYLHHLVTLARGYLGNEVIRYTTDGGTRETFEKGTI

Query:  RGVAVFS----EGIQRPWKI----------------------------------------------------STSFCMAHFGANFEFYNGANTGDNVLDY
            VF+    +    PW I                                                    S    MAH G NF FYNGANTG N  DY
Subjt:  RGVAVFS----EGIQRPWKI----------------------------------------------------STSFCMAHFGANFEFYNGANTGDNVLDY

Query:  KPDLTSYDYDAPIKEYGDVDNAKYGAMAMGVIDRHSRLGQPVTQI
        K DLTSYDYDAPI+EYGDV NAKY A+   +   H   G P+ Q+
Subjt:  KPDLTSYDYDAPIKEYGDVDNAKYGAMAMGVIDRHSRLGQPVTQI

Q6UWU2 Beta-galactosidase-1-like protein2.1e-0928.17Show/hide
Query:  GFSAWLLTKKRAPRLRSSDPGYL----QW-----------------------IENEFGSY-GDDQAYLHHLVTLARGYLGNEVIRYTTDG----------
        G  +WLL K     LR+SDP +L     W                       +ENE+GSY   D +Y+ HL  L R  LG +++ +TTDG          
Subjt:  GFSAWLLTKKRAPRLRSSDPGYL----QW-----------------------IENEFGSY-GDDQAYLHHLVTLARGYLGNEVIRYTTDG----------

Query:  GTRETFEKG--------------------------------------TIRGVAVFSEGIQRPWKI--STSFCMAHFGANFEFYNGANTGDNVLDYKPDLT
        G   T + G                                      + R V+  ++G++   K+  S +  M H G NF ++NGA+     L   P  T
Subjt:  GTRETFEKG--------------------------------------TIRGVAVFSEGIQRPWKI--STSFCMAHFGANFEFYNGANTGDNVLDYKPDLT

Query:  SYDYDAPIKEYGD
        SYDYDAPI E GD
Subjt:  SYDYDAPIKEYGD

Q93Z24 Beta-galactosidase 172.7e-3338.52Show/hide
Query:  GFSAWLLTKKRAPRLRSSDPGYLQ-----W----------------------IENEFGSYGDDQAYLHHLVTLARGYLGNEVIRYTTDGGTRETFEKGTI
        GF AWLL  K   +LR+SDP YL+     W                      IENE+GSYG+D+AYL  LV++ARG+LG+++I YTTDGGT+ET +KGT+
Subjt:  GFSAWLLTKKRAPRLRSSDPGYLQ-----W----------------------IENEFGSYGDDQAYLHHLVTLARGYLGNEVIRYTTDGGTRETFEKGTI

Query:  RGVAV-----FSEGIQRPWKI----------------------------------------------------STSFCMAHFGANFEFYNGANTGDNVLD
            V     FS G   PW I                                                    S    M H G NF FYNGANTG    D
Subjt:  RGVAV-----FSEGIQRPWKI----------------------------------------------------STSFCMAHFGANFEFYNGANTGDNVLD

Query:  YKPDLTSYDYDAPIKEYGDVDNAKYGAMAMGVIDRHSRLGQPVT
        YKPDLTSYDYDAPIKE GD+DN K+ A+   VI +++    P++
Subjt:  YKPDLTSYDYDAPIKEYGDVDNAKYGAMAMGVIDRHSRLGQPVT

Q95LV1 Beta-galactosidase-1-like protein3.2e-1028.64Show/hide
Query:  GFSAWLLTKKRAPRLRSSDPGYL----QW-----------------------IENEFGSYGD-DQAYLHHLVTLARGYLGNEVIRYTTDG----------
        G  +WLL K    RLR+SDP +L     W                       +ENE+GSYG  D +Y+ HL  L R  LG +++ +TTDG          
Subjt:  GFSAWLLTKKRAPRLRSSDPGYL----QW-----------------------IENEFGSYGD-DQAYLHHLVTLARGYLGNEVIRYTTDG----------

Query:  GTRETFEKG--------------------------------------TIRGVAVFSEGIQRPWKI--STSFCMAHFGANFEFYNGANTGDNVLDYKPDLT
        G   T + G                                      + R V+  ++G++   K+  S +  M H G NF ++NGA+     L      T
Subjt:  GTRETFEKG--------------------------------------TIRGVAVFSEGIQRPWKI--STSFCMAHFGANFEFYNGANTGDNVLDYKPDLT

Query:  SYDYDAPIKEYGD
        SYDYDAPI E GD
Subjt:  SYDYDAPIKEYGD

Arabidopsis top hitse value%identityAlignment
AT1G72990.1 beta-galactosidase 171.9e-3438.52Show/hide
Query:  GFSAWLLTKKRAPRLRSSDPGYLQ-----W----------------------IENEFGSYGDDQAYLHHLVTLARGYLGNEVIRYTTDGGTRETFEKGTI
        GF AWLL  K   +LR+SDP YL+     W                      IENE+GSYG+D+AYL  LV++ARG+LG+++I YTTDGGT+ET +KGT+
Subjt:  GFSAWLLTKKRAPRLRSSDPGYLQ-----W----------------------IENEFGSYGDDQAYLHHLVTLARGYLGNEVIRYTTDGGTRETFEKGTI

Query:  RGVAV-----FSEGIQRPWKI----------------------------------------------------STSFCMAHFGANFEFYNGANTGDNVLD
            V     FS G   PW I                                                    S    M H G NF FYNGANTG    D
Subjt:  RGVAV-----FSEGIQRPWKI----------------------------------------------------STSFCMAHFGANFEFYNGANTGDNVLD

Query:  YKPDLTSYDYDAPIKEYGDVDNAKYGAMAMGVIDRHSRLGQPVT
        YKPDLTSYDYDAPIKE GD+DN K+ A+   VI +++    P++
Subjt:  YKPDLTSYDYDAPIKEYGDVDNAKYGAMAMGVIDRHSRLGQPVT

AT1G72990.2 beta-galactosidase 171.9e-3438.52Show/hide
Query:  GFSAWLLTKKRAPRLRSSDPGYLQ-----W----------------------IENEFGSYGDDQAYLHHLVTLARGYLGNEVIRYTTDGGTRETFEKGTI
        GF AWLL  K   +LR+SDP YL+     W                      IENE+GSYG+D+AYL  LV++ARG+LG+++I YTTDGGT+ET +KGT+
Subjt:  GFSAWLLTKKRAPRLRSSDPGYLQ-----W----------------------IENEFGSYGDDQAYLHHLVTLARGYLGNEVIRYTTDGGTRETFEKGTI

Query:  RGVAV-----FSEGIQRPWKI----------------------------------------------------STSFCMAHFGANFEFYNGANTGDNVLD
            V     FS G   PW I                                                    S    M H G NF FYNGANTG    D
Subjt:  RGVAV-----FSEGIQRPWKI----------------------------------------------------STSFCMAHFGANFEFYNGANTGDNVLD

Query:  YKPDLTSYDYDAPIKEYGDVDNAKYGAMAMGVIDRHSRLGQPVT
        YKPDLTSYDYDAPIKE GD+DN K+ A+   VI +++    P++
Subjt:  YKPDLTSYDYDAPIKEYGDVDNAKYGAMAMGVIDRHSRLGQPVT

AT1G73930.1 unknown protein2.1e-0952.7Show/hide
Query:  LWWLPNVAV-VHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQ
        LW L  +   + Q  S   ELE VD FNAIE+H+  E++  ES    ADS  T QKLK DL  VF+VLPKDMQQ
Subjt:  LWWLPNVAV-VHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQ

AT1G73930.2 unknown protein2.1e-0952.7Show/hide
Query:  LWWLPNVAV-VHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQ
        LW L  +   + Q  S   ELE VD FNAIE+H+  E++  ES    ADS  T QKLK DL  VF+VLPKDMQQ
Subjt:  LWWLPNVAV-VHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQ

AT3G51050.1 FG-GAP repeat-containing protein1.0e-6740.44Show/hide
Query:  AMAMGVIDRHSRLGQPVTQ-ILVVTSGWSVMCFDHNLNELWETNLQEDFPH--------------TLKHGDSGLVIIGGEWKCSHIWNAARFVFSPSMVN
        AMA GVIDR+ + G P  Q ++VVTSGWSV+CFDHNL +LWETNLQEDFPH              TLKHGD+GLVI+GG           R    P    
Subjt:  AMAMGVIDRHSRLGQPVTQ-ILVVTSGWSVMCFDHNLNELWETNLQEDFPH--------------TLKHGDSGLVIIGGEWKCSHIWNAARFVFSPSMVN

Query:  IKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEKE--------------IYA------ILQFMYLLVDL-VYYDGAGRM---SNYKLDVHSLNAQHP
               +NH   MDPFEE+G+  ++ +QHR SATE +              +YA      +L++     D+  +   A ++    NYKLDVH+LN++HP
Subjt:  IKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEKE--------------IYA------ILQFMYLLVDL-VYYDGAGRM---SNYKLDVHSLNAQHP

Query:  GEFERREFKESILGVMPH------------------------------------HLPEENHPPGKDSSKKIPKIIGTA---AGSAKTKKPLPYVPTITNY
        GEFE REF+ESIL VMPH                                    H PEE+ P GKD S+KIPK+IG A   AGSAK KK + Y+PTITNY
Subjt:  GEFERREFKESILGVMPH------------------------------------HLPEENHPPGKDSSKKIPKIIGTA---AGSAKTKKPLPYVPTITNY

Query:  TELWWLPNVAVVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGGLH
        T+LWW+PNV V HQ +                                                            GIEA+HL +GRT+CKL L EGGLH
Subjt:  TELWWLPNVAVVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGGLH

Query:  ADINGDGAPDHVQ
        ADINGDG  DHVQ
Subjt:  ADINGDGAPDHVQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGACTTTCCCTGATTGTGGCTGCTGCAAATGTTTTAGTAGGACAATCAACCTCTCCTCAAAGAGGTACAATTACCCAGTTTTTTCTTTTCACGAACATCGGGAA
TGTGATGACATTGATGGAGACTGTAAAATTCTGTTTGCAGATTCGGGAACTGAAACCGAAGCTTCAAGAAGATAACTCAGAGAGCAATCTTTCGATGGAGGAAGAGATGG
TGGTGCCGGCCGGTTCTGAGAATTCTCTGATTGAACAAGTAAAGCCGGAAATTGCCGATCAGTTCTCTGTTCCTCCGGCGAGTGAATCCCAAGACTTCAATTACGAGAGC
TTCAACAACAATGGCGGAGTAGGGGAAGAGGCGCCGACGGAAGAGGTGTCATTGTTCCGCGATTTCAAAGATGGGTCATCCGATAGCGATTCGAGCGCAATTTTGAACGA
AGATTACAGCCCCACGGCGGCCATTTATTCACCGGGGTGCTGCAGAATCTCAAACTCAGTTATGAAACTCTCCATCAGGACAATCAAGCTCTCCTCAAAGAGCGAAGCTT
CAAAGAGCAATCTTTCGATGGAGGAAGAGATGGTGGTGCCGGCCGATTCTGAGAATTCTCTGATTGAACAAGTAAAGCCGGAAATTGCCGATCAGTTCTCTGTTCCTCCG
GCGAGTGAATCCCAAGACTTCAATTACGAGAGCTTCAACAACAATGGCGGAGAAGGGGAAGAGGCGTCATTGTTCCGCGATTTCAAAGATGGGTCATCCGATAGCGATTC
GAGCGCAATTTTGAACGAAGATTACAGCCCCACGGCGGCCATTTATTCACCGGGGCTGCTGCAGAACCACCACCATCACCACTTCATGACGGCGGCATCTCCTTCTCCGT
CCGCCGCCGTGAAACTGAACTGCGTAACGACGACGCTGAGTTACTTGCAGTATCAGAAGGGGTATCGACAAACCCAGATGTTTCCGAAAATGGAGGAGCATAATTTCTTC
AGCGGAGAGGAGGCTTGTAGTTCCATATCTTCTAAAGTTCCTCGACAACCTTACAAATATTTATGCGAGGTTCAACCGCAAGAGACTAAAAGGCCGTACTGGGGAAGAAG
ACTGCCGGATAGCACTCTCTACTCTGTACCATTTTATGGTATCCTTGTAATTAAGTTGAAATCTGTACACGACATAGTGGGATTTAGGGGTTTTTCTGCTTGGTTGCTTA
CCAAAAAGCGAGCTCCTAGATTAAGATCCTCAGATCCTGGTTACCTCCAATGGATTGAGAATGAATTTGGCTCATATGGGGATGATCAAGCTTACCTTCACCATTTGGTT
ACGCTGGCAAGAGGCTACCTTGGGAATGAAGTAATACGGTATACTACAGATGGAGGTACTAGGGAAACTTTTGAGAAAGGAACCATTCGGGGGGTTGCTGTCTTTTCGGA
AGGAATTCAACGCCCCTGGAAAATCTCCACCTCTTTCTGCATGGCTCATTTTGGAGCAAATTTCGAATTTTATAATGGAGCGAATACCGGTGATAATGTGTTGGATTACA
AGCCTGATCTTACTTCTTATGATTATGATGCACCAATTAAGGAATATGGTGACGTTGACAATGCTAAATATGGAGCCATGGCTATGGGAGTTATTGATCGGCATTCCAGA
CTGGGGCAACCTGTGACGCAGATTCTTGTTGTTACATCCGGTTGGTCTGTTATGTGTTTTGATCACAATCTCAACGAGTTATGGGAAACAAATCTGCAGGAGGATTTTCC
ACATACCCTAAAGCATGGTGATTCAGGATTGGTTATCATTGGTGGAGAATGGAAATGCAGCCACATTTGGAATGCTGCCAGATTTGTGTTTTCTCCCTCCATGGTCAATA
TCAAGAAATTTCCCAATGGTTTCAATCATATGATTTTTATGGATCCCTTTGAAGAAATTGGAGTTGCAGAAAAGAGCGTTGAGCAACATAGAAGTAGTGCTACAGAAAAG
GAGATTTACGCCATTTTGCAATTTATGTATTTGCTGGTTGATCTGGTGTACTATGATGGAGCTGGAAGAATGAGCAATTACAAGCTTGATGTTCACTCTCTGAATGCTCA
ACATCCTGGAGAGTTTGAGCGCAGGGAATTTAAGGAATCAATCCTTGGAGTTATGCCACACCACTTGCCTGAGGAAAACCATCCTCCAGGGAAGGACTCAAGTAAAAAGA
TTCCTAAAATTATTGGTACTGCTGCAGGTTCGGCAAAAACTAAGAAGCCTCTTCCATATGTTCCTACCATAACTAACTATACCGAGCTTTGGTGGCTTCCTAATGTTGCC
GTGGTACACCAAACGCAGAGCTTCAATCCCGAGTTGGAAGTTGTCGATTTATTTAATGCTATTGAAAGACATCTTCTAAGAGAAATGGAGGTAGAGGAATCCAGAAGGGC
TTCCGCAGACTCTATGGCAACTTGCCAGAAACTGAAGGGTGATCTGCTCACTGTTTTTAATGTGCTTCCCAAGGACATGCAGCAGGGAGGCATCGAAGCTCTGCATTTGG
CATCTGGTCGCACTGTTTGCAAGCTACATCTTCAAGAAGGTGGTCTTCATGCTGATATTAATGGAGATGGAGCCCCTGATCATGTTCAGGCTGGGTATCAACTATTGTTC
TTGACAGTTACATCTTTACTAAAATTTGACAAAATTATTGGCCTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGAGACTTTCCCTGATTGTGGCTGCTGCAAATGTTTTAGTAGGACAATCAACCTCTCCTCAAAGAGGTACAATTACCCAGTTTTTTCTTTTCACGAACATCGGGAA
TGTGATGACATTGATGGAGACTGTAAAATTCTGTTTGCAGATTCGGGAACTGAAACCGAAGCTTCAAGAAGATAACTCAGAGAGCAATCTTTCGATGGAGGAAGAGATGG
TGGTGCCGGCCGGTTCTGAGAATTCTCTGATTGAACAAGTAAAGCCGGAAATTGCCGATCAGTTCTCTGTTCCTCCGGCGAGTGAATCCCAAGACTTCAATTACGAGAGC
TTCAACAACAATGGCGGAGTAGGGGAAGAGGCGCCGACGGAAGAGGTGTCATTGTTCCGCGATTTCAAAGATGGGTCATCCGATAGCGATTCGAGCGCAATTTTGAACGA
AGATTACAGCCCCACGGCGGCCATTTATTCACCGGGGTGCTGCAGAATCTCAAACTCAGTTATGAAACTCTCCATCAGGACAATCAAGCTCTCCTCAAAGAGCGAAGCTT
CAAAGAGCAATCTTTCGATGGAGGAAGAGATGGTGGTGCCGGCCGATTCTGAGAATTCTCTGATTGAACAAGTAAAGCCGGAAATTGCCGATCAGTTCTCTGTTCCTCCG
GCGAGTGAATCCCAAGACTTCAATTACGAGAGCTTCAACAACAATGGCGGAGAAGGGGAAGAGGCGTCATTGTTCCGCGATTTCAAAGATGGGTCATCCGATAGCGATTC
GAGCGCAATTTTGAACGAAGATTACAGCCCCACGGCGGCCATTTATTCACCGGGGCTGCTGCAGAACCACCACCATCACCACTTCATGACGGCGGCATCTCCTTCTCCGT
CCGCCGCCGTGAAACTGAACTGCGTAACGACGACGCTGAGTTACTTGCAGTATCAGAAGGGGTATCGACAAACCCAGATGTTTCCGAAAATGGAGGAGCATAATTTCTTC
AGCGGAGAGGAGGCTTGTAGTTCCATATCTTCTAAAGTTCCTCGACAACCTTACAAATATTTATGCGAGGTTCAACCGCAAGAGACTAAAAGGCCGTACTGGGGAAGAAG
ACTGCCGGATAGCACTCTCTACTCTGTACCATTTTATGGTATCCTTGTAATTAAGTTGAAATCTGTACACGACATAGTGGGATTTAGGGGTTTTTCTGCTTGGTTGCTTA
CCAAAAAGCGAGCTCCTAGATTAAGATCCTCAGATCCTGGTTACCTCCAATGGATTGAGAATGAATTTGGCTCATATGGGGATGATCAAGCTTACCTTCACCATTTGGTT
ACGCTGGCAAGAGGCTACCTTGGGAATGAAGTAATACGGTATACTACAGATGGAGGTACTAGGGAAACTTTTGAGAAAGGAACCATTCGGGGGGTTGCTGTCTTTTCGGA
AGGAATTCAACGCCCCTGGAAAATCTCCACCTCTTTCTGCATGGCTCATTTTGGAGCAAATTTCGAATTTTATAATGGAGCGAATACCGGTGATAATGTGTTGGATTACA
AGCCTGATCTTACTTCTTATGATTATGATGCACCAATTAAGGAATATGGTGACGTTGACAATGCTAAATATGGAGCCATGGCTATGGGAGTTATTGATCGGCATTCCAGA
CTGGGGCAACCTGTGACGCAGATTCTTGTTGTTACATCCGGTTGGTCTGTTATGTGTTTTGATCACAATCTCAACGAGTTATGGGAAACAAATCTGCAGGAGGATTTTCC
ACATACCCTAAAGCATGGTGATTCAGGATTGGTTATCATTGGTGGAGAATGGAAATGCAGCCACATTTGGAATGCTGCCAGATTTGTGTTTTCTCCCTCCATGGTCAATA
TCAAGAAATTTCCCAATGGTTTCAATCATATGATTTTTATGGATCCCTTTGAAGAAATTGGAGTTGCAGAAAAGAGCGTTGAGCAACATAGAAGTAGTGCTACAGAAAAG
GAGATTTACGCCATTTTGCAATTTATGTATTTGCTGGTTGATCTGGTGTACTATGATGGAGCTGGAAGAATGAGCAATTACAAGCTTGATGTTCACTCTCTGAATGCTCA
ACATCCTGGAGAGTTTGAGCGCAGGGAATTTAAGGAATCAATCCTTGGAGTTATGCCACACCACTTGCCTGAGGAAAACCATCCTCCAGGGAAGGACTCAAGTAAAAAGA
TTCCTAAAATTATTGGTACTGCTGCAGGTTCGGCAAAAACTAAGAAGCCTCTTCCATATGTTCCTACCATAACTAACTATACCGAGCTTTGGTGGCTTCCTAATGTTGCC
GTGGTACACCAAACGCAGAGCTTCAATCCCGAGTTGGAAGTTGTCGATTTATTTAATGCTATTGAAAGACATCTTCTAAGAGAAATGGAGGTAGAGGAATCCAGAAGGGC
TTCCGCAGACTCTATGGCAACTTGCCAGAAACTGAAGGGTGATCTGCTCACTGTTTTTAATGTGCTTCCCAAGGACATGCAGCAGGGAGGCATCGAAGCTCTGCATTTGG
CATCTGGTCGCACTGTTTGCAAGCTACATCTTCAAGAAGGTGGTCTTCATGCTGATATTAATGGAGATGGAGCCCCTGATCATGTTCAGGCTGGGTATCAACTATTGTTC
TTGACAGTTACATCTTTACTAAAATTTGACAAAATTATTGGCCTATAG
Protein sequenceShow/hide protein sequence
MGRLSLIVAAANVLVGQSTSPQRGTITQFFLFTNIGNVMTLMETVKFCLQIRELKPKLQEDNSESNLSMEEEMVVPAGSENSLIEQVKPEIADQFSVPPASESQDFNYES
FNNNGGVGEEAPTEEVSLFRDFKDGSSDSDSSAILNEDYSPTAAIYSPGCCRISNSVMKLSIRTIKLSSKSEASKSNLSMEEEMVVPADSENSLIEQVKPEIADQFSVPP
ASESQDFNYESFNNNGGEGEEASLFRDFKDGSSDSDSSAILNEDYSPTAAIYSPGLLQNHHHHHFMTAASPSPSAAVKLNCVTTTLSYLQYQKGYRQTQMFPKMEEHNFF
SGEEACSSISSKVPRQPYKYLCEVQPQETKRPYWGRRLPDSTLYSVPFYGILVIKLKSVHDIVGFRGFSAWLLTKKRAPRLRSSDPGYLQWIENEFGSYGDDQAYLHHLV
TLARGYLGNEVIRYTTDGGTRETFEKGTIRGVAVFSEGIQRPWKISTSFCMAHFGANFEFYNGANTGDNVLDYKPDLTSYDYDAPIKEYGDVDNAKYGAMAMGVIDRHSR
LGQPVTQILVVTSGWSVMCFDHNLNELWETNLQEDFPHTLKHGDSGLVIIGGEWKCSHIWNAARFVFSPSMVNIKKFPNGFNHMIFMDPFEEIGVAEKSVEQHRSSATEK
EIYAILQFMYLLVDLVYYDGAGRMSNYKLDVHSLNAQHPGEFERREFKESILGVMPHHLPEENHPPGKDSSKKIPKIIGTAAGSAKTKKPLPYVPTITNYTELWWLPNVA
VVHQTQSFNPELEVVDLFNAIERHLLREMEVEESRRASADSMATCQKLKGDLLTVFNVLPKDMQQGGIEALHLASGRTVCKLHLQEGGLHADINGDGAPDHVQAGYQLLF
LTVTSLLKFDKIIGL