; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G027210 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G027210
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionBEL1-like homeodomain protein 7
Genome locationchr02:33526591..33540176
RNA-Seq ExpressionLsi02G027210
SyntenyLsi02G027210
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR001296 - Glycosyl transferase, family 1
IPR001356 - Homeobox domain
IPR006563 - POX domain
IPR008422 - Homeobox KN domain
IPR009057 - Homeobox-like domain superfamily
IPR019378 - GDP-fucose protein O-fucosyltransferase
IPR041693 - Glycosyl-transferase family 4_5


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8646445.1 hypothetical protein Csa_016224 [Cucumis sativus]0.0e+0076.87Show/hide
Query:  PFPIFTSLQSRREVGERWPLMILALVSISTAMVFFMRTSFDSCSGNGNRRFVEENGIDSQIRSSQIERKAPNPLDFMKSKLVLLVSHELSLSGGPLLLME
        PFP  T  +      +RWPLMILALVSISTAMVFFMRT+FDSC+GN NRRFVEE GI S+I S Q ERKAPNPLDFMKSKLVLLVSHELSLSGGPLLLME
Subjt:  PFPIFTSLQSRREVGERWPLMILALVSISTAMVFFMRTSFDSCSGNGNRRFVEENGIDSQIRSSQIERKAPNPLDFMKSKLVLLVSHELSLSGGPLLLME

Query:  LAFLLRGVGTQVVWITNQKPSEPDEVVYSLERKMLDRGVQIFILKVLSAKGQEAVETALKAHLVVLNTAVAGKWLDAILKENVPRVLPKVLWWIHEMRGN
        LAFLLRGVGTQVVWITNQKP EPDEVVYSLERKMLDRGVQ     VLSAK QEAVETALKAHLVVLNTAVAGKWLDA+LKENVPRVLPKVLWWIHEMRGN
Subjt:  LAFLLRGVGTQVVWITNQKPSEPDEVVYSLERKMLDRGVQIFILKVLSAKGQEAVETALKAHLVVLNTAVAGKWLDAILKENVPRVLPKVLWWIHEMRGN

Query:  YFKVEYVKHLPFVAGAMIDSHTTAEYWKNRTRERLGIKMPETYVVHLGNSKDLMEVAENNVAKRVLREHIRESLGVRNEDILFAIINSVSRGKGQDLFLR
        YFKVEYVKHLPFVAGAMIDSHTTAEYWKNRT +RLGI+MPETYVVHLGNSKDLMEVAENNVAKRVLREHIRESLGVRNEDILFAIINSVSRGKGQDLFLR
Subjt:  YFKVEYVKHLPFVAGAMIDSHTTAEYWKNRTRERLGIKMPETYVVHLGNSKDLMEVAENNVAKRVLREHIRESLGVRNEDILFAIINSVSRGKGQDLFLR

Query:  AFHQSLQMIQDKKLRVPRIHAVVVGSDMSAQTKFETELRNFVNENKIQDRVHFVNKTLSVAPYLASIDVLVQNSQGRGECFGRITIEAMAFQLPVLGTAA
        AFHQSLQMIQDKKLRVPRIHAVVVGSDM+A TKFETELRNFVNENKIQDRVHFVNKTLSVAPYLASIDVLVQNSQGRGECFGRITIEAMAFQLPVLGTAA
Subjt:  AFHQSLQMIQDKKLRVPRIHAVVVGSDMSAQTKFETELRNFVNENKIQDRVHFVNKTLSVAPYLASIDVLVQNSQGRGECFGRITIEAMAFQLPVLGTAA

Query:  GGTMEIVVNGTTGLLHPAGKEGVTPLAHNIVKLATHVERRLTIGKKGYERVRQMFLEQHMGQRIAFVAGNYVAQILSVTTINLRPKERRTTSFPCDSIFS
        GGTMEIVVNGTTGLLHPAGKEGVTPLAHNIVKLATHVERRLT+                                                         
Subjt:  GGTMEIVVNGTTGLLHPAGKEGVTPLAHNIVKLATHVERRLTIGKKGYERVRQMFLEQHMGQRIAFVAGNYVAQILSVTTINLRPKERRTTSFPCDSIFS

Query:  NTKSPTRSVLVDGEEEGFAGSEMFVVSVKPFFSVLLVTLSLSLAVLLLSPPSLLFSNTSFSFTNERGSVACRSPLPIIP--PRRWSVLLLFVKSNIYLQF
                                   VKPFFSVLLVTLSLS AVLLLSPPS  FSNTSFSFTN RG     S   I+   P +W           +L+ 
Subjt:  NTKSPTRSVLVDGEEEGFAGSEMFVVSVKPFFSVLLVTLSLSLAVLLLSPPSLLFSNTSFSFTNERGSVACRSPLPIIP--PRRWSVLLLFVKSNIYLQF

Query:  QLKNFGIRLHGYMECSKDSRVEALQVVASRLFVEMLQLCPLIAMDIYAWIAM----LCDGVGIARLLNATLVLPKFEVAAYWNESS-------------Q
         L       +GY                             I +D Y  +      LCDGVGIARLLNATLVLPKFEVAAYWNESS             Q
Subjt:  QLKNFGIRLHGYMECSKDSRVEALQVVASRLFVEMLQLCPLIAMDIYAWIAM----LCDGVGIARLLNATLVLPKFEVAAYWNESS-------------Q

Query:  MNGYVKVVKMLPPEFASKEPYHVDCSKRKGQFDYIESVLPSLLEHRYISITPAMSQRRDRYPQYAKAALCQACYNGLRLAKSVEEKAKELLEAIPKPFLS
        MNGYVKV K LPPEFASKEPYHVDCSKRKGQFDYIESVLPSLLEH YISITPAMSQRRDRYPQYAKAALCQ CYNGLRLAKSVE+KA+ELLEAIPKPFLS
Subjt:  MNGYVKVVKMLPPEFASKEPYHVDCSKRKGQFDYIESVLPSLLEHRYISITPAMSQRRDRYPQYAKAALCQACYNGLRLAKSVEEKAKELLEAIPKPFLS

Query:  LHLRFEPDMVAYSQCEYKGLSPASLEAIEATRGDRKPWTGELAGIWRKRGKCPLTPRETALILQALHIPTNTNIYLAAGDGLMELEGFTSVYTNVVTKSS
        LHLRFEPDMVAYSQCEYKGLSP SLEAIEATRGDRKPWTG+LA IWRKRGKCPLTPRETALI QALHIPTNTNIYLAAGDGLMELEGFTSVYTNVVTKSS
Subjt:  LHLRFEPDMVAYSQCEYKGLSPASLEAIEATRGDRKPWTGELAGIWRKRGKCPLTPRETALILQALHIPTNTNIYLAAGDGLMELEGFTSVYTNVVTKSS

Query:  FLSSDDFSSMHGNTKAALDYYVSINSDSYVATFFGNMDKMVAAMRAFNGKQRTLFLSRRAFADFTYKGLKGKELNQALWKTHRDDFAMGRGSALSDCFCE
        FLS++DFSSMHGNTKAALDYYVSINSD YVATFFGNMDKMVAAMRAFNGKQ+TLFLSRRAFA+FTYKGL+GKEL+QALWKTHRDDF+MGRGSALSDCFCE
Subjt:  FLSSDDFSSMHGNTKAALDYYVSINSDSYVATFFGNMDKMVAAMRAFNGKQRTLFLSRRAFADFTYKGLKGKELNQALWKTHRDDFAMGRGSALSDCFCE

Query:  FKL
        FKL
Subjt:  FKL

KAG6593438.1 O-fucosyltransferase 13, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0078.44Show/hide
Query:  ERWPLMILALVSISTAMVFFMRTSFDSCSGNGNRRFVEENGIDSQIRSSQIERKAPNPLDFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTQVVWI
        +RWPLMILALVSISTA VF MRT+FDSCSG+GNR FVEE GIDSQIRSSQIERKAPNPLDFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTQV WI
Subjt:  ERWPLMILALVSISTAMVFFMRTSFDSCSGNGNRRFVEENGIDSQIRSSQIERKAPNPLDFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTQVVWI

Query:  TNQKPSEPDEVVYSLERKMLDRGVQIFILKVLSAKGQEAVETALKAHLVVLNTAVAGKWLDAILKENVPRVLPKVLWWIHEMRGNYFKVEYVKHLPFVAG
        TNQ  SEPDEVVYSLERKMLDRGVQ     VLSAKGQEA++TALKAHLVVLNTAVAGKWLDA+LKENVPRVLPKVLWWIHEMRGNYFKVEYVKHLPFVAG
Subjt:  TNQKPSEPDEVVYSLERKMLDRGVQIFILKVLSAKGQEAVETALKAHLVVLNTAVAGKWLDAILKENVPRVLPKVLWWIHEMRGNYFKVEYVKHLPFVAG

Query:  AMIDSHTTAEYWKNRTRERLGIKMPETYVVHLGNSKDLMEVAENNVAKRVLREHIRESLGVRNEDILFAIINSVSRGKGQDLFLRAFHQSLQMIQDKKLR
        AMIDSHTTAEYW+NRTRERLGIKMPETYVVHLGNSKDLMEVAENNVAKRVLREHIRESLGVRNEDILFAIINSVSRGKGQDLFLRAFHQSL+MIQDKKLR
Subjt:  AMIDSHTTAEYWKNRTRERLGIKMPETYVVHLGNSKDLMEVAENNVAKRVLREHIRESLGVRNEDILFAIINSVSRGKGQDLFLRAFHQSLQMIQDKKLR

Query:  VPRIHAVVVGSDMSAQTKFETELRNFVNENKIQDRVHFVNKTLSVAPYLASIDVLVQNSQGRGECFGRITIEAMAFQLPVLGTAAGGTMEIVVNGTTGLL
        VPRIHAVVVGSDMSA TKFETELRNFV +NKIQDRVHFVNKTLSVAPYLASIDVLVQNSQGRGECFGRITIEAMAFQLPVLGTAAGGTMEIVVNGTTGLL
Subjt:  VPRIHAVVVGSDMSAQTKFETELRNFVNENKIQDRVHFVNKTLSVAPYLASIDVLVQNSQGRGECFGRITIEAMAFQLPVLGTAAGGTMEIVVNGTTGLL

Query:  HPAGKEGVTPLAHNIVKLATHVERRLTIGKKGYERVRQMFLEQHMGQRIAFVAGNYVAQILSVTTINLRPKERRTTSFPCDSIFSNTKSPTRSVLVDGEE
        HPAGKEGVTPLA +IVKLATHVERRLTIGKKGYER                                                               EE
Subjt:  HPAGKEGVTPLAHNIVKLATHVERRLTIGKKGYERVRQMFLEQHMGQRIAFVAGNYVAQILSVTTINLRPKERRTTSFPCDSIFSNTKSPTRSVLVDGEE

Query:  EGFAGSEMFVVSVKPFFSVLLVTLSLSLAVLLLSPPSLLFSNTSFSFTNERGSVACRSPLPIIP--PRRWSVLLLFVKSNIYLQFQLKNFGIRLHGYMEC
        EGF GSEMF+VSVKPFFSVL+V L L+LAVLLLSP S LF   S S T++RG     S   I    P +W           +LQ  L       +GY   
Subjt:  EGFAGSEMFVVSVKPFFSVLLVTLSLSLAVLLLSPPSLLFSNTSFSFTNERGSVACRSPLPIIP--PRRWSVLLLFVKSNIYLQFQLKNFGIRLHGYMEC

Query:  SKDSRVEALQVVASRLFVEMLQLCPLIAMDIYAWIAM----LCDGVGIARLLNATLVLPKFEVAAYWNES-------------SQMNGYVKVVKMLPPEF
                                  I +D Y  +      LCDGVG+ARLLNATLVLPKFEVAAYWNES             SQMNGYVKVVK LPPEF
Subjt:  SKDSRVEALQVVASRLFVEMLQLCPLIAMDIYAWIAM----LCDGVGIARLLNATLVLPKFEVAAYWNES-------------SQMNGYVKVVKMLPPEF

Query:  ASKEPYHVDCSKRKGQFDYIESVLPSLLEHRYISITPAMSQRRDRYPQYAKAALCQACYNGLRLAKSVEEKAKELLEAIPKPFLSLHLRFEPDMVAYSQC
        ASKEP+HVDCSKRKGQFDYIESVLPSLLEH YISITPAMSQRRDRYPQYAKAALCQACYN LRLA+ VEEKAK+LLEAIPKPFLSLHLRFEPDMVAYSQC
Subjt:  ASKEPYHVDCSKRKGQFDYIESVLPSLLEHRYISITPAMSQRRDRYPQYAKAALCQACYNGLRLAKSVEEKAKELLEAIPKPFLSLHLRFEPDMVAYSQC

Query:  EYKGLSPASLEAIEATRGDRKPWTGELAGIWRKRGKCPLTPRETALILQALHIPTNTNIYLAAGDGLMELEGFTSVYTNVVTKSSFLSSDDFSSMHGNTK
        EY+GLS ASLEAIEA R DRKPWTGELA IWRKRGKCPLTPRETALI QALHIPTNTNIYLAAGDGLME+EGFTSVYTNVVTKSSFLSSDDFS+MHGNTK
Subjt:  EYKGLSPASLEAIEATRGDRKPWTGELAGIWRKRGKCPLTPRETALILQALHIPTNTNIYLAAGDGLMELEGFTSVYTNVVTKSSFLSSDDFSSMHGNTK

Query:  AALDYYVSINSDSYVATFFGNMDKMVAAMRAFNGKQRTLFLSRRAFADFTYKGLKGKELNQALWKTHRDDFAMGRGSALSDCFCEFKL
        AALDYYVSINSDSYVATFFGNMDKMVAAMRAFNGKQRTLFLSRRAFA+FTY GLKGKELNQALWK HRDDF MGRGSALSDCFCEFKL
Subjt:  AALDYYVSINSDSYVATFFGNMDKMVAAMRAFNGKQRTLFLSRRAFADFTYKGLKGKELNQALWKTHRDDFAMGRGSALSDCFCEFKL

XP_004136085.1 BEL1-like homeodomain protein 7 [Cucumis sativus]1.4e-26687.92Show/hide
Query:  MTYFPGLSNQRDSMLGDQKLGSSCQE-QPLCSGNLMMFMNQDPSCRDYSEIFSGVSSNYVETVGDRSNVEMAFIPPVVGLLDDSNFQCQGVSLSLSTHSP
        MTYFPGLSNQRDSMLGD KLGSSCQE QPLCSGNLMMFMNQDPSCRDYSEI SGVSSNYVETVGDRSNVEMAFIPPVVG+LD+SNFQCQGVSLSLSTHSP
Subjt:  MTYFPGLSNQRDSMLGDQKLGSSCQE-QPLCSGNLMMFMNQDPSCRDYSEIFSGVSSNYVETVGDRSNVEMAFIPPVVGLLDDSNFQCQGVSLSLSTHSP

Query:  SVCSMSSFPHQYQTPAMPSSFINAPPSIFEKRPNLKPCISDAKNGMYIAVGSGYSTLNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVDTCRE
        SV SMSSFPHQYQTPAM SSFINAPPSI EKR N KPCISD KNGMY+A+GSGYS LNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVD+CRE
Subjt:  SVCSMSSFPHQYQTPAMPSSFINAPPSIFEKRPNLKPCISDAKNGMYIAVGSGYSTLNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVDTCRE

Query:  NGGGSNDLAGEMCCDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQYYQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAIN
        N GGSNDL GEMC +ARE+S+ANP  DLSP+ERQDLKNK SKLLSLLGEVDRRYKQYYQQLQ LASSFDMVAGRGAAEFYT+L HQTISCHFRRL+DAIN
Subjt:  NGGGSNDLAGEMCCDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQYYQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAIN

Query:  AQIEVTRRTLGEQDALNSGQEGIPRLRFVDQHLRQQRALQQLGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVMSLIL
        AQIEVTRR LGEQD+L+SGQEGIPRLRFVDQHLRQQRALQ LGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQ      
Subjt:  AQIEVTRRTLGEQDALNSGQEGIPRLRFVDQHLRQQRALQQLGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVMSLIL

Query:  NRKLDFLPSKLKRLLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRGGDELHDSTKSAANNGERVLYRPCA
                              VANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRGGDELHDSTKS ANNGERVLYR CA
Subjt:  NRKLDFLPSKLKRLLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRGGDELHDSTKSAANNGERVLYRPCA

Query:  TDDMINSSNFSMGGGDVSLALELKHCEGDEFGMFGSNN
        TDDMINSSNF+MGGGDVSLALELKHCEGDEFGMFG+NN
Subjt:  TDDMINSSNFSMGGGDVSLALELKHCEGDEFGMFGSNN

XP_008461100.1 PREDICTED: BEL1-like homeodomain protein 7 [Cucumis melo]7.9e-26788.1Show/hide
Query:  MTYFPGLSNQRDSMLGDQKLGSSCQE-QPLCSGNLMMFMNQDPSCRDYSEIFSGVSSNYVETVGDRSNVEMAFIPPVVGLLDDSNFQCQGVSLSLSTHSP
        MTYFPGLSNQRDSMLGD KLG SCQE QPLCSGNLMMFMNQDPSCRDYSEI SGVSSNYVETVGDRSNVEMAFIPPVVG+LD+SNFQCQGVSLSLSTHSP
Subjt:  MTYFPGLSNQRDSMLGDQKLGSSCQE-QPLCSGNLMMFMNQDPSCRDYSEIFSGVSSNYVETVGDRSNVEMAFIPPVVGLLDDSNFQCQGVSLSLSTHSP

Query:  SVCSMSSFPHQYQTPAMPSSFINAPPSIFEKRPNLKPCISDAKNGMYIAVGSGYSTLNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVDTCRE
        SV SMSSFPHQYQTPAM SSFINAPPSI EKRPN K CISD KNGMY+A+GSGYS LNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVD+CRE
Subjt:  SVCSMSSFPHQYQTPAMPSSFINAPPSIFEKRPNLKPCISDAKNGMYIAVGSGYSTLNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVDTCRE

Query:  NGGGSNDLAGEMCCDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQYYQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAIN
        N GGSNDL GEMC +ARE S+ANPS DLSP+ERQDLKNK SKLLSLLGEVDRRYKQYYQQLQ LASSFDMVAGRGAAEFYTAL HQTISCHFRRLRDAIN
Subjt:  NGGGSNDLAGEMCCDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQYYQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAIN

Query:  AQIEVTRRTLGEQDALNSGQEGIPRLRFVDQHLRQQRALQQLGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVMSLIL
        AQIEVTRRTLGEQDAL+SGQEGIPRLRFVDQHLRQQRALQ LGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQ      
Subjt:  AQIEVTRRTLGEQDALNSGQEGIPRLRFVDQHLRQQRALQQLGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVMSLIL

Query:  NRKLDFLPSKLKRLLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRGGDELHDSTKSAANNGERVLYRPCA
                              VANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRGGD+LHDSTKS ANNGER+LYR CA
Subjt:  NRKLDFLPSKLKRLLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRGGDELHDSTKSAANNGERVLYRPCA

Query:  TDDMINSSNFSMGGGDVSLALELKHCEGDEFGMFGSNN
        TD+MINSSNF+MGGGDVSLALELKHCEGDEFGMFG+NN
Subjt:  TDDMINSSNFSMGGGDVSLALELKHCEGDEFGMFGSNN

XP_038896117.1 BEL1-like homeodomain protein 7 [Benincasa hispida]1.3e-27290.19Show/hide
Query:  MTYFPGLSNQRDSMLGDQKLGSSCQEQPLCSGNLMMFMNQDPSCRDYSEIFSGVSSNYVETVGD-RSNVEMAFIPPVVGLLDDSNFQCQGVSLSLSTHSP
        MTYFPGLSNQRDSMLGDQKLGSSCQEQPLCSGNLMMFMNQDPSCRDYSEIFSGVSSNYVETVGD RSN EMAFIPPVVG+LDDSNFQCQGVSLSLSTH+P
Subjt:  MTYFPGLSNQRDSMLGDQKLGSSCQEQPLCSGNLMMFMNQDPSCRDYSEIFSGVSSNYVETVGD-RSNVEMAFIPPVVGLLDDSNFQCQGVSLSLSTHSP

Query:  SVCSMSSFPHQYQTPAMPSSFINAPPSIFEKRPNLKPCISDAKNGMYIAVGSGYSTLNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVDTCRE
        SV SMSSFPHQYQ PAM SSFIN PPSIFEKRPNLKP   DAKNGMYIAVGSGYS LNSVYIEAAQQLLDEMVSI+EALKELKSKKLKASNGLGVD+CRE
Subjt:  SVCSMSSFPHQYQTPAMPSSFINAPPSIFEKRPNLKPCISDAKNGMYIAVGSGYSTLNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVDTCRE

Query:  NGGGSNDLAGEMCCDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQYYQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAIN
        N GGSNDL GEMCC+ +ESSVANPSGDLSPTERQDLKNK SKLLSLLGEVDRRYKQYYQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAIN
Subjt:  NGGGSNDLAGEMCCDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQYYQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAIN

Query:  AQIEVTRRTLGEQDALNSGQEGIPRLRFVDQHLRQQRALQQLGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVMSLIL
        AQIEVTRRTLGEQDAL+SGQEGIPRLRFVDQHLRQQRALQQLGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQ      
Subjt:  AQIEVTRRTLGEQDALNSGQEGIPRLRFVDQHLRQQRALQQLGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVMSLIL

Query:  NRKLDFLPSKLKRLLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRGGDELHDSTKSAANNGERVLYRPCA
                              VANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRG  ELHDSTKSAANN ERVLYRPCA
Subjt:  NRKLDFLPSKLKRLLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRGGDELHDSTKSAANNGERVLYRPCA

Query:  TDDMINSSNFSMGGGDVSLALELKHCEGDEFGMFGSNNRM
        TDDMINSSNFS+GGGDVSLALELKHCEGDEFGMFGSNNRM
Subjt:  TDDMINSSNFSMGGGDVSLALELKHCEGDEFGMFGSNNRM

TrEMBL top hitse value%identityAlignment
A0A0A0K705 Homeobox domain-containing protein6.5e-26787.92Show/hide
Query:  MTYFPGLSNQRDSMLGDQKLGSSCQE-QPLCSGNLMMFMNQDPSCRDYSEIFSGVSSNYVETVGDRSNVEMAFIPPVVGLLDDSNFQCQGVSLSLSTHSP
        MTYFPGLSNQRDSMLGD KLGSSCQE QPLCSGNLMMFMNQDPSCRDYSEI SGVSSNYVETVGDRSNVEMAFIPPVVG+LD+SNFQCQGVSLSLSTHSP
Subjt:  MTYFPGLSNQRDSMLGDQKLGSSCQE-QPLCSGNLMMFMNQDPSCRDYSEIFSGVSSNYVETVGDRSNVEMAFIPPVVGLLDDSNFQCQGVSLSLSTHSP

Query:  SVCSMSSFPHQYQTPAMPSSFINAPPSIFEKRPNLKPCISDAKNGMYIAVGSGYSTLNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVDTCRE
        SV SMSSFPHQYQTPAM SSFINAPPSI EKR N KPCISD KNGMY+A+GSGYS LNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVD+CRE
Subjt:  SVCSMSSFPHQYQTPAMPSSFINAPPSIFEKRPNLKPCISDAKNGMYIAVGSGYSTLNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVDTCRE

Query:  NGGGSNDLAGEMCCDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQYYQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAIN
        N GGSNDL GEMC +ARE+S+ANP  DLSP+ERQDLKNK SKLLSLLGEVDRRYKQYYQQLQ LASSFDMVAGRGAAEFYT+L HQTISCHFRRL+DAIN
Subjt:  NGGGSNDLAGEMCCDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQYYQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAIN

Query:  AQIEVTRRTLGEQDALNSGQEGIPRLRFVDQHLRQQRALQQLGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVMSLIL
        AQIEVTRR LGEQD+L+SGQEGIPRLRFVDQHLRQQRALQ LGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQ      
Subjt:  AQIEVTRRTLGEQDALNSGQEGIPRLRFVDQHLRQQRALQQLGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVMSLIL

Query:  NRKLDFLPSKLKRLLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRGGDELHDSTKSAANNGERVLYRPCA
                              VANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRGGDELHDSTKS ANNGERVLYR CA
Subjt:  NRKLDFLPSKLKRLLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRGGDELHDSTKSAANNGERVLYRPCA

Query:  TDDMINSSNFSMGGGDVSLALELKHCEGDEFGMFGSNN
        TDDMINSSNF+MGGGDVSLALELKHCEGDEFGMFG+NN
Subjt:  TDDMINSSNFSMGGGDVSLALELKHCEGDEFGMFGSNN

A0A1S3CDG8 BEL1-like homeodomain protein 73.8e-26788.1Show/hide
Query:  MTYFPGLSNQRDSMLGDQKLGSSCQE-QPLCSGNLMMFMNQDPSCRDYSEIFSGVSSNYVETVGDRSNVEMAFIPPVVGLLDDSNFQCQGVSLSLSTHSP
        MTYFPGLSNQRDSMLGD KLG SCQE QPLCSGNLMMFMNQDPSCRDYSEI SGVSSNYVETVGDRSNVEMAFIPPVVG+LD+SNFQCQGVSLSLSTHSP
Subjt:  MTYFPGLSNQRDSMLGDQKLGSSCQE-QPLCSGNLMMFMNQDPSCRDYSEIFSGVSSNYVETVGDRSNVEMAFIPPVVGLLDDSNFQCQGVSLSLSTHSP

Query:  SVCSMSSFPHQYQTPAMPSSFINAPPSIFEKRPNLKPCISDAKNGMYIAVGSGYSTLNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVDTCRE
        SV SMSSFPHQYQTPAM SSFINAPPSI EKRPN K CISD KNGMY+A+GSGYS LNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVD+CRE
Subjt:  SVCSMSSFPHQYQTPAMPSSFINAPPSIFEKRPNLKPCISDAKNGMYIAVGSGYSTLNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVDTCRE

Query:  NGGGSNDLAGEMCCDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQYYQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAIN
        N GGSNDL GEMC +ARE S+ANPS DLSP+ERQDLKNK SKLLSLLGEVDRRYKQYYQQLQ LASSFDMVAGRGAAEFYTAL HQTISCHFRRLRDAIN
Subjt:  NGGGSNDLAGEMCCDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQYYQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAIN

Query:  AQIEVTRRTLGEQDALNSGQEGIPRLRFVDQHLRQQRALQQLGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVMSLIL
        AQIEVTRRTLGEQDAL+SGQEGIPRLRFVDQHLRQQRALQ LGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQ      
Subjt:  AQIEVTRRTLGEQDALNSGQEGIPRLRFVDQHLRQQRALQQLGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVMSLIL

Query:  NRKLDFLPSKLKRLLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRGGDELHDSTKSAANNGERVLYRPCA
                              VANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRGGD+LHDSTKS ANNGER+LYR CA
Subjt:  NRKLDFLPSKLKRLLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRGGDELHDSTKSAANNGERVLYRPCA

Query:  TDDMINSSNFSMGGGDVSLALELKHCEGDEFGMFGSNN
        TD+MINSSNF+MGGGDVSLALELKHCEGDEFGMFG+NN
Subjt:  TDDMINSSNFSMGGGDVSLALELKHCEGDEFGMFGSNN

A0A5A7US73 BEL1-like homeodomain protein 71.7e-25988Show/hide
Query:  MLGDQKLGSSCQE-QPLCSGNLMMFMNQDPSCRDYSEIFSGVSSNYVETVGDRSNVEMAFIPPVVGLLDDSNFQCQGVSLSLSTHSPSVCSMSSFPHQYQ
        MLGDQKLG SCQE QPLCSGNLMMFMNQDPSCRDYSEI SGVSSNYVETVGDRSNVEMAFIPPVVG+LD+SNFQCQGVSLSLSTHSPSV SMSSFPHQYQ
Subjt:  MLGDQKLGSSCQE-QPLCSGNLMMFMNQDPSCRDYSEIFSGVSSNYVETVGDRSNVEMAFIPPVVGLLDDSNFQCQGVSLSLSTHSPSVCSMSSFPHQYQ

Query:  TPAMPSSFINAPPSIFEKRPNLKPCISDAKNGMYIAVGSGYSTLNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVDTCRENGGGSNDLAGEMC
        TPAM SSFINAPPSI EKRPN K CISD KNGMY+A+GSGYS LNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVD+CREN GGSNDL GEMC
Subjt:  TPAMPSSFINAPPSIFEKRPNLKPCISDAKNGMYIAVGSGYSTLNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVDTCRENGGGSNDLAGEMC

Query:  CDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQYYQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAINAQIEVTRRTLGEQ
         +ARE S+ANPS DLSP+ERQDLKNK SKLLSLLGEVDRRYKQYYQQLQ LASSFDMVAGRGAAEFYTAL HQTISCHFRRLRDAINAQIEVTRRTLGEQ
Subjt:  CDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQYYQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAINAQIEVTRRTLGEQ

Query:  DALNSGQEGIPRLRFVDQHLRQQRALQQLGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVMSLILNRKLDFLPSKLKR
        DAL+SGQEGIPRLRFVDQHLRQQRALQ LGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQ                   
Subjt:  DALNSGQEGIPRLRFVDQHLRQQRALQQLGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVMSLILNRKLDFLPSKLKR

Query:  LLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRGGDELHDSTKSAANNGERVLYRPCATDDMINSSNFSMG
                 VANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRGGD+LHDSTKS ANNGER+LYR CATD+MINSSNF+MG
Subjt:  LLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRGGDELHDSTKSAANNGERVLYRPCATDDMINSSNFSMG

Query:  GGDVSLALELKHCEGDEFGMFGSNN
        GGDVSLALELKHCEGDEFGMFG+NN
Subjt:  GGDVSLALELKHCEGDEFGMFGSNN

A0A5A7USC2 Lipopolysaccharide core biosynthesis mannosyltransferase lpsB2.9e-23892.51Show/hide
Query:  PFPIFTSLQSRREVGERWPLMILALVSISTAMVFFMRTSFDSCSGNGNRRFVEENGIDSQIRSSQIERKAPNPLDFMKSKLVLLVSHELSLSGGPLLLME
        PFP  T  +      +RWPLMILALVSISTAMVFFMRT+FDSCSGN NRRFVEE GI S+I SSQ+ERKAPNPLDFMKSKLVLLVSHELSLSGGPLLLME
Subjt:  PFPIFTSLQSRREVGERWPLMILALVSISTAMVFFMRTSFDSCSGNGNRRFVEENGIDSQIRSSQIERKAPNPLDFMKSKLVLLVSHELSLSGGPLLLME

Query:  LAFLLRGVGTQVVWITNQKPSEPDEVVYSLERKMLDRGVQIFILKVLSAKGQEAVETALKAHLVVLNTAVAGKWLDAILKENVPRVLPKVLWWIHEMRGN
        LAFLLRGVGTQVVWITNQKP EPDEVVYSLERKMLDRGVQ     VLSAK QEAVETALKAHLVVLNTAVAGKWLDA+LKENVPRVLPKVLWWIHEMRGN
Subjt:  LAFLLRGVGTQVVWITNQKPSEPDEVVYSLERKMLDRGVQIFILKVLSAKGQEAVETALKAHLVVLNTAVAGKWLDAILKENVPRVLPKVLWWIHEMRGN

Query:  YFKVEYVKHLPFVAGAMIDSHTTAEYWKNRTRERLGIKMPETYVVHLGNSKDLMEVAENNVAKRVLREHIRESLGVRNEDILFAIINSVSRGKGQDLFLR
        YFKVEYVKHLPFVAGAMIDSHTTAEYWKNRT +RLGI+MPETYVVHLGNSKDLMEVAENNVAKRVLREHIRESLGVRNEDILFAIINSVSRGKGQDLFLR
Subjt:  YFKVEYVKHLPFVAGAMIDSHTTAEYWKNRTRERLGIKMPETYVVHLGNSKDLMEVAENNVAKRVLREHIRESLGVRNEDILFAIINSVSRGKGQDLFLR

Query:  AFHQSLQMIQDKKLRVPRIHAVVVGSDMSAQTKFETELRNFVNENKIQDRVHFVNKTLSVAPYLASIDVLVQNSQGRGECFGRITIEAMAFQLPVLGTAA
        AFHQSLQMIQDKKLRVPRIHAVVVGSDMSA TKFETELRNFVNENKIQDRVHFVNKTLSVAPYLASIDVLVQNSQGRGECFGRITIEAMAFQLPVLGTAA
Subjt:  AFHQSLQMIQDKKLRVPRIHAVVVGSDMSAQTKFETELRNFVNENKIQDRVHFVNKTLSVAPYLASIDVLVQNSQGRGECFGRITIEAMAFQLPVLGTAA

Query:  GGTMEIVVNGTTGLLHPAGKEGVTPLAHNIVKLATHVERRLTIGKKGYERVRQMFLEQHMGQRIAFV
        GGTMEIVVNGTTGLLHPAGKEG+TPLAHNIVKLATHVERRLT GKKGYERVRQMFLEQHM QRIA V
Subjt:  GGTMEIVVNGTTGLLHPAGKEGVTPLAHNIVKLATHVERRLTIGKKGYERVRQMFLEQHMGQRIAFV

A0A6J1GMZ6 uncharacterized protein LOC111455876 isoform X14.9e-23895.16Show/hide
Query:  RWPLMILALVSISTAMVFFMRTSFDSCSGNGNRRFVEENGIDSQIRSSQIERK----APNPLDFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTQV
        RWPLMILALVSISTAMVFFMRT+FDSCSGN NRRFVEENGIDSQIRSSQIERK     PNPLDFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGT+V
Subjt:  RWPLMILALVSISTAMVFFMRTSFDSCSGNGNRRFVEENGIDSQIRSSQIERK----APNPLDFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTQV

Query:  VWITNQKPSEPDEVVYSLERKMLDRGVQIFILKVLSAKGQEAVETALKAHLVVLNTAVAGKWLDAILKENVPRVLPKVLWWIHEMRGNYFKVEYVKHLPF
        VWITNQKPSEPDEV YSLERKMLDRGVQ     VLSAKGQEAVETALKA LVVLNTAVAGKWLDA+LKENVPRVLPKVLWWIHEMRGNYFKVEYVKHLPF
Subjt:  VWITNQKPSEPDEVVYSLERKMLDRGVQIFILKVLSAKGQEAVETALKAHLVVLNTAVAGKWLDAILKENVPRVLPKVLWWIHEMRGNYFKVEYVKHLPF

Query:  VAGAMIDSHTTAEYWKNRTRERLGIKMPETYVVHLGNSKDLMEVAENNVAKRVLREHIRESLGVRNEDILFAIINSVSRGKGQDLFLRAFHQSLQMIQDK
        VAGAMIDSHTTAEYWKNRTRERLGIKMPETYVVHLGNSKDLMEVAENNVAKRVLREHIRESLGVRNEDILFAIINSVSRGKGQDLFLRAFHQSLQMIQDK
Subjt:  VAGAMIDSHTTAEYWKNRTRERLGIKMPETYVVHLGNSKDLMEVAENNVAKRVLREHIRESLGVRNEDILFAIINSVSRGKGQDLFLRAFHQSLQMIQDK

Query:  KLRVPRIHAVVVGSDMSAQTKFETELRNFVNENKIQDRVHFVNKTLSVAPYLASIDVLVQNSQGRGECFGRITIEAMAFQLPVLGTAAGGTMEIVVNGTT
        KL+VPRIHAVVVGSDMSAQTKFETELRNFVNENKIQDRVHFVNKTLSV PYLASIDVLVQNSQGRGECFGRITIEAMAFQLPVLGTAAGGTMEIVVNGTT
Subjt:  KLRVPRIHAVVVGSDMSAQTKFETELRNFVNENKIQDRVHFVNKTLSVAPYLASIDVLVQNSQGRGECFGRITIEAMAFQLPVLGTAAGGTMEIVVNGTT

Query:  GLLHPAGKEGVTPLAHNIVKLATHVERRLTIGKKGYERVRQMFLEQHMGQRIAFV
        GLLHPAGKEGVTPLA NIVKLAT VERRLTIGKKGYERVRQMFLEQHM QRIA V
Subjt:  GLLHPAGKEGVTPLAHNIVKLATHVERRLTIGKKGYERVRQMFLEQHMGQRIAFV

SwissProt top hitse value%identityAlignment
O65685 BEL1-like homeodomain protein 62.7e-7646.15Show/hide
Query:  NSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVDTCRENGGGSNDLAGEMCCDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQY
        NS Y++AAQQLLDE V++++ALK+ +++                 G  N+   +    + + S  NP  D+S +ERQ++++K +KLLS+L EVDRRYKQY
Subjt:  NSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVDTCRENGGGSNDLAGEMCCDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQY

Query:  YQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAINAQIEVTRRTLGEQDALNSGQEG--IPRLRFVDQHLRQQRALQQLGVTPHSWRPQRGL
        YQQ+Q + SSFD++AG GAA+ YTALA QTIS HFR LRDAI+ QI V R+ LGEQ   + G+    I RL++VDQHLRQQR   Q    P +WRPQRGL
Subjt:  YQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAINAQIEVTRRTLGEQDALNSGQEG--IPRLRFVDQHLRQQRALQQLGVTPHSWRPQRGL

Query:  PESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVMSLILNRKLDFLPSKLKRLLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDSNIDIK
        PE+SV ILRAWLFEHFLHPYPKDS+K+ LARQTGL+R Q                            V+NWFINARVRLWKPMVEE+YKEEF ++  D  
Subjt:  PESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVMSLILNRKLDFLPSKLKRLLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDSNIDIK

Query:  SSPENASKASWNNSLFS--EDRGGDELHDSTKSAANNGERVLYRPCATDDMINSSNF-------------SMGGGDVSLALELKHCEGDE
        SS EN  K S    + +  EDR  +   D TK    +G     R       ++   F              +G GDVSL L L++ +G +
Subjt:  SSPENASKASWNNSLFS--EDRGGDELHDSTKSAANNGERVLYRPCATDDMINSSNF-------------SMGGGDVSLALELKHCEGDE

Q8RY81 O-fucosyltransferase 135.9e-14873.49Show/hide
Query:  LCDGVGIARLLNATLVLPKFEVAAYWNESS-------------QMNGYVKVVKMLPPEFASKEPYHVDCSKRKGQFDYIESVLPSLLEHRYISITPAMSQ
        LCDGVGIARLLNATLVLPKFEVAAYWNESS             +M+GY++VVK LP + ASKEP+ VDCSKRKGQFDYIESVLP LLEH YIS TPAMSQ
Subjt:  LCDGVGIARLLNATLVLPKFEVAAYWNESS-------------QMNGYVKVVKMLPPEFASKEPYHVDCSKRKGQFDYIESVLPSLLEHRYISITPAMSQ

Query:  RRDRYPQYAKAALCQACYNGLRLAKSVEEKAKELLEAIPKPFLSLHLRFEPDMVAYSQCEYKGLSPASLEAIEATRGDRKPWTGELAGIWRKRGKCPLTP
        RRDRYP+YA+A LCQACY+ + L  S+E+KA EL +AIPKPFLSLHLRFEPDMVAYSQCEY  LSP+S+ AIEA R DRKPWTGELA  WRKRGKCPLTP
Subjt:  RRDRYPQYAKAALCQACYNGLRLAKSVEEKAKELLEAIPKPFLSLHLRFEPDMVAYSQCEYKGLSPASLEAIEATRGDRKPWTGELAGIWRKRGKCPLTP

Query:  RETALILQALHIPTNTNIYLAAGDGLMELEGFTSVYTNVVTKSSFLSSDDFSSMHGNTKAALDYYVSINSDSYVATFFGNMDKMVAAMRAFNGKQRTLFL
         ET L+LQ+L+IPT+TNIYLAAGDGLME+EGFTSVYTNV TKS  L+ +DF+ MHGNTKAALDY+VSINSD+YVAT+FGNMDK+VAAMR +     TLFL
Subjt:  RETALILQALHIPTNTNIYLAAGDGLMELEGFTSVYTNVVTKSSFLSSDDFSSMHGNTKAALDYYVSINSDSYVATFFGNMDKMVAAMRAFNGKQRTLFL

Query:  SRRAFADFTYKGLKGKELNQALWKTHRDDFAMGRGSALSDCFCEFKL
        SR+AFA+ T +GL+G EL +ALW+ H+ DFA+GRG AL DCFCEF+L
Subjt:  SRRAFADFTYKGLKGKELNQALWKTHRDDFAMGRGSALSDCFCEFKL

Q9FWS9 BEL1-like homeodomain protein 31.8e-8040.32Show/hide
Query:  MFMNQDPSCRDYSEIFSGVSSNYVETVGDRSNVEMAFIPP---------VVGLLDDSNFQCQGVSLSLSTHSPSVCSMSSFPHQYQTPAMPSSFINAPPS
        +++N+    +  +   S  S + + + GD  N EM FIPP         V    +D +F   G+SLSL     S  S+S F + YQ  +   S+ N  PS
Subjt:  MFMNQDPSCRDYSEIFSGVSSNYVETVGDRSNVEMAFIPP---------VVGLLDDSNFQCQGVSLSLSTHSPSVCSMSSFPHQYQTPAMPSSFINAPPS

Query:  IFEKRPNLKPCI---------------SDAKNGM-------YIAVGSGYSTLNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVDTCRENGGGS
                   +               ++  NG+       Y   G   S L S Y++  QQLLDE+VS+R+ LK L +KK+K   G      ++   GS
Subjt:  IFEKRPNLKPCI---------------SDAKNGM-------YIAVGSGYSTLNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVDTCRENGGGS

Query:  NDLAGEMCCDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQYYQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAINAQIEV
        +D   E           + S +LSP+ERQ+L++K SKLL+++ EVD+RY QY+ Q++ LASSF+MV G GAA+ YT++A   IS HFR LRDAI  QI+V
Subjt:  NDLAGEMCCDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQYYQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAINAQIEV

Query:  TRRTLGEQDALNSGQEGIPRLRFVDQHLRQQRAL-QQLGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVMSLILNRKL
         R  LGE++  +   E IPRLR++DQ LRQQRAL QQLG+   +WRPQRGLPE+SVSILRAWLFEHFLHPYPK+SEK+ L++QTGL++NQ          
Subjt:  TRRTLGEQDALNSGQEGIPRLRFVDQHLRQQRAL-QQLGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVMSLILNRKL

Query:  DFLPSKLKRLLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRGGDELHDSTKSAANNGERVLYRPCATDDM
                          VANWFINARVRLWKPM+EEMYKEEFG+S   + +S ++  K    + L  ED    +     ++  NN   + Y   A  ++
Subjt:  DFLPSKLKRLLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRGGDELHDSTKSAANNGERVLYRPCATDDM

Query:  I
        +
Subjt:  I

Q9FXG8 BEL1-like homeodomain protein 106.4e-7841.25Show/hide
Query:  MFMNQDPSCRDYSEIFSGVSSNYVETVGDRSNVEMAFIPP-----------VVGLLDDSNFQCQGVSLSLSTHSPSVCSMSSFP-HQYQ---TPAMPSSF
        +F+N     +  S   S  ++++  T GD    EM FIPP            V   D S    QG+SLSL T      S++ F  HQYQ   T   PS  
Subjt:  MFMNQDPSCRDYSEIFSGVSSNYVETVGDRSNVEMAFIPP-----------VVGLLDDSNFQCQGVSLSLSTHSPSVCSMSSFP-HQYQ---TPAMPSSF

Query:  I--NAPPSIFEKRPNLKPCI----SDAKNGMYIAVGSGY---------------STLNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVDTCRE
        +   +P  + E     K  I    SD  +G     G+G+               S L S Y++ AQ LLDE+VS+++ L ++  KK+K  N     +   
Subjt:  I--NAPPSIFEKRPNLKPCI----SDAKNGMYIAVGSGY---------------STLNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVDTCRE

Query:  NGGGSNDLAGEMCCDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQYYQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAIN
         GGG     GE+  D+   S+     +LS  ER++L+NK +KLL+++ EVD+RY QYY Q++ LASSF++VAG G+A+ YT++A   IS HFR LRDAI 
Subjt:  NGGGSNDLAGEMCCDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQYYQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAIN

Query:  AQIEVTRRTLGEQ--DALNSGQ-EGIPRLRFVDQHLRQQRAL-QQLGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVM
         QI++ R  LGE+  ++L+  Q E IPRLR++DQ LRQQRAL QQLG+   +WRPQRGLPE+SVS+LRAWLFEHFLHPYPK+SEK+ LA+QTGL++NQ  
Subjt:  AQIEVTRRTLGEQ--DALNSGQ-EGIPRLRFVDQHLRQQRAL-QQLGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVM

Query:  SLILNRKLDFLPSKLKRLLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDSN--IDIKSSPENASKASWNNSLFSEDRGGDELHDSTKSAANNGERV
                                  VANWFINARVRLWKPM+EEMYKEEFGD +  +  KSS E        NS   ED    +     +   NN   +
Subjt:  SLILNRKLDFLPSKLKRLLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDSN--IDIKSSPENASKASWNNSLFSEDRGGDELHDSTKSAANNGERV

Query:  LYRPCATDDMINSS
         Y    T +++ SS
Subjt:  LYRPCATDDMINSS

Q9SIW1 BEL1-like homeodomain protein 71.6e-7345.01Show/hide
Query:  NSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVDTCRENGGGSNDLAGEMCCDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQY
        NS Y++AAQ+LLDE V++++ALK+ + +                        G+   + +E ++   + ++   ERQ+L++K SKLLS+L EVDR YKQY
Subjt:  NSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVDTCRENGGGSNDLAGEMCCDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQY

Query:  YQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAINAQIEVTRRTL-GEQDALNSGQEGIPRLRFVDQHLRQQRALQQLGV-TPHSWRPQRGL
        Y Q+Q + SSFD++AG GAA+ YTALA QTIS HFR LRDAI+ QI V R++L GEQD  +    GI RLR VDQ +RQQRALQ+LGV  PH+WRPQRGL
Subjt:  YQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAINAQIEVTRRTL-GEQDALNSGQEGIPRLRFVDQHLRQQRALQQLGV-TPHSWRPQRGL

Query:  PESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVMSLILNRKLDFLPSKLKRLLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDS--NID
        P+SSV +LRAWLFEHFLHPYPKDS+K+ LARQTGL+R Q                            V+NWFINARVRLWKPMVEEMYKEEF D+    D
Subjt:  PESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVMSLILNRKLDFLPSKLKRLLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDS--NID

Query:  IKSSPENASKASWNNSLFSEDRGGDELHDSTKSAANNGERVLY---RPCATDDMINSSNFSMGGGDVSLALELKHCEGDEFGMFGSNNRMN
           S EN  + +    L +E    +       S++     V +   R     DM  + N   GG  ++L ++     GD     G +N  N
Subjt:  IKSSPENASKASWNNSLFSEDRGGDELHDSTKSAANNGERVLY---RPCATDDMINSSNFSMGGGDVSLALELKHCEGDEFGMFGSNNRMN

Arabidopsis top hitse value%identityAlignment
AT1G19710.1 UDP-Glycosyltransferase superfamily protein1.6e-18874.2Show/hide
Query:  ERWPLMILALVSISTAMVFFMRTSFDSCSGNGNRRFVE-ENGIDSQIRSSQIERKAPNPLDFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTQVVW
        +RWPLMIL ++S+ST  +  +R++FDSCS +G R   E E+  D +I+S      + NPL+FMKSKLVLLVSHELSLSGGPLLLMELAFLLRGV ++VVW
Subjt:  ERWPLMILALVSISTAMVFFMRTSFDSCSGNGNRRFVE-ENGIDSQIRSSQIERKAPNPLDFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTQVVW

Query:  ITNQKPSEPDEVVYSLERKMLDRGVQIFILKVLSAKGQEAVETALKAHLVVLNTAVAGKWLDAILKENVPRVLPKVLWWIHEMRGNYFKVEYVKHLPFVA
        ITNQKP E DEV+  LE KMLDRGVQ     V+SAK Q+A++TALK+ LVVLNTAVAGKWLDA+LK+NVP+VLPKVLWWIHEMRG+YFK + VKHLPFVA
Subjt:  ITNQKPSEPDEVVYSLERKMLDRGVQIFILKVLSAKGQEAVETALKAHLVVLNTAVAGKWLDAILKENVPRVLPKVLWWIHEMRGNYFKVEYVKHLPFVA

Query:  GAMIDSHTTAEYWKNRTRERLGIKMPETYVVHLGNSKDLMEVAENNVAKRVLREHIRESLGVRNEDILFAIINSVSRGKGQDLFLRAFHQSLQMIQD-KK
        GAMIDSH TAEYWKNRT +RLGIKMP+TYVVHLGNSK+LMEVAE++ AK VLRE +RESLGVRNEDILF IINSVSRGKGQDLFLRAFH+SL++I++ KK
Subjt:  GAMIDSHTTAEYWKNRTRERLGIKMPETYVVHLGNSKDLMEVAENNVAKRVLREHIRESLGVRNEDILFAIINSVSRGKGQDLFLRAFHQSLQMIQD-KK

Query:  LRVPRIHAVVVGSDMSAQTKFETELRNFVNENKIQDRVHFVNKTLSVAPYLASIDVLVQNSQGRGECFGRITIEAMAFQLPVLGTAAGGTMEIVVNGTTG
        L VP +HAVVVGSDMSAQTKFETELRNFV E K+Q  VHFVNKT+ VAPYLA+IDVLVQNSQ RGECFGRITIEAMAF+LPVLGTAAGGTMEIVVN TTG
Subjt:  LRVPRIHAVVVGSDMSAQTKFETELRNFVNENKIQDRVHFVNKTLSVAPYLASIDVLVQNSQGRGECFGRITIEAMAFQLPVLGTAAGGTMEIVVNGTTG

Query:  LLHPAGKEGVTPLAHNIVKLATHVERRLTIGKKGYERVRQMFLEQHMGQRIAFVAGNYV--AQILSVTT
        LLH  GK+GV PLA NIVKLAT+V+ R T+GKKGYERV++MFLE HM  RIA V    +  A+I S TT
Subjt:  LLHPAGKEGVTPLAHNIVKLATHVERRLTIGKKGYERVRQMFLEQHMGQRIAFVAGNYV--AQILSVTT

AT1G52630.1 O-fucosyltransferase family protein4.2e-14973.49Show/hide
Query:  LCDGVGIARLLNATLVLPKFEVAAYWNESS-------------QMNGYVKVVKMLPPEFASKEPYHVDCSKRKGQFDYIESVLPSLLEHRYISITPAMSQ
        LCDGVGIARLLNATLVLPKFEVAAYWNESS             +M+GY++VVK LP + ASKEP+ VDCSKRKGQFDYIESVLP LLEH YIS TPAMSQ
Subjt:  LCDGVGIARLLNATLVLPKFEVAAYWNESS-------------QMNGYVKVVKMLPPEFASKEPYHVDCSKRKGQFDYIESVLPSLLEHRYISITPAMSQ

Query:  RRDRYPQYAKAALCQACYNGLRLAKSVEEKAKELLEAIPKPFLSLHLRFEPDMVAYSQCEYKGLSPASLEAIEATRGDRKPWTGELAGIWRKRGKCPLTP
        RRDRYP+YA+A LCQACY+ + L  S+E+KA EL +AIPKPFLSLHLRFEPDMVAYSQCEY  LSP+S+ AIEA R DRKPWTGELA  WRKRGKCPLTP
Subjt:  RRDRYPQYAKAALCQACYNGLRLAKSVEEKAKELLEAIPKPFLSLHLRFEPDMVAYSQCEYKGLSPASLEAIEATRGDRKPWTGELAGIWRKRGKCPLTP

Query:  RETALILQALHIPTNTNIYLAAGDGLMELEGFTSVYTNVVTKSSFLSSDDFSSMHGNTKAALDYYVSINSDSYVATFFGNMDKMVAAMRAFNGKQRTLFL
         ET L+LQ+L+IPT+TNIYLAAGDGLME+EGFTSVYTNV TKS  L+ +DF+ MHGNTKAALDY+VSINSD+YVAT+FGNMDK+VAAMR +     TLFL
Subjt:  RETALILQALHIPTNTNIYLAAGDGLMELEGFTSVYTNVVTKSSFLSSDDFSSMHGNTKAALDYYVSINSDSYVATFFGNMDKMVAAMRAFNGKQRTLFL

Query:  SRRAFADFTYKGLKGKELNQALWKTHRDDFAMGRGSALSDCFCEFKL
        SR+AFA+ T +GL+G EL +ALW+ H+ DFA+GRG AL DCFCEF+L
Subjt:  SRRAFADFTYKGLKGKELNQALWKTHRDDFAMGRGSALSDCFCEFKL

AT1G52630.2 O-fucosyltransferase family protein4.2e-14973.49Show/hide
Query:  LCDGVGIARLLNATLVLPKFEVAAYWNESS-------------QMNGYVKVVKMLPPEFASKEPYHVDCSKRKGQFDYIESVLPSLLEHRYISITPAMSQ
        LCDGVGIARLLNATLVLPKFEVAAYWNESS             +M+GY++VVK LP + ASKEP+ VDCSKRKGQFDYIESVLP LLEH YIS TPAMSQ
Subjt:  LCDGVGIARLLNATLVLPKFEVAAYWNESS-------------QMNGYVKVVKMLPPEFASKEPYHVDCSKRKGQFDYIESVLPSLLEHRYISITPAMSQ

Query:  RRDRYPQYAKAALCQACYNGLRLAKSVEEKAKELLEAIPKPFLSLHLRFEPDMVAYSQCEYKGLSPASLEAIEATRGDRKPWTGELAGIWRKRGKCPLTP
        RRDRYP+YA+A LCQACY+ + L  S+E+KA EL +AIPKPFLSLHLRFEPDMVAYSQCEY  LSP+S+ AIEA R DRKPWTGELA  WRKRGKCPLTP
Subjt:  RRDRYPQYAKAALCQACYNGLRLAKSVEEKAKELLEAIPKPFLSLHLRFEPDMVAYSQCEYKGLSPASLEAIEATRGDRKPWTGELAGIWRKRGKCPLTP

Query:  RETALILQALHIPTNTNIYLAAGDGLMELEGFTSVYTNVVTKSSFLSSDDFSSMHGNTKAALDYYVSINSDSYVATFFGNMDKMVAAMRAFNGKQRTLFL
         ET L+LQ+L+IPT+TNIYLAAGDGLME+EGFTSVYTNV TKS  L+ +DF+ MHGNTKAALDY+VSINSD+YVAT+FGNMDK+VAAMR +     TLFL
Subjt:  RETALILQALHIPTNTNIYLAAGDGLMELEGFTSVYTNVVTKSSFLSSDDFSSMHGNTKAALDYYVSINSDSYVATFFGNMDKMVAAMRAFNGKQRTLFL

Query:  SRRAFADFTYKGLKGKELNQALWKTHRDDFAMGRGSALSDCFCEFKL
        SR+AFA+ T +GL+G EL +ALW+ H+ DFA+GRG AL DCFCEF+L
Subjt:  SRRAFADFTYKGLKGKELNQALWKTHRDDFAMGRGSALSDCFCEFKL

AT1G75410.1 BEL1-like homeodomain 31.3e-8140.32Show/hide
Query:  MFMNQDPSCRDYSEIFSGVSSNYVETVGDRSNVEMAFIPP---------VVGLLDDSNFQCQGVSLSLSTHSPSVCSMSSFPHQYQTPAMPSSFINAPPS
        +++N+    +  +   S  S + + + GD  N EM FIPP         V    +D +F   G+SLSL     S  S+S F + YQ  +   S+ N  PS
Subjt:  MFMNQDPSCRDYSEIFSGVSSNYVETVGDRSNVEMAFIPP---------VVGLLDDSNFQCQGVSLSLSTHSPSVCSMSSFPHQYQTPAMPSSFINAPPS

Query:  IFEKRPNLKPCI---------------SDAKNGM-------YIAVGSGYSTLNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVDTCRENGGGS
                   +               ++  NG+       Y   G   S L S Y++  QQLLDE+VS+R+ LK L +KK+K   G      ++   GS
Subjt:  IFEKRPNLKPCI---------------SDAKNGM-------YIAVGSGYSTLNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNGLGVDTCRENGGGS

Query:  NDLAGEMCCDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQYYQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAINAQIEV
        +D   E           + S +LSP+ERQ+L++K SKLL+++ EVD+RY QY+ Q++ LASSF+MV G GAA+ YT++A   IS HFR LRDAI  QI+V
Subjt:  NDLAGEMCCDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQYYQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAINAQIEV

Query:  TRRTLGEQDALNSGQEGIPRLRFVDQHLRQQRAL-QQLGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVMSLILNRKL
         R  LGE++  +   E IPRLR++DQ LRQQRAL QQLG+   +WRPQRGLPE+SVSILRAWLFEHFLHPYPK+SEK+ L++QTGL++NQ          
Subjt:  TRRTLGEQDALNSGQEGIPRLRFVDQHLRQQRAL-QQLGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVMSLILNRKL

Query:  DFLPSKLKRLLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRGGDELHDSTKSAANNGERVLYRPCATDDM
                          VANWFINARVRLWKPM+EEMYKEEFG+S   + +S ++  K    + L  ED    +     ++  NN   + Y   A  ++
Subjt:  DFLPSKLKRLLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRGGDELHDSTKSAANNGERVLYRPCATDDM

Query:  I
        +
Subjt:  I

AT1G75420.1 UDP-Glycosyltransferase superfamily protein8.1e-19376.33Show/hide
Query:  ERWPLMILALVSISTAMVFFMRTSFDSCSGNGNRRFVEENGIDSQIRSSQIERKAPNPLDFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTQVVWI
        +RW LM+L  +S+ST  +  +R+SF++CS +   +FVEE   +S     Q      NPLDFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVG  VVWI
Subjt:  ERWPLMILALVSISTAMVFFMRTSFDSCSGNGNRRFVEENGIDSQIRSSQIERKAPNPLDFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTQVVWI

Query:  TNQKPSEPDEVVYSLERKMLDRGVQIFILKVLSAKGQEAVETALKAHLVVLNTAVAGKWLDAILKENVPRVLPKVLWWIHEMRGNYFKVEYVKHLPFVAG
        TNQKP E DEVVYSLE KMLDRGVQ     V+SAKGQ+AV+T+LKA L+VLNTAVAGKWLDA+LKENV +VLPK+LWWIHEMRG+YF  + VKHLPFVAG
Subjt:  TNQKPSEPDEVVYSLERKMLDRGVQIFILKVLSAKGQEAVETALKAHLVVLNTAVAGKWLDAILKENVPRVLPKVLWWIHEMRGNYFKVEYVKHLPFVAG

Query:  AMIDSHTTAEYWKNRTRERLGIKMPETYVVHLGNSKDLMEVAENNVAKRVLREHIRESLGVRNEDILFAIINSVSRGKGQDLFLRAFHQSLQMIQDKKLR
        AMIDSH TA YWKNRT+ RLGIKMP+TYVVHLGNSK+LMEVAE++VAKRVLREH+RESLGVRNED+LF IINSVSRGKGQDLFLRAFH+SL+ I++KKL+
Subjt:  AMIDSHTTAEYWKNRTRERLGIKMPETYVVHLGNSKDLMEVAENNVAKRVLREHIRESLGVRNEDILFAIINSVSRGKGQDLFLRAFHQSLQMIQDKKLR

Query:  VPRIHAVVVGSDMSAQTKFETELRNFVNENKIQDRVHFVNKTLSVAPYLASIDVLVQNSQGRGECFGRITIEAMAFQLPVLGTAAGGTMEIVVNGTTGLL
        VP +HAVVVGSDMS QTKFETELRNFV E K+++ VHFVNKTL+VAPY+A+IDVLVQNSQ RGECFGRITIEAMAF+LPVLGTAAGGTMEIVVNGTTGLL
Subjt:  VPRIHAVVVGSDMSAQTKFETELRNFVNENKIQDRVHFVNKTLSVAPYLASIDVLVQNSQGRGECFGRITIEAMAFQLPVLGTAAGGTMEIVVNGTTGLL

Query:  HPAGKEGVTPLAHNIVKLATHVERRLTIGKKGYERVRQMFLEQHMGQRIAFV
        H AGKEGV PLA NIVKLAT VE RL +GK GYERV++MFLE HM  RIA V
Subjt:  HPAGKEGVTPLAHNIVKLATHVERRLTIGKKGYERVRQMFLEQHMGQRIAFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGGTGTTGGAGTCGTTGTTCCAATTATGAGGAAGAGAAGAGAAGAAGAGAATACTCTGCTGCTAAAACTTGAGGGTTTTCAAATGACATATTTTCCTGGTTTAAG
TAATCAAAGAGACTCCATGTTAGGTGATCAGAAACTTGGTTCTTCTTGCCAAGAACAGCCTCTCTGCTCTGGGAATTTGATGATGTTCATGAATCAAGACCCTTCTTGCA
GAGATTATTCAGAGATTTTTTCTGGGGTTTCTTCAAACTATGTTGAAACTGTGGGAGATAGAAGTAATGTTGAAATGGCTTTCATTCCACCAGTTGTTGGACTTTTAGAT
GACTCAAATTTCCAATGTCAAGGTGTTTCACTTAGCCTTTCCACACACTCCCCCTCTGTTTGTTCTATGTCTTCATTTCCTCACCAATATCAGACCCCTGCTATGCCTTC
TTCATTCATCAATGCACCTCCTTCAATCTTCGAGAAGAGGCCGAATCTGAAACCTTGTATATCTGATGCGAAAAATGGCATGTATATAGCGGTGGGTTCTGGATATTCGA
CGTTGAATTCGGTATATATCGAAGCTGCACAGCAGTTGCTCGACGAAATGGTTAGCATCCGCGAAGCTTTGAAAGAGCTAAAGTCTAAGAAACTCAAAGCTTCAAATGGT
TTGGGTGTTGATACCTGCAGAGAAAATGGTGGGGGATCTAATGATCTGGCTGGGGAGATGTGTTGCGATGCGAGGGAGTCGTCGGTCGCCAATCCGTCGGGTGATCTATC
GCCTACCGAACGTCAGGACTTAAAGAACAAAACCTCAAAGCTTTTGTCCTTGTTGGGTGAGGTGGATCGAAGATACAAGCAATATTACCAACAGCTGCAATATTTGGCGT
CGTCTTTTGACATGGTAGCAGGCCGAGGGGCTGCAGAGTTCTACACTGCACTCGCACACCAAACTATTTCGTGCCACTTTCGCCGCTTGCGAGATGCAATCAATGCACAA
ATTGAGGTCACCAGGAGAACCTTGGGGGAGCAAGATGCTTTAAATTCTGGTCAAGAGGGCATTCCCCGTCTCCGCTTTGTCGACCAGCACCTCAGGCAACAGAGAGCCCT
TCAGCAGCTCGGTGTCACACCACATAGTTGGAGGCCTCAACGAGGGCTTCCCGAAAGCTCCGTGTCGATTCTTCGCGCTTGGCTATTCGAACACTTCCTCCATCCATATC
CCAAGGATTCAGAGAAACTTAAGCTAGCAAGACAAACTGGCCTGACTAGAAACCAAGTAATGTCGTTAATCCTCAATCGAAAACTCGACTTTTTGCCCTCTAAACTCAAA
CGTCTGCTTGCAAATTTGAACGATATTCGTGTTGCAAACTGGTTTATAAATGCGCGTGTTCGGCTATGGAAACCAATGGTAGAGGAGATGTACAAAGAAGAATTTGGAGA
CTCCAACATTGATATCAAATCCTCACCAGAAAATGCATCCAAAGCATCATGGAACAATTCGCTATTCTCCGAAGATAGGGGAGGAGACGAGTTGCACGACAGCACGAAAT
CTGCAGCAAATAACGGCGAGCGAGTCCTCTACCGCCCGTGTGCTACCGACGACATGATCAACTCAAGTAACTTCTCAATGGGAGGAGGAGATGTTTCACTTGCACTGGAG
TTGAAGCATTGTGAAGGTGATGAATTTGGGATGTTTGGGAGTAACAATAGAATGAACAAAGATGAAAAATCAAACCCTAAGAATCGCATTGCCAACTACACTAAGCTAAG
CACACTTTTTCAGTTATTAGTGGCGGAGCCATTGTTCTTTGGATTTTCCTCACTTCTTCTTCTTCCTTTTCCTATTTTCACTTCACTGCAATCTCGTCGCGAAGTGGGGG
AGAGATGGCCGTTGATGATCTTAGCATTGGTTTCAATTTCCACGGCAATGGTTTTCTTCATGAGGACCTCGTTCGATTCCTGTAGTGGTAATGGGAATAGACGTTTTGTG
GAAGAAAATGGTATCGACTCACAGATTCGTTCCTCCCAAATCGAGAGAAAAGCTCCGAATCCTCTTGATTTCATGAAGTCCAAGCTCGTCCTCTTGGTCTCACATGAGCT
CTCTCTTTCTGGTGGGCCTCTACTACTCATGGAGCTTGCATTTTTGTTGAGAGGCGTCGGTACTCAAGTTGTATGGATTACTAATCAAAAGCCATCGGAGCCTGATGAAG
TAGTGTACAGTTTGGAGCGCAAGATGTTGGATCGAGGAGTGCAGATTTTCATTCTCAAGGTTTTGTCTGCTAAGGGACAAGAAGCTGTTGAAACTGCTCTAAAAGCTCAT
TTGGTTGTTCTAAATACTGCTGTTGCCGGGAAATGGTTGGATGCTATCCTTAAGGAGAATGTTCCCCGTGTCCTTCCCAAGGTTCTGTGGTGGATTCATGAGATGCGGGG
GAATTATTTCAAAGTTGAGTATGTGAAGCACCTCCCTTTTGTTGCAGGTGCCATGATTGATTCACATACAACTGCAGAATACTGGAAAAACAGAACACGGGAACGATTGG
GGATTAAAATGCCTGAAACTTATGTCGTGCATCTTGGAAATAGTAAAGACCTTATGGAAGTGGCTGAAAATAATGTGGCCAAGAGGGTTCTTCGAGAGCATATTCGTGAG
TCCCTTGGAGTTCGGAATGAAGATATATTGTTTGCAATTATAAATAGTGTTTCACGCGGGAAAGGTCAGGATTTATTTCTCCGAGCCTTTCATCAGAGCCTGCAAATGAT
CCAAGATAAAAAGCTGCGGGTACCAAGGATACATGCAGTGGTAGTTGGCAGTGACATGAGTGCTCAAACAAAGTTTGAAACAGAACTGCGCAACTTTGTAAATGAGAATA
AAATTCAGGATCGTGTTCATTTTGTCAACAAAACCTTGTCTGTGGCTCCTTATCTAGCTTCTATTGATGTTCTTGTTCAAAACTCTCAGGGTAGGGGAGAATGCTTTGGA
AGGATAACGATTGAAGCCATGGCGTTTCAGCTACCTGTGCTGGGCACGGCTGCTGGAGGAACAATGGAGATCGTAGTGAACGGGACGACGGGTTTGCTGCATCCTGCAGG
GAAAGAAGGCGTAACTCCACTGGCACATAACATCGTAAAGTTAGCAACGCACGTCGAGCGAAGGCTAACCATTGGAAAGAAAGGATATGAGAGGGTGAGGCAAATGTTCC
TGGAACAGCACATGGGGCAAAGAATTGCTTTTGTAGCAGGAAATTATGTCGCACAGATCTTGTCGGTGACGACGATCAATCTGAGGCCAAAAGAGCGAAGAACAACGTCC
TTTCCTTGTGATTCGATTTTTTCAAATACGAAATCTCCGACCAGATCCGTTCTTGTGGACGGTGAGGAAGAAGGCTTTGCTGGATCTGAAATGTTTGTTGTCTCAGTGAA
GCCATTTTTTAGCGTTCTTCTTGTCACTCTTTCTTTGTCCTTGGCGGTGCTGCTCTTGTCGCCGCCCTCTTTGCTTTTCTCCAACACTTCATTCTCATTTACTAATGAAC
GGGGCTCTGTCGCCTGTCGCTCCCCTTTACCTATCATTCCTCCACGGCGGTGGTCTGTTTTGCTTCTTTTTGTGAAATCGAACATATACCTACAGTTCCAGCTGAAAAAC
TTTGGAATAAGGCTACACGGATATATGGAGTGTTCGAAGGATAGTAGAGTGGAGGCCTTGCAAGTGGTGGCTTCAAGACTTTTTGTTGAAATGCTGCAGCTCTGCCCGCT
GATAGCAATGGATATATACGCGTGGATTGCTATGTTGTGTGATGGAGTTGGCATAGCACGGCTGTTGAACGCAACCCTGGTTTTGCCAAAGTTTGAAGTGGCTGCATATT
GGAATGAATCAAGTCAGATGAATGGTTATGTAAAAGTTGTCAAAATGTTACCCCCAGAGTTTGCGTCCAAAGAGCCATATCATGTGGATTGTAGCAAACGCAAAGGACAG
TTTGACTATATTGAAAGTGTCCTTCCGTCCCTATTGGAACATCGTTATATTTCAATAACTCCAGCAATGAGCCAGAGAAGGGACAGGTATCCCCAGTATGCAAAAGCTGC
CCTTTGTCAAGCTTGTTACAATGGTTTGCGCCTAGCTAAATCGGTGGAGGAGAAAGCCAAGGAGCTCCTAGAAGCCATACCAAAACCCTTTCTCTCACTTCATCTTCGTT
TTGAGCCAGATATGGTGGCTTACAGCCAATGTGAATATAAAGGGCTTTCTCCTGCTTCCCTAGAAGCAATCGAAGCGACTCGGGGAGATAGAAAACCATGGACTGGAGAG
TTGGCTGGAATATGGAGAAAGAGAGGGAAATGTCCTCTCACACCCCGTGAGACAGCCCTCATACTTCAAGCACTCCATATTCCAACTAACACAAACATTTACTTGGCTGC
TGGTGATGGTTTGATGGAACTAGAAGGCTTTACATCAGTTTATACCAATGTAGTAACTAAGTCTAGTTTCCTCAGTAGCGACGATTTCTCAAGCATGCATGGGAACACAA
AAGCTGCACTGGATTATTATGTGTCTATTAATAGTGACTCATATGTTGCCACATTCTTTGGAAATATGGACAAGATGGTTGCAGCAATGCGAGCGTTCAACGGAAAGCAG
AGGACATTGTTTTTAAGTAGACGAGCTTTTGCAGATTTCACTTACAAGGGCTTGAAGGGGAAGGAGCTGAACCAAGCCTTGTGGAAGACTCATAGAGATGATTTTGCCAT
GGGGAGGGGATCCGCATTATCCGACTGTTTTTGTGAGTTCAAACTATAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGGTGTTGGAGTCGTTGTTCCAATTATGAGGAAGAGAAGAGAAGAAGAGAATACTCTGCTGCTAAAACTTGAGGGTTTTCAAATGACATATTTTCCTGGTTTAAG
TAATCAAAGAGACTCCATGTTAGGTGATCAGAAACTTGGTTCTTCTTGCCAAGAACAGCCTCTCTGCTCTGGGAATTTGATGATGTTCATGAATCAAGACCCTTCTTGCA
GAGATTATTCAGAGATTTTTTCTGGGGTTTCTTCAAACTATGTTGAAACTGTGGGAGATAGAAGTAATGTTGAAATGGCTTTCATTCCACCAGTTGTTGGACTTTTAGAT
GACTCAAATTTCCAATGTCAAGGTGTTTCACTTAGCCTTTCCACACACTCCCCCTCTGTTTGTTCTATGTCTTCATTTCCTCACCAATATCAGACCCCTGCTATGCCTTC
TTCATTCATCAATGCACCTCCTTCAATCTTCGAGAAGAGGCCGAATCTGAAACCTTGTATATCTGATGCGAAAAATGGCATGTATATAGCGGTGGGTTCTGGATATTCGA
CGTTGAATTCGGTATATATCGAAGCTGCACAGCAGTTGCTCGACGAAATGGTTAGCATCCGCGAAGCTTTGAAAGAGCTAAAGTCTAAGAAACTCAAAGCTTCAAATGGT
TTGGGTGTTGATACCTGCAGAGAAAATGGTGGGGGATCTAATGATCTGGCTGGGGAGATGTGTTGCGATGCGAGGGAGTCGTCGGTCGCCAATCCGTCGGGTGATCTATC
GCCTACCGAACGTCAGGACTTAAAGAACAAAACCTCAAAGCTTTTGTCCTTGTTGGGTGAGGTGGATCGAAGATACAAGCAATATTACCAACAGCTGCAATATTTGGCGT
CGTCTTTTGACATGGTAGCAGGCCGAGGGGCTGCAGAGTTCTACACTGCACTCGCACACCAAACTATTTCGTGCCACTTTCGCCGCTTGCGAGATGCAATCAATGCACAA
ATTGAGGTCACCAGGAGAACCTTGGGGGAGCAAGATGCTTTAAATTCTGGTCAAGAGGGCATTCCCCGTCTCCGCTTTGTCGACCAGCACCTCAGGCAACAGAGAGCCCT
TCAGCAGCTCGGTGTCACACCACATAGTTGGAGGCCTCAACGAGGGCTTCCCGAAAGCTCCGTGTCGATTCTTCGCGCTTGGCTATTCGAACACTTCCTCCATCCATATC
CCAAGGATTCAGAGAAACTTAAGCTAGCAAGACAAACTGGCCTGACTAGAAACCAAGTAATGTCGTTAATCCTCAATCGAAAACTCGACTTTTTGCCCTCTAAACTCAAA
CGTCTGCTTGCAAATTTGAACGATATTCGTGTTGCAAACTGGTTTATAAATGCGCGTGTTCGGCTATGGAAACCAATGGTAGAGGAGATGTACAAAGAAGAATTTGGAGA
CTCCAACATTGATATCAAATCCTCACCAGAAAATGCATCCAAAGCATCATGGAACAATTCGCTATTCTCCGAAGATAGGGGAGGAGACGAGTTGCACGACAGCACGAAAT
CTGCAGCAAATAACGGCGAGCGAGTCCTCTACCGCCCGTGTGCTACCGACGACATGATCAACTCAAGTAACTTCTCAATGGGAGGAGGAGATGTTTCACTTGCACTGGAG
TTGAAGCATTGTGAAGGTGATGAATTTGGGATGTTTGGGAGTAACAATAGAATGAACAAAGATGAAAAATCAAACCCTAAGAATCGCATTGCCAACTACACTAAGCTAAG
CACACTTTTTCAGTTATTAGTGGCGGAGCCATTGTTCTTTGGATTTTCCTCACTTCTTCTTCTTCCTTTTCCTATTTTCACTTCACTGCAATCTCGTCGCGAAGTGGGGG
AGAGATGGCCGTTGATGATCTTAGCATTGGTTTCAATTTCCACGGCAATGGTTTTCTTCATGAGGACCTCGTTCGATTCCTGTAGTGGTAATGGGAATAGACGTTTTGTG
GAAGAAAATGGTATCGACTCACAGATTCGTTCCTCCCAAATCGAGAGAAAAGCTCCGAATCCTCTTGATTTCATGAAGTCCAAGCTCGTCCTCTTGGTCTCACATGAGCT
CTCTCTTTCTGGTGGGCCTCTACTACTCATGGAGCTTGCATTTTTGTTGAGAGGCGTCGGTACTCAAGTTGTATGGATTACTAATCAAAAGCCATCGGAGCCTGATGAAG
TAGTGTACAGTTTGGAGCGCAAGATGTTGGATCGAGGAGTGCAGATTTTCATTCTCAAGGTTTTGTCTGCTAAGGGACAAGAAGCTGTTGAAACTGCTCTAAAAGCTCAT
TTGGTTGTTCTAAATACTGCTGTTGCCGGGAAATGGTTGGATGCTATCCTTAAGGAGAATGTTCCCCGTGTCCTTCCCAAGGTTCTGTGGTGGATTCATGAGATGCGGGG
GAATTATTTCAAAGTTGAGTATGTGAAGCACCTCCCTTTTGTTGCAGGTGCCATGATTGATTCACATACAACTGCAGAATACTGGAAAAACAGAACACGGGAACGATTGG
GGATTAAAATGCCTGAAACTTATGTCGTGCATCTTGGAAATAGTAAAGACCTTATGGAAGTGGCTGAAAATAATGTGGCCAAGAGGGTTCTTCGAGAGCATATTCGTGAG
TCCCTTGGAGTTCGGAATGAAGATATATTGTTTGCAATTATAAATAGTGTTTCACGCGGGAAAGGTCAGGATTTATTTCTCCGAGCCTTTCATCAGAGCCTGCAAATGAT
CCAAGATAAAAAGCTGCGGGTACCAAGGATACATGCAGTGGTAGTTGGCAGTGACATGAGTGCTCAAACAAAGTTTGAAACAGAACTGCGCAACTTTGTAAATGAGAATA
AAATTCAGGATCGTGTTCATTTTGTCAACAAAACCTTGTCTGTGGCTCCTTATCTAGCTTCTATTGATGTTCTTGTTCAAAACTCTCAGGGTAGGGGAGAATGCTTTGGA
AGGATAACGATTGAAGCCATGGCGTTTCAGCTACCTGTGCTGGGCACGGCTGCTGGAGGAACAATGGAGATCGTAGTGAACGGGACGACGGGTTTGCTGCATCCTGCAGG
GAAAGAAGGCGTAACTCCACTGGCACATAACATCGTAAAGTTAGCAACGCACGTCGAGCGAAGGCTAACCATTGGAAAGAAAGGATATGAGAGGGTGAGGCAAATGTTCC
TGGAACAGCACATGGGGCAAAGAATTGCTTTTGTAGCAGGAAATTATGTCGCACAGATCTTGTCGGTGACGACGATCAATCTGAGGCCAAAAGAGCGAAGAACAACGTCC
TTTCCTTGTGATTCGATTTTTTCAAATACGAAATCTCCGACCAGATCCGTTCTTGTGGACGGTGAGGAAGAAGGCTTTGCTGGATCTGAAATGTTTGTTGTCTCAGTGAA
GCCATTTTTTAGCGTTCTTCTTGTCACTCTTTCTTTGTCCTTGGCGGTGCTGCTCTTGTCGCCGCCCTCTTTGCTTTTCTCCAACACTTCATTCTCATTTACTAATGAAC
GGGGCTCTGTCGCCTGTCGCTCCCCTTTACCTATCATTCCTCCACGGCGGTGGTCTGTTTTGCTTCTTTTTGTGAAATCGAACATATACCTACAGTTCCAGCTGAAAAAC
TTTGGAATAAGGCTACACGGATATATGGAGTGTTCGAAGGATAGTAGAGTGGAGGCCTTGCAAGTGGTGGCTTCAAGACTTTTTGTTGAAATGCTGCAGCTCTGCCCGCT
GATAGCAATGGATATATACGCGTGGATTGCTATGTTGTGTGATGGAGTTGGCATAGCACGGCTGTTGAACGCAACCCTGGTTTTGCCAAAGTTTGAAGTGGCTGCATATT
GGAATGAATCAAGTCAGATGAATGGTTATGTAAAAGTTGTCAAAATGTTACCCCCAGAGTTTGCGTCCAAAGAGCCATATCATGTGGATTGTAGCAAACGCAAAGGACAG
TTTGACTATATTGAAAGTGTCCTTCCGTCCCTATTGGAACATCGTTATATTTCAATAACTCCAGCAATGAGCCAGAGAAGGGACAGGTATCCCCAGTATGCAAAAGCTGC
CCTTTGTCAAGCTTGTTACAATGGTTTGCGCCTAGCTAAATCGGTGGAGGAGAAAGCCAAGGAGCTCCTAGAAGCCATACCAAAACCCTTTCTCTCACTTCATCTTCGTT
TTGAGCCAGATATGGTGGCTTACAGCCAATGTGAATATAAAGGGCTTTCTCCTGCTTCCCTAGAAGCAATCGAAGCGACTCGGGGAGATAGAAAACCATGGACTGGAGAG
TTGGCTGGAATATGGAGAAAGAGAGGGAAATGTCCTCTCACACCCCGTGAGACAGCCCTCATACTTCAAGCACTCCATATTCCAACTAACACAAACATTTACTTGGCTGC
TGGTGATGGTTTGATGGAACTAGAAGGCTTTACATCAGTTTATACCAATGTAGTAACTAAGTCTAGTTTCCTCAGTAGCGACGATTTCTCAAGCATGCATGGGAACACAA
AAGCTGCACTGGATTATTATGTGTCTATTAATAGTGACTCATATGTTGCCACATTCTTTGGAAATATGGACAAGATGGTTGCAGCAATGCGAGCGTTCAACGGAAAGCAG
AGGACATTGTTTTTAAGTAGACGAGCTTTTGCAGATTTCACTTACAAGGGCTTGAAGGGGAAGGAGCTGAACCAAGCCTTGTGGAAGACTCATAGAGATGATTTTGCCAT
GGGGAGGGGATCCGCATTATCCGACTGTTTTTGTGAGTTCAAACTATAATTTCAAGAAACTCAGATTTGTCATTTTTGTATGTAACCACTTTCCCCTTCAAATGCTTTAG
TAAATCTACAATCTTCAAATGTTGAGAATAGATTCTTGCTTACCTGTTAAAATTGTAACCCATCTTTTGATAAAGTAGATCAGGGTCATTTTGATTACATACATTGAACA
AGCATATGTTATTGCATACCAAATATGGAATTTCTATGTTATTTGTAAACAATATTTTCCCTCATCAGGGTTAGCTATGGCTAGGCCTTTTTTTTTTTCTTCTTTGTGAT
TTCATCGATTGGTTATCTTAATCTTGGAAACCATAGATCATCA
Protein sequenceShow/hide protein sequence
MVGVGVVVPIMRKRREEENTLLLKLEGFQMTYFPGLSNQRDSMLGDQKLGSSCQEQPLCSGNLMMFMNQDPSCRDYSEIFSGVSSNYVETVGDRSNVEMAFIPPVVGLLD
DSNFQCQGVSLSLSTHSPSVCSMSSFPHQYQTPAMPSSFINAPPSIFEKRPNLKPCISDAKNGMYIAVGSGYSTLNSVYIEAAQQLLDEMVSIREALKELKSKKLKASNG
LGVDTCRENGGGSNDLAGEMCCDARESSVANPSGDLSPTERQDLKNKTSKLLSLLGEVDRRYKQYYQQLQYLASSFDMVAGRGAAEFYTALAHQTISCHFRRLRDAINAQ
IEVTRRTLGEQDALNSGQEGIPRLRFVDQHLRQQRALQQLGVTPHSWRPQRGLPESSVSILRAWLFEHFLHPYPKDSEKLKLARQTGLTRNQVMSLILNRKLDFLPSKLK
RLLANLNDIRVANWFINARVRLWKPMVEEMYKEEFGDSNIDIKSSPENASKASWNNSLFSEDRGGDELHDSTKSAANNGERVLYRPCATDDMINSSNFSMGGGDVSLALE
LKHCEGDEFGMFGSNNRMNKDEKSNPKNRIANYTKLSTLFQLLVAEPLFFGFSSLLLLPFPIFTSLQSRREVGERWPLMILALVSISTAMVFFMRTSFDSCSGNGNRRFV
EENGIDSQIRSSQIERKAPNPLDFMKSKLVLLVSHELSLSGGPLLLMELAFLLRGVGTQVVWITNQKPSEPDEVVYSLERKMLDRGVQIFILKVLSAKGQEAVETALKAH
LVVLNTAVAGKWLDAILKENVPRVLPKVLWWIHEMRGNYFKVEYVKHLPFVAGAMIDSHTTAEYWKNRTRERLGIKMPETYVVHLGNSKDLMEVAENNVAKRVLREHIRE
SLGVRNEDILFAIINSVSRGKGQDLFLRAFHQSLQMIQDKKLRVPRIHAVVVGSDMSAQTKFETELRNFVNENKIQDRVHFVNKTLSVAPYLASIDVLVQNSQGRGECFG
RITIEAMAFQLPVLGTAAGGTMEIVVNGTTGLLHPAGKEGVTPLAHNIVKLATHVERRLTIGKKGYERVRQMFLEQHMGQRIAFVAGNYVAQILSVTTINLRPKERRTTS
FPCDSIFSNTKSPTRSVLVDGEEEGFAGSEMFVVSVKPFFSVLLVTLSLSLAVLLLSPPSLLFSNTSFSFTNERGSVACRSPLPIIPPRRWSVLLLFVKSNIYLQFQLKN
FGIRLHGYMECSKDSRVEALQVVASRLFVEMLQLCPLIAMDIYAWIAMLCDGVGIARLLNATLVLPKFEVAAYWNESSQMNGYVKVVKMLPPEFASKEPYHVDCSKRKGQ
FDYIESVLPSLLEHRYISITPAMSQRRDRYPQYAKAALCQACYNGLRLAKSVEEKAKELLEAIPKPFLSLHLRFEPDMVAYSQCEYKGLSPASLEAIEATRGDRKPWTGE
LAGIWRKRGKCPLTPRETALILQALHIPTNTNIYLAAGDGLMELEGFTSVYTNVVTKSSFLSSDDFSSMHGNTKAALDYYVSINSDSYVATFFGNMDKMVAAMRAFNGKQ
RTLFLSRRAFADFTYKGLKGKELNQALWKTHRDDFAMGRGSALSDCFCEFKL