; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020084 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020084
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1
Genome locationtig00153447:211622..224950
RNA-Seq ExpressionSgr020084
SyntenySgr020084
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR009057 - Homeobox-like domain superfamily
IPR039276 - Protein SAWADEE HOMEODOMAIN HOMOLOG 1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650545.1 hypothetical protein Csa_011086 [Cucumis sativus]4.5e-10655.94Show/hide
Query:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK
        MGRPPSNGGPAFRFTA+E                                               VAEMEAILQ HNNTMP REVLVALA+KFSESVERK
Subjt:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK

Query:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPENPYRSLK--------------------LNLGG
        GKIAVQMKQVWNWFQNRRYAIRAK+SKAPGKLAVSP+VQIESTPVRNVPQT+VVPAPAPVGSAKGAPENP    +                    +  G 
Subjt:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPENPYRSLK--------------------LNLGG

Query:  MEVLVRFAGFGSRRMSGLISEGTSDLVLYLVNHQNVW-QFFQAISSYAFRRVKSRHFTLMPMCLI-----HKEEDMMYEVVVAVSHELRTLAFDVGKSWE
         EVLVRF+GFGS                     ++ W    + I   +     S    ++P  LI      KE+ + ++  V  +   R         + 
Subjt:  MEVLVRFAGFGSRRMSGLISEGTSDLVLYLVNHQNVW-QFFQAISSYAFRRVKSRHFTLMPMCLI-----HKEEDMMYEVVVAVSHELRTLAFDVGKSWE

Query:  FRVLLMAAAEIVQLRKICRRPETDYRLQQLHAVNEAASVEPPKSGMDSLLLSGQRLNFEATQKPLSKDATMVIPNANTNIDVHAQTSTQEARNTETSSAP
         R     + EIVQLRKICRRPETDYRLQQLHAVNEAAS+EP KSGMDS+LLSGQR+NFE +Q PLSKDA +VIPNAN +I+ HAQTSTQEARNTET++AP
Subjt:  FRVLLMAAAEIVQLRKICRRPETDYRLQQLHAVNEAASVEPPKSGMDSLLLSGQRLNFEATQKPLSKDATMVIPNANTNIDVHAQTSTQEARNTETSSAP

Query:  TTFNSGNPAGSSAFSSGIVTSSVSGGLGDNVSDGKLLS
        TTFNS N AGSSAFSSGIVT++VS G  DNVSDGKLLS
Subjt:  TTFNSGNPAGSSAFSSGIVTSSVSGGLGDNVSDGKLLS

TYK11257.1 protein SAWADEE HOMEODOMAIN-like protein 2 isoform X1 [Cucumis melo var. makuwa]1.5e-10153.67Show/hide
Query:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK
        MGRPPSNGGPAFRFTA+E                                               VAEME ILQ HNNTMP REVLVALA+KFSESVERK
Subjt:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK

Query:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPENPYRSLK--------------------LNLGG
        GKIAVQMKQVWNWFQNRRYAIRAK+SKAPGKLAVSP+VQIESTPVRNVPQT+VVPAP PVG+AK APENP    +                    +  G 
Subjt:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPENPYRSLK--------------------LNLGG

Query:  MEVLVRFAGFGSRRMSGLISEGTSDLVLYLVNHQNVW-QFFQAISSYAFRRVKSRHFTLMPMCLI-----HKEEDMMYEVVVAVSHELRTLAFDVGKSWE
         EVLVRF+GFGS                     ++ W    + I   +     S    ++P  LI      KE+ + ++  V  +   R         + 
Subjt:  MEVLVRFAGFGSRRMSGLISEGTSDLVLYLVNHQNVW-QFFQAISSYAFRRVKSRHFTLMPMCLI-----HKEEDMMYEVVVAVSHELRTLAFDVGKSWE

Query:  FRVLLMAAAEIVQLRKICRRPETDYRLQQLHAVNEAASVEPPKSGMDSLLLSGQRLNFEATQKPLSKDATMVIPNANTNIDVHAQTSTQEARNTETS---
         R     + EIVQLRKICRRPETDYRLQQLHAVNEAAS+EP KSGMDS+LLSGQR+NFE  Q PLSKDA +VIPNAN + + HAQTSTQEARNTET+   
Subjt:  FRVLLMAAAEIVQLRKICRRPETDYRLQQLHAVNEAASVEPPKSGMDSLLLSGQRLNFEATQKPLSKDATMVIPNANTNIDVHAQTSTQEARNTETS---

Query:  --------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGLGDNVSDGKLLS
                +APTTFNS N AGSSAFSSGIVT++VSGG  DNVSDGKLLS
Subjt:  --------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGLGDNVSDGKLLS

XP_008456010.1 PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1 [Cucumis melo]5.1e-10253.9Show/hide
Query:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK
        MGRPPSNGGPAFRFTA+E                                               VAEME ILQ HNNTMP REVLVALA+KFSESVERK
Subjt:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK

Query:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPENPYRSLK--------------------LNLGG
        GKIAVQMKQVWNWFQNRRYAIRAK+SKAPGKLAVSP+VQIESTPVRNVPQT+VVPAP PVG+AK APENP    +                    +  G 
Subjt:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPENPYRSLK--------------------LNLGG

Query:  MEVLVRFAGFGSRRMSGLISEGTSDLVLYLVNHQNVW-QFFQAISSYAFRRVKSRHFTLMPMCLI-----HKEEDMMYEVVVAVSHELRTLAFDVGKSWE
         EVLVRF+GFGS                     ++ W    + I   +     S    ++P  LI      KE+ + ++  V  +   R         + 
Subjt:  MEVLVRFAGFGSRRMSGLISEGTSDLVLYLVNHQNVW-QFFQAISSYAFRRVKSRHFTLMPMCLI-----HKEEDMMYEVVVAVSHELRTLAFDVGKSWE

Query:  FRVLLMAAAEIVQLRKICRRPETDYRLQQLHAVNEAASVEPPKSGMDSLLLSGQRLNFEATQKPLSKDATMVIPNANTNIDVHAQTSTQEARNTETS---
         R     + EIVQLRKICRRPETDYRLQQLHAVNEAAS+EP KSGMDS+LLSGQR+NFE  Q PLSKDA +VIPNAN +I+ HAQTSTQEARNTET+   
Subjt:  FRVLLMAAAEIVQLRKICRRPETDYRLQQLHAVNEAASVEPPKSGMDSLLLSGQRLNFEATQKPLSKDATMVIPNANTNIDVHAQTSTQEARNTETS---

Query:  --------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGLGDNVSDGKLLS
                +APTTFNS N AGSSAFSSGIVT++VSGG  DNVSDGKLLS
Subjt:  --------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGLGDNVSDGKLLS

XP_022142790.1 protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1 [Momordica charantia]1.8e-10756.85Show/hide
Query:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK
        MGRPPSNGGPAFRFTAAE                                               VAEMEAILQ HNNTMP REVLVALAEKFSES+ERK
Subjt:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK

Query:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPENPYRSLK--------------------LNLGG
        GKIAVQMKQVWNWFQNRRYAIRAKS+KAPGKLAVSPIVQIESTPVRNVPQ+IVVPAPAPVGS KGAP+NP    +                    +  G 
Subjt:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPENPYRSLK--------------------LNLGG

Query:  MEVLVRFAGFGSRRMSGLISEGTSDLVLYLVNHQNVW-QFFQAISSYAFRRVKSRHFTLMPMCLI-----HKEEDMMYEVVVAVSHELRTLAFDVGKSWE
         EVLVRFAGFGS                     ++ W    + I   +     S    ++P  LI      KE+ + ++  V  +   R         + 
Subjt:  MEVLVRFAGFGSRRMSGLISEGTSDLVLYLVNHQNVW-QFFQAISSYAFRRVKSRHFTLMPMCLI-----HKEEDMMYEVVVAVSHELRTLAFDVGKSWE

Query:  FRVLLMAAAEIVQLRKICRRPETDYRLQQLHAVNEAASVEPPKSGMDSLLLSGQRLNFEATQKPLSKDATMVIPNANTNIDVHAQTSTQEARNTETSSAP
         R     + EIVQLRKICRRPETDYRLQQLHAVNEAAS+EPPKSGMDS+LLSG RLNFE TQKPL KDATMV PNAN N++V AQT TQE RN ETSS P
Subjt:  FRVLLMAAAEIVQLRKICRRPETDYRLQQLHAVNEAASVEPPKSGMDSLLLSGQRLNFEATQKPLSKDATMVIPNANTNIDVHAQTSTQEARNTETSSAP

Query:  TTFNSGNPAGSSAFSSGIVTSSVSGGLGDNVSDGKLLS
         +FNSGNPAGSSAF SGI T+SVSGGLGDNVSDGKLLS
Subjt:  TTFNSGNPAGSSAFSSGIVTSSVSGGLGDNVSDGKLLS

XP_038878066.1 protein SAWADEE HOMEODOMAIN HOMOLOG 2-like isoform X1 [Benincasa hispida]5.9e-10655.46Show/hide
Query:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK
        MGRPPSNGGPAFRFTA+E                                               VAEMEAILQ HNNTMP REVLVALAEKFSESVERK
Subjt:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK

Query:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPENPYRSLK--------------------LNLGG
        GKIAVQMKQVWNWFQNRRYAIRAK++KAPGKLAVSPIVQIESTPVRNVPQTIVVPAP PVGSAKGAPENP    +                    +  G 
Subjt:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPENPYRSLK--------------------LNLGG

Query:  MEVLVRFAGFGSRRMSGLISEGTSDLVLYLVNHQNVW-QFFQAISSYAFRRVKSRHFTLMPMCLI-----HKEEDMMYEVVVAVSHELRTLAFDVGKSWE
         EVLVRF+GFGS                     ++ W    + I   +     S    ++P  LI      KE+ + ++  V  +   R         + 
Subjt:  MEVLVRFAGFGSRRMSGLISEGTSDLVLYLVNHQNVW-QFFQAISSYAFRRVKSRHFTLMPMCLI-----HKEEDMMYEVVVAVSHELRTLAFDVGKSWE

Query:  FRVLLMAAAEIVQLRKICRRPETDYRLQQLHAVNEAASVEPPKSGMDSLLLSGQRLNFEATQKPLSKDATMVIPNANTNIDVHAQTSTQEARNTETS---
         R     + EIVQLRKICRRPETDYRLQQLHAVNEAAS+EP KS MDS+LLSGQR+NFE TQKPL+KD T+VIPNAN NI+VHAQT+TQEARNTET+   
Subjt:  FRVLLMAAAEIVQLRKICRRPETDYRLQQLHAVNEAASVEPPKSGMDSLLLSGQRLNFEATQKPLSKDATMVIPNANTNIDVHAQTSTQEARNTETS---

Query:  --------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGLGDNVSDGKLLS
                SAPTTFNSGNPAG SAFS GIVT++VSGG  DNVSDGKLLS
Subjt:  --------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGLGDNVSDGKLLS

TrEMBL top hitse value%identityAlignment
A0A0A0LC67 SAWADEE domain-containing protein2.0e-9954.57Show/hide
Query:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK
        MGRPPSNGGPAFRFTA+E                                               VAEMEAILQ HNNTMP REVLVALA+KFSESVERK
Subjt:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK

Query:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPENPYRSLK--------------------LNLGG
        GKIAVQMK      QNRRYAIRAK+SKAPGKLAVSP+VQIESTPVRNVPQT+VVPAPAPVGSAKGAPENP    +                    +  G 
Subjt:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPENPYRSLK--------------------LNLGG

Query:  MEVLVRFAGFGSRRMSGLISEGTSDLVLYLVNHQNVW-QFFQAISSYAFRRVKSRHFTLMPMCLI-----HKEEDMMYEVVVAVSHELRTLAFDVGKSWE
         EVLVRF+GFGS                     ++ W    + I   +     S    ++P  LI      KE+ + ++  V  +   R         + 
Subjt:  MEVLVRFAGFGSRRMSGLISEGTSDLVLYLVNHQNVW-QFFQAISSYAFRRVKSRHFTLMPMCLI-----HKEEDMMYEVVVAVSHELRTLAFDVGKSWE

Query:  FRVLLMAAAEIVQLRKICRRPETDYRLQQLHAVNEAASVEPPKSGMDSLLLSGQRLNFEATQKPLSKDATMVIPNANTNIDVHAQTSTQEARNTETSSAP
         R     + EIVQLRKICRRPETDYRLQQLHAVNEAAS+EP KSGMDS+LLSGQR+NFE +Q PLSKDA +VIPNAN +I+ HAQTSTQEARNTET++AP
Subjt:  FRVLLMAAAEIVQLRKICRRPETDYRLQQLHAVNEAASVEPPKSGMDSLLLSGQRLNFEATQKPLSKDATMVIPNANTNIDVHAQTSTQEARNTETSSAP

Query:  TTFNSGNPAGSSAFSSGIVTSSVSGGLGDNVSDGKLLS
        TTFNS N AGSSAFSSGIVT++VS G  DNVSDGKLLS
Subjt:  TTFNSGNPAGSSAFSSGIVTSSVSGGLGDNVSDGKLLS

A0A1S3C274 protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X12.5e-10253.9Show/hide
Query:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK
        MGRPPSNGGPAFRFTA+E                                               VAEME ILQ HNNTMP REVLVALA+KFSESVERK
Subjt:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK

Query:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPENPYRSLK--------------------LNLGG
        GKIAVQMKQVWNWFQNRRYAIRAK+SKAPGKLAVSP+VQIESTPVRNVPQT+VVPAP PVG+AK APENP    +                    +  G 
Subjt:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPENPYRSLK--------------------LNLGG

Query:  MEVLVRFAGFGSRRMSGLISEGTSDLVLYLVNHQNVW-QFFQAISSYAFRRVKSRHFTLMPMCLI-----HKEEDMMYEVVVAVSHELRTLAFDVGKSWE
         EVLVRF+GFGS                     ++ W    + I   +     S    ++P  LI      KE+ + ++  V  +   R         + 
Subjt:  MEVLVRFAGFGSRRMSGLISEGTSDLVLYLVNHQNVW-QFFQAISSYAFRRVKSRHFTLMPMCLI-----HKEEDMMYEVVVAVSHELRTLAFDVGKSWE

Query:  FRVLLMAAAEIVQLRKICRRPETDYRLQQLHAVNEAASVEPPKSGMDSLLLSGQRLNFEATQKPLSKDATMVIPNANTNIDVHAQTSTQEARNTETS---
         R     + EIVQLRKICRRPETDYRLQQLHAVNEAAS+EP KSGMDS+LLSGQR+NFE  Q PLSKDA +VIPNAN +I+ HAQTSTQEARNTET+   
Subjt:  FRVLLMAAAEIVQLRKICRRPETDYRLQQLHAVNEAASVEPPKSGMDSLLLSGQRLNFEATQKPLSKDATMVIPNANTNIDVHAQTSTQEARNTETS---

Query:  --------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGLGDNVSDGKLLS
                +APTTFNS N AGSSAFSSGIVT++VSGG  DNVSDGKLLS
Subjt:  --------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGLGDNVSDGKLLS

A0A5A7UUV7 SAWADEE HOMEODOMAIN-like protein 2 isoform X12.5e-10253.9Show/hide
Query:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK
        MGRPPSNGGPAFRFTA+E                                               VAEME ILQ HNNTMP REVLVALA+KFSESVERK
Subjt:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK

Query:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPENPYRSLK--------------------LNLGG
        GKIAVQMKQVWNWFQNRRYAIRAK+SKAPGKLAVSP+VQIESTPVRNVPQT+VVPAP PVG+AK APENP    +                    +  G 
Subjt:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPENPYRSLK--------------------LNLGG

Query:  MEVLVRFAGFGSRRMSGLISEGTSDLVLYLVNHQNVW-QFFQAISSYAFRRVKSRHFTLMPMCLI-----HKEEDMMYEVVVAVSHELRTLAFDVGKSWE
         EVLVRF+GFGS                     ++ W    + I   +     S    ++P  LI      KE+ + ++  V  +   R         + 
Subjt:  MEVLVRFAGFGSRRMSGLISEGTSDLVLYLVNHQNVW-QFFQAISSYAFRRVKSRHFTLMPMCLI-----HKEEDMMYEVVVAVSHELRTLAFDVGKSWE

Query:  FRVLLMAAAEIVQLRKICRRPETDYRLQQLHAVNEAASVEPPKSGMDSLLLSGQRLNFEATQKPLSKDATMVIPNANTNIDVHAQTSTQEARNTETS---
         R     + EIVQLRKICRRPETDYRLQQLHAVNEAAS+EP KSGMDS+LLSGQR+NFE  Q PLSKDA +VIPNAN +I+ HAQTSTQEARNTET+   
Subjt:  FRVLLMAAAEIVQLRKICRRPETDYRLQQLHAVNEAASVEPPKSGMDSLLLSGQRLNFEATQKPLSKDATMVIPNANTNIDVHAQTSTQEARNTETS---

Query:  --------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGLGDNVSDGKLLS
                +APTTFNS N AGSSAFSSGIVT++VSGG  DNVSDGKLLS
Subjt:  --------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGLGDNVSDGKLLS

A0A5D3CH38 Protein SAWADEE HOMEODOMAIN-like protein 2 isoform X17.2e-10253.67Show/hide
Query:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK
        MGRPPSNGGPAFRFTA+E                                               VAEME ILQ HNNTMP REVLVALA+KFSESVERK
Subjt:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK

Query:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPENPYRSLK--------------------LNLGG
        GKIAVQMKQVWNWFQNRRYAIRAK+SKAPGKLAVSP+VQIESTPVRNVPQT+VVPAP PVG+AK APENP    +                    +  G 
Subjt:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPENPYRSLK--------------------LNLGG

Query:  MEVLVRFAGFGSRRMSGLISEGTSDLVLYLVNHQNVW-QFFQAISSYAFRRVKSRHFTLMPMCLI-----HKEEDMMYEVVVAVSHELRTLAFDVGKSWE
         EVLVRF+GFGS                     ++ W    + I   +     S    ++P  LI      KE+ + ++  V  +   R         + 
Subjt:  MEVLVRFAGFGSRRMSGLISEGTSDLVLYLVNHQNVW-QFFQAISSYAFRRVKSRHFTLMPMCLI-----HKEEDMMYEVVVAVSHELRTLAFDVGKSWE

Query:  FRVLLMAAAEIVQLRKICRRPETDYRLQQLHAVNEAASVEPPKSGMDSLLLSGQRLNFEATQKPLSKDATMVIPNANTNIDVHAQTSTQEARNTETS---
         R     + EIVQLRKICRRPETDYRLQQLHAVNEAAS+EP KSGMDS+LLSGQR+NFE  Q PLSKDA +VIPNAN + + HAQTSTQEARNTET+   
Subjt:  FRVLLMAAAEIVQLRKICRRPETDYRLQQLHAVNEAASVEPPKSGMDSLLLSGQRLNFEATQKPLSKDATMVIPNANTNIDVHAQTSTQEARNTETS---

Query:  --------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGLGDNVSDGKLLS
                +APTTFNS N AGSSAFSSGIVT++VSGG  DNVSDGKLLS
Subjt:  --------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGLGDNVSDGKLLS

A0A6J1CLX5 protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X18.8e-10856.85Show/hide
Query:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK
        MGRPPSNGGPAFRFTAAE                                               VAEMEAILQ HNNTMP REVLVALAEKFSES+ERK
Subjt:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK

Query:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPENPYRSLK--------------------LNLGG
        GKIAVQMKQVWNWFQNRRYAIRAKS+KAPGKLAVSPIVQIESTPVRNVPQ+IVVPAPAPVGS KGAP+NP    +                    +  G 
Subjt:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPENPYRSLK--------------------LNLGG

Query:  MEVLVRFAGFGSRRMSGLISEGTSDLVLYLVNHQNVW-QFFQAISSYAFRRVKSRHFTLMPMCLI-----HKEEDMMYEVVVAVSHELRTLAFDVGKSWE
         EVLVRFAGFGS                     ++ W    + I   +     S    ++P  LI      KE+ + ++  V  +   R         + 
Subjt:  MEVLVRFAGFGSRRMSGLISEGTSDLVLYLVNHQNVW-QFFQAISSYAFRRVKSRHFTLMPMCLI-----HKEEDMMYEVVVAVSHELRTLAFDVGKSWE

Query:  FRVLLMAAAEIVQLRKICRRPETDYRLQQLHAVNEAASVEPPKSGMDSLLLSGQRLNFEATQKPLSKDATMVIPNANTNIDVHAQTSTQEARNTETSSAP
         R     + EIVQLRKICRRPETDYRLQQLHAVNEAAS+EPPKSGMDS+LLSG RLNFE TQKPL KDATMV PNAN N++V AQT TQE RN ETSS P
Subjt:  FRVLLMAAAEIVQLRKICRRPETDYRLQQLHAVNEAASVEPPKSGMDSLLLSGQRLNFEATQKPLSKDATMVIPNANTNIDVHAQTSTQEARNTETSSAP

Query:  TTFNSGNPAGSSAFSSGIVTSSVSGGLGDNVSDGKLLS
         +FNSGNPAGSSAF SGI T+SVSGGLGDNVSDGKLLS
Subjt:  TTFNSGNPAGSSAFSSGIVTSSVSGGLGDNVSDGKLLS

SwissProt top hitse value%identityAlignment
Q8RWJ7 Protein SAWADEE HOMEODOMAIN HOMOLOG 24.3e-3534.44Show/hide
Query:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK
        MGRPPSNGGPAFRF   E                                               V EMEAIL  HN  MP R +L ALA+KFSES ERK
Subjt:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK

Query:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIE-STPVRNVPQTIVVP------------APAPVGS------------------AKGAPEN
        GK+ VQ KQ+WNWFQNRRYA+RA+ +KAPGKL VS + +++    +R+V Q + VP             PAP GS                  AK A + 
Subjt:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIE-STPVRNVPQTIVVP------------APAPVGS------------------AKGAPEN

Query:  PYRSLK-------LNLGGMEVLVRFAGFGSR-----------RMSGLISEGT-------SDLVLYLVNHQNVWQFFQAISSYAFRRVKSRHFTLMPMCLI
         +  ++       L +G  EV VRFAGF              R   L  E +        DLVL     ++   +F AI   A RR   RH      C  
Subjt:  PYRSLK-------LNLGGMEVLVRFAGFGSR-----------RMSGLISEGT-------SDLVLYLVNHQNVWQFFQAISSYAFRRVKSRHFTLMPMCLI

Query:  HKEEDMMYEVVVAVSHELRTLAFDVGKSWEFRVLLMAAAEIVQLRKICRRPETDYRLQQLH-AVNEAASVEPPK------SGMDSLLLSGQRLNFEATQK
                  +V  SH+                    + EIV LRKICRRPETDYRLQQLH AVN+ A+    +      +    L L G  +   A   
Subjt:  HKEEDMMYEVVVAVSHELRTLAFDVGKSWEFRVLLMAAAEIVQLRKICRRPETDYRLQQLH-AVNEAASVEPPK------SGMDSLLLSGQRLNFEATQK

Query:  PLSKD-------ATMVIPNAN
        P SKD       AT+V P++N
Subjt:  PLSKD-------ATMVIPNAN

Arabidopsis top hitse value%identityAlignment
AT3G18380.1 sequence-specific DNA binding transcription factors;sequence-specific DNA binding3.1e-3634.44Show/hide
Query:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK
        MGRPPSNGGPAFRF   E                                               V EMEAIL  HN  MP R +L ALA+KFSES ERK
Subjt:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK

Query:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIE-STPVRNVPQTIVVP------------APAPVGS------------------AKGAPEN
        GK+ VQ KQ+WNWFQNRRYA+RA+ +KAPGKL VS + +++    +R+V Q + VP             PAP GS                  AK A + 
Subjt:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIE-STPVRNVPQTIVVP------------APAPVGS------------------AKGAPEN

Query:  PYRSLK-------LNLGGMEVLVRFAGFGSR-----------RMSGLISEGT-------SDLVLYLVNHQNVWQFFQAISSYAFRRVKSRHFTLMPMCLI
         +  ++       L +G  EV VRFAGF              R   L  E +        DLVL     ++   +F AI   A RR   RH      C  
Subjt:  PYRSLK-------LNLGGMEVLVRFAGFGSR-----------RMSGLISEGT-------SDLVLYLVNHQNVWQFFQAISSYAFRRVKSRHFTLMPMCLI

Query:  HKEEDMMYEVVVAVSHELRTLAFDVGKSWEFRVLLMAAAEIVQLRKICRRPETDYRLQQLH-AVNEAASVEPPK------SGMDSLLLSGQRLNFEATQK
                  +V  SH+                    + EIV LRKICRRPETDYRLQQLH AVN+ A+    +      +    L L G  +   A   
Subjt:  HKEEDMMYEVVVAVSHELRTLAFDVGKSWEFRVLLMAAAEIVQLRKICRRPETDYRLQQLH-AVNEAASVEPPK------SGMDSLLLSGQRLNFEATQK

Query:  PLSKD-------ATMVIPNAN
        P SKD       AT+V P++N
Subjt:  PLSKD-------ATMVIPNAN

AT3G18380.2 sequence-specific DNA binding transcription factors;sequence-specific DNA binding2.3e-3634.44Show/hide
Query:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK
        MGRPPSNGGPAFRF   E                                               V EMEAIL  HN  MP R +L ALA+KFSES ERK
Subjt:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK

Query:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIE-STPVRNVPQTIVVP------------APAPVGS------------------AKGAPEN
        GK+ VQ KQ+WNWFQNRRYA+RA+ +KAPGKL VS + +++    +R+V Q + VP             PAP GS                  AK A + 
Subjt:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIE-STPVRNVPQTIVVP------------APAPVGS------------------AKGAPEN

Query:  PYRSLK-------LNLGGMEVLVRFAGFGSR-----------RMSGLISEGT-------SDLVLYLVNHQNVWQFFQAISSYAFRRVKSRHFTLMPMCLI
         +  ++       L +G  EV VRFAGF              R   L  E +        DLVL     ++   +F AI   A RR   RH      C  
Subjt:  PYRSLK-------LNLGGMEVLVRFAGFGSR-----------RMSGLISEGT-------SDLVLYLVNHQNVWQFFQAISSYAFRRVKSRHFTLMPMCLI

Query:  HKEEDMMYEVVVAVSHELRTLAFDVGKSWEFRVLLMAAAEIVQLRKICRRPETDYRLQQLH-AVNEAASVEPPK------SGMDSLLLSGQRLNFEATQK
                  +V  SH+                   +  EIV LRKICRRPETDYRLQQLH AVN+ A+    +      +    L L G  +   A   
Subjt:  HKEEDMMYEVVVAVSHELRTLAFDVGKSWEFRVLLMAAAEIVQLRKICRRPETDYRLQQLH-AVNEAASVEPPK------SGMDSLLLSGQRLNFEATQK

Query:  PLSKD-------ATMVIPNAN
        P SKD       AT+V P++N
Subjt:  PLSKD-------ATMVIPNAN

AT3G18380.3 sequence-specific DNA binding transcription factors;sequence-specific DNA binding3.1e-3633.97Show/hide
Query:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK
        MGRPPSNGGPAFRF   E                                               V EMEAIL  HN  MP R +L ALA+KFSES ERK
Subjt:  MGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFFFYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERK

Query:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIE-STPVRNVPQTIVV--------------PAPAPVGSAKGAPENPYRSLK----------
        GK+ VQ KQ+WNWFQNRRYA+RA+ +KAPGKL VS + +++    +R+V Q + V              PAP+  G  +   +N Y   +          
Subjt:  GKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIE-STPVRNVPQTIVV--------------PAPAPVGSAKGAPENPYRSLK----------

Query:  ----------LNLGGMEVLVRFAGFGSR-----------RMSGLISEGT-------SDLVLYLVNHQNVWQFFQAISSYAFRRVKSRHFTLMPMCLIHKE
                  L +G  EV VRFAGF              R   L  E +        DLVL     ++   +F AI   A RR   RH      C     
Subjt:  ----------LNLGGMEVLVRFAGFGSR-----------RMSGLISEGT-------SDLVLYLVNHQNVWQFFQAISSYAFRRVKSRHFTLMPMCLIHKE

Query:  EDMMYEVVVAVSHELRTLAFDVGKSWEFRVLLMAAAEIVQLRKICRRPETDYRLQQLH-AVNEAASVEPPK------SGMDSLLLSGQRLNFEATQKPLS
               +V  SH+                   +  EIV LRKICRRPETDYRLQQLH AVN+ A+    +      +    L L G  +   A   P S
Subjt:  EDMMYEVVVAVSHELRTLAFDVGKSWEFRVLLMAAAEIVQLRKICRRPETDYRLQQLH-AVNEAASVEPPK------SGMDSLLLSGQRLNFEATQKPLS

Query:  KD-------ATMVIPNAN
        KD       AT+V P++N
Subjt:  KD-------ATMVIPNAN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAGGCTGGCGATGGCGACGCATTATCAGGGGCTTTCCTGTAATTATATATTTATTCAGGGAGATTCCGGTAAAATAGCGAAAGAATTTTCGATCTCAGACGTCTA
TAAATACCATCCTCCTCTCCGGTTTCTGGCCTCATCCTTCAATTCGGTCAGCGAGACGATTGCCACAGCGAACGATAATCCCATCAGCAATCAGACTCACCACAGGAAAT
GGGGAAAAGAAAGTCAAGAGCGAAGCCCCCACCTAAGAAGAGAATGGACAAGCTCGATACTGTTTTCAGCTGCCCCTTCTGCAATCATGGGACCAGTGTGGAATGTCGCA
TGGTATTTGTATGGTTTTAGTGATATGAAGAACCTGATTGGGGAAGCCTCCTGTAGGATTTGCCAAGAGGGTTTAGCACAACCATTACAGAATAGATGGATGTATGGAAG
ATTTGATCTGCAGTTCCACCTTCCATCATCCCTAGATTTTTGTTCTGTTGTTCTCTCTTGCTTCTGTGACAGCGACGGGGGATTTCAGAGAAAACAGCGAAAACTTGAGC
TTATGGGTCGGCCTCCGAGCAATGGAGGTCCTGCCTTCCGCTTCACTGCTGCCGAGGTTTTCCTCTGCTTCCTCCTCTCTTCTTCTTCATCGTTGTTTTCTTTTTTTTTT
TTTTATATTGTAAATGATGCGTCTGTGATGCATGTGCGATGTTGGAGTTCGAATTTCTCTGGGTCGGCTCAATTGCAATGTCTATTGGTCGCAGAGATGGAAGCTATATT
GCAAGCACACAATAATACAATGCCAGTTCGGGAAGTTCTTGTTGCCCTTGCTGAGAAGTTCAGTGAATCAGTAGAACGGAAAGGGAAGATTGCTGTGCAAATGAAGCAAG
TTTGGAATTGGTTCCAGAATAGACGATATGCTATCAGAGCAAAGTCAAGCAAGGCTCCTGGAAAGTTAGCTGTCTCTCCAATTGTCCAAATCGAATCAACTCCAGTGAGA
AATGTGCCTCAAACCATCGTTGTTCCTGCTCCCGCACCAGTAGGCTCTGCGAAGGGTGCTCCAGAAAATCCTTATCGGAGTTTGAAGCTAAATCTGGGAGGGATGGAAGT
ACTAGTTAGATTTGCTGGTTTTGGATCGAGGAGGATGAGTGGGTTAATATCCGAAGGAACATCAGACCTCGTTCTCTACCTTGTGAATCATCAGAATGTGTGGCAGTTCT
TCCAGGCGATCTCATCTTATGCTTTCAGGAGGGTAAAGAGCAGGCACTTTACTTTGATGCCCATGTGCTTGATACACAAAGAAGAAGACATGATGTACGAGGTTGTCGTT
GCAGTCTCACACGAGCTGCGGACTTTAGCTTTCGATGTGGGGAAAAGTTGGGAATTTAGAGTTTTGTTAATGGCTGCTGCTGAAATTGTTCAGTTGAGAAAGATTTGCCG
TCGGCCCGAGACTGATTACAGGTTGCAACAGCTTCATGCTGTAAATGAAGCAGCATCCGTTGAGCCCCCAAAGTCTGGCATGGATTCTTTACTGCTCAGCGGCCAAAGGC
TAAATTTCGAGGCAACACAAAAGCCACTCAGCAAGGACGCAACCATGGTTATACCAAATGCAAACACAAATATAGACGTCCATGCCCAAACAAGTACTCAGGAAGCAAGA
AATACCGAAACTAGCAGTGCTCCAACCACGTTCAACTCTGGTAATCCCGCAGGTAGCTCTGCATTCTCGAGTGGTATCGTGACGAGCTCTGTTTCTGGTGGGTTGGGGGA
CAATGTGTCTGATGGGAAGTTACTTAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAGGCTGGCGATGGCGACGCATTATCAGGGGCTTTCCTGTAATTATATATTTATTCAGGGAGATTCCGGTAAAATAGCGAAAGAATTTTCGATCTCAGACGTCTA
TAAATACCATCCTCCTCTCCGGTTTCTGGCCTCATCCTTCAATTCGGTCAGCGAGACGATTGCCACAGCGAACGATAATCCCATCAGCAATCAGACTCACCACAGGAAAT
GGGGAAAAGAAAGTCAAGAGCGAAGCCCCCACCTAAGAAGAGAATGGACAAGCTCGATACTGTTTTCAGCTGCCCCTTCTGCAATCATGGGACCAGTGTGGAATGTCGCA
TGGTATTTGTATGGTTTTAGTGATATGAAGAACCTGATTGGGGAAGCCTCCTGTAGGATTTGCCAAGAGGGTTTAGCACAACCATTACAGAATAGATGGATGTATGGAAG
ATTTGATCTGCAGTTCCACCTTCCATCATCCCTAGATTTTTGTTCTGTTGTTCTCTCTTGCTTCTGTGACAGCGACGGGGGATTTCAGAGAAAACAGCGAAAACTTGAGC
TTATGGGTCGGCCTCCGAGCAATGGAGGTCCTGCCTTCCGCTTCACTGCTGCCGAGGTTTTCCTCTGCTTCCTCCTCTCTTCTTCTTCATCGTTGTTTTCTTTTTTTTTT
TTTTATATTGTAAATGATGCGTCTGTGATGCATGTGCGATGTTGGAGTTCGAATTTCTCTGGGTCGGCTCAATTGCAATGTCTATTGGTCGCAGAGATGGAAGCTATATT
GCAAGCACACAATAATACAATGCCAGTTCGGGAAGTTCTTGTTGCCCTTGCTGAGAAGTTCAGTGAATCAGTAGAACGGAAAGGGAAGATTGCTGTGCAAATGAAGCAAG
TTTGGAATTGGTTCCAGAATAGACGATATGCTATCAGAGCAAAGTCAAGCAAGGCTCCTGGAAAGTTAGCTGTCTCTCCAATTGTCCAAATCGAATCAACTCCAGTGAGA
AATGTGCCTCAAACCATCGTTGTTCCTGCTCCCGCACCAGTAGGCTCTGCGAAGGGTGCTCCAGAAAATCCTTATCGGAGTTTGAAGCTAAATCTGGGAGGGATGGAAGT
ACTAGTTAGATTTGCTGGTTTTGGATCGAGGAGGATGAGTGGGTTAATATCCGAAGGAACATCAGACCTCGTTCTCTACCTTGTGAATCATCAGAATGTGTGGCAGTTCT
TCCAGGCGATCTCATCTTATGCTTTCAGGAGGGTAAAGAGCAGGCACTTTACTTTGATGCCCATGTGCTTGATACACAAAGAAGAAGACATGATGTACGAGGTTGTCGTT
GCAGTCTCACACGAGCTGCGGACTTTAGCTTTCGATGTGGGGAAAAGTTGGGAATTTAGAGTTTTGTTAATGGCTGCTGCTGAAATTGTTCAGTTGAGAAAGATTTGCCG
TCGGCCCGAGACTGATTACAGGTTGCAACAGCTTCATGCTGTAAATGAAGCAGCATCCGTTGAGCCCCCAAAGTCTGGCATGGATTCTTTACTGCTCAGCGGCCAAAGGC
TAAATTTCGAGGCAACACAAAAGCCACTCAGCAAGGACGCAACCATGGTTATACCAAATGCAAACACAAATATAGACGTCCATGCCCAAACAAGTACTCAGGAAGCAAGA
AATACCGAAACTAGCAGTGCTCCAACCACGTTCAACTCTGGTAATCCCGCAGGTAGCTCTGCATTCTCGAGTGGTATCGTGACGAGCTCTGTTTCTGGTGGGTTGGGGGA
CAATGTGTCTGATGGGAAGTTACTTAGTTGA
Protein sequenceShow/hide protein sequence
MARLAMATHYQGLSCNYIFIQGDSGKIAKEFSISDVYKYHPPLRFLASSFNSVSETIATANDNPISNQTHHRKWGKESQERSPHLRREWTSSILFSAAPSAIMGPVWNVA
WYLYGFSDMKNLIGEASCRICQEGLAQPLQNRWMYGRFDLQFHLPSSLDFCSVVLSCFCDSDGGFQRKQRKLELMGRPPSNGGPAFRFTAAEVFLCFLLSSSSSLFSFFF
FYIVNDASVMHVRCWSSNFSGSAQLQCLLVAEMEAILQAHNNTMPVREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKSSKAPGKLAVSPIVQIESTPVR
NVPQTIVVPAPAPVGSAKGAPENPYRSLKLNLGGMEVLVRFAGFGSRRMSGLISEGTSDLVLYLVNHQNVWQFFQAISSYAFRRVKSRHFTLMPMCLIHKEEDMMYEVVV
AVSHELRTLAFDVGKSWEFRVLLMAAAEIVQLRKICRRPETDYRLQQLHAVNEAASVEPPKSGMDSLLLSGQRLNFEATQKPLSKDATMVIPNANTNIDVHAQTSTQEAR
NTETSSAPTTFNSGNPAGSSAFSSGIVTSSVSGGLGDNVSDGKLLS