; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009031 (gene) of Snake gourd v1 genome

Gene IDTan0009031
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMYB domain class transcription factor
Genome locationLG05:69407233..69408827
RNA-Seq ExpressionTan0009031
SyntenyTan0009031
Gene Ontology termsGO:0010119 - regulation of stomatal movement (biological process)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152489.2 transcription factor MYB86 [Cucumis sativus]7.3e-16882.29Show/hide
Query:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
Subjt:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGL---SGGLS----TQGPTFLLGGADYYDGGMTAAPIRDHFLN-KQA
        DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFED+EF QIPPVQG+    GG+S     QGP FLLGG DYYDGG+T  PIRDH +N KQA
Subjt:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGL---SGGLS----TQGPTFLLGGADYYDGGMTAAPIRDHFLN-KQA

Query:  --FDSLCYFEFQTGL--ESCGY-INNNNSSFETQYQSNVQNFSDTNSNFGFSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNNNNN---GVDNST
           DSLC+FEFQTG    SC Y  NNNN++FETQYQ+NVQ       +FGF+SVPSLTNSDHGSLSGTEFS+NSGSN+SNYGGFYMNNNNN    VDNST
Subjt:  --FDSLCYFEFQTGL--ESCGY-INNNNSSFETQYQSNVQNFSDTNSNFGFSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNNNNN---GVDNST

Query:  FCSWENE-NKLESFFQIHVNNNNNNNNHNNGIKSEELKTVT--ASSVLDGQLIQSRSSVDFSSYPLTSLSQHSTGGNFDAFHHL
        FCSWENE NKLES+FQI VNNNNNNNN+NNGIKSEELK V    SS+ DGQLIQSRSS+DFSSYPL SLSQH T  NF  FHHL
Subjt:  FCSWENE-NKLESFFQIHVNNNNNNNNHNNGIKSEELKTVT--ASSVLDGQLIQSRSSVDFSSYPLTSLSQHSTGGNFDAFHHL

XP_008438768.1 PREDICTED: transcription factor MYB86-like [Cucumis melo]2.9e-16481.3Show/hide
Query:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
Subjt:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGL---SGGLST-----QGPTFLLGGADYYDGGMTAAP--IRDHFL-N
        DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFED+EF QIPPVQG+    GG+S+     QGP FLLGG DYYDGG+T     IRDH + N
Subjt:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGL---SGGLST-----QGPTFLLGGADYYDGGMTAAP--IRDHFL-N

Query:  KQAF-DSLCYFEFQTGLESCGY--INNNNSSFE-TQYQSNVQNFSDTNSNFGFSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNNNNN---GVDN
        KQA  DSLC++EFQTGL+SC Y   NNNN++FE TQYQ+N+Q       +FGF+SVPSLTNSDHGSLSGTEFS+NSGSN+SNYGGFYMNNNNN    VDN
Subjt:  KQAF-DSLCYFEFQTGLESCGY--INNNNSSFE-TQYQSNVQNFSDTNSNFGFSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNNNNN---GVDN

Query:  STFCSWENE-NKLESFFQIHVNNNNNNNNHNNGIKSEELKTVT--ASSVLDGQLIQSRSSVDFSSYPLTSLSQHSTGGNFDAFHH
        STFCSWEN+ NKLES+FQI VNNNNNNNN NNGIKSEELK VT   SS+ DGQLIQSRSS+DFSSYPL SLSQH T  NF  FHH
Subjt:  STFCSWENE-NKLESFFQIHVNNNNNNNNHNNGIKSEELKTVT--ASSVLDGQLIQSRSSVDFSSYPLTSLSQHSTGGNFDAFHH

XP_023001524.1 transcription factor MYB61-like [Cucurbita maxima]1.9e-16080.7Show/hide
Query:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQE+DLIISLH+VLGNRWAQIAAQLP RT
Subjt:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGLSGGLS---TQGPTFLLGGADYYDGGMTAAPIRDHFLNKQAFDSLC
        DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINI EIKEKKIFEDKEFT+IPPV+GL  G+S    QGP FLLGG DYYDGG+TAAP RDHF+NKQ  DSLC
Subjt:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGLSGGLS---TQGPTFLLGGADYYDGGMTAAPIRDHFLNKQAFDSLC

Query:  YFEFQTGLESCGYINNNNSSFETQYQSNVQNFSDTNSNFGFSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNN--NNNGVDNSTFCSWENENKLE
        YFEFQTG +               Y S+VQN SDTNSNFGFSS+PSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNN  NNNGVDNSTFCSWE+ENK +
Subjt:  YFEFQTGLESCGYINNNNSSFETQYQSNVQNFSDTNSNFGFSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNN--NNNGVDNSTFCSWENENKLE

Query:  SFFQIHVNN--NNNNNNH-NNGIKSEELKTVTASSVLDGQLIQSRSSVDFSSYPLTSLSQHSTGGNFDAFHHL
         FFQ HVNN  NNNNNNH NNGIKSEEL    +SSVLDGQLIQ         YPL SLSQ     NFDAF+HL
Subjt:  SFFQIHVNN--NNNNNNH-NNGIKSEELKTVTASSVLDGQLIQSRSSVDFSSYPLTSLSQHSTGGNFDAFHHL

XP_023519924.1 transcription factor MYB86-like [Cucurbita pepo subsp. pepo]2.8e-15980.21Show/hide
Query:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQE+DLIISLHEVLGNRWAQIAAQLPGRT
Subjt:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGLSGGLS---TQGPTFLLGGADYYDGGMTAAPIRDHFLNKQAFDSLC
        DNEIKNFWNSSLKKKL+KQGIDPNTHKPIINI EIKEKKIFEDKEF +IPPV+GL  G+S    QGP FLLGG DYYDGG+TAAPIRDHF+NKQ  DSLC
Subjt:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGLSGGLS---TQGPTFLLGGADYYDGGMTAAPIRDHFLNKQAFDSLC

Query:  YFEFQTGLESCGYINNNNSSFETQYQSNVQNFSDTNSNFGFSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNN--NNNGVDNSTFCSWENENKLE
        YFEFQTG +                     N SDTNSNFGFSS+PSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNN  NNNGVDNSTFCSWE+ENK +
Subjt:  YFEFQTGLESCGYINNNNSSFETQYQSNVQNFSDTNSNFGFSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNN--NNNGVDNSTFCSWENENKLE

Query:  SFFQIHVNNN--NNNNNH--NNGIKSEELKTVTASSVLDGQLIQSRSSVDFSSYPLTSLSQHSTGGNFDAFHHL
         FFQ HVNNN  NNNNNH  NNGIKSEEL    +SSVLDGQLIQ        SYPLTSLSQ     NFDAF+HL
Subjt:  SFFQIHVNNN--NNNNNH--NNGIKSEELKTVTASSVLDGQLIQSRSSVDFSSYPLTSLSQHSTGGNFDAFHHL

XP_038895207.1 transcription factor MYB86-like [Benincasa hispida]1.9e-17184.7Show/hide
Query:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
Subjt:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGLSGG----LSTQGPTFLLGGADYYDGGMTAAPIRDHFLNKQAF-DS
        DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIF+DKEF  IPPVQGLSGG    ++ QGP FLL GADYYD G+T AP+RD+F+NKQAF DS
Subjt:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGLSGG----LSTQGPTFLLGGADYYDGGMTAAPIRDHFLNKQAF-DS

Query:  LCYFEFQTGLESCGYINNNNSSFETQYQSNVQNFSDTNSNFGFSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYM-----NNNNNGVDNSTFCSWEN
        LCYFEFQTGLESC Y  NNN++FETQYQ           NFGF+SVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYM     NNNNNGVDNSTFCSWEN
Subjt:  LCYFEFQTGLESCGYINNNNSSFETQYQSNVQNFSDTNSNFGFSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYM-----NNNNNGVDNSTFCSWEN

Query:  E-NKLESFFQIHVNNNNNNNNHNNGIKSE-ELKTVT--ASSVLDGQLIQSRSSVDFSSYPLTSLSQHSTGGNFDAFHHL
        E NKLES+FQI VNNNNNNNN NNGIKSE ELK VT  +SSV+DGQLIQSRSS+DFSSYPL SLSQH +  NF AFHHL
Subjt:  E-NKLESFFQIHVNNNNNNNNHNNGIKSE-ELKTVT--ASSVLDGQLIQSRSSVDFSSYPLTSLSQHSTGGNFDAFHHL

TrEMBL top hitse value%identityAlignment
A0A0A0LRA3 Uncharacterized protein3.6e-16882.29Show/hide
Query:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
Subjt:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGL---SGGLS----TQGPTFLLGGADYYDGGMTAAPIRDHFLN-KQA
        DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFED+EF QIPPVQG+    GG+S     QGP FLLGG DYYDGG+T  PIRDH +N KQA
Subjt:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGL---SGGLS----TQGPTFLLGGADYYDGGMTAAPIRDHFLN-KQA

Query:  --FDSLCYFEFQTGL--ESCGY-INNNNSSFETQYQSNVQNFSDTNSNFGFSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNNNNN---GVDNST
           DSLC+FEFQTG    SC Y  NNNN++FETQYQ+NVQ       +FGF+SVPSLTNSDHGSLSGTEFS+NSGSN+SNYGGFYMNNNNN    VDNST
Subjt:  --FDSLCYFEFQTGL--ESCGY-INNNNSSFETQYQSNVQNFSDTNSNFGFSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNNNNN---GVDNST

Query:  FCSWENE-NKLESFFQIHVNNNNNNNNHNNGIKSEELKTVT--ASSVLDGQLIQSRSSVDFSSYPLTSLSQHSTGGNFDAFHHL
        FCSWENE NKLES+FQI VNNNNNNNN+NNGIKSEELK V    SS+ DGQLIQSRSS+DFSSYPL SLSQH T  NF  FHHL
Subjt:  FCSWENE-NKLESFFQIHVNNNNNNNNHNNGIKSEELKTVT--ASSVLDGQLIQSRSSVDFSSYPLTSLSQHSTGGNFDAFHHL

A0A1S3AXV0 transcription factor MYB86-like1.4e-16481.3Show/hide
Query:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
Subjt:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGL---SGGLST-----QGPTFLLGGADYYDGGMTAAP--IRDHFL-N
        DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFED+EF QIPPVQG+    GG+S+     QGP FLLGG DYYDGG+T     IRDH + N
Subjt:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGL---SGGLST-----QGPTFLLGGADYYDGGMTAAP--IRDHFL-N

Query:  KQAF-DSLCYFEFQTGLESCGY--INNNNSSFE-TQYQSNVQNFSDTNSNFGFSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNNNNN---GVDN
        KQA  DSLC++EFQTGL+SC Y   NNNN++FE TQYQ+N+Q       +FGF+SVPSLTNSDHGSLSGTEFS+NSGSN+SNYGGFYMNNNNN    VDN
Subjt:  KQAF-DSLCYFEFQTGLESCGY--INNNNSSFE-TQYQSNVQNFSDTNSNFGFSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNNNNN---GVDN

Query:  STFCSWENE-NKLESFFQIHVNNNNNNNNHNNGIKSEELKTVT--ASSVLDGQLIQSRSSVDFSSYPLTSLSQHSTGGNFDAFHH
        STFCSWEN+ NKLES+FQI VNNNNNNNN NNGIKSEELK VT   SS+ DGQLIQSRSS+DFSSYPL SLSQH T  NF  FHH
Subjt:  STFCSWENE-NKLESFFQIHVNNNNNNNNHNNGIKSEELKTVT--ASSVLDGQLIQSRSSVDFSSYPLTSLSQHSTGGNFDAFHH

A0A6J1GSH4 transcription factor MYB86-like4.4e-15879.67Show/hide
Query:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
Subjt:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGLSGGLS---TQGPTFLLGGADYYDGGMTAAPIRDHFLNKQAFDSLC
        DNEIKNFWNSSLKKKLMKQGIDPNTHKP+  I+EIKEKKIFEDKEFTQIPPV GLSGG+S   TQ P FL+GGA+YYDGGMTA P RDHFLNKQA DSLC
Subjt:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGLSGGLS---TQGPTFLLGGADYYDGGMTAAPIRDHFLNKQAFDSLC

Query:  YFEFQTGLESCGYINNNNSSFETQYQSNVQNFSDTNSNFGFSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNNNNNG-VDNSTFCSWENENKLES
        +FEFQTGL+                           SNFGFSSVPSLTNSDHGSLSGTEFSDNS       GGFYMNNNNN  VDNST CSWENENKLES
Subjt:  YFEFQTGLESCGYINNNNSSFETQYQSNVQNFSDTNSNFGFSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNNNNNG-VDNSTFCSWENENKLES

Query:  FFQIHVNNNNNNNNHNNGIKSEELKTVTASSVLDGQLIQSRSSVDFSSYPLTSLSQHSTGGNFDAFHHL
        +FQ+HV      +NH+N +KSEELKTVT SS++DGQLIQSRSSVDFSSYPLTSL+Q   GGNFD FHHL
Subjt:  FFQIHVNNNNNNNNHNNGIKSEELKTVTASSVLDGQLIQSRSSVDFSSYPLTSLSQHSTGGNFDAFHHL

A0A6J1K3F5 transcription factor MYB86-like1.1e-15880.27Show/hide
Query:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
Subjt:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGLSGGLS---TQGPTFLLGGADYYDGGMTAAPIRDHFLNKQAFDSLC
        DNEIKNFWNSSLKKKLMKQGIDPNTHKP+  I+EIKEKKIFEDKEFTQIPPV GLSGGLS   TQ P FL+GGA+YYDGGMTA P RDHFLNKQA DSLC
Subjt:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGLSGGLS---TQGPTFLLGGADYYDGGMTAAPIRDHFLNKQAFDSLC

Query:  YFEFQTGLESCGYINNNNSSFETQYQSNVQNFSDTNSNFGFSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNNNNNG--VDNSTFCSWENENKLE
        +FEFQTGL+                           SNFGFSSVPSLTNSDHGSLSGTEFSDNS       GGFYMNNNNN   VDNSTFCSWENENK E
Subjt:  YFEFQTGLESCGYINNNNSSFETQYQSNVQNFSDTNSNFGFSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNNNNNG--VDNSTFCSWENENKLE

Query:  SFFQIHVNNNNNNNNHNNGIKSEELKTVTASSVLDGQLIQSRSSVDFSSYPLTSLSQHSTGGNFDAFHHL
        SFFQIHV      ++H+N +KSEELKTVT SSV+DGQLIQSRSSVDFSSYPLTSL+Q   GGNFD FHHL
Subjt:  SFFQIHVNNNNNNNNHNNGIKSEELKTVTASSVLDGQLIQSRSSVDFSSYPLTSLSQHSTGGNFDAFHHL

A0A6J1KMZ3 transcription factor MYB61-like9.4e-16180.7Show/hide
Query:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQE+DLIISLH+VLGNRWAQIAAQLP RT
Subjt:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGLSGGLS---TQGPTFLLGGADYYDGGMTAAPIRDHFLNKQAFDSLC
        DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINI EIKEKKIFEDKEFT+IPPV+GL  G+S    QGP FLLGG DYYDGG+TAAP RDHF+NKQ  DSLC
Subjt:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGLSGGLS---TQGPTFLLGGADYYDGGMTAAPIRDHFLNKQAFDSLC

Query:  YFEFQTGLESCGYINNNNSSFETQYQSNVQNFSDTNSNFGFSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNN--NNNGVDNSTFCSWENENKLE
        YFEFQTG +               Y S+VQN SDTNSNFGFSS+PSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNN  NNNGVDNSTFCSWE+ENK +
Subjt:  YFEFQTGLESCGYINNNNSSFETQYQSNVQNFSDTNSNFGFSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNN--NNNGVDNSTFCSWENENKLE

Query:  SFFQIHVNN--NNNNNNH-NNGIKSEELKTVTASSVLDGQLIQSRSSVDFSSYPLTSLSQHSTGGNFDAFHHL
         FFQ HVNN  NNNNNNH NNGIKSEEL    +SSVLDGQLIQ         YPL SLSQ     NFDAF+HL
Subjt:  SFFQIHVNN--NNNNNNH-NNGIKSEELKTVTASSVLDGQLIQSRSSVDFSSYPLTSLSQHSTGGNFDAFHHL

SwissProt top hitse value%identityAlignment
P20027 Myb-related protein Hv334.6e-5665.81Show/hide
Query:  KLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRTDNEIKNFWNS
        K+RKGLWSPEEDEKL+N+I R GVGCWSSVP+LA L RCGKSCRLRWINYLRPDLKRG FSQQEED I++LH++LGNRW+QIA+ LPGRTDNEIKNFWNS
Subjt:  KLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRTDNEIKNFWNS

Query:  SLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPV-QGLSGGLSTQGP
         +KKKL +QGIDP THKP+ +           D E     P+   + G L  + P
Subjt:  SLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPV-QGLSGGLSTQGP

P80073 Myb-related protein Pp21.2e-5168.22Show/hide
Query:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGR  CC K  LR+G W+ EED+KL ++IT  G+ CW ++PKLAGL RCGKSCRLRW NYLRPDLKRG+FS+ EE+LI+ LH  LGNRW++IAAQLPGRT
Subjt:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPI
        DNEIKN+WN+ LKK+L  QG+DPNTH P+
Subjt:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPI

Q8LPH6 Transcription factor MYB869.6e-6247.51Show/hide
Query:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRHSCC KQKLRKGLWSPEEDEKL NYITR G GCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRG FSQ EE LII LH  LGNRW+QIA +LPGRT
Subjt:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDK-EFTQIPPVQGLSGGLSTQGPTFLLGGADYYDGGMTAAPIRDHFLNKQAFDSLCYF
        DNEIKNFWNS LKKKL ++GIDP THKP+I   E++   + + K   +++    G    L  Q          ++    T    ++         + C+ 
Subjt:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDK-EFTQIPPVQGLSGGLSTQGPTFLLGGADYYDGGMTAAPIRDHFLNKQAFDSLCYF

Query:  EFQTGLESCGYINNNNS-SFETQYQSNVQNFSDTNSNFG----FSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNNNNNGVD----NSTFCSWEN
           T   S   ++  +S S  +       NFS   +N+      ++VPS +NS + +  G ++++ S +         MNNNN  VD    +    SW +
Subjt:  EFQTGLESCGYINNNNS-SFETQYQSNVQNFSDTNSNFG----FSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNNNNNGVD----NSTFCSWEN

Query:  E
        E
Subjt:  E

Q8VZQ2 Transcription factor MYB611.2e-5947.34Show/hide
Query:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRHSCC KQKLRKGLWSPEEDEKL  +IT  G GCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRG FS +EE+LI+ LH VLGNRW+QIA++LPGRT
Subjt:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGLSGGLSTQGPTFLLGGADYYDGGMTAAPIRDHFLNKQAFDSLCYFE
        DNEIKN WNSS+KKKL ++GIDPNTHKPI  ++   +K    DK  T                      G D+     ++A  +D FL + + D   YF 
Subjt:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGLSGGLSTQGPTFLLGGADYYDGGMTAAPIRDHFLNKQAFDSLCYFE

Query:  FQTGLESCGYIN-NNNSSFETQYQSNV-----QNFSDTNSNFGFSSVPSLTN------SDHGSLSGTEFSDNSGSNMSNYGGFYMNNNN--NGVDNSTFC
        FQ        +N N+N        S++       FS  N        P           D+ S S     D+      N+  F  NNNN  N  DN  F 
Subjt:  FQTGLESCGYIN-NNNSSFETQYQSNV-----QNFSDTNSNFGFSSVPSLTN------SDHGSLSGTEFSDNSGSNMSNYGGFYMNNNN--NGVDNSTFC

Query:  SWENENKLESFFQIHVNNN
        SW   N   S  Q+  N+N
Subjt:  SWENENKLESFFQIHVNNN

Q9SPG3 Transcription factor MYB262.4e-5275.4Show/hide
Query:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAG---------LQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQ
        MG HSCC KQK+++GLWSPEEDEKL NYI  +G GCWSSVPK AG         LQRCGKSCRLRWINYLRPDLKRG FS QE  LII LH +LGNRWAQ
Subjt:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAG---------LQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQ

Query:  IAAQLPGRTDNEIKNFWNSSLKKKLM
        IA  LPGRTDNE+KNFWNSS+KKKLM
Subjt:  IAAQLPGRTDNEIKNFWNSSLKKKLM

Arabidopsis top hitse value%identityAlignment
AT1G09540.1 myb domain protein 618.3e-6147.34Show/hide
Query:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRHSCC KQKLRKGLWSPEEDEKL  +IT  G GCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRG FS +EE+LI+ LH VLGNRW+QIA++LPGRT
Subjt:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGLSGGLSTQGPTFLLGGADYYDGGMTAAPIRDHFLNKQAFDSLCYFE
        DNEIKN WNSS+KKKL ++GIDPNTHKPI  ++   +K    DK  T                      G D+     ++A  +D FL + + D   YF 
Subjt:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGLSGGLSTQGPTFLLGGADYYDGGMTAAPIRDHFLNKQAFDSLCYFE

Query:  FQTGLESCGYIN-NNNSSFETQYQSNV-----QNFSDTNSNFGFSSVPSLTN------SDHGSLSGTEFSDNSGSNMSNYGGFYMNNNN--NGVDNSTFC
        FQ        +N N+N        S++       FS  N        P           D+ S S     D+      N+  F  NNNN  N  DN  F 
Subjt:  FQTGLESCGYIN-NNNSSFETQYQSNV-----QNFSDTNSNFGFSSVPSLTN------SDHGSLSGTEFSDNSGSNMSNYGGFYMNNNN--NGVDNSTFC

Query:  SWENENKLESFFQIHVNNN
        SW   N   S  Q+  N+N
Subjt:  SWENENKLESFFQIHVNNN

AT4G01680.1 myb domain protein 555.2e-6381.95Show/hide
Query:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRHSCC KQKLRKGLWSPEEDEKL  YIT++G GCWSSVPK AGLQRCGKSCRLRWINYLRPDLKRG FSQ EE+LII LH VLGNRW+QIAAQLPGRT
Subjt:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQ
        DNEIKN WNS LKKKL  +GIDP THK +  I+
Subjt:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQ

AT4G01680.2 myb domain protein 552.4e-6075.17Show/hide
Query:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPK------------LAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNR
        MGRHSCC KQKLRKGLWSPEEDEKL  YIT++G GCWSSVPK            L GLQRCGKSCRLRWINYLRPDLKRG FSQ EE+LII LH VLGNR
Subjt:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPK------------LAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNR

Query:  WAQIAAQLPGRTDNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQ
        W+QIAAQLPGRTDNEIKN WNS LKKKL  +GIDP THK +  I+
Subjt:  WAQIAAQLPGRTDNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQ

AT4G01680.3 myb domain protein 555.2e-6381.95Show/hide
Query:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRHSCC KQKLRKGLWSPEEDEKL  YIT++G GCWSSVPK AGLQRCGKSCRLRWINYLRPDLKRG FSQ EE+LII LH VLGNRW+QIAAQLPGRT
Subjt:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQ
        DNEIKN WNS LKKKL  +GIDP THK +  I+
Subjt:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQ

AT5G26660.1 myb domain protein 866.8e-6347.51Show/hide
Query:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRHSCC KQKLRKGLWSPEEDEKL NYITR G GCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRG FSQ EE LII LH  LGNRW+QIA +LPGRT
Subjt:  MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDK-EFTQIPPVQGLSGGLSTQGPTFLLGGADYYDGGMTAAPIRDHFLNKQAFDSLCYF
        DNEIKNFWNS LKKKL ++GIDP THKP+I   E++   + + K   +++    G    L  Q          ++    T    ++         + C+ 
Subjt:  DNEIKNFWNSSLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDK-EFTQIPPVQGLSGGLSTQGPTFLLGGADYYDGGMTAAPIRDHFLNKQAFDSLCYF

Query:  EFQTGLESCGYINNNNS-SFETQYQSNVQNFSDTNSNFG----FSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNNNNNGVD----NSTFCSWEN
           T   S   ++  +S S  +       NFS   +N+      ++VPS +NS + +  G ++++ S +         MNNNN  VD    +    SW +
Subjt:  EFQTGLESCGYINNNNS-SFETQYQSNVQNFSDTNSNFG----FSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNNNNNGVD----NSTFCSWEN

Query:  E
        E
Subjt:  E


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGACGCCATTCTTGCTGTCTGAAGCAGAAACTAAGGAAAGGCCTTTGGTCGCCTGAGGAAGACGAAAAGCTCTTCAATTACATAACAAGATTTGGCGTTGGCTGTTG
GAGTTCTGTCCCAAAACTTGCTGGTTTGCAGAGATGTGGGAAGAGTTGTAGGCTGAGATGGATAAATTACTTGAGGCCTGATTTGAAGAGAGGAATGTTCTCTCAACAAG
AAGAAGATCTCATTATCAGTCTTCACGAAGTTTTGGGCAATAGATGGGCACAAATTGCGGCTCAATTACCGGGAAGAACGGATAATGAAATAAAGAATTTTTGGAATTCG
AGTTTGAAGAAGAAGCTAATGAAGCAAGGAATTGACCCAAATACTCACAAGCCAATAATCAACATTCAAGAGATTAAAGAGAAGAAGATCTTCGAAGACAAGGAATTTAC
CCAAATCCCACCGGTTCAAGGACTTTCCGGCGGCCTTTCCACTCAAGGTCCAACATTTCTCCTCGGCGGCGCCGATTACTACGACGGTGGAATGACGGCGGCGCCGATTC
GAGACCATTTCTTGAACAAACAAGCCTTCGATTCACTCTGTTATTTCGAATTCCAAACAGGGCTTGAATCCTGTGGCTACATCAACAACAACAACTCGAGTTTCGAAACT
CAGTACCAATCGAATGTGCAGAATTTCTCCGACACGAATTCGAATTTCGGGTTCAGTTCAGTGCCGAGTTTGACAAATTCCGACCATGGGAGCTTGTCGGGGACAGAATT
TTCAGACAATTCGGGGTCGAACATGAGCAACTATGGAGGATTCTACATGAACAACAACAATAATGGAGTGGATAACTCGACATTTTGTTCTTGGGAAAATGAGAATAAAT
TGGAGAGTTTTTTCCAGATTCATGTGAATAATAATAATAATAATAATAATCATAATAACGGGATAAAGTCCGAGGAATTGAAGACAGTAACAGCAAGTTCAGTGCTTGAT
GGGCAGCTAATTCAGAGTCGAAGCTCAGTAGATTTCAGTAGCTATCCATTAACGTCCCTGTCACAACACAGTACTGGAGGCAATTTTGATGCTTTCCACCATTTGTGA
mRNA sequenceShow/hide mRNA sequence
CACATAAATTTCTCTCACCACAATGGGACGCCATTCTTGCTGTCTGAAGCAGAAACTAAGGAAAGGCCTTTGGTCGCCTGAGGAAGACGAAAAGCTCTTCAATTACATAA
CAAGATTTGGCGTTGGCTGTTGGAGTTCTGTCCCAAAACTTGCTGGTTTGCAGAGATGTGGGAAGAGTTGTAGGCTGAGATGGATAAATTACTTGAGGCCTGATTTGAAG
AGAGGAATGTTCTCTCAACAAGAAGAAGATCTCATTATCAGTCTTCACGAAGTTTTGGGCAATAGATGGGCACAAATTGCGGCTCAATTACCGGGAAGAACGGATAATGA
AATAAAGAATTTTTGGAATTCGAGTTTGAAGAAGAAGCTAATGAAGCAAGGAATTGACCCAAATACTCACAAGCCAATAATCAACATTCAAGAGATTAAAGAGAAGAAGA
TCTTCGAAGACAAGGAATTTACCCAAATCCCACCGGTTCAAGGACTTTCCGGCGGCCTTTCCACTCAAGGTCCAACATTTCTCCTCGGCGGCGCCGATTACTACGACGGT
GGAATGACGGCGGCGCCGATTCGAGACCATTTCTTGAACAAACAAGCCTTCGATTCACTCTGTTATTTCGAATTCCAAACAGGGCTTGAATCCTGTGGCTACATCAACAA
CAACAACTCGAGTTTCGAAACTCAGTACCAATCGAATGTGCAGAATTTCTCCGACACGAATTCGAATTTCGGGTTCAGTTCAGTGCCGAGTTTGACAAATTCCGACCATG
GGAGCTTGTCGGGGACAGAATTTTCAGACAATTCGGGGTCGAACATGAGCAACTATGGAGGATTCTACATGAACAACAACAATAATGGAGTGGATAACTCGACATTTTGT
TCTTGGGAAAATGAGAATAAATTGGAGAGTTTTTTCCAGATTCATGTGAATAATAATAATAATAATAATAATCATAATAACGGGATAAAGTCCGAGGAATTGAAGACAGT
AACAGCAAGTTCAGTGCTTGATGGGCAGCTAATTCAGAGTCGAAGCTCAGTAGATTTCAGTAGCTATCCATTAACGTCCCTGTCACAACACAGTACTGGAGGCAATTTTG
ATGCTTTCCACCATTTGTGAACAAAATGTAAATTCAATCTTTTTTAATTTTTTTTATTTTTTTTCTCTCTCTTTTTTTTAAAAAATATACACAGATGTTGATAGTAATAG
GGGGGTTTCAAAAAAAAAAAAAAAAAAAAAAGGTTTGAGGTGAAAATATTTCCCTTTCTTTCCCTTTTTTGTTTTTGTATTTCTCTTCTATTTTTCCCCTTTTTTTGTAA
TATTTGTTTTATGATGAGTTGAAATAATGTGAAGAGTGACTCCTTG
Protein sequenceShow/hide protein sequence
MGRHSCCLKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRTDNEIKNFWNS
SLKKKLMKQGIDPNTHKPIINIQEIKEKKIFEDKEFTQIPPVQGLSGGLSTQGPTFLLGGADYYDGGMTAAPIRDHFLNKQAFDSLCYFEFQTGLESCGYINNNNSSFET
QYQSNVQNFSDTNSNFGFSSVPSLTNSDHGSLSGTEFSDNSGSNMSNYGGFYMNNNNNGVDNSTFCSWENENKLESFFQIHVNNNNNNNNHNNGIKSEELKTVTASSVLD
GQLIQSRSSVDFSSYPLTSLSQHSTGGNFDAFHHL