; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g20480 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g20480
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptiontranscription factor MYB1
Genome locationchr1:14323216..14330657
RNA-Seq ExpressionMoc01g20480
SyntenyMoc01g20480
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR011016 - Zinc finger, RING-CH-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595367.1 Transcription factor MYB1, partial [Cucurbita argyrosperma subsp. sororia]9.9e-19082.3Show/hide
Query:  TAVAEVDATAAE-GGVDACDAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQL
        +A AEVDAT AE GGVDACDAV SVADC SGDDAVP+VGEGE++GGRGGKDRVKGPWSPEEDAILSRLV KFGARNWSLIARGIAGRSGKSCRLRWCNQL
Subjt:  TAVAEVDATAAE-GGVDACDAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQL

Query:  DPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLEARD
        DPSVKRKPFTDEEDRIIVAAHA+HGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGN VDDASL KTKGSSEETLSCGDVNSFKSLE +D
Subjt:  DPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLEARD

Query:  ACSREHMDDQFEDKVPIAIEGQFSHEVK--EQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGHG
         CSREH+DDQFEDKVP+AIEGQFSHEVK  EQPTLFRPVARVSAFSVYNP DGQESLRAFLRPVPMQGPL+QAS PD+EASKLLEGVY DRSVPHQCGHG
Subjt:  ACSREHMDDQFEDKVPIAIEGQFSHEVK--EQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGHG

Query:  CCESHNQGSPVDSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLLEFTGSVLSVVITDERSWKLFLMTVRWKGM---VLLPQICHL
        CCES+NQGS +DSLLGPEFVDFS+PPPSF SFELAAIATDISNLAWLKSGLENGS + +   G     +  DE  W LFLMTV WKG    +    +  +
Subjt:  CCESHNQGSPVDSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLLEFTGSVLSVVITDERSWKLFLMTVRWKGM---VLLPQICHL

Query:  G----LESSSDECGIGIE
        G      S  + CG+ IE
Subjt:  G----LESSSDECGIGIE

XP_004147900.1 transcription factor MYB1 [Cucumis sativus]1.2e-18790.28Show/hide
Query:  MEGTAVAEVDATAAE-GGVDACDAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWC
        MEGTA AE+ ATA E GGVDACDAV SVADCGSGDDA+P+VGEGEA+GGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWC
Subjt:  MEGTAVAEVDATAAE-GGVDACDAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWC

Query:  NQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLE
        NQLDPSVKRKPFTDEEDRIIVAAHA+HGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGN+VDDASLEKTKGSSEETLSCGDVNSFKS++
Subjt:  NQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLE

Query:  ARDACSREHMDDQFEDKVPIAIEGQFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGH
         +DACSRE +DDQ+EDKVPI +EGQF+HEV EQPTLFRPVARVSAFSVYNPLDGQ SLR FLRPVPMQGPL+Q SKPDVEASK LEGVYGDRSVPHQCGH
Subjt:  ARDACSREHMDDQFEDKVPIAIEGQFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGH

Query:  GCCESHNQGSPVDSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLL
        GCC+SHNQGSP++SLLGPEFVDFS+PPPSFPSFELAAIATDISNLAWLKSGLENGS + +
Subjt:  GCCESHNQGSPVDSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLL

XP_008448879.1 PREDICTED: transcription factor MYB86 [Cucumis melo]2.1e-18790Show/hide
Query:  MEGTAVAEVDATAAE-GGVDACDAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWC
        MEGTA  E+ A A E GGVDACDAV SVADCGSGDDA+P+VGEGEA+GGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWC
Subjt:  MEGTAVAEVDATAAE-GGVDACDAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWC

Query:  NQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLE
        NQLDPSVKRKPFTDEEDRIIVAAHA+HGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGN+VDDASLEKTKGSSEETLSCGDVNSFKS++
Subjt:  NQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLE

Query:  ARDACSREHMDDQFEDKVPIAIEGQFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGH
         +D CSREH+DDQ+EDKVPI +EGQFSHEV EQPTLFRPVARVSAFSVYNPLDGQ SLR FLRPVPMQGPL+Q SKPDVEASK LEGVYGDRSVPHQCGH
Subjt:  ARDACSREHMDDQFEDKVPIAIEGQFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGH

Query:  GCCESHNQGSPVDSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLL
        GCC+SHNQGSP++SLLGPEFVDFS+PPPSFPSFELAAIATDISNLAWLKSGLENGS + +
Subjt:  GCCESHNQGSPVDSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLL

XP_022151564.1 transcription factor MYB1 [Momordica charantia]5.0e-20298.89Show/hide
Query:  MEGTAVAEVDATAAEGGVDACDAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCN
        MEGTAVAEVDATAAEGGVDACDAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCN
Subjt:  MEGTAVAEVDATAAEGGVDACDAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCN

Query:  QLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLEA
        QLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLEA
Subjt:  QLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLEA

Query:  RDACSREHMDDQFEDKVPIAIEGQFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGHG
        RDACSREHMDDQFEDKVPIAIEGQFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGHG
Subjt:  RDACSREHMDDQFEDKVPIAIEGQFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGHG

Query:  CCESHNQGSPVDSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLL
        CCESHNQGSPVDSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLENGS + +
Subjt:  CCESHNQGSPVDSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLL

XP_038883582.1 transcription factor MYB1 [Benincasa hispida]1.5e-19091.67Show/hide
Query:  MEGTAVAEVDATAAE-GGVDACDAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWC
        MEG A AE+ ATAAE GGVDACDAV SVADCGSGDDAVP+VGEGEA+GGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWC
Subjt:  MEGTAVAEVDATAAE-GGVDACDAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWC

Query:  NQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLE
        NQLDPSVKRKPFTDEEDRII+AAHA+HGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELE+IKLESGN+VDDASLEKTKGSSEETLSCGDVNSFKSL+
Subjt:  NQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLE

Query:  ARDACSREHMDDQFEDKVPIAIEGQFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGH
         +DACSREH+DDQ+EDKVPIAIEGQF+HEVKEQPTLFRPVARVSAFSVYNPLDGQESLR FLRPVPMQGPL+Q SKPDVEASK LEGVYGDRSVPHQCGH
Subjt:  ARDACSREHMDDQFEDKVPIAIEGQFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGH

Query:  GCCESHNQGSPVDSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLL
        GCC+SHNQGSP++SLLGPEFVDFS+PPPSFPSFELAAIATDISNLAWLKSGLENGS + +
Subjt:  GCCESHNQGSPVDSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLL

TrEMBL top hitse value%identityAlignment
A0A0A0KXF8 Uncharacterized protein5.8e-18890.28Show/hide
Query:  MEGTAVAEVDATAAE-GGVDACDAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWC
        MEGTA AE+ ATA E GGVDACDAV SVADCGSGDDA+P+VGEGEA+GGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWC
Subjt:  MEGTAVAEVDATAAE-GGVDACDAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWC

Query:  NQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLE
        NQLDPSVKRKPFTDEEDRIIVAAHA+HGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGN+VDDASLEKTKGSSEETLSCGDVNSFKS++
Subjt:  NQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLE

Query:  ARDACSREHMDDQFEDKVPIAIEGQFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGH
         +DACSRE +DDQ+EDKVPI +EGQF+HEV EQPTLFRPVARVSAFSVYNPLDGQ SLR FLRPVPMQGPL+Q SKPDVEASK LEGVYGDRSVPHQCGH
Subjt:  ARDACSREHMDDQFEDKVPIAIEGQFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGH

Query:  GCCESHNQGSPVDSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLL
        GCC+SHNQGSP++SLLGPEFVDFS+PPPSFPSFELAAIATDISNLAWLKSGLENGS + +
Subjt:  GCCESHNQGSPVDSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLL

A0A1S3BKQ6 transcription factor MYB861.0e-18790Show/hide
Query:  MEGTAVAEVDATAAE-GGVDACDAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWC
        MEGTA  E+ A A E GGVDACDAV SVADCGSGDDA+P+VGEGEA+GGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWC
Subjt:  MEGTAVAEVDATAAE-GGVDACDAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWC

Query:  NQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLE
        NQLDPSVKRKPFTDEEDRIIVAAHA+HGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGN+VDDASLEKTKGSSEETLSCGDVNSFKS++
Subjt:  NQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLE

Query:  ARDACSREHMDDQFEDKVPIAIEGQFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGH
         +D CSREH+DDQ+EDKVPI +EGQFSHEV EQPTLFRPVARVSAFSVYNPLDGQ SLR FLRPVPMQGPL+Q SKPDVEASK LEGVYGDRSVPHQCGH
Subjt:  ARDACSREHMDDQFEDKVPIAIEGQFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGH

Query:  GCCESHNQGSPVDSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLL
        GCC+SHNQGSP++SLLGPEFVDFS+PPPSFPSFELAAIATDISNLAWLKSGLENGS + +
Subjt:  GCCESHNQGSPVDSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLL

A0A6J1DDF1 transcription factor MYB12.4e-20298.89Show/hide
Query:  MEGTAVAEVDATAAEGGVDACDAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCN
        MEGTAVAEVDATAAEGGVDACDAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCN
Subjt:  MEGTAVAEVDATAAEGGVDACDAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCN

Query:  QLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLEA
        QLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLEA
Subjt:  QLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLEA

Query:  RDACSREHMDDQFEDKVPIAIEGQFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGHG
        RDACSREHMDDQFEDKVPIAIEGQFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGHG
Subjt:  RDACSREHMDDQFEDKVPIAIEGQFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGHG

Query:  CCESHNQGSPVDSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLL
        CCESHNQGSPVDSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLENGS + +
Subjt:  CCESHNQGSPVDSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLL

A0A6J1HNF3 transcription factor MYB1-like1.3e-18491.92Show/hide
Query:  AVAEVDATAAE-GGVDAC-DAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQL
        A AEVDAT AE GGVDAC DAV SVADC SGDDAVP+VGEGE++GGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQL
Subjt:  AVAEVDATAAE-GGVDAC-DAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQL

Query:  DPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLEARD
        DPSVKRKPFTDEEDRIIVAAHA+HGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGN VDDASLEKTKGSSEETLSCGDVNSFKSLE +D
Subjt:  DPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLEARD

Query:  ACSREHMDDQFEDKVPIAIEGQFSHEVK--EQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGHG
         CSREH+DDQFEDKVPIAIEGQFSHEVK  EQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPL+QAS PDVEASKLLEGVY DRSVPHQCGHG
Subjt:  ACSREHMDDQFEDKVPIAIEGQFSHEVK--EQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGHG

Query:  CCESHNQGSPVDSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLL
        CCES+NQGS +DSLLGPEFVDFS+PPPSF SFELAAIATDISNLAWLKSGLENGS + +
Subjt:  CCESHNQGSPVDSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLL

A0A6J1IM36 transcription factor MYB1-like isoform X17.4e-18388.67Show/hide
Query:  MEGTAVAEVDATAAE---GGVDACDAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLR
        ME  A  EV A+ AE   GGV+ CDAV SVADCGSGD+A+P+VGEGE +GGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLR
Subjt:  MEGTAVAEVDATAAE---GGVDACDAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLR

Query:  WCNQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKS
        WCNQLDPSVKRKPFTDEEDRIIVAAHA+HGNKWAAIARLL GRTDNAIKNHWNSTLRRRCTELERIKLESGN+VDDASLEKTKGSSEETLSCGDVNSFKS
Subjt:  WCNQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKS

Query:  LEARDACSREHMDDQFEDKVPIAIEGQFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQC
         E +DA SREHMDDQFEDKVPIA EGQF+HEVKEQPTL+RPVARVSAFSVYNPLD Q SLRAF+RPVPMQGPL+QASKP+VEASKLLEGVYGDRSVPHQC
Subjt:  LEARDACSREHMDDQFEDKVPIAIEGQFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQC

Query:  GHGCCESHNQGSPVDSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLL
        GHGCC+SHNQGSP+DSLLGPEFVDFS+PP SFPSFELAAIATDISNLAWLKSGLENGS + +
Subjt:  GHGCCESHNQGSPVDSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLL

SwissProt top hitse value%identityAlignment
O04192 Transcription factor MYB253.5e-6542.24Show/hide
Query:  DAVTSVADCGSGDDAVPIVGEGE------ASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQLDPSVKRKPFTDEE
        + ++S   C S ++A+    E E      +    GGK +VKGPW PE+D  L+RLV   G RNW+LI+RGI GRSGKSCRLRWCNQLDP +KRKPF+DEE
Subjt:  DAVTSVADCGSGDDAVPIVGEGE------ASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQLDPSVKRKPFTDEE

Query:  DRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLEAR--DACSREHMDDQF
        + +I++A A+ GNKW+ IA+LLPGRTDNAIKNHWNS LRR+  E  +I L   N      L                  + S+  R  +A  +EH+    
Subjt:  DRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLEAR--DACSREHMDDQF

Query:  EDKVPIAIEGQFSHEVKEQPT-------LFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGHGCCESHN
        E++  +  + +   E KE P        ++RPVAR+ AFSV  P         ++   P +GPLVQAS+PD  A K L+ +  D  +P +CGHGCC +H 
Subjt:  EDKVPIAIEGQFSHEVKEQPT-------LFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGHGCCESHN

Query:  QGSPV--DSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLE
          + +   S+LG EFVD+ +   +    EL +I+ D++N AW++SG E
Subjt:  QGSPV--DSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLE

O23160 Transcription factor MYB737.9e-4168.18Show/hide
Query:  RGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNH
        R   +R+KGPWSPEED +L RLV K G RNWSLI++ I GRSGKSCRLRWCNQL P V+ + F+ EED  I+ AHA  GNKWA I+RLL GRTDNAIKNH
Subjt:  RGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNH

Query:  WNSTLRRRCT
        WNSTL+R+C+
Subjt:  WNSTLRRRCT

Q42575 Transcription factor MYB17.6e-8452.66Show/hide
Query:  GDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAA
        G+DA   VG    + GRG +DRVKGPWS EED +LS LV + GARNWS IAR I GRSGKSCRLRWCNQL+P++ R  FT+ ED+ I+AAHAIHGNKWA 
Subjt:  GDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAA

Query:  IARLLPGRTDNAIKNHWNSTLRRRCTELERIK-LESGNM-VDDASLEKTK--GSSEETLS----CGDVNSFKSLEARDA-CSREHMDDQFEDKVPIAIEG
        IA+LLPGRTDNAIKNHWNS LRRR  + E+ K + +G++ VDD+  ++T    SSEETLS    C       S E ++A  S E  ++Q  +K     EG
Subjt:  IARLLPGRTDNAIKNHWNSTLRRRCTELERIK-LESGNM-VDDASLEKTK--GSSEETLS----CGDVNSFKSLEARDA-CSREHMDDQFEDKVPIAIEG

Query:  QFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGHGCCESHNQGS-PVDSLLGPEFVDF
            + K+ PTLFRPV R+S+F+  N ++G  S      P       +Q+SK D    +LLEG Y +R VP  CG GCC ++  GS   +SLLGPEFVD+
Subjt:  QFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGHGCCESHNQGS-PVDSLLGPEFVDF

Query:  SDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLLE
         D  P+FPS ELAAIAT+I +LAWL+SGLE+ S +++E
Subjt:  SDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLLE

Q9FDW1 Transcription factor MYB441.0e-4070.48Show/hide
Query:  DRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNST
        DR+KGPWSPEED  L RLV K+G RNW++I++ I GRSGKSCRLRWCNQL P V+ +PF+ EED  I  AHA  GNKWA IARLL GRTDNA+KNHWNST
Subjt:  DRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNST

Query:  LRRRC
        L+R+C
Subjt:  LRRRC

Q9SN12 Transcription factor MYB776.7e-4069.81Show/hide
Query:  DRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNST
        DRVKGPWS EED  L R+V K+G RNWS I++ I GRSGKSCRLRWCNQL P V+ +PF+ EED  IV A A  GNKWA IARLL GRTDNA+KNHWNST
Subjt:  DRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNST

Query:  LRRRCT
        L+R+C+
Subjt:  LRRRCT

Arabidopsis top hitse value%identityAlignment
AT2G23290.1 myb domain protein 705.6e-4271.7Show/hide
Query:  DRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNST
        DR+KGPWSPEED +L  LV K G RNWSLI++ I GRSGKSCRLRWCNQL P V+ + FT EED  I+ AHA  GNKWA IARLL GRTDNAIKNHWNST
Subjt:  DRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNST

Query:  LRRRCT
        L+R+C+
Subjt:  LRRRCT

AT2G39880.1 myb domain protein 252.5e-6642.24Show/hide
Query:  DAVTSVADCGSGDDAVPIVGEGE------ASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQLDPSVKRKPFTDEE
        + ++S   C S ++A+    E E      +    GGK +VKGPW PE+D  L+RLV   G RNW+LI+RGI GRSGKSCRLRWCNQLDP +KRKPF+DEE
Subjt:  DAVTSVADCGSGDDAVPIVGEGE------ASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQLDPSVKRKPFTDEE

Query:  DRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLEAR--DACSREHMDDQF
        + +I++A A+ GNKW+ IA+LLPGRTDNAIKNHWNS LRR+  E  +I L   N      L                  + S+  R  +A  +EH+    
Subjt:  DRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLEAR--DACSREHMDDQF

Query:  EDKVPIAIEGQFSHEVKEQPT-------LFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGHGCCESHN
        E++  +  + +   E KE P        ++RPVAR+ AFSV  P         ++   P +GPLVQAS+PD  A K L+ +  D  +P +CGHGCC +H 
Subjt:  EDKVPIAIEGQFSHEVKEQPT-------LFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGHGCCESHN

Query:  QGSPV--DSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLE
          + +   S+LG EFVD+ +   +    EL +I+ D++N AW++SG E
Subjt:  QGSPV--DSLLGPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLE

AT3G09230.1 myb domain protein 15.4e-8552.66Show/hide
Query:  GDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAA
        G+DA   VG    + GRG +DRVKGPWS EED +LS LV + GARNWS IAR I GRSGKSCRLRWCNQL+P++ R  FT+ ED+ I+AAHAIHGNKWA 
Subjt:  GDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAA

Query:  IARLLPGRTDNAIKNHWNSTLRRRCTELERIK-LESGNM-VDDASLEKTK--GSSEETLS----CGDVNSFKSLEARDA-CSREHMDDQFEDKVPIAIEG
        IA+LLPGRTDNAIKNHWNS LRRR  + E+ K + +G++ VDD+  ++T    SSEETLS    C       S E ++A  S E  ++Q  +K     EG
Subjt:  IARLLPGRTDNAIKNHWNSTLRRRCTELERIK-LESGNM-VDDASLEKTK--GSSEETLS----CGDVNSFKSLEARDA-CSREHMDDQFEDKVPIAIEG

Query:  QFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGHGCCESHNQGS-PVDSLLGPEFVDF
            + K+ PTLFRPV R+S+F+  N ++G  S      P       +Q+SK D    +LLEG Y +R VP  CG GCC ++  GS   +SLLGPEFVD+
Subjt:  QFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGHGCCESHNQGS-PVDSLLGPEFVDF

Query:  SDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLLE
         D  P+FPS ELAAIAT+I +LAWL+SGLE+ S +++E
Subjt:  SDPPPSFPSFELAAIATDISNLAWLKSGLENGSEKLLE

AT3G55730.1 myb domain protein 1092.1e-8149.11Show/hide
Query:  VADCGSGDDAVPIVGEGEASGGRGG-KDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQLDPSVKRKPFTDEEDRIIVAAHAI
        +A+  +GD +    G G   GG GG + +VKGPWS EEDA+L++LV K G RNWSLIARGI GRSGKSCRLRWCNQLDP +KRKPF+DEEDR+I++AHA+
Subjt:  VADCGSGDDAVPIVGEGEASGGRGG-KDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQLDPSVKRKPFTDEEDRIIVAAHAI

Query:  HGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEK-------TKGSSEETLSCGDVNSF--KSLEARDACSREHMDDQFEDK
        HGNKWA IA+LL GRTDNAIKNHWNSTLRR+  +L        N V  AS++           SS++ L  GD+NS   K  +  D    E  ++  E +
Subjt:  HGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEK-------TKGSSEETLSCGDVNSF--KSLEARDACSREHMDDQFEDK

Query:  VPIAIEGQFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGHGCCESHNQGS-PVDSLL
                    V  +  +FRPVARV AFS+YNP   +   R +   VP +GPL+QA+KPD  A K L+ +  +  +P +CGHGC     +     +S+L
Subjt:  VPIAIEGQFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGHGCCESHNQGS-PVDSLL

Query:  GPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLEN
        GPEFVD+ +P   F + EL +IATD++N+AW+KSGL+N
Subjt:  GPEFVDFSDPPPSFPSFELAAIATDISNLAWLKSGLEN

AT4G37260.1 myb domain protein 735.6e-4268.18Show/hide
Query:  RGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNH
        R   +R+KGPWSPEED +L RLV K G RNWSLI++ I GRSGKSCRLRWCNQL P V+ + F+ EED  I+ AHA  GNKWA I+RLL GRTDNAIKNH
Subjt:  RGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQLDPSVKRKPFTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNH

Query:  WNSTLRRRCT
        WNSTL+R+C+
Subjt:  WNSTLRRRCT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGCACGGCCGTCGCCGAGGTCGATGCTACCGCCGCTGAAGGAGGAGTAGATGCATGTGATGCCGTTACTTCCGTTGCCGACTGCGGGAGCGGGGACGACGCAGT
GCCGATCGTCGGTGAGGGAGAAGCAAGTGGTGGCCGTGGCGGCAAAGATAGAGTGAAGGGGCCTTGGTCTCCTGAAGAGGATGCGATACTCAGCCGCCTCGTGAGCAAGT
TCGGGGCGAGGAATTGGAGCTTAATTGCTCGGGGAATCGCTGGGCGATCCGGAAAGTCTTGCCGCCTTAGGTGGTGTAATCAACTCGACCCTTCCGTGAAACGCAAGCCA
TTCACCGATGAGGAGGACAGGATCATTGTAGCAGCCCATGCCATACATGGAAATAAGTGGGCAGCAATTGCTAGACTTTTACCTGGGAGAACAGATAATGCTATAAAGAA
TCATTGGAATTCCACTTTACGGCGGCGATGCACAGAGCTTGAAAGAATCAAGTTAGAATCTGGGAACATGGTGGATGATGCTAGTTTAGAAAAAACCAAAGGATCATCTG
AAGAAACCCTTTCATGTGGAGATGTCAATTCCTTTAAATCCTTGGAAGCGAGAGACGCCTGCTCACGGGAGCATATGGATGACCAATTTGAAGACAAAGTACCTATTGCT
ATTGAGGGTCAATTTAGTCATGAAGTCAAAGAACAGCCTACTCTTTTCAGGCCGGTGGCTCGAGTAAGTGCTTTTAGTGTATACAACCCTTTGGATGGGCAAGAGTCTTT
GAGGGCATTTTTACGACCTGTTCCGATGCAAGGACCATTAGTTCAAGCATCAAAACCAGACGTTGAAGCTAGCAAATTGCTTGAAGGTGTCTATGGTGATCGATCAGTGC
CCCATCAATGTGGCCATGGTTGTTGTGAGAGTCATAACCAGGGATCTCCTGTAGACTCTTTGTTAGGGCCAGAATTTGTAGACTTCTCAGATCCTCCACCATCCTTTCCC
AGTTTTGAATTAGCTGCAATTGCAACTGACATAAGTAACCTTGCTTGGCTTAAGAGTGGATTGGAGAATGGCAGTGAGAAACTTTTGGAGTTTACTGGATCTGTACTTAG
TGTTGTGATAACTGATGAACGTTCTTGGAAGCTGTTCCTGATGACTGTTCGATGGAAAGGTATGGTTCTCTTACCTCAGATTTGCCATTTGGGTTTGGAGAGCAGTAGCG
ATGAGTGTGGAATTGGAATTGAATTGGGATGTTCTTGTAAGAACGATTTGGGTGCTGCCCACAAGCATTGTGCTGAGGCTTGGTTCAAAATCAGAGGAAACAAGACGTGT
GAGATATGTCATTCAGTGGTAAGCAATGTCATTGGAGCACATGAAGTTGAACCAGTAGAGCAACTCAGTGAGTCGAACAATGCAACGGTATCGACAACCACCGTTGCGAC
GTCGGTGACCGGTGGTCCCAATGGTGGCCGGAGCTTTTGGCAAGTCCACCGTGTGCTGAATTTTCTTCTGGCTTGTATGAGCATGAAGCAGAGTTGTGGTGAGTGGGTGG
TTGGAGCTCCATTTACAGATATCTCCGAACTCTTACTACCAAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGCACGGCCGTCGCCGAGGTCGATGCTACCGCCGCTGAAGGAGGAGTAGATGCATGTGATGCCGTTACTTCCGTTGCCGACTGCGGGAGCGGGGACGACGCAGT
GCCGATCGTCGGTGAGGGAGAAGCAAGTGGTGGCCGTGGCGGCAAAGATAGAGTGAAGGGGCCTTGGTCTCCTGAAGAGGATGCGATACTCAGCCGCCTCGTGAGCAAGT
TCGGGGCGAGGAATTGGAGCTTAATTGCTCGGGGAATCGCTGGGCGATCCGGAAAGTCTTGCCGCCTTAGGTGGTGTAATCAACTCGACCCTTCCGTGAAACGCAAGCCA
TTCACCGATGAGGAGGACAGGATCATTGTAGCAGCCCATGCCATACATGGAAATAAGTGGGCAGCAATTGCTAGACTTTTACCTGGGAGAACAGATAATGCTATAAAGAA
TCATTGGAATTCCACTTTACGGCGGCGATGCACAGAGCTTGAAAGAATCAAGTTAGAATCTGGGAACATGGTGGATGATGCTAGTTTAGAAAAAACCAAAGGATCATCTG
AAGAAACCCTTTCATGTGGAGATGTCAATTCCTTTAAATCCTTGGAAGCGAGAGACGCCTGCTCACGGGAGCATATGGATGACCAATTTGAAGACAAAGTACCTATTGCT
ATTGAGGGTCAATTTAGTCATGAAGTCAAAGAACAGCCTACTCTTTTCAGGCCGGTGGCTCGAGTAAGTGCTTTTAGTGTATACAACCCTTTGGATGGGCAAGAGTCTTT
GAGGGCATTTTTACGACCTGTTCCGATGCAAGGACCATTAGTTCAAGCATCAAAACCAGACGTTGAAGCTAGCAAATTGCTTGAAGGTGTCTATGGTGATCGATCAGTGC
CCCATCAATGTGGCCATGGTTGTTGTGAGAGTCATAACCAGGGATCTCCTGTAGACTCTTTGTTAGGGCCAGAATTTGTAGACTTCTCAGATCCTCCACCATCCTTTCCC
AGTTTTGAATTAGCTGCAATTGCAACTGACATAAGTAACCTTGCTTGGCTTAAGAGTGGATTGGAGAATGGCAGTGAGAAACTTTTGGAGTTTACTGGATCTGTACTTAG
TGTTGTGATAACTGATGAACGTTCTTGGAAGCTGTTCCTGATGACTGTTCGATGGAAAGGTATGGTTCTCTTACCTCAGATTTGCCATTTGGGTTTGGAGAGCAGTAGCG
ATGAGTGTGGAATTGGAATTGAATTGGGATGTTCTTGTAAGAACGATTTGGGTGCTGCCCACAAGCATTGTGCTGAGGCTTGGTTCAAAATCAGAGGAAACAAGACGTGT
GAGATATGTCATTCAGTGGTAAGCAATGTCATTGGAGCACATGAAGTTGAACCAGTAGAGCAACTCAGTGAGTCGAACAATGCAACGGTATCGACAACCACCGTTGCGAC
GTCGGTGACCGGTGGTCCCAATGGTGGCCGGAGCTTTTGGCAAGTCCACCGTGTGCTGAATTTTCTTCTGGCTTGTATGAGCATGAAGCAGAGTTGTGGTGAGTGGGTGG
TTGGAGCTCCATTTACAGATATCTCCGAACTCTTACTACCAAGGTAA
Protein sequenceShow/hide protein sequence
MEGTAVAEVDATAAEGGVDACDAVTSVADCGSGDDAVPIVGEGEASGGRGGKDRVKGPWSPEEDAILSRLVSKFGARNWSLIARGIAGRSGKSCRLRWCNQLDPSVKRKP
FTDEEDRIIVAAHAIHGNKWAAIARLLPGRTDNAIKNHWNSTLRRRCTELERIKLESGNMVDDASLEKTKGSSEETLSCGDVNSFKSLEARDACSREHMDDQFEDKVPIA
IEGQFSHEVKEQPTLFRPVARVSAFSVYNPLDGQESLRAFLRPVPMQGPLVQASKPDVEASKLLEGVYGDRSVPHQCGHGCCESHNQGSPVDSLLGPEFVDFSDPPPSFP
SFELAAIATDISNLAWLKSGLENGSEKLLEFTGSVLSVVITDERSWKLFLMTVRWKGMVLLPQICHLGLESSSDECGIGIELGCSCKNDLGAAHKHCAEAWFKIRGNKTC
EICHSVVSNVIGAHEVEPVEQLSESNNATVSTTTVATSVTGGPNGGRSFWQVHRVLNFLLACMSMKQSCGEWVVGAPFTDISELLLPR