; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g01980 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g01980
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptiontranscription factor MYB119-like
Genome locationchr3:1484110..1486274
RNA-Seq ExpressionMoc03g01980
SyntenyMoc03g01980
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583375.1 Transcription factor MYB119, partial [Cucurbita argyrosperma subsp. sororia]1.3e-14167.29Show/hide
Query:  PSGPPLAAIDRFLYGHQLLHSCDDVDVI-GGFPPDQSYCGGEW---------------------RRVEEEEEEEEIMYSWGRSGANCNYELMMENEGKII
        PSGPPLAAIDRFLY H    +C +  ++ GG  P++  CGGEW                       +EEEEEE+  MY WGR        +   +EGKI 
Subjt:  PSGPPLAAIDRFLYGHQLLHSCDDVDVI-GGFPPDQSYCGGEW---------------------RRVEEEEEEEEIMYSWGRSGANCNYELMMENEGKII

Query:  SKGKKFKKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGR
        SK KK KK SSA LIKGQWTEEEDRKLIRLVKQ+GVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEE ILVETHA+VGNRWAEIAKSIPGR
Subjt:  SKGKKFKKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGR

Query:  TENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKSKHNIPA-PATATI---VSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDDELLFMQ
        TENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIK+K+N  A PAT+T+   VS+DPSSHFNHFF ESSDST NLS AIISSP YDDELLFMQ
Subjt:  TENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKSKHNIPA-PATATI---VSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDDELLFMQ

Query:  NFFSNSTNP-PPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSSDGD-ELQNMMEMAEMEAAAA
        NFFSNS    P   D    VV NQ  AEF S+DSE K EK KV           +    S HLYSD+Y+SYLLNG  NSS GD E+QN +EMAE++   A
Subjt:  NFFSNSTNP-PPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSSDGD-ELQNMMEMAEMEAAAA

Query:  AGKGQWQSSQEGKRELDLIEMLSFHCYS
        AG+GQW++SQ+GKRE+DL+EM+SFHCYS
Subjt:  AGKGQWQSSQEGKRELDLIEMLSFHCYS

KAG7019144.1 Transcription factor MYB98, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-14267.6Show/hide
Query:  PPSGPPLAAIDRFLYGHQLLHSCDDVDVI-GGFPPDQSYCGGEW---------------------RRVEEEEEEEEIMYSWGRSGANCNYELMMENEGKI
        PPSGPPLAAIDRFLY H    +C +  ++ GG  P++  CGGEW                       +EEEEEE+  MY WGR        +   +EGKI
Subjt:  PPSGPPLAAIDRFLYGHQLLHSCDDVDVI-GGFPPDQSYCGGEW---------------------RRVEEEEEEEEIMYSWGRSGANCNYELMMENEGKI

Query:  ISKGKKFKKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPG
         SK KK KK SSA LIKGQWTEEEDRKLIRLVKQ+GVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEE ILVETHA+VGNRWAEIAKSIPG
Subjt:  ISKGKKFKKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPG

Query:  RTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKSKHNIPA-PATATI---VSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDDELLFM
        RTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIK+K+N  A PAT+T+   VS+DPSSHFNHFF ESSDST NLS AIISSP YDDELLFM
Subjt:  RTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKSKHNIPA-PATATI---VSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDDELLFM

Query:  QNFFSNSTNP-PPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSSDGD-ELQNMMEMAEMEAAA
        QNFFSNS    P   D    VV NQ  AEF S+DSE K EK KV           +    S HLYSD+Y+SYLLNG  NSS GD E+QN +EMAE++   
Subjt:  QNFFSNSTNP-PPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSSDGD-ELQNMMEMAEMEAAA

Query:  AAGKGQWQSSQEGKRELDLIEMLSFHCYS
        AAG+GQW++SQEGKRE+DL+EM+SFHCYS
Subjt:  AAGKGQWQSSQEGKRELDLIEMLSFHCYS

XP_008457563.1 PREDICTED: protein ODORANT1 [Cucumis melo]2.4e-14368.36Show/hide
Query:  PPSGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPD--QSYCGGEW------RRVE----------EEEEEEEIMYSWGR-SGANCNY---ELMME----N
        PPS PPLAAIDRFLY H +  +C    +IGG  P+      GGEW      RRVE          EEEEEEEIMY WGR +  NC     E+M+     N
Subjt:  PPSGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPD--QSYCGGEW------RRVE----------EEEEEEEIMYSWGR-SGANCNY---ELMME----N

Query:  EGKII-SKGKKFKKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIA
        E K+  SK KKFKK SSA+LIKGQWTEEEDRKL RLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHA+VGNRWAEIA
Subjt:  EGKII-SKGKKFKKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIA

Query:  KSIPGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKSKHNIPAPAT--ATIVSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDDEL
        KSIPGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKSK+N  A  T  A+  SDDPSSHFNHFF ESSDST NLS AIISSPTYDDEL
Subjt:  KSIPGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKSKHNIPAPAT--ATIVSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDDEL

Query:  LFMQNFFSNSTN--PPPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSS--DGDELQNMMEMAE
        LFMQNFFSNS++  P P+ D   T+  NQ   EF S+DSE K E+ K+ DE  ++ R       S HLYSD+YLSYLLNG  N++   G E+QN   MAE
Subjt:  LFMQNFFSNSTN--PPPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSS--DGDELQNMMEMAE

Query:  MEAAAAAGKGQWQSSQEGKRELDLIEMLSFHCY
        ++  AAAG+G W++SQ+GKRE+DL+EMLSFHCY
Subjt:  MEAAAAAGKGQWQSSQEGKRELDLIEMLSFHCY

XP_022150911.1 transcription factor MYB119-like [Momordica charantia]6.1e-23299.75Show/hide
Query:  MQPPSGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPDQSYCGGEWRRVEEEEEEEEIMYSWGRSGANCNYELMMENEGKIISKGKKFKKGSSANLIKGQW
        MQPPSGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPDQSYCGGEWRRV EEEEEEEIMYSWGRSGANCNYELMMENEGKIISKGKKFKKGSSANLIKGQW
Subjt:  MQPPSGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPDQSYCGGEWRRVEEEEEEEEIMYSWGRSGANCNYELMMENEGKIISKGKKFKKGSSANLIKGQW

Query:  TEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGRTENAIKNHWNATKRRQNSR
        TEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGRTENAIKNHWNATKRRQNSR
Subjt:  TEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGRTENAIKNHWNATKRRQNSR

Query:  RKNKRPNSQNGKPHSSILQDYIKSKHNIPAPATATIVSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDDELLFMQNFFSNSTNPPPAHDAPMTVVDNQ
        RKNKRPNSQNGKPHSSILQDYIKSKHNIPAPATATIVSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDDELLFMQNFFSNSTNPPPAHDAPMTVVDNQ
Subjt:  RKNKRPNSQNGKPHSSILQDYIKSKHNIPAPATATIVSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDDELLFMQNFFSNSTNPPPAHDAPMTVVDNQ

Query:  PAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSSDGDELQNMMEMAEMEAAAAAGKGQWQSSQEGKRELDLIEMLSFH
        PAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSSDGDELQNMMEMAEMEAAAAAGKGQWQSSQEGKRELDLIEMLSFH
Subjt:  PAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSSDGDELQNMMEMAEMEAAAAAGKGQWQSSQEGKRELDLIEMLSFH

Query:  CYS
        CYS
Subjt:  CYS

XP_022970380.1 transcription factor MYB64-like [Cucurbita maxima]9.3e-14066.36Show/hide
Query:  PPSGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPDQSYCGGEW---------------------RRVEEEEEEEEIMYSWG-RSGANCNYEL-----MME
        PPSGPPLAAIDRFLY H +  +C +  ++GG    +S CGGEW                       +EEEEEE E MY WG R    C   L        
Subjt:  PPSGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPDQSYCGGEW---------------------RRVEEEEEEEEIMYSWG-RSGANCNYEL-----MME

Query:  NEGKIISKGKKFKKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIA
        +EGKI SK KK KK SSA LIKGQWTEEEDRKL RLVKQ+GVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEE ILVETHA+VGNRWAEIA
Subjt:  NEGKIISKGKKFKKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIA

Query:  KSIPGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKSKHNIPA-PATATI---VSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDD
        KSIPGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKSK N  A  AT+++   VS+DPSSHFNHFF ESSDST NLS AIISSP YDD
Subjt:  KSIPGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKSKHNIPA-PATATI---VSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDD

Query:  ELLFMQNFFSNSTN-PPPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSSDGD-ELQNMMEMAE
        ELLFMQNFFSNS    P   D    V  NQ  AEF S+DSE K+E  K+           +    S HLYSD+Y+SYLLNG  NS  GD E+QN +EMAE
Subjt:  ELLFMQNFFSNSTN-PPPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSSDGD-ELQNMMEMAE

Query:  MEAAAAAGKGQWQSSQEGKRELDLIEMLSFHCYS
        ++   AAG+GQW++SQEGKRE+DL+EMLSFHCYS
Subjt:  MEAAAAAGKGQWQSSQEGKRELDLIEMLSFHCYS

TrEMBL top hitse value%identityAlignment
A0A0A0LVH8 Uncharacterized protein2.2e-13466.14Show/hide
Query:  PPSG-PPLAAIDRFLYGHQLL-HSCDDVDVIGGFPPD-----QSYCGGEW------RRV-----------EEEEEEEEIMYSWGRSGAN---CNYELMM-
        PPS  PPLAAIDRFLY H +  ++C +  +IGG  P+         GGEW      RRV           EEEEEEEE+ Y WGR   N     +E MM 
Subjt:  PPSG-PPLAAIDRFLYGHQLL-HSCDDVDVIGGFPPD-----QSYCGGEW------RRV-----------EEEEEEEEIMYSWGRSGAN---CNYELMM-

Query:  ----ENEGKII-SKGKKFKKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGN
             NE KI  SK KKFKK SSANLIKGQWTEEED    RLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHA+VGN
Subjt:  ----ENEGKII-SKGKKFKKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGN

Query:  RWAEIAKSIPGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKSKHNIPAPAT--ATIVSDDPSSHFNHFFPESSDSTYNLSPAIISSP
        RWAEIAKSIPGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKSK+N  A  T   +  SDDPSSHFNHFF ESSDST NLS AIISSP
Subjt:  RWAEIAKSIPGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKSKHNIPAPAT--ATIVSDDPSSHFNHFFPESSDSTYNLSPAIISSP

Query:  TYDDELLFMQNFFSNST---NPPPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSS--DGDELQ
        TYDDELLFMQNFFSNS+   + P A D     + NQ   EF S+DSE K E+ K GD+        N AT S HL SD+YLSYLLNG  N+S   G E+Q
Subjt:  TYDDELLFMQNFFSNST---NPPPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSS--DGDELQ

Query:  NMMEMAEMEAAAAAGKGQWQSSQEGKRELDLIEMLSFHCY
        N   MAE++  AAAG+G W++SQ+ K+E+DL+EMLSFHCY
Subjt:  NMMEMAEMEAAAAAGKGQWQSSQEGKRELDLIEMLSFHCY

A0A1S3C5S6 protein ODORANT11.2e-14368.36Show/hide
Query:  PPSGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPD--QSYCGGEW------RRVE----------EEEEEEEIMYSWGR-SGANCNY---ELMME----N
        PPS PPLAAIDRFLY H +  +C    +IGG  P+      GGEW      RRVE          EEEEEEEIMY WGR +  NC     E+M+     N
Subjt:  PPSGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPD--QSYCGGEW------RRVE----------EEEEEEEIMYSWGR-SGANCNY---ELMME----N

Query:  EGKII-SKGKKFKKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIA
        E K+  SK KKFKK SSA+LIKGQWTEEEDRKL RLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHA+VGNRWAEIA
Subjt:  EGKII-SKGKKFKKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIA

Query:  KSIPGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKSKHNIPAPAT--ATIVSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDDEL
        KSIPGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKSK+N  A  T  A+  SDDPSSHFNHFF ESSDST NLS AIISSPTYDDEL
Subjt:  KSIPGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKSKHNIPAPAT--ATIVSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDDEL

Query:  LFMQNFFSNSTN--PPPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSS--DGDELQNMMEMAE
        LFMQNFFSNS++  P P+ D   T+  NQ   EF S+DSE K E+ K+ DE  ++ R       S HLYSD+YLSYLLNG  N++   G E+QN   MAE
Subjt:  LFMQNFFSNSTN--PPPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSS--DGDELQNMMEMAE

Query:  MEAAAAAGKGQWQSSQEGKRELDLIEMLSFHCY
        ++  AAAG+G W++SQ+GKRE+DL+EMLSFHCY
Subjt:  MEAAAAAGKGQWQSSQEGKRELDLIEMLSFHCY

A0A6J1DCW7 transcription factor MYB119-like3.0e-23299.75Show/hide
Query:  MQPPSGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPDQSYCGGEWRRVEEEEEEEEIMYSWGRSGANCNYELMMENEGKIISKGKKFKKGSSANLIKGQW
        MQPPSGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPDQSYCGGEWRRV EEEEEEEIMYSWGRSGANCNYELMMENEGKIISKGKKFKKGSSANLIKGQW
Subjt:  MQPPSGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPDQSYCGGEWRRVEEEEEEEEIMYSWGRSGANCNYELMMENEGKIISKGKKFKKGSSANLIKGQW

Query:  TEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGRTENAIKNHWNATKRRQNSR
        TEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGRTENAIKNHWNATKRRQNSR
Subjt:  TEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGRTENAIKNHWNATKRRQNSR

Query:  RKNKRPNSQNGKPHSSILQDYIKSKHNIPAPATATIVSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDDELLFMQNFFSNSTNPPPAHDAPMTVVDNQ
        RKNKRPNSQNGKPHSSILQDYIKSKHNIPAPATATIVSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDDELLFMQNFFSNSTNPPPAHDAPMTVVDNQ
Subjt:  RKNKRPNSQNGKPHSSILQDYIKSKHNIPAPATATIVSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDDELLFMQNFFSNSTNPPPAHDAPMTVVDNQ

Query:  PAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSSDGDELQNMMEMAEMEAAAAAGKGQWQSSQEGKRELDLIEMLSFH
        PAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSSDGDELQNMMEMAEMEAAAAAGKGQWQSSQEGKRELDLIEMLSFH
Subjt:  PAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSSDGDELQNMMEMAEMEAAAAAGKGQWQSSQEGKRELDLIEMLSFH

Query:  CYS
        CYS
Subjt:  CYS

A0A6J1HLU2 transcription factor MYB64-like1.1e-13865.66Show/hide
Query:  PPSGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPDQSYCGGEW------------------------RRVEEEEEEEEIMYSWGRSGANCNYELMMENEG
        PPSGPPLAAIDRFLY H +  +C +  ++GG    +S CGGEW                          +EE EEE+  MY WGR        +   +E 
Subjt:  PPSGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPDQSYCGGEW------------------------RRVEEEEEEEEIMYSWGRSGANCNYELMMENEG

Query:  KIISKGKKFKKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSI
        KI SK KK KK SSA LIKGQWTEEEDRKLIRLVKQ+GVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEE ILVETHA+VGNRWAEIAKSI
Subjt:  KIISKGKKFKKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSI

Query:  PGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKSKHN-IPAPATATI---VSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDDELL
        PGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIK+K+N    PAT+T+   VS+DPSSHFNHFF ESSDST NLS AIISSP YDDELL
Subjt:  PGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKSKHN-IPAPATATI---VSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDDELL

Query:  FMQNFFSNSTNP-PPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSSDGD-ELQNMMEMAEMEA
        FMQNFFSNS    P   D    VV NQ  AEF S+DSE K EK KV           +    S HLYSD+Y+SYLLNG  NS   D E+   +EMAE++ 
Subjt:  FMQNFFSNSTNP-PPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSSDGD-ELQNMMEMAEMEA

Query:  AAAAGKGQWQSSQEGKRELDLIEMLSFHCYS
          AAG+GQW++S+EGKRE+DLIEM+SFHCYS
Subjt:  AAAAGKGQWQSSQEGKRELDLIEMLSFHCYS

A0A6J1I2P9 transcription factor MYB64-like4.5e-14066.36Show/hide
Query:  PPSGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPDQSYCGGEW---------------------RRVEEEEEEEEIMYSWG-RSGANCNYEL-----MME
        PPSGPPLAAIDRFLY H +  +C +  ++GG    +S CGGEW                       +EEEEEE E MY WG R    C   L        
Subjt:  PPSGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPDQSYCGGEW---------------------RRVEEEEEEEEIMYSWG-RSGANCNYEL-----MME

Query:  NEGKIISKGKKFKKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIA
        +EGKI SK KK KK SSA LIKGQWTEEEDRKL RLVKQ+GVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEE ILVETHA+VGNRWAEIA
Subjt:  NEGKIISKGKKFKKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIA

Query:  KSIPGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKSKHNIPA-PATATI---VSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDD
        KSIPGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKSK N  A  AT+++   VS+DPSSHFNHFF ESSDST NLS AIISSP YDD
Subjt:  KSIPGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKSKHNIPA-PATATI---VSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDD

Query:  ELLFMQNFFSNSTN-PPPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSSDGD-ELQNMMEMAE
        ELLFMQNFFSNS    P   D    V  NQ  AEF S+DSE K+E  K+           +    S HLYSD+Y+SYLLNG  NS  GD E+QN +EMAE
Subjt:  ELLFMQNFFSNSTN-PPPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSSDGD-ELQNMMEMAE

Query:  MEAAAAAGKGQWQSSQEGKRELDLIEMLSFHCYS
        ++   AAG+GQW++SQEGKRE+DL+EMLSFHCYS
Subjt:  MEAAAAAGKGQWQSSQEGKRELDLIEMLSFHCYS

SwissProt top hitse value%identityAlignment
Q1PDP9 Transcription factor MYB1151.2e-3657.81Show/hide
Query:  KGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGRTENAIKNHWNATKRR
        KGQWT  ED  L+R+VK  G + W  IA+  +GR GKQCRERWHNHLRP+IKK  WSEEE++IL+E H  VGN+W EIAK +PGR+EN +KNHWNATKRR
Subjt:  KGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGRTENAIKNHWNATKRR

Query:  QNSRRKNKRPNSQNGKPHSSILQDYIKS
         +S R  +   S    P ++ L++YI+S
Subjt:  QNSRRKNKRPNSQNGKPHSSILQDYIKS

Q9FIM4 Transcription factor MYB1191.0e-6440.28Show/hide
Query:  SGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPDQSYCGGEWRRVEEEEEEEEIMYSWGR------SGANCNYELMMENEGKIISKG------KKFKKGSS
        + PPL A++RFLYG +    C           DQ     +       + +E   +   R      +G N N        G+++++          KK SS
Subjt:  SGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPDQSYCGGEWRRVEEEEEEEEIMYSWGR------SGANCNYELMMENEGKIISKG------KKFKKGSS

Query:  ANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGRTENAIKNHWNA
         NLIKGQWT EEDRKLIRLV+QHG RKWA I+EKLEGRAGKQCRERWHNHLRPDIKK+ WSEEEER+LVE+H ++GN+WAEIAK IPGRTEN+IKNHWNA
Subjt:  ANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGRTENAIKNHWNA

Query:  TKRRQNSRRKNKRP-----NSQNGKPHSS---ILQDYIKS--KHNI----PAPATATIVSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDDELLFMQN
        TKRRQNS+RK+KR      N ++  P +    ILQDYIKS  ++NI            +S   + + +  + +   ++     +I+  P YD+EL + QN
Subjt:  TKRRQNSRRKNKRP-----NSQNGKPHSS---ILQDYIKS--KHNI----PAPATATIVSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDDELLFMQN

Query:  FFSNSTNPPPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSC---HLYSDLYLSYLLNGAANSSDGDELQNMMEMAEMEAAAA
         F+N  +P    +  ++   ++   +  S    +K     + D V  +H +  T T      HL SD+YLSYLLNG  +S       +            
Subjt:  FFSNSTNPPPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSC---HLYSDLYLSYLLNGAANSSDGDELQNMMEMAEMEAAAA

Query:  AGKGQW----QSSQEGKRELDLIEMLS
         G  ++     +S   +RE+DLIEMLS
Subjt:  AGKGQW----QSSQEGKRELDLIEMLS

Q9FY60 Transcription factor MYB642.0e-6040.81Show/hide
Query:  SGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPDQSYCGGEWRRVEEEEEEEEIMYSWGRSGANCNY--ELMMENEGKIISKGKKFKKGSSANLIKGQWTE
        + PPL A++RFL G +    C          P         R +E   E +E M    R   N     E++++   K  +     KK    N+IKGQWT 
Subjt:  SGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPDQSYCGGEWRRVEEEEEEEEIMYSWGRSGANCNY--ELMMENEGKIISKGKKFKKGSSANLIKGQWTE

Query:  EEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGRTENAIKNHWNATKRRQNSRRK
        +EDRKLI+LV QHG RKWA I+EKLEGRAGKQCRERWHNHLRPDIKK+SWSEEEER+LVE H ++GN+WAEIAK I GRTEN+IKNHWNATKRRQNS+RK
Subjt:  EEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGRTENAIKNHWNATKRRQNSRRK

Query:  NKRPNSQNGKPHSS------------ILQDYIKSKHNIPAPATATIVSDD-----PSSHFNHFFPESSDSTYNLSPAIISSPTYDDELLFMQNFFSNST-
        +KR  S+N   +S             IL+DYIK+  N        I++        +S+++ F  E S S      +++  P YD+EL+F++N F N + 
Subjt:  NKRPNSQNGKPHSS------------ILQDYIKSKHNIPAPATATIVSDD-----PSSHFNHFFPESSDSTYNLSPAIISSPTYDDELLFMQNFFSNST-

Query:  -NPPPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSSDGDELQNMMEMAEMEAAAAAGKGQ---
         N   +    +T    Q ++    I++      L        L         S HL SD+YLS LLNG A+SS      + +          AG+ +   
Subjt:  -NPPPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSSDGDELQNMMEMAEMEAAAAAGKGQ---

Query:  -WQSSQEGKRELDLIEMLS
           +S   +RE+DLIEMLS
Subjt:  -WQSSQEGKRELDLIEMLS

Q9LVW4 Transcription factor MYB1181.3e-4356.79Show/hide
Query:  NYELMMENEGKIISKGKKF------KKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVE
        N ++M++ E +I  K K+F      K    A++IKGQWT EED+ L++LV  HG +KW+QIA+ L+GR GKQCRERWHNHLRPDIKK+ W+EEE+ IL++
Subjt:  NYELMMENEGKIISKGKKF------KKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVE

Query:  THAKVGNRWAEIAKSIPGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKS
         H ++GNRWAEIA+ +PGRTEN IKNHWNATKRRQ+SRR  K  +  +    S+ LQ+YI+S
Subjt:  THAKVGNRWAEIAKSIPGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKS

Q9S7L2 Transcription factor MYB981.3e-4663.64Show/hide
Query:  SKGKKFKKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGR
        S    +K+   + L+KGQWT EEDR LI+LV+++G+RKW+ IA+ L GR GKQCRERWHNHLRPDIKKE+WSEEE+R+L+E H ++GN+WAEIAK +PGR
Subjt:  SKGKKFKKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGR

Query:  TENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKS
        TEN+IKNHWNATKRRQ S+RK      ++  P  S+LQDYIKS
Subjt:  TENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKS

Arabidopsis top hitse value%identityAlignment
AT3G27785.1 myb domain protein 1189.3e-4556.79Show/hide
Query:  NYELMMENEGKIISKGKKF------KKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVE
        N ++M++ E +I  K K+F      K    A++IKGQWT EED+ L++LV  HG +KW+QIA+ L+GR GKQCRERWHNHLRPDIKK+ W+EEE+ IL++
Subjt:  NYELMMENEGKIISKGKKF------KKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVE

Query:  THAKVGNRWAEIAKSIPGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKS
         H ++GNRWAEIA+ +PGRTEN IKNHWNATKRRQ+SRR  K  +  +    S+ LQ+YI+S
Subjt:  THAKVGNRWAEIAKSIPGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKS

AT4G18770.1 myb domain protein 988.9e-4863.64Show/hide
Query:  SKGKKFKKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGR
        S    +K+   + L+KGQWT EEDR LI+LV+++G+RKW+ IA+ L GR GKQCRERWHNHLRPDIKKE+WSEEE+R+L+E H ++GN+WAEIAK +PGR
Subjt:  SKGKKFKKGSSANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGR

Query:  TENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKS
        TEN+IKNHWNATKRRQ S+RK      ++  P  S+LQDYIKS
Subjt:  TENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQDYIKS

AT5G11050.1 myb domain protein 641.4e-6140.81Show/hide
Query:  SGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPDQSYCGGEWRRVEEEEEEEEIMYSWGRSGANCNY--ELMMENEGKIISKGKKFKKGSSANLIKGQWTE
        + PPL A++RFL G +    C          P         R +E   E +E M    R   N     E++++   K  +     KK    N+IKGQWT 
Subjt:  SGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPDQSYCGGEWRRVEEEEEEEEIMYSWGRSGANCNY--ELMMENEGKIISKGKKFKKGSSANLIKGQWTE

Query:  EEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGRTENAIKNHWNATKRRQNSRRK
        +EDRKLI+LV QHG RKWA I+EKLEGRAGKQCRERWHNHLRPDIKK+SWSEEEER+LVE H ++GN+WAEIAK I GRTEN+IKNHWNATKRRQNS+RK
Subjt:  EEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGRTENAIKNHWNATKRRQNSRRK

Query:  NKRPNSQNGKPHSS------------ILQDYIKSKHNIPAPATATIVSDD-----PSSHFNHFFPESSDSTYNLSPAIISSPTYDDELLFMQNFFSNST-
        +KR  S+N   +S             IL+DYIK+  N        I++        +S+++ F  E S S      +++  P YD+EL+F++N F N + 
Subjt:  NKRPNSQNGKPHSS------------ILQDYIKSKHNIPAPATATIVSDD-----PSSHFNHFFPESSDSTYNLSPAIISSPTYDDELLFMQNFFSNST-

Query:  -NPPPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSSDGDELQNMMEMAEMEAAAAAGKGQ---
         N   +    +T    Q ++    I++      L        L         S HL SD+YLS LLNG A+SS      + +          AG+ +   
Subjt:  -NPPPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSCHLYSDLYLSYLLNGAANSSDGDELQNMMEMAEMEAAAAAGKGQ---

Query:  -WQSSQEGKRELDLIEMLS
           +S   +RE+DLIEMLS
Subjt:  -WQSSQEGKRELDLIEMLS

AT5G40360.1 myb domain protein 1158.4e-3857.81Show/hide
Query:  KGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGRTENAIKNHWNATKRR
        KGQWT  ED  L+R+VK  G + W  IA+  +GR GKQCRERWHNHLRP+IKK  WSEEE++IL+E H  VGN+W EIAK +PGR+EN +KNHWNATKRR
Subjt:  KGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGRTENAIKNHWNATKRR

Query:  QNSRRKNKRPNSQNGKPHSSILQDYIKS
         +S R  +   S    P ++ L++YI+S
Subjt:  QNSRRKNKRPNSQNGKPHSSILQDYIKS

AT5G58850.1 myb domain protein 1197.3e-6640.28Show/hide
Query:  SGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPDQSYCGGEWRRVEEEEEEEEIMYSWGR------SGANCNYELMMENEGKIISKG------KKFKKGSS
        + PPL A++RFLYG +    C           DQ     +       + +E   +   R      +G N N        G+++++          KK SS
Subjt:  SGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPDQSYCGGEWRRVEEEEEEEEIMYSWGR------SGANCNYELMMENEGKIISKG------KKFKKGSS

Query:  ANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGRTENAIKNHWNA
         NLIKGQWT EEDRKLIRLV+QHG RKWA I+EKLEGRAGKQCRERWHNHLRPDIKK+ WSEEEER+LVE+H ++GN+WAEIAK IPGRTEN+IKNHWNA
Subjt:  ANLIKGQWTEEEDRKLIRLVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGRTENAIKNHWNA

Query:  TKRRQNSRRKNKRP-----NSQNGKPHSS---ILQDYIKS--KHNI----PAPATATIVSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDDELLFMQN
        TKRRQNS+RK+KR      N ++  P +    ILQDYIKS  ++NI            +S   + + +  + +   ++     +I+  P YD+EL + QN
Subjt:  TKRRQNSRRKNKRP-----NSQNGKPHSS---ILQDYIKS--KHNI----PAPATATIVSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDDELLFMQN

Query:  FFSNSTNPPPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSC---HLYSDLYLSYLLNGAANSSDGDELQNMMEMAEMEAAAA
         F+N  +P    +  ++   ++   +  S    +K     + D V  +H +  T T      HL SD+YLSYLLNG  +S       +            
Subjt:  FFSNSTNPPPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRSNTATGSC---HLYSDLYLSYLLNGAANSSDGDELQNMMEMAEMEAAAA

Query:  AGKGQW----QSSQEGKRELDLIEMLS
         G  ++     +S   +RE+DLIEMLS
Subjt:  AGKGQW----QSSQEGKRELDLIEMLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGCCGCCGTCGGGACCGCCGCTGGCCGCCATAGATCGGTTTCTCTACGGCCACCAATTATTACACAGCTGCGATGACGTGGACGTGATCGGAGGGTTCCCGCCGGA
TCAAAGTTACTGCGGCGGCGAGTGGCGGCGCGTGGAGGAGGAAGAAGAAGAAGAAGAAATTATGTACAGTTGGGGCAGAAGTGGGGCAAATTGTAATTATGAGCTGATGA
TGGAAAATGAAGGCAAAATTATTTCCAAAGGGAAGAAATTTAAGAAGGGCTCTTCTGCAAATTTGATTAAAGGCCAGTGGACTGAGGAAGAAGACAGGAAATTAATAAGA
TTGGTGAAGCAACATGGGGTGAGAAAATGGGCACAGATAGCCGAGAAGTTGGAGGGAAGAGCCGGGAAGCAATGCCGCGAGAGATGGCACAATCATTTGCGGCCTGATAT
TAAGAAGGAAAGTTGGAGCGAAGAGGAAGAGAGAATACTAGTGGAGACTCATGCAAAGGTTGGGAACCGATGGGCAGAGATAGCAAAAAGCATTCCGGGAAGAACAGAAA
ACGCCATAAAAAACCATTGGAATGCCACAAAAAGAAGGCAGAATTCAAGGAGAAAGAACAAACGCCCCAACTCCCAAAATGGCAAGCCTCACTCTTCTATTCTTCAAGAC
TACATCAAAAGCAAGCATAACATTCCCGCTCCAGCCACCGCAACCATCGTCTCCGACGATCCTTCGTCCCATTTCAACCATTTCTTCCCTGAATCCTCTGACTCCACCTA
CAACCTCTCCCCGGCGATCATCTCGTCGCCTACGTACGACGACGAGCTTCTCTTCATGCAGAACTTCTTTTCCAACTCGACCAATCCACCACCCGCCCACGATGCGCCAA
TGACCGTTGTCGATAATCAACCTGCGGCGGAGTTTGTCTCCATCGACTCCGAACTGAAAATGGAGAAGCTGAAGGTTGGCGACGAAGTAGATGATCTTCATCGTCGGAGC
AACACTGCAACCGGCAGTTGCCATCTGTATTCTGACCTATATTTGTCATATCTTTTGAACGGGGCGGCGAACTCCAGCGACGGCGATGAGCTTCAGAACATGATGGAGAT
GGCGGAGATGGAGGCGGCGGCCGCGGCGGGGAAAGGCCAATGGCAGAGCTCTCAAGAAGGGAAAAGGGAACTGGATTTGATAGAAATGCTGTCTTTCCATTGTTACTCTT
GA
mRNA sequenceShow/hide mRNA sequence
ATGCAGCCGCCGTCGGGACCGCCGCTGGCCGCCATAGATCGGTTTCTCTACGGCCACCAATTATTACACAGCTGCGATGACGTGGACGTGATCGGAGGGTTCCCGCCGGA
TCAAAGTTACTGCGGCGGCGAGTGGCGGCGCGTGGAGGAGGAAGAAGAAGAAGAAGAAATTATGTACAGTTGGGGCAGAAGTGGGGCAAATTGTAATTATGAGCTGATGA
TGGAAAATGAAGGCAAAATTATTTCCAAAGGGAAGAAATTTAAGAAGGGCTCTTCTGCAAATTTGATTAAAGGCCAGTGGACTGAGGAAGAAGACAGGAAATTAATAAGA
TTGGTGAAGCAACATGGGGTGAGAAAATGGGCACAGATAGCCGAGAAGTTGGAGGGAAGAGCCGGGAAGCAATGCCGCGAGAGATGGCACAATCATTTGCGGCCTGATAT
TAAGAAGGAAAGTTGGAGCGAAGAGGAAGAGAGAATACTAGTGGAGACTCATGCAAAGGTTGGGAACCGATGGGCAGAGATAGCAAAAAGCATTCCGGGAAGAACAGAAA
ACGCCATAAAAAACCATTGGAATGCCACAAAAAGAAGGCAGAATTCAAGGAGAAAGAACAAACGCCCCAACTCCCAAAATGGCAAGCCTCACTCTTCTATTCTTCAAGAC
TACATCAAAAGCAAGCATAACATTCCCGCTCCAGCCACCGCAACCATCGTCTCCGACGATCCTTCGTCCCATTTCAACCATTTCTTCCCTGAATCCTCTGACTCCACCTA
CAACCTCTCCCCGGCGATCATCTCGTCGCCTACGTACGACGACGAGCTTCTCTTCATGCAGAACTTCTTTTCCAACTCGACCAATCCACCACCCGCCCACGATGCGCCAA
TGACCGTTGTCGATAATCAACCTGCGGCGGAGTTTGTCTCCATCGACTCCGAACTGAAAATGGAGAAGCTGAAGGTTGGCGACGAAGTAGATGATCTTCATCGTCGGAGC
AACACTGCAACCGGCAGTTGCCATCTGTATTCTGACCTATATTTGTCATATCTTTTGAACGGGGCGGCGAACTCCAGCGACGGCGATGAGCTTCAGAACATGATGGAGAT
GGCGGAGATGGAGGCGGCGGCCGCGGCGGGGAAAGGCCAATGGCAGAGCTCTCAAGAAGGGAAAAGGGAACTGGATTTGATAGAAATGCTGTCTTTCCATTGTTACTCTT
GA
Protein sequenceShow/hide protein sequence
MQPPSGPPLAAIDRFLYGHQLLHSCDDVDVIGGFPPDQSYCGGEWRRVEEEEEEEEIMYSWGRSGANCNYELMMENEGKIISKGKKFKKGSSANLIKGQWTEEEDRKLIR
LVKQHGVRKWAQIAEKLEGRAGKQCRERWHNHLRPDIKKESWSEEEERILVETHAKVGNRWAEIAKSIPGRTENAIKNHWNATKRRQNSRRKNKRPNSQNGKPHSSILQD
YIKSKHNIPAPATATIVSDDPSSHFNHFFPESSDSTYNLSPAIISSPTYDDELLFMQNFFSNSTNPPPAHDAPMTVVDNQPAAEFVSIDSELKMEKLKVGDEVDDLHRRS
NTATGSCHLYSDLYLSYLLNGAANSSDGDELQNMMEMAEMEAAAAAGKGQWQSSQEGKRELDLIEMLSFHCYS