; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021091 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021091
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein rough sheath 2 homolog
Genome locationtig00153640:457829..458947
RNA-Seq ExpressionSgr021091
SyntenySgr021091
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581166.1 Transcription factor AS1, partial [Cucurbita argyrosperma subsp. sororia]6.8e-17786.32Show/hide
Query:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
        MKDRQRWQPEEDALLR YVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSP+EQ+LVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
Subjt:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW

Query:  EVFKEKQLKQLQKA---TQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLY-GAFQSPPLANVMPRIPVPDADPVLSLGSVSSTTSSS
        EVF+EKQLKQLQK    TQRRDYQNSDGNIP+ GVSSPEKA+QGPYDHILETFAEKYVQPK+Y  AFQSPP     PR+ VP+ADPVLSLGSVSSTTSS 
Subjt:  EVFKEKQLKQLQKA---TQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLY-GAFQSPPLANVMPRIPVPDADPVLSLGSVSSTTSSS

Query:  TVIPLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARKKREKM
        TV+PLWMNVNSTSTASSST+STTPSPSVSLTLSPSEPA L+SEVNR+LPVQQ+G LVQYCK+LEEGRQ+WVQHKKEATWRLNRLEQQLESEKARKKREKM
Subjt:  TVIPLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARKKREKM

Query:  EEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQIGVQGHGFVA----SKEIIH
        EEMEAKI+ LREEEMAYLGRIE DY EQL+ALQRDA+SKEAKL+E+WC+KH KLAKLVE+ GVQGHG V     SK+I+H
Subjt:  EEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQIGVQGHGFVA----SKEIIH

KAG7036747.1 Transcription factor AS1, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-17987.79Show/hide
Query:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
        MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQ+LVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
Subjt:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW

Query:  EVFKEKQLKQLQKA---TQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLY-GAFQS----PPLANVMPRIPVPDADPVLSLGSVSST
        EVFKEKQLK+LQKA   TQRRDYQNSDGN+ I+GVSSPEKALQGPYDHILETFAEKYVQPKLY  AFQS    PPLANVMPR+PVPDADPVLSLGSV+ST
Subjt:  EVFKEKQLKQLQKA---TQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLY-GAFQS----PPLANVMPRIPVPDADPVLSLGSVSST

Query:  TSSSTVIPLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARKK
        TSSSTV+PLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEP  LDSE+NRLLPVQQ+G LV YCKELEEGRQ W+QHKKEATWRLNRLEQQLESEKARKK
Subjt:  TSSSTVIPLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARKK

Query:  REKMEEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQIGVQGHGFVA-----SKEIIH
        REKMEEMEAKIR LREEEMAYLGRIE DY EQ++A++RDAE+KEAKLLEAWC+KH KL KLVEQIG QG G V      SK++IH
Subjt:  REKMEEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQIGVQGHGFVA-----SKEIIH

XP_022949284.1 protein rough sheath 2 homolog [Cucurbita moschata]3.6e-17887.27Show/hide
Query:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
        MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPL+RDPKSCLERWKNYLKPGLKKGSLSPEEQ+LVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
Subjt:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW

Query:  EVFKEKQLKQLQKA---TQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLY-GAFQS----PPLANVMPRIPVPDADPVLSLGSVSST
        EVFKEKQLK+LQKA   TQRRDYQNSDGN+ I+GVSSPEKALQGPYDHILETFAEKYVQPKLY  AFQS    PPLANVMPR+PVPDADPVLSLGSV+ST
Subjt:  EVFKEKQLKQLQKA---TQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLY-GAFQS----PPLANVMPRIPVPDADPVLSLGSVSST

Query:  TSSSTVIPLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARKK
        TSSSTV+PLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEP  LDSE+NRLLPVQQ+G LV YCKELEEGRQ W+QHKKEATWRL RLEQQLESEKARKK
Subjt:  TSSSTVIPLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARKK

Query:  REKMEEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQIGVQGHGFVA-----SKEIIH
        REKMEEMEAKIR LREEEMAYLGRIE DY EQ++A++RDAE+KEAKLLEAWC+KH KL KLVEQIG QG G V      SK++IH
Subjt:  REKMEEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQIGVQGHGFVA-----SKEIIH

XP_022997764.1 protein rough sheath 2 homolog [Cucurbita maxima]9.4e-17987.5Show/hide
Query:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
        MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQ+LVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
Subjt:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW

Query:  EVFKEKQLKQLQKA---TQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLYG-AFQS----PPLANVMPRIPVPDADPVLSLGSVSST
        EVFKEKQLK+LQKA   TQRRDYQNSDGN+ I+GVSSPEKALQGPYDHILETFAEKYVQPKLY  AFQS    PPLAN+MPR+PVPDADPVLSLGSV+ST
Subjt:  EVFKEKQLKQLQKA---TQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLYG-AFQS----PPLANVMPRIPVPDADPVLSLGSVSST

Query:  TSSSTVIPLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARKK
        TSSS V+PLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEP  LDSE+NRLLPVQQ+G LV YCKELEEGRQ W+QHKKEATWRLNRLEQQLESEKARKK
Subjt:  TSSSTVIPLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARKK

Query:  REKMEEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQIGVQGHGFVA----SKEIIH
        REKMEEMEAKIR LREEEMAYLGRIE DY EQ++A++RDAE+KEAKLLEAWC+KH KL KLVEQIG QG G V     SK++IH
Subjt:  REKMEEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQIGVQGHGFVA----SKEIIH

XP_023525837.1 protein rough sheath 2 homolog [Cucurbita pepo subsp. pepo]3.2e-17987.56Show/hide
Query:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
        MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLS EEQ+LVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
Subjt:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW

Query:  EVFKEKQLKQLQKA---TQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLY-GAFQS-----PPLANVMPRIPVPDADPVLSLGSVSS
        EVFKEKQLK+LQKA   TQRRDYQNSDGN+ I+GVSSPEKALQGPYDHILETFAEKYVQPKLY  AFQS     PPLANVMPR+PVPDADPVLSLGSV+S
Subjt:  EVFKEKQLKQLQKA---TQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLY-GAFQS-----PPLANVMPRIPVPDADPVLSLGSVSS

Query:  TTSSSTVIPLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARK
        TTSSSTV+PLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEP  LDSE+NRLLPVQQ+G LV YCKELEEGRQSW+QHKKEATWRLNRLEQQLESEKARK
Subjt:  TTSSSTVIPLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARK

Query:  KREKMEEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQIGVQGHGFVA-----SKEIIH
        KREKMEEMEAKIR LREEEMAYLGRIE DY EQ++A++RDAE+KEAKLLEAWC+KH KL KLVEQIG QG G V      SK++IH
Subjt:  KREKMEEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQIGVQGHGFVA-----SKEIIH

TrEMBL top hitse value%identityAlignment
A0A5D3CNU9 Transcription factor AS1-like1.5e-15380.21Show/hide
Query:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
        MKDRQRWQPEEDALLRAYVKQYGPKEWNLIS RM  PLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRT KRLGKWW
Subjt:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW

Query:  EVFKEKQLKQLQKATQRRDYQNSDGNIPIA-GVSSPEKALQGPYDHILETFAEKYVQPKLYGAFQSPPLANVMPRIPVPDADPVLSLGSVSSTTSSSTVI
        EVFKEKQLKQL KA       + D N+PI+  VSSPEKALQGPYDHILETFAEKYVQPKLY              IP+PDADP+LSLGSV+STTSSST++
Subjt:  EVFKEKQLKQLQKATQRRDYQNSDGNIPIA-GVSSPEKALQGPYDHILETFAEKYVQPKLYGAFQSPPLANVMPRIPVPDADPVLSLGSVSSTTSSSTVI

Query:  PLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARKKREKMEEM
        PLWMNVNSTSTASSST STTPSPSVSLTLSPSEP  L+SEVNR      IG LVQYCKE+EEGRQSWVQHKKEA+WRLNRLEQQLESEKARKKREKMEEM
Subjt:  PLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARKKREKMEEM

Query:  EAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQIGVQGHGFV-ASKEIIH
        EAKI+ LREEE  YLG IERDY EQLNALQR+A+ KEAKL+E WCNKH KL KLVE+ G  GHG +  SK+I+H
Subjt:  EAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQIGVQGHGFV-ASKEIIH

A0A6J1FAF4 protein rough sheath 2 homolog1.8e-17586.05Show/hide
Query:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
        MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKP LKKGSLSP+EQ+LVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
Subjt:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW

Query:  EVFKEKQLKQLQKA---TQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLY-GAFQSPPLANVMPRIPVPDADPVLSLGSVSSTTSSS
        EVF+EKQLKQLQK    TQRRDYQNSDGNIP+ GVSSPEKA+QGPYDHILETFAEKYVQPK+Y  AFQSPP     PR+ VP+ADPVLSLGSVSSTTSS 
Subjt:  EVFKEKQLKQLQKA---TQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLY-GAFQSPPLANVMPRIPVPDADPVLSLGSVSSTTSSS

Query:  TVIPLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARKKREKM
        TV+PLWMNVNSTSTASSST+STTPSPSVSLTLSPSEPA L+SEVNR+LPVQQ+G LVQYCK+LEEGRQ+WVQHKKEATWRLNRLEQQLESEKARKKREKM
Subjt:  TVIPLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARKKREKM

Query:  EEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQIGVQGHGFVA----SKEIIH
        EEMEAKI+ LREEEMAYLGRIE DY EQL+ALQRDA+SKEAKL+E+WC+KH KLAKLVE+ GVQGHG V     S +I+H
Subjt:  EEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQIGVQGHGFVA----SKEIIH

A0A6J1GCE3 protein rough sheath 2 homolog1.7e-17887.27Show/hide
Query:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
        MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPL+RDPKSCLERWKNYLKPGLKKGSLSPEEQ+LVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
Subjt:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW

Query:  EVFKEKQLKQLQKA---TQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLY-GAFQS----PPLANVMPRIPVPDADPVLSLGSVSST
        EVFKEKQLK+LQKA   TQRRDYQNSDGN+ I+GVSSPEKALQGPYDHILETFAEKYVQPKLY  AFQS    PPLANVMPR+PVPDADPVLSLGSV+ST
Subjt:  EVFKEKQLKQLQKA---TQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLY-GAFQS----PPLANVMPRIPVPDADPVLSLGSVSST

Query:  TSSSTVIPLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARKK
        TSSSTV+PLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEP  LDSE+NRLLPVQQ+G LV YCKELEEGRQ W+QHKKEATWRL RLEQQLESEKARKK
Subjt:  TSSSTVIPLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARKK

Query:  REKMEEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQIGVQGHGFVA-----SKEIIH
        REKMEEMEAKIR LREEEMAYLGRIE DY EQ++A++RDAE+KEAKLLEAWC+KH KL KLVEQIG QG G V      SK++IH
Subjt:  REKMEEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQIGVQGHGFVA-----SKEIIH

A0A6J1IZB3 protein rough sheath 2 homolog5.2e-17585.79Show/hide
Query:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
        MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSP+EQ+LVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
Subjt:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW

Query:  EVFKEKQLKQLQKA---TQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLY-GAFQSPPLANVMPRIPVPDADPVLSLGSVSSTTSSS
        EVFKEKQLKQLQK    TQR DYQNSDGNIPI GVSSPEKA+QGPYDHILETFAEKYVQPK+Y  AFQSPP     PR+ VP+ADPVLSLGSVSSTTSS 
Subjt:  EVFKEKQLKQLQKA---TQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLY-GAFQSPPLANVMPRIPVPDADPVLSLGSVSSTTSSS

Query:  TVIPLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARKKREKM
        TV+PLWMN NSTSTASSST STTPSPSVSLTLSPSEPA  ++EVNR+LPVQQ+G LVQYCK+LEEGRQSWVQHKKEATWRLNRLE+QLESEKARKKREKM
Subjt:  TVIPLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARKKREKM

Query:  EEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQIGVQGHGFVA----SKEIIH
        EEMEAKI+ LREEE AYLGRIE DY EQL+ALQRDA+SKEAKL+E+WC+KH KLAKLVE+ GVQGHG V     SK+I+H
Subjt:  EEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQIGVQGHGFVA----SKEIIH

A0A6J1KCH4 protein rough sheath 2 homolog4.6e-17987.5Show/hide
Query:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
        MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQ+LVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
Subjt:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW

Query:  EVFKEKQLKQLQKA---TQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLYG-AFQS----PPLANVMPRIPVPDADPVLSLGSVSST
        EVFKEKQLK+LQKA   TQRRDYQNSDGN+ I+GVSSPEKALQGPYDHILETFAEKYVQPKLY  AFQS    PPLAN+MPR+PVPDADPVLSLGSV+ST
Subjt:  EVFKEKQLKQLQKA---TQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLYG-AFQS----PPLANVMPRIPVPDADPVLSLGSVSST

Query:  TSSSTVIPLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARKK
        TSSS V+PLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEP  LDSE+NRLLPVQQ+G LV YCKELEEGRQ W+QHKKEATWRLNRLEQQLESEKARKK
Subjt:  TSSSTVIPLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARKK

Query:  REKMEEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQIGVQGHGFVA----SKEIIH
        REKMEEMEAKIR LREEEMAYLGRIE DY EQ++A++RDAE+KEAKLLEAWC+KH KL KLVEQIG QG G V     SK++IH
Subjt:  REKMEEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQIGVQGHGFVA----SKEIIH

SwissProt top hitse value%identityAlignment
O80931 Transcription factor AS13.0e-7946.8Show/hide
Query:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
        MK+RQRW  EEDALLRAYV+Q+GP+EW+L+S+RM KPL+RD KSCLERWKNYLKPG+KKGSL+ EEQ LVI LQ K+GNKWKKIAAEVPGRTAKRLGKWW
Subjt:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW

Query:  EVFKEKQLKQLQKATQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLYGAFQSPPLANVMPRIPVPDADPVLS------LGSVSSTTS
        EVFKEKQ ++ +++ +R                  E   +  YD ILE+FAEK V+ +          +NV+P      A  V++      L S      
Subjt:  EVFKEKQLKQLQKATQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLYGAFQSPPLANVMPRIPVPDADPVLS------LGSVSSTTS

Query:  SSTVIPLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSE----------------------VNRLLPVQQ-------IGTLVQYCKELEEGRQS
         + VIP W+     +T+++  +     PSV+LTLSPS  AA   +                      +  ++P          +  LV+ C+ELEEG ++
Subjt:  SSTVIPLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSE----------------------VNRLLPVQQ-------IGTLVQYCKELEEGRQS

Query:  WVQHKKEATWRLNRLEQQLESEKARKKREKMEEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQ
        W  HKKEA WRL RLE QLESEK  ++REKMEE+EAK+++LREE+   + +IE +Y EQL  L+RDAE+K+ KL + W ++H +L K +EQ
Subjt:  WVQHKKEATWRLNRLEQQLESEKARKKREKMEEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQ

P81392 Myb-related protein 3068.4e-2133.2Show/hide
Query:  WQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWWEVFKEK
        W PEED +L +Y++++GP  W  I    G  L R  KSC  RW NYL+PG+K+G  +  E+ ++I LQA  GN+W  IA+ +P RT   +  +W    +K
Subjt:  WQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWWEVFKEK

Query:  QLKQLQKATQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILET--------FAEKYVQPKLYGAFQSPPLANVMPRIPVPDADPVLSLGSVSSTTSSSTV
        +L++LQ      + +  DGN   + V S +   +G ++  L+T          +     K   +   P L+ V    P P         + SS  + + +
Subjt:  QLKQLQKATQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILET--------FAEKYVQPKLYGAFQSPPLANVMPRIPVPDADPVLSLGSVSSTTSSSTV

Query:  IPLW-----MNVNSTSTASSSTSSTTP--SPSVSL-TLSPSEPA
        +  W     +N +STS A SS S+TT    PSV L T SPSE A
Subjt:  IPLW-----MNVNSTSTASSSTSSTTP--SPSVSL-TLSPSEPA

Q24JK1 Transcription factor MYB965.1e-1831.35Show/hide
Query:  WQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWWEVFKEK
        W PEED +L +Y++++GP  W  +    G  L R  KSC  RW NYL+PG+K+G+ +  E+  ++ LQA  GN+W  IA+ +P RT   +  +W    +K
Subjt:  WQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWWEVFKEK

Query:  QLKQLQKATQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLYGAFQSPPLANVMPRIPVPDADPVLSLGSVSSTTSSSTVIP----LW
        +LK++          N  G     GVSS   + Q  +    +   E+ +Q  +  A Q+                  LSL   SST SSS+ +P      
Subjt:  QLKQLQKATQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLYGAFQSPPLANVMPRIPVPDADPVLSLGSVSSTTSSSTVIP----LW

Query:  MNVNSTSTA---------SSSTSSTTPSPSVSLTLSPSEPAALDSE-VNRLL
         N+ + S+A         SSS+S+TT + S +    PS   A  +E + RLL
Subjt:  MNVNSTSTA---------SSSTSSTTPSPSVSLTLSPSEPAALDSE-VNRLL

Q94IB1 Protein rough sheath 2 homolog8.8e-7946.69Show/hide
Query:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
        M++RQRW+PEEDA+L AYV+QYGP+EW+L+SQRM +PLHRD KSCLERWKNYL+PG+KKGSL+ +EQ LVI LQAK+GNKWKKIAAEVPGRTAKRLGKWW
Subjt:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW

Query:  EVFKEKQLKQLQKATQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLYGAFQSPPLANVMPRIPVPDADPVLSLGSVSSTTSSSTVIP
        EVFKEKQ ++L+   +RR     DG+              G YD +LE FA+K V         +P                               ++P
Subjt:  EVFKEKQLKQLQKATQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLYGAFQSPPLANVMPRIPVPDADPVLSLGSVSSTTSSSTVIP

Query:  LWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAAL-----DSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARKKREK
         WM  +S+ ++SSS S T    S ++  +P+ P          EV        +  L++ C+E+EEG+++W  H+KEA WR+ R+E QLE+E+A ++RE 
Subjt:  LWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAAL-----DSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARKKREK

Query:  MEEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQI
         EE EAK+R+LREE+ A + R+E +Y E++  L+RDAE+KE K+ E W  KH +LAK ++Q+
Subjt:  MEEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQI

Q9S7B2 Protein rough sheath 21.7e-7748.03Show/hide
Query:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
        MK+RQRW+PEEDA+LRAYV+QYGP+EW+L+SQRM   L RD KSCLERWKNYL+PG+KKGSL+ EEQ LVI LQAK+GNKWKKIAAEVPGRTAKRLGKWW
Subjt:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW

Query:  EVFKEKQLKQLQKATQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYV--QPKLYGAFQSPPL--ANVMPRIPVPDADPVLSLG---------
        EVFKEKQ ++L+           D   P    S  E+   G Y+ +LE FAEK V  +P+   A  SP L  A V+P     +A P  +           
Subjt:  EVFKEKQLKQLQKATQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYV--QPKLYGAFQSPPL--ANVMPRIPVPDADPVLSLG---------

Query:  ----SVSSTTSSSTVIP------LWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRL
            SV+ + +S+ V P       WM       A+ +     PSPS     +P   A +D         Q +  L + C+ELEEGR++W  H++EA WRL
Subjt:  ----SVSSTTSSSTVIP------LWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRL

Query:  NRLEQQLESEKARKKREKMEEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQIG
         R+EQQLE E+  ++RE  EE EAK+R++R E+ A   R+ERD+ E++  L+RDA+ KE K+ E W  KH ++AK VEQ+G
Subjt:  NRLEQQLESEKARKKREKMEEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQIG

Arabidopsis top hitse value%identityAlignment
AT1G25340.1 myb domain protein 1161.7e-2144.76Show/hide
Query:  WQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWWEVFKEK
        W  EED LL  Y+   G   WNL+++  G  L R  KSC  RW NYLKP +K+G+L+P+EQ L++ L +K+GN+W KI+  +PGRT   +  +W    +K
Subjt:  WQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWWEVFKEK

Query:  QLKQL
        Q +QL
Subjt:  QLKQL

AT1G68320.1 myb domain protein 621.5e-2043.52Show/hide
Query:  RQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWWEVF
        R  W  EED LL  Y+   G   WN +++  G  L R  KSC  RW NYLKP +++G+L+P+EQ L++ L +K+GN+W KIA  +PGRT   +  +W   
Subjt:  RQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWWEVF

Query:  KEKQLKQL
         +KQ +QL
Subjt:  KEKQLKQL

AT2G37630.1 myb-like HTH transcriptional regulator family protein2.2e-8046.8Show/hide
Query:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW
        MK+RQRW  EEDALLRAYV+Q+GP+EW+L+S+RM KPL+RD KSCLERWKNYLKPG+KKGSL+ EEQ LVI LQ K+GNKWKKIAAEVPGRTAKRLGKWW
Subjt:  MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWW

Query:  EVFKEKQLKQLQKATQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLYGAFQSPPLANVMPRIPVPDADPVLS------LGSVSSTTS
        EVFKEKQ ++ +++ +R                  E   +  YD ILE+FAEK V+ +          +NV+P      A  V++      L S      
Subjt:  EVFKEKQLKQLQKATQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLYGAFQSPPLANVMPRIPVPDADPVLS------LGSVSSTTS

Query:  SSTVIPLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSE----------------------VNRLLPVQQ-------IGTLVQYCKELEEGRQS
         + VIP W+     +T+++  +     PSV+LTLSPS  AA   +                      +  ++P          +  LV+ C+ELEEG ++
Subjt:  SSTVIPLWMNVNSTSTASSSTSSTTPSPSVSLTLSPSEPAALDSE----------------------VNRLLPVQQ-------IGTLVQYCKELEEGRQS

Query:  WVQHKKEATWRLNRLEQQLESEKARKKREKMEEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQ
        W  HKKEA WRL RLE QLESEK  ++REKMEE+EAK+++LREE+   + +IE +Y EQL  L+RDAE+K+ KL + W ++H +L K +EQ
Subjt:  WVQHKKEATWRLNRLEQQLESEKARKKREKMEEMEAKIRSLREEEMAYLGRIERDYSEQLNALQRDAESKEAKLLEAWCNKHEKLAKLVEQ

AT4G13480.1 myb domain protein 798.6e-2143.69Show/hide
Query:  WQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWWEVFKEK
        W  EED LL  YV+ +G   WN +S+  G  L R+ KSC  RW NYL+P LK+G ++P E+S+++ L AK+GN+W  IA  +PGRT   +  +W    +K
Subjt:  WQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWWEVFKEK

Query:  QLK
        + K
Subjt:  QLK

AT5G52600.1 myb domain protein 821.7e-2145.19Show/hide
Query:  RQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWWEVF
        R  W+PEED +L++YV+ +G   W  IS+R G  L R  KSC  RWKNYL+P +K+GS+SP+EQ L+I +    GN+W  IA  +PGRT   +  +W   
Subjt:  RQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWWEVF

Query:  KEKQ
          K+
Subjt:  KEKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGACCGCCAGCGCTGGCAGCCGGAAGAAGACGCACTCCTTCGAGCTTACGTCAAGCAATATGGCCCTAAAGAATGGAATCTCATCTCTCAGCGCATGGGGAAGCC
TCTCCATCGGGATCCCAAATCCTGCCTCGAGCGCTGGAAGAATTACCTCAAACCTGGCTTGAAGAAGGGCTCGCTCTCTCCCGAAGAGCAGAGCCTGGTTATCTCTCTCC
AGGCCAAGTACGGCAACAAGTGGAAGAAGATCGCCGCCGAGGTCCCCGGCCGTACGGCCAAGCGCCTTGGCAAGTGGTGGGAGGTATTTAAAGAGAAGCAGCTCAAGCAG
TTGCAGAAGGCGACGCAGAGGCGAGATTACCAAAATTCCGACGGGAATATTCCGATTGCTGGTGTTTCTTCGCCGGAGAAGGCGCTGCAAGGCCCCTACGACCACATCTT
GGAGACCTTCGCCGAGAAGTACGTCCAGCCGAAGCTGTACGGTGCGTTTCAATCTCCGCCGCTCGCCAACGTGATGCCTCGCATTCCGGTCCCCGACGCCGATCCGGTCC
TATCGCTCGGCTCCGTCAGCTCTACGACGTCGTCTTCCACCGTCATTCCTCTGTGGATGAACGTGAACTCCACCAGCACGGCTTCGTCTTCGACCTCGTCGACCACGCCC
TCCCCGTCGGTGAGCCTCACGCTATCCCCCTCCGAACCGGCCGCTCTCGACTCGGAAGTGAACCGTCTCTTACCGGTTCAGCAGATCGGCACTCTAGTCCAGTACTGCAA
GGAGCTGGAAGAAGGCCGGCAGAGCTGGGTACAGCATAAGAAGGAGGCGACATGGCGACTGAATCGATTAGAGCAGCAGCTGGAGTCGGAGAAAGCGAGGAAGAAAAGAG
AGAAAATGGAGGAAATGGAAGCGAAGATACGGAGCTTGCGGGAGGAAGAAATGGCGTACCTGGGGAGAATCGAGAGGGATTACAGTGAGCAGCTGAATGCGCTGCAGAGA
GACGCAGAGAGCAAAGAGGCGAAGCTGCTGGAAGCCTGGTGCAACAAGCATGAGAAATTGGCTAAGCTTGTGGAGCAAATTGGTGTCCAGGGACATGGATTCGTTGCTTC
GAAGGAGATAATCCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGACCGCCAGCGCTGGCAGCCGGAAGAAGACGCACTCCTTCGAGCTTACGTCAAGCAATATGGCCCTAAAGAATGGAATCTCATCTCTCAGCGCATGGGGAAGCC
TCTCCATCGGGATCCCAAATCCTGCCTCGAGCGCTGGAAGAATTACCTCAAACCTGGCTTGAAGAAGGGCTCGCTCTCTCCCGAAGAGCAGAGCCTGGTTATCTCTCTCC
AGGCCAAGTACGGCAACAAGTGGAAGAAGATCGCCGCCGAGGTCCCCGGCCGTACGGCCAAGCGCCTTGGCAAGTGGTGGGAGGTATTTAAAGAGAAGCAGCTCAAGCAG
TTGCAGAAGGCGACGCAGAGGCGAGATTACCAAAATTCCGACGGGAATATTCCGATTGCTGGTGTTTCTTCGCCGGAGAAGGCGCTGCAAGGCCCCTACGACCACATCTT
GGAGACCTTCGCCGAGAAGTACGTCCAGCCGAAGCTGTACGGTGCGTTTCAATCTCCGCCGCTCGCCAACGTGATGCCTCGCATTCCGGTCCCCGACGCCGATCCGGTCC
TATCGCTCGGCTCCGTCAGCTCTACGACGTCGTCTTCCACCGTCATTCCTCTGTGGATGAACGTGAACTCCACCAGCACGGCTTCGTCTTCGACCTCGTCGACCACGCCC
TCCCCGTCGGTGAGCCTCACGCTATCCCCCTCCGAACCGGCCGCTCTCGACTCGGAAGTGAACCGTCTCTTACCGGTTCAGCAGATCGGCACTCTAGTCCAGTACTGCAA
GGAGCTGGAAGAAGGCCGGCAGAGCTGGGTACAGCATAAGAAGGAGGCGACATGGCGACTGAATCGATTAGAGCAGCAGCTGGAGTCGGAGAAAGCGAGGAAGAAAAGAG
AGAAAATGGAGGAAATGGAAGCGAAGATACGGAGCTTGCGGGAGGAAGAAATGGCGTACCTGGGGAGAATCGAGAGGGATTACAGTGAGCAGCTGAATGCGCTGCAGAGA
GACGCAGAGAGCAAAGAGGCGAAGCTGCTGGAAGCCTGGTGCAACAAGCATGAGAAATTGGCTAAGCTTGTGGAGCAAATTGGTGTCCAGGGACATGGATTCGTTGCTTC
GAAGGAGATAATCCATTGA
Protein sequenceShow/hide protein sequence
MKDRQRWQPEEDALLRAYVKQYGPKEWNLISQRMGKPLHRDPKSCLERWKNYLKPGLKKGSLSPEEQSLVISLQAKYGNKWKKIAAEVPGRTAKRLGKWWEVFKEKQLKQ
LQKATQRRDYQNSDGNIPIAGVSSPEKALQGPYDHILETFAEKYVQPKLYGAFQSPPLANVMPRIPVPDADPVLSLGSVSSTTSSSTVIPLWMNVNSTSTASSSTSSTTP
SPSVSLTLSPSEPAALDSEVNRLLPVQQIGTLVQYCKELEEGRQSWVQHKKEATWRLNRLEQQLESEKARKKREKMEEMEAKIRSLREEEMAYLGRIERDYSEQLNALQR
DAESKEAKLLEAWCNKHEKLAKLVEQIGVQGHGFVASKEIIH