; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg11757 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg11757
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionHTH myb-type domain-containing protein
Genome locationCarg_Chr02:3979497..3983919
RNA-Seq ExpressionCarg11757
SyntenyCarg11757
Gene Ontology termsNA
InterPro domainsIPR009057 - Homeobox-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605382.1 Telomeric repeat-binding factor 1, partial [Cucurbita argyrosperma subsp. sororia]1.5e-30899.82Show/hide
Query:  MDKDVCRWIIEFILRSSMNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLA
        MDKDVCRWIIEFILRSSMNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLA
Subjt:  MDKDVCRWIIEFILRSSMNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLA

Query:  VEGMKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRN
        VEGMKNNGKYFDTVSRIWRGR+MELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRN
Subjt:  VEGMKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRN

Query:  EMQSVQLEQAIDKTAVASEDVGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEK
        EMQSVQLEQAIDKTAVASEDVGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEK
Subjt:  EMQSVQLEQAIDKTAVASEDVGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEK

Query:  EVTVLQDRSPNCRKNLKTSILPRCKSLASHKRVRGGAKIDHLEDLENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAK
        EVTVLQDRSPNCRKNLKTSILPRCKSLASHKRVRGGAKIDHLEDLENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAK
Subjt:  EVTVLQDRSPNCRKNLKTSILPRCKSLASHKRVRGGAKIDHLEDLENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAK

Query:  KNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEENR
        KNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEENR
Subjt:  KNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEENR

Query:  LVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFDDRTE
        LVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFDDRTE
Subjt:  LVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFDDRTE

KAG7035336.1 hypothetical protein SDJN02_02131, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MDKDVCRWIIEFILRSSMNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLA
        MDKDVCRWIIEFILRSSMNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLA
Subjt:  MDKDVCRWIIEFILRSSMNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLA

Query:  VEGMKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRN
        VEGMKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRN
Subjt:  VEGMKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRN

Query:  EMQSVQLEQAIDKTAVASEDVGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEK
        EMQSVQLEQAIDKTAVASEDVGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEK
Subjt:  EMQSVQLEQAIDKTAVASEDVGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEK

Query:  EVTVLQDRSPNCRKNLKTSILPRCKSLASHKRVRGGAKIDHLEDLENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAK
        EVTVLQDRSPNCRKNLKTSILPRCKSLASHKRVRGGAKIDHLEDLENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAK
Subjt:  EVTVLQDRSPNCRKNLKTSILPRCKSLASHKRVRGGAKIDHLEDLENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAK

Query:  KNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEENR
        KNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEENR
Subjt:  KNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEENR

Query:  LVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFDDRTEPNEQADVCSSSNGLPHFCSTNLVKVNIFSPFSFMNTAEIVLWS
        LVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFDDRTEPNEQADVCSSSNGLPHFCSTNLVKVNIFSPFSFMNTAEIVLWS
Subjt:  LVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFDDRTEPNEQADVCSSSNGLPHFCSTNLVKVNIFSPFSFMNTAEIVLWS

XP_022947187.1 uncharacterized protein LOC111451133 [Cucurbita moschata]5.9e-29297.92Show/hide
Query:  MNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEGMKNNGKYFDTVSRI
        MNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEGMKNNGKYFDTVSRI
Subjt:  MNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEGMKNNGKYFDTVSRI

Query:  WRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRNEMQSVQLEQAIDKTAVA
        WRGR+MELKRSGRSELVSRE EEWKDEVEAALWDK VWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRNEMQSVQLEQAIDKTAV 
Subjt:  WRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRNEMQSVQLEQAIDKTAVA

Query:  SEDVGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEKEVTVLQDRSPNCRKNLK
        SEDVGGSGGIE PS+TENHARPEHQGSVPVLTRAETKR DVLDMNQDSGVN+NSKRSSTVEMNTERVQ LPTETTEGQESIEKEVTVLQDRSPNCRKNLK
Subjt:  SEDVGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEKEVTVLQDRSPNCRKNLK

Query:  TSILPRCKSLASHKRVRGGAKIDHLEDLENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAKKNKTPEHSLEDQNDAVA
        TSILPRCKSLASHKRVRGGAKIDHLE LENDSSSGKSTCLQTPEF+RVREALKTSSLELQALVKDPLPDALRIAESVAQDLAKKNKTPEHSLEDQNDAVA
Subjt:  TSILPRCKSLASHKRVRGGAKIDHLEDLENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAKKNKTPEHSLEDQNDAVA

Query:  ANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEENRLVWRRKCKRWSLLEEDT
        ANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEENRLVWRRKCKRWSLLEEDT
Subjt:  ANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEENRLVWRRKCKRWSLLEEDT

Query:  LRTAVQRFGKGNWKLILNSYREIFDDRTE
        LRTAVQRFGKGNWKLILNSYREIFDDRTE
Subjt:  LRTAVQRFGKGNWKLILNSYREIFDDRTE

XP_023007149.1 uncharacterized protein LOC111499729 isoform X1 [Cucurbita maxima]6.1e-29795.79Show/hide
Query:  MDKDVCRWIIEFILRSSMNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLA
        MDKDVCRWIIEFILRSSMND LLKRTLA+MPFPDNDFRLKKTVLLRAIESERSEA+VTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLA
Subjt:  MDKDVCRWIIEFILRSSMNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLA

Query:  VEGMKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRN
        VEGMKNNGKYFDTVSRIWRGR+MELKRSGRSELVSRELEEWKD+VEAALWDKTVWKKLVNMN+RYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRN
Subjt:  VEGMKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRN

Query:  EMQSVQLEQAIDKTAVASEDVGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEK
        EMQSVQLEQAI KTAV SEDVGGSGGIELPSQTENHA+ EHQGSVPVLTRAETKR DVLD+NQDSGVN+NSKRSSTVEMNTERVQ LPTETTEG+ES+EK
Subjt:  EMQSVQLEQAIDKTAVASEDVGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEK

Query:  EVTVLQDRSPNCRKNLKTSILPRCKSLASHKRVRGGAKIDHLEDLENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAK
        EVTVLQD SPNCRKNLKTS+LPRCKSLASHKRVRGGAKI HLEDLENDSSSGKSTCLQTPEF+RVREA KTSSLELQALVKDPLPDALRIAESVAQDLAK
Subjt:  EVTVLQDRSPNCRKNLKTSILPRCKSLASHKRVRGGAKIDHLEDLENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAK

Query:  KNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEENR
        KNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEE R
Subjt:  KNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEENR

Query:  LVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFDDRTE
        LVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFD+RTE
Subjt:  LVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFDDRTE

XP_023532585.1 uncharacterized protein LOC111794704 [Cucurbita pepo subsp. pepo]2.9e-29996.7Show/hide
Query:  MDKDVCRWIIEFILRSSMNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLA
        MDKDVCRWIIEFILRSSMNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLA
Subjt:  MDKDVCRWIIEFILRSSMNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLA

Query:  VEGMKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRN
        VEGMKNNGKYFDTVSRIWRGR+MELK+SGRSELVSRELEEWKD+VE ALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRN
Subjt:  VEGMKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRN

Query:  EMQSVQLEQAIDKTAVASEDVGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEK
        EM SVQLEQAIDKTAV SEDVGGSGGIELPS+TENHARP+HQGSVPVLTRAETKR DVLDMNQDSGVN++SKRSSTVEMNTERVQ LPTETTEGQESIEK
Subjt:  EMQSVQLEQAIDKTAVASEDVGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEK

Query:  EVTVLQDRSPNCRKNLKTSILPRCKSLASHKRVRGGAKIDHLEDLENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAK
        EVTVLQD SPNCRKNLKTS+LPRCKSLASHKRVRGGAKIDHLEDLENDSSSGKSTCLQTPEF+RVREALKTSSLELQALVKDPLPDALRIA+SVAQDLAK
Subjt:  EVTVLQDRSPNCRKNLKTSILPRCKSLASHKRVRGGAKIDHLEDLENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAK

Query:  KNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEENR
        KNKTPEHSLEDQNDAVAANPAINK NMPLQSMNTAFNNP HGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEENR
Subjt:  KNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEENR

Query:  LVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFDDRTE
        LVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFDDRTE
Subjt:  LVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFDDRTE

TrEMBL top hitse value%identityAlignment
A0A6J1FVK5 uncharacterized protein LOC111448860 isoform X13.3e-20870.18Show/hide
Query:  MDKDVCRWIIEFILRSSMNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLA
        M++D+CRWI EFILRSSM+DHLLKR LAV+P  D DFRLKKTVLLRAIESE SEAV+TEK+L IFEMIEQL+K EGL +M+SMK+AYCAVAVECTVKYL 
Subjt:  MDKDVCRWIIEFILRSSMNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLA

Query:  VEGMKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRN
        VEG+  +G+YFD V RIWRGR+       ++ELVS E + WKDEVEA+L D  + KKLV+MNTRY+ALKLIGDYLGE+W  +GPSFL+LSASL+D + RN
Subjt:  VEGMKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRN

Query:  EMQSVQLEQAIDKTAVASEDVGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEK
        EM S+QLEQ  +  A+ S DVGGSGGIELPSQ EN  R E QGS  VL++ E++R D+L  +QD G N+ SK+S+   MNTERVQ L TET EGQES EK
Subjt:  EMQSVQLEQAIDKTAVASEDVGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEK

Query:  EVTVLQDRSPNCRKNLKTSILPRCKSLASHKRVRGGAKIDHLEDL----ENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQ
        EV VLQ+ S +CR+ LKTS+LPR KSLA H+RVRGG KI HLEDL    E++SSS +  CL+TPE  RVREALKTSSLELQA+VKDPLPDALRIAESVAQ
Subjt:  EVTVLQDRSPNCRKNLKTSILPRCKSLASHKRVRGGAKIDHLEDL----ENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQ

Query:  DLAKKNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKY
        DLA+KNKT E+SLED+NDA   NP INK+ +PLQ M+    +P HGH+T+ PRPSIMERNS+ACTYEWNDSID  PE + ASRLHL SPKRK ISPLKKY
Subjt:  DLAKKNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKY

Query:  EENRLVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFDDRTE
        EE + V RR+CK+WSLLEEDTLRTAVQRFGKGNWKLILNSYR+IFD+RTE
Subjt:  EENRLVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFDDRTE

A0A6J1G5R7 uncharacterized protein LOC1114511332.8e-29297.92Show/hide
Query:  MNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEGMKNNGKYFDTVSRI
        MNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEGMKNNGKYFDTVSRI
Subjt:  MNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEGMKNNGKYFDTVSRI

Query:  WRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRNEMQSVQLEQAIDKTAVA
        WRGR+MELKRSGRSELVSRE EEWKDEVEAALWDK VWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRNEMQSVQLEQAIDKTAV 
Subjt:  WRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRNEMQSVQLEQAIDKTAVA

Query:  SEDVGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEKEVTVLQDRSPNCRKNLK
        SEDVGGSGGIE PS+TENHARPEHQGSVPVLTRAETKR DVLDMNQDSGVN+NSKRSSTVEMNTERVQ LPTETTEGQESIEKEVTVLQDRSPNCRKNLK
Subjt:  SEDVGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEKEVTVLQDRSPNCRKNLK

Query:  TSILPRCKSLASHKRVRGGAKIDHLEDLENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAKKNKTPEHSLEDQNDAVA
        TSILPRCKSLASHKRVRGGAKIDHLE LENDSSSGKSTCLQTPEF+RVREALKTSSLELQALVKDPLPDALRIAESVAQDLAKKNKTPEHSLEDQNDAVA
Subjt:  TSILPRCKSLASHKRVRGGAKIDHLEDLENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAKKNKTPEHSLEDQNDAVA

Query:  ANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEENRLVWRRKCKRWSLLEEDT
        ANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEENRLVWRRKCKRWSLLEEDT
Subjt:  ANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEENRLVWRRKCKRWSLLEEDT

Query:  LRTAVQRFGKGNWKLILNSYREIFDDRTE
        LRTAVQRFGKGNWKLILNSYREIFDDRTE
Subjt:  LRTAVQRFGKGNWKLILNSYREIFDDRTE

A0A6J1JF51 uncharacterized protein LOC111484457 isoform X11.7e-20468.73Show/hide
Query:  MDKDVCRWIIEFILRSSMNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLA
        M++D+CRWI EFILRSSM+DHLLKR LAV+P  + DFRLKKTVLLRAIESE SEAV+T+K+L IFEMIEQL+K EGL +M+SMK+AYCAVAVECTVKYL 
Subjt:  MDKDVCRWIIEFILRSSMNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLA

Query:  VEGMKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRN
        VEG+  +G+YFD V RIWRG +       ++ELVS E + WKDEVEA+L D  + KKLV+MNTRY+ALKLIGDYLGE+W  +GPSFL+LSASL+D + RN
Subjt:  VEGMKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRN

Query:  EMQSVQLEQAIDKTAVASEDVGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEK
        EM S+QLE+  +  A+ S DVGGSGGIELPSQ EN  + E QGS  VL++ E++R D+L  +QD G N+ SK+S+   MNTERVQ L TET EGQES EK
Subjt:  EMQSVQLEQAIDKTAVASEDVGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEK

Query:  EVTVLQDRSPNCRKNLKTSILPRCKSLASHKRVRGGAKIDHLEDL----ENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQ
        EV VLQ+ S +CR+ LKTS+LPRCKSLA H+RV GG KI HLEDL    E++SSS +  CL+TPE  RVREALKTSSLELQA+ KDPLPDALRIAESVA 
Subjt:  EVTVLQDRSPNCRKNLKTSILPRCKSLASHKRVRGGAKIDHLEDL----ENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQ

Query:  DLAKKNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKY
        DLA+KNKT E+SLED+NDA   NP INK+ +PLQ M+    +P HGH+T+ PRPSIMERNS+ACTYEWNDSID  PE + ASRLHL SPKRK +SPLKKY
Subjt:  DLAKKNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKY

Query:  EENRLVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFDDRTE
        EE + V RR+CK+WSLLEEDTLRTAVQRFGKGNWKLILNSYR+IFD+RTE
Subjt:  EENRLVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFDDRTE

A0A6J1L270 uncharacterized protein LOC111499729 isoform X21.8e-28695.65Show/hide
Query:  MNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEGMKNNGKYFDTVSRI
        MND LLKRTLA+MPFPDNDFRLKKTVLLRAIESERSEA+VTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEGMKNNGKYFDTVSRI
Subjt:  MNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEGMKNNGKYFDTVSRI

Query:  WRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRNEMQSVQLEQAIDKTAVA
        WRGR+MELKRSGRSELVSRELEEWKD+VEAALWDKTVWKKLVNMN+RYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRNEMQSVQLEQAI KTAV 
Subjt:  WRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRNEMQSVQLEQAIDKTAVA

Query:  SEDVGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEKEVTVLQDRSPNCRKNLK
        SEDVGGSGGIELPSQTENHA+ EHQGSVPVLTRAETKR DVLD+NQDSGVN+NSKRSSTVEMNTERVQ LPTETTEG+ES+EKEVTVLQD SPNCRKNLK
Subjt:  SEDVGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEKEVTVLQDRSPNCRKNLK

Query:  TSILPRCKSLASHKRVRGGAKIDHLEDLENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAKKNKTPEHSLEDQNDAVA
        TS+LPRCKSLASHKRVRGGAKI HLEDLENDSSSGKSTCLQTPEF+RVREA KTSSLELQALVKDPLPDALRIAESVAQDLAKKNKTPEHSLEDQNDAVA
Subjt:  TSILPRCKSLASHKRVRGGAKIDHLEDLENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAKKNKTPEHSLEDQNDAVA

Query:  ANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEENRLVWRRKCKRWSLLEEDT
        ANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEE RLVWRRKCKRWSLLEEDT
Subjt:  ANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEENRLVWRRKCKRWSLLEEDT

Query:  LRTAVQRFGKGNWKLILNSYREIFDDRTE
        LRTAVQRFGKGNWKLILNSYREIFD+RTE
Subjt:  LRTAVQRFGKGNWKLILNSYREIFDDRTE

A0A6J1L461 uncharacterized protein LOC111499729 isoform X12.9e-29795.79Show/hide
Query:  MDKDVCRWIIEFILRSSMNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLA
        MDKDVCRWIIEFILRSSMND LLKRTLA+MPFPDNDFRLKKTVLLRAIESERSEA+VTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLA
Subjt:  MDKDVCRWIIEFILRSSMNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLA

Query:  VEGMKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRN
        VEGMKNNGKYFDTVSRIWRGR+MELKRSGRSELVSRELEEWKD+VEAALWDKTVWKKLVNMN+RYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRN
Subjt:  VEGMKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRN

Query:  EMQSVQLEQAIDKTAVASEDVGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEK
        EMQSVQLEQAI KTAV SEDVGGSGGIELPSQTENHA+ EHQGSVPVLTRAETKR DVLD+NQDSGVN+NSKRSSTVEMNTERVQ LPTETTEG+ES+EK
Subjt:  EMQSVQLEQAIDKTAVASEDVGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEK

Query:  EVTVLQDRSPNCRKNLKTSILPRCKSLASHKRVRGGAKIDHLEDLENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAK
        EVTVLQD SPNCRKNLKTS+LPRCKSLASHKRVRGGAKI HLEDLENDSSSGKSTCLQTPEF+RVREA KTSSLELQALVKDPLPDALRIAESVAQDLAK
Subjt:  EVTVLQDRSPNCRKNLKTSILPRCKSLASHKRVRGGAKIDHLEDLENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAK

Query:  KNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEENR
        KNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEE R
Subjt:  KNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFNNPRHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEENR

Query:  LVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFDDRTE
        LVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFD+RTE
Subjt:  LVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFDDRTE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G06910.1 TRF-like 72.8e-1024.19Show/hide
Query:  KTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEGMKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEE
        K  +L  I  E  + +V EK L   E + ++   EG  +  S+  AYC VAVECTVK LA E       Y + +  IW GR+M L     S LV+ +L +
Subjt:  KTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEGMKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEE

Query:  WKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASL-MDNRTRNEMQSVQLEQAIDKTAVASEDVGGSGGIELPSQTENHARP
            +  A  D    K L++ +TR +AL  +   +           L+L+ +L ++N   +E   ++  +  ++T    E   G G              
Subjt:  WKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASL-MDNRTRNEMQSVQLEQAIDKTAVASEDVGGSGGIELPSQTENHARP

Query:  EHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEKEVTVLQDRSPNCRKNLKTSILPRCKSLASHKRVRGGAKI
                                    N NS+ S  +E              + QES+                 L T +          +   GG+K 
Subjt:  EHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEKEVTVLQDRSPNCRKNLKTSILPRCKSLASHKRVRGGAKI

Query:  DHLEDLENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAKKNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFNNP
         ++    N         + +   +R    L+ S +EL   ++   P                N   E   E +ND   ANP+         + N A    
Subjt:  DHLEDLENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAKKNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFNNP

Query:  RHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPER--NRASRLHLPSPKRKVISPLKKYEENRLVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLI
                PRPS+ME  S+A TYEWNDSID S     +   R++    KR V+SPLK+   +    R K   WS  E   +    +++G  NWK I
Subjt:  RHGHQTIVPRPSIMERNSSACTYEWNDSIDGSPER--NRASRLHLPSPKRKVISPLKKYEENRLVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLI

AT1G15720.1 TRF-like 51.5e-2229.74Show/hide
Query:  RWIIEFILRSSMNDHLLKRTL--AVMPFPDND-FRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEG
        +W+ EF LR  +N  +    L  A+ P   +D  +LK T +LR I +   +  V E +L + E++E+L   E   +M S+KSAYC  AVECT++++    
Subjt:  RWIIEFILRSSMNDHLLKRTL--AVMPFPDND-FRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEG

Query:  MKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRNEMQ
          ++G + D + RIWR R+  LK    S+LV+REL +W+ ++  A  +  +++K+   N RY A+  +   L E W +LG S LE  A     + ++   
Subjt:  MKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRNEMQ

Query:  SVQLEQAIDKTAVASEDVGGSGGIELPSQT--ENHARPEHQGSV---PVLTRAETKR-----KDVLD--MNQDSGVNNNSKRSSTVEMNTERVQGLPTET
        + +     +K    + DV   GG+E P      N    EH+ S+    +L   E K      + ++D    QD   NN  + S  V    +       E 
Subjt:  SVQLEQAIDKTAVASEDVGGSGGIELPSQT--ENHARPEHQGSV---PVLTRAETKR-----KDVLD--MNQDSGVNNNSKRSSTVEMNTERVQGLPTET

Query:  TEGQES
        +EG  S
Subjt:  TEGQES

AT5G58340.1 myb-like HTH transcriptional regulator family protein4.4e-1931.35Show/hide
Query:  RWIIEFILRSSMNDHLLK-RTLAVMPFPDND--FRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEG
        +W+ EF L    N        ++ +   D+    +LK + +LR I +      + E +L + E++E+L   +   +MDS KSAYC  A ECT++++    
Subjt:  RWIIEFILRSSMNDHLLK-RTLAVMPFPDND--FRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEG

Query:  MKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLE
          ++G + D + RIW  R+  LK SG S+LV+ +L +W+ +++ AL D  +++++   N RY A+  +   L E W +LG S LE
Subjt:  MKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLE

AT5G58340.2 myb-like HTH transcriptional regulator family protein4.4e-1931.35Show/hide
Query:  RWIIEFILRSSMNDHLLK-RTLAVMPFPDND--FRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEG
        +W+ EF L    N        ++ +   D+    +LK + +LR I +      + E +L + E++E+L   +   +MDS KSAYC  A ECT++++    
Subjt:  RWIIEFILRSSMNDHLLK-RTLAVMPFPDND--FRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEG

Query:  MKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLE
          ++G + D + RIW  R+  LK SG S+LV+ +L +W+ +++ AL D  +++++   N RY A+  +   L E W +LG S LE
Subjt:  MKNNGKYFDTVSRIWRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAAGGACGTTTGCCGATGGATTATCGAGTTCATACTCCGGAGTTCAATGAACGATCACTTGTTGAAGAGAACTCTTGCGGTTATGCCCTTTCCGGACAACGATTT
CCGTCTGAAGAAGACGGTTCTATTACGGGCTATTGAGAGCGAAAGATCAGAAGCTGTAGTAACAGAGAAAGTACTCGCGATTTTCGAGATGATTGAGCAGTTGGACAAGA
CTGAAGGCCTTGCAATGATGGATTCAATGAAATCTGCTTATTGTGCCGTGGCAGTGGAGTGCACGGTCAAGTACTTGGCGGTGGAAGGCATGAAGAACAATGGAAAGTAT
TTTGATACAGTGAGTAGGATTTGGAGAGGGAGATTGATGGAATTGAAGAGGTCGGGAAGGAGTGAATTGGTTTCACGTGAATTGGAAGAATGGAAGGATGAAGTAGAAGC
GGCACTCTGGGATAAGACTGTCTGGAAGAAACTGGTGAATATGAATACTAGATATGAAGCTTTGAAACTGATAGGAGATTATCTGGGTGAGGCTTGGGGTGTTCTTGGTC
CCTCATTCCTAGAATTATCTGCTTCTTTGATGGATAACAGGACGAGGAATGAAATGCAGTCTGTTCAATTGGAGCAGGCAATAGACAAAACCGCTGTTGCGAGTGAAGAT
GTTGGTGGGAGTGGTGGGATTGAATTGCCCTCCCAAACTGAAAACCATGCGAGACCGGAGCATCAGGGGAGCGTGCCAGTCCTTACCCGAGCTGAAACAAAGAGAAAAGA
TGTGCTAGATATGAATCAGGATTCGGGTGTCAACAACAATTCTAAACGATCGTCCACTGTTGAAATGAACACTGAGAGAGTTCAAGGGTTGCCTACAGAAACAACGGAAG
GGCAAGAATCAATAGAGAAGGAAGTTACAGTTTTGCAAGATCGAAGTCCAAACTGCCGCAAAAATTTGAAGACTTCTATCTTGCCAAGATGCAAATCCCTTGCATCCCAT
AAGCGTGTTAGAGGAGGGGCTAAAATTGATCATCTTGAGGACTTGGAGAATGACAGTTCATCTGGAAAGTCTACTTGCCTACAAACTCCAGAATTTGAGCGGGTACGAGA
AGCGCTTAAAACCAGCTCTTTAGAGTTGCAAGCATTGGTGAAAGATCCACTTCCTGATGCATTACGCATTGCGGAATCTGTAGCTCAAGATCTTGCTAAAAAGAATAAAA
CTCCTGAGCATTCTTTGGAGGACCAAAACGATGCAGTTGCTGCGAATCCAGCCATTAACAAGGATAATATGCCTCTTCAATCCATGAATACAGCTTTCAACAATCCACGC
CATGGACATCAGACAATTGTTCCACGACCTAGTATAATGGAACGCAACAGTTCAGCTTGTACATATGAGTGGAATGATTCAATAGATGGTTCTCCGGAAAGAAATCGCGC
TAGCCGACTTCACCTACCTAGTCCCAAGAGAAAGGTCATTTCTCCCTTGAAGAAGTATGAAGAAAACAGACTGGTTTGGAGAAGGAAGTGTAAGAGGTGGAGCTTGCTTG
AAGAAGATACCTTAAGAACTGCTGTGCAAAGGTTTGGTAAAGGAAATTGGAAGCTCATCTTAAATAGCTATCGTGAAATATTTGATGATAGAACAGAGCCGAATGAGCAA
GCTGATGTTTGTTCCAGTAGCAATGGTTTGCCACACTTCTGCTCAACAAATTTGGTGAAGGTAAACATATTTTCTCCTTTTTCATTTATGAACACTGCTGAAATTGTGCT
CTGGTCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACAAGGACGTTTGCCGATGGATTATCGAGTTCATACTCCGGAGTTCAATGAACGATCACTTGTTGAAGAGAACTCTTGCGGTTATGCCCTTTCCGGACAACGATTT
CCGTCTGAAGAAGACGGTTCTATTACGGGCTATTGAGAGCGAAAGATCAGAAGCTGTAGTAACAGAGAAAGTACTCGCGATTTTCGAGATGATTGAGCAGTTGGACAAGA
CTGAAGGCCTTGCAATGATGGATTCAATGAAATCTGCTTATTGTGCCGTGGCAGTGGAGTGCACGGTCAAGTACTTGGCGGTGGAAGGCATGAAGAACAATGGAAAGTAT
TTTGATACAGTGAGTAGGATTTGGAGAGGGAGATTGATGGAATTGAAGAGGTCGGGAAGGAGTGAATTGGTTTCACGTGAATTGGAAGAATGGAAGGATGAAGTAGAAGC
GGCACTCTGGGATAAGACTGTCTGGAAGAAACTGGTGAATATGAATACTAGATATGAAGCTTTGAAACTGATAGGAGATTATCTGGGTGAGGCTTGGGGTGTTCTTGGTC
CCTCATTCCTAGAATTATCTGCTTCTTTGATGGATAACAGGACGAGGAATGAAATGCAGTCTGTTCAATTGGAGCAGGCAATAGACAAAACCGCTGTTGCGAGTGAAGAT
GTTGGTGGGAGTGGTGGGATTGAATTGCCCTCCCAAACTGAAAACCATGCGAGACCGGAGCATCAGGGGAGCGTGCCAGTCCTTACCCGAGCTGAAACAAAGAGAAAAGA
TGTGCTAGATATGAATCAGGATTCGGGTGTCAACAACAATTCTAAACGATCGTCCACTGTTGAAATGAACACTGAGAGAGTTCAAGGGTTGCCTACAGAAACAACGGAAG
GGCAAGAATCAATAGAGAAGGAAGTTACAGTTTTGCAAGATCGAAGTCCAAACTGCCGCAAAAATTTGAAGACTTCTATCTTGCCAAGATGCAAATCCCTTGCATCCCAT
AAGCGTGTTAGAGGAGGGGCTAAAATTGATCATCTTGAGGACTTGGAGAATGACAGTTCATCTGGAAAGTCTACTTGCCTACAAACTCCAGAATTTGAGCGGGTACGAGA
AGCGCTTAAAACCAGCTCTTTAGAGTTGCAAGCATTGGTGAAAGATCCACTTCCTGATGCATTACGCATTGCGGAATCTGTAGCTCAAGATCTTGCTAAAAAGAATAAAA
CTCCTGAGCATTCTTTGGAGGACCAAAACGATGCAGTTGCTGCGAATCCAGCCATTAACAAGGATAATATGCCTCTTCAATCCATGAATACAGCTTTCAACAATCCACGC
CATGGACATCAGACAATTGTTCCACGACCTAGTATAATGGAACGCAACAGTTCAGCTTGTACATATGAGTGGAATGATTCAATAGATGGTTCTCCGGAAAGAAATCGCGC
TAGCCGACTTCACCTACCTAGTCCCAAGAGAAAGGTCATTTCTCCCTTGAAGAAGTATGAAGAAAACAGACTGGTTTGGAGAAGGAAGTGTAAGAGGTGGAGCTTGCTTG
AAGAAGATACCTTAAGAACTGCTGTGCAAAGGTTTGGTAAAGGAAATTGGAAGCTCATCTTAAATAGCTATCGTGAAATATTTGATGATAGAACAGAGCCGAATGAGCAA
GCTGATGTTTGTTCCAGTAGCAATGGTTTGCCACACTTCTGCTCAACAAATTTGGTGAAGGTAAACATATTTTCTCCTTTTTCATTTATGAACACTGCTGAAATTGTGCT
CTGGTCCTGATTTCTATTCATTGCTGCATCTGCATTCATCTTCATTAGTAGTTGTCCACACATCTGAAAACTTTCATTAATACGACGACGACGAAGTTTTCGTTGAAGAC
TTAGGCTTAGATTAGAGAGTTTCTCAAAACAAATTTCAGATACCAGGATAGACTCAGTATGTCATTTAGCTCATTCAAGCCAGAAAAAAAAAAAAAAAAAAAAAAAACCT
GGTTGCCCGACATGAACACAACATGTTTCATTCCAACAAAGCTTTCAGGCACCTCATGAACATAACAACAAGGCTTCATATGCCAAAACAAACATTTCAACGAAACCCAT
TTGAAGAATCCCCCTAAGAATGTAAAAGAAAAAAGTTGGTCCTTAAATACAAATCAAATGAATGCTATTGAAGTGGAGGAGAAACTTCTCTAAATAATCGGAGGTCGATC
TATAAACAATAAACTAAATAGCATGAATCATCAACAAGTCAGTTGGGTCAAGCGTCAGGAGCCAGTTTTGCATCAAAAAGTTGTACTGCTCTCACTTGCAGAGATTGAGG
GGACCTCAGATTTCACAAGTGACGATCCTTTACTACTGATGTCTAGACCCTGTAATCGGCCGAGAGGGTCGCTCTGCAGAACCTCCGAGAGAGAATGAGATTCTGATGGA
TGCATAGGGTGGCTAGACATGTTAGATGGGGAATGGCTATATGGTGGCAGTTGCATATTCTGAGGACCTGCTGGTTGCTGTGAAAGTTGAACGAAAGACGATGGGGCATA
TGCCATGTGATGCATTCCCAAGTTAAAAGACTCAGAAGGGCTCATCATTTCCCCGGTAGCGAATTTAAGCCTCTCGACTTCCTTCTTTAATGCATCGTTTAGAGCTGCAT
ACAGTTCAGGACAATTTCAAACAAGACTGTGATAGTAACATATGTTCAAAACTAAGTACTGAAGTATAATCCTCACCATCACGCAATTGAGCTTGTTGCTCCATAGCCTG
TAACCTAAGCTTAAGCTCTGTATTCTCGTTACTGAGACCCCTTGTATCTCGCTGTGATGACGAGTCTTAATTAGATCATAGTTCTAAGTCCATTGTCAGTTAACATGAAT
ATGATGAAGGGATAAAATTGCGAAACGATATCAAAATTTGTAATCAGTTAGAAAATAGCAACTCTCCCCTTCCAACAAAGAATATAAGGTTCAATCAGAGTGAAAAATTG
TGGAAAAATGCAAGCTTAGGTTATATTGAAAGACAAAAACCTGGAACAAAGTTAATTGGGCTGATAGAGTAGTTGCTTCTGTTTGAAGGGTCTGCACCTTGCGCTCCAAC
TCTTGTATATACCGTGCCTTCCTCTCTTTTGAGCGAGCAGCAGACTGTCGATTTGCTAATATCCTACACATATCAGCAAGAACATGAGAAAGAGGTTCCCAGGAAATCCA
AAACCTT
Protein sequenceShow/hide protein sequence
MDKDVCRWIIEFILRSSMNDHLLKRTLAVMPFPDNDFRLKKTVLLRAIESERSEAVVTEKVLAIFEMIEQLDKTEGLAMMDSMKSAYCAVAVECTVKYLAVEGMKNNGKY
FDTVSRIWRGRLMELKRSGRSELVSRELEEWKDEVEAALWDKTVWKKLVNMNTRYEALKLIGDYLGEAWGVLGPSFLELSASLMDNRTRNEMQSVQLEQAIDKTAVASED
VGGSGGIELPSQTENHARPEHQGSVPVLTRAETKRKDVLDMNQDSGVNNNSKRSSTVEMNTERVQGLPTETTEGQESIEKEVTVLQDRSPNCRKNLKTSILPRCKSLASH
KRVRGGAKIDHLEDLENDSSSGKSTCLQTPEFERVREALKTSSLELQALVKDPLPDALRIAESVAQDLAKKNKTPEHSLEDQNDAVAANPAINKDNMPLQSMNTAFNNPR
HGHQTIVPRPSIMERNSSACTYEWNDSIDGSPERNRASRLHLPSPKRKVISPLKKYEENRLVWRRKCKRWSLLEEDTLRTAVQRFGKGNWKLILNSYREIFDDRTEPNEQ
ADVCSSSNGLPHFCSTNLVKVNIFSPFSFMNTAEIVLWS