; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh15G012600 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh15G012600
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionUnknown protein
Genome locationCmo_Chr15:8649652..8652314
RNA-Seq ExpressionCmoCh15G012600
SyntenyCmoCh15G012600
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579477.1 hypothetical protein SDJN03_23925, partial [Cucurbita argyrosperma subsp. sororia]1.8e-25998.93Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
        MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA

Query:  FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGN
        FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGN
Subjt:  FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGN

Query:  SESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ
        SESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTT FREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHL+Q
Subjt:  SESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ

Query:  GKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGSISRKENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFT
        GKKQKHAH+PKRFENGVEFEPLDLKKRFADIVVGRQHFGSISRKE+QREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFT
Subjt:  GKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGSISRKENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFT

Query:  EDWINGDAGEAMATGWESPEGRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
        EDWINGDAGEAMATGWESPEGRVLYIKDME AGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
Subjt:  EDWINGDAGEAMATGWESPEGRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS

KAG7016946.1 hypothetical protein SDJN02_22057, partial [Cucurbita argyrosperma subsp. argyrosperma]5.4e-25698.29Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
        MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA

Query:  FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVNT-RRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSG
        FKKFPLPSY KPFCSRSFSRKVILR FWKKQDFVDVNT RRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSG
Subjt:  FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVNT-RRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSG

Query:  NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV
        NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTT FREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV
Subjt:  NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV

Query:  QGKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGSISRKENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKF
        QGKKQKHAH+PKRFENGVEFEPLDLKKRFADIVVG QHFGSISRKE+QREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKF
Subjt:  QGKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGSISRKENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKF

Query:  TEDWINGDAGEAMATGWESPEGRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
        TEDWINGDAGEAMATGWESPEGRVLYIKDME AGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
Subjt:  TEDWINGDAGEAMATGWESPEGRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS

XP_022922340.1 uncharacterized protein LOC111430353 [Cucurbita moschata]2.9e-262100Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
        MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA

Query:  FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGN
        FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGN
Subjt:  FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGN

Query:  SESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ
        SESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ
Subjt:  SESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ

Query:  GKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGSISRKENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFT
        GKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGSISRKENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFT
Subjt:  GKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGSISRKENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFT

Query:  EDWINGDAGEAMATGWESPEGRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
        EDWINGDAGEAMATGWESPEGRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
Subjt:  EDWINGDAGEAMATGWESPEGRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS

XP_022969906.1 uncharacterized protein LOC111468962 [Cucurbita maxima]1.7e-24695.28Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
        MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDS+LTKRFLPRTASRKIALSTISTLQRASDAVVRA
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA

Query:  FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGN
        FKKFPLPSY KPFC+RSFSRKVILR FWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSE IPSSSSGN
Subjt:  FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGN

Query:  SESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ
        SESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTT FREDIVK WPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ
Subjt:  SESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ

Query:  GKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGSISRKENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFT
        GKK  HA K KRFENGVEFEPLDLKKRFADIVVG QHF  ISRKE+QREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGAD DQAQVLKFT
Subjt:  GKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGSISRKENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFT

Query:  EDWINGDAGEAMATGWESPEGRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
        EDWINGDAGEAM TGWE+PEGRVLYIKDME AGKWRS+ GEKEELAAEFEAEVW+SLFDELLIDLS
Subjt:  EDWINGDAGEAMATGWESPEGRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS

XP_023551213.1 uncharacterized protein LOC111809098 [Cucurbita pepo subsp. pepo]6.8e-25196.35Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
        MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDS+LTKRFLPRTASRKIALSTISTLQRASDAVVRA
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA

Query:  FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGN
        FKKFPLPSY KPFCSRSFSRKVILR FWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGN
Subjt:  FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGN

Query:  SESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ
        SESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAA PTTPTTT FREDIVK WPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ
Subjt:  SESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ

Query:  GKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGSISRKENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFT
        GKKQKHA +PKRFENGVEFEPLDL KRFADIVVGRQHF SISRKE+QREQKAFELLKLVKST TS ENLLLDFFHEKLEENDA ARTGADFDQAQVLKFT
Subjt:  GKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGSISRKENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFT

Query:  EDWINGDAGEAMATGWESPEGRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
        EDWINGD GEAMATGWESPEGRVLYIKDME AGKWRS+AGEKEELAAEFEAEVWMSLFDELLIDLS
Subjt:  EDWINGDAGEAMATGWESPEGRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS

TrEMBL top hitse value%identityAlignment
A0A0A0KP06 Uncharacterized protein1.2e-17972.33Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
        MAS DSS W++IS PP +KP S +LKDYLLDDFSSCSSNGFRSFPRRQCC+TTVRFLLEIDLKVKDS++TKRFLPRT SRKIALSTISTLQRASDAV+RA
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA

Query:  FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVN-TRRCKSFQEFLDEKEPPLS------RSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMI
        FK+FPLPS  K F  RS SRK+I + F KK D VD N  +R KSF+EFLDEKEPP S       SDSAVCTA+ V GRNSISSCSNSISWTESEFTSE+I
Subjt:  FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVN-TRRCKSFQEFLDEKEPPLS------RSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMI

Query:  PSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFN
        PSS SGNSESCSENDAVK DKDSPGNLIGKRDGVTFGKDSMEETTTAP++    T +  D+RED VK W NEEEKEQ SPVSVLDFPFEDEDQD  SSFN
Subjt:  PSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFN

Query:  CNLHLVQGKKQKHA-HKPKRFENGVEFEPLDLKKRFADI-VVGRQ-HFGSISRKENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGAD
        CN+HL++GKKQK    K KR E G E EP+DLKKRF +I V+G Q HF  I++KE+Q E+KA E LKL+KSTT STENLLLDFFH+KL+E++A + T +D
Subjt:  CNLHLVQGKKQKHA-HKPKRFENGVEFEPLDLKKRFADI-VVGRQ-HFGSISRKENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGAD

Query:  FDQAQVLKFTEDWINGDAGEAMATG-WESPEGRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
        FDQ Q+LKF +DWI+G+AGE    G WE PE R  YIKDME   KWRS  G+KEEL AEFE EVW+SL ++LLIDLS
Subjt:  FDQAQVLKFTEDWINGDAGEAMATG-WESPEGRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS

A0A1S3ATL0 uncharacterized protein LOC1034827061.0e-17270.77Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
        MAS DSS W++IS PP +KP S +LKDYLLDDFSSCSSNGFRSFPRRQCC+TTVRFLLEIDLKVKDS+ TK+FLPRT+SRKIALSTISTLQRASDAV+RA
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA

Query:  FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLS------RSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIP
        FK+FPLPS  K F  RS SRK+I + F KK D VD N RR KSF+EFLDEKEPP S       SDSAVCTA+ V GRNSISSCSNSISWTESEFTSE+IP
Subjt:  FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLS------RSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIP

Query:  SSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWP-NEEEKEQLSPVSVLDFPFEDEDQDTPSSFN
        SS SGNSESCSEN AVK DKDSP NLIGKRDGVTFGKDSMEET T PSA         ++RED VK W  NEEEKEQ SPVSVLDFPFEDEDQD  SS N
Subjt:  SSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWP-NEEEKEQLSPVSVLDFPFEDEDQDTPSSFN

Query:  CNLHLVQGKKQKHA-HKPKRFENGVEFEPLDLKKRFADIVV---GRQHFGSISR-KENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTG
        CN+HL++GKKQK    K KR E G E EP+DLKKRF +I V    + HF  I++ KE+Q E+KA E LKL+KSTT STENLLLDFFH+KL+E++A + T 
Subjt:  CNLHLVQGKKQKHA-HKPKRFENGVEFEPLDLKKRFADIVV---GRQHFGSISR-KENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTG

Query:  ADFDQAQVLKFTEDWINGDAGEAMATG-WESPEGRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
        +DFDQ Q+L+F +DW++G+AGE    G WE PE R  YIKDME A KWRS  G+KEEL AEFEAEVW+SL D+LLIDLS
Subjt:  ADFDQAQVLKFTEDWINGDAGEAMATG-WESPEGRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS

A0A5A7TN51 Uncharacterized protein2.1e-17370.98Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
        MAS DSS W++IS PP +KP S +LKDYLLDDFSSCSSNGFRSFPRRQCC+TTVRFLLEIDLKVKDS+ TK+FLPRT+SRKIALSTISTLQRASDAV+RA
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA

Query:  FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLS------RSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIP
        FK+FPLPS  K F  RS SRK+I + F KK D VD N RR KSF+EFLDEKEPP S       SDSAVCTA+ V GRNSISSCSNSISWTESEFTSE+IP
Subjt:  FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLS------RSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIP

Query:  SSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWP-NEEEKEQLSPVSVLDFPFEDEDQDTPSSFN
        SS SGNSESCSEN AVK DKDSP NLIGKRDGVTFGKDSMEET T PSA         ++RED VK W  NEEEKEQ SPVSVLDFPFEDEDQD  SSFN
Subjt:  SSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWP-NEEEKEQLSPVSVLDFPFEDEDQDTPSSFN

Query:  CNLHLVQGKKQKHA-HKPKRFENGVEFEPLDLKKRFADIVV---GRQHFGSISR-KENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTG
        CN+HL++GKKQK    K KR E G E EP+DLKKRF +I V    + HF  I++ KE+Q E+KA E LKL+KSTT STENLLLDFFH+KL+E++A + T 
Subjt:  CNLHLVQGKKQKHA-HKPKRFENGVEFEPLDLKKRFADIVV---GRQHFGSISR-KENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTG

Query:  ADFDQAQVLKFTEDWINGDAGEAMATG-WESPEGRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
        +DFDQ Q+L+F +DW++G+AGE    G WE PE R  YIKDME A KWRS  G+KEEL AEFEAEVW+SL D+LLIDLS
Subjt:  ADFDQAQVLKFTEDWINGDAGEAMATG-WESPEGRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS

A0A6J1E8H2 uncharacterized protein LOC1114303531.4e-262100Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
        MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA

Query:  FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGN
        FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGN
Subjt:  FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGN

Query:  SESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ
        SESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ
Subjt:  SESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ

Query:  GKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGSISRKENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFT
        GKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGSISRKENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFT
Subjt:  GKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGSISRKENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFT

Query:  EDWINGDAGEAMATGWESPEGRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
        EDWINGDAGEAMATGWESPEGRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
Subjt:  EDWINGDAGEAMATGWESPEGRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS

A0A6J1HZ34 uncharacterized protein LOC1114689628.4e-24795.28Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
        MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDS+LTKRFLPRTASRKIALSTISTLQRASDAVVRA
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA

Query:  FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGN
        FKKFPLPSY KPFC+RSFSRKVILR FWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSE IPSSSSGN
Subjt:  FKKFPLPSYPKPFCSRSFSRKVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGN

Query:  SESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ
        SESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTT FREDIVK WPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ
Subjt:  SESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ

Query:  GKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGSISRKENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFT
        GKK  HA K KRFENGVEFEPLDLKKRFADIVVG QHF  ISRKE+QREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGAD DQAQVLKFT
Subjt:  GKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGSISRKENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFT

Query:  EDWINGDAGEAMATGWESPEGRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
        EDWINGDAGEAM TGWE+PEGRVLYIKDME AGKWRS+ GEKEELAAEFEAEVW+SLFDELLIDLS
Subjt:  EDWINGDAGEAMATGWESPEGRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G00770.1 unknown protein4.7e-1626.1Show/hide
Query:  SFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKF---PLPSYPKPFCSRSF
        S MLKD LL+D +SCSSNGF+S PRR                           P    RK           A  AV+ A K      + S P     RS 
Subjt:  SFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKF---PLPSYPKPFCSRSF

Query:  SRKVILRTFWKKQDFVD-------VNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGNSESCSENDAVK
        SR++  +   + Q  +        V     K   E +   EP    + +   T  T     S +SCS   SW++ +FTSE +PSS   N E C E  +VK
Subjt:  SRKVILRTFWKKQDFVD-------VNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGNSESCSENDAVK

Query:  VDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQGKKQKHAHKPK
               NL         G+DS      A +   P         E++      + EKE  SPVSV +   E+ D+ + SSF+  L  V+  KQK     +
Subjt:  VDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQGKKQKHAHKPK

Query:  RFENGVEFEPLDLKK----RFADIVVGRQ----------HFGSISRK-----ENQREQKAFELLKLVK---STTTSTENLLLDFFHEKLEENDAIARTGA
        RFE+     P +L +      A  + G Q          +  ++ R+      ++ E+KA +L   VK   +     E+L++D+F ++L +         
Subjt:  RFENGVEFEPLDLKK----RFADIVVGRQ----------HFGSISRK-----ENQREQKAFELLKLVK---STTTSTENLLLDFFHEKLEENDAIARTGA

Query:  DFDQAQVLKFTEDWINGDAGEAMATGWESPEGRVLYIKDMEKAGKW--RSLAGEKEELAAEFEAEVWMSLFDELLIDLS
         F+  Q++   + W+ G     +  G  S + R    +++E+   W  + +  E E +  + E E++  L DE L  LS
Subjt:  DFDQAQVLKFTEDWINGDAGEAMATGWESPEGRVLYIKDMEKAGKW--RSLAGEKEELAAEFEAEVWMSLFDELLIDLS

AT4G11780.1 unknown protein4.4e-3030.97Show/hide
Query:  MLKDYLLDDFSSCSSNGFRSFPRRQ--CCATTVRFLLEIDLK----------VKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPS---
        +L+DYLLDD SSCSSNGF+SFPRRQ    ++TVR LL+ ++K           K   LT+R    T    I+      + +AS A +   K  P PS   
Subjt:  MLKDYLLDDFSSCSSNGFRSFPRRQ--CCATTVRFLLEIDLK----------VKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPS---

Query:  YPKPFCSRSFSRKVILRTFWKK------------QDFVDVNTRRCKSFQEFLDEKEPPLSR--------SDSAVCTAVTVVGRNSISSCSNSISWTESEF
          +   SRSFS++++  +FW+K                ++   R  +++E LD++    S+        + S    A+TVV    IS  S+S     SEF
Subjt:  YPKPFCSRSFSRKVILRTFWKK------------QDFVDVNTRRCKSFQEFLDEKEPPLSR--------SDSAVCTAVTVVGRNSISSCSNSISWTESEF

Query:  ----TSEMIPSSSSG-NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFED
            +SE++ SSSS  +S S  E++ V  + D+  +  GK  G     DS++      S+           R++ V      EEKEQLSPVS+L+ PF+D
Subjt:  ----TSEMIPSSSSG-NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFED

Query:  EDQDTPSSFNCNLHLVQGKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFG--SISRKENQREQKAFELLKLVKSTTTST---------ENLLLD
        +D+D   +   + +      +K A K +R    V  EPLDL KR    V  ++ +   ++  +E++ E +A  L  LVK     T         +NLLLD
Subjt:  EDQDTPSSFNCNLHLVQGKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFG--SISRKENQREQKAFELLKLVKSTTTST---------ENLLLD

Query:  FFHEKLEENDAIARTGADFDQAQVLKFTEDWINGDAGEAMATGWESPEGRVLYIKDMEKAGKWRSLAG-EKEELAAEFEAEVWMSLFDELLIDL
        +  E     D I       ++  ++K  EDW+ G   E M   WE    R +Y+K+M    KW  + G E+E +  E     + S  DE + DL
Subjt:  FFHEKLEENDAIARTGADFDQAQVLKFTEDWINGDAGEAMATGWESPEGRVLYIKDMEKAGKWRSLAG-EKEELAAEFEAEVWMSLFDELLIDL

AT4G23020.1 unknown protein9.1e-2828.8Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKI--ALSTISTLQRASDAVV
        M SI SS   +       KP   +L+D+LLDD SSCSSNGF+SFPR             ++ +++ S +         +R+I   L+    + +AS A++
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKI--ALSTISTLQRASDAVV

Query:  RAFKKFPLPSYPKPFC-SRSFSRKVILRTFWKKQDFVDVNT----------------RRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSI
         A K  P PS  K     R   + +  R+FWKK    ++N                 +RC+SF EFL E +  LS     +       G  ++S      
Subjt:  RAFKKFPLPSYPKPFC-SRSFSRKVILRTFWKKQDFVDVNT----------------RRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSI

Query:  SWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPF
                       + G+S S S  D+ +V + S G ++    G   G    + +            +  D  E+        EEKEQLSP+S+LD PF
Subjt:  SWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPF

Query:  EDEDQDTPSSFNCNLHLVQGKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGS--ISRKENQREQKAFELLKLVKS----------TTTSTENL
        +D+    PS      H  +  ++K   K +R E+ V  EP+DL+KR  +    RQ + S  I  +E+Q E +A  L  LVKS           +   +N+
Subjt:  EDEDQDTPSSFNCNLHLVQGKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGS--ISRKENQREQKAFELLKLVKS----------TTTSTENL

Query:  LLDFFHEKLEENDAIARTGADFDQAQVLKFTEDWI--NGDAGEAMATGWESPEGRVLYIKDMEKAGKWRSLAG-EKEELAAEFEAEVWMSLFDELLIDLS
        LLDFF    E N+   R     D+ ++++  E+W+    D    M   W+  E R +Y+K+M    KW  + G EKE +  E       SL DEL+ D+S
Subjt:  LLDFFHEKLEENDAIARTGADFDQAQVLKFTEDWI--NGDAGEAMATGWESPEGRVLYIKDMEKAGKWRSLAG-EKEELAAEFEAEVWMSLFDELLIDLS

AT4G23020.2 unknown protein7.7e-2729.19Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKI--ALSTISTLQRASDAVV
        M SI SS   +       KP   +L+D+LLDD SSCSSNGF+SFPR             ++ +++ S +         +R+I   L+    + +AS A++
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKI--ALSTISTLQRASDAVV

Query:  RAFKKFPLPSYPKPFC-SRSFSRKVILRTFWKKQDFVDVNT----------------RRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSIS----SC
         A K  P PS  K     R   + +  R+FWKK    ++N                 +RC+SF EFL E +  LS     +       G  ++S      
Subjt:  RAFKKFPLPSYPKPFC-SRSFSRKVILRTFWKKQDFVDVNT----------------RRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSIS----SC

Query:  SNSISWTESEFT---SEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPV
        S+S S  +SE T   S +I    SG+      +D   ++ ++   L G   G      S+                        +K     EEKEQLSP+
Subjt:  SNSISWTESEFT---SEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPV

Query:  SVLDFPFEDEDQDTPSSFNCNLHLVQGKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGS--ISRKENQREQKAFELLKLVKS----------T
        S+LD PF+D+    PS      H  +  ++K   K +R E+ V  EP+DL+KR  +    RQ + S  I  +E+Q E +A  L  LVKS           
Subjt:  SVLDFPFEDEDQDTPSSFNCNLHLVQGKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGS--ISRKENQREQKAFELLKLVKS----------T

Query:  TTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFTEDWI--NGDAGEAMATGWESPEGRVLYIKDMEKAGKWRSLAG-EKEELAAEFEAEVWMSLFD
        +   +N+LLDFF    E N+   R     D+ ++++  E+W+    D    M   W+  E R +Y+K+M    KW  + G EKE +  E       SL D
Subjt:  TTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFTEDWI--NGDAGEAMATGWESPEGRVLYIKDMEKAGKWRSLAG-EKEELAAEFEAEVWMSLFD

Query:  ELLIDLS
        EL+ D+S
Subjt:  ELLIDLS

AT5G03670.1 unknown protein9.5e-0927.48Show/hide
Query:  EEEKEQLSPVSVLDFPFEDEDQD-------TPSSFNCNLHLVQGKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGSISRKENQREQKAFELLK
        EEEKEQ SPVSVLD PF+D+D+D        PSSF      VQ  K     K  RFE     +P++L+KR +D            ++  + E++  E +K
Subjt:  EEEKEQLSPVSVLDFPFEDEDQD-------TPSSFNCNLHLVQGKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGSISRKENQREQKAFELLK

Query:  LVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFTEDWINGDAGEAMAT--------GWESPEGRVLYIK-----DMEKAGKWRSL-AGEKE
         +      T+ +L  +F E +E  + +    +D    ++       I+G+A  A+           W   E   + +        E+ G WRS    +  
Subjt:  LVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFTEDWINGDAGEAMAT--------GWESPEGRVLYIK-----DMEKAGKWRSL-AGEKE

Query:  ELAAEFEAEVWMSLFDELLIDL
        E   + E E++  L +EL  D+
Subjt:  ELAAEFEAEVWMSLFDELLIDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCATTGATTCTTCCGCTTGGAGCATCATCTCCAAGCCTCCGCCGGAAAAACCTACCTCCTTCATGCTCAAGGATTATCTTCTCGACGATTTCAGTTCC
TGCTCCTCCAATGGCTTCCGATCCTTTCCTCGCCGCCAATGCTGCGCCACAACCGTCCGATTTCTTCTCGAAATCGATCTCAAAGTTAAAGATTCTGCCCTAACT
AAAAGATTCCTTCCTCGAACTGCCTCCCGAAAAATCGCGCTCTCCACGATCTCCACTTTGCAGAGAGCCTCCGATGCCGTTGTCAGAGCCTTCAAGAAATTCCCC
TTGCCTTCTTACCCGAAGCCGTTTTGTTCGAGGAGTTTTTCACGGAAGGTGATTCTGCGAACGTTCTGGAAGAAACAGGATTTTGTGGATGTGAACACCAGACGG
TGTAAATCGTTTCAGGAATTTCTCGATGAGAAAGAACCGCCGTTGTCTCGCTCCGATTCCGCTGTGTGCACCGCCGTGACTGTCGTCGGAAGAAACTCGATTAGT
AGCTGTAGTAATAGTATCAGTTGGACGGAGAGCGAATTTACATCGGAGATGATTCCGTCGTCTTCGAGCGGTAACTCCGAGAGTTGCAGCGAAAACGACGCCGTT
AAAGTCGATAAGGATTCGCCTGGTAATCTCATTGGCAAAAGAGATGGCGTAACATTCGGTAAAGATTCCATGGAGGAAACAACCACCGCCCCTTCCGCCGCCACC
CCCACCACCCCTACCACCACCGATTTCCGAGAGGATATCGTTAAGCCATGGCCAAATGAAGAAGAAAAGGAACAGTTGAGTCCAGTTTCAGTGTTGGATTTTCCT
TTTGAAGATGAAGATCAAGACACCCCCTCGTCTTTCAACTGCAACCTTCACCTCGTCCAAGGTAAGAAGCAAAAACATGCACATAAGCCGAAGCGATTCGAGAAT
GGAGTTGAATTTGAACCTCTAGACTTAAAGAAACGATTTGCAGACATTGTTGTTGGTCGTCAACATTTCGGCTCAATCTCGAGAAAAGAAAACCAAAGGGAACAG
AAAGCATTTGAGCTTCTAAAGCTCGTCAAATCAACTACGACATCGACAGAGAATCTGCTTCTCGATTTCTTCCACGAGAAGCTCGAAGAAAACGACGCAATTGCA
AGAACAGGAGCTGATTTTGATCAAGCACAGGTCTTGAAATTCACTGAAGATTGGATCAATGGGGATGCCGGAGAAGCCATGGCGACGGGATGGGAGTCGCCGGAG
GGACGGGTTTTGTACATTAAGGACATGGAGAAGGCCGGAAAATGGAGAAGTTTGGCCGGAGAAAAGGAAGAGTTGGCGGCGGAGTTTGAAGCTGAGGTTTGGATG
TCTTTGTTTGATGAGCTATTAATCGACCTCTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCCATTGATTCTTCCGCTTGGAGCATCATCTCCAAGCCTCCGCCGGAAAAACCTACCTCCTTCATGCTCAAGGATTATCTTCTCGACGATTTCAGTTCC
TGCTCCTCCAATGGCTTCCGATCCTTTCCTCGCCGCCAATGCTGCGCCACAACCGTCCGATTTCTTCTCGAAATCGATCTCAAAGTTAAAGATTCTGCCCTAACT
AAAAGATTCCTTCCTCGAACTGCCTCCCGAAAAATCGCGCTCTCCACGATCTCCACTTTGCAGAGAGCCTCCGATGCCGTTGTCAGAGCCTTCAAGAAATTCCCC
TTGCCTTCTTACCCGAAGCCGTTTTGTTCGAGGAGTTTTTCACGGAAGGTGATTCTGCGAACGTTCTGGAAGAAACAGGATTTTGTGGATGTGAACACCAGACGG
TGTAAATCGTTTCAGGAATTTCTCGATGAGAAAGAACCGCCGTTGTCTCGCTCCGATTCCGCTGTGTGCACCGCCGTGACTGTCGTCGGAAGAAACTCGATTAGT
AGCTGTAGTAATAGTATCAGTTGGACGGAGAGCGAATTTACATCGGAGATGATTCCGTCGTCTTCGAGCGGTAACTCCGAGAGTTGCAGCGAAAACGACGCCGTT
AAAGTCGATAAGGATTCGCCTGGTAATCTCATTGGCAAAAGAGATGGCGTAACATTCGGTAAAGATTCCATGGAGGAAACAACCACCGCCCCTTCCGCCGCCACC
CCCACCACCCCTACCACCACCGATTTCCGAGAGGATATCGTTAAGCCATGGCCAAATGAAGAAGAAAAGGAACAGTTGAGTCCAGTTTCAGTGTTGGATTTTCCT
TTTGAAGATGAAGATCAAGACACCCCCTCGTCTTTCAACTGCAACCTTCACCTCGTCCAAGGTAAGAAGCAAAAACATGCACATAAGCCGAAGCGATTCGAGAAT
GGAGTTGAATTTGAACCTCTAGACTTAAAGAAACGATTTGCAGACATTGTTGTTGGTCGTCAACATTTCGGCTCAATCTCGAGAAAAGAAAACCAAAGGGAACAG
AAAGCATTTGAGCTTCTAAAGCTCGTCAAATCAACTACGACATCGACAGAGAATCTGCTTCTCGATTTCTTCCACGAGAAGCTCGAAGAAAACGACGCAATTGCA
AGAACAGGAGCTGATTTTGATCAAGCACAGGTCTTGAAATTCACTGAAGATTGGATCAATGGGGATGCCGGAGAAGCCATGGCGACGGGATGGGAGTCGCCGGAG
GGACGGGTTTTGTACATTAAGGACATGGAGAAGGCCGGAAAATGGAGAAGTTTGGCCGGAGAAAAGGAAGAGTTGGCGGCGGAGTTTGAAGCTGAGGTTTGGATG
TCTTTGTTTGATGAGCTATTAATCGACCTCTCCTAGCTTCTTGGATTATTGTTGTTCCCGTTTTTTTGGAGCTAATTAGCAGCCAAATCTACTTGTCTAAGTAGA
AGAAAGGATAAAGAATGGAATTTTGAATTGGATTATGGCGGCGTTGCCATTGTAACAGGTGGCAAGTAGGCTTGGCTGCTGACTA
Protein sequenceShow/hide protein sequence
MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFP
LPSYPKPFCSRSFSRKVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGNSESCSENDAV
KVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQGKKQKHAHKPKRFEN
GVEFEPLDLKKRFADIVVGRQHFGSISRKENQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFTEDWINGDAGEAMATGWESPE
GRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS