; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg16658 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg16658
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionUnknown protein
Genome locationCarg_Chr15:8511199..8513738
RNA-Seq ExpressionCarg16658
SyntenyCarg16658
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579477.1 hypothetical protein SDJN03_23925, partial [Cucurbita argyrosperma subsp. sororia]4.0e-25998.93Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
        MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA

Query:  FKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSG
        FKKFPLPSY KPFCSRSFSRKVILR FWKKQDFVDVNT RRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSG
Subjt:  FKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSG

Query:  NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV
        NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHL+
Subjt:  NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV

Query:  QGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFGSISRKEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKF
        QGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVG QHFGSISRKEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKF
Subjt:  QGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFGSISRKEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKF

Query:  TEDWINGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
        TEDWINGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
Subjt:  TEDWINGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS

KAG7016946.1 hypothetical protein SDJN02_22057, partial [Cucurbita argyrosperma subsp. argyrosperma]2.0e-263100Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
        MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA

Query:  FKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSG
        FKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSG
Subjt:  FKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSG

Query:  NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV
        NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV
Subjt:  NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV

Query:  QGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFGSISRKEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKF
        QGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFGSISRKEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKF
Subjt:  QGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFGSISRKEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKF

Query:  TEDWINGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
        TEDWINGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
Subjt:  TEDWINGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS

XP_022922340.1 uncharacterized protein LOC111430353 [Cucurbita moschata]1.8e-25698.29Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
        MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA

Query:  FKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSG
        FKKFPLPSY KPFCSRSFSRKVILR FWKKQDFVDVNT RRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSG
Subjt:  FKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSG

Query:  NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV
        NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTT FREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV
Subjt:  NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV

Query:  QGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFGSISRKEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKF
        QGKKQKHAH+PKRFENGVEFEPLDLKKRFADIVVG QHFGSISRKE+QREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKF
Subjt:  QGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFGSISRKEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKF

Query:  TEDWINGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
        TEDWINGDAGEAMATGWESPEGRVLYIKDME AGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
Subjt:  TEDWINGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS

XP_022969906.1 uncharacterized protein LOC111468962 [Cucurbita maxima]4.9e-24995.93Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
        MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDS+LTKRFLPRTASRKIALSTISTLQRASDAVVRA
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA

Query:  FKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSG
        FKKFPLPSYRKPFC+RSFSRKVILRAFWKKQDFVDVNT RRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSE IPSSSSG
Subjt:  FKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSG

Query:  NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV
        NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVK WPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV
Subjt:  NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV

Query:  QGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFGSISRKEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKF
        QGKK  HA + KRFENGVEFEPLDLKKRFADIVVG QHF  ISRKEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGAD DQAQVLKF
Subjt:  QGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFGSISRKEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKF

Query:  TEDWINGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
        TEDWINGDAGEAM TGWE+PEGRVLYIKDMEIAGKWRS+ GEKEELAAEFEAEVW+SLFDELLIDLS
Subjt:  TEDWINGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS

XP_023551213.1 uncharacterized protein LOC111809098 [Cucurbita pepo subsp. pepo]4.3e-25397Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
        MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDS+LTKRFLPRTASRKIALSTISTLQRASDAVVRA
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA

Query:  FKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSG
        FKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNT RRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSG
Subjt:  FKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSG

Query:  NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV
        NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAA PTTPTTTGFREDIVK WPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV
Subjt:  NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV

Query:  QGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFGSISRKEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKF
        QGKKQKHA +PKRFENGVEFEPLDL KRFADIVVG QHF SISRKEHQREQKAFELLKLVKST TS ENLLLDFFHEKLEENDA ARTGADFDQAQVLKF
Subjt:  QGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFGSISRKEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKF

Query:  TEDWINGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
        TEDWINGD GEAMATGWESPEGRVLYIKDMEIAGKWRS+AGEKEELAAEFEAEVWMSLFDELLIDLS
Subjt:  TEDWINGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS

TrEMBL top hitse value%identityAlignment
A0A0A0KP06 Uncharacterized protein5.0e-18372.54Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
        MAS DSS W++IS PP +KP S +LKDYLLDDFSSCSSNGFRSFPRRQCC+TTVRFLLEIDLKVKDS++TKRFLPRT SRKIALSTISTLQRASDAV+RA
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA

Query:  FKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLS------RSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMI
        FK+FPLPS RK F  RS SRK+I +AF KK D VD N  +R KSF+EFLDEKEPP S       SDSAVCTA+ V GRNSISSCSNSISWTESEFTSE+I
Subjt:  FKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLS------RSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMI

Query:  PSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFN
        PSS SGNSESCSENDAVK DKDSPGNLIGKRDGVTFGKDSMEETTTAP++    T +   +RED VK W NEEEKEQ SPVSVLDFPFEDEDQD  SSFN
Subjt:  PSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFN

Query:  CNLHLVQGKKQKHA-HRPKRFENGVEFEPLDLKKRFADI-VVGHQ-HFGSISRKEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGAD
        CN+HL++GKKQK    + KR E G E EP+DLKKRF +I V+G Q HF  I++KEHQ E+KA E LKL+KSTT STENLLLDFFH+KL+E++A + T +D
Subjt:  CNLHLVQGKKQKHA-HRPKRFENGVEFEPLDLKKRFADI-VVGHQ-HFGSISRKEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGAD

Query:  FDQAQVLKFTEDWINGDAGEAMATG-WESPEGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
        FDQ Q+LKF +DWI+G+AGE    G WE PE R  YIKDME+  KWRS  G+KEEL AEFE EVW+SL ++LLIDLS
Subjt:  FDQAQVLKFTEDWINGDAGEAMATG-WESPEGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS

A0A1S3ATL0 uncharacterized protein LOC1034827065.5e-17471.25Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
        MAS DSS W++IS PP +KP S +LKDYLLDDFSSCSSNGFRSFPRRQCC+TTVRFLLEIDLKVKDS+ TK+FLPRT+SRKIALSTISTLQRASDAV+RA
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA

Query:  FKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLS------RSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMI
        FK+FPLPS RK F  RS SRK+I +AF KK D VD N  RR KSF+EFLDEKEPP S       SDSAVCTA+ V GRNSISSCSNSISWTESEFTSE+I
Subjt:  FKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLS------RSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMI

Query:  PSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWP-NEEEKEQLSPVSVLDFPFEDEDQDTPSSF
        PSS SGNSESCSEN AVK DKDSP NLIGKRDGVTFGKDSMEET T PSA          +RED VK W  NEEEKEQ SPVSVLDFPFEDEDQD  SS 
Subjt:  PSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWP-NEEEKEQLSPVSVLDFPFEDEDQDTPSSF

Query:  NCNLHLVQGKKQKHA-HRPKRFENGVEFEPLDLKKRFADI--VVGHQ-HFGSISR-KEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIART
        NCN+HL++GKKQK    + KR E G E EP+DLKKRF +I  +  HQ HF  I++ KEHQ E+KA E LKL+KSTT STENLLLDFFH+KL+E++A + T
Subjt:  NCNLHLVQGKKQKHA-HRPKRFENGVEFEPLDLKKRFADI--VVGHQ-HFGSISR-KEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIART

Query:  GADFDQAQVLKFTEDWINGDAGEAMATG-WESPEGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
         +DFDQ Q+L+F +DW++G+AGE    G WE PE R  YIKDME+A KWRS  G+KEEL AEFEAEVW+SL D+LLIDLS
Subjt:  GADFDQAQVLKFTEDWINGDAGEAMATG-WESPEGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS

A0A5A7TN51 Uncharacterized protein1.1e-17471.46Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
        MAS DSS W++IS PP +KP S +LKDYLLDDFSSCSSNGFRSFPRRQCC+TTVRFLLEIDLKVKDS+ TK+FLPRT+SRKIALSTISTLQRASDAV+RA
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA

Query:  FKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLS------RSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMI
        FK+FPLPS RK F  RS SRK+I +AF KK D VD N  RR KSF+EFLDEKEPP S       SDSAVCTA+ V GRNSISSCSNSISWTESEFTSE+I
Subjt:  FKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLS------RSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMI

Query:  PSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWP-NEEEKEQLSPVSVLDFPFEDEDQDTPSSF
        PSS SGNSESCSEN AVK DKDSP NLIGKRDGVTFGKDSMEET T PSA          +RED VK W  NEEEKEQ SPVSVLDFPFEDEDQD  SSF
Subjt:  PSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWP-NEEEKEQLSPVSVLDFPFEDEDQDTPSSF

Query:  NCNLHLVQGKKQKHA-HRPKRFENGVEFEPLDLKKRFADI--VVGHQ-HFGSISR-KEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIART
        NCN+HL++GKKQK    + KR E G E EP+DLKKRF +I  +  HQ HF  I++ KEHQ E+KA E LKL+KSTT STENLLLDFFH+KL+E++A + T
Subjt:  NCNLHLVQGKKQKHA-HRPKRFENGVEFEPLDLKKRFADI--VVGHQ-HFGSISR-KEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIART

Query:  GADFDQAQVLKFTEDWINGDAGEAMATG-WESPEGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
         +DFDQ Q+L+F +DW++G+AGE    G WE PE R  YIKDME+A KWRS  G+KEEL AEFEAEVW+SL D+LLIDLS
Subjt:  GADFDQAQVLKFTEDWINGDAGEAMATG-WESPEGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS

A0A6J1E8H2 uncharacterized protein LOC1114303538.9e-25798.29Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
        MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA

Query:  FKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSG
        FKKFPLPSY KPFCSRSFSRKVILR FWKKQDFVDVNT RRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSG
Subjt:  FKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSG

Query:  NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV
        NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTT FREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV
Subjt:  NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV

Query:  QGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFGSISRKEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKF
        QGKKQKHAH+PKRFENGVEFEPLDLKKRFADIVVG QHFGSISRKE+QREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKF
Subjt:  QGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFGSISRKEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKF

Query:  TEDWINGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
        TEDWINGDAGEAMATGWESPEGRVLYIKDME AGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
Subjt:  TEDWINGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS

A0A6J1HZ34 uncharacterized protein LOC1114689622.4e-24995.93Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA
        MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDS+LTKRFLPRTASRKIALSTISTLQRASDAVVRA
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRA

Query:  FKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSG
        FKKFPLPSYRKPFC+RSFSRKVILRAFWKKQDFVDVNT RRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSE IPSSSSG
Subjt:  FKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSG

Query:  NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV
        NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVK WPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV
Subjt:  NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV

Query:  QGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFGSISRKEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKF
        QGKK  HA + KRFENGVEFEPLDLKKRFADIVVG QHF  ISRKEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGAD DQAQVLKF
Subjt:  QGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFGSISRKEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKF

Query:  TEDWINGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS
        TEDWINGDAGEAM TGWE+PEGRVLYIKDMEIAGKWRS+ GEKEELAAEFEAEVW+SLFDELLIDLS
Subjt:  TEDWINGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G00770.1 unknown protein6.8e-1525.89Show/hide
Query:  SFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKF---PLPSYRKPFCSRSF
        S MLKD LL+D +SCSSNGF+S PRR                           P    RK           A  AV+ A K      + S       RS 
Subjt:  SFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKF---PLPSYRKPFCSRSF

Query:  SRKVILRAFWKKQDFV------DVNTRRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGNSESCSENDAVK
        SR++  +   + Q  +      D+      K   E +   EP    + +   T  T     S +SCS   SW++ +FTSE +PSS   N E C E  +VK
Subjt:  SRKVILRAFWKKQDFV------DVNTRRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGNSESCSENDAVK

Query:  VDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQGKKQKHAHRPK
         +    G                E++ TA   A     T  G  E++      + EKE  SPVSV +   E+ D+ + SSF+  L  V+  KQK     +
Subjt:  VDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQGKKQKHAHRPK

Query:  RFENGVEFEPLDLKK----RFADIVVGHQ----------HFGSISRKEHQR-----EQKAFELLKLVK---STTTSTENLLLDFFHEKLEENDAIARTGA
        RFE+     P +L +      A  + G Q          +  ++ R+         E+KA +L   VK   +     E+L++D+F ++L +         
Subjt:  RFENGVEFEPLDLKK----RFADIVVGHQ----------HFGSISRKEHQR-----EQKAFELLKLVK---STTTSTENLLLDFFHEKLEENDAIARTGA

Query:  DFDQAQVLKFTEDWINGDAGEAMATGWESPEGRVLYIKDMEIAGKW--RSLAGEKEELAAEFEAEVWMSLFDELLIDLS
         F+  Q++   + W+ G     +  G  S + R    +++E    W  + +  E E +  + E E++  L DE L  LS
Subjt:  DFDQAQVLKFTEDWINGDAGEAMATGWESPEGRVLYIKDMEIAGKW--RSLAGEKEELAAEFEAEVWMSLFDELLIDLS

AT4G11780.1 unknown protein3.4e-3031.25Show/hide
Query:  MLKDYLLDDFSSCSSNGFRSFPRRQ--CCATTVRFLLEIDLK----------VKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPS---
        +L+DYLLDD SSCSSNGF+SFPRRQ    ++TVR LL+ ++K           K   LT+R    T    I+      + +AS A +   K  P PS   
Subjt:  MLKDYLLDDFSSCSSNGFRSFPRRQ--CCATTVRFLLEIDLK----------VKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPS---

Query:  YRKPFCSRSFSRKVILRAFWKKQDFVDVNTRR-------------RCKSFQEFLDEKEPPLSR--------SDSAVCTAVTVVGRNSISSCSNSISWTES
         ++   SRSFS++++  +FW+K   V   +RR             R  +++E LD++    S+        + S    A+TVV    IS  S+S     S
Subjt:  YRKPFCSRSFSRKVILRAFWKKQDFVDVNTRR-------------RCKSFQEFLDEKEPPLSR--------SDSAVCTAVTVVGRNSISSCSNSISWTES

Query:  EF----TSEMIPSSSSG-NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPF
        EF    +SE++ SSSS  +S S  E++ V  + D+  +  GK  G     DS++      S+           R++ V      EEKEQLSPVS+L+ PF
Subjt:  EF----TSEMIPSSSSG-NSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPF

Query:  EDEDQDTPSSFNCNLHLVQGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFG--SISRKEHQREQKAFELLKLVKSTTTST---------ENLL
        +D+D+D   +   + +      +K A + +R    V  EPLDL KR    V   + +   ++  +E + E +A  L  LVK     T         +NLL
Subjt:  EDEDQDTPSSFNCNLHLVQGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFG--SISRKEHQREQKAFELLKLVKSTTTST---------ENLL

Query:  LDFFHEKLEENDAIARTGADFDQAQVLKFTEDWINGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLAG-EKEELAAEFEAEVWMSLFDELLIDL
        LD+  E     D I       ++  ++K  EDW+ G   E M   WE    R +Y+K+M    KW  + G E+E +  E     + S  DE + DL
Subjt:  LDFFHEKLEENDAIARTGADFDQAQVLKFTEDWINGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLAG-EKEELAAEFEAEVWMSLFDELLIDL

AT4G23020.1 unknown protein8.5e-2628.6Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKI--ALSTISTLQRASDAVV
        M SI SS   +       KP   +L+D+LLDD SSCSSNGF+SFPR             ++ +++ S +         +R+I   L+    + +AS A++
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKI--ALSTISTLQRASDAVV

Query:  RAFKKFPLP-SYRKPFCSRSFSRKVILRAFWKK----QDFVDVNTR-----------RRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSI
         A K  P P S +     R   + +  R+FWKK    +  VDV  +           +RC+SF EFL E +  LS     +       G  ++S      
Subjt:  RAFKKFPLP-SYRKPFCSRSFSRKVILRAFWKK----QDFVDVNTR-----------RRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSI

Query:  SWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPF
                       + G+S S S  D+ +V + S G ++     V    D +    +  S+    T                 EEKEQLSP+S+LD PF
Subjt:  SWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPF

Query:  EDEDQDTPSSFNCNLHLVQGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFGS--ISRKEHQREQKAFELLKLVKS----------TTTSTENL
        +D+    PS      H  +  ++K   + +R E+ V  EP+DL+KR  +     Q + S  I  +E Q E +A  L  LVKS           +   +N+
Subjt:  EDEDQDTPSSFNCNLHLVQGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFGS--ISRKEHQREQKAFELLKLVKS----------TTTSTENL

Query:  LLDFFHEKLEENDAIARTGADFDQAQVLKFTEDWI--NGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLAG-EKEELAAEFEAEVWMSLFDELLIDLS
        LLDFF    E N+   R     D+ ++++  E+W+    D    M   W+  E R +Y+K+M    KW  + G EKE +  E       SL DEL+ D+S
Subjt:  LLDFFHEKLEENDAIARTGADFDQAQVLKFTEDWI--NGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLAG-EKEELAAEFEAEVWMSLFDELLIDLS

AT4G23020.2 unknown protein1.1e-2528.99Show/hide
Query:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKI--ALSTISTLQRASDAVV
        M SI SS   +       KP   +L+D+LLDD SSCSSNGF+SFPR             ++ +++ S +         +R+I   L+    + +AS A++
Subjt:  MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKI--ALSTISTLQRASDAVV

Query:  RAFKKFPLP-SYRKPFCSRSFSRKVILRAFWKK----QDFVDVNTR-----------RRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSIS----SC
         A K  P P S +     R   + +  R+FWKK    +  VDV  +           +RC+SF EFL E +  LS     +       G  ++S      
Subjt:  RAFKKFPLP-SYRKPFCSRSFSRKVILRAFWKK----QDFVDVNTR-----------RRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSIS----SC

Query:  SNSISWTESEFT---SEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPV
        S+S S  +SE T   S +I    SG+      +D   ++ ++   L G   G      S+                        +K     EEKEQLSP+
Subjt:  SNSISWTESEFT---SEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPV

Query:  SVLDFPFEDEDQDTPSSFNCNLHLVQGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFGS--ISRKEHQREQKAFELLKLVKS----------T
        S+LD PF+D+    PS      H  +  ++K   + +R E+ V  EP+DL+KR  +     Q + S  I  +E Q E +A  L  LVKS           
Subjt:  SVLDFPFEDEDQDTPSSFNCNLHLVQGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFGS--ISRKEHQREQKAFELLKLVKS----------T

Query:  TTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFTEDWI--NGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLAG-EKEELAAEFEAEVWMSLFD
        +   +N+LLDFF    E N+   R     D+ ++++  E+W+    D    M   W+  E R +Y+K+M    KW  + G EKE +  E       SL D
Subjt:  TTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFTEDWI--NGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLAG-EKEELAAEFEAEVWMSLFD

Query:  ELLIDLS
        EL+ D+S
Subjt:  ELLIDLS

AT5G03670.1 unknown protein2.8e-0827.03Show/hide
Query:  EEEKEQLSPVSVLDFPFEDEDQD-------TPSSFNCNLHLVQGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFGSISRKEHQREQKAFELLK
        EEEKEQ SPVSVLD PF+D+D+D        PSSF      VQ  K     +  RFE     +P++L+KR +D            ++  + E++  E +K
Subjt:  EEEKEQLSPVSVLDFPFEDEDQD-------TPSSFNCNLHLVQGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFGSISRKEHQREQKAFELLK

Query:  LVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFTEDWINGDAGEAMAT--------GWESPEGRVLYIK-----DMEIAGKWRSL-AGEKE
         +      T+ +L  +F E +E  + +    +D    ++       I+G+A  A+           W   E   + +        E  G WRS    +  
Subjt:  LVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFTEDWINGDAGEAMAT--------GWESPEGRVLYIK-----DMEIAGKWRSL-AGEKE

Query:  ELAAEFEAEVWMSLFDELLIDL
        E   + E E++  L +EL  D+
Subjt:  ELAAEFEAEVWMSLFDELLIDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCATTGATTCTTCCGCTTGGAGCATCATCTCCAAGCCTCCGCCGGAAAAACCTACCTCCTTCATGCTCAAGGATTATCTTCTCGACGATTTCAGTTCCTGCTC
CTCCAATGGCTTCCGATCCTTTCCTCGCCGCCAATGCTGCGCCACAACCGTCCGATTTCTACTCGAAATCGATCTCAAAGTTAAAGATTCTGCCCTAACTAAAAGATTCC
TTCCTCGAACTGCCTCCCGAAAAATCGCGCTCTCCACGATCTCCACTTTGCAGAGAGCCTCCGATGCCGTTGTCAGAGCCTTCAAGAAATTCCCCTTGCCTTCTTACCGG
AAGCCGTTTTGTTCGAGGAGTTTTTCACGGAAGGTGATTCTGCGAGCGTTCTGGAAGAAACAGGATTTTGTGGATGTGAACACCAGACGGCGGTGTAAATCGTTTCAGGA
ATTTCTCGATGAGAAAGAACCGCCGTTGTCTCGCTCCGATTCCGCTGTGTGCACCGCCGTGACTGTCGTCGGAAGAAACTCGATTAGTAGCTGTAGTAATAGTATCAGTT
GGACGGAGAGCGAATTTACATCGGAGATGATTCCGTCGTCTTCGAGCGGTAACTCCGAGAGTTGCAGCGAAAACGACGCCGTTAAAGTCGATAAGGATTCACCTGGTAAT
CTCATTGGCAAAAGAGATGGCGTAACATTCGGTAAAGATTCCATGGAGGAAACAACCACCGCCCCTTCCGCCGCCACCCCCACCACCCCTACCACCACCGGTTTCCGAGA
GGATATCGTTAAGCCATGGCCAAATGAAGAAGAAAAGGAACAGTTGAGTCCAGTTTCAGTGTTGGATTTTCCTTTTGAAGATGAAGATCAAGACACCCCCTCGTCTTTCA
ACTGCAATCTTCACCTCGTCCAAGGTAAGAAGCAAAAACATGCACATAGGCCGAAGCGATTCGAGAATGGAGTTGAATTTGAACCTCTAGACTTAAAGAAACGATTTGCA
GACATTGTTGTTGGTCATCAACATTTCGGCTCAATCTCGAGAAAAGAACACCAAAGGGAACAGAAAGCATTTGAGCTTCTAAAGCTCGTCAAATCAACTACGACATCGAC
AGAGAATCTGCTTCTCGATTTCTTCCACGAGAAGCTAGAAGAAAACGACGCAATTGCAAGAACAGGAGCTGATTTTGATCAAGCACAGGTCTTGAAATTCACTGAAGATT
GGATCAATGGGGATGCCGGAGAAGCCATGGCGACGGGATGGGAGTCGCCGGAGGGACGGGTTTTGTACATTAAGGACATGGAGATTGCCGGAAAATGGAGAAGTTTGGCC
GGAGAAAAGGAAGAGTTGGCGGCGGAGTTTGAAGCTGAGGTTTGGATGTCTTTGTTTGATGAGCTATTAATCGACCTCTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCCATTGATTCTTCCGCTTGGAGCATCATCTCCAAGCCTCCGCCGGAAAAACCTACCTCCTTCATGCTCAAGGATTATCTTCTCGACGATTTCAGTTCCTGCTC
CTCCAATGGCTTCCGATCCTTTCCTCGCCGCCAATGCTGCGCCACAACCGTCCGATTTCTACTCGAAATCGATCTCAAAGTTAAAGATTCTGCCCTAACTAAAAGATTCC
TTCCTCGAACTGCCTCCCGAAAAATCGCGCTCTCCACGATCTCCACTTTGCAGAGAGCCTCCGATGCCGTTGTCAGAGCCTTCAAGAAATTCCCCTTGCCTTCTTACCGG
AAGCCGTTTTGTTCGAGGAGTTTTTCACGGAAGGTGATTCTGCGAGCGTTCTGGAAGAAACAGGATTTTGTGGATGTGAACACCAGACGGCGGTGTAAATCGTTTCAGGA
ATTTCTCGATGAGAAAGAACCGCCGTTGTCTCGCTCCGATTCCGCTGTGTGCACCGCCGTGACTGTCGTCGGAAGAAACTCGATTAGTAGCTGTAGTAATAGTATCAGTT
GGACGGAGAGCGAATTTACATCGGAGATGATTCCGTCGTCTTCGAGCGGTAACTCCGAGAGTTGCAGCGAAAACGACGCCGTTAAAGTCGATAAGGATTCACCTGGTAAT
CTCATTGGCAAAAGAGATGGCGTAACATTCGGTAAAGATTCCATGGAGGAAACAACCACCGCCCCTTCCGCCGCCACCCCCACCACCCCTACCACCACCGGTTTCCGAGA
GGATATCGTTAAGCCATGGCCAAATGAAGAAGAAAAGGAACAGTTGAGTCCAGTTTCAGTGTTGGATTTTCCTTTTGAAGATGAAGATCAAGACACCCCCTCGTCTTTCA
ACTGCAATCTTCACCTCGTCCAAGGTAAGAAGCAAAAACATGCACATAGGCCGAAGCGATTCGAGAATGGAGTTGAATTTGAACCTCTAGACTTAAAGAAACGATTTGCA
GACATTGTTGTTGGTCATCAACATTTCGGCTCAATCTCGAGAAAAGAACACCAAAGGGAACAGAAAGCATTTGAGCTTCTAAAGCTCGTCAAATCAACTACGACATCGAC
AGAGAATCTGCTTCTCGATTTCTTCCACGAGAAGCTAGAAGAAAACGACGCAATTGCAAGAACAGGAGCTGATTTTGATCAAGCACAGGTCTTGAAATTCACTGAAGATT
GGATCAATGGGGATGCCGGAGAAGCCATGGCGACGGGATGGGAGTCGCCGGAGGGACGGGTTTTGTACATTAAGGACATGGAGATTGCCGGAAAATGGAGAAGTTTGGCC
GGAGAAAAGGAAGAGTTGGCGGCGGAGTTTGAAGCTGAGGTTTGGATGTCTTTGTTTGATGAGCTATTAATCGACCTCTCCTAG
Protein sequenceShow/hide protein sequence
MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYR
KPFCSRSFSRKVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGN
LIGKRDGVTFGKDSMEETTTAPSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQGKKQKHAHRPKRFENGVEFEPLDLKKRFA
DIVVGHQHFGSISRKEHQREQKAFELLKLVKSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFTEDWINGDAGEAMATGWESPEGRVLYIKDMEIAGKWRSLA
GEKEELAAEFEAEVWMSLFDELLIDLS