; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038026 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038026
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr2:11737970..11742769
RNA-Seq ExpressionLag0038026
SyntenyLag0038026
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607806.1 hypothetical protein SDJN03_01148, partial [Cucurbita argyrosperma subsp. sororia]1.2e-24880.19Show/hide
Query:  SSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHNKSGTEFGGG
        SSCS I RT LLVSIVGFLLPSQSLALKS F PRDVLPLLP QVSWPILNYLNNAADLLP FVG+VSSPEN+IQWQG+CF KN+AWLEFHNKSG+E+GGG
Subjt:  SSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHNKSGTEFGGG

Query:  TLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANIKFLESHMS
        TLHIK + AHSWTCMD YIFAT YRVTWDYYFL+REHTLE AEWEG+EE+EYVKKAGVSIFL+Q+GVLKT QAL DVLPLFS +DYGE++NIKFLE+HM 
Subjt:  TLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANIKFLESHMS

Query:  TTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYELNNDSSNPT
         TF+ERP PWTT V+VDDIQSGDFLALSKIRGPWG FETLQKWVTGSYAGHSAVCLRD EGKLWVAESGRGNGVEDDIIDVLPWD+WWDYELNNDSSNP 
Subjt:  TTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYELNNDSSNPT

Query:  LHFFPCILICGQNLMRL------------------------------------LHGVASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLPEILVEVER
        + F P        L                                        H VASAMTIWKQV+PTLAANLWNEALNIRLGTK LDLPEILVEVE+
Subjt:  LHFFPCILICGQNLMRL------------------------------------LHGVASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLPEILVEVER

Query:  QGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGKYRMDLPGYN
        QGSSFGELLAIPE DDWIYADGKSASCVAFVLEMYKAAGLFGPLA+SIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGD VKLPYCQILGKYRMDLPGYN
Subjt:  QGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGKYRMDLPGYN

Query:  SIDPYLRMNERCPSLPTEYYRPKGC
        SIDPY  MNERCPSLPTEYYRP  C
Subjt:  SIDPYLRMNERCPSLPTEYYRPKGC

XP_022940158.1 uncharacterized protein LOC111445868 [Cucurbita moschata]1.4e-24980.38Show/hide
Query:  SSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHNKSGTEFGGG
        SSCS I RT LLVSIVGFLLPSQSLALKS F PRDVLPLLP QVSWPILNYLNNAADLLP FVG+VSSPEN+IQWQG+CF KN+AWLEFHNKSG+E+GGG
Subjt:  SSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHNKSGTEFGGG

Query:  TLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANIKFLESHMS
        TLHIK + AHSWTCMD YIFAT YRVTWDYYFL+REHTLE AEWEG+EE+EYVKKAGVSIFL+Q+GVLKT QAL DVLPLFS +DYGE++NIKFLE+HM 
Subjt:  TLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANIKFLESHMS

Query:  TTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYELNNDSSNPT
         TF+ERPHPWTT V+VDDIQSGDFLALSKIRGPWG FETLQKWVTGSYAGHSAVCLRD EGKLWVAESGRGNGVEDDIIDVLPWD+WWDYELNNDSSNP 
Subjt:  TTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYELNNDSSNPT

Query:  LHFFPCILICGQNLMRL------------------------------------LHGVASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLPEILVEVER
        + F P        L                                        H VASAMTIWKQV+PTLAANLWNEALNIRLGTK LDLPEILVEVE+
Subjt:  LHFFPCILICGQNLMRL------------------------------------LHGVASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLPEILVEVER

Query:  QGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGKYRMDLPGYN
        QGSSFGELLAIPE DDWIYADGKSASCVAFVLEMYKAAGLFGPLA+SIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGD VKLPYCQILGKYRMDLPGYN
Subjt:  QGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGKYRMDLPGYN

Query:  SIDPYLRMNERCPSLPTEYYRPKGC
        SIDPY  MNERCPSLPTEYYRP  C
Subjt:  SIDPYLRMNERCPSLPTEYYRPKGC

XP_022981430.1 uncharacterized protein LOC111480552 [Cucurbita maxima]8.9e-24979.92Show/hide
Query:  SSQSSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHNKSGTEF
        S  SSCS I RT LLVSIVGFLLPSQSLALKS F PRDVLPLLP QVSWPILNYLNNAADLLP FVG+VSSPEN+IQWQG+CF  N+AWLEFHNKSG+E+
Subjt:  SSQSSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHNKSGTEF

Query:  GGGTLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANIKFLES
        GGGTLHIK + AHSWTCMD YIFAT YRVTWDYYFL+REHTLE AEWEG+EE+EYVKKAGVSIFL+Q+GVLKT QAL DVLPLFS +DYGE++NIKFLE+
Subjt:  GGGTLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANIKFLES

Query:  HMSTTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYELNNDSS
        HM  TFKERP PWTT V+VDDIQSGDFLALSKIRGPWG FETLQKWVTGSYAGHSAVCLRD EGKLWVAESGRGNGVEDDIIDVLPWD+WWDYELNNDSS
Subjt:  HMSTTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYELNNDSS

Query:  NPTLHFFPCILICGQNLMRLL------------------------------------HGVASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLPEILVE
        NP + F P        L                                        H VASAMTIWKQV+PTLAANLWNEALNIRLGTK LDLPEILVE
Subjt:  NPTLHFFPCILICGQNLMRLL------------------------------------HGVASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLPEILVE

Query:  VERQGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGKYRMDLP
        VE+QGSSFGELLAIPE DDWIYADGKSASCVAFVLEMYKAAGLFGPLA+SIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGD VKLPYCQILGKYRMDLP
Subjt:  VERQGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGKYRMDLP

Query:  GYNSIDPYLRMNERCPSLPTEYYRPKGC
        GYNSIDPY  MNERCPSLPTEYYRP  C
Subjt:  GYNSIDPYLRMNERCPSLPTEYYRPKGC

XP_023523736.1 uncharacterized protein LOC111787879 [Cucurbita pepo subsp. pepo]4.7e-25080.11Show/hide
Query:  SSQSSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHNKSGTEF
        S  SSCS I RT LLVSIVGFLLPSQSLALKS F PRDV PLLPRQVSWPILNYLNNAADLLP FVG+VSSPEN+IQWQG+CF KN+AWLEFHNKSG+E+
Subjt:  SSQSSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHNKSGTEF

Query:  GGGTLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANIKFLES
        GGGTLHIK + AHSWTCMD YIFAT YRVTWDYYFL+REHTLE AEWEG+EE+EYVKKAGVSIFL+Q+GVLKT QAL DVLPLFS +DYGE++NIKFLE+
Subjt:  GGGTLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANIKFLES

Query:  HMSTTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYELNNDSS
        HM  TF+ERPHPWTT V+VDDIQSGDFLALSKIRGPWG FETLQKWVTGSYAGHSAVCLRD EGKLWVAESGRGNGVEDDIIDVLPWD+WWDYELNNDSS
Subjt:  HMSTTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYELNNDSS

Query:  NPTLHFFPCILICGQNLMRL------------------------------------LHGVASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLPEILVE
        NP + F P        L                                        H VASAMTIWKQV+PTLAANLWNEALNIRLGTK LDLPEILVE
Subjt:  NPTLHFFPCILICGQNLMRL------------------------------------LHGVASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLPEILVE

Query:  VERQGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGKYRMDLP
        VE+QGSSFGELLAIPE DDWIYADGKSASCVAFVLEMYKAAGLFGPLA+SIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGD VKLPYCQILGKYRMDLP
Subjt:  VERQGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGKYRMDLP

Query:  GYNSIDPYLRMNERCPSLPTEYYRPKGC
        GYNSIDPY  MNERCPSLPTEYYRP  C
Subjt:  GYNSIDPYLRMNERCPSLPTEYYRPKGC

XP_038899518.1 uncharacterized protein LOC120086795 [Benincasa hispida]2.6e-24879.59Show/hide
Query:  ASSSSSSSQSSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHN
        ASSSS SS+S CS I RTILLVSIVGFLL SQ  ALK PF+P D+LPLLPRQVSWPILNYLNNAADLLPTFVGAV+SPEN+IQWQGACF KNTAWLEFHN
Subjt:  ASSSSSSSQSSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHN

Query:  KSGTEFGGGTLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQAN
        KSG+EFGGGTLHIK S AHSWTCMDLYIFAT YR+TWDYYFLSREHTLE AEWEG+EE+EYVK+AGVSIFL+Q+GVLKTLQALLDVLPLF+ SD+GEQ+N
Subjt:  KSGTEFGGGTLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQAN

Query:  IKFLESHMSTTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYE
        IKFLE+HM TTF+ER HPWTT VDVDDI SGDFLALSKIRGPWG FETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIID+LPWD+WWDY 
Subjt:  IKFLESHMSTTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYE

Query:  LNNDSSNPTLHFFP-----------------CILICGQ---------NLMRLLHG----------VASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDL
        LNNDSSNP + F P                    + G+         + +   HG          VAS MTIW +V+PTLAANLWNEALNIRLGTKGLDL
Subjt:  LNNDSSNPTLHFFP-----------------CILICGQ---------NLMRLLHG----------VASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDL

Query:  PEILVEVERQGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGK
        PEILVEVE+QGSSFGELLAIPE DDWIYADGKSASCVAF+LEMYK AGLFG LASSIQVTEFT+KDAYTLNFYENNSSRLPKWC+GGD VKLPYCQILGK
Subjt:  PEILVEVERQGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGK

Query:  YRMDLPGYNSIDPYLRMNERCPSLPTEYYRPKGC
        YRMDL GYNSIDPY  MNERCPSLPTEY+RPK C
Subjt:  YRMDLPGYNSIDPYLRMNERCPSLPTEYYRPKGC

TrEMBL top hitse value%identityAlignment
A0A1S3BYV4 uncharacterized protein LOC1034949413.4e-24678.24Show/hide
Query:  SSSSSSSQSSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHNK
        +SSSSSSQS  S I RTILLVS+VGFLLPSQ  ALKSPF+P D+LPLLPRQVSWP+LNYLNNAADLLPTFVGAV+SP+N+IQWQGACF KNTAW+EFHNK
Subjt:  SSSSSSSQSSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHNK

Query:  SGTEFGGGTLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANI
        SG+EFGGGTLHIK S AHSWTCMDLYIFAT YR+TWDYYFLSREHTLE AEWEG+EE+EYVKKAGVSIFL+Q+GVLKTLQAL+DVLPLF+ SD+GEQ+NI
Subjt:  SGTEFGGGTLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANI

Query:  KFLESHMSTTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYEL
        KFLE+HM TTF+ER HPWTT +DVDDI SGDFLALSKIRGPWG FETLQKWVTGSYAGHSAV LRDSEG+LWVAESGRG GVEDDIID+LPWD+WWDY L
Subjt:  KFLESHMSTTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYEL

Query:  NNDSSNPTLHFFP-----------------CILICGQ---------NLMRLLHG----------VASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLP
        NNDSSNP + F P                    + G+         + +   HG          VAS MTIW +V+PTLAANLWNEALNIRLGTKGLDLP
Subjt:  NNDSSNPTLHFFP-----------------CILICGQ---------NLMRLLHG----------VASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLP

Query:  EILVEVERQGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGKY
        EILVEVE+QGSSFGELLAIPE DDWIYADGKSASCVAF+L+MYK AGLFGPLASSIQVTEFT+KDAYTLNFYENNS+RLPKWCHG D VKLPYCQILGKY
Subjt:  EILVEVERQGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGKY

Query:  RMDLPGYNSIDPYLRMNERCPSLPTEYYRPKGC
        RMDL GYNSIDPY  MNERCPSLPTEY+RPK C
Subjt:  RMDLPGYNSIDPYLRMNERCPSLPTEYYRPKGC

A0A6J1F7G1 uncharacterized protein LOC111443053 isoform X43.5e-24377.11Show/hide
Query:  SSSSSSSQSSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHNK
        S ++SSSQS CSLI RT+LLVS++ FL PSQ  ALKSPF+P D+L LLPRQVSWPILNYLNNA DLLPTFVGAVSSP+ +IQWQGACFDKNTAWLEFHN 
Subjt:  SSSSSSSQSSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHNK

Query:  SGTEFGGGTLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANI
        SG+EFGGGTLHIK S+AHSWTCMDLYIFAT YRVTWDYYF SREHTLEF EWEG++E+EYVK+AGVSIFL+Q+GVL+TLQAL DVLPLF+NSD+GEQ+NI
Subjt:  SGTEFGGGTLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANI

Query:  KFLESHMSTTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYEL
        KFLE+HM  TFKER  PWTT V+VDDI SGDFLALSK RGPWG FETLQKWVTGS+AGHSAVCLRDSEG LWVAESGRG GVEDDIIDVLPWD+WW+YEL
Subjt:  KFLESHMSTTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYEL

Query:  NNDSSNPTLHFFPCILICGQNLMRL------------------------------------LHGVASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLP
        NNDSSNP + F P        L                                        H VASAMTIW QV+PTLAANLWNEALNIRLGTKGLDLP
Subjt:  NNDSSNPTLHFFPCILICGQNLMRL------------------------------------LHGVASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLP

Query:  EILVEVERQGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGKY
        EILVEVE+QGSSFGELLAIPE DDWIYADGKSASCVAF+LEMYKAAGLFGPL+SSIQVTEFT+KDAYTLNF+ENNSSRLPKWC+GGD VKLPYCQILGKY
Subjt:  EILVEVERQGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGKY

Query:  RMDLPGYNSIDPYLRMNERCPSLPTEYYRPKGC
        RM LPGYNSI+PY  MNERCPSLPT+YYRPK C
Subjt:  RMDLPGYNSIDPYLRMNERCPSLPTEYYRPKGC

A0A6J1FNI4 uncharacterized protein LOC1114458686.6e-25080.38Show/hide
Query:  SSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHNKSGTEFGGG
        SSCS I RT LLVSIVGFLLPSQSLALKS F PRDVLPLLP QVSWPILNYLNNAADLLP FVG+VSSPEN+IQWQG+CF KN+AWLEFHNKSG+E+GGG
Subjt:  SSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHNKSGTEFGGG

Query:  TLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANIKFLESHMS
        TLHIK + AHSWTCMD YIFAT YRVTWDYYFL+REHTLE AEWEG+EE+EYVKKAGVSIFL+Q+GVLKT QAL DVLPLFS +DYGE++NIKFLE+HM 
Subjt:  TLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANIKFLESHMS

Query:  TTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYELNNDSSNPT
         TF+ERPHPWTT V+VDDIQSGDFLALSKIRGPWG FETLQKWVTGSYAGHSAVCLRD EGKLWVAESGRGNGVEDDIIDVLPWD+WWDYELNNDSSNP 
Subjt:  TTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYELNNDSSNPT

Query:  LHFFPCILICGQNLMRL------------------------------------LHGVASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLPEILVEVER
        + F P        L                                        H VASAMTIWKQV+PTLAANLWNEALNIRLGTK LDLPEILVEVE+
Subjt:  LHFFPCILICGQNLMRL------------------------------------LHGVASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLPEILVEVER

Query:  QGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGKYRMDLPGYN
        QGSSFGELLAIPE DDWIYADGKSASCVAFVLEMYKAAGLFGPLA+SIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGD VKLPYCQILGKYRMDLPGYN
Subjt:  QGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGKYRMDLPGYN

Query:  SIDPYLRMNERCPSLPTEYYRPKGC
        SIDPY  MNERCPSLPTEYYRP  C
Subjt:  SIDPYLRMNERCPSLPTEYYRPKGC

A0A6J1IFU0 uncharacterized protein LOC111476862 isoform X13.9e-24276.55Show/hide
Query:  SSSSSSSQSSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHNK
        S ++SSSQS CSLI RT+LLVS+V FL P Q  ALKSPF+P D+LPLLPRQVSWPILNYLNNA DLLPTFVGAVSSP+ +IQWQGACFDKNTAWLEFHN 
Subjt:  SSSSSSSQSSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHNK

Query:  SGTEFGGGTLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANI
        SG+EFGGGTLHIK S+AHSWTCMDLYIFAT YRVTWDYYF SREHTLEF EWEG++E+EYV++AGV+IFL+Q+GVL+TLQAL DVLPLF+NSD+GEQ+NI
Subjt:  SGTEFGGGTLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANI

Query:  KFLESHMSTTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYEL
        KFLE+HM TTFKE   PWTT V+VDDI SGDFLALSK RGPWG FETLQKWVTGS+AGHSAVCLRDSEG LWVAESGRG GVEDDIIDVLPWD+WW+YEL
Subjt:  KFLESHMSTTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYEL

Query:  NNDSSNPTLHFFPCILICGQNLMRL------------------------------------LHGVASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLP
        NNDSSNP + F P        L                                        H VASAMTIW QV+PTLAANLWNEALNIRLGTKGLDLP
Subjt:  NNDSSNPTLHFFPCILICGQNLMRL------------------------------------LHGVASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLP

Query:  EILVEVERQGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGKY
        EILVEVE+QGSSFGELLAIPE DDW YADGKSASCVAF+LEMYKAAGLFGPL+S+IQVTEFT+KDAYTLNF+ENNSSRLPKWC+GGD VKLPYCQILGKY
Subjt:  EILVEVERQGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGKY

Query:  RMDLPGYNSIDPYLRMNERCPSLPTEYYRPKGC
        RM LPGYNSI+PY  MNERCPSLPT+YYRPK C
Subjt:  RMDLPGYNSIDPYLRMNERCPSLPTEYYRPKGC

A0A6J1IZH3 uncharacterized protein LOC1114805524.3e-24979.92Show/hide
Query:  SSQSSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHNKSGTEF
        S  SSCS I RT LLVSIVGFLLPSQSLALKS F PRDVLPLLP QVSWPILNYLNNAADLLP FVG+VSSPEN+IQWQG+CF  N+AWLEFHNKSG+E+
Subjt:  SSQSSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHNKSGTEF

Query:  GGGTLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANIKFLES
        GGGTLHIK + AHSWTCMD YIFAT YRVTWDYYFL+REHTLE AEWEG+EE+EYVKKAGVSIFL+Q+GVLKT QAL DVLPLFS +DYGE++NIKFLE+
Subjt:  GGGTLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANIKFLES

Query:  HMSTTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYELNNDSS
        HM  TFKERP PWTT V+VDDIQSGDFLALSKIRGPWG FETLQKWVTGSYAGHSAVCLRD EGKLWVAESGRGNGVEDDIIDVLPWD+WWDYELNNDSS
Subjt:  HMSTTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYELNNDSS

Query:  NPTLHFFPCILICGQNLMRLL------------------------------------HGVASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLPEILVE
        NP + F P        L                                        H VASAMTIWKQV+PTLAANLWNEALNIRLGTK LDLPEILVE
Subjt:  NPTLHFFPCILICGQNLMRLL------------------------------------HGVASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLPEILVE

Query:  VERQGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGKYRMDLP
        VE+QGSSFGELLAIPE DDWIYADGKSASCVAFVLEMYKAAGLFGPLA+SIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGD VKLPYCQILGKYRMDLP
Subjt:  VERQGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGKYRMDLP

Query:  GYNSIDPYLRMNERCPSLPTEYYRPKGC
        GYNSIDPY  MNERCPSLPTEYYRP  C
Subjt:  GYNSIDPYLRMNERCPSLPTEYYRPKGC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G70160.1 unknown protein3.3e-16153.98Show/hide
Query:  ALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEF--HNKSGTEFGGGTLHIKVSSAHSWTCMDLYIFATS
        +LK PF+  DVLP+LPRQVSWP+LN  +NA DLLP F+G+V+    +I+W+GACF+ N A L+    ++     GGG LH+K S AHS TCMDLY+FAT 
Subjt:  ALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEF--HNKSGTEFGGGTLHIKVSSAHSWTCMDLYIFATS

Query:  YRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANIKFLESHMSTTFKERPHPWTTGVDVDDIQSGD
        YR+TWDYYF +R+HTL F  WE + E EYVK+ GVS+FLM SG+L TL +L+DVLPLFSN+ +G+ AN+ FL  HM  TF++R  PW + ++ +D+ SGD
Subjt:  YRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANIKFLESHMSTTFKERPHPWTTGVDVDDIQSGD

Query:  FLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYELNNDSSNPTLHFFPC-----------------
        FLA+SKIRG WGGFETL+KWVTG++AGH+AVCL+D  G LWV ESG  N   ++II V+PWD+WW+  L  D+SNP +   P                  
Subjt:  FLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYELNNDSSNPTLHFFPC-----------------

Query:  -----------------ILICGQNLMRLL--HGVASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLPEILVEVERQGSSFGELLAIPEHDDWIYADGK
                         I   G N    L  H V S M++W +V+P  AAN+WNEALN RLGT+ LDL  IL E  R+G SF ELL IPE D+W+Y+DGK
Subjt:  -----------------ILICGQNLMRLL--HGVASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLPEILVEVERQGSSFGELLAIPEHDDWIYADGK

Query:  SASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGKYRMDLPGYNSIDPYLRMNERCPSLPTEYYRPK
        S +CVAF+L MYKAAG+F PLA  IQVTEFT++DAYTL  +E+N +RLP WC+  ++ KL +CQILG+YRM+LPGYN+I PY  MN+ CPSLP  Y RP 
Subjt:  SASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGKYRMDLPGYNSIDPYLRMNERCPSLPTEYYRPK

Query:  GC
         C
Subjt:  GC

AT4G27020.1 unknown protein1.3e-19258.52Show/hide
Query:  SSQSSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHNKSGTEF
        ++ SS  +   T LLV ++ F       + K PF PRD+LPL PRQVSWP++N LN A DLLPTF+G+ S   + ++W+GACF +N AWLE +NKSG+EF
Subjt:  SSQSSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHNKSGTEF

Query:  GGGTLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANIKFLES
        GGGTLHIKV  AHSWTCMD+Y+F T YRVTWD+YF SREHT+EF EWEG+ EYEYVK+ GVSIFLM++G+L TL+AL DV PLF+N+ +GE +NI FL++
Subjt:  GGGTLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANIKFLES

Query:  HMSTTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYELNNDSS
        HM   F  RP PW T +  D+I SGD LA+SKIRG WGGFETL+KWV+G+YAGH+AVCLRDSEGKLWV ESG  N   +D+I +LPW++WW++E   D S
Subjt:  HMSTTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYELNNDSS

Query:  NPTLHFFPC--------ILICGQNLMRLLHG----------------------------VASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLPEILVE
        NP +   P          +       R + G                            VAS MT+W +++P  AAN+WNEALN RLGT+GLDLP++LVE
Subjt:  NPTLHFFPC--------ILICGQNLMRLLHG----------------------------VASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLPEILVE

Query:  VERQGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGKYRMDLP
        VE++GSSF ELLA+PE DDWIY+DGKS SC+AF+LEMYK AGLF P++SSIQVTEFT+KDAY L F+E+N+SR PKWC+  D VKLPYCQILGKYRM+LP
Subjt:  VERQGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQILGKYRMDLP

Query:  GYNSIDPYLRMNERCPSLPTEYYRPKGC
        GYN+++PY  MNE CPSLP +Y+RPK C
Subjt:  GYNSIDPYLRMNERCPSLPTEYYRPKGC

AT5G54870.1 unknown protein2.4e-19961.75Show/hide
Query:  MAASSSSSSSQSSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEF
        MA+SSSSSSS S     L +IL + +V  L+ S + ++KSPF PRD+LP LPRQVSWPILN L  AADLLPTF+G  SS  +++ W+GACF +NTAWLEF
Subjt:  MAASSSSSSSQSSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEF

Query:  HNKSGTEFGGGTLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQ
        HNKSG+EFGGGTLHIK   AHSWTCMDLY+FAT YRVTW +YF+SR+HT+EF EW+G+ EYEYVK  GVSIFLM +G+L TLQAL DV PLFSN+ +GE 
Subjt:  HNKSGTEFGGGTLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQ

Query:  ANIKFLESHMSTTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWD
        +N+ FLE HM   F+ RP PW T V  D IQSGD LA+SKIRG WGGFETL+KWV+G+YAGHSAV LRDSEGKLWV ESG  N   +D+I +LPW++WW 
Subjt:  ANIKFLESHMSTTFKERPHPWTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWD

Query:  YELNNDSSNPTLHFFPC----------------------------------ILICGQNLMRLL--HGVASAMTIWKQVKPTLAANLWNEALNIRLGTKGL
        +E   D SNP +   P                                   I    +N    L  H VAS MT+W Q++P  AAN+WNEALN RLGT+GL
Subjt:  YELNNDSSNPTLHFFPC----------------------------------ILICGQNLMRLL--HGVASAMTIWKQVKPTLAANLWNEALNIRLGTKGL

Query:  DLPEILVEVERQGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQIL
        DL ++LVEVE++GSSF +LLA+PE DDWIY+DGKS SC+AF+LEMYK AGLFGPLASSIQVTEFT+KDAY LNF+ENN+SRLP WC+  D VKLPYCQIL
Subjt:  DLPEILVEVERQGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYENNSSRLPKWCHGGDKVKLPYCQIL

Query:  GKYRMDLPGYNSIDPYLRMNERCPSLPTEYYRPKGC
        GKYRM+LPGYN+++PY  MNE+CP+LP +Y RP  C
Subjt:  GKYRMDLPGYNSIDPYLRMNERCPSLPTEYYRPKGC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCTTCTTCTTCTTCTTCTTCGTCTCAATCTTCTTGTTCTTTGATTCTAAGAACTATTCTTCTTGTTTCTATTGTGGGTTTTCTTCTTCCATCGCAATCCCTGGC
CCTGAAATCGCCGTTTGCCCCTCGCGACGTGCTTCCACTACTACCCAGACAGGTCTCATGGCCGATTCTCAACTATCTCAACAACGCCGCCGATCTTCTTCCCACTTTCG
TCGGTGCTGTCTCGTCTCCTGAGAACACTATTCAGTGGCAAGGGGCTTGCTTCGATAAGAACACCGCTTGGCTGGAATTTCATAACAAGTCTGGCACTGAATTTGGCGGT
GGTACGCTTCATATAAAGGTTAGCAGTGCACATAGTTGGACGTGTATGGATCTTTATATCTTTGCAACTTCATACCGCGTGACGTGGGATTACTATTTCCTGTCCAGAGA
ACATACTCTTGAGTTTGCAGAATGGGAAGGAAGGGAAGAATATGAATATGTTAAGAAGGCAGGGGTTTCGATTTTCCTCATGCAATCTGGAGTGTTGAAAACCCTTCAAG
CATTGTTGGATGTCCTCCCTTTGTTTTCAAATTCCGACTACGGCGAGCAAGCTAATATTAAATTTCTTGAGAGTCACATGAGCACTACCTTTAAGGAACGTCCTCATCCA
TGGACTACGGGTGTTGATGTTGATGATATTCAATCTGGAGATTTTTTAGCACTTTCCAAAATCCGTGGTCCATGGGGTGGTTTTGAGACTCTACAGAAGTGGGTTACTGG
ATCATATGCTGGTCATTCTGCTGTTTGCTTAAGGGACTCTGAAGGAAAGTTGTGGGTTGCTGAATCTGGGCGTGGAAATGGAGTGGAGGATGATATTATTGATGTTTTAC
CATGGGATCAGTGGTGGGACTACGAACTCAACAACGATTCATCCAATCCCACATTGCATTTCTTCCCTTGCATCCTGATTTGCGGGCAAAATTTAATGAGACTGCTGCAT
GGAGTTGCTTCTGCTATGACAATATGGAAACAAGTGAAGCCAACCCTTGCGGCAAACTTGTGGAATGAAGCTTTGAACATACGACTTGGAACAAAGGGCCTTGACCTTCC
CGAGATTTTAGTTGAAGTTGAAAGGCAAGGATCATCTTTTGGCGAACTACTGGCTATTCCTGAACATGATGACTGGATCTATGCTGACGGGAAATCAGCCTCTTGTGTTG
CTTTTGTCCTTGAAATGTACAAGGCGGCTGGGCTTTTTGGTCCACTTGCTAGCTCAATTCAAGTTACAGAGTTCACTCTGAAAGATGCATACACTCTCAACTTTTATGAG
AACAATTCAAGCCGCCTCCCAAAATGGTGTCATGGTGGGGACAAAGTCAAGCTTCCATATTGTCAGATACTTGGAAAATATCGTATGGATTTACCTGGATATAATAGCAT
CGATCCCTATCTACGTATGAATGAAAGGTGCCCTTCTCTTCCAACCGAGTACTACCGTCCAAAGGGTTGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCTTCTTCTTCTTCTTCTTCGTCTCAATCTTCTTGTTCTTTGATTCTAAGAACTATTCTTCTTGTTTCTATTGTGGGTTTTCTTCTTCCATCGCAATCCCTGGC
CCTGAAATCGCCGTTTGCCCCTCGCGACGTGCTTCCACTACTACCCAGACAGGTCTCATGGCCGATTCTCAACTATCTCAACAACGCCGCCGATCTTCTTCCCACTTTCG
TCGGTGCTGTCTCGTCTCCTGAGAACACTATTCAGTGGCAAGGGGCTTGCTTCGATAAGAACACCGCTTGGCTGGAATTTCATAACAAGTCTGGCACTGAATTTGGCGGT
GGTACGCTTCATATAAAGGTTAGCAGTGCACATAGTTGGACGTGTATGGATCTTTATATCTTTGCAACTTCATACCGCGTGACGTGGGATTACTATTTCCTGTCCAGAGA
ACATACTCTTGAGTTTGCAGAATGGGAAGGAAGGGAAGAATATGAATATGTTAAGAAGGCAGGGGTTTCGATTTTCCTCATGCAATCTGGAGTGTTGAAAACCCTTCAAG
CATTGTTGGATGTCCTCCCTTTGTTTTCAAATTCCGACTACGGCGAGCAAGCTAATATTAAATTTCTTGAGAGTCACATGAGCACTACCTTTAAGGAACGTCCTCATCCA
TGGACTACGGGTGTTGATGTTGATGATATTCAATCTGGAGATTTTTTAGCACTTTCCAAAATCCGTGGTCCATGGGGTGGTTTTGAGACTCTACAGAAGTGGGTTACTGG
ATCATATGCTGGTCATTCTGCTGTTTGCTTAAGGGACTCTGAAGGAAAGTTGTGGGTTGCTGAATCTGGGCGTGGAAATGGAGTGGAGGATGATATTATTGATGTTTTAC
CATGGGATCAGTGGTGGGACTACGAACTCAACAACGATTCATCCAATCCCACATTGCATTTCTTCCCTTGCATCCTGATTTGCGGGCAAAATTTAATGAGACTGCTGCAT
GGAGTTGCTTCTGCTATGACAATATGGAAACAAGTGAAGCCAACCCTTGCGGCAAACTTGTGGAATGAAGCTTTGAACATACGACTTGGAACAAAGGGCCTTGACCTTCC
CGAGATTTTAGTTGAAGTTGAAAGGCAAGGATCATCTTTTGGCGAACTACTGGCTATTCCTGAACATGATGACTGGATCTATGCTGACGGGAAATCAGCCTCTTGTGTTG
CTTTTGTCCTTGAAATGTACAAGGCGGCTGGGCTTTTTGGTCCACTTGCTAGCTCAATTCAAGTTACAGAGTTCACTCTGAAAGATGCATACACTCTCAACTTTTATGAG
AACAATTCAAGCCGCCTCCCAAAATGGTGTCATGGTGGGGACAAAGTCAAGCTTCCATATTGTCAGATACTTGGAAAATATCGTATGGATTTACCTGGATATAATAGCAT
CGATCCCTATCTACGTATGAATGAAAGGTGCCCTTCTCTTCCAACCGAGTACTACCGTCCAAAGGGTTGCTAG
Protein sequenceShow/hide protein sequence
MAASSSSSSSQSSCSLILRTILLVSIVGFLLPSQSLALKSPFAPRDVLPLLPRQVSWPILNYLNNAADLLPTFVGAVSSPENTIQWQGACFDKNTAWLEFHNKSGTEFGG
GTLHIKVSSAHSWTCMDLYIFATSYRVTWDYYFLSREHTLEFAEWEGREEYEYVKKAGVSIFLMQSGVLKTLQALLDVLPLFSNSDYGEQANIKFLESHMSTTFKERPHP
WTTGVDVDDIQSGDFLALSKIRGPWGGFETLQKWVTGSYAGHSAVCLRDSEGKLWVAESGRGNGVEDDIIDVLPWDQWWDYELNNDSSNPTLHFFPCILICGQNLMRLLH
GVASAMTIWKQVKPTLAANLWNEALNIRLGTKGLDLPEILVEVERQGSSFGELLAIPEHDDWIYADGKSASCVAFVLEMYKAAGLFGPLASSIQVTEFTLKDAYTLNFYE
NNSSRLPKWCHGGDKVKLPYCQILGKYRMDLPGYNSIDPYLRMNERCPSLPTEYYRPKGC