; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G38850 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G38850
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationChr3:33460244..33461584
RNA-Seq ExpressionCSPI03G38850
SyntenyCSPI03G38850
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053776.1 UPF0481 protein [Cucumis melo var. makuwa]2.3e-10772.32Show/hide
Query:  MHKPNDS---VEVSYMAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAA
        MH+PN+S    EVSY+ +LIQNKLQ+LP +TEECCIYRVSKRLVNI+PT+YEPQLISIGPFHHGRE LK MEQFKL+FL RYLSRLSR+ LSFEVVVKAA
Subjt:  MHKPNDS---VEVSYMAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAA

Query:  LEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIA--SEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTV
        LEWETKARKCYEDC ISMNSHDFVHMLLVDGCF+VEFL+A   E LQTQTTSRVDPLVS+AMN+NLYHDLI+LENQLPFFV+                  
Subjt:  LEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIA--SEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTV

Query:  LVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMG
                 Q NF+KH+ +IP NI S   K+I HL+DFLGFYY P T D INQGN+R LFLPPSTTELYEAGVILEKA+TT+D YNIMG
Subjt:  LVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMG

KAE8651212.1 hypothetical protein Csa_001883 [Cucumis sativus]3.1e-22990.81Show/hide
Query:  MHKPNDSVEVSYMAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEW
        MHKPNDSVEVSY+AALIQNKLQNLPCVTEECCIYRVSKRLVNIYP+VYEPQLISIGPFHHGREHLKLMEQFKLQFLLR                      
Subjt:  MHKPNDSVEVSYMAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEW

Query:  ETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIV
                      MNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMN+NLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIV
Subjt:  ETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIV

Query:  HNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIH
        HNFFQANFMKHYCKIPQNIFSPT+KNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIH
Subjt:  HNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIH

Query:  DLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQW
        DLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQW
Subjt:  DLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQW

Query:  MAILKRDYFNTPWTITSFIFAVIFALITLLQTTFTIYHTIDKDRHV
        MAILKRDYFNTPWTITSFIFAVIFALITLLQTTFTIYHT DKDRHV
Subjt:  MAILKRDYFNTPWTITSFIFAVIFALITLLQTTFTIYHTIDKDRHV

XP_004147504.1 UPF0481 protein At3g47200 [Cucumis sativus]1.0e-18799.1Show/hide
Query:  MNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCK
        MNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMN+NLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCK
Subjt:  MNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCK

Query:  IPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITMRNLLAFE
        IPQNIFSPT+KNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITMRNLLAFE
Subjt:  IPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITMRNLLAFE

Query:  NFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQWMAILKRDYFNTPWT
        NFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQWMAILKRDYFNTPWT
Subjt:  NFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQWMAILKRDYFNTPWT

Query:  ITSFIFAVIFALITLLQTTFTIYHTIDKDRHV
        ITSFIFAVIFALITLLQTTFTIYHT DKDRHV
Subjt:  ITSFIFAVIFALITLLQTTFTIYHTIDKDRHV

XP_008443397.1 PREDICTED: LOW QUALITY PROTEIN: UPF0481 protein At3g47200-like [Cucumis melo]9.9e-19980.94Show/hide
Query:  MHKPNDS---VEVSYMAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAA
        MH+PN+S    EVSY+ +LIQNKLQ+LP +TEECCIYRVSKRLVNI+PT+YEPQLISIGPFHHGRE LK MEQFKL+FL RYLSRLSR+ LSFEVVVKAA
Subjt:  MHKPNDS---VEVSYMAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAA

Query:  LEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIA--SEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNN-DDSFT
        LEWETKARKCYEDC ISMNSHDFVHMLLVDGCF+VEFL+A   E LQTQTTSRVDPLVS+AMN+NLYHDLI+LENQLPFFV+QGL  FI +PNN DD F 
Subjt:  LEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIA--SEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNN-DDSFT

Query:  VLVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKI
        VLVNIVHNFFQ NF+KH+ +IP NI S   K+I HL+DFLGFYY P T D INQGN+R LFLPPSTTELYEAGVILEKA+TT+D YNIMG SFEGGVLKI
Subjt:  VLVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKI

Query:  PPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCD
        PPFEIHDLFEITMRNLLAFENFQGGS SESSAIHYI FLGALISKEKDSSLLMKKGILSNLIGGSD EVSNMFNNIGKGV FRGHF YDSTSRNLRKHCD
Subjt:  PPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCD

Query:  AKSNQWMAILKRDYFNTPWTITSFIFAVIFALITLLQTTFTIYHTI
        A+SN+WMAILKRDY NTPW I S +   I  LITLL+T   +YH +
Subjt:  AKSNQWMAILKRDYFNTPWTITSFIFAVIFALITLLQTTFTIYHTI

XP_038904513.1 UPF0481 protein At3g47200-like [Benincasa hispida]3.8e-15064.41Show/hide
Query:  MHKPNDSVEV---SY-MAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKA
        MH+PN++ E+   SY MA LIQ +L+ LP VTEECCI+RVSKRL+NI+ T YEPQLISIGPFHHGR+ LK MEQFKLQFL R+++R++R+ LS++ VV+ 
Subjt:  MHKPNDSVEV---SY-MAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKA

Query:  ALE-WETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIA----SEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDD
        AL  WET+AR CYED A +MNSHDFV M+LVDGCF+VEFL++      Q Q+ +TSRVDPLV KAMN+NLYHDLI+LENQLPFFVLQ L   I     D+
Subjt:  ALE-WETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIA----SEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDD

Query:  SFTVLVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGV
        SFT LV+I+H FF  NFMKH C+ PQN   P ++NIRHLV FL FYYSPT  DII   N++ L LPPS TEL+EAGVILEK  T     NI+ ++F+ GV
Subjt:  SFTVLVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGV

Query:  LKIPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRK
        LKIPPFEIH LFEI MRNL+AFENFQG + ++S AIHY+LFLGALIS+EKDSSLLMKKGI++NLIGGSDEEVSNMFNNIGKGV F+GHF Y+  S++L K
Subjt:  LKIPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRK

Query:  HCDAKSNQWMAILKRDYFNTPWTITSFIFAVIFALITLLQTTFT
        HC  + N+WMA L+RDY NTPW   S + A+       LQT F+
Subjt:  HCDAKSNQWMAILKRDYFNTPWTITSFIFAVIFALITLLQTTFT

TrEMBL top hitse value%identityAlignment
A0A0A0LC32 Uncharacterized protein4.3e-24896.64Show/hide
Query:  MHKPNDSVEVSYMAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEW
        MHKPNDSVEVSY+AALIQNKLQNLPCVTEECCIYRVSKRLVNIYP+VYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSF          
Subjt:  MHKPNDSVEVSYMAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEW

Query:  ETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIV
        ETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMN+NLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIV
Subjt:  ETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIV

Query:  HNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIH
        HNFFQANFMKHYCKIPQNIFSPT+KNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIH
Subjt:  HNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIH

Query:  DLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQW
        DLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQW
Subjt:  DLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQW

Query:  MAILKRDYFNTPWTITSFIFAVIFALITLLQTTFTIYHTIDKDRHV
        MAILKRDYFNTPWTITSFIFAVIFALITLLQTTFTIYHT DKDRHV
Subjt:  MAILKRDYFNTPWTITSFIFAVIFALITLLQTTFTIYHTIDKDRHV

A0A1S3B8P8 LOW QUALITY PROTEIN: UPF0481 protein At3g47200-like4.8e-19980.94Show/hide
Query:  MHKPNDS---VEVSYMAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAA
        MH+PN+S    EVSY+ +LIQNKLQ+LP +TEECCIYRVSKRLVNI+PT+YEPQLISIGPFHHGRE LK MEQFKL+FL RYLSRLSR+ LSFEVVVKAA
Subjt:  MHKPNDS---VEVSYMAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAA

Query:  LEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIA--SEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNN-DDSFT
        LEWETKARKCYEDC ISMNSHDFVHMLLVDGCF+VEFL+A   E LQTQTTSRVDPLVS+AMN+NLYHDLI+LENQLPFFV+QGL  FI +PNN DD F 
Subjt:  LEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIA--SEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNN-DDSFT

Query:  VLVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKI
        VLVNIVHNFFQ NF+KH+ +IP NI S   K+I HL+DFLGFYY P T D INQGN+R LFLPPSTTELYEAGVILEKA+TT+D YNIMG SFEGGVLKI
Subjt:  VLVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKI

Query:  PPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCD
        PPFEIHDLFEITMRNLLAFENFQGGS SESSAIHYI FLGALISKEKDSSLLMKKGILSNLIGGSD EVSNMFNNIGKGV FRGHF YDSTSRNLRKHCD
Subjt:  PPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCD

Query:  AKSNQWMAILKRDYFNTPWTITSFIFAVIFALITLLQTTFTIYHTI
        A+SN+WMAILKRDY NTPW I S +   I  LITLL+T   +YH +
Subjt:  AKSNQWMAILKRDYFNTPWTITSFIFAVIFALITLLQTTFTIYHTI

A0A5D3DPP4 UPF0481 protein1.1e-10772.32Show/hide
Query:  MHKPNDS---VEVSYMAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAA
        MH+PN+S    EVSY+ +LIQNKLQ+LP +TEECCIYRVSKRLVNI+PT+YEPQLISIGPFHHGRE LK MEQFKL+FL RYLSRLSR+ LSFEVVVKAA
Subjt:  MHKPNDS---VEVSYMAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAA

Query:  LEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIA--SEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTV
        LEWETKARKCYEDC ISMNSHDFVHMLLVDGCF+VEFL+A   E LQTQTTSRVDPLVS+AMN+NLYHDLI+LENQLPFFV+                  
Subjt:  LEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIA--SEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTV

Query:  LVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMG
                 Q NF+KH+ +IP NI S   K+I HL+DFLGFYY P T D INQGN+R LFLPPSTTELYEAGVILEKA+TT+D YNIMG
Subjt:  LVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMG

A0A6J1DXD6 UPF0481 protein At3g47200-like isoform X24.2e-8643.79Show/hide
Query:  MHKPNDSVEVSYMAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEW
        MH  N    V  + + I+  LQ LP + EEC I+RV +RL+      Y PQ+ISIGPFHHGR+ L  MEQ KL+FL RYL R +      EV V     W
Subjt:  MHKPNDSVEVSYMAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEW

Query:  ETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIV
        ET AR CY +  I+M+S +FV M+LVDGCF+VE ++   ++ ++T +R DPL+  AM  +LY DLI+LENQLPFFVLQGL    D+ + +   + L  + 
Subjt:  ETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIV

Query:  HNFFQAN--FMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGND-----RLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLK
        H F+           ++P  +   T K + HLVDFL FYY+P    +    +      +    PP+ TEL+EAG++ +KA+      +IM ISF+  VL+
Subjt:  HNFFQAN--FMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGND-----RLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLK

Query:  IPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHF-CYDSTSRNLRKH
        IPP EI D+FE  +RNL+AFE +         AI Y LFL  LIS+E+D SLL+K  I++N IGG+++EVS +FN++ K V  RG   C++  +  L +H
Subjt:  IPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHF-CYDSTSRNLRKH

Query:  CDAKSNQWMAILKRDYFNTPWTITSFIFAVIFALITLLQTTFT
        C A+ N+ MA L+RDYFNTPW   SF+ A    L+T LQT F+
Subjt:  CDAKSNQWMAILKRDYFNTPWTITSFIFAVIFALITLLQTTFT

A0A6J1E120 UPF0481 protein At3g47200-like isoform X14.2e-8643.79Show/hide
Query:  MHKPNDSVEVSYMAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEW
        MH  N    V  + + I+  LQ LP + EEC I+RV +RL+      Y PQ+ISIGPFHHGR+ L  MEQ KL+FL RYL R +      EV V     W
Subjt:  MHKPNDSVEVSYMAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEW

Query:  ETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIV
        ET AR CY +  I+M+S +FV M+LVDGCF+VE ++   ++ ++T +R DPL+  AM  +LY DLI+LENQLPFFVLQGL    D+ + +   + L  + 
Subjt:  ETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIV

Query:  HNFFQAN--FMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGND-----RLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLK
        H F+           ++P  +   T K + HLVDFL FYY+P    +    +      +    PP+ TEL+EAG++ +KA+      +IM ISF+  VL+
Subjt:  HNFFQAN--FMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGND-----RLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLK

Query:  IPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHF-CYDSTSRNLRKH
        IPP EI D+FE  +RNL+AFE +         AI Y LFL  LIS+E+D SLL+K  I++N IGG+++EVS +FN++ K V  RG   C++  +  L +H
Subjt:  IPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHF-CYDSTSRNLRKH

Query:  CDAKSNQWMAILKRDYFNTPWTITSFIFAVIFALITLLQTTFT
        C A+ N+ MA L+RDYFNTPW   SF+ A    L+T LQT F+
Subjt:  CDAKSNQWMAILKRDYFNTPWTITSFIFAVIFALITLLQTTFT

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026457.5e-1621.56Show/hide
Query:  IYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVV
        I+ V K L+  +P  Y P  +SIGP+H  +  L  ME++KL    +   R       F  +V+     E K R CY    I  N    + ++ VD  F++
Subjt:  IYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVV

Query:  EFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYF-------------------------------------------------
        EF      L+  +  +V+ L+++  +  +  D++++ENQ+P FVL+  L F                                                 
Subjt:  EFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYF-------------------------------------------------

Query:  ----------------------IDEPNNDDSFTVLVNIVHNF---FQANFMKHYCKIPQNIFS--PTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFL
                               DE   + +   +  I H F   F +       + P  I S  P    ++   D+L F           Q +  +L +
Subjt:  ----------------------IDEPNNDDSFTVLVNIVHNF---FQANFMKHYCKIPQNIFS--PTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFL

Query:  P----------PSTTELYEAGVILEKAITTNDHYNIMGISFE--GGVLKIPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSS
                   PS ++L++AGV  +       H NI  ++F+   G   +P   +    E  +RNL+A+E     ++       Y   +  +I  E+D  
Subjt:  P----------PSTTELYEAGVILEKAITTNDHYNIMGISFE--GGVLKIPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSS

Query:  LLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQWMAILKRDYFNTPWTITSFIFAVIFALITLLQ
        LL ++G+L + +  SD+E + M+N + K VR       D T  ++ ++   +    +  L   Y    W I +F+ AV+  ++  LQ
Subjt:  LLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQWMAILKRDYFNTPWTITSFIFAVIFALITLLQ

Q9SD53 UPF0481 protein At3g472008.5e-3628.01Show/hide
Query:  EECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMNSHDFVHMLLVDG
        E CCI+RV +  V + P  Y+P+++SIGP+H+G +HL++++Q K + L  +L    ++ +   V+VKA ++ E K RK Y +       HD + M+++DG
Subjt:  EECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMNSHDFVHMLLVDG

Query:  CFV-VEFLIASEQLQTQTTSRVDPLVS-KAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKH--YCKIPQNIFSPTQ
        CF+ + FLI S  ++       DP+ S   +  ++  DL+LLENQ+PFFVLQ L        + D    L  I  +FF+    K   Y +  +N      
Subjt:  CFV-VEFLIASEQLQTQTTSRVDPLVS-KAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKH--YCKIPQNIFSPTQ

Query:  KNIRHLVDFLGFYYSPTTTDI-----------INQG--------NDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEI
           +HL+D +   + P T++            +++G        + + + L  S   L   G+      +  D  +I+ +  +   L+IP          
Subjt:  KNIRHLVDFLGFYYSPTTTDI-----------INQG--------NDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEI

Query:  TMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCY-DSTSRNLRKHCDAKSNQWMAIL
           N +AFE F   S++E +   YI+F+G L++ E+D + L    ++     GS+ EVS  F  I K V F     Y ++  + + ++     N   A  
Subjt:  TMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCY-DSTSRNLRKHCDAKSNQWMAIL

Query:  KRDYFNTPWTITSFIFAVIFALITLLQTTFTI
        +  +F +PWT  S    +   L+T+LQ+T  I
Subjt:  KRDYFNTPWTITSFIFAVIFALITLLQTTFTI

Arabidopsis top hitse value%identityAlignment
AT3G50150.1 Plant protein of unknown function (DUF247)2.1e-4231.62Show/hide
Query:  EECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMNSHDFVHMLLVDG
        ++ CIYRV   L       Y PQ +SIGP+HHG+ HL+ ME+ K + +   ++R      + E+ + A  E E +AR CY+      NS++F  ML++DG
Subjt:  EECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMNSHDFVHMLLVDG

Query:  CFVVEFLIASEQ-LQTQTTSRVDPLVSK-AMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCKIPQNIFSPTQKN
        CFV+E    + Q  Q    +R DP+ +K  +  ++  D+I+LENQLP FVL  LL    +    +   ++  +   FF+                 +Q+ 
Subjt:  CFVVEFLIASEQ-LQTQTTSRVDPLVSK-AMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCKIPQNIFSPTQKN

Query:  IRHLVDFLGFYYSPT-------TTDIINQGN--------DRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITMRNLL
           L D  G +           +++  NQG         ++   L    TEL  AGV   +  T      +  I F+ G LKIP   IHD  +    NL+
Subjt:  IRHLVDFLGFYYSPT-------TTDIINQGN--------DRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITMRNLL

Query:  AFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCY-DSTSRNLRKHCDAKSNQWMAILKRDYFN
        AFE  Q  + S ++   YI+F+  LI+  +D S L   GI+ + + GSD EV+++FN + K V F     Y    SR + ++   K N   A L++ YFN
Subjt:  AFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCY-DSTSRNLRKHCDAKSNQWMAILKRDYFN

Query:  TPWTITSFIFAVIFALITLLQTTFTIY
         PW   SF  AVI   +T  Q+ F +Y
Subjt:  TPWTITSFIFAVIFALITLLQTTFTIY

AT3G50160.1 Plant protein of unknown function (DUF247)8.7e-4433.97Show/hide
Query:  EECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMNSHDFVHMLLVDG
        +  CIYRV   L       Y PQ++SIGP+HHG +HL  ME+ K + +   ++R        E+ + A  E E KAR CY+   I+MN ++F+ ML++DG
Subjt:  EECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMNSHDFVHMLLVDG

Query:  CFVVE-FLIASEQLQTQTTSRVDPLVS-KAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCKIPQNIFSPTQKN
         F++E F   SE  Q    +  DP+   + +  ++  D+++LENQLP+ VL+GLL  +  P+  D   V V +   FFQ         +P      T++ 
Subjt:  CFVVE-FLIASEQLQTQTTSRVDPLVS-KAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCKIPQNIFSPTQKN

Query:  IRHLVDFL--GFYYSPTTTD----IINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITMRNLLAFENFQGGS
          H +D L  G   S  T+D    ++N+   +L+      TEL  AGV   +  T     +   I F+ G LKIP   IHD  +    NL+AFE  Q   
Subjt:  IRHLVDFL--GFYYSPTTTD----IINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITMRNLLAFENFQGGS

Query:  ASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCY-DSTSRNLRKHCDAKSNQWMAILKRDYFNTPWTITSFI
         S      YI+F+  LI+  +D S L   GI+ N + GSD EVS++FN +GK V F  +  Y  + +  +  +   K N   A L+  YFN PW   SFI
Subjt:  ASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCY-DSTSRNLRKHCDAKSNQWMAILKRDYFNTPWTITSFI

Query:  FAVIFALITLLQTTFTIY
         AV   + T  Q+ F ++
Subjt:  FAVIFALITLLQTTFTIY

AT3G50170.1 Plant protein of unknown function (DUF247)6.9e-4131.44Show/hide
Query:  DSVEVSYMAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKAR
        DS  +S    L Q    +   +  + CIYRV   L       Y PQ +S+GP+HHG++ L+ ME+ K + L + L RL +R    E+   A  E E KAR
Subjt:  DSVEVSYMAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKAR

Query:  KCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQT-TSRVDPLVS-KAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNF
         CYE   IS++ ++F  ML++DGCFV+E    + +  T+   +R DP+ + + +  ++  D+I+LENQLP FVL  LL    +    +   ++ ++   F
Subjt:  KCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQT-TSRVDPLVS-KAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNF

Query:  F-----------QANFMKHYCKIPQNIFSPTQKNIRHLVD-----FLGFYYSPTTTDIINQ--GNDRLL-----FLPPSTTELYEAGVILEKAITTNDHY
        F           + +  K    + +++ +   K   H +D      L    +P T  ++ +   N R++      L    TEL EAGV   K  T     
Subjt:  F-----------QANFMKHYCKIPQNIFSPTQKNIRHLVD-----FLGFYYSPTTTDIINQ--GNDRLL-----FLPPSTTELYEAGVILEKAITTNDHY

Query:  NIMGISFEGGVLKIPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHF
            I F+ G L+IP   IHD  +    NL+AFE  Q    S +    YI+F+  LI+  +D S L   GI+ + + GSD EV+++FN + + V F    
Subjt:  NIMGISFEGGVLKIPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHF

Query:  CYDS-TSRNLRKHCDAKSNQWMAILKRDYFNTPWTITSFIFAVIFALITLLQTTFTIY
         + S  S ++ ++ + K N   A L   YFN PW   SF  AVI  L+TL Q+ + +Y
Subjt:  CYDS-TSRNLRKHCDAKSNQWMAILKRDYFNTPWTITSFIFAVIFALITLLQTTFTIY

AT4G31980.1 unknown protein2.4e-5733.57Show/hide
Query:  IQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMN
        I+ KL  L  ++ +CCIY+V  +L  + P  Y P+L+S GP H G+E L+ ME  K ++LL ++ R +    S E +V+ A  WE  AR CY +  + ++
Subjt:  IQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMN

Query:  SHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNV-NLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCKI
        S +FV ML+VDG F+VE L+ S   + +  +  D +   +M + ++  D+IL+ENQLPFFV++ +   +       + +++        Q +F     +I
Subjt:  SHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNV-NLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCKI

Query:  PQNIFSPTQKNIRHLVDFLGFYYSP--------TTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITM
            F    +   H VD L   Y P        TT  + N          P  TEL+ AGV  + A T++    ++ ISF  GVLKIP   + DL E   
Subjt:  PQNIFSPTQKNIRHLVDFLGFYYSP--------TTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITM

Query:  RNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQWMAILKRD
        +N++ FE  +    S  + + YI+ LG  I    D+ LL+  GI+ N +G S  +VSN+FN+I K V +   F +   S NL+ +C+   N+W AIL+RD
Subjt:  RNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQWMAILKRD

Query:  YFNTPWTITSFIFAVIFALITLLQTTFTI
        YF+ PW + S   A++  L+T +Q+  +I
Subjt:  YFNTPWTITSFIFAVIFALITLLQTTFTI

AT5G11290.1 Plant protein of unknown function (DUF247)2.1e-4534.05Show/hide
Query:  MEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSK-AMNVNLYHDLI
        ME  KL++L  ++ R +   LS E +V+ A  WE +AR CY +  + ++S ++V ML+VD  F+VE L+ S+         +D +  K  M V++ HD++
Subjt:  MEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSK-AMNVNLYHDLI

Query:  LLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEA
        LLENQLP+FV++G+   +    + +    L  I+HN    +F K +  IP    S +   I H VD L   + P        G+ R++    S  E+  A
Subjt:  LLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEA

Query:  GVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNM
        GV L+ A   +++   + ISF  GVL IP  +I+D+ E   RN++ FE        ++  IHY+ FL   I    D+ L +  GI+ N  G + E+VS +
Subjt:  GVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNM

Query:  FNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQWMAILKRDYFNTPWTITSFIFAVIFALITLLQTTFTI
        FN+I K   + G F Y +   NL+ HC+A  N+W A L+RDYF+ PW+  S + A +  L+T +Q   +I
Subjt:  FNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQWMAILKRDYFNTPWTITSFIFAVIFALITLLQTTFTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATAAACCCAATGATAGCGTTGAAGTTAGCTACATGGCTGCATTAATCCAGAACAAGCTCCAAAACCTACCTTGTGTTACTGAAGAATGTTGCATCTATCGTGTTTC
CAAACGACTTGTTAACATTTACCCTACTGTCTATGAACCTCAACTCATTTCCATCGGTCCTTTTCATCATGGTCGAGAACATTTGAAGCTAATGGAACAATTTAAACTAC
AGTTTCTCCTTCGGTATCTTTCCCGTCTTTCTAGACGGCCTTTGTCGTTTGAGGTCGTTGTTAAAGCCGCTTTGGAATGGGAGACCAAAGCTCGCAAGTGTTATGAAGAT
TGTGCCATCAGCATGAATAGCCACGACTTTGTCCATATGCTGCTTGTGGATGGTTGTTTCGTGGTCGAGTTTTTGATAGCTAGTGAACAACTTCAAACTCAAACTACATC
AAGAGTTGATCCTTTAGTGTCCAAAGCTATGAACGTAAATCTATATCATGACTTGATCTTGCTCGAGAACCAGCTTCCTTTTTTCGTTCTTCAAGGTCTTCTTTACTTTA
TTGATGAACCGAACAATGACGATAGCTTTACGGTCTTAGTAAATATTGTACATAATTTTTTTCAAGCTAACTTCATGAAACATTATTGTAAGATTCCTCAAAATATCTTC
TCCCCCACTCAAAAAAATATAAGGCATTTGGTAGATTTCTTAGGCTTTTACTATTCCCCGACCACTACAGACATTATTAATCAAGGCAACGACAGATTGTTATTTCTTCC
TCCATCTACGACTGAGCTGTATGAAGCTGGAGTGATCTTGGAGAAAGCAATAACTACAAATGATCATTACAACATTATGGGCATAAGTTTTGAAGGCGGGGTTCTTAAAA
TCCCACCTTTTGAAATTCATGATCTCTTTGAAATCACCATGAGGAATTTGTTGGCATTTGAGAATTTTCAAGGTGGAAGTGCGAGTGAAAGCTCAGCAATTCATTATATT
TTGTTTTTAGGGGCTTTAATAAGTAAAGAGAAAGATTCAAGTTTACTTATGAAGAAGGGAATCCTAAGTAATCTAATTGGAGGTAGTGATGAAGAAGTTTCCAATATGTT
CAATAACATTGGTAAAGGTGTGAGATTTCGAGGACATTTTTGCTATGATAGTACAAGCAGAAATTTACGTAAACATTGCGACGCAAAAAGTAACCAATGGATGGCTATAT
TGAAGCGTGATTATTTCAATACGCCATGGACTATTACTTCTTTCATTTTTGCCGTCATATTCGCCCTCATCACTCTACTCCAAACGACATTCACCATATACCACACTATA
GATAAAGATCGTCATGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCATAAACCCAATGATAGCGTTGAAGTTAGCTACATGGCTGCATTAATCCAGAACAAGCTCCAAAACCTACCTTGTGTTACTGAAGAATGTTGCATCTATCGTGTTTC
CAAACGACTTGTTAACATTTACCCTACTGTCTATGAACCTCAACTCATTTCCATCGGTCCTTTTCATCATGGTCGAGAACATTTGAAGCTAATGGAACAATTTAAACTAC
AGTTTCTCCTTCGGTATCTTTCCCGTCTTTCTAGACGGCCTTTGTCGTTTGAGGTCGTTGTTAAAGCCGCTTTGGAATGGGAGACCAAAGCTCGCAAGTGTTATGAAGAT
TGTGCCATCAGCATGAATAGCCACGACTTTGTCCATATGCTGCTTGTGGATGGTTGTTTCGTGGTCGAGTTTTTGATAGCTAGTGAACAACTTCAAACTCAAACTACATC
AAGAGTTGATCCTTTAGTGTCCAAAGCTATGAACGTAAATCTATATCATGACTTGATCTTGCTCGAGAACCAGCTTCCTTTTTTCGTTCTTCAAGGTCTTCTTTACTTTA
TTGATGAACCGAACAATGACGATAGCTTTACGGTCTTAGTAAATATTGTACATAATTTTTTTCAAGCTAACTTCATGAAACATTATTGTAAGATTCCTCAAAATATCTTC
TCCCCCACTCAAAAAAATATAAGGCATTTGGTAGATTTCTTAGGCTTTTACTATTCCCCGACCACTACAGACATTATTAATCAAGGCAACGACAGATTGTTATTTCTTCC
TCCATCTACGACTGAGCTGTATGAAGCTGGAGTGATCTTGGAGAAAGCAATAACTACAAATGATCATTACAACATTATGGGCATAAGTTTTGAAGGCGGGGTTCTTAAAA
TCCCACCTTTTGAAATTCATGATCTCTTTGAAATCACCATGAGGAATTTGTTGGCATTTGAGAATTTTCAAGGTGGAAGTGCGAGTGAAAGCTCAGCAATTCATTATATT
TTGTTTTTAGGGGCTTTAATAAGTAAAGAGAAAGATTCAAGTTTACTTATGAAGAAGGGAATCCTAAGTAATCTAATTGGAGGTAGTGATGAAGAAGTTTCCAATATGTT
CAATAACATTGGTAAAGGTGTGAGATTTCGAGGACATTTTTGCTATGATAGTACAAGCAGAAATTTACGTAAACATTGCGACGCAAAAAGTAACCAATGGATGGCTATAT
TGAAGCGTGATTATTTCAATACGCCATGGACTATTACTTCTTTCATTTTTGCCGTCATATTCGCCCTCATCACTCTACTCCAAACGACATTCACCATATACCACACTATA
GATAAAGATCGTCATGTGTAG
Protein sequenceShow/hide protein sequence
MHKPNDSVEVSYMAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYED
CAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCKIPQNIF
SPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYI
LFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQWMAILKRDYFNTPWTITSFIFAVIFALITLLQTTFTIYHTI
DKDRHV