; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy3G036370 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy3G036370
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationGy14Chr3:34540289..34541629
RNA-Seq ExpressionCsGy3G036370
SyntenyCsGy3G036370
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053776.1 UPF0481 protein [Cucumis melo var. makuwa]2.64e-13572.66Show/hide
Query:  MHKPNDS---VEVSYIAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAA
        MH+PN+S    EVSYI +LIQNKLQ+LP +TEECCIYRVSKRLVNI+PT+YEPQLISIGPFHHGRE LK MEQFKL+FL RYLSRLSR+ LSFEVVVKAA
Subjt:  MHKPNDS---VEVSYIAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAA

Query:  LEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIA--SEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTV
        LEWETKARKCYEDC ISMNSHDFVHMLLVDGCF+VEFL+A   E LQTQTTSRVDPLVS+AMN+NLYHDLI+LENQLPFFV+Q                 
Subjt:  LEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIA--SEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTV

Query:  LVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMG
                   NF+KH+ +IP NI S   K+I HL+DFLGFYY P T DI NQGN+R LFLPPSTTELYEAGVILEKA+TT+D YNIMG
Subjt:  LVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMG

KAE8651212.1 hypothetical protein Csa_001883 [Cucumis sativus]5.15e-29290.36Show/hide
Query:  MHKPNDSVEVSYIAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEW
        MHKPNDSVEVSYIAALIQNKLQNLPCVTEECCIYRVSKRLVNIYP+VYEPQLISIGPFHHGREHLKLMEQFKLQFLLR                      
Subjt:  MHKPNDSVEVSYIAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEW

Query:  ETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIV
                      MNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMN+NLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIV
Subjt:  ETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIV

Query:  HNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIH
        HNFFQANFMKHYCKIPQNIFSPT+KNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIH
Subjt:  HNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIH

Query:  DLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQW
        DLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQW
Subjt:  DLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQW

Query:  MAILKRDYFNTPWTITSFIFAVIFSLITLLQTAFTIYHAIDKDRHV
        MAILKRDYFNTPWTITSFIFAVIF+LITLLQT FTIYH  DKDRHV
Subjt:  MAILKRDYFNTPWTITSFIFAVIFSLITLLQTAFTIYHAIDKDRHV

XP_004147504.1 UPF0481 protein At3g47200 [Cucumis sativus]7.20e-23898.19Show/hide
Query:  MNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCK
        MNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMN+NLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCK
Subjt:  MNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCK

Query:  IPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITMRNLLAFE
        IPQNIFSPT+KNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITMRNLLAFE
Subjt:  IPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITMRNLLAFE

Query:  NFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQWMAILKRDYFNTPWT
        NFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQWMAILKRDYFNTPWT
Subjt:  NFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQWMAILKRDYFNTPWT

Query:  ITSFIFAVIFSLITLLQTAFTIYHAIDKDRHV
        ITSFIFAVIF+LITLLQT FTIYH  DKDRHV
Subjt:  ITSFIFAVIFSLITLLQTAFTIYHAIDKDRHV

XP_008443397.1 PREDICTED: LOW QUALITY PROTEIN: UPF0481 protein At3g47200-like [Cucumis melo]1.98e-25381.53Show/hide
Query:  MHKPNDS---VEVSYIAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAA
        MH+PN+S    EVSYI +LIQNKLQ+LP +TEECCIYRVSKRLVNI+PT+YEPQLISIGPFHHGRE LK MEQFKL+FL RYLSRLSR+ LSFEVVVKAA
Subjt:  MHKPNDS---VEVSYIAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAA

Query:  LEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIA--SEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNN-DDSFT
        LEWETKARKCYEDC ISMNSHDFVHMLLVDGCF+VEFL+A   E LQTQTTSRVDPLVS+AMN+NLYHDLI+LENQLPFFV+QGL  FI +PNN DD F 
Subjt:  LEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIA--SEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNN-DDSFT

Query:  VLVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKI
        VLVNIVHNFFQ NF+KH+ +IP NI S   K+I HL+DFLGFYY P T DI NQGN+R LFLPPSTTELYEAGVILEKA+TT+D YNIMG SFEGGVLKI
Subjt:  VLVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKI

Query:  PPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCD
        PPFEIHDLFEITMRNLLAFENFQGGS SESSAIHYI FLGALISKEKDSSLLMKKGILSNLIGGSD EVSNMFNNIGKGV FRGHF YDSTSRNLRKHCD
Subjt:  PPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCD

Query:  AKSNQWMAILKRDYFNTPWTITSFIFAVIFSLITLLQTAFTIYH
        A+SN+WMAILKRDY NTPW I S +   I +LITLL+T   +YH
Subjt:  AKSNQWMAILKRDYFNTPWTITSFIFAVIFSLITLLQTAFTIYH

XP_038904513.1 UPF0481 protein At3g47200-like [Benincasa hispida]1.85e-18864.19Show/hide
Query:  MHKPNDSVEV---SY-IAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKA
        MH+PN++ E+   SY +A LIQ +L+ LP VTEECCI+RVSKRL+NI+ T YEPQLISIGPFHHGR+ LK MEQFKLQFL R+++R++R+ LS++ VV+ 
Subjt:  MHKPNDSVEV---SY-IAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKA

Query:  ALE-WETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIA----SEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDD
        AL  WET+AR CYED A +MNSHDFV M+LVDGCF+VEFL++      Q Q+ +TSRVDPLV KAMN+NLYHDLI+LENQLPFFVLQ L   I     D+
Subjt:  ALE-WETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIA----SEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDD

Query:  SFTVLVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGV
        SFT LV+I+H FF  NFMKH C+ PQN   P ++NIRHLV FL FYYSPT  DII   N++ L LPPS TEL+EAGVILEK  T N    I+ ++F+ GV
Subjt:  SFTVLVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGV

Query:  LKIPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRK
        LKIPPFEIH LFEI MRNL+AFENFQG + ++S AIHY+LFLGALIS+EKDSSLLMKKGI++NLIGGSDEEVSNMFNNIGKGV F+GHF Y+  S++L K
Subjt:  LKIPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRK

Query:  HCDAKSNQWMAILKRDYFNTPWTITSFIFAVIFSLITLLQTAFT
        HC  + N+WMA L+RDY NTPW   S + A+  +    LQT F+
Subjt:  HCDAKSNQWMAILKRDYFNTPWTITSFIFAVIFSLITLLQTAFT

TrEMBL top hitse value%identityAlignment
A0A0A0LC32 Uncharacterized protein0.096.19Show/hide
Query:  MHKPNDSVEVSYIAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEW
        MHKPNDSVEVSYIAALIQNKLQNLPCVTEECCIYRVSKRLVNIYP+VYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFE         
Subjt:  MHKPNDSVEVSYIAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEW

Query:  ETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIV
         TKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMN+NLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIV
Subjt:  ETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIV

Query:  HNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIH
        HNFFQANFMKHYCKIPQNIFSPT+KNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIH
Subjt:  HNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIH

Query:  DLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQW
        DLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQW
Subjt:  DLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQW

Query:  MAILKRDYFNTPWTITSFIFAVIFSLITLLQTAFTIYHAIDKDRHV
        MAILKRDYFNTPWTITSFIFAVIF+LITLLQT FTIYH  DKDRHV
Subjt:  MAILKRDYFNTPWTITSFIFAVIFSLITLLQTAFTIYHAIDKDRHV

A0A1S3B8P8 LOW QUALITY PROTEIN: UPF0481 protein At3g47200-like9.58e-25481.53Show/hide
Query:  MHKPNDS---VEVSYIAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAA
        MH+PN+S    EVSYI +LIQNKLQ+LP +TEECCIYRVSKRLVNI+PT+YEPQLISIGPFHHGRE LK MEQFKL+FL RYLSRLSR+ LSFEVVVKAA
Subjt:  MHKPNDS---VEVSYIAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAA

Query:  LEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIA--SEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNN-DDSFT
        LEWETKARKCYEDC ISMNSHDFVHMLLVDGCF+VEFL+A   E LQTQTTSRVDPLVS+AMN+NLYHDLI+LENQLPFFV+QGL  FI +PNN DD F 
Subjt:  LEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIA--SEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNN-DDSFT

Query:  VLVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKI
        VLVNIVHNFFQ NF+KH+ +IP NI S   K+I HL+DFLGFYY P T DI NQGN+R LFLPPSTTELYEAGVILEKA+TT+D YNIMG SFEGGVLKI
Subjt:  VLVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKI

Query:  PPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCD
        PPFEIHDLFEITMRNLLAFENFQGGS SESSAIHYI FLGALISKEKDSSLLMKKGILSNLIGGSD EVSNMFNNIGKGV FRGHF YDSTSRNLRKHCD
Subjt:  PPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCD

Query:  AKSNQWMAILKRDYFNTPWTITSFIFAVIFSLITLLQTAFTIYH
        A+SN+WMAILKRDY NTPW I S +   I +LITLL+T   +YH
Subjt:  AKSNQWMAILKRDYFNTPWTITSFIFAVIFSLITLLQTAFTIYH

A0A5D3DPP4 UPF0481 protein1.28e-13572.66Show/hide
Query:  MHKPNDS---VEVSYIAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAA
        MH+PN+S    EVSYI +LIQNKLQ+LP +TEECCIYRVSKRLVNI+PT+YEPQLISIGPFHHGRE LK MEQFKL+FL RYLSRLSR+ LSFEVVVKAA
Subjt:  MHKPNDS---VEVSYIAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAA

Query:  LEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIA--SEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTV
        LEWETKARKCYEDC ISMNSHDFVHMLLVDGCF+VEFL+A   E LQTQTTSRVDPLVS+AMN+NLYHDLI+LENQLPFFV+Q                 
Subjt:  LEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIA--SEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTV

Query:  LVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMG
                   NF+KH+ +IP NI S   K+I HL+DFLGFYY P T DI NQGN+R LFLPPSTTELYEAGVILEKA+TT+D YNIMG
Subjt:  LVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMG

A0A6J1DXD6 UPF0481 protein At3g47200-like isoform X21.79e-10543.79Show/hide
Query:  MHKPNDSVEVSYIAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEW
        MH  N    V  + + I+  LQ LP + EEC I+RV +RL+      Y PQ+ISIGPFHHGR+ L  MEQ KL+FL RYL R +      EV V     W
Subjt:  MHKPNDSVEVSYIAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEW

Query:  ETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIV
        ET AR CY +  I+M+S +FV M+LVDGCF+VE ++   ++ ++T +R DPL+  AM  +LY DLI+LENQLPFFVLQGL    D+ + +   + L  + 
Subjt:  ETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIV

Query:  HNFFQAN--FMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLF-----LPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLK
        H F+           ++P  +   T K + HLVDFL FYY+P    +    +   +       PP+ TEL+EAG++ +KA+      +IM ISF+  VL+
Subjt:  HNFFQAN--FMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLF-----LPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLK

Query:  IPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHF-CYDSTSRNLRKH
        IPP EI D+FE  +RNL+AFE +         AI Y LFL  LIS+E+D SLL+K  I++N IGG+++EVS +FN++ K V  RG   C++  +  L +H
Subjt:  IPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHF-CYDSTSRNLRKH

Query:  CDAKSNQWMAILKRDYFNTPWTITSFIFAVIFSLITLLQTAFT
        C A+ N+ MA L+RDYFNTPW   SF+ A    L+T LQT F+
Subjt:  CDAKSNQWMAILKRDYFNTPWTITSFIFAVIFSLITLLQTAFT

A0A6J1DYL4 UPF0481 protein At3g47200-like isoform X39.03e-10643.79Show/hide
Query:  MHKPNDSVEVSYIAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEW
        MH  N    V  + + I+  LQ LP + EEC I+RV +RL+      Y PQ+ISIGPFHHGR+ L  MEQ KL+FL RYL R +      EV V     W
Subjt:  MHKPNDSVEVSYIAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEW

Query:  ETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIV
        ET AR CY +  I+M+S +FV M+LVDGCF+VE ++   ++ ++T +R DPL+  AM  +LY DLI+LENQLPFFVLQGL    D+ + +   + L  + 
Subjt:  ETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIV

Query:  HNFFQAN--FMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLF-----LPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLK
        H F+           ++P  +   T K + HLVDFL FYY+P    +    +   +       PP+ TEL+EAG++ +KA+      +IM ISF+  VL+
Subjt:  HNFFQAN--FMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLF-----LPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLK

Query:  IPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHF-CYDSTSRNLRKH
        IPP EI D+FE  +RNL+AFE +         AI Y LFL  LIS+E+D SLL+K  I++N IGG+++EVS +FN++ K V  RG   C++  +  L +H
Subjt:  IPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHF-CYDSTSRNLRKH

Query:  CDAKSNQWMAILKRDYFNTPWTITSFIFAVIFSLITLLQTAFT
        C A+ N+ MA L+RDYFNTPW   SF+ A    L+T LQT F+
Subjt:  CDAKSNQWMAILKRDYFNTPWTITSFIFAVIFSLITLLQTAFT

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026459.8e-1621.56Show/hide
Query:  IYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVV
        I+ V K L+  +P  Y P  +SIGP+H  +  L  ME++KL    +   R       F  +V+     E K R CY    I  N    + ++ VD  F++
Subjt:  IYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVV

Query:  EFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYF-------------------------------------------------
        EF      L+  +  +V+ L+++  +  +  D++++ENQ+P FVL+  L F                                                 
Subjt:  EFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYF-------------------------------------------------

Query:  ----------------------IDEPNNDDSFTVLVNIVHNF---FQANFMKHYCKIPQNIFS--PTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFL
                               DE   + +   +  I H F   F +       + P  I S  P    ++   D+L F           Q +  +L +
Subjt:  ----------------------IDEPNNDDSFTVLVNIVHNF---FQANFMKHYCKIPQNIFS--PTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFL

Query:  P----------PSTTELYEAGVILEKAITTNDHYNIMGISFE--GGVLKIPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSS
                   PS ++L++AGV  +       H NI  ++F+   G   +P   +    E  +RNL+A+E     ++       Y   +  +I  E+D  
Subjt:  P----------PSTTELYEAGVILEKAITTNDHYNIMGISFE--GGVLKIPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSS

Query:  LLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQWMAILKRDYFNTPWTITSFIFAVIFSLITLLQ
        LL ++G+L + +  SD+E + M+N + K VR       D T  ++ ++   +    +  L   Y    W I +F+ AV+  ++  LQ
Subjt:  LLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQWMAILKRDYFNTPWTITSFIFAVIFSLITLLQ

Q9SD53 UPF0481 protein At3g472002.5e-3527.78Show/hide
Query:  EECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMNSHDFVHMLLVDG
        E CCI+RV +  V + P  Y+P+++SIGP+H+G +HL++++Q K + L  +L    ++ +   V+VKA ++ E K RK Y +       HD + M+++DG
Subjt:  EECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMNSHDFVHMLLVDG

Query:  CFV-VEFLIASEQLQTQTTSRVDPLVS-KAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKH--YCKIPQNIFSPTQ
        CF+ + FLI S  ++       DP+ S   +  ++  DL+LLENQ+PFFVLQ L        + D    L  I  +FF+    K   Y +  +N      
Subjt:  CFV-VEFLIASEQLQTQTTSRVDPLVS-KAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKH--YCKIPQNIFSPTQ

Query:  KNIRHLVDFLGFYYSPTTTDI-----------INQG--------NDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEI
           +HL+D +   + P T++            +++G        + + + L  S   L   G+      +  D  +I+ +  +   L+IP          
Subjt:  KNIRHLVDFLGFYYSPTTTDI-----------INQG--------NDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEI

Query:  TMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCY-DSTSRNLRKHCDAKSNQWMAIL
           N +AFE F   S++E +   YI+F+G L++ E+D + L    ++     GS+ EVS  F  I K V F     Y ++  + + ++     N   A  
Subjt:  TMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCY-DSTSRNLRKHCDAKSNQWMAIL

Query:  KRDYFNTPWTITSFIFAVIFSLITLLQTAFTI
        +  +F +PWT  S    +   L+T+LQ+   I
Subjt:  KRDYFNTPWTITSFIFAVIFSLITLLQTAFTI

Arabidopsis top hitse value%identityAlignment
AT3G50150.1 Plant protein of unknown function (DUF247)1.6e-4231.62Show/hide
Query:  EECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMNSHDFVHMLLVDG
        ++ CIYRV   L       Y PQ +SIGP+HHG+ HL+ ME+ K + +   ++R      + E+ + A  E E +AR CY+      NS++F  ML++DG
Subjt:  EECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMNSHDFVHMLLVDG

Query:  CFVVEFLIASEQ-LQTQTTSRVDPLVSK-AMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCKIPQNIFSPTQKN
        CFV+E    + Q  Q    +R DP+ +K  +  ++  D+I+LENQLP FVL  LL    +    +   ++  +   FF+                 +Q+ 
Subjt:  CFVVEFLIASEQ-LQTQTTSRVDPLVSK-AMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCKIPQNIFSPTQKN

Query:  IRHLVDFLGFYYSPT-------TTDIINQGN--------DRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITMRNLL
           L D  G +           +++  NQG         ++   L    TEL  AGV   +  T      +  I F+ G LKIP   IHD  +    NL+
Subjt:  IRHLVDFLGFYYSPT-------TTDIINQGN--------DRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITMRNLL

Query:  AFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCY-DSTSRNLRKHCDAKSNQWMAILKRDYFN
        AFE  Q  + S ++   YI+F+  LI+  +D S L   GI+ + + GSD EV+++FN + K V F     Y    SR + ++   K N   A L++ YFN
Subjt:  AFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCY-DSTSRNLRKHCDAKSNQWMAILKRDYFN

Query:  TPWTITSFIFAVIFSLITLLQTAFTIY
         PW   SF  AVI   +T  Q+ F +Y
Subjt:  TPWTITSFIFAVIFSLITLLQTAFTIY

AT3G50160.1 Plant protein of unknown function (DUF247)8.7e-4433.97Show/hide
Query:  EECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMNSHDFVHMLLVDG
        +  CIYRV   L       Y PQ++SIGP+HHG +HL  ME+ K + +   ++R        E+ + A  E E KAR CY+   I+MN ++F+ ML++DG
Subjt:  EECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMNSHDFVHMLLVDG

Query:  CFVVE-FLIASEQLQTQTTSRVDPLVS-KAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCKIPQNIFSPTQKN
         F++E F   SE  Q    +  DP+   + +  ++  D+++LENQLP+ VL+GLL  +  P+  D   V V +   FFQ         +P      T++ 
Subjt:  CFVVE-FLIASEQLQTQTTSRVDPLVS-KAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCKIPQNIFSPTQKN

Query:  IRHLVDFL--GFYYSPTTTD----IINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITMRNLLAFENFQGGS
          H +D L  G   S  T+D    ++N+   +L+      TEL  AGV   +  T     +   I F+ G LKIP   IHD  +    NL+AFE  Q   
Subjt:  IRHLVDFL--GFYYSPTTTD----IINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITMRNLLAFENFQGGS

Query:  ASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCY-DSTSRNLRKHCDAKSNQWMAILKRDYFNTPWTITSFI
         S      YI+F+  LI+  +D S L   GI+ N + GSD EVS++FN +GK V F  +  Y  + +  +  +   K N   A L+  YFN PW   SFI
Subjt:  ASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCY-DSTSRNLRKHCDAKSNQWMAILKRDYFNTPWTITSFI

Query:  FAVIFSLITLLQTAFTIY
         AV   + T  Q+ F ++
Subjt:  FAVIFSLITLLQTAFTIY

AT3G50170.1 Plant protein of unknown function (DUF247)9.0e-4130.9Show/hide
Query:  KPNDSVEVSYIAALIQNKLQNL-----PCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAA
        +P ++   S++ + I++KL+         +  + CIYRV   L       Y PQ +S+GP+HHG++ L+ ME+ K + L + L RL +R    E+   A 
Subjt:  KPNDSVEVSYIAALIQNKLQNL-----PCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAA

Query:  LEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQT-TSRVDPLVS-KAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTV
         E E KAR CYE   IS++ ++F  ML++DGCFV+E    + +  T+   +R DP+ + + +  ++  D+I+LENQLP FVL  LL    +    +   +
Subjt:  LEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQT-TSRVDPLVS-KAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTV

Query:  LVNIVHNFF-----------QANFMKHYCKIPQNIFSPTQKNIRHLVD-----FLGFYYSPTTTDIINQ--GNDRLL-----FLPPSTTELYEAGVILEK
        + ++   FF           + +  K    + +++ +   K   H +D      L    +P T  ++ +   N R++      L    TEL EAGV   K
Subjt:  LVNIVHNFF-----------QANFMKHYCKIPQNIFSPTQKNIRHLVD-----FLGFYYSPTTTDIINQ--GNDRLL-----FLPPSTTELYEAGVILEK

Query:  AITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGK
          T         I F+ G L+IP   IHD  +    NL+AFE  Q    S +    YI+F+  LI+  +D S L   GI+ + + GSD EV+++FN + +
Subjt:  AITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGK

Query:  GVRFRGHFCYDS-TSRNLRKHCDAKSNQWMAILKRDYFNTPWTITSFIFAVIFSLITLLQTAFTIY
         V F     + S  S ++ ++ + K N   A L   YFN PW   SF  AVI  L+TL Q+ + +Y
Subjt:  GVRFRGHFCYDS-TSRNLRKHCDAKSNQWMAILKRDYFNTPWTITSFIFAVIFSLITLLQTAFTIY

AT4G31980.1 unknown protein2.4e-5733.57Show/hide
Query:  IQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMN
        I+ KL  L  ++ +CCIY+V  +L  + P  Y P+L+S GP H G+E L+ ME  K ++LL ++ R +    S E +V+ A  WE  AR CY +  + ++
Subjt:  IQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMN

Query:  SHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNV-NLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCKI
        S +FV ML+VDG F+VE L+ S   + +  +  D +   +M + ++  D+IL+ENQLPFFV++ +   +       + +++        Q +F     +I
Subjt:  SHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNV-NLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCKI

Query:  PQNIFSPTQKNIRHLVDFLGFYYSP--------TTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITM
            F    +   H VD L   Y P        TT  + N          P  TEL+ AGV  + A T++    ++ ISF  GVLKIP   + DL E   
Subjt:  PQNIFSPTQKNIRHLVDFLGFYYSP--------TTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITM

Query:  RNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQWMAILKRD
        +N++ FE  +    S  + + YI+ LG  I    D+ LL+  GI+ N +G S  +VSN+FN+I K V +   F +   S NL+ +C+   N+W AIL+RD
Subjt:  RNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQWMAILKRD

Query:  YFNTPWTITSFIFAVIFSLITLLQTAFTI
        YF+ PW + S   A++  L+T +Q+  +I
Subjt:  YFNTPWTITSFIFAVIFSLITLLQTAFTI

AT5G11290.1 Plant protein of unknown function (DUF247)2.1e-4534.05Show/hide
Query:  MEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSK-AMNVNLYHDLI
        ME  KL++L  ++ R +   LS E +V+ A  WE +AR CY +  + ++S ++V ML+VD  F+VE L+ S+         +D +  K  M V++ HD++
Subjt:  MEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYEDCAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSK-AMNVNLYHDLI

Query:  LLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEA
        LLENQLP+FV++G+   +    + +    L  I+HN    +F K +  IP    S +   I H VD L   + P        G+ R++    S  E+  A
Subjt:  LLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCKIPQNIFSPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEA

Query:  GVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNM
        GV L+ A   +++   + ISF  GVL IP  +I+D+ E   RN++ FE        ++  IHY+ FL   I    D+ L +  GI+ N  G + E+VS +
Subjt:  GVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYILFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNM

Query:  FNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQWMAILKRDYFNTPWTITSFIFAVIFSLITLLQTAFTI
        FN+I K   + G F Y +   NL+ HC+A  N+W A L+RDYF+ PW+  S + A +  L+T +Q   +I
Subjt:  FNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQWMAILKRDYFNTPWTITSFIFAVIFSLITLLQTAFTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATAAACCCAATGATAGCGTTGAAGTTAGCTACATCGCTGCATTAATCCAGAACAAGCTCCAAAACCTACCTTGTGTTACTGAAGAATGTTGCATCTATCGTGTTTC
CAAACGACTTGTTAACATTTACCCTACTGTCTATGAACCTCAACTCATTTCCATCGGTCCTTTTCATCATGGTCGAGAACATTTGAAGCTAATGGAACAATTTAAACTAC
AGTTTCTCCTTCGGTATCTTTCCCGTCTTTCTAGACGGCCTTTGTCGTTTGAGGTCGTTGTTAAAGCCGCTTTGGAATGGGAGACCAAAGCTCGCAAGTGTTATGAAGAT
TGTGCCATCAGCATGAATAGCCACGACTTTGTCCATATGCTGCTTGTGGATGGTTGTTTCGTGGTCGAGTTTTTGATAGCTAGTGAACAACTTCAAACTCAAACTACATC
AAGAGTTGATCCTTTAGTGTCCAAAGCTATGAACGTAAATCTATATCATGACTTGATCTTGCTCGAGAACCAGCTTCCTTTTTTCGTTCTTCAAGGTCTTCTTTACTTTA
TTGATGAACCGAACAATGACGATAGCTTTACGGTCTTAGTAAATATTGTACATAATTTTTTTCAAGCTAACTTCATGAAACATTATTGTAAGATTCCTCAAAATATCTTT
TCCCCCACTCAAAAAAATATAAGGCATTTGGTAGATTTCTTAGGCTTTTACTATTCCCCGACCACTACAGACATTATTAATCAAGGCAACGACAGATTGTTATTTCTTCC
TCCATCTACGACTGAGCTGTATGAAGCTGGAGTGATCTTGGAGAAAGCAATAACTACAAATGATCATTACAACATTATGGGCATAAGTTTTGAAGGCGGGGTTCTTAAAA
TCCCACCTTTTGAAATTCATGATCTCTTTGAAATCACCATGAGGAATTTGTTGGCATTTGAGAATTTTCAAGGTGGAAGTGCGAGTGAAAGCTCAGCAATTCATTATATT
TTGTTTTTAGGGGCTTTAATAAGTAAAGAGAAAGATTCAAGTTTACTTATGAAGAAGGGAATCCTAAGTAATCTTATTGGAGGTAGTGATGAAGAAGTTTCCAATATGTT
CAATAACATTGGTAAAGGTGTGAGATTTCGAGGACATTTTTGCTATGATAGTACAAGCAGAAATTTACGTAAACATTGCGACGCAAAAAGTAACCAATGGATGGCTATAT
TGAAGCGTGATTATTTCAATACGCCATGGACTATTACTTCTTTCATTTTTGCCGTCATATTCTCCCTCATCACTCTACTCCAAACGGCATTCACCATATACCACGCTATA
GATAAAGATCGTCATGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCATAAACCCAATGATAGCGTTGAAGTTAGCTACATCGCTGCATTAATCCAGAACAAGCTCCAAAACCTACCTTGTGTTACTGAAGAATGTTGCATCTATCGTGTTTC
CAAACGACTTGTTAACATTTACCCTACTGTCTATGAACCTCAACTCATTTCCATCGGTCCTTTTCATCATGGTCGAGAACATTTGAAGCTAATGGAACAATTTAAACTAC
AGTTTCTCCTTCGGTATCTTTCCCGTCTTTCTAGACGGCCTTTGTCGTTTGAGGTCGTTGTTAAAGCCGCTTTGGAATGGGAGACCAAAGCTCGCAAGTGTTATGAAGAT
TGTGCCATCAGCATGAATAGCCACGACTTTGTCCATATGCTGCTTGTGGATGGTTGTTTCGTGGTCGAGTTTTTGATAGCTAGTGAACAACTTCAAACTCAAACTACATC
AAGAGTTGATCCTTTAGTGTCCAAAGCTATGAACGTAAATCTATATCATGACTTGATCTTGCTCGAGAACCAGCTTCCTTTTTTCGTTCTTCAAGGTCTTCTTTACTTTA
TTGATGAACCGAACAATGACGATAGCTTTACGGTCTTAGTAAATATTGTACATAATTTTTTTCAAGCTAACTTCATGAAACATTATTGTAAGATTCCTCAAAATATCTTT
TCCCCCACTCAAAAAAATATAAGGCATTTGGTAGATTTCTTAGGCTTTTACTATTCCCCGACCACTACAGACATTATTAATCAAGGCAACGACAGATTGTTATTTCTTCC
TCCATCTACGACTGAGCTGTATGAAGCTGGAGTGATCTTGGAGAAAGCAATAACTACAAATGATCATTACAACATTATGGGCATAAGTTTTGAAGGCGGGGTTCTTAAAA
TCCCACCTTTTGAAATTCATGATCTCTTTGAAATCACCATGAGGAATTTGTTGGCATTTGAGAATTTTCAAGGTGGAAGTGCGAGTGAAAGCTCAGCAATTCATTATATT
TTGTTTTTAGGGGCTTTAATAAGTAAAGAGAAAGATTCAAGTTTACTTATGAAGAAGGGAATCCTAAGTAATCTTATTGGAGGTAGTGATGAAGAAGTTTCCAATATGTT
CAATAACATTGGTAAAGGTGTGAGATTTCGAGGACATTTTTGCTATGATAGTACAAGCAGAAATTTACGTAAACATTGCGACGCAAAAAGTAACCAATGGATGGCTATAT
TGAAGCGTGATTATTTCAATACGCCATGGACTATTACTTCTTTCATTTTTGCCGTCATATTCTCCCTCATCACTCTACTCCAAACGGCATTCACCATATACCACGCTATA
GATAAAGATCGTCATGTGTAG
Protein sequenceShow/hide protein sequence
MHKPNDSVEVSYIAALIQNKLQNLPCVTEECCIYRVSKRLVNIYPTVYEPQLISIGPFHHGREHLKLMEQFKLQFLLRYLSRLSRRPLSFEVVVKAALEWETKARKCYED
CAISMNSHDFVHMLLVDGCFVVEFLIASEQLQTQTTSRVDPLVSKAMNVNLYHDLILLENQLPFFVLQGLLYFIDEPNNDDSFTVLVNIVHNFFQANFMKHYCKIPQNIF
SPTQKNIRHLVDFLGFYYSPTTTDIINQGNDRLLFLPPSTTELYEAGVILEKAITTNDHYNIMGISFEGGVLKIPPFEIHDLFEITMRNLLAFENFQGGSASESSAIHYI
LFLGALISKEKDSSLLMKKGILSNLIGGSDEEVSNMFNNIGKGVRFRGHFCYDSTSRNLRKHCDAKSNQWMAILKRDYFNTPWTITSFIFAVIFSLITLLQTAFTIYHAI
DKDRHV