; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0000745 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0000745
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationchr04:26743859..26745192
RNA-Seq ExpressionIVF0000745
SyntenyIVF0000745
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053776.1 UPF0481 protein [Cucumis melo var. makuwa]9.32e-16686.52Show/hide
Query:  MHQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSFE------
        MHQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSFE      
Subjt:  MHQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSFE------

Query:  ----TKARKCYEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFM
            TKARKCYEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQ                 
Subjt:  ----TKARKCYEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFM

Query:  VLVNIVHNFFQVNFIKHHREIPPNILSAPNKHIGHLLDFLGFYYPPVTKDINQGNNRSLFLPPSTTELYEAGVILEKAVTTS
                   VNFIKHHREIPPNILSAPNKHIGHLLDFLGFYYPPVTKDINQGNNRSLFLPPSTTELYEAGVILEKAVTTS
Subjt:  VLVNIVHNFFQVNFIKHHREIPPNILSAPNKHIGHLLDFLGFYYPPVTKDINQGNNRSLFLPPSTTELYEAGVILEKAVTTS

KAE8651212.1 hypothetical protein Csa_001883 [Cucumis sativus]1.07e-21373.5Show/hide
Query:  MHQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSFETKARKC
        MH+PN+S    EVSYI +LIQNKLQ+LP +TEECCIYRVSKRLVNI+P++YEPQLISIGPFHHGRE LK MEQFKL+FL R                   
Subjt:  MHQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSFETKARKC

Query:  YEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFMVLVNIVHNFF
               MNSHDFVHMLLVDGCF+VEFL+A   E LQTQTTSRVDPLVS+AMNINLYHDLI+LENQLPFFV+QGL  FI +PNN DD F VLVNIVHNFF
Subjt:  YEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFMVLVNIVHNFF

Query:  QVNFIKHHREIPPNILSAPNKHIGHLLDFLGFYYPPVTKDI-NQGNNRSLFLPPSTTELYEAGVILEKAVTTS------------GVLKIPPFEIHDLFE
        Q NF+KH+ +IP NI S   K+I HL+DFLGFYY P T DI NQGN+R LFLPPSTTELYEAGVILEKA+TT+            GVLKIPPFEIHDLFE
Subjt:  QVNFIKHHREIPPNILSAPNKHIGHLLDFLGFYYPPVTKDI-NQGNNRSLFLPPSTTELYEAGVILEKAVTTS------------GVLKIPPFEIHDLFE

Query:  ITMRNLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTFRGHFYYDSTSRNLRKHCDARSNRWMAIL
        ITMRNLLAFENFQGGS SESSAIHYI FLGALISKEKDSSLLMKKGILSNLIGGSD EVSNMFNNIGKGV FRGHF YDSTSRNLRKHCDA+SN+WMAIL
Subjt:  ITMRNLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTFRGHFYYDSTSRNLRKHCDARSNRWMAIL

Query:  KRDYLNTPWAIVSLVCVTIVTLITLLKTIVFMYH
        KRDY NTPW I S +   I  LITLL+T   +YH
Subjt:  KRDYLNTPWAIVSLVCVTIVTLITLLKTIVFMYH

XP_004147504.1 UPF0481 protein At3g47200 [Cucumis sativus]1.05e-17078.29Show/hide
Query:  MNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFMVLVNIVHNFFQVNFIKH
        MNSHDFVHMLLVDGCF+VEFL+A   E LQTQTTSRVDPLVS+AMNINLYHDLI+LENQLPFFV+QGL  FI +PNN DD F VLVNIVHNFFQ NF+KH
Subjt:  MNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFMVLVNIVHNFFQVNFIKH

Query:  HREIPPNILSAPNKHIGHLLDFLGFYYPPVTKDI-NQGNNRSLFLPPSTTELYEAGVILEKAVTTS------------GVLKIPPFEIHDLFEITMRNLL
        + +IP NI S   K+I HL+DFLGFYY P T DI NQGN+R LFLPPSTTELYEAGVILEKA+TT+            GVLKIPPFEIHDLFEITMRNLL
Subjt:  HREIPPNILSAPNKHIGHLLDFLGFYYPPVTKDI-NQGNNRSLFLPPSTTELYEAGVILEKAVTTS------------GVLKIPPFEIHDLFEITMRNLL

Query:  AFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTFRGHFYYDSTSRNLRKHCDARSNRWMAILKRDYLNT
        AFENFQGGS SESSAIHYI FLGALISKEKDSSLLMKKGILSNLIGGSD EVSNMFNNIGKGV FRGHF YDSTSRNLRKHCDA+SN+WMAILKRDY NT
Subjt:  AFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTFRGHFYYDSTSRNLRKHCDARSNRWMAILKRDYLNT

Query:  PWAIVSLVCVTIVTLITLLKTIVFMYH
        PW I S +   I  LITLL+T   +YH
Subjt:  PWAIVSLVCVTIVTLITLLKTIVFMYH

XP_008443397.1 PREDICTED: LOW QUALITY PROTEIN: UPF0481 protein At3g47200-like [Cucumis melo]7.39e-30795.49Show/hide
Query:  MHQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSFE------
        MHQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSFE      
Subjt:  MHQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSFE------

Query:  ----TKARKCYEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFM
            TKARKCYEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFM
Subjt:  ----TKARKCYEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFM

Query:  VLVNIVHNFFQVNFIKHHREIPPNILSAPNKHIGHLLDFLGFYYPPVTKDINQGNNRSLFLPPSTTELYEAGVILEKAVTTS----------GVLKIPPF
        VLVNIVHNFFQVNFIKHHREIPPNILSAPNKHIGHLLDFLGFYYPPVTKDINQGNNRSLFLPPSTTELYEAGVILEKAVTTS          GVLKIPPF
Subjt:  VLVNIVHNFFQVNFIKHHREIPPNILSAPNKHIGHLLDFLGFYYPPVTKDINQGNNRSLFLPPSTTELYEAGVILEKAVTTS----------GVLKIPPF

Query:  EIHDLFEITMRNLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTFRGHFYYDSTSRNLRKHCDARS
        EIHDLFEITMRNLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTFRGHFYYDSTSRNLRKHCDARS
Subjt:  EIHDLFEITMRNLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTFRGHFYYDSTSRNLRKHCDARS

Query:  NRWMAILKRDYLNTPWAIVSLVCVTIVTLITLLKTIVFMYHFV
        NRWMAILKRDYLNTPWAIVSLVCVTIVTLITLLKTIVFMYHFV
Subjt:  NRWMAILKRDYLNTPWAIVSLVCVTIVTLITLLKTIVFMYHFV

XP_038904513.1 UPF0481 protein At3g47200-like [Benincasa hispida]5.41e-18463.7Show/hide
Query:  MHQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSF-------
        MHQPNN+ + A    + +LIQ +L+ LP +TEECCI+RVSKRL+NIH T YEPQLISIGPFHHGR+DLKPMEQFKL+FL R+++R++RQ LS+       
Subjt:  MHQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSF-------

Query:  ----ETKARKCYEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTS--RVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDD
            ET+AR CYED   +MNSHDFV M+LVDGCF+VEFLV++YG   QTQ+TS  RVDPLV +AMNINLYHDLIMLENQLPFFV+Q LF  I +    D+
Subjt:  ----ETKARKCYEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTS--RVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDD

Query:  CFMVLVNIVHNFFQVNFIKHHREIPPNILSAPNKHIGHLLDFLGFYYPPVTKDINQ-GNNRSLFLPPSTTELYEAGVILEKAVTTS--------GVLKIP
         F  LV+I+H FF  NF+KH+ E P N    P ++I HL+ FL FYY P   DI +  NN+SL LPPS TEL+EAGVILEK  T +        GVLKIP
Subjt:  CFMVLVNIVHNFFQVNFIKHHREIPPNILSAPNKHIGHLLDFLGFYYPPVTKDINQ-GNNRSLFLPPSTTELYEAGVILEKAVTTS--------GVLKIP

Query:  PFEIHDLFEITMRNLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTFRGHFYYDSTSRNLRKHCDA
        PFEIH LFEI MRNL+AFENFQG +G++S AIHY+ FLGALIS+EKDSSLLMKKGI++NLIGGSD EVSNMFNNIGKGVTF+GHFYY+  S++L KHC  
Subjt:  PFEIHDLFEITMRNLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTFRGHFYYDSTSRNLRKHCDA

Query:  RSNRWMAILKRDYLNTPWAIVSLVCVTIVTLITLLKTI
        R NRWMA L+RDY NTPWA +SL+    VT    L+TI
Subjt:  RSNRWMAILKRDYLNTPWAIVSLVCVTIVTLITLLKTI

TrEMBL top hitse value%identityAlignment
A0A0A0LC32 Uncharacterized protein4.4e-18678.8Show/hide
Query:  MHQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSFETKARKC
        MH+PN+S    EVSYI +LIQNKLQ+LP +TEECCIYRVSKRLVNI+P++YEPQLISIGPFHHGRE LK MEQFKL+FL RYLSRLSR+ LSFETKARKC
Subjt:  MHQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSFETKARKC

Query:  YEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFMVLVNIVHNFF
        YEDC ISMNSHDFVHMLLVDGCF+VEFL+A   E LQTQTTSRVDPLVS+AMNINLYHDLI+LENQLPFFV+QGL  FI +PNN DD F VLVNIVHNFF
Subjt:  YEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFMVLVNIVHNFF

Query:  QVNFIKHHREIPPNILSAPNKHIGHLLDFLGFYYPPVTKD-INQGNNRSLFLPPSTTELYEAGVILEKAVTTS------------GVLKIPPFEIHDLFE
        Q NF+KH+ +IP NI S   K+I HL+DFLGFYY P T D INQGN+R LFLPPSTTELYEAGVILEKA+TT+            GVLKIPPFEIHDLFE
Subjt:  QVNFIKHHREIPPNILSAPNKHIGHLLDFLGFYYPPVTKD-INQGNNRSLFLPPSTTELYEAGVILEKAVTTS------------GVLKIPPFEIHDLFE

Query:  ITMRNLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTFRGHFYYDSTSRNLRKHCDARSNRWMAIL
        ITMRNLLAFENFQGGS SESSAIHYI FLGALISKEKDSSLLMKKGILSNLIGGSD EVSNMFNNIGKGV FRGHF YDSTSRNLRKHCDA+SN+WMAIL
Subjt:  ITMRNLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTFRGHFYYDSTSRNLRKHCDARSNRWMAIL

Query:  KRDYLNTPWAIVSLVCVTIVTLITLLKTIVFMYH
        KRDY NTPW I S +   I  LITLL+T   +YH
Subjt:  KRDYLNTPWAIVSLVCVTIVTLITLLKTIVFMYH

A0A1S3B8P8 LOW QUALITY PROTEIN: UPF0481 protein At3g47200-like1.5e-23995.49Show/hide
Query:  MHQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSF-------
        MHQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSF       
Subjt:  MHQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSF-------

Query:  ---ETKARKCYEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFM
           ETKARKCYEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFM
Subjt:  ---ETKARKCYEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFM

Query:  VLVNIVHNFFQVNFIKHHREIPPNILSAPNKHIGHLLDFLGFYYPPVTKDINQGNNRSLFLPPSTTELYEAGVILEKAVTTS----------GVLKIPPF
        VLVNIVHNFFQVNFIKHHREIPPNILSAPNKHIGHLLDFLGFYYPPVTKDINQGNNRSLFLPPSTTELYEAGVILEKAVTTS          GVLKIPPF
Subjt:  VLVNIVHNFFQVNFIKHHREIPPNILSAPNKHIGHLLDFLGFYYPPVTKDINQGNNRSLFLPPSTTELYEAGVILEKAVTTS----------GVLKIPPF

Query:  EIHDLFEITMRNLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTFRGHFYYDSTSRNLRKHCDARS
        EIHDLFEITMRNLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTFRGHFYYDSTSRNLRKHCDARS
Subjt:  EIHDLFEITMRNLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTFRGHFYYDSTSRNLRKHCDARS

Query:  NRWMAILKRDYLNTPWAIVSLVCVTIVTLITLLKTIVFMYHFV
        NRWMAILKRDYLNTPWAIVSLVCVTIVTLITLLKTIVFMYHFV
Subjt:  NRWMAILKRDYLNTPWAIVSLVCVTIVTLITLLKTIVFMYHFV

A0A5D3DPP4 UPF0481 protein2.0e-13086.52Show/hide
Query:  MHQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSF-------
        MHQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSF       
Subjt:  MHQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSF-------

Query:  ---ETKARKCYEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFM
           ETKARKCYEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVI                  
Subjt:  ---ETKARKCYEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFM

Query:  VLVNIVHNFFQVNFIKHHREIPPNILSAPNKHIGHLLDFLGFYYPPVTKDINQGNNRSLFLPPSTTELYEAGVILEKAVTTS
                  QVNFIKHHREIPPNILSAPNKHIGHLLDFLGFYYPPVTKDINQGNNRSLFLPPSTTELYEAGVILEKAVTTS
Subjt:  VLVNIVHNFFQVNFIKHHREIPPNILSAPNKHIGHLLDFLGFYYPPVTKDINQGNNRSLFLPPSTTELYEAGVILEKAVTTS

A0A6J1DXD6 UPF0481 protein At3g47200-like isoform X28.0e-7941.5Show/hide
Query:  HQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQL-------LSFE
        ++P N+     V  + S I+  LQ LP + EEC I+RV +RL+  +   Y PQ+ISIGPFHHGR+DL PMEQ KL+FL RYL R +  +        S+E
Subjt:  HQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQL-------LSFE

Query:  TKARKCYEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYD-DCFMVLV
        T AR CY +  I+M+S +FV M+LVDGCF+VE ++ +    + ++T +R DPL+  AM  +LY DLIMLENQLPFFV+QGLF      + +  +  +  +
Subjt:  TKARKCYEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYD-DCFMVLV

Query:  NIVHNFFQVNFIKHHR--EIPPNILSAPNKHIGHLLDFLGFYYPPVTKDINQGNN------RSLFLPPSTTELYEAGVILEKAVTT---------SGVLK
         + H F+    +   R  E+P  ++ + +K + HL+DFL FYY P    ++  ++      +    PP+ TEL+EAG++ +KA+             VL+
Subjt:  NIVHNFFQVNFIKHHR--EIPPNILSAPNKHIGHLLDFLGFYYPPVTKDINQGNN------RSLFLPPSTTELYEAGVILEKAVTT---------SGVLK

Query:  IPPFEIHDLFEITMRNLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTFRGHF-YYDSTSRNLRKH
        IPP EI D+FE  +RNL+AFE +         AI Y  FL  LIS+E+D SLL+K  I++N IGG++ EVS +FN++ K V  RG    ++  +  L +H
Subjt:  IPPFEIHDLFEITMRNLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTFRGHF-YYDSTSRNLRKH

Query:  CDARSNRWMAILKRDYLNTPWAIVSLVCVTIVTLITLLKTI
        C AR N+ MA L+RDY NTPWA +S V    + L+T L+T+
Subjt:  CDARSNRWMAILKRDYLNTPWAIVSLVCVTIVTLITLLKTI

A0A6J1E120 UPF0481 protein At3g47200-like isoform X18.0e-7941.5Show/hide
Query:  HQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQL-------LSFE
        ++P N+     V  + S I+  LQ LP + EEC I+RV +RL+  +   Y PQ+ISIGPFHHGR+DL PMEQ KL+FL RYL R +  +        S+E
Subjt:  HQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQL-------LSFE

Query:  TKARKCYEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYD-DCFMVLV
        T AR CY +  I+M+S +FV M+LVDGCF+VE ++ +    + ++T +R DPL+  AM  +LY DLIMLENQLPFFV+QGLF      + +  +  +  +
Subjt:  TKARKCYEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYD-DCFMVLV

Query:  NIVHNFFQVNFIKHHR--EIPPNILSAPNKHIGHLLDFLGFYYPPVTKDINQGNN------RSLFLPPSTTELYEAGVILEKAVTT---------SGVLK
         + H F+    +   R  E+P  ++ + +K + HL+DFL FYY P    ++  ++      +    PP+ TEL+EAG++ +KA+             VL+
Subjt:  NIVHNFFQVNFIKHHR--EIPPNILSAPNKHIGHLLDFLGFYYPPVTKDINQGNN------RSLFLPPSTTELYEAGVILEKAVTT---------SGVLK

Query:  IPPFEIHDLFEITMRNLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTFRGHF-YYDSTSRNLRKH
        IPP EI D+FE  +RNL+AFE +         AI Y  FL  LIS+E+D SLL+K  I++N IGG++ EVS +FN++ K V  RG    ++  +  L +H
Subjt:  IPPFEIHDLFEITMRNLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTFRGHF-YYDSTSRNLRKH

Query:  CDARSNRWMAILKRDYLNTPWAIVSLVCVTIVTLITLLKTI
        C AR N+ MA L+RDY NTPWA +S V    + L+T L+T+
Subjt:  CDARSNRWMAILKRDYLNTPWAIVSLVCVTIVTLITLLKTI

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026453.3e-1328.78Show/hide
Query:  IYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFL---------FRYLSRLSRQLLSFETKARKCYEDCVISMNSHDFVHMLLVDGCFMVE
        I+ V K L+  HP  Y P  +SIGP+H  + +L  ME++KL            FR+   L  +L S E K R CY    I  N    + ++ VD  F++E
Subjt:  IYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFL---------FRYLSRLSRQLLSFETKARKCYEDCVISMNSHDFVHMLLVDGCFMVE

Query:  FLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCF-IYQPNNYDDCFMVLVNIVHNFFQVNFIKHHREIPPNILSAPNKHIGH
        F        L+  +  +V+ L+++  +  +  D++M+ENQ+P FV++    F +    + DD  + ++  +        IK   +    IL A  +   H
Subjt:  FLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCF-IYQPNNYDDCFMVLVNIVHNFFQVNFIKHHREIPPNILSAPNKHIGH

Query:  LLDFL
        +LDFL
Subjt:  LLDFL

Q9SD53 UPF0481 protein At3g472002.9e-2524.55Show/hide
Query:  EECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSR----------LSRQLLSFETKARKCYEDCVISMNSHDFVHMLLVDG
        E CCI+RV +  V ++P  Y+P+++SIGP+H+G + L+ ++Q K + L  +L            L + ++  E K RK Y + +     HD + M+++DG
Subjt:  EECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSR----------LSRQLLSFETKARKCYEDCVISMNSHDFVHMLLVDG

Query:  CFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNI-NLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFMVLVNIVHNFF------QVNFIKHHREIPPN
        CF++   + + G    ++     DP+ S    + ++  DL++LENQ+PFFV+Q L+       + D     L  I  +FF      + ++ + HR     
Subjt:  CFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNI-NLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFMVLVNIVHNFF------QVNFIKHHREIPPN

Query:  ILSAPNKHIGHLLDFLGFYYPPVTKDINQGNNRSLFLP---------PSTTELYEAGVILEKAVTTSGV---------------------LKIPPFEIHD
             N    HLLD +   + P T + ++ ++  + +          PS        ++  K +   G+                     L+IP      
Subjt:  ILSAPNKHIGHLLDFLGFYYPPVTKDINQGNNRSLFLP---------PSTTELYEAGVILEKAVTTSGV---------------------LKIPPFEIHD

Query:  LFEITMRNLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTFR-GHFYYDSTSRNLRKHCDARSNRW
               N +AFE F   S +E +   YI F+G L++ E+D + L    ++     GS+ EVS  F  I K V F     Y ++  + + ++     N  
Subjt:  LFEITMRNLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTFR-GHFYYDSTSRNLRKHCDARSNRW

Query:  MAILKRDYLNTPWAIVSLVCVTIVTLITLLKTIVFMYHFV
         A  +  +  +PW  +S   V  V L+T+L++ V +  ++
Subjt:  MAILKRDYLNTPWAIVSLVCVTIVTLITLLKTIVFMYHFV

Arabidopsis top hitse value%identityAlignment
AT3G50120.1 Plant protein of unknown function (DUF247)1.1e-3229.63Show/hide
Query:  CIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSF-------ETKARKCYEDCVISMNSHDFVHMLLVDGCFMVE-
        CIYRV   L       Y PQ +S+GP+HHG++ L+ M++ K + + R L R ++ +  +       E KAR CYE   +S++S++F+ ML++DGCF++E 
Subjt:  CIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSF-------ETKARKCYEDCVISMNSHDFVHMLLVDGCFMVE-

Query:  FLVAIYGEHLQTQTTSRVDPLVSQAMNI-NLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFMVLVNIVHNFF------QVNFIKHHREIPPNILSA-
        F  A+ G        +R DP+ +   ++ ++  D++MLENQLP FV+  L        N      ++  +   FF           K  +    N L+  
Subjt:  FLVAIYGEHLQTQTTSRVDPLVSQAMNI-NLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFMVLVNIVHNFF------QVNFIKHHREIPPNILSA-

Query:  ----PNKHIG--HLLDFLGFYY--------PPVT-----KDINQGNNRSLFLPPSTTELYEAGVILEKAVT--------TSGVLKIPPFEIHDLFEITMR
            P   +G  H LD              P +T     ++    + R   L    TEL EAG+   +  T         +G L+IP   IHD  +    
Subjt:  ----PNKHIG--HLLDFLGFYY--------PPVT-----KDINQGNNRSLFLPPSTTELYEAGVILEKAVT--------TSGVLKIPPFEIHDLFEITMR

Query:  NLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTF-RGHFYYDSTSRNLRKHCDARSNRWMAILKRD
        NL+AFE     S ++ ++  YI F+  LI   +D S L   GI+ + + GSD EV+++FN + + V F     Y    S  + ++ D + N W A LK  
Subjt:  NLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTF-RGHFYYDSTSRNLRKHCDARSNRWMAILKRD

Query:  YLNTPWAIVSLVCVTIVTLITLLKTIVFMYHF
        Y N PWAIVS     I+ ++T  ++   +Y +
Subjt:  YLNTPWAIVSLVCVTIVTLITLLKTIVFMYHF

AT3G50150.1 Plant protein of unknown function (DUF247)5.9e-3430.33Show/hide
Query:  EECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSF-------ETKARKCYEDCVISMNSHDFVHMLLVDGCFM
        ++ CIYRV   L       Y PQ +SIGP+HHG+  L+PME+ K + +   ++R    +  +       E +AR CY+  +   NS++F  ML++DGCF+
Subjt:  EECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSF-------ETKARKCYEDCVISMNSHDFVHMLLVDGCFM

Query:  VE-FLVAIYGEHLQTQTTSRVDPL-VSQAMNINLYHDLIMLENQLPFFVIQGLFCF-IYQPNNYDDCFMVLVNIVHNFFQVNFI--KHHREIPPNILSAP
        +E F   I G   Q    +R DP+   + +  ++  D+IMLENQLP FV+  L       PN       V V         + +  K  R +     S  
Subjt:  VE-FLVAIYGEHLQTQTTSRVDPL-VSQAMNINLYHDLIMLENQLPFFVIQGLFCF-IYQPNNYDDCFMVLVNIVHNFFQVNFI--KHHREIPPNILSAP

Query:  NKHIG--HLLDFLGFYYPPVTKDINQGN--------NRSLFLPPSTTELYEAGVILEKAVT--------TSGVLKIPPFEIHDLFEITMRNLLAFENFQG
            G  H LD         ++  NQG          +   L    TEL  AGV   +  T         +G LKIP   IHD  +    NL+AFE  Q 
Subjt:  NKHIG--HLLDFLGFYYPPVTKDINQGN--------NRSLFLPPSTTELYEAGVILEKAVT--------TSGVLKIPPFEIHDLFEITMRNLLAFENFQG

Query:  GSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTF-RGHFYYDSTSRNLRKHCDARSNRWMAILKRDYLNTPWAIVS
         + S ++   YI F+  LI+  +D S L   GI+ + + GSD EV+++FN + K V F     Y    SR + ++   + N   A L++ Y N PWA  S
Subjt:  GSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTF-RGHFYYDSTSRNLRKHCDARSNRWMAILKRDYLNTPWAIVS

Query:  LVCVTIVTLITLLKTIVFMYHF
             I+  +T  ++   +Y +
Subjt:  LVCVTIVTLITLLKTIVFMYHF

AT3G50160.1 Plant protein of unknown function (DUF247)1.6e-3431.31Show/hide
Query:  EECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSF-------ETKARKCYEDCVISMNSHDFVHMLLVDGCFM
        +  CIYRV   L       Y PQ++SIGP+HHG + L PME+ K + +   ++R    +  +       E KAR CY+   I+MN ++F+ ML++DG F+
Subjt:  EECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSF-------ETKARKCYEDCVISMNSHDFVHMLLVDGCFM

Query:  VEFLVAIYGEHLQTQTTSRVDPLVS-QAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFMVLVNIVHNFFQVNFIKHHREIPPNILSAPNKHI
        +E       E  Q    +  DP+   + +  ++  D++MLENQLP+ V++GL     Q    D    V V +   FFQ         + P       +  
Subjt:  VEFLVAIYGEHLQTQTTSRVDPLVS-QAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFMVLVNIVHNFFQVNFIKHHREIPPNILSAPNKHI

Query:  GHLLDFL--GFYYPPVT--KDINQGNNRSLFLPPSTTELYEAGVILEKAVT--------TSGVLKIPPFEIHDLFEITMRNLLAFENFQGGSGSESSAIH
         H LD L  G      T  +D++  N +   L    TEL  AGV   +  T         +G LKIP   IHD  +    NL+AFE  Q    S      
Subjt:  GHLLDFL--GFYYPPVT--KDINQGNNRSLFLPPSTTELYEAGVILEKAVT--------TSGVLKIPPFEIHDLFEITMRNLLAFENFQGGSGSESSAIH

Query:  YIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTF-RGHFYYDSTSRNLRKHCDARSNRWMAILKRDYLNTPWAIVSLVCVTIVTLI
        YI F+  LI+  +D S L   GI+ N + GSD EVS++FN +GK V F     Y  + +  +  +   + N   A L+  Y N PWA  S +    + + 
Subjt:  YIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTF-RGHFYYDSTSRNLRKHCDARSNRWMAILKRDYLNTPWAIVSLVCVTIVTLI

Query:  TLLKTI--VFMY
        T  ++   VF Y
Subjt:  TLLKTI--VFMY

AT4G31980.1 unknown protein1.8e-5433.09Show/hide
Query:  IQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSR-------LSRQLLSFETKARKCYEDCVISMNSHD
        I+ KL  L  ++ +CCIY+V  +L  ++P  Y P+L+S GP H G+E+L+ ME  K ++L  ++ R       L R   ++E  AR CY + V  ++S +
Subjt:  IQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSR-------LSRQLLSFETKARKCYEDCVISMNSHD

Query:  FVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNI-NLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFMVLVNIVHNFFQVNFIKHHREI
        FV ML+VDG F+VE L+  +   L+ +     D +   +M I ++  D+I++ENQLPFFV++ +F  +   N Y      ++ +    F     +   E 
Subjt:  FVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNI-NLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFMVLVNIVHNFFQVNFIKHHREI

Query:  PPNILSAPNKHIGHLLDFLGFYYPPVTKDINQGNNRSLFLPPSTTELYEAGVILEKAVTTS---------GVLKIPPFEIHDLFEITMRNLLAFENFQGG
            ++ P     H +D L   Y P      +     +   P  TEL+ AGV  + A T+S         GVLKIP   + DL E   +N++ FE  +  
Subjt:  PPNILSAPNKHIGHLLDFLGFYYPPVTKDINQGNNRSLFLPPSTTELYEAGVILEKAVTTS---------GVLKIPPFEIHDLFEITMRNLLAFENFQGG

Query:  SGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTFRGHFYYDSTSRNLRKHCDARSNRWMAILKRDYLNTPWAIVSLV
          S  + + YI  LG  I    D+ LL+  GI+ N +G S V+VSN+FN+I K V +   FY+   S NL+ +C+   NRW AIL+RDY + PWA+ S+ 
Subjt:  SGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNIGKGVTFRGHFYYDSTSRNLRKHCDARSNRWMAILKRDYLNTPWAIVSLV

Query:  CVTIVTLITLLKTI
           ++ L+T ++++
Subjt:  CVTIVTLITLLKTI

AT5G11290.1 Plant protein of unknown function (DUF247)1.2e-3931.96Show/hide
Query:  MEQFKLKFLFRYLSR-------LSRQLLSFETKARKCYEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLV-SQAMNINLYHDLIM
        ME  KL++L  ++ R       L R   ++E +AR CY + V  ++S ++V ML+VD  F+VE L+    +  +      +D +   Q M +++ HD+++
Subjt:  MEQFKLKFLFRYLSR-------LSRQLLSFETKARKCYEDCVISMNSHDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLV-SQAMNINLYHDLIM

Query:  LENQLPFFVIQGLFCFIYQPNNYDDCFMVLVNIVHNFFQVNFIKHHREIPPNILSAPNKHIGHLLDFLGFYYPPVTKDINQGNNRSLFLPPSTTELYEAG
        LENQLP+FV++G+F  ++   +Y      L  I+HN F+    K    IP    S  +  I H +D L   + P+      G+ R +    S  E+  AG
Subjt:  LENQLPFFVIQGLFCFIYQPNNYDDCFMVLVNIVHNFFQVNFIKHHREIPPNILSAPNKHIGHLLDFLGFYYPPVTKDINQGNNRSLFLPPSTTELYEAG

Query:  VILEKAVT---------TSGVLKIPPFEIHDLFEITMRNLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNI
        V L+ A            +GVL IP  +I+D+ E   RN++ FE        ++  IHY+ FL   I    D+ L +  GI+ N  G ++ +VS +FN+I
Subjt:  VILEKAVT---------TSGVLKIPPFEIHDLFEITMRNLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLMKKGILSNLIGGSDVEVSNMFNNI

Query:  GKGVTFRGHFYYDSTSRNLRKHCDARSNRWMAILKRDYLNTPWAIVSLVCVTIVTLITLLKTI
         K  ++ G FYY +   NL+ HC+A  N+W A L+RDY + PW+  S+V   ++ L+T ++ I
Subjt:  GKGVTFRGHFYYDSTSRNLRKHCDARSNRWMAILKRDYLNTPWAIVSLVCVTIVTLITLLKTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCAACCCAATAATAGCAGTGTAGCTGCTGAAGTTAGCTACATCACTTCACTAATCCAAAACAAGCTCCAAAGCCTACCTCATATTACTGAAGAATGTTGCATCTA
TCGTGTTTCCAAACGACTTGTTAACATTCATCCTACCATGTATGAGCCCCAACTTATTTCCATCGGTCCTTTCCATCATGGTCGAGAAGATTTGAAGCCAATGGAACAAT
TCAAATTAAAGTTTCTCTTTCGGTATCTTTCACGTCTTTCTAGACAACTTTTATCGTTTGAGACCAAAGCTCGCAAGTGCTACGAAGATTGTGTCATCAGCATGAATAGC
CATGACTTTGTCCATATGCTGCTTGTGGATGGTTGTTTCATGGTCGAGTTTTTGGTAGCTATTTATGGTGAACACCTTCAAACTCAAACTACATCAAGAGTCGATCCTTT
AGTGTCCCAAGCTATGAACATAAATCTATATCATGACTTGATCATGCTCGAGAACCAGCTCCCTTTCTTCGTTATTCAAGGTCTTTTTTGCTTTATTTATCAACCAAACA
ATTATGACGATTGCTTCATGGTCTTAGTAAATATTGTACATAATTTTTTTCAAGTTAATTTCATAAAACATCATCGTGAGATTCCTCCAAATATCTTGTCCGCCCCTAAC
AAACATATAGGGCATCTGTTAGATTTCTTAGGCTTTTACTATCCCCCGGTCACTAAAGACATTAATCAAGGCAATAATAGATCGTTATTTCTTCCTCCATCTACGACTGA
GCTGTATGAAGCTGGAGTCATCTTGGAGAAAGCAGTAACAACGAGTGGGGTTCTTAAAATCCCACCTTTTGAAATTCATGATCTCTTTGAAATCACTATGAGGAATTTGT
TGGCGTTTGAGAATTTTCAAGGTGGAAGTGGGAGTGAAAGCTCTGCAATTCATTATATTTGGTTTTTAGGGGCTTTAATAAGTAAAGAGAAAGATTCAAGTTTACTTATG
AAGAAGGGAATCTTAAGTAATCTTATTGGAGGAAGTGATGTAGAAGTTTCCAATATGTTCAATAATATTGGTAAAGGTGTGACATTTCGAGGACATTTTTACTATGATAG
TACAAGCAGAAATTTACGTAAACACTGCGACGCAAGAAGTAACCGATGGATGGCTATATTGAAGCGTGATTATTTAAATACGCCATGGGCTATTGTTTCTTTAGTTTGTG
TCACCATTGTCACTCTGATCACTTTGTTGAAAACCATAGTGTTCATGTATCACTTTGTATAA
mRNA sequenceShow/hide mRNA sequence
ATGCATCAACCCAATAATAGCAGTGTAGCTGCTGAAGTTAGCTACATCACTTCACTAATCCAAAACAAGCTCCAAAGCCTACCTCATATTACTGAAGAATGTTGCATCTA
TCGTGTTTCCAAACGACTTGTTAACATTCATCCTACCATGTATGAGCCCCAACTTATTTCCATCGGTCCTTTCCATCATGGTCGAGAAGATTTGAAGCCAATGGAACAAT
TCAAATTAAAGTTTCTCTTTCGGTATCTTTCACGTCTTTCTAGACAACTTTTATCGTTTGAGACCAAAGCTCGCAAGTGCTACGAAGATTGTGTCATCAGCATGAATAGC
CATGACTTTGTCCATATGCTGCTTGTGGATGGTTGTTTCATGGTCGAGTTTTTGGTAGCTATTTATGGTGAACACCTTCAAACTCAAACTACATCAAGAGTCGATCCTTT
AGTGTCCCAAGCTATGAACATAAATCTATATCATGACTTGATCATGCTCGAGAACCAGCTCCCTTTCTTCGTTATTCAAGGTCTTTTTTGCTTTATTTATCAACCAAACA
ATTATGACGATTGCTTCATGGTCTTAGTAAATATTGTACATAATTTTTTTCAAGTTAATTTCATAAAACATCATCGTGAGATTCCTCCAAATATCTTGTCCGCCCCTAAC
AAACATATAGGGCATCTGTTAGATTTCTTAGGCTTTTACTATCCCCCGGTCACTAAAGACATTAATCAAGGCAATAATAGATCGTTATTTCTTCCTCCATCTACGACTGA
GCTGTATGAAGCTGGAGTCATCTTGGAGAAAGCAGTAACAACGAGTGGGGTTCTTAAAATCCCACCTTTTGAAATTCATGATCTCTTTGAAATCACTATGAGGAATTTGT
TGGCGTTTGAGAATTTTCAAGGTGGAAGTGGGAGTGAAAGCTCTGCAATTCATTATATTTGGTTTTTAGGGGCTTTAATAAGTAAAGAGAAAGATTCAAGTTTACTTATG
AAGAAGGGAATCTTAAGTAATCTTATTGGAGGAAGTGATGTAGAAGTTTCCAATATGTTCAATAATATTGGTAAAGGTGTGACATTTCGAGGACATTTTTACTATGATAG
TACAAGCAGAAATTTACGTAAACACTGCGACGCAAGAAGTAACCGATGGATGGCTATATTGAAGCGTGATTATTTAAATACGCCATGGGCTATTGTTTCTTTAGTTTGTG
TCACCATTGTCACTCTGATCACTTTGTTGAAAACCATAGTGTTCATGTATCACTTTGTATAA
Protein sequenceShow/hide protein sequence
MHQPNNSSVAAEVSYITSLIQNKLQSLPHITEECCIYRVSKRLVNIHPTMYEPQLISIGPFHHGREDLKPMEQFKLKFLFRYLSRLSRQLLSFETKARKCYEDCVISMNS
HDFVHMLLVDGCFMVEFLVAIYGEHLQTQTTSRVDPLVSQAMNINLYHDLIMLENQLPFFVIQGLFCFIYQPNNYDDCFMVLVNIVHNFFQVNFIKHHREIPPNILSAPN
KHIGHLLDFLGFYYPPVTKDINQGNNRSLFLPPSTTELYEAGVILEKAVTTSGVLKIPPFEIHDLFEITMRNLLAFENFQGGSGSESSAIHYIWFLGALISKEKDSSLLM
KKGILSNLIGGSDVEVSNMFNNIGKGVTFRGHFYYDSTSRNLRKHCDARSNRWMAILKRDYLNTPWAIVSLVCVTIVTLITLLKTIVFMYHFV