; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0010531 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0010531
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionUPF0481 protein
Genome locationchr03:26038466..26039906
RNA-Seq ExpressionPay0010531
SyntenyPay0010531
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064937.1 UPF0481 protein [Cucumis melo var. makuwa]6.5e-24088.96Show/hide
Query:  MIQANHDITYKVEEPITGDADQELCVNVVMFIKNILEQVPQVNPKICIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRIS
        MIQANHDITYKVEEPITGDADQELCVNVVMFIKNILEQVPQVNPKICIYRISKEVRELNDRAYAPQFISIGPFHYHTR DLIANEYYKYQGFNNFLRRIS
Subjt:  MIQANHDITYKVEEPITGDADQELCVNVVMFIKNILEQVPQVNPKICIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRIS

Query:  INNEQIESMEGSQVKINTVKFLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRGRVSNIQRIISKIDSKFYQGALYEILGDLIKLE
        INNEQIESMEGSQVKINTVKFLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRGRVSNIQRIISKIDSKFYQGALYEILGDLIKLE
Subjt:  INNEQIESMEGSQVKINTVKFLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRGRVSNIQRIISKIDSKFYQGALYEILGDLIKLE

Query:  NQVPFFLLQSLFDLITKDDLPLVGTHERASLMDLTCFAFKIFVVAGYHLNYMYDKTTPKHLVDFSSIFFMSAPTYVEDNHKYVHTKDRWCISPSVTTLWE
        NQVPFFLLQSLFDLITKDDLPLVGTHERASLMDLTCFAFKIFVVAGYH NYMYDKTTPKHLVDFSSIFFMS PTYVEDNHKYVHTKDRWCISPSVTTLWE
Subjt:  NQVPFFLLQSLFDLITKDDLPLVGTHERASLMDLTCFAFKIFVVAGYHLNYMYDKTTPKHLVDFSSIFFMSAPTYVEDNHKYVHTKDRWCISPSVTTLWE

Query:  AGVTIKPSYVESCLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVHLLVKT--------------------
        AGVTI+PSYVESCLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVHLLVK                     
Subjt:  AGVTIKPSYVESCLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVHLLVKT--------------------

Query:  ---------------------------ENRAKRYWNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAISAFPG
                                    NRAKRYWNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAISAFPG
Subjt:  ---------------------------ENRAKRYWNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAISAFPG

KAA0064941.1 UPF0481 protein [Cucumis melo var. makuwa]2.6e-10347.72Show/hide
Query:  MIQANHDITYKVEEPITGDADQELCVNVVMFIKNILEQVPQVNPKI-CIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRI
        MI+ N +I+++V E I+ D DQ++   VV+ I  +L+ +  VN K   IY++ KE+RE+ND+AY PQFISIGPFHY TR DLIANE+YK QGF NFLR I
Subjt:  MIQANHDITYKVEEPITGDADQELCVNVVMFIKNILEQVPQVNPKI-CIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRI

Query:  SINNEQIESMEGSQVKINTVKFLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRGRVSNIQRIISKIDSKFYQGALYEILGDLIKL
        +IN +Q+ES+E  QVKI++ K LVEK H  MKEAWNCYA+ I M EEEF+ MMLVDACFIVEF +L  G      R+     S FY G  YEIL DLIKL
Subjt:  SINNEQIESMEGSQVKINTVKFLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRGRVSNIQRIISKIDSKFYQGALYEILGDLIKL

Query:  ENQVPFFLLQSLFDLITK--DDLPLVGTHERA--SLMDLTCFAFKIF-VVAGYHLNYMYDKTTPKHLVDFSSIFFMSAPTYVED----------------
        ENQVPFFLLQ+LFDLI K  DD   +  +E    SL+DLT  A K F  V  Y +N +Y K  P+H++D  S +F+  P    D                
Subjt:  ENQVPFFLLQSLFDLITK--DDLPLVGTHERA--SLMDLTCFAFKIF-VVAGYHLNYMYDKTTPKHLVDFSSIFFMSAPTYVED----------------

Query:  ----------NHKYVHTKDRWCISPSVTTLWEAGVTIKPSYVES-CLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDL
                  +HK + +K R    P++T L+EAGVTIK +  E+ C  +ISF+NGVL IP I +  TFE++IRN++AF+ +PAGN+  YA++YV+FL DL
Subjt:  ----------NHKYVHTKDRWCISPSVTTLWEAGVTIKPSYVES-CLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDL

Query:  MKEEQDVHLLVKT-----------------------------------ENRAKRY-----WNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSA
        +  E+D+ LLVK                                     ++A R      WNKAK +L+H+YF TPW  IS  AA FLI+LT+LQTIFSA
Subjt:  MKEEQDVHLLVKT-----------------------------------ENRAKRY-----WNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSA

Query:  ISAFP
        ISAFP
Subjt:  ISAFP

XP_008445182.1 PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo]4.7e-9746.32Show/hide
Query:  QELCVNVVMFIKNILEQVPQVNPKI-CIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRISINNEQIESMEGSQVKINTVK
        +++C NVV+ I  IL  +P++NPK   IY++SKE+RE+ND+AYAPQFISIGPFH+ TR+DLIANE+YK QGFNNFL RI+ N EQIES           K
Subjt:  QELCVNVVMFIKNILEQVPQVNPKI-CIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRISINNEQIESMEGSQVKINTVK

Query:  FLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRG---------RVSNIQRIISKIDSKFYQGALYEILGDLIKLENQVPFFLLQSL
          V+KCHG +KEAWNCYA+ INM EEEFV MMLVDACFI+EF ILL               I +I   +D  FY+G  +EIL DLIKLENQVPFFLLQ+L
Subjt:  FLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRG---------RVSNIQRIISKIDSKFYQGALYEILGDLIKLENQVPFFLLQSL

Query:  FDLITKDDLPLVGTHERASLMDLTCFAFKIFVVAG-YHLNYMYDKTTPKHLVDFSSIFFMSAPTYVEDNHKYVHTKDR---------------------W
        FDL+ K D+P+       SL+D+T      F   G Y +N +Y K  PKHL+DF S +F   P    D+H      +R                     W
Subjt:  FDLITKDDLPLVGTHERASLMDLTCFAFKIFVVAG-YHLNYMYDKTTPKHLVDFSSIFFMSAPTYVEDNHKYVHTKDR---------------------W

Query:  --------------CIS-------------------PSVTTLWEAGVTIKPSYVES-CLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNM
                      C S                   PS+T L EAGVTIK +  E  C  +I F+NGVL IP I++  TFE++IRN+IAFD YPAGN+ M
Subjt:  --------------CIS-------------------PSVTTLWEAGVTIKPSYVES-CLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNM

Query:  YAVEYVSFLHDLMKEEQDVHLLVK---------------TE--------------------NRAKRY-----WNKAKVTLRHDYFTTPWTTISVIAATFL
        YA+ YV FL DL+  EQD HLL K               TE                    N+A R      WN AK +L+H+YF TPW  IS  AATFL
Subjt:  YAVEYVSFLHDLMKEEQDVHLLVK---------------TE--------------------NRAKRY-----WNKAKVTLRHDYFTTPWTTISVIAATFL

Query:  ILLTLLQTIFSAISAF
        I+LT+LQTIFSAIS F
Subjt:  ILLTLLQTIFSAISAF

XP_031737062.1 UPF0481 protein At3g47200 [Cucumis sativus]1.8e-10449.89Show/hide
Query:  EPITGDADQELCVNVVMFIKNILEQVPQVNPKI-CIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRISINNEQIESMEGS
        E ++ + DQ +  NVV+ I  +LE +P+VNPK   IY++ KE+RE+ND+AY PQFISIGPFHY TR DLIANE+YK QGF NFL RISI N  I+ +E +
Subjt:  EPITGDADQELCVNVVMFIKNILEQVPQVNPKI-CIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRISINNEQIESMEGS

Query:  QVKINTVKFLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRGRVSNIQRIISKIDSKFYQGALYEILGDLIKLENQVPFFLLQSLF
        QV I++ K LVEK H  +KEAWNCYA+ I M +EEF+ MMLVDACFIVEF +L  G      ++ +   S FY G  YEIL DLIKLENQVPFFLLQ+LF
Subjt:  QVKINTVKFLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRGRVSNIQRIISKIDSKFYQGALYEILGDLIKLENQVPFFLLQSLF

Query:  DLITKDDL---PLVGTHERA--SLMDLTCFAFKIF-VVAGYHLNYMYDKTTPKHLVDFSSIFFMSAPTYVEDNHKYVHTK--DRWCISPSVTT-LWEAGV
        DL+ KD +    ++G ++ +  SL+DLT    K F  V  Y +N +Y K  PKHL+DF S +F+  P     N K+ H K   +W +SP  TT L EAGV
Subjt:  DLITKDDL---PLVGTHERA--SLMDLTCFAFKIF-VVAGYHLNYMYDKTTPKHLVDFSSIFFMSAPTYVEDNHKYVHTK--DRWCISPSVTT-LWEAGV

Query:  TIKPSYVES--CLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVHLLVKT---------------------
        TIK +  ++  C  +ISF+NGVL IP I +  TFE++IRN++AF+ +PAGN+  YA++YV+FL DL+  E+D+ LLVK                      
Subjt:  TIKPSYVES--CLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVHLLVKT---------------------

Query:  --------------ENRAKRY-----WNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAISAFP
                       ++A R      WNKAK +L+H YF TPW  IS  AATFLI+LT+LQTIFSAISAFP
Subjt:  --------------ENRAKRY-----WNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAISAFP

XP_038884451.1 UPF0481 protein At3g47200-like [Benincasa hispida]9.2e-10149.02Show/hide
Query:  MIQANHDITYKVEEPITGDADQELCVNVVMFIKNILEQVPQVNPKI-CIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRI
        M +AN D +Y V E I  + DQ+LC NVV+ I+  LE++P VN     IY++ K +RE+ND+AY PQFISIGPFHY TR +LIANE+YK QGF NFLRR+
Subjt:  MIQANHDITYKVEEPITGDADQELCVNVVMFIKNILEQVPQVNPKI-CIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRI

Query:  SINNEQIESMEGS-QVKINT--VKFLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLIL-----LRGRVSNIQRIISKIDSKFYQGALYE
        + N  +I+S+E S  VKI T  +K LVEK    +KEA NCYA+ INM EEEF  MMLVDACFIVEFLIL     L  R     +I   +D  FY G  ++
Subjt:  SINNEQIESMEGS-QVKINT--VKFLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLIL-----LRGRVSNIQRIISKIDSKFYQGALYE

Query:  ILGDLIKLENQVPFFLLQSLFDLITK-DDLPLVGTHERASLMDLTCFAFKIF-VVAGYHLNYMYDKTTPKHLVDFSSIFFMSAP----TYVEDNHKYVHT
        IL DLIKLENQVPF LLQ+LFDLI K DD P++     +SL+ LT  A K F +V+ Y ++ +Y K  PKHL+DF S +F+  P       +D  K +  
Subjt:  ILGDLIKLENQVPFFLLQSLFDLITK-DDLPLVGTHERASLMDLTCFAFKIF-VVAGYHLNYMYDKTTPKHLVDFSSIFFMSAP----TYVEDNHKYVHT

Query:  KD-----------------------RWCIS-PSVTTLWEAGVTIKPSYVES-CLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEY
        K+                       +W +S PS T L EAG+TIK +  E+ CLT+ISF+NGVL IP I +  TFEI++RNLIAFDHYPAGN+  Y ++Y
Subjt:  KD-----------------------RWCIS-PSVTTLWEAGVTIKPSYVES-CLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEY

Query:  VSFLHDLMKEEQDVHLLVKTE-----------------NRAKRY-----------------------WNKAKVTLRHDYFTTPWTTISVIAATFLILLTL
        V FL DL+  E+DVHLLVK                   N   ++                       WNKAK +L+ +YF TPW  IS IAATFLILLTL
Subjt:  VSFLHDLMKEEQDVHLLVKTE-----------------NRAKRY-----------------------WNKAKVTLRHDYFTTPWTTISVIAATFLILLTL

Query:  LQTIFSAISAFP
        LQTIFSAISAFP
Subjt:  LQTIFSAISAFP

TrEMBL top hitse value%identityAlignment
A0A0A0LQ57 Uncharacterized protein8.7e-10549.89Show/hide
Query:  EPITGDADQELCVNVVMFIKNILEQVPQVNPKI-CIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRISINNEQIESMEGS
        E ++ + DQ +  NVV+ I  +LE +P+VNPK   IY++ KE+RE+ND+AY PQFISIGPFHY TR DLIANE+YK QGF NFL RISI N  I+ +E +
Subjt:  EPITGDADQELCVNVVMFIKNILEQVPQVNPKI-CIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRISINNEQIESMEGS

Query:  QVKINTVKFLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRGRVSNIQRIISKIDSKFYQGALYEILGDLIKLENQVPFFLLQSLF
        QV I++ K LVEK H  +KEAWNCYA+ I M +EEF+ MMLVDACFIVEF +L  G      ++ +   S FY G  YEIL DLIKLENQVPFFLLQ+LF
Subjt:  QVKINTVKFLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRGRVSNIQRIISKIDSKFYQGALYEILGDLIKLENQVPFFLLQSLF

Query:  DLITKDDL---PLVGTHERA--SLMDLTCFAFKIF-VVAGYHLNYMYDKTTPKHLVDFSSIFFMSAPTYVEDNHKYVHTK--DRWCISPSVTT-LWEAGV
        DL+ KD +    ++G ++ +  SL+DLT    K F  V  Y +N +Y K  PKHL+DF S +F+  P     N K+ H K   +W +SP  TT L EAGV
Subjt:  DLITKDDL---PLVGTHERA--SLMDLTCFAFKIF-VVAGYHLNYMYDKTTPKHLVDFSSIFFMSAPTYVEDNHKYVHTK--DRWCISPSVTT-LWEAGV

Query:  TIKPSYVES--CLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVHLLVKT---------------------
        TIK +  ++  C  +ISF+NGVL IP I +  TFE++IRN++AF+ +PAGN+  YA++YV+FL DL+  E+D+ LLVK                      
Subjt:  TIKPSYVES--CLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVHLLVKT---------------------

Query:  --------------ENRAKRY-----WNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAISAFP
                       ++A R      WNKAK +L+H YF TPW  IS  AATFLI+LT+LQTIFSAISAFP
Subjt:  --------------ENRAKRY-----WNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAISAFP

A0A1S4DW65 UPF0481 protein At3g47200-like2.3e-9746.32Show/hide
Query:  QELCVNVVMFIKNILEQVPQVNPKI-CIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRISINNEQIESMEGSQVKINTVK
        +++C NVV+ I  IL  +P++NPK   IY++SKE+RE+ND+AYAPQFISIGPFH+ TR+DLIANE+YK QGFNNFL RI+ N EQIES           K
Subjt:  QELCVNVVMFIKNILEQVPQVNPKI-CIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRISINNEQIESMEGSQVKINTVK

Query:  FLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRG---------RVSNIQRIISKIDSKFYQGALYEILGDLIKLENQVPFFLLQSL
          V+KCHG +KEAWNCYA+ INM EEEFV MMLVDACFI+EF ILL               I +I   +D  FY+G  +EIL DLIKLENQVPFFLLQ+L
Subjt:  FLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRG---------RVSNIQRIISKIDSKFYQGALYEILGDLIKLENQVPFFLLQSL

Query:  FDLITKDDLPLVGTHERASLMDLTCFAFKIFVVAG-YHLNYMYDKTTPKHLVDFSSIFFMSAPTYVEDNHKYVHTKDR---------------------W
        FDL+ K D+P+       SL+D+T      F   G Y +N +Y K  PKHL+DF S +F   P    D+H      +R                     W
Subjt:  FDLITKDDLPLVGTHERASLMDLTCFAFKIFVVAG-YHLNYMYDKTTPKHLVDFSSIFFMSAPTYVEDNHKYVHTKDR---------------------W

Query:  --------------CIS-------------------PSVTTLWEAGVTIKPSYVES-CLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNM
                      C S                   PS+T L EAGVTIK +  E  C  +I F+NGVL IP I++  TFE++IRN+IAFD YPAGN+ M
Subjt:  --------------CIS-------------------PSVTTLWEAGVTIKPSYVES-CLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNM

Query:  YAVEYVSFLHDLMKEEQDVHLLVK---------------TE--------------------NRAKRY-----WNKAKVTLRHDYFTTPWTTISVIAATFL
        YA+ YV FL DL+  EQD HLL K               TE                    N+A R      WN AK +L+H+YF TPW  IS  AATFL
Subjt:  YAVEYVSFLHDLMKEEQDVHLLVK---------------TE--------------------NRAKRY-----WNKAKVTLRHDYFTTPWTTISVIAATFL

Query:  ILLTLLQTIFSAISAF
        I+LT+LQTIFSAIS F
Subjt:  ILLTLLQTIFSAISAF

A0A5A7V9V0 UPF0481 protein1.1e-8846.34Show/hide
Query:  DQELCVNVVMFIKNILEQVPQVNPKICIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRISINNEQIESMEGSQVKINTVK
        DQ+LC NVV+ I  +L+Q+PQVN +  IY+ISKE+ E+N +AY PQ ISIGP H+ T +DL+AN+ YK QGF NFLRRI+INN+QI SME   ++  T+ 
Subjt:  DQELCVNVVMFIKNILEQVPQVNPKICIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRISINNEQIESMEGSQVKINTVK

Query:  FLVEKCHGLMKEAWNCY-ADQINMME-EEFVGMMLVDACFIVEFLILL------RGRVSNIQRIISKIDSKFYQGALYEILGDLIKLENQVPFFLLQSLF
         LVEK H  +KEA NCY +  IN ++ + FV MMLVDACFIVEFLIL        G+   IQ     ID  FYQG    IL DLIKLENQVPFFLLQ LF
Subjt:  FLVEKCHGLMKEAWNCY-ADQINMME-EEFVGMMLVDACFIVEFLILL------RGRVSNIQRIISKIDSKFYQGALYEILGDLIKLENQVPFFLLQSLF

Query:  DLITKDDLPLVGTHERASLMDLTCFAFKIFVVAGYHLNYMYDKTTPKHLVDFSSIFFM-SAPTYVEDNH----KYVHTKDRWCISPSVTTLWEAGVTIKP
        DLI K D+ ++     +S  DLT  A K  +V  Y +N   +   PKH VD  + +F+ SA   V + H      +  K+RW I PS+T L EAGVTIK 
Subjt:  DLITKDDLPLVGTHERASLMDLTCFAFKIFVVAGYHLNYMYDKTTPKHLVDFSSIFFM-SAPTYVEDNH----KYVHTKDRWCISPSVTTLWEAGVTIKP

Query:  SYVESCLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVHLLVK--------------------------TE
        +     LT I+F+NGVL IP +++   FE+++RN++AF+   A   N Y ++YV F+ DL+  E+DV LLV+                          T 
Subjt:  SYVESCLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVHLLVK--------------------------TE

Query:  NRAKRY--------------WNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAISAF
         ++  +              WN+AK +L+H+YF TPW  IS  AAT LILLTLLQTIF+AI+ F
Subjt:  NRAKRY--------------WNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAISAF

A0A5A7VD32 UPF0481 protein3.2e-24088.96Show/hide
Query:  MIQANHDITYKVEEPITGDADQELCVNVVMFIKNILEQVPQVNPKICIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRIS
        MIQANHDITYKVEEPITGDADQELCVNVVMFIKNILEQVPQVNPKICIYRISKEVRELNDRAYAPQFISIGPFHYHTR DLIANEYYKYQGFNNFLRRIS
Subjt:  MIQANHDITYKVEEPITGDADQELCVNVVMFIKNILEQVPQVNPKICIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRIS

Query:  INNEQIESMEGSQVKINTVKFLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRGRVSNIQRIISKIDSKFYQGALYEILGDLIKLE
        INNEQIESMEGSQVKINTVKFLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRGRVSNIQRIISKIDSKFYQGALYEILGDLIKLE
Subjt:  INNEQIESMEGSQVKINTVKFLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRGRVSNIQRIISKIDSKFYQGALYEILGDLIKLE

Query:  NQVPFFLLQSLFDLITKDDLPLVGTHERASLMDLTCFAFKIFVVAGYHLNYMYDKTTPKHLVDFSSIFFMSAPTYVEDNHKYVHTKDRWCISPSVTTLWE
        NQVPFFLLQSLFDLITKDDLPLVGTHERASLMDLTCFAFKIFVVAGYH NYMYDKTTPKHLVDFSSIFFMS PTYVEDNHKYVHTKDRWCISPSVTTLWE
Subjt:  NQVPFFLLQSLFDLITKDDLPLVGTHERASLMDLTCFAFKIFVVAGYHLNYMYDKTTPKHLVDFSSIFFMSAPTYVEDNHKYVHTKDRWCISPSVTTLWE

Query:  AGVTIKPSYVESCLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVHLLVKT--------------------
        AGVTI+PSYVESCLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVHLLVK                     
Subjt:  AGVTIKPSYVESCLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVHLLVKT--------------------

Query:  ---------------------------ENRAKRYWNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAISAFPG
                                    NRAKRYWNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAISAFPG
Subjt:  ---------------------------ENRAKRYWNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAISAFPG

A0A5A7VGS6 UPF0481 protein1.3e-10347.72Show/hide
Query:  MIQANHDITYKVEEPITGDADQELCVNVVMFIKNILEQVPQVNPKI-CIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRI
        MI+ N +I+++V E I+ D DQ++   VV+ I  +L+ +  VN K   IY++ KE+RE+ND+AY PQFISIGPFHY TR DLIANE+YK QGF NFLR I
Subjt:  MIQANHDITYKVEEPITGDADQELCVNVVMFIKNILEQVPQVNPKI-CIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRI

Query:  SINNEQIESMEGSQVKINTVKFLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRGRVSNIQRIISKIDSKFYQGALYEILGDLIKL
        +IN +Q+ES+E  QVKI++ K LVEK H  MKEAWNCYA+ I M EEEF+ MMLVDACFIVEF +L  G      R+     S FY G  YEIL DLIKL
Subjt:  SINNEQIESMEGSQVKINTVKFLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRGRVSNIQRIISKIDSKFYQGALYEILGDLIKL

Query:  ENQVPFFLLQSLFDLITK--DDLPLVGTHERA--SLMDLTCFAFKIF-VVAGYHLNYMYDKTTPKHLVDFSSIFFMSAPTYVED----------------
        ENQVPFFLLQ+LFDLI K  DD   +  +E    SL+DLT  A K F  V  Y +N +Y K  P+H++D  S +F+  P    D                
Subjt:  ENQVPFFLLQSLFDLITK--DDLPLVGTHERA--SLMDLTCFAFKIF-VVAGYHLNYMYDKTTPKHLVDFSSIFFMSAPTYVED----------------

Query:  ----------NHKYVHTKDRWCISPSVTTLWEAGVTIKPSYVES-CLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDL
                  +HK + +K R    P++T L+EAGVTIK +  E+ C  +ISF+NGVL IP I +  TFE++IRN++AF+ +PAGN+  YA++YV+FL DL
Subjt:  ----------NHKYVHTKDRWCISPSVTTLWEAGVTIKPSYVES-CLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDL

Query:  MKEEQDVHLLVKT-----------------------------------ENRAKRY-----WNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSA
        +  E+D+ LLVK                                     ++A R      WNKAK +L+H+YF TPW  IS  AA FLI+LT+LQTIFSA
Subjt:  MKEEQDVHLLVKT-----------------------------------ENRAKRY-----WNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSA

Query:  ISAFP
        ISAFP
Subjt:  ISAFP

SwissProt top hitse value%identityAlignment
Q9SD53 UPF0481 protein At3g472006.1e-1524Show/hide
Query:  CIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRISINNEQIESMEGSQVKINTVK--FLVEKCHGLMKEAWNCYADQINMM
        CI+R+ +    LN +AY P+ +SIGP+HY  +H                L+ I  +  ++  +   + K   V+   LV+    L  +    Y++++   
Subjt:  CIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRISINNEQIESMEGSQVKINTVK--FLVEKCHGLMKEAWNCYADQINMM

Query:  EEEFVGMMLVDACFIVEFLILLRGRVSNIQRIISKIDSKFYQGALYEILGDLIKLENQVPFFLLQSLFDLITKDDLPLVGTHERASLMDLTCFAFKIF--
          + + MM++D CFI+   +++ G +   +  I  I        L  I  DL+ LENQVPFF+LQ+L+          VG+    S  DL   AF  F  
Subjt:  EEEFVGMMLVDACFIVEFLILLRGRVSNIQRIISKIDSKFYQGALYEILGDLIKLENQVPFFLLQSLFDLITKDDLPLVGTHERASLMDLTCFAFKIF--

Query:  ---VVAGY---HLNYMYDKTTPKHLVD-FSSIFFMSAPTYVEDNHKYVHTKDRWCISPSVTTLWEAGVTIKPSYVESCLTSISF------ENGVLN----
               Y   H NY       KHL+D     F  +     + +  +V  +     S +V ++    V +  S     L  I F      E+ +LN    
Subjt:  ---VVAGY---HLNYMYDKTTPKHLVD-FSSIFFMSAPTYVEDNHKYVHTKDRWCISPSVTTLWEAGVTIKPSYVESCLTSISF------ENGVLN----

Query:  -----IPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVHLL--------------------VKTENR-----------------
             IP +           N +AF+ +   + N     Y+ F+  L+  E+DV  L                     KT ++                 
Subjt:  -----IPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVHLL--------------------VKTENR-----------------

Query:  ----AKRYWNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAIS
             K+++N      RH +F +PWT +S  A  F+ILLT+LQ+  + +S
Subjt:  ----AKRYWNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAIS

Arabidopsis top hitse value%identityAlignment
AT3G50130.1 Plant protein of unknown function (DUF247)2.5e-2424.34Show/hide
Query:  KICIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRISINNEQIESMEGSQVKINTVKFLVEKCHGLMKEAWNCYADQINMM
        K+CIYR+ + ++E N ++Y PQ +S+GPFH+  +H L+  + +K++  N  + R                  + ++  ++    L   A  CY   I++ 
Subjt:  KICIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRISINNEQIESMEGSQVKINTVKFLVEKCHGLMKEAWNCYADQINMM

Query:  EEEFVGMMLVDACFIVEFLILLRGRVSNIQRI-ISKIDSKF-YQGALYEILGDLIKLENQVPFFLLQSLFD--------------LITKDDLPLVGTHER
          +F  M+++D CF++E   L RG       +   + D  F  +G+++ I  D++ LENQ+P F+L  L +              L  +   PL+ T E 
Subjt:  EEEFVGMMLVDACFIVEFLILLRGRVSNIQRI-ISKIDSKF-YQGALYEILGDLIKLENQVPFFLLQSLFD--------------LITKDDLPLVGTHER

Query:  ASLMDLTCFAFKIFVVAGYHLNYMYDKTTPK-HLVDFSSIFFMSAPTYVEDN--------HKYVHTKDRWCISPSVTTLWEAGVTIKPSYVESCLTSISF
         +  D +    K F       N + DK   + H +D      +   +  E             V  K +  +   VT L EAG+  +    +     I F
Subjt:  ASLMDLTCFAFKIFVVAGYHLNYMYDKTTPK-HLVDFSSIFFMSAPTYVEDN--------HKYVHTKDRWCISPSVTTLWEAGVTIKPSYVESCLTSISF

Query:  ENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVHLL----------------------------------------VK
        +NG L IP + +    + +  NLIAF+     + N     Y+ F+ +L+   +DV  L                                         K
Subjt:  ENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVHLL----------------------------------------VK

Query:  TENRAKRYWNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAISAF
         +    R WN  K  L+H YF  PW   S  AA  L++LTL Q+ F+A   F
Subjt:  TENRAKRYWNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAISAF

AT3G50140.1 Plant protein of unknown function (DUF247)2.1e-2625Show/hide
Query:  VMFIKNILEQVPQVN-----PKICIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRISINNEQIESMEGSQVKINTVKFLV
        V++IK+ +EQV +        KICIYR+   +++ +  +Y PQ +S+GP+H+   H L   +Y+K++  N  ++R         + +G ++ I+ +K L 
Subjt:  VMFIKNILEQVPQVN-----PKICIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRISINNEQIESMEGSQVKINTVKFLV

Query:  EKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRGRVSNIQRI-ISKIDSKF-YQGALYEILGDLIKLENQVPFFLLQSLFDLITKDDLP
        E+          CY   I +   +F  M+++D CF+++   L RG      ++   + D  F  +G+++ I  D++ LENQ+P F+L  L +L       
Subjt:  EKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRGRVSNIQRI-ISKIDSKF-YQGALYEILGDLIKLENQVPFFLLQSLFDLITKDDLP

Query:  LVGTHERASLMDLTCFAFKIFVVAGY---------------HLNYMYDKTTPK-HLVDFSSIFFMS--APTYVEDNHKYVHTKDRWCISP----------
         +GT  +  L+      F   ++  Y                 N + DK   + H +D   +F  S   P+   D      ++ RW   P          
Subjt:  LVGTHERASLMDLTCFAFKIFVVAGY---------------HLNYMYDKTTPK-HLVDFSSIFFMS--APTYVEDNHKYVHTKDRWCISP----------

Query:  --SVTTLWEAGVTIKPSYVESCLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVH----------------
           VT L EAG+  K    +     I F+NG L IP + +    + +  NLIA++     + N     Y+ F+ +L+   +D+                 
Subjt:  --SVTTLWEAGVTIKPSYVESCLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVH----------------

Query:  --------------------LLVKTENRAKRY----WNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAISAF
                             L +  N+  RY    WN  K TL+H YF+ PW   S  AA  L+LLTL Q+ F++   F
Subjt:  --------------------LLVKTENRAKRY----WNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAISAF

AT3G50170.1 Plant protein of unknown function (DUF247)1.9e-2725.9Show/hide
Query:  QANHDITYKVEEPITGDADQELCVNVVMFIKNILEQVPQ-----VNPKICIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLR
        ++  ++  +  E  TGD       + V+ I++ LEQ  +     +  K+CIYR+   ++E + ++Y PQ +S+GP+H H +  L   E +K++  N  L+
Subjt:  QANHDITYKVEEPITGDADQELCVNVVMFIKNILEQVPQ-----VNPKICIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLR

Query:  RISINNEQIESMEGSQVKINTVKFLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRGRVSNIQRI-ISKIDSKF-YQGALYEILGD
        R+    ++IE      +  N ++ L EK          CY   I++   EF  M+++D CF++E   L RG V     I  ++ D  F  +G ++ I  D
Subjt:  RISINNEQIESMEGSQVKINTVKFLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRGRVSNIQRI-ISKIDSKF-YQGALYEILGD

Query:  LIKLENQVPFFLLQSLFDL--------------ITKDDLPLVGTHERASLMDLTCFAFKIFVVAGYHLNYMYDKTTPKHLVDFSSIFFMSAPT-------
        +I LENQ+P F+L  L +L                K   PL+ T E  +  D +    K+       L+ + DK     L  F      S+PT       
Subjt:  LIKLENQVPFFLLQSLFDL--------------ITKDDLPLVGTHERASLMDLTCFAFKIFVVAGYHLNYMYDKTTPKHLVDFSSIFFMSAPT-------

Query:  -YVEDNHKYVHTKDRWCISPSVTTLWEAGVTIKPSYVESCLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQD
          +  N + V  + +  +   VT L EAGV  +    +     I F+NG L IP + +    + +  NLIAF+     + N +   Y+ F+ +L+   +D
Subjt:  -YVEDNHKYVHTKDRWCISPSVTTLWEAGVTIKPSYVESCLTSISFENGVLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQD

Query:  VHL------------------------------------LVKTENRAKRY----WNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAISAF
        V                                      L +      RY    WN  K TL H YF  PW   S  AA  L+LLTL Q+ ++  + +
Subjt:  VHL------------------------------------LVKTENRAKRY----WNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAISAF

AT4G31980.1 unknown protein4.2e-3528.25Show/hide
Query:  IKNILEQVPQVNPKICIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRISINNEQIESMEGSQVKINTVKFLVEKCHGLMK
        IK  L  +  ++ K CIY++  ++R LN  AY P+ +S GP H   + +L A E  KY+   +F+ R    N  +E              LV       +
Subjt:  IKNILEQVPQVNPKICIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRISINNEQIESMEGSQVKINTVKFLVEKCHGLMK

Query:  EAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRGRVSNIQRIISKIDSKFYQGALY-EILGDLIKLENQVPFFLLQSLFDLITKDDLPLVGTHERAS
         A +CYA+ + +  +EFV M++VD  F+VE  +LLR   S+  R+  + D  F    +  ++  D+I +ENQ+PFF+++ +F       L L+  +++ +
Subjt:  EAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRGRVSNIQRIISKIDSKFYQGALY-EILGDLIKLENQVPFFLLQSLFDLITKDDLPLVGTHERAS

Query:  LMDLTCFAFKIFVVAGYHLNYMYDK-------TTPKHLVD-FSSIFFMSAPTYVEDNHKYVHTKDRWCISPSVTTLWEAGVTIKPSYVESCLTSISFENG
                  I  +A  H +Y   +       T P+H VD   S +    P  +E      +T  +   +P  T L  AGV  KP+   SCL  ISF +G
Subjt:  LMDLTCFAFKIFVVAGYHLNYMYDK-------TTPKHLVD-FSSIFFMSAPTYVEDNHKYVHTKDRWCISPSVTTLWEAGVTIKPSYVESCLTSISFENG

Query:  VLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVHLLVKT-----------------------------------ENRAKRY
        VL IP I +    E + +N+I F+     NKN   ++Y+  L   +K   D  LL+ +                                       + Y
Subjt:  VLNIPHINMGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVHLLVKT-----------------------------------ENRAKRY

Query:  ----WNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAIS
            WN+ K  LR DYF  PW   SV AA  L+LLT +Q++ S ++
Subjt:  ----WNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAIS

AT5G22540.1 Plant protein of unknown function (DUF247)6.7e-2525.58Show/hide
Query:  CIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRISINNEQIESMEGSQVKINTVKFLVEKCHGLMKEAWNCYADQINMMEE
        CI RI + +  +N +AY P+ +SIGP+H+   H  +  ++ +      FL+      E+   +    VK       V    G+++     Y++ + +  E
Subjt:  CIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRISINNEQIESMEGSQVKINTVKFLVEKCHGLMKEAWNCYADQINMMEE

Query:  EFVGMMLVDACFIVEFLILLRGRVSNIQRIISKIDSKFYQ--GALYEILGDLIKLENQVPFFLLQSLFDLITKDDLPLVGTHERASLMDLTCFAFKIFVV
          V MM++D CFI+    ++ G+V       + +D   ++    L  I  DL+ LENQVP+ LLQ+LF+           T +  +   L   AF+ F  
Subjt:  EFVGMMLVDACFIVEFLILLRGRVSNIQRIISKIDSKFYQ--GALYEILGDLIKLENQVPFFLLQSLFDLITKDDLPLVGTHERASLMDLTCFAFKIFVV

Query:  AGYHLNYMYDK---TTPKHLVDFSSIFFMSAPTY--VEDNHKYVHTKDRWCIS--PSVTTLWEAGVTIKPSYVESCLTSISFENGVLNIPHINMGKTFEI
        +       ++K      KHL+D     F+  P+   ++D+       D   +    S   L   G+  KP      +  IS+ NGVL+IP + M      
Subjt:  AGYHLNYMYDK---TTPKHLVDFSSIFFMSAPTY--VEDNHKYVHTKDRWCIS--PSVTTLWEAGVTIKPSYVESCLTSISFENGVLNIPHINMGKTFEI

Query:  MIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVHLLVK---------TENRAKRYWNK--------------AKV-----------------TLRH
        +  N +AF+   A + N +   YV+F+  L+ EE D   L +         TE+   R++ +              AKV                    H
Subjt:  MIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVHLLVK---------TENRAKRYWNK--------------AKV-----------------TLRH

Query:  DYFTTPWTTISVIAATFLILLTLLQTIFSAISAF
         +F +PWT  S  AA  L+L   LQ  F+A S F
Subjt:  DYFTTPWTTISVIAATFLILLTLLQTIFSAISAF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATACAAGCGAACCACGACATAACTTACAAAGTGGAAGAACCAATAACTGGTGATGCTGATCAAGAGCTTTGTGTTAATGTTGTGATGTTTATTAAAAATATTTTAGA
ACAAGTGCCTCAAGTTAATCCAAAAATCTGCATCTATCGAATTTCCAAAGAGGTACGCGAGTTGAATGATAGAGCGTATGCTCCTCAATTCATTTCCATAGGCCCTTTTC
ATTATCACACTCGACATGATTTGATAGCCAATGAATACTATAAGTATCAAGGTTTTAATAACTTTCTACGTCGTATAAGTATTAATAATGAGCAGATTGAATCAATGGAA
GGAAGTCAGGTTAAAATTAATACAGTAAAGTTTCTTGTGGAAAAATGTCATGGGTTAATGAAAGAAGCTTGGAATTGCTACGCAGATCAAATAAATATGATGGAGGAAGA
GTTTGTTGGAATGATGCTTGTGGATGCTTGTTTCATAGTCGAGTTTCTTATACTGCTTCGTGGCCGTGTATCCAATATACAAAGAATAATATCAAAAATAGATTCTAAAT
TCTACCAAGGAGCACTATACGAAATACTTGGTGACTTGATAAAGTTGGAAAATCAAGTTCCTTTTTTTCTTCTTCAAAGTCTATTTGACCTGATAACAAAGGATGATTTA
CCCTTGGTTGGGACTCATGAGAGAGCCTCCTTGATGGATCTTACATGCTTTGCTTTTAAGATTTTTGTTGTGGCGGGGTATCATCTTAATTATATGTATGATAAAACGAC
ACCAAAGCACTTGGTTGATTTCTCTAGTATCTTTTTCATGTCGGCACCCACTTATGTTGAAGACAACCATAAATATGTGCACACTAAAGATCGGTGGTGCATTTCTCCAT
CCGTAACTACGCTCTGGGAGGCTGGTGTCACCATCAAACCAAGTTACGTAGAATCATGTTTGACGAGCATAAGCTTCGAAAACGGGGTTTTGAATATCCCACATATAAAT
ATGGGAAAAACCTTCGAAATTATGATACGAAACCTTATAGCATTTGATCATTACCCTGCAGGAAATAAGAACATGTATGCAGTCGAATATGTGTCATTTCTACATGATTT
GATGAAGGAAGAGCAAGATGTACATTTACTTGTGAAGACGGAGAACAGGGCAAAACGTTATTGGAATAAGGCGAAAGTTACATTGAGACATGACTATTTCACTACACCAT
GGACTACTATCTCTGTCATTGCTGCAACTTTCCTCATTCTTCTCACCCTCCTTCAAACCATATTCTCTGCTATATCGGCATTTCCTGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGATACAAGCGAACCACGACATAACTTACAAAGTGGAAGAACCAATAACTGGTGATGCTGATCAAGAGCTTTGTGTTAATGTTGTGATGTTTATTAAAAATATTTTAGA
ACAAGTGCCTCAAGTTAATCCAAAAATCTGCATCTATCGAATTTCCAAAGAGGTACGCGAGTTGAATGATAGAGCGTATGCTCCTCAATTCATTTCCATAGGCCCTTTTC
ATTATCACACTCGACATGATTTGATAGCCAATGAATACTATAAGTATCAAGGTTTTAATAACTTTCTACGTCGTATAAGTATTAATAATGAGCAGATTGAATCAATGGAA
GGAAGTCAGGTTAAAATTAATACAGTAAAGTTTCTTGTGGAAAAATGTCATGGGTTAATGAAAGAAGCTTGGAATTGCTACGCAGATCAAATAAATATGATGGAGGAAGA
GTTTGTTGGAATGATGCTTGTGGATGCTTGTTTCATAGTCGAGTTTCTTATACTGCTTCGTGGCCGTGTATCCAATATACAAAGAATAATATCAAAAATAGATTCTAAAT
TCTACCAAGGAGCACTATACGAAATACTTGGTGACTTGATAAAGTTGGAAAATCAAGTTCCTTTTTTTCTTCTTCAAAGTCTATTTGACCTGATAACAAAGGATGATTTA
CCCTTGGTTGGGACTCATGAGAGAGCCTCCTTGATGGATCTTACATGCTTTGCTTTTAAGATTTTTGTTGTGGCGGGGTATCATCTTAATTATATGTATGATAAAACGAC
ACCAAAGCACTTGGTTGATTTCTCTAGTATCTTTTTCATGTCGGCACCCACTTATGTTGAAGACAACCATAAATATGTGCACACTAAAGATCGGTGGTGCATTTCTCCAT
CCGTAACTACGCTCTGGGAGGCTGGTGTCACCATCAAACCAAGTTACGTAGAATCATGTTTGACGAGCATAAGCTTCGAAAACGGGGTTTTGAATATCCCACATATAAAT
ATGGGAAAAACCTTCGAAATTATGATACGAAACCTTATAGCATTTGATCATTACCCTGCAGGAAATAAGAACATGTATGCAGTCGAATATGTGTCATTTCTACATGATTT
GATGAAGGAAGAGCAAGATGTACATTTACTTGTGAAGACGGAGAACAGGGCAAAACGTTATTGGAATAAGGCGAAAGTTACATTGAGACATGACTATTTCACTACACCAT
GGACTACTATCTCTGTCATTGCTGCAACTTTCCTCATTCTTCTCACCCTCCTTCAAACCATATTCTCTGCTATATCGGCATTTCCTGGCTAG
Protein sequenceShow/hide protein sequence
MIQANHDITYKVEEPITGDADQELCVNVVMFIKNILEQVPQVNPKICIYRISKEVRELNDRAYAPQFISIGPFHYHTRHDLIANEYYKYQGFNNFLRRISINNEQIESME
GSQVKINTVKFLVEKCHGLMKEAWNCYADQINMMEEEFVGMMLVDACFIVEFLILLRGRVSNIQRIISKIDSKFYQGALYEILGDLIKLENQVPFFLLQSLFDLITKDDL
PLVGTHERASLMDLTCFAFKIFVVAGYHLNYMYDKTTPKHLVDFSSIFFMSAPTYVEDNHKYVHTKDRWCISPSVTTLWEAGVTIKPSYVESCLTSISFENGVLNIPHIN
MGKTFEIMIRNLIAFDHYPAGNKNMYAVEYVSFLHDLMKEEQDVHLLVKTENRAKRYWNKAKVTLRHDYFTTPWTTISVIAATFLILLTLLQTIFSAISAFPG