; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0021802 (gene) of Chayote v1 genome

Gene IDSed0021802
OrganismSechium edule (Chayote v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationLG11:32432444..32438018
RNA-Seq ExpressionSed0021802
SyntenySed0021802
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064942.1 UPF0481 protein [Cucumis melo var. makuwa]1.8e-9648.96Show/hide
Query:  ECCIHRVPKQLLNMNRNAYEPRNISIGPFH-HDKQNFSTTEELKLRFFNSYRCRV-----------GLSVKDIVEKARDWERTARWYYSEPINMNSEEFV
        +C I+RVPKQL  MN  AY P+ ISIGPFH H  +N    E+ KL+ F +Y  RV             SV+D+V++A+ W   AR  Y+E INMN E+F+
Subjt:  ECCIHRVPKQLLNMNRNAYEPRNISIGPFH-HDKQNFSTTEELKLRFFNSYRCRV-----------GLSVKDIVEKARDWERTARWYYSEPINMNSEEFV

Query:  KMMVLDGCFIVEFMIM-------KCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPS-KSNDTSFRSVVHSFLSECTLPIFRGE
        KMM++DGCFIVEF I+           LFP+ EN V  SFYK    ++  +LI LENQLPFFVLQ+LFDLIP  K     F+ + + +L+   L  +   
Subjt:  KMMVLDGCFIVEFMIM-------KCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPS-KSNDTSFRSVVHSFLSECTLPIFRGE

Query:  IRLCDNILNQSPRHLLEFLSSYFVPK------VEKHINESW---LPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEICARNLLA
             +IL+  P+H ++FLS Y VP+       + +  E W   +PPS T++CEAG+  KK + N  C++  I FE+GIL+I PL IDD+FE   RNLLA
Subjt:  IRLCDNILNQSPRHLLEFLSSYFVPK------VEKHINESW---LPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEICARNLLA

Query:  FSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKDLKQHCDRKWNKWMASLRHN
        F  F      VKN     I Y  F+D LISTEKDVNLLV+E II+N IGG+D+E++QLFN++CK ++     NYF+  SK L+ HCDR+WNK  ASL+HN
Subjt:  FSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKDLKQHCDRKWNKWMASLRHN

Query:  YFNTPWALISFLAATFLIILTVLQTIFTAISTF
        YFNTPWA IS  AATFL++LT+LQTIF+AIS F
Subjt:  YFNTPWALISFLAATFLIILTVLQTIFTAISTF

XP_008445188.1 PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo]5.5e-9847.9Show/hide
Query:  DHVILCIEENLQKMAP-FVPECCIHRVPKQLLNMNRNAYEPRNISIGPFH-HDKQNFSTTEELKLRFFNSYRCRV-----------GLSVKDIVEKARDW
        D+V++ IE+ L ++ P    +C I+RVPKQL  MN  AY P+ ISIGPFH H  +N    E+ KL+ F +Y  RV             SV+D+V++A+ W
Subjt:  DHVILCIEENLQKMAP-FVPECCIHRVPKQLLNMNRNAYEPRNISIGPFH-HDKQNFSTTEELKLRFFNSYRCRV-----------GLSVKDIVEKARDW

Query:  ERTARWYYSEPINMNSEEFVKMMVLDGCFIVEFMIM-------KCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPS-KSNDTS
           AR  Y+E INMN E+F+KMM++DGCFIVEF I+           LFP+ EN V  SFYK    ++  +LI LENQLPFFVLQ+LFDLIP  K     
Subjt:  ERTARWYYSEPINMNSEEFVKMMVLDGCFIVEFMIM-------KCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPS-KSNDTS

Query:  FRSVVHSFLSECTLPIFRGEIRLCDNILNQSPRHLLEFLSSYFVPK------VEKHINESW---LPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGIL
        F+ + + +L+   L  +        +IL+  P+H ++FLS Y VP+       + +  E W   +PPS T++CEAG+  KK + N  C++  I FE+GIL
Subjt:  FRSVVHSFLSECTLPIFRGEIRLCDNILNQSPRHLLEFLSSYFVPK------VEKHINESW---LPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGIL

Query:  KIHPLVIDDHFEICARNLLAFSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISK
        +I PL IDD+FE   RNLLAF  F      VKN     I Y  F+D LI TEKDVNLLV+E II+N IGG+D+E++QLFN++CK ++     NYF+  SK
Subjt:  KIHPLVIDDHFEICARNLLAFSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISK

Query:  DLKQHCDRKWNKWMASLRHNYFNTPWALISFLAATFLIILTVLQTIFTAISTF
         L+ HCDR+WNK  ASL+HNYFNTPWA IS  AATFL++LT+LQTIF+AIS F
Subjt:  DLKQHCDRKWNKWMASLRHNYFNTPWALISFLAATFLIILTVLQTIFTAISTF

XP_022131634.1 UPF0481 protein At3g47200-like [Momordica charantia]1.1e-9345.83Show/hide
Query:  HVILCIEENLQKM--APFVPECCIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQ-NFSTTEELKLRFFNSYRCRVGLSVKDIVEKARDWERTARWYYSEP
        HV++ IEE  +++   P  PEC I+RVPK+LLNMNR AY P+ ISIGPFHH  Q N   T++ KL+  +SY  RV ++V+ +V+  ++WE  AR  Y EP
Subjt:  HVILCIEENLQKM--APFVPECCIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQ-NFSTTEELKLRFFNSYRCRVGLSVKDIVEKARDWERTARWYYSEP

Query:  INMNSEEFVKMMVLDGCFIVEFMIMKCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPS--KSNDTSFRSVVHSFLSECTLPIF
        I MN+++FV M++LDGCF+V F+I+     +  +EN   SSFY+ +  ++  ++ MLENQLPFFVLQ L+DLIP   +  + S   ++ +F S       
Subjt:  INMNSEEFVKMMVLDGCFIVEFMIMKCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPS--KSNDTSFRSVVHSFLSECTLPIF

Query:  RGEIRLCDNILNQSPRHLLEFLSSYFVP----KVEKHINESWLPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEICARNLLAFS
             +  ++   + +HL++ LS YF+P    K +   +E  + P  T+LCEAG+  KK  G E   +  ISF++G+L+I PL IDDHFE   RNL+AF 
Subjt:  RGEIRLCDNILNQSPRHLLEFLSSYFVP----KVEKHINESWLPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEICARNLLAFS

Query:  MFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKDLKQHCDRKWNKWMASLRHNYF
         + +      N     I Y  FLD +ISTEKDV LLVE GII+N IGG+DKE+++LFND+ K ++I    +Y +HI+K L  HC + W +  A+L+ +YF
Subjt:  MFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKDLKQHCDRKWNKWMASLRHNYF

Query:  NTPWALISFLAATFLIILTVLQTIFTAISTFK
        N+PWA IS +AAT++IILT+LQTIFTAISTFK
Subjt:  NTPWALISFLAATFLIILTVLQTIFTAISTFK

XP_022158990.1 UPF0481 protein At3g47200-like isoform X2 [Momordica charantia]1.6e-8945.8Show/hide
Query:  MDHVILCIEENLQKMAPFVPECCIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQNFSTTEELKLRFFNSYRCRVGLSVKDIVEKARDWERTARWYYSEPI
        +D V   I++ LQ++ P   EC IHRVP++LL  N  AY P+ ISIGPFHH +Q+    E+ KLRF + Y  R    ++  V   R WE TAR  Y+EPI
Subjt:  MDHVILCIEENLQKMAPFVPECCIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQNFSTTEELKLRFFNSYRCRVGLSVKDIVEKARDWERTARWYYSEPI

Query:  NMNSEEFVKMMVLDGCFIVEFMIMKCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPSKSNDTSFRSVVHSF------LSECTL
        NM+S+EFVKMM++DGCFIVE M+M C  +  E E +     +  +  ++  +LIMLENQLPFFVLQ LFD   S     SF  + H F      +   TL
Subjt:  NMNSEEFVKMMVLDGCFIVEFMIMKCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPSKSNDTSFRSVVHSF------LSECTL

Query:  PIFRGEIRLCDNILNQSPRHLLEFLSSYFVP----------KVEKHINESWLPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEI
         +  G +     I      HL++FLS Y+ P           +     +   PP+ T+L EAGIVFKK    +  IM+ ISF+D +L+I PL I D FE 
Subjt:  PIFRGEIRLCDNILNQSPRHLLEFLSSYFVP----------KVEKHINESWLPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEI

Query:  CARNLLAFSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKDLKQHCDRKWNKW
          RNL+AF       E   N  + AI YF FL+ LIS E+DV+LLV+  II N IGGN++E++ LFND+CK++ ++ +CN F+HI++ L +HC  +WNK 
Subjt:  CARNLLAFSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKDLKQHCDRKWNKW

Query:  MASLRHNYFNTPWALISFLAATFLIILTVLQTIFTAISTFK
        MASLR +YFNTPWA ISF+AA FLI+LT LQT+F+A+S  K
Subjt:  MASLRHNYFNTPWALISFLAATFLIILTVLQTIFTAISTFK

XP_031736550.1 UPF0481 protein At3g47200-like [Cucumis sativus]1.4e-9848.89Show/hide
Query:  DHVILCIEENLQKM-APFVPECCIHRVPKQLLNMNRNAYEPRNISIGPF-HHDKQNFSTTEELKLRFFNSYRCRVG----------LSVKDIVEKARDWE
        D+V++ IE+ L ++ +    +C I+RVPKQL  MN  AY P+ ISIGPF +H  +N    E+ KL+ FN++  RV            S+ D+V+KA+ W 
Subjt:  DHVILCIEENLQKM-APFVPECCIHRVPKQLLNMNRNAYEPRNISIGPF-HHDKQNFSTTEELKLRFFNSYRCRVG----------LSVKDIVEKARDWE

Query:  RTARWYYSEPINMNSEEFVKMMVLDGCFIVEFMIM-----KCPY--LFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPS-KSNDTSF
        + AR  Y+E INMN E+F+KMM++DGCFIVEF I+     K P+  LFP+ EN V  SFYK    ++  +LI LENQLPFFVLQ+LFDLIP    N   F
Subjt:  RTARWYYSEPINMNSEEFVKMMVLDGCFIVEFMIM-----KCPY--LFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPS-KSNDTSF

Query:  RSVVHSFLSECTLPIFRGEIRLCDNILNQSPRHLLEFLSSYFVPKVE-KHINES-----W---LPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILK
        + + + +L+   L  +        +IL+  P+H ++FLS YFVP    +H  ES     W   +PPS T+LCEAG+  KK E N  C+M  I FE+GIL+
Subjt:  RSVVHSFLSECTLPIFRGEIRLCDNILNQSPRHLLEFLSSYFVPKVE-KHINES-----W---LPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILK

Query:  IHPLVIDDHFEICARNLLAFSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKD
        I PL IDD+FE   RNLLAF  F      V+      I Y  F+D LISTEKDVNLLV+E II+N IGG+D+E++QLFN++CK ++     NYF++IS+ 
Subjt:  IHPLVIDDHFEICARNLLAFSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKD

Query:  LKQHCDRKWNKWMASLRHNYFNTPWALISFLAATFLIILTVLQTIFTAISTF
        L++HCDR WNK  ASL+HNYFNTPWA ISF AAT L++LT+LQT+F+AIS F
Subjt:  LKQHCDRKWNKWMASLRHNYFNTPWALISFLAATFLIILTVLQTIFTAISTF

TrEMBL top hitse value%identityAlignment
A0A1S3BBL9 UPF0481 protein At3g47200-like2.6e-9847.9Show/hide
Query:  DHVILCIEENLQKMAP-FVPECCIHRVPKQLLNMNRNAYEPRNISIGPFH-HDKQNFSTTEELKLRFFNSYRCRV-----------GLSVKDIVEKARDW
        D+V++ IE+ L ++ P    +C I+RVPKQL  MN  AY P+ ISIGPFH H  +N    E+ KL+ F +Y  RV             SV+D+V++A+ W
Subjt:  DHVILCIEENLQKMAP-FVPECCIHRVPKQLLNMNRNAYEPRNISIGPFH-HDKQNFSTTEELKLRFFNSYRCRV-----------GLSVKDIVEKARDW

Query:  ERTARWYYSEPINMNSEEFVKMMVLDGCFIVEFMIM-------KCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPS-KSNDTS
           AR  Y+E INMN E+F+KMM++DGCFIVEF I+           LFP+ EN V  SFYK    ++  +LI LENQLPFFVLQ+LFDLIP  K     
Subjt:  ERTARWYYSEPINMNSEEFVKMMVLDGCFIVEFMIM-------KCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPS-KSNDTS

Query:  FRSVVHSFLSECTLPIFRGEIRLCDNILNQSPRHLLEFLSSYFVPK------VEKHINESW---LPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGIL
        F+ + + +L+   L  +        +IL+  P+H ++FLS Y VP+       + +  E W   +PPS T++CEAG+  KK + N  C++  I FE+GIL
Subjt:  FRSVVHSFLSECTLPIFRGEIRLCDNILNQSPRHLLEFLSSYFVPK------VEKHINESW---LPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGIL

Query:  KIHPLVIDDHFEICARNLLAFSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISK
        +I PL IDD+FE   RNLLAF  F      VKN     I Y  F+D LI TEKDVNLLV+E II+N IGG+D+E++QLFN++CK ++     NYF+  SK
Subjt:  KIHPLVIDDHFEICARNLLAFSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISK

Query:  DLKQHCDRKWNKWMASLRHNYFNTPWALISFLAATFLIILTVLQTIFTAISTF
         L+ HCDR+WNK  ASL+HNYFNTPWA IS  AATFL++LT+LQTIF+AIS F
Subjt:  DLKQHCDRKWNKWMASLRHNYFNTPWALISFLAATFLIILTVLQTIFTAISTF

A0A5A7V9C4 UPF0481 protein8.5e-9748.96Show/hide
Query:  ECCIHRVPKQLLNMNRNAYEPRNISIGPFH-HDKQNFSTTEELKLRFFNSYRCRV-----------GLSVKDIVEKARDWERTARWYYSEPINMNSEEFV
        +C I+RVPKQL  MN  AY P+ ISIGPFH H  +N    E+ KL+ F +Y  RV             SV+D+V++A+ W   AR  Y+E INMN E+F+
Subjt:  ECCIHRVPKQLLNMNRNAYEPRNISIGPFH-HDKQNFSTTEELKLRFFNSYRCRV-----------GLSVKDIVEKARDWERTARWYYSEPINMNSEEFV

Query:  KMMVLDGCFIVEFMIM-------KCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPS-KSNDTSFRSVVHSFLSECTLPIFRGE
        KMM++DGCFIVEF I+           LFP+ EN V  SFYK    ++  +LI LENQLPFFVLQ+LFDLIP  K     F+ + + +L+   L  +   
Subjt:  KMMVLDGCFIVEFMIM-------KCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPS-KSNDTSFRSVVHSFLSECTLPIFRGE

Query:  IRLCDNILNQSPRHLLEFLSSYFVPK------VEKHINESW---LPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEICARNLLA
             +IL+  P+H ++FLS Y VP+       + +  E W   +PPS T++CEAG+  KK + N  C++  I FE+GIL+I PL IDD+FE   RNLLA
Subjt:  IRLCDNILNQSPRHLLEFLSSYFVPK------VEKHINESW---LPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEICARNLLA

Query:  FSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKDLKQHCDRKWNKWMASLRHN
        F  F      VKN     I Y  F+D LISTEKDVNLLV+E II+N IGG+D+E++QLFN++CK ++     NYF+  SK L+ HCDR+WNK  ASL+HN
Subjt:  FSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKDLKQHCDRKWNKWMASLRHN

Query:  YFNTPWALISFLAATFLIILTVLQTIFTAISTF
        YFNTPWA IS  AATFL++LT+LQTIF+AIS F
Subjt:  YFNTPWALISFLAATFLIILTVLQTIFTAISTF

A0A6J1BQT6 UPF0481 protein At3g47200-like5.2e-9445.83Show/hide
Query:  HVILCIEENLQKM--APFVPECCIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQ-NFSTTEELKLRFFNSYRCRVGLSVKDIVEKARDWERTARWYYSEP
        HV++ IEE  +++   P  PEC I+RVPK+LLNMNR AY P+ ISIGPFHH  Q N   T++ KL+  +SY  RV ++V+ +V+  ++WE  AR  Y EP
Subjt:  HVILCIEENLQKM--APFVPECCIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQ-NFSTTEELKLRFFNSYRCRVGLSVKDIVEKARDWERTARWYYSEP

Query:  INMNSEEFVKMMVLDGCFIVEFMIMKCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPS--KSNDTSFRSVVHSFLSECTLPIF
        I MN+++FV M++LDGCF+V F+I+     +  +EN   SSFY+ +  ++  ++ MLENQLPFFVLQ L+DLIP   +  + S   ++ +F S       
Subjt:  INMNSEEFVKMMVLDGCFIVEFMIMKCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPS--KSNDTSFRSVVHSFLSECTLPIF

Query:  RGEIRLCDNILNQSPRHLLEFLSSYFVP----KVEKHINESWLPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEICARNLLAFS
             +  ++   + +HL++ LS YF+P    K +   +E  + P  T+LCEAG+  KK  G E   +  ISF++G+L+I PL IDDHFE   RNL+AF 
Subjt:  RGEIRLCDNILNQSPRHLLEFLSSYFVP----KVEKHINESWLPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEICARNLLAFS

Query:  MFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKDLKQHCDRKWNKWMASLRHNYF
         + +      N     I Y  FLD +ISTEKDV LLVE GII+N IGG+DKE+++LFND+ K ++I    +Y +HI+K L  HC + W +  A+L+ +YF
Subjt:  MFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKDLKQHCDRKWNKWMASLRHNYF

Query:  NTPWALISFLAATFLIILTVLQTIFTAISTFK
        N+PWA IS +AAT++IILT+LQTIFTAISTFK
Subjt:  NTPWALISFLAATFLIILTVLQTIFTAISTFK

A0A6J1DXD6 UPF0481 protein At3g47200-like isoform X27.7e-9045.8Show/hide
Query:  MDHVILCIEENLQKMAPFVPECCIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQNFSTTEELKLRFFNSYRCRVGLSVKDIVEKARDWERTARWYYSEPI
        +D V   I++ LQ++ P   EC IHRVP++LL  N  AY P+ ISIGPFHH +Q+    E+ KLRF + Y  R    ++  V   R WE TAR  Y+EPI
Subjt:  MDHVILCIEENLQKMAPFVPECCIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQNFSTTEELKLRFFNSYRCRVGLSVKDIVEKARDWERTARWYYSEPI

Query:  NMNSEEFVKMMVLDGCFIVEFMIMKCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPSKSNDTSFRSVVHSF------LSECTL
        NM+S+EFVKMM++DGCFIVE M+M C  +  E E +     +  +  ++  +LIMLENQLPFFVLQ LFD   S     SF  + H F      +   TL
Subjt:  NMNSEEFVKMMVLDGCFIVEFMIMKCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPSKSNDTSFRSVVHSF------LSECTL

Query:  PIFRGEIRLCDNILNQSPRHLLEFLSSYFVP----------KVEKHINESWLPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEI
         +  G +     I      HL++FLS Y+ P           +     +   PP+ T+L EAGIVFKK    +  IM+ ISF+D +L+I PL I D FE 
Subjt:  PIFRGEIRLCDNILNQSPRHLLEFLSSYFVP----------KVEKHINESWLPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEI

Query:  CARNLLAFSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKDLKQHCDRKWNKW
          RNL+AF       E   N  + AI YF FL+ LIS E+DV+LLV+  II N IGGN++E++ LFND+CK++ ++ +CN F+HI++ L +HC  +WNK 
Subjt:  CARNLLAFSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKDLKQHCDRKWNKW

Query:  MASLRHNYFNTPWALISFLAATFLIILTVLQTIFTAISTFK
        MASLR +YFNTPWA ISF+AA FLI+LT LQT+F+A+S  K
Subjt:  MASLRHNYFNTPWALISFLAATFLIILTVLQTIFTAISTFK

A0A6J1E120 UPF0481 protein At3g47200-like isoform X17.7e-9045.8Show/hide
Query:  MDHVILCIEENLQKMAPFVPECCIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQNFSTTEELKLRFFNSYRCRVGLSVKDIVEKARDWERTARWYYSEPI
        +D V   I++ LQ++ P   EC IHRVP++LL  N  AY P+ ISIGPFHH +Q+    E+ KLRF + Y  R    ++  V   R WE TAR  Y+EPI
Subjt:  MDHVILCIEENLQKMAPFVPECCIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQNFSTTEELKLRFFNSYRCRVGLSVKDIVEKARDWERTARWYYSEPI

Query:  NMNSEEFVKMMVLDGCFIVEFMIMKCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPSKSNDTSFRSVVHSF------LSECTL
        NM+S+EFVKMM++DGCFIVE M+M C  +  E E +     +  +  ++  +LIMLENQLPFFVLQ LFD   S     SF  + H F      +   TL
Subjt:  NMNSEEFVKMMVLDGCFIVEFMIMKCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPSKSNDTSFRSVVHSF------LSECTL

Query:  PIFRGEIRLCDNILNQSPRHLLEFLSSYFVP----------KVEKHINESWLPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEI
         +  G +     I      HL++FLS Y+ P           +     +   PP+ T+L EAGIVFKK    +  IM+ ISF+D +L+I PL I D FE 
Subjt:  PIFRGEIRLCDNILNQSPRHLLEFLSSYFVP----------KVEKHINESWLPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEI

Query:  CARNLLAFSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKDLKQHCDRKWNKW
          RNL+AF       E   N  + AI YF FL+ LIS E+DV+LLV+  II N IGGN++E++ LFND+CK++ ++ +CN F+HI++ L +HC  +WNK 
Subjt:  CARNLLAFSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKDLKQHCDRKWNKW

Query:  MASLRHNYFNTPWALISFLAATFLIILTVLQTIFTAISTFK
        MASLR +YFNTPWA ISF+AA FLI+LT LQT+F+A+S  K
Subjt:  MASLRHNYFNTPWALISFLAATFLIILTVLQTIFTAISTFK

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026451.2e-2022.33Show/hide
Query:  IHRVPKQLLNMNRNAYEPRNISIGPFHHDKQNFSTTEELKLRFFNSYRCRV-GLSVKDIVEKARDWERTARWYYSEPINMNSEEFVKMMVLDGCFIVEFM
        I  VPK L+  + ++Y P  +SIGP+H  K      E  KL      R +       D+VEK +  E   R  Y + I  N E  + +M +D  F++EF+
Subjt:  IHRVPKQLLNMNRNAYEPRNISIGPFHHDKQNFSTTEELKLRFFNSYRCRV-GLSVKDIVEKARDWERTARWYYSEPINMNSEEFVKMMVLDGCFIVEFM

Query:  IMKCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNL--FDLIPSKSNDTSFRSVVHSFLSECTLPIFRGEIRLCDNILN---QSPRHLLE
         +   Y F     KV +   +V    +  +++M+ENQ+P FVL+    F L  ++S D    SV+     + +  + + +    D IL    Q   H+L+
Subjt:  IMKCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNL--FDLIPSKSNDTSFRSVVHSFLSECTLPIFRGEIRLCDNILN---QSPRHLLE

Query:  FLSSYFVPKVE----------------------------KH-----------------------------------------------------------
        FL    VP++E                            KH                                                           
Subjt:  FLSSYFVPKVE----------------------------KH-----------------------------------------------------------

Query:  ----INESWLPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEICARNLLAFSMFISCQENVKNIKENAIHYFQFLDELISTEKDV
            + E    PS + L +AG+ FK      I  + + S   G   +  + +D + E   RNL+A+    +    V         Y + ++ +I +E+DV
Subjt:  ----INESWLPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEICARNLLAFSMFISCQENVKNIKENAIHYFQFLDELISTEKDV

Query:  NLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQECNYFSHISKDLKQHCDRKWNKWMASLRHNYFNTPWALISFLAATFLIILTVLQTIFTAISTF
         LL E+G++++ +  +D+E A+++N + K++ + +  +     +D+ ++   +W   +  L   Y    W +++FLAA  L++L  LQ      S+F
Subjt:  NLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQECNYFSHISKDLKQHCDRKWNKWMASLRHNYFNTPWALISFLAATFLIILTVLQTIFTAISTF

Q9SD53 UPF0481 protein At3g472009.2e-3227.65Show/hide
Query:  CCIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQNFSTTEELK---LRFFNSYRCRVGLSVKDIVEKARDWERTARWYYSEPINMNSEEFVKMMVLDGCFI
        CCI RVP+  + +N  AY+P+ +SIGP+H+ +++    ++ K   L+ F     +  +    +V+   D E   R  YSE +     + + MMVLDGCFI
Subjt:  CCIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQNFSTTEELK---LRFFNSYRCRVGLSVKDIVEKARDWERTARWYYSEPINMNSEEFVKMMVLDGCFI

Query:  -VEFMIMKCPYLFPENENKVHSSFYKVIELNMSV--ELIMLENQLPFFVLQNLFDLIPSK---SNDTSFRSVVHSFLSECTLPIFRGEIRLCDNILNQSP
         + F+IM        N        + +  L  S+  +L++LENQ+PFFVLQ L+  + SK   S+D + R   H F +    PI + E    +   N   
Subjt:  -VEFMIMKCPYLFPENENKVHSSFYKVIELNMSV--ELIMLENQLPFFVLQNLFDLIPSK---SNDTSFRSVVHSFLSECTLPIFRGEIRLCDNILNQSP

Query:  RHLLEFLSSYFVP-------------KVEKHINESWLPP-----------SATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEICARN
        +HLL+ +   F+P             +V+ H  +S   P           SA +L   GI F+     E  I+  +  +   L+I  L  D        N
Subjt:  RHLLEFLSSYFVP-------------KVEKHINESWLPP-----------SATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEICARN

Query:  LLAFSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKDLKQHCDRKWNKWMASL
         +AF  F +   N          Y  F+  L++ E+DV  L  + +I+    G++ E+++ F  I K++  + + +Y +++ K + ++  + +N   A  
Subjt:  LLAFSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKDLKQHCDRKWNKWMASL

Query:  RHNYFNTPWALISFLAATFLIILTVLQTIFTAIS
        RH +F +PW  +S  A  F+I+LT+LQ+    +S
Subjt:  RHNYFNTPWALISFLAATFLIILTVLQTIFTAIS

Arabidopsis top hitse value%identityAlignment
AT2G36430.1 Plant protein of unknown function (DUF247)6.1e-4731.04Show/hide
Query:  PECCIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQNFSTTEELKLRFFNSYRCRV-GLSVKDIVEKARDWERTARWYYSEPINMNSEEFVKMMVLDGCFI
        P C I RVP+ +++ N   YEPR +SIGP+H  +      EE K R+ N    R   L+++D ++  ++ E  AR  YSE I+M+SEEF +MMVLDGCF+
Subjt:  PECCIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQNFSTTEELKLRFFNSYRCRV-GLSVKDIVEKARDWERTARWYYSEPINMNSEEFVKMMVLDGCFI

Query:  VEFMIMKCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLI-PSKSNDT--SFRSVVHSFLSECTLPIFRGEIRLCDNILNQSPRHL
        +E +  K   L P   N    +   V+      + + LENQ+PFFVL+ LF+L      N+T  S +S+  +F +     + R E  L         +HL
Subjt:  VEFMIMKCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLI-PSKSNDT--SFRSVVHSFLSECTLPIFRGEIRLCDNILNQSPRHL

Query:  LEFLSSYFVPKVEKHINESWLP----------PSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEICARNLLAFSM-FISCQENVKN
        L+ L S F+P+ E H   +  P           S ++L  AGI  ++++  E  ++  + F  G +++  + +DD       N +A+    ++C  +   
Subjt:  LEFLSSYFVPKVEKHINESWLP----------PSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEICARNLLAFSM-FISCQENVKN

Query:  IKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNIT--IQECNYFSHISKDLKQHCDRKWNKWMASLRHNYFNTPWALISFL
               Y   LD L +T KDV  L ++ II N   G D E+A+  N + +++   I +C Y   + +++ ++    W+   A+ +  YFN+PW+ +S L
Subjt:  IKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNIT--IQECNYFSHISKDLKQHCDRKWNKWMASLRHNYFNTPWALISFL

Query:  AATFLIILTVLQTIFTAISTFK
        AA  L++L+V+QTI+T    ++
Subjt:  AATFLIILTVLQTIFTAISTFK

AT3G50120.1 Plant protein of unknown function (DUF247)8.0e-4729.55Show/hide
Query:  CIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQNFSTTEELKLRFFNSYRCRVGLSVKDIVEKARDWERTARWYYSEPINMNSEEFVKMMVLDGCFIVEFM
        CI+RVP  L   +  +Y P+ +S+GP+HH K+   + +  K R  N    R    +K  ++  R+ E  AR  Y  P++++S EF++M+VLDGCF++E  
Subjt:  CIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQNFSTTEELKLRFFNSYRCRVGLSVKDIVEKARDWERTARWYYSEPINMNSEEFVKMMVLDGCFIVEFM

Query:  ------IMKCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPSKSNDTSFRSVVHSFLSECTLPIFR-----GEIRL--------
                +  Y   +    +  S + +       +++MLENQLP FVL  L +L     N T   + +     +  +P        G+ +L        
Subjt:  ------IMKCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPSKSNDTSFRSVVHSFLSECTLPIFR-----GEIRL--------

Query:  -CDNILNQSPRHLLEFLSSYFV---PKVEKHI-NESW-------------LPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEIC
          D   +    H L+      +   PK E  +  + W             L    T+L EAGI F++ + +    M+   F++G L+I  L+I D  +  
Subjt:  -CDNILNQSPRHLLEFLSSYFV---PKVEKHI-NESW-------------LPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEIC

Query:  ARNLLAFSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKDLKQHCDRKWNKWM
          NL+AF     C  +  N   +   Y  F+D LI + +DV+ L   GII + + G+D E+A LFN +C+ +    E +Y S +S ++ ++ D KWN W 
Subjt:  ARNLLAFSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKDLKQHCDRKWNKWM

Query:  ASLRHNYFNTPWALISFLAATFLIILTVLQTIFTAISTFK
        A+L+H YFN PWA++SF AA  L++LT  Q+ +   + +K
Subjt:  ASLRHNYFNTPWALISFLAATFLIILTVLQTIFTAISTFK

AT3G50150.1 Plant protein of unknown function (DUF247)5.2e-4631.07Show/hide
Query:  CIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQNFSTTEELKLRFFNSYRCRVGLSVKDIVEKARDWERTARWYYSEPINM-NSEEFVKMMVLDGCFIVEF
        CI+RVP  L   ++ +Y P+ +SIGP+HH K +    E  K R  N    R   +++  ++  ++ E  AR  Y  PI+M NS EF +M+VLDGCF++E 
Subjt:  CIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQNFSTTEELKLRFFNSYRCRVGLSVKDIVEKARDWERTARWYYSEPINM-NSEEFVKMMVLDGCFIVEF

Query:  M---IMKCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPSKSNDTSFRSVVHSFLSECTLP----IFRGEIRL-----CDNILN
            I     +     + V +   + +  ++  ++IMLENQLP FVL  L  L     N T   + V     +  +P    + + E  L      D + +
Subjt:  M---IMKCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPSKSNDTSFRSVVHSFLSECTLP----IFRGEIRL-----CDNILN

Query:  QSPRHLLEFLSSYFVPKVEKH------------INESWLPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEICARNLLAFSMFIS
            H L+      +   E                +  L    T+L  AG+ F + E  ++  +E   F++G LKI  L+I D  +    NL+AF     
Subjt:  QSPRHLLEFLSSYFVPKVEKH------------INESWLPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEICARNLLAFSMFIS

Query:  CQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKDLKQHCDRKWNKWMASLRHNYFNTPW
        C     N   N   Y  F+D LI++ +DV+ L  +GII + + G+D E+A LFN +CK +    +  Y S +S+++ ++  RKWN   A+LR  YFN PW
Subjt:  CQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKDLKQHCDRKWNKWMASLRHNYFNTPW

Query:  ALISFLAATFLIILTVLQTIFTAISTFK
        A  SF AA  L+ LT  Q+ F   + +K
Subjt:  ALISFLAATFLIILTVLQTIFTAISTFK

AT3G50170.1 Plant protein of unknown function (DUF247)2.8e-4430.89Show/hide
Query:  CIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQNFSTTEELKLRFFNSYRCRVGLSVKDIVEKARDWERTARWYYSEPINMNSEEFVKMMVLDGCFIVEFM
        CI+RVP  L   ++ +Y P+ +S+GP+HH K+     E  K R  N    R+   ++      R+ E  AR  Y  PI+++  EF +M+VLDGCF++E +
Subjt:  CIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQNFSTTEELKLRFFNSYRCRVGLSVKDIVEKARDWERTARWYYSEPINMNSEEFVKMMVLDGCFIVEFM

Query:  IMKCPYLFPENENKVHSSFYKVIELNMSV--ELIMLENQLPFFVLQNLFDLIPSKSNDTSFRSVVHSFLSECTLPI------------------------
               F E     +   + +  L  S+  ++IMLENQLP FVL  L +L     N T   + V     +  +P                         
Subjt:  IMKCPYLFPENENKVHSSFYKVIELNMSV--ELIMLENQLPFFVLQNLFDLIPSKSNDTSFRSVVHSFLSECTLPI------------------------

Query:  FRGEIRLCD----NILNQSP----RHLLEFLSSYFVPKVEKHINESWLPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEICARN
         +GE+   D    ++L  SP    R LL+ L+      V+K   +  L    T+L EAG+ F+K + +    +E   F++G L+I  L+I D  +    N
Subjt:  FRGEIRLCD----NILNQSP----RHLLEFLSSYFVPKVEKHINESWLPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEICARN

Query:  LLAFSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKDLKQHCDRKWNKWMASL
        L+AF     C     N   +   Y  F+D LI++ +DV+ L   GII + + G+D E+A LFN +C+ +    + ++ S +S D+ ++ +RKWN   A+L
Subjt:  LLAFSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQ-ECNYFSHISKDLKQHCDRKWNKWMASL

Query:  RHNYFNTPWALISFLAATFLIILTVLQTIFTAISTFK
         H YFN PWA  SF AA  L++LT+ Q+ +   + +K
Subjt:  RHNYFNTPWALISFLAATFLIILTVLQTIFTAISTFK

AT4G31980.1 unknown protein1.5e-6133.18Show/hide
Query:  DHVILCIEENLQKMAPFVPECCIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQNFSTTEELKLRFFNSYRCRVGLSVKDIVEKARDWERTARWYYSEPIN
        D ++  I+  L  ++    +CCI++VP +L  +N +AY PR +S GP H  K+     E+ K R+  S+  R   S++D+V  AR WE+ AR  Y+E + 
Subjt:  DHVILCIEENLQKMAPFVPECCIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQNFSTTEELKLRFFNSYRCRVGLSVKDIVEKARDWERTARWYYSEPIN

Query:  MNSEEFVKMMVLDGCFIVEFMIMK-CPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPSKSNDTSFRSVVHSFLSECTLPIFRGE
        ++S+EFV+M+V+DG F+VE ++    P L  EN+    +S   ++  ++  ++I++ENQLPFFV++ +F L+ +     +  S++   L++     F   
Subjt:  MNSEEFVKMMVLDGCFIVEFMIMK-CPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPSKSNDTSFRSVVHSFLSECTLPIFRGE

Query:  IRLCDNILNQSPRHLLEFLSSYFVP----KVEKHINESWLPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEICARNLLAFSMFI
         R+ D      P H ++ L S ++P    K+E    +    P AT+L  AG+ FK  E +  C+++ ISF DG+LKI  +V+DD  E   +N++ F    
Subjt:  IRLCDNILNQSPRHLLEFLSSYFVP----KVEKHINESWLPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEICARNLLAFSMFI

Query:  SCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQECNYFSHISKDLKQHCDRKWNKWMASLRHNYFNTPW
           E  +   +N + Y   L   I +  D +LL+  GII+N + GN  +++ LFN I K +      YFS +S++L+ +C+  WN+W A LR +YF+ PW
Subjt:  SCQENVKNIKENAIHYFQFLDELISTEKDVNLLVEEGIILNGIGGNDKEIAQLFNDICKNITIQECNYFSHISKDLKQHCDRKWNKWMASLRHNYFNTPW

Query:  ALISFLAATFLIILTVLQTIFTAIS
        A+ S  AA  L++LT +Q++ + ++
Subjt:  ALISFLAATFLIILTVLQTIFTAIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCACGTCATACTATGCATCGAAGAAAATCTACAGAAAATGGCTCCATTTGTTCCGGAATGTTGCATCCATCGAGTTCCGAAACAACTGCTAAACATGAATCGTAA
TGCATATGAACCTAGAAACATTTCAATTGGCCCTTTTCATCATGATAAACAAAATTTCAGTACTACAGAAGAGCTCAAGCTTCGTTTTTTTAATAGTTATCGATGTCGCG
TGGGCCTAAGTGTAAAGGACATTGTGGAAAAGGCTCGAGATTGGGAGAGAACAGCTCGTTGGTACTACTCAGAACCTATAAACATGAACAGCGAAGAGTTTGTGAAAATG
ATGGTTTTAGATGGTTGTTTCATAGTGGAGTTCATGATTATGAAGTGCCCGTACTTATTTCCTGAAAATGAAAACAAGGTACATTCTTCCTTTTACAAAGTTATAGAATT
GAATATGAGTGTGGAGTTGATAATGCTTGAGAATCAACTTCCTTTTTTTGTCCTTCAAAACCTATTTGACCTTATTCCATCCAAATCAAATGACACATCTTTTAGATCCG
TTGTACACTCATTTCTCAGTGAGTGTACATTGCCAATATTTAGGGGTGAGATCCGACTTTGTGATAATATCTTGAATCAAAGTCCGCGCCATTTATTGGAATTCTTAAGC
TCTTATTTTGTCCCCAAGGTTGAGAAACATATTAATGAAAGCTGGCTACCTCCAAGTGCAACCCAGCTCTGCGAGGCTGGTATTGTCTTTAAGAAAGTAGAAGGAAATGA
AATATGTATTATGGAGTACATAAGTTTTGAAGATGGGATTTTGAAGATTCACCCTTTAGTAATTGATGATCACTTTGAAATATGTGCAAGAAACTTATTGGCATTTTCCA
TGTTTATAAGTTGTCAAGAAAATGTTAAGAATATCAAGGAGAATGCCATTCATTACTTTCAGTTTCTAGATGAGCTGATAAGTACTGAGAAAGATGTGAACTTACTTGTC
GAGGAAGGAATCATATTAAATGGTATTGGCGGCAACGACAAAGAAATTGCGCAACTGTTTAATGATATTTGTAAGAACATCACGATACAAGAGTGTAATTACTTCAGTCA
TATTTCAAAGGATTTGAAGCAACATTGTGATAGAAAATGGAACAAGTGGATGGCTTCATTGAGACACAACTATTTTAACACACCATGGGCTCTTATCTCCTTCTTGGCAG
CTACCTTCCTTATTATACTAACTGTCCTACAAACCATATTTACTGCTATATCCACTTTCAAGTAA
mRNA sequenceShow/hide mRNA sequence
TTATTATATATCTTATTTTAATTATTACCCTTTTTTTTCCGTTCGCCATAGCATTTGGTCTGCAGATCTCCACTTCTCTCCTTTCATCGCCTTGTTCTCCGTTTCTTTCT
ACTCCATTTTAGGCTCTCCACATCACATTTTAAGATAACAACTATTGAGATAGGAGACTCAAACTCATAACCTTTCAATCTGTGGATATATATTGATGTCACTGAGCTAT
TGCTTTTATAAGTTTCAATGGATCACGTCATACTATGCATCGAAGAAAATCTACAGAAAATGGCTCCATTTGTTCCGGAATGTTGCATCCATCGAGTTCCGAAACAACTG
CTAAACATGAATCGTAATGCATATGAACCTAGAAACATTTCAATTGGCCCTTTTCATCATGATAAACAAAATTTCAGTACTACAGAAGAGCTCAAGCTTCGTTTTTTTAA
TAGTTATCGATGTCGCGTGGGCCTAAGTGTAAAGGACATTGTGGAAAAGGCTCGAGATTGGGAGAGAACAGCTCGTTGGTACTACTCAGAACCTATAAACATGAACAGCG
AAGAGTTTGTGAAAATGATGGTTTTAGATGGTTGTTTCATAGTGGAGTTCATGATTATGAAGTGCCCGTACTTATTTCCTGAAAATGAAAACAAGGTACATTCTTCCTTT
TACAAAGTTATAGAATTGAATATGAGTGTGGAGTTGATAATGCTTGAGAATCAACTTCCTTTTTTTGTCCTTCAAAACCTATTTGACCTTATTCCATCCAAATCAAATGA
CACATCTTTTAGATCCGTTGTACACTCATTTCTCAGTGAGTGTACATTGCCAATATTTAGGGGTGAGATCCGACTTTGTGATAATATCTTGAATCAAAGTCCGCGCCATT
TATTGGAATTCTTAAGCTCTTATTTTGTCCCCAAGGTTGAGAAACATATTAATGAAAGCTGGCTACCTCCAAGTGCAACCCAGCTCTGCGAGGCTGGTATTGTCTTTAAG
AAAGTAGAAGGAAATGAAATATGTATTATGGAGTACATAAGTTTTGAAGATGGGATTTTGAAGATTCACCCTTTAGTAATTGATGATCACTTTGAAATATGTGCAAGAAA
CTTATTGGCATTTTCCATGTTTATAAGTTGTCAAGAAAATGTTAAGAATATCAAGGAGAATGCCATTCATTACTTTCAGTTTCTAGATGAGCTGATAAGTACTGAGAAAG
ATGTGAACTTACTTGTCGAGGAAGGAATCATATTAAATGGTATTGGCGGCAACGACAAAGAAATTGCGCAACTGTTTAATGATATTTGTAAGAACATCACGATACAAGAG
TGTAATTACTTCAGTCATATTTCAAAGGATTTGAAGCAACATTGTGATAGAAAATGGAACAAGTGGATGGCTTCATTGAGACACAACTATTTTAACACACCATGGGCTCT
TATCTCCTTCTTGGCAGCTACCTTCCTTATTATACTAACTGTCCTACAAACCATATTTACTGCTATATCCACTTTCAAGTAAAGCCCATCCCCATTTCATAAGTCTTGGT
TTTAATTTTGTTTTGTTGCTCTTTGTTGAACATTTAGGGTTTAATTCTGAACTCTTGTAGTTAGGGTGTGCTCTTTTGTGGCGTGTTGCTTCTTTTTAATTATCTATATT
GTAATAATATGTGCTCTTTTATAATTCCTGTGGTAAAAATATTCATTTTTGTTGCTCTTTGTGG
Protein sequenceShow/hide protein sequence
MDHVILCIEENLQKMAPFVPECCIHRVPKQLLNMNRNAYEPRNISIGPFHHDKQNFSTTEELKLRFFNSYRCRVGLSVKDIVEKARDWERTARWYYSEPINMNSEEFVKM
MVLDGCFIVEFMIMKCPYLFPENENKVHSSFYKVIELNMSVELIMLENQLPFFVLQNLFDLIPSKSNDTSFRSVVHSFLSECTLPIFRGEIRLCDNILNQSPRHLLEFLS
SYFVPKVEKHINESWLPPSATQLCEAGIVFKKVEGNEICIMEYISFEDGILKIHPLVIDDHFEICARNLLAFSMFISCQENVKNIKENAIHYFQFLDELISTEKDVNLLV
EEGIILNGIGGNDKEIAQLFNDICKNITIQECNYFSHISKDLKQHCDRKWNKWMASLRHNYFNTPWALISFLAATFLIILTVLQTIFTAISTFK