; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0003333 (gene) of Chayote v1 genome

Gene IDSed0003333
OrganismSechium edule (Chayote v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationLG08:27199440..27200786
RNA-Seq ExpressionSed0003333
SyntenySed0003333
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8651212.1 hypothetical protein Csa_001883 [Cucumis sativus]2.6e-9550.97Show/hide
Query:  IANSIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKREIRDVVQINMEKKEFVGMMVVDGCFLV
        IA  I+NKLQ L   TEEC I+RVSKRL+N + + Y+PQ ISIG FHHG++ LK MEQ KL+FL                + M   +FV M++VDGCF+V
Subjt:  IANSIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKREIRDVVQINMEKKEFVGMMVVDGCFLV

Query:  EFLLVNSG-QFVLTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLIIPSIANVSFAY---VIHKSFTNMFMKHR-DLPQGI---SNKNINHLLD
        EFL+ +   Q   TS+   L  +A+N NLYHDLI+LENQLPFFVLQ L   I     + SF     ++H  F   FMKH   +PQ I   + KNI HL+D
Subjt:  EFLLVNSG-QFVLTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLIIPSIANVSFAY---VIHKSFTNMFMKHR-DLPQGI---SNKNINHLLD

Query:  FLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERA---TKEKSMMGITFEAGVLKIPPFDVNDIFEIYLRNLMVFESFKVGSQFPR
        FL FY+       SP   +    G  ++ L LPPS T+L EAGV  E+A       ++MGI+FE GVLKIPPF+++D+FEI +RNL+ FE+F+ GS    
Subjt:  FLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERA---TKEKSMMGITFEAGVLKIPPFDVNDIFEIYLRNLMVFESFKVGSQFPR

Query:  YIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALRDHCKRRWNRWMASLKREYFNTPWTLVSFIAASI
          IHY+LFLGALIS EKDSSLL K+GI++NLIGGSD E+S +FN+IGKGV     F Y    S  LR HC  + N+WMA LKR+YFNTPWT+ SFI A I
Subjt:  YIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALRDHCKRRWNRWMASLKREYFNTPWTLVSFIAASI

Query:  FIILTFLQTLFS
        F ++T LQT F+
Subjt:  FIILTFLQTLFS

XP_008443397.1 PREDICTED: LOW QUALITY PROTEIN: UPF0481 protein At3g47200-like [Cucumis melo]2.5e-9849.88Show/hide
Query:  IANSIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKREI---------------------RDVV
        I + I+NKLQ L   TEEC I+RVSKRL+N H T Y+PQ ISIG FHHG++DLK MEQ KL+FL  YL R+ R++                      D V
Subjt:  IANSIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKREI---------------------RDVV

Query:  QINMEKKEFVGMMVVDGCFLVEFLLVNSGQFV---LTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLII-PSIANVSF---AYVIHKSFTNMF
         I+M   +FV M++VDGCF+VEFL+   G+ +    TS+   L  +A+N NLYHDLIMLENQLPFFV+Q LF  I  P+  +  F     ++H  F   F
Subjt:  QINMEKKEFVGMMVVDGCFLVEFLLVNSGQFV---LTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLII-PSIANVSF---AYVIHKSFTNMF

Query:  MK-HRDLPQGI---SNKNINHLLDFLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERA--TKEKSMMGITFEAGVLKIPPFDVND
        +K HR++P  I    NK+I HLLDFL FY+        P+  +  +G   N+ L LPPS T+L EAGV  E+A  T + ++MG +FE GVLKIPPF+++D
Subjt:  MK-HRDLPQGI---SNKNINHLLDFLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERA--TKEKSMMGITFEAGVLKIPPFDVND

Query:  IFEIYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALRDHCKRRWNRW
        +FEI +RNL+ FE+F+ GS      IHY+ FLGALIS EKDSSLL K+GI++NLIGGSDVE+S +FN+IGKGV     FYY    S  LR HC  R NRW
Subjt:  IFEIYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALRDHCKRRWNRW

Query:  MASLKREYFNTPWTLVSFIAASIFIILTFLQTL
        MA LKR+Y NTPW +VS +  +I  ++T L+T+
Subjt:  MASLKREYFNTPWTLVSFIAASIFIILTFLQTL

XP_022132066.1 UPF0481 protein At3g47200-like [Momordica charantia]4.1e-8542.86Show/hide
Query:  MNQHNIDPYIDINVDVLRIDIAN-----SIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKREI
        M   +I+ Y D+N  +  +++       S++N L++LH  +EECSI+RVSKRL N +  AY PQAISIG FHHGQ +  AMEQLKLRFL +YL R+   I
Subjt:  MNQHNIDPYIDINVDVLRIDIAN-----SIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKREI

Query:  RDVVQ----------------INMEKKEFVGMMVVDGCFLVEFLLVNSGQFVLTSKESSLK-FRALNTNLYHDLIMLENQLPFFVLQELFSLIIPSIANV
         D  +                I+M+   FV MM+VDG FLVEF+ ++     +T    +   F+A++ ++Y DLI+LENQLPFF+L+ L      S   V
Subjt:  RDVVQ----------------INMEKKEFVGMMVVDGCFLVEFLLVNSGQFVLTSKESSLK-FRALNTNLYHDLIMLENQLPFFVLQELFSLIIPSIANV

Query:  SFAYVIHKSFTNMFMKHRDL-PQGISNKNINHLLDFLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERATKEKSM-MGITFEAGV
         F      +F   +   R+L    +  K  NHL+DFLSFY+  P +      N+  +  ++      PP+AT+L EAGV+F++AT++K + M I F+ GV
Subjt:  SFAYVIHKSFTNMFMKHRDL-PQGISNKNINHLLDFLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERATKEKSM-MGITFEAGV

Query:  LKIPPFDVNDIFEIYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALR
        L IP  +++D FE Y+RNL+ +E + +G    R +I YV FL  LISTE+D SLL K GIITN IGG++ ++S LFND+ K ++I  +FYY+ DIS  L 
Subjt:  LKIPPFDVNDIFEIYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALR

Query:  DHCKRRWNRWMASLKREYFNTPWTLVSFIAASIFIILTFLQTLFSSIS
         +C+  W+R MASL+R+YFNTPW  +SF+AA+  ++LT +Q ++S+IS
Subjt:  DHCKRRWNRWMASLKREYFNTPWTLVSFIAASIFIILTFLQTLFSSIS

XP_022158992.1 UPF0481 protein At3g47200-like isoform X3 [Momordica charantia]9.2e-8546.21Show/hide
Query:  NVDVLRIDIANSIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKREIRDVV-------------
        NVD    ++ +SI+  LQ+L    EEC+I RV +RL+  +  AY PQ ISIG FHHG+ DL  MEQ KLRFL  YL R    I   V             
Subjt:  NVDVLRIDIANSIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKREIRDVV-------------

Query:  ---QINMEKKEFVGMMVVDGCFLVEFLLV--NSGQFVLTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLIIPSIANVSFAYVIHKSFT-NMFM
            INM+  EFV MM+VDGCF+VE +++    G    T +   L F A+ T+LY DLIMLENQLPFFVLQ LF       A +SF  + H  +T    +
Subjt:  ---QINMEKKEFVGMMVVDGCFLVEFLLV--NSGQFVLTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLIIPSIANVSFAYVIHKSFT-NMFM

Query:  KHR--DLPQG--ISNKNINHLLDFLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERATKEKSMMGITFEAGVLKIPPFDVNDIFE
        K R  +LP G  IS   +NHL+DFLSFY+ P     S   +         K    PP+ T+L EAG+ F++A + K +M I+F+  VL+IPP ++ D+FE
Subjt:  KHR--DLPQG--ISNKNINHLLDFLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERATKEKSMMGITFEAGVLKIPPFDVNDIFE

Query:  IYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALRDHCKRRWNRWMAS
         Y+RNLM FE +       +Y I Y LFL  LIS E+D SLL K  IITN IGG++ E+S LFND+ K V +  +   F  I+EAL +HC  RWN+ MAS
Subjt:  IYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALRDHCKRRWNRWMAS

Query:  LKREYFNTPWTLVSFIAASIFIILTFLQTLFSSIS
        L+R+YFNTPW  +SF+AA+  I+LTFLQTLFS++S
Subjt:  LKREYFNTPWTLVSFIAASIFIILTFLQTLFSSIS

XP_038904513.1 UPF0481 protein At3g47200-like [Benincasa hispida]4.8e-10251.5Show/hide
Query:  DIANSIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKRE---IRDVVQI---------------
        ++A  I+ +L++L   TEEC I RVSKRL+N H TAY+PQ ISIG FHHG+ DLK MEQ KL+FL  ++ RI R+    +DVV+                
Subjt:  DIANSIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKRE---IRDVVQI---------------

Query:  ---NMEKKEFVGMMVVDGCFLVEFLLVNSG-----QFVLTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLIIPSIAN-VSFAYVIHKSFTNMF
           NM   +FV MM+VDGCF+VEFL+   G     Q   TS+   L F+A+N NLYHDLIMLENQLPFFVLQ+LF LII    N  +   ++HK F + F
Subjt:  ---NMEKKEFVGMMVVDGCFLVEFLLVNSG-----QFVLTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLIIPSIAN-VSFAYVIHKSFTNMF

Query:  MKHR-DLPQGISNK-NINHLLDFLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERATKEKSMMGITFEAGVLKIPPFDVNDIFEI
        MKH  + PQ    K NI HL+ FL FY++P         N        NK L+LPPS T+L EAGV  E+ + + +++ +TF+ GVLKIPPF+++ +FEI
Subjt:  MKHR-DLPQGISNK-NINHLLDFLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERATKEKSMMGITFEAGVLKIPPFDVNDIFEI

Query:  YLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALRDHCKRRWNRWMASL
        Y+RNLM FE+F+  +    Y IHYVLFLGALIS EKDSSLL K+GIITNLIGGSD E+S +FN+IGKGV     FYY +D+S+ L  HCK R NRWMASL
Subjt:  YLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALRDHCKRRWNRWMASL

Query:  KREYFNTPWTLVSFIAASIFIILTFLQTLFSSI
        +R+Y NTPW  +S +AA       FLQT+FS I
Subjt:  KREYFNTPWTLVSFIAASIFIILTFLQTLFSSI

TrEMBL top hitse value%identityAlignment
A0A0A0LC32 Uncharacterized protein3.5e-9850.95Show/hide
Query:  IANSIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIK----------REIRDVVQINMEKKEFVG
        IA  I+NKLQ L   TEEC I+RVSKRL+N + + Y+PQ ISIG FHHG++ LK MEQ KL+FL  YL R+           R+  +   I+M   +FV 
Subjt:  IANSIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIK----------REIRDVVQINMEKKEFVG

Query:  MMVVDGCFLVEFLLVNSG-QFVLTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLIIPSIANVSFAY---VIHKSFTNMFMKHR-DLPQGI---
        M++VDGCF+VEFL+ +   Q   TS+   L  +A+N NLYHDLI+LENQLPFFVLQ L   I     + SF     ++H  F   FMKH   +PQ I   
Subjt:  MMVVDGCFLVEFLLVNSG-QFVLTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLIIPSIANVSFAY---VIHKSFTNMFMKHR-DLPQGI---

Query:  SNKNINHLLDFLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERA---TKEKSMMGITFEAGVLKIPPFDVNDIFEIYLRNLMVFE
        + KNI HL+DFL FY+       SP   +    G  ++ L LPPS T+L EAGV  E+A       ++MGI+FE GVLKIPPF+++D+FEI +RNL+ FE
Subjt:  SNKNINHLLDFLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERA---TKEKSMMGITFEAGVLKIPPFDVNDIFEIYLRNLMVFE

Query:  SFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALRDHCKRRWNRWMASLKREYFNTPW
        +F+ GS      IHY+LFLGALIS EKDSSLL K+GI++NLIGGSD E+S +FN+IGKGV     F Y    S  LR HC  + N+WMA LKR+YFNTPW
Subjt:  SFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALRDHCKRRWNRWMASLKREYFNTPW

Query:  TLVSFIAASIFIILTFLQTLFS
        T+ SFI A IF ++T LQT F+
Subjt:  TLVSFIAASIFIILTFLQTLFS

A0A1S3B8P8 LOW QUALITY PROTEIN: UPF0481 protein At3g47200-like1.2e-9849.88Show/hide
Query:  IANSIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKREI---------------------RDVV
        I + I+NKLQ L   TEEC I+RVSKRL+N H T Y+PQ ISIG FHHG++DLK MEQ KL+FL  YL R+ R++                      D V
Subjt:  IANSIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKREI---------------------RDVV

Query:  QINMEKKEFVGMMVVDGCFLVEFLLVNSGQFV---LTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLII-PSIANVSF---AYVIHKSFTNMF
         I+M   +FV M++VDGCF+VEFL+   G+ +    TS+   L  +A+N NLYHDLIMLENQLPFFV+Q LF  I  P+  +  F     ++H  F   F
Subjt:  QINMEKKEFVGMMVVDGCFLVEFLLVNSGQFV---LTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLII-PSIANVSF---AYVIHKSFTNMF

Query:  MK-HRDLPQGI---SNKNINHLLDFLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERA--TKEKSMMGITFEAGVLKIPPFDVND
        +K HR++P  I    NK+I HLLDFL FY+        P+  +  +G   N+ L LPPS T+L EAGV  E+A  T + ++MG +FE GVLKIPPF+++D
Subjt:  MK-HRDLPQGI---SNKNINHLLDFLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERA--TKEKSMMGITFEAGVLKIPPFDVND

Query:  IFEIYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALRDHCKRRWNRW
        +FEI +RNL+ FE+F+ GS      IHY+ FLGALIS EKDSSLL K+GI++NLIGGSDVE+S +FN+IGKGV     FYY    S  LR HC  R NRW
Subjt:  IFEIYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALRDHCKRRWNRW

Query:  MASLKREYFNTPWTLVSFIAASIFIILTFLQTL
        MA LKR+Y NTPW +VS +  +I  ++T L+T+
Subjt:  MASLKREYFNTPWTLVSFIAASIFIILTFLQTL

A0A6J1BR71 UPF0481 protein At3g47200-like2.0e-8542.86Show/hide
Query:  MNQHNIDPYIDINVDVLRIDIAN-----SIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKREI
        M   +I+ Y D+N  +  +++       S++N L++LH  +EECSI+RVSKRL N +  AY PQAISIG FHHGQ +  AMEQLKLRFL +YL R+   I
Subjt:  MNQHNIDPYIDINVDVLRIDIAN-----SIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKREI

Query:  RDVVQ----------------INMEKKEFVGMMVVDGCFLVEFLLVNSGQFVLTSKESSLK-FRALNTNLYHDLIMLENQLPFFVLQELFSLIIPSIANV
         D  +                I+M+   FV MM+VDG FLVEF+ ++     +T    +   F+A++ ++Y DLI+LENQLPFF+L+ L      S   V
Subjt:  RDVVQ----------------INMEKKEFVGMMVVDGCFLVEFLLVNSGQFVLTSKESSLK-FRALNTNLYHDLIMLENQLPFFVLQELFSLIIPSIANV

Query:  SFAYVIHKSFTNMFMKHRDL-PQGISNKNINHLLDFLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERATKEKSM-MGITFEAGV
         F      +F   +   R+L    +  K  NHL+DFLSFY+  P +      N+  +  ++      PP+AT+L EAGV+F++AT++K + M I F+ GV
Subjt:  SFAYVIHKSFTNMFMKHRDL-PQGISNKNINHLLDFLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERATKEKSM-MGITFEAGV

Query:  LKIPPFDVNDIFEIYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALR
        L IP  +++D FE Y+RNL+ +E + +G    R +I YV FL  LISTE+D SLL K GIITN IGG++ ++S LFND+ K ++I  +FYY+ DIS  L 
Subjt:  LKIPPFDVNDIFEIYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALR

Query:  DHCKRRWNRWMASLKREYFNTPWTLVSFIAASIFIILTFLQTLFSSIS
         +C+  W+R MASL+R+YFNTPW  +SF+AA+  ++LT +Q ++S+IS
Subjt:  DHCKRRWNRWMASLKREYFNTPWTLVSFIAASIFIILTFLQTLFSSIS

A0A6J1DXD6 UPF0481 protein At3g47200-like isoform X25.8e-8545.74Show/hide
Query:  NIDPYIDI---NVDVLRIDIANSIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKREIRDVV--
        N  PY ++   NVD    ++ +SI+  LQ+L    EEC+I RV +RL+  +  AY PQ ISIG FHHG+ DL  MEQ KLRFL  YL R    I   V  
Subjt:  NIDPYIDI---NVDVLRIDIANSIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKREIRDVV--

Query:  --------------QINMEKKEFVGMMVVDGCFLVEFLLV--NSGQFVLTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLIIPSIANVSFAYV
                       INM+  EFV MM+VDGCF+VE +++    G    T +   L F A+ T+LY DLIMLENQLPFFVLQ LF       A +SF  +
Subjt:  --------------QINMEKKEFVGMMVVDGCFLVEFLLV--NSGQFVLTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLIIPSIANVSFAYV

Query:  IHKSFT-NMFMKHR--DLPQG--ISNKNINHLLDFLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERATKEKSMMGITFEAGVLK
         H  +T    +K R  +LP G  IS   +NHL+DFLSFY+ P     S   +         K    PP+ T+L EAG+ F++A + K +M I+F+  VL+
Subjt:  IHKSFT-NMFMKHR--DLPQG--ISNKNINHLLDFLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERATKEKSMMGITFEAGVLK

Query:  IPPFDVNDIFEIYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALRDH
        IPP ++ D+FE Y+RNLM FE +       +Y I Y LFL  LIS E+D SLL K  IITN IGG++ E+S LFND+ K V +  +   F  I+EAL +H
Subjt:  IPPFDVNDIFEIYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALRDH

Query:  CKRRWNRWMASLKREYFNTPWTLVSFIAASIFIILTFLQTLFSSIS
        C  RWN+ MASL+R+YFNTPW  +SF+AA+  I+LTFLQTLFS++S
Subjt:  CKRRWNRWMASLKREYFNTPWTLVSFIAASIFIILTFLQTLFSSIS

A0A6J1DYL4 UPF0481 protein At3g47200-like isoform X34.4e-8546.21Show/hide
Query:  NVDVLRIDIANSIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKREIRDVV-------------
        NVD    ++ +SI+  LQ+L    EEC+I RV +RL+  +  AY PQ ISIG FHHG+ DL  MEQ KLRFL  YL R    I   V             
Subjt:  NVDVLRIDIANSIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKREIRDVV-------------

Query:  ---QINMEKKEFVGMMVVDGCFLVEFLLV--NSGQFVLTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLIIPSIANVSFAYVIHKSFT-NMFM
            INM+  EFV MM+VDGCF+VE +++    G    T +   L F A+ T+LY DLIMLENQLPFFVLQ LF       A +SF  + H  +T    +
Subjt:  ---QINMEKKEFVGMMVVDGCFLVEFLLV--NSGQFVLTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLIIPSIANVSFAYVIHKSFT-NMFM

Query:  KHR--DLPQG--ISNKNINHLLDFLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERATKEKSMMGITFEAGVLKIPPFDVNDIFE
        K R  +LP G  IS   +NHL+DFLSFY+ P     S   +         K    PP+ T+L EAG+ F++A + K +M I+F+  VL+IPP ++ D+FE
Subjt:  KHR--DLPQG--ISNKNINHLLDFLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERATKEKSMMGITFEAGVLKIPPFDVNDIFE

Query:  IYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALRDHCKRRWNRWMAS
         Y+RNLM FE +       +Y I Y LFL  LIS E+D SLL K  IITN IGG++ E+S LFND+ K V +  +   F  I+EAL +HC  RWN+ MAS
Subjt:  IYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALRDHCKRRWNRWMAS

Query:  LKREYFNTPWTLVSFIAASIFIILTFLQTLFSSIS
        L+R+YFNTPW  +SF+AA+  I+LTFLQTLFS++S
Subjt:  LKREYFNTPWTLVSFIAASIFIILTFLQTLFSSIS

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026453.2e-0827.4Show/hide
Query:  IDIANSIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKL-------------RF--LASYLHRIKREIRDVVQ--I
        I++  S++ +L++        SIF V K L+ +H  +Y P  +SIG +H  + +L  ME+ KL             RF  L   L  ++ +IR      I
Subjt:  IDIANSIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKL-------------RF--LASYLHRIKREIRDVVQ--I

Query:  NMEKKEFVGMMVVDGCFLVEFLLVNSGQFVLTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLIIPSIAN-----VSFAYVIHKSFTNMFMK-H
            +  + +M VD  FL+EFL + S +     K  +L  R  +  +  D++M+ENQ+P FVL++     + S  +     +S    + K  + + +K  
Subjt:  NMEKKEFVGMMVVDGCFLVEFLLVNSGQFVLTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLIIPSIAN-----VSFAYVIHKSFTNMFMK-H

Query:  RDLPQGISNKNINHLLDFL
         D       +  NH+LDFL
Subjt:  RDLPQGISNKNINHLLDFL

Q9SD53 UPF0481 protein At3g472003.4e-3428.44Show/hide
Query:  EECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKREIRDVVQ-------INMEKK-------------EFVGMMVVDG
        E C IFRV +  +  +  AY+P+ +SIG +H+G+  L+ ++Q K R L  +L   K+  +DV +       +++E K             + + MMV+DG
Subjt:  EECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKREIRDVVQ-------INMEKK-------------EFVGMMVVDG

Query:  CFLVEFLLVNSGQFVLTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLIIPSIANVS--FAYVIHKSFTNMFMKHRDLPQGISNKNINHLLDFL
        CF++   L+ SG   L S++       L +++  DL++LENQ+PFFVLQ L+   + S   VS     +    F N   K     +   N    HLLD +
Subjt:  CFLVEFLLVNSGQFVLTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLIIPSIANVS--FAYVIHKSFTNMFMKHRDLPQGISNKNINHLLDFL

Query:  SFYFNPPII----YQSPLPNETTRGGR-------QNKWLVLPPSATQLCEAGVKFE-RATKEKSMMGITFEAGVLKIPPFDVNDIFEIYLRNLMVFESFK
           F P         SP        G+        +K + L  SA +L   G+KF  R +KE S++ +  +   L+IP    +     +  N + FE F 
Subjt:  SFYFNPPII----YQSPLPNETTRGGR-------QNKWLVLPPSATQLCEAGVKFE-RATKEKSMMGITFEAGVLKIPPFDVNDIFEIYLRNLMVFESFK

Query:  VGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALRDHCKRRWNRWMASLKREYFNTPWTLV
          S     I  Y++F+G L++ E+D + L  + +I     GS+ E+S  F  I K V    +  Y  ++ + + ++ K+ +N   A  +  +F +PWT +
Subjt:  VGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALRDHCKRRWNRWMASLKREYFNTPWTLV

Query:  SFIAASIFIILTFLQTLFSSIS
        S  A    I+LT LQ+  + +S
Subjt:  SFIAASIFIILTFLQTLFSSIS

Arabidopsis top hitse value%identityAlignment
AT2G36430.1 Plant protein of unknown function (DUF247)2.1e-4228.68Show/hide
Query:  CSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIK-----------REIRDVVQ------INMEKKEFVGMMVVDGCFLVE
        CSIFRV + +I+ +   Y+P+ +SIG +H GQ  LK +E+ K R+L   L R +           + + +V +      I+M+ +EF  MMV+DGCFL+E
Subjt:  CSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIK-----------REIRDVVQ------INMEKKEFVGMMVVDGCFLVE

Query:  FLLVNSGQFVLTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLI---IPSIANVSFAYVIHKSFTNMFMKHRDLPQGISNKNINHLLDFLSFYF
             +        +  +    +    Y D + LENQ+PFFVL+ LF+L      +  N S   +    F NM  +  +           HLLD L   F
Subjt:  FLLVNSGQFVLTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLI---IPSIANVSFAYVIHKSFTNMFMKHRDLPQGISNKNINHLLDFLSFYF

Query:  NPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERATKEKSMMGITFEAGVLKIPPFDVNDIFEIYLRNLMVFESFKVGSQFPRYIIHYVLFL
         P     +P     T  G++     +  S ++L  AG+K       +S + + F  G +++P   V+D    +L N + +E   V      +   Y   L
Subjt:  NPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERATKEKSMMGITFEAGVLKIPPFDVNDIFEIYLRNLMVFESFKVGSQFPRYIIHYVLFL

Query:  GALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALRDHCKRRWNRWMASLKREYFNTPWTLVSFIAASIFIILTFLQT
          L +T KD   L  + II N   G+D E++   N +G+ V       Y KD+ E + ++ K  W+   A+ K  YFN+PW+ VS +AA + ++L+ +QT
Subjt:  GALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALRDHCKRRWNRWMASLKREYFNTPWTLVSFIAASIFIILTFLQT

Query:  LFSSISSF
        +++   ++
Subjt:  LFSSISSF

AT3G44710.1 Plant protein of unknown function (DUF247)9.0e-3828.63Show/hide
Query:  EECSIFRVSKRLIN-AHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKR-----------------EIRDVVQINM--EKKEFVGMMVVDG
        ++C IF++S +  N  +  AY+P+ +S+G +HHG+ +L+ +E+ KLRFL  ++   KR                 +IRD    ++  + K+ + MMV+DG
Subjt:  EECSIFRVSKRLIN-AHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKR-----------------EIRDVVQINM--EKKEFVGMMVVDG

Query:  CFLVEFLLVNSGQFVLTSKESSLKFRA--LNTNLYHDLIMLENQLPFFVLQELF-------SLIIPSIANVSFAYVIHKSFTNMFMKHR--------DLP
        CF++   LV +G    +  E+   F    +   + +DLI+LENQ+PFF+LQ +F       S  +  I    F Y + KS T  ++KH+        DL 
Subjt:  CFLVEFLLVNSGQFVLTSKESSLKFRA--LNTNLYHDLIMLENQLPFFVLQELF-------SLIIPSIANVSFAYVIHKSFTNMFMKHR--------DLP

Query:  QGI-------SNKNINHLLDFLSF---------------------------YFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERATKEK
        + I         K  NHLLD +                                  I     L  E +  GR +  +VL  SA +L   G+KF+   K +
Subjt:  QGI-------SNKNINHLLDFLSF---------------------------YFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERATKEK

Query:  SMMGITFEAGVLKIPPFDVNDIFEIYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEF
        ++M I  +  +L+IPP  ++D     L N + FE +       +++  YV F+G L+ +E D+  L++ GI+ N  G  D E+S  F  +GK V    + 
Subjt:  SMMGITFEAGVLKIPPFDVNDIFEIYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEF

Query:  YYFKDISEALRDHCKRRWNRWMASLKREYFNTPWTLVSFIAASIFIILTFLQTLFSSISSF
         Y   I E +  +    W+   A  K  +F++PWT +S  AA   +ILT +Q  F++   F
Subjt:  YYFKDISEALRDHCKRRWNRWMASLKREYFNTPWTLVSFIAASIFIILTFLQTLFSSISSF

AT4G31980.1 unknown protein4.7e-5532.23Show/hide
Query:  YFKMNQHNIDPYIDINVDVLRIDIANSIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKREIRD
        Y +MNQ+  D  +D            SI+ KL  L   + +C I++V  +L   +  AY P+ +S G  H G+++L+AME  K R+L S++ R    + D
Subjt:  YFKMNQHNIDPYIDINVDVLRIDIANSIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKREIRD

Query:  VVQ----------------INMEKKEFVGMMVVDGCFLVEFLLVNSGQFVLTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLII-------PS
        +V+                + +   EFV M+VVDG FLVE LL +    +    +       + T++  D+I++ENQLPFFV++E+F L++       PS
Subjt:  VVQ----------------INMEKKEFVGMMVVDGCFLVEFLLVNSGQFVLTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLII-------PS

Query:  IANVS---FAYVIHKSFTNMFMKHRDLPQGISNKNINHLLDFLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERATKEKSMMGIT
        I  ++   F+Y + +     F+   +           H +D L   + P    Q P+  E T     N      P AT+L  AGV+F+ A     ++ I+
Subjt:  IANVS---FAYVIHKSFTNMFMKHRDLPQGISNKNINHLLDFLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERATKEKSMMGIT

Query:  FEAGVLKIPPFDVNDIFEIYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDI
        F  GVLKIP   V+D+ E   +N++ FE  +  +   +  + Y++ LG  I +  D+ LL   GII N +G S V++S LFN I K V I++  +YF  +
Subjt:  FEAGVLKIPPFDVNDIFEIYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDI

Query:  SEALRDHCKRRWNRWMASLKREYFNTPWTLVSFIAASIFIILTFLQTLFSSIS
        SE L+ +C   WNRW A L+R+YF+ PW + S  AA + ++LTF+Q++ S ++
Subjt:  SEALRDHCKRRWNRWMASLKREYFNTPWTLVSFIAASIFIILTFLQTLFSSIS

AT5G11290.1 Plant protein of unknown function (DUF247)3.9e-4130.11Show/hide
Query:  MEQLKLRFLASYLHRIKREIRDVVQ----------------INMEKKEFVGMMVVDGCFLVEFLLVNSGQFVLTSKESSLKFRALNTNLYHDLIMLENQL
        ME  KLR+L S++ R    + D+V+                + +   E+V M++VD  FLVE LL +         +     + +  ++ HD+++LENQL
Subjt:  MEQLKLRFLASYLHRIKREIRDVVQ----------------INMEKKEFVGMMVVDGCFLVEFLLVNSGQFVLTSKESSLKFRALNTNLYHDLIMLENQL

Query:  PFFVLQELFSLI-------IPSIANVSFAYVIHKSFTNMFMKHRDLPQGISNKNINHLLDFLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLC
        P+FV++ +F L+       +P +       +IH  F   +M      + IS+  I H +D L    + P++   P        G   + +    SA ++ 
Subjt:  PFFVLQELFSLI-------IPSIANVSFAYVIHKSFTNMFMKHRDLPQGISNKNINHLLDFLSFYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLC

Query:  EAGVKFERATKEKSMMGITFEAGVLKIPPFDVNDIFEIYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILF
         AGVK + A      + I+F  GVL IP   +NDI E   RN+++FE      +   Y IHY+ FL   I +  D+ L    GII N  G ++ ++S LF
Subjt:  EAGVKFERATKEKSMMGITFEAGVLKIPPFDVNDIFEIYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILF

Query:  NDIGKGVDIHEEFYYFKDISEALRDHCKRRWNRWMASLKREYFNTPWTLVSFIAASIFIILTFLQTLFSSIS
        N I K  +     +Y+K +   L+ HC   WN+W A+L+R+YF+ PW+  S +AA + ++LTF+Q + S ++
Subjt:  NDIGKGVDIHEEFYYFKDISEALRDHCKRRWNRWMASLKREYFNTPWTLVSFIAASIFIILTFLQTLFSSIS

AT5G22550.2 Plant protein of unknown function (DUF247)4.5e-3727.53Show/hide
Query:  CSIFRVSKRLINAHHTAYQPQAISIGLFHHGQD--DLKAMEQLKLRFLASYLHRIK-----------------REIRDVVQINME--KKEFVGMMVVDGC
        C I+R+   L   +  AY P+ +SIG +HH  D   LK +E+ K R+L  ++ + K                 ++IRD    N+E  +++ + +M++DGC
Subjt:  CSIFRVSKRLINAHHTAYQPQAISIGLFHHGQD--DLKAMEQLKLRFLASYLHRIK-----------------REIRDVVQINME--KKEFVGMMVVDGC

Query:  FLVEFLLVNSGQFVLTS-KESSLKFRALNTNLYHDLIMLENQLPFFVLQELF--SLIIPSIANVSFAYVIHKSFTNMFMKHRDLPQGISNKNINHLLDFL
        F++   LV S +   T+ K+   K R +   L  DL++LENQ+P F+L+ L   S + PS    S   +  K F     K     +  +N    HLLD +
Subjt:  FLVEFLLVNSGQFVLTS-KESSLKFRALNTNLYHDLIMLENQLPFFVLQELF--SLIIPSIANVSFAYVIHKSFTNMFMKHRDLPQGISNKNINHLLDFL

Query:  SFYFNPPIIYQSPLPNETTR--------GGRQ----------------------------------NKWLVLPPSATQLCEAGVKFERATKEKSMMGITF
           F P     +P P+ T R        G R+                                    +L L  SA +L   G+KF R    ++ + I+F
Subjt:  SFYFNPPIIYQSPLPNETTR--------GGRQ----------------------------------NKWLVLPPSATQLCEAGVKFERATKEKSMMGITF

Query:  EAGVLKIPPFDVNDIFEIYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDIS
        ++G+++IP    +D     L N + FE F +       I  +V+F+G LI+TE D++ L ++GI+ N  G  + E+S+ F +IGK +       +  ++ 
Subjt:  EAGVLKIPPFDVNDIFEIYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTEKDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDIS

Query:  EALRDHCKRRWNRWMASLKREYFNTPWTLVSFIAASIFIILTFLQTLFSSISSF
        E + ++  + ++   A  K  +FNTPWT +S  AA + ++LT  Q  F++ + F
Subjt:  EALRDHCKRRWNRWMASLKREYFNTPWTLVSFIAASIFIILTFLQTLFSSISSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTATTTCAAAATGAATCAACACAACATAGATCCTTACATTGACATCAACGTTGATGTACTACGCATTGATATAGCGAATTCTATTGAAAACAAGCTTCAACAACT
TCATCAATTTACCGAAGAATGTAGCATTTTTCGTGTTTCGAAACGGCTTATAAACGCTCACCACACGGCCTATCAACCTCAAGCGATTTCCATTGGCCTTTTTCACCATG
GTCAAGATGATTTGAAGGCGATGGAACAATTAAAACTCCGATTTCTCGCAAGCTATCTACATCGCATAAAACGAGAAATCAGGGACGTCGTTCAAATCAACATGGAGAAA
AAGGAGTTTGTAGGTATGATGGTCGTGGATGGTTGTTTCTTAGTGGAGTTTTTATTAGTTAATTCCGGTCAATTTGTTCTAACTAGCAAGGAAAGTTCTTTAAAATTCCG
AGCCTTGAACACTAATCTATACCATGACTTAATCATGCTTGAGAATCAACTTCCTTTTTTTGTTCTTCAAGAACTTTTTAGCTTAATTATTCCATCCATTGCTAACGTCT
CCTTTGCATATGTTATACACAAGTCTTTTACAAATATGTTCATGAAGCATCGTGATCTTCCTCAAGGTATTTCCAACAAAAATATAAACCACTTGCTCGATTTCTTAAGC
TTTTACTTCAATCCTCCAATTATTTATCAATCGCCTCTCCCAAACGAAACAACCCGTGGTGGGCGTCAAAATAAATGGTTGGTTCTTCCCCCATCTGCAACTCAGCTTTG
TGAGGCCGGAGTCAAATTCGAGAGAGCAACAAAAGAAAAAAGCATGATGGGCATAACCTTTGAAGCGGGTGTTCTGAAGATCCCACCTTTTGATGTTAACGATATCTTCG
AAATTTACTTGCGAAATTTGATGGTGTTCGAGAGTTTCAAGGTCGGGAGTCAATTTCCAAGGTATATAATCCATTATGTTTTGTTTCTAGGAGCGTTAATAAGCACAGAG
AAAGATTCGAGTTTACTTGCAAAGGAAGGAATAATAACCAACCTAATTGGTGGTAGCGATGTAGAAATTTCAATACTTTTTAATGATATAGGTAAAGGTGTGGACATCCA
TGAAGAATTTTATTACTTCAAAGATATAAGCGAAGCTTTACGTGATCATTGTAAGAGACGATGGAATCGATGGATGGCTTCACTCAAACGCGAATATTTCAATACGCCAT
GGACGCTTGTCTCCTTCATTGCTGCCTCTATTTTTATTATCCTCACTTTTCTGCAAACCCTATTTTCTAGTATATCGTCCTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTATTTCAAAATGAATCAACACAACATAGATCCTTACATTGACATCAACGTTGATGTACTACGCATTGATATAGCGAATTCTATTGAAAACAAGCTTCAACAACT
TCATCAATTTACCGAAGAATGTAGCATTTTTCGTGTTTCGAAACGGCTTATAAACGCTCACCACACGGCCTATCAACCTCAAGCGATTTCCATTGGCCTTTTTCACCATG
GTCAAGATGATTTGAAGGCGATGGAACAATTAAAACTCCGATTTCTCGCAAGCTATCTACATCGCATAAAACGAGAAATCAGGGACGTCGTTCAAATCAACATGGAGAAA
AAGGAGTTTGTAGGTATGATGGTCGTGGATGGTTGTTTCTTAGTGGAGTTTTTATTAGTTAATTCCGGTCAATTTGTTCTAACTAGCAAGGAAAGTTCTTTAAAATTCCG
AGCCTTGAACACTAATCTATACCATGACTTAATCATGCTTGAGAATCAACTTCCTTTTTTTGTTCTTCAAGAACTTTTTAGCTTAATTATTCCATCCATTGCTAACGTCT
CCTTTGCATATGTTATACACAAGTCTTTTACAAATATGTTCATGAAGCATCGTGATCTTCCTCAAGGTATTTCCAACAAAAATATAAACCACTTGCTCGATTTCTTAAGC
TTTTACTTCAATCCTCCAATTATTTATCAATCGCCTCTCCCAAACGAAACAACCCGTGGTGGGCGTCAAAATAAATGGTTGGTTCTTCCCCCATCTGCAACTCAGCTTTG
TGAGGCCGGAGTCAAATTCGAGAGAGCAACAAAAGAAAAAAGCATGATGGGCATAACCTTTGAAGCGGGTGTTCTGAAGATCCCACCTTTTGATGTTAACGATATCTTCG
AAATTTACTTGCGAAATTTGATGGTGTTCGAGAGTTTCAAGGTCGGGAGTCAATTTCCAAGGTATATAATCCATTATGTTTTGTTTCTAGGAGCGTTAATAAGCACAGAG
AAAGATTCGAGTTTACTTGCAAAGGAAGGAATAATAACCAACCTAATTGGTGGTAGCGATGTAGAAATTTCAATACTTTTTAATGATATAGGTAAAGGTGTGGACATCCA
TGAAGAATTTTATTACTTCAAAGATATAAGCGAAGCTTTACGTGATCATTGTAAGAGACGATGGAATCGATGGATGGCTTCACTCAAACGCGAATATTTCAATACGCCAT
GGACGCTTGTCTCCTTCATTGCTGCCTCTATTTTTATTATCCTCACTTTTCTGCAAACCCTATTTTCTAGTATATCGTCCTTTTGA
Protein sequenceShow/hide protein sequence
MVYFKMNQHNIDPYIDINVDVLRIDIANSIENKLQQLHQFTEECSIFRVSKRLINAHHTAYQPQAISIGLFHHGQDDLKAMEQLKLRFLASYLHRIKREIRDVVQINMEK
KEFVGMMVVDGCFLVEFLLVNSGQFVLTSKESSLKFRALNTNLYHDLIMLENQLPFFVLQELFSLIIPSIANVSFAYVIHKSFTNMFMKHRDLPQGISNKNINHLLDFLS
FYFNPPIIYQSPLPNETTRGGRQNKWLVLPPSATQLCEAGVKFERATKEKSMMGITFEAGVLKIPPFDVNDIFEIYLRNLMVFESFKVGSQFPRYIIHYVLFLGALISTE
KDSSLLAKEGIITNLIGGSDVEISILFNDIGKGVDIHEEFYYFKDISEALRDHCKRRWNRWMASLKREYFNTPWTLVSFIAASIFIILTFLQTLFSSISSF