; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg007312 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg007312
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationscaffold7:30285121..30290091
RNA-Seq ExpressionSpg007312
SyntenySpg007312
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131632.1 UPF0481 protein At3g47200-like [Momordica charantia]2.7e-7545.13Show/hide
Query:  DEVKRQSRDEI-TVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSIQDIVEKTRSWETIA
        +EV+    + I  V IE +L+ +       S+YRV KRL +I+  A+ P+ ISIGPFHH ++ L   E+LK RFL  Y RRV   ++   +  R WET A
Subjt:  DEVKRQSRDEI-TVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSIQDIVEKTRSWETIA

Query:  RKCYSEPINMNIDDFVGMMVLDGCFIVEFMIINYATSG----------------PNGPADQ-----KLQRWHKTQHFVPEKSENCPRDRCLPPNATALRE
        RKCY+EPI+M  +DFV MM++DGCF+VEF I+ Y  SG                P  PA +      L  ++     V     N  +    PP AT L+E
Subjt:  RKCYSEPINMNIDDFVGMMVLDGCFIVEFMIINYATSG----------------PNGPADQ-----KLQRWHKTQHFVPEKSENCPRDRCLPPNATALRE

Query:  AGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFEQFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLV
        AGV F+KA  D +HI DI F+DGVL+IP F I   FET VRNL+AFE +             H   D+K            YF FLD+LIS+EKDV LLV
Subjt:  AGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFEQFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLV

Query:  KEGIIINSIGGSDEEISKLFNDLCKYITV-DEFNHFSNISEDLKKHCEKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSFRA
        K GII N+IGG+ E+I+KLFND+ KY  +   F ++S IS DL+K+C+  W RW ASL+ +YFN+PW  ISFLAATF+ILLT++QT + A
Subjt:  KEGIIINSIGGSDEEISKLFNDLCKYITV-DEFNHFSNISEDLKKHCEKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSFRA

XP_022132066.1 UPF0481 protein At3g47200-like [Momordica charantia]4.2e-7640.88Show/hide
Query:  MDCGHVEAY---HNIDEVKRQSRDEITVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSI
        M+  H+E Y     IDEV+ + +  +T+ ++  L+ +       S+YRV KRL +IN +AY P+ ISIGPFHH Q+     E+LK RFL +Y RRV   I
Subjt:  MDCGHVEAY---HNIDEVKRQSRDEITVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSI

Query:  QDIVEKTRSWETIARKCYSEPINMNIDDFVGMMVLDGCFIVEFMIINY---------------------------------------------ATSGPNG
        +D  E  + WET ARKCY+E I+M  D+FV MM++DG F+VEF+ ++Y                                             ++S P  
Subjt:  QDIVEKTRSWETIARKCYSEPINMNIDDFVGMMVLDGCFIVEFMIINY---------------------------------------------ATSGPNG

Query:  PADQKLQRWH--------------KTQHFVP-------------EKSENCPRDRCLPPNATALREAGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDD
               RW+              K  H V              +  +     R  PP AT L EAGV F+KA  D+R IMDIRFKDGVL IP   I D 
Subjt:  PADQKLQRWH--------------KTQHFVP-------------EKSENCPRDRCLPPNATALREAGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDD

Query:  FETYVRNLMAFEQFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEISKLFNDLCKYITVD-EFNH
        FETYVRNL+A+E + + ++E                     R    Y  FLDELIS+E+DVSLLVK GII N+IGG++E++SKLFNDLCK I +  +F +
Subjt:  FETYVRNLMAFEQFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEISKLFNDLCKYITVD-EFNH

Query:  FSNISEDLKKHCEKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSFRA
        +++IS DL K+CE  W R MASLR +YFNTPWAFISFLAATF++LLT +Q  + A
Subjt:  FSNISEDLKKHCEKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSFRA

XP_022158989.1 UPF0481 protein At3g47200-like isoform X1 [Momordica charantia]1.6e-7242.44Show/hide
Query:  DEITVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSIQDIVEKTRSWETIARKCYSEPIN
        DE+   I+  LQ +P      +++RVP+RLL  N  AY+P++ISIGPFHH +Q L   E+ K RFL  Y RR N  I+  V   RSWET AR CY+EPIN
Subjt:  DEITVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSIQDIVEKTRSWETIARKCYSEPIN

Query:  MNIDDFVGMMVLDGCFIVEFMII------------------------------------------------------------NYATSGP-NGPADQKLQ
        M+ D+FV MM++DGCFIVE M++                                                             + T GP   P   +L 
Subjt:  MNIDDFVGMMVLDGCFIVEFMII------------------------------------------------------------NYATSGP-NGPADQKLQ

Query:  R-----WHKTQHFV----------------PEKSENCPRDRC-LPPNATALREAGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFE
               HK  H V                   S    R +C  PP  T L EAG+VFKKA R  +HIMDI FKD VL+IP   I D FETYVRNLMAFE
Subjt:  R-----WHKTQHFV----------------PEKSENCPRDRC-LPPNATALREAGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFE

Query:  QFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEISKLFNDLCKYITV-DEFNHFSNISEDLKKHC
        Q+   +N+ K                    YA  YF FL+ LIS E+DVSLLVK  II N IGG+++E+S LFNDLCK + V  + N F++I+E L +HC
Subjt:  QFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEISKLFNDLCKYITV-DEFNHFSNISEDLKKHC

Query:  EKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSFRA
          RW + MASLR +YFNTPWAFISF+AA F+ILLT LQT F A
Subjt:  EKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSFRA

XP_022158990.1 UPF0481 protein At3g47200-like isoform X2 [Momordica charantia]1.6e-7242.44Show/hide
Query:  DEITVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSIQDIVEKTRSWETIARKCYSEPIN
        DE+   I+  LQ +P      +++RVP+RLL  N  AY+P++ISIGPFHH +Q L   E+ K RFL  Y RR N  I+  V   RSWET AR CY+EPIN
Subjt:  DEITVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSIQDIVEKTRSWETIARKCYSEPIN

Query:  MNIDDFVGMMVLDGCFIVEFMII------------------------------------------------------------NYATSGP-NGPADQKLQ
        M+ D+FV MM++DGCFIVE M++                                                             + T GP   P   +L 
Subjt:  MNIDDFVGMMVLDGCFIVEFMII------------------------------------------------------------NYATSGP-NGPADQKLQ

Query:  R-----WHKTQHFV----------------PEKSENCPRDRC-LPPNATALREAGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFE
               HK  H V                   S    R +C  PP  T L EAG+VFKKA R  +HIMDI FKD VL+IP   I D FETYVRNLMAFE
Subjt:  R-----WHKTQHFV----------------PEKSENCPRDRC-LPPNATALREAGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFE

Query:  QFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEISKLFNDLCKYITV-DEFNHFSNISEDLKKHC
        Q+   +N+ K                    YA  YF FL+ LIS E+DVSLLVK  II N IGG+++E+S LFNDLCK + V  + N F++I+E L +HC
Subjt:  QFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEISKLFNDLCKYITV-DEFNHFSNISEDLKKHC

Query:  EKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSFRA
          RW + MASLR +YFNTPWAFISF+AA F+ILLT LQT F A
Subjt:  EKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSFRA

XP_022158992.1 UPF0481 protein At3g47200-like isoform X3 [Momordica charantia]1.6e-7242.44Show/hide
Query:  DEITVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSIQDIVEKTRSWETIARKCYSEPIN
        DE+   I+  LQ +P      +++RVP+RLL  N  AY+P++ISIGPFHH +Q L   E+ K RFL  Y RR N  I+  V   RSWET AR CY+EPIN
Subjt:  DEITVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSIQDIVEKTRSWETIARKCYSEPIN

Query:  MNIDDFVGMMVLDGCFIVEFMII------------------------------------------------------------NYATSGP-NGPADQKLQ
        M+ D+FV MM++DGCFIVE M++                                                             + T GP   P   +L 
Subjt:  MNIDDFVGMMVLDGCFIVEFMII------------------------------------------------------------NYATSGP-NGPADQKLQ

Query:  R-----WHKTQHFV----------------PEKSENCPRDRC-LPPNATALREAGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFE
               HK  H V                   S    R +C  PP  T L EAG+VFKKA R  +HIMDI FKD VL+IP   I D FETYVRNLMAFE
Subjt:  R-----WHKTQHFV----------------PEKSENCPRDRC-LPPNATALREAGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFE

Query:  QFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEISKLFNDLCKYITV-DEFNHFSNISEDLKKHC
        Q+   +N+ K                    YA  YF FL+ LIS E+DVSLLVK  II N IGG+++E+S LFNDLCK + V  + N F++I+E L +HC
Subjt:  QFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEISKLFNDLCKYITV-DEFNHFSNISEDLKKHC

Query:  EKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSFRA
          RW + MASLR +YFNTPWAFISF+AA F+ILLT LQT F A
Subjt:  EKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSFRA

TrEMBL top hitse value%identityAlignment
A0A6J1BR71 UPF0481 protein At3g47200-like2.0e-7640.88Show/hide
Query:  MDCGHVEAY---HNIDEVKRQSRDEITVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSI
        M+  H+E Y     IDEV+ + +  +T+ ++  L+ +       S+YRV KRL +IN +AY P+ ISIGPFHH Q+     E+LK RFL +Y RRV   I
Subjt:  MDCGHVEAY---HNIDEVKRQSRDEITVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSI

Query:  QDIVEKTRSWETIARKCYSEPINMNIDDFVGMMVLDGCFIVEFMIINY---------------------------------------------ATSGPNG
        +D  E  + WET ARKCY+E I+M  D+FV MM++DG F+VEF+ ++Y                                             ++S P  
Subjt:  QDIVEKTRSWETIARKCYSEPINMNIDDFVGMMVLDGCFIVEFMIINY---------------------------------------------ATSGPNG

Query:  PADQKLQRWH--------------KTQHFVP-------------EKSENCPRDRCLPPNATALREAGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDD
               RW+              K  H V              +  +     R  PP AT L EAGV F+KA  D+R IMDIRFKDGVL IP   I D 
Subjt:  PADQKLQRWH--------------KTQHFVP-------------EKSENCPRDRCLPPNATALREAGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDD

Query:  FETYVRNLMAFEQFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEISKLFNDLCKYITVD-EFNH
        FETYVRNL+A+E + + ++E                     R    Y  FLDELIS+E+DVSLLVK GII N+IGG++E++SKLFNDLCK I +  +F +
Subjt:  FETYVRNLMAFEQFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEISKLFNDLCKYITVD-EFNH

Query:  FSNISEDLKKHCEKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSFRA
        +++IS DL K+CE  W R MASLR +YFNTPWAFISFLAATF++LLT +Q  + A
Subjt:  FSNISEDLKKHCEKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSFRA

A0A6J1BRK2 UPF0481 protein At3g47200-like1.3e-7545.13Show/hide
Query:  DEVKRQSRDEI-TVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSIQDIVEKTRSWETIA
        +EV+    + I  V IE +L+ +       S+YRV KRL +I+  A+ P+ ISIGPFHH ++ L   E+LK RFL  Y RRV   ++   +  R WET A
Subjt:  DEVKRQSRDEI-TVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSIQDIVEKTRSWETIA

Query:  RKCYSEPINMNIDDFVGMMVLDGCFIVEFMIINYATSG----------------PNGPADQ-----KLQRWHKTQHFVPEKSENCPRDRCLPPNATALRE
        RKCY+EPI+M  +DFV MM++DGCF+VEF I+ Y  SG                P  PA +      L  ++     V     N  +    PP AT L+E
Subjt:  RKCYSEPINMNIDDFVGMMVLDGCFIVEFMIINYATSG----------------PNGPADQ-----KLQRWHKTQHFVPEKSENCPRDRCLPPNATALRE

Query:  AGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFEQFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLV
        AGV F+KA  D +HI DI F+DGVL+IP F I   FET VRNL+AFE +             H   D+K            YF FLD+LIS+EKDV LLV
Subjt:  AGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFEQFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLV

Query:  KEGIIINSIGGSDEEISKLFNDLCKYITV-DEFNHFSNISEDLKKHCEKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSFRA
        K GII N+IGG+ E+I+KLFND+ KY  +   F ++S IS DL+K+C+  W RW ASL+ +YFN+PW  ISFLAATF+ILLT++QT + A
Subjt:  KEGIIINSIGGSDEEISKLFNDLCKYITV-DEFNHFSNISEDLKKHCEKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSFRA

A0A6J1DXD6 UPF0481 protein At3g47200-like isoform X27.9e-7342.44Show/hide
Query:  DEITVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSIQDIVEKTRSWETIARKCYSEPIN
        DE+   I+  LQ +P      +++RVP+RLL  N  AY+P++ISIGPFHH +Q L   E+ K RFL  Y RR N  I+  V   RSWET AR CY+EPIN
Subjt:  DEITVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSIQDIVEKTRSWETIARKCYSEPIN

Query:  MNIDDFVGMMVLDGCFIVEFMII------------------------------------------------------------NYATSGP-NGPADQKLQ
        M+ D+FV MM++DGCFIVE M++                                                             + T GP   P   +L 
Subjt:  MNIDDFVGMMVLDGCFIVEFMII------------------------------------------------------------NYATSGP-NGPADQKLQ

Query:  R-----WHKTQHFV----------------PEKSENCPRDRC-LPPNATALREAGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFE
               HK  H V                   S    R +C  PP  T L EAG+VFKKA R  +HIMDI FKD VL+IP   I D FETYVRNLMAFE
Subjt:  R-----WHKTQHFV----------------PEKSENCPRDRC-LPPNATALREAGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFE

Query:  QFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEISKLFNDLCKYITV-DEFNHFSNISEDLKKHC
        Q+   +N+ K                    YA  YF FL+ LIS E+DVSLLVK  II N IGG+++E+S LFNDLCK + V  + N F++I+E L +HC
Subjt:  QFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEISKLFNDLCKYITV-DEFNHFSNISEDLKKHC

Query:  EKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSFRA
          RW + MASLR +YFNTPWAFISF+AA F+ILLT LQT F A
Subjt:  EKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSFRA

A0A6J1DYL4 UPF0481 protein At3g47200-like isoform X37.9e-7342.44Show/hide
Query:  DEITVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSIQDIVEKTRSWETIARKCYSEPIN
        DE+   I+  LQ +P      +++RVP+RLL  N  AY+P++ISIGPFHH +Q L   E+ K RFL  Y RR N  I+  V   RSWET AR CY+EPIN
Subjt:  DEITVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSIQDIVEKTRSWETIARKCYSEPIN

Query:  MNIDDFVGMMVLDGCFIVEFMII------------------------------------------------------------NYATSGP-NGPADQKLQ
        M+ D+FV MM++DGCFIVE M++                                                             + T GP   P   +L 
Subjt:  MNIDDFVGMMVLDGCFIVEFMII------------------------------------------------------------NYATSGP-NGPADQKLQ

Query:  R-----WHKTQHFV----------------PEKSENCPRDRC-LPPNATALREAGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFE
               HK  H V                   S    R +C  PP  T L EAG+VFKKA R  +HIMDI FKD VL+IP   I D FETYVRNLMAFE
Subjt:  R-----WHKTQHFV----------------PEKSENCPRDRC-LPPNATALREAGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFE

Query:  QFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEISKLFNDLCKYITV-DEFNHFSNISEDLKKHC
        Q+   +N+ K                    YA  YF FL+ LIS E+DVSLLVK  II N IGG+++E+S LFNDLCK + V  + N F++I+E L +HC
Subjt:  QFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEISKLFNDLCKYITV-DEFNHFSNISEDLKKHC

Query:  EKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSFRA
          RW + MASLR +YFNTPWAFISF+AA F+ILLT LQT F A
Subjt:  EKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSFRA

A0A6J1E120 UPF0481 protein At3g47200-like isoform X17.9e-7342.44Show/hide
Query:  DEITVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSIQDIVEKTRSWETIARKCYSEPIN
        DE+   I+  LQ +P      +++RVP+RLL  N  AY+P++ISIGPFHH +Q L   E+ K RFL  Y RR N  I+  V   RSWET AR CY+EPIN
Subjt:  DEITVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSIQDIVEKTRSWETIARKCYSEPIN

Query:  MNIDDFVGMMVLDGCFIVEFMII------------------------------------------------------------NYATSGP-NGPADQKLQ
        M+ D+FV MM++DGCFIVE M++                                                             + T GP   P   +L 
Subjt:  MNIDDFVGMMVLDGCFIVEFMII------------------------------------------------------------NYATSGP-NGPADQKLQ

Query:  R-----WHKTQHFV----------------PEKSENCPRDRC-LPPNATALREAGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFE
               HK  H V                   S    R +C  PP  T L EAG+VFKKA R  +HIMDI FKD VL+IP   I D FETYVRNLMAFE
Subjt:  R-----WHKTQHFV----------------PEKSENCPRDRC-LPPNATALREAGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFE

Query:  QFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEISKLFNDLCKYITV-DEFNHFSNISEDLKKHC
        Q+   +N+ K                    YA  YF FL+ LIS E+DVSLLVK  II N IGG+++E+S LFNDLCK + V  + N F++I+E L +HC
Subjt:  QFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEISKLFNDLCKYITV-DEFNHFSNISEDLKKHC

Query:  EKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSFRA
          RW + MASLR +YFNTPWAFISF+AA F+ILLT LQT F A
Subjt:  EKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSFRA

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026455.4e-1024.48Show/hide
Query:  PNATALREAGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFEQFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISS
        P+ + L +AGV FK         +      G   +P   ++ + ET +RNL+A+E                       +  S     T Y E ++ +I S
Subjt:  PNATALREAGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFEQFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISS

Query:  EKDVSLLVKEGIIINSIGGSDEEISKLFNDLCKYITVDEFNHFSNISEDLKKHCEKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQ
        E+DV LL ++G++++ +  SD+E ++++N + K + + +        ED+ ++   RW   +  L   Y    W  ++FLAA  +++L  LQ
Subjt:  EKDVSLLVKEGIIINSIGGSDEEISKLFNDLCKYITVDEFNHFSNISEDLKKHCEKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQ

Q9SD53 UPF0481 protein At3g472002.7e-2524.48Show/hide
Query:  LYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSY-QRRVNKSIQD--IVEKTRSWETIARKCYSEPINMNIDDFVGMMVLDGCFI--
        ++RVP+  + +NP AY P+V+SIGP+H+ ++ L+  ++ K R L  +      K +++  +V+     E   RK YSE +     D + MMVLDGCFI  
Subjt:  LYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSY-QRRVNKSIQD--IVEKTRSWETIARKCYSEPINMNIDDFVGMMVLDGCFI--

Query:  ------------------------------------VEFMIINYATSGP----------------NGPADQKLQRW-----HKTQH--------FVPEKS
                                            V F ++     G                   P D++   W     +K +H        F+P  S
Subjt:  ------------------------------------VEFMIINYATSGP----------------NGPADQKLQRW-----HKTQH--------FVPEKS

Query:  E-------------------NCPR--DRCLP--PNATALREAGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFEQFLVE-NNEMKS
        E                   N P    + +P   +A  LR  G+ F+  +  E  I+++R K   L+IP    +    ++  N +AFEQF  + +NE+  
Subjt:  E-------------------NCPR--DRCLP--PNATALREAGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFEQFLVE-NNEMKS

Query:  EDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEISKLFNDLCKYITVD-EFNHFSNISEDLKKHCEKRWTRWMASL
                             T Y  F+  L+++E+DV+ L  + +II +  GS+ E+S+ F  + K +  + + ++ +N+ + + ++ +K +    A  
Subjt:  EDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEISKLFNDLCKYITVD-EFNHFSNISEDLKKHCEKRWTRWMASL

Query:  RHNYFNTPWAFISFLAATFIILLTLLQTS
        RH +F +PW F+S  A  F+ILLT+LQ++
Subjt:  RHNYFNTPWAFISFLAATFIILLTLLQTS

Arabidopsis top hitse value%identityAlignment
AT3G50120.1 Plant protein of unknown function (DUF247)2.3e-4027.59Show/hide
Query:  AYHNIDEVKRQSRDEITVFIEGKLQGVPRTVPT-----YSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSIQDIVE
        A + I E  + SRD+  + I  KL+   R   T       +YRVP  L + +  +Y P+ +S+GP+HH ++ L+  +  K+R ++   +R N+ I+  ++
Subjt:  AYHNIDEVKRQSRDEITVFIEGKLQGVPRTVPT-----YSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSIQDIVE

Query:  KTRSWETIARKCYSEPINMNIDDFVGMMVLDGCFIVE--------FMIINYATSGP---------------------------NGPADQKLQRWHKT---
          R  E  AR CY  P++++ ++F+ M+VLDGCF++E        F  + YA + P                           N   + +L   ++T   
Subjt:  KTRSWETIARKCYSEPINMNIDDFVGMMVLDGCFIVE--------FMIINYATSGP---------------------------NGPADQKLQRWHKT---

Query:  -----QHFVP-------------EKSEN-CPRDRCLPPNA-------------------------------------------------TALREAGVVFK
             + F P              K EN   RD+   P A                                                 T L+EAG+ F+
Subjt:  -----QHFVP-------------EKSEN-CPRDRCLPPNA-------------------------------------------------TALREAGVVFK

Query:  KAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFEQFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIII
        + K D     D++FK+G LEIP   I D  ++   NL+AFE             QCH +    I         T Y  F+D LI S +DVS L   GII 
Subjt:  KAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFEQFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIII

Query:  NSIGGSDEEISKLFNDLCKYITVD-EFNHFSNISEDLKKHCEKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSF
        + + GSD E++ LFN LC+ +  D E ++ S +S ++ ++ + +W  W A+L+H YFN PWA +SF AA  +++LT  Q+ +
Subjt:  NSIGGSDEEISKLFNDLCKYITVD-EFNHFSNISEDLKKHCEKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSF

AT3G50150.1 Plant protein of unknown function (DUF247)1.8e-3727.39Show/hide
Query:  HVEAYHNIDEVK------RQSRDEITVFIEGKLQGVPRTVPTYS-----LYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRV
        HVE Y    +++      R++R+E  + I+ K++       T S     +YRVP  L + +  +Y+P+ +SIGP+HH +  L+  E  K+R ++    R 
Subjt:  HVEAYHNIDEVK------RQSRDEITVFIEGKLQGVPRTVPTYS-----LYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRV

Query:  NKSIQDIVEKTRSWETIARKCYSEPINM-NIDDFVGMMVLDGCFIVE--------FMIINYATSGP----------------------------------
          +I+  ++  +  E  AR CY  PI+M N ++F  M+VLDGCF++E        F  I YA + P                                  
Subjt:  NKSIQDIVEKTRSWETIARKCYSEPINM-NIDDFVGMMVLDGCFIVE--------FMIINYATSGP----------------------------------

Query:  NGPADQ-----------------------KLQRWHKTQHFVPEKSEN-------------------------------CPRDRCLPPNATALREAGVVFK
         G  +Q                       K +R   +Q    E  +N                                 + + L    T LR AGV F 
Subjt:  NGPADQ-----------------------KLQRWHKTQHFVPEKSEN-------------------------------CPRDRCLPPNATALREAGVVFK

Query:  KAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFEQFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIII
          +++   + DI FK+G L+IP   I D  ++   NL+AFE             QCH +    I         T Y  F+D LI+S +DVS L  +GII 
Subjt:  KAKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFEQFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIII

Query:  NSIGGSDEEISKLFNDLCKYITVD-EFNHFSNISEDLKKHCEKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSF
        + + GSD E++ LFN LCK +  D +  + S +S ++ ++  ++W    A+LR  YFN PWA+ SF AA  ++ LT  Q+ F
Subjt:  NSIGGSDEEISKLFNDLCKYITVD-EFNHFSNISEDLKKHCEKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSF

AT3G50160.1 Plant protein of unknown function (DUF247)1.9e-3929.39Show/hide
Query:  GHVEAYHNIDEVKRQSRDEITVFIEGKLQGVPRTVPTYSL--------------------YRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKF
        G +E    I+E  R+++ E  V IE K +   R +   SL                    YRVP  L + +  +Y+P+++SIGP+HH  + L   E  K+
Subjt:  GHVEAYHNIDEVKRQSRDEITVFIEGKLQGVPRTVPTYSL--------------------YRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKF

Query:  RFLSSYQRRVNKSIQDIVEKTRSWETIARKCYSEPINMNIDDFVGMMVLDGCFIVE--------FMIINYATSGPNGPA------DQKLQR---------
        R ++    R    I+  ++  +  E  AR CY  PINMN ++F+ M+VLDG FI+E        F  I YA   PN P        Q ++R         
Subjt:  RFLSSYQRRVNKSIQDIVEKTRSWETIARKCYSEPINMNIDDFVGMMVLDGCFIVE--------FMIINYATSGPNGPA------DQKLQR---------

Query:  -WH-----------------KTQHFVPEKSENCPRDRCLPPN-------------------------------------ATALREAGVVFKKAKRDERHI
         W                    Q F P      P    L                                         T LR AGV F   +++  H 
Subjt:  -WH-----------------KTQHFVPEKSENCPRDRCLPPN-------------------------------------ATALREAGVVFKKAKRDERHI

Query:  MDIRFKDGVLEIPSFGIEDDFETYVRNLMAFEQFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEE
         DI FK+G L+IP   I D  ++   NL+AFE             QCH +  +KI         T Y  F+D LI+S +DVS L   GII N + GSD E
Subjt:  MDIRFKDGVLEIPSFGIEDDFETYVRNLMAFEQFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEE

Query:  ISKLFNDLCKYITVD-EFNHFSNISEDLKKHCEKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSF
        +S LFN L K +  D    + S ++ ++  +  ++W    A+LRH YFN PWA+ SF+AA  +++ T  Q+ F
Subjt:  ISKLFNDLCKYITVD-EFNHFSNISEDLKKHCEKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSF

AT3G50170.1 Plant protein of unknown function (DUF247)6.0e-4129.36Show/hide
Query:  DEVKRQSRDEITVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSIQDIVEKTRSWETIAR
        D++++  RD+ T  I GKL           +YRVP  L + +  +Y P+ +S+GP+HH ++ L+  E  K+R L+   +R+ + I+      R  E  AR
Subjt:  DEVKRQSRDEITVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSIQDIVEKTRSWETIAR

Query:  KCYSEPINMNIDDFVGMMVLDGCFIVE--------FMIINYATSGP------------------------------------------------------
         CY  PI+++ ++F  M+VLDGCF++E        F  I YA + P                                                      
Subjt:  KCYSEPINMNIDDFVGMMVLDGCFIVE--------FMIINYATSGP------------------------------------------------------

Query:  --------NGPADQKLQRW-HKTQHFVPEKSE-NC--------------------------------PRDRCLPPNATALREAGVVFKKAKRDERHIMDI
                  P   KL  W  K+   + +K E +C                                 R + L    T LREAGV F+K K D     DI
Subjt:  --------NGPADQKLQRW-HKTQHFVPEKSE-NC--------------------------------PRDRCLPPNATALREAGVVFKKAKRDERHIMDI

Query:  RFKDGVLEIPSFGIEDDFETYVRNLMAFEQFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEISK
         FK+G LEIP   I D  ++   NL+AFE             QCH E    I         T Y  F+D LI+S +DVS L   GII + + GSD E++ 
Subjt:  RFKDGVLEIPSFGIEDDFETYVRNLMAFEQFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEISK

Query:  LFNDLCKYITVD-EFNHFSNISEDLKKHCEKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSF
        LFN LC+ +  D + +H S +S D+ ++  ++W    A+L H YFN PWA+ SF AA  ++LLTL Q+ +
Subjt:  LFNDLCKYITVD-EFNHFSNISEDLKKHCEKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSF

AT4G31980.1 unknown protein8.7e-4829.4Show/hide
Query:  CGHVEAYHNIDEVKRQSRDE---ITVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSIQD
        C  V+ +      +R +++E   +   I+ KL  +        +Y+VP +L  +NP AY PR++S GP H  ++ L+  E+ K+R+L S+  R N S++D
Subjt:  CGHVEAYHNIDEVKRQSRDE---ITVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLSSYQRRVNKSIQD

Query:  IVEKTRSWETIARKCYSEPINMNIDDFVGMMVLDGCFIVE-----------------------------------------------FMIINYATSGPNG
        +V   R+WE  AR CY+E + ++ D+FV M+V+DG F+VE                                                +++NY   G   
Subjt:  IVEKTRSWETIARKCYSEPINMNIDDFVGMMVLDGCFIVE-----------------------------------------------FMIINYATSGPNG

Query:  PADQKLQRWH---------------KTQHFV-----------PEKSENCPRDRCLPPNATALREAGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDDF
        P+  +L + H               + +HFV           P K E         P AT L  AGV FK A+     ++DI F DGVL+IP+  ++D  
Subjt:  PADQKLQRWH---------------KTQHFV-----------PEKSENCPRDRCLPPNATALREAGVVFKKAKRDERHIMDIRFKDGVLEIPSFGIEDDF

Query:  ETYVRNLMAFEQFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEISKLFNDLCKYITVDEFNHFS
        E+  +N++ FEQ    N                           DY   L   I S  D  LL+  GII+N +G S  ++S LFN + K +  D   +FS
Subjt:  ETYVRNLMAFEQFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEISKLFNDLCKYITVDEFNHFS

Query:  NISEDLKKHCEKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQT
         +SE+L+ +C   W RW A LR +YF+ PWA  S  AA  ++LLT +Q+
Subjt:  NISEDLKKHCEKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTTCTTCTTCACCATCATCACCATTCCACCCAATCTGCATTTGCCGTCAGCCACCACCGGCCGTCGACGACTCCATCGAAGAGCCCATGGCTCTGGTGAGAAGCT
GGGAGAAGGAGGGCGAATCAACGGCACCGTCAGAGAAGAAACTAGAGAGAGAGACGTCGGCGGCTTGGTAGAGAGAGCACAAGAGAGAGACATCGAAAGAGATTTGCAAC
TCAGATCTGCAGCCCACAGAAGAAGAGAGACGTCGGCAACCGGATCCGTTTTAGATCTACAACCCACAGAAGAAGAGAGACGCCGGCAGCGGATCCGTTTCAGATCTGCA
ACCCACGACTCAATCCGTTTCAGATTTGGAATGGGGAGATTTTTAAAGTCTCTTTGCATTTCAGTTCAAATGGATTGCGGTCATGTTGAAGCATATCACAATATTGATGA
GGTTAAACGACAATCTCGGGATGAGATCACGGTATTCATCGAAGGAAAGCTTCAAGGAGTGCCTCGCACCGTTCCAACATATAGCCTCTATCGGGTTCCCAAACGGCTAC
TTGACATTAACCCTATAGCTTATGTGCCTCGAGTCATTTCAATTGGTCCTTTTCACCATCATCAACAAATTTTGAAGGACACAGAAGAGCTCAAGTTTCGATTTTTAAGT
AGCTATCAACGTCGCGTAAATAAGAGCATTCAGGACATTGTGGAAAAGACTCGAAGTTGGGAGACAATAGCCCGTAAATGCTACTCAGAGCCCATAAACATGAACATTGA
TGACTTTGTGGGAATGATGGTTTTAGATGGTTGTTTCATAGTGGAGTTCATGATAATAAATTACGCTACCAGTGGACCTAATGGACCTGCAGATCAGAAGCTCCAACGAT
GGCATAAAACCCAACACTTTGTTCCCGAGAAGAGTGAAAATTGTCCACGCGATAGGTGCCTTCCCCCAAATGCAACCGCTCTTCGTGAGGCTGGTGTCGTCTTCAAGAAA
GCAAAACGAGACGAAAGACACATTATGGACATAAGGTTCAAAGATGGGGTCCTAGAAATTCCGTCTTTCGGAATTGAAGATGATTTTGAAACATATGTACGAAACTTAAT
GGCATTTGAGCAGTTTTTGGTGGAGAATAATGAGATGAAAAGTGAAGATCAGTGTCACAAGGAGCTTGATGAGAAGATTAGCAAAGAGAGCATTAAAAGGTATGCAACGG
ATTATTTTGAGTTCCTAGATGAGTTGATATCATCAGAGAAAGATGTGAGTTTACTTGTGAAGGAAGGAATCATAATCAATAGTATTGGTGGCAGTGATGAAGAAATTTCA
AAATTGTTTAATGACCTTTGCAAGTATATCACCGTAGATGAATTTAACCACTTCTCCAATATTAGCGAAGATCTAAAAAAGCACTGTGAAAAACGATGGACCAGGTGGAT
GGCTTCATTGAGACACAACTATTTTAATACGCCATGGGCTTTTATTTCCTTCTTAGCTGCAACCTTCATTATTTTGCTCACTCTCCTACAAACGAGTTTTAGGGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTCTTCTTCTTCACCATCATCACCATTCCACCCAATCTGCATTTGCCGTCAGCCACCACCGGCCGTCGACGACTCCATCGAAGAGCCCATGGCTCTGGTGAGAAGCT
GGGAGAAGGAGGGCGAATCAACGGCACCGTCAGAGAAGAAACTAGAGAGAGAGACGTCGGCGGCTTGGTAGAGAGAGCACAAGAGAGAGACATCGAAAGAGATTTGCAAC
TCAGATCTGCAGCCCACAGAAGAAGAGAGACGTCGGCAACCGGATCCGTTTTAGATCTACAACCCACAGAAGAAGAGAGACGCCGGCAGCGGATCCGTTTCAGATCTGCA
ACCCACGACTCAATCCGTTTCAGATTTGGAATGGGGAGATTTTTAAAGTCTCTTTGCATTTCAGTTCAAATGGATTGCGGTCATGTTGAAGCATATCACAATATTGATGA
GGTTAAACGACAATCTCGGGATGAGATCACGGTATTCATCGAAGGAAAGCTTCAAGGAGTGCCTCGCACCGTTCCAACATATAGCCTCTATCGGGTTCCCAAACGGCTAC
TTGACATTAACCCTATAGCTTATGTGCCTCGAGTCATTTCAATTGGTCCTTTTCACCATCATCAACAAATTTTGAAGGACACAGAAGAGCTCAAGTTTCGATTTTTAAGT
AGCTATCAACGTCGCGTAAATAAGAGCATTCAGGACATTGTGGAAAAGACTCGAAGTTGGGAGACAATAGCCCGTAAATGCTACTCAGAGCCCATAAACATGAACATTGA
TGACTTTGTGGGAATGATGGTTTTAGATGGTTGTTTCATAGTGGAGTTCATGATAATAAATTACGCTACCAGTGGACCTAATGGACCTGCAGATCAGAAGCTCCAACGAT
GGCATAAAACCCAACACTTTGTTCCCGAGAAGAGTGAAAATTGTCCACGCGATAGGTGCCTTCCCCCAAATGCAACCGCTCTTCGTGAGGCTGGTGTCGTCTTCAAGAAA
GCAAAACGAGACGAAAGACACATTATGGACATAAGGTTCAAAGATGGGGTCCTAGAAATTCCGTCTTTCGGAATTGAAGATGATTTTGAAACATATGTACGAAACTTAAT
GGCATTTGAGCAGTTTTTGGTGGAGAATAATGAGATGAAAAGTGAAGATCAGTGTCACAAGGAGCTTGATGAGAAGATTAGCAAAGAGAGCATTAAAAGGTATGCAACGG
ATTATTTTGAGTTCCTAGATGAGTTGATATCATCAGAGAAAGATGTGAGTTTACTTGTGAAGGAAGGAATCATAATCAATAGTATTGGTGGCAGTGATGAAGAAATTTCA
AAATTGTTTAATGACCTTTGCAAGTATATCACCGTAGATGAATTTAACCACTTCTCCAATATTAGCGAAGATCTAAAAAAGCACTGTGAAAAACGATGGACCAGGTGGAT
GGCTTCATTGAGACACAACTATTTTAATACGCCATGGGCTTTTATTTCCTTCTTAGCTGCAACCTTCATTATTTTGCTCACTCTCCTACAAACGAGTTTTAGGGCATAA
Protein sequenceShow/hide protein sequence
MFFFFTIITIPPNLHLPSATTGRRRLHRRAHGSGEKLGEGGRINGTVREETRERDVGGLVERAQERDIERDLQLRSAAHRRRETSATGSVLDLQPTEEERRRQRIRFRSA
THDSIRFRFGMGRFLKSLCISVQMDCGHVEAYHNIDEVKRQSRDEITVFIEGKLQGVPRTVPTYSLYRVPKRLLDINPIAYVPRVISIGPFHHHQQILKDTEELKFRFLS
SYQRRVNKSIQDIVEKTRSWETIARKCYSEPINMNIDDFVGMMVLDGCFIVEFMIINYATSGPNGPADQKLQRWHKTQHFVPEKSENCPRDRCLPPNATALREAGVVFKK
AKRDERHIMDIRFKDGVLEIPSFGIEDDFETYVRNLMAFEQFLVENNEMKSEDQCHKELDEKISKESIKRYATDYFEFLDELISSEKDVSLLVKEGIIINSIGGSDEEIS
KLFNDLCKYITVDEFNHFSNISEDLKKHCEKRWTRWMASLRHNYFNTPWAFISFLAATFIILLTLLQTSFRA