; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0020650 (gene) of Chayote v1 genome

Gene IDSed0020650
OrganismSechium edule (Chayote v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationLG11:32406910..32412369
RNA-Seq ExpressionSed0020650
SyntenySed0020650
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008445188.1 PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo]4.9e-9146.33Show/hide
Query:  DHVALVIEDNLQKMAP-FVPECSIHRVPKALLNMNHNAYVPRDISIGPFH-HDKQKFKTTEELKLRFFNSYRCRV-----------GRSAQDIVRRARGW
        D+V + IE  L ++ P    +CSI+RVPK L  MN  AY P+ ISIGPFH H  +     E+ KL+ F +Y  RV            +S +D+V+RA+ W
Subjt:  DHVALVIEDNLQKMAP-FVPECSIHRVPKALLNMNHNAYVPRDISIGPFH-HDKQKFKTTEELKLRFFNSYRCRV-----------GRSAQDIVRRARGW

Query:  ERKAREYYSEPINMNSEDFVIMMVLDGCFIVEFMIKD-RTYS------FPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNI
          +AR  Y+E INMN EDF+ MM++DGCFIVEF I D   Y+      FP  EN V  SFYK    ++  +LI LENQLPFFVLQ+LFDLIP      N 
Subjt:  ERKAREYYSEPINMNSEDFVIMMVLDGCFIVEFMIKD-RTYS------FPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNI

Query:  EVFISVVHLFAGGCVWPNKSGIQIYNNNLNKNPRHLLDFLSSYFVPMDEITGEHH---------CLPPSATELGKAGIVFKKVEGDQICIMDISFKDGIL
           ++  +L  G       S I      L+  P+H +DFLS Y VP  +   +            +PPS TE+ +AG+  KK + +  C+++I F++GIL
Subjt:  EVFISVVHLFAGGCVWPNKSGIQIYNNNLNKNPRHLLDFLSSYFVPMDEITGEHH---------CLPPSATELGKAGIVFKKVEGDQICIMDISFKDGIL

Query:  NIPPLEIDDKFESCARNLLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRN-NYFSHISNDL
         IPPL IDD FE   RNLLAF  F  E         I Y  F+D LI TEKDVNLL KE IIIN IGGS  E+SQLFN++CK ++   N NYF+  S  L
Subjt:  NIPPLEIDDKFESCARNLLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRN-NYFSHISNDL

Query:  KQHCDRKRNKWMASLRQNYFHTPWTLISFLAATFLIILTLLQTIFTGIS
        + HCDR+ NK  ASL+ NYF+TPW  IS  AATFL++LT+LQTIF+ IS
Subjt:  KQHCDRKRNKWMASLRQNYFHTPWTLISFLAATFLIILTLLQTIFTGIS

XP_022158990.1 UPF0481 protein At3g47200-like isoform X2 [Momordica charantia]4.2e-9046Show/hide
Query:  MDHVALVIEDNLQKMAPFVPECSIHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGRSAQDIVRRARGWERKAREYYSEPI
        +D V   I+  LQ++ P   EC+IHRVP+ LL  N  AY+P+ ISIGPFHH +Q     E+ KLRF + Y  R     +  V   R WE  AR  Y+EPI
Subjt:  MDHVALVIEDNLQKMAPFVPECSIHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGRSAQDIVRRARGWERKAREYYSEPI

Query:  NMNSEDFVIMMVLDGCFIVEFMIKDRTYSFPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVHLF--AGGCVWP
        NM+S++FV MM++DGCFIVE M+          E +     + A+  ++  +LIMLENQLPFFVLQ LFD    ++ L+    F+ + H+F   G  + P
Subjt:  NMNSEDFVIMMVLDGCFIVEFMIKDRTYSFPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVHLF--AGGCVWP

Query:  N----KSGIQIYNNNLNKNPRHLLDFLSSYFVPMD-EITGEHHCL---------PPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFE
               G+ I  + +N    HL+DFLS Y+ P    ++   H L         PP+ TEL +AGIVFKK    +  IMDISFKD +L IPPLEI D FE
Subjt:  N----KSGIQIYNNNLNKNPRHLLDFLSSYFVPMD-EITGEHHCL---------PPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFE

Query:  SCARNLLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRN-NYFSHISNDLKQHCDRKRNKWM
        +  RNL+AF Q+    +GK    AI YF+FL+ LI+ E+DV+LL K  II NCIGG+  E+S LFND+CK++ VR + N F+HI+  L +HC  + NK M
Subjt:  SCARNLLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRN-NYFSHISNDLKQHCDRKRNKWM

Query:  ASLRQNYFHTPWTLISFLAATFLIILTLLQTIFTGIS
        ASLR++YF+TPW  ISF+AA FLI+LT LQT+F+ +S
Subjt:  ASLRQNYFHTPWTLISFLAATFLIILTLLQTIFTGIS

XP_022158992.1 UPF0481 protein At3g47200-like isoform X3 [Momordica charantia]4.2e-9046Show/hide
Query:  MDHVALVIEDNLQKMAPFVPECSIHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGRSAQDIVRRARGWERKAREYYSEPI
        +D V   I+  LQ++ P   EC+IHRVP+ LL  N  AY+P+ ISIGPFHH +Q     E+ KLRF + Y  R     +  V   R WE  AR  Y+EPI
Subjt:  MDHVALVIEDNLQKMAPFVPECSIHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGRSAQDIVRRARGWERKAREYYSEPI

Query:  NMNSEDFVIMMVLDGCFIVEFMIKDRTYSFPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVHLF--AGGCVWP
        NM+S++FV MM++DGCFIVE M+          E +     + A+  ++  +LIMLENQLPFFVLQ LFD    ++ L+    F+ + H+F   G  + P
Subjt:  NMNSEDFVIMMVLDGCFIVEFMIKDRTYSFPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVHLF--AGGCVWP

Query:  N----KSGIQIYNNNLNKNPRHLLDFLSSYFVPMD-EITGEHHCL---------PPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFE
               G+ I  + +N    HL+DFLS Y+ P    ++   H L         PP+ TEL +AGIVFKK    +  IMDISFKD +L IPPLEI D FE
Subjt:  N----KSGIQIYNNNLNKNPRHLLDFLSSYFVPMD-EITGEHHCL---------PPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFE

Query:  SCARNLLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRN-NYFSHISNDLKQHCDRKRNKWM
        +  RNL+AF Q+    +GK    AI YF+FL+ LI+ E+DV+LL K  II NCIGG+  E+S LFND+CK++ VR + N F+HI+  L +HC  + NK M
Subjt:  SCARNLLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRN-NYFSHISNDLKQHCDRKRNKWM

Query:  ASLRQNYFHTPWTLISFLAATFLIILTLLQTIFTGIS
        ASLR++YF+TPW  ISF+AA FLI+LT LQT+F+ +S
Subjt:  ASLRQNYFHTPWTLISFLAATFLIILTLLQTIFTGIS

XP_023547064.1 UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo]4.9e-9145.99Show/hide
Query:  IEDNLQKMAPFVPECSIHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGR---SAQDIVRRARGWERKAREYYSEPINMNS
        I++ + K+ P   +CSI RVPK L NMNH AY P+ ISIGPFHH ++    TE  KLR   ++  R+G    S + + +  + W ++ R  Y EPINMN 
Subjt:  IEDNLQKMAPFVPECSIHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGR---SAQDIVRRARGWERKAREYYSEPINMNS

Query:  EDFVIMMVLDGCFIVEFMIKDRTYSFP----PNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIE-VFISVVHLFAGGCVWPN
        ++FV MMV+DGCF+VEF+I+     +P       N +  +F++     +  +LIMLENQ+PFF+L+ LF LIP  +SL+  E ++IS        C   N
Subjt:  EDFVIMMVLDGCFIVEFMIKDRTYSFP----PNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIE-VFISVVHLFAGGCVWPN

Query:  KSGIQIYNNNLNKNPRHLLDFLSSYFVPMDEITGEHH---CLPPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFESCARNLLAFAQF
         S         +  P+HL+DFLS +FV    +   ++     PPS TEL +AG+  KK E + I +MDI FK+ IL IPPL IDD FE   RNL+AF  F
Subjt:  KSGIQIYNNNLNKNPRHLLDFLSSYFVPMDEITGEHH---CLPPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFESCARNLLAFAQF

Query:  GWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRNNYFSHISNDLKQHCDRKRNKWMASLRQNYFHTPWT
              +N+ N I Y  F+D+LI+TEKDVNLL K GIIIN IGGS  E+S+LFN++CK +    ++  ++ISN L++HC+R+ NK  ASL+ NYF+TPW 
Subjt:  GWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRNNYFSHISNDLKQHCDRKRNKWMASLRQNYFHTPWT

Query:  LISFLAATFLIILTLLQTIFTGIS
        ++SF AATFLIILTL QTIF+G+S
Subjt:  LISFLAATFLIILTLLQTIFTGIS

XP_031736550.1 UPF0481 protein At3g47200-like [Cucumis sativus]7.6e-9246.88Show/hide
Query:  DHVALVIEDNLQKM-APFVPECSIHRVPKALLNMNHNAYVPRDISIGPF-HHDKQKFKTTEELKLRFFNSYRCRVG----------RSAQDIVRRARGWE
        D+V + IE  L ++ +    +CSI+RVPK L  MN  AY P+ ISIGPF +H  +     E+ KL+ FN++  RV           RS  D+V++A+ W 
Subjt:  DHVALVIEDNLQKM-APFVPECSIHRVPKALLNMNHNAYVPRDISIGPF-HHDKQKFKTTEELKLRFFNSYRCRVG----------RSAQDIVRRARGWE

Query:  RKAREYYSEPINMNSEDFVIMMVLDGCFIVEFMIKD-------RTYSFPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIE
        ++AR  Y+E INMN EDF+ MM++DGCFIVEF I D           FP  EN V  SFYK    ++  +LI LENQLPFFVLQ+LFDLIP  +   N  
Subjt:  RKAREYYSEPINMNSEDFVIMMVLDGCFIVEFMIKD-------RTYSFPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIE

Query:  VFISVVHLFAGGCVWPNKSGIQIYNNNLNKNPRHLLDFLSSYFVPMDEITGEHH---------CLPPSATELGKAGIVFKKVEGDQICIMDISFKDGILN
          ++  +L  G       S I      L+  P+H +DFLS YFVP      +            +PPS TEL +AG+  KK E  + C+M+I F++GIL 
Subjt:  VFISVVHLFAGGCVWPNKSGIQIYNNNLNKNPRHLLDFLSSYFVPMDEITGEHH---------CLPPSATELGKAGIVFKKVEGDQICIMDISFKDGILN

Query:  IPPLEIDDKFESCARNLLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRN-NYFSHISNDLK
        IPPL IDD FE   RNLLAF  F  E    N    I Y  F+D LI+TEKDVNLL KE IIIN IGGS  E+SQLFN++CK ++   N NYF++IS  L+
Subjt:  IPPLEIDDKFESCARNLLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRN-NYFSHISNDLK

Query:  QHCDRKRNKWMASLRQNYFHTPWTLISFLAATFLIILTLLQTIFTGIS
        +HCDR  NK  ASL+ NYF+TPW  ISF AAT L++LT+LQT+F+ IS
Subjt:  QHCDRKRNKWMASLRQNYFHTPWTLISFLAATFLIILTLLQTIFTGIS

TrEMBL top hitse value%identityAlignment
A0A1S3BBL9 UPF0481 protein At3g47200-like2.4e-9146.33Show/hide
Query:  DHVALVIEDNLQKMAP-FVPECSIHRVPKALLNMNHNAYVPRDISIGPFH-HDKQKFKTTEELKLRFFNSYRCRV-----------GRSAQDIVRRARGW
        D+V + IE  L ++ P    +CSI+RVPK L  MN  AY P+ ISIGPFH H  +     E+ KL+ F +Y  RV            +S +D+V+RA+ W
Subjt:  DHVALVIEDNLQKMAP-FVPECSIHRVPKALLNMNHNAYVPRDISIGPFH-HDKQKFKTTEELKLRFFNSYRCRV-----------GRSAQDIVRRARGW

Query:  ERKAREYYSEPINMNSEDFVIMMVLDGCFIVEFMIKD-RTYS------FPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNI
          +AR  Y+E INMN EDF+ MM++DGCFIVEF I D   Y+      FP  EN V  SFYK    ++  +LI LENQLPFFVLQ+LFDLIP      N 
Subjt:  ERKAREYYSEPINMNSEDFVIMMVLDGCFIVEFMIKD-RTYS------FPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNI

Query:  EVFISVVHLFAGGCVWPNKSGIQIYNNNLNKNPRHLLDFLSSYFVPMDEITGEHH---------CLPPSATELGKAGIVFKKVEGDQICIMDISFKDGIL
           ++  +L  G       S I      L+  P+H +DFLS Y VP  +   +            +PPS TE+ +AG+  KK + +  C+++I F++GIL
Subjt:  EVFISVVHLFAGGCVWPNKSGIQIYNNNLNKNPRHLLDFLSSYFVPMDEITGEHH---------CLPPSATELGKAGIVFKKVEGDQICIMDISFKDGIL

Query:  NIPPLEIDDKFESCARNLLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRN-NYFSHISNDL
         IPPL IDD FE   RNLLAF  F  E         I Y  F+D LI TEKDVNLL KE IIIN IGGS  E+SQLFN++CK ++   N NYF+  S  L
Subjt:  NIPPLEIDDKFESCARNLLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRN-NYFSHISNDL

Query:  KQHCDRKRNKWMASLRQNYFHTPWTLISFLAATFLIILTLLQTIFTGIS
        + HCDR+ NK  ASL+ NYF+TPW  IS  AATFL++LT+LQTIF+ IS
Subjt:  KQHCDRKRNKWMASLRQNYFHTPWTLISFLAATFLIILTLLQTIFTGIS

A0A5A7V9C4 UPF0481 protein5.9e-9047.09Show/hide
Query:  ECSIHRVPKALLNMNHNAYVPRDISIGPFH-HDKQKFKTTEELKLRFFNSYRCRV-----------GRSAQDIVRRARGWERKAREYYSEPINMNSEDFV
        +CSI+RVPK L  MN  AY P+ ISIGPFH H  +     E+ KL+ F +Y  RV            +S +D+V+RA+ W  +AR  Y+E INMN EDF+
Subjt:  ECSIHRVPKALLNMNHNAYVPRDISIGPFH-HDKQKFKTTEELKLRFFNSYRCRV-----------GRSAQDIVRRARGWERKAREYYSEPINMNSEDFV

Query:  IMMVLDGCFIVEFMIKD-RTYS------FPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVHLFAGGCVWPNKS
         MM++DGCFIVEF I D   Y+      FP  EN V  SFYK    ++  +LI LENQLPFFVLQ+LFDLIP      N    ++  +L  G       S
Subjt:  IMMVLDGCFIVEFMIKD-RTYS------FPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVHLFAGGCVWPNKS

Query:  GIQIYNNNLNKNPRHLLDFLSSYFVPMDEITGEHH---------CLPPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFESCARNLLA
         I      L+  P+H +DFLS Y VP  +   +            +PPS TE+ +AG+  KK + +  C+++I F++GIL IPPL IDD FE   RNLLA
Subjt:  GIQIYNNNLNKNPRHLLDFLSSYFVPMDEITGEHH---------CLPPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFESCARNLLA

Query:  FAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRN-NYFSHISNDLKQHCDRKRNKWMASLRQNYF
        F  F  E         I Y  F+D LI+TEKDVNLL KE IIIN IGGS  E+SQLFN++CK ++   N NYF+  S  L+ HCDR+ NK  ASL+ NYF
Subjt:  FAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRN-NYFSHISNDLKQHCDRKRNKWMASLRQNYF

Query:  HTPWTLISFLAATFLIILTLLQTIFTGIS
        +TPW  IS  AATFL++LT+LQTIF+ IS
Subjt:  HTPWTLISFLAATFLIILTLLQTIFTGIS

A0A6J1DXD6 UPF0481 protein At3g47200-like isoform X22.0e-9046Show/hide
Query:  MDHVALVIEDNLQKMAPFVPECSIHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGRSAQDIVRRARGWERKAREYYSEPI
        +D V   I+  LQ++ P   EC+IHRVP+ LL  N  AY+P+ ISIGPFHH +Q     E+ KLRF + Y  R     +  V   R WE  AR  Y+EPI
Subjt:  MDHVALVIEDNLQKMAPFVPECSIHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGRSAQDIVRRARGWERKAREYYSEPI

Query:  NMNSEDFVIMMVLDGCFIVEFMIKDRTYSFPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVHLF--AGGCVWP
        NM+S++FV MM++DGCFIVE M+          E +     + A+  ++  +LIMLENQLPFFVLQ LFD    ++ L+    F+ + H+F   G  + P
Subjt:  NMNSEDFVIMMVLDGCFIVEFMIKDRTYSFPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVHLF--AGGCVWP

Query:  N----KSGIQIYNNNLNKNPRHLLDFLSSYFVPMD-EITGEHHCL---------PPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFE
               G+ I  + +N    HL+DFLS Y+ P    ++   H L         PP+ TEL +AGIVFKK    +  IMDISFKD +L IPPLEI D FE
Subjt:  N----KSGIQIYNNNLNKNPRHLLDFLSSYFVPMD-EITGEHHCL---------PPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFE

Query:  SCARNLLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRN-NYFSHISNDLKQHCDRKRNKWM
        +  RNL+AF Q+    +GK    AI YF+FL+ LI+ E+DV+LL K  II NCIGG+  E+S LFND+CK++ VR + N F+HI+  L +HC  + NK M
Subjt:  SCARNLLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRN-NYFSHISNDLKQHCDRKRNKWM

Query:  ASLRQNYFHTPWTLISFLAATFLIILTLLQTIFTGIS
        ASLR++YF+TPW  ISF+AA FLI+LT LQT+F+ +S
Subjt:  ASLRQNYFHTPWTLISFLAATFLIILTLLQTIFTGIS

A0A6J1DYL4 UPF0481 protein At3g47200-like isoform X32.0e-9046Show/hide
Query:  MDHVALVIEDNLQKMAPFVPECSIHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGRSAQDIVRRARGWERKAREYYSEPI
        +D V   I+  LQ++ P   EC+IHRVP+ LL  N  AY+P+ ISIGPFHH +Q     E+ KLRF + Y  R     +  V   R WE  AR  Y+EPI
Subjt:  MDHVALVIEDNLQKMAPFVPECSIHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGRSAQDIVRRARGWERKAREYYSEPI

Query:  NMNSEDFVIMMVLDGCFIVEFMIKDRTYSFPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVHLF--AGGCVWP
        NM+S++FV MM++DGCFIVE M+          E +     + A+  ++  +LIMLENQLPFFVLQ LFD    ++ L+    F+ + H+F   G  + P
Subjt:  NMNSEDFVIMMVLDGCFIVEFMIKDRTYSFPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVHLF--AGGCVWP

Query:  N----KSGIQIYNNNLNKNPRHLLDFLSSYFVPMD-EITGEHHCL---------PPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFE
               G+ I  + +N    HL+DFLS Y+ P    ++   H L         PP+ TEL +AGIVFKK    +  IMDISFKD +L IPPLEI D FE
Subjt:  N----KSGIQIYNNNLNKNPRHLLDFLSSYFVPMD-EITGEHHCL---------PPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFE

Query:  SCARNLLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRN-NYFSHISNDLKQHCDRKRNKWM
        +  RNL+AF Q+    +GK    AI YF+FL+ LI+ E+DV+LL K  II NCIGG+  E+S LFND+CK++ VR + N F+HI+  L +HC  + NK M
Subjt:  SCARNLLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRN-NYFSHISNDLKQHCDRKRNKWM

Query:  ASLRQNYFHTPWTLISFLAATFLIILTLLQTIFTGIS
        ASLR++YF+TPW  ISF+AA FLI+LT LQT+F+ +S
Subjt:  ASLRQNYFHTPWTLISFLAATFLIILTLLQTIFTGIS

A0A6J1E120 UPF0481 protein At3g47200-like isoform X12.0e-9046Show/hide
Query:  MDHVALVIEDNLQKMAPFVPECSIHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGRSAQDIVRRARGWERKAREYYSEPI
        +D V   I+  LQ++ P   EC+IHRVP+ LL  N  AY+P+ ISIGPFHH +Q     E+ KLRF + Y  R     +  V   R WE  AR  Y+EPI
Subjt:  MDHVALVIEDNLQKMAPFVPECSIHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGRSAQDIVRRARGWERKAREYYSEPI

Query:  NMNSEDFVIMMVLDGCFIVEFMIKDRTYSFPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVHLF--AGGCVWP
        NM+S++FV MM++DGCFIVE M+          E +     + A+  ++  +LIMLENQLPFFVLQ LFD    ++ L+    F+ + H+F   G  + P
Subjt:  NMNSEDFVIMMVLDGCFIVEFMIKDRTYSFPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVHLF--AGGCVWP

Query:  N----KSGIQIYNNNLNKNPRHLLDFLSSYFVPMD-EITGEHHCL---------PPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFE
               G+ I  + +N    HL+DFLS Y+ P    ++   H L         PP+ TEL +AGIVFKK    +  IMDISFKD +L IPPLEI D FE
Subjt:  N----KSGIQIYNNNLNKNPRHLLDFLSSYFVPMD-EITGEHHCL---------PPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFE

Query:  SCARNLLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRN-NYFSHISNDLKQHCDRKRNKWM
        +  RNL+AF Q+    +GK    AI YF+FL+ LI+ E+DV+LL K  II NCIGG+  E+S LFND+CK++ VR + N F+HI+  L +HC  + NK M
Subjt:  SCARNLLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRN-NYFSHISNDLKQHCDRKRNKWM

Query:  ASLRQNYFHTPWTLISFLAATFLIILTLLQTIFTGIS
        ASLR++YF+TPW  ISF+AA FLI+LT LQT+F+ +S
Subjt:  ASLRQNYFHTPWTLISFLAATFLIILTLLQTIFTGIS

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026455.1e-1428.77Show/hide
Query:  SIHRVPKALLNMNHNAYVPRDISIGPFH------HDKQKFKTTEELKLR-FFNSYRCRVGRSAQDIVRRARGWERKAREYYSEPINMNSEDFVIMMVLDG
        SI  VPKAL+  + ++Y P  +SIGP+H      H+ +++K     K+R  +NS+R        D+V + +  E K R  Y + I  N E  + +M +D 
Subjt:  SIHRVPKALLNMNHNAYVPRDISIGPFH------HDKQKFKTTEELKLR-FFNSYRCRVGRSAQDIVRRARGWERKAREYYSEPINMNSEDFVIMMVLDG

Query:  CFIVEFMIKDRTYSFPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVHLFAGGCVWPNKSGIQIYNNNLNK---
         F++EF+   + YSF     KV +   +     +  +++M+ENQ+P FVL+   +    +S+ +  ++ +SV+    G C   +   I+  ++ + K   
Subjt:  CFIVEFMIKDRTYSFPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVHLFAGGCVWPNKSGIQIYNNNLNK---

Query:  -NPRHLLDFLSSYFVPMDE
            H+LDFL    VP  E
Subjt:  -NPRHLLDFLSSYFVPMDE

Q9SD53 UPF0481 protein At3g472001.3e-3328.01Show/hide
Query:  CSIHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGRSAQD---IVRRARGWERKAREYYSEPINMNSEDFVIMMVLDGCFI
        C I RVP++ + +N  AY P+ +SIGP+H+ ++  +  ++ K R    +     +   +   +V+     E K R+ YSE +     D + MMVLDGCFI
Subjt:  CSIHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGRSAQD---IVRRARGWERKAREYYSEPINMNSEDFVIMMVLDGCFI

Query:  -VEFMIKDRTYSFPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLF--DLIPYQSSLTNIEVFISVVHLFAGGCVWPNKSGIQIYNNNLNKNPR
         + F+I         +E+ + S  +  +  ++  +L++LENQ+PFFVLQ L+    I   S L  I       H F       +K G   +  + N   +
Subjt:  -VEFMIKDRTYSFPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLF--DLIPYQSSLTNIEVFISVVHLFAGGCVWPNKSGIQIYNNNLNKNPR

Query:  HLLDFLSSYFVP----MDEITGEH------------------HCLP--PSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFESCARNLL
        HLLD +   F+P     D+ +  H                    +P   SA  L   GI F+     +  I+++  K   L IP L  D    S   N +
Subjt:  HLLDFLSSYFVP----MDEITGEH------------------HCLP--PSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFESCARNLL

Query:  AFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVR-RNNYFSHISNDLKQHCDRKRNKWMASLRQNY
        AF QF    +  N++    Y +F+  L+  E+DV  L  + +II    GS  E+S+ F  I K++      +Y +++   + ++  +  N   A  R  +
Subjt:  AFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVR-RNNYFSHISNDLKQHCDRKRNKWMASLRQNY

Query:  FHTPWTLISFLAATFLIILTLLQTIFTGISNL
        F +PWT +S  A  F+I+LT+LQ+    +S L
Subjt:  FHTPWTLISFLAATFLIILTLLQTIFTGISNL

Arabidopsis top hitse value%identityAlignment
AT2G36430.1 Plant protein of unknown function (DUF247)1.4e-4329.33Show/hide
Query:  HVALVIEDNLQKMAPFVPECSIHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRV-GRSAQDIVRRARGWERKAREYYSEPIN
        H  L     L   A   P CSI RVP+++++ N   Y PR +SIGP+H  + + K  EE K R+ N    R    + +D ++  +  E  ARE YSE I+
Subjt:  HVALVIEDNLQKMAPFVPECSIHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRV-GRSAQDIVRRARGWERKAREYYSEPIN

Query:  MNSEDFVIMMVLDGCFIVEFMIK-DRTYSFPPNENKVHSS-----FYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVHLFAGGC
        M+SE+F  MMVLDGCF++E   K +    F PN+  V  +     FY+   C        LENQ+PFFVL+ LF+L    +         S+   F    
Subjt:  MNSEDFVIMMVLDGCFIVEFMIK-DRTYSFPPNENKVHSS-----FYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVHLFAGGC

Query:  VWPNKSGIQIYNNNLNKNPRHLLDFLSSYFVPMDEITGEHHCLP----------PSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFES
        +   +  +  +        +HLLD L S F+P  E+       P           S ++L +AGI  ++++ D    + + F+ G + +P + +DD   S
Subjt:  VWPNKSGIQIYNNNLNKNPRHLLDFLSSYFVPMDEITGEHHCLP----------PSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFES

Query:  CARNLLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRNN-YFSHISNDLKQHCDRKRNKWMA
           N +A+ Q     +    ++   Y   LD L  T KDV  L  + II N   G+  E+++  N + +++       Y   +  ++ ++     +   A
Subjt:  CARNLLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRNN-YFSHISNDLKQHCDRKRNKWMA

Query:  SLRQNYFHTPWTLISFLAATFLIILTLLQTIFT
        + +  YF++PW+ +S LAA  L++L+++QTI+T
Subjt:  SLRQNYFHTPWTLISFLAATFLIILTLLQTIFT

AT3G50120.1 Plant protein of unknown function (DUF247)2.4e-4329.63Show/hide
Query:  IHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGRSAQDIVRRARGWERKAREYYSEPINMNSEDFVIMMVLDGCFIVEFMI
        I+RVP  L   ++ +Y P+ +S+GP+HH K++ ++ +  K R  N    R  +  +  +   R  E KAR  Y  P++++S +F+ M+VLDGCF++E + 
Subjt:  IHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGRSAQDIVRRARGWERKAREYYSEPINMNSEDFVIMMVLDGCFIVEFMI

Query:  KDRTYSFPPNENKVHSSFYKAIECNMSV--ELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVHLFAGGCVWP-----NKSGIQIYNNNLNKNPR-
        +     F       +   +       S+  +++MLENQLP FVL  L +L   Q    N    ++ + +     + P      KSG     N+L ++   
Subjt:  KDRTYSFPPNENKVHSSFYKAIECNMSV--ELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVHLFAGGCVWP-----NKSGIQIYNNNLNKNPR-

Query:  ---------HLLD-----FLSSYFVPMDEITGEH----------------HCLPPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFES
                 H LD      L S   P   +T +                 HC+    TEL +AGI F++ + D+    D+ FK+G L IP L I D  +S
Subjt:  ---------HLLD-----FLSSYFVPMDEITGEH----------------HCLPPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFES

Query:  CARNLLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITV-RRNNYFSHISNDLKQHCDRKRNKWMA
           NL+AF Q   + +     +   Y +F+D LI + +DV+ L   GII + + GS  E++ LFN +C+ +     ++Y S +S ++ ++ D K N W A
Subjt:  CARNLLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITV-RRNNYFSHISNDLKQHCDRKRNKWMA

Query:  SLRQNYFHTPWTLISFLAATFLIILTLLQTIF
        +L+  YF+ PW ++SF AA  L++LT  Q+ +
Subjt:  SLRQNYFHTPWTLISFLAATFLIILTLLQTIF

AT3G50150.1 Plant protein of unknown function (DUF247)2.0e-4531.21Show/hide
Query:  IHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGRSAQDIVRRARGWERKAREYYSEPINM-NSEDFVIMMVLDGCFIVEFM
        I+RVP  L   +  +Y+P+ +SIGP+HH K   +  E  K R  N    R   + +  +   +  E +AR  Y  PI+M NS +F  M+VLDGCF++E +
Subjt:  IHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGRSAQDIVRRARGWERKAREYYSEPINM-NSEDFVIMMVLDGCFIVEFM

Query:  IKDRTYSFPPNENKVHSSFY--KAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVHLFAGGCVWPNKSGIQIYNNNLNKNPR-----
         K     F       +   +  + +  ++  ++IMLENQLP FVL  L  L   Q+   N    ++ V +     + P    +     +L+   +     
Subjt:  IKDRTYSFPPNENKVHSSFY--KAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVHLFAGGCVWPNKSGIQIYNNNLNKNPR-----

Query:  -----HLLDFLSSYFVPMDEITGEH----------------HCLPPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFESCARNLLAFA
             H LD      +   E T +                 HC+    TEL  AG+ F + E  Q  + DI FK+G L IP L I D  +S   NL+AF 
Subjt:  -----HLLDFLSSYFVPMDEITGEH----------------HCLPPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFESCARNLLAFA

Query:  QFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITV-RRNNYFSHISNDLKQHCDRKRNKWMASLRQNYFHT
        Q   + +     N   Y +F+D LI + +DV+ L  +GII + + GS  E++ LFN +CK +    ++ Y S +S ++ ++  RK N   A+LRQ YF+ 
Subjt:  QFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITV-RRNNYFSHISNDLKQHCDRKRNKWMASLRQNYFHT

Query:  PWTLISFLAATFLIILTLLQTIF
        PW   SF AA  L+ LT  Q+ F
Subjt:  PWTLISFLAATFLIILTLLQTIF

AT3G50170.1 Plant protein of unknown function (DUF247)9.1e-4330.84Show/hide
Query:  IHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGRSAQDIVRRARGWERKAREYYSEPINMNSEDFVIMMVLDGCFIVEFM-
        I+RVP  L   +  +Y P+ +S+GP+HH K++ +  E  K R  N    R+ +  +      R  E KAR  Y  PI+++  +F  M+VLDGCF++E   
Subjt:  IHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGRSAQDIVRRARGWERKAREYYSEPINMNSEDFVIMMVLDGCFIVEFM-

Query:  --IKDRTYSFPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVH---LFAGG-----------CVWPNKS-----
          ++  T       + V +   + +  ++  ++IMLENQLP FVL  L +L     + T I   ++V     L   G             W  KS     
Subjt:  --IKDRTYSFPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVH---LFAGG-----------CVWPNKS-----

Query:  ------GIQIYNNNL-----NKNPRHLLDFLSSYFVPMDEITGE-HHCLPPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFESCARN
               + ++  +L       N R LL  L+     +D+   +  HC+    TEL +AG+ F+K + D+    DI FK+G L IP L I D  +S   N
Subjt:  ------GIQIYNNNL-----NKNPRHLLDFLSSYFVPMDEITGE-HHCLPPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFESCARN

Query:  LLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITV-RRNNYFSHISNDLKQHCDRKRNKWMASLRQ
        L+AF Q   E +     +   Y +F+D LI + +DV+ L   GII + + GS  E++ LFN +C+ +    ++++ S +S D+ ++ +RK N   A+L  
Subjt:  LLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITV-RRNNYFSHISNDLKQHCDRKRNKWMASLRQ

Query:  NYFHTPWTLISFLAATFLIILTLLQTIF
         YF+ PW   SF AA  L++LTL Q+ +
Subjt:  NYFHTPWTLISFLAATFLIILTLLQTIF

AT4G31980.1 unknown protein9.3e-6433.82Show/hide
Query:  IEDNLQKMAPFVPECSIHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGRSAQDIVRRARGWERKAREYYSEPINMNSEDF
        I+  L  ++    +C I++VP  L  +N +AY PR +S GP H  K++ +  E+ K R+  S+  R   S +D+VR AR WE+ AR  Y+E + ++S++F
Subjt:  IEDNLQKMAPFVPECSIHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGRSAQDIVRRARGWERKAREYYSEPINMNSEDF

Query:  VIMMVLDGCFIVEFMIKDRTYSFPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLI--PYQSSLTNIEVFISVVHLFAGGCVWPNKSGIQI
        V M+V+DG F+VE +++          +++  +    +  ++  ++I++ENQLPFFV++ +F L+   YQ    +I + ++  H         +    +I
Subjt:  VIMMVLDGCFIVEFMIKDRTYSFPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLI--PYQSSLTNIEVFISVVHLFAGGCVWPNKSGIQI

Query:  YNNNLNKNPRHLLDFLSSYFVPMDEITGEHHCL----PPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFESCARNLLAFAQFGWEEN
         +      P H +D L S ++P   I  E+  +     P ATEL  AG+ FK  E    C++DISF DG+L IP + +DD  ES  +N++     G+E+ 
Subjt:  YNNNLNKNPRHLLDFLSSYFVPMDEITGEHHCL----PPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFESCARNLLAFAQFGWEEN

Query:  GKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRNNYFSHISNDLKQHCDRKRNKWMASLRQNYFHTPWTLISFL
          +  N +DY M L   I +  D +LL   GII+N +G S +++S LFN I K +   R  YFS +S +L+ +C+   N+W A LR++YFH PW + S  
Subjt:  GKNKVNAIDYFMFLDELITTEKDVNLLAKEGIIINCIGGSQIEISQLFNDICKNITVRRNNYFSHISNDLKQHCDRKRNKWMASLRQNYFHTPWTLISFL

Query:  AATFLIILTLLQTI
        AA  L++LT +Q++
Subjt:  AATFLIILTLLQTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCACGTCGCACTAGTCATCGAAGATAATCTACAGAAAATGGCTCCATTTGTTCCAGAATGTAGCATCCATCGAGTTCCAAAAGCACTGCTCAACATGAATCACAA
TGCATATGTACCAAGAGACATTTCAATTGGCCCTTTTCATCATGATAAACAAAAATTCAAAACTACAGAAGAGCTCAAGCTTCGTTTTTTTAACAGTTATCGATGTCGCG
TAGGCAGGAGTGCTCAGGACATTGTAAGAAGGGCTCGAGGTTGGGAGAGAAAAGCTCGTGAGTACTACTCAGAACCTATAAACATGAACAGTGAAGATTTTGTGATAATG
ATGGTTTTAGATGGTTGTTTCATAGTGGAGTTCATGATTAAGGATCGCACATACTCATTTCCTCCAAATGAAAACAAGGTACACTCCTCCTTCTACAAAGCTATAGAATG
CAATATGAGTGTGGAGTTGATAATGCTTGAGAATCAACTTCCTTTTTTTGTCCTTCAAAACCTATTTGACCTTATTCCATACCAATCATCCCTGACCAATATTGAGGTTT
TTATATCCGTTGTACACTTATTTGCCGGTGGGTGTGTGTGGCCAAATAAGAGTGGAATCCAAATTTATAATAATAATTTAAATAAAAATCCACGCCACTTATTGGATTTC
TTAAGCTCTTATTTTGTCCCCATGGATGAGATAACTGGTGAACACCACTGCCTACCACCAAGTGCAACCGAGCTCGGGAAGGCTGGTATTGTCTTTAAGAAAGTAGAAGG
AGATCAAATATGTATTATGGACATCAGTTTCAAAGATGGGATTTTGAACATTCCACCTTTAGAAATTGATGATAAATTTGAAAGTTGTGCTAGAAACTTATTGGCATTTG
CACAGTTTGGTTGGGAGGAAAATGGTAAGAACAAGGTGAATGCAATTGATTACTTTATGTTTTTAGATGAGCTCATAACAACGGAGAAAGATGTGAACTTACTTGCGAAG
GAAGGAATCATAATAAATTGTATTGGTGGTAGCCAAATAGAAATTTCGCAACTGTTTAATGATATTTGTAAGAATATCACTGTACGTCGTAATAATTACTTCAGTCATAT
TTCAAACGATTTGAAACAACATTGTGATAGAAAACGGAACAAATGGATGGCTTCATTGAGACAAAACTATTTTCACACGCCATGGACTCTTATCTCCTTCTTGGCTGCAA
CCTTCCTTATTATACTAACTTTACTACAAACCATATTTACTGGTATATCCAATCTCAAGTAA
mRNA sequenceShow/hide mRNA sequence
GAAACTTGTTGGGTCTTTTATATAAATCTTGCAACTGCCTCTCTTACCCTAAAATCATATTTCCAAATTGCTGTTTTTTCTTCTCTCCATCTCAATGGATCACGTCGCAC
TAGTCATCGAAGATAATCTACAGAAAATGGCTCCATTTGTTCCAGAATGTAGCATCCATCGAGTTCCAAAAGCACTGCTCAACATGAATCACAATGCATATGTACCAAGA
GACATTTCAATTGGCCCTTTTCATCATGATAAACAAAAATTCAAAACTACAGAAGAGCTCAAGCTTCGTTTTTTTAACAGTTATCGATGTCGCGTAGGCAGGAGTGCTCA
GGACATTGTAAGAAGGGCTCGAGGTTGGGAGAGAAAAGCTCGTGAGTACTACTCAGAACCTATAAACATGAACAGTGAAGATTTTGTGATAATGATGGTTTTAGATGGTT
GTTTCATAGTGGAGTTCATGATTAAGGATCGCACATACTCATTTCCTCCAAATGAAAACAAGGTACACTCCTCCTTCTACAAAGCTATAGAATGCAATATGAGTGTGGAG
TTGATAATGCTTGAGAATCAACTTCCTTTTTTTGTCCTTCAAAACCTATTTGACCTTATTCCATACCAATCATCCCTGACCAATATTGAGGTTTTTATATCCGTTGTACA
CTTATTTGCCGGTGGGTGTGTGTGGCCAAATAAGAGTGGAATCCAAATTTATAATAATAATTTAAATAAAAATCCACGCCACTTATTGGATTTCTTAAGCTCTTATTTTG
TCCCCATGGATGAGATAACTGGTGAACACCACTGCCTACCACCAAGTGCAACCGAGCTCGGGAAGGCTGGTATTGTCTTTAAGAAAGTAGAAGGAGATCAAATATGTATT
ATGGACATCAGTTTCAAAGATGGGATTTTGAACATTCCACCTTTAGAAATTGATGATAAATTTGAAAGTTGTGCTAGAAACTTATTGGCATTTGCACAGTTTGGTTGGGA
GGAAAATGGTAAGAACAAGGTGAATGCAATTGATTACTTTATGTTTTTAGATGAGCTCATAACAACGGAGAAAGATGTGAACTTACTTGCGAAGGAAGGAATCATAATAA
ATTGTATTGGTGGTAGCCAAATAGAAATTTCGCAACTGTTTAATGATATTTGTAAGAATATCACTGTACGTCGTAATAATTACTTCAGTCATATTTCAAACGATTTGAAA
CAACATTGTGATAGAAAACGGAACAAATGGATGGCTTCATTGAGACAAAACTATTTTCACACGCCATGGACTCTTATCTCCTTCTTGGCTGCAACCTTCCTTATTATACT
AACTTTACTACAAACCATATTTACTGGTATATCCAATCTCAAGTAAAGCCCATCCCCATCCCCATTTCTTAAGTCTTGGTTTTCATTTGGTTTTGTTGCTCTTTGTTGAA
TATTTAGGGTTTCATTACCTAGGGTGTGCTCTTTTGTGGTGTGTGTTGCTTCTTTTTAATTATCTATATTGTAATATGTGTTCTTTTATAATTCTTGTGGTAAAAATATT
CATGTTTGTTGCTCTTCGTGGATGTATGCATAGTTCAGATATATTGTACTTTGTACACACATTTTCTATTTCTA
Protein sequenceShow/hide protein sequence
MDHVALVIEDNLQKMAPFVPECSIHRVPKALLNMNHNAYVPRDISIGPFHHDKQKFKTTEELKLRFFNSYRCRVGRSAQDIVRRARGWERKAREYYSEPINMNSEDFVIM
MVLDGCFIVEFMIKDRTYSFPPNENKVHSSFYKAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSLTNIEVFISVVHLFAGGCVWPNKSGIQIYNNNLNKNPRHLLDF
LSSYFVPMDEITGEHHCLPPSATELGKAGIVFKKVEGDQICIMDISFKDGILNIPPLEIDDKFESCARNLLAFAQFGWEENGKNKVNAIDYFMFLDELITTEKDVNLLAK
EGIIINCIGGSQIEISQLFNDICKNITVRRNNYFSHISNDLKQHCDRKRNKWMASLRQNYFHTPWTLISFLAATFLIILTLLQTIFTGISNLK