; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014556 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014556
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationtig00000729:537476..538842
RNA-Seq ExpressionSgr014556
SyntenySgr014556
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131634.1 UPF0481 protein At3g47200-like [Momordica charantia]4.2e-9846.31Show/hide
Query:  NIEKNDDIEQCQEDDMILAKSMKKMLQEM-PSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGRE-HLKAMEWHKLQGLDIYLRRINMNVEAAIE
        NI  N+++++     +++  S+++M + + P  I P+CSIYRV KRL+ +N  AY PQV+SIGP HH  + +L   + HKLQ LD YL R+ M VEA ++
Subjt:  NIEKNDDIEQCQEDDMILAKSMKKMLQEM-PSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGRE-HLKAMEWHKLQGLDIYLRRINMNVEAAIE

Query:  IARGWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESLFDRLSLESDGK---
        I + WE RARSCY EP   MN + FV M+L+DGCF++ F++L  +  ++T ENG D SFY A+ SD+Y D TMLENQLP FVL+ L+D +  + + K   
Subjt:  IARGWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESLFDRLSLESDGK---

Query:  --------------------------AKHLVDFLISYCVPTDQMSKKGWKKNFLHPPTLTALHEAGVSILKATDKP-LMDISFTDGVLKIPPFKIYDNFE
                                   KHLVD L  Y +P     ++  K  +L  P +T L EAGV+I K  +   LMDISF +GVL+IPP  I D+FE
Subjt:  --------------------------AKHLVDFLISYCVPTDQMSKKGWKKNFLHPPTLTALHEAGVSILKATDKP-LMDISFTDGVLKIPPFKIYDNFE

Query:  TYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDCYYCDMTNALHGYCKKRWPKWKATL
        T VRN+MAFE +   +   Y   Y  FLD +ISTEKD  LLV+A I+ N+IGGSD E+++LFN+L K + +P G  Y   +T  LH +CKK WP+ KATL
Subjt:  TYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDCYYCDMTNALHGYCKKRWPKWKATL

Query:  RREYFKTPWTLISVVAATFIILLTLLQTLFSALS
        +R+YF +PW  IS+VAAT+II+LTLLQT+F+A+S
Subjt:  RREYFKTPWTLISVVAATFIILLTLLQTLFSALS

XP_022132066.1 UPF0481 protein At3g47200-like [Momordica charantia]2.8e-9446.24Show/hide
Query:  MEPEYVVTHQQNNGPYNIEKNDDIEQCQEDDMILAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDI
        ME +++ T+  N      +K D++E  Q    I   SMK ML+++   I+ +CSIYRVSKRL  IN +AY PQ +SIGP HHG++   AME  KL+ LD 
Subjt:  MEPEYVVTHQQNNGPYNIEKNDDIEQCQEDDMILAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDI

Query:  YLRRINMNVEAAIEIARGWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESL
        YLRR+ M +E A EIA+GWE RAR CYAE  + M  ++FVKMMLVDG F++EF+ + H++    T+  L+ + + AI  D+Y D  +LENQLP F+LE L
Subjt:  YLRRINMNVEAAIEIARGWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESL

Query:  FDRLS---------------------LESD----GKAKHLVDFL-ISYCVPTDQMSKKGWKKNFLH-PPTLTALHEAGVSILKATD--KPLMDISFTDGV
         D+ S                     L SD     K  HLVDFL   Y +PT        K N    PPT T L EAGV   KAT+  + +MDI F DGV
Subjt:  FDRLS---------------------LESD----GKAKHLVDFL-ISYCVPTDQMSKKGWKKNFLH-PPTLTALHEAGVSILKATD--KPLMDISFTDGV

Query:  LKIPPFKIYDNFETYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDCYYCDMTNALHG
        L IP  +I+D FETYVRN++A+E +H+      +  Y+ FLD LISTE+D SLLVKA I+TNNIGG++ +++KLFN+LCK+I +     YY D++  LH 
Subjt:  LKIPPFKIYDNFETYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDCYYCDMTNALHG

Query:  YCKKRWPKWKATLRREYFKTPWTLISVVAATFIILLTLLQTLFSALSITNTK
        YC+  W +  A+LRR+YF TPW  IS +AATF++LLT +Q ++SA+S   +K
Subjt:  YCKKRWPKWKATLRREYFKTPWTLISVVAATFIILLTLLQTLFSALSITNTK

XP_022158989.1 UPF0481 protein At3g47200-like isoform X1 [Momordica charantia]5.4e-9847.56Show/hide
Query:  QNNGPYNIEKNDDIEQCQEDDMILAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVE
        QNN PYN     ++++       +  S+KKMLQE+P  +  +C+I+RV +RL+  N  AYMPQ++SIGP HHGR+ L  ME HKL+ LD YLRR N  +E
Subjt:  QNNGPYNIEKNDDIEQCQEDDMILAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVE

Query:  AAIEIARGWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESLFDRLSLE---
          + I R WE  AR+CYAEP N M+ ++FVKMMLVDGCF++E M++        TE   DP  + A+ +DLY D  MLENQLP FVL+ LFD+ SLE   
Subjt:  AAIEIARGWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESLFDRLSLE---

Query:  -----------------------------SDGKAKHLVDFLISYCVPT------DQMSKKGWKKNFLHPPTLTALHEAGVSILKA-TDKPLMDISFTDGV
                                     S  K  HLVDFL  Y  P          S    +K    PPT+T L EAG+   KA   K +MDISF D V
Subjt:  -----------------------------SDGKAKHLVDFLISYCVPT------DQMSKKGWKKNFLHPPTLTALHEAGVSILKA-TDKPLMDISFTDGV

Query:  LKIPPFKIYDNFETYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDC-YYCDMTNALH
        L+IPP +I D FETYVRN+MAFEQ+H   +  Y   Y  FL+GLIS E+D SLLVKA I+TN IGG++ E++ LFN+LCK++++  GDC  +  +  ALH
Subjt:  LKIPPFKIYDNFETYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDC-YYCDMTNALH

Query:  GYCKKRWPKWKATLRREYFKTPWTLISVVAATFIILLTLLQTLFSALSIT
         +C  RW K  A+LRR+YF TPW  IS VAA F+ILLT LQTLFSA+S++
Subjt:  GYCKKRWPKWKATLRREYFKTPWTLISVVAATFIILLTLLQTLFSALSIT

XP_022158990.1 UPF0481 protein At3g47200-like isoform X2 [Momordica charantia]5.4e-9847.56Show/hide
Query:  QNNGPYNIEKNDDIEQCQEDDMILAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVE
        QNN PYN     ++++       +  S+KKMLQE+P  +  +C+I+RV +RL+  N  AYMPQ++SIGP HHGR+ L  ME HKL+ LD YLRR N  +E
Subjt:  QNNGPYNIEKNDDIEQCQEDDMILAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVE

Query:  AAIEIARGWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESLFDRLSLE---
          + I R WE  AR+CYAEP N M+ ++FVKMMLVDGCF++E M++        TE   DP  + A+ +DLY D  MLENQLP FVL+ LFD+ SLE   
Subjt:  AAIEIARGWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESLFDRLSLE---

Query:  -----------------------------SDGKAKHLVDFLISYCVPT------DQMSKKGWKKNFLHPPTLTALHEAGVSILKA-TDKPLMDISFTDGV
                                     S  K  HLVDFL  Y  P          S    +K    PPT+T L EAG+   KA   K +MDISF D V
Subjt:  -----------------------------SDGKAKHLVDFLISYCVPT------DQMSKKGWKKNFLHPPTLTALHEAGVSILKA-TDKPLMDISFTDGV

Query:  LKIPPFKIYDNFETYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDC-YYCDMTNALH
        L+IPP +I D FETYVRN+MAFEQ+H   +  Y   Y  FL+GLIS E+D SLLVKA I+TN IGG++ E++ LFN+LCK++++  GDC  +  +  ALH
Subjt:  LKIPPFKIYDNFETYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDC-YYCDMTNALH

Query:  GYCKKRWPKWKATLRREYFKTPWTLISVVAATFIILLTLLQTLFSALSIT
         +C  RW K  A+LRR+YF TPW  IS VAA F+ILLT LQTLFSA+S++
Subjt:  GYCKKRWPKWKATLRREYFKTPWTLISVVAATFIILLTLLQTLFSALSIT

XP_022158992.1 UPF0481 protein At3g47200-like isoform X3 [Momordica charantia]1.0e-9648.71Show/hide
Query:  LAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVEAAIEIARGWERRARSCYAEPTNY
        +  S+KKMLQE+P  +  +C+I+RV +RL+  N  AYMPQ++SIGP HHGR+ L  ME HKL+ LD YLRR N  +E  + I R WE  AR+CYAEP N 
Subjt:  LAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVEAAIEIARGWERRARSCYAEPTNY

Query:  MNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESLFDRLSLE--------------------------
        M+ ++FVKMMLVDGCF++E M++        TE   DP  + A+ +DLY D  MLENQLP FVL+ LFD+ SLE                          
Subjt:  MNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESLFDRLSLE--------------------------

Query:  ------SDGKAKHLVDFLISYCVPT------DQMSKKGWKKNFLHPPTLTALHEAGVSILKA-TDKPLMDISFTDGVLKIPPFKIYDNFETYVRNMMAFE
              S  K  HLVDFL  Y  P          S    +K    PPT+T L EAG+   KA   K +MDISF D VL+IPP +I D FETYVRN+MAFE
Subjt:  ------SDGKAKHLVDFLISYCVPT------DQMSKKGWKKNFLHPPTLTALHEAGVSILKA-TDKPLMDISFTDGVLKIPPFKIYDNFETYVRNMMAFE

Query:  QFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDC-YYCDMTNALHGYCKKRWPKWKATLRREYFKTPW
        Q+H   +  Y   Y  FL+GLIS E+D SLLVKA I+TN IGG++ E++ LFN+LCK++++  GDC  +  +  ALH +C  RW K  A+LRR+YF TPW
Subjt:  QFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDC-YYCDMTNALHGYCKKRWPKWKATLRREYFKTPW

Query:  TLISVVAATFIILLTLLQTLFSALSIT
          IS VAA F+ILLT LQTLFSA+S++
Subjt:  TLISVVAATFIILLTLLQTLFSALSIT

TrEMBL top hitse value%identityAlignment
A0A6J1BQT6 UPF0481 protein At3g47200-like2.0e-9846.31Show/hide
Query:  NIEKNDDIEQCQEDDMILAKSMKKMLQEM-PSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGRE-HLKAMEWHKLQGLDIYLRRINMNVEAAIE
        NI  N+++++     +++  S+++M + + P  I P+CSIYRV KRL+ +N  AY PQV+SIGP HH  + +L   + HKLQ LD YL R+ M VEA ++
Subjt:  NIEKNDDIEQCQEDDMILAKSMKKMLQEM-PSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGRE-HLKAMEWHKLQGLDIYLRRINMNVEAAIE

Query:  IARGWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESLFDRLSLESDGK---
        I + WE RARSCY EP   MN + FV M+L+DGCF++ F++L  +  ++T ENG D SFY A+ SD+Y D TMLENQLP FVL+ L+D +  + + K   
Subjt:  IARGWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESLFDRLSLESDGK---

Query:  --------------------------AKHLVDFLISYCVPTDQMSKKGWKKNFLHPPTLTALHEAGVSILKATDKP-LMDISFTDGVLKIPPFKIYDNFE
                                   KHLVD L  Y +P     ++  K  +L  P +T L EAGV+I K  +   LMDISF +GVL+IPP  I D+FE
Subjt:  --------------------------AKHLVDFLISYCVPTDQMSKKGWKKNFLHPPTLTALHEAGVSILKATDKP-LMDISFTDGVLKIPPFKIYDNFE

Query:  TYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDCYYCDMTNALHGYCKKRWPKWKATL
        T VRN+MAFE +   +   Y   Y  FLD +ISTEKD  LLV+A I+ N+IGGSD E+++LFN+L K + +P G  Y   +T  LH +CKK WP+ KATL
Subjt:  TYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDCYYCDMTNALHGYCKKRWPKWKATL

Query:  RREYFKTPWTLISVVAATFIILLTLLQTLFSALS
        +R+YF +PW  IS+VAAT+II+LTLLQT+F+A+S
Subjt:  RREYFKTPWTLISVVAATFIILLTLLQTLFSALS

A0A6J1BR71 UPF0481 protein At3g47200-like1.4e-9446.24Show/hide
Query:  MEPEYVVTHQQNNGPYNIEKNDDIEQCQEDDMILAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDI
        ME +++ T+  N      +K D++E  Q    I   SMK ML+++   I+ +CSIYRVSKRL  IN +AY PQ +SIGP HHG++   AME  KL+ LD 
Subjt:  MEPEYVVTHQQNNGPYNIEKNDDIEQCQEDDMILAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDI

Query:  YLRRINMNVEAAIEIARGWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESL
        YLRR+ M +E A EIA+GWE RAR CYAE  + M  ++FVKMMLVDG F++EF+ + H++    T+  L+ + + AI  D+Y D  +LENQLP F+LE L
Subjt:  YLRRINMNVEAAIEIARGWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESL

Query:  FDRLS---------------------LESD----GKAKHLVDFL-ISYCVPTDQMSKKGWKKNFLH-PPTLTALHEAGVSILKATD--KPLMDISFTDGV
         D+ S                     L SD     K  HLVDFL   Y +PT        K N    PPT T L EAGV   KAT+  + +MDI F DGV
Subjt:  FDRLS---------------------LESD----GKAKHLVDFL-ISYCVPTDQMSKKGWKKNFLH-PPTLTALHEAGVSILKATD--KPLMDISFTDGV

Query:  LKIPPFKIYDNFETYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDCYYCDMTNALHG
        L IP  +I+D FETYVRN++A+E +H+      +  Y+ FLD LISTE+D SLLVKA I+TNNIGG++ +++KLFN+LCK+I +     YY D++  LH 
Subjt:  LKIPPFKIYDNFETYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDCYYCDMTNALHG

Query:  YCKKRWPKWKATLRREYFKTPWTLISVVAATFIILLTLLQTLFSALSITNTK
        YC+  W +  A+LRR+YF TPW  IS +AATF++LLT +Q ++SA+S   +K
Subjt:  YCKKRWPKWKATLRREYFKTPWTLISVVAATFIILLTLLQTLFSALSITNTK

A0A6J1DXD6 UPF0481 protein At3g47200-like isoform X22.6e-9847.56Show/hide
Query:  QNNGPYNIEKNDDIEQCQEDDMILAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVE
        QNN PYN     ++++       +  S+KKMLQE+P  +  +C+I+RV +RL+  N  AYMPQ++SIGP HHGR+ L  ME HKL+ LD YLRR N  +E
Subjt:  QNNGPYNIEKNDDIEQCQEDDMILAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVE

Query:  AAIEIARGWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESLFDRLSLE---
          + I R WE  AR+CYAEP N M+ ++FVKMMLVDGCF++E M++        TE   DP  + A+ +DLY D  MLENQLP FVL+ LFD+ SLE   
Subjt:  AAIEIARGWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESLFDRLSLE---

Query:  -----------------------------SDGKAKHLVDFLISYCVPT------DQMSKKGWKKNFLHPPTLTALHEAGVSILKA-TDKPLMDISFTDGV
                                     S  K  HLVDFL  Y  P          S    +K    PPT+T L EAG+   KA   K +MDISF D V
Subjt:  -----------------------------SDGKAKHLVDFLISYCVPT------DQMSKKGWKKNFLHPPTLTALHEAGVSILKA-TDKPLMDISFTDGV

Query:  LKIPPFKIYDNFETYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDC-YYCDMTNALH
        L+IPP +I D FETYVRN+MAFEQ+H   +  Y   Y  FL+GLIS E+D SLLVKA I+TN IGG++ E++ LFN+LCK++++  GDC  +  +  ALH
Subjt:  LKIPPFKIYDNFETYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDC-YYCDMTNALH

Query:  GYCKKRWPKWKATLRREYFKTPWTLISVVAATFIILLTLLQTLFSALSIT
         +C  RW K  A+LRR+YF TPW  IS VAA F+ILLT LQTLFSA+S++
Subjt:  GYCKKRWPKWKATLRREYFKTPWTLISVVAATFIILLTLLQTLFSALSIT

A0A6J1DYL4 UPF0481 protein At3g47200-like isoform X35.0e-9748.71Show/hide
Query:  LAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVEAAIEIARGWERRARSCYAEPTNY
        +  S+KKMLQE+P  +  +C+I+RV +RL+  N  AYMPQ++SIGP HHGR+ L  ME HKL+ LD YLRR N  +E  + I R WE  AR+CYAEP N 
Subjt:  LAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVEAAIEIARGWERRARSCYAEPTNY

Query:  MNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESLFDRLSLE--------------------------
        M+ ++FVKMMLVDGCF++E M++        TE   DP  + A+ +DLY D  MLENQLP FVL+ LFD+ SLE                          
Subjt:  MNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESLFDRLSLE--------------------------

Query:  ------SDGKAKHLVDFLISYCVPT------DQMSKKGWKKNFLHPPTLTALHEAGVSILKA-TDKPLMDISFTDGVLKIPPFKIYDNFETYVRNMMAFE
              S  K  HLVDFL  Y  P          S    +K    PPT+T L EAG+   KA   K +MDISF D VL+IPP +I D FETYVRN+MAFE
Subjt:  ------SDGKAKHLVDFLISYCVPT------DQMSKKGWKKNFLHPPTLTALHEAGVSILKA-TDKPLMDISFTDGVLKIPPFKIYDNFETYVRNMMAFE

Query:  QFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDC-YYCDMTNALHGYCKKRWPKWKATLRREYFKTPW
        Q+H   +  Y   Y  FL+GLIS E+D SLLVKA I+TN IGG++ E++ LFN+LCK++++  GDC  +  +  ALH +C  RW K  A+LRR+YF TPW
Subjt:  QFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDC-YYCDMTNALHGYCKKRWPKWKATLRREYFKTPW

Query:  TLISVVAATFIILLTLLQTLFSALSIT
          IS VAA F+ILLT LQTLFSA+S++
Subjt:  TLISVVAATFIILLTLLQTLFSALSIT

A0A6J1E120 UPF0481 protein At3g47200-like isoform X12.6e-9847.56Show/hide
Query:  QNNGPYNIEKNDDIEQCQEDDMILAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVE
        QNN PYN     ++++       +  S+KKMLQE+P  +  +C+I+RV +RL+  N  AYMPQ++SIGP HHGR+ L  ME HKL+ LD YLRR N  +E
Subjt:  QNNGPYNIEKNDDIEQCQEDDMILAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVE

Query:  AAIEIARGWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESLFDRLSLE---
          + I R WE  AR+CYAEP N M+ ++FVKMMLVDGCF++E M++        TE   DP  + A+ +DLY D  MLENQLP FVL+ LFD+ SLE   
Subjt:  AAIEIARGWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESLFDRLSLE---

Query:  -----------------------------SDGKAKHLVDFLISYCVPT------DQMSKKGWKKNFLHPPTLTALHEAGVSILKA-TDKPLMDISFTDGV
                                     S  K  HLVDFL  Y  P          S    +K    PPT+T L EAG+   KA   K +MDISF D V
Subjt:  -----------------------------SDGKAKHLVDFLISYCVPT------DQMSKKGWKKNFLHPPTLTALHEAGVSILKA-TDKPLMDISFTDGV

Query:  LKIPPFKIYDNFETYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDC-YYCDMTNALH
        L+IPP +I D FETYVRN+MAFEQ+H   +  Y   Y  FL+GLIS E+D SLLVKA I+TN IGG++ E++ LFN+LCK++++  GDC  +  +  ALH
Subjt:  LKIPPFKIYDNFETYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDC-YYCDMTNALH

Query:  GYCKKRWPKWKATLRREYFKTPWTLISVVAATFIILLTLLQTLFSALSIT
         +C  RW K  A+LRR+YF TPW  IS VAA F+ILLT LQTLFSA+S++
Subjt:  GYCKKRWPKWKATLRREYFKTPWTLISVVAATFIILLTLLQTLFSALSIT

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026452.9e-0929.88Show/hide
Query:  SIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQ-GLDIYLRRINMNVEAAIEIARGWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIE
        SI+ V K L+  +  +Y P  VSIGP H  +  L  ME +KL     I  +  +      +E  +  E + R+CY +   + N    + +M VD  F+IE
Subjt:  SIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQ-GLDIYLRRINMNVEAAIEIARGWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIE

Query:  FMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESLFDRLSLESDGKAKHLV
        F+ +      +T  N +         +++  D  M+ENQ+PLFVL    +   LES   A  L+
Subjt:  FMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESLFDRLSLESDGKAKHLV

Q9SD53 UPF0481 protein At3g472001.5e-3727.29Show/hide
Query:  LAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRI-------NMNVEAAIEIARGWERRARSC
        L+   K+ +  + S+    C I+RV +  VA+N  AY P+VVSIGP H+G +HL+ ++ HK + L ++L          N+ V+A +++    E + R  
Subjt:  LAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRI-------NMNVEAAIEIARGWERRARSC

Query:  YAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVA-IGSDLYHDFTMLENQLPLFVLESLF--DRLSLESD--------------
        Y+E       +D + MM++DGCF++   L+         E   DP F +  + S +  D  +LENQ+P FVL++L+   ++ + SD              
Subjt:  YAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVA-IGSDLYHDFTMLENQLPLFVLESLF--DRLSLESD--------------

Query:  -----------GKAKHLVDFLISYCVPTDQMSKKGWKKNF---LH---------------PPTLTA--LHEAGVS--ILKATDKPLMDISFTDGVLKIPP
                    KAKHL+D +    +P    S K    +    LH               P  L+A  L   G+   + ++ +  ++++      L+IP 
Subjt:  -----------GKAKHLVDFLISYCVPTDQMSKKGWKKNF---LH---------------PPTLTA--LHEAGVS--ILKATDKPLMDISFTDGVLKIPP

Query:  FKIYDNFETYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDCYYCDMTNALHGYCKKR
         +      ++  N +AFEQF+  S    +  YI F+  L++ E+D + L   +++  N  GS+ E+++ F  + K++V      Y  ++   ++ Y KK 
Subjt:  FKIYDNFETYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDCYYCDMTNALHGYCKKR

Query:  WPKWKATLRREYFKTPWTLISVVAATFIILLTLLQTLFSALSITNTK
        +    A  R  +F++PWT +S  A  F+ILLT+LQ+  + LS  N K
Subjt:  WPKWKATLRREYFKTPWTLISVVAATFIILLTLLQTLFSALSITNTK

Arabidopsis top hitse value%identityAlignment
AT3G50120.1 Plant protein of unknown function (DUF247)4.1e-5129.93Show/hide
Query:  IEKNDDIEQCQEDDMILAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVEAAIEIAR
        I   D +EQ   DD               +++     IYRV   L   ++ +Y PQ VS+GP HHG++ L++M+ HK + ++  L+R N  ++  I+  R
Subjt:  IEKNDDIEQCQEDDMILAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVEAAIEIAR

Query:  GWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGS--DLYHDFTMLENQLPLFVLESL---------------
          E +AR+CY  P + ++ N+F++M+++DGCF++E    A     +      DP F +  GS   +  D  MLENQLPLFVL  L               
Subjt:  GWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGS--DLYHDFTMLENQLPLFVLESL---------------

Query:  ------FDRL-----SLESDGKAK--------------------HLVD-----FLISYCVPTDQMSKKGWKKN----------FLHPPTLTALHEAGVSI
              FD L      L   G++K                    H +D      L S   P  ++++K W +N           +H   +T L EAG+  
Subjt:  ------FDRL-----SLESDGKAK--------------------HLVD-----FLISYCVPTDQMSKKGWKKN----------FLHPPTLTALHEAGVSI

Query:  LKATDKPLMDISFTDGVLKIPPFKIYDNFETYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIV
         +       D+ F +G L+IP   I+D  ++   N++AFEQ H I     + +YI F+D LI + +D S L    I+ + + GSD E+A LFN LC+E+V
Subjt:  LKATDKPLMDISFTDGVLKIPPFKIYDNFETYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIV

Query:  LPDGDCYYCDMTNALHGYCKKRWPKWKATLRREYFKTPWTLISVVAATFIILLTLLQTLFS
            D Y   ++  ++ Y   +W  W+ATL+ +YF  PW ++S  AA  +++LT  Q+ ++
Subjt:  LPDGDCYYCDMTNALHGYCKKRWPKWKATLRREYFKTPWTLISVVAATFIILLTLLQTLFS

AT3G50140.1 Plant protein of unknown function (DUF247)1.2e-5028.98Show/hide
Query:  KNDDIEQCQEDDMILAK-SMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVEAAIEIARG
        +N   E+ +E+ +I  K  M++++++  ++      IYRV   L   +  +Y PQ VS+GP HHG EHL+ M++HK + +++ ++R    +E  I+  + 
Subjt:  KNDDIEQCQEDDMILAK-SMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVEAAIEIARG

Query:  WERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGS--DLYHDFTMLENQLPLFVLESLFD--------------
         E RAR+CY  P   ++ N F +M+++DGCF+++    A+    +   +  DP F +  GS   +  D  MLENQLPLFVL  L +              
Subjt:  WERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGS--DLYHDFTMLENQLPLFVLESLFD--------------

Query:  -------------------RLSLESDGK-----------AKHLVD-----FLISYCVPTDQMSKKGW----------KKNFLHPPTLTALHEAGVSILKA
                             S E++ K             H +D      L     P  ++S+  W          ++  LH   +T L EAG+   + 
Subjt:  -------------------RLSLESDGK-----------AKHLVD-----FLISYCVPTDQMSKKGW----------KKNFLHPPTLTALHEAGVSILKA

Query:  TDKPLMDISFTDGVLKIPPFKIYDNFETYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPD
              DI F +G L+IP   I+D  ++   N++A+EQ H I     + +YI F+D LI + +D   L   +I+ + + G+D E+A +FN LC+E+    
Subjt:  TDKPLMDISFTDGVLKIPPFKIYDNFETYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPD

Query:  GDCYYCDMTNALHGYCKKRWPKWKATLRREYFKTPWTLISVVAATFIILLTLLQTLFSA
         + Y  +++N +  Y  ++W   KATL+ +YF  PW   S  AA  ++LLTL Q+ F++
Subjt:  GDCYYCDMTNALHGYCKKRWPKWKATLRREYFKTPWTLISVVAATFIILLTLLQTLFSA

AT3G50150.1 Plant protein of unknown function (DUF247)3.0e-5431.43Show/hide
Query:  PYNIE-KNDDIEQCQEDDMILAK-SMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVEAA
        P  IE K +   + +E+ +I  K  M+K L    ++      IYRV   L   +  +Y+PQ VSIGP HHG+ HL+ ME HK + +++ + R   N+E  
Subjt:  PYNIE-KNDDIEQCQEDDMILAK-SMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVEAA

Query:  IEIARGWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYV-AIGSDLYHDFTMLENQLPLFVLESLF----------
        I+  +  E  AR+CY  P +  N N+F +M+++DGCF++E          +      DP F    +   +  D  MLENQLPLFVL+ L           
Subjt:  IEIARGWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYV-AIGSDLYHDFTMLENQLPLFVLESLF----------

Query:  -----------------------DRLSLESDGKAKHLVDFLISYCV------------------PTDQMSKKGWKKNFLHPPTLTALHEAGVSILKATDK
                                  SL+S  K+  L D    +C+                  P + MS    ++  +H   +T L  AGV+ ++    
Subjt:  -----------------------DRLSLESDGKAKHLVDFLISYCV------------------PTDQMSKKGWKKNFLHPPTLTALHEAGVSILKATDK

Query:  PLMDISFTDGVLKIPPFKIYDNFETYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDC
         L DI F +G LKIP   I+D  ++   N++AFEQ H  S    + +YI F+D LI++ +D S L    I+ + + GSD E+A LFN LCKE++    D 
Subjt:  PLMDISFTDGVLKIPPFKIYDNFETYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDC

Query:  YYCDMTNALHGYCKKRWPKWKATLRREYFKTPWTLISVVAATFIILLTLLQTLFS
        Y   ++  ++ Y  ++W   KATLR++YF  PW   S  AA  ++ LT  Q+ F+
Subjt:  YYCDMTNALHGYCKKRWPKWKATLRREYFKTPWTLISVVAATFIILLTLLQTLFS

AT3G50160.1 Plant protein of unknown function (DUF247)8.2e-5231Show/hide
Query:  EKNDDIEQCQEDDMILAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVEAAIEIARG
        +KN+  ++ +E  +I      K L +  ++   +  IYRV   L   +  +YMPQ+VSIGP HHG +HL  ME HK + +++ + R   ++E  I+  + 
Subjt:  EKNDDIEQCQEDDMILAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVEAAIEIARG

Query:  WERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYV-AIGSDLYHDFTMLENQLPLFVLESL--------FDRLSLE--
         E +AR+CY  P N MN+N+F++M+++DG F+IE          +      DP F +  +   +  D  MLENQLP  VL+ L         D+++++  
Subjt:  WERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYV-AIGSDLYHDFTMLENQLPLFVLESL--------FDRLSLE--

Query:  --------------SDGKAKHLVDFLISYCVPTDQMSKKGWKKNFLHPPTL----TALHEAGVSILKATDKPLMDISFTDGVLKIPPFKIYDNFETYVRN
                      ++    H +D L    + +   S +        P  L    T L  AGV  ++       DI F +G LKIP   I+D  ++   N
Subjt:  --------------SDGKAKHLVDFLISYCVPTDQMSKKGWKKNFLHPPTL----TALHEAGVSILKATDKPLMDISFTDGVLKIPPFKIYDNFETYVRN

Query:  MMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDCYYCDMTNALHGYCKKRWPKWKATLRREYF
        ++AFEQ H+ S    + +YI F+D LI++ +D S L    I+ N + GSD E++ LFN L KE++    D Y   +T  ++ Y +++W   KATLR +YF
Subjt:  MMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDCYYCDMTNALHGYCKKRWPKWKATLRREYF

Query:  KTPWTLISVVAATFIILLTLLQTLFSALS
          PW   S +AA  +++ T  Q+ F+  +
Subjt:  KTPWTLISVVAATFIILLTLLQTLFSALS

AT4G31980.1 unknown protein2.0e-5834.29Show/hide
Query:  QEDDMILAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVEAAIEIARGWERRARSCY
        Q +   L  S+K  L  + SS++  C IY+V  +L  +N  AY P++VS GPLH G+E L+AME  K + L  ++ R N ++E  + +AR WE+ ARSCY
Subjt:  QEDDMILAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVEAAIEIARGWERRARSCY

Query:  AEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESLF-----------------------DRL
        AE    ++ ++FV+M++VDG F++E +L +H+   +   + +  +  +   +D+  D  ++ENQLP FV++ +F                         L
Subjt:  AEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESLF-----------------------DRL

Query:  SLESDGK----AKHLVDFLISYCVPTDQMSKKGWKKNFLHPPTLTALHEAGVSILKA-TDKPLMDISFTDGVLKIPPFKIYDNFETYVRNMMAFEQFHLI
        S   D K     +H VD L S  +P   +  +       + P  T LH AGV    A T   L+DISF DGVLKIP   + D  E+  +N++ FEQ    
Subjt:  SLESDGK----AKHLVDFLISYCVPTDQMSKKGWKKNFLHPPTLTALHEAGVSILKA-TDKPLMDISFTDGVLKIPPFKIYDNFETYVRNMMAFEQFHLI

Query:  SRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDCYYCDMTNALHGYCKKRWPKWKATLRREYFKTPWTLISVV
        ++     +YI  L   I +  DA LL+ + I+ N +G S  +++ LFN++ KE++  D   Y+  ++  L  YC   W +WKA LRR+YF  PW + SV 
Subjt:  SRPGYVANYIAFLDGLISTEKDASLLVKAEILTNNIGGSDGEIAKLFNNLCKEIVLPDGDCYYCDMTNALHGYCKKRWPKWKATLRREYFKTPWTLISVV

Query:  AATFIILLTLLQTLFSALSI
        AA  ++LLT +Q++ S L++
Subjt:  AATFIILLTLLQTLFSALSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCCCGAATATGTTGTAACGCATCAACAAAACAACGGACCTTACAATATCGAAAAAAATGATGACATTGAACAGTGTCAGGAAGATGATATGATCCTCGCGAAATC
CATGAAGAAAATGCTCCAGGAAATGCCTTCTTCCATCACCCCAGATTGCAGCATCTATCGAGTTTCCAAACGACTAGTCGCCATTAATCATGTAGCGTATATGCCTCAAG
TCGTTTCCATCGGCCCTCTTCACCATGGTCGAGAGCATTTGAAGGCCATGGAATGGCATAAGCTTCAGGGTCTCGATATTTACCTACGCCGTATAAATATGAATGTTGAG
GCTGCCATTGAAATCGCTAGGGGTTGGGAGAGAAGAGCTCGTAGTTGCTATGCAGAACCTACAAATTACATGAACAAAAACGACTTTGTGAAAATGATGCTTGTGGATGG
CTGTTTCATGATAGAGTTTATGTTACTAGCTCATCACGAAACCCACCAAACTACAGAAAACGGGTTAGATCCTTCATTCTATGTAGCTATAGGGTCTGATTTATATCATG
ACTTTACAATGCTTGAGAATCAACTTCCTCTTTTTGTTCTTGAGTCTCTATTTGACAGACTTTCACTCGAAAGTGACGGAAAAGCAAAACACTTGGTCGATTTCTTAATC
TCCTACTGCGTCCCCACTGACCAGATGAGCAAAAAGGGTTGGAAAAAGAATTTTCTGCATCCCCCAACTTTAACCGCGCTCCATGAGGCTGGTGTTAGCATCTTGAAAGC
AACAGACAAACCCTTGATGGACATAAGCTTCACAGATGGGGTTCTGAAAATACCACCTTTCAAAATTTACGATAACTTCGAAACCTATGTGCGAAACATGATGGCTTTTG
AGCAGTTCCACTTAATTTCTAGACCGGGGTATGTAGCCAATTATATTGCATTTCTAGATGGCTTGATAAGCACAGAGAAAGACGCGAGTTTACTTGTGAAGGCGGAAATC
CTAACCAACAATATTGGTGGCAGTGACGGAGAAATTGCAAAACTGTTTAACAATCTATGTAAAGAGATAGTCCTTCCAGATGGTGACTGTTACTACTGCGATATGACCAA
CGCTTTACATGGCTATTGCAAGAAACGGTGGCCCAAGTGGAAAGCTACACTGAGACGTGAGTATTTCAAGACGCCATGGACTTTAATCTCCGTCGTAGCTGCAACCTTCA
TCATTCTCCTCACGCTCCTGCAAACCCTATTTTCTGCTTTATCGATTACCAATACCAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCCCGAATATGTTGTAACGCATCAACAAAACAACGGACCTTACAATATCGAAAAAAATGATGACATTGAACAGTGTCAGGAAGATGATATGATCCTCGCGAAATC
CATGAAGAAAATGCTCCAGGAAATGCCTTCTTCCATCACCCCAGATTGCAGCATCTATCGAGTTTCCAAACGACTAGTCGCCATTAATCATGTAGCGTATATGCCTCAAG
TCGTTTCCATCGGCCCTCTTCACCATGGTCGAGAGCATTTGAAGGCCATGGAATGGCATAAGCTTCAGGGTCTCGATATTTACCTACGCCGTATAAATATGAATGTTGAG
GCTGCCATTGAAATCGCTAGGGGTTGGGAGAGAAGAGCTCGTAGTTGCTATGCAGAACCTACAAATTACATGAACAAAAACGACTTTGTGAAAATGATGCTTGTGGATGG
CTGTTTCATGATAGAGTTTATGTTACTAGCTCATCACGAAACCCACCAAACTACAGAAAACGGGTTAGATCCTTCATTCTATGTAGCTATAGGGTCTGATTTATATCATG
ACTTTACAATGCTTGAGAATCAACTTCCTCTTTTTGTTCTTGAGTCTCTATTTGACAGACTTTCACTCGAAAGTGACGGAAAAGCAAAACACTTGGTCGATTTCTTAATC
TCCTACTGCGTCCCCACTGACCAGATGAGCAAAAAGGGTTGGAAAAAGAATTTTCTGCATCCCCCAACTTTAACCGCGCTCCATGAGGCTGGTGTTAGCATCTTGAAAGC
AACAGACAAACCCTTGATGGACATAAGCTTCACAGATGGGGTTCTGAAAATACCACCTTTCAAAATTTACGATAACTTCGAAACCTATGTGCGAAACATGATGGCTTTTG
AGCAGTTCCACTTAATTTCTAGACCGGGGTATGTAGCCAATTATATTGCATTTCTAGATGGCTTGATAAGCACAGAGAAAGACGCGAGTTTACTTGTGAAGGCGGAAATC
CTAACCAACAATATTGGTGGCAGTGACGGAGAAATTGCAAAACTGTTTAACAATCTATGTAAAGAGATAGTCCTTCCAGATGGTGACTGTTACTACTGCGATATGACCAA
CGCTTTACATGGCTATTGCAAGAAACGGTGGCCCAAGTGGAAAGCTACACTGAGACGTGAGTATTTCAAGACGCCATGGACTTTAATCTCCGTCGTAGCTGCAACCTTCA
TCATTCTCCTCACGCTCCTGCAAACCCTATTTTCTGCTTTATCGATTACCAATACCAAGTAA
Protein sequenceShow/hide protein sequence
MEPEYVVTHQQNNGPYNIEKNDDIEQCQEDDMILAKSMKKMLQEMPSSITPDCSIYRVSKRLVAINHVAYMPQVVSIGPLHHGREHLKAMEWHKLQGLDIYLRRINMNVE
AAIEIARGWERRARSCYAEPTNYMNKNDFVKMMLVDGCFMIEFMLLAHHETHQTTENGLDPSFYVAIGSDLYHDFTMLENQLPLFVLESLFDRLSLESDGKAKHLVDFLI
SYCVPTDQMSKKGWKKNFLHPPTLTALHEAGVSILKATDKPLMDISFTDGVLKIPPFKIYDNFETYVRNMMAFEQFHLISRPGYVANYIAFLDGLISTEKDASLLVKAEI
LTNNIGGSDGEIAKLFNNLCKEIVLPDGDCYYCDMTNALHGYCKKRWPKWKATLRREYFKTPWTLISVVAATFIILLTLLQTLFSALSITNTK