; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0023720 (gene) of Chayote v1 genome

Gene IDSed0023720
OrganismSechium edule (Chayote v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationLG11:32421762..32426215
RNA-Seq ExpressionSed0023720
SyntenySed0023720
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008445188.1 PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo]1.3e-9144.81Show/hide
Query:  DHVALVIEDNLQKMAP-FVPECSIHRVPKALLNMNRNAYVPRDISIGPFH-HDKQKFKTTEELKVRFFDSYRCRV-----------GMSAQDIVRRARGW
        D+V + IE  L ++ P    +CSI+RVPK L  MN  AY P+ ISIGPFH H  +     E+ K++ F +Y  RV             S +D+V+RA+ W
Subjt:  DHVALVIEDNLQKMAP-FVPECSIHRVPKALLNMNRNAYVPRDISIGPFH-HDKQKFKTTEELKVRFFDSYRCRV-----------GMSAQDIVRRARGW

Query:  ERKARQYYSEPTNMNNDEFVKMLVLDGCFIVEFMIKD-------HRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIP-YQSSENT
          +AR  Y+E  NMN+++F+KM+++DGCFIVEF I D       H   FP  +N V  SFY+    ++  +LI LENQLPFFVLQ+LFDLIP ++ + N 
Subjt:  ERKARQYYSEPTNMNNDEFVKMLVLDGCFIVEFMIKD-------HRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIP-YQSSENT

Query:  IESF-ISVVDLFARECVWPNKSGIRLYNNNLNQNPRHLLDFLSSYFVPKDAINVEHKES--------LPPSATQLREAGIVFKKVEGDQICTMDISFKDG
         +      + +   E   P        ++ L+  P+H +DFLS Y VP+     + K +        +PPS T++ EAG+  KK + +  C ++I F++G
Subjt:  IESF-ISVVDLFARECVWPNKSGIRLYNNNLNQNPRHLLDFLSSYFVPKDAINVEHKES--------LPPSATQLREAGIVFKKVEGDQICTMDISFKDG

Query:  ILNIPTLEIDDKFERCARNVLAFAQFSWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHN-NYFSHISK
        IL IP L IDD FE   RN+LAF  F  E         I Y  F+D LI TEKD NLL KE IIIN IGGS  E+SQLFN++CK ++   N NYF+  SK
Subjt:  ILNIPTLEIDDKFERCARNVLAFAQFSWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHN-NYFSHISK

Query:  DLKQHCDRKWNKWMASLRHNYFNTPWALISFLAATFLIILTLLQTIFTMIPTF
         L+ HCDR+WNK  ASL+HNYFNTPWA IS  AATFL++LT+LQTIF+ I  F
Subjt:  DLKQHCDRKWNKWMASLRHNYFNTPWALISFLAATFLIILTLLQTIFTMIPTF

XP_022131634.1 UPF0481 protein At3g47200-like [Momordica charantia]4.8e-9446.51Show/hide
Query:  HVALVIEDNLQKM--APFVPECSIHRVPKALLNMNRNAYVPRDISIGPFHHDKQ-KFKTTEELKVRFFDSYRCRVGMSAQDIVRRARGWERKARQYYSEP
        HV + IE+  +++   P  PECSI+RVPK LLNMNR AY P+ ISIGPFHH  Q     T++ K++  DSY  RV M+ + +V+  + WE +AR  Y EP
Subjt:  HVALVIEDNLQKM--APFVPECSIHRVPKALLNMNRNAYVPRDISIGPFHHDKQ-KFKTTEELKVRFFDSYRCRVGMSAQDIVRRARGWERKARQYYSEP

Query:  TNMNNDEFVKMLVLDGCFIVEFMIKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDLFARECVWPN
          MNND+FV ML+LDGCF+V F+I D+ + +  ++N   SSFY A+  ++  ++ MLENQLPFFVLQ L+DLIP +  E    S I +++ F    +  N
Subjt:  TNMNNDEFVKMLVLDGCFIVEFMIKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDLFARECVWPN

Query:  KSGIRLYNNNLNQNPRHLLDFLSSYFVPKDAINVEHKES---LPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNVLAFAQ
           I  + +    N +HL+D LS YF+P      +H +    + P  T+L EAG+  KK + +  C MDISFK+G+L IP L+IDD FE   RN++AF  
Subjt:  KSGIRLYNNNLNQNPRHLLDFLSSYFVPKDAINVEHKES---LPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNVLAFAQ

Query:  FSWEENGANKVN-AIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHN-NYFSHISKDLKQHCDRKWNKWMASLRHNYFNT
        +      AN     I Y +FLD +I+TEKD  LL + GIIINSIGGS  E+S+LFND+ K +++    +Y +HI+K L  HC + W +  A+L+ +YFN+
Subjt:  FSWEENGANKVN-AIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHN-NYFSHISKDLKQHCDRKWNKWMASLRHNYFNT

Query:  PWALISFLAATFLIILTLLQTIFTMIPTFK
        PWA IS +AAT++IILTLLQTIFT I TFK
Subjt:  PWALISFLAATFLIILTLLQTIFTMIPTFK

XP_022961913.1 UPF0481 protein At3g47200-like [Cucurbita moschata]1.2e-9246.46Show/hide
Query:  HVALVIEDNLQKMAPFVPECSIHRVPKALLNMNRNAYVPRDISIGPFHHDKQKFKTTEELKVRFFDSYRCRVGM---SAQDIVRRARGWERKARQYYSEP
        +V L I++ + K+ P   ECSI RVPK L NMN  AY P+ ISIGPFHH ++    TE  K+R   ++  R+G    S + + +  + W ++ R  Y EP
Subjt:  HVALVIEDNLQKMAPFVPECSIHRVPKALLNMNRNAYVPRDISIGPFHHDKQKFKTTEELKVRFFDSYRCRVGM---SAQDIVRRARGWERKARQYYSEP

Query:  TNMNNDEFVKMLVLDGCFIVEFMIKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDLFARECVWPN
         NMN+ EFV M+++DGCF+VEF+I++H     PN  ++ +   R+IE  +  +LIMLENQ+PFF+L+ LF LIP      T  SF  +  +F R+ +  N
Subjt:  TNMNNDEFVKMLVLDGCFIVEFMIKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDLFARECVWPN

Query:  KSGIRLYNNNLNQNPRHLLDFLSSYFVPKDAINVEHKES--LPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNVLAFAQF
         S     +N  +  P+HL+DFLS +FV K ++   +  S   PP+ T+L EAG+  KK + D I  MDI F++ IL IP L IDD FE   RN++AF  F
Subjt:  KSGIRLYNNNLNQNPRHLLDFLSSYFVPKDAINVEHKES--LPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNVLAFAQF

Query:  SWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHNNYFSHISKDLKQHCDRKWNKWMASLRHNYFNTPWA
               NK N I Y +F+D+LI+TEKD NLL K GIIIN+IGGS  E+S+LFN++CK +    +   ++IS  L++HC+R+WNK  ASL+HNYFNTPWA
Subjt:  SWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHNNYFSHISKDLKQHCDRKWNKWMASLRHNYFNTPWA

Query:  LISFLAATFLIILTLLQTIFTMIP
        ++SF AAT LIILTLLQTIF++ P
Subjt:  LISFLAATFLIILTLLQTIFTMIP

XP_023547064.1 UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo]1.7e-9145.9Show/hide
Query:  IEDNLQKMAPFVPECSIHRVPKALLNMNRNAYVPRDISIGPFHHDKQKFKTTEELKVRFFDSYRCRVGM---SAQDIVRRARGWERKARQYYSEPTNMNN
        I++ + K+ P   +CSI RVPK L NMN  AY P+ ISIGPFHH ++    TE  K+R   ++  R+G    S + + +  + W ++ R  Y EP NMN+
Subjt:  IEDNLQKMAPFVPECSIHRVPKALLNMNRNAYVPRDISIGPFHHDKQKFKTTEELKVRFFDSYRCRVGM---SAQDIVRRARGWERKARQYYSEPTNMNN

Query:  DEFVKMLVLDGCFIVEFMIKDHRDRFP----PNKNEVHSSFY-RAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIE-SFISVVDLFARECVWP
         EFV M+V+DGCF+VEF+I+ H + +P       N +  +F+ R+IE  +  +LIMLENQ+PFF+L+ LF LIP  +S +  E  +IS        C   
Subjt:  DEFVKMLVLDGCFIVEFMIKDHRDRFP----PNKNEVHSSFY-RAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIE-SFISVVDLFARECVWP

Query:  NKSGIRLYNNNLNQNPRHLLDFLSSYFVPKDAINVEHKES--LPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNVLAFAQ
        N S  +         P+HL+DFLS +FV K ++   +  S   PPS T+L EAG+  KK E + I  MDI FK+ IL IP L IDD FE   RN++AF  
Subjt:  NKSGIRLYNNNLNQNPRHLLDFLSSYFVPKDAINVEHKES--LPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNVLAFAQ

Query:  FSWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHNNYFSHISKDLKQHCDRKWNKWMASLRHNYFNTPW
        F       N+ N I Y  F+D+LI+TEKD NLL K GIIIN+IGGS  E+S+LFN++CK +    ++  ++IS  L++HC+R+WNK  ASL+HNYFNTPW
Subjt:  FSWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHNNYFSHISKDLKQHCDRKWNKWMASLRHNYFNTPW

Query:  ALISFLAATFLIILTLLQTIFTMIPTF
        A++SF AATFLIILTL QTIF+ +  F
Subjt:  ALISFLAATFLIILTLLQTIFTMIPTF

XP_031736550.1 UPF0481 protein At3g47200-like [Cucumis sativus]1.3e-9144.91Show/hide
Query:  DHVALVIEDNLQKM-APFVPECSIHRVPKALLNMNRNAYVPRDISIGPF-HHDKQKFKTTEELKVRFFDSYRCRVG----------MSAQDIVRRARGWE
        D+V + IE  L ++ +    +CSI+RVPK L  MN  AY P+ ISIGPF +H  +     E+ K++ F+++  RV            S  D+V++A+ W 
Subjt:  DHVALVIEDNLQKM-APFVPECSIHRVPKALLNMNRNAYVPRDISIGPF-HHDKQKFKTTEELKVRFFDSYRCRVG----------MSAQDIVRRARGWE

Query:  RKARQYYSEPTNMNNDEFVKMLVLDGCFIVEFMIKD-------HRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIE
        ++AR  Y+E  NMN+++F+KM+++DGCFIVEF I D       H   FP  +N V  SFY+    ++  +LI LENQLPFFVLQ+LFDLIP  +      
Subjt:  RKARQYYSEPTNMNNDEFVKMLVLDGCFIVEFMIKD-------HRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIE

Query:  SFISVVDLFARECVWPNKSGIRLY--NNNLNQNPRHLLDFLSSYFVPKDAINVEHKES--------LPPSATQLREAGIVFKKVEGDQICTMDISFKDGI
          ++   L        N   +  Y  ++ L+  P+H +DFLS YFVP      + + S        +PPS T+L EAG+  KK E  + C M+I F++GI
Subjt:  SFISVVDLFARECVWPNKSGIRLY--NNNLNQNPRHLLDFLSSYFVPKDAINVEHKES--------LPPSATQLREAGIVFKKVEGDQICTMDISFKDGI

Query:  LNIPTLEIDDKFERCARNVLAFAQFSWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHN-NYFSHISKD
        L IP L IDD FE   RN+LAF  F  E    N    I Y  F+D LI+TEKD NLL KE IIIN IGGS  E+SQLFN++CK ++   N NYF++IS+ 
Subjt:  LNIPTLEIDDKFERCARNVLAFAQFSWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHN-NYFSHISKD

Query:  LKQHCDRKWNKWMASLRHNYFNTPWALISFLAATFLIILTLLQTIFTMIPTF
        L++HCDR WNK  ASL+HNYFNTPWA ISF AAT L++LT+LQT+F+ I  F
Subjt:  LKQHCDRKWNKWMASLRHNYFNTPWALISFLAATFLIILTLLQTIFTMIPTF

TrEMBL top hitse value%identityAlignment
A0A1S3BBL9 UPF0481 protein At3g47200-like6.3e-9244.81Show/hide
Query:  DHVALVIEDNLQKMAP-FVPECSIHRVPKALLNMNRNAYVPRDISIGPFH-HDKQKFKTTEELKVRFFDSYRCRV-----------GMSAQDIVRRARGW
        D+V + IE  L ++ P    +CSI+RVPK L  MN  AY P+ ISIGPFH H  +     E+ K++ F +Y  RV             S +D+V+RA+ W
Subjt:  DHVALVIEDNLQKMAP-FVPECSIHRVPKALLNMNRNAYVPRDISIGPFH-HDKQKFKTTEELKVRFFDSYRCRV-----------GMSAQDIVRRARGW

Query:  ERKARQYYSEPTNMNNDEFVKMLVLDGCFIVEFMIKD-------HRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIP-YQSSENT
          +AR  Y+E  NMN+++F+KM+++DGCFIVEF I D       H   FP  +N V  SFY+    ++  +LI LENQLPFFVLQ+LFDLIP ++ + N 
Subjt:  ERKARQYYSEPTNMNNDEFVKMLVLDGCFIVEFMIKD-------HRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIP-YQSSENT

Query:  IESF-ISVVDLFARECVWPNKSGIRLYNNNLNQNPRHLLDFLSSYFVPKDAINVEHKES--------LPPSATQLREAGIVFKKVEGDQICTMDISFKDG
         +      + +   E   P        ++ L+  P+H +DFLS Y VP+     + K +        +PPS T++ EAG+  KK + +  C ++I F++G
Subjt:  IESF-ISVVDLFARECVWPNKSGIRLYNNNLNQNPRHLLDFLSSYFVPKDAINVEHKES--------LPPSATQLREAGIVFKKVEGDQICTMDISFKDG

Query:  ILNIPTLEIDDKFERCARNVLAFAQFSWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHN-NYFSHISK
        IL IP L IDD FE   RN+LAF  F  E         I Y  F+D LI TEKD NLL KE IIIN IGGS  E+SQLFN++CK ++   N NYF+  SK
Subjt:  ILNIPTLEIDDKFERCARNVLAFAQFSWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHN-NYFSHISK

Query:  DLKQHCDRKWNKWMASLRHNYFNTPWALISFLAATFLIILTLLQTIFTMIPTF
         L+ HCDR+WNK  ASL+HNYFNTPWA IS  AATFL++LT+LQTIF+ I  F
Subjt:  DLKQHCDRKWNKWMASLRHNYFNTPWALISFLAATFLIILTLLQTIFTMIPTF

A0A5A7V9C4 UPF0481 protein1.2e-9045.5Show/hide
Query:  ECSIHRVPKALLNMNRNAYVPRDISIGPFH-HDKQKFKTTEELKVRFFDSYRCRV-----------GMSAQDIVRRARGWERKARQYYSEPTNMNNDEFV
        +CSI+RVPK L  MN  AY P+ ISIGPFH H  +     E+ K++ F +Y  RV             S +D+V+RA+ W  +AR  Y+E  NMN+++F+
Subjt:  ECSIHRVPKALLNMNRNAYVPRDISIGPFH-HDKQKFKTTEELKVRFFDSYRCRV-----------GMSAQDIVRRARGWERKARQYYSEPTNMNNDEFV

Query:  KMLVLDGCFIVEFMIKD-------HRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIP-YQSSENTIESF-ISVVDLFARECVWPN
        KM+++DGCFIVEF I D       H   FP  +N V  SFY+    ++  +LI LENQLPFFVLQ+LFDLIP ++ + N  +      + +   E   P 
Subjt:  KMLVLDGCFIVEFMIKD-------HRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIP-YQSSENTIESF-ISVVDLFARECVWPN

Query:  KSGIRLYNNNLNQNPRHLLDFLSSYFVPKDAINVEHKES--------LPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNV
               ++ L+  P+H +DFLS Y VP+     + K +        +PPS T++ EAG+  KK + +  C ++I F++GIL IP L IDD FE   RN+
Subjt:  KSGIRLYNNNLNQNPRHLLDFLSSYFVPKDAINVEHKES--------LPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNV

Query:  LAFAQFSWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHN-NYFSHISKDLKQHCDRKWNKWMASLRHN
        LAF  F  E         I Y  F+D LI+TEKD NLL KE IIIN IGGS  E+SQLFN++CK ++   N NYF+  SK L+ HCDR+WNK  ASL+HN
Subjt:  LAFAQFSWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHN-NYFSHISKDLKQHCDRKWNKWMASLRHN

Query:  YFNTPWALISFLAATFLIILTLLQTIFTMIPTF
        YFNTPWA IS  AATFL++LT+LQTIF+ I  F
Subjt:  YFNTPWALISFLAATFLIILTLLQTIFTMIPTF

A0A6J1BQT6 UPF0481 protein At3g47200-like2.3e-9446.51Show/hide
Query:  HVALVIEDNLQKM--APFVPECSIHRVPKALLNMNRNAYVPRDISIGPFHHDKQ-KFKTTEELKVRFFDSYRCRVGMSAQDIVRRARGWERKARQYYSEP
        HV + IE+  +++   P  PECSI+RVPK LLNMNR AY P+ ISIGPFHH  Q     T++ K++  DSY  RV M+ + +V+  + WE +AR  Y EP
Subjt:  HVALVIEDNLQKM--APFVPECSIHRVPKALLNMNRNAYVPRDISIGPFHHDKQ-KFKTTEELKVRFFDSYRCRVGMSAQDIVRRARGWERKARQYYSEP

Query:  TNMNNDEFVKMLVLDGCFIVEFMIKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDLFARECVWPN
          MNND+FV ML+LDGCF+V F+I D+ + +  ++N   SSFY A+  ++  ++ MLENQLPFFVLQ L+DLIP +  E    S I +++ F    +  N
Subjt:  TNMNNDEFVKMLVLDGCFIVEFMIKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDLFARECVWPN

Query:  KSGIRLYNNNLNQNPRHLLDFLSSYFVPKDAINVEHKES---LPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNVLAFAQ
           I  + +    N +HL+D LS YF+P      +H +    + P  T+L EAG+  KK + +  C MDISFK+G+L IP L+IDD FE   RN++AF  
Subjt:  KSGIRLYNNNLNQNPRHLLDFLSSYFVPKDAINVEHKES---LPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNVLAFAQ

Query:  FSWEENGANKVN-AIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHN-NYFSHISKDLKQHCDRKWNKWMASLRHNYFNT
        +      AN     I Y +FLD +I+TEKD  LL + GIIINSIGGS  E+S+LFND+ K +++    +Y +HI+K L  HC + W +  A+L+ +YFN+
Subjt:  FSWEENGANKVN-AIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHN-NYFSHISKDLKQHCDRKWNKWMASLRHNYFNT

Query:  PWALISFLAATFLIILTLLQTIFTMIPTFK
        PWA IS +AAT++IILTLLQTIFT I TFK
Subjt:  PWALISFLAATFLIILTLLQTIFTMIPTFK

A0A6J1BR71 UPF0481 protein At3g47200-like1.2e-9044.06Show/hide
Query:  HVALVIEDNLQKMAPFVPECSIHRVPKALLNMNRNAYVPRDISIGPFHHDKQKFKTTEELKVRFFDSYRCRVGMSAQDIVRRARGWERKARQYYSEPTNM
        HV + +++ L+K+ P   ECSI+RV K L N+N  AY P+ ISIGPFHH +++F   E+LK+RF D+Y  RVGM  +D    A+GWE +AR+ Y+E  +M
Subjt:  HVALVIEDNLQKMAPFVPECSIHRVPKALLNMNRNAYVPRDISIGPFHHDKQKFKTTEELKVRFFDSYRCRVGMSAQDIVRRARGWERKARQYYSEPTNM

Query:  NNDEFVKMLVLDGCFIVEFMIKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDLFARECVWPNKSG
         +D FVKM+++DG F+VEF I+ H       +  ++ + ++AI  ++  +LI+LENQLPFF+L+ L D    + S +T   F+     F   C W   + 
Subjt:  NNDEFVKMLVLDGCFIVEFMIKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDLFARECVWPNKSG

Query:  IRLYNNNLNQNPRHLLDFLSSYFV------PKDAINVEHKESLPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNVLAFAQ
          + +  L + P HL+DFLS Y+         D +    +ES PP+AT+L EAG+ F+K   D+   MDI FKDG+L+IP LEI D FE   RN+LA+  
Subjt:  IRLYNNNLNQNPRHLLDFLSSYFV------PKDAINVEHKESLPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNVLAFAQ

Query:  FSWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHN-NYFSHISKDLKQHCDRKWNKWMASLRHNYFNTP
        +     G ++   I Y  FLDELI+TE+D +LL K GII N+IGG+  ++S+LFND+CK+I +  +  Y++ IS DL ++C+  W++ MASLR +YFNTP
Subjt:  FSWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHN-NYFSHISKDLKQHCDRKWNKWMASLRHNYFNTP

Query:  WALISFLAATFLIILTLLQTIFTMIPTFK
        WA ISFLAATFL++LT +Q I++ I   K
Subjt:  WALISFLAATFLIILTLLQTIFTMIPTFK

A0A6J1HD72 UPF0481 protein At3g47200-like5.7e-9346.46Show/hide
Query:  HVALVIEDNLQKMAPFVPECSIHRVPKALLNMNRNAYVPRDISIGPFHHDKQKFKTTEELKVRFFDSYRCRVGM---SAQDIVRRARGWERKARQYYSEP
        +V L I++ + K+ P   ECSI RVPK L NMN  AY P+ ISIGPFHH ++    TE  K+R   ++  R+G    S + + +  + W ++ R  Y EP
Subjt:  HVALVIEDNLQKMAPFVPECSIHRVPKALLNMNRNAYVPRDISIGPFHHDKQKFKTTEELKVRFFDSYRCRVGM---SAQDIVRRARGWERKARQYYSEP

Query:  TNMNNDEFVKMLVLDGCFIVEFMIKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDLFARECVWPN
         NMN+ EFV M+++DGCF+VEF+I++H     PN  ++ +   R+IE  +  +LIMLENQ+PFF+L+ LF LIP      T  SF  +  +F R+ +  N
Subjt:  TNMNNDEFVKMLVLDGCFIVEFMIKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDLFARECVWPN

Query:  KSGIRLYNNNLNQNPRHLLDFLSSYFVPKDAINVEHKES--LPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNVLAFAQF
         S     +N  +  P+HL+DFLS +FV K ++   +  S   PP+ T+L EAG+  KK + D I  MDI F++ IL IP L IDD FE   RN++AF  F
Subjt:  KSGIRLYNNNLNQNPRHLLDFLSSYFVPKDAINVEHKES--LPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNVLAFAQF

Query:  SWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHNNYFSHISKDLKQHCDRKWNKWMASLRHNYFNTPWA
               NK N I Y +F+D+LI+TEKD NLL K GIIIN+IGGS  E+S+LFN++CK +    +   ++IS  L++HC+R+WNK  ASL+HNYFNTPWA
Subjt:  SWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHNNYFSHISKDLKQHCDRKWNKWMASLRHNYFNTPWA

Query:  LISFLAATFLIILTLLQTIFTMIP
        ++SF AAT LIILTLLQTIF++ P
Subjt:  LISFLAATFLIILTLLQTIFTMIP

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026458.1e-2020.52Show/hide
Query:  SIHRVPKALLNMNRNAYVPRDISIGPFH------HDKQKFKTTEELKVR-FFDSYRCRVGMSAQDIVRRARGWERKARQYYSEPTNMNNDEFVKMLVLDG
        SI  VPKAL+  + ++Y P  +SIGP+H      H+ +++K     K+R  ++S+R        D+V + +  E K R  Y +    N +  + ++ +D 
Subjt:  SIHRVPKALLNMNRNAYVPRDISIGPFH------HDKQKFKTTEELKVR-FFDSYRCRVGMSAQDIVRRARGWERKARQYYSEPTNMNNDEFVKMLVLDG

Query:  CFIVEFMIKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDLFARECVWPNKSGIRLYNNNL----N
         F++EF +K +  R      +V +   R     +  +++M+ENQ+P FVL+   +    +S+E+  +  +SV+    ++    +   I+  ++ +     
Subjt:  CFIVEFMIKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDLFARECVWPNKSGIRLYNNNL----N

Query:  QNPRHLLDFLSSYFVPK---------------------------DAINVEHK-------------------ESLP-------------------------
        Q   H+LDFL    VP+                           D I  + K                    +LP                         
Subjt:  QNPRHLLDFLSSYFVPK---------------------------DAINVEHK-------------------ESLP-------------------------

Query:  -------------------PSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNVLAFAQFSWEENGANKVNAIDYFMFLDELIT
                           PS + L +AG+ FK      I T+      G   +P + +D   E   RN++A+       N +  +    Y   ++ +I 
Subjt:  -------------------PSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNVLAFAQFSWEENGANKVNAIDYFMFLDELIT

Query:  TEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHNNYFSHISKDLKQHCDRKWNKWMASLRHNYFNTPWALISFLAATFLIILTLLQTIFTMIP
        +E+D  LL ++G++++ +   Q E ++++N + K++ +    +     +D+ ++   +W   +  L   Y    W +++FLAA  L++L  LQ    +  
Subjt:  TEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHNNYFSHISKDLKQHCDRKWNKWMASLRHNYFNTPWALISFLAATFLIILTLLQTIFTMIP

Query:  TF
        +F
Subjt:  TF

Q9SD53 UPF0481 protein At3g472002.2e-3327.04Show/hide
Query:  CSIHRVPKALLNMNRNAYVPRDISIGPFHHDKQKFKTTEELKVR----FFDSYRCRVGMSAQDIVRRARGWERKARQYYSEPTNMNNDEFVKMLVLDGCF
        C I RVP++ + +N  AY P+ +SIGP+H+ ++  +  ++ K R    F D  + +  +    +V+     E K R+ YSE     +D  + M+VLDGCF
Subjt:  CSIHRVPKALLNMNRNAYVPRDISIGPFHHDKQKFKTTEELKVR----FFDSYRCRVGMSAQDIVRRARGWERKARQYYSEPTNMNNDEFVKMLVLDGCF

Query:  I-VEFMIKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDL-FARECVWPNKSGIRLYNNNLNQNPR
        I + F+I         +++ + S  +  +  ++  +L++LENQ+PFFVLQ L     Y  S+  + S ++ +   F +  +  +K G   +  + N   +
Subjt:  I-VEFMIKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDL-FARECVWPNKSGIRLYNNNLNQNPR

Query:  HLLDFLSSYFVPKDA---------INVEHKES--------------LPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNVL
        HLLD +   F+P  +         + V+  E               L  SA +LR  GI F+     +   +++  K   L IP L  D        N +
Subjt:  HLLDFLSSYFVPKDA---------INVEHKES--------------LPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNVL

Query:  AFAQFSWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVR-HNNYFSHISKDLKQHCDRKWNKWMASLRHNY
        AF QF    + +N++    Y +F+  L+  E+D   L  + +II +  GS  E+S+ F  I K++      +Y +++ K + ++  + +N   A  RH +
Subjt:  AFAQFSWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVR-HNNYFSHISKDLKQHCDRKWNKWMASLRHNY

Query:  FNTPWALISFLAATFLIILTLLQTIFTMI
        F +PW  +S  A  F+I+LT+LQ+   ++
Subjt:  FNTPWALISFLAATFLIILTLLQTIFTMI

Arabidopsis top hitse value%identityAlignment
AT2G36430.1 Plant protein of unknown function (DUF247)7.2e-4829.16Show/hide
Query:  HVALVIEDNLQKMAPFVPECSIHRVPKALLNMNRNAYVPRDISIGPFHHDKQKFKTTEELKVRFFDSYRCRV-GMSAQDIVRRARGWERKARQYYSEPTN
        H  L     L   A   P CSI RVP+++++ N   Y PR +SIGP+H  + + K  EE K R+ +    R   ++ +D ++  +  E  AR+ YSE  +
Subjt:  HVALVIEDNLQKMAPFVPECSIHRVPKALLNMNRNAYVPRDISIGPFHHDKQKFKTTEELKVRFFDSYRCRV-GMSAQDIVRRARGWERKARQYYSEPTN

Query:  MNNDEFVKMLVLDGCFIVEFMIK-DHRDRFPPNKNEVHSS-----FYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDLFAREC
        M+++EF +M+VLDGCF++E   K ++   F PN   V  +     FYR   C        LENQ+PFFVL+ LF+L    +   T  S  S+   F    
Subjt:  MNNDEFVKMLVLDGCFIVEFMIK-DHRDRFPPNKNEVHSS-----FYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDLFAREC

Query:  VWPNKSGIRLYNNNLNQNPRHLLDFLSSYFVPKDAINV-----EHKESLPP----SATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFER
        +   +  +  +        +HLLD L S F+P+  ++        KE +P     S ++LR AGI  ++++ D    + + F+ G + +P + +DD    
Subjt:  VWPNKSGIRLYNNNLNQNPRHLLDFLSSYFVPKDAINV-----EHKESLPP----SATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFER

Query:  CARNVLAFAQFSWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHNN-YFSHISKDLKQHCDRKWNKWMA
           N +A+ Q     + A  ++   Y   LD L  T KD   L  + II N   G+  E+++  N + +++       Y   + +++ ++    W+   A
Subjt:  CARNVLAFAQFSWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHNN-YFSHISKDLKQHCDRKWNKWMA

Query:  SLRHNYFNTPWALISFLAATFLIILTLLQTIFTMIPTFK
        + +  YFN+PW+ +S LAA  L++L+++QTI+T+   ++
Subjt:  SLRHNYFNTPWALISFLAATFLIILTLLQTIFTMIPTFK

AT3G50120.1 Plant protein of unknown function (DUF247)1.0e-4629.13Show/hide
Query:  IHRVPKALLNMNRNAYVPRDISIGPFHHDKQKFKTTEELKVRFFDSYRCRVGMSAQDIVRRARGWERKARQYYSEPTNMNNDEFVKMLVLDGCFIVEFM-
        I+RVP  L   +  +Y P+ +S+GP+HH K++ ++ +  K R  +    R     +  +   R  E KAR  Y  P +++++EF++MLVLDGCF++E   
Subjt:  IHRVPKALLNMNRNAYVPRDISIGPFHHDKQKFKTTEELKVRFFDSYRCRVGMSAQDIVRRARGWERKARQYYSEPTNMNNDEFVKMLVLDGCFIVEFM-

Query:  --IKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDLFARECVWP-----NKSGIRLYNNNL-----
          ++   +      + V +   R    ++  +++MLENQLP FVL  L +L   Q         ++ + +   + + P      KSG     N+L     
Subjt:  --IKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDLFARECVWP-----NKSGIRLYNNNL-----

Query:  -----NQNPRHLLDFLSSYFV---PKDAINVEHK-------------ESLPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCAR
             +    H LD      +   PK    +  K             + L    T+L+EAGI F++ + D+    D+ FK+G L IP L I D  +    
Subjt:  -----NQNPRHLLDFLSSYFV---PKDAINVEHK-------------ESLPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCAR

Query:  NVLAFAQFSWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITV-RHNNYFSHISKDLKQHCDRKWNKWMASLR
        N++AF Q   + +     +   Y +F+D LI + +D + L   GII + + GS  E++ LFN +C+ +     ++Y S +S ++ ++ D KWN W A+L+
Subjt:  NVLAFAQFSWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITV-RHNNYFSHISKDLKQHCDRKWNKWMASLR

Query:  HNYFNTPWALISFLAATFLIILTLLQTIFTMIPTFK
        H YFN PWA++SF AA  L++LT  Q+ + +   +K
Subjt:  HNYFNTPWALISFLAATFLIILTLLQTIFTMIPTFK

AT3G50150.1 Plant protein of unknown function (DUF247)2.1e-4730.68Show/hide
Query:  IHRVPKALLNMNRNAYVPRDISIGPFHHDKQKFKTTEELKVRFFDSYRCRVGMSAQDIVRRARGWERKARQYYSEPTNM-NNDEFVKMLVLDGCFIVEFM
        I+RVP  L   ++ +Y+P+ +SIGP+HH K   +  E  K R  +    R   + +  +   +  E +AR  Y  P +M N++EF +MLVLDGCF++E  
Subjt:  IHRVPKALLNMNRNAYVPRDISIGPFHHDKQKFKTTEELKVRFFDSYRCRVGMSAQDIVRRARGWERKARQYYSEPTNM-NNDEFVKMLVLDGCFIVEFM

Query:  ---IKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDLFARECVWPNKSGIRLYNNNLNQNPR----
           I+  +       + V +   R +  ++  ++IMLENQLP FVL  L  L   Q+        ++ V +   + + P    +     +L+   +    
Subjt:  ---IKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDLFARECVWPNKSGIRLYNNNLNQNPR----

Query:  ------HLLDFLSSYFVP-----------KDAINVEHKESLPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNVLAFAQFS
              H LD      +            +D   VE ++ L    T+LR AG+ F + E  Q+   DI FK+G L IP L I D  +    N++AF Q  
Subjt:  ------HLLDFLSSYFVP-----------KDAINVEHKESLPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNVLAFAQFS

Query:  WEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITV-RHNNYFSHISKDLKQHCDRKWNKWMASLRHNYFNTPWA
         + +     N   Y +F+D LI + +D + L  +GII + + GS  E++ LFN +CK +     + Y S +S+++ ++  RKWN   A+LR  YFN PWA
Subjt:  WEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITV-RHNNYFSHISKDLKQHCDRKWNKWMASLRHNYFNTPWA

Query:  LISFLAATFLIILTLLQTIFTMIPTFK
          SF AA  L+ LT  Q+ F +   +K
Subjt:  LISFLAATFLIILTLLQTIFTMIPTFK

AT3G50170.1 Plant protein of unknown function (DUF247)5.2e-4630.11Show/hide
Query:  IHRVPKALLNMNRNAYVPRDISIGPFHHDKQKFKTTEELKVRFFDSYRCRVGMSAQDIVRRARGWERKARQYYSEPTNMNNDEFVKMLVLDGCFIVEFM-
        I+RVP  L   ++ +Y P+ +S+GP+HH K++ +  E  K R  +    R+    +      R  E KAR  Y  P +++ +EF +MLVLDGCF++E   
Subjt:  IHRVPKALLNMNRNAYVPRDISIGPFHHDKQKFKTTEELKVRFFDSYRCRVGMSAQDIVRRARGWERKARQYYSEPTNMNNDEFVKMLVLDGCFIVEFM-

Query:  --IKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDL--------------FARECVWPNKS-----
          ++   +      + V +   R +  ++  ++IMLENQLP FVL  L +L     ++  I + ++V                  ++   W  KS     
Subjt:  --IKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDL--------------FARECVWPNKS-----

Query:  ------GIRLYNNNLNQ-----NPRHLLDFLSSYFVPKDAINVEHKESLPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARN
               + ++  +L Q     N R LL  L+        +  + ++ L    T+LREAG+ F+K + D+    DI FK+G L IP L I D  +    N
Subjt:  ------GIRLYNNNLNQ-----NPRHLLDFLSSYFVPKDAINVEHKESLPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARN

Query:  VLAFAQFSWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITV-RHNNYFSHISKDLKQHCDRKWNKWMASLRH
        ++AF Q   E +     +   Y +F+D LI + +D + L   GII + + GS  E++ LFN +C+ +     +++ S +S D+ ++ +RKWN   A+L H
Subjt:  VLAFAQFSWEENGANKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITV-RHNNYFSHISKDLKQHCDRKWNKWMASLRH

Query:  NYFNTPWALISFLAATFLIILTLLQTIFTMIPTFK
         YFN PWA  SF AA  L++LTL Q+ + +   +K
Subjt:  NYFNTPWALISFLAATFLIILTLLQTIFTMIPTFK

AT4G31980.1 unknown protein5.1e-7034.86Show/hide
Query:  IEDNLQKMAPFVPECSIHRVPKALLNMNRNAYVPRDISIGPFHHDKQKFKTTEELKVRFFDSYRCRVGMSAQDIVRRARGWERKARQYYSEPTNMNNDEF
        I+  L  ++    +C I++VP  L  +N +AY PR +S GP H  K++ +  E+ K R+  S+  R   S +D+VR AR WE+ AR  Y+E   +++DEF
Subjt:  IEDNLQKMAPFVPECSIHRVPKALLNMNRNAYVPRDISIGPFHHDKQKFKTTEELKVRFFDSYRCRVGMSAQDIVRRARGWERKARQYYSEPTNMNNDEF

Query:  VKMLVLDGCFIVEFMIKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDLFARECVWPNKSGIRLYN
        V+MLV+DG F+VE +++ H  R     + +  +    +  ++  ++I++ENQLPFFV++ +F L+     + T     S++ L  R   +      R+ +
Subjt:  VKMLVLDGCFIVEFMIKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDLFARECVWPNKSGIRLYN

Query:  NNLNQNPRHLLDFLSSYFVPKDAINVEH---KESLPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNVLAFAQFSWEENGA
              P H +D L S ++P+  I +E+   K    P AT+L  AG+ FK  E    C +DISF DG+L IPT+ +DD  E   +N++ F     E+   
Subjt:  NNLNQNPRHLLDFLSSYFVPKDAINVEH---KESLPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNVLAFAQFSWEENGA

Query:  NKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHNNYFSHISKDLKQHCDRKWNKWMASLRHNYFNTPWALISFLAA
        +  N +DY M L   I +  DA+LL   GII+N +G S +++S LFN I K +      YFS +S++L+ +C+  WN+W A LR +YF+ PWA+ S  AA
Subjt:  NKVNAIDYFMFLDELITTEKDANLLAKEGIIINSIGGSQIEISQLFNDICKNITVRHNNYFSHISKDLKQHCDRKWNKWMASLRHNYFNTPWALISFLAA

Query:  TFLIILTLLQTIFTMI
          L++LT +Q++ +++
Subjt:  TFLIILTLLQTIFTMI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCACGTCGCACTAGTCATCGAAGATAATCTACAGAAAATGGCTCCATTTGTTCCAGAATGTAGCATCCATCGAGTTCCAAAAGCACTGCTCAACATGAATCGCAA
TGCATATGTACCAAGAGACATTTCAATTGGCCCTTTTCATCATGATAAACAAAAATTCAAAACTACAGAAGAGCTCAAGGTTCGTTTTTTTGACAGTTATCGATGTCGCG
TAGGCATGAGTGCTCAGGACATTGTAAGAAGGGCTCGAGGTTGGGAGAGAAAAGCTCGTCAGTACTACTCAGAACCTACAAACATGAACAATGATGAGTTTGTGAAAATG
CTGGTTTTAGATGGTTGTTTCATAGTAGAGTTCATGATTAAGGATCACCGAGACAGATTTCCTCCAAATAAAAACGAGGTACACTCCTCCTTCTACAGAGCTATAGAATG
CAATATGAGTGTGGAGTTGATAATGCTTGAGAATCAACTTCCTTTTTTTGTCCTTCAAAACCTATTCGACCTTATTCCATACCAATCATCCGAGAACACTATTGAGTCCT
TTATATCGGTTGTAGACTTATTTGCCCGTGAGTGTGTGTGGCCAAATAAAAGTGGGATCCGACTTTATAATAATAATTTGAATCAAAATCCACGCCACTTATTGGATTTC
TTAAGCTCTTATTTTGTCCCCAAGGATGCGATAAATGTTGAACACAAAGAAAGCCTACCTCCAAGTGCAACCCAGCTCAGGGAGGCTGGTATTGTCTTTAAGAAAGTAGA
AGGAGATCAAATATGTACTATGGACATAAGTTTCAAAGATGGGATTTTGAACATTCCAACTTTAGAAATTGATGATAAATTTGAAAGATGTGCTAGAAATGTATTGGCAT
TTGCACAGTTTAGTTGGGAGGAAAATGGTGCTAACAAGGTGAATGCAATTGATTACTTTATGTTCTTAGATGAGCTCATAACAACGGAGAAAGATGCGAACTTACTTGCG
AAGGAAGGGATCATAATAAACAGTATTGGCGGTAGCCAAATAGAAATTTCGCAACTGTTTAATGATATTTGTAAGAATATCACAGTACGTCATAATAATTACTTCAGTCA
TATTTCAAAGGATTTGAAGCAACATTGTGATAGAAAATGGAACAAGTGGATGGCTTCATTGAGACACAACTATTTTAACACGCCATGGGCTCTTATCTCCTTCTTGGCAG
CTACCTTCCTTATTATACTAACTTTACTACAAACCATATTTACTATGATACCCACTTTCAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATCACGTCGCACTAGTCATCGAAGATAATCTACAGAAAATGGCTCCATTTGTTCCAGAATGTAGCATCCATCGAGTTCCAAAAGCACTGCTCAACATGAATCGCAA
TGCATATGTACCAAGAGACATTTCAATTGGCCCTTTTCATCATGATAAACAAAAATTCAAAACTACAGAAGAGCTCAAGGTTCGTTTTTTTGACAGTTATCGATGTCGCG
TAGGCATGAGTGCTCAGGACATTGTAAGAAGGGCTCGAGGTTGGGAGAGAAAAGCTCGTCAGTACTACTCAGAACCTACAAACATGAACAATGATGAGTTTGTGAAAATG
CTGGTTTTAGATGGTTGTTTCATAGTAGAGTTCATGATTAAGGATCACCGAGACAGATTTCCTCCAAATAAAAACGAGGTACACTCCTCCTTCTACAGAGCTATAGAATG
CAATATGAGTGTGGAGTTGATAATGCTTGAGAATCAACTTCCTTTTTTTGTCCTTCAAAACCTATTCGACCTTATTCCATACCAATCATCCGAGAACACTATTGAGTCCT
TTATATCGGTTGTAGACTTATTTGCCCGTGAGTGTGTGTGGCCAAATAAAAGTGGGATCCGACTTTATAATAATAATTTGAATCAAAATCCACGCCACTTATTGGATTTC
TTAAGCTCTTATTTTGTCCCCAAGGATGCGATAAATGTTGAACACAAAGAAAGCCTACCTCCAAGTGCAACCCAGCTCAGGGAGGCTGGTATTGTCTTTAAGAAAGTAGA
AGGAGATCAAATATGTACTATGGACATAAGTTTCAAAGATGGGATTTTGAACATTCCAACTTTAGAAATTGATGATAAATTTGAAAGATGTGCTAGAAATGTATTGGCAT
TTGCACAGTTTAGTTGGGAGGAAAATGGTGCTAACAAGGTGAATGCAATTGATTACTTTATGTTCTTAGATGAGCTCATAACAACGGAGAAAGATGCGAACTTACTTGCG
AAGGAAGGGATCATAATAAACAGTATTGGCGGTAGCCAAATAGAAATTTCGCAACTGTTTAATGATATTTGTAAGAATATCACAGTACGTCATAATAATTACTTCAGTCA
TATTTCAAAGGATTTGAAGCAACATTGTGATAGAAAATGGAACAAGTGGATGGCTTCATTGAGACACAACTATTTTAACACGCCATGGGCTCTTATCTCCTTCTTGGCAG
CTACCTTCCTTATTATACTAACTTTACTACAAACCATATTTACTATGATACCCACTTTCAAGTAA
Protein sequenceShow/hide protein sequence
MDHVALVIEDNLQKMAPFVPECSIHRVPKALLNMNRNAYVPRDISIGPFHHDKQKFKTTEELKVRFFDSYRCRVGMSAQDIVRRARGWERKARQYYSEPTNMNNDEFVKM
LVLDGCFIVEFMIKDHRDRFPPNKNEVHSSFYRAIECNMSVELIMLENQLPFFVLQNLFDLIPYQSSENTIESFISVVDLFARECVWPNKSGIRLYNNNLNQNPRHLLDF
LSSYFVPKDAINVEHKESLPPSATQLREAGIVFKKVEGDQICTMDISFKDGILNIPTLEIDDKFERCARNVLAFAQFSWEENGANKVNAIDYFMFLDELITTEKDANLLA
KEGIIINSIGGSQIEISQLFNDICKNITVRHNNYFSHISKDLKQHCDRKWNKWMASLRHNYFNTPWALISFLAATFLIILTLLQTIFTMIPTFK