; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G06070 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G06070
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationChr3:5216297..5219145
RNA-Seq ExpressionCSPI03G06070
SyntenyCSPI03G06070
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064943.1 UPF0481 protein [Cucumis melo var. makuwa]1.3e-12053.97Show/hide
Query:  YVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYAT-PINMEEKD-FIIMMVV
        Y+PQLISIGP HHGT  DL+AN++YKL GFINFLRRINI        ++   S++D+L+TGTL  LVEKAH WVKEA NCY + PIN  + D F+IMM+V
Subjt:  YVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYAT-PINMEEKD-FIIMMVV

Query:  DACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPKENVPIR-SKFREEKEFPIYSRDDKNVPIRS-KFR
        DACFI EF ILK D  HP  KF  IQ+N+DISF+ G+++ I  D+IKLENQVPFFLL++LF+ IPK ++ +  S FR+             + +R+ KFR
Subjt:  DACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPKENVPIR-SKFREEKEFPIYSRDDKNVPIRS-KFR

Query:  EEKELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNNNCLSFLCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWNKIACKEKKSEKGSSNIPP
          +   I+         FK+  H+               + L+F   +P +G  QK++ ++           +FK                E+ +  IPP
Subjt:  EEKELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNNNCLSFLCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWNKIACKEKKSEKGSSNIPP

Query:  SIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVIINNIGGSDKE
        SI EL EAGV+IKKAE  K++ +ITFKNGVL IPPLHI+D F+L+LRNMVAFEQ +A   NKYV QYVLF+D+LISTEKD+ LLV+AGVIIN IGGSDKE
Subjt:  SIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVIINNIGGSDKE

Query:  VSDLLNNLPKFVTQSSSPHYDSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLT-VQTIFSGITTF
        V+DL NN  KF+T   S ++DSI K LCEHCNG WN+AKASLKHNYFNTPWAFISFFAAT LILLT +QTIF+ ITTF
Subjt:  VSDLLNNLPKFVTQSSSPHYDSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLT-VQTIFSGITTF

XP_008445187.1 PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo]3.7e-12053.77Show/hide
Query:  YVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYAT-PINMEEKD-FIIMMVV
        Y+PQLISIGP HHGT  DL+AN++YKL GFINFLRRINI        ++   S++D+L+TGTL  LVEKAH WV+EA NCY + PIN  + D F+IMM+V
Subjt:  YVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYAT-PINMEEKD-FIIMMVV

Query:  DACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPKENVPIR-SKFREEKEFPIYSRDDKNVPIRS-KFR
        DACFI EF ILK D  HP  KF  IQ+N+DISF+ G+++ I  D+IKLENQVPFFLL++LF+ IPK ++ +  S FR+             + +R+ KFR
Subjt:  DACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPKENVPIR-SKFREEKEFPIYSRDDKNVPIRS-KFR

Query:  EEKELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNNNCLSFLCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWNKIACKEKKSEKGSSNIPP
          +   I+         FK+  H+               + L+F   +P +G  QK++ ++           +FK                E+ +  IPP
Subjt:  EEKELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNNNCLSFLCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWNKIACKEKKSEKGSSNIPP

Query:  SIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVIINNIGGSDKE
        SI EL EAGV+IKKAE  K++ +ITFKNGVL IPPLHI+D F+L+LRNMVAFEQ +A   NKYV QYVLF+D+LISTEKD+ LLV+AGVIIN IGGSDKE
Subjt:  SIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVIINNIGGSDKE

Query:  VSDLLNNLPKFVTQSSSPHYDSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLT-VQTIFSGITTF
        V+DL NN  KF+T   S ++DSI K LCEHCNG WN+AKASLKHNYFNTPWAFISFFAAT LILLT +QTIF+ ITTF
Subjt:  VSDLLNNLPKFVTQSSSPHYDSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLT-VQTIFSGITTF

XP_031739483.1 UPF0481 protein At3g47200-like [Cucumis sativus]4.4e-12197.36Show/hide
Query:  TYVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYATPINMEEKDFIIMMVVD
        TYVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYATPINMEEKDFIIMMVVD
Subjt:  TYVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYATPINMEEKDFIIMMVVD

Query:  ACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPKENVPIRSKFREEKEFPIYSRDDKNVPIRSKFREEK
        ACFI EFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLE LFEEIPKENVPIRSKFREEKEFPI SRDDKNVP+RSKFREEK
Subjt:  ACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPKENVPIRSKFREEKEFPIYSRDDKNVPIRSKFREEK

Query:  ELPISRDDENVPISFKDLAHWALKSGL
        ELPISRDDENVPIS KDLAHWALKS L
Subjt:  ELPISRDDENVPISFKDLAHWALKSGL

XP_038889346.1 UPF0481 protein At3g47200-like [Benincasa hispida]5.7e-12956.21Show/hide
Query:  YVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRIN---IKSTVEGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYATPINMEEKDFIIMMV
        Y PQLISIGP HHGT KDLIANE+YKLHGF+NFLRR+N   I+S    G  +T           TL VLVEKAH WVKEA NCYATPI M+E++F+ MM+
Subjt:  YVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRIN---IKSTVEGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYATPINMEEKDFIIMMV

Query:  VDACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPKENVPIRSKFREEKEFPIYSRDDKNVPIRSKFRE
        VDACFI EFFILK D+ HP CKF  IQENVDISF+ GIE+DI DD+IKLENQVPFFLL  LF+ IPK NVP+ S F+++K +                  
Subjt:  VDACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPKENVPIRSKFREEKEFPIYSRDDKNVPIRSKFRE

Query:  EKELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNNNCLSFLCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWNKIACKEKKSEKGSSNIPPS
                           L H  LK GL  +++  + N      H+P               TKN                      KSEK  S IPPS
Subjt:  EKELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNNNCLSFLCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWNKIACKEKKSEKGSSNIPPS

Query:  IPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVIINNIGGSDKEV
        I EL EAGV+IKKA+N KYMR+ITFKNGVLEIPPLHI+D+F+LMLRNMVAFEQ+ A N+NKYVTQYVLFLD+LISTEKD+HLL+KAG+IINNIGG  KEV
Subjt:  IPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVIINNIGGSDKEV

Query:  SDLLNNLPKFVTQSSSPHYDSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLTV-QTIFSGIT
        SDL NNL KFVT+SSS H+D IS+AL +HCN  WNK +ASLKH+YFNTPWA++SF AA  +I L + QT F+ ++
Subjt:  SDLLNNLPKFVTQSSSPHYDSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLTV-QTIFSGIT

XP_038890800.1 UPF0481 protein At3g47200-like [Benincasa hispida]4.8e-12855.58Show/hide
Query:  YVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYATPINMEEKD-FIIMMVVD
        Y PQLISIGP H GT KDLIANE YKL GFINFLRRIN         +    S++DLLKTG + +LVEKAH WVK+A NCYATP NM + D F++MM+VD
Subjt:  YVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYATPINMEEKD-FIIMMVVD

Query:  ACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPKENVPIRSKFREEKEFPIYSRDDKNVPIRSKFREEK
        ACF+ EF ILK D  HP CKF  IQ+NVDISF+ GI++ I  D+IKLENQVPFFLL +LF  IPK +VP+                              
Subjt:  ACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPKENVPIRSKFREEKEFPIYSRDDKNVPIRSKFREEK

Query:  ELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNN-------NCLSF-LCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWNKIACKEKKSEKGS
                  +  SF DL H ALK  L  +   D         + LSF    +P      K  G  N +              H W              
Subjt:  ELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNN-------NCLSF-LCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWNKIACKEKKSEKGS

Query:  SNIPPSIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVIINNIG
          IPPSI EL EAGV+IKKAEN KY+ NITFKNGVLEIPPLHI+D F+LMLRNMVAFEQ +AG KNKYV QYVLF+D+LISTEKD+ LLV+AGVIIN IG
Subjt:  SNIPPSIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVIINNIG

Query:  GSDKEVSDLLNNLPKFVTQSSSPHYDSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLTV-QTIFSGITTFP
        GSDKEVSDL NNL KF+T   S H+D I K LC+HCNG WNKAKASLKHNYFNTPWAFIS FAA+ LILLT+ QTIFS I+ FP
Subjt:  GSDKEVSDLLNNLPKFVTQSSSPHYDSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLTV-QTIFSGITTFP

TrEMBL top hitse value%identityAlignment
A0A0A0L379 Uncharacterized protein2.1e-12197.36Show/hide
Query:  TYVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYATPINMEEKDFIIMMVVD
        TYVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYATPINMEEKDFIIMMVVD
Subjt:  TYVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYATPINMEEKDFIIMMVVD

Query:  ACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPKENVPIRSKFREEKEFPIYSRDDKNVPIRSKFREEK
        ACFI EFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLE LFEEIPKENVPIRSKFREEKEFPI SRDDKNVP+RSKFREEK
Subjt:  ACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPKENVPIRSKFREEKEFPIYSRDDKNVPIRSKFREEK

Query:  ELPISRDDENVPISFKDLAHWALKSGL
        ELPISRDDENVPIS KDLAHWALKS L
Subjt:  ELPISRDDENVPISFKDLAHWALKSGL

A0A1S3BCY5 UPF0481 protein At3g47200-like1.8e-12053.77Show/hide
Query:  YVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYAT-PINMEEKD-FIIMMVV
        Y+PQLISIGP HHGT  DL+AN++YKL GFINFLRRINI        ++   S++D+L+TGTL  LVEKAH WV+EA NCY + PIN  + D F+IMM+V
Subjt:  YVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYAT-PINMEEKD-FIIMMVV

Query:  DACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPKENVPIR-SKFREEKEFPIYSRDDKNVPIRS-KFR
        DACFI EF ILK D  HP  KF  IQ+N+DISF+ G+++ I  D+IKLENQVPFFLL++LF+ IPK ++ +  S FR+             + +R+ KFR
Subjt:  DACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPKENVPIR-SKFREEKEFPIYSRDDKNVPIRS-KFR

Query:  EEKELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNNNCLSFLCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWNKIACKEKKSEKGSSNIPP
          +   I+         FK+  H+               + L+F   +P +G  QK++ ++           +FK                E+ +  IPP
Subjt:  EEKELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNNNCLSFLCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWNKIACKEKKSEKGSSNIPP

Query:  SIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVIINNIGGSDKE
        SI EL EAGV+IKKAE  K++ +ITFKNGVL IPPLHI+D F+L+LRNMVAFEQ +A   NKYV QYVLF+D+LISTEKD+ LLV+AGVIIN IGGSDKE
Subjt:  SIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVIINNIGGSDKE

Query:  VSDLLNNLPKFVTQSSSPHYDSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLT-VQTIFSGITTF
        V+DL NN  KF+T   S ++DSI K LCEHCNG WN+AKASLKHNYFNTPWAFISFFAAT LILLT +QTIF+ ITTF
Subjt:  VSDLLNNLPKFVTQSSSPHYDSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLT-VQTIFSGITTF

A0A1S3BD29 LOW QUALITY PROTEIN: UPF0481 protein At3g47200-like4.0e-11250.4Show/hide
Query:  QQLRD--DATYVPQLISIGPFHH-GTPKDLIANEKYKLHGFINFLRRINIKSTV--EGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYATPIN-
        +QLR+     Y PQLISIGPFHH     D  A E+YKL   +NFLRRIN  +    E    + +RSL DLLK GTLKVLVEK H W+ E  NCY+ PI+ 
Subjt:  QQLRD--DATYVPQLISIGPFHH-GTPKDLIANEKYKLHGFINFLRRINIKSTV--EGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYATPIN-

Query:  MEEKDFIIMMVVDACFIDEFFILKVDESHP-ICKFDLIQENVD-ISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPKENVPIR-SKFREEKEFPIYS
        M++ +F+IMM++DACFI E FI + D S+P   KF  IQ+NVD +  +     DI++D+IKLENQVPFFLL+H+F  IP+ + P+          +  + 
Subjt:  MEEKDFIIMMVVDACFIDEFFILKVDESHP-ICKFDLIQENVD-ISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPKENVPIR-SKFREEKEFPIYS

Query:  RDDKNVPIRSKFREEKELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNN-NCLSFLCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWNKIAC
           K   +     E K L        +P    D      +   N +N   NN N LSF   +     WQ  D  N  +   N CL               
Subjt:  RDDKNVPIRSKFREEKELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNN-NCLSFLCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWNKIAC

Query:  KEKKSEKGSSNIPPSIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVK
                     PSI EL E+GV+I+KA+N KY+ NITFKNGVL+IP LHI+D F+LM RN++AFEQ+ AGN+N Y TQY+LF+D+LISTEKD+ LLV 
Subjt:  KEKKSEKGSSNIPPSIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVK

Query:  AGVIINNIGGSDKEVSDLLNNLPKFVTQSSSPH-YDSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLT-VQTIFSGITTFP
        +GVIINNIGGSDKEVS+L NNL KFV Q  SP+ ++ ISKAL +HCNG WNKAKASLKHNYFNTPWAFISFFAA+FL+LLT +QTIFSGI+ FP
Subjt:  AGVIINNIGGSDKEVSDLLNNLPKFVTQSSSPH-YDSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLT-VQTIFSGITTFP

A0A5A7V9V0 UPF0481 protein6.2e-12153.97Show/hide
Query:  YVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYAT-PINMEEKD-FIIMMVV
        Y+PQLISIGP HHGT  DL+AN++YKL GFINFLRRINI        ++   S++D+L+TGTL  LVEKAH WVKEA NCY + PIN  + D F+IMM+V
Subjt:  YVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYAT-PINMEEKD-FIIMMVV

Query:  DACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPKENVPIR-SKFREEKEFPIYSRDDKNVPIRS-KFR
        DACFI EF ILK D  HP  KF  IQ+N+DISF+ G+++ I  D+IKLENQVPFFLL++LF+ IPK ++ +  S FR+             + +R+ KFR
Subjt:  DACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPKENVPIR-SKFREEKEFPIYSRDDKNVPIRS-KFR

Query:  EEKELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNNNCLSFLCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWNKIACKEKKSEKGSSNIPP
          +   I+         FK+  H+               + L+F   +P +G  QK++ ++           +FK                E+ +  IPP
Subjt:  EEKELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNNNCLSFLCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWNKIACKEKKSEKGSSNIPP

Query:  SIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVIINNIGGSDKE
        SI EL EAGV+IKKAE  K++ +ITFKNGVL IPPLHI+D F+L+LRNMVAFEQ +A   NKYV QYVLF+D+LISTEKD+ LLV+AGVIIN IGGSDKE
Subjt:  SIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVIINNIGGSDKE

Query:  VSDLLNNLPKFVTQSSSPHYDSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLT-VQTIFSGITTF
        V+DL NN  KF+T   S ++DSI K LCEHCNG WN+AKASLKHNYFNTPWAFISFFAAT LILLT +QTIF+ ITTF
Subjt:  VSDLLNNLPKFVTQSSSPHYDSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLT-VQTIFSGITTF

A0A5A7VF39 UPF0481 protein1.6e-11350.81Show/hide
Query:  QQLRD--DATYVPQLISIGPFHH-GTPKDLIANEKYKLHGFINFLRRINIKSTV--EGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYATPIN-
        +QLR+     Y PQLISIGPFHH     D  A E+YKL   +NFLRRIN  +    E    + +RSL DLLK GTLKVLVEK H W+ E  NCY+ PI+ 
Subjt:  QQLRD--DATYVPQLISIGPFHH-GTPKDLIANEKYKLHGFINFLRRINIKSTV--EGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYATPIN-

Query:  MEEKDFIIMMVVDACFIDEFFILKVDESHP-ICKFDLIQENVD-ISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPKENVPIRSKFREEKEFPIYSR
        M++ +F+IMM++DACFI E FI + D S+P   KF  IQ+NVD +  +     DI++D+IKLENQVPFFLL+H+F  IP+ + P+   F       +Y  
Subjt:  MEEKDFIIMMVVDACFIDEFFILKVDESHP-ICKFDLIQENVD-ISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPKENVPIRSKFREEKEFPIYSR

Query:  DDKNVPIRSKFREEKELPISRDDENVPISFKDLAHWALKSGLNMQNNSD--NNNCLSFLCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPL--CHLWNKI
              I S                VP   K L H+     +      D   N               Q++    N +  N   L+ F  L  C LW   
Subjt:  DDKNVPIRSKFREEKELPISRDDENVPISFKDLAHWALKSGLNMQNNSD--NNNCLSFLCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPL--CHLWNKI

Query:  ACKEKKSEKGSSNIPPSIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLL
          K+K  ++  S  PPSI EL E+GV+I+KA+N KY+ NITFKNGVL+IP LHI+D F+LM RN++AFEQ+ AGN+N Y TQY+LF+D+LISTEKD+ LL
Subjt:  ACKEKKSEKGSSNIPPSIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLL

Query:  VKAGVIINNIGGSDKEVSDLLNNLPKFVTQSSSPH-YDSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLT-VQTIFSGITTFP
        V +GVIINNIGGSDKEVS+L NNL KFV Q  SP+ ++ ISKAL +HCNG WNKAKASLKHNYFNTPWAFISFFAA+FL+LLT +QTIFSGI+ FP
Subjt:  VKAGVIINNIGGSDKEVSDLLNNLPKFVTQSSSPH-YDSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLT-VQTIFSGITTFP

SwissProt top hitse value%identityAlignment
Q9SD53 UPF0481 protein At3g472001.4e-1333.33Show/hide
Query:  NITFKNGVLEIPPLHIFDNF-KLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVIINNIGGSDKEVSDLLNNLPK-FVTQSSSPHY
        N+  K   L+IP L  FD F      N VAFEQ+   + N+ +T Y++F+  L++ E+D+  L    +II N  GS+ EVS+    + K  V +  + + 
Subjt:  NITFKNGVLEIPPLHIFDNF-KLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVIINNIGGSDKEVSDLLNNLPK-FVTQSSSPHY

Query:  DSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLTV
        +++ K + E+    +N   A  +H +F +PW F+S  A  F+ILLT+
Subjt:  DSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLTV

Arabidopsis top hitse value%identityAlignment
AT3G50120.1 Plant protein of unknown function (DUF247)8.5e-3825.94Show/hide
Query:  FAVQQKPKKPKKPRAISVWQKIVSDINDQ-----------------QLRDDATYVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGI
        + +++ PK  +    IS+  K+     D                  Q  D+ +Y PQ +S+GP+HHG  K L + +++K       L+R N     +G  
Subjt:  FAVQQKPKKPKKPRAISVWQKIVSDINDQ-----------------QLRDDATYVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGI

Query:  SETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYATPINMEEKDFIIMMVVDACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLE
                       +K+ ++   +  ++A  CY  P+++   +FI M+V+D CF+ E F   V+      +    + +   +  G +   I  DM+ LE
Subjt:  SETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYATPINMEEKDFIIMMVVDACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLE

Query:  NQVPFFLLEHLFE-EIPKEN-VPIRSKFREEKEFPIYSRDDKNVPIRSKFREEKELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNNNCLSFLCHIP
        NQ+P F+L  L E ++   N   + ++       P+   D+   P+    + + E  ++RD    P  F D+                  +CL       
Subjt:  NQVPFFLLEHLFE-EIPKEN-VPIRSKFREEKEFPIYSRDDKNVPIRSKFREEKELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNNNCLSFLCHIP

Query:  FSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWNKIACKEKK-SEKGSSNIPPSIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRN
                     D  + ++  +  KP   L  K   +  + ++K    +   + EL EAG+  ++ +  ++  ++ FKNG LEIP L I D  K +  N
Subjt:  FSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWNKIACKEKK-SEKGSSNIPPSIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRN

Query:  MVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVIINNIGGSDKEVSDLLNNLPKFVTQSSSPHYDS-ISKALCEHCNGCWNKAKASLKHNYF
        ++AFEQ    + N  +T Y++F+DNLI + +D+  L   G+I + + GSD EV+DL N L + V   +   Y S +S  +  + +  WN  +A+LKH YF
Subjt:  MVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVIINNIGGSDKEVSDLLNNLPKFVTQSSSPHYDS-ISKALCEHCNGCWNKAKASLKHNYF

Query:  NTPWAFISFFAATFLILLTVQTIFSGITTFPK
        N PWA +SF AA  L++LT    F  +  + K
Subjt:  NTPWAFISFFAATFLILLTVQTIFSGITTFPK

AT3G50130.1 Plant protein of unknown function (DUF247)2.1e-3626.63Show/hide
Query:  WKCMKQSTKFAVQQKPKKPKKPRAISVWQKIVSDINDQQLRDDAT----------------------YVPQLISIGPFHHGTPKDLIANEKYKLHGFINF
        W  +    +  +Q++ +KP++ R    W   + D  +Q LR+DAT                      Y PQ +S+GPFHHG  K L+  +++K       
Subjt:  WKCMKQSTKFAVQQKPKKPKKPRAISVWQKIVSDINDQQLRDDAT----------------------YVPQLISIGPFHHGTPKDLIANEKYKLHGFINF

Query:  LRRINIKSTVEGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYATPINMEEKDFIIMMVVDACFIDEFFILKVDESHPICKFDLIQENVDISFHG
         R +N+           AR+  D      +++ ++   +    A  CY  PI++    F  M+V+D CF+ E F    DE      +D    N  +    
Subjt:  LRRINIKSTVEGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYATPINMEEKDFIIMMVVDACFIDEFFILKVDESHPICKFDLIQENVDISFHG

Query:  GIEVDITDDMIKLENQVPFFLLEHLFE-EIPKEN-VPIRSKFREEKEFPIYSRDDKNVPIRSKFREEKELPISRDDENV-------PISFKDLAHWALKS
        G    I  DM+ LENQ+P F+L  L E ++ K +   + S+       P+   D+               P+++ D+++       PI+ KD        
Subjt:  GIEVDITDDMIKLENQVPFFLLEHLFE-EIPKEN-VPIRSKFREEKEFPIYSRDDKNVPIRSKFREEKELPISRDDENV-------PISFKDLAHWALKS

Query:  GLNMQNNSDNNNCLSFLCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWN-KIACKEKKSEKGSSNIPPSIPELSEAGVSIKKAENGKYMRNITFK
                   +CL                    D  + N+      P   L   + + + + ++K    +   + EL EAG+  +  +  ++  +I FK
Subjt:  GLNMQNNSDNNNCLSFLCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWN-KIACKEKKSEKGSSNIPPSIPELSEAGVSIKKAENGKYMRNITFK

Query:  NGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVIINNIGGSDKEVSDLLNNLPKFVT-QSSSPHYDSISKA
        NG LEIP L I D  K +  N++AFEQ    + N  +T Y++F+DNLI + +D+  L   G+I + + G+D EV+DL N L + V     + +   +S  
Subjt:  NGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVIINNIGGSDKEVSDLLNNLPKFVT-QSSSPHYDSISKA

Query:  LCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLTV-QTIFSGITTF
        +  + +  WN  KA LKH YFN PWA+ SFFAA  L++LT+ Q+ F+    F
Subjt:  LCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLTV-QTIFSGITTF

AT3G50140.1 Plant protein of unknown function (DUF247)1.9e-3425.83Show/hide
Query:  VQQKPKKPKKPRAISVWQKIVSDINDQQLR--DDATYVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDLLKTGTLK
        ++ K ++  +  A + W KI        L+  D  +Y PQ +S+GP+HHG   + +    Y     +N +    +K T +G                 ++
Subjt:  VQQKPKKPKKPRAISVWQKIVSDINDQQLR--DDATYVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDLLKTGTLK

Query:  VLVEKAHDWVKEALNCYATPINMEEKDFIIMMVVDACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPK
        + ++   +  + A  CY  PI +    F  M+V+D CF+ + F     E      +D    N  +    G    I  DM+ LENQ+P F+L  L E    
Subjt:  VLVEKAHDWVKEALNCYATPINMEEKDFIIMMVVDACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPK

Query:  ENVPIRSKFREEKEFPIYSRDDKNVPIRSKFREEKELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNNNCLSFLCHIPFSGCWQKLDGENNDQTKNN
                                + + ++++      ++       + F +       S   ++N+ +NNN   F   I      +K +    D  + +
Subjt:  ENVPIRSKFREEKEFPIYSRDDKNVPIRSKFREEKELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNNNCLSFLCHIPFSGCWQKLDGENNDQTKNN

Query:  VCLTLFKPLCHL----WNKIACKEKKSEKGSSNIPPSIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYV
        +     KP   L    W++   K   ++K    +   + EL EAG+  K+ ++ ++  +I FKNG LEIP L I D  K +  N++A+EQ    + N  +
Subjt:  VCLTLFKPLCHL----WNKIACKEKKSEKGSSNIPPSIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYV

Query:  TQYVLFLDNLISTEKDLHLLVKAGVIINNIGGSDKEVSDLLNNLPKFVT-QSSSPHYDSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLI
        T Y++F+DNLI + +D+  L    +I + + G+D EV+D+ N L + V     + +   +S  +  + N  WN  KA+LKH YF+ PWA+ SFFAA  L+
Subjt:  TQYVLFLDNLISTEKDLHLLVKAGVIINNIGGSDKEVSDLLNNLPKFVT-QSSSPHYDSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLI

Query:  LLTVQTIFSGITTFP
        LLT+   F   T++P
Subjt:  LLTVQTIFSGITTFP

AT3G50160.1 Plant protein of unknown function (DUF247)1.1e-3429.04Show/hide
Query:  QLRDDATYVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDL-LKTGTLKVLVEKAHDWVKEALNCYATPINMEEKDF
        Q  D  +Y+PQ++SIGP+HHG  K L+  E++K        R +N+           AR+  D+ +    +K L EKA         CY  PINM   +F
Subjt:  QLRDDATYVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDL-LKTGTLKVLVEKAHDWVKEALNCYATPINMEEKDF

Query:  IIMMVVDACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLF-----EEIPKENVPIRSKFREEKEFPIYSRDDK
        I M+V+D  FI E F    +    I        N  +    G+   I  DM+ LENQ+P+ +L+ L      + + K NV +   F +    P+      
Subjt:  IIMMVVDACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLF-----EEIPKENVPIRSKFREEKEFPIYSRDDK

Query:  NVPIRSKFREEKELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNNNCLSFLCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWNKIACKEKKS
         +P R    EE  L                                  +CL  L      G  Q     + D +  N                       
Subjt:  NVPIRSKFREEKELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNNNCLSFLCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWNKIACKEKKS

Query:  EKGSSNIPPSIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVII
         K    +   + EL  AGV   + E G +  +I FKNG L+IP L I D  K +  N++AFEQ      +K +T Y++F+DNLI++ +D+  L   G+I 
Subjt:  EKGSSNIPPSIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVII

Query:  NNIGGSDKEVSDLLNNLPKFVTQSSSPHY-DSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLTVQTIFSGITTFPK
        N + GSD EVSDL N L K V    +  Y  +++  +  +    WN  KA+L+H YFN PWA+ SF AA  L++ T    F  +  + K
Subjt:  NNIGGSDKEVSDLLNNLPKFVTQSSSPHY-DSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLTVQTIFSGITTFPK

AT3G50170.1 Plant protein of unknown function (DUF247)1.8e-4028.95Show/hide
Query:  QLRDDATYVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYATPINMEEKDFI
        Q  D  +Y PQ +S+GP+HHG  K L   E++K       L+R  +K  +E             + T  ++ L EKA         CY  PI++   +F 
Subjt:  QLRDDATYVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDLLKTGTLKVLVEKAHDWVKEALNCYATPINMEEKDFI

Query:  IMMVVDACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLFE-EIPKENVPIRSKFREEKEFPIYSRDDKNVPIR
         M+V+D CF+ E F   V+    I        N  +    G+   I  DMI LENQ+P F+L+ L E ++  +N          K F      D  +P  
Subjt:  IMMVVDACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLFE-EIPKENVPIRSKFREEKEFPIYSRDDKNVPIR

Query:  SKFREEKELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNNNCLSFLCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWNKIACKEKKSEKGSS
                  +++ D++       L +W L+  L+   +    +CL              L       T++            L  ++    +  +K   
Subjt:  SKFREEKELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNNNCLSFLCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWNKIACKEKKSEKGSS

Query:  NIPPSIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVIINNIGG
         +   + EL EAGV  +K +  ++  +I FKNG LEIP L I D  K +  N++AFEQ    + N ++T Y++F+DNLI++ +D+  L   G+I + + G
Subjt:  NIPPSIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVIINNIGG

Query:  SDKEVSDLLNNL-PKFVTQSSSPHYDSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLTVQTIFSGITTFPKQHT
        SD EV+DL N L  + V      H   +S  +  + N  WN  KA+L H YFN PWA+ SF AA  L+LLT+   F  +  + K ++
Subjt:  SDKEVSDLLNNL-PKFVTQSSSPHYDSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLTVQTIFSGITTFPKQHT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGAGAGAAATGGAAATGCATGAAACAAAGCACAAAATTCGCAGTGCAGCAGAAACCCAAAAAGCCAAAGAAACCCAGAGCAATATCTGTGTGGCAAAAAATTGT
TAGTGATATTAATGATCAACAGCTTCGGGATGATGCTACTTATGTTCCTCAACTAATTTCCATTGGCCCTTTTCACCATGGCACTCCAAAAGATTTGATAGCCAATGAAA
AATATAAGCTTCATGGTTTTATTAACTTTCTACGTCGTATCAATATTAAATCAACAGTGGAGGGTGGCATTTCTGAAACTGCCAGATCATTGAAGGACCTGCTGAAAACT
GGAACATTGAAAGTTCTCGTGGAAAAAGCTCATGATTGGGTGAAAGAAGCTTTGAATTGCTATGCAACACCCATAAACATGGAAGAAAAGGATTTTATTATAATGATGGT
TGTGGATGCTTGTTTCATAGACGAGTTTTTTATACTAAAAGTTGATGAAAGCCATCCAATTTGTAAGTTCGATCTAATTCAAGAGAATGTAGATATTTCATTCCACGGAG
GCATAGAGGTAGATATAACTGATGACATGATCAAGTTGGAAAATCAAGTTCCTTTTTTTCTTCTTGAACACCTATTTGAGGAAATACCCAAAGAAAATGTCCCCATCCGC
TCCAAATTTAGAGAAGAAAAAGAATTCCCCATCTATTCTAGAGATGATAAAAATGTCCCCATCCGCTCCAAATTTAGAGAAGAAAAAGAACTCCCCATCTCTAGAGATGA
TGAAAATGTCCCTATCTCCTTTAAAGATCTTGCACATTGGGCTCTTAAGTCTGGGTTGAATATGCAGAACAATAGTGACAATAATAATTGTTTAAGTTTCTTGTGTCACA
TCCCATTTTCAGGATGTTGGCAAAAGCTGGATGGGGAAAATAATGATCAGACGAAGAATAATGTTTGCTTAACATTATTTAAACCATTGTGTCATCTTTGGAATAAGATA
GCTTGTAAGGAAAAAAAAAGTGAGAAAGGAAGCTCCAATATTCCTCCATCCATACCTGAGCTCTCGGAAGCTGGTGTCTCCATCAAGAAGGCAGAAAATGGCAAATATAT
GAGGAACATAACCTTCAAAAACGGGGTTTTGGAAATCCCACCTTTACATATTTTTGATAACTTCAAACTTATGTTGCGAAACATGGTAGCATTTGAGCAATATACTGCAG
GAAATAAGAACAAGTATGTAACTCAATATGTGTTATTTCTAGATAATTTGATAAGTACAGAGAAAGACTTGCATTTACTTGTGAAGGCCGGAGTCATAATCAATAATATT
GGTGGAAGTGATAAAGAAGTTTCAGATCTGCTTAACAATCTCCCTAAATTTGTCACACAATCAAGTTCTCCCCACTATGACAGTATTAGCAAAGCTTTGTGCGAACATTG
CAATGGATGTTGGAACAAGGCAAAAGCCTCACTGAAACATAACTATTTCAACACACCATGGGCCTTCATTTCCTTTTTTGCTGCAACTTTCCTTATTCTTCTCACAGTCC
AAACTATTTTCTCTGGTATCACCACATTTCCTAAGCAGCATACGGCCTAG
mRNA sequenceShow/hide mRNA sequence
ACTCAATCTTCTTCCCACATTTTTGCCTCTTTTCTTTATCTTCCCCTCGTTTGTTTGATCTTCCACTCACCATCTCCTTCCATCAAATTTTTCACACAAGCTAAGATTTG
AAAATGGGAAGAGAGAAATGGAAATGCATGAAACAAAGCACAAAATTCGCAGTGCAGCAGAAACCCAAAAAGCCAAAGAAACCCAGAGCAATATCTGTGTGGCAAAAAAT
TGTTAGTGATATTAATGATCAACAGCTTCGGGATGATGCTACTTATGTTCCTCAACTAATTTCCATTGGCCCTTTTCACCATGGCACTCCAAAAGATTTGATAGCCAATG
AAAAATATAAGCTTCATGGTTTTATTAACTTTCTACGTCGTATCAATATTAAATCAACAGTGGAGGGTGGCATTTCTGAAACTGCCAGATCATTGAAGGACCTGCTGAAA
ACTGGAACATTGAAAGTTCTCGTGGAAAAAGCTCATGATTGGGTGAAAGAAGCTTTGAATTGCTATGCAACACCCATAAACATGGAAGAAAAGGATTTTATTATAATGAT
GGTTGTGGATGCTTGTTTCATAGACGAGTTTTTTATACTAAAAGTTGATGAAAGCCATCCAATTTGTAAGTTCGATCTAATTCAAGAGAATGTAGATATTTCATTCCACG
GAGGCATAGAGGTAGATATAACTGATGACATGATCAAGTTGGAAAATCAAGTTCCTTTTTTTCTTCTTGAACACCTATTTGAGGAAATACCCAAAGAAAATGTCCCCATC
CGCTCCAAATTTAGAGAAGAAAAAGAATTCCCCATCTATTCTAGAGATGATAAAAATGTCCCCATCCGCTCCAAATTTAGAGAAGAAAAAGAACTCCCCATCTCTAGAGA
TGATGAAAATGTCCCTATCTCCTTTAAAGATCTTGCACATTGGGCTCTTAAGTCTGGGTTGAATATGCAGAACAATAGTGACAATAATAATTGTTTAAGTTTCTTGTGTC
ACATCCCATTTTCAGGATGTTGGCAAAAGCTGGATGGGGAAAATAATGATCAGACGAAGAATAATGTTTGCTTAACATTATTTAAACCATTGTGTCATCTTTGGAATAAG
ATAGCTTGTAAGGAAAAAAAAAGTGAGAAAGGAAGCTCCAATATTCCTCCATCCATACCTGAGCTCTCGGAAGCTGGTGTCTCCATCAAGAAGGCAGAAAATGGCAAATA
TATGAGGAACATAACCTTCAAAAACGGGGTTTTGGAAATCCCACCTTTACATATTTTTGATAACTTCAAACTTATGTTGCGAAACATGGTAGCATTTGAGCAATATACTG
CAGGAAATAAGAACAAGTATGTAACTCAATATGTGTTATTTCTAGATAATTTGATAAGTACAGAGAAAGACTTGCATTTACTTGTGAAGGCCGGAGTCATAATCAATAAT
ATTGGTGGAAGTGATAAAGAAGTTTCAGATCTGCTTAACAATCTCCCTAAATTTGTCACACAATCAAGTTCTCCCCACTATGACAGTATTAGCAAAGCTTTGTGCGAACA
TTGCAATGGATGTTGGAACAAGGCAAAAGCCTCACTGAAACATAACTATTTCAACACACCATGGGCCTTCATTTCCTTTTTTGCTGCAACTTTCCTTATTCTTCTCACAG
TCCAAACTATTTTCTCTGGTATCACCACATTTCCTAAGCAGCATACGGCCTAG
Protein sequenceShow/hide protein sequence
MGREKWKCMKQSTKFAVQQKPKKPKKPRAISVWQKIVSDINDQQLRDDATYVPQLISIGPFHHGTPKDLIANEKYKLHGFINFLRRINIKSTVEGGISETARSLKDLLKT
GTLKVLVEKAHDWVKEALNCYATPINMEEKDFIIMMVVDACFIDEFFILKVDESHPICKFDLIQENVDISFHGGIEVDITDDMIKLENQVPFFLLEHLFEEIPKENVPIR
SKFREEKEFPIYSRDDKNVPIRSKFREEKELPISRDDENVPISFKDLAHWALKSGLNMQNNSDNNNCLSFLCHIPFSGCWQKLDGENNDQTKNNVCLTLFKPLCHLWNKI
ACKEKKSEKGSSNIPPSIPELSEAGVSIKKAENGKYMRNITFKNGVLEIPPLHIFDNFKLMLRNMVAFEQYTAGNKNKYVTQYVLFLDNLISTEKDLHLLVKAGVIINNI
GGSDKEVSDLLNNLPKFVTQSSSPHYDSISKALCEHCNGCWNKAKASLKHNYFNTPWAFISFFAATFLILLTVQTIFSGITTFPKQHTA