; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0017853 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0017853
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionUPF0503 protein At3g09070, chloroplastic
Genome locationchr08:22632667..22634550
RNA-Seq ExpressionPI0017853
SyntenyPI0017853
Gene Ontology termsGO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR008004 - Protein OCTOPUS-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580491.1 Protein OCTOPUS, partial [Cucurbita argyrosperma subsp. sororia]1.1e-24274.33Show/hide
Query:  MNPSTDEQPL-----PPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSASSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSF
        MNPST EQP      PPPLPP PHRPSA CPRHPQE FTAFCPLCLCERLSLLDSSS +SSSS+RKPHSTAASALKAIFRP PPNRPSSFFPELRRTKSF
Subjt:  MNPSTDEQPL-----PPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSASSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSF

Query:  SASKNEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIV-EEEI
        SASKN+AFS VFEPQRKSCDVR+RN  CSL S DAS+SS ++AP A EI  E+K LED  SSC+E  P GDG+IRVS QPNV D VIEN VQEIV EEEI
Subjt:  SASKNEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIV-EEEI

Query:  QVELGSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDP
        QVE+G E VQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNG GSTTLPVEKPIGRHFR+TQSEIADYG+GRRSCDIDP
Subjt:  QVELGSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDP

Query:  RFSLDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG------------------LLRPGS
        RFSLDAGRMS DDPRYSFDEPRASWDGYLISRT PRMPTMLSVVEDAPI+VFR+D QIPVEDSINSTNEEENIPGG                  L R  S
Subjt:  RFSLDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG------------------LLRPGS

Query:  TIQTLLLV----DERASTEIRIPIHCEMTIPVVRG--CCLSSRDGKSE------------------------RRVEEVQRV---GEGLEIWGLINRRGGN
          +T   V    DE  S+     +    T  V  G    +  RD  S                          R EE ++    G+G  IWGLINRRGGN
Subjt:  TIQTLLLV----DERASTEIRIPIHCEMTIPVVRG--CCLSSRDGKSE------------------------RRVEEVQRV---GEGLEIWGLINRRGGN

Query:  KDEEEDRESSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNV
        KDEEED+ESSRPNG+ERSYS SWPELRG+RN DVK GGFNPKMFRSNSSVSWRS+SMMGGSFSSSRKSNAE+NGNGKKK EE  PVLERN+SARHSPTN+
Subjt:  KDEEEDRESSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNV

Query:  DNGLLRFYLTPLRGSRKGGSGKVKPSQAQSIARSVLRLY
        DNGLLRFYLT LRGSR+GGSGKVKP+QAQSIARSVLRLY
Subjt:  DNGLLRFYLTPLRGSRKGGSGKVKPSQAQSIARSVLRLY

XP_004137734.1 protein OCTOPUS [Cucumis sativus]3.3e-27181.77Show/hide
Query:  MNPSTDEQPLPPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSA--SSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFSAS
        MNPSTD QPLPPPLPP PHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSA  SSSSSSRKPHSTA+SAL+A+FRPPPPNRPSSFFPELRRTKSFSAS
Subjt:  MNPSTDEQPLPPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSA--SSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFSAS

Query:  KNEAFST-VFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIVEEEIQVE
        KNEAFST +FEPQRKSCDVRLRNTLCSLISQDASSSSK+LAPAA EI VE+KNLEDP SS VE IPD DGDIRVSGQPNVGDFVIENSV+EIVEEEIQVE
Subjt:  KNEAFST-VFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIVEEEIQVE

Query:  LGSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFS
        LGSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFS
Subjt:  LGSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFS

Query:  LDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG------------------LLRPGSTIQ
        LDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG                  L R  S  +
Subjt:  LDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG------------------LLRPGSTIQ

Query:  TLLLV----DERASTEIRIPIHCEMT---------IPVVRGCCLSSRD-------------GKSERRVEEVQRV--GEGLEIWGLINRRGGNKDEEEDRE
        T   V    D+  S+     +    T         IP       S RD             G + R+ E  +    G+G +IWGLINRRGGNKDEEEDRE
Subjt:  TLLLV----DERASTEIRIPIHCEMT---------IPVVRGCCLSSRD-------------GKSERRVEEVQRV--GEGLEIWGLINRRGGNKDEEEDRE

Query:  SSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGLLRFY
        SSRPNGVERSYSESWPELRGDRNGDVK GGFNPKMFRSNSSVSWRSASM+GGSFSSSRKSNAESNGNGKKKKEE QPVLERN+SAR SPTNVDNGLLRFY
Subjt:  SSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGLLRFY

Query:  LTPLRGSRKGGSGKVKPSQAQSIARSVLRLY
        LTPLRGSR+G SGKVKPSQAQSIARSVLRLY
Subjt:  LTPLRGSRKGGSGKVKPSQAQSIARSVLRLY

XP_008442467.1 PREDICTED: UPF0503 protein At3g09070, chloroplastic [Cucumis melo]1.3e-27882.94Show/hide
Query:  MNPSTDEQPLPPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSA----SSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFS
        MNPSTD QPLPPPLPP PHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSA    SSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFS
Subjt:  MNPSTDEQPLPPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSA----SSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFS

Query:  ASKNEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIVEEEIQV
        ASKNEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVE+KNLEDP SS VE IPDGDGDIRVSGQPNVGDFVIENSVQEIVEEEIQV
Subjt:  ASKNEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIVEEEIQV

Query:  ELGSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRF
        EL SESV LQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRF
Subjt:  ELGSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRF

Query:  SLDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG------------------LLRPGSTI
        SLDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG                  L R  S  
Subjt:  SLDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG------------------LLRPGSTI

Query:  QTLLLVDERASTEIRIPIHCEMTIPVVRGCC------LSSRDGKSE--------------------RRVEEVQRV---GEGLEIWGLINRRGGNKDEEED
        +T   V      +++  +      P     C      L  RD  S                      R EE ++    G+G +IWGLINRRGGNKDEEED
Subjt:  QTLLLVDERASTEIRIPIHCEMTIPVVRGCC------LSSRDGKSE--------------------RRVEEVQRV---GEGLEIWGLINRRGGNKDEEED

Query:  RESSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGLLR
        RESSRPNGVERSYSESWPELRGDRNGDVK GGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERN+SARHSPTNVDNGLLR
Subjt:  RESSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGLLR

Query:  FYLTPLRGSRKGGSGKVKPSQAQSIARSVLRLY
        FYLTPLRGSR+GGSGKVKPSQAQSIARSVLRLY
Subjt:  FYLTPLRGSRKGGSGKVKPSQAQSIARSVLRLY

XP_022934120.1 UPF0503 protein At3g09070, chloroplastic-like [Cucurbita moschata]7.7e-24474.65Show/hide
Query:  MNPSTDEQPL-PPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSASSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFSASK
        MNPST EQP  PPPLPP PHRPSA CPRHPQE FTAFCPLCLCERLSLLDSSS +SSSS+RKPHSTAASALKAIFRP PPNRPSSFFPELRRTKSFSASK
Subjt:  MNPSTDEQPL-PPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSASSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFSASK

Query:  NEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIV-EEEIQVEL
        N+AFS VFEPQRKSCDVR+RN  CSL S DAS+SS ++AP A EI  E+K LED  SSC+E  P GDG+I+VS +PNV D VIEN VQEIV EEEI+VE+
Subjt:  NEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIV-EEEIQVEL

Query:  GSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSL
        G E VQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFR+TQSEIADYG+GRRSCDIDPRFSL
Subjt:  GSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSL

Query:  DAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG------------------LLRPGSTIQT
        DAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPI+VFR+D QIPVEDSINSTNEEENIPGG                  L R  S  +T
Subjt:  DAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG------------------LLRPGSTIQT

Query:  LLLV----DERASTEIRIPIHCEMTIPVVRG--CCLSSRDGKSE------------------------RRVEEVQRV---GEGLEIWGLINRRGGNKDEE
           V    DE  S+     +    T  V  G    +  RD  S                          R EE ++    G+G  IWGLINRRGGNKDEE
Subjt:  LLLV----DERASTEIRIPIHCEMTIPVVRG--CCLSSRDGKSE------------------------RRVEEVQRV---GEGLEIWGLINRRGGNKDEE

Query:  EDRESSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGL
        ED+ESSRPNG+ERSYS SWPELRG+RN DVK GGFNPKMFRSNSSVSWRS+SMMGGSFSSSRKSNAE+NGNG+KK EE  PVLERN+SARHSPTN+DNGL
Subjt:  EDRESSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGL

Query:  LRFYLTPLRGSRKGGSGKVKPSQAQSIARSVLRLY
        LRFYLT LRGSR+GGSGKVKP+QAQSIARSVLRLY
Subjt:  LRFYLTPLRGSRKGGSGKVKPSQAQSIARSVLRLY

XP_038903136.1 protein OCTOPUS-like [Benincasa hispida]9.6e-26379.12Show/hide
Query:  MNPSTDEQ----PLPPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSS-SASSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSF
        MNPSTDEQ    PLPPPLPP PHRPSA CPRHPQEHFTAFCPLCLCERLS+LDSS SA+SSSSSRKPHSTAASAL+AIFRP PPNRPSSFFPELRR KSF
Subjt:  MNPSTDEQ----PLPPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSS-SASSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSF

Query:  SASKNEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIV---EE
        SASKNEAFS VFEPQRKSCDVRLRNTLCSL SQDAS+SS++LA   PEI++ESKNLEDP SSC+E  PDGDG+IRVSGQPNV D VIEN VQEIV   EE
Subjt:  SASKNEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIV---EE

Query:  EIQVELGSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDI
        EIQVELGSESVQLQEEFKTMKDHIDLDSHTKKP+GRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDI
Subjt:  EIQVELGSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDI

Query:  DPRFSLDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG------------------LLRP
        DPRFSLDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNE+ENIPGG                  L R 
Subjt:  DPRFSLDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG------------------LLRP

Query:  GSTIQTLLLV----DERASTEIRIPIH------C---EMTIP-------VVRGCCLSSRD--------GKSERRVEEVQRVGEGLEIWGLINRRGGNKDE
         S  +T   V    D+  S+     +       C   ++ IP        +R  C  S D        G  +   ++ +  G+G +IWGLINRRGGNKDE
Subjt:  GSTIQTLLLV----DERASTEIRIPIH------C---EMTIP-------VVRGCCLSSRD--------GKSERRVEEVQRVGEGLEIWGLINRRGGNKDE

Query:  EEDRESSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMG-GSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDN
        EEDRESSRPNGVERSYSESWPELRGDRNGDVK GGFNPKMFRSNSSVSWRS+SMMG G FSSSRKSNAESNGNGKKKKEEPQPVLERN+SARHSPTNVDN
Subjt:  EEDRESSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMG-GSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDN

Query:  GLLRFYLTPLRGSRKGGSGKVKPSQAQSIARSVLRLY
        GLLRFYLTPLRGSR+GGSGKVKP+QAQSIARSVLRLY
Subjt:  GLLRFYLTPLRGSRKGGSGKVKPSQAQSIARSVLRLY

TrEMBL top hitse value%identityAlignment
A0A0A0LAA9 Uncharacterized protein1.6e-27181.77Show/hide
Query:  MNPSTDEQPLPPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSA--SSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFSAS
        MNPSTD QPLPPPLPP PHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSA  SSSSSSRKPHSTA+SAL+A+FRPPPPNRPSSFFPELRRTKSFSAS
Subjt:  MNPSTDEQPLPPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSA--SSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFSAS

Query:  KNEAFST-VFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIVEEEIQVE
        KNEAFST +FEPQRKSCDVRLRNTLCSLISQDASSSSK+LAPAA EI VE+KNLEDP SS VE IPD DGDIRVSGQPNVGDFVIENSV+EIVEEEIQVE
Subjt:  KNEAFST-VFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIVEEEIQVE

Query:  LGSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFS
        LGSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFS
Subjt:  LGSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFS

Query:  LDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG------------------LLRPGSTIQ
        LDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG                  L R  S  +
Subjt:  LDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG------------------LLRPGSTIQ

Query:  TLLLV----DERASTEIRIPIHCEMT---------IPVVRGCCLSSRD-------------GKSERRVEEVQRV--GEGLEIWGLINRRGGNKDEEEDRE
        T   V    D+  S+     +    T         IP       S RD             G + R+ E  +    G+G +IWGLINRRGGNKDEEEDRE
Subjt:  TLLLV----DERASTEIRIPIHCEMT---------IPVVRGCCLSSRD-------------GKSERRVEEVQRV--GEGLEIWGLINRRGGNKDEEEDRE

Query:  SSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGLLRFY
        SSRPNGVERSYSESWPELRGDRNGDVK GGFNPKMFRSNSSVSWRSASM+GGSFSSSRKSNAESNGNGKKKKEE QPVLERN+SAR SPTNVDNGLLRFY
Subjt:  SSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGLLRFY

Query:  LTPLRGSRKGGSGKVKPSQAQSIARSVLRLY
        LTPLRGSR+G SGKVKPSQAQSIARSVLRLY
Subjt:  LTPLRGSRKGGSGKVKPSQAQSIARSVLRLY

A0A1S3B6I0 UPF0503 protein At3g09070, chloroplastic6.1e-27982.94Show/hide
Query:  MNPSTDEQPLPPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSA----SSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFS
        MNPSTD QPLPPPLPP PHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSA    SSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFS
Subjt:  MNPSTDEQPLPPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSA----SSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFS

Query:  ASKNEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIVEEEIQV
        ASKNEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVE+KNLEDP SS VE IPDGDGDIRVSGQPNVGDFVIENSVQEIVEEEIQV
Subjt:  ASKNEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIVEEEIQV

Query:  ELGSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRF
        EL SESV LQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRF
Subjt:  ELGSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRF

Query:  SLDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG------------------LLRPGSTI
        SLDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG                  L R  S  
Subjt:  SLDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG------------------LLRPGSTI

Query:  QTLLLVDERASTEIRIPIHCEMTIPVVRGCC------LSSRDGKSE--------------------RRVEEVQRV---GEGLEIWGLINRRGGNKDEEED
        +T   V      +++  +      P     C      L  RD  S                      R EE ++    G+G +IWGLINRRGGNKDEEED
Subjt:  QTLLLVDERASTEIRIPIHCEMTIPVVRGCC------LSSRDGKSE--------------------RRVEEVQRV---GEGLEIWGLINRRGGNKDEEED

Query:  RESSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGLLR
        RESSRPNGVERSYSESWPELRGDRNGDVK GGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERN+SARHSPTNVDNGLLR
Subjt:  RESSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGLLR

Query:  FYLTPLRGSRKGGSGKVKPSQAQSIARSVLRLY
        FYLTPLRGSR+GGSGKVKPSQAQSIARSVLRLY
Subjt:  FYLTPLRGSRKGGSGKVKPSQAQSIARSVLRLY

A0A5A7TLQ7 UPF0503 protein6.1e-27982.94Show/hide
Query:  MNPSTDEQPLPPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSA----SSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFS
        MNPSTD QPLPPPLPP PHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSA    SSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFS
Subjt:  MNPSTDEQPLPPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSA----SSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFS

Query:  ASKNEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIVEEEIQV
        ASKNEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVE+KNLEDP SS VE IPDGDGDIRVSGQPNVGDFVIENSVQEIVEEEIQV
Subjt:  ASKNEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIVEEEIQV

Query:  ELGSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRF
        EL SESV LQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRF
Subjt:  ELGSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRF

Query:  SLDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG------------------LLRPGSTI
        SLDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG                  L R  S  
Subjt:  SLDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG------------------LLRPGSTI

Query:  QTLLLVDERASTEIRIPIHCEMTIPVVRGCC------LSSRDGKSE--------------------RRVEEVQRV---GEGLEIWGLINRRGGNKDEEED
        +T   V      +++  +      P     C      L  RD  S                      R EE ++    G+G +IWGLINRRGGNKDEEED
Subjt:  QTLLLVDERASTEIRIPIHCEMTIPVVRGCC------LSSRDGKSE--------------------RRVEEVQRV---GEGLEIWGLINRRGGNKDEEED

Query:  RESSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGLLR
        RESSRPNGVERSYSESWPELRGDRNGDVK GGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERN+SARHSPTNVDNGLLR
Subjt:  RESSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGLLR

Query:  FYLTPLRGSRKGGSGKVKPSQAQSIARSVLRLY
        FYLTPLRGSR+GGSGKVKPSQAQSIARSVLRLY
Subjt:  FYLTPLRGSRKGGSGKVKPSQAQSIARSVLRLY

A0A6J1F6S2 UPF0503 protein At3g09070, chloroplastic-like3.7e-24474.65Show/hide
Query:  MNPSTDEQPL-PPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSASSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFSASK
        MNPST EQP  PPPLPP PHRPSA CPRHPQE FTAFCPLCLCERLSLLDSSS +SSSS+RKPHSTAASALKAIFRP PPNRPSSFFPELRRTKSFSASK
Subjt:  MNPSTDEQPL-PPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSASSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFSASK

Query:  NEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIV-EEEIQVEL
        N+AFS VFEPQRKSCDVR+RN  CSL S DAS+SS ++AP A EI  E+K LED  SSC+E  P GDG+I+VS +PNV D VIEN VQEIV EEEI+VE+
Subjt:  NEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIV-EEEIQVEL

Query:  GSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSL
        G E VQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFR+TQSEIADYG+GRRSCDIDPRFSL
Subjt:  GSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSL

Query:  DAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG------------------LLRPGSTIQT
        DAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPI+VFR+D QIPVEDSINSTNEEENIPGG                  L R  S  +T
Subjt:  DAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG------------------LLRPGSTIQT

Query:  LLLV----DERASTEIRIPIHCEMTIPVVRG--CCLSSRDGKSE------------------------RRVEEVQRV---GEGLEIWGLINRRGGNKDEE
           V    DE  S+     +    T  V  G    +  RD  S                          R EE ++    G+G  IWGLINRRGGNKDEE
Subjt:  LLLV----DERASTEIRIPIHCEMTIPVVRG--CCLSSRDGKSE------------------------RRVEEVQRV---GEGLEIWGLINRRGGNKDEE

Query:  EDRESSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGL
        ED+ESSRPNG+ERSYS SWPELRG+RN DVK GGFNPKMFRSNSSVSWRS+SMMGGSFSSSRKSNAE+NGNG+KK EE  PVLERN+SARHSPTN+DNGL
Subjt:  EDRESSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGL

Query:  LRFYLTPLRGSRKGGSGKVKPSQAQSIARSVLRLY
        LRFYLT LRGSR+GGSGKVKP+QAQSIARSVLRLY
Subjt:  LRFYLTPLRGSRKGGSGKVKPSQAQSIARSVLRLY

A0A6J1J5H7 UPF0503 protein At3g09070, chloroplastic-like3.8e-24174.21Show/hide
Query:  MNPSTDEQPL-PPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSS-ASSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFSAS
        MNPST EQP  PPPLPP PHRPSA CPRHPQE FTAFCPLCLCERLSLLDSSS  SSSSS+RKPHSTAASALKAIFRP PPNRPSSFFPELRRTKSFSAS
Subjt:  MNPSTDEQPL-PPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSS-ASSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFSAS

Query:  KNEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIVEE-EIQVE
        KN+AFS VFEPQRKSCDVR+RN  CSL S DAS+SS ++AP+A EI  E+K LED  SSC+EH P G  +IRVS QPNV D VIEN  QEIVEE EIQVE
Subjt:  KNEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIVEE-EIQVE

Query:  LGSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFS
        +G E+VQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGST LPVEKPIGRHFR+TQSEIADYG+GRRSCDIDPRFS
Subjt:  LGSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFS

Query:  LDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG------------------LLRPGSTIQ
        LDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPI+VFR+D QIPVEDSINSTNEEENIPGG                  L R  S  +
Subjt:  LDAGRMSFDDPRYSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGG------------------LLRPGSTIQ

Query:  TLLLV----DERASTEIRIPIHCEMTIPVVRG--CCLSSRDGKSE------------------------RRVEEVQRV---GEGLEIWGLINRRGGNKDE
        T   V    DE  S+     +    T  V  G    +  RD  S                          R EE ++    G+G  IWGLINRRGGNKDE
Subjt:  TLLLV----DERASTEIRIPIHCEMTIPVVRG--CCLSSRDGKSE------------------------RRVEEVQRV---GEGLEIWGLINRRGGNKDE

Query:  EEDRESSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNG
        EED+ESSRPNG+ERS S SWPELRG+RN D+K GGFNPKMFRSNSSVSWRS+SMMGGSFSSSRKSN E+NGNGKKK EE  PVLER++SARHSPTN+DNG
Subjt:  EEDRESSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNG

Query:  LLRFYLTPLRGSRKGGSGKVKPSQAQSIARSVLRLY
        LLRFYLT LRGSR+GGSGKVKP+QAQSIARSVLRLY
Subjt:  LLRFYLTPLRGSRKGGSGKVKPSQAQSIARSVLRLY

SwissProt top hitse value%identityAlignment
Q9LFB9 Protein OCTOPUS-like2.0e-8542.19Show/hide
Query:  MNPSTDEQP------LPPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSA-SSSSSSRKPHSTAASALKAIFRPPPPNRPSS---------
        MN S D+ P      L PP    PHR S +C  HP+E F+ FCP CLC+RLS+LD ++A   SSSSRKP S +A +LKA+F+P      +S         
Subjt:  MNPSTDEQP------LPPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSA-SSSSSSRKPHSTAASALKAIFRPPPPNRPSS---------

Query:  FFPELRRTKSFSASKNEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIEN
        FFPELRRTKSFSA  NE FS  FEPQR+SCDVRLR+   +L   +A+S  K+    A E  V    LE    + +E   +       +G+ + G+ V E 
Subjt:  FFPELRRTKSFSASKNEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIEN

Query:  SVQEIVEEEIQVELGSESVQLQEEFKTMKDHIDLDSHTKKPSGR---GSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIA
        S  EI EEE             EE K MKD++DL S TKKPS +   GSF+SAASVFSKKLQKW+ KQK KK RNG G          GR     QSEI 
Subjt:  SVQEIVEEEIQVELGSESVQLQEEFKTMKDHIDLDSHTKKPSGR---GSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIA

Query:  DYGFGRRSCDIDPRFSLDA-------GRMSFDDPRYSFDEPRASWDGYLISRT----FPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEEN----
          G GRRS D DPRFSLDA       GR+S DD RYS DEPRASWDG+LI RT     P  P+MLSVVE+AP++  RSD QIP   SI   + + +    
Subjt:  DYGFGRRSCDIDPRFSLDA-------GRMSFDDPRYSFDEPRASWDGYLISRT----FPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEEN----

Query:  IPGG-------LLRPGSTIQTLLL----VDERASTEIRIPIHCEMTIPVVRGCCLSSRDGKSERRVEEVQRVGEGLEIWGLINRRGGNKDEEEDRESSRP
        IPGG          P S+ +   L       +  TE+        +   +    + + + K  +  ++  R      I G I R+G  KD+EE+   SR 
Subjt:  IPGG-------LLRPGSTIQTLLL----VDERASTEIRIPIHCEMTIPVVRGCCLSSRDGKSERRVEEVQRVGEGLEIWGLINRRGGNKDEEEDRESSRP

Query:  NG---VERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGLLRFYL
        N    VERS SESWPE+   RNG+    G  PKM RSNS+VSWRS+   GGS                           RN+S+R+S  + +NG+LRFYL
Subjt:  NG---VERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGLLRFYL

Query:  TPLRGSRK------------GGSGKVKP-----SQAQSIARSVLRLY
        TP+R S K            GG G  K      S   SIAR V+RLY
Subjt:  TPLRGSRK------------GGSGKVKP-----SQAQSIARSVLRLY

Q9SS80 Protein OCTOPUS1.6e-11144.01Show/hide
Query:  MNPSTDEQP------LPPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLD-SSSASSSSSSRKPHSTAASALKAIFRPPPPNR------------
        MNP+TD          PPP PP PHR S +C RHP+E FT FCP CLCERLS+LD +++  SSSSS+KP + +A+ALKA+F+P   N             
Subjt:  MNPSTDEQP------LPPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLD-SSSASSSSSSRKPHSTAASALKAIFRPPPPNR------------

Query:  PSSFFPELRRTKSFSASK-NEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSS--SKVLAPAAPEIVVESK-----------NLEDPGSSCVEHIPDGDG
           FFPELRRTKSFSASK NE FS VFEPQR+SCDVRLR++L +L SQD   +  S V      EI VE +           N E    S  E + + + 
Subjt:  PSSFFPELRRTKSFSASK-NEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSS--SKVLAPAAPEIVVESK-----------NLEDPGSSCVEHIPDGDG

Query:  DIRVSGQPNVGDFVIENSVQEI----------VEEEIQVELGSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNG
        +  V      GDF I N   E+          V EEI+  +       +EE K +KD+IDLDS TKKPS R SFWSAASVFSKKLQKWR  QK KK+RNG
Subjt:  DIRVSGQPNVGDFVIENSVQEI----------VEEEIQVELGSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNG

Query:  G----GSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSLDA--------------GRMSFDDPRYSFDEPRASWDGYLISRTF-------PRMP
        G    GS  LPVEKPIGR  R+TQSEIADYG+GRRSCD DPRFSLDA              GR+S DDPRYSFDEPRASWDG LI RT        P  P
Subjt:  G----GSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSLDA--------------GRMSFDDPRYSFDEPRASWDGYLISRTF-------PRMP

Query:  TMLSVVEDAP----IHVFRSDTQIPVEDS------INSTNEEEN---IPGG------------------LLRPGSTIQTLLLVDERASTEIRIPIHCEMT
        +MLSVVEDAP     HV R+D Q PVE+       +N TN   +   IPGG                  L R  S+++           E ++ +   ++
Subjt:  TMLSVVEDAP----IHVFRSDTQIPVEDS------INSTNEEEN---IPGG------------------LLRPGSTIQTLLLVDERASTEIRIPIHCEMT

Query:  IPVVRG----------------------CCLSSRDGKSERRVEEVQRVGEGLEIWGLINRRGGNKDEEEDRESS----RPNG--VERSYSESWPELRGDR
        I    G                        +  R   S    ++ +R G+   I GLI R+  NK EEE+ E      R NG  VERS SESWPEL   R
Subjt:  IPVVRG----------------------CCLSSRDGKSERRVEEVQRVGEGLEIWGLINRRGGNKDEEEDRESS----RPNG--VERSYSESWPELRGDR

Query:  NGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGLLRFYLTPLRGSRK---------GGSG
        NG    GG  P+M RSNS+VSWRS+   GG   S+RK N                +  RN+S+R+SP N +NG+L+FYL  ++ SR+         GG G
Subjt:  NGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGLLRFYLTPLRGSRK---------GGSG

Query:  KVKPSQAQSIARSVLRLY
            S   SIARSV+RLY
Subjt:  KVKPSQAQSIARSVLRLY

Arabidopsis top hitse value%identityAlignment
AT2G38070.1 Protein of unknown function (DUF740)1.0e-10044.18Show/hide
Query:  PPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSASS---SSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFSASKNEAFST-V
        PPP PP PHRPS +C RHP E FT FCP CL +RLS+LD +  ++   +SSS+KP S++A ALKAIF+  P +   SFFPELRRTKSFSASK EAFS   
Subjt:  PPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSASS---SSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFSASKNEAFST-V

Query:  FEPQRKSCDVRLRNTLCSLISQDASSSSKV---LAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIE------NSVQEIVEEEIQVE
        FEPQR+SCDVR+RNTL SL  +DA  +S+    L+    EI     +LE   S     + + + +I  S Q N  D   E      + + EIVEEE +  
Subjt:  FEPQRKSCDVRLRNTLCSLISQDASSSSKV---LAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIE------NSVQEIVEEEIQVE

Query:  LGSESVQLQEEFKTMKDHIDLDSHTKKPSGR------GSFWSAASVFSKKLQKWRDKQKEKKQRN---GGGSTTLPVEKPIGRHFRETQSEIADYGFGRR
           E  +  E+F TM+ +      T K + R      GSFWSAASVFSKKLQKWR KQK KK R    G GS+ LPVEK IGR  R+TQSEIA+YG+GRR
Subjt:  LGSESVQLQEEFKTMKDHIDLDSHTKKPSGR------GSFWSAASVFSKKLQKWRDKQKEKKQRN---GGGSTTLPVEKPIGRHFRETQSEIADYGFGRR

Query:  SCDIDP-------RFSLDAGRMSFDDPRYSFDEPRASWDGYLISRTFP--RMPTMLSVVEDAPI--HVFRSDTQIPVEDS--INSTNEEENIPGG-----
        SCD DP       RFSLDAGR+S DDPRYSF+EPRASWDGYLI R     RMP+MLSVVED+P+  HV RSDT IPVE S  ++    +E +PGG     
Subjt:  SCDIDP-------RFSLDAGRMSFDDPRYSFDEPRASWDGYLISRTFP--RMPTMLSVVEDAPI--HVFRSDTQIPVEDS--INSTNEEENIPGG-----

Query:  --------------LLRPGST----IQTLLLVDERASTEIRIPIHCEMTIPVVRGCCLSSRD----------GKSERRVEEVQRVGEGLEIWGLINRRGG
                      L R  ST       +  +DE   T+ R           +R  C S  +          G  E   +  ++      I+GL++R+ G
Subjt:  --------------LLRPGST----IQTLLLVDERASTEIRIPIHCEMTIPVVRGCCLSSRD----------GKSERRVEEVQRVGEGLEIWGLINRRGG

Query:  NKDEEEDRESSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTN
        NK EEE+R S    GV+R++S SW       N + +  GF+PKM RSNSSVSWRS+   GG     ++++ +   +GKKK                  + 
Subjt:  NKDEEEDRESSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTN

Query:  VDNGLLRFYLTPLRGSRKGGSGKVKPS
         +NG+L+FYLTP +G R+G      P+
Subjt:  VDNGLLRFYLTPLRGSRKGGSGKVKPS

AT3G09070.1 Protein of unknown function (DUF740)1.1e-11244.01Show/hide
Query:  MNPSTDEQP------LPPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLD-SSSASSSSSSRKPHSTAASALKAIFRPPPPNR------------
        MNP+TD          PPP PP PHR S +C RHP+E FT FCP CLCERLS+LD +++  SSSSS+KP + +A+ALKA+F+P   N             
Subjt:  MNPSTDEQP------LPPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLD-SSSASSSSSSRKPHSTAASALKAIFRPPPPNR------------

Query:  PSSFFPELRRTKSFSASK-NEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSS--SKVLAPAAPEIVVESK-----------NLEDPGSSCVEHIPDGDG
           FFPELRRTKSFSASK NE FS VFEPQR+SCDVRLR++L +L SQD   +  S V      EI VE +           N E    S  E + + + 
Subjt:  PSSFFPELRRTKSFSASK-NEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSS--SKVLAPAAPEIVVESK-----------NLEDPGSSCVEHIPDGDG

Query:  DIRVSGQPNVGDFVIENSVQEI----------VEEEIQVELGSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNG
        +  V      GDF I N   E+          V EEI+  +       +EE K +KD+IDLDS TKKPS R SFWSAASVFSKKLQKWR  QK KK+RNG
Subjt:  DIRVSGQPNVGDFVIENSVQEI----------VEEEIQVELGSESVQLQEEFKTMKDHIDLDSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNG

Query:  G----GSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSLDA--------------GRMSFDDPRYSFDEPRASWDGYLISRTF-------PRMP
        G    GS  LPVEKPIGR  R+TQSEIADYG+GRRSCD DPRFSLDA              GR+S DDPRYSFDEPRASWDG LI RT        P  P
Subjt:  G----GSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSLDA--------------GRMSFDDPRYSFDEPRASWDGYLISRTF-------PRMP

Query:  TMLSVVEDAP----IHVFRSDTQIPVEDS------INSTNEEEN---IPGG------------------LLRPGSTIQTLLLVDERASTEIRIPIHCEMT
        +MLSVVEDAP     HV R+D Q PVE+       +N TN   +   IPGG                  L R  S+++           E ++ +   ++
Subjt:  TMLSVVEDAP----IHVFRSDTQIPVEDS------INSTNEEEN---IPGG------------------LLRPGSTIQTLLLVDERASTEIRIPIHCEMT

Query:  IPVVRG----------------------CCLSSRDGKSERRVEEVQRVGEGLEIWGLINRRGGNKDEEEDRESS----RPNG--VERSYSESWPELRGDR
        I    G                        +  R   S    ++ +R G+   I GLI R+  NK EEE+ E      R NG  VERS SESWPEL   R
Subjt:  IPVVRG----------------------CCLSSRDGKSERRVEEVQRVGEGLEIWGLINRRGGNKDEEEDRESS----RPNG--VERSYSESWPELRGDR

Query:  NGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGLLRFYLTPLRGSRK---------GGSG
        NG    GG  P+M RSNS+VSWRS+   GG   S+RK N                +  RN+S+R+SP N +NG+L+FYL  ++ SR+         GG G
Subjt:  NGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGLLRFYLTPLRGSRK---------GGSG

Query:  KVKPSQAQSIARSVLRLY
            S   SIARSV+RLY
Subjt:  KVKPSQAQSIARSVLRLY

AT3G46990.1 Protein of unknown function (DUF740)1.9e-1925.52Show/hide
Query:  RPSATCPRHPQEHFTA-FCPLCLCERLSLLDSSSASSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFSASKNEAFSTVFEPQRKSCDVR-
        R S++C RHP    T+ FC  CL ERL  +++ S+S ++                             PELRR +S+S  +N + S   +P+R+SCDVR 
Subjt:  RPSATCPRHPQEHFTA-FCPLCLCERLSLLDSSSASSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFSASKNEAFSTVFEPQRKSCDVR-

Query:  LRNTLCSLISQDASS--SSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIVEEEIQVELGSESVQLQEEFKTMKDH
          ++L  L   D      S +  P  P++  E +  E+      E   DG+ DI+            E   ++IVEE                 KTMK+ 
Subjt:  LRNTLCSLISQDASS--SSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIVEEEIQVELGSESVQLQEEFKTMKDH

Query:  IDLD--SHTKKPSGRGSFWSAASVFSKKLQKWR-DKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSLDAGRMSFDDPRYSF
        IDLD  +  KK +G+      ASV S++L+ +  +K+ ++K                      + S  A    GR S D+DPR S D GR+       SF
Subjt:  IDLD--SHTKKPSGRGSFWSAASVFSKKLQKWR-DKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSLDAGRMSFDDPRYSF

Query:  DEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGGLLRPGSTI-----------------QTLLLVDE-RASTEIRI
        ++PR+SWDG LI +++ ++ T+ +V EDA         +  VE+      E+E  PGG ++  +                   Q LL VDE R  +  ++
Subjt:  DEPRASWDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGGLLRPGSTI-----------------QTLLLVDE-RASTEIRI

Query:  P-----------------------------IHCEMTIPVVRG--CCLSSRDGKSERRVE---EVQRVGEGLEIWGLINRRGGNKDEEEDRE--SSRPNGV
                                      +  E    V +G  C  +  +GK +  VE     ++  +G  IWGLI R+   K+E +  +      N V
Subjt:  P-----------------------------IHCEMTIPVVRG--CCLSSRDGKSERRVE---EVQRVGEGLEIWGLINRRGGNKDEEEDRE--SSRPNGV

Query:  ERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSA---------------------SMMGGSFSSSRKSNAESNG--NGKKKKEEPQPVLERNQS
        E S +ES  +LR    G+    G + K+ +S S  + +S                       +  GS +S        +G  NG + K+    +L+RN +
Subjt:  ERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSA---------------------SMMGGSFSSSRKSNAESNG--NGKKKKEEPQPVLERNQS

Query:  -ARHSPTNVDNGLLRFYLTPLRGSRKGGSGK
            S  N++  + RFYL+P++  +   SGK
Subjt:  -ARHSPTNVDNGLLRFYLTPLRGSRKGGSGK

AT5G01170.1 Protein of unknown function (DUF740)1.4e-8642.19Show/hide
Query:  MNPSTDEQP------LPPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSA-SSSSSSRKPHSTAASALKAIFRPPPPNRPSS---------
        MN S D+ P      L PP    PHR S +C  HP+E F+ FCP CLC+RLS+LD ++A   SSSSRKP S +A +LKA+F+P      +S         
Subjt:  MNPSTDEQP------LPPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSA-SSSSSSRKPHSTAASALKAIFRPPPPNRPSS---------

Query:  FFPELRRTKSFSASKNEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIEN
        FFPELRRTKSFSA  NE FS  FEPQR+SCDVRLR+   +L   +A+S  K+    A E  V    LE    + +E   +       +G+ + G+ V E 
Subjt:  FFPELRRTKSFSASKNEAFSTVFEPQRKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIEN

Query:  SVQEIVEEEIQVELGSESVQLQEEFKTMKDHIDLDSHTKKPSGR---GSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIA
        S  EI EEE             EE K MKD++DL S TKKPS +   GSF+SAASVFSKKLQKW+ KQK KK RNG G          GR     QSEI 
Subjt:  SVQEIVEEEIQVELGSESVQLQEEFKTMKDHIDLDSHTKKPSGR---GSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIA

Query:  DYGFGRRSCDIDPRFSLDA-------GRMSFDDPRYSFDEPRASWDGYLISRT----FPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEEN----
          G GRRS D DPRFSLDA       GR+S DD RYS DEPRASWDG+LI RT     P  P+MLSVVE+AP++  RSD QIP   SI   + + +    
Subjt:  DYGFGRRSCDIDPRFSLDA-------GRMSFDDPRYSFDEPRASWDGYLISRT----FPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEEN----

Query:  IPGG-------LLRPGSTIQTLLL----VDERASTEIRIPIHCEMTIPVVRGCCLSSRDGKSERRVEEVQRVGEGLEIWGLINRRGGNKDEEEDRESSRP
        IPGG          P S+ +   L       +  TE+        +   +    + + + K  +  ++  R      I G I R+G  KD+EE+   SR 
Subjt:  IPGG-------LLRPGSTIQTLLL----VDERASTEIRIPIHCEMTIPVVRGCCLSSRDGKSERRVEEVQRVGEGLEIWGLINRRGGNKDEEEDRESSRP

Query:  NG---VERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGLLRFYL
        N    VERS SESWPE+   RNG+    G  PKM RSNS+VSWRS+   GGS                           RN+S+R+S  + +NG+LRFYL
Subjt:  NG---VERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGLLRFYL

Query:  TPLRGSRK------------GGSGKVKP-----SQAQSIARSVLRLY
        TP+R S K            GG G  K      S   SIAR V+RLY
Subjt:  TPLRGSRK------------GGSGKVKP-----SQAQSIARSVLRLY

AT5G58930.1 Protein of unknown function (DUF740)5.8e-2426.68Show/hide
Query:  RPSATCPRHP-QEHFTAFCPLCLCERLSLLDSSSASSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFSASKNEAFSTVFEPQRKSCDVRL
        R SA C RHP  +  T FC  CL ERLS +++ S+S S+S+                            ELRR +S+S  ++ + S + +P+R+SCDVR 
Subjt:  RPSATCPRHP-QEHFTAFCPLCLCERLSLLDSSSASSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFSASKNEAFSTVFEPQRKSCDVRL

Query:  RNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIVEEEIQVELGSESVQLQEEFKTMKDHIDL
         +        D    S +  P  P+++ + +  +D G                               +++VEEEI+            E KTMK+ IDL
Subjt:  RNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIVEEEIQVELGSESVQLQEEFKTMKDHIDL

Query:  DSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSLDAGRMSFDDPRYSFDEPRAS
        +S  ++    G      SVFS+ L+K+  K   K   +G                            GRRSCD+DPR SLDAGR+       SFDEPRAS
Subjt:  DSHTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSLDAGRMSFDDPRYSFDEPRAS

Query:  WDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGGLLR--------------PGSTIQTLLLVDE-------RASTEIRIPIH
        WDG LI +T+P++  + SV ED    V  S  +I  E       +E+N PGG  +                S+   LL VDE       + S E     H
Subjt:  WDGYLISRTFPRMPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGGLLR--------------PGSTIQTLLLVDE-------RASTEIRIPIH

Query:  CEMTIPVVR-------------------------GCCLSSRDGKSE--RRVEEVQRVGEGLEIWGLINRRGGNKDEEEDRESSRP---NGVERSYSESWP
            +   R                         GC  +    K +     +  +   +G   WGLI R+      E   E S     N +E S +ES  
Subjt:  CEMTIPVVR-------------------------GCCLSSRDGKSE--RRVEEVQRVGEGLEIWGLINRRGGNKDEEEDRESSRP---NGVERSYSESWP

Query:  ELR----GDRNGDVKTGGFNPKMFRSNSSVS-------WRSASMMGG-----------------SFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHS
        +LR    G+ NGDV     + K+ RS S  +        R AS++ G                    + R+S  E       + +    +   ++   +S
Subjt:  ELR----GDRNGDVKTGGFNPKMFRSNSSVS-------WRSASMMGG-----------------SFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHS

Query:  PTNVDNGLLRFYLTPLRGSRKGGSGK
        P N+ NG++RFYLTPL       SGK
Subjt:  PTNVDNGLLRFYLTPLRGSRKGGSGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCCTCCACCGACGAACAACCACTTCCTCCTCCCTTGCCGCCGTTTCCTCACCGTCCTTCCGCCACCTGCCCACGTCACCCACAAGAGCATTTCACTGCTTTCTG
TCCTTTATGTCTCTGTGAGCGTCTCTCTCTTCTTGATTCTTCCTCTGCTTCTTCTTCTTCTTCTTCCCGGAAACCCCATTCCACCGCTGCCTCTGCTCTTAAAGCCATTT
TCAGACCTCCCCCTCCTAATCGACCCTCTTCCTTTTTTCCTGAGCTTCGTCGGACTAAATCCTTCTCTGCTTCTAAGAACGAAGCCTTCTCTACCGTCTTTGAACCACAG
AGGAAATCTTGTGACGTTCGCCTTCGTAACACTCTCTGCTCTCTCATTTCACAGGACGCTTCTTCCTCTTCTAAGGTTCTTGCTCCTGCTGCTCCTGAAATTGTTGTTGA
AAGTAAGAATTTGGAGGACCCCGGTTCGTCCTGCGTTGAACACATACCGGACGGTGACGGAGATATTAGGGTTTCTGGACAACCCAATGTGGGGGATTTTGTGATTGAAA
ACAGTGTGCAGGAGATTGTTGAAGAAGAAATTCAGGTTGAATTAGGGTCAGAATCGGTGCAACTACAAGAGGAGTTCAAAACCATGAAGGATCATATAGATCTCGATTCA
CACACGAAGAAGCCATCCGGAAGAGGAAGCTTTTGGTCGGCTGCTTCGGTCTTCAGTAAGAAGCTGCAGAAATGGAGGGATAAACAGAAGGAGAAGAAGCAGAGAAACGG
CGGTGGATCTACGACATTGCCGGTGGAGAAGCCAATCGGACGTCATTTCAGAGAAACCCAGTCGGAAATTGCTGATTATGGATTTGGCCGACGCTCTTGCGATATTGATC
CGAGATTCTCTCTCGATGCTGGGCGAATGTCCTTCGACGATCCCCGTTATTCCTTCGACGAACCTAGGGCATCTTGGGACGGGTATTTGATCAGTAGAACGTTCCCAAGA
ATGCCCACCATGCTTTCCGTCGTTGAAGATGCCCCTATCCATGTTTTCCGTTCCGATACCCAAATTCCCGTCGAAGACTCCATAAATTCAACCAATGAAGAAGAAAATAT
CCCCGGGGGTCTTCTCAGACCCGGGAGTACTATTCAGACTCTTCTTCTCGTCGACGAAAGAGCCTCGACCGAGATTCGAATTCCAATTCATTGCGAGATGACTATTCCGG
TCGTTCGAGGATGCTGCCTCAGTAGTCGGGACGGCAAATCGGAAAGAAGAGTCGAAGAAGTCCAAAGGGTGGGGGAAGGGTTGGAAATCTGGGGATTGATTAACCGGCGA
GGTGGAAATAAAGATGAGGAGGAAGATAGAGAGAGTAGTAGACCCAATGGCGTGGAGCGGTCGTATTCGGAGTCGTGGCCAGAGCTGCGTGGGGATCGAAATGGGGATGT
CAAAACAGGAGGATTCAATCCCAAAATGTTTAGGAGTAACAGCAGTGTTAGTTGGAGGAGTGCAAGTATGATGGGTGGATCTTTCAGTAGTTCAAGGAAAAGCAATGCAG
AATCTAATGGTAATGGGAAGAAGAAGAAAGAGGAGCCACAGCCAGTGTTGGAGAGGAATCAAAGTGCACGACATTCTCCGACGAACGTCGATAATGGACTTCTTCGATTC
TATTTGACGCCATTGCGGGGCAGCCGGAAAGGCGGGTCGGGGAAGGTGAAACCAAGTCAAGCTCAGTCCATTGCTAGAAGTGTTCTTCGACTGTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCCCTCCACCGACGAACAACCACTTCCTCCTCCCTTGCCGCCGTTTCCTCACCGTCCTTCCGCCACCTGCCCACGTCACCCACAAGAGCATTTCACTGCTTTCTG
TCCTTTATGTCTCTGTGAGCGTCTCTCTCTTCTTGATTCTTCCTCTGCTTCTTCTTCTTCTTCTTCCCGGAAACCCCATTCCACCGCTGCCTCTGCTCTTAAAGCCATTT
TCAGACCTCCCCCTCCTAATCGACCCTCTTCCTTTTTTCCTGAGCTTCGTCGGACTAAATCCTTCTCTGCTTCTAAGAACGAAGCCTTCTCTACCGTCTTTGAACCACAG
AGGAAATCTTGTGACGTTCGCCTTCGTAACACTCTCTGCTCTCTCATTTCACAGGACGCTTCTTCCTCTTCTAAGGTTCTTGCTCCTGCTGCTCCTGAAATTGTTGTTGA
AAGTAAGAATTTGGAGGACCCCGGTTCGTCCTGCGTTGAACACATACCGGACGGTGACGGAGATATTAGGGTTTCTGGACAACCCAATGTGGGGGATTTTGTGATTGAAA
ACAGTGTGCAGGAGATTGTTGAAGAAGAAATTCAGGTTGAATTAGGGTCAGAATCGGTGCAACTACAAGAGGAGTTCAAAACCATGAAGGATCATATAGATCTCGATTCA
CACACGAAGAAGCCATCCGGAAGAGGAAGCTTTTGGTCGGCTGCTTCGGTCTTCAGTAAGAAGCTGCAGAAATGGAGGGATAAACAGAAGGAGAAGAAGCAGAGAAACGG
CGGTGGATCTACGACATTGCCGGTGGAGAAGCCAATCGGACGTCATTTCAGAGAAACCCAGTCGGAAATTGCTGATTATGGATTTGGCCGACGCTCTTGCGATATTGATC
CGAGATTCTCTCTCGATGCTGGGCGAATGTCCTTCGACGATCCCCGTTATTCCTTCGACGAACCTAGGGCATCTTGGGACGGGTATTTGATCAGTAGAACGTTCCCAAGA
ATGCCCACCATGCTTTCCGTCGTTGAAGATGCCCCTATCCATGTTTTCCGTTCCGATACCCAAATTCCCGTCGAAGACTCCATAAATTCAACCAATGAAGAAGAAAATAT
CCCCGGGGGTCTTCTCAGACCCGGGAGTACTATTCAGACTCTTCTTCTCGTCGACGAAAGAGCCTCGACCGAGATTCGAATTCCAATTCATTGCGAGATGACTATTCCGG
TCGTTCGAGGATGCTGCCTCAGTAGTCGGGACGGCAAATCGGAAAGAAGAGTCGAAGAAGTCCAAAGGGTGGGGGAAGGGTTGGAAATCTGGGGATTGATTAACCGGCGA
GGTGGAAATAAAGATGAGGAGGAAGATAGAGAGAGTAGTAGACCCAATGGCGTGGAGCGGTCGTATTCGGAGTCGTGGCCAGAGCTGCGTGGGGATCGAAATGGGGATGT
CAAAACAGGAGGATTCAATCCCAAAATGTTTAGGAGTAACAGCAGTGTTAGTTGGAGGAGTGCAAGTATGATGGGTGGATCTTTCAGTAGTTCAAGGAAAAGCAATGCAG
AATCTAATGGTAATGGGAAGAAGAAGAAAGAGGAGCCACAGCCAGTGTTGGAGAGGAATCAAAGTGCACGACATTCTCCGACGAACGTCGATAATGGACTTCTTCGATTC
TATTTGACGCCATTGCGGGGCAGCCGGAAAGGCGGGTCGGGGAAGGTGAAACCAAGTCAAGCTCAGTCCATTGCTAGAAGTGTTCTTCGACTGTATTAA
Protein sequenceShow/hide protein sequence
MNPSTDEQPLPPPLPPFPHRPSATCPRHPQEHFTAFCPLCLCERLSLLDSSSASSSSSSRKPHSTAASALKAIFRPPPPNRPSSFFPELRRTKSFSASKNEAFSTVFEPQ
RKSCDVRLRNTLCSLISQDASSSSKVLAPAAPEIVVESKNLEDPGSSCVEHIPDGDGDIRVSGQPNVGDFVIENSVQEIVEEEIQVELGSESVQLQEEFKTMKDHIDLDS
HTKKPSGRGSFWSAASVFSKKLQKWRDKQKEKKQRNGGGSTTLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSLDAGRMSFDDPRYSFDEPRASWDGYLISRTFPR
MPTMLSVVEDAPIHVFRSDTQIPVEDSINSTNEEENIPGGLLRPGSTIQTLLLVDERASTEIRIPIHCEMTIPVVRGCCLSSRDGKSERRVEEVQRVGEGLEIWGLINRR
GGNKDEEEDRESSRPNGVERSYSESWPELRGDRNGDVKTGGFNPKMFRSNSSVSWRSASMMGGSFSSSRKSNAESNGNGKKKKEEPQPVLERNQSARHSPTNVDNGLLRF
YLTPLRGSRKGGSGKVKPSQAQSIARSVLRLY