; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr003996 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr003996
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionarmadillo repeat-containing protein LFR-like
Genome locationtig00002539:60211..67961
RNA-Seq ExpressionSgr003996
SyntenySgr003996
Gene Ontology termsGO:0006338 - chromatin remodeling (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0048366 - leaf development (biological process)
GO:0048653 - anther development (biological process)
GO:0005654 - nucleoplasm (cellular component)
GO:0016514 - SWI/SNF complex (cellular component)
GO:0035060 - brahma complex (cellular component)
GO:0031491 - nucleosome binding (molecular function)
InterPro domainsIPR006502 - Protein of unknown function PDDEXK-like
IPR016024 - Armadillo-type fold
IPR021906 - SWI/SNF-like complex subunit BAF250/Osa


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138190.1 uncharacterized protein LOC111009423 [Momordica charantia]9.3e-18786.42Show/hide
Query:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPILGEPAQFSSKDGG-------GTEFEPSSVCLDKMVQNFIEDSNEKQPPV
        MKIQPIDID +T REQ+RTDSAKP+FKSRLRRLFDRPFPSVLRISAVEKPI+GEPAQFSSKDGG       GTEFEP+SVCLDKMVQNFIEDSNEKQP  
Subjt:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPILGEPAQFSSKDGG-------GTEFEPSSVCLDKMVQNFIEDSNEKQPPV

Query:  VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICK
        VKYGRNRCNCFN NSNDSSDDEFDVFGGFGESITSGSSGGDACD+LKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSIC+
Subjt:  VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICK

Query:  SKWEKSPSFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS
        SKW+KSPSFPAGEYEYVDV VDGERLL+DIDFRSEFEIARSTGTYKAILQTLP VFVGKSDRL QIVSIVSEAARQSLKKKGMHFPPWRKAEY LAKWLS
Subjt:  SKWEKSPSFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS

Query:  PPTRNADSLNNASPEL-QP-EIKAQIQNDDPLVTDTDCGEFELIFG--EESAPTESVPGDTES-SSPTSSSGENKQPRATASPWQPPAIKPKSIDKGAKL
        PPTR  DS   A PE  QP EI+ QI ND  LVTDTDCGEFELIFG  EESAP+ +VPGD ES SS +SSSG+NK+P  TASPWQPPAIKPKS+DKGAK 
Subjt:  PPTRNADSLNNASPEL-QP-EIKAQIQNDDPLVTDTDCGEFELIFG--EESAPTESVPGDTES-SSPTSSSGENKQPRATASPWQPPAIKPKSIDKGAKL

Query:  PIVTG
         IVTG
Subjt:  PIVTG

XP_022937081.1 uncharacterized protein LOC111443488 [Cucurbita moschata]1.6e-18686.26Show/hide
Query:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPILGEPAQFSSKDGGGTEFEPSSVCLDKMVQNFIEDSNEKQPPVVKYGRNR
        MKIQPIDID++TVREQVRTD+AKPLFK RLRRLFDRPFPSVLR S VEKPI+ EPAQFS     GTEFEPSSVCLDKMVQNFIE+SNEKQP  VKYGRN 
Subjt:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPILGEPAQFSSKDGGGTEFEPSSVCLDKMVQNFIEDSNEKQPPVVKYGRNR

Query:  CNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKWEKSP
        CNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNK+HKRKDDLRRIVTDGLSSLGYNSSICKSKWEKSP
Subjt:  CNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKWEKSP

Query:  SFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPTRNAD
        SFPAGEYEY+DVIVDGERLLVDIDF+SEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS P R AD
Subjt:  SFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPTRNAD

Query:  SLNNASPELQPEIKAQIQNDDPLVTDTDCGEFELIFGEESAPTESVPGDTESSSPTSSSGENKQPRATASPWQPPAIKPKSIDKGAKLPIVTG
        S+ N+ PE +PE+     ++DPLVTDTDCGEFELIFGEES+P+ESV GD  S S TS S + K P  TASPWQPPA+KPKSIDKGAK  IVTG
Subjt:  SLNNASPELQPEIKAQIQNDDPLVTDTDCGEFELIFGEESAPTESVPGDTESSSPTSSSGENKQPRATASPWQPPAIKPKSIDKGAKLPIVTG

XP_022974410.1 uncharacterized protein LOC111473084 [Cucurbita maxima]1.6e-18686.51Show/hide
Query:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPILGEPAQFSSKDGGGTEFEPSSVCLDKMVQNFIEDSNEKQPPVVKYGRNR
        MKIQPIDID++TVREQVRTD+AKPLFKSRLRRLFDRPFPSVLR S VEKPI+GEPAQFS     GTEFEPSSVCLDKMVQNFIE+S+EKQP  VKYGRN 
Subjt:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPILGEPAQFSSKDGGGTEFEPSSVCLDKMVQNFIEDSNEKQPPVVKYGRNR

Query:  CNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKWEKSP
        CNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDA DILKSLIPCASVTERNLLADASKIVEKHNK+HKRKDDLRRIVTDGLSSLGYNSSICKSKWEKSP
Subjt:  CNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKWEKSP

Query:  SFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPTRNAD
        SFPAGEYEY+DVIVDGERLLVDIDF+SEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS P R AD
Subjt:  SFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPTRNAD

Query:  SLNNASPELQPEIKAQIQNDDPLVTDTDCGEFELIFGEESAPTESVPGDTESSSPTSSSGENKQPRATASPWQPPAIKPKSIDKGAKLPIVTG
        S+ N+ PE +PE+     ++DPLVTDTDCGEFELIFGEESAP ESV GD  S S TS S + K P  TASPWQPPA+KPKSIDKGAK  IVTG
Subjt:  SLNNASPELQPEIKAQIQNDDPLVTDTDCGEFELIFGEESAPTESVPGDTESSSPTSSSGENKQPRATASPWQPPAIKPKSIDKGAKLPIVTG

XP_023539817.1 uncharacterized protein LOC111800384 [Cucurbita pepo subsp. pepo]4.2e-18786.26Show/hide
Query:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPILGEPAQFSSKDGGGTEFEPSSVCLDKMVQNFIEDSNEKQPPVVKYGRNR
        MKIQPIDID++TVREQVRTD+AKPLFKSRLRRLFDRPFPSVLR S VEKPI+GEPAQFS     GTEFEPSSVCLDKMVQNFIE+SNEKQP  VKYGRNR
Subjt:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPILGEPAQFSSKDGGGTEFEPSSVCLDKMVQNFIEDSNEKQPPVVKYGRNR

Query:  CNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKWEKSP
        CNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNK+HKRKDDLRRIVTDGLSSLGYNSSICKSKWEKSP
Subjt:  CNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKWEKSP

Query:  SFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPTRNAD
        SFPAGEYEY+DVIVDGERLL+DIDF+SEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS P R AD
Subjt:  SFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPTRNAD

Query:  SLNNASPELQPEIKAQIQNDDPLVTDTDCGEFELIFGEESAPTESVPGDTESSSPTSSSGENKQPRATASPWQPPAIKPKSIDKGAKLPIVTG
         + N+ PE +PE+     ++DPLVTDTDCGEFELIFGEESAP+ES   D  S S TS S + K P  TASPWQPPA+KPKSIDKGAK  IVTG
Subjt:  SLNNASPELQPEIKAQIQNDDPLVTDTDCGEFELIFGEESAPTESVPGDTESSSPTSSSGENKQPRATASPWQPPAIKPKSIDKGAKLPIVTG

XP_038898340.1 uncharacterized protein LOC120086017 [Benincasa hispida]1.3e-18887.88Show/hide
Query:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPIL-GEPAQFSSKD-GGGTEFEPSSVCLDKMVQNFIEDSNEKQPPVVKYGR
        MKIQPIDID+QTVREQVRT+SAKP+FKSRLRRLFDRPFPSVLRI+AVEKPI+ GEPAQFSSKD GGGTE EPSSVCLDKMVQNFIE++NEKQP  VKYGR
Subjt:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPIL-GEPAQFSSKD-GGGTEFEPSSVCLDKMVQNFIEDSNEKQPPVVKYGR

Query:  NRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKWEK
        NRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILK LIPC SVTERNLLADASKIVEKHNKIHKRKDDLRRIVTD LSSLGYNSSICKSKWEK
Subjt:  NRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKWEK

Query:  SPSFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPTRN
        SPSFPAGEYEYVDVI+DGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVS+AARQSLKKKGMHFPPWRKAEYMLAKWLS PTR 
Subjt:  SPSFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPTRN

Query:  ADSLNNASPELQP-EIKAQIQNDDPLVTDTDCGEFELIFGEESAPTESVPGDTESSSPTSSSGENKQPRATASPWQPPAIKPKSIDKGAKLPIVTG
        ADSL  A+P+ +P E K  IQN DPLVTDTDCGEFELIFGEES P+ S+ GD+E     S SGENK P  TA PWQPPAIKPKSIDKGAK  IVTG
Subjt:  ADSLNNASPELQP-EIKAQIQNDDPLVTDTDCGEFELIFGEESAPTESVPGDTESSSPTSSSGENKQPRATASPWQPPAIKPKSIDKGAKLPIVTG

TrEMBL top hitse value%identityAlignment
A0A0A0L8I2 Uncharacterized protein3.0e-18384.29Show/hide
Query:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPIL-GEPAQFSSKD---GGGTEFEPSSVCLDKMVQNFIEDSNEKQPPVVKY
        MKIQPIDID+QTVREQVRT+SAKP+FKSRLRRLFDRPFPSVLRISAVEKPI+ GE AQFSSKD   GGGTE EPSSVCLDKMVQNFIE++NE+QP  VKY
Subjt:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPIL-GEPAQFSSKD---GGGTEFEPSSVCLDKMVQNFIEDSNEKQPPVVKY

Query:  GRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKW
        GRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILK LIPC SVTERNLLADASKIVEKHNKIHKRKDDLR+IVTD LS LGYNSSICKSKW
Subjt:  GRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKW

Query:  EKSPSFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPT
        EKSPSFPAGEYEYVDVI+DGERLL+DIDFRSEFEIARSTG YK ILQTLPY+FVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS PT
Subjt:  EKSPSFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPT

Query:  RNADSLNNASPELQP-EIKAQIQNDDPLVTDTDCGEFELIFGEESAPTE---SVPGDTESSSPTSSSGENKQPRATASPWQPPAIKPKSIDKGAKLPIVT
        R ADS++NASP+ +P E K+ I  +DPLVT TDCGEFELIFGEES+      S+ GDTE     S +GENK P  +A PWQPPAIKPKSIDKGAK  IVT
Subjt:  RNADSLNNASPELQP-EIKAQIQNDDPLVTDTDCGEFELIFGEESAPTE---SVPGDTESSSPTSSSGENKQPRATASPWQPPAIKPKSIDKGAKLPIVT

Query:  G
        G
Subjt:  G

A0A5A7U4P5 Uncharacterized protein7.4e-18284.63Show/hide
Query:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPI-LGEPAQFSSKD--GGGTEFEPSSVCLDKMVQNFIEDSNEKQPPVVKYG
        MKIQPIDID+QT REQVRT+SAKP+FKSRLRRLFDRPFPSVLRISAVEKPI +GE AQFSSKD  GGGT+ EPSSVCLDKMVQNFIE++NEKQP  VKYG
Subjt:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPI-LGEPAQFSSKD--GGGTEFEPSSVCLDKMVQNFIEDSNEKQPPVVKYG

Query:  RNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKWE
        RNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILK LIPC SVTERNLLADASKIVEKHNKIHKRKDDLR+IVTD LSSLGYNSSICKSKWE
Subjt:  RNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKWE

Query:  KSPSFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPTR
        KSPSFPAGEYEYVDVI+DGERLL+DIDFRSEFEIARSTGTYK ILQTLPY+FVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS PTR
Subjt:  KSPSFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPTR

Query:  NADSLNNASPELQP-EIKAQIQNDDPLVTDTDCGEFELIFGEESAPTESVPGDTESSSPTSSSGENKQPRATASPWQPPAIKPKSIDKGAKLPIVTG
         ADSL+NASP+ +P E K+ I  +DPLVT TDCGEFELIFGEES P    P  + S    S   ENK P  +A  WQPPAIKPKSIDKGAK  IVTG
Subjt:  NADSLNNASPELQP-EIKAQIQNDDPLVTDTDCGEFELIFGEESAPTESVPGDTESSSPTSSSGENKQPRATASPWQPPAIKPKSIDKGAKLPIVTG

A0A6J1C8R5 uncharacterized protein LOC1110094234.5e-18786.42Show/hide
Query:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPILGEPAQFSSKDGG-------GTEFEPSSVCLDKMVQNFIEDSNEKQPPV
        MKIQPIDID +T REQ+RTDSAKP+FKSRLRRLFDRPFPSVLRISAVEKPI+GEPAQFSSKDGG       GTEFEP+SVCLDKMVQNFIEDSNEKQP  
Subjt:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPILGEPAQFSSKDGG-------GTEFEPSSVCLDKMVQNFIEDSNEKQPPV

Query:  VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICK
        VKYGRNRCNCFN NSNDSSDDEFDVFGGFGESITSGSSGGDACD+LKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSIC+
Subjt:  VKYGRNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICK

Query:  SKWEKSPSFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS
        SKW+KSPSFPAGEYEYVDV VDGERLL+DIDFRSEFEIARSTGTYKAILQTLP VFVGKSDRL QIVSIVSEAARQSLKKKGMHFPPWRKAEY LAKWLS
Subjt:  SKWEKSPSFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS

Query:  PPTRNADSLNNASPEL-QP-EIKAQIQNDDPLVTDTDCGEFELIFG--EESAPTESVPGDTES-SSPTSSSGENKQPRATASPWQPPAIKPKSIDKGAKL
        PPTR  DS   A PE  QP EI+ QI ND  LVTDTDCGEFELIFG  EESAP+ +VPGD ES SS +SSSG+NK+P  TASPWQPPAIKPKS+DKGAK 
Subjt:  PPTRNADSLNNASPEL-QP-EIKAQIQNDDPLVTDTDCGEFELIFG--EESAPTESVPGDTES-SSPTSSSGENKQPRATASPWQPPAIKPKSIDKGAKL

Query:  PIVTG
         IVTG
Subjt:  PIVTG

A0A6J1F9C5 uncharacterized protein LOC1114434887.7e-18786.26Show/hide
Query:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPILGEPAQFSSKDGGGTEFEPSSVCLDKMVQNFIEDSNEKQPPVVKYGRNR
        MKIQPIDID++TVREQVRTD+AKPLFK RLRRLFDRPFPSVLR S VEKPI+ EPAQFS     GTEFEPSSVCLDKMVQNFIE+SNEKQP  VKYGRN 
Subjt:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPILGEPAQFSSKDGGGTEFEPSSVCLDKMVQNFIEDSNEKQPPVVKYGRNR

Query:  CNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKWEKSP
        CNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNK+HKRKDDLRRIVTDGLSSLGYNSSICKSKWEKSP
Subjt:  CNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKWEKSP

Query:  SFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPTRNAD
        SFPAGEYEY+DVIVDGERLLVDIDF+SEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS P R AD
Subjt:  SFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPTRNAD

Query:  SLNNASPELQPEIKAQIQNDDPLVTDTDCGEFELIFGEESAPTESVPGDTESSSPTSSSGENKQPRATASPWQPPAIKPKSIDKGAKLPIVTG
        S+ N+ PE +PE+     ++DPLVTDTDCGEFELIFGEES+P+ESV GD  S S TS S + K P  TASPWQPPA+KPKSIDKGAK  IVTG
Subjt:  SLNNASPELQPEIKAQIQNDDPLVTDTDCGEFELIFGEESAPTESVPGDTESSSPTSSSGENKQPRATASPWQPPAIKPKSIDKGAKLPIVTG

A0A6J1IDW1 uncharacterized protein LOC1114730847.7e-18786.51Show/hide
Query:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPILGEPAQFSSKDGGGTEFEPSSVCLDKMVQNFIEDSNEKQPPVVKYGRNR
        MKIQPIDID++TVREQVRTD+AKPLFKSRLRRLFDRPFPSVLR S VEKPI+GEPAQFS     GTEFEPSSVCLDKMVQNFIE+S+EKQP  VKYGRN 
Subjt:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPILGEPAQFSSKDGGGTEFEPSSVCLDKMVQNFIEDSNEKQPPVVKYGRNR

Query:  CNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKWEKSP
        CNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDA DILKSLIPCASVTERNLLADASKIVEKHNK+HKRKDDLRRIVTDGLSSLGYNSSICKSKWEKSP
Subjt:  CNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKWEKSP

Query:  SFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPTRNAD
        SFPAGEYEY+DVIVDGERLLVDIDF+SEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLS P R AD
Subjt:  SFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPTRNAD

Query:  SLNNASPELQPEIKAQIQNDDPLVTDTDCGEFELIFGEESAPTESVPGDTESSSPTSSSGENKQPRATASPWQPPAIKPKSIDKGAKLPIVTG
        S+ N+ PE +PE+     ++DPLVTDTDCGEFELIFGEESAP ESV GD  S S TS S + K P  TASPWQPPA+KPKSIDKGAK  IVTG
Subjt:  SLNNASPELQPEIKAQIQNDDPLVTDTDCGEFELIFGEESAPTESVPGDTESSSPTSSSGENKQPRATASPWQPPAIKPKSIDKGAKLPIVTG

SwissProt top hitse value%identityAlignment
Q6YTW6 Armadillo repeat-containing protein LFR4.3e-11857.35Show/hide
Query:  GGNVGGTSAPPAKRGRPFGS-----ASSIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPL
        G + GG  + PAKRGRPFGS     A++ AAAAA  +  AP+AL+GPSL V T+ +DQNNKRIVLALQSGLKSE+ WALN LT+LSFKEKDD+R+D+TPL
Subjt:  GGNVGGTSAPPAKRGRPFGS-----ASSIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPL

Query:  AKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNE-FEALGSNGLRPGSSVSE-ATCHALKPSPRHWWLDEDGLFNLDDEGRAERQQC
        AK+PGLLDALLQVIDDWRDIA+P+D  K PRVRTLG N++++GFG+E  E + S+   P    ++ A     K     +  DE+GLFN+DDEGR E+QQC
Subjt:  AKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNE-FEALGSNGLRPGSSVSE-ATCHALKPSPRHWWLDEDGLFNLDDEGRAERQQC

Query:  AVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITGLSDERDVLCHQHRMFRLKFKVLNTITFVWIRLKALILIKPSYIRITEKGAVEAIMGMLG
        AV+ASNIIRNFSFMPENE +M QHRH LETVFQC+E+  T   ++ +++ +          VL+        L+     KPS+I+ITEK AV+AIMGML 
Subjt:  AVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITGLSDERDVLCHQHRMFRLKFKVLNTITFVWIRLKALILIKPSYIRITEKGAVEAIMGMLG

Query:  SAVKVWHCAAAELLGRLIINLIMSLSFFPLLPSRPYEH-------PALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEYA-GSS
        S+++VWHCAAAEL+GRLIIN        P +P + Y+        PA+DAQAAA+ ALYN+ EVNMD R+KLASERWA+DRLLKV+KTPHPVPE    +S
Subjt:  SAVKVWHCAAAELLGRLIINLIMSLSFFPLLPSRPYEH-------PALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKTPHPVPEYA-GSS

Query:  YDIGSLVSEPQTGLY
          + SLVSEPQ  ++
Subjt:  YDIGSLVSEPQTGLY

Q9LS90 Armadillo repeat-containing protein LFR6.8e-13262.26Show/hide
Query:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSAS----SIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDD
        MQKR+  K GGN GG+S PPAKRGRPFGS S    + AAAAAAA+ ++PSALLGPSL VH SF +QNN+RIVLALQSGLKSE+TWALNTLTLLSFKEK+D
Subjt:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSAS----SIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDD

Query:  MRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHAL--KPSPRH----WWLDEDGLFN
        +R+D  PLAKI GLLDALL +IDDWRDIALP+DL +  RVRTLG N+SVTGFGNE++AL S    PGS +  +   AL  K + +H    WW++EDGLFN
Subjt:  MRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHAL--KPSPRH----WWLDEDGLFN

Query:  LDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITGLSDERDVLCHQHRMFRLKFKVLNTITFV--WIRLKALILIKPSYIRI
        LDDEGR+E+Q CA++ASN+IRNFSFMP+NEV+MAQHRH LETVFQCI +H+T   DE            L    L TI  +   + L+    +K SYI I
Subjt:  LDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITGLSDERDVLCHQHRMFRLKFKVLNTITFV--WIRLKALILIKPSYIRI

Query:  TEKGAVEAIMGMLGSAVKVWHCAAAELLGRLIINLIMSLSFFPLLPS------RPYEHPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIK
         EK AV+A++G+L S+VK W+CAAAELLGRLIIN        PL+P             A+DAQAAAVGALYNLVEVNMDCR+KLASERWA+DRLLKVIK
Subjt:  TEKGAVEAIMGMLGSAVKVWHCAAAELLGRLIINLIMSLSFFPLLPS------RPYEHPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIK

Query:  TPHPVPEYA-GSSYDIGSLVSEPQ
        TPHPVPE    ++  + +LVSEPQ
Subjt:  TPHPVPEYA-GSSYDIGSLVSEPQ

Arabidopsis top hitse value%identityAlignment
AT2G38820.1 Protein of unknown function (DUF506)6.2e-7247.17Show/hide
Query:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPILGEPAQFSSKDGGGTEFEPSSVCLDKMVQNFIEDSN--EKQPPVVKYGR
        MKIQPID +     E    ++ + + KSRL+RLF+R F +      V +   G   +     G   +FEPSSVCL KMV NF+ED+N  EKQ    + GR
Subjt:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPILGEPAQFSSKDGGGTEFEPSSVCLDKMVQNFIEDSN--EKQPPVVKYGR

Query:  NRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKWEK
        +RCNCF+G+  +SSDDE +             S G+AC+ILKSL+ C S+  RNLL D +KI E                        Y++++CKS+WEK
Subjt:  NRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKWEK

Query:  SPSFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPTRN
        SPS PAGEYEYVDVI+ GERLL+DIDF+S+FEIAR+T TYK++LQTLPY+FVGK+DRL +I+ ++ +AA+QSLKKKG+H PPWR+AEY+ +KWLS   R 
Subjt:  SPSFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPTRN

Query:  ADSLNNASPELQPEIKAQ
          + N    +   E+ A+
Subjt:  ADSLNNASPELQPEIKAQ

AT2G38820.2 Protein of unknown function (DUF506)7.6e-7849.06Show/hide
Query:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPILGEPAQFSSKDGGGTEFEPSSVCLDKMVQNFIEDSN--EKQPPVVKYGR
        MKIQPID +     E    ++ + + KSRL+RLF+R F +      V +   G   +     G   +FEPSSVCL KMV NF+ED+N  EKQ    + GR
Subjt:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPILGEPAQFSSKDGGGTEFEPSSVCLDKMVQNFIEDSN--EKQPPVVKYGR

Query:  NRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKWEK
        +RCNCF+G+  +SSDDE +             S G+AC+ILKSL+ C S+  RNLL D +KI E       +     + V +GL SLGY++++CKS+WEK
Subjt:  NRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKWEK

Query:  SPSFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPTRN
        SPS PAGEYEYVDVI+ GERLL+DIDF+S+FEIAR+T TYK++LQTLPY+FVGK+DRL +I+ ++ +AA+QSLKKKG+H PPWR+AEY+ +KWLS   R 
Subjt:  SPSFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPTRN

Query:  ADSLNNASPELQPEIKAQ
          + N    +   E+ A+
Subjt:  ADSLNNASPELQPEIKAQ

AT3G22970.1 Protein of unknown function (DUF506)1.2e-9956.87Show/hide
Query:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRIS---AVEKPILGEPAQFSSKDGGGTEFEPSSVCLDKMVQNFIEDSNEKQPPVVKYG
        MKIQPIDID      +  + + KP+ KSRL+RLFDRPF +VLR S     EKP +    +     G  TEFEPSSVCL KMVQNFIE++NEKQ    K G
Subjt:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRIS---AVEKPILGEPAQFSSKDGGGTEFEPSSVCLDKMVQNFIEDSNEKQPPVVKYG

Query:  RNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKWE
        RNRCNCFNGN++ SSDDE D+FGG          G DA D LKSLIPC +V ERNLLADA+KIV+K NK  KRKDD+++IV +GL SL YNSSICKSKW+
Subjt:  RNRCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKWE

Query:  KSPSFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPTR
        KSPSFPAGEYEY+DVI+  ERL++D+DFRSEF+IAR T  YK +LQ+LP++FVGKSDRL QIV ++SEAA+QSLKKKGM FPPWRKAEYM +KWLS  TR
Subjt:  KSPSFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPTR

Query:  NADSLNNASPELQPEIKAQIQNDDPLVTDTDCGEFELIFGEESAPTESVPGDTESSSPTSSSGE
         +  + + +  +     A    D  +  + D  E EL+F E+      +     SSSPT    +
Subjt:  NADSLNNASPELQPEIKAQIQNDDPLVTDTDCGEFELIFGEESAPTESVPGDTESSSPTSSSGE

AT3G22990.1 ARM repeat superfamily protein4.8e-13362.26Show/hide
Query:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSAS----SIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDD
        MQKR+  K GGN GG+S PPAKRGRPFGS S    + AAAAAAA+ ++PSALLGPSL VH SF +QNN+RIVLALQSGLKSE+TWALNTLTLLSFKEK+D
Subjt:  MQKRDQNKLGGNVGGTSAPPAKRGRPFGSAS----SIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDD

Query:  MRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHAL--KPSPRH----WWLDEDGLFN
        +R+D  PLAKI GLLDALL +IDDWRDIALP+DL +  RVRTLG N+SVTGFGNE++AL S    PGS +  +   AL  K + +H    WW++EDGLFN
Subjt:  MRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLGANSSVTGFGNEFEALGSNGLRPGSSVSEATCHAL--KPSPRH----WWLDEDGLFN

Query:  LDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITGLSDERDVLCHQHRMFRLKFKVLNTITFV--WIRLKALILIKPSYIRI
        LDDEGR+E+Q CA++ASN+IRNFSFMP+NEV+MAQHRH LETVFQCI +H+T   DE            L    L TI  +   + L+    +K SYI I
Subjt:  LDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITGLSDERDVLCHQHRMFRLKFKVLNTITFV--WIRLKALILIKPSYIRI

Query:  TEKGAVEAIMGMLGSAVKVWHCAAAELLGRLIINLIMSLSFFPLLPS------RPYEHPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIK
         EK AV+A++G+L S+VK W+CAAAELLGRLIIN        PL+P             A+DAQAAAVGALYNLVEVNMDCR+KLASERWA+DRLLKVIK
Subjt:  TEKGAVEAIMGMLGSAVKVWHCAAAELLGRLIINLIMSLSFFPLLPS------RPYEHPALDAQAAAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIK

Query:  TPHPVPEYA-GSSYDIGSLVSEPQ
        TPHPVPE    ++  + +LVSEPQ
Subjt:  TPHPVPEYA-GSSYDIGSLVSEPQ

AT4G14620.1 Protein of unknown function (DUF506)1.8e-8756.27Show/hide
Query:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPILGEPAQFSSKDG--GGTEFEPSSVCLDKMVQNFIEDSNEKQPPVVKYGR
        MKIQPI+ D+   R +    S KP+ KSRL+RL DRPF    RIS  EK ++       S DG   GTEFEPS   L KMVQN++E++N+KQ    K GR
Subjt:  MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPILGEPAQFSSKDG--GGTEFEPSSVCLDKMVQNFIEDSNEKQPPVVKYGR

Query:  N--RCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKW
        N  RCNCFNGN ND SDDE D F                 D  KSLI C S  E++LL +A+KI+EK NK  KRKD+LR+IV D LSSLGY+SSICKSKW
Subjt:  N--RCNCFNGNSNDSSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKW

Query:  EKSPSFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPT
        +K+ S PAGEYEY+DVIV+GERL++DIDFRSEFEIAR T  YK +LQ+LP +FVGKSDR+ QIVSIVSEA++QSLKKKGMHFPPWRKA+YM AKWLS  T
Subjt:  EKSPSFPAGEYEYVDVIVDGERLLVDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPT

Query:  RNADSLNNASPELQPEIKAQIQNDDPLVTDTDCGEFELIFGEE
        RN+       P +    K   +       + D  E ELIF E+
Subjt:  RNADSLNNASPELQPEIKAQIQNDDPLVTDTDCGEFELIFGEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATCCAACCGATCGATATCGATATTCAGACGGTGAGGGAACAGGTTCGGACCGACTCGGCCAAGCCTCTGTTCAAATCGCGACTGAGGCGACTTTTCGATCGGCC
ATTTCCGAGCGTTCTGAGAATTTCTGCGGTAGAGAAGCCTATTCTTGGGGAACCAGCTCAGTTTAGCAGCAAAGATGGAGGAGGGACTGAGTTCGAGCCTAGCTCCGTTT
GCTTAGATAAGATGGTGCAAAATTTCATCGAGGACAGCAACGAGAAACAACCACCGGTCGTCAAGTATGGCCGCAATCGCTGCAATTGCTTCAACGGCAACAGTAATGAC
AGCTCCGACGACGAGTTCGATGTATTTGGGGGTTTTGGTGAATCGATCACCTCCGGATCATCTGGCGGCGATGCATGTGATATACTCAAGAGTTTAATTCCTTGCGCGAG
CGTTACGGAGAGAAACCTCTTAGCTGACGCTTCGAAGATCGTCGAGAAGCATAACAAAATTCACAAACGAAAAGACGATTTACGCAGAATCGTCACCGACGGCCTCTCAT
CTCTCGGTTACAATTCCTCCATCTGCAAGTCCAAATGGGAGAAATCCCCCTCTTTCCCAGCTGGTGAATACGAATACGTGGATGTGATTGTCGACGGAGAGAGATTGCTG
GTAGACATCGATTTTAGATCCGAATTTGAGATAGCTCGTTCGACGGGAACGTACAAGGCGATCCTCCAGACGTTGCCTTACGTCTTCGTCGGGAAATCGGATCGTCTCGG
ACAAATTGTTTCGATCGTATCGGAAGCCGCGAGACAGAGCTTGAAGAAAAAAGGCATGCACTTTCCGCCATGGAGGAAGGCCGAGTACATGCTTGCGAAATGGCTCTCTC
CTCCCACCAGAAACGCCGACTCTCTCAACAACGCTTCTCCAGAGCTCCAACCCGAAATCAAAGCTCAAATCCAAAACGACGACCCGCTGGTTACAGACACCGATTGTGGA
GAATTCGAGTTGATCTTTGGTGAGGAATCGGCGCCGACCGAGTCCGTCCCCGGCGATACTGAATCGTCGTCACCGACGTCAAGTTCCGGTGAAAATAAACAGCCGAGGGC
GACTGCTTCCCCGTGGCAACCCCCTGCGATCAAGCCAAAGAGCATAGATAAAGGAGCTAAGCTCCCGATTGTTACCGGATCGGATCGCCATAGCCAATCAGCCGAAGTTC
AGAGAGAACAAGACGTAGGCATGCAGAAGAGAGATCAGAACAAGTTGGGCGGAAATGTTGGCGGTACCTCTGCGCCTCCGGCTAAGCGAGGCCGTCCGTTCGGCAGCGCA
AGCAGCATCGCCGCTGCTGCTGCTGCCGCCGAGACGTTGGCTCCATCGGCTCTCCTAGGGCCTTCTCTTCATGTTCATACTTCCTTCGCGGATCAAAACAATAAAAGGAT
AGTGTTGGCTCTACAGAGTGGATTGAAGAGTGAATTGACGTGGGCACTGAACACTCTCACTCTGCTCTCCTTCAAAGAGAAGGATGATATGCGCAAAGACTCCACTCCTC
TGGCTAAAATTCCCGGCTTGCTCGACGCTCTTCTTCAAGTTATAGATGACTGGCGTGATATAGCACTTCCGAGGGATCTTGTAAAGAAGCCAAGGGTCAGAACTCTAGGT
GCAAATTCTTCTGTAACGGGATTTGGGAATGAATTTGAGGCATTGGGCTCAAATGGCCTGAGACCTGGTTCTTCAGTTTCAGAGGCAACATGTCATGCTCTTAAACCATC
TCCTCGACATTGGTGGCTTGATGAAGATGGTCTATTTAATCTGGATGACGAAGGACGAGCAGAAAGACAGCAGTGTGCCGTTTCTGCTTCAAATATCATCCGAAACTTCT
CTTTCATGCCAGAGAATGAAGTTATTATGGCTCAACATCGACATACTCTTGAAACAGTGTTTCAGTGTATAGAAGAACATATTACAGGACTATCTGATGAAAGGGACGTC
TTATGTCATCAGCATAGAATGTTTCGTTTGAAGTTTAAAGTGCTCAATACAATTACTTTTGTCTGGATACGTCTGAAAGCTTTAATTCTCATCAAACCATCCTACATCAG
AATAACAGAGAAAGGAGCAGTTGAAGCCATCATGGGTATGCTTGGATCTGCTGTCAAAGTTTGGCACTGTGCTGCTGCAGAATTACTTGGACGCTTGATAATAAATCTGA
TAATGAGCCTTTCCTTCTTCCCTTTGCTCCCCAGTCGACCTTATGAGCATCCAGCATTAGATGCACAAGCAGCAGCTGTTGGCGCACTGTATAACCTTGTCGAAGTTAAT
ATGGACTGCAGAATAAAGCTGGCAAGCGAGCGATGGGCGATCGATAGACTCCTTAAGGTAATCAAAACGCCTCACCCAGTTCCAGAATATGCAGGAAGCAGCTATGATAT
TGGGAGTCTTGTATCTGAGCCACAGACAGGGCTTTACTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGATCCAACCGATCGATATCGATATTCAGACGGTGAGGGAACAGGTTCGGACCGACTCGGCCAAGCCTCTGTTCAAATCGCGACTGAGGCGACTTTTCGATCGGCC
ATTTCCGAGCGTTCTGAGAATTTCTGCGGTAGAGAAGCCTATTCTTGGGGAACCAGCTCAGTTTAGCAGCAAAGATGGAGGAGGGACTGAGTTCGAGCCTAGCTCCGTTT
GCTTAGATAAGATGGTGCAAAATTTCATCGAGGACAGCAACGAGAAACAACCACCGGTCGTCAAGTATGGCCGCAATCGCTGCAATTGCTTCAACGGCAACAGTAATGAC
AGCTCCGACGACGAGTTCGATGTATTTGGGGGTTTTGGTGAATCGATCACCTCCGGATCATCTGGCGGCGATGCATGTGATATACTCAAGAGTTTAATTCCTTGCGCGAG
CGTTACGGAGAGAAACCTCTTAGCTGACGCTTCGAAGATCGTCGAGAAGCATAACAAAATTCACAAACGAAAAGACGATTTACGCAGAATCGTCACCGACGGCCTCTCAT
CTCTCGGTTACAATTCCTCCATCTGCAAGTCCAAATGGGAGAAATCCCCCTCTTTCCCAGCTGGTGAATACGAATACGTGGATGTGATTGTCGACGGAGAGAGATTGCTG
GTAGACATCGATTTTAGATCCGAATTTGAGATAGCTCGTTCGACGGGAACGTACAAGGCGATCCTCCAGACGTTGCCTTACGTCTTCGTCGGGAAATCGGATCGTCTCGG
ACAAATTGTTTCGATCGTATCGGAAGCCGCGAGACAGAGCTTGAAGAAAAAAGGCATGCACTTTCCGCCATGGAGGAAGGCCGAGTACATGCTTGCGAAATGGCTCTCTC
CTCCCACCAGAAACGCCGACTCTCTCAACAACGCTTCTCCAGAGCTCCAACCCGAAATCAAAGCTCAAATCCAAAACGACGACCCGCTGGTTACAGACACCGATTGTGGA
GAATTCGAGTTGATCTTTGGTGAGGAATCGGCGCCGACCGAGTCCGTCCCCGGCGATACTGAATCGTCGTCACCGACGTCAAGTTCCGGTGAAAATAAACAGCCGAGGGC
GACTGCTTCCCCGTGGCAACCCCCTGCGATCAAGCCAAAGAGCATAGATAAAGGAGCTAAGCTCCCGATTGTTACCGGATCGGATCGCCATAGCCAATCAGCCGAAGTTC
AGAGAGAACAAGACGTAGGCATGCAGAAGAGAGATCAGAACAAGTTGGGCGGAAATGTTGGCGGTACCTCTGCGCCTCCGGCTAAGCGAGGCCGTCCGTTCGGCAGCGCA
AGCAGCATCGCCGCTGCTGCTGCTGCCGCCGAGACGTTGGCTCCATCGGCTCTCCTAGGGCCTTCTCTTCATGTTCATACTTCCTTCGCGGATCAAAACAATAAAAGGAT
AGTGTTGGCTCTACAGAGTGGATTGAAGAGTGAATTGACGTGGGCACTGAACACTCTCACTCTGCTCTCCTTCAAAGAGAAGGATGATATGCGCAAAGACTCCACTCCTC
TGGCTAAAATTCCCGGCTTGCTCGACGCTCTTCTTCAAGTTATAGATGACTGGCGTGATATAGCACTTCCGAGGGATCTTGTAAAGAAGCCAAGGGTCAGAACTCTAGGT
GCAAATTCTTCTGTAACGGGATTTGGGAATGAATTTGAGGCATTGGGCTCAAATGGCCTGAGACCTGGTTCTTCAGTTTCAGAGGCAACATGTCATGCTCTTAAACCATC
TCCTCGACATTGGTGGCTTGATGAAGATGGTCTATTTAATCTGGATGACGAAGGACGAGCAGAAAGACAGCAGTGTGCCGTTTCTGCTTCAAATATCATCCGAAACTTCT
CTTTCATGCCAGAGAATGAAGTTATTATGGCTCAACATCGACATACTCTTGAAACAGTGTTTCAGTGTATAGAAGAACATATTACAGGACTATCTGATGAAAGGGACGTC
TTATGTCATCAGCATAGAATGTTTCGTTTGAAGTTTAAAGTGCTCAATACAATTACTTTTGTCTGGATACGTCTGAAAGCTTTAATTCTCATCAAACCATCCTACATCAG
AATAACAGAGAAAGGAGCAGTTGAAGCCATCATGGGTATGCTTGGATCTGCTGTCAAAGTTTGGCACTGTGCTGCTGCAGAATTACTTGGACGCTTGATAATAAATCTGA
TAATGAGCCTTTCCTTCTTCCCTTTGCTCCCCAGTCGACCTTATGAGCATCCAGCATTAGATGCACAAGCAGCAGCTGTTGGCGCACTGTATAACCTTGTCGAAGTTAAT
ATGGACTGCAGAATAAAGCTGGCAAGCGAGCGATGGGCGATCGATAGACTCCTTAAGGTAATCAAAACGCCTCACCCAGTTCCAGAATATGCAGGAAGCAGCTATGATAT
TGGGAGTCTTGTATCTGAGCCACAGACAGGGCTTTACTTCTAG
Protein sequenceShow/hide protein sequence
MKIQPIDIDIQTVREQVRTDSAKPLFKSRLRRLFDRPFPSVLRISAVEKPILGEPAQFSSKDGGGTEFEPSSVCLDKMVQNFIEDSNEKQPPVVKYGRNRCNCFNGNSND
SSDDEFDVFGGFGESITSGSSGGDACDILKSLIPCASVTERNLLADASKIVEKHNKIHKRKDDLRRIVTDGLSSLGYNSSICKSKWEKSPSFPAGEYEYVDVIVDGERLL
VDIDFRSEFEIARSTGTYKAILQTLPYVFVGKSDRLGQIVSIVSEAARQSLKKKGMHFPPWRKAEYMLAKWLSPPTRNADSLNNASPELQPEIKAQIQNDDPLVTDTDCG
EFELIFGEESAPTESVPGDTESSSPTSSSGENKQPRATASPWQPPAIKPKSIDKGAKLPIVTGSDRHSQSAEVQREQDVGMQKRDQNKLGGNVGGTSAPPAKRGRPFGSA
SSIAAAAAAAETLAPSALLGPSLHVHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDSTPLAKIPGLLDALLQVIDDWRDIALPRDLVKKPRVRTLG
ANSSVTGFGNEFEALGSNGLRPGSSVSEATCHALKPSPRHWWLDEDGLFNLDDEGRAERQQCAVSASNIIRNFSFMPENEVIMAQHRHTLETVFQCIEEHITGLSDERDV
LCHQHRMFRLKFKVLNTITFVWIRLKALILIKPSYIRITEKGAVEAIMGMLGSAVKVWHCAAAELLGRLIINLIMSLSFFPLLPSRPYEHPALDAQAAAVGALYNLVEVN
MDCRIKLASERWAIDRLLKVIKTPHPVPEYAGSSYDIGSLVSEPQTGLYF