; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018715 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018715
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionNuclear transcription factor Y subunit B-8
Genome locationtig00153207:887704..900971
RNA-Seq ExpressionSgr018715
SyntenySgr018715
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0007005 - mitochondrion organization (biological process)
GO:0009738 - abscisic acid-activated signaling pathway (biological process)
GO:0031930 - mitochondria-nucleus signaling pathway (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005739 - mitochondrion (cellular component)
GO:0003824 - catalytic activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR003734 - Domain of unknown function DUF155
IPR003956 - Transcription factor, NFYB/HAP3, conserved site
IPR003958 - Transcription factor CBF/NF-Y/archaeal histone domain
IPR009072 - Histone-fold
IPR036038 - Aminotransferase-like, PLP-dependent enzymes


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601043.1 hypothetical protein SDJN03_06276, partial [Cucurbita argyrosperma subsp. sororia]6.0e-18063.28Show/hide
Query:  LSSNGVVLQGSEAPPLTTFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPELLSESNKTTTKL--------------------------
        L SNGV+LQGSE PP+ TFLETHPGAYTTTR+HNNASSILFWDRHMKRLTQSVKILSNSTP+LLSESN+T  KL                          
Subjt:  LSSNGVVLQGSEAPPLTTFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPELLSESNKTTTKL--------------------------

Query:  ------ERNEGEELIITALVSVSLEKLSESDDVVDAENVKEALDVHVYVGDYVPREFGVPENGANLAVVGRGRDIAAAKYSDWVRRRKSLEKLRPPSVTE
              ERN  EEL +T LVSV+LE L ESD VVD E VKEA+ VH +V +YVPREFGVPENGANLAVVGRGRD AAAKYSDWVRRRKSLEKLRPPSVTE
Subjt:  ------ERNEGEELIITALVSVSLEKLSESDDVVDAENVKEALDVHVYVGDYVPREFGVPENGANLAVVGRGRDIAAAKYSDWVRRRKSLEKLRPPSVTE

Query:  LLLSNDGDQILEGCLTNFFVVCRKVSEHKIKADAFGSLKSF----------------------------------------------------DSLRILE
        LLLSNDGDQILEGCLTNFFVV RK +    +A    S  +                                                     +SLR++E
Subjt:  LLLSNDGDQILEGCLTNFFVVCRKVSEHKIKADAFGSLKSF----------------------------------------------------DSLRILE

Query:  HVKTICIPGIWDLLDSKTWSDISWNKKSFKDAPGMITGTI---------------------------------------------------------QDL
        HV TIC+P IWDLL+SKTW +ISWNKKSFKDAPG+IT TI                                                         QDL
Subjt:  HVKTICIPGIWDLLDSKTWSDISWNKKSFKDAPGMITGTI---------------------------------------------------------QDL

Query:  ADMAEAPTSPAGGSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMAT
        ADMAE PTSP GGSHESGGEQSP TGG REQDR+LPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISF+TSEASDKCQKEKRKTINGDDLLWAMAT
Subjt:  ADMAEAPTSPAGGSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMAT

Query:  LGFEDYIDPLKSYLTRYRECDAKGSSRGGDESAKRDAVGALPGQNS-QYMQPGALTYINTQ
        LGFE+YIDPLKSYLTRYRECDAKGSSRGGDESAKRDAVGA+PGQNS QYMQ GALTYINTQ
Subjt:  LGFEDYIDPLKSYLTRYRECDAKGSSRGGDESAKRDAVGALPGQNS-QYMQPGALTYINTQ

XP_008445840.1 PREDICTED: uncharacterized protein LOC103488742 [Cucumis melo]4.0e-15283.76Show/hide
Query:  MWRTIDAHLRSVRLLPNLSARSSSSSSSS---SSSLLFTSGRSLLARSSS----SLLSRHRSLT---------------RVLCFGLGIRRFGGSTCGLVV
        MWRTIDAHLRSVRLLP+LS+ SSSSSSSS   SSS LF+SGRS L RS S    S L +  S+T                V CF LGI+RF GS  G++V
Subjt:  MWRTIDAHLRSVRLLPNLSARSSSSSSSS---SSSLLFTSGRSLLARSSS----SLLSRHRSLT---------------RVLCFGLGIRRFGGSTCGLVV

Query:  LARCIASSVHTLEWNEPVSCSEVGDGGFRSVGEGISDGEADEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDLCNVNTLG
        LARCI SS +TLEWNEPVSCSEVGDGGFRSV EGISDGE DEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNK NFIPPSSRMTNYVVLKFGDLCNVNT  
Subjt:  LARCIASSVHTLEWNEPVSCSEVGDGGFRSVGEGISDGEADEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDLCNVNTLG

Query:  GSISGSDCCYMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQ
         SISGSDCC+MVVFQYGSIVLFNVREH+VDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDG+RTIGSVLGQSIALDYYGRQ
Subjt:  GSISGSDCCYMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQ

Query:  VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER
        VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER
Subjt:  VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER

XP_022139336.1 uncharacterized protein LOC111010276 [Momordica charantia]1.1e-15786.17Show/hide
Query:  MWRTIDAHLRSVRLLPNLSARSSSSSSSSSSSLLFTSGRSLLARSSSSLL-----SRHRSLTRVL-------------CFGLGIRRFGGSTCGLVVLARC
        MWRTIDAHLRSVRL+P LSA    SSSSSSSSLLF +GRS L RSSSSLL     S   +L R L             C GLGIRRFG S+CGLVVLARC
Subjt:  MWRTIDAHLRSVRLLPNLSARSSSSSSSSSSSLLFTSGRSLLARSSSSLL-----SRHRSLTRVL-------------CFGLGIRRFGGSTCGLVVLARC

Query:  IASSVHTLEWNEPVSCSEVGDGGFRSVGEGISDGEADEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDLCNVNTLGGSIS
        I SSVHTLEWNEPVSCSEVGDGGFRS+GEG+SDGE DEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNK NFIPPSSRMTNYVVLKFGDLCN NTL  SI+
Subjt:  IASSVHTLEWNEPVSCSEVGDGGFRSVGEGISDGEADEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDLCNVNTLGGSIS

Query:  GSDCCYMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQVDGM
        GSDCC+MVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDG+RTIGSVLGQSIALDYYGRQVDGM
Subjt:  GSDCCYMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQVDGM

Query:  VAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER
        VAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER
Subjt:  VAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER

XP_022942113.1 uncharacterized protein LOC111447285 [Cucurbita moschata]1.1e-14982.05Show/hide
Query:  MWRTIDAHLRSVRLLPNLSARSSSSSSSSSSS----LLFTSGRSLLARSSSSLLS------------------RHRSLTRVLCFGLGIRRFGGSTCGLVV
        MWRTIDAHLRSVRLLP+LS +SSSSSSSSSSS     LF SGRS  ARSSSSLLS                      L+ VLCFGL   R GGS+CG VV
Subjt:  MWRTIDAHLRSVRLLPNLSARSSSSSSSSSSS----LLFTSGRSLLARSSSSLLS------------------RHRSLTRVLCFGLGIRRFGGSTCGLVV

Query:  LARCIASSVHTLEWNEPVSCSEVGDGGFRSVGEGISDGEADEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDLCNVNTLG
        LARCI +SV+TLEWNEPVSCSEVG+G FRS  +G SDGEADEV EDSRPSIPVRA+F STSVDLR LVDQNK NFIPPSSRMTNYVVLKFGDLC+VN+ G
Subjt:  LARCIASSVHTLEWNEPVSCSEVGDGGFRSVGEGISDGEADEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDLCNVNTLG

Query:  GSISGSDCCYMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQ
         SISGSDCC+MVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVRE PAL+TWMEGGLDYIMLQYLNIDG+RTIGSVLGQSIALDYYGRQ
Subjt:  GSISGSDCCYMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQ

Query:  VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER
        VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER
Subjt:  VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER

XP_038891437.1 uncharacterized protein LOC120080856 [Benincasa hispida]7.6e-15986.21Show/hide
Query:  MWRTIDAHLRSVRLLPNLSARSSSSSS-SSSSSLLFTSGRSLLARSSSSLLS------------------RHRSLTRVLCFGLGIRRFGGSTCGLVVLAR
        MWR+IDAHLRSVRLLPNLSA SSSSSS SSS+S LF+SGRS  ARSSS+ LS                      L+ V CFGLGI+RFGGS CG++VLA+
Subjt:  MWRTIDAHLRSVRLLPNLSARSSSSSS-SSSSSLLFTSGRSLLARSSSSLLS------------------RHRSLTRVLCFGLGIRRFGGSTCGLVVLAR

Query:  CIASSVHTLEWNEPVSCSEVGDGGFRSVGEGISDGEADEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDLCNVNTLGGSI
        CI SSVHTLEWNEPV CSEVGDGGFRSVGEGISDGE DEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNK NFIPPSSRMTNYVVLKFGDLCNVNT G SI
Subjt:  CIASSVHTLEWNEPVSCSEVGDGGFRSVGEGISDGEADEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDLCNVNTLGGSI

Query:  SGSDCCYMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQVDG
        SGSDCC+MVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDG+RTIGSVLGQSIALDYYGRQVDG
Subjt:  SGSDCCYMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQVDG

Query:  MVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER
        MVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER
Subjt:  MVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER

TrEMBL top hitse value%identityAlignment
A0A0A0KNM3 DUF155 domain-containing protein7.0e-15083.05Show/hide
Query:  MWRTIDAHLRSVRLLPNLSARSSSSSSSSSSSLLFTSGRSLLARSSSS------------LLSRHRS-------LTRVLCFGLGIRRFGGSTCGLVVLAR
        MWRTIDAHLRSVRLLP+L    SSSSSSSSSS  F+SGRS + RS S+             LS+  +       L+ V CF LGI+R  GS  G++VLAR
Subjt:  MWRTIDAHLRSVRLLPNLSARSSSSSSSSSSSLLFTSGRSLLARSSSS------------LLSRHRS-------LTRVLCFGLGIRRFGGSTCGLVVLAR

Query:  CIASSVHTLEWNEPVSCSEVGDGGFRSVGEGISDGEADEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDLCNVNTLGGSI
        CI SSV++LEWNEPVSCSEVGDGGFRSV EGISDGE DEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNK NFIPPSSRMTNYVVLKFGDLCNVNT G SI
Subjt:  CIASSVHTLEWNEPVSCSEVGDGGFRSVGEGISDGEADEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDLCNVNTLGGSI

Query:  SGSDCCYMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQVDG
         GSDCC+MVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDG+RTIGSVLGQSIALDYYGRQVDG
Subjt:  SGSDCCYMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQVDG

Query:  MVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER
        MVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER
Subjt:  MVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER

A0A1S3BEH0 uncharacterized protein LOC1034887422.0e-15283.76Show/hide
Query:  MWRTIDAHLRSVRLLPNLSARSSSSSSSS---SSSLLFTSGRSLLARSSS----SLLSRHRSLT---------------RVLCFGLGIRRFGGSTCGLVV
        MWRTIDAHLRSVRLLP+LS+ SSSSSSSS   SSS LF+SGRS L RS S    S L +  S+T                V CF LGI+RF GS  G++V
Subjt:  MWRTIDAHLRSVRLLPNLSARSSSSSSSS---SSSLLFTSGRSLLARSSS----SLLSRHRSLT---------------RVLCFGLGIRRFGGSTCGLVV

Query:  LARCIASSVHTLEWNEPVSCSEVGDGGFRSVGEGISDGEADEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDLCNVNTLG
        LARCI SS +TLEWNEPVSCSEVGDGGFRSV EGISDGE DEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNK NFIPPSSRMTNYVVLKFGDLCNVNT  
Subjt:  LARCIASSVHTLEWNEPVSCSEVGDGGFRSVGEGISDGEADEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDLCNVNTLG

Query:  GSISGSDCCYMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQ
         SISGSDCC+MVVFQYGSIVLFNVREH+VDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDG+RTIGSVLGQSIALDYYGRQ
Subjt:  GSISGSDCCYMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQ

Query:  VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER
        VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER
Subjt:  VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER

A0A6J1CCC7 uncharacterized protein LOC1110102765.3e-15886.17Show/hide
Query:  MWRTIDAHLRSVRLLPNLSARSSSSSSSSSSSLLFTSGRSLLARSSSSLL-----SRHRSLTRVL-------------CFGLGIRRFGGSTCGLVVLARC
        MWRTIDAHLRSVRL+P LSA    SSSSSSSSLLF +GRS L RSSSSLL     S   +L R L             C GLGIRRFG S+CGLVVLARC
Subjt:  MWRTIDAHLRSVRLLPNLSARSSSSSSSSSSSLLFTSGRSLLARSSSSLL-----SRHRSLTRVL-------------CFGLGIRRFGGSTCGLVVLARC

Query:  IASSVHTLEWNEPVSCSEVGDGGFRSVGEGISDGEADEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDLCNVNTLGGSIS
        I SSVHTLEWNEPVSCSEVGDGGFRS+GEG+SDGE DEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNK NFIPPSSRMTNYVVLKFGDLCN NTL  SI+
Subjt:  IASSVHTLEWNEPVSCSEVGDGGFRSVGEGISDGEADEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDLCNVNTLGGSIS

Query:  GSDCCYMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQVDGM
        GSDCC+MVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDG+RTIGSVLGQSIALDYYGRQVDGM
Subjt:  GSDCCYMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQVDGM

Query:  VAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER
        VAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER
Subjt:  VAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER

A0A6J1FTZ0 uncharacterized protein LOC1114472855.3e-15082.05Show/hide
Query:  MWRTIDAHLRSVRLLPNLSARSSSSSSSSSSS----LLFTSGRSLLARSSSSLLS------------------RHRSLTRVLCFGLGIRRFGGSTCGLVV
        MWRTIDAHLRSVRLLP+LS +SSSSSSSSSSS     LF SGRS  ARSSSSLLS                      L+ VLCFGL   R GGS+CG VV
Subjt:  MWRTIDAHLRSVRLLPNLSARSSSSSSSSSSS----LLFTSGRSLLARSSSSLLS------------------RHRSLTRVLCFGLGIRRFGGSTCGLVV

Query:  LARCIASSVHTLEWNEPVSCSEVGDGGFRSVGEGISDGEADEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDLCNVNTLG
        LARCI +SV+TLEWNEPVSCSEVG+G FRS  +G SDGEADEV EDSRPSIPVRA+F STSVDLR LVDQNK NFIPPSSRMTNYVVLKFGDLC+VN+ G
Subjt:  LARCIASSVHTLEWNEPVSCSEVGDGGFRSVGEGISDGEADEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDLCNVNTLG

Query:  GSISGSDCCYMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQ
         SISGSDCC+MVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVRE PAL+TWMEGGLDYIMLQYLNIDG+RTIGSVLGQSIALDYYGRQ
Subjt:  GSISGSDCCYMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQ

Query:  VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER
        VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER
Subjt:  VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFER

A0A7J0DZ00 Sporulation RMD1-like protein, putative5.0e-14847.65Show/hide
Query:  MW--RTIDAHLRSVRLLPNLSARSSSSSSSSSSSLLFTSGRSLL-------ARSSSSLLSRHRSLTRVLCFGLGIRRFGGSTCGLVVLARCIASSVHTLE
        MW  R+IDAH +++ LLP+LS+ SS+S+  +  +LL  S  S           S SS LS  ++L  +L       R    +   +       +S  TLE
Subjt:  MW--RTIDAHLRSVRLLPNLSARSSSSSSSSSSSLLFTSGRSLL-------ARSSSSLLSRHRSLTRVLCFGLGIRRFGGSTCGLVVLARCIASSVHTLE

Query:  WNEPVSCSEVGDGGFRSVGEGISDGEADEV--EEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDL-CNVNTLGGSISGSDCCY
        WNEP+S SEVGD               DEV  +EDS+PSIPVRAYFFSTSVDLR+LV+QNK NFIPP+SRMTNYVVL+FGD+    N LG +ISGSDC Y
Subjt:  WNEPVSCSEVGDGGFRSVGEGISDGEADEV--EEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDL-CNVNTLGGSISGSDCCY

Query:  MVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQVDGMVAEFTD
        MVVFQYGSIVLFNVREH+VDGYLKIVEKHASGLLPEMRKDEYEVREKP LNTWM+GGLDYIMLQYLNIDG+RTIGSVLGQSIALDYY RQVDGMVAEFTD
Subjt:  MVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQVDGMVAEFTD

Query:  INREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERVIAAVEENGELPILSSNGVVLQGSEAPPLTTFLETHPGAYTTTRSHNNASSILFWDRHM
        INR ME TG F M+RKKLFQLVGKANSNLADVILKLGLFER                                                  S + W    
Subjt:  INREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERVIAAVEENGELPILSSNGVVLQGSEAPPLTTFLETHPGAYTTTRSHNNASSILFWDRHM

Query:  KRLTQSVKILSNSTPELLSESNKTTTKLERNEGEELIITALVSVSLEKLSESDDVVDAENVKEALDVHVYVGDYVPREFGVPENGANLAVVGRGRDIAAA
                                                                DA+        +  + +Y+  EF + +  A+L            
Subjt:  KRLTQSVKILSNSTPELLSESNKTTTKLERNEGEELIITALVSVSLEKLSESDDVVDAENVKEALDVHVYVGDYVPREFGVPENGANLAVVGRGRDIAAA

Query:  KYSDWVRRRKSLEKLRPPSVTELLLSNDGDQILEGCLTNFFVVCRKVSEHKIKADAFGSLKSFDSLRILEHVKTICIPGIWDLLDSKTWSDISWNKKSFK
                     KL                              K  EH I              R L+ +                   +   K  F 
Subjt:  KYSDWVRRRKSLEKLRPPSVTELLLSNDGDQILEGCLTNFFVVCRKVSEHKIKADAFGSLKSFDSLRILEHVKTICIPGIWDLLDSKTWSDISWNKKSFK

Query:  DAPGMITGTIQDLADMAEAPTSPAGGSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRK
        +   +I    + L  +  A    A GS ESGG+QSP    VREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISF+TSEASDKCQKEKRK
Subjt:  DAPGMITGTIQDLADMAEAPTSPAGGSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRK

Query:  TINGDDLLWAMATLGFEDYIDPLKSYLTRYRECDAKGSSRGGDESAKRDAVGALPGQNSQYMQPGA
        TINGDDLLWAMATLGFEDYIDPLK+YL RYRE D KGS+R GD S+K+D VGA    N+Q  Q  A
Subjt:  TINGDDLLWAMATLGFEDYIDPLKSYLTRYRECDAKGSSRGGDESAKRDAVGALPGQNSQYMQPGA

SwissProt top hitse value%identityAlignment
P25209 Nuclear transcription factor Y subunit B2.5e-5173.15Show/hide
Query:  MAEAPTSP--AGGSHESGGEQSPNTGG-VREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMA
        MAEAP SP   GGSHESG  +    GG VREQDR+LPIANISRIMKKA+PANGKIAKDAK+TVQECVSEFISFITSEASDKCQ+EKRKTINGDDLLWAMA
Subjt:  MAEAPTSP--AGGSHESGGEQSPNTGG-VREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMA

Query:  TLGFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNS
        TLGFEDYI+PLK YL +YRE   D+K +++  D S K+DA+G +   +S
Subjt:  TLGFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNS

Q67XJ2 Nuclear transcription factor Y subunit B-106.2e-5570.73Show/hide
Query:  MAEAPT-SPAGGSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMATL
        MAE+ T    GGSHESGG+QSP +  VREQDR+LPIANISRIMK+ LP NGKIAKDAK+T+QECVSEFISF+TSEASDKCQ+EKRKTINGDDLLWAMATL
Subjt:  MAEAPT-SPAGGSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMATL

Query:  GFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQYMQPGALT---YINTQ
        GFEDYIDPLK YL RYRE   D KGS +GG+ SAKRD   +   Q SQ  Q G+ +   Y N+Q
Subjt:  GFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQYMQPGALT---YINTQ

Q8VYK4 Nuclear transcription factor Y subunit B-81.5e-5370.99Show/hide
Query:  MAEAPT-SPAG-GSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMAT
        MAE+   SP G GSHESGG+QSP +  VREQDR+LPIANISRIMK+ LPANGKIAKDAK+ VQECVSEFISF+TSEASDKCQ+EKRKTINGDDLLWAMAT
Subjt:  MAEAPT-SPAG-GSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMAT

Query:  LGFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQYMQPGALTYINTQ
        LGFEDY++PLK YL RYRE   D KGS++GGD +AK+D   +  GQ SQ    G   Y N+Q
Subjt:  LGFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQYMQPGALTYINTQ

Q9C565 Protein RETARDED ROOT GROWTH, mitochondrial1.1e-6259.33Show/hide
Query:  EDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDL--CNVNTLGGSISGSDCCYMVVFQYGSIVLFNVREHEVDGYLKIVEKHASG
        E++   IP++AYF STS+DL+++  +N  N +PP+SR TNY+ LKF D     + +L    S S+C +MVVFQYGS +LFNV +++VD YL IV +HASG
Subjt:  EDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDL--CNVNTLGGSISGSDCCYMVVFQYGSIVLFNVREHEVDGYLKIVEKHASG

Query:  LLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADV
        LL EMRKD+Y V+EKP L   M+GG DYI+L+ L+ + +R IGSVLGQSIALDY   QV+ +V EF DINR M  TG F M RKKLFQLVGKANSN+ADV
Subjt:  LLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADV

Query:  ILKLGLFER
        ILK+GLFER
Subjt:  ILKLGLFER

Q9FNB2 Protein RETARDED ROOT GROWTH-LIKE7.3e-8876.13Show/hide
Query:  VGEGISDGEADEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDLCN-VNTLGGSISGSDCCYMVVFQYGSIVLFNVREHEV
        V E IS G    +E++++ SIPVRAYFFSTSVDLRSL++QNK NFIPP+SRMTNYVVLKFG+  +  +T  G ISGS+  YMVVF YGSIVLFNVREHEV
Subjt:  VGEGISDGEADEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDLCN-VNTLGGSISGSDCCYMVVFQYGSIVLFNVREHEV

Query:  DGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLF
        D YLK+VE+HASGLLPEMRKDEYEVRE P L+TWME G D+I LQ+LN DG+RTIG VLGQSIALDYYGRQVDGMVAEFT+INR++E TG F MKRKKLF
Subjt:  DGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLF

Query:  QLVGKANSNLADVILKLGLFER
        QLVGKAN  LADVILKLGLFER
Subjt:  QLVGKANSNLADVILKLGLFER

Arabidopsis top hitse value%identityAlignment
AT1G69380.1 Protein of unknown function (DUF155)7.5e-6459.33Show/hide
Query:  EDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDL--CNVNTLGGSISGSDCCYMVVFQYGSIVLFNVREHEVDGYLKIVEKHASG
        E++   IP++AYF STS+DL+++  +N  N +PP+SR TNY+ LKF D     + +L    S S+C +MVVFQYGS +LFNV +++VD YL IV +HASG
Subjt:  EDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDL--CNVNTLGGSISGSDCCYMVVFQYGSIVLFNVREHEVDGYLKIVEKHASG

Query:  LLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADV
        LL EMRKD+Y V+EKP L   M+GG DYI+L+ L+ + +R IGSVLGQSIALDY   QV+ +V EF DINR M  TG F M RKKLFQLVGKANSN+ADV
Subjt:  LLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADV

Query:  ILKLGLFER
        ILK+GLFER
Subjt:  ILKLGLFER

AT2G37060.1 nuclear factor Y, subunit B81.1e-5470.99Show/hide
Query:  MAEAPT-SPAG-GSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMAT
        MAE+   SP G GSHESGG+QSP +  VREQDR+LPIANISRIMK+ LPANGKIAKDAK+ VQECVSEFISF+TSEASDKCQ+EKRKTINGDDLLWAMAT
Subjt:  MAEAPT-SPAG-GSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMAT

Query:  LGFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQYMQPGALTYINTQ
        LGFEDY++PLK YL RYRE   D KGS++GGD +AK+D   +  GQ SQ    G   Y N+Q
Subjt:  LGFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQYMQPGALTYINTQ

AT3G53340.1 nuclear factor Y, subunit B104.4e-5670.73Show/hide
Query:  MAEAPT-SPAGGSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMATL
        MAE+ T    GGSHESGG+QSP +  VREQDR+LPIANISRIMK+ LP NGKIAKDAK+T+QECVSEFISF+TSEASDKCQ+EKRKTINGDDLLWAMATL
Subjt:  MAEAPT-SPAGGSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMATL

Query:  GFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQYMQPGALT---YINTQ
        GFEDYIDPLK YL RYRE   D KGS +GG+ SAKRD   +   Q SQ  Q G+ +   Y N+Q
Subjt:  GFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQYMQPGALT---YINTQ

AT3G54970.1 D-aminoacid aminotransferase-like PLP-dependent enzymes superfamily protein3.1e-5742.94Show/hide
Query:  LSSNGVVLQGSEAPPLTTFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPELLSESNKT------------------------------
        L  NGVVL   EAPP+TTFLE+H GAYTTTR+ NN +S LFW+RHMKRL+ S++IL  S PELL  S  +                              
Subjt:  LSSNGVVLQGSEAPPLTTFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPELLSESNKT------------------------------

Query:  TTKLERNEGEELIITALVSVSLEKLSESDDVVDAENVKEALDVHVYVGDYVP-REFGVPENGANLAVVGRGRDIAAAKYSDWVRRRKSLEKLRPPSVTEL
          + ER  GEEL +T LV+ ++EKL+     +D  N  + LDV +++G Y P    GV EN A+LA+VGRGRD+AAAKYSDWVR RK LEK RPP  TEL
Subjt:  TTKLERNEGEELIITALVSVSLEKLSESDDVVDAENVKEALDVHVYVGDYVP-REFGVPENGANLAVVGRGRDIAAAKYSDWVRRRKSLEKLRPPSVTEL

Query:  LLSNDGDQILEGCLTNFFVVCRKVSEHKIKADAF-GSLKSFD--------------------------------------------------SLRILEHV
        LLSNDGD +LEGC+TNFFVVCR+V   K   + + GSL  F+                                                  SLRIL+HV
Subjt:  LLSNDGDQILEGCLTNFFVVCRKVSEHKIKADAF-GSLKSFD--------------------------------------------------SLRILEHV

Query:  KTICIP-GIWDLLDSKTWSDISWNKKSFKDAPGMITGTIQ
         TI +P G  + L      +I W +K FK+ PGMIT  I+
Subjt:  KTICIP-GIWDLLDSKTWSDISWNKKSFKDAPGMITGTIQ

AT5G13610.1 Protein of unknown function (DUF155)5.2e-8976.13Show/hide
Query:  VGEGISDGEADEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDLCN-VNTLGGSISGSDCCYMVVFQYGSIVLFNVREHEV
        V E IS G    +E++++ SIPVRAYFFSTSVDLRSL++QNK NFIPP+SRMTNYVVLKFG+  +  +T  G ISGS+  YMVVF YGSIVLFNVREHEV
Subjt:  VGEGISDGEADEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDLCN-VNTLGGSISGSDCCYMVVFQYGSIVLFNVREHEV

Query:  DGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLF
        D YLK+VE+HASGLLPEMRKDEYEVRE P L+TWME G D+I LQ+LN DG+RTIG VLGQSIALDYYGRQVDGMVAEFT+INR++E TG F MKRKKLF
Subjt:  DGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLF

Query:  QLVGKANSNLADVILKLGLFER
        QLVGKAN  LADVILKLGLFER
Subjt:  QLVGKANSNLADVILKLGLFER


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGCGCACCATTGACGCCCATCTGAGGTCCGTACGCCTCCTTCCTAATCTCTCCGCTCGTTCTTCTTCTTCTTCTTCTTCATCTTCATCCTCACTTCTGTTCACGTC
CGGCCGTTCGTTGCTTGCTCGCTCGAGTTCGAGCCTCCTCTCCCGCCACCGCAGTCTCACTCGTGTTTTGTGTTTTGGCCTTGGAATTCGGCGCTTCGGAGGATCGACTT
GTGGTTTGGTTGTGTTAGCGAGGTGCATTGCCTCTTCGGTGCACACGTTGGAGTGGAATGAACCGGTGTCGTGTTCGGAGGTCGGTGATGGAGGTTTTCGAAGTGTTGGG
GAAGGAATTAGCGACGGTGAAGCGGATGAAGTCGAGGAAGATTCTAGGCCGTCTATTCCTGTCAGAGCTTATTTCTTCTCGACTAGTGTGGATTTGAGAAGCTTGGTGGA
TCAGAATAAACTCAACTTTATCCCGCCATCATCTCGTATGACGAATTATGTAGTCCTTAAGTTCGGTGATCTTTGTAATGTGAATACTCTTGGCGGCAGCATAAGTGGAA
GTGACTGCTGTTACATGGTAGTTTTTCAGTATGGCTCCATTGTGCTATTTAATGTTCGTGAACATGAGGTTGATGGGTATCTGAAAATTGTAGAGAAACATGCATCTGGA
TTGCTGCCTGAAATGAGAAAGGATGAGTATGAGGTGAGAGAGAAGCCTGCTTTAAACACATGGATGGAAGGGGGATTGGACTACATAATGCTGCAGTACTTGAATATTGA
TGGCATGCGTACCATAGGTAGTGTTCTTGGTCAAAGCATTGCTCTTGATTACTATGGGCGACAGGTTGATGGGATGGTTGCTGAATTTACTGACATAAACCGTGAAATGG
AAGCAACTGGGAAGTTTAAAATGAAGAGGAAGAAACTATTCCAGTTGGTGGGAAAGGCAAATTCTAATCTTGCTGATGTCATTCTTAAGCTTGGACTTTTTGAGAGAGTG
ATCGCAGCAGTTGAAGAAAATGGCGAGCTTCCGATCCTGTCCAGCAATGGAGTCGTCTTGCAAGGCTCCGAAGCTCCTCCGCTCACCACCTTCCTCGAAACTCATCCTGG
CGCTTATACCACTACTCGCTCCCATAACAATGCGTCGAGCATTCTGTTTTGGGACAGGCACATGAAACGGCTGACTCAATCAGTAAAAATTCTGTCGAATTCGACTCCGG
AACTCTTGTCTGAATCGAACAAAACGACCACTAAACTGGAGAGGAATGAGGGAGAAGAATTGATAATTACAGCGCTAGTTAGTGTGAGTTTGGAAAAATTGAGTGAAAGT
GATGACGTAGTGGATGCAGAAAATGTTAAAGAGGCTCTTGATGTGCACGTGTATGTTGGTGATTATGTCCCTCGTGAATTTGGTGTCCCGGAAAATGGTGCAAATCTGGC
CGTGGTCGGCCGAGGGAGGGATATCGCTGCAGCAAAGTACTCAGATTGGGTTAGGCGTAGGAAGTCACTGGAAAAATTGAGGCCTCCTTCTGTGACTGAGCTCTTGTTGT
CAAACGATGGTGATCAAATACTTGAAGGCTGCCTGACAAACTTTTTTGTTGTTTGCCGCAAGGTGAGTGAGCATAAAATTAAAGCTGATGCGTTTGGTTCTTTGAAATCC
TTCGATAGCTTGAGAATCTTGGAGCACGTGAAAACTATTTGCATCCCTGGGATATGGGATTTGCTCGACTCGAAGACATGGAGCGATATATCGTGGAATAAGAAATCGTT
TAAGGATGCTCCTGGAATGATCACCGGCACAATCCAGGATCTCGCTGATATGGCGGAGGCTCCGACGAGTCCAGCCGGTGGAAGCCATGAGAGCGGAGGCGAGCAGAGCC
CCAATACCGGTGGCGTTCGGGAGCAGGACCGCTACCTCCCGATCGCTAACATTAGTCGGATCATGAAGAAGGCCTTGCCGGCTAATGGCAAGATCGCCAAGGATGCTAAG
GACACCGTCCAGGAATGCGTCTCTGAATTTATCAGCTTCATCACTAGCGAGGCGAGCGATAAGTGCCAGAAGGAGAAGAGAAAGACCATTAATGGTGATGATTTGCTGTG
GGCAATGGCGACGTTGGGTTTCGAGGACTATATTGATCCACTTAAGTCGTACCTAACTAGGTACAGAGAGTGTGATGCTAAAGGATCTTCTAGGGGTGGTGATGAATCTG
CTAAAAGAGATGCGGTTGGCGCCTTGCCTGGCCAAAATTCCCAGTACATGCAGCCGGGAGCATTGACCTATATTAACACCCAA
mRNA sequenceShow/hide mRNA sequence
ATGTGGCGCACCATTGACGCCCATCTGAGGTCCGTACGCCTCCTTCCTAATCTCTCCGCTCGTTCTTCTTCTTCTTCTTCTTCATCTTCATCCTCACTTCTGTTCACGTC
CGGCCGTTCGTTGCTTGCTCGCTCGAGTTCGAGCCTCCTCTCCCGCCACCGCAGTCTCACTCGTGTTTTGTGTTTTGGCCTTGGAATTCGGCGCTTCGGAGGATCGACTT
GTGGTTTGGTTGTGTTAGCGAGGTGCATTGCCTCTTCGGTGCACACGTTGGAGTGGAATGAACCGGTGTCGTGTTCGGAGGTCGGTGATGGAGGTTTTCGAAGTGTTGGG
GAAGGAATTAGCGACGGTGAAGCGGATGAAGTCGAGGAAGATTCTAGGCCGTCTATTCCTGTCAGAGCTTATTTCTTCTCGACTAGTGTGGATTTGAGAAGCTTGGTGGA
TCAGAATAAACTCAACTTTATCCCGCCATCATCTCGTATGACGAATTATGTAGTCCTTAAGTTCGGTGATCTTTGTAATGTGAATACTCTTGGCGGCAGCATAAGTGGAA
GTGACTGCTGTTACATGGTAGTTTTTCAGTATGGCTCCATTGTGCTATTTAATGTTCGTGAACATGAGGTTGATGGGTATCTGAAAATTGTAGAGAAACATGCATCTGGA
TTGCTGCCTGAAATGAGAAAGGATGAGTATGAGGTGAGAGAGAAGCCTGCTTTAAACACATGGATGGAAGGGGGATTGGACTACATAATGCTGCAGTACTTGAATATTGA
TGGCATGCGTACCATAGGTAGTGTTCTTGGTCAAAGCATTGCTCTTGATTACTATGGGCGACAGGTTGATGGGATGGTTGCTGAATTTACTGACATAAACCGTGAAATGG
AAGCAACTGGGAAGTTTAAAATGAAGAGGAAGAAACTATTCCAGTTGGTGGGAAAGGCAAATTCTAATCTTGCTGATGTCATTCTTAAGCTTGGACTTTTTGAGAGAGTG
ATCGCAGCAGTTGAAGAAAATGGCGAGCTTCCGATCCTGTCCAGCAATGGAGTCGTCTTGCAAGGCTCCGAAGCTCCTCCGCTCACCACCTTCCTCGAAACTCATCCTGG
CGCTTATACCACTACTCGCTCCCATAACAATGCGTCGAGCATTCTGTTTTGGGACAGGCACATGAAACGGCTGACTCAATCAGTAAAAATTCTGTCGAATTCGACTCCGG
AACTCTTGTCTGAATCGAACAAAACGACCACTAAACTGGAGAGGAATGAGGGAGAAGAATTGATAATTACAGCGCTAGTTAGTGTGAGTTTGGAAAAATTGAGTGAAAGT
GATGACGTAGTGGATGCAGAAAATGTTAAAGAGGCTCTTGATGTGCACGTGTATGTTGGTGATTATGTCCCTCGTGAATTTGGTGTCCCGGAAAATGGTGCAAATCTGGC
CGTGGTCGGCCGAGGGAGGGATATCGCTGCAGCAAAGTACTCAGATTGGGTTAGGCGTAGGAAGTCACTGGAAAAATTGAGGCCTCCTTCTGTGACTGAGCTCTTGTTGT
CAAACGATGGTGATCAAATACTTGAAGGCTGCCTGACAAACTTTTTTGTTGTTTGCCGCAAGGTGAGTGAGCATAAAATTAAAGCTGATGCGTTTGGTTCTTTGAAATCC
TTCGATAGCTTGAGAATCTTGGAGCACGTGAAAACTATTTGCATCCCTGGGATATGGGATTTGCTCGACTCGAAGACATGGAGCGATATATCGTGGAATAAGAAATCGTT
TAAGGATGCTCCTGGAATGATCACCGGCACAATCCAGGATCTCGCTGATATGGCGGAGGCTCCGACGAGTCCAGCCGGTGGAAGCCATGAGAGCGGAGGCGAGCAGAGCC
CCAATACCGGTGGCGTTCGGGAGCAGGACCGCTACCTCCCGATCGCTAACATTAGTCGGATCATGAAGAAGGCCTTGCCGGCTAATGGCAAGATCGCCAAGGATGCTAAG
GACACCGTCCAGGAATGCGTCTCTGAATTTATCAGCTTCATCACTAGCGAGGCGAGCGATAAGTGCCAGAAGGAGAAGAGAAAGACCATTAATGGTGATGATTTGCTGTG
GGCAATGGCGACGTTGGGTTTCGAGGACTATATTGATCCACTTAAGTCGTACCTAACTAGGTACAGAGAGTGTGATGCTAAAGGATCTTCTAGGGGTGGTGATGAATCTG
CTAAAAGAGATGCGGTTGGCGCCTTGCCTGGCCAAAATTCCCAGTACATGCAGCCGGGAGCATTGACCTATATTAACACCCAA
Protein sequenceShow/hide protein sequence
MWRTIDAHLRSVRLLPNLSARSSSSSSSSSSSLLFTSGRSLLARSSSSLLSRHRSLTRVLCFGLGIRRFGGSTCGLVVLARCIASSVHTLEWNEPVSCSEVGDGGFRSVG
EGISDGEADEVEEDSRPSIPVRAYFFSTSVDLRSLVDQNKLNFIPPSSRMTNYVVLKFGDLCNVNTLGGSISGSDCCYMVVFQYGSIVLFNVREHEVDGYLKIVEKHASG
LLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGMRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERV
IAAVEENGELPILSSNGVVLQGSEAPPLTTFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPELLSESNKTTTKLERNEGEELIITALVSVSLEKLSES
DDVVDAENVKEALDVHVYVGDYVPREFGVPENGANLAVVGRGRDIAAAKYSDWVRRRKSLEKLRPPSVTELLLSNDGDQILEGCLTNFFVVCRKVSEHKIKADAFGSLKS
FDSLRILEHVKTICIPGIWDLLDSKTWSDISWNKKSFKDAPGMITGTIQDLADMAEAPTSPAGGSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAK
DTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMATLGFEDYIDPLKSYLTRYRECDAKGSSRGGDESAKRDAVGALPGQNSQYMQPGALTYINTQ