; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g34600 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g34600
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionNuclear transcription factor Y subunit B-8
Genome locationchr9:26469067..26479564
RNA-Seq ExpressionMoc09g34600
SyntenyMoc09g34600
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016740 - transferase activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR001544 - Aminotransferase class IV
IPR003956 - Transcription factor, NFYB/HAP3, conserved site
IPR003958 - Transcription factor CBF/NF-Y/archaeal histone domain
IPR009072 - Histone-fold
IPR036038 - Aminotransferase-like, PLP-dependent enzymes
IPR043132 - Branched-chain-amino-acid aminotransferase-like, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXC19490.1 Nuclear transcription factor Y subunit B-8 [Morus notabilis]2.6e-16959.15Show/hide
Query:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSIDSIPWEPAIRTLVD
        M++ RFLFSNGV    S+ PPV TFLET PGAYTTTRSH N S +LFW+RH+ RL  SV+ILSNS P+LL   N+        S      WE  +R LV+
Subjt:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSIDSIPWEPAIRTLVD

Query:  DSMREVLPVALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLEKLR
        DS+ +VLP+A++ER +GEEL+ITALV+ +L K  E  G +  E+   VLDV+VHVG Y P  FGI ENGANLAVVG+ RE AEAKYSDWVR RK LEKLR
Subjt:  DSMREVLPVALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLEKLR

Query:  PPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNED--------------KNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPT
        PPS TELLLSNDGD++LEG ++NFFVVCR+V N+D              + K + +    + FE+QTAP+RDGVL G+IRQ+V + CL+ GIPFREVAP+
Subjt:  PPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNED--------------KNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPT

Query:  WSSNEIWEEAFVTNSLRILEHVKAMCIPGTWDLLDSKTWSDIPWNKKSF---------------KDTPGMITSAIQVPSFHFLSILFHSRLRVPLIQDLA
        WS +EIWEEAF+TNSLRIL+HV+ + IP +WD L SK+W +I WN   F               K +PG I                   LR P   D  
Subjt:  WSSNEIWEEAFVTNSLRILEHVKAMCIPGTWDLLDSKTWSDIPWNKKSF---------------KDTPGMITSAIQVPSFHFLSILFHSRLRVPLIQDLA

Query:  H------MAEAPTSPAGG-SHESGGEQSPNT------GGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKT
        H       +  P +  GG SHESGGEQSP+       GGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKT
Subjt:  H------MAEAPTSPAGG-SHESGGEQSPNT------GGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKT

Query:  INGDDLLWAMATLGFEDYIDPLKSYLTRYRECDAKGSSRGGDESAKRDAVGALPGQNSQQYMQ
        INGDDLLWAMATLGFEDYI+PLK YL RYRE D KGS+RGG+ SAKRDAV +  GQN Q+ ++
Subjt:  INGDDLLWAMATLGFEDYIDPLKSYLTRYRECDAKGSSRGGDESAKRDAVGALPGQNSQQYMQ

GAV70263.1 CBFD_NFYB_HMF domain-containing protein/Aminotran_4 domain-containing protein, partial [Cephalotus follicularis]4.3e-17262.43Show/hide
Query:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSIDSIPWEPAIRTLVD
        MAS RFLFSNG++   S+ P ++TFL TH GAYTT+R+HNN S +L+W RH++RL  S +IL N  P L+S S            + S  WE  + +LV+
Subjt:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSIDSIPWEPAIRTLVD

Query:  DSMREVLPVALKERNEGEELTITAL-VSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLEKL
        +S+ +VL VALKER+ G+EL +TAL V  +LEK    DG    E+  E++DVHVHVGNYVP  FG+  NGA+LA+VG+GR  A AKYSDWVR RK LEKL
Subjt:  DSMREVLPVALKERNEGEELTITAL-VSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLEKL

Query:  RPPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEIWEEAFVT
        RPPSVTELLLSNDGDQILEGC+TNFFVVCRK DN+               E+QTAPI DGVL G+IRQ+V E CLS GIP REVAP+WS +E WEEAFVT
Subjt:  RPPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEIWEEAFVT

Query:  NSLRILEHVKAMCIPGTWDLLDSKTWSDIPWNKKSFKDTPGMITSAIQVPSFHFLSILFHSRLRVPLIQDLAHMAEAPTSPAGGSHESGGEQSPNT-GGV
        +SLRI++HV A+ +P +   L+SK W++I W ++ F++ PGMIT  IQ                         MA+ P SPAGGSHESGGEQSP   GGV
Subjt:  NSLRILEHVKAMCIPGTWDLLDSKTWSDIPWNKKSFKDTPGMITSAIQVPSFHFLSILFHSRLRVPLIQDLAHMAEAPTSPAGGSHESGGEQSPNT-GGV

Query:  REQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMATLGFEDYIDPLKSYLTRYRECDAKGSSRG
        REQDRYLPIANISRIMKKALP NGKIAKDAKDTVQECVSEFISFITSEASDKC KEKRKTINGDDLLWAMATLGFEDYI+PLK YL RYRE D KGS+RG
Subjt:  REQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMATLGFEDYIDPLKSYLTRYRECDAKGSSRG

Query:  GDESAKRDAVGALPGQNSQ
        GD SAKRD VGALP  N+Q
Subjt:  GDESAKRDAVGALPGQNSQ

KAG6601043.1 hypothetical protein SDJN03_06276, partial [Cucurbita argyrosperma subsp. sororia]5.0e-25377.76Show/hide
Query:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSIDSIPWEPAIRTLVD
        M SFRFLFSNGV+LQGSEVPPVATFLETHPGAYTTTR+HNNASSILFWDRHMKRLTQSVKILSNSTP+LLSESN+T    V+ S  DSIPWEPAIRTLVD
Subjt:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSIDSIPWEPAIRTLVD

Query:  DSMREVLPVALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLEKLR
        DSMR+VLP+AL ERN  EEL +T LVSVNLE  GESDGVVDVE+VKE + VH HV NYVPREFG+PENGANLAVVG+GR+ A AKYSDWVRRRKSLEKLR
Subjt:  DSMREVLPVALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLEKLR

Query:  PPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEIWEEAFVTN
        PPSVTELLLSNDGDQILEGCLTNFFVV RK +NE K   V D  ST+SFELQTAPI DGVLTGVIRQ+V EACLS GIPFREVAPTWSSNE+WEEAFVTN
Subjt:  PPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEIWEEAFVTN

Query:  SLRILEHVKAMCIPGTWDLLDSKTWSDIPWNKKSFKDTPGMITSAIQ-----------VPSFHFLSILFHSRLR-------------------------V
        SLR++EHV  +C+P  WDLL+SKTW +I WNKKSFKD PG+ITS IQ            P  ++  I    + +                          
Subjt:  SLRILEHVKAMCIPGTWDLLDSKTWSDIPWNKKSFKDTPGMITSAIQ-----------VPSFHFLSILFHSRLR-------------------------V

Query:  PLIQDLAHMAEAPTSPAGGSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDL
         L QDLA MAE PTSP GGSHESGGEQSP TGG REQDR+LPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISF+TSEASDKCQKEKRKTINGDDL
Subjt:  PLIQDLAHMAEAPTSPAGGSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDL

Query:  LWAMATLGFEDYIDPLKSYLTRYRECDAKGSSRGGDESAKRDAVGALPGQNSQQYMQPGAMTYINTQGQHLIIPSMQNNE
        LWAMATLGFE+YIDPLKSYLTRYRECDAKGSSRGGDESAKRDAVGA+PGQNSQQYMQ GA+TYINTQGQHLIIPSMQNNE
Subjt:  LWAMATLGFEDYIDPLKSYLTRYRECDAKGSSRGGDESAKRDAVGALPGQNSQQYMQPGAMTYINTQGQHLIIPSMQNNE

OMO71599.1 Aminotransferase, class IV [Corchorus capsularis]2.8e-17161.16Show/hide
Query:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSI---DSIPWEPAIRT
        M   RF+FSNGVVL  +E PPVATFLE+ PGAYTTTR+H N +++LFW+RH+KRL  S +IL NS P L+ + NK   NP+  S +    S  W+   R+
Subjt:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSI---DSIPWEPAIRT

Query:  LVDDSMREVLPVALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLE
        L+++SM +VLP+AL ER++G+EL +TALVS +LEK  E    VD   V  VLDVH H+G+YVP  FGI ENGA+LA+VG GR+ A AKYSDWVR RK L+
Subjt:  LVDDSMREVLPVALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLE

Query:  KLRPPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEIWEEAF
        K RPP VTELLLSNDGD+ILEGC+TNFFV+C++  +E +   + D+ +    E+QTAPI DGVL GVIRQ+V E CLS GIP REVAP+W  +E+WEEAF
Subjt:  KLRPPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEIWEEAF

Query:  VTNSLRILEHVKAMCIPGTWDLLDSKTWSDIPWNKKSFKDTPGMITSAIQVPSFHFLS-------ILFHSRLRVPLIQDLAHMAEAPTSPAGGSHESGGE
        +TNSLR+L+HV+ + +P +W+ L SK    I W +K F + PG IT  IQV S    S       +LF        + D+A  A  P SPAGGSHESGGE
Subjt:  VTNSLRILEHVKAMCIPGTWDLLDSKTWSDIPWNKKSFKDTPGMITSAIQVPSFHFLS-------ILFHSRLRVPLIQDLAHMAEAPTSPAGGSHESGGE

Query:  QSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMATLGFEDYIDPLKSYLTRYREC
        QS     VREQDRYLPIANISRIMKKALP NGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAM+TLGFE+YI+PLK YL RYRE 
Subjt:  QSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMATLGFEDYIDPLKSYLTRYREC

Query:  DAKGSSRGGDESAKRDAVGALPGQNSQQYMQPG
        D KGS+RGGD S +RDA G +  QN+Q  +QPG
Subjt:  DAKGSSRGGDESAKRDAVGALPGQNSQQYMQPG

XP_022138906.1 uncharacterized protein LOC111009973 isoform X1 [Momordica charantia]3.9e-197100Show/hide
Query:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSIDSIPWEPAIRTLVD
        MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSIDSIPWEPAIRTLVD
Subjt:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSIDSIPWEPAIRTLVD

Query:  DSMREVLPVALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLEKLR
        DSMREVLPVALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLEKLR
Subjt:  DSMREVLPVALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLEKLR

Query:  PPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEIWEEAFVTN
        PPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEIWEEAFVTN
Subjt:  PPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEIWEEAFVTN

Query:  SLRILEHVKAMCIPGTWDLLDSKTWSDIPWNKKSFKDTPGMITSAIQ
        SLRILEHVKAMCIPGTWDLLDSKTWSDIPWNKKSFKDTPGMITSAIQ
Subjt:  SLRILEHVKAMCIPGTWDLLDSKTWSDIPWNKKSFKDTPGMITSAIQ

TrEMBL top hitse value%identityAlignment
A0A1Q3BRC3 CBFD_NFYB_HMF domain-containing protein/Aminotran_4 domain-containing protein (Fragment)2.1e-17262.43Show/hide
Query:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSIDSIPWEPAIRTLVD
        MAS RFLFSNG++   S+ P ++TFL TH GAYTT+R+HNN S +L+W RH++RL  S +IL N  P L+S S            + S  WE  + +LV+
Subjt:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSIDSIPWEPAIRTLVD

Query:  DSMREVLPVALKERNEGEELTITAL-VSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLEKL
        +S+ +VL VALKER+ G+EL +TAL V  +LEK    DG    E+  E++DVHVHVGNYVP  FG+  NGA+LA+VG+GR  A AKYSDWVR RK LEKL
Subjt:  DSMREVLPVALKERNEGEELTITAL-VSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLEKL

Query:  RPPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEIWEEAFVT
        RPPSVTELLLSNDGDQILEGC+TNFFVVCRK DN+               E+QTAPI DGVL G+IRQ+V E CLS GIP REVAP+WS +E WEEAFVT
Subjt:  RPPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEIWEEAFVT

Query:  NSLRILEHVKAMCIPGTWDLLDSKTWSDIPWNKKSFKDTPGMITSAIQVPSFHFLSILFHSRLRVPLIQDLAHMAEAPTSPAGGSHESGGEQSPNT-GGV
        +SLRI++HV A+ +P +   L+SK W++I W ++ F++ PGMIT  IQ                         MA+ P SPAGGSHESGGEQSP   GGV
Subjt:  NSLRILEHVKAMCIPGTWDLLDSKTWSDIPWNKKSFKDTPGMITSAIQVPSFHFLSILFHSRLRVPLIQDLAHMAEAPTSPAGGSHESGGEQSPNT-GGV

Query:  REQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMATLGFEDYIDPLKSYLTRYRECDAKGSSRG
        REQDRYLPIANISRIMKKALP NGKIAKDAKDTVQECVSEFISFITSEASDKC KEKRKTINGDDLLWAMATLGFEDYI+PLK YL RYRE D KGS+RG
Subjt:  REQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMATLGFEDYIDPLKSYLTRYRECDAKGSSRG

Query:  GDESAKRDAVGALPGQNSQ
        GD SAKRD VGALP  N+Q
Subjt:  GDESAKRDAVGALPGQNSQ

A0A1R3HMK9 Aminotransferase, class IV1.3e-17161.16Show/hide
Query:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSI---DSIPWEPAIRT
        M   RF+FSNGVVL  +E PPVATFLE+ PGAYTTTR+H N +++LFW+RH+KRL  S +IL NS P L+ + NK   NP+  S +    S  W+   R+
Subjt:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSI---DSIPWEPAIRT

Query:  LVDDSMREVLPVALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLE
        L+++SM +VLP+AL ER++G+EL +TALVS +LEK  E    VD   V  VLDVH H+G+YVP  FGI ENGA+LA+VG GR+ A AKYSDWVR RK L+
Subjt:  LVDDSMREVLPVALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLE

Query:  KLRPPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEIWEEAF
        K RPP VTELLLSNDGD+ILEGC+TNFFV+C++  +E +   + D+ +    E+QTAPI DGVL GVIRQ+V E CLS GIP REVAP+W  +E+WEEAF
Subjt:  KLRPPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEIWEEAF

Query:  VTNSLRILEHVKAMCIPGTWDLLDSKTWSDIPWNKKSFKDTPGMITSAIQVPSFHFLS-------ILFHSRLRVPLIQDLAHMAEAPTSPAGGSHESGGE
        +TNSLR+L+HV+ + +P +W+ L SK    I W +K F + PG IT  IQV S    S       +LF        + D+A  A  P SPAGGSHESGGE
Subjt:  VTNSLRILEHVKAMCIPGTWDLLDSKTWSDIPWNKKSFKDTPGMITSAIQVPSFHFLS-------ILFHSRLRVPLIQDLAHMAEAPTSPAGGSHESGGE

Query:  QSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMATLGFEDYIDPLKSYLTRYREC
        QS     VREQDRYLPIANISRIMKKALP NGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAM+TLGFE+YI+PLK YL RYRE 
Subjt:  QSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMATLGFEDYIDPLKSYLTRYREC

Query:  DAKGSSRGGDESAKRDAVGALPGQNSQQYMQPG
        D KGS+RGGD S +RDA G +  QN+Q  +QPG
Subjt:  DAKGSSRGGDESAKRDAVGALPGQNSQQYMQPG

A0A6J1CB25 uncharacterized protein LOC111009973 isoform X11.9e-197100Show/hide
Query:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSIDSIPWEPAIRTLVD
        MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSIDSIPWEPAIRTLVD
Subjt:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSIDSIPWEPAIRTLVD

Query:  DSMREVLPVALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLEKLR
        DSMREVLPVALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLEKLR
Subjt:  DSMREVLPVALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLEKLR

Query:  PPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEIWEEAFVTN
        PPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEIWEEAFVTN
Subjt:  PPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEIWEEAFVTN

Query:  SLRILEHVKAMCIPGTWDLLDSKTWSDIPWNKKSFKDTPGMITSAIQ
        SLRILEHVKAMCIPGTWDLLDSKTWSDIPWNKKSFKDTPGMITSAIQ
Subjt:  SLRILEHVKAMCIPGTWDLLDSKTWSDIPWNKKSFKDTPGMITSAIQ

A0A6J1CCI7 uncharacterized protein LOC111009973 isoform X25.9e-16799.34Show/hide
Query:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSIDSIPWEPAIRTLVD
        MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSIDSIPWEPAIRTLVD
Subjt:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSIDSIPWEPAIRTLVD

Query:  DSMREVLPVALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLEKLR
        DSMREVLPVALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLEKLR
Subjt:  DSMREVLPVALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLEKLR

Query:  PPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEIWEEAFVTN
        PPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEIWEEAFVT+
Subjt:  PPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEIWEEAFVTN

Query:  SL
        +L
Subjt:  SL

W9RZQ9 Nuclear transcription factor Y subunit B-81.3e-16959.15Show/hide
Query:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSIDSIPWEPAIRTLVD
        M++ RFLFSNGV    S+ PPV TFLET PGAYTTTRSH N S +LFW+RH+ RL  SV+ILSNS P+LL   N+        S      WE  +R LV+
Subjt:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSIDSIPWEPAIRTLVD

Query:  DSMREVLPVALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLEKLR
        DS+ +VLP+A++ER +GEEL+ITALV+ +L K  E  G +  E+   VLDV+VHVG Y P  FGI ENGANLAVVG+ RE AEAKYSDWVR RK LEKLR
Subjt:  DSMREVLPVALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLEKLR

Query:  PPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNED--------------KNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPT
        PPS TELLLSNDGD++LEG ++NFFVVCR+V N+D              + K + +    + FE+QTAP+RDGVL G+IRQ+V + CL+ GIPFREVAP+
Subjt:  PPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNED--------------KNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPT

Query:  WSSNEIWEEAFVTNSLRILEHVKAMCIPGTWDLLDSKTWSDIPWNKKSF---------------KDTPGMITSAIQVPSFHFLSILFHSRLRVPLIQDLA
        WS +EIWEEAF+TNSLRIL+HV+ + IP +WD L SK+W +I WN   F               K +PG I                   LR P   D  
Subjt:  WSSNEIWEEAFVTNSLRILEHVKAMCIPGTWDLLDSKTWSDIPWNKKSF---------------KDTPGMITSAIQVPSFHFLSILFHSRLRVPLIQDLA

Query:  H------MAEAPTSPAGG-SHESGGEQSPNT------GGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKT
        H       +  P +  GG SHESGGEQSP+       GGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKT
Subjt:  H------MAEAPTSPAGG-SHESGGEQSPNT------GGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKT

Query:  INGDDLLWAMATLGFEDYIDPLKSYLTRYRECDAKGSSRGGDESAKRDAVGALPGQNSQQYMQ
        INGDDLLWAMATLGFEDYI+PLK YL RYRE D KGS+RGG+ SAKRDAV +  GQN Q+ ++
Subjt:  INGDDLLWAMATLGFEDYIDPLKSYLTRYRECDAKGSSRGGDESAKRDAVGALPGQNSQQYMQ

SwissProt top hitse value%identityAlignment
P25209 Nuclear transcription factor Y subunit B2.3e-5164.8Show/hide
Query:  MAEAPTSP--AGGSHESGGEQSPNTGG-VREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMA
        MAEAP SP   GGSHESG  +    GG VREQDR+LPIANISRIMKKA+PANGKIAKDAK+TVQECVSEFISFITSEASDKCQ+EKRKTINGDDLLWAMA
Subjt:  MAEAPTSP--AGGSHESGGEQSPNTGG-VREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMA

Query:  TLGFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNS--QQYMQPGAMTYINTQGQHLIIPSMQNNE
        TLGFEDYI+PLK YL +YRE   D+K +++  D S K+DA+G +   +S  +   Q GA      QG   + P   N +
Subjt:  TLGFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNS--QQYMQPGAMTYINTQGQHLIIPSMQNNE

Q60EQ4 Nuclear transcription factor Y subunit B-35.1e-5161.96Show/hide
Query:  MAEAPTSP--AGGSHES--------GGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGD
        MA+ P SP   GGSHES        GG      GGVREQDR+LPIANISRIMKKA+PANGKIAKDAK+TVQECVSEFISFITSEASDKCQ+EKRKTINGD
Subjt:  MAEAPTSP--AGGSHES--------GGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGD

Query:  DLLWAMATLGFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQQYMQPGAMTYINTQGQHLIIPSMQNNE
        DLLWAMATLGFEDYI+PLK YL +YRE   D+K +++ GD S K+D +G+  G +S          Y   QG   + P   N +
Subjt:  DLLWAMATLGFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQQYMQPGAMTYINTQGQHLIIPSMQNNE

Q67XJ2 Nuclear transcription factor Y subunit B-102.6e-5568.79Show/hide
Query:  MAEAPT-SPAGGSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMATL
        MAE+ T    GGSHESGG+QSP +  VREQDR+LPIANISRIMK+ LP NGKIAKDAK+T+QECVSEFISF+TSEASDKCQ+EKRKTINGDDLLWAMATL
Subjt:  MAEAPT-SPAGGSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMATL

Query:  GFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQ--QYMQPGAMT---YINTQGQHLII
        GFEDYIDPLK YL RYRE   D KGS +GG+ SAKRD     P Q SQ  Q  Q G+ +   Y N+QG ++++
Subjt:  GFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQ--QYMQPGAMT---YINTQGQHLII

Q8VYK4 Nuclear transcription factor Y subunit B-81.7e-5469.01Show/hide
Query:  MAEAPT-SPAG-GSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMAT
        MAE+   SP G GSHESGG+QSP +  VREQDR+LPIANISRIMK+ LPANGKIAKDAK+ VQECVSEFISF+TSEASDKCQ+EKRKTINGDDLLWAMAT
Subjt:  MAEAPT-SPAG-GSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMAT

Query:  LGFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQQYMQPGAMTYINTQG-QHLIIP
        LGFEDY++PLK YL RYRE   D KGS++GGD +AK+D   +  GQ SQ   Q     Y N+Q  QH+++P
Subjt:  LGFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQQYMQPGAMTYINTQG-QHLIIP

Q9SLG0 Nuclear transcription factor Y subunit B-16.7e-5175Show/hide
Query:  MAEAPTSPAGGSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMATLG
        MA+ P+SPAG   ESG       G VREQDRYLPIANISRIMKKALP NGKI KDAKDTVQECVSEFISFITSEASDKCQKEKRKT+NGDDLLWAMATLG
Subjt:  MAEAPTSPAGGSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMATLG

Query:  FEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQ
        FEDY++PLK YL RYRE   D KGS + GD S  RDA G + G+
Subjt:  FEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQ

Arabidopsis top hitse value%identityAlignment
AT2G37060.1 nuclear factor Y, subunit B81.2e-5569.01Show/hide
Query:  MAEAPT-SPAG-GSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMAT
        MAE+   SP G GSHESGG+QSP +  VREQDR+LPIANISRIMK+ LPANGKIAKDAK+ VQECVSEFISF+TSEASDKCQ+EKRKTINGDDLLWAMAT
Subjt:  MAEAPT-SPAG-GSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMAT

Query:  LGFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQQYMQPGAMTYINTQG-QHLIIP
        LGFEDY++PLK YL RYRE   D KGS++GGD +AK+D   +  GQ SQ   Q     Y N+Q  QH+++P
Subjt:  LGFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQQYMQPGAMTYINTQG-QHLIIP

AT2G37060.2 nuclear factor Y, subunit B81.2e-5569.01Show/hide
Query:  MAEAPT-SPAG-GSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMAT
        MAE+   SP G GSHESGG+QSP +  VREQDR+LPIANISRIMK+ LPANGKIAKDAK+ VQECVSEFISF+TSEASDKCQ+EKRKTINGDDLLWAMAT
Subjt:  MAEAPT-SPAG-GSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMAT

Query:  LGFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQQYMQPGAMTYINTQG-QHLIIP
        LGFEDY++PLK YL RYRE   D KGS++GGD +AK+D   +  GQ SQ   Q     Y N+Q  QH+++P
Subjt:  LGFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQQYMQPGAMTYINTQG-QHLIIP

AT3G53340.1 nuclear factor Y, subunit B101.9e-5668.79Show/hide
Query:  MAEAPT-SPAGGSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMATL
        MAE+ T    GGSHESGG+QSP +  VREQDR+LPIANISRIMK+ LP NGKIAKDAK+T+QECVSEFISF+TSEASDKCQ+EKRKTINGDDLLWAMATL
Subjt:  MAEAPT-SPAGGSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMATL

Query:  GFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQ--QYMQPGAMT---YINTQGQHLII
        GFEDYIDPLK YL RYRE   D KGS +GG+ SAKRD     P Q SQ  Q  Q G+ +   Y N+QG ++++
Subjt:  GFEDYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQ--QYMQPGAMT---YINTQGQHLII

AT3G54970.1 D-aminoacid aminotransferase-like PLP-dependent enzymes superfamily protein2.2e-8952.53Show/hide
Query:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTA----NPVLQSSIDSIPWEPAIR
        M++ RFL+ NGVVL   E PPV TFLE+H GAYTTTR+ NN +S LFW+RHMKRL+ S++IL  S P LL  S  +       PV  SS         I 
Subjt:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTA----NPVLQSSIDSIPWEPAIR

Query:  TLVDDSMREVLP---VALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVP-REFGIPENGANLAVVGQGREFAEAKYSDWVRR
          V+ SM E L    V   ER  GEEL +T LV+ N+EK       +DV    + LDV +H+G Y P    G+ EN A+LA+VG+GR+ A AKYSDWVR 
Subjt:  TLVDDSMREVLP---VALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVP-REFGIPENGANLAVVGQGREFAEAKYSDWVRR

Query:  RKSLEKLRPPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEI
        RK LEK RPP  TELLLSNDGD +LEGC+TNFFVVCR+V    K+   L   S + FE+QTAPI DGVL GVIR +V E CLS GIP+RE AP+WS  E+
Subjt:  RKSLEKLRPPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEI

Query:  WEEAFVTNSLRILEHVKAMCIP-GTWDLLDSKTWSDIPWNKKSFKDTPGMITSAIQ
        WEEAF+T+SLRIL+HV  + +P G+ + L      +I W +K FK+ PGMIT  I+
Subjt:  WEEAFVTNSLRILEHVKAMCIP-GTWDLLDSKTWSDIPWNKKSFKDTPGMITSAIQ

AT3G54970.2 D-aminoacid aminotransferase-like PLP-dependent enzymes superfamily protein1.1e-7754.07Show/hide
Query:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTA----NPVLQSSIDSIPWEPAIR
        M++ RFL+ NGVVL   E PPV TFLE+H GAYTTTR+ NN +S LFW+RHMKRL+ S++IL  S P LL  S  +       PV  SS         I 
Subjt:  MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTA----NPVLQSSIDSIPWEPAIR

Query:  TLVDDSMREVLP---VALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVP-REFGIPENGANLAVVGQGREFAEAKYSDWVRR
          V+ SM E L    V   ER  GEEL +T LV+ N+EK       +DV    + LDV +H+G Y P    G+ EN A+LA+VG+GR+ A AKYSDWVR 
Subjt:  TLVDDSMREVLP---VALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVP-REFGIPENGANLAVVGQGREFAEAKYSDWVRR

Query:  RKSLEKLRPPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEI
        RK LEK RPP  TELLLSNDGD +LEGC+TNFFVVCR+V    K+   L   S + FE+QTAPI DGVL GVIR +V E CLS GIP+RE AP+WS  E+
Subjt:  RKSLEKLRPPSVTELLLSNDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEI

Query:  WEEAFVT
        WEEAF+T
Subjt:  WEEAFVT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAGCTTCCGCTTCTTGTTCAGTAATGGCGTCGTGTTGCAAGGCTCCGAAGTTCCTCCAGTCGCTACCTTCCTCGAAACTCATCCTGGCGCTTATACCACT
ACTCGGTCCCATAACAATGCGTCGAGCATTCTGTTCTGGGACAGGCACATGAAAAGGCTCACTCAATCAGTTAAAATTCTGTCGAATTCGACTCCACGACTCTTG
TCTGAATCGAACAAAACGACTGCTAATCCGGTATTACAGTCGTCGATCGATTCCATTCCTTGGGAACCAGCTATTCGGACGCTTGTCGATGATTCTATGAGAGAA
GTGTTGCCGGTAGCGTTGAAGGAGAGGAATGAGGGAGAAGAATTGACAATTACAGCACTAGTTAGTGTGAATTTGGAAAAATTTGGTGAAAGTGACGGCGTAGTG
GATGTAGAGAAAGTTAAAGAGGTTCTTGATGTGCACGTGCATGTCGGTAACTACGTCCCTCGTGAATTTGGGATCCCGGAAAATGGTGCAAATCTGGCCGTGGTG
GGCCAAGGGAGGGAATTCGCTGAGGCGAAGTACTCCGATTGGGTTAGGCGTAGGAAGTCTCTGGAAAAATTGAGGCCTCCTTCTGTGACTGAGCTTCTGTTGTCA
AACGATGGTGATCAGATACTTGAAGGCTGCCTGACAAACTTTTTTGTTGTTTGCCGCAAGGTTGATAACGAAGATAAGAATAAGAGAGTTCTTGATTTCACAAGT
ACAAATTCCTTTGAACTGCAGACAGCTCCCATTAGGGACGGTGTTCTGACTGGGGTTATTCGCCAAGTAGTCAAAGAAGCTTGTTTAAGTAATGGCATTCCATTT
CGAGAAGTTGCACCTACTTGGTCAAGTAATGAAATCTGGGAAGAAGCATTTGTTACAAATAGCTTGAGAATCTTGGAGCACGTGAAAGCCATGTGCATTCCTGGC
ACGTGGGACTTGCTCGACTCGAAGACATGGAGCGATATACCGTGGAATAAGAAGTCGTTTAAGGATACTCCTGGAATGATCACAAGCGCAATCCAGGTACCCTCT
TTCCATTTTCTCTCAATTCTCTTCCATTCCAGGCTTAGGGTTCCACTGATTCAGGATCTCGCTCACATGGCGGAGGCTCCGACGAGTCCAGCCGGCGGCAGCCAC
GAGAGCGGCGGCGAGCAGAGCCCCAATACCGGTGGGGTTCGGGAGCAGGACCGATACCTCCCGATCGCTAACATTAGCCGGATCATGAAGAAGGCCTTGCCCGCT
AATGGCAAGATCGCCAAGGACGCCAAGGACACCGTCCAGGAATGCGTCTCCGAATTCATCAGCTTCATCACTAGCGAGGCGAGCGATAAGTGCCAGAAGGAGAAG
AGAAAGACCATTAATGGGGATGATTTGCTGTGGGCAATGGCGACATTGGGTTTCGAGGACTATATTGATCCGCTTAAGTCGTATCTAACTAGGTACAGAGAGTGT
GATGCTAAGGGATCTTCTAGGGGTGGTGATGAGTCTGCTAAAAGAGATGCAGTTGGGGCCTTGCCTGGCCAAAATTCCCAGCAGTACATGCAGCCGGGAGCAATG
ACCTACATTAACACCCAAGGACAGCATTTGATCATTCCTTCAATGCAGAATAATGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGAGCTTCCGCTTCTTGTTCAGTAATGGCGTCGTGTTGCAAGGCTCCGAAGTTCCTCCAGTCGCTACCTTCCTCGAAACTCATCCTGGCGCTTATACCACT
ACTCGGTCCCATAACAATGCGTCGAGCATTCTGTTCTGGGACAGGCACATGAAAAGGCTCACTCAATCAGTTAAAATTCTGTCGAATTCGACTCCACGACTCTTG
TCTGAATCGAACAAAACGACTGCTAATCCGGTATTACAGTCGTCGATCGATTCCATTCCTTGGGAACCAGCTATTCGGACGCTTGTCGATGATTCTATGAGAGAA
GTGTTGCCGGTAGCGTTGAAGGAGAGGAATGAGGGAGAAGAATTGACAATTACAGCACTAGTTAGTGTGAATTTGGAAAAATTTGGTGAAAGTGACGGCGTAGTG
GATGTAGAGAAAGTTAAAGAGGTTCTTGATGTGCACGTGCATGTCGGTAACTACGTCCCTCGTGAATTTGGGATCCCGGAAAATGGTGCAAATCTGGCCGTGGTG
GGCCAAGGGAGGGAATTCGCTGAGGCGAAGTACTCCGATTGGGTTAGGCGTAGGAAGTCTCTGGAAAAATTGAGGCCTCCTTCTGTGACTGAGCTTCTGTTGTCA
AACGATGGTGATCAGATACTTGAAGGCTGCCTGACAAACTTTTTTGTTGTTTGCCGCAAGGTTGATAACGAAGATAAGAATAAGAGAGTTCTTGATTTCACAAGT
ACAAATTCCTTTGAACTGCAGACAGCTCCCATTAGGGACGGTGTTCTGACTGGGGTTATTCGCCAAGTAGTCAAAGAAGCTTGTTTAAGTAATGGCATTCCATTT
CGAGAAGTTGCACCTACTTGGTCAAGTAATGAAATCTGGGAAGAAGCATTTGTTACAAATAGCTTGAGAATCTTGGAGCACGTGAAAGCCATGTGCATTCCTGGC
ACGTGGGACTTGCTCGACTCGAAGACATGGAGCGATATACCGTGGAATAAGAAGTCGTTTAAGGATACTCCTGGAATGATCACAAGCGCAATCCAGGTACCCTCT
TTCCATTTTCTCTCAATTCTCTTCCATTCCAGGCTTAGGGTTCCACTGATTCAGGATCTCGCTCACATGGCGGAGGCTCCGACGAGTCCAGCCGGCGGCAGCCAC
GAGAGCGGCGGCGAGCAGAGCCCCAATACCGGTGGGGTTCGGGAGCAGGACCGATACCTCCCGATCGCTAACATTAGCCGGATCATGAAGAAGGCCTTGCCCGCT
AATGGCAAGATCGCCAAGGACGCCAAGGACACCGTCCAGGAATGCGTCTCCGAATTCATCAGCTTCATCACTAGCGAGGCGAGCGATAAGTGCCAGAAGGAGAAG
AGAAAGACCATTAATGGGGATGATTTGCTGTGGGCAATGGCGACATTGGGTTTCGAGGACTATATTGATCCGCTTAAGTCGTATCTAACTAGGTACAGAGAGTGT
GATGCTAAGGGATCTTCTAGGGGTGGTGATGAGTCTGCTAAAAGAGATGCAGTTGGGGCCTTGCCTGGCCAAAATTCCCAGCAGTACATGCAGCCGGGAGCAATG
ACCTACATTAACACCCAAGGACAGCATTTGATCATTCCTTCAATGCAGAATAATGAATAG
Protein sequenceShow/hide protein sequence
MASFRFLFSNGVVLQGSEVPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPRLLSESNKTTANPVLQSSIDSIPWEPAIRTLVDDSMRE
VLPVALKERNEGEELTITALVSVNLEKFGESDGVVDVEKVKEVLDVHVHVGNYVPREFGIPENGANLAVVGQGREFAEAKYSDWVRRRKSLEKLRPPSVTELLLS
NDGDQILEGCLTNFFVVCRKVDNEDKNKRVLDFTSTNSFELQTAPIRDGVLTGVIRQVVKEACLSNGIPFREVAPTWSSNEIWEEAFVTNSLRILEHVKAMCIPG
TWDLLDSKTWSDIPWNKKSFKDTPGMITSAIQVPSFHFLSILFHSRLRVPLIQDLAHMAEAPTSPAGGSHESGGEQSPNTGGVREQDRYLPIANISRIMKKALPA
NGKIAKDAKDTVQECVSEFISFITSEASDKCQKEKRKTINGDDLLWAMATLGFEDYIDPLKSYLTRYRECDAKGSSRGGDESAKRDAVGALPGQNSQQYMQPGAM
TYINTQGQHLIIPSMQNNE