; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014972 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014972
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionNuclear transcription factor Y subunit B-8
Genome locationChr02:22473376..22483454
RNA-Seq ExpressionHG10014972
SyntenyHG10014972
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016740 - transferase activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR001544 - Aminotransferase class IV
IPR003956 - Transcription factor, NFYB/HAP3, conserved site
IPR003958 - Transcription factor CBF/NF-Y/archaeal histone domain
IPR009072 - Histone-fold
IPR036038 - Aminotransferase-like, PLP-dependent enzymes
IPR043131 - Branched-chain-amino-acid aminotransferase-like, N-terminal
IPR043132 - Branched-chain-amino-acid aminotransferase-like, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAV70263.1 CBFD_NFYB_HMF domain-containing protein/Aminotran_4 domain-containing protein, partial [Cephalotus follicularis]1.5e-17164.46Show/hide
Query:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD
        M S RFLFSNG++   S+ P ++TFL TH GAYTT+R+HNN S +L+W RH++RL  S +IL N  P L+S S   + K        S  WE  +  LV+
Subjt:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD

Query:  DSMRKVLPTALNERIGGEELTITVL-VSVNLEILGESDGAVDVERVKEALDVHVYVGSYVPRQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSLEKL
        +S+ KVL  AL ER  G+EL +T L V  +LE L   DG    ER  E +DVHV+VG+YVP  FG+  NG +LA+VGRGR++AAAKYSDWVR RK LEKL
Subjt:  DSMRKVLPTALNERIGGEELTITVL-VSVNLEILGESDGAVDVERVKEALDVHVYVGSYVPRQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSLEKL

Query:  RPPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGAFVT
        RPPSVTELLLSNDGDQILEGCVTNFFVVCRKD+N+               E+QTAPISDGVL G+IRQLVIE C S GIP REV P+WS +E WE AFVT
Subjt:  RPPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGAFVT

Query:  SSLRILEHVNTICIPNRWDLLDSKTWSEISWNKKSFKDAPGMISSTILDLAHMAEPPTSPAGGSHESGGEQSPNT-GGVREQDRFLPIANISRIMKKALP
        SSLRI++HV+ I +P     L+SK W+EISW ++ F++ PGMI+  I     MA+ P SPAGGSHESGGEQSP   GGVREQDR+LPIANISRIMKKALP
Subjt:  SSLRILEHVNTICIPNRWDLLDSKTWSEISWNKKSFKDAPGMISSTILDLAHMAEPPTSPAGGSHESGGEQSPNT-GGVREQDRFLPIANISRIMKKALP

Query:  ANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATLGFEEYIDPLKSYLTRYRECDAKGSSRGGDESAKRDAVGALPGQNSQ
         NGKIAKDAKDTVQECVSEFISF+TSEASDKC KEKRKTINGDDLLWAMATLGFE+YI+PLK YL RYRE D KGS+RGGD SAKRD VGALP  N+Q
Subjt:  ANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATLGFEEYIDPLKSYLTRYRECDAKGSSRGGDESAKRDAVGALPGQNSQ

KAG6601043.1 hypothetical protein SDJN03_06276, partial [Cucurbita argyrosperma subsp. sororia]2.1e-25380.6Show/hide
Query:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD
        MTSFRFLFSNGV+LQGSE PPVATFLETHPGAYTTTR+HNNASSILFWDRHMKRLTQSVKILSNSTP LLSESNRTINKLV PS  DSIPWEPAIR LVD
Subjt:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD

Query:  DSMRKVLPTALNERIGGEELTITVLVSVNLEILGESDGAVDVERVKEALDVHVYVGSYVPRQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSLEKLR
        DSMRKVLP ALNER G EEL +TVLVSVNLE LGESDG VDVERVKEA+ VH +V +YVPR+FGVPENG NLAVVGRGRD AAAKYSDWVRRRKSLEKLR
Subjt:  DSMRKVLPTALNERIGGEELTITVLVSVNLEILGESDGAVDVERVKEALDVHVYVGSYVPRQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSLEKLR

Query:  PPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGAFVTS
        PPSVTELLLSNDGDQILEGC+TNFFVV RK NNEAKE SV DS ST+SFELQTAPISDGVLTGVIRQLV+EAC S GIPFREV PTWSSNE+WE AFVT+
Subjt:  PPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGAFVTS

Query:  SLRILEHVNTICIPNRWDLLDSKTWSEISWNKKSFKDAPGMISSTI------------------------------------------------------
        SLR++EHVNTIC+PN WDLL+SKTW EISWNKKSFKDAPG+I+STI                                                      
Subjt:  SLRILEHVNTICIPNRWDLLDSKTWSEISWNKKSFKDAPGMISSTI------------------------------------------------------

Query:  ---LDLAHMAEPPTSPAGGSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDL
            DLA MAEPPTSP GGSHESGGEQSP TGG REQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDL
Subjt:  ---LDLAHMAEPPTSPAGGSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDL

Query:  LWAMATLGFEEYIDPLKSYLTRYRECDAKGSSRGGDESAKRDAVGALPGQNSQQYMQPGALTYINTQ
        LWAMATLGFEEYIDPLKSYLTRYRECDAKGSSRGGDESAKRDAVGA+PGQNSQQYMQ GALTYINTQ
Subjt:  LWAMATLGFEEYIDPLKSYLTRYRECDAKGSSRGGDESAKRDAVGALPGQNSQQYMQPGALTYINTQ

XP_004143606.1 uncharacterized protein LOC101222891 isoform X1 [Cucumis sativus]6.1e-17690.17Show/hide
Query:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD
        MTSFRFLFSNG +LQGSEAPPVATFLETH GAYTTTRS NNASSILFWDRHMKRLTQSVKILSNS+PLLLSESN+TIN+LVKPSWIDS+PWEPAIR LVD
Subjt:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD

Query:  DSMRKVLPTALNERIGGEELTITVLVSVNLEILGESDGAVDVERVKEALDVHVYVGSYVPRQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSLEKLR
        DSMRKV+ TALNERI GEELTITV+VSVNLEILGE++  VDVERVKEALDVHVYVGSYVPR+FGVPENG NLAVVGRGRDVAAAKYSDWVRRRKSLEKLR
Subjt:  DSMRKVLPTALNERIGGEELTITVLVSVNLEILGESDGAVDVERVKEALDVHVYVGSYVPRQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSLEKLR

Query:  PPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGAFVTS
        PPSV+ELLLSNDGDQILEG VTNFFVVCRKDN+E+KETS LDSKS YSFELQTAP+SDGVLTGVIRQLVIEACSS GI FREV PTWSSNEIWE AF+TS
Subjt:  PPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGAFVTS

Query:  SLRILEHVNTICIPNRWDLLDSKTWSEISWNKKSFKDAPGMISSTI
        SLRILEHVNTICIP+ WDLLDSKTWSE SWNKKSFKDAPGMISSTI
Subjt:  SLRILEHVNTICIPNRWDLLDSKTWSEISWNKKSFKDAPGMISSTI

XP_008445841.1 PREDICTED: uncharacterized protein LOC103488743 [Cucumis melo]9.7e-17489.6Show/hide
Query:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD
        MTSFRFLFSNG +LQGSE PPVATFLETHPGAYTTTRS NNASSILFWDRHMKRLTQSVKILSNS+PLLLSESNRTIN+LVKPSWIDS+PWEPAIR LVD
Subjt:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD

Query:  DSMRKVLPTALNERIGGEELTITVLVSVNLEILGESDGAVDVERVKEALDVHVYVGSYVPRQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSLEKLR
        DSMRKV+ TALNE+I GEEL+ITV+VSVNLE LGE+   VDVERVKEAL VHVYVGSYVPR+FGVPENG NLAVVGRGRDVAAAKYSDWVRRRKSLEKLR
Subjt:  DSMRKVLPTALNERIGGEELTITVLVSVNLEILGESDGAVDVERVKEALDVHVYVGSYVPRQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSLEKLR

Query:  PPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGAFVTS
        PPSV+ELLLSNDGDQILEG VTNFFVVCRK+N+E+KETSVLDSKS YSFELQTAPISDGVLTGVIRQLVIEACSS GI FREV PTWSSNEIWE AFVTS
Subjt:  PPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGAFVTS

Query:  SLRILEHVNTICIPNRWDLLDSKTWSEISWNKKSFKDAPGMISSTI
        SLRILEHVNTICIP+ W+LLDSKTWSEISWNKKSFKD PGMISSTI
Subjt:  SLRILEHVNTICIPNRWDLLDSKTWSEISWNKKSFKDAPGMISSTI

XP_038891441.1 uncharacterized protein LOC120080860 isoform X1 [Benincasa hispida]1.8e-18089.23Show/hide
Query:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD
        MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSN++PLLLSESNRTINK VKPSW+DSIPWEPAIR LVD
Subjt:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD

Query:  DSMRKVLPTALNERIGGEELTITVLVSVNLEILGESDGAVDVERVKEALDVHVYVGSYVPRQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSLEKLR
        DSMRKVLPTALNERI GEELTITVL+SVNLE LGESDG VDVERV+EALDVHVYVGSYVP +FGVPENG NLAVVGRGRD+AAAKYSDWVR RKSLEKLR
Subjt:  DSMRKVLPTALNERIGGEELTITVLVSVNLEILGESDGAVDVERVKEALDVHVYVGSYVPRQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSLEKLR

Query:  PPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGAFVTS
        PPSVTELLLSN+GDQILEGCVTNFFVVCRKDNNEAKETSV DSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGI FREV PTWSSNEIWE AFVTS
Subjt:  PPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGAFVTS

Query:  SLRILEHVNTICIPNRWDLLDSKTWSEISWNKKSFKDAPGMISSTI-LDLAHMAEPPTSPAG
        SLRILEHVNTICIP+ W+LL+SKTWSEISWNKKSFKDAPG+ISS +  D+   A     P G
Subjt:  SLRILEHVNTICIPNRWDLLDSKTWSEISWNKKSFKDAPGMISSTI-LDLAHMAEPPTSPAG

TrEMBL top hitse value%identityAlignment
A0A0A0KQ17 Uncharacterized protein2.9e-17690.17Show/hide
Query:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD
        MTSFRFLFSNG +LQGSEAPPVATFLETH GAYTTTRS NNASSILFWDRHMKRLTQSVKILSNS+PLLLSESN+TIN+LVKPSWIDS+PWEPAIR LVD
Subjt:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD

Query:  DSMRKVLPTALNERIGGEELTITVLVSVNLEILGESDGAVDVERVKEALDVHVYVGSYVPRQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSLEKLR
        DSMRKV+ TALNERI GEELTITV+VSVNLEILGE++  VDVERVKEALDVHVYVGSYVPR+FGVPENG NLAVVGRGRDVAAAKYSDWVRRRKSLEKLR
Subjt:  DSMRKVLPTALNERIGGEELTITVLVSVNLEILGESDGAVDVERVKEALDVHVYVGSYVPRQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSLEKLR

Query:  PPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGAFVTS
        PPSV+ELLLSNDGDQILEG VTNFFVVCRKDN+E+KETS LDSKS YSFELQTAP+SDGVLTGVIRQLVIEACSS GI FREV PTWSSNEIWE AF+TS
Subjt:  PPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGAFVTS

Query:  SLRILEHVNTICIPNRWDLLDSKTWSEISWNKKSFKDAPGMISSTI
        SLRILEHVNTICIP+ WDLLDSKTWSE SWNKKSFKDAPGMISSTI
Subjt:  SLRILEHVNTICIPNRWDLLDSKTWSEISWNKKSFKDAPGMISSTI

A0A1Q3BRC3 CBFD_NFYB_HMF domain-containing protein/Aminotran_4 domain-containing protein (Fragment)7.5e-17264.46Show/hide
Query:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD
        M S RFLFSNG++   S+ P ++TFL TH GAYTT+R+HNN S +L+W RH++RL  S +IL N  P L+S S   + K        S  WE  +  LV+
Subjt:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD

Query:  DSMRKVLPTALNERIGGEELTITVL-VSVNLEILGESDGAVDVERVKEALDVHVYVGSYVPRQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSLEKL
        +S+ KVL  AL ER  G+EL +T L V  +LE L   DG    ER  E +DVHV+VG+YVP  FG+  NG +LA+VGRGR++AAAKYSDWVR RK LEKL
Subjt:  DSMRKVLPTALNERIGGEELTITVL-VSVNLEILGESDGAVDVERVKEALDVHVYVGSYVPRQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSLEKL

Query:  RPPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGAFVT
        RPPSVTELLLSNDGDQILEGCVTNFFVVCRKD+N+               E+QTAPISDGVL G+IRQLVIE C S GIP REV P+WS +E WE AFVT
Subjt:  RPPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGAFVT

Query:  SSLRILEHVNTICIPNRWDLLDSKTWSEISWNKKSFKDAPGMISSTILDLAHMAEPPTSPAGGSHESGGEQSPNT-GGVREQDRFLPIANISRIMKKALP
        SSLRI++HV+ I +P     L+SK W+EISW ++ F++ PGMI+  I     MA+ P SPAGGSHESGGEQSP   GGVREQDR+LPIANISRIMKKALP
Subjt:  SSLRILEHVNTICIPNRWDLLDSKTWSEISWNKKSFKDAPGMISSTILDLAHMAEPPTSPAGGSHESGGEQSPNT-GGVREQDRFLPIANISRIMKKALP

Query:  ANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATLGFEEYIDPLKSYLTRYRECDAKGSSRGGDESAKRDAVGALPGQNSQ
         NGKIAKDAKDTVQECVSEFISF+TSEASDKC KEKRKTINGDDLLWAMATLGFE+YI+PLK YL RYRE D KGS+RGGD SAKRD VGALP  N+Q
Subjt:  ANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATLGFEEYIDPLKSYLTRYRECDAKGSSRGGDESAKRDAVGALPGQNSQ

A0A1S3BDM6 uncharacterized protein LOC1034887434.7e-17489.6Show/hide
Query:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD
        MTSFRFLFSNG +LQGSE PPVATFLETHPGAYTTTRS NNASSILFWDRHMKRLTQSVKILSNS+PLLLSESNRTIN+LVKPSWIDS+PWEPAIR LVD
Subjt:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD

Query:  DSMRKVLPTALNERIGGEELTITVLVSVNLEILGESDGAVDVERVKEALDVHVYVGSYVPRQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSLEKLR
        DSMRKV+ TALNE+I GEEL+ITV+VSVNLE LGE+   VDVERVKEAL VHVYVGSYVPR+FGVPENG NLAVVGRGRDVAAAKYSDWVRRRKSLEKLR
Subjt:  DSMRKVLPTALNERIGGEELTITVLVSVNLEILGESDGAVDVERVKEALDVHVYVGSYVPRQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSLEKLR

Query:  PPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGAFVTS
        PPSV+ELLLSNDGDQILEG VTNFFVVCRK+N+E+KETSVLDSKS YSFELQTAPISDGVLTGVIRQLVIEACSS GI FREV PTWSSNEIWE AFVTS
Subjt:  PPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGAFVTS

Query:  SLRILEHVNTICIPNRWDLLDSKTWSEISWNKKSFKDAPGMISSTI
        SLRILEHVNTICIP+ W+LLDSKTWSEISWNKKSFKD PGMISSTI
Subjt:  SLRILEHVNTICIPNRWDLLDSKTWSEISWNKKSFKDAPGMISSTI

A0A5A7SXQ4 Class IV aminotransferase4.7e-17489.6Show/hide
Query:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD
        MTSFRFLFSNG +LQGSE PPVATFLETHPGAYTTTRS NNASSILFWDRHMKRLTQSVKILSNS+PLLLSESNRTIN+LVKPSWIDS+PWEPAIR LVD
Subjt:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD

Query:  DSMRKVLPTALNERIGGEELTITVLVSVNLEILGESDGAVDVERVKEALDVHVYVGSYVPRQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSLEKLR
        DSMRKV+ TALNE+I GEEL+ITV+VSVNLE LGE+   VDVERVKEAL VHVYVGSYVPR+FGVPENG NLAVVGRGRDVAAAKYSDWVRRRKSLEKLR
Subjt:  DSMRKVLPTALNERIGGEELTITVLVSVNLEILGESDGAVDVERVKEALDVHVYVGSYVPRQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSLEKLR

Query:  PPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGAFVTS
        PPSV+ELLLSNDGDQILEG VTNFFVVCRK+N+E+KETSVLDSKS YSFELQTAPISDGVLTGVIRQLVIEACSS GI FREV PTWSSNEIWE AFVTS
Subjt:  PPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGAFVTS

Query:  SLRILEHVNTICIPNRWDLLDSKTWSEISWNKKSFKDAPGMISSTI
        SLRILEHVNTICIP+ W+LLDSKTWSEISWNKKSFKD PGMISSTI
Subjt:  SLRILEHVNTICIPNRWDLLDSKTWSEISWNKKSFKDAPGMISSTI

A0A6J1KA79 uncharacterized protein LOC111493655 isoform X11.8e-17087.57Show/hide
Query:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD
        MTSFRFLFSNGV+LQGSEAPPVATFLETHPGAYTTTR+HNNASSILFWDRHMKRLTQSVKILSNSTP LLSESNRTINKLV PS IDSIPWEPAIR LVD
Subjt:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD

Query:  DSMRKVLPTALNERIGGEELTITVLVSVNLEILGESDGAVDVERVKEALDVHVYVGSYVPRQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSLEKLR
        DSMRKVLP ALNER G EEL +TVLVSVNLE LGESDG VDVERVKEA+ VH +VG+YVPR+FGVPENG NLAVVGRGRD AAAKYSDWVR RKSLEKLR
Subjt:  DSMRKVLPTALNERIGGEELTITVLVSVNLEILGESDGAVDVERVKEALDVHVYVGSYVPRQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSLEKLR

Query:  PPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGAFVTS
        PPSVTELLLSNDGDQILEGC+TNFFVVCRK N+EAKE SV DS ST+SFELQTAPISDGVLTGVIRQLV+EAC S GIPFREV PTWSSNE+WE AFVT+
Subjt:  PPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGAFVTS

Query:  SLRILEHVNTICIPNRWDLLDSKTWSEISWNKKSFKDAPGMISSTI
        SLR++EHVNTIC+PN WDLL+SKTW EISWNKKSFKDAPGMI+STI
Subjt:  SLRILEHVNTICIPNRWDLLDSKTWSEISWNKKSFKDAPGMISSTI

SwissProt top hitse value%identityAlignment
P25209 Nuclear transcription factor Y subunit B1.1e-5071.81Show/hide
Query:  MAEPPTSP--AGGSHESGGEQSPNTGG-VREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMA
        MAE P SP   GGSHESG  +    GG VREQDRFLPIANISRIMKKA+PANGKIAKDAK+TVQECVSEFISF+TSEASDKCQ+EKRKTINGDDLLWAMA
Subjt:  MAEPPTSP--AGGSHESGGEQSPNTGG-VREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMA

Query:  TLGFEEYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNS
        TLGFE+YI+PLK YL +YRE   D+K +++  D S K+DA+G +   +S
Subjt:  TLGFEEYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNS

Q60EQ4 Nuclear transcription factor Y subunit B-32.4e-5069.23Show/hide
Query:  MAEPPTSP--AGGSHES--------GGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGD
        MA+ P SP   GGSHES        GG      GGVREQDRFLPIANISRIMKKA+PANGKIAKDAK+TVQECVSEFISF+TSEASDKCQ+EKRKTINGD
Subjt:  MAEPPTSP--AGGSHES--------GGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGD

Query:  DLLWAMATLGFEEYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNS
        DLLWAMATLGFE+YI+PLK YL +YRE   D+K +++ GD S K+D +G+  G +S
Subjt:  DLLWAMATLGFEEYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNS

Q67XJ2 Nuclear transcription factor Y subunit B-101.4e-5375.68Show/hide
Query:  MAEPPT-SPAGGSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATL
        MAE  T    GGSHESGG+QSP +  VREQDRFLPIANISRIMK+ LP NGKIAKDAK+T+QECVSEFISFVTSEASDKCQ+EKRKTINGDDLLWAMATL
Subjt:  MAEPPT-SPAGGSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATL

Query:  GFEEYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQ
        GFE+YIDPLK YL RYRE   D KGS +GG+ SAKRD   +   Q SQ
Subjt:  GFEEYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQ

Q8VYK4 Nuclear transcription factor Y subunit B-84.0e-5376.76Show/hide
Query:  SPAG-GSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATLGFEEYI
        SP G GSHESGG+QSP +  VREQDRFLPIANISRIMK+ LPANGKIAKDAK+ VQECVSEFISFVTSEASDKCQ+EKRKTINGDDLLWAMATLGFE+Y+
Subjt:  SPAG-GSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATLGFEEYI

Query:  DPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQ
        +PLK YL RYRE   D KGS++GGD +AK+D   +  GQ SQ
Subjt:  DPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQ

Q9SLG0 Nuclear transcription factor Y subunit B-11.2e-4972.92Show/hide
Query:  MAEPPTSPAGGSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATLG
        MA+ P+SPAG   ESG       G VREQDR+LPIANISRIMKKALP NGKI KDAKDTVQECVSEFISF+TSEASDKCQKEKRKT+NGDDLLWAMATLG
Subjt:  MAEPPTSPAGGSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATLG

Query:  FEEYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQ
        FE+Y++PLK YL RYRE   D KGS + GD S  RDA G + G+
Subjt:  FEEYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQ

Arabidopsis top hitse value%identityAlignment
AT2G37060.1 nuclear factor Y, subunit B82.8e-5476.76Show/hide
Query:  SPAG-GSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATLGFEEYI
        SP G GSHESGG+QSP +  VREQDRFLPIANISRIMK+ LPANGKIAKDAK+ VQECVSEFISFVTSEASDKCQ+EKRKTINGDDLLWAMATLGFE+Y+
Subjt:  SPAG-GSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATLGFEEYI

Query:  DPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQ
        +PLK YL RYRE   D KGS++GGD +AK+D   +  GQ SQ
Subjt:  DPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQ

AT2G37060.2 nuclear factor Y, subunit B82.8e-5476.76Show/hide
Query:  SPAG-GSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATLGFEEYI
        SP G GSHESGG+QSP +  VREQDRFLPIANISRIMK+ LPANGKIAKDAK+ VQECVSEFISFVTSEASDKCQ+EKRKTINGDDLLWAMATLGFE+Y+
Subjt:  SPAG-GSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATLGFEEYI

Query:  DPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQ
        +PLK YL RYRE   D KGS++GGD +AK+D   +  GQ SQ
Subjt:  DPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQ

AT3G53340.1 nuclear factor Y, subunit B109.7e-5575.68Show/hide
Query:  MAEPPT-SPAGGSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATL
        MAE  T    GGSHESGG+QSP +  VREQDRFLPIANISRIMK+ LP NGKIAKDAK+T+QECVSEFISFVTSEASDKCQ+EKRKTINGDDLLWAMATL
Subjt:  MAEPPT-SPAGGSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATL

Query:  GFEEYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQ
        GFE+YIDPLK YL RYRE   D KGS +GG+ SAKRD   +   Q SQ
Subjt:  GFEEYIDPLKSYLTRYREC--DAKGSSRGGDESAKRDAVGALPGQNSQ

AT3G54970.1 D-aminoacid aminotransferase-like PLP-dependent enzymes superfamily protein8.4e-9153.28Show/hide
Query:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD
        M++ RFL+ NGVVL   EAPPV TFLE+H GAYTTTR+ NN +S LFW+RHMKRL+ S++IL  S P LL  S  +        W++      +I   V+
Subjt:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD

Query:  DSMRKVLPTAL---NERIGGEELTITVLVSVNLEILGESDGAVDVERVKEALDVHVYVGSYVP-RQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSL
         SM + L + +   +ER+ GEEL +TVLV+ N+E L      +DV    + LDV +++G+Y P    GV EN  +LA+VGRGRDVAAAKYSDWVR RK L
Subjt:  DSMRKVLPTAL---NERIGGEELTITVLVSVNLEILGESDGAVDVERVKEALDVHVYVGSYVP-RQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSL

Query:  EKLRPPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGA
        EK RPP  TELLLSNDGD +LEGC+TNFFVVCR+     K +  L   S   FE+QTAPI+DGVL GVIR LVIE C S GIP+RE  P+WS  E+WE A
Subjt:  EKLRPPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGA

Query:  FVTSSLRILEHVNTICIP-NRWDLLDSKTWSEISWNKKSFKDAPGMISSTI
        F+TSSLRIL+HV TI +P    + L      EI W +K FK+ PGMI+  I
Subjt:  FVTSSLRILEHVNTICIP-NRWDLLDSKTWSEISWNKKSFKDAPGMISSTI

AT3G54970.2 D-aminoacid aminotransferase-like PLP-dependent enzymes superfamily protein4.3e-7953.95Show/hide
Query:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD
        M++ RFL+ NGVVL   EAPPV TFLE+H GAYTTTR+ NN +S LFW+RHMKRL+ S++IL  S P LL  S  +        W++      +I   V+
Subjt:  MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVD

Query:  DSMRKVLPTAL---NERIGGEELTITVLVSVNLEILGESDGAVDVERVKEALDVHVYVGSYVP-RQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSL
         SM + L + +   +ER+ GEEL +TVLV+ N+E L      +DV    + LDV +++G+Y P    GV EN  +LA+VGRGRDVAAAKYSDWVR RK L
Subjt:  DSMRKVLPTAL---NERIGGEELTITVLVSVNLEILGESDGAVDVERVKEALDVHVYVGSYVP-RQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSL

Query:  EKLRPPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGA
        EK RPP  TELLLSNDGD +LEGC+TNFFVVCR+     K +  L   S   FE+QTAPI+DGVL GVIR LVIE C S GIP+RE  P+WS  E+WE A
Subjt:  EKLRPPSVTELLLSNDGDQILEGCVTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGA

Query:  FVTS
        F+T+
Subjt:  FVTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAGCTTCCGATTCTTGTTTAGCAATGGCGTCGTATTGCAAGGCTCCGAAGCTCCTCCGGTCGCCACCTTCCTCGAAACTCATCCTGGCGCTTATACAACTACTCG
ATCACATAACAATGCGTCGAGCATTCTGTTTTGGGATAGGCACATGAAAAGATTGACTCAATCGGTGAAAATTCTGTCGAATTCAACTCCACTACTCTTGTCTGAATCGA
ACAGAACAATTAATAAACTGGTAAAACCGTCGTGGATAGATTCAATTCCTTGGGAACCAGCTATCCGGATGCTTGTTGATGATTCAATGAGAAAAGTGTTGCCAACAGCA
TTGAATGAGAGAATTGGGGGAGAAGAATTGACAATTACAGTGCTTGTAAGTGTGAATTTGGAAATTTTGGGTGAAAGTGACGGTGCTGTGGATGTAGAAAGAGTCAAAGA
GGCTCTTGATGTGCACGTGTATGTTGGTAGTTATGTTCCTCGTCAATTTGGTGTCCCAGAAAATGGTGTGAATCTGGCTGTGGTGGGTCGAGGGAGGGATGTTGCTGCGG
CGAAGTACTCAGATTGGGTTAGGCGTAGGAAGTCTCTGGAAAAATTGAGGCCTCCTTCTGTGACTGAGCTTTTGTTGTCAAATGATGGTGATCAGATACTTGAAGGCTGC
GTGACAAACTTTTTTGTTGTTTGTCGCAAGGATAATAATGAAGCTAAAGAAACAAGCGTTCTTGATTCCAAAAGTACATATTCCTTTGAACTGCAGACAGCTCCCATTAG
TGATGGTGTTCTGACTGGAGTTATTCGTCAATTAGTTATCGAAGCTTGTTCGAGCAACGGCATTCCATTTCGAGAAGTTACACCTACTTGGTCAAGTAATGAAATATGGG
AAGGAGCATTTGTTACAAGTAGCTTGAGAATCTTGGAGCACGTGAATACTATTTGCATTCCTAATAGATGGGACTTGCTCGACTCGAAAACATGGAGTGAGATATCATGG
AACAAGAAGTCATTTAAGGATGCTCCTGGAATGATCTCAAGCACAATCCTGGATCTCGCTCATATGGCGGAGCCTCCTACCAGTCCGGCCGGCGGTAGCCACGAAAGCGG
TGGTGAGCAGAGCCCTAACACAGGTGGCGTTCGTGAACAGGACCGATTCCTTCCGATCGCTAATATTAGTCGGATCATGAAGAAAGCCTTACCTGCTAATGGCAAGATCG
CTAAAGACGCTAAAGATACCGTCCAGGAATGCGTCTCTGAATTCATTAGCTTCGTTACTAGCGAGGCGAGTGATAAGTGCCAGAAGGAGAAGAGGAAGACTATTAATGGA
GATGATTTACTTTGGGCAATGGCGACGTTGGGATTTGAGGAATATATTGATCCGCTTAAGTCGTACCTTACTAGATACAGAGAGTGTGATGCAAAAGGATCTTCTAGGGG
TGGTGACGAATCTGCTAAAAGAGATGCAGTTGGAGCCTTGCCTGGTCAAAATTCCCAGCAATACATGCAGCCGGGAGCATTGACCTACATTAACACTCAAGTAATAATCT
CTCTTCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGACAAGCTTCCGATTCTTGTTTAGCAATGGCGTCGTATTGCAAGGCTCCGAAGCTCCTCCGGTCGCCACCTTCCTCGAAACTCATCCTGGCGCTTATACAACTACTCG
ATCACATAACAATGCGTCGAGCATTCTGTTTTGGGATAGGCACATGAAAAGATTGACTCAATCGGTGAAAATTCTGTCGAATTCAACTCCACTACTCTTGTCTGAATCGA
ACAGAACAATTAATAAACTGGTAAAACCGTCGTGGATAGATTCAATTCCTTGGGAACCAGCTATCCGGATGCTTGTTGATGATTCAATGAGAAAAGTGTTGCCAACAGCA
TTGAATGAGAGAATTGGGGGAGAAGAATTGACAATTACAGTGCTTGTAAGTGTGAATTTGGAAATTTTGGGTGAAAGTGACGGTGCTGTGGATGTAGAAAGAGTCAAAGA
GGCTCTTGATGTGCACGTGTATGTTGGTAGTTATGTTCCTCGTCAATTTGGTGTCCCAGAAAATGGTGTGAATCTGGCTGTGGTGGGTCGAGGGAGGGATGTTGCTGCGG
CGAAGTACTCAGATTGGGTTAGGCGTAGGAAGTCTCTGGAAAAATTGAGGCCTCCTTCTGTGACTGAGCTTTTGTTGTCAAATGATGGTGATCAGATACTTGAAGGCTGC
GTGACAAACTTTTTTGTTGTTTGTCGCAAGGATAATAATGAAGCTAAAGAAACAAGCGTTCTTGATTCCAAAAGTACATATTCCTTTGAACTGCAGACAGCTCCCATTAG
TGATGGTGTTCTGACTGGAGTTATTCGTCAATTAGTTATCGAAGCTTGTTCGAGCAACGGCATTCCATTTCGAGAAGTTACACCTACTTGGTCAAGTAATGAAATATGGG
AAGGAGCATTTGTTACAAGTAGCTTGAGAATCTTGGAGCACGTGAATACTATTTGCATTCCTAATAGATGGGACTTGCTCGACTCGAAAACATGGAGTGAGATATCATGG
AACAAGAAGTCATTTAAGGATGCTCCTGGAATGATCTCAAGCACAATCCTGGATCTCGCTCATATGGCGGAGCCTCCTACCAGTCCGGCCGGCGGTAGCCACGAAAGCGG
TGGTGAGCAGAGCCCTAACACAGGTGGCGTTCGTGAACAGGACCGATTCCTTCCGATCGCTAATATTAGTCGGATCATGAAGAAAGCCTTACCTGCTAATGGCAAGATCG
CTAAAGACGCTAAAGATACCGTCCAGGAATGCGTCTCTGAATTCATTAGCTTCGTTACTAGCGAGGCGAGTGATAAGTGCCAGAAGGAGAAGAGGAAGACTATTAATGGA
GATGATTTACTTTGGGCAATGGCGACGTTGGGATTTGAGGAATATATTGATCCGCTTAAGTCGTACCTTACTAGATACAGAGAGTGTGATGCAAAAGGATCTTCTAGGGG
TGGTGACGAATCTGCTAAAAGAGATGCAGTTGGAGCCTTGCCTGGTCAAAATTCCCAGCAATACATGCAGCCGGGAGCATTGACCTACATTAACACTCAAGTAATAATCT
CTCTTCTCTAA
Protein sequenceShow/hide protein sequence
MTSFRFLFSNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLTQSVKILSNSTPLLLSESNRTINKLVKPSWIDSIPWEPAIRMLVDDSMRKVLPTA
LNERIGGEELTITVLVSVNLEILGESDGAVDVERVKEALDVHVYVGSYVPRQFGVPENGVNLAVVGRGRDVAAAKYSDWVRRRKSLEKLRPPSVTELLLSNDGDQILEGC
VTNFFVVCRKDNNEAKETSVLDSKSTYSFELQTAPISDGVLTGVIRQLVIEACSSNGIPFREVTPTWSSNEIWEGAFVTSSLRILEHVNTICIPNRWDLLDSKTWSEISW
NKKSFKDAPGMISSTILDLAHMAEPPTSPAGGSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTING
DDLLWAMATLGFEEYIDPLKSYLTRYRECDAKGSSRGGDESAKRDAVGALPGQNSQQYMQPGALTYINTQVIISLL