; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G14680 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G14680
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionNuclear transcription factor Y subunit B-8
Genome locationClcChr02:27176419..27197029
RNA-Seq ExpressionClc02G14680
SyntenyClc02G14680
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0007005 - mitochondrion organization (biological process)
GO:0009738 - abscisic acid-activated signaling pathway (biological process)
GO:0031930 - mitochondria-nucleus signaling pathway (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005739 - mitochondrion (cellular component)
GO:0003824 - catalytic activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR001544 - Aminotransferase class IV
IPR003734 - Domain of unknown function DUF155
IPR003956 - Transcription factor, NFYB/HAP3, conserved site
IPR003958 - Transcription factor CBF/NF-Y/archaeal histone domain
IPR009072 - Histone-fold
IPR036038 - Aminotransferase-like, PLP-dependent enzymes
IPR043132 - Branched-chain-amino-acid aminotransferase-like, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601043.1 hypothetical protein SDJN03_06276, partial [Cucurbita argyrosperma subsp. sororia]6.0e-25282.61Show/hide
Query:  LRGNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLSQSVKILSNSSPLLLSESNRTINKLVKPSWIDSVPWEPAIRTLVDDSMRKV
        L  NGV+LQGSE PPVATFLETHPGAYTTTR+HNNASSILFWDRHMKRL+QSVKILSNS+P LLSESNRTINKLV PS  DS+PWEPAIRTLVDDSMRKV
Subjt:  LRGNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLSQSVKILSNSSPLLLSESNRTINKLVKPSWIDSVPWEPAIRTLVDDSMRKV

Query:  LPTALNERLGGEELTITVLVSVNLENLGESDGVVDVERLKEALDVHVHVGSYVPREFGVPENGANLAVVGRGRDVAAAKYSDWVRRRKSLEKLRPPSVTE
        LP ALNER G EEL +TVLVSVNLENLGESDGVVDVER+KEA+ VH HV +YVPREFGVPENGANLAVVGRGRD AAAKYSDWVRRRKSLEKLRPPSVTE
Subjt:  LPTALNERLGGEELTITVLVSVNLENLGESDGVVDVERLKEALDVHVHVGSYVPREFGVPENGANLAVVGRGRDVAAAKYSDWVRRRKSLEKLRPPSVTE

Query:  LLLSNDGDHILEGCVTNFFVVCRKNNNEAKETSVLDSARTYSFELQTAPINDGVLTGVIRQLVIEACSSNGIPFREVAPTWSSNEIWEEAFVTSSLRIVE
        LLLSNDGD ILEGC+TNFFVV RK NNEAKE SV DSA T+SFELQTAPI+DGVLTGVIRQLV+EAC S GIPFREVAPTWSSNE+WEEAFVT+SLR++E
Subjt:  LLLSNDGDHILEGCVTNFFVVCRKNNNEAKETSVLDSARTYSFELQTAPINDGVLTGVIRQLVIEACSSNGIPFREVAPTWSSNEIWEEAFVTSSLRIVE

Query:  HVNTICIPSIWDLLDSKTWSDISWNKKSFKDAPGMISSTIQKEIMEKAVAEAFPIA--------RQIQKFLLFPSSSSSSSSSSS-----------FQDL
        HVNTIC+P+IWDLL+SKTW +ISWNKKSFKDAPG+I+STIQKEIM+K V EAFPI         ++ +K    P + S SS SS+            QDL
Subjt:  HVNTICIPSIWDLLDSKTWSDISWNKKSFKDAPGMISSTIQKEIMEKAVAEAFPIA--------RQIQKFLLFPSSSSSSSSSSS-----------FQDL

Query:  PHMAEPPTSPAGGSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMAT
          MAEPPTSP GGSHESGGEQSP TGG REQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMAT
Subjt:  PHMAEPPTSPAGGSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMAT

Query:  LGFEEYIDPLKSYLTRYRELECDAKGSSRGGDESAKRDPVGALPGQNSQVYL
        LGFEEYIDPLKSYLTRYR  ECDAKGSSRGGDESAKRD VGA+PGQNSQ Y+
Subjt:  LGFEEYIDPLKSYLTRYRELECDAKGSSRGGDESAKRDPVGALPGQNSQVYL

XP_004143605.3 uncharacterized protein LOC101222647 [Cucumis sativus]3.8e-20692.93Show/hide
Query:  MWRTIDAHLRSVRLLPNLSAHSSSSSSSASSLFSSGRSFLARS-SSTLLSPVPKPHSITLSKTLASRSTINCLSSVSCFGLGIQRFGGSSCGVLVLAKCI
        MWRTIDAHLRSVRLLP+LS  SSSSSSS+SS FSSGRSF+ RS S+T  SP PKPHSITLSKTLA    IN LSSVSCF LGIQR  GS+ GVLVLA+CI
Subjt:  MWRTIDAHLRSVRLLPNLSAHSSSSSSSASSLFSSGRSFLARS-SSTLLSPVPKPHSITLSKTLASRSTINCLSSVSCFGLGIQRFGGSSCGVLVLAKCI

Query:  TSSVHTLECNEPVSCSEVGDGGFRSVGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISG
        TSSV++LE NEPVSCSEVGDGGFRSV EGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLR LVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTHGASI G
Subjt:  TSSVHTLECNEPVSCSEVGDGGFRSVGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISG

Query:  SDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMV
        SDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMV
Subjt:  SDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMV

Query:  AEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSD
        AEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSD
Subjt:  AEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSD

Query:  FLEWLIIALI
        FLEWLIIALI
Subjt:  FLEWLIIALI

XP_008445840.1 PREDICTED: uncharacterized protein LOC103488742 [Cucumis melo]1.4e-20892.53Show/hide
Query:  MWRTIDAHLRSVRLLPNLSAHSSSSSS-----SASSLFSSGRSFLARS-SSTLLSPVPKPHSITLSKTLASRSTINCLSSVSCFGLGIQRFGGSSCGVLV
        MWRTIDAHLRSVRLLP+LS+HSSSSSS     S+SSLFSSGRSFL RS S+TL+SP+PKPHSITLSKTLA    IN  SSVSCF LGIQRF GS+ GVLV
Subjt:  MWRTIDAHLRSVRLLPNLSAHSSSSSS-----SASSLFSSGRSFLARS-SSTLLSPVPKPHSITLSKTLASRSTINCLSSVSCFGLGIQRFGGSSCGVLV

Query:  LAKCITSSVHTLECNEPVSCSEVGDGGFRSVGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTHG
        LA+CITSS +TLE NEPVSCSEVGDGGFRSV EGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLR LVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTH 
Subjt:  LAKCITSSVHTLECNEPVSCSEVGDGGFRSVGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTHG

Query:  ASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQ
        ASISGSDCCFMVVFQYGSIVLFNVREH+VDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQ
Subjt:  ASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQ

Query:  VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQ
        VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQ
Subjt:  VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQ

Query:  NRKSDFLEWLIIALI
        NRKSDFLEWLIIALI
Subjt:  NRKSDFLEWLIIALI

XP_022139336.1 uncharacterized protein LOC111010276 [Momordica charantia]1.5e-20791.44Show/hide
Query:  MWRTIDAHLRSVRLLPNLSAHSSSSSSSASSLFSSGRSFLARSSSTLLSPVPKPHSITLSKTLASRSTINCLSSVSCFGLGIQRFGGSSCGVLVLAKCIT
        MWRTIDAHLRSVRL+P LSA+SSSSSSS S LF++GRSFL RSSS+LLSPVP+ HSITL +TL+SR+ +NC SS  C GLGI+RFG SSCG++VLA+CIT
Subjt:  MWRTIDAHLRSVRLLPNLSAHSSSSSSSASSLFSSGRSFLARSSSTLLSPVPKPHSITLSKTLASRSTINCLSSVSCFGLGIQRFGGSSCGVLVLAKCIT

Query:  SSVHTLECNEPVSCSEVGDGGFRSVGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISGS
        SSVHTLE NEPVSCSEVGDGGFRS+GEG+SDGEGDEVEEDSRPSIPVRAYFFSTSVDLR LVDQNK NFIPPSSRMTNYVVLKFGDLCN NT  ASI+GS
Subjt:  SSVHTLECNEPVSCSEVGDGGFRSVGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISGS

Query:  DCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVA
        DCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVA
Subjt:  DCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVA

Query:  EFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDF
        EFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDF
Subjt:  EFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDF

Query:  LEWLIIALI
        LEWLIIALI
Subjt:  LEWLIIALI

XP_038891437.1 uncharacterized protein LOC120080856 [Benincasa hispida]6.0e-22096.36Show/hide
Query:  MWRTIDAHLRSVRLLPNLSAHSSSS---SSSASSLFSSGRSFLARSSSTLLSPVPKPHSITLSKTLASRSTINCLSSVSCFGLGIQRFGGSSCGVLVLAK
        MWR+IDAHLRSVRLLPNLSAHSSSS   SSSASSLFSSGRSF ARSSST LSPVPKPHS+TL KTL  R+TINCLSSVSCFGLGIQRFGGS+CGVLVLAK
Subjt:  MWRTIDAHLRSVRLLPNLSAHSSSS---SSSASSLFSSGRSFLARSSSTLLSPVPKPHSITLSKTLASRSTINCLSSVSCFGLGIQRFGGSSCGVLVLAK

Query:  CITSSVHTLECNEPVSCSEVGDGGFRSVGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTHGASI
        CITSSVHTLE NEPV CSEVGDGGFRSVGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLR LVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTHGASI
Subjt:  CITSSVHTLECNEPVSCSEVGDGGFRSVGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTHGASI

Query:  SGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDG
        SGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDG
Subjt:  SGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDG

Query:  MVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRK
        MVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRK
Subjt:  MVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRK

Query:  SDFLEWLIIALI
        SDFLEWLIIALI
Subjt:  SDFLEWLIIALI

TrEMBL top hitse value%identityAlignment
A0A0A0KNM3 DUF155 domain-containing protein1.8e-20692.93Show/hide
Query:  MWRTIDAHLRSVRLLPNLSAHSSSSSSSASSLFSSGRSFLARS-SSTLLSPVPKPHSITLSKTLASRSTINCLSSVSCFGLGIQRFGGSSCGVLVLAKCI
        MWRTIDAHLRSVRLLP+LS  SSSSSSS+SS FSSGRSF+ RS S+T  SP PKPHSITLSKTLA    IN LSSVSCF LGIQR  GS+ GVLVLA+CI
Subjt:  MWRTIDAHLRSVRLLPNLSAHSSSSSSSASSLFSSGRSFLARS-SSTLLSPVPKPHSITLSKTLASRSTINCLSSVSCFGLGIQRFGGSSCGVLVLAKCI

Query:  TSSVHTLECNEPVSCSEVGDGGFRSVGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISG
        TSSV++LE NEPVSCSEVGDGGFRSV EGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLR LVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTHGASI G
Subjt:  TSSVHTLECNEPVSCSEVGDGGFRSVGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISG

Query:  SDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMV
        SDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMV
Subjt:  SDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMV

Query:  AEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSD
        AEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSD
Subjt:  AEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSD

Query:  FLEWLIIALI
        FLEWLIIALI
Subjt:  FLEWLIIALI

A0A1S3BEH0 uncharacterized protein LOC1034887426.7e-20992.53Show/hide
Query:  MWRTIDAHLRSVRLLPNLSAHSSSSSS-----SASSLFSSGRSFLARS-SSTLLSPVPKPHSITLSKTLASRSTINCLSSVSCFGLGIQRFGGSSCGVLV
        MWRTIDAHLRSVRLLP+LS+HSSSSSS     S+SSLFSSGRSFL RS S+TL+SP+PKPHSITLSKTLA    IN  SSVSCF LGIQRF GS+ GVLV
Subjt:  MWRTIDAHLRSVRLLPNLSAHSSSSSS-----SASSLFSSGRSFLARS-SSTLLSPVPKPHSITLSKTLASRSTINCLSSVSCFGLGIQRFGGSSCGVLV

Query:  LAKCITSSVHTLECNEPVSCSEVGDGGFRSVGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTHG
        LA+CITSS +TLE NEPVSCSEVGDGGFRSV EGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLR LVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTH 
Subjt:  LAKCITSSVHTLECNEPVSCSEVGDGGFRSVGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTHG

Query:  ASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQ
        ASISGSDCCFMVVFQYGSIVLFNVREH+VDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQ
Subjt:  ASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQ

Query:  VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQ
        VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQ
Subjt:  VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQ

Query:  NRKSDFLEWLIIALI
        NRKSDFLEWLIIALI
Subjt:  NRKSDFLEWLIIALI

A0A6J1CCC7 uncharacterized protein LOC1110102767.5e-20891.44Show/hide
Query:  MWRTIDAHLRSVRLLPNLSAHSSSSSSSASSLFSSGRSFLARSSSTLLSPVPKPHSITLSKTLASRSTINCLSSVSCFGLGIQRFGGSSCGVLVLAKCIT
        MWRTIDAHLRSVRL+P LSA+SSSSSSS S LF++GRSFL RSSS+LLSPVP+ HSITL +TL+SR+ +NC SS  C GLGI+RFG SSCG++VLA+CIT
Subjt:  MWRTIDAHLRSVRLLPNLSAHSSSSSSSASSLFSSGRSFLARSSSTLLSPVPKPHSITLSKTLASRSTINCLSSVSCFGLGIQRFGGSSCGVLVLAKCIT

Query:  SSVHTLECNEPVSCSEVGDGGFRSVGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISGS
        SSVHTLE NEPVSCSEVGDGGFRS+GEG+SDGEGDEVEEDSRPSIPVRAYFFSTSVDLR LVDQNK NFIPPSSRMTNYVVLKFGDLCN NT  ASI+GS
Subjt:  SSVHTLECNEPVSCSEVGDGGFRSVGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISGS

Query:  DCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVA
        DCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVA
Subjt:  DCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVA

Query:  EFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDF
        EFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDF
Subjt:  EFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDF

Query:  LEWLIIALI
        LEWLIIALI
Subjt:  LEWLIIALI

A0A6J1FTZ0 uncharacterized protein LOC1114472854.7e-20288.92Show/hide
Query:  MWRTIDAHLRSVRLLPNLSAHSSSSSSSASS------LFSSGRSFLARSSSTLLSPVPKPHSITLSKTLASRSTINCLSSVSCFGLGIQRFGGSSCGVLV
        MWRTIDAHLRSVRLLP+LS  SSSSSSS+SS      LF+SGRSF ARSSS+LLSPVPKPH ITLSK LASR+  NCLSSV CFGL   R GGSSCG +V
Subjt:  MWRTIDAHLRSVRLLPNLSAHSSSSSSSASS------LFSSGRSFLARSSSTLLSPVPKPHSITLSKTLASRSTINCLSSVSCFGLGIQRFGGSSCGVLV

Query:  LAKCITSSVHTLECNEPVSCSEVGDGGFRSVGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTHG
        LA+CIT+SV+TLE NEPVSCSEVG+G FRS  +G SDGE DEV EDSRPSIPVRA+F STSVDLRRLVDQNK NFIPPSSRMTNYVVLKFGDLC+VN++G
Subjt:  LAKCITSSVHTLECNEPVSCSEVGDGGFRSVGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTHG

Query:  ASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQ
        ASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVRE PAL+TWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQ
Subjt:  ASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQ

Query:  VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQ
        VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWK+AKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQ
Subjt:  VDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQ

Query:  NRKSDFLEWLIIALI
        NRKSDFLEWLIIALI
Subjt:  NRKSDFLEWLIIALI

A0A6J1GZZ1 uncharacterized protein LOC1114587772.4e-19889.49Show/hide
Query:  MWRTIDAHLRSVRLLPNLSAHSSSSSSSASSLFSSGRSFLARSSSTLLSPVPKPHSITLSKTLASRSTINCLSSVSCFGLGIQRFGGSSCGVLVLAKCIT
        MWR IDAHLRSVRLLPNL A+SSS     S LF SGRS LARSSST LSPVPKPHSITLSKTL   +TINCLSSVSC G+GI+RFGGSSCGV+VLA+CIT
Subjt:  MWRTIDAHLRSVRLLPNLSAHSSSSSSSASSLFSSGRSFLARSSSTLLSPVPKPHSITLSKTLASRSTINCLSSVSCFGLGIQRFGGSSCGVLVLAKCIT

Query:  SSVHTLECNEPVSCSEVGDGGFRSVGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISGS
        SSVHTLE NEPVSCSE        VGEGI +GE DEVEEDSRPSIPVRAYF STSVDLR LVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNT GAS+SGS
Subjt:  SSVHTLECNEPVSCSEVGDGGFRSVGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISGS

Query:  DCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVA
        D  +MVVFQYGSIVLFN+RE EVDGYLKIVEKHASGLLPEMRKDEYEVREK ALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVA
Subjt:  DCCFMVVFQYGSIVLFNVREHEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVA

Query:  EFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDF
        EFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDF
Subjt:  EFTDINREMEATGKFKMKRKKLFQLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDF

Query:  LEWLIIALI
        LEWLIIALI
Subjt:  LEWLIIALI

SwissProt top hitse value%identityAlignment
Q60EQ4 Nuclear transcription factor Y subunit B-36.6e-5269.87Show/hide
Query:  MAEPPTSP--AGGSHES--------GGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGD
        MA+ P SP   GGSHES        GG      GGVREQDRFLPIANISRIMKKA+PANGKIAKDAK+TVQECVSEFISF+TSEASDKCQ+EKRKTINGD
Subjt:  MAEPPTSP--AGGSHES--------GGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGD

Query:  DLLWAMATLGFEEYIDPLKSYLTRYRELECDAKGSSRGGDESAKRDPVGALPGQNS
        DLLWAMATLGFE+YI+PLK YL +YRE+E D+K +++ GD S K+D +G+  G +S
Subjt:  DLLWAMATLGFEEYIDPLKSYLTRYRELECDAKGSSRGGDESAKRDPVGALPGQNS

Q67XJ2 Nuclear transcription factor Y subunit B-101.3e-5570.48Show/hide
Query:  MAEPPT-SPAGGSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATL
        MAE  T    GGSHESGG+QSP +  VREQDRFLPIANISRIMK+ LP NGKIAKDAK+T+QECVSEFISFVTSEASDKCQ+EKRKTINGDDLLWAMATL
Subjt:  MAEPPT-SPAGGSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATL

Query:  GFEEYIDPLKSYLTRYRELECDAKGSSRGGDESAKRDPVGALPGQNSQVYLLFLNSLFTENFYSSS
        GFE+YIDPLK YL RYRE+E D KGS +GG+ SAKRD     P Q SQ   +     F++  Y +S
Subjt:  GFEEYIDPLKSYLTRYRELECDAKGSSRGGDESAKRDPVGALPGQNSQVYLLFLNSLFTENFYSSS

Q8VYK4 Nuclear transcription factor Y subunit B-84.9e-5576.92Show/hide
Query:  SPAG-GSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATLGFEEYI
        SP G GSHESGG+QSP +  VREQDRFLPIANISRIMK+ LPANGKIAKDAK+ VQECVSEFISFVTSEASDKCQ+EKRKTINGDDLLWAMATLGFE+Y+
Subjt:  SPAG-GSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATLGFEEYI

Query:  DPLKSYLTRYRELECDAKGSSRGGDESAKRDPVGALPGQNSQV
        +PLK YL RYRE+E D KGS++GGD +AK+D   +  GQ SQ+
Subjt:  DPLKSYLTRYRELECDAKGSSRGGDESAKRDPVGALPGQNSQV

Q9C565 Protein RETARDED ROOT GROWTH, mitochondrial1.4e-9161.51Show/hide
Query:  GDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDL--CNVNTHGASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVE
        G E EE +   IP++AYF STS+DL+ +  +N  N +PP+SR TNY+ LKF D     + +     S S+C FMVVFQYGS +LFNV +++VD YL IV 
Subjt:  GDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDL--CNVNTHGASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVE

Query:  KHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANS
        +HASGLL EMRKD+Y V+EKP L   M+GG DYI+L+ L+ + IR IGSVLGQSIALDY   QV+ +V EF DINR M  TG F M RKKLFQLVGKANS
Subjt:  KHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANS

Query:  NLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLEWLIIALI
        N+ADVILK+GLFERS+IAW++A+YAQI+EYLR+E+E++QRF  LD+KLKF+EHNI FLQE++QNR+SD LEW II L+
Subjt:  NLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLEWLIIALI

Q9FNB2 Protein RETARDED ROOT GROWTH-LIKE2.5e-12077.62Show/hide
Query:  VGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCN-VNTHGASISGSDCCFMVVFQYGSIVLFNVREHEV
        V E IS G    +E++++ SIPVRAYFFSTSVDLR L++QNK+NFIPP+SRMTNYVVLKFG+  +  +T    ISGS+  +MVVF YGSIVLFNVREHEV
Subjt:  VGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCN-VNTHGASISGSDCCFMVVFQYGSIVLFNVREHEV

Query:  DGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLF
        D YLK+VE+HASGLLPEMRKDEYEVRE P L+TWME G D+I LQ+LN DGIRTIG VLGQSIALDYYGRQVDGMVAEFT+INR++E TG F MKRKKLF
Subjt:  DGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLF

Query:  QLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLEWLIIALI
        QLVGKAN  LADVILKLGLFERSDIAWKDAKY QIWE+LRDEFELTQ FA+LD+KLKFVEHN+RFLQEILQNRKS  LEWLII LI
Subjt:  QLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLEWLIIALI

Arabidopsis top hitse value%identityAlignment
AT1G69380.1 Protein of unknown function (DUF155)1.0e-9261.51Show/hide
Query:  GDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDL--CNVNTHGASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVE
        G E EE +   IP++AYF STS+DL+ +  +N  N +PP+SR TNY+ LKF D     + +     S S+C FMVVFQYGS +LFNV +++VD YL IV 
Subjt:  GDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDL--CNVNTHGASISGSDCCFMVVFQYGSIVLFNVREHEVDGYLKIVE

Query:  KHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANS
        +HASGLL EMRKD+Y V+EKP L   M+GG DYI+L+ L+ + IR IGSVLGQSIALDY   QV+ +V EF DINR M  TG F M RKKLFQLVGKANS
Subjt:  KHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKANS

Query:  NLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLEWLIIALI
        N+ADVILK+GLFERS+IAW++A+YAQI+EYLR+E+E++QRF  LD+KLKF+EHNI FLQE++QNR+SD LEW II L+
Subjt:  NLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLEWLIIALI

AT3G53340.1 nuclear factor Y, subunit B109.1e-5770.48Show/hide
Query:  MAEPPT-SPAGGSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATL
        MAE  T    GGSHESGG+QSP +  VREQDRFLPIANISRIMK+ LP NGKIAKDAK+T+QECVSEFISFVTSEASDKCQ+EKRKTINGDDLLWAMATL
Subjt:  MAEPPT-SPAGGSHESGGEQSPNTGGVREQDRFLPIANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATL

Query:  GFEEYIDPLKSYLTRYRELECDAKGSSRGGDESAKRDPVGALPGQNSQVYLLFLNSLFTENFYSSS
        GFE+YIDPLK YL RYRE+E D KGS +GG+ SAKRD     P Q SQ   +     F++  Y +S
Subjt:  GFEEYIDPLKSYLTRYRELECDAKGSSRGGDESAKRDPVGALPGQNSQVYLLFLNSLFTENFYSSS

AT3G54970.1 D-aminoacid aminotransferase-like PLP-dependent enzymes superfamily protein3.4e-9654.75Show/hide
Query:  NGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLSQSVKILSNSSPLLLSESNRTINKLVKPSWIDSVPWEPAIRTLVDDSMRKVLPT
        NGVVL   EAPPV TFLE+H GAYTTTR+ NN +S LFW+RHMKRLS S++IL  S+P LL  S  +        W++      +I   V+ SM + L +
Subjt:  NGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLSQSVKILSNSSPLLLSESNRTINKLVKPSWIDSVPWEPAIRTLVDDSMRKVLPT

Query:  AL---NERLGGEELTITVLVSVNLENLGESDGVVDVERLKEALDVHVHVGSYVP-REFGVPENGANLAVVGRGRDVAAAKYSDWVRRRKSLEKLRPPSVT
         +   +ERL GEEL +TVLV+ N+E L      +DV    + LDV +H+G+Y P    GV EN A+LA+VGRGRDVAAAKYSDWVR RK LEK RPP  T
Subjt:  AL---NERLGGEELTITVLVSVNLENLGESDGVVDVERLKEALDVHVHVGSYVP-REFGVPENGANLAVVGRGRDVAAAKYSDWVRRRKSLEKLRPPSVT

Query:  ELLLSNDGDHILEGCVTNFFVVCRKNNNEAKETSVLDSARTYSFELQTAPINDGVLTGVIRQLVIEACSSNGIPFREVAPTWSSNEIWEEAFVTSSLRIV
        ELLLSNDGDH+LEGC+TNFFVVCR+     K +  L       FE+QTAPI DGVL GVIR LVIE C S GIP+RE AP+WS  E+WEEAF+TSSLRI+
Subjt:  ELLLSNDGDHILEGCVTNFFVVCRKNNNEAKETSVLDSARTYSFELQTAPINDGVLTGVIRQLVIEACSSNGIPFREVAPTWSSNEIWEEAFVTSSLRIV

Query:  EHVNTICIP--SIWDLLDSKTWSDISWNKKSFKDAPGMISSTIQKEIMEKAVAEAFPI
        +HV TI +P  S+  L  +K   +I W +K FK+ PGMI+  I+K IME+ + E FP+
Subjt:  EHVNTICIP--SIWDLLDSKTWSDISWNKKSFKDAPGMISSTIQKEIMEKAVAEAFPI

AT3G54970.2 D-aminoacid aminotransferase-like PLP-dependent enzymes superfamily protein1.2e-8056.27Show/hide
Query:  NGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLSQSVKILSNSSPLLLSESNRTINKLVKPSWIDSVPWEPAIRTLVDDSMRKVLPT
        NGVVL   EAPPV TFLE+H GAYTTTR+ NN +S LFW+RHMKRLS S++IL  S+P LL  S  +        W++      +I   V+ SM + L +
Subjt:  NGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLSQSVKILSNSSPLLLSESNRTINKLVKPSWIDSVPWEPAIRTLVDDSMRKVLPT

Query:  AL---NERLGGEELTITVLVSVNLENLGESDGVVDVERLKEALDVHVHVGSYVP-REFGVPENGANLAVVGRGRDVAAAKYSDWVRRRKSLEKLRPPSVT
         +   +ERL GEEL +TVLV+ N+E L      +DV    + LDV +H+G+Y P    GV EN A+LA+VGRGRDVAAAKYSDWVR RK LEK RPP  T
Subjt:  AL---NERLGGEELTITVLVSVNLENLGESDGVVDVERLKEALDVHVHVGSYVP-REFGVPENGANLAVVGRGRDVAAAKYSDWVRRRKSLEKLRPPSVT

Query:  ELLLSNDGDHILEGCVTNFFVVCRKNNNEAKETSVLDSARTYSFELQTAPINDGVLTGVIRQLVIEACSSNGIPFREVAPTWSSNEIWEEAFVTS
        ELLLSNDGDH+LEGC+TNFFVVCR+     K +  L       FE+QTAPI DGVL GVIR LVIE C S GIP+RE AP+WS  E+WEEAF+T+
Subjt:  ELLLSNDGDHILEGCVTNFFVVCRKNNNEAKETSVLDSARTYSFELQTAPINDGVLTGVIRQLVIEACSSNGIPFREVAPTWSSNEIWEEAFVTS

AT5G13610.1 Protein of unknown function (DUF155)1.8e-12177.62Show/hide
Query:  VGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCN-VNTHGASISGSDCCFMVVFQYGSIVLFNVREHEV
        V E IS G    +E++++ SIPVRAYFFSTSVDLR L++QNK+NFIPP+SRMTNYVVLKFG+  +  +T    ISGS+  +MVVF YGSIVLFNVREHEV
Subjt:  VGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCN-VNTHGASISGSDCCFMVVFQYGSIVLFNVREHEV

Query:  DGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLF
        D YLK+VE+HASGLLPEMRKDEYEVRE P L+TWME G D+I LQ+LN DGIRTIG VLGQSIALDYYGRQVDGMVAEFT+INR++E TG F MKRKKLF
Subjt:  DGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLF

Query:  QLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLEWLIIALI
        QLVGKAN  LADVILKLGLFERSDIAWKDAKY QIWE+LRDEFELTQ FA+LD+KLKFVEHN+RFLQEILQNRKS  LEWLII LI
Subjt:  QLVGKANSNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLEWLIIALI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGCGCACCATTGACGCCCATCTGAGGTCCGTACGCCTCCTTCCGAATCTCTCTGCTCATTCTTCTTCTTCTTCATCTTCAGCTTCTTCTCTCTTTTCCTCCGGCCG
TTCATTTCTCGCTCGCTCGAGTTCGACTCTCCTCTCGCCTGTACCCAAGCCTCACTCGATTACTCTCTCCAAAACCCTAGCGTCTCGTTCTACCATTAATTGTTTATCGA
GTGTTTCGTGTTTTGGCCTTGGAATTCAACGCTTCGGAGGATCGAGTTGCGGTGTGTTGGTGTTGGCGAAATGCATTACCTCTTCAGTGCACACGTTGGAGTGTAATGAA
CCAGTGTCGTGTTCGGAGGTTGGAGACGGTGGTTTTCGGAGTGTTGGGGAAGGAATTAGTGACGGTGAAGGGGATGAAGTCGAGGAGGATTCTAGACCGTCTATTCCTGT
CAGAGCTTATTTCTTCTCCACTAGTGTGGATTTGAGAAGATTGGTGGATCAGAATAAACGTAACTTTATCCCGCCATCATCTCGTATGACGAATTATGTAGTCCTTAAGT
TTGGGGATCTTTGTAATGTGAACACTCATGGCGCCAGCATAAGTGGAAGTGATTGCTGTTTCATGGTAGTTTTTCAGTATGGCTCAATTGTGCTGTTTAACGTTCGTGAA
CATGAGGTTGATGGGTATTTGAAAATTGTAGAGAAACATGCATCCGGATTGCTGCCTGAAATGAGAAAGGATGAGTATGAGGTTAGAGAGAAGCCTGCCTTAAATACATG
GATGGAAGGGGGCTTGGACTACATAATGCTGCAGTACTTGAATATTGATGGCATACGTACCATAGGTAGCGTTCTTGGTCAGAGTATTGCTCTTGATTACTATGGGCGAC
AGGTTGATGGGATGGTTGCGGAATTCACTGACATCAACCGTGAAATGGAAGCAACTGGGAAGTTTAAAATGAAGAGGAAGAAGCTTTTCCAGTTGGTGGGAAAGGCAAAT
TCAAATCTTGCTGATGTCATTCTCAAGCTTGGACTTTTTGAGAGATCAGACATTGCATGGAAAGATGCAAAATATGCTCAAATATGGGAATATCTCAGAGATGAGTTTGA
GTTAACACAGAGATTCGCAAGTCTGGATTTCAAATTGAAGTTTGTGGAGCATAATATTCGCTTCCTACAAGAGATTCTGCAAAACAGGAAATCAGATTTTTTGGAATGGC
TGATCATTGCATTGATTGTTGGAACTGGACAGTTTTGGCTTCCTGTATCGGGTAATAACAATTCAATCGACGGGGAAGAGATAAATCGAAGCAGGGTCGGGGATACCCTC
CCTGTCACCGGATGGGTATCCGGAGAATGTCTCCGATTTGACTGGGGTTTTGACTCGATCGGAGCGGAGCTCCGCGGCAATGGCGTCGTATTGCAAGGCTCCGAAGCTCC
TCCGGTCGCCACCTTCCTCGAAACTCATCCTGGCGCTTATACAACAACTCGGTCACATAATAATGCATCGAGCATTCTGTTTTGGGATAGGCACATGAAAAGGCTGAGTC
AGTCGGTGAAAATTCTGTCGAATTCAAGTCCACTACTCTTGTCTGAATCTAACAGAACAATCAATAAACTGGTAAAACCGTCGTGGATAGATTCTGTTCCCTGGGAACCA
GCTATCCGGACGCTTGTTGATGATTCAATGAGAAAAGTGTTGCCAACAGCATTGAATGAGAGACTTGGGGGAGAAGAATTGACAATTACAGTGCTCGTAAGTGTGAATTT
GGAAAATTTGGGTGAAAGTGACGGTGTTGTGGATGTAGAAAGACTTAAAGAGGCCCTTGATGTGCACGTGCATGTTGGTAGTTATGTTCCTCGTGAATTTGGTGTCCCGG
AAAATGGTGCGAATCTGGCTGTGGTGGGTCGAGGGAGGGATGTTGCTGCGGCAAAGTACTCAGATTGGGTTAGGCGTAGGAAGTCTCTGGAAAAATTGAGGCCTCCATCT
GTGACCGAGCTTTTGTTGTCAAATGATGGTGATCATATACTTGAAGGCTGCGTGACAAACTTTTTTGTTGTTTGCCGCAAGAATAATAATGAAGCCAAAGAGACAAGCGT
TCTTGATTCCGCAAGAACATATTCCTTTGAACTGCAGACAGCTCCCATTAATGATGGTGTTCTGACTGGAGTTATTCGTCAATTAGTCATTGAAGCTTGTTCAAGCAACG
GCATCCCATTTCGAGAAGTTGCACCTACTTGGTCAAGTAATGAAATATGGGAAGAAGCATTTGTTACAAGTAGCTTGAGAATCGTGGAGCATGTGAATACTATTTGCATC
CCTAGCATATGGGACTTGCTCGACTCGAAAACGTGGAGTGATATATCATGGAACAAGAAGTCGTTTAAGGATGCTCCTGGAATGATCTCAAGCACAATCCAGAAAGAGAT
AATGGAGAAAGCTGTTGCAGAAGCATTCCCGATTGCGCGCCAAATACAAAAATTTCTCCTCTTTCCCTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCCTTTCAGGATC
TCCCTCATATGGCGGAGCCTCCCACCAGTCCGGCCGGCGGCAGCCACGAAAGCGGCGGTGAGCAGAGCCCTAACACAGGTGGCGTTCGTGAACAGGACCGATTTCTTCCG
ATCGCTAATATTAGTCGGATCATGAAGAAAGCCTTACCTGCTAATGGCAAGATCGCTAAAGACGCTAAAGATACCGTCCAGGAATGCGTCTCTGAATTCATTAGCTTCGT
TACTAGCGAGGCGAGTGATAAGTGTCAGAAGGAGAAGAGGAAGACTATTAATGGAGATGACTTGCTTTGGGCAATGGCGACGTTGGGATTTGAGGAATATATTGATCCGC
TTAAGTCGTACCTTACTAGATACAGAGAGTTGGAGTGTGATGCTAAAGGATCTTCTAGGGGTGGTGATGAATCTGCTAAAAGAGATCCAGTTGGAGCCTTGCCTGGTCAA
AATTCCCAGGTTTATTTACTTTTTTTAAATTCTTTGTTTACAGAGAATTTCTATTCATCGTCATAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGCGCACCATTGACGCCCATCTGAGGTCCGTACGCCTCCTTCCGAATCTCTCTGCTCATTCTTCTTCTTCTTCATCTTCAGCTTCTTCTCTCTTTTCCTCCGGCCG
TTCATTTCTCGCTCGCTCGAGTTCGACTCTCCTCTCGCCTGTACCCAAGCCTCACTCGATTACTCTCTCCAAAACCCTAGCGTCTCGTTCTACCATTAATTGTTTATCGA
GTGTTTCGTGTTTTGGCCTTGGAATTCAACGCTTCGGAGGATCGAGTTGCGGTGTGTTGGTGTTGGCGAAATGCATTACCTCTTCAGTGCACACGTTGGAGTGTAATGAA
CCAGTGTCGTGTTCGGAGGTTGGAGACGGTGGTTTTCGGAGTGTTGGGGAAGGAATTAGTGACGGTGAAGGGGATGAAGTCGAGGAGGATTCTAGACCGTCTATTCCTGT
CAGAGCTTATTTCTTCTCCACTAGTGTGGATTTGAGAAGATTGGTGGATCAGAATAAACGTAACTTTATCCCGCCATCATCTCGTATGACGAATTATGTAGTCCTTAAGT
TTGGGGATCTTTGTAATGTGAACACTCATGGCGCCAGCATAAGTGGAAGTGATTGCTGTTTCATGGTAGTTTTTCAGTATGGCTCAATTGTGCTGTTTAACGTTCGTGAA
CATGAGGTTGATGGGTATTTGAAAATTGTAGAGAAACATGCATCCGGATTGCTGCCTGAAATGAGAAAGGATGAGTATGAGGTTAGAGAGAAGCCTGCCTTAAATACATG
GATGGAAGGGGGCTTGGACTACATAATGCTGCAGTACTTGAATATTGATGGCATACGTACCATAGGTAGCGTTCTTGGTCAGAGTATTGCTCTTGATTACTATGGGCGAC
AGGTTGATGGGATGGTTGCGGAATTCACTGACATCAACCGTGAAATGGAAGCAACTGGGAAGTTTAAAATGAAGAGGAAGAAGCTTTTCCAGTTGGTGGGAAAGGCAAAT
TCAAATCTTGCTGATGTCATTCTCAAGCTTGGACTTTTTGAGAGATCAGACATTGCATGGAAAGATGCAAAATATGCTCAAATATGGGAATATCTCAGAGATGAGTTTGA
GTTAACACAGAGATTCGCAAGTCTGGATTTCAAATTGAAGTTTGTGGAGCATAATATTCGCTTCCTACAAGAGATTCTGCAAAACAGGAAATCAGATTTTTTGGAATGGC
TGATCATTGCATTGATTGTTGGAACTGGACAGTTTTGGCTTCCTGTATCGGGTAATAACAATTCAATCGACGGGGAAGAGATAAATCGAAGCAGGGTCGGGGATACCCTC
CCTGTCACCGGATGGGTATCCGGAGAATGTCTCCGATTTGACTGGGGTTTTGACTCGATCGGAGCGGAGCTCCGCGGCAATGGCGTCGTATTGCAAGGCTCCGAAGCTCC
TCCGGTCGCCACCTTCCTCGAAACTCATCCTGGCGCTTATACAACAACTCGGTCACATAATAATGCATCGAGCATTCTGTTTTGGGATAGGCACATGAAAAGGCTGAGTC
AGTCGGTGAAAATTCTGTCGAATTCAAGTCCACTACTCTTGTCTGAATCTAACAGAACAATCAATAAACTGGTAAAACCGTCGTGGATAGATTCTGTTCCCTGGGAACCA
GCTATCCGGACGCTTGTTGATGATTCAATGAGAAAAGTGTTGCCAACAGCATTGAATGAGAGACTTGGGGGAGAAGAATTGACAATTACAGTGCTCGTAAGTGTGAATTT
GGAAAATTTGGGTGAAAGTGACGGTGTTGTGGATGTAGAAAGACTTAAAGAGGCCCTTGATGTGCACGTGCATGTTGGTAGTTATGTTCCTCGTGAATTTGGTGTCCCGG
AAAATGGTGCGAATCTGGCTGTGGTGGGTCGAGGGAGGGATGTTGCTGCGGCAAAGTACTCAGATTGGGTTAGGCGTAGGAAGTCTCTGGAAAAATTGAGGCCTCCATCT
GTGACCGAGCTTTTGTTGTCAAATGATGGTGATCATATACTTGAAGGCTGCGTGACAAACTTTTTTGTTGTTTGCCGCAAGAATAATAATGAAGCCAAAGAGACAAGCGT
TCTTGATTCCGCAAGAACATATTCCTTTGAACTGCAGACAGCTCCCATTAATGATGGTGTTCTGACTGGAGTTATTCGTCAATTAGTCATTGAAGCTTGTTCAAGCAACG
GCATCCCATTTCGAGAAGTTGCACCTACTTGGTCAAGTAATGAAATATGGGAAGAAGCATTTGTTACAAGTAGCTTGAGAATCGTGGAGCATGTGAATACTATTTGCATC
CCTAGCATATGGGACTTGCTCGACTCGAAAACGTGGAGTGATATATCATGGAACAAGAAGTCGTTTAAGGATGCTCCTGGAATGATCTCAAGCACAATCCAGAAAGAGAT
AATGGAGAAAGCTGTTGCAGAAGCATTCCCGATTGCGCGCCAAATACAAAAATTTCTCCTCTTTCCCTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCCTTTCAGGATC
TCCCTCATATGGCGGAGCCTCCCACCAGTCCGGCCGGCGGCAGCCACGAAAGCGGCGGTGAGCAGAGCCCTAACACAGGTGGCGTTCGTGAACAGGACCGATTTCTTCCG
ATCGCTAATATTAGTCGGATCATGAAGAAAGCCTTACCTGCTAATGGCAAGATCGCTAAAGACGCTAAAGATACCGTCCAGGAATGCGTCTCTGAATTCATTAGCTTCGT
TACTAGCGAGGCGAGTGATAAGTGTCAGAAGGAGAAGAGGAAGACTATTAATGGAGATGACTTGCTTTGGGCAATGGCGACGTTGGGATTTGAGGAATATATTGATCCGC
TTAAGTCGTACCTTACTAGATACAGAGAGTTGGAGTGTGATGCTAAAGGATCTTCTAGGGGTGGTGATGAATCTGCTAAAAGAGATCCAGTTGGAGCCTTGCCTGGTCAA
AATTCCCAGGTTTATTTACTTTTTTTAAATTCTTTGTTTACAGAGAATTTCTATTCATCGTCATAGATACATGTCTTGATTATCCTTTTTTCTTTGTCATGTAATCATAC
GTTGGCTTTTGGTGTAAATATGTTCAGCAATACATGCAGCCGGGAGCATTAACCTACATAAACACTCAAGTAATAATCTCTCTTCTCTAAACTTGTCAACATACATGCTG
TCCAGGAATACCCTCTTTTTCATTGCTAATAATTGGATTGAATTTTCGTTTATTATTCTCTTCTTGTTATTTCCAGTATGGTCTTTAATTGTTGAATGTAGCCCTCACAT
ATAGATACATCTCCTACCTGTTTTTTTTAGTACAAAGTTAGCTTGTCAAGAAACTTGTAGGATTTCATTCTACGTGGTAGGTGGCGAACATGGAGACTGAATCCATTCCC
CCAAAGTCCATTTTCGTTTTTCCAAATGTCTAATGTCAGCAGGCCAAAACCAACCATTGGTGGTTGATATTTAAATGATACACTTTTAATGTTTATATGAAGATATATTA
TTCTACCTTTCAAATATCCCATAGTCAAGAAAAAATTCTAGACATTTTTTTTTTCTGTTGGAGGAATTTCTAGACATTCTAACGTGTCCTACAGTTTCATTGTTCAGAAA
AGAAGATTTCATGGTCTCAAAAAAATTTCTGTAACTGAAGAATGTAACAGTGTTCTAAAATTTCGAATGTGATAGTTATTGGTTTATCTTTATCTATTTATTTTGAAATA
ATGCTTAAATACAACGTTTTATCGTTCTTTCATAGTTCTGCCTTCTTGGTTTTCTAGGATGAAAAGAAAATGAATTGCATTAGTTTTTTTGCTACCTCTCATAATAGGGT
CTTGTGTGAGGAGGATATGCATGTTTACCTGTTGCATCGATTCACTGCAGTCTATAGAACTAAAAATAATTTGATTTGTACTATTCTTCTTGCTATTTGTTCTAGTATAG
GAGGATATGCCTGATTACCTGTCTCATAGATTGTAGTTGCTATAAAATAGCTACGTAGTTGGTCAAATCATAATTTCTTCTTTATGCAATGATATTATATCTTTTAAGAA
GGCCATGAAGGCTTTTATGTTATGGAGGTGGTGTAAATTTGTTTCTACGATGATGGTTAAATGTTTAGTTTTGAACAATCTGGCCTTAGTGTTTTTTAACTTCTTTTCTT
TTTTCTTTTCCTTTTTTTTTTTCCTCATAAAAAAAAAAAGTTATTTTGTTATAATATTTGGCTTTCAATTTAGCAAATTGAAAATACAAACAGAATATGGGTTAAACCAG
TCGGAAGGATTAAATCCCTAAATTCATTCCAAATATATTAATAGAGTTATAGAGCTATACGAAAGAGCTCTAATTGTTAGTAACGCCAGCAACTATTGGCAATCACAGAT
AGTTTGAATGTTGCAAGATTTAAAATAGACCTCTACAAGGATGATAAATTATGAGGAGCAGCTACCACGCTGAGGGCGTTACTCACATGAACATCAAATTCATAGTGGAA
AGACATTTGTTTTAATCACTATCATGCATCTAATTATCATGGGGTGATAACTTGAACTCCTATCAACCTAGAGTTCTCCCTTTCCCCATGAAAGCAAAGGCTCCCCCATA
AGTCTTCCAAACTTCCATTCTAACAGGCTTATATTTGGGCACAAGACTACAGAGGATTGGAAATGATATATGCCAGGGTTGCTAAAACATTATTAAGTGAAGAAGGCGTA
CTTTTGTATCAATTAGTTTGTGGCAGGCCAGATTTCCATACTCAGTTCTGCTTCTGCAACCAGCCTTATGGAACTTTTCTTTCCTTACCCACTTTCAATTACATACCAAG
GAATCAAAGAAGTTGCTTGGCTAATCAATTCTTGATTTGATGTCTCATTGATGCAGTGATTATTAATTTATGATGACATAAGCTTCCTTTGCTAACTTTGAGCTTGACGG
TTTGAAGAACCAGGGTAACAGTTATGACTTATGAGTACAAATCCTCATGGCAGCATACATGAACGAATGTCTTATGAGCTCCCTTTTTTTTGGTACTAAAATTGAGGGGT
ATGCACGGTAGCCTCTTGTTAAAATAGGTGGAAGTCTGTAGATCGTCAGTATCGAAATGGGCTAGAATGAAAAGAAGTGTTGAAATATGAAATTTTAGTAAGCTTTTGAT
TTGTAGCATTTTTACGCTCATTTAAACTTCAACAAGATCTTCAAATCGAGTAACAGGTTTAGATTGTCAATTTCATCCTTAATTTGATTGATATCTATACCCTTCCCAAC
TATCTGCTGTGGAACAAAATGTTGTCCTTACACTAATAGAGTTTAGGATTATGAAATGTGGTGGCTAGTTTTAAATACTAAGGAATGCTGGCTGAGGTGTCTGCGTGAGG
GTGACTCAAATGTTTATTTTAATTTGGCTGTTCGATTATTATGCTTATTGGTGCATTATTTTTTAGCAATAAACAAAGGGCTTTTATTATCAGGGGTTATATCTCGCTCT
TTTCCCTCTCCTATACTTGCTGTATTCACACAATTTTTCCCTTTAAAGAATGTGTGGTAGAAATTGCTTTGTGTTCTGCAATTTTGTATGGTCTTTTTAATATCATTTTA
TTCTTGAAAAAATCGTGTTCTATTTGAGAAGCGCCATGGGTTTGCTGAGATCATTAAGGGAATGAATGTCTCATTCTGGTTTTTAAAATATGATTTGTACAAGTTTATGC
AATTAGTTTCTTTTAAGATTTTTTGTGGAATGAGCCTCGGTGCAATTGAGACATGCATGAGATTTTGCATGTAAAAATTCATAATATTTTGTAGGGGCTAGAATGTGTTG
TCTGATAAACGGCCTTTAGACGGTCTCAAATGAAAATAATTTTGTACTTCTATAACAAATGTACAGCATTGGTATCTATTTACCTATCTATATACATGTATGTCTTAAAA
GAAAAAAAGAAAGAATCCTTCGTTTTATAGAGGTTAGTTGTCTGATGAATATTTTGGCTGACTTGGTGATATTTTCTTCCTGCATCTTATGTTTTGCAGGGACAGCATAT
GATCATTCCTTCAATGCAGAATAATGAATAGGAG
Protein sequenceShow/hide protein sequence
MWRTIDAHLRSVRLLPNLSAHSSSSSSSASSLFSSGRSFLARSSSTLLSPVPKPHSITLSKTLASRSTINCLSSVSCFGLGIQRFGGSSCGVLVLAKCITSSVHTLECNE
PVSCSEVGDGGFRSVGEGISDGEGDEVEEDSRPSIPVRAYFFSTSVDLRRLVDQNKRNFIPPSSRMTNYVVLKFGDLCNVNTHGASISGSDCCFMVVFQYGSIVLFNVRE
HEVDGYLKIVEKHASGLLPEMRKDEYEVREKPALNTWMEGGLDYIMLQYLNIDGIRTIGSVLGQSIALDYYGRQVDGMVAEFTDINREMEATGKFKMKRKKLFQLVGKAN
SNLADVILKLGLFERSDIAWKDAKYAQIWEYLRDEFELTQRFASLDFKLKFVEHNIRFLQEILQNRKSDFLEWLIIALIVGTGQFWLPVSGNNNSIDGEEINRSRVGDTL
PVTGWVSGECLRFDWGFDSIGAELRGNGVVLQGSEAPPVATFLETHPGAYTTTRSHNNASSILFWDRHMKRLSQSVKILSNSSPLLLSESNRTINKLVKPSWIDSVPWEP
AIRTLVDDSMRKVLPTALNERLGGEELTITVLVSVNLENLGESDGVVDVERLKEALDVHVHVGSYVPREFGVPENGANLAVVGRGRDVAAAKYSDWVRRRKSLEKLRPPS
VTELLLSNDGDHILEGCVTNFFVVCRKNNNEAKETSVLDSARTYSFELQTAPINDGVLTGVIRQLVIEACSSNGIPFREVAPTWSSNEIWEEAFVTSSLRIVEHVNTICI
PSIWDLLDSKTWSDISWNKKSFKDAPGMISSTIQKEIMEKAVAEAFPIARQIQKFLLFPSSSSSSSSSSSFQDLPHMAEPPTSPAGGSHESGGEQSPNTGGVREQDRFLP
IANISRIMKKALPANGKIAKDAKDTVQECVSEFISFVTSEASDKCQKEKRKTINGDDLLWAMATLGFEEYIDPLKSYLTRYRELECDAKGSSRGGDESAKRDPVGALPGQ
NSQVYLLFLNSLFTENFYSSS