; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022284 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022284
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionTranscription initiation factor IIF subunit beta
Genome locationChr05:22593575..22606626
RNA-Seq ExpressionHG10022284
SyntenyHG10022284
Gene Ontology termsGO:0006367 - transcription initiation from RNA polymerase II promoter (biological process)
GO:0009451 - RNA modification (biological process)
GO:0042742 - defense response to bacterium (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0012505 - endomembrane system (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR011039 - Transcription Factor IIF, Rap30/Rap74, interaction
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily
IPR040450 - TFIIF beta subunit, HTH domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN81117.1 hypothetical protein VITISV_005903 [Vitis vinifera]1.6e-20262.02Show/hide
Query:  IETNKADRAMWLLKCPQAVTRALSNSTDAPS-RPVAKVIVSVDPLQSNDDDDSSSTE-------------FTMELAGTDSGNALRSYSLNMSTDFIPMSV
        ++T++++RA+WL+KCP  V+R+LS+S+ + S RPVAKVIVS+DPL +NDDDDS                 FTMELAGT+SGNA + YS++MS DFIPM+V
Subjt:  IETNKADRAMWLLKCPQAVTRALSNSTDAPS-RPVAKVIVSVDPLQSNDDDDSSSTE-------------FTMELAGTDSGNALRSYSLNMSTDFIPMSV

Query:  FSESSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGMDVLSFGAAEKKKVVTKGSETKRLRKERGELEKII
        FSESSQG+  VEGKILNKFDMKPH++N++ YGKLCRERT K MTKSRQIQVID+  G HMRPMPGM + S                              
Subjt:  FSESSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGMDVLSFGAAEKKKVVTKGSETKRLRKERGELEKII

Query:  FRLFERQAYWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYK-------------ESCWFQVQMDNFLGGFTEDSTTFNQDIQRCPFLR
                               Q++K++LKDLCVYNNKG +QGTYELKPEYK              + W    +D   G  T DS    QDI RCPFLR
Subjt:  FRLFERQAYWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYK-------------ESCWFQVQMDNFLGGFTEDSTTFNQDIQRCPFLR

Query:  NINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGR-SMHLESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSEKW
        NIN+PTNFSF+S MAFP+PVRGAKGP FEDGPNFDMAFR+FHGRDGVVPLSGR  +  E+ E + AP QFNPLAAKAATISLS FG GGPFSFDSFS KW
Subjt:  NINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGR-SMHLESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSEKW

Query:  KNQKKKFESSRR-----GGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGL
        KNQK++ +SS++      G+S+HEA  NEWL+ GNCPIAKSYRAVS VIP+VAKALQPPPGMKFKCPPA+VAARAALA+TA  KNLRPQPLPAK+L IG+
Subjt:  KNQKKKFESSRR-----GGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGL

Query:  LGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVASKKLTLQDS----LAESTLLPVVTMK
        LGMAANVPLGIWR+HTEKFSPSWFAAVHAAVPFIAMLRKS+LMPK+AMAFTIAAS+LGQVIGSRAER R+K VAS++L L ++     ++ T + VV ++
Subjt:  LGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVASKKLTLQDS----LAESTLLPVVTMK

Query:  DGHCGDIESWNPVTTLQVAGPPSP
         GHCG    W+PV  LQVAGP SP
Subjt:  DGHCGDIESWNPVTTLQVAGPPSP

KAG6602326.1 General transcription factor IIF subunit 2, partial [Cucurbita argyrosperma subsp. sororia]3.6e-30880.14Show/hide
Query:  MDVDGKSIERCSSRKLQSNNNIVVGTDFIETNKADRAMWLLKCPQAVTRALSNSTDAPSRPVAKVIVSVDPLQSNDDDDSSSTEFTMELAGTDSGNALRS
        MD D K I+R SSRKL SN++ V GTDF+ETNKADRAMWLLKCPQ VTRALSNS D PSRPVAKVIVSVDP+QSNDDDDSSSTEFTMELA TDSG ALR+
Subjt:  MDVDGKSIERCSSRKLQSNNNIVVGTDFIETNKADRAMWLLKCPQAVTRALSNSTDAPSRPVAKVIVSVDPLQSNDDDDSSSTEFTMELAGTDSGNALRS

Query:  YSLNMSTDFIPMSVFSESSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGMDVLSFGAAEKKKVVTKGSET
        YSLNMSTDFIPMSVFSES+QGKFTVEGKILNKFDMKPHDQNL+ YGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGMDVLSFGAAEKKKVV+KGSE 
Subjt:  YSLNMSTDFIPMSVFSESSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGMDVLSFGAAEKKKVVTKGSET

Query:  KRLRKERGELEKIIFRLFERQAYWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYKESC------------------------------
        KRLRKERGELEKIIFRLFERQ YWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYKE+C                              
Subjt:  KRLRKERGELEKIIFRLFERQAYWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYKESC------------------------------

Query:  ----------------------------------------WFQVQMDNFLGGFTEDSTTFNQDIQRCPFLRNINDPTNFSFSSSMAFPIPVRGAKGPFFE
                                                  +V+MDNFLGGFTE       DIQRCPFLRNIN+PT+FSFSSSMAFP+PVRGAKGP FE
Subjt:  ----------------------------------------WFQVQMDNFLGGFTEDSTTFNQDIQRCPFLRNINDPTNFSFSSSMAFPIPVRGAKGPFFE

Query:  DGPNFDMAFRLFHGRDGVVPLSGRSMHLESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSR-----RGGNSQHEAAGNEW
        DGPNFDMAFRLFHGRDGVVPLSGRSMH ESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFS+KW+NQKKKFESS+     +GGNS HEA  NEW
Subjt:  DGPNFDMAFRLFHGRDGVVPLSGRSMHLESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSR-----RGGNSQHEAAGNEW

Query:  LQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAA
        LQ+GNCPIAKSYRAVS+VIPLVAKALQPPPGMKF+CPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWR+HTEKFSPSWFAAVHAA
Subjt:  LQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAA

Query:  VPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVASKKLTLQDSLAESTLLPVVTMKDGHCGDIESWNPVTTLQVAGPPSPTKVPC
        VPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAER RLKAVAS+KLTLQDS+++ST  PVVT+K+GHCGDIESWNPVTTLQVAGPPSPT +PC
Subjt:  VPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVASKKLTLQDSLAESTLLPVVTMKDGHCGDIESWNPVTTLQVAGPPSPTKVPC

PQP99380.1 hypothetical protein Pyn_11934 [Prunus yedoensis var. nudiflora]4.4e-22167.09Show/hide
Query:  SNNNIVVGTDFIETNKADRAMWLLKCPQAVTRAL------------SNSTDAPSRPVAKVIVSVDPLQSNDDDDSSSTEFTMELAGTDSGNALRSYSLNM
        SNNN   G   +ET +A+RA+WL+KCP  V ++L            S S    SRPVAKV++S+DPL SN  DDSSS +FTMELA T+SGN  + YSL+M
Subjt:  SNNNIVVGTDFIETNKADRAMWLLKCPQAVTRAL------------SNSTDAPSRPVAKVIVSVDPLQSNDDDDSSSTEFTMELAGTDSGNALRSYSLNM

Query:  STDFIPMSVFSESSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGM-DVLSFGAAEKKKVVTKGSETKRLR
        S DFIPMSVFSE SQGK +VEGKILNKFDMKPH++NLE YGKLCRERT K M K+R+IQVID+  G HMRPMPGM   ++ G +EKKK   KG   ++ +
Subjt:  STDFIPMSVFSESSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGM-DVLSFGAAEKKKVVTKGSETKRLR

Query:  KERGELEKIIFRLFERQAYWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYKESCWFQVQMDNFLGGFTEDSTTFNQDIQRCPFLRNIN
        ++RGE+E I+F+LFERQ  WT +QLIQETDQPEQ++K+ILK+LCVYNNKG +QGT           +  + MD+F     ED      DI RCPFLRNIN
Subjt:  KERGELEKIIFRLFERQAYWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYKESCWFQVQMDNFLGGFTEDSTTFNQDIQRCPFLRNIN

Query:  DPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRS-MHLESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSEKWKNQ
        +PTNFSF+SS+AFP+P RG KGP FEDGPNFDMAFRLFHG DGVVPLSGRS +H   VE +PAPS FNPLAAKAATISLSSF  GGPFSFD+FSEKWKNQ
Subjt:  DPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRS-MHLESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSEKWKNQ

Query:  KKKFESSR-----RGGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGM
        K+K  SS+     +GGNS HEA  NEWLQ G+CPIAKSYRAVS VIPLVAKA QPPPGMK KCPPA+VAARAALA+TAFAKNLRPQPLP KVL IG LGM
Subjt:  KKKFESSR-----RGGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGM

Query:  AANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVASKKLTLQDS-LAESTLLPVVTMKDGHCGD
        AANVPLG+WR+HT+KFSPSWFAAVHAAVPFIA+LRKS+LMPKSAMAFTIAASVLGQVIGSRAER RLKAV +KKL+L ++    ++ L V+  K GHC D
Subjt:  AANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVASKKLTLQDS-LAESTLLPVVTMKDGHCGD

Query:  IESWNPVTTLQVAGPPSPTKVPC
        IE WN V  LQV  P S T V C
Subjt:  IESWNPVTTLQVAGPPSPTKVPC

RXH94290.1 hypothetical protein DVH24_023974 [Malus domestica]4.1e-18765.1Show/hide
Query:  LNMSTDFIPMSVFSESSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGMDV--LSFGAAEKKKVVTKGSET
        +++  D +PMSVFSESSQG  +VEG+IL KFDMKPH +NLE YGKLCRERT K M KSR+IQV+++  G HMRP PGM +  +S GA+EKKK+ TKGS+ 
Subjt:  LNMSTDFIPMSVFSESSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGMDV--LSFGAAEKKKVVTKGSET

Query:  KRLRKERGELEKIIFRLFERQAYWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYKESCWFQVQMDNFLG--GFTEDSTTFNQDIQRCP
        KR RK+RGE+E I+F LFE++   T +QLIQET+QPEQ++K+ILKDLC+YNNKG +QGT           +  ++MD+F      ++D      DI +CP
Subjt:  KRLRKERGELEKIIFRLFERQAYWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYKESCWFQVQMDNFLG--GFTEDSTTFNQDIQRCP

Query:  FLRNINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRS-MHLESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFS
        FLRNIN+PTNFSF+SS+AFP+P RGAKGP FEDGPNFDMAFRLFHG DGVVPLSG+S +H + VE +PAP  FNPLAAKAATISLSSFG GGPFSFD+F 
Subjt:  FLRNINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRS-MHLESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFS

Query:  EKWKNQKKKFESSR-----RGGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAKTAFAKNLRPQPLPAKVLA
        +KWKNQ+KK  SS+     +GGNS HEA GNEWLQ GNCPIAKSYRAVS VIPLVAKA QPPPGMK KCPPA+VAARAAL++TAFAKNLRPQPLPAKVL 
Subjt:  EKWKNQKKKFESSR-----RGGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAKTAFAKNLRPQPLPAKVLA

Query:  IGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLK-AVASKKLTLQDSLA-----------
        IG +GMAANVPLGIWR+HT+KFS SWFAAVHAAVPFIA+LRKS+LMPKSAMAFTIAASVLGQVIGSRAER RLK AVASK   L ++ A           
Subjt:  IGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLK-AVASKKLTLQDSLA-----------

Query:  ---------------ESTLLPVVTMKDGHCGDIESWNPVTTLQVAGPPSPTKV
                       +++ L V+  K GHC  IE WN V +LQVA P S   V
Subjt:  ---------------ESTLLPVVTMKDGHCGDIESWNPVTTLQVAGPPSPTKV

XP_038891089.1 uncharacterized protein LOC120080490 [Benincasa hispida]2.1e-18394Show/hide
Query:  MDNFLGGFTEDSTTFNQDIQRCPFLRNINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRSMHLESVELKPAPSQFNPLAA
        MDNFLGGFTE STTFNQDIQRCPFLRNIN+PTNFSFSSSMAFPIPVRGAKGP FEDGPNFDMAFRLFHG+DGVVPLSGRSMH ESVELKP PSQFNPLAA
Subjt:  MDNFLGGFTEDSTTFNQDIQRCPFLRNINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRSMHLESVELKPAPSQFNPLAA

Query:  KAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSRR-----GGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARA
        KAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESS +     GGNSQHEA GNEWLQ+GNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARA
Subjt:  KAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSRR-----GGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARA

Query:  ALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVAS
        ALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWR+HTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAER RLKAVAS
Subjt:  ALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVAS

Query:  KKLTLQDSLAESTLLPVVTMKDGHCGDIESWNPVTTLQVAGPPSPTKVPC
        KKLTLQDSLAEST+LP+VT+K+GHCGDIESWNPVTTLQVAGPPSPTKVPC
Subjt:  KKLTLQDSLAESTLLPVVTMKDGHCGDIESWNPVTTLQVAGPPSPTKVPC

TrEMBL top hitse value%identityAlignment
A0A1S3CPS7 uncharacterized protein LOC1035033073.3e-18293.71Show/hide
Query:  MDNFLGGFTEDSTTFNQDIQRCPFLRNINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRSMHLESVELKPAPSQFNPLAA
        MD+FLGGFTEDSTTFNQDIQRCPFLRNIN+PTNFSFSSSMAFPIPVRGAKGP FEDGPNFDMAFRLFHGRDGVVPLSGRSMH  SVELKPAPSQFNPLAA
Subjt:  MDNFLGGFTEDSTTFNQDIQRCPFLRNINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRSMHLESVELKPAPSQFNPLAA

Query:  KAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSRR-----GGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARA
        KAATISLSSFGPGGPFSF SFSEKWKNQKKKFESS++     GGNSQHEA GNEWLQ+GNCPIAKSYRAVSSVIPLVAKALQPPPGMKF+CPPAVVAARA
Subjt:  KAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSRR-----GGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARA

Query:  ALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVAS
        ALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWR+HTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAER RLKAVAS
Subjt:  ALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVAS

Query:  KKLTLQDSLAESTLLPVVTMKDGHCGDIESWNPVTTLQVAGPPSPTKVPC
        KKLTLQDSLAESTLLPVV MK+GHCGDIESWNPVTTLQVAGP SP KVPC
Subjt:  KKLTLQDSLAESTLLPVVTMKDGHCGDIESWNPVTTLQVAGPPSPTKVPC

A0A314Y4Q1 TFIIF_beta domain-containing protein2.1e-22167.09Show/hide
Query:  SNNNIVVGTDFIETNKADRAMWLLKCPQAVTRAL------------SNSTDAPSRPVAKVIVSVDPLQSNDDDDSSSTEFTMELAGTDSGNALRSYSLNM
        SNNN   G   +ET +A+RA+WL+KCP  V ++L            S S    SRPVAKV++S+DPL SN  DDSSS +FTMELA T+SGN  + YSL+M
Subjt:  SNNNIVVGTDFIETNKADRAMWLLKCPQAVTRAL------------SNSTDAPSRPVAKVIVSVDPLQSNDDDDSSSTEFTMELAGTDSGNALRSYSLNM

Query:  STDFIPMSVFSESSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGM-DVLSFGAAEKKKVVTKGSETKRLR
        S DFIPMSVFSE SQGK +VEGKILNKFDMKPH++NLE YGKLCRERT K M K+R+IQVID+  G HMRPMPGM   ++ G +EKKK   KG   ++ +
Subjt:  STDFIPMSVFSESSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGM-DVLSFGAAEKKKVVTKGSETKRLR

Query:  KERGELEKIIFRLFERQAYWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYKESCWFQVQMDNFLGGFTEDSTTFNQDIQRCPFLRNIN
        ++RGE+E I+F+LFERQ  WT +QLIQETDQPEQ++K+ILK+LCVYNNKG +QGT           +  + MD+F     ED      DI RCPFLRNIN
Subjt:  KERGELEKIIFRLFERQAYWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYKESCWFQVQMDNFLGGFTEDSTTFNQDIQRCPFLRNIN

Query:  DPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRS-MHLESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSEKWKNQ
        +PTNFSF+SS+AFP+P RG KGP FEDGPNFDMAFRLFHG DGVVPLSGRS +H   VE +PAPS FNPLAAKAATISLSSF  GGPFSFD+FSEKWKNQ
Subjt:  DPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRS-MHLESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSEKWKNQ

Query:  KKKFESSR-----RGGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGM
        K+K  SS+     +GGNS HEA  NEWLQ G+CPIAKSYRAVS VIPLVAKA QPPPGMK KCPPA+VAARAALA+TAFAKNLRPQPLP KVL IG LGM
Subjt:  KKKFESSR-----RGGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGM

Query:  AANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVASKKLTLQDS-LAESTLLPVVTMKDGHCGD
        AANVPLG+WR+HT+KFSPSWFAAVHAAVPFIA+LRKS+LMPKSAMAFTIAASVLGQVIGSRAER RLKAV +KKL+L ++    ++ L V+  K GHC D
Subjt:  AANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVASKKLTLQDS-LAESTLLPVVTMKDGHCGD

Query:  IESWNPVTTLQVAGPPSPTKVPC
        IE WN V  LQV  P S T V C
Subjt:  IESWNPVTTLQVAGPPSPTKVPC

A0A498JFH5 Uncharacterized protein2.0e-18765.1Show/hide
Query:  LNMSTDFIPMSVFSESSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGMDV--LSFGAAEKKKVVTKGSET
        +++  D +PMSVFSESSQG  +VEG+IL KFDMKPH +NLE YGKLCRERT K M KSR+IQV+++  G HMRP PGM +  +S GA+EKKK+ TKGS+ 
Subjt:  LNMSTDFIPMSVFSESSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGMDV--LSFGAAEKKKVVTKGSET

Query:  KRLRKERGELEKIIFRLFERQAYWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYKESCWFQVQMDNFLG--GFTEDSTTFNQDIQRCP
        KR RK+RGE+E I+F LFE++   T +QLIQET+QPEQ++K+ILKDLC+YNNKG +QGT           +  ++MD+F      ++D      DI +CP
Subjt:  KRLRKERGELEKIIFRLFERQAYWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYKESCWFQVQMDNFLG--GFTEDSTTFNQDIQRCP

Query:  FLRNINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRS-MHLESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFS
        FLRNIN+PTNFSF+SS+AFP+P RGAKGP FEDGPNFDMAFRLFHG DGVVPLSG+S +H + VE +PAP  FNPLAAKAATISLSSFG GGPFSFD+F 
Subjt:  FLRNINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRS-MHLESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFS

Query:  EKWKNQKKKFESSR-----RGGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAKTAFAKNLRPQPLPAKVLA
        +KWKNQ+KK  SS+     +GGNS HEA GNEWLQ GNCPIAKSYRAVS VIPLVAKA QPPPGMK KCPPA+VAARAAL++TAFAKNLRPQPLPAKVL 
Subjt:  EKWKNQKKKFESSR-----RGGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAKTAFAKNLRPQPLPAKVLA

Query:  IGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLK-AVASKKLTLQDSLA-----------
        IG +GMAANVPLGIWR+HT+KFS SWFAAVHAAVPFIA+LRKS+LMPKSAMAFTIAASVLGQVIGSRAER RLK AVASK   L ++ A           
Subjt:  IGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLK-AVASKKLTLQDSLA-----------

Query:  ---------------ESTLLPVVTMKDGHCGDIESWNPVTTLQVAGPPSPTKV
                       +++ L V+  K GHC  IE WN V +LQVA P S   V
Subjt:  ---------------ESTLLPVVTMKDGHCGDIESWNPVTTLQVAGPPSPTKV

A0A5A7TBC4 Uncharacterized protein3.3e-18293.71Show/hide
Query:  MDNFLGGFTEDSTTFNQDIQRCPFLRNINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRSMHLESVELKPAPSQFNPLAA
        MD+FLGGFTEDSTTFNQDIQRCPFLRNIN+PTNFSFSSSMAFPIPVRGAKGP FEDGPNFDMAFRLFHGRDGVVPLSGRSMH  SVELKPAPSQFNPLAA
Subjt:  MDNFLGGFTEDSTTFNQDIQRCPFLRNINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRSMHLESVELKPAPSQFNPLAA

Query:  KAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSRR-----GGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARA
        KAATISLSSFGPGGPFSF SFSEKWKNQKKKFESS++     GGNSQHEA GNEWLQ+GNCPIAKSYRAVSSVIPLVAKALQPPPGMKF+CPPAVVAARA
Subjt:  KAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSRR-----GGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARA

Query:  ALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVAS
        ALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWR+HTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAER RLKAVAS
Subjt:  ALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVAS

Query:  KKLTLQDSLAESTLLPVVTMKDGHCGDIESWNPVTTLQVAGPPSPTKVPC
        KKLTLQDSLAESTLLPVV MK+GHCGDIESWNPVTTLQVAGP SP KVPC
Subjt:  KKLTLQDSLAESTLLPVVTMKDGHCGDIESWNPVTTLQVAGPPSPTKVPC

A5BGJ2 TFIIF_beta domain-containing protein7.6e-20362.02Show/hide
Query:  IETNKADRAMWLLKCPQAVTRALSNSTDAPS-RPVAKVIVSVDPLQSNDDDDSSSTE-------------FTMELAGTDSGNALRSYSLNMSTDFIPMSV
        ++T++++RA+WL+KCP  V+R+LS+S+ + S RPVAKVIVS+DPL +NDDDDS                 FTMELAGT+SGNA + YS++MS DFIPM+V
Subjt:  IETNKADRAMWLLKCPQAVTRALSNSTDAPS-RPVAKVIVSVDPLQSNDDDDSSSTE-------------FTMELAGTDSGNALRSYSLNMSTDFIPMSV

Query:  FSESSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGMDVLSFGAAEKKKVVTKGSETKRLRKERGELEKII
        FSESSQG+  VEGKILNKFDMKPH++N++ YGKLCRERT K MTKSRQIQVID+  G HMRPMPGM + S                              
Subjt:  FSESSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGMDVLSFGAAEKKKVVTKGSETKRLRKERGELEKII

Query:  FRLFERQAYWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYK-------------ESCWFQVQMDNFLGGFTEDSTTFNQDIQRCPFLR
                               Q++K++LKDLCVYNNKG +QGTYELKPEYK              + W    +D   G  T DS    QDI RCPFLR
Subjt:  FRLFERQAYWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYK-------------ESCWFQVQMDNFLGGFTEDSTTFNQDIQRCPFLR

Query:  NINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGR-SMHLESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSEKW
        NIN+PTNFSF+S MAFP+PVRGAKGP FEDGPNFDMAFR+FHGRDGVVPLSGR  +  E+ E + AP QFNPLAAKAATISLS FG GGPFSFDSFS KW
Subjt:  NINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGR-SMHLESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSEKW

Query:  KNQKKKFESSRR-----GGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGL
        KNQK++ +SS++      G+S+HEA  NEWL+ GNCPIAKSYRAVS VIP+VAKALQPPPGMKFKCPPA+VAARAALA+TA  KNLRPQPLPAK+L IG+
Subjt:  KNQKKKFESSRR-----GGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGL

Query:  LGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVASKKLTLQDS----LAESTLLPVVTMK
        LGMAANVPLGIWR+HTEKFSPSWFAAVHAAVPFIAMLRKS+LMPK+AMAFTIAAS+LGQVIGSRAER R+K VAS++L L ++     ++ T + VV ++
Subjt:  LGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVASKKLTLQDS----LAESTLLPVVTMK

Query:  DGHCGDIESWNPVTTLQVAGPPSP
         GHCG    W+PV  LQVAGP SP
Subjt:  DGHCGDIESWNPVTTLQVAGPPSP

SwissProt top hitse value%identityAlignment
P13984 General transcription factor IIF subunit 29.7e-1427.35Show/hide
Query:  KADRAMWLLKCPQAVTRALSNSTDAPSRPVAKVIVSVDPLQSNDDDDSSSTEFTM--ELAGT-DSGNALRSYSLNMSTDFI-------PMSVFSESSQGK
        K +  +WL+K P+ +++  + ++             V  L+       +   FT+  +LA   D G    S S      F+        ++VF+ESS  K
Subjt:  KADRAMWLLKCPQAVTRALSNSTDAPSRPVAKVIVSVDPLQSNDDDDSSSTEFTM--ELAGT-DSGNALRSYSLNMSTDFI-------PMSVFSESSQGK

Query:  FTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGMDVLSFGAAEKKKVVTKGSETKRLRKERGELEKIIFRLFERQA
         ++EG ++ + + +P     E Y +L R +  +S    R  Q +D V   + +P+       +    ++K   K  + KR R ++  +  ++F  FE+  
Subjt:  FTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGMDVLSFGAAEKKKVVTKGSETKRLRKERGELEKIIFRLFERQA

Query:  YWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYK
        Y+  K L+  T QP  Y+KEILK++ V N KG+H+ T+ELKPEY+
Subjt:  YWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYK

P41900 General transcription factor IIF subunit 29.7e-1427.78Show/hide
Query:  IETNKADRAMWLLKCPQAVT----RALSNSTDAPSR------PVAKVIVSVDP-LQSNDDDDSSSTEFTMELAGTDSGNALRSYSLNMSTDFIPMSVFS-
        ++ + A R +WL+K P+ +     +A +N      R        A+V +S+ P + + D ++   TE  ++++   +   L  +S    +D    S  S 
Subjt:  IETNKADRAMWLLKCPQAVT----RALSNSTDAPSR------PVAKVIVSVDP-LQSNDDDDSSSTEFTMELAGTDSGNALRSYSLNMSTDFIPMSVFS-

Query:  -ESSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGMDVLSFGAAEKKKVVTKGSETKRLRKERGELEKIIF
         +    K  +EG+I+ K + +P   N   Y KL  E   K+    R++Q ID +   + +P+      +    E+KK     +E K+ R ++  +  ++F
Subjt:  -ESSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGMDVLSFGAAEKKKVVTKGSETKRLRKERGELEKIIF

Query:  RLFERQAYWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYK
          FE+  Y+  K L++ T+QP  Y+KEILKD+C YN K  H+  +ELK EY+
Subjt:  RLFERQAYWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYK

Q01750 General transcription factor IIF subunit 22.0e-1428.57Show/hide
Query:  KADRAMWLLKCPQAVTRALSNSTDAPSRPVAKVIVSVDPLQSNDDDDSSSTEFTM--ELAGT-DSGNALRSYSLNMSTDFI-------PMSVFSESSQGK
        K +  +WL+K P+ +++  +    AP R        V  L+   +   +   FT+  +LA   D G    S S      F+        ++VF+ESS  K
Subjt:  KADRAMWLLKCPQAVTRALSNSTDAPSRPVAKVIVSVDPLQSNDDDDSSSTEFTM--ELAGT-DSGNALRSYSLNMSTDFI-------PMSVFSESSQGK

Query:  FTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGMDVLSFGAAEKKKVVTKGSETKRLRKERGELEKIIFRLFERQA
         ++EG ++ + + +P     E Y KL R +  +S    R  Q  D V   + +P+       +    ++K   K  + KR R ++  +  ++F  FE+  
Subjt:  FTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGMDVLSFGAAEKKKVVTKGSETKRLRKERGELEKIIFRLFERQA

Query:  YWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYK
        Y+  K L+  T QP  Y+KEILK++ + N KG+H+ T+ELKPEY+
Subjt:  YWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYK

Q2T9L9 General transcription factor IIF subunit 22.0e-1428.16Show/hide
Query:  KADRAMWLLKCPQAVTRALSNSTDAPSRPVAKVIVSVDPLQSNDDDDSSSTEFTM--ELAGT-DSGNALRSYSLNMSTDFI-------PMSVFSESSQGK
        K +  +WL+K P+ +++  +    AP R        V  L+   +   +   FT+  +LA   D G    S S      F+        ++VF+ESS  K
Subjt:  KADRAMWLLKCPQAVTRALSNSTDAPSRPVAKVIVSVDPLQSNDDDDSSSTEFTM--ELAGT-DSGNALRSYSLNMSTDFI-------PMSVFSESSQGK

Query:  FTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGMDVLSFGAAEKKKVVTKGSETKRLRKERGELEKIIFRLFERQA
         ++EG ++ + + +P     E Y +L R +  +S    R  Q +D V   + +P+       +    ++K   K  + KR R ++  +  ++F  FE+  
Subjt:  FTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGMDVLSFGAAEKKKVVTKGSETKRLRKERGELEKIIFRLFERQA

Query:  YWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYK
        Y+  K L+  T QP  Y+K+ILK++ V N KG+H+ T+ELKPEY+
Subjt:  YWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYK

Q8R0A0 General transcription factor IIF subunit 23.3e-1427.35Show/hide
Query:  KADRAMWLLKCPQAVTRALSNSTDAPSRPVAKVIVSVDPLQSNDDDDSSSTEFTM--ELAGT-DSGNALRSYSLNMSTDFI-------PMSVFSESSQGK
        K +  +WL+K P+ +++  + ++             V  L+   +   +   FT+  +LA   D G    S S      F+        ++VF+ESS  K
Subjt:  KADRAMWLLKCPQAVTRALSNSTDAPSRPVAKVIVSVDPLQSNDDDDSSSTEFTM--ELAGT-DSGNALRSYSLNMSTDFI-------PMSVFSESSQGK

Query:  FTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGMDVLSFGAAEKKKVVTKGSETKRLRKERGELEKIIFRLFERQA
         ++EG ++ + + +P     E Y KL R +  +S    R  Q +D V   + +P+       +    ++K   K  + KR R ++  +  ++F  FE+  
Subjt:  FTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGMDVLSFGAAEKKKVVTKGSETKRLRKERGELEKIIFRLFERQA

Query:  YWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYK
        Y+  K L+  T QP  Y+KEILK++ + N KG+H+ T+ELKPEY+
Subjt:  YWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYK

Arabidopsis top hitse value%identityAlignment
AT4G25030.1 unknown protein3.2e-12167.63Show/hide
Query:  MDNFLGGFTEDSTTFNQDIQRCPFLRNINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRSMHLESVELKPAPSQFNPLAA
        MDN  G   E+      +I RCPFLRNIN+PTN SFSSS+ FPIP R  KGP FEDGPNFD AFRLFHG+DGVVPLS  +    +   KP P  F+PLAA
Subjt:  MDNFLGGFTEDSTTFNQDIQRCPFLRNINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRSMHLESVELKPAPSQFNPLAA

Query:  KAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSR-RGGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAK
        KAATISLSSFG GGPF FD+FS+ +KNQKKK +SS+ +GGN  HEA G+EWL+ GNCPIAKSYRAVS V PLVAK LQPPPGMKFKCP A+V ARAA++K
Subjt:  KAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSR-RGGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAK

Query:  TAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVASKKLT
        T FAKNLRPQPLPAKVL IG+LGMA NVPLG+WR+HTEKFS SWF A+HAAVPFI +LRKS+LMPK+AM FTIAASVLGQVIGSRAER RLK+VA KKLT
Subjt:  TAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVASKKLT

Query:  LQ----DSLAESTLLPVVTMKDGHCGD--IESWNPVTTLQVAGPPS
        L+     S+    +       DG CGD  +  WNP+  L VA P S
Subjt:  LQ----DSLAESTLLPVVTMKDGHCGD--IESWNPVTTLQVAGPPS

AT4G25030.2 unknown protein3.2e-12167.63Show/hide
Query:  MDNFLGGFTEDSTTFNQDIQRCPFLRNINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRSMHLESVELKPAPSQFNPLAA
        MDN  G   E+      +I RCPFLRNIN+PTN SFSSS+ FPIP R  KGP FEDGPNFD AFRLFHG+DGVVPLS  +    +   KP P  F+PLAA
Subjt:  MDNFLGGFTEDSTTFNQDIQRCPFLRNINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRSMHLESVELKPAPSQFNPLAA

Query:  KAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSR-RGGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAK
        KAATISLSSFG GGPF FD+FS+ +KNQKKK +SS+ +GGN  HEA G+EWL+ GNCPIAKSYRAVS V PLVAK LQPPPGMKFKCP A+V ARAA++K
Subjt:  KAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSR-RGGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAK

Query:  TAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVASKKLT
        T FAKNLRPQPLPAKVL IG+LGMA NVPLG+WR+HTEKFS SWF A+HAAVPFI +LRKS+LMPK+AM FTIAASVLGQVIGSRAER RLK+VA KKLT
Subjt:  TAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVASKKLT

Query:  LQ----DSLAESTLLPVVTMKDGHCGD--IESWNPVTTLQVAGPPS
        L+     S+    +       DG CGD  +  WNP+  L VA P S
Subjt:  LQ----DSLAESTLLPVVTMKDGHCGD--IESWNPVTTLQVAGPPS

AT5G45410.1 unknown protein9.7e-11066.12Show/hide
Query:  IQRCPFLRNINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRSMHLESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSF
        IQ+CPFLRNIN PTN SF SS++FPIPV+G KGP FEDGP FD AF+LFHG+DG+VPLSG +   E  E      QFNPLA K ATISLS+FGPGGPF F
Subjt:  IQRCPFLRNINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRSMHLESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSF

Query:  DSFSEKWKNQKKKFESSRR---GGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAKTAFAKNLRPQPLPAKV
          FSEKWK Q+KK + S+    G +S+HEA G+EWL+ GNCPIAKS+RA S V+PL++KAL  PPGMK++CP  +VAARAAL+KTA  K+LRPQPLP K+
Subjt:  DSFSEKWKNQKKKFESSRR---GGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAKTAFAKNLRPQPLPAKV

Query:  LAIGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVASKKLTLQDSLAESTLLPVVT-
        LAI L+GMAANVPLG+WR+HT+KFSP+WF AVHAAVPFIAMLRKS+LMPK+AMA TI AS+LGQVIGSRAER RLKAVA K + +   ++     P  + 
Subjt:  LAIGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVASKKLTLQDSLAESTLLPVVT-

Query:  MKDGHCG
        +  GHCG
Subjt:  MKDGHCG

AT5G45410.2 unknown protein9.7e-11066.12Show/hide
Query:  IQRCPFLRNINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRSMHLESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSF
        IQ+CPFLRNIN PTN SF SS++FPIPV+G KGP FEDGP FD AF+LFHG+DG+VPLSG +   E  E      QFNPLA K ATISLS+FGPGGPF F
Subjt:  IQRCPFLRNINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRSMHLESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSF

Query:  DSFSEKWKNQKKKFESSRR---GGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAKTAFAKNLRPQPLPAKV
          FSEKWK Q+KK + S+    G +S+HEA G+EWL+ GNCPIAKS+RA S V+PL++KAL  PPGMK++CP  +VAARAAL+KTA  K+LRPQPLP K+
Subjt:  DSFSEKWKNQKKKFESSRR---GGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAKTAFAKNLRPQPLPAKV

Query:  LAIGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVASKKLTLQDSLAESTLLPVVT-
        LAI L+GMAANVPLG+WR+HT+KFSP+WF AVHAAVPFIAMLRKS+LMPK+AMA TI AS+LGQVIGSRAER RLKAVA K + +   ++     P  + 
Subjt:  LAIGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVASKKLTLQDSLAESTLLPVVT-

Query:  MKDGHCG
        +  GHCG
Subjt:  MKDGHCG

AT5G45410.3 unknown protein9.7e-11066.12Show/hide
Query:  IQRCPFLRNINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRSMHLESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSF
        IQ+CPFLRNIN PTN SF SS++FPIPV+G KGP FEDGP FD AF+LFHG+DG+VPLSG +   E  E      QFNPLA K ATISLS+FGPGGPF F
Subjt:  IQRCPFLRNINDPTNFSFSSSMAFPIPVRGAKGPFFEDGPNFDMAFRLFHGRDGVVPLSGRSMHLESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSF

Query:  DSFSEKWKNQKKKFESSRR---GGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAKTAFAKNLRPQPLPAKV
          FSEKWK Q+KK + S+    G +S+HEA G+EWL+ GNCPIAKS+RA S V+PL++KAL  PPGMK++CP  +VAARAAL+KTA  K+LRPQPLP K+
Subjt:  DSFSEKWKNQKKKFESSRR---GGNSQHEAAGNEWLQIGNCPIAKSYRAVSSVIPLVAKALQPPPGMKFKCPPAVVAARAALAKTAFAKNLRPQPLPAKV

Query:  LAIGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVASKKLTLQDSLAESTLLPVVT-
        LAI L+GMAANVPLG+WR+HT+KFSP+WF AVHAAVPFIAMLRKS+LMPK+AMA TI AS+LGQVIGSRAER RLKAVA K + +   ++     P  + 
Subjt:  LAIGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAASVLGQVIGSRAERPRLKAVASKKLTLQDSLAESTLLPVVT-

Query:  MKDGHCG
        +  GHCG
Subjt:  MKDGHCG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGTCGACGGCAAATCCATCGAGCGTTGCAGTTCCAGAAAACTCCAAAGCAATAACAACATCGTTGTTGGAACCGACTTCATAGAGACTAACAAGGCGGACAGAGC
AATGTGGCTCTTGAAATGCCCGCAAGCTGTAACTCGCGCCCTTTCTAATTCGACCGATGCCCCTTCCCGTCCGGTCGCTAAGGTCATCGTTTCTGTTGACCCGCTGCAAT
CCAATGACGATGACGATTCTTCCTCTACTGAGTTCACTATGGAATTGGCTGGCACTGACTCTGGAAATGCATTAAGGTCCTACTCCTTGAATATGTCCACAGACTTCATT
CCAATGTCTGTATTTTCCGAGTCATCACAAGGGAAGTTTACGGTCGAAGGGAAAATACTTAACAAATTTGACATGAAGCCACATGATCAGAATCTTGAGAGGTACGGGAA
ACTCTGTCGTGAAAGAACTCACAAGTCTATGACCAAAAGCAGACAGATTCAGGTGATTGATCATGTCACAGGAGGACATATGAGGCCTATGCCAGGGATGGACGTGTTGT
CATTTGGAGCTGCAGAGAAGAAGAAAGTAGTGACCAAGGGATCGGAAACAAAGAGGTTGAGAAAGGAGCGTGGTGAATTGGAGAAGATTATTTTTAGGCTTTTTGAAAGA
CAGGCATATTGGACGTCAAAACAGTTGATTCAAGAAACTGACCAACCTGAACAATATATGAAGGAAATACTTAAAGACCTCTGTGTCTACAACAACAAAGGTGTTCACCA
AGGAACATATGAGCTGAAGCCAGAATACAAGGAATCATGTTGGTTTCAAGTACAAATGGATAATTTCCTTGGAGGCTTTACAGAGGACTCTACAACTTTTAATCAGGATA
TTCAAAGATGTCCATTCTTAAGAAATATTAATGATCCGACAAATTTCTCATTTTCCTCGTCCATGGCATTCCCAATACCTGTACGTGGAGCTAAAGGGCCATTTTTTGAA
GATGGTCCTAATTTTGATATGGCATTTAGGCTTTTCCATGGACGGGATGGAGTTGTTCCACTCTCTGGAAGATCTATGCATCTGGAGAGTGTAGAACTCAAACCAGCCCC
ATCACAGTTCAATCCTTTAGCTGCAAAGGCAGCCACTATTAGCCTCTCATCCTTCGGACCTGGAGGCCCCTTCAGCTTTGATTCATTTTCAGAGAAGTGGAAAAATCAGA
AGAAGAAATTTGAGTCATCCAGAAGAGGTGGAAATTCACAGCATGAAGCAGCGGGCAATGAGTGGTTGCAAATTGGGAACTGCCCAATTGCGAAGTCCTACCGTGCTGTT
AGCAGTGTCATACCGCTTGTTGCAAAAGCTCTTCAACCCCCACCAGGCATGAAATTTAAATGCCCACCAGCAGTAGTTGCAGCAAGAGCAGCTCTAGCAAAGACTGCATT
TGCAAAGAATCTCCGCCCCCAACCGTTGCCAGCAAAAGTACTCGCAATTGGTCTACTAGGCATGGCAGCAAACGTACCTTTAGGAATATGGAGAGACCACACTGAAAAAT
TCTCACCCTCTTGGTTTGCTGCCGTTCATGCAGCTGTTCCATTTATAGCTATGCTAAGGAAATCTATCTTAATGCCCAAATCAGCCATGGCATTTACAATTGCAGCATCA
GTTCTGGGGCAGGTCATCGGCTCGAGAGCAGAGCGACCCCGACTCAAGGCAGTGGCTTCGAAAAAACTAACCCTCCAAGATTCTCTCGCTGAATCCACCCTTTTGCCGGT
GGTTACCATGAAGGATGGTCATTGTGGCGACATAGAAAGCTGGAATCCAGTTACCACTCTTCAAGTAGCAGGTCCACCTTCACCAACCAAGGTGCCCTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGTCGACGGCAAATCCATCGAGCGTTGCAGTTCCAGAAAACTCCAAAGCAATAACAACATCGTTGTTGGAACCGACTTCATAGAGACTAACAAGGCGGACAGAGC
AATGTGGCTCTTGAAATGCCCGCAAGCTGTAACTCGCGCCCTTTCTAATTCGACCGATGCCCCTTCCCGTCCGGTCGCTAAGGTCATCGTTTCTGTTGACCCGCTGCAAT
CCAATGACGATGACGATTCTTCCTCTACTGAGTTCACTATGGAATTGGCTGGCACTGACTCTGGAAATGCATTAAGGTCCTACTCCTTGAATATGTCCACAGACTTCATT
CCAATGTCTGTATTTTCCGAGTCATCACAAGGGAAGTTTACGGTCGAAGGGAAAATACTTAACAAATTTGACATGAAGCCACATGATCAGAATCTTGAGAGGTACGGGAA
ACTCTGTCGTGAAAGAACTCACAAGTCTATGACCAAAAGCAGACAGATTCAGGTGATTGATCATGTCACAGGAGGACATATGAGGCCTATGCCAGGGATGGACGTGTTGT
CATTTGGAGCTGCAGAGAAGAAGAAAGTAGTGACCAAGGGATCGGAAACAAAGAGGTTGAGAAAGGAGCGTGGTGAATTGGAGAAGATTATTTTTAGGCTTTTTGAAAGA
CAGGCATATTGGACGTCAAAACAGTTGATTCAAGAAACTGACCAACCTGAACAATATATGAAGGAAATACTTAAAGACCTCTGTGTCTACAACAACAAAGGTGTTCACCA
AGGAACATATGAGCTGAAGCCAGAATACAAGGAATCATGTTGGTTTCAAGTACAAATGGATAATTTCCTTGGAGGCTTTACAGAGGACTCTACAACTTTTAATCAGGATA
TTCAAAGATGTCCATTCTTAAGAAATATTAATGATCCGACAAATTTCTCATTTTCCTCGTCCATGGCATTCCCAATACCTGTACGTGGAGCTAAAGGGCCATTTTTTGAA
GATGGTCCTAATTTTGATATGGCATTTAGGCTTTTCCATGGACGGGATGGAGTTGTTCCACTCTCTGGAAGATCTATGCATCTGGAGAGTGTAGAACTCAAACCAGCCCC
ATCACAGTTCAATCCTTTAGCTGCAAAGGCAGCCACTATTAGCCTCTCATCCTTCGGACCTGGAGGCCCCTTCAGCTTTGATTCATTTTCAGAGAAGTGGAAAAATCAGA
AGAAGAAATTTGAGTCATCCAGAAGAGGTGGAAATTCACAGCATGAAGCAGCGGGCAATGAGTGGTTGCAAATTGGGAACTGCCCAATTGCGAAGTCCTACCGTGCTGTT
AGCAGTGTCATACCGCTTGTTGCAAAAGCTCTTCAACCCCCACCAGGCATGAAATTTAAATGCCCACCAGCAGTAGTTGCAGCAAGAGCAGCTCTAGCAAAGACTGCATT
TGCAAAGAATCTCCGCCCCCAACCGTTGCCAGCAAAAGTACTCGCAATTGGTCTACTAGGCATGGCAGCAAACGTACCTTTAGGAATATGGAGAGACCACACTGAAAAAT
TCTCACCCTCTTGGTTTGCTGCCGTTCATGCAGCTGTTCCATTTATAGCTATGCTAAGGAAATCTATCTTAATGCCCAAATCAGCCATGGCATTTACAATTGCAGCATCA
GTTCTGGGGCAGGTCATCGGCTCGAGAGCAGAGCGACCCCGACTCAAGGCAGTGGCTTCGAAAAAACTAACCCTCCAAGATTCTCTCGCTGAATCCACCCTTTTGCCGGT
GGTTACCATGAAGGATGGTCATTGTGGCGACATAGAAAGCTGGAATCCAGTTACCACTCTTCAAGTAGCAGGTCCACCTTCACCAACCAAGGTGCCCTGCTGA
Protein sequenceShow/hide protein sequence
MDVDGKSIERCSSRKLQSNNNIVVGTDFIETNKADRAMWLLKCPQAVTRALSNSTDAPSRPVAKVIVSVDPLQSNDDDDSSSTEFTMELAGTDSGNALRSYSLNMSTDFI
PMSVFSESSQGKFTVEGKILNKFDMKPHDQNLERYGKLCRERTHKSMTKSRQIQVIDHVTGGHMRPMPGMDVLSFGAAEKKKVVTKGSETKRLRKERGELEKIIFRLFER
QAYWTSKQLIQETDQPEQYMKEILKDLCVYNNKGVHQGTYELKPEYKESCWFQVQMDNFLGGFTEDSTTFNQDIQRCPFLRNINDPTNFSFSSSMAFPIPVRGAKGPFFE
DGPNFDMAFRLFHGRDGVVPLSGRSMHLESVELKPAPSQFNPLAAKAATISLSSFGPGGPFSFDSFSEKWKNQKKKFESSRRGGNSQHEAAGNEWLQIGNCPIAKSYRAV
SSVIPLVAKALQPPPGMKFKCPPAVVAARAALAKTAFAKNLRPQPLPAKVLAIGLLGMAANVPLGIWRDHTEKFSPSWFAAVHAAVPFIAMLRKSILMPKSAMAFTIAAS
VLGQVIGSRAERPRLKAVASKKLTLQDSLAESTLLPVVTMKDGHCGDIESWNPVTTLQVAGPPSPTKVPC