; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029820 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029820
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptiontranscription initiation factor TFIID subunit 4b
Genome locationtig00153533:1496888..1514457
RNA-Seq ExpressionSgr029820
SyntenySgr029820
Gene Ontology termsGO:0006352 - DNA-templated transcription, initiation (biological process)
GO:0006413 - translational initiation (biological process)
GO:0005669 - transcription factor TFIID complex (cellular component)
GO:0005840 - ribosome (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0003743 - translation initiation factor activity (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR001047 - Ribosomal protein S8e
IPR007900 - Transcription initiation factor TFIID component TAF4, C-terminal
IPR009072 - Histone-fold
IPR018283 - Ribosomal protein S8e, conserved site
IPR022003 - RST domain
IPR022309 - Ribosomal protein S8e/ribosomal biogenesis NSA2
IPR045144 - Transcription initiation factor TFIID component TAF4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153297.1 transcription initiation factor TFIID subunit 4b [Momordica charantia]0.0e+0091.17Show/hide
Query:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHSSLMEQERCTSVPENQQQHNATL
        +DETMHSGAAVEAFQAALNRDIEGD PA SQTSESDAAFPRGSNS SGLSLQASSQNENTESHV QDQNFLLKQEQHSSLME ERCTS PENQQ+HNA L
Subjt:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHSSLMEQERCTSVPENQQQHNATL

Query:  LQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKDRA
        LQ+SKNQPQADHEQGE EQ  VQFSQTAGLQ S+KAP+LVNDS+RM NRDNESQYLKLQKMSNQQAMVSEQASN +NRSKQVPFASLMPVLMPQLDKDRA
Subjt:  LQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKDRA

Query:  MQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSS
        MQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQ+QPPPSVRQLSPRMPS+GPGA NFSD RPFAQLHQKGMNS  VQSYIPS ASQGR SS 
Subjt:  MQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSS

Query:  YPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQGPN
        YP +DKNMQSLREVEQRTDGNGNQLTSSS+GTIQERERSSVPVPGLEKQQLHFQQK FAMYG SG+YH YTGSNIN S LSLKPQPHEGQVKQI QQ PN
Subjt:  YPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQGPN

Query:  FDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQ
        FDRQVTINDSKRVQGG VPHLHNN+TSQQNPV+WKS+TSKEQN GPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISS+QAEQVNTTPGI KDSFEKQ
Subjt:  FDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQ

Query:  SSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHNRAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA
        SSKM FPT+NNV+PPTSTN ANSISSDA+S+HDSSAML SQVPS  TPG+ NR  QKK T+GQKKPL+ALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA
Subjt:  SSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHNRAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA

Query:  VSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHR
        VSGVNIREEEEQLFS AKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVE+CLSLCVEERLRGIISNLIRLSKQRVD+EKPRHR
Subjt:  VSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHR

Query:  TLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK
        TLITSDVRQQI LVNQKAREEWEKKQAEEEKLRKLNDPEDGSG AGDKEKDE R+KSVK NKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK
Subjt:  TLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK

Query:  REGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRKGTTRKFGRNQV
        REGG+DSASGSQSGKDA RKSSSAAGRHGKDNQEADRKGT+RKFGRNQ+
Subjt:  REGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRKGTTRKFGRNQV

XP_022952261.1 transcription initiation factor TFIID subunit 4b isoform X1 [Cucurbita moschata]0.0e+0088.72Show/hide
Query:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHSSLMEQERCTSVPENQQQHNATL
        +DETMHSGAAVEAFQAALNRDIEGD PA SQ  ESDAAFPRGSN  S LSLQA SQNE TESH  QDQNF  KQEQHSSLME ERC+S+PENQQQHNA  
Subjt:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHSSLMEQERCTSVPENQQQHNATL

Query:  LQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKDRA
        LQASKNQPQAD EQG+ EQ S QFSQTAGLQ S+KAP+LVNDSNRMQNRDNESQYLKLQKMSNQQ MVSEQA+NP+NRSKQVPFASLMPVLMPQLDKDRA
Subjt:  LQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKDRA

Query:  MQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSS
        MQLQTLFN+LKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPS+GPGAPNFSD RPF+QLHQKGMNS AVQSYIPSSASQGRSSS 
Subjt:  MQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSS

Query:  YPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQGPN
        YPA+DKNMQSLREVEQRTDGN NQLTSSSSGTIQERERSS+PVPGLEKQQLHFQQK F MYGNSGNYHPYTGSNIN S LSLKPQPHEGQ KQI QQ PN
Subjt:  YPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQGPN

Query:  FDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQ
        FDRQVTIND KRVQGG V HL NN+TSQ +P  WKSSTSKEQN GPLSSMSYIKQEPSDQV+EQNKTQ SNLQGLSSI SMQAEQVNTTPGI KD FEKQ
Subjt:  FDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQ

Query:  SSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHNRAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA
        +SK+AFPT+NNVMP +STNAAN ISSDA SLH+SSA+LSSQVPS  TPGM NRAPQKK  +GQKKPL+ALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA
Subjt:  SSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHNRAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA

Query:  VSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHR
        VSGVNIREEEEQLFS AKEDSRASEASR+VVQEEEERL+LQKAPLQKKL+EIMAK GLK MSNDVE+CLSL VEERLRGIISNLIRLSKQRVDTEKPRHR
Subjt:  VSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHR

Query:  TLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK
        T ITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGS  AGDK+KDEGRMKSVK NKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK
Subjt:  TLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK

Query:  REGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRKGTTRKFGRNQVIA
        REGGMDSASGSQSGKD VRKSSSAAGRHGKDNQE DRKGT++KFGRNQ+ A
Subjt:  REGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRKGTTRKFGRNQVIA

XP_022952264.1 transcription initiation factor TFIID subunit 4b isoform X2 [Cucurbita moschata]0.0e+0088.6Show/hide
Query:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHSSLMEQERCTSVPENQQQHNATL
        +DETMHSGAAVEAFQAALNRDIEGD PA SQ  ESDAAFPRGSN  S LSLQA SQNE TESH  QDQNF  KQEQHSSLME ERC+S+PENQQQHNA  
Subjt:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHSSLMEQERCTSVPENQQQHNATL

Query:  LQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKDRA
        LQASKNQPQAD EQG+ EQ S QFSQTAGLQ S+KAP+LVNDSNRMQNRDNESQYLKLQKMSNQQ MVSEQA+NP+NRSKQVPFASLMPVLMPQLDKDRA
Subjt:  LQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKDRA

Query:  MQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSS
        MQLQTLFN+LKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPS+GPGAPNFSD RPF+QLHQKGMNS AVQSYIPSSASQGRSSS 
Subjt:  MQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSS

Query:  YPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQGPN
        YPA+DKNMQSLREVEQRTDGN NQLTSSSSGTIQERERSS+PVPGLEKQQLHFQQK F MYGNSGNYHPYTGSNIN S LSLKPQPHEGQ KQI QQ PN
Subjt:  YPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQGPN

Query:  FDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQ
        FDRQVTIND KRVQGG V HL NN+TSQ +P  WKSSTSKEQN GPLSSMSYIKQEPSDQV+EQNKTQ SNLQGLSSI SMQAEQVNTTPGI KD FEKQ
Subjt:  FDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQ

Query:  SSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHNRAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA
        +SK+AFPT+NNVMP +STNAAN ISSDA SLH+SSA+LSSQVPS  TPGM NRAPQKK  +GQKKPL+ALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA
Subjt:  SSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHNRAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA

Query:  VSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHR
        VSGVNIREEEEQLFS AKEDSRASEASR+VVQEEEERL+LQKAPLQKKL+EIMAK GLK MSNDVE+CLSL VEERLRGIISNLIRLSKQRVDTEKPRHR
Subjt:  VSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHR

Query:  TLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK
        T ITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGS  AGDK+KDEGRMKSV  NKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK
Subjt:  TLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK

Query:  REGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRKGTTRKFGRNQVIA
        REGGMDSASGSQSGKD VRKSSSAAGRHGKDNQE DRKGT++KFGRNQ+ A
Subjt:  REGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRKGTTRKFGRNQVIA

XP_023554163.1 transcription initiation factor TFIID subunit 4b isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0088.48Show/hide
Query:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHSSLMEQERCTSVPENQQQHNATL
        +DETMHSGAAVEAFQAALNRDIEGD PA SQ  ESDAAFPRGSN  S LSLQA SQNE TESH  QDQNF  KQEQHSSLME ERC+S+PENQQQHNA  
Subjt:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHSSLMEQERCTSVPENQQQHNATL

Query:  LQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKDRA
        LQASKNQPQAD EQG+AEQ S QFSQTAGLQ S+KAP+LVNDSNRMQNRDNESQYLKLQKMSNQQ MVSEQA+NP+NRSKQVPFASLMPVLMPQLDKDRA
Subjt:  LQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKDRA

Query:  MQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSS
        MQLQTLFN+LKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPS+GPGAPNFSD RPF+QLHQKGMNS AVQSY+PSSASQGR SS 
Subjt:  MQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSS

Query:  YPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQGPN
        YPA+DKNMQSLREVEQRTDGN NQLTSSSSGTIQERERSS+PVPGLEKQQLHFQQK F MYGNSGNYHPYTGSNIN S LSLKPQPHEGQ KQI QQ PN
Subjt:  YPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQGPN

Query:  FDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQ
        FDRQV+IND KRVQGG V HL NN+TSQ +P  WKSSTSKEQN GPLSSMSYIKQEPSDQV+EQNKTQ SNLQGLSSI SMQAEQVNTTPGI KD FEKQ
Subjt:  FDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQ

Query:  SSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHNRAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA
        +SK+AFPT+NNVMP +STNAAN ISSDA SLH+SSA+LSSQVPS  TPGM NRAPQKK  +GQKKPL+ALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA
Subjt:  SSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHNRAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA

Query:  VSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHR
        VSGVNIREEEEQLFS AKEDSRASEASR+VVQEEEERL+LQKAPLQKKL+EIMAK GLK MSNDVE+CLSL VEERLRGIISNLIRLSKQRVDTEKPRHR
Subjt:  VSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHR

Query:  TLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK
        T ITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGS  AGDK+KDEGRMKSVK NKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK
Subjt:  TLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK

Query:  REGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRKGTTRKFGRNQVIA
        REGGMDSASGSQSGKD VRKSSSAAGRHGKDNQE DRKGT++KFGRNQ+ A
Subjt:  REGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRKGTTRKFGRNQVIA

XP_023554166.1 transcription initiation factor TFIID subunit 4b isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0088.37Show/hide
Query:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHSSLMEQERCTSVPENQQQHNATL
        +DETMHSGAAVEAFQAALNRDIEGD PA SQ  ESDAAFPRGSN  S LSLQA SQNE TESH  QDQNF  KQEQHSSLME ERC+S+PENQQQHNA  
Subjt:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHSSLMEQERCTSVPENQQQHNATL

Query:  LQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKDRA
        LQASKNQPQAD EQG+AEQ S QFSQTAGLQ S+KAP+LVNDSNRMQNRDNESQYLKLQKMSNQQ MVSEQA+NP+NRSKQVPFASLMPVLMPQLDKDRA
Subjt:  LQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKDRA

Query:  MQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSS
        MQLQTLFN+LKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPS+GPGAPNFSD RPF+QLHQKGMNS AVQSY+PSSASQGR SS 
Subjt:  MQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSS

Query:  YPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQGPN
        YPA+DKNMQSLREVEQRTDGN NQLTSSSSGTIQERERSS+PVPGLEKQQLHFQQK F MYGNSGNYHPYTGSNIN S LSLKPQPHEGQ KQI QQ PN
Subjt:  YPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQGPN

Query:  FDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQ
        FDRQV+IND KRVQGG V HL NN+TSQ +P  WKSSTSKEQN GPLSSMSYIKQEPSDQV+EQNKTQ SNLQGLSSI SMQAEQVNTTPGI KD FEKQ
Subjt:  FDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQ

Query:  SSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHNRAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA
        +SK+AFPT+NNVMP +STNAAN ISSDA SLH+SSA+LSSQVPS  TPGM NRAPQKK  +GQKKPL+ALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA
Subjt:  SSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHNRAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA

Query:  VSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHR
        VSGVNIREEEEQLFS AKEDSRASEASR+VVQEEEERL+LQKAPLQKKL+EIMAK GLK MSNDVE+CLSL VEERLRGIISNLIRLSKQRVDTEKPRHR
Subjt:  VSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHR

Query:  TLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK
        T ITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGS  AGDK+KDEGRMKSV  NKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK
Subjt:  TLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK

Query:  REGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRKGTTRKFGRNQVIA
        REGGMDSASGSQSGKD VRKSSSAAGRHGKDNQE DRKGT++KFGRNQ+ A
Subjt:  REGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRKGTTRKFGRNQVIA

TrEMBL top hitse value%identityAlignment
A0A6J1DH47 transcription initiation factor TFIID subunit 4b0.0e+0091.17Show/hide
Query:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHSSLMEQERCTSVPENQQQHNATL
        +DETMHSGAAVEAFQAALNRDIEGD PA SQTSESDAAFPRGSNS SGLSLQASSQNENTESHV QDQNFLLKQEQHSSLME ERCTS PENQQ+HNA L
Subjt:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHSSLMEQERCTSVPENQQQHNATL

Query:  LQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKDRA
        LQ+SKNQPQADHEQGE EQ  VQFSQTAGLQ S+KAP+LVNDS+RM NRDNESQYLKLQKMSNQQAMVSEQASN +NRSKQVPFASLMPVLMPQLDKDRA
Subjt:  LQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKDRA

Query:  MQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSS
        MQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQ+QPPPSVRQLSPRMPS+GPGA NFSD RPFAQLHQKGMNS  VQSYIPS ASQGR SS 
Subjt:  MQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSS

Query:  YPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQGPN
        YP +DKNMQSLREVEQRTDGNGNQLTSSS+GTIQERERSSVPVPGLEKQQLHFQQK FAMYG SG+YH YTGSNIN S LSLKPQPHEGQVKQI QQ PN
Subjt:  YPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQGPN

Query:  FDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQ
        FDRQVTINDSKRVQGG VPHLHNN+TSQQNPV+WKS+TSKEQN GPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISS+QAEQVNTTPGI KDSFEKQ
Subjt:  FDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQ

Query:  SSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHNRAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA
        SSKM FPT+NNV+PPTSTN ANSISSDA+S+HDSSAML SQVPS  TPG+ NR  QKK T+GQKKPL+ALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA
Subjt:  SSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHNRAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA

Query:  VSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHR
        VSGVNIREEEEQLFS AKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVE+CLSLCVEERLRGIISNLIRLSKQRVD+EKPRHR
Subjt:  VSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHR

Query:  TLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK
        TLITSDVRQQI LVNQKAREEWEKKQAEEEKLRKLNDPEDGSG AGDKEKDE R+KSVK NKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK
Subjt:  TLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK

Query:  REGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRKGTTRKFGRNQV
        REGG+DSASGSQSGKDA RKSSSAAGRHGKDNQEADRKGT+RKFGRNQ+
Subjt:  REGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRKGTTRKFGRNQV

A0A6J1GJW9 transcription initiation factor TFIID subunit 4b isoform X10.0e+0088.72Show/hide
Query:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHSSLMEQERCTSVPENQQQHNATL
        +DETMHSGAAVEAFQAALNRDIEGD PA SQ  ESDAAFPRGSN  S LSLQA SQNE TESH  QDQNF  KQEQHSSLME ERC+S+PENQQQHNA  
Subjt:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHSSLMEQERCTSVPENQQQHNATL

Query:  LQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKDRA
        LQASKNQPQAD EQG+ EQ S QFSQTAGLQ S+KAP+LVNDSNRMQNRDNESQYLKLQKMSNQQ MVSEQA+NP+NRSKQVPFASLMPVLMPQLDKDRA
Subjt:  LQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKDRA

Query:  MQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSS
        MQLQTLFN+LKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPS+GPGAPNFSD RPF+QLHQKGMNS AVQSYIPSSASQGRSSS 
Subjt:  MQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSS

Query:  YPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQGPN
        YPA+DKNMQSLREVEQRTDGN NQLTSSSSGTIQERERSS+PVPGLEKQQLHFQQK F MYGNSGNYHPYTGSNIN S LSLKPQPHEGQ KQI QQ PN
Subjt:  YPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQGPN

Query:  FDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQ
        FDRQVTIND KRVQGG V HL NN+TSQ +P  WKSSTSKEQN GPLSSMSYIKQEPSDQV+EQNKTQ SNLQGLSSI SMQAEQVNTTPGI KD FEKQ
Subjt:  FDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQ

Query:  SSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHNRAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA
        +SK+AFPT+NNVMP +STNAAN ISSDA SLH+SSA+LSSQVPS  TPGM NRAPQKK  +GQKKPL+ALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA
Subjt:  SSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHNRAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA

Query:  VSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHR
        VSGVNIREEEEQLFS AKEDSRASEASR+VVQEEEERL+LQKAPLQKKL+EIMAK GLK MSNDVE+CLSL VEERLRGIISNLIRLSKQRVDTEKPRHR
Subjt:  VSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHR

Query:  TLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK
        T ITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGS  AGDK+KDEGRMKSVK NKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK
Subjt:  TLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK

Query:  REGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRKGTTRKFGRNQVIA
        REGGMDSASGSQSGKD VRKSSSAAGRHGKDNQE DRKGT++KFGRNQ+ A
Subjt:  REGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRKGTTRKFGRNQVIA

A0A6J1GL68 transcription initiation factor TFIID subunit 4b isoform X20.0e+0088.6Show/hide
Query:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHSSLMEQERCTSVPENQQQHNATL
        +DETMHSGAAVEAFQAALNRDIEGD PA SQ  ESDAAFPRGSN  S LSLQA SQNE TESH  QDQNF  KQEQHSSLME ERC+S+PENQQQHNA  
Subjt:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHSSLMEQERCTSVPENQQQHNATL

Query:  LQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKDRA
        LQASKNQPQAD EQG+ EQ S QFSQTAGLQ S+KAP+LVNDSNRMQNRDNESQYLKLQKMSNQQ MVSEQA+NP+NRSKQVPFASLMPVLMPQLDKDRA
Subjt:  LQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKDRA

Query:  MQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSS
        MQLQTLFN+LKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPS+GPGAPNFSD RPF+QLHQKGMNS AVQSYIPSSASQGRSSS 
Subjt:  MQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSS

Query:  YPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQGPN
        YPA+DKNMQSLREVEQRTDGN NQLTSSSSGTIQERERSS+PVPGLEKQQLHFQQK F MYGNSGNYHPYTGSNIN S LSLKPQPHEGQ KQI QQ PN
Subjt:  YPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQGPN

Query:  FDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQ
        FDRQVTIND KRVQGG V HL NN+TSQ +P  WKSSTSKEQN GPLSSMSYIKQEPSDQV+EQNKTQ SNLQGLSSI SMQAEQVNTTPGI KD FEKQ
Subjt:  FDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQ

Query:  SSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHNRAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA
        +SK+AFPT+NNVMP +STNAAN ISSDA SLH+SSA+LSSQVPS  TPGM NRAPQKK  +GQKKPL+ALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA
Subjt:  SSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHNRAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA

Query:  VSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHR
        VSGVNIREEEEQLFS AKEDSRASEASR+VVQEEEERL+LQKAPLQKKL+EIMAK GLK MSNDVE+CLSL VEERLRGIISNLIRLSKQRVDTEKPRHR
Subjt:  VSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHR

Query:  TLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK
        T ITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGS  AGDK+KDEGRMKSV  NKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK
Subjt:  TLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK

Query:  REGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRKGTTRKFGRNQVIA
        REGGMDSASGSQSGKD VRKSSSAAGRHGKDNQE DRKGT++KFGRNQ+ A
Subjt:  REGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRKGTTRKFGRNQVIA

A0A6J1I3C1 transcription initiation factor TFIID subunit 4b isoform X20.0e+0087.66Show/hide
Query:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHSSLMEQERCTSVPENQQQHNATL
        +DETMHSGAAVEAFQAALNRDIEGD PA S+   SDAAFPRGSN  S L LQA SQNE TESH  QDQNF  KQEQHSSLME ERC+S+PENQQQHNA  
Subjt:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHSSLMEQERCTSVPENQQQHNATL

Query:  LQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKDRA
        LQASKNQPQ+D EQG+AEQ S QFSQT GLQ S+KAP+LVNDSNRMQNRDNESQYLKLQKMSNQQ MVSEQA+NP+NRSKQVPFASLMPVLMPQLDKDRA
Subjt:  LQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKDRA

Query:  MQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSS
        MQLQTLFN+LKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPS+GPGAPNFSD RPF+QLHQKGMNS AVQSYIPSSASQGRSSS 
Subjt:  MQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSS

Query:  YPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQGPN
        YPA+DKNMQSLREVEQRTDGN NQLTSSSSGTIQERERSS+PVPGLEKQQLHFQQK F MYGNSGNYHPYTGSNIN S LSLKPQPHEGQVKQI QQ  N
Subjt:  YPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQGPN

Query:  FDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQ
        FDRQVTIND KRVQGG V HL NN+TSQ +P  WKSSTSKEQ  GPLSSMSYIKQEPSDQV+EQNKTQ SNLQGLSSI SMQ+EQVNTTPGI KD FEKQ
Subjt:  FDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQ

Query:  SSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHNRAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA
        +SK+ FPT+NNVMP +STNAAN ISSDA SL +SSA+LSSQVPS  TPGM NRAPQKK  +GQKK L+ALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA
Subjt:  SSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHNRAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA

Query:  VSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHR
        VSGVNIREEEEQLFS AKEDSRASEASR+VVQEEEERL+LQKAPLQKKL+EIMAK GLK MSNDVE+CLSL VEERLRGIISNLIRLSKQRVDTEKPRHR
Subjt:  VSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHR

Query:  TLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK
        T ITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGS  AGDK+KDEGRMKSV  NKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK
Subjt:  TLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK

Query:  REGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRKGTTRKFGRNQVIA
        REGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQE DRKGT++KFGRNQ+ A
Subjt:  REGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRKGTTRKFGRNQVIA

A0A6J1I4L6 transcription initiation factor TFIID subunit 4b isoform X10.0e+0087.78Show/hide
Query:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHSSLMEQERCTSVPENQQQHNATL
        +DETMHSGAAVEAFQAALNRDIEGD PA S+   SDAAFPRGSN  S L LQA SQNE TESH  QDQNF  KQEQHSSLME ERC+S+PENQQQHNA  
Subjt:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHSSLMEQERCTSVPENQQQHNATL

Query:  LQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKDRA
        LQASKNQPQ+D EQG+AEQ S QFSQT GLQ S+KAP+LVNDSNRMQNRDNESQYLKLQKMSNQQ MVSEQA+NP+NRSKQVPFASLMPVLMPQLDKDRA
Subjt:  LQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKDRA

Query:  MQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSS
        MQLQTLFN+LKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPS+GPGAPNFSD RPF+QLHQKGMNS AVQSYIPSSASQGRSSS 
Subjt:  MQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSS

Query:  YPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQGPN
        YPA+DKNMQSLREVEQRTDGN NQLTSSSSGTIQERERSS+PVPGLEKQQLHFQQK F MYGNSGNYHPYTGSNIN S LSLKPQPHEGQVKQI QQ  N
Subjt:  YPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQGPN

Query:  FDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQ
        FDRQVTIND KRVQGG V HL NN+TSQ +P  WKSSTSKEQ  GPLSSMSYIKQEPSDQV+EQNKTQ SNLQGLSSI SMQ+EQVNTTPGI KD FEKQ
Subjt:  FDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQ

Query:  SSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHNRAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA
        +SK+ FPT+NNVMP +STNAAN ISSDA SL +SSA+LSSQVPS  TPGM NRAPQKK  +GQKK L+ALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA
Subjt:  SSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHNRAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTA

Query:  VSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHR
        VSGVNIREEEEQLFS AKEDSRASEASR+VVQEEEERL+LQKAPLQKKL+EIMAK GLK MSNDVE+CLSL VEERLRGIISNLIRLSKQRVDTEKPRHR
Subjt:  VSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHR

Query:  TLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK
        T ITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGS  AGDK+KDEGRMKSVK NKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK
Subjt:  TLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQK

Query:  REGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRKGTTRKFGRNQVIA
        REGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQE DRKGT++KFGRNQ+ A
Subjt:  REGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRKGTTRKFGRNQVIA

SwissProt top hitse value%identityAlignment
F4K4L7 Transcription initiation factor TFIID subunit 4b2.4e-15346.11Show/hide
Query:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHS--SLMEQERCTSVPENQQQHN-
        +DE+MHSGA V+AFQAALNRDIEG     S T+        G+N        +S Q  +T  +   D N  + Q QHS  S   +E+  S  ENQ QH+ 
Subjt:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHS--SLMEQERCTSVPENQQQHN-

Query:  --ATLLQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAM-----VSEQASNPINRS-KQVPFASLMP
          A       NQPQ  H  G+  +   Q  Q+ GL +S+K P   N+S+R  N+++ESQY+KLQKMS+QQA      V+    NPINR+ KQVPFA+L+P
Subjt:  --ATLLQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAM-----VSEQASNPINRS-KQVPFASLMP

Query:  VLMPQLDKDRAMQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIP
         LM QLDKDRA+QL+TL+ +LK+NE+ K+ F R M+ +VGDQMLR+AV                                  ++L Q   N   +    P
Subjt:  VLMPQLDKDRAMQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIP

Query:  SSASQGRSSSSYPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKP---FAMYGNSGNYHPYTGSNINPSLLSLKPQP
        S+    + S S P         R V        NQL SS+SGT+     SSVPV GL K   H  Q P   F MY  SG++H + G N N S  +L+P  
Subjt:  SSASQGRSSSSYPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKP---FAMYGNSGNYHPYTGSNINPSLLSLKPQP

Query:  HEGQVKQI-------------PQQG-------PNFDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNK
        H+  ++ +             P Q        P F+R  ++ND  RVQGG   H  N+           SS       G  SS+S++KQE  DQ  E+N 
Subjt:  HEGQVKQI-------------PQQG-------PNFDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNK

Query:  TQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQSSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVP-STATPGMHNRAPQKKVTIGQKK
                 +S++S              +  EK+SS+M   T NN+ P      A+S+S    +  D+S  ++S+ P  T+  G + R P KK ++GQKK
Subjt:  TQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQSSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVP-STATPGMHNRAPQKKVTIGQKK

Query:  PLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTAVSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDV
        PL+ LGSSPP  SKKQKV+G   DQSIEQLNDVTAVSGVN+REEEEQLFSGAKED R SEASRRVV EEEERLILQK PLQ+KL EIMAK GLK +SNDV
Subjt:  PLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTAVSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDV

Query:  ERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHRTLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAA-GDKEKDEGRMKSVKANKEE
        ERCLSLCVEER+RG++S++IRLSKQRVD EK RHRT ITSD+R QI  +NQK +EEWEKKQAE EKL+K ++ E+G G    +K+K++ R K VK NKE+
Subjt:  ERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHRTLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAA-GDKEKDEGRMKSVKANKEE

Query:  DDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQKREGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRK
        DDKMRTTAANVAARAAVGGDD   KWQLMAE ARQK        S S++GKD  +K++S  G++ KD Q+  R+
Subjt:  DDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQKREGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRK

O81361 40S ribosomal protein S81.4e-10088.43Show/hide
Query:  ISRDSMHKRRATGGKKKAWRKKRKYELGRQPANTKLSSDKSVRRVRVRGGNVKWRAFRLDTGNYSWGSEAVTRKTRILDVVYNASNNELVRTQTLVKSAI
        ISRDSMHKRRATGGKKKAWRKKRKYELGRQPANTKLSS+K+VRR+RVRGGNVKWRA RLDTGN+SWGSEAVTRKTR+LDVVYNASNNELVRTQTLVKSAI
Subjt:  ISRDSMHKRRATGGKKKAWRKKRKYELGRQPANTKLSSDKSVRRVRVRGGNVKWRAFRLDTGNYSWGSEAVTRKTRILDVVYNASNNELVRTQTLVKSAI

Query:  VQVDAAPFKQWYLQHYGVDIGRKKKTLASAKKE--EEGDAATEEVKKSNHVQRKLEKRQQDRKLDAHIEEQFSSGRLMACISSRPGQCGRADGYILEGKE
        VQVDAAPFKQWYLQHYGV+IGRKKKT A+AKK+  EEG+A TEE KKSNHV  KLEKRQQ R LDAHIEEQF  G+L+ACISSRPGQCG+ADG ILEGKE
Subjt:  VQVDAAPFKQWYLQHYGVDIGRKKKTLASAKKE--EEGDAATEEVKKSNHVQRKLEKRQQDRKLDAHIEEQFSSGRLMACISSRPGQCGRADGYILEGKE

Query:  LEFYMKKLQRKKGKGA
        LEFYMKKLQRKKGKGA
Subjt:  LEFYMKKLQRKKGKGA

P49199 40S ribosomal protein S84.3e-10290.32Show/hide
Query:  ISRDSMHKRRATGGKKKAWRKKRKYELGRQPANTKLSSDKSVRRVRVRGGNVKWRAFRLDTGNYSWGSEAVTRKTRILDVVYNASNNELVRTQTLVKSAI
        ISRDSMHKRRATGGK+KAWRKKRKYELGRQPANTKLSS+K+VRRVRVRGGNVKWRA RLDTGNYSWGSEAVTRKTRILDVVYNASNNELVRTQTLVKSAI
Subjt:  ISRDSMHKRRATGGKKKAWRKKRKYELGRQPANTKLSSDKSVRRVRVRGGNVKWRAFRLDTGNYSWGSEAVTRKTRILDVVYNASNNELVRTQTLVKSAI

Query:  VQVDAAPFKQWYLQHYGVDIGRKKKTLASAKKEEEG---DAATEEVKKSNHVQRKLEKRQQDRKLDAHIEEQFSSGRLMACISSRPGQCGRADGYILEGK
        VQVDAAPFKQWYL HYGVDIGRKKK  A AKK+ EG   +A TEE KKSNHV RKLEKRQQ R LDAHIEEQF SGRL+ACISSRPGQCGRADGYILEGK
Subjt:  VQVDAAPFKQWYLQHYGVDIGRKKKTLASAKKEEEG---DAATEEVKKSNHVQRKLEKRQQDRKLDAHIEEQFSSGRLMACISSRPGQCGRADGYILEGK

Query:  ELEFYMKKLQRKKGKGA
        ELEFYMKKLQRKKGKGA
Subjt:  ELEFYMKKLQRKKGKGA

Q08069 40S ribosomal protein S81.4e-10088.02Show/hide
Query:  ISRDSMHKRRATGGKKKAWRKKRKYELGRQPANTKLSSDKSVRRVRVRGGNVKWRAFRLDTGNYSWGSEAVTRKTRILDVVYNASNNELVRTQTLVKSAI
        ISRDSMHKRRATGGK+KAWRKKRKYELGRQPANTKLSS+K+VRRVRVRGGNVKWRA RLDTGNYSWGSEAVTRKTRILDVVYNASNNELVRTQTLVKSAI
Subjt:  ISRDSMHKRRATGGKKKAWRKKRKYELGRQPANTKLSSDKSVRRVRVRGGNVKWRAFRLDTGNYSWGSEAVTRKTRILDVVYNASNNELVRTQTLVKSAI

Query:  VQVDAAPFKQWYLQHYGVDIGRKKKTLASAKKEEEG---DAATEEVKKSNHVQRKLEKRQQDRKLDAHIEEQFSSGRLMACISSRPGQCGRADGYILEGK
        VQVDAAPFKQWYL HYGVDIGRKKKT A+ K   EG   +AA EE KKSNHV RKLEKR++ R LD HIEEQF SGRL+ACISSRPGQCGRADGYILEGK
Subjt:  VQVDAAPFKQWYLQHYGVDIGRKKKTLASAKKEEEG---DAATEEVKKSNHVQRKLEKRQQDRKLDAHIEEQFSSGRLMACISSRPGQCGRADGYILEGK

Query:  ELEFYMKKLQRKKGKGA
        ELEFYMKKLQRKKGK A
Subjt:  ELEFYMKKLQRKKGKGA

Q93VG5 40S ribosomal protein S8-11.2e-9681.74Show/hide
Query:  ISRDSMHKRRATGGKKKAWRKKRKYELGRQPANTKLSSDKSVRRVRVRGGNVKWRAFRLDTGNYSWGSEAVTRKTRILDVVYNASNNELVRTQTLVKSAI
        ISRDS+HKRRATGGK+K WRKKRKYE+GRQPANTKLSS+K+VRR+RVRGGNVKWRA RLDTGNYSWGSEA TRKTR+LDVVYNASNNELVRT+TLVKSAI
Subjt:  ISRDSMHKRRATGGKKKAWRKKRKYELGRQPANTKLSSDKSVRRVRVRGGNVKWRAFRLDTGNYSWGSEAVTRKTRILDVVYNASNNELVRTQTLVKSAI

Query:  VQVDAAPFKQWYLQHYGVDIGRKKKTLASAKKE-EEGD----AATEEVKKSNHVQRKLEKRQQDRKLDAHIEEQFSSGRLMACISSRPGQCGRADGYILE
        VQVDAAPFKQWYL HYGV++GRKKK+ +S KK+ EEG+    AA EEVKKSNH+ RK+  RQ+ R LD+HIE+QF+SGRL+ACISSRPGQCGRADGYILE
Subjt:  VQVDAAPFKQWYLQHYGVDIGRKKKTLASAKKE-EEGD----AATEEVKKSNHVQRKLEKRQQDRKLDAHIEEQFSSGRLMACISSRPGQCGRADGYILE

Query:  GKELEFYMKKLQRKKGKGA
        GKELEFYMKK+Q+KKGKGA
Subjt:  GKELEFYMKKLQRKKGKGA

Arabidopsis top hitse value%identityAlignment
AT1G27720.1 TBP-associated factor 4B1.6e-9137.95Show/hide
Query:  VNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKD--RAMQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAV
        +N++   ++   + +Y+KLQKMS+++    E+  +P+N + ++  A +  +L   +D    +      L  KLKR E+  ++F+R +R +VGDQ++R  +
Subjt:  VNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKD--RAMQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAV

Query:  CQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSSYPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERE
         Q               +P + PG  N     P    H K            S +++  +  S P         REV      + NQL+S++SGT+    
Subjt:  CQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSSYPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERE

Query:  RSSVPVPGLEK---QQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQ------------------GPNFDRQVTINDSKRVQGG
         SS  V GL K   Q +      F M   SG+ +PY G+N+     S + +  + Q ++  Q                    P F+R   +N   RVQ G
Subjt:  RSSVPVPGLEK---QQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQ------------------GPNFDRQVTINDSKRVQGG

Query:  TVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQ-NKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQSSKMAFPTANNVMPP
         +     N +       W+ S +K+   GP SS+ +++ +  DQ  EQ +K +    QG   ++++  +Q N  P    D  EKQSSKM           
Subjt:  TVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQ-NKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQSSKMAFPTANNVMPP

Query:  TSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHN-RAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTAVSGVNIREEEEQLF
        TST +A+S+     +  DSS M++   PS   P + N     K  ++GQKKPL+ALGSS P S KKQK+ G  SD+SIE+ NDVTAVSG+N+REEE+QL 
Subjt:  TSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHN-RAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTAVSGVNIREEEEQLF

Query:  -SGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHRTLITSDVRQQIML
         SG K++ R S+A RR+V  EEER +LQK PLQ+KL EIM K GLK + +DVERCLSLCVEER+RG++ N+IR+SKQR D EK R+RT ITSD+R++I  
Subjt:  -SGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHRTLITSDVRQQIML

Query:  VNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQKREGGMDSASGSQS
        +NQK +EEWEKK + EEK ++            D EK++ R   VKANK+++DK R  AANVA RAAVGGDD  SKW+LMAE ARQ+   G    S   S
Subjt:  VNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQKREGGMDSASGSQS

Query:  G
        G
Subjt:  G

AT5G20290.1 Ribosomal protein S8e family protein8.6e-9881.74Show/hide
Query:  ISRDSMHKRRATGGKKKAWRKKRKYELGRQPANTKLSSDKSVRRVRVRGGNVKWRAFRLDTGNYSWGSEAVTRKTRILDVVYNASNNELVRTQTLVKSAI
        ISRDS+HKRRATGGK+K WRKKRKYE+GRQPANTKLSS+K+VRR+RVRGGNVKWRA RLDTGNYSWGSEA TRKTR+LDVVYNASNNELVRT+TLVKSAI
Subjt:  ISRDSMHKRRATGGKKKAWRKKRKYELGRQPANTKLSSDKSVRRVRVRGGNVKWRAFRLDTGNYSWGSEAVTRKTRILDVVYNASNNELVRTQTLVKSAI

Query:  VQVDAAPFKQWYLQHYGVDIGRKKKTLASAKKE-EEGD----AATEEVKKSNHVQRKLEKRQQDRKLDAHIEEQFSSGRLMACISSRPGQCGRADGYILE
        VQVDAAPFKQWYL HYGV++GRKKK+ +S KK+ EEG+    AA EEVKKSNH+ RK+  RQ+ R LD+HIE+QF+SGRL+ACISSRPGQCGRADGYILE
Subjt:  VQVDAAPFKQWYLQHYGVDIGRKKKTLASAKKE-EEGD----AATEEVKKSNHVQRKLEKRQQDRKLDAHIEEQFSSGRLMACISSRPGQCGRADGYILE

Query:  GKELEFYMKKLQRKKGKGA
        GKELEFYMKK+Q+KKGKGA
Subjt:  GKELEFYMKKLQRKKGKGA

AT5G43130.1 TBP-associated factor 41.5e-15546.11Show/hide
Query:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHS--SLMEQERCTSVPENQQQHN-
        +DE+MHSGA V+AFQAALNRDIEG     S T+        G+N        +S Q  +T  +   D N  + Q QHS  S   +E+  S  ENQ QH+ 
Subjt:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHS--SLMEQERCTSVPENQQQHN-

Query:  --ATLLQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAM-----VSEQASNPINRS-KQVPFASLMP
          A       NQPQ  H  G+  +   Q  Q+ GL +S+K P   N+S+R  N+++ESQY+KLQKMS+QQA      V+    NPINR+ KQVPFA+L+P
Subjt:  --ATLLQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAM-----VSEQASNPINRS-KQVPFASLMP

Query:  VLMPQLDKDRAMQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIP
         LM QLDKDRA+QL+TL+ +LK+NE+ K+ F R M+ +VGDQMLR+AV                                  ++L Q   N   +    P
Subjt:  VLMPQLDKDRAMQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIP

Query:  SSASQGRSSSSYPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKP---FAMYGNSGNYHPYTGSNINPSLLSLKPQP
        S+    + S S P         R V        NQL SS+SGT+     SSVPV GL K   H  Q P   F MY  SG++H + G N N S  +L+P  
Subjt:  SSASQGRSSSSYPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKP---FAMYGNSGNYHPYTGSNINPSLLSLKPQP

Query:  HEGQVKQI-------------PQQG-------PNFDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNK
        H+  ++ +             P Q        P F+R  ++ND  RVQGG   H  N+           SS       G  SS+S++KQE  DQ  E+N 
Subjt:  HEGQVKQI-------------PQQG-------PNFDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNK

Query:  TQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQSSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVP-STATPGMHNRAPQKKVTIGQKK
                 +S++S              +  EK+SS+M   T NN+ P      A+S+S    +  D+S  ++S+ P  T+  G + R P KK ++GQKK
Subjt:  TQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQSSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVP-STATPGMHNRAPQKKVTIGQKK

Query:  PLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTAVSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDV
        PL+ LGSSPP  SKKQKV+G   DQSIEQLNDVTAVSGVN+REEEEQLFSGAKED R SEASRRVV EEEERLILQK PLQ+KL EIMAK GLK +SNDV
Subjt:  PLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTAVSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDV

Query:  ERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHRTLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAA-GDKEKDEGRMKSVKANKEE
        ERCLSLCVEER+RG++S++IRLSKQRVD EK RHRT ITSD+R QI  +NQK +EEWEKKQAE EKL+K ++ E+G G    +K+K++ R K VK NKE+
Subjt:  ERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHRTLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAA-GDKEKDEGRMKSVKANKEE

Query:  DDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQKREGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRK--GT-TRKFGRNQ
        DDKMRTTAANVAARAAVGGDD   KWQLMAE ARQK        S S++GKD  +K++S  G++ KD Q+  R+  GT  R+ G+NQ
Subjt:  DDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQKREGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRK--GT-TRKFGRNQ

AT5G43130.2 TBP-associated factor 41.7e-15446.11Show/hide
Query:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHS--SLMEQERCTSVPENQQQHN-
        +DE+MHSGA V+AFQAALNRDIEG     S T+        G+N        +S Q  +T  +   D N  + Q QHS  S   +E+  S  ENQ QH+ 
Subjt:  KDETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHS--SLMEQERCTSVPENQQQHN-

Query:  --ATLLQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAM-----VSEQASNPINRS-KQVPFASLMP
          A       NQPQ  H  G+  +   Q  Q+ GL +S+K P   N+S+R  N+++ESQY+KLQKMS+QQA      V+    NPINR+ KQVPFA+L+P
Subjt:  --ATLLQASKNQPQADHEQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAM-----VSEQASNPINRS-KQVPFASLMP

Query:  VLMPQLDKDRAMQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIP
         LM QLDKDRA+QL+TL+ +LK+NE+ K+ F R M+ +VGDQMLR+AV                                  ++L Q   N   +    P
Subjt:  VLMPQLDKDRAMQLQTLFNKLKRNEMNKDDFIRLMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIP

Query:  SSASQGRSSSSYPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKP---FAMYGNSGNYHPYTGSNINPSLLSLKPQP
        S+    + S S P         R V        NQL SS+SGT+     SSVPV GL K   H  Q P   F MY  SG++H + G N N S  +L+P  
Subjt:  SSASQGRSSSSYPAIDKNMQSLREVEQRTDGNGNQLTSSSSGTIQERERSSVPVPGLEKQQLHFQQKP---FAMYGNSGNYHPYTGSNINPSLLSLKPQP

Query:  HEGQVKQI-------------PQQG-------PNFDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNK
        H+  ++ +             P Q        P F+R  ++ND  RVQGG   H  N+           SS       G  SS+S++KQE  DQ  E+N 
Subjt:  HEGQVKQI-------------PQQG-------PNFDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQNAGPLSSMSYIKQEPSDQVAEQNK

Query:  TQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQSSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVP-STATPGMHNRAPQKKVTIGQKK
                 +S++S              +  EK+SS+M   T NN+ P      A+S+S    +  D+S  ++S+ P  T+  G + R P KK ++GQKK
Subjt:  TQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQSSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVP-STATPGMHNRAPQKKVTIGQKK

Query:  PLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTAVSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDV
        PL+ LGSSPP  SKKQKV+G   DQSIEQLNDVTAVSGVN+REEEEQLFSGAKED R SEASRRVV EEEERLILQK PLQ+KL EIMAK GLK +SNDV
Subjt:  PLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTAVSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMSNDV

Query:  ERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHRTLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAA-GDKEKDEGRMKSVKANKEE
        ERCLSLCVEER+RG++S++IRLSKQRVD EK RHRT ITSD+R QI  +NQK +EEWEKKQAE EKL+K ++ E+G G    +K+K++ R K VK NKE+
Subjt:  ERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHRTLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAA-GDKEKDEGRMKSVKANKEE

Query:  DDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQKREGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRK
        DDKMRTTAANVAARAAVGGDD   KWQLMAE ARQK        S S++GKD  +K++S  G++ KD Q+  R+
Subjt:  DDKMRTTAANVAARAAVGGDDMLSKWQLMAEQARQKREGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRK

AT5G59240.1 Ribosomal protein S8e family protein2.1e-9684.58Show/hide
Query:  ISRDSMHKRRATGGKKKAWRKKRKYELGRQPANTKLSSDKSVRRVRVRGGNVKWRAFRLDTGNYSWGSEAVTRKTRILDVVYNASNNELVRTQTLVKSAI
        ISRDS+HKRRATGGK+K WRKKRKYELGRQPANTKLSS+K+VRR+RVRGGNVKWRA RLDTGN+SWGSEAVTRKTRILDV YNASNNELVRTQTLVKSAI
Subjt:  ISRDSMHKRRATGGKKKAWRKKRKYELGRQPANTKLSSDKSVRRVRVRGGNVKWRAFRLDTGNYSWGSEAVTRKTRILDVVYNASNNELVRTQTLVKSAI

Query:  VQVDAAPFKQWYLQHYGVDIGRKKKTLASAKKEEEGDAATEEVKKSNHVQRKLEKRQQDRKLDAHIEEQFSSGRLMACISSRPGQCGRADGYILEGKELE
        VQVDAAPFKQ YLQHYGVDIGRKKK  A           TEEVKKSNHVQRKLE RQ+ R LD+H+EEQFSSGRL+ACI+SRPGQCGRADGYILEGKELE
Subjt:  VQVDAAPFKQWYLQHYGVDIGRKKKTLASAKKEEEGDAATEEVKKSNHVQRKLEKRQQDRKLDAHIEEQFSSGRLMACISSRPGQCGRADGYILEGKELE

Query:  FYMKKLQRKKGKGA
        FYMKKLQ+KKGK A
Subjt:  FYMKKLQRKKGKGA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGAAAATTTGATTGGGATGTGCAGAGAAAAATGATATCAAGAACCTCTTCTGCTGGCTGTTCTTCTCGCAGCATTTATTACTGTGGAACAGCTGAAGGAGTCCC
TTTCAAATGGGAAACACAGCCAGGAACTCCCAAGGATGCTCCCCCTCAAGAAAAAATGCTTCCACCCCTCAGCCCTCCGCCGGCTGTTCTCAGCCTCCGATTACCGAAAC
CGGGCGTCGTCGAACAACCGAAACCCCGGCCCCGGATGAAGCTTAGGTTTTGGAGGAAGAGCAAGAAAAGTAGGGACAGTGAGAGACCTGCCCAAGCAACAACTTTAGAC
TATAGCAATGGACATTTTTTCAGTCACTCTAATAAGTTGGAAACTTTCTCATTTCTTAGCTCTGACTGTGAGTTTATGGCGTCTCCTCGGGATTCAATGTCGTTGTCTTC
GTCTTCGTCTTCGTCTTCTTCCTCGTTGCCATTGTCTTTGCTCGAGTCGTTAAATAATTTTGGCTCCGTCGTGGGTGACGTGACGCCATTGATGCTTCTTCTTCTTCTCC
TTCTGCAAACAGCAAAGGCCGTTCGATTTCTGCAACACAGGGAAGCGTCTCCAGACTGCCTTCGCCTGCGCCACCGAAGTCGTATCATTGGATTTCCTGTTGGCAAGGAT
GAAACAATGCATTCAGGTGCTGCTGTGGAGGCTTTTCAGGCTGCCTTGAATCGAGACATAGAAGGTGATGCTCCGGCTGCTTCCCAGACTTCTGAATCAGATGCAGCTTT
CCCTCGAGGGAGCAATAGCACTTCTGGCCTCTCGCTACAAGCTTCTAGCCAAAATGAAAATACTGAATCTCATGTACACCAAGATCAAAATTTTCTTCTGAAGCAAGAAC
AACATTCATCTCTGATGGAACAAGAGCGATGTACATCAGTACCTGAAAATCAGCAGCAACATAATGCTACCCTCCTCCAAGCTTCAAAGAACCAACCCCAAGCAGATCAT
GAACAAGGGGAGGCAGAACAAGCTTCTGTTCAATTTTCTCAAACAGCAGGGCTACAAGTTTCGGACAAAGCTCCAATGCTTGTGAATGACTCAAACAGAATGCAAAATCG
GGACAATGAATCTCAATACCTGAAGCTACAAAAGATGAGTAATCAGCAGGCAATGGTCTCAGAGCAGGCAAGCAACCCAATAAATCGTAGTAAACAGGTACCATTTGCCT
CGTTGATGCCTGTACTGATGCCTCAGCTTGATAAAGACAGAGCCATGCAGCTTCAGACTTTATTTAACAAATTGAAGAGGAATGAGATGAATAAAGATGATTTTATCCGG
CTCATGAGGGGTGTTGTTGGCGATCAGATGCTCAGATTAGCAGTTTGTCAAGTGCAAGCGCAGCCTCCACCTTCAGTAAGGCAACTGTCTCCTAGAATGCCATCCATTGG
TCCTGGTGCACCAAATTTCTCTGATGCGCGACCATTTGCACAACTTCATCAGAAAGGCATGAATTCTTCTGCAGTTCAATCATACATCCCCTCTTCAGCATCCCAAGGGC
GGAGTAGTTCAAGCTATCCTGCCATAGACAAGAATATGCAATCTTTACGAGAAGTAGAACAGCGAACTGATGGTAATGGAAATCAATTGACTTCTTCCAGTAGTGGCACC
ATTCAGGAAAGGGAACGCTCCTCAGTTCCTGTACCAGGACTCGAGAAGCAGCAGTTACACTTCCAACAGAAACCTTTTGCCATGTATGGAAACAGTGGTAATTATCACCC
ATACACAGGGTCGAACATAAATCCCTCCTTATTGTCTCTTAAACCCCAACCTCACGAGGGCCAAGTGAAGCAAATTCCGCAGCAGGGTCCCAATTTTGACAGGCAAGTTA
CCATAAATGATTCCAAGCGAGTGCAGGGTGGAACTGTTCCGCACTTGCATAACAACATAACTTCACAGCAAAATCCAGTTAACTGGAAATCCTCAACAAGTAAAGAGCAG
AACGCTGGCCCTTTGTCATCAATGAGTTATATAAAACAAGAACCTTCTGATCAGGTTGCTGAGCAGAACAAAACCCAACTCTCAAATTTGCAGGGATTGTCTTCTATTTC
TAGCATGCAGGCTGAACAAGTTAACACAACCCCAGGAATTGTGAAGGACTCTTTTGAAAAACAATCTTCCAAAATGGCTTTTCCTACAGCTAACAATGTTATGCCTCCAA
CATCTACCAATGCGGCAAATTCAATTTCTTCTGATGCAATATCTCTACATGACTCTAGTGCTATGTTAAGCTCTCAGGTTCCTTCTACAGCTACTCCTGGAATGCATAAT
AGGGCACCTCAAAAAAAAGTAACCATTGGCCAGAAGAAGCCCCTTGATGCCCTTGGTTCTTCACCACCCCTGTCAAGTAAGAAGCAAAAAGTATCTGGGGCATTTTCAGA
TCAAAGTATTGAACAACTTAACGATGTCACTGCAGTCAGTGGAGTTAATATTCGGGAAGAAGAAGAACAGCTATTTTCTGGTGCAAAGGAGGATAGTCGAGCTTCAGAAG
CATCACGAAGGGTTGTACAAGAAGAAGAAGAGAGGCTGATATTGCAGAAAGCTCCCCTGCAGAAAAAGCTGGTGGAAATCATGGCAAAATGTGGTTTGAAGGGTATGAGC
AATGATGTTGAGCGATGTCTCTCGTTGTGTGTGGAGGAAAGATTACGTGGGATTATTAGTAATCTAATCAGGCTGTCAAAGCAGCGAGTGGACACCGAGAAACCAAGGCA
CCGAACACTTATTACCTCAGATGTTCGACAACAAATCATGTTAGTTAACCAAAAGGCCAGAGAAGAATGGGAGAAGAAACAGGCTGAAGAAGAAAAACTCCGGAAGCTTA
ATGATCCTGAGGATGGTTCTGGGGCTGCTGGCGACAAGGAGAAAGATGAAGGTCGGATGAAATCAGTTAAGGCAAACAAAGAGGAGGATGATAAAATGCGGACAACAGCA
GCAAATGTTGCTGCCCGTGCTGCTGTTGGAGGAGATGACATGCTATCAAAATGGCAACTTATGGCTGAGCAGGCGCGACAGAAACGCGAAGGTGGAATGGATTCAGCTTC
TGGTTCTCAGTCGGGGAAAGATGCAGTCCGTAAGTCTTCATCAGCAGCAGGAAGACATGGGAAGGACAACCAAGAAGCCGATAGAAAAGGAACCACCAGGAAATTTGGAA
GGAACCAAGTTATTGCAATCAAACTAGGGTGGCTCGTAGTATTTCTGTCAAGGATGTCATTGCTGTGCTGCAAAGAGAACCCCAGATGTCCAGATCCACCACAATATATC
ATTTGTGCAACCGAATTCGTTCTGAAGCCACTAGTGAATAGAATCGACCATAATGATGGAAACTGCATCCTGCAGATGTGCGCTTGTGGGGGTTTTGTTTTTCAGTTTGA
AATGAATCCTAGACTCCTTTTAACGCCAAGTGCTCTCGTCTCACCAAGATGGGTATGTATCTCTCGTGATTCTATGCACAAGAGGCGTGCCACTGGAGGCAAGAAGAAGG
CTTGGAGGAAGAAGAGAAAGTATGAGCTCGGAAGGCAACCTGCTAACACTAAGTTATCCAGTGACAAGTCAGTCAGGAGGGTTAGAGTTCGTGGGGGTAATGTGAAGTGG
CGCGCTTTTAGGCTTGATACTGGAAACTACTCTTGGGGTAGTGAAGCTGTGACTAGAAAGACCCGTATTCTTGATGTGGTGTACAATGCATCTAACAACGAGCTTGTTCG
TACTCAAACTTTGGTGAAGAGTGCCATCGTTCAAGTTGATGCAGCACCATTTAAGCAGTGGTATCTTCAGCATTATGGAGTGGATATCGGACGGAAGAAGAAGACCTTGG
CATCTGCTAAGAAGGAAGAGGAAGGTGATGCTGCCACTGAGGAGGTGAAGAAGAGTAACCATGTCCAGCGCAAGCTAGAGAAACGTCAGCAGGACCGTAAACTTGACGCA
CATATTGAAGAACAATTCAGCAGCGGTCGTTTGATGGCTTGTATTTCATCAAGACCCGGTCAATGTGGCCGAGCAGATGGATATATATTGGAGGGCAAAGAGCTTGAGTT
CTATATGAAGAAGCTCCAAAGAAAGAAGGGAAAAGGAGCTGACGATGAGCTGTTGCAGCTGAGAGAAACAGCTATTTTGGAACTTCAGTTTTGCTGGTATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGGAAAATTTGATTGGGATGTGCAGAGAAAAATGATATCAAGAACCTCTTCTGCTGGCTGTTCTTCTCGCAGCATTTATTACTGTGGAACAGCTGAAGGAGTCCC
TTTCAAATGGGAAACACAGCCAGGAACTCCCAAGGATGCTCCCCCTCAAGAAAAAATGCTTCCACCCCTCAGCCCTCCGCCGGCTGTTCTCAGCCTCCGATTACCGAAAC
CGGGCGTCGTCGAACAACCGAAACCCCGGCCCCGGATGAAGCTTAGGTTTTGGAGGAAGAGCAAGAAAAGTAGGGACAGTGAGAGACCTGCCCAAGCAACAACTTTAGAC
TATAGCAATGGACATTTTTTCAGTCACTCTAATAAGTTGGAAACTTTCTCATTTCTTAGCTCTGACTGTGAGTTTATGGCGTCTCCTCGGGATTCAATGTCGTTGTCTTC
GTCTTCGTCTTCGTCTTCTTCCTCGTTGCCATTGTCTTTGCTCGAGTCGTTAAATAATTTTGGCTCCGTCGTGGGTGACGTGACGCCATTGATGCTTCTTCTTCTTCTCC
TTCTGCAAACAGCAAAGGCCGTTCGATTTCTGCAACACAGGGAAGCGTCTCCAGACTGCCTTCGCCTGCGCCACCGAAGTCGTATCATTGGATTTCCTGTTGGCAAGGAT
GAAACAATGCATTCAGGTGCTGCTGTGGAGGCTTTTCAGGCTGCCTTGAATCGAGACATAGAAGGTGATGCTCCGGCTGCTTCCCAGACTTCTGAATCAGATGCAGCTTT
CCCTCGAGGGAGCAATAGCACTTCTGGCCTCTCGCTACAAGCTTCTAGCCAAAATGAAAATACTGAATCTCATGTACACCAAGATCAAAATTTTCTTCTGAAGCAAGAAC
AACATTCATCTCTGATGGAACAAGAGCGATGTACATCAGTACCTGAAAATCAGCAGCAACATAATGCTACCCTCCTCCAAGCTTCAAAGAACCAACCCCAAGCAGATCAT
GAACAAGGGGAGGCAGAACAAGCTTCTGTTCAATTTTCTCAAACAGCAGGGCTACAAGTTTCGGACAAAGCTCCAATGCTTGTGAATGACTCAAACAGAATGCAAAATCG
GGACAATGAATCTCAATACCTGAAGCTACAAAAGATGAGTAATCAGCAGGCAATGGTCTCAGAGCAGGCAAGCAACCCAATAAATCGTAGTAAACAGGTACCATTTGCCT
CGTTGATGCCTGTACTGATGCCTCAGCTTGATAAAGACAGAGCCATGCAGCTTCAGACTTTATTTAACAAATTGAAGAGGAATGAGATGAATAAAGATGATTTTATCCGG
CTCATGAGGGGTGTTGTTGGCGATCAGATGCTCAGATTAGCAGTTTGTCAAGTGCAAGCGCAGCCTCCACCTTCAGTAAGGCAACTGTCTCCTAGAATGCCATCCATTGG
TCCTGGTGCACCAAATTTCTCTGATGCGCGACCATTTGCACAACTTCATCAGAAAGGCATGAATTCTTCTGCAGTTCAATCATACATCCCCTCTTCAGCATCCCAAGGGC
GGAGTAGTTCAAGCTATCCTGCCATAGACAAGAATATGCAATCTTTACGAGAAGTAGAACAGCGAACTGATGGTAATGGAAATCAATTGACTTCTTCCAGTAGTGGCACC
ATTCAGGAAAGGGAACGCTCCTCAGTTCCTGTACCAGGACTCGAGAAGCAGCAGTTACACTTCCAACAGAAACCTTTTGCCATGTATGGAAACAGTGGTAATTATCACCC
ATACACAGGGTCGAACATAAATCCCTCCTTATTGTCTCTTAAACCCCAACCTCACGAGGGCCAAGTGAAGCAAATTCCGCAGCAGGGTCCCAATTTTGACAGGCAAGTTA
CCATAAATGATTCCAAGCGAGTGCAGGGTGGAACTGTTCCGCACTTGCATAACAACATAACTTCACAGCAAAATCCAGTTAACTGGAAATCCTCAACAAGTAAAGAGCAG
AACGCTGGCCCTTTGTCATCAATGAGTTATATAAAACAAGAACCTTCTGATCAGGTTGCTGAGCAGAACAAAACCCAACTCTCAAATTTGCAGGGATTGTCTTCTATTTC
TAGCATGCAGGCTGAACAAGTTAACACAACCCCAGGAATTGTGAAGGACTCTTTTGAAAAACAATCTTCCAAAATGGCTTTTCCTACAGCTAACAATGTTATGCCTCCAA
CATCTACCAATGCGGCAAATTCAATTTCTTCTGATGCAATATCTCTACATGACTCTAGTGCTATGTTAAGCTCTCAGGTTCCTTCTACAGCTACTCCTGGAATGCATAAT
AGGGCACCTCAAAAAAAAGTAACCATTGGCCAGAAGAAGCCCCTTGATGCCCTTGGTTCTTCACCACCCCTGTCAAGTAAGAAGCAAAAAGTATCTGGGGCATTTTCAGA
TCAAAGTATTGAACAACTTAACGATGTCACTGCAGTCAGTGGAGTTAATATTCGGGAAGAAGAAGAACAGCTATTTTCTGGTGCAAAGGAGGATAGTCGAGCTTCAGAAG
CATCACGAAGGGTTGTACAAGAAGAAGAAGAGAGGCTGATATTGCAGAAAGCTCCCCTGCAGAAAAAGCTGGTGGAAATCATGGCAAAATGTGGTTTGAAGGGTATGAGC
AATGATGTTGAGCGATGTCTCTCGTTGTGTGTGGAGGAAAGATTACGTGGGATTATTAGTAATCTAATCAGGCTGTCAAAGCAGCGAGTGGACACCGAGAAACCAAGGCA
CCGAACACTTATTACCTCAGATGTTCGACAACAAATCATGTTAGTTAACCAAAAGGCCAGAGAAGAATGGGAGAAGAAACAGGCTGAAGAAGAAAAACTCCGGAAGCTTA
ATGATCCTGAGGATGGTTCTGGGGCTGCTGGCGACAAGGAGAAAGATGAAGGTCGGATGAAATCAGTTAAGGCAAACAAAGAGGAGGATGATAAAATGCGGACAACAGCA
GCAAATGTTGCTGCCCGTGCTGCTGTTGGAGGAGATGACATGCTATCAAAATGGCAACTTATGGCTGAGCAGGCGCGACAGAAACGCGAAGGTGGAATGGATTCAGCTTC
TGGTTCTCAGTCGGGGAAAGATGCAGTCCGTAAGTCTTCATCAGCAGCAGGAAGACATGGGAAGGACAACCAAGAAGCCGATAGAAAAGGAACCACCAGGAAATTTGGAA
GGAACCAAGTTATTGCAATCAAACTAGGGTGGCTCGTAGTATTTCTGTCAAGGATGTCATTGCTGTGCTGCAAAGAGAACCCCAGATGTCCAGATCCACCACAATATATC
ATTTGTGCAACCGAATTCGTTCTGAAGCCACTAGTGAATAGAATCGACCATAATGATGGAAACTGCATCCTGCAGATGTGCGCTTGTGGGGGTTTTGTTTTTCAGTTTGA
AATGAATCCTAGACTCCTTTTAACGCCAAGTGCTCTCGTCTCACCAAGATGGGTATGTATCTCTCGTGATTCTATGCACAAGAGGCGTGCCACTGGAGGCAAGAAGAAGG
CTTGGAGGAAGAAGAGAAAGTATGAGCTCGGAAGGCAACCTGCTAACACTAAGTTATCCAGTGACAAGTCAGTCAGGAGGGTTAGAGTTCGTGGGGGTAATGTGAAGTGG
CGCGCTTTTAGGCTTGATACTGGAAACTACTCTTGGGGTAGTGAAGCTGTGACTAGAAAGACCCGTATTCTTGATGTGGTGTACAATGCATCTAACAACGAGCTTGTTCG
TACTCAAACTTTGGTGAAGAGTGCCATCGTTCAAGTTGATGCAGCACCATTTAAGCAGTGGTATCTTCAGCATTATGGAGTGGATATCGGACGGAAGAAGAAGACCTTGG
CATCTGCTAAGAAGGAAGAGGAAGGTGATGCTGCCACTGAGGAGGTGAAGAAGAGTAACCATGTCCAGCGCAAGCTAGAGAAACGTCAGCAGGACCGTAAACTTGACGCA
CATATTGAAGAACAATTCAGCAGCGGTCGTTTGATGGCTTGTATTTCATCAAGACCCGGTCAATGTGGCCGAGCAGATGGATATATATTGGAGGGCAAAGAGCTTGAGTT
CTATATGAAGAAGCTCCAAAGAAAGAAGGGAAAAGGAGCTGACGATGAGCTGTTGCAGCTGAGAGAAACAGCTATTTTGGAACTTCAGTTTTGCTGGTATTAG
Protein sequenceShow/hide protein sequence
MDGKFDWDVQRKMISRTSSAGCSSRSIYYCGTAEGVPFKWETQPGTPKDAPPQEKMLPPLSPPPAVLSLRLPKPGVVEQPKPRPRMKLRFWRKSKKSRDSERPAQATTLD
YSNGHFFSHSNKLETFSFLSSDCEFMASPRDSMSLSSSSSSSSSSLPLSLLESLNNFGSVVGDVTPLMLLLLLLLQTAKAVRFLQHREASPDCLRLRHRSRIIGFPVGKD
ETMHSGAAVEAFQAALNRDIEGDAPAASQTSESDAAFPRGSNSTSGLSLQASSQNENTESHVHQDQNFLLKQEQHSSLMEQERCTSVPENQQQHNATLLQASKNQPQADH
EQGEAEQASVQFSQTAGLQVSDKAPMLVNDSNRMQNRDNESQYLKLQKMSNQQAMVSEQASNPINRSKQVPFASLMPVLMPQLDKDRAMQLQTLFNKLKRNEMNKDDFIR
LMRGVVGDQMLRLAVCQVQAQPPPSVRQLSPRMPSIGPGAPNFSDARPFAQLHQKGMNSSAVQSYIPSSASQGRSSSSYPAIDKNMQSLREVEQRTDGNGNQLTSSSSGT
IQERERSSVPVPGLEKQQLHFQQKPFAMYGNSGNYHPYTGSNINPSLLSLKPQPHEGQVKQIPQQGPNFDRQVTINDSKRVQGGTVPHLHNNITSQQNPVNWKSSTSKEQ
NAGPLSSMSYIKQEPSDQVAEQNKTQLSNLQGLSSISSMQAEQVNTTPGIVKDSFEKQSSKMAFPTANNVMPPTSTNAANSISSDAISLHDSSAMLSSQVPSTATPGMHN
RAPQKKVTIGQKKPLDALGSSPPLSSKKQKVSGAFSDQSIEQLNDVTAVSGVNIREEEEQLFSGAKEDSRASEASRRVVQEEEERLILQKAPLQKKLVEIMAKCGLKGMS
NDVERCLSLCVEERLRGIISNLIRLSKQRVDTEKPRHRTLITSDVRQQIMLVNQKAREEWEKKQAEEEKLRKLNDPEDGSGAAGDKEKDEGRMKSVKANKEEDDKMRTTA
ANVAARAAVGGDDMLSKWQLMAEQARQKREGGMDSASGSQSGKDAVRKSSSAAGRHGKDNQEADRKGTTRKFGRNQVIAIKLGWLVVFLSRMSLLCCKENPRCPDPPQYI
ICATEFVLKPLVNRIDHNDGNCILQMCACGGFVFQFEMNPRLLLTPSALVSPRWVCISRDSMHKRRATGGKKKAWRKKRKYELGRQPANTKLSSDKSVRRVRVRGGNVKW
RAFRLDTGNYSWGSEAVTRKTRILDVVYNASNNELVRTQTLVKSAIVQVDAAPFKQWYLQHYGVDIGRKKKTLASAKKEEEGDAATEEVKKSNHVQRKLEKRQQDRKLDA
HIEEQFSSGRLMACISSRPGQCGRADGYILEGKELEFYMKKLQRKKGKGADDELLQLRETAILELQFCWY