; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0013857 (gene) of Chayote v1 genome

Gene IDSed0013857
OrganismSechium edule (Chayote v1)
Descriptiongeneral transcription factor 3C polypeptide 3
Genome locationLG06:45942182..45960871
RNA-Seq ExpressionSed0013857
SyntenySed0013857
Gene Ontology termsGO:0006383 - transcription by RNA polymerase III (biological process)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat
IPR039340 - Transcription factor Tfc4/TFIIIC-102/Sfc4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606714.1 General transcription factor 3C polypeptide 3, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0085.8Show/hide
Query:  MEEEGNKISDNEEVPGCLVRGNELVETEVEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQPYK
        MEEEGN ISDNEEVPGC VRG  +VETEVEDRE+ EEEEEEEEE+EEE E+E E++             EEEDGYIFKFKAGENPFDFVEGT+FSIQPYK
Subjt:  MEEEGNKISDNEEVPGCLVRGNELVETEVEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQPYK

Query:  KFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQ
        KFERLEYEALAEKKRKALA+GQSERSAKRGRVEDISGASF+EIMEAMNYGS+RKR++ KKRGRRKGSK+KLN DVTKLLG+ATLCYAQGQYEKAIS+LSQ
Subjt:  KFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQ

Query:  VVLQAPDVPDSYHTLGLIYNAIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAETY
        VVLQAPDVPDSYHTLGL+YNAI DDV+AMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEP+DINLLFY AS+YL+RGDCQKAAETY
Subjt:  VVLQAPDVPDSYHTLGLIYNAIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAETY

Query:  DQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHAHLGN
        DQIHQ    NVEALM GAKLYQKCGHLERAICILE+YIKGH   ADLDVVDLLASLYMGSKEF KALE IEHAD VYCA  +LPLNLTAKAGICH HLGN
Subjt:  DQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHAHLGN

Query:  MEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLASLL
         EKAECLFANL R+A  + SNLMIEVADSLL+LKHY+LALKYYLMSE+ NAGGNVGILYLKIAQCY STNERAEAI+FFYKV+QHLEDN+NARLTLASLL
Subjt:  MEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLASLL

Query:  LEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRET
        LEE REEE ISLLSPPKDSNS SSSSSKCK WWLNERVK KLCHI++TKGMLENF+EAI PLV +SLYIETL EKIKVNKKKL K+VLLERVK+LD R+T
Subjt:  LEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRET

Query:  GGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEALEIIS
        GGLFR FRPVA +SDLSKASRA+KLL KRERI+EEKKA+ALA+GVNLNYDD DDE  LR+ +ESPLPNLLKDEE H LI DLCKALASLGRCSEALEIIS
Subjt:  GGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEALEIIS

Query:  LTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISH
        LTLKLAFNSLS+ERKEELQLLGAQLAFS+TDTKH FNFAKHVVKQYPYSNSAWNCYYKVS+RMTNRDSRHCKLLNSMQ KYKDCAPPY+IAGHQFTAISH
Subjt:  LTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISH

Query:  HQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQKDCLIP
        HQDAARKYLEAYK+LPDSPL+NLCVG++LINLTLGFRLQNKHQC+AQ L FLYKNLKLCDNSQEALYNIARAYHH+GLVT+AVTYYEKVLATYQKDC IP
Subjt:  HQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQKDCLIP

Query:  ELFGENR-SVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        ELFGEN+   K + SV+CDLRREAAYNLHLIYKESGALDLARQVLKD+CTF
Subjt:  ELFGENR-SVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

KAG7036429.1 General transcription factor 3C polypeptide 3, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0085.7Show/hide
Query:  MEEEGNKISDNEEVPGCLVRGNELVETEVEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQPYK
        MEEEGN ISDNEEVPGC VRG  +VETEVEDRE+ EEEEEEEE+EEE E+E E++              EEEDGYIFKFKAGENPFDFVEGT+FSIQPYK
Subjt:  MEEEGNKISDNEEVPGCLVRGNELVETEVEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQPYK

Query:  KFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQ
        KFERLEYEALAEKKRKALA+GQSERSAKRGRVEDISGASF+EIMEAMNYGS+RKR++ KKRGRRKGSK+KLN DVTKLLG+ATLCYAQGQYEKAIS+LSQ
Subjt:  KFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQ

Query:  VVLQAPDVPDSYHTLGLIYNAIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAETY
        VVLQAPDVPDSYHTLGL+YNAI DDV+AMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEP+DINLLFY AS+YL+RGDCQKAAETY
Subjt:  VVLQAPDVPDSYHTLGLIYNAIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAETY

Query:  DQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHAHLGN
        DQIHQ    NVEALM GAKLYQKCGHLERAICILE+YIKGH   ADLDVVDLLASLYMGSKEF KALE IEHAD VYCA  +LPLNLTAKAGICH HLGN
Subjt:  DQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHAHLGN

Query:  MEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLASLL
         EKAECLFANL R+A  + SNLMIEVADSLL+LKHY+LALKYYLMSE+ NAGGNVGILYLKIAQCY STNERAEAI+FFYKV+QHLEDN+NARLTLASLL
Subjt:  MEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLASLL

Query:  LEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRET
        LEE REEE ISLLSPPKDSNS SSSSSKCK WWLNERVK KLCHI++TKGMLENF+EAI PLV +SLYIETL EKIKVNKKKL K+VLLERVK+LD R+T
Subjt:  LEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRET

Query:  GGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEALEIIS
        GGLFR FRPVA +SDLSKASRA+KLL KRERI+EEKKA+ALA+GVNLNYDD DDE  LR+ +ESPLPNLLKDEE H LI DLCKALASLGRCSEALEIIS
Subjt:  GGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEALEIIS

Query:  LTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISH
        LTLKLAFNSLS+ERKEELQLLGAQLAFS+TDTKH FNFAKHVVKQYPYSNSAWNCYYKVS+RMTNRDSRHCKLLNSMQ KYKDCAPPY+IAGHQFTAISH
Subjt:  LTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISH

Query:  HQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQKDCLIP
        HQDAARKYLEAYK+LPDSPL+NLCVG++LINLTLGFRLQNKHQC+AQ L FLYKNLKLCDNSQEALYNIARAYHH+GLVT+AVTYYEKVLATYQKDC IP
Subjt:  HQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQKDCLIP

Query:  ELFGENR-SVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        ELFGEN+   K + SV+CDLRREAAYNLHLIYKESGALDLARQVLKD+CTF
Subjt:  ELFGENR-SVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

XP_022948712.1 general transcription factor 3C polypeptide 3 [Cucurbita moschata]0.0e+0086Show/hide
Query:  MEEEGNKISDNEEVPGCLVRGNELVETEVEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQPYK
        MEEEGN ISDNEEVPGC VRG  +VETEVEDRE+ EEEEEEEEEE+EEE E+E E++            EEEDGYIFKFKAGENPFDFVEGT+FSIQPYK
Subjt:  MEEEGNKISDNEEVPGCLVRGNELVETEVEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQPYK

Query:  KFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQ
        KFERLEYEALAEKKRKALA+GQSERSAKRGRVEDISGASF+EIMEAMNYGS+RKR++ KKRGRRKGSK+KLN DVTKLLG+ATLCYAQGQYEKAIS+LSQ
Subjt:  KFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQ

Query:  VVLQAPDVPDSYHTLGLIYNAIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAETY
        VVLQAPDVPDSYHTLGL+YNAI DDV+AMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEP+DINLLFY AS+YL+RGDCQKAAETY
Subjt:  VVLQAPDVPDSYHTLGLIYNAIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAETY

Query:  DQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHAHLGN
        DQIHQ    NVEALM GAKLYQKCGHLERAICILE+YIKGH   ADLDVVDLLASLYMGSKEF KALE IEHAD VYCA  +LPLNLTAKAGICH HLGN
Subjt:  DQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHAHLGN

Query:  MEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLASLL
         EKAECLFANL R+A  + SNLMIEVADSLL+LKHY+LALKYYLMSE+ NAGGNVGILYLKIAQCY STNERAEAI+FFYKV+QHLEDN+NARLTLASLL
Subjt:  MEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLASLL

Query:  LEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRET
        LEE REEE ISLLSPPKDSNS SSSSSKCK WWLNERVK KLCHI++TKGMLENF+EAI PLV +SLYIETL EKIKVNKKKL K+VLLERVK+LD R+T
Subjt:  LEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRET

Query:  GGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEALEIIS
        GGLFR FRPVA +SDLSKASRA+KLL KRERI+EEKKA+ALA+GVNLNYDD DDE  LR+ +ESPLPNLLKDEE H LI DLCKALASLGRCSEALEIIS
Subjt:  GGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEALEIIS

Query:  LTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISH
        LTLKLAFNSLS+ERKEELQLLGAQLAFS+TDTKH FNFAKHVVKQYPYSNSAWNCYYKVS+RMTNRDSRHCKLLNSMQ KYKDCAPPY+IAGHQFTAISH
Subjt:  LTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISH

Query:  HQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQKDCLIP
        HQDAARKYLEAYK+LPDSPL+NLCVG++LINLTLGFRLQNKHQC+AQ L FLYKNLKLCDNSQEALYNIARAYHH+GLVT+AVTYYEKVLATYQKDC IP
Subjt:  HQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQKDCLIP

Query:  ELFGENRSVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        ELFGEN+  K + SV+CDLRREAAYNLHLIYKESGALDLARQVLKD+CTF
Subjt:  ELFGENRSVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

XP_022997988.1 general transcription factor 3C polypeptide 3 [Cucurbita maxima]0.0e+0086.11Show/hide
Query:  MEEEGNKISDNEEVPGCLVRGNELVETEVEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQPYK
        MEEEGN ISDNEEVPGC VRG  +VETEVEDRE+ EEEEEEEEEEE+EEE E+E E++           EEEDGYIFKFKAGENPFDFVEGT+FSIQPYK
Subjt:  MEEEGNKISDNEEVPGCLVRGNELVETEVEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQPYK

Query:  KFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQ
        KFERLEYEALAEKKRKALA GQSERSAKRGRVEDISGASF+EIMEAMNYGS+RKR++ KKRGRRKGSK+KLN DVTKLLG+ATLCYAQGQYEKAIS+LSQ
Subjt:  KFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQ

Query:  VVLQAPDVPDSYHTLGLIYNAIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAETY
        VVLQAPDVPDSYHTLGL+YNAI DDV+AMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEP+DINLLFY AS+YL+RGDCQKAAETY
Subjt:  VVLQAPDVPDSYHTLGLIYNAIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAETY

Query:  DQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHAHLGN
        DQIHQ    NVEALM GAKLYQKCGHLERAICILE+YIKGH   ADLDVVDLLASLYMGSKEF KALE IEHAD VYCA  +LPLNLTAKAGICH HLGN
Subjt:  DQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHAHLGN

Query:  MEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLASLL
         EKAECLFANL R+A  + SNLMIEVADSLL+LKHY+LALKYYLMSE+ NAGGNVGILYLKIAQCY STNERAEAI+FFYKV+QHLEDN+NARLTLASLL
Subjt:  MEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLASLL

Query:  LEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRET
        LEE REEE ISLLSPPKDSNS SSSSSKCK WWLNERVK KLCHI++TKGMLENF+EAI PLV +SLYIETL EKIKVNKKKL K+VLLERVK+LD R+T
Subjt:  LEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRET

Query:  GGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEALEIIS
        GGLFR FRPVA +SDLSKASRA+KLL KRERI+EEKKA+ALA+GVNLNYDD DDE  LR+ +ESPLPNLLKDEE H LI DLCKALASLGRCSEALEIIS
Subjt:  GGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEALEIIS

Query:  LTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISH
        LTLKLAFNSLS+ERKEELQLLGAQLAFS+TDTKH FNFAKHVVKQYPYSNSAWNCYYKVS+RMTNRDSRHCKLLNSMQ KYKDCAPPY+IAGHQFTAISH
Subjt:  LTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISH

Query:  HQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQKDCLIP
        HQDAARKYLEAYK+LPDSPL+NLCVG++LINLTLGFRLQNKHQC+AQ L FLYKNLKLCDNSQEALYNIARAYHH+GLVT+AVTYYEKVLATYQKDC IP
Subjt:  HQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQKDCLIP

Query:  ELFGENRSVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        ELFGEN+  K + SV+CDLRREAAYNLHLIYKESGALDLARQVLKD+CTF
Subjt:  ELFGENRSVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

XP_023525239.1 general transcription factor 3C polypeptide 3 [Cucurbita pepo subsp. pepo]0.0e+0085.68Show/hide
Query:  MEEEGNKISDNEEVPGCLVRGNELVETEVEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQPYK
        MEEEGN ISDNEEVPGC VRG  +VETEVEDRE+ EEEEEEEE+EEE E+E E++              EEEDGYIFKFKAGENPFDFVEGT+FSIQPYK
Subjt:  MEEEGNKISDNEEVPGCLVRGNELVETEVEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQPYK

Query:  KFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQ
        KFERLEYEALAEKKRKALA GQSERSAKRGRVEDISGASF+EIMEAMNYGS+RKR++ KKRGRRKGSK+KLN DVTKLLG+ATLCYAQGQYEKAIS+LSQ
Subjt:  KFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQ

Query:  VVLQAPDVPDSYHTLGLIYNAIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAETY
        VVLQAPDVPDSYHTLGL+YNAI DDV+AMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEP+DINLLFY AS+YL+RGDCQKAAETY
Subjt:  VVLQAPDVPDSYHTLGLIYNAIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAETY

Query:  DQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHAHLGN
        DQIHQ    NVEALM GAKLYQ+CGHLERAICILE+YIKGH   ADLDVVDLLASLYMGSKEF KALE IEHAD VYCA  +LPLNLTAKAGICH HLGN
Subjt:  DQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHAHLGN

Query:  MEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLASLL
         EKAECLFANL R+A  + SNLMIEVADSLL+LKHY+LALKYYLMSE+ NAGGNVGILYLKIAQCY STNERAEAI+FFYKV+QHLEDN+NARLTLASLL
Subjt:  MEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLASLL

Query:  LEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRET
        LEE REEE ISLLSPPKDSNS SSSSSKCK WWLNERVK KLCHI++TKGMLENF++AI PLV +SLYIETL EKIKVNKKKL K+VLLERVK+LD R+T
Subjt:  LEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRET

Query:  GGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEALEIIS
        GGLFR FRPVA +SDLSKASRA+KLL KRERIKEEKKA+ALA+GVNLNYDD DDE  LR+ +ESPLPNLLKDEE H LI DLCKALASLGRCSEALEIIS
Subjt:  GGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEALEIIS

Query:  LTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISH
        LTLKLAFNSLS+ERKEELQLLGAQLAFS+TDTKH FNFAKHVVKQYPYSNSAWNCYYKVS+RMTNRDSRHCKLLNSMQ KYKDCAPPY+IAGHQFTAISH
Subjt:  LTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISH

Query:  HQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQKDCLIP
        HQDAARKYLEAYK+LPDSPL+NLCVG++LINLTLGFRLQNKHQC+AQ L FLYKNLKLCDNSQEALYNIARAYHH+GLVT+AVTYYEKVLATYQKDC IP
Subjt:  HQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQKDCLIP

Query:  ELFGENRSVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        ELFGEN+  K + SV+CDLRREAAYNLHLIYKESGALDLARQVLKD+CTF
Subjt:  ELFGENRSVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

TrEMBL top hitse value%identityAlignment
A0A1S3BHB9 general transcription factor 3C polypeptide 3 isoform X10.0e+0082.58Show/hide
Query:  MEEEGNKISDNEEVPGCL--VRGNEL-VETEVEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQ
        ME+EGN+ISD+EEVPG +  V G E  VET V DR           EEEEEEEE EEE E+E E+++     EEEDGY FKFKAGENPFDFVEGT+FS+Q
Subjt:  MEEEGNKISDNEEVPGCL--VRGNEL-VETEVEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQ

Query:  PYKKFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISL
        PYKKFERLEYEALAEKKRKALA GQSER+AKRGRVED++GASFDEI+EAMNYGS+RK KE KKRGRRKGSKKKLNRDVTKLLG+ATLCYAQGQ+EKAISL
Subjt:  PYKKFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISL

Query:  LSQVVLQAPDVPDSYHTLGLIYNAIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAA
        L QVVLQAPD+PDSYHTLGL+YNAI DDV+AMGFYMLAAHLMP+DSSLWKLLFSWSI+RGDIDQASYCLSKAIKAEP+DINLLF+ AS+YL+RGDC+KAA
Subjt:  LSQVVLQAPDVPDSYHTLGLIYNAIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAA

Query:  ETYDQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHAH
        ETYDQIHQ+C GNVEALM GAKLYQKCGHLERAICILEDYIK H + ADLDVVDLLASLYMGSKEF KALE IEHAD VYCAG +LPLNLTAKAGICHAH
Subjt:  ETYDQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHAH

Query:  LGNMEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLA
        LGN+EKAECLFANL R+  YDHSNLMIEVADSLL+LKHY  ALKYYLMSE+ NAG N+GILY K+A+CYLSTNE+ +AI+FFYKV+QH+EDN+NARLTLA
Subjt:  LGNMEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLA

Query:  SLLLEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDI
        SLLLEE R+EE ISLLSPPKDSN  SSSS K K WWLNE+VK KLCHIY+T+G+LENF+E I PLV +SLYIETLQEKIKVNKKKL ++VLLERVK+LD 
Subjt:  SLLLEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDI

Query:  RETGGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEALE
        RETG LFR FRPVA +SDL+KASRA++LL KR+RIKEEKKAK LA+GVN++YDDLDDE  LRMH+ESPLPNLLK+EE HILI DLCKALASLGRCSEALE
Subjt:  RETGGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEALE

Query:  IISLTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTA
        IISLTLKLAFNSLS ERKEELQLLGAQLAFS+T T H FNFAKHVVKQYPYS SAWNCYYKV++ +TNRDSRHCKLLNSMQAKYKDCAPPY+IAGHQFT 
Subjt:  IISLTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTA

Query:  ISHHQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQKDC
        ISHHQDAARKYLEAYKI+PDSPL+NLCVGSSLINL LGFRLQNKHQC+AQ L FLYKNLKLCDN+QEALYNIARAYHH+GLVT+AVTYYEKVLATYQKDC
Subjt:  ISHHQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQKDC

Query:  LIPELFGENRSVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
         IPELFGENR++K + SV+CDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Subjt:  LIPELFGENRSVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

A0A6J1DEF9 general transcription factor 3C polypeptide 3 isoform X20.0e+0083.23Show/hide
Query:  MEEEGNKISDNEEVPGCLVRGNELVETEVEDREDRE-EEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGE---EEEDGYIFKFKAGENPFDFVEGTNFSI
        ME+EGN+ISDN+EVPGC V     VE EV+  ++ E E  EEEEEEEEE+EEEEEEEEEEEEEEVE EGE   EEEDGYIFKFKAGENPFDFVEGT+FSI
Subjt:  MEEEGNKISDNEEVPGCLVRGNELVETEVEDREDRE-EEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGE---EEEDGYIFKFKAGENPFDFVEGTNFSI

Query:  QPYKKFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAIS
        QPYKKFERLEYEALAEKKRKALA  QSER  KRGR+EDI GASF+EIMEAMNYGS+RK KE K+RGRRKGSKKK+NR++TKLLG+ATLCYAQGQYEKAIS
Subjt:  QPYKKFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAIS

Query:  LLSQVVLQAPDVPDSYHTLGLIYNAIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKA
        +L QVVLQAPD+PDSYHTLGL+YNAI DDV+AMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEP+DINLLF+ AS+YL+RGDCQKA
Subjt:  LLSQVVLQAPDVPDSYHTLGLIYNAIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKA

Query:  AETYDQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHA
        AETYDQIHQKC GNVEALM GAKLYQKCGH ERAICILEDYIKGH   ADLDVVDLLASLYMGSKEF KALE IEHAD+VYCAG ++PLNL  KAGICH 
Subjt:  AETYDQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHA

Query:  HLGNMEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTL
        HLGN+EKAE LFANLGR  A DHS+ +IE ADSLL+LKH++LALKYYLMSE+ NAGG +GI+YLKIAQCYLSTNERAEAI+FFYKV+Q LEDN+NARLTL
Subjt:  HLGNMEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTL

Query:  ASLLLEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILD
        ASLLLEE REEE ISLLSPPKDSNS SSSSSK K WWLNE+VK KLC+IY+TKGMLENF+E I  LV +SLYIETL+EKIKVNKKKL ++VLLERVK+LD
Subjt:  ASLLLEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILD

Query:  IRETGGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEAL
         RETG LFR FRPVA +SDLSKASRA++LL KRERIKEEKKA+ALA+GVN++YDD+DDE  LR+H+ESPLPNLLKDEE H LI DLCKALASLGRCSEAL
Subjt:  IRETGGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEAL

Query:  EIISLTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFT
        EIISLTLKLAFNSLS+ERKEELQLLGAQLAFS+TDTKH FNFAKHVVKQYPYSNSAWNCYYKVS+RMT+RDSRHCKLLNS+QAKYKDCAPP++IAGHQF 
Subjt:  EIISLTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFT

Query:  AISHHQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQKD
        AISHHQ+AA+KYLEAYK+LPDSPL+NLCVG++LINL LG RLQNKHQC+AQ L FLY NLKLCDNSQEALYNIARAYHH+GLVT+AVTYYEKVLATYQKD
Subjt:  AISHHQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQKD

Query:  CLIPELFGENRSVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        C IP++FGENR++K ++SV+CDLRREAAYNLHLIYK+SGALDLARQVLKDHCTF
Subjt:  CLIPELFGENRSVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

A0A6J1DGK2 general transcription factor 3C polypeptide 3 isoform X10.0e+0083.14Show/hide
Query:  MEEEGNKISDNEEVPGCLVRGNELVETEVEDREDRE-EEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGE---EEEDGYIFKFKAGENPFDFVEGTNFSI
        ME+EGN+ISDN+EVPGC V     VE EV+  ++ E E  EEEEEEEEE+EEEEEEEEEEEEEEVE EGE   EEEDGYIFKFKAGENPFDFVEGT+FSI
Subjt:  MEEEGNKISDNEEVPGCLVRGNELVETEVEDREDRE-EEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGE---EEEDGYIFKFKAGENPFDFVEGTNFSI

Query:  QPYKKFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAIS
        QPYKKFERLEYEALAEKKRKALA  QSER  KRGR+EDI GASF+EIMEAMNYGS+RK KE K+RGRRKGSKKK+NR++TKLLG+ATLCYAQGQYEKAIS
Subjt:  QPYKKFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAIS

Query:  LLSQVVLQAPDVPDSYHTLGLIYNAIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKA
        +L QVVLQAPD+PDSYHTLGL+YNAI DDV+AMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEP+DINLLF+ AS+YL+RGDCQKA
Subjt:  LLSQVVLQAPDVPDSYHTLGLIYNAIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKA

Query:  AETYDQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHA
        AETYDQIHQKC GNVEALM GAKLYQKCGH ERAICILEDYIKGH   ADLDVVDLLASLYMGSKEF KALE IEHAD+VYCAG ++PLNL  KAGICH 
Subjt:  AETYDQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHA

Query:  HLGNMEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNV-GILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLT
        HLGN+EKAE LFANLGR  A DHS+ +IE ADSLL+LKH++LALKYYLMSE+ NAGG + GI+YLKIAQCYLSTNERAEAI+FFYKV+Q LEDN+NARLT
Subjt:  HLGNMEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNV-GILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLT

Query:  LASLLLEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKIL
        LASLLLEE REEE ISLLSPPKDSNS SSSSSK K WWLNE+VK KLC+IY+TKGMLENF+E I  LV +SLYIETL+EKIKVNKKKL ++VLLERVK+L
Subjt:  LASLLLEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKIL

Query:  DIRETGGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEA
        D RETG LFR FRPVA +SDLSKASRA++LL KRERIKEEKKA+ALA+GVN++YDD+DDE  LR+H+ESPLPNLLKDEE H LI DLCKALASLGRCSEA
Subjt:  DIRETGGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEA

Query:  LEIISLTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQF
        LEIISLTLKLAFNSLS+ERKEELQLLGAQLAFS+TDTKH FNFAKHVVKQYPYSNSAWNCYYKVS+RMT+RDSRHCKLLNS+QAKYKDCAPP++IAGHQF
Subjt:  LEIISLTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQF

Query:  TAISHHQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQK
         AISHHQ+AA+KYLEAYK+LPDSPL+NLCVG++LINL LG RLQNKHQC+AQ L FLY NLKLCDNSQEALYNIARAYHH+GLVT+AVTYYEKVLATYQK
Subjt:  TAISHHQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQK

Query:  DCLIPELFGENRSVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        DC IP++FGENR++K ++SV+CDLRREAAYNLHLIYK+SGALDLARQVLKDHCTF
Subjt:  DCLIPELFGENRSVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

A0A6J1GA07 general transcription factor 3C polypeptide 30.0e+0086Show/hide
Query:  MEEEGNKISDNEEVPGCLVRGNELVETEVEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQPYK
        MEEEGN ISDNEEVPGC VRG  +VETEVEDRE+ EEEEEEEEEE+EEE E+E E++            EEEDGYIFKFKAGENPFDFVEGT+FSIQPYK
Subjt:  MEEEGNKISDNEEVPGCLVRGNELVETEVEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQPYK

Query:  KFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQ
        KFERLEYEALAEKKRKALA+GQSERSAKRGRVEDISGASF+EIMEAMNYGS+RKR++ KKRGRRKGSK+KLN DVTKLLG+ATLCYAQGQYEKAIS+LSQ
Subjt:  KFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQ

Query:  VVLQAPDVPDSYHTLGLIYNAIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAETY
        VVLQAPDVPDSYHTLGL+YNAI DDV+AMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEP+DINLLFY AS+YL+RGDCQKAAETY
Subjt:  VVLQAPDVPDSYHTLGLIYNAIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAETY

Query:  DQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHAHLGN
        DQIHQ    NVEALM GAKLYQKCGHLERAICILE+YIKGH   ADLDVVDLLASLYMGSKEF KALE IEHAD VYCA  +LPLNLTAKAGICH HLGN
Subjt:  DQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHAHLGN

Query:  MEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLASLL
         EKAECLFANL R+A  + SNLMIEVADSLL+LKHY+LALKYYLMSE+ NAGGNVGILYLKIAQCY STNERAEAI+FFYKV+QHLEDN+NARLTLASLL
Subjt:  MEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLASLL

Query:  LEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRET
        LEE REEE ISLLSPPKDSNS SSSSSKCK WWLNERVK KLCHI++TKGMLENF+EAI PLV +SLYIETL EKIKVNKKKL K+VLLERVK+LD R+T
Subjt:  LEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRET

Query:  GGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEALEIIS
        GGLFR FRPVA +SDLSKASRA+KLL KRERI+EEKKA+ALA+GVNLNYDD DDE  LR+ +ESPLPNLLKDEE H LI DLCKALASLGRCSEALEIIS
Subjt:  GGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEALEIIS

Query:  LTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISH
        LTLKLAFNSLS+ERKEELQLLGAQLAFS+TDTKH FNFAKHVVKQYPYSNSAWNCYYKVS+RMTNRDSRHCKLLNSMQ KYKDCAPPY+IAGHQFTAISH
Subjt:  LTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISH

Query:  HQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQKDCLIP
        HQDAARKYLEAYK+LPDSPL+NLCVG++LINLTLGFRLQNKHQC+AQ L FLYKNLKLCDNSQEALYNIARAYHH+GLVT+AVTYYEKVLATYQKDC IP
Subjt:  HQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQKDCLIP

Query:  ELFGENRSVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        ELFGEN+  K + SV+CDLRREAAYNLHLIYKESGALDLARQVLKD+CTF
Subjt:  ELFGENRSVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

A0A6J1KFI7 general transcription factor 3C polypeptide 30.0e+0086.11Show/hide
Query:  MEEEGNKISDNEEVPGCLVRGNELVETEVEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQPYK
        MEEEGN ISDNEEVPGC VRG  +VETEVEDRE+ EEEEEEEEEEE+EEE E+E E++           EEEDGYIFKFKAGENPFDFVEGT+FSIQPYK
Subjt:  MEEEGNKISDNEEVPGCLVRGNELVETEVEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQPYK

Query:  KFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQ
        KFERLEYEALAEKKRKALA GQSERSAKRGRVEDISGASF+EIMEAMNYGS+RKR++ KKRGRRKGSK+KLN DVTKLLG+ATLCYAQGQYEKAIS+LSQ
Subjt:  KFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQ

Query:  VVLQAPDVPDSYHTLGLIYNAIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAETY
        VVLQAPDVPDSYHTLGL+YNAI DDV+AMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEP+DINLLFY AS+YL+RGDCQKAAETY
Subjt:  VVLQAPDVPDSYHTLGLIYNAIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAETY

Query:  DQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHAHLGN
        DQIHQ    NVEALM GAKLYQKCGHLERAICILE+YIKGH   ADLDVVDLLASLYMGSKEF KALE IEHAD VYCA  +LPLNLTAKAGICH HLGN
Subjt:  DQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHAHLGN

Query:  MEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLASLL
         EKAECLFANL R+A  + SNLMIEVADSLL+LKHY+LALKYYLMSE+ NAGGNVGILYLKIAQCY STNERAEAI+FFYKV+QHLEDN+NARLTLASLL
Subjt:  MEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLASLL

Query:  LEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRET
        LEE REEE ISLLSPPKDSNS SSSSSKCK WWLNERVK KLCHI++TKGMLENF+EAI PLV +SLYIETL EKIKVNKKKL K+VLLERVK+LD R+T
Subjt:  LEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRET

Query:  GGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEALEIIS
        GGLFR FRPVA +SDLSKASRA+KLL KRERI+EEKKA+ALA+GVNLNYDD DDE  LR+ +ESPLPNLLKDEE H LI DLCKALASLGRCSEALEIIS
Subjt:  GGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEALEIIS

Query:  LTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISH
        LTLKLAFNSLS+ERKEELQLLGAQLAFS+TDTKH FNFAKHVVKQYPYSNSAWNCYYKVS+RMTNRDSRHCKLLNSMQ KYKDCAPPY+IAGHQFTAISH
Subjt:  LTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISH

Query:  HQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQKDCLIP
        HQDAARKYLEAYK+LPDSPL+NLCVG++LINLTLGFRLQNKHQC+AQ L FLYKNLKLCDNSQEALYNIARAYHH+GLVT+AVTYYEKVLATYQKDC IP
Subjt:  HQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQKDCLIP

Query:  ELFGENRSVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        ELFGEN+  K + SV+CDLRREAAYNLHLIYKESGALDLARQVLKD+CTF
Subjt:  ELFGENRSVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

SwissProt top hitse value%identityAlignment
O74458 Transcription factor tau subunit sfc41.7e-2522.12Show/hide
Query:  ERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQ-GQYEKAISLLSQV
        E  +YE   ++   +  A ++E        +DI+   ++E ++A+  G ++ RK  K RGR   +    + +V ++L  A   +AQ G +++A  L  ++
Subjt:  ERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQ-GQYEKAISLLSQV

Query:  VLQAPDVPDSYHTLGLIY----NAIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEP---EDINLLFYHASIY-LDRGDC
        V    +V  ++  LG  +    N   +  + +  +M AAHL P+D  LW      S      DQA YC ++A+ A+P    ++    ++ S+   + G  
Subjt:  VLQAPDVPDSYHTLGLIY----NAIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEP---EDINLLFYHASIY-LDRGDC

Query:  QKAAETYDQIHQKCQGNVEALMKGAKLYQKCGHLERAIC----ILEDYIKGHQAGA------DLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKD-
        +KAAE +  + Q    N   L   A++Y K  H  R I     I   Y   + A        DL  ++L A L +   ++   +  I      +   K  
Subjt:  QKAAETYDQIHQKCQGNVEALMKGAKLYQKCGHLERAIC----ILEDYIKGHQAGA------DLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKD-

Query:  -----------------------------------LPLNLTAKAGICHAHLGNMEKAECLFA---NLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYL
                                           LP     K GI     G + +AE  F+   NL  D A+    ++ ++A + + ++  DLAL+Y++
Subjt:  -----------------------------------LPLNLTAKAGICHAHLGNMEKAECLFA---NLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYL

Query:  MSEKANAGGNVGILYLKIAQCYLSTNER------AEAIIFF----------YKVIQHLEDNVNARLTLASLLLEEGREEEVISLLSPPKDSNSISSSSSK
        +        N+G+ Y  +  CYL   E        EAI+               I  L+DN +A L + + + E+ R    I+ L   +  N     +  
Subjt:  MSEKANAGGNVGILYLKIAQCYLSTNER------AEAIIFF----------YKVIQHLEDNVNARLTLASLLLEEGREEEVISLLSPPKDSNSISSSSSK

Query:  CKAWWLNERVK----FKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRETGGLFRRFRPVALRSDLSKASRARK
         + +  N++V      K   I ++K     F       + ++   +    K+ + ++ L K+  +       +     L   F  +       K +RAR 
Subjt:  CKAWWLNERVK----FKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRETGGLFRRFRPVALRSDLSKASRARK

Query:  LLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKD--------EECHILIFDLCKALASLGRCSEALEIISLTLKLAFNSLSLERKE
         LL R R +       L S +N     L+D  T   + +  L  +L+         +  + L  +    L  +G   +A ++++  +          +++
Subjt:  LLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKD--------EECHILIFDLCKALASLGRCSEALEIISLTLKLAFNSLSLERKE

Query:  ELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCY-------YKVSARMTNRDS-----RHCKLLNSMQ------------------AKYKDCAP
         L+      +  A D +      + V   + +    +  +       Y+ S    +  +     R  KL++ +                   A       
Subjt:  ELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCY-------YKVSARMTNRDS-----RHCKLLNSMQ------------------AKYKDCAP

Query:  PYLIA--GHQFTAISHHQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDN-----SQEALYNIARAYHHVGLV
        P L+   GH          A   Y  A+ I PD P+ NL +G + ++  +     N+H  I Q  TFLY+   L  N      QEALYN+ +AYH +GL 
Subjt:  PYLIA--GHQFTAISHHQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDN-----SQEALYNIARAYHHVGLV

Query:  TMAVTYYEKVLATYQKDCLIPELFGENRSVKDKR-SVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
          AV YYE VL       L P   G+  +  +   S   D   EAAYNL LIY  SG + LA Q+   +  F
Subjt:  TMAVTYYEKVLATYQKDCLIPELFGENRSVKDKR-SVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

P33339 Transcription factor tau 131 kDa subunit1.5e-0519.89Show/hide
Query:  EEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQPYKKFERLEYEALAEKKRKALAAGQSERSAKRGRV-EDISGASFDEIM
        ++E++ +  E E  +  +V  E EE   G I  +K        ++   +  +      +L  +    ++  +L A  S+     G + ED      + I 
Subjt:  EEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQPYKKFERLEYEALAEKKRKALAAGQSERSAKRGRV-EDISGASFDEIM

Query:  EAMNYGSKRKRKELK-KRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQVVLQAPDVPDSYHTLGLIYNAIDDDVRAMGFYMLAAHLMPRDS
        EA N+  K+K+K  K K   R+  ++ L+ +V +LL +A   + +   + A  L ++V+ +      +Y TLG IY            + LAAHL   D 
Subjt:  EAMNYGSKRKRKELK-KRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQVVLQAPDVPDSYHTLGLIYNAIDDDVRAMGFYMLAAHLMPRDS

Query:  SLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAETYDQIHQKCQGNVEALMKGA-------------KLYQKC--GHLE
          WK++   S D   + QA YC S+ I   P +   ++  + +Y   G   +A + + +++     +   L + A             +LY K    ++E
Subjt:  SLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAETYDQIHQKCQGNVEALMKGA-------------KLYQKC--GHLE

Query:  RAICIL------------EDYIKGHQAGA------------------------------DLDVVDLLASLYM--------GSKEFCKALERIEHA-----
        R   IL            E   +G  A                                D   +++LA L++        G K   K    I+       
Subjt:  RAICIL------------EDYIKGHQAGA------------------------------DLDVVDLLASLYM--------GSKEFCKALERIEHA-----

Query:  --------------------DEVYCAGKD----LPLNLTAKAGICHAHLGNMEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKA
                            D +  A K+    +P+++  + G+   +  N+ +A   F  L  +   D ++L  E A +L   + Y  A+ ++      
Subjt:  --------------------DEVYCAGKD----LPLNLTAKAGICHAHLGNMEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKA

Query:  NAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLASLL------------------LEEGREEEVISLLSPPKDSNSISSSSSKCKA
               + +  +A+CY        A  F+   I+   D+++ R++LA +                   + + + +E +  +S  K SN  S  SSK   
Subjt:  NAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLASLL------------------LEEGREEEVISLLSPPKDSNSISSSSSKCKA

Query:  WWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRETGGLFRRFRPVALRSDLSKASRARK---LLLK
          L E  KF+    ++ K       E       + +  + + +  K+ K +L+  +   +   + I     L   F  V    +    SR+RK   +L +
Subjt:  WWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRETGGLFRRFRPVALRSDLSKASRARK---LLLK

Query:  RERIKEE-----KKAKALASGVNLNYDDL-DDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEALEIISLTLKLAFNSLSLERKEELQLLG
         ++   E     ++   LA G ++    L ++  TL    E      L  E+   L  +L   +A      + L ++    ++       ER + ++ + 
Subjt:  RERIKEE-----KKAKALASGVNLNYDDL-DDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEALEIISLTLKLAFNSLSLERKEELQLLG

Query:  AQLAFSATDTKHV----------FNFAKHVVKQYPYS----NSAWNCYYKV--------------SARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQ
          +     D + +          F F + V++ + YS     S+ N                   S R     +    + N         + PYL   + 
Subjt:  AQLAFSATDTKHV----------FNFAKHVVKQYPYS----NSAWNCYYKV--------------SARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQ

Query:  FTAISHHQDAARKYLEAYKIL-------PDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDN------SQEALYNIARAYHHVGLVTM
            S     +R +L A + L       PD P+VNL +G S I+  +      +H  I   L +LY+  K+  +       QEA YN+ RA+H +GLV++
Subjt:  FTAISHHQDAARKYLEAYKIL-------PDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDN------SQEALYNIARAYHHVGLVTM

Query:  AVTYYEKVLATYQKDCLIPELFGENRSVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDH
        A+ YY +VL  Y                         L++ AAYN  +IY++SG ++LA  +++ +
Subjt:  AVTYYEKVLATYQKDCLIPELFGENRSVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDH

Q9Y5Q9 General transcription factor 3C polypeptide 38.1e-4423.04Show/hide
Query:  VEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGE---NPFDFVEGTNFSIQPYKKFERL--EYEALAEKKRKALAAGQS
        +E +   EE E   EE +  E++  +E+ +   EE   + E      I   K+ +   N  +  +G   S+  +K F  +  E E   E++ +     + 
Subjt:  VEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGE---NPFDFVEGTNFSIQPYKKFERL--EYEALAEKKRKALAAGQS

Query:  ERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQVVLQAPDVPDSYHTLGLIYNAID
        E + ++    D+       ++E +        +E KK  + K  + KL R +  L+GEA + +A+G+ E+AI +  +++ QAP   + + TL +IY    
Subjt:  ERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQVVLQAPDVPDSYHTLGLIYNAID

Query:  DDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAETYDQIHQKCQGN-----VEALMKGA
        D  +++ F ++AAHL P D+  W  L   S+++ +I QA +C +KA+K EP ++  L+  +S+Y   GD + A + Y +I      +     ++     A
Subjt:  DDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAETYDQIHQKCQGN-----VEALMKGA

Query:  KLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERI---------------------EHADEVYCAGKD-LPLNLTAKAGICHA
        K Y +   +  AI I+++    HQ    ++ V++ A LY+ +K++ KALE I                     +  + V C   D +P+++T K  +C  
Subjt:  KLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERI---------------------EHADEVYCAGKD-LPLNLTAKAGICHA

Query:  HLGNMEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTL
        HL  +E    L   L      D  +L ++VA++ L +  Y+ AL   L +   +   N+ +++L+ A+C  +      A   + KV+     +++AR++L
Subjt:  HLGNMEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTL

Query:  ASLLLEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKL--CHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKI
        ++L  + G+ E+ +  L P  D ++++  ++  +     + +K  L    +  ++G +  +++ +  ++   L                  KV + R ++
Subjt:  ASLLLEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKL--CHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKI

Query:  LDIRETGGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSE
          I                   SK+      L+K  R K       ++   +    + D +A   +     L ++L  ++   L+     +L  L R  E
Subjt:  LDIRETGGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSE

Query:  ALEIISLTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQ
        A  ++  +L+        ++++EL+  G   A    + +  +N+ + +V +       WN + +V+  M ++D RH +    +  K  +     ++ GH 
Subjt:  ALEIISLTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQ

Query:  FTAISHHQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQ
               + A  +Y++A++  PD PL + C+G + I++     +  +H  I Q  +FL + L L    QE+ YN+ R  H +GL+ +A+ YY+K L    
Subjt:  FTAISHHQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQ

Query:  KDCLIPELFGENRSVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCT
            +P L  E   +        DLRR+ AYNL LIY+ SG   +A+ +L  +C+
Subjt:  KDCLIPELFGENRSVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCT

Arabidopsis top hitse value%identityAlignment
AT1G17680.1 tetratricopeptide repeat (TPR)-containing protein2.2e-17741.28Show/hide
Query:  EEGNKISDNEEVPGCLVRGNELV--ETEVEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQPYK
        +EGN IS+ EE P  +    +++  +T  +D++   +EE   ++++++ ++++E +E EEE++               F+AG  P               
Subjt:  EEGNKISDNEEVPGCLVRGNELV--ETEVEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQPYK

Query:  KFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQ
         FER EYEALAE+KRKALA  Q   S        + G      ME M+ G +RK ++ KK+GRR GSKK++  D+ K   EA   +A G+  +A+ +L +
Subjt:  KFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQ

Query:  VVLQAPDVPDSYHTLGLIYNAI-DDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAET
        V+ QAP    +Y+ L  +   +   +  +     +AA++    S  WKLL+    ++ +I  A    SKAI+A+P+DI L + +A I L+ G  ++AAET
Subjt:  VVLQAPDVPDSYHTLGLIYNAI-DDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAET

Query:  YDQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHAHLG
        ++QI ++C   +EAL  G + + K G  ERA  ILED+IK H +    DV+DLLAS++M      +AL+ I    ++Y  GK+L  +L  +  ICH HL 
Subjt:  YDQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHAHLG

Query:  NMEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLASL
         ME+AE + + L ++A  +H  L+  +AD L  + ++  ALKYY+ +      GN   L++KIA+CY+S  ER +AI+F+YK +  L D V+ R+TLASL
Subjt:  NMEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLASL

Query:  LLEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRE
        LLE+G+ +E + +LSPP++ +     ++K KAWW N +++  LC IY ++GMLE+F      LVL+ ++  T+       K K  + VL E  +    R 
Subjt:  LLEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRE

Query:  TGGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEALEII
             R  +   LR    K  + R  L +  RI+E    KA       + +D+  E+            ++KDEE H L  DLCKALASL R  EALEI+
Subjt:  TGGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEALEII

Query:  SLTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAIS
        +L  +L    L +E K+ELQ LGA+++    D K  F+  + V++Q+PY  +AWNCYY V +R+  R S   K ++ +++KY+DC PP LIAGH FT  S
Subjt:  SLTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAIS

Query:  HHQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQKDCLI
         HQDAAR+YLEAYK++P+SPL+NLCVG++LINL LGFRL+N+H+C+AQ   FLY NL++C NSQEALYN+ARAY HVGLVT+A +YYEKVLA Y+KD  +
Subjt:  HHQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQKDCLI

Query:  PELFGENRSVKDKRS-VHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        P+L  E+  V ++R  V+CDLR+EAA+NLHLIYK SGA DLARQVLKDHCTF
Subjt:  PELFGENRSVKDKRS-VHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

AT1G17680.2 tetratricopeptide repeat (TPR)-containing protein2.2e-17741.28Show/hide
Query:  EEGNKISDNEEVPGCLVRGNELV--ETEVEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQPYK
        +EGN IS+ EE P  +    +++  +T  +D++   +EE   ++++++ ++++E +E EEE++               F+AG  P               
Subjt:  EEGNKISDNEEVPGCLVRGNELV--ETEVEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQPYK

Query:  KFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQ
         FER EYEALAE+KRKALA  Q   S        + G      ME M+ G +RK ++ KK+GRR GSKK++  D+ K   EA   +A G+  +A+ +L +
Subjt:  KFERLEYEALAEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQ

Query:  VVLQAPDVPDSYHTLGLIYNAI-DDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAET
        V+ QAP    +Y+ L  +   +   +  +     +AA++    S  WKLL+    ++ +I  A    SKAI+A+P+DI L + +A I L+ G  ++AAET
Subjt:  VVLQAPDVPDSYHTLGLIYNAI-DDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAET

Query:  YDQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHAHLG
        ++QI ++C   +EAL  G + + K G  ERA  ILED+IK H +    DV+DLLAS++M      +AL+ I    ++Y  GK+L  +L  +  ICH HL 
Subjt:  YDQIHQKCQGNVEALMKGAKLYQKCGHLERAICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHAHLG

Query:  NMEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLASL
         ME+AE + + L ++A  +H  L+  +AD L  + ++  ALKYY+ +      GN   L++KIA+CY+S  ER +AI+F+YK +  L D V+ R+TLASL
Subjt:  NMEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLALKYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLASL

Query:  LLEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRE
        LLE+G+ +E + +LSPP++ +     ++K KAWW N +++  LC IY ++GMLE+F      LVL+ ++  T+       K K  + VL E  +    R 
Subjt:  LLEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKGMLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRE

Query:  TGGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEALEII
             R  +   LR    K  + R  L +  RI+E    KA       + +D+  E+            ++KDEE H L  DLCKALASL R  EALEI+
Subjt:  TGGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRMHQESPLPNLLKDEECHILIFDLCKALASLGRCSEALEII

Query:  SLTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAIS
        +L  +L    L +E K+ELQ LGA+++    D K  F+  + V++Q+PY  +AWNCYY V +R+  R S   K ++ +++KY+DC PP LIAGH FT  S
Subjt:  SLTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAIS

Query:  HHQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQKDCLI
         HQDAAR+YLEAYK++P+SPL+NLCVG++LINL LGFRL+N+H+C+AQ   FLY NL++C NSQEALYN+ARAY HVGLVT+A +YYEKVLA Y+KD  +
Subjt:  HHQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVTMAVTYYEKVLATYQKDCLI

Query:  PELFGENRSVKDKRS-VHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        P+L  E+  V ++R  V+CDLR+EAA+NLHLIYK SGA DLARQVLKDHCTF
Subjt:  PELFGENRSVKDKRS-VHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAAGAAGGGAACAAAATTTCTGACAATGAAGAGGTTCCTGGTTGTCTTGTGCGAGGAAACGAACTTGTAGAAACAGAAGTAGAAGATAGAGAGGATAGAGAGGA
GGAGGAGGAGGAAGAGGAAGAGGAAGAGGAAGAGGAAGAGGAAGAGGAAGAAGAAGAAGAAGAAGAAGAAGAGGAAGAAGTGGAGGGTGAAGGAGAAGAAGAAGAAGATG
GTTACATTTTCAAATTTAAGGCTGGAGAAAATCCATTTGATTTTGTTGAAGGGACAAATTTTAGCATCCAACCATATAAAAAATTTGAGCGCCTTGAATACGAAGCTCTT
GCCGAGAAAAAGCGAAAAGCTCTTGCAGCTGGTCAGAGCGAGAGATCTGCAAAGAGAGGCAGGGTAGAGGATATTTCTGGTGCAAGCTTTGATGAAATTATGGAAGCTAT
GAATTATGGATCTAAGAGAAAGCGAAAGGAGCTTAAAAAAAGAGGTAGACGGAAAGGATCAAAGAAAAAACTTAATCGTGATGTTACAAAGTTGCTTGGTGAAGCAACTT
TATGCTATGCTCAAGGCCAGTATGAGAAGGCTATATCTCTATTGAGTCAAGTTGTTCTGCAAGCCCCAGATGTACCTGATTCATACCATACGTTGGGACTTATTTATAAT
GCAATTGATGATGATGTAAGAGCCATGGGATTCTACATGCTTGCTGCACATTTAATGCCAAGAGATTCATCTCTCTGGAAACTGCTATTTTCATGGTCAATTGACCGAGG
TGACATTGATCAAGCAAGCTACTGCCTTTCTAAAGCAATAAAAGCAGAGCCCGAAGATATTAATTTATTATTTTATCATGCATCAATCTACCTTGATCGTGGAGATTGTC
AAAAAGCTGCTGAAACATATGATCAAATTCATCAAAAATGTCAGGGAAATGTTGAAGCACTAATGAAAGGAGCAAAGCTGTACCAAAAATGCGGTCATCTTGAACGGGCA
ATTTGCATTCTTGAAGACTACATCAAAGGGCATCAAGCAGGGGCTGATTTAGATGTGGTTGATCTTCTAGCTTCTTTATACATGGGAAGTAAAGAATTCTGCAAAGCTCT
TGAGCGCATTGAGCATGCAGATGAGGTGTACTGTGCAGGAAAGGATCTACCTTTGAATTTGACTGCTAAAGCAGGAATTTGCCACGCCCATCTTGGAAATATGGAGAAGG
CAGAGTGCCTTTTTGCTAATTTGGGGCGGGATGCTGCATACGATCACTCAAATTTGATGATTGAAGTTGCAGACTCGTTGCTGACTCTTAAGCACTATGACTTGGCATTG
AAGTATTATCTGATGTCAGAAAAAGCTAATGCTGGAGGGAACGTGGGAATTCTATACCTTAAAATTGCCCAATGTTACTTATCAACTAATGAAAGAGCAGAGGCAATTAT
TTTCTTTTATAAAGTAATTCAACATCTTGAAGATAACGTTAATGCTCGATTAACTTTGGCTTCCCTCCTCCTTGAGGAAGGTAGAGAAGAAGAAGTCATTTCATTACTAT
CTCCTCCGAAAGATTCAAACTCAATTAGCTCATCTTCCAGCAAATGCAAAGCTTGGTGGCTCAATGAAAGAGTAAAGTTTAAACTTTGCCACATATACAAAACTAAAGGA
ATGCTTGAGAACTTCATTGAGGCAATCGGTCCTTTGGTTCTTCAGTCCTTATATATAGAGACTCTTCAAGAAAAGATTAAAGTGAACAAGAAGAAGCTTTCAAAGAAGGT
TTTGCTTGAAAGAGTCAAAATACTAGATATACGTGAAACTGGTGGCCTATTTCGTAGATTCAGACCTGTAGCTCTGAGATCAGATTTATCAAAGGCGTCCAGAGCAAGGA
AATTGCTTCTGAAGAGGGAAAGAATCAAGGAAGAAAAGAAGGCTAAAGCTCTGGCTTCTGGGGTCAATTTGAACTATGATGATTTAGATGATGAGGCCACGCTACGGATG
CATCAAGAATCTCCCCTGCCTAATCTTCTGAAAGATGAAGAATGTCATATTCTTATTTTCGATTTGTGCAAGGCACTGGCTTCCTTGGGAAGATGTTCTGAAGCTTTAGA
GATAATAAGTCTAACTTTAAAGTTGGCTTTTAACTCATTGTCTTTGGAGAGGAAGGAGGAACTTCAGTTACTCGGAGCTCAACTAGCATTCAGCGCAACTGATACCAAGC
ATGTATTTAATTTTGCAAAGCATGTTGTTAAACAGTACCCTTACAGCAACTCTGCTTGGAACTGCTATTATAAAGTATCTGCAAGAATGACGAACCGGGATTCAAGGCAT
TGCAAACTTCTGAATAGCATGCAAGCAAAATACAAAGATTGTGCACCGCCCTATCTCATAGCAGGACATCAGTTTACCGCCATTAGCCATCATCAAGATGCTGCAAGGAA
ATATCTTGAAGCTTATAAAATATTGCCTGATAGTCCCCTAGTTAACTTATGTGTTGGATCGTCATTAATCAACTTGACACTTGGATTTCGTCTTCAAAACAAGCATCAGT
GTATTGCACAAGCCCTGACATTCCTCTATAAGAATTTGAAGCTTTGTGATAACAGCCAGGAAGCCCTATACAACATTGCTCGAGCATATCATCACGTTGGACTTGTGACG
ATGGCGGTTACGTATTACGAAAAGGTGCTTGCAACTTACCAGAAGGATTGCCTCATCCCAGAACTTTTTGGTGAGAACCGAAGCGTTAAGGATAAAAGATCAGTCCACTG
TGACCTACGCAGAGAAGCAGCTTACAATTTGCATCTTATATACAAAGAAAGTGGAGCTCTTGATCTTGCTAGGCAAGTCCTAAAAGATCATTGCACATTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAAGAAGGGAACAAAATTTCTGACAATGAAGAGGTTCCTGGTTGTCTTGTGCGAGGAAACGAACTTGTAGAAACAGAAGTAGAAGATAGAGAGGATAGAGAGGA
GGAGGAGGAGGAAGAGGAAGAGGAAGAGGAAGAGGAAGAGGAAGAGGAAGAAGAAGAAGAAGAAGAAGAAGAGGAAGAAGTGGAGGGTGAAGGAGAAGAAGAAGAAGATG
GTTACATTTTCAAATTTAAGGCTGGAGAAAATCCATTTGATTTTGTTGAAGGGACAAATTTTAGCATCCAACCATATAAAAAATTTGAGCGCCTTGAATACGAAGCTCTT
GCCGAGAAAAAGCGAAAAGCTCTTGCAGCTGGTCAGAGCGAGAGATCTGCAAAGAGAGGCAGGGTAGAGGATATTTCTGGTGCAAGCTTTGATGAAATTATGGAAGCTAT
GAATTATGGATCTAAGAGAAAGCGAAAGGAGCTTAAAAAAAGAGGTAGACGGAAAGGATCAAAGAAAAAACTTAATCGTGATGTTACAAAGTTGCTTGGTGAAGCAACTT
TATGCTATGCTCAAGGCCAGTATGAGAAGGCTATATCTCTATTGAGTCAAGTTGTTCTGCAAGCCCCAGATGTACCTGATTCATACCATACGTTGGGACTTATTTATAAT
GCAATTGATGATGATGTAAGAGCCATGGGATTCTACATGCTTGCTGCACATTTAATGCCAAGAGATTCATCTCTCTGGAAACTGCTATTTTCATGGTCAATTGACCGAGG
TGACATTGATCAAGCAAGCTACTGCCTTTCTAAAGCAATAAAAGCAGAGCCCGAAGATATTAATTTATTATTTTATCATGCATCAATCTACCTTGATCGTGGAGATTGTC
AAAAAGCTGCTGAAACATATGATCAAATTCATCAAAAATGTCAGGGAAATGTTGAAGCACTAATGAAAGGAGCAAAGCTGTACCAAAAATGCGGTCATCTTGAACGGGCA
ATTTGCATTCTTGAAGACTACATCAAAGGGCATCAAGCAGGGGCTGATTTAGATGTGGTTGATCTTCTAGCTTCTTTATACATGGGAAGTAAAGAATTCTGCAAAGCTCT
TGAGCGCATTGAGCATGCAGATGAGGTGTACTGTGCAGGAAAGGATCTACCTTTGAATTTGACTGCTAAAGCAGGAATTTGCCACGCCCATCTTGGAAATATGGAGAAGG
CAGAGTGCCTTTTTGCTAATTTGGGGCGGGATGCTGCATACGATCACTCAAATTTGATGATTGAAGTTGCAGACTCGTTGCTGACTCTTAAGCACTATGACTTGGCATTG
AAGTATTATCTGATGTCAGAAAAAGCTAATGCTGGAGGGAACGTGGGAATTCTATACCTTAAAATTGCCCAATGTTACTTATCAACTAATGAAAGAGCAGAGGCAATTAT
TTTCTTTTATAAAGTAATTCAACATCTTGAAGATAACGTTAATGCTCGATTAACTTTGGCTTCCCTCCTCCTTGAGGAAGGTAGAGAAGAAGAAGTCATTTCATTACTAT
CTCCTCCGAAAGATTCAAACTCAATTAGCTCATCTTCCAGCAAATGCAAAGCTTGGTGGCTCAATGAAAGAGTAAAGTTTAAACTTTGCCACATATACAAAACTAAAGGA
ATGCTTGAGAACTTCATTGAGGCAATCGGTCCTTTGGTTCTTCAGTCCTTATATATAGAGACTCTTCAAGAAAAGATTAAAGTGAACAAGAAGAAGCTTTCAAAGAAGGT
TTTGCTTGAAAGAGTCAAAATACTAGATATACGTGAAACTGGTGGCCTATTTCGTAGATTCAGACCTGTAGCTCTGAGATCAGATTTATCAAAGGCGTCCAGAGCAAGGA
AATTGCTTCTGAAGAGGGAAAGAATCAAGGAAGAAAAGAAGGCTAAAGCTCTGGCTTCTGGGGTCAATTTGAACTATGATGATTTAGATGATGAGGCCACGCTACGGATG
CATCAAGAATCTCCCCTGCCTAATCTTCTGAAAGATGAAGAATGTCATATTCTTATTTTCGATTTGTGCAAGGCACTGGCTTCCTTGGGAAGATGTTCTGAAGCTTTAGA
GATAATAAGTCTAACTTTAAAGTTGGCTTTTAACTCATTGTCTTTGGAGAGGAAGGAGGAACTTCAGTTACTCGGAGCTCAACTAGCATTCAGCGCAACTGATACCAAGC
ATGTATTTAATTTTGCAAAGCATGTTGTTAAACAGTACCCTTACAGCAACTCTGCTTGGAACTGCTATTATAAAGTATCTGCAAGAATGACGAACCGGGATTCAAGGCAT
TGCAAACTTCTGAATAGCATGCAAGCAAAATACAAAGATTGTGCACCGCCCTATCTCATAGCAGGACATCAGTTTACCGCCATTAGCCATCATCAAGATGCTGCAAGGAA
ATATCTTGAAGCTTATAAAATATTGCCTGATAGTCCCCTAGTTAACTTATGTGTTGGATCGTCATTAATCAACTTGACACTTGGATTTCGTCTTCAAAACAAGCATCAGT
GTATTGCACAAGCCCTGACATTCCTCTATAAGAATTTGAAGCTTTGTGATAACAGCCAGGAAGCCCTATACAACATTGCTCGAGCATATCATCACGTTGGACTTGTGACG
ATGGCGGTTACGTATTACGAAAAGGTGCTTGCAACTTACCAGAAGGATTGCCTCATCCCAGAACTTTTTGGTGAGAACCGAAGCGTTAAGGATAAAAGATCAGTCCACTG
TGACCTACGCAGAGAAGCAGCTTACAATTTGCATCTTATATACAAAGAAAGTGGAGCTCTTGATCTTGCTAGGCAAGTCCTAAAAGATCATTGCACATTTTAA
Protein sequenceShow/hide protein sequence
MEEEGNKISDNEEVPGCLVRGNELVETEVEDREDREEEEEEEEEEEEEEEEEEEEEEEEEEEEVEGEGEEEEDGYIFKFKAGENPFDFVEGTNFSIQPYKKFERLEYEAL
AEKKRKALAAGQSERSAKRGRVEDISGASFDEIMEAMNYGSKRKRKELKKRGRRKGSKKKLNRDVTKLLGEATLCYAQGQYEKAISLLSQVVLQAPDVPDSYHTLGLIYN
AIDDDVRAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPEDINLLFYHASIYLDRGDCQKAAETYDQIHQKCQGNVEALMKGAKLYQKCGHLERA
ICILEDYIKGHQAGADLDVVDLLASLYMGSKEFCKALERIEHADEVYCAGKDLPLNLTAKAGICHAHLGNMEKAECLFANLGRDAAYDHSNLMIEVADSLLTLKHYDLAL
KYYLMSEKANAGGNVGILYLKIAQCYLSTNERAEAIIFFYKVIQHLEDNVNARLTLASLLLEEGREEEVISLLSPPKDSNSISSSSSKCKAWWLNERVKFKLCHIYKTKG
MLENFIEAIGPLVLQSLYIETLQEKIKVNKKKLSKKVLLERVKILDIRETGGLFRRFRPVALRSDLSKASRARKLLLKRERIKEEKKAKALASGVNLNYDDLDDEATLRM
HQESPLPNLLKDEECHILIFDLCKALASLGRCSEALEIISLTLKLAFNSLSLERKEELQLLGAQLAFSATDTKHVFNFAKHVVKQYPYSNSAWNCYYKVSARMTNRDSRH
CKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEAYKILPDSPLVNLCVGSSLINLTLGFRLQNKHQCIAQALTFLYKNLKLCDNSQEALYNIARAYHHVGLVT
MAVTYYEKVLATYQKDCLIPELFGENRSVKDKRSVHCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF