; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10002345 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10002345
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptiongeneral transcription factor 3C polypeptide 3
Genome locationChr11:5823615..5842501
RNA-Seq ExpressionHG10002345
SyntenyHG10002345
Gene Ontology termsGO:0006383 - transcription by RNA polymerase III (biological process)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat
IPR039340 - Transcription factor Tfc4/TFIIIC-102/Sfc4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008447634.1 PREDICTED: general transcription factor 3C polypeptide 3 isoform X1 [Cucumis melo]0.0e+0091.5Show/hide
Query:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA
        MEKEG++ISD+EEVPG VM VLG EKE VETGV  R   EEEEEEEEGEEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFS+QPYKKFERLEYEA
Subjt:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA

Query:  LAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP
        LAEKKRKALANGQSERAAKRGRVED+ GASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAIS+LRQVVLQAPDLP
Subjt:  LAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP

Query:  DSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG
        DSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSI+RGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG
Subjt:  DSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG

Query:  NVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFA
        NVEALMTGAKLYQKCGHLERAICILEDYIKEHP+EADLDVVDLLASLYMGSKEFSKALE IEHAD VYCAGNELPLNLT KAGICH HLGN+EKAECLFA
Subjt:  NVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFA

Query:  NLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEA
        NL RET  DHSNLMIE ADSLLSLKHY+ ALKYYLMSEEVNAG NMGILY K+A+CYLST+E+ QAIVFFYKVLQH+EDNINARLTLASLLLEEARDEEA
Subjt:  NLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEA

Query:  ISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKP
        ISLLSPPKDSNPTSSSS KLKPWWLNEKVKLKLC IY+TRG+LENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGF+P
Subjt:  ISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKP

Query:  VAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIISLTLKLAFNS
        VAPKSDL+KASRAKRLLQKR+RIKEEKKA+ LAAGV+V+YDDLDDEPA                        LCKALASLGRCSEALEIISLTLKLAFNS
Subjt:  VAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIISLTLKLAFNS

Query:  LSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYL
        LS ERKEELQLLGAQLAFSST TMHGFNFAKHVVKQYPYSISAWNCYYKVAS LTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYL
Subjt:  LSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYL

Query:  EAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI
        EAYKIMPDSPLINLCVG+SLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN+NI
Subjt:  EAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI

Query:  KHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        KHQ SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Subjt:  KHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

XP_011652346.1 general transcription factor 3C polypeptide 3 [Cucumis sativus]0.0e+0092.36Show/hide
Query:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE-EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYE
        MEKEGS+ISD EEVPG VM VLG EKEVVETGVE RE EEEEEEEEEEGEEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFS+QPYKKFERLEYE
Subjt:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE-EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYE

Query:  ALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDL
        ALAEKKRKALANGQSERAAKRGRVEDI GASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQG+HEKAIS+LRQVVL+APDL
Subjt:  ALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDL

Query:  PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
        PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
Subjt:  PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL

Query:  GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLF
        GNVEALMTGAKLYQKCGHLERAICILEDYIK HP+EADLDVVDLLASLYMGSKEFSKALERIEHAD VYCAGNELPLNLTTKAGICH HLG++EKAECLF
Subjt:  GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLF

Query:  ANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEE
        ANL RET  DHSNLMIE ADSL+SLKHY+ ALKYYLMSEEVNAG NMGILYLKIA+CYLST+ER QAIVFFYKVLQH+EDNINARLTLASLLLEEARD+E
Subjt:  ANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEE

Query:  AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK
        AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLC IY+TRG+LENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK
Subjt:  AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK

Query:  PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIISLTLKLAFN
        PVAPKSDL+KASRAKRLLQKRERIKEEKKA+ALAAGV+++YDDLDDEPA                        LCKALASLGRCSEALEIISLTLKLAFN
Subjt:  PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIISLTLKLAFN

Query:  SLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKY
        SLS+ERKEELQLLGAQLAFSST TMHGFNFAKHVVKQYPYSISAWNCYYKVAS LTNRDSRHCKLLNSMQ+KYKDCAPPYIIAGHQFTTISHHQDAARKY
Subjt:  SLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKY

Query:  LEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQN
        LEAYKIMPDSPLINLCVG+SLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN+N
Subjt:  LEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQN

Query:  IKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        IKHQ SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Subjt:  IKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

XP_022152620.1 general transcription factor 3C polypeptide 3 isoform X2 [Momordica charantia]0.0e+0087.05Show/hide
Query:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK
        MEKEG++ISDN+EVPG  + V G  K + ET VE RE         EEEEEEEEEE EEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK
Subjt:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK

Query:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ
        KFERLEYEALAEKKRKALAN QSER  KRGR+EDIPGASF+EI+EAMNYGSRRKLKEPK+RGRRKGSKKK+NR++TKLLGDATLCYAQGQ+EKAIS+LRQ
Subjt:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ

Query:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY
        VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETY
Subjt:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY

Query:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGN
        DQIHQ+C+GNVEALMTGAKLYQKCGH ERAICILEDYIK HPTEADLDVVDLLASLYMGSKEFSKALE IEHAD+VYCAGNE+PLNL TKAGICHVHLGN
Subjt:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGN

Query:  MEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLL
        +EKAE LFANLGR+TA+DHS+ +IEAADSLLSLKH+NLALKYYLMSEEVNAGG MGI+YLKIAQCYLST+ER +AIVFFYKVLQ LEDNINARLTLASLL
Subjt:  MEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLL

Query:  LEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRET
        LEEAR+EEAISLLSPPKDSN +SSSSSK KPWWLNEKVKLKLC IY+T+GMLENFVE IF LVRESLYIETL+EKIKVNKKKLPRRVLLERVKVLDGRET
Subjt:  LEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRET

Query:  GNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIIS
        G+LFRGF+PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV+V+YDD+DDEPA                        LCKALASLGRCSEALEIIS
Subjt:  GNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIIS

Query:  LTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISH
        LTLKLAFNSLS+ERKEELQLLGAQLAFSSTDT HGFNFAKHVVKQYPYS SAWNCYYKV+S +T+RDSRHCKLLNS+QAKYKDCAPP+IIAGHQF  ISH
Subjt:  LTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISH

Query:  HQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP
        HQ+AA+KYLEAYK++PDSPLINLCVGA+LINLALG RLQNKHQCVAQGLAFLY NLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP
Subjt:  HQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP

Query:  ELFGENQNIKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        ++FGEN+NIKH+KSVYCDLRREAAYNLHLIYK+SGALDLARQVLKDHCTF
Subjt:  ELFGENQNIKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

XP_022152621.1 general transcription factor 3C polypeptide 3 isoform X3 [Momordica charantia]0.0e+0089.21Show/hide
Query:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK
        MEKEG++ISDN+EVPG  + V G  K + ET VE RE         EEEEEEEEEE EEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK
Subjt:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK

Query:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ
        KFERLEYEALAEKKRKALAN QSER  KRGR+EDIPGASF+EI+EAMNYGSRRKLKEPK+RGRRKGSKKK+NR++TKLLGDATLCYAQGQ+EKAIS+LRQ
Subjt:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ

Query:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY
        VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETY
Subjt:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY

Query:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGN
        DQIHQ+C+GNVEALMTGAKLYQKCGH ERAICILEDYIK HPTEADLDVVDLLASLYMGSKEFSKALE IEHAD+VYCAGNE+PLNL TKAGICHVHLGN
Subjt:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGN

Query:  MEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNM-GILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASL
        +EKAE LFANLGR+TA+DHS+ +IEAADSLLSLKH+NLALKYYLMSEEVNAGG M GI+YLKIAQCYLST+ER +AIVFFYKVLQ LEDNINARLTLASL
Subjt:  MEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNM-GILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASL

Query:  LLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRE
        LLEEAR+EEAISLLSPPKDSN +SSSSSK KPWWLNEKVKLKLC IY+T+GMLENFVE IF LVRESLYIETL+EKIKVNKKKLPRRVLLERVKVLDGRE
Subjt:  LLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRE

Query:  TGNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGA
        TG+LFRGF+PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV+V+YDD+DDEPALCKALASLGRCSEALEIISLTLKLAFNSLS+ERKEELQLLGA
Subjt:  TGNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGA

Query:  QLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINL
        QLAFSSTDT HGFNFAKHVVKQYPYS SAWNCYYKV+S +T+RDSRHCKLLNS+QAKYKDCAPP+IIAGHQF  ISHHQ+AA+KYLEAYK++PDSPLINL
Subjt:  QLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINL

Query:  CVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNIKHQKSVYCDLRREA
        CVGA+LINLALG RLQNKHQCVAQGLAFLY NLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP++FGEN+NIKH+KSVYCDLRREA
Subjt:  CVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNIKHQKSVYCDLRREA

Query:  AYNLHLIYKESGALDLARQVLKDHCTF
        AYNLHLIYK+SGALDLARQVLKDHCTF
Subjt:  AYNLHLIYKESGALDLARQVLKDHCTF

XP_038879313.1 general transcription factor 3C polypeptide 3 [Benincasa hispida]0.0e+0093.73Show/hide
Query:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA
        MEKEG+ +SDNEEVPGG M VLGA KEVVETGVE R  EEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA
Subjt:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA

Query:  LAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP
        LAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKA+SILRQVVLQAPDLP
Subjt:  LAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP

Query:  DSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG
        DSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDI+LLFHRASLYLERGDCEKAAETYDQIHQQCLG
Subjt:  DSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG

Query:  NVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFA
        NVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMG+KEF KALERIEHADEVYCAG+ELPL LTTK GICH HLGNMEKAECLFA
Subjt:  NVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFA

Query:  NLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEA
        NLG ETA+DHSNLMIE ADSLLSLKHYNLALKYYLM EEVNAGGNMGILYLKIAQCYLST+ER QAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEA
Subjt:  NLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEA

Query:  ISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKP
        ISLLSPPKDSNPTSSSSSKLKPWW NEKVKLKLC IYKTRGMLE+FVEVIFPLVRESLYIETLQEKIKVNKKKLP+RVLLERVKVLDGRE+GNLFRGF+P
Subjt:  ISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKP

Query:  VAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIISLTLKLAFNS
        VAPKSDLSKASRAKRLLQKRERIKEEKKA+ALAAGVDVNYDDLDDEPA                        LCKALASLGRCSEALEIISLTLKLA NS
Subjt:  VAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIISLTLKLAFNS

Query:  LSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYL
        LSIERKEELQLLGAQLAFSSTDT HGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFT ISHHQDAARKYL
Subjt:  LSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYL

Query:  EAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI
        EAYKIMPDSPLINLCVG+SLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN+NI
Subjt:  EAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI

Query:  KHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        KHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Subjt:  KHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

TrEMBL top hitse value%identityAlignment
A0A0A0LGB1 Uncharacterized protein0.0e+0089.81Show/hide
Query:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE-EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYE
        MEKEGS+ISD EEVPG VM VLG EKEVVETGVE RE EEEEEEEEEEGEEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFS+QPYKKFERLEYE
Subjt:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE-EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYE

Query:  ALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDL
        ALAEKKRKALANGQSERAAKRGRVEDI GASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQG+HEKAIS+LRQVVL+APDL
Subjt:  ALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDL

Query:  PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
        PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
Subjt:  PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL

Query:  GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLF
        GNVEALMTGAKLYQKCGHLERAICILEDYIK HP+EADLDVVDLLASLYMGSKEFSKALERIEHAD VYCAGNELPLNLTTKAGICH HLG++EKAECLF
Subjt:  GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLF

Query:  ANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEE
        ANL RET  DHSNLMIE ADSL+SLKHY+ ALKYYLMSEEVNAG NMGILYLKIA+CYLST+ER QAIVFFYKVLQH+EDNINARLTLASLLLEEARD+E
Subjt:  ANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEE

Query:  AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK
        AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLC IY+TRG+LENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK
Subjt:  AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK

Query:  PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIISLTLKLAFN
        PVAPKSDL+KASRAKRLLQKRERIKEEKKA+ALAAGV+++YDDLDDEPA                        LCKALASLGRCSEALEIISLTL     
Subjt:  PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIISLTLKLAFN

Query:  SLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKY
                           +ST TMHGFNFAKHVVKQYPYSISAWNCYYKVAS LTNRDSRHCKLLNSMQ+KYKDCAPPYIIAGHQFTTISHHQDAARKY
Subjt:  SLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKY

Query:  LEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQN
        LEAYKIMPDSPLINLCVG+SLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN+N
Subjt:  LEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQN

Query:  IKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        IKHQ SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Subjt:  IKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

A0A1S3BHB9 general transcription factor 3C polypeptide 3 isoform X10.0e+0091.5Show/hide
Query:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA
        MEKEG++ISD+EEVPG VM VLG EKE VETGV  R   EEEEEEEEGEEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFS+QPYKKFERLEYEA
Subjt:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA

Query:  LAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP
        LAEKKRKALANGQSERAAKRGRVED+ GASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAIS+LRQVVLQAPDLP
Subjt:  LAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP

Query:  DSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG
        DSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSI+RGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG
Subjt:  DSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG

Query:  NVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFA
        NVEALMTGAKLYQKCGHLERAICILEDYIKEHP+EADLDVVDLLASLYMGSKEFSKALE IEHAD VYCAGNELPLNLT KAGICH HLGN+EKAECLFA
Subjt:  NVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFA

Query:  NLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEA
        NL RET  DHSNLMIE ADSLLSLKHY+ ALKYYLMSEEVNAG NMGILY K+A+CYLST+E+ QAIVFFYKVLQH+EDNINARLTLASLLLEEARDEEA
Subjt:  NLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEA

Query:  ISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKP
        ISLLSPPKDSNPTSSSS KLKPWWLNEKVKLKLC IY+TRG+LENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGF+P
Subjt:  ISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKP

Query:  VAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIISLTLKLAFNS
        VAPKSDL+KASRAKRLLQKR+RIKEEKKA+ LAAGV+V+YDDLDDEPA                        LCKALASLGRCSEALEIISLTLKLAFNS
Subjt:  VAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIISLTLKLAFNS

Query:  LSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYL
        LS ERKEELQLLGAQLAFSST TMHGFNFAKHVVKQYPYSISAWNCYYKVAS LTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYL
Subjt:  LSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYL

Query:  EAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI
        EAYKIMPDSPLINLCVG+SLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN+NI
Subjt:  EAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI

Query:  KHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        KHQ SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Subjt:  KHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

A0A6J1DEF9 general transcription factor 3C polypeptide 3 isoform X20.0e+0087.05Show/hide
Query:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK
        MEKEG++ISDN+EVPG  + V G  K + ET VE RE         EEEEEEEEEE EEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK
Subjt:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK

Query:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ
        KFERLEYEALAEKKRKALAN QSER  KRGR+EDIPGASF+EI+EAMNYGSRRKLKEPK+RGRRKGSKKK+NR++TKLLGDATLCYAQGQ+EKAIS+LRQ
Subjt:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ

Query:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY
        VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETY
Subjt:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY

Query:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGN
        DQIHQ+C+GNVEALMTGAKLYQKCGH ERAICILEDYIK HPTEADLDVVDLLASLYMGSKEFSKALE IEHAD+VYCAGNE+PLNL TKAGICHVHLGN
Subjt:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGN

Query:  MEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLL
        +EKAE LFANLGR+TA+DHS+ +IEAADSLLSLKH+NLALKYYLMSEEVNAGG MGI+YLKIAQCYLST+ER +AIVFFYKVLQ LEDNINARLTLASLL
Subjt:  MEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLL

Query:  LEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRET
        LEEAR+EEAISLLSPPKDSN +SSSSSK KPWWLNEKVKLKLC IY+T+GMLENFVE IF LVRESLYIETL+EKIKVNKKKLPRRVLLERVKVLDGRET
Subjt:  LEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRET

Query:  GNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIIS
        G+LFRGF+PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV+V+YDD+DDEPA                        LCKALASLGRCSEALEIIS
Subjt:  GNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIIS

Query:  LTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISH
        LTLKLAFNSLS+ERKEELQLLGAQLAFSSTDT HGFNFAKHVVKQYPYS SAWNCYYKV+S +T+RDSRHCKLLNS+QAKYKDCAPP+IIAGHQF  ISH
Subjt:  LTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISH

Query:  HQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP
        HQ+AA+KYLEAYK++PDSPLINLCVGA+LINLALG RLQNKHQCVAQGLAFLY NLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP
Subjt:  HQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP

Query:  ELFGENQNIKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        ++FGEN+NIKH+KSVYCDLRREAAYNLHLIYK+SGALDLARQVLKDHCTF
Subjt:  ELFGENQNIKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

A0A6J1DGK2 general transcription factor 3C polypeptide 3 isoform X10.0e+0086.96Show/hide
Query:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK
        MEKEG++ISDN+EVPG  + V G  K + ET VE RE         EEEEEEEEEE EEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK
Subjt:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK

Query:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ
        KFERLEYEALAEKKRKALAN QSER  KRGR+EDIPGASF+EI+EAMNYGSRRKLKEPK+RGRRKGSKKK+NR++TKLLGDATLCYAQGQ+EKAIS+LRQ
Subjt:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ

Query:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY
        VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETY
Subjt:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY

Query:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGN
        DQIHQ+C+GNVEALMTGAKLYQKCGH ERAICILEDYIK HPTEADLDVVDLLASLYMGSKEFSKALE IEHAD+VYCAGNE+PLNL TKAGICHVHLGN
Subjt:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGN

Query:  MEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNM-GILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASL
        +EKAE LFANLGR+TA+DHS+ +IEAADSLLSLKH+NLALKYYLMSEEVNAGG M GI+YLKIAQCYLST+ER +AIVFFYKVLQ LEDNINARLTLASL
Subjt:  MEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNM-GILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASL

Query:  LLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRE
        LLEEAR+EEAISLLSPPKDSN +SSSSSK KPWWLNEKVKLKLC IY+T+GMLENFVE IF LVRESLYIETL+EKIKVNKKKLPRRVLLERVKVLDGRE
Subjt:  LLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRE

Query:  TGNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEII
        TG+LFRGF+PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV+V+YDD+DDEPA                        LCKALASLGRCSEALEII
Subjt:  TGNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEII

Query:  SLTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTIS
        SLTLKLAFNSLS+ERKEELQLLGAQLAFSSTDT HGFNFAKHVVKQYPYS SAWNCYYKV+S +T+RDSRHCKLLNS+QAKYKDCAPP+IIAGHQF  IS
Subjt:  SLTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTIS

Query:  HHQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPI
        HHQ+AA+KYLEAYK++PDSPLINLCVGA+LINLALG RLQNKHQCVAQGLAFLY NLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPI
Subjt:  HHQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPI

Query:  PELFGENQNIKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        P++FGEN+NIKH+KSVYCDLRREAAYNLHLIYK+SGALDLARQVLKDHCTF
Subjt:  PELFGENQNIKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

A0A6J1DIA5 general transcription factor 3C polypeptide 3 isoform X30.0e+0089.21Show/hide
Query:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK
        MEKEG++ISDN+EVPG  + V G  K + ET VE RE         EEEEEEEEEE EEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK
Subjt:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK

Query:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ
        KFERLEYEALAEKKRKALAN QSER  KRGR+EDIPGASF+EI+EAMNYGSRRKLKEPK+RGRRKGSKKK+NR++TKLLGDATLCYAQGQ+EKAIS+LRQ
Subjt:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ

Query:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY
        VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETY
Subjt:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY

Query:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGN
        DQIHQ+C+GNVEALMTGAKLYQKCGH ERAICILEDYIK HPTEADLDVVDLLASLYMGSKEFSKALE IEHAD+VYCAGNE+PLNL TKAGICHVHLGN
Subjt:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGN

Query:  MEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNM-GILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASL
        +EKAE LFANLGR+TA+DHS+ +IEAADSLLSLKH+NLALKYYLMSEEVNAGG M GI+YLKIAQCYLST+ER +AIVFFYKVLQ LEDNINARLTLASL
Subjt:  MEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNM-GILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASL

Query:  LLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRE
        LLEEAR+EEAISLLSPPKDSN +SSSSSK KPWWLNEKVKLKLC IY+T+GMLENFVE IF LVRESLYIETL+EKIKVNKKKLPRRVLLERVKVLDGRE
Subjt:  LLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRE

Query:  TGNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGA
        TG+LFRGF+PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV+V+YDD+DDEPALCKALASLGRCSEALEIISLTLKLAFNSLS+ERKEELQLLGA
Subjt:  TGNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGA

Query:  QLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINL
        QLAFSSTDT HGFNFAKHVVKQYPYS SAWNCYYKV+S +T+RDSRHCKLLNS+QAKYKDCAPP+IIAGHQF  ISHHQ+AA+KYLEAYK++PDSPLINL
Subjt:  QLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINL

Query:  CVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNIKHQKSVYCDLRREA
        CVGA+LINLALG RLQNKHQCVAQGLAFLY NLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP++FGEN+NIKH+KSVYCDLRREA
Subjt:  CVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNIKHQKSVYCDLRREA

Query:  AYNLHLIYKESGALDLARQVLKDHCTF
        AYNLHLIYK+SGALDLARQVLKDHCTF
Subjt:  AYNLHLIYKESGALDLARQVLKDHCTF

SwissProt top hitse value%identityAlignment
O74458 Transcription factor tau subunit sfc43.3e-2622.57Show/hide
Query:  EDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQ-GQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDD----VK
        +DI    ++E L+A+  G R+  K  K RGR   +    + +V ++L  A   +AQ G  ++A  +  ++V    ++  ++  LG  +   G+      K
Subjt:  EDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQ-GQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDD----VK

Query:  AMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDIN----LLFHRASLYLERGDCEKAAETYDQIHQQCLGNVEALMTGAKLYQK
         +  +M AAHL PKD  LW      S      DQA YC ++A+ A+P + +     +++R+ L  E G  +KAAE +  + Q    N   L   A++Y K
Subjt:  AMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDIN----LLFHRASLYLERGDCEKAAETYDQIHQQCLGNVEALMTGAKLYQK

Query:  CGHLERAIC----ILEDYIKEHPTEA------DLDVVDLLASLYMGSKEFSKALERI----------------------------EHADEVYCAGNE---
          H  R I     I   Y  ++P         DL  ++L A L +   ++S  +  I                            E   E   A  E   
Subjt:  CGHLERAIC----ILEDYIKEHPTEA------DLDVVDLLASLYMGSKEFSKALERI----------------------------EHADEVYCAGNE---

Query:  -----LPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLM------SEEVNAGGNMGILYLKI------AQC
             LP    TK GI  +  G + +AE  F+ +     +    ++ + A + + ++  +LAL+Y+++      ++ +    NMG+ YL++       QC
Subjt:  -----LPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLM------SEEVNAGGNMGILYLKI------AQC

Query:  ----YLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFP
             +  +  T A++   ++   L+DN +A L + + + E+ R+   I+ L   +  N     +   + +  N+KV       ++ R  +    E    
Subjt:  ----YLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFP

Query:  LVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRE-----TGNLFRGFKPVAPKSDLSKASRAKR-LLQKRER---IKEEKKA---RALAAGVDVNY
          R+    +T + + + +K  + R+ L +   V +           L   F  +       K +RA+  LL +R R   + ++  +   R   +     Y
Subjt:  LVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRE-----TGNLFRGFKPVAPKSDLSKASRAKR-LLQKRER---IKEEKKA---RALAAGVDVNY

Query:  DDLDDEPAL--------------------CKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAW
         DLD +  L                       L  +G   +A ++++  +          +++ L+      +  + D        + V   + +    +
Subjt:  DDLDDEPAL--------------------CKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAW

Query:  NCYYKVAS-----SLTNRDSRHCKLL------------NSM-------------QAKYKDCAPPYIIA--GHQFTTISHHQDAARKYLEAYKIMPDSPLI
          +  V S     S    DS + K L            NS+              A       P ++   GH          A   Y  A+ I PD P+ 
Subjt:  NCYYKVAS-----SLTNRDSRHCKLL------------NSM-------------QAKYKDCAPPYIIA--GHQFTTISHHQDAARKYLEAYKIMPDSPLI

Query:  NLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN-----SQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNIKHQK-SV
        NL +G + ++ A+     N+H  + QG  FLY+   L  N      QEALYN+ +AYH IGL   AV YYE VL         P   G+         S 
Subjt:  NLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN-----SQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNIKHQK-SV

Query:  YCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
          D   EAAYNL LIY  SG + LA Q+   +  F
Subjt:  YCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

P33339 Transcription factor tau 131 kDa subunit6.9e-1619.75Show/hide
Query:  EEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMN
        ++E++ +  E E  D G+ + E+E+         ++     E  D  +    +    EY   +E+    LA             ED   A    I EA N
Subjt:  EEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMN

Query:  YGSRRKLKEPK-KRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWK
        +  ++K K  K K   R+  ++ L+ +V +LL  A   + +   + A  +  +V+ +      +Y TLG +Y   G        + LAAHL   D   WK
Subjt:  YGSRRKLKEPK-KRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWK

Query:  LLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQ-------------CLGNVEALMTGAKLYQKC--GHLERAIC
        ++   S D   + QA YC S+ I   P +   ++ R+ LY + G   +A + + +++                  + + +    +LY K    ++ER   
Subjt:  LLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQ-------------CLGNVEALMTGAKLYQKC--GHLERAIC

Query:  IL------------------EDYIKEHPTEADLD------------------------VVDLLASLYM--------GSKEFSKALERIEHA---------
        IL                  ED  ++ P E D D                         +++LA L++        G K   K    I+           
Subjt:  IL------------------EDYIKEHPTEADLD------------------------VVDLLASLYM--------GSKEFSKALERIEHA---------

Query:  ----------------DEVYCAGNE----LPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYY---LMSEEVN
                        D +  A  E    +P+++  + G+  ++  N+ +A   F  L  ET +D ++L  EAA +L   + Y  A+ ++   L  EE  
Subjt:  ----------------DEVYCAGNE----LPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYY---LMSEEVN

Query:  AGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLL------------------LEEARDEEAISLLSPPKDSNPTSSSSSK--LK
               ++  +A+CY   +    A  F+   ++   D+++ R++LA +                   + + + +E +  +S  K SN TS  SSK  L+
Subjt:  AGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLL------------------LEEARDEEAISLLSPPKDSNPTSSSSSK--LK

Query:  PWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPR---RVLLERVKVLD----------GRETGNLFRGFKPVAPKSD--
                K K       R  +E    +   +V +   ++  +    +N+ K        + E V +             R+   + R  K    + D  
Subjt:  PWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPR---RVLLERVKVLD----------GRETGNLFRGFKPVAPKSD--

Query:  ---LSKASRAKR-----LLQKRERIKEEKKARALA-----------AGVDVNYDDLDDEPALCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQL
           LSK +         L+++R  +    + R L+           + V   Y  ++D  ++ +    +    +  E + + +K    ++ ++  +E   
Subjt:  ---LSKASRAKR-----LLQKRERIKEEKKARALA-----------AGVDVNYDDLDDEPALCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQL

Query:  LGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRD------------------SRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQ
           +LA +    ++ F F + V++ + YS+        + SS   +                   +    + N         + PY+   +     S   
Subjt:  LGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRD------------------SRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQ

Query:  DAARKYLEAYKIM-------PDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN------SQEALYNIARAYHHIGLVTLAVTYYEKV
          +R +L A + +       PD P++NL +G S I+ A+      +H  +  GL +LY+  K+  +       QEA YN+ RA+H IGLV++A+ YY +V
Subjt:  DAARKYLEAYKIM-------PDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN------SQEALYNIARAYHHIGLVTLAVTYYEKV

Query:  LATYQKDCPIPELFGENQNIKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDH
        L  Y                         L++ AAYN  +IY++SG ++LA  +++ +
Subjt:  LATYQKDCPIPELFGENQNIKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDH

Q61371 Intraflagellar transport protein 88 homolog4.6e-0423.83Show/hide
Query:  ERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILE--AMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ
        +R    AL  K     ANG  E+AA+  +      +S  E L    + Y    +L E      +  +   L      L   A +        +AI  L Q
Subjt:  ERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILE--AMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ

Query:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY
        ++   P    +   LG +Y++ GD  +A  +Y  +    P +  + + L ++ ID    ++A     +A   +P  +      AS +   G+ +KA +TY
Subjt:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY

Query:  DQIHQQCLGNVEAL
         +IH++   NVE L
Subjt:  DQIHQQCLGNVEAL

Q9Y5Q9 General transcription factor 3C polypeptide 31.5e-5824.35Show/hide
Query:  EVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERAAKRGRVEDI
        E  E   E R+  E++  +E+G+   E+  +D        I   K+ +   D  EG + S    K   ++    L E +       + E   +     + 
Subjt:  EVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERAAKRGRVEDI

Query:  PGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLA
        P A    +LE +        +E KK  + K  + KL R +  L+G+A + +A+G+ E+AI +  +++ QAP   + + TL ++Y   GD  K++ F ++A
Subjt:  PGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLA

Query:  AHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGN-----VEALMTGAKLYQKCGHLERA
        AHL P D+  W  L   S+++ +I QA +C +KA+K EP ++  L+ R+SLY + GD + A + Y +I      +     ++     AK Y +   +  A
Subjt:  AHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGN-----VEALMTGAKLYQKCGHLERA

Query:  ICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERI---------------------EHADEVYCA-GNELPLNLTTKAGICHVHLGNMEKAECLF
        I I+++   +H     ++ V++ A LY+ +K++ KALE I                     +  + V C   + +P+++T K  +C VHL  +E    L 
Subjt:  ICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERI---------------------EHADEVYCA-GNELPLNLTTKAGICHVHLGNMEKAECLF

Query:  ANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEE
          L  +   D  +L ++ A++ L +  YN AL   L +   +   N+ +++L+ A+C  +     +A   + KV+     +++AR++L++L  +  + E+
Subjt:  ANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEE

Query:  AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK
        A+  L P  D +  +  ++  +      K+ L    +  ++G +  +V+ +  ++   L                  +V + R +V        L    K
Subjt:  AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK

Query:  PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALCKA---LASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSS
               L K SR K +    ++      A+A+ A +       D    L KA   L  L R  EA  ++  +L+        ++++EL+  G   A   
Subjt:  PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALCKA---LASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSS

Query:  TDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGASL
         +    +N+ + +V +       WN + +V  ++ ++D RH +    +  K  +     ++ GH        + A  +Y++A++  PD PL + C+G + 
Subjt:  TDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGASL

Query:  INLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNIKHQKSVYCDLRREAAYNLHL
        I++A    +  +H  + QG +FL + L L    QE+ YN+ R  H +GL+ LA+ YY+K L        +P L  E   +        DLRR+ AYNL L
Subjt:  INLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNIKHQKSVYCDLRREAAYNLHL

Query:  IYKESGALDLARQVLKDHCT
        IY+ SG   +A+ +L  +C+
Subjt:  IYKESGALDLARQVLKDHCT

Arabidopsis top hitse value%identityAlignment
AT1G17680.1 tetratricopeptide repeat (TPR)-containing protein2.0e-18843.06Show/hide
Query:  EGSKISDNEEVPGGV---MRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA
        EG+ IS+ EE P  +    +VLG +    +  + + EE   ++++++ ++  +DEG++  EE+D     F+AG  P                FER EYEA
Subjt:  EGSKISDNEEVPGGV---MRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA

Query:  LAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP
        LAE+KRKALA+ Q   +        + G      +E M+ G RRK ++ KK+GRR GSKK++  D+ K   +A   +A G+  +A+ IL +V+ QAP   
Subjt:  LAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP

Query:  DSYHTLGLVYNAIG-DDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
         +Y+ L  V   +G  +  +     +AA++    S  WKLL+    ++ +I  A    SKAI+A+PDDI L +  A + L  G   +AAET++QI ++C 
Subjt:  DSYHTLGLVYNAIG-DDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL

Query:  GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLF
          +EAL  G + + K G  ERA  ILED+IK H +E   DV+DLLAS++M      +AL+ I    ++Y  G EL  +L  +  ICHVHL  ME+AE + 
Subjt:  GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLF

Query:  ANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYL--MSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARD
        + L +E  ++H  L+   AD L ++ +++ ALKYY+  +SE VN     G L++KIA+CY+S +ER QAIVF+YK L  L D ++ R+TLASLLLE+ + 
Subjt:  ANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYL--MSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARD

Query:  EEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRG
        +EA+ +LSPP++ +P    ++KLK WW N K+++ LCQIY + GMLE+F      LV + ++  T++ K K       R VL E  +    R   +    
Subjt:  EEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRG

Query:  FKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV--DVNYDDLDDEP------ALCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLG
             PK    K  + +  L +  RI+E    +A    V  +   + + DE        LCKALASL R  EALEI++L  +L    L +E K+ELQ LG
Subjt:  FKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV--DVNYDDLDDEP------ALCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLG

Query:  AQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLIN
        A+++  + D    F+  + V++Q+PY ++AWNCYY V S L  R S   K ++ +++KY+DC PP +IAGH FT  S HQDAAR+YLEAYK+MP+SPLIN
Subjt:  AQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLIN

Query:  LCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI-KHQKSVYCDLRR
        LCVGA+LINLALGFRL+N+H+C+AQG AFLY NL++C NSQEALYN+ARAY H+GLVTLA +YYEKVLA Y+KD  +P+L  E+  + + +K V CDLR+
Subjt:  LCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI-KHQKSVYCDLRR

Query:  EAAYNLHLIYKESGALDLARQVLKDHCTF
        EAA+NLHLIYK SGA DLARQVLKDHCTF
Subjt:  EAAYNLHLIYKESGALDLARQVLKDHCTF

AT1G17680.2 tetratricopeptide repeat (TPR)-containing protein2.0e-18843.06Show/hide
Query:  EGSKISDNEEVPGGV---MRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA
        EG+ IS+ EE P  +    +VLG +    +  + + EE   ++++++ ++  +DEG++  EE+D     F+AG  P                FER EYEA
Subjt:  EGSKISDNEEVPGGV---MRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA

Query:  LAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP
        LAE+KRKALA+ Q   +        + G      +E M+ G RRK ++ KK+GRR GSKK++  D+ K   +A   +A G+  +A+ IL +V+ QAP   
Subjt:  LAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP

Query:  DSYHTLGLVYNAIG-DDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
         +Y+ L  V   +G  +  +     +AA++    S  WKLL+    ++ +I  A    SKAI+A+PDDI L +  A + L  G   +AAET++QI ++C 
Subjt:  DSYHTLGLVYNAIG-DDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL

Query:  GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLF
          +EAL  G + + K G  ERA  ILED+IK H +E   DV+DLLAS++M      +AL+ I    ++Y  G EL  +L  +  ICHVHL  ME+AE + 
Subjt:  GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLF

Query:  ANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYL--MSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARD
        + L +E  ++H  L+   AD L ++ +++ ALKYY+  +SE VN     G L++KIA+CY+S +ER QAIVF+YK L  L D ++ R+TLASLLLE+ + 
Subjt:  ANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYL--MSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARD

Query:  EEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRG
        +EA+ +LSPP++ +P    ++KLK WW N K+++ LCQIY + GMLE+F      LV + ++  T++ K K       R VL E  +    R   +    
Subjt:  EEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRG

Query:  FKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV--DVNYDDLDDEP------ALCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLG
             PK    K  + +  L +  RI+E    +A    V  +   + + DE        LCKALASL R  EALEI++L  +L    L +E K+ELQ LG
Subjt:  FKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV--DVNYDDLDDEP------ALCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLG

Query:  AQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLIN
        A+++  + D    F+  + V++Q+PY ++AWNCYY V S L  R S   K ++ +++KY+DC PP +IAGH FT  S HQDAAR+YLEAYK+MP+SPLIN
Subjt:  AQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLIN

Query:  LCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI-KHQKSVYCDLRR
        LCVGA+LINLALGFRL+N+H+C+AQG AFLY NL++C NSQEALYN+ARAY H+GLVTLA +YYEKVLA Y+KD  +P+L  E+  + + +K V CDLR+
Subjt:  LCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI-KHQKSVYCDLRR

Query:  EAAYNLHLIYKESGALDLARQVLKDHCTF
        EAA+NLHLIYK SGA DLARQVLKDHCTF
Subjt:  EAAYNLHLIYKESGALDLARQVLKDHCTF

AT3G04240.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-0523.53Show/hide
Query:  ATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF
        A L    G   +A+   ++ V   P  PD+Y  LG VY A+G   +A+  Y  A  + P  +  +  + S   ++G +D A     +A+  +P  +    
Subjt:  ATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF

Query:  HRASLYLERGDCEKAAETYDQI------HQQCLGNV
        +  +   + G  ++A   Y+Q       H Q + N+
Subjt:  HRASLYLERGDCEKAAETYDQI------HQQCLGNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGGAAGGGAGTAAAATTTCTGACAATGAAGAGGTTCCTGGTGGTGTTATGCGTGTTTTAGGAGCAGAAAAAGAGGTTGTAGAAACAGGAGTGGAGGCTAGAGA
GGAGGAGGAAGAGGAAGAGGAAGAGGAAGAAGGAGAGGAAGAAGTGGAGGATGAAGGAGAGGATGATATTGAAGAAGAAGATGGTTACATATTCAAATTTAAGGCCGGAG
AAAATCCATTTGATTTTGTTGAAGGGACAGATTTTAGCATCCAACCATATAAAAAATTTGAGCGCCTTGAATATGAAGCTCTTGCTGAGAAAAAGAGAAAAGCTCTTGCA
AATGGTCAGAGTGAGAGAGCTGCAAAGAGGGGCAGGGTAGAAGATATTCCTGGTGCAAGCTTTGATGAAATTTTGGAAGCTATGAATTATGGATCTAGGAGGAAGTTAAA
AGAGCCTAAAAAAAGAGGTAGACGGAAAGGATCAAAGAAAAAACTTAATCGTGATGTTACGAAGTTGCTTGGTGATGCAACTTTATGTTATGCTCAAGGCCAGCATGAGA
AGGCTATATCTATATTGCGTCAAGTTGTTCTGCAAGCCCCAGACTTACCTGATTCGTACCATACGCTTGGACTTGTATACAATGCAATTGGTGATGATGTAAAAGCCATG
GGATTCTACATGCTTGCTGCACATTTAATGCCAAAAGATTCATCTCTCTGGAAACTGCTATTTTCATGGTCAATTGATCGAGGTGATATTGATCAAGCAAGCTACTGTCT
TTCTAAAGCAATAAAAGCAGAGCCTGATGACATTAATTTATTATTTCATCGTGCGTCACTCTACCTTGAGCGTGGAGATTGTGAAAAAGCAGCTGAAACATATGATCAGA
TTCATCAACAATGCCTTGGAAATGTTGAAGCACTCATGACAGGAGCAAAGCTGTACCAAAAATGCGGTCATCTTGAACGTGCAATTTGCATTCTTGAGGACTACATCAAA
GAGCATCCAACTGAAGCTGATTTAGATGTGGTTGATCTTTTAGCTTCTTTATACATGGGAAGCAAAGAATTCAGCAAAGCTCTTGAGCGCATCGAGCATGCAGATGAGGT
TTACTGTGCAGGAAATGAGCTACCTTTAAACTTGACAACTAAAGCAGGAATTTGCCACGTTCACCTTGGAAATATGGAGAAAGCAGAGTGCCTCTTTGCTAATTTGGGAC
GGGAAACTGCCAATGATCACTCAAATTTGATGATTGAAGCTGCAGACTCGTTGCTGAGTCTTAAGCACTATAACTTGGCATTGAAGTATTATCTAATGTCCGAAGAAGTA
AATGCTGGAGGGAACATGGGGATTTTATACCTAAAAATTGCCCAGTGTTACTTATCAACCGATGAAAGAACACAGGCAATTGTTTTCTTTTATAAAGTGCTTCAACATCT
TGAAGATAACATTAATGCTCGATTAACTTTGGCCTCCCTCCTCCTTGAGGAAGCTAGAGATGAAGAAGCCATTTCATTACTATCTCCTCCAAAAGATTCAAACCCAACTA
GCTCATCTTCCAGCAAATTAAAACCTTGGTGGCTCAATGAGAAAGTAAAACTGAAGCTTTGCCAAATATACAAAACTAGAGGAATGCTTGAGAACTTCGTTGAGGTGATC
TTTCCTTTGGTCCGAGAGTCCTTATATATTGAGACTCTTCAAGAAAAGATTAAAGTGAACAAGAAGAAGCTTCCAAGGAGGGTTTTGCTTGAAAGAGTCAAAGTATTAGA
TGGACGTGAGACTGGTAACCTATTTCGTGGATTCAAACCTGTGGCTCCTAAATCAGATTTATCAAAAGCGTCCAGAGCAAAGAGATTGCTTCAAAAGAGGGAAAGAATCA
AGGAAGAAAAGAAGGCTAGAGCGCTGGCTGCGGGAGTCGATGTGAACTATGATGATTTAGATGATGAGCCAGCGTTGTGCAAGGCATTGGCTTCCTTGGGAAGATGTTCT
GAAGCTTTAGAGATTATAAGTCTAACTTTAAAGTTGGCTTTTAACTCGTTATCCATAGAAAGGAAGGAAGAACTCCAGTTACTTGGAGCTCAATTAGCATTCAGCTCAAC
TGATACCATGCATGGTTTCAACTTTGCAAAGCACGTTGTTAAGCAGTACCCTTATAGCATCTCTGCTTGGAACTGCTATTATAAAGTAGCTTCAAGTTTGACGAACCGGG
ATTCAAGGCATTGCAAGCTTCTGAATAGCATGCAAGCCAAATACAAAGATTGTGCACCACCCTATATCATTGCCGGGCATCAATTTACCACCATTAGCCATCATCAAGAT
GCTGCAAGGAAATATCTTGAAGCTTACAAAATCATGCCGGATAGTCCCCTGATTAACTTGTGTGTTGGAGCATCCTTAATCAACTTGGCTCTTGGATTCCGTCTTCAAAA
TAAGCATCAGTGTGTTGCACAAGGCTTGGCATTCCTCTACAAAAATTTGAAGCTTTGTGATAACAGCCAGGAAGCCTTGTACAACATAGCTCGAGCATATCATCACATCG
GACTCGTGACACTGGCGGTTACATATTACGAAAAGGTGCTTGCAACTTACCAGAAGGATTGCCCCATTCCAGAACTTTTTGGTGAGAATCAAAACATTAAGCATCAGAAA
TCTGTCTATTGTGATCTACGCAGAGAAGCAGCTTACAATTTGCATCTGATATATAAAGAGAGTGGAGCTCTTGATCTTGCCAGGCAAGTCCTAAAAGATCATTGCACATT
TTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAGGAAGGGAGTAAAATTTCTGACAATGAAGAGGTTCCTGGTGGTGTTATGCGTGTTTTAGGAGCAGAAAAAGAGGTTGTAGAAACAGGAGTGGAGGCTAGAGA
GGAGGAGGAAGAGGAAGAGGAAGAGGAAGAAGGAGAGGAAGAAGTGGAGGATGAAGGAGAGGATGATATTGAAGAAGAAGATGGTTACATATTCAAATTTAAGGCCGGAG
AAAATCCATTTGATTTTGTTGAAGGGACAGATTTTAGCATCCAACCATATAAAAAATTTGAGCGCCTTGAATATGAAGCTCTTGCTGAGAAAAAGAGAAAAGCTCTTGCA
AATGGTCAGAGTGAGAGAGCTGCAAAGAGGGGCAGGGTAGAAGATATTCCTGGTGCAAGCTTTGATGAAATTTTGGAAGCTATGAATTATGGATCTAGGAGGAAGTTAAA
AGAGCCTAAAAAAAGAGGTAGACGGAAAGGATCAAAGAAAAAACTTAATCGTGATGTTACGAAGTTGCTTGGTGATGCAACTTTATGTTATGCTCAAGGCCAGCATGAGA
AGGCTATATCTATATTGCGTCAAGTTGTTCTGCAAGCCCCAGACTTACCTGATTCGTACCATACGCTTGGACTTGTATACAATGCAATTGGTGATGATGTAAAAGCCATG
GGATTCTACATGCTTGCTGCACATTTAATGCCAAAAGATTCATCTCTCTGGAAACTGCTATTTTCATGGTCAATTGATCGAGGTGATATTGATCAAGCAAGCTACTGTCT
TTCTAAAGCAATAAAAGCAGAGCCTGATGACATTAATTTATTATTTCATCGTGCGTCACTCTACCTTGAGCGTGGAGATTGTGAAAAAGCAGCTGAAACATATGATCAGA
TTCATCAACAATGCCTTGGAAATGTTGAAGCACTCATGACAGGAGCAAAGCTGTACCAAAAATGCGGTCATCTTGAACGTGCAATTTGCATTCTTGAGGACTACATCAAA
GAGCATCCAACTGAAGCTGATTTAGATGTGGTTGATCTTTTAGCTTCTTTATACATGGGAAGCAAAGAATTCAGCAAAGCTCTTGAGCGCATCGAGCATGCAGATGAGGT
TTACTGTGCAGGAAATGAGCTACCTTTAAACTTGACAACTAAAGCAGGAATTTGCCACGTTCACCTTGGAAATATGGAGAAAGCAGAGTGCCTCTTTGCTAATTTGGGAC
GGGAAACTGCCAATGATCACTCAAATTTGATGATTGAAGCTGCAGACTCGTTGCTGAGTCTTAAGCACTATAACTTGGCATTGAAGTATTATCTAATGTCCGAAGAAGTA
AATGCTGGAGGGAACATGGGGATTTTATACCTAAAAATTGCCCAGTGTTACTTATCAACCGATGAAAGAACACAGGCAATTGTTTTCTTTTATAAAGTGCTTCAACATCT
TGAAGATAACATTAATGCTCGATTAACTTTGGCCTCCCTCCTCCTTGAGGAAGCTAGAGATGAAGAAGCCATTTCATTACTATCTCCTCCAAAAGATTCAAACCCAACTA
GCTCATCTTCCAGCAAATTAAAACCTTGGTGGCTCAATGAGAAAGTAAAACTGAAGCTTTGCCAAATATACAAAACTAGAGGAATGCTTGAGAACTTCGTTGAGGTGATC
TTTCCTTTGGTCCGAGAGTCCTTATATATTGAGACTCTTCAAGAAAAGATTAAAGTGAACAAGAAGAAGCTTCCAAGGAGGGTTTTGCTTGAAAGAGTCAAAGTATTAGA
TGGACGTGAGACTGGTAACCTATTTCGTGGATTCAAACCTGTGGCTCCTAAATCAGATTTATCAAAAGCGTCCAGAGCAAAGAGATTGCTTCAAAAGAGGGAAAGAATCA
AGGAAGAAAAGAAGGCTAGAGCGCTGGCTGCGGGAGTCGATGTGAACTATGATGATTTAGATGATGAGCCAGCGTTGTGCAAGGCATTGGCTTCCTTGGGAAGATGTTCT
GAAGCTTTAGAGATTATAAGTCTAACTTTAAAGTTGGCTTTTAACTCGTTATCCATAGAAAGGAAGGAAGAACTCCAGTTACTTGGAGCTCAATTAGCATTCAGCTCAAC
TGATACCATGCATGGTTTCAACTTTGCAAAGCACGTTGTTAAGCAGTACCCTTATAGCATCTCTGCTTGGAACTGCTATTATAAAGTAGCTTCAAGTTTGACGAACCGGG
ATTCAAGGCATTGCAAGCTTCTGAATAGCATGCAAGCCAAATACAAAGATTGTGCACCACCCTATATCATTGCCGGGCATCAATTTACCACCATTAGCCATCATCAAGAT
GCTGCAAGGAAATATCTTGAAGCTTACAAAATCATGCCGGATAGTCCCCTGATTAACTTGTGTGTTGGAGCATCCTTAATCAACTTGGCTCTTGGATTCCGTCTTCAAAA
TAAGCATCAGTGTGTTGCACAAGGCTTGGCATTCCTCTACAAAAATTTGAAGCTTTGTGATAACAGCCAGGAAGCCTTGTACAACATAGCTCGAGCATATCATCACATCG
GACTCGTGACACTGGCGGTTACATATTACGAAAAGGTGCTTGCAACTTACCAGAAGGATTGCCCCATTCCAGAACTTTTTGGTGAGAATCAAAACATTAAGCATCAGAAA
TCTGTCTATTGTGATCTACGCAGAGAAGCAGCTTACAATTTGCATCTGATATATAAAGAGAGTGGAGCTCTTGATCTTGCCAGGCAAGTCCTAAAAGATCATTGCACATT
TTAA
Protein sequenceShow/hide protein sequence
MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALA
NGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAM
GFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIK
EHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEV
NAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEVI
FPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALCKALASLGRCS
EALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQD
AARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNIKHQK
SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF