; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg023637 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg023637
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptiongeneral transcription factor 3C polypeptide 3
Genome locationscaffold13:1386807..1399047
RNA-Seq ExpressionSpg023637
SyntenySpg023637
Gene Ontology termsGO:0006383 - transcription by RNA polymerase III (biological process)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat
IPR039340 - Transcription factor Tfc4/TFIIIC-102/Sfc4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606714.1 General transcription factor 3C polypeptide 3, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0089.99Show/hide
Query:  MEEEGNKISGNEEVPGCAV-GNLGEEEDREDKEEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAE
        MEEEGN IS NEEVPGC V G    E + ED+EEEEEEEEEEEE+EEEVEDEGEDD EEEDGY FKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAE
Subjt:  MEEEGNKISGNEEVPGCAV-GNLGEEEDREDKEEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAE

Query:  KKRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDSY
        KKRKALA+GQSERSAKRGRVEDISGASF+EIMEAMNYGSRRKR++PKKRGRRKGSK+KLN DVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPD+PDSY
Subjt:  KKRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDSY

Query:  HTLGLVYHAIGDDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNVE
        HTLGLVY+AIGDD+KAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF+RASLYLERGDCQKAAETYDQIHQ  + NVE
Subjt:  HTLGLVYHAIGDDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNVE

Query:  ALMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANLE
        ALMTGAKLYQKCGHLER ICILE+YIKGHP EADLDVVDLLAS YMGSKEFSKALE IEHAD VYCA  ELPLNLTAKAGICH+HLGN EKAECLFANL 
Subjt:  ALMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANLE

Query:  REAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAISL
        REA  + SNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGN+GILYLKIAQCYSSTN+RA+AIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAISL
Subjt:  REAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAISL

Query:  LSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVAP
        LSPPKDSNSTSSSS K KPWWLNE+VKLKLCHI+RTKGMLENFVEAIF LVRESLYIETL EKIKVNKKKLPKRVLLERVKVLDGR+TG LFRGFRPVAP
Subjt:  LSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVAP

Query:  KSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLST
        KSDLSKASRAK+LLQKRERI+EEKKA+ALAAGV +NYDD DDEP LR  RESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFN+LS 
Subjt:  KSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLST

Query:  ERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEAY
        ERKEELQLLGAQLAFSSTDTKHGFNFAKHVVK YPYS SAWNCYYKVSS              SMQ KYKDCAPPY+IAGHQFTAISHHQDAARKYLEAY
Subjt:  ERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEAY

Query:  KLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN-VKH
        KLLPDSPLINLCVG+ALINLTLGFRLQNKHQC+AQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN++  KH
Subjt:  KLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN-VKH

Query:  QKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
        Q  VYCDLRREAAYNLHLIYKESGA+DLARQVLKD+CTF
Subjt:  QKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

XP_022948712.1 general transcription factor 3C polypeptide 3 [Cucurbita moschata]0.0e+0090.1Show/hide
Query:  MEEEGNKISGNEEVPGCAV-GNLGEEEDREDKEEEEEEEEEEEEE-EEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALA
        MEEEGN IS NEEVPGC V G    E + ED+EEEEEEEEEEEEE EEEVEDEGEDD EEEDGY FKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALA
Subjt:  MEEEGNKISGNEEVPGCAV-GNLGEEEDREDKEEEEEEEEEEEEE-EEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALA

Query:  EKKRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDS
        EKKRKALA+GQSERSAKRGRVEDISGASF+EIMEAMNYGSRRKR++PKKRGRRKGSK+KLN DVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPD+PDS
Subjt:  EKKRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDS

Query:  YHTLGLVYHAIGDDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNV
        YHTLGLVY+AIGDD+KAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF+RASLYLERGDCQKAAETYDQIHQ  + NV
Subjt:  YHTLGLVYHAIGDDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNV

Query:  EALMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANL
        EALMTGAKLYQKCGHLER ICILE+YIKGHP EADLDVVDLLAS YMGSKEFSKALE IEHAD VYCA  ELPLNLTAKAGICH+HLGN EKAECLFANL
Subjt:  EALMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANL

Query:  EREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAIS
         REA  + SNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGN+GILYLKIAQCYSSTN+RA+AIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAIS
Subjt:  EREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAIS

Query:  LLSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVA
        LLSPPKDSNSTSSSS K KPWWLNE+VKLKLCHI+RTKGMLENFVEAIF LVRESLYIETL EKIKVNKKKLPKRVLLERVKVLDGR+TG LFRGFRPVA
Subjt:  LLSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVA

Query:  PKSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLS
        PKSDLSKASRAK+LLQKRERI+EEKKA+ALAAGV +NYDD DDEP LR  RESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFN+LS
Subjt:  PKSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLS

Query:  TERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEA
         ERKEELQLLGAQLAFSSTDTKHGFNFAKHVVK YPYS SAWNCYYKVSS              SMQ KYKDCAPPY+IAGHQFTAISHHQDAARKYLEA
Subjt:  TERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEA

Query:  YKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKH
        YKLLPDSPLINLCVG+ALINLTLGFRLQNKHQC+AQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN++ KH
Subjt:  YKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKH

Query:  QKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
        Q  VYCDLRREAAYNLHLIYKESGA+DLARQVLKD+CTF
Subjt:  QKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

XP_022997988.1 general transcription factor 3C polypeptide 3 [Cucurbita maxima]0.0e+0090.01Show/hide
Query:  MEEEGNKISGNEEVPGCAVGNL----GEEEDREDKEEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA
        MEEEGN IS NEEVPGC V        E EDRE+ EEEEEEEEEEEE+EEEVEDEGEDD EEEDGY FKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA
Subjt:  MEEEGNKISGNEEVPGCAVGNL----GEEEDREDKEEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA

Query:  LAEKKRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLP
        LAEKKRKALANGQSERSAKRGRVEDISGASF+EIMEAMNYGSRRKR++PKKRGRRKGSK+KLN DVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPD+P
Subjt:  LAEKKRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLP

Query:  DSYHTLGLVYHAIGDDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLG
        DSYHTLGLVY+AIGDD+KAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF+RASLYLERGDCQKAAETYDQIHQ  + 
Subjt:  DSYHTLGLVYHAIGDDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLG

Query:  NVEALMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFA
        NVEALMTGAKLYQKCGHLER ICILE+YIKGHP EADLDVVDLLAS YMGSKEFSKALE IEHAD VYCA  ELPLNLTAKAGICH+HLGN EKAECLFA
Subjt:  NVEALMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFA

Query:  NLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEA
        NL REA  + SNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGN+GILYLKIAQCYSSTN+RA+AIVFFYKVLQHLEDNINARLTLASLLLEEAREEEA
Subjt:  NLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEA

Query:  ISLLSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRP
        ISLLSPPKDSNSTSSSS K KPWWLNE+VKLKLCHI+RTKGMLENFVEAIF LVRESLYIETL EKIKVNKKKLPKRVLLERVKVLDGR+TG LFRGFRP
Subjt:  ISLLSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRP

Query:  VAPKSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNT
        VAPKSDLSKASRAK+LLQKRERI+EEKKA+ALAAGV +NYDD DDEP LR  RESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFN+
Subjt:  VAPKSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNT

Query:  LSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYL
        LS ERKEELQLLGAQLAFSSTDTKHGFNFAKHVVK YPYS SAWNCYYKVSS              SMQ KYKDCAPPY+IAGHQFTAISHHQDAARKYL
Subjt:  LSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYL

Query:  EAYKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNV
        EAYKLLPDSPLINLCVG+ALINLTLGFRLQNKHQC+AQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN++ 
Subjt:  EAYKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNV

Query:  KHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
        KHQ  VYCDLRREAAYNLHLIYKESGA+DLARQVLKD+CTF
Subjt:  KHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

XP_023525239.1 general transcription factor 3C polypeptide 3 [Cucurbita pepo subsp. pepo]0.0e+0089.75Show/hide
Query:  MEEEGNKISGNEEVPGCAVGNLGEEEDREDKEEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEK
        MEEEGN IS NEEVPGC V      E   +  EEEEEEEEEEE+EEEVEDEGEDD EEEDGY FKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEK
Subjt:  MEEEGNKISGNEEVPGCAVGNLGEEEDREDKEEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEK

Query:  KRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDSYH
        KRKALANGQSERSAKRGRVEDISGASF+EIMEAMNYGSRRKR++PKKRGRRKGSK+KLN DVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPD+PDSYH
Subjt:  KRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDSYH

Query:  TLGLVYHAIGDDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNVEA
        TLGLVY+AIGDD+KAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF+RASLYLERGDCQKAAETYDQIHQ  + NVEA
Subjt:  TLGLVYHAIGDDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNVEA

Query:  LMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANLER
        LMTGAKLYQ+CGHLER ICILE+YIKGHP EADLDVVDLLAS YMGSKEFSKALE IEHAD VYCA  ELPLNLTAKAGICH+HLGN EKAECLFANL R
Subjt:  LMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANLER

Query:  EAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAISLL
        EA  + SNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGN+GILYLKIAQCYSSTN+RA+AIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAISLL
Subjt:  EAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAISLL

Query:  SPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVAPK
        SPPKDSNSTSSSS K KPWWLNE+VKLKLCHI+RTKGMLENFV+AIF LVRESLYIETL EKIKVNKKKLPKRVLLERVKVLDGR+TG LFRGFRPVAPK
Subjt:  SPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVAPK

Query:  SDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLSTE
        SDLSKASRAK+LLQKRERIKEEKKA+ALAAGV +NYDD DDEP LR  RESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFN+LS E
Subjt:  SDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLSTE

Query:  RKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEAYK
        RKEELQLLGAQLAFSSTDTKHGFNFAKHVVK YPYS SAWNCYYKVSS              SMQ KYKDCAPPY+IAGHQFTAISHHQDAARKYLEAYK
Subjt:  RKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEAYK

Query:  LLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKHQK
        LLPDSPLINLCVG+ALINLTLGFRLQNKHQC+AQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN++ KHQ 
Subjt:  LLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKHQK

Query:  LVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
         VYCDLRREAAYNLHLIYKESGA+DLARQVLKD+CTF
Subjt:  LVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

XP_038879313.1 general transcription factor 3C polypeptide 3 [Benincasa hispida]0.0e+0089.46Show/hide
Query:  MEEEGNKISGNEEVPG--CAVGNLGEEEDREDKEEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALA
        ME+EGN +S NEEVPG   +V   G+E      E+ EEEEEEEEE EEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALA
Subjt:  MEEEGNKISGNEEVPG--CAVGNLGEEEDREDKEEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALA

Query:  EKKRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDS
        EKKRKALANGQSER+AKRGRVEDI GASFDEI+EAMNYGSRRK KEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQ+EKA+S+L QVVLQAPDLPDS
Subjt:  EKKRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDS

Query:  YHTLGLVYHAIGDDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNV
        YHTLGLVY+AIGDD+KAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDI+LLFHRASLYLERGDC+KAAETYDQIHQQCLGNV
Subjt:  YHTLGLVYHAIGDDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNV

Query:  EALMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANL
        EALMTGAKLYQKCGHLER ICILEDYIK HP EADLDVVDLLAS YMG+KEF KALERIEHADEVYCAG ELPL LT K GICH HLGN+EKAECLFANL
Subjt:  EALMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANL

Query:  EREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAIS
          E A D+SNLMIEVADSLLSLKHYNLALKYYLM EEVNAGGNMGILYLKIAQCY STN+RAQAIVFFYKVLQHLEDNINARLTLASLLLEEAR+EEAIS
Subjt:  EREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAIS

Query:  LLSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVA
        LLSPPKDSN TSSSS KLKPWW NEKVKLKLCHIY+T+GMLE+FVE IF LVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRE+G+LFRGFRPVA
Subjt:  LLSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVA

Query:  PKSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLS
        PKSDLSKASRAKRLLQKRERIKEEKKAKALAAGV+VNYDDLDDEP LR HRESPLPNLLK+EEYHNLIVDLCKALASLGRCSEALEIISLTLKLA N+LS
Subjt:  PKSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLS

Query:  TERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEA
         ERKEELQLLGAQLAFSSTDT HGFNFAKHVVK YPYSISAWNCYYKV+S              SMQAKYKDCAPPY+IAGHQFT ISHHQDAARKYLEA
Subjt:  TERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEA

Query:  YKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKH
        YK++PDSPLINLCVGS+LINL LGFRLQNKHQC+AQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN+KH
Subjt:  YKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKH

Query:  QKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
        QK VYCDLRREAAYNLHLIYKESGA+DLARQVLKDHCTF
Subjt:  QKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

TrEMBL top hitse value%identityAlignment
A0A1S3BHB9 general transcription factor 3C polypeptide 3 isoform X10.0e+0088.47Show/hide
Query:  MEEEGNKISGNEEVPGCAVGNLGEEEDREDKEEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEK
        ME+EGN+IS +EEVPG  +  LG E++ E    + EEEEEEEE EEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFS+QPYKKFERLEYEALAEK
Subjt:  MEEEGNKISGNEEVPGCAVGNLGEEEDREDKEEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEK

Query:  KRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDSYH
        KRKALANGQSER+AKRGRVED++GASFDEI+EAMNYGSRRK KEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQ+EKAIS+L QVVLQAPDLPDSYH
Subjt:  KRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDSYH

Query:  TLGLVYHAIGDDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNVEA
        TLGLVY+AIGDD+KAMGFYMLAAHLMPKDSSLWKLLFSWSI+RGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETYDQIHQQCLGNVEA
Subjt:  TLGLVYHAIGDDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNVEA

Query:  LMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANLER
        LMTGAKLYQKCGHLER ICILEDYIK HP+EADLDVVDLLAS YMGSKEFSKALE IEHAD VYCAG ELPLNLTAKAGICH HLGN+EKAECLFANL R
Subjt:  LMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANLER

Query:  EAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAISLL
        E   D+SNLMIEVADSLLSLKHY+ ALKYYLMSEEVNAG NMGILY K+A+CY STN++ QAIVFFYKVLQH+EDNINARLTLASLLLEEAR+EEAISLL
Subjt:  EAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAISLL

Query:  SPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVAPK
        SPPKDSN TSSSS KLKPWWLNEKVKLKLCHIYRT+G+LENFVE IF LVRESLYIETLQEKIKVNKKKLP+RVLLERVKVLDGRETG+LFRGFRPVAPK
Subjt:  SPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVAPK

Query:  SDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLSTE
        SDL+KASRAKRLLQKR+RIKEEKKAK LAAGV V+YDDLDDEP LR HRESPLPNLLK+EE+H LIVDLCKALASLGRCSEALEIISLTLKLAFN+LSTE
Subjt:  SDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLSTE

Query:  RKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEAYK
        RKEELQLLGAQLAFSST T HGFNFAKHVVK YPYSISAWNCYYKV+S              SMQAKYKDCAPPY+IAGHQFT ISHHQDAARKYLEAYK
Subjt:  RKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEAYK

Query:  LLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKHQK
        ++PDSPLINLCVGS+LINL LGFRLQNKHQC+AQGLAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN+KHQ 
Subjt:  LLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKHQK

Query:  LVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
         VYCDLRREAAYNLHLIYKESGA+DLARQVLKDHCTF
Subjt:  LVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

A0A6J1DEF9 general transcription factor 3C polypeptide 3 isoform X20.0e+0087.58Show/hide
Query:  MEEEGNKISGNEEVPGCAVGNLG-------------EEEDREDKEEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYK
        ME+EGN+IS N+EVPGCAVG  G             EEE+ E++E+EEEEEEEEEEEEEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFSIQPYK
Subjt:  MEEEGNKISGNEEVPGCAVGNLG-------------EEEDREDKEEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYK

Query:  KFERLEYEALAEKKRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQ
        KFERLEYEALAEKKRKALAN QSER  KRGR+EDI GASF+EIMEAMNYGSRRK KEPK+RGRRKGSKKK+NR++TKLLGDATLCYAQGQYEKAISVL Q
Subjt:  KFERLEYEALAEKKRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQ

Query:  VVLQAPDLPDSYHTLGLVYHAIGDDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETY
        VVLQAPDLPDSYHTLGLVY+AIGDD+KAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETY
Subjt:  VVLQAPDLPDSYHTLGLVYHAIGDDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETY

Query:  DQIHQQCLGNVEALMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGN
        DQIHQ+C+GNVEALMTGAKLYQKCGH ER ICILEDYIKGHP EADLDVVDLLAS YMGSKEFSKALE IEHAD+VYCAG E+PLNL  KAGICHVHLGN
Subjt:  DQIHQQCLGNVEALMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGN

Query:  IEKAECLFANLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLL
        IEKAE LFANL R+ A D+S+ +IE ADSLLSLKH+NLALKYYLMSEEVNAGG MGI+YLKIAQCY STN+RA+AIVFFYKVLQ LEDNINARLTLASLL
Subjt:  IEKAECLFANLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLL

Query:  LEEAREEEAISLLSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRET
        LEEAREEEAISLLSPPKDSNS+SSSS K KPWWLNEKVKLKLC+IYRTKGMLENFVE IFSLVRESLYIETL+EKIKVNKKKLP+RVLLERVKVLDGRET
Subjt:  LEEAREEEAISLLSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRET

Query:  GSLFRGFRPVAPKSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIIS
        GSLFRGFRPVAPKSDLSKASRAKRLLQKRERIKEEKKA+ALAAGV V+YDD+DDEP LR HRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIIS
Subjt:  GSLFRGFRPVAPKSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIIS

Query:  LTLKLAFNTLSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAISH
        LTLKLAFN+LS ERKEELQLLGAQLAFSSTDTKHGFNFAKHVVK YPYS SAWNCYYKVSS              S+QAKYKDCAPP++IAGHQF AISH
Subjt:  LTLKLAFNTLSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAISH

Query:  HQDAARKYLEAYKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP
        HQ+AA+KYLEAYKLLPDSPLINLCVG+ALINL LG RLQNKHQC+AQGLAFLY NLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP
Subjt:  HQDAARKYLEAYKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP

Query:  ELFGENRNVKHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
        ++FGENRN+KH+K VYCDLRREAAYNLHLIYK+SGA+DLARQVLKDHCTF
Subjt:  ELFGENRNVKHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

A0A6J1DGK2 general transcription factor 3C polypeptide 3 isoform X10.0e+0087.49Show/hide
Query:  MEEEGNKISGNEEVPGCAVGNLG-------------EEEDREDKEEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYK
        ME+EGN+IS N+EVPGCAVG  G             EEE+ E++E+EEEEEEEEEEEEEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFSIQPYK
Subjt:  MEEEGNKISGNEEVPGCAVGNLG-------------EEEDREDKEEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYK

Query:  KFERLEYEALAEKKRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQ
        KFERLEYEALAEKKRKALAN QSER  KRGR+EDI GASF+EIMEAMNYGSRRK KEPK+RGRRKGSKKK+NR++TKLLGDATLCYAQGQYEKAISVL Q
Subjt:  KFERLEYEALAEKKRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQ

Query:  VVLQAPDLPDSYHTLGLVYHAIGDDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETY
        VVLQAPDLPDSYHTLGLVY+AIGDD+KAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETY
Subjt:  VVLQAPDLPDSYHTLGLVYHAIGDDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETY

Query:  DQIHQQCLGNVEALMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGN
        DQIHQ+C+GNVEALMTGAKLYQKCGH ER ICILEDYIKGHP EADLDVVDLLAS YMGSKEFSKALE IEHAD+VYCAG E+PLNL  KAGICHVHLGN
Subjt:  DQIHQQCLGNVEALMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGN

Query:  IEKAECLFANLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNM-GILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASL
        IEKAE LFANL R+ A D+S+ +IE ADSLLSLKH+NLALKYYLMSEEVNAGG M GI+YLKIAQCY STN+RA+AIVFFYKVLQ LEDNINARLTLASL
Subjt:  IEKAECLFANLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNM-GILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASL

Query:  LLEEAREEEAISLLSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRE
        LLEEAREEEAISLLSPPKDSNS+SSSS K KPWWLNEKVKLKLC+IYRTKGMLENFVE IFSLVRESLYIETL+EKIKVNKKKLP+RVLLERVKVLDGRE
Subjt:  LLEEAREEEAISLLSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRE

Query:  TGSLFRGFRPVAPKSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEII
        TGSLFRGFRPVAPKSDLSKASRAKRLLQKRERIKEEKKA+ALAAGV V+YDD+DDEP LR HRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEII
Subjt:  TGSLFRGFRPVAPKSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEII

Query:  SLTLKLAFNTLSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAIS
        SLTLKLAFN+LS ERKEELQLLGAQLAFSSTDTKHGFNFAKHVVK YPYS SAWNCYYKVSS              S+QAKYKDCAPP++IAGHQF AIS
Subjt:  SLTLKLAFNTLSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAIS

Query:  HHQDAARKYLEAYKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPI
        HHQ+AA+KYLEAYKLLPDSPLINLCVG+ALINL LG RLQNKHQC+AQGLAFLY NLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPI
Subjt:  HHQDAARKYLEAYKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPI

Query:  PELFGENRNVKHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
        P++FGENRN+KH+K VYCDLRREAAYNLHLIYK+SGA+DLARQVLKDHCTF
Subjt:  PELFGENRNVKHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

A0A6J1GA07 general transcription factor 3C polypeptide 30.0e+0090.1Show/hide
Query:  MEEEGNKISGNEEVPGCAV-GNLGEEEDREDKEEEEEEEEEEEEE-EEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALA
        MEEEGN IS NEEVPGC V G    E + ED+EEEEEEEEEEEEE EEEVEDEGEDD EEEDGY FKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALA
Subjt:  MEEEGNKISGNEEVPGCAV-GNLGEEEDREDKEEEEEEEEEEEEE-EEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALA

Query:  EKKRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDS
        EKKRKALA+GQSERSAKRGRVEDISGASF+EIMEAMNYGSRRKR++PKKRGRRKGSK+KLN DVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPD+PDS
Subjt:  EKKRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDS

Query:  YHTLGLVYHAIGDDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNV
        YHTLGLVY+AIGDD+KAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF+RASLYLERGDCQKAAETYDQIHQ  + NV
Subjt:  YHTLGLVYHAIGDDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNV

Query:  EALMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANL
        EALMTGAKLYQKCGHLER ICILE+YIKGHP EADLDVVDLLAS YMGSKEFSKALE IEHAD VYCA  ELPLNLTAKAGICH+HLGN EKAECLFANL
Subjt:  EALMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANL

Query:  EREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAIS
         REA  + SNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGN+GILYLKIAQCYSSTN+RA+AIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAIS
Subjt:  EREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAIS

Query:  LLSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVA
        LLSPPKDSNSTSSSS K KPWWLNE+VKLKLCHI+RTKGMLENFVEAIF LVRESLYIETL EKIKVNKKKLPKRVLLERVKVLDGR+TG LFRGFRPVA
Subjt:  LLSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVA

Query:  PKSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLS
        PKSDLSKASRAK+LLQKRERI+EEKKA+ALAAGV +NYDD DDEP LR  RESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFN+LS
Subjt:  PKSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLS

Query:  TERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEA
         ERKEELQLLGAQLAFSSTDTKHGFNFAKHVVK YPYS SAWNCYYKVSS              SMQ KYKDCAPPY+IAGHQFTAISHHQDAARKYLEA
Subjt:  TERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEA

Query:  YKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKH
        YKLLPDSPLINLCVG+ALINLTLGFRLQNKHQC+AQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN++ KH
Subjt:  YKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKH

Query:  QKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
        Q  VYCDLRREAAYNLHLIYKESGA+DLARQVLKD+CTF
Subjt:  QKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

A0A6J1KFI7 general transcription factor 3C polypeptide 30.0e+0090.01Show/hide
Query:  MEEEGNKISGNEEVPGCAVGNL----GEEEDREDKEEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA
        MEEEGN IS NEEVPGC V        E EDRE+ EEEEEEEEEEEE+EEEVEDEGEDD EEEDGY FKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA
Subjt:  MEEEGNKISGNEEVPGCAVGNL----GEEEDREDKEEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA

Query:  LAEKKRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLP
        LAEKKRKALANGQSERSAKRGRVEDISGASF+EIMEAMNYGSRRKR++PKKRGRRKGSK+KLN DVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPD+P
Subjt:  LAEKKRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLP

Query:  DSYHTLGLVYHAIGDDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLG
        DSYHTLGLVY+AIGDD+KAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF+RASLYLERGDCQKAAETYDQIHQ  + 
Subjt:  DSYHTLGLVYHAIGDDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLG

Query:  NVEALMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFA
        NVEALMTGAKLYQKCGHLER ICILE+YIKGHP EADLDVVDLLAS YMGSKEFSKALE IEHAD VYCA  ELPLNLTAKAGICH+HLGN EKAECLFA
Subjt:  NVEALMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFA

Query:  NLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEA
        NL REA  + SNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGN+GILYLKIAQCYSSTN+RA+AIVFFYKVLQHLEDNINARLTLASLLLEEAREEEA
Subjt:  NLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEA

Query:  ISLLSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRP
        ISLLSPPKDSNSTSSSS K KPWWLNE+VKLKLCHI+RTKGMLENFVEAIF LVRESLYIETL EKIKVNKKKLPKRVLLERVKVLDGR+TG LFRGFRP
Subjt:  ISLLSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRP

Query:  VAPKSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNT
        VAPKSDLSKASRAK+LLQKRERI+EEKKA+ALAAGV +NYDD DDEP LR  RESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFN+
Subjt:  VAPKSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNT

Query:  LSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYL
        LS ERKEELQLLGAQLAFSSTDTKHGFNFAKHVVK YPYS SAWNCYYKVSS              SMQ KYKDCAPPY+IAGHQFTAISHHQDAARKYL
Subjt:  LSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYL

Query:  EAYKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNV
        EAYKLLPDSPLINLCVG+ALINLTLGFRLQNKHQC+AQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN++ 
Subjt:  EAYKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNV

Query:  KHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
        KHQ  VYCDLRREAAYNLHLIYKESGA+DLARQVLKD+CTF
Subjt:  KHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

SwissProt top hitse value%identityAlignment
O74458 Transcription factor tau subunit sfc41.0e-2722.72Show/hide
Query:  ANGQSERSAKRGRVE-DISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQ-GQYEKAISVLSQVVLQAPDLPDSYHTLG
        ++ +S  S    R E DI+   ++E ++A+  G R+ RK  K RGR   +    + +V ++L  A   +AQ G +++A  +  ++V    ++  ++  LG
Subjt:  ANGQSERSAKRGRVE-DISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQ-GQYEKAISVLSQVVLQAPDLPDSYHTLG

Query:  LVYHAIGDD----IKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDIN----LLFHRASLYLERGDCQKAAETYDQIHQQCL
          +   G+      K +  +M AAHL PKD  LW      S      DQA YC ++A+ A+P + +     +++R+ L  E G  +KAAE +  + Q   
Subjt:  LVYHAIGDD----IKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDIN----LLFHRASLYLERGDCQKAAETYDQIHQQCL

Query:  GNVEALMTGAKLYQKCGHLERGIC----ILEDYIKGHPAEA------DLDVVDLLASFYMGSKEFSKALERI----------------------------
         N   L   A++Y K  H  R I     I   Y   +PA        DL  ++L A   +   ++S  +  I                            
Subjt:  GNVEALMTGAKLYQKCGHLERGIC----ILEDYIKGHPAEA------DLDVVDLLASFYMGSKEFSKALERI----------------------------

Query:  EHADEVYCAGKE--------LPLNLTAKAGICHVHLGNIEKAECLFANLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLM------SEEVNAGGNMG
        E   E   A +E        LP     K GI  +  G + +AE  F+ ++         ++ ++A + + ++  +LAL+Y+++      ++ +    NMG
Subjt:  EHADEVYCAGKE--------LPLNLTAKAGICHVHLGNIEKAECLFANLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLM------SEEVNAGGNMG

Query:  ILYLKI------AQCYSS----TNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAISLLSPPKDSNSTSSSSCKLKPWWLNEKVKL----KL
        + YL++       QC  +     N    A++   ++   L+DN +A L + + + E+ R    I+ L   +  N     +   + +  N+KV      K 
Subjt:  ILYLKI------AQCYSS----TNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAISLLSPPKDSNSTSSSSCKLKPWWLNEKVKL----KL

Query:  CHIYRTKGMLENF-VEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVAPKSDLSKASRAKR-LLQKRERIKEEKKAKA
          I R+K     F +       R    ++ L++ +K  K++     L E + +        L   F  +       K +RA+  LL +R R         
Subjt:  CHIYRTKGMLENF-VEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVAPKSDLSKASRAKR-LLQKRERIKEEKKAKA

Query:  LAAG------VEVNYDDLDDEPTLR-GHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLSTERKEELQLLGAQLAFSSTDTK
                      Y DLD +  LR G+  +     +  + ++ L V+    L  +G   +A ++++  +        T +++ L+      +  + D +
Subjt:  LAAG------VEVNYDDLDDEPTLR-GHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLSTERKEELQLLGAQLAFSSTDTK

Query:  HG----------FNFAKHVVKLYPYSIS-AWNCYYKVSSSMQAKY---------------------------------KDCAPPYLIA--GHQFTAISHH
                    F F +   +L+   +S  + C      S   K+                                      P L+   GH        
Subjt:  HG----------FNFAKHVVKLYPYSIS-AWNCYYKVSSSMQAKY---------------------------------KDCAPPYLIA--GHQFTAISHH

Query:  QDAARKYLEAYKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDN-----SQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKD
          A   Y  A+ + PD P+ NL +G A ++  +     N+H  I QG  FLY+   L  N      QEALYN+ +AYH IGL   AV YYE VL      
Subjt:  QDAARKYLEAYKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDN-----SQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKD

Query:  CPIPELFGENRNVKHQKL-VYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
           P   G+        +    D   EAAYNL LIY  SG + LA Q+   +  F
Subjt:  CPIPELFGENRNVKHQKL-VYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

P33339 Transcription factor tau 131 kDa subunit7.2e-1319.93Show/hide
Query:  AVGNLGEEEDREDKEEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERSAKRG
        A G L +E+  +  E E  +  +  +E+EE      DD        +K    +  +D     D  +    +    EY   +E+    LA           
Subjt:  AVGNLGEEEDREDKEEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERSAKRG

Query:  RVEDISGASFDEIMEAMNYGSRRKRKEPK-KRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDSYHTLGLVYHAIGDDIKAM
          ED   A    I EA N+  ++K+K  K K   R+  ++ L+ +V +LL  A   + +   + A  + ++V+ +      +Y TLG +Y   G      
Subjt:  RVEDISGASFDEIMEAMNYGSRRKRKEPK-KRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDSYHTLGLVYHAIGDDIKAM

Query:  GFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQ-------------CLGNVEALMT
          + LAAHL   D   WK++   S D   + QA YC S+ I   P +   ++ R+ LY + G   +A + + +++                  + + +  
Subjt:  GFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQ-------------CLGNVEALMT

Query:  GAKLYQKC--GHLERGICIL------------------EDYIKGHPAEADLD------------------------VVDLLASFYM--------GSKEFS
          +LY K    ++ER   IL                  ED  +  P E D D                         +++LA  ++        G K   
Subjt:  GAKLYQKC--GHLERGICIL------------------EDYIKGHPAEADLD------------------------VVDLLASFYM--------GSKEFS

Query:  KALERIEHA-------------------------DEVYCAGKE----LPLNLTAKAGICHVHLGNIEKAECLFANLEREAAGDNSNLMIEVADSLLSLKH
        K    I+                           D +  A KE    +P+++  + G+  ++  N+ +A   F  L  E   D ++L  E A +L   + 
Subjt:  KALERIEHA-------------------------DEVYCAGKE----LPLNLTAKAGICHVHLGNIEKAECLFANLEREAAGDNSNLMIEVADSLLSLKH

Query:  YNLALKYY---LMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLL------------------LEEAREEEAISLLS
        Y  A+ ++   L  EE         ++  +A+CY        A  F+   ++   D+++ R++LA +                   + + + +E +  +S
Subjt:  YNLALKYY---LMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLL------------------LEEAREEEAISLLS

Query:  PPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLY---IETLQEKIKVNKKKLPKRVLLERVKVLDG-----------RET
          K SN TS  S                     +K +LE+     F   + + Y    E ++ + ++  K + K   +++ ++  G              
Subjt:  PPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLY---IETLQEKIKVNKKKLPKRVLLERVKVLDG-----------RET

Query:  GSLFRGFRPVAPKSDLSKASRAKRLLQKRERIKEE-----KKAKALAAGVEVNYDDL-DDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSE
          L   F  V      S++ +   +L++ ++   E     ++   LA G  V    L ++  TL    E      L  E++  L ++L   +A      +
Subjt:  GSLFRGFRPVAPKSDLSKASRAKRLLQKRERIKEE-----KKAKALAAGVEVNYDDL-DDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSE

Query:  ALEIISLTLKLAFNTLSTERKEELQLLGAQLAFSSTDTK----------HGFNFAKHVVKLYPYSI----SAWN--------------------CYYKVS
         L ++    ++       ER + ++ +   +     D +          + F F + V++++ YS+    S+ N                    C Y   
Subjt:  ALEIISLTLKLAFNTLSTERKEELQLLGAQLAFSSTDTK----------HGFNFAKHVVKLYPYSI----SAWN--------------------CYYKVS

Query:  SSMQAKYKD--------CAPPYLIAGHQFTAISHHQDAARKYLEAYKLL-------PDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLC
         + QA   +         + PYL   +     S     +R +L A + L       PD P++NL +G + I+  +      +H  I  GL +LY+  K+ 
Subjt:  SSMQAKYKD--------CAPPYLIAGHQFTAISHHQDAARKYLEAYKLL-------PDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLC

Query:  DN------SQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDH
         +       QEA YN+ RA+H IGLV++A+ YY +VL  Y                         L++ AAYN  +IY++SG V+LA  +++ +
Subjt:  DN------SQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDH

Q9Y5Q9 General transcription factor 3C polypeptide 32.5e-5023.35Show/hide
Query:  GNLGEEEDREDKEEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERSAKRGRV
        G +  EE    +EE +  E++  +E+ ++  E   D  E    +          D  EG + S    K   ++    L E +       + E   +    
Subjt:  GNLGEEEDREDKEEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERSAKRGRV

Query:  EDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDSYHTLGLVYHAIGDDIKAMGFY
         +   A    ++E +        +E KK  + K  + KL R +  L+G+A + +A+G+ E+AI +  +++ QAP   + + TL ++Y   GD  K++ F 
Subjt:  EDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDSYHTLGLVYHAIGDDIKAMGFY

Query:  MLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGN-----VEALMTGAKLYQKCGHL
        ++AAHL P D+  W  L   S+++ +I QA +C +KA+K EP ++  L+ R+SLY + GD + A + Y +I      +     ++     AK Y +   +
Subjt:  MLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGN-----VEALMTGAKLYQKCGHL

Query:  ERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERI---------------------EHADEVYCAGKE-LPLNLTAKAGICHVHLGNIEKAE
           I I+++    H     ++ V++ A  Y+ +K++ KALE I                     +  + V C   + +P+++T K  +C VHL  +E   
Subjt:  ERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERI---------------------EHADEVYCAGKE-LPLNLTAKAGICHVHLGNIEKAE

Query:  CLFANLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAR
         L   L  +   D  +L ++VA++ L +  YN AL   L +   +   N+ +++L+ A+C  +     +A   + KV+     +++AR++L++L  +  +
Subjt:  CLFANLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAR

Query:  EEEAISLLSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFR
         E+A+  L P  D ++ +  +   +      K+ L    +  ++G +  +V+ + +++   L       K+ +N+          +V ++   ++G    
Subjt:  EEEAISLLSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFR

Query:  GFRPVAPKSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKL
                  L K SR K +    ++      AKA+ A                      L ++L  +++ NL++    +L  L R  EA  ++  +L+ 
Subjt:  GFRPVAPKSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKL

Query:  AFNTLSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSSMQ--AKYKDCAPPYL----------IAGHQFTAISHHQDAARK
               ++++EL+  G   A    + +  +N+ + +V         WN + +V+   Q    ++ C    L          + GH        + A  +
Subjt:  AFNTLSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSSMQ--AKYKDCAPPYL----------IAGHQFTAISHHQDAARK

Query:  YLEAYKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENR
        Y++A++  PD PL + C+G   I++     +  +H  I QG +FL + L L    QE+ YN+ R  H +GL+ LA+ YY+K L        +P L     
Subjt:  YLEAYKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENR

Query:  NVKHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCT
         V+  +L   DLRR+ AYNL LIY+ SG   +A+ +L  +C+
Subjt:  NVKHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCT

Arabidopsis top hitse value%identityAlignment
AT1G17680.1 tetratricopeptide repeat (TPR)-containing protein5.2e-18442.8Show/hide
Query:  EEGNKISGNEEVPG---CAVGNLGEEEDREDKEEEEEEE--EEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEAL
        +EGN IS  EE P    C    LG + + +DK+   +EE   ++++++ + +DEG++  EE+D     F+AG  P                FER EYEAL
Subjt:  EEGNKISGNEEVPG---CAVGNLGEEEDREDKEEEEEEE--EEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEAL

Query:  AEKKRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPD
        AE+KRKALA+ Q   S        + G      ME M+ G RRK ++ KK+GRR GSKK++  D+ K   +A   +A G+  +A+ +L +V+ QAP    
Subjt:  AEKKRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPD

Query:  SYHTLGLVYHAIG-DDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLG
        +Y+ L  V   +G  +  +     +AA++    S  WKLL+    ++ +I  A    SKAI+A+PDDI L +  A + L  G  ++AAET++QI ++C  
Subjt:  SYHTLGLVYHAIG-DDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLG

Query:  NVEALMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFA
         +EAL  G + + K G  ER   ILED+IK H +E   DV+DLLAS +M      +AL+ I    ++Y  GKEL  +L  +  ICHVHL  +E+AE + +
Subjt:  NVEALMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFA

Query:  NLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYL--MSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREE
         L +EA  ++  L+  +AD L ++ +++ ALKYY+  +SE VN     G L++KIA+CY S  +R QAIVF+YK L  L D ++ R+TLASLLLE+ + +
Subjt:  NLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYL--MSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREE

Query:  EAISLLSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGF
        EA+ +LSPP++ +     + KLK WW N K+++ LC IY ++GMLE+F      LV + ++  T++ K K       + VL E  +    R         
Subjt:  EAISLLSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGF

Query:  RPVAPKSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAF
            PK    K  + +  L +  RI+E        A ++ + +D+  E             ++KDEEYH L VDLCKALASL R  EALEI++L  +L  
Subjt:  RPVAPKSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAF

Query:  NTLSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAISHHQDAARK
          L  E K+ELQ LGA+++  + D K  F+  + V++ +PY ++AWNCYY V S               +++KY+DC PP LIAGH FT  S HQDAAR+
Subjt:  NTLSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAISHHQDAARK

Query:  YLEAYKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENR
        YLEAYKL+P+SPLINLCVG+ALINL LGFRL+N+H+C+AQG AFLY NL++C NSQEALYN+ARAY H+GLVTLA +YYEKVLA Y+KD  +P+L  E+ 
Subjt:  YLEAYKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENR

Query:  NV-KHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
         V + +K V CDLR+EAA+NLHLIYK SGA DLARQVLKDHCTF
Subjt:  NV-KHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

AT1G17680.2 tetratricopeptide repeat (TPR)-containing protein5.2e-18442.8Show/hide
Query:  EEGNKISGNEEVPG---CAVGNLGEEEDREDKEEEEEEE--EEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEAL
        +EGN IS  EE P    C    LG + + +DK+   +EE   ++++++ + +DEG++  EE+D     F+AG  P                FER EYEAL
Subjt:  EEGNKISGNEEVPG---CAVGNLGEEEDREDKEEEEEEE--EEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEAL

Query:  AEKKRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPD
        AE+KRKALA+ Q   S        + G      ME M+ G RRK ++ KK+GRR GSKK++  D+ K   +A   +A G+  +A+ +L +V+ QAP    
Subjt:  AEKKRKALANGQSERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPD

Query:  SYHTLGLVYHAIG-DDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLG
        +Y+ L  V   +G  +  +     +AA++    S  WKLL+    ++ +I  A    SKAI+A+PDDI L +  A + L  G  ++AAET++QI ++C  
Subjt:  SYHTLGLVYHAIG-DDIKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLG

Query:  NVEALMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFA
         +EAL  G + + K G  ER   ILED+IK H +E   DV+DLLAS +M      +AL+ I    ++Y  GKEL  +L  +  ICHVHL  +E+AE + +
Subjt:  NVEALMTGAKLYQKCGHLERGICILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFA

Query:  NLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYL--MSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREE
         L +EA  ++  L+  +AD L ++ +++ ALKYY+  +SE VN     G L++KIA+CY S  +R QAIVF+YK L  L D ++ R+TLASLLLE+ + +
Subjt:  NLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYL--MSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREE

Query:  EAISLLSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGF
        EA+ +LSPP++ +     + KLK WW N K+++ LC IY ++GMLE+F      LV + ++  T++ K K       + VL E  +    R         
Subjt:  EAISLLSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGF

Query:  RPVAPKSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAF
            PK    K  + +  L +  RI+E        A ++ + +D+  E             ++KDEEYH L VDLCKALASL R  EALEI++L  +L  
Subjt:  RPVAPKSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAF

Query:  NTLSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAISHHQDAARK
          L  E K+ELQ LGA+++  + D K  F+  + V++ +PY ++AWNCYY V S               +++KY+DC PP LIAGH FT  S HQDAAR+
Subjt:  NTLSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSS--------------SMQAKYKDCAPPYLIAGHQFTAISHHQDAARK

Query:  YLEAYKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENR
        YLEAYKL+P+SPLINLCVG+ALINL LGFRL+N+H+C+AQG AFLY NL++C NSQEALYN+ARAY H+GLVTLA +YYEKVLA Y+KD  +P+L  E+ 
Subjt:  YLEAYKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENR

Query:  NV-KHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
         V + +K V CDLR+EAA+NLHLIYK SGA DLARQVLKDHCTF
Subjt:  NV-KHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAGGAAGGGAATAAAATTTCTGGCAATGAAGAGGTTCCTGGTTGTGCCGTAGGCAATTTAGGAGAGGAGGAGGATAGAGAGGATAAAGAGGAGGAGGAGGAAGA
GGAAGAGGAAGAGGAAGAAGAAGAGGAAGAAGTGGAGGATGAAGGAGAAGATGATATTGAAGAAGAAGATGGTTACACTTTCAAATTTAAGGCCGGAGAAAATCCTTTTG
ATTTTGTTGAAGGGACGGATTTTAGCATCCAACCATATAAAAAATTTGAGCGTCTTGAATATGAAGCTCTTGCTGAGAAAAAACGAAAAGCTCTTGCAAATGGTCAGAGC
GAGAGATCTGCAAAGAGGGGCAGGGTAGAAGATATTTCTGGTGCAAGCTTTGATGAAATTATGGAAGCTATGAATTATGGATCTAGGAGGAAGCGGAAGGAGCCGAAAAA
AAGAGGTAGACGGAAGGGATCAAAGAAAAAACTTAATCGTGATGTTACAAAGTTGCTTGGTGATGCAACCTTATGTTATGCTCAAGGCCAGTATGAGAAGGCTATATCTG
TGTTGAGTCAAGTTGTTCTGCAAGCCCCAGATTTACCTGATTCGTACCATACACTTGGACTTGTTTATCATGCAATTGGCGATGATATAAAAGCCATGGGATTCTACATG
CTTGCTGCACATTTAATGCCAAAAGATTCATCTCTCTGGAAACTGCTATTTTCATGGTCAATTGACCGAGGTGACATTGATCAAGCAAGTTATTGTCTTTCTAAAGCAAT
AAAAGCAGAGCCTGATGATATTAATTTATTATTTCATCGTGCATCACTCTATCTTGAGCGTGGAGATTGTCAAAAGGCAGCTGAAACATATGACCAAATTCATCAACAAT
GTCTAGGAAATGTTGAAGCACTAATGACAGGAGCGAAGCTGTACCAAAAATGCGGTCATCTGGAACGTGGAATTTGCATTCTTGAGGACTACATTAAAGGACATCCAGCT
GAAGCTGATTTAGATGTGGTTGATCTTCTAGCTTCTTTTTACATGGGAAGTAAAGAATTCAGCAAAGCTCTTGAGCGCATTGAGCATGCGGATGAGGTGTACTGTGCAGG
AAAAGAGCTACCTTTAAACTTGACAGCTAAAGCAGGAATCTGCCACGTCCATCTTGGAAATATTGAAAAGGCAGAGTGCCTCTTTGCTAATTTGGAACGGGAAGCTGCCG
GTGATAACTCAAATTTGATGATTGAAGTTGCAGACTCGTTGCTGAGTCTTAAGCACTATAACTTGGCATTGAAGTATTATCTCATGTCAGAAGAAGTAAATGCTGGAGGG
AACATGGGGATTTTATACCTAAAAATTGCCCAATGTTACTCATCAACTAATGATAGAGCACAGGCAATTGTGTTCTTTTACAAAGTGCTTCAACATCTTGAAGATAACAT
TAATGCCCGATTAACTTTGGCTTCCCTCCTCCTTGAGGAAGCTAGAGAAGAAGAAGCCATTTCATTACTATCTCCTCCGAAAGATTCAAACTCAACTAGCTCATCTTCCT
GCAAACTCAAACCTTGGTGGCTCAATGAGAAAGTAAAACTGAAGCTTTGCCACATATACAGAACTAAAGGAATGCTCGAGAACTTCGTTGAGGCAATCTTTTCTTTGGTC
CGTGAGTCCTTATATATTGAGACTCTTCAAGAAAAGATTAAAGTGAACAAGAAGAAGCTTCCAAAGAGGGTTTTGCTTGAAAGAGTCAAAGTACTAGATGGACGTGAAAC
TGGTAGCCTATTTCGTGGCTTCAGACCTGTAGCTCCTAAATCAGATTTATCAAAAGCTTCCAGAGCAAAGAGATTGCTTCAAAAGAGGGAAAGAATCAAGGAAGAAAAGA
AGGCTAAAGCTCTGGCTGCTGGAGTTGAGGTGAACTATGATGATTTGGATGATGAGCCGACGCTACGGGGACACCGAGAATCTCCCCTGCCTAATCTTCTGAAGGATGAA
GAATATCATAATCTTATTGTTGATCTGTGCAAGGCATTGGCTTCGTTGGGAAGATGTTCTGAAGCTTTAGAGATAATAAGTTTAACTTTAAAGTTGGCTTTTAACACATT
ATCCACAGAAAGGAAGGAGGAACTCCAGTTACTCGGAGCTCAATTAGCATTCAGCTCAACTGATACCAAGCATGGATTCAACTTTGCAAAGCACGTTGTTAAGCTGTATC
CTTACAGCATCTCTGCTTGGAACTGCTATTATAAAGTATCTTCAAGCATGCAAGCCAAATACAAAGATTGTGCACCGCCCTATCTCATTGCCGGGCATCAATTTACCGCC
ATTAGCCATCATCAAGATGCTGCAAGGAAATATCTTGAAGCTTATAAATTATTGCCGGATAGTCCCCTAATTAACTTGTGTGTTGGATCGGCGTTAATCAACTTGACGCT
TGGATTTCGTCTTCAAAACAAGCATCAGTGTATCGCGCAAGGCTTAGCATTCCTCTACAAAAATTTGAAGCTTTGTGATAACAGCCAGGAAGCCTTATACAACATAGCTC
GAGCGTATCATCACATTGGACTCGTGACGCTGGCGGTTACATATTATGAAAAGGTGCTCGCAACTTACCAGAAGGATTGCCCCATCCCAGAACTTTTTGGCGAGAATCGA
AACGTTAAGCATCAGAAATTAGTATACTGTGATCTACGCAGAGAAGCAGCTTACAATTTGCATCTTATATATAAAGAGAGCGGAGCTGTTGATCTTGCTAGGCAAGTCCT
AAAAGATCATTGTACATTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAGGAAGGGAATAAAATTTCTGGCAATGAAGAGGTTCCTGGTTGTGCCGTAGGCAATTTAGGAGAGGAGGAGGATAGAGAGGATAAAGAGGAGGAGGAGGAAGA
GGAAGAGGAAGAGGAAGAAGAAGAGGAAGAAGTGGAGGATGAAGGAGAAGATGATATTGAAGAAGAAGATGGTTACACTTTCAAATTTAAGGCCGGAGAAAATCCTTTTG
ATTTTGTTGAAGGGACGGATTTTAGCATCCAACCATATAAAAAATTTGAGCGTCTTGAATATGAAGCTCTTGCTGAGAAAAAACGAAAAGCTCTTGCAAATGGTCAGAGC
GAGAGATCTGCAAAGAGGGGCAGGGTAGAAGATATTTCTGGTGCAAGCTTTGATGAAATTATGGAAGCTATGAATTATGGATCTAGGAGGAAGCGGAAGGAGCCGAAAAA
AAGAGGTAGACGGAAGGGATCAAAGAAAAAACTTAATCGTGATGTTACAAAGTTGCTTGGTGATGCAACCTTATGTTATGCTCAAGGCCAGTATGAGAAGGCTATATCTG
TGTTGAGTCAAGTTGTTCTGCAAGCCCCAGATTTACCTGATTCGTACCATACACTTGGACTTGTTTATCATGCAATTGGCGATGATATAAAAGCCATGGGATTCTACATG
CTTGCTGCACATTTAATGCCAAAAGATTCATCTCTCTGGAAACTGCTATTTTCATGGTCAATTGACCGAGGTGACATTGATCAAGCAAGTTATTGTCTTTCTAAAGCAAT
AAAAGCAGAGCCTGATGATATTAATTTATTATTTCATCGTGCATCACTCTATCTTGAGCGTGGAGATTGTCAAAAGGCAGCTGAAACATATGACCAAATTCATCAACAAT
GTCTAGGAAATGTTGAAGCACTAATGACAGGAGCGAAGCTGTACCAAAAATGCGGTCATCTGGAACGTGGAATTTGCATTCTTGAGGACTACATTAAAGGACATCCAGCT
GAAGCTGATTTAGATGTGGTTGATCTTCTAGCTTCTTTTTACATGGGAAGTAAAGAATTCAGCAAAGCTCTTGAGCGCATTGAGCATGCGGATGAGGTGTACTGTGCAGG
AAAAGAGCTACCTTTAAACTTGACAGCTAAAGCAGGAATCTGCCACGTCCATCTTGGAAATATTGAAAAGGCAGAGTGCCTCTTTGCTAATTTGGAACGGGAAGCTGCCG
GTGATAACTCAAATTTGATGATTGAAGTTGCAGACTCGTTGCTGAGTCTTAAGCACTATAACTTGGCATTGAAGTATTATCTCATGTCAGAAGAAGTAAATGCTGGAGGG
AACATGGGGATTTTATACCTAAAAATTGCCCAATGTTACTCATCAACTAATGATAGAGCACAGGCAATTGTGTTCTTTTACAAAGTGCTTCAACATCTTGAAGATAACAT
TAATGCCCGATTAACTTTGGCTTCCCTCCTCCTTGAGGAAGCTAGAGAAGAAGAAGCCATTTCATTACTATCTCCTCCGAAAGATTCAAACTCAACTAGCTCATCTTCCT
GCAAACTCAAACCTTGGTGGCTCAATGAGAAAGTAAAACTGAAGCTTTGCCACATATACAGAACTAAAGGAATGCTCGAGAACTTCGTTGAGGCAATCTTTTCTTTGGTC
CGTGAGTCCTTATATATTGAGACTCTTCAAGAAAAGATTAAAGTGAACAAGAAGAAGCTTCCAAAGAGGGTTTTGCTTGAAAGAGTCAAAGTACTAGATGGACGTGAAAC
TGGTAGCCTATTTCGTGGCTTCAGACCTGTAGCTCCTAAATCAGATTTATCAAAAGCTTCCAGAGCAAAGAGATTGCTTCAAAAGAGGGAAAGAATCAAGGAAGAAAAGA
AGGCTAAAGCTCTGGCTGCTGGAGTTGAGGTGAACTATGATGATTTGGATGATGAGCCGACGCTACGGGGACACCGAGAATCTCCCCTGCCTAATCTTCTGAAGGATGAA
GAATATCATAATCTTATTGTTGATCTGTGCAAGGCATTGGCTTCGTTGGGAAGATGTTCTGAAGCTTTAGAGATAATAAGTTTAACTTTAAAGTTGGCTTTTAACACATT
ATCCACAGAAAGGAAGGAGGAACTCCAGTTACTCGGAGCTCAATTAGCATTCAGCTCAACTGATACCAAGCATGGATTCAACTTTGCAAAGCACGTTGTTAAGCTGTATC
CTTACAGCATCTCTGCTTGGAACTGCTATTATAAAGTATCTTCAAGCATGCAAGCCAAATACAAAGATTGTGCACCGCCCTATCTCATTGCCGGGCATCAATTTACCGCC
ATTAGCCATCATCAAGATGCTGCAAGGAAATATCTTGAAGCTTATAAATTATTGCCGGATAGTCCCCTAATTAACTTGTGTGTTGGATCGGCGTTAATCAACTTGACGCT
TGGATTTCGTCTTCAAAACAAGCATCAGTGTATCGCGCAAGGCTTAGCATTCCTCTACAAAAATTTGAAGCTTTGTGATAACAGCCAGGAAGCCTTATACAACATAGCTC
GAGCGTATCATCACATTGGACTCGTGACGCTGGCGGTTACATATTATGAAAAGGTGCTCGCAACTTACCAGAAGGATTGCCCCATCCCAGAACTTTTTGGCGAGAATCGA
AACGTTAAGCATCAGAAATTAGTATACTGTGATCTACGCAGAGAAGCAGCTTACAATTTGCATCTTATATATAAAGAGAGCGGAGCTGTTGATCTTGCTAGGCAAGTCCT
AAAAGATCATTGTACATTTTAA
Protein sequenceShow/hide protein sequence
MEEEGNKISGNEEVPGCAVGNLGEEEDREDKEEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQS
ERSAKRGRVEDISGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDSYHTLGLVYHAIGDDIKAMGFYM
LAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNVEALMTGAKLYQKCGHLERGICILEDYIKGHPA
EADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGG
NMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAISLLSPPKDSNSTSSSSCKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLV
RESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVAPKSDLSKASRAKRLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDE
EYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSSMQAKYKDCAPPYLIAGHQFTA
ISHHQDAARKYLEAYKLLPDSPLINLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENR
NVKHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF