; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016821 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016821
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptiongeneral transcription factor 3C polypeptide 3
Genome locationchr12:41571341..41582435
RNA-Seq ExpressionLag0016821
SyntenyLag0016821
Gene Ontology termsGO:0006383 - transcription by RNA polymerase III (biological process)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat
IPR039340 - Transcription factor Tfc4/TFIIIC-102/Sfc4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606714.1 General transcription factor 3C polypeptide 3, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0091.37Show/hide
Query:  MEEEGNKNSGNEEVPGCAV-GDLGEEEDREDREEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAE
        MEEEGN  S NEEVPGC V G    E + EDREEEEEEEEEEEE+EEEVEDEGEDD EEEDGY FKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAE
Subjt:  MEEEGNKNSGNEEVPGCAV-GDLGEEEDREDREEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAE

Query:  KKRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDSY
        KKRKALA+GQSERSAKRGRVEDI+GASF+EIMEAMNYGSRRKR++PKKRGRRKGSK+KLN DVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPD+PDSY
Subjt:  KKRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDSY

Query:  HTLGLVYHAIGDDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNVE
        HTLGLVY+AIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF+RASLYLERGDCQKAAETYDQIHQ  + NVE
Subjt:  HTLGLVYHAIGDDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNVE

Query:  ALMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANLE
        ALMTGAKLYQKCGHLER  CILE+YIKGHP EADLDVVDLLAS YMGSKEFSKALE IEHAD VYCA  ELPLNLTAKAGICH+HLGN EKAECLFANL 
Subjt:  ALMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANLE

Query:  REAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAISL
        REA  + SNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGN+GILYLKIAQCYSSTN+RA+AIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAISL
Subjt:  REAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAISL

Query:  LSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVAP
        LSPPKDSNSTSSSSSK KPWWLNE+VKLKLCHI+RTKGMLENFVEAIF LVRESLYIETL EKIKVNKKKLPKRVLLERVKVLDGR+TG LFRGFRPVAP
Subjt:  LSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVAP

Query:  KSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLST
        KSDLSKASRAKKLLQKRERI+EEKKA+ALAAGV +NYDD DDEP LR  RESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFN+LS 
Subjt:  KSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLST

Query:  ERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEAY
        ERKEELQLLGAQLAFSSTDTKHGFNFAKHVVK YPYS SAWNCYYKVSSRMTNRDSRHCKLLNSMQ KYKDCAPPY+IAGHQFTAISHHQDAARKYLEAY
Subjt:  ERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEAY

Query:  KLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN-VKH
        KLLPDSPL+NLCVG+ALINLTLGFRLQNKHQC+AQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN++  KH
Subjt:  KLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN-VKH

Query:  QKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
        Q  VYCDLRREAAYNLHLIYKESGA+DLARQVLKD+CTF
Subjt:  QKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

XP_022948712.1 general transcription factor 3C polypeptide 3 [Cucurbita moschata]0.0e+0091.48Show/hide
Query:  MEEEGNKNSGNEEVPGCAV-GDLGEEEDREDREEEEEEEEEEEEE-EEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALA
        MEEEGN  S NEEVPGC V G    E + EDREEEEEEEEEEEEE EEEVEDEGEDD EEEDGY FKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALA
Subjt:  MEEEGNKNSGNEEVPGCAV-GDLGEEEDREDREEEEEEEEEEEEE-EEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALA

Query:  EKKRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDS
        EKKRKALA+GQSERSAKRGRVEDI+GASF+EIMEAMNYGSRRKR++PKKRGRRKGSK+KLN DVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPD+PDS
Subjt:  EKKRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDS

Query:  YHTLGLVYHAIGDDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNV
        YHTLGLVY+AIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF+RASLYLERGDCQKAAETYDQIHQ  + NV
Subjt:  YHTLGLVYHAIGDDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNV

Query:  EALMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANL
        EALMTGAKLYQKCGHLER  CILE+YIKGHP EADLDVVDLLAS YMGSKEFSKALE IEHAD VYCA  ELPLNLTAKAGICH+HLGN EKAECLFANL
Subjt:  EALMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANL

Query:  EREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAIS
         REA  + SNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGN+GILYLKIAQCYSSTN+RA+AIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAIS
Subjt:  EREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAIS

Query:  LLSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVA
        LLSPPKDSNSTSSSSSK KPWWLNE+VKLKLCHI+RTKGMLENFVEAIF LVRESLYIETL EKIKVNKKKLPKRVLLERVKVLDGR+TG LFRGFRPVA
Subjt:  LLSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVA

Query:  PKSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLS
        PKSDLSKASRAKKLLQKRERI+EEKKA+ALAAGV +NYDD DDEP LR  RESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFN+LS
Subjt:  PKSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLS

Query:  TERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEA
         ERKEELQLLGAQLAFSSTDTKHGFNFAKHVVK YPYS SAWNCYYKVSSRMTNRDSRHCKLLNSMQ KYKDCAPPY+IAGHQFTAISHHQDAARKYLEA
Subjt:  TERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEA

Query:  YKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKH
        YKLLPDSPL+NLCVG+ALINLTLGFRLQNKHQC+AQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN++ KH
Subjt:  YKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKH

Query:  QKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
        Q  VYCDLRREAAYNLHLIYKESGA+DLARQVLKD+CTF
Subjt:  QKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

XP_022997988.1 general transcription factor 3C polypeptide 3 [Cucurbita maxima]0.0e+0091.49Show/hide
Query:  MEEEGNKNSGNEEVPGCAV-GDLGEEEDREDREEEEEEEEEEEEE--EEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEAL
        MEEEGN  S NEEVPGC V G    E + EDREEEEEEEEEEEEE  EEEVEDEGEDD EEEDGY FKFKAGENPFDFVEGTDFSIQPYKKFERLEYEAL
Subjt:  MEEEGNKNSGNEEVPGCAV-GDLGEEEDREDREEEEEEEEEEEEE--EEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEAL

Query:  AEKKRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPD
        AEKKRKALANGQSERSAKRGRVEDI+GASF+EIMEAMNYGSRRKR++PKKRGRRKGSK+KLN DVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPD+PD
Subjt:  AEKKRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPD

Query:  SYHTLGLVYHAIGDDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGN
        SYHTLGLVY+AIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF+RASLYLERGDCQKAAETYDQIHQ  + N
Subjt:  SYHTLGLVYHAIGDDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGN

Query:  VEALMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFAN
        VEALMTGAKLYQKCGHLER  CILE+YIKGHP EADLDVVDLLAS YMGSKEFSKALE IEHAD VYCA  ELPLNLTAKAGICH+HLGN EKAECLFAN
Subjt:  VEALMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFAN

Query:  LEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAI
        L REA  + SNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGN+GILYLKIAQCYSSTN+RA+AIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAI
Subjt:  LEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAI

Query:  SLLSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPV
        SLLSPPKDSNSTSSSSSK KPWWLNE+VKLKLCHI+RTKGMLENFVEAIF LVRESLYIETL EKIKVNKKKLPKRVLLERVKVLDGR+TG LFRGFRPV
Subjt:  SLLSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPV

Query:  APKSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTL
        APKSDLSKASRAKKLLQKRERI+EEKKA+ALAAGV +NYDD DDEP LR  RESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFN+L
Subjt:  APKSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTL

Query:  STERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLE
        S ERKEELQLLGAQLAFSSTDTKHGFNFAKHVVK YPYS SAWNCYYKVSSRMTNRDSRHCKLLNSMQ KYKDCAPPY+IAGHQFTAISHHQDAARKYLE
Subjt:  STERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLE

Query:  AYKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVK
        AYKLLPDSPL+NLCVG+ALINLTLGFRLQNKHQC+AQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN++ K
Subjt:  AYKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVK

Query:  HQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
        HQ  VYCDLRREAAYNLHLIYKESGA+DLARQVLKD+CTF
Subjt:  HQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

XP_023525239.1 general transcription factor 3C polypeptide 3 [Cucurbita pepo subsp. pepo]0.0e+0091.36Show/hide
Query:  MEEEGNKNSGNEEVPGCAV-GDLGEEEDREDREEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAE
        MEEEGN  S NEEVPGC V G    E + EDR EEEEEEEEEEE+EEEVEDEGEDD EEEDGY FKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAE
Subjt:  MEEEGNKNSGNEEVPGCAV-GDLGEEEDREDREEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAE

Query:  KKRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDSY
        KKRKALANGQSERSAKRGRVEDI+GASF+EIMEAMNYGSRRKR++PKKRGRRKGSK+KLN DVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPD+PDSY
Subjt:  KKRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDSY

Query:  HTLGLVYHAIGDDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNVE
        HTLGLVY+AIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF+RASLYLERGDCQKAAETYDQIHQ  + NVE
Subjt:  HTLGLVYHAIGDDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNVE

Query:  ALMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANLE
        ALMTGAKLYQ+CGHLER  CILE+YIKGHP EADLDVVDLLAS YMGSKEFSKALE IEHAD VYCA  ELPLNLTAKAGICH+HLGN EKAECLFANL 
Subjt:  ALMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANLE

Query:  REAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAISL
        REA  + SNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGN+GILYLKIAQCYSSTN+RA+AIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAISL
Subjt:  REAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAISL

Query:  LSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVAP
        LSPPKDSNSTSSSSSK KPWWLNE+VKLKLCHI+RTKGMLENFV+AIF LVRESLYIETL EKIKVNKKKLPKRVLLERVKVLDGR+TG LFRGFRPVAP
Subjt:  LSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVAP

Query:  KSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLST
        KSDLSKASRAKKLLQKRERIKEEKKA+ALAAGV +NYDD DDEP LR  RESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFN+LS 
Subjt:  KSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLST

Query:  ERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEAY
        ERKEELQLLGAQLAFSSTDTKHGFNFAKHVVK YPYS SAWNCYYKVSSRMTNRDSRHCKLLNSMQ KYKDCAPPY+IAGHQFTAISHHQDAARKYLEAY
Subjt:  ERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEAY

Query:  KLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKHQ
        KLLPDSPL+NLCVG+ALINLTLGFRLQNKHQC+AQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN++ KHQ
Subjt:  KLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKHQ

Query:  KLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
          VYCDLRREAAYNLHLIYKESGA+DLARQVLKD+CTF
Subjt:  KLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

XP_038879313.1 general transcription factor 3C polypeptide 3 [Benincasa hispida]0.0e+0090.52Show/hide
Query:  MEEEGNKNSGNEEVPG--CAVGDLGEEEDREDREEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALA
        ME+EGN  S NEEVPG   +V   G+E      E+ EEEEEEEEE EEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALA
Subjt:  MEEEGNKNSGNEEVPG--CAVGDLGEEEDREDREEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALA

Query:  EKKRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDS
        EKKRKALANGQSER+AKRGRVEDI GASFDEI+EAMNYGSRRK KEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQ+EKA+S+L QVVLQAPDLPDS
Subjt:  EKKRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDS

Query:  YHTLGLVYHAIGDDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNV
        YHTLGLVY+AIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDI+LLFHRASLYLERGDC+KAAETYDQIHQQCLGNV
Subjt:  YHTLGLVYHAIGDDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNV

Query:  EALMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANL
        EALMTGAKLYQKCGHLER  CILEDYIK HP EADLDVVDLLAS YMG+KEF KALERIEHADEVYCAG ELPL LT K GICH HLGN+EKAECLFANL
Subjt:  EALMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANL

Query:  EREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAIS
          E A D+SNLMIEVADSLLSLKHYNLALKYYLM EEVNAGGNMGILYLKIAQCY STN+RAQAIVFFYKVLQHLEDNINARLTLASLLLEEAR+EEAIS
Subjt:  EREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAIS

Query:  LLSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVA
        LLSPPKDSN TSSSSSKLKPWW NEKVKLKLCHIY+T+GMLE+FVE IF LVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRE+G+LFRGFRPVA
Subjt:  LLSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVA

Query:  PKSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLS
        PKSDLSKASRAK+LLQKRERIKEEKKAKALAAGV+VNYDDLDDEP LR HRESPLPNLLK+EEYHNLIVDLCKALASLGRCSEALEIISLTLKLA N+LS
Subjt:  PKSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLS

Query:  TERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEA
         ERKEELQLLGAQLAFSSTDT HGFNFAKHVVK YPYSISAWNCYYKV+S +TNRDSRHCKLLNSMQAKYKDCAPPY+IAGHQFT ISHHQDAARKYLEA
Subjt:  TERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEA

Query:  YKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKH
        YK++PDSPL+NLCVGS+LINL LGFRLQNKHQC+AQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN+KH
Subjt:  YKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKH

Query:  QKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
        QK VYCDLRREAAYNLHLIYKESGA+DLARQVLKDHCTF
Subjt:  QKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

TrEMBL top hitse value%identityAlignment
A0A1S3BHB9 general transcription factor 3C polypeptide 3 isoform X10.0e+0089.33Show/hide
Query:  MEEEGNKNSGNEEVPGCAVGDLGEEEDREDREEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEK
        ME+EGN+ S +EEVPG  +  LG E++ E    + EEEEEEEE EEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFS+QPYKKFERLEYEALAEK
Subjt:  MEEEGNKNSGNEEVPGCAVGDLGEEEDREDREEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEK

Query:  KRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDSYH
        KRKALANGQSER+AKRGRVED+ GASFDEI+EAMNYGSRRK KEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQ+EKAIS+L QVVLQAPDLPDSYH
Subjt:  KRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDSYH

Query:  TLGLVYHAIGDDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNVEA
        TLGLVY+AIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSI+RGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETYDQIHQQCLGNVEA
Subjt:  TLGLVYHAIGDDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNVEA

Query:  LMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANLER
        LMTGAKLYQKCGHLER  CILEDYIK HP+EADLDVVDLLAS YMGSKEFSKALE IEHAD VYCAG ELPLNLTAKAGICH HLGN+EKAECLFANL R
Subjt:  LMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANLER

Query:  EAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAISLL
        E   D+SNLMIEVADSLLSLKHY+ ALKYYLMSEEVNAG NMGILY K+A+CY STN++ QAIVFFYKVLQH+EDNINARLTLASLLLEEAR+EEAISLL
Subjt:  EAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAISLL

Query:  SPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVAPK
        SPPKDSN TSSSS KLKPWWLNEKVKLKLCHIYRT+G+LENFVE IF LVRESLYIETLQEKIKVNKKKLP+RVLLERVKVLDGRETG+LFRGFRPVAPK
Subjt:  SPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVAPK

Query:  SDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLSTE
        SDL+KASRAK+LLQKR+RIKEEKKAK LAAGV V+YDDLDDEP LR HRESPLPNLLK+EE+H LIVDLCKALASLGRCSEALEIISLTLKLAFN+LSTE
Subjt:  SDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLSTE

Query:  RKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEAYK
        RKEELQLLGAQLAFSST T HGFNFAKHVVK YPYSISAWNCYYKV+S +TNRDSRHCKLLNSMQAKYKDCAPPY+IAGHQFT ISHHQDAARKYLEAYK
Subjt:  RKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEAYK

Query:  LLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKHQK
        ++PDSPL+NLCVGS+LINL LGFRLQNKHQC+AQGLAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN+KHQ 
Subjt:  LLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKHQK

Query:  LVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
         VYCDLRREAAYNLHLIYKESGA+DLARQVLKDHCTF
Subjt:  LVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

A0A6J1DEF9 general transcription factor 3C polypeptide 3 isoform X20.0e+0088.84Show/hide
Query:  MEEEGNKNSGNEEVPGCAVGDLG-------------EEEDREDREEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYK
        ME+EGN+ S N+EVPGCAVG  G             EEE+ E+ E+EEEEEEEEEEEEEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFSIQPYK
Subjt:  MEEEGNKNSGNEEVPGCAVGDLG-------------EEEDREDREEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYK

Query:  KFERLEYEALAEKKRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQ
        KFERLEYEALAEKKRKALAN QSER  KRGR+EDI GASF+EIMEAMNYGSRRK KEPK+RGRRKGSKKK+NR++TKLLGDATLCYAQGQYEKAISVL Q
Subjt:  KFERLEYEALAEKKRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQ

Query:  VVLQAPDLPDSYHTLGLVYHAIGDDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETY
        VVLQAPDLPDSYHTLGLVY+AIGDDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETY
Subjt:  VVLQAPDLPDSYHTLGLVYHAIGDDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETY

Query:  DQIHQQCLGNVEALMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGN
        DQIHQ+C+GNVEALMTGAKLYQKCGH ER  CILEDYIKGHP EADLDVVDLLAS YMGSKEFSKALE IEHAD+VYCAG E+PLNL  KAGICHVHLGN
Subjt:  DQIHQQCLGNVEALMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGN

Query:  IEKAECLFANLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLL
        IEKAE LFANL R+ A D+S+ +IE ADSLLSLKH+NLALKYYLMSEEVNAGG MGI+YLKIAQCY STN+RA+AIVFFYKVLQ LEDNINARLTLASLL
Subjt:  IEKAECLFANLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLL

Query:  LEEAREEEAISLLSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRET
        LEEAREEEAISLLSPPKDSNS+SSSSSK KPWWLNEKVKLKLC+IYRTKGMLENFVE IFSLVRESLYIETL+EKIKVNKKKLP+RVLLERVKVLDGRET
Subjt:  LEEAREEEAISLLSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRET

Query:  GSLFRGFRPVAPKSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIIS
        GSLFRGFRPVAPKSDLSKASRAK+LLQKRERIKEEKKA+ALAAGV V+YDD+DDEP LR HRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIIS
Subjt:  GSLFRGFRPVAPKSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIIS

Query:  LTLKLAFNTLSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISH
        LTLKLAFN+LS ERKEELQLLGAQLAFSSTDTKHGFNFAKHVVK YPYS SAWNCYYKVSSRMT+RDSRHCKLLNS+QAKYKDCAPP++IAGHQF AISH
Subjt:  LTLKLAFNTLSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISH

Query:  HQDAARKYLEAYKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP
        HQ+AA+KYLEAYKLLPDSPL+NLCVG+ALINL LG RLQNKHQC+AQGLAFLY NLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP
Subjt:  HQDAARKYLEAYKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP

Query:  ELFGENRNVKHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
        ++FGENRN+KH+K VYCDLRREAAYNLHLIYK+SGA+DLARQVLKDHCTF
Subjt:  ELFGENRNVKHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

A0A6J1DGK2 general transcription factor 3C polypeptide 3 isoform X10.0e+0088.75Show/hide
Query:  MEEEGNKNSGNEEVPGCAVGDLG-------------EEEDREDREEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYK
        ME+EGN+ S N+EVPGCAVG  G             EEE+ E+ E+EEEEEEEEEEEEEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFSIQPYK
Subjt:  MEEEGNKNSGNEEVPGCAVGDLG-------------EEEDREDREEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYK

Query:  KFERLEYEALAEKKRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQ
        KFERLEYEALAEKKRKALAN QSER  KRGR+EDI GASF+EIMEAMNYGSRRK KEPK+RGRRKGSKKK+NR++TKLLGDATLCYAQGQYEKAISVL Q
Subjt:  KFERLEYEALAEKKRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQ

Query:  VVLQAPDLPDSYHTLGLVYHAIGDDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETY
        VVLQAPDLPDSYHTLGLVY+AIGDDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETY
Subjt:  VVLQAPDLPDSYHTLGLVYHAIGDDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETY

Query:  DQIHQQCLGNVEALMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGN
        DQIHQ+C+GNVEALMTGAKLYQKCGH ER  CILEDYIKGHP EADLDVVDLLAS YMGSKEFSKALE IEHAD+VYCAG E+PLNL  KAGICHVHLGN
Subjt:  DQIHQQCLGNVEALMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGN

Query:  IEKAECLFANLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNM-GILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASL
        IEKAE LFANL R+ A D+S+ +IE ADSLLSLKH+NLALKYYLMSEEVNAGG M GI+YLKIAQCY STN+RA+AIVFFYKVLQ LEDNINARLTLASL
Subjt:  IEKAECLFANLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNM-GILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASL

Query:  LLEEAREEEAISLLSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRE
        LLEEAREEEAISLLSPPKDSNS+SSSSSK KPWWLNEKVKLKLC+IYRTKGMLENFVE IFSLVRESLYIETL+EKIKVNKKKLP+RVLLERVKVLDGRE
Subjt:  LLEEAREEEAISLLSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRE

Query:  TGSLFRGFRPVAPKSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEII
        TGSLFRGFRPVAPKSDLSKASRAK+LLQKRERIKEEKKA+ALAAGV V+YDD+DDEP LR HRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEII
Subjt:  TGSLFRGFRPVAPKSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEII

Query:  SLTLKLAFNTLSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAIS
        SLTLKLAFN+LS ERKEELQLLGAQLAFSSTDTKHGFNFAKHVVK YPYS SAWNCYYKVSSRMT+RDSRHCKLLNS+QAKYKDCAPP++IAGHQF AIS
Subjt:  SLTLKLAFNTLSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAIS

Query:  HHQDAARKYLEAYKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPI
        HHQ+AA+KYLEAYKLLPDSPL+NLCVG+ALINL LG RLQNKHQC+AQGLAFLY NLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPI
Subjt:  HHQDAARKYLEAYKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPI

Query:  PELFGENRNVKHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
        P++FGENRN+KH+K VYCDLRREAAYNLHLIYK+SGA+DLARQVLKDHCTF
Subjt:  PELFGENRNVKHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

A0A6J1GA07 general transcription factor 3C polypeptide 30.0e+0091.48Show/hide
Query:  MEEEGNKNSGNEEVPGCAV-GDLGEEEDREDREEEEEEEEEEEEE-EEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALA
        MEEEGN  S NEEVPGC V G    E + EDREEEEEEEEEEEEE EEEVEDEGEDD EEEDGY FKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALA
Subjt:  MEEEGNKNSGNEEVPGCAV-GDLGEEEDREDREEEEEEEEEEEEE-EEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALA

Query:  EKKRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDS
        EKKRKALA+GQSERSAKRGRVEDI+GASF+EIMEAMNYGSRRKR++PKKRGRRKGSK+KLN DVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPD+PDS
Subjt:  EKKRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDS

Query:  YHTLGLVYHAIGDDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNV
        YHTLGLVY+AIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF+RASLYLERGDCQKAAETYDQIHQ  + NV
Subjt:  YHTLGLVYHAIGDDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNV

Query:  EALMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANL
        EALMTGAKLYQKCGHLER  CILE+YIKGHP EADLDVVDLLAS YMGSKEFSKALE IEHAD VYCA  ELPLNLTAKAGICH+HLGN EKAECLFANL
Subjt:  EALMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANL

Query:  EREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAIS
         REA  + SNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGN+GILYLKIAQCYSSTN+RA+AIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAIS
Subjt:  EREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAIS

Query:  LLSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVA
        LLSPPKDSNSTSSSSSK KPWWLNE+VKLKLCHI+RTKGMLENFVEAIF LVRESLYIETL EKIKVNKKKLPKRVLLERVKVLDGR+TG LFRGFRPVA
Subjt:  LLSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVA

Query:  PKSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLS
        PKSDLSKASRAKKLLQKRERI+EEKKA+ALAAGV +NYDD DDEP LR  RESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFN+LS
Subjt:  PKSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLS

Query:  TERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEA
         ERKEELQLLGAQLAFSSTDTKHGFNFAKHVVK YPYS SAWNCYYKVSSRMTNRDSRHCKLLNSMQ KYKDCAPPY+IAGHQFTAISHHQDAARKYLEA
Subjt:  TERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLEA

Query:  YKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKH
        YKLLPDSPL+NLCVG+ALINLTLGFRLQNKHQC+AQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN++ KH
Subjt:  YKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKH

Query:  QKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
        Q  VYCDLRREAAYNLHLIYKESGA+DLARQVLKD+CTF
Subjt:  QKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

A0A6J1KFI7 general transcription factor 3C polypeptide 30.0e+0091.49Show/hide
Query:  MEEEGNKNSGNEEVPGCAV-GDLGEEEDREDREEEEEEEEEEEEE--EEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEAL
        MEEEGN  S NEEVPGC V G    E + EDREEEEEEEEEEEEE  EEEVEDEGEDD EEEDGY FKFKAGENPFDFVEGTDFSIQPYKKFERLEYEAL
Subjt:  MEEEGNKNSGNEEVPGCAV-GDLGEEEDREDREEEEEEEEEEEEE--EEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEAL

Query:  AEKKRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPD
        AEKKRKALANGQSERSAKRGRVEDI+GASF+EIMEAMNYGSRRKR++PKKRGRRKGSK+KLN DVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPD+PD
Subjt:  AEKKRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPD

Query:  SYHTLGLVYHAIGDDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGN
        SYHTLGLVY+AIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF+RASLYLERGDCQKAAETYDQIHQ  + N
Subjt:  SYHTLGLVYHAIGDDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGN

Query:  VEALMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFAN
        VEALMTGAKLYQKCGHLER  CILE+YIKGHP EADLDVVDLLAS YMGSKEFSKALE IEHAD VYCA  ELPLNLTAKAGICH+HLGN EKAECLFAN
Subjt:  VEALMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFAN

Query:  LEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAI
        L REA  + SNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGN+GILYLKIAQCYSSTN+RA+AIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAI
Subjt:  LEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAI

Query:  SLLSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPV
        SLLSPPKDSNSTSSSSSK KPWWLNE+VKLKLCHI+RTKGMLENFVEAIF LVRESLYIETL EKIKVNKKKLPKRVLLERVKVLDGR+TG LFRGFRPV
Subjt:  SLLSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPV

Query:  APKSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTL
        APKSDLSKASRAKKLLQKRERI+EEKKA+ALAAGV +NYDD DDEP LR  RESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFN+L
Subjt:  APKSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTL

Query:  STERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLE
        S ERKEELQLLGAQLAFSSTDTKHGFNFAKHVVK YPYS SAWNCYYKVSSRMTNRDSRHCKLLNSMQ KYKDCAPPY+IAGHQFTAISHHQDAARKYLE
Subjt:  STERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAARKYLE

Query:  AYKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVK
        AYKLLPDSPL+NLCVG+ALINLTLGFRLQNKHQC+AQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN++ K
Subjt:  AYKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVK

Query:  HQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
        HQ  VYCDLRREAAYNLHLIYKESGA+DLARQVLKD+CTF
Subjt:  HQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

SwissProt top hitse value%identityAlignment
O74458 Transcription factor tau subunit sfc41.0e-2722.93Show/hide
Query:  ANGQSERSAKRGRVE-DITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQ-GQYEKAISVLSQVVLQAPDLPDSYHTLG
        ++ +S  S    R E DI    ++E ++A+  G R+ RK  K RGR   +    + +V ++L  A   +AQ G +++A  +  ++V    ++  ++  LG
Subjt:  ANGQSERSAKRGRVE-DITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQ-GQYEKAISVLSQVVLQAPDLPDSYHTLG

Query:  LVYHAIGDD----VKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDIN----LLFHRASLYLERGDCQKAAETYDQIHQQCL
          +   G+      K +  +M AAHL P+D  LW      S      DQA YC ++A+ A+P + +     +++R+ L  E G  +KAAE +  + Q   
Subjt:  LVYHAIGDD----VKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDIN----LLFHRASLYLERGDCQKAAETYDQIHQQCL

Query:  GNVEALMTGAKLYQKCGHLERGTC----ILEDYIKGHPAEA------DLDVVDLLASFYMGSKEFSKALERI----------------------------
         N   L   A++Y K  H  R       I   Y   +PA        DL  ++L A   +   ++S  +  I                            
Subjt:  GNVEALMTGAKLYQKCGHLERGTC----ILEDYIKGHPAEA------DLDVVDLLASFYMGSKEFSKALERI----------------------------

Query:  EHADEVYCAGKE--------LPLNLTAKAGICHVHLGNIEKAECLFANLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLM------SEEVNAGGNMG
        E   E   A +E        LP     K GI  +  G + +AE  F+ ++         ++ ++A + + ++  +LAL+Y+++      ++ +    NMG
Subjt:  EHADEVYCAGKE--------LPLNLTAKAGICHVHLGNIEKAECLFANLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLM------SEEVNAGGNMG

Query:  ILYLKI------AQCYSS----TNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAISLLSPPKDSNSTSSSSSKLKPWWLNEKVKL----KL
        + YL++       QC  +     N    A++   ++   L+DN +A L + + + E+ R    I+ L   +  N     +   + +  N+KV      K 
Subjt:  ILYLKI------AQCYSS----TNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAISLLSPPKDSNSTSSSSSKLKPWWLNEKVKL----KL

Query:  CHIYRTKGMLENF-VEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVAPKSDLSKASRAKK-LLQKRERIKEEKKAKA
          I R+K     F +       R    ++ L++ +K  K++     L E + +        L   F  +       K +RA+  LL +R R         
Subjt:  CHIYRTKGMLENF-VEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVAPKSDLSKASRAKK-LLQKRERIKEEKKAKA

Query:  LAAG------VEVNYDDLDDEPTLR-GHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLSTERKEELQLLGAQLAFSSTDTK
                      Y DLD +  LR G+  +     +  + ++ L V+    L  +G   +A ++++  +        T +++ L+      +  + D +
Subjt:  LAAG------VEVNYDDLDDEPTLR-GHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLSTERKEELQLLGAQLAFSSTDTK

Query:  HG----------FNFAKHVVKLYPYSIS-AWNCYYK-VSSRMTNRDSRHCKLLNSMQ------------------AKYKDCAPPYLIA--GHQFTAISHH
                    F F +   +L+   +S  + C    V S       R  KL++ +                   A       P L+   GH        
Subjt:  HG----------FNFAKHVVKLYPYSIS-AWNCYYK-VSSRMTNRDSRHCKLLNSMQ------------------AKYKDCAPPYLIA--GHQFTAISHH

Query:  QDAARKYLEAYKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDN-----SQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKD
          A   Y  A+ + PD P+ NL +G A ++  +     N+H  I QG  FLY+   L  N      QEALYN+ +AYH IGL   AV YYE VL      
Subjt:  QDAARKYLEAYKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDN-----SQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKD

Query:  CPIPELFGENRNVKHQKL-VYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
           P   G+        +    D   EAAYNL LIY  SG + LA Q+   +  F
Subjt:  CPIPELFGENRNVKHQKL-VYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

P33339 Transcription factor tau 131 kDa subunit1.4e-1620.32Show/hide
Query:  EEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERSAKRGRVEDITGASFDEIMEAMN
        ++E++ +  E E  D G+ + E+E+         ++     E  D  +    +    EY   +E+    LA             ED   A    I EA N
Subjt:  EEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERSAKRGRVEDITGASFDEIMEAMN

Query:  YGSRRKRKEPK-KRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDSYHTLGLVYHAIGDDVKAMGFYMLAAHLMPRDSSLWK
        +  ++K+K  K K   R+  ++ L+ +V +LL  A   + +   + A  + ++V+ +      +Y TLG +Y   G        + LAAHL   D   WK
Subjt:  YGSRRKRKEPK-KRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDSYHTLGLVYHAIGDDVKAMGFYMLAAHLMPRDSSLWK

Query:  LLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQ-------------CLGNVEALMTGAKLYQKC--GHLERGTC
        ++   S D   + QA YC S+ I   P +   ++ R+ LY + G   +A + + +++                  + + +    +LY K    ++ER   
Subjt:  LLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQ-------------CLGNVEALMTGAKLYQKC--GHLERGTC

Query:  IL------------------EDYIKGHPAEADLD------------------------VVDLLASFYM--------GSKEFSKALERIEHA---------
        IL                  ED  +  P E D D                         +++LA  ++        G K   K    I+           
Subjt:  IL------------------EDYIKGHPAEADLD------------------------VVDLLASFYM--------GSKEFSKALERIEHA---------

Query:  ----------------DEVYCAGKE----LPLNLTAKAGICHVHLGNIEKAECLFANLEREAAGDNSNLMIEVADSLLSLKHYNLALKYY---LMSEEVN
                        D +  A KE    +P+++  + G+  ++  N+ +A   F  L  E   D ++L  E A +L   + Y  A+ ++   L  EE  
Subjt:  ----------------DEVYCAGKE----LPLNLTAKAGICHVHLGNIEKAECLFANLEREAAGDNSNLMIEVADSLLSLKHYNLALKYY---LMSEEVN

Query:  AGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLL------------------LEEAREEEAISLLSPPKDSNSTSSSSSKLKPW
               ++  +A+CY        A  F+   ++   D+++ R++LA +                   + + + +E +  +S  K SN TS  SSK    
Subjt:  AGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLL------------------LEEAREEEAISLLSPPKDSNSTSSSSSKLKPW

Query:  WLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDG-RETGSLFRGFRPVAPKSDLSKA-------SRAKK
         L E  K +    +R K       E            E ++ + ++  K + K   +++ ++  G  E          V+   D+  +       SR++K
Subjt:  WLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDG-RETGSLFRGFRPVAPKSDLSKA-------SRAKK

Query:  ---LLQKRERIKEE-----KKAKALAAGVEVNYDDL-DDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLSTERK
           +L++ ++   E     ++   LA G  V    L ++  TL    E      L  E++  L ++L   +A      + L ++    ++       ER 
Subjt:  ---LLQKRERIKEE-----KKAKALAAGVEVNYDDL-DDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLSTERK

Query:  EELQLLGAQLAFSSTDTK----------HGFNFAKHVVKLYPYSI----SAWNCYYKV--------------SSRMTNRDSRHCKLLNSMQAKYKDCAPP
        + ++ +   +     D +          + F F + V++++ YS+    S+ N                   S R     +    + N         + P
Subjt:  EELQLLGAQLAFSSTDTK----------HGFNFAKHVVKLYPYSI----SAWNCYYKV--------------SSRMTNRDSRHCKLLNSMQAKYKDCAPP

Query:  YLIAGHQFTAISHHQDAARKYLEAYKLL-------PDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDN------SQEALYNIARAYH
        YL   +     S     +R +L A + L       PD P+VNL +G + I+  +      +H  I  GL +LY+  K+  +       QEA YN+ RA+H
Subjt:  YLIAGHQFTAISHHQDAARKYLEAYKLL-------PDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDN------SQEALYNIARAYH

Query:  HIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDH
         IGLV++A+ YY +VL  Y                         L++ AAYN  +IY++SG V+LA  +++ +
Subjt:  HIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNVKHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDH

Q9Y5Q9 General transcription factor 3C polypeptide 33.2e-5323.52Show/hide
Query:  GDLGEEEDREDREEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERSAKRGRV
        G +  EE    REE +  E++  +E+ ++  E   D  E    +          D  EG + S    K   ++    L E +       + E   +    
Subjt:  GDLGEEEDREDREEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERSAKRGRV

Query:  EDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDSYHTLGLVYHAIGDDVKAMGFY
         +   A    ++E +        +E KK  + K  + KL R +  L+G+A + +A+G+ E+AI +  +++ QAP   + + TL ++Y   GD  K++ F 
Subjt:  EDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDSYHTLGLVYHAIGDDVKAMGFY

Query:  MLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGN-----VEALMTGAKLYQKCGHL
        ++AAHL P D+  W  L   S+++ +I QA +C +KA+K EP ++  L+ R+SLY + GD + A + Y +I      +     ++     AK Y +   +
Subjt:  MLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGN-----VEALMTGAKLYQKCGHL

Query:  ERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERI---------------------EHADEVYCAGKE-LPLNLTAKAGICHVHLGNIEKAE
             I+++    H     ++ V++ A  Y+ +K++ KALE I                     +  + V C   + +P+++T K  +C VHL  +E   
Subjt:  ERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERI---------------------EHADEVYCAGKE-LPLNLTAKAGICHVHLGNIEKAE

Query:  CLFANLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAR
         L   L  +   D  +L ++VA++ L +  YN AL   L +   +   N+ +++L+ A+C  +     +A   + KV+     +++AR++L++L  +  +
Subjt:  CLFANLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAR

Query:  EEEAISLLSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFR
         E+A+  L P  D ++ +  ++  +      K+ L    +  ++G +  +V+ + +++   L       K+ +N+          +V ++   ++G    
Subjt:  EEEAISLLSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFR

Query:  GFRPVAPKSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKL
                  L K SR  K+    ++      AKA+ A                      L ++L  +++ NL++    +L  L R  EA  ++  +L+ 
Subjt:  GFRPVAPKSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKL

Query:  AFNTLSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAA
               ++++EL+  G   A    + +  +N+ + +V         WN + +V+  M ++D RH +    +  K  +     ++ GH        + A 
Subjt:  AFNTLSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAA

Query:  RKYLEAYKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGE
         +Y++A++  PD PL + C+G   I++     +  +H  I QG +FL + L L    QE+ YN+ R  H +GL+ LA+ YY+K L        +P L   
Subjt:  RKYLEAYKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGE

Query:  NRNVKHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCT
           V+  +L   DLRR+ AYNL LIY+ SG   +A+ +L  +C+
Subjt:  NRNVKHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCT

Arabidopsis top hitse value%identityAlignment
AT1G17680.1 tetratricopeptide repeat (TPR)-containing protein2.3e-18742.9Show/hide
Query:  EEGNKNSGNEEVPG---CAVGDLGEEEDREDREEEEEEE--EEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEAL
        +EGN  S  EE P    C    LG + + +D++   +EE   ++++++ + +DEG++  EE+D     F+AG  P                FER EYEAL
Subjt:  EEGNKNSGNEEVPG---CAVGDLGEEEDREDREEEEEEE--EEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEAL

Query:  AEKKRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPD
        AE+KRKALA+ Q   S        + G      ME M+ G RRK ++ KK+GRR GSKK++  D+ K   +A   +A G+  +A+ +L +V+ QAP    
Subjt:  AEKKRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPD

Query:  SYHTLGLVYHAIG-DDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLG
        +Y+ L  V   +G  +  +     +AA++    S  WKLL+    ++ +I  A    SKAI+A+PDDI L +  A + L  G  ++AAET++QI ++C  
Subjt:  SYHTLGLVYHAIG-DDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLG

Query:  NVEALMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFA
         +EAL  G + + K G  ER   ILED+IK H +E   DV+DLLAS +M      +AL+ I    ++Y  GKEL  +L  +  ICHVHL  +E+AE + +
Subjt:  NVEALMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFA

Query:  NLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYL--MSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREE
         L +EA  ++  L+  +AD L ++ +++ ALKYY+  +SE VN     G L++KIA+CY S  +R QAIVF+YK L  L D ++ R+TLASLLLE+ + +
Subjt:  NLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYL--MSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREE

Query:  EAISLLSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGF
        EA+ +LSPP++ +     ++KLK WW N K+++ LC IY ++GMLE+F      LV + ++  T++ K K       + VL E  +    R         
Subjt:  EAISLLSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGF

Query:  RPVAPKSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAF
            PK    K  + +  L +  RI+E        A ++ + +D+  E             ++KDEEYH L VDLCKALASL R  EALEI++L  +L  
Subjt:  RPVAPKSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAF

Query:  NTLSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAARK
          L  E K+ELQ LGA+++  + D K  F+  + V++ +PY ++AWNCYY V SR+  R S   K ++ +++KY+DC PP LIAGH FT  S HQDAAR+
Subjt:  NTLSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAARK

Query:  YLEAYKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENR
        YLEAYKL+P+SPL+NLCVG+ALINL LGFRL+N+H+C+AQG AFLY NL++C NSQEALYN+ARAY H+GLVTLA +YYEKVLA Y+KD  +P+L  E+ 
Subjt:  YLEAYKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENR

Query:  NV-KHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
         V + +K V CDLR+EAA+NLHLIYK SGA DLARQVLKDHCTF
Subjt:  NV-KHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF

AT1G17680.2 tetratricopeptide repeat (TPR)-containing protein2.3e-18742.9Show/hide
Query:  EEGNKNSGNEEVPG---CAVGDLGEEEDREDREEEEEEE--EEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEAL
        +EGN  S  EE P    C    LG + + +D++   +EE   ++++++ + +DEG++  EE+D     F+AG  P                FER EYEAL
Subjt:  EEGNKNSGNEEVPG---CAVGDLGEEEDREDREEEEEEE--EEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEAL

Query:  AEKKRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPD
        AE+KRKALA+ Q   S        + G      ME M+ G RRK ++ KK+GRR GSKK++  D+ K   +A   +A G+  +A+ +L +V+ QAP    
Subjt:  AEKKRKALANGQSERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPD

Query:  SYHTLGLVYHAIG-DDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLG
        +Y+ L  V   +G  +  +     +AA++    S  WKLL+    ++ +I  A    SKAI+A+PDDI L +  A + L  G  ++AAET++QI ++C  
Subjt:  SYHTLGLVYHAIG-DDVKAMGFYMLAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLG

Query:  NVEALMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFA
         +EAL  G + + K G  ER   ILED+IK H +E   DV+DLLAS +M      +AL+ I    ++Y  GKEL  +L  +  ICHVHL  +E+AE + +
Subjt:  NVEALMTGAKLYQKCGHLERGTCILEDYIKGHPAEADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFA

Query:  NLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYL--MSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREE
         L +EA  ++  L+  +AD L ++ +++ ALKYY+  +SE VN     G L++KIA+CY S  +R QAIVF+YK L  L D ++ R+TLASLLLE+ + +
Subjt:  NLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYL--MSEEVNAGGNMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREE

Query:  EAISLLSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGF
        EA+ +LSPP++ +     ++KLK WW N K+++ LC IY ++GMLE+F      LV + ++  T++ K K       + VL E  +    R         
Subjt:  EAISLLSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLVRESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGF

Query:  RPVAPKSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAF
            PK    K  + +  L +  RI+E        A ++ + +D+  E             ++KDEEYH L VDLCKALASL R  EALEI++L  +L  
Subjt:  RPVAPKSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAF

Query:  NTLSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAARK
          L  E K+ELQ LGA+++  + D K  F+  + V++ +PY ++AWNCYY V SR+  R S   K ++ +++KY+DC PP LIAGH FT  S HQDAAR+
Subjt:  NTLSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKDCAPPYLIAGHQFTAISHHQDAARK

Query:  YLEAYKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENR
        YLEAYKL+P+SPL+NLCVG+ALINL LGFRL+N+H+C+AQG AFLY NL++C NSQEALYN+ARAY H+GLVTLA +YYEKVLA Y+KD  +P+L  E+ 
Subjt:  YLEAYKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENR

Query:  NV-KHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF
         V + +K V CDLR+EAA+NLHLIYK SGA DLARQVLKDHCTF
Subjt:  NV-KHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAGGAAGGGAATAAAAATTCTGGCAATGAAGAGGTTCCTGGTTGTGCTGTAGGCGATTTAGGAGAGGAGGAGGATAGAGAGGATAGAGAGGAGGAGGAGGAAGA
GGAAGAGGAAGAGGAAGAAGAAGAGGAAGAAGTGGAGGATGAAGGAGAAGATGATATTGAAGAAGAAGATGGTTACACTTTCAAATTTAAGGCCGGAGAAAATCCTTTTG
ATTTTGTTGAAGGGACGGATTTTAGCATCCAACCATATAAAAAATTTGAGCGTCTTGAATATGAAGCTCTTGCTGAGAAAAAACGAAAAGCTCTTGCAAATGGTCAGAGC
GAGAGATCTGCAAAGAGGGGCAGGGTAGAAGATATTACTGGTGCAAGCTTTGATGAAATTATGGAAGCTATGAATTATGGATCTAGGAGGAAGCGGAAGGAGCCTAAAAA
AAGAGGTAGACGGAAGGGATCAAAGAAAAAACTTAATCGTGATGTTACAAAGTTGCTTGGTGATGCAACCTTATGTTATGCTCAAGGCCAGTATGAGAAGGCTATATCTG
TATTGAGTCAAGTTGTTCTGCAAGCCCCAGATTTACCTGATTCGTACCATACGCTTGGACTTGTTTATCATGCAATTGGCGATGATGTAAAAGCCATGGGATTCTACATG
CTTGCTGCACATTTAATGCCAAGAGATTCATCTCTCTGGAAACTGCTATTTTCATGGTCGATTGACCGAGGTGACATTGATCAAGCAAGTTACTGTCTTTCTAAAGCAAT
TAAAGCAGAGCCTGATGATATTAATTTATTATTTCATCGTGCATCACTCTATCTTGAGCGTGGAGATTGTCAAAAGGCAGCTGAAACATATGACCAAATTCATCAACAAT
GTCTAGGAAATGTTGAAGCACTAATGACAGGAGCGAAGCTGTACCAAAAATGCGGTCATCTGGAACGTGGAACTTGCATTCTTGAGGACTACATTAAAGGACATCCAGCT
GAAGCTGATTTAGATGTGGTTGATCTTCTAGCTTCTTTTTATATGGGAAGTAAAGAATTCAGCAAAGCTCTTGAGCGCATTGAGCATGCGGATGAGGTGTACTGTGCAGG
AAAAGAGCTACCTTTAAACTTGACAGCTAAAGCAGGAATCTGCCACGTCCATCTTGGAAATATTGAAAAGGCAGAGTGCCTCTTTGCTAATTTGGAACGGGAAGCTGCCG
GTGATAACTCAAATTTGATGATTGAAGTTGCAGACTCGTTGCTGAGTCTTAAGCACTATAACTTGGCATTGAAGTATTATCTCATGTCAGAAGAAGTAAATGCTGGAGGG
AACATGGGGATTTTATACCTAAAAATTGCCCAATGTTACTCATCAACTAATGATAGAGCACAGGCAATTGTGTTCTTTTACAAAGTGCTTCAACATCTTGAAGATAACAT
TAATGCCCGATTAACTTTGGCTTCCCTCCTCCTTGAGGAAGCTAGAGAAGAAGAAGCCATTTCATTACTATCTCCTCCAAAAGATTCAAACTCAACTAGCTCATCTTCCA
GCAAACTCAAACCTTGGTGGCTCAATGAGAAAGTAAAACTGAAGCTTTGCCACATATACAGAACTAAAGGAATGCTCGAGAACTTCGTTGAGGCAATCTTTTCTTTGGTC
CGTGAGTCCTTATATATTGAGACTCTTCAAGAAAAGATTAAAGTGAACAAGAAGAAGCTTCCAAAGAGAGTTTTGCTTGAAAGAGTCAAAGTACTAGATGGACGTGAAAC
TGGTAGCCTATTTCGTGGCTTCAGACCCGTAGCTCCTAAATCAGATTTATCAAAAGCTTCCAGAGCAAAGAAATTGCTTCAAAAGAGGGAAAGAATCAAGGAAGAAAAGA
AGGCCAAAGCTCTGGCTGCTGGAGTCGAGGTGAACTATGATGATTTGGATGATGAGCCAACGCTACGGGGGCACCGAGAATCTCCCCTGCCTAATCTTCTGAAGGATGAA
GAATATCATAATCTTATTGTTGATCTGTGCAAGGCATTGGCTTCGTTGGGAAGATGTTCTGAAGCTTTAGAGATAATAAGTTTAACTTTAAAGTTGGCTTTTAACACATT
ATCCACAGAAAGGAAGGAGGAACTCCAGTTACTCGGAGCTCAATTAGCATTCAGCTCAACTGATACCAAGCATGGATTCAACTTTGCAAAGCACGTTGTTAAGCTGTATC
CTTACAGCATCTCTGCTTGGAACTGCTATTATAAAGTATCTTCAAGAATGACGAACCGGGATTCAAGGCATTGCAAGCTTCTGAATAGCATGCAAGCCAAATACAAAGAT
TGTGCACCGCCCTATCTCATTGCCGGGCATCAATTTACCGCCATTAGCCATCATCAAGATGCTGCAAGGAAATATCTTGAAGCTTATAAATTATTGCCGGATAGTCCCCT
AGTTAACTTGTGTGTTGGATCGGCGTTAATCAACTTAACGCTTGGATTTCGTCTTCAAAACAAGCATCAGTGTATCGCGCAAGGCTTAGCATTCCTCTACAAAAATTTGA
AGCTTTGTGATAACAGCCAGGAAGCCTTATACAACATAGCTCGAGCGTATCATCACATTGGACTCGTGACGCTGGCGGTTACATATTACGAAAAGGTGCTCGCAACTTAC
CAGAAGGATTGCCCCATCCCAGAACTTTTTGGCGAGAATCGAAACGTTAAGCATCAGAAATTAGTCTACTGTGATCTACGCAGAGAAGCAGCTTACAATTTGCATCTTAT
ATATAAAGAGAGCGGAGCTGTTGATCTTGCTAGGCAAGTCCTAAAAGATCATTGTACATTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAGGAAGGGAATAAAAATTCTGGCAATGAAGAGGTTCCTGGTTGTGCTGTAGGCGATTTAGGAGAGGAGGAGGATAGAGAGGATAGAGAGGAGGAGGAGGAAGA
GGAAGAGGAAGAGGAAGAAGAAGAGGAAGAAGTGGAGGATGAAGGAGAAGATGATATTGAAGAAGAAGATGGTTACACTTTCAAATTTAAGGCCGGAGAAAATCCTTTTG
ATTTTGTTGAAGGGACGGATTTTAGCATCCAACCATATAAAAAATTTGAGCGTCTTGAATATGAAGCTCTTGCTGAGAAAAAACGAAAAGCTCTTGCAAATGGTCAGAGC
GAGAGATCTGCAAAGAGGGGCAGGGTAGAAGATATTACTGGTGCAAGCTTTGATGAAATTATGGAAGCTATGAATTATGGATCTAGGAGGAAGCGGAAGGAGCCTAAAAA
AAGAGGTAGACGGAAGGGATCAAAGAAAAAACTTAATCGTGATGTTACAAAGTTGCTTGGTGATGCAACCTTATGTTATGCTCAAGGCCAGTATGAGAAGGCTATATCTG
TATTGAGTCAAGTTGTTCTGCAAGCCCCAGATTTACCTGATTCGTACCATACGCTTGGACTTGTTTATCATGCAATTGGCGATGATGTAAAAGCCATGGGATTCTACATG
CTTGCTGCACATTTAATGCCAAGAGATTCATCTCTCTGGAAACTGCTATTTTCATGGTCGATTGACCGAGGTGACATTGATCAAGCAAGTTACTGTCTTTCTAAAGCAAT
TAAAGCAGAGCCTGATGATATTAATTTATTATTTCATCGTGCATCACTCTATCTTGAGCGTGGAGATTGTCAAAAGGCAGCTGAAACATATGACCAAATTCATCAACAAT
GTCTAGGAAATGTTGAAGCACTAATGACAGGAGCGAAGCTGTACCAAAAATGCGGTCATCTGGAACGTGGAACTTGCATTCTTGAGGACTACATTAAAGGACATCCAGCT
GAAGCTGATTTAGATGTGGTTGATCTTCTAGCTTCTTTTTATATGGGAAGTAAAGAATTCAGCAAAGCTCTTGAGCGCATTGAGCATGCGGATGAGGTGTACTGTGCAGG
AAAAGAGCTACCTTTAAACTTGACAGCTAAAGCAGGAATCTGCCACGTCCATCTTGGAAATATTGAAAAGGCAGAGTGCCTCTTTGCTAATTTGGAACGGGAAGCTGCCG
GTGATAACTCAAATTTGATGATTGAAGTTGCAGACTCGTTGCTGAGTCTTAAGCACTATAACTTGGCATTGAAGTATTATCTCATGTCAGAAGAAGTAAATGCTGGAGGG
AACATGGGGATTTTATACCTAAAAATTGCCCAATGTTACTCATCAACTAATGATAGAGCACAGGCAATTGTGTTCTTTTACAAAGTGCTTCAACATCTTGAAGATAACAT
TAATGCCCGATTAACTTTGGCTTCCCTCCTCCTTGAGGAAGCTAGAGAAGAAGAAGCCATTTCATTACTATCTCCTCCAAAAGATTCAAACTCAACTAGCTCATCTTCCA
GCAAACTCAAACCTTGGTGGCTCAATGAGAAAGTAAAACTGAAGCTTTGCCACATATACAGAACTAAAGGAATGCTCGAGAACTTCGTTGAGGCAATCTTTTCTTTGGTC
CGTGAGTCCTTATATATTGAGACTCTTCAAGAAAAGATTAAAGTGAACAAGAAGAAGCTTCCAAAGAGAGTTTTGCTTGAAAGAGTCAAAGTACTAGATGGACGTGAAAC
TGGTAGCCTATTTCGTGGCTTCAGACCCGTAGCTCCTAAATCAGATTTATCAAAAGCTTCCAGAGCAAAGAAATTGCTTCAAAAGAGGGAAAGAATCAAGGAAGAAAAGA
AGGCCAAAGCTCTGGCTGCTGGAGTCGAGGTGAACTATGATGATTTGGATGATGAGCCAACGCTACGGGGGCACCGAGAATCTCCCCTGCCTAATCTTCTGAAGGATGAA
GAATATCATAATCTTATTGTTGATCTGTGCAAGGCATTGGCTTCGTTGGGAAGATGTTCTGAAGCTTTAGAGATAATAAGTTTAACTTTAAAGTTGGCTTTTAACACATT
ATCCACAGAAAGGAAGGAGGAACTCCAGTTACTCGGAGCTCAATTAGCATTCAGCTCAACTGATACCAAGCATGGATTCAACTTTGCAAAGCACGTTGTTAAGCTGTATC
CTTACAGCATCTCTGCTTGGAACTGCTATTATAAAGTATCTTCAAGAATGACGAACCGGGATTCAAGGCATTGCAAGCTTCTGAATAGCATGCAAGCCAAATACAAAGAT
TGTGCACCGCCCTATCTCATTGCCGGGCATCAATTTACCGCCATTAGCCATCATCAAGATGCTGCAAGGAAATATCTTGAAGCTTATAAATTATTGCCGGATAGTCCCCT
AGTTAACTTGTGTGTTGGATCGGCGTTAATCAACTTAACGCTTGGATTTCGTCTTCAAAACAAGCATCAGTGTATCGCGCAAGGCTTAGCATTCCTCTACAAAAATTTGA
AGCTTTGTGATAACAGCCAGGAAGCCTTATACAACATAGCTCGAGCGTATCATCACATTGGACTCGTGACGCTGGCGGTTACATATTACGAAAAGGTGCTCGCAACTTAC
CAGAAGGATTGCCCCATCCCAGAACTTTTTGGCGAGAATCGAAACGTTAAGCATCAGAAATTAGTCTACTGTGATCTACGCAGAGAAGCAGCTTACAATTTGCATCTTAT
ATATAAAGAGAGCGGAGCTGTTGATCTTGCTAGGCAAGTCCTAAAAGATCATTGTACATTTTAA
Protein sequenceShow/hide protein sequence
MEEEGNKNSGNEEVPGCAVGDLGEEEDREDREEEEEEEEEEEEEEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQS
ERSAKRGRVEDITGASFDEIMEAMNYGSRRKRKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQYEKAISVLSQVVLQAPDLPDSYHTLGLVYHAIGDDVKAMGFYM
LAAHLMPRDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCQKAAETYDQIHQQCLGNVEALMTGAKLYQKCGHLERGTCILEDYIKGHPA
EADLDVVDLLASFYMGSKEFSKALERIEHADEVYCAGKELPLNLTAKAGICHVHLGNIEKAECLFANLEREAAGDNSNLMIEVADSLLSLKHYNLALKYYLMSEEVNAGG
NMGILYLKIAQCYSSTNDRAQAIVFFYKVLQHLEDNINARLTLASLLLEEAREEEAISLLSPPKDSNSTSSSSSKLKPWWLNEKVKLKLCHIYRTKGMLENFVEAIFSLV
RESLYIETLQEKIKVNKKKLPKRVLLERVKVLDGRETGSLFRGFRPVAPKSDLSKASRAKKLLQKRERIKEEKKAKALAAGVEVNYDDLDDEPTLRGHRESPLPNLLKDE
EYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNTLSTERKEELQLLGAQLAFSSTDTKHGFNFAKHVVKLYPYSISAWNCYYKVSSRMTNRDSRHCKLLNSMQAKYKD
CAPPYLIAGHQFTAISHHQDAARKYLEAYKLLPDSPLVNLCVGSALINLTLGFRLQNKHQCIAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATY
QKDCPIPELFGENRNVKHQKLVYCDLRREAAYNLHLIYKESGAVDLARQVLKDHCTF