; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G41340 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G41340
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptiongeneral transcription factor 3C polypeptide 3
Genome locationChr3:35442203..35458120
RNA-Seq ExpressionCSPI03G41340
SyntenyCSPI03G41340
Gene Ontology termsGO:0006383 - transcription by RNA polymerase III (biological process)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat
IPR039340 - Transcription factor Tfc4/TFIIIC-102/Sfc4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008447634.1 PREDICTED: general transcription factor 3C polypeptide 3 isoform X1 [Cucumis melo]0.0e+0096.39Show/hide
Query:  MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDREGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYE
        MEKEG+RISD EEVPG+VMHVLG EKE VETGV DR    EEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYE
Subjt:  MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDREGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYE

Query:  ALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAPDL
        ALAEKKRKALANGQSERAAKRGRVED++GASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQG+HEKAISLLRQVVL+APDL
Subjt:  ALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAPDL

Query:  PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
        PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSI+RGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
Subjt:  PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL

Query:  GNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAECLF
        GNVEALMTGAKLYQKCGHLERAICILEDYIK HPSEADLDVVDLLASLYMGSKEFSKALE IEHADRVYCAGNELPLNLT KAGICHAHLG+LEKAECLF
Subjt:  GNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAECLF

Query:  ANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEARDKE
        ANLRRETTY HSNLMIEVADSL+SLKHYSWALKYYLMSEEVNAGENMGILY K+AECYLSTNE+EQAIVFFYKVLQHVEDNINARLTLASLLLEEARD+E
Subjt:  ANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEARDKE

Query:  AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK
        AISLLSPPKDSNPTSSSS KLKPWWLNEKVKLKLCHIYR RGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGF+
Subjt:  AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK

Query:  PVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLKLAFN
        PVAPKSDLTKASRAKRLLQKR+RIKEEKKAK LAAGVN+SYDDLDDEPALRMHRESPLPNLLKEEE+HILIVDLCKALASLGRCSEALEIISLTLKLAFN
Subjt:  PVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLKLAFN

Query:  SLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISHHQDAARKY
        SLS ERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQ+KYKDCAPPYIIAGHQFTTISHHQDAARKY
Subjt:  SLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISHHQDAARKY

Query:  LEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN
        LEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN
Subjt:  LEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN

Query:  IKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        IKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Subjt:  IKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

XP_011652346.1 general transcription factor 3C polypeptide 3 [Cucumis sativus]0.0e+0099.79Show/hide
Query:  MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDREGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYE
        MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDREGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYE
Subjt:  MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDREGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYE

Query:  ALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAPDL
        ALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAPDL
Subjt:  ALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAPDL

Query:  PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
        PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
Subjt:  PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL

Query:  GNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAECLF
        GNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAECLF
Subjt:  GNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAECLF

Query:  ANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEARDKE
        ANLRRETTY HSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEARDKE
Subjt:  ANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEARDKE

Query:  AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK
        AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYR RGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK
Subjt:  AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK

Query:  PVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLKLAFN
        PVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLKLAFN
Subjt:  PVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLKLAFN

Query:  SLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISHHQDAARKY
        SLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISHHQDAARKY
Subjt:  SLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISHHQDAARKY

Query:  LEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN
        LEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN
Subjt:  LEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN

Query:  IKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        IKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Subjt:  IKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

XP_022152620.1 general transcription factor 3C polypeptide 3 isoform X2 [Momordica charantia]0.0e+0086.84Show/hide
Query:  MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDR--------EGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYK
        MEKEG+ ISD +EVPG  + V G  K + ET VE+R        E EEEEEEEEEE EEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFS+QPYK
Subjt:  MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDR--------EGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYK

Query:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQ
        KFERLEYEALAEKKRKALAN QSER  KRGR+EDI GASF+EI+EAMNYGSRRKLKEPK+RGRRKGSKKK+NR++TKLLGDATLCYAQG++EKAIS+LRQ
Subjt:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQ

Query:  VVLRAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY
        VVL+APDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETY
Subjt:  VVLRAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY

Query:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGD
        DQIHQ+C+GNVEALMTGAKLYQKCGH ERAICILEDYIKGHP+EADLDVVDLLASLYMGSKEFSKALE IEHAD+VYCAGNE+PLNL TKAGICH HLG+
Subjt:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGD

Query:  LEKAECLFANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLL
        +EKAE LFANL R+T   HS+ +IE ADSL+SLKH++ ALKYYLMSEEVNAG  MGI+YLKIA+CYLSTNER +AIVFFYKVLQ +EDNINARLTLASLL
Subjt:  LEKAECLFANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLL

Query:  LEEARDKEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRET
        LEEAR++EAISLLSPPKDSN +SSSSSK KPWWLNEKVKLKLC+IYR +G+LENFVE IF LVRESLYIETL+EKIKVNKKKLPRRVLLERVKVLDGRET
Subjt:  LEEARDKEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRET

Query:  GNLFRGFKPVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIIS
        G+LFRGF+PVAPKSDL+KASRAKRLLQKRERIKEEKKA+ALAAGVN+SYDD+DDEPALR+HRESPLPNLLK+EEYH LIVDLCKALASLGRCSEALEIIS
Subjt:  GNLFRGFKPVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIIS

Query:  LTLKLAFNSLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISH
        LTLKLAFNSLSMERKEELQLLGAQLAFSST T HGFNFAKHVVKQYPYS SAWNCYYKV+S +T+RDSRHCKLLNS+Q+KYKDCAPP+IIAGHQF  ISH
Subjt:  LTLKLAFNSLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISH

Query:  HQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP
        HQ+AA+KYLEAYK++PDSPLINLCVG++LINLALG RLQNKHQCVAQGLAFLY NLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP
Subjt:  HQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP

Query:  ELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        ++FGENRNIKH+ SVYCDLRREAAYNLHLIYK+SGALDLARQVLKDHCTF
Subjt:  ELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

XP_022997988.1 general transcription factor 3C polypeptide 3 [Cucurbita maxima]0.0e+0088.35Show/hide
Query:  MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDR--EGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLE
        ME+EG+ ISD EEVPG    V G  K VVET VEDR  E EEEEEEEEEE EEEVEDEGEDD EEEDGY FKFKAGENPFDFVEGTDFS+QPYKKFERLE
Subjt:  MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDR--EGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLE

Query:  YEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAP
        YEALAEKKRKALANGQSER+AKRGRVEDISGASF+EI+EAMNYGSRRK ++PKKRGRRKGSK+KLN DVTKLLGDATLCYAQG++EKAIS+L QVVL+AP
Subjt:  YEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAP

Query:  DLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQ
        D+PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF+RASLYLERGDC+KAAETYDQIHQ 
Subjt:  DLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQ

Query:  CLGNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAEC
         + NVEALMTGAKLYQKCGHLERAICILE+YIKGHP EADLDVVDLLASLYMGSKEFSKALE IEHAD VYCA NELPLNLT KAGICH HLG+ EKAEC
Subjt:  CLGNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAEC

Query:  LFANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEARD
        LFANLRRE     SNLMIEVADSL+SLKHY+ ALKYYLMSEEVNAG N+GILYLKIA+CY STNER +AIVFFYKVLQH+EDNINARLTLASLLLEEAR+
Subjt:  LFANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEARD

Query:  KEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRG
        +EAISLLSPPKDSN TSSSSSK KPWWLNE+VKLKLCHI+R +G+LENFVE IFPLVRESLYIETL EKIKVNKKKLP+RVLLERVKVLDGR+TG LFRG
Subjt:  KEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRG

Query:  FKPVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLKLA
        F+PVAPKSDL+KASRAK+LLQKRERI+EEKKA+ALAAGVNL+YDD DDEP LR+ RESPLPNLLK+EEYH LIVDLCKALASLGRCSEALEIISLTLKLA
Subjt:  FKPVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLKLA

Query:  FNSLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISHHQDAAR
        FNSLS+ERKEELQLLGAQLAFSST T HGFNFAKHVVKQYPYS SAWNCYYKV+S +TNRDSRHCKLLNSMQ+KYKDCAPPYIIAGHQFT ISHHQDAAR
Subjt:  FNSLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISHHQDAAR

Query:  KYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN
        KYLEAYK++PDSPLINLCVG++LINL LGFRLQNKHQCVAQGLAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN
Subjt:  KYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN

Query:  RNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        ++ KHQ SVYCDLRREAAYNLHLIYKESGALDLARQVLKD+CTF
Subjt:  RNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

XP_038879313.1 general transcription factor 3C polypeptide 3 [Benincasa hispida]0.0e+0092.99Show/hide
Query:  MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDREGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYE
        MEKEG+ +SD EEVPG  M VLG  KEVVETGVEDR   EEEEEEEEEGEEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFS+QPYKKFERLEYE
Subjt:  MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDREGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYE

Query:  ALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAPDL
        ALAEKKRKALANGQSERAAKRGRVEDI GASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQG+HEKA+S+LRQVVL+APDL
Subjt:  ALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAPDL

Query:  PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
        PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDI+LLFHRASLYLERGDCEKAAETYDQIHQQCL
Subjt:  PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL

Query:  GNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAECLF
        GNVEALMTGAKLYQKCGHLERAICILEDYIK HP+EADLDVVDLLASLYMG+KEF KALERIEHAD VYCAG+ELPL LTTK GICHAHLG++EKAECLF
Subjt:  GNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAECLF

Query:  ANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEARDKE
        ANL  ET   HSNLMIEVADSL+SLKHY+ ALKYYLM EEVNAG NMGILYLKIA+CYLSTNER QAIVFFYKVLQH+EDNINARLTLASLLLEEARD+E
Subjt:  ANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEARDKE

Query:  AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK
        AISLLSPPKDSNPTSSSSSKLKPWW NEKVKLKLCHIY+ RG+LE+FVEVIFPLVRESLYIETLQEKIKVNKKKLP+RVLLERVKVLDGRE+GNLFRGF+
Subjt:  AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK

Query:  PVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLKLAFN
        PVAPKSDL+KASRAKRLLQKRERIKEEKKAKALAAGV+++YDDLDDEPALRMHRESPLPNLLKEEEYH LIVDLCKALASLGRCSEALEIISLTLKLA N
Subjt:  PVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLKLAFN

Query:  SLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISHHQDAARKY
        SLS+ERKEELQLLGAQLAFSST T HGFNFAKHVVKQYPYSISAWNCYYKVAS LTNRDSRHCKLLNSMQ+KYKDCAPPYIIAGHQFT ISHHQDAARKY
Subjt:  SLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISHHQDAARKY

Query:  LEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN
        LEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN
Subjt:  LEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN

Query:  IKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        IKHQ SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Subjt:  IKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

TrEMBL top hitse value%identityAlignment
A0A0A0LGB1 Uncharacterized protein0.0e+0097.13Show/hide
Query:  MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDREGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYE
        MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDREGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYE
Subjt:  MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDREGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYE

Query:  ALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAPDL
        ALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAPDL
Subjt:  ALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAPDL

Query:  PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
        PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
Subjt:  PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL

Query:  GNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAECLF
        GNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAECLF
Subjt:  GNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAECLF

Query:  ANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEARDKE
        ANLRRETTY HSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEARDKE
Subjt:  ANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEARDKE

Query:  AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK
        AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYR RGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK
Subjt:  AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK

Query:  PVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLKLAFN
        PVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTL     
Subjt:  PVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLKLAFN

Query:  SLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISHHQDAARKY
                           +STGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISHHQDAARKY
Subjt:  SLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISHHQDAARKY

Query:  LEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN
        LEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN
Subjt:  LEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN

Query:  IKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        IKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Subjt:  IKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

A0A1S3BHB9 general transcription factor 3C polypeptide 3 isoform X10.0e+0096.39Show/hide
Query:  MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDREGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYE
        MEKEG+RISD EEVPG+VMHVLG EKE VETGV DR    EEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYE
Subjt:  MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDREGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYE

Query:  ALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAPDL
        ALAEKKRKALANGQSERAAKRGRVED++GASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQG+HEKAISLLRQVVL+APDL
Subjt:  ALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAPDL

Query:  PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
        PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSI+RGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
Subjt:  PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL

Query:  GNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAECLF
        GNVEALMTGAKLYQKCGHLERAICILEDYIK HPSEADLDVVDLLASLYMGSKEFSKALE IEHADRVYCAGNELPLNLT KAGICHAHLG+LEKAECLF
Subjt:  GNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAECLF

Query:  ANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEARDKE
        ANLRRETTY HSNLMIEVADSL+SLKHYSWALKYYLMSEEVNAGENMGILY K+AECYLSTNE+EQAIVFFYKVLQHVEDNINARLTLASLLLEEARD+E
Subjt:  ANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEARDKE

Query:  AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK
        AISLLSPPKDSNPTSSSS KLKPWWLNEKVKLKLCHIYR RGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGF+
Subjt:  AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK

Query:  PVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLKLAFN
        PVAPKSDLTKASRAKRLLQKR+RIKEEKKAK LAAGVN+SYDDLDDEPALRMHRESPLPNLLKEEE+HILIVDLCKALASLGRCSEALEIISLTLKLAFN
Subjt:  PVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLKLAFN

Query:  SLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISHHQDAARKY
        SLS ERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQ+KYKDCAPPYIIAGHQFTTISHHQDAARKY
Subjt:  SLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISHHQDAARKY

Query:  LEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN
        LEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN
Subjt:  LEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRN

Query:  IKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        IKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Subjt:  IKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

A0A6J1DEF9 general transcription factor 3C polypeptide 3 isoform X20.0e+0086.84Show/hide
Query:  MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDR--------EGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYK
        MEKEG+ ISD +EVPG  + V G  K + ET VE+R        E EEEEEEEEEE EEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFS+QPYK
Subjt:  MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDR--------EGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYK

Query:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQ
        KFERLEYEALAEKKRKALAN QSER  KRGR+EDI GASF+EI+EAMNYGSRRKLKEPK+RGRRKGSKKK+NR++TKLLGDATLCYAQG++EKAIS+LRQ
Subjt:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQ

Query:  VVLRAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY
        VVL+APDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETY
Subjt:  VVLRAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY

Query:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGD
        DQIHQ+C+GNVEALMTGAKLYQKCGH ERAICILEDYIKGHP+EADLDVVDLLASLYMGSKEFSKALE IEHAD+VYCAGNE+PLNL TKAGICH HLG+
Subjt:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGD

Query:  LEKAECLFANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLL
        +EKAE LFANL R+T   HS+ +IE ADSL+SLKH++ ALKYYLMSEEVNAG  MGI+YLKIA+CYLSTNER +AIVFFYKVLQ +EDNINARLTLASLL
Subjt:  LEKAECLFANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLL

Query:  LEEARDKEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRET
        LEEAR++EAISLLSPPKDSN +SSSSSK KPWWLNEKVKLKLC+IYR +G+LENFVE IF LVRESLYIETL+EKIKVNKKKLPRRVLLERVKVLDGRET
Subjt:  LEEARDKEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRET

Query:  GNLFRGFKPVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIIS
        G+LFRGF+PVAPKSDL+KASRAKRLLQKRERIKEEKKA+ALAAGVN+SYDD+DDEPALR+HRESPLPNLLK+EEYH LIVDLCKALASLGRCSEALEIIS
Subjt:  GNLFRGFKPVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIIS

Query:  LTLKLAFNSLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISH
        LTLKLAFNSLSMERKEELQLLGAQLAFSST T HGFNFAKHVVKQYPYS SAWNCYYKV+S +T+RDSRHCKLLNS+Q+KYKDCAPP+IIAGHQF  ISH
Subjt:  LTLKLAFNSLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISH

Query:  HQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP
        HQ+AA+KYLEAYK++PDSPLINLCVG++LINLALG RLQNKHQCVAQGLAFLY NLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP
Subjt:  HQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP

Query:  ELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        ++FGENRNIKH+ SVYCDLRREAAYNLHLIYK+SGALDLARQVLKDHCTF
Subjt:  ELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

A0A6J1DGK2 general transcription factor 3C polypeptide 3 isoform X10.0e+0086.75Show/hide
Query:  MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDR--------EGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYK
        MEKEG+ ISD +EVPG  + V G  K + ET VE+R        E EEEEEEEEEE EEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFS+QPYK
Subjt:  MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDR--------EGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYK

Query:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQ
        KFERLEYEALAEKKRKALAN QSER  KRGR+EDI GASF+EI+EAMNYGSRRKLKEPK+RGRRKGSKKK+NR++TKLLGDATLCYAQG++EKAIS+LRQ
Subjt:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQ

Query:  VVLRAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY
        VVL+APDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETY
Subjt:  VVLRAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY

Query:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGD
        DQIHQ+C+GNVEALMTGAKLYQKCGH ERAICILEDYIKGHP+EADLDVVDLLASLYMGSKEFSKALE IEHAD+VYCAGNE+PLNL TKAGICH HLG+
Subjt:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGD

Query:  LEKAECLFANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENM-GILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASL
        +EKAE LFANL R+T   HS+ +IE ADSL+SLKH++ ALKYYLMSEEVNAG  M GI+YLKIA+CYLSTNER +AIVFFYKVLQ +EDNINARLTLASL
Subjt:  LEKAECLFANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENM-GILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASL

Query:  LLEEARDKEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRE
        LLEEAR++EAISLLSPPKDSN +SSSSSK KPWWLNEKVKLKLC+IYR +G+LENFVE IF LVRESLYIETL+EKIKVNKKKLPRRVLLERVKVLDGRE
Subjt:  LLEEARDKEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRE

Query:  TGNLFRGFKPVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEII
        TG+LFRGF+PVAPKSDL+KASRAKRLLQKRERIKEEKKA+ALAAGVN+SYDD+DDEPALR+HRESPLPNLLK+EEYH LIVDLCKALASLGRCSEALEII
Subjt:  TGNLFRGFKPVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEII

Query:  SLTLKLAFNSLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTIS
        SLTLKLAFNSLSMERKEELQLLGAQLAFSST T HGFNFAKHVVKQYPYS SAWNCYYKV+S +T+RDSRHCKLLNS+Q+KYKDCAPP+IIAGHQF  IS
Subjt:  SLTLKLAFNSLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTIS

Query:  HHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPI
        HHQ+AA+KYLEAYK++PDSPLINLCVG++LINLALG RLQNKHQCVAQGLAFLY NLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPI
Subjt:  HHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPI

Query:  PELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        P++FGENRNIKH+ SVYCDLRREAAYNLHLIYK+SGALDLARQVLKDHCTF
Subjt:  PELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

A0A6J1KFI7 general transcription factor 3C polypeptide 30.0e+0088.35Show/hide
Query:  MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDR--EGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLE
        ME+EG+ ISD EEVPG    V G  K VVET VEDR  E EEEEEEEEEE EEEVEDEGEDD EEEDGY FKFKAGENPFDFVEGTDFS+QPYKKFERLE
Subjt:  MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDR--EGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLE

Query:  YEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAP
        YEALAEKKRKALANGQSER+AKRGRVEDISGASF+EI+EAMNYGSRRK ++PKKRGRRKGSK+KLN DVTKLLGDATLCYAQG++EKAIS+L QVVL+AP
Subjt:  YEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAP

Query:  DLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQ
        D+PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF+RASLYLERGDC+KAAETYDQIHQ 
Subjt:  DLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQ

Query:  CLGNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAEC
         + NVEALMTGAKLYQKCGHLERAICILE+YIKGHP EADLDVVDLLASLYMGSKEFSKALE IEHAD VYCA NELPLNLT KAGICH HLG+ EKAEC
Subjt:  CLGNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAEC

Query:  LFANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEARD
        LFANLRRE     SNLMIEVADSL+SLKHY+ ALKYYLMSEEVNAG N+GILYLKIA+CY STNER +AIVFFYKVLQH+EDNINARLTLASLLLEEAR+
Subjt:  LFANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEARD

Query:  KEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRG
        +EAISLLSPPKDSN TSSSSSK KPWWLNE+VKLKLCHI+R +G+LENFVE IFPLVRESLYIETL EKIKVNKKKLP+RVLLERVKVLDGR+TG LFRG
Subjt:  KEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRG

Query:  FKPVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLKLA
        F+PVAPKSDL+KASRAK+LLQKRERI+EEKKA+ALAAGVNL+YDD DDEP LR+ RESPLPNLLK+EEYH LIVDLCKALASLGRCSEALEIISLTLKLA
Subjt:  FKPVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLKLA

Query:  FNSLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISHHQDAAR
        FNSLS+ERKEELQLLGAQLAFSST T HGFNFAKHVVKQYPYS SAWNCYYKV+S +TNRDSRHCKLLNSMQ+KYKDCAPPYIIAGHQFT ISHHQDAAR
Subjt:  FNSLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISHHQDAAR

Query:  KYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN
        KYLEAYK++PDSPLINLCVG++LINL LGFRLQNKHQCVAQGLAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN
Subjt:  KYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN

Query:  RNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        ++ KHQ SVYCDLRREAAYNLHLIYKESGALDLARQVLKD+CTF
Subjt:  RNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

SwissProt top hitse value%identityAlignment
O74458 Transcription factor tau subunit sfc45.0e-3022.65Show/hide
Query:  EDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQ-GEHEKAISLLRQVVLRAPDLPDSYHTLGLVYNAIGDD----VK
        +DI+   ++E L+A+  G R+  K  K RGR   +    + +V ++L  A   +AQ G  ++A  L  ++V    ++  ++  LG  +   G+      K
Subjt:  EDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQ-GEHEKAISLLRQVVLRAPDLPDSYHTLGLVYNAIGDD----VK

Query:  AMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDIN----LLFHRASLYLERGDCEKAAETYDQIHQQCLGNVEALMTGAKLYQK
         +  +M AAHL PKD  LW      S      DQA YC ++A+ A+P + +     +++R+ L  E G  +KAAE +  + Q    N   L   A++Y K
Subjt:  AMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDIN----LLFHRASLYLERGDCEKAAETYDQIHQQCLGNVEALMTGAKLYQK

Query:  CGHLERAIC----ILEDYIKGHPSEA------DLDVVDLLASLYMGSKEFSKALERIEHADRVY----------------------------CAGNE---
          H  R I     I   Y   +P+        DL  ++L A L +   ++S  +  I    R +                             A  E   
Subjt:  CGHLERAIC----ILEDYIKGHPSEA------DLDVVDLLASLYMGSKEFSKALERIEHADRVY----------------------------CAGNE---

Query:  -----LPLNLTTKAGICHAHLGDLEKAECLFANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIV
             LP    TK GI     G+L +AE  F+ ++         ++ ++A + M ++    AL+Y+++       +N+G+ Y  +  CYL   E E A  
Subjt:  -----LPLNLTTKAGICHAHLGDLEKAECLFANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIV

Query:  FFYKVLQHVEDNINARLTLASL-LLEEARDKEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKI
            +L     N NA + LA +  L++ RD  A+ +++                             +I+  R    N  E+     +   + + +  ++
Subjt:  FFYKVLQHVEDNINARLTLASL-LLEEARDKEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKI

Query:  KVNKKKLPRRVLLERVKVLDGRETGNLFRGFKPVAPKSDLTKASRAKRLLQKRERIKE-------------------------EKKAKALAAGV--NLSY
         V  +K+P+    +R ++   +E    F  +K    +    K    ++ L+K E + E                         EKKA+A A  +     Y
Subjt:  KVNKKKLPRRVLLERVKVLDGRETGNLFRGFKPVAPKSDLTKASRAKRLLQKRERIKE-------------------------EKKAKALAAGV--NLSY

Query:  DDLDDEPALRMHR-----------ESPLPNLLKE--------EEYHILIVDLCKALASLGRCSEALEIISLTLKLAFNSLSMERKEELQ---LLGAQLAF
          L+D+    ++R           +  L  +L+         + ++ L V+    L  +G   +A ++++  +          +++ L+   L  +  A 
Subjt:  DDLDDEPALRMHR-----------ESPLPNLLKE--------EEYHILIVDLCKALASLGRCSEALEIISLTLKLAFNSLSMERKEELQ---LLGAQLAF

Query:  SSTGTM-------HGFNFAKHVVKQYPYSIS-AWNCYYKVASCLTNRD-SRHCKLLNSMQS-----------KYKD---CAPP------YIIAGHQFTTI
           G +         F F +   + +   +S  + C          +   R  KL++ + S           K  D     P        ++ GH     
Subjt:  SSTGTM-------HGFNFAKHVVKQYPYSIS-AWNCYYKVASCLTNRD-SRHCKLLNSMQS-----------KYKD---CAPP------YIIAGHQFTTI

Query:  SHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN-----NQEALYNIARAYHHIGLVTLAVTYYEKVLATY
             A   Y  A+ I PD P+ NL +G + ++ A+     N+H  + QG  FLY+   L  N      QEALYN+ +AYH IGL   AV YYE VL   
Subjt:  SHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN-----NQEALYNIARAYHHIGLVTLAVTYYEKVLATY

Query:  QKDCPIPELFGENRNIKHQN-SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
              P   G+         S   D   EAAYNL LIY  SG + LA Q+   +  F
Subjt:  QKDCPIPELFGENRNIKHQN-SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

P33339 Transcription factor tau 131 kDa subunit1.1e-1620.45Show/hide
Query:  GEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILE
        G+ ++E++ +  E E  D G+ + E+E+         ++     E  D  V    +    EY   +E+    LA             ED   A    I E
Subjt:  GEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILE

Query:  AMNYGSRRKLKEPK-KRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSS
        A N+  ++K K  K K   R+  ++ L+ +V +LL  A   + + + + A  L  +V+ +      +Y TLG +Y   G        + LAAHL   D  
Subjt:  AMNYGSRRKLKEPK-KRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSS

Query:  LWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQ-------------CLGNVEALMTGAKLYQKC--GHLER
         WK++   S D   + QA YC S+ I   P +   ++ R+ LY + G   +A + + +++                  + + +    +LY K    ++ER
Subjt:  LWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQ-------------CLGNVEALMTGAKLYQKC--GHLER

Query:  AICIL------------------EDYIKGHPSEADLD------------------------VVDLLASLYM--------GSKEFSKALERIEHA------
           IL                  ED  +  P E D D                         +++LA L++        G K   K    I+        
Subjt:  AICIL------------------EDYIKGHPSEADLD------------------------VVDLLASLYM--------GSKEFSKALERIEHA------

Query:  -------------------DRVYCAGNE----LPLNLTTKAGICHAHLGDLEKAECLFANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYY---LMSE
                           D +  A  E    +P+++  + G+   +  +L +A   F  L  ET    ++L  E A +L   + Y  A+ ++   L  E
Subjt:  -------------------DRVYCAGNE----LPLNLTTKAGICHAHLGDLEKAECLFANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYY---LMSE

Query:  EVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLL------------------LEEARDKEAISLLSPPKDSNPTSSSSSK-
        E    +    ++  +A CY      E A  F+   ++   D+++ R++LA +                   + + +  E +  +S  K SN TS  SSK 
Subjt:  EVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLL------------------LEEARDKEAISLLSPPKDSNPTSSSSSK-

Query:  -LKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKPVAPKSDLTKASRAKRLLQ
         L+        K K       R  +E    +   +V +   ++  +    +N+ K    + +  V      E  ++F   K    KS   ++ +   +L+
Subjt:  -LKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKPVAPKSDLTKASRAKRLLQ

Query:  KRERIKEE-----KKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLKLAFNSLSMERKEELQLL-
        + ++   E     ++   LA G ++    L +E  + +   + L   L  E++  L ++L   +A      + L ++    ++       ER + ++ + 
Subjt:  KRERIKEE-----KKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLKLAFNSLSMERKEELQLL-

Query:  ---------GAQLAFSSTGTMHGFNFAKHVVKQYPYSI----SAWNCYYKVA-------------SCLTNRD-SRHCKLLNSMQSKYKDCAPPYIIAGHQ
                   +LA +  G ++ F F + V++ + YS+    S+ N                   SC  N + +    + N         + PY+   + 
Subjt:  ---------GAQLAFSSTGTMHGFNFAKHVVKQYPYSI----SAWNCYYKVA-------------SCLTNRD-SRHCKLLNSMQSKYKDCAPPYIIAGHQ

Query:  FTTISHHQDAARKYLEAYKIM-------PDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN------NQEALYNIARAYHHIGLVTL
            S     +R +L A + +       PD P++NL +G S I+ A+      +H  +  GL +LY+  K+  +       QEA YN+ RA+H IGLV++
Subjt:  FTTISHHQDAARKYLEAYKIM-------PDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN------NQEALYNIARAYHHIGLVTL

Query:  AVTYYEKVLATYQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDH
        A+ YY +VL  Y                         L++ AAYN  +IY++SG ++LA  +++ +
Subjt:  AVTYYEKVLATYQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDH

Q9Y5Q9 General transcription factor 3C polypeptide 31.1e-6424.73Show/hide
Query:  KEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSID
        +E KK  + K  + KL R +  L+G+A + +A+GE E+AI +  +++ +AP   + + TL ++Y   GD  K++ F ++AAHL P D+  W  L   S++
Subjt:  KEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSID

Query:  RGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGN-----VEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVV
        + +I QA +C +KA+K EP ++  L+ R+SLY + GD + A + Y +I      +     ++     AK Y +   +  AI I+++    H     ++ V
Subjt:  RGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGN-----VEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVV

Query:  DLLASLYMGSKEFSKALERI---------------------EHADRVYCA-GNELPLNLTTKAGICHAHLGDLEKAECLFANLRRETTYVHSNLMIEVAD
        ++ A LY+ +K++ KALE I                     +  + V C   + +P+++T K  +C  HL  LE    L   L  +      +L ++VA+
Subjt:  DLLASLYMGSKEFSKALERI---------------------EHADRVYCA-GNELPLNLTTKAGICHAHLGDLEKAECLFANLRRETTYVHSNLMIEVAD

Query:  SLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEARDKEAISLLSPPKDSNPTSSSSSK
        + + +  Y+ AL   L +   +   N+ +++L+ AEC  +    E+A   + KV+     +++AR++L++L  +  + ++A+  L P  D +  +  ++ 
Subjt:  SLMSLKHYSWALKYYLMSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEARDKEAISLLSPPKDSNPTSSSSSK

Query:  LKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKPVAPKSDLTKASRAKRLLQK
         +     +++KL L           +   ++F   +   Y++TL   + +  K     V + R +V        L    K       L K SR K +   
Subjt:  LKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKPVAPKSDLTKASRAKRLLQK

Query:  RERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLKLAFNSLSMERKEELQLLGAQLAFS
         ++      AKA+ A                      L ++L ++++  L++    +L  L R  EA  ++  +L+        ++++EL+  G   A  
Subjt:  RERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLKLAFNSLSMERKEELQLLGAQLAFS

Query:  STGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSS
               +N+ + +V +       WN + +V   + ++D RH +    +  K  +     ++ GH        + A  +Y++A++  PD PL + C+G +
Subjt:  STGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSS

Query:  LINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNLH
         I++A    +  +H  + QG +FL + L L    QE+ YN+ R  H +GL+ LA+ YY+K L        +P L  E   +        DLRR+ AYNL 
Subjt:  LINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNLH

Query:  LIYKESGALDLARQVLKDHCT
        LIY+ SG   +A+ +L  +C+
Subjt:  LIYKESGALDLARQVLKDHCT

Arabidopsis top hitse value%identityAlignment
AT1G17680.1 tetratricopeptide repeat (TPR)-containing protein1.0e-18742.13Show/hide
Query:  EGSRISDCEEVPGEVMHVLGTEKEVV--ETGVEDREGEEEEE--EEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEY
        EG+ IS+ EE P      +  +K+V+  +T  +D++   +EE   ++++ + + +DEG++  EE+D     F+AG  P                FER EY
Subjt:  EGSRISDCEEVPGEVMHVLGTEKEVV--ETGVEDREGEEEEE--EEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEY

Query:  EALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAPD
        EALAE+KRKALA+ Q   +        + G      +E M+ G RRK ++ KK+GRR GSKK++  D+ K   +A   +A G   +A+ +L +V+ +AP 
Subjt:  EALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAPD

Query:  LPDSYHTLGLVYNAIG-DDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQ
           +Y+ L  V   +G  +  +     +AA++    S  WKLL+    ++ +I  A    SKAI+A+PDDI L +  A + L  G   +AAET++QI ++
Subjt:  LPDSYHTLGLVYNAIG-DDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQ

Query:  CLGNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAEC
        C   +EAL  G + + K G  ERA  ILED+IK H SE   DV+DLLAS++M      +AL+ I    ++Y  G EL  +L  +  ICH HL ++E+AE 
Subjt:  CLGNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAEC

Query:  LFANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYL--MSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEA
        + + L +E    H  L+  +AD L ++ ++  ALKYY+  +SE VN     G L++KIA CY+S  ER+QAIVF+YK L  + D ++ R+TLASLLLE+ 
Subjt:  LFANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYL--MSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEA

Query:  RDKEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLF
        +  EA+ +LSPP++ +P    ++KLK WW N K+++ LC IY   G+LE+F      LV + ++  T++ K K       R VL E  +    R   +  
Subjt:  RDKEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLF

Query:  RGFKPVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLK
               PK    K  + +  L +  RI+E        A +    +D+  E             ++K+EEYH L VDLCKALASL R  EALEI++L  +
Subjt:  RGFKPVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLK

Query:  LAFNSLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISHHQDA
        L    L +E K+ELQ LGA+++  +      F+  + V++Q+PY ++AWNCYY V S L  R S   K ++ ++SKY+DC PP +IAGH FT  S HQDA
Subjt:  LAFNSLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISHHQDA

Query:  ARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFG
        AR+YLEAYK+MP+SPLINLCVG++LINLALGFRL+N+H+C+AQG AFLY NL++C N+QEALYN+ARAY H+GLVTLA +YYEKVLA Y+KD  +P+L  
Subjt:  ARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFG

Query:  ENRNI-KHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        E+  + + +  V CDLR+EAA+NLHLIYK SGA DLARQVLKDHCTF
Subjt:  ENRNI-KHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

AT1G17680.2 tetratricopeptide repeat (TPR)-containing protein1.0e-18742.13Show/hide
Query:  EGSRISDCEEVPGEVMHVLGTEKEVV--ETGVEDREGEEEEE--EEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEY
        EG+ IS+ EE P      +  +K+V+  +T  +D++   +EE   ++++ + + +DEG++  EE+D     F+AG  P                FER EY
Subjt:  EGSRISDCEEVPGEVMHVLGTEKEVV--ETGVEDREGEEEEE--EEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEY

Query:  EALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAPD
        EALAE+KRKALA+ Q   +        + G      +E M+ G RRK ++ KK+GRR GSKK++  D+ K   +A   +A G   +A+ +L +V+ +AP 
Subjt:  EALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAPD

Query:  LPDSYHTLGLVYNAIG-DDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQ
           +Y+ L  V   +G  +  +     +AA++    S  WKLL+    ++ +I  A    SKAI+A+PDDI L +  A + L  G   +AAET++QI ++
Subjt:  LPDSYHTLGLVYNAIG-DDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQ

Query:  CLGNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAEC
        C   +EAL  G + + K G  ERA  ILED+IK H SE   DV+DLLAS++M      +AL+ I    ++Y  G EL  +L  +  ICH HL ++E+AE 
Subjt:  CLGNVEALMTGAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAEC

Query:  LFANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYL--MSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEA
        + + L +E    H  L+  +AD L ++ ++  ALKYY+  +SE VN     G L++KIA CY+S  ER+QAIVF+YK L  + D ++ R+TLASLLLE+ 
Subjt:  LFANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYL--MSEEVNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEA

Query:  RDKEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLF
        +  EA+ +LSPP++ +P    ++KLK WW N K+++ LC IY   G+LE+F      LV + ++  T++ K K       R VL E  +    R   +  
Subjt:  RDKEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLF

Query:  RGFKPVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLK
               PK    K  + +  L +  RI+E        A +    +D+  E             ++K+EEYH L VDLCKALASL R  EALEI++L  +
Subjt:  RGFKPVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPNLLKEEEYHILIVDLCKALASLGRCSEALEIISLTLK

Query:  LAFNSLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISHHQDA
        L    L +E K+ELQ LGA+++  +      F+  + V++Q+PY ++AWNCYY V S L  R S   K ++ ++SKY+DC PP +IAGH FT  S HQDA
Subjt:  LAFNSLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQSKYKDCAPPYIIAGHQFTTISHHQDA

Query:  ARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFG
        AR+YLEAYK+MP+SPLINLCVG++LINLALGFRL+N+H+C+AQG AFLY NL++C N+QEALYN+ARAY H+GLVTLA +YYEKVLA Y+KD  +P+L  
Subjt:  ARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFG

Query:  ENRNI-KHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        E+  + + +  V CDLR+EAA+NLHLIYK SGA DLARQVLKDHCTF
Subjt:  ENRNI-KHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

AT3G04240.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.2e-0623.53Show/hide
Query:  ATLCYAQGEHEKAISLLRQVVLRAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF
        A L    G+  +A+   ++ V   P  PD+Y  LG VY A+G   +A+  Y  A  + P  +  +  + S   ++G +D A     +A+  +P  +    
Subjt:  ATLCYAQGEHEKAISLLRQVVLRAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF

Query:  HRASLYLERGDCEKAAETYDQI------HQQCLGNV
        +  +   + G  ++A   Y+Q       H Q + N+
Subjt:  HRASLYLERGDCEKAAETYDQI------HQQCLGNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGGAGGGGAGTAGAATTTCTGACTGTGAAGAGGTTCCTGGTGAAGTTATGCATGTTTTGGGAACGGAAAAAGAGGTTGTAGAAACAGGAGTCGAGGATAGAGA
GGGGGAGGAGGAGGAGGAGGAGGAAGAGGAAGAAGGAGAGGAAGAAGTGGAGGATGAAGGAGAGGATGATATTGAAGAAGAAGATGGATACACATTCAAATTTAAGGCTG
GAGAAAATCCATTTGATTTTGTTGAAGGAACAGATTTTAGTGTCCAACCATATAAAAAATTTGAGCGCCTTGAATATGAAGCCCTTGCTGAGAAAAAGAGAAAAGCTCTT
GCAAATGGTCAAAGTGAGAGAGCTGCAAAGAGGGGCAGGGTAGAAGACATTTCTGGTGCTAGCTTTGATGAAATTTTGGAAGCTATGAATTATGGGTCTAGGAGGAAGTT
AAAGGAGCCTAAAAAAAGAGGTAGACGAAAAGGATCAAAGAAAAAACTTAATCGCGATGTTACAAAGTTGCTTGGTGATGCAACTTTATGTTATGCTCAAGGCGAGCACG
AAAAGGCTATATCTTTATTGCGCCAAGTTGTACTACGAGCCCCAGATTTACCTGATTCGTACCACACACTTGGACTTGTTTACAATGCAATTGGTGATGATGTAAAGGCC
ATGGGATTCTACATGCTTGCTGCACATTTAATGCCAAAAGATTCATCTCTCTGGAAACTGCTATTTTCATGGTCAATTGACCGTGGTGATATTGATCAAGCAAGCTATTG
TCTTTCTAAAGCAATAAAAGCAGAGCCTGATGATATTAATTTATTATTTCATCGAGCATCACTCTACCTTGAGCGTGGAGATTGTGAGAAAGCAGCTGAAACATATGATC
AGATTCATCAACAATGCCTTGGTAACGTTGAAGCACTCATGACAGGAGCAAAGCTGTACCAAAAATGTGGTCATCTTGAACGTGCAATTTGCATTCTTGAGGACTACATC
AAAGGGCATCCATCTGAAGCTGATTTAGATGTGGTTGATCTTTTAGCTTCTTTATACATGGGAAGTAAAGAATTCAGCAAAGCTCTTGAGCGCATTGAACATGCAGATAG
GGTGTACTGTGCAGGAAATGAGCTACCTTTAAACTTGACAACTAAAGCAGGAATCTGCCATGCTCATCTTGGAGATTTGGAGAAGGCAGAGTGCCTCTTTGCTAATTTGA
GACGGGAAACTACCTATGTTCACTCAAATTTGATGATTGAAGTTGCAGACTCGTTGATGAGTCTTAAGCACTATAGCTGGGCATTGAAATATTATCTGATGTCCGAAGAA
GTAAATGCTGGAGAGAACATGGGGATTTTATACCTAAAAATTGCCGAGTGTTACTTATCAACTAATGAAAGAGAACAGGCAATTGTTTTCTTTTATAAAGTGCTTCAACA
TGTTGAAGATAACATTAATGCTCGATTAACTTTGGCATCCCTCCTTCTTGAGGAAGCTAGAGACAAAGAAGCCATTTCATTACTATCTCCTCCAAAAGATTCAAACCCAA
CTAGCTCATCTTCCAGCAAATTAAAACCTTGGTGGCTCAATGAGAAAGTAAAACTGAAGCTTTGCCACATATACAGAATTAGAGGATTGCTTGAGAACTTCGTTGAGGTG
ATCTTTCCTTTGGTCCGAGAGTCCTTATATATCGAGACTCTTCAAGAAAAGATTAAAGTGAACAAGAAGAAGCTTCCAAGGAGGGTTTTGCTCGAGAGAGTCAAAGTACT
AGATGGACGTGAAACTGGCAACCTATTTCGTGGATTCAAACCTGTAGCTCCGAAATCAGATTTAACAAAAGCATCCAGAGCAAAGAGATTACTTCAAAAAAGGGAAAGAA
TCAAGGAAGAAAAGAAGGCTAAAGCACTGGCTGCAGGAGTCAATTTGAGCTATGATGATTTAGATGATGAACCAGCGCTACGGATGCACCGAGAATCCCCCCTGCCTAAC
CTTCTGAAAGAAGAAGAATACCATATTCTTATTGTCGATTTGTGCAAGGCATTGGCTTCCTTAGGAAGATGTTCTGAAGCTTTAGAGATTATAAGCCTAACTTTAAAGTT
GGCTTTTAACTCATTATCGATGGAAAGGAAGGAAGAACTCCAGTTACTTGGAGCTCAATTAGCTTTCAGCTCAACTGGTACCATGCATGGTTTCAACTTTGCAAAGCACG
TTGTTAAGCAGTACCCTTACAGCATCTCGGCTTGGAACTGCTATTATAAAGTAGCTTCATGTTTGACGAACCGGGATTCAAGGCATTGCAAGCTTCTGAATAGCATGCAA
TCCAAATACAAAGATTGTGCACCACCCTATATCATTGCGGGGCATCAGTTTACCACCATTAGCCATCATCAAGATGCTGCAAGGAAATATCTTGAAGCTTATAAAATCAT
GCCGGATAGTCCCCTAATTAACTTGTGTGTTGGATCGTCCTTGATCAACTTGGCTCTCGGATTCCGCCTTCAAAACAAGCATCAGTGTGTTGCGCAGGGCTTGGCATTCC
TCTACAAAAATTTGAAGCTTTGCGATAACAACCAGGAAGCCTTGTACAACATAGCTCGAGCATATCATCACATTGGACTGGTGACACTGGCAGTTACATATTACGAAAAG
GTGCTCGCAACTTACCAGAAGGATTGCCCCATTCCAGAACTTTTTGGTGAGAATCGAAATATTAAACATCAGAATTCAGTCTATTGTGACCTACGGAGAGAAGCAGCTTA
CAATTTGCATCTTATTTATAAAGAGAGTGGAGCTCTTGATCTTGCTAGGCAAGTCCTAAAAGATCATTGCACATTTTAA
mRNA sequenceShow/hide mRNA sequence
GTTTCAAAATTCTGTAGCCATCATAGCCCCTTCCTCCAATTTACTAGCCACTTTCACCCGCCACACCGACCGCCGCCCCCACCGTTTCCCTCACGCCACTCGCCGTTCAT
AAACGTTTATAAGCTGTTTTGAGTTCGAAGGTTGAACGAATTTGTGATTAAACTATTCTTTCCCTCCAAAATTGGATTAGCATTTACAATTTCTCCTGCCCCGTTTTCCC
ACCACCGCCACTTTCATCCATTGGGCTGCCACCACCCTTCGATCAGAATTTTTTTTTCAAGAGTTTCCAGTAACCTGCATTTGAATGGAAAAGGAGGGGAGTAGAATTTC
TGACTGTGAAGAGGTTCCTGGTGAAGTTATGCATGTTTTGGGAACGGAAAAAGAGGTTGTAGAAACAGGAGTCGAGGATAGAGAGGGGGAGGAGGAGGAGGAGGAGGAAG
AGGAAGAAGGAGAGGAAGAAGTGGAGGATGAAGGAGAGGATGATATTGAAGAAGAAGATGGATACACATTCAAATTTAAGGCTGGAGAAAATCCATTTGATTTTGTTGAA
GGAACAGATTTTAGTGTCCAACCATATAAAAAATTTGAGCGCCTTGAATATGAAGCCCTTGCTGAGAAAAAGAGAAAAGCTCTTGCAAATGGTCAAAGTGAGAGAGCTGC
AAAGAGGGGCAGGGTAGAAGACATTTCTGGTGCTAGCTTTGATGAAATTTTGGAAGCTATGAATTATGGGTCTAGGAGGAAGTTAAAGGAGCCTAAAAAAAGAGGTAGAC
GAAAAGGATCAAAGAAAAAACTTAATCGCGATGTTACAAAGTTGCTTGGTGATGCAACTTTATGTTATGCTCAAGGCGAGCACGAAAAGGCTATATCTTTATTGCGCCAA
GTTGTACTACGAGCCCCAGATTTACCTGATTCGTACCACACACTTGGACTTGTTTACAATGCAATTGGTGATGATGTAAAGGCCATGGGATTCTACATGCTTGCTGCACA
TTTAATGCCAAAAGATTCATCTCTCTGGAAACTGCTATTTTCATGGTCAATTGACCGTGGTGATATTGATCAAGCAAGCTATTGTCTTTCTAAAGCAATAAAAGCAGAGC
CTGATGATATTAATTTATTATTTCATCGAGCATCACTCTACCTTGAGCGTGGAGATTGTGAGAAAGCAGCTGAAACATATGATCAGATTCATCAACAATGCCTTGGTAAC
GTTGAAGCACTCATGACAGGAGCAAAGCTGTACCAAAAATGTGGTCATCTTGAACGTGCAATTTGCATTCTTGAGGACTACATCAAAGGGCATCCATCTGAAGCTGATTT
AGATGTGGTTGATCTTTTAGCTTCTTTATACATGGGAAGTAAAGAATTCAGCAAAGCTCTTGAGCGCATTGAACATGCAGATAGGGTGTACTGTGCAGGAAATGAGCTAC
CTTTAAACTTGACAACTAAAGCAGGAATCTGCCATGCTCATCTTGGAGATTTGGAGAAGGCAGAGTGCCTCTTTGCTAATTTGAGACGGGAAACTACCTATGTTCACTCA
AATTTGATGATTGAAGTTGCAGACTCGTTGATGAGTCTTAAGCACTATAGCTGGGCATTGAAATATTATCTGATGTCCGAAGAAGTAAATGCTGGAGAGAACATGGGGAT
TTTATACCTAAAAATTGCCGAGTGTTACTTATCAACTAATGAAAGAGAACAGGCAATTGTTTTCTTTTATAAAGTGCTTCAACATGTTGAAGATAACATTAATGCTCGAT
TAACTTTGGCATCCCTCCTTCTTGAGGAAGCTAGAGACAAAGAAGCCATTTCATTACTATCTCCTCCAAAAGATTCAAACCCAACTAGCTCATCTTCCAGCAAATTAAAA
CCTTGGTGGCTCAATGAGAAAGTAAAACTGAAGCTTTGCCACATATACAGAATTAGAGGATTGCTTGAGAACTTCGTTGAGGTGATCTTTCCTTTGGTCCGAGAGTCCTT
ATATATCGAGACTCTTCAAGAAAAGATTAAAGTGAACAAGAAGAAGCTTCCAAGGAGGGTTTTGCTCGAGAGAGTCAAAGTACTAGATGGACGTGAAACTGGCAACCTAT
TTCGTGGATTCAAACCTGTAGCTCCGAAATCAGATTTAACAAAAGCATCCAGAGCAAAGAGATTACTTCAAAAAAGGGAAAGAATCAAGGAAGAAAAGAAGGCTAAAGCA
CTGGCTGCAGGAGTCAATTTGAGCTATGATGATTTAGATGATGAACCAGCGCTACGGATGCACCGAGAATCCCCCCTGCCTAACCTTCTGAAAGAAGAAGAATACCATAT
TCTTATTGTCGATTTGTGCAAGGCATTGGCTTCCTTAGGAAGATGTTCTGAAGCTTTAGAGATTATAAGCCTAACTTTAAAGTTGGCTTTTAACTCATTATCGATGGAAA
GGAAGGAAGAACTCCAGTTACTTGGAGCTCAATTAGCTTTCAGCTCAACTGGTACCATGCATGGTTTCAACTTTGCAAAGCACGTTGTTAAGCAGTACCCTTACAGCATC
TCGGCTTGGAACTGCTATTATAAAGTAGCTTCATGTTTGACGAACCGGGATTCAAGGCATTGCAAGCTTCTGAATAGCATGCAATCCAAATACAAAGATTGTGCACCACC
CTATATCATTGCGGGGCATCAGTTTACCACCATTAGCCATCATCAAGATGCTGCAAGGAAATATCTTGAAGCTTATAAAATCATGCCGGATAGTCCCCTAATTAACTTGT
GTGTTGGATCGTCCTTGATCAACTTGGCTCTCGGATTCCGCCTTCAAAACAAGCATCAGTGTGTTGCGCAGGGCTTGGCATTCCTCTACAAAAATTTGAAGCTTTGCGAT
AACAACCAGGAAGCCTTGTACAACATAGCTCGAGCATATCATCACATTGGACTGGTGACACTGGCAGTTACATATTACGAAAAGGTGCTCGCAACTTACCAGAAGGATTG
CCCCATTCCAGAACTTTTTGGTGAGAATCGAAATATTAAACATCAGAATTCAGTCTATTGTGACCTACGGAGAGAAGCAGCTTACAATTTGCATCTTATTTATAAAGAGA
GTGGAGCTCTTGATCTTGCTAGGCAAGTCCTAAAAGATCATTGCACATTTTAAGTACAAAGCATGTTTATTAGCAACAGCATAGAATGTAACAAAAAGAAAAAGTCTTAT
TTTATTTTGATATACATTGTACAAAGATGGAGGGTGTAGAATACGGTAGCACGTACGACCGAGTTTATAAAGATTCAGTTGAGTCAGTCCATTATGTATATATAACTATC
ATATCAATGCCGGAGATTAGTTATTAATTCAGCAGAATATAGTTTTATGTTTGTATCGGTAATTGAGGTAAATTTATTTTAGCCGAGGC
Protein sequenceShow/hide protein sequence
MEKEGSRISDCEEVPGEVMHVLGTEKEVVETGVEDREGEEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALAEKKRKAL
ANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISLLRQVVLRAPDLPDSYHTLGLVYNAIGDDVKA
MGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYI
KGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRVYCAGNELPLNLTTKAGICHAHLGDLEKAECLFANLRRETTYVHSNLMIEVADSLMSLKHYSWALKYYLMSEE
VNAGENMGILYLKIAECYLSTNEREQAIVFFYKVLQHVEDNINARLTLASLLLEEARDKEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYRIRGLLENFVEV
IFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKPVAPKSDLTKASRAKRLLQKRERIKEEKKAKALAAGVNLSYDDLDDEPALRMHRESPLPN
LLKEEEYHILIVDLCKALASLGRCSEALEIISLTLKLAFNSLSMERKEELQLLGAQLAFSSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQ
SKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEK
VLATYQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF