; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0009402 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0009402
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
Descriptiongeneral transcription factor 3C polypeptide 3 isoform X1
Genome locationchr04:16732068..16752728
RNA-Seq ExpressionIVF0009402
SyntenyIVF0009402
Gene Ontology termsGO:0006383 - transcription by RNA polymerase III (biological process)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat
IPR039340 - Transcription factor Tfc4/TFIIIC-102/Sfc4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008447634.1 PREDICTED: general transcription factor 3C polypeptide 3 isoform X1 [Cucumis melo]0.093.72Show/hide
Query:  MEKEGNRISDSEEVPGDVMHVLGVEKEVETGVVDREEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALAEK
        MEKEGNRISDSEEVPGDVMHVLGVEKEVETGVVDREEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALAEK
Subjt:  MEKEGNRISDSEEVPGDVMHVLGVEKEVETGVVDREEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALAEK

Query:  KRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPDSYH
        KRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPDSYH
Subjt:  KRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPDSYH

Query:  TLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGNVEA
        TLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGNVEA
Subjt:  TLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGNVEA

Query:  LMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLFANLRR
        LMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLFANLRR
Subjt:  LMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLFANLRR

Query:  ETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEEAISLL
        ETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEEAISLL
Subjt:  ETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEEAISLL

Query:  SPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAPK
        SPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAPK
Subjt:  SPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAPK

Query:  SDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVDLCKALASLGRCSE
        SDLTKASRAKRLLQKRDRIKEEKKAKLLAAG  V++  +         +  LRM                PLPNLLKEEEHHILIVDLCKALASLGRCSE
Subjt:  SDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVDLCKALASLGRCSE

Query:  ALEIISLTL------------------------NSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQ
        ALEIISLTL                        +STGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQ
Subjt:  ALEIISLTL------------------------NSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQ

Query:  FTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQ
        FTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQ
Subjt:  FTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQ

Query:  KDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        KDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Subjt:  KDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

XP_011652346.1 general transcription factor 3C polypeptide 3 [Cucumis sativus]0.090.53Show/hide
Query:  MEKEGNRISDSEEVPGDVMHVLGVEKEV-ETGVVDRE----EEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYE
        MEKEG+RISD EEVPG+VMHVLG EKEV ETGV DRE    EEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYE
Subjt:  MEKEGNRISDSEEVPGDVMHVLGVEKEV-ETGVVDRE----EEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYE

Query:  ALAEKKRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDL
        ALAEKKRKALANGQSERAAKRGRVED++GASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQG+HEKAISLLRQVVL+APDL
Subjt:  ALAEKKRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDL

Query:  PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
        PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSI+RGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
Subjt:  PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL

Query:  GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLF
        GNVEALMTGAKLYQKCGHLERAICILEDYIK HPSEADLDVVDLLASLYMGSKEFSKALE IEHADRVYCAGNELPLNLT KAGICHAHLG+LEKAECLF
Subjt:  GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLF

Query:  ANLRRETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEE
        ANLRRETTYDHSNLMIEVADSL+SLKHYSWALKYYLMSEEVNAGENMGILY K+AECYLSTNE+EQAIVFFYKVLQHVEDNINARLTLASLLLEEARD+E
Subjt:  ANLRRETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEE

Query:  AISLLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFR
        AISLLSPPKDSNPTSSSS KLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGF+
Subjt:  AISLLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFR

Query:  PVAPKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVDLCKALASL
        PVAPKSDLTKASRAKRLLQKR+RIKEEKKAK LAAG  +++  +         +  LRM                PLPNLLKEEE+HILIVDLCKALASL
Subjt:  PVAPKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVDLCKALASL

Query:  GRCSEALEIISLTL------------------------NSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYI
        GRCSEALEIISLTL                        +STGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQ+KYKDCAPPYI
Subjt:  GRCSEALEIISLTL------------------------NSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYI

Query:  IAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKV
        IAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKV
Subjt:  IAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKV

Query:  LATYQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        LATYQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Subjt:  LATYQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

XP_022152620.1 general transcription factor 3C polypeptide 3 isoform X2 [Momordica charantia]0.080.76Show/hide
Query:  MEKEGNRISDSEEVPGDVMHVLGVEKEV----ETGVVDREEEEEEEEG------------EEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQ
        MEKEGN ISD++EVPG     +GVE EV    ET V +REEEEEEEE             EEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFS+Q
Subjt:  MEKEGNRISDSEEVPGDVMHVLGVEKEV----ETGVVDREEEEEEEEG------------EEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQ

Query:  PYKKFERLEYEALAEKKRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISL
        PYKKFERLEYEALAEKKRKALAN QSER  KRGR+ED+ GASF+EI+EAMNYGSRRKLKEPK+RGRRKGSKKK+NR++TKLLGDATLCYAQGQ+EKAIS+
Subjt:  PYKKFERLEYEALAEKKRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISL

Query:  LRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAA
        LRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSI+RGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAA
Subjt:  LRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAA

Query:  ETYDQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAH
        ETYDQIHQ+C+GNVEALMTGAKLYQKCGH ERAICILEDYIK HP+EADLDVVDLLASLYMGSKEFSKALEHIEHAD+VYCAGNE+PLNL  KAGICH H
Subjt:  ETYDQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAH

Query:  LGNLEKAECLFANLRRETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLA
        LGN+EKAE LFANL R+T  DHS+ +IE ADSLLSLKH++ ALKYYLMSEEVNAG  MGI+Y K+A+CYLSTNE+ +AIVFFYKVLQ +EDNINARLTLA
Subjt:  LGNLEKAECLFANLRRETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLA

Query:  SLLLEEARDEEAISLLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDG
        SLLLEEAR+EEAISLLSPPKDSN +SSSS K KPWWLNEKVKLKLC+IYRT+G+LENFVE IF LVRESLYIETL+EKIKVNKKKLPRRVLLERVKVLDG
Subjt:  SLLLEEARDEEAISLLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDG

Query:  RETGNLFRGFRPVAPKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHIL
        RETG+LFRGFRPVAPKSDL+KASRAKRLLQKR+RIKEEKKA+ LAAG  V++  V         +  LR+                PLPNLLK+EE+H L
Subjt:  RETGNLFRGFRPVAPKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHIL

Query:  IVDLCKALASLGRCSEALEIISLTL------------------------NSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQ
        IVDLCKALASLGRCSEALEIISLTL                        +ST T HGFNFAKHVVKQYPYS SAWNCYYKV+S +T+RDSRHCKLLNS+Q
Subjt:  IVDLCKALASLGRCSEALEIISLTL------------------------NSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQ

Query:  AKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGL
        AKYKDCAPP+IIAGHQF  ISHHQ+AA+KYLEAYK++PDSPLINLCVG++LINLALG RLQNKHQCVAQGLAFLY NLKLCDN+QEALYNIARAYHHIGL
Subjt:  AKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGL

Query:  VTLAVTYYEKVLATYQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        VTLAVTYYEKVLATYQKDCPIP++FGENRNIKH+ SVYCDLRREAAYNLHLIYK+SGALDLARQVLKDHCTF
Subjt:  VTLAVTYYEKVLATYQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

XP_022997988.1 general transcription factor 3C polypeptide 3 [Cucurbita maxima]0.082.06Show/hide
Query:  MEKEGNRISDSEEVPG-DVMHVLGVEKEVETGVVDREEEEEEEE--GEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEAL
        ME+EGN ISD+EEVPG DV     VE EVE    + EEEEEEEE   EEEVEDEGEDD EEEDGY FKFKAGENPFDFVEGTDFS+QPYKKFERLEYEAL
Subjt:  MEKEGNRISDSEEVPG-DVMHVLGVEKEVETGVVDREEEEEEEE--GEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEAL

Query:  AEKKRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPD
        AEKKRKALANGQSER+AKRGRVED++GASF+EI+EAMNYGSRRK ++PKKRGRRKGSK+KLN DVTKLLGDATLCYAQGQ+EKAIS+L QVVLQAPD+PD
Subjt:  AEKKRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPD

Query:  SYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGN
        SYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSI+RGDIDQASYCLSKAIKAEPDDINLLF+RASLYLERGDC+KAAETYDQIHQ  + N
Subjt:  SYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGN

Query:  VEALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLFAN
        VEALMTGAKLYQKCGHLERAICILE+YIK HP EADLDVVDLLASLYMGSKEFSKALE IEHAD VYCA NELPLNLTAKAGICH HLGN EKAECLFAN
Subjt:  VEALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLFAN

Query:  LRRETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEEAI
        LRRE   + SNLMIEVADSLLSLKHY+ ALKYYLMSEEVNAG N+GILY K+A+CY STNE+ +AIVFFYKVLQH+EDNINARLTLASLLLEEAR+EEAI
Subjt:  LRRETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEEAI

Query:  SLLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPV
        SLLSPPKDSN TSSSS K KPWWLNE+VKLKLCHI+RT+G+LENFVE IFPLVRESLYIETL EKIKVNKKKLP+RVLLERVKVLDGR+TG LFRGFRPV
Subjt:  SLLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPV

Query:  APKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVDLCKALASLGR
        APKSDL+KASRAK+LLQKR+RI+EEKKA+ LAAG  + +                     F    +       PLPNLLK+EE+H LIVDLCKALASLGR
Subjt:  APKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVDLCKALASLGR

Query:  CSEALEIISLTL------------------------NSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYIIA
        CSEALEIISLTL                        +ST T HGFNFAKHVVKQYPYS SAWNCYYKV+S +TNRDSRHCKLLNSMQ KYKDCAPPYIIA
Subjt:  CSEALEIISLTL------------------------NSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYIIA

Query:  GHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLA
        GHQFT ISHHQDAARKYLEAYK++PDSPLINLCVG++LINL LGFRLQNKHQCVAQGLAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLA
Subjt:  GHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLA

Query:  TYQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        TYQKDCPIPELFGEN++ KHQ SVYCDLRREAAYNLHLIYKESGALDLARQVLKD+CTF
Subjt:  TYQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

XP_038879313.1 general transcription factor 3C polypeptide 3 [Benincasa hispida]0.087.37Show/hide
Query:  MEKEGNRISDSEEVPGDVMHVLGVEKEV-ETGVVDREEEEEEEE-GEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALA
        MEKEGN +SD+EEVPG  M VLG  KEV ETGV DREEEEEEEE GEEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFS+QPYKKFERLEYEALA
Subjt:  MEKEGNRISDSEEVPGDVMHVLGVEKEV-ETGVVDREEEEEEEE-GEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALA

Query:  EKKRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPDS
        EKKRKALANGQSERAAKRGRVED+ GASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKA+S+LRQVVLQAPDLPDS
Subjt:  EKKRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPDS

Query:  YHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGNV
        YHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSI+RGDIDQASYCLSKAIKAEPDDI+LLFHRASLYLERGDCEKAAETYDQIHQQCLGNV
Subjt:  YHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGNV

Query:  EALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLFANL
        EALMTGAKLYQKCGHLERAICILEDYIKEHP+EADLDVVDLLASLYMG+KEF KALE IEHAD VYCAG+ELPL LT K GICHAHLGN+EKAECLFANL
Subjt:  EALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLFANL

Query:  RRETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEEAIS
          ET  DHSNLMIEVADSLLSLKHY+ ALKYYLM EEVNAG NMGILY K+A+CYLSTNE+ QAIVFFYKVLQH+EDNINARLTLASLLLEEARDEEAIS
Subjt:  RRETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEEAIS

Query:  LLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVA
        LLSPPKDSNPTSSSS KLKPWW NEKVKLKLCHIY+TRG+LE+FVEVIFPLVRESLYIETLQEKIKVNKKKLP+RVLLERVKVLDGRE+GNLFRGFRPVA
Subjt:  LLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVA

Query:  PKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVDLCKALASLGRC
        PKSDL+KASRAKRLLQKR+RIKEEKKAK LAAG  V +  +         +  LRM                PLPNLLKEEE+H LIVDLCKALASLGRC
Subjt:  PKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVDLCKALASLGRC

Query:  SEALEIISLTL------------------------NSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYIIAG
        SEALEIISLTL                        +ST T HGFNFAKHVVKQYPYSISAWNCYYKVAS LTNRDSRHCKLLNSMQAKYKDCAPPYIIAG
Subjt:  SEALEIISLTL------------------------NSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYIIAG

Query:  HQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLAT
        HQFT ISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLAT
Subjt:  HQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLAT

Query:  YQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        YQKDCPIPELFGENRNIKHQ SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Subjt:  YQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

TrEMBL top hitse value%identityAlignment
A0A0A0LGB1 Uncharacterized protein0.0e+0092.96Show/hide
Query:  MEKEGNRISDSEEVPGDVMHVLGVEKE-VETGVVDR----EEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYE
        MEKEG+RISD EEVPG+VMHVLG EKE VETGV DR    EEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYE
Subjt:  MEKEGNRISDSEEVPGDVMHVLGVEKE-VETGVVDR----EEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYE

Query:  ALAEKKRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDL
        ALAEKKRKALANGQSERAAKRGRVED++GASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQG+HEKAISLLRQVVL+APDL
Subjt:  ALAEKKRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDL

Query:  PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
        PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSI+RGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
Subjt:  PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL

Query:  GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLF
        GNVEALMTGAKLYQKCGHLERAICILEDYIK HPSEADLDVVDLLASLYMGSKEFSKALE IEHADRVYCAGNELPLNLT KAGICHAHLG+LEKAECLF
Subjt:  GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLF

Query:  ANLRRETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEE
        ANLRRETTYDHSNLMIEVADSL+SLKHYSWALKYYLMSEEVNAGENMGILY K+AECYLSTNE+EQAIVFFYKVLQHVEDNINARLTLASLLLEEARD+E
Subjt:  ANLRRETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEE

Query:  AISLLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFR
        AISLLSPPKDSNPTSSSS KLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGF+
Subjt:  AISLLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFR

Query:  PVAPKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVDLCKALASL
        PVAPKSDLTKASRAKRLLQKR+RIKEEKKAK LAAG  +++         +  +  LRM                PLPNLLKEEE+HILIVDLCKALASL
Subjt:  PVAPKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVDLCKALASL

Query:  GRCSEALEIISLTLNSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYK
        GRCSEALEIISLTLNSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQ+KYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYK
Subjt:  GRCSEALEIISLTLNSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYK

Query:  IMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIKHQN
        IMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIKHQN
Subjt:  IMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIKHQN

Query:  SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Subjt:  SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

A0A1S3BHB9 general transcription factor 3C polypeptide 3 isoform X10.0e+0093.72Show/hide
Query:  MEKEGNRISDSEEVPGDVMHVLGVEKEVETGVVDREEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALAEK
        MEKEGNRISDSEEVPGDVMHVLGVEKEVETGVVDREEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALAEK
Subjt:  MEKEGNRISDSEEVPGDVMHVLGVEKEVETGVVDREEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALAEK

Query:  KRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPDSYH
        KRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPDSYH
Subjt:  KRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPDSYH

Query:  TLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGNVEA
        TLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGNVEA
Subjt:  TLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGNVEA

Query:  LMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLFANLRR
        LMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLFANLRR
Subjt:  LMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLFANLRR

Query:  ETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEEAISLL
        ETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEEAISLL
Subjt:  ETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEEAISLL

Query:  SPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAPK
        SPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAPK
Subjt:  SPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAPK

Query:  SDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVDLCKALASLGRCSE
        SDLTKASRAKRLLQKRDRIKEEKKAKLLAAG  V++         +  +  LRM                PLPNLLKEEEHHILIVDLCKALASLGRCSE
Subjt:  SDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVDLCKALASLGRCSE

Query:  ALEIISLTL------------------------NSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQ
        ALEIISLTL                        +STGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQ
Subjt:  ALEIISLTL------------------------NSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQ

Query:  FTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQ
        FTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQ
Subjt:  FTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQ

Query:  KDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        KDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Subjt:  KDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

A0A6J1DEF9 general transcription factor 3C polypeptide 3 isoform X20.0e+0080.6Show/hide
Query:  MEKEGNRISDSEEVPGDVMHVLG-------------VEKEVETGVVDREEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYK
        MEKEGN ISD++EVPG  + V G              E+E E    + EEEEEEEE EEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFS+QPYK
Subjt:  MEKEGNRISDSEEVPGDVMHVLG-------------VEKEVETGVVDREEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYK

Query:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQ
        KFERLEYEALAEKKRKALAN QSER  KRGR+ED+ GASF+EI+EAMNYGSRRKLKEPK+RGRRKGSKKK+NR++TKLLGDATLCYAQGQ+EKAIS+LRQ
Subjt:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQ

Query:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY
        VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSI+RGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETY
Subjt:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY

Query:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGN
        DQIHQ+C+GNVEALMTGAKLYQKCGH ERAICILEDYIK HP+EADLDVVDLLASLYMGSKEFSKALEHIEHAD+VYCAGNE+PLNL  KAGICH HLGN
Subjt:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGN

Query:  LEKAECLFANLRRETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLL
        +EKAE LFANL R+T  DHS+ +IE ADSLLSLKH++ ALKYYLMSEEVNAG  MGI+Y K+A+CYLSTNE+ +AIVFFYKVLQ +EDNINARLTLASLL
Subjt:  LEKAECLFANLRRETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLL

Query:  LEEARDEEAISLLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRET
        LEEAR+EEAISLLSPPKDSN +SSSS K KPWWLNEKVKLKLC+IYRT+G+LENFVE IF LVRESLYIETL+EKIKVNKKKLPRRVLLERVKVLDGRET
Subjt:  LEEARDEEAISLLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRET

Query:  GNLFRGFRPVAPKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVD
        G+LFRGFRPVAPKSDL+KASRAKRLLQKR+RIKEEKKA+ LAAG  V++  V         +  LR+                PLPNLLK+EE+H LIVD
Subjt:  GNLFRGFRPVAPKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVD

Query:  LCKALASLGRCSEALEIISLTL------------------------NSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKY
        LCKALASLGRCSEALEIISLTL                        +ST T HGFNFAKHVVKQYPYS SAWNCYYKV+S +T+RDSRHCKLLNS+QAKY
Subjt:  LCKALASLGRCSEALEIISLTL------------------------NSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKY

Query:  KDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTL
        KDCAPP+IIAGHQF  ISHHQ+AA+KYLEAYK++PDSPLINLCVG++LINLALG RLQNKHQCVAQGLAFLY NLKLCDN+QEALYNIARAYHHIGLVTL
Subjt:  KDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTL

Query:  AVTYYEKVLATYQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        AVTYYEKVLATYQKDCPIP++FGENRNIKH+ SVYCDLRREAAYNLHLIYK+SGALDLARQVLKDHCTF
Subjt:  AVTYYEKVLATYQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

A0A6J1GA07 general transcription factor 3C polypeptide 30.0e+0082.05Show/hide
Query:  MEKEGNRISDSEEVPG-DVMHVLGVEKEVE-TGVVDREEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALA
        ME+EGN ISD+EEVPG DV     VE EVE     + EEEEEEEE EEEVEDEGEDD EEEDGY FKFKAGENPFDFVEGTDFS+QPYKKFERLEYEALA
Subjt:  MEKEGNRISDSEEVPG-DVMHVLGVEKEVE-TGVVDREEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALA

Query:  EKKRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPDS
        EKKRKALA+GQSER+AKRGRVED++GASF+EI+EAMNYGSRRK ++PKKRGRRKGSK+KLN DVTKLLGDATLCYAQGQ+EKAIS+L QVVLQAPD+PDS
Subjt:  EKKRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPDS

Query:  YHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGNV
        YHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSI+RGDIDQASYCLSKAIKAEPDDINLLF+RASLYLERGDC+KAAETYDQIHQ  + NV
Subjt:  YHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGNV

Query:  EALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLFANL
        EALMTGAKLYQKCGHLERAICILE+YIK HP EADLDVVDLLASLYMGSKEFSKALE IEHAD VYCA NELPLNLTAKAGICH HLGN EKAECLFANL
Subjt:  EALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLFANL

Query:  RRETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEEAIS
        RRE   + SNLMIEVADSLLSLKHY+ ALKYYLMSEEVNAG N+GILY K+A+CY STNE+ +AIVFFYKVLQH+EDNINARLTLASLLLEEAR+EEAIS
Subjt:  RRETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEEAIS

Query:  LLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVA
        LLSPPKDSN TSSSS K KPWWLNE+VKLKLCHI+RT+G+LENFVE IFPLVRESLYIETL EKIKVNKKKLP+RVLLERVKVLDGR+TG LFRGFRPVA
Subjt:  LLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVA

Query:  PKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVDLCKALASLGRC
        PKSDL+KASRAK+LLQKR+RI+EEKKA+ LAAG  + +                     F    +       PLPNLLK+EE+H LIVDLCKALASLGRC
Subjt:  PKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVDLCKALASLGRC

Query:  SEALEIISLTL------------------------NSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYIIAG
        SEALEIISLTL                        +ST T HGFNFAKHVVKQYPYS SAWNCYYKV+S +TNRDSRHCKLLNSMQ KYKDCAPPYIIAG
Subjt:  SEALEIISLTL------------------------NSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYIIAG

Query:  HQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLAT
        HQFT ISHHQDAARKYLEAYK++PDSPLINLCVG++LINL LGFRLQNKHQCVAQGLAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLAT
Subjt:  HQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLAT

Query:  YQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        YQKDCPIPELFGEN++ KHQ SVYCDLRREAAYNLHLIYKESGALDLARQVLKD+CTF
Subjt:  YQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

A0A6J1KFI7 general transcription factor 3C polypeptide 30.0e+0082.06Show/hide
Query:  MEKEGNRISDSEEVPG-DVMHVLGVEKEVE--TGVVDREEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEAL
        ME+EGN ISD+EEVPG DV     VE EVE      + EEEEEEEE EEEVEDEGEDD EEEDGY FKFKAGENPFDFVEGTDFS+QPYKKFERLEYEAL
Subjt:  MEKEGNRISDSEEVPG-DVMHVLGVEKEVE--TGVVDREEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEAL

Query:  AEKKRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPD
        AEKKRKALANGQSER+AKRGRVED++GASF+EI+EAMNYGSRRK ++PKKRGRRKGSK+KLN DVTKLLGDATLCYAQGQ+EKAIS+L QVVLQAPD+PD
Subjt:  AEKKRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPD

Query:  SYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGN
        SYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSI+RGDIDQASYCLSKAIKAEPDDINLLF+RASLYLERGDC+KAAETYDQIHQ  + N
Subjt:  SYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGN

Query:  VEALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLFAN
        VEALMTGAKLYQKCGHLERAICILE+YIK HP EADLDVVDLLASLYMGSKEFSKALE IEHAD VYCA NELPLNLTAKAGICH HLGN EKAECLFAN
Subjt:  VEALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLFAN

Query:  LRRETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEEAI
        LRRE   + SNLMIEVADSLLSLKHY+ ALKYYLMSEEVNAG N+GILY K+A+CY STNE+ +AIVFFYKVLQH+EDNINARLTLASLLLEEAR+EEAI
Subjt:  LRRETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEEAI

Query:  SLLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPV
        SLLSPPKDSN TSSSS K KPWWLNE+VKLKLCHI+RT+G+LENFVE IFPLVRESLYIETL EKIKVNKKKLP+RVLLERVKVLDGR+TG LFRGFRPV
Subjt:  SLLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPV

Query:  APKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVDLCKALASLGR
        APKSDL+KASRAK+LLQKR+RI+EEKKA+ LAAG  + +                     F    +       PLPNLLK+EE+H LIVDLCKALASLGR
Subjt:  APKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVDLCKALASLGR

Query:  CSEALEIISLTL------------------------NSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYIIA
        CSEALEIISLTL                        +ST T HGFNFAKHVVKQYPYS SAWNCYYKV+S +TNRDSRHCKLLNSMQ KYKDCAPPYIIA
Subjt:  CSEALEIISLTL------------------------NSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYIIA

Query:  GHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLA
        GHQFT ISHHQDAARKYLEAYK++PDSPLINLCVG++LINL LGFRLQNKHQCVAQGLAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLA
Subjt:  GHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLA

Query:  TYQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        TYQKDCPIPELFGEN++ KHQ SVYCDLRREAAYNLHLIYKESGALDLARQVLKD+CTF
Subjt:  TYQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

SwissProt top hitse value%identityAlignment
O67178 Uncharacterized protein aq_10881.2e-0423.93Show/hide
Query:  YAQGQHEKAISLLRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRAS
        +  G+ +KA  LL QV   + +       LGL+Y  +G    A      A      D +L+  L +    +G +++A +   +A+   P+ + +L++   
Subjt:  YAQGQHEKAISLLRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRAS

Query:  LYLERGDCEKAAETYDQ
        L+L +G+ EKA + +++
Subjt:  LYLERGDCEKAAETYDQ

O74458 Transcription factor tau subunit sfc41.3e-2520.9Show/hide
Query:  EDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQ-GQHEKAISLLRQVVLQAPDLPDSYHTLGLVYNAIGDD----VK
        +D+A   ++E L+A+  G R+  K  K RGR   +    + +V ++L  A   +AQ G  ++A  L  ++V    ++  ++  LG  +   G+      K
Subjt:  EDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQ-GQHEKAISLLRQVVLQAPDLPDSYHTLGLVYNAIGDD----VK

Query:  AMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDIN----LLFHRASLYLERGDCEKAAETYDQIHQQCLGNVEALMTGAKLYQK
         +  +M AAHL PKD  LW      S      DQA YC ++A+ A+P + +     +++R+ L  E G  +KAAE +  + Q    N   L   A++Y K
Subjt:  AMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDIN----LLFHRASLYLERGDCEKAAETYDQIHQQCLGNVEALMTGAKLYQK

Query:  CGHLERAIC----ILEDYIKEHPSEA------DLDVVDLLASLYMGSKEFSKALEHIEHADRVY----------------------------CAGNE---
          H  R I     I   Y  ++P+        DL  ++L A L +   ++S  +  I    R +                             A  E   
Subjt:  CGHLERAIC----ILEDYIKEHPSEA------DLDVVDLLASLYMGSKEFSKALEHIEHADRVY----------------------------CAGNE---

Query:  -----LPLNLTAKAGICHAHLGNLEKAECLFANLRRETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIV
             LP     K GI     G L +AE  F+ ++         ++ ++A + + ++    AL+Y+++       +N+G+ Y  +  CYL   E E A  
Subjt:  -----LPLNLTAKAGICHAHLGNLEKAECLFANLRRETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIV

Query:  FFYKVLQHVEDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIK
            +L     N NA + LA +   +   + A+ +++                             +I+  R    N  E+     +   + + +  ++ 
Subjt:  FFYKVLQHVEDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIK

Query:  VNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAPKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLR-----------
        V  +K+P+    +R ++   +E    F  ++    +    K    ++ L+K + + E     L  A   +   +      P   K + R           
Subjt:  VNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAPKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLR-----------

Query:  --------MLFFFSVMSLATDASRIPLPNLLKE--------EEHHILIVDLCKALASLGRCSEALEIISLTLNS----TGTMHGFNFAKHVVKQYPYSIS
                M+   +     T    + L  +L+         +  + L V+    L  +G   +A ++++  + +      T+   N    ++    Y+  
Subjt:  --------MLFFFSVMSLATDASRIPLPNLLKE--------EEHHILIVDLCKALASLGRCSEALEIISLTLNS----TGTMHGFNFAKHVVKQYPYSIS

Query:  AWNCYYKVASCLTNRDSRH--CKLLNSMQAKYKDCAPPYIIAGHQ-------------------------------FTTISHHQD---------------
               +    T    R    +L +++ ++  +C+  ++ + +Q                                 T+    D               
Subjt:  AWNCYYKVASCLTNRDSRH--CKLLNSMQAKYKDCAPPYIIAGHQ-------------------------------FTTISHHQD---------------

Query:  ----AARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN-----NQEALYNIARAYHHIGLVTLAVTYYEKVLATYQ
            A   Y  A+ I PD P+ NL +G + ++ A+     N+H  + QG  FLY+   L  N      QEALYN+ +AYH IGL   AV YYE VL    
Subjt:  ----AARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN-----NQEALYNIARAYHHIGLVTLAVTYYEKVLATYQ

Query:  KDCPIPELFGENRNIKHQN-SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
             P   G+         S   D   EAAYNL LIY  SG + LA Q+   +  F
Subjt:  KDCPIPELFGENRNIKHQN-SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

P33339 Transcription factor tau 131 kDa subunit5.4e-1620.13Show/hide
Query:  EEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALAEKKRKALANGQSERAAKRGRVEDVAGASFDEILEAMN
        ++E++ +  E E  D G+ + E+E+         ++     E  D  V    +    EY   +E+    LA             ED   A    I EA N
Subjt:  EEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALAEKKRKALANGQSERAAKRGRVEDVAGASFDEILEAMN

Query:  YGSRRKLKEPK-KRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWK
        +  ++K K  K K   R+  ++ L+ +V +LL  A   + +   + A  L  +V+ +      +Y TLG +Y   G        + LAAHL   D   WK
Subjt:  YGSRRKLKEPK-KRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWK

Query:  LLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQ-------------CLGNVEALMTGAKLYQKC--GHLERAIC
        ++   S +   + QA YC S+ I   P +   ++ R+ LY + G   +A + + +++                  + + +    +LY K    ++ER   
Subjt:  LLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQ-------------CLGNVEALMTGAKLYQKC--GHLERAIC

Query:  IL------------------EDYIKEHPSEADLD------------------------VVDLLASLYM--------GSKEFSKALEHIEHA---------
        IL                  ED  ++ P E D D                         +++LA L++        G K   K    I+           
Subjt:  IL------------------EDYIKEHPSEADLD------------------------VVDLLASLYM--------GSKEFSKALEHIEHA---------

Query:  ----------------DRVYCAGNE----LPLNLTAKAGICHAHLGNLEKAECLFANLRRETTYDHSNLMIEVADSLLSLKHYSWALKYY---LMSEEVN
                        D +  A  E    +P+++  + G+   +  NL +A   F  L  ET  D ++L  E A +L   + Y  A+ ++   L  EE  
Subjt:  ----------------DRVYCAGNE----LPLNLTAKAGICHAHLGNLEKAECLFANLRRETTYDHSNLMIEVADSLLSLKHYSWALKYY---LMSEEVN

Query:  AGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLL------------------LEEARDEEAISLLSPPKDSNPTSSSSGK--LK
          +    +++ +A CY      E A  F+   ++   D+++ R++LA +                   + + + +E +  +S  K SN TS  S K  L+
Subjt:  AGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLL------------------LEEARDEEAISLLSPPKDSNPTSSSSGK--LK

Query:  PWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAPKSDLTKASRAKRLLQKRD
                K K       R  +E    +   +V +   ++  +    +N+ K    + +  V      E  ++F   +    KS   ++ +   +L++  
Subjt:  PWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAPKSDLTKASRAKRLLQKRD

Query:  RIKEE-----KKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVDLCKALASLGRCSEALEII-------
        +   E     ++   LA G +V    +    + + +  +LR L +     L  + S +      +  E  + +V+  + +    +  E ++++       
Subjt:  RIKEE-----KKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVDLCKALASLGRCSEALEII-------

Query:  --------SLTLNSTGTMHGFNFAKHVVKQYPYSI----SAWNCYYKVA-------------SCLTNRD-SRHCKLLNSMQAKYKDCAPPYIIAGHQFTT
                 L  N  G ++ F F + V++ + YS+    S+ N                   SC  N + +    + N         + PY+   +    
Subjt:  --------SLTLNSTGTMHGFNFAKHVVKQYPYSI----SAWNCYYKVA-------------SCLTNRD-SRHCKLLNSMQAKYKDCAPPYIIAGHQFTT

Query:  ISHHQDAARKYLEAYKIM-------PDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN------NQEALYNIARAYHHIGLVTLAVT
         S     +R +L A + +       PD P++NL +G S I+ A+      +H  +  GL +LY+  K+  +       QEA YN+ RA+H IGLV++A+ 
Subjt:  ISHHQDAARKYLEAYKIM-------PDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN------NQEALYNIARAYHHIGLVTLAVT

Query:  YYEKVLATYQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDH
        YY +VL  Y                         L++ AAYN  +IY++SG ++LA  +++ +
Subjt:  YYEKVLATYQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDH

Q9Y5Q9 General transcription factor 3C polypeptide 31.3e-6224.94Show/hide
Query:  KEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIE
        +E KK  + K  + KL R +  L+G+A + +A+G+ E+AI +  +++ QAP   + + TL ++Y   GD  K++ F ++AAHL P D+  W  L   S+E
Subjt:  KEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIE

Query:  RGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGN-----VEALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVV
        + +I QA +C +KA+K EP ++  L+ R+SLY + GD + A + Y +I      +     ++     AK Y +   +  AI I+++   +H     ++ V
Subjt:  RGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGN-----VEALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVV

Query:  DLLASLYMGSKEFSKALEHI---------------------EHADRVYCA-GNELPLNLTAKAGICHAHLGNLEKAECLFANLRRETTYDHSNLMIEVAD
        ++ A LY+ +K++ KALE I                     +  + V C   + +P+++T K  +C  HL  LE    L   L  +   D  +L ++VA+
Subjt:  DLLASLYMGSKEFSKALEHI---------------------EHADRVYCA-GNELPLNLTAKAGICHAHLGNLEKAECLFANLRRETTYDHSNLMIEVAD

Query:  SLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSGK
        + L +  Y+ AL   L +   +   N+ +++ + AEC  +    E+A   + KV+     +++AR++L++L  +  + E+A+  L P  D +  +  +  
Subjt:  SLLSLKHYSWALKYYLMSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSGK

Query:  LKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETG--NLFRGFRPVAPKSDLTKASRAKRLL
         +      K+ L    +  ++G +  +V+ +  ++   L       K+ +N+          +V ++   ++G  +L+           L K S      
Subjt:  LKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETG--NLFRGFRPVAPKSDLTKASRAKRLL

Query:  QKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEE--EHHILIVDLCKALASLGRCSEALEI--ISLTL
          RD+I +    +         F ++ S    +L K     L   ++ SL  D SR     LL +   E++    D  K         + LE   +S  +
Subjt:  QKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEE--EHHILIVDLCKALASLGRCSEALEI--ISLTL

Query:  NSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGS
                +N+ + +V +       WN + +V   + ++D RH +    +  K  +     ++ GH        + A  +Y++A++  PD PL + C+G 
Subjt:  NSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGS

Query:  SLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNL
        + I++A    +  +H  + QG +FL + L L    QE+ YN+ R  H +GL+ LA+ YY+K L        +P L  E   +        DLRR+ AYNL
Subjt:  SLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIKHQNSVYCDLRREAAYNL

Query:  HLIYKESGALDLARQVLKDHCT
         LIY+ SG   +A+ +L  +C+
Subjt:  HLIYKESGALDLARQVLKDHCT

Arabidopsis top hitse value%identityAlignment
AT1G17680.1 tetratricopeptide repeat (TPR)-containing protein1.1e-17840.64Show/hide
Query:  EGNRISDSEEVPGDV---MHVLGVEKEVETGVVDREEE--EEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALA
        EGN IS+ EE P ++     VLG +   +   ++ +EE   ++++ + + +DEG++  EE+D     F+AG  P                FER EYEALA
Subjt:  EGNRISDSEEVPGDV---MHVLGVEKEVETGVVDREEE--EEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALA

Query:  EKKRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPDS
        E+KRKALA+ Q   +        V G      +E M+ G RRK ++ KK+GRR GSKK++  D+ K   +A   +A G+  +A+ +L +V+ QAP    +
Subjt:  EKKRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPDS

Query:  YHTLGLVYNAIG-DDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGN
        Y+ L  V   +G  +  +     +AA++    S  WKLL+    E+ +I  A    SKAI+A+PDDI L +  A + L  G   +AAET++QI ++C   
Subjt:  YHTLGLVYNAIG-DDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGN

Query:  VEALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLFAN
        +EAL  G + + K G  ERA  ILED+IK H SE   DV+DLLAS++M      +AL++I    ++Y  G EL  +L  +  ICH HL  +E+AE + + 
Subjt:  VEALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLFAN

Query:  LRRETTYDHSNLMIEVADSLLSLKHYSWALKYYL--MSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEE
        L +E   +H  L+  +AD L ++ ++  ALKYY+  +SE VN     G L+ K+A CY+S  E++QAIVF+YK L  + D ++ R+TLASLLLE+ + +E
Subjt:  LRRETTYDHSNLMIEVADSLLSLKHYSWALKYYL--MSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEE

Query:  AISLLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFR
        A+ +LSPP++ +P    + KLK WW N K+++ LC IY + G+LE+F      LV + ++  T++ K K       R VL E  +               
Subjt:  AISLLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFR

Query:  PVAPKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVDLCKALASL
                           K+ R   + +A  L  GP     I  +  L    +I+ R     ++ +   D        ++K+EE+H L VDLCKALASL
Subjt:  PVAPKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVDLCKALASL

Query:  GRCSEALEIISLT---------------LNSTGT---------MHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYI
         R  EALEI++L                L S G             F+  + V++Q+PY ++AWNCYY V S L  R S   K ++ +++KY+DC PP +
Subjt:  GRCSEALEIISLT---------------LNSTGT---------MHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYI

Query:  IAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKV
        IAGH FT  S HQDAAR+YLEAYK+MP+SPLINLCVG++LINLALGFRL+N+H+C+AQG AFLY NL++C N+QEALYN+ARAY H+GLVTLA +YYEKV
Subjt:  IAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKV

Query:  LATYQKDCPIPELFGENRNI-KHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        LA Y+KD  +P+L  E+  + + +  V CDLR+EAA+NLHLIYK SGA DLARQVLKDHCTF
Subjt:  LATYQKDCPIPELFGENRNI-KHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

AT1G17680.2 tetratricopeptide repeat (TPR)-containing protein1.1e-17840.64Show/hide
Query:  EGNRISDSEEVPGDV---MHVLGVEKEVETGVVDREEE--EEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALA
        EGN IS+ EE P ++     VLG +   +   ++ +EE   ++++ + + +DEG++  EE+D     F+AG  P                FER EYEALA
Subjt:  EGNRISDSEEVPGDV---MHVLGVEKEVETGVVDREEE--EEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALA

Query:  EKKRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPDS
        E+KRKALA+ Q   +        V G      +E M+ G RRK ++ KK+GRR GSKK++  D+ K   +A   +A G+  +A+ +L +V+ QAP    +
Subjt:  EKKRKALANGQSERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPDS

Query:  YHTLGLVYNAIG-DDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGN
        Y+ L  V   +G  +  +     +AA++    S  WKLL+    E+ +I  A    SKAI+A+PDDI L +  A + L  G   +AAET++QI ++C   
Subjt:  YHTLGLVYNAIG-DDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGN

Query:  VEALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLFAN
        +EAL  G + + K G  ERA  ILED+IK H SE   DV+DLLAS++M      +AL++I    ++Y  G EL  +L  +  ICH HL  +E+AE + + 
Subjt:  VEALMTGAKLYQKCGHLERAICILEDYIKEHPSEADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLFAN

Query:  LRRETTYDHSNLMIEVADSLLSLKHYSWALKYYL--MSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEE
        L +E   +H  L+  +AD L ++ ++  ALKYY+  +SE VN     G L+ K+A CY+S  E++QAIVF+YK L  + D ++ R+TLASLLLE+ + +E
Subjt:  LRRETTYDHSNLMIEVADSLLSLKHYSWALKYYL--MSEEVNAGENMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEE

Query:  AISLLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFR
        A+ +LSPP++ +P    + KLK WW N K+++ LC IY + G+LE+F      LV + ++  T++ K K       R VL E  +               
Subjt:  AISLLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLVRESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFR

Query:  PVAPKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVDLCKALASL
                           K+ R   + +A  L  GP     I  +  L    +I+ R     ++ +   D        ++K+EE+H L VDLCKALASL
Subjt:  PVAPKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSVMSLATDASRIPLPNLLKEEEHHILIVDLCKALASL

Query:  GRCSEALEIISLT---------------LNSTGT---------MHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYI
         R  EALEI++L                L S G             F+  + V++Q+PY ++AWNCYY V S L  R S   K ++ +++KY+DC PP +
Subjt:  GRCSEALEIISLT---------------LNSTGT---------MHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPYI

Query:  IAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKV
        IAGH FT  S HQDAAR+YLEAYK+MP+SPLINLCVG++LINLALGFRL+N+H+C+AQG AFLY NL++C N+QEALYN+ARAY H+GLVTLA +YYEKV
Subjt:  IAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKV

Query:  LATYQKDCPIPELFGENRNI-KHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        LA Y+KD  +P+L  E+  + + +  V CDLR+EAA+NLHLIYK SGA DLARQVLKDHCTF
Subjt:  LATYQKDCPIPELFGENRNI-KHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

AT3G04240.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.7e-0624.26Show/hide
Query:  ATLCYAQGQHEKAISLLRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLF
        A L    G   +A+   ++ V   P  PD+Y  LG VY A+G   +A+  Y  A  + P  +  +  + S   E+G +D A     +A+  +P  +    
Subjt:  ATLCYAQGQHEKAISLLRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLF

Query:  HRASLYLERGDCEKAAETYDQI------HQQCLGNV
        +  +   + G  ++A   Y+Q       H Q + N+
Subjt:  HRASLYLERGDCEKAAETYDQI------HQQCLGNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGGAGGGGAATAGAATTTCTGACAGTGAAGAGGTTCCTGGTGATGTTATGCATGTTTTGGGAGTGGAAAAAGAGGTAGAAACAGGAGTGGTGGATAGAGAGGA
GGAGGAGGAGGAGGAGGAAGGAGAGGAAGAAGTGGAGGATGAAGGAGAGGATGATATTGAAGAAGAAGACGGTTACACATTCAAATTTAAGGCCGGAGAAAATCCATTTG
ATTTTGTTGAAGGAACCGATTTTAGTGTCCAACCATATAAAAAATTTGAGCGCCTTGAATATGAAGCCCTTGCTGAGAAAAAGAGAAAAGCTCTTGCAAATGGCCAGAGT
GAGAGAGCTGCAAAGAGGGGCAGGGTAGAAGACGTTGCTGGTGCTAGCTTTGATGAAATTTTGGAAGCTATGAATTATGGATCTAGGAGGAAGTTAAAGGAGCCTAAAAA
AAGAGGTCGACGGAAAGGATCAAAGAAAAAACTTAATCGTGATGTTACAAAGTTGCTCGGTGATGCAACTTTATGTTATGCTCAAGGCCAGCATGAGAAGGCTATATCTT
TATTACGCCAAGTTGTTCTGCAAGCCCCAGATTTACCTGATTCGTACCACACACTTGGACTTGTTTACAATGCAATTGGTGATGATGTAAAAGCCATGGGATTCTACATG
CTTGCTGCACATTTAATGCCAAAAGATTCATCTCTCTGGAAACTGCTATTTTCATGGTCAATTGAACGAGGTGATATTGATCAAGCAAGCTATTGTCTTTCTAAAGCAAT
AAAAGCAGAGCCTGATGATATTAATTTATTATTTCATCGTGCGTCACTCTACCTTGAGCGTGGAGATTGTGAGAAAGCAGCTGAAACATATGATCAGATTCATCAACAAT
GCCTTGGTAACGTTGAAGCACTCATGACAGGAGCAAAGCTGTACCAAAAATGTGGTCATCTTGAACGTGCAATTTGCATTCTTGAGGACTACATTAAAGAGCATCCATCT
GAAGCTGATTTAGATGTGGTTGATCTTTTAGCTTCTTTATACATGGGAAGTAAAGAATTCAGCAAAGCTCTTGAGCACATTGAACATGCAGATAGGGTGTACTGTGCCGG
AAATGAGCTACCTTTAAACTTGACAGCTAAAGCAGGAATCTGCCACGCTCATCTTGGAAACTTGGAGAAGGCAGAGTGCCTCTTTGCTAATTTGAGACGGGAAACTACCT
ATGATCACTCAAATTTGATGATTGAAGTTGCAGACTCGTTGCTGAGTCTTAAGCACTATAGCTGGGCATTGAAGTATTATTTGATGTCCGAAGAAGTAAATGCTGGAGAG
AACATGGGGATTTTATACCAAAAAGTTGCTGAGTGTTACTTATCAACTAATGAAAAGGAACAGGCAATTGTTTTCTTTTATAAAGTGCTTCAACATGTTGAAGATAACAT
TAATGCTCGATTAACTTTGGCCTCCCTCCTCCTTGAGGAAGCTAGAGACGAAGAAGCCATTTCATTACTATCTCCTCCAAAAGATTCAAACCCAACTAGCTCATCTTCCG
GCAAATTAAAACCTTGGTGGCTGAATGAGAAAGTAAAACTGAAGCTTTGCCACATATACAGAACTAGAGGATTGCTTGAGAACTTCGTTGAGGTGATCTTTCCTTTGGTC
CGAGAATCCTTATATATCGAGACTCTTCAAGAAAAGATTAAAGTGAACAAGAAGAAGCTTCCAAGGAGGGTTTTGCTAGAGAGAGTGAAAGTACTAGATGGACGTGAAAC
TGGCAACCTATTTCGTGGATTCAGACCTGTAGCTCCGAAATCAGATTTAACAAAAGCATCCAGAGCAAAGAGATTACTTCAAAAAAGGGATAGAATCAAGGAAGAAAAGA
AGGCTAAATTACTGGCTGCAGGACCAGCGGTAACTTTTTTGATTGTTTGCTCATTTTTATTGCCTATGCTAGCAAAAATTCAACTAAGAATGCTTTTCTTTTTCTCTGTG
ATGTCTCTAGCTACGGATGCATCGAGAATCCCCCTGCCTAACCTTCTGAAGGAAGAAGAACACCATATTCTTATTGTCGATTTGTGCAAGGCATTGGCTTCCTTAGGAAG
ATGTTCTGAAGCTTTAGAGATTATAAGTCTAACTTTAAACTCAACTGGTACCATGCATGGTTTCAACTTTGCAAAGCACGTTGTTAAGCAGTACCCTTACAGCATCTCGG
CTTGGAACTGCTATTATAAAGTAGCTTCATGTTTGACGAACCGGGATTCAAGGCATTGCAAGCTTCTGAATAGCATGCAAGCCAAATACAAAGATTGTGCACCACCCTAT
ATCATTGCCGGGCATCAATTTACCACCATTAGCCATCATCAAGATGCTGCAAGGAAATATCTTGAAGCTTATAAAATCATGCCGGATAGTCCCCTAATTAATTTGTGTGT
TGGATCGTCCTTAATCAACTTGGCTCTTGGATTCCGTCTTCAAAACAAGCATCAGTGTGTTGCGCAGGGCTTGGCATTCCTCTACAAAAATTTGAAGCTGTGCGATAACA
ACCAGGAAGCCTTGTACAACATAGCTCGAGCATATCATCACATTGGACTGGTGACACTGGCAGTTACATATTACGAAAAGGTGCTTGCAACTTACCAGAAGGATTGCCCC
ATACCAGAACTTTTTGGTGAGAATCGAAACATTAAACATCAGAATTCAGTCTATTGTGACCTACGGAGAGAAGCAGCTTACAATTTGCATCTTATTTATAAAGAGAGTGG
AGCTCTTGATCTTGCTAGGCAAGTCCTAAAAGATCATTGCACATTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAGGAGGGGAATAGAATTTCTGACAGTGAAGAGGTTCCTGGTGATGTTATGCATGTTTTGGGAGTGGAAAAAGAGGTAGAAACAGGAGTGGTGGATAGAGAGGA
GGAGGAGGAGGAGGAGGAAGGAGAGGAAGAAGTGGAGGATGAAGGAGAGGATGATATTGAAGAAGAAGACGGTTACACATTCAAATTTAAGGCCGGAGAAAATCCATTTG
ATTTTGTTGAAGGAACCGATTTTAGTGTCCAACCATATAAAAAATTTGAGCGCCTTGAATATGAAGCCCTTGCTGAGAAAAAGAGAAAAGCTCTTGCAAATGGCCAGAGT
GAGAGAGCTGCAAAGAGGGGCAGGGTAGAAGACGTTGCTGGTGCTAGCTTTGATGAAATTTTGGAAGCTATGAATTATGGATCTAGGAGGAAGTTAAAGGAGCCTAAAAA
AAGAGGTCGACGGAAAGGATCAAAGAAAAAACTTAATCGTGATGTTACAAAGTTGCTCGGTGATGCAACTTTATGTTATGCTCAAGGCCAGCATGAGAAGGCTATATCTT
TATTACGCCAAGTTGTTCTGCAAGCCCCAGATTTACCTGATTCGTACCACACACTTGGACTTGTTTACAATGCAATTGGTGATGATGTAAAAGCCATGGGATTCTACATG
CTTGCTGCACATTTAATGCCAAAAGATTCATCTCTCTGGAAACTGCTATTTTCATGGTCAATTGAACGAGGTGATATTGATCAAGCAAGCTATTGTCTTTCTAAAGCAAT
AAAAGCAGAGCCTGATGATATTAATTTATTATTTCATCGTGCGTCACTCTACCTTGAGCGTGGAGATTGTGAGAAAGCAGCTGAAACATATGATCAGATTCATCAACAAT
GCCTTGGTAACGTTGAAGCACTCATGACAGGAGCAAAGCTGTACCAAAAATGTGGTCATCTTGAACGTGCAATTTGCATTCTTGAGGACTACATTAAAGAGCATCCATCT
GAAGCTGATTTAGATGTGGTTGATCTTTTAGCTTCTTTATACATGGGAAGTAAAGAATTCAGCAAAGCTCTTGAGCACATTGAACATGCAGATAGGGTGTACTGTGCCGG
AAATGAGCTACCTTTAAACTTGACAGCTAAAGCAGGAATCTGCCACGCTCATCTTGGAAACTTGGAGAAGGCAGAGTGCCTCTTTGCTAATTTGAGACGGGAAACTACCT
ATGATCACTCAAATTTGATGATTGAAGTTGCAGACTCGTTGCTGAGTCTTAAGCACTATAGCTGGGCATTGAAGTATTATTTGATGTCCGAAGAAGTAAATGCTGGAGAG
AACATGGGGATTTTATACCAAAAAGTTGCTGAGTGTTACTTATCAACTAATGAAAAGGAACAGGCAATTGTTTTCTTTTATAAAGTGCTTCAACATGTTGAAGATAACAT
TAATGCTCGATTAACTTTGGCCTCCCTCCTCCTTGAGGAAGCTAGAGACGAAGAAGCCATTTCATTACTATCTCCTCCAAAAGATTCAAACCCAACTAGCTCATCTTCCG
GCAAATTAAAACCTTGGTGGCTGAATGAGAAAGTAAAACTGAAGCTTTGCCACATATACAGAACTAGAGGATTGCTTGAGAACTTCGTTGAGGTGATCTTTCCTTTGGTC
CGAGAATCCTTATATATCGAGACTCTTCAAGAAAAGATTAAAGTGAACAAGAAGAAGCTTCCAAGGAGGGTTTTGCTAGAGAGAGTGAAAGTACTAGATGGACGTGAAAC
TGGCAACCTATTTCGTGGATTCAGACCTGTAGCTCCGAAATCAGATTTAACAAAAGCATCCAGAGCAAAGAGATTACTTCAAAAAAGGGATAGAATCAAGGAAGAAAAGA
AGGCTAAATTACTGGCTGCAGGACCAGCGGTAACTTTTTTGATTGTTTGCTCATTTTTATTGCCTATGCTAGCAAAAATTCAACTAAGAATGCTTTTCTTTTTCTCTGTG
ATGTCTCTAGCTACGGATGCATCGAGAATCCCCCTGCCTAACCTTCTGAAGGAAGAAGAACACCATATTCTTATTGTCGATTTGTGCAAGGCATTGGCTTCCTTAGGAAG
ATGTTCTGAAGCTTTAGAGATTATAAGTCTAACTTTAAACTCAACTGGTACCATGCATGGTTTCAACTTTGCAAAGCACGTTGTTAAGCAGTACCCTTACAGCATCTCGG
CTTGGAACTGCTATTATAAAGTAGCTTCATGTTTGACGAACCGGGATTCAAGGCATTGCAAGCTTCTGAATAGCATGCAAGCCAAATACAAAGATTGTGCACCACCCTAT
ATCATTGCCGGGCATCAATTTACCACCATTAGCCATCATCAAGATGCTGCAAGGAAATATCTTGAAGCTTATAAAATCATGCCGGATAGTCCCCTAATTAATTTGTGTGT
TGGATCGTCCTTAATCAACTTGGCTCTTGGATTCCGTCTTCAAAACAAGCATCAGTGTGTTGCGCAGGGCTTGGCATTCCTCTACAAAAATTTGAAGCTGTGCGATAACA
ACCAGGAAGCCTTGTACAACATAGCTCGAGCATATCATCACATTGGACTGGTGACACTGGCAGTTACATATTACGAAAAGGTGCTTGCAACTTACCAGAAGGATTGCCCC
ATACCAGAACTTTTTGGTGAGAATCGAAACATTAAACATCAGAATTCAGTCTATTGTGACCTACGGAGAGAAGCAGCTTACAATTTGCATCTTATTTATAAAGAGAGTGG
AGCTCTTGATCTTGCTAGGCAAGTCCTAAAAGATCATTGCACATTTTAA
Protein sequenceShow/hide protein sequence
MEKEGNRISDSEEVPGDVMHVLGVEKEVETGVVDREEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSVQPYKKFERLEYEALAEKKRKALANGQS
ERAAKRGRVEDVAGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISLLRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYM
LAAHLMPKDSSLWKLLFSWSIERGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPS
EADLDVVDLLASLYMGSKEFSKALEHIEHADRVYCAGNELPLNLTAKAGICHAHLGNLEKAECLFANLRRETTYDHSNLMIEVADSLLSLKHYSWALKYYLMSEEVNAGE
NMGILYQKVAECYLSTNEKEQAIVFFYKVLQHVEDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSGKLKPWWLNEKVKLKLCHIYRTRGLLENFVEVIFPLV
RESLYIETLQEKIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAPKSDLTKASRAKRLLQKRDRIKEEKKAKLLAAGPAVTFLIVCSFLLPMLAKIQLRMLFFFSV
MSLATDASRIPLPNLLKEEEHHILIVDLCKALASLGRCSEALEIISLTLNSTGTMHGFNFAKHVVKQYPYSISAWNCYYKVASCLTNRDSRHCKLLNSMQAKYKDCAPPY
IIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGSSLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNNQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCP
IPELFGENRNIKHQNSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF