; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G005790 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G005790
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptiongeneral transcription factor 3C polypeptide 3
Genome locationchr07:6153686..6173853
RNA-Seq ExpressionLsi07G005790
SyntenyLsi07G005790
Gene Ontology termsGO:0006383 - transcription by RNA polymerase III (biological process)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat
IPR039340 - Transcription factor Tfc4/TFIIIC-102/Sfc4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008447634.1 PREDICTED: general transcription factor 3C polypeptide 3 isoform X1 [Cucumis melo]0.0e+0089.59Show/hide
Query:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA
        MEKEG++ISD+EEVPG VM VLG EKE VETGV  R   EEEEEEEEGEEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFS+QPYKKFERLEYEA
Subjt:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA

Query:  LAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP
        LAEKKRKALANGQSERAAKRGRVED+ GASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAIS+LRQVVLQAPDLP
Subjt:  LAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP

Query:  DSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG
        DSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSI+RGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG
Subjt:  DSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG

Query:  NVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFA
        NVEALMTGAKLYQKCGHLERAICILEDYIKEHP+EADLDVVDLLASLYMGSKEFSKALE IEHAD VYCAGNELPLNLT KAGICH HLGN+EKAECLFA
Subjt:  NVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFA

Query:  NLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEA
        NL RET  DHSNLMIE ADSLLSLKHY+ ALKYYLMSEEVNAG NMGILY K+A+CYLST+E+ QAIVFFYKVLQH+EDNINARLTLASLLLEEARDEEA
Subjt:  NLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEA

Query:  ISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVE------------------IKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKP
        ISLLSPPKDSNPTSSSS KLKPWWLNEKVKLKLC IY+TRG+LENFVE                  IKVNKKKLPRRVLLERVKVLDGRETGNLFRGF+P
Subjt:  ISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVE------------------IKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKP

Query:  VAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIISLTLKLAFNS
        VAPKSDL+KASRAKRLLQKR+RIKEEKKA+ LAAGV+V+YDDLDDEPA                        LCKALASLGRCSEALEIISLTLKLAFNS
Subjt:  VAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIISLTLKLAFNS

Query:  LSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYL
        LS ERKEELQLLGAQLAFSST TMHGFNFAKHVVKQYPYSISAWNCYYKVAS LTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYL
Subjt:  LSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYL

Query:  EAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI
        EAYKIMPDSPLINLCVG+SLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN+NI
Subjt:  EAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI

Query:  KHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        KHQ SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Subjt:  KHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

XP_011652346.1 general transcription factor 3C polypeptide 3 [Cucumis sativus]0.0e+0090.45Show/hide
Query:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE-EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYE
        MEKEGS+ISD EEVPG VM VLG EKEVVETGVE RE EEEEEEEEEEGEEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFS+QPYKKFERLEYE
Subjt:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE-EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYE

Query:  ALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDL
        ALAEKKRKALANGQSERAAKRGRVEDI GASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQG+HEKAIS+LRQVVL+APDL
Subjt:  ALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDL

Query:  PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
        PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
Subjt:  PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL

Query:  GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLF
        GNVEALMTGAKLYQKCGHLERAICILEDYIK HP+EADLDVVDLLASLYMGSKEFSKALERIEHAD VYCAGNELPLNLTTKAGICH HLG++EKAECLF
Subjt:  GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLF

Query:  ANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEE
        ANL RET  DHSNLMIE ADSL+SLKHY+ ALKYYLMSEEVNAG NMGILYLKIA+CYLST+ER QAIVFFYKVLQH+EDNINARLTLASLLLEEARD+E
Subjt:  ANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEE

Query:  AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVE------------------IKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK
        AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLC IY+TRG+LENFVE                  IKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK
Subjt:  AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVE------------------IKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK

Query:  PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIISLTLKLAFN
        PVAPKSDL+KASRAKRLLQKRERIKEEKKA+ALAAGV+++YDDLDDEPA                        LCKALASLGRCSEALEIISLTLKLAFN
Subjt:  PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIISLTLKLAFN

Query:  SLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKY
        SLS+ERKEELQLLGAQLAFSST TMHGFNFAKHVVKQYPYSISAWNCYYKVAS LTNRDSRHCKLLNSMQ+KYKDCAPPYIIAGHQFTTISHHQDAARKY
Subjt:  SLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKY

Query:  LEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQN
        LEAYKIMPDSPLINLCVG+SLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN+N
Subjt:  LEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQN

Query:  IKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        IKHQ SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Subjt:  IKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

XP_022152620.1 general transcription factor 3C polypeptide 3 isoform X2 [Momordica charantia]0.0e+0085.47Show/hide
Query:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK
        MEKEG++ISDN+EVPG  + V G  K + ET VE RE         EEEEEEEEEE EEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK
Subjt:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK

Query:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ
        KFERLEYEALAEKKRKALAN QSER  KRGR+EDIPGASF+EI+EAMNYGSRRKLKEPK+RGRRKGSKKK+NR++TKLLGDATLCYAQGQ+EKAIS+LRQ
Subjt:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ

Query:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY
        VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETY
Subjt:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY

Query:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGN
        DQIHQ+C+GNVEALMTGAKLYQKCGH ERAICILEDYIK HPTEADLDVVDLLASLYMGSKEFSKALE IEHAD+VYCAGNE+PLNL TKAGICHVHLGN
Subjt:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGN

Query:  MEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLL
        +EKAE LFANLGR+TA+DHS+ +IEAADSLLSLKH+NLALKYYLMSEEVNAGG MGI+YLKIAQCYLST+ER +AIVFFYKVLQ LEDNINARLTLASLL
Subjt:  MEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLL

Query:  LEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVE------------------IKVNKKKLPRRVLLERVKVLDGRET
        LEEAR+EEAISLLSPPKDSN +SSSSSK KPWWLNEKVKLKLC IY+T+GMLENFVE                  IKVNKKKLPRRVLLERVKVLDGRET
Subjt:  LEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVE------------------IKVNKKKLPRRVLLERVKVLDGRET

Query:  GNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIIS
        G+LFRGF+PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV+V+YDD+DDEPA                        LCKALASLGRCSEALEIIS
Subjt:  GNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIIS

Query:  LTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISH
        LTLKLAFNSLS+ERKEELQLLGAQLAFSSTDT HGFNFAKHVVKQYPYS SAWNCYYKV+S +T+RDSRHCKLLNS+QAKYKDCAPP+IIAGHQF  ISH
Subjt:  LTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISH

Query:  HQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP
        HQ+AA+KYLEAYK++PDSPLINLCVGA+LINLALG RLQNKHQCVAQGLAFLY NLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP
Subjt:  HQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP

Query:  ELFGENQNIKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        ++FGEN+NIKH+KSVYCDLRREAAYNLHLIYK+SGALDLARQVLKDHCTF
Subjt:  ELFGENQNIKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

XP_022152621.1 general transcription factor 3C polypeptide 3 isoform X3 [Momordica charantia]0.0e+0087.59Show/hide
Query:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK
        MEKEG++ISDN+EVPG  + V G  K + ET VE RE         EEEEEEEEEE EEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK
Subjt:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK

Query:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ
        KFERLEYEALAEKKRKALAN QSER  KRGR+EDIPGASF+EI+EAMNYGSRRKLKEPK+RGRRKGSKKK+NR++TKLLGDATLCYAQGQ+EKAIS+LRQ
Subjt:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ

Query:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY
        VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETY
Subjt:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY

Query:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGN
        DQIHQ+C+GNVEALMTGAKLYQKCGH ERAICILEDYIK HPTEADLDVVDLLASLYMGSKEFSKALE IEHAD+VYCAGNE+PLNL TKAGICHVHLGN
Subjt:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGN

Query:  MEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNM-GILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASL
        +EKAE LFANLGR+TA+DHS+ +IEAADSLLSLKH+NLALKYYLMSEEVNAGG M GI+YLKIAQCYLST+ER +AIVFFYKVLQ LEDNINARLTLASL
Subjt:  MEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNM-GILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASL

Query:  LLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVE------------------IKVNKKKLPRRVLLERVKVLDGRE
        LLEEAR+EEAISLLSPPKDSN +SSSSSK KPWWLNEKVKLKLC IY+T+GMLENFVE                  IKVNKKKLPRRVLLERVKVLDGRE
Subjt:  LLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVE------------------IKVNKKKLPRRVLLERVKVLDGRE

Query:  TGNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGA
        TG+LFRGF+PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV+V+YDD+DDEPALCKALASLGRCSEALEIISLTLKLAFNSLS+ERKEELQLLGA
Subjt:  TGNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGA

Query:  QLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINL
        QLAFSSTDT HGFNFAKHVVKQYPYS SAWNCYYKV+S +T+RDSRHCKLLNS+QAKYKDCAPP+IIAGHQF  ISHHQ+AA+KYLEAYK++PDSPLINL
Subjt:  QLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINL

Query:  CVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNIKHQKSVYCDLRREA
        CVGA+LINLALG RLQNKHQCVAQGLAFLY NLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP++FGEN+NIKH+KSVYCDLRREA
Subjt:  CVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNIKHQKSVYCDLRREA

Query:  AYNLHLIYKESGALDLARQVLKDHCTF
        AYNLHLIYK+SGALDLARQVLKDHCTF
Subjt:  AYNLHLIYKESGALDLARQVLKDHCTF

XP_038879313.1 general transcription factor 3C polypeptide 3 [Benincasa hispida]0.0e+0091.82Show/hide
Query:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA
        MEKEG+ +SDNEEVPGG M VLGA KEVVETGVE R  EEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA
Subjt:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA

Query:  LAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP
        LAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKA+SILRQVVLQAPDLP
Subjt:  LAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP

Query:  DSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG
        DSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDI+LLFHRASLYLERGDCEKAAETYDQIHQQCLG
Subjt:  DSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG

Query:  NVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFA
        NVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMG+KEF KALERIEHADEVYCAG+ELPL LTTK GICH HLGNMEKAECLFA
Subjt:  NVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFA

Query:  NLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEA
        NLG ETA+DHSNLMIE ADSLLSLKHYNLALKYYLM EEVNAGGNMGILYLKIAQCYLST+ER QAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEA
Subjt:  NLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEA

Query:  ISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVE------------------IKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKP
        ISLLSPPKDSNPTSSSSSKLKPWW NEKVKLKLC IYKTRGMLE+FVE                  IKVNKKKLP+RVLLERVKVLDGRE+GNLFRGF+P
Subjt:  ISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVE------------------IKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKP

Query:  VAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIISLTLKLAFNS
        VAPKSDLSKASRAKRLLQKRERIKEEKKA+ALAAGVDVNYDDLDDEPA                        LCKALASLGRCSEALEIISLTLKLA NS
Subjt:  VAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIISLTLKLAFNS

Query:  LSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYL
        LSIERKEELQLLGAQLAFSSTDT HGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFT ISHHQDAARKYL
Subjt:  LSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYL

Query:  EAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI
        EAYKIMPDSPLINLCVG+SLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN+NI
Subjt:  EAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI

Query:  KHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        KHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Subjt:  KHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

TrEMBL top hitse value%identityAlignment
A0A0A0LGB1 Uncharacterized protein0.0e+0087.9Show/hide
Query:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE-EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYE
        MEKEGS+ISD EEVPG VM VLG EKEVVETGVE RE EEEEEEEEEEGEEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFS+QPYKKFERLEYE
Subjt:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE-EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYE

Query:  ALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDL
        ALAEKKRKALANGQSERAAKRGRVEDI GASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQG+HEKAIS+LRQVVL+APDL
Subjt:  ALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDL

Query:  PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
        PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
Subjt:  PDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL

Query:  GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLF
        GNVEALMTGAKLYQKCGHLERAICILEDYIK HP+EADLDVVDLLASLYMGSKEFSKALERIEHAD VYCAGNELPLNLTTKAGICH HLG++EKAECLF
Subjt:  GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLF

Query:  ANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEE
        ANL RET  DHSNLMIE ADSL+SLKHY+ ALKYYLMSEEVNAG NMGILYLKIA+CYLST+ER QAIVFFYKVLQH+EDNINARLTLASLLLEEARD+E
Subjt:  ANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEE

Query:  AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVE------------------IKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK
        AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLC IY+TRG+LENFVE                  IKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK
Subjt:  AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVE------------------IKVNKKKLPRRVLLERVKVLDGRETGNLFRGFK

Query:  PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIISLTLKLAFN
        PVAPKSDL+KASRAKRLLQKRERIKEEKKA+ALAAGV+++YDDLDDEPA                        LCKALASLGRCSEALEIISLTL     
Subjt:  PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIISLTLKLAFN

Query:  SLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKY
                           +ST TMHGFNFAKHVVKQYPYSISAWNCYYKVAS LTNRDSRHCKLLNSMQ+KYKDCAPPYIIAGHQFTTISHHQDAARKY
Subjt:  SLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKY

Query:  LEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQN
        LEAYKIMPDSPLINLCVG+SLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN+N
Subjt:  LEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQN

Query:  IKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        IKHQ SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Subjt:  IKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

A0A1S3BHB9 general transcription factor 3C polypeptide 3 isoform X10.0e+0089.59Show/hide
Query:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA
        MEKEG++ISD+EEVPG VM VLG EKE VETGV  R   EEEEEEEEGEEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFS+QPYKKFERLEYEA
Subjt:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA

Query:  LAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP
        LAEKKRKALANGQSERAAKRGRVED+ GASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAIS+LRQVVLQAPDLP
Subjt:  LAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP

Query:  DSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG
        DSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSI+RGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG
Subjt:  DSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLG

Query:  NVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFA
        NVEALMTGAKLYQKCGHLERAICILEDYIKEHP+EADLDVVDLLASLYMGSKEFSKALE IEHAD VYCAGNELPLNLT KAGICH HLGN+EKAECLFA
Subjt:  NVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFA

Query:  NLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEA
        NL RET  DHSNLMIE ADSLLSLKHY+ ALKYYLMSEEVNAG NMGILY K+A+CYLST+E+ QAIVFFYKVLQH+EDNINARLTLASLLLEEARDEEA
Subjt:  NLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEA

Query:  ISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVE------------------IKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKP
        ISLLSPPKDSNPTSSSS KLKPWWLNEKVKLKLC IY+TRG+LENFVE                  IKVNKKKLPRRVLLERVKVLDGRETGNLFRGF+P
Subjt:  ISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVE------------------IKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKP

Query:  VAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIISLTLKLAFNS
        VAPKSDL+KASRAKRLLQKR+RIKEEKKA+ LAAGV+V+YDDLDDEPA                        LCKALASLGRCSEALEIISLTLKLAFNS
Subjt:  VAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIISLTLKLAFNS

Query:  LSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYL
        LS ERKEELQLLGAQLAFSST TMHGFNFAKHVVKQYPYSISAWNCYYKVAS LTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYL
Subjt:  LSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYL

Query:  EAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI
        EAYKIMPDSPLINLCVG+SLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN+NI
Subjt:  EAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI

Query:  KHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        KHQ SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
Subjt:  KHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

A0A6J1DEF9 general transcription factor 3C polypeptide 3 isoform X20.0e+0085.47Show/hide
Query:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK
        MEKEG++ISDN+EVPG  + V G  K + ET VE RE         EEEEEEEEEE EEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK
Subjt:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK

Query:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ
        KFERLEYEALAEKKRKALAN QSER  KRGR+EDIPGASF+EI+EAMNYGSRRKLKEPK+RGRRKGSKKK+NR++TKLLGDATLCYAQGQ+EKAIS+LRQ
Subjt:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ

Query:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY
        VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETY
Subjt:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY

Query:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGN
        DQIHQ+C+GNVEALMTGAKLYQKCGH ERAICILEDYIK HPTEADLDVVDLLASLYMGSKEFSKALE IEHAD+VYCAGNE+PLNL TKAGICHVHLGN
Subjt:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGN

Query:  MEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLL
        +EKAE LFANLGR+TA+DHS+ +IEAADSLLSLKH+NLALKYYLMSEEVNAGG MGI+YLKIAQCYLST+ER +AIVFFYKVLQ LEDNINARLTLASLL
Subjt:  MEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLL

Query:  LEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVE------------------IKVNKKKLPRRVLLERVKVLDGRET
        LEEAR+EEAISLLSPPKDSN +SSSSSK KPWWLNEKVKLKLC IY+T+GMLENFVE                  IKVNKKKLPRRVLLERVKVLDGRET
Subjt:  LEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVE------------------IKVNKKKLPRRVLLERVKVLDGRET

Query:  GNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIIS
        G+LFRGF+PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV+V+YDD+DDEPA                        LCKALASLGRCSEALEIIS
Subjt:  GNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEIIS

Query:  LTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISH
        LTLKLAFNSLS+ERKEELQLLGAQLAFSSTDT HGFNFAKHVVKQYPYS SAWNCYYKV+S +T+RDSRHCKLLNS+QAKYKDCAPP+IIAGHQF  ISH
Subjt:  LTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISH

Query:  HQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP
        HQ+AA+KYLEAYK++PDSPLINLCVGA+LINLALG RLQNKHQCVAQGLAFLY NLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP
Subjt:  HQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP

Query:  ELFGENQNIKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        ++FGEN+NIKH+KSVYCDLRREAAYNLHLIYK+SGALDLARQVLKDHCTF
Subjt:  ELFGENQNIKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

A0A6J1DGK2 general transcription factor 3C polypeptide 3 isoform X10.0e+0085.38Show/hide
Query:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK
        MEKEG++ISDN+EVPG  + V G  K + ET VE RE         EEEEEEEEEE EEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK
Subjt:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK

Query:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ
        KFERLEYEALAEKKRKALAN QSER  KRGR+EDIPGASF+EI+EAMNYGSRRKLKEPK+RGRRKGSKKK+NR++TKLLGDATLCYAQGQ+EKAIS+LRQ
Subjt:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ

Query:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY
        VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETY
Subjt:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY

Query:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGN
        DQIHQ+C+GNVEALMTGAKLYQKCGH ERAICILEDYIK HPTEADLDVVDLLASLYMGSKEFSKALE IEHAD+VYCAGNE+PLNL TKAGICHVHLGN
Subjt:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGN

Query:  MEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNM-GILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASL
        +EKAE LFANLGR+TA+DHS+ +IEAADSLLSLKH+NLALKYYLMSEEVNAGG M GI+YLKIAQCYLST+ER +AIVFFYKVLQ LEDNINARLTLASL
Subjt:  MEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNM-GILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASL

Query:  LLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVE------------------IKVNKKKLPRRVLLERVKVLDGRE
        LLEEAR+EEAISLLSPPKDSN +SSSSSK KPWWLNEKVKLKLC IY+T+GMLENFVE                  IKVNKKKLPRRVLLERVKVLDGRE
Subjt:  LLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVE------------------IKVNKKKLPRRVLLERVKVLDGRE

Query:  TGNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEII
        TG+LFRGF+PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV+V+YDD+DDEPA                        LCKALASLGRCSEALEII
Subjt:  TGNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPA------------------------LCKALASLGRCSEALEII

Query:  SLTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTIS
        SLTLKLAFNSLS+ERKEELQLLGAQLAFSSTDT HGFNFAKHVVKQYPYS SAWNCYYKV+S +T+RDSRHCKLLNS+QAKYKDCAPP+IIAGHQF  IS
Subjt:  SLTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTIS

Query:  HHQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPI
        HHQ+AA+KYLEAYK++PDSPLINLCVGA+LINLALG RLQNKHQCVAQGLAFLY NLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPI
Subjt:  HHQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPI

Query:  PELFGENQNIKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
        P++FGEN+NIKH+KSVYCDLRREAAYNLHLIYK+SGALDLARQVLKDHCTF
Subjt:  PELFGENQNIKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

A0A6J1DIA5 general transcription factor 3C polypeptide 3 isoform X30.0e+0087.59Show/hide
Query:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK
        MEKEG++ISDN+EVPG  + V G  K + ET VE RE         EEEEEEEEEE EEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK
Subjt:  MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEARE---------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYK

Query:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ
        KFERLEYEALAEKKRKALAN QSER  KRGR+EDIPGASF+EI+EAMNYGSRRKLKEPK+RGRRKGSKKK+NR++TKLLGDATLCYAQGQ+EKAIS+LRQ
Subjt:  KFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ

Query:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY
        VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETY
Subjt:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY

Query:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGN
        DQIHQ+C+GNVEALMTGAKLYQKCGH ERAICILEDYIK HPTEADLDVVDLLASLYMGSKEFSKALE IEHAD+VYCAGNE+PLNL TKAGICHVHLGN
Subjt:  DQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGN

Query:  MEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNM-GILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASL
        +EKAE LFANLGR+TA+DHS+ +IEAADSLLSLKH+NLALKYYLMSEEVNAGG M GI+YLKIAQCYLST+ER +AIVFFYKVLQ LEDNINARLTLASL
Subjt:  MEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNM-GILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASL

Query:  LLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVE------------------IKVNKKKLPRRVLLERVKVLDGRE
        LLEEAR+EEAISLLSPPKDSN +SSSSSK KPWWLNEKVKLKLC IY+T+GMLENFVE                  IKVNKKKLPRRVLLERVKVLDGRE
Subjt:  LLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVE------------------IKVNKKKLPRRVLLERVKVLDGRE

Query:  TGNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGA
        TG+LFRGF+PVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV+V+YDD+DDEPALCKALASLGRCSEALEIISLTLKLAFNSLS+ERKEELQLLGA
Subjt:  TGNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGA

Query:  QLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINL
        QLAFSSTDT HGFNFAKHVVKQYPYS SAWNCYYKV+S +T+RDSRHCKLLNS+QAKYKDCAPP+IIAGHQF  ISHHQ+AA+KYLEAYK++PDSPLINL
Subjt:  QLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINL

Query:  CVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNIKHQKSVYCDLRREA
        CVGA+LINLALG RLQNKHQCVAQGLAFLY NLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP++FGEN+NIKH+KSVYCDLRREA
Subjt:  CVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNIKHQKSVYCDLRREA

Query:  AYNLHLIYKESGALDLARQVLKDHCTF
        AYNLHLIYK+SGALDLARQVLKDHCTF
Subjt:  AYNLHLIYKESGALDLARQVLKDHCTF

SwissProt top hitse value%identityAlignment
O74458 Transcription factor tau subunit sfc43.8e-2722.69Show/hide
Query:  EDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQ-GQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDD----VK
        +DI    ++E L+A+  G R+  K  K RGR   +    + +V ++L  A   +AQ G  ++A  +  ++V    ++  ++  LG  +   G+      K
Subjt:  EDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQ-GQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDD----VK

Query:  AMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDIN----LLFHRASLYLERGDCEKAAETYDQIHQQCLGNVEALMTGAKLYQK
         +  +M AAHL PKD  LW      S      DQA YC ++A+ A+P + +     +++R+ L  E G  +KAAE +  + Q    N   L   A++Y K
Subjt:  AMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDIN----LLFHRASLYLERGDCEKAAETYDQIHQQCLGNVEALMTGAKLYQK

Query:  CGHLERAIC----ILEDYIKEHPTEA------DLDVVDLLASLYMGSKEFSKALERI----------------------------EHADEVYCAGNE---
          H  R I     I   Y  ++P         DL  ++L A L +   ++S  +  I                            E   E   A  E   
Subjt:  CGHLERAIC----ILEDYIKEHPTEA------DLDVVDLLASLYMGSKEFSKALERI----------------------------EHADEVYCAGNE---

Query:  -----LPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIV
             LP    TK GI  +  G + +AE  F+ +     +    ++ + A + + ++  +LAL+Y+++        N+G+ Y  +  CYL   E   A  
Subjt:  -----LPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIV

Query:  FFYKVLQHLEDNINARLTLASL-LLEEARD---EEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEIKVNKKKLPRRVLLER
            +L     N NA + LA +  L++ RD   E   ++    ++ N      S+      NE  +       K  G      ++ V  +K+P+    +R
Subjt:  FFYKVLQHLEDNINARLTLASL-LLEEARD---EEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEIKVNKKKLPRRVLLER

Query:  VKVLDGRETGNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKE-------------------------EKKARALAAGVD--------------------
         ++   +E    F  +K    +    K    ++ L+K E + E                         EKKARA A  +                     
Subjt:  VKVLDGRETGNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKE-------------------------EKKARALAAGVD--------------------

Query:  -----VNYDDLDDEPAL--------------------CKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQ
               Y DLD +  L                       L  +G   +A ++++  +          +++ L+      +  + D        + V   
Subjt:  -----VNYDDLDDEPAL--------------------CKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQ

Query:  YPYSISAWNCYYKVAS-----SLTNRDSRHCKLL------------NSM-------------QAKYKDCAPPYIIA--GHQFTTISHHQDAARKYLEAYK
        + +    +  +  V S     S    DS + K L            NS+              A       P ++   GH          A   Y  A+ 
Subjt:  YPYSISAWNCYYKVAS-----SLTNRDSRHCKLL------------NSM-------------QAKYKDCAPPYIIA--GHQFTTISHHQDAARKYLEAYK

Query:  IMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN-----SQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQN
        I PD P+ NL +G + ++ A+     N+H  + QG  FLY+   L  N      QEALYN+ +AYH IGL   AV YYE VL         P   G+   
Subjt:  IMPDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN-----SQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQN

Query:  IKHQK-SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF
              S   D   EAAYNL LIY  SG + LA Q+   +  F
Subjt:  IKHQK-SVYCDLRREAAYNLHLIYKESGALDLARQVLKDHCTF

P33339 Transcription factor tau 131 kDa subunit6.7e-1619.58Show/hide
Query:  EEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMN
        ++E++ +  E E  D G+ + E+E+         ++     E  D  +    +    EY   +E+    LA             ED   A    I EA N
Subjt:  EEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMN

Query:  YGSRRKLKEPK-KRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWK
        +  ++K K  K K   R+  ++ L+ +V +LL  A   + +   + A  +  +V+ +      +Y TLG +Y   G        + LAAHL   D   WK
Subjt:  YGSRRKLKEPK-KRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWK

Query:  LLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQ-------------CLGNVEALMTGAKLYQKC--GHLERAIC
        ++   S D   + QA YC S+ I   P +   ++ R+ LY + G   +A + + +++                  + + +    +LY K    ++ER   
Subjt:  LLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQ-------------CLGNVEALMTGAKLYQKC--GHLERAIC

Query:  IL------------------EDYIKEHPTEADLD------------------------VVDLLASLYM--------GSKEFSKALERIEHA---------
        IL                  ED  ++ P E D D                         +++LA L++        G K   K    I+           
Subjt:  IL------------------EDYIKEHPTEADLD------------------------VVDLLASLYM--------GSKEFSKALERIEHA---------

Query:  ----------------DEVYCAGNE----LPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYY---LMSEEVN
                        D +  A  E    +P+++  + G+  ++  N+ +A   F  L  ET +D ++L  EAA +L   + Y  A+ ++   L  EE  
Subjt:  ----------------DEVYCAGNE----LPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYY---LMSEEVN

Query:  AGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLL------------------LEEARDEEAISLLSPPKDSNPTSSSSSK--LK
               ++  +A+CY   +    A  F+   ++   D+++ R++LA +                   + + + +E +  +S  K SN TS  SSK  L+
Subjt:  AGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLL------------------LEEARDEEAISLLSPPKDSNPTSSSSSK--LK

Query:  PWWLNEKVKLKLCQIYKTRGMLENFVEIKVNKKKLPRRVLLERVKVLDG--------------RETGNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKE
                K K       R  +E   E ++  K + +   +++ ++  G               E  ++F   K    K   S++ +   +L++ ++   
Subjt:  PWWLNEKVKLKLCQIYKTRGMLENFVEIKVNKKKLPRRVLLERVKVLDG--------------RETGNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKE

Query:  E-----KKARALAAGVDVNYDDLDDEPA----------------------LCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSSTD
        E     ++   LA G  V    L +E                        L   +A      + L ++    ++       ER + ++ +   +     D
Subjt:  E-----KKARALAAGVDVNYDDLDDEPA----------------------LCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSSTD

Query:  ----------TMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRD------------------SRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQD
                   ++ F F + V++ + YS+        + SS   +                   +    + N         + PY+   +     S    
Subjt:  ----------TMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRD------------------SRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQD

Query:  AARKYLEAYKIM-------PDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN------SQEALYNIARAYHHIGLVTLAVTYYEKVL
         +R +L A + +       PD P++NL +G S I+ A+      +H  +  GL +LY+  K+  +       QEA YN+ RA+H IGLV++A+ YY +VL
Subjt:  AARKYLEAYKIM-------PDSPLINLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDN------SQEALYNIARAYHHIGLVTLAVTYYEKVL

Query:  ATYQKDCPIPELFGENQNIKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDH
          Y                         L++ AAYN  +IY++SG ++LA  +++ +
Subjt:  ATYQKDCPIPELFGENQNIKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDH

Q61371 Intraflagellar transport protein 88 homolog4.5e-0423.83Show/hide
Query:  ERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILE--AMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ
        +R    AL  K     ANG  E+AA+  +      +S  E L    + Y    +L E      +  +   L      L   A +        +AI  L Q
Subjt:  ERLEYEALAEKKRKALANGQSERAAKRGRVEDIPGASFDEILE--AMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQ

Query:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY
        ++   P    +   LG +Y++ GD  +A  +Y  +    P +  + + L ++ ID    ++A     +A   +P  +      AS +   G+ +KA +TY
Subjt:  VVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETY

Query:  DQIHQQCLGNVEAL
         +IH++   NVE L
Subjt:  DQIHQQCLGNVEAL

Q9Y5Q9 General transcription factor 3C polypeptide 32.6e-6024.72Show/hide
Query:  EVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERAAKRGRVEDI
        E  E   E R+  E++  +E+G+   E+  +D        I   K+ +   D  EG + S    K   ++    L E +       + E   +     + 
Subjt:  EVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERAAKRGRVEDI

Query:  PGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLA
        P A    +LE +        +E KK  + K  + KL R +  L+G+A + +A+G+ E+AI +  +++ QAP   + + TL ++Y   GD  K++ F ++A
Subjt:  PGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLA

Query:  AHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGN-----VEALMTGAKLYQKCGHLERA
        AHL P D+  W  L   S+++ +I QA +C +KA+K EP ++  L+ R+SLY + GD + A + Y +I      +     ++     AK Y +   +  A
Subjt:  AHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGN-----VEALMTGAKLYQKCGHLERA

Query:  ICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERI---------------------EHADEVYCA-GNELPLNLTTKAGICHVHLGNMEKAECLF
        I I+++   +H     ++ V++ A LY+ +K++ KALE I                     +  + V C   + +P+++T K  +C VHL  +E    L 
Subjt:  ICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERI---------------------EHADEVYCA-GNELPLNLTTKAGICHVHLGNMEKAECLF

Query:  ANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEE
          L  +   D  +L ++ A++ L +  YN AL   L +   +   N+ +++L+ A+C  +     +A   + KV+     +++AR++L++L  +  + E+
Subjt:  ANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEE

Query:  AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKPVAPKSDLSKASRAKRLL
        A+  L P  D +  +  ++  +      K+ L    +  ++G +  +V+  +    +  +V + R +V        L    K       L K SR K + 
Subjt:  AISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEIKVNKKKLPRRVLLERVKVLDGRETGNLFRGFKPVAPKSDLSKASRAKRLL

Query:  QKRERIKEEKKARALAAGVDVNYDDLDDEPALCKA---LASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYP
           ++      A+A+ A +       D    L KA   L  L R  EA  ++  +L+        ++++EL+  G   A    +    +N+ + +V +  
Subjt:  QKRERIKEEKKARALAAGVDVNYDDLDDEPALCKA---LASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYP

Query:  YSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQ
             WN + +V  ++ ++D RH +    +  K  +     ++ GH        + A  +Y++A++  PD PL + C+G + I++A    +  +H  + Q
Subjt:  YSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGASLINLALGFRLQNKHQCVAQ

Query:  GLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNIKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDH
        G +FL + L L    QE+ YN+ R  H +GL+ LA+ YY+K L        +P L  E   +        DLRR+ AYNL LIY+ SG   +A+ +L  +
Subjt:  GLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNIKHQKSVYCDLRREAAYNLHLIYKESGALDLARQVLKDH

Query:  CT
        C+
Subjt:  CT

Arabidopsis top hitse value%identityAlignment
AT1G17680.1 tetratricopeptide repeat (TPR)-containing protein4.9e-18743.36Show/hide
Query:  EGSKISDNEEVPGGV---MRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA
        EG+ IS+ EE P  +    +VLG +    +  + + EE   ++++++ ++  +DEG++  EE+D     F+AG  P                FER EYEA
Subjt:  EGSKISDNEEVPGGV---MRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA

Query:  LAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP
        LAE+KRKALA+ Q   +        + G      +E M+ G RRK ++ KK+GRR GSKK++  D+ K   +A   +A G+  +A+ IL +V+ QAP   
Subjt:  LAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP

Query:  DSYHTLGLVYNAIG-DDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
         +Y+ L  V   +G  +  +     +AA++    S  WKLL+    ++ +I  A    SKAI+A+PDDI L +  A + L  G   +AAET++QI ++C 
Subjt:  DSYHTLGLVYNAIG-DDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL

Query:  GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLF
          +EAL  G + + K G  ERA  ILED+IK H +E   DV+DLLAS++M      +AL+ I    ++Y  G EL  +L  +  ICHVHL  ME+AE + 
Subjt:  GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLF

Query:  ANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYL--MSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARD
        + L +E  ++H  L+   AD L ++ +++ ALKYY+  +SE VN     G L++KIA+CY+S +ER QAIVF+YK L  L D ++ R+TLASLLLE+ + 
Subjt:  ANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYL--MSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARD

Query:  EEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEIK---VNKKKLPRRVLLERVKVLDGRETGNLFRGFKPVAPKSDL----S
        +EA+ +LSPP++ +P    ++KLK WW N K+++ LCQIY + GMLE+F       V K    R V  +R +++      N  R     A  S L     
Subjt:  EEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEIK---VNKKKLPRRVLLERVKVLDGRETGNLFRGFKPVAPKSDL----S

Query:  KASRAKRLLQKRERIKEEKKARALAAGV--DVNYDDLDDEP------ALCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSSTDTM
        K  + +  L +  RI+E    +A    V  +   + + DE        LCKALASL R  EALEI++L  +L    L +E K+ELQ LGA+++  + D  
Subjt:  KASRAKRLLQKRERIKEEKKARALAAGV--DVNYDDLDDEP------ALCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSSTDTM

Query:  HGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGASLINLA
          F+  + V++Q+PY ++AWNCYY V S L  R S   K ++ +++KY+DC PP +IAGH FT  S HQDAAR+YLEAYK+MP+SPLINLCVGA+LINLA
Subjt:  HGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGASLINLA

Query:  LGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI-KHQKSVYCDLRREAAYNLHLIYK
        LGFRL+N+H+C+AQG AFLY NL++C NSQEALYN+ARAY H+GLVTLA +YYEKVLA Y+KD  +P+L  E+  + + +K V CDLR+EAA+NLHLIYK
Subjt:  LGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI-KHQKSVYCDLRREAAYNLHLIYK

Query:  ESGALDLARQVLKDHCTF
         SGA DLARQVLKDHCTF
Subjt:  ESGALDLARQVLKDHCTF

AT1G17680.2 tetratricopeptide repeat (TPR)-containing protein4.9e-18743.36Show/hide
Query:  EGSKISDNEEVPGGV---MRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA
        EG+ IS+ EE P  +    +VLG +    +  + + EE   ++++++ ++  +DEG++  EE+D     F+AG  P                FER EYEA
Subjt:  EGSKISDNEEVPGGV---MRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA

Query:  LAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP
        LAE+KRKALA+ Q   +        + G      +E M+ G RRK ++ KK+GRR GSKK++  D+ K   +A   +A G+  +A+ IL +V+ QAP   
Subjt:  LAEKKRKALANGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP

Query:  DSYHTLGLVYNAIG-DDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL
         +Y+ L  V   +G  +  +     +AA++    S  WKLL+    ++ +I  A    SKAI+A+PDDI L +  A + L  G   +AAET++QI ++C 
Subjt:  DSYHTLGLVYNAIG-DDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCL

Query:  GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLF
          +EAL  G + + K G  ERA  ILED+IK H +E   DV+DLLAS++M      +AL+ I    ++Y  G EL  +L  +  ICHVHL  ME+AE + 
Subjt:  GNVEALMTGAKLYQKCGHLERAICILEDYIKEHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLF

Query:  ANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYL--MSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARD
        + L +E  ++H  L+   AD L ++ +++ ALKYY+  +SE VN     G L++KIA+CY+S +ER QAIVF+YK L  L D ++ R+TLASLLLE+ + 
Subjt:  ANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYL--MSEEVNAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARD

Query:  EEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEIK---VNKKKLPRRVLLERVKVLDGRETGNLFRGFKPVAPKSDL----S
        +EA+ +LSPP++ +P    ++KLK WW N K+++ LCQIY + GMLE+F       V K    R V  +R +++      N  R     A  S L     
Subjt:  EEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEIK---VNKKKLPRRVLLERVKVLDGRETGNLFRGFKPVAPKSDL----S

Query:  KASRAKRLLQKRERIKEEKKARALAAGV--DVNYDDLDDEP------ALCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSSTDTM
        K  + +  L +  RI+E    +A    V  +   + + DE        LCKALASL R  EALEI++L  +L    L +E K+ELQ LGA+++  + D  
Subjt:  KASRAKRLLQKRERIKEEKKARALAAGV--DVNYDDLDDEP------ALCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSSTDTM

Query:  HGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGASLINLA
          F+  + V++Q+PY ++AWNCYY V S L  R S   K ++ +++KY+DC PP +IAGH FT  S HQDAAR+YLEAYK+MP+SPLINLCVGA+LINLA
Subjt:  HGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLINLCVGASLINLA

Query:  LGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI-KHQKSVYCDLRREAAYNLHLIYK
        LGFRL+N+H+C+AQG AFLY NL++C NSQEALYN+ARAY H+GLVTLA +YYEKVLA Y+KD  +P+L  E+  + + +K V CDLR+EAA+NLHLIYK
Subjt:  LGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNI-KHQKSVYCDLRREAAYNLHLIYK

Query:  ESGALDLARQVLKDHCTF
         SGA DLARQVLKDHCTF
Subjt:  ESGALDLARQVLKDHCTF

AT3G04240.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-0523.53Show/hide
Query:  ATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF
        A L    G   +A+   ++ V   P  PD+Y  LG VY A+G   +A+  Y  A  + P  +  +  + S   ++G +D A     +A+  +P  +    
Subjt:  ATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF

Query:  HRASLYLERGDCEKAAETYDQI------HQQCLGNV
        +  +   + G  ++A   Y+Q       H Q + N+
Subjt:  HRASLYLERGDCEKAAETYDQI------HQQCLGNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGGAAGGGAGTAAAATTTCTGACAATGAAGAGGTTCCTGGTGGTGTTATGCGTGTTTTAGGAGCAGAAAAAGAGGTTGTAGAAACAGGAGTGGAGGCTAGAGA
GGAGGAGGAAGAGGAAGAGGAAGAGGAAGAAGGAGAGGAAGAAGTGGAGGATGAAGGAGAGGATGATATTGAAGAAGAAGATGGTTACATATTCAAATTTAAGGCCGGAG
AAAATCCATTTGATTTTGTTGAAGGGACAGATTTTAGCATCCAACCATATAAAAAATTTGAGCGCCTTGAATATGAAGCTCTTGCTGAGAAAAAGAGAAAAGCTCTTGCA
AATGGTCAGAGTGAGAGAGCTGCAAAGAGGGGCAGGGTAGAAGATATTCCTGGTGCAAGCTTTGATGAAATTTTGGAAGCTATGAATTATGGATCTAGGAGGAAGTTAAA
AGAGCCTAAAAAAAGAGGTAGACGGAAAGGATCAAAGAAAAAACTTAATCGTGATGTTACGAAGTTGCTTGGTGATGCAACTTTATGTTATGCTCAAGGCCAGCATGAGA
AGGCTATATCTATATTGCGTCAAGTTGTTCTGCAAGCCCCAGACTTACCTGATTCGTACCATACGCTTGGACTTGTATACAATGCAATTGGTGATGATGTAAAAGCCATG
GGATTCTACATGCTTGCTGCACATTTAATGCCAAAAGATTCATCTCTCTGGAAACTGCTATTTTCATGGTCAATTGATCGAGGTGATATTGATCAAGCAAGCTACTGTCT
TTCTAAAGCAATAAAAGCAGAGCCTGATGACATTAATTTATTATTTCATCGTGCGTCACTCTACCTTGAGCGTGGAGATTGTGAAAAAGCAGCTGAAACATATGATCAGA
TTCATCAACAATGCCTTGGAAATGTTGAAGCACTCATGACAGGAGCAAAGCTGTACCAAAAATGCGGTCATCTTGAACGTGCAATTTGCATTCTTGAGGACTACATCAAA
GAGCATCCAACTGAAGCTGATTTAGATGTGGTTGATCTTTTAGCTTCTTTATACATGGGAAGCAAAGAATTCAGCAAAGCTCTTGAGCGCATCGAGCATGCAGATGAGGT
TTACTGTGCAGGAAATGAGCTACCTTTAAACTTGACAACTAAAGCAGGAATTTGCCACGTTCACCTTGGAAATATGGAGAAAGCAGAGTGCCTCTTTGCTAATTTGGGAC
GGGAAACTGCCAATGATCACTCAAATTTGATGATTGAAGCTGCAGACTCGTTGCTGAGTCTTAAGCACTATAACTTGGCATTGAAGTATTATCTAATGTCCGAAGAAGTA
AATGCTGGAGGGAACATGGGGATTTTATACCTAAAAATTGCCCAGTGTTACTTATCAACCGATGAAAGAACACAGGCAATTGTTTTCTTTTATAAAGTGCTTCAACATCT
TGAAGATAACATTAATGCTCGATTAACTTTGGCCTCCCTCCTCCTTGAGGAAGCTAGAGATGAAGAAGCCATTTCATTACTATCTCCTCCAAAAGATTCAAACCCAACTA
GCTCATCTTCCAGCAAATTAAAACCTTGGTGGCTCAATGAGAAAGTAAAACTGAAGCTTTGCCAAATATACAAAACTAGAGGAATGCTTGAGAACTTCGTTGAGATTAAA
GTGAACAAGAAGAAGCTTCCAAGGAGGGTTTTGCTTGAAAGAGTCAAAGTATTAGATGGACGTGAGACTGGTAACCTATTTCGTGGATTCAAACCTGTGGCTCCTAAATC
AGATTTATCAAAAGCGTCCAGAGCAAAGAGATTGCTTCAAAAGAGGGAAAGAATCAAGGAAGAAAAGAAGGCTAGAGCGCTGGCTGCGGGAGTCGATGTGAACTATGATG
ATTTAGATGATGAGCCAGCGTTGTGCAAGGCATTGGCTTCCTTGGGAAGATGTTCTGAAGCTTTAGAGATTATAAGTCTAACTTTAAAGTTGGCTTTTAACTCGTTATCC
ATAGAAAGGAAGGAAGAACTCCAGTTACTTGGAGCTCAATTAGCATTCAGCTCAACTGATACCATGCATGGTTTCAACTTTGCAAAGCACGTTGTTAAGCAGTACCCTTA
TAGCATCTCTGCTTGGAACTGCTATTATAAAGTAGCTTCAAGTTTGACGAACCGGGATTCAAGGCATTGCAAGCTTCTGAATAGCATGCAAGCCAAATACAAAGATTGTG
CACCACCCTATATCATTGCCGGGCATCAATTTACCACCATTAGCCATCATCAAGATGCTGCAAGGAAATATCTTGAAGCTTACAAAATCATGCCGGATAGTCCCCTGATT
AACTTGTGTGTTGGAGCATCCTTAATCAACTTGGCTCTTGGATTCCGTCTTCAAAATAAGCATCAGTGTGTTGCACAAGGCTTGGCATTCCTCTACAAAAATTTGAAGCT
TTGTGATAACAGCCAGGAAGCCTTGTACAACATAGCTCGAGCATATCATCACATCGGACTCGTGACACTGGCGGTTACATATTACGAAAAGGTGCTTGCAACTTACCAGA
AGGATTGCCCCATTCCAGAACTTTTTGGTGAGAATCAAAACATTAAGCATCAGAAATCTGTCTATTGTGATCTACGCAGAGAAGCAGCTTACAATTTGCATCTGATATAT
AAAGAGAGTGGAGCTCTTGATCTTGCCAGGCAAGTCCTAAAAGATCATTGCACATTTTAA
mRNA sequenceShow/hide mRNA sequence
GATAATTTAACCATTTTTTTAAAATCATCCTTCCAAACCCCTCCGTCCCATTTACCAGCCGCTGCTCCTTTCCGGTCGCCGTTCCTACCGTCAGACGGCAGCCGCTGCAG
CTTAGCACTCCGACGAAATCGCCGCCGCCCCTCCCTCTCACACTCTCTCGCCCCTCCGAGGAATTTCCTTTATTTATTTATTCTTTTTAACCATTCTAGTTTTCTTTTGT
TCTATCACGTTCCAGCCACCCCTCCACCCCGCCGCACCGGACGCCGCCGCCGTTTCCCTCACGCCACTCGCCGTTCCTAAACCTTTATAATCTTTTTTGAGTACGAAGGT
TAAATTACAAATTTGTGATAAAGACTCTTTTTTTCCCTTAAAATTGGATTAACATTTACAATTTCCCCTGCCACTCCCTCTCTTTGCCGCTGCCCACCGCCGCCGCCTCC
ATCCATTGTGCTGCAGCCACCCTCCGATCAGAACATTTTCTTTCTAGAGTTTCCAGTATCGTGCATTGGAATGGAAAAGGAAGGGAGTAAAATTTCTGACAATGAAGAGG
TTCCTGGTGGTGTTATGCGTGTTTTAGGAGCAGAAAAAGAGGTTGTAGAAACAGGAGTGGAGGCTAGAGAGGAGGAGGAAGAGGAAGAGGAAGAGGAAGAAGGAGAGGAA
GAAGTGGAGGATGAAGGAGAGGATGATATTGAAGAAGAAGATGGTTACATATTCAAATTTAAGGCCGGAGAAAATCCATTTGATTTTGTTGAAGGGACAGATTTTAGCAT
CCAACCATATAAAAAATTTGAGCGCCTTGAATATGAAGCTCTTGCTGAGAAAAAGAGAAAAGCTCTTGCAAATGGTCAGAGTGAGAGAGCTGCAAAGAGGGGCAGGGTAG
AAGATATTCCTGGTGCAAGCTTTGATGAAATTTTGGAAGCTATGAATTATGGATCTAGGAGGAAGTTAAAAGAGCCTAAAAAAAGAGGTAGACGGAAAGGATCAAAGAAA
AAACTTAATCGTGATGTTACGAAGTTGCTTGGTGATGCAACTTTATGTTATGCTCAAGGCCAGCATGAGAAGGCTATATCTATATTGCGTCAAGTTGTTCTGCAAGCCCC
AGACTTACCTGATTCGTACCATACGCTTGGACTTGTATACAATGCAATTGGTGATGATGTAAAAGCCATGGGATTCTACATGCTTGCTGCACATTTAATGCCAAAAGATT
CATCTCTCTGGAAACTGCTATTTTCATGGTCAATTGATCGAGGTGATATTGATCAAGCAAGCTACTGTCTTTCTAAAGCAATAAAAGCAGAGCCTGATGACATTAATTTA
TTATTTCATCGTGCGTCACTCTACCTTGAGCGTGGAGATTGTGAAAAAGCAGCTGAAACATATGATCAGATTCATCAACAATGCCTTGGAAATGTTGAAGCACTCATGAC
AGGAGCAAAGCTGTACCAAAAATGCGGTCATCTTGAACGTGCAATTTGCATTCTTGAGGACTACATCAAAGAGCATCCAACTGAAGCTGATTTAGATGTGGTTGATCTTT
TAGCTTCTTTATACATGGGAAGCAAAGAATTCAGCAAAGCTCTTGAGCGCATCGAGCATGCAGATGAGGTTTACTGTGCAGGAAATGAGCTACCTTTAAACTTGACAACT
AAAGCAGGAATTTGCCACGTTCACCTTGGAAATATGGAGAAAGCAGAGTGCCTCTTTGCTAATTTGGGACGGGAAACTGCCAATGATCACTCAAATTTGATGATTGAAGC
TGCAGACTCGTTGCTGAGTCTTAAGCACTATAACTTGGCATTGAAGTATTATCTAATGTCCGAAGAAGTAAATGCTGGAGGGAACATGGGGATTTTATACCTAAAAATTG
CCCAGTGTTACTTATCAACCGATGAAAGAACACAGGCAATTGTTTTCTTTTATAAAGTGCTTCAACATCTTGAAGATAACATTAATGCTCGATTAACTTTGGCCTCCCTC
CTCCTTGAGGAAGCTAGAGATGAAGAAGCCATTTCATTACTATCTCCTCCAAAAGATTCAAACCCAACTAGCTCATCTTCCAGCAAATTAAAACCTTGGTGGCTCAATGA
GAAAGTAAAACTGAAGCTTTGCCAAATATACAAAACTAGAGGAATGCTTGAGAACTTCGTTGAGATTAAAGTGAACAAGAAGAAGCTTCCAAGGAGGGTTTTGCTTGAAA
GAGTCAAAGTATTAGATGGACGTGAGACTGGTAACCTATTTCGTGGATTCAAACCTGTGGCTCCTAAATCAGATTTATCAAAAGCGTCCAGAGCAAAGAGATTGCTTCAA
AAGAGGGAAAGAATCAAGGAAGAAAAGAAGGCTAGAGCGCTGGCTGCGGGAGTCGATGTGAACTATGATGATTTAGATGATGAGCCAGCGTTGTGCAAGGCATTGGCTTC
CTTGGGAAGATGTTCTGAAGCTTTAGAGATTATAAGTCTAACTTTAAAGTTGGCTTTTAACTCGTTATCCATAGAAAGGAAGGAAGAACTCCAGTTACTTGGAGCTCAAT
TAGCATTCAGCTCAACTGATACCATGCATGGTTTCAACTTTGCAAAGCACGTTGTTAAGCAGTACCCTTATAGCATCTCTGCTTGGAACTGCTATTATAAAGTAGCTTCA
AGTTTGACGAACCGGGATTCAAGGCATTGCAAGCTTCTGAATAGCATGCAAGCCAAATACAAAGATTGTGCACCACCCTATATCATTGCCGGGCATCAATTTACCACCAT
TAGCCATCATCAAGATGCTGCAAGGAAATATCTTGAAGCTTACAAAATCATGCCGGATAGTCCCCTGATTAACTTGTGTGTTGGAGCATCCTTAATCAACTTGGCTCTTG
GATTCCGTCTTCAAAATAAGCATCAGTGTGTTGCACAAGGCTTGGCATTCCTCTACAAAAATTTGAAGCTTTGTGATAACAGCCAGGAAGCCTTGTACAACATAGCTCGA
GCATATCATCACATCGGACTCGTGACACTGGCGGTTACATATTACGAAAAGGTGCTTGCAACTTACCAGAAGGATTGCCCCATTCCAGAACTTTTTGGTGAGAATCAAAA
CATTAAGCATCAGAAATCTGTCTATTGTGATCTACGCAGAGAAGCAGCTTACAATTTGCATCTGATATATAAAGAGAGTGGAGCTCTTGATCTTGCCAGGCAAGTCCTAA
AAGATCATTGCACATTTTAAGGTACAAAGCATATTTTTTGGACAGCAACTATGTGTTAGCATGTAACAAAAAAGTAAAGTTATCTTATTTTTTTATAGATATTGTACATA
GGTTGAGGATGTAGGATATGGTAGTTGGAAGTAAAAGGATTAACTAATTGAACTCGATGGGTATATACTCGGATATATACTAAGGATTCAGTTGATTATGTATATATATA
TATATATTATATCTATCAATGATGTCACTATGC
Protein sequenceShow/hide protein sequence
MEKEGSKISDNEEVPGGVMRVLGAEKEVVETGVEAREEEEEEEEEEEGEEEVEDEGEDDIEEEDGYIFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALA
NGQSERAAKRGRVEDIPGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYNAIGDDVKAM
GFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGNVEALMTGAKLYQKCGHLERAICILEDYIK
EHPTEADLDVVDLLASLYMGSKEFSKALERIEHADEVYCAGNELPLNLTTKAGICHVHLGNMEKAECLFANLGRETANDHSNLMIEAADSLLSLKHYNLALKYYLMSEEV
NAGGNMGILYLKIAQCYLSTDERTQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCQIYKTRGMLENFVEIK
VNKKKLPRRVLLERVKVLDGRETGNLFRGFKPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALCKALASLGRCSEALEIISLTLKLAFNSLS
IERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLI
NLCVGASLINLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENQNIKHQKSVYCDLRREAAYNLHLIY
KESGALDLARQVLKDHCTF