; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG07G013810 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG07G013810
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptiongeneral transcription factor 3C polypeptide 3
Genome locationCG_Chr07:30282219..30298528
RNA-Seq ExpressionClCG07G013810
SyntenyClCG07G013810
Gene Ontology termsGO:0006383 - transcription by RNA polymerase III (biological process)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat
IPR039340 - Transcription factor Tfc4/TFIIIC-102/Sfc4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008447634.1 PREDICTED: general transcription factor 3C polypeptide 3 isoform X1 [Cucumis melo]0.0e+0093.18Show/hide
Query:  REGNKISDNEEVPGGAMCVLGAEKEAVETEVEDREEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAE
        +EGN+ISD+EEVPG  M VLG EKE VET V DR  EEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFS+QPYKKFERLEYEALAE
Subjt:  REGNKISDNEEVPGGAMCVLGAEKEAVETEVEDREEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAE

Query:  KKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSY
        KKRKALANGQSERAAKRGRVED++GASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAIS+LRQVVLQAPDLPDSY
Subjt:  KKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSY

Query:  HTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHRQCLGNVE
        HTLGLVY+AIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSI+RGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIH+QCLGNVE
Subjt:  HTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHRQCLGNVE

Query:  ALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNMEKAECLFANLG
        ALMTGAKLYQKCGHLERAICILEDYIKEHP+EADLDVVDLLAS+YMGSKEFSKALE IEHA  VYCAGNELPLNLT KAGIC  HLGN+EKAECLFANL 
Subjt:  ALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNMEKAECLFANLG

Query:  RETAYDHSNLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAISL
        RET YDHSNLMIEVADSLLSLKHY+ ALKYYLM EEVNAG NMGILY K+A+CYLS NE+ QAIVFFYKVLQH+EDNINARLTLASLLLEEARDEEAISL
Subjt:  RETAYDHSNLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAISL

Query:  LSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAP
        LSPPKDSNPTSSSS KLKPWWLNEKVKLKLCHIY+TRG++ENFVEVIFPLVRESLYIETLQEK+KVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAP
Subjt:  LSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAP

Query:  KSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSLSI
        KSDL+KASRAKRLLQKR+RIKEEKKA+ LAAGV+V+YDDLDDEPALRMHRESPLPNLLKEEE+H LIVDLCKALASLGRCSEALEIISLTLKLAFNSLS 
Subjt:  KSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSLSI

Query:  ERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAY
        ERKEELQLLGAQLAFSST TMHGFNFAKHVVKQYPYSISAWNCYYKVAS LTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAY
Subjt:  ERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAY

Query:  KIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIKHQ
        KIMPDSPL+NLCVG+SL+NLALGFRLQNKHQCVAQGLAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIKHQ
Subjt:  KIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIKHQ

Query:  KSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF
         SVYCDLRREAAYNLHLIYKESGA DLARQVLKD+CTF
Subjt:  KSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF

XP_011652346.1 general transcription factor 3C polypeptide 3 [Cucumis sativus]0.0e+0093.4Show/hide
Query:  REGNKISDNEEVPGGAMCVLGAEKEAVETEVEDR--EEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEAL
        +EG++ISD EEVPG  M VLG EKE VET VEDR  EEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFS+QPYKKFERLEYEAL
Subjt:  REGNKISDNEEVPGGAMCVLGAEKEAVETEVEDR--EEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEAL

Query:  AEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPD
        AEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQG+HEKAIS+LRQVVL+APDLPD
Subjt:  AEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPD

Query:  SYHTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHRQCLGN
        SYHTLGLVY+AIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIH+QCLGN
Subjt:  SYHTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHRQCLGN

Query:  VEALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNMEKAECLFAN
        VEALMTGAKLYQKCGHLERAICILEDYIK HP+EADLDVVDLLAS+YMGSKEFSKALE IEHA  VYCAGNELPLNLTTKAGIC  HLG++EKAECLFAN
Subjt:  VEALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNMEKAECLFAN

Query:  LGRETAYDHSNLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAI
        L RET YDHSNLMIEVADSL+SLKHY+ ALKYYLM EEVNAG NMGILYLKIA+CYLS NER QAIVFFYKVLQH+EDNINARLTLASLLLEEARD+EAI
Subjt:  LGRETAYDHSNLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAI

Query:  SLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPV
        SLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIY+TRG++ENFVEVIFPLVRESLYIETLQEK+KVNKKKLPRRVLLERVKVLDGRETGNLFRGF+PV
Subjt:  SLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPV

Query:  APKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSL
        APKSDL+KASRAKRLLQKRERIKEEKKA+ALAAGV+++YDDLDDEPALRMHRESPLPNLLKEEEYH LIVDLCKALASLGRCSEALEIISLTLKLAFNSL
Subjt:  APKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSL

Query:  SIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLE
        S+ERKEELQLLGAQLAFSST TMHGFNFAKHVVKQYPYSISAWNCYYKVAS LTNRDSRHCKLLNSMQ+KYKDCAPPYIIAGHQFTTISHHQDAARKYLE
Subjt:  SIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLE

Query:  AYKIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIK
        AYKIMPDSPL+NLCVG+SL+NLALGFRLQNKHQCVAQGLAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIK
Subjt:  AYKIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIK

Query:  HQKSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF
        HQ SVYCDLRREAAYNLHLIYKESGA DLARQVLKD+CTF
Subjt:  HQKSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF

XP_022152620.1 general transcription factor 3C polypeptide 3 isoform X2 [Momordica charantia]0.0e+0087.87Show/hide
Query:  REGNKISDNEEVPGGAMCVLGAEKEAVETEVEDR----------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKF
        +EGN+ISDN+EVPG A+ V G  K   ETEVE+R          EEEEEEEEEE EEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFSIQPYKKF
Subjt:  REGNKISDNEEVPGGAMCVLGAEKEAVETEVEDR----------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKF

Query:  ERLEYEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVV
        ERLEYEALAEKKRKALAN QSER  KRGR+EDI GASF+EI+EAMNYGSRRKLKEPK+RGRRKGSKKK+NR++TKLLGDATLCYAQGQ+EKAIS+LRQVV
Subjt:  ERLEYEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVV

Query:  LQAPDLPDSYHTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQ
        LQAPDLPDSYHTLGLVY+AIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETYDQ
Subjt:  LQAPDLPDSYHTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQ

Query:  IHRQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNME
        IH++C+GNVEALMTGAKLYQKCGH ERAICILEDYIK HP EADLDVVDLLAS+YMGSKEFSKALE IEHA +VYCAGNE+PLNL TKAGIC VHLGN+E
Subjt:  IHRQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNME

Query:  KAECLFANLGRETAYDHSNLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLE
        KAE LFANLGR+TA DHS+ +IE ADSLLSLKH+N+ALKYYLM EEVNAGG MGI+YLKIAQCYLS NERA+AIVFFYKVLQ LEDNINARLTLASLLLE
Subjt:  KAECLFANLGRETAYDHSNLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLE

Query:  EARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGN
        EAR+EEAISLLSPPKDSN +SSSSSK KPWWLNEKVKLKLC+IY+T+GM+ENFVE IF LVRESLYIETL+EK+KVNKKKLPRRVLLERVKVLDGRETG+
Subjt:  EARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGN

Query:  LFRGFRPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLT
        LFRGFRPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV+V+YDD+DDEPALR+HRESPLPNLLK+EEYHNLIVDLCKALASLGRCSEALEIISLT
Subjt:  LFRGFRPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLT

Query:  LKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQ
        LKLAFNSLS+ERKEELQLLGAQLAFSSTDT HGFNFAKHVVKQYPYS SAWNCYYKV+S +T+RDSRHCKLLNS+QAKYKDCAPP+IIAGHQF  ISHHQ
Subjt:  LKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQ

Query:  DAARKYLEAYKIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPEL
        +AA+KYLEAYK++PDSPL+NLCVG +L+NLALG RLQNKHQCVAQGLAFLY NLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP++
Subjt:  DAARKYLEAYKIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPEL

Query:  FGENRNIKHQKSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF
        FGENRNIKH+KSVYCDLRREAAYNLHLIYK+SGA DLARQVLKD+CTF
Subjt:  FGENRNIKHQKSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF

XP_022997988.1 general transcription factor 3C polypeptide 3 [Cucurbita maxima]0.0e+0089.05Show/hide
Query:  EGNKISDNEEVPGGAMCVLGAEKEAVETEVEDR----EEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA
        EGN ISDNEEVPG   C +   K  VETEVEDR    EEEEEEEEEE EEEVEDEGEDD EEEDGY FKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA
Subjt:  EGNKISDNEEVPGGAMCVLGAEKEAVETEVEDR----EEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA

Query:  LAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP
        LAEKKRKALANGQSER+AKRGRVEDISGASF+EI+EAMNYGSRRK ++PKKRGRRKGSK+KLN DVTKLLGDATLCYAQGQ+EKAIS+L QVVLQAPD+P
Subjt:  LAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP

Query:  DSYHTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHRQCLG
        DSYHTLGLVY+AIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF+RASLYLERGDC+KAAETYDQIH+  + 
Subjt:  DSYHTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHRQCLG

Query:  NVEALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNMEKAECLFA
        NVEALMTGAKLYQKCGHLERAICILE+YIK HP EADLDVVDLLAS+YMGSKEFSKALECIEHA  VYCA NELPLNLT KAGIC +HLGN EKAECLFA
Subjt:  NVEALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNMEKAECLFA

Query:  NLGRETAYDHSNLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEA
        NL RE   + SNLMIEVADSLLSLKHYN+ALKYYLM EEVNAGGN+GILYLKIAQCY S NERA+AIVFFYKVLQHLEDNINARLTLASLLLEEAR+EEA
Subjt:  NLGRETAYDHSNLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEA

Query:  ISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRP
        ISLLSPPKDSN TSSSSSK KPWWLNE+VKLKLCHI++T+GM+ENFVE IFPLVRESLYIETL EK+KVNKKKLP+RVLLERVKVLDGR+TG LFRGFRP
Subjt:  ISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRP

Query:  VAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNS
        VAPKSDLSKASRAK+LLQKRERI+EEKKA+ALAAGV++NYDD DDEP LR+ RESPLPNLLK+EEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNS
Subjt:  VAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNS

Query:  LSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYL
        LSIERKEELQLLGAQLAFSSTDT HGFNFAKHVVKQYPYS SAWNCYYKV+S +TNRDSRHCKLLNSMQ KYKDCAPPYIIAGHQFT ISHHQDAARKYL
Subjt:  LSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYL

Query:  EAYKIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNI
        EAYK++PDSPL+NLCVGT+L+NL LGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN++ 
Subjt:  EAYKIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNI

Query:  KHQKSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF
        KHQ SVYCDLRREAAYNLHLIYKESGA DLARQVLKDNCTF
Subjt:  KHQKSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF

XP_038879313.1 general transcription factor 3C polypeptide 3 [Benincasa hispida]0.0e+0095.2Show/hide
Query:  REGNKISDNEEVPGGAMCVLGAEKEAVETEVEDREEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAE
        +EGN +SDNEEVPGG M VLGA KE VET VEDR EEEEEEEEEGEEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAE
Subjt:  REGNKISDNEEVPGGAMCVLGAEKEAVETEVEDREEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAE

Query:  KKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSY
        KKRKALANGQSERAAKRGRVEDI GASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKA+SILRQVVLQAPDLPDSY
Subjt:  KKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSY

Query:  HTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHRQCLGNVE
        HTLGLVY+AIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDI+LLFHRASLYLERGDCEKAAETYDQIH+QCLGNVE
Subjt:  HTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHRQCLGNVE

Query:  ALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNMEKAECLFANLG
        ALMTGAKLYQKCGHLERAICILEDYIKEHP EADLDVVDLLAS+YMG+KEF KALE IEHA EVYCAG+ELPL LTTK GIC  HLGNMEKAECLFANLG
Subjt:  ALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNMEKAECLFANLG

Query:  RETAYDHSNLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAISL
         ETA DHSNLMIEVADSLLSLKHYN+ALKYYLMFEEVNAGGNMGILYLKIAQCYLS NERAQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAISL
Subjt:  RETAYDHSNLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAISL

Query:  LSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAP
        LSPPKDSNPTSSSSSKLKPWW NEKVKLKLCHIYKTRGM+E+FVEVIFPLVRESLYIETLQEK+KVNKKKLP+RVLLERVKVLDGRE+GNLFRGFRPVAP
Subjt:  LSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAP

Query:  KSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSLSI
        KSDLSKASRAKRLLQKRERIKEEKKA+ALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLA NSLSI
Subjt:  KSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSLSI

Query:  ERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAY
        ERKEELQLLGAQLAFSSTDT HGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFT ISHHQDAARKYLEAY
Subjt:  ERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAY

Query:  KIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIKHQ
        KIMPDSPL+NLCVG+SL+NLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIKHQ
Subjt:  KIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIKHQ

Query:  KSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF
        KSVYCDLRREAAYNLHLIYKESGA DLARQVLKD+CTF
Subjt:  KSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF

TrEMBL top hitse value%identityAlignment
A0A0A0LGB1 Uncharacterized protein0.0e+0090.85Show/hide
Query:  REGNKISDNEEVPGGAMCVLGAEKEAVETEVEDR--EEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEAL
        +EG++ISD EEVPG  M VLG EKE VET VEDR  EEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFS+QPYKKFERLEYEAL
Subjt:  REGNKISDNEEVPGGAMCVLGAEKEAVETEVEDR--EEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEAL

Query:  AEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPD
        AEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQG+HEKAIS+LRQVVL+APDLPD
Subjt:  AEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPD

Query:  SYHTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHRQCLGN
        SYHTLGLVY+AIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIH+QCLGN
Subjt:  SYHTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHRQCLGN

Query:  VEALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNMEKAECLFAN
        VEALMTGAKLYQKCGHLERAICILEDYIK HP+EADLDVVDLLAS+YMGSKEFSKALE IEHA  VYCAGNELPLNLTTKAGIC  HLG++EKAECLFAN
Subjt:  VEALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNMEKAECLFAN

Query:  LGRETAYDHSNLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAI
        L RET YDHSNLMIEVADSL+SLKHY+ ALKYYLM EEVNAG NMGILYLKIA+CYLS NER QAIVFFYKVLQH+EDNINARLTLASLLLEEARD+EAI
Subjt:  LGRETAYDHSNLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAI

Query:  SLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPV
        SLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIY+TRG++ENFVEVIFPLVRESLYIETLQEK+KVNKKKLPRRVLLERVKVLDGRETGNLFRGF+PV
Subjt:  SLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPV

Query:  APKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSL
        APKSDL+KASRAKRLLQKRERIKEEKKA+ALAAGV+++YDDLDDEPALRMHRESPLPNLLKEEEYH LIVDLCKALASLGRCSEALEIISLTL       
Subjt:  APKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSL

Query:  SIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLE
                         +ST TMHGFNFAKHVVKQYPYSISAWNCYYKVAS LTNRDSRHCKLLNSMQ+KYKDCAPPYIIAGHQFTTISHHQDAARKYLE
Subjt:  SIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLE

Query:  AYKIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIK
        AYKIMPDSPL+NLCVG+SL+NLALGFRLQNKHQCVAQGLAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIK
Subjt:  AYKIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIK

Query:  HQKSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF
        HQ SVYCDLRREAAYNLHLIYKESGA DLARQVLKD+CTF
Subjt:  HQKSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF

A0A1S3BHB9 general transcription factor 3C polypeptide 3 isoform X10.0e+0093.18Show/hide
Query:  REGNKISDNEEVPGGAMCVLGAEKEAVETEVEDREEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAE
        +EGN+ISD+EEVPG  M VLG EKE VET V DR  EEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFS+QPYKKFERLEYEALAE
Subjt:  REGNKISDNEEVPGGAMCVLGAEKEAVETEVEDREEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAE

Query:  KKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSY
        KKRKALANGQSERAAKRGRVED++GASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAIS+LRQVVLQAPDLPDSY
Subjt:  KKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSY

Query:  HTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHRQCLGNVE
        HTLGLVY+AIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSI+RGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIH+QCLGNVE
Subjt:  HTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHRQCLGNVE

Query:  ALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNMEKAECLFANLG
        ALMTGAKLYQKCGHLERAICILEDYIKEHP+EADLDVVDLLAS+YMGSKEFSKALE IEHA  VYCAGNELPLNLT KAGIC  HLGN+EKAECLFANL 
Subjt:  ALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNMEKAECLFANLG

Query:  RETAYDHSNLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAISL
        RET YDHSNLMIEVADSLLSLKHY+ ALKYYLM EEVNAG NMGILY K+A+CYLS NE+ QAIVFFYKVLQH+EDNINARLTLASLLLEEARDEEAISL
Subjt:  RETAYDHSNLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAISL

Query:  LSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAP
        LSPPKDSNPTSSSS KLKPWWLNEKVKLKLCHIY+TRG++ENFVEVIFPLVRESLYIETLQEK+KVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAP
Subjt:  LSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAP

Query:  KSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSLSI
        KSDL+KASRAKRLLQKR+RIKEEKKA+ LAAGV+V+YDDLDDEPALRMHRESPLPNLLKEEE+H LIVDLCKALASLGRCSEALEIISLTLKLAFNSLS 
Subjt:  KSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSLSI

Query:  ERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAY
        ERKEELQLLGAQLAFSST TMHGFNFAKHVVKQYPYSISAWNCYYKVAS LTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAY
Subjt:  ERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAY

Query:  KIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIKHQ
        KIMPDSPL+NLCVG+SL+NLALGFRLQNKHQCVAQGLAFLYKNLKLCDN+QEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIKHQ
Subjt:  KIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIKHQ

Query:  KSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF
         SVYCDLRREAAYNLHLIYKESGA DLARQVLKD+CTF
Subjt:  KSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF

A0A6J1DEF9 general transcription factor 3C polypeptide 3 isoform X20.0e+0087.87Show/hide
Query:  REGNKISDNEEVPGGAMCVLGAEKEAVETEVEDR----------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKF
        +EGN+ISDN+EVPG A+ V G  K   ETEVE+R          EEEEEEEEEE EEEVEDEGEDDIEEEDGY FKFKAGENPFDFVEGTDFSIQPYKKF
Subjt:  REGNKISDNEEVPGGAMCVLGAEKEAVETEVEDR----------EEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKF

Query:  ERLEYEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVV
        ERLEYEALAEKKRKALAN QSER  KRGR+EDI GASF+EI+EAMNYGSRRKLKEPK+RGRRKGSKKK+NR++TKLLGDATLCYAQGQ+EKAIS+LRQVV
Subjt:  ERLEYEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVV

Query:  LQAPDLPDSYHTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQ
        LQAPDLPDSYHTLGLVY+AIGDDVKAMGFYMLAAHLMP+DSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDC+KAAETYDQ
Subjt:  LQAPDLPDSYHTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQ

Query:  IHRQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNME
        IH++C+GNVEALMTGAKLYQKCGH ERAICILEDYIK HP EADLDVVDLLAS+YMGSKEFSKALE IEHA +VYCAGNE+PLNL TKAGIC VHLGN+E
Subjt:  IHRQCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNME

Query:  KAECLFANLGRETAYDHSNLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLE
        KAE LFANLGR+TA DHS+ +IE ADSLLSLKH+N+ALKYYLM EEVNAGG MGI+YLKIAQCYLS NERA+AIVFFYKVLQ LEDNINARLTLASLLLE
Subjt:  KAECLFANLGRETAYDHSNLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLE

Query:  EARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGN
        EAR+EEAISLLSPPKDSN +SSSSSK KPWWLNEKVKLKLC+IY+T+GM+ENFVE IF LVRESLYIETL+EK+KVNKKKLPRRVLLERVKVLDGRETG+
Subjt:  EARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGN

Query:  LFRGFRPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLT
        LFRGFRPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGV+V+YDD+DDEPALR+HRESPLPNLLK+EEYHNLIVDLCKALASLGRCSEALEIISLT
Subjt:  LFRGFRPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLT

Query:  LKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQ
        LKLAFNSLS+ERKEELQLLGAQLAFSSTDT HGFNFAKHVVKQYPYS SAWNCYYKV+S +T+RDSRHCKLLNS+QAKYKDCAPP+IIAGHQF  ISHHQ
Subjt:  LKLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQ

Query:  DAARKYLEAYKIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPEL
        +AA+KYLEAYK++PDSPL+NLCVG +L+NLALG RLQNKHQCVAQGLAFLY NLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIP++
Subjt:  DAARKYLEAYKIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPEL

Query:  FGENRNIKHQKSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF
        FGENRNIKH+KSVYCDLRREAAYNLHLIYK+SGA DLARQVLKD+CTF
Subjt:  FGENRNIKHQKSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF

A0A6J1GA07 general transcription factor 3C polypeptide 30.0e+0089.04Show/hide
Query:  EGNKISDNEEVPGGAMCVLGAEKEAVETEVEDR---EEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEAL
        EGN ISDNEEVPG   C +   K  VETEVEDR   EEEEEEEEEE EEEVEDEGEDD EEEDGY FKFKAGENPFDFVEGTDFSIQPYKKFERLEYEAL
Subjt:  EGNKISDNEEVPGGAMCVLGAEKEAVETEVEDR---EEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEAL

Query:  AEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPD
        AEKKRKALA+GQSER+AKRGRVEDISGASF+EI+EAMNYGSRRK ++PKKRGRRKGSK+KLN DVTKLLGDATLCYAQGQ+EKAIS+L QVVLQAPD+PD
Subjt:  AEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPD

Query:  SYHTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHRQCLGN
        SYHTLGLVY+AIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF+RASLYLERGDC+KAAETYDQIH+  + N
Subjt:  SYHTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHRQCLGN

Query:  VEALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNMEKAECLFAN
        VEALMTGAKLYQKCGHLERAICILE+YIK HP EADLDVVDLLAS+YMGSKEFSKALECIEHA  VYCA NELPLNLT KAGIC +HLGN EKAECLFAN
Subjt:  VEALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNMEKAECLFAN

Query:  LGRETAYDHSNLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAI
        L RE   + SNLMIEVADSLLSLKHYN+ALKYYLM EEVNAGGN+GILYLKIAQCY S NERA+AIVFFYKVLQHLEDNINARLTLASLLLEEAR+EEAI
Subjt:  LGRETAYDHSNLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAI

Query:  SLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPV
        SLLSPPKDSN TSSSSSK KPWWLNE+VKLKLCHI++T+GM+ENFVE IFPLVRESLYIETL EK+KVNKKKLP+RVLLERVKVLDGR+TG LFRGFRPV
Subjt:  SLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPV

Query:  APKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSL
        APKSDLSKASRAK+LLQKRERI+EEKKA+ALAAGV++NYDD DDEP LR+ RESPLPNLLK+EEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSL
Subjt:  APKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSL

Query:  SIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLE
        SIERKEELQLLGAQLAFSSTDT HGFNFAKHVVKQYPYS SAWNCYYKV+S +TNRDSRHCKLLNSMQ KYKDCAPPYIIAGHQFT ISHHQDAARKYLE
Subjt:  SIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLE

Query:  AYKIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIK
        AYK++PDSPL+NLCVGT+L+NL LGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN++ K
Subjt:  AYKIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIK

Query:  HQKSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF
        HQ SVYCDLRREAAYNLHLIYKESGA DLARQVLKDNCTF
Subjt:  HQKSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF

A0A6J1KFI7 general transcription factor 3C polypeptide 30.0e+0089.05Show/hide
Query:  EGNKISDNEEVPGGAMCVLGAEKEAVETEVEDR----EEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA
        EGN ISDNEEVPG   C +   K  VETEVEDR    EEEEEEEEEE EEEVEDEGEDD EEEDGY FKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA
Subjt:  EGNKISDNEEVPGGAMCVLGAEKEAVETEVEDR----EEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEA

Query:  LAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP
        LAEKKRKALANGQSER+AKRGRVEDISGASF+EI+EAMNYGSRRK ++PKKRGRRKGSK+KLN DVTKLLGDATLCYAQGQ+EKAIS+L QVVLQAPD+P
Subjt:  LAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLP

Query:  DSYHTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHRQCLG
        DSYHTLGLVY+AIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF+RASLYLERGDC+KAAETYDQIH+  + 
Subjt:  DSYHTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHRQCLG

Query:  NVEALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNMEKAECLFA
        NVEALMTGAKLYQKCGHLERAICILE+YIK HP EADLDVVDLLAS+YMGSKEFSKALECIEHA  VYCA NELPLNLT KAGIC +HLGN EKAECLFA
Subjt:  NVEALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNMEKAECLFA

Query:  NLGRETAYDHSNLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEA
        NL RE   + SNLMIEVADSLLSLKHYN+ALKYYLM EEVNAGGN+GILYLKIAQCY S NERA+AIVFFYKVLQHLEDNINARLTLASLLLEEAR+EEA
Subjt:  NLGRETAYDHSNLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEA

Query:  ISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRP
        ISLLSPPKDSN TSSSSSK KPWWLNE+VKLKLCHI++T+GM+ENFVE IFPLVRESLYIETL EK+KVNKKKLP+RVLLERVKVLDGR+TG LFRGFRP
Subjt:  ISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRP

Query:  VAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNS
        VAPKSDLSKASRAK+LLQKRERI+EEKKA+ALAAGV++NYDD DDEP LR+ RESPLPNLLK+EEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNS
Subjt:  VAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNS

Query:  LSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYL
        LSIERKEELQLLGAQLAFSSTDT HGFNFAKHVVKQYPYS SAWNCYYKV+S +TNRDSRHCKLLNSMQ KYKDCAPPYIIAGHQFT ISHHQDAARKYL
Subjt:  LSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYL

Query:  EAYKIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNI
        EAYK++PDSPL+NLCVGT+L+NL LGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGEN++ 
Subjt:  EAYKIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNI

Query:  KHQKSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF
        KHQ SVYCDLRREAAYNLHLIYKESGA DLARQVLKDNCTF
Subjt:  KHQKSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF

SwissProt top hitse value%identityAlignment
O74458 Transcription factor tau subunit sfc46.2e-2922.14Show/hide
Query:  EDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQ-GQHEKAISILRQVVLQAPDLPDSYHTLGLVYSAIGDD----VK
        +DI+   ++E L+A+  G R+  K  K RGR   +    + +V ++L  A   +AQ G  ++A  +  ++V    ++  ++  LG  +   G+      K
Subjt:  EDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQ-GQHEKAISILRQVVLQAPDLPDSYHTLGLVYSAIGDD----VK

Query:  AMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDIN----LLFHRASLYLERGDCEKAAETYDQIHRQCLGNVEALMTGAKLYQK
         +  +M AAHL PKD  LW      S      DQA YC ++A+ A+P + +     +++R+ L  E G  +KAAE +  + +    N   L   A++Y K
Subjt:  AMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDIN----LLFHRASLYLERGDCEKAAETYDQIHRQCLGNVEALMTGAKLYQK

Query:  CGHLERAIC----ILEDYIKEHPAEA------DLDVVDLLASIYMGSKEFSKALECI----------------------------EHAGEVYCAGNE---
          H  R I     I   Y  ++PA        DL  ++L A + +   ++S  +  I                            E   E   A  E   
Subjt:  CGHLERAIC----ILEDYIKEHPAEA------DLDVVDLLASIYMGSKEFSKALECI----------------------------EHAGEVYCAGNE---

Query:  -----LPLNLTTKAGICLVHLGNMEKAECLFA---NLGRETAYDHSNLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQ
             LP    TK GI  +  G + +AE  F+   NL  + A+    ++ ++A + + ++  ++AL+Y+++        N+G+ Y  +  CYL + E   
Subjt:  -----LPLNLTTKAGICLVHLGNMEKAECLFA---NLGRETAYDHSNLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQ

Query:  AIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQE
        A      +L     N NA + LA +   +   + A+ +++                             +I++ R    N  E+     +   + + +  
Subjt:  AIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQE

Query:  KVKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAPKSDLSKASRAKRLLQKRERIKE-------------------------EKKARALAAGVD--V
        ++ V  +K+P+    +R ++   +E    F  ++    +    K    ++ L+K E + E                         EKKARA A  +    
Subjt:  KVKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAPKSDLSKASRAKRLLQKRERIKE-------------------------EKKARALAAGVD--V

Query:  NYDDLDDEPALRMHR-----------ESPLPNLLKE--------EEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFS
         Y  L+D+    ++R           +  L  +L+         + ++ L V+    L  +G   +A ++++  +          +++ L+      +  
Subjt:  NYDDLDDEPALRMHR-----------ESPLPNLLKE--------EEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFS

Query:  STDTMHGFNFAKHVVKQYPYSISAWNCYYKVAS-----SLTNRDSRHCKLL------------NSM-------------QAKYKDCAPPYIIA--GHQFT
        + D        + V   + +    +  +  V S     S    DS + K L            NS+              A       P ++   GH   
Subjt:  STDTMHGFNFAKHVVKQYPYSISAWNCYYKVAS-----SLTNRDSRHCKLL------------NSM-------------QAKYKDCAPPYIIA--GHQFT

Query:  TISHHQDAARKYLEAYKIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDN-----SQEALYNIARAYHHIGLVTLAVTYYEKVLA
               A   Y  A+ I PD P+ NL +G + ++ A+     N+H  + QG  FLY+   L  N      QEALYN+ +AYH IGL   AV YYE VL 
Subjt:  TISHHQDAARKYLEAYKIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDN-----SQEALYNIARAYHHIGLVTLAVTYYEKVLA

Query:  TYQKDCPIPELFGENRNIKHQK-SVYCDLRREAAYNLHLIYKESGAFDLARQV
                P   G+         S   D   EAAYNL LIY  SG   LA Q+
Subjt:  TYQKDCPIPELFGENRNIKHQK-SVYCDLRREAAYNLHLIYKESGAFDLARQV

P33339 Transcription factor tau 131 kDa subunit7.9e-1619.79Show/hide
Query:  EEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMN
        ++E++ +  E E  D G+ + E+E+         ++     E  D  +    +    EY   +E+    LA             ED   A    I EA N
Subjt:  EEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLEYEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMN

Query:  YGSRRKLKEPK-KRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWK
        +  ++K K  K K   R+  ++ L+ +V +LL  A   + +   + A  +  +V+ +      +Y TLG +Y   G        + LAAHL   D   WK
Subjt:  YGSRRKLKEPK-KRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWK

Query:  LLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHRQ-------------CLGNVEALMTGAKLYQKC--GHLERAIC
        ++   S D   + QA YC S+ I   P +   ++ R+ LY + G   +A + + +++                  + + +    +LY K    ++ER   
Subjt:  LLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHRQ-------------CLGNVEALMTGAKLYQKC--GHLERAIC

Query:  IL------------------EDYIKEHPAEADLD------------------------VVDLLASIYM--------GSKEFSKALECIEHA---------
        IL                  ED  ++ P E D D                         +++LA +++        G K   K    I+           
Subjt:  IL------------------EDYIKEHPAEADLD------------------------VVDLLASIYM--------GSKEFSKALECIEHA---------

Query:  ----------------GEVYCAGNE----LPLNLTTKAGICLVHLGNMEKAECLFANLGRETAYDHSNLMIEVADSLLSLKHYNVALKYY---LMFEEVN
                          +  A  E    +P+++  + G+  ++  N+ +A   F  L  ET  D ++L  E A +L   + Y  A+ ++   L  EE  
Subjt:  ----------------GEVYCAGNE----LPLNLTTKAGICLVHLGNMEKAECLFANLGRETAYDHSNLMIEVADSLLSLKHYNVALKYY---LMFEEVN

Query:  AGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLL------------------LEEARDEEAISLLSPPKDSNPTSSSSSK--LK
               ++  +A+CY  I     A  F+   ++   D+++ R++LA +                   + + + +E +  +S  K SN TS  SSK  L+
Subjt:  AGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLL------------------LEEARDEEAISLLSPPKDSNPTSSSSSK--LK

Query:  PWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAPKSDLSKASRAKRLLQKRE
                K K       R  IE    +   +V +   ++  +    +N+ K    + +  V      E  ++F   +    K   S++ +   +L++ +
Subjt:  PWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAPKSDLSKASRAKRLLQKRE

Query:  RIKEE-----KKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQL
        +   E     ++   LA G  V    L +E  + +   + L   L  E++  L ++L   +A      + L ++    ++       ER + ++ +   +
Subjt:  RIKEE-----KKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQL

Query:  AFSSTD----------TMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRD------------------SRHCKLLNSMQAKYKDCAPPYIIAGHQFTT
             D           ++ F F + V++ + YS+        + SS   +                   +    + N         + PY+   +    
Subjt:  AFSSTD----------TMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRD------------------SRHCKLLNSMQAKYKDCAPPYIIAGHQFTT

Query:  ISHHQDAARKYLEAYKIM-------PDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDN------SQEALYNIARAYHHIGLVTLAVT
         S     +R +L A + +       PD P+VNL +G S ++ A+      +H  +  GL +LY+  K+  +       QEA YN+ RA+H IGLV++A+ 
Subjt:  ISHHQDAARKYLEAYKIM-------PDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDN------SQEALYNIARAYHHIGLVTLAVT

Query:  YYEKVLATYQKDCPIPELFGENRNIKHQKSVYCDLRREAAYNLHLIYKESGAFDLARQVLK
        YY +VL  Y                         L++ AAYN  +IY++SG  +LA  +++
Subjt:  YYEKVLATYQKDCPIPELFGENRNIKHQKSVYCDLRREAAYNLHLIYKESGAFDLARQVLK

Q9Y5Q9 General transcription factor 3C polypeptide 34.4e-6724.67Show/hide
Query:  KEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSID
        +E KK  + K  + KL R +  L+G+A + +A+G+ E+AI +  +++ QAP   + + TL ++Y   GD  K++ F ++AAHL P D+  W  L   S++
Subjt:  KEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSID

Query:  RGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHRQCLGN-----VEALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVV
        + +I QA +C +KA+K EP ++  L+ R+SLY + GD + A + Y +I      +     ++     AK Y +   +  AI I+++   +H     ++ V
Subjt:  RGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHRQCLGN-----VEALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVV

Query:  DLLASIYMGSKEFSKALECI-EHAG--------------------EVYCA-GNELPLNLTTKAGICLVHLGNMEKAECLFANLGRETAYDHSNLMIEVAD
        ++ A +Y+ +K++ KALE I + +G                     V C   + +P+++T K  +CLVHL  +E    L   L  +   D  +L ++VA+
Subjt:  DLLASIYMGSKEFSKALECI-EHAG--------------------EVYCA-GNELPLNLTTKAGICLVHLGNMEKAECLFANLGRETAYDHSNLMIEVAD

Query:  SLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSK
        + L +  YN AL   L     +   N+ +++L+ A+C  ++    +A   + KV+     +++AR++L++L  +  + E+A+  L P  D +  +  ++ 
Subjt:  SLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSSSSK

Query:  LKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETG--NLFRGFRPVAPKSDLSKASRAKRLL
         +      K+ L    +  ++G +  +V+ +  ++   L       KV +N+          +V ++   ++G  +L+           L K SR K + 
Subjt:  LKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETG--NLFRGFRPVAPKSDLSKASRAKRLL

Query:  QKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLA
           ++      A+A+ A                      L ++L ++++ NL++    +L  L R  EA  ++  +L+        ++++EL+  G   A
Subjt:  QKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLA

Query:  FSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLVNLCVG
            +    +N+ + +V +       WN + +V  ++ ++D RH +    +  K  +     ++ GH        + A  +Y++A++  PD PL + C+G
Subjt:  FSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLVNLCVG

Query:  TSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIKHQKSVYCDLRREAAYN
         + +++A    +  +H  + QG +FL + L L    QE+ YN+ R  H +GL+ LA+ YY+K L        +P L  E   +        DLRR+ AYN
Subjt:  TSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIKHQKSVYCDLRREAAYN

Query:  LHLIYKESGAFDLARQVLKDNCT
        L LIY+ SG   +A+ +L   C+
Subjt:  LHLIYKESGAFDLARQVLKDNCT

Arabidopsis top hitse value%identityAlignment
AT1G17680.1 tetratricopeptide repeat (TPR)-containing protein2.7e-19242.72Show/hide
Query:  EGNKISDNEEVPGGAMC---VLGAEKEAVETEVEDREEEEEEE----EEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLE
        EGN IS+ EE P    C   VLG      +T  +D++   +EE    +++ + + +DEG++  EE+D     F+AG  P                FER E
Subjt:  EGNKISDNEEVPGGAMC---VLGAEKEAVETEVEDREEEEEEE----EEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLE

Query:  YEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAP
        YEALAE+KRKALA+ Q   +        + G      +E M+ G RRK ++ KK+GRR GSKK++  D+ K   +A   +A G+  +A+ IL +V+ QAP
Subjt:  YEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAP

Query:  DLPDSYHTLGLVYSAIG-DDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHR
            +Y+ L  V   +G  +  +     +AA++    S  WKLL+    ++ +I  A    SKAI+A+PDDI L +  A + L  G   +AAET++QI R
Subjt:  DLPDSYHTLGLVYSAIG-DDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHR

Query:  QCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNMEKAE
        +C   +EAL  G + + K G  ERA  ILED+IK H +E   DV+DLLAS++M      +AL+ I    ++Y  G EL  +L  +  IC VHL  ME+AE
Subjt:  QCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNMEKAE

Query:  CLFANLGRETAYDHSNLMIEVADSLLSLKHYNVALKYYL--MFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEE
         + + L +E   +H  L+  +AD L ++ +++ ALKYY+  + E VN     G L++KIA+CY+S+ ER QAIVF+YK L  L D ++ R+TLASLLLE+
Subjt:  CLFANLGRETAYDHSNLMIEVADSLLSLKHYNVALKYYL--MFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEE

Query:  ARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNL
         + +EA+ +LSPP++ +P    ++KLK WW N K+++ LC IY + GM+E+F      LV + ++  T++ K K       R VL E  +    R   + 
Subjt:  ARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNL

Query:  FRGFRPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTL
                PK    K  + +  L +  RI+E        A +  + +D+  E             ++K+EEYH L VDLCKALASL R  EALEI++L  
Subjt:  FRGFRPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTL

Query:  KLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQD
        +L    L +E K+ELQ LGA+++  + D    F+  + V++Q+PY ++AWNCYY V S L  R S   K ++ +++KY+DC PP +IAGH FT  S HQD
Subjt:  KLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQD

Query:  AARKYLEAYKIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELF
        AAR+YLEAYK+MP+SPL+NLCVG +L+NLALGFRL+N+H+C+AQG AFLY NL++C NSQEALYN+ARAY H+GLVTLA +YYEKVLA Y+KD  +P+L 
Subjt:  AARKYLEAYKIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELF

Query:  GENRNI-KHQKSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF
         E+  + + +K V CDLR+EAA+NLHLIYK SGAFDLARQVLKD+CTF
Subjt:  GENRNI-KHQKSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF

AT1G17680.2 tetratricopeptide repeat (TPR)-containing protein2.7e-19242.72Show/hide
Query:  EGNKISDNEEVPGGAMC---VLGAEKEAVETEVEDREEEEEEE----EEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLE
        EGN IS+ EE P    C   VLG      +T  +D++   +EE    +++ + + +DEG++  EE+D     F+AG  P                FER E
Subjt:  EGNKISDNEEVPGGAMC---VLGAEKEAVETEVEDREEEEEEE----EEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERLE

Query:  YEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAP
        YEALAE+KRKALA+ Q   +        + G      +E M+ G RRK ++ KK+GRR GSKK++  D+ K   +A   +A G+  +A+ IL +V+ QAP
Subjt:  YEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAP

Query:  DLPDSYHTLGLVYSAIG-DDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHR
            +Y+ L  V   +G  +  +     +AA++    S  WKLL+    ++ +I  A    SKAI+A+PDDI L +  A + L  G   +AAET++QI R
Subjt:  DLPDSYHTLGLVYSAIG-DDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHR

Query:  QCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNMEKAE
        +C   +EAL  G + + K G  ERA  ILED+IK H +E   DV+DLLAS++M      +AL+ I    ++Y  G EL  +L  +  IC VHL  ME+AE
Subjt:  QCLGNVEALMTGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNMEKAE

Query:  CLFANLGRETAYDHSNLMIEVADSLLSLKHYNVALKYYL--MFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEE
         + + L +E   +H  L+  +AD L ++ +++ ALKYY+  + E VN     G L++KIA+CY+S+ ER QAIVF+YK L  L D ++ R+TLASLLLE+
Subjt:  CLFANLGRETAYDHSNLMIEVADSLLSLKHYNVALKYYL--MFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEE

Query:  ARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNL
         + +EA+ +LSPP++ +P    ++KLK WW N K+++ LC IY + GM+E+F      LV + ++  T++ K K       R VL E  +    R   + 
Subjt:  ARDEEAISLLSPPKDSNPTSSSSSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNL

Query:  FRGFRPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTL
                PK    K  + +  L +  RI+E        A +  + +D+  E             ++K+EEYH L VDLCKALASL R  EALEI++L  
Subjt:  FRGFRPVAPKSDLSKASRAKRLLQKRERIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTL

Query:  KLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQD
        +L    L +E K+ELQ LGA+++  + D    F+  + V++Q+PY ++AWNCYY V S L  R S   K ++ +++KY+DC PP +IAGH FT  S HQD
Subjt:  KLAFNSLSIERKEELQLLGAQLAFSSTDTMHGFNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQD

Query:  AARKYLEAYKIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELF
        AAR+YLEAYK+MP+SPL+NLCVG +L+NLALGFRL+N+H+C+AQG AFLY NL++C NSQEALYN+ARAY H+GLVTLA +YYEKVLA Y+KD  +P+L 
Subjt:  AARKYLEAYKIMPDSPLVNLCVGTSLVNLALGFRLQNKHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELF

Query:  GENRNI-KHQKSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF
         E+  + + +K V CDLR+EAA+NLHLIYK SGAFDLARQVLKD+CTF
Subjt:  GENRNI-KHQKSVYCDLRREAAYNLHLIYKESGAFDLARQVLKDNCTF

AT3G04240.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.7e-0523.53Show/hide
Query:  ATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF
        A L    G   +A+   ++ V   P  PD+Y  LG VY A+G   +A+  Y  A  + P  +  +  + S   ++G +D A     +A+  +P  +    
Subjt:  ATLCYAQGQHEKAISILRQVVLQAPDLPDSYHTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLF

Query:  HRASLYLERGDCEKAAETYDQI------HRQCLGNV
        +  +   + G  ++A   Y+Q       H Q + N+
Subjt:  HRASLYLERGDCEKAAETYDQI------HRQCLGNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAGCGTTTCTTGTAAGCGGACTATAGCGGGCCCAAAAAGAAGTTCGTCAGAGATTATAGAGCTTCCCCTCGCCGCACTCTCCGACCGGGAAGCCGGAAAT
TGTATTGGACCTTTTGTTTTCTTTTGTTCTAAAGCCTATTTCACGTTCCAGCCATCCCTCCCTCCCATTTACCAGCCACTCCACCCCCCCGCACCGGACGCCGCC
GCTGCCGCCGCCTTCCTCACGCCACTCGCCATTCCTAAACCTTTTGAGTACGAAAAAATTGGATTAACATCTACAATTTCCCCTGCCCCTCCATCTCTTCGCCGC
TGCCTTCATCCATTGAGCTGCCGCCACCCACCGATCAGGGAAGGGAATAAAATTTCTGACAATGAAGAGGTTCCTGGTGGTGCTATGTGCGTTTTAGGAGCGGAA
AAAGAGGCTGTAGAAACAGAAGTGGAGGATAGAGAGGAGGAGGAGGAGGAGGAAGAGGAAGAAGGAGAGGAAGAAGTGGAGGATGAAGGAGAGGATGATATTGAA
GAAGAAGATGGTTACACATTCAAATTTAAGGCCGGAGAAAATCCATTTGATTTTGTTGAAGGGACAGATTTTAGCATCCAACCATATAAAAAATTTGAGCGCCTT
GAATATGAAGCTCTTGCTGAGAAAAAGAGAAAAGCTCTTGCAAATGGTCAGAGTGAGAGAGCTGCAAAGAGGGGCAGGGTAGAAGATATTTCTGGTGCAAGCTTT
GATGAAATTTTGGAAGCTATGAATTATGGATCTAGGAGGAAGTTAAAGGAGCCTAAAAAAAGAGGTAGACGGAAAGGATCAAAGAAAAAACTTAATCGTGATGTT
ACAAAATTGCTTGGTGATGCAACTTTATGTTATGCTCAAGGCCAGCATGAGAAGGCTATATCCATATTGCGTCAAGTTGTTCTGCAAGCCCCAGATTTACCTGAT
TCGTACCATACGCTTGGACTTGTTTACAGTGCAATTGGTGATGATGTGAAAGCCATGGGATTCTACATGCTTGCTGCACATTTAATGCCAAAAGATTCATCTCTC
TGGAAACTGCTATTTTCATGGTCAATTGACCGAGGTGATATTGATCAAGCAAGCTACTGTCTTTCTAAAGCAATCAAAGCAGAGCCTGATGATATTAATTTATTA
TTCCATCGTGCATCACTCTACCTTGAGCGTGGAGATTGTGAAAAAGCAGCTGAAACGTATGATCAGATTCATCGACAATGCCTTGGAAATGTTGAAGCACTCATG
ACAGGAGCAAAGCTGTACCAAAAATGCGGTCATCTTGAACGTGCAATTTGCATTCTTGAGGACTACATCAAAGAGCATCCAGCTGAAGCTGATTTAGATGTGGTT
GATCTTTTAGCTTCTATATACATGGGAAGTAAAGAATTCAGCAAAGCTCTTGAGTGCATTGAGCATGCAGGTGAGGTGTACTGTGCAGGAAATGAGCTACCTTTA
AACTTGACAACTAAAGCAGGAATCTGCCTAGTTCATCTTGGAAATATGGAGAAGGCAGAGTGCCTCTTTGCTAATTTGGGACGGGAAACTGCCTATGATCACTCA
AATTTGATGATTGAAGTTGCAGACTCATTGCTGAGTCTTAAGCATTATAACGTGGCATTGAAGTATTATCTGATGTTCGAAGAAGTAAATGCTGGAGGGAACATG
GGGATTTTATACCTTAAAATTGCCCAGTGTTACTTATCAATCAATGAAAGAGCACAGGCAATTGTTTTCTTTTATAAAGTGCTTCAACATCTTGAAGATAACATT
AATGCTCGATTAACTTTGGCCTCCCTCCTCCTTGAGGAAGCTAGAGATGAAGAAGCCATTTCATTACTATCTCCTCCAAAAGATTCAAACCCAACTAGCTCATCT
TCCAGCAAATTAAAACCTTGGTGGCTCAATGAGAAAGTAAAACTGAAGCTTTGCCACATATACAAAACTAGAGGAATGATTGAGAACTTCGTTGAGGTGATCTTT
CCTTTGGTCCGAGAGTCCTTATATATCGAGACTCTTCAAGAAAAGGTTAAAGTGAACAAGAAGAAGCTTCCAAGGAGGGTTTTGCTTGAAAGAGTCAAAGTACTA
GATGGACGTGAAACTGGTAACCTATTTCGTGGATTCAGACCTGTAGCTCCTAAATCAGATTTATCAAAAGCGTCTAGAGCAAAGAGATTGCTTCAAAAGAGGGAA
AGAATCAAGGAAGAAAAGAAGGCTAGAGCGCTGGCTGCTGGAGTCGATGTGAACTATGATGATTTAGATGATGAGCCAGCGCTACGGATGCACCGAGAATCCCCC
CTGCCTAATCTTCTGAAGGAAGAAGAATATCATAATCTTATTGTTGATTTGTGCAAGGCATTGGCTTCCTTGGGAAGATGTTCTGAAGCTTTAGAGATTATAAGT
CTAACTTTAAAGTTGGCTTTTAACTCATTATCCATAGAAAGGAAGGAAGAACTCCAGTTACTTGGAGCCCAATTAGCATTCAGCTCAACTGATACCATGCATGGT
TTCAACTTTGCAAAGCATGTTGTTAAGCAGTACCCTTACAGCATCTCTGCTTGGAACTGCTATTATAAAGTAGCTTCAAGTTTGACGAACCGGGATTCAAGGCAC
TGCAAGCTTCTGAATAGCATGCAAGCCAAATACAAAGATTGTGCACCACCCTATATCATTGCCGGGCATCAATTTACCACCATTAGCCATCATCAAGATGCTGCA
AGGAAATATCTTGAAGCTTATAAAATCATGCCGGATAGTCCCCTAGTTAACTTGTGTGTTGGAACTTCCTTGGTCAATTTGGCTCTTGGATTCCGTCTTCAAAAT
AAGCATCAGTGTGTTGCACAGGGCTTGGCATTCCTCTACAAAAATTTGAAGCTTTGTGATAATAGCCAGGAAGCCTTGTACAATATAGCTCGAGCATATCATCAC
ATTGGACTCGTGACACTGGCGGTTACATATTACGAAAAGGTGCTCGCAACGTACCAGAAGGACTGCCCCATTCCAGAACTTTTTGGTGAGAATCGAAACATTAAG
CATCAGAAATCAGTCTATTGTGACCTACGCAGAGAAGCAGCTTACAATTTGCACCTGATTTATAAAGAGAGTGGAGCTTTTGATCTTGCTAGGCAAGTCCTAAAA
GATAATTGCACATTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAGCGTTTCTTGTAAGCGGACTATAGCGGGCCCAAAAAGAAGTTCGTCAGAGATTATAGAGCTTCCCCTCGCCGCACTCTCCGACCGGGAAGCCGGAAAT
TGTATTGGACCTTTTGTTTTCTTTTGTTCTAAAGCCTATTTCACGTTCCAGCCATCCCTCCCTCCCATTTACCAGCCACTCCACCCCCCCGCACCGGACGCCGCC
GCTGCCGCCGCCTTCCTCACGCCACTCGCCATTCCTAAACCTTTTGAGTACGAAAAAATTGGATTAACATCTACAATTTCCCCTGCCCCTCCATCTCTTCGCCGC
TGCCTTCATCCATTGAGCTGCCGCCACCCACCGATCAGGGAAGGGAATAAAATTTCTGACAATGAAGAGGTTCCTGGTGGTGCTATGTGCGTTTTAGGAGCGGAA
AAAGAGGCTGTAGAAACAGAAGTGGAGGATAGAGAGGAGGAGGAGGAGGAGGAAGAGGAAGAAGGAGAGGAAGAAGTGGAGGATGAAGGAGAGGATGATATTGAA
GAAGAAGATGGTTACACATTCAAATTTAAGGCCGGAGAAAATCCATTTGATTTTGTTGAAGGGACAGATTTTAGCATCCAACCATATAAAAAATTTGAGCGCCTT
GAATATGAAGCTCTTGCTGAGAAAAAGAGAAAAGCTCTTGCAAATGGTCAGAGTGAGAGAGCTGCAAAGAGGGGCAGGGTAGAAGATATTTCTGGTGCAAGCTTT
GATGAAATTTTGGAAGCTATGAATTATGGATCTAGGAGGAAGTTAAAGGAGCCTAAAAAAAGAGGTAGACGGAAAGGATCAAAGAAAAAACTTAATCGTGATGTT
ACAAAATTGCTTGGTGATGCAACTTTATGTTATGCTCAAGGCCAGCATGAGAAGGCTATATCCATATTGCGTCAAGTTGTTCTGCAAGCCCCAGATTTACCTGAT
TCGTACCATACGCTTGGACTTGTTTACAGTGCAATTGGTGATGATGTGAAAGCCATGGGATTCTACATGCTTGCTGCACATTTAATGCCAAAAGATTCATCTCTC
TGGAAACTGCTATTTTCATGGTCAATTGACCGAGGTGATATTGATCAAGCAAGCTACTGTCTTTCTAAAGCAATCAAAGCAGAGCCTGATGATATTAATTTATTA
TTCCATCGTGCATCACTCTACCTTGAGCGTGGAGATTGTGAAAAAGCAGCTGAAACGTATGATCAGATTCATCGACAATGCCTTGGAAATGTTGAAGCACTCATG
ACAGGAGCAAAGCTGTACCAAAAATGCGGTCATCTTGAACGTGCAATTTGCATTCTTGAGGACTACATCAAAGAGCATCCAGCTGAAGCTGATTTAGATGTGGTT
GATCTTTTAGCTTCTATATACATGGGAAGTAAAGAATTCAGCAAAGCTCTTGAGTGCATTGAGCATGCAGGTGAGGTGTACTGTGCAGGAAATGAGCTACCTTTA
AACTTGACAACTAAAGCAGGAATCTGCCTAGTTCATCTTGGAAATATGGAGAAGGCAGAGTGCCTCTTTGCTAATTTGGGACGGGAAACTGCCTATGATCACTCA
AATTTGATGATTGAAGTTGCAGACTCATTGCTGAGTCTTAAGCATTATAACGTGGCATTGAAGTATTATCTGATGTTCGAAGAAGTAAATGCTGGAGGGAACATG
GGGATTTTATACCTTAAAATTGCCCAGTGTTACTTATCAATCAATGAAAGAGCACAGGCAATTGTTTTCTTTTATAAAGTGCTTCAACATCTTGAAGATAACATT
AATGCTCGATTAACTTTGGCCTCCCTCCTCCTTGAGGAAGCTAGAGATGAAGAAGCCATTTCATTACTATCTCCTCCAAAAGATTCAAACCCAACTAGCTCATCT
TCCAGCAAATTAAAACCTTGGTGGCTCAATGAGAAAGTAAAACTGAAGCTTTGCCACATATACAAAACTAGAGGAATGATTGAGAACTTCGTTGAGGTGATCTTT
CCTTTGGTCCGAGAGTCCTTATATATCGAGACTCTTCAAGAAAAGGTTAAAGTGAACAAGAAGAAGCTTCCAAGGAGGGTTTTGCTTGAAAGAGTCAAAGTACTA
GATGGACGTGAAACTGGTAACCTATTTCGTGGATTCAGACCTGTAGCTCCTAAATCAGATTTATCAAAAGCGTCTAGAGCAAAGAGATTGCTTCAAAAGAGGGAA
AGAATCAAGGAAGAAAAGAAGGCTAGAGCGCTGGCTGCTGGAGTCGATGTGAACTATGATGATTTAGATGATGAGCCAGCGCTACGGATGCACCGAGAATCCCCC
CTGCCTAATCTTCTGAAGGAAGAAGAATATCATAATCTTATTGTTGATTTGTGCAAGGCATTGGCTTCCTTGGGAAGATGTTCTGAAGCTTTAGAGATTATAAGT
CTAACTTTAAAGTTGGCTTTTAACTCATTATCCATAGAAAGGAAGGAAGAACTCCAGTTACTTGGAGCCCAATTAGCATTCAGCTCAACTGATACCATGCATGGT
TTCAACTTTGCAAAGCATGTTGTTAAGCAGTACCCTTACAGCATCTCTGCTTGGAACTGCTATTATAAAGTAGCTTCAAGTTTGACGAACCGGGATTCAAGGCAC
TGCAAGCTTCTGAATAGCATGCAAGCCAAATACAAAGATTGTGCACCACCCTATATCATTGCCGGGCATCAATTTACCACCATTAGCCATCATCAAGATGCTGCA
AGGAAATATCTTGAAGCTTATAAAATCATGCCGGATAGTCCCCTAGTTAACTTGTGTGTTGGAACTTCCTTGGTCAATTTGGCTCTTGGATTCCGTCTTCAAAAT
AAGCATCAGTGTGTTGCACAGGGCTTGGCATTCCTCTACAAAAATTTGAAGCTTTGTGATAATAGCCAGGAAGCCTTGTACAATATAGCTCGAGCATATCATCAC
ATTGGACTCGTGACACTGGCGGTTACATATTACGAAAAGGTGCTCGCAACGTACCAGAAGGACTGCCCCATTCCAGAACTTTTTGGTGAGAATCGAAACATTAAG
CATCAGAAATCAGTCTATTGTGACCTACGCAGAGAAGCAGCTTACAATTTGCACCTGATTTATAAAGAGAGTGGAGCTTTTGATCTTGCTAGGCAAGTCCTAAAA
GATAATTGCACATTTTAAGTACAAAGCACATTTTTTGCACACCAACTATATGTTAATTAGCAACAAGAAATAAAGTTATCTTATTTTGATAGACATTGTACATAG
GTTGAGGATGTAGAATATGTTAGTTGGAAGTGAAAGGATTCACTAGAACCCGATGGGTATTTTTATGACCGAGTTTATAAGGATTCAGTTTATTATGTATATATA
TCTATCAATCAATGATGTTACTCTG
Protein sequenceShow/hide protein sequence
MKSVSCKRTIAGPKRSSSEIIELPLAALSDREAGNCIGPFVFFCSKAYFTFQPSLPPIYQPLHPPAPDAAAAAAFLTPLAIPKPFEYEKIGLTSTISPAPPSLRR
CLHPLSCRHPPIREGNKISDNEEVPGGAMCVLGAEKEAVETEVEDREEEEEEEEEEGEEEVEDEGEDDIEEEDGYTFKFKAGENPFDFVEGTDFSIQPYKKFERL
EYEALAEKKRKALANGQSERAAKRGRVEDISGASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGQHEKAISILRQVVLQAPDLPD
SYHTLGLVYSAIGDDVKAMGFYMLAAHLMPKDSSLWKLLFSWSIDRGDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHRQCLGNVEALM
TGAKLYQKCGHLERAICILEDYIKEHPAEADLDVVDLLASIYMGSKEFSKALECIEHAGEVYCAGNELPLNLTTKAGICLVHLGNMEKAECLFANLGRETAYDHS
NLMIEVADSLLSLKHYNVALKYYLMFEEVNAGGNMGILYLKIAQCYLSINERAQAIVFFYKVLQHLEDNINARLTLASLLLEEARDEEAISLLSPPKDSNPTSSS
SSKLKPWWLNEKVKLKLCHIYKTRGMIENFVEVIFPLVRESLYIETLQEKVKVNKKKLPRRVLLERVKVLDGRETGNLFRGFRPVAPKSDLSKASRAKRLLQKRE
RIKEEKKARALAAGVDVNYDDLDDEPALRMHRESPLPNLLKEEEYHNLIVDLCKALASLGRCSEALEIISLTLKLAFNSLSIERKEELQLLGAQLAFSSTDTMHG
FNFAKHVVKQYPYSISAWNCYYKVASSLTNRDSRHCKLLNSMQAKYKDCAPPYIIAGHQFTTISHHQDAARKYLEAYKIMPDSPLVNLCVGTSLVNLALGFRLQN
KHQCVAQGLAFLYKNLKLCDNSQEALYNIARAYHHIGLVTLAVTYYEKVLATYQKDCPIPELFGENRNIKHQKSVYCDLRREAAYNLHLIYKESGAFDLARQVLK
DNCTF