; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G028270 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G028270
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionpre-mRNA-processing protein 40C
Genome locationchr02:34482084..34493987
RNA-Seq ExpressionLsi02G028270
SyntenyLsi02G028270
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003712 - transcription coregulator activity (molecular function)
GO:0070063 - RNA polymerase binding (molecular function)
InterPro domainsIPR001202 - WW domain
IPR002713 - FF domain
IPR012870 - Protein of unknown function DUF1666
IPR036020 - WW domain superfamily
IPR036517 - FF domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK03994.1 pre-mRNA-processing protein 40C [Cucumis melo var. makuwa]1.2e-26385.49Show/hide
Query:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNSINP
        MSSASTVSQSVSLPAPPTSNS ANGSSIPNLIP+TSPV P+PSFH+HQL PV PMVPGPPGMSPS P+VST PA LFPP DSASTIPGP+MHA  N I+P
Subjt:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNSINP

Query:  SARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQTDS
        SARPQICGSYPSLTPVVSPP+A+WFQPPQLGAMPRPPF+PYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAI VP  HG+QL GNSLIQTDS
Subjt:  SARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQTDS

Query:  NHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE---LLLQ----QLSNLSGTDWVLVTM
        NHP+L              DSQKH Q VG SENISL KHSEDWTAHKTEAGIIYYYNALTGESTYEKP GF+GE   L+ Q     +SNLSGTDWVLVTM
Subjt:  NHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE---LLLQ----QLSNLSGTDWVLVTM

Query:  GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPVAS
        GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKE SAPLPNNNA TDLGTSS SINTPAINTGGREATPLRTVGI GSSSALDLIKKKLQDSG PVAS
Subjt:  GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPVAS

Query:  SPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKG---
        SPISA TVAQSDVNL RDADATVKALQTENNKDKPKDAN DGN+SDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFK    
Subjt:  SPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKG---

Query:  ----------QNMTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER
                     TRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGND RFEALDRKDRENLLNER
Subjt:  ----------QNMTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER

XP_008462197.1 PREDICTED: pre-mRNA-processing protein 40C [Cucumis melo]1.2e-26385.49Show/hide
Query:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNSINP
        MSSASTVSQSVSLPAPPTSNS ANGSSIPNLIP+TSPV P+PSFH+HQL PV PMVPGPPGMSPS P+VST PA LFPP DSASTIPGP+MHA  N I+P
Subjt:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNSINP

Query:  SARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQTDS
        SARPQICGSYPSLTPVVSPP+A+WFQPPQLGAMPRPPF+PYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAI VP  HG+QL GNSLIQTDS
Subjt:  SARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQTDS

Query:  NHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE---LLLQ----QLSNLSGTDWVLVTM
        NHP+L              DSQKH Q VG SENISL KHSEDWTAHKTEAGIIYYYNALTGESTYEKP GF+GE   L+ Q     +SNLSGTDWVLVTM
Subjt:  NHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE---LLLQ----QLSNLSGTDWVLVTM

Query:  GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPVAS
        GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKE SAPLPNNNA TDLGTSS SINTPAINTGGREATPLRTVGI GSSSALDLIKKKLQDSG PVAS
Subjt:  GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPVAS

Query:  SPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKG---
        SPISA TVAQSDVNL RDADATVKALQTENNKDKPKDAN DGN+SDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFK    
Subjt:  SPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKG---

Query:  ----------QNMTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER
                     TRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGND RFEALDRKDRENLLNER
Subjt:  ----------QNMTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER

XP_011659583.1 pre-mRNA-processing protein 40C [Cucumis sativus]3.1e-26485.84Show/hide
Query:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNSINP
        MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIP+TSPV P+PSFH+HQL  V PMVPGPPGMSPS P+VST PA LFPP DSASTIPGP+MHA  N I+P
Subjt:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNSINP

Query:  SARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQTDS
        SARPQICGSYPSLTPVVSPP+A+WFQPPQLGAMPRPPFLPYS SYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAS I+VP  HG+QL GN+LIQTDS
Subjt:  SARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQTDS

Query:  NHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE---LLLQ----QLSNLSGTDWVLVTM
        NHP+L              DS KHAQGVG SENISL KHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGF+GE   L+ Q     +SNLSGTDWVLVTM
Subjt:  NHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE---LLLQ----QLSNLSGTDWVLVTM

Query:  GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPVAS
        GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKE SAPLPNNNASTDLGTSS SINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSG PVAS
Subjt:  GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPVAS

Query:  SPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKG---
        SPISAPTVAQSDVNL RDADATVKALQTE NKDKPKDAN DGN+SDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFK    
Subjt:  SPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKG---

Query:  ----------QNMTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER
                     TRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGND RFEALDRKDRENLLNER
Subjt:  ----------QNMTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER

XP_023547625.1 pre-mRNA-processing protein 40C [Cucurbita pepo subsp. pepo]3.0e-25481.93Show/hide
Query:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDS--ASTIPGPHMHATPNSI
        MSSASTVSQS+SLPAPPTSNSAANGSSIPNLIPATSPV P+ SFH+HQL P TPMVPGPPGMSPS PV       +FPP+DS  +STIPGP+MHA PNSI
Subjt:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDS--ASTIPGPHMHATPNSI

Query:  NPSARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQT
        N S RPQICGSYPSL PVVSPP+AIWFQPPQLG MPRPPFLPY ASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQV+SA AVP  HG+ L+GNSLIQT
Subjt:  NPSARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQT

Query:  DSNHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE---LLLQ----QLSNLSGTDWVLV
        D NHP+L              D+QKHAQG+G SE+ISL+KHSE+WTAHKTEAGI+YYYNALTGESTYEKPSGFKGE   L++Q     +SNLSGTDWVLV
Subjt:  DSNHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE---LLLQ----QLSNLSGTDWVLV

Query:  TMGDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPV
        TMGDGKKYYYNNKTKISSWQIPNEV+ELRQQNDEKTKEHS PLPNNNA T+ G+S IS+NTPAINTGGREA PLRTVG+SG SSALDLIKKKLQ+SG PV
Subjt:  TMGDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPV

Query:  ASSPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKGQ
        ASSPISAPT+AQSDVNL RDADA VKALQTEN+KDKPKDANGDGN+SDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFK  
Subjt:  ASSPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKGQ

Query:  N-------------MTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER
                       TRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLL+ER
Subjt:  N-------------MTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER

XP_038900162.1 pre-mRNA-processing protein 40C [Benincasa hispida]2.9e-27087.56Show/hide
Query:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNSINP
        MSSASTVSQSVSLPAPPTSNS ANGSSIPNLIPA       PSFH HQLLP TPMVPGPPGMSPS PVVST+PAALFPPNDSASTIPGPHMHATPNSINP
Subjt:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNSINP

Query:  SARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQTDS
        S RPQICGSYPSLTPVVSPP+AIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAV   HG+QLSGNSLIQTDS
Subjt:  SARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQTDS

Query:  NHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE---LLLQ----QLSNLSGTDWVLVTM
        NHPQL              DSQKHAQGVG SENI LTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE   ++ Q     +SNLSGTDWVLVTM
Subjt:  NHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE---LLLQ----QLSNLSGTDWVLVTM

Query:  GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPVAS
        GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNA TDLGTSSISINTPAINTGGREATPLR VGISGSSSALDLIKKKLQDSG PVAS
Subjt:  GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPVAS

Query:  SPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKG---
        SPISAPTVAQ DVNLLRDADATVKALQTENNKDKPKDA+GDGN+SDSSSDSEDVD+GPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFK    
Subjt:  SPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKG---

Query:  ----------QNMTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER
                     TRAEEERKEKRAAQKAA+EGFKQLLD ASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER
Subjt:  ----------QNMTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER

TrEMBL top hitse value%identityAlignment
A0A0A0K978 Uncharacterized protein1.5e-26485.84Show/hide
Query:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNSINP
        MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIP+TSPV P+PSFH+HQL  V PMVPGPPGMSPS P+VST PA LFPP DSASTIPGP+MHA  N I+P
Subjt:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNSINP

Query:  SARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQTDS
        SARPQICGSYPSLTPVVSPP+A+WFQPPQLGAMPRPPFLPYS SYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAS I+VP  HG+QL GN+LIQTDS
Subjt:  SARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQTDS

Query:  NHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE---LLLQ----QLSNLSGTDWVLVTM
        NHP+L              DS KHAQGVG SENISL KHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGF+GE   L+ Q     +SNLSGTDWVLVTM
Subjt:  NHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE---LLLQ----QLSNLSGTDWVLVTM

Query:  GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPVAS
        GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKE SAPLPNNNASTDLGTSS SINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSG PVAS
Subjt:  GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPVAS

Query:  SPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKG---
        SPISAPTVAQSDVNL RDADATVKALQTE NKDKPKDAN DGN+SDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFK    
Subjt:  SPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKG---

Query:  ----------QNMTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER
                     TRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGND RFEALDRKDRENLLNER
Subjt:  ----------QNMTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER

A0A1S3CHX0 pre-mRNA-processing protein 40C5.8e-26485.49Show/hide
Query:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNSINP
        MSSASTVSQSVSLPAPPTSNS ANGSSIPNLIP+TSPV P+PSFH+HQL PV PMVPGPPGMSPS P+VST PA LFPP DSASTIPGP+MHA  N I+P
Subjt:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNSINP

Query:  SARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQTDS
        SARPQICGSYPSLTPVVSPP+A+WFQPPQLGAMPRPPF+PYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAI VP  HG+QL GNSLIQTDS
Subjt:  SARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQTDS

Query:  NHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE---LLLQ----QLSNLSGTDWVLVTM
        NHP+L              DSQKH Q VG SENISL KHSEDWTAHKTEAGIIYYYNALTGESTYEKP GF+GE   L+ Q     +SNLSGTDWVLVTM
Subjt:  NHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE---LLLQ----QLSNLSGTDWVLVTM

Query:  GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPVAS
        GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKE SAPLPNNNA TDLGTSS SINTPAINTGGREATPLRTVGI GSSSALDLIKKKLQDSG PVAS
Subjt:  GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPVAS

Query:  SPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKG---
        SPISA TVAQSDVNL RDADATVKALQTENNKDKPKDAN DGN+SDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFK    
Subjt:  SPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKG---

Query:  ----------QNMTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER
                     TRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGND RFEALDRKDRENLLNER
Subjt:  ----------QNMTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER

A0A5A7V0S2 Pre-mRNA-processing protein 40C5.8e-26485.49Show/hide
Query:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNSINP
        MSSASTVSQSVSLPAPPTSNS ANGSSIPNLIP+TSPV P+PSFH+HQL PV PMVPGPPGMSPS P+VST PA LFPP DSASTIPGP+MHA  N I+P
Subjt:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNSINP

Query:  SARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQTDS
        SARPQICGSYPSLTPVVSPP+A+WFQPPQLGAMPRPPF+PYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAI VP  HG+QL GNSLIQTDS
Subjt:  SARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQTDS

Query:  NHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE---LLLQ----QLSNLSGTDWVLVTM
        NHP+L              DSQKH Q VG SENISL KHSEDWTAHKTEAGIIYYYNALTGESTYEKP GF+GE   L+ Q     +SNLSGTDWVLVTM
Subjt:  NHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE---LLLQ----QLSNLSGTDWVLVTM

Query:  GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPVAS
        GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKE SAPLPNNNA TDLGTSS SINTPAINTGGREATPLRTVGI GSSSALDLIKKKLQDSG PVAS
Subjt:  GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPVAS

Query:  SPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKG---
        SPISA TVAQSDVNL RDADATVKALQTENNKDKPKDAN DGN+SDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFK    
Subjt:  SPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKG---

Query:  ----------QNMTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER
                     TRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGND RFEALDRKDRENLLNER
Subjt:  ----------QNMTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER

A0A5D3BYD0 Pre-mRNA-processing protein 40C5.8e-26485.49Show/hide
Query:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNSINP
        MSSASTVSQSVSLPAPPTSNS ANGSSIPNLIP+TSPV P+PSFH+HQL PV PMVPGPPGMSPS P+VST PA LFPP DSASTIPGP+MHA  N I+P
Subjt:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNSINP

Query:  SARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQTDS
        SARPQICGSYPSLTPVVSPP+A+WFQPPQLGAMPRPPF+PYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAI VP  HG+QL GNSLIQTDS
Subjt:  SARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQTDS

Query:  NHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE---LLLQ----QLSNLSGTDWVLVTM
        NHP+L              DSQKH Q VG SENISL KHSEDWTAHKTEAGIIYYYNALTGESTYEKP GF+GE   L+ Q     +SNLSGTDWVLVTM
Subjt:  NHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE---LLLQ----QLSNLSGTDWVLVTM

Query:  GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPVAS
        GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKE SAPLPNNNA TDLGTSS SINTPAINTGGREATPLRTVGI GSSSALDLIKKKLQDSG PVAS
Subjt:  GDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPVAS

Query:  SPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKG---
        SPISA TVAQSDVNL RDADATVKALQTENNKDKPKDAN DGN+SDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFK    
Subjt:  SPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKG---

Query:  ----------QNMTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER
                     TRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGND RFEALDRKDRENLLNER
Subjt:  ----------QNMTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER

A0A6J1GNF1 pre-mRNA-processing protein 40C3.2e-25481.76Show/hide
Query:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDS--ASTIPGPHMHATPNSI
        MSSASTVSQS+SLPAPPTSNSAANGSSIPNLIPATSPV P+ SFH+HQL P TPMVPGPPGMSPS PV       +FPP+DS  +STIPGP+MHA PNSI
Subjt:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDS--ASTIPGPHMHATPNSI

Query:  NPSARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQT
        N S RPQICGSYPSL PVVSPP+AIWFQPPQLG MPRPPFLPY ASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQV+SA AVP  HG+ L+GNSLIQT
Subjt:  NPSARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQT

Query:  DSNHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE---LLLQ----QLSNLSGTDWVLV
        D NHP+L              D+QKHAQG+G SE+ISL+KHSE+WTAHKTEAGI+YYYNALTGESTYEKPSGFKGE   L++Q     +SNLSGTDWVLV
Subjt:  DSNHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE---LLLQ----QLSNLSGTDWVLV

Query:  TMGDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPV
        TMGDGKKYYYNNKTKISSWQIPNEV+ELRQQNDEKTKEHSAPLPNNNA T+ G+S IS+NTPAINTGGREA PLRTVG+SG SSALDLIKKKLQ+SG PV
Subjt:  TMGDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPV

Query:  ASSPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKGQ
        ASSPIS PT+AQSDVNL RDADA VKALQTEN+KDKPKDANGDGN+SDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFK  
Subjt:  ASSPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKGQ

Query:  N-------------MTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER
                       TRAEEERKEKRAAQKAAIEGFKQLLD ASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLL+ER
Subjt:  N-------------MTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER

SwissProt top hitse value%identityAlignment
B6EUA9 Pre-mRNA-processing protein 40A1.1e-1426.77Show/hide
Query:  PPNAIWFQPPQLG----------AMPRPPFLPYSASYHGPLPFPARGMPLPS-VPLPDPQPPGVTPVQV-----ASAIAVPPVHGSQLSGNSLIQTDSNH
        PPN +  QPPQ              P  P    S+S    +P+      L S    P P  P +T         +S     P    Q    SL+Q +S  
Subjt:  PPNAIWFQPPQLG----------AMPRPPFLPYSASYHGPLPFPARGMPLPS-VPLPDPQPPGVTPVQV-----ASAIAVPPVHGSQLSGNSLIQTDSNH

Query:  PQLGM-------PYSVVKLISLYNDSQKHAQ----GVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGELLLQQLSNLSGTDWVLV
           G+       P  V +  SL +  Q+  Q     V         + + DW  H +  G  YYYN  T +S +EKP     EL+       + T W   
Subjt:  PQLGM-------PYSVVKLISLYNDSQKHAQ----GVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGELLLQQLSNLSGTDWVLV

Query:  TMGDGKKYYYNNKTKISSWQIPNEVSELRQQ---NDEKT---KEHSAPLPNNNA-STDLGTSSISINTPAINTG--GREATPLRTVGISGSSSALDLIKK
        T  +GKKYYYN  TK S W IP ++   R+Q     EKT   +  S PL ++ A S+DL  S+++   P+ ++   G  ++P++  G++   +    +  
Subjt:  TMGDGKKYYYNNKTKISSWQIPNEVSELRQQ---NDEKT---KEHSAPLPNNNA-STDLGTSSISINTPAINTG--GREATPLRTVGISGSSSALDLIKK

Query:  KLQDSGAPVASSPISAPTVAQSDVNLLRDADATVKALQTENN--KDKPKDANGDGNLSDS--SSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKE
            SG   A S   A T+   +++  R AD +      +NN  ++K    NG  NLS +   ++ E+     T ++    FK +L+   V     W++ 
Subjt:  KLQDSGAPVASSPISAPTVAQSDVNLLRDADATVKALQTENN--KDKPKDANGDGNLSDS--SSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKE

Query:  LPKIVFDPRFKGQNM-------------TRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDR-KDRENLLN
        L +IV D R+                   R + E +E+R  QK A E F ++L+   E++  +  +      + ND RF+A+DR +DRE+L +
Subjt:  LPKIVFDPRFKGQNM-------------TRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDR-KDRENLLN

O14776 Transcription elongation regulator 12.4e-1725.32Show/hide
Query:  ASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNSINPSAR
        AST + S   PA  TS S++  SS  +     + V  + S       P T        +S + P VS S  A  P      T+P PH    P ++ P + 
Subjt:  ASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNSINPSAR

Query:  PQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQTDSNHP
        PQ   + P+  PV+ PP                        +  PLP    GMP+P         PGV  +Q+ S   V             + T     
Subjt:  PQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQTDSNHP

Query:  QLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGELLLQQ------------------------
          GM   +V +I       + A    P+     T  SE WT +KT  G  YYYN  T EST+EKP   K +  L++                        
Subjt:  QLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGELLLQQ------------------------

Query:  ------------------------------LSNLSGTDWVLVTMGDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSIS
                                       + + GT W +V  GD + ++YN  T++S W  P+++                          +G + + 
Subjt:  ------------------------------LSNLSGTDWVLVTMGDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSIS

Query:  --INTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPVASSPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDV
          I  P    G  E   LR      + + L + K +        + S I        ++N     D  VKA + + + +K  D+  +  +      + + 
Subjt:  --INTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPVASSPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDV

Query:  DSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKGQN------------MTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFK
           P  E  + QFK+ML ERGV+ FS W+KEL KIVFDPR+   N             TRAEEER+EK+     A E FK++++ A    +   ++  F 
Subjt:  DSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKGQN------------MTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFK

Query:  KKWGNDPRFEALDR-KDRENLLNE
         K   D RF+A+++ KDRE L NE
Subjt:  KKWGNDPRFEALDR-KDRENLLNE

Q3B807 Transcription elongation regulator 1-like protein7.6e-1130.35Show/hide
Query:  IKKKLQDSGAPVASSPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTN---------EQLIIQFKEMLKERGVA
        +K+ ++D   P     + A     SD +   D     ++++T+ N+ +  +  G         D+  VD GP           E+ +  F++ML ERGV+
Subjt:  IKKKLQDSGAPVASSPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTN---------EQLIIQFKEMLKERGVA

Query:  PFSKWDKELPKIVFDPRFKGQN------------MTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALD-RKDRENLLN
         FS W+KEL KIVFDPR+   N             TR +EE KE+++    A E FK+LL+ +   +   T+++ F +K G D RF  +  RKD+E+  N
Subjt:  PFSKWDKELPKIVFDPRFKGQN------------MTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALD-RKDRENLLN

Query:  E
        +
Subjt:  E

Q8CGF7 Transcription elongation regulator 13.8e-1824.96Show/hide
Query:  SSASTVSQSVSLPAPPTSNSA-ANGSSIPNLIPATSPVLPSPSFHVHQLL--PVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNSI
        + A   +Q+V  P P TS+ A A  +S P   P+++    + +  V Q +  P T        +S + P VS S  A  P      T+P PH    P ++
Subjt:  SSASTVSQSVSLPAPPTSNSA-ANGSSIPNLIPATSPVLPSPSFHVHQLL--PVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNSI

Query:  NPSARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQT
         P + PQ   + P+  PV+ PP                        +  PLP    GMP+P         PGV  +Q+ S   V             + T
Subjt:  NPSARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGNSLIQT

Query:  DSNHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGELLLQQ-------------------
               GM   +V +I       + A    P+     T  SE WT +KT  G  YYYN  T EST+EKP   K +  L +                   
Subjt:  DSNHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGELLLQQ-------------------

Query:  -----------------------------------LSNLSGTDWVLVTMGDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLG
                                            + + GT W +V  GD + ++YN  T++S W  P+++                          +G
Subjt:  -----------------------------------LSNLSGTDWVLVTMGDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLG

Query:  TSSIS--INTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPVASSPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSS
         + +   I  P    G  +   LR      + + L + K +        + S I        ++N     D  +KA + + + +K  D+  +  +     
Subjt:  TSSIS--INTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPVASSPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSS

Query:  DSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKGQN------------MTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTS
         + +    P  E  + QFK+ML ERGV+ FS W+KEL KIVFDPR+   N             TRAEEER+EK+     A E FK++++ A    +   +
Subjt:  DSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKGQN------------MTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTS

Query:  YQTFKKKWGNDPRFEALDR-KDRENLLNE
        +  F  K   D RF+A+++ KDRE L NE
Subjt:  YQTFKKKWGNDPRFEALDR-KDRENLLNE

Q9LT25 Pre-mRNA-processing protein 40C2.0e-10444.43Show/hide
Query:  TMSSAST--VSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNS
        +MS AST  VSQSV         + A  SS  N IP  SP+L +  F         P    PPG+  SPP         FP ++  ST P P M A P  
Subjt:  TMSSAST--VSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNS

Query:  INPSARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPV-QVASAIAVPPVHGSQLSGNSLI
        +NP   P +   YP    +   P  +W QPP +G +PR PFL +  ++ G  PFP RG+  P++P     P G +P+  V +  A+P             
Subjt:  INPSARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPV-QVASAIAVPPVHGSQLSGNSLI

Query:  QTDSNHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE-------LLLQQLSNLSGTDWV
              P +       +L  +  D +  +Q VG           + WTAHK+EAG++YYYN++TG+STYEKP GF GE        +   + +L GTDW 
Subjt:  QTDSNHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE-------LLLQQLSNLSGTDWV

Query:  LVTMGDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGA
        LV+  DGKKYYYNNKTK+SSWQIP EV +  ++ +E+  E  A +P+ +  T+ G+   S++ PAI+ GGR+A  L+T      SSALDL+KKKL DSG 
Subjt:  LVTMGDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGA

Query:  PVASSPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFK
        PV+S+         S+ N  +  + T    ++ N+  K KDA G G LSDSSSDSED DSGP+ E+   QFKEMLKERG+APFSKW+KELPKI+FDPRFK
Subjt:  PVASSPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFK

Query:  G-------------QNMTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER
                         TRAEEER+EKRAA KAAIEGF+QLLD AS DID  T Y+ FKKKWGND RFEA++RK+RE LLNER
Subjt:  G-------------QNMTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER

Arabidopsis top hitse value%identityAlignment
AT1G73850.1 Protein of unknown function (DUF1666)1.5e-3030.87Show/hide
Query:  IPEEEEEEEGE---EGEEGEEVGEAEPEWRD-------VEAEGRQ---WWGGFGAVYDDYCKRMFFFDRMSLRSGPDSTSQRSASKKSASPLRCLSLKRI
        + EEEEEE G+   E        ++  EWR+            R+    W  +  V+  Y + M F  R+S +   ++ S +S   +  S    +  K  
Subjt:  IPEEEEEEEGE---EGEEGEEVGEAEPEWRD-------VEAEGRQ---WWGGFGAVYDDYCKRMFFFDRMSLRSGPDSTSQRSASKKSASPLRCLSLKRI

Query:  EEPEDEMEDVDPSLTLIDSNHHI--ETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLT--------AQLFQQFQVLLQRFIENEPFQQALRPT
             + +   P       N ++  E+AYVA ICL+WEAL   Y         + + STT  +          A  F+ F +LLQR++ENEP++   RP 
Subjt:  EEPEDEMEDVDPSLTLIDSNHHI--ETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLT--------AQLFQQFQVLLQRFIENEPFQQALRPT

Query:  LYARTRRTFPKMLHVPNIQASDPNGVQEQESD----FLILAPDLLLIIEASIFTFHRFLKMDK-KTSNSASLSFRNHTQ----DAALLARVRSSLDKKKT
        +YAR R   PK+L VP  Q  +    +E E++      I +   L+I+E  I TF  FL+ DK K       +F   ++    D  L+  ++    KKKT
Subjt:  LYARTRRTFPKMLHVPNIQASDPNGVQEQESD----FLILAPDLLLIIEASIFTFHRFLKMDK-KTSNSASLSFRNHTQ----DAALLARVRSSLDKKKT

Query:  KLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNG--KLWRDPSPLLFP
        KLKE+R+  +  ++K      E+M++L G++D+K++SR+++M+   +E L WCEEKM+K+ +  G   L RD +PL FP
Subjt:  KLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNG--KLWRDPSPLLFP

AT3G19840.1 pre-mRNA-processing protein 40C1.4e-10544.43Show/hide
Query:  TMSSAST--VSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNS
        +MS AST  VSQSV         + A  SS  N IP  SP+L +  F         P    PPG+  SPP         FP ++  ST P P M A P  
Subjt:  TMSSAST--VSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSASTIPGPHMHATPNS

Query:  INPSARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPV-QVASAIAVPPVHGSQLSGNSLI
        +NP   P +   YP    +   P  +W QPP +G +PR PFL +  ++ G  PFP RG+  P++P     P G +P+  V +  A+P             
Subjt:  INPSARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPV-QVASAIAVPPVHGSQLSGNSLI

Query:  QTDSNHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE-------LLLQQLSNLSGTDWV
              P +       +L  +  D +  +Q VG           + WTAHK+EAG++YYYN++TG+STYEKP GF GE        +   + +L GTDW 
Subjt:  QTDSNHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGE-------LLLQQLSNLSGTDWV

Query:  LVTMGDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGA
        LV+  DGKKYYYNNKTK+SSWQIP EV +  ++ +E+  E  A +P+ +  T+ G+   S++ PAI+ GGR+A  L+T      SSALDL+KKKL DSG 
Subjt:  LVTMGDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGA

Query:  PVASSPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFK
        PV+S+         S+ N  +  + T    ++ N+  K KDA G G LSDSSSDSED DSGP+ E+   QFKEMLKERG+APFSKW+KELPKI+FDPRFK
Subjt:  PVASSPISAPTVAQSDVNLLRDADATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFK

Query:  G-------------QNMTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER
                         TRAEEER+EKRAA KAAIEGF+QLLD AS DID  T Y+ FKKKWGND RFEA++RK+RE LLNER
Subjt:  G-------------QNMTRAEEERKEKRAAQKAAIEGFKQLLDSASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNER

AT3G20260.1 Protein of unknown function (DUF1666)1.1e-11055.98Show/hide
Query:  EAEEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEE-----GEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDYCKRMFFFDRMS--------
        E E++++DFI  EVKRRLKELRRNSFMVLIPEEEEEEE      E+ ++GE+  +   EWRDV AEG QWWGGF AVY+ YC+RM FFDR+S        
Subjt:  EAEEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEEE-----GEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDYCKRMFFFDRMS--------

Query:  --LRSGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQ
          +   P + S RSASKK +SP RCLSLK+ + PE+++E + P+  + D    +ETAYVA +CL+WEALHCQYTQL+HLISCQP+  T  YN TAQLFQQ
Subjt:  --LRSGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQ

Query:  FQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSF----RNHTQD
        F VLLQR+IENEPF+Q  R  LYAR R   PK+L  P IQ SD   + E+++ F++LA DL+ +IE+SI TF+ FLKMDKK  N     F     NH   
Subjt:  FQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASLSF----RNHTQD

Query:  AALLARVRSSLDKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFP
           L  V+SS+DKK+ K KE+ KK++G ++K+WPQT+E +QLLF  +DIK+ +R+++MS+ +KEQLLWCEEKM KL+ S GKL R PSP+LFP
Subjt:  AALLARVRSSLDKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFP

AT5G39785.1 Protein of unknown function (DUF1666)2.8e-2928.47Show/hide
Query:  QEEDGTDSKCIAEAEAEEEEEDF--------IMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYD---D
        ++ D + S   +E E EE+   F        ++E++K  +K+++    +  I EEEEE+     ++  ++ E    WR  E +  +     G V+     
Subjt:  QEEDGTDSKCIAEAEAEEEEEDF--------IMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYD---D

Query:  YCKRMFFFDRMSLR----------SGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLI-DSNHHIETAYVAHICLSWEALHCQYTQLNHL
        Y +RM   D +S +            P   +    S  S +    +    I   + +  +++P +  + +    +E  YV  +CLSWE LH QY +   L
Subjt:  YCKRMFFFDRMSLR----------SGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLI-DSNHHIETAYVAHICLSWEALHCQYTQLNHL

Query:  ISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQ----ASDPNGVQ---EQESDFLILAPDLLLIIEASIFTF
        +      S   YN  A  FQQFQVLLQRF+ENEPF++  R   Y + R     +L +P I+        NG +   E+ +D +I +  L+ I+E +I  F
Subjt:  ISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQ----ASDPNGVQ---EQESDFLILAPDLLLIIEASIFTF

Query:  HRFLKMDKKTSNSASLSFRNHTQ----------DAALLARVRSSLDKKKTKLKEVRKKS-----RGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRT
         RF++ DK TS+      R  +Q          D  + A V+S L  K+ +L++V K       R  K K    T + +   F  VD+K+++R++ MS+ 
Subjt:  HRFLKMDKKTSNSASLSFRNHTQ----------DAALLARVRSSLDKKKTKLKEVRKKS-----RGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRT

Query:  TKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFP
        T++ L+WC  K+ K++  N +L  DPS  LFP
Subjt:  TKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFP

AT5G39785.2 Protein of unknown function (DUF1666)2.7e-2728.18Show/hide
Query:  QEEDGTDSKCIAEAEAEEEEEDF--------IMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYD---D
        ++ D + S   +E E EE+   F        ++E++K  +K+++    +  I EEEEE+     ++  ++ E    WR  E +  +     G V+     
Subjt:  QEEDGTDSKCIAEAEAEEEEEDF--------IMEEVKRRLKELRRNSFMVLIPEEEEEEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYD---D

Query:  YCKRMFFFDRMSLR----------SGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLI-DSNHHIETAYVAHICLSWEALHCQYTQLNHL
        Y +RM   D +S +            P   +    S  S +    +    I   + +  +++P +  + +    +E  YV  +CLSWE LH QY +   L
Subjt:  YCKRMFFFDRMSLR----------SGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLI-DSNHHIETAYVAHICLSWEALHCQYTQLNHL

Query:  ISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQ----ASDPNGVQ---EQESDFLILAPDLLLIIEASIFTF
        +      S   YN  A  FQQFQVLLQRF+ENEPF++  R   Y + R     +L +P I+        NG +   E+ +D +I +  L+ I+E +I  F
Subjt:  ISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQ----ASDPNGVQ---EQESDFLILAPDLLLIIEASIFTF

Query:  HRFLKMDKKTSNSASLSFRNHTQ----------DAALLARVRSSLDK-KKTKLKEVRKKS-----RGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSR
         RF++ DK TS+      R  +Q          D  + A V+S L    + +L++V K       R  K K    T + +   F  VD+K+++R++ MS+
Subjt:  HRFLKMDKKTSNSASLSFRNHTQ----------DAALLARVRSSLDK-KKTKLKEVRKKS-----RGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSR

Query:  TTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFP
         T++ L+WC  K+ K++  N +L  DPS  LFP
Subjt:  TTKEQLLWCEEKMNKLDVSNGKLWRDPSPLLFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTCAAACTTGTGTGGTACGGATTGCGTTTTGGTTTCTATGGGTGATGGTAAAAAGTTTTACTACAACAACAAGACGATGGTATGCTATTTTCCCCCGCCCCCCTT
CAAAATTCCCCCTGGATGGGGAACAGAGCTCAGATTCAGCTTTCTTATATGTTTGGAAAGAATTGACGATGCTAAGATGATTCCATTAAAAAACATAGAAATTGTGTCAA
TCCTGACCCTCACTCTGAATCAGGTAAGCAGTCGCCAAATCCCAAATGAAGATGAGAAAACAAAAGAACATTTAGCTCCTTTGCGAAATAATAAAGCATTCACTGATCTA
GGATCTTCTCCTATCAATATCAATACTCCTGCCATCAACACAGGTGGTTGTGAAGCCACGCCCCTCAGAATGGTAGGAATATCAGGGTCATCTTCTGCCCTGGATTTGAT
CAAGAAAAAATTACAAGACTCTGGAACTTCTGTAGCTTCCTCGCCTATTTCAGCTCCAGCAATAGCTCAATTAGATGTAAATCTACCGAGAGATGTTGATGTTGCAGTTA
AGGCACTGCAGCTAGAGAACAACAAGGATAAACCGAAAGATGCTAATGGTGATGGAAATGTATCCGACTCCTTCTTGGACTCTGAGGATGTAGAAAGTGGGCCAACTAAT
GAGCAATTAATTATCCAGTTTAAGGTATCATCTAACTTCCCTTTTATTGAGGATCAATATCATAATCTTTTTGATAGTATGATAATACTAGATGGAGTTTTTCTCCTTTT
CTTGTCGGAATTTTCTCTTGCTAGTATTCTGAATATCAAGTTCATTCTGCATTTGAGTCTTTCCTATGCATTGCAGGAGGAGGATGGCACAGATTCTAAGTGTATTGCAG
AAGCGGAAGCTGAAGAGGAGGAGGAGGATTTCATTATGGAGGAGGTAAAAAGGAGATTGAAAGAGCTGAGGAGGAACAGTTTCATGGTGTTGATTCCAGAGGAAGAAGAA
GAAGAAGAAGGAGAAGAAGGAGAAGAAGGAGAAGAAGTAGGCGAAGCCGAGCCCGAGTGGAGAGACGTGGAAGCAGAAGGCCGACAATGGTGGGGAGGGTTTGGTGCTGT
TTATGATGATTACTGCAAGAGGATGTTTTTCTTTGATCGGATGAGCCTTCGATCTGGTCCCGATTCAACCTCCCAAAGATCTGCATCGAAAAAGAGTGCATCTCCTCTTC
GATGTCTTTCTCTGAAGAGGATCGAAGAACCTGAAGATGAAATGGAAGATGTTGACCCATCATTGACTCTGATTGACTCCAATCACCACATAGAAACAGCCTATGTTGCT
CACATTTGCTTGTCCTGGGAGGCCCTTCACTGTCAGTACACTCAACTTAACCACTTAATATCATGCCAACCCCAAAACTCTACTACTCATTATAATCTTACTGCTCAGCT
CTTTCAGCAATTTCAAGTCCTCTTGCAAAGGTTTATTGAAAATGAACCCTTCCAACAAGCTCTCAGGCCTACACTTTATGCCCGAACCCGTCGAACTTTTCCTAAAATGT
TGCATGTTCCTAACATACAAGCTTCAGATCCAAACGGGGTGCAGGAACAAGAATCTGATTTCCTCATCCTCGCTCCTGATCTGCTGCTCATTATTGAGGCTTCAATCTTT
ACTTTCCACCGCTTCCTGAAGATGGACAAGAAAACCTCAAACTCTGCTTCTTTGTCGTTTCGGAACCACACACAGGATGCCGCTCTGCTTGCTCGTGTTCGGTCTTCTCT
CGACAAGAAGAAGACGAAGCTGAAAGAGGTTAGGAAGAAGAGTAGAGGGTGGAAACAGAAAACGTGGCCTCAAACGTATGAAGACATGCAATTACTTTTTGGAATCGTGG
ACATTAAAATCATATCAAGGCTTGTTAAGATGTCGAGGACTACTAAAGAACAGCTGCTCTGGTGCGAGGAGAAAATGAACAAGTTAGATGTGTCTAATGGAAAATTGTGG
AGAGATCCGTCTCCTCTTCTTTTCCCATATAAGTATGGTAATGGACCCTACTACTACTACTACGATGAATCTGAATTTTTGGGGTTTTCGAACTCCTTGTGTTTTAGTGT
TCCTTATGCGGGTCCTTTCAACTATCTAATCTTTCTGTTGCAAACTCACGTCACTATGCAAGAGATCAATGTCATCGTGAAATGCAGAGACTTGACTTCGCCAAAAAAAG
AAAAAAGGAAAAAAAAAAGAAAGAAAAACAGGCGTTGCAAAAACGCGAACAGGCGAGTGACGACCGGCGAACAGGCATTGCAAGAACGTGCGAAGATCGGCGGTAGATCG
TGGACTGCAAACACATGCTTCTTTCTGGGTTTTCTTCGTGGTTACCACACCTGTTACTGGTTAGCTTCGTTTCATTTCACCATGTCTTCAGCATCAACTGTCTCCCAATC
TGTATCACTTCCTGCTCCGCCTACTTCCAATTCTGCTGCTAATGGTTCTTCAATTCCCAATTTGATCCCTGCAACTTCACCAGTTCTTCCTTCCCCATCTTTCCATGTTC
ACCAACTACTACCTGTAACTCCGATGGTACCTGGTCCACCGGGAATGTCGCCGTCGCCGCCAGTTGTGTCGACAAGTCCGGCAGCTCTGTTTCCACCAAACGATTCTGCT
TCCACTATCCCGGGACCCCATATGCATGCAACTCCTAACTCAATTAATCCTTCTGCTCGTCCACAAATTTGTGGTTCATATCCTTCTCTAACTCCTGTTGTTTCTCCGCC
TAATGCGATCTGGTTTCAGCCTCCTCAGTTGGGAGCCATGCCCAGGCCTCCCTTTCTGCCATACTCTGCTTCTTATCATGGTCCTCTTCCTTTTCCTGCCCGTGGAATGC
CCCTTCCCTCTGTCCCATTGCCCGATCCTCAACCTCCCGGTGTTACCCCTGTTCAAGTTGCATCTGCGATTGCTGTGCCACCTGTTCATGGAAGTCAGCTGAGTGGCAAT
TCATTGATTCAGACAGACTCAAATCATCCTCAACTTGGTATGCCGTATTCTGTTGTTAAATTAATATCTTTATATAATGATAGCCAGAAACACGCTCAAGGTGTAGGCCC
ATCTGAGAACATCTCTTTAACTAAGCACTCGGAGGATTGGACTGCCCACAAGACTGAGGCAGGAATAATCTATTACTATAATGCCTTGACAGGAGAATCGACCTATGAAA
AACCTTCGGGTTTCAAAGGGGAGTTACTTCTACAACAGCTGTCAAACTTGTCTGGTACGGATTGGGTTTTGGTTACTATGGGTGATGGTAAAAAGTACTACTACAACAAC
AAGACGAAGATTAGCAGTTGGCAAATTCCAAATGAAGTATCTGAATTGAGGCAACAGAATGATGAAAAAACAAAAGAACATTCAGCTCCTTTGCCAAATAATAATGCATC
GACTGATCTAGGAACTTCCTCTATCAGTATCAATACTCCTGCCATTAATACAGGTGGCCGTGAAGCCACACCCCTTAGAACGGTAGGAATATCAGGGTCATCTTCCGCTC
TGGATTTGATCAAGAAAAAATTGCAAGACTCTGGAGCTCCGGTAGCTTCCTCACCTATTTCAGCTCCAACAGTAGCTCAATCAGATGTAAATCTATTGAGAGATGCTGAT
GCTACAGTTAAGGCACTGCAGACTGAGAACAACAAGGATAAGCCAAAAGATGCTAATGGTGATGGAAATTTATCAGACTCCTCCTCGGACTCTGAGGATGTAGATAGTGG
GCCAACTAATGAGCAGTTAATTATCCAGTTTAAGGAAATGCTTAAGGAGCGAGGAGTGGCACCATTCTCTAAATGGGACAAGGAATTGCCGAAGATAGTTTTTGATCCCC
GTTTTAAGGGCCAGAACATGACCCGTGCTGAGGAGGAACGCAAGGAAAAAAGAGCTGCTCAGAAGGCTGCAATAGAGGGATTTAAACAGTTATTGGATAGCGCATCTGAG
GATATTGATCACACCACTAGTTATCAAACATTCAAAAAGAAATGGGGAAATGACCCGCGGTTTGAAGCTTTGGATCGTAAGGATCGGGAGAATTTATTGAACGAAAGGTG
CCATTTAATGAGTAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGTCAAACTTGTGTGGTACGGATTGCGTTTTGGTTTCTATGGGTGATGGTAAAAAGTTTTACTACAACAACAAGACGATGGTATGCTATTTTCCCCCGCCCCCCTT
CAAAATTCCCCCTGGATGGGGAACAGAGCTCAGATTCAGCTTTCTTATATGTTTGGAAAGAATTGACGATGCTAAGATGATTCCATTAAAAAACATAGAAATTGTGTCAA
TCCTGACCCTCACTCTGAATCAGGTAAGCAGTCGCCAAATCCCAAATGAAGATGAGAAAACAAAAGAACATTTAGCTCCTTTGCGAAATAATAAAGCATTCACTGATCTA
GGATCTTCTCCTATCAATATCAATACTCCTGCCATCAACACAGGTGGTTGTGAAGCCACGCCCCTCAGAATGGTAGGAATATCAGGGTCATCTTCTGCCCTGGATTTGAT
CAAGAAAAAATTACAAGACTCTGGAACTTCTGTAGCTTCCTCGCCTATTTCAGCTCCAGCAATAGCTCAATTAGATGTAAATCTACCGAGAGATGTTGATGTTGCAGTTA
AGGCACTGCAGCTAGAGAACAACAAGGATAAACCGAAAGATGCTAATGGTGATGGAAATGTATCCGACTCCTTCTTGGACTCTGAGGATGTAGAAAGTGGGCCAACTAAT
GAGCAATTAATTATCCAGTTTAAGGTATCATCTAACTTCCCTTTTATTGAGGATCAATATCATAATCTTTTTGATAGTATGATAATACTAGATGGAGTTTTTCTCCTTTT
CTTGTCGGAATTTTCTCTTGCTAGTATTCTGAATATCAAGTTCATTCTGCATTTGAGTCTTTCCTATGCATTGCAGGAGGAGGATGGCACAGATTCTAAGTGTATTGCAG
AAGCGGAAGCTGAAGAGGAGGAGGAGGATTTCATTATGGAGGAGGTAAAAAGGAGATTGAAAGAGCTGAGGAGGAACAGTTTCATGGTGTTGATTCCAGAGGAAGAAGAA
GAAGAAGAAGGAGAAGAAGGAGAAGAAGGAGAAGAAGTAGGCGAAGCCGAGCCCGAGTGGAGAGACGTGGAAGCAGAAGGCCGACAATGGTGGGGAGGGTTTGGTGCTGT
TTATGATGATTACTGCAAGAGGATGTTTTTCTTTGATCGGATGAGCCTTCGATCTGGTCCCGATTCAACCTCCCAAAGATCTGCATCGAAAAAGAGTGCATCTCCTCTTC
GATGTCTTTCTCTGAAGAGGATCGAAGAACCTGAAGATGAAATGGAAGATGTTGACCCATCATTGACTCTGATTGACTCCAATCACCACATAGAAACAGCCTATGTTGCT
CACATTTGCTTGTCCTGGGAGGCCCTTCACTGTCAGTACACTCAACTTAACCACTTAATATCATGCCAACCCCAAAACTCTACTACTCATTATAATCTTACTGCTCAGCT
CTTTCAGCAATTTCAAGTCCTCTTGCAAAGGTTTATTGAAAATGAACCCTTCCAACAAGCTCTCAGGCCTACACTTTATGCCCGAACCCGTCGAACTTTTCCTAAAATGT
TGCATGTTCCTAACATACAAGCTTCAGATCCAAACGGGGTGCAGGAACAAGAATCTGATTTCCTCATCCTCGCTCCTGATCTGCTGCTCATTATTGAGGCTTCAATCTTT
ACTTTCCACCGCTTCCTGAAGATGGACAAGAAAACCTCAAACTCTGCTTCTTTGTCGTTTCGGAACCACACACAGGATGCCGCTCTGCTTGCTCGTGTTCGGTCTTCTCT
CGACAAGAAGAAGACGAAGCTGAAAGAGGTTAGGAAGAAGAGTAGAGGGTGGAAACAGAAAACGTGGCCTCAAACGTATGAAGACATGCAATTACTTTTTGGAATCGTGG
ACATTAAAATCATATCAAGGCTTGTTAAGATGTCGAGGACTACTAAAGAACAGCTGCTCTGGTGCGAGGAGAAAATGAACAAGTTAGATGTGTCTAATGGAAAATTGTGG
AGAGATCCGTCTCCTCTTCTTTTCCCATATAAGTATGGTAATGGACCCTACTACTACTACTACGATGAATCTGAATTTTTGGGGTTTTCGAACTCCTTGTGTTTTAGTGT
TCCTTATGCGGGTCCTTTCAACTATCTAATCTTTCTGTTGCAAACTCACGTCACTATGCAAGAGATCAATGTCATCGTGAAATGCAGAGACTTGACTTCGCCAAAAAAAG
AAAAAAGGAAAAAAAAAAGAAAGAAAAACAGGCGTTGCAAAAACGCGAACAGGCGAGTGACGACCGGCGAACAGGCATTGCAAGAACGTGCGAAGATCGGCGGTAGATCG
TGGACTGCAAACACATGCTTCTTTCTGGGTTTTCTTCGTGGTTACCACACCTGTTACTGGTTAGCTTCGTTTCATTTCACCATGTCTTCAGCATCAACTGTCTCCCAATC
TGTATCACTTCCTGCTCCGCCTACTTCCAATTCTGCTGCTAATGGTTCTTCAATTCCCAATTTGATCCCTGCAACTTCACCAGTTCTTCCTTCCCCATCTTTCCATGTTC
ACCAACTACTACCTGTAACTCCGATGGTACCTGGTCCACCGGGAATGTCGCCGTCGCCGCCAGTTGTGTCGACAAGTCCGGCAGCTCTGTTTCCACCAAACGATTCTGCT
TCCACTATCCCGGGACCCCATATGCATGCAACTCCTAACTCAATTAATCCTTCTGCTCGTCCACAAATTTGTGGTTCATATCCTTCTCTAACTCCTGTTGTTTCTCCGCC
TAATGCGATCTGGTTTCAGCCTCCTCAGTTGGGAGCCATGCCCAGGCCTCCCTTTCTGCCATACTCTGCTTCTTATCATGGTCCTCTTCCTTTTCCTGCCCGTGGAATGC
CCCTTCCCTCTGTCCCATTGCCCGATCCTCAACCTCCCGGTGTTACCCCTGTTCAAGTTGCATCTGCGATTGCTGTGCCACCTGTTCATGGAAGTCAGCTGAGTGGCAAT
TCATTGATTCAGACAGACTCAAATCATCCTCAACTTGGTATGCCGTATTCTGTTGTTAAATTAATATCTTTATATAATGATAGCCAGAAACACGCTCAAGGTGTAGGCCC
ATCTGAGAACATCTCTTTAACTAAGCACTCGGAGGATTGGACTGCCCACAAGACTGAGGCAGGAATAATCTATTACTATAATGCCTTGACAGGAGAATCGACCTATGAAA
AACCTTCGGGTTTCAAAGGGGAGTTACTTCTACAACAGCTGTCAAACTTGTCTGGTACGGATTGGGTTTTGGTTACTATGGGTGATGGTAAAAAGTACTACTACAACAAC
AAGACGAAGATTAGCAGTTGGCAAATTCCAAATGAAGTATCTGAATTGAGGCAACAGAATGATGAAAAAACAAAAGAACATTCAGCTCCTTTGCCAAATAATAATGCATC
GACTGATCTAGGAACTTCCTCTATCAGTATCAATACTCCTGCCATTAATACAGGTGGCCGTGAAGCCACACCCCTTAGAACGGTAGGAATATCAGGGTCATCTTCCGCTC
TGGATTTGATCAAGAAAAAATTGCAAGACTCTGGAGCTCCGGTAGCTTCCTCACCTATTTCAGCTCCAACAGTAGCTCAATCAGATGTAAATCTATTGAGAGATGCTGAT
GCTACAGTTAAGGCACTGCAGACTGAGAACAACAAGGATAAGCCAAAAGATGCTAATGGTGATGGAAATTTATCAGACTCCTCCTCGGACTCTGAGGATGTAGATAGTGG
GCCAACTAATGAGCAGTTAATTATCCAGTTTAAGGAAATGCTTAAGGAGCGAGGAGTGGCACCATTCTCTAAATGGGACAAGGAATTGCCGAAGATAGTTTTTGATCCCC
GTTTTAAGGGCCAGAACATGACCCGTGCTGAGGAGGAACGCAAGGAAAAAAGAGCTGCTCAGAAGGCTGCAATAGAGGGATTTAAACAGTTATTGGATAGCGCATCTGAG
GATATTGATCACACCACTAGTTATCAAACATTCAAAAAGAAATGGGGAAATGACCCGCGGTTTGAAGCTTTGGATCGTAAGGATCGGGAGAATTTATTGAACGAAAGGTG
CCATTTAATGAGTAACTAG
Protein sequenceShow/hide protein sequence
MVSNLCGTDCVLVSMGDGKKFYYNNKTMVCYFPPPPFKIPPGWGTELRFSFLICLERIDDAKMIPLKNIEIVSILTLTLNQVSSRQIPNEDEKTKEHLAPLRNNKAFTDL
GSSPININTPAINTGGCEATPLRMVGISGSSSALDLIKKKLQDSGTSVASSPISAPAIAQLDVNLPRDVDVAVKALQLENNKDKPKDANGDGNVSDSFLDSEDVESGPTN
EQLIIQFKVSSNFPFIEDQYHNLFDSMIILDGVFLLFLSEFSLASILNIKFILHLSLSYALQEEDGTDSKCIAEAEAEEEEEDFIMEEVKRRLKELRRNSFMVLIPEEEE
EEEGEEGEEGEEVGEAEPEWRDVEAEGRQWWGGFGAVYDDYCKRMFFFDRMSLRSGPDSTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIDSNHHIETAYVA
HICLSWEALHCQYTQLNHLISCQPQNSTTHYNLTAQLFQQFQVLLQRFIENEPFQQALRPTLYARTRRTFPKMLHVPNIQASDPNGVQEQESDFLILAPDLLLIIEASIF
TFHRFLKMDKKTSNSASLSFRNHTQDAALLARVRSSLDKKKTKLKEVRKKSRGWKQKTWPQTYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLW
RDPSPLLFPYKYGNGPYYYYYDESEFLGFSNSLCFSVPYAGPFNYLIFLLQTHVTMQEINVIVKCRDLTSPKKEKRKKKRKKNRRCKNANRRVTTGEQALQERAKIGGRS
WTANTCFFLGFLRGYHTCYWLASFHFTMSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPATSPVLPSPSFHVHQLLPVTPMVPGPPGMSPSPPVVSTSPAALFPPNDSA
STIPGPHMHATPNSINPSARPQICGSYPSLTPVVSPPNAIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVASAIAVPPVHGSQLSGN
SLIQTDSNHPQLGMPYSVVKLISLYNDSQKHAQGVGPSENISLTKHSEDWTAHKTEAGIIYYYNALTGESTYEKPSGFKGELLLQQLSNLSGTDWVLVTMGDGKKYYYNN
KTKISSWQIPNEVSELRQQNDEKTKEHSAPLPNNNASTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGAPVASSPISAPTVAQSDVNLLRDAD
ATVKALQTENNKDKPKDANGDGNLSDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKGQNMTRAEEERKEKRAAQKAAIEGFKQLLDSASE
DIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERCHLMSN