; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G01780 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G01780
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionpre-mRNA-processing protein 40C
Genome locationClcChr09:1476415..1502495
RNA-Seq ExpressionClc09G01780
SyntenyClc09G01780
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003712 - transcription coregulator activity (molecular function)
GO:0070063 - RNA polymerase binding (molecular function)
InterPro domainsIPR001202 - WW domain
IPR002713 - FF domain
IPR012870 - Protein of unknown function DUF1666
IPR036020 - WW domain superfamily
IPR036517 - FF domain superfamily
IPR045148 - Transcription elongation regulator 1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008462197.1 PREDICTED: pre-mRNA-processing protein 40C [Cucumis melo]0.0e+0089.36Show/hide
Query:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDSASTILGPHMHATPSSINP
        MSSASTVSQSVSLPAPPTSNS ANGSSIPNLIP TSPV P+PSFH+HQL PV PMVPGPPGM+P+ P+VST PA LFPP DSASTI GP+MHA  + I+P
Subjt:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDSASTILGPHMHATPSSINP

Query:  SARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQTDS
        SARPQI GSYPSLTPVVSPPH +WFQPPQLGAMPRPPF+PYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVA AI VP+GHGNQL GNSLIQTDS
Subjt:  SARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQTDS

Query:  NHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI
        NHP+LDSQKH Q VG SENISL KHSEDWTAHKTEAG+IYYYNALTGESTYEKP GF+ E ENL++Q TSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI
Subjt:  NHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI

Query:  SSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSDVN
        SSWQIPNEVSELRQQNDEK KE SAPLPNNNALTDLGTSS SINTPAINTGGREATPLRTVGI GSSSALDLIKKKLQDSGTPVASSPISA T+AQSDVN
Subjt:  SSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSDVN

Query:  LLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF-----------------KT
        L RDADATVKALQ ENNK KPKD + DGN+SDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF                 KT
Subjt:  LLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF-----------------KT

Query:  RAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDINV
        RAEEERKEKRAAQKAAIEGFKQLLD ASEDIDHTTSYQTFKKKWGND RFEALDRKDRENLLNERVLCLKKAA EKAQALWAASTTSFKSMLQER DINV
Subjt:  RAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDINV

Query:  NSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIK
        NSRWFRVKDSLR+DPRYRSVKHEEREMLFNEYISELKA EEEKQ ESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIK
Subjt:  NSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIK

Query:  DPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKEREA
        DPQASWTESKVKLEKDPQGRASNPDLDSS+TEKLFREHVKMLQERCANEFR+LLSEAFTAEVVAQVSEDGKTVL+SWTMAKRILKPDPRY KVPRKEREA
Subjt:  DPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKEREA

Query:  VWRRYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPSKPRIHE
        +WRRYADDT+RKQKLANDHKGEK+ND+K+RAT DAGKFPSKPRIH+
Subjt:  VWRRYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPSKPRIHE

XP_011659583.1 pre-mRNA-processing protein 40C [Cucumis sativus]0.0e+0089.24Show/hide
Query:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDSASTILGPHMHATPSSINP
        MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIP TSPV P+PSFH+HQL  V PMVPGPPGM+P+ P+VST PA LFPP DSASTI GP+MHA  + I+P
Subjt:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDSASTILGPHMHATPSSINP

Query:  SARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQTDS
        SARPQI GSYPSLTPVVSPPH +WFQPPQLGAMPRPPFLPYS SYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVA  I+VP+GHGNQL GN+LIQTDS
Subjt:  SARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQTDS

Query:  NHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI
        NHP+LDS KHAQGVG SENISL KHSEDWTAHKTEAG+IYYYNALTGESTYEKPSGF+ E ENLM+Q TSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI
Subjt:  NHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI

Query:  SSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSDVN
        SSWQIPNEVSELRQQNDEK KE SAPLPNNNA TDLGTSS SINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPT+AQSDVN
Subjt:  SSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSDVN

Query:  LLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF-----------------KT
        L RDADATVKALQ E NK KPKD + DGN+SDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF                 KT
Subjt:  LLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF-----------------KT

Query:  RAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDINV
        RAEEERKEKRAAQKAAIEGFKQLLD ASEDIDHTTSYQTFKKKWGND RFEALDRKDRENLLNERVLCLKKAA EKAQALWAASTTSFKSMLQER DIN+
Subjt:  RAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDINV

Query:  NSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIK
        NSRWFRVKDSLR+DPRYRSVKHEEREMLFNEYISELKA EEEKQ ESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIK
Subjt:  NSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIK

Query:  DPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKEREA
        DPQASWTESKVKLEKDPQGRASN DLDSS+TEKLFREHVKMLQERCANEFR+LLSE+FTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRY KVPRKEREA
Subjt:  DPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKEREA

Query:  VWRRYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPSKPRIHE
        +WRRYADDT+RKQKLANDHKGEK+ND+K+RAT DAGKFPSKPRIH+
Subjt:  VWRRYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPSKPRIHE

XP_022953473.1 pre-mRNA-processing protein 40C [Cucurbita moschata]0.0e+0086.79Show/hide
Query:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDS--ASTILGPHMHATPSSI
        MSSASTVSQS+SLPAPPTSNSAANGSSIPNLIP TSPV P+ SFH+HQL P TPMVPGPPGM+P+ PV       +FPP+DS  +STI GP+MHA P+SI
Subjt:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDS--ASTILGPHMHATPSSI

Query:  NPSARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQT
        N S RPQI GSYPSL PVVSPPH IWFQPPQLG MPRPPFLPY ASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQV+ A AVP+ HGN L+GNSLIQT
Subjt:  NPSARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQT

Query:  DSNHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKT
        D NHP+LD+QKHAQG+GQSE+ISL+KHSE+WTAHKTEAG++YYYNALTGESTYEKPSGFK EP+NLM QPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKT
Subjt:  DSNHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKT

Query:  KISSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSD
        KISSWQIPNEV+ELRQQNDEK KEHSAPLPNNNALT+ G+S IS+NTPAINTGGREA PLRTVG+SG SSALDLIKKKLQ+SGTPVASSPIS PTIAQSD
Subjt:  KISSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSD

Query:  VNLLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF-----------------
        VNL RDADA VKALQ EN+K KPKD +GDGN+SDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF                 
Subjt:  VNLLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF-----------------

Query:  KTRAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDI
        KTRAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLL+ERVLCLKKAA EKAQALWAASTTSFKSMLQERGDI
Subjt:  KTRAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDI

Query:  NVNSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVES
        NVNSRW RVKDSLRDDPRYRSVKHE+REMLFNEYISELKA+EEEKQ ESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVES
Subjt:  NVNSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVES

Query:  IKDPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKER
        IKDPQASW+ESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFR+LLSEAFTAEVV+QVSEDGKTVLNSWTMAKR LKPDPRYSK+PRKER
Subjt:  IKDPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKER

Query:  EAVWRRYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPSKPRIHE
        EA+WRRYADDTLRKQK AND K EKH++ KSR+T  AGK PSKPRIHE
Subjt:  EAVWRRYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPSKPRIHE

XP_023547625.1 pre-mRNA-processing protein 40C [Cucurbita pepo subsp. pepo]0.0e+0086.67Show/hide
Query:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDS--ASTILGPHMHATPSSI
        MSSASTVSQS+SLPAPPTSNSAANGSSIPNLIP TSPV P+ SFH+HQL P TPMVPGPPGM+P+ PV       +FPP+DS  +STI GP+MHA P+SI
Subjt:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDS--ASTILGPHMHATPSSI

Query:  NPSARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQT
        N S RPQI GSYPSL PVVSPPH IWFQPPQLG MPRPPFLPY ASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQV+ A AVP+ HGN L+GNSLIQT
Subjt:  NPSARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQT

Query:  DSNHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKT
        D NHP+LD+QKHAQG+GQSE+ISL+KHSE+WTAHKTEAG++YYYNALTGESTYEKPSGFK EP+NLM QPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKT
Subjt:  DSNHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKT

Query:  KISSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSD
        KISSWQIPNEV+ELRQQNDEK KEHS PLPNNNALT+ G+S IS+NTPAINTGGREA PLRTVG+SG SSALDLIKKKLQ+SGTPVASSPISAPTIAQSD
Subjt:  KISSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSD

Query:  VNLLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF-----------------
        VNL RDADA VKALQ EN+K KPKD +GDGN+SDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF                 
Subjt:  VNLLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF-----------------

Query:  KTRAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDI
        KTRAEEERKEKRAAQKAAIEGFKQLLD ASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLL+ERVLCLKKAA EKAQALWAASTTSFKSMLQERGDI
Subjt:  KTRAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDI

Query:  NVNSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVES
        NVNSRW RVKDSLRDDPRYRSVKHE+REMLFNEYISELKA+EEEKQ ESKA+KEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVES
Subjt:  NVNSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVES

Query:  IKDPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKER
        IKDPQASW+ESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFR+LLSEAFTAEVV+QVSEDGKTVLNSWTMAKR LKPDPRYSK+PRKER
Subjt:  IKDPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKER

Query:  EAVWRRYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPSKPRIHE
        EA+WRRYADDTLRKQK ANDHK EKH++ KSR+T  AGK PSKPRIHE
Subjt:  EAVWRRYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPSKPRIHE

XP_038900162.1 pre-mRNA-processing protein 40C [Benincasa hispida]0.0e+0092.67Show/hide
Query:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDSASTILGPHMHATPSSINP
        MSSASTVSQSVSLPAPPTSNS ANGSSIPNLI       P+PSFH HQLLP TPMVPGPPGM+P+ PVVST+PAALFPPNDSASTI GPHMHATP+SINP
Subjt:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDSASTILGPHMHATPSSINP

Query:  SARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQTDS
        S RPQI GSYPSLTPVVSPPH IWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVA AIAV +GHGNQLSGNSLIQTDS
Subjt:  SARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQTDS

Query:  NHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI
        NHPQLDSQKHAQGVGQSENI LTKHSEDWTAHKTEAG+IYYYNALTGESTYEKPSGFK EPEN+M+QPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI
Subjt:  NHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI

Query:  SSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSDVN
        SSWQIPNEVSELRQQNDEK KEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLR VGISGSSSALDLIKKKLQDSGTPVASSPISAPT+AQ DVN
Subjt:  SSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSDVN

Query:  LLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF-----------------KT
        LLRDADATVKALQ ENNK KPKD DGDGN+SDSSSDSEDVD+GPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF                 KT
Subjt:  LLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF-----------------KT

Query:  RAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDINV
        RAEEERKEKRAAQKAA+EGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVL LKKAA EKAQALWAASTTSFKSMLQERGDINV
Subjt:  RAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDINV

Query:  NSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIK
        NSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKA+EEEKQ ESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIK
Subjt:  NSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIK

Query:  DPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKEREA
        DPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFR+LLSEAFTAEVVAQ+SEDGKTVLNSWTMAKRILKPDPRYSKVPRKEREA
Subjt:  DPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKEREA

Query:  VWRRYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPSKPRIHE
        +WRRYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPSKPRIHE
Subjt:  VWRRYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPSKPRIHE

TrEMBL top hitse value%identityAlignment
A0A0A0K978 Uncharacterized protein0.0e+0089.24Show/hide
Query:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDSASTILGPHMHATPSSINP
        MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIP TSPV P+PSFH+HQL  V PMVPGPPGM+P+ P+VST PA LFPP DSASTI GP+MHA  + I+P
Subjt:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDSASTILGPHMHATPSSINP

Query:  SARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQTDS
        SARPQI GSYPSLTPVVSPPH +WFQPPQLGAMPRPPFLPYS SYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVA  I+VP+GHGNQL GN+LIQTDS
Subjt:  SARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQTDS

Query:  NHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI
        NHP+LDS KHAQGVG SENISL KHSEDWTAHKTEAG+IYYYNALTGESTYEKPSGF+ E ENLM+Q TSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI
Subjt:  NHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI

Query:  SSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSDVN
        SSWQIPNEVSELRQQNDEK KE SAPLPNNNA TDLGTSS SINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPT+AQSDVN
Subjt:  SSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSDVN

Query:  LLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF-----------------KT
        L RDADATVKALQ E NK KPKD + DGN+SDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF                 KT
Subjt:  LLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF-----------------KT

Query:  RAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDINV
        RAEEERKEKRAAQKAAIEGFKQLLD ASEDIDHTTSYQTFKKKWGND RFEALDRKDRENLLNERVLCLKKAA EKAQALWAASTTSFKSMLQER DIN+
Subjt:  RAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDINV

Query:  NSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIK
        NSRWFRVKDSLR+DPRYRSVKHEEREMLFNEYISELKA EEEKQ ESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIK
Subjt:  NSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIK

Query:  DPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKEREA
        DPQASWTESKVKLEKDPQGRASN DLDSS+TEKLFREHVKMLQERCANEFR+LLSE+FTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRY KVPRKEREA
Subjt:  DPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKEREA

Query:  VWRRYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPSKPRIHE
        +WRRYADDT+RKQKLANDHKGEK+ND+K+RAT DAGKFPSKPRIH+
Subjt:  VWRRYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPSKPRIHE

A0A1S3CHX0 pre-mRNA-processing protein 40C0.0e+0089.36Show/hide
Query:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDSASTILGPHMHATPSSINP
        MSSASTVSQSVSLPAPPTSNS ANGSSIPNLIP TSPV P+PSFH+HQL PV PMVPGPPGM+P+ P+VST PA LFPP DSASTI GP+MHA  + I+P
Subjt:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDSASTILGPHMHATPSSINP

Query:  SARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQTDS
        SARPQI GSYPSLTPVVSPPH +WFQPPQLGAMPRPPF+PYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVA AI VP+GHGNQL GNSLIQTDS
Subjt:  SARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQTDS

Query:  NHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI
        NHP+LDSQKH Q VG SENISL KHSEDWTAHKTEAG+IYYYNALTGESTYEKP GF+ E ENL++Q TSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI
Subjt:  NHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI

Query:  SSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSDVN
        SSWQIPNEVSELRQQNDEK KE SAPLPNNNALTDLGTSS SINTPAINTGGREATPLRTVGI GSSSALDLIKKKLQDSGTPVASSPISA T+AQSDVN
Subjt:  SSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSDVN

Query:  LLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF-----------------KT
        L RDADATVKALQ ENNK KPKD + DGN+SDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF                 KT
Subjt:  LLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF-----------------KT

Query:  RAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDINV
        RAEEERKEKRAAQKAAIEGFKQLLD ASEDIDHTTSYQTFKKKWGND RFEALDRKDRENLLNERVLCLKKAA EKAQALWAASTTSFKSMLQER DINV
Subjt:  RAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDINV

Query:  NSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIK
        NSRWFRVKDSLR+DPRYRSVKHEEREMLFNEYISELKA EEEKQ ESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIK
Subjt:  NSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIK

Query:  DPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKEREA
        DPQASWTESKVKLEKDPQGRASNPDLDSS+TEKLFREHVKMLQERCANEFR+LLSEAFTAEVVAQVSEDGKTVL+SWTMAKRILKPDPRY KVPRKEREA
Subjt:  DPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKEREA

Query:  VWRRYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPSKPRIHE
        +WRRYADDT+RKQKLANDHKGEK+ND+K+RAT DAGKFPSKPRIH+
Subjt:  VWRRYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPSKPRIHE

A0A5A7V0S2 Pre-mRNA-processing protein 40C0.0e+0089.36Show/hide
Query:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDSASTILGPHMHATPSSINP
        MSSASTVSQSVSLPAPPTSNS ANGSSIPNLIP TSPV P+PSFH+HQL PV PMVPGPPGM+P+ P+VST PA LFPP DSASTI GP+MHA  + I+P
Subjt:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDSASTILGPHMHATPSSINP

Query:  SARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQTDS
        SARPQI GSYPSLTPVVSPPH +WFQPPQLGAMPRPPF+PYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVA AI VP+GHGNQL GNSLIQTDS
Subjt:  SARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQTDS

Query:  NHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI
        NHP+LDSQKH Q VG SENISL KHSEDWTAHKTEAG+IYYYNALTGESTYEKP GF+ E ENL++Q TSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI
Subjt:  NHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKI

Query:  SSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSDVN
        SSWQIPNEVSELRQQNDEK KE SAPLPNNNALTDLGTSS SINTPAINTGGREATPLRTVGI GSSSALDLIKKKLQDSGTPVASSPISA T+AQSDVN
Subjt:  SSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSDVN

Query:  LLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF-----------------KT
        L RDADATVKALQ ENNK KPKD + DGN+SDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF                 KT
Subjt:  LLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF-----------------KT

Query:  RAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDINV
        RAEEERKEKRAAQKAAIEGFKQLLD ASEDIDHTTSYQTFKKKWGND RFEALDRKDRENLLNERVLCLKKAA EKAQALWAASTTSFKSMLQER DINV
Subjt:  RAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDINV

Query:  NSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIK
        NSRWFRVKDSLR+DPRYRSVKHEEREMLFNEYISELKA EEEKQ ESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIK
Subjt:  NSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIK

Query:  DPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKEREA
        DPQASWTESKVKLEKDPQGRASNPDLDSS+TEKLFREHVKMLQERCANEFR+LLSEAFTAEVVAQVSEDGKTVL+SWTMAKRILKPDPRY KVPRKEREA
Subjt:  DPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKEREA

Query:  VWRRYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPSKPRIHE
        +WRRYADDT+RKQKLANDHKGEK+ND+K+RAT DAGKFPSKPRIH+
Subjt:  VWRRYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPSKPRIHE

A0A6J1GNF1 pre-mRNA-processing protein 40C0.0e+0086.79Show/hide
Query:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDS--ASTILGPHMHATPSSI
        MSSASTVSQS+SLPAPPTSNSAANGSSIPNLIP TSPV P+ SFH+HQL P TPMVPGPPGM+P+ PV       +FPP+DS  +STI GP+MHA P+SI
Subjt:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDS--ASTILGPHMHATPSSI

Query:  NPSARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQT
        N S RPQI GSYPSL PVVSPPH IWFQPPQLG MPRPPFLPY ASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQV+ A AVP+ HGN L+GNSLIQT
Subjt:  NPSARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQT

Query:  DSNHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKT
        D NHP+LD+QKHAQG+GQSE+ISL+KHSE+WTAHKTEAG++YYYNALTGESTYEKPSGFK EP+NLM QPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKT
Subjt:  DSNHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKT

Query:  KISSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSD
        KISSWQIPNEV+ELRQQNDEK KEHSAPLPNNNALT+ G+S IS+NTPAINTGGREA PLRTVG+SG SSALDLIKKKLQ+SGTPVASSPIS PTIAQSD
Subjt:  KISSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSD

Query:  VNLLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF-----------------
        VNL RDADA VKALQ EN+K KPKD +GDGN+SDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF                 
Subjt:  VNLLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF-----------------

Query:  KTRAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDI
        KTRAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLL+ERVLCLKKAA EKAQALWAASTTSFKSMLQERGDI
Subjt:  KTRAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDI

Query:  NVNSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVES
        NVNSRW RVKDSLRDDPRYRSVKHE+REMLFNEYISELKA+EEEKQ ESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVES
Subjt:  NVNSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVES

Query:  IKDPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKER
        IKDPQASW+ESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFR+LLSEAFTAEVV+QVSEDGKTVLNSWTMAKR LKPDPRYSK+PRKER
Subjt:  IKDPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKER

Query:  EAVWRRYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPSKPRIHE
        EA+WRRYADDTLRKQK AND K EKH++ KSR+T  AGK PSKPRIHE
Subjt:  EAVWRRYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPSKPRIHE

A0A6J1JUS6 pre-mRNA-processing protein 40C0.0e+0085.97Show/hide
Query:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDS--ASTILGPHMHATPSSI
        MSSASTVSQS+SLPAPPTSNSAANGSSIPNLIP T PV P+ SFH+HQL P TPMVPGPPGM+P+ PV       +FPP+DS  +STI GP+MHA P+SI
Subjt:  MSSASTVSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDS--ASTILGPHMHATPSSI

Query:  NPSARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQT
        N S RPQI GSYPSL PVVSPPH IWFQPPQLG MPRPPFLPY ASYHGPLPFPARGM LPSVPLPDPQPPGVTPVQV+ A AVP+ HG+ L+GNSLIQT
Subjt:  NPSARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQT

Query:  DSNHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKT
        D NHP+LD+ KHAQG+GQSE+ISL+KHSE+WTAHKTE+G++YYYNALTGESTYEKPSGFK EP+NLM QPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKT
Subjt:  DSNHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKT

Query:  KISSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSD
        KISSWQIPNEV+ELRQQNDEK KEHSAPLPNN+ALT+ G+S IS+NTPAINTGGREA PLRTVG+SG SSALDLIKKKLQ+SGTPVASSPISAPTIAQSD
Subjt:  KISSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSD

Query:  VNLLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF-----------------
        VNL RDADA VKALQ EN+K KPKD +GDGN+SDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF                 
Subjt:  VNLLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF-----------------

Query:  KTRAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDI
        KTRAEEERKEKRAAQKAAIEGFKQLLD ASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLL+ERVLCLKKAA EKAQALWAASTTSFKSMLQERGDI
Subjt:  KTRAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDI

Query:  NVNSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVES
        NVNSRW RVKDSLRDDPRYRSVKHE+REMLFNEYISELKA+EEEKQ ESKA+KEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVES
Subjt:  NVNSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVES

Query:  IKDPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKER
        IKDPQASW+ESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFR+LLSEAFTAEVV+Q SEDGKTVLNSWTMAKR LKPDPRYSK+PRKER
Subjt:  IKDPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKER

Query:  EAVWRRYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPSKPRIHE
        EA+WRRYADDTLRKQK ANDHK EKH++ KSR+T  AGK PSKPRIHE
Subjt:  EAVWRRYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPSKPRIHE

SwissProt top hitse value%identityAlignment
B6EUA9 Pre-mRNA-processing protein 40A1.2e-1524.77Show/hide
Query:  PMVPGPPGMTPAPPVVST-SPAALFPPNDSASTILGPHMHATPSSINPSARPQIFGSYP-------SLTPVVSPPH-------GIWFQPPQLGAMPRPPF
        PMVPG  G    P       P    PPN  +           P    P  + Q+F   P       S +  VS P+             PQ  A P   F
Subjt:  PMVPGPPGMTPAPPVVST-SPAALFPPNDSASTILGPHMHATPSSINPSARPQIFGSYP-------SLTPVVSPPH-------GIWFQPPQLGAMPRPPF

Query:  LPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIA----VPAGHGNQLSGNSLIQTDSNHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKT
              +  P  F     P    P    QP     V   P  A    VP      L  + + QT    P   S        QS        + DW  H +
Subjt:  LPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIA----VPAGHGNQLSGNSLIQTDSNHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKT

Query:  EAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKISSWQIP------NEVSELRQQNDEKPKEHSAPLP
          G  YYYN  T +S +EKP          +   T +  ++ S T W   T  +GKKYYYN  TK S W IP       E ++L  +     +  S PL 
Subjt:  EAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKISSWQIP------NEVSELRQQNDEKPKEHSAPLP

Query:  NNNA-LTDLGTSSISINTPAINTG--GREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSDVNLLRDADATVKALQPENNKGKPKD--
        ++ A  +DL  S+++   P+ ++   G  ++P++  G++   +    +      SG   A S   A TI   +++  R AD +      +NN+ + K+  
Subjt:  NNNA-LTDLGTSSISINTPAINTG--GREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSDVNLLRDADATVKALQPENNKGKPKD--

Query:  TDGDGNISDS--SSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKT-----------------RAEEERKEKRAAQKAAIEGFK
         +G  N+S +   ++ E+     T ++    FK +L+   V     W++ L +IV D R+                   R + E +E+R  QK A E F 
Subjt:  TDGDGNISDS--SSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKT-----------------RAEEERKEKRAAQKAAIEGFK

Query:  QLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDR-KDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDINVNSRWFRVKDSLRDDPRYRSV
        ++L+   E++  +  +      + ND RF+A+DR +DRE+L +  ++ L++   EKA          ++  L+    I   ++W +++D L DD R   +
Subjt:  QLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDR-KDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDINVNSRWFRVKDSLRDDPRYRSV

Query:  KHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGR
        +  +R + F EYI +L              ++E+E+LK  E+E  +R ER+ ++  R  L    +E VA+        I   +  W +  ++L+  PQ +
Subjt:  KHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGR

Query:  ASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVS----EDGKTVLN
        A   +   S  + LF +  + L E+  +E +S + +A  +  ++ VS    ED K+ ++
Subjt:  ASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVS----EDGKTVLN

F4JCC1 Pre-mRNA-processing protein 40B4.5e-1523.82Show/hide
Query:  FPPNDSASTILGP--HMHATPSSINPSARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMP---LPSVPLPDPQPPGV
        +PP   A  +L    H H  PS +N                V   P G    PP L + P       ++ +   +  P  GMP    P      P     
Subjt:  FPPNDSASTILGP--HMHATPSSINPSARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMP---LPSVPLPDPQPPGV

Query:  TPVQVAPAIAVPAGHGNQLSGNSLIQTDSNHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSV
           +V P +  P+ H       S+I T +    ++           + +   K   DW  H +  G  Y++N  T +ST+EKP       E   ++    
Subjt:  TPVQVAPAIAVPAGHGNQLSGNSLIQTDSNHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSV

Query:  SMSNLSGTDWVLVTMGDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALD
               TDW   +  DG+KYYYN  TK S+W +P E+  +R+Q +       A +   +A   +  S +   +   +T      P +T   S     L 
Subjt:  SMSNLSGTDWVLVTMGDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALD

Query:  LIKKKLQDSGTPVASSPI-------------------------SAPTIAQSDVNLLRDADATVKALQPENNKGKPKDTDGDGNISDSSS----DSEDVDS
        L     Q +  P +SSP+                         S P    S   L+   + +V      ++        G G+    S     +SE V+S
Subjt:  LIKKKLQDSGTPVASSPI-------------------------SAPTIAQSDVNLLRDADATVKALQPENNKGKPKDTDGDGNISDSSS----DSEDVDS

Query:  GPTNEQL-------------IIQFKEMLKERGVAPFSKWDKELPKIVFDPRF--------------------KTRAEEERKEKRAAQKAAIEGFKQLLDR
            +Q+             +  FK +LK   V     W++ + +I+ D R+                    K  AEEER    A QK   E FK++L+ 
Subjt:  GPTNEQL-------------IIQFKEMLKERGVAPFSKWDKELPKIVFDPRF--------------------KTRAEEERKEKRAAQKAAIEGFKQLLDR

Query:  ASEDIDHTTSYQTFKKKWGNDPRFEALDR-KDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDINVNSRWFRVKDSLRDDPRYRSVKHEER
          E +  +T +      + +D RF+AL+R KDR N+  + V  LK+    KA      +   +K  L+    I  NS+W +V+D L  D R   ++  ++
Subjt:  ASEDIDHTTSYQTFKKKWGNDPRFEALDR-KDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDINVNSRWFRVKDSLRDDPRYRSVKHEER

Query:  EMLFNEYISELKAIEEEKQHESKARKEEQEKLKERERE
          +F EY+ +L+  EEEK+   K +KEE +K++ + R+
Subjt:  EMLFNEYISELKAIEEEKQHESKARKEEQEKLKERERE

O14776 Transcription elongation regulator 13.7e-4126.68Show/hide
Query:  ASTVSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDSASTILGPHMHATPSSINPSAR
        AST + S   PA  TS S++  SS  +     + V  + S       P T        ++ A P VS S  A  P      T+  PH    P ++ P + 
Subjt:  ASTVSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDSASTILGPHMHATPSSINPSAR

Query:  PQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQTDSNHP
        PQ   + P+  PV+ PP    F+ P  G                P+P P   M +  V  P  +    T   V P +A P                  HP
Subjt:  PQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQTDSNHP

Query:  QLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKE--------------------------------------------
        Q+        +  +  +S      +WT +KT  G  YYYN  T EST+EKP   KE                                            
Subjt:  QLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKE--------------------------------------------

Query:  ---EPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSIS--INTPAINTGGRE
           E E    +   V+ + + GT W +V  GD + ++YN  T++S W  P+++                          +G + +   I  P    G  E
Subjt:  ---EPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSIS--INTPAINTGGRE

Query:  ATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSDVNLLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFK
           LR      + + L + K +        + S I        ++N     D  VKA + + +  K  D++ +  +      + +    P  E  + QFK
Subjt:  ATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSDVNLLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFK

Query:  EMLKERGVAPFSKWDKELPKIVFDPRF----------------KTRAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDR
        +ML ERGV+ FS W+KEL KIVFDPR+                KTRAEEER+EK+     A E FK++++ A    +   ++  F  K   D RF+A+++
Subjt:  EMLKERGVAPFSKWDKELPKIVFDPRF----------------KTRAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDR

Query:  -KDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDINVNSRWFRVKDSLRDDPRYRSVKHEE-REMLFNEYISEL-KAIEEEKQHESKARKE
         KDRE L NE V   +K   E ++       + F  +L     ++  SRW +VKD +  DPRY++V     RE LF +YI ++ K ++ EK+ E + +  
Subjt:  -KDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDINVNSRWFRVKDSLRDDPRYRSVKHEE-REMLFNEYISEL-KAIEEEKQHESKARKE

Query:  EQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRS
         +  L+ERERE +K +  + +E++R R + +++EA+ +F+ALL + ++    SW++++  L KD +   S   L+  + EKLF EH++ L ++    FR 
Subjt:  EQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRS

Query:  LLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKERE
        LL E               T+ ++W   K+I+K DPR  K    +R+
Subjt:  LLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKERE

Q8CGF7 Transcription elongation regulator 19.7e-4226.71Show/hide
Query:  SSASTVSQSVSLPAPPTSNSA-ANGSSIPNLIPVTSPVLPSPSFHVHQLL--PVTPMVPGPPGMTPAPPVVSTSPAALFPPNDSASTILGPHMHATPSSI
        + A   +Q+V  P P TS+ A A  +S P   P ++    + +  V Q +  P T        ++ A P VS S  A  P      T+  PH    P ++
Subjt:  SSASTVSQSVSLPAPPTSNSA-ANGSSIPNLIPVTSPVLPSPSFHVHQLL--PVTPMVPGPPGMTPAPPVVSTSPAALFPPNDSASTILGPHMHATPSSI

Query:  NPSARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQT
         P + PQ   + P+  PV+ PP    F+ P  G                P+P P   M +  V  P  +    T   V P +A P               
Subjt:  NPSARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQT

Query:  DSNHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKE---------------------------------------
           HPQ+        +  +  +S      +WT +KT  G  YYYN  T EST+EKP   KE                                       
Subjt:  DSNHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKE---------------------------------------

Query:  --------EPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTG
                E E    +   V+ + + GT W +V  GD + ++YN  T++S W  P+++   R   D+  +E     P+   L D+      +  PA    
Subjt:  --------EPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTG

Query:  GREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSDVNLLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLII
            T L       S SA+   ++ +++                        + D  +KA + + +  K  D++ +  +      + +    P  E  + 
Subjt:  GREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSDVNLLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLII

Query:  QFKEMLKERGVAPFSKWDKELPKIVFDPRF----------------KTRAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEA
        QFK+ML ERGV+ FS W+KEL KIVFDPR+                KTRAEEER+EK+     A E FK++++ A    +   ++  F  K   D RF+A
Subjt:  QFKEMLKERGVAPFSKWDKELPKIVFDPRF----------------KTRAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEA

Query:  LDR-KDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDINVNSRWFRVKDSLRDDPRYRSVKHEE-REMLFNEYISEL-KAIEEEKQHESKA
        +++ KDRE L NE V   +K   E ++       + F  +L     ++  SRW +VKD +  DPRY++V     RE LF +YI ++ K ++ EK+ E + 
Subjt:  LDR-KDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGDINVNSRWFRVKDSLRDDPRYRSVKHEE-REMLFNEYISEL-KAIEEEKQHESKA

Query:  RKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANE
        +   +  L+ERERE +K +  + +E++R R + +++EA+ +F+ALL + ++    SW++++  L KD +   S   L+  + EKLF EH++ L ++    
Subjt:  RKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVESIKDPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANE

Query:  FRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKERE
        FR LL E               T+ ++W   K+I+K DPR  K    +R+
Subjt:  FRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKERE

Q9LT25 Pre-mRNA-processing protein 40C2.3e-19751.32Show/hide
Query:  TMSSAST--VSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDSASTILGPHMHATPSS
        +MS AST  VSQSV         + A  SS  N IP  SP+L +  F             G PG T APP + TSP A FP ++  ST   P M A P+ 
Subjt:  TMSSAST--VSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDSASTILGPHMHATPSS

Query:  INPSARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQ
        +NP   P ++  Y SL      P G+W QPP +G +PR PFL +  ++ G  PFP RG+  P++P     P G +P+     +    G    +S     +
Subjt:  INPSARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQ

Query:  TDSNHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNK
          S    +D +  +Q VG           + WTAHK+EAGV+YYYN++TG+STYEKP GF  EP+ +  QP  VSM +L GTDW LV+  DGKKYYYNNK
Subjt:  TDSNHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNK

Query:  TKISSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQS
        TK+SSWQIP EV +  ++ +E+  E  A +P+ + LT+ G+   S++ PAI+ GGR+A  L+T      SSALDL+KKKL DSG PV+S+         S
Subjt:  TKISSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQS

Query:  DVNLLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF----------------
        + N  +  + T    +  N+ GK KD  G G +SDSSSDSED DSGP+ E+   QFKEMLKERG+APFSKW+KELPKI+FDPRF                
Subjt:  DVNLLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF----------------

Query:  -KTRAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGD
         KTRAEEER+EKRAA KAAIEGF+QLLD AS DID  T Y+ FKKKWGND RFEA++RK+RE LLNERVL LK++A++KAQ + AA+ + FK+ML+ER +
Subjt:  -KTRAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGD

Query:  INVNSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVE
        I++NS W +VKDSLR++PRYRSV HE+RE+ + EYI+ELKA +    HE KAR +E++KL+ERERE RKRKERE QE+ERVR K+R+KEA +S+QALLVE
Subjt:  INVNSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVE

Query:  SIKDPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKE
         I+DP+ASWTESK  LE+DPQ RASNPDL+ +D EKLFR+HVK L ERC ++F++LL+EA ++E     +EDGKT LNSW+ AK++LKPD RYSK+PR++
Subjt:  SIKDPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKE

Query:  REAVWRRYADDTLRKQKLANDHKGEKHNDFKS
        RE VWRRY +D  RKQ+  N ++ EK  D+K+
Subjt:  REAVWRRYADDTLRKQKLANDHKGEKHNDFKS

Arabidopsis top hitse value%identityAlignment
AT1G73850.1 Protein of unknown function (DUF1666)2.1e-2829.13Show/hide
Query:  IENSKDKLKDANGDANVSDSEDVE-----------------------SGPTNEQSNIQFKEE--------DGADPKGIAEAEEDEEDFIMEEVKRRLKEL
        IE   D ++D   D    D ED E                        GP+  QS + F ++             K   +  +    F M E+K   +EL
Subjt:  IENSKDKLKDANGDANVSDSEDVE-----------------------SGPTNEQSNIQFKEE--------DGADPKGIAEAEEDEEDFIMEEVKRRLKEL

Query:  RRNSFMVLIPEEEEEGGGGGEEDEEEEVGEGEPEWRD-------VEAEGRQ---WWGGFGAVYDDYCKRMLFFDRMSIQSGPESTS-------QRSASKK
                  EEEE G   GE        +   EWR+            R+    W  +  V+  Y + M F  R+S Q   E+ S        RS S++
Subjt:  RRNSFMVLIPEEEEEGGGGGEEDEEEEVGEGEPEWRD-------VEAEGRQ---WWGGFGAVYDDYCKRMLFFDRMSIQSGPESTS-------QRSASKK

Query:  SASPLRCLSLKRIEEPEDEMEDVDPSLTLIGSSHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQ--NSTIHYNLTAQLFQQFQVLLQRFIENEPFQQ
            L     K+ ++         P+  +      +E+AYVA ICL+WEAL   Y       S   +  N        A  F+ F +LLQR++ENEP++ 
Subjt:  SASPLRCLSLKRIEEPEDEMEDVDPSLTLIGSSHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQ--NSTIHYNLTAQLFQQFQVLLQRFIENEPFQQ

Query:  DFRPAIYARTRQTFPKMLHVPNIQASDPNGVQEQESD----LLILAPDLLLIIEASIFTFHRFLKMDKK--TSNSASSFRNHTQ----DATLLARVRSSL
          RP IYAR R   PK+L VP  Q  +    +E E++      I +   L+I+E  I TF  FL+ DK+        +F   ++    D TL+  ++   
Subjt:  DFRPAIYARTRQTFPKMLHVPNIQASDPNGVQEQESD----LLILAPDLLLIIEASIFTFHRFLKMDKK--TSNSASSFRNHTQ----DATLLARVRSSL

Query:  DKKKTKLKEVRKKSRGWKQKTWPQSYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNG--KLQRDPSPLLFP
         KKKTKLKE+R+  +  ++K      E+M++L G++D+K++SR+++M+   +E L WCEEKM+K+ +  G   LQRD +PL FP
Subjt:  DKKKTKLKEVRKKSRGWKQKTWPQSYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNG--KLQRDPSPLLFP

AT3G19840.1 pre-mRNA-processing protein 40C1.7e-19851.32Show/hide
Query:  TMSSAST--VSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDSASTILGPHMHATPSS
        +MS AST  VSQSV         + A  SS  N IP  SP+L +  F             G PG T APP + TSP A FP ++  ST   P M A P+ 
Subjt:  TMSSAST--VSQSVSLPAPPTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDSASTILGPHMHATPSS

Query:  INPSARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQ
        +NP   P ++  Y SL      P G+W QPP +G +PR PFL +  ++ G  PFP RG+  P++P     P G +P+     +    G    +S     +
Subjt:  INPSARPQIFGSYPSLTPVVSPPHGIWFQPPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQ

Query:  TDSNHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNK
          S    +D +  +Q VG           + WTAHK+EAGV+YYYN++TG+STYEKP GF  EP+ +  QP  VSM +L GTDW LV+  DGKKYYYNNK
Subjt:  TDSNHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEAGVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNK

Query:  TKISSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQS
        TK+SSWQIP EV +  ++ +E+  E  A +P+ + LT+ G+   S++ PAI+ GGR+A  L+T      SSALDL+KKKL DSG PV+S+         S
Subjt:  TKISSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTPAINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQS

Query:  DVNLLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF----------------
        + N  +  + T    +  N+ GK KD  G G +SDSSSDSED DSGP+ E+   QFKEMLKERG+APFSKW+KELPKI+FDPRF                
Subjt:  DVNLLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEMLKERGVAPFSKWDKELPKIVFDPRF----------------

Query:  -KTRAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGD
         KTRAEEER+EKRAA KAAIEGF+QLLD AS DID  T Y+ FKKKWGND RFEA++RK+RE LLNERVL LK++A++KAQ + AA+ + FK+ML+ER +
Subjt:  -KTRAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAASTTSFKSMLQERGD

Query:  INVNSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVE
        I++NS W +VKDSLR++PRYRSV HE+RE+ + EYI+ELKA +    HE KAR +E++KL+ERERE RKRKERE QE+ERVR K+R+KEA +S+QALLVE
Subjt:  INVNSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQALLVE

Query:  SIKDPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKE
         I+DP+ASWTESK  LE+DPQ RASNPDL+ +D EKLFR+HVK L ERC ++F++LL+EA ++E     +EDGKT LNSW+ AK++LKPD RYSK+PR++
Subjt:  SIKDPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKE

Query:  REAVWRRYADDTLRKQKLANDHKGEKHNDFKS
        RE VWRRY +D  RKQ+  N ++ EK  D+K+
Subjt:  REAVWRRYADDTLRKQKLANDHKGEKHNDFKS

AT3G20260.1 Protein of unknown function (DUF1666)2.2e-11054.39Show/hide
Query:  EQSNIQFKEEDGADPKGIAEAEEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEGGGGGEEDEEEEVGEGE--PEWRDVEAEGRQWWGGFGAVYDDYCK
        EQ+  +  E    D     E E+D++DFI  EVKRRLKELRRNSFMVLIPEEEEE       DE+++ GE +   EWRDV AEG QWWGGF AVY+ YC+
Subjt:  EQSNIQFKEEDGADPKGIAEAEEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEGGGGGEEDEEEEVGEGE--PEWRDVEAEGRQWWGGFGAVYDDYCK

Query:  RMLFFDRMSIQS----------GPESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIGSSHHIETAYVAHICLSWEALHCQYTQLNHLISCQ
        RMLFFDR+S Q            P + S RSASKK +SP RCLSLK+ + PE+++E + P+  +      +ETAYVA +CL+WEALHCQYTQL+HLISCQ
Subjt:  RMLFFDRMSIQS----------GPESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIGSSHHIETAYVAHICLSWEALHCQYTQLNHLISCQ

Query:  PQNSTIHYNLTAQLFQQFQVLLQRFIENEPFQQDFRPAIYARTRQTFPKMLHVPNIQASDPNGVQEQESDLLILAPDLLLIIEASIFTFHRFLKMDKKTS
        P+  T  YN TAQLFQQF VLLQR+IENEPF+Q  R  +YAR R   PK+L  P IQ SD   + E+++  ++LA DL+ +IE+SI TF+ FLKMDKK  
Subjt:  PQNSTIHYNLTAQLFQQFQVLLQRFIENEPFQQDFRPAIYARTRQTFPKMLHVPNIQASDPNGVQEQESDLLILAPDLLLIIEASIFTFHRFLKMDKKTS

Query:  NS-----ASSFRNHTQDATLLARVRSSLDKKKTKLKEVRKKSRGWKQKTWPQSYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKL
        N       +   NH    T L  V+SS+DKK+ K KE+ KK++G ++K+WPQ++E +QLLF  +DIK+ +R+++MS+ +KEQLLWCEEKM KL+ S GKL
Subjt:  NS-----ASSFRNHTQDATLLARVRSSLDKKKTKLKEVRKKSRGWKQKTWPQSYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKL

Query:  QRDPSPLLFP
        QR PSP+LFP
Subjt:  QRDPSPLLFP

AT5G39785.1 Protein of unknown function (DUF1666)1.9e-2928.63Show/hide
Query:  DANVSDSEDVESGPTNEQSNIQFKEEDGADPKGIAEAEEDEEDF--------IMEEVKRRLKELRRNSFMVLIPEEEEEGGGGGEEDEEEEVGEGEPEWR
        D  +SDS+  E+     Q+        G+D +   E EED   F        ++E++K  +K+++    +  I EEEE      E+D+  ++ E    WR
Subjt:  DANVSDSEDVESGPTNEQSNIQFKEEDGADPKGIAEAEEDEEDF--------IMEEVKRRLKELRRNSFMVLIPEEEEEGGGGGEEDEEEEVGEGEPEWR

Query:  DVEAEGRQWWGGFGAVYD---DYCKRMLFFDRMSIQ----------SGPESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIGS-SHHIETA
          E +  +     G V+     Y +RM   D +S Q            P+  +    S  S +    +    I   + +  +++P +  +      +E  
Subjt:  DVEAEGRQWWGGFGAVYD---DYCKRMLFFDRMSIQ----------SGPESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIGS-SHHIETA

Query:  YVAHICLSWEALHCQYTQLNHLISCQPQNSTIHYNLTAQLFQQFQVLLQRFIENEPFQQDFRPAIYARTRQTFPKMLHVPNIQ----ASDPNGVQ---EQ
        YV  +CLSWE LH QY +   L+      S   YN  A  FQQFQVLLQRF+ENEPF++  R   Y + R     +L +P I+        NG +   E+
Subjt:  YVAHICLSWEALHCQYTQLNHLISCQPQNSTIHYNLTAQLFQQFQVLLQRFIENEPFQQDFRPAIYARTRQTFPKMLHVPNIQ----ASDPNGVQ---EQ

Query:  ESDLLILAPDLLLIIEASIFTFHRFLKMDKKTSN-----------SASSFRNHTQDATLLARVRSSLDKKKTKLKEVRKKS----RGWKQKTWPQSYEDM
         +D +I +  L+ I+E +I  F RF++ DK TS+                   ++D  + A V+S L  K+ +L++V K      R +++     S ED 
Subjt:  ESDLLILAPDLLLIIEASIFTFHRFLKMDKKTSN-----------SASSFRNHTQDATLLARVRSSLDKKKTKLKEVRKKS----RGWKQKTWPQSYEDM

Query:  QL-LFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLQRDPSPLLFP
         L  F  VD+K+++R++ MS+ T++ L+WC  K+ K++  N +L  DPS  LFP
Subjt:  QL-LFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLQRDPSPLLFP

AT5G39785.2 Protein of unknown function (DUF1666)1.8e-2728.57Show/hide
Query:  DANVSDSEDVESGPTNEQSNIQFKEEDGADPKGIAEAEEDEEDF--------IMEEVKRRLKELRRNSFMVLIPEEEEEGGGGGEEDEEEEVGEGEPEWR
        D  +SDS+  E+     Q+        G+D +   E EED   F        ++E++K  +K+++    +  I EEEE      E+D+  ++ E    WR
Subjt:  DANVSDSEDVESGPTNEQSNIQFKEEDGADPKGIAEAEEDEEDF--------IMEEVKRRLKELRRNSFMVLIPEEEEEGGGGGEEDEEEEVGEGEPEWR

Query:  DVEAEGRQWWGGFGAVYD---DYCKRMLFFDRMSIQ----------SGPESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIGS-SHHIETA
          E +  +     G V+     Y +RM   D +S Q            P+  +    S  S +    +    I   + +  +++P +  +      +E  
Subjt:  DVEAEGRQWWGGFGAVYD---DYCKRMLFFDRMSIQ----------SGPESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIGS-SHHIETA

Query:  YVAHICLSWEALHCQYTQLNHLISCQPQNSTIHYNLTAQLFQQFQVLLQRFIENEPFQQDFRPAIYARTRQTFPKMLHVPNIQ----ASDPNGVQ---EQ
        YV  +CLSWE LH QY +   L+      S   YN  A  FQQFQVLLQRF+ENEPF++  R   Y + R     +L +P I+        NG +   E+
Subjt:  YVAHICLSWEALHCQYTQLNHLISCQPQNSTIHYNLTAQLFQQFQVLLQRFIENEPFQQDFRPAIYARTRQTFPKMLHVPNIQ----ASDPNGVQ---EQ

Query:  ESDLLILAPDLLLIIEASIFTFHRFLKMDKKTSN-----------SASSFRNHTQDATLLARVRSSLDKKKTK-----LKEVRKKSRGWKQKTWPQSYED
         +D +I +  L+ I+E +I  F RF++ DK TS+                   ++D  + A V+S L     K     LK  R   R +++     S ED
Subjt:  ESDLLILAPDLLLIIEASIFTFHRFLKMDKKTSN-----------SASSFRNHTQDATLLARVRSSLDKKKTK-----LKEVRKKSRGWKQKTWPQSYED

Query:  MQL-LFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLQRDPSPLLFP
          L  F  VD+K+++R++ MS+ T++ L+WC  K+ K++  N +L  DPS  LFP
Subjt:  MQL-LFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLQRDPSPLLFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGATGAATTGACGGACGGAGATGGATACTTCGACCACAAACACGCGGGAGATCCTCGCAGCTTTATGGGTGTTCTTCCCTTAGTTTATACTAGTTCTCATTTTGG
CGTTCGCATGACTTTTGGGCTAATTTTGTTAAACTTGTGTGGTACGGATTGCGTTTTGGTTTCTATGGGTGATGGTAAAAAGTTTTACTACAGCAACAAAACGAAGGTAT
GCTATTTCCTCCCGCCCCCTTCAAAATTCTTCTCCGGATGGGGAACAGAGCTCAGATTCAAGCAACAAGAAAAAGAAGCAGAACTTGTATCTAACTTGTATTTCTTTGTT
TTGTATTTGTTTGCGACTCCTTTGCCGAAAAATAAAGCATTGATTGATCTAGAAACTTCTCCTATAAGTATCAATACTCCTGCCATTGACACAGGTGGTCGTGAAGCCAC
ACCCCTCAGAATGGTAGGAATATCGGGGTCATCTTCTGCCCTGAATTTGATCAAGAAAAAATTGCAAGACTCTGGAACTGCTGTAGCTTCCTCGCCTATTTCAGCTCCAA
CAATAGCTCAATTAAATGTAAATCTACATAAAGATGTTGATGTTGCAGTTAAGGCACTGCGTATAGAGAACAGCAAGGATAAATTGAAAGATGCTAATGGTGATGCAAAT
GTATCTGACTCTGAGGATGTAGAAAGTGGACCAACTAATGAGCAATCAAATATCCAGTTTAAGGAGGAGGATGGCGCAGATCCTAAGGGTATAGCAGAAGCTGAAGAGGA
TGAGGAGGATTTTATAATGGAGGAGGTAAAGAGGAGATTGAAAGAGCTGAGGAGGAACAGTTTCATGGTGTTGATTCCAGAGGAAGAAGAAGAAGGAGGAGGAGGAGGAG
AAGAAGATGAAGAAGAAGAAGTAGGCGAAGGGGAGCCTGAGTGGAGAGACGTGGAAGCAGAAGGTCGACAATGGTGGGGAGGGTTTGGTGCTGTTTATGATGATTACTGC
AAGAGGATGCTTTTCTTTGATCGGATGAGCATTCAATCTGGTCCTGAATCAACCTCCCAAAGATCTGCATCGAAAAAGAGTGCATCTCCTCTTCGATGTCTTTCTCTGAA
GAGGATCGAAGAACCTGAAGATGAGATGGAGGATGTTGACCCTTCATTGACTCTGATTGGCTCCAGTCACCACATAGAAACAGCCTATGTTGCTCACATTTGCTTGTCCT
GGGAAGCCCTTCACTGTCAGTACACTCAACTTAACCACTTAATATCATGCCAACCCCAAAACTCTACTATTCATTATAATCTTACTGCTCAGCTCTTTCAGCAATTTCAA
GTCCTCTTGCAAAGGTTTATTGAAAATGAACCTTTCCAACAAGATTTCAGGCCTGCAATTTATGCCCGAACCCGTCAAACTTTTCCTAAAATGTTGCATGTTCCTAACAT
ACAAGCTTCAGATCCAAACGGGGTGCAGGAACAAGAATCCGACCTCCTAATCCTCGCTCCTGATCTGCTGCTCATTATTGAGGCTTCAATCTTTACTTTCCACCGTTTCC
TGAAGATGGACAAGAAAACCTCAAATTCTGCTTCATCATTTCGGAACCACACCCAGGATGCCACTCTGCTTGCTCGTGTTCGATCTTCTCTTGACAAGAAGAAGACGAAG
CTGAAAGAGGTTAGGAAGAAGAGTAGAGGGTGGAAACAGAAAACGTGGCCTCAATCTTATGAAGACATGCAATTACTCTTTGGAATCGTGGACATTAAAATCATATCAAG
GCTTGTTAAGATGTCGAGGACTACTAAAGAACAGCTGCTTTGGTGCGAGGAGAAAATGAACAAGTTAGATGTGTCTAATGGAAAATTGCAGAGAGATCCGTCTCCTCTTC
TTTTCCCATATAAGTATGGTAATGGACCCTACTACTACTATGATGAATCTGAATTTTTGGGGTTTTCGAACTCCTTGTGTTTTAGTGTTCCTTATGCGGGTCCCTTCAAC
TATCTTATCTTTCTGTTGCAAATTCACATCACAATGGACTTGAAAATCGCCAAAAAAATTAAAGAGAAAAAAAAGAAAGAAAAACAAACGAATCCTCTCAAGACGACCAC
GAACAGGCGTTGCAAAAACGCGAATAGGCGAGTGACGACCGGCGAAAGCCATTGCAAAAACTTGCGAAGATCGACGGTAGATCGTGAACTGCAAACATATGCTTCTTTCT
GGGTTTTCTTCGTGGTTGCAGGTTACCGCACCTGTTACTGGTTAGCTTCGTTTCATTTCACCATGTCTTCAGCATCAACTGTCTCCCAATCTGTGTCACTTCCTGCTCCG
CCTACTTCCAATTCTGCTGCTAATGGTTCTTCAATTCCCAATTTGATCCCTGTGACTTCACCAGTTCTTCCCTCCCCGTCTTTCCACGTTCACCAACTACTGCCTGTAAC
TCCTATGGTACCTGGTCCACCGGGAATGACGCCGGCGCCGCCAGTTGTGTCCACAAGTCCGGCAGCTCTGTTTCCACCAAACGATTCTGCTTCCACTATCCTGGGACCCC
ATATGCATGCAACTCCTAGCTCAATTAATCCTTCTGCTCGTCCACAAATCTTTGGTTCCTATCCTTCTCTAACTCCTGTTGTTTCTCCGCCTCATGGGATCTGGTTTCAG
CCTCCTCAGTTGGGAGCCATGCCCAGGCCTCCCTTTCTGCCATACTCTGCTTCTTATCACGGCCCTCTTCCTTTTCCTGCCCGCGGAATGCCCCTTCCCTCTGTCCCATT
GCCCGATCCTCAACCTCCCGGTGTTACCCCTGTTCAAGTTGCACCTGCTATTGCTGTGCCAGCTGGTCATGGAAATCAGCTTAGTGGCAATTCATTGATTCAGACAGACT
CAAATCATCCTCAACTTGATAGCCAGAAACACGCTCAAGGTGTAGGCCAATCTGAGAACATTTCTTTAACTAAGCACTCCGAGGATTGGACTGCCCACAAGACTGAGGCA
GGAGTAATCTATTACTATAATGCCTTGACGGGAGAATCTACCTATGAAAAGCCTTCGGGTTTCAAAGAGGAGCCTGAAAATCTCATGTCGCAGCCAACATCAGTTTCAAT
GTCAAACTTGTCTGGTACAGATTGGGTTTTGGTTACTATGGGTGATGGTAAAAAGTACTACTATAACAACAAGACGAAGATTAGCAGTTGGCAAATCCCCAATGAAGTAT
CTGAATTGAGGCAACAAAATGATGAAAAACCAAAAGAACATTCCGCTCCGCTGCCAAATAATAATGCATTGACTGATCTAGGAACTTCCTCTATCAGTATCAATACTCCT
GCCATTAATACAGGTGGCCGTGAAGCCACGCCTCTTAGAACGGTAGGAATATCAGGGTCATCTTCTGCTCTGGATTTGATCAAGAAAAAATTGCAAGACTCTGGAACTCC
GGTTGCTTCTTCGCCTATTTCAGCTCCAACAATAGCTCAATCAGATGTAAATCTGCTGAGAGATGCTGATGCTACAGTTAAGGCACTGCAGCCTGAGAACAACAAGGGTA
AGCCAAAAGATACTGATGGTGATGGAAATATATCGGACTCCTCCTCGGACTCTGAGGATGTAGACAGTGGGCCAACTAATGAGCAGTTAATTATCCAGTTTAAGGAAATG
CTTAAGGAGCGAGGAGTGGCACCATTCTCTAAATGGGACAAGGAATTGCCGAAGATAGTTTTTGATCCCCGCTTTAAGACACGTGCTGAGGAGGAACGCAAGGAAAAAAG
AGCTGCTCAGAAGGCTGCAATAGAGGGATTTAAACAGTTACTGGATAGGGCATCTGAGGATATCGATCACACGACTAGTTATCAAACATTCAAAAAGAAATGGGGGAATG
ACCCGCGGTTTGAAGCTTTGGATCGTAAGGATCGGGAGAATTTATTGAACGAAAGGGTCCTTTGTCTGAAGAAGGCTGCCGATGAAAAGGCTCAAGCTTTATGGGCTGCT
TCCACCACTAGTTTCAAGTCCATGCTGCAGGAGAGAGGAGATATCAACGTCAATTCCCGTTGGTTCAGGGTAAAAGATAGTCTACGGGACGATCCAAGATACAGATCTGT
TAAGCATGAGGAGCGTGAGATGTTATTCAATGAGTACATATCTGAACTTAAGGCTATTGAGGAGGAAAAGCAGCACGAATCAAAAGCTAGAAAGGAGGAGCAGGAGAAAC
TGAAGGAAAGGGAGAGAGAATGGCGGAAACGGAAGGAAAGAGAAGAGCAAGAAATGGAAAGGGTCCGTCTAAAAGTACGAAAGAAAGAGGCAGTTGCATCTTTTCAAGCA
TTGCTTGTTGAATCGATCAAAGACCCCCAGGCCTCTTGGACTGAATCAAAAGTTAAATTGGAGAAGGATCCACAAGGGCGTGCATCCAATCCTGATTTAGATTCATCTGA
CACAGAAAAGTTATTCAGAGAACATGTAAAGATGCTGCAAGAGCGGTGTGCAAACGAGTTCAGAAGTCTATTATCTGAAGCCTTTACAGCCGAGGTGGTTGCTCAGGTGT
CTGAAGATGGTAAAACAGTTCTTAATTCATGGACGATGGCTAAACGAATTTTGAAGCCCGATCCCAGATATAGTAAAGTCCCAAGGAAGGAAAGGGAGGCAGTTTGGCGT
CGTTATGCTGATGATACATTGCGGAAGCAGAAGTTGGCAAATGATCATAAAGGAGAAAAACATAACGACTTTAAGAGTAGAGCAACCATTGACGCTGGAAAATTTCCGTC
TAAACCAAGAATCCATGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGATGAATTGACGGACGGAGATGGATACTTCGACCACAAACACGCGGGAGATCCTCGCAGCTTTATGGGTGTTCTTCCCTTAGTTTATACTAGTTCTCATTTTGG
CGTTCGCATGACTTTTGGGCTAATTTTGTTAAACTTGTGTGGTACGGATTGCGTTTTGGTTTCTATGGGTGATGGTAAAAAGTTTTACTACAGCAACAAAACGAAGGTAT
GCTATTTCCTCCCGCCCCCTTCAAAATTCTTCTCCGGATGGGGAACAGAGCTCAGATTCAAGCAACAAGAAAAAGAAGCAGAACTTGTATCTAACTTGTATTTCTTTGTT
TTGTATTTGTTTGCGACTCCTTTGCCGAAAAATAAAGCATTGATTGATCTAGAAACTTCTCCTATAAGTATCAATACTCCTGCCATTGACACAGGTGGTCGTGAAGCCAC
ACCCCTCAGAATGGTAGGAATATCGGGGTCATCTTCTGCCCTGAATTTGATCAAGAAAAAATTGCAAGACTCTGGAACTGCTGTAGCTTCCTCGCCTATTTCAGCTCCAA
CAATAGCTCAATTAAATGTAAATCTACATAAAGATGTTGATGTTGCAGTTAAGGCACTGCGTATAGAGAACAGCAAGGATAAATTGAAAGATGCTAATGGTGATGCAAAT
GTATCTGACTCTGAGGATGTAGAAAGTGGACCAACTAATGAGCAATCAAATATCCAGTTTAAGGAGGAGGATGGCGCAGATCCTAAGGGTATAGCAGAAGCTGAAGAGGA
TGAGGAGGATTTTATAATGGAGGAGGTAAAGAGGAGATTGAAAGAGCTGAGGAGGAACAGTTTCATGGTGTTGATTCCAGAGGAAGAAGAAGAAGGAGGAGGAGGAGGAG
AAGAAGATGAAGAAGAAGAAGTAGGCGAAGGGGAGCCTGAGTGGAGAGACGTGGAAGCAGAAGGTCGACAATGGTGGGGAGGGTTTGGTGCTGTTTATGATGATTACTGC
AAGAGGATGCTTTTCTTTGATCGGATGAGCATTCAATCTGGTCCTGAATCAACCTCCCAAAGATCTGCATCGAAAAAGAGTGCATCTCCTCTTCGATGTCTTTCTCTGAA
GAGGATCGAAGAACCTGAAGATGAGATGGAGGATGTTGACCCTTCATTGACTCTGATTGGCTCCAGTCACCACATAGAAACAGCCTATGTTGCTCACATTTGCTTGTCCT
GGGAAGCCCTTCACTGTCAGTACACTCAACTTAACCACTTAATATCATGCCAACCCCAAAACTCTACTATTCATTATAATCTTACTGCTCAGCTCTTTCAGCAATTTCAA
GTCCTCTTGCAAAGGTTTATTGAAAATGAACCTTTCCAACAAGATTTCAGGCCTGCAATTTATGCCCGAACCCGTCAAACTTTTCCTAAAATGTTGCATGTTCCTAACAT
ACAAGCTTCAGATCCAAACGGGGTGCAGGAACAAGAATCCGACCTCCTAATCCTCGCTCCTGATCTGCTGCTCATTATTGAGGCTTCAATCTTTACTTTCCACCGTTTCC
TGAAGATGGACAAGAAAACCTCAAATTCTGCTTCATCATTTCGGAACCACACCCAGGATGCCACTCTGCTTGCTCGTGTTCGATCTTCTCTTGACAAGAAGAAGACGAAG
CTGAAAGAGGTTAGGAAGAAGAGTAGAGGGTGGAAACAGAAAACGTGGCCTCAATCTTATGAAGACATGCAATTACTCTTTGGAATCGTGGACATTAAAATCATATCAAG
GCTTGTTAAGATGTCGAGGACTACTAAAGAACAGCTGCTTTGGTGCGAGGAGAAAATGAACAAGTTAGATGTGTCTAATGGAAAATTGCAGAGAGATCCGTCTCCTCTTC
TTTTCCCATATAAGTATGGTAATGGACCCTACTACTACTATGATGAATCTGAATTTTTGGGGTTTTCGAACTCCTTGTGTTTTAGTGTTCCTTATGCGGGTCCCTTCAAC
TATCTTATCTTTCTGTTGCAAATTCACATCACAATGGACTTGAAAATCGCCAAAAAAATTAAAGAGAAAAAAAAGAAAGAAAAACAAACGAATCCTCTCAAGACGACCAC
GAACAGGCGTTGCAAAAACGCGAATAGGCGAGTGACGACCGGCGAAAGCCATTGCAAAAACTTGCGAAGATCGACGGTAGATCGTGAACTGCAAACATATGCTTCTTTCT
GGGTTTTCTTCGTGGTTGCAGGTTACCGCACCTGTTACTGGTTAGCTTCGTTTCATTTCACCATGTCTTCAGCATCAACTGTCTCCCAATCTGTGTCACTTCCTGCTCCG
CCTACTTCCAATTCTGCTGCTAATGGTTCTTCAATTCCCAATTTGATCCCTGTGACTTCACCAGTTCTTCCCTCCCCGTCTTTCCACGTTCACCAACTACTGCCTGTAAC
TCCTATGGTACCTGGTCCACCGGGAATGACGCCGGCGCCGCCAGTTGTGTCCACAAGTCCGGCAGCTCTGTTTCCACCAAACGATTCTGCTTCCACTATCCTGGGACCCC
ATATGCATGCAACTCCTAGCTCAATTAATCCTTCTGCTCGTCCACAAATCTTTGGTTCCTATCCTTCTCTAACTCCTGTTGTTTCTCCGCCTCATGGGATCTGGTTTCAG
CCTCCTCAGTTGGGAGCCATGCCCAGGCCTCCCTTTCTGCCATACTCTGCTTCTTATCACGGCCCTCTTCCTTTTCCTGCCCGCGGAATGCCCCTTCCCTCTGTCCCATT
GCCCGATCCTCAACCTCCCGGTGTTACCCCTGTTCAAGTTGCACCTGCTATTGCTGTGCCAGCTGGTCATGGAAATCAGCTTAGTGGCAATTCATTGATTCAGACAGACT
CAAATCATCCTCAACTTGATAGCCAGAAACACGCTCAAGGTGTAGGCCAATCTGAGAACATTTCTTTAACTAAGCACTCCGAGGATTGGACTGCCCACAAGACTGAGGCA
GGAGTAATCTATTACTATAATGCCTTGACGGGAGAATCTACCTATGAAAAGCCTTCGGGTTTCAAAGAGGAGCCTGAAAATCTCATGTCGCAGCCAACATCAGTTTCAAT
GTCAAACTTGTCTGGTACAGATTGGGTTTTGGTTACTATGGGTGATGGTAAAAAGTACTACTATAACAACAAGACGAAGATTAGCAGTTGGCAAATCCCCAATGAAGTAT
CTGAATTGAGGCAACAAAATGATGAAAAACCAAAAGAACATTCCGCTCCGCTGCCAAATAATAATGCATTGACTGATCTAGGAACTTCCTCTATCAGTATCAATACTCCT
GCCATTAATACAGGTGGCCGTGAAGCCACGCCTCTTAGAACGGTAGGAATATCAGGGTCATCTTCTGCTCTGGATTTGATCAAGAAAAAATTGCAAGACTCTGGAACTCC
GGTTGCTTCTTCGCCTATTTCAGCTCCAACAATAGCTCAATCAGATGTAAATCTGCTGAGAGATGCTGATGCTACAGTTAAGGCACTGCAGCCTGAGAACAACAAGGGTA
AGCCAAAAGATACTGATGGTGATGGAAATATATCGGACTCCTCCTCGGACTCTGAGGATGTAGACAGTGGGCCAACTAATGAGCAGTTAATTATCCAGTTTAAGGAAATG
CTTAAGGAGCGAGGAGTGGCACCATTCTCTAAATGGGACAAGGAATTGCCGAAGATAGTTTTTGATCCCCGCTTTAAGACACGTGCTGAGGAGGAACGCAAGGAAAAAAG
AGCTGCTCAGAAGGCTGCAATAGAGGGATTTAAACAGTTACTGGATAGGGCATCTGAGGATATCGATCACACGACTAGTTATCAAACATTCAAAAAGAAATGGGGGAATG
ACCCGCGGTTTGAAGCTTTGGATCGTAAGGATCGGGAGAATTTATTGAACGAAAGGGTCCTTTGTCTGAAGAAGGCTGCCGATGAAAAGGCTCAAGCTTTATGGGCTGCT
TCCACCACTAGTTTCAAGTCCATGCTGCAGGAGAGAGGAGATATCAACGTCAATTCCCGTTGGTTCAGGGTAAAAGATAGTCTACGGGACGATCCAAGATACAGATCTGT
TAAGCATGAGGAGCGTGAGATGTTATTCAATGAGTACATATCTGAACTTAAGGCTATTGAGGAGGAAAAGCAGCACGAATCAAAAGCTAGAAAGGAGGAGCAGGAGAAAC
TGAAGGAAAGGGAGAGAGAATGGCGGAAACGGAAGGAAAGAGAAGAGCAAGAAATGGAAAGGGTCCGTCTAAAAGTACGAAAGAAAGAGGCAGTTGCATCTTTTCAAGCA
TTGCTTGTTGAATCGATCAAAGACCCCCAGGCCTCTTGGACTGAATCAAAAGTTAAATTGGAGAAGGATCCACAAGGGCGTGCATCCAATCCTGATTTAGATTCATCTGA
CACAGAAAAGTTATTCAGAGAACATGTAAAGATGCTGCAAGAGCGGTGTGCAAACGAGTTCAGAAGTCTATTATCTGAAGCCTTTACAGCCGAGGTGGTTGCTCAGGTGT
CTGAAGATGGTAAAACAGTTCTTAATTCATGGACGATGGCTAAACGAATTTTGAAGCCCGATCCCAGATATAGTAAAGTCCCAAGGAAGGAAAGGGAGGCAGTTTGGCGT
CGTTATGCTGATGATACATTGCGGAAGCAGAAGTTGGCAAATGATCATAAAGGAGAAAAACATAACGACTTTAAGAGTAGAGCAACCATTGACGCTGGAAAATTTCCGTC
TAAACCAAGAATCCATGAATGAAGAAAGTTGAAGCGTTTTTTCATCTCATTCAGTCTTTTTTTTTTTTTTTTTTT
Protein sequenceShow/hide protein sequence
MKDELTDGDGYFDHKHAGDPRSFMGVLPLVYTSSHFGVRMTFGLILLNLCGTDCVLVSMGDGKKFYYSNKTKVCYFLPPPSKFFSGWGTELRFKQQEKEAELVSNLYFFV
LYLFATPLPKNKALIDLETSPISINTPAIDTGGREATPLRMVGISGSSSALNLIKKKLQDSGTAVASSPISAPTIAQLNVNLHKDVDVAVKALRIENSKDKLKDANGDAN
VSDSEDVESGPTNEQSNIQFKEEDGADPKGIAEAEEDEEDFIMEEVKRRLKELRRNSFMVLIPEEEEEGGGGGEEDEEEEVGEGEPEWRDVEAEGRQWWGGFGAVYDDYC
KRMLFFDRMSIQSGPESTSQRSASKKSASPLRCLSLKRIEEPEDEMEDVDPSLTLIGSSHHIETAYVAHICLSWEALHCQYTQLNHLISCQPQNSTIHYNLTAQLFQQFQ
VLLQRFIENEPFQQDFRPAIYARTRQTFPKMLHVPNIQASDPNGVQEQESDLLILAPDLLLIIEASIFTFHRFLKMDKKTSNSASSFRNHTQDATLLARVRSSLDKKKTK
LKEVRKKSRGWKQKTWPQSYEDMQLLFGIVDIKIISRLVKMSRTTKEQLLWCEEKMNKLDVSNGKLQRDPSPLLFPYKYGNGPYYYYDESEFLGFSNSLCFSVPYAGPFN
YLIFLLQIHITMDLKIAKKIKEKKKKEKQTNPLKTTTNRRCKNANRRVTTGESHCKNLRRSTVDRELQTYASFWVFFVVAGYRTCYWLASFHFTMSSASTVSQSVSLPAP
PTSNSAANGSSIPNLIPVTSPVLPSPSFHVHQLLPVTPMVPGPPGMTPAPPVVSTSPAALFPPNDSASTILGPHMHATPSSINPSARPQIFGSYPSLTPVVSPPHGIWFQ
PPQLGAMPRPPFLPYSASYHGPLPFPARGMPLPSVPLPDPQPPGVTPVQVAPAIAVPAGHGNQLSGNSLIQTDSNHPQLDSQKHAQGVGQSENISLTKHSEDWTAHKTEA
GVIYYYNALTGESTYEKPSGFKEEPENLMSQPTSVSMSNLSGTDWVLVTMGDGKKYYYNNKTKISSWQIPNEVSELRQQNDEKPKEHSAPLPNNNALTDLGTSSISINTP
AINTGGREATPLRTVGISGSSSALDLIKKKLQDSGTPVASSPISAPTIAQSDVNLLRDADATVKALQPENNKGKPKDTDGDGNISDSSSDSEDVDSGPTNEQLIIQFKEM
LKERGVAPFSKWDKELPKIVFDPRFKTRAEEERKEKRAAQKAAIEGFKQLLDRASEDIDHTTSYQTFKKKWGNDPRFEALDRKDRENLLNERVLCLKKAADEKAQALWAA
STTSFKSMLQERGDINVNSRWFRVKDSLRDDPRYRSVKHEEREMLFNEYISELKAIEEEKQHESKARKEEQEKLKEREREWRKRKEREEQEMERVRLKVRKKEAVASFQA
LLVESIKDPQASWTESKVKLEKDPQGRASNPDLDSSDTEKLFREHVKMLQERCANEFRSLLSEAFTAEVVAQVSEDGKTVLNSWTMAKRILKPDPRYSKVPRKEREAVWR
RYADDTLRKQKLANDHKGEKHNDFKSRATIDAGKFPSKPRIHE