; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G018200 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G018200
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionTFIIS N-terminal domain-containing protein
Genome locationCG_Chr05:30504288..30507380
RNA-Seq ExpressionClCG05G018200
SyntenyClCG05G018200
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR017923 - Transcription factor IIS, N-terminal
IPR035441 - TFIIS/LEDGF domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0063293.1 uncharacterized protein E6C27_scaffold205G001160 [Cucumis melo var. makuwa]0.0e+0089.37Show/hide
Query:  MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII
        MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII
Subjt:  MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII

Query:  VLLQALKKLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVE-SSDGK
        VLLQAL+KLHITAEKSISSGILFTVK L+E+TDHGKSRFGKELSVLLDRWMQEINDKDLL DAE   VHFDEE  NL   AGRSS SGASVS E SSDG+
Subjt:  VLLQALKKLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVE-SSDGK

Query:  QTAEPVRDKVLSCGSSDALHPDEIEHSKVQSPRNELNSHSNSGNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGGGTSVSVACSFPAAAREGSD
        QTAEP+ DK+LS GS DALHPD+IE SKVQSPRNEL+SHS SGNSVVKDRSPDLTTNSAVML P+EDVLKK+ETSLCSVGGG  VSV CSFP AAREGSD
Subjt:  QTAEPVRDKVLSCGSSDALHPDEIEHSKVQSPRNELNSHSNSGNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGGGTSVSVACSFPAAAREGSD

Query:  NEQLAGGSKKLNESPELENQVSKIDGSCGRSCVTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLENAGVRQRSSSLDSERVSTLDSAS
        NEQLA G KK NESPELENQ +KIDGS GRSCVTEKSDNS HSPMQDPGTVLEGFDAA GEESAKEAPA+QDNDGL++AG  QRSSSLDSE+V TL+SAS
Subjt:  NEQLAGGSKKLNESPELENQVSKIDGSCGRSCVTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLENAGVRQRSSSLDSERVSTLDSAS

Query:  GISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGKLEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTS----NGSDMELE
        G+S+KKTNY S+ VFKP G+DA+RY +TLRD SMNGS IGK EERG SFSRMEDFG +  DRQRRRKEDDGGM  S FSKPKLN KTS    N SDMEL+
Subjt:  GISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGKLEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTS----NGSDMELE

Query:  YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDLPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPENLNEME
        YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTM EKQDLPADLQ REVQSAKSHVAESYSDAETCL HPDNLDTQPEN+NEME
Subjt:  YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDLPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPENLNEME

Query:  SSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGN
        SSMVTEAARGADASTEKGFCE DLNQDVFNDDAEQ+ATPVS+PVS+ISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRT SSGGN
Subjt:  SSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGN

Query:  SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVRNID
        SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQ GEFLVESG RRSGGLKLDLNCVGDDV+ PASDLRM+GLFNNQNSYSASPACSSSSMQPLVRNID
Subjt:  SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVRNID

Query:  LNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFPFHASSLPNGRTVEPAGMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP
        LNDRP+VQGDAPDQ PGKY QNA+AYG  NSDASVISIMGT+VEVSRKDFPFHASSLPNGRTVEP GMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP
Subjt:  LNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFPFHASSLPNGRTVEPAGMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP

Query:  GPTISFSTMYEPSGSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGMADAQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFPSHLRPMEEQLRQP
        GPTISFSTMYEP GSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGM +AQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFP HLR +EEQLRQP
Subjt:  GPTISFSTMYEPSGSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGMADAQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFPSHLRPMEEQLRQP

Query:  SSSGVGAKRKEPDCPDSGWEGYLLSYKHQQPPWKQ
        SSSGVG KRKEPD PD GWE Y LSYKHQQPPWKQ
Subjt:  SSSGVGAKRKEPDCPDSGWEGYLLSYKHQQPPWKQ

TYK31482.1 uncharacterized protein E5676_scaffold455G007980 [Cucumis melo var. makuwa]0.0e+0089.37Show/hide
Query:  MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII
        MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII
Subjt:  MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII

Query:  VLLQALKKLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVE-SSDGK
        VLLQAL+KLHITAEKSISSGILFTVK L+E+TDHGKSRFGKELSVLLDRWMQEINDKDLL DAE   VHFDEE  NL   AGRSS SGASVS E SSDG+
Subjt:  VLLQALKKLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVE-SSDGK

Query:  QTAEPVRDKVLSCGSSDALHPDEIEHSKVQSPRNELNSHSNSGNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGGGTSVSVACSFPAAAREGSD
        QTAEP+ DK+LS GS DALHPD+IE SKVQSPRNEL+SHS SGNSVVKDRSPDLTTNSAVML P+EDVLKK+ETSLCSVGGG  VSV CSFP AAREGSD
Subjt:  QTAEPVRDKVLSCGSSDALHPDEIEHSKVQSPRNELNSHSNSGNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGGGTSVSVACSFPAAAREGSD

Query:  NEQLAGGSKKLNESPELENQVSKIDGSCGRSCVTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLENAGVRQRSSSLDSERVSTLDSAS
        NEQLA G KK NESPELENQ +KIDGS GRSCVTEKSDNS HSPMQDPGTVLEGFDAA GEESAKEAPA+QDNDGL++AG  QRSSSLDSE+VSTL+SAS
Subjt:  NEQLAGGSKKLNESPELENQVSKIDGSCGRSCVTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLENAGVRQRSSSLDSERVSTLDSAS

Query:  GISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGKLEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTS----NGSDMELE
        G+S+KKTNY S+ VFKP G+DA+RY +TLRD SMNGS IGK EERG SFSRMEDFG +  DRQRRRKEDDGGM  S FSKPKLN KTS    N SDMEL+
Subjt:  GISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGKLEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTS----NGSDMELE

Query:  YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDLPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPENLNEME
        YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTM EKQDLPADLQ REVQSAKSHVAESYSDAETCL HPDNLDTQPEN+NEME
Subjt:  YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDLPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPENLNEME

Query:  SSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGN
        SSMVTEAARGADASTEKGFCE DLNQDVFNDDAEQ+ATPVS+PVS+ISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRT SSGGN
Subjt:  SSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGN

Query:  SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVRNID
        SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQ GEFLVESG RRSGGLKLDLNCVGDDV+ PASDLRM+GLFNNQNSYSASPACSSSSMQPLVRNID
Subjt:  SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVRNID

Query:  LNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFPFHASSLPNGRTVEPAGMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP
        LNDRP+VQGDAPDQ PGKY QNA+AYG  NSDASVISIMGT+VEVSRKDFPFHASSLPNGRTVEP GMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP
Subjt:  LNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFPFHASSLPNGRTVEPAGMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP

Query:  GPTISFSTMYEPSGSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGMADAQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFPSHLRPMEEQLRQP
        GPTISFSTMYEP GSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGM +AQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFP HLR +EEQLRQP
Subjt:  GPTISFSTMYEPSGSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGMADAQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFPSHLRPMEEQLRQP

Query:  SSSGVGAKRKEPDCPDSGWEGYLLSYKHQQPPWKQ
        SSSGVG KRKEP+ PD GWE Y LSYKHQQPPWKQ
Subjt:  SSSGVGAKRKEPDCPDSGWEGYLLSYKHQQPPWKQ

XP_011652453.1 uncharacterized protein LOC101221601 [Cucumis sativus]0.0e+0088.5Show/hide
Query:  MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII
        MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII
Subjt:  MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII

Query:  VLLQALKKLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVE-SSDGK
        VLLQAL+KLHITAEKSISSGILFTVK L+E+TDHGKSRFGKELSVLLDRWMQEINDKDLL DAE + VHFDEE  NL   AGRSS SGASVS E SSDG+
Subjt:  VLLQALKKLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVE-SSDGK

Query:  QTAEPVRDKVLSCGSSDALHPDEIEHSKVQSPRNELNSHSNSGNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGGGTSVSVACSFPAAAREGSD
        QTAEPV DK+LS G+ DAL+PD+IE SKVQSP NELNSHS SGNSVVKDRSPDLT NS VML P+EDVLKK+ETSLCSVGGG  +SV CSFP A REG+D
Subjt:  QTAEPVRDKVLSCGSSDALHPDEIEHSKVQSPRNELNSHSNSGNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGGGTSVSVACSFPAAAREGSD

Query:  NEQLAGGSKKLNESPELENQVSKIDGSCGRSCVTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLENAGVRQRSSSLDSERVSTLDSAS
        NEQLA G KK +ES E ENQV+KIDGS GRSCVTEKSD S HSPMQDPGTVLEGFDAA GEESAKEAPA+QDNDGL++AG  QRSSSLDSERVSTL+SAS
Subjt:  NEQLAGGSKKLNESPELENQVSKIDGSCGRSCVTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLENAGVRQRSSSLDSERVSTLDSAS

Query:  GISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGKLEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTS----NGSDMELE
        G+SDKKTNY S+ VFKP G DA+RY +T RDLSMNGS IGKLE+RG SFSRMEDFG +  DRQRRRKEDD GM  S FSKPKLN KTS    N SDMEL+
Subjt:  GISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGKLEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTS----NGSDMELE

Query:  YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDLPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPENLNEME
        YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPD+M EKQDLPADLQ REVQSAKSH+AESYSDAETCL  PDNLDTQPENLNEME
Subjt:  YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDLPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPENLNEME

Query:  SSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGN
        SSMVTEAARGADAST K +CE DLNQDVFNDDAEQ+ATPVS+PVS+ISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRT SSGGN
Subjt:  SSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGN

Query:  SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVRNID
        SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQ GEFLVESG RRSGGLKLDLNCVGDDVD PASDLR+EGLFNNQNSYSASPACSSSSMQPLVRNID
Subjt:  SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVRNID

Query:  LNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFPFHASSLPNGRTVEPAGMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP
        LNDRPYVQGDAPDQGPGKY QNA+AYGR NSDASVISIMGT+VEVSRKDFPFHAS LPNGRTVEP GMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP
Subjt:  LNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFPFHASSLPNGRTVEPAGMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP

Query:  GPTISFSTMYEPSGSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGMADAQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFPSHLRPMEEQLRQP
        GPTISFSTMYEP GSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFI  MADAQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFP HLR +EEQLRQP
Subjt:  GPTISFSTMYEPSGSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGMADAQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFPSHLRPMEEQLRQP

Query:  SSSGVGAKRKEPDCPDSGWEGYLLSYKHQQPPWKQ
        SSSGVG KRKEPD PD GWE Y LSYKHQQPPWKQ
Subjt:  SSSGVGAKRKEPDCPDSGWEGYLLSYKHQQPPWKQ

XP_016903548.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103503867 [Cucumis melo]0.0e+0089.28Show/hide
Query:  MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII
        MMTLEDFFTLTEIK GLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII
Subjt:  MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII

Query:  VLLQALKKLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVE-SSDGK
        VLLQAL+KLHITAEKSISSGILFTVK L+E+TDHGKSRFGKELSVLLDRWMQEINDKDLL DAE   VHFDEE  NL   AGRSS SGASVS E SSDG+
Subjt:  VLLQALKKLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVE-SSDGK

Query:  QTAEPVRDKVLSCGSSDALHPDEIEHSKVQSPRNELNSHSNSGNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGGGTSVSVACSFPAAAREGSD
        QTAEP+ DK+LS GS DALHPD+IE SKVQSPRNEL+SHS SGNSVVKDRSPDLTTNSAVML P+EDVLKK+ETSLCSVGGG  VSV CSFP AAREGSD
Subjt:  QTAEPVRDKVLSCGSSDALHPDEIEHSKVQSPRNELNSHSNSGNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGGGTSVSVACSFPAAAREGSD

Query:  NEQLAGGSKKLNESPELENQVSKIDGSCGRSCVTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLENAGVRQRSSSLDSERVSTLDSAS
        NEQLA G KK NESPELENQ +KIDGS GRSCVTEKSDNS HSPMQDPGTVLEGFDAA GEESAKEAPA+QDNDGL++AG  QRSSSLDSE+VSTL+SAS
Subjt:  NEQLAGGSKKLNESPELENQVSKIDGSCGRSCVTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLENAGVRQRSSSLDSERVSTLDSAS

Query:  GISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGKLEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTS----NGSDMELE
        G+S+KKTNY S+ VFKP G+DA+RY +TLRD SMNGS IGK EERG SFSRMEDFG +  DRQRRRKEDDGGM  S FSKPKLN KTS    N SDMEL+
Subjt:  GISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGKLEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTS----NGSDMELE

Query:  YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDLPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPENLNEME
        YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTM EKQDLPADLQ REVQSAKSHVAESYSDAETCL HPDNLDTQPEN+NEME
Subjt:  YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDLPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPENLNEME

Query:  SSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGN
        SSMVTEAARGADASTEKGFCE DLNQDVFNDDAEQ+ATPVS+PVS+ISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRT SSGGN
Subjt:  SSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGN

Query:  SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVRNID
        SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQ GEFLVESG RRSGGLKLDLNCVGDDV+ PASDLRM+GLFNNQNSYSASPACSSSSMQPLVRNID
Subjt:  SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVRNID

Query:  LNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFPFHASSLPNGRTVEPAGMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP
        LNDRP+VQGDAPDQ PGKY QNA+AYG  NSDASVISIMGT+VEVSRKDFPFHASSLPNGRTVEP GMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP
Subjt:  LNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFPFHASSLPNGRTVEPAGMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP

Query:  GPTISFSTMYEPSGSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGMADAQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFPSHLRPMEEQLRQP
        GPTISFSTMYEP GSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGM +AQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFP HLR +EEQLRQP
Subjt:  GPTISFSTMYEPSGSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGMADAQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFPSHLRPMEEQLRQP

Query:  SSSGVGAKRKEPDCPDSGWEGYLLSYKHQQPPWKQ
        SSSGVG KRKEP+ PD GWE Y LSYKHQQPPWKQ
Subjt:  SSSGVGAKRKEPDCPDSGWEGYLLSYKHQQPPWKQ

XP_038899939.1 uncharacterized protein LOC120087121 [Benincasa hispida]0.0e+0091.6Show/hide
Query:  MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII
        MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDT+DSTVEESII
Subjt:  MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII

Query:  VLLQALKKLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVE-SSDGK
        VLLQALKKLHITAEKSISSGILFT+K LYENTDHGKSRFGKELSVLLDRWMQEINDKDLL DAE + VH+DEENSNLAR AGRSSASGASVS E SSDGK
Subjt:  VLLQALKKLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVE-SSDGK

Query:  QTAEPVRDKVLSCGSSDALHPDEIEHSKVQSPRNELNSHSNSGNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGGGTSVSVACSFPAAAREGSD
        QTAEPV DK+LS GSSDALH D+IE SKVQSPRNELNSHS SGNSVV+DRSPDL TN AVML P EDVLKK+ETSLCSVGGGTSVSVA    +AAREGSD
Subjt:  QTAEPVRDKVLSCGSSDALHPDEIEHSKVQSPRNELNSHSNSGNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGGGTSVSVACSFPAAAREGSD

Query:  NEQLAGGSKKLNESPELENQVSKIDGSCGRSCVTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLENAGVRQRSSSLDSERVSTLDSAS
        NEQLA G KK NESPELEN V+KIDGS GRSCVTEKSDNS HSPMQDPG+VLEGFDAANGEESAKEAPA+QDNDGL+NAG RQRSSSLDSERVSTLDSAS
Subjt:  NEQLAGGSKKLNESPELENQVSKIDGSCGRSCVTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLENAGVRQRSSSLDSERVSTLDSAS

Query:  GISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGKLEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTS----NGSDMELE
        GISDKKTNYAS++VFKPAGLD +RY +TLRDLSMNGS IGK EERGASFSRMEDFG+V GDRQRRRKEDDGG+T S FSKPKLN KTS    N SDMELE
Subjt:  GISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGKLEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTS----NGSDMELE

Query:  YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDLPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPENLNEME
        YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPD+++EKQDLPADLQGREVQSAKSHVAESYSDAETCL HPDNLDTQPENLNEME
Subjt:  YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDLPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPENLNEME

Query:  SSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGN
        SSMVTEAARGA+ASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVS+ISVSRPAASSGLP+TPLQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGN
Subjt:  SSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGN

Query:  SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVRNID
        SDSSKQRQDFLDIDLNVAETGEETRKQ +GSSFPQSGEFLVESG RRSGGLKLDLNCVGDDVD PASDLR+EGLFNNQNSYSASPACSSSSMQPLVRNID
Subjt:  SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVRNID

Query:  LNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFPFHASSLPNGRTVEPAGMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP
        LNDRPYVQGDAPDQGPGKY QNATAYGR NSDASVISIMGTRVEV RKDFPFHASSLPNGRTVEP GMGATLARTGDILGM+SAVS+HQTPFIGYNGLTP
Subjt:  LNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFPFHASSLPNGRTVEPAGMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP

Query:  GPTISFSTMYEPSGSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGMADAQLTPNG-IAHSRPKFDLNSGLSDSGGLKQLLFPSHLRPMEEQLRQ
        GPTISFSTMYEPSGSMPYMVDSRG AVMPQ+MGPMSAVPPSSYSHPPFIMGMADAQLTPNG +AHSRPKFDLNSGLSDSGGLKQLLFP HLR +EEQLRQ
Subjt:  GPTISFSTMYEPSGSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGMADAQLTPNG-IAHSRPKFDLNSGLSDSGGLKQLLFPSHLRPMEEQLRQ

Query:  PSSSGVGAKRKEPDCPDSGWEGYLLSYKHQQPPWKQ
        PSSSGVG+KRKEPD PD GWEGYLLSYKHQQPPWKQ
Subjt:  PSSSGVGAKRKEPDCPDSGWEGYLLSYKHQQPPWKQ

TrEMBL top hitse value%identityAlignment
A0A0A0LGI6 TFIIS N-terminal domain-containing protein0.0e+0088.5Show/hide
Query:  MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII
        MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII
Subjt:  MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII

Query:  VLLQALKKLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVE-SSDGK
        VLLQAL+KLHITAEKSISSGILFTVK L+E+TDHGKSRFGKELSVLLDRWMQEINDKDLL DAE + VHFDEE  NL   AGRSS SGASVS E SSDG+
Subjt:  VLLQALKKLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVE-SSDGK

Query:  QTAEPVRDKVLSCGSSDALHPDEIEHSKVQSPRNELNSHSNSGNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGGGTSVSVACSFPAAAREGSD
        QTAEPV DK+LS G+ DAL+PD+IE SKVQSP NELNSHS SGNSVVKDRSPDLT NS VML P+EDVLKK+ETSLCSVGGG  +SV CSFP A REG+D
Subjt:  QTAEPVRDKVLSCGSSDALHPDEIEHSKVQSPRNELNSHSNSGNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGGGTSVSVACSFPAAAREGSD

Query:  NEQLAGGSKKLNESPELENQVSKIDGSCGRSCVTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLENAGVRQRSSSLDSERVSTLDSAS
        NEQLA G KK +ES E ENQV+KIDGS GRSCVTEKSD S HSPMQDPGTVLEGFDAA GEESAKEAPA+QDNDGL++AG  QRSSSLDSERVSTL+SAS
Subjt:  NEQLAGGSKKLNESPELENQVSKIDGSCGRSCVTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLENAGVRQRSSSLDSERVSTLDSAS

Query:  GISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGKLEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTS----NGSDMELE
        G+SDKKTNY S+ VFKP G DA+RY +T RDLSMNGS IGKLE+RG SFSRMEDFG +  DRQRRRKEDD GM  S FSKPKLN KTS    N SDMEL+
Subjt:  GISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGKLEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTS----NGSDMELE

Query:  YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDLPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPENLNEME
        YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPD+M EKQDLPADLQ REVQSAKSH+AESYSDAETCL  PDNLDTQPENLNEME
Subjt:  YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDLPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPENLNEME

Query:  SSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGN
        SSMVTEAARGADAST K +CE DLNQDVFNDDAEQ+ATPVS+PVS+ISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRT SSGGN
Subjt:  SSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGN

Query:  SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVRNID
        SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQ GEFLVESG RRSGGLKLDLNCVGDDVD PASDLR+EGLFNNQNSYSASPACSSSSMQPLVRNID
Subjt:  SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVRNID

Query:  LNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFPFHASSLPNGRTVEPAGMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP
        LNDRPYVQGDAPDQGPGKY QNA+AYGR NSDASVISIMGT+VEVSRKDFPFHAS LPNGRTVEP GMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP
Subjt:  LNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFPFHASSLPNGRTVEPAGMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP

Query:  GPTISFSTMYEPSGSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGMADAQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFPSHLRPMEEQLRQP
        GPTISFSTMYEP GSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFI  MADAQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFP HLR +EEQLRQP
Subjt:  GPTISFSTMYEPSGSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGMADAQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFPSHLRPMEEQLRQP

Query:  SSSGVGAKRKEPDCPDSGWEGYLLSYKHQQPPWKQ
        SSSGVG KRKEPD PD GWE Y LSYKHQQPPWKQ
Subjt:  SSSGVGAKRKEPDCPDSGWEGYLLSYKHQQPPWKQ

A0A1S4E5P4 LOW QUALITY PROTEIN: uncharacterized protein LOC1035038670.0e+0089.28Show/hide
Query:  MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII
        MMTLEDFFTLTEIK GLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII
Subjt:  MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII

Query:  VLLQALKKLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVE-SSDGK
        VLLQAL+KLHITAEKSISSGILFTVK L+E+TDHGKSRFGKELSVLLDRWMQEINDKDLL DAE   VHFDEE  NL   AGRSS SGASVS E SSDG+
Subjt:  VLLQALKKLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVE-SSDGK

Query:  QTAEPVRDKVLSCGSSDALHPDEIEHSKVQSPRNELNSHSNSGNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGGGTSVSVACSFPAAAREGSD
        QTAEP+ DK+LS GS DALHPD+IE SKVQSPRNEL+SHS SGNSVVKDRSPDLTTNSAVML P+EDVLKK+ETSLCSVGGG  VSV CSFP AAREGSD
Subjt:  QTAEPVRDKVLSCGSSDALHPDEIEHSKVQSPRNELNSHSNSGNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGGGTSVSVACSFPAAAREGSD

Query:  NEQLAGGSKKLNESPELENQVSKIDGSCGRSCVTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLENAGVRQRSSSLDSERVSTLDSAS
        NEQLA G KK NESPELENQ +KIDGS GRSCVTEKSDNS HSPMQDPGTVLEGFDAA GEESAKEAPA+QDNDGL++AG  QRSSSLDSE+VSTL+SAS
Subjt:  NEQLAGGSKKLNESPELENQVSKIDGSCGRSCVTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLENAGVRQRSSSLDSERVSTLDSAS

Query:  GISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGKLEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTS----NGSDMELE
        G+S+KKTNY S+ VFKP G+DA+RY +TLRD SMNGS IGK EERG SFSRMEDFG +  DRQRRRKEDDGGM  S FSKPKLN KTS    N SDMEL+
Subjt:  GISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGKLEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTS----NGSDMELE

Query:  YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDLPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPENLNEME
        YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTM EKQDLPADLQ REVQSAKSHVAESYSDAETCL HPDNLDTQPEN+NEME
Subjt:  YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDLPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPENLNEME

Query:  SSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGN
        SSMVTEAARGADASTEKGFCE DLNQDVFNDDAEQ+ATPVS+PVS+ISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRT SSGGN
Subjt:  SSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGN

Query:  SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVRNID
        SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQ GEFLVESG RRSGGLKLDLNCVGDDV+ PASDLRM+GLFNNQNSYSASPACSSSSMQPLVRNID
Subjt:  SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVRNID

Query:  LNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFPFHASSLPNGRTVEPAGMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP
        LNDRP+VQGDAPDQ PGKY QNA+AYG  NSDASVISIMGT+VEVSRKDFPFHASSLPNGRTVEP GMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP
Subjt:  LNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFPFHASSLPNGRTVEPAGMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP

Query:  GPTISFSTMYEPSGSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGMADAQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFPSHLRPMEEQLRQP
        GPTISFSTMYEP GSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGM +AQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFP HLR +EEQLRQP
Subjt:  GPTISFSTMYEPSGSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGMADAQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFPSHLRPMEEQLRQP

Query:  SSSGVGAKRKEPDCPDSGWEGYLLSYKHQQPPWKQ
        SSSGVG KRKEP+ PD GWE Y LSYKHQQPPWKQ
Subjt:  SSSGVGAKRKEPDCPDSGWEGYLLSYKHQQPPWKQ

A0A5A7V5D2 TFIIS N-terminal domain-containing protein0.0e+0089.37Show/hide
Query:  MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII
        MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII
Subjt:  MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII

Query:  VLLQALKKLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVE-SSDGK
        VLLQAL+KLHITAEKSISSGILFTVK L+E+TDHGKSRFGKELSVLLDRWMQEINDKDLL DAE   VHFDEE  NL   AGRSS SGASVS E SSDG+
Subjt:  VLLQALKKLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVE-SSDGK

Query:  QTAEPVRDKVLSCGSSDALHPDEIEHSKVQSPRNELNSHSNSGNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGGGTSVSVACSFPAAAREGSD
        QTAEP+ DK+LS GS DALHPD+IE SKVQSPRNEL+SHS SGNSVVKDRSPDLTTNSAVML P+EDVLKK+ETSLCSVGGG  VSV CSFP AAREGSD
Subjt:  QTAEPVRDKVLSCGSSDALHPDEIEHSKVQSPRNELNSHSNSGNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGGGTSVSVACSFPAAAREGSD

Query:  NEQLAGGSKKLNESPELENQVSKIDGSCGRSCVTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLENAGVRQRSSSLDSERVSTLDSAS
        NEQLA G KK NESPELENQ +KIDGS GRSCVTEKSDNS HSPMQDPGTVLEGFDAA GEESAKEAPA+QDNDGL++AG  QRSSSLDSE+V TL+SAS
Subjt:  NEQLAGGSKKLNESPELENQVSKIDGSCGRSCVTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLENAGVRQRSSSLDSERVSTLDSAS

Query:  GISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGKLEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTS----NGSDMELE
        G+S+KKTNY S+ VFKP G+DA+RY +TLRD SMNGS IGK EERG SFSRMEDFG +  DRQRRRKEDDGGM  S FSKPKLN KTS    N SDMEL+
Subjt:  GISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGKLEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTS----NGSDMELE

Query:  YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDLPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPENLNEME
        YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTM EKQDLPADLQ REVQSAKSHVAESYSDAETCL HPDNLDTQPEN+NEME
Subjt:  YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDLPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPENLNEME

Query:  SSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGN
        SSMVTEAARGADASTEKGFCE DLNQDVFNDDAEQ+ATPVS+PVS+ISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRT SSGGN
Subjt:  SSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGN

Query:  SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVRNID
        SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQ GEFLVESG RRSGGLKLDLNCVGDDV+ PASDLRM+GLFNNQNSYSASPACSSSSMQPLVRNID
Subjt:  SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVRNID

Query:  LNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFPFHASSLPNGRTVEPAGMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP
        LNDRP+VQGDAPDQ PGKY QNA+AYG  NSDASVISIMGT+VEVSRKDFPFHASSLPNGRTVEP GMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP
Subjt:  LNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFPFHASSLPNGRTVEPAGMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP

Query:  GPTISFSTMYEPSGSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGMADAQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFPSHLRPMEEQLRQP
        GPTISFSTMYEP GSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGM +AQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFP HLR +EEQLRQP
Subjt:  GPTISFSTMYEPSGSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGMADAQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFPSHLRPMEEQLRQP

Query:  SSSGVGAKRKEPDCPDSGWEGYLLSYKHQQPPWKQ
        SSSGVG KRKEPD PD GWE Y LSYKHQQPPWKQ
Subjt:  SSSGVGAKRKEPDCPDSGWEGYLLSYKHQQPPWKQ

A0A5D3E6E1 TFIIS N-terminal domain-containing protein0.0e+0089.37Show/hide
Query:  MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII
        MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII
Subjt:  MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII

Query:  VLLQALKKLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVE-SSDGK
        VLLQAL+KLHITAEKSISSGILFTVK L+E+TDHGKSRFGKELSVLLDRWMQEINDKDLL DAE   VHFDEE  NL   AGRSS SGASVS E SSDG+
Subjt:  VLLQALKKLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVE-SSDGK

Query:  QTAEPVRDKVLSCGSSDALHPDEIEHSKVQSPRNELNSHSNSGNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGGGTSVSVACSFPAAAREGSD
        QTAEP+ DK+LS GS DALHPD+IE SKVQSPRNEL+SHS SGNSVVKDRSPDLTTNSAVML P+EDVLKK+ETSLCSVGGG  VSV CSFP AAREGSD
Subjt:  QTAEPVRDKVLSCGSSDALHPDEIEHSKVQSPRNELNSHSNSGNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGGGTSVSVACSFPAAAREGSD

Query:  NEQLAGGSKKLNESPELENQVSKIDGSCGRSCVTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLENAGVRQRSSSLDSERVSTLDSAS
        NEQLA G KK NESPELENQ +KIDGS GRSCVTEKSDNS HSPMQDPGTVLEGFDAA GEESAKEAPA+QDNDGL++AG  QRSSSLDSE+VSTL+SAS
Subjt:  NEQLAGGSKKLNESPELENQVSKIDGSCGRSCVTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLENAGVRQRSSSLDSERVSTLDSAS

Query:  GISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGKLEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTS----NGSDMELE
        G+S+KKTNY S+ VFKP G+DA+RY +TLRD SMNGS IGK EERG SFSRMEDFG +  DRQRRRKEDDGGM  S FSKPKLN KTS    N SDMEL+
Subjt:  GISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGKLEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTS----NGSDMELE

Query:  YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDLPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPENLNEME
        YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTM EKQDLPADLQ REVQSAKSHVAESYSDAETCL HPDNLDTQPEN+NEME
Subjt:  YGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDLPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPENLNEME

Query:  SSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGN
        SSMVTEAARGADASTEKGFCE DLNQDVFNDDAEQ+ATPVS+PVS+ISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRT SSGGN
Subjt:  SSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGN

Query:  SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVRNID
        SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQ GEFLVESG RRSGGLKLDLNCVGDDV+ PASDLRM+GLFNNQNSYSASPACSSSSMQPLVRNID
Subjt:  SDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVRNID

Query:  LNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFPFHASSLPNGRTVEPAGMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP
        LNDRP+VQGDAPDQ PGKY QNA+AYG  NSDASVISIMGT+VEVSRKDFPFHASSLPNGRTVEP GMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP
Subjt:  LNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFPFHASSLPNGRTVEPAGMGATLARTGDILGMSSAVSYHQTPFIGYNGLTP

Query:  GPTISFSTMYEPSGSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGMADAQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFPSHLRPMEEQLRQP
        GPTISFSTMYEP GSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGM +AQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFP HLR +EEQLRQP
Subjt:  GPTISFSTMYEPSGSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGMADAQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFPSHLRPMEEQLRQP

Query:  SSSGVGAKRKEPDCPDSGWEGYLLSYKHQQPPWKQ
        SSSGVG KRKEP+ PD GWE Y LSYKHQQPPWKQ
Subjt:  SSSGVGAKRKEPDCPDSGWEGYLLSYKHQQPPWKQ

A0A6J1BYC5 uncharacterized protein LOC1110063880.0e+0087.7Show/hide
Query:  MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII
        MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGL+FIQRWLKDAQ+FSNDTNDSTVEESII
Subjt:  MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESII

Query:  VLLQALKKLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVE-SSDGK
        VLLQALKKLHITAEKSISSGILFTVK LYENTDH KSRFGKELS LLDRWMQEINDK LL D E VG+HFDEENS++A   GRSSASG SVS E +SDGK
Subjt:  VLLQALKKLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVE-SSDGK

Query:  QTAEPVRDKVLSCGSSDALHPDEIEHSKVQSPRNELNSHSNSGNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGGGTSVSVACSFPAAAREGSD
        Q  EPVR+K+ S GS DALHPD+ E SKVQSPRNEL+S   SGNSVVKDRSPDL +NSAVMLVPTEDV KKEET LCSVGGGTS SVACS P  AREGSD
Subjt:  QTAEPVRDKVLSCGSSDALHPDEIEHSKVQSPRNELNSHSNSGNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGGGTSVSVACSFPAAAREGSD

Query:  NEQLAGGSKKLNESPELENQVSKIDGSCGRSCVTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLE---NAGVRQRSSSLDSERVSTLD
         EQL  GSKKLNE PE+ENQV KIDGS GRSCVTEKSD S HSPMQD GT LEGFDAANGEESAKEAPA+QDNDGL+   NAGV +RSSSLDSERVSTLD
Subjt:  NEQLAGGSKKLNESPELENQVSKIDGSCGRSCVTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLE---NAGVRQRSSSLDSERVSTLD

Query:  SASGISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGKLEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTSN----GSDM
        S SGISDKK NY+S   FK AG + ERY N LRDLSMNGS +GKLE+ GASFSRMEDFG+VNGDRQRRRKEDD  MT SEFSKPKLN KTSN     SDM
Subjt:  SASGISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGKLEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTSN----GSDM

Query:  ELEYGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDLPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPENLN
        ELEYGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIR LGKPDTM EKQDLP DL GRE+QSAKSHVAESYSDAETCL HPDNLDTQPEN+N
Subjt:  ELEYGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDLPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPENLN

Query:  EMESSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSS
        EMESS+VTEAARGA+ STEKGFCEFDLNQ+VFNDD EQLATPVSLPVS+ISVSRPAASSGLPLTPLQFEG LGWRGSAATSAFRPASPRKVPDSDRTLSS
Subjt:  EMESSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLPLTPLQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSS

Query:  GGNSDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVR
        GGNSDSSKQRQDFLDIDLNVAETG+ETRKQNLGSSFP SGEFLVESGQRRSGGLKLDLNC GDDVD PASDLRMEG FNNQNSYSASPACSSSSMQPLVR
Subjt:  GGNSDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVR

Query:  NIDLNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFPFHASSLPNGRTVEPAGMGATLARTGDILGMSSAVSYHQTPFIGYNG
        NIDLN+RPYVQGDA DQGPGKYCQNA+AYG  ++DASVISIMGTRVEVSRKDF  HASSLPNGR VEPAGMGATLARTGDILGM+SAVSYHQTPFIGYNG
Subjt:  NIDLNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFPFHASSLPNGRTVEPAGMGATLARTGDILGMSSAVSYHQTPFIGYNG

Query:  LTPGPTISFSTMYEPSGSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGMADAQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFPSHLRPMEEQL
        LTPGPTISFSTMYEPSGS+PYMVDSRGAAVMPQ MGPMSAVPPSSY+HPPFIMGM DAQLTPNG AHSRPKFDLNSGL DSGGLKQLLFP HLR MEEQL
Subjt:  LTPGPTISFSTMYEPSGSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGMADAQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFPSHLRPMEEQL

Query:  R---QPSSSGVGAKRKEPDCPDSGWEGYLLSYKHQQPPWKQ
        R   QPSSSGVG KRKEPDCPD GWEGYLLSYKHQQPPWKQ
Subjt:  R---QPSSSGVGAKRKEPDCPDSGWEGYLLSYKHQQPPWKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G48050.1 BAH domain ;TFIIS helical bundle-like domain2.2e-1827.03Show/hide
Query:  DRQRRRKEDDGGMTKSEFSKPKLNLKTSNGSDMELEYGIVDALEVARQVAQEV--------EREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDL
        D  ++ ++    ++     K + ++  S+G   ++     DA+ + R + + V        +++ V+  +  C +S   + D     L    T  + + +
Subjt:  DRQRRRKEDDGGMTKSEFSKPKLNLKTSNGSDMELEYGIVDALEVARQVAQEV--------EREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDL

Query:  PADLQGREVQSAKSHVAE----SYSDAETCLIHPDNLDTQPENLNEMESSMVTEAARGADA--------STEKGFCEFDLNQDVFNDDAEQ---------
          +L+  EV+   S +      S  +AE  L  P+   T   + +  E+   T AAR A +        S      EFDLN+    DDA+          
Subjt:  PADLQGREVQSAKSHVAE----SYSDAETCLIHPDNLDTQPENLNEMESSMVTEAARGADA--------STEKGFCEFDLNQDVFNDDAEQ---------

Query:  -LATPVSL-----------PVSI---ISVSRPAASSGLPLTP----LQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGNSDSS----KQRQDFLD
           TP  L           PVS     S++  AA+ G P  P    L+ +GA+GWRGSAATSAFRPA PRK  D   ++++   SD+S    KQ + FLD
Subjt:  -LATPVSL-----------PVSI---ISVSRPAASSGLPLTP----LQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGNSDSS----KQRQDFLD

Query:  IDLNVAE-----------TGEETR-KQNLGSSFPQSGEFLVESG-QRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVRNI
         DLNV +           +G  T    ++ +SF Q    ++ S     SGGL LDLN V D  D  +  +      ++       P+          R+ 
Subjt:  IDLNVAE-----------TGEETR-KQNLGSSFPQSGEFLVESG-QRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVRNI

Query:  DLNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFP----FHASSLP---NGRTVEPAGMGAT------LART-----------
        DLND P V  DA  +      Q++ +   S    S I + G  +      FP    + A S+P     R  +P  M AT      L  T           
Subjt:  DLNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFP----FHASSLP---NGRTVEPAGMGAT------LART-----------

Query:  -GDILGMSSAVSYHQTPFIGYNGLTPGPTISFSTMYEPSGSMPYM-VDSRGAAVMP----QFMGPMSAVPPSSYSHPPFIMGM----ADAQLTPNGIAHS
         G +L  S A+ +  T F  Y     G +   ++   P  S  +M   S G A  P    Q +GP   V PS+Y   P+I+G+    ++  +  NG    
Subjt:  -GDILGMSSAVSYHQTPFIGYNGLTPGPTISFSTMYEPSGSMPYM-VDSRGAAVMP----QFMGPMSAVPPSSYSHPPFIMGM----ADAQLTPNGIAHS

Query:  RPKFDLNSGLS-------DSGGL--KQLLFPSHLRPMEEQLRQPSSSGVGAKRKEPDCPDSGWEGY
        R   DLNSG         D   L  +QL   + L   E+Q R    SG   KRKE   P+ GW+GY
Subjt:  RPKFDLNSGLS-------DSGGL--KQLLFPSHLRPMEEQLRQPSSSGVGAKRKEPDCPDSGWEGY

AT3G48050.1 BAH domain ;TFIIS helical bundle-like domain5.0e-1028.86Show/hide
Query:  KNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQK------FSNDTNDSTVEESIIVLLQALK
        K GL     VE+L+ +M  E++   K +    R  A +AG +AAT+  DCL  F+QL GL     WL++  K       S   +D  V++ ++VLL+AL 
Subjt:  KNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQK------FSNDTNDSTVEESIIVLLQALK

Query:  KLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVESSDGKQTAEPVRD
        KL +      +  I    KS+     H  S  GK+   L+D W + +  +         GV +    S+     GR S   A  +  SS     ++ V  
Subjt:  KLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVESSDGKQTAEPVRD

Query:  K
        K
Subjt:  K

AT3G48050.2 BAH domain ;TFIIS helical bundle-like domain2.2e-1827.03Show/hide
Query:  DRQRRRKEDDGGMTKSEFSKPKLNLKTSNGSDMELEYGIVDALEVARQVAQEV--------EREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDL
        D  ++ ++    ++     K + ++  S+G   ++     DA+ + R + + V        +++ V+  +  C +S   + D     L    T  + + +
Subjt:  DRQRRRKEDDGGMTKSEFSKPKLNLKTSNGSDMELEYGIVDALEVARQVAQEV--------EREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDL

Query:  PADLQGREVQSAKSHVAE----SYSDAETCLIHPDNLDTQPENLNEMESSMVTEAARGADA--------STEKGFCEFDLNQDVFNDDAEQ---------
          +L+  EV+   S +      S  +AE  L  P+   T   + +  E+   T AAR A +        S      EFDLN+    DDA+          
Subjt:  PADLQGREVQSAKSHVAE----SYSDAETCLIHPDNLDTQPENLNEMESSMVTEAARGADA--------STEKGFCEFDLNQDVFNDDAEQ---------

Query:  -LATPVSL-----------PVSI---ISVSRPAASSGLPLTP----LQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGNSDSS----KQRQDFLD
           TP  L           PVS     S++  AA+ G P  P    L+ +GA+GWRGSAATSAFRPA PRK  D   ++++   SD+S    KQ + FLD
Subjt:  -LATPVSL-----------PVSI---ISVSRPAASSGLPLTP----LQFEGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGNSDSS----KQRQDFLD

Query:  IDLNVAE-----------TGEETR-KQNLGSSFPQSGEFLVESG-QRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVRNI
         DLNV +           +G  T    ++ +SF Q    ++ S     SGGL LDLN V D  D  +  +      ++       P+          R+ 
Subjt:  IDLNVAE-----------TGEETR-KQNLGSSFPQSGEFLVESG-QRRSGGLKLDLNCVGDDVDTPASDLRMEGLFNNQNSYSASPACSSSSMQPLVRNI

Query:  DLNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFP----FHASSLP---NGRTVEPAGMGAT------LART-----------
        DLND P V  DA  +      Q++ +   S    S I + G  +      FP    + A S+P     R  +P  M AT      L  T           
Subjt:  DLNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFP----FHASSLP---NGRTVEPAGMGAT------LART-----------

Query:  -GDILGMSSAVSYHQTPFIGYNGLTPGPTISFSTMYEPSGSMPYM-VDSRGAAVMP----QFMGPMSAVPPSSYSHPPFIMGM----ADAQLTPNGIAHS
         G +L  S A+ +  T F  Y     G +   ++   P  S  +M   S G A  P    Q +GP   V PS+Y   P+I+G+    ++  +  NG    
Subjt:  -GDILGMSSAVSYHQTPFIGYNGLTPGPTISFSTMYEPSGSMPYM-VDSRGAAVMP----QFMGPMSAVPPSSYSHPPFIMGM----ADAQLTPNGIAHS

Query:  RPKFDLNSGLS-------DSGGL--KQLLFPSHLRPMEEQLRQPSSSGVGAKRKEPDCPDSGWEGY
        R   DLNSG         D   L  +QL   + L   E+Q R    SG   KRKE   P+ GW+GY
Subjt:  RPKFDLNSGLS-------DSGGL--KQLLFPSHLRPMEEQLRQPSSSGVGAKRKEPDCPDSGWEGY

AT3G48050.2 BAH domain ;TFIIS helical bundle-like domain5.0e-1028.86Show/hide
Query:  KNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQK------FSNDTNDSTVEESIIVLLQALK
        K GL     VE+L+ +M  E++   K +    R  A +AG +AAT+  DCL  F+QL GL     WL++  K       S   +D  V++ ++VLL+AL 
Subjt:  KNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQK------FSNDTNDSTVEESIIVLLQALK

Query:  KLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVESSDGKQTAEPVRD
        KL +      +  I    KS+     H  S  GK+   L+D W + +  +         GV +    S+     GR S   A  +  SS     ++ V  
Subjt:  KLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVESSDGKQTAEPVRD

Query:  K
        K
Subjt:  K

AT3G48060.1 BAH domain ;TFIIS helical bundle-like domain4.1e-2029.86Show/hide
Query:  ESSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQ----------LATPVSL-PVSIISVSRPAASSGLPLT----------------PLQFEGALGWR
        ++S V+ AA  +  S      EFDLN+    DDA+           + TP  L PV+ +       SSG+P +                 L+++GA+GWR
Subjt:  ESSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQ----------LATPVSL-PVSIISVSRPAASSGLPLT----------------PLQFEGALGWR

Query:  GSAATSAFRPASPRKVPDSDRTLSSGGNSDSS----KQRQDFLDIDLNVAETG--EETRKQNLGSSFPQSGEFLVESGQRRSG--GLKLDLNCVGDDVDT
        GSAATSAFRPA PRK  D   ++++   SD+S    KQ + FLD DLNV +    E+   Q  G+    +        Q RSG  G  LD +  G D++ 
Subjt:  GSAATSAFRPASPRKVPDSDRTLSSGGNSDSS----KQRQDFLDIDLNVAETG--EETRKQNLGSSFPQSGEFLVESGQRRSG--GLKLDLNCVGDDVDT

Query:  PASDLRMEGLFNNQNSY--SASPACSSSSMQPLV------RNIDLNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFP----F
           DL       + NSY  ++S    SS  Q  +      R+ DLND P V  DA  +      Q++ +   S    S I + G  +      FP    +
Subjt:  PASDLRMEGLFNNQNSY--SASPACSSSSMQPLV------RNIDLNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFP----F

Query:  HASSLP---NGRTVEPAGMGAT------LART------------GDILGMSSAVSYHQTPFIGYNGLTPGPTISFSTMYEPSGSMPYM-VDSRGAAVMP-
         A S+P     R  +P  M AT      L  T            G +L  S A+ +  T F  Y     G +   +    P  S  +M   S G A  P 
Subjt:  HASSLP---NGRTVEPAGMGAT------LART------------GDILGMSSAVSYHQTPFIGYNGLTPGPTISFSTMYEPSGSMPYM-VDSRGAAVMP-

Query:  ---QFMGPMSAVPPSSYSHPPFIMGM----ADAQLTPNGIAHSRPKFDLNSGLS-------DSGGL--KQLLFPSHLRPMEEQLRQPSSSGVGAKRKEPD
           Q +GP   V PS+Y   P+I+G+    ++  +  N     R   DLNSG         D   L  +QL   + +   E+Q R    SG   KRKE  
Subjt:  ---QFMGPMSAVPPSSYSHPPFIMGM----ADAQLTPNGIAHSRPKFDLNSGLS-------DSGGL--KQLLFPSHLRPMEEQLRQPSSSGVGAKRKEPD

Query:  CPDSGWEGY
         P+ GW+GY
Subjt:  CPDSGWEGY

AT4G24200.1 Transcription elongation factor (TFIIS) family protein3.1e-10533.33Show/hide
Query:  MTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESIIV
        MTLEDFFTLTEIK+GLT   RVEEL++VMQ  KD  +KN  DA R W AVA  IAAT+N+DCLD+F+ LDGL ++  WL +AQ   ND+ D +VEESI+ 
Subjt:  MTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESIIV

Query:  LLQALKKLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVG-VHFDEENSNLARVAGRSSASGASVSVESSDGKQ
        LL+A++ L + + K +SSG+   VK L    DHG SR   +   L   W  + +     HD+E    +H DE     A +      S  ++    S  ++
Subjt:  LLQALKKLHITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVG-VHFDEENSNLARVAGRSSASGASVSVESSDGKQ

Query:  TAEPVRDKVLSCGSSDALHPDEIEHSKVQSPRNELNSHSNS--GNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGG----GTSVSVACSFPAAA
            + D+ L  G S+   PD+ +   +Q+ +    S+ NS   NS+ +  +   T +  +M    E +  KE++S+    G    GT  + + S     
Subjt:  TAEPVRDKVLSCGSSDALHPDEIEHSKVQSPRNELNSHSNS--GNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGG----GTSVSVACSFPAAA

Query:  REGSDNEQLAGGSKKLNESPELENQVSKIDGSCGRSC----VTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLENAGVRQRSSSLDSE
        R+ ++    A    ++ +  ++ N   +  G  G S     V   SD+ + S   +  ++L         +S+ ++     N      G    S + +S+
Subjt:  REGSDNEQLAGGSKKLNESPELENQVSKIDGSCGRSC----VTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLENAGVRQRSSSLDSE

Query:  RVSTLDSASGISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGKLEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTSNGS
        +VS+L   + ++D + +  S  +    G   +   + L  L+ N  +    ++ G S  +     RV   ++R+++     MT S+       L   + +
Subjt:  RVSTLDSASGISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGKLEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTSNGS

Query:  DMELEYGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDLPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPEN
          +++ GI+DALEVA +VAQEV RE V+  EPS SSS +   + G  Q G     +   D+      + +   ++H  E     +  L+  D  D +PE+
Subjt:  DMELEYGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQDLPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPEN

Query:  LNEMESSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLP-LTPLQFEGALGWRGSAATSAFRPASPRKVPDSD--
         +  E  + T A   ++   EK  C FDLNQD+  D+ + + +  S   + +SVS   +SS +P   P   E +L  +GSAATS F  A P KVP  D  
Subjt:  LNEMESSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLP-LTPLQFEGALGWRGSAATSAFRPASPRKVPDSD--

Query:  --RTLSSGGNSDSSKQRQDFLDIDLNVAETGEET-------RKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVG-DDVDTPASDLRME-GLFNNQN-S
          + +S G              IDLNVAE G++        ++    SS  + GE   E+  R S    LDLNC+  DD   P S+ +ME  LF + N  
Subjt:  --RTLSSGGNSDSSKQRQDFLDIDLNVAETGEET-------RKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVG-DDVDTPASDLRME-GLFNNQN-S

Query:  YSASPACSSSSMQPLVR--NIDLNDRPYVQGDAPDQGP--GKYCQNATAYGRSNSDASVISIMGTRVEVSRKD-FPFHASSLPNGRTVEPAGMGATLART
         SASP  SSS  Q   +  N DLNDRP    D+ DQGP  G++  +  +YG    +   ISI+GT+VE  RKD  P  AS L NG+++EPA  G  + RT
Subjt:  YSASPACSSSSMQPLVR--NIDLNDRPYVQGDAPDQGP--GKYCQNATAYGRSNSDASVISIMGTRVEVSRKD-FPFHASSLPNGRTVEPAGMGATLART

Query:  GDILGMSSAVSYHQTPFIGYNGLTPGPTISFST-MYEPSGSMPYMVDSRGAAV-MPQFMGPMSAV-PPSSYSHPPFIMGMADAQLTPNGIAHSRPKFDLN
        G+ LG++  VS+   P  GYNGLT  P +S S+ MY P  ++PYMVDSRG  V MPQ +G    V PP    H    M +A    + NG    RP FD N
Subjt:  GDILGMSSAVSYHQTPFIGYNGLTPGPTISFST-MYEPSGSMPYMVDSRGAAV-MPQFMGPMSAV-PPSSYSHPPFIMGMADAQLTPNGIAHSRPKFDLN

Query:  SGLS------DSGGLKQLLFPSHLRPMEEQLR---QPSSS---GVGAKRKEPDCPDSGWEGYLLSYKHQQPPWK
        SG        +S  L+Q L PS    M E      +PSSS    +G KRKE   P+  WE          PPW+
Subjt:  SGLS------DSGGLKQLLFPSHLRPMEEQLR---QPSSS---GVGAKRKEPDCPDSGWEGYLLSYKHQQPPWK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGACACTTGAAGACTTCTTCACCTTGACCGAAATAAAAAATGGGCTTACAGCCCCATGTAGAGTTGAAGAGTTGATCAATGTTATGCAAAAGGAGAAGGATTGTTT
TGTGAAGAATGTTAGTGATGCAACTAGGCACTGGGCTGCTGTTGCAGGTGCTATTGCTGCTACGGAGAATAAAGATTGTCTTGATCTTTTTATCCAATTAGATGGACTAA
GCTTCATTCAAAGATGGCTTAAGGATGCTCAGAAGTTCAGTAATGATACAAATGATAGCACCGTGGAAGAGTCTATCATTGTTTTGTTGCAAGCACTTAAAAAGCTACAT
ATAACTGCTGAGAAATCTATTTCTTCTGGGATTTTGTTTACTGTTAAGAGTCTTTATGAAAATACTGACCATGGCAAATCTAGGTTTGGGAAAGAATTGAGTGTACTTCT
TGATAGGTGGATGCAGGAGATAAATGATAAAGATTTGCTGCATGATGCAGAAAAGGTTGGTGTTCATTTTGATGAAGAAAATTCAAATCTTGCTCGTGTAGCAGGAAGGT
CATCTGCTTCAGGTGCATCTGTTTCGGTAGAATCAAGTGATGGAAAACAAACAGCAGAACCTGTCAGGGACAAGGTATTATCCTGTGGAAGCTCAGATGCACTTCACCCA
GATGAGATCGAACATTCAAAGGTTCAATCTCCAAGAAATGAGCTCAACTCTCATTCAAATTCTGGAAATTCAGTTGTGAAAGATAGATCTCCAGATTTAACAACAAACTC
TGCTGTTATGTTGGTCCCAACCGAGGATGTTTTGAAGAAGGAAGAAACATCTCTTTGTTCTGTTGGAGGTGGAACCTCAGTTAGTGTGGCTTGTAGCTTTCCAGCTGCTG
CAAGAGAAGGTAGCGATAATGAACAATTAGCTGGTGGTTCAAAGAAGTTGAATGAGTCACCAGAGCTTGAAAACCAAGTGAGTAAGATTGATGGCTCTTGTGGTAGGTCA
TGTGTGACAGAGAAATCAGATAACTCTTTGCATTCTCCCATGCAAGATCCTGGAACTGTTTTGGAAGGTTTCGATGCTGCAAATGGTGAAGAGTCTGCTAAAGAAGCTCC
AGCTGAACAAGATAATGATGGTCTTGAAAATGCTGGTGTCCGTCAGCGTAGTTCTAGTCTAGATAGTGAGAGAGTTTCCACATTAGATTCAGCAAGTGGGATATCTGATA
AAAAGACGAATTATGCTAGCATATCTGTGTTCAAACCTGCAGGCTTAGATGCTGAGCGCTATGGAAATACTCTGCGGGATTTGTCTATGAATGGGAGTCGAATAGGAAAA
CTTGAGGAACGTGGGGCTTCTTTTTCGAGGATGGAAGACTTTGGTCGAGTTAATGGAGACAGACAACGTCGGAGAAAGGAAGATGACGGTGGAATGACTAAGTCTGAATT
TTCCAAACCGAAATTAAACCTCAAAACTTCAAATGGGTCAGACATGGAACTTGAGTATGGTATAGTTGATGCTCTGGAGGTTGCTCGGCAAGTAGCTCAAGAAGTAGAAA
GAGAAGTGGTGGAATATAGAGAGCCATCCTGCAGCTCTTCTTCTGATAAAGTTTCTGATGGTGGAATCAGGCAGCTGGGTAAACCAGACACCATGGCTGAAAAACAGGAC
CTGCCAGCTGATCTCCAAGGAAGGGAGGTCCAATCTGCAAAAAGTCATGTTGCTGAATCATATTCTGATGCGGAGACATGCTTAATCCATCCAGATAATTTAGATACTCA
ACCAGAAAATTTAAATGAAATGGAATCCTCCATGGTCACTGAAGCAGCTCGAGGGGCAGATGCAAGTACAGAAAAAGGGTTTTGTGAATTTGATCTAAATCAAGACGTAT
TTAATGATGATGCAGAGCAGTTAGCAACTCCAGTGTCTTTACCGGTGTCTATTATCTCTGTTTCTAGACCAGCTGCTTCTTCTGGCTTGCCTCTAACACCTTTGCAGTTT
GAAGGGGCGCTTGGATGGAGAGGCTCTGCAGCCACTAGTGCTTTCCGGCCTGCTTCCCCACGTAAAGTTCCTGATAGTGATAGAACTCTTTCTAGTGGGGGAAATTCTGA
TAGTTCAAAGCAGAGGCAGGATTTTCTTGACATTGATTTGAATGTGGCCGAGACTGGAGAAGAAACCAGGAAACAAAACCTAGGATCATCTTTCCCACAATCCGGAGAGT
TTTTAGTTGAAAGTGGACAAAGAAGATCCGGGGGACTAAAGCTGGACCTTAACTGTGTTGGTGACGATGTTGATACTCCAGCATCAGATTTGAGAATGGAGGGACTCTTC
AACAACCAGAATAGCTACAGTGCTTCTCCTGCCTGCTCTTCATCATCGATGCAACCTTTGGTAAGGAATATTGATTTGAATGATAGGCCATACGTTCAAGGTGATGCTCC
AGATCAAGGTCCTGGTAAGTATTGTCAAAATGCAACTGCTTATGGACGGAGTAACTCAGATGCTTCTGTTATTTCCATTATGGGTACAAGGGTGGAAGTTAGTAGAAAGG
ATTTTCCTTTTCATGCTTCATCGTTGCCAAATGGCAGAACTGTTGAGCCTGCTGGAATGGGTGCTACTTTGGCAAGGACAGGAGATATACTAGGGATGAGCTCTGCGGTT
TCTTACCACCAAACTCCTTTTATTGGTTACAATGGATTGACACCAGGGCCGACCATTTCATTCTCAACCATGTATGAACCTAGTGGTTCAATGCCTTACATGGTTGATTC
AAGAGGAGCTGCTGTTATGCCTCAATTTATGGGCCCTATGTCAGCTGTTCCGCCTTCCTCTTACTCTCACCCGCCGTTTATCATGGGGATGGCAGATGCACAGCTGACTC
CCAATGGCATTGCTCACTCGCGTCCTAAGTTTGATTTGAATTCTGGATTAAGTGATTCTGGGGGTTTAAAGCAGCTCCTATTCCCAAGCCATCTCCGACCCATGGAAGAG
CAGTTGAGACAACCCTCAAGTTCTGGAGTTGGTGCGAAAAGGAAAGAACCAGATTGCCCTGATAGTGGATGGGAAGGGTATCTGCTAAGTTATAAACATCAACAGCCTCC
ATGGAAACAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGACACTTGAAGACTTCTTCACCTTGACCGAAATAAAAAATGGGCTTACAGCCCCATGTAGAGTTGAAGAGTTGATCAATGTTATGCAAAAGGAGAAGGATTGTTT
TGTGAAGAATGTTAGTGATGCAACTAGGCACTGGGCTGCTGTTGCAGGTGCTATTGCTGCTACGGAGAATAAAGATTGTCTTGATCTTTTTATCCAATTAGATGGACTAA
GCTTCATTCAAAGATGGCTTAAGGATGCTCAGAAGTTCAGTAATGATACAAATGATAGCACCGTGGAAGAGTCTATCATTGTTTTGTTGCAAGCACTTAAAAAGCTACAT
ATAACTGCTGAGAAATCTATTTCTTCTGGGATTTTGTTTACTGTTAAGAGTCTTTATGAAAATACTGACCATGGCAAATCTAGGTTTGGGAAAGAATTGAGTGTACTTCT
TGATAGGTGGATGCAGGAGATAAATGATAAAGATTTGCTGCATGATGCAGAAAAGGTTGGTGTTCATTTTGATGAAGAAAATTCAAATCTTGCTCGTGTAGCAGGAAGGT
CATCTGCTTCAGGTGCATCTGTTTCGGTAGAATCAAGTGATGGAAAACAAACAGCAGAACCTGTCAGGGACAAGGTATTATCCTGTGGAAGCTCAGATGCACTTCACCCA
GATGAGATCGAACATTCAAAGGTTCAATCTCCAAGAAATGAGCTCAACTCTCATTCAAATTCTGGAAATTCAGTTGTGAAAGATAGATCTCCAGATTTAACAACAAACTC
TGCTGTTATGTTGGTCCCAACCGAGGATGTTTTGAAGAAGGAAGAAACATCTCTTTGTTCTGTTGGAGGTGGAACCTCAGTTAGTGTGGCTTGTAGCTTTCCAGCTGCTG
CAAGAGAAGGTAGCGATAATGAACAATTAGCTGGTGGTTCAAAGAAGTTGAATGAGTCACCAGAGCTTGAAAACCAAGTGAGTAAGATTGATGGCTCTTGTGGTAGGTCA
TGTGTGACAGAGAAATCAGATAACTCTTTGCATTCTCCCATGCAAGATCCTGGAACTGTTTTGGAAGGTTTCGATGCTGCAAATGGTGAAGAGTCTGCTAAAGAAGCTCC
AGCTGAACAAGATAATGATGGTCTTGAAAATGCTGGTGTCCGTCAGCGTAGTTCTAGTCTAGATAGTGAGAGAGTTTCCACATTAGATTCAGCAAGTGGGATATCTGATA
AAAAGACGAATTATGCTAGCATATCTGTGTTCAAACCTGCAGGCTTAGATGCTGAGCGCTATGGAAATACTCTGCGGGATTTGTCTATGAATGGGAGTCGAATAGGAAAA
CTTGAGGAACGTGGGGCTTCTTTTTCGAGGATGGAAGACTTTGGTCGAGTTAATGGAGACAGACAACGTCGGAGAAAGGAAGATGACGGTGGAATGACTAAGTCTGAATT
TTCCAAACCGAAATTAAACCTCAAAACTTCAAATGGGTCAGACATGGAACTTGAGTATGGTATAGTTGATGCTCTGGAGGTTGCTCGGCAAGTAGCTCAAGAAGTAGAAA
GAGAAGTGGTGGAATATAGAGAGCCATCCTGCAGCTCTTCTTCTGATAAAGTTTCTGATGGTGGAATCAGGCAGCTGGGTAAACCAGACACCATGGCTGAAAAACAGGAC
CTGCCAGCTGATCTCCAAGGAAGGGAGGTCCAATCTGCAAAAAGTCATGTTGCTGAATCATATTCTGATGCGGAGACATGCTTAATCCATCCAGATAATTTAGATACTCA
ACCAGAAAATTTAAATGAAATGGAATCCTCCATGGTCACTGAAGCAGCTCGAGGGGCAGATGCAAGTACAGAAAAAGGGTTTTGTGAATTTGATCTAAATCAAGACGTAT
TTAATGATGATGCAGAGCAGTTAGCAACTCCAGTGTCTTTACCGGTGTCTATTATCTCTGTTTCTAGACCAGCTGCTTCTTCTGGCTTGCCTCTAACACCTTTGCAGTTT
GAAGGGGCGCTTGGATGGAGAGGCTCTGCAGCCACTAGTGCTTTCCGGCCTGCTTCCCCACGTAAAGTTCCTGATAGTGATAGAACTCTTTCTAGTGGGGGAAATTCTGA
TAGTTCAAAGCAGAGGCAGGATTTTCTTGACATTGATTTGAATGTGGCCGAGACTGGAGAAGAAACCAGGAAACAAAACCTAGGATCATCTTTCCCACAATCCGGAGAGT
TTTTAGTTGAAAGTGGACAAAGAAGATCCGGGGGACTAAAGCTGGACCTTAACTGTGTTGGTGACGATGTTGATACTCCAGCATCAGATTTGAGAATGGAGGGACTCTTC
AACAACCAGAATAGCTACAGTGCTTCTCCTGCCTGCTCTTCATCATCGATGCAACCTTTGGTAAGGAATATTGATTTGAATGATAGGCCATACGTTCAAGGTGATGCTCC
AGATCAAGGTCCTGGTAAGTATTGTCAAAATGCAACTGCTTATGGACGGAGTAACTCAGATGCTTCTGTTATTTCCATTATGGGTACAAGGGTGGAAGTTAGTAGAAAGG
ATTTTCCTTTTCATGCTTCATCGTTGCCAAATGGCAGAACTGTTGAGCCTGCTGGAATGGGTGCTACTTTGGCAAGGACAGGAGATATACTAGGGATGAGCTCTGCGGTT
TCTTACCACCAAACTCCTTTTATTGGTTACAATGGATTGACACCAGGGCCGACCATTTCATTCTCAACCATGTATGAACCTAGTGGTTCAATGCCTTACATGGTTGATTC
AAGAGGAGCTGCTGTTATGCCTCAATTTATGGGCCCTATGTCAGCTGTTCCGCCTTCCTCTTACTCTCACCCGCCGTTTATCATGGGGATGGCAGATGCACAGCTGACTC
CCAATGGCATTGCTCACTCGCGTCCTAAGTTTGATTTGAATTCTGGATTAAGTGATTCTGGGGGTTTAAAGCAGCTCCTATTCCCAAGCCATCTCCGACCCATGGAAGAG
CAGTTGAGACAACCCTCAAGTTCTGGAGTTGGTGCGAAAAGGAAAGAACCAGATTGCCCTGATAGTGGATGGGAAGGGTATCTGCTAAGTTATAAACATCAACAGCCTCC
ATGGAAACAGTAA
Protein sequenceShow/hide protein sequence
MMTLEDFFTLTEIKNGLTAPCRVEELINVMQKEKDCFVKNVSDATRHWAAVAGAIAATENKDCLDLFIQLDGLSFIQRWLKDAQKFSNDTNDSTVEESIIVLLQALKKLH
ITAEKSISSGILFTVKSLYENTDHGKSRFGKELSVLLDRWMQEINDKDLLHDAEKVGVHFDEENSNLARVAGRSSASGASVSVESSDGKQTAEPVRDKVLSCGSSDALHP
DEIEHSKVQSPRNELNSHSNSGNSVVKDRSPDLTTNSAVMLVPTEDVLKKEETSLCSVGGGTSVSVACSFPAAAREGSDNEQLAGGSKKLNESPELENQVSKIDGSCGRS
CVTEKSDNSLHSPMQDPGTVLEGFDAANGEESAKEAPAEQDNDGLENAGVRQRSSSLDSERVSTLDSASGISDKKTNYASISVFKPAGLDAERYGNTLRDLSMNGSRIGK
LEERGASFSRMEDFGRVNGDRQRRRKEDDGGMTKSEFSKPKLNLKTSNGSDMELEYGIVDALEVARQVAQEVEREVVEYREPSCSSSSDKVSDGGIRQLGKPDTMAEKQD
LPADLQGREVQSAKSHVAESYSDAETCLIHPDNLDTQPENLNEMESSMVTEAARGADASTEKGFCEFDLNQDVFNDDAEQLATPVSLPVSIISVSRPAASSGLPLTPLQF
EGALGWRGSAATSAFRPASPRKVPDSDRTLSSGGNSDSSKQRQDFLDIDLNVAETGEETRKQNLGSSFPQSGEFLVESGQRRSGGLKLDLNCVGDDVDTPASDLRMEGLF
NNQNSYSASPACSSSSMQPLVRNIDLNDRPYVQGDAPDQGPGKYCQNATAYGRSNSDASVISIMGTRVEVSRKDFPFHASSLPNGRTVEPAGMGATLARTGDILGMSSAV
SYHQTPFIGYNGLTPGPTISFSTMYEPSGSMPYMVDSRGAAVMPQFMGPMSAVPPSSYSHPPFIMGMADAQLTPNGIAHSRPKFDLNSGLSDSGGLKQLLFPSHLRPMEE
QLRQPSSSGVGAKRKEPDCPDSGWEGYLLSYKHQQPPWKQ