; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G013050 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G013050
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionCRC domain-containing protein TSO1
Genome locationchr09:20917935..20928611
RNA-Seq ExpressionLsi09G013050
SyntenyLsi09G013050
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005839 - proteasome core complex (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0004298 - threonine-type endopeptidase activity (molecular function)
InterPro domainsIPR001353 - Proteasome, subunit alpha/beta
IPR005172 - CRC domain
IPR023333 - Proteasome B-type subunit
IPR029055 - Nucleophile aminohydrolases, N-terminal
IPR033467 - Tesmin/TSO1-like CXC domain
IPR035206 - Proteasome subunit beta 2
IPR044522 - CRC domain-containing protein TSO1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057436.1 CRC domain-containing protein TSO1 [Cucumis melo var. makuwa]1.7e-27382.35Show/hide
Query:  DSTPQKKPTKLPP-LSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPP-HHFSPLKSPKRPNFSNPSRSPPADAKNE-GAVIIMDQLL
        DSTP+KKP KLPP LSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPP HHFSPLKSPKRPNFSNPSRSPP  +KNE GAVIIMDQLL
Subjt:  DSTPQKKPTKLPP-LSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPP-HHFSPLKSPKRPNFSNPSRSPPADAKNE-GAVIIMDQLL

Query:  EIFVPEIPPAGDGTTPTIQPQETVNSVAGEVLIFRSPISSEAMEEAINGAEDEALFRNDSVLDRIEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVVAVE
        EIFVPEIP A   +TP I PQETV+S AGEV++FR PIS+EA E+     EDEALF+ND VLD IEENK LSNLQSGNMRRRCLDFEMAG PN+A V   
Subjt:  EIFVPEIPPAGDGTTPTIQPQETVNSVAGEVLIFRSPISSEAMEEAINGAEDEALFRNDSVLDRIEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVVAVE

Query:  ATADASIP---ASSSSPFRCTLPGIGLHLNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINPKK
        A  DASIP   +SSSS  R TLPGIGLHLN+LAATLK+SDS+NLCSD QPSLPSSSA IF  NSTRDQ L+ASSTP+SE PPQE ANLTGEELNQINPKK
Subjt:  ATADASIP---ASSSSPFRCTLPGIGLHLNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINPKK

Query:  NWKLMENAGIGACKRCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRNP
        NWKLMENAGIGACKRCNCKKSRCLKL                                YCECFAAGVYCIEPCSCQ CFNKPIHEA+VLETRRQIESRNP
Subjt:  NWKLMENAGIGACKRCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRNP

Query:  LAFAPKVILNCDSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQSD
        LAFAPKVI+N DSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIG E EQEEEGREHCQK  ++QSD
Subjt:  LAFAPKVILNCDSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQSD

Query:  EDHQNPSNAAPSTPLGPCRSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDVMPETLRNGCPSSTGVKSVSPNSKRVTLPPP
        ED QNPSNAAPSTPLGP RSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDV PETL NGCPS T VKSVSPNSKRVTLPPP
Subjt:  EDHQNPSNAAPSTPLGPCRSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDVMPETLRNGCPSSTGVKSVSPNSKRVTLPPP

Query:  QGDFRPLPSTRI
        Q DFRPLPSTRI
Subjt:  QGDFRPLPSTRI

KAG6593881.1 CRC domain-containing protein TSO1, partial [Cucurbita argyrosperma subsp. sororia]4.6e-22671.87Show/hide
Query:  MDSTPQKKPTKL-----PPLSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPPHHFSPLKSPKRPNFSNPSRSPPADAKNEGAVIIMD
        MDSTP+KKPT       PPL KFEDSPVFNFINSLSPIKP+KSIHITQTFNSISFPSLPVFT          SPKRPNF NPSRS P  + NEG VIIMD
Subjt:  MDSTPQKKPTKL-----PPLSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPPHHFSPLKSPKRPNFSNPSRSPPADAKNEGAVIIMD

Query:  QLLEIFVPEIPPAGDGTTPTIQPQETVNSVAGEVLIFRSPISSEAMEEAINGAEDEALFRNDSVLDRIEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVV
        QLLEIFVPEIP A DG+TP +QP E+VN  AGEV      ISSEAME   + AEDE     DS       NKPL NLQSGNM RRCLDFEMAG PNTA  
Subjt:  QLLEIFVPEIPPAGDGTTPTIQPQETVNSVAGEVLIFRSPISSEAMEEAINGAEDEALFRNDSVLDRIEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVV

Query:  AVEATADASI--PASSSSPFRCTLPGIGLHLNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINP
        A  A  D SI   +SSSS FRCTLPGIGLHLNA+AATLKH   ENL      S PSSSA I  PNS +DQPL+ASS  E E P  EPANL GE LN INP
Subjt:  AVEATADASI--PASSSSPFRCTLPGIGLHLNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINP

Query:  KKNWKLMENAGIGACKRCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESR
        KKN  LMENA IGACKRCNCKKS+CLKL                                YCECFAAGVYCI+PCSCQ CFN+PIHEA+VLETRRQIESR
Subjt:  KKNWKLMENAGIGACKRCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESR

Query:  NPLAFAPKVILNCDSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQ
        NPLAFAPKVI+N DS SELGDDSNKTPASARHKRGCNCKKS CLKKYCECYQGGVGCSINCRCEGCKN FGRKDESA+I  +T+QEEEGREHC K AEVQ
Subjt:  NPLAFAPKVILNCDSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQ

Query:  SDEDHQNPSNAAPSTPLGPCRSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDVMPETLRNGCPSSTG-VKSVSPNSKRVTL
         DED QNP++A PSTPLG CRSLIP PFQLKR+LPSF++DESS RLSVRFKLEK GI QTE KFEKTP  DVMPET+ NGC SSTG VKSVSPNSKR+TL
Subjt:  SDEDHQNPSNAAPSTPLGPCRSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDVMPETLRNGCPSSTG-VKSVSPNSKRVTL

Query:  PPPQGDFRPLPSTRI
        PPPQGDFRPLPSTRI
Subjt:  PPPQGDFRPLPSTRI

XP_004145421.1 CRC domain-containing protein TSO1 [Cucumis sativus]4.1e-26780.42Show/hide
Query:  DSTPQKKPTKLPP-LSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPP-HHFSPLKSPKRPNFSNPSRSPPAD-AKNE-GAVIIMDQL
        DSTP+KKPTKLPP LSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPP HHFSPLKSPKRPNFSNPSRSPP   +KNE GAVIIMDQL
Subjt:  DSTPQKKPTKLPP-LSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPP-HHFSPLKSPKRPNFSNPSRSPPAD-AKNE-GAVIIMDQL

Query:  LEIFVPEIPPAGDGTTPTIQPQETVNSVAGEVLIFRSPISSEAMEEAINGAEDEALFRNDSVLDRIEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVVAV
        LEIFVPEIP A   +TP I P ETV+S AGEV+IF  PIS+EAMEE     EDEALF+ND  L  IEENKPLSNL SGNMRRRCLDFEMAG P +A V  
Subjt:  LEIFVPEIPPAGDGTTPTIQPQETVNSVAGEVLIFRSPISSEAMEEAINGAEDEALFRNDSVLDRIEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVVAV

Query:  EATADASIPA---SSSSPFRCTLPGIGLHLNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINPK
          T DASIPA   SSSS FR TLP IGLHLNALAATLKHSDS+N+CSD QPS PSSSAPIF  NST DQ L+ASSTPESE PPQE ANLTGEE++  NPK
Subjt:  EATADASIPA---SSSSPFRCTLPGIGLHLNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINPK

Query:  KNWKLMENAGIGACKRCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRN
         NW+ MENAG+GACKRCNCKKSRCLKL                                YCECFAAGVYCIEPCSCQ CFNKPIHEA+VLETRRQIESRN
Subjt:  KNWKLMENAGIGACKRCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRN

Query:  PLAFAPKVILNCDSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQS
        PLAFAPKVI+NCD +SEL DDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESAL+G E EQEEEGREHCQK  +V S
Subjt:  PLAFAPKVILNCDSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQS

Query:  DEDHQNPSNAAPSTPLGPCRSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDVMPETLRNGCPSSTGVKSVSPNSKRVTLPP
        DED QNPSNAAPSTPLGP RSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDV PETL NGCPS TGVKSVSPNSKR+TLP 
Subjt:  DEDHQNPSNAAPSTPLGPCRSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDVMPETLRNGCPSSTGVKSVSPNSKRVTLPP

Query:  PQGDFRPLPSTRI
        PQ DFRPLPSTRI
Subjt:  PQGDFRPLPSTRI

XP_008449927.1 PREDICTED: CRC domain-containing protein TSO1, partial [Cucumis melo]3.7e-26882.44Show/hide
Query:  LSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPP-HHFSPLKSPKRPNFSNPSRSPPADAKNE-GAVIIMDQLLEIFVPEIPPAGDGT
        LSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPP HHFSPLKSPKRPNFSNPSRSPP  +KNE GAVIIMDQLLEIFVPEIP A   +
Subjt:  LSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPP-HHFSPLKSPKRPNFSNPSRSPPADAKNE-GAVIIMDQLLEIFVPEIPPAGDGT

Query:  TPTIQPQETVNSVAGEVLIFRSPISSEAMEEAINGAEDEALFRNDSVLDRIEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVVAVEATADASIP---ASS
        TP I PQETV+S AGEV++FR PIS+EA E+     EDEALF+ND VLD IEENK LSNLQSGNMRRRCLDFEMAG PN+A V   A  DASIP   +SS
Subjt:  TPTIQPQETVNSVAGEVLIFRSPISSEAMEEAINGAEDEALFRNDSVLDRIEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVVAVEATADASIP---ASS

Query:  SSPFRCTLPGIGLHLNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINPKKNWKLMENAGIGACK
        SS  R TLPGIGLHLN+LAATLK+SDS+NLCSD QPSLPSSSA IF  NSTRDQ L+ASSTP+SE PPQE ANLTGEELNQINPKKNWKLMENAGIGACK
Subjt:  SSPFRCTLPGIGLHLNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINPKKNWKLMENAGIGACK

Query:  RCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRNPLAFAPKVILNCDSV
        RCNCKKSRCLKL                                YCECFAAGVYCIEPCSCQ CFNKPIHEA+VLETRRQIESRNPLAFAPKVI+N DSV
Subjt:  RCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRNPLAFAPKVILNCDSV

Query:  SELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQSDEDHQNPSNAAPSTP
        SELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIG E EQEEEGREHCQK  ++QSDED QNPSNAAPSTP
Subjt:  SELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQSDEDHQNPSNAAPSTP

Query:  LGPCRSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDVMPETLRNGCPSSTGVKSVSPNSKRVTLPPPQGDFRPLPSTRI
        LGP RSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDV PETL NGCPS T VKSVSPNSKRVTLPPPQ DFRPLPSTRI
Subjt:  LGPCRSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDVMPETLRNGCPSSTGVKSVSPNSKRVTLPPPQGDFRPLPSTRI

XP_038906811.1 protein tesmin/TSO1-like CXC 2 [Benincasa hispida]5.1e-30287.97Show/hide
Query:  DSTPQKKPTKLPPLSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPPHHFSPLKSPKRPNFSNPSRSPPADAKNEGAVIIMDQLLEIF
        DSTPQKKPTKLPPLSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPPH+FSPLKSPKRPNFSNPSRSPPA +KNEGAVIIMDQLLEIF
Subjt:  DSTPQKKPTKLPPLSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPPHHFSPLKSPKRPNFSNPSRSPPADAKNEGAVIIMDQLLEIF

Query:  VPEIPPAGDGTTPTIQPQETVNSVAGEVLIFRSPISSEAMEEAINGAEDEALFRNDSVLDRIEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVVAVEATA
        VPEIPP+GDG+TP IQP ETV+S AGEVLIFR PISSEAM+ AI+G EDE L +NDSVL+RIEENK LSNLQSGNMRRRCLDFEMAGKPN A  AVEAT 
Subjt:  VPEIPPAGDGTTPTIQPQETVNSVAGEVLIFRSPISSEAMEEAINGAEDEALFRNDSVLDRIEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVVAVEATA

Query:  DASIPASSSSPFRCTLPGIGLHLNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINPKKNWKLME
        DASIP SSSS FRCTLPGIGLHLNALAAT KHSDSENLCS RQPSLPSSSAPIF PNS RDQPL+ASSTPESE PPQEPANLTGEELNQINPKKNWKLME
Subjt:  DASIPASSSSPFRCTLPGIGLHLNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINPKKNWKLME

Query:  NAGIGACKRCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRNPLAFAPK
        NA IGACKRCNCKKS+CLKL                                YCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRNPLAFAPK
Subjt:  NAGIGACKRCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRNPLAFAPK

Query:  VILNCDSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQSDEDHQNP
        VILNCDSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQK+AEVQSD+DH NP
Subjt:  VILNCDSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQSDEDHQNP

Query:  SNAAPSTPLGPCRSLIPFPFQLKRRLPSFLNDESSSR-LSVRFKLEKHGITQTEPKFEKTPCEDVMPETLRNGCPSSTGVKSVSPNSKRVTLPPPQGDFR
        SNAAPSTPLGP RSLIPFPF LKRR PSFLNDESSSR LSVRFKLEKHGITQTEPKFEKTPCEDVMPETL NGCPSSTGVKSVSPNSKRVTLPPPQGDFR
Subjt:  SNAAPSTPLGPCRSLIPFPFQLKRRLPSFLNDESSSR-LSVRFKLEKHGITQTEPKFEKTPCEDVMPETLRNGCPSSTGVKSVSPNSKRVTLPPPQGDFR

Query:  PLPSTRI
        PLPSTRI
Subjt:  PLPSTRI

TrEMBL top hitse value%identityAlignment
A0A0A0LNU4 CRC domain-containing protein2.0e-26780.42Show/hide
Query:  DSTPQKKPTKLPP-LSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPP-HHFSPLKSPKRPNFSNPSRSPPAD-AKNE-GAVIIMDQL
        DSTP+KKPTKLPP LSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPP HHFSPLKSPKRPNFSNPSRSPP   +KNE GAVIIMDQL
Subjt:  DSTPQKKPTKLPP-LSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPP-HHFSPLKSPKRPNFSNPSRSPPAD-AKNE-GAVIIMDQL

Query:  LEIFVPEIPPAGDGTTPTIQPQETVNSVAGEVLIFRSPISSEAMEEAINGAEDEALFRNDSVLDRIEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVVAV
        LEIFVPEIP A   +TP I P ETV+S AGEV+IF  PIS+EAMEE     EDEALF+ND  L  IEENKPLSNL SGNMRRRCLDFEMAG P +A V  
Subjt:  LEIFVPEIPPAGDGTTPTIQPQETVNSVAGEVLIFRSPISSEAMEEAINGAEDEALFRNDSVLDRIEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVVAV

Query:  EATADASIPA---SSSSPFRCTLPGIGLHLNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINPK
          T DASIPA   SSSS FR TLP IGLHLNALAATLKHSDS+N+CSD QPS PSSSAPIF  NST DQ L+ASSTPESE PPQE ANLTGEE++  NPK
Subjt:  EATADASIPA---SSSSPFRCTLPGIGLHLNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINPK

Query:  KNWKLMENAGIGACKRCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRN
         NW+ MENAG+GACKRCNCKKSRCLKL                                YCECFAAGVYCIEPCSCQ CFNKPIHEA+VLETRRQIESRN
Subjt:  KNWKLMENAGIGACKRCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRN

Query:  PLAFAPKVILNCDSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQS
        PLAFAPKVI+NCD +SEL DDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESAL+G E EQEEEGREHCQK  +V S
Subjt:  PLAFAPKVILNCDSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQS

Query:  DEDHQNPSNAAPSTPLGPCRSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDVMPETLRNGCPSSTGVKSVSPNSKRVTLPP
        DED QNPSNAAPSTPLGP RSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDV PETL NGCPS TGVKSVSPNSKR+TLP 
Subjt:  DEDHQNPSNAAPSTPLGPCRSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDVMPETLRNGCPSSTGVKSVSPNSKRVTLPP

Query:  PQGDFRPLPSTRI
        PQ DFRPLPSTRI
Subjt:  PQGDFRPLPSTRI

A0A1S3BP37 CRC domain-containing protein TSO11.8e-26882.44Show/hide
Query:  LSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPP-HHFSPLKSPKRPNFSNPSRSPPADAKNE-GAVIIMDQLLEIFVPEIPPAGDGT
        LSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPP HHFSPLKSPKRPNFSNPSRSPP  +KNE GAVIIMDQLLEIFVPEIP A   +
Subjt:  LSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPP-HHFSPLKSPKRPNFSNPSRSPPADAKNE-GAVIIMDQLLEIFVPEIPPAGDGT

Query:  TPTIQPQETVNSVAGEVLIFRSPISSEAMEEAINGAEDEALFRNDSVLDRIEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVVAVEATADASIP---ASS
        TP I PQETV+S AGEV++FR PIS+EA E+     EDEALF+ND VLD IEENK LSNLQSGNMRRRCLDFEMAG PN+A V   A  DASIP   +SS
Subjt:  TPTIQPQETVNSVAGEVLIFRSPISSEAMEEAINGAEDEALFRNDSVLDRIEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVVAVEATADASIP---ASS

Query:  SSPFRCTLPGIGLHLNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINPKKNWKLMENAGIGACK
        SS  R TLPGIGLHLN+LAATLK+SDS+NLCSD QPSLPSSSA IF  NSTRDQ L+ASSTP+SE PPQE ANLTGEELNQINPKKNWKLMENAGIGACK
Subjt:  SSPFRCTLPGIGLHLNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINPKKNWKLMENAGIGACK

Query:  RCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRNPLAFAPKVILNCDSV
        RCNCKKSRCLKL                                YCECFAAGVYCIEPCSCQ CFNKPIHEA+VLETRRQIESRNPLAFAPKVI+N DSV
Subjt:  RCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRNPLAFAPKVILNCDSV

Query:  SELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQSDEDHQNPSNAAPSTP
        SELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIG E EQEEEGREHCQK  ++QSDED QNPSNAAPSTP
Subjt:  SELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQSDEDHQNPSNAAPSTP

Query:  LGPCRSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDVMPETLRNGCPSSTGVKSVSPNSKRVTLPPPQGDFRPLPSTRI
        LGP RSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDV PETL NGCPS T VKSVSPNSKRVTLPPPQ DFRPLPSTRI
Subjt:  LGPCRSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDVMPETLRNGCPSSTGVKSVSPNSKRVTLPPPQGDFRPLPSTRI

A0A5D3E4C7 CRC domain-containing protein TSO18.3e-27482.35Show/hide
Query:  DSTPQKKPTKLPP-LSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPP-HHFSPLKSPKRPNFSNPSRSPPADAKNE-GAVIIMDQLL
        DSTP+KKP KLPP LSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPP HHFSPLKSPKRPNFSNPSRSPP  +KNE GAVIIMDQLL
Subjt:  DSTPQKKPTKLPP-LSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPP-HHFSPLKSPKRPNFSNPSRSPPADAKNE-GAVIIMDQLL

Query:  EIFVPEIPPAGDGTTPTIQPQETVNSVAGEVLIFRSPISSEAMEEAINGAEDEALFRNDSVLDRIEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVVAVE
        EIFVPEIP A   +TP I PQETV+S AGEV++FR PIS+EA E+     EDEALF+ND VLD IEENK LSNLQSGNMRRRCLDFEMAG PN+A V   
Subjt:  EIFVPEIPPAGDGTTPTIQPQETVNSVAGEVLIFRSPISSEAMEEAINGAEDEALFRNDSVLDRIEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVVAVE

Query:  ATADASIP---ASSSSPFRCTLPGIGLHLNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINPKK
        A  DASIP   +SSSS  R TLPGIGLHLN+LAATLK+SDS+NLCSD QPSLPSSSA IF  NSTRDQ L+ASSTP+SE PPQE ANLTGEELNQINPKK
Subjt:  ATADASIP---ASSSSPFRCTLPGIGLHLNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINPKK

Query:  NWKLMENAGIGACKRCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRNP
        NWKLMENAGIGACKRCNCKKSRCLKL                                YCECFAAGVYCIEPCSCQ CFNKPIHEA+VLETRRQIESRNP
Subjt:  NWKLMENAGIGACKRCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRNP

Query:  LAFAPKVILNCDSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQSD
        LAFAPKVI+N DSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIG E EQEEEGREHCQK  ++QSD
Subjt:  LAFAPKVILNCDSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQSD

Query:  EDHQNPSNAAPSTPLGPCRSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDVMPETLRNGCPSSTGVKSVSPNSKRVTLPPP
        ED QNPSNAAPSTPLGP RSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDV PETL NGCPS T VKSVSPNSKRVTLPPP
Subjt:  EDHQNPSNAAPSTPLGPCRSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDVMPETLRNGCPSSTGVKSVSPNSKRVTLPPP

Query:  QGDFRPLPSTRI
        Q DFRPLPSTRI
Subjt:  QGDFRPLPSTRI

A0A6J1ER31 CRC domain-containing protein TSO1-like1.6e-22471.2Show/hide
Query:  MDSTPQKKPTKL-----PPLSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPPHHFSPLKSPKRPNFSNPSRSPPADAKNEGAVIIMD
        MDSTP+KKPT       PPL KFEDSPVFNFINSLSPIKP+KSIHITQTFNSISFPSLPVFT          SPKRPNF NPSRS P  + NEG VIIMD
Subjt:  MDSTPQKKPTKL-----PPLSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPPHHFSPLKSPKRPNFSNPSRSPPADAKNEGAVIIMD

Query:  QLLEIFVPEIPPAGDGTTPTIQPQETVNSVAGEVLIFRSPISSEAMEEAINGAEDEALFRNDSVLDRIEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVV
        QLLEIFVPEIP A DG+TP +QP ETVN  AGEV      ISSEA+E   + AEDE     DS       NKPL NLQSGNM RRCLDFEMAG PNTA  
Subjt:  QLLEIFVPEIPPAGDGTTPTIQPQETVNSVAGEVLIFRSPISSEAMEEAINGAEDEALFRNDSVLDRIEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVV

Query:  AVEATADASI-----PASSSSPFRCTLPGIGLHLNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQ
        A  A  D SI      +SSSS FRCTLPGIGLHLNA+AATLKH   ENL      S PSSSA I  PNS +DQPL+ASS  E E P  EPANL GE LN 
Subjt:  AVEATADASI-----PASSSSPFRCTLPGIGLHLNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQ

Query:  INPKKNWKLMENAGIGACKRCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQI
        INPKKN  LMENA IGACKRCNCKKS+CLKL                                YCECFAAGVYCI+PCSCQ CFN+PIHEA+VLETRRQI
Subjt:  INPKKNWKLMENAGIGACKRCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQI

Query:  ESRNPLAFAPKVILNCDSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHA
        ESRNPLAFAPKVI+N DS SELGDDSNKTPASARHKRGCNCKKS CLKKYCECYQGGVGCSINCRCEGCKN FGRKDESA+I  +T+QEEEGREHC K A
Subjt:  ESRNPLAFAPKVILNCDSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHA

Query:  EVQSDEDHQNPSNAAPSTPLGPCRSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDVMPETLRNGCPSSTG-VKSVSPNSKR
        EVQ DED QNP +A PSTPLG CRSLIP PFQLKR+LPSF++DESS RLSVRFKLEK GI QTE +FEKTP  DVM ET+ NGC SSTG VKSVSPNSKR
Subjt:  EVQSDEDHQNPSNAAPSTPLGPCRSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDVMPETLRNGCPSSTG-VKSVSPNSKR

Query:  VTLPPPQGDFRPLPSTRI
        +TLPPPQGDFRPLPSTRI
Subjt:  VTLPPPQGDFRPLPSTRI

A0A6J1KE48 protein tesmin/TSO1-like CXC 24.2e-22571.82Show/hide
Query:  MDSTPQKKPT----KLPPLSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPPHHFSPLKSPKRPNFSNPSRSPPADAKNEGAVIIMDQ
        MDSTP+KKPT      PPL KFEDSPVFNFINSLSPIKP+KSIHITQTFNSISFPSLPVFT          SPKRPNF NPSRS P  + NEGAVIIMDQ
Subjt:  MDSTPQKKPT----KLPPLSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPPHHFSPLKSPKRPNFSNPSRSPPADAKNEGAVIIMDQ

Query:  LLEIFVPEIPPAGDGTTPTIQPQETVNSVAGEVLIFRSPISSEAMEEAINGAEDEALFRNDSVLDRIEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVVA
        LLEIFVPEIP A    TP +QP ETVN  AGEV      ISSEAME   + AEDE     DS       NKPL NLQSGNM RRCLDFEMAG PNTA  A
Subjt:  LLEIFVPEIPPAGDGTTPTIQPQETVNSVAGEVLIFRSPISSEAMEEAINGAEDEALFRNDSVLDRIEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVVA

Query:  VEATADASI--PASSSSPFRCTLPGIGLHLNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINPK
          A  D SI   +SSSS FRCTLPGIGLHLNA+AATLKH   ENL      S PSSSA I  PNS +DQPL+ SS  E E P  EPANL GE LN INPK
Subjt:  VEATADASI--PASSSSPFRCTLPGIGLHLNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINPK

Query:  KNWKLMENAGIGACKRCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRN
        KN  LMENA IGACKRCNCKKS+CLKL                                YCECFAAGVYCI+PCSCQ CFN+PIHEA+VLETRRQIESRN
Subjt:  KNWKLMENAGIGACKRCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRN

Query:  PLAFAPKVILNCDSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQS
        PLAFAPKVI+N DS SELGDDSNKTPASARHKRGCNCKKS CLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESA+I  +T+QEEEGREHC K+AEVQ 
Subjt:  PLAFAPKVILNCDSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQS

Query:  DEDHQNPSNAAPSTPLGPCRSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDVMPETLRNGCPSSTG-VKSVSPNSKRVTLP
        DED QN ++A PSTPLG CRSLIP PFQLKR+LPSF++DESS RLSVRFKLEK GI QTE KFEKTP  DVMPET+ NGC SSTG VKSVSPNSKR+TLP
Subjt:  DEDHQNPSNAAPSTPLGPCRSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCEDVMPETLRNGCPSSTG-VKSVSPNSKRVTLP

Query:  PPQGDFRPLPSTRI
        PPQGDFRPLPSTRI
Subjt:  PPQGDFRPLPSTRI

SwissProt top hitse value%identityAlignment
F4JIF5 Protein tesmin/TSO1-like CXC 28.2e-9340.32Show/hide
Query:  TPQKKPTKL-PPLSK--FEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLP-VFTSP----PHHFSPLKSPKRPNFSNPSRS-------------PP
        TPQK  T++  P+SK  FEDSPVFN+INSLSPI+PV+SI     F+S++F S P VFTSP     H  S        + S+P+ S              P
Subjt:  TPQKKPTKL-PPLSK--FEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLP-VFTSP----PHHFSPLKSPKRPNFSNPSRS-------------PP

Query:  ADAKNEGAVIIMDQLLE-----------------------IFVPEIPPAGD-------------------GTTPTIQPQETVNSVAGEVLIFRSPISSEA
        A+ ++   + I D + E                         VP  P  G+                   G T T    E++ + A E+LIF SP +SEA
Subjt:  ADAKNEGAVIIMDQLLE-----------------------IFVPEIPPAGD-------------------GTTPTIQPQETVNSVAGEVLIFRSPISSEA

Query:  -----MEEAINGAEDEALFRN---DSVLDRIEENKPLS---------NLQSGNMRRRCLDFEMAGKPNTAVVAVEATADASIPASSSSPFRCTLPGIGLH
             M+ A N    EA FRN      +      +P S         +L    +RRRCLDFEM G         + T+  +  A+  S  RC +P IGLH
Subjt:  -----MEEAINGAEDEALFRN---DSVLDRIEENKPLS---------NLQSGNMRRRCLDFEMAGKPNTAVVAVEATADASIPASSSSPFRCTLPGIGLH

Query:  LNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQE------PANLTGEELNQINPKKNWKLMENAGIGACKRCNCKKSR
        LNA+  + K       C        S SA I V    R    +  S  ++E   +E      P     +ELN  +PKK    +++    +CKRCNCKKS+
Subjt:  LNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQE------PANLTGEELNQINPKKNWKLMENAGIGACKRCNCKKSR

Query:  CLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRNPLAFAPKVILNCDSVSELGDDSN
        CLKL                                YCECFAAGVYCIEPCSC  CFNKPIHE VVL TR+QIESRNPLAFAPKVI N DSV E GDD++
Subjt:  CLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRNPLAFAPKVILNCDSVSELGDDSN

Query:  KTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQSDED-----HQNPSNAAPSTPLGP
        KTPASARHKRGCNCKKS CLKKYCECYQGGVGCSINCRCEGCKNAFGRKD S+ I +E EQEEE  E  +K    +S ++      ++ S+A P+TP   
Subjt:  KTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQSDED-----HQNPSNAAPSTPLGP

Query:  CR-SLIPFPF-QLKRRLP---SFLNDESSSRLSVRFKLEKHGITQTEPKFEKT------PCEDVMPETLRNGCPSSTGVKSVSPNSKRVTLP
         R  L+  PF   K R+P   S L   SSS +     L K  I+ ++ + EK+         + MPE L +       +KSVSPN KRV+ P
Subjt:  CR-SLIPFPF-QLKRRLP---SFLNDESSSRLSVRFKLEKHGITQTEPKFEKT------PCEDVMPETLRNGCPSSTGVKSVSPNSKRVTLP

O23714 Proteasome subunit beta type-2-A1.1e-9489.06Show/hide
Query:  IVGNGFAIVAADSSAVHSILVHKSDEDKIMVLDSHKLVAASGEPGDRVQFTEYIQKNVALYQFRNGIPLTTAAAANFTRGELATALRKNPYSVNILLAGY
        +VGNGFAIVAAD+SAVHSIL+HK++EDKIMVLDSHKLVAASGEPGDRVQFTEY+QKNV+LY+FRNGIPLTTAAAANFTRGELATALRKNPYSVNIL+AGY
Subjt:  IVGNGFAIVAADSSAVHSILVHKSDEDKIMVLDSHKLVAASGEPGDRVQFTEYIQKNVALYQFRNGIPLTTAAAANFTRGELATALRKNPYSVNILLAGY

Query:  DKETGPSLYYIDYIATLHKVEKGAFGYGSYFSLSMMDRHYHSGMSVEEAIDLVDKCIIEIRSRLVVAPPNFVIKIVDKDGAREVAWRQSIKD
        D E+G SLYYIDYIATLHKV+KGAFGYGSYFSLS MDRHY S MSVEEAI+LVDKCI+EIRSRLVVAPPNFVIKIVDKDGAR+ AWRQS+KD
Subjt:  DKETGPSLYYIDYIATLHKVEKGAFGYGSYFSLSMMDRHYHSGMSVEEAIDLVDKCIIEIRSRLVVAPPNFVIKIVDKDGAREVAWRQSIKD

O24633 Proteasome subunit beta type-2-B3.1e-9287.5Show/hide
Query:  IVGNGFAIVAADSSAVHSILVHKSDEDKIMVLDSHKLVAASGEPGDRVQFTEYIQKNVALYQFRNGIPLTTAAAANFTRGELATALRKNPYSVNILLAGY
        +VGNGFAIVAAD+SAVHSIL+HK+ EDKIM LDSHKLVAASGEPGDRVQFTEY+QKNV+LYQFRNGIPL+TAAAANFTRGELATALRKNPYSVNIL+AGY
Subjt:  IVGNGFAIVAADSSAVHSILVHKSDEDKIMVLDSHKLVAASGEPGDRVQFTEYIQKNVALYQFRNGIPLTTAAAANFTRGELATALRKNPYSVNILLAGY

Query:  DKETGPSLYYIDYIATLHKVEKGAFGYGSYFSLSMMDRHYHSGMSVEEAIDLVDKCIIEIRSRLVVAPPNFVIKIVDKDGAREVAWRQSIKD
        DKE G SLYYIDYIATLHKV+KGAFGYGSYFSLS MDRHY S MSVEEAI+LVDKCI+EIRSRLV+APPNFVIKIVDKDGARE  WR S  D
Subjt:  DKETGPSLYYIDYIATLHKVEKGAFGYGSYFSLSMMDRHYHSGMSVEEAIDLVDKCIIEIRSRLVVAPPNFVIKIVDKDGAREVAWRQSIKD

Q8L548 Protein tesmin/TSO1-like CXC 35.0e-8239.25Show/hide
Query:  TPQKKPTKL-PPLSKF--EDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLP-VFTSP---PHHFSPLKSPK-------------------RPNFSNP
        TP+K  T++  P+SK   EDSPVF++I +LSPIK +K I IT   +S+++ S P VFTSP    H  S  +S K                     ++ N 
Subjt:  TPQKKPTKL-PPLSKF--EDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLP-VFTSP---PHHFSPLKSPK-------------------RPNFSNP

Query:  SRSPPA--DAKNEGAVIIMDQLLEIFVPEIPPAGDGTTPTIQPQETVNSVAGEVLIFRSPISSEAM----------EEAING---AEDEALFRNDSVLDR
          +P    D K+ G      + L++ +  +    D  TP     ET+ +   E LI+ SP  SEA           E  + G   A   A+   D V + 
Subjt:  SRSPPA--DAKNEGAVIIMDQLLEIFVPEIPPAGDGTTPTIQPQETVNSVAGEVLIFRSPISSEAM----------EEAING---AEDEALFRNDSVLDR

Query:  IEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVVAVEATADASIPASSSSPFRCTLPGIGLHLNALAATLKHSDSEN---LCSDRQPSLPSSSAPIFVPNS
         E    LS L  G +RRRCLDFE+ G               ++  SSSS   C +P IGLHLN +A + K  +  N      + +  + SS  P+   +S
Subjt:  IEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVVAVEATADASIPASSSSPFRCTLPGIGLHLNALAATLKHSDSEN---LCSDRQPSLPSSSAPIFVPNS

Query:  TRDQPLIASSTPESEKPPQ-EPANLTGEELNQINPKKNWKLMENAGIG--ACKRCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCE
          D      S  +S +  +  P +L   +L  I+PKK  +  E +G G  +CKRCNCKKS+CLKL                                YCE
Subjt:  TRDQPLIASSTPESEKPPQ-EPANLTGEELNQINPKKNWKLMENAGIG--ACKRCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCE

Query:  CFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRNPLAFAPKVILNCDSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRC
        CFAAG YCIEPCSC  CFNKPIH+ VVL TR+QIESRNPLAFAPKVI N DS+ E+G+D++KTPASARHKRGCNCKKS CLKKYCECYQGGVGCSINCRC
Subjt:  CFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRNPLAFAPKVILNCDSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRC

Query:  EGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQSDEDHQNPSNAAPSTPLGPCRSLIPFPFQLKRRL----PSFLNDESSSRLSVRFKLEKHGITQ
        EGCKNAFGRKD S L   + E E  G    +K    Q + +   P+ A PSTP+   + L   P     RL      F +    S  S  + + K  ++ 
Subjt:  EGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQSDEDHQNPSNAAPSTPLGPCRSLIPFPFQLKRRL----PSFLNDESSSRLSVRFKLEKHGITQ

Query:  TEPKFEKTPCEDV--MPETLRNGCPSSTGVKSVSPNSKRVTL
              +T  ED+  M E L +     + + ++SPNSKRV+L
Subjt:  TEPKFEKTPCEDV--MPETLRNGCPSSTGVKSVSPNSKRVTL

Q9LUI3 CRC domain-containing protein TSO11.6e-10140.8Show/hide
Query:  QKKPTK----LPPLSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPPHHFSPLKSPKRPNFSNPSRSPPADA------KNEGAVIIMD
        QK PT       P SKFEDSPVFN+I++LSPI+ VKSI   QTF+S+SF S P   + PH  S  +S      ++  RS   ++      K E  V +++
Subjt:  QKKPTK----LPPLSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPPHHFSPLKSPKRPNFSNPSRSPPADA------KNEGAVIIMD

Query:  QL---------------LEIFVPEI--------------------------PPAGDG------TTPTIQPQETVN------------SVAGEVLIFRSPI
         L                   +P+I                          PP GD        T  +Q    V             S A E+L+FRSP 
Subjt:  QL---------------LEIFVPEI--------------------------PPAGDG------TTPTIQPQETVN------------SVAGEVLIFRSPI

Query:  SSEA---MEEAINGAED------EALFRNDSVLD-----RIEENKPLS---------NLQSGNMRRRCLDFEMAGKPNTAVV-AVEATADASIPASSSSP
         SEA   + + I+ +E       ++  R D   D        EN+PL+         NL  G MRRRCLDFEM GK    +V   ++  D ++   SSS 
Subjt:  SSEA---MEEAINGAED------EALFRNDSVLD-----RIEENKPLS---------NLQSGNMRRRCLDFEMAGKPNTAVV-AVEATADASIPASSSSP

Query:  FRCTLPGIGLHLNALAATLKHSD-----SENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINPKKNWKLMENAGIG-
          C +PGIGLHLNA+A + K S+       ++  + Q S   S+ PI     ++D     S   E+E   + P  L   ELN  + KK  +  E AG G 
Subjt:  FRCTLPGIGLHLNALAATLKHSD-----SENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINPKKNWKLMENAGIG-

Query:  ACKRCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRNPLAFAPKVILNC
        +CKRCNCKKS+CLKL                                YCECFAAGVYCIEPCSC  CFNKPIHE  VL TR+QIESRNPLAFAPKVI N 
Subjt:  ACKRCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRNPLAFAPKVILNC

Query:  DSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQ-----SDEDHQNP
        DS+ E  DD++KTPASARHKRGCNCKKS C+KKYCECYQGGVGCS+NCRCEGC N FGRKD S L+ +E++ EE    + ++ A++Q     S E  QNP
Subjt:  DSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQ-----SDEDHQNP

Query:  SNAAPSTPLGPCRSLIPF-PFQLKRRLPS---FLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCED---VMPETLRNGCPSSTGVKSVSPNSKRVTLPP
        S+  PSTPL P R L+   PF  K RLP    FL   SSS       L +   +Q E K  +T  ED   +MPE L N       +K++SPNSKRV+ P 
Subjt:  SNAAPSTPLGPCRSLIPF-PFQLKRRLPS---FLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCED---VMPETLRNGCPSSTGVKSVSPNSKRVTLPP

Query:  P
        P
Subjt:  P

Arabidopsis top hitse value%identityAlignment
AT3G22630.1 20S proteasome beta subunit D18.1e-9689.06Show/hide
Query:  IVGNGFAIVAADSSAVHSILVHKSDEDKIMVLDSHKLVAASGEPGDRVQFTEYIQKNVALYQFRNGIPLTTAAAANFTRGELATALRKNPYSVNILLAGY
        +VGNGFAIVAAD+SAVHSIL+HK++EDKIMVLDSHKLVAASGEPGDRVQFTEY+QKNV+LY+FRNGIPLTTAAAANFTRGELATALRKNPYSVNIL+AGY
Subjt:  IVGNGFAIVAADSSAVHSILVHKSDEDKIMVLDSHKLVAASGEPGDRVQFTEYIQKNVALYQFRNGIPLTTAAAANFTRGELATALRKNPYSVNILLAGY

Query:  DKETGPSLYYIDYIATLHKVEKGAFGYGSYFSLSMMDRHYHSGMSVEEAIDLVDKCIIEIRSRLVVAPPNFVIKIVDKDGAREVAWRQSIKD
        D E+G SLYYIDYIATLHKV+KGAFGYGSYFSLS MDRHY S MSVEEAI+LVDKCI+EIRSRLVVAPPNFVIKIVDKDGAR+ AWRQS+KD
Subjt:  DKETGPSLYYIDYIATLHKVEKGAFGYGSYFSLSMMDRHYHSGMSVEEAIDLVDKCIIEIRSRLVVAPPNFVIKIVDKDGAREVAWRQSIKD

AT3G22780.1 Tesmin/TSO1-like CXC domain-containing protein1.2e-10240.8Show/hide
Query:  QKKPTK----LPPLSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPPHHFSPLKSPKRPNFSNPSRSPPADA------KNEGAVIIMD
        QK PT       P SKFEDSPVFN+I++LSPI+ VKSI   QTF+S+SF S P   + PH  S  +S      ++  RS   ++      K E  V +++
Subjt:  QKKPTK----LPPLSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPPHHFSPLKSPKRPNFSNPSRSPPADA------KNEGAVIIMD

Query:  QL---------------LEIFVPEI--------------------------PPAGDG------TTPTIQPQETVN------------SVAGEVLIFRSPI
         L                   +P+I                          PP GD        T  +Q    V             S A E+L+FRSP 
Subjt:  QL---------------LEIFVPEI--------------------------PPAGDG------TTPTIQPQETVN------------SVAGEVLIFRSPI

Query:  SSEA---MEEAINGAED------EALFRNDSVLD-----RIEENKPLS---------NLQSGNMRRRCLDFEMAGKPNTAVV-AVEATADASIPASSSSP
         SEA   + + I+ +E       ++  R D   D        EN+PL+         NL  G MRRRCLDFEM GK    +V   ++  D ++   SSS 
Subjt:  SSEA---MEEAINGAED------EALFRNDSVLD-----RIEENKPLS---------NLQSGNMRRRCLDFEMAGKPNTAVV-AVEATADASIPASSSSP

Query:  FRCTLPGIGLHLNALAATLKHSD-----SENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINPKKNWKLMENAGIG-
          C +PGIGLHLNA+A + K S+       ++  + Q S   S+ PI     ++D     S   E+E   + P  L   ELN  + KK  +  E AG G 
Subjt:  FRCTLPGIGLHLNALAATLKHSD-----SENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINPKKNWKLMENAGIG-

Query:  ACKRCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRNPLAFAPKVILNC
        +CKRCNCKKS+CLKL                                YCECFAAGVYCIEPCSC  CFNKPIHE  VL TR+QIESRNPLAFAPKVI N 
Subjt:  ACKRCNCKKSRCLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRNPLAFAPKVILNC

Query:  DSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQ-----SDEDHQNP
        DS+ E  DD++KTPASARHKRGCNCKKS C+KKYCECYQGGVGCS+NCRCEGC N FGRKD S L+ +E++ EE    + ++ A++Q     S E  QNP
Subjt:  DSVSELGDDSNKTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQ-----SDEDHQNP

Query:  SNAAPSTPLGPCRSLIPF-PFQLKRRLPS---FLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCED---VMPETLRNGCPSSTGVKSVSPNSKRVTLPP
        S+  PSTPL P R L+   PF  K RLP    FL   SSS       L +   +Q E K  +T  ED   +MPE L N       +K++SPNSKRV+ P 
Subjt:  SNAAPSTPLGPCRSLIPF-PFQLKRRLPS---FLNDESSSRLSVRFKLEKHGITQTEPKFEKTPCED---VMPETLRNGCPSSTGVKSVSPNSKRVTLPP

Query:  P
        P
Subjt:  P

AT4G14770.1 TESMIN/TSO1-like CXC 25.8e-9440.32Show/hide
Query:  TPQKKPTKL-PPLSK--FEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLP-VFTSP----PHHFSPLKSPKRPNFSNPSRS-------------PP
        TPQK  T++  P+SK  FEDSPVFN+INSLSPI+PV+SI     F+S++F S P VFTSP     H  S        + S+P+ S              P
Subjt:  TPQKKPTKL-PPLSK--FEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLP-VFTSP----PHHFSPLKSPKRPNFSNPSRS-------------PP

Query:  ADAKNEGAVIIMDQLLE-----------------------IFVPEIPPAGD-------------------GTTPTIQPQETVNSVAGEVLIFRSPISSEA
        A+ ++   + I D + E                         VP  P  G+                   G T T    E++ + A E+LIF SP +SEA
Subjt:  ADAKNEGAVIIMDQLLE-----------------------IFVPEIPPAGD-------------------GTTPTIQPQETVNSVAGEVLIFRSPISSEA

Query:  -----MEEAINGAEDEALFRN---DSVLDRIEENKPLS---------NLQSGNMRRRCLDFEMAGKPNTAVVAVEATADASIPASSSSPFRCTLPGIGLH
             M+ A N    EA FRN      +      +P S         +L    +RRRCLDFEM G         + T+  +  A+  S  RC +P IGLH
Subjt:  -----MEEAINGAEDEALFRN---DSVLDRIEENKPLS---------NLQSGNMRRRCLDFEMAGKPNTAVVAVEATADASIPASSSSPFRCTLPGIGLH

Query:  LNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQE------PANLTGEELNQINPKKNWKLMENAGIGACKRCNCKKSR
        LNA+  + K       C        S SA I V    R    +  S  ++E   +E      P     +ELN  +PKK    +++    +CKRCNCKKS+
Subjt:  LNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQE------PANLTGEELNQINPKKNWKLMENAGIGACKRCNCKKSR

Query:  CLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRNPLAFAPKVILNCDSVSELGDDSN
        CLKL                                YCECFAAGVYCIEPCSC  CFNKPIHE VVL TR+QIESRNPLAFAPKVI N DSV E GDD++
Subjt:  CLKLERCQEFTVATVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRNPLAFAPKVILNCDSVSELGDDSN

Query:  KTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQSDED-----HQNPSNAAPSTPLGP
        KTPASARHKRGCNCKKS CLKKYCECYQGGVGCSINCRCEGCKNAFGRKD S+ I +E EQEEE  E  +K    +S ++      ++ S+A P+TP   
Subjt:  KTPASARHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQSDED-----HQNPSNAAPSTPLGP

Query:  CR-SLIPFPF-QLKRRLP---SFLNDESSSRLSVRFKLEKHGITQTEPKFEKT------PCEDVMPETLRNGCPSSTGVKSVSPNSKRVTLP
         R  L+  PF   K R+P   S L   SSS +     L K  I+ ++ + EK+         + MPE L +       +KSVSPN KRV+ P
Subjt:  CR-SLIPFPF-QLKRRLP---SFLNDESSSRLSVRFKLEKHGITQTEPKFEKT------PCEDVMPETLRNGCPSSTGVKSVSPNSKRVTLP

AT4G14800.1 20S proteasome beta subunit D22.2e-9387.5Show/hide
Query:  IVGNGFAIVAADSSAVHSILVHKSDEDKIMVLDSHKLVAASGEPGDRVQFTEYIQKNVALYQFRNGIPLTTAAAANFTRGELATALRKNPYSVNILLAGY
        +VGNGFAIVAAD+SAVHSIL+HK+ EDKIM LDSHKLVAASGEPGDRVQFTEY+QKNV+LYQFRNGIPL+TAAAANFTRGELATALRKNPYSVNIL+AGY
Subjt:  IVGNGFAIVAADSSAVHSILVHKSDEDKIMVLDSHKLVAASGEPGDRVQFTEYIQKNVALYQFRNGIPLTTAAAANFTRGELATALRKNPYSVNILLAGY

Query:  DKETGPSLYYIDYIATLHKVEKGAFGYGSYFSLSMMDRHYHSGMSVEEAIDLVDKCIIEIRSRLVVAPPNFVIKIVDKDGAREVAWRQSIKD
        DKE G SLYYIDYIATLHKV+KGAFGYGSYFSLS MDRHY S MSVEEAI+LVDKCI+EIRSRLV+APPNFVIKIVDKDGARE  WR S  D
Subjt:  DKETGPSLYYIDYIATLHKVEKGAFGYGSYFSLSMMDRHYHSGMSVEEAIDLVDKCIIEIRSRLVVAPPNFVIKIVDKDGAREVAWRQSIKD

AT4G14800.2 20S proteasome beta subunit D23.0e-9080.77Show/hide
Query:  IVGNGFAIVAADSSAVHSILVHKSDEDKIMVLDSHKLVAASGEPGDRVQFTEYIQKNVALYQFRNGIPLTTAAAANFTRGELATALRK------------
        +VGNGFAIVAAD+SAVHSIL+HK+ EDKIM LDSHKLVAASGEPGDRVQFTEY+QKNV+LYQFRNGIPL+TAAAANFTRGELATALRK            
Subjt:  IVGNGFAIVAADSSAVHSILVHKSDEDKIMVLDSHKLVAASGEPGDRVQFTEYIQKNVALYQFRNGIPLTTAAAANFTRGELATALRK------------

Query:  ----NPYSVNILLAGYDKETGPSLYYIDYIATLHKVEKGAFGYGSYFSLSMMDRHYHSGMSVEEAIDLVDKCIIEIRSRLVVAPPNFVIKIVDKDGAREV
            NPYSVNIL+AGYDKE G SLYYIDYIATLHKV+KGAFGYGSYFSLS MDRHY S MSVEEAI+LVDKCI+EIRSRLV+APPNFVIKIVDKDGARE 
Subjt:  ----NPYSVNILLAGYDKETGPSLYYIDYIATLHKVEKGAFGYGSYFSLSMMDRHYHSGMSVEEAIDLVDKCIIEIRSRLVVAPPNFVIKIVDKDGAREV

Query:  AWRQSIKD
         WR S  D
Subjt:  AWRQSIKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCACTCCTCAGAAAAAACCCACCAAACTCCCTCCTCTCTCCAAATTTGAGGATTCACCAGTTTTCAACTTCATCAACAGTCTCTCTCCCATAAAACCTGTTAA
ATCCATTCACATTACTCAAACATTCAACTCCATCTCTTTCCCTTCCCTTCCTGTTTTCACTTCCCCACCACACCATTTTAGTCCTCTAAAGTCCCCAAAAAGGCCGAATT
TTTCGAACCCATCGAGATCCCCGCCTGCAGATGCTAAAAATGAAGGGGCGGTTATAATAATGGATCAGTTACTGGAGATTTTCGTGCCGGAAATTCCTCCCGCCGGCGAC
GGTACAACTCCAACAATTCAGCCACAGGAAACGGTCAATTCCGTCGCCGGCGAGGTTCTGATATTTCGTTCTCCGATTAGCTCTGAAGCTATGGAGGAGGCGATTAATGG
AGCGGAAGATGAGGCATTGTTTCGAAACGACAGCGTTTTAGATAGAATAGAGGAGAACAAGCCTCTCTCTAACCTACAGAGCGGGAACATGCGGAGGCGATGCCTAGATT
TCGAAATGGCGGGAAAACCCAACACGGCGGTGGTTGCGGTGGAGGCGACCGCCGATGCTTCAATCCCCGCTTCTTCTTCTTCTCCATTTCGATGTACTCTGCCTGGTATC
GGTTTGCACTTGAATGCTCTTGCAGCGACTTTGAAGCACTCCGATTCAGAGAATCTGTGTTCCGATAGGCAGCCGAGTCTTCCCAGCTCCTCTGCCCCCATTTTTGTCCC
AAATTCCACTCGAGATCAACCCTTGATAGCTTCATCCACTCCTGAAAGTGAAAAACCTCCTCAGGAACCTGCGAATCTGACTGGAGAAGAACTCAATCAGATCAACCCTA
AAAAGAATTGGAAATTGATGGAAAATGCTGGAATTGGGGCTTGTAAACGTTGTAACTGTAAGAAATCTAGGTGCCTGAAGCTTGAAAGATGCCAAGAGTTCACTGTTGCA
ACAGTTTATGCACTTAGAGGAGTGTTGATAACATCAAAATTTCAATCTAAACTAGTAATTTTCTTCAGATATTGTGAATGCTTTGCTGCTGGCGTGTACTGCATTGAGCC
ATGTTCATGTCAAGGCTGCTTCAACAAACCGATACATGAAGCCGTGGTTCTTGAGACTCGCAGACAGATTGAATCTCGCAATCCACTTGCGTTTGCTCCTAAAGTGATCC
TGAACTGCGATTCTGTTTCTGAACTTGGGGACGATTCAAACAAAACTCCCGCTTCGGCACGACATAAACGAGGATGCAACTGTAAGAAATCAGGTTGCTTGAAAAAATAC
TGCGAATGTTATCAGGGTGGCGTTGGATGCTCCATCAACTGCAGATGTGAAGGCTGTAAAAATGCATTTGGGAGGAAAGATGAATCAGCTCTAATAGGAATTGAAACTGA
ACAAGAGGAAGAAGGAAGAGAACATTGCCAAAAACATGCAGAAGTACAGAGCGATGAAGATCATCAGAACCCAAGCAATGCTGCTCCCTCGACACCACTAGGACCATGCA
GATCATTGATTCCATTTCCATTCCAATTGAAGAGGAGACTTCCATCTTTCCTCAACGACGAATCCTCCTCTAGATTGAGCGTTCGATTCAAACTGGAGAAGCATGGCATT
ACTCAAACAGAACCCAAGTTTGAGAAAACACCCTGTGAGGATGTGATGCCAGAAACACTTCGTAATGGTTGCCCTTCTAGCACAGGTGTTAAGAGTGTTTCTCCCAACAG
TAAGAGGGTTACTTTACCCCCACCGCAGGGCGATTTCAGGCCGCTGCCCTCGACTAGAATTGAATTTACAAGACGACGTGTTGCTCAAGTCGCTGCCGGTGGCACTTTGT
TAGAGCATACGATCGTGGGTAATGGTTTCGCCATTGTTGCCGCTGATTCTTCGGCGGTACACAGCATTTTGGTCCATAAATCCGACGAGGACAAGATTATGGTCCTCGAC
TCTCACAAGCTCGTCGCTGCTAGCGGCGAGCCTGGTGACAGGGTTCAATTCACCGAATACATCCAGAAGAATGTGGCGTTGTATCAGTTCCGAAATGGGATCCCATTGAC
AACTGCTGCTGCTGCTAATTTTACTCGAGGCGAACTCGCCACTGCATTGAGAAAGAATCCATACTCTGTAAATATCCTCCTGGCTGGCTATGATAAGGAGACTGGCCCGT
CGCTTTACTACATCGATTACATTGCGACGCTTCACAAGGTTGAGAAAGGTGCTTTTGGTTATGGATCCTACTTTTCACTTTCTATGATGGATAGGCATTACCATAGCGGC
ATGTCAGTCGAGGAAGCAATCGACTTGGTCGATAAATGCATCATCGAGATACGTTCAAGGCTGGTAGTGGCTCCACCGAACTTTGTGATCAAGATCGTGGACAAGGATGG
GGCAAGAGAGGTTGCATGGCGTCAATCCATTAAAGATACTGGAGCTCTTCCAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATCTACCTTACCCTTACACTCATATTTTCCCGCTCTCTCTTCTCTTCAACTTCCCATCTTCATTTTCAACTCCTCAACCCCTTCTTCTTCTCCACACACTCAAATGACGA
AACCTCTGCTTTTCTGAACTTCACTCAACAATGGATTCCACTCCTCAGAAAAAACCCACCAAACTCCCTCCTCTCTCCAAATTTGAGGATTCACCAGTTTTCAACTTCAT
CAACAGTCTCTCTCCCATAAAACCTGTTAAATCCATTCACATTACTCAAACATTCAACTCCATCTCTTTCCCTTCCCTTCCTGTTTTCACTTCCCCACCACACCATTTTA
GTCCTCTAAAGTCCCCAAAAAGGCCGAATTTTTCGAACCCATCGAGATCCCCGCCTGCAGATGCTAAAAATGAAGGGGCGGTTATAATAATGGATCAGTTACTGGAGATT
TTCGTGCCGGAAATTCCTCCCGCCGGCGACGGTACAACTCCAACAATTCAGCCACAGGAAACGGTCAATTCCGTCGCCGGCGAGGTTCTGATATTTCGTTCTCCGATTAG
CTCTGAAGCTATGGAGGAGGCGATTAATGGAGCGGAAGATGAGGCATTGTTTCGAAACGACAGCGTTTTAGATAGAATAGAGGAGAACAAGCCTCTCTCTAACCTACAGA
GCGGGAACATGCGGAGGCGATGCCTAGATTTCGAAATGGCGGGAAAACCCAACACGGCGGTGGTTGCGGTGGAGGCGACCGCCGATGCTTCAATCCCCGCTTCTTCTTCT
TCTCCATTTCGATGTACTCTGCCTGGTATCGGTTTGCACTTGAATGCTCTTGCAGCGACTTTGAAGCACTCCGATTCAGAGAATCTGTGTTCCGATAGGCAGCCGAGTCT
TCCCAGCTCCTCTGCCCCCATTTTTGTCCCAAATTCCACTCGAGATCAACCCTTGATAGCTTCATCCACTCCTGAAAGTGAAAAACCTCCTCAGGAACCTGCGAATCTGA
CTGGAGAAGAACTCAATCAGATCAACCCTAAAAAGAATTGGAAATTGATGGAAAATGCTGGAATTGGGGCTTGTAAACGTTGTAACTGTAAGAAATCTAGGTGCCTGAAG
CTTGAAAGATGCCAAGAGTTCACTGTTGCAACAGTTTATGCACTTAGAGGAGTGTTGATAACATCAAAATTTCAATCTAAACTAGTAATTTTCTTCAGATATTGTGAATG
CTTTGCTGCTGGCGTGTACTGCATTGAGCCATGTTCATGTCAAGGCTGCTTCAACAAACCGATACATGAAGCCGTGGTTCTTGAGACTCGCAGACAGATTGAATCTCGCA
ATCCACTTGCGTTTGCTCCTAAAGTGATCCTGAACTGCGATTCTGTTTCTGAACTTGGGGACGATTCAAACAAAACTCCCGCTTCGGCACGACATAAACGAGGATGCAAC
TGTAAGAAATCAGGTTGCTTGAAAAAATACTGCGAATGTTATCAGGGTGGCGTTGGATGCTCCATCAACTGCAGATGTGAAGGCTGTAAAAATGCATTTGGGAGGAAAGA
TGAATCAGCTCTAATAGGAATTGAAACTGAACAAGAGGAAGAAGGAAGAGAACATTGCCAAAAACATGCAGAAGTACAGAGCGATGAAGATCATCAGAACCCAAGCAATG
CTGCTCCCTCGACACCACTAGGACCATGCAGATCATTGATTCCATTTCCATTCCAATTGAAGAGGAGACTTCCATCTTTCCTCAACGACGAATCCTCCTCTAGATTGAGC
GTTCGATTCAAACTGGAGAAGCATGGCATTACTCAAACAGAACCCAAGTTTGAGAAAACACCCTGTGAGGATGTGATGCCAGAAACACTTCGTAATGGTTGCCCTTCTAG
CACAGGTGTTAAGAGTGTTTCTCCCAACAGTAAGAGGGTTACTTTACCCCCACCGCAGGGCGATTTCAGGCCGCTGCCCTCGACTAGAATTGAATTTACAAGACGACGTG
TTGCTCAAGTCGCTGCCGGTGGCACTTTGTTAGAGCATACGATCGTGGGTAATGGTTTCGCCATTGTTGCCGCTGATTCTTCGGCGGTACACAGCATTTTGGTCCATAAA
TCCGACGAGGACAAGATTATGGTCCTCGACTCTCACAAGCTCGTCGCTGCTAGCGGCGAGCCTGGTGACAGGGTTCAATTCACCGAATACATCCAGAAGAATGTGGCGTT
GTATCAGTTCCGAAATGGGATCCCATTGACAACTGCTGCTGCTGCTAATTTTACTCGAGGCGAACTCGCCACTGCATTGAGAAAGAATCCATACTCTGTAAATATCCTCC
TGGCTGGCTATGATAAGGAGACTGGCCCGTCGCTTTACTACATCGATTACATTGCGACGCTTCACAAGGTTGAGAAAGGTGCTTTTGGTTATGGATCCTACTTTTCACTT
TCTATGATGGATAGGCATTACCATAGCGGCATGTCAGTCGAGGAAGCAATCGACTTGGTCGATAAATGCATCATCGAGATACGTTCAAGGCTGGTAGTGGCTCCACCGAA
CTTTGTGATCAAGATCGTGGACAAGGATGGGGCAAGAGAGGTTGCATGGCGTCAATCCATTAAAGATACTGGAGCTCTTCCAGTTTGAGGAGTTTGAGATTGTTGAACTT
GGGTTTTATTTTGACAATTTTTTTTCTCTGTTTTGTATACAAATGTTATGTCCTTACAAGACCTGGTTCATTAGTCTTGATTGGAAGTGCTTATGCTGCCACTTGAAGCA
GGTTTAAATTTAGGTAAATGAATTTGAAAATTTGAAGTTATGATTCATCGTCTACTATGGAACTACAACAATTTCTATTTTTG
Protein sequenceShow/hide protein sequence
MDSTPQKKPTKLPPLSKFEDSPVFNFINSLSPIKPVKSIHITQTFNSISFPSLPVFTSPPHHFSPLKSPKRPNFSNPSRSPPADAKNEGAVIIMDQLLEIFVPEIPPAGD
GTTPTIQPQETVNSVAGEVLIFRSPISSEAMEEAINGAEDEALFRNDSVLDRIEENKPLSNLQSGNMRRRCLDFEMAGKPNTAVVAVEATADASIPASSSSPFRCTLPGI
GLHLNALAATLKHSDSENLCSDRQPSLPSSSAPIFVPNSTRDQPLIASSTPESEKPPQEPANLTGEELNQINPKKNWKLMENAGIGACKRCNCKKSRCLKLERCQEFTVA
TVYALRGVLITSKFQSKLVIFFRYCECFAAGVYCIEPCSCQGCFNKPIHEAVVLETRRQIESRNPLAFAPKVILNCDSVSELGDDSNKTPASARHKRGCNCKKSGCLKKY
CECYQGGVGCSINCRCEGCKNAFGRKDESALIGIETEQEEEGREHCQKHAEVQSDEDHQNPSNAAPSTPLGPCRSLIPFPFQLKRRLPSFLNDESSSRLSVRFKLEKHGI
TQTEPKFEKTPCEDVMPETLRNGCPSSTGVKSVSPNSKRVTLPPPQGDFRPLPSTRIEFTRRRVAQVAAGGTLLEHTIVGNGFAIVAADSSAVHSILVHKSDEDKIMVLD
SHKLVAASGEPGDRVQFTEYIQKNVALYQFRNGIPLTTAAAANFTRGELATALRKNPYSVNILLAGYDKETGPSLYYIDYIATLHKVEKGAFGYGSYFSLSMMDRHYHSG
MSVEEAIDLVDKCIIEIRSRLVVAPPNFVIKIVDKDGAREVAWRQSIKDTGALPV