; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G19570 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G19570
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionbox C/D snoRNA protein 1-like
Genome locationChr7:17159288..17162900
RNA-Seq ExpressionCSPI07G19570
SyntenyCSPI07G19570
Gene Ontology termsNA
InterPro domainsIPR007529 - Zinc finger, HIT-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058659.1 box C/D snoRNA protein 1-like [Cucumis melo var. makuwa]1.7e-21392.94Show/hide
Query:  VAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAY
        +AVSTSSN QGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAY
Subjt:  VAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAY

Query:  FRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYEQLDCLKF
        FRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNST+IVLVDHEVNENSKLSTIL NHLRPSPWKTQLQKF EQLDCLK 
Subjt:  FRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYEQLDCLKF

Query:  FVRTYPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDENSCNPQVLDLMK
        FVRTYPKGA S F ELDSTLPIRQLFSNLAFVEYPVIYVVLP QTPNFEVVKTANP SRNLEG NAL+NDLASH GVCFRVEEIE+DENSCNPQVLDLMK
Subjt:  FVRTYPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDENSCNPQVLDLMK

Query:  VSTSSPHCKVSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGELLGDAFT
        VSTSSPHCKVSPRN           LVGKQEVGNSPKSSSQARE GVVKELEFDFEQDLIDAYSNIMAQINPDDFLDW+GDFSKEVEMEGSGELLGDAFT
Subjt:  VSTSSPHCKVSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGELLGDAFT

Query:  VEELEEGEIME
         EELEEGEIME
Subjt:  VEELEEGEIME

XP_008461252.1 PREDICTED: box C/D snoRNA protein 1-like [Cucumis melo]4.4e-21793.06Show/hide
Query:  MAEEDATVAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL
        MAEEDAT+AVSTSSN QGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL
Subjt:  MAEEDATVAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL

Query:  CPYTHAYFRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYE
        CPYTHAYFRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNST+IVLVDHEVNENSKLSTIL NHLRPSPWKTQLQKF E
Subjt:  CPYTHAYFRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYE

Query:  QLDCLKFFVRTYPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDENSCNP
        QLDCLK FVRTYPKGA S F ELDSTLPIRQLFSNLAFVEYPVIYVVLP QTPNFEVVKTANP SRNLEG NAL+NDLASH GVCFRVEEIE+DENSCNP
Subjt:  QLDCLKFFVRTYPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDENSCNP

Query:  QVLDLMKVSTSSPHCKVSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGE
        QVLDLMKVSTSSPHCKVSPRN           LVGKQEVGNSPKSSSQARE GVVKELEFDFEQDLIDAYSNIMAQINPDDFLDW+GDFSKEVEMEGSGE
Subjt:  QVLDLMKVSTSSPHCKVSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGE

Query:  LLGDAFTVEELEEGEIME
        LLGDAFT EELEEGEIME
Subjt:  LLGDAFTVEELEEGEIME

XP_022959527.1 box C/D snoRNA protein 1-like [Cucurbita moschata]1.4e-19481.8Show/hide
Query:  MAEED-----ATVAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQR
        MAE D     A  A STS N++GSSLC+EC SNPSKYKCPACS+RSCSL+CVN HKRRSGCTGKRKQTQFVP+SQFNDS+LLSDYNLLEEVKRMAESAQR
Subjt:  MAEED-----ATVAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQR

Query:  LRKKLCPYTHAYFRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQL
        LRKKLCPYTH Y+RLPFHLKSLR AAS+RRTKIMFLPTGMTKRE NQTRYDKREKTIFWT+EWR NSTD+VLVDH VNEN+ LST+LENHL+PSPWK Q+
Subjt:  LRKKLCPYTHAYFRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQL

Query:  QKFYEQLDCLKFFVRTYPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDE
        QKF EQLD LKFFVRTYPKGA + F ELDS +PIRQLFSNL FVEYPVIYV LPSQTPNFEVVKTANPVS N EG N  KNDLAS EGV FRVEEIE+D+
Subjt:  QKFYEQLDCLKFFVRTYPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDE

Query:  NSCNPQVLDLMKVSTSSPHCKVSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEM
        NS N QVLDLMK S SSPHC+V P+N+ GATH+YST L+GK EVGNSP SSSQA+E GV KELEFDFEQDL+D YSNIMAQINPDDFLDWD DFSK VEM
Subjt:  NSCNPQVLDLMKVSTSSPHCKVSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEM

Query:  EGSGELLGDAFTVEELEEGEIME
        EGSG+LLGD FTV+ELEEGEIME
Subjt:  EGSGELLGDAFTVEELEEGEIME

XP_031745217.1 box C/D snoRNA protein 1 [Cucumis sativus]1.3e-23799.52Show/hide
Query:  MAEEDATVAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL
        MAE DATVAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL
Subjt:  MAEEDATVAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL

Query:  CPYTHAYFRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYE
        CPYTHAYFRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRP PWKTQLQKFYE
Subjt:  CPYTHAYFRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYE

Query:  QLDCLKFFVRTYPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDENSCNP
        QLDCLKFFVRTYPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDENSCNP
Subjt:  QLDCLKFFVRTYPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDENSCNP

Query:  QVLDLMKVSTSSPHCKVSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGE
        QVLDLMKVSTSSPHCKVSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGE
Subjt:  QVLDLMKVSTSSPHCKVSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGE

Query:  LLGDAFTVEELEEGEIME
        LLGDAFTVEELEEGEIME
Subjt:  LLGDAFTVEELEEGEIME

XP_038900096.1 box C/D snoRNA protein 1-like [Benincasa hispida]5.4e-20788.28Show/hide
Query:  MAEEDATVAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL
        MAE DAT   STSSN+Q SSLC+ECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQR RKKL
Subjt:  MAEEDATVAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL

Query:  CPYTHAYFRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYE
        CPYTHAYFRLPFHLKSLR AAS+RRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNS DIVLVDH VNENSKLSTILENHL+PSPWK QL+KF E
Subjt:  CPYTHAYFRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYE

Query:  QLDCLKFFVRTYPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDENSCNP
        QLD LKFFVRTYPKGATS F ELDS LPIRQLFSNL FVEYPVIYV LPSQTPNFEVVKTANP+SRN EG NA KN+LASHEGV FRVEEIE+D+NS NP
Subjt:  QLDCLKFFVRTYPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDENSCNP

Query:  QVLDLMKVSTSSPHCKVSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGE
        QVLDLM+VST SP C+V P+NLH ATH+YS  L+GKQE GNSP SSSQA+E GVVKE EFDFEQDLIDAYSNIMAQINPDDFLDW+GDFSK VEMEGSGE
Subjt:  QVLDLMKVSTSSPHCKVSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGE

Query:  LLGDAFTVEELEEGEIME
        LLGDAFTVEELEEGEIME
Subjt:  LLGDAFTVEELEEGEIME

TrEMBL top hitse value%identityAlignment
A0A0A0K6N1 HIT-type domain-containing protein6.4e-23899.52Show/hide
Query:  MAEEDATVAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL
        MAE DATVAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL
Subjt:  MAEEDATVAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL

Query:  CPYTHAYFRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYE
        CPYTHAYFRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRP PWKTQLQKFYE
Subjt:  CPYTHAYFRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYE

Query:  QLDCLKFFVRTYPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDENSCNP
        QLDCLKFFVRTYPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDENSCNP
Subjt:  QLDCLKFFVRTYPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDENSCNP

Query:  QVLDLMKVSTSSPHCKVSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGE
        QVLDLMKVSTSSPHCKVSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGE
Subjt:  QVLDLMKVSTSSPHCKVSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGE

Query:  LLGDAFTVEELEEGEIME
        LLGDAFTVEELEEGEIME
Subjt:  LLGDAFTVEELEEGEIME

A0A1S3CEQ2 box C/D snoRNA protein 1-like2.1e-21793.06Show/hide
Query:  MAEEDATVAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL
        MAEEDAT+AVSTSSN QGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL
Subjt:  MAEEDATVAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKL

Query:  CPYTHAYFRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYE
        CPYTHAYFRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNST+IVLVDHEVNENSKLSTIL NHLRPSPWKTQLQKF E
Subjt:  CPYTHAYFRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYE

Query:  QLDCLKFFVRTYPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDENSCNP
        QLDCLK FVRTYPKGA S F ELDSTLPIRQLFSNLAFVEYPVIYVVLP QTPNFEVVKTANP SRNLEG NAL+NDLASH GVCFRVEEIE+DENSCNP
Subjt:  QLDCLKFFVRTYPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDENSCNP

Query:  QVLDLMKVSTSSPHCKVSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGE
        QVLDLMKVSTSSPHCKVSPRN           LVGKQEVGNSPKSSSQARE GVVKELEFDFEQDLIDAYSNIMAQINPDDFLDW+GDFSKEVEMEGSGE
Subjt:  QVLDLMKVSTSSPHCKVSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGE

Query:  LLGDAFTVEELEEGEIME
        LLGDAFT EELEEGEIME
Subjt:  LLGDAFTVEELEEGEIME

A0A5D3CJA8 Box C/D snoRNA protein 1-like8.4e-21492.94Show/hide
Query:  VAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAY
        +AVSTSSN QGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAY
Subjt:  VAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAY

Query:  FRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYEQLDCLKF
        FRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNST+IVLVDHEVNENSKLSTIL NHLRPSPWKTQLQKF EQLDCLK 
Subjt:  FRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYEQLDCLKF

Query:  FVRTYPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDENSCNPQVLDLMK
        FVRTYPKGA S F ELDSTLPIRQLFSNLAFVEYPVIYVVLP QTPNFEVVKTANP SRNLEG NAL+NDLASH GVCFRVEEIE+DENSCNPQVLDLMK
Subjt:  FVRTYPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDENSCNPQVLDLMK

Query:  VSTSSPHCKVSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGELLGDAFT
        VSTSSPHCKVSPRN           LVGKQEVGNSPKSSSQARE GVVKELEFDFEQDLIDAYSNIMAQINPDDFLDW+GDFSKEVEMEGSGELLGDAFT
Subjt:  VSTSSPHCKVSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGELLGDAFT

Query:  VEELEEGEIME
         EELEEGEIME
Subjt:  VEELEEGEIME

A0A6J1H4S8 box C/D snoRNA protein 1-like6.7e-19581.8Show/hide
Query:  MAEED-----ATVAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQR
        MAE D     A  A STS N++GSSLC+EC SNPSKYKCPACS+RSCSL+CVN HKRRSGCTGKRKQTQFVP+SQFNDS+LLSDYNLLEEVKRMAESAQR
Subjt:  MAEED-----ATVAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQR

Query:  LRKKLCPYTHAYFRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQL
        LRKKLCPYTH Y+RLPFHLKSLR AAS+RRTKIMFLPTGMTKRE NQTRYDKREKTIFWT+EWR NSTD+VLVDH VNEN+ LST+LENHL+PSPWK Q+
Subjt:  LRKKLCPYTHAYFRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQL

Query:  QKFYEQLDCLKFFVRTYPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDE
        QKF EQLD LKFFVRTYPKGA + F ELDS +PIRQLFSNL FVEYPVIYV LPSQTPNFEVVKTANPVS N EG N  KNDLAS EGV FRVEEIE+D+
Subjt:  QKFYEQLDCLKFFVRTYPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDE

Query:  NSCNPQVLDLMKVSTSSPHCKVSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEM
        NS N QVLDLMK S SSPHC+V P+N+ GATH+YST L+GK EVGNSP SSSQA+E GV KELEFDFEQDL+D YSNIMAQINPDDFLDWD DFSK VEM
Subjt:  NSCNPQVLDLMKVSTSSPHCKVSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEM

Query:  EGSGELLGDAFTVEELEEGEIME
        EGSG+LLGD FTV+ELEEGEIME
Subjt:  EGSGELLGDAFTVEELEEGEIME

A0A6J1KUW8 box C/D snoRNA protein 17.4e-19481.32Show/hide
Query:  MAEED-----ATVAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQR
        MAE D     A  A STS N++GSSLCEEC SNPSKYKCPACS+RSCSL+CVN HKRRSGCTGKRKQTQFVP+SQFNDS+LLSDYNLLEEVKRMAESAQR
Subjt:  MAEED-----ATVAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQR

Query:  LRKKLCPYTHAYFRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQL
        LRKKLCPYTH Y+RLPFHLKSLR AAS+RRTKIMFLPTGMTKRE NQTRYDKREKTIFWT+EWR NSTD+VLVDH VNEN+ LST+LENHL+PSPWK Q+
Subjt:  LRKKLCPYTHAYFRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQL

Query:  QKFYEQLDCLKFFVRTYPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDE
        QKF EQLD LKFFVRTYPKGA   F ELDS +PIRQLFSNL FVEYPVIYV LPSQTPNFEV+KTANPVS N EG N  KNDL S EGV FRVEEIE+D+
Subjt:  QKFYEQLDCLKFFVRTYPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDE

Query:  NSCNPQVLDLMKVSTSSPHCKVSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEM
        NS N QVLDLMK S SSP+C+V P+N+ GATH+YST L+GK EVGNSP SSSQA+E GV+KELEFDFEQDL+D YSNIMAQINPDDFLDWD DFSK VEM
Subjt:  NSCNPQVLDLMKVSTSSPHCKVSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEM

Query:  EGSGELLGDAFTVEELEEGEIME
        EGSG+LLGD FTV+ELEEGEIME
Subjt:  EGSGELLGDAFTVEELEEGEIME

SwissProt top hitse value%identityAlignment
O74906 Putative box C/D snoRNA protein SPCC613.071.0e-1434.38Show/hide
Query:  SSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPF
        S N+ G  +C  C+ N SKY+CP C  R C L C   HKR + C+G+R    FVP S+  +  L SD+N L  V+R+    +    ++   ++   R   
Subjt:  SSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPF

Query:  HLK-SLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEW--RFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYEQ
         LK SL  A  N    I F P    KR  N+T YDK+   I W++EW    +ST   L D E +EN   + I  +H    P +   +K  E+
Subjt:  HLK-SLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEW--RFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYEQ

P38772 Box C/D snoRNA protein 12.4e-0822.68Show/hide
Query:  LCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGK----RKQTQFVPLSQFND------SILLSDYNLLEEVKRMA-----ESAQRLRKKLCP---
        LC  C     KYKCP C +++CSL C   HK R  C+G+    ++      L Q +D      + +  DYN L ++KRM      ++  + ++ L P   
Subjt:  LCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGK----RKQTQFVPLSQFND------SILLSDYNLLEEVKRMA-----ESAQRLRKKLCP---

Query:  YTHAYFRLPFHLKSLRAAAS------NRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPW-----
        +   + +  + +      ++       R    + LP GM +   N++++DK      W++EW      I+    E  E  +L   + + ++ + +     
Subjt:  YTHAYFRLPFHLKSLRAAAS------NRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPW-----

Query:  -KTQLQK---FYE------------------------QLDCLKFFVRTYPKGAT-----SSFCELD-STLPIRQLFSNLAFVEYPVIYVVL
         K   QK   FY                         Q   LKF+ +T+P   T         EL      I +L  N   +E+P I+V +
Subjt:  -KTQLQK---FYE------------------------QLDCLKFFVRTYPKGAT-----SSFCELD-STLPIRQLFSNLAFVEYPVIYVVL

Q3UFB2 Box C/D snoRNA protein 11.6e-2028.77Show/hide
Query:  AEEDATVAVSTSSNQQ-GS------SLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQ
        A EDA V      N + GS      S CE C +  +KY+CP C   SCSL CV  HK    C+G R +T +V L QF +  LLSDY  LE+V R A+   
Subjt:  AEEDATVAVSTSSNQQ-GS------SLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQ

Query:  RLRKKLCPYTHAYFRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRP---SPW
        R      P    Y    F +K+    A  +   +  LP G +KR+ N T +D R++   W ++ +F  +    ++  V ++  ++ IL+ ++ P    P 
Subjt:  RLRKKLCPYTHAYFRLPFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRP---SPW

Query:  KTQLQKFYEQLDC-LKFFVRTYPKGATSSFCELDSTLPIRQLFSNL---AFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPN
          Q  K Y Q    ++  +R   +    +        P + L  NL     +EYP ++VVL   + + ++++  +  ++ L   N
Subjt:  KTQLQKFYEQLDC-LKFFVRTYPKGATSSFCELDSTLPIRQLFSNL---AFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPN

Q5RF97 Box C/D snoRNA protein 11.2e-2328.63Show/hide
Query:  SLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPF---HLKS
        S CE C +  +KY+CP C   SCSL CV  HK    C G R +T ++ + QF +  LLSDY  LE+V R A+   R          A+ + P    H+  
Subjt:  SLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPF---HLKS

Query:  LRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRP---SPWKTQLQKFY--EQLDCLKFFVRT
        ++  A  +   +  LP G TKR+ N T +DK+++   W ++ +F  +    ++  V ++  ++ IL+ ++ P    P   Q  K Y   Q          
Subjt:  LRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRP---SPWKTQLQKFY--EQLDCLKFFVRT

Query:  YPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVS-RNLEGPN
        Y +     + ELD    +     N   +EYP ++VVL     + +V++     S +NL   N
Subjt:  YPKGATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVS-RNLEGPN

Q9NWK9 Box C/D snoRNA protein 11.7e-2228.69Show/hide
Query:  SLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPFHLKSLRA
        S CE C +  +KY+CP C   SCSL CV  HK    C G R +T ++ + QF +  LLSDY  LE+V R A+   R      P ++ Y      +  ++ 
Subjt:  SLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPFHLKSLRA

Query:  AASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRP---SPWKTQLQKFY--EQLDCLKFFVRTYPK
         A  +   +  LP G TKR+ N T +DK+++   W ++ +F  +    ++  V ++  ++ IL+ ++ P    P   Q  K Y   Q          Y +
Subjt:  AASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRP---SPWKTQLQKFY--EQLDCLKFFVRTYPK

Query:  GATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVV
             + ELD    +     N   +EYP ++VVL     + +V+
Subjt:  GATSSFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVV

Arabidopsis top hitse value%identityAlignment
AT1G04945.1 HIT-type Zinc finger family protein3.6e-9248.01Show/hide
Query:  SLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPFHLKSLRA
        S+CEECK NP KYKCP CSIRSC+L CV AHK+R+GCTGKRK T  VPLS+F+D++LLSDYN+LEE KR+AESA R R +LC   H  ++LP+ LKSL++
Subjt:  SLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPFHLKSLRA

Query:  AASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYE-QLDCLKFFVRTYPKGATS
        AA +RRTK+ +LP+GM KRENNQ+RYD R K I WT+EWRF+STD++LVDH V E+  L ++++NHL+P PW  +L+ F +  LD LK F+R YPKGA +
Subjt:  AASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYE-QLDCLKFFVRTYPKGATS

Query:  SFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPN-ALKNDLASHEGVCFRVEEIEEDE-NSCNPQVLDLMKVSTSSPHCK
         F ELD   P+R+  + +  +EYPVI+V LPSQ+  F+V+K  N        PN +L +      G+ FR EEIEED+ +S  P+VL LMK    +P  +
Subjt:  SFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPN-ALKNDLASHEGVCFRVEEIEEDE-NSCNPQVLDLMKVSTSSPHCK

Query:  VSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGELLGDAFTVEELEEGEI
        VS +       S + G VG     N    +++  + G    +E +FEQ LID YS++ A++NPD     DG                      +LEEGEI
Subjt:  VSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGELLGDAFTVEELEEGEI

Query:  ME
        +E
Subjt:  ME

AT1G04945.2 HIT-type Zinc finger family protein4.1e-9648.28Show/hide
Query:  SLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPFHLKSLRA
        S+CEECK NP KYKCP CSIRSC+L CV AHK+R+GCTGKRK T  VPLS+F+D++LLSDYN+LEE KR+AESA R R +LC   H  ++LP+ LKSL++
Subjt:  SLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPFHLKSLRA

Query:  AASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYE-QLDCLKFFVRTYPKGATS
        AA +RRTK+ +LP+GM KRENNQ+RYD R K I WT+EWRF+STD++LVDH V E+  L ++++NHL+P PW  +L+ F +  LD LK F+R YPKGA +
Subjt:  AASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYE-QLDCLKFFVRTYPKGATS

Query:  SFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPN-ALKNDLASHEGVCFRVEEIEEDE-NSCNPQVLDLMKVSTSSPHCK
         F ELD   P+R+  + +  +EYPVI+V LPSQ+  F+V+K  N        PN +L +      G+ FR EEIEED+ +S  P+VL LMK    +P  +
Subjt:  SFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPN-ALKNDLASHEGVCFRVEEIEEDE-NSCNPQVLDLMKVSTSSPHCK

Query:  VSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGEL--LGDAFTVE--ELE
        VS +       S + G VG     N    +++  + G    +E +FEQ LID YS++ A++NP D+ +++ +F+K ++ + +  L  L   F  +  +LE
Subjt:  VSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGEL--LGDAFTVE--ELE

Query:  EGEIME
        EGEI+E
Subjt:  EGEIME

AT1G04945.3 HIT-type Zinc finger family protein4.1e-9648.28Show/hide
Query:  SLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPFHLKSLRA
        S+CEECK NP KYKCP CSIRSC+L CV AHK+R+GCTGKRK T  VPLS+F+D++LLSDYN+LEE KR+AESA R R +LC   H  ++LP+ LKSL++
Subjt:  SLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRLPFHLKSLRA

Query:  AASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYE-QLDCLKFFVRTYPKGATS
        AA +RRTK+ +LP+GM KRENNQ+RYD R K I WT+EWRF+STD++LVDH V E+  L ++++NHL+P PW  +L+ F +  LD LK F+R YPKGA +
Subjt:  AASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYE-QLDCLKFFVRTYPKGATS

Query:  SFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPN-ALKNDLASHEGVCFRVEEIEEDE-NSCNPQVLDLMKVSTSSPHCK
         F ELD   P+R+  + +  +EYPVI+V LPSQ+  F+V+K  N        PN +L +      G+ FR EEIEED+ +S  P+VL LMK    +P  +
Subjt:  SFCELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPN-ALKNDLASHEGVCFRVEEIEEDE-NSCNPQVLDLMKVSTSSPHCK

Query:  VSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGEL--LGDAFTVE--ELE
        VS +       S + G VG     N    +++  + G    +E +FEQ LID YS++ A++NP D+ +++ +F+K ++ + +  L  L   F  +  +LE
Subjt:  VSPRNLHGATHSYSTGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGEL--LGDAFTVE--ELE

Query:  EGEIME
        EGEI+E
Subjt:  EGEIME


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAGAGGATGCAACTGTGGCAGTTTCCACAAGCTCCAACCAGCAAGGATCATCACTTTGCGAAGAGTGTAAATCAAACCCATCAAAGTACAAATGCCCCGCGTG
CTCTATCCGTTCTTGTAGCCTCAATTGCGTCAATGCCCACAAGCGCCGCAGTGGCTGTACTGGCAAGAGAAAGCAGACCCAATTCGTCCCTCTTTCTCAATTCAATGATA
GTATCCTTCTTTCTGATTATAATTTGTTGGAGGAAGTTAAGAGAATGGCTGAATCAGCTCAAAGACTTAGAAAGAAATTGTGCCCTTACACTCATGCTTACTTTCGACTA
CCGTTTCATCTTAAAAGTTTGCGTGCTGCTGCTTCAAATAGGAGAACAAAAATTATGTTTCTCCCCACCGGAATGACGAAAAGGGAGAACAATCAAACTCGATATGATAA
GAGGGAAAAAACAATCTTCTGGACAATGGAATGGCGGTTTAACTCCACTGACATTGTTTTAGTTGACCATGAAGTTAATGAAAACTCTAAGCTTTCTACCATTCTTGAAA
ACCATCTACGACCAAGTCCATGGAAAACTCAACTTCAGAAGTTCTATGAGCAGCTGGATTGCCTCAAATTTTTTGTCCGTACATACCCCAAGGGAGCTACATCGTCTTTT
TGTGAGCTGGACTCGACATTGCCAATAAGACAACTGTTTTCAAATTTGGCTTTTGTGGAATACCCTGTTATATATGTTGTTTTACCCTCTCAAACTCCTAATTTTGAAGT
AGTTAAAACTGCCAATCCAGTGAGTCGTAATCTAGAAGGTCCAAATGCTCTGAAAAATGATCTTGCTAGCCATGAAGGCGTTTGCTTCAGAGTGGAAGAAATAGAAGAAG
ACGAAAATTCCTGCAATCCTCAGGTTCTTGATCTGATGAAAGTATCAACTTCAAGCCCACATTGCAAAGTCAGCCCCCGAAACCTACATGGAGCAACGCATAGTTACTCT
ACAGGTTTGGTGGGGAAGCAGGAAGTTGGGAATAGCCCCAAGTCAAGCTCCCAGGCCAGGGAGCCAGGGGTAGTGAAAGAGTTGGAGTTTGATTTTGAGCAAGATCTGAT
AGATGCATACTCAAACATCATGGCACAAATTAATCCAGATGATTTTCTTGATTGGGATGGAGACTTTTCCAAGGAAGTGGAAATGGAAGGAAGCGGTGAACTTCTTGGGG
ATGCGTTCACGGTTGAAGAACTCGAGGAAGGAGAGATTATGGAATAA
mRNA sequenceShow/hide mRNA sequence
AAAAAAACTCTCTTGGTTTTTCCTCTTTCTTTTCATTCTTCCTTCACATTCCCCCGCGCCACGTTTACAGCCGCCACTGTTTCTTCTTCAACTCCGGCGATTCTCCATAG
ACCCACGATACCCTCCGCCGCCAGCACTCATCATTGCGACTCCGACCAGCGACCCGTGCCGCACCTCCACTATTAACGACTGTTTGCTGCTCTTCTACACCCTTGCAGAC
GGAGACCCACGGCGCACGACGGCGGAACATCCTCCAGCACCGAAACCTCGAGCTCCACATTTTGAATTCTTCTGTTGATAACTTCGGGCTGATCCAAAAAGATTTAACGG
CGAAATTGGGGCGAAACAATATAAACCGTATTCAAATTACGAAACCTAGTCGTCGTTGCTGAGGAAATTTTGTTGTCCAAGATGGCGGAAGAGGATGCAACTGTGGCAGT
TTCCACAAGCTCCAACCAGCAAGGATCATCACTTTGCGAAGAGTGTAAATCAAACCCATCAAAGTACAAATGCCCCGCGTGCTCTATCCGTTCTTGTAGCCTCAATTGCG
TCAATGCCCACAAGCGCCGCAGTGGCTGTACTGGCAAGAGAAAGCAGACCCAATTCGTCCCTCTTTCTCAATTCAATGATAGTATCCTTCTTTCTGATTATAATTTGTTG
GAGGAAGTTAAGAGAATGGCTGAATCAGCTCAAAGACTTAGAAAGAAATTGTGCCCTTACACTCATGCTTACTTTCGACTACCGTTTCATCTTAAAAGTTTGCGTGCTGC
TGCTTCAAATAGGAGAACAAAAATTATGTTTCTCCCCACCGGAATGACGAAAAGGGAGAACAATCAAACTCGATATGATAAGAGGGAAAAAACAATCTTCTGGACAATGG
AATGGCGGTTTAACTCCACTGACATTGTTTTAGTTGACCATGAAGTTAATGAAAACTCTAAGCTTTCTACCATTCTTGAAAACCATCTACGACCAAGTCCATGGAAAACT
CAACTTCAGAAGTTCTATGAGCAGCTGGATTGCCTCAAATTTTTTGTCCGTACATACCCCAAGGGAGCTACATCGTCTTTTTGTGAGCTGGACTCGACATTGCCAATAAG
ACAACTGTTTTCAAATTTGGCTTTTGTGGAATACCCTGTTATATATGTTGTTTTACCCTCTCAAACTCCTAATTTTGAAGTAGTTAAAACTGCCAATCCAGTGAGTCGTA
ATCTAGAAGGTCCAAATGCTCTGAAAAATGATCTTGCTAGCCATGAAGGCGTTTGCTTCAGAGTGGAAGAAATAGAAGAAGACGAAAATTCCTGCAATCCTCAGGTTCTT
GATCTGATGAAAGTATCAACTTCAAGCCCACATTGCAAAGTCAGCCCCCGAAACCTACATGGAGCAACGCATAGTTACTCTACAGGTTTGGTGGGGAAGCAGGAAGTTGG
GAATAGCCCCAAGTCAAGCTCCCAGGCCAGGGAGCCAGGGGTAGTGAAAGAGTTGGAGTTTGATTTTGAGCAAGATCTGATAGATGCATACTCAAACATCATGGCACAAA
TTAATCCAGATGATTTTCTTGATTGGGATGGAGACTTTTCCAAGGAAGTGGAAATGGAAGGAAGCGGTGAACTTCTTGGGGATGCGTTCACGGTTGAAGAACTCGAGGAA
GGAGAGATTATGGAATAATTTTGGTCATTTTTGTTGCACTTCATCTGTCAGTCTACAAGCGGGAACCAATTTTTGCTTGATGCAGAGCTTTCCAGTTCCTACTTCACTAC
AGGTAAATCTTGATATCATTTTACTTTATTGATTTGCCCCTCTCTCCAAGTAGCATCCTGGATTTTGGCCTTTGCTAGATTGGAACATTCTGGAATCGTCTGTTAGCTGA
ATCTTCTCGTTGTTTACACTTGTTGCTAATCTTGATTTGATTATATCAGTGAGTTGTTTGAGAACATATATCTTACAGAAGTTTATACGATCTGTTAAG
Protein sequenceShow/hide protein sequence
MAEEDATVAVSTSSNQQGSSLCEECKSNPSKYKCPACSIRSCSLNCVNAHKRRSGCTGKRKQTQFVPLSQFNDSILLSDYNLLEEVKRMAESAQRLRKKLCPYTHAYFRL
PFHLKSLRAAASNRRTKIMFLPTGMTKRENNQTRYDKREKTIFWTMEWRFNSTDIVLVDHEVNENSKLSTILENHLRPSPWKTQLQKFYEQLDCLKFFVRTYPKGATSSF
CELDSTLPIRQLFSNLAFVEYPVIYVVLPSQTPNFEVVKTANPVSRNLEGPNALKNDLASHEGVCFRVEEIEEDENSCNPQVLDLMKVSTSSPHCKVSPRNLHGATHSYS
TGLVGKQEVGNSPKSSSQAREPGVVKELEFDFEQDLIDAYSNIMAQINPDDFLDWDGDFSKEVEMEGSGELLGDAFTVEELEEGEIME