; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G006820 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G006820
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionSerine/arginine repetitive matrix-like protein
Genome locationCmo_Chr04:3387448..3388773
RNA-Seq ExpressionCmoCh04G006820
SyntenyCmoCh04G006820
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600467.1 hypothetical protein SDJN03_05700, partial [Cucurbita argyrosperma subsp. sororia]2.9e-23598.19Show/hide
Query:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKL
        MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNL+GLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKL
Subjt:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKL

Query:  VPFQVSSVKPSVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKALRYLLH
        VPFQVSSVKPSVNVLKSMRCVS PETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGS KTENHKTTSPSYASEANSKALRYLLH
Subjt:  VPFQVSSVKPSVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKALRYLLH

Query:  RRSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRPKSEMNPRSTMDHHP
        RRSKSSLSSSF+SSL+LPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPN SNPPRMRQVKPRPKSEMNPRSTMDHHP
Subjt:  RRSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRPKSEMNPRSTMDHHP

Query:  TATRVGRSPMRPAPGESTRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVLHVPVCSSLRGSSKSVSVFGFGQ
        TATRVGRSPMRPAPGESTRLA+RGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVL+VPVCSSLRGSSKSVSVFGFGQ
Subjt:  TATRVGRSPMRPAPGESTRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVLHVPVCSSLRGSSKSVSVFGFGQ

Query:  LFSSTGTSGSSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH
        LFSSTGTSGSSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH
Subjt:  LFSSTGTSGSSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH

XP_022942648.1 uncharacterized protein LOC111447620 [Cucurbita moschata]2.5e-239100Show/hide
Query:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKL
        MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKL
Subjt:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKL

Query:  VPFQVSSVKPSVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKALRYLLH
        VPFQVSSVKPSVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKALRYLLH
Subjt:  VPFQVSSVKPSVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKALRYLLH

Query:  RRSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRPKSEMNPRSTMDHHP
        RRSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRPKSEMNPRSTMDHHP
Subjt:  RRSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRPKSEMNPRSTMDHHP

Query:  TATRVGRSPMRPAPGESTRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVLHVPVCSSLRGSSKSVSVFGFGQ
        TATRVGRSPMRPAPGESTRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVLHVPVCSSLRGSSKSVSVFGFGQ
Subjt:  TATRVGRSPMRPAPGESTRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVLHVPVCSSLRGSSKSVSVFGFGQ

Query:  LFSSTGTSGSSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH
        LFSSTGTSGSSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH
Subjt:  LFSSTGTSGSSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH

XP_022980895.1 uncharacterized protein LOC111480206 [Cucurbita maxima]3.1e-22996.37Show/hide
Query:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKL
        MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNL+GLITKPAADPAGESEIRDSDPE+VPVSEFEFRLQDPVALMLPADELFLNGKL
Subjt:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKL

Query:  VPFQVSSVKPSVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKALRYLLH
        VPFQVSSVKPSVNVLKSMRCVS PETAAQ RR+VEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKALRYLLH
Subjt:  VPFQVSSVKPSVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKALRYLLH

Query:  RRSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRPKSEMNPRSTMDHHP
        RRSKSSL SSFDSSL+LPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPN SNPPRMRQVKPRPKSEMNPRSTMDHHP
Subjt:  RRSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRPKSEMNPRSTMDHHP

Query:  TATRVGRSPMRPAPGESTRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVLHVPVCSSLRGSSKSVSVFGFGQ
        TATRVGRSPMRPAPGESTRLA+RGVSVDSPR+NS GKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVL+VPVCSSLRGSSKSVSVFGFGQ
Subjt:  TATRVGRSPMRPAPGESTRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVLHVPVCSSLRGSSKSVSVFGFGQ

Query:  LFSSTGTSGSSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH
        LFS TGTSGSSRSYQ SS SSSTNRTTTRRI+ETDGGGKLH
Subjt:  LFSSTGTSGSSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH

XP_023549710.1 uncharacterized protein LOC111808127 [Cucurbita pepo subsp. pepo]1.2e-23397.51Show/hide
Query:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKL
        MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNL+GL+TKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKL
Subjt:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKL

Query:  VPFQVSSVKPSVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKALRYLLH
        VPF+VSSVKPSVNVLKSMRCVS PETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKALRYLLH
Subjt:  VPFQVSSVKPSVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKALRYLLH

Query:  RRSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRPKSEMNPRSTMDHHP
        RRSKSSLSSSFDSSL+LPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPN SNPPRMRQVKPRPKSEMNPRSTMDHH 
Subjt:  RRSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRPKSEMNPRSTMDHHP

Query:  TATRVGRSPMRPAPGESTRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVLHVPVCSSLRGSSKSVSVFGFGQ
        TATRVGRSPMRPAPGESTRLA+RGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVL+VPVCSSLRGSSKSVSVFGFGQ
Subjt:  TATRVGRSPMRPAPGESTRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVLHVPVCSSLRGSSKSVSVFGFGQ

Query:  LFSSTGTSGSSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH
        LFS+TGTSGSSRSYQSSSSSSSTNRTTTRRI+ETDGGGKLH
Subjt:  LFSSTGTSGSSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH

XP_023551901.1 uncharacterized protein LOC111809733 [Cucurbita pepo subsp. pepo]1.6e-18881.52Show/hide
Query:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSR---DDSPPSTNLSGLIT--KPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELF
        MASACVN++GMSPENFLDCSSAPCHSYGWLSPR+SFSR   DDS PS+NL+  I+  KP ADPAG+SEIRD DPELVPVSEFEF L+DPVALMLPADELF
Subjt:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSR---DDSPPSTNLSGLIT--KPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELF

Query:  LNGKLVPFQVSSVKPSVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGS------AKTENHKTT---SPSY
        L+GKLVP QVSSVKPSVN LKS RCVS PET  Q+RR VE EC+TDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNG+      AK ENHKTT   S SY
Subjt:  LNGKLVPFQVSSVKPSVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGS------AKTENHKTT---SPSY

Query:  ASEANSKALRYLLHRRSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRP
         SEANSKAL+Y LHR SKSSL+SSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHE EDLHRL LD ENKPN NPISLHRNPN +NPPRMR VKPRP
Subjt:  ASEANSKALRYLLHRRSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRP

Query:  KSEMNPR--STMDHHPTATRVGRSPMRPAPGE--STRL-AVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVLHVP
        KSE NPR  ST+DHHPTATRVGRSPMR  PGE  S+RL  +RGVSVDSPR+NS GKIVFHNLERSSSSPS+FNGGPK KNRG ERSYSANVR+TPVL+VP
Subjt:  KSEMNPR--STMDHHPTATRVGRSPMRPAPGE--STRL-AVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVLHVP

Query:  VCSSLRGSSKSVSVFGFGQLFSSTGTSGSSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH
        VCSSLRGSSKS SVFGFG LF+ TG     RS+Q+SSSSSSTNRTTTRRI ETDGGGKLH
Subjt:  VCSSLRGSSKSVSVFGFGQLFSSTGTSGSSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH

TrEMBL top hitse value%identityAlignment
A0A5D3CZJ6 Putative serine-rich protein5.2e-18279.49Show/hide
Query:  MASACVNNIGMSPENFLDCSSA-PCHSYGWLSPRISFSRDDSPPSTNLSGLI--TKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLN
        MASACVNN+G+S ENFLDCSS+ PCHSYGWL PR+SFSRDDSPPS NL G +  TKPA   AGESE RD DPELVPVSEFEFRLQDPV+LMLPADELF +
Subjt:  MASACVNNIGMSPENFLDCSSA-PCHSYGWLSPRISFSRDDSPPSTNLSGLI--TKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLN

Query:  GKLVPFQVSSVKPSVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSS------------NGSAKTENHKTT---
        GKLVP QVSS KPSVN LKS RCVS PET  QSRR VE ECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSS            NGSAK ENHKTT   
Subjt:  GKLVPFQVSSVKPSVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSS------------NGSAKTENHKTT---

Query:  SPSYASEANSKALRYLLHRRSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQV
        S SY SEANSKAL+Y LHR SKSSLSSS DSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHE EDLHRL LD ENKPNTNPISLHRNPN +NPPRMR V
Subjt:  SPSYASEANSKALRYLLHRRSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQV

Query:  KPRPKSEMNPR--STMDH-HPTATRVGRSPMRPAPGE----STRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRIT
        KPRPKSE NPR  ST DH HP+ATRVGRSP+R  PGE    S+RL +RGVSVDSPR+NS GKIVFHNLERSSSSPSSFNGGPK KNRG ERSYSANVR+T
Subjt:  KPRPKSEMNPR--STMDH-HPTATRVGRSPMRPAPGE----STRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRIT

Query:  PVLHVPVCSSLRGSSKSVSVFGFGQLFSSTGTSGSSRSYQ--SSSSSSSTNRTTTRRIMETDGGGKLH
        PVL+VPVCSSLRGSSKSVSVFGFG L   TG  GSSR++Q  SS+SSSS+NRTTTRRI +T+GGGK+H
Subjt:  PVLHVPVCSSLRGSSKSVSVFGFGQLFSSTGTSGSSRSYQ--SSSSSSSTNRTTTRRIMETDGGGKLH

A0A6J1ETX7 homeobox protein prospero-like9.8e-18981.74Show/hide
Query:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSR---DDSPPSTNLSGLIT--KPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELF
        MASACVN++GMSPENFLDCSSAPCHSYGWLSPR+SFSR   DDS PS+NL+  I+  KP ADPA +SEIRD DPELVPVSEFEF LQDPVALMLPADELF
Subjt:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSR---DDSPPSTNLSGLIT--KPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELF

Query:  LNGKLVPFQVSSVKPSVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSS------SNGSAKTENHKTT---SPSY
        L+GKLVP QVSSVKPSVN LKS RCVS PET  Q+RR VE EC+TDPYLFSPKAPRCSSRWRELLGLKKLYQSS      SNGSAK ENHKTT   S SY
Subjt:  LNGKLVPFQVSSVKPSVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSS------SNGSAKTENHKTT---SPSY

Query:  ASEANSKALRYLLHRRSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRP
         SEANSKAL+Y LHR SKSSL+SSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHE EDLHRL LD ENKPN NPISLHRNPN +NPPRMR VKPRP
Subjt:  ASEANSKALRYLLHRRSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRP

Query:  KSEMNPR--STMDHHPTATRVGRSPMRPAPGE--STRL-AVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVLHVP
        KSE NPR  ST+DHHPTATRVGRSPMR  PG+  S+RL  +RGVSVDSPR+NS GKIVFHNLERSSSSPS+FNGGPK KNRG ERSYSANVR+TPVL+VP
Subjt:  KSEMNPR--STMDHHPTATRVGRSPMRPAPGE--STRL-AVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVLHVP

Query:  VCSSLRGSSKSVSVFGFGQLFSSTGTSGSSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH
        VCSSLRGSSKS SVFGFG LF+ TG     RS+QSSSSSSSTNRTTTRRI ETDGGGKLH
Subjt:  VCSSLRGSSKSVSVFGFGQLFSSTGTSGSSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH

A0A6J1FVC0 uncharacterized protein LOC1114476201.2e-239100Show/hide
Query:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKL
        MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKL
Subjt:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKL

Query:  VPFQVSSVKPSVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKALRYLLH
        VPFQVSSVKPSVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKALRYLLH
Subjt:  VPFQVSSVKPSVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKALRYLLH

Query:  RRSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRPKSEMNPRSTMDHHP
        RRSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRPKSEMNPRSTMDHHP
Subjt:  RRSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRPKSEMNPRSTMDHHP

Query:  TATRVGRSPMRPAPGESTRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVLHVPVCSSLRGSSKSVSVFGFGQ
        TATRVGRSPMRPAPGESTRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVLHVPVCSSLRGSSKSVSVFGFGQ
Subjt:  TATRVGRSPMRPAPGESTRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVLHVPVCSSLRGSSKSVSVFGFGQ

Query:  LFSSTGTSGSSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH
        LFSSTGTSGSSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH
Subjt:  LFSSTGTSGSSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH

A0A6J1IXV4 uncharacterized protein LOC1114802061.5e-22996.37Show/hide
Query:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKL
        MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNL+GLITKPAADPAGESEIRDSDPE+VPVSEFEFRLQDPVALMLPADELFLNGKL
Subjt:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKL

Query:  VPFQVSSVKPSVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKALRYLLH
        VPFQVSSVKPSVNVLKSMRCVS PETAAQ RR+VEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKALRYLLH
Subjt:  VPFQVSSVKPSVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKALRYLLH

Query:  RRSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRPKSEMNPRSTMDHHP
        RRSKSSL SSFDSSL+LPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPN SNPPRMRQVKPRPKSEMNPRSTMDHHP
Subjt:  RRSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRPKSEMNPRSTMDHHP

Query:  TATRVGRSPMRPAPGESTRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVLHVPVCSSLRGSSKSVSVFGFGQ
        TATRVGRSPMRPAPGESTRLA+RGVSVDSPR+NS GKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVL+VPVCSSLRGSSKSVSVFGFGQ
Subjt:  TATRVGRSPMRPAPGESTRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVLHVPVCSSLRGSSKSVSVFGFGQ

Query:  LFSSTGTSGSSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH
        LFS TGTSGSSRSYQ SS SSSTNRTTTRRI+ETDGGGKLH
Subjt:  LFSSTGTSGSSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH

A0A6J1JAD5 uncharacterized serine-rich protein C215.134.9e-18881.3Show/hide
Query:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSR---DDSPPSTNLSGLIT--KPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELF
        MASACVN++GMSPENFLDCSSAPCHSYGWLSPR+SFSR   DDS PS+NL+  I+  KP ADPAG+SEIRD DPELVPVSEFEF LQDPVALMLPADELF
Subjt:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSR---DDSPPSTNLSGLIT--KPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELF

Query:  LNGKLVPFQVSSVKPSVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGS------AKTENHKTT---SPSY
        L+GKLVP QVSSVKPSVN LKS RCVS PE+A Q+RR VE EC+TDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNG+      AK ENHKTT   S SY
Subjt:  LNGKLVPFQVSSVKPSVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGS------AKTENHKTT---SPSY

Query:  ASEANSKALRYLLHRRSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRP
         SEANSKAL+Y LHR SKSSL+SSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHE EDLHRL LD EN PN NPISLHRNPN +NPPRMR VKPRP
Subjt:  ASEANSKALRYLLHRRSKSSLSSSFDSSLSLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRP

Query:  KSEMNPR--STMDHHPTATRVGRSPMRPAPGE--STRL-AVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVLHVP
        KSE NPR  ST+DHHPTA RVGRSPMR  PGE  S+RL  +RGVSVDSPR+NS GKIVFHNLERSSSSPS+FNGGPK KNRG ERSYSANVR+TPVL+VP
Subjt:  KSEMNPR--STMDHHPTATRVGRSPMRPAPGE--STRL-AVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVLHVP

Query:  VCSSLRGSSKSVSVFGFGQLFSSTGTSGSSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH
        VCSSLRGSSKS SVFGFG LF+ TG     RS+QSSSSSSSTNRTTTRRI ETDGGGK+H
Subjt:  VCSSLRGSSKSVSVFGFGQLFSSTGTSGSSRSYQSSSSSSSTNRTTTRRIMETDGGGKLH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G56020.1 unknown protein5.2e-6545.02Show/hide
Query:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKL
        MASACV + G+SPE F         SYGW SPR+S +RDD+  S++    + K  +DP    EI+D      PV +FEF L+DPV  ML ADELF +GKL
Subjt:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKL

Query:  VPFQVSSVKPSVNVLKSMRCVSPPE-----TAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKAL
        VP + S  K +     +    +  E        +S R +E E S    LFSPKAPRC++RWRELLGLK+L     N   + E+ K +S S ++   + + 
Subjt:  VPFQVSSVKPSVNVLKSMRCVSPPE-----TAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKAL

Query:  RYLLHRRSKSSLSSSFDSSLSLPLLKDSD-SESVSLSSSRVSL-SSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTS-NPPRMRQVKPRPKSEMNP
        +  LHR SKSS      ++ S PL K+SD SES+S++SSR+SL SSSSS HE++DL RL LD + KP+ NP +  R  + + N PR+R  KPR       
Subjt:  RYLLHRRSKSSLSSSFDSSLSLPLLKDSD-SESVSLSSSRVSL-SSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTS-NPPRMRQVKPRPKSEMNP

Query:  RSTMDHHPTATRVGRSPMRPAPGESTRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLK-NRGTERSYSANVRITPVLHVPVCSSLRGSSK
            +H P+   V  S    A  ES  L    V+ DSPR+N+ GKIVFH LERSSSSP SF GGP++K + G  RSYSANVRITPVL+VPVCS   G   
Subjt:  RSTMDHHPTATRVGRSPMRPAPGESTRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLK-NRGTERSYSANVRITPVLHVPVCSSLRGSSK

Query:  SVSVFGFGQLFSSTGTSGSS-------RSYQSSSSSSSTNRT
              FGQLFSS+ +S SS          QS+ S +  NRT
Subjt:  SVSVFGFGQLFSSTGTSGSS-------RSYQSSSSSSSTNRT

AT1G79060.1 unknown protein2.2e-7147.78Show/hide
Query:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKL
        MAS CVNN+ +S +           +YG  +PR SFSRDD   S   SG +          SEI   +   V   +FEFRL++    MLPADELF +GKL
Subjt:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKL

Query:  VPFQVSSVKPSVNVLKSMRCVSPPETAAQSRR----EVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKALR
        V  Q                    E   + RR    E+E     D   FSPKAPRCSSRWR+LLGLK+  Q+SS  SA T    TT+P     +++ +L+
Subjt:  VPFQVSSVKPSVNVLKSMRCVSPPETAAQSRR----EVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKALR

Query:  YLLHRRSKSSLSSSFDSSL--SLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNP-PRMRQVKPRPKSEMNPR
          LHR S+SS SSS D+SL  SLPLLKDSDSESVS+SSSR+SLSSSSSGH+ EDL RL LD E +PN N I    +  T+NP    R + P P     PR
Subjt:  YLLHRRSKSSLSSSFDSSL--SLPLLKDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNP-PRMRQVKPRPKSEMNPR

Query:  STMDHHPTA----TRVGRSPMRPAPGESTRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPK-LKNRGTERSYSANVRITPVLHVPVCSSLRG
          + +H T+     RVGRSPMR + GE++ +  RGVSVDSPR+NS GKIVF NLERSSSSPSSFNGG    ++RG ERSYS+NVR+TPVL+VPVC S+RG
Subjt:  STMDHHPTA----TRVGRSPMRPAPGESTRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPK-LKNRGTERSYSANVRITPVLHVPVCSSLRG

Query:  SSKSVSVFGFGQLFSSTGTSGSSRSYQSSSSSS-------STNRTTTRRI
         S       FGQ FSS+ +S SS++ ++ ++++       S  R +T RI
Subjt:  SSKSVSVFGFGQLFSSTGTSGSSRSYQSSSSSS-------STNRTTTRRI

AT3G05980.1 unknown protein1.7e-0430.52Show/hide
Query:  LSPRISFSRD--DSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVA--LMLPADELFLNGKLVPFQVSSVKPSVNVLKSMRCVSPPE
        L PRISFS D  D      ++ ++ K       E  ++ S    V VS+FEF   + V+   ML ADELF  GKL+PF    VK S   LK++   +  E
Subjt:  LSPRISFSRD--DSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVA--LMLPADELFLNGKLVPFQVSSVKPSVNVLKSMRCVSPPE

Query:  TAAQSRR-EVEAE--------------CSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNG------SAKTENHKTTSPSYASEANSKALRYLLHRRSK
          A+ R+ EV+ +                 DP   SP+ P+C+  W+ELL LKK    SS+       S+ + +  T+S S   +A  +  +    +R K
Subjt:  TAAQSRR-EVEAE--------------CSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNG------SAKTENHKTTSPSYASEANSKALRYLLHRRSK

Query:  SSLSSSFDSSLSL
          L  +  +S+ +
Subjt:  SSLSSSFDSSLSL

AT3G12970.1 unknown protein4.8e-5541.32Show/hide
Query:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKL
        MAS CV N+G SP            S+ W S ++S +R+  P +             PA E+E         PV +FEF L+DPV  ML ADELF +GKL
Subjt:  MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKL

Query:  VPFQVSSV-----KPSVNVLKSMRCVSPPETAAQSRREVEAECS--TDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSK
        VP + S V     KP  +V+          TA +  R +E E S   DPYLFSP+APRC+ RWRELLGLK+L ++    SA + +  ++S   +    + 
Subjt:  VPFQVSSV-----KPSVNVLKSMRCVSPPETAAQSRREVEAECS--TDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSK

Query:  ALRYLLHRRSKSSLSSSFDSSLSLPLLKDSD---SESVSLSSSRVSL-SSSSSGHELEDLHRLPLDWENKPNT-NPISL-----HRNPNTSNPPRMRQVK
        + R+ L+R SKS+           P  KDSD   S S S+SSSR+SL SSSSSGHEL+DL RL LD +NKP T NP +      H +    N PR    K
Subjt:  ALRYLLHRRSKSSLSSSFDSSLSLPLLKDSD---SESVSLSSSRVSL-SSSSSGHELEDLHRLPLDWENKPNT-NPISL-----HRNPNTSNPPRMRQVK

Query:  PRPKSEMNPRSTMDHHPTATRVGRSPMRPAPGESTRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLK-NRGTERSYSANVRITPVLHVPV
        PR  ++++  +                      S    V  V+ DSPR+N+ GKIVFH LERSSSSP +F GGP++K + G  RS+SANVRITPVL+VPV
Subjt:  PRPKSEMNPRSTMDHHPTATRVGRSPMRPAPGESTRLAVRGVSVDSPRINSCGKIVFHNLERSSSSPSSFNGGPKLK-NRGTERSYSANVRITPVLHVPV

Query:  CSSLRGSSKSVSVFGFGQLFSSTGTSGSSRSYQSSS---SSSSTNRTTTRRIMET
         SSLR   KS  +F FGQLF+S+ ++ SS S  + +   S++  NRT   R+  T
Subjt:  CSSLRGSSKSVSVFGFGQLFSSTGTSGSSRSYQSSS---SSSSTNRTTTRRIMET

AT5G19340.1 unknown protein6.6e-0429.41Show/hide
Query:  PRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKLVPF-QVS--------SVKPSVNVL---KSMR
        PRISFS D S   ++   +   P  +     E  + D   V   +FEF  ++  A ML ADELF  GKL+PF QV         ++KP V V    +  +
Subjt:  PRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKLVPF-QVS--------SVKPSVNVL---KSMR

Query:  CVSPPETAAQSRREVEAE------------CSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKA
         V+         +E                   DP   SP+ P+C+  W+ELL LKK  Q ++  +  +    + SPS +S + S +
Subjt:  CVSPPETAAQSRREVEAE------------CSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCGCATGCGTTAACAACATCGGAATGTCGCCGGAGAATTTCCTCGATTGTTCTTCTGCTCCTTGCCATTCTTACGGCTGGCTCAGCCCTCGCATCTCCTTCAG
CCGCGACGACTCCCCGCCTTCTACTAACCTCTCCGGACTTATAACTAAGCCTGCCGCTGACCCGGCCGGAGAGTCTGAGATTCGAGATTCGGATCCTGAACTAGTGCCGG
TCAGTGAGTTCGAGTTCCGCCTTCAAGATCCCGTCGCCTTGATGCTCCCGGCGGATGAGCTGTTTCTGAATGGAAAACTCGTGCCATTTCAGGTCTCGTCTGTTAAACCC
TCGGTCAACGTTTTGAAGTCGATGAGGTGCGTTTCGCCGCCGGAGACTGCGGCTCAGTCCCGCCGGGAAGTTGAGGCAGAATGCAGTACGGATCCATATCTGTTCTCTCC
CAAGGCGCCGAGATGTTCCAGTCGATGGAGAGAGCTTTTAGGGCTCAAGAAACTGTACCAGAGCAGCAGCAATGGCAGCGCCAAAACCGAAAATCACAAAACGACGTCGC
CGTCTTACGCCTCGGAAGCCAATTCTAAGGCGCTGAGGTATTTACTTCACCGGCGTTCGAAATCATCGTTATCGTCTTCGTTCGATTCGTCACTGAGCCTTCCATTGTTG
AAGGACTCCGATAGCGAGTCTGTTTCGCTATCTTCTTCCCGCGTATCTCTTTCCTCGTCTTCCTCAGGTCACGAACTCGAAGATCTTCACAGACTCCCGCTGGATTGGGA
AAATAAGCCAAACACGAATCCGATTTCTCTCCATCGGAACCCTAACACTAGCAATCCACCGCGAATGAGACAGGTGAAACCTCGGCCTAAATCGGAGATGAATCCAAGAT
CAACAATGGATCATCATCCGACGGCAACGAGAGTAGGTAGAAGCCCAATGCGGCCTGCGCCGGGAGAATCCACTAGATTAGCTGTCCGAGGAGTATCCGTAGACAGTCCA
CGAATTAACTCCTGTGGTAAAATCGTGTTCCACAACTTGGAGAGAAGCTCGAGCAGTCCAAGCAGCTTCAATGGCGGACCAAAACTGAAGAACAGAGGAACGGAAAGGTC
ATATTCAGCGAACGTGAGAATAACTCCAGTCCTTCACGTTCCAGTCTGTTCTTCCCTGAGAGGATCCTCAAAATCCGTCTCTGTGTTCGGATTCGGTCAACTATTCTCCA
GTACCGGCACCAGCGGAAGCAGTAGAAGCTACCAGAGTAGTAGCAGTAGTAGTAGCACTAACCGGACGACGACAAGGCGGATCATGGAAACAGACGGCGGAGGAAAACTC
CATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCGCATGCGTTAACAACATCGGAATGTCGCCGGAGAATTTCCTCGATTGTTCTTCTGCTCCTTGCCATTCTTACGGCTGGCTCAGCCCTCGCATCTCCTTCAG
CCGCGACGACTCCCCGCCTTCTACTAACCTCTCCGGACTTATAACTAAGCCTGCCGCTGACCCGGCCGGAGAGTCTGAGATTCGAGATTCGGATCCTGAACTAGTGCCGG
TCAGTGAGTTCGAGTTCCGCCTTCAAGATCCCGTCGCCTTGATGCTCCCGGCGGATGAGCTGTTTCTGAATGGAAAACTCGTGCCATTTCAGGTCTCGTCTGTTAAACCC
TCGGTCAACGTTTTGAAGTCGATGAGGTGCGTTTCGCCGCCGGAGACTGCGGCTCAGTCCCGCCGGGAAGTTGAGGCAGAATGCAGTACGGATCCATATCTGTTCTCTCC
CAAGGCGCCGAGATGTTCCAGTCGATGGAGAGAGCTTTTAGGGCTCAAGAAACTGTACCAGAGCAGCAGCAATGGCAGCGCCAAAACCGAAAATCACAAAACGACGTCGC
CGTCTTACGCCTCGGAAGCCAATTCTAAGGCGCTGAGGTATTTACTTCACCGGCGTTCGAAATCATCGTTATCGTCTTCGTTCGATTCGTCACTGAGCCTTCCATTGTTG
AAGGACTCCGATAGCGAGTCTGTTTCGCTATCTTCTTCCCGCGTATCTCTTTCCTCGTCTTCCTCAGGTCACGAACTCGAAGATCTTCACAGACTCCCGCTGGATTGGGA
AAATAAGCCAAACACGAATCCGATTTCTCTCCATCGGAACCCTAACACTAGCAATCCACCGCGAATGAGACAGGTGAAACCTCGGCCTAAATCGGAGATGAATCCAAGAT
CAACAATGGATCATCATCCGACGGCAACGAGAGTAGGTAGAAGCCCAATGCGGCCTGCGCCGGGAGAATCCACTAGATTAGCTGTCCGAGGAGTATCCGTAGACAGTCCA
CGAATTAACTCCTGTGGTAAAATCGTGTTCCACAACTTGGAGAGAAGCTCGAGCAGTCCAAGCAGCTTCAATGGCGGACCAAAACTGAAGAACAGAGGAACGGAAAGGTC
ATATTCAGCGAACGTGAGAATAACTCCAGTCCTTCACGTTCCAGTCTGTTCTTCCCTGAGAGGATCCTCAAAATCCGTCTCTGTGTTCGGATTCGGTCAACTATTCTCCA
GTACCGGCACCAGCGGAAGCAGTAGAAGCTACCAGAGTAGTAGCAGTAGTAGTAGCACTAACCGGACGACGACAAGGCGGATCATGGAAACAGACGGCGGAGGAAAACTC
CATTAA
Protein sequenceShow/hide protein sequence
MASACVNNIGMSPENFLDCSSAPCHSYGWLSPRISFSRDDSPPSTNLSGLITKPAADPAGESEIRDSDPELVPVSEFEFRLQDPVALMLPADELFLNGKLVPFQVSSVKP
SVNVLKSMRCVSPPETAAQSRREVEAECSTDPYLFSPKAPRCSSRWRELLGLKKLYQSSSNGSAKTENHKTTSPSYASEANSKALRYLLHRRSKSSLSSSFDSSLSLPLL
KDSDSESVSLSSSRVSLSSSSSGHELEDLHRLPLDWENKPNTNPISLHRNPNTSNPPRMRQVKPRPKSEMNPRSTMDHHPTATRVGRSPMRPAPGESTRLAVRGVSVDSP
RINSCGKIVFHNLERSSSSPSSFNGGPKLKNRGTERSYSANVRITPVLHVPVCSSLRGSSKSVSVFGFGQLFSSTGTSGSSRSYQSSSSSSSTNRTTTRRIMETDGGGKL
H