; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023722 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023722
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionArm domain-containing protein
Genome locationtig00000892:5984960..5994991
RNA-Seq ExpressionSgr023722
SyntenySgr023722
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAV68442.1 Arm domain-containing protein, partial [Cephalotus follicularis]7.0e-30757.56Show/hide
Query:  ERESETWN------HKKQILIFQLSQRLIHGDLHSRIEAARDLRKLARKSSPKSRSKLGASSLIQPLVCMLLSPILMPVKPLYLPSSILLPATNGSGTSM
        E E  TWN      HK Q LI +LS +LI+GDL ++IEAARD+RK+ RKSS K+RSK  A+ +IQPLV ML SP L                        
Subjt:  ERESETWN------HKKQILIFQLSQRLIHGDLHSRIEAARDLRKLARKSSPKSRSKLGASSLIQPLVCMLLSPILMPVKPLYLPSSILLPATNGSGTSM

Query:  FGNSPPPPPPNFYWEEARFSEFWIISWFSLSRFMTCLTAEHTVPSSHTIPVISFIIRNKIKIVAAGAIPPLLELLKFQNLSLRELATAAILTLSAAASNK
                       +AR S        SL   +                      RNK+KIV AGAIPPL+ELLKFQ+ S RELA AAILTLSAAA NK
Subjt:  FGNSPPPPPPNFYWEEARFSEFWIISWFSLSRFMTCLTAEHTVPSSHTIPVISFIIRNKIKIVAAGAIPPLLELLKFQNLSLRELATAAILTLSAAASNK

Query:  PVILSAGAASLLVQILISGSVQAKVDAVTALYYLSACTESEDCSTILDPRA--------------------------IISKSDEGRTAISNSDGGILTLV
          I+++GAA LLV IL SGSVQ KVDAVT L+ LS  T  E+   IL+ +A                          I+S S+EGR AI++SDGGILTLV
Subjt:  PVILSAGAASLLVQILISGSVQAKVDAVTALYYLSACTESEDCSTILDPRA--------------------------IISKSDEGRTAISNSDGGILTLV

Query:  QTVEDGSLVSTEHAVGVLLSLCQSCRETYRKPILKEGAIPGLLRLTVEGTAEAQERARRLLDLLRDSPQEKRMSSTDLERIVYKSLPRSMEVSLGKWVEI
        +T+EDGSLVSTEHAVG LLSLCQS R+ YR  ILKEGAIPGLL LTV+GT EAQERAR LLDLLRD+P+EK+++++ LE+IVY    R            
Subjt:  QTVEDGSLVSTEHAVGVLLSLCQSCRETYRKPILKEGAIPGLLRLTVEGTAEAQERARRLLDLLRDSPQEKRMSSTDLERIVYKSLPRSMEVSLGKWVEI

Query:  SLLNLRIWFGYSLLCYFCIPSTSPAAHSSHLTLIRSQTHLHQLQRREKQRDGVEFLESSSCLRQVRVRELLSNSFAPDQCPCGIEAFFGLFCVDLIVGSV
                          +     AA ++   L        +L     Q   V++                 + F+P Q P      + L     +    
Subjt:  SLLNLRIWFGYSLLCYFCIPSTSPAAHSSHLTLIRSQTHLHQLQRREKQRDGVEFLESSSCLRQVRVRELLSNSFAPDQCPCGIEAFFGLFCVDLIVGSV

Query:  GSLVSGKPILSFHFFCCTLLSGSIWKLMVGKMLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTS
               P L+ H    T     +       MLGRS L RTGSFRPENLGQNALA+IGNLCFTLFV+GVLIFTIIAATY+PEDPLFHPSTKITTFLTSTS
Subjt:  GSLVSGKPILSFHFFCCTLLSGSIWKLMVGKMLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTS

Query:  NATFKTDSTVMKTGEDFMAANQTAFATFLNETDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSND
        NATFK+D TV+KTGEDFMA+NQTAFATF+N TD+      EN +  +  +++S  C  +VD PIDC+DPEVFHLMM+ TIE FKDIHFY           
Subjt:  NATFKTDSTVMKTGEDFMAANQTAFATFLNETDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSND

Query:  STCDMAWRFRPKEGKTAAFYKDYRRFVITRSTNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAIALPVVGEVVNDSLPVVESEGSFSHGKYL
                 RPKEGK AAFYKDYRRFVI RS NCT S+VSIGDYH+G+NARK+KKN K GFEK   + +  + LPVVGE VND+LPVVESE SFS GKYL
Subjt:  STCDMAWRFRPKEGKTAAFYKDYRRFVITRSTNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAIALPVVGEVVNDSLPVVESEGSFSHGKYL

Query:  VYEMGGDKCKSMNHYLWSFLCALGEAQYLNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFR
        +Y  GGD+CKSMNHYLWSFLCALGEAQYLNRTLVMDL +CLSSIYTSS QDEEGKDFRFYFDFEHL ESAS+LD+ QFW DW KWQKKD LSL LV+DFR
Subjt:  VYEMGGDKCKSMNHYLWSFLCALGEAQYLNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFR

Query:  VTPMKLRDVKDALILRKFGSAEPDNYWYRVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTL
        VTPMKL +VKDALI+RKFGS EPDNYWYRVCEGETESVV+RPWHL+WKSRR+MDIVS+IA+RLNWDYDSVHI RGEKAKNK+LWPNL ADTSPD LLSTL
Subjt:  VTPMKLRDVKDALILRKFGSAEPDNYWYRVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTL

Query:  QDKVENGRNLYIATNEPNTAFFDPLKDKYSTHFLDEYKDLWDKDSEWYTETMNLNNGV
         +K+E+GRNLY+ATNEP  +FFDPLKDKYSTHFLDEYKDLW+++SEWY ETM LN G+
Subjt:  QDKVENGRNLYIATNEPNTAFFDPLKDKYSTHFLDEYKDLWDKDSEWYTETMNLNNGV

KAG6604922.1 hypothetical protein SDJN03_02239, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0055.68Show/hide
Query:  KRSAVESERGDSGGGGNGGEGRDWTTSILLFVLWAALMYYVFNLAPNQTSSMDIYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLVYGMLLLPSGRSS
        K S VE+ERG SGGGG GGEGRDWTTSILLF  WA LM+YVFNLAPN+T SMDIY LKKLLNLK+DDGFKMNEVLVSLWYIMGLWPLVYGMLLLPSGRSS
Subjt:  KRSAVESERGDSGGGGNGGEGRDWTTSILLFVLWAALMYYVFNLAPNQTSSMDIYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLVYGMLLLPSGRSS

Query:  DSNVPVWPFLGLSFFLGAYSLLPYFVLWKPPPPPVEEAELKRWPLKFLESKLTAGITFAAGLGILFYTGLAGGTVWKEFYQYFRESRFIHIMSIDFMLLS
        +SNVPVWPFL LSFFLGAY+LLPYFVL KPPPPPVEE ELKRWPL FLESK T+GITFAAGLGILFY GLAG + WKEFYQYFRESRFIH MSIDFMLLS
Subjt:  DSNVPVWPFLGLSFFLGAYSLLPYFVLWKPPPPPVEEAELKRWPLKFLESKLTAGITFAAGLGILFYTGLAGGTVWKEFYQYFRESRFIHIMSIDFMLLS

Query:  SFAPFWVYNDMTARKWYNQGSWLFRFRWCRSWVLPCMSSYDRYQRRHPFPSTPLHLNQNNLVSKVNQKKELGCKGGKEKLSESVGIVKRMKQEEREEQDE
        SFAPFWVYNDM ARKWY++GSWL                        PF   P                                               
Subjt:  SFAPFWVYNDMTARKWYNQGSWLFRFRWCRSWVLPCMSSYDRYQRRHPFPSTPLHLNQNNLVSKVNQKKELGCKGGKEKLSESVGIVKRMKQEEREEQDE

Query:  EIGFERESETWNHKKQILIFQLSQRLIHGDLHSRIEAARDLRKLARKSSPKSRSKLGASSLIQPLVCMLLSPILMPVKPLYLPSSILLPATNGSGTSMFG
                                                                     + P + ++L P               LP T         
Subjt:  EIGFERESETWNHKKQILIFQLSQRLIHGDLHSRIEAARDLRKLARKSSPKSRSKLGASSLIQPLVCMLLSPILMPVKPLYLPSSILLPATNGSGTSMFG

Query:  NSPPPPPPNFYWEEARFSEFWIISWFSLSRFMTCLTAEHTVPSSHTIPVISFIIRNKIKIVAAGAIPPLLELLKFQNLSLRELATAAILTLSAAASNKPV
                                           T   TVP +                                                 AA+++P 
Subjt:  NSPPPPPPNFYWEEARFSEFWIISWFSLSRFMTCLTAEHTVPSSHTIPVISFIIRNKIKIVAAGAIPPLLELLKFQNLSLRELATAAILTLSAAASNKPV

Query:  ILSAGAASLLVQILISGSVQAKVDAVTALYYLSACTESEDCSTILDPRAIISKSDEGRTAISNSDGGILTLVQTVEDGSLVSTEHAVGVLLSLCQSCRET
            G A                                                                                             
Subjt:  ILSAGAASLLVQILISGSVQAKVDAVTALYYLSACTESEDCSTILDPRAIISKSDEGRTAISNSDGGILTLVQTVEDGSLVSTEHAVGVLLSLCQSCRET

Query:  YRKPILKEGAIPGLLRLTVEGTAEAQERARRLLDLLRDSPQEKRMSSTDLERIVYKSLPRSMEVSLGKWVEISLLNLRIWFGYSLLCYFCIPSTSPAAHS
            ILK GAIPGLL  TV                         +SST                       +  L+ RI                     
Subjt:  YRKPILKEGAIPGLLRLTVEGTAEAQERARRLLDLLRDSPQEKRMSSTDLERIVYKSLPRSMEVSLGKWVEISLLNLRIWFGYSLLCYFCIPSTSPAAHS

Query:  SHLTLIRSQTHLHQLQRREKQRDGVEFLESSSCLRQVRVRELLSNSFAPDQCPCGIEAFFGLFCVDLIVGSVGSLVSGKPILSFHFFCCTLLSGSIWKLM
                         R  Q  GV+ LESSS L QV                       G F   LIV  +G                        KLM
Subjt:  SHLTLIRSQTHLHQLQRREKQRDGVEFLESSSCLRQVRVRELLSNSFAPDQCPCGIEAFFGLFCVDLIVGSVGSLVSGKPILSFHFFCCTLLSGSIWKLM

Query:  VGKMLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFMAANQTAFATF
        +GKMLGRSSLYR+GSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTS SNATFKTDSTVMKTGEDFMAANQTAFATF
Subjt:  VGKMLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFMAANQTAFATF

Query:  LNETDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGKTAAFYKDYRRFVI
        LNETDIVKIIDAENSALG+ TE  SPECN NVDDPIDCRDPEVFH+MMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGK A+FYKDYRRFVI
Subjt:  LNETDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGKTAAFYKDYRRFVI

Query:  TRSTNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAIALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLWSFLCALGEAQY
        TRS NCTLSIVSIGDYH+GVNARK+KKNPK+ FEKKMEQLE A++LPVVGEVVNDSLPVVESEGSFS GKYLVYEM GDKCKSMNHYLWSFLCALGEAQY
Subjt:  TRSTNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAIALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLWSFLCALGEAQY

Query:  LNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFRVTPMKLRDVKDALILRKFGSAEPDNYWY
        LNRTLVMDLKICLSSIYTSS QDEEGKDFRFYFDFEHLK+SA+ILDQGQFWSDWEKWQKKD L L LV+DFRVTPMKL DVKDALI RKFGSAEPDNYWY
Subjt:  LNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFRVTPMKLRDVKDALILRKFGSAEPDNYWY

Query:  RVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEPNTAFFDPLKDK
        RVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVH+VRGEKAKNKELWPNL  DTSPDTLLSTLQ+K+E GRNLYIATNEPNT +FDPLKDK
Subjt:  RVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEPNTAFFDPLKDK

Query:  YSTHFLDEYKDLWDKDSEWYTETMNLNNG
        +STHFLDEYKDLW KDSEWYTETMNLNNG
Subjt:  YSTHFLDEYKDLWDKDSEWYTETMNLNNG

XP_004139993.2 uncharacterized protein LOC101217823 isoform X1 [Cucumis sativus]4.1e-29993.53Show/hide
Query:  LLSGSIWKLMVGKMLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFM
        LL   IW LMVGKMLGRSSLYR GSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTS SNATFKTDSTVMKTGEDFM
Subjt:  LLSGSIWKLMVGKMLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFM

Query:  AANQTAFATFLNETDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGKTAA
        AANQTAFATFLNETDIVKIIDAENSALGT TE  S ECN NV+DPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGK AA
Subjt:  AANQTAFATFLNETDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGKTAA

Query:  FYKDYRRFVITRSTNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAI-ALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLW
        FYKDYRRFVITRS NC+LSI+SIGDYHTGVNARKRKKNPKH FEKKMEQLEQA+ +LPVVGEVVNDSLPVVESEGSFS GKYL+YEMGGDKCKSMNHYLW
Subjt:  FYKDYRRFVITRSTNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAI-ALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLW

Query:  SFLCALGEAQYLNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFRVTPMKLRDVKDALILRK
        SFLCALGEAQYLNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKD L L LV+D RVTPMKL DVKDALILRK
Subjt:  SFLCALGEAQYLNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFRVTPMKLRDVKDALILRK

Query:  FGSAEPDNYWYRVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEP
        FGSAEPDNYWYRVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKA+NKELWPNLAADTSPDTLLSTLQDK+E+GRNLYIATNEP
Subjt:  FGSAEPDNYWYRVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEP

Query:  NTAFFDPLKDKYSTHFLDEYKDLWDKDSEWYTETMNLNNGV
        NT +FDPLKDKYSTHFL+EYKDLWDK SEWYTETMNLNNGV
Subjt:  NTAFFDPLKDKYSTHFLDEYKDLWDKDSEWYTETMNLNNGV

XP_008448130.1 PREDICTED: uncharacterized protein LOC103490419 [Cucumis melo]4.5e-29894.55Show/hide
Query:  MVGKMLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFMAANQTAFAT
        MVGKMLGRSSLYR GSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTS SNATFKTDSTV+KTGEDFMAANQTAFAT
Subjt:  MVGKMLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFMAANQTAFAT

Query:  FLNETDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGKTAAFYKDYRRFV
        FLNETDIVKIIDAENSALGT TE  SPECN NVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGK AAFYKDYRRFV
Subjt:  FLNETDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGKTAAFYKDYRRFV

Query:  ITRSTNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAI-ALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLWSFLCALGEA
        ITRS NC+LSI+SIGDYHTGVNARKRKKNPKH FEKKMEQLEQA+ +LPVVGEVVNDSLPVVESEGSFS GKYL+YEMGGDKCKSMNHYLWSFLCALGEA
Subjt:  ITRSTNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAI-ALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLWSFLCALGEA

Query:  QYLNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFRVTPMKLRDVKDALILRKFGSAEPDNY
        QYLNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKD L L LV+DFRVTPM L DVKDALILRKFGSAEPDNY
Subjt:  QYLNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFRVTPMKLRDVKDALILRKFGSAEPDNY

Query:  WYRVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEPNTAFFDPLK
        WYRVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDK+E+GRNLYIATNEPNT +FDPLK
Subjt:  WYRVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEPNTAFFDPLK

Query:  DKYSTHFLDEYKDLWDKDSEWYTETMNLNNGV
        DKYSTHFL+EYKDLWDK SEWYTETMNLNNGV
Subjt:  DKYSTHFLDEYKDLWDKDSEWYTETMNLNNGV

XP_038900970.1 uncharacterized protein LOC120088020 [Benincasa hispida]4.4e-30195.3Show/hide
Query:  MVGKMLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFMAANQTAFAT
        MVGKMLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTS SNATFKTDSTVMKTGEDFMAANQTAFAT
Subjt:  MVGKMLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFMAANQTAFAT

Query:  FLNETDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGKTAAFYKDYRRFV
        FLNETDIVKIIDAENSALGT +E T+PECN NVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGK AAFYKDYRRFV
Subjt:  FLNETDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGKTAAFYKDYRRFV

Query:  ITRSTNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAI-ALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLWSFLCALGEA
        ITRS NCTLSI+SIGDYHTGVNARKRKKNPKH FEKKMEQLEQA+ +LPVVGEVVNDSLPVVESEGSFSHGKYL+YEM GDKCKSMNHYLWSFLCALGEA
Subjt:  ITRSTNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAI-ALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLWSFLCALGEA

Query:  QYLNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFRVTPMKLRDVKDALILRKFGSAEPDNY
        QYLNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHL L LV+DFRVTPMKL DVKDALILRKFGSAEPDNY
Subjt:  QYLNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFRVTPMKLRDVKDALILRKFGSAEPDNY

Query:  WYRVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEPNTAFFDPLK
        WYRVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDK+E GRNLYIATNEPNT +FDPLK
Subjt:  WYRVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEPNTAFFDPLK

Query:  DKYSTHFLDEYKDLWDKDSEWYTETMNLNNGV
        DKYSTHFL+EYKDLWDKDSEWYTETM LNNGV
Subjt:  DKYSTHFLDEYKDLWDKDSEWYTETMNLNNGV

TrEMBL top hitse value%identityAlignment
A0A0A0KDZ2 Uncharacterized protein9.2e-29794.17Show/hide
Query:  MVGKMLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFMAANQTAFAT
        MVGKMLGRSSLYR GSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTS SNATFKTDSTVMKTGEDFMAANQTAFAT
Subjt:  MVGKMLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFMAANQTAFAT

Query:  FLNETDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGKTAAFYKDYRRFV
        FLNETDIVKIIDAENSALGT TE  S ECN NV+DPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGK AAFYKDYRRFV
Subjt:  FLNETDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGKTAAFYKDYRRFV

Query:  ITRSTNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAI-ALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLWSFLCALGEA
        ITRS NC+LSI+SIGDYHTGVNARKRKKNPKH FEKKMEQLEQA+ +LPVVGEVVNDSLPVVESEGSFS GKYL+YEMGGDKCKSMNHYLWSFLCALGEA
Subjt:  ITRSTNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAI-ALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLWSFLCALGEA

Query:  QYLNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFRVTPMKLRDVKDALILRKFGSAEPDNY
        QYLNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKD L L LV+D RVTPMKL DVKDALILRKFGSAEPDNY
Subjt:  QYLNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFRVTPMKLRDVKDALILRKFGSAEPDNY

Query:  WYRVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEPNTAFFDPLK
        WYRVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKA+NKELWPNLAADTSPDTLLSTLQDK+E+GRNLYIATNEPNT +FDPLK
Subjt:  WYRVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEPNTAFFDPLK

Query:  DKYSTHFLDEYKDLWDKDSEWYTETMNLNNGV
        DKYSTHFL+EYKDLWDK SEWYTETMNLNNGV
Subjt:  DKYSTHFLDEYKDLWDKDSEWYTETMNLNNGV

A0A1Q3BKT3 Arm domain-containing protein (Fragment)3.4e-30757.56Show/hide
Query:  ERESETWN------HKKQILIFQLSQRLIHGDLHSRIEAARDLRKLARKSSPKSRSKLGASSLIQPLVCMLLSPILMPVKPLYLPSSILLPATNGSGTSM
        E E  TWN      HK Q LI +LS +LI+GDL ++IEAARD+RK+ RKSS K+RSK  A+ +IQPLV ML SP L                        
Subjt:  ERESETWN------HKKQILIFQLSQRLIHGDLHSRIEAARDLRKLARKSSPKSRSKLGASSLIQPLVCMLLSPILMPVKPLYLPSSILLPATNGSGTSM

Query:  FGNSPPPPPPNFYWEEARFSEFWIISWFSLSRFMTCLTAEHTVPSSHTIPVISFIIRNKIKIVAAGAIPPLLELLKFQNLSLRELATAAILTLSAAASNK
                       +AR S        SL   +                      RNK+KIV AGAIPPL+ELLKFQ+ S RELA AAILTLSAAA NK
Subjt:  FGNSPPPPPPNFYWEEARFSEFWIISWFSLSRFMTCLTAEHTVPSSHTIPVISFIIRNKIKIVAAGAIPPLLELLKFQNLSLRELATAAILTLSAAASNK

Query:  PVILSAGAASLLVQILISGSVQAKVDAVTALYYLSACTESEDCSTILDPRA--------------------------IISKSDEGRTAISNSDGGILTLV
          I+++GAA LLV IL SGSVQ KVDAVT L+ LS  T  E+   IL+ +A                          I+S S+EGR AI++SDGGILTLV
Subjt:  PVILSAGAASLLVQILISGSVQAKVDAVTALYYLSACTESEDCSTILDPRA--------------------------IISKSDEGRTAISNSDGGILTLV

Query:  QTVEDGSLVSTEHAVGVLLSLCQSCRETYRKPILKEGAIPGLLRLTVEGTAEAQERARRLLDLLRDSPQEKRMSSTDLERIVYKSLPRSMEVSLGKWVEI
        +T+EDGSLVSTEHAVG LLSLCQS R+ YR  ILKEGAIPGLL LTV+GT EAQERAR LLDLLRD+P+EK+++++ LE+IVY    R            
Subjt:  QTVEDGSLVSTEHAVGVLLSLCQSCRETYRKPILKEGAIPGLLRLTVEGTAEAQERARRLLDLLRDSPQEKRMSSTDLERIVYKSLPRSMEVSLGKWVEI

Query:  SLLNLRIWFGYSLLCYFCIPSTSPAAHSSHLTLIRSQTHLHQLQRREKQRDGVEFLESSSCLRQVRVRELLSNSFAPDQCPCGIEAFFGLFCVDLIVGSV
                          +     AA ++   L        +L     Q   V++                 + F+P Q P      + L     +    
Subjt:  SLLNLRIWFGYSLLCYFCIPSTSPAAHSSHLTLIRSQTHLHQLQRREKQRDGVEFLESSSCLRQVRVRELLSNSFAPDQCPCGIEAFFGLFCVDLIVGSV

Query:  GSLVSGKPILSFHFFCCTLLSGSIWKLMVGKMLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTS
               P L+ H    T     +       MLGRS L RTGSFRPENLGQNALA+IGNLCFTLFV+GVLIFTIIAATY+PEDPLFHPSTKITTFLTSTS
Subjt:  GSLVSGKPILSFHFFCCTLLSGSIWKLMVGKMLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTS

Query:  NATFKTDSTVMKTGEDFMAANQTAFATFLNETDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSND
        NATFK+D TV+KTGEDFMA+NQTAFATF+N TD+      EN +  +  +++S  C  +VD PIDC+DPEVFHLMM+ TIE FKDIHFY           
Subjt:  NATFKTDSTVMKTGEDFMAANQTAFATFLNETDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSND

Query:  STCDMAWRFRPKEGKTAAFYKDYRRFVITRSTNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAIALPVVGEVVNDSLPVVESEGSFSHGKYL
                 RPKEGK AAFYKDYRRFVI RS NCT S+VSIGDYH+G+NARK+KKN K GFEK   + +  + LPVVGE VND+LPVVESE SFS GKYL
Subjt:  STCDMAWRFRPKEGKTAAFYKDYRRFVITRSTNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAIALPVVGEVVNDSLPVVESEGSFSHGKYL

Query:  VYEMGGDKCKSMNHYLWSFLCALGEAQYLNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFR
        +Y  GGD+CKSMNHYLWSFLCALGEAQYLNRTLVMDL +CLSSIYTSS QDEEGKDFRFYFDFEHL ESAS+LD+ QFW DW KWQKKD LSL LV+DFR
Subjt:  VYEMGGDKCKSMNHYLWSFLCALGEAQYLNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFR

Query:  VTPMKLRDVKDALILRKFGSAEPDNYWYRVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTL
        VTPMKL +VKDALI+RKFGS EPDNYWYRVCEGETESVV+RPWHL+WKSRR+MDIVS+IA+RLNWDYDSVHI RGEKAKNK+LWPNL ADTSPD LLSTL
Subjt:  VTPMKLRDVKDALILRKFGSAEPDNYWYRVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTL

Query:  QDKVENGRNLYIATNEPNTAFFDPLKDKYSTHFLDEYKDLWDKDSEWYTETMNLNNGV
         +K+E+GRNLY+ATNEP  +FFDPLKDKYSTHFLDEYKDLW+++SEWY ETM LN G+
Subjt:  QDKVENGRNLYIATNEPNTAFFDPLKDKYSTHFLDEYKDLWDKDSEWYTETMNLNNGV

A0A1S3BJV6 uncharacterized protein LOC1034904192.2e-29894.55Show/hide
Query:  MVGKMLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFMAANQTAFAT
        MVGKMLGRSSLYR GSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTS SNATFKTDSTV+KTGEDFMAANQTAFAT
Subjt:  MVGKMLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFMAANQTAFAT

Query:  FLNETDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGKTAAFYKDYRRFV
        FLNETDIVKIIDAENSALGT TE  SPECN NVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGK AAFYKDYRRFV
Subjt:  FLNETDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGKTAAFYKDYRRFV

Query:  ITRSTNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAI-ALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLWSFLCALGEA
        ITRS NC+LSI+SIGDYHTGVNARKRKKNPKH FEKKMEQLEQA+ +LPVVGEVVNDSLPVVESEGSFS GKYL+YEMGGDKCKSMNHYLWSFLCALGEA
Subjt:  ITRSTNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAI-ALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLWSFLCALGEA

Query:  QYLNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFRVTPMKLRDVKDALILRKFGSAEPDNY
        QYLNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKD L L LV+DFRVTPM L DVKDALILRKFGSAEPDNY
Subjt:  QYLNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFRVTPMKLRDVKDALILRKFGSAEPDNY

Query:  WYRVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEPNTAFFDPLK
        WYRVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDK+E+GRNLYIATNEPNT +FDPLK
Subjt:  WYRVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEPNTAFFDPLK

Query:  DKYSTHFLDEYKDLWDKDSEWYTETMNLNNGV
        DKYSTHFL+EYKDLWDK SEWYTETMNLNNGV
Subjt:  DKYSTHFLDEYKDLWDKDSEWYTETMNLNNGV

A0A5A7V2W3 Uncharacterized protein1.2e-29694.7Show/hide
Query:  MLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFMAANQTAFATFLNE
        MLGRSSLYR GSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTS SNATFKTDSTV+KTGEDFMAANQTAFATFLNE
Subjt:  MLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFMAANQTAFATFLNE

Query:  TDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGKTAAFYKDYRRFVITRS
        TDIVKIIDAENSALGT TE  SPECN NVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGK AAFYKDYRRFVITRS
Subjt:  TDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGKTAAFYKDYRRFVITRS

Query:  TNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAI-ALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLWSFLCALGEAQYLN
         NC+LSI+SIGDYHTGVNARKRKKNPKH FEKKMEQLEQA+ +LPVVGEVVNDSLPVVESEGSFS GKYL+YEMGGDKCKSMNHYLWSFLCALGEAQYLN
Subjt:  TNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAI-ALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLWSFLCALGEAQYLN

Query:  RTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFRVTPMKLRDVKDALILRKFGSAEPDNYWYRV
        RTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKD L L LV+DFRVTPMKL DVKDALILRKFGSAEPDNYWYRV
Subjt:  RTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFRVTPMKLRDVKDALILRKFGSAEPDNYWYRV

Query:  CEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEPNTAFFDPLKDKYS
        CEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDK+E+GRNLYIATNEPNT +FDPLKDKYS
Subjt:  CEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEPNTAFFDPLKDKYS

Query:  THFLDEYKDLWDKDSEWYTETMNLNNGV
        THFL+EYKDLWDK SEWYTETMNLNNGV
Subjt:  THFLDEYKDLWDKDSEWYTETMNLNNGV

A0A6J1CFX8 uncharacterized protein LOC1110110692.9e-29893.97Show/hide
Query:  MVGKMLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFMAANQTAFAT
        MVGKMLGRSSL+RTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTS SNATFKTDSTVMKTGEDFMAANQTAFAT
Subjt:  MVGKMLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFMAANQTAFAT

Query:  FLNETDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGKTAAFYKDYRRFV
        FLNETDIVKIIDAENSALG   EA SPECNGNVDDPIDCRDPEVFHLMME TIE FKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGK AAFYKDYRRFV
Subjt:  FLNETDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGKTAAFYKDYRRFV

Query:  ITRSTNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAIALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLWSFLCALGEAQ
        ITRS NCTLSI+SIGDYHTGVNARKRKKNPKH FEKKMEQLEQA+ALPVVGE VNDSLP+VESEGSFSHGKYLVYEM GDKCKSMNHYLWSFLCALGEAQ
Subjt:  ITRSTNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAIALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLWSFLCALGEAQ

Query:  YLNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFRVTPMKLRDVKDALILRKFGSAEPDNYW
        YLNRTLVMDLKICLSSIYTSSNQDE+GKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKD L L LV+DFR+TPMKL+DV+DALILRKFGSAEPDNYW
Subjt:  YLNRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFRVTPMKLRDVKDALILRKFGSAEPDNYW

Query:  YRVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEPNTAFFDPLKD
        YRVCEGETESVVKRPW+LIWKSRR+M+IVSSIASRLNWDYDSVHIVRGEKAKN+ELWPNLAADTSPDTLLSTLQDK+E+GRNLYIATNEPNTAFFDPLKD
Subjt:  YRVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEPNTAFFDPLKD

Query:  KYSTHFLDEYKDLWDKDSEWYTETMNLNNGV
        KYSTHFLDEYKDLWDK SEWYTETM LNNGV
Subjt:  KYSTHFLDEYKDLWDKDSEWYTETMNLNNGV

SwissProt top hitse value%identityAlignment
O22193 U-box domain-containing protein 43.8e-1334.31Show/hide
Query:  NKIKIVAAGAIPPLLELLKFQNLSLRELATAAILTLSAAASNKPVILSAGAASLLVQILISGSVQAKVDAVTALYYLSACTESEDCST----------IL
        NK  I  AGAI PL+ +L+  +   +E + A + +LS    NK  I  +GA   LV +L +G+ + K DA TAL+ LS   E++              ++
Subjt:  NKIKIVAAGAIPPLLELLKFQNLSLRELATAAILTLSAAASNKPVILSAGAASLLVQILISGSVQAKVDAVTALYYLSACTESEDCST----------IL

Query:  DPRA-IISKS----------DEGRTAISNSDGGILTLVQTVEDGSLVSTEHAVGVLLSLCQSCRETYRKPILKEGAIPGLLRLTVEGTAEAQERARRLLD
        DP A ++ K+           EGR AI   +GGI  LV+ VE GS    E+A   LL L  +    +   +L+EGA+P L+ L+  GT  A+E+A+ LL 
Subjt:  DPRA-IISKS----------DEGRTAISNSDGGILTLVQTVEDGSLVSTEHAVGVLLSLCQSCRETYRKPILKEGAIPGLLRLTVEGTAEAQERARRLLD

Query:  LLRD
          R+
Subjt:  LLRD

O48700 U-box domain-containing protein 63.0e-1028.94Show/hide
Query:  RNKIKIVAAGAIPPLLELLKFQNLSLRELATAAILTLSAAASNKPVILSAGAASLLVQILISGS-VQAKVDAVTALYYLSACTESEDCSTILDPRAI---
        RNK  ++ +G IP L +++       +  ATA  L LS     KPVI S+ A S  V +L+  +  Q K+DA+ ALY LS  T S +  T+L    I   
Subjt:  RNKIKIVAAGAIPPLLELLKFQNLSLRELATAAILTLSAAASNKPVILSAGAASLLVQILISGS-VQAKVDAVTALYYLSACTESEDCSTILDPRAI---

Query:  ---------------------ISKSDEGRTAISNSDGGILTLVQTVEDGSLVSTEHAVGVLLSLCQSCRETYRKPILKEGAIPGLLRLTVEGTAEAQERA
                             ++ S EG+  +  + G I TL   ++ G  V  E AV  L+ LC    E+  + +L+EG IP L+ ++V G+   ++++
Subjt:  ---------------------ISKSDEGRTAISNSDGGILTLVQTVEDGSLVSTEHAVGVLLSLCQSCRETYRKPILKEGAIPGLLRLTVEGTAEAQERA

Query:  RRLLDLLRDSPQEKRMSSTDLERIVYKSLPRSMEV
        ++LL L R+  + +   S + E    K++   M +
Subjt:  RRLLDLLRDSPQEKRMSSTDLERIVYKSLPRSMEV

Q5XEZ8 U-box domain-containing protein 23.2e-1234.6Show/hide
Query:  NKIKIVAAGAIPPLLELLKFQNLSLRELATAAIL-TLSAAASNKPVILSAGAASLLVQILISGSVQAKVDAVTALYYLSACTESED----------CSTI
        NK  I  +GAI PL+ +LK   L   +  +AA L +LS     K  I  AGA   LV +L SGS+  K DA TAL+ LS   E++              +
Subjt:  NKIKIVAAGAIPPLLELLKFQNLSLRELATAAIL-TLSAAASNKPVILSAGAASLLVQILISGSVQAKVDAVTALYYLSACTESED----------CSTI

Query:  LDP------RAIISKSD-----EGRTAISNSDGGILTLVQTVEDGSLVSTEHAVGVLLSLCQSCRETYRKPILKEGAIPGLLRLTVEGTAEAQERARRLL
        +DP      +A++  ++     EG+ AI   +GGI  LV+ VE GS    E+A   LL LC    + +   +++EG IP L+ LT  GTA  +E+A+ LL
Subjt:  LDP------RAIISKSD-----EGRTAISNSDGGILTLVQTVEDGSLVSTEHAVGVLLSLCQSCRETYRKPILKEGAIPGLLRLTVEGTAEAQERARRLL

Query:  DLLRDSPQEKR
           +   Q  +
Subjt:  DLLRDSPQEKR

Q8GWV5 U-box domain-containing protein 32.2e-1334.07Show/hide
Query:  LTAEHTVPSSHTIPVISFIIRNKIKIVAAGAIPPLLELLKFQNLSLRELATAAILTLSAAASNKPVILSAGAA-SLLVQILISGSVQAKVDAVTALYYLS
        LT EH V +   + +      NK  IV  GAI PL+ +L   N   +E + A++ +LS    N+  I  + AA   LV +L  G+ + K DA +AL+ LS
Subjt:  LTAEHTVPSSHTIPVISFIIRNKIKIVAAGAIPPLLELLKFQNLSLRELATAAILTLSAAASNKPVILSAGAA-SLLVQILISGSVQAKVDAVTALYYLS

Query:  ACTESED----------CSTILDP-----------RAIISKSDEGRTAISNSDGGILTLVQTVEDGSLVSTEHAVGVLLSLCQSCRETYRKPILKEGAIP
           +++              +LDP            A +S   EGR AI   +GGI  LV+TV+ GS    E+A  VLL LC +  + +   +L+EGAIP
Subjt:  ACTESED----------CSTILDP-----------RAIISKSDEGRTAISNSDGGILTLVQTVEDGSLVSTEHAVGVLLSLCQSCRETYRKPILKEGAIP

Query:  GLLRLTVEGTAEAQERARRLLDLLRD
         L+ L+  GT  A+E+A++LL   R+
Subjt:  GLLRLTVEGTAEAQERARRLLDLLRD

Q8VZ40 U-box domain-containing protein 146.1e-1132.16Show/hide
Query:  TAEHTVPSSHTIPVISFIIRNKIKIVAAGAIPPLLELLKFQNLSLRELATAAILTLSAAASNKPVILSAGAASLLVQILISGSVQAKVDAVTALYYLSAC
        T EH+V +   +  +S    NK  IV AGAI  ++E+LK  ++  RE A A + +LS    NK  I +AGA   L+ +L  G+ + K DA TA++ L   
Subjt:  TAEHTVPSSHTIPVISFIIRNKIKIVAAGAIPPLLELLKFQNLSLRELATAAILTLSAAASNKPVILSAGAASLLVQILISGSVQAKVDAVTALYYLSAC

Query:  TESEDCST---ILDP-------------------RAIISKSDEGRTAISNSDGGILTLVQTVEDGSLVSTEHAVGVLLSLCQSCRETYRKPILKE-GAIP
          ++  +    I+DP                    AI+S + EG+TAI+ ++  I  LV+ +  GS  + E+A  +L  LC    E  R  + +E GA  
Subjt:  TESEDCST---ILDP-------------------RAIISKSDEGRTAISNSDGGILTLVQTVEDGSLVSTEHAVGVLLSLCQSCRETYRKPILKE-GAIP

Query:  GLLRLTVEGTAEAQERARRLLDLLRDS
         L  LT  GT  A+ +A  LL+L++ +
Subjt:  GLLRLTVEGTAEAQERARRLLDLLRDS

Arabidopsis top hitse value%identityAlignment
AT2G04280.1 unknown protein6.7e-22371.08Show/hide
Query:  MLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFMAANQTAFATFLNE
        M GRS++ R G FR ENLGQNAL LIGN+ F+LFV GVLIFTIIAATYEPEDPLFHPS KITTFLTSTSNAT ++D +V+KTGEDFM ANQTAFA F+N 
Subjt:  MLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFMAANQTAFATFLNE

Query:  TDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSND-STCDMAWRFRPKEGKTAAFYKDYRRFVITR
         D    ++A  +   T  E    EC  +V+ PIDC+D +VFHLMM  TI++FKDIHFY+FGKPV G    ++CDMAWR+RP++GK+AAFYKDYRRFV+ +
Subjt:  TDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSND-STCDMAWRFRPKEGKTAAFYKDYRRFVITR

Query:  STNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAIALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLWSFLCALGEAQYLN
        S NC++S+V IG+YH+G+NARKRKKN K GFEK   + +   +LPVVGE+VNDSLP+VES+  F  GKYLVY  GGD+CKSMNH+LWSFLCALGEAQYLN
Subjt:  STNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAIALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLWSFLCALGEAQYLN

Query:  RTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKK--DHLSLLLVDDFRVTPMKLRDVKDALILRKFGSAEPDNYWY
        RTLVMDL +CLSSIYTSS Q+EEGKDFRFYFDFEHLKE+AS+LD+ QFW+ W K +KK  + L+L LV+DFRVTPMKL  VKD LI+RKFGS EPDNYWY
Subjt:  RTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKK--DHLSLLLVDDFRVTPMKLRDVKDALILRKFGSAEPDNYWY

Query:  RVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEPNTAFFDPLKDK
        RVCEG+ ESVVKRPWHL+WKSRRLM+IVS+IASRLNWDYD+VHI RGEKA+NKE+WPNL ADTSP  LLSTLQDKVE GR+LYIATNE   +FF+PLKDK
Subjt:  RVCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEPNTAFFDPLKDK

Query:  YSTHFLDEYKDLWDKDSEWYTETMNLNNG
        Y+THFL +YKDLWD+ SEWY+ET  LN G
Subjt:  YSTHFLDEYKDLWDKDSEWYTETMNLNNG

AT2G04360.1 unknown protein7.2e-8451.92Show/hide
Query:  MIANVNLISCNFS-LPSLPLRISKLGTAQQIQTATPRNHAPKLSPSSFRIRSPPISTSGEIQARVKW--VCSKNVGVRREVRLQETQLGKRSAVESERGD
        M+ +++LISCNFS LP L      L  + + QT T  +    L  S        I+   + + +V +  +C  +     E    + Q             
Subjt:  MIANVNLISCNFS-LPSLPLRISKLGTAQQIQTATPRNHAPKLSPSSFRIRSPPISTSGEIQARVKW--VCSKNVGVRREVRLQETQLGKRSAVESERGD

Query:  SGGGGNGGEGRDWTTSILLFVLWAALMYYVFNLAPNQTSSMDIYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLVYGMLLLPSGRSSDSNVPVWPFLG
                EGRDW++SILLF LW AL+YY FNLAP+QT + D+YFLKKLLNLK DDGF+MN++LV LWYIMGLWPLVY MLLLP+G    S  P WPF+ 
Subjt:  SGGGGNGGEGRDWTTSILLFVLWAALMYYVFNLAPNQTSSMDIYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLVYGMLLLPSGRSSDSNVPVWPFLG

Query:  LSFFLGAYSLLPYFVLWKPPPPPVEEAELKRWPLKFLESKLTAGITFAAGLGILFYTGLAGGTVWKEFYQYFRESRFIHIMSIDFMLLSSFAPFWVYNDM
        LSFF G Y+LLPYF LW PP PPV E EL++WPL  LESK+TAG+T  AGLGI+ Y+ +     W EFYQYFRES+FIH+ S+DF LLS+FAPFWVYNDM
Subjt:  LSFFLGAYSLLPYFVLWKPPPPPVEEAELKRWPLKFLESKLTAGITFAAGLGILFYTGLAGGTVWKEFYQYFRESRFIHIMSIDFMLLSSFAPFWVYNDM

Query:  TARKWYNQGSWL
        T RKW+++GSWL
Subjt:  TARKWYNQGSWL

AT4G08810.1 calcium ion binding8.7e-13846.69Show/hide
Query:  ENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFMAANQTAFATFLNETDIVKIIDAENSALG
        E + QN + LI N+CF++FV  VLIFT+IA TY+P DP    +  +T  LT T NATFK D +++KTGED  ++  ++  +   E      I+   + +G
Subjt:  ENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFMAANQTAFATFLNETDIVKIIDAENSALG

Query:  THTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGKTAAFYKDYRRFVITRSTNCTLSIVSIGDYHT
          T   S +C+ ++   ++C DP V   +    ++ FK I F  +  PV GS    CD++WRFR K+ K+   Y+D+RRF      NCT  +     +H+
Subjt:  THTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGKTAAFYKDYRRFVITRSTNCTLSIVSIGDYHT

Query:  GVNARK-RKKNPKHGFEKKMEQLEQAIALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLWSFLCALGEAQYLNRTLVMDLKICLSSIY
        GVNAR+ R   P      +    E           +ND++P + S+ SF  GKYL Y  GGD CK MN Y+WSFLC LGEA YLNRT VMDL +CLSS Y
Subjt:  GVNARK-RKKNPKHGFEKKMEQLEQAIALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLWSFLCALGEAQYLNRTLVMDLKICLSSIY

Query:  TSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQK--KDHLSLLLVDDFRVTPMKLRDVKDALILRKFGSAEPDNYWYRVCEGETESVVKRPW
        +S  +DEEGKDFR+YFDFEHLKE+ASI+++G+F  DW+KW +  K  + +  V   RV+P++L   K  +I R+F + EP+NYWYRVCEG+    V+RPW
Subjt:  TSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQK--KDHLSLLLVDDFRVTPMKLRDVKDALILRKFGSAEPDNYWYRVCEGETESVVKRPW

Query:  HLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEPNTAFFDPLKDKYSTHFLDEYKDLWDK
        H +WKS+RLM+IVS I+ +++WD+D+VH+VRGEKAKNK+LWP+L ADT PD +L+ L+  V+  RNLY+ATNEP   +FD L+ +Y  H LD+Y  LW  
Subjt:  HLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEPNTAFFDPLKDKYSTHFLDEYKDLWDK

Query:  DSEWYTETMNLNNG
         SEWY ET  LNNG
Subjt:  DSEWYTETMNLNNG

AT4G12700.1 unknown protein9.7e-22269.89Show/hide
Query:  MLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFMAANQTAFATFLNE
        M GRS+  RTG FRPENLGQNA++LIG++ F++ V+GV++FTIIAATYEPEDPLFHPS KITTFLTS SNAT K+D +++KTGEDFMAANQTAF  F+N 
Subjt:  MLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFMAANQTAFATFLNE

Query:  TDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKP--VRGSNDSTCDMAWRFRPKEGKTAAFYKDYRRFVIT
         D+     +EN + G        +C+ N+  PIDC+DPEVFHLMM+ T+E+FKD HFY+FGKP  V GS+ S+CDMAWR+RPK+GK AAFYKDYRRFVI 
Subjt:  TDIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKP--VRGSNDSTCDMAWRFRPKEGKTAAFYKDYRRFVIT

Query:  RSTNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAIALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLWSFLCALGEAQYL
        +S NC++S++ IG+YH+GVNARKRK   + GF           ALPVVGE VNDSLPVVESE  F  G YLVY  GGD+CKSMNH+LWSFLCALGEAQYL
Subjt:  RSTNCTLSIVSIGDYHTGVNARKRKKNPKHGFEKKMEQLEQAIALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLWSFLCALGEAQYL

Query:  NRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFRVTPMKLRDVKDALILRKFGSAEPDNYWYR
        NRTLVMDL +CLSS+YT S Q+EEGKDFRFYFDFEHLKE+AS+LDQ QFW+DW KW KK+ L L LV+DFRVTPMKL DVKD LI+RKFG+ EPDNYWYR
Subjt:  NRTLVMDLKICLSSIYTSSNQDEEGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFRVTPMKLRDVKDALILRKFGSAEPDNYWYR

Query:  VCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEPNTAFFDPLKDKY
        VCEGETESVV+RPW+L+WKS+RLM+IVS+IASRLNWDYD++HI RG+KA+NKE+WPNL  DTSP ++LSTLQDK+E GRNLYIATNEP  +FF+PLKDKY
Subjt:  VCEGETESVVKRPWHLIWKSRRLMDIVSSIASRLNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEPNTAFFDPLKDKY

Query:  STHFLDEYKDLWDKDSEWYTETMNLNNG
          HFLDE+KDLWD+ SEWY+ET  LN G
Subjt:  STHFLDEYKDLWDKDSEWYTETMNLNNG

AT4G12710.1 ARM repeat superfamily protein1.7e-6444.44Show/hide
Query:  QDEEIGFERESETWNHKKQILIFQLSQRLIHGDLHSRIEAARDLRKLARKSSPKS--RSKLGASSLIQPLVCMLLSPILMPVKPLYLPSSILLPATNGSG
        +++E   E  + TW   K++LI  LS++L+HGDL  RIEAA+++RKL RKS  KS  RSKL  + +I PLV ML S            S++         
Subjt:  QDEEIGFERESETWNHKKQILIFQLSQRLIHGDLHSRIEAARDLRKLARKSSPKS--RSKLGASSLIQPLVCMLLSPILMPVKPLYLPSSILLPATNGSG

Query:  TSMFGNSPPPPPPNFYWEEARFSEFWIISWFSLSRFMTCLTAEHTVPSSHTIPVISFIIRNKIKIVAAGAIPPLLELLKFQNLSLRELATAAILTLSAAA
                          +AR +        SL   +                      RNKI+IV AGA+PPL+++LK  N SLRELATAAILTLSAA 
Subjt:  TSMFGNSPPPPPPNFYWEEARFSEFWIISWFSLSRFMTCLTAEHTVPSSHTIPVISFIIRNKIKIVAAGAIPPLLELLKFQNLSLRELATAAILTLSAAA

Query:  SNKPVILSAGAASLLVQILISGSVQAKVDAVTALYYLSACTESEDCSTILDPRA---------------------------IISKSDEGRTAISNSDGGI
        +NK +I+S+G   LL+Q+L SG+VQ KVDAVTAL+ LSAC E    + ILD +A                           I+S S++GR AI++ + GI
Subjt:  SNKPVILSAGAASLLVQILISGSVQAKVDAVTALYYLSACTESEDCSTILDPRA---------------------------IISKSDEGRTAISNSDGGI

Query:  LTLVQTVEDGSLVSTEHAVGVLLSLCQSCRETYRKPILKEGAIPGLLRLTVEGTAEAQERARRLLDLLRDSPQEKRMSSTDLERIVY
        LTLV+TVEDGS +S EHAVG LLSLC+S R+ YRK ILKEGAIPGLL  TV+GT+++++RAR LLDLLR++P+EK M+   LE+IVY
Subjt:  LTLVQTVEDGSLVSTEHAVGVLLSLCQSCRETYRKPILKEGAIPGLLRLTVEGTAEAQERARRLLDLLRDSPQEKRMSSTDLERIVY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAGCTAATGTCAATCTTATCTCCTGCAACTTCTCGTTACCATCTCTTCCTTTAAGAATCTCCAAGCTTGGAACTGCCCAGCAAATTCAAACCGCAACTCCCAGAAA
TCACGCCCCCAAATTATCACCTTCAAGTTTTCGAATTCGGTCTCCCCCAATATCAACATCCGGGGAGATTCAAGCTCGAGTAAAATGGGTCTGCTCCAAAAATGTCGGGG
TGCGTCGGGAAGTCAGACTGCAGGAGACCCAGTTGGGGAAAAGGAGCGCAGTAGAAAGTGAGCGTGGTGATAGCGGCGGCGGCGGAAATGGTGGGGAGGGTAGAGACTGG
ACGACCTCGATTTTACTGTTTGTTTTGTGGGCTGCTCTTATGTATTATGTTTTCAATCTCGCTCCAAATCAGACTTCGTCAATGGACATTTACTTCCTGAAGAAACTTCT
GAACTTGAAAAACGACGATGGTTTCAAAATGAATGAAGTGCTTGTGTCTCTTTGGTATATTATGGGCTTGTGGCCCCTTGTGTATGGCATGCTGCTGCTTCCTTCTGGTC
GAAGCTCAGATAGCAATGTTCCTGTCTGGCCTTTCCTAGGACTGTCGTTCTTTCTGGGTGCTTACAGTCTTCTTCCATATTTTGTACTTTGGAAGCCACCACCACCTCCT
GTCGAAGAAGCCGAGCTCAAGAGATGGCCTTTGAAGTTTCTCGAGTCAAAATTAACTGCTGGGATAACATTTGCTGCAGGACTGGGGATATTATTTTATACTGGATTAGC
TGGTGGGACTGTGTGGAAGGAATTTTACCAGTACTTTAGAGAGAGCAGATTTATCCATATCATGAGCATCGATTTCATGCTACTATCTTCTTTTGCTCCATTTTGGGTTT
ACAACGACATGACTGCGCGAAAATGGTATAACCAAGGTTCTTGGCTCTTCCGCTTTCGTTGGTGCCGTTCTTGGGTCCTGCCTTGTATGTCATCCTACGACCGTTACCAA
CGACGACACCCCTTCCCCTCAACCCCGCTGCATCTGAACCAAAATAATCTTGTTTCTAAAGTGAACCAGAAAAAGGAACTAGGCTGCAAAGGCGGAAAAGAAAAACTTTC
AGAGAGTGTGGGCATCGTCAAGAGGATGAAGCAGGAGGAGAGAGAAGAGCAGGACGAGGAGATTGGGTTTGAGAGAGAATCAGAGACATGGAACCACAAAAAGCAAATTC
TAATCTTTCAGCTTTCTCAGAGACTCATCCATGGCGACCTCCATTCCCGAATCGAAGCTGCCAGAGATTTGAGGAAGCTTGCTCGGAAGTCTTCGCCCAAGTCCCGCTCC
AAGCTTGGAGCTTCTAGTCTGATTCAGCCACTTGTTTGCATGCTTCTCTCTCCTATCTTGATGCCCGTGAAGCCTCTCTACTTGCCCTCCTCAATCTTGCTTCCCGCAAC
GAACGGTTCAGGCACTAGTATGTTTGGAAACTCCCCCCCCCCCCCCCCCCCTAATTTTTACTGGGAAGAAGCGAGGTTCTCTGAGTTCTGGATCATAAGCTGGTTCTCTC
TTAGTCGTTTTATGACATGTCTTACAGCTGAACATACCGTTCCATCTTCTCACACCATTCCTGTTATTTCTTTTATTATCAGGAACAAGATAAAAATAGTTGCAGCTGGT
GCCATTCCCCCTCTTCTGGAGCTGCTCAAATTCCAAAACCTTAGTTTGAGGGAATTAGCCACTGCAGCAATCCTGACCTTGTCAGCTGCTGCATCCAATAAACCGGTTAT
CTTGTCTGCTGGTGCAGCATCTCTTCTGGTTCAGATTCTTATTTCTGGAAGTGTTCAAGCTAAAGTTGATGCAGTGACAGCTCTATACTATTTATCCGCCTGCACTGAAA
GTGAGGACTGCTCTACGATACTTGATCCAAGAGCGATCATTTCCAAGTCTGATGAAGGGCGAACTGCAATCTCCAATTCGGATGGTGGAATACTAACTTTAGTACAGACA
GTTGAAGATGGATCTCTTGTGAGCACCGAGCATGCGGTGGGAGTTCTACTCTCCCTGTGCCAGAGTTGCCGAGAGACATACCGCAAACCCATTTTAAAAGAAGGTGCAAT
CCCTGGCCTTTTGAGACTGACGGTAGAGGGCACAGCTGAAGCTCAAGAGAGAGCTCGCAGACTTTTGGATTTGCTAAGAGATTCTCCCCAAGAAAAGAGAATGAGTTCCA
CAGATCTGGAGAGAATAGTTTACAAATCGCTGCCGAGGTCGATGGAGGTAAGTCTTGGGAAATGGGTTGAGATTTCACTTCTAAATCTTAGGATTTGGTTTGGCTATTCT
CTGCTGTGCTACTTCTGTATTCCAAGCACTTCCCCTGCTGCTCACAGCTCGCATCTCACTCTTATTCGCTCGCAGACACATCTCCACCAACTTCAAAGGAGAGAGAAACA
ACGAGATGGGGTTGAATTCCTCGAGTCGTCTTCTTGCTTGAGGCAGGTAAGAGTTAGAGAACTTCTTTCTAATTCCTTTGCTCCAGATCAATGTCCTTGTGGAATTGAAG
CTTTTTTTGGGTTGTTTTGTGTGGATTTGATTGTTGGTTCTGTTGGGTCATTGGTCTCGGGGAAACCCATCTTGTCTTTCCATTTTTTTTGTTGCACATTGCTGTCGGGA
TCTATTTGGAAGTTGATGGTTGGAAAGATGTTGGGTCGGTCTTCTCTTTACAGAACTGGAAGCTTTCGACCAGAGAATCTGGGTCAAAATGCGCTTGCCCTGATTGGGAA
CCTTTGTTTCACTTTGTTCGTGGTTGGCGTTTTGATTTTTACGATAATTGCCGCCACGTACGAACCTGAGGACCCTCTTTTTCACCCATCGACCAAGATCACAACCTTCC
TCACATCTACCTCCAATGCCACTTTTAAAACTGATAGCACTGTGATGAAGACTGGGGAAGATTTCATGGCTGCCAACCAAACTGCATTTGCAACCTTTCTCAATGAAACT
GATATTGTCAAAATCATTGATGCCGAAAACTCTGCTTTGGGTACTCATACCGAAGCAACCTCGCCCGAGTGCAATGGCAATGTGGATGACCCCATCGATTGCCGCGACCC
CGAGGTTTTCCATTTGATGATGGAGACCACCATTGAACGATTCAAGGACATTCATTTTTACCGGTTTGGGAAACCAGTTCGTGGGTCTAATGACAGCACCTGTGATATGG
CGTGGCGGTTTCGGCCCAAGGAAGGGAAGACTGCTGCTTTTTACAAGGATTATAGGAGGTTTGTGATTACTAGATCTACGAATTGCACTCTTAGTATTGTTAGCATAGGT
GACTACCATACTGGTGTGAATGCAAGGAAGAGGAAAAAGAATCCAAAACATGGTTTTGAGAAGAAAATGGAGCAGCTGGAGCAGGCGATTGCTTTGCCTGTTGTTGGGGA
GGTTGTGAATGATTCTCTCCCGGTGGTCGAGTCTGAAGGCTCATTTAGTCATGGGAAGTACTTGGTTTATGAGATGGGTGGAGATAAATGCAAGAGCATGAACCATTACT
TGTGGAGTTTCTTGTGTGCTTTAGGTGAAGCTCAGTATTTGAACCGCACATTAGTTATGGATTTGAAAATTTGTCTGTCATCGATTTATACTTCATCGAATCAAGATGAG
GAAGGAAAAGATTTCAGGTTTTATTTTGATTTCGAGCATCTGAAAGAGTCTGCGTCCATCTTGGACCAGGGTCAGTTTTGGTCTGATTGGGAGAAATGGCAAAAGAAAGA
CCATTTAAGCCTCTTGCTTGTCGATGACTTTCGGGTCACACCAATGAAACTTAGAGATGTAAAGGATGCCTTGATATTGAGAAAGTTTGGATCTGCAGAGCCAGATAATT
ATTGGTATAGAGTCTGTGAAGGAGAAACTGAATCCGTAGTTAAGCGACCATGGCATTTGATATGGAAATCAAGACGGCTGATGGATATAGTATCTTCAATTGCATCGAGA
TTGAACTGGGATTATGACTCAGTCCACATAGTGAGAGGCGAGAAAGCAAAGAACAAGGAGCTCTGGCCAAATCTTGCTGCTGATACTTCACCTGATACGCTTCTATCAAC
ATTGCAAGACAAGGTTGAAAACGGAAGAAACCTTTACATTGCTACCAATGAACCAAACACGGCATTCTTCGACCCATTGAAAGACAAATACTCCACTCACTTTCTCGACG
AATACAAGGATCTTTGGGACAAAGACAGTGAATGGTACACTGAAACAATGAACCTTAACAATGGGGTTCAGTTGAATTTGATGGATATATGA
mRNA sequenceShow/hide mRNA sequence
ATGATAGCTAATGTCAATCTTATCTCCTGCAACTTCTCGTTACCATCTCTTCCTTTAAGAATCTCCAAGCTTGGAACTGCCCAGCAAATTCAAACCGCAACTCCCAGAAA
TCACGCCCCCAAATTATCACCTTCAAGTTTTCGAATTCGGTCTCCCCCAATATCAACATCCGGGGAGATTCAAGCTCGAGTAAAATGGGTCTGCTCCAAAAATGTCGGGG
TGCGTCGGGAAGTCAGACTGCAGGAGACCCAGTTGGGGAAAAGGAGCGCAGTAGAAAGTGAGCGTGGTGATAGCGGCGGCGGCGGAAATGGTGGGGAGGGTAGAGACTGG
ACGACCTCGATTTTACTGTTTGTTTTGTGGGCTGCTCTTATGTATTATGTTTTCAATCTCGCTCCAAATCAGACTTCGTCAATGGACATTTACTTCCTGAAGAAACTTCT
GAACTTGAAAAACGACGATGGTTTCAAAATGAATGAAGTGCTTGTGTCTCTTTGGTATATTATGGGCTTGTGGCCCCTTGTGTATGGCATGCTGCTGCTTCCTTCTGGTC
GAAGCTCAGATAGCAATGTTCCTGTCTGGCCTTTCCTAGGACTGTCGTTCTTTCTGGGTGCTTACAGTCTTCTTCCATATTTTGTACTTTGGAAGCCACCACCACCTCCT
GTCGAAGAAGCCGAGCTCAAGAGATGGCCTTTGAAGTTTCTCGAGTCAAAATTAACTGCTGGGATAACATTTGCTGCAGGACTGGGGATATTATTTTATACTGGATTAGC
TGGTGGGACTGTGTGGAAGGAATTTTACCAGTACTTTAGAGAGAGCAGATTTATCCATATCATGAGCATCGATTTCATGCTACTATCTTCTTTTGCTCCATTTTGGGTTT
ACAACGACATGACTGCGCGAAAATGGTATAACCAAGGTTCTTGGCTCTTCCGCTTTCGTTGGTGCCGTTCTTGGGTCCTGCCTTGTATGTCATCCTACGACCGTTACCAA
CGACGACACCCCTTCCCCTCAACCCCGCTGCATCTGAACCAAAATAATCTTGTTTCTAAAGTGAACCAGAAAAAGGAACTAGGCTGCAAAGGCGGAAAAGAAAAACTTTC
AGAGAGTGTGGGCATCGTCAAGAGGATGAAGCAGGAGGAGAGAGAAGAGCAGGACGAGGAGATTGGGTTTGAGAGAGAATCAGAGACATGGAACCACAAAAAGCAAATTC
TAATCTTTCAGCTTTCTCAGAGACTCATCCATGGCGACCTCCATTCCCGAATCGAAGCTGCCAGAGATTTGAGGAAGCTTGCTCGGAAGTCTTCGCCCAAGTCCCGCTCC
AAGCTTGGAGCTTCTAGTCTGATTCAGCCACTTGTTTGCATGCTTCTCTCTCCTATCTTGATGCCCGTGAAGCCTCTCTACTTGCCCTCCTCAATCTTGCTTCCCGCAAC
GAACGGTTCAGGCACTAGTATGTTTGGAAACTCCCCCCCCCCCCCCCCCCCTAATTTTTACTGGGAAGAAGCGAGGTTCTCTGAGTTCTGGATCATAAGCTGGTTCTCTC
TTAGTCGTTTTATGACATGTCTTACAGCTGAACATACCGTTCCATCTTCTCACACCATTCCTGTTATTTCTTTTATTATCAGGAACAAGATAAAAATAGTTGCAGCTGGT
GCCATTCCCCCTCTTCTGGAGCTGCTCAAATTCCAAAACCTTAGTTTGAGGGAATTAGCCACTGCAGCAATCCTGACCTTGTCAGCTGCTGCATCCAATAAACCGGTTAT
CTTGTCTGCTGGTGCAGCATCTCTTCTGGTTCAGATTCTTATTTCTGGAAGTGTTCAAGCTAAAGTTGATGCAGTGACAGCTCTATACTATTTATCCGCCTGCACTGAAA
GTGAGGACTGCTCTACGATACTTGATCCAAGAGCGATCATTTCCAAGTCTGATGAAGGGCGAACTGCAATCTCCAATTCGGATGGTGGAATACTAACTTTAGTACAGACA
GTTGAAGATGGATCTCTTGTGAGCACCGAGCATGCGGTGGGAGTTCTACTCTCCCTGTGCCAGAGTTGCCGAGAGACATACCGCAAACCCATTTTAAAAGAAGGTGCAAT
CCCTGGCCTTTTGAGACTGACGGTAGAGGGCACAGCTGAAGCTCAAGAGAGAGCTCGCAGACTTTTGGATTTGCTAAGAGATTCTCCCCAAGAAAAGAGAATGAGTTCCA
CAGATCTGGAGAGAATAGTTTACAAATCGCTGCCGAGGTCGATGGAGGTAAGTCTTGGGAAATGGGTTGAGATTTCACTTCTAAATCTTAGGATTTGGTTTGGCTATTCT
CTGCTGTGCTACTTCTGTATTCCAAGCACTTCCCCTGCTGCTCACAGCTCGCATCTCACTCTTATTCGCTCGCAGACACATCTCCACCAACTTCAAAGGAGAGAGAAACA
ACGAGATGGGGTTGAATTCCTCGAGTCGTCTTCTTGCTTGAGGCAGGTAAGAGTTAGAGAACTTCTTTCTAATTCCTTTGCTCCAGATCAATGTCCTTGTGGAATTGAAG
CTTTTTTTGGGTTGTTTTGTGTGGATTTGATTGTTGGTTCTGTTGGGTCATTGGTCTCGGGGAAACCCATCTTGTCTTTCCATTTTTTTTGTTGCACATTGCTGTCGGGA
TCTATTTGGAAGTTGATGGTTGGAAAGATGTTGGGTCGGTCTTCTCTTTACAGAACTGGAAGCTTTCGACCAGAGAATCTGGGTCAAAATGCGCTTGCCCTGATTGGGAA
CCTTTGTTTCACTTTGTTCGTGGTTGGCGTTTTGATTTTTACGATAATTGCCGCCACGTACGAACCTGAGGACCCTCTTTTTCACCCATCGACCAAGATCACAACCTTCC
TCACATCTACCTCCAATGCCACTTTTAAAACTGATAGCACTGTGATGAAGACTGGGGAAGATTTCATGGCTGCCAACCAAACTGCATTTGCAACCTTTCTCAATGAAACT
GATATTGTCAAAATCATTGATGCCGAAAACTCTGCTTTGGGTACTCATACCGAAGCAACCTCGCCCGAGTGCAATGGCAATGTGGATGACCCCATCGATTGCCGCGACCC
CGAGGTTTTCCATTTGATGATGGAGACCACCATTGAACGATTCAAGGACATTCATTTTTACCGGTTTGGGAAACCAGTTCGTGGGTCTAATGACAGCACCTGTGATATGG
CGTGGCGGTTTCGGCCCAAGGAAGGGAAGACTGCTGCTTTTTACAAGGATTATAGGAGGTTTGTGATTACTAGATCTACGAATTGCACTCTTAGTATTGTTAGCATAGGT
GACTACCATACTGGTGTGAATGCAAGGAAGAGGAAAAAGAATCCAAAACATGGTTTTGAGAAGAAAATGGAGCAGCTGGAGCAGGCGATTGCTTTGCCTGTTGTTGGGGA
GGTTGTGAATGATTCTCTCCCGGTGGTCGAGTCTGAAGGCTCATTTAGTCATGGGAAGTACTTGGTTTATGAGATGGGTGGAGATAAATGCAAGAGCATGAACCATTACT
TGTGGAGTTTCTTGTGTGCTTTAGGTGAAGCTCAGTATTTGAACCGCACATTAGTTATGGATTTGAAAATTTGTCTGTCATCGATTTATACTTCATCGAATCAAGATGAG
GAAGGAAAAGATTTCAGGTTTTATTTTGATTTCGAGCATCTGAAAGAGTCTGCGTCCATCTTGGACCAGGGTCAGTTTTGGTCTGATTGGGAGAAATGGCAAAAGAAAGA
CCATTTAAGCCTCTTGCTTGTCGATGACTTTCGGGTCACACCAATGAAACTTAGAGATGTAAAGGATGCCTTGATATTGAGAAAGTTTGGATCTGCAGAGCCAGATAATT
ATTGGTATAGAGTCTGTGAAGGAGAAACTGAATCCGTAGTTAAGCGACCATGGCATTTGATATGGAAATCAAGACGGCTGATGGATATAGTATCTTCAATTGCATCGAGA
TTGAACTGGGATTATGACTCAGTCCACATAGTGAGAGGCGAGAAAGCAAAGAACAAGGAGCTCTGGCCAAATCTTGCTGCTGATACTTCACCTGATACGCTTCTATCAAC
ATTGCAAGACAAGGTTGAAAACGGAAGAAACCTTTACATTGCTACCAATGAACCAAACACGGCATTCTTCGACCCATTGAAAGACAAATACTCCACTCACTTTCTCGACG
AATACAAGGATCTTTGGGACAAAGACAGTGAATGGTACACTGAAACAATGAACCTTAACAATGGGGTTCAGTTGAATTTGATGGATATATGA
Protein sequenceShow/hide protein sequence
MIANVNLISCNFSLPSLPLRISKLGTAQQIQTATPRNHAPKLSPSSFRIRSPPISTSGEIQARVKWVCSKNVGVRREVRLQETQLGKRSAVESERGDSGGGGNGGEGRDW
TTSILLFVLWAALMYYVFNLAPNQTSSMDIYFLKKLLNLKNDDGFKMNEVLVSLWYIMGLWPLVYGMLLLPSGRSSDSNVPVWPFLGLSFFLGAYSLLPYFVLWKPPPPP
VEEAELKRWPLKFLESKLTAGITFAAGLGILFYTGLAGGTVWKEFYQYFRESRFIHIMSIDFMLLSSFAPFWVYNDMTARKWYNQGSWLFRFRWCRSWVLPCMSSYDRYQ
RRHPFPSTPLHLNQNNLVSKVNQKKELGCKGGKEKLSESVGIVKRMKQEEREEQDEEIGFERESETWNHKKQILIFQLSQRLIHGDLHSRIEAARDLRKLARKSSPKSRS
KLGASSLIQPLVCMLLSPILMPVKPLYLPSSILLPATNGSGTSMFGNSPPPPPPNFYWEEARFSEFWIISWFSLSRFMTCLTAEHTVPSSHTIPVISFIIRNKIKIVAAG
AIPPLLELLKFQNLSLRELATAAILTLSAAASNKPVILSAGAASLLVQILISGSVQAKVDAVTALYYLSACTESEDCSTILDPRAIISKSDEGRTAISNSDGGILTLVQT
VEDGSLVSTEHAVGVLLSLCQSCRETYRKPILKEGAIPGLLRLTVEGTAEAQERARRLLDLLRDSPQEKRMSSTDLERIVYKSLPRSMEVSLGKWVEISLLNLRIWFGYS
LLCYFCIPSTSPAAHSSHLTLIRSQTHLHQLQRREKQRDGVEFLESSSCLRQVRVRELLSNSFAPDQCPCGIEAFFGLFCVDLIVGSVGSLVSGKPILSFHFFCCTLLSG
SIWKLMVGKMLGRSSLYRTGSFRPENLGQNALALIGNLCFTLFVVGVLIFTIIAATYEPEDPLFHPSTKITTFLTSTSNATFKTDSTVMKTGEDFMAANQTAFATFLNET
DIVKIIDAENSALGTHTEATSPECNGNVDDPIDCRDPEVFHLMMETTIERFKDIHFYRFGKPVRGSNDSTCDMAWRFRPKEGKTAAFYKDYRRFVITRSTNCTLSIVSIG
DYHTGVNARKRKKNPKHGFEKKMEQLEQAIALPVVGEVVNDSLPVVESEGSFSHGKYLVYEMGGDKCKSMNHYLWSFLCALGEAQYLNRTLVMDLKICLSSIYTSSNQDE
EGKDFRFYFDFEHLKESASILDQGQFWSDWEKWQKKDHLSLLLVDDFRVTPMKLRDVKDALILRKFGSAEPDNYWYRVCEGETESVVKRPWHLIWKSRRLMDIVSSIASR
LNWDYDSVHIVRGEKAKNKELWPNLAADTSPDTLLSTLQDKVENGRNLYIATNEPNTAFFDPLKDKYSTHFLDEYKDLWDKDSEWYTETMNLNNGVQLNLMDI