; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022568 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022568
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUPF0496 protein 4
Genome locationtig00000289:1061893..1064917
RNA-Seq ExpressionSgr022568
SyntenySgr022568
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008511 - Protein BYPASS-related


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578881.1 Protein ROH1, partial [Cucurbita argyrosperma subsp. sororia]6.4e-20586.28Show/hide
Query:  MPATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL
        MPATDYQGSSAACTNIGR VQS+RRDQLYAMDGSPTSQE DLD FQRQV ERF+DLAS G D+LLSLSWVQKLL+SFLCCQEEFK VL +HKSQISRPPL
Subjt:  MPATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSL
        DRLVADYSER+VKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASA+AHRNRSFGRNNASKD RSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR
        GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSL RNFPWA PILQLHDRIVEESKKR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR

Query:  DRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRIYPAPLWYVATMKDMVVQVLNSLRCQVMFLVS
        DRRNSCGLLKEI QIEKCTRLM+DLAD AQFPLAE+KEAELR+RVQEL+TVCNTLRTGLDSLERQ GR YPAPLW VATM+DMVVQ+L+           
Subjt:  DRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRIYPAPLWYVATMKDMVVQVLNSLRCQVMFLVS

Query:  EHSKLLLTELKDLLRSWMLTLDG--KQLTT
                  +DL  SWML+L+G  K+LTT
Subjt:  EHSKLLLTELKDLLRSWMLTLDG--KQLTT

KAG6602188.1 Protein ROH1, partial [Cucurbita argyrosperma subsp. sororia]7.8e-19580.94Show/hide
Query:  MPATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL
        MPATDYQGSSAA          VRRDQLYAM  SPTS E DLD FQRQV +RFL+LAS G+DDLLSLSWVQKLL+SFLCCQEEFK++L S KS IS+ PL
Subjt:  MPATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSL
        DRLVADYSERSVKALDVCNAIRDG+EQLRQWQKLLEIV+SALDNCNHQKTLGEGQFRRAKKAL DLAICMLDEKDSH S LAHRNRSFGRNN +KD RSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR
        GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPR+FPWAAPILQLHDRIVEES+KR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR

Query:  DRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRIYPAPLWYVATMKDMVVQVLNSLRCQVMFLVS
        DRRNSCGLLKEI QIEKCTRLM+DLADAAQFPLAE+KEAELRQRVQEL+ VCNTLRTGLDS+ERQDGRIYPA LW+V TMKDMVV           F  S
Subjt:  DRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRIYPAPLWYVATMKDMVVQVLNSLRCQVMFLVS

Query:  EHSKLLLTELKDLLRSWMLTLDGKQLTT----CASHKVTPHTASRL
            L+ +   D LR  MLTL+G+ LTT    C SHKVTP+ A R+
Subjt:  EHSKLLLTELKDLLRSWMLTLDGKQLTT----CASHKVTPHTASRL

KAG7016412.1 hypothetical protein SDJN02_21521, partial [Cucurbita argyrosperma subsp. argyrosperma]8.6e-19472.04Show/hide
Query:  MPATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL
        MPATDYQGSSAACTNIGR VQS+RRDQLYAMDGSPTSQE DLD FQRQV ERF+DLAS G D+LLSLSWVQKLL+SFLCCQEEFK VL +HKSQISRPPL
Subjt:  MPATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSL
        DRLVADYSER+VKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASA+AHRNRSFGRNNASKD RSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR
        GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSL RNFPWA PILQLHDRIVEESKKR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR

Query:  DRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQ-----------------------------------
        DRRNSCGLLKEI QIEKCTRLM+DLAD AQFPLAE+KEAELR+RVQEL+TVCNTLRTGLDSLERQ                                   
Subjt:  DRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQ-----------------------------------

Query:  --------------------------------------------------DGRIYPAPLWYVATMKDMVVQVLNSLRCQVMFLVSEHSKLLLTELKDLLR
                                                           GR YPAPLW VATM+DMVVQ+L+                     +DL  
Subjt:  --------------------------------------------------DGRIYPAPLWYVATMKDMVVQVLNSLRCQVMFLVSEHSKLLLTELKDLLR

Query:  SWMLTLDG--KQLTT
        SWML+L+G  K+LTT
Subjt:  SWMLTLDG--KQLTT

XP_022939341.1 uncharacterized protein LOC111445285 isoform X1 [Cucurbita moschata]4.7e-19291.89Show/hide
Query:  MPATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL
        MPATDYQGSSAACTNIGR VQS+RRDQLYAMDGSPTSQEQDLD FQRQV ERF+DLAS G D+LLSLSWVQKLL+SFLCCQEEFK VL +HKSQIS+PPL
Subjt:  MPATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSL
        DRLVADYSER+VKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKAL+DLAICMLDEKDSHASA+AHRNRSFGRNNASKD RSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR
        GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSL RNFPWA PILQLHDRIVEESKKR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR

Query:  DRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRIY
        DRRNSCGLLKEI QIEKCTRLM+DLAD AQFPLAE+KEAELR+RVQEL+TVCNTLRTGLDSLERQ   ++
Subjt:  DRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRIY

XP_022992890.1 uncharacterized protein LOC111489091 isoform X1 [Cucurbita maxima]3.6e-19292.16Show/hide
Query:  MPATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL
        MPATDYQGSSAACTNIGR VQS+RRDQLYAMDGSPTSQEQDLD FQRQV ERF+DLAS G D+LLSLSWVQKLL+SFLCCQEEFK VL +HKSQIS+PPL
Subjt:  MPATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSL
        DRLVADYSER+VKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASA+AHRNRSFGRNNASKD RSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR
        GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSL RNFPWA PILQLHDRIVEESKKR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR

Query:  DRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRIY
        DRRNSCGLLKEI QIEKCTRLM+DLAD AQFPLAE+KEAELR+RVQEL+TVCNTLRTGLDSLERQ   ++
Subjt:  DRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRIY

TrEMBL top hitse value%identityAlignment
A0A5A7UI93 UPF0496 protein 46.3e-19091.35Show/hide
Query:  MPATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL
        MPATDYQGSSAA TNIGRPVQ +RRDQLYAMDGSPTS EQDLD FQ+QV +RFLDLAS G DDLLSLSWV KLLNSFL CQE+FK+VL SHKSQISRPPL
Subjt:  MPATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSL
        DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCN+QKTLGEGQFRRAKKALIDLAICMLDEKDSH SALAHRNRSFGRNNASKD RSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR
        GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELL TNGLAVP+FTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWA+PILQLHDRIVEESKKR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR

Query:  DRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRIY
        DRRNSCGLLKEI QIEKCTRLM+DLAD+AQFPLAE+KEAELRQRVQEL+TVC+TLRTGLDSLERQ   ++
Subjt:  DRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRIY

A0A6J1BW97 uncharacterized protein LOC1110062929.6e-19191.89Show/hide
Query:  MPATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL
        MPATDYQGSSAA TNIGRP QS+RRDQL+AMDGSPTS EQDLD FQRQV +RFLDLAS GSDDLLSLSW+ KLLNSFL CQEEFKIVL SHKSQISRPPL
Subjt:  MPATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSL
        DR+VADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQ+TLGEGQFRRAKKALIDLAICMLDEKDSH SALAHRNRSFGRNN SK+ RSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR
        GHFRSLSWSVSRSWSAARQLQSIGN+LAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPR+FPWAAPILQLHDRIVEESKKR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR

Query:  DRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRIY
        DRRNSCGLLKEI QIEKCTRLM+DLADA QFPLAEDKEAELRQRVQELSTVC+TLRTGLDSLERQ   ++
Subjt:  DRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRIY

A0A6J1FFL7 uncharacterized protein LOC111445285 isoform X12.3e-19291.89Show/hide
Query:  MPATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL
        MPATDYQGSSAACTNIGR VQS+RRDQLYAMDGSPTSQEQDLD FQRQV ERF+DLAS G D+LLSLSWVQKLL+SFLCCQEEFK VL +HKSQIS+PPL
Subjt:  MPATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSL
        DRLVADYSER+VKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKAL+DLAICMLDEKDSHASA+AHRNRSFGRNNASKD RSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR
        GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSL RNFPWA PILQLHDRIVEESKKR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR

Query:  DRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRIY
        DRRNSCGLLKEI QIEKCTRLM+DLAD AQFPLAE+KEAELR+RVQEL+TVCNTLRTGLDSLERQ   ++
Subjt:  DRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRIY

A0A6J1JUS7 uncharacterized protein LOC111489091 isoform X23.0e-19293.42Show/hide
Query:  MPATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL
        MPATDYQGSSAACTNIGR VQS+RRDQLYAMDGSPTSQEQDLD FQRQV ERF+DLAS G D+LLSLSWVQKLL+SFLCCQEEFK VL +HKSQIS+PPL
Subjt:  MPATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSL
        DRLVADYSER+VKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASA+AHRNRSFGRNNASKD RSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR
        GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSL RNFPWA PILQLHDRIVEESKKR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR

Query:  DRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQ
        DRRNSCGLLKEI QIEKCTRLM+DLAD AQFPLAE+KEAELR+RVQEL+TVCNTLRTGLDSLERQ
Subjt:  DRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQ

A0A6J1JWZ2 uncharacterized protein LOC111489091 isoform X11.8e-19292.16Show/hide
Query:  MPATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL
        MPATDYQGSSAACTNIGR VQS+RRDQLYAMDGSPTSQEQDLD FQRQV ERF+DLAS G D+LLSLSWVQKLL+SFLCCQEEFK VL +HKSQIS+PPL
Subjt:  MPATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSL
        DRLVADYSER+VKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASA+AHRNRSFGRNNASKD RSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR
        GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSL RNFPWA PILQLHDRIVEESKKR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR

Query:  DRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRIY
        DRRNSCGLLKEI QIEKCTRLM+DLAD AQFPLAE+KEAELR+RVQEL+TVCNTLRTGLDSLERQ   ++
Subjt:  DRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRIY

SwissProt top hitse value%identityAlignment
A2Z9A6 UPF0496 protein 49.8e-0722.53Show/hide
Query:  LASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL---DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLG
        L    + D+L+LSW++  ++    C  E    + +  + +  P     D+ V  Y   SVK LD+C A+   + +L Q Q LL+  L  L   +      
Subjt:  LASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL---DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLG

Query:  EGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVL
        + Q +RA+ +L                               ++   L   R       R  S +  LQ +  NL+  K    +    L   ++ +  V 
Subjt:  EGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVL

Query:  LFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKRDRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRV
        +FV    VA +    + L V   +P  F W+     LH  + EE  ++    S   +KE++++E C + +  LA  +Q    E++ A L   V
Subjt:  LFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKRDRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRV

Q337C0 UPF0496 protein 44.4e-0722.87Show/hide
Query:  LASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL---DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLG
        L    + D+L+LSW++  ++    C  E    + +  + +  P     D+ V  Y   SVK LD+C A+   + +L Q Q LL+  L  L   +      
Subjt:  LASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPL---DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLG

Query:  EGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVL
        + Q +RA+ +L                               ++   L   R      +R  S +  LQ +  NL+  K         L   ++ +  V 
Subjt:  EGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVL

Query:  LFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKRDRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRV
        +FV    VA +    + L V   +P  F W+     LH  + EE  ++    S   +KE++++E C R +  LA  +Q    E++ A L   V
Subjt:  LFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKRDRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRV

Q9CAK4 Protein ROH11.3e-6234.65Show/hide
Query:  PATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLAS----------------AGSDDLLSLSWVQKLLNSFLCCQEEFK
        PA D QGS      +GR   S+RR+Q   +D +   +++DL+ FQ+ + +RF +L S                A ++ ++S++W++KL++ FLCC+ EFK
Subjt:  PATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLAS----------------AGSDDLLSLSWVQKLLNSFLCCQEEFK

Query:  IVLFSHK--SQISRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICM-LDEKDS------
         +L   +  +QIS+PP DRLV +  +RS+KALD+C A+ +GI+ +R +Q+L EI ++AL+    Q+ LG+G  RRAK+AL +L + + L++K++      
Subjt:  IVLFSHK--SQISRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICM-LDEKDS------

Query:  ---HASALAHRNRSFGRNN-----ASKDHRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDR
             +    R+ SFGR +     ASK   ++G  +S SW+V R+WSAA+Q+ ++  NL  P+  E     GL  P+F M+ V++FVMW L AA+PCQ+R
Subjt:  ---HASALAHRNRSFGRNN-----ASKDHRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDR

Query:  -GLQVHFSL-PRNFPWAAPILQLHDRIVEESKKRDRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQD
         GL  H  + P++  WA  ++ +H++I +E KK++++ S GL++E+ ++EK    + + AD   +P  +D       +V E++ +C  +   L  L++Q 
Subjt:  -GLQVHFSL-PRNFPWAAPILQLHDRIVEESKKRDRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQD

Query:  GRIY
          ++
Subjt:  GRIY

Arabidopsis top hitse value%identityAlignment
AT1G18740.1 Protein of unknown function (DUF793)1.0e-12862.27Show/hide
Query:  MPATDYQGSSAACTNIGRPVQSVRRDQL---YAMDGS-----PTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHK
        MPATD+QGS       GR + S+RRDQ+     + GS     P++ E +LD FQRQV E+F+DL +A S+DLLSL W+ KLL+SFLCCQEEF+ ++F+H+
Subjt:  MPATDYQGSSAACTNIGRPVQSVRRDQL---YAMDGS-----PTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHK

Query:  SQISRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASA-LAHRNRSFGRN
        SQIS+ P+DRL++DY ERS+KALDVCNAIRDGIEQ+RQW+KL +IV+SALD+    + +GEGQ RRAKKALIDLAI MLDEKD  +   LAHRNRSFGR 
Subjt:  SQISRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASA-LAHRNRSFGRN

Query:  NASKDHRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHD
          S  HRS+GHFRSLSWSVSRSWSA++QLQ++ +NLA P+  +++A+NGLAVP++TM  VLLFVMW LVAAIPCQDRGLQV+F +PR+F WAAP++ LHD
Subjt:  NASKDHRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHD

Query:  RIVEESKKRDRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRIY
        +IVEESK+RDR+N CGLLKEI +IEK +RLM++L D+  FPL +DKE E++QRV EL  V   LR GLD  ER+   ++
Subjt:  RIVEESKKRDRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRIY

AT1G43630.1 Protein of unknown function (DUF793)4.8e-11858.45Show/hide
Query:  MPATDYQGSSAACTNIGRPVQSVRRDQLYAMD----GSPTSQEQDLDCFQRQVTERFLDL-ASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQI
        MP T+Y        + GR   S+RRDQ + MD      P + E +LD FQRQV E+F+DL ASA   ++LSL W+ KLL+SFLCCQE+F++++F+HK Q+
Subjt:  MPATDYQGSSAACTNIGRPVQSVRRDQLYAMD----GSPTSQEQDLDCFQRQVTERFLDL-ASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQI

Query:  SRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASK
         + P+DRL+ +Y ERSVKALDVCNAIRDGIEQ+RQWQKL+EIV+SALD   +Q+ LGEG+  RAKKALIDLAI MLDEKD   S+  HRNRSF RN   K
Subjt:  SRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASK

Query:  DH-RSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIV
        DH + +G+ RSLSWSVSRSWSA+RQLQ IGNNLA P+A++++ATNGLA+ ++TM  +LLFV W LVAAIPCQDRGL VHF  PR+F WA P++ LHD+I+
Subjt:  DH-RSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIV

Query:  EESKKRD-RRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDK-EAELRQRVQELSTVCNTLRTGLDSLERQ
        +ESKKRD ++  CGLL+EI QIE+ +R++SDL D+  F L ++K   E+++RVQEL  VC  ++ GLD  +R+
Subjt:  EESKKRD-RRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDK-EAELRQRVQELSTVCNTLRTGLDSLERQ

AT1G63930.1 from the Czech 'roh' meaning 'corner'9.0e-6434.65Show/hide
Query:  PATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLAS----------------AGSDDLLSLSWVQKLLNSFLCCQEEFK
        PA D QGS      +GR   S+RR+Q   +D +   +++DL+ FQ+ + +RF +L S                A ++ ++S++W++KL++ FLCC+ EFK
Subjt:  PATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLAS----------------AGSDDLLSLSWVQKLLNSFLCCQEEFK

Query:  IVLFSHK--SQISRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICM-LDEKDS------
         +L   +  +QIS+PP DRLV +  +RS+KALD+C A+ +GI+ +R +Q+L EI ++AL+    Q+ LG+G  RRAK+AL +L + + L++K++      
Subjt:  IVLFSHK--SQISRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICM-LDEKDS------

Query:  ---HASALAHRNRSFGRNN-----ASKDHRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDR
             +    R+ SFGR +     ASK   ++G  +S SW+V R+WSAA+Q+ ++  NL  P+  E     GL  P+F M+ V++FVMW L AA+PCQ+R
Subjt:  ---HASALAHRNRSFGRNN-----ASKDHRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDR

Query:  -GLQVHFSL-PRNFPWAAPILQLHDRIVEESKKRDRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQD
         GL  H  + P++  WA  ++ +H++I +E KK++++ S GL++E+ ++EK    + + AD   +P  +D       +V E++ +C  +   L  L++Q 
Subjt:  -GLQVHFSL-PRNFPWAAPILQLHDRIVEESKKRDRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQD

Query:  GRIY
          ++
Subjt:  GRIY

AT1G74450.1 Protein of unknown function (DUF793)1.1e-12759.42Show/hide
Query:  MPATDYQGSSAACTNIGRPVQSVRRD------QLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQ
        MPAT+YQ S       GR   ++RRD      +   +    T  E +L  FQR+V ERF+DL ++  +DLLSL WV KLL+SFL CQEEF+ ++ +H+S 
Subjt:  MPATDYQGSSAACTNIGRPVQSVRRD------QLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQ

Query:  ISRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDN----CNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALA--HRNRSF
        I++PP+DRLV+DY ERSVKALDVCNAIRDG+EQ+RQWQKL+EIV+ A +N     + ++ LGEGQFRRA+K LI+LAI MLDEKDS +S+++  HRNRSF
Subjt:  ISRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDN----CNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALA--HRNRSF

Query:  GRNNASKDHRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQ
        GRN     HR++GHFRSLSWSVSRSWSA++QLQ+IGNNLA P+A+++ ATNGL VP++TM  VLLFVMWALVAAIPCQDRGLQVHF++PRN+ W   ++ 
Subjt:  GRNNASKDHRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQ

Query:  LHDRIVEESKKRDRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRIY
        LHDRI+EESKKR+R+N+CGLLKEI Q EK +RLM++L D+ QFPL+E+KE E+R+RV+EL  +   L+ GLD  ER+   ++
Subjt:  LHDRIVEESKKRDRRNSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRIY

AT4G23530.1 Protein of unknown function (DUF793)3.4e-5533.92Show/hide
Query:  ATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDL--------------ASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVL
        AT++QGS  +         S+RR+Q+ +MD +   + ++L+ FQ+ V ERF DL               S  SD +LS+ W+Q LL+ F+ C+ EFK VL
Subjt:  ATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDL--------------ASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVL

Query:  FSHKSQISR-PPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRS
         +  +QIS+ P L+R++ +  +R +KALD+CNA+ +GI+ +RQ ++  EI ++AL     Q+ L +G  RRAK+AL  L I +        +A   R+R+
Subjt:  FSHKSQISR-PPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRS

Query:  FGRNNASKDHRSLGHFRSLSWS----------------VSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQ
         G +  S   R+     S SWS                VS++WSA++Q+Q++  NL  P+  E    +G A+P++ M+ V++ VMW LVAA+PCQ   + 
Subjt:  FGRNNASKDHRSLGHFRSLSWS----------------VSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQ

Query:  V-HFSLPRNFPWAAPILQLHDRIVEESKKRDRR-NSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRI
        V    LP++  WA+  + + +RI EE K++++R    GL++E+Q++EK    + + A+  +FP  E++E E+ ++V E+  +C  +  GL+ L+RQ  ++
Subjt:  V-HFSLPRNFPWAAPILQLHDRIVEESKKRDRR-NSCGLLKEIQQIEKCTRLMSDLADAAQFPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRI

Query:  Y
        +
Subjt:  Y


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCGCAACAGATTACCAGGGTTCGTCGGCTGCTTGTACCAACATTGGGCGTCCGGTTCAGAGTGTCCGGCGGGATCAGCTTTACGCGATGGATGGTTCTCCGACGTC
TCAGGAGCAGGATCTCGACTGCTTTCAGAGGCAAGTTACCGAACGGTTTCTGGACCTTGCATCGGCTGGTTCCGATGATCTGCTTTCCCTATCGTGGGTTCAGAAGCTTC
TAAATTCCTTCCTATGCTGCCAGGAGGAGTTTAAAATCGTTCTCTTTAGCCACAAATCTCAAATCTCTAGACCCCCATTGGACCGCTTGGTTGCTGACTATTCGGAGCGT
AGCGTTAAAGCACTTGATGTGTGTAATGCGATTCGTGATGGAATTGAGCAGCTGAGGCAATGGCAGAAACTTTTGGAGATTGTTCTCAGTGCATTGGACAATTGTAATCA
CCAGAAGACTCTCGGGGAGGGTCAATTCCGCCGCGCCAAGAAGGCCCTCATCGATTTGGCTATTTGTATGCTTGATGAGAAAGATTCACATGCCTCTGCCCTTGCCCACC
GCAACCGTTCTTTTGGACGCAATAATGCCTCGAAAGATCACCGGTCCTTGGGCCACTTTCGATCGCTGTCTTGGAGCGTTTCTCGATCGTGGTCAGCCGCGAGGCAGCTG
CAATCAATTGGAAATAATTTGGCGGCCCCTAAAGCGACTGAGCTTCTGGCTACTAATGGCCTTGCAGTTCCTATTTTTACCATGAACATGGTGTTATTGTTCGTAATGTG
GGCGCTGGTGGCCGCTATTCCTTGTCAGGACCGGGGTTTGCAGGTTCATTTCTCTTTGCCCCGGAATTTTCCGTGGGCGGCTCCAATCCTTCAACTGCATGATCGAATTG
TGGAGGAGTCCAAGAAGCGAGATCGAAGAAATTCCTGTGGGCTGTTGAAGGAGATTCAACAGATTGAAAAGTGCACGCGTCTCATGAGTGATTTGGCCGATGCAGCTCAG
TTCCCATTGGCAGAGGATAAAGAAGCGGAGCTGAGACAGAGAGTGCAAGAGCTATCAACGGTTTGTAATACCCTGAGGACTGGATTGGACTCATTGGAGCGGCAGGATGG
AAGAATCTATCCTGCACCACTATGGTATGTGGCTACGATGAAGGATATGGTGGTCCAGGTTCTCAACTCTTTGAGGTGCCAAGTCATGTTCCTTGTCAGTGAACATTCAA
AATTGTTACTGACAGAATTGAAGGATCTGCTACGTTCGTGGATGCTGACCCTTGACGGAAAGCAGTTGACAACTTGTGCAAGTCATAAAGTCACTCCACACACTGCGTCG
AGACTTGGAGCTTTCAGTCGCTTTGTTCGTCATCAACAATCGAGTACAACCCTCATGGACAGATCACTGCTCCTTATCTCTTCAATCACAAAAAACTCTCTTCCCATTCA
GATGCTTTGTGTTGCTGCCTCTGCCTTGCTTTGCCCTCACCCTCATCACCTCCCTCTCTCTCTCTCTCTCTCTCTCTCTCTCATTACCGAACTTCACCGAGGTCGCCCCT
CTCTTGTAGAGTTTGACTTCTACCGTGCACTTCATCTAATCTTCGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCGCAACAGATTACCAGGGTTCGTCGGCTGCTTGTACCAACATTGGGCGTCCGGTTCAGAGTGTCCGGCGGGATCAGCTTTACGCGATGGATGGTTCTCCGACGTC
TCAGGAGCAGGATCTCGACTGCTTTCAGAGGCAAGTTACCGAACGGTTTCTGGACCTTGCATCGGCTGGTTCCGATGATCTGCTTTCCCTATCGTGGGTTCAGAAGCTTC
TAAATTCCTTCCTATGCTGCCAGGAGGAGTTTAAAATCGTTCTCTTTAGCCACAAATCTCAAATCTCTAGACCCCCATTGGACCGCTTGGTTGCTGACTATTCGGAGCGT
AGCGTTAAAGCACTTGATGTGTGTAATGCGATTCGTGATGGAATTGAGCAGCTGAGGCAATGGCAGAAACTTTTGGAGATTGTTCTCAGTGCATTGGACAATTGTAATCA
CCAGAAGACTCTCGGGGAGGGTCAATTCCGCCGCGCCAAGAAGGCCCTCATCGATTTGGCTATTTGTATGCTTGATGAGAAAGATTCACATGCCTCTGCCCTTGCCCACC
GCAACCGTTCTTTTGGACGCAATAATGCCTCGAAAGATCACCGGTCCTTGGGCCACTTTCGATCGCTGTCTTGGAGCGTTTCTCGATCGTGGTCAGCCGCGAGGCAGCTG
CAATCAATTGGAAATAATTTGGCGGCCCCTAAAGCGACTGAGCTTCTGGCTACTAATGGCCTTGCAGTTCCTATTTTTACCATGAACATGGTGTTATTGTTCGTAATGTG
GGCGCTGGTGGCCGCTATTCCTTGTCAGGACCGGGGTTTGCAGGTTCATTTCTCTTTGCCCCGGAATTTTCCGTGGGCGGCTCCAATCCTTCAACTGCATGATCGAATTG
TGGAGGAGTCCAAGAAGCGAGATCGAAGAAATTCCTGTGGGCTGTTGAAGGAGATTCAACAGATTGAAAAGTGCACGCGTCTCATGAGTGATTTGGCCGATGCAGCTCAG
TTCCCATTGGCAGAGGATAAAGAAGCGGAGCTGAGACAGAGAGTGCAAGAGCTATCAACGGTTTGTAATACCCTGAGGACTGGATTGGACTCATTGGAGCGGCAGGATGG
AAGAATCTATCCTGCACCACTATGGTATGTGGCTACGATGAAGGATATGGTGGTCCAGGTTCTCAACTCTTTGAGGTGCCAAGTCATGTTCCTTGTCAGTGAACATTCAA
AATTGTTACTGACAGAATTGAAGGATCTGCTACGTTCGTGGATGCTGACCCTTGACGGAAAGCAGTTGACAACTTGTGCAAGTCATAAAGTCACTCCACACACTGCGTCG
AGACTTGGAGCTTTCAGTCGCTTTGTTCGTCATCAACAATCGAGTACAACCCTCATGGACAGATCACTGCTCCTTATCTCTTCAATCACAAAAAACTCTCTTCCCATTCA
GATGCTTTGTGTTGCTGCCTCTGCCTTGCTTTGCCCTCACCCTCATCACCTCCCTCTCTCTCTCTCTCTCTCTCTCTCTCTCATTACCGAACTTCACCGAGGTCGCCCCT
CTCTTGTAGAGTTTGACTTCTACCGTGCACTTCATCTAATCTTCGCCTGA
Protein sequenceShow/hide protein sequence
MPATDYQGSSAACTNIGRPVQSVRRDQLYAMDGSPTSQEQDLDCFQRQVTERFLDLASAGSDDLLSLSWVQKLLNSFLCCQEEFKIVLFSHKSQISRPPLDRLVADYSER
SVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHASALAHRNRSFGRNNASKDHRSLGHFRSLSWSVSRSWSAARQL
QSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKRDRRNSCGLLKEIQQIEKCTRLMSDLADAAQ
FPLAEDKEAELRQRVQELSTVCNTLRTGLDSLERQDGRIYPAPLWYVATMKDMVVQVLNSLRCQVMFLVSEHSKLLLTELKDLLRSWMLTLDGKQLTTCASHKVTPHTAS
RLGAFSRFVRHQQSSTTLMDRSLLLISSITKNSLPIQMLCVAASALLCPHPHHLPLSLSLSLSLITELHRGRPSLVEFDFYRALHLIFA