; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG08G007060 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG08G007060
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionUPF0496 protein 4
Genome locationCG_Chr08:19462712..19465803
RNA-Seq ExpressionClCG08G007060
SyntenyClCG08G007060
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008511 - Protein BYPASS-related


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578881.1 Protein ROH1, partial [Cucurbita argyrosperma subsp. sororia]1.3e-20592.58Show/hide
Query:  MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL
        MPATDYQGSSAA TNIGR VQSIRRDQLYAMDGSPTS E DLDSFQRQV +RFMDLASVGP+ELLSLSWV KLL+SFLCCQEEFK VL++HKSQISRPPL
Subjt:  MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
        DRLVADYSER+VKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSH SA+AHRNRSFGRNNASKDPRSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR
        GHFRSLSWSVSRSWSAARQLQSIGNNL APKATELL TNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSL RNFPWA PILQLHDRIVEESKKR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR

Query:  DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIYPATFWSVATRRDMVVQVLHFE
        DRRNSCGLLKEI+QIEKCTRLMNDLAD+AQFPLAEEKEAELR+RVQELTTVC+TLRTGLDSLERQGGR YPA  WSVAT  DMVVQ+LHFE
Subjt:  DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIYPATFWSVATRRDMVVQVLHFE

KAG7016412.1 hypothetical protein SDJN02_21521, partial [Cucurbita argyrosperma subsp. argyrosperma]1.8e-19476.05Show/hide
Query:  MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL
        MPATDYQGSSAA TNIGR VQSIRRDQLYAMDGSPTS E DLDSFQRQV +RFMDLASVGP+ELLSLSWV KLL+SFLCCQEEFK VL++HKSQISRPPL
Subjt:  MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
        DRLVADYSER+VKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSH SA+AHRNRSFGRNNASKDPRSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR
        GHFRSLSWSVSRSWSAARQLQSIGNNL APKATELL TNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSL RNFPWA PILQLHDRIVEESKKR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR

Query:  DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQ-----------------------------------
        DRRNSCGLLKEI+QIEKCTRLMNDLAD+AQFPLAEEKEAELR+RVQELTTVC+TLRTGLDSLERQ                                   
Subjt:  DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQ-----------------------------------

Query:  --------------------------------------------------GGRIYPATFWSVATRRDMVVQVLHFE
                                                          GGR YPA  WSVAT  DMVVQ+LHFE
Subjt:  --------------------------------------------------GGRIYPATFWSVATRRDMVVQVLHFE

XP_004140965.1 uncharacterized protein LOC101202838 [Cucumis sativus]1.9e-19694.59Show/hide
Query:  MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL
        MPATDYQGSSAAFTNIGRPVQ IRRDQLYAMDGSPTS EQDLDSFQRQV DRF+DLASVG ++LLSLSWVHKLLNSFL CQE+FKLVL+SHKSQISRPPL
Subjt:  MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
        DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCN QKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR
        GHFRSLSWSVSRSWSAARQLQSIGNNL APKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWA+ ILQLHDRIVEESKKR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR

Query:  DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIY
        DRRNSCGLLKEINQIEKC RLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQ   ++
Subjt:  DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIY

XP_008441509.1 PREDICTED: UPF0496 protein 4 [Cucumis melo]5.9e-19894.59Show/hide
Query:  MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL
        MPATDYQGSSAAFTNIGRPVQ IRRDQLYAMDGSPTS EQDLD FQ+QV DRF+DLASVGP++LLSLSWVHKLLNSFL CQE+FKLVL+SHKSQISRPPL
Subjt:  MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
        DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCN+QKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR
        GHFRSLSWSVSRSWSAARQLQSIGNNL APKATELLTTNGLAVP+FTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWA+PILQLHDRIVEESKKR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR

Query:  DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIY
        DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQ   ++
Subjt:  DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIY

XP_038885144.1 UPF0496 protein 4-like [Benincasa hispida]3.5e-19896.7Show/hide
Query:  MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL
        MPATDYQGSSA+FTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQV DRF+DLASVGPEELLSLSWV KLLNSFLCCQEEFKLVL+SHKSQISRPPL
Subjt:  MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
        DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQK LGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR
        GHFRSLSWSVSRSWSAARQLQSIGNNL APKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPR FPWAA ILQLHDRIVEESKKR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR

Query:  DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLER
        DRRNSCGLLKEINQIEKCTRL+ND ADSAQFPLAEEKEAELRQRVQELTTVCDTL+TGLDSLER
Subjt:  DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLER

TrEMBL top hitse value%identityAlignment
A0A0A0KED5 Uncharacterized protein9.2e-19794.59Show/hide
Query:  MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL
        MPATDYQGSSAAFTNIGRPVQ IRRDQLYAMDGSPTS EQDLDSFQRQV DRF+DLASVG ++LLSLSWVHKLLNSFL CQE+FKLVL+SHKSQISRPPL
Subjt:  MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
        DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCN QKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR
        GHFRSLSWSVSRSWSAARQLQSIGNNL APKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWA+ ILQLHDRIVEESKKR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR

Query:  DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIY
        DRRNSCGLLKEINQIEKC RLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQ   ++
Subjt:  DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIY

A0A1S3B3M9 UPF0496 protein 42.8e-19894.59Show/hide
Query:  MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL
        MPATDYQGSSAAFTNIGRPVQ IRRDQLYAMDGSPTS EQDLD FQ+QV DRF+DLASVGP++LLSLSWVHKLLNSFL CQE+FKLVL+SHKSQISRPPL
Subjt:  MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
        DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCN+QKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR
        GHFRSLSWSVSRSWSAARQLQSIGNNL APKATELLTTNGLAVP+FTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWA+PILQLHDRIVEESKKR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR

Query:  DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIY
        DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQ   ++
Subjt:  DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIY

A0A5A7UI93 UPF0496 protein 42.8e-19894.59Show/hide
Query:  MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL
        MPATDYQGSSAAFTNIGRPVQ IRRDQLYAMDGSPTS EQDLD FQ+QV DRF+DLASVGP++LLSLSWVHKLLNSFL CQE+FKLVL+SHKSQISRPPL
Subjt:  MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
        DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCN+QKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR
        GHFRSLSWSVSRSWSAARQLQSIGNNL APKATELLTTNGLAVP+FTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWA+PILQLHDRIVEESKKR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR

Query:  DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIY
        DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQ   ++
Subjt:  DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIY

A0A6J1FFL7 uncharacterized protein LOC111445285 isoform X19.5e-19488.24Show/hide
Query:  MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL
        MPATDYQGSSAA TNIGR VQSIRRDQLYAMDGSPTS EQDLDSFQRQV +RFMDLASVGP+ELLSLSWV KLL+SFLCCQEEFK VL++HKSQIS+PPL
Subjt:  MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
        DRLVADYSER+VKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKAL+DLAICMLDEKDSH SA+AHRNRSFGRNNASKDPRSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR
        GHFRSLSWSVSRSWSAARQLQSIGNNL APKATELL TNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSL RNFPWA PILQLHDRIVEESKKR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR

Query:  DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIYPATFWSVATRRDMVVQVLHFE
        DRRNSCGLLKEI+QIEKCTRLMNDLAD+AQFPLAEEKEAELR+RVQELTTVC+TLRTGLDSLERQ   ++     S     D + +  H E
Subjt:  DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIYPATFWSVATRRDMVVQVLHFE

A0A6J1JWZ2 uncharacterized protein LOC111489091 isoform X11.6e-19392.43Show/hide
Query:  MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL
        MPATDYQGSSAA TNIGR VQSIRRDQLYAMDGSPTS EQDLDSFQRQV +RFMDLASVGP+ELLSLSWV KLL+SFLCCQEEFK VL++HKSQIS+PPL
Subjt:  MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
        DRLVADYSER+VKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSH SA+AHRNRSFGRNNASKDPRSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR
        GHFRSLSWSVSRSWSAARQLQSIGNNL APKATELL TNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSL RNFPWA PILQLHDRIVEESKKR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKR

Query:  DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIY
        DRRNSCGLLKEI+QIEKCTRLMNDLAD+AQFPLAEEKEAELR+RVQELTTVC+TLRTGLDSLERQ   ++
Subjt:  DRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIY

SwissProt top hitse value%identityAlignment
A2Z9A6 UPF0496 protein 45.8e-0722.82Show/hide
Query:  MDLASVGPE---ELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL---DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNH
        + L  + PE   ++L+LSW+   ++    C  E    + +  + +  P     D+ V  Y   SVK LD+C A+   + +L Q Q LL+  L  L   + 
Subjt:  MDLASVGPE---ELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL---DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNH

Query:  QKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFT
             + Q +RA+ +L                      R +      + PR +              S +  LQ +  NL   K    +    L   ++ 
Subjt:  QKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFT

Query:  MNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKRDRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRV
        +  V +FV    VA +    + L V   +P  F W+     LH  + EE  ++    S   +KE+ ++E C + ++ LA ++Q    EE+ A L   V
Subjt:  MNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKRDRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRV

Q337C0 UPF0496 protein 45.8e-0722.48Show/hide
Query:  MDLASVGPE---ELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL---DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNH
        + L  + PE   ++L+LSW+   ++    C  E    + +  + +  P     D+ V  Y   SVK LD+C A+   + +L Q Q LL+  L  L   + 
Subjt:  MDLASVGPE---ELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPL---DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNH

Query:  QKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFT
             + Q +RA+ +L +                                        +    +R  S +  LQ +  NL   K         L   ++ 
Subjt:  QKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFT

Query:  MNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKRDRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRV
        +  V +FV    VA +    + L V   +P  F W+     LH  + EE  ++    S   +KE+ ++E C R ++ LA ++Q    EE+ A L   V
Subjt:  MNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKRDRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRV

Q9CAK4 Protein ROH17.5e-6334.6Show/hide
Query:  PATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLAS----------------VGPEELLSLSWVHKLLNSFLCCQEEFK
        PA D QGS      +GR   SIRR+Q   +D +    ++DL+ FQ+ + DRF +L S                   E+++S++W+ KL++ FLCC+ EFK
Subjt:  PATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLAS----------------VGPEELLSLSWVHKLLNSFLCCQEEFK

Query:  LVLLSHK--SQISRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICM-LDEKDSHT----
         +LL  +  +QIS+PP DRLV +  +RS+KALD+C A+ +GI+ +R +Q+L EI ++AL+    Q+ LG+G  RRAK+AL +L + + L++K++ +    
Subjt:  LVLLSHK--SQISRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICM-LDEKDSHT----

Query:  -----SALAHRNRSFGRNN-----ASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDR
             +    R+ SFGR +     ASK   ++G  +S SW+V R+WSAA+Q+ ++  NL  P+  E     GL  P+F M+ V++FVMW L AA+PCQ+R
Subjt:  -----SALAHRNRSFGRNN-----ASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDR

Query:  -GLQVHFSL-PRNFPWAAPILQLHDRIVEESKKRDRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQG
         GL  H  + P++  WA  ++ +H++I +E KK++++ S GL++E+ ++EK    + + AD   +P  ++       +V E+  +C  +   L  L++Q 
Subjt:  -GLQVHFSL-PRNFPWAAPILQLHDRIVEESKKRDRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQG

Query:  GRIYPATFWSVATRRDMVVQVL
          +    F  +   R  +++VL
Subjt:  GRIYPATFWSVATRRDMVVQVL

Arabidopsis top hitse value%identityAlignment
AT1G18740.1 Protein of unknown function (DUF793)4.9e-12660.69Show/hide
Query:  MPATDYQGSSAAFTNIGRPVQSIRRDQL---YAMDGSPTSHEQ-----DLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHK
        MPATD+QGS       GR + S+RRDQ+     + GS + HE      +LDSFQRQV ++F+DL +    +LLSL W+ KLL+SFLCCQEEF+ ++ +H+
Subjt:  MPATDYQGSSAAFTNIGRPVQSIRRDQL---YAMDGSPTSHEQ-----DLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHK

Query:  SQISRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSA-LAHRNRSFGRN
        SQIS+ P+DRL++DY ERS+KALDVCNAIRDGIEQ+RQW+KL +IV+SALD+    + +GEGQ RRAKKALIDLAI MLDEKD  +   LAHRNRSFGR 
Subjt:  SQISRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSA-LAHRNRSFGRN

Query:  NASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHD
          S   RS+GHFRSLSWSVSRSWSA++QLQ++ +NL  P+  +++ +NGLAVP++TM  VLLFVMW LVAAIPCQDRGLQV+F +PR+F WAAP++ LHD
Subjt:  NASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHD

Query:  RIVEESKKRDRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIY
        +IVEESK+RDR+N CGLLKEI++IEK +RLMN+L DS  FPL ++KE E++QRV EL  V + LR GLD  ER+   ++
Subjt:  RIVEESKKRDRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIY

AT1G43630.1 Protein of unknown function (DUF793)9.3e-11757.53Show/hide
Query:  MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMD----GSPTSHEQDLDSFQRQVTDRFMDL-ASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQI
        MP T+Y        + GR   S+RRDQ + MD      P + E +LDSFQRQV ++F+DL AS    E+LSL W+ KLL+SFLCCQE+F++++ +HK Q+
Subjt:  MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMD----GSPTSHEQDLDSFQRQVTDRFMDL-ASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQI

Query:  SRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASK
         + P+DRL+ +Y ERSVKALDVCNAIRDGIEQ+RQWQKL+EIV+SALD   +Q+ LGEG+  RAKKALIDLAI MLDEKDS  +   HRNRSF RN    
Subjt:  SRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASK

Query:  DPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVE
          + +G+ RSLSWSVSRSWSA+RQLQ IGNNL  P+A++++ TNGLA+ ++TM  +LLFV W LVAAIPCQDRGL VHF  PR+F WA P++ LHD+I++
Subjt:  DPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVE

Query:  ESKKRD-RRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEK-EAELRQRVQELTTVCDTLRTGLDSLERQ
        ESKKRD ++  CGLL+EINQIE+ +R+++DL DS  F L +EK   E+++RVQEL  VC+ ++ GLD  +R+
Subjt:  ESKKRD-RRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEK-EAELRQRVQELTTVCDTLRTGLDSLERQ

AT1G63930.1 from the Czech 'roh' meaning 'corner'5.3e-6434.6Show/hide
Query:  PATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLAS----------------VGPEELLSLSWVHKLLNSFLCCQEEFK
        PA D QGS      +GR   SIRR+Q   +D +    ++DL+ FQ+ + DRF +L S                   E+++S++W+ KL++ FLCC+ EFK
Subjt:  PATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLAS----------------VGPEELLSLSWVHKLLNSFLCCQEEFK

Query:  LVLLSHK--SQISRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICM-LDEKDSHT----
         +LL  +  +QIS+PP DRLV +  +RS+KALD+C A+ +GI+ +R +Q+L EI ++AL+    Q+ LG+G  RRAK+AL +L + + L++K++ +    
Subjt:  LVLLSHK--SQISRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICM-LDEKDSHT----

Query:  -----SALAHRNRSFGRNN-----ASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDR
             +    R+ SFGR +     ASK   ++G  +S SW+V R+WSAA+Q+ ++  NL  P+  E     GL  P+F M+ V++FVMW L AA+PCQ+R
Subjt:  -----SALAHRNRSFGRNN-----ASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDR

Query:  -GLQVHFSL-PRNFPWAAPILQLHDRIVEESKKRDRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQG
         GL  H  + P++  WA  ++ +H++I +E KK++++ S GL++E+ ++EK    + + AD   +P  ++       +V E+  +C  +   L  L++Q 
Subjt:  -GLQVHFSL-PRNFPWAAPILQLHDRIVEESKKRDRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQG

Query:  GRIYPATFWSVATRRDMVVQVL
          +    F  +   R  +++VL
Subjt:  GRIYPATFWSVATRRDMVVQVL

AT1G74450.1 Protein of unknown function (DUF793)4.4e-12759.42Show/hide
Query:  MPATDYQGSSAAFTNIGRPVQSIRRD------QLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQ
        MPAT+YQ S       GR   ++RRD      +   +    T  E +L SFQR+V +RF+DL +   E+LLSL WV KLL+SFL CQEEF+ ++++H+S 
Subjt:  MPATDYQGSSAAFTNIGRPVQSIRRD------QLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQ

Query:  ISRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDN----CNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALA--HRNRSF
        I++PP+DRLV+DY ERSVKALDVCNAIRDG+EQ+RQWQKL+EIV+ A +N     + ++ LGEGQFRRA+K LI+LAI MLDEKDS +S+++  HRNRSF
Subjt:  ISRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDN----CNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALA--HRNRSF

Query:  GRNNASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQ
        GRN      R++GHFRSLSWSVSRSWSA++QLQ+IGNNL  P+A+++  TNGL VP++TM  VLLFVMWALVAAIPCQDRGLQVHF++PRN+ W   ++ 
Subjt:  GRNNASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQ

Query:  LHDRIVEESKKRDRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIY
        LHDRI+EESKKR+R+N+CGLLKEI+Q EK +RLMN+L DS QFPL+EEKE E+R+RV+EL  + + L+ GLD  ER+   ++
Subjt:  LHDRIVEESKKRDRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIY

AT4G11300.1 Protein of unknown function (DUF793)5.0e-5434.91Show/hide
Query:  PATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDL--ASVGPEE--LLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISR
        PAT++Q S  +        +  RR+Q+ +M+ +    +++L+ FQ+ V +RF +L   S  PE   +LS+ W+ KLL+ F+  + EF  VL S+ SQIS+
Subjt:  PATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDL--ASVGPEE--LLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISR

Query:  PPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNR-------SFGR
        PPLD+LV +  +R VKALD+C A+ +G++ +RQ Q+  EI ++AL     Q  L +G  RRAK+AL  L   +  +K+S +S      R       SFGR
Subjt:  PPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNR-------SFGR

Query:  NNASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQ-DRGLQVHFSLPRNFPWAAPILQL
         +      S G     +  VS++WSAA+Q+Q++  NLVAP+        G A P++ M+ V++ VMW LV A+PCQ   GL VH  LP+N  WA   + +
Subjt:  NNASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQ-DRGLQVHFSLPRNFPWAAPILQL

Query:  HDRIVEESKKRDRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIY
         +R+ EE K+++ R   GL++E+ ++E+    + + ++  +F   E+  AE    V E+  +C  +  GL+ L+R+   ++
Subjt:  HDRIVEESKKRDRRNSCGLLKEINQIEKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAGCTACAGATTATCAGGGTTCGTCGGCTGCTTTTACCAACATCGGCCGTCCAGTTCAGAGTATTCGACGAGATCAGCTTTACGCTATGGATGGTTCTCCC
ACGTCTCACGAGCAGGATCTTGATTCCTTTCAGAGGCAAGTTACTGACCGTTTCATGGACCTTGCATCGGTTGGTCCTGAGGAGTTGCTTTCCCTATCATGGGTT
CATAAGCTTTTGAATTCCTTCTTGTGCTGTCAGGAAGAGTTTAAACTCGTTCTCCTTAGCCACAAATCTCAAATCTCTAGACCCCCTTTGGACCGTTTGGTTGCT
GATTATTCGGAGAGAAGCGTTAAGGCGCTTGATGTGTGTAATGCGATTCGTGATGGGATTGAGCAACTCCGGCAATGGCAGAAGCTTTTGGAGATTGTTCTTAGT
GCGTTGGATAATTGTAATCATCAGAAGACTCTTGGAGAGGGTCAATTTCGTCGCGCCAAGAAGGCTCTTATTGATTTGGCTATTTGTATGTTGGATGAGAAAGAT
TCGCATACCTCTGCTCTTGCTCACCGCAACCGTTCTTTCGGACGGAACAATGCCTCGAAGGATCCACGGTCATTGGGCCACTTTCGGTCGCTCTCGTGGAGCGTT
TCACGGTCGTGGTCGGCCGCGAGGCAGCTGCAATCGATTGGTAACAATTTAGTGGCCCCTAAAGCGACTGAGCTTTTGACTACCAATGGGCTTGCAGTTCCTATC
TTTACCATGAACATGGTGTTATTGTTTGTAATGTGGGCGCTTGTGGCGGCTATTCCTTGCCAGGATCGTGGCTTGCAGGTTCATTTCTCTTTGCCTCGAAATTTC
CCATGGGCAGCTCCAATCCTTCAACTACATGATCGAATTGTGGAGGAGTCCAAGAAGCGAGATCGAAGAAATTCCTGTGGGCTGTTGAAGGAGATTAATCAGATT
GAAAAGTGCACGCGTCTCATGAATGATTTGGCAGATTCAGCTCAGTTCCCATTGGCAGAGGAGAAAGAAGCAGAGCTGAGACAGAGAGTACAAGAGCTAACCACA
GTTTGTGATACTCTTAGGACTGGATTGGACTCGTTGGAGCGGCAGGGTGGGAGAATCTATCCTGCAACATTCTGGTCTGTGGCTACAAGGAGGGATATGGTGGTC
CAGGTTTTACACTTTGAGAGATCTGATCTTGATTTGGTCTTCTACAGGATCTGCAAAGTTCGAGGAATGCTGACCCTTGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAGCTACAGATTATCAGGGTTCGTCGGCTGCTTTTACCAACATCGGCCGTCCAGTTCAGAGTATTCGACGAGATCAGCTTTACGCTATGGATGGTTCTCCC
ACGTCTCACGAGCAGGATCTTGATTCCTTTCAGAGGCAAGTTACTGACCGTTTCATGGACCTTGCATCGGTTGGTCCTGAGGAGTTGCTTTCCCTATCATGGGTT
CATAAGCTTTTGAATTCCTTCTTGTGCTGTCAGGAAGAGTTTAAACTCGTTCTCCTTAGCCACAAATCTCAAATCTCTAGACCCCCTTTGGACCGTTTGGTTGCT
GATTATTCGGAGAGAAGCGTTAAGGCGCTTGATGTGTGTAATGCGATTCGTGATGGGATTGAGCAACTCCGGCAATGGCAGAAGCTTTTGGAGATTGTTCTTAGT
GCGTTGGATAATTGTAATCATCAGAAGACTCTTGGAGAGGGTCAATTTCGTCGCGCCAAGAAGGCTCTTATTGATTTGGCTATTTGTATGTTGGATGAGAAAGAT
TCGCATACCTCTGCTCTTGCTCACCGCAACCGTTCTTTCGGACGGAACAATGCCTCGAAGGATCCACGGTCATTGGGCCACTTTCGGTCGCTCTCGTGGAGCGTT
TCACGGTCGTGGTCGGCCGCGAGGCAGCTGCAATCGATTGGTAACAATTTAGTGGCCCCTAAAGCGACTGAGCTTTTGACTACCAATGGGCTTGCAGTTCCTATC
TTTACCATGAACATGGTGTTATTGTTTGTAATGTGGGCGCTTGTGGCGGCTATTCCTTGCCAGGATCGTGGCTTGCAGGTTCATTTCTCTTTGCCTCGAAATTTC
CCATGGGCAGCTCCAATCCTTCAACTACATGATCGAATTGTGGAGGAGTCCAAGAAGCGAGATCGAAGAAATTCCTGTGGGCTGTTGAAGGAGATTAATCAGATT
GAAAAGTGCACGCGTCTCATGAATGATTTGGCAGATTCAGCTCAGTTCCCATTGGCAGAGGAGAAAGAAGCAGAGCTGAGACAGAGAGTACAAGAGCTAACCACA
GTTTGTGATACTCTTAGGACTGGATTGGACTCGTTGGAGCGGCAGGGTGGGAGAATCTATCCTGCAACATTCTGGTCTGTGGCTACAAGGAGGGATATGGTGGTC
CAGGTTTTACACTTTGAGAGATCTGATCTTGATTTGGTCTTCTACAGGATCTGCAAAGTTCGAGGAATGCTGACCCTTGAATGAAAGAGAAAGAGTTTGACAACT
TGGTCAGCATAATGGTAAAGATTTATTCTTACATTTTGGTCAAGGTCTAAATTTGACTTGGAAAAATGATCCTAACCAACGTATTTGGACTTTATCCTCTTCGAT
TTGAAGGGTGATATCCCTGCCGGGTTTGACTGTCTCCTTTCTCCACAACACCTTTTTTTACAGTGCTAATCATAAAGTTACTCCATACTTTGCGCTGAGAGTCTG
AGACTTGGAGTTTCCAGTCGCTTTGTTTGTCATCAAGGATGAAGGTAACTACTTGTCTGCACACTACAGTTGGTAGTTGAAATTCATGTTTTTTGGTTGTTGGGA
TCATGGAAGGCTGATTTTCCATTCCTTGTTTGGTTCTTCTTTGCTTTAGTCATGATGATTCTCTGTTTTATTTGGAGCTTCCATTTCTTTTCAGTCTCCCAAAAG
CTCTCTTCCCATTCAAATGTTTTGTGTTTCTATCTCTCTGCCTTGCTTTGCTTTATGTTTCTAGGATGTATGTGTGTACTGTATCTATTCATCTGAGGGTGTTTG
GCCCACAACTCGAG
Protein sequenceShow/hide protein sequence
MPATDYQGSSAAFTNIGRPVQSIRRDQLYAMDGSPTSHEQDLDSFQRQVTDRFMDLASVGPEELLSLSWVHKLLNSFLCCQEEFKLVLLSHKSQISRPPLDRLVA
DYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSLGHFRSLSWSV
SRSWSAARQLQSIGNNLVAPKATELLTTNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESKKRDRRNSCGLLKEINQI
EKCTRLMNDLADSAQFPLAEEKEAELRQRVQELTTVCDTLRTGLDSLERQGGRIYPATFWSVATRRDMVVQVLHFERSDLDLVFYRICKVRGMLTLE