; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg008339 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg008339
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUPF0496 protein 4
Genome locationscaffold10:36733439..36736656
RNA-Seq ExpressionSpg008339
SyntenySpg008339
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008511 - Protein BYPASS-related


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016412.1 hypothetical protein SDJN02_21521, partial [Cucurbita argyrosperma subsp. argyrosperma]2.6e-19479.74Show/hide
Query:  MPATDYQGSSAS---------FVRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL
        MPATDYQGSSA+          +RRDQLYAMDGSPTS E DLD FQRQVA+RF+DLASVG D+LLSLSWVQKLL+SFLCCQEEFK VLI+ KSQISRPPL
Subjt:  MPATDYQGSSAS---------FVRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
        DRLVADYSER+VKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSH SA+AHRNRSFGRNNASKDPRSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR
        GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSL RNFPWA PILQLHDRIVEES+KR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR

Query:  DRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ-----------------------------------
        DRRNSCGLLKEIHQIEKCTRLMNDLAD AQFPLAEEKEAELR+RVQEL+TVC+TLRTGLDSLERQ                                   
Subjt:  DRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ-----------------------------------

Query:  ------------GITLNRIDGDSEHVRLGFSWFLGIRERTEDENMKTRPIGWEN
                     I LNRI G SEHV L FSWFLGIRER EDENMK R  G  N
Subjt:  ------------GITLNRIDGDSEHVRLGFSWFLGIRERTEDENMKTRPIGWEN

XP_004140965.1 uncharacterized protein LOC101202838 [Cucumis sativus]5.8e-18692.6Show/hide
Query:  MPATDYQGSSASF---------VRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL
        MPATDYQGSSA+F         +RRDQLYAMDGSPTS EQDLD FQRQVADRFLDLASVG+DDLLSLSWV KLLNSFL CQE+FKLVLIS KSQISRPPL
Subjt:  MPATDYQGSSASF---------VRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
        DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCN QKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR
        GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELL TNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWA+ ILQLHDRIVEES+KR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR

Query:  DRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ
        DRRNSCGLLKEI+QIEKC RLMNDLAD+AQFPLAEEKEAELRQRVQEL+TVCDTLRTGLDSLERQ
Subjt:  DRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ

XP_008441509.1 PREDICTED: UPF0496 protein 4 [Cucumis melo]1.2e-18692.6Show/hide
Query:  MPATDYQGSSASF---------VRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL
        MPATDYQGSSA+F         +RRDQLYAMDGSPTS EQDLD FQ+QVADRFLDLASVG DDLLSLSWV KLLNSFL CQE+FKLVLIS KSQISRPPL
Subjt:  MPATDYQGSSASF---------VRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
        DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCN+QKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR
        GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELL TNGLAVP+FTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWA+PILQLHDRIVEES+KR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR

Query:  DRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ
        DRRNSCGLLKEI+QIEKCTRLMNDLAD+AQFPLAEEKEAELRQRVQEL+TVCDTLRTGLDSLERQ
Subjt:  DRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ

XP_022133831.1 uncharacterized protein LOC111006292 [Momordica charantia]8.1e-18889.47Show/hide
Query:  MPATDYQGSSASF---------VRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL
        MPATDYQGSSA+F         +RRDQL+AMDGSPTSHEQDLDGFQRQVADRFLDLASVG+DDLLSLSW+ KLLNSFL CQEEFK+VL+S KSQISRPPL
Subjt:  MPATDYQGSSASF---------VRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
        DR+VADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQ+TLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNN SK+ RSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR
        GHFRSLSWSVSRSWSAARQLQSIGN+LAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPR+FPWAAPILQLHDRIVEES+KR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR

Query:  DRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ------GITLNRIDG
        DRRNSCGLLKEIHQIEKCTRLMNDLADA QFPLAE+KEAELRQRVQELSTVCDTLRTGLDSLERQ       I  NR +G
Subjt:  DRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ------GITLNRIDG

XP_038885144.1 UPF0496 protein 4-like [Benincasa hispida]5.8e-18692.31Show/hide
Query:  MPATDYQGSSASF---------VRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL
        MPATDYQGSSASF         +RRDQLYAMDGSPTSHEQDLD FQRQVADRFLDLASVG ++LLSLSWV KLLNSFLCCQEEFKLVLIS KSQISRPPL
Subjt:  MPATDYQGSSASF---------VRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
        DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQK LGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR
        GHFRSLSWSVSRSWSAARQLQSIGNNL APKATELL TNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPR FPWAA ILQLHDRIVEES+KR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR

Query:  DRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLER
        DRRNSCGLLKEI+QIEKCTRL+ND AD+AQFPLAEEKEAELRQRVQEL+TVCDTL+TGLDSLER
Subjt:  DRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLER

TrEMBL top hitse value%identityAlignment
A0A0A0KED5 Uncharacterized protein2.8e-18692.6Show/hide
Query:  MPATDYQGSSASF---------VRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL
        MPATDYQGSSA+F         +RRDQLYAMDGSPTS EQDLD FQRQVADRFLDLASVG+DDLLSLSWV KLLNSFL CQE+FKLVLIS KSQISRPPL
Subjt:  MPATDYQGSSASF---------VRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
        DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCN QKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR
        GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELL TNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWA+ ILQLHDRIVEES+KR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR

Query:  DRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ
        DRRNSCGLLKEI+QIEKC RLMNDLAD+AQFPLAEEKEAELRQRVQEL+TVCDTLRTGLDSLERQ
Subjt:  DRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ

A0A1S3B3M9 UPF0496 protein 45.7e-18792.6Show/hide
Query:  MPATDYQGSSASF---------VRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL
        MPATDYQGSSA+F         +RRDQLYAMDGSPTS EQDLD FQ+QVADRFLDLASVG DDLLSLSWV KLLNSFL CQE+FKLVLIS KSQISRPPL
Subjt:  MPATDYQGSSASF---------VRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
        DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCN+QKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR
        GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELL TNGLAVP+FTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWA+PILQLHDRIVEES+KR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR

Query:  DRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ
        DRRNSCGLLKEI+QIEKCTRLMNDLAD+AQFPLAEEKEAELRQRVQEL+TVCDTLRTGLDSLERQ
Subjt:  DRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ

A0A5A7UI93 UPF0496 protein 45.7e-18792.6Show/hide
Query:  MPATDYQGSSASF---------VRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL
        MPATDYQGSSA+F         +RRDQLYAMDGSPTS EQDLD FQ+QVADRFLDLASVG DDLLSLSWV KLLNSFL CQE+FKLVLIS KSQISRPPL
Subjt:  MPATDYQGSSASF---------VRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
        DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCN+QKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR
        GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELL TNGLAVP+FTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWA+PILQLHDRIVEES+KR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR

Query:  DRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ
        DRRNSCGLLKEI+QIEKCTRLMNDLAD+AQFPLAEEKEAELRQRVQEL+TVCDTLRTGLDSLERQ
Subjt:  DRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ

A0A6J1BW97 uncharacterized protein LOC1110062923.9e-18889.47Show/hide
Query:  MPATDYQGSSASF---------VRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL
        MPATDYQGSSA+F         +RRDQL+AMDGSPTSHEQDLDGFQRQVADRFLDLASVG+DDLLSLSW+ KLLNSFL CQEEFK+VL+S KSQISRPPL
Subjt:  MPATDYQGSSASF---------VRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
        DR+VADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQ+TLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNN SK+ RSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR
        GHFRSLSWSVSRSWSAARQLQSIGN+LAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPR+FPWAAPILQLHDRIVEES+KR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR

Query:  DRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ------GITLNRIDG
        DRRNSCGLLKEIHQIEKCTRLMNDLADA QFPLAE+KEAELRQRVQELSTVCDTLRTGLDSLERQ       I  NR +G
Subjt:  DRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ------GITLNRIDG

A0A6J1JWZ2 uncharacterized protein LOC111489091 isoform X14.1e-18590.96Show/hide
Query:  MPATDYQGSSAS---------FVRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL
        MPATDYQGSSA+          +RRDQLYAMDGSPTS EQDLD FQRQVA+RF+DLASVG D+LLSLSWVQKLL+SFLCCQEEFK VLI+ KSQIS+PPL
Subjt:  MPATDYQGSSAS---------FVRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL
        DRLVADYSER+VKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSH SA+AHRNRSFGRNNASKDPRSL
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR
        GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSL RNFPWA PILQLHDRIVEES+KR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR

Query:  DRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ
        DRRNSCGLLKEIHQIEKCTRLMNDLAD AQFPLAEEKEAELR+RVQEL+TVC+TLRTGLDSLERQ
Subjt:  DRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ

SwissProt top hitse value%identityAlignment
A2Z9A6 UPF0496 protein 45.9e-0822.8Show/hide
Query:  LDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL---DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIV
        L  ++  +A     L    A D+L+LSW++  ++    C  E    + +  + +  P     D+ V  Y   SVK LD+C A+   + +L Q Q LL+  
Subjt:  LDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL---DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIV

Query:  LSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLAT
        L  L   +      + Q +RA+ +L                      R +      + PR +              S +  LQ +  NL+  K    +  
Subjt:  LSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLAT

Query:  NGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRDRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKE
          L   ++ +  V +FV    VA +    + L V   +P  F W+     LH  + EE  ++    S   +KE+ ++E C + ++ LA  +Q    EE+ 
Subjt:  NGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRDRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKE

Query:  AELRQRV
        A L   V
Subjt:  AELRQRV

Q337C0 UPF0496 protein 45.9e-0822.48Show/hide
Query:  LDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL---DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIV
        L  ++  +A     L    A D+L+LSW++  ++    C  E    + +  + +  P     D+ V  Y   SVK LD+C A+   + +L Q Q LL+  
Subjt:  LDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL---DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIV

Query:  LSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLAT
        L  L   +      + Q +RA+ +L +                                        +    +R  S +  LQ +  NL+  K       
Subjt:  LSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLAT

Query:  NGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRDRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKE
          L   ++ +  V +FV    VA +    + L V   +P  F W+     LH  + EE  ++    S   +KE+ ++E C R ++ LA  +Q    EE+ 
Subjt:  NGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRDRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKE

Query:  AELRQRV
        A L   V
Subjt:  AELRQRV

Q9CAK4 Protein ROH12.7e-6135.46Show/hide
Query:  PATDYQGS--SASFVRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDL---------------ASVGA-DDLLSLSWVQKLLNSFLCCQEEFKLVLISQK
        PA D QGS      +RR+Q   +D +    ++DL+ FQ+ +ADRF +L               ASV A + ++S++W++KL++ FLCC+ EFK +L+  +
Subjt:  PATDYQGS--SASFVRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDL---------------ASVGA-DDLLSLSWVQKLLNSFLCCQEEFKLVLISQK

Query:  --SQISRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICM-LDEKDSHT---------SA
          +QIS+PP DRLV +  +RS+KALD+C A+ +GI+ +R +Q+L EI ++AL+    Q+ LG+G  RRAK+AL +L + + L++K++ +         + 
Subjt:  --SQISRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICM-LDEKDSHT---------SA

Query:  LAHRNRSFGRNN-----ASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDR-GLQVHF
           R+ SFGR +     ASK   ++G  +S SW+V R+WSAA+Q+ ++  NL  P+  E     GL  P+F M+ V++FVMW L AA+PCQ+R GL  H 
Subjt:  LAHRNRSFGRNN-----ASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDR-GLQVHF

Query:  SL-PRNFPWAAPILQLHDRIVEESRKRDRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ
         + P++  WA  ++ +H++I +E +K++++ S GL++E+ ++EK    + + AD   +P  ++       +V E++ +C  +   L  L++Q
Subjt:  SL-PRNFPWAAPILQLHDRIVEESRKRDRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ

Q9LMM6 Protein BPS1, chloroplastic4.0e-0420.31Show/hide
Query:  LDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSA
        L+ F+  +A     L      D+L++SW+++ + S        K ++   +  +S    D+ V  Y + SVK LD+CNA    + +L Q   LL+  L  
Subjt:  LDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSA

Query:  LDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGL
        L+  N  + L + Q               LD    H                SK+PR + + R++             L S+   L  PK         L
Subjt:  LDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGL

Query:  AVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRDRRNSCGLLKEIHQIEK-CTRLMNDLADAAQFPLAEEKEAE
           ++ + +  L++     AA     + L ++ ++    PWA   +++ + +  E +     +   +LKE+  +     +L   +   +  P++ +    
Subjt:  AVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRDRRNSCGLLKEIHQIEK-CTRLMNDLADAAQFPLAEEKEAE

Query:  LRQRVQELSTVCDTLRTGLD
        L+  V ELS   D +   +D
Subjt:  LRQRVQELSTVCDTLRTGLD

Arabidopsis top hitse value%identityAlignment
AT1G18740.1 Protein of unknown function (DUF793)1.1e-12361.96Show/hide
Query:  MPATDYQGS---SASFVRRDQL---YAMDGSPTSHEQ-----DLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRP
        MPATD+QGS   S   +RRDQ+     + GS + HE      +LD FQRQVA++F+DL +  ++DLLSL W+ KLL+SFLCCQEEF+ ++ + +SQIS+ 
Subjt:  MPATDYQGS---SASFVRRDQL---YAMDGSPTSHEQ-----DLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRP

Query:  PLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSA-LAHRNRSFGRNNASKDP
        P+DRL++DY ERS+KALDVCNAIRDGIEQ+RQW+KL +IV+SALD+    + +GEGQ RRAKKALIDLAI MLDEKD  +   LAHRNRSFGR   S   
Subjt:  PLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSA-LAHRNRSFGRNNASKDP

Query:  RSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEES
        RS+GHFRSLSWSVSRSWSA++QLQ++ +NLA P+  +++A+NGLAVP++TM  VLLFVMW LVAAIPCQDRGLQV+F +PR+F WAAP++ LHD+IVEES
Subjt:  RSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEES

Query:  RKRDRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ
        ++RDR+N CGLLKEI +IEK +RLMN+L D+  FPL ++KE E++QRV EL  V + LR GLD  ER+
Subjt:  RKRDRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ

AT1G43630.1 Protein of unknown function (DUF793)3.7e-11457.69Show/hide
Query:  MPATDYQ-GSSASFVRRDQLYAMD----GSPTSHEQDLDGFQRQVADRFLDL-ASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPLDRL
        MP T+Y  G S   +RRDQ + MD      P + E +LD FQRQVA++F+DL AS    ++LSL W+ KLL+SFLCCQE+F++++ + K Q+ + P+DRL
Subjt:  MPATDYQ-GSSASFVRRDQLYAMD----GSPTSHEQDLDGFQRQVADRFLDL-ASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPLDRL

Query:  VADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSLGHF
        + +Y ERSVKALDVCNAIRDGIEQ+RQWQKL+EIV+SALD   +Q+ LGEG+  RAKKALIDLAI MLDEKDS  +   HRNRSF RN      + +G+ 
Subjt:  VADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSLGHF

Query:  RSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRD-R
        RSLSWSVSRSWSA+RQLQ IGNNLA P+A++++ATNGLA+ ++TM  +LLFV W LVAAIPCQDRGL VHF  PR+F WA P++ LHD+I++ES+KRD +
Subjt:  RSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRD-R

Query:  RNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEK-EAELRQRVQELSTVCDTLRTGLDSLERQ
        +  CGLL+EI+QIE+ +R+++DL D+  F L +EK   E+++RVQEL  VC+ ++ GLD  +R+
Subjt:  RNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEK-EAELRQRVQELSTVCDTLRTGLDSLERQ

AT1G63930.1 from the Czech 'roh' meaning 'corner'1.9e-6235.46Show/hide
Query:  PATDYQGS--SASFVRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDL---------------ASVGA-DDLLSLSWVQKLLNSFLCCQEEFKLVLISQK
        PA D QGS      +RR+Q   +D +    ++DL+ FQ+ +ADRF +L               ASV A + ++S++W++KL++ FLCC+ EFK +L+  +
Subjt:  PATDYQGS--SASFVRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDL---------------ASVGA-DDLLSLSWVQKLLNSFLCCQEEFKLVLISQK

Query:  --SQISRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICM-LDEKDSHT---------SA
          +QIS+PP DRLV +  +RS+KALD+C A+ +GI+ +R +Q+L EI ++AL+    Q+ LG+G  RRAK+AL +L + + L++K++ +         + 
Subjt:  --SQISRPPLDRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICM-LDEKDSHT---------SA

Query:  LAHRNRSFGRNN-----ASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDR-GLQVHF
           R+ SFGR +     ASK   ++G  +S SW+V R+WSAA+Q+ ++  NL  P+  E     GL  P+F M+ V++FVMW L AA+PCQ+R GL  H 
Subjt:  LAHRNRSFGRNN-----ASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDR-GLQVHF

Query:  SL-PRNFPWAAPILQLHDRIVEESRKRDRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ
         + P++  WA  ++ +H++I +E +K++++ S GL++E+ ++EK    + + AD   +P  ++       +V E++ +C  +   L  L++Q
Subjt:  SL-PRNFPWAAPILQLHDRIVEESRKRDRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ

AT1G74450.1 Protein of unknown function (DUF793)8.5e-12761.19Show/hide
Query:  MPATDYQ---GSSASFVRRD------QLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL
        MPAT+YQ   G S   +RRD      +   +    T  E +L  FQR+VA+RF+DL +   +DLLSL WV KLL+SFL CQEEF+ ++I+ +S I++PP+
Subjt:  MPATDYQ---GSSASFVRRD------QLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPL

Query:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDN----CNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALA--HRNRSFGRNNAS
        DRLV+DY ERSVKALDVCNAIRDG+EQ+RQWQKL+EIV+ A +N     + ++ LGEGQFRRA+K LI+LAI MLDEKDS +S+++  HRNRSFGRN   
Subjt:  DRLVADYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDN----CNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALA--HRNRSFGRNNAS

Query:  KDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIV
           R++GHFRSLSWSVSRSWSA++QLQ+IGNNLA P+A+++ ATNGL VP++TM  VLLFVMWALVAAIPCQDRGLQVHF++PRN+ W   ++ LHDRI+
Subjt:  KDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIV

Query:  EESRKRDRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ
        EES+KR+R+N+CGLLKEIHQ EK +RLMN+L D+ QFPL+EEKE E+R+RV+EL  + + L+ GLD  ER+
Subjt:  EESRKRDRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ

AT4G11300.1 Protein of unknown function (DUF793)9.6e-5435.6Show/hide
Query:  PATDYQGSSAS-FVRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLA----SVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPLDRLVA
        PAT++Q S  S   RR+Q+ +M+ +    +++L+ FQ+ VA+RF +L     S  +  +LS+ W++KLL+ F+  + EF  VL S  SQIS+PPLD+LV 
Subjt:  PATDYQGSSAS-FVRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLA----SVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPLDRLVA

Query:  DYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNR-------SFGRNNASKDPR
        +  +R VKALD+C A+ +G++ +RQ Q+  EI ++AL     Q  L +G  RRAK+AL  L   +  +K+S +S      R       SFGR +      
Subjt:  DYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNR-------SFGRNNASKDPR

Query:  SLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQ-DRGLQVHFSLPRNFPWAAPILQLHDRIVEES
        S G     +  VS++WSAA+Q+Q++  NL AP+        G A P++ M+ V++ VMW LV A+PCQ   GL VH  LP+N  WA   + + +R+ EE 
Subjt:  SLGHFRSLSWSVSRSWSAARQLQSIGNNLAAPKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQ-DRGLQVHFSLPRNFPWAAPILQLHDRIVEES

Query:  RKRDRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ
        ++++ R   GL++E+ ++E+    + + ++  +F   E+  AE    V E+  +C  +  GL+ L+R+
Subjt:  RKRDRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEAELRQRVQELSTVCDTLRTGLDSLERQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAGCTACAGATTATCAGGGTTCGTCGGCTTCTTTTGTTCGGCGAGATCAGCTTTACGCCATGGATGGTTCTCCGACCTCTCACGAGCAGGATCTTGACGGATTTCA
GAGGCAAGTTGCTGACCGGTTCTTGGACCTTGCGTCGGTTGGTGCTGATGATTTGCTTTCCCTTTCTTGGGTTCAGAAGCTTTTGAATTCCTTCCTGTGCTGTCAGGAGG
AGTTTAAGCTCGTTCTCATCAGCCAAAAATCTCAAATCTCTAGACCCCCATTGGACCGATTGGTTGCTGATTATTCGGAGCGCAGCGTTAAGGCGCTTGATGTGTGTAAT
GCGATTCGTGATGGGATTGAGCAGCTCCGCCAATGGCAGAAGCTCTTGGAGATTGTTCTCAGTGCGTTGGACAATTGTAATCATCAGAAGACTCTTGGGGAGGGCCAATT
CCGTCGCGCCAAGAAGGCCCTCATCGATTTGGCTATTTGTATGCTTGATGAGAAGGATTCGCATACCTCTGCTCTTGCCCACCGCAACCGTTCCTTCGGACGGAACAATG
CGTCCAAGGATCCACGGTCCTTGGGCCACTTTCGATCGCTCTCGTGGAGCGTTTCGAGATCGTGGTCGGCTGCGAGGCAGCTGCAATCAATTGGTAATAATTTGGCGGCC
CCTAAGGCGACTGAGCTTTTGGCTACTAATGGGCTTGCAGTTCCTATTTTTACCATGAACATGGTGTTATTGTTTGTAATGTGGGCGCTGGTGGCGGCTATTCCTTGCCA
GGATAGGGGTTTGCAGGTTCATTTCTCGTTGCCCCGGAATTTCCCATGGGCCGCTCCAATCCTTCAACTGCACGATCGAATTGTGGAGGAGTCCAGGAAGCGAGATCGAA
GGAATTCCTGTGGGCTGTTGAAGGAGATTCATCAGATTGAAAAGTGCACGCGTCTCATGAATGATTTGGCCGATGCAGCTCAGTTCCCATTAGCAGAGGAGAAAGAAGCA
GAGCTGAGGCAGAGAGTGCAAGAGCTATCGACAGTTTGTGATACTCTGAGGACTGGATTGGACTCATTGGAGCGGCAGGGAATCACTCTGAATAGAATTGATGGAGATTC
AGAACATGTTCGACTTGGGTTTTCATGGTTTCTAGGAATAAGAGAAAGAACAGAGGATGAAAATATGAAGACTCGCCCCATAGGATGGGAGAATCTATCCTGCACCACTC
TGGTCTGTGGCTACAATGAAGGATATGGTGGTCCAGGATCTGCGAATGCTAGGCATAAAGTTACCACTCCATACGCTGCATTTAGAGTCGGAGACTTTGGAGCTTTCACT
CACTTTGTTCGTCATCAAGGCTCGAGGATCCCTTCTGATAAAGTTGACATTTTAGCGAATCCCACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCAGCTACAGATTATCAGGGTTCGTCGGCTTCTTTTGTTCGGCGAGATCAGCTTTACGCCATGGATGGTTCTCCGACCTCTCACGAGCAGGATCTTGACGGATTTCA
GAGGCAAGTTGCTGACCGGTTCTTGGACCTTGCGTCGGTTGGTGCTGATGATTTGCTTTCCCTTTCTTGGGTTCAGAAGCTTTTGAATTCCTTCCTGTGCTGTCAGGAGG
AGTTTAAGCTCGTTCTCATCAGCCAAAAATCTCAAATCTCTAGACCCCCATTGGACCGATTGGTTGCTGATTATTCGGAGCGCAGCGTTAAGGCGCTTGATGTGTGTAAT
GCGATTCGTGATGGGATTGAGCAGCTCCGCCAATGGCAGAAGCTCTTGGAGATTGTTCTCAGTGCGTTGGACAATTGTAATCATCAGAAGACTCTTGGGGAGGGCCAATT
CCGTCGCGCCAAGAAGGCCCTCATCGATTTGGCTATTTGTATGCTTGATGAGAAGGATTCGCATACCTCTGCTCTTGCCCACCGCAACCGTTCCTTCGGACGGAACAATG
CGTCCAAGGATCCACGGTCCTTGGGCCACTTTCGATCGCTCTCGTGGAGCGTTTCGAGATCGTGGTCGGCTGCGAGGCAGCTGCAATCAATTGGTAATAATTTGGCGGCC
CCTAAGGCGACTGAGCTTTTGGCTACTAATGGGCTTGCAGTTCCTATTTTTACCATGAACATGGTGTTATTGTTTGTAATGTGGGCGCTGGTGGCGGCTATTCCTTGCCA
GGATAGGGGTTTGCAGGTTCATTTCTCGTTGCCCCGGAATTTCCCATGGGCCGCTCCAATCCTTCAACTGCACGATCGAATTGTGGAGGAGTCCAGGAAGCGAGATCGAA
GGAATTCCTGTGGGCTGTTGAAGGAGATTCATCAGATTGAAAAGTGCACGCGTCTCATGAATGATTTGGCCGATGCAGCTCAGTTCCCATTAGCAGAGGAGAAAGAAGCA
GAGCTGAGGCAGAGAGTGCAAGAGCTATCGACAGTTTGTGATACTCTGAGGACTGGATTGGACTCATTGGAGCGGCAGGGAATCACTCTGAATAGAATTGATGGAGATTC
AGAACATGTTCGACTTGGGTTTTCATGGTTTCTAGGAATAAGAGAAAGAACAGAGGATGAAAATATGAAGACTCGCCCCATAGGATGGGAGAATCTATCCTGCACCACTC
TGGTCTGTGGCTACAATGAAGGATATGGTGGTCCAGGATCTGCGAATGCTAGGCATAAAGTTACCACTCCATACGCTGCATTTAGAGTCGGAGACTTTGGAGCTTTCACT
CACTTTGTTCGTCATCAAGGCTCGAGGATCCCTTCTGATAAAGTTGACATTTTAGCGAATCCCACTTAG
Protein sequenceShow/hide protein sequence
MPATDYQGSSASFVRRDQLYAMDGSPTSHEQDLDGFQRQVADRFLDLASVGADDLLSLSWVQKLLNSFLCCQEEFKLVLISQKSQISRPPLDRLVADYSERSVKALDVCN
AIRDGIEQLRQWQKLLEIVLSALDNCNHQKTLGEGQFRRAKKALIDLAICMLDEKDSHTSALAHRNRSFGRNNASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNNLAA
PKATELLATNGLAVPIFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRDRRNSCGLLKEIHQIEKCTRLMNDLADAAQFPLAEEKEA
ELRQRVQELSTVCDTLRTGLDSLERQGITLNRIDGDSEHVRLGFSWFLGIRERTEDENMKTRPIGWENLSCTTLVCGYNEGYGGPGSANARHKVTTPYAAFRVGDFGAFT
HFVRHQGSRIPSDKVDILANPT