; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0014497 (gene) of Chayote v1 genome

Gene IDSed0014497
OrganismSechium edule (Chayote v1)
DescriptionProtein of unknown function (DUF793)
Genome locationLG03:2980915..2982968
RNA-Seq ExpressionSed0014497
SyntenySed0014497
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008511 - Protein BYPASS-related


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008441509.1 PREDICTED: UPF0496 protein 4 [Cucumis melo]2.6e-19288.75Show/hide
Query:  MPATDYQGSSAAW--------GVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPL
        MPATDYQGSSAA+        G+RRDQL+AMD S TS EQDLD FQ+QVADRF DLASV  DDLLSLSWV KLLNSFL CQE+FKL+L+SHKSQIS+PPL
Subjt:  MPATDYQGSSAAW--------GVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPL

Query:  DRLVSDYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSL
        DRLV+DYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCN+QK+LGEGQFRRAKKALIDLAICMLDEKDSHTSA AHRNRSFGRNNASKDPRSL
Subjt:  DRLVSDYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR
        GHFRSLSWSVSRSWSAARQLQSIGN+LAAPKATEL+ TNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWA+PILQLHDRIVEES+KR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR

Query:  DRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE
        DRRNSCGLLKEI+QIEKCTRLMND  D+ QFPL EEKEAELRQRVQELTTVC TLRTGLDSLERQVREVFHRIVR+RTEGL SLGRAN SE
Subjt:  DRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE

XP_022133831.1 uncharacterized protein LOC111006292 [Momordica charantia]4.3e-19589.26Show/hide
Query:  MPATDYQGSSAAW--------GVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPL
        MPATDYQGSSAA+         +RRDQLFAMD S TSHEQDLDGFQRQVADRF DLASV +DDLLSLSW+ KLLNSFL CQEEFK++LVSHKSQIS+PPL
Subjt:  MPATDYQGSSAAW--------GVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPL

Query:  DRLVSDYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSL
        DR+V+DYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQ++LGEGQFRRAKKALIDLAICMLDEKDSHTSA AHRNRSFGRNN SK+ RSL
Subjt:  DRLVSDYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR
        GHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATEL+ATNGLAVP+FTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPR+FPWAAPILQLHDRIVEES+KR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR

Query:  DRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE
        DRRNSCGLLKEIHQIEKCTRLMND  DAVQFPL E+KEAELRQRVQEL+TVC TLRTGLDSLERQVREVFHRIVRNRTEGL SLGRAN SE
Subjt:  DRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE

XP_022963767.1 UPF0496 protein 4-like [Cucurbita moschata]1.8e-19390.08Show/hide
Query:  MPATDYQGSSAAWGVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPLDRLVSDYS
        MPATDYQGSSAAWGVRRDQL+AM  S TSHE DLDGFQRQVADRF +LASV ADDLLSLSWVQKLL+SFLCCQEEFKL+L S KS ISK PLDRLV+DYS
Subjt:  MPATDYQGSSAAWGVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPLDRLVSDYS

Query:  ERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSLGHFRSLSW
        ERSVKALDVCNAIRDG+EQLRQWQKLLEIV+SALDNCNHQK+LGEGQFRRAKKAL DLAICMLDEKDSHTS  AHRNRSFGRNN +KD RSLGHFRSLSW
Subjt:  ERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSLGHFRSLSW

Query:  SVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRDRRNSCGL
        SVSRSWSAARQLQSIGN+LAAPKATEL+ATNGLAVP+FTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPR+FPWAAPILQLHDRIVEESRKRDRRNSCGL
Subjt:  SVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRDRRNSCGL

Query:  LKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE
        LKEIHQIEKCTRLMND  DA QFPL EEKEAELRQRVQELT VC+TLRTGLDS+ERQVREVFHRIVR+RTEGL SLGRAN SE
Subjt:  LKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE

XP_022990383.1 UPF0496 protein 4-like [Cucurbita maxima]2.4e-19389.82Show/hide
Query:  MPATDYQGSSAAWGVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPLDRLVSDYS
        MPATDYQGSSAAWGVRRDQL+AM+ S T HEQDLDGFQRQVADRF +LASV ADDLLSLSWVQKLL+SFLCCQEEFKL+L S KS ISK PLDRLV+DYS
Subjt:  MPATDYQGSSAAWGVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPLDRLVSDYS

Query:  ERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSLGHFRSLSW
        ERSVKALDVCNAIRDG+EQLRQWQKLLEIV+SALDNCNHQK+LGEGQFRRAKKAL DLAICMLDEKDSHTS  AHRNRSFGRNN +KD RSLGHFRSLSW
Subjt:  ERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSLGHFRSLSW

Query:  SVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRDRRNSCGL
        SVSRSWSAARQLQSIGN+LAAPKATEL+ATNGLAVP+FTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPR+FPWAAPILQLHDRIVEESRKRDRRNSCGL
Subjt:  SVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRDRRNSCGL

Query:  LKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE
        LKEIHQIEKC RLMND  DA QFPL EEKEAELRQRVQELT VC+TLRTGLDS+ERQVREVFHRIVR+RTEGL SLGRAN SE
Subjt:  LKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE

XP_023528071.1 UPF0496 protein 4-like [Cucurbita pepo subsp. pepo]4.8e-19490.34Show/hide
Query:  MPATDYQGSSAAWGVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPLDRLVSDYS
        MPATDYQGSSAAWGVRRDQL+AM  S TSHEQDLDGFQRQVADRF +LASV ADDLLSLSWVQKLL+SFLCCQEEFKL+L S KS ISK PLDRLV+DYS
Subjt:  MPATDYQGSSAAWGVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPLDRLVSDYS

Query:  ERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSLGHFRSLSW
        ERSVKALDVCNAIRDG+EQLRQWQKLLEIV+SALDNCNHQK+LGEGQFRRAKKAL DLAICMLDEKDSHTS  AHRNRSFGRNN +KDPRSLGHFRSLSW
Subjt:  ERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSLGHFRSLSW

Query:  SVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRDRRNSCGL
        SVSRSWSAARQLQSIGN+LAAPKATEL+ATNGLAVP+FTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPR+FPWAA ILQLHDRIVEESRKRDRRNSCGL
Subjt:  SVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRDRRNSCGL

Query:  LKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE
        LKEIHQIEKCTRLMND  DA QFPL EEKEAELRQRVQELT VC+TLRTGLDS+ERQVREVFHRIVR+RTEGL SLGRAN SE
Subjt:  LKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE

TrEMBL top hitse value%identityAlignment
A0A1S3B3M9 UPF0496 protein 41.3e-19288.75Show/hide
Query:  MPATDYQGSSAAW--------GVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPL
        MPATDYQGSSAA+        G+RRDQL+AMD S TS EQDLD FQ+QVADRF DLASV  DDLLSLSWV KLLNSFL CQE+FKL+L+SHKSQIS+PPL
Subjt:  MPATDYQGSSAAW--------GVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPL

Query:  DRLVSDYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSL
        DRLV+DYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCN+QK+LGEGQFRRAKKALIDLAICMLDEKDSHTSA AHRNRSFGRNNASKDPRSL
Subjt:  DRLVSDYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR
        GHFRSLSWSVSRSWSAARQLQSIGN+LAAPKATEL+ TNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWA+PILQLHDRIVEES+KR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR

Query:  DRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE
        DRRNSCGLLKEI+QIEKCTRLMND  D+ QFPL EEKEAELRQRVQELTTVC TLRTGLDSLERQVREVFHRIVR+RTEGL SLGRAN SE
Subjt:  DRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE

A0A5A7UI93 UPF0496 protein 41.3e-19288.75Show/hide
Query:  MPATDYQGSSAAW--------GVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPL
        MPATDYQGSSAA+        G+RRDQL+AMD S TS EQDLD FQ+QVADRF DLASV  DDLLSLSWV KLLNSFL CQE+FKL+L+SHKSQIS+PPL
Subjt:  MPATDYQGSSAAW--------GVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPL

Query:  DRLVSDYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSL
        DRLV+DYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCN+QK+LGEGQFRRAKKALIDLAICMLDEKDSHTSA AHRNRSFGRNNASKDPRSL
Subjt:  DRLVSDYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR
        GHFRSLSWSVSRSWSAARQLQSIGN+LAAPKATEL+ TNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWA+PILQLHDRIVEES+KR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR

Query:  DRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE
        DRRNSCGLLKEI+QIEKCTRLMND  D+ QFPL EEKEAELRQRVQELTTVC TLRTGLDSLERQVREVFHRIVR+RTEGL SLGRAN SE
Subjt:  DRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE

A0A6J1BW97 uncharacterized protein LOC1110062922.1e-19589.26Show/hide
Query:  MPATDYQGSSAAW--------GVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPL
        MPATDYQGSSAA+         +RRDQLFAMD S TSHEQDLDGFQRQVADRF DLASV +DDLLSLSW+ KLLNSFL CQEEFK++LVSHKSQIS+PPL
Subjt:  MPATDYQGSSAAW--------GVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPL

Query:  DRLVSDYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSL
        DR+V+DYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQ++LGEGQFRRAKKALIDLAICMLDEKDSHTSA AHRNRSFGRNN SK+ RSL
Subjt:  DRLVSDYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSL

Query:  GHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR
        GHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATEL+ATNGLAVP+FTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPR+FPWAAPILQLHDRIVEES+KR
Subjt:  GHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKR

Query:  DRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE
        DRRNSCGLLKEIHQIEKCTRLMND  DAVQFPL E+KEAELRQRVQEL+TVC TLRTGLDSLERQVREVFHRIVRNRTEGL SLGRAN SE
Subjt:  DRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE

A0A6J1HG41 UPF0496 protein 4-like8.8e-19490.08Show/hide
Query:  MPATDYQGSSAAWGVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPLDRLVSDYS
        MPATDYQGSSAAWGVRRDQL+AM  S TSHE DLDGFQRQVADRF +LASV ADDLLSLSWVQKLL+SFLCCQEEFKL+L S KS ISK PLDRLV+DYS
Subjt:  MPATDYQGSSAAWGVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPLDRLVSDYS

Query:  ERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSLGHFRSLSW
        ERSVKALDVCNAIRDG+EQLRQWQKLLEIV+SALDNCNHQK+LGEGQFRRAKKAL DLAICMLDEKDSHTS  AHRNRSFGRNN +KD RSLGHFRSLSW
Subjt:  ERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSLGHFRSLSW

Query:  SVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRDRRNSCGL
        SVSRSWSAARQLQSIGN+LAAPKATEL+ATNGLAVP+FTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPR+FPWAAPILQLHDRIVEESRKRDRRNSCGL
Subjt:  SVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRDRRNSCGL

Query:  LKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE
        LKEIHQIEKCTRLMND  DA QFPL EEKEAELRQRVQELT VC+TLRTGLDS+ERQVREVFHRIVR+RTEGL SLGRAN SE
Subjt:  LKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE

A0A6J1JT46 UPF0496 protein 4-like1.2e-19389.82Show/hide
Query:  MPATDYQGSSAAWGVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPLDRLVSDYS
        MPATDYQGSSAAWGVRRDQL+AM+ S T HEQDLDGFQRQVADRF +LASV ADDLLSLSWVQKLL+SFLCCQEEFKL+L S KS ISK PLDRLV+DYS
Subjt:  MPATDYQGSSAAWGVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPLDRLVSDYS

Query:  ERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSLGHFRSLSW
        ERSVKALDVCNAIRDG+EQLRQWQKLLEIV+SALDNCNHQK+LGEGQFRRAKKAL DLAICMLDEKDSHTS  AHRNRSFGRNN +KD RSLGHFRSLSW
Subjt:  ERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSLGHFRSLSW

Query:  SVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRDRRNSCGL
        SVSRSWSAARQLQSIGN+LAAPKATEL+ATNGLAVP+FTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPR+FPWAAPILQLHDRIVEESRKRDRRNSCGL
Subjt:  SVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRDRRNSCGL

Query:  LKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE
        LKEIHQIEKC RLMND  DA QFPL EEKEAELRQRVQELT VC+TLRTGLDS+ERQVREVFHRIVR+RTEGL SLGRAN SE
Subjt:  LKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE

SwissProt top hitse value%identityAlignment
Q9CAK4 Protein ROH16.5e-6936.96Show/hide
Query:  PATDYQGS-SAAWGVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDL---------------ASVSA-DDLLSLSWVQKLLNSFLCCQEEFKLLLVSHK
        PA D QGS      +RR+Q   +D ++   ++DL+ FQ+ +ADRF++L               ASV+A + ++S++W++KL++ FLCC+ EFK +L+  +
Subjt:  PATDYQGS-SAAWGVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDL---------------ASVSA-DDLLSLSWVQKLLNSFLCCQEEFKLLLVSHK

Query:  --SQISKPPLDRLVSDYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICM-LDEKDSHT---------SA
          +QISKPP DRLV +  +RS+KALD+C A+ +GI+ +R +Q+L EI ++AL+    Q+ LG+G  RRAK+AL +L + + L++K++ +         + 
Subjt:  --SQISKPPLDRLVSDYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICM-LDEKDSHT---------SA

Query:  FAHRNRSFGRNN-----ASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDR-GLQVHF
           R+ SFGR +     ASK   ++G  +S SW+V R+WSAA+Q+ ++  +L  P+  E     GL  P+F M+ V++FVMW L AA+PCQ+R GL  H 
Subjt:  FAHRNRSFGRNN-----ASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDR-GLQVHF

Query:  SL-PRNFPWAAPILQLHDRIVEESRKRDRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRI
         + P++  WA  ++ +H++I +E +K++++ S GL++E+ ++EK    + +F D   +P  ++       +V E+  +C  +   L  L++Q+REVFHRI
Subjt:  SL-PRNFPWAAPILQLHDRIVEESRKRDRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRI

Query:  VRNRTEGLGSLGRA
        VR+R E L  L +A
Subjt:  VRNRTEGLGSLGRA

Q9LMM6 Protein BPS1, chloroplastic1.6e-0620.46Show/hide
Query:  LDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPLDRLVSDYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSA
        L+ F+  +A   S L      D+L++SW+++ + S        K L+   +  +S    D+ V  Y + SVK LD+CNA    + +L Q   LL+  L  
Subjt:  LDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPLDRLVSDYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSA

Query:  LDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGL
        L+  N  ++L + Q               LD    H                SK+PR + + R++             L S+  +L  PK         L
Subjt:  LDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGL

Query:  AVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRDRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAEL
           ++ + +  L++     AA     + L ++ ++    PWA   +++ + +  E +     +   +LKE+  +    + +        +P +++   + 
Subjt:  AVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRDRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAEL

Query:  RQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNR---TEGLGSLG
           +Q L    + L  G+D + ++V   F  ++  R    E L S+G
Subjt:  RQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNR---TEGLGSLG

Arabidopsis top hitse value%identityAlignment
AT1G18740.1 Protein of unknown function (DUF793)8.6e-13363.31Show/hide
Query:  MPATDYQGS--SAAWGVRRDQL---FAMDASSTSHEQ-----DLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKP
        MPATD+QGS   +   +RRDQ+     +  SS+ HE      +LD FQRQVA++F DL + S++DLLSL W+ KLL+SFLCCQEEF+ ++ +H+SQISK 
Subjt:  MPATDYQGS--SAAWGVRRDQL---FAMDASSTSHEQ-----DLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKP

Query:  PLDRLVSDYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSA-FAHRNRSFGRNNASKDP
        P+DRL+SDY ERS+KALDVCNAIRDGIEQ+RQW+KL +IV+SALD+    + +GEGQ RRAKKALIDLAI MLDEKD  +    AHRNRSFGR   S   
Subjt:  PLDRLVSDYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSA-FAHRNRSFGRNNASKDP

Query:  RSLGHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEES
        RS+GHFRSLSWSVSRSWSA++QLQ++ ++LA P+  +++A+NGLAVPV+TM  VLLFVMW LVAAIPCQDRGLQV+F +PR+F WAAP++ LHD+IVEES
Subjt:  RSLGHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEES

Query:  RKRDRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSL
        ++RDR+N CGLLKEI +IEK +RLMN+ +D++ FPL ++KE E++QRV EL  V   LR GLD  ER+VREVFHRIVR+RTE L SL
Subjt:  RKRDRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSL

AT1G43630.1 Protein of unknown function (DUF793)2.1e-12358.21Show/hide
Query:  MPATDYQGSSAAWGVRRDQLFAMDASSTSH----EQDLDGFQRQVADRFSDL-ASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPLDRL
        MP T+Y    +   +RRDQ   MD +S S     E +LD FQRQVA++F DL AS    ++LSL W+ KLL+SFLCCQE+F++++ +HK Q+ K P+DRL
Subjt:  MPATDYQGSSAAWGVRRDQLFAMDASSTSH----EQDLDGFQRQVADRFSDL-ASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPLDRL

Query:  VSDYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSLGHF
        + +Y ERSVKALDVCNAIRDGIEQ+RQWQKL+EIV+SALD   +Q+ LGEG+  RAKKALIDLAI MLDEKDS  +   HRNRSF RN      + +G+ 
Subjt:  VSDYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSLGHF

Query:  RSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRD-R
        RSLSWSVSRSWSA+RQLQ IGN+LA P+A+++MATNGLA+ V+TM  +LLFV W LVAAIPCQDRGL VHF  PR+F WA P++ LHD+I++ES+KRD +
Subjt:  RSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRD-R

Query:  RNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEK-EAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE
        +  CGLL+EI+QIE+ +R+++D +D+  F L +EK   E+++RVQEL  VC  ++ GLD  +R+VR+VFH+IVR RTE L SLG+    E
Subjt:  RNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEK-EAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE

AT1G63930.1 from the Czech 'roh' meaning 'corner'4.6e-7036.96Show/hide
Query:  PATDYQGS-SAAWGVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDL---------------ASVSA-DDLLSLSWVQKLLNSFLCCQEEFKLLLVSHK
        PA D QGS      +RR+Q   +D ++   ++DL+ FQ+ +ADRF++L               ASV+A + ++S++W++KL++ FLCC+ EFK +L+  +
Subjt:  PATDYQGS-SAAWGVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDL---------------ASVSA-DDLLSLSWVQKLLNSFLCCQEEFKLLLVSHK

Query:  --SQISKPPLDRLVSDYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICM-LDEKDSHT---------SA
          +QISKPP DRLV +  +RS+KALD+C A+ +GI+ +R +Q+L EI ++AL+    Q+ LG+G  RRAK+AL +L + + L++K++ +         + 
Subjt:  --SQISKPPLDRLVSDYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICM-LDEKDSHT---------SA

Query:  FAHRNRSFGRNN-----ASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDR-GLQVHF
           R+ SFGR +     ASK   ++G  +S SW+V R+WSAA+Q+ ++  +L  P+  E     GL  P+F M+ V++FVMW L AA+PCQ+R GL  H 
Subjt:  FAHRNRSFGRNN-----ASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDR-GLQVHF

Query:  SL-PRNFPWAAPILQLHDRIVEESRKRDRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRI
         + P++  WA  ++ +H++I +E +K++++ S GL++E+ ++EK    + +F D   +P  ++       +V E+  +C  +   L  L++Q+REVFHRI
Subjt:  SL-PRNFPWAAPILQLHDRIVEESRKRDRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRI

Query:  VRNRTEGLGSLGRA
        VR+R E L  L +A
Subjt:  VRNRTEGLGSLGRA

AT1G74450.1 Protein of unknown function (DUF793)3.4e-13762.47Show/hide
Query:  MPATDYQGS--SAAWGVRRD------QLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPL
        MPAT+YQ S   +   +RRD      +   +    T  E +L  FQR+VA+RF DL + S +DLLSL WV KLL+SFL CQEEF+ ++++H+S I+KPP+
Subjt:  MPATDYQGS--SAAWGVRRD------QLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPL

Query:  DRLVSDYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDN----CNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFA--HRNRSFGRNNAS
        DRLVSDY ERSVKALDVCNAIRDG+EQ+RQWQKL+EIV+ A +N     + ++ LGEGQFRRA+K LI+LAI MLDEKDS +S+ +  HRNRSFGRN   
Subjt:  DRLVSDYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDN----CNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFA--HRNRSFGRNNAS

Query:  KDPRSLGHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIV
           R++GHFRSLSWSVSRSWSA++QLQ+IGN+LA P+A+++ ATNGL VPV+TM  VLLFVMWALVAAIPCQDRGLQVHF++PRN+ W   ++ LHDRI+
Subjt:  KDPRSLGHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIV

Query:  EESRKRDRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE
        EES+KR+R+N+CGLLKEIHQ EK +RLMN+ VD+VQFPL EEKE E+R+RV+EL  +   L+ GLD  ER+VREVFHRIVR+RTEGL ++G+ + SE
Subjt:  EESRKRDRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE

AT4G11300.1 Protein of unknown function (DUF793)5.1e-6136.65Show/hide
Query:  PATDYQGSSAAWGVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLA----SVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPLDRLVS
        PAT++Q S  +   RR+Q+ +M+ +    +++L+ FQ+ VA+RF++L     S  +  +LS+ W++KLL+ F+  + EF  +L S+ SQISKPPLD+LV 
Subjt:  PATDYQGSSAAWGVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLA----SVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPLDRLVS

Query:  DYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNR-------SFGRNNASKDPR
        +  +R VKALD+C A+ +G++ +RQ Q+  EI ++AL     Q  L +G  RRAK+AL  L   +  +K+S +S      R       SFGR +      
Subjt:  DYSERSVKALDVCNAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNR-------SFGRNNASKDPR

Query:  SLGHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQ-DRGLQVHFSLPRNFPWAAPILQLHDRIVEES
        S G     +  VS++WSAA+Q+Q++  +L AP+        G A P++ M+ V++ VMW LV A+PCQ   GL VH  LP+N  WA   + + +R+ EE 
Subjt:  SLGHFRSLSWSVSRSWSAARQLQSIGNSLAAPKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQ-DRGLQVHFSLPRNFPWAAPILQLHDRIVEES

Query:  RKRDRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTE
        ++++ R   GL++E+ ++E+    + +F +  +F   E+  AE    V E+  +C  +  GL+ L+R+VREVFHR+V++R+E
Subjt:  RKRDRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKEAELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAGCCACAGATTATCAGGGTTCGTCGGCTGCTTGGGGTGTAAGGCGGGATCAACTTTTCGCCATGGATGCTTCTTCGACCTCTCACGAGCAGGATCTTGATGGATT
TCAGAGGCAAGTTGCCGATCGCTTCTCCGATCTTGCTTCGGTTAGTGCTGATGATTTACTTTCTTTATCTTGGGTTCAGAAGCTTTTGAATTCCTTCCTGTGCTGTCAGG
AAGAGTTTAAGCTCCTTCTTGTTAGCCATAAATCTCAAATCTCTAAACCCCCATTGGACCGATTGGTTTCCGATTACTCGGAGCGCAGCGTCAAGGCGCTTGATGTGTGT
AATGCGATTCGTGATGGGATTGAGCAGCTCCGGCAATGGCAGAAGCTCTTGGAGATTGTTCTTAGTGCATTGGACAATTGTAATCATCAGAAATCTCTTGGGGAGGGCCA
ATTCCGTCGAGCCAAGAAGGCCCTTATCGATTTGGCTATTTGTATGCTTGATGAGAAAGATTCACATACCTCTGCTTTTGCACACCGCAACCGATCTTTCGGCCGTAACA
ATGCCTCCAAAGATCCGCGGTCTTTGGGACACTTTCGATCTCTGTCGTGGAGTGTTTCGCGGTCATGGTCGGCTGCCCGGCAGCTGCAATCGATTGGCAATAGTTTGGCT
GCTCCTAAAGCTACTGAGCTTATGGCTACTAATGGGCTTGCTGTTCCTGTTTTTACCATGAACATGGTGTTATTGTTTGTAATGTGGGCATTGGTGGCGGCTATTCCTTG
CCAGGATCGGGGTTTGCAGGTTCATTTTTCGCTACCCCGGAACTTCCCATGGGCGGCTCCAATTCTTCAACTTCATGATCGAATTGTGGAGGAGTCCAGGAAGCGAGATC
GAAGGAATTCATGTGGGTTGTTGAAGGAGATTCATCAGATTGAAAAGTGCACACGGCTCATGAATGATTTCGTGGATGCGGTTCAGTTCCCATTGGTAGAGGAGAAAGAA
GCAGAGCTGAGACAAAGAGTGCAAGAACTAACCACGGTTTGTAGTACTCTGAGGACTGGATTGGATTCATTGGAGCGGCAGGTGAGGGAAGTTTTTCACAGAATTGTTCG
CAATAGGACGGAAGGGCTTGGCTCTTTGGGACGAGCCAATCCCTCTGAATAG
mRNA sequenceShow/hide mRNA sequence
GAGGTCCTGAATTCTCCCAATAAAAGGGCGTTCTCTAGGTTTTCTTTTTCTAAACCATTTTTCATCTAAATTTCAATTCAGTCGCCATTTTTTTTTTCTCTCGTTCGATC
CTTCATCATCCCACTCTAATCTCTGTTCTTCGATTTCAGCGCTTCAATTCGGATCTCGCGTTGTAATTTCTGTTGTTGATTGGGTGATTGGTCGAATCGAATCGAATTGA
TTTGGGGGAAATCCATTCTGTATCGATCGAGTTATTATTGAAATAATTCGATTGGGGAATTTGGGGATTAGGGTTTGGGGTTCGTGTTATGTGAAAGGAGAATCTCTGTG
CGAGGATGCCAGCCACAGATTATCAGGGTTCGTCGGCTGCTTGGGGTGTAAGGCGGGATCAACTTTTCGCCATGGATGCTTCTTCGACCTCTCACGAGCAGGATCTTGAT
GGATTTCAGAGGCAAGTTGCCGATCGCTTCTCCGATCTTGCTTCGGTTAGTGCTGATGATTTACTTTCTTTATCTTGGGTTCAGAAGCTTTTGAATTCCTTCCTGTGCTG
TCAGGAAGAGTTTAAGCTCCTTCTTGTTAGCCATAAATCTCAAATCTCTAAACCCCCATTGGACCGATTGGTTTCCGATTACTCGGAGCGCAGCGTCAAGGCGCTTGATG
TGTGTAATGCGATTCGTGATGGGATTGAGCAGCTCCGGCAATGGCAGAAGCTCTTGGAGATTGTTCTTAGTGCATTGGACAATTGTAATCATCAGAAATCTCTTGGGGAG
GGCCAATTCCGTCGAGCCAAGAAGGCCCTTATCGATTTGGCTATTTGTATGCTTGATGAGAAAGATTCACATACCTCTGCTTTTGCACACCGCAACCGATCTTTCGGCCG
TAACAATGCCTCCAAAGATCCGCGGTCTTTGGGACACTTTCGATCTCTGTCGTGGAGTGTTTCGCGGTCATGGTCGGCTGCCCGGCAGCTGCAATCGATTGGCAATAGTT
TGGCTGCTCCTAAAGCTACTGAGCTTATGGCTACTAATGGGCTTGCTGTTCCTGTTTTTACCATGAACATGGTGTTATTGTTTGTAATGTGGGCATTGGTGGCGGCTATT
CCTTGCCAGGATCGGGGTTTGCAGGTTCATTTTTCGCTACCCCGGAACTTCCCATGGGCGGCTCCAATTCTTCAACTTCATGATCGAATTGTGGAGGAGTCCAGGAAGCG
AGATCGAAGGAATTCATGTGGGTTGTTGAAGGAGATTCATCAGATTGAAAAGTGCACACGGCTCATGAATGATTTCGTGGATGCGGTTCAGTTCCCATTGGTAGAGGAGA
AAGAAGCAGAGCTGAGACAAAGAGTGCAAGAACTAACCACGGTTTGTAGTACTCTGAGGACTGGATTGGATTCATTGGAGCGGCAGGTGAGGGAAGTTTTTCACAGAATT
GTTCGCAATAGGACGGAAGGGCTTGGCTCTTTGGGACGAGCCAATCCCTCTGAATAGAATTGTTGGAGTTTCAGAAACATGTTCGTGTTCGGTTTTCATGGTTTCTCTAG
GATCAAGGAAGAACAGAGGATGATAATATGAAGACGAGAGTGACAGTTATAAAAATGGCGAGGGGGCTCTTTGTATTTATTTACTCAAGCGTATATTGTGTAAAGAAATT
TGGATTTGAAAATACTGGTATGGATCTTCTCCATCGGGTAGTGTCTTGGTAATGTCATATTGTTTCAATGCCATATTTATCAGCCGTGGCTGAGATATATTGTTCATTGT
TACACAGTCGTCTGTTGGTCCATGTTTTATACGAGTTCTGCTGCAAAAGATTGCATTACTGTCAAAAGTGTCACATTTTTTTTTATTCGTGTTCTCTCATTGTTCTTTCT
TCAAGATTTTGTGAATTTTTTTTTTGAGTTCCTGCACTCTAGAAAGCAAATCCAAATGATCATTTCCTGTTGTTTCAATGAGCTTCTCTGTAAAATCCTCTTTCCATCCC
ATTATAGTTGGGATCTCTTACTTGTTTTATGAGATGGGAACCTCCTCTTTTCAAGTGCGAAAGTTTTATTCGAG
Protein sequenceShow/hide protein sequence
MPATDYQGSSAAWGVRRDQLFAMDASSTSHEQDLDGFQRQVADRFSDLASVSADDLLSLSWVQKLLNSFLCCQEEFKLLLVSHKSQISKPPLDRLVSDYSERSVKALDVC
NAIRDGIEQLRQWQKLLEIVLSALDNCNHQKSLGEGQFRRAKKALIDLAICMLDEKDSHTSAFAHRNRSFGRNNASKDPRSLGHFRSLSWSVSRSWSAARQLQSIGNSLA
APKATELMATNGLAVPVFTMNMVLLFVMWALVAAIPCQDRGLQVHFSLPRNFPWAAPILQLHDRIVEESRKRDRRNSCGLLKEIHQIEKCTRLMNDFVDAVQFPLVEEKE
AELRQRVQELTTVCSTLRTGLDSLERQVREVFHRIVRNRTEGLGSLGRANPSE