; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh06G016980 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh06G016980
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of unknown function (DUF3755)
Genome locationCmo_Chr06:11697206..11711566
RNA-Seq ExpressionCmoCh06G016980
SyntenyCmoCh06G016980
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR001005 - SANT/Myb domain
IPR022228 - Protein of unknown function DUF3755
IPR040283 - Transmembrane protein DDB_G0292058-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597688.1 hypothetical protein SDJN03_10868, partial [Cucurbita argyrosperma subsp. sororia]8.8e-25387.43Show/hide
Query:  MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGI
        MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGI
Subjt:  MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGI

Query:  GVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKD
        GVAWLVCGIAYGGFFAATLC GKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKD
Subjt:  GVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKD

Query:  MIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLS-----------GEFV---WFLKNIKKT
        MIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLS             F+   WFL  +   
Subjt:  MIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLS-----------GEFV---WFLKNIKKT

Query:  TKSVH-----FSIDSCRALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL--------------------------
           ++     FSIDSCRALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQIS+SYPDTGL                          
Subjt:  TKSVH-----FSIDSCRALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL--------------------------

Query:  ----VLKLLTCSDESHGGCENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWT
            VLKLLTCSDESHGGCENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWT
Subjt:  ----VLKLLTCSDESHGGCENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWT

Query:  ISANVEHKLHPSADASVLPNSSTTPKMMETSTL
        ISANVEHKLHPSADASVLPNSST+PKMME STL
Subjt:  ISANVEHKLHPSADASVLPNSSTTPKMMETSTL

KAG7029132.1 hypothetical protein SDJN02_10317 [Cucurbita argyrosperma subsp. argyrosperma]1.0e-25387.13Show/hide
Query:  PVKMMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAG
        P++MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAG
Subjt:  PVKMMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAG

Query:  YGIGVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSA
        YGIGVAWLVCGIAYGGFFAATLC GKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSA
Subjt:  YGIGVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSA

Query:  MKDMIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLS-----------GEFV---WFLKNI
        MKDMIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLS             F+   WFL  +
Subjt:  MKDMIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLS-----------GEFV---WFLKNI

Query:  KKTTKSVH-----FSIDSCRALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL-----------------------
              ++     FSIDSCRALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQIS+SYPDTGL                       
Subjt:  KKTTKSVH-----FSIDSCRALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL-----------------------

Query:  -------VLKLLTCSDESHGGCENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVL
               VLKLLTCSDESHGGCENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVL
Subjt:  -------VLKLLTCSDESHGGCENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVL

Query:  IWTISANVEHKLHPSADASVLPNSSTTPKMMETSTL
        IWTISANVEHKLHPSADASVLPNSST+PKMME STL
Subjt:  IWTISANVEHKLHPSADASVLPNSSTTPKMMETSTL

XP_022932736.1 uncharacterized protein LOC111439196 isoform X1 [Cucurbita moschata]5.5e-25588.18Show/hide
Query:  MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGI
        MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGI
Subjt:  MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGI

Query:  GVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKD
        GVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKD
Subjt:  GVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKD

Query:  MIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLS-----------GEFV---WFLKNIKKT
        MIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLS             F+   WFL  +   
Subjt:  MIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLS-----------GEFV---WFLKNIKKT

Query:  TKSVH-----FSIDSCRALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL--------------------------
           ++     FSIDSCRALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL                          
Subjt:  TKSVH-----FSIDSCRALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL--------------------------

Query:  ----VLKLLTCSDESHGGCENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWT
            VLKLLTCSDESHGGCENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWT
Subjt:  ----VLKLLTCSDESHGGCENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWT

Query:  ISANVEHKLHPSADASVLPNSSTTPKMMETSTL
        ISANVEHKLHPSADASVLPNSSTTPKMMETSTL
Subjt:  ISANVEHKLHPSADASVLPNSSTTPKMMETSTL

XP_022932737.1 uncharacterized protein LOC111439196 isoform X2 [Cucurbita moschata]2.9e-25691.05Show/hide
Query:  MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGI
        MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGI
Subjt:  MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGI

Query:  GVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKD
        GVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKD
Subjt:  GVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKD

Query:  MIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLSGEFVWFLKNIKKTTKSVHFSIDSCRAL
        MIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLS   V+ +  + +   S  FSIDSCRAL
Subjt:  MIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLSGEFVWFLKNIKKTTKSVHFSIDSCRAL

Query:  EMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL------------------------------VLKLLTCSDESHGGC
        EMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL                              VLKLLTCSDESHGGC
Subjt:  EMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL------------------------------VLKLLTCSDESHGGC

Query:  ENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWTISANVEHKLHPSADASVLP
        ENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWTISANVEHKLHPSADASVLP
Subjt:  ENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWTISANVEHKLHPSADASVLP

Query:  NSSTTPKMMETSTL
        NSSTTPKMMETSTL
Subjt:  NSSTTPKMMETSTL

XP_023540591.1 uncharacterized protein LOC111800908 [Cucurbita pepo subsp. pepo]9.1e-25086.87Show/hide
Query:  MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGI
        MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRD VQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGI
Subjt:  MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGI

Query:  GVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKD
        GVAWLVCGIAYGGFFAATLC GKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKD
Subjt:  GVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKD

Query:  MIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLS-----------GEFV---WFLKNIKKT
        MIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLS             F+   WFL  +   
Subjt:  MIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLS-----------GEFV---WFLKNIKKT

Query:  TKSVH-----FSIDSCRALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL--------------------------
           ++     FSIDSCRALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNT I VSYPD GL                          
Subjt:  TKSVH-----FSIDSCRALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL--------------------------

Query:  ----VLKLLTCSDESHGGCENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWT
            VLKLLTCSDES+GGCENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWT
Subjt:  ----VLKLLTCSDESHGGCENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWT

Query:  ISANVEHKLHPSADASVLPNSSTTPKMMETSTL
        ISANVEHKLHPSADASVLPNSSTTPKMME STL
Subjt:  ISANVEHKLHPSADASVLPNSSTTPKMMETSTL

TrEMBL top hitse value%identityAlignment
A0A0A0KYT1 Uncharacterized protein3.8e-19369.36Show/hide
Query:  MMGSRNGVKGFI-IPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYG
        MMGS+NGVK  + I L+++ +SSSWIFPET+GQ+ISSS+SL+Q GRDFV++NDGLEA++E D+TVRVDPLNHF  YRGGYNITNKHYWSST+FTGA GYG
Subjt:  MMGSRNGVKGFI-IPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYG

Query:  IGVAWLVCGIAYGGFFAATL-CRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAM
        IGV WLVCGIAYGGF  ATL C GK R K KLKKM H G +FYLWTILLA FFTILA+VGCG+VI GS+RFD+EAK+VVKIIIETANGASNTIQNTTSAM
Subjt:  IGVAWLVCGIAYGGFFAATL-CRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAM

Query:  KDMIHNLEASKGIGTHE-EAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLSGEFVWFLKNIKKTTKSV-------
        KDMI NLEASK  G++  +  + TLTSTSH LDAQAANIQ QANKNR LIHKGLNI+YIVTMVT+SLNLGAVI +S   +  L+ +      +       
Subjt:  KDMIHNLEASKGIGTHE-EAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLSGEFVWFLKNIKKTTKSV-------

Query:  ------------HFSIDSCRALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL-----------------------
                    +FS D+C+ALEMFQENPNNNSLSSILPCEQLLTAKS LTDVSSEIYDLVNQVNTQI++SYPD  L                       
Subjt:  ------------HFSIDSCRALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL-----------------------

Query:  -------VLKLLTCSDESHGGCENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVL
               VLKLLTC+DE++GGCEN QFMSN EYKTVEAYTNS+QDF NVYPGMESLV CQTVKDAFSKILEHHC+PLE YA MVW GLVFVS+VM+CLVL
Subjt:  -------VLKLLTCSDESHGGCENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVL

Query:  IWTISANVEHKLHPSADASVLPNSSTTPKMME
        IWTI AN++ KLH   D SV PNSS TPK ME
Subjt:  IWTISANVEHKLHPSADASVLPNSSTTPKMME

A0A6J1EX81 uncharacterized protein LOC111439196 isoform X12.7e-25588.18Show/hide
Query:  MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGI
        MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGI
Subjt:  MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGI

Query:  GVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKD
        GVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKD
Subjt:  GVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKD

Query:  MIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLS-----------GEFV---WFLKNIKKT
        MIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLS             F+   WFL  +   
Subjt:  MIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLS-----------GEFV---WFLKNIKKT

Query:  TKSVH-----FSIDSCRALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL--------------------------
           ++     FSIDSCRALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL                          
Subjt:  TKSVH-----FSIDSCRALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL--------------------------

Query:  ----VLKLLTCSDESHGGCENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWT
            VLKLLTCSDESHGGCENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWT
Subjt:  ----VLKLLTCSDESHGGCENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWT

Query:  ISANVEHKLHPSADASVLPNSSTTPKMMETSTL
        ISANVEHKLHPSADASVLPNSSTTPKMMETSTL
Subjt:  ISANVEHKLHPSADASVLPNSSTTPKMMETSTL

A0A6J1EXL4 uncharacterized protein LOC111439196 isoform X21.4e-25691.05Show/hide
Query:  MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGI
        MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGI
Subjt:  MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGI

Query:  GVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKD
        GVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKD
Subjt:  GVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKD

Query:  MIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLSGEFVWFLKNIKKTTKSVHFSIDSCRAL
        MIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLS   V+ +  + +   S  FSIDSCRAL
Subjt:  MIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLSGEFVWFLKNIKKTTKSVHFSIDSCRAL

Query:  EMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL------------------------------VLKLLTCSDESHGGC
        EMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL                              VLKLLTCSDESHGGC
Subjt:  EMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL------------------------------VLKLLTCSDESHGGC

Query:  ENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWTISANVEHKLHPSADASVLP
        ENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWTISANVEHKLHPSADASVLP
Subjt:  ENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWTISANVEHKLHPSADASVLP

Query:  NSSTTPKMMETSTL
        NSSTTPKMMETSTL
Subjt:  NSSTTPKMMETSTL

A0A6J1I7I1 uncharacterized protein LOC111470709 isoform X22.3e-24687.55Show/hide
Query:  MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGI
        MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRD VQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGA GYGI
Subjt:  MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGI

Query:  GVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKD
        GVAWLVCGIAYGGF+A TLC GKRR KGKLKK+SHCG KFYLWT LLATFFTILA+VGCGLVIVGSSRFDREAKDVVKIIIETANGA NTIQNTTSAMKD
Subjt:  GVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKD

Query:  MIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLSGEFVWFLKNIKKTTKSVHFSIDSCRAL
        MIHNLEASKGIGTHEEAAATTLTSTSH LDAQAANIQCQANKNRRLIHKGLNIMYI TMVTISLNLGAVIGLS   V+ +  + +   S  FS+DSCRAL
Subjt:  MIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLSGEFVWFLKNIKKTTKSVHFSIDSCRAL

Query:  EMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL------------------------------VLKLLTCSDESHGGC
        EMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPD GL                              VLKLLTCSDES+GGC
Subjt:  EMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL------------------------------VLKLLTCSDESHGGC

Query:  ENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWTISANVEHKLHPSADASVLP
        ENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWTISANVEHKLHPSADASVLP
Subjt:  ENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWTISANVEHKLHPSADASVLP

Query:  NSSTTPKMMETSTL
        NSSTTP+MME STL
Subjt:  NSSTTPKMMETSTL

A0A6J1IAE6 uncharacterized protein LOC111470709 isoform X14.2e-24584.8Show/hide
Query:  MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGI
        MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRD VQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGA GYGI
Subjt:  MMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGI

Query:  GVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKD
        GVAWLVCGIAYGGF+A TLC GKRR KGKLKK+SHCG KFYLWT LLATFFTILA+VGCGLVIVGSSRFDREAKDVVKIIIETANGA NTIQNTTSAMKD
Subjt:  GVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKD

Query:  MIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLS-----------GEFV---WFLKNIKKT
        MIHNLEASKGIGTHEEAAATTLTSTSH LDAQAANIQCQANKNRRLIHKGLNIMYI TMVTISLNLGAVIGLS             F+   WFL  +   
Subjt:  MIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLS-----------GEFV---WFLKNIKKT

Query:  TKSVH-----FSIDSCRALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL--------------------------
           ++     FS+DSCRALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPD GL                          
Subjt:  TKSVH-----FSIDSCRALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGL--------------------------

Query:  ----VLKLLTCSDESHGGCENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWT
            VLKLLTCSDES+GGCENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWT
Subjt:  ----VLKLLTCSDESHGGCENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWT

Query:  ISANVEHKLHPSADASVLPNSSTTPKMMETSTL
        ISANVEHKLHPSADASVLPNSSTTP+MME STL
Subjt:  ISANVEHKLHPSADASVLPNSSTTPKMMETSTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G07565.1 Protein of unknown function (DUF3755)8.1e-4740.49Show/hide
Query:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PAADSSSSALAMKHNPGISTDWTSDEQLTLEEGLKKYAGESSVIRYAKIAMQLPNKTVRD
        ANPSGN+QE    +      + +  N   V           AAD+S +  A++HNPGISTDWT +EQ  LE+ L KYA E SV RYAKIAM++ +KTVRD
Subjt:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PAADSSSSALAMKHNPGISTDWTSDEQLTLEEGLKKYAGESSVIRYAKIAMQLPNKTVRD

Query:  VALRCRWM---------------------NERVSDPSMK-SAQVATRPNVSPYGMPMIPMDNDDGVSYKGFVSTHVFLHTTMCFGLLQLPLFILNMMDEG
        VALRCRWM                      E+ +D S K S+ +   PN   Y  PM+P+D DDG+SYK                               
Subjt:  VALRCRWM---------------------NERVSDPSMK-SAQVATRPNVSPYGMPMIPMDNDDGVSYKGFVSTHVFLHTTMCFGLLQLPLFILNMMDEG

Query:  VNFISVITCRSQAMKVASIGGTTGELLEQNAHAMNQISSNLASFQIQDNISLFCQTRDNILKIMSDLNEMPEVMKQMPPLPVKM
                         +IGG +G+LLEQNA   NQ+S+N ++FQ+ +N+++ C+ RDNIL I++DLN+MPEVMKQMPPLPVK+
Subjt:  VNFISVITCRSQAMKVASIGGTTGELLEQNAHAMNQISSNLASFQIQDNISLFCQTRDNILKIMSDLNEMPEVMKQMPPLPVKM

AT3G07565.2 Protein of unknown function (DUF3755)4.9e-2843.02Show/hide
Query:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PAADSSSSALAMKHNPGISTDWTSDEQLTLEEGLKKYAGESSVIRYAKIAMQLPNKTVRD
        ANPSGN+QE    +      + +  N   V           AAD+S +  A++HNPGISTDWT +EQ  LE+ L KYA E SV RYAKIAM++ +KTVRD
Subjt:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PAADSSSSALAMKHNPGISTDWTSDEQLTLEEGLKKYAGESSVIRYAKIAMQLPNKTVRD

Query:  VALRCRWM---------------------NERVSDPSMK-SAQVATRPNVSPYGMPMIPMDNDDGVSYKGFVSTHVFLH
        VALRCRWM                      E+ +D S K S+ +   PN   Y  PM+P+D DDG+SYKG      F H
Subjt:  VALRCRWM---------------------NERVSDPSMK-SAQVATRPNVSPYGMPMIPMDNDDGVSYKGFVSTHVFLH

AT3G07565.3 Protein of unknown function (DUF3755)1.1e-4640.35Show/hide
Query:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PAADSSSSALAMKHNPGISTDWTSDEQLTLEEGLKKYAGESSVIRYAKIAMQLPNKTVRD
        ANPSGN+QE    +      + +  N   V           AAD+S +  A++HNPGISTDWT +EQ  LE+ L KYA E SV RYAKIAM++ +KTVRD
Subjt:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PAADSSSSALAMKHNPGISTDWTSDEQLTLEEGLKKYAGESSVIRYAKIAMQLPNKTVRD

Query:  VALRCRWM----------------------NERVSDPSMK-SAQVATRPNVSPYGMPMIPMDNDDGVSYKGFVSTHVFLHTTMCFGLLQLPLFILNMMDE
        VALRCRWM                       E+ +D S K S+ +   PN   Y  PM+P+D DDG+SYK                              
Subjt:  VALRCRWM----------------------NERVSDPSMK-SAQVATRPNVSPYGMPMIPMDNDDGVSYKGFVSTHVFLHTTMCFGLLQLPLFILNMMDE

Query:  GVNFISVITCRSQAMKVASIGGTTGELLEQNAHAMNQISSNLASFQIQDNISLFCQTRDNILKIMSDLNEMPEVMKQMPPLPVKM
                          +IGG +G+LLEQNA   NQ+S+N ++FQ+ +N+++ C+ RDNIL I++DLN+MPEVMKQMPPLPVK+
Subjt:  GVNFISVITCRSQAMKVASIGGTTGELLEQNAHAMNQISSNLASFQIQDNISLFCQTRDNILKIMSDLNEMPEVMKQMPPLPVKM

AT3G07565.4 Protein of unknown function (DUF3755)1.7e-4439.12Show/hide
Query:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PAADSSSSALAMKHNPGISTDWTSDEQLTLEEGLKKYAGESSVIRYAKIAMQLPNKTVRD
        ANPSGN+QE    +      + +  N   V           AAD+S +  A++HNPGISTDWT +EQ  LE+ L KYA E SV RYAKIAM++ +KTVRD
Subjt:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PAADSSSSALAMKHNPGISTDWTSDEQLTLEEGLKKYAGESSVIRYAKIAMQLPNKTVRD

Query:  VALRCRWM----------------------NERVSDPSMK-SAQVATRPNVSPYGMPMIPMDNDDGVSYKGFVSTHVFLHTTMCFGLLQLPLFILNMMDE
        VALRCRWM                       E+ +D S K S+ +   PN   Y  PM+P+D DDG+SYK                              
Subjt:  VALRCRWM----------------------NERVSDPSMK-SAQVATRPNVSPYGMPMIPMDNDDGVSYKGFVSTHVFLHTTMCFGLLQLPLFILNMMDE

Query:  GVNFISVITCRSQAMKVASIGGTTGELLEQNAHAMNQISSNLASFQI---------QDNISLFCQTRDNILKIMSDLNEMPEVMKQMPPLPVKM
                          +IGG +G+LLEQNA   NQ+S+N ++FQ+          +N+++ C+ RDNIL I++DLN+MPEVMKQMPPLPVK+
Subjt:  GVNFISVITCRSQAMKVASIGGTTGELLEQNAHAMNQISSNLASFQI---------QDNISLFCQTRDNILKIMSDLNEMPEVMKQMPPLPVKM

AT5G67550.1 unknown protein1.9e-3224.22Show/hide
Query:  RVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGIGVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIV
        R DPLN F+ Y GG+N+ NKHYW++T FTG  GY +    ++ GI  G + A +    KRR     ++     D++YL   LL   F  L++V  G+VI 
Subjt:  RVDPLNHFKMYRGGYNITNKHYWSSTIFTGAAGYGIGVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIV

Query:  GSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKDMIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISL
         + R     +++ + I +     +  I+    ++  + + L       TH       L  T+H+L   +  IQ   +   R I   + I Y+  ++  S 
Subjt:  GSSRFDREAKDVVKIIIETANGASNTIQNTTSAMKDMIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISL

Query:  NL----------------GAVIGLSGEFVWFLKNIKKTTKSVHFSI-----DSCRALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVN
        NL                G ++ +     W +  +        F I     D C A   F +NP N++L+++ PC   L +   L ++S  I++ + Q+N
Subjt:  NL----------------GAVIGLSGEFVWFLKNIKKTTKSVHFSI-----DSCRALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVN

Query:  TQISVSY---------------PDTGL------------------------------VLKLLTCSD-ESHGGCE-NRQFMSNSEYKTVEAYTNSVQDFFN
        ++++ S                P++G+                              +L   TC D +    C    +F+  + Y  V AY+NS Q   +
Subjt:  TQISVSY---------------PDTGL------------------------------VLKLLTCSD-ESHGGCE-NRQFMSNSEYKTVEAYTNSVQDFFN

Query:  VYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWTISANVE-------HKLHPSADASV
        + P  ++L  C  VKD  S I+ + C P  +    +WA ++ +S++M+ LVL++   A  E         +HP++ A +
Subjt:  VYPGMESLVNCQTVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWTISANVE-------HKLHPSADASV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAACCCATCTGGGAACCATCAAGAAGCTGGCCAACCCTCGTCTTCCTTCGATGGAGGGAACCCCAGCAATGGTAATTCCACCCCAGTGCCTGCAGCGGATAGTTC
CAGCTCCGCTCTTGCGATGAAGCACAATCCTGGTATCTCCACGGATTGGACATCTGATGAGCAGCTCACACTGGAAGAAGGCCTTAAGAAATATGCCGGAGAGTCTAGTG
TTATTCGTTATGCAAAGATTGCTATGCAGCTACCAAATAAGACTGTACGAGATGTTGCCTTGCGTTGCAGATGGATGAATGAAAGAGTATCTGACCCTTCAATGAAGTCA
GCACAGGTTGCAACTAGGCCTAATGTGTCTCCTTATGGAATGCCTATGATTCCCATGGACAATGATGACGGCGTCTCATATAAAGGTTTTGTTTCTACGCATGTCTTCCT
TCACACCACTATGTGTTTTGGGCTGTTGCAGCTCCCACTCTTCATTCTCAACATGATGGATGAGGGAGTAAATTTCATATCTGTTATTACTTGTAGGTCACAAGCAATGA
AAGTTGCTTCTATTGGTGGTACAACTGGAGAGCTTCTTGAACAGAATGCACATGCCATGAATCAAATATCTTCTAATCTTGCATCTTTTCAGATACAAGATAATATCAGT
CTCTTCTGCCAAACGCGAGATAACATCCTCAAAATAATGAGCGACTTGAACGAAATGCCCGAAGTAATGAAGCAGATGCCGCCTCTTCCGGTGAAGATGATGGGATCAAG
AAATGGAGTTAAAGGGTTCATAATTCCTCTGATTTGGATGTTCATCAGTTCATCCTGGATTTTCCCGGAAACTGTGGGACAGAAAATTTCATCCTCCAGCTCTCTAATAC
AATATGGGAGAGATTTTGTGCAGAGAAATGATGGGTTGGAAGCACTTGAGGAAGAAGATAATACAGTTCGAGTTGATCCTTTGAACCATTTCAAGATGTATAGAGGTGGA
TATAACATCACCAACAAACACTATTGGAGTTCAACAATATTCACAGGAGCTGCAGGGTATGGGATTGGAGTGGCCTGGCTTGTGTGTGGAATAGCATATGGAGGATTTTT
TGCAGCAACTTTGTGTCGTGGCAAGAGAAGGAGCAAGGGAAAGTTGAAGAAAATGTCACATTGTGGGGACAAATTTTACCTATGGACCATTCTTTTGGCTACTTTCTTCA
CAATTTTGGCTCTGGTGGGATGTGGATTGGTGATTGTAGGCAGCAGTAGATTTGATAGAGAGGCAAAGGATGTGGTGAAGATCATAATAGAGACAGCCAATGGGGCATCA
AACACTATACAAAACACAACTTCAGCTATGAAAGATATGATTCATAACTTAGAAGCTTCTAAAGGAATTGGAACTCATGAGGAGGCAGCTGCTACAACTTTGACTTCTAC
CTCTCACAAACTTGATGCTCAAGCTGCAAATATTCAGTGCCAAGCCAACAAGAACAGGCGCTTGATCCACAAGGGCCTCAACATAATGTACATAGTGACTATGGTTACAA
TTTCCTTGAACCTGGGAGCTGTAATTGGTTTGTCTGGTGAGTTTGTGTGGTTTCTTAAAAACATCAAAAAGACAACTAAATCTGTTCATTTCTCAATAGACAGTTGCAGA
GCTCTTGAAATGTTTCAAGAAAATCCCAACAACAACAGCCTCAGTTCGATTCTCCCATGTGAGCAACTGCTGACAGCCAAATCAGCTCTAACTGATGTAAGCTCAGAGAT
TTATGATCTTGTTAATCAGGTCAACACACAAATTTCAGTATCATATCCAGACACTGGTTTGGTGCTAAAGCTACTCACATGTTCTGATGAAAGCCATGGAGGGTGTGAGA
ATAGGCAGTTCATGTCGAACTCAGAGTACAAAACGGTGGAGGCTTACACAAACTCAGTACAAGATTTCTTCAATGTGTATCCAGGCATGGAAAGCCTTGTGAACTGCCAA
ACAGTGAAGGATGCGTTTTCAAAGATCTTAGAACACCATTGCAGGCCTTTGGAGAGCTATGCCAACATGGTTTGGGCAGGCCTTGTGTTTGTCTCAGTTGTTATGCTGTG
TTTGGTTTTGATTTGGACAATTTCAGCAAATGTTGAGCATAAACTTCACCCTTCAGCAGATGCCTCAGTGCTGCCCAATTCTTCAACTACTCCAAAGATGATGGAGACGT
CTACTCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAACCCATCTGGGAACCATCAAGAAGCTGGCCAACCCTCGTCTTCCTTCGATGGAGGGAACCCCAGCAATGGTAATTCCACCCCAGTGCCTGCAGCGGATAGTTC
CAGCTCCGCTCTTGCGATGAAGCACAATCCTGGTATCTCCACGGATTGGACATCTGATGAGCAGCTCACACTGGAAGAAGGCCTTAAGAAATATGCCGGAGAGTCTAGTG
TTATTCGTTATGCAAAGATTGCTATGCAGCTACCAAATAAGACTGTACGAGATGTTGCCTTGCGTTGCAGATGGATGAATGAAAGAGTATCTGACCCTTCAATGAAGTCA
GCACAGGTTGCAACTAGGCCTAATGTGTCTCCTTATGGAATGCCTATGATTCCCATGGACAATGATGACGGCGTCTCATATAAAGGTTTTGTTTCTACGCATGTCTTCCT
TCACACCACTATGTGTTTTGGGCTGTTGCAGCTCCCACTCTTCATTCTCAACATGATGGATGAGGGAGTAAATTTCATATCTGTTATTACTTGTAGGTCACAAGCAATGA
AAGTTGCTTCTATTGGTGGTACAACTGGAGAGCTTCTTGAACAGAATGCACATGCCATGAATCAAATATCTTCTAATCTTGCATCTTTTCAGATACAAGATAATATCAGT
CTCTTCTGCCAAACGCGAGATAACATCCTCAAAATAATGAGCGACTTGAACGAAATGCCCGAAGTAATGAAGCAGATGCCGCCTCTTCCGGTGAAGATGATGGGATCAAG
AAATGGAGTTAAAGGGTTCATAATTCCTCTGATTTGGATGTTCATCAGTTCATCCTGGATTTTCCCGGAAACTGTGGGACAGAAAATTTCATCCTCCAGCTCTCTAATAC
AATATGGGAGAGATTTTGTGCAGAGAAATGATGGGTTGGAAGCACTTGAGGAAGAAGATAATACAGTTCGAGTTGATCCTTTGAACCATTTCAAGATGTATAGAGGTGGA
TATAACATCACCAACAAACACTATTGGAGTTCAACAATATTCACAGGAGCTGCAGGGTATGGGATTGGAGTGGCCTGGCTTGTGTGTGGAATAGCATATGGAGGATTTTT
TGCAGCAACTTTGTGTCGTGGCAAGAGAAGGAGCAAGGGAAAGTTGAAGAAAATGTCACATTGTGGGGACAAATTTTACCTATGGACCATTCTTTTGGCTACTTTCTTCA
CAATTTTGGCTCTGGTGGGATGTGGATTGGTGATTGTAGGCAGCAGTAGATTTGATAGAGAGGCAAAGGATGTGGTGAAGATCATAATAGAGACAGCCAATGGGGCATCA
AACACTATACAAAACACAACTTCAGCTATGAAAGATATGATTCATAACTTAGAAGCTTCTAAAGGAATTGGAACTCATGAGGAGGCAGCTGCTACAACTTTGACTTCTAC
CTCTCACAAACTTGATGCTCAAGCTGCAAATATTCAGTGCCAAGCCAACAAGAACAGGCGCTTGATCCACAAGGGCCTCAACATAATGTACATAGTGACTATGGTTACAA
TTTCCTTGAACCTGGGAGCTGTAATTGGTTTGTCTGGTGAGTTTGTGTGGTTTCTTAAAAACATCAAAAAGACAACTAAATCTGTTCATTTCTCAATAGACAGTTGCAGA
GCTCTTGAAATGTTTCAAGAAAATCCCAACAACAACAGCCTCAGTTCGATTCTCCCATGTGAGCAACTGCTGACAGCCAAATCAGCTCTAACTGATGTAAGCTCAGAGAT
TTATGATCTTGTTAATCAGGTCAACACACAAATTTCAGTATCATATCCAGACACTGGTTTGGTGCTAAAGCTACTCACATGTTCTGATGAAAGCCATGGAGGGTGTGAGA
ATAGGCAGTTCATGTCGAACTCAGAGTACAAAACGGTGGAGGCTTACACAAACTCAGTACAAGATTTCTTCAATGTGTATCCAGGCATGGAAAGCCTTGTGAACTGCCAA
ACAGTGAAGGATGCGTTTTCAAAGATCTTAGAACACCATTGCAGGCCTTTGGAGAGCTATGCCAACATGGTTTGGGCAGGCCTTGTGTTTGTCTCAGTTGTTATGCTGTG
TTTGGTTTTGATTTGGACAATTTCAGCAAATGTTGAGCATAAACTTCACCCTTCAGCAGATGCCTCAGTGCTGCCCAATTCTTCAACTACTCCAAAGATGATGGAGACGT
CTACTCTTTGAGATCTTAGCTCGTGTTCAAAATTAGGATAGAATCAATCGGTGATCCGAAGAATCTCGCGTAAGCGAGTTCTCGGTATGTCTAGGTTAGACCCGACACGT
ATTTTTCTTTGCACGTAGGCTAAATTATACAAAATATTGCTTGTGTATATAAATCATTTCAGTTTTGTCTATGATTTGACTGTGATTGATGTTCCTATTATGTTCATATA
AACTAATATGATCCGTCCTGATTAGGGGCA
Protein sequenceShow/hide protein sequence
MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPAADSSSSALAMKHNPGISTDWTSDEQLTLEEGLKKYAGESSVIRYAKIAMQLPNKTVRDVALRCRWMNERVSDPSMKS
AQVATRPNVSPYGMPMIPMDNDDGVSYKGFVSTHVFLHTTMCFGLLQLPLFILNMMDEGVNFISVITCRSQAMKVASIGGTTGELLEQNAHAMNQISSNLASFQIQDNIS
LFCQTRDNILKIMSDLNEMPEVMKQMPPLPVKMMGSRNGVKGFIIPLIWMFISSSWIFPETVGQKISSSSSLIQYGRDFVQRNDGLEALEEEDNTVRVDPLNHFKMYRGG
YNITNKHYWSSTIFTGAAGYGIGVAWLVCGIAYGGFFAATLCRGKRRSKGKLKKMSHCGDKFYLWTILLATFFTILALVGCGLVIVGSSRFDREAKDVVKIIIETANGAS
NTIQNTTSAMKDMIHNLEASKGIGTHEEAAATTLTSTSHKLDAQAANIQCQANKNRRLIHKGLNIMYIVTMVTISLNLGAVIGLSGEFVWFLKNIKKTTKSVHFSIDSCR
ALEMFQENPNNNSLSSILPCEQLLTAKSALTDVSSEIYDLVNQVNTQISVSYPDTGLVLKLLTCSDESHGGCENRQFMSNSEYKTVEAYTNSVQDFFNVYPGMESLVNCQ
TVKDAFSKILEHHCRPLESYANMVWAGLVFVSVVMLCLVLIWTISANVEHKLHPSADASVLPNSSTTPKMMETSTL