; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019221 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019221
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionU-box domain-containing protein 7
Genome locationtig00153293:802185..804067
RNA-Seq ExpressionSgr019221
SyntenySgr019221
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR000225 - Armadillo
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052624.1 U-box domain-containing protein 7 [Cucumis melo var. makuwa]9.4e-15577.17Show/hide
Query:  MNPMSP----ASSSSSPQTPIWYFT------------------------------ETTVRSNEILQVADADWDGSGGNGGGVDESAAELQRTVKKLHFGS
        MNP+SP    +SSSSS QTPIW ++                                T  SNE++QVADA WD +    GG   SAA L+ TVKKLHFGS
Subjt:  MNPMSP----ASSSSSPQTPIWYFT------------------------------ETTVRSNEILQVADADWDGSGGNGGGVDESAAELQRTVKKLHFGS

Query:  WEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLSLS
        WEEKEIAA++IEKMSKED++ KKLMV+L+V+PALVSM ASDAVGRPEVAVKALLELAKGS  +NKAL+VEAGILHKLPSNIQ MDESAKHDFARLLLSLS
Subjt:  WEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLSLS

Query:  SLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPSDRALAALGNLVVTSQGKKAMESSPMVP
        SLINSHFTIALQ+NE  IPFLV+ILDSTSN ETQKCCLETLYNISTVLENVGPLVSNGVVH LLK+SSSKG SDRALAALGNLVVTSQGK+ MESS MVP
Subjt:  SLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPSDRALAALGNLVVTSQGKKAMESSPMVP

Query:  DSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRIVIGSPVNQRMFRK
        DSLI+IMTWED+PKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKM+PHSGPQTGRIVIGSPVNQR  ++
Subjt:  DSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRIVIGSPVNQRMFRK

Query:  GEK
        G K
Subjt:  GEK

XP_008439745.1 PREDICTED: U-box domain-containing protein 7 [Cucumis melo]1.6e-15476.79Show/hide
Query:  MNPMSP------ASSSSSPQTPIWYFT------------------------------ETTVRSNEILQVADADWDGSGGNGGGVDESAAELQRTVKKLHF
        MNP+SP      +SSSSS QTPIW ++                                T  SNE++QVADA WD +    GG   SAA L+ TVKKLHF
Subjt:  MNPMSP------ASSSSSPQTPIWYFT------------------------------ETTVRSNEILQVADADWDGSGGNGGGVDESAAELQRTVKKLHF

Query:  GSWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLS
        GSWEEKEIAA++IEKMSKED++ KKLMV+L+V+PALVSM ASDAVGRPEVAVKALLELAKGS  +NKAL+VEAGILHKLPSNIQ MDESAKHDFARLLLS
Subjt:  GSWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLS

Query:  LSSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPSDRALAALGNLVVTSQGKKAMESSPM
        LSSLINSHFTIALQ+NE  IPFLV+ILDSTSN ETQKCCLETLYNISTVLENVGPLVSNGVVH LLK+SSSKG SDRALAALGNLVVTSQGK+ MESS M
Subjt:  LSSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPSDRALAALGNLVVTSQGKKAMESSPM

Query:  VPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRIVIGSPVNQRMF
        VPDSLI+IMTWED+PKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKM+PHSGPQTGRIVIGSPVNQR  
Subjt:  VPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRIVIGSPVNQRMF

Query:  RKGEK
        ++G K
Subjt:  RKGEK

XP_022142084.1 U-box domain-containing protein 7 [Momordica charantia]2.9e-15677.28Show/hide
Query:  MNPMSPASSSSSPQTPIWYFT-----------------------------------ETTVRSNEILQVADADWDGSGGNGGGVDESAAELQRTVKKLHFG
        MNPMS  SSSSSPQTPIW ++                                   + TV SNE++Q AD +WDG           AA LQRTVKKLHFG
Subjt:  MNPMSPASSSSSPQTPIWYFT-----------------------------------ETTVRSNEILQVADADWDGSGGNGGGVDESAAELQRTVKKLHFG

Query:  SWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLSL
        SWEEKEIAAKVIEKMSK+D+K KKLMVEL+V+PALVSM ASDAVGRPE+AVK LLELAKGS  +NKAL+VEAGILHKLP NIQ MDESAKHDFARLLLSL
Subjt:  SWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLSL

Query:  SSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKL-SSSKGPSDRALAALGNLVVTSQGKKAMESSPM
        SSLINSHFTIALQ+NENAIPF+V+ILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKL SSSKG SDRALA+LGNLVVTSQGKKAME SPM
Subjt:  SSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKL-SSSKGPSDRALAALGNLVVTSQGKKAMESSPM

Query:  VPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRIVIGSPVNQRMF
        VP SLIEIMTWEDKPKS+ELSAYILMMLAHQSSEQREKMA+SGIVAVLLEVALLGSPLAQKRA KLLQWFKNEKQAKM+PHSGPQTGR+VIGSPVNQR  
Subjt:  VPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRIVIGSPVNQRMF

Query:  RKGEK
        ++G K
Subjt:  RKGEK

XP_023517336.1 uncharacterized protein LOC111781124 [Cucurbita pepo subsp. pepo]3.2e-15577.36Show/hide
Query:  MNPMSP-ASSSSSPQTPIWYFT----------------------------------ETTVRSNEILQVADADWDGSGGNGGGVDESAAELQRTVKKLHFG
        MNPMS  +SSSSSPQTPIW ++                                  + T  SNE+LQVA ADW+ SGG  GG DESAA L+RTVKKLHFG
Subjt:  MNPMSP-ASSSSSPQTPIWYFT----------------------------------ETTVRSNEILQVADADWDGSGGNGGGVDESAAELQRTVKKLHFG

Query:  SWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLSL
        +WEEKEIAA VI KMSKED+K KKL+VEL V+PALVSM ASDA+GRPEVAVKALLELAKGS  +NKAL+VEAGILHKLPS+I+ MDESAKHDFARLLLSL
Subjt:  SWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLSL

Query:  SSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPSDRALAALGNLVVTSQGKKAMESSPMV
        SSLINSHFT ALQ+NENAIPF+VEILDSTSN ETQKCC+ETL+NISTVLENVGPLVSNGVVHTLLK+SSSK  S RALAALGNLVVTSQGKKAMESSPMV
Subjt:  SSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPSDRALAALGNLVVTSQGKKAMESSPMV

Query:  PDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRIVIGSPVNQRMFR
        PD+LIEIMTWEDKPKSIE SAYILM+LAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRA KLLQWFKNEKQ K++PHSGPQTGRIVIGSPVNQR  +
Subjt:  PDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRIVIGSPVNQRMFR

Query:  KG
        +G
Subjt:  KG

XP_038881164.1 U-box domain-containing protein 7 [Benincasa hispida]5.0e-15678.87Show/hide
Query:  MNPMSPASSSSS---------PQTPIWYFTETTVR----------------------------SNEILQVADADWDGSGGNGGGVDES-AAELQRTVKKL
        MNP+SP  SSSS         PQTPIW ++   +R                            S E+LQVAD     +GG G G D S AA L+ TVKKL
Subjt:  MNPMSPASSSSS---------PQTPIWYFTETTVR----------------------------SNEILQVADADWDGSGGNGGGVDES-AAELQRTVKKL

Query:  HFGSWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLL
        HFGSWEEKEIAAK+IEKMSKED K KKLMVEL+VIPALVSM ASDAVGRPEVAVK LLELAKGS  +NKAL+VEAGILHKLPSNIQ MDESAKHDFARLL
Subjt:  HFGSWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLL

Query:  LSLSSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPSDRALAALGNLVVTSQGKKAMESS
        LSLSSLINSHFT ALQ+NE+ IPFLVEILDSTSN ETQKCCLETLYNISTVLENVGPLVSNGVVHTLLK+SSSKG SDRALAALGNLVVTSQGKKAMESS
Subjt:  LSLSSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPSDRALAALGNLVVTSQGKKAMESS

Query:  PMVPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRIVIGSPVNQR
        PMVPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKM+PHSGPQTGRIVIGSPVNQR
Subjt:  PMVPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRIVIGSPVNQR

Query:  MFRKGEK
          R+G K
Subjt:  MFRKGEK

TrEMBL top hitse value%identityAlignment
A0A0A0KHV3 Uncharacterized protein3.0e-15476.67Show/hide
Query:  MNPMSP----ASSSSSPQTPIWYFT------------------------------ETTVRSNEILQVADADWDGSGGNGGGVDESAAELQRTVKKLHFGS
        MNP+SP    +SSSSSPQTPIW ++                                T   NE++QVA A WD +    GG D SAA L+ TVKKLHFGS
Subjt:  MNPMSP----ASSSSSPQTPIWYFT------------------------------ETTVRSNEILQVADADWDGSGGNGGGVDESAAELQRTVKKLHFGS

Query:  WEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLSLS
        WEEKE+AAK+IEKMSKED++ K LMV+L+V+PALV M ASDAVGRPEVAVKALLELAKGS  +NKAL+VEAGILHKLPSNIQ MDESAKHDFARLLLSLS
Subjt:  WEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLSLS

Query:  SLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPSDRALAALGNLVVTSQGKKAMESSPMVP
        SLINSHFTIALQ+NE  IPFLV+ILDSTSN ETQKCCLETLYNISTVLENVGPLVSNGVVH LLK+SSSKG SDRALAALGNLVVTSQGK+ MESS MVP
Subjt:  SLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPSDRALAALGNLVVTSQGKKAMESSPMVP

Query:  DSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRIVIGSPVNQRMFRK
        DSLI+IMTWEDKPKS ELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKM+PHSGPQTGRIVIGSPVNQR  ++
Subjt:  DSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRIVIGSPVNQRMFRK

Query:  GEK
        G K
Subjt:  GEK

A0A1S3B073 U-box domain-containing protein 77.8e-15576.79Show/hide
Query:  MNPMSP------ASSSSSPQTPIWYFT------------------------------ETTVRSNEILQVADADWDGSGGNGGGVDESAAELQRTVKKLHF
        MNP+SP      +SSSSS QTPIW ++                                T  SNE++QVADA WD +    GG   SAA L+ TVKKLHF
Subjt:  MNPMSP------ASSSSSPQTPIWYFT------------------------------ETTVRSNEILQVADADWDGSGGNGGGVDESAAELQRTVKKLHF

Query:  GSWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLS
        GSWEEKEIAA++IEKMSKED++ KKLMV+L+V+PALVSM ASDAVGRPEVAVKALLELAKGS  +NKAL+VEAGILHKLPSNIQ MDESAKHDFARLLLS
Subjt:  GSWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLS

Query:  LSSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPSDRALAALGNLVVTSQGKKAMESSPM
        LSSLINSHFTIALQ+NE  IPFLV+ILDSTSN ETQKCCLETLYNISTVLENVGPLVSNGVVH LLK+SSSKG SDRALAALGNLVVTSQGK+ MESS M
Subjt:  LSSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPSDRALAALGNLVVTSQGKKAMESSPM

Query:  VPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRIVIGSPVNQRMF
        VPDSLI+IMTWED+PKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKM+PHSGPQTGRIVIGSPVNQR  
Subjt:  VPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRIVIGSPVNQRMF

Query:  RKGEK
        ++G K
Subjt:  RKGEK

A0A5A7UB79 U-box domain-containing protein 74.6e-15577.17Show/hide
Query:  MNPMSP----ASSSSSPQTPIWYFT------------------------------ETTVRSNEILQVADADWDGSGGNGGGVDESAAELQRTVKKLHFGS
        MNP+SP    +SSSSS QTPIW ++                                T  SNE++QVADA WD +    GG   SAA L+ TVKKLHFGS
Subjt:  MNPMSP----ASSSSSPQTPIWYFT------------------------------ETTVRSNEILQVADADWDGSGGNGGGVDESAAELQRTVKKLHFGS

Query:  WEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLSLS
        WEEKEIAA++IEKMSKED++ KKLMV+L+V+PALVSM ASDAVGRPEVAVKALLELAKGS  +NKAL+VEAGILHKLPSNIQ MDESAKHDFARLLLSLS
Subjt:  WEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLSLS

Query:  SLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPSDRALAALGNLVVTSQGKKAMESSPMVP
        SLINSHFTIALQ+NE  IPFLV+ILDSTSN ETQKCCLETLYNISTVLENVGPLVSNGVVH LLK+SSSKG SDRALAALGNLVVTSQGK+ MESS MVP
Subjt:  SLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPSDRALAALGNLVVTSQGKKAMESSPMVP

Query:  DSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRIVIGSPVNQRMFRK
        DSLI+IMTWED+PKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKM+PHSGPQTGRIVIGSPVNQR  ++
Subjt:  DSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRIVIGSPVNQRMFRK

Query:  GEK
        G K
Subjt:  GEK

A0A6J1CKK5 U-box domain-containing protein 71.4e-15677.28Show/hide
Query:  MNPMSPASSSSSPQTPIWYFT-----------------------------------ETTVRSNEILQVADADWDGSGGNGGGVDESAAELQRTVKKLHFG
        MNPMS  SSSSSPQTPIW ++                                   + TV SNE++Q AD +WDG           AA LQRTVKKLHFG
Subjt:  MNPMSPASSSSSPQTPIWYFT-----------------------------------ETTVRSNEILQVADADWDGSGGNGGGVDESAAELQRTVKKLHFG

Query:  SWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLSL
        SWEEKEIAAKVIEKMSK+D+K KKLMVEL+V+PALVSM ASDAVGRPE+AVK LLELAKGS  +NKAL+VEAGILHKLP NIQ MDESAKHDFARLLLSL
Subjt:  SWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLSL

Query:  SSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKL-SSSKGPSDRALAALGNLVVTSQGKKAMESSPM
        SSLINSHFTIALQ+NENAIPF+V+ILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKL SSSKG SDRALA+LGNLVVTSQGKKAME SPM
Subjt:  SSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKL-SSSKGPSDRALAALGNLVVTSQGKKAMESSPM

Query:  VPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRIVIGSPVNQRMF
        VP SLIEIMTWEDKPKS+ELSAYILMMLAHQSSEQREKMA+SGIVAVLLEVALLGSPLAQKRA KLLQWFKNEKQAKM+PHSGPQTGR+VIGSPVNQR  
Subjt:  VPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRIVIGSPVNQRMF

Query:  RKGEK
        ++G K
Subjt:  RKGEK

A0A6J1KNS3 uncharacterized protein LOC1114969447.3e-15375.81Show/hide
Query:  MNPMSPASSSSSPQTPIWYFT----------------------------------ETTVRSNEILQVADADWDGSGGNGGGVDESAAELQRTVKKLHFGS
        M+ +S +SSSSSPQTPIW ++                                  + T  SNE+++VADADW+ SGG  GG DESAA L+RTVKKLHFG 
Subjt:  MNPMSPASSSSSPQTPIWYFT----------------------------------ETTVRSNEILQVADADWDGSGGNGGGVDESAAELQRTVKKLHFGS

Query:  WEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLSLS
        WEEKEIAA  I KMSKED+K KKL+VEL V+PALVSM ASDA+GRPEVAVKALLELAKGS  +NKAL+VEAGILH LPS+I+ MDESAKHDFARLLLSLS
Subjt:  WEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLSLS

Query:  SLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPSDRALAALGNLVVTSQGKKAMESSPMVP
        SLINSHFT ALQ+NENAIPF+VEILDSTSN ETQKCC+ETL+NISTVLENVGPLVSNGVVHTLLK+SSSK  S RALAALGNLVVTSQGKKAMESSPMVP
Subjt:  SLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPSDRALAALGNLVVTSQGKKAMESSPMVP

Query:  DSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRIVIGSPVNQRMFRK
        D+LIEIMTWEDKPKSIE SAYILM+LAHQS+EQREKMA SGIVAVLLEVALLGSPLAQKRA KLLQWFKNEKQ KM+PHSGPQTGRIVIGSPVNQR  ++
Subjt:  DSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRIVIGSPVNQRMFRK

Query:  G
        G
Subjt:  G

SwissProt top hitse value%identityAlignment
O22193 U-box domain-containing protein 42.7e-1125.68Show/hide
Query:  VDESAAELQRTVKKLHFGSWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNI
        + E   ++++ V++L   S + +  A   +  ++K ++  + ++     I  LV +  S      E AV ALL L+      NK  I +AG +  L   +
Subjt:  VDESAAELQRTVKKLHFGSWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNI

Query:  QTMDESAKHDFARLLLSLSSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKL-SSSKGPSDRALAAL
        +     AK + A  L SLS +  +   I       AI  LV++L        +K     L+N+S   EN   +V +G V  L+ L   + G  D+A+A L
Subjt:  QTMDESAKHDFARLLLSLSSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKL-SSSKGPSDRALAAL

Query:  GNLVVTSQGKKAMESSPMVPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEK
         NL    +G+ A+     +P  L+E++      +  E +A  L+ L+  S      + + G V  L+ ++  G+P A+++A  LL +F+N++
Subjt:  GNLVVTSQGKKAMESSPMVPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEK

O48700 U-box domain-containing protein 67.3e-0923.29Show/hide
Query:  EVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLSLSSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNIST
        E    AL  LA  + ++NK L++ +G++  L   I      ++     L L+LS L  +   I    +  A+ F V +L   + ++ +   L  LYN+ST
Subjt:  EVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLSLSSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNIST

Query:  VLENVGPLVSNGVVHTLLKLSSSKGP--SDRALAALGNLVVTSQGKKAMESSPMVPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVA
           N+  L+S+ ++ +L  L+S+      +++LA L NL  + +GK+ M ++  +  +L  ++   D  +  E +   L++L   S    + + + G++ 
Subjt:  VLENVGPLVSNGVVHTLLKLSSSKGP--SDRALAALGNLVVTSQGKKAMESSPMVPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVA

Query:  VLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRIVIGSPV
         L+ +++ GSP  + ++ KLL  F+ ++       +  +  R  + +P+
Subjt:  VLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRIVIGSPV

Q681N2 U-box domain-containing protein 154.9e-1327.3Show/hide
Query:  DESAAELQRTVKKLHFGSWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQ
        +E   E+   V+ L     EE+  + K +  +++E+ + + L+     IP LV + +    G  E AV  LL L+   +  NK LI   G +  +   ++
Subjt:  DESAAELQRTVKKLHFGSWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQ

Query:  TMDESAKHDFARLLLSLSSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSK--GPSDRALAAL
          +  A+ + A  L SLS L  +  TI L    N IP LV++L        +K  L  L+N+S    N G  +  G+V  LL L   K  G  D AL+ L
Subjt:  TMDESAKHDFARLLLSLSSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSK--GPSDRALAAL

Query:  GNLVVTSQGKKAMESSPMVPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQ
          L    +G++A+     + ++L+E +  +  PK+ E +  +L+ L   +S       + G+   L+E+   G+  AQ++A  L+Q     +Q
Subjt:  GNLVVTSQGKKAMESSPMVPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQ

Q9C7G1 U-box domain-containing protein 457.8e-1125.81Show/hide
Query:  IEKMSKEDLKGKKLMVELQVIPALVSMAAS----DAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLSLSSLINSH
        I  + K+D + + LM E   + AL+    S    +     +V   AL  LA  + ++NK L++ +GI+  L   +   +  +      + L+LS L  + 
Subjt:  IEKMSKEDLKGKKLMVELQVIPALVSMAAS----DAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLSLSSLINSH

Query:  FTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTL--LKLSSSKGPSDRALAALGNLVVTSQGKKAMESSPMVPDSLI
          I    +  A+PF+V +L + +  + +   L +L+++ST   N+  L+S  +V+ L  L +S  +  ++++LA L NLV+   GK  M S+P +  +L 
Subjt:  FTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTL--LKLSSSKGPSDRALAALGNLVVTSQGKKAMESSPMVPDSLI

Query:  EIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQ
         I+    +P   E +  +L++L + S    E + + G++  L+ +++ G+   ++RA KLL  F+  +Q      + PQ
Subjt:  EIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQ

Q9CAG5 U-box domain-containing protein 74.9e-1325.61Show/hide
Query:  EEKEIAAKVIEK---MSKEDLKGKKLMVELQVIPALVSMAAS----DAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFAR
        E  E   KV+EK   + K+D + +  M     + AL+    S    +     +    AL  LA  + ++NK L++ +G++  L   I + +         
Subjt:  EEKEIAAKVIEK---MSKEDLKGKKLMVELQVIPALVSMAAS----DAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFAR

Query:  LLLSLSSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPS---DRALAALGNLVVTSQGKK
        L L+LS L  +   I    +  A+PFLV++L     ++ +   L  LYN+ST   N+  L+S+ ++ +L  L +S G +   +++LA L NL  + +GK 
Subjt:  LLLSLSSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPS---DRALAALGNLVVTSQGKK

Query:  AMESSPMVPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHS
           SS  +  SL  ++   D  +  E +   L++L +      + + + G++  L+ +++ G+P  ++++ KLL  F+ E+Q + +P S
Subjt:  AMESSPMVPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHS

Arabidopsis top hitse value%identityAlignment
AT1G67530.1 ARM repeat superfamily protein3.5e-1425.61Show/hide
Query:  EEKEIAAKVIEK---MSKEDLKGKKLMVELQVIPALVSMAAS----DAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFAR
        E  E   KV+EK   + K+D + +  M     + AL+    S    +     +    AL  LA  + ++NK L++ +G++  L   I + +         
Subjt:  EEKEIAAKVIEK---MSKEDLKGKKLMVELQVIPALVSMAAS----DAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFAR

Query:  LLLSLSSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPS---DRALAALGNLVVTSQGKK
        L L+LS L  +   I    +  A+PFLV++L     ++ +   L  LYN+ST   N+  L+S+ ++ +L  L +S G +   +++LA L NL  + +GK 
Subjt:  LLLSLSSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPS---DRALAALGNLVVTSQGKK

Query:  AMESSPMVPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHS
           SS  +  SL  ++   D  +  E +   L++L +      + + + G++  L+ +++ G+P  ++++ KLL  F+ E+Q + +P S
Subjt:  AMESSPMVPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHS

AT2G25130.1 ARM repeat superfamily protein1.0e-2632.21Show/hide
Query:  DESAAELQRTVKKLHFGSWEEKEIAAKVI------EKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEV--AVKALLELAKGSLSKNKALIVEAGIL
        +E+   L+R VK L   +  E E A K I        ++K+D++ +  +  L  IP LVSM   ++     +  ++ ALL L  G+   NKA IV+AG++
Subjt:  DESAAELQRTVKKLHFGSWEEKEIAAKVI------EKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEV--AVKALLELAKGSLSKNKALIVEAGIL

Query:  HKLPSNIQTM---DESAKHDFARLLLSLSSLINSHFTIALQSNENAIPFLVEIL---DSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSS
        HK+   +++    +++         L LS+L ++   I    +  AI FLV+ L   + TS+S+ ++  L  LYN+S   +NV  ++   ++  LL    
Subjt:  HKLPSNIQTM---DESAKHDFARLLLSLSSLINSHFTIALQSNENAIPFLVEIL---DSTSNSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSS

Query:  SKGPSDRALAALGNLVVTSQGKKAMESSPMVPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQ
            S+R LA L N+V   +G+KA+         L++++ W D  K  E + YILM++AH+    R  M ++GI + LLE+ L+GSPLAQKRA ++L+
Subjt:  SKGPSDRALAALGNLVVTSQGKKAMESSPMVPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQ

AT2G27430.1 ARM repeat superfamily protein9.6e-8957.55Show/hide
Query:  LQRTVKKLHFGSWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESA
        LQ+TVKK+HFGSWEEKE AA  IEK+++ED K +KLM EL VI  LVSM ASD  G  + AV AL++L+ G+ + NKAL+V A I  KLP N++ +D+S 
Subjt:  LQRTVKKLHFGSWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVGRPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESA

Query:  KHDFARLLLSLSSLINSHFTIALQSNENAIPFLVEILDSTS-NSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPSDRALAALGNLVVTS
        +H FA LLLSLSSL N+   +A   +   +PFL++ ++S S + +T++ CL T+ N+  VLEN GPLV NG V TLL L S+K  S++ALA+LG LVVT 
Subjt:  KHDFARLLLSLSSLINSHFTIALQSNENAIPFLVEILDSTS-NSETQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPSDRALAALGNLVVTS

Query:  QGKKAMESSPMVPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRI
         GKKAME   +V   LIEI+TWED PK  E +AYILM+LAHQS  QREKMAK+GIV VLLEV+LLGSPL QKRA+KLLQWFK+E+  +M PHSGPQTG +
Subjt:  QGKKAMESSPMVPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSGPQTGRI

Query:  V--IGSPVNQRMFRKGEK
           +GSP++ R   +G K
Subjt:  V--IGSPVNQRMFRKGEK

AT4G31890.1 ARM repeat superfamily protein1.1e-2831.52Show/hide
Query:  FTETTVRSN---EILQVADADWDGSGGNGGGVDESAAELQRTVKKLHF-----------GSWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAA
        F ET+ +S    ++L +A+ + D         +E+   L+R V++L             G   +K  AA  +  ++KED + +  +  L  IP LVSM  
Subjt:  FTETTVRSN---EILQVADADWDGSGGNGGGVDESAAELQRTVKKLHF-----------GSWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAA

Query:  SDAVGRPEVA-VKALLELAKGSLSKNKALIVEAGILHKLPSNIQ---TMDESAKHDFARLLLSLSSLINSHFTIALQSNENAIPFLVEI---LDSTSNSE
           +   ++A + ALL L  G+   NKA IV+AG +HK+   I+   T D+          L LS+L ++   I    +  AI FLV+    LD TS+S+
Subjt:  SDAVGRPEVA-VKALLELAKGSLSKNKALIVEAGILHKLPSNIQ---TMDESAKHDFARLLLSLSSLINSHFTIALQSNENAIPFLVEI---LDSTSNSE

Query:  TQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPSDRALAALGNLVVTSQGKKAMESSPMVPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSE
         ++  L  LYN+S    NV  ++   ++  LL        S+R LA L NLV   +G+KA+         L++++ W D P   E + YILM++AH+   
Subjt:  TQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPSDRALAALGNLVVTSQGKKAMESSPMVPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSE

Query:  QREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSG
         R+ M ++GI + LLE+ LLGS LAQKRA ++L+  + +K  ++   +G
Subjt:  QREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSG

AT4G31890.2 ARM repeat superfamily protein1.1e-2831.52Show/hide
Query:  FTETTVRSN---EILQVADADWDGSGGNGGGVDESAAELQRTVKKLHF-----------GSWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAA
        F ET+ +S    ++L +A+ + D         +E+   L+R V++L             G   +K  AA  +  ++KED + +  +  L  IP LVSM  
Subjt:  FTETTVRSN---EILQVADADWDGSGGNGGGVDESAAELQRTVKKLHF-----------GSWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAA

Query:  SDAVGRPEVA-VKALLELAKGSLSKNKALIVEAGILHKLPSNIQ---TMDESAKHDFARLLLSLSSLINSHFTIALQSNENAIPFLVEI---LDSTSNSE
           +   ++A + ALL L  G+   NKA IV+AG +HK+   I+   T D+          L LS+L ++   I    +  AI FLV+    LD TS+S+
Subjt:  SDAVGRPEVA-VKALLELAKGSLSKNKALIVEAGILHKLPSNIQ---TMDESAKHDFARLLLSLSSLINSHFTIALQSNENAIPFLVEI---LDSTSNSE

Query:  TQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPSDRALAALGNLVVTSQGKKAMESSPMVPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSE
         ++  L  LYN+S    NV  ++   ++  LL        S+R LA L NLV   +G+KA+         L++++ W D P   E + YILM++AH+   
Subjt:  TQKCCLETLYNISTVLENVGPLVSNGVVHTLLKLSSSKGPSDRALAALGNLVVTSQGKKAMESSPMVPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSE

Query:  QREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSG
         R+ M ++GI + LLE+ LLGS LAQKRA ++L+  + +K  ++   +G
Subjt:  QREKMAKSGIVAVLLEVALLGSPLAQKRALKLLQWFKNEKQAKMEPHSG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCCCATGTCGCCGGCGTCTTCTTCTTCTTCTCCTCAAACACCCATTTGGTACTTCACCGAAACTACAGTAAGGAGTAATGAAATTCTACAAGTTGCAGATGCGGA
TTGGGACGGCAGCGGTGGAAATGGTGGTGGTGTGGATGAATCTGCGGCGGAGTTGCAGAGGACGGTGAAGAAGCTCCACTTTGGCAGCTGGGAAGAGAAGGAGATTGCAG
CCAAAGTGATTGAGAAAATGTCTAAAGAGGACTTGAAGGGGAAGAAGTTGATGGTGGAGCTCCAGGTTATACCGGCCTTGGTTTCAATGGCGGCATCGGACGCCGTGGGG
CGGCCGGAAGTGGCTGTAAAGGCGTTGCTTGAGCTTGCCAAAGGAAGCTTGAGTAAGAACAAGGCCCTCATAGTGGAGGCAGGAATCTTACACAAACTTCCAAGTAACAT
CCAAACCATGGATGAATCAGCAAAACATGATTTTGCAAGATTGTTGTTGTCACTCTCGTCTCTGATCAATTCCCATTTCACAATTGCTTTACAGTCCAATGAAAATGCCA
TTCCATTTCTTGTGGAGATTCTTGATTCAACCTCAAACTCCGAGACCCAAAAATGCTGCCTTGAAACTCTATATAACATCTCCACAGTGTTGGAAAATGTAGGGCCTCTG
GTCTCGAATGGTGTGGTGCACACCCTCTTGAAATTGTCCTCGTCAAAGGGCCCTTCGGACAGAGCCCTCGCAGCATTGGGGAACTTGGTGGTGACTTCACAGGGAAAGAA
AGCCATGGAGAGCAGCCCAATGGTCCCTGATAGCCTGATAGAGATTATGACATGGGAGGACAAACCAAAATCTATTGAGTTATCTGCTTATATCTTAATGATGTTGGCTC
ATCAGAGCTCAGAACAGAGAGAGAAGATGGCCAAGTCTGGGATTGTTGCAGTGCTTCTTGAAGTGGCATTACTGGGTAGTCCACTGGCTCAAAAGAGGGCATTAAAGCTA
TTACAGTGGTTTAAGAATGAGAAGCAAGCAAAAATGGAGCCACATTCTGGGCCACAGACAGGAAGAATAGTAATTGGGTCGCCTGTAAATCAAAGGATGTTCAGGAAGGG
AGAAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAACCCCATGTCGCCGGCGTCTTCTTCTTCTTCTCCTCAAACACCCATTTGGTACTTCACCGAAACTACAGTAAGGAGTAATGAAATTCTACAAGTTGCAGATGCGGA
TTGGGACGGCAGCGGTGGAAATGGTGGTGGTGTGGATGAATCTGCGGCGGAGTTGCAGAGGACGGTGAAGAAGCTCCACTTTGGCAGCTGGGAAGAGAAGGAGATTGCAG
CCAAAGTGATTGAGAAAATGTCTAAAGAGGACTTGAAGGGGAAGAAGTTGATGGTGGAGCTCCAGGTTATACCGGCCTTGGTTTCAATGGCGGCATCGGACGCCGTGGGG
CGGCCGGAAGTGGCTGTAAAGGCGTTGCTTGAGCTTGCCAAAGGAAGCTTGAGTAAGAACAAGGCCCTCATAGTGGAGGCAGGAATCTTACACAAACTTCCAAGTAACAT
CCAAACCATGGATGAATCAGCAAAACATGATTTTGCAAGATTGTTGTTGTCACTCTCGTCTCTGATCAATTCCCATTTCACAATTGCTTTACAGTCCAATGAAAATGCCA
TTCCATTTCTTGTGGAGATTCTTGATTCAACCTCAAACTCCGAGACCCAAAAATGCTGCCTTGAAACTCTATATAACATCTCCACAGTGTTGGAAAATGTAGGGCCTCTG
GTCTCGAATGGTGTGGTGCACACCCTCTTGAAATTGTCCTCGTCAAAGGGCCCTTCGGACAGAGCCCTCGCAGCATTGGGGAACTTGGTGGTGACTTCACAGGGAAAGAA
AGCCATGGAGAGCAGCCCAATGGTCCCTGATAGCCTGATAGAGATTATGACATGGGAGGACAAACCAAAATCTATTGAGTTATCTGCTTATATCTTAATGATGTTGGCTC
ATCAGAGCTCAGAACAGAGAGAGAAGATGGCCAAGTCTGGGATTGTTGCAGTGCTTCTTGAAGTGGCATTACTGGGTAGTCCACTGGCTCAAAAGAGGGCATTAAAGCTA
TTACAGTGGTTTAAGAATGAGAAGCAAGCAAAAATGGAGCCACATTCTGGGCCACAGACAGGAAGAATAGTAATTGGGTCGCCTGTAAATCAAAGGATGTTCAGGAAGGG
AGAAAAATGA
Protein sequenceShow/hide protein sequence
MNPMSPASSSSSPQTPIWYFTETTVRSNEILQVADADWDGSGGNGGGVDESAAELQRTVKKLHFGSWEEKEIAAKVIEKMSKEDLKGKKLMVELQVIPALVSMAASDAVG
RPEVAVKALLELAKGSLSKNKALIVEAGILHKLPSNIQTMDESAKHDFARLLLSLSSLINSHFTIALQSNENAIPFLVEILDSTSNSETQKCCLETLYNISTVLENVGPL
VSNGVVHTLLKLSSSKGPSDRALAALGNLVVTSQGKKAMESSPMVPDSLIEIMTWEDKPKSIELSAYILMMLAHQSSEQREKMAKSGIVAVLLEVALLGSPLAQKRALKL
LQWFKNEKQAKMEPHSGPQTGRIVIGSPVNQRMFRKGEK