; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh18G011110 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh18G011110
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Description18S pre-ribosomal assembly protein gar2-related, putative isoform 2
Genome locationCmo_Chr18:11514256..11518025
RNA-Seq ExpressionCmoCh18G011110
SyntenyCmoCh18G011110
Gene Ontology termsGO:0009786 - regulation of asymmetric cell division (biological process)
InterPro domainsIPR040378 - Protein BREAKING OF ASYMMETRY IN THE STOMATAL LINEAGE


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6574030.1 hypothetical protein SDJN03_27917, partial [Cucurbita argyrosperma subsp. sororia]1.8e-25998.09Show/hide
Query:  MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKAKPVVSPRTNSSEVDLFEE
        MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIA DGWVVPKLDCL+FDDFNDYDKAKPVVSPRTNSSEVDLFEE
Subjt:  MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKAKPVVSPRTNSSEVDLFEE

Query:  DSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDSAR
        DSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAI PPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDS R
Subjt:  DSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDSAR

Query:  TPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTSAPVPEDPENSASGSDISYNSK
        TPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTSAPVPEDPENSASG+DISYNSK
Subjt:  TPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTSAPVPEDPENSASGSDISYNSK

Query:  LDNGNITFDFNSLSSTASDGLERCDNGDLNPSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVSHQVLSQVQRGGVGEASF
        LDNGNITFDFNSLSSTASDGLER DNGDLN STPSTSALMDC+DTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVSHQV SQVQRGGVGEASF
Subjt:  LDNGNITFDFNSLSSTASDGLERCDNGDLNPSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVSHQVLSQVQRGGVGEASF

Query:  SSMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF
        SSMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF
Subjt:  SSMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF

XP_022944916.1 uncharacterized protein LOC111449304 [Cucurbita moschata]2.9e-265100Show/hide
Query:  MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKAKPVVSPRTNSSEVDLFEE
        MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKAKPVVSPRTNSSEVDLFEE
Subjt:  MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKAKPVVSPRTNSSEVDLFEE

Query:  DSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDSAR
        DSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDSAR
Subjt:  DSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDSAR

Query:  TPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTSAPVPEDPENSASGSDISYNSK
        TPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTSAPVPEDPENSASGSDISYNSK
Subjt:  TPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTSAPVPEDPENSASGSDISYNSK

Query:  LDNGNITFDFNSLSSTASDGLERCDNGDLNPSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVSHQVLSQVQRGGVGEASF
        LDNGNITFDFNSLSSTASDGLERCDNGDLNPSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVSHQVLSQVQRGGVGEASF
Subjt:  LDNGNITFDFNSLSSTASDGLERCDNGDLNPSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVSHQVLSQVQRGGVGEASF

Query:  SSMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF
        SSMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF
Subjt:  SSMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF

XP_022968482.1 uncharacterized protein LOC111467709 [Cucurbita maxima]6.9e-24392.63Show/hide
Query:  MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKAKPVVSPRTNSSEVDLFEE
        MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKL D KELASRPKGYEDADRNNIAADGWVVPKLDCL+ DDFNDYDKAKP+VS RTNSSEVDLFEE
Subjt:  MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKAKPVVSPRTNSSEVDLFEE

Query:  DSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDSAR
        DSELFMEKSIVECQLPELIVCYKENICNIVKDICID+GVPSRD LEKA  AIVPPEKDWKDESTRDLAK +TFASDDSEHSESFGDKDSPKLRDSKDSAR
Subjt:  DSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDSAR

Query:  TPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTSAPVPEDPENSASGSDISYNSK
        TPDAEYDVAYFTDNDI+NLPMTDLVAGSLKPLINNKNEPHSQAEQVLIE TS ++AVSACVAEES SNPKEAMS FTTSAP PEDPENSASG+DISYNSK
Subjt:  TPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTSAPVPEDPENSASGSDISYNSK

Query:  LDNGNITFDFNSLSSTASDGLERCDNGDLNPSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVSHQVLSQVQRGGVGEASF
        LDNGNITFDFNSLSS ASDGLE  DNGDLN STPSTSALMDC+DTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVSHQV S VQRG VGEASF
Subjt:  LDNGNITFDFNSLSSTASDGLERCDNGDLNPSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVSHQVLSQVQRGGVGEASF

Query:  SSMVPFRS---NSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF
        SSMVPF S   +SDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWK+SILCCRF
Subjt:  SSMVPFRS---NSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF

XP_023541070.1 uncharacterized protein LOC111801342 [Cucurbita pepo subsp. pepo]2.4e-25195.77Show/hide
Query:  MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKA-KPVVSPRTNSSEVDLFE
        MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGG+KL DQKELASRPKGYEDADRNNIAADGWVVPKLDCL+ DDFNDYDKA KP+VSPRTNSSEVDLFE
Subjt:  MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKA-KPVVSPRTNSSEVDLFE

Query:  EDSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDSA
        EDSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDES RDLAKG+ FASDDSEHSESFGDKDSPKLRDSKDSA
Subjt:  EDSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDSA

Query:  RTPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTSAPVPEDPENSASGSDISYNS
        RTPDAEYDVAYFTDNDI+NLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTS+PVPEDPENSASG+DISYNS
Subjt:  RTPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTSAPVPEDPENSASGSDISYNS

Query:  KLDNGNITFDFNSLSSTASDGLERCDNGDLNPSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVSHQVLSQVQRGGVGEAS
        KLDNGNITFDFNSLSSTASDGLER DNGDLN STPSTSALMDC+DTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVG    HQV SQVQRGGVGEAS
Subjt:  KLDNGNITFDFNSLSSTASDGLERCDNGDLNPSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVSHQVLSQVQRGGVGEAS

Query:  FSSMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF
        FSSMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF
Subjt:  FSSMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF

XP_038892052.1 uncharacterized protein LOC120081347 isoform X1 [Benincasa hispida]5.2e-17467.33Show/hide
Query:  MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKAKPVVSPRTNSSEVDLFEE
        MK+EGE +V HSNA P+FVPKSFECDNDA++SGGMKL DQKE  S  KG EDA+ NN AADGWV  K +CL+ DDFN+YD+ K  VSP TNSS+VDLFEE
Subjt:  MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKAKPVVSPRTNSSEVDLFEE

Query:  DSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNL-------EKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLR
        DSEL+MEKS VECQLPELIVCYKENICNIVKDICID+GVPSRD L       EK V +I+PP   WKD+  R+L K + +ASDDSEHSESFG+KDSPK  
Subjt:  DSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNL-------EKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLR

Query:  DSKDSARTPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTSAPVPEDPENSASGS
        DS D  RTP+AEYDVAYFTDND   +PMTD V  SLKPL NNK EPH ++EQV IE TSL++ V ACVAEES S+ +E +S  TTSA  PE+ +NS S +
Subjt:  DSKDSARTPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTSAPVPEDPENSASGS

Query:  DISYNSKLDNGNITFDFNSLSSTASDGLERCDNGDLNPSTPSTSALMDCQDTSSS---SSSSKCQVQCHDTSIS--------LKRVEYEDVLKAGVGNSV
        D+SYNSK+D GNITFDFNSL+STASDGLE CDN DLN S PSTSA + CQ+TSSS   +S+ KCQ QCHDTS +        L R+EYED+ K  VGN  
Subjt:  DISYNSKLDNGNITFDFNSLSSTASDGLERCDNGDLNPSTPSTSALMDCQDTSSS---SSSSKCQVQCHDTSIS--------LKRVEYEDVLKAGVGNSV

Query:  SHQVLSQVQRGG---------VGEASFSSMVP---FRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCC
        SH V SQVQ GG         +GE SFSSMVP     SNS  IGYSGSIS+RSDSSTTSTRSFAFP+LQ EWNSSPVRMAKADR+HLRKHR W++ ILCC
Subjt:  SHQVLSQVQRGG---------VGEASFSSMVP---FRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCC

Query:  RF
        RF
Subjt:  RF

TrEMBL top hitse value%identityAlignment
A0A1S3BFZ0 uncharacterized protein LOC103489197 isoform X17.1e-16165.85Show/hide
Query:  MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKAKPVVSPRTNSSEVDLFEE
        MK+EGE +VCHSNA PKFVPKSFECDND ++SGGMKL DQKE  S  KG  DA  NN AADGWV  K +CL+ DDFNDYD  K  VSP  NS +VDL EE
Subjt:  MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKAKPVVSPRTNSSEVDLFEE

Query:  DSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNL-------EKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLR
        DSEL+MEKSIVECQLPELIVCYKENICNIVKDICID+G P RD L       E+ V +I PP KDWKDES  +L + + FASDDSEHSESFG KDSP   
Subjt:  DSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNL-------EKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLR

Query:  DSKDSARTPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTSAPVPEDPENSASGS
        DSKD A TP+AEYDVAYFTDND   +PMTDLV  SLKPL +NK +PH Q+EQV IE T  ++ V A VA+ES  N +E  S   TSA   EDP+NS S +
Subjt:  DSKDSARTPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTSAPVPEDPENSASGS

Query:  DISYNSKLDNGNITFDFNSLSSTASDGLERCDNGDLNPSTPSTSALMDCQDTSSS---SSSSKCQVQCHDTSIS--------LKRVEYEDVLKAGVGNSV
         +SYNSK+D GNITFDFNSL+ TASDGLERCDNGDLN S PSTSA + C++T+SS   +S+ K + QCH+TS +        L RVEYED+ K  VGN  
Subjt:  DISYNSKLDNGNITFDFNSLSSTASDGLERCDNGDLNPSTPSTSALMDCQDTSSS---SSSSKCQVQCHDTSIS--------LKRVEYEDVLKAGVGNSV

Query:  SHQVLSQVQRGGVGEASFS--SMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF
        SH V S+VQ+ GVGE SFS   +    SNS RIGYSGSIS RSDSSTTSTRSFAFP+LQ EWNSSPVRMAK DRKHL+KHR W+  ILCCRF
Subjt:  SHQVLSQVQRGGVGEASFS--SMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF

A0A6J1FZH8 uncharacterized protein LOC1114493041.4e-265100Show/hide
Query:  MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKAKPVVSPRTNSSEVDLFEE
        MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKAKPVVSPRTNSSEVDLFEE
Subjt:  MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKAKPVVSPRTNSSEVDLFEE

Query:  DSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDSAR
        DSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDSAR
Subjt:  DSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDSAR

Query:  TPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTSAPVPEDPENSASGSDISYNSK
        TPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTSAPVPEDPENSASGSDISYNSK
Subjt:  TPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTSAPVPEDPENSASGSDISYNSK

Query:  LDNGNITFDFNSLSSTASDGLERCDNGDLNPSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVSHQVLSQVQRGGVGEASF
        LDNGNITFDFNSLSSTASDGLERCDNGDLNPSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVSHQVLSQVQRGGVGEASF
Subjt:  LDNGNITFDFNSLSSTASDGLERCDNGDLNPSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVSHQVLSQVQRGGVGEASF

Query:  SSMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF
        SSMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF
Subjt:  SSMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF

A0A6J1GWT7 uncharacterized protein LOC111458170 isoform X17.1e-16165.18Show/hide
Query:  MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKAKPVVSPRTNSSEVDLFEE
        MK+EGE VV HS+A PKF+PKSFECDNDA+DSGGMKL D K+     K  EDA+  N             +  DD N++D+ K  V   TNSS+VDLFEE
Subjt:  MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKAKPVVSPRTNSSEVDLFEE

Query:  DSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNL--------EKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKL
        DSEL+MEKSIVECQLPELIVCYKEN CNIVKDICID+GVPSRD L        EKAV  I+PPE+DWKDE  R L +G+ FASDDSEHSESFG KDSPK 
Subjt:  DSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNL--------EKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKL

Query:  RDSKDSARTPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTSAPVPEDPENSASG
         D +D ARTP+AEYDV YFTDNDI+NLPMTDL   S+KPL NNKNEP+ Q+EQV IE TSL++ V ACVAEES S+ +E +S   +S    E+P+NS S 
Subjt:  RDSKDSARTPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTSAPVPEDPENSASG

Query:  SDISYNSKLDNGNITFDFNSLSSTASDGLERCDNGDLNPSTPSTSALMDCQDTSSS---SSSSKCQVQCHDTSISLKRVEYEDVL--------KAGVGNS
         DISYNSK+D GNITFDFNS +STASDGLE CDNG LN S PSTSA +DC D+SSS   +S+ KCQ  C+D S + KRVEYED+L        KA VGNS
Subjt:  SDISYNSKLDNGNITFDFNSLSSTASDGLERCDNGDLNPSTPSTSALMDCQDTSSS---SSSSKCQVQCHDTSISLKRVEYEDVL--------KAGVGNS

Query:  VSHQVLSQVQRGGVGEASFSSMV---PFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF
         S+ V SQVQ  GVGEAS SSMV      SNS RIGYSGSIS RSDSSTTST SFAFP+LQ EWNSSPVRMAKAD+KHLRK R W++ +LCCRF
Subjt:  VSHQVLSQVQRGGVGEASFSSMV---PFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF

A0A6J1HTL9 uncharacterized protein LOC1114677093.3e-24392.63Show/hide
Query:  MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKAKPVVSPRTNSSEVDLFEE
        MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKL D KELASRPKGYEDADRNNIAADGWVVPKLDCL+ DDFNDYDKAKP+VS RTNSSEVDLFEE
Subjt:  MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKAKPVVSPRTNSSEVDLFEE

Query:  DSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDSAR
        DSELFMEKSIVECQLPELIVCYKENICNIVKDICID+GVPSRD LEKA  AIVPPEKDWKDESTRDLAK +TFASDDSEHSESFGDKDSPKLRDSKDSAR
Subjt:  DSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDSAR

Query:  TPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTSAPVPEDPENSASGSDISYNSK
        TPDAEYDVAYFTDNDI+NLPMTDLVAGSLKPLINNKNEPHSQAEQVLIE TS ++AVSACVAEES SNPKEAMS FTTSAP PEDPENSASG+DISYNSK
Subjt:  TPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTSAPVPEDPENSASGSDISYNSK

Query:  LDNGNITFDFNSLSSTASDGLERCDNGDLNPSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVSHQVLSQVQRGGVGEASF
        LDNGNITFDFNSLSS ASDGLE  DNGDLN STPSTSALMDC+DTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVSHQV S VQRG VGEASF
Subjt:  LDNGNITFDFNSLSSTASDGLERCDNGDLNPSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVSHQVLSQVQRGGVGEASF

Query:  SSMVPFRS---NSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF
        SSMVPF S   +SDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWK+SILCCRF
Subjt:  SSMVPFRS---NSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF

A0A6J1IHW9 uncharacterized protein LOC111477604 isoform X16.7e-15965.05Show/hide
Query:  MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKAKPVVSPRTNSSEVDLFEE
        MK+EGE VV HS+A PKF+PKSFECDNDA+DSGGMKL D K+     KG EDA++ N             +  DD N++D+ K  V   TNSS+VDLFEE
Subjt:  MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKAKPVVSPRTNSSEVDLFEE

Query:  DSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNL--------EKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKL
        DSEL+MEKSIVECQLPELIVCYKEN CNIVKDICID+GVPSRD L        EKAV  I+PPE+DWKDE  R L +G+ FASDDSEHSESFG KDSPK 
Subjt:  DSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNL--------EKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKL

Query:  RDSKDSARTPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMS-AFTTSAPVPEDPENSAS
        RD ++ ARTP AEYDV Y TDND++NLP+TDL   S+KPL NNKNEP+ Q+EQVL           ACVAEES S+ +EA+S  F  S    E+P+NS S
Subjt:  RDSKDSARTPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMS-AFTTSAPVPEDPENSAS

Query:  GSDISYNSKLDNGNITFDFNSLSSTASDGLERCDNGDLNPSTPSTSALMDCQDTSSS---SSSSKCQVQCHDTSISLKRVEYEDVL--------KAGVGN
          DISYNSK+D GNITFDFNS +STASDGLE CDNGDLN S PSTSA +DC DTSSS   +S+ KCQV C+D S + KRVEYED+L        KA VGN
Subjt:  GSDISYNSKLDNGNITFDFNSLSSTASDGLERCDNGDLNPSTPSTSALMDCQDTSSS---SSSSKCQVQCHDTSISLKRVEYEDVL--------KAGVGN

Query:  SVSHQVLSQVQRGGVGEASFSSMV---PFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF
        S SH V SQVQ  GVGEAS SSMV      SNS RIGYSGSIS RSDSSTTSTRSFAFP+LQ EWNSSPVRMAKAD+KHLRK R W++ +LCCRF
Subjt:  SVSHQVLSQVQRGGVGEASFSSMV---PFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13650.2 BEST Arabidopsis thaliana protein match is: 18S pre-ribosomal assembly protein gar2-related (TAIR:AT2G03810.4)4.0e-0750.7Show/hide
Query:  GVGEASFSSMVPFRSNSDRIGYSG------SISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLR
        G GE SF      +S+ D I Y G      ++SIRSD   TS  SFA P+LQ EWNSSPVRM KA+   LR
Subjt:  GVGEASFSSMVPFRSNSDRIGYSG------SISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLR

AT2G03810.1 18S pre-ribosomal assembly protein gar2-related3.9e-2629.82Show/hide
Query:  EEDSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDS
        ++D   +M+K++  C LPE++VCYKEN  +IVKDIC+DEGVP ++      + +   +   K  ST DL K +    + SE             + ++DS
Subjt:  EEDSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDS

Query:  -ARTPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESG------SNPKEAMSAFTTSAPVP-EDPENSA
         ++  D+E+   + TD D+      D      +   +N N+ H    + +  + +  L+ S    +E+       S   ++    T    +  ED + S 
Subjt:  -ARTPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESG------SNPKEAMSAFTTSAPVP-EDPENSA

Query:  SGSDISYNSKLDNGNITF---DFNSLSSTASD-GLERCD---NGDLNPSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVS
        +  +IS +S  +         +  SL +TA +  LE+ +    G+   S+ ST+   +   T   +   K + + H     L    YED       +  S
Subjt:  SGSDISYNSKLDNGNITF---DFNSLSSTASD-GLERCD---NGDLNPSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVS

Query:  HQVLSQVQRGGVGEASFSSMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF
             +         S S  + +   S  I YSGS+S+RSD+STTS RSFAFP+LQ EWNSSPVRMAKAD++  R+   W+ ++LCCRF
Subjt:  HQVLSQVQRGGVGEASFSSMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF

AT2G03810.2 18S pre-ribosomal assembly protein gar2-related3.9e-2629.82Show/hide
Query:  EEDSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDS
        ++D   +M+K++  C LPE++VCYKEN  +IVKDIC+DEGVP ++      + +   +   K  ST DL K +    + SE             + ++DS
Subjt:  EEDSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDS

Query:  -ARTPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESG------SNPKEAMSAFTTSAPVP-EDPENSA
         ++  D+E+   + TD D+      D      +   +N N+ H    + +  + +  L+ S    +E+       S   ++    T    +  ED + S 
Subjt:  -ARTPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESG------SNPKEAMSAFTTSAPVP-EDPENSA

Query:  SGSDISYNSKLDNGNITF---DFNSLSSTASD-GLERCD---NGDLNPSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVS
        +  +IS +S  +         +  SL +TA +  LE+ +    G+   S+ ST+   +   T   +   K + + H     L    YED       +  S
Subjt:  SGSDISYNSKLDNGNITF---DFNSLSSTASD-GLERCD---NGDLNPSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVS

Query:  HQVLSQVQRGGVGEASFSSMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF
             +         S S  + +   S  I YSGS+S+RSD+STTS RSFAFP+LQ EWNSSPVRMAKAD++  R+   W+ ++LCCRF
Subjt:  HQVLSQVQRGGVGEASFSSMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF

AT2G03810.3 18S pre-ribosomal assembly protein gar2-related3.9e-2629.82Show/hide
Query:  EEDSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDS
        ++D   +M+K++  C LPE++VCYKEN  +IVKDIC+DEGVP ++      + +   +   K  ST DL K +    + SE             + ++DS
Subjt:  EEDSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDS

Query:  -ARTPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESG------SNPKEAMSAFTTSAPVP-EDPENSA
         ++  D+E+   + TD D+      D      +   +N N+ H    + +  + +  L+ S    +E+       S   ++    T    +  ED + S 
Subjt:  -ARTPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESG------SNPKEAMSAFTTSAPVP-EDPENSA

Query:  SGSDISYNSKLDNGNITF---DFNSLSSTASD-GLERCD---NGDLNPSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVS
        +  +IS +S  +         +  SL +TA +  LE+ +    G+   S+ ST+   +   T   +   K + + H     L    YED       +  S
Subjt:  SGSDISYNSKLDNGNITF---DFNSLSSTASD-GLERCD---NGDLNPSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVS

Query:  HQVLSQVQRGGVGEASFSSMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF
             +         S S  + +   S  I YSGS+S+RSD+STTS RSFAFP+LQ EWNSSPVRMAKAD++  R+   W+ ++LCCRF
Subjt:  HQVLSQVQRGGVGEASFSSMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF

AT2G03810.4 18S pre-ribosomal assembly protein gar2-related3.9e-2629.82Show/hide
Query:  EEDSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDS
        ++D   +M+K++  C LPE++VCYKEN  +IVKDIC+DEGVP ++      + +   +   K  ST DL K +    + SE             + ++DS
Subjt:  EEDSELFMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDS

Query:  -ARTPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESG------SNPKEAMSAFTTSAPVP-EDPENSA
         ++  D+E+   + TD D+      D      +   +N N+ H    + +  + +  L+ S    +E+       S   ++    T    +  ED + S 
Subjt:  -ARTPDAEYDVAYFTDNDIINLPMTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESG------SNPKEAMSAFTTSAPVP-EDPENSA

Query:  SGSDISYNSKLDNGNITF---DFNSLSSTASD-GLERCD---NGDLNPSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVS
        +  +IS +S  +         +  SL +TA +  LE+ +    G+   S+ ST+   +   T   +   K + + H     L    YED       +  S
Subjt:  SGSDISYNSKLDNGNITF---DFNSLSSTASD-GLERCD---NGDLNPSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVS

Query:  HQVLSQVQRGGVGEASFSSMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF
             +         S S  + +   S  I YSGS+S+RSD+STTS RSFAFP+LQ EWNSSPVRMAKAD++  R+   W+ ++LCCRF
Subjt:  HQVLSQVQRGGVGEASFSSMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQPEWNSSPVRMAKADRKHLRKHRCWKKSILCCRF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATTGAGGGTGAAGCCGTAGTTTGCCATTCAAATGCTGGCCCCAAGTTTGTTCCTAAGTCATTTGAATGTGATAATGATGCTATTGATTCTGGTGGGATGAAGCT
ACCAGATCAGAAAGAACTTGCAAGCCGTCCGAAAGGTTATGAGGATGCCGATCGCAATAATATTGCTGCAGATGGTTGGGTTGTACCTAAGCTTGATTGTTTAAATTTTG
ATGATTTTAATGACTATGACAAGGCTAAACCCGTTGTGTCACCGCGCACGAATTCTTCGGAAGTAGACTTGTTCGAGGAAGACTCGGAGTTGTTCATGGAAAAGAGTATC
GTTGAATGTCAACTTCCTGAACTGATAGTTTGTTACAAAGAAAATATTTGCAATATCGTTAAGGATATTTGTATCGACGAGGGAGTACCTTCTCGGGATAACCTTGAGAA
GGCTGTCCGTGCCATTGTCCCTCCTGAGAAAGACTGGAAGGATGAGTCGACACGAGACCTGGCAAAAGGAGAGACGTTTGCTTCAGATGATTCAGAGCATTCGGAATCGT
TTGGTGACAAGGATTCACCCAAACTACGCGATTCCAAGGATTCGGCTAGAACACCCGACGCGGAGTATGATGTGGCATATTTTACTGACAATGATATAATAAATCTTCCA
ATGACAGACTTGGTTGCAGGGAGCTTAAAGCCATTGATCAACAATAAGAATGAGCCTCACTCTCAGGCTGAACAGGTTTTGATTGAAGCTACGAGTTTGAAACTCGCTGT
TTCGGCATGTGTAGCTGAAGAATCTGGTAGTAACCCGAAAGAAGCAATGTCAGCATTCACTACATCAGCTCCCGTACCGGAAGATCCCGAAAATAGCGCTTCTGGGAGTG
ATATATCATATAATAGTAAATTGGACAATGGAAACATCACTTTTGATTTTAATTCTTTATCATCCACAGCTAGTGATGGACTCGAGCGTTGTGATAATGGCGACTTAAAC
CCTTCAACTCCTTCGACCAGTGCCTTGATGGACTGCCAAGACACGAGCAGCAGCTCGAGCAGCTCCAAATGTCAAGTGCAGTGTCACGATACTAGTATTAGCCTCAAACG
TGTGGAATATGAAGATGTACTGAAGGCAGGAGTTGGGAATTCTGTAAGTCATCAAGTTTTGAGCCAAGTTCAACGTGGTGGCGTAGGTGAAGCAAGTTTCTCTTCAATGG
TACCGTTCAGGAGTAATTCAGATCGTATAGGGTACTCCGGCAGCATTTCTATTCGGTCGGATAGCAGCACGACGAGCACCCGTTCCTTTGCATTTCCTGTATTACAACCC
GAGTGGAATAGTAGCCCTGTTAGAATGGCTAAAGCTGATCGAAAGCATCTACGAAAGCATAGGTGTTGGAAAAAGAGCATTCTGTGCTGTAGATTCTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAAAATGGGAGACAGCTTCTGATCCGAAAACCAGGGTAGAACAGTGCTAGAGTTTTGATCAAGCAGCCGAAATCCGCTATCAGAATCAGCGTTTTCCATACAAC
CTGGTGGACTAATCTTATGTTATCGCACTCTCAAGGAAACAGCTCCGTCCAGCTAATTTGCTCTGTTCTCACTTTTTCCGGCAAAAAAGCTGAACCGATCAACAAAAATA
AATTAAAATCCCCCGGCGGTTGCTTTTGAGGAGTTGGAAATGAAAACCTGTCCGATTTTGAGTTCGTAACAGCAGAAAATGCTATCTGGGGTATACTTCCTGACATGATT
GTCACCCCAAGAAAAGGCGCTGACTGCAAAAGGGTACTAGGGATTGCTGCTCGATCGGTTTAAATACCTACATTTGAAGCAGCTTCTTCGGGATTGGATTCCAGAATCGT
CAAGGCGCACCTCTTGGTTTCAGGCTGCTGCTTGGTGGGTTCTTCTGTACAGTCGCTTTAACACATTTTTTTAGTGCATTTTCTTCTGTTGTTCCTACAACCTTCAAGAC
TCTTTGGTGCATGTCTTGAGAAAATCCATTGGAAACAGTTCATTTGAATCATGAAAATTGAGGGTGAAGCCGTAGTTTGCCATTCAAATGCTGGCCCCAAGTTTGTTCCT
AAGTCATTTGAATGTGATAATGATGCTATTGATTCTGGTGGGATGAAGCTACCAGATCAGAAAGAACTTGCAAGCCGTCCGAAAGGTTATGAGGATGCCGATCGCAATAA
TATTGCTGCAGATGGTTGGGTTGTACCTAAGCTTGATTGTTTAAATTTTGATGATTTTAATGACTATGACAAGGCTAAACCCGTTGTGTCACCGCGCACGAATTCTTCGG
AAGTAGACTTGTTCGAGGAAGACTCGGAGTTGTTCATGGAAAAGAGTATCGTTGAATGTCAACTTCCTGAACTGATAGTTTGTTACAAAGAAAATATTTGCAATATCGTT
AAGGATATTTGTATCGACGAGGGAGTACCTTCTCGGGATAACCTTGAGAAGGCTGTCCGTGCCATTGTCCCTCCTGAGAAAGACTGGAAGGATGAGTCGACACGAGACCT
GGCAAAAGGAGAGACGTTTGCTTCAGATGATTCAGAGCATTCGGAATCGTTTGGTGACAAGGATTCACCCAAACTACGCGATTCCAAGGATTCGGCTAGAACACCCGACG
CGGAGTATGATGTGGCATATTTTACTGACAATGATATAATAAATCTTCCAATGACAGACTTGGTTGCAGGGAGCTTAAAGCCATTGATCAACAATAAGAATGAGCCTCAC
TCTCAGGCTGAACAGGTTTTGATTGAAGCTACGAGTTTGAAACTCGCTGTTTCGGCATGTGTAGCTGAAGAATCTGGTAGTAACCCGAAAGAAGCAATGTCAGCATTCAC
TACATCAGCTCCCGTACCGGAAGATCCCGAAAATAGCGCTTCTGGGAGTGATATATCATATAATAGTAAATTGGACAATGGAAACATCACTTTTGATTTTAATTCTTTAT
CATCCACAGCTAGTGATGGACTCGAGCGTTGTGATAATGGCGACTTAAACCCTTCAACTCCTTCGACCAGTGCCTTGATGGACTGCCAAGACACGAGCAGCAGCTCGAGC
AGCTCCAAATGTCAAGTGCAGTGTCACGATACTAGTATTAGCCTCAAACGTGTGGAATATGAAGATGTACTGAAGGCAGGAGTTGGGAATTCTGTAAGTCATCAAGTTTT
GAGCCAAGTTCAACGTGGTGGCGTAGGTGAAGCAAGTTTCTCTTCAATGGTACCGTTCAGGAGTAATTCAGATCGTATAGGGTACTCCGGCAGCATTTCTATTCGGTCGG
ATAGCAGCACGACGAGCACCCGTTCCTTTGCATTTCCTGTATTACAACCCGAGTGGAATAGTAGCCCTGTTAGAATGGCTAAAGCTGATCGAAAGCATCTACGAAAGCAT
AGGTGTTGGAAAAAGAGCATTCTGTGCTGTAGATTCTGAAACCTTTTACTAATTTATTCGTAGGTTTAGTGTCGAAAATATAAACGTTTACTTCACACACACACTGTCGG
GTCATTTTACGAGATTAGTCGAAATACGTATTAGTTGATTCGAACACTCGATGCAGATTAACATTGTTTTGACTGTAAAAGGAGTCGGTGATGAGTTTGGCTTGCTCAAG
TGTTCTTATGAGCTAAACCAAGAGTGTTTGACAAAATCATTTGGTTTGTGGAATGGGAAAAGCTTGTACTGCTGAACATTGCTATAAATATATAGATATACAACATTCTT
TTGCG
Protein sequenceShow/hide protein sequence
MKIEGEAVVCHSNAGPKFVPKSFECDNDAIDSGGMKLPDQKELASRPKGYEDADRNNIAADGWVVPKLDCLNFDDFNDYDKAKPVVSPRTNSSEVDLFEEDSELFMEKSI
VECQLPELIVCYKENICNIVKDICIDEGVPSRDNLEKAVRAIVPPEKDWKDESTRDLAKGETFASDDSEHSESFGDKDSPKLRDSKDSARTPDAEYDVAYFTDNDIINLP
MTDLVAGSLKPLINNKNEPHSQAEQVLIEATSLKLAVSACVAEESGSNPKEAMSAFTTSAPVPEDPENSASGSDISYNSKLDNGNITFDFNSLSSTASDGLERCDNGDLN
PSTPSTSALMDCQDTSSSSSSSKCQVQCHDTSISLKRVEYEDVLKAGVGNSVSHQVLSQVQRGGVGEASFSSMVPFRSNSDRIGYSGSISIRSDSSTTSTRSFAFPVLQP
EWNSSPVRMAKADRKHLRKHRCWKKSILCCRF