; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G03870 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G03870
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein UPSTREAM OF FLC isoform X1
Genome locationClcChr09:3010032..3015460
RNA-Seq ExpressionClc09G03870
SyntenyClc09G03870
Gene Ontology termsGO:0051258 - protein polymerization (biological process)
GO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR010369 - Protein SOSEKI


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136105.3 uncharacterized protein LOC101207468 isoform X1 [Cucumis sativus]6.7e-15473.18Show/hide
Query:  MEANK----GGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVL
        MEAN     GGE+RKVHIIYFLSRMGHVEQPHLIRVHHLAGG  GVYLRDVKRWLGELRG  M EAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVL
Subjt:  MEANK----GGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVL

Query:  QGSQIIFPTTLFDAKESLFTEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNLKT---QRIQQQIQHEIEGFKTQY--SSLYAKL
        QGSQIIFP+TLFD K SLF EELE +EDFS K+ + N E+SPP DSERSTVTDDGDSMKVE +T+KNL+T   Q ++++   +IEG KTQY  SSLYAKL
Subjt:  QGSQIIFPTTLFDAKESLFTEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNLKT---QRIQQQIQHEIEGFKTQY--SSLYAKL

Query:  VNNNNKDKHKDKDLMEKEGGPTATSTVTSSSQGAFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNRSPKDPLNQPPPEKLKNDAVLCRDDVLGGS
          NN K+K KDKDLM KEGGPTATSTV+SS+Q AF KSKSYSSGAS+V RQ ITCG GA DTND VL+KNRS      QP     KNDAV+CRD+VLGGS
Subjt:  VNNNNKDKHKDKDLMEKEGGPTATSTVTSSSQGAFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNRSPKDPLNQPPPEKLKNDAVLCRDDVLGGS

Query:  ARVLGNSWD-GNLEIRRQYSPQTSRKSFDDSRKKRPKESGGRKVEAATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLT---------KTASTSD
        ARV+GNSWD GNLEIRR    QTSRKS DD RKKRPKE+G  KV A  TYKPMAGPNCSLCGK+FRPEKMHSHMKSCRG+K LT            +TSD
Subjt:  ARVLGNSWD-GNLEIRRQYSPQTSRKSFDDSRKKRPKESGGRKVEAATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLT---------KTASTSD

Query:  KTTSSNSTSTRTSDKDSTSTYLLTN
        K+T S ST   TSD D  STY+LTN
Subjt:  KTTSSNSTSTRTSDKDSTSTYLLTN

XP_008461226.1 PREDICTED: protein UPSTREAM OF FLC isoform X1 [Cucumis melo]3.4e-16678.83Show/hide
Query:  NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF
        N+GGE+RKVHIIYFLSRMGHVEQPHLIRVHHLAG  GGVYLRDVKRWLGELRG  MPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF
Subjt:  NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF

Query:  PTTLFDAKESLFTEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNL----KTQRIQQQIQHEIEGFKTQY--SSLYAKLVNNNNK
        P+TLFD K+SLF EELE +EDFS KL Q+N E+SPP+DSERSTVTDDGDS+KVE +T+KNL    K   ++++   EIEGFKTQY  SSLYAKL  NN K
Subjt:  PTTLFDAKESLFTEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNL----KTQRIQQQIQHEIEGFKTQY--SSLYAKLVNNNNK

Query:  DKHKDKDLMEKEGGPTATSTVTSSSQGAFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNRSPKDPLNQPPPEKLKNDAVLCRDDVLGGSARVLGN
        +K KDKDLM KEGG TATSTV+SSSQ AF KSKSYSSGAS+V RQ ITCG GAVDTND VL+KNRSPKD    PP EK KNDAV+CRD+VLGGSARV+G+
Subjt:  DKHKDKDLMEKEGGPTATSTVTSSSQGAFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNRSPKDPLNQPPPEKLKNDAVLCRDDVLGGSARVLGN

Query:  SWD-GNLEIRRQYSPQTSRKSFDDSRKKRPKESGGRKVEAATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-TASTSDKTTSSNSTSTRTSD
        SWD GNLEIRR    QTSR S DD RKKRPKE+G  KV AATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK   +TSDKTT S ST+T TSD
Subjt:  SWD-GNLEIRRQYSPQTSRKSFDDSRKKRPKESGGRKVEAATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-TASTSDKTTSSNSTSTRTSD

Query:  KDSTSTYLLTN
        KD  STY+LTN
Subjt:  KDSTSTYLLTN

XP_008461227.1 PREDICTED: uncharacterized protein LOC103499876 isoform X2 [Cucumis melo]1.0e-16278.1Show/hide
Query:  NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF
        N+GGE+RKVHIIYFLSRMGHVEQPHLIRVHHLAG  GGVYLRDVKRWLGELRG  MPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF
Subjt:  NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF

Query:  PTTLFDAKESLFTEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNL----KTQRIQQQIQHEIEGFKTQY--SSLYAKLVNNNNK
        P+TLFD K+SLF EELE +EDFS KL Q+N E+SPP+DSERSTVTDDGDS+KVE +T+KNL    K   ++++   EIEGFKTQY  SSLYAKL  NN K
Subjt:  PTTLFDAKESLFTEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNL----KTQRIQQQIQHEIEGFKTQY--SSLYAKLVNNNNK

Query:  DKHKDKDLMEKEGGPTATSTVTSSSQGAFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNRSPKDPLNQPPPEKLKNDAVLCRDDVLGGSARVLGN
        +K KDKDLM KE    ATSTV+SSSQ AF KSKSYSSGAS+V RQ ITCG GAVDTND VL+KNRSPKD    PP EK KNDAV+CRD+VLGGSARV+G+
Subjt:  DKHKDKDLMEKEGGPTATSTVTSSSQGAFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNRSPKDPLNQPPPEKLKNDAVLCRDDVLGGSARVLGN

Query:  SWD-GNLEIRRQYSPQTSRKSFDDSRKKRPKESGGRKVEAATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-TASTSDKTTSSNSTSTRTSD
        SWD GNLEIRR    QTSR S DD RKKRPKE+G  KV AATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK   +TSDKTT S ST+T TSD
Subjt:  SWD-GNLEIRRQYSPQTSRKSFDDSRKKRPKESGGRKVEAATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-TASTSDKTTSSNSTSTRTSD

Query:  KDSTSTYLLTN
        KD  STY+LTN
Subjt:  KDSTSTYLLTN

XP_008461228.1 PREDICTED: protein UPSTREAM OF FLC isoform X3 [Cucumis melo]3.4e-16678.83Show/hide
Query:  NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF
        N+GGE+RKVHIIYFLSRMGHVEQPHLIRVHHLAG  GGVYLRDVKRWLGELRG  MPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF
Subjt:  NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF

Query:  PTTLFDAKESLFTEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNL----KTQRIQQQIQHEIEGFKTQY--SSLYAKLVNNNNK
        P+TLFD K+SLF EELE +EDFS KL Q+N E+SPP+DSERSTVTDDGDS+KVE +T+KNL    K   ++++   EIEGFKTQY  SSLYAKL  NN K
Subjt:  PTTLFDAKESLFTEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNL----KTQRIQQQIQHEIEGFKTQY--SSLYAKLVNNNNK

Query:  DKHKDKDLMEKEGGPTATSTVTSSSQGAFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNRSPKDPLNQPPPEKLKNDAVLCRDDVLGGSARVLGN
        +K KDKDLM KEGG TATSTV+SSSQ AF KSKSYSSGAS+V RQ ITCG GAVDTND VL+KNRSPKD    PP EK KNDAV+CRD+VLGGSARV+G+
Subjt:  DKHKDKDLMEKEGGPTATSTVTSSSQGAFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNRSPKDPLNQPPPEKLKNDAVLCRDDVLGGSARVLGN

Query:  SWD-GNLEIRRQYSPQTSRKSFDDSRKKRPKESGGRKVEAATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-TASTSDKTTSSNSTSTRTSD
        SWD GNLEIRR    QTSR S DD RKKRPKE+G  KV AATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK   +TSDKTT S ST+T TSD
Subjt:  SWD-GNLEIRRQYSPQTSRKSFDDSRKKRPKESGGRKVEAATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-TASTSDKTTSSNSTSTRTSD

Query:  KDSTSTYLLTN
        KD  STY+LTN
Subjt:  KDSTSTYLLTN

XP_038899908.1 LOW QUALITY PROTEIN: uncharacterized protein LOC120087098 [Benincasa hispida]3.0e-18682.63Show/hide
Query:  MEANKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQ
        MEANKGGE+RKVHIIYFLSRMGHVEQPHLIRVHHL GGGGGVYLRDVKRWLGELRG +MPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQ
Subjt:  MEANKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQ

Query:  II-FPTTLF----------------DAKESLFTEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNLKT---QRIQQQIQHEIEGF
        II FP   F                D K+SLF EELE NEDF  K +QKNVEESPP DSERSTVTDDGDSMKVE +T+KNL+T   Q I+++I HEIEGF
Subjt:  II-FPTTLF----------------DAKESLFTEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNLKT---QRIQQQIQHEIEGF

Query:  KTQYSSLYAKLVNNNNKDKHKDKDLMEKEGGPTATSTVTSSSQGAFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNRSPKDPLNQPPPEKLKNDA
        KTQYSSLYAKLV  N+K+K K+KDLMEKEGGPTATSTVTSSSQ AF KSKSYSSGAS+VLRQWITCGPGAVDTNDAVL+KNRSPKDP NQPPP KLKNDA
Subjt:  KTQYSSLYAKLVNNNNKDKHKDKDLMEKEGGPTATSTVTSSSQGAFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNRSPKDPLNQPPPEKLKNDA

Query:  VLCRDDVLGGSARVLGNSWDGNLEIRRQYSPQTSRKSFDDSRKKRPKESGGRKVEAATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTKTASTS
        V CRDDVLGGSARV+GNS +GNLEIRRQYSPQTSRKSFDDSRKKRPKESG RKV A  TYKPMAGPNC LCGKTFRPEKMHSHMKSC+GI+SLTKTASTS
Subjt:  VLCRDDVLGGSARVLGNSWDGNLEIRRQYSPQTSRKSFDDSRKKRPKESGGRKVEAATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTKTASTS

Query:  DKTTSSNSTSTRTSDKDSTSTYLLTN
        DK   S ST+TRTSDKDS STYLLTN
Subjt:  DKTTSSNSTSTRTSDKDSTSTYLLTN

TrEMBL top hitse value%identityAlignment
A0A1S3CE79 protein UPSTREAM OF FLC isoform X11.7e-16678.83Show/hide
Query:  NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF
        N+GGE+RKVHIIYFLSRMGHVEQPHLIRVHHLAG  GGVYLRDVKRWLGELRG  MPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF
Subjt:  NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF

Query:  PTTLFDAKESLFTEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNL----KTQRIQQQIQHEIEGFKTQY--SSLYAKLVNNNNK
        P+TLFD K+SLF EELE +EDFS KL Q+N E+SPP+DSERSTVTDDGDS+KVE +T+KNL    K   ++++   EIEGFKTQY  SSLYAKL  NN K
Subjt:  PTTLFDAKESLFTEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNL----KTQRIQQQIQHEIEGFKTQY--SSLYAKLVNNNNK

Query:  DKHKDKDLMEKEGGPTATSTVTSSSQGAFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNRSPKDPLNQPPPEKLKNDAVLCRDDVLGGSARVLGN
        +K KDKDLM KEGG TATSTV+SSSQ AF KSKSYSSGAS+V RQ ITCG GAVDTND VL+KNRSPKD    PP EK KNDAV+CRD+VLGGSARV+G+
Subjt:  DKHKDKDLMEKEGGPTATSTVTSSSQGAFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNRSPKDPLNQPPPEKLKNDAVLCRDDVLGGSARVLGN

Query:  SWD-GNLEIRRQYSPQTSRKSFDDSRKKRPKESGGRKVEAATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-TASTSDKTTSSNSTSTRTSD
        SWD GNLEIRR    QTSR S DD RKKRPKE+G  KV AATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK   +TSDKTT S ST+T TSD
Subjt:  SWD-GNLEIRRQYSPQTSRKSFDDSRKKRPKESGGRKVEAATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-TASTSDKTTSSNSTSTRTSD

Query:  KDSTSTYLLTN
        KD  STY+LTN
Subjt:  KDSTSTYLLTN

A0A1S3CE85 uncharacterized protein LOC103499876 isoform X25.0e-16378.1Show/hide
Query:  NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF
        N+GGE+RKVHIIYFLSRMGHVEQPHLIRVHHLAG  GGVYLRDVKRWLGELRG  MPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF
Subjt:  NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF

Query:  PTTLFDAKESLFTEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNL----KTQRIQQQIQHEIEGFKTQY--SSLYAKLVNNNNK
        P+TLFD K+SLF EELE +EDFS KL Q+N E+SPP+DSERSTVTDDGDS+KVE +T+KNL    K   ++++   EIEGFKTQY  SSLYAKL  NN K
Subjt:  PTTLFDAKESLFTEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNL----KTQRIQQQIQHEIEGFKTQY--SSLYAKLVNNNNK

Query:  DKHKDKDLMEKEGGPTATSTVTSSSQGAFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNRSPKDPLNQPPPEKLKNDAVLCRDDVLGGSARVLGN
        +K KDKDLM KE    ATSTV+SSSQ AF KSKSYSSGAS+V RQ ITCG GAVDTND VL+KNRSPKD    PP EK KNDAV+CRD+VLGGSARV+G+
Subjt:  DKHKDKDLMEKEGGPTATSTVTSSSQGAFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNRSPKDPLNQPPPEKLKNDAVLCRDDVLGGSARVLGN

Query:  SWD-GNLEIRRQYSPQTSRKSFDDSRKKRPKESGGRKVEAATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-TASTSDKTTSSNSTSTRTSD
        SWD GNLEIRR    QTSR S DD RKKRPKE+G  KV AATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK   +TSDKTT S ST+T TSD
Subjt:  SWD-GNLEIRRQYSPQTSRKSFDDSRKKRPKESGGRKVEAATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-TASTSDKTTSSNSTSTRTSD

Query:  KDSTSTYLLTN
        KD  STY+LTN
Subjt:  KDSTSTYLLTN

A0A1S3CEM9 protein UPSTREAM OF FLC isoform X31.7e-16678.83Show/hide
Query:  NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF
        N+GGE+RKVHIIYFLSRMGHVEQPHLIRVHHLAG  GGVYLRDVKRWLGELRG  MPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF
Subjt:  NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF

Query:  PTTLFDAKESLFTEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNL----KTQRIQQQIQHEIEGFKTQY--SSLYAKLVNNNNK
        P+TLFD K+SLF EELE +EDFS KL Q+N E+SPP+DSERSTVTDDGDS+KVE +T+KNL    K   ++++   EIEGFKTQY  SSLYAKL  NN K
Subjt:  PTTLFDAKESLFTEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNL----KTQRIQQQIQHEIEGFKTQY--SSLYAKLVNNNNK

Query:  DKHKDKDLMEKEGGPTATSTVTSSSQGAFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNRSPKDPLNQPPPEKLKNDAVLCRDDVLGGSARVLGN
        +K KDKDLM KEGG TATSTV+SSSQ AF KSKSYSSGAS+V RQ ITCG GAVDTND VL+KNRSPKD    PP EK KNDAV+CRD+VLGGSARV+G+
Subjt:  DKHKDKDLMEKEGGPTATSTVTSSSQGAFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNRSPKDPLNQPPPEKLKNDAVLCRDDVLGGSARVLGN

Query:  SWD-GNLEIRRQYSPQTSRKSFDDSRKKRPKESGGRKVEAATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-TASTSDKTTSSNSTSTRTSD
        SWD GNLEIRR    QTSR S DD RKKRPKE+G  KV AATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK   +TSDKTT S ST+T TSD
Subjt:  SWD-GNLEIRRQYSPQTSRKSFDDSRKKRPKESGGRKVEAATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-TASTSDKTTSSNSTSTRTSD

Query:  KDSTSTYLLTN
        KD  STY+LTN
Subjt:  KDSTSTYLLTN

A0A6J1CUY7 protein UPSTREAM OF FLC isoform X24.0e-13667.36Show/hide
Query:  MEA-NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGG----GGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYV
        MEA NKGGE+R+VHI+YFLSRMGHVEQPHLIRVHHLA        GV+LRDVKRWLGELRG  MPEAFSWSYKRKYKTGYVWQDLVDDDLITP SDNEYV
Subjt:  MEA-NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGG----GGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYV

Query:  LQGSQII-FPTTLFDA----------KESLF-TEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNLKTQRIQQQIQHEIEGFKTQ
        LQGSQII FP    ++          K S+F ++ELEP  DF  KL      ESP  DSERSTVTDDGDS+KVE +T   L+T +    ++ EIEGF  Q
Subjt:  LQGSQII-FPTTLFDA----------KESLF-TEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNLKTQRIQQQIQHEIEGFKTQ

Query:  YSSLYAKLVNNNNKDKHKDKDLMEKEGGPTATSTVTSSSQG-------AFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNRS-PKDPLNQPPPEK
        YSSLY KL     ++K+ +KD MEKEGGPTATSTV+SSS         AF KSKSYSSGAS+VLRQWITC  GAVDTND VLIKNRS  KDP N   PEK
Subjt:  YSSLYAKLVNNNNKDKHKDKDLMEKEGGPTATSTVTSSSQG-------AFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNRS-PKDPLNQPPPEK

Query:  LKNDAVLCRDDVLGGSARVLGNSWDGNLEIRRQYSPQTSRKSFDDSRKKRPKESGGRKVEAAT-TYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLT
         KNDAV+CRDD+LGGSARVL +SWDG L+I R  + Q SRKSFD+ RKKRPKESGGRKV AAT  +K M GPNCS CGK+F+PEKMH+HMKSCRG+KSL 
Subjt:  LKNDAVLCRDDVLGGSARVLGNSWDGNLEIRRQYSPQTSRKSFDDSRKKRPKESGGRKVEAAT-TYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLT

Query:  KTASTSDKTTSSNSTSTRTSDKDSTSTYLLTN
        KT STS+KTTSS ST+T TS       Y LTN
Subjt:  KTASTSDKTTSSNSTSTRTSDKDSTSTYLLTN

A0A6J1CV58 protein UPSTREAM OF FLC isoform X14.0e-13667.36Show/hide
Query:  MEA-NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGG----GGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYV
        MEA NKGGE+R+VHI+YFLSRMGHVEQPHLIRVHHLA        GV+LRDVKRWLGELRG  MPEAFSWSYKRKYKTGYVWQDLVDDDLITP SDNEYV
Subjt:  MEA-NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGG----GGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYV

Query:  LQGSQII-FPTTLFDA----------KESLF-TEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNLKTQRIQQQIQHEIEGFKTQ
        LQGSQII FP    ++          K S+F ++ELEP  DF  KL      ESP  DSERSTVTDDGDS+KVE +T   L+T +    ++ EIEGF  Q
Subjt:  LQGSQII-FPTTLFDA----------KESLF-TEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNLKTQRIQQQIQHEIEGFKTQ

Query:  YSSLYAKLVNNNNKDKHKDKDLMEKEGGPTATSTVTSSSQG-------AFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNRS-PKDPLNQPPPEK
        YSSLY KL     ++K+ +KD MEKEGGPTATSTV+SSS         AF KSKSYSSGAS+VLRQWITC  GAVDTND VLIKNRS  KDP N   PEK
Subjt:  YSSLYAKLVNNNNKDKHKDKDLMEKEGGPTATSTVTSSSQG-------AFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNRS-PKDPLNQPPPEK

Query:  LKNDAVLCRDDVLGGSARVLGNSWDGNLEIRRQYSPQTSRKSFDDSRKKRPKESGGRKVEAAT-TYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLT
         KNDAV+CRDD+LGGSARVL +SWDG L+I R  + Q SRKSFD+ RKKRPKESGGRKV AAT  +K M GPNCS CGK+F+PEKMH+HMKSCRG+KSL 
Subjt:  LKNDAVLCRDDVLGGSARVLGNSWDGNLEIRRQYSPQTSRKSFDDSRKKRPKESGGRKVEAAT-TYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLT

Query:  KTASTSDKTTSSNSTSTRTSDKDSTSTYLLTN
        KT STS+KTTSS ST+T TS       Y LTN
Subjt:  KTASTSDKTTSSNSTSTRTSDKDSTSTYLLTN

SwissProt top hitse value%identityAlignment
A0A2R6X6S3 Protein SOSEKI3.5e-2030.36Show/hide
Query:  KVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIFPTTLFDA
        KV ++Y+LSR G ++QPHLI V  ++    G+YLRDVKR L  +RG  M ++FSWS KR YK  ++WQDL DDD I P+SD E VL+GS++    T F  
Subjt:  KVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIFPTTLFDA

Query:  KESLFTEELEPNEDFSL-KLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNLKTQRIQQQIQHEIEGFKTQYSSLYAKLVNNNNKDKHKDKDLMEKE
        K      + +P     L   ++K   +   +++ + ++  + D ++ ++D +  L          H     K +   L  +++N+ ++ K    +    +
Subjt:  KESLFTEELEPNEDFSL-KLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNLKTQRIQQQIQHEIEGFKTQYSSLYAKLVNNNNKDKHKDKDLMEKE

Query:  GGPTATSTVTSSSQGAFAKSKSYS
           T  S+ T  S+ +  +  S++
Subjt:  GGPTATSTVTSSSQGAFAKSKSYS

Q8GY65 Protein SOSEKI 45.6e-1836.81Show/hide
Query:  NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF
        +K    R V ++Y+LSR G ++ PH I V        G+YL+DV   L +LRGN M   +SWS KR YK G+VW DL D+D I P+   EYVL+GSQI+ 
Subjt:  NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF

Query:  PTTLFDAKESLFTEELEPNEDFS-------LKLVQKNVEESPPWDSERSTVTDDGDSMKVEAD
           L +   +        N+ +S        K  + N E +     + ST TDD    K   D
Subjt:  PTTLFDAKESLFTEELEPNEDFS-------LKLVQKNVEESPPWDSERSTVTDDGDSMKVEAD

Q8GYT8 Protein SOSEKI 31.1e-1840.41Show/hide
Query:  EVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIFPTTL
        +++KV I+Y+LS+   +E PH + V  L     G+YLRDV   L  LRG  M   +SWS KR Y+ G+VW DL +DDLI P + NEYVL+GS+      L
Subjt:  EVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIFPTTL

Query:  FDAKESLFTEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDS
        FD   S   +   P  + + + +++ V E P   S RS   DD  S
Subjt:  FDAKESLFTEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDS

Q9FJF5 Protein SOSEKI 51.2e-1727.8Show/hide
Query:  RKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQII------F
        RKV ++Y+L R G ++ PH I V        G+YL+DV   L +LRG  M   +SWS KR YK G+VW DL +DD I P+   EYVL+GS+++       
Subjt:  RKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQII------F

Query:  PTTLFDAKESLFTEELEPNEDFSLKL-VQKNVEESPPWDS--------ERSTVTDDGDSMKVEADTSKNLKTQRIQQQ-IQHEIEGFKT--QYSSLYAKL
        P +L +         L P+++    +    N   +  W S         ++T +    + ++ AD S     +R +++  + EIE  K+   Y +   +L
Subjt:  PTTLFDAKESLFTEELEPNEDFSLKL-VQKNVEESPPWDS--------ERSTVTDDGDSMKVEADTSKNLKTQRIQQQ-IQHEIEGFKT--QYSSLYAKL

Query:  VNNNNKDKHKD------KDLMEKEGGPTATSTVTSSSQGAFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNR
          +       D      ++L++ +G      + +S+         S    AS VL Q I+CG  +      VL+K++
Subjt:  VNNNNKDKHKD------KDLMEKEGGPTATSTVTSSSQGAFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNR

Q9SYJ8 Protein SOSEKI 12.6e-5540.54Show/hide
Query:  MEAN-KGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGS
        ME+N  GGEVR+V+++YFLSR GHV+ PHL+RVHHL+    GV+LRDVK+WL + RG+ MP+AFSWS KR+YK GYVWQDL+DDDLITPISDNEYVL+GS
Subjt:  MEAN-KGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGS

Query:  QIIFPTTLFD----AKESLFTEE--LEPNEDF-SLKLVQKNVE-ESPPWDSERSTVTDDGDSMKVEADTSKNLKTQRIQQQIQHEIEGFKTQYSSLYAKL
        +I+  +   D     K++  T    ++  E    LKL  + ++ ESP + S+RST T                 T  + ++     EGF  +      K 
Subjt:  QIIFPTTLFD----AKESLFTEE--LEPNEDF-SLKLVQKNVE-ESPPWDSERSTVTDDGDSMKVEADTSKNLKTQRIQQQIQHEIEGFKTQYSSLYAKL

Query:  VNNNNKDKHKDKDLMEKEGGPTATSTVTSSSQGAFAKSKSYSS-GASNVLRQWITCGPGAVDTNDAVLIKNRSPKDPLNQPPPEKLKNDAVLCRDDVLGG
        V+       ++    + E G  + S+ TSSS  ++ K+KSYSS  AS+VLR  + C  G +DTNDAVL+       PLN+                    
Subjt:  VNNNNKDKHKDKDLMEKEGGPTATSTVTSSSQGAFAKSKSYSS-GASNVLRQWITCGPGAVDTNDAVLIKNRSPKDPLNQPPPEKLKNDAVLCRDDVLGG

Query:  SARVLGNSWDGNLEIRRQYSPQ-TSRKSFDDS----RKKRPKESGGRKVEAATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTKTASTSDKTTS
         +   G +W+   E R QY  Q  +RKSF+ +    + K   E    KV  +   KP   P CS CGK F+PEKMHSHMK CRG+K+   +++ +D  TS
Subjt:  SARVLGNSWDGNLEIRRQYSPQ-TSRKSFDDS----RKKRPKESGGRKVEAATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTKTASTSDKTTS

Query:  SNSTSTR
        +N+   R
Subjt:  SNSTSTR

Arabidopsis top hitse value%identityAlignment
AT1G05577.1 Domain of unknown function (DUF966)1.8e-5640.54Show/hide
Query:  MEAN-KGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGS
        ME+N  GGEVR+V+++YFLSR GHV+ PHL+RVHHL+    GV+LRDVK+WL + RG+ MP+AFSWS KR+YK GYVWQDL+DDDLITPISDNEYVL+GS
Subjt:  MEAN-KGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGS

Query:  QIIFPTTLFD----AKESLFTEE--LEPNEDF-SLKLVQKNVE-ESPPWDSERSTVTDDGDSMKVEADTSKNLKTQRIQQQIQHEIEGFKTQYSSLYAKL
        +I+  +   D     K++  T    ++  E    LKL  + ++ ESP + S+RST T                 T  + ++     EGF  +      K 
Subjt:  QIIFPTTLFD----AKESLFTEE--LEPNEDF-SLKLVQKNVE-ESPPWDSERSTVTDDGDSMKVEADTSKNLKTQRIQQQIQHEIEGFKTQYSSLYAKL

Query:  VNNNNKDKHKDKDLMEKEGGPTATSTVTSSSQGAFAKSKSYSS-GASNVLRQWITCGPGAVDTNDAVLIKNRSPKDPLNQPPPEKLKNDAVLCRDDVLGG
        V+       ++    + E G  + S+ TSSS  ++ K+KSYSS  AS+VLR  + C  G +DTNDAVL+       PLN+                    
Subjt:  VNNNNKDKHKDKDLMEKEGGPTATSTVTSSSQGAFAKSKSYSS-GASNVLRQWITCGPGAVDTNDAVLIKNRSPKDPLNQPPPEKLKNDAVLCRDDVLGG

Query:  SARVLGNSWDGNLEIRRQYSPQ-TSRKSFDDS----RKKRPKESGGRKVEAATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTKTASTSDKTTS
         +   G +W+   E R QY  Q  +RKSF+ +    + K   E    KV  +   KP   P CS CGK F+PEKMHSHMK CRG+K+   +++ +D  TS
Subjt:  SARVLGNSWDGNLEIRRQYSPQ-TSRKSFDDS----RKKRPKESGGRKVEAATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTKTASTSDKTTS

Query:  SNSTSTR
        +N+   R
Subjt:  SNSTSTR

AT2G28150.1 Domain of unknown function (DUF966)8.0e-2040.41Show/hide
Query:  EVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIFPTTL
        +++KV I+Y+LS+   +E PH + V  L     G+YLRDV   L  LRG  M   +SWS KR Y+ G+VW DL +DDLI P + NEYVL+GS+      L
Subjt:  EVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIFPTTL

Query:  FDAKESLFTEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDS
        FD   S   +   P  + + + +++ V E P   S RS   DD  S
Subjt:  FDAKESLFTEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDS

AT3G46110.1 Domain of unknown function (DUF966)3.9e-1936.81Show/hide
Query:  NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF
        +K    R V ++Y+LSR G ++ PH I V        G+YL+DV   L +LRGN M   +SWS KR YK G+VW DL D+D I P+   EYVL+GSQI+ 
Subjt:  NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF

Query:  PTTLFDAKESLFTEELEPNEDFS-------LKLVQKNVEESPPWDSERSTVTDDGDSMKVEAD
           L +   +        N+ +S        K  + N E +     + ST TDD    K   D
Subjt:  PTTLFDAKESLFTEELEPNEDFS-------LKLVQKNVEESPPWDSERSTVTDDGDSMKVEAD

AT3G46110.2 Domain of unknown function (DUF966)3.9e-1936.81Show/hide
Query:  NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF
        +K    R V ++Y+LSR G ++ PH I V        G+YL+DV   L +LRGN M   +SWS KR YK G+VW DL D+D I P+   EYVL+GSQI+ 
Subjt:  NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF

Query:  PTTLFDAKESLFTEELEPNEDFS-------LKLVQKNVEESPPWDSERSTVTDDGDSMKVEAD
           L +   +        N+ +S        K  + N E +     + ST TDD    K   D
Subjt:  PTTLFDAKESLFTEELEPNEDFS-------LKLVQKNVEESPPWDSERSTVTDDGDSMKVEAD

AT5G59790.1 Domain of unknown function (DUF966)8.8e-1927.8Show/hide
Query:  RKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQII------F
        RKV ++Y+L R G ++ PH I V        G+YL+DV   L +LRG  M   +SWS KR YK G+VW DL +DD I P+   EYVL+GS+++       
Subjt:  RKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQII------F

Query:  PTTLFDAKESLFTEELEPNEDFSLKL-VQKNVEESPPWDS--------ERSTVTDDGDSMKVEADTSKNLKTQRIQQQ-IQHEIEGFKT--QYSSLYAKL
        P +L +         L P+++    +    N   +  W S         ++T +    + ++ AD S     +R +++  + EIE  K+   Y +   +L
Subjt:  PTTLFDAKESLFTEELEPNEDFSLKL-VQKNVEESPPWDS--------ERSTVTDDGDSMKVEADTSKNLKTQRIQQQ-IQHEIEGFKT--QYSSLYAKL

Query:  VNNNNKDKHKD------KDLMEKEGGPTATSTVTSSSQGAFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNR
          +       D      ++L++ +G      + +S+         S    AS VL Q I+CG  +      VL+K++
Subjt:  VNNNNKDKHKD------KDLMEKEGGPTATSTVTSSSQGAFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGCTAATAAAGGTGGAGAAGTTAGAAAAGTTCATATTATTTACTTTCTTAGCCGGATGGGACACGTGGAGCAACCCCATCTCATCCGCGTTCATCATCTTGCCGG
CGGCGGCGGCGGCGTTTATCTCCGAGATGTAAAGAGATGGCTAGGGGAATTGAGAGGAAATAACATGCCAGAAGCCTTCTCATGGTCATACAAAAGAAAGTACAAAACAG
GGTACGTTTGGCAAGACCTGGTGGATGACGATCTCATAACTCCAATATCTGACAACGAATATGTCCTTCAAGGATCCCAAATCATATTTCCCACTACTCTCTTTGATGCT
AAAGAATCATTGTTCACAGAAGAATTGGAACCAAATGAAGATTTTTCATTGAAACTTGTGCAAAAGAATGTAGAAGAATCTCCGCCGTGGGATTCGGAGAGGTCGACGGT
GACGGACGATGGGGATTCCATGAAGGTTGAAGCAGATACAAGTAAGAATTTGAAGACACAGAGGATACAACAACAAATACAACACGAAATTGAAGGGTTTAAGACACAAT
ATTCTTCTTTGTACGCAAAATTGGTGAACAACAACAACAAGGACAAACATAAAGATAAGGATCTAATGGAGAAGGAAGGTGGACCCACAGCAACGTCAACAGTTACGTCA
TCATCCCAAGGTGCATTCGCAAAGAGCAAGAGCTACTCAAGCGGAGCTTCCAACGTGCTTCGCCAGTGGATCACGTGCGGTCCCGGGGCAGTAGACACAAACGACGCCGT
TTTGATCAAGAATCGATCTCCCAAAGATCCTTTGAATCAACCGCCACCGGAAAAACTCAAAAACGACGCCGTGTTATGCAGAGACGACGTGTTGGGCGGCTCCGCTCGAG
TTCTCGGAAATTCTTGGGATGGGAACCTCGAAATCCGCCGCCAATACAGCCCGCAAACTTCTCGTAAAAGCTTCGACGATTCAAGGAAGAAAAGGCCAAAGGAGAGCGGT
GGCAGAAAGGTGGAGGCCGCCACAACCTACAAGCCGATGGCAGGACCAAACTGCTCGCTATGTGGGAAGACTTTTAGGCCGGAGAAAATGCATTCACACATGAAATCATG
TAGAGGAATCAAGTCTCTTACTAAGACTGCTTCCACATCAGACAAAACGACATCGTCCAATTCAACTTCGACAAGAACTTCCGACAAGGATTCAACTTCCACATACTTGT
TGACCAACTGA
mRNA sequenceShow/hide mRNA sequence
ATTAGATTAAAGAATAATTAACCCATTCCTCTCCCTAATTTCTCTGCTCATTCTCTCTTTAAATTTTTCTCTCTAAACACAAACCGCAAGTAAGAAGAAGCCAAGTGGGT
GGAAGAAAAATTTAAGAACGAGAGACAAATCAGAAGCTTAGAAAAAGAGCGTTAATGGAGGCTAATAAAGGTGGAGAAGTTAGAAAAGTTCATATTATTTACTTTCTTAG
CCGGATGGGACACGTGGAGCAACCCCATCTCATCCGCGTTCATCATCTTGCCGGCGGCGGCGGCGGCGTTTATCTCCGAGATGTAAAGAGATGGCTAGGGGAATTGAGAG
GAAATAACATGCCAGAAGCCTTCTCATGGTCATACAAAAGAAAGTACAAAACAGGGTACGTTTGGCAAGACCTGGTGGATGACGATCTCATAACTCCAATATCTGACAAC
GAATATGTCCTTCAAGGATCCCAAATCATATTTCCCACTACTCTCTTTGATGCTAAAGAATCATTGTTCACAGAAGAATTGGAACCAAATGAAGATTTTTCATTGAAACT
TGTGCAAAAGAATGTAGAAGAATCTCCGCCGTGGGATTCGGAGAGGTCGACGGTGACGGACGATGGGGATTCCATGAAGGTTGAAGCAGATACAAGTAAGAATTTGAAGA
CACAGAGGATACAACAACAAATACAACACGAAATTGAAGGGTTTAAGACACAATATTCTTCTTTGTACGCAAAATTGGTGAACAACAACAACAAGGACAAACATAAAGAT
AAGGATCTAATGGAGAAGGAAGGTGGACCCACAGCAACGTCAACAGTTACGTCATCATCCCAAGGTGCATTCGCAAAGAGCAAGAGCTACTCAAGCGGAGCTTCCAACGT
GCTTCGCCAGTGGATCACGTGCGGTCCCGGGGCAGTAGACACAAACGACGCCGTTTTGATCAAGAATCGATCTCCCAAAGATCCTTTGAATCAACCGCCACCGGAAAAAC
TCAAAAACGACGCCGTGTTATGCAGAGACGACGTGTTGGGCGGCTCCGCTCGAGTTCTCGGAAATTCTTGGGATGGGAACCTCGAAATCCGCCGCCAATACAGCCCGCAA
ACTTCTCGTAAAAGCTTCGACGATTCAAGGAAGAAAAGGCCAAAGGAGAGCGGTGGCAGAAAGGTGGAGGCCGCCACAACCTACAAGCCGATGGCAGGACCAAACTGCTC
GCTATGTGGGAAGACTTTTAGGCCGGAGAAAATGCATTCACACATGAAATCATGTAGAGGAATCAAGTCTCTTACTAAGACTGCTTCCACATCAGACAAAACGACATCGT
CCAATTCAACTTCGACAAGAACTTCCGACAAGGATTCAACTTCCACATACTTGTTGACCAACTGAGAGACTTTTTGCTGTTAAAAATCAATTTGTCAAGAGAGTATTATT
TTATGTGGTCACTCTCAATTCACAGATATGGTGTAAAGCTAGAAACTATAATTTGAAGGGTAGTTCTATGAAGCTATCATATAATCAAATAGAATTTTTTCCTCTTGGTT
GAA
Protein sequenceShow/hide protein sequence
MEANKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGGGGGVYLRDVKRWLGELRGNNMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIFPTTLFDA
KESLFTEELEPNEDFSLKLVQKNVEESPPWDSERSTVTDDGDSMKVEADTSKNLKTQRIQQQIQHEIEGFKTQYSSLYAKLVNNNNKDKHKDKDLMEKEGGPTATSTVTS
SSQGAFAKSKSYSSGASNVLRQWITCGPGAVDTNDAVLIKNRSPKDPLNQPPPEKLKNDAVLCRDDVLGGSARVLGNSWDGNLEIRRQYSPQTSRKSFDDSRKKRPKESG
GRKVEAATTYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTKTASTSDKTTSSNSTSTRTSDKDSTSTYLLTN