; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10023183 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10023183
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein UPSTREAM OF FLC isoform X1
Genome locationChr05:31928680..31933490
RNA-Seq ExpressionHG10023183
SyntenyHG10023183
Gene Ontology termsGO:0051258 - protein polymerization (biological process)
GO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR010369 - Protein SOSEKI


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136105.3 uncharacterized protein LOC101207468 isoform X1 [Cucumis sativus]2.2e-16375.57Show/hide
Query:  MMKLRKRALMEANK----GGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDD
        MMKLRKRALMEAN     GGE+RKVHIIYFLSRMGHVEQPHLIRVHHLA    GG  GVYLRDVKRWL ELRGKEM EAFSWSYKRKYKTGYVWQDLVDD
Subjt:  MMKLRKRALMEANK----GGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDD

Query:  DLITPISDNEYVLQGSQIIFPTTLFDTKKSLFREELELNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGI-EEEIHEIEG
        DLITPISDNEYVLQGSQIIFP+TLFDTK SLFREELE++ DFSSK+ + N E+SPP DSERSTVTDDGDSMKVE ET + NLET+ KQG+ EEE+ +IEG
Subjt:  DLITPISDNEYVLQGSQIIFPTTLFDTKKSLFREELELNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGI-EEEIHEIEG

Query:  FKTQF--SSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNRSPKHPPNQPPSEKHRN
         KTQ+  SSLYAKL  N KEKDKDK+LM KEGGPTATSTV+SS+QPAFTKSKSYSSGAS+V RQ ITCG GA DTND VLVKNRS      QP     +N
Subjt:  FKTQF--SSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNRSPKHPPNQPPSEKHRN

Query:  DAVFCRDDVLGGSARVLGNSWD-ANLEIRRQYSPQTSRKKFDDSRKKRSKESGGKKVAATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLT---
        DAV CRD+VLGGSARV+GNSWD  NLEIRR    QTSRK  DD RKKR KE+G  KV ATATYKPMAGPNCSLCGK+FRPEKMHSHMKSCRG+K LT   
Subjt:  DAVFCRDDVLGGSARVLGNSWD-ANLEIRRQYSPQTSRKKFDDSRKKRSKESGGKKVAATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLT---

Query:  ------KTASTSDKTTSSKSTTTRTSDKDLASTYFLTN
                 +TSDK+T SKST   TSD DL STY LTN
Subjt:  ------KTASTSDKTTSSKSTTTRTSDKDLASTYFLTN

XP_008461226.1 PREDICTED: protein UPSTREAM OF FLC isoform X1 [Cucumis melo]1.5e-17780.93Show/hide
Query:  MMKLRKRALMEA---NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDD
        MMKLRKRALMEA   N+GGE+RKVHIIYFLSRMGHVEQPHLIRVHHLAGA     GGVYLRDVKRWL ELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDD
Subjt:  MMKLRKRALMEA---NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDD

Query:  LITPISDNEYVLQGSQIIFPTTLFDTKKSLFREELELNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGI--EEEIHEIEG
        LITPISDNEYVLQGSQIIFP+TLFDTKKSLFREELEL+ DFSSKL Q+N E+SPPYDSERSTVTDDGDS+KVE ET + NLE +PKQG+  EEE+ EIEG
Subjt:  LITPISDNEYVLQGSQIIFPTTLFDTKKSLFREELELNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGI--EEEIHEIEG

Query:  FKTQF--SSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNRSPKHPPNQPPSEKHRN
        FKTQ+  SSLYAKL  N KEKDKDK+LM KEGG TATSTV+SSSQPAFTKSKSYSSGAS+V RQ ITCG GAVDTND VLVKNRSPK     PP EK +N
Subjt:  FKTQF--SSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNRSPKHPPNQPPSEKHRN

Query:  DAVFCRDDVLGGSARVLGNSWD-ANLEIRRQYSPQTSRKKFDDSRKKRSKESGGKKVAATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-T
        DAV CRD+VLGGSARV+G+SWD  NLEIRR    QTSR   DD RKKR KE+G  KV A  TYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK  
Subjt:  DAVFCRDDVLGGSARVLGNSWD-ANLEIRRQYSPQTSRKKFDDSRKKRSKESGGKKVAATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-T

Query:  ASTSDKTTSSKSTTTRTSDKDLASTYFLTN
         +TSDKTT SKSTTT TSDKDL STY LTN
Subjt:  ASTSDKTTSSKSTTTRTSDKDLASTYFLTN

XP_008461227.1 PREDICTED: uncharacterized protein LOC103499876 isoform X2 [Cucumis melo]4.6e-17480.23Show/hide
Query:  MMKLRKRALMEA---NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDD
        MMKLRKRALMEA   N+GGE+RKVHIIYFLSRMGHVEQPHLIRVHHLAGA     GGVYLRDVKRWL ELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDD
Subjt:  MMKLRKRALMEA---NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDD

Query:  LITPISDNEYVLQGSQIIFPTTLFDTKKSLFREELELNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGI--EEEIHEIEG
        LITPISDNEYVLQGSQIIFP+TLFDTKKSLFREELEL+ DFSSKL Q+N E+SPPYDSERSTVTDDGDS+KVE ET + NLE +PKQG+  EEE+ EIEG
Subjt:  LITPISDNEYVLQGSQIIFPTTLFDTKKSLFREELELNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGI--EEEIHEIEG

Query:  FKTQF--SSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNRSPKHPPNQPPSEKHRN
        FKTQ+  SSLYAKL  N KEKDKDK+LM KE    ATSTV+SSSQPAFTKSKSYSSGAS+V RQ ITCG GAVDTND VLVKNRSPK     PP EK +N
Subjt:  FKTQF--SSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNRSPKHPPNQPPSEKHRN

Query:  DAVFCRDDVLGGSARVLGNSWD-ANLEIRRQYSPQTSRKKFDDSRKKRSKESGGKKVAATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-T
        DAV CRD+VLGGSARV+G+SWD  NLEIRR    QTSR   DD RKKR KE+G  KV A  TYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK  
Subjt:  DAVFCRDDVLGGSARVLGNSWD-ANLEIRRQYSPQTSRKKFDDSRKKRSKESGGKKVAATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-T

Query:  ASTSDKTTSSKSTTTRTSDKDLASTYFLTN
         +TSDKTT SKSTTT TSDKDL STY LTN
Subjt:  ASTSDKTTSSKSTTTRTSDKDLASTYFLTN

XP_008461228.1 PREDICTED: protein UPSTREAM OF FLC isoform X3 [Cucumis melo]3.9e-17380.96Show/hide
Query:  NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGS
        N+GGE+RKVHIIYFLSRMGHVEQPHLIRVHHLAGA     GGVYLRDVKRWL ELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGS
Subjt:  NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGS

Query:  QIIFPTTLFDTKKSLFREELELNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGI--EEEIHEIEGFKTQF--SSLYAKLV
        QIIFP+TLFDTKKSLFREELEL+ DFSSKL Q+N E+SPPYDSERSTVTDDGDS+KVE ET + NLE +PKQG+  EEE+ EIEGFKTQ+  SSLYAKL 
Subjt:  QIIFPTTLFDTKKSLFREELELNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGI--EEEIHEIEGFKTQF--SSLYAKLV

Query:  INNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNRSPKHPPNQPPSEKHRNDAVFCRDDVLGGSAR
         N KEKDKDK+LM KEGG TATSTV+SSSQPAFTKSKSYSSGAS+V RQ ITCG GAVDTND VLVKNRSPK     PP EK +NDAV CRD+VLGGSAR
Subjt:  INNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNRSPKHPPNQPPSEKHRNDAVFCRDDVLGGSAR

Query:  VLGNSWD-ANLEIRRQYSPQTSRKKFDDSRKKRSKESGGKKVAATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-TASTSDKTTSSKSTTT
        V+G+SWD  NLEIRR    QTSR   DD RKKR KE+G  KV A  TYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK   +TSDKTT SKSTTT
Subjt:  VLGNSWD-ANLEIRRQYSPQTSRKKFDDSRKKRSKESGGKKVAATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-TASTSDKTTSSKSTTT

Query:  RTSDKDLASTYFLTN
         TSDKDL STY LTN
Subjt:  RTSDKDLASTYFLTN

XP_038899908.1 LOW QUALITY PROTEIN: uncharacterized protein LOC120087098 [Benincasa hispida]1.0e-19785.13Show/hide
Query:  MKLRKRALMEANKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITP
        MKLRKRALMEANKGGE+RKVHIIYFLSRMGHVEQPHLIRVHHL     GGGGGVYLRDVKRWL ELRGK+MPEAFSWSYKRKYKTGYVWQDLVDDDLITP
Subjt:  MKLRKRALMEANKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITP

Query:  ISDNEYVLQGSQII-FPTTLF----------------DTKKSLFREELELNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQ
        ISDNEYVLQGSQII FP   F                DTKKSLFREELELN DF SK LQKNVEESPP DSERSTVTDDGDSMKVE ET + NLET PKQ
Subjt:  ISDNEYVLQGSQII-FPTTLF----------------DTKKSLFREELELNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQ

Query:  GIEEEIHEIEGFKTQFSSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNRSPKHPPN
        GIEEEIHEIEGFKTQ+SSLYAKLV N+KEKDK+K+LMEKEGGPTATSTVTSSSQPAFTKSKSYSSGAS+VLRQWITCGPGAVDTNDAVLVKNRSPK PPN
Subjt:  GIEEEIHEIEGFKTQFSSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNRSPKHPPN

Query:  QPPSEKHRNDAVFCRDDVLGGSARVLGNSWDANLEIRRQYSPQTSRKKFDDSRKKRSKESGGKKVAATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRG
        QPP  K +NDAVFCRDDVLGGSARV+GNS + NLEIRRQYSPQTSRK FDDSRKKR KESG +KVAATATYKPMAGPNC LCGKTFRPEKMHSHMKSC+G
Subjt:  QPPSEKHRNDAVFCRDDVLGGSARVLGNSWDANLEIRRQYSPQTSRKKFDDSRKKRSKESGGKKVAATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRG

Query:  IKSLTKTASTSDKTTSSKSTTTRTSDKDLASTYFLTN
        I+SLTKTASTSDK   SKSTTTRTSDKD  STY LTN
Subjt:  IKSLTKTASTSDKTTSSKSTTTRTSDKDLASTYFLTN

TrEMBL top hitse value%identityAlignment
A0A1S3CE79 protein UPSTREAM OF FLC isoform X17.5e-17880.93Show/hide
Query:  MMKLRKRALMEA---NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDD
        MMKLRKRALMEA   N+GGE+RKVHIIYFLSRMGHVEQPHLIRVHHLAGA     GGVYLRDVKRWL ELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDD
Subjt:  MMKLRKRALMEA---NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDD

Query:  LITPISDNEYVLQGSQIIFPTTLFDTKKSLFREELELNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGI--EEEIHEIEG
        LITPISDNEYVLQGSQIIFP+TLFDTKKSLFREELEL+ DFSSKL Q+N E+SPPYDSERSTVTDDGDS+KVE ET + NLE +PKQG+  EEE+ EIEG
Subjt:  LITPISDNEYVLQGSQIIFPTTLFDTKKSLFREELELNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGI--EEEIHEIEG

Query:  FKTQF--SSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNRSPKHPPNQPPSEKHRN
        FKTQ+  SSLYAKL  N KEKDKDK+LM KEGG TATSTV+SSSQPAFTKSKSYSSGAS+V RQ ITCG GAVDTND VLVKNRSPK     PP EK +N
Subjt:  FKTQF--SSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNRSPKHPPNQPPSEKHRN

Query:  DAVFCRDDVLGGSARVLGNSWD-ANLEIRRQYSPQTSRKKFDDSRKKRSKESGGKKVAATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-T
        DAV CRD+VLGGSARV+G+SWD  NLEIRR    QTSR   DD RKKR KE+G  KV A  TYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK  
Subjt:  DAVFCRDDVLGGSARVLGNSWD-ANLEIRRQYSPQTSRKKFDDSRKKRSKESGGKKVAATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-T

Query:  ASTSDKTTSSKSTTTRTSDKDLASTYFLTN
         +TSDKTT SKSTTT TSDKDL STY LTN
Subjt:  ASTSDKTTSSKSTTTRTSDKDLASTYFLTN

A0A1S3CE85 uncharacterized protein LOC103499876 isoform X22.2e-17480.23Show/hide
Query:  MMKLRKRALMEA---NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDD
        MMKLRKRALMEA   N+GGE+RKVHIIYFLSRMGHVEQPHLIRVHHLAGA     GGVYLRDVKRWL ELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDD
Subjt:  MMKLRKRALMEA---NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDD

Query:  LITPISDNEYVLQGSQIIFPTTLFDTKKSLFREELELNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGI--EEEIHEIEG
        LITPISDNEYVLQGSQIIFP+TLFDTKKSLFREELEL+ DFSSKL Q+N E+SPPYDSERSTVTDDGDS+KVE ET + NLE +PKQG+  EEE+ EIEG
Subjt:  LITPISDNEYVLQGSQIIFPTTLFDTKKSLFREELELNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGI--EEEIHEIEG

Query:  FKTQF--SSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNRSPKHPPNQPPSEKHRN
        FKTQ+  SSLYAKL  N KEKDKDK+LM KE    ATSTV+SSSQPAFTKSKSYSSGAS+V RQ ITCG GAVDTND VLVKNRSPK     PP EK +N
Subjt:  FKTQF--SSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNRSPKHPPNQPPSEKHRN

Query:  DAVFCRDDVLGGSARVLGNSWD-ANLEIRRQYSPQTSRKKFDDSRKKRSKESGGKKVAATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-T
        DAV CRD+VLGGSARV+G+SWD  NLEIRR    QTSR   DD RKKR KE+G  KV A  TYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK  
Subjt:  DAVFCRDDVLGGSARVLGNSWD-ANLEIRRQYSPQTSRKKFDDSRKKRSKESGGKKVAATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-T

Query:  ASTSDKTTSSKSTTTRTSDKDLASTYFLTN
         +TSDKTT SKSTTT TSDKDL STY LTN
Subjt:  ASTSDKTTSSKSTTTRTSDKDLASTYFLTN

A0A1S3CEM9 protein UPSTREAM OF FLC isoform X31.9e-17380.96Show/hide
Query:  NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGS
        N+GGE+RKVHIIYFLSRMGHVEQPHLIRVHHLAGA     GGVYLRDVKRWL ELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGS
Subjt:  NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGS

Query:  QIIFPTTLFDTKKSLFREELELNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGI--EEEIHEIEGFKTQF--SSLYAKLV
        QIIFP+TLFDTKKSLFREELEL+ DFSSKL Q+N E+SPPYDSERSTVTDDGDS+KVE ET + NLE +PKQG+  EEE+ EIEGFKTQ+  SSLYAKL 
Subjt:  QIIFPTTLFDTKKSLFREELELNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGI--EEEIHEIEGFKTQF--SSLYAKLV

Query:  INNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNRSPKHPPNQPPSEKHRNDAVFCRDDVLGGSAR
         N KEKDKDK+LM KEGG TATSTV+SSSQPAFTKSKSYSSGAS+V RQ ITCG GAVDTND VLVKNRSPK     PP EK +NDAV CRD+VLGGSAR
Subjt:  INNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNRSPKHPPNQPPSEKHRNDAVFCRDDVLGGSAR

Query:  VLGNSWD-ANLEIRRQYSPQTSRKKFDDSRKKRSKESGGKKVAATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-TASTSDKTTSSKSTTT
        V+G+SWD  NLEIRR    QTSR   DD RKKR KE+G  KV A  TYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK   +TSDKTT SKSTTT
Subjt:  VLGNSWD-ANLEIRRQYSPQTSRKKFDDSRKKRSKESGGKKVAATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTK-TASTSDKTTSSKSTTT

Query:  RTSDKDLASTYFLTN
         TSDKDL STY LTN
Subjt:  RTSDKDLASTYFLTN

A0A6J1CUY7 protein UPSTREAM OF FLC isoform X21.2e-13868.89Show/hide
Query:  MEA-NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYV
        MEA NKGGE+R+VHI+YFLSRMGHVEQPHLIRVHHLA A      GV+LRDVKRWL ELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITP SDNEYV
Subjt:  MEA-NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYV

Query:  LQGSQII-FPTTLFDT----------KKSLFR-EELELNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGIEEEIHEIEGF
        LQGSQII FP    ++          K S+FR +ELE   DF +KL      ESP  DSERSTVTDDGDS+KVE ET  N LET  KQGI   + EIEGF
Subjt:  LQGSQII-FPTTLFDT----------KKSLFR-EELELNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGIEEEIHEIEGF

Query:  KTQFSSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSS-------QPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNRS-PKHPPNQPPS
          Q+SSLY KL    +EK  +K+ MEKEGGPTATSTV+SSS        PAFTKSKSYSSGAS+VLRQWITC  GAVDTND VL+KNRS  K PPN P  
Subjt:  KTQFSSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSS-------QPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNRS-PKHPPNQPPS

Query:  EKHRNDAVFCRDDVLGGSARVLGNSWDANLEIRRQYSPQTSRKKFDDSRKKRSKESGGKKV-AATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKS
        EK +NDAV CRDD+LGGSARVL +SWD  L+I R  + Q SRK FD+ RKKR KESGG+KV AATA +K M GPNCS CGK+F+PEKMH+HMKSCRG+KS
Subjt:  EKHRNDAVFCRDDVLGGSARVLGNSWDANLEIRRQYSPQTSRKKFDDSRKKRSKESGGKKV-AATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKS

Query:  LTKTASTSDKTTSSKSTTTRTSDKDLASTYFLTN
        L KT STS+KTTSSKSTTT TS       YFLTN
Subjt:  LTKTASTSDKTTSSKSTTTRTSDKDLASTYFLTN

A0A6J1CV58 protein UPSTREAM OF FLC isoform X16.2e-14068.47Show/hide
Query:  MKLRKRA--LMEA-NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDL
        M+L+KR+  LMEA NKGGE+R+VHI+YFLSRMGHVEQPHLIRVHHLA A      GV+LRDVKRWL ELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDL
Subjt:  MKLRKRA--LMEA-NKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDL

Query:  ITPISDNEYVLQGSQII-FPTTLFDT----------KKSLFR-EELELNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGI
        ITP SDNEYVLQGSQII FP    ++          K S+FR +ELE   DF +KL      ESP  DSERSTVTDDGDS+KVE ET  N LET  KQGI
Subjt:  ITPISDNEYVLQGSQII-FPTTLFDT----------KKSLFR-EELELNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGI

Query:  EEEIHEIEGFKTQFSSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSS-------QPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNRS-
           + EIEGF  Q+SSLY KL    +EK  +K+ MEKEGGPTATSTV+SSS        PAFTKSKSYSSGAS+VLRQWITC  GAVDTND VL+KNRS 
Subjt:  EEEIHEIEGFKTQFSSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSS-------QPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNRS-

Query:  PKHPPNQPPSEKHRNDAVFCRDDVLGGSARVLGNSWDANLEIRRQYSPQTSRKKFDDSRKKRSKESGGKKV-AATATYKPMAGPNCSLCGKTFRPEKMHS
         K PPN P  EK +NDAV CRDD+LGGSARVL +SWD  L+I R  + Q SRK FD+ RKKR KESGG+KV AATA +K M GPNCS CGK+F+PEKMH+
Subjt:  PKHPPNQPPSEKHRNDAVFCRDDVLGGSARVLGNSWDANLEIRRQYSPQTSRKKFDDSRKKRSKESGGKKV-AATATYKPMAGPNCSLCGKTFRPEKMHS

Query:  HMKSCRGIKSLTKTASTSDKTTSSKSTTTRTSDKDLASTYFLTN
        HMKSCRG+KSL KT STS+KTTSSKSTTT TS       YFLTN
Subjt:  HMKSCRGIKSLTKTASTSDKTTSSKSTTTRTSDKDLASTYFLTN

SwissProt top hitse value%identityAlignment
A0A2R6X6S3 Protein SOSEKI8.0e-2037.89Show/hide
Query:  KVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIFPTT
        KV ++Y+LSR G ++QPHLI V            G+YLRDVKR L  +RGK M ++FSWS KR YK  ++WQDL DDD I P+SD E VL+GS++     
Subjt:  KVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIFPTT

Query:  LFDTKKSLFREELE-LNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLE
              + F+E+ E ++G F      +N  + P    + ++   D +++K   +  S+ L+
Subjt:  LFDTKKSLFREELE-LNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLE

A9RNY0 Protein SOSEKI 41.9e-1638.17Show/hide
Query:  MEANKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDN-EYV
        ME  +    R   ++Y LS  G +E PH+I V +          G  LRDVK  L  LRG+ MP++FSWSYKR YK  ++W DL DD+ I P+S++ EY 
Subjt:  MEANKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDN-EYV

Query:  LQGSQIIFPTTLFDTKKSLFREELELNGDFS
        L+ ++    + +   +     E LE  GD S
Subjt:  LQGSQIIFPTTLFDTKKSLFREELELNGDFS

Q8GYT8 Protein SOSEKI 35.4e-1636.26Show/hide
Query:  EVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF
        +++KV I+Y+LS+   +E PH + V            G+YLRDV   L  LRG+ M   +SWS KR Y+ G+VW DL +DDLI P + NEYVL+GS+   
Subjt:  EVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF

Query:  PTTLFDTKKSLFREELELNGDFSSKLL------QKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETH
           LFD          E N D  S ++       K +   PP  S RS   DD  S          N  +H
Subjt:  PTTLFDTKKSLFREELELNGDFSSKLL------QKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETH

Q9FJF5 Protein SOSEKI 58.9e-1928.98Show/hide
Query:  RKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQII---
        RKV ++Y+L R G ++ PH I V            G+YL+DV   L +LRGK M   +SWS KR YK G+VW DL +DD I P+   EYVL+GS+++   
Subjt:  RKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQII---

Query:  ---FPTTLFDTKKSLFREELELN-----GDFSSKLLQKNVEES------PPYDSERSTVTDDGDSMKVEAETSSNNLE-THPKQGIEEEIHEI------E
            P +L +T  S FR+   LN     GD    ++ +   +S        Y   ++T +    + ++ A+ S+   +    ++  +EEI E+      E
Subjt:  ---FPTTLFDTKKSLFREELELN-----GDFSSKLLQKNVEES------PPYDSERSTVTDDGDSMKVEAETSSNNLE-THPKQGIEEEIHEI------E

Query:  GFKTQFSSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNR
           T+ S        ++   +  +NL++ +G      + +S+         S    AS VL Q I+CG  +      VL+K++
Subjt:  GFKTQFSSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNR

Q9SYJ8 Protein SOSEKI 17.7e-5540.96Show/hide
Query:  MEAN-KGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYV
        ME+N  GGEVR+V+++YFLSR GHV+ PHL+RVHHL+        GV+LRDVK+WLA+ RG  MP+AFSWS KR+YK GYVWQDL+DDDLITPISDNEYV
Subjt:  MEAN-KGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYV

Query:  LQGSQIIFPTTLFDTKKSLFREELELNG--DFSSKL------LQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGIEEEIHEIEGF--KT
        L+GS+I+  +   D      +  +  NG  D   KL       +K  +ESP + S+RST T    +  V  E+++N                 EGF  K 
Subjt:  LQGSQIIFPTTLFDTKKSLFREELELNG--DFSSKL------LQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGIEEEIHEIEGF--KT

Query:  QFSSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSS-GASNVLRQWITCGPGAVDTNDAVLVKNRSPKHPPNQPPSEKHRNDAVF
        Q     +     + E     ++  + G P+ +ST +SSS   + K+KSYSS  AS+VLR  + C  G +DTNDAVLV            P  K R+ A  
Subjt:  QFSSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSS-GASNVLRQWITCGPGAVDTNDAVLVKNRSPKHPPNQPPSEKHRNDAVF

Query:  CRDDVLGGSARVLGNSWDANLEIRRQYSPQ-TSRKKFDDS----RKKRSKESGGKKVAATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTKTA
                     G +W+   E R QY  Q  +RK F+ +    + K + E    KVA +   KP   P CS CGK F+PEKMHSHMK CRG+K+   ++
Subjt:  CRDDVLGGSARVLGNSWDANLEIRRQYSPQ-TSRKKFDDS----RKKRSKESGGKKVAATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTKTA

Query:  STSDKTTSSKSTTTR
        + +D  TS+ +   R
Subjt:  STSDKTTSSKSTTTR

Arabidopsis top hitse value%identityAlignment
AT1G05577.1 Domain of unknown function (DUF966)5.5e-5640.96Show/hide
Query:  MEAN-KGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYV
        ME+N  GGEVR+V+++YFLSR GHV+ PHL+RVHHL+        GV+LRDVK+WLA+ RG  MP+AFSWS KR+YK GYVWQDL+DDDLITPISDNEYV
Subjt:  MEAN-KGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYV

Query:  LQGSQIIFPTTLFDTKKSLFREELELNG--DFSSKL------LQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGIEEEIHEIEGF--KT
        L+GS+I+  +   D      +  +  NG  D   KL       +K  +ESP + S+RST T    +  V  E+++N                 EGF  K 
Subjt:  LQGSQIIFPTTLFDTKKSLFREELELNG--DFSSKL------LQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGIEEEIHEIEGF--KT

Query:  QFSSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSS-GASNVLRQWITCGPGAVDTNDAVLVKNRSPKHPPNQPPSEKHRNDAVF
        Q     +     + E     ++  + G P+ +ST +SSS   + K+KSYSS  AS+VLR  + C  G +DTNDAVLV            P  K R+ A  
Subjt:  QFSSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSS-GASNVLRQWITCGPGAVDTNDAVLVKNRSPKHPPNQPPSEKHRNDAVF

Query:  CRDDVLGGSARVLGNSWDANLEIRRQYSPQ-TSRKKFDDS----RKKRSKESGGKKVAATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTKTA
                     G +W+   E R QY  Q  +RK F+ +    + K + E    KVA +   KP   P CS CGK F+PEKMHSHMK CRG+K+   ++
Subjt:  CRDDVLGGSARVLGNSWDANLEIRRQYSPQ-TSRKKFDDS----RKKRSKESGGKKVAATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTKTA

Query:  STSDKTTSSKSTTTR
        + +D  TS+ +   R
Subjt:  STSDKTTSSKSTTTR

AT2G28150.1 Domain of unknown function (DUF966)3.8e-1736.26Show/hide
Query:  EVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF
        +++KV I+Y+LS+   +E PH + V            G+YLRDV   L  LRG+ M   +SWS KR Y+ G+VW DL +DDLI P + NEYVL+GS+   
Subjt:  EVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQIIF

Query:  PTTLFDTKKSLFREELELNGDFSSKLL------QKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETH
           LFD          E N D  S ++       K +   PP  S RS   DD  S          N  +H
Subjt:  PTTLFDTKKSLFREELELNGDFSSKLL------QKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETH

AT3G46110.1 Domain of unknown function (DUF966)3.8e-1735.5Show/hide
Query:  MKLRKRALMEANKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITP
        M L      + +K    R V ++Y+LSR G ++ PH I V            G+YL+DV   L +LRG  M   +SWS KR YK G+VW DL D+D I P
Subjt:  MKLRKRALMEANKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITP

Query:  ISDNEYVLQGSQIIFPTTLFDTKKSLFREELELNGDFSS-------KLLQKNVEESPPYDSERSTVTDD
        +   EYVL+GSQI+    L +   +        N  +SS       K  + N E +     + ST TDD
Subjt:  ISDNEYVLQGSQIIFPTTLFDTKKSLFREELELNGDFSS-------KLLQKNVEESPPYDSERSTVTDD

AT3G46110.2 Domain of unknown function (DUF966)3.8e-1735.5Show/hide
Query:  MKLRKRALMEANKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITP
        M L      + +K    R V ++Y+LSR G ++ PH I V            G+YL+DV   L +LRG  M   +SWS KR YK G+VW DL D+D I P
Subjt:  MKLRKRALMEANKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITP

Query:  ISDNEYVLQGSQIIFPTTLFDTKKSLFREELELNGDFSS-------KLLQKNVEESPPYDSERSTVTDD
        +   EYVL+GSQI+    L +   +        N  +SS       K  + N E +     + ST TDD
Subjt:  ISDNEYVLQGSQIIFPTTLFDTKKSLFREELELNGDFSS-------KLLQKNVEESPPYDSERSTVTDD

AT5G59790.1 Domain of unknown function (DUF966)6.3e-2028.98Show/hide
Query:  RKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQII---
        RKV ++Y+L R G ++ PH I V            G+YL+DV   L +LRGK M   +SWS KR YK G+VW DL +DD I P+   EYVL+GS+++   
Subjt:  RKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQGSQII---

Query:  ---FPTTLFDTKKSLFREELELN-----GDFSSKLLQKNVEES------PPYDSERSTVTDDGDSMKVEAETSSNNLE-THPKQGIEEEIHEI------E
            P +L +T  S FR+   LN     GD    ++ +   +S        Y   ++T +    + ++ A+ S+   +    ++  +EEI E+      E
Subjt:  ---FPTTLFDTKKSLFREELELN-----GDFSSKLLQKNVEES------PPYDSERSTVTDDGDSMKVEAETSSNNLE-THPKQGIEEEIHEI------E

Query:  GFKTQFSSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNR
           T+ S        ++   +  +NL++ +G      + +S+         S    AS VL Q I+CG  +      VL+K++
Subjt:  GFKTQFSSLYAKLVINNKEKDKDKNLMEKEGGPTATSTVTSSSQPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAAGCTTAGAAAGAGAGCGTTAATGGAGGCTAATAAAGGTGGAGAAGTTAGAAAAGTTCATATTATTTACTTTCTTAGCCGGATGGGACACGTGGAGCAACCCCA
TCTCATCCGTGTTCATCATCTCGCCGGCGCCGGCGGAGGCGGAGGTGGCGGCGTTTATCTCCGAGATGTGAAGAGATGGCTAGCGGAATTGAGGGGAAAGGAGATGCCAG
AAGCCTTCTCATGGTCATACAAAAGAAAGTATAAAACAGGGTACGTTTGGCAGGACCTGGTGGATGACGATCTCATAACTCCAATTTCTGACAACGAATATGTCCTTCAA
GGATCCCAAATCATATTTCCCACTACTCTCTTTGATACTAAAAAATCATTATTCAGAGAGGAATTGGAGTTAAATGGAGATTTTTCATCGAAACTCCTGCAAAAGAATGT
AGAAGAATCTCCACCGTACGATTCAGAGAGGTCGACGGTGACGGACGATGGGGATTCGATGAAGGTTGAAGCAGAGACAAGTAGTAATAATTTGGAGACACACCCCAAAC
AGGGGATAGAAGAAGAAATACACGAAATTGAAGGGTTTAAGACACAATTTTCTTCTTTGTATGCAAAATTGGTGATCAACAACAAGGAGAAAGATAAAGATAAGAATCTC
ATGGAGAAGGAAGGTGGACCCACAGCCACGTCAACAGTTACGTCATCATCCCAACCTGCATTTACAAAGAGCAAGAGCTACTCAAGCGGAGCTTCCAACGTGCTTCGGCA
GTGGATCACGTGCGGGCCCGGGGCAGTAGACACAAACGACGCCGTTTTGGTCAAGAACCGATCTCCCAAACATCCACCGAATCAACCGCCATCGGAAAAACACAGAAACG
ACGCCGTGTTTTGCAGAGACGACGTGTTGGGCGGCTCCGCCCGAGTTCTCGGAAATTCTTGGGATGCGAACCTCGAAATCCGCCGCCAATACAGCCCGCAAACTTCTCGA
AAAAAATTCGATGATTCAAGGAAGAAAAGATCGAAGGAGAGCGGTGGCAAAAAGGTGGCGGCCACCGCAACTTACAAGCCGATGGCAGGACCAAATTGCTCGTTATGTGG
GAAGACTTTCAGGCCGGAGAAAATGCATTCACATATGAAATCATGTAGAGGAATCAAGTCTCTGACCAAAACTGCTTCAACATCAGACAAAACGACATCGTCTAAGTCAA
CTACGACAAGAACTTCCGACAAGGATTTGGCTTCCACCTACTTTTTGACCAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGAAGCTTAGAAAGAGAGCGTTAATGGAGGCTAATAAAGGTGGAGAAGTTAGAAAAGTTCATATTATTTACTTTCTTAGCCGGATGGGACACGTGGAGCAACCCCA
TCTCATCCGTGTTCATCATCTCGCCGGCGCCGGCGGAGGCGGAGGTGGCGGCGTTTATCTCCGAGATGTGAAGAGATGGCTAGCGGAATTGAGGGGAAAGGAGATGCCAG
AAGCCTTCTCATGGTCATACAAAAGAAAGTATAAAACAGGGTACGTTTGGCAGGACCTGGTGGATGACGATCTCATAACTCCAATTTCTGACAACGAATATGTCCTTCAA
GGATCCCAAATCATATTTCCCACTACTCTCTTTGATACTAAAAAATCATTATTCAGAGAGGAATTGGAGTTAAATGGAGATTTTTCATCGAAACTCCTGCAAAAGAATGT
AGAAGAATCTCCACCGTACGATTCAGAGAGGTCGACGGTGACGGACGATGGGGATTCGATGAAGGTTGAAGCAGAGACAAGTAGTAATAATTTGGAGACACACCCCAAAC
AGGGGATAGAAGAAGAAATACACGAAATTGAAGGGTTTAAGACACAATTTTCTTCTTTGTATGCAAAATTGGTGATCAACAACAAGGAGAAAGATAAAGATAAGAATCTC
ATGGAGAAGGAAGGTGGACCCACAGCCACGTCAACAGTTACGTCATCATCCCAACCTGCATTTACAAAGAGCAAGAGCTACTCAAGCGGAGCTTCCAACGTGCTTCGGCA
GTGGATCACGTGCGGGCCCGGGGCAGTAGACACAAACGACGCCGTTTTGGTCAAGAACCGATCTCCCAAACATCCACCGAATCAACCGCCATCGGAAAAACACAGAAACG
ACGCCGTGTTTTGCAGAGACGACGTGTTGGGCGGCTCCGCCCGAGTTCTCGGAAATTCTTGGGATGCGAACCTCGAAATCCGCCGCCAATACAGCCCGCAAACTTCTCGA
AAAAAATTCGATGATTCAAGGAAGAAAAGATCGAAGGAGAGCGGTGGCAAAAAGGTGGCGGCCACCGCAACTTACAAGCCGATGGCAGGACCAAATTGCTCGTTATGTGG
GAAGACTTTCAGGCCGGAGAAAATGCATTCACATATGAAATCATGTAGAGGAATCAAGTCTCTGACCAAAACTGCTTCAACATCAGACAAAACGACATCGTCTAAGTCAA
CTACGACAAGAACTTCCGACAAGGATTTGGCTTCCACCTACTTTTTGACCAACTGA
Protein sequenceShow/hide protein sequence
MMKLRKRALMEANKGGEVRKVHIIYFLSRMGHVEQPHLIRVHHLAGAGGGGGGGVYLRDVKRWLAELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPISDNEYVLQ
GSQIIFPTTLFDTKKSLFREELELNGDFSSKLLQKNVEESPPYDSERSTVTDDGDSMKVEAETSSNNLETHPKQGIEEEIHEIEGFKTQFSSLYAKLVINNKEKDKDKNL
MEKEGGPTATSTVTSSSQPAFTKSKSYSSGASNVLRQWITCGPGAVDTNDAVLVKNRSPKHPPNQPPSEKHRNDAVFCRDDVLGGSARVLGNSWDANLEIRRQYSPQTSR
KKFDDSRKKRSKESGGKKVAATATYKPMAGPNCSLCGKTFRPEKMHSHMKSCRGIKSLTKTASTSDKTTSSKSTTTRTSDKDLASTYFLTN