; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G008670 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G008670
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionprotein UPSTREAM OF FLC isoform X1
Genome locationCmo_Chr04:4339099..4341040
RNA-Seq ExpressionCmoCh04G008670
SyntenyCmoCh04G008670
Gene Ontology termsGO:0051258 - protein polymerization (biological process)
GO:0051302 - regulation of cell division (biological process)
GO:0090708 - specification of plant organ axis polarity (biological process)
GO:2000067 - regulation of root morphogenesis (biological process)
GO:0031234 - extrinsic component of cytoplasmic side of plasma membrane (cellular component)
InterPro domainsIPR010369 - Protein SOSEKI


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600637.1 Protein UPSTREAM OF FLC, partial [Cucurbita argyrosperma subsp. sororia]5.9e-18583.02Show/hide
Query:  MQSNRRRRERALPAPSPQHQSRQSQGDD--DDGCATSHEKSPKMGVWNSPISSKSASSPTSPFDSKMLWIGSPFSEARPCALSIPGPVRGKQKTKSSSVL
        MQSNRRRRERALPAPSPQHQSRQSQGDD  DDGCATSHEKSP +      +S +  +S T+PF                    +P P    Q+   S+  
Subjt:  MQSNRRRRERALPAPSPQHQSRQSQGDD--DDGCATSHEKSPKMGVWNSPISSKSASSPTSPFDSKMLWIGSPFSEARPCALSIPGPVRGKQKTKSSSVL

Query:  QDDMSFTMPHVLLLRNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDY
               +   L+L NYKTGYVWNDLSENDIVYPA GAEYVLKASELVDFCSEKLQEIHTG+NDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDY
Subjt:  QDDMSFTMPHVLLLRNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDY

Query:  DEVEDYENEDVDKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVSSKEKPGEGCREAG
        DEVEDYENEDVDKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPT+STPFDSSRLSTSKRFTYED+LGTAPSRNSVLMQFISCGGS  SKEKPGEG  EAG
Subjt:  DEVEDYENEDVDKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVSSKEKPGEGCREAG

Query:  KEMGRRTESLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENEETRREEDD
        KEMGRRTESLGRGVVCKMAGKR+GEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENEETRREEDD
Subjt:  KEMGRRTESLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENEETRREEDD

Query:  GGMKGRCIPRMISAASALALASSKQPPKKP
        GGMKGRCIPRMISAASALALASSKQPPKKP
Subjt:  GGMKGRCIPRMISAASALALASSKQPPKKP

KAG7031273.1 Protein UPSTREAM OF FLC, partial [Cucurbita argyrosperma subsp. argyrosperma]1.0e-18482.64Show/hide
Query:  MQSNRRRRERALPAPSPQHQSRQSQG----DDDDGCATSHEKSPKMGVWNSPISSKSASSPTSPFDSKMLWIGSPFSEARPCALSIPGPVRGKQKTKSSS
        MQSNRRRRERALPAPSPQHQSRQSQG    DDDDGCATSHEKSP +      +S +  +S T+PF                    +P P    Q+   S+
Subjt:  MQSNRRRRERALPAPSPQHQSRQSQG----DDDDGCATSHEKSPKMGVWNSPISSKSASSPTSPFDSKMLWIGSPFSEARPCALSIPGPVRGKQKTKSSS

Query:  VLQDDMSFTMPHVLLLRNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDL
                 +   L+L NYKTGYVWNDLSENDIVYPA GAEYVLKASELVDFCSEKLQEIHTG+NDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDL
Subjt:  VLQDDMSFTMPHVLLLRNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDL

Query:  DYDEVEDYENEDVDKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVSSKEKPGEGCRE
        DYDEVEDYENEDVDKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPT+STPFDSSRLSTSKRFTYED+LGTAPSRNSVLMQFISCGGS  SKEKPGEG  E
Subjt:  DYDEVEDYENEDVDKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVSSKEKPGEGCRE

Query:  AGKEMGRRTESLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENEETRREE
        AGKEMGRRTESLGRGVVCKMAGKR+GEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENEETRREE
Subjt:  AGKEMGRRTESLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENEETRREE

Query:  DDGGMKGRCIPRMISAASALALASSKQPPKKP
        DDGGMKGRCIPRMISAASALALASSKQPPKKP
Subjt:  DDGGMKGRCIPRMISAASALALASSKQPPKKP

XP_022941694.1 protein UPSTREAM OF FLC isoform X1 [Cucurbita moschata]2.1e-19083.07Show/hide
Query:  MQSNRRRRERALPAPSPQHQSRQSQGDDDDGCATSHEKSPKMGVWNSPISSKSASSPTSPFDSKMLWIGSPFSEARPCALSIPGPVRGKQKTKSSSVLQD
        MQSNRRRRERALPAPSPQHQSRQSQGDDDDGCATSHEKSP +      +S +  +S T+PF         P  +  P     P   +   + +S  +   
Subjt:  MQSNRRRRERALPAPSPQHQSRQSQGDDDDGCATSHEKSPKMGVWNSPISSKSASSPTSPFDSKMLWIGSPFSEARPCALSIPGPVRGKQKTKSSSVLQD

Query:  DMSF-TMPHVLLL--------------RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPL
         ++   M  + LL              RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPL
Subjt:  DMSF-TMPHVLLL--------------RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPL

Query:  KELDDQPYSDLDYDEVEDYENEDVDKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVS
        KELDDQPYSDLDYDEVEDYENEDVDKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVS
Subjt:  KELDDQPYSDLDYDEVEDYENEDVDKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVS

Query:  SKEKPGEGCREAGKEMGRRTESLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEK
        SKEKPGEGCREAGKEMGRRTESLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEK
Subjt:  SKEKPGEGCREAGKEMGRRTESLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEK

Query:  RDENEETRREEDDGGMKGRCIPRMISAASALALASSKQPPKKP
        RDENEETRREEDDGGMKGRCIPRMISAASALALASSKQPPKKP
Subjt:  RDENEETRREEDDGGMKGRCIPRMISAASALALASSKQPPKKP

XP_022941695.1 protein UPSTREAM OF FLC isoform X2 [Cucurbita moschata]2.0e-172100Show/hide
Query:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYENEDVDKP
        RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYENEDVDKP
Subjt:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYENEDVDKP

Query:  IYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVSSKEKPGEGCREAGKEMGRRTESLGRGV
        IYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVSSKEKPGEGCREAGKEMGRRTESLGRGV
Subjt:  IYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVSSKEKPGEGCREAGKEMGRRTESLGRGV

Query:  VCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENEETRREEDDGGMKGRCIPRMISA
        VCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENEETRREEDDGGMKGRCIPRMISA
Subjt:  VCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENEETRREEDDGGMKGRCIPRMISA

Query:  ASALALASSKQPPKKP
        ASALALASSKQPPKKP
Subjt:  ASALALASSKQPPKKP

XP_022980967.1 protein UPSTREAM OF FLC isoform X1 [Cucurbita maxima]2.4e-17879.41Show/hide
Query:  MQSNRRRRERALPAPSPQHQSRQSQGDDDDGCATSHEKSPKMGVWNSPISSKSASSPTSPFDSKMLWIGSPFSEARPCALSIPGPVRGKQKTKSSSVLQD
        M  NRRR ERALPAPSPQHQSRQSQ DDDDGCATSHE+SP +      +S +  +S T+PF      +  P    +  A S     R +      S +  
Subjt:  MQSNRRRRERALPAPSPQHQSRQSQGDDDDGCATSHEKSPKMGVWNSPISSKSASSPTSPFDSKMLWIGSPFSEARPCALSIPGPVRGKQKTKSSSVLQD

Query:  DMSFTMPHVLLL--------------RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLK
        D+   M  + +L              RNYKTGYVWNDLSENDIVYPA GAEYVLKASELVD CSEK+QEIHTGTNDRRPVQEPNLR+KTRK QLAPSPLK
Subjt:  DMSFTMPHVLLL--------------RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLK

Query:  ELDDQPYSDLDYDEVEDYENEDVDKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVSS
        ELDDQ YSDLDYDEVEDYENEDVDKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPT+STPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGS  S
Subjt:  ELDDQPYSDLDYDEVEDYENEDVDKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVSS

Query:  KEKPGEGCREAGKEMGRRTESLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKR
        KEKPGEG REAGKEMGRRTE LGRGVVCKM G+RIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKR
Subjt:  KEKPGEGCREAGKEMGRRTESLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKR

Query:  DENEETRREEDDGGMKGRCIPRMISAASALALASSKQPPKKP
        DENEETRREEDDGGMKGRCIPRMISAASALALASSKQPPKKP
Subjt:  DENEETRREEDDGGMKGRCIPRMISAASALALASSKQPPKKP

TrEMBL top hitse value%identityAlignment
A0A0A0KXS9 Uncharacterized protein7.8e-8259.44Show/hide
Query:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDF--CSEKLQEIH-TGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYENEDV
        RNYK+GYVWNDLSEND+VYPA G EYVLKAS+LVD     EKLQ++H    N R+PVQEPNL TKTRK QLAP+PL      P+SDL+YDE EDYE +D 
Subjt:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDF--CSEKLQEIH-TGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYENEDV

Query:  DKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTY--EDELG--TAPSRNSVLMQFISCGGSVSSKEKPGEGCREAGKEMGRRT
        DK                  +  E PG TQ+PT     +S+R STSKRF    +DELG  + PSRNSVLMQFI CGGSV SK          GK + R  
Subjt:  DKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTY--EDELG--TAPSRNSVLMQFISCGGSVSSKEKPGEGCREAGKEMGRRT

Query:  ESLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENEETRREEDDGGMKGRC
        + +G+GVVCKM G  + EEEMIKYMSENPR GKLQ EEKEYFSGSIVESIREDRHV+ P+L KS+SY EEKSKR EL EK+DE+E+   E ++GG+KGRC
Subjt:  ESLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENEETRREEDDGGMKGRC

Query:  IPRMISAASALALASSKQPPKKP
        +P MI     L   SSKQ PKKP
Subjt:  IPRMISAASALALASSKQPPKKP

A0A6J1FP61 protein UPSTREAM OF FLC isoform X11.0e-19083.07Show/hide
Query:  MQSNRRRRERALPAPSPQHQSRQSQGDDDDGCATSHEKSPKMGVWNSPISSKSASSPTSPFDSKMLWIGSPFSEARPCALSIPGPVRGKQKTKSSSVLQD
        MQSNRRRRERALPAPSPQHQSRQSQGDDDDGCATSHEKSP +      +S +  +S T+PF         P  +  P     P   +   + +S  +   
Subjt:  MQSNRRRRERALPAPSPQHQSRQSQGDDDDGCATSHEKSPKMGVWNSPISSKSASSPTSPFDSKMLWIGSPFSEARPCALSIPGPVRGKQKTKSSSVLQD

Query:  DMSF-TMPHVLLL--------------RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPL
         ++   M  + LL              RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPL
Subjt:  DMSF-TMPHVLLL--------------RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPL

Query:  KELDDQPYSDLDYDEVEDYENEDVDKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVS
        KELDDQPYSDLDYDEVEDYENEDVDKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVS
Subjt:  KELDDQPYSDLDYDEVEDYENEDVDKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVS

Query:  SKEKPGEGCREAGKEMGRRTESLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEK
        SKEKPGEGCREAGKEMGRRTESLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEK
Subjt:  SKEKPGEGCREAGKEMGRRTESLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEK

Query:  RDENEETRREEDDGGMKGRCIPRMISAASALALASSKQPPKKP
        RDENEETRREEDDGGMKGRCIPRMISAASALALASSKQPPKKP
Subjt:  RDENEETRREEDDGGMKGRCIPRMISAASALALASSKQPPKKP

A0A6J1FUF7 protein UPSTREAM OF FLC isoform X29.6e-173100Show/hide
Query:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYENEDVDKP
        RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYENEDVDKP
Subjt:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYENEDVDKP

Query:  IYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVSSKEKPGEGCREAGKEMGRRTESLGRGV
        IYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVSSKEKPGEGCREAGKEMGRRTESLGRGV
Subjt:  IYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVSSKEKPGEGCREAGKEMGRRTESLGRGV

Query:  VCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENEETRREEDDGGMKGRCIPRMISA
        VCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENEETRREEDDGGMKGRCIPRMISA
Subjt:  VCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENEETRREEDDGGMKGRCIPRMISA

Query:  ASALALASSKQPPKKP
        ASALALASSKQPPKKP
Subjt:  ASALALASSKQPPKKP

A0A6J1ISQ2 protein UPSTREAM OF FLC isoform X11.2e-17879.41Show/hide
Query:  MQSNRRRRERALPAPSPQHQSRQSQGDDDDGCATSHEKSPKMGVWNSPISSKSASSPTSPFDSKMLWIGSPFSEARPCALSIPGPVRGKQKTKSSSVLQD
        M  NRRR ERALPAPSPQHQSRQSQ DDDDGCATSHE+SP +      +S +  +S T+PF      +  P    +  A S     R +      S +  
Subjt:  MQSNRRRRERALPAPSPQHQSRQSQGDDDDGCATSHEKSPKMGVWNSPISSKSASSPTSPFDSKMLWIGSPFSEARPCALSIPGPVRGKQKTKSSSVLQD

Query:  DMSFTMPHVLLL--------------RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLK
        D+   M  + +L              RNYKTGYVWNDLSENDIVYPA GAEYVLKASELVD CSEK+QEIHTGTNDRRPVQEPNLR+KTRK QLAPSPLK
Subjt:  DMSFTMPHVLLL--------------RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLK

Query:  ELDDQPYSDLDYDEVEDYENEDVDKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVSS
        ELDDQ YSDLDYDEVEDYENEDVDKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPT+STPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGS  S
Subjt:  ELDDQPYSDLDYDEVEDYENEDVDKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVSS

Query:  KEKPGEGCREAGKEMGRRTESLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKR
        KEKPGEG REAGKEMGRRTE LGRGVVCKM G+RIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKR
Subjt:  KEKPGEGCREAGKEMGRRTESLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKR

Query:  DENEETRREEDDGGMKGRCIPRMISAASALALASSKQPPKKP
        DENEETRREEDDGGMKGRCIPRMISAASALALASSKQPPKKP
Subjt:  DENEETRREEDDGGMKGRCIPRMISAASALALASSKQPPKKP

A0A6J1IY41 protein UPSTREAM OF FLC isoform X23.7e-16495.89Show/hide
Query:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYENEDVDKP
        RNYKTGYVWNDLSENDIVYPA GAEYVLKASELVD CSEK+QEIHTGTNDRRPVQEPNLR+KTRK QLAPSPLKELDDQ YSDLDYDEVEDYENEDVDKP
Subjt:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYENEDVDKP

Query:  IYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVSSKEKPGEGCREAGKEMGRRTESLGRGV
        IYTTTSTTPHSRCSRGVSTEELPGPTQSPT+STPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGS  SKEKPGEG REAGKEMGRRTE LGRGV
Subjt:  IYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVSSKEKPGEGCREAGKEMGRRTESLGRGV

Query:  VCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENEETRREEDDGGMKGRCIPRMISA
        VCKM G+RIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENEETRREEDDGGMKGRCIPRMISA
Subjt:  VCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENEETRREEDDGGMKGRCIPRMISA

Query:  ASALALASSKQPPKKP
        ASALALASSKQPPKKP
Subjt:  ASALALASSKQPPKKP

SwissProt top hitse value%identityAlignment
A0A2K1JJ00 Protein SOSEKI 16.7e-0636.67Show/hide
Query:  RNYKTGYVWNDLSENDIVYP-AGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNL
        R Y   ++WNDLS++D+++P  G  EYVL+ASEL+D    K  ++H+  +++     P +
Subjt:  RNYKTGYVWNDLSENDIVYP-AGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNL

Q8GY65 Protein SOSEKI 49.1e-1126.48Show/hide
Query:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYENEDVDK-
        R YK G+VW DLS+ D ++P  G EYVLK S+++D        +   + +   V     ++ +           EL+ +    L  D     ++    K 
Subjt:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYENEDVDK-

Query:  PIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYED--ELGTAPSR---NSVLMQFISCGGSVSSKEKPGEGCREAGKEMGRRTE
        P+      T  SR       EE+  P QS +     +S   +  +    ++  EL     +   ++VLMQ ISCG     K  P                
Subjt:  PIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYED--ELGTAPSR---NSVLMQFISCGGSVSSKEKPGEGCREAGKEMGRRTE

Query:  SLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENEETR
        +L  G     A +  G   + +   E   FG+++ EEKEYFSGS+++     +  + P LK+SSSYN ++S R  L ++++  E  R
Subjt:  SLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENEETR

Q8GYT8 Protein SOSEKI 31.0e-0624.86Show/hide
Query:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASEL------------VDFCSEKLQEIHTGTNDRRPVQEPNLRTK------TRK-----PQLAPSPLKELD
        R+Y+ G+VW+DLSE+D++ PA G EYVLK SEL            V+  ++ +++I       R + + +  +       T K      +L+P  L+ + 
Subjt:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASEL------------VDFCSEKLQEIHTGTNDRRPVQEPNLRTK------TRK-----PQLAPSPLKELD

Query:  DQPYSDLDYD----------EVEDYENEDVDKPIYTTTST------TPHSRCSRGVSTEE----------------------------------------
            S    D          E + Y++E +      T  T      TP    SRGVST+E                                        
Subjt:  DQPYSDLDYD----------EVEDYENEDVDKPIYTTTST------TPHSRCSRGVSTEE----------------------------------------

Query:  ----LPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSR---NSVLMQFISCGGSVSSKE---------KPGEGCREAGKEMGRRTESLGRGVVCKMA
            L G T +       D S++++ +    ED    A  R   +++LMQ ISC GS+S K+         KP  G  +        +  +G        
Subjt:  ----LPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSR---NSVLMQFISCGGSVSSKE---------KPGEGCREAGKEMGRRTESLGRGVVCKMA

Query:  GKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVE--PVLKKSSSYNEEKS
                 +  +SE P    L+ EEKEYFSGS+VE+  + +   +    LK+SSSYN +++
Subjt:  GKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVE--PVLKKSSSYNEEKS

Q9FJF5 Protein SOSEKI 55.5e-1628.92Show/hide
Query:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYE-------
        R+YK G+VW+DLSE+D ++P  G EYVLK SE++D C          T+  R  +  +L          P+ +    +Q +S +D  E + Y+       
Subjt:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYE-------

Query:  -NEDVDKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDST----------PFDSSRLSTSKRFTYEDELGTAPSRNS------------------VLMQ
          + +     T T      R       EE+  P      ST          P DSS  +       +  L   PS +S                  VLMQ
Subjt:  -NEDVDKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDST----------PFDSSRLSTSKRFTYEDELGTAPSRNS------------------VLMQ

Query:  FISCGGSVSSKEKPGEGCREAGKEMGRRTESLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEK
         ISC G++S KE      ++ G  +  R+     G       +  GEE + K   E   FG++Q E+KEYFSGS++E+ +E    + P LK+SSSYN ++
Subjt:  FISCGGSVSSKEKPGEGCREAGKEMGRRTESLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEK

Query:  SKRGELGEKRDENEETRREEDDGGMKGRCIPR
          R     ++DE E  R          +CIPR
Subjt:  SKRGELGEKRDENEETRREEDDGGMKGRCIPR

Q9LX14 Protein SOSEKI 22.4e-4037.5Show/hide
Query:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYENEDVDKP
        R+Y+ G+VWNDL+END++YP+  AEYVLK SE+ D    K QE+H        +QE       R      +     DD      + +E ED E E  ++ 
Subjt:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYENEDVDKP

Query:  IYTTTSTTPHSRCSRGVSTEELPGPTQSP------------TDSTPFDSS-------RLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVSSKEKPGEG
           T+STTP SRCSRGVSTE +    Q P            +DS+    S       R   S R    D +     R S+ +Q ISCG   +    P   
Subjt:  IYTTTSTTPHSRCSRGVSTEELPGPTQSP------------TDSTPFDSS-------RLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVSSKEKPGEG

Query:  CREAGKEMGRRTESLGRGVVCKMAGKRI---GEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENE
           +     ++ E+L +GV+CK   K+     E EMI++MSENPRFG  Q EEKEYFSGSIVES+ ++R   EP L++S+S+NEE+SK  E+       +
Subjt:  CREAGKEMGRRTESLGRGVVCKMAGKRI---GEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENE

Query:  ETRREEDDGGMKGRCIPRMISAASALALASSKQPPK
        ET+++E+    K +CIPR         ++SSKQ  K
Subjt:  ETRREEDDGGMKGRCIPRMISAASALALASSKQPPK

Arabidopsis top hitse value%identityAlignment
AT2G28150.1 Domain of unknown function (DUF966)7.4e-0824.86Show/hide
Query:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASEL------------VDFCSEKLQEIHTGTNDRRPVQEPNLRTK------TRK-----PQLAPSPLKELD
        R+Y+ G+VW+DLSE+D++ PA G EYVLK SEL            V+  ++ +++I       R + + +  +       T K      +L+P  L+ + 
Subjt:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASEL------------VDFCSEKLQEIHTGTNDRRPVQEPNLRTK------TRK-----PQLAPSPLKELD

Query:  DQPYSDLDYD----------EVEDYENEDVDKPIYTTTST------TPHSRCSRGVSTEE----------------------------------------
            S    D          E + Y++E +      T  T      TP    SRGVST+E                                        
Subjt:  DQPYSDLDYD----------EVEDYENEDVDKPIYTTTST------TPHSRCSRGVSTEE----------------------------------------

Query:  ----LPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSR---NSVLMQFISCGGSVSSKE---------KPGEGCREAGKEMGRRTESLGRGVVCKMA
            L G T +       D S++++ +    ED    A  R   +++LMQ ISC GS+S K+         KP  G  +        +  +G        
Subjt:  ----LPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSR---NSVLMQFISCGGSVSSKE---------KPGEGCREAGKEMGRRTESLGRGVVCKMA

Query:  GKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVE--PVLKKSSSYNEEKS
                 +  +SE P    L+ EEKEYFSGS+VE+  + +   +    LK+SSSYN +++
Subjt:  GKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVE--PVLKKSSSYNEEKS

AT3G46110.1 Domain of unknown function (DUF966)6.4e-1226.48Show/hide
Query:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYENEDVDK-
        R YK G+VW DLS+ D ++P  G EYVLK S+++D        +   + +   V     ++ +           EL+ +    L  D     ++    K 
Subjt:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYENEDVDK-

Query:  PIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYED--ELGTAPSR---NSVLMQFISCGGSVSSKEKPGEGCREAGKEMGRRTE
        P+      T  SR       EE+  P QS +     +S   +  +    ++  EL     +   ++VLMQ ISCG     K  P                
Subjt:  PIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYED--ELGTAPSR---NSVLMQFISCGGSVSSKEKPGEGCREAGKEMGRRTE

Query:  SLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENEETR
        +L  G     A +  G   + +   E   FG+++ EEKEYFSGS+++     +  + P LK+SSSYN ++S R  L ++++  E  R
Subjt:  SLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENEETR

AT3G46110.2 Domain of unknown function (DUF966)1.9e-1126.3Show/hide
Query:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYENEDVDK-
        R YK G+VW DLS+ D ++P  G EYVLK S+++D        +   + +   V     ++ +           EL+ +    L  D     ++    K 
Subjt:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYENEDVDK-

Query:  PIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYED--ELGTAPSR---NSVLMQFISCGGSVSSKEKPGEGCREAGKEMGRRTE
        P+      T  SR       EE+  P QS +     +S   +  +    ++  EL     +   ++VLMQ ISCG     K  P                
Subjt:  PIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYED--ELGTAPSR---NSVLMQFISCGGSVSSKEKPGEGCREAGKEMGRRTE

Query:  SLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEK
        +L  G     A +  G   + +   E   FG+++ EEKEYFSGS+++     +  + P LK+SSSYN ++
Subjt:  SLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEK

AT5G10150.1 Domain of unknown function (DUF966)1.7e-4137.5Show/hide
Query:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYENEDVDKP
        R+Y+ G+VWNDL+END++YP+  AEYVLK SE+ D    K QE+H        +QE       R      +     DD      + +E ED E E  ++ 
Subjt:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYENEDVDKP

Query:  IYTTTSTTPHSRCSRGVSTEELPGPTQSP------------TDSTPFDSS-------RLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVSSKEKPGEG
           T+STTP SRCSRGVSTE +    Q P            +DS+    S       R   S R    D +     R S+ +Q ISCG   +    P   
Subjt:  IYTTTSTTPHSRCSRGVSTEELPGPTQSP------------TDSTPFDSS-------RLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVSSKEKPGEG

Query:  CREAGKEMGRRTESLGRGVVCKMAGKRI---GEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENE
           +     ++ E+L +GV+CK   K+     E EMI++MSENPRFG  Q EEKEYFSGSIVES+ ++R   EP L++S+S+NEE+SK  E+       +
Subjt:  CREAGKEMGRRTESLGRGVVCKMAGKRI---GEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENE

Query:  ETRREEDDGGMKGRCIPRMISAASALALASSKQPPK
        ET+++E+    K +CIPR         ++SSKQ  K
Subjt:  ETRREEDDGGMKGRCIPRMISAASALALASSKQPPK

AT5G59790.1 Domain of unknown function (DUF966)3.9e-1728.92Show/hide
Query:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYE-------
        R+YK G+VW+DLSE+D ++P  G EYVLK SE++D C          T+  R  +  +L          P+ +    +Q +S +D  E + Y+       
Subjt:  RNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYE-------

Query:  -NEDVDKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDST----------PFDSSRLSTSKRFTYEDELGTAPSRNS------------------VLMQ
          + +     T T      R       EE+  P      ST          P DSS  +       +  L   PS +S                  VLMQ
Subjt:  -NEDVDKPIYTTTSTTPHSRCSRGVSTEELPGPTQSPTDST----------PFDSSRLSTSKRFTYEDELGTAPSRNS------------------VLMQ

Query:  FISCGGSVSSKEKPGEGCREAGKEMGRRTESLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEK
         ISC G++S KE      ++ G  +  R+     G       +  GEE + K   E   FG++Q E+KEYFSGS++E+ +E    + P LK+SSSYN ++
Subjt:  FISCGGSVSSKEKPGEGCREAGKEMGRRTESLGRGVVCKMAGKRIGEEEMIKYMSENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEK

Query:  SKRGELGEKRDENEETRREEDDGGMKGRCIPR
          R     ++DE E  R          +CIPR
Subjt:  SKRGELGEKRDENEETRREEDDGGMKGRCIPR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAGCAACAGGCGGAGGAGAGAGAGAGCACTGCCGGCACCGTCGCCGCAGCACCAGTCCAGACAGAGCCAAGGCGACGACGACGACGGCTGCGCCACCTCCCATGA
AAAAAGTCCAAAAATGGGCGTCTGGAACAGCCCCATTTCCTCGAAATCAGCCTCTTCCCCAACCAGCCCCTTCGACTCAAAGATGTTATGGATCGGCTCACCCTTCTCAG
AGGCAAGGCCATGCGCTCTCTCTATTCCTGGTCCTGTAAGAGGTAAACAAAAAACAAAAAGTTCCTCTGTTTTGCAAGACGACATGTCGTTCACTATGCCTCATGTTTTG
CTCCTTAGGAACTATAAAACTGGATATGTGTGGAACGACTTGTCGGAAAATGACATTGTATACCCCGCCGGAGGAGCTGAGTACGTGCTCAAGGCCTCCGAACTTGTTGA
CTTCTGTTCTGAAAAATTGCAGGAAATTCACACGGGCACCAACGACAGGCGACCGGTTCAGGAACCGAACCTCCGGACTAAAACTCGGAAACCGCAACTCGCTCCGAGTC
CACTCAAAGAACTCGACGACCAACCGTATTCCGATTTAGACTACGACGAAGTGGAGGACTACGAAAACGAAGACGTGGACAAACCCATTTACACCACCACTTCCACTACC
CCTCACTCCCGGTGCTCTCGCGGCGTCTCCACCGAAGAACTCCCCGGTCCAACTCAGTCCCCCACCGACTCAACTCCCTTTGACTCGTCCCGCCTCTCCACTTCGAAACG
GTTCACTTACGAGGACGAACTCGGGACGGCGCCGAGTAGGAACTCGGTCCTGATGCAGTTCATTTCTTGCGGTGGGTCGGTAAGTTCGAAGGAGAAACCCGGGGAGGGTT
GTAGAGAAGCGGGGAAGGAGATGGGGAGAAGAACCGAGAGCCTTGGGAGAGGGGTTGTGTGTAAAATGGCTGGAAAGCGGATCGGAGAAGAGGAGATGATAAAGTATATG
TCGGAGAATCCGAGGTTTGGGAAGTTGCAGACAGAGGAAAAGGAGTATTTCAGTGGGTCGATTGTGGAGTCCATTAGAGAAGATCGACATGTGGTTGAACCTGTGCTCAA
GAAATCCAGCTCCTACAACGAAGAAAAGAGCAAAAGAGGAGAGTTGGGAGAAAAAAGAGATGAAAATGAAGAGACAAGGAGGGAGGAAGATGACGGTGGGATGAAAGGAA
GGTGCATTCCTCGCATGATATCAGCTGCCTCGGCCTTAGCCTTAGCCTCATCAAAGCAACCCCCCAAGAAGCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGAGCAACAGGCGGAGGAGAGAGAGAGCACTGCCGGCACCGTCGCCGCAGCACCAGTCCAGACAGAGCCAAGGCGACGACGACGACGGCTGCGCCACCTCCCATGA
AAAAAGTCCAAAAATGGGCGTCTGGAACAGCCCCATTTCCTCGAAATCAGCCTCTTCCCCAACCAGCCCCTTCGACTCAAAGATGTTATGGATCGGCTCACCCTTCTCAG
AGGCAAGGCCATGCGCTCTCTCTATTCCTGGTCCTGTAAGAGGTAAACAAAAAACAAAAAGTTCCTCTGTTTTGCAAGACGACATGTCGTTCACTATGCCTCATGTTTTG
CTCCTTAGGAACTATAAAACTGGATATGTGTGGAACGACTTGTCGGAAAATGACATTGTATACCCCGCCGGAGGAGCTGAGTACGTGCTCAAGGCCTCCGAACTTGTTGA
CTTCTGTTCTGAAAAATTGCAGGAAATTCACACGGGCACCAACGACAGGCGACCGGTTCAGGAACCGAACCTCCGGACTAAAACTCGGAAACCGCAACTCGCTCCGAGTC
CACTCAAAGAACTCGACGACCAACCGTATTCCGATTTAGACTACGACGAAGTGGAGGACTACGAAAACGAAGACGTGGACAAACCCATTTACACCACCACTTCCACTACC
CCTCACTCCCGGTGCTCTCGCGGCGTCTCCACCGAAGAACTCCCCGGTCCAACTCAGTCCCCCACCGACTCAACTCCCTTTGACTCGTCCCGCCTCTCCACTTCGAAACG
GTTCACTTACGAGGACGAACTCGGGACGGCGCCGAGTAGGAACTCGGTCCTGATGCAGTTCATTTCTTGCGGTGGGTCGGTAAGTTCGAAGGAGAAACCCGGGGAGGGTT
GTAGAGAAGCGGGGAAGGAGATGGGGAGAAGAACCGAGAGCCTTGGGAGAGGGGTTGTGTGTAAAATGGCTGGAAAGCGGATCGGAGAAGAGGAGATGATAAAGTATATG
TCGGAGAATCCGAGGTTTGGGAAGTTGCAGACAGAGGAAAAGGAGTATTTCAGTGGGTCGATTGTGGAGTCCATTAGAGAAGATCGACATGTGGTTGAACCTGTGCTCAA
GAAATCCAGCTCCTACAACGAAGAAAAGAGCAAAAGAGGAGAGTTGGGAGAAAAAAGAGATGAAAATGAAGAGACAAGGAGGGAGGAAGATGACGGTGGGATGAAAGGAA
GGTGCATTCCTCGCATGATATCAGCTGCCTCGGCCTTAGCCTTAGCCTCATCAAAGCAACCCCCCAAGAAGCCTTGAAGCTCACTCACTTCCCCACTGCTTCAATTTTGT
GTCTCTCTTGATCCCACAAAACAGAGGCCTGTTGCTTTGTCCAAATATGGTTGCCAAACACTTGGTTGTTTCTTGCAAATTTTGTATTAGCTTACTTTTTTGGAGCTGTA
TGCACAAGCACTCACATTTTCAACATTACGG
Protein sequenceShow/hide protein sequence
MQSNRRRRERALPAPSPQHQSRQSQGDDDDGCATSHEKSPKMGVWNSPISSKSASSPTSPFDSKMLWIGSPFSEARPCALSIPGPVRGKQKTKSSSVLQDDMSFTMPHVL
LLRNYKTGYVWNDLSENDIVYPAGGAEYVLKASELVDFCSEKLQEIHTGTNDRRPVQEPNLRTKTRKPQLAPSPLKELDDQPYSDLDYDEVEDYENEDVDKPIYTTTSTT
PHSRCSRGVSTEELPGPTQSPTDSTPFDSSRLSTSKRFTYEDELGTAPSRNSVLMQFISCGGSVSSKEKPGEGCREAGKEMGRRTESLGRGVVCKMAGKRIGEEEMIKYM
SENPRFGKLQTEEKEYFSGSIVESIREDRHVVEPVLKKSSSYNEEKSKRGELGEKRDENEETRREEDDGGMKGRCIPRMISAASALALASSKQPPKKP