; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g0406 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g0406
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotein UPSTREAM OF FLC isoform X1
Genome locationMC09:3675623..3679506
RNA-Seq ExpressionMC09g0406
SyntenyMC09g0406
Gene Ontology termsGO:0051258 - protein polymerization (biological process)
GO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR010369 - Protein SOSEKI


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461226.1 PREDICTED: protein UPSTREAM OF FLC isoform X1 [Cucumis melo]1.44e-17069.44Show/hide
Query:  LLIMQLKKRSGLLMEANN--KGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQD
        +++M+L+KR+  LMEANN  +GGEIR+VHI+YFLSRMGHVEQPHLIRVHHLA A+      GV+LRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQD
Subjt:  LLIMQLKKRSGLLMEANN--KGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQD

Query:  LVDDDLITPFSDNEYVLQGSQIIQFPSLFRNFSTPTLFRSDELEPTRDFPAKLQMNG--ESPPCDSERSTVTDDGDSIKVEEET-KNFLETAKQGI----
        LVDDDLITP SDNEYVLQGSQII FPS    F T      +ELE + DF +KLQ     +SPP DSERSTVTDDGDS+KVEEET KN     KQG+    
Subjt:  LVDDDLITPFSDNEYVLQGSQIIQFPSLFRNFSTPTLFRSDELEPTRDFPAKLQMNG--ESPPCDSERSTVTDDGDSIKVEEET-KNFLETAKQGI----

Query:  GVEEIEGFNNQYSS--LYEKL---KEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCG--AVDTNDTVLIKNRSA
         V+EIEGF  QYSS  LY KL   K+EK  +KD M KEGG TATSTVSSSS        PAFTKSKSYSSGASHV RQ ITCG  AVDTNDTVL+KNRS 
Subjt:  GVEEIEGFNNQYSS--LYEKL---KEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCG--AVDTNDTVLIKNRSA

Query:  AKDPPNLPEKPKNDAVICRDDMLGGSARVLRSSWD-GQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAH
         KDPP   EKPKNDAV+CRD++LGGSARV+ SSWD G L+I R    Q SR S D+LRKKRPKE+G  KV AAT  +K M GPNCS CGK+F+PEKMH+H
Subjt:  AKDPPNLPEKPKNDAVICRDDMLGGSARVLRSSWD-GQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAH

Query:  MKSCRGLKSLLKT-PSTSNKTTSSKSTTTTTS
        MKSCRG+KSL K  P+TS+KTT SKSTTTTTS
Subjt:  MKSCRGLKSLLKT-PSTSNKTTSSKSTTTTTS

XP_008461228.1 PREDICTED: protein UPSTREAM OF FLC isoform X3 [Cucumis melo]7.49e-16870.41Show/hide
Query:  MEANN--KGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDN
        MEANN  +GGEIR+VHI+YFLSRMGHVEQPHLIRVHHLA A+      GV+LRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITP SDN
Subjt:  MEANN--KGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDN

Query:  EYVLQGSQIIQFPSLFRNFSTPTLFRSDELEPTRDFPAKLQMNG--ESPPCDSERSTVTDDGDSIKVEEET-KNFLETAKQGI----GVEEIEGFNNQYS
        EYVLQGSQII FPS    F T      +ELE + DF +KLQ     +SPP DSERSTVTDDGDS+KVEEET KN     KQG+     V+EIEGF  QYS
Subjt:  EYVLQGSQIIQFPSLFRNFSTPTLFRSDELEPTRDFPAKLQMNG--ESPPCDSERSTVTDDGDSIKVEEET-KNFLETAKQGI----GVEEIEGFNNQYS

Query:  S--LYEKL---KEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCG--AVDTNDTVLIKNRSAAKDPPNLPEKPKN
        S  LY KL   K+EK  +KD M KEGG TATSTVSSSS        PAFTKSKSYSSGASHV RQ ITCG  AVDTNDTVL+KNRS  KDPP   EKPKN
Subjt:  S--LYEKL---KEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCG--AVDTNDTVLIKNRSAAKDPPNLPEKPKN

Query:  DAVICRDDMLGGSARVLRSSWD-GQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAHMKSCRGLKSLLKT
        DAV+CRD++LGGSARV+ SSWD G L+I R    Q SR S D+LRKKRPKE+G  KV AAT  +K M GPNCS CGK+F+PEKMH+HMKSCRG+KSL K 
Subjt:  DAVICRDDMLGGSARVLRSSWD-GQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAHMKSCRGLKSLLKT

Query:  -PSTSNKTTSSKSTTTTTS
         P+TS+KTT SKSTTTTTS
Subjt:  -PSTSNKTTSSKSTTTTTS

XP_022144952.1 protein UPSTREAM OF FLC isoform X1 [Momordica charantia]6.68e-27695.48Show/hide
Query:  MQLKKRSGLLMEANNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDD
        MQLKKRSGLLMEANNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAA TATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDD
Subjt:  MQLKKRSGLLMEANNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDD

Query:  LITPFSDNEYVLQGSQIIQFPSLFRNFSTP---------TLFRSDELEPTRDFPAKLQMNGESPPCDSERSTVTDDGDSIKVEEETKNFLETAKQGIGVE
        LITPFSDNEYVLQGSQIIQFP LF N             ++FRSDELEPTRDFPAKLQMNGESP CDSERSTVTDDGDSIKVEEETKNFLETAKQGIGVE
Subjt:  LITPFSDNEYVLQGSQIIQFPSLFRNFSTP---------TLFRSDELEPTRDFPAKLQMNGESPPCDSERSTVTDDGDSIKVEEETKNFLETAKQGIGVE

Query:  EIEGFNNQYSSLYEKLKEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCGAVDTNDTVLIKNRSAAKDPPNLPEK
        EIEGFNNQYSSLYEKLKEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCGAVDTNDTVLIKNRSAAKDPPNLPEK
Subjt:  EIEGFNNQYSSLYEKLKEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCGAVDTNDTVLIKNRSAAKDPPNLPEK

Query:  PKNDAVICRDDMLGGSARVLRSSWDGQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAHMKSCRGLKSLL
        PKNDAVICRDDMLGGSARVLRSSWDGQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAHMKSCRGLKSLL
Subjt:  PKNDAVICRDDMLGGSARVLRSSWDGQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAHMKSCRGLKSLL

Query:  KTPSTSNKTTSSKSTTTTTS
        KTPSTSNKTTSSKSTTTTTS
Subjt:  KTPSTSNKTTSSKSTTTTTS

XP_022144961.1 protein UPSTREAM OF FLC isoform X2 [Momordica charantia]4.68e-26995.37Show/hide
Query:  MEANNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEY
        MEANNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAA TATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEY
Subjt:  MEANNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEY

Query:  VLQGSQIIQFPSLFRNFSTP---------TLFRSDELEPTRDFPAKLQMNGESPPCDSERSTVTDDGDSIKVEEETKNFLETAKQGIGVEEIEGFNNQYS
        VLQGSQIIQFP LF N             ++FRSDELEPTRDFPAKLQMNGESP CDSERSTVTDDGDSIKVEEETKNFLETAKQGIGVEEIEGFNNQYS
Subjt:  VLQGSQIIQFPSLFRNFSTP---------TLFRSDELEPTRDFPAKLQMNGESPPCDSERSTVTDDGDSIKVEEETKNFLETAKQGIGVEEIEGFNNQYS

Query:  SLYEKLKEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCGAVDTNDTVLIKNRSAAKDPPNLPEKPKNDAVICRD
        SLYEKLKEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCGAVDTNDTVLIKNRSAAKDPPNLPEKPKNDAVICRD
Subjt:  SLYEKLKEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCGAVDTNDTVLIKNRSAAKDPPNLPEKPKNDAVICRD

Query:  DMLGGSARVLRSSWDGQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAHMKSCRGLKSLLKTPSTSNKTT
        DMLGGSARVLRSSWDGQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAHMKSCRGLKSLLKTPSTSNKTT
Subjt:  DMLGGSARVLRSSWDGQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAHMKSCRGLKSLLKTPSTSNKTT

Query:  SSKSTTTTTS
        SSKSTTTTTS
Subjt:  SSKSTTTTTS

XP_038899908.1 LOW QUALITY PROTEIN: uncharacterized protein LOC120087098 [Benincasa hispida]6.32e-17569.86Show/hide
Query:  MQLKKRSGLLMEANNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDD
        M+L+KR+  LMEAN KGGEIR+VHI+YFLSRMGHVEQPHLIRVHHL          GV+LRDVKRWLGELRGK+MPEAFSWSYKRKYKTGYVWQDLVDDD
Subjt:  MQLKKRSGLLMEANNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDD

Query:  LITPFSDNEYVLQGSQIIQFP-----SLFRNFST----------PTLFRSDELEPTRDFPAK-LQMN-GESPPCDSERSTVTDDGDSIKVEEETKNFLET
        LITP SDNEYVLQGSQII FP     S F NFS            +LFR +ELE   DF +K LQ N  ESPPCDSERSTVTDDGDS+KVEEET   LET
Subjt:  LITPFSDNEYVLQGSQIIQFP-----SLFRNFST----------PTLFRSDELEPTRDFPAK-LQMN-GESPPCDSERSTVTDDGDSIKVEEETKNFLET

Query:  A-KQGIGVE--EIEGFNNQYSSLYEKL---KEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCG--AVDTNDTVL
          KQGI  E  EIEGF  QYSSLY KL    +EK  EKD MEKEGGPTATSTV+SSS        PAFTKSKSYSSGASHVLRQWITCG  AVDTND VL
Subjt:  A-KQGIGVE--EIEGFNNQYSSLYEKL---KEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCG--AVDTNDTVL

Query:  IKNRSAAKDPPNLPE--KPKNDAVICRDDMLGGSARVLRSSWDGQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFK
        +KNRS  KDPPN P   K KNDAV CRDD+LGGSARV+ +S +G L+I R  + Q SRKSFD+ RKKRPKESG RKVAA TA +K M GPNC  CGK+F+
Subjt:  IKNRSAAKDPPNLPE--KPKNDAVICRDDMLGGSARVLRSSWDGQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFK

Query:  PEKMHAHMKSCRGLKSLLKTPSTSNKTTSSKSTTTTTS
        PEKMH+HMKSC+G++SL KT STS+K   SKSTTT TS
Subjt:  PEKMHAHMKSCRGLKSLLKTPSTSNKTTSSKSTTTTTS

TrEMBL top hitse value%identityAlignment
A0A1S3CE79 protein UPSTREAM OF FLC isoform X16.96e-17169.44Show/hide
Query:  LLIMQLKKRSGLLMEANN--KGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQD
        +++M+L+KR+  LMEANN  +GGEIR+VHI+YFLSRMGHVEQPHLIRVHHLA A+      GV+LRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQD
Subjt:  LLIMQLKKRSGLLMEANN--KGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQD

Query:  LVDDDLITPFSDNEYVLQGSQIIQFPSLFRNFSTPTLFRSDELEPTRDFPAKLQMNG--ESPPCDSERSTVTDDGDSIKVEEET-KNFLETAKQGI----
        LVDDDLITP SDNEYVLQGSQII FPS    F T      +ELE + DF +KLQ     +SPP DSERSTVTDDGDS+KVEEET KN     KQG+    
Subjt:  LVDDDLITPFSDNEYVLQGSQIIQFPSLFRNFSTPTLFRSDELEPTRDFPAKLQMNG--ESPPCDSERSTVTDDGDSIKVEEET-KNFLETAKQGI----

Query:  GVEEIEGFNNQYSS--LYEKL---KEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCG--AVDTNDTVLIKNRSA
         V+EIEGF  QYSS  LY KL   K+EK  +KD M KEGG TATSTVSSSS        PAFTKSKSYSSGASHV RQ ITCG  AVDTNDTVL+KNRS 
Subjt:  GVEEIEGFNNQYSS--LYEKL---KEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCG--AVDTNDTVLIKNRSA

Query:  AKDPPNLPEKPKNDAVICRDDMLGGSARVLRSSWD-GQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAH
         KDPP   EKPKNDAV+CRD++LGGSARV+ SSWD G L+I R    Q SR S D+LRKKRPKE+G  KV AAT  +K M GPNCS CGK+F+PEKMH+H
Subjt:  AKDPPNLPEKPKNDAVICRDDMLGGSARVLRSSWD-GQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAH

Query:  MKSCRGLKSLLKT-PSTSNKTTSSKSTTTTTS
        MKSCRG+KSL K  P+TS+KTT SKSTTTTTS
Subjt:  MKSCRGLKSLLKT-PSTSNKTTSSKSTTTTTS

A0A1S3CE85 uncharacterized protein LOC103499876 isoform X23.08e-16668.75Show/hide
Query:  LLIMQLKKRSGLLMEANN--KGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQD
        +++M+L+KR+  LMEANN  +GGEIR+VHI+YFLSRMGHVEQPHLIRVHHLA A+      GV+LRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQD
Subjt:  LLIMQLKKRSGLLMEANN--KGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQD

Query:  LVDDDLITPFSDNEYVLQGSQIIQFPSLFRNFSTPTLFRSDELEPTRDFPAKLQMNG--ESPPCDSERSTVTDDGDSIKVEEET-KNFLETAKQGI----
        LVDDDLITP SDNEYVLQGSQII FPS    F T      +ELE + DF +KLQ     +SPP DSERSTVTDDGDS+KVEEET KN     KQG+    
Subjt:  LVDDDLITPFSDNEYVLQGSQIIQFPSLFRNFSTPTLFRSDELEPTRDFPAKLQMNG--ESPPCDSERSTVTDDGDSIKVEEET-KNFLETAKQGI----

Query:  GVEEIEGFNNQYSS--LYEKL---KEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCG--AVDTNDTVLIKNRSA
         V+EIEGF  QYSS  LY KL   K+EK  +KD M KE    ATSTVSSSS        PAFTKSKSYSSGASHV RQ ITCG  AVDTNDTVL+KNRS 
Subjt:  GVEEIEGFNNQYSS--LYEKL---KEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCG--AVDTNDTVLIKNRSA

Query:  AKDPPNLPEKPKNDAVICRDDMLGGSARVLRSSWD-GQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAH
         KDPP   EKPKNDAV+CRD++LGGSARV+ SSWD G L+I R    Q SR S D+LRKKRPKE+G  KV AAT  +K M GPNCS CGK+F+PEKMH+H
Subjt:  AKDPPNLPEKPKNDAVICRDDMLGGSARVLRSSWD-GQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAH

Query:  MKSCRGLKSLLKT-PSTSNKTTSSKSTTTTTS
        MKSCRG+KSL K  P+TS+KTT SKSTTTTTS
Subjt:  MKSCRGLKSLLKT-PSTSNKTTSSKSTTTTTS

A0A1S3CEM9 protein UPSTREAM OF FLC isoform X33.63e-16870.41Show/hide
Query:  MEANN--KGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDN
        MEANN  +GGEIR+VHI+YFLSRMGHVEQPHLIRVHHLA A+      GV+LRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITP SDN
Subjt:  MEANN--KGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDN

Query:  EYVLQGSQIIQFPSLFRNFSTPTLFRSDELEPTRDFPAKLQMNG--ESPPCDSERSTVTDDGDSIKVEEET-KNFLETAKQGI----GVEEIEGFNNQYS
        EYVLQGSQII FPS    F T      +ELE + DF +KLQ     +SPP DSERSTVTDDGDS+KVEEET KN     KQG+     V+EIEGF  QYS
Subjt:  EYVLQGSQIIQFPSLFRNFSTPTLFRSDELEPTRDFPAKLQMNG--ESPPCDSERSTVTDDGDSIKVEEET-KNFLETAKQGI----GVEEIEGFNNQYS

Query:  S--LYEKL---KEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCG--AVDTNDTVLIKNRSAAKDPPNLPEKPKN
        S  LY KL   K+EK  +KD M KEGG TATSTVSSSS        PAFTKSKSYSSGASHV RQ ITCG  AVDTNDTVL+KNRS  KDPP   EKPKN
Subjt:  S--LYEKL---KEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCG--AVDTNDTVLIKNRSAAKDPPNLPEKPKN

Query:  DAVICRDDMLGGSARVLRSSWD-GQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAHMKSCRGLKSLLKT
        DAV+CRD++LGGSARV+ SSWD G L+I R    Q SR S D+LRKKRPKE+G  KV AAT  +K M GPNCS CGK+F+PEKMH+HMKSCRG+KSL K 
Subjt:  DAVICRDDMLGGSARVLRSSWD-GQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAHMKSCRGLKSLLKT

Query:  -PSTSNKTTSSKSTTTTTS
         P+TS+KTT SKSTTTTTS
Subjt:  -PSTSNKTTSSKSTTTTTS

A0A6J1CUY7 protein UPSTREAM OF FLC isoform X22.27e-26995.37Show/hide
Query:  MEANNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEY
        MEANNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAA TATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEY
Subjt:  MEANNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEY

Query:  VLQGSQIIQFPSLFRNFSTP---------TLFRSDELEPTRDFPAKLQMNGESPPCDSERSTVTDDGDSIKVEEETKNFLETAKQGIGVEEIEGFNNQYS
        VLQGSQIIQFP LF N             ++FRSDELEPTRDFPAKLQMNGESP CDSERSTVTDDGDSIKVEEETKNFLETAKQGIGVEEIEGFNNQYS
Subjt:  VLQGSQIIQFPSLFRNFSTP---------TLFRSDELEPTRDFPAKLQMNGESPPCDSERSTVTDDGDSIKVEEETKNFLETAKQGIGVEEIEGFNNQYS

Query:  SLYEKLKEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCGAVDTNDTVLIKNRSAAKDPPNLPEKPKNDAVICRD
        SLYEKLKEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCGAVDTNDTVLIKNRSAAKDPPNLPEKPKNDAVICRD
Subjt:  SLYEKLKEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCGAVDTNDTVLIKNRSAAKDPPNLPEKPKNDAVICRD

Query:  DMLGGSARVLRSSWDGQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAHMKSCRGLKSLLKTPSTSNKTT
        DMLGGSARVLRSSWDGQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAHMKSCRGLKSLLKTPSTSNKTT
Subjt:  DMLGGSARVLRSSWDGQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAHMKSCRGLKSLLKTPSTSNKTT

Query:  SSKSTTTTTS
        SSKSTTTTTS
Subjt:  SSKSTTTTTS

A0A6J1CV58 protein UPSTREAM OF FLC isoform X13.24e-27695.48Show/hide
Query:  MQLKKRSGLLMEANNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDD
        MQLKKRSGLLMEANNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAA TATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDD
Subjt:  MQLKKRSGLLMEANNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDD

Query:  LITPFSDNEYVLQGSQIIQFPSLFRNFSTP---------TLFRSDELEPTRDFPAKLQMNGESPPCDSERSTVTDDGDSIKVEEETKNFLETAKQGIGVE
        LITPFSDNEYVLQGSQIIQFP LF N             ++FRSDELEPTRDFPAKLQMNGESP CDSERSTVTDDGDSIKVEEETKNFLETAKQGIGVE
Subjt:  LITPFSDNEYVLQGSQIIQFPSLFRNFSTP---------TLFRSDELEPTRDFPAKLQMNGESPPCDSERSTVTDDGDSIKVEEETKNFLETAKQGIGVE

Query:  EIEGFNNQYSSLYEKLKEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCGAVDTNDTVLIKNRSAAKDPPNLPEK
        EIEGFNNQYSSLYEKLKEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCGAVDTNDTVLIKNRSAAKDPPNLPEK
Subjt:  EIEGFNNQYSSLYEKLKEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCGAVDTNDTVLIKNRSAAKDPPNLPEK

Query:  PKNDAVICRDDMLGGSARVLRSSWDGQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAHMKSCRGLKSLL
        PKNDAVICRDDMLGGSARVLRSSWDGQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAHMKSCRGLKSLL
Subjt:  PKNDAVICRDDMLGGSARVLRSSWDGQLDIHRHNNQQNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAHMKSCRGLKSLL

Query:  KTPSTSNKTTSSKSTTTTTS
        KTPSTSNKTTSSKSTTTTTS
Subjt:  KTPSTSNKTTSSKSTTTTTS

SwissProt top hitse value%identityAlignment
A0A2R6X6S3 Protein SOSEKI9.0e-1627.04Show/hide
Query:  RVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEYVLQGSQII---
        +V ++Y+LSR G ++QPHLI V        +T   G++LRDVKR L  +RGK M ++FSWS KR YK  ++WQDL DDD I P SD E VL+GS++    
Subjt:  RVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEYVLQGSQII---

Query:  -----------------QFPSLFRNFSTPTL----------FRSDELEPTRDFPAKLQMNGESPPCDSERSTVTDDGDSIKVEEETKNFLETAKQGIGVE
                         Q P+  +  ++               SD+L+   D  A L ++ +    D      + D  S+ + ++  N L  AK     E
Subjt:  -----------------QFPSLFRNFSTPTL----------FRSDELEPTRDFPAKLQMNGESPPCDSERSTVTDDGDSIKVEEETKNFLETAKQGIGVE

Query:  EIEGFNNQYSSLYEKLKEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCGAVDTNDTVLIKNRSAAKD-PPNLP-
         +   +    S      EE  ++K    +     + ST +S+SSS +   H      K+ S G    L +             L ++R  +++ PP +P 
Subjt:  EIEGFNNQYSSLYEKLKEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCGAVDTNDTVLIKNRSAAKD-PPNLP-

Query:  EKPKNDAVICRDDMLGGSARVLRSSWDGQLDIHRHNNQQNSRKSFDELRKKRPKE
        E P+  +     ++   + R L+   +   ++ R   ++ SR    EL ++ P+E
Subjt:  EKPKNDAVICRDDMLGGSARVLRSSWDGQLDIHRHNNQQNSRKSFDELRKKRPKE

Q8GY65 Protein SOSEKI 41.3e-1736.31Show/hide
Query:  NNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEYVLQ
        ++K    R V ++Y+LSR G ++ PH I V         ++  G++L+DV   L +LRG  M   +SWS KR YK G+VW DL D+D I P    EYVL+
Subjt:  NNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEYVLQ

Query:  GSQIIQFPSLFRNFSTPTLFRSDELEPTRDFPA--KLQMNGESP-PCDSERSTVTDD
        GSQI+   +   NFS  T  R+        +      ++N E+      + ST TDD
Subjt:  GSQIIQFPSLFRNFSTPTLFRSDELEPTRDFPA--KLQMNGESP-PCDSERSTVTDD

Q8GYT8 Protein SOSEKI 38.2e-1743.43Show/hide
Query:  EIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEYVLQGSQI
        +I++V I+Y+LS+   +E PH + V         ++  G++LRDV   L  LRG+ M   +SWS KR Y+ G+VW DL +DDLI P + NEYVL+GS++
Subjt:  EIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEYVLQGSQI

Q9FJF5 Protein SOSEKI 51.1e-1328.03Show/hide
Query:  RRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEYVLQGSQIIQ-
        R+V ++Y+L R G ++ PH I V       T ++  G++L+DV   L +LRGK M   +SWS KR YK G+VW DL +DD I P    EYVL+GS+++  
Subjt:  RRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEYVLQGSQIIQ-

Query:  -FPSLFRNFSTPTLFRSD-ELEPTR----DFPAKLQMNGESPPCDSERS---------TVTDDGDSIKVEEETKNFLETAKQGIGVEEIEGFNNQYSSLY
           S  R+    + FR    L P +    D PA +           + S         +  +    +  +  T+      ++    EEIE   +  S  Y
Subjt:  -FPSLFRNFSTPTLFRSD-ELEPTR----DFPAKLQMNGESPPCDSERS---------TVTDDGDSIKVEEETKNFLETAKQGIGVEEIEGFNNQYSSLY

Query:  EKLKEEKYIEKDKMEKEGGPTATSTVSS---------SSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCGAVDTND--TVLIKNRSAA
        E    E  + +D++      ++  T+ +            S SS+ H       S    AS VL Q I+CG +   +   VL+K++  A
Subjt:  EKLKEEKYIEKDKMEKEGGPTATSTVSS---------SSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCGAVDTND--TVLIKNRSAA

Q9SYJ8 Protein SOSEKI 12.1e-5238.98Show/hide
Query:  MEANNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEY
        ME+N  GGE+RRV+++YFLSR GHV+ PHL+RVHHL+         GVFLRDVK+WL + RG  MP+AFSWS KR+YK GYVWQDL+DDDLITP SDNEY
Subjt:  MEANNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEY

Query:  VLQGSQII------QFPSLFRN---FSTPTLFRSDELEPTRDFPAKLQMNGESPPCDSERSTVTDDGDSIKVEEETKNFLETAKQGIGVEEIEGFNNQYS
        VL+GS+I+       +P++ +         +   ++L+  +    K+Q   ESP   S+RST T    S   EE T N              EGF     
Subjt:  VLQGSQII------QFPSLFRN---FSTPTLFRSDELEPTRDFPAKLQMNGESPPCDSERSTVTDDGDSIKVEEETKNFLETAKQGIGVEEIEGFNNQYS

Query:  SLYEKLKEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSS-GASHVLRQWITCGAVDTNDTVLI---KNRSAAKDPPNLPEKPKNDAV
            K ++ K +   +       +     S   S SS+++  ++ K+KSYSS  ASHVLR  + CG +DTND VL+   K+RS A  P            
Subjt:  SLYEKLKEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSS-GASHVLRQWITCGAVDTNDTVLI---KNRSAAKDPPNLPEKPKNDAV

Query:  ICRDDMLGGSARVLRSSWDGQLDIHRHNNQQNSRKSFD----ELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAHMKSCRGLKSLLKT
                        +W+ +   ++++ Q N+RKSF+     ++ K   E    KVA +    K    P CSQCGK FKPEKMH+HMK CRG+K+    
Subjt:  ICRDDMLGGSARVLRSSWDGQLDIHRHNNQQNSRKSFD----ELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAHMKSCRGLKSLLKT

Query:  PSTSNKTTSSKST
         S +N   +S +T
Subjt:  PSTSNKTTSSKST

Arabidopsis top hitse value%identityAlignment
AT1G05577.1 Domain of unknown function (DUF966)1.5e-5338.98Show/hide
Query:  MEANNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEY
        ME+N  GGE+RRV+++YFLSR GHV+ PHL+RVHHL+         GVFLRDVK+WL + RG  MP+AFSWS KR+YK GYVWQDL+DDDLITP SDNEY
Subjt:  MEANNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEY

Query:  VLQGSQII------QFPSLFRN---FSTPTLFRSDELEPTRDFPAKLQMNGESPPCDSERSTVTDDGDSIKVEEETKNFLETAKQGIGVEEIEGFNNQYS
        VL+GS+I+       +P++ +         +   ++L+  +    K+Q   ESP   S+RST T    S   EE T N              EGF     
Subjt:  VLQGSQII------QFPSLFRN---FSTPTLFRSDELEPTRDFPAKLQMNGESPPCDSERSTVTDDGDSIKVEEETKNFLETAKQGIGVEEIEGFNNQYS

Query:  SLYEKLKEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSS-GASHVLRQWITCGAVDTNDTVLI---KNRSAAKDPPNLPEKPKNDAV
            K ++ K +   +       +     S   S SS+++  ++ K+KSYSS  ASHVLR  + CG +DTND VL+   K+RS A  P            
Subjt:  SLYEKLKEEKYIEKDKMEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSS-GASHVLRQWITCGAVDTNDTVLI---KNRSAAKDPPNLPEKPKNDAV

Query:  ICRDDMLGGSARVLRSSWDGQLDIHRHNNQQNSRKSFD----ELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAHMKSCRGLKSLLKT
                        +W+ +   ++++ Q N+RKSF+     ++ K   E    KVA +    K    P CSQCGK FKPEKMH+HMK CRG+K+    
Subjt:  ICRDDMLGGSARVLRSSWDGQLDIHRHNNQQNSRKSFD----ELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAHMKSCRGLKSLLKT

Query:  PSTSNKTTSSKST
         S +N   +S +T
Subjt:  PSTSNKTTSSKST

AT2G28150.1 Domain of unknown function (DUF966)5.8e-1843.43Show/hide
Query:  EIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEYVLQGSQI
        +I++V I+Y+LS+   +E PH + V         ++  G++LRDV   L  LRG+ M   +SWS KR Y+ G+VW DL +DDLI P + NEYVL+GS++
Subjt:  EIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEYVLQGSQI

AT3G46110.1 Domain of unknown function (DUF966)9.0e-1936.31Show/hide
Query:  NNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEYVLQ
        ++K    R V ++Y+LSR G ++ PH I V         ++  G++L+DV   L +LRG  M   +SWS KR YK G+VW DL D+D I P    EYVL+
Subjt:  NNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEYVLQ

Query:  GSQIIQFPSLFRNFSTPTLFRSDELEPTRDFPA--KLQMNGESP-PCDSERSTVTDD
        GSQI+   +   NFS  T  R+        +      ++N E+      + ST TDD
Subjt:  GSQIIQFPSLFRNFSTPTLFRSDELEPTRDFPA--KLQMNGESP-PCDSERSTVTDD

AT3G46110.2 Domain of unknown function (DUF966)9.0e-1936.31Show/hide
Query:  NNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEYVLQ
        ++K    R V ++Y+LSR G ++ PH I V         ++  G++L+DV   L +LRG  M   +SWS KR YK G+VW DL D+D I P    EYVL+
Subjt:  NNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEYVLQ

Query:  GSQIIQFPSLFRNFSTPTLFRSDELEPTRDFPA--KLQMNGESP-PCDSERSTVTDD
        GSQI+   +   NFS  T  R+        +      ++N E+      + ST TDD
Subjt:  GSQIIQFPSLFRNFSTPTLFRSDELEPTRDFPA--KLQMNGESP-PCDSERSTVTDD

AT5G59790.1 Domain of unknown function (DUF966)7.9e-1528.03Show/hide
Query:  RRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEYVLQGSQIIQ-
        R+V ++Y+L R G ++ PH I V       T ++  G++L+DV   L +LRGK M   +SWS KR YK G+VW DL +DD I P    EYVL+GS+++  
Subjt:  RRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSDNEYVLQGSQIIQ-

Query:  -FPSLFRNFSTPTLFRSD-ELEPTR----DFPAKLQMNGESPPCDSERS---------TVTDDGDSIKVEEETKNFLETAKQGIGVEEIEGFNNQYSSLY
           S  R+    + FR    L P +    D PA +           + S         +  +    +  +  T+      ++    EEIE   +  S  Y
Subjt:  -FPSLFRNFSTPTLFRSD-ELEPTR----DFPAKLQMNGESPPCDSERS---------TVTDDGDSIKVEEETKNFLETAKQGIGVEEIEGFNNQYSSLY

Query:  EKLKEEKYIEKDKMEKEGGPTATSTVSS---------SSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCGAVDTND--TVLIKNRSAA
        E    E  + +D++      ++  T+ +            S SS+ H       S    AS VL Q I+CG +   +   VL+K++  A
Subjt:  EKLKEEKYIEKDKMEKEGGPTATSTVSS---------SSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCGAVDTND--TVLIKNRSAA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TTATTAATTATGCAGCTGAAGAAGAGATCAGGGTTGTTAATGGAGGCTAATAATAAAGGTGGGGAAATTAGAAGAGTTCATATTCTTTACTTTCTTAGCCGGATGGGACA
CGTGGAGCAACCTCATCTCATCCGTGTTCATCATCTCGCCGCCGCCGCCACCGCCACCGCCGTCGCCGGCGTTTTCCTCCGAGATGTAAAGAGGTGGCTAGGGGAATTGA
GAGGGAAGGAGATGCCAGAAGCCTTCTCATGGTCATACAAAAGAAAGTACAAAACAGGCTACGTTTGGCAAGACCTGGTGGATGACGATCTCATAACTCCATTTTCTGAC
AACGAATATGTCCTCCAAGGATCTCAAATAATTCAATTTCCCTCTCTATTTCGTAATTTCTCCACTCCCACTCTGTTCAGATCAGATGAATTGGAACCCACTCGAGATTT
TCCGGCCAAACTGCAAATGAATGGAGAATCGCCGCCGTGCGATTCGGAGAGATCGACGGTCACAGACGACGGTGATTCCATCAAGGTTGAAGAAGAAACCAAGAATTTTT
TGGAGACAGCAAAACAGGGAATAGGAGTAGAAGAAATTGAGGGATTCAACAACCAATATTCTTCGTTATATGAGAAATTGAAGGAGGAGAAATATATAGAGAAAGACAAA
ATGGAGAAAGAAGGTGGACCCACTGCCACGTCAACAGTTTCATCATCGTCGTCATCATCATCTTCATCTACTCATCCTGCTTTCACGAAGAGCAAGAGCTACTCGAGCGG
AGCTTCCCACGTGCTTCGCCAATGGATCACGTGCGGCGCCGTGGATACGAACGACACCGTTTTGATCAAGAACCGATCTGCCGCCAAAGATCCACCGAATCTGCCGGAGA
AACCCAAAAACGACGCCGTTATTTGCAGAGACGACATGTTGGGCGGCTCTGCCCGAGTTCTCCGAAGTTCTTGGGACGGGCAGCTCGATATCCACCGCCACAACAACCAA
CAAAACTCTCGGAAAAGCTTCGATGAATTAAGGAAGAAGAGGCCGAAGGAAAGCGGCGGGAGAAAGGTGGCGGCGGCGACGGCGGCTTTCAAGGCGATGGGCGGACCGAA
CTGCTCGCAATGTGGGAAGAGTTTCAAGCCGGAGAAGATGCACGCACACATGAAATCGTGCAGGGGGCTCAAGTCTCTGCTCAAGACTCCTTCAACTTCCAACAAAACGA
CATCGTCTAAGTCAACCACCACAACAACTTCC
mRNA sequenceShow/hide mRNA sequence
TTATTAATTATGCAGCTGAAGAAGAGATCAGGGTTGTTAATGGAGGCTAATAATAAAGGTGGGGAAATTAGAAGAGTTCATATTCTTTACTTTCTTAGCCGGATGGGACA
CGTGGAGCAACCTCATCTCATCCGTGTTCATCATCTCGCCGCCGCCGCCACCGCCACCGCCGTCGCCGGCGTTTTCCTCCGAGATGTAAAGAGGTGGCTAGGGGAATTGA
GAGGGAAGGAGATGCCAGAAGCCTTCTCATGGTCATACAAAAGAAAGTACAAAACAGGCTACGTTTGGCAAGACCTGGTGGATGACGATCTCATAACTCCATTTTCTGAC
AACGAATATGTCCTCCAAGGATCTCAAATAATTCAATTTCCCTCTCTATTTCGTAATTTCTCCACTCCCACTCTGTTCAGATCAGATGAATTGGAACCCACTCGAGATTT
TCCGGCCAAACTGCAAATGAATGGAGAATCGCCGCCGTGCGATTCGGAGAGATCGACGGTCACAGACGACGGTGATTCCATCAAGGTTGAAGAAGAAACCAAGAATTTTT
TGGAGACAGCAAAACAGGGAATAGGAGTAGAAGAAATTGAGGGATTCAACAACCAATATTCTTCGTTATATGAGAAATTGAAGGAGGAGAAATATATAGAGAAAGACAAA
ATGGAGAAAGAAGGTGGACCCACTGCCACGTCAACAGTTTCATCATCGTCGTCATCATCATCTTCATCTACTCATCCTGCTTTCACGAAGAGCAAGAGCTACTCGAGCGG
AGCTTCCCACGTGCTTCGCCAATGGATCACGTGCGGCGCCGTGGATACGAACGACACCGTTTTGATCAAGAACCGATCTGCCGCCAAAGATCCACCGAATCTGCCGGAGA
AACCCAAAAACGACGCCGTTATTTGCAGAGACGACATGTTGGGCGGCTCTGCCCGAGTTCTCCGAAGTTCTTGGGACGGGCAGCTCGATATCCACCGCCACAACAACCAA
CAAAACTCTCGGAAAAGCTTCGATGAATTAAGGAAGAAGAGGCCGAAGGAAAGCGGCGGGAGAAAGGTGGCGGCGGCGACGGCGGCTTTCAAGGCGATGGGCGGACCGAA
CTGCTCGCAATGTGGGAAGAGTTTCAAGCCGGAGAAGATGCACGCACACATGAAATCGTGCAGGGGGCTCAAGTCTCTGCTCAAGACTCCTTCAACTTCCAACAAAACGA
CATCGTCTAAGTCAACCACCACAACAACTTCC
Protein sequenceShow/hide protein sequence
LLIMQLKKRSGLLMEANNKGGEIRRVHILYFLSRMGHVEQPHLIRVHHLAAAATATAVAGVFLRDVKRWLGELRGKEMPEAFSWSYKRKYKTGYVWQDLVDDDLITPFSD
NEYVLQGSQIIQFPSLFRNFSTPTLFRSDELEPTRDFPAKLQMNGESPPCDSERSTVTDDGDSIKVEEETKNFLETAKQGIGVEEIEGFNNQYSSLYEKLKEEKYIEKDK
MEKEGGPTATSTVSSSSSSSSSSTHPAFTKSKSYSSGASHVLRQWITCGAVDTNDTVLIKNRSAAKDPPNLPEKPKNDAVICRDDMLGGSARVLRSSWDGQLDIHRHNNQ
QNSRKSFDELRKKRPKESGGRKVAAATAAFKAMGGPNCSQCGKSFKPEKMHAHMKSCRGLKSLLKTPSTSNKTTSSKSTTTTTS