; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS012648 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS012648
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein UPSTREAM OF FLC
Genome locationscaffold63:2314006..2316637
RNA-Seq ExpressionMS012648
SyntenyMS012648
Gene Ontology termsGO:0050789 - regulation of biological process (biological process)
GO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR010369 - Protein SOSEKI
IPR021182 - Protein SOSEKI, magnoliopsida


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136668.1 protein UPSTREAM OF FLC [Momordica charantia]9.6e-10463.41Show/hide
Query:  MAEKKKTKKEEEEEGESMAMKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADA
        MAEKKKTKKEEEEEGESMAMKK+QVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADA
Subjt:  MAEKKKTKKEEEEEGESMAMKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADA

Query:  VGAEYLQQIQVSSSNRPPVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSYTTSTTPHSRCSRGVSTDELEDHDPTRPLTQK
        VGAEY+                                                           KGS                  +L  H         
Subjt:  VGAEYLQQIQVSSSNRPPVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSYTTSTTPHSRCSRGVSTDELEDHDPTRPLTQK

Query:  LTESTRFDSARLSSASPIATEHQQTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKL
                                                ELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKL
Subjt:  LTESTRFDSARLSSASPIATEHQQTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKL

Query:  HVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK
        HVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK
Subjt:  HVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK

XP_022941695.1 protein UPSTREAM OF FLC isoform X2 [Cucurbita moschata]1.4e-9962.99Show/hide
Query:  MKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEY--------------
        MKK+QVFYYLSRNG+LEQPHF+E++L  N+PLRL+DVMDRL +LRGKAM +LYSWSCKR+YK GYVWNDLSEND+VYPA   GAEY              
Subjt:  MKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEY--------------

Query:  LQQIQVSSSNRPPVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSY-TTSTTPHSRCSRGVSTDELEDHDPTRPLTQKLTES
        LQ+I   +++R PVQE +L +K RK QLAPSPL+E DD QP   +L+YDEVE+ E ED +K  Y TTSTTPHSRCSRGVST+EL         TQ  T+S
Subjt:  LQQIQVSSSNRPPVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSY-TTSTTPHSRCSRGVSTDELEDHDPTRPLTQKLTES

Query:  TRFDSARLSSASPIATEHQQTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGG
        T FDS+RLS+                  SKRFT +DEL T  APSR+SVL+QFI+CG S  SK K G G  E  KE+GRRTESL +GV CK+AGK    G
Subjt:  TRFDSARLSSASPIATEHQQTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGG

Query:  DEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK
        +EEMI YMSENPRFG LQ+EEKEYFSGSIVESIREDRHV EP LKKS+SYNEEK
Subjt:  DEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK

XP_022980975.1 protein UPSTREAM OF FLC isoform X2 [Cucurbita maxima]6.4e-10062.43Show/hide
Query:  MKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEY--------------
        MKK+QVFYYLSRNG+LEQPHFVE++L  N+PLRL+DVMDRL VLRGKAM +LY+WSCKR+YK GYVWNDLSEND+VYPA+  GAEY              
Subjt:  MKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEY--------------

Query:  LQQIQVSSSNRPPVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSY-TTSTTPHSRCSRGVSTDELEDHDPTRPLTQKLTES
        +Q+I   +++R PVQE +L SK RK QLAPSPL+E DD    D  L+YDEVE+ E ED +K  Y TTSTTPHSRCSRGVST+EL         TQ  TES
Subjt:  LQQIQVSSSNRPPVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSY-TTSTTPHSRCSRGVSTDELEDHDPTRPLTQKLTES

Query:  TRFDSARLSSASPIATEHQQTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGG
        T FDS+RLS+                  SKRFT +DEL T  APSR+SVL+QFI+CG S+ SK K G G  E  KE+GRRTE L +GV CK+ G+    G
Subjt:  TRFDSARLSSASPIATEHQQTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGG

Query:  DEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK
        +EEMI YMSENPRFG LQ+EEKEYFSGSIVESIREDRHV EP LKKS+SYNEEK
Subjt:  DEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK

XP_023525148.1 protein UPSTREAM OF FLC isoform X2 [Cucurbita pepo subsp. pepo]1.3e-10063.56Show/hide
Query:  MKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEY--------------
        MKK+QVFYYLSRNG+LEQPHF+E++L  N+PLRL+DVMDRL +LRGKAM +LYSWSCKR+YK GYVWNDLSEND+VYPA+  GAEY              
Subjt:  MKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEY--------------

Query:  LQQIQVSSSNRPPVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSY-TTSTTPHSRCSRGVSTDELEDHDPTRPLTQKLTES
        LQ+I   S++R PVQE +L  K RKQQLAPS L+E DD QP   +L+YDEVE+ E ED +K  Y TTSTTPHSRCSRGVST+EL         TQ  TES
Subjt:  LQQIQVSSSNRPPVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSY-TTSTTPHSRCSRGVSTDELEDHDPTRPLTQKLTES

Query:  TRFDSARLSSASPIATEHQQTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGG
        T FDS+RLS+                  SKRFT +DEL T  APSR+SVL+QFI+CG S+ SK K G G  E  KE+GRRTESL +GV CK+AGK    G
Subjt:  TRFDSARLSSASPIATEHQQTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGG

Query:  DEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK
        +EEMI YMSENPRFG LQ+EEKEYFSGSIVESIREDRHV EP LKKS+SYNEEK
Subjt:  DEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK

XP_038907002.1 protein UPSTREAM OF FLC [Benincasa hispida]2.9e-10061.01Show/hide
Query:  MAEKKKTKKEEEEEGESMAMKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADA
        M E  +             M K+QVFYY+SRNG+LEQPHF+E+ L  + PLRL+DV+DRLAVLRG AMP LYSWSCKR+YK GYVWNDLSENDVVYPA+ 
Subjt:  MAEKKKTKKEEEEEGESMAMKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADA

Query:  VGAEY----------------LQQIQVSSSNRPPVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSYTTSTTPHSRCSRGVS
         G+EY                LQQI +S++ R PVQE +LP+K RKQQLAPSPL+E   +   D +LEYDEVE+ EY+DGEK  Y+T TTP SRCSRGVS
Subjt:  VGAEY----------------LQQIQVSSSNRPPVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSYTTSTTPHSRCSRGVS

Query:  TDELEDHDPTRPLTQKLTESTRFDSARLSSASPIATEHQQTRREDNAVSKRFT--DDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIG
        T+  E   PTR  T    ESTRFDSARLS+                  SKRF   ++DEL TESAPSR+SVLLQFIACG S  SK KTGPG  EPA    
Subjt:  TDELEDHDPTRPLTQKLTESTRFDSARLSSASPIATEHQQTRREDNAVSKRFT--DDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIG

Query:  RRTE-SLRKGVACKIAGKLHVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK
        RRTE  L K V CK+AGK+   G+EEMI YMSENPRFG LQ+EEKEYFSGSIVESIREDRHV +P LKKSNSYNEEK
Subjt:  RRTE-SLRKGVACKIAGKLHVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK

TrEMBL top hitse value%identityAlignment
A0A2N9HLG3 Uncharacterized protein7.0e-8451.48Show/hide
Query:  MKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEYLQ------------
        +KK+QV YYLSRNG LE PH++E+   AN+PLRLRDV +RL VLRGK MP+LYSWSCKRSYK GYVWNDL+END++YP++  GAEY+             
Subjt:  MKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEYLQ------------

Query:  -QIQVSSSNRPPVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSYTTSTTPHSRCSRGVSTDELEDHD-PTRPLTQK-LTE-
         Q QV  +NR  +QE +    P+++QLAPSP REP+D + ++YE E  E EE EYEDGEK SYT+STTPHSRCSRGVSTDELEDH+  T+P  QK  TE 
Subjt:  -QIQVSSSNRPPVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSYTTSTTPHSRCSRGVSTDELEDHD-PTRPLTQK-LTE-

Query:  -STRFDSARLSS---------------ASPIATEHQ-----------QTRREDNAV--------------------SKRFTDDDELVTESAPSRSSVLLQ
         ST  DS+  +S                 P+A + +           Q++R D                       SKRF D D +  ESA +R+SVLLQ
Subjt:  -STRFDSARLSS---------------ASPIATEHQ-----------QTRREDNAV--------------------SKRFTDDDELVTESAPSRSSVLLQ

Query:  FIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESI--REDRHVA--EPALKKSN
         IACGSS+V K K G G +       ++ +SL KGV CK A K  V  ++ +I YMSENPRFGNLQSEEKEYFSGSIVES+   EDR V   +P LKKSN
Subjt:  FIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESI--REDRHVA--EPALKKSN

Query:  SYNEEK
        SYNEE+
Subjt:  SYNEEK

A0A6J1C868 protein UPSTREAM OF FLC4.6e-10463.41Show/hide
Query:  MAEKKKTKKEEEEEGESMAMKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADA
        MAEKKKTKKEEEEEGESMAMKK+QVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADA
Subjt:  MAEKKKTKKEEEEEGESMAMKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADA

Query:  VGAEYLQQIQVSSSNRPPVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSYTTSTTPHSRCSRGVSTDELEDHDPTRPLTQK
        VGAEY+                                                           KGS                  +L  H         
Subjt:  VGAEYLQQIQVSSSNRPPVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSYTTSTTPHSRCSRGVSTDELEDHDPTRPLTQK

Query:  LTESTRFDSARLSSASPIATEHQQTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKL
                                                ELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKL
Subjt:  LTESTRFDSARLSSASPIATEHQQTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKL

Query:  HVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK
        HVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK
Subjt:  HVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK

A0A6J1FUF7 protein UPSTREAM OF FLC isoform X26.9e-10062.99Show/hide
Query:  MKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEY--------------
        MKK+QVFYYLSRNG+LEQPHF+E++L  N+PLRL+DVMDRL +LRGKAM +LYSWSCKR+YK GYVWNDLSEND+VYPA   GAEY              
Subjt:  MKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEY--------------

Query:  LQQIQVSSSNRPPVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSY-TTSTTPHSRCSRGVSTDELEDHDPTRPLTQKLTES
        LQ+I   +++R PVQE +L +K RK QLAPSPL+E DD QP   +L+YDEVE+ E ED +K  Y TTSTTPHSRCSRGVST+EL         TQ  T+S
Subjt:  LQQIQVSSSNRPPVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSY-TTSTTPHSRCSRGVSTDELEDHDPTRPLTQKLTES

Query:  TRFDSARLSSASPIATEHQQTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGG
        T FDS+RLS+                  SKRFT +DEL T  APSR+SVL+QFI+CG S  SK K G G  E  KE+GRRTESL +GV CK+AGK    G
Subjt:  TRFDSARLSSASPIATEHQQTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGG

Query:  DEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK
        +EEMI YMSENPRFG LQ+EEKEYFSGSIVESIREDRHV EP LKKS+SYNEEK
Subjt:  DEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK

A0A6J1ISQ2 protein UPSTREAM OF FLC isoform X11.2e-8360.82Show/hide
Query:  DVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEY--------------LQQIQVSSSNRPPVQESHLPSKPRKQQLAPSPLRE
        DVMDRL VLRGKAM +LY+WSCKR+YK GYVWNDLSEND+VYPA+  GAEY              +Q+I   +++R PVQE +L SK RK QLAPSPL+E
Subjt:  DVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEY--------------LQQIQVSSSNRPPVQESHLPSKPRKQQLAPSPLRE

Query:  PDDSQPQDYELEYDEVEESEYEDGEKGSY-TTSTTPHSRCSRGVSTDELEDHDPTRPLTQKLTESTRFDSARLSSASPIATEHQQTRREDNAVSKRFTDD
         DD    D  L+YDEVE+ E ED +K  Y TTSTTPHSRCSRGVST+EL         TQ  TEST FDS+RLS+                  SKRFT +
Subjt:  PDDSQPQDYELEYDEVEESEYEDGEKGSY-TTSTTPHSRCSRGVSTDELEDHDPTRPLTQKLTESTRFDSARLSSASPIATEHQQTRREDNAVSKRFTDD

Query:  DELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESIRE
        DEL T  APSR+SVL+QFI+CG S+ SK K G G  E  KE+GRRTE L +GV CK+ G+    G+EEMI YMSENPRFG LQ+EEKEYFSGSIVESIRE
Subjt:  DELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESIRE

Query:  DRHVAEPALKKSNSYNEEK
        DRHV EP LKKS+SYNEEK
Subjt:  DRHVAEPALKKSNSYNEEK

A0A6J1IY41 protein UPSTREAM OF FLC isoform X23.1e-10062.43Show/hide
Query:  MKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEY--------------
        MKK+QVFYYLSRNG+LEQPHFVE++L  N+PLRL+DVMDRL VLRGKAM +LY+WSCKR+YK GYVWNDLSEND+VYPA+  GAEY              
Subjt:  MKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEY--------------

Query:  LQQIQVSSSNRPPVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSY-TTSTTPHSRCSRGVSTDELEDHDPTRPLTQKLTES
        +Q+I   +++R PVQE +L SK RK QLAPSPL+E DD    D  L+YDEVE+ E ED +K  Y TTSTTPHSRCSRGVST+EL         TQ  TES
Subjt:  LQQIQVSSSNRPPVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSY-TTSTTPHSRCSRGVSTDELEDHDPTRPLTQKLTES

Query:  TRFDSARLSSASPIATEHQQTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGG
        T FDS+RLS+                  SKRFT +DEL T  APSR+SVL+QFI+CG S+ SK K G G  E  KE+GRRTE L +GV CK+ G+    G
Subjt:  TRFDSARLSSASPIATEHQQTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGG

Query:  DEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK
        +EEMI YMSENPRFG LQ+EEKEYFSGSIVESIREDRHV EP LKKS+SYNEEK
Subjt:  DEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK

SwissProt top hitse value%identityAlignment
A0A2K1J5A5 Protein SOSEKI 38.6e-1542.86Show/hide
Query:  LQVFYYLSRNGQLEQPHFVELNLPANR-PLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEYLQQIQV--SSSNRP
        +QV Y LS +G+LE PH +E++ P N+  LRLRDV  RL  LRG  +   +SWS KR+YK  ++WNDL ++DV+ P    G   L+  ++  + +N+P
Subjt:  LQVFYYLSRNGQLEQPHFVELNLPANR-PLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEYLQQIQV--SSSNRP

Q8GY65 Protein SOSEKI 45.6e-2231.64Show/hide
Query:  VFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEY-LQQIQVSSSNRPPVQESH
        V YYLSRNG+L+ PHF+E+ L ++  L L+DV++RL  LRG  M  LYSWS KR+YK G+VW DLS+ D ++P    G EY L+  Q+   +      S 
Subjt:  VFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEY-LQQIQVSSSNRPPVQESH

Query:  LPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSYTTSTTPHSRCSRGVSTDELEDHDPTRPLTQKLTESTRFDSARLSSASPIATEHQ
                  A +  R    S    Y++   +  E   E   K S   ST    R  R    DE+  ++ T    +++T   + DS      SP   E  
Subjt:  LPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSYTTSTTPHSRCSRGVSTDELEDHDPTRPLTQKLTESTRFDSARLSSASPIATEHQ

Query:  QTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGGDEEMINYMSENPRFGNLQS
           R D  +     D +   T      S+VL+Q I+CG+ S  K         P    G    +  +G            G+  +     E   FG ++ 
Subjt:  QTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGGDEEMINYMSENPRFGNLQS

Query:  EEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK
        EEKEYFSGS+++   +   V  PALK+S+SYN ++
Subjt:  EEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK

Q8GYT8 Protein SOSEKI 31.6e-2429.61Show/hide
Query:  MKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADA-------------------
        +KK+Q+ YYLS+N QLE PHF+E+ + +   L LRDV++RL VLRG+ M ++YSWS KRSY+ G+VW+DLSE+D++ PA+                    
Subjt:  MKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADA-------------------

Query:  -----VGAEYLQQIQVSSSNRPPVQESHLPSKPRK-----------QQLAPSPLR-------EPDDSQPQDYE----LEYDEVEESEYEDGEKGSYTT--
             +  + ++QI V   +   + +S   S                +L+P  LR        PD    ++       EY   +     D    +  T  
Subjt:  -----VGAEYLQQIQVSSSNRPPVQESHLPSKPRK-----------QQLAPSPLR-------EPDDSQPQDYE----LEYDEVEESEYEDGEKGSYTT--

Query:  --STTPHSRCSRGVSTDELEDHDPTRPLTQKLTEST----RFDSARLS----------SASPIATEHQQTRREDNA-VSK----RFTDDDELVTESAP--
          S TP    SRGVSTDE    +P       ++E++      +SA +S          SAS +  +         A VSK    R  + +++   + P  
Subjt:  --STTPHSRCSRGVSTDELEDHDPTRPLTQKLTEST----RFDSARLS----------SASPIATEHQQTRREDNA-VSK----RFTDDDELVTESAP--

Query:  SRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAE--P
          S++L+Q I+CGS SV     G  P    K    +  S              + GD   ++ +SE P    L+ EEKEYFSGS+VE+  + +  A+   
Subjt:  SRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAE--P

Query:  ALKKSNSYNEEK
        +LK+S+SYN ++
Subjt:  ALKKSNSYNEEK

Q9FJF5 Protein SOSEKI 51.3e-2632.97Show/hide
Query:  KKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEYLQQIQVSSSNRPPVQ
        +K+ V YYL RNGQL+ PHF+E+ L ++  L L+DV++RL  LRGK M +LYSWS KRSYK G+VW+DLSE+D ++P    G EY+ +          V 
Subjt:  KKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEYLQQIQVSSSNRPPVQ

Query:  ESHLPSKPRKQQLAPSPLREPDDSQPQD-----------------------YELEYDEVEESEYEDGEKGSYTTST-TPHSRCSRGVSTDELEDHDPTRP
        +S L S PR   L  S  R+P    P                          E +  +  ES  E  ++ +   ST T   R  R  + +E+E+      
Subjt:  ESHLPSKPRKQQLAPSPLREPDDSQPQD-----------------------YELEYDEVEESEYEDGEKGSYTTST-TPHSRCSRGVSTDELEDHDPTRP

Query:  LTQKLTESTRFD-SARLSSASPIATEHQQTRREDNAVSKRFTDDDELVTESAPS----RSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKG
           + TE +R + S   S +SP   E+         +    +  D    ES  S     S+VL+Q I+CG+ S  +     GP+   K+ G    +L   
Subjt:  LTQKLTESTRFD-SARLSSASPIATEHQQTRREDNAVSKRFTDDDELVTESAPS----RSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKG

Query:  VACKIAGKLHVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK
          C I       G+E +     E   FG +Q E+KEYFSGS++E+ +E      PALK+S+SYN ++
Subjt:  VACKIAGKLHVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK

Q9LX14 Protein SOSEKI 21.1e-6545.83Show/hide
Query:  EEEEEGESMAMKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAV-----GAE
        EEE + +    +++QV YYL+RNG LE PHF+E+  P N+PLRLRDVM+RL +LRGK M + Y+WSCKRSY+ G+VWNDL+ENDV+YP+D       G+E
Subjt:  EEEEEGESMAMKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAV-----GAE

Query:  YLQQIQVSSSNRP---PVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYE-DGEKGSYTTSTTPHSRCSRGVSTDELEDHDPTRPLTQK
           + Q    NRP    +QE+   S+  + +L P   R       + Y  E +E E+ EYE   EK SYT+STTP SRCSRGVST+ +E  +    LT+ 
Subjt:  YLQQIQVSSSNRP---PVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYE-DGEKGSYTTSTTPHSRCSRGVSTDELEDHDPTRPLTQK

Query:  LTE-STRFDSARLSSASPIATEHQQTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACK-IAG
          +   R DS+ L+ ++P+     + RR +  VS R  D D +  E    R S+ LQ I+CG   ++     P  + P     ++ E+LRKGV CK I  
Subjt:  LTE-STRFDSARLSSASPIATEHQQTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACK-IAG

Query:  KLHVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK
        K  V  + EMI +MSENPRFGN Q+EEKEYFSGSIVES+ ++R  AEP+L++SNS+NEE+
Subjt:  KLHVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK

Arabidopsis top hitse value%identityAlignment
AT2G28150.1 Domain of unknown function (DUF966)1.1e-2529.61Show/hide
Query:  MKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADA-------------------
        +KK+Q+ YYLS+N QLE PHF+E+ + +   L LRDV++RL VLRG+ M ++YSWS KRSY+ G+VW+DLSE+D++ PA+                    
Subjt:  MKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADA-------------------

Query:  -----VGAEYLQQIQVSSSNRPPVQESHLPSKPRK-----------QQLAPSPLR-------EPDDSQPQDYE----LEYDEVEESEYEDGEKGSYTT--
             +  + ++QI V   +   + +S   S                +L+P  LR        PD    ++       EY   +     D    +  T  
Subjt:  -----VGAEYLQQIQVSSSNRPPVQESHLPSKPRK-----------QQLAPSPLR-------EPDDSQPQDYE----LEYDEVEESEYEDGEKGSYTT--

Query:  --STTPHSRCSRGVSTDELEDHDPTRPLTQKLTEST----RFDSARLS----------SASPIATEHQQTRREDNA-VSK----RFTDDDELVTESAP--
          S TP    SRGVSTDE    +P       ++E++      +SA +S          SAS +  +         A VSK    R  + +++   + P  
Subjt:  --STTPHSRCSRGVSTDELEDHDPTRPLTQKLTEST----RFDSARLS----------SASPIATEHQQTRREDNA-VSK----RFTDDDELVTESAP--

Query:  SRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAE--P
          S++L+Q I+CGS SV     G  P    K    +  S              + GD   ++ +SE P    L+ EEKEYFSGS+VE+  + +  A+   
Subjt:  SRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAE--P

Query:  ALKKSNSYNEEK
        +LK+S+SYN ++
Subjt:  ALKKSNSYNEEK

AT3G46110.1 Domain of unknown function (DUF966)4.0e-2331.64Show/hide
Query:  VFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEY-LQQIQVSSSNRPPVQESH
        V YYLSRNG+L+ PHF+E+ L ++  L L+DV++RL  LRG  M  LYSWS KR+YK G+VW DLS+ D ++P    G EY L+  Q+   +      S 
Subjt:  VFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEY-LQQIQVSSSNRPPVQESH

Query:  LPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSYTTSTTPHSRCSRGVSTDELEDHDPTRPLTQKLTESTRFDSARLSSASPIATEHQ
                  A +  R    S    Y++   +  E   E   K S   ST    R  R    DE+  ++ T    +++T   + DS      SP   E  
Subjt:  LPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSYTTSTTPHSRCSRGVSTDELEDHDPTRPLTQKLTESTRFDSARLSSASPIATEHQ

Query:  QTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGGDEEMINYMSENPRFGNLQS
           R D  +     D +   T      S+VL+Q I+CG+ S  K         P    G    +  +G            G+  +     E   FG ++ 
Subjt:  QTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGGDEEMINYMSENPRFGNLQS

Query:  EEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK
        EEKEYFSGS+++   +   V  PALK+S+SYN ++
Subjt:  EEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK

AT3G46110.2 Domain of unknown function (DUF966)4.0e-2331.64Show/hide
Query:  VFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEY-LQQIQVSSSNRPPVQESH
        V YYLSRNG+L+ PHF+E+ L ++  L L+DV++RL  LRG  M  LYSWS KR+YK G+VW DLS+ D ++P    G EY L+  Q+   +      S 
Subjt:  VFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEY-LQQIQVSSSNRPPVQESH

Query:  LPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSYTTSTTPHSRCSRGVSTDELEDHDPTRPLTQKLTESTRFDSARLSSASPIATEHQ
                  A +  R    S    Y++   +  E   E   K S   ST    R  R    DE+  ++ T    +++T   + DS      SP   E  
Subjt:  LPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSYTTSTTPHSRCSRGVSTDELEDHDPTRPLTQKLTESTRFDSARLSSASPIATEHQ

Query:  QTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGGDEEMINYMSENPRFGNLQS
           R D  +     D +   T      S+VL+Q I+CG+ S  K         P    G    +  +G            G+  +     E   FG ++ 
Subjt:  QTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGGDEEMINYMSENPRFGNLQS

Query:  EEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK
        EEKEYFSGS+++   +   V  PALK+S+SYN ++
Subjt:  EEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK

AT5G10150.1 Domain of unknown function (DUF966)7.6e-6745.83Show/hide
Query:  EEEEEGESMAMKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAV-----GAE
        EEE + +    +++QV YYL+RNG LE PHF+E+  P N+PLRLRDVM+RL +LRGK M + Y+WSCKRSY+ G+VWNDL+ENDV+YP+D       G+E
Subjt:  EEEEEGESMAMKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAV-----GAE

Query:  YLQQIQVSSSNRP---PVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYE-DGEKGSYTTSTTPHSRCSRGVSTDELEDHDPTRPLTQK
           + Q    NRP    +QE+   S+  + +L P   R       + Y  E +E E+ EYE   EK SYT+STTP SRCSRGVST+ +E  +    LT+ 
Subjt:  YLQQIQVSSSNRP---PVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYE-DGEKGSYTTSTTPHSRCSRGVSTDELEDHDPTRPLTQK

Query:  LTE-STRFDSARLSSASPIATEHQQTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACK-IAG
          +   R DS+ L+ ++P+     + RR +  VS R  D D +  E    R S+ LQ I+CG   ++     P  + P     ++ E+LRKGV CK I  
Subjt:  LTE-STRFDSARLSSASPIATEHQQTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACK-IAG

Query:  KLHVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK
        K  V  + EMI +MSENPRFGN Q+EEKEYFSGSIVES+ ++R  AEP+L++SNS+NEE+
Subjt:  KLHVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK

AT5G59790.1 Domain of unknown function (DUF966)9.1e-2832.97Show/hide
Query:  KKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEYLQQIQVSSSNRPPVQ
        +K+ V YYL RNGQL+ PHF+E+ L ++  L L+DV++RL  LRGK M +LYSWS KRSYK G+VW+DLSE+D ++P    G EY+ +          V 
Subjt:  KKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEYLQQIQVSSSNRPPVQ

Query:  ESHLPSKPRKQQLAPSPLREPDDSQPQD-----------------------YELEYDEVEESEYEDGEKGSYTTST-TPHSRCSRGVSTDELEDHDPTRP
        +S L S PR   L  S  R+P    P                          E +  +  ES  E  ++ +   ST T   R  R  + +E+E+      
Subjt:  ESHLPSKPRKQQLAPSPLREPDDSQPQD-----------------------YELEYDEVEESEYEDGEKGSYTTST-TPHSRCSRGVSTDELEDHDPTRP

Query:  LTQKLTESTRFD-SARLSSASPIATEHQQTRREDNAVSKRFTDDDELVTESAPS----RSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKG
           + TE +R + S   S +SP   E+         +    +  D    ES  S     S+VL+Q I+CG+ S  +     GP+   K+ G    +L   
Subjt:  LTQKLTESTRFD-SARLSSASPIATEHQQTRREDNAVSKRFTDDDELVTESAPS----RSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKG

Query:  VACKIAGKLHVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK
          C I       G+E +     E   FG +Q E+KEYFSGS++E+ +E      PALK+S+SYN ++
Subjt:  VACKIAGKLHVGGDEEMINYMSENPRFGNLQSEEKEYFSGSIVESIREDRHVAEPALKKSNSYNEEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAGAAGAAGAAGACGAAGAAGGAAGAAGAAGAAGAAGGTGAAAGCATGGCCATGAAAAAGCTGCAAGTTTTCTACTACCTCTCGAGAAATGGGCAACTGGAGCA
GCCCCATTTTGTTGAGCTCAATCTCCCCGCCAACCGCCCTCTTCGCCTCAGAGATGTGATGGATCGGCTGGCGGTTCTGAGAGGGAAGGCCATGCCCGCTCTCTATTCCT
GGTCCTGCAAGAGGAGCTACAAATGCGGATATGTGTGGAACGACTTGTCGGAAAACGACGTTGTGTATCCTGCTGATGCTGTGGGGGCTGAGTATTTGCAGCAAATTCAG
GTGAGCAGCAGCAATAGGCCACCGGTTCAGGAATCCCACCTCCCGAGTAAACCCCGGAAGCAGCAACTCGCTCCGAGTCCACTCAGAGAACCCGACGATTCTCAACCCCA
AGATTACGAGTTAGAATACGACGAAGTCGAAGAATCGGAATACGAAGACGGAGAGAAAGGCAGCTACACCACCTCCACAACCCCTCACTCCCGCTGCTCCCGCGGCGTCT
CCACCGACGAGCTCGAAGACCACGACCCAACTCGCCCTCTAACTCAGAAGCTCACCGAGTCAACTCGCTTCGACTCGGCTCGCCTCTCCTCCGCATCGCCGATCGCAACC
GAACATCAACAGACCCGCCGCGAAGACAACGCCGTTTCGAAGCGGTTCACCGACGACGACGAACTCGTAACCGAATCGGCTCCGAGTCGGAGCTCGGTCCTGCTCCAGTT
CATTGCCTGCGGGAGCTCGTCGGTCTCGAAGGGGAAAACCGGACCAGGTCCGATAGAACCGGCTAAGGAGATTGGGAGAAGAACGGAGAGCCTTCGGAAAGGAGTTGCGT
GTAAAATTGCTGGAAAACTACATGTCGGAGGAGATGAGGAGATGATAAACTACATGTCGGAGAACCCGAGGTTCGGAAATTTGCAGTCGGAGGAGAAGGAGTACTTCAGT
GGCTCGATCGTCGAGTCGATTAGAGAAGATCGACATGTCGCGGAACCGGCGCTGAAGAAATCGAACTCATATAACGAAGAAAAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTGAGAAGAAGAAGACGAAGAAGGAAGAAGAAGAAGAAGGTGAAAGCATGGCCATGAAAAAGCTGCAAGTTTTCTACTACCTCTCGAGAAATGGGCAACTGGAGCA
GCCCCATTTTGTTGAGCTCAATCTCCCCGCCAACCGCCCTCTTCGCCTCAGAGATGTGATGGATCGGCTGGCGGTTCTGAGAGGGAAGGCCATGCCCGCTCTCTATTCCT
GGTCCTGCAAGAGGAGCTACAAATGCGGATATGTGTGGAACGACTTGTCGGAAAACGACGTTGTGTATCCTGCTGATGCTGTGGGGGCTGAGTATTTGCAGCAAATTCAG
GTGAGCAGCAGCAATAGGCCACCGGTTCAGGAATCCCACCTCCCGAGTAAACCCCGGAAGCAGCAACTCGCTCCGAGTCCACTCAGAGAACCCGACGATTCTCAACCCCA
AGATTACGAGTTAGAATACGACGAAGTCGAAGAATCGGAATACGAAGACGGAGAGAAAGGCAGCTACACCACCTCCACAACCCCTCACTCCCGCTGCTCCCGCGGCGTCT
CCACCGACGAGCTCGAAGACCACGACCCAACTCGCCCTCTAACTCAGAAGCTCACCGAGTCAACTCGCTTCGACTCGGCTCGCCTCTCCTCCGCATCGCCGATCGCAACC
GAACATCAACAGACCCGCCGCGAAGACAACGCCGTTTCGAAGCGGTTCACCGACGACGACGAACTCGTAACCGAATCGGCTCCGAGTCGGAGCTCGGTCCTGCTCCAGTT
CATTGCCTGCGGGAGCTCGTCGGTCTCGAAGGGGAAAACCGGACCAGGTCCGATAGAACCGGCTAAGGAGATTGGGAGAAGAACGGAGAGCCTTCGGAAAGGAGTTGCGT
GTAAAATTGCTGGAAAACTACATGTCGGAGGAGATGAGGAGATGATAAACTACATGTCGGAGAACCCGAGGTTCGGAAATTTGCAGTCGGAGGAGAAGGAGTACTTCAGT
GGCTCGATCGTCGAGTCGATTAGAGAAGATCGACATGTCGCGGAACCGGCGCTGAAGAAATCGAACTCATATAACGAAGAAAAG
Protein sequenceShow/hide protein sequence
MAEKKKTKKEEEEEGESMAMKKLQVFYYLSRNGQLEQPHFVELNLPANRPLRLRDVMDRLAVLRGKAMPALYSWSCKRSYKCGYVWNDLSENDVVYPADAVGAEYLQQIQ
VSSSNRPPVQESHLPSKPRKQQLAPSPLREPDDSQPQDYELEYDEVEESEYEDGEKGSYTTSTTPHSRCSRGVSTDELEDHDPTRPLTQKLTESTRFDSARLSSASPIAT
EHQQTRREDNAVSKRFTDDDELVTESAPSRSSVLLQFIACGSSSVSKGKTGPGPIEPAKEIGRRTESLRKGVACKIAGKLHVGGDEEMINYMSENPRFGNLQSEEKEYFS
GSIVESIREDRHVAEPALKKSNSYNEEK