; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G22840 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G22840
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein UPSTREAM OF FLC-like
Genome locationClcChr02:34882814..34886996
RNA-Seq ExpressionClc02G22840
SyntenyClc02G22840
Gene Ontology termsGO:0050789 - regulation of biological process (biological process)
GO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR010369 - Protein SOSEKI
IPR021182 - Protein SOSEKI, magnoliopsida


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037308.1 protein UPSTREAM OF FLC isoform X1 [Cucumis melo var. makuwa]1.1e-15573.8Show/hide
Query:  MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLI
        MKKYNQLSPERAKVWTEKSPKY QVRKVPVIYYLCRNRQLEHPHFMEVP+SS +GLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLC+DDLI
Subjt:  MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLI

Query:  LPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL
        LPAHGNEYVLKGSELFEES SSKD FSSIGNV+IQPLKQLPDPASSQSQDDSSSSSSM GKE+KNSQEDDLSLSVLRPGSS MSPDSGGGK+SWGGCLSL
Subjt:  LPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL

Query:  TEYKT--------------------------------------------LNNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS
        TEYK                                             LNN ASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS
Subjt:  TEYKT--------------------------------------------LNNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS

Query:  FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPRL----------------------------------------------KCWQG
        FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPR                                               K  Q 
Subjt:  FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPRL----------------------------------------------KCWQG

Query:  DGLTTLKRSSSYNADR
        DGL+TLKRSSSYNADR
Subjt:  DGLTTLKRSSSYNADR

KAE8646893.1 hypothetical protein Csa_021000 [Cucumis sativus]9.9e-15773.92Show/hide
Query:  MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLI
        MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVP+SS +GLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLC+DDLI
Subjt:  MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLI

Query:  LPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL
        LPAHGNEYVLKGSELFEES +SKD FSSIGNV+IQPLKQLPDPASSQSQDDSSSSSSM GKE+KNSQEDDLSLSVLRPGSS MSPDSGGGKSSWGGCLSL
Subjt:  LPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL

Query:  TEYK--------------------------------------------TLNNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS
        TEYK                                            TLNN ASN KKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS
Subjt:  TEYK--------------------------------------------TLNNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS

Query:  FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPRL----------------------------------------------KCWQG
        FRILEEEEIRMP NARLKATNVLMQLISCGSISVKDHSFGLIPSYKPR                                               K  Q 
Subjt:  FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPRL----------------------------------------------KCWQG

Query:  DGLTTLKRSSSYNADRFF
        DGLTTLKRSSSYNADR F
Subjt:  DGLTTLKRSSSYNADRFF

XP_011657026.1 protein UPSTREAM OF FLC isoform X1 [Cucumis sativus]9.9e-15773.92Show/hide
Query:  MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLI
        MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVP+SS +GLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLC+DDLI
Subjt:  MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLI

Query:  LPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL
        LPAHGNEYVLKGSELFEES +SKD FSSIGNV+IQPLKQLPDPASSQSQDDSSSSSSM GKE+KNSQEDDLSLSVLRPGSS MSPDSGGGKSSWGGCLSL
Subjt:  LPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL

Query:  TEYK--------------------------------------------TLNNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS
        TEYK                                            TLNN ASN KKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS
Subjt:  TEYK--------------------------------------------TLNNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS

Query:  FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPRL----------------------------------------------KCWQG
        FRILEEEEIRMP NARLKATNVLMQLISCGSISVKDHSFGLIPSYKPR                                               K  Q 
Subjt:  FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPRL----------------------------------------------KCWQG

Query:  DGLTTLKRSSSYNADRFF
        DGLTTLKRSSSYNADR F
Subjt:  DGLTTLKRSSSYNADRFF

XP_022140411.1 protein UPSTREAM OF FLC isoform X1 [Momordica charantia]7.6e-15773.21Show/hide
Query:  MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLI
        MKKYNQLSPERAKVWTEKSPKYQQ RKVPVIYYLCRNRQLEHPHFMEVP+SSP+GLYLRDVINRLNVLRGRGMAT YSWSCKRSYKNGFVWHDLC+DDLI
Subjt:  MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLI

Query:  LPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL
        LPAHGNEYVLKGSELFEES S KDPFSSIGNV+IQPLKQLP+P SSQSQDDSSSSSS+NGKE+KNSQ+DDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL
Subjt:  LPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL

Query:  TEYKT--------------------------------------------LNNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS
        TEYK                                             LNN ASNQK+NYDAPQDSVSPP LSSSASSSGGKTETLESLIRADASKINS
Subjt:  TEYKT--------------------------------------------LNNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS

Query:  FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPRL----------------------------------------------KCWQG
        FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPR                                               K  QG
Subjt:  FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPRL----------------------------------------------KCWQG

Query:  DGLTTLKRSSSYNADRFF
        DGLTTLKRSSSYNADR +
Subjt:  DGLTTLKRSSSYNADRFF

XP_038901817.1 protein UPSTREAM OF FLC isoform X1 [Benincasa hispida]4.4e-15773.8Show/hide
Query:  MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLI
        MKKYNQLSPERAKVWTEKSPKYQQVRKVPV+YYLCRNRQLEHPHFMEVP+SSP+GLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLC+DDLI
Subjt:  MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLI

Query:  LPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL
        LPAHGNEYVLKGSELFEES SSKDPF+SIGN++IQPLKQLPDPASSQSQDDSSSSSSMNGKE+K S EDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL
Subjt:  LPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL

Query:  TEYK--------------------------------------------TLNNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS
        TEYK                                            T +NV  NQKKNYDAPQDSVSPP LSSSASSSGGKTETLESLIRADASKINS
Subjt:  TEYK--------------------------------------------TLNNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS

Query:  FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPRL----------------------------------------------KCWQG
        FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPR                                               K  QG
Subjt:  FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPRL----------------------------------------------KCWQG

Query:  DGLTTLKRSSSYNADR
        DGLTTLKRSSSYNADR
Subjt:  DGLTTLKRSSSYNADR

TrEMBL top hitse value%identityAlignment
A0A0A0KBI2 Uncharacterized protein4.8e-15773.92Show/hide
Query:  MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLI
        MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVP+SS +GLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLC+DDLI
Subjt:  MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLI

Query:  LPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL
        LPAHGNEYVLKGSELFEES +SKD FSSIGNV+IQPLKQLPDPASSQSQDDSSSSSSM GKE+KNSQEDDLSLSVLRPGSS MSPDSGGGKSSWGGCLSL
Subjt:  LPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL

Query:  TEYK--------------------------------------------TLNNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS
        TEYK                                            TLNN ASN KKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS
Subjt:  TEYK--------------------------------------------TLNNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS

Query:  FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPRL----------------------------------------------KCWQG
        FRILEEEEIRMP NARLKATNVLMQLISCGSISVKDHSFGLIPSYKPR                                               K  Q 
Subjt:  FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPRL----------------------------------------------KCWQG

Query:  DGLTTLKRSSSYNADRFF
        DGLTTLKRSSSYNADR F
Subjt:  DGLTTLKRSSSYNADRFF

A0A1S3CNT2 protein UPSTREAM OF FLC isoform X15.3e-15673.8Show/hide
Query:  MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLI
        MKKYNQLSPERAKVWTEKSPKY QVRKVPVIYYLCRNRQLEHPHFMEVP+SS +GLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLC+DDLI
Subjt:  MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLI

Query:  LPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL
        LPAHGNEYVLKGSELFEES SSKD FSSIGNV+IQPLKQLPDPASSQSQDDSSSSSSM GKE+KNSQEDDLSLSVLRPGSS MSPDSGGGK+SWGGCLSL
Subjt:  LPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL

Query:  TEYKT--------------------------------------------LNNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS
        TEYK                                             LNN ASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS
Subjt:  TEYKT--------------------------------------------LNNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS

Query:  FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPRL----------------------------------------------KCWQG
        FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPR                                               K  Q 
Subjt:  FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPRL----------------------------------------------KCWQG

Query:  DGLTTLKRSSSYNADR
        DGL+TLKRSSSYNADR
Subjt:  DGLTTLKRSSSYNADR

A0A5D3DKN9 Protein UPSTREAM OF FLC isoform X15.3e-15673.8Show/hide
Query:  MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLI
        MKKYNQLSPERAKVWTEKSPKY QVRKVPVIYYLCRNRQLEHPHFMEVP+SS +GLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLC+DDLI
Subjt:  MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLI

Query:  LPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL
        LPAHGNEYVLKGSELFEES SSKD FSSIGNV+IQPLKQLPDPASSQSQDDSSSSSSM GKE+KNSQEDDLSLSVLRPGSS MSPDSGGGK+SWGGCLSL
Subjt:  LPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL

Query:  TEYKT--------------------------------------------LNNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS
        TEYK                                             LNN ASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS
Subjt:  TEYKT--------------------------------------------LNNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS

Query:  FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPRL----------------------------------------------KCWQG
        FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPR                                               K  Q 
Subjt:  FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPRL----------------------------------------------KCWQG

Query:  DGLTTLKRSSSYNADR
        DGL+TLKRSSSYNADR
Subjt:  DGLTTLKRSSSYNADR

A0A6J1CG10 protein UPSTREAM OF FLC isoform X13.7e-15773.21Show/hide
Query:  MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLI
        MKKYNQLSPERAKVWTEKSPKYQQ RKVPVIYYLCRNRQLEHPHFMEVP+SSP+GLYLRDVINRLNVLRGRGMAT YSWSCKRSYKNGFVWHDLC+DDLI
Subjt:  MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLI

Query:  LPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL
        LPAHGNEYVLKGSELFEES S KDPFSSIGNV+IQPLKQLP+P SSQSQDDSSSSSS+NGKE+KNSQ+DDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL
Subjt:  LPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL

Query:  TEYKT--------------------------------------------LNNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS
        TEYK                                             LNN ASNQK+NYDAPQDSVSPP LSSSASSSGGKTETLESLIRADASKINS
Subjt:  TEYKT--------------------------------------------LNNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINS

Query:  FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPRL----------------------------------------------KCWQG
        FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPR                                               K  QG
Subjt:  FRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPRL----------------------------------------------KCWQG

Query:  DGLTTLKRSSSYNADRFF
        DGLTTLKRSSSYNADR +
Subjt:  DGLTTLKRSSSYNADRFF

A0A6J1JAP5 protein UPSTREAM OF FLC-like3.4e-15573.9Show/hide
Query:  MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLI
        MKKYNQLSPERAKVWTEKSPKYQQ RKVPVIYYLCRNRQLEHPHFMEVP+SSP+GL+LRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLC+DDLI
Subjt:  MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLI

Query:  LPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL
        LPAHGNEYVLKGSELFEE  SSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKE+KNSQ DDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL
Subjt:  LPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSL

Query:  TEYKTL--------------------------------------NNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINSFRILEE
        TEYK                                        +N ASNQKKNY+AP++SVSPP LSSSASSSG KTETLESLIRADASK+NSFRI+EE
Subjt:  TEYKTL--------------------------------------NNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKINSFRILEE

Query:  EEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPRL----------------------------------------------KCWQGDGLTTL
        EEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPR                                               K  QGDGLTTL
Subjt:  EEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPRL----------------------------------------------KCWQGDGLTTL

Query:  KRSSSYNADR
        KRSSSYNADR
Subjt:  KRSSSYNADR

SwissProt top hitse value%identityAlignment
A0A2R6X6S3 Protein SOSEKI1.0e-2343.14Show/hide
Query:  KVPVIYYLCRNRQLEHPHFMEVPVSS-PDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLILPAHGNEYVLKGSELFEESISSKDP
        KV V+YYL R  QL+ PH ++VPVS+  +GLYLRDV  RL  +RG+GM   +SWSCKR+YKN F+W DL  DD ILP    E VLKGSEL+      K  
Subjt:  KVPVIYYLCRNRQLEHPHFMEVPVSS-PDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLILPAHGNEYVLKGSELFEESISSKDP

Query:  FSSIGNVSIQPLKQLPD---PASSQSQDDSSSSSSMNGKELKNSQEDDLSLSV
        F   G    +   QLP+     +S+  D  +   S++ +  K  ++ DL+ ++
Subjt:  FSSIGNVSIQPLKQLPD---PASSQSQDDSSSSSSMNGKELKNSQEDDLSLSV

Q8GY65 Protein SOSEKI 43.7e-3740.59Show/hide
Query:  KYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLILPAHGNEYVLKGSELFEESI
        K  + R VPV+YYL RN +L+HPHF+EVP+SS +GLYL+DVINRLN LRG GMA LYSWS KR+YKNGFVW+DL  +D I P HG EYVLKGS++ +   
Subjt:  KYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLILPAHGNEYVLKGSELFEESI

Query:  SSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSLTEYKTLNNVASNQKKNYDAP
        +S       GN S           +   + + S SS  + K  K S+ +  +   L   +S  + D    KS             +N V    ++   +P
Subjt:  SSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSLTEYKTLNNVASNQKKNYDAP

Query:  QDSVSPPTLSSSASSSGGKTETLESLIRADASKINSFRILEEEEIRMPTNARLKATNVLMQLISCGSISVK
          S S P             ETLESL+RAD   I    +L+E++    T  +++ + VLMQLISCG++S K
Subjt:  QDSVSPPTLSSSASSSGGKTETLESLIRADASKINSFRILEEEEIRMPTNARLKATNVLMQLISCGSISVK

Q8GYT8 Protein SOSEKI 32.3e-9255.8Show/hide
Query:  MKKYN-QLSPERAKVWTEKSPKY-QQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDD
        MKKY+ ++SPERAKVWTEKSPKY Q+++KV ++YYL +NRQLEHPHFMEV +SSP+GLYLRDVI RLNVLRGRGMA++YSWS KRSY+NGFVWHDL +DD
Subjt:  MKKYN-QLSPERAKVWTEKSPKY-QQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDD

Query:  LILPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQL-PDPASSQSQDDSSSSSSMN---GKELKNSQEDDLSLSVLRP-GSSGMSPDSGGGKSS
        LILPA+GNEYVLKGSELF+E  S+ D FS I N++ Q +KQ+  +P SS+S DDSSSSSSMN   G    + ++D+LS   LR   SSG+SPDS   K+S
Subjt:  LILPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQL-PDPASSQSQDDSSSSSSMN---GKELKNSQEDDLSLSVLRP-GSSGMSPDSGGGKSS

Query:  WGGCLSLTEYKTL---------------------------------------------NNV------ASNQKKNYDAPQDSVSPPTLSSSASSSGGKTET
           CL+  EYK                                               NN+      A  ++++ +  ++SVSPP  S+SASS GGKT+T
Subjt:  WGGCLSLTEYKTL---------------------------------------------NNV------ASNQKKNYDAPQDSVSPPTLSSSASSSGGKTET

Query:  LESLIRADASKINSFRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPR
        LESLIRAD SK+NSFRILE+E++RMP   RL+A+N+LMQLISCGSISVKD++FGL+P+YKP+
Subjt:  LESLIRADASKINSFRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPR

Q9FJF5 Protein SOSEKI 58.7e-4741.58Show/hide
Query:  KKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLIL
        K     SP+R ++W+E   K    RKVPV+YYLCRN QL+HPHF+EV +SS DGLYL+DVINRLN LRG+GMA+LYSWS KRSYKNGFVWHDL +DD I 
Subjt:  KKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLIL

Query:  PAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQL-PDPAS--------SQSQDDSSSSSSMNG-KELKNSQEDDLSLSVLRPGSSGMSPDSGGGK
        P  G EYVLKGSE+ +  + S +P S +   S +  + L PD  S        ++ ++ S SS  ++  K  K ++    S   L   +S  + D    +
Subjt:  PAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQL-PDPAS--------SQSQDDSSSSSSMNG-KELKNSQEDDLSLSVLRPGSSGMSPDSGGGK

Query:  SSWGGCLSLTEYKTLNNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKI--NSFRILEEEEIRMPTNARLKATNVLMQLISCGSIS
               +  E + + + AS + ++ +  +D +SPP   SS        ETLE+LI+AD   I   S    +   +   ++ R++A+ VLMQLISCG++S
Subjt:  SSWGGCLSLTEYKTLNNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKI--NSFRILEEEEIRMPTNARLKATNVLMQLISCGSIS

Query:  VKD
         K+
Subjt:  VKD

Q9LX14 Protein SOSEKI 27.6e-2760Show/hide
Query:  RKVPVIYYLCRNRQLEHPHFMEV--PVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLILPAHGNEYVLKGSELFEE
        R+V V+YYL RN  LEHPHF+EV  PV+ P  L LRDV+NRL +LRG+ M + Y+WSCKRSY+NGFVW+DL ++D+I P+   EYVLKGSE+ ++
Subjt:  RKVPVIYYLCRNRQLEHPHFMEV--PVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLILPAHGNEYVLKGSELFEE

Arabidopsis top hitse value%identityAlignment
AT2G28150.1 Domain of unknown function (DUF966)1.7e-9355.8Show/hide
Query:  MKKYN-QLSPERAKVWTEKSPKY-QQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDD
        MKKY+ ++SPERAKVWTEKSPKY Q+++KV ++YYL +NRQLEHPHFMEV +SSP+GLYLRDVI RLNVLRGRGMA++YSWS KRSY+NGFVWHDL +DD
Subjt:  MKKYN-QLSPERAKVWTEKSPKY-QQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDD

Query:  LILPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQL-PDPASSQSQDDSSSSSSMN---GKELKNSQEDDLSLSVLRP-GSSGMSPDSGGGKSS
        LILPA+GNEYVLKGSELF+E  S+ D FS I N++ Q +KQ+  +P SS+S DDSSSSSSMN   G    + ++D+LS   LR   SSG+SPDS   K+S
Subjt:  LILPAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQL-PDPASSQSQDDSSSSSSMN---GKELKNSQEDDLSLSVLRP-GSSGMSPDSGGGKSS

Query:  WGGCLSLTEYKTL---------------------------------------------NNV------ASNQKKNYDAPQDSVSPPTLSSSASSSGGKTET
           CL+  EYK                                               NN+      A  ++++ +  ++SVSPP  S+SASS GGKT+T
Subjt:  WGGCLSLTEYKTL---------------------------------------------NNV------ASNQKKNYDAPQDSVSPPTLSSSASSSGGKTET

Query:  LESLIRADASKINSFRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPR
        LESLIRAD SK+NSFRILE+E++RMP   RL+A+N+LMQLISCGSISVKD++FGL+P+YKP+
Subjt:  LESLIRADASKINSFRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPR

AT3G46110.1 Domain of unknown function (DUF966)2.6e-3840.59Show/hide
Query:  KYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLILPAHGNEYVLKGSELFEESI
        K  + R VPV+YYL RN +L+HPHF+EVP+SS +GLYL+DVINRLN LRG GMA LYSWS KR+YKNGFVW+DL  +D I P HG EYVLKGS++ +   
Subjt:  KYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLILPAHGNEYVLKGSELFEESI

Query:  SSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSLTEYKTLNNVASNQKKNYDAP
        +S       GN S           +   + + S SS  + K  K S+ +  +   L   +S  + D    KS             +N V    ++   +P
Subjt:  SSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSLTEYKTLNNVASNQKKNYDAP

Query:  QDSVSPPTLSSSASSSGGKTETLESLIRADASKINSFRILEEEEIRMPTNARLKATNVLMQLISCGSISVK
          S S P             ETLESL+RAD   I    +L+E++    T  +++ + VLMQLISCG++S K
Subjt:  QDSVSPPTLSSSASSSGGKTETLESLIRADASKINSFRILEEEEIRMPTNARLKATNVLMQLISCGSISVK

AT3G46110.2 Domain of unknown function (DUF966)2.6e-3840.59Show/hide
Query:  KYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLILPAHGNEYVLKGSELFEESI
        K  + R VPV+YYL RN +L+HPHF+EVP+SS +GLYL+DVINRLN LRG GMA LYSWS KR+YKNGFVW+DL  +D I P HG EYVLKGS++ +   
Subjt:  KYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLILPAHGNEYVLKGSELFEESI

Query:  SSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSLTEYKTLNNVASNQKKNYDAP
        +S       GN S           +   + + S SS  + K  K S+ +  +   L   +S  + D    KS             +N V    ++   +P
Subjt:  SSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSLTEYKTLNNVASNQKKNYDAP

Query:  QDSVSPPTLSSSASSSGGKTETLESLIRADASKINSFRILEEEEIRMPTNARLKATNVLMQLISCGSISVK
          S S P             ETLESL+RAD   I    +L+E++    T  +++ + VLMQLISCG++S K
Subjt:  QDSVSPPTLSSSASSSGGKTETLESLIRADASKINSFRILEEEEIRMPTNARLKATNVLMQLISCGSISVK

AT5G10150.1 Domain of unknown function (DUF966)5.4e-2860Show/hide
Query:  RKVPVIYYLCRNRQLEHPHFMEV--PVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLILPAHGNEYVLKGSELFEE
        R+V V+YYL RN  LEHPHF+EV  PV+ P  L LRDV+NRL +LRG+ M + Y+WSCKRSY+NGFVW+DL ++D+I P+   EYVLKGSE+ ++
Subjt:  RKVPVIYYLCRNRQLEHPHFMEV--PVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLILPAHGNEYVLKGSELFEE

AT5G59790.1 Domain of unknown function (DUF966)6.2e-4841.58Show/hide
Query:  KKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLIL
        K     SP+R ++W+E   K    RKVPV+YYLCRN QL+HPHF+EV +SS DGLYL+DVINRLN LRG+GMA+LYSWS KRSYKNGFVWHDL +DD I 
Subjt:  KKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLIL

Query:  PAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQL-PDPAS--------SQSQDDSSSSSSMNG-KELKNSQEDDLSLSVLRPGSSGMSPDSGGGK
        P  G EYVLKGSE+ +  + S +P S +   S +  + L PD  S        ++ ++ S SS  ++  K  K ++    S   L   +S  + D    +
Subjt:  PAHGNEYVLKGSELFEESISSKDPFSSIGNVSIQPLKQL-PDPAS--------SQSQDDSSSSSSMNG-KELKNSQEDDLSLSVLRPGSSGMSPDSGGGK

Query:  SSWGGCLSLTEYKTLNNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKI--NSFRILEEEEIRMPTNARLKATNVLMQLISCGSIS
               +  E + + + AS + ++ +  +D +SPP   SS        ETLE+LI+AD   I   S    +   +   ++ R++A+ VLMQLISCG++S
Subjt:  SSWGGCLSLTEYKTLNNVASNQKKNYDAPQDSVSPPTLSSSASSSGGKTETLESLIRADASKI--NSFRILEEEEIRMPTNARLKATNVLMQLISCGSIS

Query:  VKD
         K+
Subjt:  VKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGTATAACCAACTGAGTCCTGAGAGGGCTAAGGTGTGGACTGAGAAATCGCCCAAATATCAACAGGTTCGGAAGGTGCCTGTGATTTACTATCTATGCAGAAA
CAGGCAGCTGGAGCATCCTCATTTCATGGAAGTTCCAGTCTCATCCCCCGATGGACTGTACTTGAGAGATGTGATTAACAGGCTTAACGTTCTCAGAGGAAGAGGCATGG
CTACCTTATACTCTTGGTCTTGTAAGAGAAGCTACAAGAATGGATTTGTTTGGCATGATCTCTGCCAAGATGACCTAATTCTTCCGGCTCATGGAAATGAGTATGTTCTC
AAAGGCTCCGAGCTGTTTGAAGAGTCCATTTCTAGTAAAGACCCTTTCAGCTCCATTGGCAATGTGAGTATTCAGCCACTGAAGCAATTACCCGATCCAGCTTCTTCCCA
GAGTCAGGATGATTCTTCATCATCTTCAAGCATGAACGGAAAAGAATTGAAAAATTCTCAAGAGGATGATCTCTCCTTGTCTGTCCTTCGACCTGGTTCATCAGGCATGT
CTCCAGATTCTGGAGGTGGAAAGAGTTCATGGGGTGGTTGTCTTAGCTTGACAGAGTACAAGACTCTTAACAATGTGGCTTCAAATCAAAAGAAAAACTATGATGCCCCT
CAGGATTCAGTTTCTCCACCCACATTGTCCTCTAGTGCTTCCTCCTCAGGTGGGAAGACTGAAACATTAGAGTCTCTTATCAGAGCCGATGCCAGTAAAATCAACAGTTT
TAGGATTCTTGAAGAAGAAGAAATTCGGATGCCAACCAATGCTAGACTCAAGGCTACAAATGTGTTAATGCAACTTATCTCGTGTGGATCAATATCAGTCAAGGATCATA
GTTTTGGACTTATTCCATCATACAAACCAAGACTAAAATGCTGGCAAGGGGATGGGCTGACAACTCTAAAACGTTCATCTTCATACAATGCTGACAGGTTCTTTTTATGT
TCGTATACTTTAGTGAAGTTAAACGTTCTCAAATCCACAAATTTGAAGCAATTTATCATTTTTATTTGTCTTCAATTTTCCTCATATTCTTTGGGCCGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAGTATAACCAACTGAGTCCTGAGAGGGCTAAGGTGTGGACTGAGAAATCGCCCAAATATCAACAGGTTCGGAAGGTGCCTGTGATTTACTATCTATGCAGAAA
CAGGCAGCTGGAGCATCCTCATTTCATGGAAGTTCCAGTCTCATCCCCCGATGGACTGTACTTGAGAGATGTGATTAACAGGCTTAACGTTCTCAGAGGAAGAGGCATGG
CTACCTTATACTCTTGGTCTTGTAAGAGAAGCTACAAGAATGGATTTGTTTGGCATGATCTCTGCCAAGATGACCTAATTCTTCCGGCTCATGGAAATGAGTATGTTCTC
AAAGGCTCCGAGCTGTTTGAAGAGTCCATTTCTAGTAAAGACCCTTTCAGCTCCATTGGCAATGTGAGTATTCAGCCACTGAAGCAATTACCCGATCCAGCTTCTTCCCA
GAGTCAGGATGATTCTTCATCATCTTCAAGCATGAACGGAAAAGAATTGAAAAATTCTCAAGAGGATGATCTCTCCTTGTCTGTCCTTCGACCTGGTTCATCAGGCATGT
CTCCAGATTCTGGAGGTGGAAAGAGTTCATGGGGTGGTTGTCTTAGCTTGACAGAGTACAAGACTCTTAACAATGTGGCTTCAAATCAAAAGAAAAACTATGATGCCCCT
CAGGATTCAGTTTCTCCACCCACATTGTCCTCTAGTGCTTCCTCCTCAGGTGGGAAGACTGAAACATTAGAGTCTCTTATCAGAGCCGATGCCAGTAAAATCAACAGTTT
TAGGATTCTTGAAGAAGAAGAAATTCGGATGCCAACCAATGCTAGACTCAAGGCTACAAATGTGTTAATGCAACTTATCTCGTGTGGATCAATATCAGTCAAGGATCATA
GTTTTGGACTTATTCCATCATACAAACCAAGACTAAAATGCTGGCAAGGGGATGGGCTGACAACTCTAAAACGTTCATCTTCATACAATGCTGACAGGTTCTTTTTATGT
TCGTATACTTTAGTGAAGTTAAACGTTCTCAAATCCACAAATTTGAAGCAATTTATCATTTTTATTTGTCTTCAATTTTCCTCATATTCTTTGGGCCGGTAATGACTTAT
GGTGATAACTATGATCAAGCTTGCTCGATTAAGGCAGTGACTGACATAATTGAAGAAAATGATTCGTTACCATAATTTTAGTGCACAGGGCACCTAATTTTTCAGTGTAC
ATCACCTTTAACATGCTGAAAACTAGTCCTTGTTTTGATTGATGGAACCCTTGATCCCTGCCTGTGCTATTTCCTGTCGGTTTATTTTCTTTGTCCTTTTTGCATAGGAG
GTCAAGAAGTTATAAAGTTTCTTTTCTACATTTATGTGCTGTATCAAGCTTAAAGCTTGATGAGAGATTATGCAAATTTGGCTGGCCAAGATTGGTGAACCAGCATTTCT
GTCGTGCTCTTTTCATTTTGAACTTGTGCATAATAATAATTATAGGGGGAAAGTTCTCGTGTCTTCTTACTGTTACTTGTTGTACCTTGTTTCTCATCCTTTGAAAATTA
AATGGATGTACATAAAAAGAACTTACCGTTATTGCTTGCCACAATAGAATGTGAATCTTTCTTATCTTTCAAATACATGTTTTGTGTGCATACGTATGTATGGCAGCAGC
TAAATTTGGTTTCAATCATTATATGTATGTATGCCGGTATTCGGTTTTTACTTATTTTGTTTTATACAATACACATTGTTGTGTGATTAGCAGAGAGCTTTTGCGATATG
TGTGTGGTCATTTCTTCATAATGCTTGCAGTTTCAGTTAGTGTATAACTTACAATAAGCTACTACTATAGGACATGTAAGCAATTGAGTTCAACTGAAGACAAGGATCAG
TCGACCTCTAGCCGCTCAAAGTGCATTCCACGAGCTATTAAAGCTTCACTGAGCAAGCAGCCACGAAATGAGTCCATGAAATCTCCTACTTCTGATAGACCAAGAACTTC
TTCTGATGGTCTTGATAGCTCTCAGAACGTAAGCCCCACTACCTCCAATGATAGCAGTAAAAGGATTACAGAACCTTGCTCCGGAAGGAAACAATCTAAGAGGCTAGATT
CCTTTCGAGAAGAGGAAGAGGACGTGATTAAGATTGAAGAAAGGCTTGCTTCAGGAGCTCGGGTTATAATCTGGTCGAAATCGACTTGCAATGGCACAGATGTTGGCAGT
ACTTAGAGCACACTATGAACAACCAATGGATGCCTCATCTTCTGGCCAAGTTAAAAAAGCTCGTTCCCGCCAAGATCCTACTTTTGATCAGATCTACATCGAATTCAATT
GATGCTGATAATTTTTTGGTGGGCCTTGGTGATTCTGTATATCTCTCGAGTGTAAAAAGAAGAAACCACCGGTCTTCAGCTTGAGGGGCCCCTCGGACCAACTCAGAGAT
CTTTTGGGTTTGTTGGCTTTTGCTTGTCAGGTCAGCTAATATTAAATGATAAAATTGTAACTTTCCTTTAAACTACAGTTACCATCACTTCTTTTTATGATCCCTTCTTT
AGCTTTAGAATAGTTCCTGCAGGCAGAGTCTGTAACAAAAAAAGCTAGGGAGGAGTGTCGCCCCCTTTAAAATTATTTATAATTATTTAGTTCTTTCTCTGGAAGGATTC
CCAACTTCAACATCATTTTTCCCCCATAAAACAAGTCAACTGGATTCAGTTAAAATGCCTCTAAATGCTTGTCTTTTGTTTGAATGAATTGGATTATAAATCCTGTAAAA
ACAATTCAAAGCAAGATGAGACATGAGTCTCCTCAAAAGCCTCATTTGGGTTATTTCAAGAGATGCCACTTAAATGTTTTTTTGTAAATTTAATGCATCTTTAACTAGAA
TGGAGAAATCTGCTCCTTCCTAGACCAAAAGAAACTTGGCTTCACGTGTGAGAAGCCTTGGAAAGCGTGTGGGAGTGTTAGGTCTAAATTTAAGCTGAGAAGAAAGAGAG
TGAGTAGATTTCAGCTCTGCCTGTCTGTCTGTTTGTGAATTATGAAAGCTTTTCTTCTGCTTAGAGTGATGATAGTTGTAATAGGCCATTCTGTCCTTTCAGTATTTAAG
GCTAAAGTCTAAAGGCAAAAAGATGCATTATTTGAACAGAGATTTAAATGCAGAATACTTAGGGCTCATACAAAGCAAAAGACAGCCATGTCAGGATAATGGAAACTGGG
CAGACAGTAGAACCTTTTTTTAGCTAACAATGTTAAAATTTCAAATCTTGTAATTTGAGTTGATTTTCAGTTAATCTCCCAGTTTTATTTTTTATCATCACATTTCCCTT
TTTCTTGACAAGAATGAAAAAGTTAGACAGAGAGAAGGATAAAGTTTGAAAGAGTGAAATTATTA
Protein sequenceShow/hide protein sequence
MKKYNQLSPERAKVWTEKSPKYQQVRKVPVIYYLCRNRQLEHPHFMEVPVSSPDGLYLRDVINRLNVLRGRGMATLYSWSCKRSYKNGFVWHDLCQDDLILPAHGNEYVL
KGSELFEESISSKDPFSSIGNVSIQPLKQLPDPASSQSQDDSSSSSSMNGKELKNSQEDDLSLSVLRPGSSGMSPDSGGGKSSWGGCLSLTEYKTLNNVASNQKKNYDAP
QDSVSPPTLSSSASSSGGKTETLESLIRADASKINSFRILEEEEIRMPTNARLKATNVLMQLISCGSISVKDHSFGLIPSYKPRLKCWQGDGLTTLKRSSSYNADRFFLC
SYTLVKLNVLKSTNLKQFIIFICLQFSSYSLGR