; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc03G00570 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc03G00570
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionSAGA-associated factor 11
Genome locationClcChr03:471622..481194
RNA-Seq ExpressionClc03G00570
SyntenyClc03G00570
Gene Ontology termsGO:0006325 - chromatin organization (biological process)
GO:0009737 - response to abscisic acid (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0016487 - farnesol metabolic process (biological process)
GO:0048440 - carpel development (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0031969 - chloroplast membrane (cellular component)
GO:0070461 - SAGA-type complex (cellular component)
GO:0016301 - kinase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0052668 - farnesol kinase activity (molecular function)
GO:0052670 - geraniol kinase activity (molecular function)
GO:0052671 - geranylgeraniol kinase activity (molecular function)
InterPro domainsIPR013246 - SAGA complex, Sgf11 subunit
IPR039606 - Phytol/farnesol kinase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060025.1 farnesol kinase [Cucumis melo var. makuwa]2.8e-23592.78Show/hide
Query:  MLLPENPVVSDICAAALSGGVALALLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMS
        ML PENPVVSDICA ALS GVAL+LL+LW ETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGY+GAIFASLIPG N+IRMLVLG GILKDEAT+KSMS
Subjt:  MLLPENPVVSDICAAALSGGVALALLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMS

Query:  RYGDYRELLKGPLYYVATITLGCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKM
        RYGDYRELLKGPLYYVATITL CIFYWRTSPISIALICNLCAGDG ADIVGRRFGSEKIFYN+NKSLAGS+AMA+AGFLAS+GYMYYFSSFGY+E S  M
Subjt:  RYGDYRELLKGPLYYVATITLGCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKM

Query:  VLGFLVVSLASALVESLPLSTEIDDNLTVPLTSFLAYIETHPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEEL
         L FL+VSLASALVESLP+STE+DDNLTVPLTS LAYIET  ASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEEL
Subjt:  VLGFLVVSLASALVESLPLSTEIDDNLTVPLTSFLAYIETHPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEEL

Query:  RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSTTAAQSRYSRGNPVSAYSPYPNSTG
        RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRS+TAAQSRYSRGNPVSAYSPYPNST 
Subjt:  RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSTTAAQSRYSRGNPVSAYSPYPNSTG

Query:  TNRLPNGTSSLAGEEYSNVPKIFSSFAINSINNFMEANCRKVVQKVANLCSATEKLE
        TNRLPNGTSSLAGEEYSNVPKIFSSFA NSINNFMEANCRKVVQKVANLCSAT KLE
Subjt:  TNRLPNGTSSLAGEEYSNVPKIFSSFAINSINNFMEANCRKVVQKVANLCSATEKLE

KAG6583348.1 hypothetical protein SDJN03_19280, partial [Cucurbita argyrosperma subsp. sororia]4.0e-21383.37Show/hide
Query:  MAAILQFRFRSSVGSLGPLFPLSSPTFLSRFTPVSLSFNSISTPIFRSGTFALRFRSKIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLRLWAE
        MAA LQF FRS  GS GPLFP SSPT +SRF PVS+SFNSI  PIFRS  F LRF +KIRRE C VAAVMLLP+NPVVSDICA A+SGGVAL+LLRLW E
Subjt:  MAAILQFRFRSSVGSLGPLFPLSSPTFLSRFTPVSLSFNSISTPIFRSGTFALRFRSKIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLRLWAE

Query:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSP
        TAKRGLDQKLNRKLVH SIGLAFMLCWPMFSSG+RGA+ ASLIPGVNIIRMLVLGLGILKDEATVKSMSR GD RELLKGPLYYVATITL CIFYWRTSP
Subjt:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSP

Query:  ISIALICNLCAGDGLADIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLSTEIDDNLTVPL
        ISIALICNLCAGDGLADIVGRRFGS KI YNKNKSLAGS+AM SAGFLASVGYMYYFSSFGY+EGS++MVLGFLVVS+ASALVESLP+STEIDDNLTVPL
Subjt:  ISIALICNLCAGDGLADIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLSTEIDDNLTVPL

Query:  TSFL------------------------------AYIETHPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELR
        TS L                              AYIET PASRSMSMP+ED+ASS TQLS NLFGDLLDSVI DVASECHRIARLGLDRNLEEEEEELR
Subjt:  TSFL------------------------------AYIETHPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELR

Query:  LSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSTTAAQSR
        LSAQAR RVADS NSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRS+TAAQSR
Subjt:  LSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSTTAAQSR

KAG6607329.1 hypothetical protein SDJN03_00671, partial [Cucurbita argyrosperma subsp. sororia]1.4e-20284.53Show/hide
Query:  LFPLSSPTFLSRFTPVSLSFNSISTPIFRSGTFALRFRS----KIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLRLWAETAKRGLDQKLNRKL
        ++PL SP+FLSRF  +S+S NSIS P  RSG+F  RFRS    KIRR+  PVAA MLLP+NPVVSDICA+ LSG VA +LLRLWAETAKRGLDQKLNRKL
Subjt:  LFPLSSPTFLSRFTPVSLSFNSISTPIFRSGTFALRFRS----KIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLRLWAETAKRGLDQKLNRKL

Query:  VHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSPISIALICNLCAGDG
        VHISIGLAFMLCWPMFSSG RGAI ASL+PGVNIIRMLV GLGI+KDEATVKSM+RYGDYRELLKGPLYYVATITL CIFYWRTSPISIALICNLCAGDG
Subjt:  VHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSPISIALICNLCAGDG

Query:  LADIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLSTEIDDNLTVPLTSFLAY-----IET
        +ADI+GRRFG++KI YNKNKS+ GS+AMASAGFLASVGYMYYFSSFGY+EGSS+MVLGFLVVSLASALVESLP+STEIDDNLTVPLTSFL+      IET
Subjt:  LADIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLSTEIDDNLTVPLTSFLAY-----IET

Query:  HPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIF
         P+SRSMSMP+EDNASS  QLSSN FGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQ HPSVA+EIF
Subjt:  HPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIF

Query:  DCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSTTAAQSRYSRG
        DCMNCGRSI+AGRFAPHLEKCMGRGRKAR KVTRS+TA QSR + G
Subjt:  DCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSTTAAQSRYSRG

KAG7019118.1 Farnesol kinase, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]1.2e-23689.32Show/hide
Query:  MAAILQFRFRSSVGSLGPLFPLSSPTFLSRFTPVSLSFNSISTPIFRSGTFALRFRSKIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLRLWAE
        MAA LQF FRS  GS GPLFP SSPT +SRF PVS+SFNSI  PIFRS  F LRF +KIRRE C VAAVMLLP+NPVVSDICA A+SGGVAL+LLRLW E
Subjt:  MAAILQFRFRSSVGSLGPLFPLSSPTFLSRFTPVSLSFNSISTPIFRSGTFALRFRSKIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLRLWAE

Query:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSP
        TAKRGLDQKLNRKLVH SIGLAFMLCWPMFSSG+RGA+ ASLIPGVNIIRMLVLGLGILKDEATVKSMSR GD RELLKGPLYYVATITL CIFYWRTSP
Subjt:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSP

Query:  ISIALICNLCAGDGLADIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLSTEIDDNLTVPL
        ISIALICNLCAGDGLADIVGRRFGS KI YNKNKSLAGS+AM SAGFLASVGYMYYFSSFGY+EGS++MVLGFLVVS+ASALVESLP+STEIDDNLTVPL
Subjt:  ISIALICNLCAGDGLADIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLSTEIDDNLTVPL

Query:  TSFLAYIETHPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQT
        TS LAYIET PASRSMSMP+ED+ASS TQLS NLFGDLLDSVI DVASECHRIARLGLDRNLEEEEEELRLSAQAR RVADS NSSEANGKYVVDIFGQT
Subjt:  TSFLAYIETHPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQT

Query:  HPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSTTAAQSRYSRGNPVSAYSPYPNSTGTNRLPNGTSSLAGEEYSN
        HPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRS+TAAQSRYSRGNPVS YSPYPNST TNRLPNGTSSLAGEEYSN
Subjt:  HPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSTTAAQSRYSRGNPVSAYSPYPNSTGTNRLPNGTSSLAGEEYSN

KAG8471435.1 hypothetical protein CXB51_036412 [Gossypium anomalum]3.8e-17167.41Show/hide
Query:  LFPLSSPTFLSRFTPVSLSFNSISTPIFRSGTFALR-------FRSKIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLRLWAETAKRGL-DQKL
        LF   S T + +  P    F+ +S P   S   A         +  K R      AA ML P+N + SD CAA +SG +AL++LRLW ETAKRGL DQKL
Subjt:  LFPLSSPTFLSRFTPVSLSFNSISTPIFRSGTFALR-------FRSKIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLRLWAETAKRGL-DQKL

Query:  NRKLVHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSPISIALICNLC
        NRKLVHISIGL FMLCWP++SSGYRGAI A++ PGVNIIRM+++G G+ KDEATVKSMSRYGDYRELLKGPLYY  TITL C FYWRTSPI+IA ICNLC
Subjt:  NRKLVHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSPISIALICNLC

Query:  AGDGLADIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLSTEIDDNLTVPLTSFL------
        AGDG ADIVGR+FG +K+ YNKNKS+AGS+AMA AGFL SVGYMYYFS FGYL+ S+++V GFL+VSLASALVESLP+STE+DDNLTV LTS L      
Subjt:  AGDGLADIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLSTEIDDNLTVPLTSFL------

Query:  ------AYIETHPASR-----SMSMPNEDNASSQTQLSSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYV
               +    P SR      +        +   +LSS+ FGDLLDS+IVDVASECHRIA+LGLDRNLEEEEEE+RLS QAR RVAD SNSSE N KYV
Subjt:  ------AYIETHPASR-----SMSMPNEDNASSQTQLSSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYV

Query:  VDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSTTAAQSRYSRGNPVSAYSPYPNSTGTNRLPNGTSSLAGEE
        VDIFGQTHPSVA EIF+CMNCGRSI AGRFAPHLEKCMG+GRKAR KVTRS+TAAQ+RYSRG+PVSAYSPY NST TNRLPNGT S+AGEE
Subjt:  VDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSTTAAQSRYSRGNPVSAYSPYPNSTGTNRLPNGTSSLAGEE

TrEMBL top hitse value%identityAlignment
A0A5D3BBT0 SAGA-associated factor 111.4e-23592.78Show/hide
Query:  MLLPENPVVSDICAAALSGGVALALLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMS
        ML PENPVVSDICA ALS GVAL+LL+LW ETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGY+GAIFASLIPG N+IRMLVLG GILKDEAT+KSMS
Subjt:  MLLPENPVVSDICAAALSGGVALALLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMS

Query:  RYGDYRELLKGPLYYVATITLGCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKM
        RYGDYRELLKGPLYYVATITL CIFYWRTSPISIALICNLCAGDG ADIVGRRFGSEKIFYN+NKSLAGS+AMA+AGFLAS+GYMYYFSSFGY+E S  M
Subjt:  RYGDYRELLKGPLYYVATITLGCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKM

Query:  VLGFLVVSLASALVESLPLSTEIDDNLTVPLTSFLAYIETHPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEEL
         L FL+VSLASALVESLP+STE+DDNLTVPLTS LAYIET  ASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEEL
Subjt:  VLGFLVVSLASALVESLPLSTEIDDNLTVPLTSFLAYIETHPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEEL

Query:  RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSTTAAQSRYSRGNPVSAYSPYPNSTG
        RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRS+TAAQSRYSRGNPVSAYSPYPNST 
Subjt:  RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSTTAAQSRYSRGNPVSAYSPYPNSTG

Query:  TNRLPNGTSSLAGEEYSNVPKIFSSFAINSINNFMEANCRKVVQKVANLCSATEKLE
        TNRLPNGTSSLAGEEYSNVPKIFSSFA NSINNFMEANCRKVVQKVANLCSAT KLE
Subjt:  TNRLPNGTSSLAGEEYSNVPKIFSSFAINSINNFMEANCRKVVQKVANLCSATEKLE

A0A6J1DBM3 probable phytol kinase 3, chloroplastic1.3e-13281.49Show/hide
Query:  MAAILQFRFRSSVGSLGPLFPLSSPTFLSRFTPVSLSFNSISTPIFRSGTFALRFRS----KIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLR
        MAAILQ RFRSS+G   P F    P FL +F PVS+SFN IS P      F LRF S    KIRR   PVAAVMLLP+NPVVSDICA A++GG+AL+LLR
Subjt:  MAAILQFRFRSSVGSLGPLFPLSSPTFLSRFTPVSLSFNSISTPIFRSGTFALRFRS----KIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLR

Query:  LWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYW
        LW ETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSG RGA+ ASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYV TITL CI YW
Subjt:  LWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYW

Query:  RTSPISIALICNLCAGDGLADIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLSTEIDDNL
        RTSPISIAL+CNLCAGDGLAD++GRRFGS KI YNKNKSLAGS+AMASAGFLASVGYMYYFSSFGYLEGSS+M+LGFLVVS+ASALVESLP+STEIDDNL
Subjt:  RTSPISIALICNLCAGDGLADIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLSTEIDDNL

Query:  TVPLTSFL
        +VPLTS L
Subjt:  TVPLTSFL

A0A6J1HM59 farnesol kinase, chloroplastic-like9.2e-13986.51Show/hide
Query:  MAAILQFRFRSSVGSLGPLFPLSSPTFLSRFTPVSLSFNSISTPIFRSGTFALRFRSKIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLRLWAE
        MAA LQF FRS  GS GPLFP SSPT +SRF PVS+SFNSI  PIFRS  F LRF +KIRRE C VAAVMLLP+NPVVSDICA A+SGGVAL+LLRLW E
Subjt:  MAAILQFRFRSSVGSLGPLFPLSSPTFLSRFTPVSLSFNSISTPIFRSGTFALRFRSKIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLRLWAE

Query:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSP
        TAKRGLDQKLNRKLVH SIGLAFMLCWPMFSSG+RGA+ ASLIPGVNIIRMLVLGLGILKDEATVKSMSR GD RELLKGPLYYVATITL CIFYWRTSP
Subjt:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSP

Query:  ISIALICNLCAGDGLADIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLSTEIDDNLTVPL
        ISIALICNLCAGDGLADIVGRRFGS KI YNKNKSLAGS+AM SAGFLASVGYMYYFSSFGY+EGS++MVLGFLVVS+ASALVESLP+STEIDDNLTVPL
Subjt:  ISIALICNLCAGDGLADIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLSTEIDDNLTVPL

Query:  TSFL
        TS L
Subjt:  TSFL

A0A6J1I3Z6 farnesol kinase, chloroplastic1.9e-13685.2Show/hide
Query:  MAAILQFRFRSSVGSLGPLFPLSSPTFLSRFTPVSLSFNSISTPIFRSGTFALRFRSKIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLRLWAE
        MAA LQF F S  GS GP FP SSPT +SRF PVS+SFNSI  PIFR   F LRF +KIRRE C VAA MLLP+NPVVSDICA A+SGGVAL+LLRLW E
Subjt:  MAAILQFRFRSSVGSLGPLFPLSSPTFLSRFTPVSLSFNSISTPIFRSGTFALRFRSKIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLRLWAE

Query:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSP
        TAKRGLDQKLNRKLVH SIGLAFMLCWPMFSSG+RGA+ ASLIPGVNIIRMLVLGLGILKDEATVKSMSR GD RELLKGPLYYVATITL CIFYWRTSP
Subjt:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSP

Query:  ISIALICNLCAGDGLADIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLSTEIDDNLTVPL
        ISIALICNLCAGDGLADIVGRRFGS KI YNKNKSLAGS+AM SAGFLASVGYMYYFSSFGY+EGS++MVLGFLVVS+ASALVESLP+STEIDDNLTVPL
Subjt:  ISIALICNLCAGDGLADIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLSTEIDDNLTVPL

Query:  TSFL
        TS L
Subjt:  TSFL

D7MQE9 SAGA-associated factor 111.5e-13658.19Show/hide
Query:  TPIFRSGTFALRFRSKIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLRLWAETAKRG-LDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAIFAS
        +PI R  T + R  S+        +   L   +      CA  ++  VA + L  W E  KR  LDQKL RKLVHI+IGL FMLCWP+FSSG +GA+FAS
Subjt:  TPIFRSGTFALRFRSKIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLRLWAETAKRG-LDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAIFAS

Query:  LIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSEKIFYNKNKSLAGSLA
        L+PG+NIIRML+LGLG+  DE T+KSMSR+GD RELLKGPLYY  +IT  CIFYW++SPI+IA+ICNLCAGDG+ADIVGRRFG+EK+ YNKNKS AGS+ 
Subjt:  LIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSEKIFYNKNKSLAGSLA

Query:  MASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLST---------------EIDDNLTVPLTSF-LAYIETHPASRSMSMPNE--DN
        MA+AGFLASVGYMYYF+SFGY+E S  M+L FL++SLASALV  + +S                  D N    LT   +  + +H     +S+P      
Subjt:  MASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLST---------------EIDDNLTVPLTSF-LAYIETHPASRSMSMPNE--DN

Query:  ASSQTQLSSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRF
        A+    LSS +F DL+DSVI DVASECHR+ARLGLDR+LE  EEELRLS +AR +VAD SN+ E N K+VVDIFGQTHP VA E+F+CMNCGR I+AGRF
Subjt:  ASSQTQLSSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRF

Query:  APHLEKCMGRGRKARPKVTRSTTAAQSRYSRGNPVSAYSPYPNSTGTNRLPNGTSSLAGEEYSN
        APHLEKCMG+GRKAR K TRSTTAAQ+R +R +P   YSPYPNS   N+L +G+  +AGE+ SN
Subjt:  APHLEKCMGRGRKARPKVTRSTTAAQSRYSRGNPVSAYSPYPNSTGTNRLPNGTSSLAGEEYSN

SwissProt top hitse value%identityAlignment
Q2N2K0 Probable phytol kinase 3, chloroplastic2.3e-9467.71Show/hide
Query:  PLFPLSSPTFLSRFTPVSLSFNSISTPIFRSGTFALRFRSKIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLRLWAETAKRGL-DQKLNRKLVH
        P F   SP FLS+  P  L F S S+    S +F   F S       P  + M L  +P+VSD+ A A+SG VAL+ LRL+ ETAKR L DQKLNRKLVH
Subjt:  PLFPLSSPTFLSRFTPVSLSFNSISTPIFRSGTFALRFRSKIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLRLWAETAKRGL-DQKLNRKLVH

Query:  ISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSPISIALICNLCAGDGLA
        ISIGL FMLC P+FS+    + FA+LIPG+NI RMLV+GLGILKDEATVKSMSR+GDYRELLKGPLYY ATITL  I YWRTSPISIA ICNLCAGDG+A
Subjt:  ISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSPISIALICNLCAGDGLA

Query:  DIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLSTEIDDNLTVPLTSFL
        DIVGRR G EKI YNKNKS AGS+AMA+AGFL S+GYM+YFSSFG++EGS K+VLGFL+VS+ +A VESLP+STE+DDNLTVPLTS L
Subjt:  DIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLSTEIDDNLTVPLTSFL

Q2N2K1 Probable phytol kinase 1, chloroplastic3.8e-4946.22Show/hide
Query:  GVALALLRLWAETAKRG-LDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVAT
        G   AL+R + E  +R  L Q L+RKLVHI  GL F++ WP+FS+  +   FA+ +P VN +R+LV GL +  DE  +KS++R GD  ELL+GPLYYV  
Subjt:  GVALALLRLWAETAKRG-LDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVAT

Query:  ITLGCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLV-VSLASALVESL
        + L  + +WR SPI +  +  +CAGDG+ADI+GRR+GS KI YN++KSLAGS++M   GFL S+G +YY+S  G+++      L  +  +S  + LVESL
Subjt:  ITLGCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLV-VSLASALVESL

Query:  PLSTEIDDNLTVPL-TSFLAYIETH
        P++  +DDN++VPL T  +A+   H
Subjt:  PLSTEIDDNLTVPL-TSFLAYIETH

Q5N9J9 Probable phytol kinase 2, chloroplastic6.4e-8164.58Show/hide
Query:  VAAVMLLPENPVVSDICAAALSGGVALALLRLWAETAKRGL-DQKLNRKLVHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEAT
        +AA +    + +  D+ +AA++ GVALALLR + E AKRG+ +QKLNRKLVHI+IG+ F+L WP+FSSG      A++ PG+NIIRML+LGLG++K+EA 
Subjt:  VAAVMLLPENPVVSDICAAALSGGVALALLRLWAETAKRGL-DQKLNRKLVHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEAT

Query:  VKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLE
        VKSMSR GD RELLKGPLYY  TIT     +WRTSPI+IALICNLCAGDG+ADIVGRR G EK+ YN NKS AGS+AMA AGF+AS+GYM+YF SFG++E
Subjt:  VKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLE

Query:  GSSKMVLGFLVVSLASALVESLPLSTEIDDNLTVPLTSFL
         S  +  GFLVVS+ +ALVES P+ST +DDNLTVPLTSFL
Subjt:  GSSKMVLGFLVVSLASALVESLPLSTEIDDNLTVPLTSFL

Q67ZM7 Farnesol kinase, chloroplastic1.1e-9363.64Show/hide
Query:  LSSPTFLSRFTPVSLSFNSISTPIFRSGTFALRF-RSKIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLRLWAETAKRGL-DQKLNRKLVHISI
        + SP  L+ F+P+           FRS +   RF  +KIR+    +AAVM  PEN V+SD+CA  ++  VA + L  W E  KRG+ DQKL RKLVHI+I
Subjt:  LSSPTFLSRFTPVSLSFNSISTPIFRSGTFALRF-RSKIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLRLWAETAKRGL-DQKLNRKLVHISI

Query:  GLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSPISIALICNLCAGDGLADIV
        GL FMLCWP+FSSG +GA+FASL+PG+NI+RML+LGLG+  DE T+KSMSR+GD RELLKGPLYYV +IT  CI+YW++SPI+IA+ICNLCAGDG+ADIV
Subjt:  GLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSPISIALICNLCAGDGLADIV

Query:  GRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLSTEIDDNLTVPLTSFLA
        GRRFG+EK+ YNKNKS AGS+ MA+AGFLASV YMYYF+SFGY+E S  M+L FLV+S+ASALVESLP+ST+IDDNLT+ LTS LA
Subjt:  GRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLSTEIDDNLTVPLTSFLA

Q94BV2 SAGA-associated factor 119.1e-5968.86Show/hide
Query:  EDNASSQTQLSSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMA
        EDN SS  QLSS +F DL+DSVI DVASECHR+ARLGLDR+L+  EEELRLS +AR ++AD SN+ E N KYVVDIFGQTHP VA+E+F+CMNCGR I+A
Subjt:  EDNASSQTQLSSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMA

Query:  GRFAPHLEKCMGRGRKARPKVTRSTTAAQSRYSRGNPVSAYSPYPNSTGTNRLPNGTSSLAGEEYSN
        GRFAPHLEKCMG+GRKAR K TRSTTAAQ+R +R +P   YSPYPNS   N+L +G+  +AGE+ SN
Subjt:  GRFAPHLEKCMGRGRKARPKVTRSTTAAQSRYSRGNPVSAYSPYPNSTGTNRLPNGTSSLAGEEYSN

Arabidopsis top hitse value%identityAlignment
AT5G04490.1 vitamin E pathway gene 52.5e-4839.7Show/hide
Query:  ISTPIFRSGTFALRFRSKIRREHCPVAAVMLLPENPVVSDICA-AALSGGVALALLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAIF
        IS+P F  G   +   +++R     +++   +  N ++ D+ A  A+ GG    +L   + T +  + Q L+RKLVHI  GL F+L WP+FS       F
Subjt:  ISTPIFRSGTFALRFRSKIRREHCPVAAVMLLPENPVVSDICA-AALSGGVALALLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAIF

Query:  ASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSEKIFYNKNKSLAGS
        A+ +P VN +R+++ GL I  +   +KS++R G   ELLKGPL+YV  +    +F+WR SPI +  +  +C GDG+ADI+GR+FGS KI YN  KS AGS
Subjt:  ASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSEKIFYNKNKSLAGS

Query:  LAMASAGFLASVGYMYYFSSFGYLEGSSKMVL-GFLVVSLASALVESLPLSTEIDDNLTVPLTSFLA
        ++M   GF  S+  +YY+SS GYL  + +  L    +VS+ + +VESLP++ ++DDN++VPL + LA
Subjt:  LAMASAGFLASVGYMYYFSSFGYLEGSSKMVL-GFLVVSLASALVESLPLSTEIDDNLTVPLTSFLA

AT5G58560.1 Phosphatidate cytidylyltransferase family protein8.0e-9563.64Show/hide
Query:  LSSPTFLSRFTPVSLSFNSISTPIFRSGTFALRF-RSKIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLRLWAETAKRGL-DQKLNRKLVHISI
        + SP  L+ F+P+           FRS +   RF  +KIR+    +AAVM  PEN V+SD+CA  ++  VA + L  W E  KRG+ DQKL RKLVHI+I
Subjt:  LSSPTFLSRFTPVSLSFNSISTPIFRSGTFALRF-RSKIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLRLWAETAKRGL-DQKLNRKLVHISI

Query:  GLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSPISIALICNLCAGDGLADIV
        GL FMLCWP+FSSG +GA+FASL+PG+NI+RML+LGLG+  DE T+KSMSR+GD RELLKGPLYYV +IT  CI+YW++SPI+IA+ICNLCAGDG+ADIV
Subjt:  GLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSPISIALICNLCAGDGLADIV

Query:  GRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLSTEIDDNLTVPLTSFLA
        GRRFG+EK+ YNKNKS AGS+ MA+AGFLASV YMYYF+SFGY+E S  M+L FLV+S+ASALVESLP+ST+IDDNLT+ LTS LA
Subjt:  GRRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLSTEIDDNLTVPLTSFLA

AT5G58575.1 CONTAINS InterPro DOMAIN/s: Sgf11, transcriptional regulation (InterPro:IPR013246); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).6.4e-6068.86Show/hide
Query:  EDNASSQTQLSSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMA
        EDN SS  QLSS +F DL+DSVI DVASECHR+ARLGLDR+L+  EEELRLS +AR ++AD SN+ E N KYVVDIFGQTHP VA+E+F+CMNCGR I+A
Subjt:  EDNASSQTQLSSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMA

Query:  GRFAPHLEKCMGRGRKARPKVTRSTTAAQSRYSRGNPVSAYSPYPNSTGTNRLPNGTSSLAGEEYSN
        GRFAPHLEKCMG+GRKAR K TRSTTAAQ+R +R +P   YSPYPNS   N+L +G+  +AGE+ SN
Subjt:  GRFAPHLEKCMGRGRKARPKVTRSTTAAQSRYSRGNPVSAYSPYPNSTGTNRLPNGTSSLAGEEYSN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCAATTCTTCAATTTCGATTTCGCTCATCAGTTGGGTCGTTGGGCCCTCTCTTTCCGCTCAGCTCCCCAACTTTTCTCTCTCGATTCACACCAGTTTCCCTCTC
TTTCAACTCCATTTCTACGCCAATCTTCCGCTCCGGTACCTTCGCTTTGAGATTTCGGTCGAAAATCCGTCGGGAACACTGCCCAGTTGCGGCAGTTATGTTGTTGCCTG
AAAATCCGGTGGTCTCCGATATCTGCGCCGCCGCCTTGTCTGGCGGAGTCGCCTTGGCTTTGCTTCGATTATGGGCGGAAACGGCGAAACGTGGCCTCGACCAGAAATTG
AACAGGAAGCTTGTTCATATAAGCATTGGGCTTGCATTCATGCTTTGCTGGCCTATGTTCAGTTCTGGTTATCGAGGAGCAATATTTGCATCTCTAATTCCCGGTGTGAA
TATTATACGAATGCTCGTCCTGGGACTCGGGATATTGAAAGATGAGGCTACGGTGAAGTCAATGAGCAGATATGGAGACTATAGGGAGCTTTTGAAGGGGCCTTTGTATT
ATGTTGCAACTATTACATTAGGTTGTATATTCTATTGGAGGACCTCCCCCATTTCAATTGCACTGATATGCAACTTATGTGCCGGAGATGGGTTGGCTGATATTGTTGGA
AGACGATTTGGAAGTGAAAAGATCTTTTACAACAAGAACAAGTCACTAGCTGGTAGTTTAGCAATGGCATCTGCTGGTTTTCTTGCATCTGTTGGGTATATGTACTATTT
CTCATCATTTGGGTATCTTGAGGGAAGCAGCAAAATGGTTTTGGGATTCTTAGTTGTGTCCCTTGCCTCAGCATTGGTGGAGTCTCTCCCCTTAAGCACTGAGATTGATG
ACAACCTCACTGTTCCACTCACTTCCTTCCTGGCCTATATTGAAACTCATCCTGCTTCTAGATCCATGTCAATGCCTAATGAGGATAATGCATCTTCACAAACTCAGCTT
TCATCTAATTTGTTTGGGGATCTCCTGGATTCCGTGATTGTTGATGTTGCATCGGAATGTCATCGAATAGCAAGGTTAGGTCTTGATCGTAACTTAGAAGAGGAAGAAGA
AGAATTAAGACTTTCAGCTCAGGCACGAGTAAGAGTAGCTGATTCTAGCAATAGTAGTGAGGCAAACGGCAAATATGTAGTTGATATTTTTGGACAAACTCATCCTTCTG
TTGCGAACGAAATATTTGATTGCATGAATTGTGGTCGATCAATTATGGCTGGGAGATTTGCCCCTCATTTAGAGAAATGCATGGGAAGGGGTAGAAAGGCTCGTCCCAAA
GTAACAAGAAGTACCACAGCTGCCCAGAGCCGGTATTCACGAGGCAATCCTGTTTCTGCATATTCCCCTTACCCTAATTCCACCGGCACGAACCGCTTACCTAATGGAAC
GTCTAGTCTTGCAGGGGAGGAGTACTCAAATGTTCCCAAGATATTCAGTTCTTTTGCTATAAATTCTATTAACAATTTTATGGAAGCAAATTGCAGGAAGGTGGTCCAAA
AGGTTGCCAATCTATGCTCAGCCACTGAAAAACTAGAATGTGCTCCACTCAGCTATAATAAAGTAGTTTCTTTAGAAGATTTGGAGGAAAAATACGTCCTACCTCTCTGT
CCTCTCTCTCTAGCAGGAATAAACACATCATCAAACACCAAAGATCATATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCAATTCTTCAATTTCGATTTCGCTCATCAGTTGGGTCGTTGGGCCCTCTCTTTCCGCTCAGCTCCCCAACTTTTCTCTCTCGATTCACACCAGTTTCCCTCTC
TTTCAACTCCATTTCTACGCCAATCTTCCGCTCCGGTACCTTCGCTTTGAGATTTCGGTCGAAAATCCGTCGGGAACACTGCCCAGTTGCGGCAGTTATGTTGTTGCCTG
AAAATCCGGTGGTCTCCGATATCTGCGCCGCCGCCTTGTCTGGCGGAGTCGCCTTGGCTTTGCTTCGATTATGGGCGGAAACGGCGAAACGTGGCCTCGACCAGAAATTG
AACAGGAAGCTTGTTCATATAAGCATTGGGCTTGCATTCATGCTTTGCTGGCCTATGTTCAGTTCTGGTTATCGAGGAGCAATATTTGCATCTCTAATTCCCGGTGTGAA
TATTATACGAATGCTCGTCCTGGGACTCGGGATATTGAAAGATGAGGCTACGGTGAAGTCAATGAGCAGATATGGAGACTATAGGGAGCTTTTGAAGGGGCCTTTGTATT
ATGTTGCAACTATTACATTAGGTTGTATATTCTATTGGAGGACCTCCCCCATTTCAATTGCACTGATATGCAACTTATGTGCCGGAGATGGGTTGGCTGATATTGTTGGA
AGACGATTTGGAAGTGAAAAGATCTTTTACAACAAGAACAAGTCACTAGCTGGTAGTTTAGCAATGGCATCTGCTGGTTTTCTTGCATCTGTTGGGTATATGTACTATTT
CTCATCATTTGGGTATCTTGAGGGAAGCAGCAAAATGGTTTTGGGATTCTTAGTTGTGTCCCTTGCCTCAGCATTGGTGGAGTCTCTCCCCTTAAGCACTGAGATTGATG
ACAACCTCACTGTTCCACTCACTTCCTTCCTGGCCTATATTGAAACTCATCCTGCTTCTAGATCCATGTCAATGCCTAATGAGGATAATGCATCTTCACAAACTCAGCTT
TCATCTAATTTGTTTGGGGATCTCCTGGATTCCGTGATTGTTGATGTTGCATCGGAATGTCATCGAATAGCAAGGTTAGGTCTTGATCGTAACTTAGAAGAGGAAGAAGA
AGAATTAAGACTTTCAGCTCAGGCACGAGTAAGAGTAGCTGATTCTAGCAATAGTAGTGAGGCAAACGGCAAATATGTAGTTGATATTTTTGGACAAACTCATCCTTCTG
TTGCGAACGAAATATTTGATTGCATGAATTGTGGTCGATCAATTATGGCTGGGAGATTTGCCCCTCATTTAGAGAAATGCATGGGAAGGGGTAGAAAGGCTCGTCCCAAA
GTAACAAGAAGTACCACAGCTGCCCAGAGCCGGTATTCACGAGGCAATCCTGTTTCTGCATATTCCCCTTACCCTAATTCCACCGGCACGAACCGCTTACCTAATGGAAC
GTCTAGTCTTGCAGGGGAGGAGTACTCAAATGTTCCCAAGATATTCAGTTCTTTTGCTATAAATTCTATTAACAATTTTATGGAAGCAAATTGCAGGAAGGTGGTCCAAA
AGGTTGCCAATCTATGCTCAGCCACTGAAAAACTAGAATGTGCTCCACTCAGCTATAATAAAGTAGTTTCTTTAGAAGATTTGGAGGAAAAATACGTCCTACCTCTCTGT
CCTCTCTCTCTAGCAGGAATAAACACATCATCAAACACCAAAGATCATATCTAA
Protein sequenceShow/hide protein sequence
MAAILQFRFRSSVGSLGPLFPLSSPTFLSRFTPVSLSFNSISTPIFRSGTFALRFRSKIRREHCPVAAVMLLPENPVVSDICAAALSGGVALALLRLWAETAKRGLDQKL
NRKLVHISIGLAFMLCWPMFSSGYRGAIFASLIPGVNIIRMLVLGLGILKDEATVKSMSRYGDYRELLKGPLYYVATITLGCIFYWRTSPISIALICNLCAGDGLADIVG
RRFGSEKIFYNKNKSLAGSLAMASAGFLASVGYMYYFSSFGYLEGSSKMVLGFLVVSLASALVESLPLSTEIDDNLTVPLTSFLAYIETHPASRSMSMPNEDNASSQTQL
SSNLFGDLLDSVIVDVASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPK
VTRSTTAAQSRYSRGNPVSAYSPYPNSTGTNRLPNGTSSLAGEEYSNVPKIFSSFAINSINNFMEANCRKVVQKVANLCSATEKLECAPLSYNKVVSLEDLEEKYVLPLC
PLSLAGINTSSNTKDHI