; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029070 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029070
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCAAX amino terminal protease
Genome locationtig00153210:2971396..2989453
RNA-Seq ExpressionSgr029070
SyntenySgr029070
Gene Ontology termsGO:0008610 - lipid biosynthetic process (biological process)
GO:0071586 - CAAX-box protein processing (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004222 - metalloendopeptidase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR003675 - Type II CAAX prenyl endopeptidase Rce1-like
IPR006694 - Fatty acid hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601429.1 hypothetical protein SDJN03_06662, partial [Cucurbita argyrosperma subsp. sororia]1.9e-15584.24Show/hide
Query:  ESSIYCLPKSWKP-SSLKSTGKRTRKIVLFGYRNLSLRFSPRRSNSLSQNRFRPLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGA
        ESSIYCL   WKP SSLKSTGKRT KI+LF  RNLSLR SPRRSNSLS NRFRPLC FNAKDESGGDFQQK                         GNG 
Subjt:  ESSIYCLPKSWKP-SSLKSTGKRTRKIVLFGYRNLSLRFSPRRSNSLSQNRFRPLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGA

Query:  QWPIFKRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQK
         WPI +RWEVPWGWQTVSLTSLACGLS IVTGLVESAAIPYLGIRIEELSLDEKAEIL L+QGI TVAVLGI+YSIANTFQPLP+D+YRYDIRDPLNLQK
Subjt:  QWPIFKRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQK

Query:  GWLLWAGVGLVGALASIAATGAVLSSFSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALA
        GWLLWAGVGLVGALASIA TGAVLSSF+GGS QRETDALVRLLPL+GSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKW+PTPVAVLISAA+FALA
Subjt:  GWLLWAGVGLVGALASIAATGAVLSSFSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALA

Query:  HLTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFLS
        H TPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFL+
Subjt:  HLTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFLS

KAG7013147.1 hypothetical protein SDJN02_25903 [Cucurbita argyrosperma subsp. argyrosperma]5.5e-15584.24Show/hide
Query:  ESSIYCLPKSWKP-SSLKSTGKRTRKIVLFGYRNLSLRFSPRRSNSLSQNRFRPLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGA
        ESSI+CL   WKP SSLKST K   KI+LFGYRNLSLR  PRRSNS  QN FRPLCFFNAKDESGGDFQQK                         GNGA
Subjt:  ESSIYCLPKSWKP-SSLKSTGKRTRKIVLFGYRNLSLRFSPRRSNSLSQNRFRPLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGA

Query:  QWPIFKRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQK
         WPI +RWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLP DLYRYD+RDPLNLQK
Subjt:  QWPIFKRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQK

Query:  GWLLWAGVGLVGALASIAATGAVLSSFSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALA
        GWLLWAGVGL GALASIAATGAVLSSF+GGSP+RETDALVRLLPLIGSSSISTACLVGI GVLAPVLEETVFRGFFMVSLTKWVPTP+A+LISAA+FALA
Subjt:  GWLLWAGVGLVGALASIAATGAVLSSFSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALA

Query:  HLTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFLS
        HLTPGEFPQLFVLGTALG SYAQT NLLTPITIHALWNSGVILLLTFL+
Subjt:  HLTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFLS

XP_022151272.1 uncharacterized protein LOC111019231 [Momordica charantia]1.3e-15986.82Show/hide
Query:  ESSIYCLPKSWKP-SSLKSTGKRTRKIVLFGYRNLSLRFSPRRSNSLSQNRFRPLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGA
        ESSIYCL  SWKP SSLKST K T KIV FGYRNL L+FSPRRSNSLSQN FRPL FFNAKDESGGDFQQK                         GNG 
Subjt:  ESSIYCLPKSWKP-SSLKSTGKRTRKIVLFGYRNLSLRFSPRRSNSLSQNRFRPLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGA

Query:  QWPIFKRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQK
        +WPI KRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLP+DLYRYDI+DPLNLQK
Subjt:  QWPIFKRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQK

Query:  GWLLWAGVGLVGALASIAATGAVLSSFSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALA
        GWLLWAGVGLVGALASIAATGAVLSSF+GGSPQRETDALVRLLPLIGSSSISTACLVGI GVLAPVLEETVFRGFFMVSLTKWVPTPVA+LISAA+FALA
Subjt:  GWLLWAGVGLVGALASIAATGAVLSSFSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALA

Query:  HLTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFLS
        HLTPGEFPQLFVLGTALGFSYAQT NLLTPITIHALWNSGVILLLTFLS
Subjt:  HLTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFLS

XP_022957376.1 uncharacterized protein LOC111458793 isoform X2 [Cucurbita moschata]2.2e-15684.53Show/hide
Query:  ESSIYCLPKSWKP-SSLKSTGKRTRKIVLFGYRNLSLRFSPRRSNSLSQNRFRPLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGA
        ESSIYCL   WKP SSLKSTGKRT KI+LFG RNLSLR SPRRSNSLS NRFRPLC FNAKDESGGDFQQK                         GNG 
Subjt:  ESSIYCLPKSWKP-SSLKSTGKRTRKIVLFGYRNLSLRFSPRRSNSLSQNRFRPLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGA

Query:  QWPIFKRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQK
         WPI +RWEVPWGWQTVSLTSLACGLS IVTGLVESAAIPYLGIRIEELSLDEKAEIL L+QGI TVAVLGI+YSIANTFQPLP+D+YRYDIRDPLNLQK
Subjt:  QWPIFKRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQK

Query:  GWLLWAGVGLVGALASIAATGAVLSSFSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALA
        GWLLWAGVGLVGALASIA TGAVLSSF+GGS QRETDALVRLLPL+GSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKW+PTPVAVLISAA+FALA
Subjt:  GWLLWAGVGLVGALASIAATGAVLSSFSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALA

Query:  HLTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFLS
        H TPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFL+
Subjt:  HLTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFLS

XP_038891439.1 uncharacterized protein LOC120080858 [Benincasa hispida]5.0e-15684.24Show/hide
Query:  ESSIYCLPKSWKPSS-LKSTGKRTRKIVLFGYRNLSLRFSPRRSNSLSQNRFRPLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGA
        ESSI+CL   WKPSS +K T KRT KI+LFGYRNLSLR S +RSNSLSQNRFRPL FFNAKDESGGDFQQK                         G G 
Subjt:  ESSIYCLPKSWKPSS-LKSTGKRTRKIVLFGYRNLSLRFSPRRSNSLSQNRFRPLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGA

Query:  QWPIFKRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQK
        +WP+ +RW+VPWGWQTVSLTSLACGLSFIVTGLVESAA+PYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLP+DLYRYDIRDPLNLQK
Subjt:  QWPIFKRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQK

Query:  GWLLWAGVGLVGALASIAATGAVLSSFSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALA
        GWLLWAGVGLVGALASIA TGAVLSSF+GGS +RETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKW+PTPVAVLISAA+FALA
Subjt:  GWLLWAGVGLVGALASIAATGAVLSSFSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALA

Query:  HLTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFLS
        HLTPGEFPQLFVLGTALGFSYAQT NLLTPITIHALWNSGVILLLTFLS
Subjt:  HLTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFLS

TrEMBL top hitse value%identityAlignment
A0A5A7SY82 CAAX amino terminal protease1.1e-15383.38Show/hide
Query:  ESSIYCLPKSWKP-SSLKSTGKRTRKIVLFGYRNLSLRFSPRRSNSLSQNRFRPLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGA
        ESSI+CL   WKP SS+K  GKRT KI+LFGYRNLSLR SP RSNSLSQ+ FRPLCFFNAKDESGGDFQQK                          NGA
Subjt:  ESSIYCLPKSWKP-SSLKSTGKRTRKIVLFGYRNLSLRFSPRRSNSLSQNRFRPLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGA

Query:  QWPIFKRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQK
         WPI +RW+VPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTV VLGILYSIANTFQPLP+ LYRYDIRDPLNLQ+
Subjt:  QWPIFKRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQK

Query:  GWLLWAGVGLVGALASIAATGAVLSSFSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALA
        GWLLWAGVGL GALASIA TGAVLSSF+GGS +RETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGF MVSLTKW+PTPVAVLI+AA+FALA
Subjt:  GWLLWAGVGLVGALASIAATGAVLSSFSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALA

Query:  HLTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFLS
        HLTPGEFPQLFVLGTALGFSYAQT NLLTPITIHALWNSGVILLLTFLS
Subjt:  HLTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFLS

A0A6J1DD10 uncharacterized protein LOC1110192316.1e-16086.82Show/hide
Query:  ESSIYCLPKSWKP-SSLKSTGKRTRKIVLFGYRNLSLRFSPRRSNSLSQNRFRPLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGA
        ESSIYCL  SWKP SSLKST K T KIV FGYRNL L+FSPRRSNSLSQN FRPL FFNAKDESGGDFQQK                         GNG 
Subjt:  ESSIYCLPKSWKP-SSLKSTGKRTRKIVLFGYRNLSLRFSPRRSNSLSQNRFRPLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGA

Query:  QWPIFKRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQK
        +WPI KRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLP+DLYRYDI+DPLNLQK
Subjt:  QWPIFKRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQK

Query:  GWLLWAGVGLVGALASIAATGAVLSSFSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALA
        GWLLWAGVGLVGALASIAATGAVLSSF+GGSPQRETDALVRLLPLIGSSSISTACLVGI GVLAPVLEETVFRGFFMVSLTKWVPTPVA+LISAA+FALA
Subjt:  GWLLWAGVGLVGALASIAATGAVLSSFSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALA

Query:  HLTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFLS
        HLTPGEFPQLFVLGTALGFSYAQT NLLTPITIHALWNSGVILLLTFLS
Subjt:  HLTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFLS

A0A6J1FZL0 uncharacterized protein LOC1114493367.7e-15583.95Show/hide
Query:  ESSIYCLPKSWKP-SSLKSTGKRTRKIVLFGYRNLSLRFSPRRSNSLSQNRFRPLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGA
        ESSI+CL   WKP S LKST K   KI+LFGYRNLSLR  PRRSNS  QN FRPLCFFNAKDESGGDFQQK                         GNGA
Subjt:  ESSIYCLPKSWKP-SSLKSTGKRTRKIVLFGYRNLSLRFSPRRSNSLSQNRFRPLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGA

Query:  QWPIFKRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQK
         WPI +RWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLP DLYRYD+RDPLNLQK
Subjt:  QWPIFKRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQK

Query:  GWLLWAGVGLVGALASIAATGAVLSSFSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALA
        GWLLWAGVGL GALASIAATGAVLSSF+GGSP+RETDALVRLLPLIGSSSISTACLVGI GVLAPVLEETVFRGFFMVSLTKWVPTP+A+LISAA+FALA
Subjt:  GWLLWAGVGLVGALASIAATGAVLSSFSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALA

Query:  HLTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFLS
        HLTPGEFPQLFVLGTALG SYAQT NLLTPITIHALWNSGVILLLTFL+
Subjt:  HLTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFLS

A0A6J1GZ20 uncharacterized protein LOC111458793 isoform X21.1e-15684.53Show/hide
Query:  ESSIYCLPKSWKP-SSLKSTGKRTRKIVLFGYRNLSLRFSPRRSNSLSQNRFRPLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGA
        ESSIYCL   WKP SSLKSTGKRT KI+LFG RNLSLR SPRRSNSLS NRFRPLC FNAKDESGGDFQQK                         GNG 
Subjt:  ESSIYCLPKSWKP-SSLKSTGKRTRKIVLFGYRNLSLRFSPRRSNSLSQNRFRPLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGA

Query:  QWPIFKRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQK
         WPI +RWEVPWGWQTVSLTSLACGLS IVTGLVESAAIPYLGIRIEELSLDEKAEIL L+QGI TVAVLGI+YSIANTFQPLP+D+YRYDIRDPLNLQK
Subjt:  QWPIFKRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQK

Query:  GWLLWAGVGLVGALASIAATGAVLSSFSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALA
        GWLLWAGVGLVGALASIA TGAVLSSF+GGS QRETDALVRLLPL+GSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKW+PTPVAVLISAA+FALA
Subjt:  GWLLWAGVGLVGALASIAATGAVLSSFSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALA

Query:  HLTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFLS
        H TPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFL+
Subjt:  HLTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFLS

A0A6J1HWZ6 uncharacterized protein LOC1114674211.3e-15483.62Show/hide
Query:  ESSIYCLPKSWKPSSLKSTGKRTRKIVLFGYRNLSLRFSPRRSNSLSQNRFRPLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGAQ
        ESSI+CL   WKPSSLKST K   K +LFGYRNL LR  PRRSNS  QN FRPLCFFNAKDESGGDFQQK                         GNGA 
Subjt:  ESSIYCLPKSWKPSSLKSTGKRTRKIVLFGYRNLSLRFSPRRSNSLSQNRFRPLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGAQ

Query:  WPIFKRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQKG
        WPI +RWEVPWGWQTVS TSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLP DLYRYD+R+PLNLQKG
Subjt:  WPIFKRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQKG

Query:  WLLWAGVGLVGALASIAATGAVLSSFSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALAH
        WLLWAGVGL GALASIAATGAVLSSF+GGSP+RETDALVRLLPLIGSSSISTACLVGI GVLAPVLEETVFRGFFMVSLTKWVPTPVA+LISAA+FALAH
Subjt:  WLLWAGVGLVGALASIAATGAVLSSFSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALAH

Query:  LTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFLS
        LTPGEFPQLFVLGTALG SYAQT NLLTPITIHALWNSGVILLLTFL+
Subjt:  LTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFLS

SwissProt top hitse value%identityAlignment
B8B4W4 Very-long-chain aldehyde decarbonylase GL1-91.3e-7164.18Show/hide
Query:  ISDQFLGTFVPIVVYWVYSGIYILL---GSFENYRLHSKKDEQEKNLVSKSTVVRGVFLQQAIQAIVAIILFKVTGNDGEVATGPKPGLTIIVQFIIAML
        +SD+ +GTF PI +YWVY+G Y L+      E YRLH++ +E+EKNLV+   VVRGV LQQ +QAIVA+ILF VT +   V   P P +    QF++AML
Subjt:  ISDQFLGTFVPIVVYWVYSGIYILL---GSFENYRLHSKKDEQEKNLVSKSTVVRGVFLQQAIQAIVAIILFKVTGNDGEVATGPKPGLTIIVQFIIAML

Query:  VLDTWQYFIHRYMHQNKFLYKHVHSQHHRLVVPYAFGALYNHPLEGLLLDTIGGALSFLVSGMSPRVAIFFFSFATIKTVDDHCGLWLPGNLFHVFFRNN
        V+D+WQYF+HRYMHQNKFLY+H+HSQHHRL+VPYA GALYNHPLEGLLLDT+GGA+SFLVSGM+PR ++FFF FA +KTVDDHCGLWLP N+F   F+NN
Subjt:  VLDTWQYFIHRYMHQNKFLYKHVHSQHHRLVVPYAFGALYNHPLEGLLLDTIGGALSFLVSGMSPRVAIFFFSFATIKTVDDHCGLWLPGNLFHVFFRNN

Query:  S
        +
Subjt:  S

O94298 Sphingolipid C4-hydroxylase sur21.0e-3939.82Show/hide
Query:  PIVVYWVYSGI-----YILLGSFENYRLHSKKDEQEKNLVSKSTVVRGVFLQQAIQAIVAIILFKVTGNDGEV------------------------ATG
        P+++YWV S       YI L  FE YR+H  ++   +N V +  VV+ V  QQ  + +V I L    G    +                           
Subjt:  PIVVYWVYSGI-----YILLGSFENYRLHSKKDEQEKNLVSKSTVVRGVFLQQAIQAIVAIILFKVTGNDGEV------------------------ATG

Query:  PKPGLTIIV---QFIIAMLVLDTWQYFIHRYMHQNKFLYKHVHSQHHRLVVPYAFGALYNHPLEGLLLDTIGGALSFLVSGMSPRVAIFFFSFATIKTVD
        PK     IV   Q+  A  ++D+WQYF HRY+H NK LY  +H+ HHRL VPYA GALYNHP EGL+LDT G  +++L +G+SP+ A+ FF+ +T+KTVD
Subjt:  PKPGLTIIV---QFIIAMLVLDTWQYFIHRYMHQNKFLYKHVHSQHHRLVVPYAFGALYNHPLEGLLLDTIGGALSFLVSGMSPRVAIFFFSFATIKTVD

Query:  DHCGLWLPGNLFHVFFRNNSR
        DHCG   P +   +FF NN+R
Subjt:  DHCGLWLPGNLFHVFFRNNSR

Q0D4G3 Very-long-chain aldehyde decarbonylase GL1-91.3e-7164.18Show/hide
Query:  ISDQFLGTFVPIVVYWVYSGIYILL---GSFENYRLHSKKDEQEKNLVSKSTVVRGVFLQQAIQAIVAIILFKVTGNDGEVATGPKPGLTIIVQFIIAML
        +SD+ +GTF PI +YWVY+G Y L+      E YRLH++ +E+EKNLV+   VVRGV LQQ +QAIVA+ILF VT +   V   P P +    QF++AML
Subjt:  ISDQFLGTFVPIVVYWVYSGIYILL---GSFENYRLHSKKDEQEKNLVSKSTVVRGVFLQQAIQAIVAIILFKVTGNDGEVATGPKPGLTIIVQFIIAML

Query:  VLDTWQYFIHRYMHQNKFLYKHVHSQHHRLVVPYAFGALYNHPLEGLLLDTIGGALSFLVSGMSPRVAIFFFSFATIKTVDDHCGLWLPGNLFHVFFRNN
        V+D+WQYF+HRYMHQNKFLY+H+HSQHHRL+VPYA GALYNHPLEGLLLDT+GGA+SFLVSGM+PR ++FFF FA +KTVDDHCGLWLP N+F   F+NN
Subjt:  VLDTWQYFIHRYMHQNKFLYKHVHSQHHRLVVPYAFGALYNHPLEGLLLDTIGGALSFLVSGMSPRVAIFFFSFATIKTVDDHCGLWLPGNLFHVFFRNN

Query:  S
        +
Subjt:  S

Q8VYI1 Sphinganine C4-monooxygenase 12.8e-8573.4Show/hide
Query:  MGIEISDQFLGTFVPIVVYWVYSGIYILLGSFENYRLHSKKDEQEKNLVSKSTVVRGVFLQQAIQAIVAIILFKVTGNDGEVATGPKPGLTIIV-QFIIA
        MG  +SD+ LGT  PIVVYW+YSGIY+ L S E+YRLHSK +E+EKNLVSKS+VV+GV +QQ +QA+VAI+LF VTG+D E     +    ++  QF+ A
Subjt:  MGIEISDQFLGTFVPIVVYWVYSGIYILLGSFENYRLHSKKDEQEKNLVSKSTVVRGVFLQQAIQAIVAIILFKVTGNDGEVATGPKPGLTIIV-QFIIA

Query:  MLVLDTWQYFIHRYMHQNKFLYKHVHSQHHRLVVPYAFGALYNHPLEGLLLDTIGGALSFLVSGMSPRVAIFFFSFATIKTVDDHCGLWLPGNLFHVFFR
        M+VLDTWQYF+HRYMHQNKFLYKH+HSQHHRL+VPYA+GALYNHP+EGLLLDTIGGALSFLVSGMSPR +IFFFSFATIKTVDDHCGLWLPGNLFH+ F+
Subjt:  MLVLDTWQYFIHRYMHQNKFLYKHVHSQHHRLVVPYAFGALYNHPLEGLLLDTIGGALSFLVSGMSPRVAIFFFSFATIKTVDDHCGLWLPGNLFHVFFR

Query:  NNS
        NNS
Subjt:  NNS

Q9AST3 Sphinganine C4-monooxygenase 26.0e-8876.85Show/hide
Query:  MGIEISDQFLGTFVPIVVYWVYSGIYILLGSFENYRLHSKKDEQEKNLVSKSTVVRGVFLQQAIQAIVAIILFKVTGNDGEVATGPKPGLTIIV-QFIIA
        M   ISD+FLGTFVPI+VYWVYSG+YI LGS + YRLHSK DE EKNLVSKS VV+GV LQQ +QAI+++ILFK+TG+D + AT  +  + ++  QFIIA
Subjt:  MGIEISDQFLGTFVPIVVYWVYSGIYILLGSFENYRLHSKKDEQEKNLVSKSTVVRGVFLQQAIQAIVAIILFKVTGNDGEVATGPKPGLTIIV-QFIIA

Query:  MLVLDTWQYFIHRYMHQNKFLYKHVHSQHHRLVVPYAFGALYNHPLEGLLLDTIGGALSFLVSGMSPRVAIFFFSFATIKTVDDHCGLWLPGNLFHVFFR
        MLV+DTWQYFIHRYMH NKFLYKH+HSQHHRL+VPY++GALYNHPLEGLLLDTIGGALSFL SGMSPR AIFFFSFATIKTVDDHCGLWLPGN FH+FF 
Subjt:  MLVLDTWQYFIHRYMHQNKFLYKHVHSQHHRLVVPYAFGALYNHPLEGLLLDTIGGALSFLVSGMSPRVAIFFFSFATIKTVDDHCGLWLPGNLFHVFFR

Query:  NNS
        NNS
Subjt:  NNS

Arabidopsis top hitse value%identityAlignment
AT1G14270.1 CAAX amino terminal protease family protein3.5e-10760.38Show/hide
Query:  GYRNLSLRFSPRRSNSLSQNRFRPLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGAQWPIFKRWEVPWGWQTVSLTSLACGLSFIV
        G    S  F+ RR  ++++ ++R  C FN+  E GG+                GKL          G  ++WPI +RWEVPWGWQTVSLTS AC LSF++
Subjt:  GYRNLSLRFSPRRSNSLSQNRFRPLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGAQWPIFKRWEVPWGWQTVSLTSLACGLSFIV

Query:  TGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQKGWLLWAGVGLVGALASIAATGAVLSSFSGG
        TGL E A IP+LGI +E+L+LD+KAEILFLDQG+TT  VL +++++A TF PLP D+ RYD+R P+NLQKGWL+W G+GLVGA+ +IA TG VLS F   
Subjt:  TGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQKGWLLWAGVGLVGALASIAATGAVLSSFSGG

Query:  SPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALAHLTPGEFPQLFVLGTALGFSYAQTHNLLTP
        +P+RE D+L++LLPLIGSS+IST  LVGITG+LAP+LEETVFRGFFMVSLTKWVPTP+A++IS+A FALAH TPGEFPQLF+LG+ LG SYAQT NL+TP
Subjt:  SPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALAHLTPGEFPQLFVLGTALGFSYAQTHNLLTP

Query:  ITIHALWNSGVILLLTFL
        + IH  WNSGVILLLTFL
Subjt:  ITIHALWNSGVILLLTFL

AT1G14270.2 CAAX amino terminal protease family protein2.8e-8869.82Show/hide
Query:  SFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQKGWLLWAGVGLVGALASIAATGAVLSS
        SF++TGL E A IP+LGI +E+L+LD+KAEILFLDQG+TT  VL +++++A TF PLP D+ RYD+R P+NLQKGWL+W G+GLVGA+ +IA TG VLS 
Subjt:  SFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQKGWLLWAGVGLVGALASIAATGAVLSS

Query:  FSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALAHLTPGEFPQLFVLGTALGFSYAQTHN
        F   +P+RE D+L++LLPLIGSS+IST  LVGITG+LAP+LEETVFRGFFMVSLTKWVPTP+A++IS+A FALAH TPGEFPQLF+LG+ LG SYAQT N
Subjt:  FSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALAHLTPGEFPQLFVLGTALGFSYAQTHN

Query:  LLTPITIHALWNSGVILLLTFL
        L+TP+ IH  WNSGVILLLTFL
Subjt:  LLTPITIHALWNSGVILLLTFL

AT1G14270.3 CAAX amino terminal protease family protein1.4e-8470.28Show/hide
Query:  AAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQKGWLLWAGVGLVGALASIAATGAVLSSFSGGSPQRET
        A IP+LGI +E+L+LD+KAEILFLDQG+TT  VL +++++A TF PLP D+ RYD+R P+NLQKGWL+W G+GLVGA+ +IA TG VLS F   +P+RE 
Subjt:  AAIPYLGIRIEELSLDEKAEILFLDQGITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQKGWLLWAGVGLVGALASIAATGAVLSSFSGGSPQRET

Query:  DALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALAHLTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHAL
        D+L++LLPLIGSS+IST  LVGITG+LAP+LEETVFRGFFMVSLTKWVPTP+A++IS+A FALAH TPGEFPQLF+LG+ LG SYAQT NL+TP+ IH  
Subjt:  DALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFRGFFMVSLTKWVPTPVAVLISAAIFALAHLTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHAL

Query:  WNSGVILLLTFL
        WNSGVILLLTFL
Subjt:  WNSGVILLLTFL

AT1G14290.1 sphingoid base hydroxylase 24.3e-8976.85Show/hide
Query:  MGIEISDQFLGTFVPIVVYWVYSGIYILLGSFENYRLHSKKDEQEKNLVSKSTVVRGVFLQQAIQAIVAIILFKVTGNDGEVATGPKPGLTIIV-QFIIA
        M   ISD+FLGTFVPI+VYWVYSG+YI LGS + YRLHSK DE EKNLVSKS VV+GV LQQ +QAI+++ILFK+TG+D + AT  +  + ++  QFIIA
Subjt:  MGIEISDQFLGTFVPIVVYWVYSGIYILLGSFENYRLHSKKDEQEKNLVSKSTVVRGVFLQQAIQAIVAIILFKVTGNDGEVATGPKPGLTIIV-QFIIA

Query:  MLVLDTWQYFIHRYMHQNKFLYKHVHSQHHRLVVPYAFGALYNHPLEGLLLDTIGGALSFLVSGMSPRVAIFFFSFATIKTVDDHCGLWLPGNLFHVFFR
        MLV+DTWQYFIHRYMH NKFLYKH+HSQHHRL+VPY++GALYNHPLEGLLLDTIGGALSFL SGMSPR AIFFFSFATIKTVDDHCGLWLPGN FH+FF 
Subjt:  MLVLDTWQYFIHRYMHQNKFLYKHVHSQHHRLVVPYAFGALYNHPLEGLLLDTIGGALSFLVSGMSPRVAIFFFSFATIKTVDDHCGLWLPGNLFHVFFR

Query:  NNS
        NNS
Subjt:  NNS

AT1G69640.1 sphingoid base hydroxylase 12.0e-8673.4Show/hide
Query:  MGIEISDQFLGTFVPIVVYWVYSGIYILLGSFENYRLHSKKDEQEKNLVSKSTVVRGVFLQQAIQAIVAIILFKVTGNDGEVATGPKPGLTIIV-QFIIA
        MG  +SD+ LGT  PIVVYW+YSGIY+ L S E+YRLHSK +E+EKNLVSKS+VV+GV +QQ +QA+VAI+LF VTG+D E     +    ++  QF+ A
Subjt:  MGIEISDQFLGTFVPIVVYWVYSGIYILLGSFENYRLHSKKDEQEKNLVSKSTVVRGVFLQQAIQAIVAIILFKVTGNDGEVATGPKPGLTIIV-QFIIA

Query:  MLVLDTWQYFIHRYMHQNKFLYKHVHSQHHRLVVPYAFGALYNHPLEGLLLDTIGGALSFLVSGMSPRVAIFFFSFATIKTVDDHCGLWLPGNLFHVFFR
        M+VLDTWQYF+HRYMHQNKFLYKH+HSQHHRL+VPYA+GALYNHP+EGLLLDTIGGALSFLVSGMSPR +IFFFSFATIKTVDDHCGLWLPGNLFH+ F+
Subjt:  MLVLDTWQYFIHRYMHQNKFLYKHVHSQHHRLVVPYAFGALYNHPLEGLLLDTIGGALSFLVSGMSPRVAIFFFSFATIKTVDDHCGLWLPGNLFHVFFR

Query:  NNS
        NNS
Subjt:  NNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCTGGAGACTTCACAGTAGCTATGGAAGTTTCGCAAAGCTTGAGATGTGTTTATATGTACGAGACCGCAGCTTCGGTGCGGCGATTCAACGCATACGCCACGTC
TGCTTGGAAACTACGCTTCACAATCGAAGGACAGATCAGGCGAAGCTGCGGTGGGCGGCGTACATCGGGGCTTGTGGGGTGCGGTGAGGCTTGCAGTGGGCGGCGTGTGG
CGGGGCTTGCGACAAAGGGTGCGGTGGGCAGCGTGAGTCCACGAAGCGCAGAATCACAGCCCTACATCAAGTTGGGCCCCGTTTTGACTCTTTGCCAGACGCCACGTGGA
ACACCTCCTAATCCAATTCTCTCTCCCACGACGCCATTTTCATCAGGCTGGGTTGGAAAGTGGACAAAGAAACGACGGAGCTGCGATTGGGATTTTGGGGGGCATTTGGT
TCCTAATGCAATTCCGGCTTTTTTTGGGGTTGAGTATTTGATCGGTTGTAGGTTCTTTTCCAAGATGGGTATTGAAATCTCTGATCAATTCTTGGGCACTTTTGTACCAA
TTGTGGTTTATTGGGTATATTCTGGAATTTATATTCTGTTAGGTTCATTTGAGAACTATCGGTTGCACTCTAAGAAAGATGAGCAGGAGAAGAACTTAGTATCCAAAAGC
ACTGTGGTTAGAGGCGTTTTCCTTCAACAGGCTATTCAGGCAATCGTTGCGATCATCTTGTTTAAGGTGACTGGAAATGATGGGGAGGTTGCTACAGGTCCAAAACCTGG
CCTAACAATCATTGTTCAGTTTATCATTGCGATGCTGGTGTTGGACACGTGGCAATACTTTATTCACAGATACATGCATCAGAATAAGTTCTTATACAAGCACGTCCATT
CCCAGCATCACCGGTTGGTTGTACCTTACGCATTTGGAGCTTTATACAACCATCCCTTGGAGGGTCTCCTCCTCGACACAATTGGTGGGGCTTTATCTTTTCTCGTTTCC
GGGATGTCACCTCGAGTTGCCATCTTTTTCTTTTCATTTGCCACCATCAAAACAGTGGATGATCATTGTGGATTATGGCTTCCTGGAAACCTTTTCCACGTGTTTTTCAG
AAACAATTCGCGTATCATGATGTTCACCATCAGCTATATGGCAGCTAAACATACAAAATATTGGAAATTCGGGGCGAATACTGACGAGGAAGCGCATTCGAACCAAAACC
CACAATCTTCGTCTCAGCTTCGAACCTCGTCCGTCGCCCAAATGACTCCAACCTCTGTCGAACTTGTGCCTCAGAACCGATCACCTCCCCGTTCGTCACTTAGATTTGAT
CATGTACGTGGTGGCGCAGATGTTTCTGACTGGACTCGTCGTCGACAGCCGCCGTTCATCGTAGTCCGACATGATGGAGTAGTCGGCGGCGCTGGCGGTGATGACACTCC
ACTCGATGCTCGCCTCGCTTGGTGCATAACAAGTTGTGGGTGTGACACATTCTGTGCAATCAGAGCCCTGTCTCGCAAGAAAAGAGTTGGAGTTCCCCGTCAAGCTCTCT
ATCTCGAAAAGTCAGCATCGATAGCCTCTTCTCAAGGTGGTTCAGACTGTCGACGTTTGGCGGCTTCCCCTCCATTAGGGAAGATCCAAAGACCTCTATTGACTTGCGCG
CTTGCTGATCTTCTACTTGAATTTGCATTTGCAGGGAGTTCTTGTCGCAGCAAGAGCATTTATAGAGAAAACCCTTTGATGGTGACGACTTACTCGGCTTGGCCGCCGCA
CGTGAAAACGAACACTTGGGGTTCCAGAGCGGTTAGTTTTTGGCTTCTTGGGGTGAAGATCTATTTGTTGGGGCTTCTTGCACTGCTGTTTTCTCGGCCGCAACTTGAGA
GGCTTGGGGCTGTCCATTCCTCCATTAAAGTATTTCTCTGCTCCAAAGACTTCGATGTCTCCATCCTCTTCCTGGGTGGTGGTAGTGATGGTGGGATTGGACCGACCTCG
AGTGGGCTCGGCAAGTTTGCGCACAAACTCCTTCTCTGCATGATTCAAGTAGGAAGAGAAAGAGGCTTCGCGAAGGTTATTATTGTTGTTCTCGAAGGACAACCTCTGGG
AGAGGCGATGATGGTGCATGAGATAGAGGCTGTCAACATCGCTCAAGTGAGAGAGCCCGAGACTGAATCTTCAATCTATTGTCTTCCCAAGTCATGGAAGCCGTCTTCTT
TGAAATCCACGGGGAAACGAACTCGTAAGATTGTACTCTTTGGTTACCGAAACTTGTCCCTGCGCTTTTCACCCCGACGTTCGAATTCCTTGTCCCAAAATCGGTTTCGT
CCGCTTTGCTTCTTCAATGCTAAAGATGAATCCGGCGGCGATTTCCAACAAAAGGTACGGTCTTCCTCATCTAATTCGCCATTAGTTTTCGGCAAACTTGCTGTTGAATT
TGTTTCCCGAGAAAAGTACGGAAATGGGGCGCAATGGCCTATCTTCAAGCGGTGGGAGGTGCCATGGGGATGGCAAACAGTTTCATTAACTTCACTTGCTTGTGGATTAA
GTTTCATTGTGACAGGATTGGTTGAATCTGCTGCTATACCTTATCTTGGTATTCGCATTGAAGAGCTAAGCTTAGACGAGAAGGCTGAAATACTTTTCCTGGATCAAGGA
ATTACAACTGTGGCAGTGCTTGGGATCTTATACAGTATTGCCAACACTTTCCAACCACTTCCCAATGACTTGTATCGCTATGATATTAGGGACCCCCTGAATCTGCAGAA
AGGCTGGCTCTTATGGGCAGGAGTAGGTCTGGTTGGTGCCTTAGCTTCTATTGCAGCGACAGGAGCTGTCTTGTCTTCATTTAGCGGTGGGAGCCCTCAAAGAGAGACAG
ACGCTCTAGTACGCTTGCTACCACTAATTGGATCTTCCAGCATCAGCACTGCTTGTCTGGTGGGCATCACAGGTGTCCTTGCTCCAGTTCTTGAAGAGACTGTGTTCCGG
GGATTTTTTATGGTGTCCCTTACCAAATGGGTTCCCACACCAGTTGCCGTCCTGATTAGCGCAGCCATATTTGCCCTTGCACATCTCACTCCTGGAGAGTTTCCCCAGCT
CTTTGTGCTTGGAACTGCTCTGGGATTTTCATATGCTCAAACTCACAACCTCTTGACCCCCATCACCATACATGCTTTGTGGAACTCGGGTGTCATTTTGCTTCTTACCT
TCCTTTCG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCTGGAGACTTCACAGTAGCTATGGAAGTTTCGCAAAGCTTGAGATGTGTTTATATGTACGAGACCGCAGCTTCGGTGCGGCGATTCAACGCATACGCCACGTC
TGCTTGGAAACTACGCTTCACAATCGAAGGACAGATCAGGCGAAGCTGCGGTGGGCGGCGTACATCGGGGCTTGTGGGGTGCGGTGAGGCTTGCAGTGGGCGGCGTGTGG
CGGGGCTTGCGACAAAGGGTGCGGTGGGCAGCGTGAGTCCACGAAGCGCAGAATCACAGCCCTACATCAAGTTGGGCCCCGTTTTGACTCTTTGCCAGACGCCACGTGGA
ACACCTCCTAATCCAATTCTCTCTCCCACGACGCCATTTTCATCAGGCTGGGTTGGAAAGTGGACAAAGAAACGACGGAGCTGCGATTGGGATTTTGGGGGGCATTTGGT
TCCTAATGCAATTCCGGCTTTTTTTGGGGTTGAGTATTTGATCGGTTGTAGGTTCTTTTCCAAGATGGGTATTGAAATCTCTGATCAATTCTTGGGCACTTTTGTACCAA
TTGTGGTTTATTGGGTATATTCTGGAATTTATATTCTGTTAGGTTCATTTGAGAACTATCGGTTGCACTCTAAGAAAGATGAGCAGGAGAAGAACTTAGTATCCAAAAGC
ACTGTGGTTAGAGGCGTTTTCCTTCAACAGGCTATTCAGGCAATCGTTGCGATCATCTTGTTTAAGGTGACTGGAAATGATGGGGAGGTTGCTACAGGTCCAAAACCTGG
CCTAACAATCATTGTTCAGTTTATCATTGCGATGCTGGTGTTGGACACGTGGCAATACTTTATTCACAGATACATGCATCAGAATAAGTTCTTATACAAGCACGTCCATT
CCCAGCATCACCGGTTGGTTGTACCTTACGCATTTGGAGCTTTATACAACCATCCCTTGGAGGGTCTCCTCCTCGACACAATTGGTGGGGCTTTATCTTTTCTCGTTTCC
GGGATGTCACCTCGAGTTGCCATCTTTTTCTTTTCATTTGCCACCATCAAAACAGTGGATGATCATTGTGGATTATGGCTTCCTGGAAACCTTTTCCACGTGTTTTTCAG
AAACAATTCGCGTATCATGATGTTCACCATCAGCTATATGGCAGCTAAACATACAAAATATTGGAAATTCGGGGCGAATACTGACGAGGAAGCGCATTCGAACCAAAACC
CACAATCTTCGTCTCAGCTTCGAACCTCGTCCGTCGCCCAAATGACTCCAACCTCTGTCGAACTTGTGCCTCAGAACCGATCACCTCCCCGTTCGTCACTTAGATTTGAT
CATGTACGTGGTGGCGCAGATGTTTCTGACTGGACTCGTCGTCGACAGCCGCCGTTCATCGTAGTCCGACATGATGGAGTAGTCGGCGGCGCTGGCGGTGATGACACTCC
ACTCGATGCTCGCCTCGCTTGGTGCATAACAAGTTGTGGGTGTGACACATTCTGTGCAATCAGAGCCCTGTCTCGCAAGAAAAGAGTTGGAGTTCCCCGTCAAGCTCTCT
ATCTCGAAAAGTCAGCATCGATAGCCTCTTCTCAAGGTGGTTCAGACTGTCGACGTTTGGCGGCTTCCCCTCCATTAGGGAAGATCCAAAGACCTCTATTGACTTGCGCG
CTTGCTGATCTTCTACTTGAATTTGCATTTGCAGGGAGTTCTTGTCGCAGCAAGAGCATTTATAGAGAAAACCCTTTGATGGTGACGACTTACTCGGCTTGGCCGCCGCA
CGTGAAAACGAACACTTGGGGTTCCAGAGCGGTTAGTTTTTGGCTTCTTGGGGTGAAGATCTATTTGTTGGGGCTTCTTGCACTGCTGTTTTCTCGGCCGCAACTTGAGA
GGCTTGGGGCTGTCCATTCCTCCATTAAAGTATTTCTCTGCTCCAAAGACTTCGATGTCTCCATCCTCTTCCTGGGTGGTGGTAGTGATGGTGGGATTGGACCGACCTCG
AGTGGGCTCGGCAAGTTTGCGCACAAACTCCTTCTCTGCATGATTCAAGTAGGAAGAGAAAGAGGCTTCGCGAAGGTTATTATTGTTGTTCTCGAAGGACAACCTCTGGG
AGAGGCGATGATGGTGCATGAGATAGAGGCTGTCAACATCGCTCAAGTGAGAGAGCCCGAGACTGAATCTTCAATCTATTGTCTTCCCAAGTCATGGAAGCCGTCTTCTT
TGAAATCCACGGGGAAACGAACTCGTAAGATTGTACTCTTTGGTTACCGAAACTTGTCCCTGCGCTTTTCACCCCGACGTTCGAATTCCTTGTCCCAAAATCGGTTTCGT
CCGCTTTGCTTCTTCAATGCTAAAGATGAATCCGGCGGCGATTTCCAACAAAAGGTACGGTCTTCCTCATCTAATTCGCCATTAGTTTTCGGCAAACTTGCTGTTGAATT
TGTTTCCCGAGAAAAGTACGGAAATGGGGCGCAATGGCCTATCTTCAAGCGGTGGGAGGTGCCATGGGGATGGCAAACAGTTTCATTAACTTCACTTGCTTGTGGATTAA
GTTTCATTGTGACAGGATTGGTTGAATCTGCTGCTATACCTTATCTTGGTATTCGCATTGAAGAGCTAAGCTTAGACGAGAAGGCTGAAATACTTTTCCTGGATCAAGGA
ATTACAACTGTGGCAGTGCTTGGGATCTTATACAGTATTGCCAACACTTTCCAACCACTTCCCAATGACTTGTATCGCTATGATATTAGGGACCCCCTGAATCTGCAGAA
AGGCTGGCTCTTATGGGCAGGAGTAGGTCTGGTTGGTGCCTTAGCTTCTATTGCAGCGACAGGAGCTGTCTTGTCTTCATTTAGCGGTGGGAGCCCTCAAAGAGAGACAG
ACGCTCTAGTACGCTTGCTACCACTAATTGGATCTTCCAGCATCAGCACTGCTTGTCTGGTGGGCATCACAGGTGTCCTTGCTCCAGTTCTTGAAGAGACTGTGTTCCGG
GGATTTTTTATGGTGTCCCTTACCAAATGGGTTCCCACACCAGTTGCCGTCCTGATTAGCGCAGCCATATTTGCCCTTGCACATCTCACTCCTGGAGAGTTTCCCCAGCT
CTTTGTGCTTGGAACTGCTCTGGGATTTTCATATGCTCAAACTCACAACCTCTTGACCCCCATCACCATACATGCTTTGTGGAACTCGGGTGTCATTTTGCTTCTTACCT
TCCTTTCG
Protein sequenceShow/hide protein sequence
MEAGDFTVAMEVSQSLRCVYMYETAASVRRFNAYATSAWKLRFTIEGQIRRSCGGRRTSGLVGCGEACSGRRVAGLATKGAVGSVSPRSAESQPYIKLGPVLTLCQTPRG
TPPNPILSPTTPFSSGWVGKWTKKRRSCDWDFGGHLVPNAIPAFFGVEYLIGCRFFSKMGIEISDQFLGTFVPIVVYWVYSGIYILLGSFENYRLHSKKDEQEKNLVSKS
TVVRGVFLQQAIQAIVAIILFKVTGNDGEVATGPKPGLTIIVQFIIAMLVLDTWQYFIHRYMHQNKFLYKHVHSQHHRLVVPYAFGALYNHPLEGLLLDTIGGALSFLVS
GMSPRVAIFFFSFATIKTVDDHCGLWLPGNLFHVFFRNNSRIMMFTISYMAAKHTKYWKFGANTDEEAHSNQNPQSSSQLRTSSVAQMTPTSVELVPQNRSPPRSSLRFD
HVRGGADVSDWTRRRQPPFIVVRHDGVVGGAGGDDTPLDARLAWCITSCGCDTFCAIRALSRKKRVGVPRQALYLEKSASIASSQGGSDCRRLAASPPLGKIQRPLLTCA
LADLLLEFAFAGSSCRSKSIYRENPLMVTTYSAWPPHVKTNTWGSRAVSFWLLGVKIYLLGLLALLFSRPQLERLGAVHSSIKVFLCSKDFDVSILFLGGGSDGGIGPTS
SGLGKFAHKLLLCMIQVGRERGFAKVIIVVLEGQPLGEAMMVHEIEAVNIAQVREPETESSIYCLPKSWKPSSLKSTGKRTRKIVLFGYRNLSLRFSPRRSNSLSQNRFR
PLCFFNAKDESGGDFQQKVRSSSSNSPLVFGKLAVEFVSREKYGNGAQWPIFKRWEVPWGWQTVSLTSLACGLSFIVTGLVESAAIPYLGIRIEELSLDEKAEILFLDQG
ITTVAVLGILYSIANTFQPLPNDLYRYDIRDPLNLQKGWLLWAGVGLVGALASIAATGAVLSSFSGGSPQRETDALVRLLPLIGSSSISTACLVGITGVLAPVLEETVFR
GFFMVSLTKWVPTPVAVLISAAIFALAHLTPGEFPQLFVLGTALGFSYAQTHNLLTPITIHALWNSGVILLLTFLS