; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012360 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012360
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionfatty acid hydroxylase 2
Genome locationtig00153347:64709..75323
RNA-Seq ExpressionSgr012360
SyntenySgr012360
Gene Ontology termsGO:0008610 - lipid biosynthetic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0080132 - fatty acid alpha-hydroxylase activity (molecular function)
InterPro domainsIPR014430 - Sterol desaturase Scs7


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583838.1 hypothetical protein SDJN03_19770, partial [Cucurbita argyrosperma subsp. sororia]7.2e-13462.9Show/hide
Query:  MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKKSKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRRHG
        MPLLPWK+TR+SRISQIVADLQSPKRGGSLVVETGFPTSLVDL VKHRDR RK SARRKSK K+SKNE  D I Q SS   S PP ++ + +  GC +  
Subjt:  MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKKSKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRRHG

Query:  NLEIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIA--VLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEAAL
        NLE++DLKV+DE R   DRGCELV NRTETCVVGG AAG N V +  LK+FV+A  VLALS KKLVVG TLSA LLLLLEFVGKR+VRFLKPC HGE AL
Subjt:  NLEIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIA--VLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEAAL

Query:  RSLIHKVMKHLWIGKDDLVVQNSGNCEQESVAKVPLDAFPNESIGLSELNSSIQ---LVEPDVDAVGTPKGIEDEKHHLGFLG-----------------
        RSLI KV+KHLWIGKDDLV+Q S N   + V KV LDAFPNESI   E +SSI+   LVEP+VDA  TPKGIEDEK  LGFLG                 
Subjt:  RSLIHKVMKHLWIGKDDLVVQNSGNCEQESVAKVPLDAFPNESIGLSELNSSIQ---LVEPDVDAVGTPKGIEDEKHHLGFLG-----------------

Query:  -------MRRMRRKVGMEPSFGCSQAIEKENR-EELELRNDRFNHLAADKMSIQEQSHEEEDEN------QEQEQDGETRSILCEQERDNGRPPGFEEQW
                R M RK+  + S    QA+EK+N  EE E +NDR      DK SI+EQ ++E+DEN      QEQE+D E R+I CE+++D+ +PP F+EQW
Subjt:  -------MRRMRRKVGMEPSFGCSQAIEKENR-EELELRNDRFNHLAADKMSIQEQSHEEEDEN------QEQEQDGETRSILCEQERDNGRPPGFEEQW

Query:  QAMEVAESCEMQDIEREGSSSYPILVLVLVGLAGLFGGRILAVVLTTAGCFMIKMNEISRRRSPNPPLK
        QAMEV ESCE+Q I+++G  SY  L+LVLV LAGLFGGR LAVVLTTA CFMIK+ E   RRS NPPLK
Subjt:  QAMEVAESCEMQDIEREGSSSYPILVLVLVGLAGLFGGRILAVVLTTAGCFMIKMNEISRRRSPNPPLK

XP_022140249.1 uncharacterized protein LOC111010961 [Momordica charantia]1.9e-15067.68Show/hide
Query:  MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKKSKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRRHG
        MPLLPWKT RASRIS+IVADLQSPK GGSLVVETGFPTS+VDL VKHRDR RKHSARRKSKKKKSK EI+D I QLSSSPTSPPP MNPT     C R  
Subjt:  MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKKSKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRRHG

Query:  NLEIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIAVLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEAALRS
        NLEIDDLK++DEARVVGDR CE V NRTETCVVGGGAA G+SV MVA+K+F++AVLALSTKKLVVG+TLSASLLL+LEFVGKRAVRFLKPCA+GEAALRS
Subjt:  NLEIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIAVLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEAALRS

Query:  LIHKVMKHLWIGKDDLVVQNSGNCEQESVAKVPLD---AFPNESIGLSELNSSIQLVEPDVDAVGTPKGIEDEKHHLGFLG-------------------
         IHKV+ HLWI      V+NSGN ++ S+ KV       FPNES+ LS     IQLVE DV+A  TP GIEDE HH+GFL                    
Subjt:  LIHKVMKHLWIGKDDLVVQNSGNCEQESVAKVPLD---AFPNESIGLSELNSSIQLVEPDVDAVGTPKGIEDEKHHLGFLG-------------------

Query:  -----MRRMRRKVGMEPSFGCSQAIEKENRE-ELELRNDRFNHLAADKMSIQEQSHEEEDENQEQEQDGETRSILCEQERDNGRPPGFEEQWQAMEVAES
              RRM  K+    S G  +AIE +NRE ELE R+DRFNHL ADK  I+EQ + EEDEN EQEQDGE RSILC++E++NG+P     QWQAME AES
Subjt:  -----MRRMRRKVGMEPSFGCSQAIEKENRE-ELELRNDRFNHLAADKMSIQEQSHEEEDENQEQEQDGETRSILCEQERDNGRPPGFEEQWQAMEVAES

Query:  CEMQDIEREGSSSYPILVLVLVGLAGLFGGRILAVVLTTAGCFMIKMNEISRRRSPNPPLK
        C++Q+IEREGS +YPILV+V++G  GL GGRILAVVLTTAGCF++K+NEISRRRS +PPLK
Subjt:  CEMQDIEREGSSSYPILVLVLVGLAGLFGGRILAVVLTTAGCFMIKMNEISRRRSPNPPLK

XP_022927029.1 uncharacterized protein LOC111433981 [Cucurbita moschata]1.5e-13462.87Show/hide
Query:  MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKKSKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRRHG
        MPLLPWK+TR+SRISQIVADLQSPKRGGSLVVETGFPTSLVDL VKHRDR RK SARRKSK K+SKNE  D I Q SS   S PP ++ + +  GC +  
Subjt:  MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKKSKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRRHG

Query:  NLEIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIA--VLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEAAL
        NLE++DLKV+DE R   DRGCELV NRTETCVVGG AAG N V +  LK+FV+A  VLALS KKLVVG TLSA LLLLLEFVGKR+VRFLKPC HGE AL
Subjt:  NLEIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIA--VLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEAAL

Query:  RSLIHKVMKHLWIGKDDLVVQNSGNCEQESVAKVPLDAFPNESIGLSELNSSIQ---LVEPDVDAVGTPKGIEDEKHHLGFLG-----------------
        RSLI KV+KHLWIGKDDLV+Q S N   +SV KV LDAFPNESI   E +SSI+   LVEP+VDA  TP+GIEDEK  LGFLG                 
Subjt:  RSLIHKVMKHLWIGKDDLVVQNSGNCEQESVAKVPLDAFPNESIGLSELNSSIQ---LVEPDVDAVGTPKGIEDEKHHLGFLG-----------------

Query:  -------MRRMRRKVGMEPSFGCSQAIEKENR-EELELRNDRFNHLAADKMSIQEQSHEEEDEN------QEQEQDGETRSILCEQERDNGRPPGFEEQW
                R M RK+  + S    QAIEK+N  EE E RNDR      DK SI+EQ ++E+DEN      QEQE+D E R+I CE+++D+ +PP F+EQW
Subjt:  -------MRRMRRKVGMEPSFGCSQAIEKENR-EELELRNDRFNHLAADKMSIQEQSHEEEDEN------QEQEQDGETRSILCEQERDNGRPPGFEEQW

Query:  QAMEVAESCEMQDIEREGSSSYPILVLVLVGLAGLFGGRILAVVLTTAGCFMIKMNEISRRRSPNPPLKAGIRR
        QAMEV ESCE+Q I+++G  SY  L+LVLV LAGLFGGR LAVVLTTA CFMIK+ E   R S NPPLK  I R
Subjt:  QAMEVAESCEMQDIEREGSSSYPILVLVLVGLAGLFGGRILAVVLTTAGCFMIKMNEISRRRSPNPPLKAGIRR

XP_023519491.1 uncharacterized protein LOC111782887 [Cucurbita pepo subsp. pepo]5.5e-13463.19Show/hide
Query:  MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKKSKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRRHG
        MPLLPWK+TR+SRISQIVADLQSPKRGGSLVVETGFPTSLVDL VKHRDR RK SARRKSK K+SKNE  D I Q SS   S PP ++ + +  GC    
Subjt:  MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKKSKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRRHG

Query:  NLEIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIA--VLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEAAL
        NLE++DLKV+DE R   DRGCELV NRTETCVVGG AAG N V +  LK+FV+A  VLALS KKLVVG TLSA LLLLLEFVGKR+VRFLKPC HGE AL
Subjt:  NLEIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIA--VLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEAAL

Query:  RSLIHKVMKHLWIGKDDLVVQNSGNCEQESVAKVPLDAFPNESIGLSELNSSIQ---LVEPDVDAVGTPKGIEDEKHHLGFLG-----------------
        RSLI KV+KHLWIGKDDLV+Q S N   + V KV LDAFPNESI   E +SSI+   LVEP+VDA  TPKGIEDEK  LGFLG                 
Subjt:  RSLIHKVMKHLWIGKDDLVVQNSGNCEQESVAKVPLDAFPNESIGLSELNSSIQ---LVEPDVDAVGTPKGIEDEKHHLGFLG-----------------

Query:  -------MRRMRRKVGMEPSFGCSQAIEKENR-EELELRNDRFNHLAADKMSIQEQSHEEEDEN-------QEQEQDGETRSILCEQERDNGRPPGFEEQ
                R M RK+  + S    QAIEK+N  EE E RNDR      DK SI+EQ ++E+DEN       QEQE+D E R+I CE+++D+ +PP F+EQ
Subjt:  -------MRRMRRKVGMEPSFGCSQAIEKENR-EELELRNDRFNHLAADKMSIQEQSHEEEDEN-------QEQEQDGETRSILCEQERDNGRPPGFEEQ

Query:  WQAMEVAESCEMQDIEREGSSSYPILVLVLVGLAGLFGGRILAVVLTTAGCFMIKMNEISRRRSPNPPLK
        WQAMEV ESCE+Q I+++G  SY  L+LVLV LAGLFGGR LAVVLTTA CFMIK+ E   RRS NPPLK
Subjt:  WQAMEVAESCEMQDIEREGSSSYPILVLVLVGLAGLFGGRILAVVLTTAGCFMIKMNEISRRRSPNPPLK

XP_038893606.1 uncharacterized protein LOC120082488 [Benincasa hispida]3.4e-13164.11Show/hide
Query:  MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKKSKNEIYDPITQLSSS-PTSPPPLMNPTGSAVGCRRH
        MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTS+VDL VKHRDR RKHSARRK KKK SKNE +D I QLSSS P S P  +N +GS   C   
Subjt:  MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKKSKNEIYDPITQLSSS-PTSPPPLMNPTGSAVGCRRH

Query:  GNLEIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIAVLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEAALR
        GNLEI+DLKV+DEAR   DRGCEL+ +RTE CVVGG  AGGN V +  LK+FV+AVLALS KKLVVGITL A LL LLEFVG+RAVRFLKPC HG+ ALR
Subjt:  GNLEIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIAVLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEAALR

Query:  SLIHKVMKHLWIGKDDLVVQNSGNCEQESVAKVPLDAFPNESIGLSELNSSIQLVEPDVDAVGTPKGIEDEKHHLGFLG----------------MRRMR
         LI KV KHLWIGKDDLV Q+S NCE E V +VPLDAF NES  LS     IQLVEP+ DA  TPKG+EDEK  L FLG                 R +R
Subjt:  SLIHKVMKHLWIGKDDLVVQNSGNCEQESVAKVPLDAFPNESIGLSELNSSIQLVEPDVDAVGTPKGIEDEKHHLGFLG----------------MRRMR

Query:  RKVG------MEPSFGCSQAIEKENR-EELELRNDRFNHLAADKMSIQEQSHEEEDENQEQEQDGETRSILCEQERDNGRPPGFEEQWQAMEVAESCEMQ
        RK        +      +QAIEK+N  EELE R+DR      DK SI+EQ  E E   +EQ+QD E RSI CE           E+Q  AME AESCE+ 
Subjt:  RKVG------MEPSFGCSQAIEKENR-EELELRNDRFNHLAADKMSIQEQSHEEEDENQEQEQDGETRSILCEQERDNGRPPGFEEQWQAMEVAESCEMQ

Query:  DIEREGSSSYPILVLVLVGLAGLFGGRILAVVLTTAGCFMIKMNEISRRRSPNPPLK
         IEREGS SY ILVL++  LAGLFGGR LAVVLTTA CFMIK+NEI+RR+S NPPLK
Subjt:  DIEREGSSSYPILVLVLVGLAGLFGGRILAVVLTTAGCFMIKMNEISRRRSPNPPLK

TrEMBL top hitse value%identityAlignment
A0A0A0LXY7 Uncharacterized protein7.6e-12160.65Show/hide
Query:  MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKK--SKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRR
        MPLLPWKTTRASRISQIVADL SPKR  SLVVETGFPTS+VDL VKHRDR RKHSARRKSKKKK  +KNE +D I QLS      P  ++  GS+  C  
Subjt:  MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKK--SKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRR

Query:  HGNLEIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIAVLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEAAL
         GNLEI+DL   DEAR    RGCE + +RTETCVVGG AA GN   +  LK+FV+AVLALS KKLVVGITLSA LL LLEF+G R VRF KPC HGEAAL
Subjt:  HGNLEIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIAVLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEAAL

Query:  RSLIHKVMKHLWIGKDDLVVQNSGNCEQESVAKVPLDAFPNESIGLSELNSS---IQLVEPDVDAVGTPKGIEDEKHHL--GFL-------GMRR-----
        RSLI KV KHLWIGKDD V+Q S NCE ESV KV L+AF NESI   EL+SS   IQLVEP++D   TPKG EDEK  L  G L       G+RR     
Subjt:  RSLIHKVMKHLWIGKDDLVVQNSGNCEQESVAKVPLDAFPNESIGLSELNSS---IQLVEPDVDAVGTPKGIEDEKHHL--GFL-------GMRR-----

Query:  -----MRRKVGMEPSFGCSQAIEKENREELELRNDRFNHLAADKMSIQEQSHEEEDENQEQEQDGETRSILCEQERDNGRPPGFEEQWQAMEVAE-SCEM
             +RRK G E +       +K N EEL  R+        DK SI+EQ +E  +  ++QEQDGET SI C+ E+D+G+P GF+EQWQ M+VAE SCE+
Subjt:  -----MRRKVGMEPSFGCSQAIEKENREELELRNDRFNHLAADKMSIQEQSHEEEDENQEQEQDGETRSILCEQERDNGRPPGFEEQWQAMEVAE-SCEM

Query:  Q--DIEREGSSSYPILVLVLVGLAGLFGGRILAVVLTTAGCFMIKMNEISRRRSPNPPLK
            I+RE S SY  L L++  L GLFGGR +AVVL TA CFMIK+NEI+RR+S NPPLK
Subjt:  Q--DIEREGSSSYPILVLVLVGLAGLFGGRILAVVLTTAGCFMIKMNEISRRRSPNPPLK

A0A5A7VD10 Ethylene-responsive nuclear family protein9.9e-12161.39Show/hide
Query:  MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKK--SKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRR
        MPLL WKTTRASRISQIVAD  SPKR GSLVVETGFPTS+VDL VKHRDR RKHSARRKSKKKK  +KNE +D I QLS      P  ++  GS+  C  
Subjt:  MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKK--SKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRR

Query:  HGNLEIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIAVLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEAAL
         GNLE +DL   DEAR    RGCEL+ +RTETCVVGG A  GN   +  LK+FV+A LALS KKLVVGITLSA LL LLEFVGKRAVRF KPC H EAA 
Subjt:  HGNLEIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIAVLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEAAL

Query:  RSLIHKVMKHLWIGKDDLVVQNSGNCEQESVAKVPLDAFPNESIGLSELNSS---IQLVEPDVDAVGTPKGIEDEKHHL--GFL-------GMRR-----
        RSLI K  KHLWIGKDD V+Q+S NCE ESV K+ L+AFPNESI L EL+SS   IQLVEP +D   TPK  EDEK  L  G L       G+RR     
Subjt:  RSLIHKVMKHLWIGKDDLVVQNSGNCEQESVAKVPLDAFPNESIGLSELNSS---IQLVEPDVDAVGTPKGIEDEKHHL--GFL-------GMRR-----

Query:  -----MRRKVGMEPSFGCSQAIEKENR-EELELRNDRFNHLAADKMSIQEQSHEEEDENQEQEQDGETRSILCEQERDNGRPPGFEEQWQAMEVAE-SCE
             +RRK G       SQA EK+N  EELE R+D       DK SI+EQ  E E   +E+EQDGET SI  + ++D+G+PPGF+EQWQ MEVAE SCE
Subjt:  -----MRRKVGMEPSFGCSQAIEKENR-EELELRNDRFNHLAADKMSIQEQSHEEEDENQEQEQDGETRSILCEQERDNGRPPGFEEQWQAMEVAE-SCE

Query:  MQ--DIEREGSSSYPILVLVLVGLAGLFGGRILAVVLTTAGCFMIKMNEISRRRSPNPPLK
        +    I+RE S SY IL L++  L GLFGGR LAVVL TA CFMIK+ EI+RR+S NPPLK
Subjt:  MQ--DIEREGSSSYPILVLVLVGLAGLFGGRILAVVLTTAGCFMIKMNEISRRRSPNPPLK

A0A6J1CF69 uncharacterized protein LOC1110109619.2e-15167.68Show/hide
Query:  MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKKSKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRRHG
        MPLLPWKT RASRIS+IVADLQSPK GGSLVVETGFPTS+VDL VKHRDR RKHSARRKSKKKKSK EI+D I QLSSSPTSPPP MNPT     C R  
Subjt:  MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKKSKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRRHG

Query:  NLEIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIAVLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEAALRS
        NLEIDDLK++DEARVVGDR CE V NRTETCVVGGGAA G+SV MVA+K+F++AVLALSTKKLVVG+TLSASLLL+LEFVGKRAVRFLKPCA+GEAALRS
Subjt:  NLEIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIAVLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEAALRS

Query:  LIHKVMKHLWIGKDDLVVQNSGNCEQESVAKVPLD---AFPNESIGLSELNSSIQLVEPDVDAVGTPKGIEDEKHHLGFLG-------------------
         IHKV+ HLWI      V+NSGN ++ S+ KV       FPNES+ LS     IQLVE DV+A  TP GIEDE HH+GFL                    
Subjt:  LIHKVMKHLWIGKDDLVVQNSGNCEQESVAKVPLD---AFPNESIGLSELNSSIQLVEPDVDAVGTPKGIEDEKHHLGFLG-------------------

Query:  -----MRRMRRKVGMEPSFGCSQAIEKENRE-ELELRNDRFNHLAADKMSIQEQSHEEEDENQEQEQDGETRSILCEQERDNGRPPGFEEQWQAMEVAES
              RRM  K+    S G  +AIE +NRE ELE R+DRFNHL ADK  I+EQ + EEDEN EQEQDGE RSILC++E++NG+P     QWQAME AES
Subjt:  -----MRRMRRKVGMEPSFGCSQAIEKENRE-ELELRNDRFNHLAADKMSIQEQSHEEEDENQEQEQDGETRSILCEQERDNGRPPGFEEQWQAMEVAES

Query:  CEMQDIEREGSSSYPILVLVLVGLAGLFGGRILAVVLTTAGCFMIKMNEISRRRSPNPPLK
        C++Q+IEREGS +YPILV+V++G  GL GGRILAVVLTTAGCF++K+NEISRRRS +PPLK
Subjt:  CEMQDIEREGSSSYPILVLVLVGLAGLFGGRILAVVLTTAGCFMIKMNEISRRRSPNPPLK

A0A6J1EGV0 uncharacterized protein LOC1114339817.1e-13562.87Show/hide
Query:  MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKKSKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRRHG
        MPLLPWK+TR+SRISQIVADLQSPKRGGSLVVETGFPTSLVDL VKHRDR RK SARRKSK K+SKNE  D I Q SS   S PP ++ + +  GC +  
Subjt:  MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKKSKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRRHG

Query:  NLEIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIA--VLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEAAL
        NLE++DLKV+DE R   DRGCELV NRTETCVVGG AAG N V +  LK+FV+A  VLALS KKLVVG TLSA LLLLLEFVGKR+VRFLKPC HGE AL
Subjt:  NLEIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIA--VLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEAAL

Query:  RSLIHKVMKHLWIGKDDLVVQNSGNCEQESVAKVPLDAFPNESIGLSELNSSIQ---LVEPDVDAVGTPKGIEDEKHHLGFLG-----------------
        RSLI KV+KHLWIGKDDLV+Q S N   +SV KV LDAFPNESI   E +SSI+   LVEP+VDA  TP+GIEDEK  LGFLG                 
Subjt:  RSLIHKVMKHLWIGKDDLVVQNSGNCEQESVAKVPLDAFPNESIGLSELNSSIQ---LVEPDVDAVGTPKGIEDEKHHLGFLG-----------------

Query:  -------MRRMRRKVGMEPSFGCSQAIEKENR-EELELRNDRFNHLAADKMSIQEQSHEEEDEN------QEQEQDGETRSILCEQERDNGRPPGFEEQW
                R M RK+  + S    QAIEK+N  EE E RNDR      DK SI+EQ ++E+DEN      QEQE+D E R+I CE+++D+ +PP F+EQW
Subjt:  -------MRRMRRKVGMEPSFGCSQAIEKENR-EELELRNDRFNHLAADKMSIQEQSHEEEDEN------QEQEQDGETRSILCEQERDNGRPPGFEEQW

Query:  QAMEVAESCEMQDIEREGSSSYPILVLVLVGLAGLFGGRILAVVLTTAGCFMIKMNEISRRRSPNPPLKAGIRR
        QAMEV ESCE+Q I+++G  SY  L+LVLV LAGLFGGR LAVVLTTA CFMIK+ E   R S NPPLK  I R
Subjt:  QAMEVAESCEMQDIEREGSSSYPILVLVLVGLAGLFGGRILAVVLTTAGCFMIKMNEISRRRSPNPPLKAGIRR

A0A6J1KJZ0 uncharacterized protein LOC1114952882.9e-12861.67Show/hide
Query:  MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKKSKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRRHG
        MPLLPWK+TR+SRISQIVADLQSPKRGGSLVVETGFPTSLVDL VKHRDR RK SARRKSK K+SKNE  D I Q SS   S P  ++ + +  GC +  
Subjt:  MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKKSKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRRHG

Query:  NLEIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFV----IAVLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEA
        NLE+++LKV+DE R   DRGCELV NRTE CVVGG A G N V +  LK+FV    +AVLAL+ KKLVVG TLSA LLLLLEFVGKR+VRFLKPC HGEA
Subjt:  NLEIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFV----IAVLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEA

Query:  ALRSLIHKVMKHLWIGKDDLVVQNSGNCEQESVAKVPLDAFPNESIGLSELNSSIQ---LVEPDVDAVGTPKGIEDEKHHLGFLG---------------
        ALRSLI KV+ HLWI KDDLV+Q S N   + V KV LDAFPNESI   E +SSI+   LVEP+VDA  T KGIEDEK  LGFLG               
Subjt:  ALRSLIHKVMKHLWIGKDDLVVQNSGNCEQESVAKVPLDAFPNESIGLSELNSSIQ---LVEPDVDAVGTPKGIEDEKHHLGFLG---------------

Query:  ---------MRRMRRKVGMEPSFGCSQAIEKENR-EELELRNDRFNHLAADKMSIQEQSHEEEDEN--QEQEQDGETRSILCEQERDNGRPPGFEEQWQA
                  R M RK+  + S    QAIEK+N  EE E RNDR      DK SI+EQ ++E+DEN  +EQE+D E R+I CE+++D  +PP F+EQWQA
Subjt:  ---------MRRMRRKVGMEPSFGCSQAIEKENR-EELELRNDRFNHLAADKMSIQEQSHEEEDEN--QEQEQDGETRSILCEQERDNGRPPGFEEQWQA

Query:  MEVAESCEMQDIEREGSSSYPILVLVLVGLAGLFGGRILAVVLTTAGCFMIKMNEISRRRSPNPPLK
        MEV ESCE+Q I++ G  SY  L+LVLV LAGLFGGR LAVVLTTA CFMIK+ E   RRS NP LK
Subjt:  MEVAESCEMQDIEREGSSSYPILVLVLVGLAGLFGGRILAVVLTTAGCFMIKMNEISRRRSPNPPLK

SwissProt top hitse value%identityAlignment
O48916 Dihydroceramide fatty acyl 2-hydroxylase FAH17.1e-8468.75Show/hide
Query:  MVAQGFTVDLNKPLVFQVGHLGEDYQEWVHQPIVSKEGPRFLRVIFGGIGKFIGFQTQFLTRTAWWAIPTIWLPVVCWFISMSFRVGHTPSEISLMVVFG
        MVAQGFTVDL KPLVFQVGHLGEDY+EWVHQPI +KEGPRF +  F           +FLT T WWA+P IWLPVV W IS S  +G +  EI  +VV G
Subjt:  MVAQGFTVDLNKPLVFQVGHLGEDYQEWVHQPIVSKEGPRFLRVIFGGIGKFIGFQTQFLTRTAWWAIPTIWLPVVCWFISMSFRVGHTPSEISLMVVFG

Query:  IFVWTFLEYSLHRFLFHIKTKSYWGNTIHYLLHGCHHKHPMDSLRLVFPPAATAVLLFPFWNLIRLFATPSTTPALVGGGLLGYVMYDVTHHYLHHGQPS
        IF+WTF EY LHRF+FHIKTKSYWGNT HYL+HGCHHKHPMD LRLVFPP ATA+L FPFWN+ +  +TPST PAL GGG+LGYVMYDVTH+YLHH QP+
Subjt:  IFVWTFLEYSLHRFLFHIKTKSYWGNTIHYLLHGCHHKHPMDSLRLVFPPAATAVLLFPFWNLIRLFATPSTTPALVGGGLLGYVMYDVTHHYLHHGQPS

Query:  GEIVRNLK
          + +NLK
Subjt:  GEIVRNLK

Q2LAM0 Fatty acid 2-hydroxylase3.8e-3738.39Show/hide
Query:  VDLNKPLVFQVGHLGEDYQEWVHQPIVSKEGPRFLRVIFGGIGKFIGFQTQFLTRTAWWAIPTIWLPVVCWFISMSFRVGHTPSEISL------------
        VD  KPL++QVGHLGE Y EWVHQP+      R +R+    +        +  ++T W+++P IW+P+V  ++S S+    T   I L            
Subjt:  VDLNKPLVFQVGHLGEDYQEWVHQPIVSKEGPRFLRVIFGGIGKFIGFQTQFLTRTAWWAIPTIWLPVVCWFISMSFRVGHTPSEISL------------

Query:  -------MVVFGIFVWTFLEYSLHRFLFHIK--TKSYWGNTIHYLLHGCHHKHPMDSLRLVFPPAATAVLLFPFWNLIRLFATPSTTPALVGGGLLGYVM
               + V G+ +WT +EY +HRFLFH+K  + S++   +H+++HG HHK P D  RLVFPP   +V++  F+  +RL    +    L  GGLLGYV+
Subjt:  -------MVVFGIFVWTFLEYSLHRFLFHIK--TKSYWGNTIHYLLHGCHHKHPMDSLRLVFPPAATAVLLFPFWNLIRLFATPSTTPALVGGGLLGYVM

Query:  YDVTHHYLHHGQP-SGEIVRNLKA
        YD+TH+YLH G P  G  + N+KA
Subjt:  YDVTHHYLHHGQP-SGEIVRNLKA

Q4R4P4 Fatty acid 2-hydroxylase3.8e-3739.01Show/hide
Query:  VDLNKPLVFQVGHLGEDYQEWVHQPIVSKEGPRFLRVIFGGIGKFIGFQTQFLTRTAWWAIPTIWLPVV---CWFISMSFRVGHT------PSEISLMV-
        VD  KPL++QVGHLGE Y EWVHQP+      R +R+    +        + L++T W+++P IW+P+V    W    +F  G+        +E +L V 
Subjt:  VDLNKPLVFQVGHLGEDYQEWVHQPIVSKEGPRFLRVIFGGIGKFIGFQTQFLTRTAWWAIPTIWLPVV---CWFISMSFRVGHT------PSEISLMV-

Query:  --------VFGIFVWTFLEYSLHRFLFHIK--TKSYWGNTIHYLLHGCHHKHPMDSLRLVFPPAATAVLLFPFWNLIRLFATPSTTPALVGGGLLGYVMY
                + GIF+W+ +EY +HRFLFH+K  + SY+   +H+++HG HHK P D  RLVFPP   ++++  F+  ++L    +    +  GGLLGYV+Y
Subjt:  --------VFGIFVWTFLEYSLHRFLFHIK--TKSYWGNTIHYLLHGCHHKHPMDSLRLVFPPAATAVLLFPFWNLIRLFATPSTTPALVGGGLLGYVMY

Query:  DVTHHYLHHGQP-SGEIVRNLKA
        D+TH+YLH G P  G  + NLKA
Subjt:  DVTHHYLHHGQP-SGEIVRNLKA

Q5MPP0 Fatty acid 2-hydroxylase1.1e-3637.95Show/hide
Query:  VDLNKPLVFQVGHLGEDYQEWVHQPIVSKEGPRFLRVIFGGIGKFIGFQTQFLTRTAWWAIPTIWLPVVCWFISMSFRVGHTPSEISL------------
        VD  KPL++QVGHLGE Y EWVHQP+      R +R+    +        +  ++T W+++P IW+P+V  ++S S+    T   I L            
Subjt:  VDLNKPLVFQVGHLGEDYQEWVHQPIVSKEGPRFLRVIFGGIGKFIGFQTQFLTRTAWWAIPTIWLPVVCWFISMSFRVGHTPSEISL------------

Query:  -------MVVFGIFVWTFLEYSLHRFLFHIK--TKSYWGNTIHYLLHGCHHKHPMDSLRLVFPPAATAVLLFPFWNLIRLFATPSTTPALVGGGLLGYVM
               + V G+  WTF+EY +HRFLFH+K  + S++   +H+++HG HHK P D  RLVFPP   ++++  F+  +RL    +    +  GGLLGYV+
Subjt:  -------MVVFGIFVWTFLEYSLHRFLFHIK--TKSYWGNTIHYLLHGCHHKHPMDSLRLVFPPAATAVLLFPFWNLIRLFATPSTTPALVGGGLLGYVM

Query:  YDVTHHYLHHGQP-SGEIVRNLKA
        YD+TH+YLH G P  G  + N+KA
Subjt:  YDVTHHYLHHGQP-SGEIVRNLKA

Q9SUC5 Dihydroceramide fatty acyl 2-hydroxylase FAH21.1e-7965.87Show/hide
Query:  MVAQGFTVDLNKPLVFQVGHLGEDYQEWVHQPIVSKEGPRFLRVIFGGIGKFIGFQTQFLTRTAWWAIPTIWLPVVCWFISMSFRVGHTPSEISLMVVFG
        MVA+ +TVDLNKPLVFQVGHLGE+YQEW+HQPIV  EGPRF    F           +FLTRT WWAIPTIWLPVVC+ +S+S   G T  +I L+V FG
Subjt:  MVAQGFTVDLNKPLVFQVGHLGEDYQEWVHQPIVSKEGPRFLRVIFGGIGKFIGFQTQFLTRTAWWAIPTIWLPVVCWFISMSFRVGHTPSEISLMVVFG

Query:  IFVWTFLEYSLHRFLFHIKTKSYWGNTIHYLLHGCHHKHPMDSLRLVFPPAATAVLLFPFWNLIRLFATPSTTPALVGGGLLGYVMYDVTHHYLHHGQPS
        +  WT LEY+LHRFLFHI+TKSYW NT HYLLHGCHHKHP D LRLVFPP ATA+LL P W L+ L ATP+T PA++GG L GYVMYD+TH+YLHHGQP 
Subjt:  IFVWTFLEYSLHRFLFHIKTKSYWGNTIHYLLHGCHHKHPMDSLRLVFPPAATAVLLFPFWNLIRLFATPSTTPALVGGGLLGYVMYDVTHHYLHHGQPS

Query:  GEIVRNLK
            ++LK
Subjt:  GEIVRNLK

Arabidopsis top hitse value%identityAlignment
AT2G34770.1 fatty acid hydroxylase 15.1e-8568.75Show/hide
Query:  MVAQGFTVDLNKPLVFQVGHLGEDYQEWVHQPIVSKEGPRFLRVIFGGIGKFIGFQTQFLTRTAWWAIPTIWLPVVCWFISMSFRVGHTPSEISLMVVFG
        MVAQGFTVDL KPLVFQVGHLGEDY+EWVHQPI +KEGPRF +  F           +FLT T WWA+P IWLPVV W IS S  +G +  EI  +VV G
Subjt:  MVAQGFTVDLNKPLVFQVGHLGEDYQEWVHQPIVSKEGPRFLRVIFGGIGKFIGFQTQFLTRTAWWAIPTIWLPVVCWFISMSFRVGHTPSEISLMVVFG

Query:  IFVWTFLEYSLHRFLFHIKTKSYWGNTIHYLLHGCHHKHPMDSLRLVFPPAATAVLLFPFWNLIRLFATPSTTPALVGGGLLGYVMYDVTHHYLHHGQPS
        IF+WTF EY LHRF+FHIKTKSYWGNT HYL+HGCHHKHPMD LRLVFPP ATA+L FPFWN+ +  +TPST PAL GGG+LGYVMYDVTH+YLHH QP+
Subjt:  IFVWTFLEYSLHRFLFHIKTKSYWGNTIHYLLHGCHHKHPMDSLRLVFPPAATAVLLFPFWNLIRLFATPSTTPALVGGGLLGYVMYDVTHHYLHHGQPS

Query:  GEIVRNLK
          + +NLK
Subjt:  GEIVRNLK

AT4G20870.1 fatty acid hydroxylase 27.6e-8165.87Show/hide
Query:  MVAQGFTVDLNKPLVFQVGHLGEDYQEWVHQPIVSKEGPRFLRVIFGGIGKFIGFQTQFLTRTAWWAIPTIWLPVVCWFISMSFRVGHTPSEISLMVVFG
        MVA+ +TVDLNKPLVFQVGHLGE+YQEW+HQPIV  EGPRF    F           +FLTRT WWAIPTIWLPVVC+ +S+S   G T  +I L+V FG
Subjt:  MVAQGFTVDLNKPLVFQVGHLGEDYQEWVHQPIVSKEGPRFLRVIFGGIGKFIGFQTQFLTRTAWWAIPTIWLPVVCWFISMSFRVGHTPSEISLMVVFG

Query:  IFVWTFLEYSLHRFLFHIKTKSYWGNTIHYLLHGCHHKHPMDSLRLVFPPAATAVLLFPFWNLIRLFATPSTTPALVGGGLLGYVMYDVTHHYLHHGQPS
        +  WT LEY+LHRFLFHI+TKSYW NT HYLLHGCHHKHP D LRLVFPP ATA+LL P W L+ L ATP+T PA++GG L GYVMYD+TH+YLHHGQP 
Subjt:  IFVWTFLEYSLHRFLFHIKTKSYWGNTIHYLLHGCHHKHPMDSLRLVFPPAATAVLLFPFWNLIRLFATPSTTPALVGGGLLGYVMYDVTHHYLHHGQPS

Query:  GEIVRNLK
            ++LK
Subjt:  GEIVRNLK

AT4G20880.1 ethylene-responsive nuclear protein / ethylene-regulated nuclear protein (ERT2)6.1e-3031.32Show/hide
Query:  LPWKTTRASRISQIVADL-QSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKKSKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRRHGNL
        LPWK +++SRIS+ V+D  QSPK GGSLVVETGFPTSL+DL VK+RDR +K S++R +K  K++ E     T++ S P  PP +      ++  +    L
Subjt:  LPWKTTRASRISQIVADL-QSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKKSKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRRHGNL

Query:  EIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIAVLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEAALRSLI
          +D  +V   R + +     +++  +    GGG  GG  + +V  K+F++AVLALSTKKL VGITLSA  LL LE    R   +L  C   +  + SLI
Subjt:  EIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIAVLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEAALRSLI

Query:  HKVMKHLWIGKDDLVVQNSGNCEQESVAKVPLDAFPNESIGLSELNSSIQLVEPDVDAVGTPKG--IEDEKHHLGFLGMRRMRRKVGMEPSFGCSQAIEK
         K+     IGK     ++    E+E  +    D   N    + E       V P  D+    K   + DEK  +  +  R +  K     S      I K
Subjt:  HKVMKHLWIGKDDLVVQNSGNCEQESVAKVPLDAFPNESIGLSELNSSIQLVEPDVDAVGTPKG--IEDEKHHLGFLGMRRMRRKVGMEPSFGCSQAIEK

Query:  ENREELELRNDRFNHLAADKMSIQEQSHEEEDENQEQEQDG---ETRSILCEQ-------ERD--NGRPPGFEEQWQAMEVAESCEMQDIEREGSSSYPI
        +   + +LR+ +       K  ++ +  EEE+   E E++G   E  S+  ++       ERD  +  PP  E   +  E  E+   +D+ +       +
Subjt:  ENREELELRNDRFNHLAADKMSIQEQSHEEEDENQEQEQDG---ETRSILCEQ-------ERD--NGRPPGFEEQWQAMEVAESCEMQDIEREGSSSYPI

Query:  LVLVLVGLAGLFGGRILAVVLTTAGCFMIKM
        + L+++ L GL  G++ A+ LT + C ++++
Subjt:  LVLVLVGLAGLFGGRILAVVLTTAGCFMIKM

AT5G44350.1 ethylene-responsive nuclear protein -related7.5e-1234.24Show/hide
Query:  MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKKSKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRRHG
        M  LPWK  ++SRI + +++ Q      S  VETGFPTSL+DL  K+RDR +K  ++R  + ++         T  ++S  S    +    S +   R  
Subjt:  MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKKSKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRRHG

Query:  NLEIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIAVLALST-KKLVVGITLSASLLLLLEFVGKR
                 V++   V  +   L  N  +  V G    G   V ++A K+ ++A+LALST KKL +GITLSA  LLL E V  R
Subjt:  NLEIDDLKVVDEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIAVLALST-KKLVVGITLSASLLLLLEFVGKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCCTTCTTCCATGGAAGACGACGAGGGCTTCTCGGATTTCCCAAATCGTCGCCGACCTCCAGTCGCCGAAGCGCGGGGGGTCGCTTGTTGTGGAGACTGGATTTCC
GACCTCCCTCGTTGACCTCGTCGTTAAGCACCGGGATCGATTCAGAAAGCACTCGGCGAGAAGGAAGTCAAAGAAGAAGAAATCCAAGAATGAAATTTACGACCCGATCA
CTCAACTGTCTTCTTCTCCTACATCGCCTCCTCCGCTGATGAATCCTACTGGCTCTGCCGTCGGTTGCAGACGGCATGGTAATCTAGAAATTGATGATTTGAAAGTCGTG
GATGAAGCGCGAGTAGTTGGTGATCGTGGATGCGAATTGGTCCTGAATCGGACTGAGACGTGCGTTGTCGGTGGCGGTGCGGCGGGCGGGAACAGCGTCAGTATGGTTGC
GTTGAAGTTGTTTGTGATTGCGGTTTTGGCATTGAGTACCAAGAAGCTTGTGGTTGGGATTACTCTCTCTGCTTCCCTGCTTTTACTTCTGGAATTCGTCGGCAAACGTG
CGGTTCGCTTCTTGAAACCGTGCGCACATGGCGAGGCTGCTCTGAGATCCTTGATCCACAAGGTAATGAAGCATCTCTGGATTGGAAAGGATGATCTAGTAGTACAAAAC
TCAGGAAATTGCGAACAAGAGTCGGTTGCGAAAGTCCCGCTCGATGCTTTTCCGAATGAATCGATTGGTTTATCGGAGTTAAATTCTTCCATCCAACTCGTGGAACCGGA
CGTCGACGCCGTTGGGACGCCCAAGGGAATTGAAGATGAAAAACACCATCTCGGTTTCTTGGGGATGAGAAGGATGAGGAGGAAAGTAGGGATGGAGCCGAGCTTCGGGT
GCAGTCAAGCGATCGAGAAGGAGAACAGAGAGGAGTTAGAGTTGAGAAATGACAGATTCAATCATTTGGCAGCCGATAAAATGTCGATTCAAGAACAATCACACGAGGAA
GAAGATGAAAACCAAGAACAAGAACAAGATGGAGAAACCAGAAGCATTTTATGTGAACAGGAAAGAGATAACGGAAGACCTCCTGGATTTGAAGAACAATGGCAGGCAAT
GGAAGTCGCAGAGTCGTGTGAAATGCAGGACATCGAAAGAGAAGGGAGTTCGAGTTACCCAATTCTGGTTCTGGTTCTGGTTGGTCTTGCCGGGCTGTTCGGAGGTCGGA
TTCTGGCGGTGGTGCTGACAACCGCAGGGTGTTTCATGATCAAAATGAATGAAATTTCAAGGAGAAGGTCTCCGAATCCGCCATTGAAGGCTGGCATCCGCAGATTCCAA
CAAGCGAATCAGCTGGCCGTTTCGTCGGATGCATGTTTGAATGGACGGTCCAGATCTCTTGTCCATTATACCGCAATTTTCTCAAGTTTCCAGCCGTCCCCTCGACACTG
CCACTCCTTGCAAACGGAAAAAACAGACACCCCACAACGTCCCTTCAGCGGTGGGCCTCGGTGGTGGTGGTGGTGGTGGTGGTTTTCTAGAGCGAGAAACTGCTCTCTCC
GGGAGGGAGGGAGGGGGAGAGAGAGAGAGATCGCCGCTGTTCTTTTCTGCAGCTACATCGAGGCCAAGATGGTCGCACAGGGATTCACTGTCGATTTGAATAAGCCACTT
GTATTCCAGGTAGGCCATCTTGGAGAAGATTATCAGGAGTGGGTTCACCAGCCCATTGTCAGCAAAGAAGGGCCTCGATTTTTGAGAGTGATTTTTGGAGGTATAGGAAA
GTTTATTGGTTTTCAAACGCAATTCTTGACTCGCACGGCGTGGTGGGCAATTCCTACCATTTGGCTGCCGGTTGTTTGCTGGTTCATTTCTATGTCTTTCAGGGTGGGCC
ATACACCTTCAGAGATATCTTTGATGGTGGTATTTGGAATTTTTGTATGGACTTTTCTGGAATACTCTCTTCACCGTTTCCTTTTCCATATTAAGACCAAGAGCTATTGG
GGAAACACCATACACTATCTGCTTCATGGTTGCCACCACAAACACCCAATGGATAGCCTGCGCCTCGTTTTCCCCCCTGCTGCAACTGCTGTTCTGCTCTTTCCATTCTG
GAATTTGATAAGGCTTTTTGCCACTCCATCTACCACGCCTGCTTTGGTTGGAGGTGGGTTGCTGGGCTATGTGATGTATGATGTTACCCATCACTACCTTCATCACGGAC
AACCATCTGGAGAAATTGTGCGAAACCTTAAGGCTGGCCTGTCAAAGGAAAATGGAAATTGTCTCAACTCAACGACAAAGTCCCATTTTGTTGTAGTGTCTCAACAAAGA
GAGAAGTCTCTTTTAGGGCAACTTCTCCCATCAAACATAGAAAATGGTGGGATTTGGTACCTGGTGGATAAAGAATGGTTTCATGATATGGAGGATATGGACTATACCGC
TGCCATCGCGGCCTCAGCGCCGTCGTTGTCCGTCAAGGTCTGGCCATGGCCGTCGTTTTATGTCTCGTCGTCGATCTTCACTCGCCCCCCCGAGTCGGCGTCCGGTGATC
CCCTCCTTGCTCGAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCCTTCTTCCATGGAAGACGACGAGGGCTTCTCGGATTTCCCAAATCGTCGCCGACCTCCAGTCGCCGAAGCGCGGGGGGTCGCTTGTTGTGGAGACTGGATTTCC
GACCTCCCTCGTTGACCTCGTCGTTAAGCACCGGGATCGATTCAGAAAGCACTCGGCGAGAAGGAAGTCAAAGAAGAAGAAATCCAAGAATGAAATTTACGACCCGATCA
CTCAACTGTCTTCTTCTCCTACATCGCCTCCTCCGCTGATGAATCCTACTGGCTCTGCCGTCGGTTGCAGACGGCATGGTAATCTAGAAATTGATGATTTGAAAGTCGTG
GATGAAGCGCGAGTAGTTGGTGATCGTGGATGCGAATTGGTCCTGAATCGGACTGAGACGTGCGTTGTCGGTGGCGGTGCGGCGGGCGGGAACAGCGTCAGTATGGTTGC
GTTGAAGTTGTTTGTGATTGCGGTTTTGGCATTGAGTACCAAGAAGCTTGTGGTTGGGATTACTCTCTCTGCTTCCCTGCTTTTACTTCTGGAATTCGTCGGCAAACGTG
CGGTTCGCTTCTTGAAACCGTGCGCACATGGCGAGGCTGCTCTGAGATCCTTGATCCACAAGGTAATGAAGCATCTCTGGATTGGAAAGGATGATCTAGTAGTACAAAAC
TCAGGAAATTGCGAACAAGAGTCGGTTGCGAAAGTCCCGCTCGATGCTTTTCCGAATGAATCGATTGGTTTATCGGAGTTAAATTCTTCCATCCAACTCGTGGAACCGGA
CGTCGACGCCGTTGGGACGCCCAAGGGAATTGAAGATGAAAAACACCATCTCGGTTTCTTGGGGATGAGAAGGATGAGGAGGAAAGTAGGGATGGAGCCGAGCTTCGGGT
GCAGTCAAGCGATCGAGAAGGAGAACAGAGAGGAGTTAGAGTTGAGAAATGACAGATTCAATCATTTGGCAGCCGATAAAATGTCGATTCAAGAACAATCACACGAGGAA
GAAGATGAAAACCAAGAACAAGAACAAGATGGAGAAACCAGAAGCATTTTATGTGAACAGGAAAGAGATAACGGAAGACCTCCTGGATTTGAAGAACAATGGCAGGCAAT
GGAAGTCGCAGAGTCGTGTGAAATGCAGGACATCGAAAGAGAAGGGAGTTCGAGTTACCCAATTCTGGTTCTGGTTCTGGTTGGTCTTGCCGGGCTGTTCGGAGGTCGGA
TTCTGGCGGTGGTGCTGACAACCGCAGGGTGTTTCATGATCAAAATGAATGAAATTTCAAGGAGAAGGTCTCCGAATCCGCCATTGAAGGCTGGCATCCGCAGATTCCAA
CAAGCGAATCAGCTGGCCGTTTCGTCGGATGCATGTTTGAATGGACGGTCCAGATCTCTTGTCCATTATACCGCAATTTTCTCAAGTTTCCAGCCGTCCCCTCGACACTG
CCACTCCTTGCAAACGGAAAAAACAGACACCCCACAACGTCCCTTCAGCGGTGGGCCTCGGTGGTGGTGGTGGTGGTGGTGGTTTTCTAGAGCGAGAAACTGCTCTCTCC
GGGAGGGAGGGAGGGGGAGAGAGAGAGAGATCGCCGCTGTTCTTTTCTGCAGCTACATCGAGGCCAAGATGGTCGCACAGGGATTCACTGTCGATTTGAATAAGCCACTT
GTATTCCAGGTAGGCCATCTTGGAGAAGATTATCAGGAGTGGGTTCACCAGCCCATTGTCAGCAAAGAAGGGCCTCGATTTTTGAGAGTGATTTTTGGAGGTATAGGAAA
GTTTATTGGTTTTCAAACGCAATTCTTGACTCGCACGGCGTGGTGGGCAATTCCTACCATTTGGCTGCCGGTTGTTTGCTGGTTCATTTCTATGTCTTTCAGGGTGGGCC
ATACACCTTCAGAGATATCTTTGATGGTGGTATTTGGAATTTTTGTATGGACTTTTCTGGAATACTCTCTTCACCGTTTCCTTTTCCATATTAAGACCAAGAGCTATTGG
GGAAACACCATACACTATCTGCTTCATGGTTGCCACCACAAACACCCAATGGATAGCCTGCGCCTCGTTTTCCCCCCTGCTGCAACTGCTGTTCTGCTCTTTCCATTCTG
GAATTTGATAAGGCTTTTTGCCACTCCATCTACCACGCCTGCTTTGGTTGGAGGTGGGTTGCTGGGCTATGTGATGTATGATGTTACCCATCACTACCTTCATCACGGAC
AACCATCTGGAGAAATTGTGCGAAACCTTAAGGCTGGCCTGTCAAAGGAAAATGGAAATTGTCTCAACTCAACGACAAAGTCCCATTTTGTTGTAGTGTCTCAACAAAGA
GAGAAGTCTCTTTTAGGGCAACTTCTCCCATCAAACATAGAAAATGGTGGGATTTGGTACCTGGTGGATAAAGAATGGTTTCATGATATGGAGGATATGGACTATACCGC
TGCCATCGCGGCCTCAGCGCCGTCGTTGTCCGTCAAGGTCTGGCCATGGCCGTCGTTTTATGTCTCGTCGTCGATCTTCACTCGCCCCCCCGAGTCGGCGTCCGGTGATC
CCCTCCTTGCTCGAAGTTGA
Protein sequenceShow/hide protein sequence
MPLLPWKTTRASRISQIVADLQSPKRGGSLVVETGFPTSLVDLVVKHRDRFRKHSARRKSKKKKSKNEIYDPITQLSSSPTSPPPLMNPTGSAVGCRRHGNLEIDDLKVV
DEARVVGDRGCELVLNRTETCVVGGGAAGGNSVSMVALKLFVIAVLALSTKKLVVGITLSASLLLLLEFVGKRAVRFLKPCAHGEAALRSLIHKVMKHLWIGKDDLVVQN
SGNCEQESVAKVPLDAFPNESIGLSELNSSIQLVEPDVDAVGTPKGIEDEKHHLGFLGMRRMRRKVGMEPSFGCSQAIEKENREELELRNDRFNHLAADKMSIQEQSHEE
EDENQEQEQDGETRSILCEQERDNGRPPGFEEQWQAMEVAESCEMQDIEREGSSSYPILVLVLVGLAGLFGGRILAVVLTTAGCFMIKMNEISRRRSPNPPLKAGIRRFQ
QANQLAVSSDACLNGRSRSLVHYTAIFSSFQPSPRHCHSLQTEKTDTPQRPFSGGPRWWWWWWWFSRARNCSLREGGRGREREIAAVLFCSYIEAKMVAQGFTVDLNKPL
VFQVGHLGEDYQEWVHQPIVSKEGPRFLRVIFGGIGKFIGFQTQFLTRTAWWAIPTIWLPVVCWFISMSFRVGHTPSEISLMVVFGIFVWTFLEYSLHRFLFHIKTKSYW
GNTIHYLLHGCHHKHPMDSLRLVFPPAATAVLLFPFWNLIRLFATPSTTPALVGGGLLGYVMYDVTHHYLHHGQPSGEIVRNLKAGLSKENGNCLNSTTKSHFVVVSQQR
EKSLLGQLLPSNIENGGIWYLVDKEWFHDMEDMDYTAAIAASAPSLSVKVWPWPSFYVSSSIFTRPPESASGDPLLARS