; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G006380 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G006380
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Description3-epi-6-deoxocathasterone 23-monooxygenase
Genome locationchr01:5136644..5143389
RNA-Seq ExpressionLsi01G006380
SyntenyLsi01G006380
Gene Ontology termsGO:0048856 - anatomical structure development (biological process)
GO:0004497 - monooxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR001128 - Cytochrome P450
IPR002397 - Cytochrome P450, B-class
IPR017972 - Cytochrome P450, conserved site
IPR036396 - Cytochrome P450 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149982.1 3-epi-6-deoxocathasterone 23-monooxygenase CYP90C1 [Cucumis sativus]1.2e-13255.62Show/hide
Query:  MGFWVWFVSVVILGWFWLKKKKKHIGNKDGIPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL----------------------------------
        MG WV+    VILGWF LKKKKK + NKDGIPKGNLGWP FGETL FISSGYSSRPVTFMDKRKSL                                  
Subjt:  MGFWVWFVSVVILGWFWLKKKKKHIGNKDGIPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL----------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------AKLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSDC
                                  KLLKIV KIVE+RKS V EK APRDAVDVLLQDH+E QGLPLDFISSHIIELMIPGEETVP AMTLAVKFLSDC
Subjt:  -------------------------AKLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSDC

Query:  PCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWCV
        P ALAQL                          +EENMKLKK+KDGSGEEYTWTDYMSL FTQNVISETLRMANI+NGVWRKAQKDV+IKGYLIPQGWCV
Subjt:  PCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWCV

Query:  LASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVRMRRKLPITITTLSS
        LASFISVHMDEKNYANPH+FDPWRWEENLSATNNHNFTPFGGGQRLCPG+EL RLEISIFLHHLVTTY+W AEKD II+FPTV+MRRKLPIT+TTLSS
Subjt:  LASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVRMRRKLPITITTLSS

XP_008440770.1 PREDICTED: 3-epi-6-deoxocathasterone 23-monooxygenase [Cucumis melo]6.3e-13756.51Show/hide
Query:  MGFWVWFVSVVILGWFWLKKKKKHIGNKDGIPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL----------------------------------
        MGFWVWFV  VILGWF  KKKKK I NKDGIPKGNLGWP FGETL FISSGYSSRPVTFMDKRKSL                                  
Subjt:  MGFWVWFVSVVILGWFWLKKKKKHIGNKDGIPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL----------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------AKLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSD
                                   KLLKIV KIVE+RKSRV EK APRDAVDVLLQDH E QGLPLDFISSHIIELMIPGEETVP AMTLAVKFL D
Subjt:  --------------------------AKLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSD

Query:  CPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWC
        CP ALAQLM                          EENMKLKK+KDGSGEEYTWTDYMSL FTQNVISETLRMANI+NGVWRKAQKDV+IK YLIPQGWC
Subjt:  CPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWC

Query:  VLASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVRMRRKLPITITTLSS
        VLASFISVHMDEKNYANPH+FDPWRWEENLSATNNHNFTPFGGGQRLCPG+EL RLEISIFLHHLVTTY+W AEKDEI+HFPTV+MRRKLPIT+TTLSS
Subjt:  VLASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVRMRRKLPITITTLSS

XP_022963240.1 3-epi-6-deoxocathasterone 23-monooxygenase [Cucurbita moschata]2.7e-12452.25Show/hide
Query:  MEWVMGFWVWFVSVVILGWFWLKK-----KKKHIGNKDGIPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL-------------------------
        MEWV+GFWVWF+SVV L     K       K HI NK GIPKG+LGWP  GETL FI+SGYSSRPVTFM+KRKSL                         
Subjt:  MEWVMGFWVWFVSVVILGWFWLKK-----KKKHIGNKDGIPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL-------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------AKLLKIVSKIVEDRKSRV--------EEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPG
                                            KLLKIVSKIVE+R+S          EEK+ PRDAVDVLLQD+D +QGLPLDFISSHIIELMIPG
Subjt:  -----------------------------------AKLLKIVSKIVEDRKSRV--------EEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPG

Query:  EETVPMAMTLAVKFLSDCPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRK
        EETVPMAMTLAVKFLSDCP ALAQ+M                          EENM+LKKQKD SGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRK
Subjt:  EETVPMAMTLAVKFLSDCPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRK

Query:  AQKDVQIKGYLIPQGWCVLASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPT
        AQKDV+IKGYLIPQGWCVLASFISVHMDE NY N HQFDPWRWE   S +NN NFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAE+DEIIHFPT
Subjt:  AQKDVQIKGYLIPQGWCVLASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPT

Query:  VRMRRKLPITI
        VRMRRKLPI I
Subjt:  VRMRRKLPITI

XP_023518560.1 3-epi-6-deoxocathasterone 23-monooxygenase [Cucurbita pepo subsp. pepo]1.2e-12452.65Show/hide
Query:  MEWVMGFWVWFVSVVILGWFWLKKK-----KKHIGNKDGIPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL-------------------------
        MEWV+GFWVWF+SVV L     K       K HI NK GIPKG+LGWP  GETL FI+SGYSSRPVTFM+KRKSL                         
Subjt:  MEWVMGFWVWFVSVVILGWFWLKKK-----KKHIGNKDGIPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL-------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------AKLLKIVSKIVEDRKSRV------EEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEE
                                            KLLKIVSKIVE+R+S        EEKV PRDAVDVLLQD+D +QGLPLDFISSHIIELMIPGEE
Subjt:  -----------------------------------AKLLKIVSKIVEDRKSRV------EEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEE

Query:  TVPMAMTLAVKFLSDCPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQ
        TVPMAMTLAVKFLSDCP ALAQ+M                          EENM+LKKQKD SGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQ
Subjt:  TVPMAMTLAVKFLSDCPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQ

Query:  KDVQIKGYLIPQGWCVLASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVR
        KDV+IKGYLIPQGWCVLASFISVHMDE NY N HQFDPWRWE   S +NN NFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAE+DEIIHFPTVR
Subjt:  KDVQIKGYLIPQGWCVLASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVR

Query:  MRRKLPITI
        MRRKLPI I
Subjt:  MRRKLPITI

XP_038881879.1 3-epi-6-deoxocathasterone 23-monooxygenase CYP90C1 [Benincasa hispida]2.9e-15060.12Show/hide
Query:  MEWVMGFWVWFVSVVILGWFWLKKKKK-HIGNKDGIPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL-----------------------------
        MEWV+GFWVWFVSVVILGWFWLK+KKK HIGN DGIP+GNLGWP FGETL FISSGYSSRPVTFMDKRKSL                             
Subjt:  MEWVMGFWVWFVSVVILGWFWLKKKKK-HIGNKDGIPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL-----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------AKLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAV
                                        KLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAV
Subjt:  -------------------------------AKLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAV

Query:  KFLSDCPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLI
        KFLSDCP ALAQLM                          EENMKLKKQK+GSGEEY WTDYMSLLFTQNVISETLRMANIINGVWRKAQKDV+IK YLI
Subjt:  KFLSDCPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLI

Query:  PQGWCVLASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVRMRRKLPITIT
        PQGWCVLASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTV+MRRKLPITIT
Subjt:  PQGWCVLASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVRMRRKLPITIT

Query:  TLSS
         LSS
Subjt:  TLSS

TrEMBL top hitse value%identityAlignment
A0A0A0KH50 Uncharacterized protein1.9e-12352.81Show/hide
Query:  MGFWVWFVSVVILGWFWLKKKKKHIGNKDGIPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL----------------------------------
        MG WV+    VILGWF LKKKKK + NKDGIPKGNLGWP FGETL FISSGYSSRPVTFMDKRKSL                                  
Subjt:  MGFWVWFVSVVILGWFWLKKKKKHIGNKDGIPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL----------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------AKLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSDC
                                  KLLKIV KIVE+RKS V EK APRDAVDVLLQDH+E QGLPLDFISSHIIELMIPGEETVP AMTLAVKFLSDC
Subjt:  -------------------------AKLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSDC

Query:  PCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWCV
        P ALAQL                                         +EYTWTDYMSL FTQNVISETLRMANI+NGVWRKAQKDV+IKGYLIPQGWCV
Subjt:  PCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWCV

Query:  LASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVRMRRKLPITITTLSS
        LASFISVHMDEKNYANPH+FDPWRWEENLSATNNHNFTPFGGGQRLCPG+EL RLEISIFLHHLVTTY+W AEKD II+FPTV+MRRKLPIT+TTLSS
Subjt:  LASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVRMRRKLPITITTLSS

A0A1S3B2I5 3-epi-6-deoxocathasterone 23-monooxygenase3.0e-13756.51Show/hide
Query:  MGFWVWFVSVVILGWFWLKKKKKHIGNKDGIPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL----------------------------------
        MGFWVWFV  VILGWF  KKKKK I NKDGIPKGNLGWP FGETL FISSGYSSRPVTFMDKRKSL                                  
Subjt:  MGFWVWFVSVVILGWFWLKKKKKHIGNKDGIPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL----------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------AKLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSD
                                   KLLKIV KIVE+RKSRV EK APRDAVDVLLQDH E QGLPLDFISSHIIELMIPGEETVP AMTLAVKFL D
Subjt:  --------------------------AKLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSD

Query:  CPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWC
        CP ALAQLM                          EENMKLKK+KDGSGEEYTWTDYMSL FTQNVISETLRMANI+NGVWRKAQKDV+IK YLIPQGWC
Subjt:  CPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWC

Query:  VLASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVRMRRKLPITITTLSS
        VLASFISVHMDEKNYANPH+FDPWRWEENLSATNNHNFTPFGGGQRLCPG+EL RLEISIFLHHLVTTY+W AEKDEI+HFPTV+MRRKLPIT+TTLSS
Subjt:  VLASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVRMRRKLPITITTLSS

A0A6J1BWQ0 3-epi-6-deoxocathasterone 23-monooxygenase isoform X23.9e-12452.82Show/hide
Query:  EWVMGFWVWFVSVVILGWFWLKKKKKHIGNKDG-IPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL------------------------------
        +WV+GFWV F+S V+  WFW+  K ++I NKDG IPKGNLGWP  GETL FI+SGYSSRPVTFMDKRKSL                              
Subjt:  EWVMGFWVWFVSVVILGWFWLKKKKKHIGNKDG-IPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------AKLLKIVSKIVEDRKSRVEEKVAP-RDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSDCPC
                               KLLKIV KIVE++K  + EK AP +DAV+VLL+DH+  QGLPLDFISSH+IELMIPGEETVPMAMTLAVKFLSD P 
Subjt:  ----------------------AKLLKIVSKIVEDRKSRVEEKVAP-RDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSDCPC

Query:  ALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWCVLA
        ALAQ M                          EEN +LKKQKD SGEEYTWTD+MSL FTQNVISETLRMANIINGVWRKAQKDV+IKGYLIPQGWCVLA
Subjt:  ALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWCVLA

Query:  SFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVRMRRKLPITITTLSS
        SFISVHMDE+NYANPHQF+PWRWEE  SATNNHNFTPFGGGQRLCPGIELARLEISIFLH LVTTYKW AEKDEIIHFPTV+MRRKLPI ITT++S
Subjt:  SFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVRMRRKLPITITTLSS

A0A6J1HHF1 3-epi-6-deoxocathasterone 23-monooxygenase1.3e-12452.25Show/hide
Query:  MEWVMGFWVWFVSVVILGWFWLKK-----KKKHIGNKDGIPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL-------------------------
        MEWV+GFWVWF+SVV L     K       K HI NK GIPKG+LGWP  GETL FI+SGYSSRPVTFM+KRKSL                         
Subjt:  MEWVMGFWVWFVSVVILGWFWLKK-----KKKHIGNKDGIPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL-------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------AKLLKIVSKIVEDRKSRV--------EEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPG
                                            KLLKIVSKIVE+R+S          EEK+ PRDAVDVLLQD+D +QGLPLDFISSHIIELMIPG
Subjt:  -----------------------------------AKLLKIVSKIVEDRKSRV--------EEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPG

Query:  EETVPMAMTLAVKFLSDCPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRK
        EETVPMAMTLAVKFLSDCP ALAQ+M                          EENM+LKKQKD SGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRK
Subjt:  EETVPMAMTLAVKFLSDCPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRK

Query:  AQKDVQIKGYLIPQGWCVLASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPT
        AQKDV+IKGYLIPQGWCVLASFISVHMDE NY N HQFDPWRWE   S +NN NFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAE+DEIIHFPT
Subjt:  AQKDVQIKGYLIPQGWCVLASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPT

Query:  VRMRRKLPITI
        VRMRRKLPI I
Subjt:  VRMRRKLPITI

A0A6J1KKY8 3-epi-6-deoxocathasterone 23-monooxygenase1.1e-12352.27Show/hide
Query:  MEWVMGFWVWFVSVVILGWFWLK-------KKKKHIGNKDGIPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL-----------------------
        MEWV+GFWVWF+SVV L     K         K HI NK GIPKG+LGWP  GETL FI+SGYSSRPVTFM+ RKSL                       
Subjt:  MEWVMGFWVWFVSVVILGWFWLK-------KKKKHIGNKDGIPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL-----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------AKLLKIVSKIVEDRKSRV--EEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETV
                                              KLLKIVSKIVE+R+S    EEK +PRDAVDVLLQD+D +QGLPLDFISSHIIELMIPGEETV
Subjt:  -------------------------------------AKLLKIVSKIVEDRKSRV--EEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETV

Query:  PMAMTLAVKFLSDCPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKD
        PMAMTLAVKFLSDCP ALAQ+M                          EENM+LKKQKD SGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKD
Subjt:  PMAMTLAVKFLSDCPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKD

Query:  VQIKGYLIPQGWCVLASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVRMR
        V+IKGYLIPQGWCVLASFISVHMDE NY N HQFDPWRWE   S + N NFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAE+DEIIHFPTVRMR
Subjt:  VQIKGYLIPQGWCVLASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVRMR

Query:  RKLPITI
        RKLPI I
Subjt:  RKLPITI

SwissProt top hitse value%identityAlignment
Q2RAP4 Cytochrome P450 90A31.2e-4539.72Show/hide
Query:  KRKSLAKLLKIVSKIVEDRKSR------VEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSDCPCALAQLMEEKKYK
        ++K    L +++ K +E++          E K   +D V+ LL+   E      + +    + L++ G ET  M MTLAVKFL++ P ALA+L EE    
Subjt:  KRKSLAKLLKIVSKIVEDRKSR------VEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSDCPCALAQLMEEKKYK

Query:  WRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWCVLASFISVHMDEKNY
                    + RD       MK KKQ         W+DY S+ FTQ VI+ETLR+ NII+GV+R+A  D+  K Y IP+G  + ASF +VH++ ++Y
Subjt:  WRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWCVLASFISVHMDEKNY

Query:  ANPHQFDPWRWEEN---LSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKW-GAEKDEIIHFPTVRMRRKLPITITTLS
         N   F+PWRW+ N    +A   + FTPFGGG RLCPG ELAR+ +SIFLHHLVT + W   E+D ++ FPT R  +  PI +  LS
Subjt:  ANPHQFDPWRWEEN---LSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKW-GAEKDEIIHFPTVRMRRKLPITITTLS

Q42569 Cytochrome P450 90A11.0e-4942.18Show/hide
Query:  KRKSLAKLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSDCPCALAQLMEEKKYKWRETHE
        +RK    L  +V K  E+ +   E K   +D +  LL   D   G   + I   ++ L++ G ET    MTLAVKFL++ P ALAQL EE        HE
Subjt:  KRKSLAKLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSDCPCALAQLMEEKKYKWRETHE

Query:  RGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWCVLASFISVHMDEKNYANPHQF
                          K++  K  S     W+DY S+ FTQ V++ETLR+ANII GV+R+A  DV+IKGY IP+GW V +SF +VH+D  ++ +   F
Subjt:  RGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWCVLASFISVHMDEKNYANPHQF

Query:  DPWRWEENLSATNNHN-FTPFGGGQRLCPGIELARLEISIFLHHLVTTYKW-GAEKDEIIHFPTVRMRRKLPITI
        +PWRW+ N   T   N FTPFGGG RLCPG ELAR+ +S+FLH LVT + W  AE+D+++ FPT R +++ PI +
Subjt:  DPWRWEENLSATNNHN-FTPFGGGQRLCPGIELARLEISIFLHHLVTTYKW-GAEKDEIIHFPTVRMRRKLPITI

Q94IA6 3-epi-6-deoxocathasterone 23-monooxygenase CYP90D11.7e-6847.96Show/hide
Query:  SGYSSRPVTF--------MDKRKSLAKLL-KIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSD
        SG  S P+ F        +  +K++ K + +I+   +   K++ E+ V  +D VDVLL+D  E   L  + I++++I++MIPG ++VP+ +TLAVKFLSD
Subjt:  SGYSSRPVTF--------MDKRKSLAKLL-KIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSD

Query:  CPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWC
         P AL  L                           EENMKLK  K+ +GE   W DY+SL FTQ VI+ETLRM N+I GV RKA KDV+IKGY+IP+GWC
Subjt:  CPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWC

Query:  VLASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVRMRRKLPITI
         LA   SVH+D+  Y +P++F+PWRW+E     N  +F+PFGGGQRLCPG++LARLE S+FLHHLVT ++W AE+D II+FPTV M+ KLPI I
Subjt:  VLASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVRMRRKLPITI

Q94IW5 Cytochrome P450 90D24.3e-7250Show/hide
Query:  KLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSDCPCALAQLMEEKKYKWRETHERGSARV
        K+ +++ +I+ ++++R      PRDA+DVL+ D  +   L  + IS ++I+LMIP E++VP+ +TLAVKFLS+CP AL QL                   
Subjt:  KLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSDCPCALAQLMEEKKYKWRETHERGSARV

Query:  SERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWCVLASFISVHMDEKNYANPHQFDPWRWE
               EEEN++LK++K   GE   WTDYMSL FTQ+VI+ETLR+ NII G+ RKA +DV++KG+LIP+GWCV   F SVH+D+  Y  P++F+PWRW+
Subjt:  SERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWCVLASFISVHMDEKNYANPHQFDPWRWE

Query:  ENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVRMRRKLPITIT
        E     +N +FTPFGGGQRLCPG++LARLE SIFLHHLVT+++W AE+D I++FPTVR++R +PI +T
Subjt:  ENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVRMRRKLPITIT

Q94IW5 Cytochrome P450 90D23.2e-0641.07Show/hide
Query:  LKKKKKHIGNKDG-------IPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL
        L+++++  G+  G       +P G+ GWP+ GETL F+S  YS RP  F+DKR+ L
Subjt:  LKKKKKHIGNKDG-------IPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL

Q9M066 3-epi-6-deoxocathasterone 23-monooxygenase CYP90C11.5e-8840.08Show/hide
Query:  VMGFWVWFVSVVILGWFWLKKKKKHIGNKDG-------------IPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL--------------------
        V GF V    +++  W WL+ +      KDG             IP G+LGWP+ GETL+FI+ GYSSRPVTFMDKRKSL                    
Subjt:  VMGFWVWFVSVVILGWFWLKKKKKHIGNKDG-------------IPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL--------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------AKLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQD--HDEIQGLPLDFISSHIIELMIPGE
                                                 +L+K+V K+VE+R+  +       D VDVLL+D    E Q  P DF+S  I+E+MIPGE
Subjt:  ----------------------------------------AKLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQD--HDEIQGLPLDFISSHIIELMIPGE

Query:  ETVPMAMTLAVKFLSDCPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKA
        ET+P AMTLAVKFLSD P ALA+L+                          EENM++K++K   GEEY WTDYMSL FTQNVI+ETLRMANIINGVWRKA
Subjt:  ETVPMAMTLAVKFLSDCPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKA

Query:  QKDVQIKGYLIPQGWCVLASFISVHMDEKNYANPHQFDPWRWEE-NLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPT
         KDV+IKGYLIP+GWCVLASFISVHMDE  Y NP+QFDPWRW+  N SA ++  FTPFGGGQRLCPG+EL++LEISIFLHHLVT Y W AE+DEI+ FPT
Subjt:  QKDVQIKGYLIPQGWCVLASFISVHMDEKNYANPHQFDPWRWEE-NLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPT

Query:  VRMRRKLPITITTL
        V+M+R+LPI + T+
Subjt:  VRMRRKLPITITTL

Arabidopsis top hitse value%identityAlignment
AT3G13730.1 cytochrome P450, family 90, subfamily D, polypeptide 11.2e-6947.96Show/hide
Query:  SGYSSRPVTF--------MDKRKSLAKLL-KIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSD
        SG  S P+ F        +  +K++ K + +I+   +   K++ E+ V  +D VDVLL+D  E   L  + I++++I++MIPG ++VP+ +TLAVKFLSD
Subjt:  SGYSSRPVTF--------MDKRKSLAKLL-KIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSD

Query:  CPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWC
         P AL  L                           EENMKLK  K+ +GE   W DY+SL FTQ VI+ETLRM N+I GV RKA KDV+IKGY+IP+GWC
Subjt:  CPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWC

Query:  VLASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVRMRRKLPITI
         LA   SVH+D+  Y +P++F+PWRW+E     N  +F+PFGGGQRLCPG++LARLE S+FLHHLVT ++W AE+D II+FPTV M+ KLPI I
Subjt:  VLASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVRMRRKLPITI

AT3G50660.1 Cytochrome P450 superfamily protein8.2e-4234.64Show/hide
Query:  KSLAKLLKIVSKIVEDRK------SRVEEKVAPRDAVDVLLQDH-------DEIQG-------LPLDFISSHIIELMIPGEETVPMAMTLAVKFLSDCPC
        +S A +LK + + +E+RK       + EE+V   D  ++   DH       D++ G       L  + I   I+ L+  G ET  +A+ LA+ FL  CP 
Subjt:  KSLAKLLKIVSKIVEDRK------SRVEEKVAPRDAVDVLLQDH-------DEIQG-------LPLDFISSHIIELMIPGEETVPMAMTLAVKFLSDCPC

Query:  ALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGE-EYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWCVL
        A+ +L                           EE++++ + K   GE E  W DY  + FTQ VI+ETLR+ N++  + RKA KDV+ KGY IP GW VL
Subjt:  ALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGE-EYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWCVL

Query:  ASFISVHMDEKNYANPHQFDPWRWEENLS----------ATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWG-AEKDEIIHFPTVRMRRKLP
            +VH+D   Y  P+ F+PWRW++  +          +T  +N+ PFGGG RLC G ELA+LE+++F+HHLV  + W  AE D+   FP V     LP
Subjt:  ASFISVHMDEKNYANPHQFDPWRWEENLS----------ATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWG-AEKDEIIHFPTVRMRRKLP

Query:  ITITTL
        I ++ +
Subjt:  ITITTL

AT4G36380.1 Cytochrome P450 superfamily protein1.1e-8940.08Show/hide
Query:  VMGFWVWFVSVVILGWFWLKKKKKHIGNKDG-------------IPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL--------------------
        V GF V    +++  W WL+ +      KDG             IP G+LGWP+ GETL+FI+ GYSSRPVTFMDKRKSL                    
Subjt:  VMGFWVWFVSVVILGWFWLKKKKKHIGNKDG-------------IPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSL--------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------AKLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQD--HDEIQGLPLDFISSHIIELMIPGE
                                                 +L+K+V K+VE+R+  +       D VDVLL+D    E Q  P DF+S  I+E+MIPGE
Subjt:  ----------------------------------------AKLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQD--HDEIQGLPLDFISSHIIELMIPGE

Query:  ETVPMAMTLAVKFLSDCPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKA
        ET+P AMTLAVKFLSD P ALA+L+                          EENM++K++K   GEEY WTDYMSL FTQNVI+ETLRMANIINGVWRKA
Subjt:  ETVPMAMTLAVKFLSDCPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKA

Query:  QKDVQIKGYLIPQGWCVLASFISVHMDEKNYANPHQFDPWRWEE-NLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPT
         KDV+IKGYLIP+GWCVLASFISVHMDE  Y NP+QFDPWRW+  N SA ++  FTPFGGGQRLCPG+EL++LEISIFLHHLVT Y W AE+DEI+ FPT
Subjt:  QKDVQIKGYLIPQGWCVLASFISVHMDEKNYANPHQFDPWRWEE-NLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPT

Query:  VRMRRKLPITITTL
        V+M+R+LPI + T+
Subjt:  VRMRRKLPITITTL

AT5G05690.1 Cytochrome P450 superfamily protein7.4e-5142.18Show/hide
Query:  KRKSLAKLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSDCPCALAQLMEEKKYKWRETHE
        +RK    L  +V K  E+ +   E K   +D +  LL   D   G   + I   ++ L++ G ET    MTLAVKFL++ P ALAQL EE        HE
Subjt:  KRKSLAKLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSDCPCALAQLMEEKKYKWRETHE

Query:  RGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWCVLASFISVHMDEKNYANPHQF
                          K++  K  S     W+DY S+ FTQ V++ETLR+ANII GV+R+A  DV+IKGY IP+GW V +SF +VH+D  ++ +   F
Subjt:  RGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWCVLASFISVHMDEKNYANPHQF

Query:  DPWRWEENLSATNNHN-FTPFGGGQRLCPGIELARLEISIFLHHLVTTYKW-GAEKDEIIHFPTVRMRRKLPITI
        +PWRW+ N   T   N FTPFGGG RLCPG ELAR+ +S+FLH LVT + W  AE+D+++ FPT R +++ PI +
Subjt:  DPWRWEENLSATNNHN-FTPFGGGQRLCPGIELARLEISIFLHHLVTTYKW-GAEKDEIIHFPTVRMRRKLPITI

AT5G05690.3 Cytochrome P450 superfamily protein7.4e-5142.18Show/hide
Query:  KRKSLAKLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSDCPCALAQLMEEKKYKWRETHE
        +RK    L  +V K  E+ +   E K   +D +  LL   D   G   + I   ++ L++ G ET    MTLAVKFL++ P ALAQL EE        HE
Subjt:  KRKSLAKLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQGLPLDFISSHIIELMIPGEETVPMAMTLAVKFLSDCPCALAQLMEEKKYKWRETHE

Query:  RGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWCVLASFISVHMDEKNYANPHQF
                          K++  K  S     W+DY S+ FTQ V++ETLR+ANII GV+R+A  DV+IKGY IP+GW V +SF +VH+D  ++ +   F
Subjt:  RGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANIINGVWRKAQKDVQIKGYLIPQGWCVLASFISVHMDEKNYANPHQF

Query:  DPWRWEENLSATNNHN-FTPFGGGQRLCPGIELARLEISIFLHHLVTTYKW-GAEKDEIIHFPTVRMRRKLPITI
        +PWRW+ N   T   N FTPFGGG RLCPG ELAR+ +S+FLH LVT + W  AE+D+++ FPT R +++ PI +
Subjt:  DPWRWEENLSATNNHN-FTPFGGGQRLCPGIELARLEISIFLHHLVTTYKW-GAEKDEIIHFPTVRMRRKLPITI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATGGGTTATGGGATTTTGGGTTTGGTTTGTGAGTGTTGTTATTTTAGGTTGGTTTTGGTTGAAGAAGAAGAAGAAGCATATTGGAAACAAAGATGGAATTCCTAA
AGGGAATTTGGGTTGGCCTCTCTTTGGTGAAACTCTTCATTTCATTTCTTCTGGCTATTCTTCTCGTCCTGTCACCTTCATGGACAAACGCAAATCTCTAGCGAAGTTGT
TGAAAATTGTGAGTAAGATTGTGGAAGATAGAAAATCGAGGGTGGAGGAGAAGGTTGCACCGAGAGATGCGGTGGACGTGCTGTTACAGGATCACGATGAGATCCAAGGG
CTACCGTTGGATTTCATCAGCAGTCATATCATAGAGCTGATGATTCCCGGCGAGGAGACGGTGCCGATGGCCATGACCCTAGCCGTCAAATTTCTCAGTGATTGCCCCTG
CGCTCTTGCCCAATTAATGGAAGAAAAAAAGTATAAGTGGAGGGAGACGCACGAGCGGGGGAGCGCACGTGTGAGTGAGAGAGACAGAGAAAATGAGGAAGAGAATATGA
AATTAAAGAAGCAGAAGGATGGTTCTGGAGAGGAGTATACGTGGACAGATTACATGTCTCTGCTATTTACTCAAAATGTGATTAGTGAAACATTAAGAATGGCTAATATC
ATCAATGGGGTTTGGAGAAAAGCTCAAAAAGATGTGCAAATCAAAGGCTATTTAATACCACAAGGATGGTGCGTCTTGGCATCTTTTATTTCGGTTCATATGGATGAAAA
GAACTATGCCAACCCGCATCAGTTTGATCCATGGAGATGGGAGGAAAATTTGTCTGCAACAAACAACCATAACTTTACACCTTTTGGAGGGGGACAGAGGCTATGTCCTG
GCATCGAACTCGCCAGGCTCGAAATCTCAATTTTCCTTCATCATCTCGTCACTACCTACAAATGGGGAGCTGAAAAGGATGAAATCATCCATTTTCCGACAGTGAGGATG
AGAAGGAAGCTGCCAATCACAATCACAACCTTAAGCTCTTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAAAAAAAAAGGGAAAAAAGAAAAAAAGTTGCAGCCTTTTTTGTTTATAGGGTTGTAGGGTTTTGGGGGGTAAATTAAAGAAGGTATAGGTAGGTAGAGAAGAA
TTGAAAGGTAAAAAGAAATGGAATGGGTTATGGGATTTTGGGTTTGGTTTGTGAGTGTTGTTATTTTAGGTTGGTTTTGGTTGAAGAAGAAGAAGAAGCATATTGGAAAC
AAAGATGGAATTCCTAAAGGGAATTTGGGTTGGCCTCTCTTTGGTGAAACTCTTCATTTCATTTCTTCTGGCTATTCTTCTCGTCCTGTCACCTTCATGGACAAACGCAA
ATCTCTAGCGAAGTTGTTGAAAATTGTGAGTAAGATTGTGGAAGATAGAAAATCGAGGGTGGAGGAGAAGGTTGCACCGAGAGATGCGGTGGACGTGCTGTTACAGGATC
ACGATGAGATCCAAGGGCTACCGTTGGATTTCATCAGCAGTCATATCATAGAGCTGATGATTCCCGGCGAGGAGACGGTGCCGATGGCCATGACCCTAGCCGTCAAATTT
CTCAGTGATTGCCCCTGCGCTCTTGCCCAATTAATGGAAGAAAAAAAGTATAAGTGGAGGGAGACGCACGAGCGGGGGAGCGCACGTGTGAGTGAGAGAGACAGAGAAAA
TGAGGAAGAGAATATGAAATTAAAGAAGCAGAAGGATGGTTCTGGAGAGGAGTATACGTGGACAGATTACATGTCTCTGCTATTTACTCAAAATGTGATTAGTGAAACAT
TAAGAATGGCTAATATCATCAATGGGGTTTGGAGAAAAGCTCAAAAAGATGTGCAAATCAAAGGCTATTTAATACCACAAGGATGGTGCGTCTTGGCATCTTTTATTTCG
GTTCATATGGATGAAAAGAACTATGCCAACCCGCATCAGTTTGATCCATGGAGATGGGAGGAAAATTTGTCTGCAACAAACAACCATAACTTTACACCTTTTGGAGGGGG
ACAGAGGCTATGTCCTGGCATCGAACTCGCCAGGCTCGAAATCTCAATTTTCCTTCATCATCTCGTCACTACCTACAAATGGGGAGCTGAAAAGGATGAAATCATCCATT
TTCCGACAGTGAGGATGAGAAGGAAGCTGCCAATCACAATCACAACCTTAAGCTCTTGATTGGATTCAATCGTCTCAACAACTTCAATACCAAGCTGTCGAGGAAATAAG
GCAAAAAAAATCTTCATACAGATGTAAATATATTCAATGACCACCCAAGTTTTCTTTCTTTTTTTTTTTTTTTCTTTTTTGTGAGATAGATATAGCTAGACAGACCCACG
AGGGGTCCTCTTTTTTAAGCCATAACTGTACTGATACAGGAATTCTAGTTTTACTTTACATGGAGCTCTTCTCTACCATTCTGAATTATGATTCTTGTTCGTGCGTCGCC
ATCTTTTTGACAGTTCAAAGGGGTCCTTCCTTTGTCCATTTCCTCCCTTCCCTCCATTGAATAGCATTCATTGTACCCAACTCCATGTATAAAGTTTATTTCCTTTCTTT
CCCCCCCTTTTAACATTTTGTGGTGAGGAGCTTGCTGGAGATTGTGTCTTTCCGATTCAGTCTTCAAATTGCTACCTTCTGATGGATGATGTTTGGAAGATTGCAGAGAG
AACAAGTTCATCTTTTGGTATCCAATTTTCTCCTAATACCAAGCATGCGTGTTTGAATTTGCCTTCAATGTCTATCACATCAGCTCATGATAGAGCACATCCAGCGTTCA
ATTGTTTGATCAAATCAACATCACATGCCGTGAAGTAATCAAAATAAGATAATAATAGTGGATATAGATTCTTACATCATTTCAATTTCAATTCTGTCAATCAAAATGGT
ACAAATTTGAGTAAGCTTAAAGAGAGAGGGAGTTGTGGATGGTGCGTTTATTAAAGGCCATGCCAATCACAAGACTAAGTGGGTCAATGATATGAAGTTTGGTAGAGCCC
ACTAGTGGGCGTCCTCTTCCCATGCCAATCACAGTTCACATGATTCAAACCATCCACCATATTACTGTCTCCCCTTTTTTGGGCGTTGGGAAATACCCAAAAATTCAGTT
ACTGTGTCGGTGCAGATTAATTTATACATCTTAATCTGATCATCCTTCAGCTTCAAGTACCATTTGCTGAACAGGCGTATCAGAGATGGCATTAGGAACCCAGTTCCTGT
GCCATTTGATGGAAGATAGCTTTCCTTCAATGCTGAAGCCCATTTCTACTGCACTTGCAAATCGGCTCAACTCTTTAAAGCTTTACTGTAATTTTGGTTTGTGTTTAAAT
AGCAAAAGAGAGAAGATATTGGGAAAATAAGACTGCATAGTATATATGGAAAAAACAAGCTATTACCTCGCAATCTATTCCATAATCCATAATTTTTTTTCTCTTCTTCA
AGCTGTTGAATTTGGATTATGTAGTCGATCCTACAATAAAGACTTAATTACAAAATCTTGTGAGTTAGGAACTTGGGATATGTAAGAGAAATAGCCAAGCTGTAAGTCAT
AGGATTAACAAACTCTGCAATCAACTCAAAATATGCTCCATTTTGTAACTTCGGTTATACAAACTACCAATATGTGAGGCATATGCCTCCGGTTAACTTTAAAAAAATGA
TCTTTCAATTT
Protein sequenceShow/hide protein sequence
MEWVMGFWVWFVSVVILGWFWLKKKKKHIGNKDGIPKGNLGWPLFGETLHFISSGYSSRPVTFMDKRKSLAKLLKIVSKIVEDRKSRVEEKVAPRDAVDVLLQDHDEIQG
LPLDFISSHIIELMIPGEETVPMAMTLAVKFLSDCPCALAQLMEEKKYKWRETHERGSARVSERDRENEEENMKLKKQKDGSGEEYTWTDYMSLLFTQNVISETLRMANI
INGVWRKAQKDVQIKGYLIPQGWCVLASFISVHMDEKNYANPHQFDPWRWEENLSATNNHNFTPFGGGQRLCPGIELARLEISIFLHHLVTTYKWGAEKDEIIHFPTVRM
RRKLPITITTLSS