; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022099 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022099
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionFAD/NAD(P)-binding oxidoreductase family protein
Genome locationChr05:20813560..20825870
RNA-Seq ExpressionHG10022099
SyntenyHG10022099
Gene Ontology termsNA
InterPro domainsIPR004792 - 3-Dehydro-bile acid delta(4,6)-reductase-like
IPR023166 - HI0933-like insert domain superfamily
IPR036188 - FAD/NAD(P)-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016151.1 ytfP [Cucurbita argyrosperma subsp. argyrosperma]2.8e-20377.8Show/hide
Query:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGP
        M+ T+A+TSIVAVQKLNEELLVVVGGGAAGVYGA+RAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGH  DAKSLAEHYPRGHKEFRGPFFNVHGP
Subjt:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGP

Query:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQL
        +DTMSWFSNHGV+LKVEDDGRVFPV+N SASIVDCL+SEAKRTGVSLQTGKVVTSASISSG KFALKIQKLMN  EHVEANYLLIASGSSRQGFSLAAQL
Subjt:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQL

Query:  GHSLIDPVPSLFTFKIEDPQLAELSGVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQLIIFSVGLLIVDFTPDLHLEDVKT
        GHSLIDPVPSLFTFKIEDPQLAELSGVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSL          +LI+F VGLLIVDF PD HLEDVKT
Subjt:  GHSLIDPVPSLFTFKIEDPQLAELSGVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQLIIFSVGLLIVDFTPDLHLEDVKT

Query:  ILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILG---------------------------KGQFK
        ILSRHKSQFM                          EINDEILWASISNKSLASISSLLKQCIFK+LG                           KGQFK
Subjt:  ILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILG---------------------------KGQFK

Query:  DEFVTAGGVPLSE------------------------ISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA
        DEFVTAGGV LSE                        ISLKTMESKIHSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIGKLANGE LGRDISNLA
Subjt:  DEFVTAGGVPLSE------------------------ISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA

XP_004148683.1 uncharacterized protein LOC101210627 isoform X2 [Cucumis sativus]4.8e-20380.13Show/hide
Query:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGP
        M+ T+ALTS VA QKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNV+VIEKGRPLSKVKISGGGRCNVTNGH  DAKSLAEHYPRGHKEFRGPFFNVHGP
Subjt:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGP

Query:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQL
        MDTMSWFSNHGVELKVEDDGRVFPVSNCS+S+VDCL+SEAKRTGVSLQTGKVV SASIS+G KFALKIQKL+NCFEHVEANYLLIASGSSRQGFSLAAQL
Subjt:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQL

Query:  GHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ
        GHSLIDPVPSLFTFKIEDPQLAELSG                        VGPMLVTHWGLSGPVILRLSAWGARDLFASDYK                 
Subjt:  GHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ

Query:  LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEF
              GLLIVDFTPDLHLE+VKTIL+RHKSQFM                          EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEF
Subjt:  LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEF

Query:  VTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA
        VTAGGVPLSEISLKTMESKIHSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIG+LANGE LGRDI+NLA
Subjt:  VTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA

XP_022993604.1 uncharacterized protein LOC111489549 isoform X1 [Cucurbita maxima]9.0e-20280.34Show/hide
Query:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGP
        M+ T+A+TSIVAVQKLNEE+LVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGH  DAKSLAEHYPRGHKEFRGPFFNVHGP
Subjt:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGP

Query:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQL
        MDTMSWFSNHGV+LKVEDDGRVFPVSN SASIVDCL+SEAKRTGVSLQTGKVVTSASISSG KFALKIQKLMN  EHVEANYLLIASGSSRQGFSLAAQL
Subjt:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQL

Query:  GHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ
        GHSL+DPVPSLFTFKIEDPQLAELSG                        VGPMLVTHWGLSGPVILRLSAWGARDLFASDYK                 
Subjt:  GHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ

Query:  LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEF
              GLLIVDF PDLHLEDVKTILSRHKSQFM                          EINDEILWASISNKSLASISSLLKQCIFK+LGKGQFKDEF
Subjt:  LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEF

Query:  VTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA
        VTAGGV LSEISLKTMESKIHSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIGKLANGE LGRD+SNLA
Subjt:  VTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA

XP_023549941.1 uncharacterized protein LOC111808280 [Cucurbita pepo subsp. pepo]2.6e-20180.13Show/hide
Query:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGP
        M+ T+A+TSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGH  DAKSLAEHYPRGHKEFRGPFFNVHGP
Subjt:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGP

Query:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQL
        MDTMSWFSNHGV+LKVEDDGRVFPVSN SASI+DCL++EAKRTGVSLQTGKVVTSASISSG KFALKIQKLMN  EHVEANYLLIASGSSRQGFSLAAQL
Subjt:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQL

Query:  GHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ
        GHSL+DPVPSLFTFKIEDPQLAELSG                        VGPMLVTHWGLSGPVILRLSAWGARDLF SDYK                 
Subjt:  GHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ

Query:  LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEF
              GLLIVDF PDLHLEDVKTILSRHKSQFM                          EINDEILWASISNKSLASISSLLKQCIFK+LGKGQFKDEF
Subjt:  LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEF

Query:  VTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA
        VTAGGV LSEISLKTMESKIHSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIGKLANGE LGRDISNLA
Subjt:  VTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA

XP_038889404.1 uncharacterized protein YtfP isoform X2 [Benincasa hispida]2.5e-20480.55Show/hide
Query:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGP
        M+ T+ALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNV+VIEKGRPLSKVKISGGGRCNVTNGHC DAKSLAEHYPRG+KEFRGPFFNVHGP
Subjt:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGP

Query:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQL
        MDTMSWFSNHGVELK+E+DGRVFPVSNCSASIVDCL+SE+KRTGVSLQTGKVVTSAS+SSG KFALKIQKLMNC EH+EANYLLIASGSSRQGFSLAAQ 
Subjt:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQL

Query:  GHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ
        GHSLIDPVPSLFTFKIEDPQLAELSG                        VGPMLVTHWGLSGPVILRLSAWGARDLF SDYK                 
Subjt:  GHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ

Query:  LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEF
              GLLIVDFTPDLHLEDVKTILSRHKSQFM                          EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEF
Subjt:  LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEF

Query:  VTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA
        VTAGGVPLSEISLKTMESKI SRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIGKLANGE LGRDISNLA
Subjt:  VTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA

TrEMBL top hitse value%identityAlignment
A0A0A0KVG6 Uncharacterized protein2.3e-20380.13Show/hide
Query:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGP
        M+ T+ALTS VA QKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNV+VIEKGRPLSKVKISGGGRCNVTNGH  DAKSLAEHYPRGHKEFRGPFFNVHGP
Subjt:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGP

Query:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQL
        MDTMSWFSNHGVELKVEDDGRVFPVSNCS+S+VDCL+SEAKRTGVSLQTGKVV SASIS+G KFALKIQKL+NCFEHVEANYLLIASGSSRQGFSLAAQL
Subjt:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQL

Query:  GHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ
        GHSLIDPVPSLFTFKIEDPQLAELSG                        VGPMLVTHWGLSGPVILRLSAWGARDLFASDYK                 
Subjt:  GHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ

Query:  LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEF
              GLLIVDFTPDLHLE+VKTIL+RHKSQFM                          EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEF
Subjt:  LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEF

Query:  VTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA
        VTAGGVPLSEISLKTMESKIHSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIG+LANGE LGRDI+NLA
Subjt:  VTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA

A0A1S3C9B6 uncharacterized protein YtfP isoform X11.1e-20079.28Show/hide
Query:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGP
        M+ T+ALTS VA QKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNV+VIEKGRPLSKVKISGGGRCNVTNGHC DAKSLAEHYPRGHKEFRGPFFNVHGP
Subjt:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGP

Query:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQL
        MDTMSWFSNHGVELKVEDDGRVFPVSNCS+S+VDCL+SEAKRTGVSLQTGKVV SASIS+G KFALKIQKL+NCFEHVEANYLLIASGSSRQGFSLAAQL
Subjt:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQL

Query:  GHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ
        GHSLIDPVPSLFTFKIEDPQLAELSG                        VGPMLVTHWGLSGPVILRLSAWGARDLFASDYK                 
Subjt:  GHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ

Query:  LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEF
              GLLIVDFTPDLHLEDVK IL+RHKSQFM                          EINDEILWASISNKSLASIS LLKQCIFKILGKGQFKDEF
Subjt:  LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEF

Query:  VTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA
        VTAGGVPLSE+SLKTMESKIHSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIG LANGE L  DI+N A
Subjt:  VTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA

A0A6J1BWQ2 uncharacterized protein LOC111006025 isoform X17.0e-19276.79Show/hide
Query:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGP
        M+  +ALTS VAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKG+PLSKVKISGGGRCNVTNGH  D+KSLAEHYPRGHKEFRG FFNVHGP
Subjt:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGP

Query:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQL
        MDTMSWFSNHGVELK+EDDGRVFPVSNCSASIVDCL+ EA R GVSLQTGKVVTSAS SSG KF LKIQK++   EHVEANYLLIASGSSRQGFSLAAQL
Subjt:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQL

Query:  GHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ
        GHSLIDPVPSLFTFKIEDPQLAELSG                        VGPMLVTHWGLSGPVILRLSAWGARDLFAS+YK                 
Subjt:  GHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ

Query:  LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEF
              GLLIVDFTPDLHLEDVK+ILSRHKSQFM                          EI+DEILWAS+SNKSLAS+SSLLK+CIFK+LGKGQFKDEF
Subjt:  LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEF

Query:  VTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGR-DISNLA
        VTAGGVPLSEISLKTMESKIHSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIGKLANG  LGR D+ N+A
Subjt:  VTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGR-DISNLA

A0A6J1FLC1 uncharacterized protein LOC111445022 isoform X13.7e-20180.13Show/hide
Query:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGP
        M+ TRA+TSIV VQKLNEELLVVVGGGAAGVYGA+RAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGH  DAKSLAEHYPRGHKEFRGPFFNVHGP
Subjt:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGP

Query:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQL
        MDTMSWFSNHGV+LKVEDDGRVFPV+N SASIVDCL+SEAKRTGVSLQTGKVVTSASISSG KFALKIQKLMN  EHVEANYLLIASGSSRQGFSLAAQL
Subjt:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQL

Query:  GHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ
        GHSL+DPVPSLFTFKIEDPQLAELSG                        VGPMLVTHWGLSGPVILRLSAWGARDLFASDYK                 
Subjt:  GHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ

Query:  LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEF
              GLLIVDF PD HLEDVKTILSRHKSQFM                          EINDEILWASISNKSLASISSLLKQCIFK+LGKGQFKDEF
Subjt:  LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEF

Query:  VTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA
        VTAGGV LSEISLKTMESKIHSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIGKLANGE LGRDISNLA
Subjt:  VTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA

A0A6J1K0M0 uncharacterized protein LOC111489549 isoform X14.4e-20280.34Show/hide
Query:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGP
        M+ T+A+TSIVAVQKLNEE+LVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGH  DAKSLAEHYPRGHKEFRGPFFNVHGP
Subjt:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGP

Query:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQL
        MDTMSWFSNHGV+LKVEDDGRVFPVSN SASIVDCL+SEAKRTGVSLQTGKVVTSASISSG KFALKIQKLMN  EHVEANYLLIASGSSRQGFSLAAQL
Subjt:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQL

Query:  GHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ
        GHSL+DPVPSLFTFKIEDPQLAELSG                        VGPMLVTHWGLSGPVILRLSAWGARDLFASDYK                 
Subjt:  GHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQ

Query:  LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEF
              GLLIVDF PDLHLEDVKTILSRHKSQFM                          EINDEILWASISNKSLASISSLLKQCIFK+LGKGQFKDEF
Subjt:  LIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEF

Query:  VTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA
        VTAGGV LSEISLKTMESKIHSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIGKLANGE LGRD+SNLA
Subjt:  VTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA

SwissProt top hitse value%identityAlignment
B0NAQ4 3-dehydro-bile acid delta(4,6)-reductase7.5e-2624.94Show/hide
Query:  VVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPL-SKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDGR
        ++GGGA+G+  AI A     +  V ++E+   +  K+  +G GRCN+TN   +DA      Y     EF        G  +T+ +F++ G+  K    G 
Subjt:  VVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPL-SKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDGR

Query:  VFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIAS--------GSSRQGFSLAAQLGHSLIDPVPSLFT
        ++P S+ +AS+++ L  E +R  V + TG  V +  +S+   F ++        +   A+ +++A         GS   G++LA  +GH+L   VP+L  
Subjt:  VFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIAS--------GSSRQGFSLAAQLGHSLIDPVPSLFT

Query:  FKIEDPQLAELSGV-------------------GPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQLIIFSVGLLIVDFTPDLH
         K++    A+ +GV                   G M +T +G+SG  + ++S   A+ L+             +G  +K           + VDF P++ 
Subjt:  FKIEDPQLAELSGV-------------------GPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQLIIFSVGLLIVDFTPDLH

Query:  LEDVKTILSRHKSQFMEINDEILWASISNKSL---------------------ASISSLLKQC---IFKILGKGQFKDEFVTAGGVPLSEISLKTMESKI
           V+   + H  +      +     I  K L                     A    L++ C   +  I     F +  V AGGV   E+   T+ES+ 
Subjt:  LEDVKTILSRHKSQFMEINDEILWASISNKSL---------------------ASISSLLKQC---IFKILGKGQFKDEFVTAGGVPLSEISLKTMESKI

Query:  HSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAG
           L+  GE+L+++G+ GG+N Q AW+ GY+AG
Subjt:  HSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAG

P37631 Uncharacterized protein YhiN4.7e-2026.47Show/hide
Query:  VVVGGGAAGVYGAIRAKTLAPNLNVMVIEKG-RPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDG
        +++G GAAG++ +  A        V++I+ G +P  K+ +SGGGRCN TN +      L+++ P   K     F       D +   + HG+    +  G
Subjt:  VVVGGGAAGVYGAIRAKTLAPNLNVMVIEKG-RPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDG

Query:  RVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKL-MNCFEHVEA--NYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIE
        ++F   + +  IVD L+ E ++  V+ +    V S +      F L +  + + C + V A     +   G+S  G+ +A Q G +++     L  F + 
Subjt:  RVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKL-MNCFEHVEA--NYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIE

Query:  DPQLAE---LSGVG---------------PMLVTHWGLSGPVILRLSAWGARDLFAS-------DYKGFLNSLLVKGYGLKFSQLIIFSVGLLIVDFTPD
         P L E   L+GV                 +L TH GLSGP +L++S++     F S       D + FLN                          T  
Subjt:  DPQLAE---LSGVG---------------PMLVTHWGLSGPVILRLSAWGARDLFAS-------DYKGFLNSLLVKGYGLKFSQLIIFSVGLLIVDFTPD

Query:  LHLEDVKTILSRHKSQFMEINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNF
        +HL   K ++ R   Q  +I D  L   ++ +   ++ S L     +  G   ++   VT GGV  +E+S +TME++    L+F GEV+++ G  GG+NF
Subjt:  LHLEDVKTILSRHKSQFMEINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNF

Query:  QNAWSGGY
        Q AWS  +
Subjt:  QNAWSGGY

P44941 Uncharacterized protein HI_09336.6e-2227.14Show/hide
Query:  VVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLS-KVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDG
        +++G GAAG++ A +   L    +V V + G+ +  K+ +SGGG CN TN     A  L+++ P   K     + N     D +S  +  G+    ++ G
Subjt:  VVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLS-KVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDG

Query:  RVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSA---SISSGVKFALKIQKLM-NCFEHVEA--NYLLIASGSSRQGFSLAAQLGHSLIDPVPSL--F
        ++F     +  IV+ L SE  + G  +     V+          V+F L++      C   + A     +   G++  G+ +A Q G  +I P  SL  F
Subjt:  RVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSA---SISSGVKFALKIQKLM-NCFEHVEA--NYLLIASGSSRQGFSLAAQLGHSLIDPVPSL--F

Query:  TFKIEDPQLAELSGV---------------GPMLVTHWGLSGPVILRLS-AWGARDLFASDYKGFLNSLLVKGYGLKFSQLIIFSVGLLIVDFTPDLHLE
        T++  D  L  LSG+                 +L TH G+SGP +L++S  W   +                   ++   L   +V   I         +
Subjt:  TFKIEDPQLAELSGV---------------GPMLVTHWGLSGPVILRLS-AWGARDLFASDYKGFLNSLLVKGYGLKFSQLIIFSVGLLIVDFTPDLHLE

Query:  DVKTILSR-HKSQFME-------INDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTG
         +KTIL R    + +E       + DE++ A+IS   + ++   +    F   G   ++   VT GGV    IS KTMES   S L+F GEVL++ G  G
Subjt:  DVKTILSR-HKSQFME-------INDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTG

Query:  GFNFQNAWSGGYIAGTSIGK
        G+NFQ AWS  Y    SI +
Subjt:  GFNFQNAWSGGYIAGTSIGK

Q795R8 Uncharacterized protein YtfP3.7e-3328.81Show/hide
Query:  LVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLS-KVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDD
        ++V+GGG +G+  AI A        V++I+KG  L  K+ ISGGGRCNVTN   +  + + +H P G+  F    F+     D + +F N G++LK ED 
Subjt:  LVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLS-KVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDD

Query:  GRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIA--------SGSSRQGFSLAAQLGHSLIDPVP--
        GR+FPV++ + S+VD L++  K+  V+++T + + S     G    +    + N  E + +  ++IA        +GS+  G+  A   GH++ +  P  
Subjt:  GRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIA--------SGSSRQGFSLAAQLGHSLIDPVP--

Query:  -------------SLFTFKIEDPQLAELSGVG--------PMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQLIIFSVGLLIVD
                     +L    + D  ++ L+  G         ML TH+GLSGP ILR S +  ++L           L         ++  +F      + 
Subjt:  -------------SLFTFKIEDPQLAELSGVG--------PMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQLIIFSVGLLIVD

Query:  FTPDLHLEDV--KTILSRHKSQFMEINDEILWASISNKSLASISSLLKQC-IFKILGKG--QFKDEFVTAGGVPLSEISLKTMESKIHSRLFFAGEVLNI
          P   +++V    +  R+    +E N      S S          ++ C  F +L  G       FVT GGV + EI  K M SK    L+F GE+L+I
Subjt:  FTPDLHLEDV--KTILSRHKSQFMEINDEILWASISNKSLASISSLLKQC-IFKILGKG--QFKDEFVTAGGVPLSEISLKTMESKIHSRLFFAGEVLNI

Query:  DGVTGGFNFQNAWSGGYIAGTSIGKLA
         G TGG+N  +A   G +AG + G+ A
Subjt:  DGVTGGFNFQNAWSGGYIAGTSIGKLA

Arabidopsis top hitse value%identityAlignment
AT5G39940.1 FAD/NAD(P)-binding oxidoreductase family protein2.1e-14058.01Show/hide
Query:  ALTSIV-AVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTM
        A+TS+    +K   ELLVVVGGGAAGVYGAIRAKTL+P+L V+VIEKG  LSKVKISGGGRCNVTNGHC D  +LA HYPRGHKE +G FF  HGP DTM
Subjt:  ALTSIV-AVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTM

Query:  SWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQK-LMNCFEHVEANYLLIASGSSRQGFSLAAQLGHS
        SWFS HGV LK EDDGRVFPVS+ S S+VDCL++EA   GV L+ GK V +ASI    KF +K+ K   +  E +EA YLLIA+GSS++G SLA + GHS
Subjt:  SWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQK-LMNCFEHVEANYLLIASGSSRQGFSLAAQLGHS

Query:  LIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQLII
        ++DPVPSLFTFKI DP L EL+G                        +GPMLVTHWGLSGPVILRLSAWGAR LF+S YKG                   
Subjt:  LIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQLII

Query:  FSVGLLIVDFTPDLHLEDVKTILSRHKSQFME--------------------------INDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTA
             LIVDF PD+++E  K++L  HK QF +                           + + LWAS+SN SL+SIS LLK C F++ GKGQ+KDEFVTA
Subjt:  FSVGLLIVDFTPDLHLEDVKTILSRHKSQFME--------------------------INDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTA

Query:  GGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESL
        GGVPLSE+SLKTMESK+   LFFAGEVLN+DGVTGGFNFQNAWSGGYIAGT+IG+LA+   +
Subjt:  GGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESL

AT5G39940.2 FAD/NAD(P)-binding oxidoreductase family protein5.2e-13157.5Show/hide
Query:  ALTSIV-AVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTM
        A+TS+    +K   ELLVVVGGGAAGVYGAIRAKTL+P+L V+VIEKG  LSKVKISGGGRCNVTNGHC D  +LA HYPRGHKE +G FF  HGP DTM
Subjt:  ALTSIV-AVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTM

Query:  SWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQK-LMNCFEHVEANYLLIASGSSRQGFSLAAQLGHS
        SWFS HGV LK EDDGRVFPVS+ S S+VDCL++EA   GV L+ GK V +ASI    KF +K+ K   +  E +EA YLLIA+GSS++G SLA + GHS
Subjt:  SWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQK-LMNCFEHVEANYLLIASGSSRQGFSLAAQLGHS

Query:  LIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQLII
        ++DPVPSLFTFKI DP L EL+G                        +GPMLVTHWGLSGPVILRLSAWGAR LF+S YKG                   
Subjt:  LIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQLII

Query:  FSVGLLIVDFTPDLHLEDVKTILSRHKSQFME--------------------------INDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTA
             LIVDF PD+++E  K++L  HK QF +                           + + LWAS+SN SL+SIS LLK C F++ GKGQ+KDEFVTA
Subjt:  FSVGLLIVDFTPDLHLEDVKTILSRHKSQFME--------------------------INDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTA

Query:  GGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQ
        GGVPLSE+SLKTMESK+   LFFAGEVLN+DGVTGGFNFQ
Subjt:  GGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCAAACGAGAGCTTTAACCTCCATTGTTGCAGTCCAAAAGTTGAATGAAGAACTGTTGGTAGTGGTAGGAGGTGGAGCAGCAGGTGTTTATGGAGCTATTAGAGC
TAAAACCCTCGCCCCCAATCTCAATGTCATGGTTATTGAGAAAGGAAGACCCCTTTCCAAGGTCAAAATTTCTGGAGGAGGCCGATGCAATGTGACGAATGGGCATTGTA
TCGATGCAAAGAGTTTGGCAGAGCATTACCCTAGAGGCCATAAAGAATTTAGGGGCCCTTTCTTCAATGTTCACGGTCCAATGGATACAATGTCCTGGTTTTCCAATCAT
GGAGTTGAACTGAAGGTTGAGGATGATGGAAGGGTTTTTCCTGTCAGCAATTGTTCTGCTTCTATAGTCGATTGTCTGATTTCTGAAGCAAAACGCACTGGAGTTTCCTT
GCAGACTGGAAAGGTTGTTACAAGTGCATCGATTAGTAGTGGCGTGAAGTTCGCTTTGAAGATTCAAAAGCTTATGAATTGTTTTGAACACGTTGAAGCAAACTATTTAC
TGATTGCTAGTGGGAGTAGTCGGCAGGGCTTTAGTCTAGCTGCTCAGCTTGGACATTCACTTATAGACCCAGTGCCAAGCCTATTTACTTTCAAGATTGAAGATCCTCAA
TTGGCAGAGTTGTCTGGGGTTGGGCCTATGCTTGTCACACATTGGGGACTTAGTGGACCGGTAATTCTTCGTTTATCGGCTTGGGGAGCTCGTGACCTATTTGCTTCAGA
TTATAAAGGTTTCTTAAATTCCCTACTTGTGAAAGGTTATGGTTTAAAATTTTCTCAGCTAATAATTTTTAGTGTAGGCCTGCTCATTGTGGATTTTACACCTGATTTAC
ATTTGGAAGATGTCAAAACAATTCTTAGCCGGCACAAATCTCAGTTTATGGAAATAAATGATGAGATTTTGTGGGCTTCCATCTCAAACAAATCATTAGCTTCCATTTCT
TCTCTGTTGAAACAATGCATATTTAAGATCTTGGGGAAGGGTCAATTTAAGGATGAATTTGTCACTGCTGGTGGTGTTCCGCTGTCAGAGATCTCACTTAAAACAATGGA
GAGCAAAATTCATTCTCGCCTATTCTTTGCTGGGGAGGTGCTAAATATCGATGGCGTAACGGGTGGTTTCAACTTTCAGAATGCTTGGTCCGGTGGCTACATTGCTGGAA
CTAGCATTGGTAAACTTGCAAATGGTGAGTCTCTAGGGAGGGATATAAGCAATTTGGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTCAAACGAGAGCTTTAACCTCCATTGTTGCAGTCCAAAAGTTGAATGAAGAACTGTTGGTAGTGGTAGGAGGTGGAGCAGCAGGTGTTTATGGAGCTATTAGAGC
TAAAACCCTCGCCCCCAATCTCAATGTCATGGTTATTGAGAAAGGAAGACCCCTTTCCAAGGTCAAAATTTCTGGAGGAGGCCGATGCAATGTGACGAATGGGCATTGTA
TCGATGCAAAGAGTTTGGCAGAGCATTACCCTAGAGGCCATAAAGAATTTAGGGGCCCTTTCTTCAATGTTCACGGTCCAATGGATACAATGTCCTGGTTTTCCAATCAT
GGAGTTGAACTGAAGGTTGAGGATGATGGAAGGGTTTTTCCTGTCAGCAATTGTTCTGCTTCTATAGTCGATTGTCTGATTTCTGAAGCAAAACGCACTGGAGTTTCCTT
GCAGACTGGAAAGGTTGTTACAAGTGCATCGATTAGTAGTGGCGTGAAGTTCGCTTTGAAGATTCAAAAGCTTATGAATTGTTTTGAACACGTTGAAGCAAACTATTTAC
TGATTGCTAGTGGGAGTAGTCGGCAGGGCTTTAGTCTAGCTGCTCAGCTTGGACATTCACTTATAGACCCAGTGCCAAGCCTATTTACTTTCAAGATTGAAGATCCTCAA
TTGGCAGAGTTGTCTGGGGTTGGGCCTATGCTTGTCACACATTGGGGACTTAGTGGACCGGTAATTCTTCGTTTATCGGCTTGGGGAGCTCGTGACCTATTTGCTTCAGA
TTATAAAGGTTTCTTAAATTCCCTACTTGTGAAAGGTTATGGTTTAAAATTTTCTCAGCTAATAATTTTTAGTGTAGGCCTGCTCATTGTGGATTTTACACCTGATTTAC
ATTTGGAAGATGTCAAAACAATTCTTAGCCGGCACAAATCTCAGTTTATGGAAATAAATGATGAGATTTTGTGGGCTTCCATCTCAAACAAATCATTAGCTTCCATTTCT
TCTCTGTTGAAACAATGCATATTTAAGATCTTGGGGAAGGGTCAATTTAAGGATGAATTTGTCACTGCTGGTGGTGTTCCGCTGTCAGAGATCTCACTTAAAACAATGGA
GAGCAAAATTCATTCTCGCCTATTCTTTGCTGGGGAGGTGCTAAATATCGATGGCGTAACGGGTGGTTTCAACTTTCAGAATGCTTGGTCCGGTGGCTACATTGCTGGAA
CTAGCATTGGTAAACTTGCAAATGGTGAGTCTCTAGGGAGGGATATAAGCAATTTGGCTTGA
Protein sequenceShow/hide protein sequence
MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGRPLSKVKISGGGRCNVTNGHCIDAKSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNH
GVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQ
LAELSGVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGFLNSLLVKGYGLKFSQLIIFSVGLLIVDFTPDLHLEDVKTILSRHKSQFMEINDEILWASISNKSLASIS
SLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA