; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G015050 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G015050
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionFAD/NAD(P)-binding oxidoreductase family protein
Genome locationchr02:20929696..20942546
RNA-Seq ExpressionLsi02G015050
SyntenyLsi02G015050
Gene Ontology termsNA
InterPro domainsIPR004792 - 3-Dehydro-bile acid delta(4,6)-reductase-like
IPR023166 - HI0933-like insert domain superfamily
IPR036188 - FAD/NAD(P)-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148683.1 uncharacterized protein LOC101210627 isoform X2 [Cucumis sativus]1.7e-18376.14Show/hide
Query:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGP
        M+ T+ALTS VA QKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNV+                              SLAEHYPRGHKEFRGPFFNVHGP
Subjt:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGP

Query:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGS
        MDTMSWFSNHGVELKVEDDGRVFPVSNCS+S+VDCL+SEAKRTG           VSLQTGKVV SASIS+G KFALKIQKL+NCFEHVEANYLLIASGS
Subjt:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGS

Query:  SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD
        SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG                        VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD
Subjt:  SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD

Query:  FTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS
        FTPDLHLE+VKTIL+RHKSQFM                          EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS
Subjt:  FTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS

Query:  LKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA
        LKTMESKIHSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIG+LANGE LGRDI+NLA
Subjt:  LKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA

XP_022938975.1 uncharacterized protein LOC111445022 isoform X1 [Cucurbita moschata]2.7e-18176.14Show/hide
Query:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGP
        M+ TRA+TSIV VQKLNEELLVVVGGGAAGVYGA+RAKTLAPNLNVM                              SLAEHYPRGHKEFRGPFFNVHGP
Subjt:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGP

Query:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGS
        MDTMSWFSNHGV+LKVEDDGRVFPV+N SASIVDCL+SEAKRTG           VSLQTGKVVTSASISSG KFALKIQKLMN  EHVEANYLLIASGS
Subjt:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGS

Query:  SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD
        SRQGFSLAAQLGHSL+DPVPSLFTFKIEDPQLAELSG                        VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD
Subjt:  SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD

Query:  FTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS
        F PD HLEDVKTILSRHKSQFM                          EINDEILWASISNKSLASISSLLKQCIFK+LGKGQFKDEFVTAGGV LSEIS
Subjt:  FTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS

Query:  LKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA
        LKTMESKIHSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIGKLANGE LGRDISNLA
Subjt:  LKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA

XP_022993604.1 uncharacterized protein LOC111489549 isoform X1 [Cucurbita maxima]3.2e-18276.36Show/hide
Query:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGP
        M+ T+A+TSIVAVQKLNEE+LVVVGGGAAGVYGAIRAKTLAPNLNVM                              SLAEHYPRGHKEFRGPFFNVHGP
Subjt:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGP

Query:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGS
        MDTMSWFSNHGV+LKVEDDGRVFPVSN SASIVDCL+SEAKRTG           VSLQTGKVVTSASISSG KFALKIQKLMN  EHVEANYLLIASGS
Subjt:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGS

Query:  SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD
        SRQGFSLAAQLGHSL+DPVPSLFTFKIEDPQLAELSG                        VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD
Subjt:  SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD

Query:  FTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS
        F PDLHLEDVKTILSRHKSQFM                          EINDEILWASISNKSLASISSLLKQCIFK+LGKGQFKDEFVTAGGV LSEIS
Subjt:  FTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS

Query:  LKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA
        LKTMESKIHSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIGKLANGE LGRD+SNLA
Subjt:  LKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA

XP_023549941.1 uncharacterized protein LOC111808280 [Cucurbita pepo subsp. pepo]9.3e-18276.14Show/hide
Query:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGP
        M+ T+A+TSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM                              SLAEHYPRGHKEFRGPFFNVHGP
Subjt:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGP

Query:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGS
        MDTMSWFSNHGV+LKVEDDGRVFPVSN SASI+DCL++EAKRTG           VSLQTGKVVTSASISSG KFALKIQKLMN  EHVEANYLLIASGS
Subjt:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGS

Query:  SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD
        SRQGFSLAAQLGHSL+DPVPSLFTFKIEDPQLAELSG                        VGPMLVTHWGLSGPVILRLSAWGARDLF SDYKGLLIVD
Subjt:  SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD

Query:  FTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS
        F PDLHLEDVKTILSRHKSQFM                          EINDEILWASISNKSLASISSLLKQCIFK+LGKGQFKDEFVTAGGV LSEIS
Subjt:  FTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS

Query:  LKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA
        LKTMESKIHSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIGKLANGE LGRDISNLA
Subjt:  LKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA

XP_038889404.1 uncharacterized protein YtfP isoform X2 [Benincasa hispida]1.7e-18376.36Show/hide
Query:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGP
        M+ T+ALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNV+                              SLAEHYPRG+KEFRGPFFNVHGP
Subjt:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGP

Query:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGS
        MDTMSWFSNHGVELK+E+DGRVFPVSNCSASIVDCL+SE+KRTG           VSLQTGKVVTSAS+SSG KFALKIQKLMNC EH+EANYLLIASGS
Subjt:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGS

Query:  SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD
        SRQGFSLAAQ GHSLIDPVPSLFTFKIEDPQLAELSG                        VGPMLVTHWGLSGPVILRLSAWGARDLF SDYKGLLIVD
Subjt:  SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD

Query:  FTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS
        FTPDLHLEDVKTILSRHKSQFM                          EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS
Subjt:  FTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS

Query:  LKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA
        LKTMESKI SRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIGKLANGE LGRDISNLA
Subjt:  LKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA

TrEMBL top hitse value%identityAlignment
A0A0A0KVG6 Uncharacterized protein8.2e-18476.14Show/hide
Query:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGP
        M+ T+ALTS VA QKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNV+                              SLAEHYPRGHKEFRGPFFNVHGP
Subjt:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGP

Query:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGS
        MDTMSWFSNHGVELKVEDDGRVFPVSNCS+S+VDCL+SEAKRTG           VSLQTGKVV SASIS+G KFALKIQKL+NCFEHVEANYLLIASGS
Subjt:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGS

Query:  SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD
        SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG                        VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD
Subjt:  SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD

Query:  FTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS
        FTPDLHLE+VKTIL+RHKSQFM                          EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS
Subjt:  FTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS

Query:  LKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA
        LKTMESKIHSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIG+LANGE LGRDI+NLA
Subjt:  LKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA

A0A1S3C9B6 uncharacterized protein YtfP isoform X17.2e-18075.05Show/hide
Query:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGP
        M+ T+ALTS VA QKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNV+                              SLAEHYPRGHKEFRGPFFNVHGP
Subjt:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGP

Query:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGS
        MDTMSWFSNHGVELKVEDDGRVFPVSNCS+S+VDCL+SEAKRTG           VSLQTGKVV SASIS+G KFALKIQKL+NCFEHVEANYLLIASGS
Subjt:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGS

Query:  SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD
        SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG                        VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD
Subjt:  SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD

Query:  FTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS
        FTPDLHLEDVK IL+RHKSQFM                          EINDEILWASISNKSLASIS LLKQCIFKILGKGQFKDEFVTAGGVPLSE+S
Subjt:  FTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS

Query:  LKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA
        LKTMESKIHSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIG LANGE L  DI+N A
Subjt:  LKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA

A0A6J1BWQ2 uncharacterized protein LOC111006025 isoform X16.5e-17373.16Show/hide
Query:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGP
        M+  +ALTS VAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM                              SLAEHYPRGHKEFRG FFNVHGP
Subjt:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGP

Query:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGS
        MDTMSWFSNHGVELK+EDDGRVFPVSNCSASIVDCL+ EA R G           VSLQTGKVVTSAS SSG KF LKIQK++   EHVEANYLLIASGS
Subjt:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGS

Query:  SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD
        SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG                        VGPMLVTHWGLSGPVILRLSAWGARDLFAS+YKGLLIVD
Subjt:  SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD

Query:  FTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS
        FTPDLHLEDVK+ILSRHKSQFM                          EI+DEILWAS+SNKSLAS+SSLLK+CIFK+LGKGQFKDEFVTAGGVPLSEIS
Subjt:  FTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS

Query:  LKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGR-DISNLA
        LKTMESKIHSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIGKLANG  LGR D+ N+A
Subjt:  LKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGR-DISNLA

A0A6J1FLC1 uncharacterized protein LOC111445022 isoform X11.3e-18176.14Show/hide
Query:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGP
        M+ TRA+TSIV VQKLNEELLVVVGGGAAGVYGA+RAKTLAPNLNVM                              SLAEHYPRGHKEFRGPFFNVHGP
Subjt:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGP

Query:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGS
        MDTMSWFSNHGV+LKVEDDGRVFPV+N SASIVDCL+SEAKRTG           VSLQTGKVVTSASISSG KFALKIQKLMN  EHVEANYLLIASGS
Subjt:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGS

Query:  SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD
        SRQGFSLAAQLGHSL+DPVPSLFTFKIEDPQLAELSG                        VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD
Subjt:  SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD

Query:  FTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS
        F PD HLEDVKTILSRHKSQFM                          EINDEILWASISNKSLASISSLLKQCIFK+LGKGQFKDEFVTAGGV LSEIS
Subjt:  FTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS

Query:  LKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA
        LKTMESKIHSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIGKLANGE LGRDISNLA
Subjt:  LKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA

A0A6J1K0M0 uncharacterized protein LOC111489549 isoform X11.6e-18276.36Show/hide
Query:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGP
        M+ T+A+TSIVAVQKLNEE+LVVVGGGAAGVYGAIRAKTLAPNLNVM                              SLAEHYPRGHKEFRGPFFNVHGP
Subjt:  MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGP

Query:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGS
        MDTMSWFSNHGV+LKVEDDGRVFPVSN SASIVDCL+SEAKRTG           VSLQTGKVVTSASISSG KFALKIQKLMN  EHVEANYLLIASGS
Subjt:  MDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGS

Query:  SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD
        SRQGFSLAAQLGHSL+DPVPSLFTFKIEDPQLAELSG                        VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD
Subjt:  SRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVD

Query:  FTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS
        F PDLHLEDVKTILSRHKSQFM                          EINDEILWASISNKSLASISSLLKQCIFK+LGKGQFKDEFVTAGGV LSEIS
Subjt:  FTPDLHLEDVKTILSRHKSQFM--------------------------EINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEIS

Query:  LKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA
        LKTMESKIHSRLFFAGEVLN+DGVTGGFNFQNAWSGGYIAGTSIGKLANGE LGRD+SNLA
Subjt:  LKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA

SwissProt top hitse value%identityAlignment
B0NAQ4 3-dehydro-bile acid delta(4,6)-reductase4.7e-1923.08Show/hide
Query:  VVGGGAAGVYGAIRAKTLAPNLNVMSLAEHYPRGHK--------------------------EFRGPFFNVHGPMDTMSWFSNHGVELKVEDDGRVFPVS
        ++GGGA+G+  AI A     +  V  L +    G K                          EF        G  +T+ +F++ G+  K    G ++P S
Subjt:  VVGGGAAGVYGAIRAKTLAPNLNVMSLAEHYPRGHK--------------------------EFRGPFFNVHGPMDTMSWFSNHGVELKVEDDGRVFPVS

Query:  NCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIAS--------GSSRQGFSLAAQLGHSLIDP
        + +AS+++ L  E +R                Q  K+ T   + + +K + K   +    +   A+ +++A         GS   G++LA  +GH+L   
Subjt:  NCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIAS--------GSSRQGFSLAAQLGHSLIDP

Query:  VPSLFTFKIEDPQLAELSGV-------------------GPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVDFTPDLHLEDVKTILSRHKSQFME
        VP+L   K++    A+ +GV                   G M +T +G+SG  + ++S   A+ L+    +  + VDF P++    V+   + H  +   
Subjt:  VPSLFTFKIEDPQLAELSGV-------------------GPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVDFTPDLHLEDVKTILSRHKSQFME

Query:  INDEILWASISNKSL---------------------ASISSLLKQC---IFKILGKGQFKDEFVTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVT
           +     I  K L                     A    L++ C   +  I     F +  V AGGV   E+   T+ES+    L+  GE+L+++G+ 
Subjt:  INDEILWASISNKSL---------------------ASISSLLKQC---IFKILGKGQFKDEFVTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVT

Query:  GGFNFQNAWSGGYIAG
        GG+N Q AW+ GY+AG
Subjt:  GGFNFQNAWSGGYIAG

P37631 Uncharacterized protein YhiN3.9e-1325.53Show/hide
Query:  DTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKL-MNCFEHVEA--NYLLIAS
        D +   + HG+    +  G++F   + +  IVD L+ E ++          +V   L++ +V++ A   +G  F L +  + + C + V A     +   
Subjt:  DTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKL-MNCFEHVEA--NYLLIAS

Query:  GSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAE---LSGVG---------------PMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVDFTPD
        G+S  G+ +A Q G +++     L  F +  P L E   L+GV                 +L TH GLSGP +L++S++     F S       ++  PD
Subjt:  GSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAE---LSGVG---------------PMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVDFTPD

Query:  LHLE-------------DVKTILSRHKSQFMEINDEILWASISNKSLASISSLLKQCIFKIL--------GKGQFKDEFVTAGGVPLSEISLKTMESKIH
        + LE              +K  L+ H  + +    + L   I + SL  ++   +Q +   L        G   ++   VT GGV  +E+S +TME++  
Subjt:  LHLE-------------DVKTILSRHKSQFMEINDEILWASISNKSLASISSLLKQCIFKIL--------GKGQFKDEFVTAGGVPLSEISLKTMESKIH

Query:  SRLFFAGEVLNIDGVTGGFNFQNAWSGGY
          L+F GEV+++ G  GG+NFQ AWS  +
Subjt:  SRLFFAGEVLNIDGVTGGFNFQNAWSGGY

P44941 Uncharacterized protein HI_09333.4e-1725Show/hide
Query:  VVVGGGAAGVYGAIRAKTLAPNLNVMSLAE------------------------HYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDGRVFPVSN
        +++G GAAG++ A +   L  ++ V    +                        HY   +  F       +   D +S  +  G+    ++ G++F    
Subjt:  VVVGGGAAGVYGAIRAKTLAPNLNVMSLAE------------------------HYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDGRVFPVSN

Query:  CSASIVDCLISEAKRTGG-LLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLM-NCFEHVEA--NYLLIASGSSRQGFSLAAQLGHSLIDPVPSL-
         +  IV+ L SE  + G  +L+ +  S    +Q             V+F L++      C   + A     +   G++  G+ +A Q G  +I P  SL 
Subjt:  CSASIVDCLISEAKRTGG-LLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLM-NCFEHVEA--NYLLIASGSSRQGFSLAAQLGHSLIDPVPSL-

Query:  -FTFKIEDPQLAELSGV---------------GPMLVTHWGLSGPVILRLS-AWGARDLFASDYKGLLIVDFTPDLHLED-------------VKTILSR
         FT++  D  L  LSG+                 +L TH G+SGP +L++S  W   +         + +D  P+ ++E+             +KTIL R
Subjt:  -FTFKIEDPQLAELSGV---------------GPMLVTHWGLSGPVILRLS-AWGARDLFASDYKGLLIVDFTPDLHLED-------------VKTILSR

Query:  -HKSQFME-------INDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAW
            + +E       + DE++ A+IS   + ++   +    F   G   ++   VT GGV    IS KTMES   S L+F GEVL++ G  GG+NFQ AW
Subjt:  -HKSQFME-------INDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKIHSRLFFAGEVLNIDGVTGGFNFQNAW

Query:  SGGYIAGTSIGK
        S  Y    SI +
Subjt:  SGGYIAGTSIGK

Q795R8 Uncharacterized protein YtfP1.1e-2325.81Show/hide
Query:  LVVVGGGAAGVYGAIRAK---------------------------TLAPNLNVMSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDGRVF
        ++V+GGG +G+  AI A                             +   L V  + +H P G+  F    F+     D + +F N G++LK ED GR+F
Subjt:  LVVVGGGAAGVYGAIRAK---------------------------TLAPNLNVMSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDGRVF

Query:  PVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIA--------SGSSRQGFSLAAQLGHSL
        PV++ + S+VD L++  K+           + V+++T + + S     G    +    + N  E + +  ++IA        +GS+  G+  A   GH++
Subjt:  PVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIA--------SGSSRQGFSLAAQLGHSL

Query:  IDPVP---------------SLFTFKIEDPQLAELSGVG--------PMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVDFTPDLHLE--------
         +  P               +L    + D  ++ L+  G         ML TH+GLSGP ILR S +  ++L       + I D  PD++ E        
Subjt:  IDPVP---------------SLFTFKIEDPQLAELSGVG--------PMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVDFTPDLHLE--------

Query:  --------DVKTIL-----SRHKSQFMEINDEILWASISNKSLASISSLLKQC-IFKILGKG--QFKDEFVTAGGVPLSEISLKTMESKIHSRLFFAGEV
                 +K +L      R+    +E N      S S          ++ C  F +L  G       FVT GGV + EI  K M SK    L+F GE+
Subjt:  --------DVKTIL-----SRHKSQFMEINDEILWASISNKSLASISSLLKQC-IFKILGKG--QFKDEFVTAGGVPLSEISLKTMESKIHSRLFFAGEV

Query:  LNIDGVTGGFNFQNAWSGGYIAGTSIGKLA
        L+I G TGG+N  +A   G +AG + G+ A
Subjt:  LNIDGVTGGFNFQNAWSGGYIAGTSIGKLA

Arabidopsis top hitse value%identityAlignment
AT5G39940.1 FAD/NAD(P)-binding oxidoreductase family protein6.1e-12354Show/hide
Query:  ALTSIV-AVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGPMDTM
        A+TS+    +K   ELLVVVGGGAAGVYGAIRAKTL+P+L V+                              +LA HYPRGHKE +G FF  HGP DTM
Subjt:  ALTSIV-AVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGPMDTM

Query:  SWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQK-LMNCFEHVEANYLLIASGSSRQ
        SWFS HGV LK EDDGRVFPVS+ S S+VDCL++EA   G           V L+ GK V +ASI    KF +K+ K   +  E +EA YLLIA+GSS++
Subjt:  SWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQK-LMNCFEHVEANYLLIASGSSRQ

Query:  GFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVDFTP
        G SLA + GHS++DPVPSLFTFKI DP L EL+G                        +GPMLVTHWGLSGPVILRLSAWGAR LF+S YKG LIVDF P
Subjt:  GFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVDFTP

Query:  DLHLEDVKTILSRHKSQFME--------------------------INDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKT
        D+++E  K++L  HK QF +                           + + LWAS+SN SL+SIS LLK C F++ GKGQ+KDEFVTAGGVPLSE+SLKT
Subjt:  DLHLEDVKTILSRHKSQFME--------------------------INDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKT

Query:  MESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESL
        MESK+   LFFAGEVLN+DGVTGGFNFQNAWSGGYIAGT+IG+LA+   +
Subjt:  MESKIHSRLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESL

AT5G39940.2 FAD/NAD(P)-binding oxidoreductase family protein1.5e-11353.27Show/hide
Query:  ALTSIV-AVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGPMDTM
        A+TS+    +K   ELLVVVGGGAAGVYGAIRAKTL+P+L V+                              +LA HYPRGHKE +G FF  HGP DTM
Subjt:  ALTSIV-AVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVM------------------------------SLAEHYPRGHKEFRGPFFNVHGPMDTM

Query:  SWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQK-LMNCFEHVEANYLLIASGSSRQ
        SWFS HGV LK EDDGRVFPVS+ S S+VDCL++EA   G           V L+ GK V +ASI    KF +K+ K   +  E +EA YLLIA+GSS++
Subjt:  SWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEAKRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQK-LMNCFEHVEANYLLIASGSSRQ

Query:  GFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVDFTP
        G SLA + GHS++DPVPSLFTFKI DP L EL+G                        +GPMLVTHWGLSGPVILRLSAWGAR LF+S YKG LIVDF P
Subjt:  GFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSG------------------------VGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLLIVDFTP

Query:  DLHLEDVKTILSRHKSQFME--------------------------INDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKT
        D+++E  K++L  HK QF +                           + + LWAS+SN SL+SIS LLK C F++ GKGQ+KDEFVTAGGVPLSE+SLKT
Subjt:  DLHLEDVKTILSRHKSQFME--------------------------INDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKT

Query:  MESKIHSRLFFAGEVLNIDGVTGGFNFQ
        MESK+   LFFAGEVLN+DGVTGGFNFQ
Subjt:  MESKIHSRLFFAGEVLNIDGVTGGFNFQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCAAACGAGAGCTTTAACCTCCATTGTTGCAGTCCAAAAGTTGAATGAAGAACTGTTGGTAGTGGTAGGAGGTGGAGCAGCAGGTGTTTATGGAGCTATTAGAGC
TAAAACCCTCGCCCCCAATCTCAATGTCATGAGTTTGGCAGAGCATTACCCTAGAGGCCATAAAGAATTTAGGGGCCCTTTCTTCAATGTTCACGGTCCAATGGATACAA
TGTCCTGGTTTTCCAATCATGGAGTTGAACTGAAGGTTGAGGATGATGGAAGGGTTTTTCCTGTCAGCAATTGTTCTGCTTCTATAGTCGATTGTCTGATTTCTGAAGCA
AAACGCACTGGAGGTTTGTTAATTGATGCTCTGTTCTCGGTGTTTGTTTCCTTGCAGACTGGAAAGGTTGTTACAAGTGCATCGATTAGTAGTGGCGTGAAGTTCGCTTT
GAAGATTCAAAAGCTTATGAATTGTTTTGAACACGTTGAAGCAAACTATTTACTGATTGCTAGTGGGAGTAGTCGGCAGGGCTTTAGTCTAGCTGCTCAGCTTGGACATT
CACTTATAGACCCAGTGCCAAGCCTATTTACTTTCAAGATTGAAGATCCTCAATTGGCAGAGTTGTCTGGGGTTGGGCCTATGCTTGTCACACATTGGGGACTTAGTGGA
CCGGTAATTCTTCGTTTATCGGCTTGGGGAGCTCGTGACCTATTTGCTTCAGATTATAAAGGCCTGCTCATTGTGGATTTTACACCTGATTTACATTTGGAAGATGTCAA
AACAATTCTTAGCCGGCACAAATCTCAGTTTATGGAAATAAATGATGAGATTTTGTGGGCTTCCATCTCAAACAAATCATTAGCTTCCATTTCTTCTCTGTTGAAACAAT
GCATATTTAAGATCTTGGGGAAGGGTCAATTTAAGGATGAATTTGTCACTGCTGGTGGTGTTCCGCTGTCAGAGATCTCACTTAAAACAATGGAGAGCAAAATTCATTCT
CGCCTATTCTTTGCTGGGGAGGTGCTAAATATCGATGGCGTAACGGGTGGTTTCAACTTTCAGAATGCTTGGTCCGGTGGCTACATTGCTGGAACTAGCATTGGTAAACT
TGCAAATGGTGAGTCTCTAGGGAGGGATATAAGCAATTTGGCTTGA
mRNA sequenceShow/hide mRNA sequence
TGAAACCACGCATTTGGGTCATACAGCCAATCATTTTATCCTATGAGGTGTTGTCCAGAGACTAAATTATGCCCCGCTAAATTCTCGTCTTCAAGGTTGGTTCCGAATCA
AAGTGGCGCTAAATTCTCGTCTTGAAGGTTGGTTCCGAATCAAACCACCCACTTCTATAAATTCATAGAATGAGTCAAACGAGAGCTTTAACCTCCATTGTTGCAGTCCA
AAAGTTGAATGAAGAACTGTTGGTAGTGGTAGGAGGTGGAGCAGCAGGTGTTTATGGAGCTATTAGAGCTAAAACCCTCGCCCCCAATCTCAATGTCATGAGTTTGGCAG
AGCATTACCCTAGAGGCCATAAAGAATTTAGGGGCCCTTTCTTCAATGTTCACGGTCCAATGGATACAATGTCCTGGTTTTCCAATCATGGAGTTGAACTGAAGGTTGAG
GATGATGGAAGGGTTTTTCCTGTCAGCAATTGTTCTGCTTCTATAGTCGATTGTCTGATTTCTGAAGCAAAACGCACTGGAGGTTTGTTAATTGATGCTCTGTTCTCGGT
GTTTGTTTCCTTGCAGACTGGAAAGGTTGTTACAAGTGCATCGATTAGTAGTGGCGTGAAGTTCGCTTTGAAGATTCAAAAGCTTATGAATTGTTTTGAACACGTTGAAG
CAAACTATTTACTGATTGCTAGTGGGAGTAGTCGGCAGGGCTTTAGTCTAGCTGCTCAGCTTGGACATTCACTTATAGACCCAGTGCCAAGCCTATTTACTTTCAAGATT
GAAGATCCTCAATTGGCAGAGTTGTCTGGGGTTGGGCCTATGCTTGTCACACATTGGGGACTTAGTGGACCGGTAATTCTTCGTTTATCGGCTTGGGGAGCTCGTGACCT
ATTTGCTTCAGATTATAAAGGCCTGCTCATTGTGGATTTTACACCTGATTTACATTTGGAAGATGTCAAAACAATTCTTAGCCGGCACAAATCTCAGTTTATGGAAATAA
ATGATGAGATTTTGTGGGCTTCCATCTCAAACAAATCATTAGCTTCCATTTCTTCTCTGTTGAAACAATGCATATTTAAGATCTTGGGGAAGGGTCAATTTAAGGATGAA
TTTGTCACTGCTGGTGGTGTTCCGCTGTCAGAGATCTCACTTAAAACAATGGAGAGCAAAATTCATTCTCGCCTATTCTTTGCTGGGGAGGTGCTAAATATCGATGGCGT
AACGGGTGGTTTCAACTTTCAGAATGCTTGGTCCGGTGGCTACATTGCTGGAACTAGCATTGGTAAACTTGCAAATGGTGAGTCTCTAGGGAGGGATATAAGCAATTTGG
CTTGAGCATTTTTTACGTGAAAATTTGTTAACATTCTGTGTGGAGATTGGATTGGCCCTTTGTACCTTTGCTGATAACTTTATTGTCCAAAAAAAATGTAAAAAAAAAAG
AAAAAAGAAAAAACAGAAGAAAAGAAAACTTCATTTATCACAGACACTTGAATCTTGCAAACTGATGAAAAAAATTTGACCTTTCTTTTTGCGTTACTCTAAGGTTATAG
GTTCAAATACCGGTACCTCCACATTTGTTATACTAAACAAAAATTTTGACCTTTTCTGTGTATGTCTCTTGTTTATTGATTGGATTTCCTGTAATATAATCCTTTGGTAT
TTTGATTATGTAAAGGATACTATGAACCCATGGTTTATGTGTTATGCATTTTGTTG
Protein sequenceShow/hide protein sequence
MSQTRALTSIVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMSLAEHYPRGHKEFRGPFFNVHGPMDTMSWFSNHGVELKVEDDGRVFPVSNCSASIVDCLISEA
KRTGGLLIDALFSVFVSLQTGKVVTSASISSGVKFALKIQKLMNCFEHVEANYLLIASGSSRQGFSLAAQLGHSLIDPVPSLFTFKIEDPQLAELSGVGPMLVTHWGLSG
PVILRLSAWGARDLFASDYKGLLIVDFTPDLHLEDVKTILSRHKSQFMEINDEILWASISNKSLASISSLLKQCIFKILGKGQFKDEFVTAGGVPLSEISLKTMESKIHS
RLFFAGEVLNIDGVTGGFNFQNAWSGGYIAGTSIGKLANGESLGRDISNLA