; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024060 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024060
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionFAD/NAD(P)-binding oxidoreductase family protein
Genome locationtig00001047:2906388..2911099
RNA-Seq ExpressionSgr024060
SyntenySgr024060
Gene Ontology termsNA
InterPro domainsIPR004792 - 3-Dehydro-bile acid delta(4,6)-reductase-like
IPR023166 - HI0933-like insert domain superfamily
IPR036188 - FAD/NAD(P)-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578599.1 hypothetical protein SDJN03_23047, partial [Cucurbita argyrosperma subsp. sororia]4.8e-21286.99Show/hide
Query:  MNLVKALTSTVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGP
        MNL KA+TS V VQKLNEELLVVVGGGAAGVYGA+RAKTLAPNLNVMVIEKG+PLSKVKISGGGRCNVTNGH+TDA +LAEHYPRGHKEFRG FFN+HGP
Subjt:  MNLVKALTSTVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGP

Query:  MDTMSWFSNHGVELKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIASGSSRQGFSLAAQL
        MDTMSWFSNHGV+LKVE+DGRVFPV+N SASIVDCLMSEAKRTGVSLQTGKVVTSAS S GGKF LKIQKL N VEHVEANYLLIASGSSRQGFSLAAQL
Subjt:  MDTMSWFSNHGVELKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIASGSSRQGFSLAAQL

Query:  GHSLVDPVPSLFTFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVK
        GHSLVDPVPSLFTFKIEDP LAELSGVSFPKV+AKLKLEN+QRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGL+IVDFAPD+HLEDVK
Subjt:  GHSLVDPVPSLFTFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVK

Query:  TILSQHKSQFM--------------------------EIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR
        TILS+HKSQFM                          EI+DEILWAS+SNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGV LSEISLKTMESKIHSR
Subjt:  TILSQHKSQFM--------------------------EIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR

Query:  LYFAGEVLNVDGVTGGFNFQNAWSGGYIAGSSIGKLAN
        L+FAGEVLNVDGVTGGFNFQNAWSGGYIAG+SIGKLAN
Subjt:  LYFAGEVLNVDGVTGGFNFQNAWSGGYIAGSSIGKLAN

XP_022133452.1 uncharacterized protein LOC111006025 isoform X1 [Momordica charantia]1.1e-21186.53Show/hide
Query:  MNLVKALTSTVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGP
        MNL KALTS+VAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGH TD+ +LAEHYPRGHKEFRGSFFN+HGP
Subjt:  MNLVKALTSTVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGP

Query:  MDTMSWFSNHGVELKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIASGSSRQGFSLAAQL
        MDTMSWFSNHGVELK+E+DGRVFPVSNCSASIVDCLM EA R GVSLQTGKVVTSASTS GGKF+LKIQK+   VEHVEANYLLIASGSSRQGFSLAAQL
Subjt:  MDTMSWFSNHGVELKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIASGSSRQGFSLAAQL

Query:  GHSLVDPVPSLFTFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVK
        GHSL+DPVPSLFTFKIEDP LAELSGVSFPKV+AKL+LENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFAS+YKGL+IVDF PD HLEDVK
Subjt:  GHSLVDPVPSLFTFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVK

Query:  TILSQHKSQFM--------------------------EIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR
        +ILS+HKSQFM                          EIHDEILWAS+SNKSLAS+SSLLK+CIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR
Subjt:  TILSQHKSQFM--------------------------EIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR

Query:  LYFAGEVLNVDGVTGGFNFQNAWSGGYIAGSSIGKLAN
        L+FAGEVLNVDGVTGGFNFQNAWSGGYIAG+SIGKLAN
Subjt:  LYFAGEVLNVDGVTGGFNFQNAWSGGYIAGSSIGKLAN

XP_022938975.1 uncharacterized protein LOC111445022 isoform X1 [Cucurbita moschata]1.1e-21186.76Show/hide
Query:  MNLVKALTSTVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGP
        MNL +A+TS V VQKLNEELLVVVGGGAAGVYGA+RAKTLAPNLNVMVIEKG+PLSKVKISGGGRCNVTNGH+TDA +LAEHYPRGHKEFRG FFN+HGP
Subjt:  MNLVKALTSTVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGP

Query:  MDTMSWFSNHGVELKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIASGSSRQGFSLAAQL
        MDTMSWFSNHGV+LKVE+DGRVFPV+N SASIVDCLMSEAKRTGVSLQTGKVVTSAS S GGKF LKIQKL N VEHVEANYLLIASGSSRQGFSLAAQL
Subjt:  MDTMSWFSNHGVELKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIASGSSRQGFSLAAQL

Query:  GHSLVDPVPSLFTFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVK
        GHSLVDPVPSLFTFKIEDP LAELSGVSFPKV+AKLKLEN+QRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGL+IVDFAPD+HLEDVK
Subjt:  GHSLVDPVPSLFTFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVK

Query:  TILSQHKSQFM--------------------------EIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR
        TILS+HKSQFM                          EI+DEILWAS+SNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGV LSEISLKTMESKIHSR
Subjt:  TILSQHKSQFM--------------------------EIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR

Query:  LYFAGEVLNVDGVTGGFNFQNAWSGGYIAGSSIGKLAN
        L+FAGEVLNVDGVTGGFNFQNAWSGGYIAG+SIGKLAN
Subjt:  LYFAGEVLNVDGVTGGFNFQNAWSGGYIAGSSIGKLAN

XP_022993604.1 uncharacterized protein LOC111489549 isoform X1 [Cucurbita maxima]1.3e-21287.44Show/hide
Query:  MNLVKALTSTVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGP
        MNL KA+TS VAVQKLNEE+LVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKG+PLSKVKISGGGRCNVTNGH+TDA +LAEHYPRGHKEFRG FFN+HGP
Subjt:  MNLVKALTSTVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGP

Query:  MDTMSWFSNHGVELKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIASGSSRQGFSLAAQL
        MDTMSWFSNHGV+LKVE+DGRVFPVSN SASIVDCLMSEAKRTGVSLQTGKVVTSAS S GGKF LKIQKL N VEHVEANYLLIASGSSRQGFSLAAQL
Subjt:  MDTMSWFSNHGVELKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIASGSSRQGFSLAAQL

Query:  GHSLVDPVPSLFTFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVK
        GHSLVDPVPSLFTFKIEDP LAELSGVSFPKV+AKLKLEN+QRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGL+IVDFAPD HLEDVK
Subjt:  GHSLVDPVPSLFTFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVK

Query:  TILSQHKSQFM--------------------------EIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR
        TILS+HKSQFM                          EI+DEILWAS+SNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGV LSEISLKTMESKIHSR
Subjt:  TILSQHKSQFM--------------------------EIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR

Query:  LYFAGEVLNVDGVTGGFNFQNAWSGGYIAGSSIGKLAN
        L+FAGEVLNVDGVTGGFNFQNAWSGGYIAG+SIGKLAN
Subjt:  LYFAGEVLNVDGVTGGFNFQNAWSGGYIAGSSIGKLAN

XP_023549941.1 uncharacterized protein LOC111808280 [Cucurbita pepo subsp. pepo]4.8e-21286.99Show/hide
Query:  MNLVKALTSTVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGP
        MNL KA+TS VAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKG+PLSKVKISGGGRCNVTNGH+TDA +LAEHYPRGHKEFRG FFN+HGP
Subjt:  MNLVKALTSTVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGP

Query:  MDTMSWFSNHGVELKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIASGSSRQGFSLAAQL
        MDTMSWFSNHGV+LKVE+DGRVFPVSN SASI+DCLM+EAKRTGVSLQTGKVVTSAS S GGKF LKIQKL N VEHVEANYLLIASGSSRQGFSLAAQL
Subjt:  MDTMSWFSNHGVELKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIASGSSRQGFSLAAQL

Query:  GHSLVDPVPSLFTFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVK
        GHSLVDPVPSLFTFKIEDP LAELSGVSFPKV+AKLKLEN+QRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLF SDYKGL+IVDFAPD HLEDVK
Subjt:  GHSLVDPVPSLFTFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVK

Query:  TILSQHKSQFM--------------------------EIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR
        TILS+HKSQFM                          EI+DEILWAS+SNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGV LSEISLKTMESKIHSR
Subjt:  TILSQHKSQFM--------------------------EIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR

Query:  LYFAGEVLNVDGVTGGFNFQNAWSGGYIAGSSIGKLAN
        L+FAGEVLNVDGVTGGFNFQNAWSGGYIAG+SIGKLAN
Subjt:  LYFAGEVLNVDGVTGGFNFQNAWSGGYIAGSSIGKLAN

TrEMBL top hitse value%identityAlignment
A0A0A0KVG6 Uncharacterized protein6.7e-21286.07Show/hide
Query:  MNLVKALTSTVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGP
        MNL KALTS VA QKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNV+VIEKG+PLSKVKISGGGRCNVTNGHYTDA +LAEHYPRGHKEFRG FFN+HGP
Subjt:  MNLVKALTSTVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGP

Query:  MDTMSWFSNHGVELKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIASGSSRQGFSLAAQL
        MDTMSWFSNHGVELKVE+DGRVFPVSNCS+S+VDCLMSEAKRTGVSLQTGKVV SAS S GGKF LKIQKL N  EHVEANYLLIASGSSRQGFSLAAQL
Subjt:  MDTMSWFSNHGVELKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIASGSSRQGFSLAAQL

Query:  GHSLVDPVPSLFTFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVK
        GHSL+DPVPSLFTFKIEDP LAELSGVSFPKV+AKLKLEN+QRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGL+IVDF PD HLE+VK
Subjt:  GHSLVDPVPSLFTFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVK

Query:  TILSQHKSQFM--------------------------EIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR
        TIL++HKSQFM                          EI+DEILWAS+SNKSLASISSLLKQCIFK+LGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR
Subjt:  TILSQHKSQFM--------------------------EIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR

Query:  LYFAGEVLNVDGVTGGFNFQNAWSGGYIAGSSIGKLAN
        L+FAGEVLNVDGVTGGFNFQNAWSGGYIAG+SIG+LAN
Subjt:  LYFAGEVLNVDGVTGGFNFQNAWSGGYIAGSSIGKLAN

A0A1S3C9B6 uncharacterized protein YtfP isoform X12.4e-20985.39Show/hide
Query:  MNLVKALTSTVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGP
        MNL KALTS VA QKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNV+VIEKG+PLSKVKISGGGRCNVTNGH TDA +LAEHYPRGHKEFRG FFN+HGP
Subjt:  MNLVKALTSTVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGP

Query:  MDTMSWFSNHGVELKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIASGSSRQGFSLAAQL
        MDTMSWFSNHGVELKVE+DGRVFPVSNCS+S+VDCLMSEAKRTGVSLQTGKVV SAS S GGKF LKIQKL N  EHVEANYLLIASGSSRQGFSLAAQL
Subjt:  MDTMSWFSNHGVELKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIASGSSRQGFSLAAQL

Query:  GHSLVDPVPSLFTFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVK
        GHSL+DPVPSLFTFKIEDP LAELSGVSFPKV+AKLKLEN+QRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGL+IVDF PD HLEDVK
Subjt:  GHSLVDPVPSLFTFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVK

Query:  TILSQHKSQFM--------------------------EIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR
         IL++HKSQFM                          EI+DEILWAS+SNKSLASIS LLKQCIFK+LGKGQFKDEFVTAGGVPLSE+SLKTMESKIHSR
Subjt:  TILSQHKSQFM--------------------------EIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR

Query:  LYFAGEVLNVDGVTGGFNFQNAWSGGYIAGSSIGKLAN
        L+FAGEVLNVDGVTGGFNFQNAWSGGYIAG+SIG LAN
Subjt:  LYFAGEVLNVDGVTGGFNFQNAWSGGYIAGSSIGKLAN

A0A6J1BWQ2 uncharacterized protein LOC111006025 isoform X15.2e-21286.53Show/hide
Query:  MNLVKALTSTVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGP
        MNL KALTS+VAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGH TD+ +LAEHYPRGHKEFRGSFFN+HGP
Subjt:  MNLVKALTSTVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGP

Query:  MDTMSWFSNHGVELKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIASGSSRQGFSLAAQL
        MDTMSWFSNHGVELK+E+DGRVFPVSNCSASIVDCLM EA R GVSLQTGKVVTSASTS GGKF+LKIQK+   VEHVEANYLLIASGSSRQGFSLAAQL
Subjt:  MDTMSWFSNHGVELKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIASGSSRQGFSLAAQL

Query:  GHSLVDPVPSLFTFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVK
        GHSL+DPVPSLFTFKIEDP LAELSGVSFPKV+AKL+LENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFAS+YKGL+IVDF PD HLEDVK
Subjt:  GHSLVDPVPSLFTFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVK

Query:  TILSQHKSQFM--------------------------EIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR
        +ILS+HKSQFM                          EIHDEILWAS+SNKSLAS+SSLLK+CIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR
Subjt:  TILSQHKSQFM--------------------------EIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR

Query:  LYFAGEVLNVDGVTGGFNFQNAWSGGYIAGSSIGKLAN
        L+FAGEVLNVDGVTGGFNFQNAWSGGYIAG+SIGKLAN
Subjt:  LYFAGEVLNVDGVTGGFNFQNAWSGGYIAGSSIGKLAN

A0A6J1FLC1 uncharacterized protein LOC111445022 isoform X15.2e-21286.76Show/hide
Query:  MNLVKALTSTVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGP
        MNL +A+TS V VQKLNEELLVVVGGGAAGVYGA+RAKTLAPNLNVMVIEKG+PLSKVKISGGGRCNVTNGH+TDA +LAEHYPRGHKEFRG FFN+HGP
Subjt:  MNLVKALTSTVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGP

Query:  MDTMSWFSNHGVELKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIASGSSRQGFSLAAQL
        MDTMSWFSNHGV+LKVE+DGRVFPV+N SASIVDCLMSEAKRTGVSLQTGKVVTSAS S GGKF LKIQKL N VEHVEANYLLIASGSSRQGFSLAAQL
Subjt:  MDTMSWFSNHGVELKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIASGSSRQGFSLAAQL

Query:  GHSLVDPVPSLFTFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVK
        GHSLVDPVPSLFTFKIEDP LAELSGVSFPKV+AKLKLEN+QRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGL+IVDFAPD+HLEDVK
Subjt:  GHSLVDPVPSLFTFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVK

Query:  TILSQHKSQFM--------------------------EIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR
        TILS+HKSQFM                          EI+DEILWAS+SNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGV LSEISLKTMESKIHSR
Subjt:  TILSQHKSQFM--------------------------EIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR

Query:  LYFAGEVLNVDGVTGGFNFQNAWSGGYIAGSSIGKLAN
        L+FAGEVLNVDGVTGGFNFQNAWSGGYIAG+SIGKLAN
Subjt:  LYFAGEVLNVDGVTGGFNFQNAWSGGYIAGSSIGKLAN

A0A6J1K0M0 uncharacterized protein LOC111489549 isoform X16.1e-21387.44Show/hide
Query:  MNLVKALTSTVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGP
        MNL KA+TS VAVQKLNEE+LVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKG+PLSKVKISGGGRCNVTNGH+TDA +LAEHYPRGHKEFRG FFN+HGP
Subjt:  MNLVKALTSTVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGP

Query:  MDTMSWFSNHGVELKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIASGSSRQGFSLAAQL
        MDTMSWFSNHGV+LKVE+DGRVFPVSN SASIVDCLMSEAKRTGVSLQTGKVVTSAS S GGKF LKIQKL N VEHVEANYLLIASGSSRQGFSLAAQL
Subjt:  MDTMSWFSNHGVELKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIASGSSRQGFSLAAQL

Query:  GHSLVDPVPSLFTFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVK
        GHSLVDPVPSLFTFKIEDP LAELSGVSFPKV+AKLKLEN+QRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGL+IVDFAPD HLEDVK
Subjt:  GHSLVDPVPSLFTFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVK

Query:  TILSQHKSQFM--------------------------EIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR
        TILS+HKSQFM                          EI+DEILWAS+SNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGV LSEISLKTMESKIHSR
Subjt:  TILSQHKSQFM--------------------------EIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSR

Query:  LYFAGEVLNVDGVTGGFNFQNAWSGGYIAGSSIGKLAN
        L+FAGEVLNVDGVTGGFNFQNAWSGGYIAG+SIGKLAN
Subjt:  LYFAGEVLNVDGVTGGFNFQNAWSGGYIAGSSIGKLAN

SwissProt top hitse value%identityAlignment
B0NAQ4 3-dehydro-bile acid delta(4,6)-reductase7.3e-3026.51Show/hide
Query:  VVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPL-SKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGPMDTMSWFSNHGVELKVENDGR
        ++GGGA+G+  AI A     +  V ++E+ + +  K+  +G GRCN+TN    DA+     Y     EF  +     G  +T+ +F++ G+  K    G 
Subjt:  VVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPL-SKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGPMDTMSWFSNHGVELKVENDGR

Query:  VFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIAS--------GSSRQGFSLAAQLGHSLVDPVPSLFT
        ++P S+ +AS+++ L  E +R  V + TG  V +   S  G F+++        +   A+ +++A         GS   G++LA  +GH+L   VP+L  
Subjt:  VFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIAS--------GSSRQGFSLAAQLGHSLVDPVPSLFT

Query:  FKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVKTILSQHKS-----
         K++    A+ +GV   +  AK+     ++ L + T  G M +T +G+SG  + ++S   A+ L+    +  V VDF P+     V+   + H       
Subjt:  FKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVKTILSQHKS-----

Query:  -------------------QFMEIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSRLYFAGEVLNVDGVTG
                           +   I      + +       +    KQ +  +     F +  V AGGV   E+   T+ES+    LY  GE+L+V+G+ G
Subjt:  -------------------QFMEIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSRLYFAGEVLNVDGVTG

Query:  GFNFQNAWSGGYIAG
        G+N Q AW+ GY+AG
Subjt:  GFNFQNAWSGGYIAG

P37631 Uncharacterized protein YhiN2.7e-2426.92Show/hide
Query:  VVVGGGAAGVYGAIRAKTLAPNLNVMVIEKG-KPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGPMDTMSWFSNHGVELKVENDG
        +++G GAAG++ +  A        V++I+ G KP  K+ +SGGGRCN TN  Y +        P   K     F       D +   + HG+    +  G
Subjt:  VVVGGGAAGVYGAIRAKTLAPNLNVMVIEKG-KPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGPMDTMSWFSNHGVELKVENDG

Query:  RVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIAS--------GSSRQGFSLAAQLGHSLVDPVPSLF
        ++F   + +  IVD L+ E ++  V+ +    V S +  + G F L +  +      V    L+IA+        G+S  G+ +A Q G +++     L 
Subjt:  RVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIAS--------GSSRQGFSLAAQLGHSLVDPVPSLF

Query:  TFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLE-------------DV
         F +  P L EL  ++   V + +  EN             +L TH GLSGP +L++S++     F S       ++  PD  LE              +
Subjt:  TFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLE-------------DV

Query:  KTILSQH--------KSQFMEIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSRLYFAGEVLNVDGVTGGF
        K  L+ H          Q  +I D  L   ++ +   ++ S L     +  G   ++   VT GGV  +E+S +TME++    LYF GEV++V G  GG+
Subjt:  KTILSQH--------KSQFMEIHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSRLYFAGEVLNVDGVTGGF

Query:  NFQNAWSGGYIAGSSI
        NFQ AWS  +     +
Subjt:  NFQNAWSGGYIAGSSI

P44941 Uncharacterized protein HI_09335.8e-2728.3Show/hide
Query:  VVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLS-KVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGPMDTMSWFSNHGVELKVENDG
        +++G GAAG++ A +   L    +V V + GK +  K+ +SGGG CN TN   T A     HY   +  F  S    +   D +S  +  G+    +  G
Subjt:  VVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLS-KVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGPMDTMSWFSNHGVELKVENDG

Query:  RVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSA---STSDGGKFILKIQKLANFVEHVEANYLLIAS--------GSSRQGFSLAAQLGHSLVDPVP
        ++F     +  IV+ L SE  + G  +     V+        +  +F+L++          +   L++A+        G++  G+ +A Q G  ++ P  
Subjt:  RVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSA---STSDGGKFILKIQKLANFVEHVEANYLLIAS--------GSSRQGFSLAAQLGHSLVDPVP

Query:  SL--FTFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLS-AWGARDLFASDYKGLVIVDFAPDFHLED--------
        SL  FT++  D  L  LSG+S P     L  ++       Y Q   +L TH G+SGP +L++S  W   +         V +D  P+ ++E+        
Subjt:  SL--FTFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLS-AWGARDLFASDYKGLVIVDFAPDFHLED--------

Query:  -----VKTILSQ-HKSQFME-------IHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSRLYFAGEVLNVD
             +KTIL +    + +E       + DE++ A++S   + ++   +    F   G   ++   VT GGV    IS KTMES   S LYF GEVL+V 
Subjt:  -----VKTILSQ-HKSQFME-------IHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSRLYFAGEVLNVD

Query:  GVTGGFNFQNAWSGGYIAGSSIGK
        G  GG+NFQ AWS  Y    SI +
Subjt:  GVTGGFNFQNAWSGGYIAGSSIGK

Q795R8 Uncharacterized protein YtfP5.6e-3830.84Show/hide
Query:  LVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLS-KVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGPMDTMSWFSNHGVELKVEND
        ++V+GGG +G+  AI A        V++I+KG  L  K+ ISGGGRCNVTN        + +H P G+  F  S F+     D + +F N G++LK E+ 
Subjt:  LVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLS-KVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGPMDTMSWFSNHGVELKVEND

Query:  GRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIA--------SGSSRQGFSLAAQLGHSLVDPVPSL
        GR+FPV++ + S+VD L++  K+  V+++T + + S    DG    +    + N  E + +  ++IA        +GS+  G+  A   GH++ +  P+ 
Subjt:  GRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIA--------SGSSRQGFSLAAQLGHSLVDPVPSL

Query:  FTFKIEDPHLAE--LSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDV----------
              +P + +  L G+S   V   +     ++  P  T    ML TH+GLSGP ILR S +  ++L     +  + +D  PD + E +          
Subjt:  FTFKIEDPHLAE--LSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDV----------

Query:  ---KTILSQHKSQFMEIHDEILWASMSNKSLASISSLLKQCI---------FKVLGKG--QFKDEFVTAGGVPLSEISLKTMESKIHSRLYFAGEVLNVD
           KTI +  K    E +   L          S S L K            F VL  G       FVT GGV + EI  K M SK    LYF GE+L++ 
Subjt:  ---KTILSQHKSQFMEIHDEILWASMSNKSLASISSLLKQCI---------FKVLGKG--QFKDEFVTAGGVPLSEISLKTMESKIHSRLYFAGEVLNVD

Query:  GVTGGFNFQNAWSGGYIAGSSIGKLANA
        G TGG+N  +A   G +AG + G+ A +
Subjt:  GVTGGFNFQNAWSGGYIAGSSIGKLANA

Arabidopsis top hitse value%identityAlignment
AT5G39940.1 FAD/NAD(P)-binding oxidoreductase family protein5.4e-15364.87Show/hide
Query:  QKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGPMDTMSWFSNHGVE
        +K   ELLVVVGGGAAGVYGAIRAKTL+P+L V+VIEKG  LSKVKISGGGRCNVTNGH  D   LA HYPRGHKE +GSFF  HGP DTMSWFS HGV 
Subjt:  QKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGPMDTMSWFSNHGVE

Query:  LKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQK-LANFVEHVEANYLLIASGSSRQGFSLAAQLGHSLVDPVPSLF
        LK E+DGRVFPVS+ S S+VDCL++EA   GV L+ GK V +AS    GKF++K+ K  A+  E +EA YLLIA+GSS++G SLA + GHS+VDPVPSLF
Subjt:  LKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQK-LANFVEHVEANYLLIASGSSRQGFSLAAQLGHSLVDPVPSLF

Query:  TFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVKTILSQHKSQFME
        TFKI DP L EL+G+SF KV+AKLKL+N    L    Q+GPMLVTHWGLSGPVILRLSAWGAR LF+S YKG +IVDF PD ++E  K++L +HK QF +
Subjt:  TFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVKTILSQHKSQFME

Query:  --------------------------IHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSRLYFAGEVLNVDG
                                     + LWAS+SN SL+SIS LLK C F+V GKGQ+KDEFVTAGGVPLSE+SLKTMESK+   L+FAGEVLNVDG
Subjt:  --------------------------IHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSRLYFAGEVLNVDG

Query:  VTGGFNFQNAWSGGYIAGSSIGKLANA
        VTGGFNFQNAWSGGYIAG++IG+LA++
Subjt:  VTGGFNFQNAWSGGYIAGSSIGKLANA

AT5G39940.2 FAD/NAD(P)-binding oxidoreductase family protein5.9e-14464.46Show/hide
Query:  QKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGPMDTMSWFSNHGVE
        +K   ELLVVVGGGAAGVYGAIRAKTL+P+L V+VIEKG  LSKVKISGGGRCNVTNGH  D   LA HYPRGHKE +GSFF  HGP DTMSWFS HGV 
Subjt:  QKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGPMDTMSWFSNHGVE

Query:  LKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQK-LANFVEHVEANYLLIASGSSRQGFSLAAQLGHSLVDPVPSLF
        LK E+DGRVFPVS+ S S+VDCL++EA   GV L+ GK V +AS    GKF++K+ K  A+  E +EA YLLIA+GSS++G SLA + GHS+VDPVPSLF
Subjt:  LKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQK-LANFVEHVEANYLLIASGSSRQGFSLAAQLGHSLVDPVPSLF

Query:  TFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVKTILSQHKSQFME
        TFKI DP L EL+G+SF KV+AKLKL+N    L    Q+GPMLVTHWGLSGPVILRLSAWGAR LF+S YKG +IVDF PD ++E  K++L +HK QF +
Subjt:  TFKIEDPHLAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVKTILSQHKSQFME

Query:  --------------------------IHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSRLYFAGEVLNVDG
                                     + LWAS+SN SL+SIS LLK C F+V GKGQ+KDEFVTAGGVPLSE+SLKTMESK+   L+FAGEVLNVDG
Subjt:  --------------------------IHDEILWASMSNKSLASISSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSRLYFAGEVLNVDG

Query:  VTGGFNFQ
        VTGGFNFQ
Subjt:  VTGGFNFQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTTAGTGAAAGCTTTAACTTCCACTGTTGCAGTCCAAAAGTTAAACGAAGAACTGTTGGTGGTCGTGGGAGGTGGAGCAGCAGGTGTTTATGGCGCTATAAGAGC
TAAAACCCTCGCCCCCAATCTCAATGTCATGGTTATTGAGAAAGGAAAGCCTCTTTCGAAGGTTAAAATTTCTGGAGGGGGCCGATGCAATGTGACGAATGGACATTATA
CCGATGCAAATACTTTGGCAGAGCATTACCCTAGAGGCCATAAAGAATTTAGGGGCTCTTTCTTCAATATTCACGGTCCAATGGATACAATGTCCTGGTTTTCCAATCAC
GGAGTTGAACTAAAGGTTGAGAATGATGGAAGGGTTTTTCCTGTCAGCAACTGTTCTGCTTCGATAGTTGATTGTCTGATGTCTGAAGCAAAACGTACCGGAGTTTCCTT
GCAGACTGGAAAGGTTGTTACAAGTGCATCGACTAGTGACGGTGGGAAGTTCATCTTGAAGATTCAAAAGCTCGCCAATTTTGTTGAACATGTTGAAGCAAACTACTTGT
TAATTGCTAGTGGAAGTAGTCGGCAGGGCTTTAGTCTCGCTGCTCAGCTCGGACATTCACTTGTAGACCCAGTGCCTAGCCTATTTACTTTCAAGATTGAAGATCCCCAC
TTGGCAGAGTTGTCTGGGGTCTCATTCCCTAAGGTCAAAGCGAAGCTTAAGTTAGAAAACATGCAACGGCATCTTCCACAATATACACAGGTTGGGCCTATGCTTGTCAC
ACATTGGGGACTTAGTGGACCGGTAATTCTACGTTTATCCGCTTGGGGAGCCCGTGACCTATTTGCTTCAGATTATAAAGGCCTTGTCATTGTGGATTTTGCACCTGATT
TTCATTTAGAAGATGTCAAGACAATCCTTAGCCAACACAAATCTCAGTTTATGGAAATACATGATGAGATCCTGTGGGCTTCCATGTCAAACAAATCATTAGCTTCCATT
TCTTCTCTGTTGAAGCAGTGCATATTTAAAGTCTTGGGGAAGGGTCAATTTAAGGATGAATTTGTCACTGCTGGAGGAGTTCCGCTGTCGGAGATCTCTCTTAAAACAAT
GGAGAGCAAAATTCATTCTCGCCTATACTTTGCCGGGGAGGTGCTAAATGTGGATGGGGTAACTGGTGGTTTCAACTTTCAGAATGCTTGGTCCGGTGGTTACATTGCTG
GAAGTAGCATTGGTAAACTTGCAAATGCTATTTGGACAAGTGTAAATCGTTCGACTCGTTTCATCAAAATCTTCCTTTCCCTTCAGCAGCGCAAACGATTTTGCTCGAAG
CACCAAATGCGAGTTCATAGAAACCAATCTCGAGCTCTCAATTCGATCAAAGCCAACTTGAAAAGCAATATTTCGACTGATTATGACGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATTTAGTGAAAGCTTTAACTTCCACTGTTGCAGTCCAAAAGTTAAACGAAGAACTGTTGGTGGTCGTGGGAGGTGGAGCAGCAGGTGTTTATGGCGCTATAAGAGC
TAAAACCCTCGCCCCCAATCTCAATGTCATGGTTATTGAGAAAGGAAAGCCTCTTTCGAAGGTTAAAATTTCTGGAGGGGGCCGATGCAATGTGACGAATGGACATTATA
CCGATGCAAATACTTTGGCAGAGCATTACCCTAGAGGCCATAAAGAATTTAGGGGCTCTTTCTTCAATATTCACGGTCCAATGGATACAATGTCCTGGTTTTCCAATCAC
GGAGTTGAACTAAAGGTTGAGAATGATGGAAGGGTTTTTCCTGTCAGCAACTGTTCTGCTTCGATAGTTGATTGTCTGATGTCTGAAGCAAAACGTACCGGAGTTTCCTT
GCAGACTGGAAAGGTTGTTACAAGTGCATCGACTAGTGACGGTGGGAAGTTCATCTTGAAGATTCAAAAGCTCGCCAATTTTGTTGAACATGTTGAAGCAAACTACTTGT
TAATTGCTAGTGGAAGTAGTCGGCAGGGCTTTAGTCTCGCTGCTCAGCTCGGACATTCACTTGTAGACCCAGTGCCTAGCCTATTTACTTTCAAGATTGAAGATCCCCAC
TTGGCAGAGTTGTCTGGGGTCTCATTCCCTAAGGTCAAAGCGAAGCTTAAGTTAGAAAACATGCAACGGCATCTTCCACAATATACACAGGTTGGGCCTATGCTTGTCAC
ACATTGGGGACTTAGTGGACCGGTAATTCTACGTTTATCCGCTTGGGGAGCCCGTGACCTATTTGCTTCAGATTATAAAGGCCTTGTCATTGTGGATTTTGCACCTGATT
TTCATTTAGAAGATGTCAAGACAATCCTTAGCCAACACAAATCTCAGTTTATGGAAATACATGATGAGATCCTGTGGGCTTCCATGTCAAACAAATCATTAGCTTCCATT
TCTTCTCTGTTGAAGCAGTGCATATTTAAAGTCTTGGGGAAGGGTCAATTTAAGGATGAATTTGTCACTGCTGGAGGAGTTCCGCTGTCGGAGATCTCTCTTAAAACAAT
GGAGAGCAAAATTCATTCTCGCCTATACTTTGCCGGGGAGGTGCTAAATGTGGATGGGGTAACTGGTGGTTTCAACTTTCAGAATGCTTGGTCCGGTGGTTACATTGCTG
GAAGTAGCATTGGTAAACTTGCAAATGCTATTTGGACAAGTGTAAATCGTTCGACTCGTTTCATCAAAATCTTCCTTTCCCTTCAGCAGCGCAAACGATTTTGCTCGAAG
CACCAAATGCGAGTTCATAGAAACCAATCTCGAGCTCTCAATTCGATCAAAGCCAACTTGAAAAGCAATATTTCGACTGATTATGACGCTTAA
Protein sequenceShow/hide protein sequence
MNLVKALTSTVAVQKLNEELLVVVGGGAAGVYGAIRAKTLAPNLNVMVIEKGKPLSKVKISGGGRCNVTNGHYTDANTLAEHYPRGHKEFRGSFFNIHGPMDTMSWFSNH
GVELKVENDGRVFPVSNCSASIVDCLMSEAKRTGVSLQTGKVVTSASTSDGGKFILKIQKLANFVEHVEANYLLIASGSSRQGFSLAAQLGHSLVDPVPSLFTFKIEDPH
LAELSGVSFPKVKAKLKLENMQRHLPQYTQVGPMLVTHWGLSGPVILRLSAWGARDLFASDYKGLVIVDFAPDFHLEDVKTILSQHKSQFMEIHDEILWASMSNKSLASI
SSLLKQCIFKVLGKGQFKDEFVTAGGVPLSEISLKTMESKIHSRLYFAGEVLNVDGVTGGFNFQNAWSGGYIAGSSIGKLANAIWTSVNRSTRFIKIFLSLQQRKRFCSK
HQMRVHRNQSRALNSIKANLKSNISTDYDA