; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008024 (gene) of Snake gourd v1 genome

Gene IDTan0008024
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionAB hydrolase-1 domain-containing protein
Genome locationLG02:91610714..91619662
RNA-Seq ExpressionTan0008024
SyntenyTan0008024
Gene Ontology termsGO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR000073 - Alpha/beta hydrolase fold-1
IPR000639 - Epoxide hydrolase-like
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029571.1 yugF, partial [Cucurbita argyrosperma subsp. argyrosperma]7.2e-17890.8Show/hide
Query:  MLSLTVGASWSSAPLHKKWNKPKRFGVTINEFPSFLPKEVHNIKDQFARKLATRIERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTY
        MLSL V ASW SAP   +W KPKR GV +N+FPSFLPKEV NIKDQFAR+LATRIERLPVSFSDSC MSSCVKPSIQSKE PVVLLHGFDSSCLEWRYTY
Subjt:  MLSLTVGASWSSAPLHKKWNKPKRFGVTINEFPSFLPKEVHNIKDQFARKLATRIERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTY

Query:  PLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAY
        PL EEAG ETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKS+IKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVY EGTGNLATLPRLIAY
Subjt:  PLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAY

Query:  AGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQ
        AGV LLKSIPLRLYVNVLAFT IS  TSLDW NIGRLHCLLPWWEDATV+FMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNA IRQ
Subjt:  AGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQ

Query:  IPECGHIPHVEKPNFVAKLIAQFVHEDSRKEVQYVNS
        IPECGH+PHVEKPNFVAKLI QFVHEDSRKEV YVN+
Subjt:  IPECGHIPHVEKPNFVAKLIAQFVHEDSRKEVQYVNS

XP_022962117.1 uncharacterized protein LOC111462669 isoform X1 [Cucurbita moschata]7.2e-17890.8Show/hide
Query:  MLSLTVGASWSSAPLHKKWNKPKRFGVTINEFPSFLPKEVHNIKDQFARKLATRIERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTY
        MLSL V ASW SAP   +W KPKR GV +N+FPSFLPKEV NIKDQFAR+LATRIERLPVSFSDSC MSSCVKPSIQSKE PVVLLHGFDSSCLEWRYTY
Subjt:  MLSLTVGASWSSAPLHKKWNKPKRFGVTINEFPSFLPKEVHNIKDQFARKLATRIERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTY

Query:  PLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAY
        PL EEAG ETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKS+IKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVY EGTGNLATLPRLIAY
Subjt:  PLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAY

Query:  AGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQ
        AGV LLKSIPLRLYVNVLAFT IS  TSLDW NIGRLHCLLPWWEDATV+FMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNA IRQ
Subjt:  AGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQ

Query:  IPECGHIPHVEKPNFVAKLIAQFVHEDSRKEVQYVNS
        IPECGH+PHVEKPNFVAKLI QFVHEDSRKEV YVN+
Subjt:  IPECGHIPHVEKPNFVAKLIAQFVHEDSRKEVQYVNS

XP_023546937.1 uncharacterized protein LOC111805886 isoform X2 [Cucurbita pepo subsp. pepo]1.8e-17690.18Show/hide
Query:  MLSLTVGASWSSAPLHKKWNKPKRFGVTINEFPSFLPKEVHNIKDQFARKLATRIERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTY
        MLSL V ASW SAP   +W KPKR GV +N+FPSFLPKEV NIKDQFAR+LATR ERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTY
Subjt:  MLSLTVGASWSSAPLHKKWNKPKRFGVTINEFPSFLPKEVHNIKDQFARKLATRIERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTY

Query:  PLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAY
        PL EEAG ETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKS+IKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVY EGTGNLATLPRLIAY
Subjt:  PLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAY

Query:  AGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQ
        AGV LLKSIPLRLYVNVLAFT IS  TSLDW NIGRLHCLLPWWEDATV+FMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNA IRQ
Subjt:  AGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQ

Query:  IPECGHIPHVEKPNFVAKLIAQFVHEDSRKEVQYVN
        IPECGH+PHVEKPNFVAKLI QFVHEDSRK++ +V+
Subjt:  IPECGHIPHVEKPNFVAKLIAQFVHEDSRKEVQYVN

XP_023546938.1 uncharacterized protein LOC111805886 isoform X3 [Cucurbita pepo subsp. pepo]1.4e-17690.5Show/hide
Query:  MLSLTVGASWSSAPLHKKWNKPKRFGVTINEFPSFLPKEVHNIKDQFARKLATRIERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTY
        MLSL V ASW SAP   +W KPKR GV +N+FPSFLPKEV NIKDQFAR+LATR ERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTY
Subjt:  MLSLTVGASWSSAPLHKKWNKPKRFGVTINEFPSFLPKEVHNIKDQFARKLATRIERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTY

Query:  PLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAY
        PL EEAG ETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKS+IKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVY EGTGNLATLPRLIAY
Subjt:  PLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAY

Query:  AGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQ
        AGV LLKSIPLRLYVNVLAFT IS  TSLDW NIGRLHCLLPWWEDATV+FMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNA IRQ
Subjt:  AGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQ

Query:  IPECGHIPHVEKPNFVAKLIAQFVHEDSRKEVQYVNS
        IPECGH+PHVEKPNFVAKLI QFVHEDSRK+V +V S
Subjt:  IPECGHIPHVEKPNFVAKLIAQFVHEDSRKEVQYVNS

XP_038884275.1 uncharacterized hydrolase YugF isoform X2 [Benincasa hispida]5.0e-17991.1Show/hide
Query:  MLSLTVGASWSSAPLHKKWNKPKRFGVTINEFPSFLPKEVHNIKDQFARKLATRIERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTY
        MLSLT+ ASW SAPLH +W K K   VTINEFPSFLPKEVHNIKDQFAR LATRIERLPVSFSDSCIMSSCVKPSIQSKE PVVLLHGFDSSCLEWRYTY
Subjt:  MLSLTVGASWSSAPLHKKWNKPKRFGVTINEFPSFLPKEVHNIKDQFARKLATRIERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTY

Query:  PLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAY
        PL EEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKS+IKRPMVLVGPSLGAAVAIDFAVNYP+AVDRLVLIDASVY EGTGNLATLPRLIAY
Subjt:  PLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAY

Query:  AGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQ
        AGV LLKSIPLRLYVNVL F  ISF TSLDWANIGRLHCLLPWWEDATV+FMLSGGYNVSSQIEKVKQK+LIIWGEDDQIIS+KL VRLHCELPNA+IRQ
Subjt:  AGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQ

Query:  IPECGHIPHVEKPNFVAKLIAQFVHEDSRKEVQYVNS
        IP CGH+PHVEKPN VAKLI QFVHEDSRKEVQYVNS
Subjt:  IPECGHIPHVEKPNFVAKLIAQFVHEDSRKEVQYVNS

TrEMBL top hitse value%identityAlignment
A0A6J1HBV7 uncharacterized protein LOC111462669 isoform X13.5e-17890.8Show/hide
Query:  MLSLTVGASWSSAPLHKKWNKPKRFGVTINEFPSFLPKEVHNIKDQFARKLATRIERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTY
        MLSL V ASW SAP   +W KPKR GV +N+FPSFLPKEV NIKDQFAR+LATRIERLPVSFSDSC MSSCVKPSIQSKE PVVLLHGFDSSCLEWRYTY
Subjt:  MLSLTVGASWSSAPLHKKWNKPKRFGVTINEFPSFLPKEVHNIKDQFARKLATRIERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTY

Query:  PLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAY
        PL EEAG ETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKS+IKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVY EGTGNLATLPRLIAY
Subjt:  PLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAY

Query:  AGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQ
        AGV LLKSIPLRLYVNVLAFT IS  TSLDW NIGRLHCLLPWWEDATV+FMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNA IRQ
Subjt:  AGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQ

Query:  IPECGHIPHVEKPNFVAKLIAQFVHEDSRKEVQYVNS
        IPECGH+PHVEKPNFVAKLI QFVHEDSRKEV YVN+
Subjt:  IPECGHIPHVEKPNFVAKLIAQFVHEDSRKEVQYVNS

A0A6J1HC70 uncharacterized protein LOC111462669 isoform X33.3e-17691.27Show/hide
Query:  MLSLTVGASWSSAPLHKKWNKPKRFGVTINEFPSFLPKEVHNIKDQFARKLATRIERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTY
        MLSL V ASW SAP   +W KPKR GV +N+FPSFLPKEV NIKDQFAR+LATRIERLPVSFSDSC MSSCVKPSIQSKE PVVLLHGFDSSCLEWRYTY
Subjt:  MLSLTVGASWSSAPLHKKWNKPKRFGVTINEFPSFLPKEVHNIKDQFARKLATRIERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTY

Query:  PLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAY
        PL EEAG ETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKS+IKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVY EGTGNLATLPRLIAY
Subjt:  PLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAY

Query:  AGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQ
        AGV LLKSIPLRLYVNVLAFT IS  TSLDW NIGRLHCLLPWWEDATV+FMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNA IRQ
Subjt:  AGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQ

Query:  IPECGHIPHVEKPNFVAKLIAQFVHEDSRKEV
        IPECGH+PHVEKPNFVAKLI QFVHEDSRKEV
Subjt:  IPECGHIPHVEKPNFVAKLIAQFVHEDSRKEV

A0A6J1HE71 uncharacterized protein LOC111462669 isoform X27.3e-17690.96Show/hide
Query:  MLSLTVGASWSSAPLHKKWNKPKRFGVTINEFPSFLPKEVHNIKDQFARKLATRIERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTY
        MLSL V ASW SAP   +W KPKR GV +N+FPSFLPKEV NIKDQFAR+LATRIERLPVSFSDSC MSSCVKPSIQSKE PVVLLHGFDSSCLEWRYTY
Subjt:  MLSLTVGASWSSAPLHKKWNKPKRFGVTINEFPSFLPKEVHNIKDQFARKLATRIERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTY

Query:  PLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAY
        PL EEAG ETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKS+IKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVY EGTGNLATLPRLIAY
Subjt:  PLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAY

Query:  AGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQ
        AGV LLKSIPLRLYVNVLAFT IS  TSLDW NIGRLHCLLPWWEDATV+FMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNA IRQ
Subjt:  AGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQ

Query:  IPECGHIPHVEKPNFVAKLIAQFVHEDSRKEV
        IPECGH+PHVEKPNFVAKLI QFVHEDSRKE+
Subjt:  IPECGHIPHVEKPNFVAKLIAQFVHEDSRKEV

A0A6J1KAS9 uncharacterized protein LOC111492171 isoform X28.6e-17791.57Show/hide
Query:  MLSLTVGASWSSAPLHKKWNKPKRFGVTINEFPSFLPKEVHNIKDQFARKLATRIERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTY
        MLSL V ASW SAP   +W KPKR GV IN+FPSFLPKEV NIKDQFAR+LATRIERLPVSFSDSCIMSSCVKPSIQSKE PVVLLHGFDSSCLEWRYTY
Subjt:  MLSLTVGASWSSAPLHKKWNKPKRFGVTINEFPSFLPKEVHNIKDQFARKLATRIERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTY

Query:  PLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAY
        PL EEAG ETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKS+IKRPMV+VGPSLGAAVAIDFAVNYPEAVDRLVLIDASVY EGTGNLATLPRLIAY
Subjt:  PLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAY

Query:  AGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQ
        AGV LLKSIPLRLYVNVLAFT IS  TSLDW NIGRLHCLLPWWEDATV+FMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNA IRQ
Subjt:  AGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQ

Query:  IPECGHIPHVEKPNFVAKLIAQFVHEDSRKEV
        IPECGH+PHVEKPNFVAKLI QFVHEDSRKEV
Subjt:  IPECGHIPHVEKPNFVAKLIAQFVHEDSRKEV

A0A6J1KD64 uncharacterized protein LOC111492171 isoform X11.9e-17691.27Show/hide
Query:  MLSLTVGASWSSAPLHKKWNKPKRFGVTINEFPSFLPKEVHNIKDQFARKLATRIERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTY
        MLSL V ASW SAP   +W KPKR GV IN+FPSFLPKEV NIKDQFAR+LATRIERLPVSFSDSCIMSSCVKPSIQSKE PVVLLHGFDSSCLEWRYTY
Subjt:  MLSLTVGASWSSAPLHKKWNKPKRFGVTINEFPSFLPKEVHNIKDQFARKLATRIERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTY

Query:  PLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAY
        PL EEAG ETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKS+IKRPMV+VGPSLGAAVAIDFAVNYPEAVDRLVLIDASVY EGTGNLATLPRLIAY
Subjt:  PLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAY

Query:  AGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQ
        AGV LLKSIPLRLYVNVLAFT IS  TSLDW NIGRLHCLLPWWEDATV+FMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNA IRQ
Subjt:  AGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQ

Query:  IPECGHIPHVEKPNFVAKLIAQFVHEDSRKEV
        IPECGH+PHVEKPNFVAKLI QFVHEDSRKE+
Subjt:  IPECGHIPHVEKPNFVAKLIAQFVHEDSRKEV

SwissProt top hitse value%identityAlignment
A4JPX5 2-hydroxy-6-oxononadienedioate/2-hydroxy-6-oxononatrienedioate hydrolase7.6e-1327.52Show/hide
Query:  VVLLHGFDSSCLEW---RYTYPLFEEAGLETWAVDILGWGFSD-LERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVL
        +VLLHG       W         F  AG     VD  GWG SD +         + RV    L    I R   LVG S+G   A+ FA++YPE V +LVL
Subjt:  VVLLHGFDSSCLEW---RYTYPLFEEAGLETWAVDILGWGFSD-LERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVL

Query:  IDASVYTEGTGNLATLPRLIAYAGVLLLKSI-------PLRLYVNVLAF-----TDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKV
        +     T G      +P      G+ LL+++        L+  +NV  +     T+    T L+   +GR   L  + +  T N      Y    ++ ++
Subjt:  IDASVYTEGTGNLATLPRLIAYAGVLLLKSI-------PLRLYVNVLAF-----TDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKV

Query:  KQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQIPECGHIPHVEKPNFVAKLIAQFV
        K   L+IWG DD+ +   + +RL   +PNA +     CGH    E      +++ +F+
Subjt:  KQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQIPECGHIPHVEKPNFVAKLIAQFV

B2JQW2 2-hydroxy-6-oxononadienedioate/2-hydroxy-6-oxononatrienedioate hydrolase1.3e-1228.68Show/hide
Query:  VVLLHGFDSSCLEWRYTY---PLFEEAGLETWAVDILGWGFSD-LERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVL
        +VLLHG       W   Y     F  AG     VD  GWG SD +         + RV    L    I+R   LVG S+G   A+ FA++YPE V +LVL
Subjt:  VVLLHGFDSSCLEWRYTY---PLFEEAGLETWAVDILGWGFSD-LERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVL

Query:  IDASVYTEGTGNLATLPRLIAYAGVLLLKSI-------PLRLYVNVLAF-----TDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKV
        +     T G      +P      G+ LL+ +        L+  +NV  +     T+    T LD     R H L  + +  T N      Y    ++ ++
Subjt:  IDASVYTEGTGNLATLPRLIAYAGVLLLKSI-------PLRLYVNVLAF-----TDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKV

Query:  KQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQIPECGHIPHVEKPNFVAKLIAQFV
        K   L+IWG DD+ +   + +RL   LPNA       CGH    E      +++  F+
Subjt:  KQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQIPECGHIPHVEKPNFVAKLIAQFV

O05235 Uncharacterized hydrolase YugF8.2e-1524.7Show/hide
Query:  VVLLHGFDSSCLEWRYTYPLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDAS
        +V +HGF SS   +R   PL  +   +  A+D+  +G S+  R       +    +  + +    +  VLVG S+G  +++  A+  PE   ++VL+ +S
Subjt:  VVLLHGFDSSCLEWRYTYPLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDAS

Query:  VYTEGTGNL----ATLPRLIAYAGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFML-----SGGYNVSSQIEKVKQKTLII
         Y + +         +P    Y    L K   ++  +NV+    +  +  +D    GR     P+ ++     M        G     Q++K+ +  L+I
Subjt:  VYTEGTGNL----ATLPRLIAYAGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFML-----SGGYNVSSQIEKVKQKTLII

Query:  WGEDDQIISNKLAVRLHCELPNAVIRQIPECGHIPHVEKPNFVAKLIAQFV
        WGE+D+I+  ++  RLH +LPN+V+  + + GH+   E+P  +++ IA F+
Subjt:  WGEDDQIISNKLAVRLHCELPNAVIRQIPECGHIPHVEKPNFVAKLIAQFV

P17548 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase3.8e-1226.22Show/hide
Query:  VVLLHGFDSSCLEWRYTY---PLFEEAGLETWAVDILGWGFSDL------ERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAV
        V++LHG       W   Y     F EAG      D  G+  SD         L         + +  + K+H      LVG S+G A A++FA+ YPE  
Subjt:  VVLLHGFDSSCLEWRYTY---PLFEEAGLETWAVDILGWGFSDL------ERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAV

Query:  DRLVLIDASVYTEGTGNLATLPRLIAYAGVLLLKSI-------PLRLYVNVLAFTDISFDTSL---DWANIGRLHCLLPWWEDATVNFMLS------GGY
         +L+L+       G GN  +L   +   G+ LL  +        L+  +NV  F        L    WANI R         +   NF+LS        +
Subjt:  DRLVLIDASVYTEGTGNLATLPRLIAYAGVLLLKSI-------PLRLYVNVLAFTDISFDTSL---DWANIGRLHCLLPWWEDATVNFMLS------GGY

Query:  NVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQIPECGHIPHVEKPNFVAKLIAQFV
        +VS+++ ++K KTL+ WG DD+ +     ++L   + +A +   P C H    E  +   +L   F+
Subjt:  NVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQIPECGHIPHVEKPNFVAKLIAQFV

Q8KZP5 2-hydroxy-6-oxononadienedioate/2-hydroxy-6-oxononatrienedioate hydrolase1.5e-1326.97Show/hide
Query:  VVLLHGFDSSCLEWRYTY---PLFEEAGLETWAVDILGWGFSDL------ERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAV
        V++LHG       W   Y     F +AG      D  G+  SD         L         + +  + K+H      LVG S+G A A++FA+ YPE  
Subjt:  VVLLHGFDSSCLEWRYTY---PLFEEAGLETWAVDILGWGFSDL------ERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAV

Query:  DRLVLIDASVYTEGTGNLATLPRLIAYAGVLLLKSI-------PLRLYVNVLAFTDISFDTSL---DWANIGRLHCLLPWWEDATVNFMLS------GGY
         +L+L+       G GN  +L   +   G+ LL  +        L+  +NV  F        L    WANI R         +   NF+LS        +
Subjt:  DRLVLIDASVYTEGTGNLATLPRLIAYAGVLLLKSI-------PLRLYVNVLAFTDISFDTSL---DWANIGRLHCLLPWWEDATVNFMLS------GGY

Query:  NVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQIPECGHIPHVEKPNFVAKLIAQFV
        NVS ++ ++K KTL+ WG DD+ +     ++L   +P+A +   P CGH    E  +   +L   F+
Subjt:  NVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQIPECGHIPHVEKPNFVAKLIAQFV

Arabidopsis top hitse value%identityAlignment
AT1G13820.1 alpha/beta-Hydrolases superfamily protein3.6e-13570.88Show/hide
Query:  MLSLTVGASWSSAPLHKKWNKPKR-FGVTINEFPSFLPKEVHNIKDQFARKLATRIERLP--VSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWR
        MLS TV A+  S P+ K     KR F VT  +FP+FLP +V  IKD FA KLA RIERLP  VSF +  IMSSCV P ++++ +PVVLLHGFDSSCLEWR
Subjt:  MLSLTVGASWSSAPLHKKWNKPKR-FGVTINEFPSFLPKEVHNIKDQFARKLATRIERLP--VSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWR

Query:  YTYPLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRL
        YTYPL EEAGLETWA DILGWGFSDL++LPPCDV SKR H Y+ WKSHIKRP+VLVGPSLGAAVAID AVN+PEAV+ LVL+DASVY EGTGNLATLP+ 
Subjt:  YTYPLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRL

Query:  IAYAGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAV
         AYAGV LLKSIPLRLYVN + F  IS +TS DW  IGRLHCL PWWEDATV+FM SGGYNV+S I+KV QKTLI+WGEDDQIISNKLA RLH EL NA 
Subjt:  IAYAGVLLLKSIPLRLYVNVLAFTDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAV

Query:  IRQIPECGHIPHVEKPNFVAKLIAQFVHEDSR-KEVQYVN
        ++QI  CGH+PHVEKP  V KLIA+FV E  R KEV+ ++
Subjt:  IRQIPECGHIPHVEKPNFVAKLIAQFVHEDSR-KEVQYVN

AT4G36530.1 alpha/beta-Hydrolases superfamily protein1.4e-1425Show/hide
Query:  IQSKESPVVLLHGFDSSCLEWRYTYPLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDR
        +Q + SP+VL+HGF +S   WRY  P   +   + +A+D+LG+G+SD + L   D       +    K  +K P V+VG SLG   A+  AV  PE V  
Subjt:  IQSKESPVVLLHGFDSSCLEWRYTYPLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDR

Query:  LVLI----------------DASVYTEGTGNLATLPRLIAYAGVLL------------LKSIPLRLYV---NVLAFTDISFDTSLDWANIGRLHCLLPWW
        + L+                D +V T+        P    +  V+L            ++S+   +Y+   NV  +   S        N G ++  L   
Subjt:  LVLI----------------DASVYTEGTGNLATLPRLIAYAGVLL------------LKSIPLRLYV---NVLAFTDISFDTSLDWANIGRLHCLLPWW

Query:  EDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQIPECGHIPHVEKPNFVAKLIAQFV
           T        Y + S + K+    L++WG+ D  +    A ++     N+ +  + + GH PH E P  V K +  ++
Subjt:  EDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQIPECGHIPHVEKPNFVAKLIAQFV

AT4G36530.2 alpha/beta-Hydrolases superfamily protein1.4e-1425Show/hide
Query:  IQSKESPVVLLHGFDSSCLEWRYTYPLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDR
        +Q + SP+VL+HGF +S   WRY  P   +   + +A+D+LG+G+SD + L   D       +    K  +K P V+VG SLG   A+  AV  PE V  
Subjt:  IQSKESPVVLLHGFDSSCLEWRYTYPLFEEAGLETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDR

Query:  LVLI----------------DASVYTEGTGNLATLPRLIAYAGVLL------------LKSIPLRLYV---NVLAFTDISFDTSLDWANIGRLHCLLPWW
        + L+                D +V T+        P    +  V+L            ++S+   +Y+   NV  +   S        N G ++  L   
Subjt:  LVLI----------------DASVYTEGTGNLATLPRLIAYAGVLL------------LKSIPLRLYV---NVLAFTDISFDTSLDWANIGRLHCLLPWW

Query:  EDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQIPECGHIPHVEKPNFVAKLIAQFV
           T        Y + S + K+    L++WG+ D  +    A ++     N+ +  + + GH PH E P  V K +  ++
Subjt:  EDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQIPECGHIPHVEKPNFVAKLIAQFV

AT5G09430.1 alpha/beta-Hydrolases superfamily protein7.8e-1327.27Show/hide
Query:  CVKPSIQSKESP-VVLLHGFDSSCLEWRYTYPLFEEAG-LETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVN
        C  P   ++  P ++LLHGF ++ + W+Y   L    G    +  D+L +G S        +    R  L +L ++H  + M +VG S G  V    A  
Subjt:  CVKPSIQSKESP-VVLLHGFDSSCLEWRYTYPLFEEAG-LETWAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVN

Query:  YPEAVDRLVLIDASVYTEG---TGNLATLPRLIAYAGVLLLKSIPLRLYVNVLAFTDI----SFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSS
        +PE V++LVL  A V  E       L  +P L    G+L+ ++ P +L   ++ F+ +       +   W  I  +       +   +  +L       S
Subjt:  YPEAVDRLVLIDASVYTEG---TGNLATLPRLIAYAGVLLLKSIPLRLYVNVLAFTDI----SFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSS

Query:  QIEKVKQKTLIIWGEDDQIISNKLAVRLHCEL-PNAVIRQIPECGHIPHVEKPNFVAKLIAQFV
         + ++KQK+LIIWGE+DQI   +L  RL   +  +A I  I + GH  ++EK     K +  F+
Subjt:  QIEKVKQKTLIIWGEDDQIISNKLAVRLHCEL-PNAVIRQIPECGHIPHVEKPNFVAKLIAQFV

AT5G39220.1 alpha/beta-Hydrolases superfamily protein2.4e-11563.97Show/hide
Query:  FPSFLPKEVHNIKDQFARKLATRIERLPVSFS----DSCIMSSCVKPSIQ-SKESPVVLLHGFDSSCLEWRYTYPLFEEAGLETWAVDILGWGFSDLERL
        FP+FLPKE+ NIKD FAR LA RI R+PV          +MSSC+KP +Q   +SPVVLLH FDSSCLEWR TYPL E+A LETWA+D+LGWGFSDLE+L
Subjt:  FPSFLPKEVHNIKDQFARKLATRIERLPVSFS----DSCIMSSCVKPSIQ-SKESPVVLLHGFDSSCLEWRYTYPLFEEAGLETWAVDILGWGFSDLERL

Query:  PPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAYAGVLLLKSIPLRLYVNVLAFTDISFD
        PPCD  SKR HL++LWK++IKRPM+LVGPSLGA VA+DF   YPEAVD+LVLI+A+ Y+EGTG L  LP+ IAYAGV LLKS PLRL  NVLAF      
Subjt:  PPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAYAGVLLLKSIPLRLYVNVLAFTDISFD

Query:  TSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQIPECGHIPHVEKPNFVAKLIAQF
         ++DW NIGRLHC +PWWEDA V+FM+SGGYNV+S I+ +  KTL++  E+DQI+SN+L+V+L CEL NAV+R++P+ GH+PHVE P  + KLI+ F
Subjt:  TSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQIPECGHIPHVEKPNFVAKLIAQF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTCTCTAACAGTGGGTGCGAGCTGGAGCTCGGCTCCACTGCACAAGAAATGGAACAAGCCCAAGCGTTTTGGAGTCACCATTAACGAGTTTCCTTCATTCCTTCC
CAAAGAAGTCCACAATATCAAAGACCAATTTGCTCGAAAGCTCGCTACAAGGATTGAGAGGCTACCCGTCAGCTTCTCTGATTCTTGTATTATGAGTAGTTGCGTGAAGC
CGTCAATACAGAGCAAAGAAAGTCCAGTGGTTCTTCTCCATGGCTTTGACAGCTCCTGTTTAGAATGGAGGTACACATATCCATTGTTTGAAGAAGCTGGTTTAGAAACC
TGGGCTGTTGACATTCTTGGCTGGGGCTTCTCTGATTTAGAAAGACTCCCACCATGTGATGTGACTTCTAAGCGTGTTCATTTGTATCAGCTTTGGAAATCTCATATTAA
GAGACCAATGGTATTGGTTGGACCAAGCCTTGGTGCTGCTGTTGCTATTGATTTTGCTGTCAATTATCCAGAAGCTGTCGACAGGCTTGTATTGATTGATGCAAGTGTAT
ATACAGAAGGTACAGGAAACTTGGCTACCCTACCAAGGTTGATTGCCTATGCTGGGGTACTTTTATTGAAGAGCATTCCATTGCGCTTATATGTCAACGTTTTGGCCTTC
ACTGACATATCATTTGATACCAGCCTGGATTGGGCCAATATCGGCCGCCTGCATTGTTTATTACCTTGGTGGGAGGATGCAACTGTTAATTTTATGTTGAGTGGAGGATA
TAATGTCAGTAGCCAGATTGAAAAGGTGAAGCAGAAAACGCTTATTATATGGGGTGAGGATGATCAAATCATCAGCAACAAGCTTGCAGTGAGGTTGCACTGTGAACTGC
CAAATGCAGTTATTCGTCAAATACCCGAGTGCGGCCACATTCCTCATGTCGAGAAGCCAAATTTTGTTGCCAAATTGATTGCACAATTTGTTCACGAAGATTCCAGAAAA
GAGGTTCAGTATGTTAATAGTGGATAA
mRNA sequenceShow/hide mRNA sequence
CAGCCAACGGGCGAACGCATTTGCCCGCGCTTCCACTAGCGAAGAAGATGCTCTCTCTAACAGTGGGTGCGAGCTGGAGCTCGGCTCCACTGCACAAGAAATGGAACAAG
CCCAAGCGTTTTGGAGTCACCATTAACGAGTTTCCTTCATTCCTTCCCAAAGAAGTCCACAATATCAAAGACCAATTTGCTCGAAAGCTCGCTACAAGGATTGAGAGGCT
ACCCGTCAGCTTCTCTGATTCTTGTATTATGAGTAGTTGCGTGAAGCCGTCAATACAGAGCAAAGAAAGTCCAGTGGTTCTTCTCCATGGCTTTGACAGCTCCTGTTTAG
AATGGAGGTACACATATCCATTGTTTGAAGAAGCTGGTTTAGAAACCTGGGCTGTTGACATTCTTGGCTGGGGCTTCTCTGATTTAGAAAGACTCCCACCATGTGATGTG
ACTTCTAAGCGTGTTCATTTGTATCAGCTTTGGAAATCTCATATTAAGAGACCAATGGTATTGGTTGGACCAAGCCTTGGTGCTGCTGTTGCTATTGATTTTGCTGTCAA
TTATCCAGAAGCTGTCGACAGGCTTGTATTGATTGATGCAAGTGTATATACAGAAGGTACAGGAAACTTGGCTACCCTACCAAGGTTGATTGCCTATGCTGGGGTACTTT
TATTGAAGAGCATTCCATTGCGCTTATATGTCAACGTTTTGGCCTTCACTGACATATCATTTGATACCAGCCTGGATTGGGCCAATATCGGCCGCCTGCATTGTTTATTA
CCTTGGTGGGAGGATGCAACTGTTAATTTTATGTTGAGTGGAGGATATAATGTCAGTAGCCAGATTGAAAAGGTGAAGCAGAAAACGCTTATTATATGGGGTGAGGATGA
TCAAATCATCAGCAACAAGCTTGCAGTGAGGTTGCACTGTGAACTGCCAAATGCAGTTATTCGTCAAATACCCGAGTGCGGCCACATTCCTCATGTCGAGAAGCCAAATT
TTGTTGCCAAATTGATTGCACAATTTGTTCACGAAGATTCCAGAAAAGAGGTTCAGTATGTTAATAGTGGATAATTTTTCTTTGCACTTGCTTACATTTTTTTAGTCTTA
TTAACGAATTTGCTTCATTCTGCTTATTTATGGGTTTCCAATCCTGGCGTGTGTTGTGCAATGAGGGCTGGAAGGTGCTCTACATGCTTGAAATCCAGTAGAATGTAACA
TCAACAAAATAAACCAAAAACTCCATAATATCTGTAGTGTACCTGATTGTTGCTTTCTTTGCCTGGTGATGGTAGAAATAATTCCCTCCCTCAAACATTTAGGACTAGTC
AACCACTAGTGCTCAAGGGGCAAGTTAAAAAAGTGGAGGGACTTCAGGGATATGGTTTAAAATTCAGATCTAAATGTTGTAAGATTAAACTAGAATCTTTTTTTTTTTTC
CTTTTTTACTTCAACACCATGTGGGACGAGAAGGTATGGTCAAAAGTATGTCTCTTAACCCGTGAACTATGTTCATGTTGACATGTACCTCATAGATTGGTATGCATATT
ATATCATGCGTATTTCTCCCTTTAACTCCGAATATCTTTGTTCCTCTTCTCGAAATGGTAGGATCTTTCGGTGACAAGGTAGATACATTCTGTTTCTCCTTCTGATTAGT
TTGCATGTGAGATAGCTCAATGGTAATTGATATGTACTTCTTTCCTTAAGGTTGAAGGTTCAAATTCCCACAGTATATGTACTCCTTTTCTTGAGATCGAAGGTTCAAAT
TCTCCTACTCTATAACAACTATCTATATCTATAGTTGTTGTAAGTTAGTTTGGATAGAGAGACTTGTGCTATAAAAAAAATAAAATAGACATAGTCTAAGGG
Protein sequenceShow/hide protein sequence
MLSLTVGASWSSAPLHKKWNKPKRFGVTINEFPSFLPKEVHNIKDQFARKLATRIERLPVSFSDSCIMSSCVKPSIQSKESPVVLLHGFDSSCLEWRYTYPLFEEAGLET
WAVDILGWGFSDLERLPPCDVTSKRVHLYQLWKSHIKRPMVLVGPSLGAAVAIDFAVNYPEAVDRLVLIDASVYTEGTGNLATLPRLIAYAGVLLLKSIPLRLYVNVLAF
TDISFDTSLDWANIGRLHCLLPWWEDATVNFMLSGGYNVSSQIEKVKQKTLIIWGEDDQIISNKLAVRLHCELPNAVIRQIPECGHIPHVEKPNFVAKLIAQFVHEDSRK
EVQYVNSG