; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015867 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015867
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionBeta_elim_lyase domain-containing protein
Genome locationtig00006144:540198..551258
RNA-Seq ExpressionSgr015867
SyntenySgr015867
Gene Ontology termsGO:0006545 - glycine biosynthetic process (biological process)
GO:0006567 - threonine catabolic process (biological process)
GO:0005829 - cytosol (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0008732 - L-allo-threonine aldolase activity (molecular function)
InterPro domainsIPR001597 - Aromatic amino acid beta-eliminating lyase/threonine aldolase
IPR006936 - ALOG domain
IPR015421 - Pyridoxal phosphate-dependent transferase, major domain
IPR015424 - Pyridoxal phosphate-dependent transferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF2306995.1 hypothetical protein GH714_023151 [Hevea brasiliensis]2.0e-19568.57Show/hide
Query:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI
        MVTR +DLRSDTVTKPTEAMR AMANAEVDDDVL  DP+A R ETEMAKI+GKEAAL+VPSGTMGNLISVLVHC+IRGSEVILG NSHIHIYENGGI+T+
Subjt:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI

Query:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS
        GGVHPRT++NN+DGTMD+DLIEAAIRDPRG LV+PTTRLICLEN+ ANCGGRCLSV YTDRV ELAKKHGLKLHIDGARIFNA+VALGVPV+RLVQAADS
Subjt:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS

Query:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE-----------------------------
        V VC SKGLGAPVGSVI GS++FI KA+ LRKTLGGGMRQVG+LCAA+LVAIQENV KLE DH+KA+ LAE                             
Subjt:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE-----------------------------

Query:  ---------------------------------SSSPTTY--------ETTSTTNSILTVTTTTVAAAAPAAAAAACS-------SPSSSSTT-------
                                         S+S   Y        +  S    IL++++  +  A+ +A     S       +PS +STT       
Subjt:  ---------------------------------SSSPTTY--------ETTSTTNSILTVTTTTVAAAAPAAAAAACS-------SPSSSSTT-------

Query:  -----TPSRYENQKRRDWNTFCQYLRNHRPPLALPMCSGAHVLEFLRYLDQFGKTKVHNQSCPFFGLPNPPAPCPCPLRQAWGSLDALIGRLRAAYEENG
             TPSRYENQKRRDWNTFCQYLRNHRPPL+LPMCSGAHVLEFL YLDQFGKTKVH Q+CPFFGLPNPPAPCPCPLRQAWGSLDALIGRLRAAYEE+G
Subjt:  -----TPSRYENQKRRDWNTFCQYLRNHRPPLALPMCSGAHVLEFLRYLDQFGKTKVHNQSCPFFGLPNPPAPCPCPLRQAWGSLDALIGRLRAAYEENG

Query:  GRPEANPFGARAVRLYLREVRDFQAKARGVSYEKKRKRPKQKLT
        GRPE NPFGAR VR+YLREVRDFQAKARGVSYEKKRKRPK K T
Subjt:  GRPEANPFGARAVRLYLREVRDFQAKARGVSYEKKRKRPKQKLT

KAG7021181.1 putative low-specificity L-threonine aldolase 1 [Cucurbita argyrosperma subsp. argyrosperma]3.3e-13791.88Show/hide
Query:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI
        MV+R +DLRSDTVTKPTEAMR AMA+AEVDDDVLKNDPTALR ETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI
Subjt:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI

Query:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS
        GGVHPRT++NN DGTM+LDLIEAAIRDPRGELVFPTTRLICLEN+HANCGGRCL+ +YTD VGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS
Subjt:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS

Query:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE
        VSVCLSKGLGAPVGSVIVGS+ FITKA+RLRKTLGGGMRQVGVLCAAALVAIQEN+VKLEGDHE A+ILA+
Subjt:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE

XP_004139254.1 probable low-specificity L-threonine aldolase 1 [Cucumis sativus]5.6e-13791.88Show/hide
Query:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI
        MV R +DLRSDTVTKPTEAMR AMA+AEVDDDVLK DPTALR ETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI
Subjt:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI

Query:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS
        GGVHPRT++NN DGTMDLDLIEAAIRDPRGELVFPTTRLICLEN+HANCGGRCLSV+YTDRVG+LAKKHGLKLHIDGARIFNASVALGVPV+RLVQAADS
Subjt:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS

Query:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE
        VSVCLSKGLGAPVGSVIVGSR FI KA+RLRKTLGGGMRQVG+LCAAALVAIQEN+VKLEGDHE A+ILA+
Subjt:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE

XP_008456444.1 PREDICTED: probable low-specificity L-threonine aldolase 1 [Cucumis melo]5.6e-13791.51Show/hide
Query:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI
        MVTR +DLRSDTVTKPTEAMR AMA+AEVDDDVLK DPTALR ETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI
Subjt:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI

Query:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS
        GGVHPRT++NN DGTMDLDLIEAAIRDPRGE+VFPTTRLICLEN+HANCGGRCLSV+YTDRVG+LAKKHGLKLHIDGARIFNASVALGVPV+RLVQAADS
Subjt:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS

Query:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE
        VSVCLSKGLGAPVGSVIVGSR FI KA+RLRKTLGGGMRQVG+LCAAALVAIQEN+VKLEGDHE A+ LA+
Subjt:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE

XP_038891136.1 probable low-specificity L-threonine aldolase 1 [Benincasa hispida]1.7e-13892.62Show/hide
Query:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI
        MVT+ +DLRSDTVTKPTEAMR AMA+AEVDDDVLK DPTALR ETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI
Subjt:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI

Query:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS
        GGVHPRT++NN DGTMDLDLIEAAIRDPRGELVFPTTRLICLEN+HANCGGRCLSV+YTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS
Subjt:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS

Query:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE
        VSVCLSKGLGAPVGSVIVGS+ FITKA+RLRKTLGGGMRQVG+LCAAALVAIQEN+VKLEGDHE A+ILA+
Subjt:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE

TrEMBL top hitse value%identityAlignment
A0A0A0LI99 Beta_elim_lyase domain-containing protein2.7e-13791.88Show/hide
Query:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI
        MV R +DLRSDTVTKPTEAMR AMA+AEVDDDVLK DPTALR ETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI
Subjt:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI

Query:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS
        GGVHPRT++NN DGTMDLDLIEAAIRDPRGELVFPTTRLICLEN+HANCGGRCLSV+YTDRVG+LAKKHGLKLHIDGARIFNASVALGVPV+RLVQAADS
Subjt:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS

Query:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE
        VSVCLSKGLGAPVGSVIVGSR FI KA+RLRKTLGGGMRQVG+LCAAALVAIQEN+VKLEGDHE A+ILA+
Subjt:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE

A0A1S3C391 probable low-specificity L-threonine aldolase 12.7e-13791.51Show/hide
Query:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI
        MVTR +DLRSDTVTKPTEAMR AMA+AEVDDDVLK DPTALR ETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI
Subjt:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI

Query:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS
        GGVHPRT++NN DGTMDLDLIEAAIRDPRGE+VFPTTRLICLEN+HANCGGRCLSV+YTDRVG+LAKKHGLKLHIDGARIFNASVALGVPV+RLVQAADS
Subjt:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS

Query:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE
        VSVCLSKGLGAPVGSVIVGSR FI KA+RLRKTLGGGMRQVG+LCAAALVAIQEN+VKLEGDHE A+ LA+
Subjt:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE

A0A5D3BHD8 Putative low-specificity L-threonine aldolase 13.6e-13790.88Show/hide
Query:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI
        MVTR +DLRSDTVTKPTEAMR AMA+AEVDDDVLK DPTALR ETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI
Subjt:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI

Query:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS
        GGVHPRT++NN DGTMDLDLIEAAIRDPRGE+VFPTTRLICLEN+HANCGGRCLSV+YTDRVG+LAKKHGLKLHIDGARIFNASVALGVPV+RLVQAADS
Subjt:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS

Query:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAESSS
        VSVCLSKGLGAPVGSVIVGSR FI KA+RLRKTLGGGMRQVG+LCAAALVAIQEN+VKLEGDHE A+ LA  +S
Subjt:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAESSS

A0A6A6M316 ALOG domain-containing protein9.8e-19668.57Show/hide
Query:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI
        MVTR +DLRSDTVTKPTEAMR AMANAEVDDDVL  DP+A R ETEMAKI+GKEAAL+VPSGTMGNLISVLVHC+IRGSEVILG NSHIHIYENGGI+T+
Subjt:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI

Query:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS
        GGVHPRT++NN+DGTMD+DLIEAAIRDPRG LV+PTTRLICLEN+ ANCGGRCLSV YTDRV ELAKKHGLKLHIDGARIFNA+VALGVPV+RLVQAADS
Subjt:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS

Query:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE-----------------------------
        V VC SKGLGAPVGSVI GS++FI KA+ LRKTLGGGMRQVG+LCAA+LVAIQENV KLE DH+KA+ LAE                             
Subjt:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE-----------------------------

Query:  ---------------------------------SSSPTTY--------ETTSTTNSILTVTTTTVAAAAPAAAAAACS-------SPSSSSTT-------
                                         S+S   Y        +  S    IL++++  +  A+ +A     S       +PS +STT       
Subjt:  ---------------------------------SSSPTTY--------ETTSTTNSILTVTTTTVAAAAPAAAAAACS-------SPSSSSTT-------

Query:  -----TPSRYENQKRRDWNTFCQYLRNHRPPLALPMCSGAHVLEFLRYLDQFGKTKVHNQSCPFFGLPNPPAPCPCPLRQAWGSLDALIGRLRAAYEENG
             TPSRYENQKRRDWNTFCQYLRNHRPPL+LPMCSGAHVLEFL YLDQFGKTKVH Q+CPFFGLPNPPAPCPCPLRQAWGSLDALIGRLRAAYEE+G
Subjt:  -----TPSRYENQKRRDWNTFCQYLRNHRPPLALPMCSGAHVLEFLRYLDQFGKTKVHNQSCPFFGLPNPPAPCPCPLRQAWGSLDALIGRLRAAYEENG

Query:  GRPEANPFGARAVRLYLREVRDFQAKARGVSYEKKRKRPKQKLT
        GRPE NPFGAR VR+YLREVRDFQAKARGVSYEKKRKRPK K T
Subjt:  GRPEANPFGARAVRLYLREVRDFQAKARGVSYEKKRKRPKQKLT

A0A6J1CPC7 probable low-specificity L-threonine aldolase 13.6e-13791.14Show/hide
Query:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI
        MVTR +DLRSDTVTKPTEAMR AMA+AEVDDDVL+NDPTALR ETEMAKIMGKEAALFVPSGTMGNLISVLVHCE+RGSEVILGDNSHIHIYENGGIATI
Subjt:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI

Query:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS
        GGVHPR++KNNRDGTMDLDLIEAAIRDPRGE+VFPTTRLICLEN+HANCGGRCLSV+YTDRVGELAKKHGLKLHIDGARIFNASVAL VPV+RLVQAADS
Subjt:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS

Query:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE
         SVCLSKGLGAPVGSVIVGS+ FI+KA+RLRKTLGGGMRQVGVLCAAALVAIQEN+VKL+GDHEK +ILA+
Subjt:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE

SwissProt top hitse value%identityAlignment
O07051 L-allo-threonine aldolase4.0e-6953.36Show/hide
Query:  RAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATIGGV
        R IDLRSDTVT+PT+AMR  M +AEV DDV   DP     E   A ++GKEAALFVPSGTM NL++V+ HC+ RG   +LG  +HI+ YE  G A +G V
Subjt:  RAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATIGGV

Query:  HPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADSVSV
          + +    DG++ L  + AAI     ++ F  TRL+CLENTH    G+ L + Y   + EL  +HGL+LH+DGAR+FNA VA G  V  LV   DSVS+
Subjt:  HPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADSVSV

Query:  CLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE
        CLSKGLGAPVGS++VGS  FI +A+RLRK +GGGMRQ G+L  A L A+Q++VV+L  DH +AR LAE
Subjt:  CLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE

Q6NNI3 Protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 14.3e-7188.89Show/hide
Query:  SPSSSSTTTP---SRYENQKRRDWNTFCQYLRNHRPPLALPMCSGAHVLEFLRYLDQFGKTKVHNQSCPFFGLPNPPAPCPCPLRQAWGSLDALIGRLRA
        +P+SS+  TP   SRYENQKRRDWNTFCQYLRNHRPPL+LP CSGAHVLEFLRYLDQFGKTKVH+Q+C FFGLPNPPAPCPCPLRQAWGSLDALIGRLRA
Subjt:  SPSSSSTTTP---SRYENQKRRDWNTFCQYLRNHRPPLALPMCSGAHVLEFLRYLDQFGKTKVHNQSCPFFGLPNPPAPCPCPLRQAWGSLDALIGRLRA

Query:  AYEENGGRPEANPFGARAVRLYLREVRDFQAKARGVSYEKKRKR
        AYEENGG PEANPFG+RAVRL+LREVRDFQAKARGVSYEKKRKR
Subjt:  AYEENGGRPEANPFGARAVRLYLREVRDFQAKARGVSYEKKRKR

Q8RXU4 Probable low-specificity L-threonine aldolase 11.6e-12378.6Show/hide
Query:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI
        MV R++DLRSDTVT+PT+AMR AM NAEVDDDVL  DPTA R E EMAK+MGKEAALFVPSGTMGNLISV+VHC++RGSEVILGDN HIH+YENGGI+TI
Subjt:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI

Query:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS
        GGVHP+T+KN  DGTMDL+ IEAAIRDP+G   +P+TRLICLENTHAN GGRCLSV+YT++VGE+AK+HG+KLHIDGAR+FNAS+ALGVPV++LV+AADS
Subjt:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS

Query:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE
        V VCLSKGLGAPVGSVIVGS++FI KAK +RKTLGGGMRQ+GVLCAAALVA+QEN+ KL+ DH+KA++LAE
Subjt:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE

Q9FPH3 Probable low-specificity L-threonine aldolase 24.8e-12379.85Show/hide
Query:  RAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATIGGV
        R +DLRSDTVTKPTE+MR+AMANAEVDDDVL NDPTALR E E+A+I GKEAA+FVPSGTMGNLISVLVHC+ RGSEVILGD+SHIHIYENGG++++GGV
Subjt:  RAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATIGGV

Query:  HPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADSVSV
        HPRT+KN  DGTM++  IEAA+R P+G+L  P T+LICLENT ANCGGRCL ++Y D+VGELAKKHGLKLHIDGARIFNASVALGVPV R+VQAADSVS+
Subjt:  HPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADSVSV

Query:  CLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE
        CLSKG+GAPVGSVIVGS+ FITKA+ LRKTLGGGMRQ+GVLCAAALVA+ ENV KLE DH+KAR+LAE
Subjt:  CLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE

Q9M836 Protein LIGHT-DEPENDENT SHORT HYPOCOTYLS 27.0e-6677.92Show/hide
Query:  PAAAAAACSSPSSSSTTTPSRYENQKRRDWNTFCQYLRNHRPPLALPMCSGAHVLEFLRYLDQFGKTKVHNQSCPFFGLPNPPAPCPCPLRQAWGSLDAL
        P  + +  +  S SS  + SRYENQKRRDWNTFCQYLRNH PPL+L  CSGAHVL+FLRYLDQFGKTKVH+Q+C FFGLPNPPAPCPCPLRQAWGSLDAL
Subjt:  PAAAAAACSSPSSSSTTTPSRYENQKRRDWNTFCQYLRNHRPPLALPMCSGAHVLEFLRYLDQFGKTKVHNQSCPFFGLPNPPAPCPCPLRQAWGSLDAL

Query:  IGRLRAAYEENGGRPEANPFGARAVRLYLREVRDFQAKARGVSYEKKRKRPKQK
        IGRLRAAYEENGG PE +PFG+R+VR++LREVRDFQAK+RGVSYEKKRKR   K
Subjt:  IGRLRAAYEENGGRPEANPFGARAVRLYLREVRDFQAKARGVSYEKKRKRPKQK

Arabidopsis top hitse value%identityAlignment
AT1G08630.1 threonine aldolase 11.2e-12478.6Show/hide
Query:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI
        MV R++DLRSDTVT+PT+AMR AM NAEVDDDVL  DPTA R E EMAK+MGKEAALFVPSGTMGNLISV+VHC++RGSEVILGDN HIH+YENGGI+TI
Subjt:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI

Query:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS
        GGVHP+T+KN  DGTMDL+ IEAAIRDP+G   +P+TRLICLENTHAN GGRCLSV+YT++VGE+AK+HG+KLHIDGAR+FNAS+ALGVPV++LV+AADS
Subjt:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS

Query:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE
        V VCLSKGLGAPVGSVIVGS++FI KAK +RKTLGGGMRQ+GVLCAAALVA+QEN+ KL+ DH+KA++LAE
Subjt:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE

AT1G08630.2 threonine aldolase 11.2e-12478.6Show/hide
Query:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI
        MV R++DLRSDTVT+PT+AMR AM NAEVDDDVL  DPTA R E EMAK+MGKEAALFVPSGTMGNLISV+VHC++RGSEVILGDN HIH+YENGGI+TI
Subjt:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI

Query:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS
        GGVHP+T+KN  DGTMDL+ IEAAIRDP+G   +P+TRLICLENTHAN GGRCLSV+YT++VGE+AK+HG+KLHIDGAR+FNAS+ALGVPV++LV+AADS
Subjt:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS

Query:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE
        V VCLSKGLGAPVGSVIVGS++FI KAK +RKTLGGGMRQ+GVLCAAALVA+QEN+ KL+ DH+KA++LAE
Subjt:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE

AT1G08630.3 threonine aldolase 11.2e-12478.6Show/hide
Query:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI
        MV R++DLRSDTVT+PT+AMR AM NAEVDDDVL  DPTA R E EMAK+MGKEAALFVPSGTMGNLISV+VHC++RGSEVILGDN HIH+YENGGI+TI
Subjt:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI

Query:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS
        GGVHP+T+KN  DGTMDL+ IEAAIRDP+G   +P+TRLICLENTHAN GGRCLSV+YT++VGE+AK+HG+KLHIDGAR+FNAS+ALGVPV++LV+AADS
Subjt:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS

Query:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE
        V VCLSKGLGAPVGSVIVGS++FI KAK +RKTLGGGMRQ+GVLCAAALVA+QEN+ KL+ DH+KA++LAE
Subjt:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE

AT1G08630.4 threonine aldolase 11.2e-12478.6Show/hide
Query:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI
        MV R++DLRSDTVT+PT+AMR AM NAEVDDDVL  DPTA R E EMAK+MGKEAALFVPSGTMGNLISV+VHC++RGSEVILGDN HIH+YENGGI+TI
Subjt:  MVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATI

Query:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS
        GGVHP+T+KN  DGTMDL+ IEAAIRDP+G   +P+TRLICLENTHAN GGRCLSV+YT++VGE+AK+HG+KLHIDGAR+FNAS+ALGVPV++LV+AADS
Subjt:  GGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADS

Query:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE
        V VCLSKGLGAPVGSVIVGS++FI KAK +RKTLGGGMRQ+GVLCAAALVA+QEN+ KL+ DH+KA++LAE
Subjt:  VSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE

AT3G04520.1 threonine aldolase 23.4e-12479.85Show/hide
Query:  RAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATIGGV
        R +DLRSDTVTKPTE+MR+AMANAEVDDDVL NDPTALR E E+A+I GKEAA+FVPSGTMGNLISVLVHC+ RGSEVILGD+SHIHIYENGG++++GGV
Subjt:  RAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEMAKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATIGGV

Query:  HPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADSVSV
        HPRT+KN  DGTM++  IEAA+R P+G+L  P T+LICLENT ANCGGRCL ++Y D+VGELAKKHGLKLHIDGARIFNASVALGVPV R+VQAADSVS+
Subjt:  HPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQYTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADSVSV

Query:  CLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE
        CLSKG+GAPVGSVIVGS+ FITKA+ LRKTLGGGMRQ+GVLCAAALVA+ ENV KLE DH+KAR+LAE
Subjt:  CLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKARILAE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAAGGCGCTCGCCTCCGGAGCGGGAGATTGCGAACTGATCCACTCTGCTACTGCTGCCCAAGAAGAGCAAGAGTTTCTCTGTTCCTTTGCCTCCGGAGAATTTGG
GTCTTCCAACCCATTTCTTCTTCCAAGAAACCAGAGCTCGTGGAGAAGAGGAAGAAGAGAAACAGTTTTTCTTCCTAAAATGGTGACGAGAGCCATAGATCTCCGGTCAG
ACACCGTCACCAAACCAACAGAGGCGATGCGGACTGCAATGGCGAACGCCGAGGTTGATGATGATGTATTGAAAAATGACCCAACTGCCCTCCGCTTTGAGACAGAAATG
GCGAAGATCATGGGGAAGGAAGCAGCTCTTTTCGTGCCATCAGGCACCATGGGGAACCTCATAAGCGTACTCGTCCATTGCGAGATTAGGGGCTCTGAAGTCATTCTTGG
GGATAATTCACATATCCATATTTATGAAAATGGAGGTATTGCAACCATTGGAGGAGTCCATCCTAGGACTATTAAGAACAACAGAGATGGAACTATGGACTTAGATTTGA
TTGAAGCCGCCATTAGAGACCCAAGAGGAGAGCTAGTGTTCCCAACTACGAGGCTTATCTGCTTGGAAAACACACATGCTAACTGTGGTGGTAGATGCCTCTCTGTGCAA
TATACCGACAGGGTTGGCGAATTAGCTAAGAAGCATGGTTTGAAGCTTCACATTGATGGTGCTCGTATTTTCAATGCCTCAGTTGCACTTGGTGTTCCTGTTAATCGGCT
TGTGCAGGCAGCTGATTCTGTATCGGTATGTCTATCTAAGGGTCTGGGAGCTCCTGTTGGCTCTGTTATCGTTGGTTCCAGAACCTTCATTACGAAGGCGAAACGGCTGA
GGAAAACGTTGGGAGGTGGGATGAGACAGGTTGGTGTTCTTTGTGCTGCTGCATTGGTTGCAATACAAGAGAATGTTGTAAAGCTTGAGGGAGATCATGAGAAGGCTAGG
ATTTTGGCTGAATCATCCAGCCCCACCACTTATGAGACCACCTCCACAACCAACAGCATCCTCACAGTAACAACCACCACCGTCGCCGCTGCAGCGCCTGCAGCCGCTGC
CGCAGCATGTTCGTCGCCGTCGTCGTCCTCCACCACCACTCCCAGCCGGTACGAGAACCAGAAACGGCGAGACTGGAACACCTTCTGCCAGTACTTGCGGAACCACCGGC
CGCCGCTGGCTCTGCCGATGTGCAGCGGCGCCCACGTGCTGGAATTCCTCAGGTACCTTGATCAGTTCGGGAAGACCAAAGTCCACAACCAGAGCTGCCCGTTCTTCGGC
CTCCCGAACCCACCTGCCCCCTGCCCCTGCCCGCTGCGGCAGGCCTGGGGCAGCCTCGACGCGCTCATCGGCCGCCTCCGGGCAGCCTACGAAGAGAACGGCGGGAGGCC
GGAGGCGAACCCGTTCGGAGCAAGAGCCGTGAGGCTGTATTTGAGGGAAGTTCGTGATTTTCAGGCCAAAGCAAGAGGTGTCAGCTATGAGAAGAAGAGGAAGAGGCCAA
AGCAGAAGCTCACTGATGGTGCAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATAAGGCGCTCGCCTCCGGAGCGGGAGATTGCGAACTGATCCACTCTGCTACTGCTGCCCAAGAAGAGCAAGAGTTTCTCTGTTCCTTTGCCTCCGGAGAATTTGG
GTCTTCCAACCCATTTCTTCTTCCAAGAAACCAGAGCTCGTGGAGAAGAGGAAGAAGAGAAACAGTTTTTCTTCCTAAAATGGTGACGAGAGCCATAGATCTCCGGTCAG
ACACCGTCACCAAACCAACAGAGGCGATGCGGACTGCAATGGCGAACGCCGAGGTTGATGATGATGTATTGAAAAATGACCCAACTGCCCTCCGCTTTGAGACAGAAATG
GCGAAGATCATGGGGAAGGAAGCAGCTCTTTTCGTGCCATCAGGCACCATGGGGAACCTCATAAGCGTACTCGTCCATTGCGAGATTAGGGGCTCTGAAGTCATTCTTGG
GGATAATTCACATATCCATATTTATGAAAATGGAGGTATTGCAACCATTGGAGGAGTCCATCCTAGGACTATTAAGAACAACAGAGATGGAACTATGGACTTAGATTTGA
TTGAAGCCGCCATTAGAGACCCAAGAGGAGAGCTAGTGTTCCCAACTACGAGGCTTATCTGCTTGGAAAACACACATGCTAACTGTGGTGGTAGATGCCTCTCTGTGCAA
TATACCGACAGGGTTGGCGAATTAGCTAAGAAGCATGGTTTGAAGCTTCACATTGATGGTGCTCGTATTTTCAATGCCTCAGTTGCACTTGGTGTTCCTGTTAATCGGCT
TGTGCAGGCAGCTGATTCTGTATCGGTATGTCTATCTAAGGGTCTGGGAGCTCCTGTTGGCTCTGTTATCGTTGGTTCCAGAACCTTCATTACGAAGGCGAAACGGCTGA
GGAAAACGTTGGGAGGTGGGATGAGACAGGTTGGTGTTCTTTGTGCTGCTGCATTGGTTGCAATACAAGAGAATGTTGTAAAGCTTGAGGGAGATCATGAGAAGGCTAGG
ATTTTGGCTGAATCATCCAGCCCCACCACTTATGAGACCACCTCCACAACCAACAGCATCCTCACAGTAACAACCACCACCGTCGCCGCTGCAGCGCCTGCAGCCGCTGC
CGCAGCATGTTCGTCGCCGTCGTCGTCCTCCACCACCACTCCCAGCCGGTACGAGAACCAGAAACGGCGAGACTGGAACACCTTCTGCCAGTACTTGCGGAACCACCGGC
CGCCGCTGGCTCTGCCGATGTGCAGCGGCGCCCACGTGCTGGAATTCCTCAGGTACCTTGATCAGTTCGGGAAGACCAAAGTCCACAACCAGAGCTGCCCGTTCTTCGGC
CTCCCGAACCCACCTGCCCCCTGCCCCTGCCCGCTGCGGCAGGCCTGGGGCAGCCTCGACGCGCTCATCGGCCGCCTCCGGGCAGCCTACGAAGAGAACGGCGGGAGGCC
GGAGGCGAACCCGTTCGGAGCAAGAGCCGTGAGGCTGTATTTGAGGGAAGTTCGTGATTTTCAGGCCAAAGCAAGAGGTGTCAGCTATGAGAAGAAGAGGAAGAGGCCAA
AGCAGAAGCTCACTGATGGTGCAACTTGA
Protein sequenceShow/hide protein sequence
MDKALASGAGDCELIHSATAAQEEQEFLCSFASGEFGSSNPFLLPRNQSSWRRGRRETVFLPKMVTRAIDLRSDTVTKPTEAMRTAMANAEVDDDVLKNDPTALRFETEM
AKIMGKEAALFVPSGTMGNLISVLVHCEIRGSEVILGDNSHIHIYENGGIATIGGVHPRTIKNNRDGTMDLDLIEAAIRDPRGELVFPTTRLICLENTHANCGGRCLSVQ
YTDRVGELAKKHGLKLHIDGARIFNASVALGVPVNRLVQAADSVSVCLSKGLGAPVGSVIVGSRTFITKAKRLRKTLGGGMRQVGVLCAAALVAIQENVVKLEGDHEKAR
ILAESSSPTTYETTSTTNSILTVTTTTVAAAAPAAAAAACSSPSSSSTTTPSRYENQKRRDWNTFCQYLRNHRPPLALPMCSGAHVLEFLRYLDQFGKTKVHNQSCPFFG
LPNPPAPCPCPLRQAWGSLDALIGRLRAAYEENGGRPEANPFGARAVRLYLREVRDFQAKARGVSYEKKRKRPKQKLTDGAT