; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021844 (gene) of Snake gourd v1 genome

Gene IDTan0021844
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionAB hydrolase-1 domain-containing protein
Genome locationLG08:69062080..69065638
RNA-Seq ExpressionTan0021844
SyntenyTan0021844
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR000073 - Alpha/beta hydrolase fold-1
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149212.1 uncharacterized protein LOC101206403 isoform X1 [Cucumis sativus]2.4e-19888.13Show/hide
Query:  MAGEVNRKISAASAR--SHTR-KSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK
        M G  NRK+SAAS    +HTR KSKQSNSLRL  G+F+K+L LLLLGFSAWFYQ IQPPPPKICGSPGGP ITAPRIKLRDGRHLAYKEHGVPKDKAKYK
Subjt:  MAGEVNRKISAASAR--SHTR-KSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK

Query:  IVSVHGFDSCRHDTAAARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLI
        IVSVHGFDSCRHDTAAARALSP   +GLGIYILSFDRPGYGESDP+PKR VKSAAMDIEELADQL LGSKFYVIGGSMGG+IVWSCLKYIPNRLAGAVLI
Subjt:  IVSVHGFDSCRHDTAAARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLI

Query:  APVINYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESLHQDLNVG
        APVINYWWSGLP NLSNEAF WKPLQD+WALSVAHYTPWLTYWWNT++WFPASSIIAHNPD+LSP D+NLIPKLSFR EY AQIRQQGEYESLHQDLNVG
Subjt:  APVINYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESLHQDLNVG

Query:  YSSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLLENK
        +SSWEFSPLDLKNPFPHN+GS+HIWQGDDDRVVS KLQRYIAEKLPWI YHEV GAGHLFSYAD VYDSVI ALLLE K
Subjt:  YSSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLLENK

XP_008442834.1 PREDICTED: uncharacterized protein LOC103486603 isoform X1 [Cucumis melo]3.5e-19786.79Show/hide
Query:  MAGEVNRKISAASARS--HTR-KSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK
        M G   RK+SAASA    HTR KSKQSNSLRL  G+F K+LGLLLLG SAWFYQ IQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK
Subjt:  MAGEVNRKISAASARS--HTR-KSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK

Query:  IVSVHGFDSCRHDTAAARALSP-------GLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNR
        IVSVHGFDSCRHDTAAARALSP        L +GLGIYI+SFDRPGYGESDP+PKR VKSAAMDIEEL DQL LG+KFYVIGGSMGG+IVWSCLKYIPNR
Subjt:  IVSVHGFDSCRHDTAAARALSP-------GLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNR

Query:  LAGAVLIAPVINYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESL
        LAGAVLIAPVINYWWSGLPANLSNEAF WKPLQD+WALSVAHYTPWLTYWWNTQ+WFPASSIIAHNPD+LSP D+NLIPKLSFR EY AQIRQQGEYESL
Subjt:  LAGAVLIAPVINYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESL

Query:  HQDLNVGYSSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLLENK
        HQDLNVG+SSWEFSPL+LKNPFPHN+GS+HIWQGDDDRVVS KLQRYIAEKLPWI YHEV GAGHLFSYADNVYDSVI ALLLE K
Subjt:  HQDLNVGYSSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLLENK

XP_008442835.1 PREDICTED: uncharacterized protein LOC103486603 isoform X2 [Cucumis melo]2.8e-19988.39Show/hide
Query:  MAGEVNRKISAASARS--HTR-KSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK
        M G   RK+SAASA    HTR KSKQSNSLRL  G+F K+LGLLLLG SAWFYQ IQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK
Subjt:  MAGEVNRKISAASARS--HTR-KSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK

Query:  IVSVHGFDSCRHDTAAARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLI
        IVSVHGFDSCRHDTAAARALSP L +GLGIYI+SFDRPGYGESDP+PKR VKSAAMDIEEL DQL LG+KFYVIGGSMGG+IVWSCLKYIPNRLAGAVLI
Subjt:  IVSVHGFDSCRHDTAAARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLI

Query:  APVINYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESLHQDLNVG
        APVINYWWSGLPANLSNEAF WKPLQD+WALSVAHYTPWLTYWWNTQ+WFPASSIIAHNPD+LSP D+NLIPKLSFR EY AQIRQQGEYESLHQDLNVG
Subjt:  APVINYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESLHQDLNVG

Query:  YSSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLLENK
        +SSWEFSPL+LKNPFPHN+GS+HIWQGDDDRVVS KLQRYIAEKLPWI YHEV GAGHLFSYADNVYDSVI ALLLE K
Subjt:  YSSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLLENK

XP_038904915.1 uncharacterized protein LOC120091129 isoform X1 [Benincasa hispida]1.6e-19485.42Show/hide
Query:  MAGEVNRKISA--ASARSHTR-KSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK
        M G  NRK++A  AS  +H R KSKQS+SLRLP G F+K+LGLL LGFSAWFY  IQPPPPKICG PGGPPITAPRIKL DGRHLAYKEHGVPKDKAKYK
Subjt:  MAGEVNRKISA--ASARSHTR-KSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK

Query:  IVSVHGFDSCRHDTAAARALSP-----GLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLA
        IVSVHGFDSCRHDT AARALSP      L++GLGIYILSFDRPGYGESDPDPKR VKSAAMDIEELADQL LGSKFYV+GGSMGG+IVWSCLKYIPNRLA
Subjt:  IVSVHGFDSCRHDTAAARALSP-----GLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLA

Query:  GAVLIAPVINYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESLHQ
        GAVLIAPVINYWW GLPANLSNEAF WKPLQD+WALSVAHYTPWLTYWWNTQ+WFPASSII HNP+ILSP D+NLI KLSFR++Y+AQIRQQGEYESLHQ
Subjt:  GAVLIAPVINYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESLHQ

Query:  DLNVGYSSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLLENK
        DLNVG+SSWEFSPLDLKNPFPHNDGS+HIWQGDDDRVVS KLQR IAEKLPWI YHEV GAGHLFSYADNVYDS+I ALLLE K
Subjt:  DLNVGYSSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLLENK

XP_038904916.1 uncharacterized protein LOC120091129 isoform X2 [Benincasa hispida]2.3e-19686.54Show/hide
Query:  MAGEVNRKISA--ASARSHTR-KSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK
        M G  NRK++A  AS  +H R KSKQS+SLRLP G F+K+LGLL LGFSAWFY  IQPPPPKICG PGGPPITAPRIKL DGRHLAYKEHGVPKDKAKYK
Subjt:  MAGEVNRKISA--ASARSHTR-KSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK

Query:  IVSVHGFDSCRHDTAAARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLI
        IVSVHGFDSCRHDT AARALSP L++GLGIYILSFDRPGYGESDPDPKR VKSAAMDIEELADQL LGSKFYV+GGSMGG+IVWSCLKYIPNRLAGAVLI
Subjt:  IVSVHGFDSCRHDTAAARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLI

Query:  APVINYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESLHQDLNVG
        APVINYWW GLPANLSNEAF WKPLQD+WALSVAHYTPWLTYWWNTQ+WFPASSII HNP+ILSP D+NLI KLSFR++Y+AQIRQQGEYESLHQDLNVG
Subjt:  APVINYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESLHQDLNVG

Query:  YSSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLLENK
        +SSWEFSPLDLKNPFPHNDGS+HIWQGDDDRVVS KLQR IAEKLPWI YHEV GAGHLFSYADNVYDS+I ALLLE K
Subjt:  YSSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLLENK

TrEMBL top hitse value%identityAlignment
A0A0A0LE07 AB hydrolase-1 domain-containing protein1.2e-19888.13Show/hide
Query:  MAGEVNRKISAASAR--SHTR-KSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK
        M G  NRK+SAAS    +HTR KSKQSNSLRL  G+F+K+L LLLLGFSAWFYQ IQPPPPKICGSPGGP ITAPRIKLRDGRHLAYKEHGVPKDKAKYK
Subjt:  MAGEVNRKISAASAR--SHTR-KSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK

Query:  IVSVHGFDSCRHDTAAARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLI
        IVSVHGFDSCRHDTAAARALSP   +GLGIYILSFDRPGYGESDP+PKR VKSAAMDIEELADQL LGSKFYVIGGSMGG+IVWSCLKYIPNRLAGAVLI
Subjt:  IVSVHGFDSCRHDTAAARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLI

Query:  APVINYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESLHQDLNVG
        APVINYWWSGLP NLSNEAF WKPLQD+WALSVAHYTPWLTYWWNT++WFPASSIIAHNPD+LSP D+NLIPKLSFR EY AQIRQQGEYESLHQDLNVG
Subjt:  APVINYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESLHQDLNVG

Query:  YSSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLLENK
        +SSWEFSPLDLKNPFPHN+GS+HIWQGDDDRVVS KLQRYIAEKLPWI YHEV GAGHLFSYAD VYDSVI ALLLE K
Subjt:  YSSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLLENK

A0A1S3B6N5 uncharacterized protein LOC103486603 isoform X11.7e-19786.79Show/hide
Query:  MAGEVNRKISAASARS--HTR-KSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK
        M G   RK+SAASA    HTR KSKQSNSLRL  G+F K+LGLLLLG SAWFYQ IQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK
Subjt:  MAGEVNRKISAASARS--HTR-KSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK

Query:  IVSVHGFDSCRHDTAAARALSP-------GLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNR
        IVSVHGFDSCRHDTAAARALSP        L +GLGIYI+SFDRPGYGESDP+PKR VKSAAMDIEEL DQL LG+KFYVIGGSMGG+IVWSCLKYIPNR
Subjt:  IVSVHGFDSCRHDTAAARALSP-------GLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNR

Query:  LAGAVLIAPVINYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESL
        LAGAVLIAPVINYWWSGLPANLSNEAF WKPLQD+WALSVAHYTPWLTYWWNTQ+WFPASSIIAHNPD+LSP D+NLIPKLSFR EY AQIRQQGEYESL
Subjt:  LAGAVLIAPVINYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESL

Query:  HQDLNVGYSSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLLENK
        HQDLNVG+SSWEFSPL+LKNPFPHN+GS+HIWQGDDDRVVS KLQRYIAEKLPWI YHEV GAGHLFSYADNVYDSVI ALLLE K
Subjt:  HQDLNVGYSSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLLENK

A0A1S3B6P3 uncharacterized protein LOC103486603 isoform X21.4e-19988.39Show/hide
Query:  MAGEVNRKISAASARS--HTR-KSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK
        M G   RK+SAASA    HTR KSKQSNSLRL  G+F K+LGLLLLG SAWFYQ IQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK
Subjt:  MAGEVNRKISAASARS--HTR-KSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK

Query:  IVSVHGFDSCRHDTAAARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLI
        IVSVHGFDSCRHDTAAARALSP L +GLGIYI+SFDRPGYGESDP+PKR VKSAAMDIEEL DQL LG+KFYVIGGSMGG+IVWSCLKYIPNRLAGAVLI
Subjt:  IVSVHGFDSCRHDTAAARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLI

Query:  APVINYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESLHQDLNVG
        APVINYWWSGLPANLSNEAF WKPLQD+WALSVAHYTPWLTYWWNTQ+WFPASSIIAHNPD+LSP D+NLIPKLSFR EY AQIRQQGEYESLHQDLNVG
Subjt:  APVINYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESLHQDLNVG

Query:  YSSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLLENK
        +SSWEFSPL+LKNPFPHN+GS+HIWQGDDDRVVS KLQRYIAEKLPWI YHEV GAGHLFSYADNVYDSVI ALLLE K
Subjt:  YSSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLLENK

A0A5A7TRQ5 Hydrolase family protein1.4e-19988.39Show/hide
Query:  MAGEVNRKISAASARS--HTR-KSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK
        M G   RK+SAASA    HTR KSKQSNSLRL  G+F K+LGLLLLG SAWFYQ IQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK
Subjt:  MAGEVNRKISAASARS--HTR-KSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYK

Query:  IVSVHGFDSCRHDTAAARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLI
        IVSVHGFDSCRHDTAAARALSP L +GLGIYI+SFDRPGYGESDP+PKR VKSAAMDIEEL DQL LG+KFYVIGGSMGG+IVWSCLKYIPNRLAGAVLI
Subjt:  IVSVHGFDSCRHDTAAARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLI

Query:  APVINYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESLHQDLNVG
        APVINYWWSGLPANLSNEAF WKPLQD+WALSVAHYTPWLTYWWNTQ+WFPASSIIAHNPD+LSP D+NLIPKLSFR EY AQIRQQGEYESLHQDLNVG
Subjt:  APVINYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESLHQDLNVG

Query:  YSSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLLENK
        +SSWEFSPL+LKNPFPHN+GS+HIWQGDDDRVVS KLQRYIAEKLPWI YHEV GAGHLFSYADNVYDSVI ALLLE K
Subjt:  YSSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLLENK

A0A6J1CU97 uncharacterized protein LOC111014285 isoform X18.1e-19286.06Show/hide
Query:  MAGEVNRKISAASARSHTRKSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYKIVS
        MAG VNRKISAASAR+HTRKS+QS    LP GM +K+LGLL +GFSA FY AIQPPPPKICGSPGGPPIT+ RIKLRDGRHLAYKEHGVP++KAKYKIVS
Subjt:  MAGEVNRKISAASARSHTRKSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYKIVS

Query:  VHGFDSCRHDTAAARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLIAPV
        VHGFDSCRHDT AAR L+P LV+GLGIYI+SFDRPGYGESDPDPKR VKS AMDI ELADQL LGSKFYVIGGSMGG IVWSCLKYIPNRLAGAVL+APV
Subjt:  VHGFDSCRHDTAAARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLIAPV

Query:  INYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESLHQDLNVGYSS
        INYWWS LPANLSNEA+  KPLQDRWALS+AHYTPWLTYWWNTQRWFPASS IAHN D+LSPED+ LIPKLSFRKEYVAQIRQQGEY SLHQDLNVG+ S
Subjt:  INYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESLHQDLNVGYSS

Query:  WEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLL
        WEFSPLDLKNPFPHNDGSVH+WQGDDDRVVS KLQRYIA+ LPWIHYHEV GAGHLFSYAD VYDSVI  LLL
Subjt:  WEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLL

SwissProt top hitse value%identityAlignment
Q1QQD3 Homoserine O-acetyltransferase9.4e-0434.21Show/hide
Query:  GVPKDKAKYKIVSVHGFDSCRHDTA-AARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGG--VIVWSCL
        G P D  KY ++  +    C   T  A+   + G V GL   +++          PD   MV++ AM    L D+LG+ + F V+GGSMGG  V+ W C+
Subjt:  GVPKDKAKYKIVSVHGFDSCRHDTA-AARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGG--VIVWSCL

Query:  KYIPNRLAGAVLIA
         Y P R+  A+ IA
Subjt:  KYIPNRLAGAVLIA

Q9KWQ6 4,5:9,10-diseco-3-hydroxy-5,9,17-trioxoandrosta-1(10),2-diene-4-oate hydrolase4.5e-0631.97Show/hide
Query:  TAPRIKLRDGRHLAYKEHGVPKDKAKYKIVSVHGFDSCRHDTAAARALSPGLVQGLGIYILSFDRPGYGESD---PDPKRMVKSAAMDIEELADQLGLGS
        T+   ++R    L Y E GV  D     IV +HG        +      P L +    ++L+ D+PGYG SD     P+  V SA+  +++L D LG+G 
Subjt:  TAPRIKLRDGRHLAYKEHGVPKDKAKYKIVSVHGFDSCRHDTAAARALSPGLVQGLGIYILSFDRPGYGESD---PDPKRMVKSAAMDIEELADQLGLGS

Query:  KFYVIGGSM-GGVIVWSCLKYIPNRLAGAVLIAPVINYWWSGLPANL
        + +++G S+ GG  V   L Y P+R    VL+ P       GL  NL
Subjt:  KFYVIGGSM-GGVIVWSCLKYIPNRLAGAVLIAPVINYWWSGLPANL

Arabidopsis top hitse value%identityAlignment
AT1G74280.1 alpha/beta-Hydrolases superfamily protein5.8e-12656.94Show/hide
Query:  TRKSKQSNSLRLPPG---MFKKVLGLLLLGFSAWFYQA-IQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYKIVSVHGFDSCRHDTAA
        TR S +++SL+  P    +F  V+ +++    A+ YQ+ ++PPPPK+CGS GGPPITAPRIKL+DGR+LAYKEHG+P++KA  KIV +HG D CRHD   
Subjt:  TRKSKQSNSLRLPPG---MFKKVLGLLLLGFSAWFYQA-IQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYKIVSVHGFDSCRHDTAA

Query:  ARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLIAPVINYWWSGLPANLS
        A  LSP LV+ LG+Y++SFDRPGY ESDP P R  +S   DIEELADQL LGSKFYV+G SMGG   W CLKYIP+RLAG  L+APV+NY+W  LP N+S
Subjt:  ARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLIAPVINYWWSGLPANLS

Query:  NEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKL-SFRKEYVAQIRQQGEYESLHQDLNVGYSSWEFSPLDLKNPF
         E FN++  +D+ A+ VAHYTPWL YWWNTQ+WFP SSI   +  +L+  D+++I KL S RK + A++RQQG +ES+++D+ VG+ +WEF PLDL+NPF
Subjt:  NEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKL-SFRKEYVAQIRQQGEYESLHQDLNVGYSSWEFSPLDLKNPF

Query:  PHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALL
         + +GSVH+WQGD+D +V  KLQRY+A +LPW+HYHEV  +GH F Y   V D ++ +LL
Subjt:  PHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALL

AT1G74290.1 alpha/beta-Hydrolases superfamily protein8.7e-12256.01Show/hide
Query:  ARSHTRKSKQSNSLR---LPPGMFKKVLGLLLLGFSAWFYQA-IQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYKIVSVHGFDSCRH
        A   +RKS    S R   L P +   ++G+++    A  YQ+ ++PP PK+CGS  GPPITAPRIKL+DGR+LAYKEHG+P++KA  KIV +HG D CRH
Subjt:  ARSHTRKSKQSNSLR---LPPGMFKKVLGLLLLGFSAWFYQA-IQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYKIVSVHGFDSCRH

Query:  DTAAARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSC--LKYIPNRLAGAVLIAPVINYWWSG
        D   A  LSP LV+ LG+Y++SFDRPGY ESDP P R  +S   DIEEL DQL LGSKFYVIG SMGG   W C  LKYIP+RLAG  L+APV+NY+W  
Subjt:  DTAAARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSC--LKYIPNRLAGAVLIAPVINYWWSG

Query:  LPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSF-RKEYVAQIRQQGEYESLHQDLNVGYSSWEFSPL
        LP N+S E FN++  +D+WA+ VAHY PWL YWWNTQ+WFP SS IA+   +LS  DR++I K  + RK + A++RQQG +ES+++D+ VG+ +WEF PL
Subjt:  LPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSF-RKEYVAQIRQQGEYESLHQDLNVGYSSWEFSPL

Query:  DLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALL
        DL NPF +N+G VH+WQGD+D +V VKLQRY+A +LPW+HYHEV  +GH F +   V D+++  LL
Subjt:  DLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALL

AT1G74300.1 alpha/beta-Hydrolases superfamily protein1.8e-13061.24Show/hide
Query:  VLGLLLLGFSAWFYQAI-QPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYKIVSVHGFDSCRHDTAAARALSPGLVQGLGIYILSFDRP
        ++G++L    A  YQ+I +PPPP +CGSPGGPPITAPRIKLRDGRHLAYKE+G+P++KAK+KIV +HG DSCRHD   A  LSP LVQ  G+Y++SFD+P
Subjt:  VLGLLLLGFSAWFYQAI-QPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYKIVSVHGFDSCRHDTAAARALSPGLVQGLGIYILSFDRP

Query:  GYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLIAPVINYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTP
        GYGESDPDP R  KS A+DIEELADQL LGSKFYVIG SMGG   W CLKY P+RLAG  L+APV+NY+W  LP N+S E FN +  +D+WA+ VAHY P
Subjt:  GYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLIAPVINYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTP

Query:  WLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKL-SFRKEYVAQIRQQGEYESLHQDLNVGYSSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKL
        WL YWWNTQ WFP SS++  +  +LS  D+++I KL S RK ++A++RQQG +ES+++D+ VG+ +WEF PL+L+NPF + +GSVH+WQGD+D +V V L
Subjt:  WLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKL-SFRKEYVAQIRQQGEYESLHQDLNVGYSSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKL

Query:  QRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALL
        QRYIA+KLPW+HYHEV G GH F  A  V D ++   L
Subjt:  QRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALL

AT2G36290.1 alpha/beta-Hydrolases superfamily protein5.8e-13460.68Show/hide
Query:  SNSLRLP-PGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYKIVSVHGFDSCRHDTAAARALSPGLV
        ++S R P  G+ +K+L L  +  +   Y+AIQPPP K+CGSP GP IT PRIKLRDGR LAYKEHGVP+D+A +KI+ VHG DSCRHD A A  LSP + 
Subjt:  SNSLRLP-PGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYKIVSVHGFDSCRHDTAAARALSPGLV

Query:  QGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLIAPVINYWWSGLPANLSNEAFNWKPL
        +GLG+Y++SFDRPGY ESDPDP R  KS A+DIEELADQL LGSKFYVIG SMGG   W+CLKYIP+RLAG  L+APV+NYWW   P+ +S EAFN +  
Subjt:  QGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLIAPVINYWWSGLPANLSNEAFNWKPL

Query:  QDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKL-SFRKEYVAQIRQQGEYESLHQDLNVGYSSWEFSPLDLKNPFPHNDGSVHI
         D+WA+ VAHY PWLT+WWN+Q WFP SS++A N  +LS  D+ ++ KL + R ++ AQIRQQG +E+LH+D+ VG+ +WEF P++L+N FP+N+GSVH+
Subjt:  QDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKL-SFRKEYVAQIRQQGEYESLHQDLNVGYSSWEFSPLDLKNPFPHNDGSVHI

Query:  WQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALL
        WQGDDD +V V LQRYIA+KLPWIHYHE+ GAGHLF +A  + ++++  LL
Subjt:  WQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALL

AT3G48410.1 alpha/beta-Hydrolases superfamily protein7.4e-13759.73Show/hide
Query:  MAGE--VNRKISAASARSHTRKSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYKI
        MAGE     K SAASAR+HTR + Q +S     G  K +L +  +G  AW YQ IQPPP KI GSPGGP +T+PRIKLRDGRHLAY E G+P+D+AK+KI
Subjt:  MAGE--VNRKISAASARSHTRKSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYKI

Query:  VSVHGFDSCRHDTAAARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLIA
        +++HGFDSC  D+  A  LSP LV+ L IYI+SFDRPGYGESDP+     +S A+DIEELAD LGLG +FY+ G SMGG I W+CL YIP+RLAGA L+A
Subjt:  VSVHGFDSCRHDTAAARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLIA

Query:  PVINYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESLHQDLNVGY
        P INYWW  LP +L+ EAF+     D+W+L VAHY PWLTYWWNTQ+WFP S++IA NP I S +D  ++ KL F     A IRQQGEY SLH+DLNV +
Subjt:  PVINYWWSGLPANLSNEAFNWKPLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESLHQDLNVGY

Query:  SSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLL
        SSWEF PLDL++PFP+N+GSVH+W GD+D+ V VKLQRY+A KLPWI YHE+ G+GH   + + + D +I +LL+
Subjt:  SSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVVSVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGGAGAAGTTAACCGGAAGATATCGGCGGCATCTGCTAGATCTCACACGAGAAAATCAAAACAGAGCAACTCCTTACGGCTTCCCCCAGGGATGTTCAAAAAAGT
GTTGGGGCTTCTCCTTTTGGGATTTTCAGCATGGTTCTATCAGGCGATTCAGCCTCCTCCTCCCAAGATATGTGGCTCACCTGGTGGCCCTCCCATTACAGCACCAAGAA
TAAAGCTCAGAGATGGAAGGCATTTGGCGTACAAGGAGCATGGTGTGCCAAAAGACAAAGCCAAGTACAAAATTGTCTCAGTCCATGGCTTTGACTCTTGCAGGCACGAC
ACTGCAGCCGCCCGCGCCCTTTCTCCGGGACTTGTTCAAGGCTTGGGAATCTACATTCTGTCTTTTGATAGACCTGGTTATGGAGAGAGTGATCCCGATCCAAAGAGGAT
GGTGAAGAGCGCAGCTATGGACATAGAAGAGCTTGCAGATCAATTGGGGTTGGGATCCAAATTCTATGTTATTGGAGGTTCCATGGGTGGGGTTATAGTTTGGAGCTGTC
TGAAGTACATTCCAAACAGGCTGGCGGGGGCGGTACTAATAGCTCCAGTTATCAACTACTGGTGGTCAGGTCTTCCTGCAAATTTGTCTAATGAAGCCTTCAATTGGAAG
CCATTGCAAGACCGATGGGCTCTTAGTGTTGCTCACTACACTCCTTGGCTTACATACTGGTGGAACACACAGAGATGGTTTCCTGCTTCGAGTATTATCGCTCACAATCC
CGACATTCTTTCTCCCGAGGACAGAAATCTCATTCCTAAGCTCTCATTTAGAAAGGAATATGTGGCTCAGATTAGACAACAAGGTGAATATGAATCCCTCCATCAGGACC
TGAATGTTGGATATAGCAGCTGGGAGTTCAGTCCTCTGGACCTTAAAAATCCATTTCCTCACAATGACGGGTCGGTCCATATATGGCAGGGAGACGACGACAGGGTCGTG
TCTGTTAAACTGCAACGCTACATTGCCGAAAAGCTCCCATGGATTCACTACCACGAGGTACGCGGTGCTGGGCACTTGTTCTCTTATGCTGATAATGTTTATGATTCTGT
CATCAATGCTCTTTTGCTTGAAAATAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGGAGAAGTTAACCGGAAGATATCGGCGGCATCTGCTAGATCTCACACGAGAAAATCAAAACAGAGCAACTCCTTACGGCTTCCCCCAGGGATGTTCAAAAAAGT
GTTGGGGCTTCTCCTTTTGGGATTTTCAGCATGGTTCTATCAGGCGATTCAGCCTCCTCCTCCCAAGATATGTGGCTCACCTGGTGGCCCTCCCATTACAGCACCAAGAA
TAAAGCTCAGAGATGGAAGGCATTTGGCGTACAAGGAGCATGGTGTGCCAAAAGACAAAGCCAAGTACAAAATTGTCTCAGTCCATGGCTTTGACTCTTGCAGGCACGAC
ACTGCAGCCGCCCGCGCCCTTTCTCCGGGACTTGTTCAAGGCTTGGGAATCTACATTCTGTCTTTTGATAGACCTGGTTATGGAGAGAGTGATCCCGATCCAAAGAGGAT
GGTGAAGAGCGCAGCTATGGACATAGAAGAGCTTGCAGATCAATTGGGGTTGGGATCCAAATTCTATGTTATTGGAGGTTCCATGGGTGGGGTTATAGTTTGGAGCTGTC
TGAAGTACATTCCAAACAGGCTGGCGGGGGCGGTACTAATAGCTCCAGTTATCAACTACTGGTGGTCAGGTCTTCCTGCAAATTTGTCTAATGAAGCCTTCAATTGGAAG
CCATTGCAAGACCGATGGGCTCTTAGTGTTGCTCACTACACTCCTTGGCTTACATACTGGTGGAACACACAGAGATGGTTTCCTGCTTCGAGTATTATCGCTCACAATCC
CGACATTCTTTCTCCCGAGGACAGAAATCTCATTCCTAAGCTCTCATTTAGAAAGGAATATGTGGCTCAGATTAGACAACAAGGTGAATATGAATCCCTCCATCAGGACC
TGAATGTTGGATATAGCAGCTGGGAGTTCAGTCCTCTGGACCTTAAAAATCCATTTCCTCACAATGACGGGTCGGTCCATATATGGCAGGGAGACGACGACAGGGTCGTG
TCTGTTAAACTGCAACGCTACATTGCCGAAAAGCTCCCATGGATTCACTACCACGAGGTACGCGGTGCTGGGCACTTGTTCTCTTATGCTGATAATGTTTATGATTCTGT
CATCAATGCTCTTTTGCTTGAAAATAAGTGA
Protein sequenceShow/hide protein sequence
MAGEVNRKISAASARSHTRKSKQSNSLRLPPGMFKKVLGLLLLGFSAWFYQAIQPPPPKICGSPGGPPITAPRIKLRDGRHLAYKEHGVPKDKAKYKIVSVHGFDSCRHD
TAAARALSPGLVQGLGIYILSFDRPGYGESDPDPKRMVKSAAMDIEELADQLGLGSKFYVIGGSMGGVIVWSCLKYIPNRLAGAVLIAPVINYWWSGLPANLSNEAFNWK
PLQDRWALSVAHYTPWLTYWWNTQRWFPASSIIAHNPDILSPEDRNLIPKLSFRKEYVAQIRQQGEYESLHQDLNVGYSSWEFSPLDLKNPFPHNDGSVHIWQGDDDRVV
SVKLQRYIAEKLPWIHYHEVRGAGHLFSYADNVYDSVINALLLENK