; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013513 (gene) of Snake gourd v1 genome

Gene IDTan0013513
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUncharacterised conserved protein UCP031088, alpha/beta hydrolase
Genome locationLG05:73228552..73231803
RNA-Seq ExpressionTan0013513
SyntenyTan0013513
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004252 - serine-type endopeptidase activity (molecular function)
InterPro domainsIPR002471 - Peptidase S9, serine active site
IPR016969 - Uncharacterised conserved protein UCP031088, alpha/beta hydrolase, At1g15070
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572951.1 hypothetical protein SDJN03_26838, partial [Cucurbita argyrosperma subsp. sororia]2.6e-24586.84Show/hide
Query:  MATLTLSRFDLSSIGWSNNRRRHLHRFRRQGKSEASWALCRRNVVAVKSVTAFYGGAYGWSDNKEKGSISTADELHYVSVPNSDWKLALWRYLPSFQAPS
        MATL+LSRFDLSSIG S+NRRRHLHRFRRQGKSEASWALCRRN+VAVKSVTAFYGG      +KEKG I TADELHYVSVPNSDWKLALWRY PS QA S
Subjt:  MATLTLSRFDLSSIGWSNNRRRHLHRFRRQGKSEASWALCRRNVVAVKSVTAFYGGAYGWSDNKEKGSISTADELHYVSVPNSDWKLALWRYLPSFQAPS

Query:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMND-TGHIRPETPEKRPL----------GSSISSKDGQTSTIATQLR
        RNHPLLLLSGVGSNALGYDLSP+SSFARYMSNQGYDTWILEVRGSGLSTHRVE  D TG IR ETPEK+PL          GSSISSK GQ STIATQL 
Subjt:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMND-TGHIRPETPEKRPL----------GSSISSKDGQTSTIATQLR

Query:  NWNKNLINIIEGAQQLGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP
        NWNKNLINIIEGAQQLGPLKPFNL+GVT+ALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA IS CSFNKVDP
Subjt:  NWNKNLINIIEGAQQLGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP

Query:  QLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTLLEKLVLNGFESVPAKVLLQLASVF
        QLASV+TLASSLDYRPSNSSLRLLLPL +PA+AFNVPVFPIGPLL IAHPLASRPPY+LYWLKDQIS EDMLDPTLLEKLVLNGFESVPAKVLLQL+SVF
Subjt:  QLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTLLEKLVLNGFESVPAKVLLQLASVF

Query:  EEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSRLASSQVYPLITDFLNCHDM
        +EGGLRDRNGTF+Y DHL Q+NVPILA+AGDQD ICPPEAVYETVK IPR+ VSY+VLG PGGPHY+HYD+VGSRLASS+VYPLITDFLN HD+
Subjt:  EEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSRLASSQVYPLITDFLNCHDM

XP_022954911.1 uncharacterized protein LOC111457026 isoform X2 [Cucurbita moschata]1.3e-24486.64Show/hide
Query:  MATLTLSRFDLSSIGWSNNRRRHLHRFRRQGKSEASWALCRRNVVAVKSVTAFYGGAYGWSDNKEKGSISTADELHYVSVPNSDWKLALWRYLPSFQAPS
        MATL+LS F LSS+G S+NRRRHLHRFRRQGKSEASWALCRRN+VAVKSVTAFYGG      +KEKG I TADELHYVSVPNSDWKLALWRY P  QA S
Subjt:  MATLTLSRFDLSSIGWSNNRRRHLHRFRRQGKSEASWALCRRNVVAVKSVTAFYGGAYGWSDNKEKGSISTADELHYVSVPNSDWKLALWRYLPSFQAPS

Query:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMND-TGHIRPETPEKRPL----------GSSISSKDGQTSTIATQLR
        RNHPLLLLSGVGSNALGYDLSP+SSFARYMSNQGYDTWILEVRGSGLSTHRVE  D TG IR ETPEK+PL          GSSISSK GQ STIATQL 
Subjt:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMND-TGHIRPETPEKRPL----------GSSISSKDGQTSTIATQLR

Query:  NWNKNLINIIEGAQQLGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP
        NWNKNLINIIEGAQQLGPLKPFNL+GVT+ALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA IS CSFNKVDP
Subjt:  NWNKNLINIIEGAQQLGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP

Query:  QLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTLLEKLVLNGFESVPAKVLLQLASVF
        QLASV+TLASSLDYRPSNSSLRLLLPLRDPA+AFNVPVFPIGPLL IAHPLASRPPY+LYWLKDQIS EDMLDPTLLEKLVLNGFESVPAKVLLQL+SVF
Subjt:  QLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTLLEKLVLNGFESVPAKVLLQLASVF

Query:  EEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSRLASSQVYPLITDFLNCHDM
        EEGGLRDRNGTF+Y DHL Q+NVPILA+AGDQD ICPPEAVYETVK IPR+ VSY+VLG PGGPHY+HYD+VGSRLASS+VYPLITDFLN HD+
Subjt:  EEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSRLASSQVYPLITDFLNCHDM

XP_022994270.1 uncharacterized protein LOC111490055 isoform X1 [Cucurbita maxima]1.7e-24988.26Show/hide
Query:  MATLTLSRFDLSSIGWSNNRRRHLHRFRRQGKSEASWALCRRNVVAVKSVTAFYGGAYGWSDNKEKGSISTADELHYVSVPNSDWKLALWRYLPSFQAPS
        MATL+LSRFDLSSIG S+NRRRHLHRFRRQGKSEASWALCRRN+VAVKSVTAFYGG      NKEKGSI TADELHYVSVPNSDWKLALWRY PS QA S
Subjt:  MATLTLSRFDLSSIGWSNNRRRHLHRFRRQGKSEASWALCRRNVVAVKSVTAFYGGAYGWSDNKEKGSISTADELHYVSVPNSDWKLALWRYLPSFQAPS

Query:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMNDT-GHIRPETPEKRPL----------GSSISSKDGQTSTIATQLR
        RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVE  DT G IR ETPEK+PL          GSS+SSK GQ STIATQL 
Subjt:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMNDT-GHIRPETPEKRPL----------GSSISSKDGQTSTIATQLR

Query:  NWNKNLINIIEGAQQLGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP
        NWNKNLINIIEGAQQLGPLKPFNL+GVT+ALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP
Subjt:  NWNKNLINIIEGAQQLGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP

Query:  QLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTLLEKLVLNGFESVPAKVLLQLASVF
        QLASV+TLASSLDYRPSNSSLRLLLPLRDPA+AFNVPVFPIGPLL IAHPLASRPPY+LYWLKDQIS EDMLDPTLLEKLVLNGFESVPAKVLLQL+SVF
Subjt:  QLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTLLEKLVLNGFESVPAKVLLQLASVF

Query:  EEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSRLASSQVYPLITDFLNCHDM
        EEGGLRDRNGTF+Y DHL Q+NVPILA+AGDQD ICPPEAVYETVK IPR+ VSY+VLG PGGPHY+HYD+VGSRLASS VYPLITDFLN HD+
Subjt:  EEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSRLASSQVYPLITDFLNCHDM

XP_022994272.1 uncharacterized protein LOC111490055 isoform X2 [Cucurbita maxima]1.7e-24988.26Show/hide
Query:  MATLTLSRFDLSSIGWSNNRRRHLHRFRRQGKSEASWALCRRNVVAVKSVTAFYGGAYGWSDNKEKGSISTADELHYVSVPNSDWKLALWRYLPSFQAPS
        MATL+LSRFDLSSIG S+NRRRHLHRFRRQGKSEASWALCRRN+VAVKSVTAFYGG      NKEKGSI TADELHYVSVPNSDWKLALWRY PS QA S
Subjt:  MATLTLSRFDLSSIGWSNNRRRHLHRFRRQGKSEASWALCRRNVVAVKSVTAFYGGAYGWSDNKEKGSISTADELHYVSVPNSDWKLALWRYLPSFQAPS

Query:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMNDT-GHIRPETPEKRPL----------GSSISSKDGQTSTIATQLR
        RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVE  DT G IR ETPEK+PL          GSS+SSK GQ STIATQL 
Subjt:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMNDT-GHIRPETPEKRPL----------GSSISSKDGQTSTIATQLR

Query:  NWNKNLINIIEGAQQLGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP
        NWNKNLINIIEGAQQLGPLKPFNL+GVT+ALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP
Subjt:  NWNKNLINIIEGAQQLGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP

Query:  QLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTLLEKLVLNGFESVPAKVLLQLASVF
        QLASV+TLASSLDYRPSNSSLRLLLPLRDPA+AFNVPVFPIGPLL IAHPLASRPPY+LYWLKDQIS EDMLDPTLLEKLVLNGFESVPAKVLLQL+SVF
Subjt:  QLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTLLEKLVLNGFESVPAKVLLQLASVF

Query:  EEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSRLASSQVYPLITDFLNCHDM
        EEGGLRDRNGTF+Y DHL Q+NVPILA+AGDQD ICPPEAVYETVK IPR+ VSY+VLG PGGPHY+HYD+VGSRLASS VYPLITDFLN HD+
Subjt:  EEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSRLASSQVYPLITDFLNCHDM

XP_023541739.1 uncharacterized protein LOC111801808 [Cucurbita pepo subsp. pepo]9.5e-24887.65Show/hide
Query:  MATLTLSRFDLSSIGWSNNRRRHLHRFRRQGKSEASWALCRRNVVAVKSVTAFYGGAYGWSDNKEKGSISTADELHYVSVPNSDWKLALWRYLPSFQAPS
        MATL+LS FDLSSIG S+NRRR+LHRFRRQGKSEASWAL RRN+VAVKSVTAFYGG      NKEKGSI TADELHYVSVPNSDWKLALWRY PS QA S
Subjt:  MATLTLSRFDLSSIGWSNNRRRHLHRFRRQGKSEASWALCRRNVVAVKSVTAFYGGAYGWSDNKEKGSISTADELHYVSVPNSDWKLALWRYLPSFQAPS

Query:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMND-TGHIRPETPEKRPL----------GSSISSKDGQTSTIATQLR
        RNHPLLLLSGVGSNALGYDLSP+SSFARYMSNQGYDTWILEVRGSGLSTHRVE  D TG IR ETP+K+PL          GSSISSK GQ STIATQL 
Subjt:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMND-TGHIRPETPEKRPL----------GSSISSKDGQTSTIATQLR

Query:  NWNKNLINIIEGAQQLGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP
        NWNKNLINIIEGAQQLGPLKPFNL+GVT+ALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP
Subjt:  NWNKNLINIIEGAQQLGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP

Query:  QLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTLLEKLVLNGFESVPAKVLLQLASVF
        QLASV+TLASSLDYRPSNSSLRLLLPLRDPA+AFNVPVFPIGPLL IAHPLASRPPY+LYWLKDQIS EDMLDPTLLEKLVLNGFESVPAKVLLQL+SVF
Subjt:  QLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTLLEKLVLNGFESVPAKVLLQLASVF

Query:  EEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSRLASSQVYPLITDFLNCHDM
        EEGGLRDRNGTF+Y DHLCQ+NVPILA+AGDQD ICPPEAVYETVK IPR+ VSY+VLG PGGPHY+HYD+VGSRLASS+VYPLITDFLN HD+
Subjt:  EEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSRLASSQVYPLITDFLNCHDM

TrEMBL top hitse value%identityAlignment
A0A0A0LVT9 Uncharacterized protein8.7e-23183.57Show/hide
Query:  MATLTLSRFDLSSIGWSNNRRRHLHRFRRQGKSEASWALCRRNVVAVKSVTAFYGGAYGWSDNKEKGSISTADELHYVSVPNSDWKLALWRYLPSFQAPS
        MATL L RFDLSSIG S N  RHLHRFR QG  E SW L RRNVVAVKSV AFYGGAYG +DNK  G I TADELHYVSVPNSDWKLALWRY PS +APS
Subjt:  MATLTLSRFDLSSIGWSNNRRRHLHRFRRQGKSEASWALCRRNVVAVKSVTAFYGGAYGWSDNKEKGSISTADELHYVSVPNSDWKLALWRYLPSFQAPS

Query:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMNDTGHIRPETPEKRPL----------GSSISSKDGQTSTIATQLRN
        RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRG GLST       T  I  ET EK+PL          GS++SS+DGQTS IATQL  
Subjt:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMNDTGHIRPETPEKRPL----------GSSISSKDGQTSTIATQLRN

Query:  WNKNLINIIEGAQQLGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDPQ
        WNKNLINII+GAQQLGP +PF ++GVT+ALE+FQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA ISRCSF KVDPQ
Subjt:  WNKNLINIIEGAQQLGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDPQ

Query:  LASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTLLEKLVLNGFESVPAKVLLQLASVFE
        LASV+TLASSLDYRPSNSSLRLLLPL+DPA+ FNVPVFPIGPLLVIAHPLASRPPYVL WLKDQ+S EDML PTLLEKLVLNGF SVPAKVLLQL+SVFE
Subjt:  LASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTLLEKLVLNGFESVPAKVLLQLASVFE

Query:  EGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSRLASSQVYPLITDFLNCHDM
        +GGLRDRNGTF+YKDHL Q NVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLG PGGPHYAHYDIVGSRLASS+VYPL+ DFLN HDM
Subjt:  EGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSRLASSQVYPLITDFLNCHDM

A0A6J1GS92 uncharacterized protein LOC111457026 isoform X26.2e-24586.64Show/hide
Query:  MATLTLSRFDLSSIGWSNNRRRHLHRFRRQGKSEASWALCRRNVVAVKSVTAFYGGAYGWSDNKEKGSISTADELHYVSVPNSDWKLALWRYLPSFQAPS
        MATL+LS F LSS+G S+NRRRHLHRFRRQGKSEASWALCRRN+VAVKSVTAFYGG      +KEKG I TADELHYVSVPNSDWKLALWRY P  QA S
Subjt:  MATLTLSRFDLSSIGWSNNRRRHLHRFRRQGKSEASWALCRRNVVAVKSVTAFYGGAYGWSDNKEKGSISTADELHYVSVPNSDWKLALWRYLPSFQAPS

Query:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMND-TGHIRPETPEKRPL----------GSSISSKDGQTSTIATQLR
        RNHPLLLLSGVGSNALGYDLSP+SSFARYMSNQGYDTWILEVRGSGLSTHRVE  D TG IR ETPEK+PL          GSSISSK GQ STIATQL 
Subjt:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMND-TGHIRPETPEKRPL----------GSSISSKDGQTSTIATQLR

Query:  NWNKNLINIIEGAQQLGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP
        NWNKNLINIIEGAQQLGPLKPFNL+GVT+ALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA IS CSFNKVDP
Subjt:  NWNKNLINIIEGAQQLGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP

Query:  QLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTLLEKLVLNGFESVPAKVLLQLASVF
        QLASV+TLASSLDYRPSNSSLRLLLPLRDPA+AFNVPVFPIGPLL IAHPLASRPPY+LYWLKDQIS EDMLDPTLLEKLVLNGFESVPAKVLLQL+SVF
Subjt:  QLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTLLEKLVLNGFESVPAKVLLQLASVF

Query:  EEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSRLASSQVYPLITDFLNCHDM
        EEGGLRDRNGTF+Y DHL Q+NVPILA+AGDQD ICPPEAVYETVK IPR+ VSY+VLG PGGPHY+HYD+VGSRLASS+VYPLITDFLN HD+
Subjt:  EEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSRLASSQVYPLITDFLNCHDM

A0A6J1GTQ4 uncharacterized protein LOC111457026 isoform X16.2e-24586.64Show/hide
Query:  MATLTLSRFDLSSIGWSNNRRRHLHRFRRQGKSEASWALCRRNVVAVKSVTAFYGGAYGWSDNKEKGSISTADELHYVSVPNSDWKLALWRYLPSFQAPS
        MATL+LS F LSS+G S+NRRRHLHRFRRQGKSEASWALCRRN+VAVKSVTAFYGG      +KEKG I TADELHYVSVPNSDWKLALWRY P  QA S
Subjt:  MATLTLSRFDLSSIGWSNNRRRHLHRFRRQGKSEASWALCRRNVVAVKSVTAFYGGAYGWSDNKEKGSISTADELHYVSVPNSDWKLALWRYLPSFQAPS

Query:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMND-TGHIRPETPEKRPL----------GSSISSKDGQTSTIATQLR
        RNHPLLLLSGVGSNALGYDLSP+SSFARYMSNQGYDTWILEVRGSGLSTHRVE  D TG IR ETPEK+PL          GSSISSK GQ STIATQL 
Subjt:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMND-TGHIRPETPEKRPL----------GSSISSKDGQTSTIATQLR

Query:  NWNKNLINIIEGAQQLGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP
        NWNKNLINIIEGAQQLGPLKPFNL+GVT+ALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA IS CSFNKVDP
Subjt:  NWNKNLINIIEGAQQLGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP

Query:  QLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTLLEKLVLNGFESVPAKVLLQLASVF
        QLASV+TLASSLDYRPSNSSLRLLLPLRDPA+AFNVPVFPIGPLL IAHPLASRPPY+LYWLKDQIS EDMLDPTLLEKLVLNGFESVPAKVLLQL+SVF
Subjt:  QLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTLLEKLVLNGFESVPAKVLLQLASVF

Query:  EEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSRLASSQVYPLITDFLNCHDM
        EEGGLRDRNGTF+Y DHL Q+NVPILA+AGDQD ICPPEAVYETVK IPR+ VSY+VLG PGGPHY+HYD+VGSRLASS+VYPLITDFLN HD+
Subjt:  EEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSRLASSQVYPLITDFLNCHDM

A0A6J1JVB9 uncharacterized protein LOC111490055 isoform X28.4e-25088.26Show/hide
Query:  MATLTLSRFDLSSIGWSNNRRRHLHRFRRQGKSEASWALCRRNVVAVKSVTAFYGGAYGWSDNKEKGSISTADELHYVSVPNSDWKLALWRYLPSFQAPS
        MATL+LSRFDLSSIG S+NRRRHLHRFRRQGKSEASWALCRRN+VAVKSVTAFYGG      NKEKGSI TADELHYVSVPNSDWKLALWRY PS QA S
Subjt:  MATLTLSRFDLSSIGWSNNRRRHLHRFRRQGKSEASWALCRRNVVAVKSVTAFYGGAYGWSDNKEKGSISTADELHYVSVPNSDWKLALWRYLPSFQAPS

Query:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMNDT-GHIRPETPEKRPL----------GSSISSKDGQTSTIATQLR
        RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVE  DT G IR ETPEK+PL          GSS+SSK GQ STIATQL 
Subjt:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMNDT-GHIRPETPEKRPL----------GSSISSKDGQTSTIATQLR

Query:  NWNKNLINIIEGAQQLGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP
        NWNKNLINIIEGAQQLGPLKPFNL+GVT+ALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP
Subjt:  NWNKNLINIIEGAQQLGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP

Query:  QLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTLLEKLVLNGFESVPAKVLLQLASVF
        QLASV+TLASSLDYRPSNSSLRLLLPLRDPA+AFNVPVFPIGPLL IAHPLASRPPY+LYWLKDQIS EDMLDPTLLEKLVLNGFESVPAKVLLQL+SVF
Subjt:  QLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTLLEKLVLNGFESVPAKVLLQLASVF

Query:  EEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSRLASSQVYPLITDFLNCHDM
        EEGGLRDRNGTF+Y DHL Q+NVPILA+AGDQD ICPPEAVYETVK IPR+ VSY+VLG PGGPHY+HYD+VGSRLASS VYPLITDFLN HD+
Subjt:  EEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSRLASSQVYPLITDFLNCHDM

A0A6J1K0R1 uncharacterized protein LOC111490055 isoform X18.4e-25088.26Show/hide
Query:  MATLTLSRFDLSSIGWSNNRRRHLHRFRRQGKSEASWALCRRNVVAVKSVTAFYGGAYGWSDNKEKGSISTADELHYVSVPNSDWKLALWRYLPSFQAPS
        MATL+LSRFDLSSIG S+NRRRHLHRFRRQGKSEASWALCRRN+VAVKSVTAFYGG      NKEKGSI TADELHYVSVPNSDWKLALWRY PS QA S
Subjt:  MATLTLSRFDLSSIGWSNNRRRHLHRFRRQGKSEASWALCRRNVVAVKSVTAFYGGAYGWSDNKEKGSISTADELHYVSVPNSDWKLALWRYLPSFQAPS

Query:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMNDT-GHIRPETPEKRPL----------GSSISSKDGQTSTIATQLR
        RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVE  DT G IR ETPEK+PL          GSS+SSK GQ STIATQL 
Subjt:  RNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMNDT-GHIRPETPEKRPL----------GSSISSKDGQTSTIATQLR

Query:  NWNKNLINIIEGAQQLGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP
        NWNKNLINIIEGAQQLGPLKPFNL+GVT+ALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP
Subjt:  NWNKNLINIIEGAQQLGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDP

Query:  QLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTLLEKLVLNGFESVPAKVLLQLASVF
        QLASV+TLASSLDYRPSNSSLRLLLPLRDPA+AFNVPVFPIGPLL IAHPLASRPPY+LYWLKDQIS EDMLDPTLLEKLVLNGFESVPAKVLLQL+SVF
Subjt:  QLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTLLEKLVLNGFESVPAKVLLQLASVF

Query:  EEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSRLASSQVYPLITDFLNCHDM
        EEGGLRDRNGTF+Y DHL Q+NVPILA+AGDQD ICPPEAVYETVK IPR+ VSY+VLG PGGPHY+HYD+VGSRLASS VYPLITDFLN HD+
Subjt:  EEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSRLASSQVYPLITDFLNCHDM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15060.1 Uncharacterised conserved protein UCP031088, alpha/beta hydrolase6.6e-13850Show/hide
Query:  KGSISTADELHYVSVPNSDWKLALWRYLPSFQAPSRNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMNDTGHI-----
        K S+ TADELHYVSVPN+DW+LALWRYLP  QAP+RNHPLLLLSGVG+NA+GYDLSP  SFAR+MS QG++TWILEVRG+GLST   ++ D         
Subjt:  KGSISTADELHYVSVPNSDWKLALWRYLPSFQAPSRNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMNDTGHI-----

Query:  -RPETPEKRPLGSSISSKDGQT------------------------------------------------------------------------------
         + E+  +   G    S + QT                                                                              
Subjt:  -RPETPEKRPLGSSISSKDGQT------------------------------------------------------------------------------

Query:  ---------------STIATQLRNWNKNLINIIEGAQQ--LGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLL
                       S +  Q+R+  + L+N+ +  Q+    PL     R +T  +EDFQ+QLD+  KYDWDFDHYLEEDVPAA+EY+R QSKP DGKL 
Subjt:  ---------------STIATQLRNWNKNLINIIEGAQQ--LGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLL

Query:  AIGHSMGGILLYATISRCSFNKVDPQLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPT
        AIGHSMGGILLYA +SRC+F   +P +A+V TLASS+DY  SNS+L+LL+PL +PA A +VPV P+G LL  A PL++RPPYVL WL D IS+ DM+ P 
Subjt:  AIGHSMGGILLYATISRCSFNKVDPQLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPT

Query:  LLEKLVLNGFESVPAKVLLQLASVFEEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSR
        +LEKLVLN F ++PAK+L+QL + F EGGLRDR+G F YKDHL +++VP+LALAGD+DLICPP AV +TVK  P  LV+YK+LG P GPHYAHYD+VG R
Subjt:  LLEKLVLNGFESVPAKVLLQLASVFEEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSR

Query:  LASSQVYPLITDFLNCHD
        LA  QVYP IT+FL+ HD
Subjt:  LASSQVYPLITDFLNCHD

AT1G73750.1 Uncharacterised conserved protein UCP031088, alpha/beta hydrolase4.6e-13153.72Show/hide
Query:  ISTADELHYVSVPNSDWKLALWRYLPSFQAPSRNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMNDTGHIRPETPEKR
        I TADELHYV VPNSDW++ALWRYLPS +AP RNHPLLLLSG+G+NA+ YDLSPE SFAR MS  G+DTWILE+RG+GLS+  V+ N             
Subjt:  ISTADELHYVSVPNSDWKLALWRYLPSFQAPSRNHPLLLLSGVGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMNDTGHIRPETPEKR

Query:  PLGSSISSKDGQTSTIATQLRNW---NKNLINIIEGAQQLGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLA
             +   + Q   ++  L N+   ++ L N+++G  ++  ++      ++    DF+++ ++   Y+WDFD+YLEEDVP+AM+Y+R Q+K  DGKLLA
Subjt:  PLGSSISSKDGQTSTIATQLRNW---NKNLINIIEGAQQLGPLKPFNLRGVTTALEDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLA

Query:  IGHSMGGILLYATISRCSFNKVDPQLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTL
        +GHSMGGILLYA +SRC F  +D  LA V TLAS+ DY  S + L+ LLP+++PA+A N+P+ PI  +L +AHPL  RPPY L WL   ISA  M+DP +
Subjt:  IGHSMGGILLYATISRCSFNKVDPQLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPIGPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTL

Query:  LEKLVLNGFESVPAKVLLQLASVFEEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSRL
        +EKLVLN   +VP K+LLQL +  + GGLRDR GTF YKDH+ ++NVPILALAGD D+ICPP+AVY+TVK IP  L +YKV+G+PGGPHY H D++  R 
Subjt:  LEKLVLNGFESVPAKVLLQLASVFEEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQLVSYKVLGNPGGPHYAHYDIVGSRL

Query:  ASSQVYPLITDFLNCHD
        A ++VYPLIT FL   D
Subjt:  ASSQVYPLITDFLNCHD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACCCTTACGTTATCTCGTTTCGATCTCTCTTCGATCGGCTGGAGCAACAACCGCCGCCGTCACCTCCACCGATTCAGGCGACAAGGGAAATCGGAAGCCTCGTG
GGCCTTATGCCGACGGAATGTGGTCGCCGTGAAGTCCGTCACGGCGTTTTACGGCGGCGCGTATGGTTGGTCTGACAATAAGGAGAAGGGTTCGATCTCTACTGCCGACG
AGCTTCATTATGTCTCTGTTCCCAACTCTGATTGGAAGCTCGCTCTCTGGCGTTATCTCCCTTCTTTTCAGGCGCCGTCAAGGAATCATCCGCTTTTGCTGTTATCAGGG
GTTGGGAGCAACGCTCTTGGATACGACCTTTCTCCAGAGTCCTCATTTGCTCGCTACATGTCCAACCAAGGATATGATACATGGATTCTTGAAGTTCGAGGTTCGGGACT
GAGCACACACAGAGTAGAAATGAATGATACTGGGCACATACGGCCAGAAACTCCAGAGAAACGGCCATTAGGTTCCAGCATTTCTTCAAAAGATGGACAGACTTCTACTA
TTGCTACTCAACTTAGGAACTGGAATAAGAATCTTATCAATATAATTGAAGGAGCTCAACAACTGGGTCCATTGAAGCCTTTTAATTTAAGAGGTGTTACCACTGCATTA
GAAGACTTCCAGGAACAACTTGATGTGTATGAGAAGTATGATTGGGACTTCGACCACTATTTGGAAGAAGATGTGCCTGCTGCGATGGAGTACATAAGGAACCAATCCAA
ACCAAATGATGGCAAGTTACTAGCCATCGGCCACTCGATGGGGGGTATCTTGCTATATGCAACGATCTCGCGCTGTAGCTTTAACAAAGTTGACCCGCAGTTGGCATCAG
TTATTACGTTGGCTTCATCACTTGACTACAGACCTTCAAATTCATCACTCAGACTTCTGTTACCTTTGAGAGATCCTGCTCGAGCTTTTAATGTTCCCGTGTTTCCCATC
GGGCCATTGCTTGTTATTGCTCATCCTCTCGCATCACGTCCTCCTTATGTCCTGTATTGGTTAAAGGATCAAATCTCTGCAGAAGACATGTTAGATCCCACCTTGCTCGA
GAAGCTGGTGTTGAATGGCTTTGAATCTGTGCCTGCAAAGGTTCTCCTTCAGCTAGCGTCAGTTTTTGAAGAGGGGGGCTTACGTGACAGGAACGGAACATTCAAGTACA
AGGACCATTTATGCCAAAGCAATGTTCCAATCCTTGCACTTGCTGGAGACCAAGACCTCATTTGTCCACCTGAAGCTGTATACGAAACTGTGAAAGAAATTCCCAGGCAG
TTGGTTTCCTACAAAGTTCTTGGGAATCCTGGTGGTCCTCATTATGCTCACTATGATATTGTGGGAAGTCGTTTGGCATCAAGTCAAGTATATCCATTGATAACCGATTT
TCTCAACTGCCACGACATGGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGACCCTTACGTTATCTCGTTTCGATCTCTCTTCGATCGGCTGGAGCAACAACCGCCGCCGTCACCTCCACCGATTCAGGCGACAAGGGAAATCGGAAGCCTCGTG
GGCCTTATGCCGACGGAATGTGGTCGCCGTGAAGTCCGTCACGGCGTTTTACGGCGGCGCGTATGGTTGGTCTGACAATAAGGAGAAGGGTTCGATCTCTACTGCCGACG
AGCTTCATTATGTCTCTGTTCCCAACTCTGATTGGAAGCTCGCTCTCTGGCGTTATCTCCCTTCTTTTCAGGCGCCGTCAAGGAATCATCCGCTTTTGCTGTTATCAGGG
GTTGGGAGCAACGCTCTTGGATACGACCTTTCTCCAGAGTCCTCATTTGCTCGCTACATGTCCAACCAAGGATATGATACATGGATTCTTGAAGTTCGAGGTTCGGGACT
GAGCACACACAGAGTAGAAATGAATGATACTGGGCACATACGGCCAGAAACTCCAGAGAAACGGCCATTAGGTTCCAGCATTTCTTCAAAAGATGGACAGACTTCTACTA
TTGCTACTCAACTTAGGAACTGGAATAAGAATCTTATCAATATAATTGAAGGAGCTCAACAACTGGGTCCATTGAAGCCTTTTAATTTAAGAGGTGTTACCACTGCATTA
GAAGACTTCCAGGAACAACTTGATGTGTATGAGAAGTATGATTGGGACTTCGACCACTATTTGGAAGAAGATGTGCCTGCTGCGATGGAGTACATAAGGAACCAATCCAA
ACCAAATGATGGCAAGTTACTAGCCATCGGCCACTCGATGGGGGGTATCTTGCTATATGCAACGATCTCGCGCTGTAGCTTTAACAAAGTTGACCCGCAGTTGGCATCAG
TTATTACGTTGGCTTCATCACTTGACTACAGACCTTCAAATTCATCACTCAGACTTCTGTTACCTTTGAGAGATCCTGCTCGAGCTTTTAATGTTCCCGTGTTTCCCATC
GGGCCATTGCTTGTTATTGCTCATCCTCTCGCATCACGTCCTCCTTATGTCCTGTATTGGTTAAAGGATCAAATCTCTGCAGAAGACATGTTAGATCCCACCTTGCTCGA
GAAGCTGGTGTTGAATGGCTTTGAATCTGTGCCTGCAAAGGTTCTCCTTCAGCTAGCGTCAGTTTTTGAAGAGGGGGGCTTACGTGACAGGAACGGAACATTCAAGTACA
AGGACCATTTATGCCAAAGCAATGTTCCAATCCTTGCACTTGCTGGAGACCAAGACCTCATTTGTCCACCTGAAGCTGTATACGAAACTGTGAAAGAAATTCCCAGGCAG
TTGGTTTCCTACAAAGTTCTTGGGAATCCTGGTGGTCCTCATTATGCTCACTATGATATTGTGGGAAGTCGTTTGGCATCAAGTCAAGTATATCCATTGATAACCGATTT
TCTCAACTGCCACGACATGGGTTGA
Protein sequenceShow/hide protein sequence
MATLTLSRFDLSSIGWSNNRRRHLHRFRRQGKSEASWALCRRNVVAVKSVTAFYGGAYGWSDNKEKGSISTADELHYVSVPNSDWKLALWRYLPSFQAPSRNHPLLLLSG
VGSNALGYDLSPESSFARYMSNQGYDTWILEVRGSGLSTHRVEMNDTGHIRPETPEKRPLGSSISSKDGQTSTIATQLRNWNKNLINIIEGAQQLGPLKPFNLRGVTTAL
EDFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYATISRCSFNKVDPQLASVITLASSLDYRPSNSSLRLLLPLRDPARAFNVPVFPI
GPLLVIAHPLASRPPYVLYWLKDQISAEDMLDPTLLEKLVLNGFESVPAKVLLQLASVFEEGGLRDRNGTFKYKDHLCQSNVPILALAGDQDLICPPEAVYETVKEIPRQ
LVSYKVLGNPGGPHYAHYDIVGSRLASSQVYPLITDFLNCHDMG