; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0009174 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0009174
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUsp domain-containing protein
Genome locationchr9:36439196..36442614
RNA-Seq ExpressionLag0009174
SyntenyLag0009174
Gene Ontology termsNA
InterPro domainsIPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010929.1 hypothetical protein SDJN02_27727 [Cucurbita argyrosperma subsp. argyrosperma]3.6e-8492.4Show/hide
Query:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAF
        M+TL+EEEEYNWREVRLPSLIPVVPEPELERET  RRRGRD+++AVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYE+SQ LMEKLAVEAF
Subjt:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAF

Query:  EVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGKGS
        EVAMVRTVARI QGDAGK+ICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSE+VFHNCKSAPVIIVPGKG+
Subjt:  EVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGKGS

XP_004148842.1 uncharacterized protein LOC101210790 isoform X1 [Cucumis sativus]1.3e-8494.67Show/hide
Query:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAF
        M+TL+EEEEYNWREVRLPSLIPVVPEPELERET ERRRGRD++IAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQ LMEKLAVEAF
Subjt:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAF

Query:  EVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGK
        EVAMVRTVARI QGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSE+VFHNCKSAPV+IVPGK
Subjt:  EVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGK

XP_023512225.1 uncharacterized protein LOC111777016 [Cucurbita pepo subsp. pepo]2.4e-8392.9Show/hide
Query:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAF
        M+TL+EEEEYNWREVRLPSLIPVVPEPELERET  RRRGRD+++AVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYE+SQ LMEKLAVEAF
Subjt:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAF

Query:  EVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGK
        EVAMVRTVARI QGDAGK+ICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSE+VFHNCKSAPVIIVPGK
Subjt:  EVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGK

XP_031737219.1 uncharacterized protein LOC101210790 isoform X2 [Cucumis sativus]3.3e-8591.43Show/hide
Query:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAF
        M+TL+EEEEYNWREVRLPSLIPVVPEPELERET ERRRGRD++IAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQ LMEKLAVEAF
Subjt:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAF

Query:  EVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGKGSRSDI
        EVAMVRTVARI QGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSE+VFHNCKSAPV+IVPGK   + +
Subjt:  EVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGKGSRSDI

XP_038901616.1 uncharacterized protein LOC120088411 [Benincasa hispida]9.6e-8595.27Show/hide
Query:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAF
        M+TL+EEEEYNWREVRLPSLIPVVPEPELERET ERRRGRD++IAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQ LMEKLAVEAF
Subjt:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAF

Query:  EVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGK
        EVAMVRTVARI QGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSE+VFHNCKSAPVIIVPGK
Subjt:  EVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGK

TrEMBL top hitse value%identityAlignment
A0A0A0LL61 Usp domain-containing protein6.1e-8594.67Show/hide
Query:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAF
        M+TL+EEEEYNWREVRLPSLIPVVPEPELERET ERRRGRD++IAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQ LMEKLAVEAF
Subjt:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAF

Query:  EVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGK
        EVAMVRTVARI QGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSE+VFHNCKSAPV+IVPGK
Subjt:  EVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGK

A0A1S3CET4 uncharacterized protein LOC1034996766.1e-8594.67Show/hide
Query:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAF
        M+TL+EEEEYNWREVRLPSLIPVVPEPELERET ERRRGRD++IAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQ LMEKLAVEAF
Subjt:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAF

Query:  EVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGK
        EVAMVRTVARI QGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSE+VFHNCKSAPV+IVPGK
Subjt:  EVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGK

A0A5D3DYQ0 Universal stress protein PHOS326.1e-8594.67Show/hide
Query:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAF
        M+TL+EEEEYNWREVRLPSLIPVVPEPELERET ERRRGRD++IAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQ LMEKLAVEAF
Subjt:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAF

Query:  EVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGK
        EVAMVRTVARI QGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSE+VFHNCKSAPV+IVPGK
Subjt:  EVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGK

A0A6J1FTC2 uncharacterized protein LOC1114482931.1e-8392.9Show/hide
Query:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAF
        M+TL+EEEEYNWREVRLPSLIPVVPEPELERET  RRRGRD+++AVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYE+SQ LMEKLAVEAF
Subjt:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAF

Query:  EVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGK
        EVAMVRTVARI QGDAGK+ICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSE+VFHNCKSAPVIIVPGK
Subjt:  EVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGK

A0A6J1JGJ7 uncharacterized protein LOC1114843079.7e-8391.12Show/hide
Query:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAF
        M+TL+EEEEYNWREVRLPSLIPVVPEPELERET  RRRGRD+++AVDHGPNSKHAFDWA+IHFCRLADTIHLVHAVSNVKNELVYE+SQ LMEKLAVEAF
Subjt:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAF

Query:  EVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGK
        EV+MVRTVARI QGDAG++ICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSE+VFHNCKSAPVIIVPGK
Subjt:  EVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGK

SwissProt top hitse value%identityAlignment
P87132 Uncharacterized protein C167.051.3e-0424.36Show/hide
Query:  ELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAV--------------SNVKNELVYEFSQALMEKLAVEAFEVAMVRTVARIAQ
        + +   +  +R     + +D    S HA +WA+    R  DT+ +V  +               + + E + + ++ +++ L+    EV +   +  I  
Subjt:  ELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAV--------------SNVKNELVYEFSQALMEKLAVEAFEVAMVRTVARIAQ

Query:  GDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGK
          A  +I +  + ++P+ VVMG+RGRS ++ VL GS S Y+  N  S PV++   K
Subjt:  GDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGK

Arabidopsis top hitse value%identityAlignment
AT1G11360.1 Adenine nucleotide alpha hydrolases-like superfamily protein3.0e-0423.08Show/hide
Query:  PSLIPVVP-EPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALM-------------EKLAVEAFEVA
        P+++ V P  P     T      R + IAVD    S +A  WA+ ++ R  D + L+H        ++Y      M             ++   + F++ 
Subjt:  PSLIPVVP-EPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALM-------------EKLAVEAFEVA

Query:  MVRTVARIAQ----------------GDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQ---GSVSEYVFHNCKSAPVIIV
          +  + +AQ                 D  + +C E E+L  + ++MG+RG    +   +   GSVS+Y  H+C + PV++V
Subjt:  MVRTVARIAQ----------------GDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQ---GSVSEYVFHNCKSAPVIIV

AT2G21620.1 Adenine nucleotide alpha hydrolases-like superfamily protein6.4e-7982.84Show/hide
Query:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAF
        ME L E+EEY++REV LPSLIPVVPEPELERE+ ERRRGRD+I+AVDHGPNSKHAFDWAL+HFCRLADT+HLVHAVS+VKN++VYE SQALMEKLAVEA+
Subjt:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAF

Query:  EVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGK
        +VAMV++VAR+ +GDAGKVICKEAEK+KPAAV++GTRGRSL++SVLQGSVSEY FHNCKSAPVIIVPGK
Subjt:  EVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGK

AT2G21620.2 Adenine nucleotide alpha hydrolases-like superfamily protein6.0e-7780Show/hide
Query:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSN------VKNELVYEFSQALMEK
        ME L E+EEY++REV LPSLIPVVPEPELERE+ ERRRGRD+I+AVDHGPNSKHAFDWAL+HFCRLADT+HLVHAVS+      VKN++VYE SQALMEK
Subjt:  METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSN------VKNELVYEFSQALMEK

Query:  LAVEAFEVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGK
        LAVEA++VAMV++VAR+ +GDAGKVICKEAEK+KPAAV++GTRGRSL++SVLQGSVSEY FHNCKSAPVIIVPGK
Subjt:  LAVEAFEVAMVRTVARIAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGK

AT3G53990.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.6e-0827.74Show/hide
Query:  RGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAV----SNVKNELVY----------EFSQ-ALMEKLAVEAFEVAM-----------VRTVARI
        + R++ IA+D   +SK+A  WA+ +     DTI+++H +       +N L +          EF +  +MEK  V+     +           V  V ++
Subjt:  RGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAV----SNVKNELVY----------EFSQ-ALMEKLAVEAFEVAM-----------VRTVARI

Query:  AQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIV
          GDA + +    + LK  ++VMG+RG S +Q ++ GSVS +V  +    PV +V
Subjt:  AQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIV

AT5G54430.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.0e-0425.64Show/hide
Query:  RDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSN------------VKNELVYEFSQALMEKLAVEAFEVAMVRTVAR-------------IAQG
        R + +AVD    S  A  WA+ H+ R  D + L+H                +K ++    +Q    +   +AF    V  +A+             +   
Subjt:  RDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSN------------VKNELVYEFSQALMEKLAVEAFEVAMVRTVAR-------------IAQG

Query:  DAGKVICKEAEKLKPAAVVMGTRG----RSLIQSVLQGSVSEYVFHNCKSAPVIIV
        D  + +C E E+L  +AV+MG+RG    +        GSVS+Y  H+C   PV++V
Subjt:  DAGKVICKEAEKLKPAAVVMGTRG----RSLIQSVLQGSVSEYVFHNCKSAPVIIV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGACATTGGATGAAGAAGAAGAGTACAACTGGAGAGAAGTCCGCCTCCCGTCGCTGATCCCGGTGGTGCCGGAGCCGGAGCTGGAGAGAGAGACGACGGAGAGACG
CCGTGGCCGAGACCTCATCATCGCCGTCGACCATGGACCCAACAGCAAACACGCCTTCGATTGGGCTCTGATCCATTTCTGCCGCCTCGCCGACACCATCCATCTAGTCC
ACGCCGTTTCCAATGTGAAGAACGAATTGGTTTATGAGTTCAGCCAGGCGCTGATGGAGAAGCTCGCAGTGGAGGCCTTTGAAGTGGCCATGGTGAGGACTGTGGCAAGG
ATTGCGCAGGGAGATGCAGGGAAGGTTATTTGTAAGGAAGCAGAGAAGTTGAAGCCTGCTGCTGTTGTTATGGGCACCAGAGGAAGAAGTTTGATTCAAAGTGTTTTGCA
GGGAAGTGTGAGTGAGTATGTCTTCCACAACTGCAAATCAGCACCTGTTATAATAGTTCCTGGAAAAGGGTCGAGGTCGGACATCTGGGCTTTAGAGATGGAAAGTGCAA
AAGGGAGAGAAGATAAGGTAATTTTGGACCACTCTGATGCGCAAGGAGCTGACGAGGACAATCGGGTAGAGATAAGACCAGAAAATCGACCCAGAGGAAGACCAGACCAA
AGGGTCGGGCCAAGTATGGTCCGCCTCGGTCGACCATTCGGCCTGCTTGCGCGGGTCGAGCTCGTTCACTTCCATTCGGTCCTTGCTGCCTCTGGCCGCCCCGGTTCCAC
CTGGTTCGTCCTGAAACGCTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGACATTGGATGAAGAAGAAGAGTACAACTGGAGAGAAGTCCGCCTCCCGTCGCTGATCCCGGTGGTGCCGGAGCCGGAGCTGGAGAGAGAGACGACGGAGAGACG
CCGTGGCCGAGACCTCATCATCGCCGTCGACCATGGACCCAACAGCAAACACGCCTTCGATTGGGCTCTGATCCATTTCTGCCGCCTCGCCGACACCATCCATCTAGTCC
ACGCCGTTTCCAATGTGAAGAACGAATTGGTTTATGAGTTCAGCCAGGCGCTGATGGAGAAGCTCGCAGTGGAGGCCTTTGAAGTGGCCATGGTGAGGACTGTGGCAAGG
ATTGCGCAGGGAGATGCAGGGAAGGTTATTTGTAAGGAAGCAGAGAAGTTGAAGCCTGCTGCTGTTGTTATGGGCACCAGAGGAAGAAGTTTGATTCAAAGTGTTTTGCA
GGGAAGTGTGAGTGAGTATGTCTTCCACAACTGCAAATCAGCACCTGTTATAATAGTTCCTGGAAAAGGGTCGAGGTCGGACATCTGGGCTTTAGAGATGGAAAGTGCAA
AAGGGAGAGAAGATAAGGTAATTTTGGACCACTCTGATGCGCAAGGAGCTGACGAGGACAATCGGGTAGAGATAAGACCAGAAAATCGACCCAGAGGAAGACCAGACCAA
AGGGTCGGGCCAAGTATGGTCCGCCTCGGTCGACCATTCGGCCTGCTTGCGCGGGTCGAGCTCGTTCACTTCCATTCGGTCCTTGCTGCCTCTGGCCGCCCCGGTTCCAC
CTGGTTCGTCCTGAAACGCTTCTGA
Protein sequenceShow/hide protein sequence
METLDEEEEYNWREVRLPSLIPVVPEPELERETTERRRGRDLIIAVDHGPNSKHAFDWALIHFCRLADTIHLVHAVSNVKNELVYEFSQALMEKLAVEAFEVAMVRTVAR
IAQGDAGKVICKEAEKLKPAAVVMGTRGRSLIQSVLQGSVSEYVFHNCKSAPVIIVPGKGSRSDIWALEMESAKGREDKVILDHSDAQGADEDNRVEIRPENRPRGRPDQ
RVGPSMVRLGRPFGLLARVELVHFHSVLAASGRPGSTWFVLKRF