; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032441 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032441
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUlp1-like peptidase
Genome locationchr11:32549063..32550679
RNA-Seq ExpressionLag0032441
SyntenyLag0032441
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN49944.1 hypothetical protein Csa_000148 [Cucumis sativus]2.8e-5237.62Show/hide
Query:  MAHALKIVEADQFRAQVTCL-SHIRNGNKIVKDTFNGRQMNMFRRTVFGRFLDMDIVFNSSLVHHILLREVQTDRTDAMSFKIKGTVVTFSKDDFLLM--
        M   LKI   ++F  Q + + SHI   NKIVK+    +Q+++F+RTVFGRF+DMD+VF+S LVHHILLREV+ +R DAMSF + G + TFSK++FLL+  
Subjt:  MAHALKIVEADQFRAQVTCL-SHIRNGNKIVKDTFNGRQMNMFRRTVFGRFLDMDIVFNSSLVHHILLREVQTDRTDAMSFKIKGTVVTFSKDDFLLM--

Query:  -------------------RRDYGGALFLEFN----------------------------------GNKSRGSVDRTLFEDVENLDYYNNKDWGNILWER
                            + +G  L ++ N                                   ++++  ++++L +DVE+L YYN+ DWG+ILWE+
Subjt:  -------------------RRDYGGALFLEFN----------------------------------GNKSRGSVDRTLFEDVENLDYYNNKDWGNILWER

Query:  TLKGLKVALKDKAKTHKKKVSAGDDRLIKYSLPEIPHTFQVWAYEILSSVPRRVAIRGRKLTVPQFLRWSC----------------MQTVVATSLEMPI
        TL+GL+ ALK+K   +KKKV    +  +KYSLP  PH FQVWAYEI+SS+  +   R     VP+ LRWSC                M+ V++ +L M  
Subjt:  TLKGLKVALKDKAKTHKKKVSAGDDRLIKYSLPEIPHTFQVWAYEILSSVPRRVAIRGRKLTVPQFLRWSC----------------MQTVVATSLEMPI

Query:  EEEKYWDARIDQRPIFVHD
         E+++ D+ +D+RPI+V D
Subjt:  EEEKYWDARIDQRPIFVHD

XP_008437500.1 PREDICTED: uncharacterized protein LOC103482899 isoform X1 [Cucumis melo]1.8e-5137.81Show/hide
Query:  MAHALKIVEADQFRAQVTCL-SHIRNGNKIVKDTFNGRQMNMFRRTVFGRFLDMDIVFNSSLVHHILLREVQTDRTDAMSFKIKGTVVTFSKDDFLLM--
        M   LKI   ++F  Q + + SHI   NKIVK+    +Q+++F+RTVFGRF+DMD+VF+S LVHHILLREV+ +R DAMSF + G + TFSK++FLL+  
Subjt:  MAHALKIVEADQFRAQVTCL-SHIRNGNKIVKDTFNGRQMNMFRRTVFGRFLDMDIVFNSSLVHHILLREVQTDRTDAMSFKIKGTVVTFSKDDFLLM--

Query:  -------------------RRDYGGALFLEFN----------------------------------GNKSRGSVDRTLFEDVENLDYYNNKDWGNILWER
                            + +G  L ++ N                                   ++++  ++++L +DVE+L YYN+ DWG+ILWER
Subjt:  -------------------RRDYGGALFLEFN----------------------------------GNKSRGSVDRTLFEDVENLDYYNNKDWGNILWER

Query:  TLKGLKVALKDKAKTHKKKVSAGDDRLIKYSLPEIPHTFQVWAYEILSSVPR-RVAIRGRKLTVPQFLRWSC----------------MQTVVATSLEMP
        TL+GL+ ALK+K   +KKKV    +  +KYSLP  PH FQVWAYEI+SS+   R   R     VP+FLRWSC                 + V++ +L M 
Subjt:  TLKGLKVALKDKAKTHKKKVSAGDDRLIKYSLPEIPHTFQVWAYEILSSVPR-RVAIRGRKLTVPQFLRWSC----------------MQTVVATSLEMP

Query:  IEEEKYWDARIDQRPIFVHD
          E+++ D ++D+RP++V D
Subjt:  IEEEKYWDARIDQRPIFVHD

XP_011654656.1 uncharacterized protein LOC105435430 isoform X1 [Cucumis sativus]2.8e-5237.62Show/hide
Query:  MAHALKIVEADQFRAQVTCL-SHIRNGNKIVKDTFNGRQMNMFRRTVFGRFLDMDIVFNSSLVHHILLREVQTDRTDAMSFKIKGTVVTFSKDDFLLM--
        M   LKI   ++F  Q + + SHI   NKIVK+    +Q+++F+RTVFGRF+DMD+VF+S LVHHILLREV+ +R DAMSF + G + TFSK++FLL+  
Subjt:  MAHALKIVEADQFRAQVTCL-SHIRNGNKIVKDTFNGRQMNMFRRTVFGRFLDMDIVFNSSLVHHILLREVQTDRTDAMSFKIKGTVVTFSKDDFLLM--

Query:  -------------------RRDYGGALFLEFN----------------------------------GNKSRGSVDRTLFEDVENLDYYNNKDWGNILWER
                            + +G  L ++ N                                   ++++  ++++L +DVE+L YYN+ DWG+ILWE+
Subjt:  -------------------RRDYGGALFLEFN----------------------------------GNKSRGSVDRTLFEDVENLDYYNNKDWGNILWER

Query:  TLKGLKVALKDKAKTHKKKVSAGDDRLIKYSLPEIPHTFQVWAYEILSSVPRRVAIRGRKLTVPQFLRWSC----------------MQTVVATSLEMPI
        TL+GL+ ALK+K   +KKKV    +  +KYSLP  PH FQVWAYEI+SS+  +   R     VP+ LRWSC                M+ V++ +L M  
Subjt:  TLKGLKVALKDKAKTHKKKVSAGDDRLIKYSLPEIPHTFQVWAYEILSSVPRRVAIRGRKLTVPQFLRWSC----------------MQTVVATSLEMPI

Query:  EEEKYWDARIDQRPIFVHD
         E+++ D+ +D+RPI+V D
Subjt:  EEEKYWDARIDQRPIFVHD

XP_031741885.1 uncharacterized protein LOC105435430 isoform X2 [Cucumis sativus]2.8e-5237.62Show/hide
Query:  MAHALKIVEADQFRAQVTCL-SHIRNGNKIVKDTFNGRQMNMFRRTVFGRFLDMDIVFNSSLVHHILLREVQTDRTDAMSFKIKGTVVTFSKDDFLLM--
        M   LKI   ++F  Q + + SHI   NKIVK+    +Q+++F+RTVFGRF+DMD+VF+S LVHHILLREV+ +R DAMSF + G + TFSK++FLL+  
Subjt:  MAHALKIVEADQFRAQVTCL-SHIRNGNKIVKDTFNGRQMNMFRRTVFGRFLDMDIVFNSSLVHHILLREVQTDRTDAMSFKIKGTVVTFSKDDFLLM--

Query:  -------------------RRDYGGALFLEFN----------------------------------GNKSRGSVDRTLFEDVENLDYYNNKDWGNILWER
                            + +G  L ++ N                                   ++++  ++++L +DVE+L YYN+ DWG+ILWE+
Subjt:  -------------------RRDYGGALFLEFN----------------------------------GNKSRGSVDRTLFEDVENLDYYNNKDWGNILWER

Query:  TLKGLKVALKDKAKTHKKKVSAGDDRLIKYSLPEIPHTFQVWAYEILSSVPRRVAIRGRKLTVPQFLRWSC----------------MQTVVATSLEMPI
        TL+GL+ ALK+K   +KKKV    +  +KYSLP  PH FQVWAYEI+SS+  +   R     VP+ LRWSC                M+ V++ +L M  
Subjt:  TLKGLKVALKDKAKTHKKKVSAGDDRLIKYSLPEIPHTFQVWAYEILSSVPRRVAIRGRKLTVPQFLRWSC----------------MQTVVATSLEMPI

Query:  EEEKYWDARIDQRPIFVHD
         E+++ D+ +D+RPI+V D
Subjt:  EEEKYWDARIDQRPIFVHD

XP_038888550.1 uncharacterized protein LOC120078361 isoform X1 [Benincasa hispida]7.4e-5341.2Show/hide
Query:  MAHALKIVEADQFRAQVTCLSHIRNGNKIVKDTFNGRQMNMFRRTVFGRFLDMDIVFNSSLVHHILLREVQTDRTDAMSFKIKGTVVTFSKDDFLLM---
        M  +LKI E D+F  QVTCL+H+ N NKIVK     RQ+ MF+RTVFGRFLD+++VFN+ L+HH+LL EV+    D++SF ++G VVTFSKDDFLL+   
Subjt:  MAHALKIVEADQFRAQVTCLSHIRNGNKIVKDTFNGRQMNMFRRTVFGRFLDMDIVFNSSLVHHILLREVQTDRTDAMSFKIKGTVVTFSKDDFLLM---

Query:  ---------RRDYGGALFLEFNG--------------------------------------------NKSRGSVDRTLFEDVENLDYYNNKDWGNILWER
                   +    LF ++ G                                            NK +  VDRTLF+DVE++DYYN+ DWG I+W+R
Subjt:  ---------RRDYGGALFLEFNG--------------------------------------------NKSRGSVDRTLFEDVENLDYYNNKDWGNILWER

Query:  TLKGLKVALKDKAKTHKKKVSAGDDRLIKYSLPEIPHTFQVWAYEILSSVPRRVAIRGRKLTVPQFLRWSCMQTVVATSLEMPI
        T   LK+ALKDK   +K+K+    + ++KYSL   P  FQVW Y+ILS +   VA R   + +P+ LRWSC  +V   +LE  I
Subjt:  TLKGLKVALKDKAKTHKKKVSAGDDRLIKYSLPEIPHTFQVWAYEILSSVPRRVAIRGRKLTVPQFLRWSCMQTVVATSLEMPI

TrEMBL top hitse value%identityAlignment
A0A0A0KM59 DUF1985 domain-containing protein1.4e-5237.62Show/hide
Query:  MAHALKIVEADQFRAQVTCL-SHIRNGNKIVKDTFNGRQMNMFRRTVFGRFLDMDIVFNSSLVHHILLREVQTDRTDAMSFKIKGTVVTFSKDDFLLM--
        M   LKI   ++F  Q + + SHI   NKIVK+    +Q+++F+RTVFGRF+DMD+VF+S LVHHILLREV+ +R DAMSF + G + TFSK++FLL+  
Subjt:  MAHALKIVEADQFRAQVTCL-SHIRNGNKIVKDTFNGRQMNMFRRTVFGRFLDMDIVFNSSLVHHILLREVQTDRTDAMSFKIKGTVVTFSKDDFLLM--

Query:  -------------------RRDYGGALFLEFN----------------------------------GNKSRGSVDRTLFEDVENLDYYNNKDWGNILWER
                            + +G  L ++ N                                   ++++  ++++L +DVE+L YYN+ DWG+ILWE+
Subjt:  -------------------RRDYGGALFLEFN----------------------------------GNKSRGSVDRTLFEDVENLDYYNNKDWGNILWER

Query:  TLKGLKVALKDKAKTHKKKVSAGDDRLIKYSLPEIPHTFQVWAYEILSSVPRRVAIRGRKLTVPQFLRWSC----------------MQTVVATSLEMPI
        TL+GL+ ALK+K   +KKKV    +  +KYSLP  PH FQVWAYEI+SS+  +   R     VP+ LRWSC                M+ V++ +L M  
Subjt:  TLKGLKVALKDKAKTHKKKVSAGDDRLIKYSLPEIPHTFQVWAYEILSSVPRRVAIRGRKLTVPQFLRWSC----------------MQTVVATSLEMPI

Query:  EEEKYWDARIDQRPIFVHD
         E+++ D+ +D+RPI+V D
Subjt:  EEEKYWDARIDQRPIFVHD

A0A1S3ATU8 uncharacterized protein LOC103482899 isoform X18.8e-5237.81Show/hide
Query:  MAHALKIVEADQFRAQVTCL-SHIRNGNKIVKDTFNGRQMNMFRRTVFGRFLDMDIVFNSSLVHHILLREVQTDRTDAMSFKIKGTVVTFSKDDFLLM--
        M   LKI   ++F  Q + + SHI   NKIVK+    +Q+++F+RTVFGRF+DMD+VF+S LVHHILLREV+ +R DAMSF + G + TFSK++FLL+  
Subjt:  MAHALKIVEADQFRAQVTCL-SHIRNGNKIVKDTFNGRQMNMFRRTVFGRFLDMDIVFNSSLVHHILLREVQTDRTDAMSFKIKGTVVTFSKDDFLLM--

Query:  -------------------RRDYGGALFLEFN----------------------------------GNKSRGSVDRTLFEDVENLDYYNNKDWGNILWER
                            + +G  L ++ N                                   ++++  ++++L +DVE+L YYN+ DWG+ILWER
Subjt:  -------------------RRDYGGALFLEFN----------------------------------GNKSRGSVDRTLFEDVENLDYYNNKDWGNILWER

Query:  TLKGLKVALKDKAKTHKKKVSAGDDRLIKYSLPEIPHTFQVWAYEILSSVPR-RVAIRGRKLTVPQFLRWSC----------------MQTVVATSLEMP
        TL+GL+ ALK+K   +KKKV    +  +KYSLP  PH FQVWAYEI+SS+   R   R     VP+FLRWSC                 + V++ +L M 
Subjt:  TLKGLKVALKDKAKTHKKKVSAGDDRLIKYSLPEIPHTFQVWAYEILSSVPR-RVAIRGRKLTVPQFLRWSC----------------MQTVVATSLEMP

Query:  IEEEKYWDARIDQRPIFVHD
          E+++ D ++D+RP++V D
Subjt:  IEEEKYWDARIDQRPIFVHD

A0A1S3AUB0 uncharacterized protein LOC103482899 isoform X28.8e-5237.81Show/hide
Query:  MAHALKIVEADQFRAQVTCL-SHIRNGNKIVKDTFNGRQMNMFRRTVFGRFLDMDIVFNSSLVHHILLREVQTDRTDAMSFKIKGTVVTFSKDDFLLM--
        M   LKI   ++F  Q + + SHI   NKIVK+    +Q+++F+RTVFGRF+DMD+VF+S LVHHILLREV+ +R DAMSF + G + TFSK++FLL+  
Subjt:  MAHALKIVEADQFRAQVTCL-SHIRNGNKIVKDTFNGRQMNMFRRTVFGRFLDMDIVFNSSLVHHILLREVQTDRTDAMSFKIKGTVVTFSKDDFLLM--

Query:  -------------------RRDYGGALFLEFN----------------------------------GNKSRGSVDRTLFEDVENLDYYNNKDWGNILWER
                            + +G  L ++ N                                   ++++  ++++L +DVE+L YYN+ DWG+ILWER
Subjt:  -------------------RRDYGGALFLEFN----------------------------------GNKSRGSVDRTLFEDVENLDYYNNKDWGNILWER

Query:  TLKGLKVALKDKAKTHKKKVSAGDDRLIKYSLPEIPHTFQVWAYEILSSVPR-RVAIRGRKLTVPQFLRWSC----------------MQTVVATSLEMP
        TL+GL+ ALK+K   +KKKV    +  +KYSLP  PH FQVWAYEI+SS+   R   R     VP+FLRWSC                 + V++ +L M 
Subjt:  TLKGLKVALKDKAKTHKKKVSAGDDRLIKYSLPEIPHTFQVWAYEILSSVPR-RVAIRGRKLTVPQFLRWSC----------------MQTVVATSLEMP

Query:  IEEEKYWDARIDQRPIFVHD
          E+++ D ++D+RP++V D
Subjt:  IEEEKYWDARIDQRPIFVHD

A0A5A7TGU0 Ulp1-like peptidase8.8e-5237.81Show/hide
Query:  MAHALKIVEADQFRAQVTCL-SHIRNGNKIVKDTFNGRQMNMFRRTVFGRFLDMDIVFNSSLVHHILLREVQTDRTDAMSFKIKGTVVTFSKDDFLLM--
        M   LKI   ++F  Q + + SHI   NKIVK+    +Q+++F+RTVFGRF+DMD+VF+S LVHHILLREV+ +R DAMSF + G + TFSK++FLL+  
Subjt:  MAHALKIVEADQFRAQVTCL-SHIRNGNKIVKDTFNGRQMNMFRRTVFGRFLDMDIVFNSSLVHHILLREVQTDRTDAMSFKIKGTVVTFSKDDFLLM--

Query:  -------------------RRDYGGALFLEFN----------------------------------GNKSRGSVDRTLFEDVENLDYYNNKDWGNILWER
                            + +G  L ++ N                                   ++++  ++++L +DVE+L YYN+ DWG+ILWER
Subjt:  -------------------RRDYGGALFLEFN----------------------------------GNKSRGSVDRTLFEDVENLDYYNNKDWGNILWER

Query:  TLKGLKVALKDKAKTHKKKVSAGDDRLIKYSLPEIPHTFQVWAYEILSSVPR-RVAIRGRKLTVPQFLRWSC----------------MQTVVATSLEMP
        TL+GL+ ALK+K   +KKKV    +  +KYSLP  PH FQVWAYEI+SS+   R   R     VP+FLRWSC                 + V++ +L M 
Subjt:  TLKGLKVALKDKAKTHKKKVSAGDDRLIKYSLPEIPHTFQVWAYEILSSVPR-RVAIRGRKLTVPQFLRWSC----------------MQTVVATSLEMP

Query:  IEEEKYWDARIDQRPIFVHD
          E+++ D ++D+RP++V D
Subjt:  IEEEKYWDARIDQRPIFVHD

A0A6J1DSS5 uncharacterized protein LOC1110239697.3e-4639.01Show/hide
Query:  MAHALKIVEADQFRAQVTCLSHIRNGNKIVKDTFNGRQMNMFR-RTVFGRFLDMDIVFNSSLVHHILLREVQTDRTDAMSFKIKGTVVTFSKDDFLLM--
        M H LK+ EAD+F AQVT LSH+   NKI+       Q++MFR RT+FGRF+D+D++F S+LVH+ LLREV   R D M F I GT+VTFSK +FLLM  
Subjt:  MAHALKIVEADQFRAQVTCLSHIRNGNKIVKDTFNGRQMNMFR-RTVFGRFLDMDIVFNSSLVHHILLREVQTDRTDAMSFKIKGTVVTFSKDDFLLM--

Query:  ------------------RRDYGGAL---------------------------------FLEFNGNKSRGSVDRTLFEDVENLDYYNNKDWGNILWERTL
                          RR+Y   +                                  +    NK + +VD+ L+  VE+LDY+NN DWG  +W+RTL
Subjt:  ------------------RRDYGGAL---------------------------------FLEFNGNKSRGSVDRTLFEDVENLDYYNNKDWGNILWERTL

Query:  KGLKVALKDKAKTHKKKVSAGDDRLIKYSLPEIPHTFQVWAYEILSSVPRRVAIRGRKLTVPQFLRWSCMQTVVATSLEMPI
        KGL+ A+KDK   +K KV+      ++YSL   P  FQVWAYEI+ S+ R    R     +P+  R+SC Q++ +  LE  +
Subjt:  KGLKVALKDKAKTHKKKVSAGDDRLIKYSLPEIPHTFQVWAYEILSSVPRRVAIRGRKLTVPQFLRWSCMQTVVATSLEMPI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACATGCTCTGAAGATTGTCGAGGCGGATCAATTTCGCGCCCAAGTGACTTGCCTGTCTCACATTAGGAATGGCAACAAGATCGTCAAAGACACATTCAACGGCAG
ACAAATGAACATGTTCAGGAGAACCGTATTTGGGCGATTCCTGGACATGGATATTGTGTTTAACAGTTCCTTGGTGCATCACATACTGCTTAGGGAGGTTCAAACAGATC
GAACGGATGCCATGAGTTTCAAAATCAAGGGGACGGTCGTGACCTTCTCGAAGGACGATTTCCTCTTGATGAGACGAGACTATGGAGGAGCCCTATTCCTCGAGTTCAAT
GGGAACAAGAGCAGAGGTTCCGTTGACAGAACACTGTTCGAGGATGTGGAAAACCTGGATTATTACAACAACAAGGATTGGGGTAATATCCTATGGGAGCGAACACTAAA
GGGCCTTAAGGTTGCCTTGAAGGATAAGGCCAAGACTCATAAGAAAAAGGTCAGTGCGGGAGACGATAGGCTGATTAAGTATTCGCTACCCGAAATTCCCCACACCTTTC
AGGTCTGGGCATATGAGATACTGTCGTCTGTTCCGAGGAGGGTTGCCATCAGGGGTAGGAAATTGACAGTGCCCCAGTTTCTCAGATGGTCTTGCATGCAGACGGTTGTT
GCCACGTCCCTTGAGATGCCGATCGAGGAGGAGAAATACTGGGATGCTAGAATCGATCAGCGGCCGATATTTGTACATGACCCGAAGAACGTTCAAGACCTGGACACCGT
GATGGATGATATATTCACCATGGATCAAGACACTAAAGAACCGGACGCACCGACAACACATTCGTCTGATGTGCGATTGATGTCGAGAGCTCTGTTTTCGTGTATTTACG
GAGCCTTGATGCAAGAGTCGCGGGGGTGGATGCTTATGTCGCTGAGTTGGATTCTCGGCATGAACATCGAACTGCATGGTAGTAGGTCGGGTGACCCTAATGACGGTGAC
CACATAGATGACGATGACCATGAGGACCCATCGATACCATCTACCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCACATGCTCTGAAGATTGTCGAGGCGGATCAATTTCGCGCCCAAGTGACTTGCCTGTCTCACATTAGGAATGGCAACAAGATCGTCAAAGACACATTCAACGGCAG
ACAAATGAACATGTTCAGGAGAACCGTATTTGGGCGATTCCTGGACATGGATATTGTGTTTAACAGTTCCTTGGTGCATCACATACTGCTTAGGGAGGTTCAAACAGATC
GAACGGATGCCATGAGTTTCAAAATCAAGGGGACGGTCGTGACCTTCTCGAAGGACGATTTCCTCTTGATGAGACGAGACTATGGAGGAGCCCTATTCCTCGAGTTCAAT
GGGAACAAGAGCAGAGGTTCCGTTGACAGAACACTGTTCGAGGATGTGGAAAACCTGGATTATTACAACAACAAGGATTGGGGTAATATCCTATGGGAGCGAACACTAAA
GGGCCTTAAGGTTGCCTTGAAGGATAAGGCCAAGACTCATAAGAAAAAGGTCAGTGCGGGAGACGATAGGCTGATTAAGTATTCGCTACCCGAAATTCCCCACACCTTTC
AGGTCTGGGCATATGAGATACTGTCGTCTGTTCCGAGGAGGGTTGCCATCAGGGGTAGGAAATTGACAGTGCCCCAGTTTCTCAGATGGTCTTGCATGCAGACGGTTGTT
GCCACGTCCCTTGAGATGCCGATCGAGGAGGAGAAATACTGGGATGCTAGAATCGATCAGCGGCCGATATTTGTACATGACCCGAAGAACGTTCAAGACCTGGACACCGT
GATGGATGATATATTCACCATGGATCAAGACACTAAAGAACCGGACGCACCGACAACACATTCGTCTGATGTGCGATTGATGTCGAGAGCTCTGTTTTCGTGTATTTACG
GAGCCTTGATGCAAGAGTCGCGGGGGTGGATGCTTATGTCGCTGAGTTGGATTCTCGGCATGAACATCGAACTGCATGGTAGTAGGTCGGGTGACCCTAATGACGGTGAC
CACATAGATGACGATGACCATGAGGACCCATCGATACCATCTACCTAA
Protein sequenceShow/hide protein sequence
MAHALKIVEADQFRAQVTCLSHIRNGNKIVKDTFNGRQMNMFRRTVFGRFLDMDIVFNSSLVHHILLREVQTDRTDAMSFKIKGTVVTFSKDDFLLMRRDYGGALFLEFN
GNKSRGSVDRTLFEDVENLDYYNNKDWGNILWERTLKGLKVALKDKAKTHKKKVSAGDDRLIKYSLPEIPHTFQVWAYEILSSVPRRVAIRGRKLTVPQFLRWSCMQTVV
ATSLEMPIEEEKYWDARIDQRPIFVHDPKNVQDLDTVMDDIFTMDQDTKEPDAPTTHSSDVRLMSRALFSCIYGALMQESRGWMLMSLSWILGMNIELHGSRSGDPNDGD
HIDDDDHEDPSIPST