; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G37150 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G37150
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionHistidine triad nucleotide-binding protein 2
Genome locationChr3:32237058..32241281
RNA-Seq ExpressionCSPI03G37150
SyntenyCSPI03G37150
Gene Ontology termsGO:0006790 - sulfur compound metabolic process (biological process)
GO:0009150 - purine ribonucleotide metabolic process (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0047627 - adenylylsulfatase activity (molecular function)
InterPro domainsIPR001310 - Histidine triad (HIT) protein
IPR011146 - HIT-like domain
IPR019808 - Histidine triad, conserved site
IPR036265 - HIT-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017601.1 Adenylylsulfatase HINT1 [Cucurbita argyrosperma subsp. argyrosperma]1.1e-7594.23Show/hide
Query:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH
        MVNEDIK  TRLSVLSSH S SLSISMASSE+EAALAAVPSDSPTIFDKIINKEIPST+VFEDDKVLAFRDI+PQAPTHILIIPKVKDGLSGLSKAEERH
Subjt:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH

Query:  TEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        TEILG LLY AKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  TEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG

XP_004136603.2 14 kDa zinc-binding protein [Cucumis sativus]1.6e-85100Show/hide
Query:  MKKEIRAVMVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSG
        MKKEIRAVMVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSG
Subjt:  MKKEIRAVMVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSG

Query:  LSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        LSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  LSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG

XP_008443161.1 PREDICTED: 14 kDa zinc-binding protein [Cucumis melo]2.0e-8098.72Show/hide
Query:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH
        MVNEDIKLRTRLSVLSSH SHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIP+VKDGLSGLSKAEERH
Subjt:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH

Query:  TEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        TEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  TEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG

XP_022971276.1 14 kDa zinc-binding protein-like isoform X2 [Cucurbita maxima]2.1e-7792.59Show/hide
Query:  KEIRAVMVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLS
        ++IRAVMVN DIK  TRLSVLSSH + SLSISMASSE+EAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDI+PQAPTHILIIPKVKDGLSGLS
Subjt:  KEIRAVMVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLS

Query:  KAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        KAEERHTEILGHLLYTAKLIA+QEGLDDGFR+VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  KAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG

XP_038905126.1 14 kDa zinc-binding protein [Benincasa hispida]3.8e-7997.44Show/hide
Query:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH
        M+NEDIK RTRLSVLSSH SHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPS VVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH
Subjt:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH

Query:  TEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        TEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  TEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG

TrEMBL top hitse value%identityAlignment
A0A0A0LC24 HIT domain-containing protein8.7e-82100Show/hide
Query:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH
        MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH
Subjt:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH

Query:  TEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        TEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  TEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG

A0A1S3B859 14 kDa zinc-binding protein9.7e-8198.72Show/hide
Query:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH
        MVNEDIKLRTRLSVLSSH SHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIP+VKDGLSGLSKAEERH
Subjt:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH

Query:  TEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        TEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  TEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG

A0A6J1F2T2 14 kDa zinc-binding protein-like1.4e-7492.95Show/hide
Query:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH
        MVNE IK  TRL+VLSSH S SLSISMASSE+EAALAAVPSDSPTIFDKIINKEIPST+VFEDDKVLAFRDI+PQAPTHILIIPKVKDGLSGLSKAEERH
Subjt:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH

Query:  TEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        TEILG LLY AKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  TEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG

A0A6J1GGA0 14 kDa zinc-binding protein-like6.1e-7592.95Show/hide
Query:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH
        MVN DIK  TRLSVLSSH + SLSISMASSE+EAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDI+PQAPTHILIIPKVKDGLSGLSKAEERH
Subjt:  MVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERH

Query:  TEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
         EILGHLLYTAKLIA+QEGLDDGFR+VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  TEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG

A0A6J1I2W3 14 kDa zinc-binding protein-like isoform X21.0e-7792.59Show/hide
Query:  KEIRAVMVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLS
        ++IRAVMVN DIK  TRLSVLSSH + SLSISMASSE+EAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDI+PQAPTHILIIPKVKDGLSGLS
Subjt:  KEIRAVMVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLS

Query:  KAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        KAEERHTEILGHLLYTAKLIA+QEGLDDGFR+VINDGPSGCQSVYHLHVHLLGGRQMNWPPG
Subjt:  KAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG

SwissProt top hitse value%identityAlignment
P32084 Uncharacterized HIT-like protein Synpcc7942_13905.3e-3661.61Show/hide
Query:  TIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVH
        TIF KII +EIP+ +V+EDD  LAFRD+APQAP HIL+IP  K  ++ L +A   H  +LGHLL T K IA QEGL +G+R VIN GP+G Q+VYHLH+H
Subjt:  TIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVH

Query:  LLGGRQMNWPPG
        LLGGR + WPPG
Subjt:  LLGGRQMNWPPG

P42855 14 kDa zinc-binding protein (Fragment)6.3e-5383.04Show/hide
Query:  TIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVH
        TIF KII+KEIPSTVV+EDDKVLAFRDI PQ P HIL+IPKV+DGL+GL KAEERH +ILG LLYTAKL+AKQEGLD+GFR+VINDGP GCQSVYH+HVH
Subjt:  TIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVH

Query:  LLGGRQMNWPPG
        L+GGRQMNWPPG
Subjt:  LLGGRQMNWPPG

P42856 14 kDa zinc-binding protein1.0e-5884.38Show/hide
Query:  SSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVI
        SSE+EAAL  +  DSPTIFDKII KEIPSTVV+ED+KVLAFRDI PQAPTHILIIPKVKDGL+GL+KAEERH EILG+LLY AK++AKQEGL+DG+RVVI
Subjt:  SSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVI

Query:  NDGPSGCQSVYHLHVHLLGGRQMNWPPG
        NDGPSGCQSVYH+HVHLLGGRQMNWPPG
Subjt:  NDGPSGCQSVYHLHVHLLGGRQMNWPPG

Q8GUN2 Adenylylsulfatase HINT17.9e-6479.45Show/hide
Query:  RLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYT
        R+S+LSSHFS + ++   +SE+EAALAA PSDSPTIFDKII+KEIPSTVVFEDDKVLAFRDI PQ P HIL+IPKV+DGL+GLSKAEERH +ILG LLYT
Subjt:  RLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYT

Query:  AKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        AKL+AKQEGL +GFR+VINDGP GCQSVYH+HVHL+GGRQMNWPPG
Subjt:  AKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG

Q8SQ21 Histidine triad nucleotide-binding protein 2, mitochondrial9.1e-3654.84Show/hide
Query:  EAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGP
        +A  AA    +PTIF +I+++ +P+ +++ED + LAFRD+APQAP H L+IP  K  +  +S+AEE   ++LGHLL  AK  AK EGL DG+R+VINDG 
Subjt:  EAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGP

Query:  SGCQSVYHLHVHLLGGRQMNWPPG
         G QSVYHLH+H+LGGRQ+ WPPG
Subjt:  SGCQSVYHLHVHLLGGRQMNWPPG

Arabidopsis top hitse value%identityAlignment
AT1G31160.1 HISTIDINE TRIAD NUCLEOTIDE-BINDING 22.0e-4662.5Show/hide
Query:  SISMASSEQEAALAA---VPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGL
        S   A +E+ AA AA     + +PTIFDKII KEIPS +V+ED+ VLAFRDI PQAP H+L+IPK++DGL+ L KAE RH E+LG LL+ +K++A++EG+
Subjt:  SISMASSEQEAALAA---VPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYTAKLIAKQEGL

Query:  DDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
         DGFRVVIN+G   CQSVYHLH+H+LGGRQM WPPG
Subjt:  DDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG

AT3G56490.1 HIS triad family protein 35.6e-6579.45Show/hide
Query:  RLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYT
        R+S+LSSHFS + ++   +SE+EAALAA PSDSPTIFDKII+KEIPSTVVFEDDKVLAFRDI PQ P HIL+IPKV+DGL+GLSKAEERH +ILG LLYT
Subjt:  RLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTEILGHLLYT

Query:  AKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG
        AKL+AKQEGL +GFR+VINDGP GCQSVYH+HVHL+GGRQMNWPPG
Subjt:  AKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG

AT4G16566.1 histidine triad nucleotide-binding 42.0e-0629Show/hide
Query:  IFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVK-DGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVH
        IF +I+     + ++  D+KV+AF+DI P A  H L+IPK     ++ L + +E ++ ++ H+L   + + +++      R   +  P    SV HLH+H
Subjt:  IFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVK-DGLSGLSKAEERHTEILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAAAGAGATTAGGGCTGTAATGGTCAACGAAGATATAAAACTCAGGACTCGACTCTCAGTTCTGAGCTCTCACTTTTCCCACTCTCTTTCTATTTCCATGGCTTC
TTCCGAGCAGGAAGCGGCTCTTGCAGCCGTTCCCTCCGATTCCCCCACCATATTTGACAAAATCATTAACAAGGAAATTCCATCTACGGTGGTTTTTGAGGATGACAAGG
TCCTTGCTTTCAGGGACATAGCACCCCAAGCTCCTACACATATTTTAATCATTCCAAAAGTTAAGGATGGATTATCTGGATTATCTAAGGCTGAGGAGAGGCACACGGAG
ATTCTCGGCCACCTTCTTTACACTGCCAAGCTCATTGCCAAACAAGAAGGACTGGACGATGGCTTTAGAGTCGTAATTAACGACGGACCAAGTGGATGTCAATCGGTTTA
TCATCTTCATGTTCACCTTTTGGGGGGACGACAAATGAATTGGCCCCCAGGTTAA
mRNA sequenceShow/hide mRNA sequence
GAGGCTTTGAAGGTGCCATATGTAAAGAGCGCGGAATTCGATGAAAAAAGAGATTAGGGCTGTAATGGTCAACGAAGATATAAAACTCAGGACTCGACTCTCAGTTCTGA
GCTCTCACTTTTCCCACTCTCTTTCTATTTCCATGGCTTCTTCCGAGCAGGAAGCGGCTCTTGCAGCCGTTCCCTCCGATTCCCCCACCATATTTGACAAAATCATTAAC
AAGGAAATTCCATCTACGGTGGTTTTTGAGGATGACAAGGTCCTTGCTTTCAGGGACATAGCACCCCAAGCTCCTACACATATTTTAATCATTCCAAAAGTTAAGGATGG
ATTATCTGGATTATCTAAGGCTGAGGAGAGGCACACGGAGATTCTCGGCCACCTTCTTTACACTGCCAAGCTCATTGCCAAACAAGAAGGACTGGACGATGGCTTTAGAG
TCGTAATTAACGACGGACCAAGTGGATGTCAATCGGTTTATCATCTTCATGTTCACCTTTTGGGGGGACGACAAATGAATTGGCCCCCAGGTTAAGATATAGAGACTTTT
ATTAACGTTGAAATATAGAGCTAATAAACCTATATTATCATTTACCTATTCAAACACTTTTTTATGAACTCCTGCACGTGGCTTGTGGTGTCTAACATCCTATTAAATAT
GGTAATTGGTGTAAGTTTGGTGACTATCGTTTTTTTCTTGAATATCTAATTGAAAATGGATGGCTATCTTTCCTCATAAAGGGTAATGCATGTATGTGGGTTTATATGTT
AGAATGGTGTTTAGTTAAAAGGGAAGGAAATAAAACAAAGATTGGGCCTCA
Protein sequenceShow/hide protein sequence
MKKEIRAVMVNEDIKLRTRLSVLSSHFSHSLSISMASSEQEAALAAVPSDSPTIFDKIINKEIPSTVVFEDDKVLAFRDIAPQAPTHILIIPKVKDGLSGLSKAEERHTE
ILGHLLYTAKLIAKQEGLDDGFRVVINDGPSGCQSVYHLHVHLLGGRQMNWPPG