; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017501 (gene) of Snake gourd v1 genome

Gene IDTan0017501
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionchaperone protein DnaJ-like isoform X1
Genome locationLG04:24625480..24627246
RNA-Seq ExpressionTan0017501
SyntenyTan0017501
Gene Ontology termsGO:0005622 - intracellular (cellular component)
InterPro domainsIPR001623 - DnaJ domain
IPR018253 - DnaJ domain, conserved site
IPR036869 - Chaperone J-domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030580.1 DnaJ-like subfamily B member 6 [Cucurbita argyrosperma subsp. argyrosperma]1.5e-7284.66Show/hide
Query:  MEADCGSTSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDE-GMCDFMQELWS
        MEADCGSTSYYSILGVSSGCS+DEIRRAYRKLAMKWHPDRC KNP  LGT KRKFQQIQEAYSVLSD+RKR RYDAGIQDFD+DE+DE GMCDFMQELWS
Subjt:  MEADCGSTSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDE-GMCDFMQELWS

Query:  LMAEDKKREEKSYSLKELQGMLTEMAKGFDFGCWSSYGTSDCEITRCSKELCSDRGSYVHVWN
        L+AEDKKREEKSYSL+ELQGML EMAKGFDF CW SYG+S+CEIT   KEL   +GS VHVWN
Subjt:  LMAEDKKREEKSYSLKELQGMLTEMAKGFDFGCWSSYGTSDCEITRCSKELCSDRGSYVHVWN

XP_022942667.1 uncharacterized protein LOC111447636 isoform X1 [Cucurbita moschata]5.2e-7385.28Show/hide
Query:  MEADCGSTSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDE-GMCDFMQELWS
        MEADCGSTSYYSILGVSSGCS+DEIRRAYRKLAMKWHPDRC KNP  LGT KRKFQQIQEAYSVLSD+RKR RYDAGIQDFD+DE+DE GMCDFMQELWS
Subjt:  MEADCGSTSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDE-GMCDFMQELWS

Query:  LMAEDKKREEKSYSLKELQGMLTEMAKGFDFGCWSSYGTSDCEITRCSKELCSDRGSYVHVWN
        L+AEDKKREEKSYSL+ELQGML EMAKGFDF CW SYGTS+CEIT   KEL   +GS VHVWN
Subjt:  LMAEDKKREEKSYSLKELQGMLTEMAKGFDFGCWSSYGTSDCEITRCSKELCSDRGSYVHVWN

XP_022942668.1 uncharacterized protein LOC111447636 isoform X2 [Cucurbita moschata]2.1e-7485.8Show/hide
Query:  MEADCGSTSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDEGMCDFMQELWSL
        MEADCGSTSYYSILGVSSGCS+DEIRRAYRKLAMKWHPDRC KNP  LGT KRKFQQIQEAYSVLSD+RKR RYDAGIQDFD+DE+DEGMCDFMQELWSL
Subjt:  MEADCGSTSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDEGMCDFMQELWSL

Query:  MAEDKKREEKSYSLKELQGMLTEMAKGFDFGCWSSYGTSDCEITRCSKELCSDRGSYVHVWN
        +AEDKKREEKSYSL+ELQGML EMAKGFDF CW SYGTS+CEIT   KEL   +GS VHVWN
Subjt:  MAEDKKREEKSYSLKELQGMLTEMAKGFDFGCWSSYGTSDCEITRCSKELCSDRGSYVHVWN

XP_022985068.1 uncharacterized protein LOC111483143 isoform X2 [Cucurbita maxima]4.0e-7384.57Show/hide
Query:  MEADCGSTSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDEGMCDFMQELWSL
        MEADCGSTSYYSILGVSSGCS+DEIRRAYRKLAMKWHPDRC KNP  L T KRKFQQIQEAYSVLSD+RKR RYDAGIQDFD+DE+DEGMCDFMQELWSL
Subjt:  MEADCGSTSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDEGMCDFMQELWSL

Query:  MAEDKKREEKSYSLKELQGMLTEMAKGFDFGCWSSYGTSDCEITRCSKELCSDRGSYVHVWN
        +AEDKKREEKSYSL+ELQGML EMAKGFDF CW SYG S+CEIT+  KEL   +GS VHVWN
Subjt:  MAEDKKREEKSYSLKELQGMLTEMAKGFDFGCWSSYGTSDCEITRCSKELCSDRGSYVHVWN

XP_023514929.1 uncharacterized protein LOC111779094 [Cucurbita pepo subsp. pepo]2.3e-7385.19Show/hide
Query:  MEADCGSTSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDEGMCDFMQELWSL
        MEADCGSTSYYSILGVSSGCS+DEIRRAYRKLAMKWHPDRC KNP  LGT KRKFQQIQEAYSVLSD+RKR RYDAGIQDFD+DE+DEGMCDFMQELWSL
Subjt:  MEADCGSTSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDEGMCDFMQELWSL

Query:  MAEDKKREEKSYSLKELQGMLTEMAKGFDFGCWSSYGTSDCEITRCSKELCSDRGSYVHVWN
        +AEDKKREEKS SL+ELQGML EMAKGFDF CW SYGTS+CEIT   KEL   +GS VHVWN
Subjt:  MAEDKKREEKSYSLKELQGMLTEMAKGFDFGCWSSYGTSDCEITRCSKELCSDRGSYVHVWN

TrEMBL top hitse value%identityAlignment
A0A6J1DMB8 chaperone protein dnaJ 6-like isoform X22.1e-6480.86Show/hide
Query:  EADCGSTSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDD-EGMCDFMQELWSL
        EADCG++SYYSILGVSSG SIDEIRRAYRKLAMKWHPD+CA+NPSLLGT KRKFQQIQEAYSVLSDQRKR RYDAGIQD D DEDD EGMCDF+QELWSL
Subjt:  EADCGSTSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDD-EGMCDFMQELWSL

Query:  MAEDKKREEKSYSLKELQGMLTEMAKGFDFGCWSSYGTSDCEITRCSKELCSDRGSYVHVWN
        MAE KKREEKSYSL+ELQ ML EMA+GF+    SSYGTS CE+T+CSKEL  D   +VHV N
Subjt:  MAEDKKREEKSYSLKELQGMLTEMAKGFDFGCWSSYGTSDCEITRCSKELCSDRGSYVHVWN

A0A6J1FQX0 uncharacterized protein LOC111447636 isoform X12.5e-7385.28Show/hide
Query:  MEADCGSTSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDE-GMCDFMQELWS
        MEADCGSTSYYSILGVSSGCS+DEIRRAYRKLAMKWHPDRC KNP  LGT KRKFQQIQEAYSVLSD+RKR RYDAGIQDFD+DE+DE GMCDFMQELWS
Subjt:  MEADCGSTSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDE-GMCDFMQELWS

Query:  LMAEDKKREEKSYSLKELQGMLTEMAKGFDFGCWSSYGTSDCEITRCSKELCSDRGSYVHVWN
        L+AEDKKREEKSYSL+ELQGML EMAKGFDF CW SYGTS+CEIT   KEL   +GS VHVWN
Subjt:  LMAEDKKREEKSYSLKELQGMLTEMAKGFDFGCWSSYGTSDCEITRCSKELCSDRGSYVHVWN

A0A6J1FVD4 uncharacterized protein LOC111447636 isoform X21.0e-7485.8Show/hide
Query:  MEADCGSTSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDEGMCDFMQELWSL
        MEADCGSTSYYSILGVSSGCS+DEIRRAYRKLAMKWHPDRC KNP  LGT KRKFQQIQEAYSVLSD+RKR RYDAGIQDFD+DE+DEGMCDFMQELWSL
Subjt:  MEADCGSTSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDEGMCDFMQELWSL

Query:  MAEDKKREEKSYSLKELQGMLTEMAKGFDFGCWSSYGTSDCEITRCSKELCSDRGSYVHVWN
        +AEDKKREEKSYSL+ELQGML EMAKGFDF CW SYGTS+CEIT   KEL   +GS VHVWN
Subjt:  MAEDKKREEKSYSLKELQGMLTEMAKGFDFGCWSSYGTSDCEITRCSKELCSDRGSYVHVWN

A0A6J1J719 uncharacterized protein LOC111483143 isoform X13.6e-7284.05Show/hide
Query:  MEADCGSTSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDE-DDEGMCDFMQELWS
        MEADCGSTSYYSILGVSSGCS+DEIRRAYRKLAMKWHPDRC KNP  L T KRKFQQIQEAYSVLSD+RKR RYDAGIQDFD+DE D+EGMCDFMQELWS
Subjt:  MEADCGSTSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDE-DDEGMCDFMQELWS

Query:  LMAEDKKREEKSYSLKELQGMLTEMAKGFDFGCWSSYGTSDCEITRCSKELCSDRGSYVHVWN
        L+AEDKKREEKSYSL+ELQGML EMAKGFDF CW SYG S+CEIT+  KEL   +GS VHVWN
Subjt:  LMAEDKKREEKSYSLKELQGMLTEMAKGFDFGCWSSYGTSDCEITRCSKELCSDRGSYVHVWN

A0A6J1JCH2 uncharacterized protein LOC111483143 isoform X21.9e-7384.57Show/hide
Query:  MEADCGSTSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDEGMCDFMQELWSL
        MEADCGSTSYYSILGVSSGCS+DEIRRAYRKLAMKWHPDRC KNP  L T KRKFQQIQEAYSVLSD+RKR RYDAGIQDFD+DE+DEGMCDFMQELWSL
Subjt:  MEADCGSTSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDEGMCDFMQELWSL

Query:  MAEDKKREEKSYSLKELQGMLTEMAKGFDFGCWSSYGTSDCEITRCSKELCSDRGSYVHVWN
        +AEDKKREEKSYSL+ELQGML EMAKGFDF CW SYG S+CEIT+  KEL   +GS VHVWN
Subjt:  MAEDKKREEKSYSLKELQGMLTEMAKGFDFGCWSSYGTSDCEITRCSKELCSDRGSYVHVWN

SwissProt top hitse value%identityAlignment
A0Q1R3 Chaperone protein DnaJ8.2e-1351.95Show/hide
Query:  STSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDA-GIQDFD
        S  YY +LG+S G S DEI++AYRKLAMK+HPDR   N       + KF+ I EAY VLSD +K+  YD  G  DF+
Subjt:  STSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDA-GIQDFD

A5FZ18 Chaperone protein DnaJ1.1e-1251.95Show/hide
Query:  TSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDA-GIQDFDN
        T YY +LGVS G S DE+++AYRKLAM++HPDR   NP      ++KF+ I EAY VL D++KR  YD  G   F+N
Subjt:  TSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDA-GIQDFDN

Q0III6 DnaJ homolog subfamily B member 68.2e-1354.55Show/hide
Query:  YYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYD
        YY +LGV    S ++I++AYRKLA+KWHPD   KNP      +RKF+Q+ EAY VLSD +KR  YD
Subjt:  YYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYD

Q9CMS2 Chaperone protein DnaJ8.2e-1356.06Show/hide
Query:  YYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYD
        YY +LGV  G    EI+RAY+KLAMK+HPDR   N  L    + KF++IQEAY VLSD++KR  YD
Subjt:  YYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYD

Q9QYI7 DnaJ homolog subfamily B member 82.8e-1352.24Show/hide
Query:  SYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYD
        +YY +LGV S  S ++I++AYRKLA++WHPD   KNP      ++KF+Q+ EAY VLSD +KR  YD
Subjt:  SYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYD

Arabidopsis top hitse value%identityAlignment
AT1G56300.1 Chaperone DnaJ-domain superfamily protein1.5e-3055Show/hide
Query:  TSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDEGMCDFMQELWSLMAEDKKR
        +SYY+ILG+    S+ +IR AYRKLAMKWHPDR A+NP + G  KR+FQQIQEAYSVL+D+ KR  YD G+ D  +++DD+  CDFMQE+ S+M   K  
Subjt:  TSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDEGMCDFMQELWSLMAEDKKR

Query:  EEKSYSLKELQGMLTEMAKG
         E   SL++LQ M T+M  G
Subjt:  EEKSYSLKELQGMLTEMAKG

AT1G71000.1 Chaperone DnaJ-domain superfamily protein1.3e-2957.5Show/hide
Query:  SYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDEGMCDFMQELWSLMAEDKKRE
        +YY ILGV+   S ++IRRAY KLA  WHPDR  K+P   G  KR+FQQIQEAYSVLSD+RKR  YD G+ D     +DEG  DF+QE+ SLM++  KRE
Subjt:  SYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDEGMCDFMQELWSLMAEDKKRE

Query:  EKSYSLKELQGMLTEMAKGF
        EK YSL+ELQ M+ +M   F
Subjt:  EKSYSLKELQGMLTEMAKGF

AT1G72416.1 Chaperone DnaJ-domain superfamily protein3.9e-1840.52Show/hide
Query:  YSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDEGMCDFMQELWSLMAEDKKREEK
        Y++L +++ C+  ++R +Y+ L +KWHPDR  +        K KFQ IQ AYSVLSD  KRL YD G   +D+D+D+ GM DF+ E+ +LMA+ +   ++
Subjt:  YSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDEGMCDFMQELWSLMAEDKKREEK

Query:  SYSLKELQGMLTEMAK
          SL+E + +  E+ K
Subjt:  SYSLKELQGMLTEMAK

AT1G72416.2 Chaperone DnaJ-domain superfamily protein3.9e-1840.52Show/hide
Query:  YSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDEGMCDFMQELWSLMAEDKKREEK
        Y++L +++ C+  ++R +Y+ L +KWHPDR  +        K KFQ IQ AYSVLSD  KRL YD G   +D+D+D+ GM DF+ E+ +LMA+ +   ++
Subjt:  YSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDEGMCDFMQELWSLMAEDKKREEK

Query:  SYSLKELQGMLTEMAK
          SL+E + +  E+ K
Subjt:  SYSLKELQGMLTEMAK

AT3G14200.1 Chaperone DnaJ-domain superfamily protein1.4e-2344.92Show/hide
Query:  YSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDEGMCDFMQELWSLMAEDKKREEK
        Y++LG+   CS  E+R AY+KLA++WHPDRC+ +   +   K+KFQ IQEAYSVLSD  KR  YD G  + D+D+D  GM DF+ E+ ++M + K  +  
Subjt:  YSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDEGMCDFMQELWSLMAEDKKREEK

Query:  S-YSLKELQGMLTEMAKG
        +  S ++LQ +  EM +G
Subjt:  S-YSLKELQGMLTEMAKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGCTGATTGTGGCTCAACTTCGTATTACAGCATTCTCGGCGTCAGTTCTGGCTGTTCGATCGATGAAATTCGCAGAGCTTATCGCAAGCTCGCCATGAAATGGCA
TCCTGATAGATGTGCGAAAAATCCTTCGCTATTAGGTACAACCAAGAGAAAATTCCAACAAATCCAAGAAGCCTATTCAGTTCTATCGGATCAGAGAAAGAGATTGCGAT
ACGACGCCGGAATTCAAGATTTCGATAATGACGAAGACGACGAGGGAATGTGTGATTTCATGCAAGAACTATGGTCGCTAATGGCGGAAGACAAGAAGAGAGAGGAAAAA
AGCTACAGCTTGAAGGAGTTGCAGGGGATGTTGACGGAAATGGCGAAAGGCTTCGATTTCGGTTGTTGGTCTTCTTACGGAACTTCTGATTGTGAAATTACTCGATGCTC
CAAGGAATTATGTTCCGACAGAGGTTCCTATGTACATGTGTGGAACTGA
mRNA sequenceShow/hide mRNA sequence
CCACGTCATCTTTCTCTTTCTCCATTAGAAATTTCTGGAACACTTCTCTCGTTCTCGTTCGCAGATATTTTTTATCCTTCGAATCTTCTGGACTTTCGCATCCTCCCACA
TATAAAAATCACTTCGCCTTCTACATTCTGCATTGTGAATTTCTGTCGATAGATTTCACCATTGTTGAACGCTTCGAGTTTCTTGATTCTCTCTGTGATAACAATAAGGG
AGATGGAGGCTGATTGTGGCTCAACTTCGTATTACAGCATTCTCGGCGTCAGTTCTGGCTGTTCGATCGATGAAATTCGCAGAGCTTATCGCAAGCTCGCCATGAAATGG
CATCCTGATAGATGTGCGAAAAATCCTTCGCTATTAGGTACAACCAAGAGAAAATTCCAACAAATCCAAGAAGCCTATTCAGTTCTATCGGATCAGAGAAAGAGATTGCG
ATACGACGCCGGAATTCAAGATTTCGATAATGACGAAGACGACGAGGGAATGTGTGATTTCATGCAAGAACTATGGTCGCTAATGGCGGAAGACAAGAAGAGAGAGGAAA
AAAGCTACAGCTTGAAGGAGTTGCAGGGGATGTTGACGGAAATGGCGAAAGGCTTCGATTTCGGTTGTTGGTCTTCTTACGGAACTTCTGATTGTGAAATTACTCGATGC
TCCAAGGAATTATGTTCCGACAGAGGTTCCTATGTACATGTGTGGAACTGATTTGGACACGTAGTAGGGCTGTTTTTGTAATTGTGGGAGAATCCCAACAATTTTGGTGT
AAATCTCCATGAGTTTAGAGGAATGGTGAAAATCTTAATCTTTCACTGCCAGCTTTCAATTTAGATGGTTCGAGTATTTATGTCTTTTTTTTTTCAATCTAGTTTCTGTT
ATTTAAAATTAGTTATCAATTTAGTCTATCAATGGAACATTATTCTACTTT
Protein sequenceShow/hide protein sequence
MEADCGSTSYYSILGVSSGCSIDEIRRAYRKLAMKWHPDRCAKNPSLLGTTKRKFQQIQEAYSVLSDQRKRLRYDAGIQDFDNDEDDEGMCDFMQELWSLMAEDKKREEK
SYSLKELQGMLTEMAKGFDFGCWSSYGTSDCEITRCSKELCSDRGSYVHVWN