; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G014540 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G014540
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionJ domain-containing protein
Genome locationchr08:22703652..22711895
RNA-Seq ExpressionLsi08G014540
SyntenyLsi08G014540
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR001623 - DnaJ domain
IPR007872 - DPH-type metal-binding domain
IPR036671 - DPH-type metal-binding domain superfamily
IPR036869 - Chaperone J-domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8100649.1 hypothetical protein FH972_018527 [Carpinus fangiana]1.0e-17661.04Show/hide
Query:  QGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLK-EFSQEQMVDELPHPLKQK
        QGR +W CSYKKTTL++CSINI +ALYV RSLY+SLY+Y+   SR+ V YTPDQIRKMEE IRIRRASEP+EL+KLVK LK E ++ ++V +LP PLK+K
Subjt:  QGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLK-EFSQEQMVDELPHPLKQK

Query:  ITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGVDEFNDEI
        IT+EIL +L+SLN S+NST+QREAVE WR+EKLEEA +L  E+   NST+  E+AG+LV+ALE+DWAVLSE IGLWIPAEV + EHD+KPEG +E  D+I
Subjt:  ITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGVDEFNDEI

Query:  LPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFKTNYPQSY
        LPGRP+PPECH ELHTDY GAAVRWGLTHHKESAADCCQACLDHAKRA+PG++KCNIWVYCPSE GCHSPDIY+HKH ECWLKYAE P+ +FK  Y  SY
Subjt:  LPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFKTNYPQSY

Query:  RNSHPTAPLVVPWVSGVLPVASHHPAAAISVAL-----SAAPFLRRSSPLRWFQALALYLANNSHCIYLKVVGKKSH---------PTMSSSSGSIGETY
        RNSHP AP VVPWV+      +     A  + L         FL    P      L     N S  +  + + K  +         P M     SI ET+
Subjt:  RNSHPTAPLVVPWVSGVLPVASHHPAAAISVAL-----SAAPFLRRSSPLRWFQALALYLANNSHCIYLKVVGKKSH---------PTMSSSSGSIGETY

Query:  YDVLSLREDASYDEIRASYRSALLNFHPDKLQAICHRSHPDDIAGERYFKVQKAWEVLGSSMSRASYDRELQAAKGDAIGAESICLEDMVVEDKGEVVEL
        YDVLS++EDASY+EIR SYRSA+LN HPDKLQ     S P   + ER+ KVQ+AWEVL +  SRA YD +L+A + DA+ A+ I L+DM+VED GE +EL
Subjt:  YDVLSLREDASYDEIRASYRSALLNFHPDKLQAICHRSHPDDIAGERYFKVQKAWEVLGSSMSRASYDRELQAAKGDAIGAESICLEDMVVEDKGEVVEL

Query:  FYQCRCGDYFFIDSGELDEMG
        FYQCRCGDYF +DS EL+ +G
Subjt:  FYQCRCGDYFFIDSGELDEMG

KAG6585651.1 hypothetical protein SDJN03_18384, partial [Cucurbita argyrosperma subsp. sororia]8.7e-22474.32Show/hide
Query:  MARVEWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLKEFSQEQMVDEL
        MA+VEWGYQGR                                           VVKYTPDQIRKME+FIRIRRAS+PVELIK VK LKEFSQE+ V+EL
Subjt:  MARVEWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLKEFSQEQMVDEL

Query:  PHPLKQKITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGV
        P PLKQKITEEILLKLRS N SSNSTQQREAVESWRKEKLEEA KLITEQ IENSTLS EDAGILVKALETDWAVL+E IG+WIPAE+F+ EHDDKPE  
Subjt:  PHPLKQKITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGV

Query:  DEFNDEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFK
        DEF+ E+LPGRP+PPEC+AELHTDY GAAVRWGLTHHK+SAADCCQACLDH KRAQPG RKCNIWVYCPSETGCHSPDIYEHK+MECWLKYAE PKLNFK
Subjt:  DEFNDEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFK

Query:  TNYPQSYRNSHPTAPLVVPWVSGVLPVASHHPAAAISVALS--AAPFLRRSSPLRWFQALALYLANNSHCIYLKVVGKKSHPTMSSSSGSIGETYYDVLS
        T YPQSYRNSHPTAPL+VPW S V P          SVALS  AA  L RSS  R  Q                   K     MSSSSGSIGETYYDVLS
Subjt:  TNYPQSYRNSHPTAPLVVPWVSGVLPVASHHPAAAISVALS--AAPFLRRSSPLRWFQALALYLANNSHCIYLKVVGKKSHPTMSSSSGSIGETYYDVLS

Query:  LREDASYDEIRASYRSALLNFHPDKLQAICHRSHPDDIAGERYFKVQKAWEVLGSSMSRASYDRELQAAKGDAIGAESICLEDMVVEDKGEVVELFYQCR
        LREDASYDEIRASYRSALLNFHPDKLQA+C +SHPDD+ GERYFKVQKAWEVLGSSMSRA+YDRELQAAKGDA+GAESI LEDM VEDKGEVVELFYQCR
Subjt:  LREDASYDEIRASYRSALLNFHPDKLQAICHRSHPDDIAGERYFKVQKAWEVLGSSMSRASYDRELQAAKGDAIGAESICLEDMVVEDKGEVVELFYQCR

Query:  CGDYFFIDSGELDEMGYPLLRNGSKVSLRTLDALPASVVLPCGSCSLKI
        CGDYFFIDS EL+EMGYP+LR GSK+SLRT DALPAS+VLPCGSCSLK+
Subjt:  CGDYFFIDSGELDEMGYPLLRNGSKVSLRTLDALPASVVLPCGSCSLKI

XP_022962386.1 uncharacterized protein LOC111462847 [Cucurbita moschata]8.6e-17993.88Show/hide
Query:  MARVEWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLKEFSQEQMVDEL
        MARVEWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDS++VVKYTPDQIRKMEEFIRIRRASEPVELIKLVK LKEFSQE+ VDEL
Subjt:  MARVEWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLKEFSQEQMVDEL

Query:  PHPLKQKITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGV
        PHPLKQKITEEILLKLRS NASSN+TQQREAVESWRKEKLEEA KLITEQMIENSTLSFEDAGILVKALE DW  LSE+IGLWIPAE+F+ EHDDKPEGV
Subjt:  PHPLKQKITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGV

Query:  DEFNDEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFK
        DEF+DEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRA+PGDRKCNIWVYCPSE GCHSPDIYEHKHMECWLKYAENPKLNFK
Subjt:  DEFNDEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFK

Query:  TNYPQSYRNSHPTAPLVVPWVSGVLPV
        TNYPQSYRNSHPTAPLVVPWVSGV+ V
Subjt:  TNYPQSYRNSHPTAPLVVPWVSGVLPV

XP_022996464.1 uncharacterized protein LOC111491706 [Cucurbita maxima]2.5e-17893.58Show/hide
Query:  MARVEWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLKEFSQEQMVDEL
        MARVEWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDS++V KYTPDQIRKMEEFIRIRRASEPVELIKLVK LKEFSQE+ VDEL
Subjt:  MARVEWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLKEFSQEQMVDEL

Query:  PHPLKQKITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGV
        PHPLKQKITEEILLKLRS NASSN+TQQREAVESWRKEKLEEA KLITEQMIENSTLSFEDAGILVKALE DW  LSE+IGLWIPAE+F+ EHDDKPEGV
Subjt:  PHPLKQKITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGV

Query:  DEFNDEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFK
        DEF+DEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRA+PGDRKCNIWVYCPSE GCHSPDIYEHKHMECWLKYAENPKLNFK
Subjt:  DEFNDEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFK

Query:  TNYPQSYRNSHPTAPLVVPWVSGVLPV
        TNYPQSYRNSHPTAPLVVPWVSGV+ V
Subjt:  TNYPQSYRNSHPTAPLVVPWVSGVLPV

XP_038886550.1 uncharacterized protein LOC120076725 [Benincasa hispida]2.6e-18396.62Show/hide
Query:  MARVEWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLKEFSQEQMVDEL
        MARVEWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVK LKEFSQEQ +DEL
Subjt:  MARVEWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLKEFSQEQMVDEL

Query:  PHPLKQKITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGV
        P PLKQKITEEILLKLRSLNASSNST QREAVESWRKEKLEEA KLITEQM+ENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAE+FHTEHDDKPEG+
Subjt:  PHPLKQKITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGV

Query:  DEFNDEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFK
        DEF+DEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFK
Subjt:  DEFNDEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFK

Query:  TNYPQSYRNSHPTAPLVVPWVSGVL
        TNYPQSYRNSHPTAPLVVPWVSGV+
Subjt:  TNYPQSYRNSHPTAPLVVPWVSGVL

TrEMBL top hitse value%identityAlignment
A0A0A0LLA7 Uncharacterized protein4.3e-17692.92Show/hide
Query:  MARVEWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLKEFSQEQMVDEL
        MARVEWGYQGRGKW CSYKKTT+IICSINIIVALYVFRSLYASLYLY+DNDS SVVKYTPDQIRKMEEF+R+RRASEPVELIKLVK LKEFSQEQ VDEL
Subjt:  MARVEWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLKEFSQEQMVDEL

Query:  PHPLKQKITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGV
        P PLKQKITEEILLKLRSLN SSNSTQQREAVE WRKEKLEEA KLITEQM+ENSTLSFEDAG LVKALE DW V SEAIGLWIP EV HTEHDDKPEGV
Subjt:  PHPLKQKITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGV

Query:  DEFNDEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFK
        DEF+DEILPGRPVPPEC+AELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFK
Subjt:  DEFNDEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFK

Query:  TNYPQSYRNSHPTAPLVVPWVSGVL
        TNYPQSYRNSHPTAP VVPWVSGV+
Subjt:  TNYPQSYRNSHPTAPLVVPWVSGVL

A0A5A7V011 Uncharacterized protein1.2e-17592.62Show/hide
Query:  MARVEWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLKEFSQEQMVDEL
        MARVEWGYQGRGKW CSYKKTTLIICSINIIVALYVFRSLYASLYLY+DNDS SVVKYTPDQIRKMEEF+R+RRASEPVELIKLVK LKEFSQEQ VDEL
Subjt:  MARVEWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLKEFSQEQMVDEL

Query:  PHPLKQKITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGV
        P PLKQKITEEILL+LRSLN+SSNST+QREAVE WRKEKLEEA KLI E+MIENSTLS EDAG LVKALETDW +LSEAIGLWIP EV H EHDDKPEGV
Subjt:  PHPLKQKITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGV

Query:  DEFNDEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFK
        DEF+DEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFK
Subjt:  DEFNDEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFK

Query:  TNYPQSYRNSHPTAPLVVPWVSGVL
        TNYPQSYRNSHPTAPLVVPWVSGV+
Subjt:  TNYPQSYRNSHPTAPLVVPWVSGVL

A0A5N6RQT4 J domain-containing protein5.1e-17761.04Show/hide
Query:  QGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLK-EFSQEQMVDELPHPLKQK
        QGR +W CSYKKTTL++CSINI +ALYV RSLY+SLY+Y+   SR+ V YTPDQIRKMEE IRIRRASEP+EL+KLVK LK E ++ ++V +LP PLK+K
Subjt:  QGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLK-EFSQEQMVDELPHPLKQK

Query:  ITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGVDEFNDEI
        IT+EIL +L+SLN S+NST+QREAVE WR+EKLEEA +L  E+   NST+  E+AG+LV+ALE+DWAVLSE IGLWIPAEV + EHD+KPEG +E  D+I
Subjt:  ITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGVDEFNDEI

Query:  LPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFKTNYPQSY
        LPGRP+PPECH ELHTDY GAAVRWGLTHHKESAADCCQACLDHAKRA+PG++KCNIWVYCPSE GCHSPDIY+HKH ECWLKYAE P+ +FK  Y  SY
Subjt:  LPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFKTNYPQSY

Query:  RNSHPTAPLVVPWVSGVLPVASHHPAAAISVAL-----SAAPFLRRSSPLRWFQALALYLANNSHCIYLKVVGKKSH---------PTMSSSSGSIGETY
        RNSHP AP VVPWV+      +     A  + L         FL    P      L     N S  +  + + K  +         P M     SI ET+
Subjt:  RNSHPTAPLVVPWVSGVLPVASHHPAAAISVAL-----SAAPFLRRSSPLRWFQALALYLANNSHCIYLKVVGKKSH---------PTMSSSSGSIGETY

Query:  YDVLSLREDASYDEIRASYRSALLNFHPDKLQAICHRSHPDDIAGERYFKVQKAWEVLGSSMSRASYDRELQAAKGDAIGAESICLEDMVVEDKGEVVEL
        YDVLS++EDASY+EIR SYRSA+LN HPDKLQ     S P   + ER+ KVQ+AWEVL +  SRA YD +L+A + DA+ A+ I L+DM+VED GE +EL
Subjt:  YDVLSLREDASYDEIRASYRSALLNFHPDKLQAICHRSHPDDIAGERYFKVQKAWEVLGSSMSRASYDRELQAAKGDAIGAESICLEDMVVEDKGEVVEL

Query:  FYQCRCGDYFFIDSGELDEMG
        FYQCRCGDYF +DS EL+ +G
Subjt:  FYQCRCGDYFFIDSGELDEMG

A0A6J1HD00 uncharacterized protein LOC1114628474.1e-17993.88Show/hide
Query:  MARVEWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLKEFSQEQMVDEL
        MARVEWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDS++VVKYTPDQIRKMEEFIRIRRASEPVELIKLVK LKEFSQE+ VDEL
Subjt:  MARVEWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLKEFSQEQMVDEL

Query:  PHPLKQKITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGV
        PHPLKQKITEEILLKLRS NASSN+TQQREAVESWRKEKLEEA KLITEQMIENSTLSFEDAGILVKALE DW  LSE+IGLWIPAE+F+ EHDDKPEGV
Subjt:  PHPLKQKITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGV

Query:  DEFNDEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFK
        DEF+DEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRA+PGDRKCNIWVYCPSE GCHSPDIYEHKHMECWLKYAENPKLNFK
Subjt:  DEFNDEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFK

Query:  TNYPQSYRNSHPTAPLVVPWVSGVLPV
        TNYPQSYRNSHPTAPLVVPWVSGV+ V
Subjt:  TNYPQSYRNSHPTAPLVVPWVSGVLPV

A0A6J1K8S9 uncharacterized protein LOC1114917061.2e-17893.58Show/hide
Query:  MARVEWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLKEFSQEQMVDEL
        MARVEWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDS++V KYTPDQIRKMEEFIRIRRASEPVELIKLVK LKEFSQE+ VDEL
Subjt:  MARVEWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLKEFSQEQMVDEL

Query:  PHPLKQKITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGV
        PHPLKQKITEEILLKLRS NASSN+TQQREAVESWRKEKLEEA KLITEQMIENSTLSFEDAGILVKALE DW  LSE+IGLWIPAE+F+ EHDDKPEGV
Subjt:  PHPLKQKITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGV

Query:  DEFNDEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFK
        DEF+DEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRA+PGDRKCNIWVYCPSE GCHSPDIYEHKHMECWLKYAENPKLNFK
Subjt:  DEFNDEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFK

Query:  TNYPQSYRNSHPTAPLVVPWVSGVLPV
        TNYPQSYRNSHPTAPLVVPWVSGV+ V
Subjt:  TNYPQSYRNSHPTAPLVVPWVSGVLPV

SwissProt top hitse value%identityAlignment
B9KH92 Chaperone protein DnaJ8.2e-0738.36Show/hide
Query:  GETYYDVLSLREDASYDEIRASYRSALLNFHPDKLQAICHRSHPDDIAGERYFKVQKAWEVLGSSMSRASYDR
        G  YY++L +  +AS +EI+ SYR  +  +HPDK       +  D  A E++ K+ +A+EVL +   RA+YDR
Subjt:  GETYYDVLSLREDASYDEIRASYRSALLNFHPDKLQAICHRSHPDDIAGERYFKVQKAWEVLGSSMSRASYDR

P47138 Diphthamide biosynthesis protein 43.9e-0931.17Show/hide
Query:  TYYDVLSLREDASYDEIRASYRSALLNFHPDKLQAICHRSHPDDIAGERYFKVQKAWEVLGSSMSRASYDR------ELQAAKGDAIGAESICLEDMVV-
        T+Y++L +  DA+ DEI+ +YR+ LLN HPDKL    H    D ++     K+Q A+++L +  +R  YDR      + Q       G +   L+D    
Subjt:  TYYDVLSLREDASYDEIRASYRSALLNFHPDKLQAICHRSHPDDIAGERYFKVQKAWEVLGSSMSRASYDR------ELQAAKGDAIGAESICLEDMVV-

Query:  EDKGEVVELFYQCR-CGDYFF-----------IDSGELDEMGYPLLRNGSKVSL
        EDK E +    +C+  G + F           +D+ E    GY LL   S  SL
Subjt:  EDKGEVVELFYQCR-CGDYFF-----------IDSGELDEMGYPLLRNGSKVSL

Q54CI5 DPH4 homolog2.1e-1028.05Show/hide
Query:  SSSSGSIGETYYDVLSLREDASYDEIRASYRSALLNFHPDKL--------------QAICHRSHPDDIAGERYFK-VQKAWEVLGSSMSRASYDRELQAA
        ++S+    + YY++L +  DA  +EI+ SYR   L +HPDKL                + + ++ ++ +  + F  +Q AWE L   + R  YD  L   
Subjt:  SSSSGSIGETYYDVLSLREDASYDEIRASYRSALLNFHPDKL--------------QAICHRSHPDDIAGERYFK-VQKAWEVLGSSMSRASYDRELQAA

Query:  KGDAIG-AESICLEDM-VVEDKGEVVELFYQCRCGDYFFIDSGELDEMGYPLLRNGSKVSLRTL
        K      ++ I L+DM  +E+  E V   Y CRCGD++ I   +L E    +  +G  +S++ +
Subjt:  KGDAIG-AESICLEDM-VVEDKGEVVELFYQCRCGDYFFIDSGELDEMGYPLLRNGSKVSLRTL

Q5P9E0 Chaperone protein DnaJ8.2e-0738.36Show/hide
Query:  GETYYDVLSLREDASYDEIRASYRSALLNFHPDKLQAICHRSHPDDIAGERYFKVQKAWEVLGSSMSRASYDR
        G  YY++L +  +AS +EI+ SYR  +  +HPDK       +  D  A E++ K+ +A+EVL +   RA+YDR
Subjt:  GETYYDVLSLREDASYDEIRASYRSALLNFHPDKLQAICHRSHPDDIAGERYFKVQKAWEVLGSSMSRASYDR

Q6FWM1 Diphthamide biosynthesis protein 45.7e-0829.48Show/hide
Query:  TYYDVLSLREDASYDEIRASYRSALLNFHPDKLQAICHRSHPDDIAGERYFKVQKAWEVLGSSMSRASYDRELQAAK--------GDAIGAESICLEDMV
        ++Y+VL +  DAS DEI+ +YR  LL  HPDK + +   S     +G    ++Q+A+ VL +   RA+Y+R L  +         GD  G +   L++  
Subjt:  TYYDVLSLREDASYDEIRASYRSALLNFHPDKLQAICHRSHPDDIAGERYFKVQKAWEVLGSSMSRASYDRELQAAK--------GDAIGAESICLEDMV

Query:  V-EDKGEVVELFYQCRCGDYFFIDSGELDEMGYPLLRNGSKVSLRTLDALPASVVLPCGSCSLKILGISFSND
          E+K E V    +C+ G  F +    L+E     + +G    L   +     V+  C +CSL +    F ND
Subjt:  V-EDKGEVVELFYQCRCGDYFFIDSGELDEMGYPLLRNGSKVSLRTLDALPASVVLPCGSCSLKILGISFSND

Arabidopsis top hitse value%identityAlignment
AT1G56300.1 Chaperone DnaJ-domain superfamily protein1.9e-0629.2Show/hide
Query:  SIGETYYDVLSLREDASYDEIRASYRSALLNFHPDKLQAICHRSHPDDIAGE---RYFKVQKAWEVLGSSMSRASYDRELQAAKGDAIGAESICLEDMV-
        ++  +YY +L +R+DAS  +IR +YR   + +HPD+       +    +AGE   R+ ++Q+A+ VL     R+ YD  L     D        +++M+ 
Subjt:  SIGETYYDVLSLREDASYDEIRASYRSALLNFHPDKLQAICHRSHPDDIAGE---RYFKVQKAWEVLGSSMSRASYDRELQAAKGDAIGAESICLEDMV-

Query:  ----VEDKGEVVE
            V+D GE +E
Subjt:  ----VEDKGEVVE

AT1G74250.1 DNAJ heat shock N-terminal domain-containing protein6.4e-0740Show/hide
Query:  MSSSSGSIGETYYDVLSLREDASYDEIRASYRSALLNFHPDKLQAICHRSHPDDIAGERYFKVQKAWEVLGSSMSRASYD
        M+SSS S    +Y+VL + +++S DEIR+SYR   L  HPDKL      S  +  A  ++ ++  A+EVL     RA YD
Subjt:  MSSSSGSIGETYYDVLSLREDASYDEIRASYRSALLNFHPDKLQAICHRSHPDDIAGERYFKVQKAWEVLGSSMSRASYD

AT4G10130.1 DNAJ heat shock N-terminal domain-containing protein2.0e-4556.96Show/hide
Query:  IGETYYDVLSLREDASYDEIRASYRSALLNFHPDKLQAICHRSHPDDIAGERYFKVQKAWEVLGSSMSRASYDRELQAAKGDAIGAESICLEDMVVEDKG
        + ETYY++LS++EDASY+EIR SYRSA+L+ HPDKL     RS  DD   E++ K+QKAWEVL  +  R  YD +L++++ D I A+ I +EDM VE  G
Subjt:  IGETYYDVLSLREDASYDEIRASYRSALLNFHPDKLQAICHRSHPDDIAGERYFKVQKAWEVLGSSMSRASYDRELQAAKGDAIGAESICLEDMVVEDKG

Query:  EVVELFYQCRCGDYFFIDSGELDEMGYPLLRNGSKVSLRTLDALPASVVLPCGSCSLK
        +V++LFYQCRCGDYF +DS EL  MG+ LLR+G  V ++ L A  ASVVLPCGSCSLK
Subjt:  EVVELFYQCRCGDYFFIDSGELDEMGYPLLRNGSKVSLRTLDALPASVVLPCGSCSLK

AT4G33380.1 unknown protein4.3e-11263.16Show/hide
Query:  EWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYA-SLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLK-EFSQEQMVDELPH
        EW     G    S+K+ TL++C  NI++AL+V R LYA SL++Y++ND  +VVKYT D+IRKMEE IRIRR+ EP  +++LVK LK E S  +   EL  
Subjt:  EWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYA-SLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLK-EFSQEQMVDELPH

Query:  PLKQKITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGVDE
         +K K+ +EIL +L+S    SN TQ RE VE+WR EKLEEA +LI  Q   NSTL  E+AG+LV+ALE +W VLSE IG W+PAEV + EHDDKPEG +E
Subjt:  PLKQKITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGVDE

Query:  FNDEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFKTN
          +EIL GRPVP  C+AELHTDY GAAVRWGLTHHKESAADCCQACLD AKRA+PG+ +CNIWVYCPSE GC SPDIYEHKH ECWLKYAE PK NFK  
Subjt:  FNDEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFKTN

Query:  YPQSYRNSHPTAPLVVPWVSGVL
        Y ++YRN+HP AP +VPWVSGV+
Subjt:  YPQSYRNSHPTAPLVVPWVSGVL

AT4G33380.2 unknown protein5.3e-11062.42Show/hide
Query:  EWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYA-SLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLKEFSQEQMVDELPHP
        EW     G    S+K+ TL++C  NI++AL+V R LYA SL++Y++ND  +VVKYT D+IRKMEE IRIRR+ EP  +++L K   E S  +   EL   
Subjt:  EWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYA-SLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLKEFSQEQMVDELPHP

Query:  LKQKITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGVDEF
        +K K+ +EIL +L+S    SN TQ RE VE+WR EKLEEA +LI  Q   NSTL  E+AG+LV+ALE +W VLSE IG W+PAEV + EHDDKPEG +E 
Subjt:  LKQKITEEILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGVDEF

Query:  NDEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFKTNY
         +EIL GRPVP  C+AELHTDY GAAVRWGLTHHKESAADCCQACLD AKRA+PG+ +CNIWVYCPSE GC SPDIYEHKH ECWLKYAE PK NFK  Y
Subjt:  NDEILPGRPVPPECHAELHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFKTNY

Query:  PQSYRNSHPTAPLVVPWVSGVL
         ++YRN+HP AP +VPWVSGV+
Subjt:  PQSYRNSHPTAPLVVPWVSGVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAGGGTAGAGTGGGGTTATCAAGGAAGAGGTAAATGGGCGTGTTCGTACAAGAAAACCACTCTCATCATTTGCTCTATCAACATTATCGTTGCTCTCTATGTTTT
TCGATCTCTATATGCCTCCCTCTACCTCTATACGGATAATGATTCGCGAAGTGTTGTAAAGTACACTCCGGATCAGATTAGAAAAATGGAAGAGTTTATTCGGATTCGTA
GGGCCTCTGAGCCGGTAGAGCTCATTAAATTGGTCAAGGGGCTTAAAGAATTCTCCCAAGAACAGATGGTGGATGAATTACCTCATCCATTGAAACAGAAGATAACCGAG
GAAATTCTTCTAAAGTTGAGAAGCTTGAATGCTAGCTCAAATAGTACTCAGCAGAGAGAAGCAGTAGAAAGCTGGAGAAAGGAAAAATTGGAAGAAGCATATAAGTTGAT
CACAGAGCAAATGATTGAAAATTCAACTCTTTCTTTTGAAGATGCTGGGATTCTAGTAAAAGCCTTGGAGACTGATTGGGCAGTGCTGTCTGAAGCAATTGGTCTGTGGA
TACCTGCAGAGGTCTTCCACACGGAACATGATGATAAGCCTGAGGGCGTAGATGAATTCAATGATGAGATTTTACCTGGAAGACCAGTTCCACCTGAATGTCATGCTGAG
CTCCATACAGATTACGATGGTGCTGCAGTAAGATGGGGTCTAACTCATCATAAAGAAAGTGCAGCTGATTGCTGTCAGGCTTGCTTGGATCACGCAAAACGGGCGCAACC
AGGTGACAGGAAATGCAACATCTGGGTTTACTGTCCATCCGAGACTGGATGCCACTCTCCTGACATCTACGAACATAAACATATGGAATGCTGGCTGAAATATGCAGAAA
ATCCCAAACTGAATTTCAAAACCAACTATCCGCAATCTTATAGAAACTCACACCCGACTGCGCCGCTGGTTGTACCTTGGGTCTCTGGTGTTCTTCCCGTTGCGTCGCAC
CACCCCGCCGCCGCCATCTCCGTCGCGCTCAGCGCCGCACCCTTTCTTCGCCGATCTTCTCCCCTCCGTTGGTTTCAGGCACTAGCACTGTATTTAGCTAATAATTCCCA
TTGCATATATCTCAAGGTTGTTGGAAAAAAATCCCATCCCACAATGAGTTCAAGCAGTGGCTCCATTGGTGAAACTTATTATGATGTACTGTCTTTGAGAGAAGATGCTA
GCTATGACGAAATTCGAGCCAGCTATCGGTCTGCTCTCCTTAATTTCCACCCTGATAAGTTACAAGCTATATGCCATAGATCCCATCCAGACGACATTGCGGGAGAAAGA
TACTTCAAGGTGCAGAAGGCTTGGGAAGTCCTCGGCAGCTCAATGTCCCGTGCATCTTATGACAGAGAGCTCCAAGCTGCCAAAGGGGATGCAATTGGTGCAGAGAGCAT
ATGCTTAGAGGATATGGTTGTGGAAGATAAAGGTGAAGTTGTAGAACTTTTTTATCAGTGCCGTTGTGGGGATTACTTCTTTATTGATTCAGGGGAGTTGGATGAAATGG
GATATCCATTGTTGAGGAATGGAAGTAAGGTTTCTTTAAGGACTCTGGATGCTTTGCCTGCTTCCGTTGTTTTACCTTGTGGTTCTTGCTCTTTGAAAATCCTCGGCATT
TCATTCTCGAATGATTTCGATGTTGTAAAGAGAGCTTATGACTGTGTTGCGCTCTTGGGTAGAGTTTACGTCGGTTTTCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATTTAATTCATCTCTGTTTTTTTCTTTTATGCCCCAAAAAAGAAACAGTGCCATAGTTGGTTAGCATTTAACCATCGACAAAAGGAAAGCTTTGATGCTCCGGCGCACTT
TAACGAATTGTCGGCGACGGACGGCCAAGTGAAATTTTGTCAGATAGCTTACAAAACCTCCGGTGCACGTTAGCTCCACCATCGCCGTCGGAATTGAAGGACGCGGCCGA
GCTCTCTGTCCGATTGCATTGTTGAAATCGCCTGCAGCGGCAAAGAGGCAGCAAGATACGGCGCACAATAGCGCGGAAAATCTTGACCGAATTCAATAATTGTATTGGTT
AGTTGTGTGTTATCAGGAAAACAAATATTATTGTCAGAGCGATTCCGCTTGTTACTGTTTTCCGAAAAAGCAATTTACGGATTTGGCGTGAAGATCCATCGGATCGATCT
GCTTCAAGGGTAAGAAAGATTGGGGAGGAGAGGCGTTGAAATTTTGCTTCGATTTTCCGTTTTTTGTTTCTGGTGTAAGGTATTAATATATTGAGAGAGGAGATATGGTG
GTGGGATGAGGGTGAAGAAGAAGGGATATGGCGAGGGTAGAGTGGGGTTATCAAGGAAGAGGTAAATGGGCGTGTTCGTACAAGAAAACCACTCTCATCATTTGCTCTAT
CAACATTATCGTTGCTCTCTATGTTTTTCGATCTCTATATGCCTCCCTCTACCTCTATACGGATAATGATTCGCGAAGTGTTGTAAAGTACACTCCGGATCAGATTAGAA
AAATGGAAGAGTTTATTCGGATTCGTAGGGCCTCTGAGCCGGTAGAGCTCATTAAATTGGTCAAGGGGCTTAAAGAATTCTCCCAAGAACAGATGGTGGATGAATTACCT
CATCCATTGAAACAGAAGATAACCGAGGAAATTCTTCTAAAGTTGAGAAGCTTGAATGCTAGCTCAAATAGTACTCAGCAGAGAGAAGCAGTAGAAAGCTGGAGAAAGGA
AAAATTGGAAGAAGCATATAAGTTGATCACAGAGCAAATGATTGAAAATTCAACTCTTTCTTTTGAAGATGCTGGGATTCTAGTAAAAGCCTTGGAGACTGATTGGGCAG
TGCTGTCTGAAGCAATTGGTCTGTGGATACCTGCAGAGGTCTTCCACACGGAACATGATGATAAGCCTGAGGGCGTAGATGAATTCAATGATGAGATTTTACCTGGAAGA
CCAGTTCCACCTGAATGTCATGCTGAGCTCCATACAGATTACGATGGTGCTGCAGTAAGATGGGGTCTAACTCATCATAAAGAAAGTGCAGCTGATTGCTGTCAGGCTTG
CTTGGATCACGCAAAACGGGCGCAACCAGGTGACAGGAAATGCAACATCTGGGTTTACTGTCCATCCGAGACTGGATGCCACTCTCCTGACATCTACGAACATAAACATA
TGGAATGCTGGCTGAAATATGCAGAAAATCCCAAACTGAATTTCAAAACCAACTATCCGCAATCTTATAGAAACTCACACCCGACTGCGCCGCTGGTTGTACCTTGGGTC
TCTGGTGTTCTTCCCGTTGCGTCGCACCACCCCGCCGCCGCCATCTCCGTCGCGCTCAGCGCCGCACCCTTTCTTCGCCGATCTTCTCCCCTCCGTTGGTTTCAGGCACT
AGCACTGTATTTAGCTAATAATTCCCATTGCATATATCTCAAGGTTGTTGGAAAAAAATCCCATCCCACAATGAGTTCAAGCAGTGGCTCCATTGGTGAAACTTATTATG
ATGTACTGTCTTTGAGAGAAGATGCTAGCTATGACGAAATTCGAGCCAGCTATCGGTCTGCTCTCCTTAATTTCCACCCTGATAAGTTACAAGCTATATGCCATAGATCC
CATCCAGACGACATTGCGGGAGAAAGATACTTCAAGGTGCAGAAGGCTTGGGAAGTCCTCGGCAGCTCAATGTCCCGTGCATCTTATGACAGAGAGCTCCAAGCTGCCAA
AGGGGATGCAATTGGTGCAGAGAGCATATGCTTAGAGGATATGGTTGTGGAAGATAAAGGTGAAGTTGTAGAACTTTTTTATCAGTGCCGTTGTGGGGATTACTTCTTTA
TTGATTCAGGGGAGTTGGATGAAATGGGATATCCATTGTTGAGGAATGGAAGTAAGGTTTCTTTAAGGACTCTGGATGCTTTGCCTGCTTCCGTTGTTTTACCTTGTGGT
TCTTGCTCTTTGAAAATCCTCGGCATTTCATTCTCGAATGATTTCGATGTTGTAAAGAGAGCTTATGACTGTGTTGCGCTCTTGGGTAGAGTTTACGTCGGTTTTCTTTA
A
Protein sequenceShow/hide protein sequence
MARVEWGYQGRGKWACSYKKTTLIICSINIIVALYVFRSLYASLYLYTDNDSRSVVKYTPDQIRKMEEFIRIRRASEPVELIKLVKGLKEFSQEQMVDELPHPLKQKITE
EILLKLRSLNASSNSTQQREAVESWRKEKLEEAYKLITEQMIENSTLSFEDAGILVKALETDWAVLSEAIGLWIPAEVFHTEHDDKPEGVDEFNDEILPGRPVPPECHAE
LHTDYDGAAVRWGLTHHKESAADCCQACLDHAKRAQPGDRKCNIWVYCPSETGCHSPDIYEHKHMECWLKYAENPKLNFKTNYPQSYRNSHPTAPLVVPWVSGVLPVASH
HPAAAISVALSAAPFLRRSSPLRWFQALALYLANNSHCIYLKVVGKKSHPTMSSSSGSIGETYYDVLSLREDASYDEIRASYRSALLNFHPDKLQAICHRSHPDDIAGER
YFKVQKAWEVLGSSMSRASYDRELQAAKGDAIGAESICLEDMVVEDKGEVVELFYQCRCGDYFFIDSGELDEMGYPLLRNGSKVSLRTLDALPASVVLPCGSCSLKILGI
SFSNDFDVVKRAYDCVALLGRVYVGFL