; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011083 (gene) of Snake gourd v1 genome

Gene IDTan0011083
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionN-acetyl-glutamate semialdehyde dehydrogenase
Genome locationLG02:9881202..9884498
RNA-Seq ExpressionTan0011083
SyntenyTan0011083
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0071472 - cellular response to salt stress (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7022436.1 hypothetical protein SDJN02_16167 [Cucurbita argyrosperma subsp. argyrosperma]1.4e-11989.83Show/hide
Query:  MVTHLQISTKMPLHAPVLLAPRLRSPECSHVVSLKPHKRFQNARSVVSCAMNMTAGQSDDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQLI
        M THLQIS KM LHAPVLLAPR++SPECSHVVSLKPHKR QNARS+VSCAMNM+AGQSDDPGR SWDSLKD+VKKLWDNSPEPVKCFPWNEAL+NFIQLI
Subjt:  MVTHLQISTKMPLHAPVLLAPRLRSPECSHVVSLKPHKRFQNARSVVSCAMNMTAGQSDDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQLI

Query:  ADLILSVIKYLCVPLLAITSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFAN
        ADLILSVIKYL +PLLA+TSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIA+FFTLIKLPGPYYPYWGRIF+PHFAN
Subjt:  ADLILSVIKYLCVPLLAITSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFAN

Query:  GGLLRTLWSLFLWFRRPRKISTSLKHNENHLDTKKN
        GGLLRT+WS+FLWFRRPRK S  LK  ENHLDT KN
Subjt:  GGLLRTLWSLFLWFRRPRKISTSLKHNENHLDTKKN

XP_022136318.1 uncharacterized protein LOC111008035 [Momordica charantia]4.1e-11989.45Show/hide
Query:  MVTHLQISTKMPLHAPVLLAPR-LRSPECSHVVSLKPHKRFQNARSVVSCAMNMTAGQSDDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQL
        M THLQISTKM LH+PVL APR  RSPEC+HVVS KPHKRFQNARS+VSCAMNMTAGQSDDPG+VSWDSLKD+VKKLWD+SPEPVKCFPWNEALDNFIQL
Subjt:  MVTHLQISTKMPLHAPVLLAPR-LRSPECSHVVSLKPHKRFQNARSVVSCAMNMTAGQSDDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQL

Query:  IADLILSVIKYLCVPLLAITSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFA
        IADL+L+VIKYLCVPLLA+TSLSEMSYCAHEKKL +VPFPLI+GFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFA
Subjt:  IADLILSVIKYLCVPLLAITSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFA

Query:  NGGLLRTLWSLFLWFRRPRKISTSLKHNENHLDTKKN
        NGGLLRTLWS FLWFRRPRK ST LKHN  HL+T KN
Subjt:  NGGLLRTLWSLFLWFRRPRKISTSLKHNENHLDTKKN

XP_022927689.1 uncharacterized protein LOC111434507 [Cucurbita moschata]6.3e-12089.83Show/hide
Query:  MVTHLQISTKMPLHAPVLLAPRLRSPECSHVVSLKPHKRFQNARSVVSCAMNMTAGQSDDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQLI
        M THLQIS KM LHAPVLLAPRLRSPECSHVVSLKPHKR QNARS+VSCAMNM+AGQSDDPGR SWDSLKD+VKKLWDNSPEPVKCFPWNEAL+NFIQLI
Subjt:  MVTHLQISTKMPLHAPVLLAPRLRSPECSHVVSLKPHKRFQNARSVVSCAMNMTAGQSDDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQLI

Query:  ADLILSVIKYLCVPLLAITSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFAN
        ADLILSVIKYL +PLLA+TSLSEMSYCAHEKKLLIVPFP+IIGFSVAEVMRQTALSLSPILKDLEVPWHLITIA+FFTL+KLPGPYYPYWGRIF+PHFAN
Subjt:  ADLILSVIKYLCVPLLAITSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFAN

Query:  GGLLRTLWSLFLWFRRPRKISTSLKHNENHLDTKKN
        GGLLRT+WS+FLWFRRPRK S  LK  ENHLDT KN
Subjt:  GGLLRTLWSLFLWFRRPRKISTSLKHNENHLDTKKN

XP_022988926.1 uncharacterized protein LOC111486136 [Cucurbita maxima]1.2e-11889.41Show/hide
Query:  MVTHLQISTKMPLHAPVLLAPRLRSPECSHVVSLKPHKRFQNARSVVSCAMNMTAGQSDDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQLI
        M THLQIS KM LHAPVLLAPRLRSPECSHVVSLKPHKR QNARS+VSCAMNM+AGQSDDPGR SWDSLK++VKKLWDNSPEPVKCFPWNEAL+NFIQLI
Subjt:  MVTHLQISTKMPLHAPVLLAPRLRSPECSHVVSLKPHKRFQNARSVVSCAMNMTAGQSDDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQLI

Query:  ADLILSVIKYLCVPLLAITSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFAN
        ADLILSVIKYL VPLL +TSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIA+FFTLIKLPGPYYPYWGRIF+PHFAN
Subjt:  ADLILSVIKYLCVPLLAITSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFAN

Query:  GGLLRTLWSLFLWFRRPRKISTSLKHNENHLDTKKN
        GGLLRT+WS+F WFRRPR+ S  LK  ENHLDT KN
Subjt:  GGLLRTLWSLFLWFRRPRKISTSLKHNENHLDTKKN

XP_023530129.1 uncharacterized protein LOC111792778 [Cucurbita pepo subsp. pepo]5.3e-11990.25Show/hide
Query:  MVTHLQISTKMPLHAPVLLAPRLRSPECSHVVSLKPHKRFQNARSVVSCAMNMTAGQSDDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQLI
        M THLQIS KM LHAPVLLAPRLRSPECSHVVSLKPHKR QNARS+VSCAMNM+AGQSDDPGR SWDSLK +VKKLWDNSPEPVKCFPWNEAL+NFIQLI
Subjt:  MVTHLQISTKMPLHAPVLLAPRLRSPECSHVVSLKPHKRFQNARSVVSCAMNMTAGQSDDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQLI

Query:  ADLILSVIKYLCVPLLAITSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFAN
        ADLILSVIKYL VPLL +TSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIA+FFTLIKLPGPYYPYWGRIF+PHFAN
Subjt:  ADLILSVIKYLCVPLLAITSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFAN

Query:  GGLLRTLWSLFLWFRRPRKISTSLKHNENHLDTKKN
        GGLLRT+WS+FLWFRRPRK S  LK  ENHLDT KN
Subjt:  GGLLRTLWSLFLWFRRPRKISTSLKHNENHLDTKKN

TrEMBL top hitse value%identityAlignment
A0A0A0K6P0 Uncharacterized protein2.9e-11085.47Show/hide
Query:  THLQISTKMPLHAPVLLAPRLRSPECSHVVSLKPHKRFQNARSVVSCAMNMTAGQSDDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQLIAD
        TH Q S K+ LHAP LLAP  RSPECSHVVSLKPHKRFQNA SVVSCA NMTAGQSDD GRV+WDSLK++VK+LW++SPEPVK FPWNEALDNFIQLIAD
Subjt:  THLQISTKMPLHAPVLLAPRLRSPECSHVVSLKPHKRFQNARSVVSCAMNMTAGQSDDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQLIAD

Query:  LILSVIKYLCVPLLAITSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFANGG
        LIL+VIKYL VPLL +TSLSEMSYCAHEKKLLIVPF  IIGFS AEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGP YPYWGRIFIPH ANGG
Subjt:  LILSVIKYLCVPLLAITSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFANGG

Query:  LLRTLWSLFLWFRRPRKISTSLKHNENHLDTKKN
        L+RT WS+FLWFRRP+K S  LKHNENHLDTKKN
Subjt:  LLRTLWSLFLWFRRPRKISTSLKHNENHLDTKKN

A0A1S3C0E1 uncharacterized protein LOC1034952111.7e-11588.46Show/hide
Query:  THLQISTKMPLHAPVLLAPRLRSPECSHVVSLKPHKRFQNARSVVSCAMNMTAGQSDDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQLIAD
        TH Q S KM LHAP LLAP  RSPECSH+VSLKPHKRFQNARSVVSCAMNMTAGQSDD GRV+WDSLK+KVK+LWD SPEPVK FPWNEALDNFIQLIAD
Subjt:  THLQISTKMPLHAPVLLAPRLRSPECSHVVSLKPHKRFQNARSVVSCAMNMTAGQSDDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQLIAD

Query:  LILSVIKYLCVPLLAITSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFANGG
        LIL+VIKYL VPLL +TSLSEMSYCAHEKKLLIVPFP IIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPY+PYWGRIFIPH ANGG
Subjt:  LILSVIKYLCVPLLAITSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFANGG

Query:  LLRTLWSLFLWFRRPRKISTSLKHNENHLDTKKN
        L+RTLWS FLWFRRP+K S  LKHNENHLDTKKN
Subjt:  LLRTLWSLFLWFRRPRKISTSLKHNENHLDTKKN

A0A6J1C7B2 uncharacterized protein LOC1110080352.0e-11989.45Show/hide
Query:  MVTHLQISTKMPLHAPVLLAPR-LRSPECSHVVSLKPHKRFQNARSVVSCAMNMTAGQSDDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQL
        M THLQISTKM LH+PVL APR  RSPEC+HVVS KPHKRFQNARS+VSCAMNMTAGQSDDPG+VSWDSLKD+VKKLWD+SPEPVKCFPWNEALDNFIQL
Subjt:  MVTHLQISTKMPLHAPVLLAPR-LRSPECSHVVSLKPHKRFQNARSVVSCAMNMTAGQSDDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQL

Query:  IADLILSVIKYLCVPLLAITSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFA
        IADL+L+VIKYLCVPLLA+TSLSEMSYCAHEKKL +VPFPLI+GFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFA
Subjt:  IADLILSVIKYLCVPLLAITSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFA

Query:  NGGLLRTLWSLFLWFRRPRKISTSLKHNENHLDTKKN
        NGGLLRTLWS FLWFRRPRK ST LKHN  HL+T KN
Subjt:  NGGLLRTLWSLFLWFRRPRKISTSLKHNENHLDTKKN

A0A6J1EIP9 uncharacterized protein LOC1114345073.0e-12089.83Show/hide
Query:  MVTHLQISTKMPLHAPVLLAPRLRSPECSHVVSLKPHKRFQNARSVVSCAMNMTAGQSDDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQLI
        M THLQIS KM LHAPVLLAPRLRSPECSHVVSLKPHKR QNARS+VSCAMNM+AGQSDDPGR SWDSLKD+VKKLWDNSPEPVKCFPWNEAL+NFIQLI
Subjt:  MVTHLQISTKMPLHAPVLLAPRLRSPECSHVVSLKPHKRFQNARSVVSCAMNMTAGQSDDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQLI

Query:  ADLILSVIKYLCVPLLAITSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFAN
        ADLILSVIKYL +PLLA+TSLSEMSYCAHEKKLLIVPFP+IIGFSVAEVMRQTALSLSPILKDLEVPWHLITIA+FFTL+KLPGPYYPYWGRIF+PHFAN
Subjt:  ADLILSVIKYLCVPLLAITSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFAN

Query:  GGLLRTLWSLFLWFRRPRKISTSLKHNENHLDTKKN
        GGLLRT+WS+FLWFRRPRK S  LK  ENHLDT KN
Subjt:  GGLLRTLWSLFLWFRRPRKISTSLKHNENHLDTKKN

A0A6J1JED8 uncharacterized protein LOC1114861365.7e-11989.41Show/hide
Query:  MVTHLQISTKMPLHAPVLLAPRLRSPECSHVVSLKPHKRFQNARSVVSCAMNMTAGQSDDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQLI
        M THLQIS KM LHAPVLLAPRLRSPECSHVVSLKPHKR QNARS+VSCAMNM+AGQSDDPGR SWDSLK++VKKLWDNSPEPVKCFPWNEAL+NFIQLI
Subjt:  MVTHLQISTKMPLHAPVLLAPRLRSPECSHVVSLKPHKRFQNARSVVSCAMNMTAGQSDDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQLI

Query:  ADLILSVIKYLCVPLLAITSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFAN
        ADLILSVIKYL VPLL +TSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIA+FFTLIKLPGPYYPYWGRIF+PHFAN
Subjt:  ADLILSVIKYLCVPLLAITSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFAN

Query:  GGLLRTLWSLFLWFRRPRKISTSLKHNENHLDTKKN
        GGLLRT+WS+F WFRRPR+ S  LK  ENHLDT KN
Subjt:  GGLLRTLWSLFLWFRRPRKISTSLKHNENHLDTKKN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49510.1 embryo defective 12731.3e-6258.51Show/hide
Query:  RSVVSCAMNMTAGQS-DDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQLIADLILSVIKYLCVPLLAITSLSEMSYCAHEKKLLIVPFPLII
        RS++  A++M +G S + P ++++D+L  + K +WDNSP+PVK FPWN A  NFIQL+ DL +SV+K+L VP+LA++S+SEMSYCAHE+KL +VPFPL+I
Subjt:  RSVVSCAMNMTAGQS-DDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQLIADLILSVIKYLCVPLLAITSLSEMSYCAHEKKLLIVPFPLII

Query:  GFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFANGGLLRTLWSLFLWFRRPRKISTSLKHNENH
        G  VA V+++TAL +SP LK+ EVPWHLI + +FFTLIKLPGPYYPYWGR+ +PHFANG LLR LWS+F W+++ R  +TS    +NH
Subjt:  GFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFANGGLLRTLWSLFLWFRRPRKISTSLKHNENH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGACCCACCTTCAAATTTCCACCAAAATGCCCCTGCACGCCCCGGTTCTTCTAGCGCCCAGACTTCGGTCACCAGAATGCAGCCACGTAGTGAGTTTAAAACCACA
TAAAAGATTTCAGAATGCTAGATCTGTGGTGTCTTGTGCTATGAATATGACTGCAGGACAATCAGATGATCCAGGGAGGGTAAGTTGGGATAGTTTGAAGGACAAGGTAA
AAAAACTTTGGGACAACTCACCGGAGCCAGTGAAGTGCTTCCCTTGGAATGAGGCATTGGATAACTTCATCCAGCTTATTGCTGATCTCATCTTGTCGGTCATCAAATAC
TTGTGTGTACCATTACTCGCCATCACCTCCCTCAGCGAGATGTCCTACTGTGCCCATGAAAAGAAGCTTCTTATTGTCCCATTTCCGTTAATAATTGGCTTCTCTGTTGC
TGAGGTCATGAGACAAACAGCTTTGAGTCTGTCTCCAATTCTTAAGGATCTAGAAGTACCATGGCATTTGATTACCATTGCTATATTCTTCACCCTCATTAAACTGCCCG
GCCCGTACTACCCATATTGGGGTCGCATATTCATTCCCCATTTTGCAAATGGGGGCTTGTTGAGAACTTTATGGTCTTTGTTTCTCTGGTTTAGGAGACCCCGGAAGATC
TCGACGTCACTGAAGCATAATGAGAATCACCTGGATACAAAGAAGAACTAA
mRNA sequenceShow/hide mRNA sequence
TTTTTGCTTCGAGCTCGAACTTGATCCTCTCAGAAATTTTCAAGTTTCTCGCGGGTTATGGTGTTCGTGAAGAAACCCTTCTTCCTAAACTGTGTTTCATCCATCAACAG
CTTGAATTGACGCCCAGCGATGGTGACCCACCTTCAAATTTCCACCAAAATGCCCCTGCACGCCCCGGTTCTTCTAGCGCCCAGACTTCGGTCACCAGAATGCAGCCACG
TAGTGAGTTTAAAACCACATAAAAGATTTCAGAATGCTAGATCTGTGGTGTCTTGTGCTATGAATATGACTGCAGGACAATCAGATGATCCAGGGAGGGTAAGTTGGGAT
AGTTTGAAGGACAAGGTAAAAAAACTTTGGGACAACTCACCGGAGCCAGTGAAGTGCTTCCCTTGGAATGAGGCATTGGATAACTTCATCCAGCTTATTGCTGATCTCAT
CTTGTCGGTCATCAAATACTTGTGTGTACCATTACTCGCCATCACCTCCCTCAGCGAGATGTCCTACTGTGCCCATGAAAAGAAGCTTCTTATTGTCCCATTTCCGTTAA
TAATTGGCTTCTCTGTTGCTGAGGTCATGAGACAAACAGCTTTGAGTCTGTCTCCAATTCTTAAGGATCTAGAAGTACCATGGCATTTGATTACCATTGCTATATTCTTC
ACCCTCATTAAACTGCCCGGCCCGTACTACCCATATTGGGGTCGCATATTCATTCCCCATTTTGCAAATGGGGGCTTGTTGAGAACTTTATGGTCTTTGTTTCTCTGGTT
TAGGAGACCCCGGAAGATCTCGACGTCACTGAAGCATAATGAGAATCACCTGGATACAAAGAAGAACTAATCAAAAAGGCATAGAGAACATTGGTTCAGATGACATTGCT
GGCTTCATCAGAGAGAAGTGATCCATCGAGAGCGAACTGGTCGATAGTTTCTTGTAGCACGCGACCGATTCTGCTTCATTGAAGGATCGTTTTCGTGATGGACGTTCCAG
ATAAGCCTTAAATGAAGCCTAATGTGTTTTCCAATTTTCTTTGTTTTATGTAGCACAGAAAGTTTTGGAGGCTGTATTTACATATCATAATTGTTCTAATAACTTCAAAT
ATGAAAATAGTACACAATGTTATTCAATTCCTTCCTAAATTTTCAAAAAATTAGTAA
Protein sequenceShow/hide protein sequence
MVTHLQISTKMPLHAPVLLAPRLRSPECSHVVSLKPHKRFQNARSVVSCAMNMTAGQSDDPGRVSWDSLKDKVKKLWDNSPEPVKCFPWNEALDNFIQLIADLILSVIKY
LCVPLLAITSLSEMSYCAHEKKLLIVPFPLIIGFSVAEVMRQTALSLSPILKDLEVPWHLITIAIFFTLIKLPGPYYPYWGRIFIPHFANGGLLRTLWSLFLWFRRPRKI
STSLKHNENHLDTKKN