; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g03670 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g03670
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of unknown function (DUF789)
Genome locationchr6:2612312..2615135
RNA-Seq ExpressionMoc06g03670
SyntenyMoc06g03670
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589615.1 hypothetical protein SDJN03_15038, partial [Cucurbita argyrosperma subsp. sororia]8.8e-19985.54Show/hide
Query:  MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
        M GTALQFGGIKGEDRFYIPV+ARK+YNQQKPSRRP K+DETE+PSS+    VVASTT PSKPLTPQ KSNLERFLDAT PSV AQYFSKTTMRGWRTCD
Subjt:  MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGES-AMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVH
        IEFQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGES A+RSDSK RLA EDSDLDSS+DTSS+GSI+YE GK+  +SREQWVH
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGES-AMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVH

Query:  DLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPT
          L CE+ + MR  S+RDEH   QEGFSSDDGDA  PRS LLFQF EQDLPYQRVPLADKIFDLAYQ+PGLK+LRSCDI PASW+SVAWYPIYRIPTGPT
Subjt:  DLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIRGNGHGQAP MIYPND DG+PKVSLPVFGLASYKLKGSIWAQN VKEHQMANSLMQAA+ WLR LQV+QPDFQFFASH TYW
Subjt:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYW

Query:  R
        R
Subjt:  R

XP_022134722.1 uncharacterized protein LOC111006925 [Momordica charantia]1.8e-236100Show/hide
Query:  MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
        MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
Subjt:  MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVHD
        IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVHD
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVHD

Query:  LLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPTL
        LLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPTL
Subjt:  LLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPTL

Query:  KDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYWR
        KDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYWR
Subjt:  KDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYWR

XP_022921943.1 uncharacterized protein LOC111430050 [Cucurbita moschata]1.4e-19985.79Show/hide
Query:  MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
        M GTALQFGGIKGEDRFYIPV+ARK+YNQQKPSRRP K+DETE+PSS+    VVASTT PSKPLTPQ KSNLERFLDAT PSVPAQYFSKTTMRGWRTCD
Subjt:  MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGES-AMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVH
        IEFQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGES A+RSDSK RLA EDSDLDSS+DTSS+GSI+YE GK+  +SREQWVH
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGES-AMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVH

Query:  DLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPT
          L CE+ + MR  S+RDEH   QEGFSSDDGDA  PRS LLFQF EQDLPYQRVPLADKIFDLAYQ+PGLK+LRSCDI PASW+SVAWYPIYRIPTGPT
Subjt:  DLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIRGNGHGQAP MIYPND DG+PKVSLPVFGLASYKLKGSIWAQN VKEHQMANSLMQAA+ WLR LQV+QPDFQFFASH TYW
Subjt:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYW

Query:  R
        R
Subjt:  R

XP_022987436.1 uncharacterized protein LOC111484983 [Cucurbita maxima]9.7e-19885.54Show/hide
Query:  MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
        M GTALQFGGIKGEDRFYIPV+ARK YNQQKPSRRP K+DETE+PSS    KVVASTT PSKPLTPQ KSNLERFLDAT PSVPAQYFSKTTMRGWRTCD
Subjt:  MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGES-AMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVH
        IEFQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGES A+RSDSK RLA EDSDLDSSRDTSS+GSI+YE GK+  +SREQWVH
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGES-AMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVH

Query:  DLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPT
          L C++ L +R  S+RDEH   QEGFSSDDGDA  PRS LLFQF EQDLPYQRVPLADKIFDLAYQ+PGLK+LRSCDI PASW+SVAWYPIYRIPTGPT
Subjt:  DLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIRGNGHGQAP MIYPND DG+PKVSLPVFGLASYKLKGSIWAQN VKE+QM NSLMQAA+ WLR LQV+QPDFQFFASH TYW
Subjt:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYW

Query:  R
        R
Subjt:  R

XP_023516127.1 uncharacterized protein LOC111780081 [Cucurbita pepo subsp. pepo]2.3e-19986.03Show/hide
Query:  MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
        M GTALQFGGIKGEDRFYIPV+ARK+YNQQKPSRRP K+DETE+PSS    KVVASTT PSKPLTPQ KSNLERFLDAT PSVPAQYFSKTTMRGWRTCD
Subjt:  MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGES-AMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVH
        IEFQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGES A+RSDSK RLA EDSDLDSSRDTSS+GSI+YE GK+  +SREQWVH
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGES-AMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVH

Query:  DLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPT
          L CE+ + MR  S+RDEH   QEGFSSDDGDA  PRS LLFQF EQDLPYQRVPLADKIFDLAYQ+PGLK+LRSCDI PASW+SVAWYPIYRIPTGPT
Subjt:  DLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIRGNGHGQAP MIYPND DG+PKVSLPVFGLASYKLKGSIWAQN VKEHQMANSLMQAA+ WLR LQV+QPDFQFFAS+ TYW
Subjt:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYW

Query:  R
        R
Subjt:  R

TrEMBL top hitse value%identityAlignment
A0A1S3BX10 uncharacterized protein LOC103494138 isoform X17.6e-18881.05Show/hide
Query:  MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
        M GTALQFGGIKGEDRFYIPV+ARKNYNQQKPSRRP K+DETES SS    KVV  TTKP + LTPQ KSNLERFL+AT PSVPAQYFSKTTMR WRTCD
Subjt:  MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGE-SAMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVH
        IEFQPYFILNDLWESFKEWSAYGAGVPLVL+GGDSVVQYYVPYLSGIQIYGE +A+RSDS  RLA EDSDLDSSRDTSSDGSI+Y++GK+  +SREQW H
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGE-SAMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVH

Query:  DLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPT
          L CEN  KMR  S+ DE  M+QEGF SDDGDAG PRS LLFQF EQDLPYQRVPLADKIF+LAYQ+PGLK+LRSCDI PASWVSVAWYPIYRIPTGPT
Subjt:  DLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTP +GN H   PVM+YP D+D + K+SLPVFG+ASYKLKGSIW QNG+ +HQ ANSLMQAAD WLR LQV QPDFQFF+SHGTYW
Subjt:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYW

Query:  R
        R
Subjt:  R

A0A5A7USF1 Uncharacterized protein7.6e-18881.05Show/hide
Query:  MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
        M GTALQFGGIKGEDRFYIPV+ARKNYNQQKPSRRP K+DETES SS    KVV  TTKP + LTPQ KSNLERFL+AT PSVPAQYFSKTTMR WRTCD
Subjt:  MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGE-SAMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVH
        IEFQPYFILNDLWESFKEWSAYGAGVPLVL+GGDSVVQYYVPYLSGIQIYGE +A+RSDS  RLA EDSDLDSSRDTSSDGSI+Y++GK+  +SREQW H
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGE-SAMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVH

Query:  DLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPT
          L CEN  KMR  S+ DE  M+QEGF SDDGDAG PRS LLFQF EQDLPYQRVPLADKIF+LAYQ+PGLK+LRSCDI PASWVSVAWYPIYRIPTGPT
Subjt:  DLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTP +GN H   PVM+YP D+D + K+SLPVFG+ASYKLKGSIW QNG+ +HQ ANSLMQAAD WLR LQV QPDFQFF+SHGTYW
Subjt:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYW

Query:  R
        R
Subjt:  R

A0A6J1C0E1 uncharacterized protein LOC1110069258.8e-237100Show/hide
Query:  MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
        MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
Subjt:  MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVHD
        IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVHD
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVHD

Query:  LLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPTL
        LLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPTL
Subjt:  LLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPTL

Query:  KDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYWR
        KDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYWR
Subjt:  KDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYWR

A0A6J1E577 uncharacterized protein LOC1114300506.6e-20085.79Show/hide
Query:  MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
        M GTALQFGGIKGEDRFYIPV+ARK+YNQQKPSRRP K+DETE+PSS+    VVASTT PSKPLTPQ KSNLERFLDAT PSVPAQYFSKTTMRGWRTCD
Subjt:  MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGES-AMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVH
        IEFQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGES A+RSDSK RLA EDSDLDSS+DTSS+GSI+YE GK+  +SREQWVH
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGES-AMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVH

Query:  DLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPT
          L CE+ + MR  S+RDEH   QEGFSSDDGDA  PRS LLFQF EQDLPYQRVPLADKIFDLAYQ+PGLK+LRSCDI PASW+SVAWYPIYRIPTGPT
Subjt:  DLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIRGNGHGQAP MIYPND DG+PKVSLPVFGLASYKLKGSIWAQN VKEHQMANSLMQAA+ WLR LQV+QPDFQFFASH TYW
Subjt:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYW

Query:  R
        R
Subjt:  R

A0A6J1JE68 uncharacterized protein LOC1114849834.7e-19885.54Show/hide
Query:  MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
        M GTALQFGGIKGEDRFYIPV+ARK YNQQKPSRRP K+DETE+PSS    KVVASTT PSKPLTPQ KSNLERFLDAT PSVPAQYFSKTTMRGWRTCD
Subjt:  MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGES-AMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVH
        IEFQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGES A+RSDSK RLA EDSDLDSSRDTSS+GSI+YE GK+  +SREQWVH
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGES-AMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVH

Query:  DLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPT
          L C++ L +R  S+RDEH   QEGFSSDDGDA  PRS LLFQF EQDLPYQRVPLADKIFDLAYQ+PGLK+LRSCDI PASW+SVAWYPIYRIPTGPT
Subjt:  DLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIRGNGHGQAP MIYPND DG+PKVSLPVFGLASYKLKGSIWAQN VKE+QM NSLMQAA+ WLR LQV+QPDFQFFASH TYW
Subjt:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYW

Query:  R
        R
Subjt:  R

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)4.5e-9255.52Show/hide
Query:  SNLERFLDATTPSVPAQYFSKTTMRGWRTCDIEFQ-PYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGE-SAMRSDSKCRLAGE
        SN+ERFLD+ TPSVPA Y SKT +R     D+E Q PYF+L D+WESF EWSAYG GVPL LN   D V QYYVP LSGIQ+Y +  A+ S  + R  GE
Subjt:  SNLERFLDATTPSVPAQYFSKTTMRGWRTCDIEFQ-PYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGE-SAMRSDSKCRLAGE

Query:  DSDLDSSRDTSSDGSIEYEVGKNSQISREQWVHDLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQ
        +S+ D  RD+SS+GS   E  +    S+EQ          + +M  +S+R EH   QE  SSDDG+  + +  L+F++ E+DLPY R P ADK+ DLA +
Subjt:  DSDLDSSRDTSSDGSIEYEVGKNSQISREQWVHDLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQ

Query:  YPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQ
        +P LK+LRSCD+ P+SW SVAWYPIY+IPTGPTLKDLDACFLTYHSL TP +G G     + +     + V K+ LPVFGLASYKL+GS+W   G   HQ
Subjt:  YPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQ

Query:  MANSLMQAADNWLRLLQVHQPDFQFF
        +ANSL QAADNWLRL QV+ PDF FF
Subjt:  MANSLMQAADNWLRLLQVHQPDFQFF

AT2G01260.1 Protein of unknown function (DUF789)9.4e-9049.12Show/hide
Query:  MFGTALQF-GGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTC
        M G   Q   G  G+D FY   K R+  NQ+    R  +SD +  PSS        S  K     +    SNL+RFL++ TPSVPAQ+ SKT +R  R  
Subjt:  MFGTALQF-GGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTC

Query:  D--IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGES-AMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISRE
        D   +  PYF+L D+W+SF EWSAYG GVPLVLN   D V+QYYVP LS IQIY  S A+ S  K R  G+ SD D  RD+SSD S +         S  
Subjt:  D--IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGES-AMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISRE

Query:  QWVHDLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIP
        + V   + C        +S+RD+H   QE  SSDDG+    +  L+F++ E+DLPY R P ADK+ DLA Q+P L +LRSCD+  +SW SVAWYPIYRIP
Subjt:  QWVHDLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIP

Query:  TGPTLKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFF
        TGPTLKDLDACFLTYHSL T   G G  Q+  +  P + +   K+SLPVFGLASYK +GS+W   G  EHQ+ NSL QAAD WL    V  PDF FF
Subjt:  TGPTLKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFF

AT2G01260.2 Protein of unknown function (DUF789)3.5e-6848.77Show/hide
Query:  MFGTALQF-GGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTC
        M G   Q   G  G+D FY   K R+  NQ+    R  +SD +  PSS        S  K     +    SNL+RFL++ TPSVPAQ+ SKT +R  R  
Subjt:  MFGTALQF-GGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTC

Query:  D--IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGES-AMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISRE
        D   +  PYF+L D+W+SF EWSAYG GVPLVLN   D V+QYYVP LS IQIY  S A+ S  K R  G+ SD D  RD+SSD S +         S  
Subjt:  D--IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGES-AMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISRE

Query:  QWVHDLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIP
        + V   + C        +S+RD+H   QE  SSDDG+    +  L+F++ E+DLPY R P ADK+ DLA Q+P L +LRSCD+  +SW SVAWYPIYRIP
Subjt:  QWVHDLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIP

Query:  TGPTLKDLDACFLTYHSLSTPIRG
        TGPTLKDLDACFLTYHSL T   G
Subjt:  TGPTLKDLDACFLTYHSLSTPIRG

AT4G16100.1 Protein of unknown function (DUF789)2.4e-8544.71Show/hide
Query:  IKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSS----DMKTKV-----------------VASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFS
        I+GE+RFY P   RK   +++  R   +  E E   +    D K KV                 V S    +   T    SNL RFLD TTP V  Q+  
Subjt:  IKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSS----DMKTKV-----------------VASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFS

Query:  KTTMRGWRTCDIEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKN
         T+ +GWRT + E++PYF+LNDLW+SF+EWSAYG GVPL+LNG DSVVQYYVPYLSGIQ+Y E   R+ +  R  GE+SD DS RD SSDGS +      
Subjt:  KTTMRGWRTCDIEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKN

Query:  SQISREQWVHDLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDA-GNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAW
                       E +  +   S+ ++ C+   G SSD+ +A  N    L+F++ E  +P+ R PL DKI +L+ Q+P L++ RSCD+ P+SWVSVAW
Subjt:  SQISREQWVHDLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDA-GNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAW

Query:  YPIYRIPTGPTLKDLDACFLTYHSLSTPIRG--NGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWA-QNGVKEHQMANSLMQAADNWLRLLQVH
        YPIYRIP G +L++LDACFLT+HSLSTP RG  N  GQ+      +      K+ LP FGLASYK K S W+ ++ V E+Q   +L++ A+ WLR L+V 
Subjt:  YPIYRIPTGPTLKDLDACFLTYHSLSTPIRG--NGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWA-QNGVKEHQMANSLMQAADNWLRLLQVH

Query:  QPDFQFFASH-GTYWR
         PDF+ F SH G+ WR
Subjt:  QPDFQFFASH-GTYWR

AT5G49220.1 Protein of unknown function (DUF789)5.2e-7238.44Show/hide
Query:  GTALQFGGIKGEDRFYIPVKARKNYN----QQKPSRRPVKSDETE-------------SPS--------SDMKTKVVASTTKPSKPLTPQHK--------
        G ++    I+GE+RFY P   R+       QQ+   +  + DE E             +P         S+ K++VV S ++     +            
Subjt:  GTALQFGGIKGEDRFYIPVKARKNYN----QQKPSRRPVKSDETE-------------SPS--------SDMKTKVVASTTKPSKPLTPQHK--------

Query:  -SNLERFLDATTPSVPAQYFSKTTMRGWRTCDIEFQPYFILNDLWESFKEWSAYGAGV-----PLVLNGGDSVVQYYVPYLSGIQIYGESAMRSDSKCRL
         SNL+RFL+ TTP VPA+ F   +    +T + +   YF+L DLWESF EWSAYGAGV     PL ++G DS VQYYVPYLSGIQ+Y +   +  +    
Subjt:  -SNLERFLDATTPSVPAQYFSKTTMRGWRTCDIEFQPYFILNDLWESFKEWSAYGAGV-----PLVLNGGDSVVQYYVPYLSGIQIYGESAMRSDSKCRL

Query:  AGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVHDLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDL
         G++         S    ++  VG                     ++  +S++D+   I    SS + +  NP+  LLF++ E + P+ R PLA+KI DL
Subjt:  AGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVHDLLTCENTLKMRNMSVRDEHCMIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDL

Query:  AYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVK
        A + P L + RSCD+ P+SWVSV+WYPIYRIP GPTL++LDACFLT+HSLST    +  G        +D     K+ LP FGLASYKLK S+W QN ++
Subjt:  AYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVK

Query:  EHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYWR
        E Q   SL+QAAD WL+ LQV  PD++FF S+    R
Subjt:  EHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYWR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGGAACTGCGTTGCAGTTTGGGGGAATCAAGGGTGAGGATCGGTTTTATATTCCAGTAAAGGCAAGGAAGAATTATAATCAGCAAAAGCCGTCGAGGAGACCCGT
CAAGAGCGATGAAACTGAGAGCCCTTCTTCAGATATGAAGACCAAAGTCGTGGCTTCTACTACTAAGCCTTCTAAGCCATTAACTCCTCAGCATAAGAGCAACTTGGAGA
GATTCTTGGACGCCACAACGCCTTCAGTTCCAGCGCAGTACTTCTCTAAGACAACTATGAGGGGTTGGCGGACTTGTGATATTGAGTTTCAACCTTATTTCATTCTGAAT
GATCTGTGGGAGTCTTTCAAGGAGTGGAGTGCATACGGTGCTGGAGTTCCTTTAGTACTTAATGGAGGCGACTCTGTTGTCCAATATTACGTTCCATATTTGTCTGGCAT
CCAAATATATGGTGAATCTGCAATGAGATCAGATTCTAAGTGCAGGCTGGCTGGTGAGGACAGTGACCTCGACTCCTCCAGGGATACAAGTAGCGACGGTAGCATTGAAT
ATGAAGTTGGAAAAAACTCTCAAATTTCTAGGGAGCAATGGGTTCATGACCTTCTAACTTGTGAAAACACACTTAAAATGAGAAATATGTCTGTAAGAGATGAACATTGC
ATGATACAAGAAGGTTTTTCGAGTGATGATGGGGATGCTGGAAATCCTAGGAGTGTTTTGCTCTTTCAGTTTTTTGAGCAAGATCTTCCTTATCAACGAGTTCCATTGGC
TGATAAGATATTTGATCTTGCTTACCAATATCCTGGTTTGAAATCTTTAAGAAGTTGTGATATCCAGCCAGCCAGTTGGGTCTCTGTCGCATGGTACCCGATATACCGTA
TACCCACCGGTCCGACATTAAAAGATTTGGATGCTTGCTTTTTAACATATCATTCCCTTTCGACTCCCATTAGAGGTAATGGACATGGCCAGGCACCAGTGATGATATAT
CCAAATGACATGGATGGTGTCCCAAAGGTCTCCTTGCCTGTTTTTGGATTGGCTTCTTACAAGCTAAAAGGCTCAATTTGGGCGCAAAATGGAGTCAAGGAGCATCAGAT
GGCAAATTCTCTGATGCAGGCAGCGGATAACTGGCTGAGGCTTCTTCAGGTTCATCAACCTGATTTTCAGTTCTTTGCATCGCACGGGACGTACTGGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGGAACTGCGTTGCAGTTTGGGGGAATCAAGGGTGAGGATCGGTTTTATATTCCAGTAAAGGCAAGGAAGAATTATAATCAGCAAAAGCCGTCGAGGAGACCCGT
CAAGAGCGATGAAACTGAGAGCCCTTCTTCAGATATGAAGACCAAAGTCGTGGCTTCTACTACTAAGCCTTCTAAGCCATTAACTCCTCAGCATAAGAGCAACTTGGAGA
GATTCTTGGACGCCACAACGCCTTCAGTTCCAGCGCAGTACTTCTCTAAGACAACTATGAGGGGTTGGCGGACTTGTGATATTGAGTTTCAACCTTATTTCATTCTGAAT
GATCTGTGGGAGTCTTTCAAGGAGTGGAGTGCATACGGTGCTGGAGTTCCTTTAGTACTTAATGGAGGCGACTCTGTTGTCCAATATTACGTTCCATATTTGTCTGGCAT
CCAAATATATGGTGAATCTGCAATGAGATCAGATTCTAAGTGCAGGCTGGCTGGTGAGGACAGTGACCTCGACTCCTCCAGGGATACAAGTAGCGACGGTAGCATTGAAT
ATGAAGTTGGAAAAAACTCTCAAATTTCTAGGGAGCAATGGGTTCATGACCTTCTAACTTGTGAAAACACACTTAAAATGAGAAATATGTCTGTAAGAGATGAACATTGC
ATGATACAAGAAGGTTTTTCGAGTGATGATGGGGATGCTGGAAATCCTAGGAGTGTTTTGCTCTTTCAGTTTTTTGAGCAAGATCTTCCTTATCAACGAGTTCCATTGGC
TGATAAGATATTTGATCTTGCTTACCAATATCCTGGTTTGAAATCTTTAAGAAGTTGTGATATCCAGCCAGCCAGTTGGGTCTCTGTCGCATGGTACCCGATATACCGTA
TACCCACCGGTCCGACATTAAAAGATTTGGATGCTTGCTTTTTAACATATCATTCCCTTTCGACTCCCATTAGAGGTAATGGACATGGCCAGGCACCAGTGATGATATAT
CCAAATGACATGGATGGTGTCCCAAAGGTCTCCTTGCCTGTTTTTGGATTGGCTTCTTACAAGCTAAAAGGCTCAATTTGGGCGCAAAATGGAGTCAAGGAGCATCAGAT
GGCAAATTCTCTGATGCAGGCAGCGGATAACTGGCTGAGGCTTCTTCAGGTTCATCAACCTGATTTTCAGTTCTTTGCATCGCACGGGACGTACTGGAGATGA
Protein sequenceShow/hide protein sequence
MFGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPVKSDETESPSSDMKTKVVASTTKPSKPLTPQHKSNLERFLDATTPSVPAQYFSKTTMRGWRTCDIEFQPYFILN
DLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGESAMRSDSKCRLAGEDSDLDSSRDTSSDGSIEYEVGKNSQISREQWVHDLLTCENTLKMRNMSVRDEHC
MIQEGFSSDDGDAGNPRSVLLFQFFEQDLPYQRVPLADKIFDLAYQYPGLKSLRSCDIQPASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGNGHGQAPVMIY
PNDMDGVPKVSLPVFGLASYKLKGSIWAQNGVKEHQMANSLMQAADNWLRLLQVHQPDFQFFASHGTYWR