; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026977 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026977
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationtig00153047:2759686..2766192
RNA-Seq ExpressionSgr026977
SyntenySgr026977
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589615.1 hypothetical protein SDJN03_15038, partial [Cucurbita argyrosperma subsp. sororia]5.5e-18584.9Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLT-QPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
        MLGTALQFGGIKGEDRFYIPV+ARK+YNQQKPSRRP+KTDETE+PSS+    VVAS T PSKPLT Q KSNLERFLDAT PSV AQYFSKTTMRGWRTCD
Subjt:  MLGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLT-QPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGES-AMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVH
        IEFQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQ+YGES A+RS+SKSRLA EDSDLDSS+DTSS+GS +YE GK C  SREQWVH
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGES-AMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVH

Query:  HHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPT
        HHL  E+ + +RKTSL DE    QEGFSSDDGDA   RS LLFQFLEQDLPYQRVPLADKI++LAYQ+PGLKTLRSCDILPASW+SVAWYPIYRIPTGPT
Subjt:  HHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQ
        LKDLDACFLTYHSLSTPIRGNGHGQAP MIYPND DG+PKVSLPVFGLASYKLKG IWAQN V EHQMANSLMQAA+ WLR LQ
Subjt:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQ

XP_022134722.1 uncharacterized protein LOC111006925 [Momordica charantia]5.1e-19991.12Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLT-QPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
        M GTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRP K+DETESPSSD K+KVVAS TKPSKPLT Q KSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
Subjt:  MLGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLT-QPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGESAMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVHH
        IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQ+YGESAMRS+SK RLAGEDSDLDSSRDTSSDGS EYEVGK  + SREQWVH 
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGESAMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVHH

Query:  HLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPTL
         LT ENTLK+R  S+ DE C IQEGFSSDDGDAGN RSVLLFQF EQDLPYQRVPLADKI++LAYQYPGLK+LRSCDI PASWVSVAWYPIYRIPTGPTL
Subjt:  HLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPTL

Query:  KDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQ
        KDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKG IWAQNGV EHQMANSLMQAADNWLRLLQ
Subjt:  KDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQ

XP_022921943.1 uncharacterized protein LOC111430050 [Cucurbita moschata]8.5e-18685.16Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLT-QPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
        MLGTALQFGGIKGEDRFYIPV+ARK+YNQQKPSRRP+KTDETE+PSS+    VVAS T PSKPLT Q KSNLERFLDAT PSVPAQYFSKTTMRGWRTCD
Subjt:  MLGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLT-QPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGES-AMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVH
        IEFQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQ+YGES A+RS+SKSRLA EDSDLDSS+DTSS+GS +YE GK C  SREQWVH
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGES-AMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVH

Query:  HHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPT
        HHL  E+ + +RKTSL DE    QEGFSSDDGDA   RS LLFQFLEQDLPYQRVPLADKI++LAYQ+PGLKTLRSCDILPASW+SVAWYPIYRIPTGPT
Subjt:  HHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQ
        LKDLDACFLTYHSLSTPIRGNGHGQAP MIYPND DG+PKVSLPVFGLASYKLKG IWAQN V EHQMANSLMQAA+ WLR LQ
Subjt:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQ

XP_022987436.1 uncharacterized protein LOC111484983 [Cucurbita maxima]2.1e-18485.16Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLT-QPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
        MLGTALQFGGIKGEDRFYIPV+ARK YNQQKPSRRP+KTDETE+PS    SKVVAS T PSKPLT Q KSNLERFLDAT PSVPAQYFSKTTMRGWRTCD
Subjt:  MLGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLT-QPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGES-AMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVH
        IEFQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQ+YGES A+RS+SKSRLA EDSDLDSSRDTSS+GS +YE GK C  SREQWVH
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGES-AMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVH

Query:  HHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPT
        HHL  ++ L +RKTSL DE    QEGFSSDDGDA   RS LLFQFLEQDLPYQRVPLADKI++LAYQ+PGLKTLRSCDILPASW+SVAWYPIYRIPTGPT
Subjt:  HHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQ
        LKDLDACFLTYHSLSTPIRGNGHGQAP MIYPND DG+PKVSLPVFGLASYKLKG IWAQN V E+QM NSLMQAA+ WLR LQ
Subjt:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQ

XP_023516127.1 uncharacterized protein LOC111780081 [Cucurbita pepo subsp. pepo]2.2e-18685.68Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLT-QPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
        MLGTALQFGGIKGEDRFYIPV+ARK+YNQQKPSRRP+KTDETE+PS    SKVVAS T PSKPLT Q KSNLERFLDAT PSVPAQYFSKTTMRGWRTCD
Subjt:  MLGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLT-QPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGES-AMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVH
        IEFQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQ+YGES A+RS+SKSRLA EDSDLDSSRDTSS+GS +YE GK C  SREQWVH
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGES-AMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVH

Query:  HHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPT
        HHL  E+ + +RKTSL DE    QEGFSSDDGDA   RS LLFQFLEQDLPYQRVPLADKI++LAYQ+PGLKTLRSCDILPASW+SVAWYPIYRIPTGPT
Subjt:  HHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQ
        LKDLDACFLTYHSLSTPIRGNGHGQAP MIYPND DG+PKVSLPVFGLASYKLKG IWAQN V EHQMANSLMQAA+ WLR LQ
Subjt:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQ

TrEMBL top hitse value%identityAlignment
A0A1S4DZI3 uncharacterized protein LOC103494138 isoform X38.0e-17480.73Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLT-QPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
        MLGTALQFGGIKGEDRFYIPV+ARKNYNQQKPSRRP+KTDETES S    SKVV   TKP + LT Q KSNLERFL+AT PSVPAQYFSKTTMR WRTCD
Subjt:  MLGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLT-QPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGE-SAMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVH
        IEFQPYFILNDLWESFKEWSAYGAGVPLVL+GGDSVVQYYVPYLSGIQ+YGE +A+RS+S  RLA EDSDLDSSRDTSSDGS +Y++GK    SREQW H
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGE-SAMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVH

Query:  HHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPT
         HL  EN  K+RKTSL+DE+  +QEGF SDDGDAG  RS LLFQFLEQDLPYQRVPLADKI+ELAYQ+PGLKTLRSCDILPASWVSVAWYPIYRIPTGPT
Subjt:  HHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQ
        LKDLDACFLTYHSLSTP +GN H   PVM+YP D+D + K+SLPVFG+ASYKLKG IW QNG+N+HQ ANSLMQAAD WLR LQ
Subjt:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQ

A0A5A7USF1 Uncharacterized protein8.0e-17480.73Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLT-QPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
        MLGTALQFGGIKGEDRFYIPV+ARKNYNQQKPSRRP+KTDETES S    SKVV   TKP + LT Q KSNLERFL+AT PSVPAQYFSKTTMR WRTCD
Subjt:  MLGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLT-QPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGE-SAMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVH
        IEFQPYFILNDLWESFKEWSAYGAGVPLVL+GGDSVVQYYVPYLSGIQ+YGE +A+RS+S  RLA EDSDLDSSRDTSSDGS +Y++GK    SREQW H
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGE-SAMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVH

Query:  HHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPT
         HL  EN  K+RKTSL+DE+  +QEGF SDDGDAG  RS LLFQFLEQDLPYQRVPLADKI+ELAYQ+PGLKTLRSCDILPASWVSVAWYPIYRIPTGPT
Subjt:  HHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQ
        LKDLDACFLTYHSLSTP +GN H   PVM+YP D+D + K+SLPVFG+ASYKLKG IW QNG+N+HQ ANSLMQAAD WLR LQ
Subjt:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQ

A0A6J1C0E1 uncharacterized protein LOC1110069252.5e-19991.12Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLT-QPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
        M GTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRP K+DETESPSSD K+KVVAS TKPSKPLT Q KSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
Subjt:  MLGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLT-QPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGESAMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVHH
        IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQ+YGESAMRS+SK RLAGEDSDLDSSRDTSSDGS EYEVGK  + SREQWVH 
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGESAMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVHH

Query:  HLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPTL
         LT ENTLK+R  S+ DE C IQEGFSSDDGDAGN RSVLLFQF EQDLPYQRVPLADKI++LAYQYPGLK+LRSCDI PASWVSVAWYPIYRIPTGPTL
Subjt:  HLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPTL

Query:  KDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQ
        KDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKG IWAQNGV EHQMANSLMQAADNWLRLLQ
Subjt:  KDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQ

A0A6J1E577 uncharacterized protein LOC1114300504.1e-18685.16Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLT-QPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
        MLGTALQFGGIKGEDRFYIPV+ARK+YNQQKPSRRP+KTDETE+PSS+    VVAS T PSKPLT Q KSNLERFLDAT PSVPAQYFSKTTMRGWRTCD
Subjt:  MLGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLT-QPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGES-AMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVH
        IEFQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQ+YGES A+RS+SKSRLA EDSDLDSS+DTSS+GS +YE GK C  SREQWVH
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGES-AMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVH

Query:  HHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPT
        HHL  E+ + +RKTSL DE    QEGFSSDDGDA   RS LLFQFLEQDLPYQRVPLADKI++LAYQ+PGLKTLRSCDILPASW+SVAWYPIYRIPTGPT
Subjt:  HHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQ
        LKDLDACFLTYHSLSTPIRGNGHGQAP MIYPND DG+PKVSLPVFGLASYKLKG IWAQN V EHQMANSLMQAA+ WLR LQ
Subjt:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQ

A0A6J1JE68 uncharacterized protein LOC1114849831.0e-18485.16Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLT-QPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD
        MLGTALQFGGIKGEDRFYIPV+ARK YNQQKPSRRP+KTDETE+PS    SKVVAS T PSKPLT Q KSNLERFLDAT PSVPAQYFSKTTMRGWRTCD
Subjt:  MLGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLT-QPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCD

Query:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGES-AMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVH
        IEFQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQ+YGES A+RS+SKSRLA EDSDLDSSRDTSS+GS +YE GK C  SREQWVH
Subjt:  IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGES-AMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVH

Query:  HHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPT
        HHL  ++ L +RKTSL DE    QEGFSSDDGDA   RS LLFQFLEQDLPYQRVPLADKI++LAYQ+PGLKTLRSCDILPASW+SVAWYPIYRIPTGPT
Subjt:  HHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQ
        LKDLDACFLTYHSLSTPIRGNGHGQAP MIYPND DG+PKVSLPVFGLASYKLKG IWAQN V E+QM NSLMQAA+ WLR LQ
Subjt:  LKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)7.8e-8953.82Show/hide
Query:  SSDTKSKVVASATKPSKPLTQPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCDIEFQ-PYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYL
        SS TK +   SA           SN+ERFLD+ TPSVPA Y SKT +R     D+E Q PYF+L D+WESF EWSAYG GVPL LN   D V QYYVP L
Subjt:  SSDTKSKVVASATKPSKPLTQPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCDIEFQ-PYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYL

Query:  SGIQLYGE-SAMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVHHHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQ
        SGIQ+Y +  A+ S  ++R  GE+S+ D  RD+SS+GSS      +C +S+EQ          + ++ K SL  E    QE  SSDDG+  +S+  L+F+
Subjt:  SGIQLYGE-SAMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVHHHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQ

Query:  FLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLP
        +LE+DLPY R P ADK+ +LA ++P LKTLRSCD+LP+SW SVAWYPIY+IPTGPTLKDLDACFLTYHSL TP +G G     + +     + V K+ LP
Subjt:  FLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLP

Query:  VFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQ
        VFGLASYKL+G +W   G + HQ+ANSL QAADNWLRL Q
Subjt:  VFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQ

AT2G01260.1 Protein of unknown function (DUF789)5.6e-8749.1Show/hide
Query:  MLGTALQF-GGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLTQP----KSNLERFLDATTPSVPAQYFSKTTMRGW
        MLG   Q   G  G+D FY   K R+  NQ+    R +++D +  PS         SA  P K   +P     SNL+RFL++ TPSVPAQ+ SKT +R  
Subjt:  MLGTALQF-GGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLTQP----KSNLERFLDATTPSVPAQYFSKTTMRGW

Query:  RTCD--IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQLYGES-AMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKF
        R  D   +  PYF+L D+W+SF EWSAYG GVPLVLN   D V+QYYVP LS IQ+Y  S A+ S  KSR  G+ SD D  RD+SSD SS+ +  ++   
Subjt:  RTCD--IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQLYGES-AMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKF

Query:  SREQWVHHHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIY
                      + ++   SL D+    QE  SSDDG+   S+  L+F++LE+DLPY R P ADK+ +LA Q+P L TLRSCD+L +SW SVAWYPIY
Subjt:  SREQWVHHHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIY

Query:  RIPTGPTLKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWL
        RIPTGPTLKDLDACFLTYHSL T   G G  Q+  +  P + +   K+SLPVFGLASYK +G +W   G +EHQ+ NSL QAAD WL
Subjt:  RIPTGPTLKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWL

AT2G01260.2 Protein of unknown function (DUF789)4.0e-6948.93Show/hide
Query:  MLGTALQF-GGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLTQP----KSNLERFLDATTPSVPAQYFSKTTMRGW
        MLG   Q   G  G+D FY   K R+  NQ+    R +++D +  PS         SA  P K   +P     SNL+RFL++ TPSVPAQ+ SKT +R  
Subjt:  MLGTALQF-GGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLTQP----KSNLERFLDATTPSVPAQYFSKTTMRGW

Query:  RTCD--IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQLYGES-AMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKF
        R  D   +  PYF+L D+W+SF EWSAYG GVPLVLN   D V+QYYVP LS IQ+Y  S A+ S  KSR  G+ SD D  RD+SSD SS+ +  ++   
Subjt:  RTCD--IEFQPYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQLYGES-AMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKF

Query:  SREQWVHHHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIY
                      + ++   SL D+    QE  SSDDG+   S+  L+F++LE+DLPY R P ADK+ +LA Q+P L TLRSCD+L +SW SVAWYPIY
Subjt:  SREQWVHHHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIY

Query:  RIPTGPTLKDLDACFLTYHSLSTPIRG
        RIPTGPTLKDLDACFLTYHSL T   G
Subjt:  RIPTGPTLKDLDACFLTYHSLSTPIRG

AT4G16100.1 Protein of unknown function (DUF789)1.0e-8045Show/hide
Query:  IKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLTQPK------------------------SNLERFLDATTPSVPAQY
        I+GE+RFY P   RK   QQ+  ++  + +E E      K  +        K + QP+                        SNL RFLD TTP V  Q+
Subjt:  IKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLTQPK------------------------SNLERFLDATTPSVPAQY

Query:  FSKTTMRGWRTCDIEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGESAMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVG
           T+ +GWRT + E++PYF+LNDLW+SF+EWSAYG GVPL+LNG DSVVQYYVPYLSGIQLY E   R+ +  R  GE+SD DS RD SSDGS++    
Subjt:  FSKTTMRGWRTCDIEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGESAMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVG

Query:  KICKFSREQWVHHHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDA-GNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSV
          C+             E +  L + SL ++ C    G SSD+ +A  NS   L+F++LE  +P+ R PL DKI  L+ Q+P L+T RSCD+ P+SWVSV
Subjt:  KICKFSREQWVHHHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDA-GNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSV

Query:  AWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRG--NGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWA-QNGVNEHQMANSLMQAADNWLRLLQ
        AWYPIYRIP G +L++LDACFLT+HSLSTP RG  N  GQ+      +      K+ LP FGLASYK K   W+ ++ V+E+Q   +L++ A+ WLR L+
Subjt:  AWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRG--NGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWA-QNGVNEHQMANSLMQAADNWLRLLQ

AT5G49220.1 Protein of unknown function (DUF789)7.6e-6839.76Show/hide
Query:  GTALQFGGIKGEDRFYIPVKARKNYN----QQKPSRRPSKTDETE-------------SPS--------SDTKSKVVAS----------ATKPSKPLTQP
        G ++    I+GE+RFY P   R+       QQ+   +  + DE E             +P         S++KS+VV S          ++  S  +   
Subjt:  GTALQFGGIKGEDRFYIPVKARKNYN----QQKPSRRPSKTDETE-------------SPS--------SDTKSKVVAS----------ATKPSKPLTQP

Query:  KSNLERFLDATTPSVPAQYFSKTTMRGWRTCDIEFQPYFILNDLWESFKEWSAYGAGV-----PLVLNGGDSVVQYYVPYLSGIQLYGESAMRSESKSRL
         SNL+RFL+ TTP VPA+ F   +    +T + +   YF+L DLWESF EWSAYGAGV     PL ++G DS VQYYVPYLSGIQLY +   +  +    
Subjt:  KSNLERFLDATTPSVPAQYFSKTTMRGWRTCDIEFQPYFILNDLWESFKEWSAYGAGV-----PLVLNGGDSVVQYYVPYLSGIQLYGESAMRSESKSRL

Query:  AGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVHHHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYEL
                   + SS+GSS      +                +  +L + SL D+  +I    SS + +  N +  LLF++LE + P+ R PLA+KI +L
Subjt:  AGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVHHHLTSENTLKLRKTSLSDEQCAIQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYEL

Query:  AYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVN
        A + P L T RSCD+LP+SWVSV+WYPIYRIP GPTL++LDACFLT+HSLST    +  G        +D     K+ LP FGLASYKLK  +W QN + 
Subjt:  AYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGNGHGQAPVMIYPNDMDGVPKVSLPVFGLASYKLKGLIWAQNGVN

Query:  EHQMANSLMQAADNWLRLLQ
        E Q   SL+QAAD WL+ LQ
Subjt:  EHQMANSLMQAADNWLRLLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGGAACTGCGTTGCAGTTTGGGGGCATCAAAGGTGAGGATCGGTTTTACATTCCAGTGAAGGCACGAAAGAATTATAATCAGCAGAAGCCATCCAGGAGACCCAG
CAAGACCGATGAAACTGAGAGCCCATCTTCAGATACGAAGAGCAAAGTTGTGGCTTCTGCGACAAAGCCTTCTAAACCATTAACTCAGCCTAAGAGCAACTTAGAGAGAT
TCTTGGACGCCACAACGCCTTCAGTTCCAGCGCAGTACTTTTCTAAGACAACCATGAGGGGTTGGCGGACTTGTGACATTGAGTTTCAACCTTATTTCATACTGAATGAT
CTGTGGGAGTCTTTCAAGGAGTGGAGTGCATACGGTGCTGGAGTTCCTTTAGTACTTAATGGAGGTGACTCAGTTGTTCAATATTACGTTCCATATTTGTCTGGTATCCA
ATTGTATGGTGAATCTGCTATGAGATCAGAATCTAAGTCCAGGCTGGCTGGTGAGGACAGTGATCTCGACTCTTCCAGGGATACAAGCAGCGATGGTAGCAGTGAATATG
AAGTTGGAAAAATTTGTAAATTTTCTAGAGAACAATGGGTTCATCACCATTTAACTAGTGAAAACACACTTAAACTGAGAAAGACATCTCTAAGCGATGAACAATGCGCG
ATACAGGAAGGTTTTTCAAGTGATGATGGGGATGCTGGAAATTCTCGGAGTGTTTTGCTCTTTCAGTTTCTCGAGCAAGATCTTCCTTATCAACGTGTACCATTGGCTGA
TAAGATATATGAACTTGCTTACCAATATCCTGGTTTGAAAACTTTAAGAAGTTGTGATATCCTGCCAGCCAGTTGGGTCTCTGTAGCATGGTACCCAATATACCGTATAC
CGACTGGTCCGACATTAAAAGATTTGGATGCTTGCTTTTTAACATATCATTCCCTCTCCACACCCATTCGAGGTAATGGACATGGCCAGGCACCAGTAATGATATATCCA
AATGACATGGATGGTGTCCCAAAGGTCTCCTTGCCTGTTTTTGGACTGGCTTCTTATAAGCTGAAAGGCTTGATTTGGGCGCAAAATGGCGTCAACGAGCATCAAATGGC
AAATTCCCTCATGCAGGCTGCAGATAACTGGCTGAGGCTTCTTCAGCCCAGAATAAAGAACTCGTTCGCTTATCAACACAGTCAGACGAGGGGCTATACTGCAAGTGGGG
AATGGAGTTCCACAGTCACTGGCGACTTTCCTGGCATTTGGAGAGTCGACGAAGTTCTTGAGGGAAGGGTTTTTGAGGTAGTCGCAGAGCAGGGCTTCTGCTCCTTGATC
TTGAAGCAGCAGAGCTTTGATGGTGGAGTTGAAGAGATAATTGCGCTGACGCAGGAGCTCAGTTGCATTGGGTTGCATGTCACTGCTTCACCATTGATACCTCTGCTTCT
GCCAGAAGAAGCACCGCCATTGTCAACATGGCAACCCATGACACCTTCATTTTTCACTCAGAGAGAGAGAGAGAGTGATGATTGGTGCTTATGGCCTTGGTTTTTGCCTC
TCAATTTATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGGGAACTGCGTTGCAGTTTGGGGGCATCAAAGGTGAGGATCGGTTTTACATTCCAGTGAAGGCACGAAAGAATTATAATCAGCAGAAGCCATCCAGGAGACCCAG
CAAGACCGATGAAACTGAGAGCCCATCTTCAGATACGAAGAGCAAAGTTGTGGCTTCTGCGACAAAGCCTTCTAAACCATTAACTCAGCCTAAGAGCAACTTAGAGAGAT
TCTTGGACGCCACAACGCCTTCAGTTCCAGCGCAGTACTTTTCTAAGACAACCATGAGGGGTTGGCGGACTTGTGACATTGAGTTTCAACCTTATTTCATACTGAATGAT
CTGTGGGAGTCTTTCAAGGAGTGGAGTGCATACGGTGCTGGAGTTCCTTTAGTACTTAATGGAGGTGACTCAGTTGTTCAATATTACGTTCCATATTTGTCTGGTATCCA
ATTGTATGGTGAATCTGCTATGAGATCAGAATCTAAGTCCAGGCTGGCTGGTGAGGACAGTGATCTCGACTCTTCCAGGGATACAAGCAGCGATGGTAGCAGTGAATATG
AAGTTGGAAAAATTTGTAAATTTTCTAGAGAACAATGGGTTCATCACCATTTAACTAGTGAAAACACACTTAAACTGAGAAAGACATCTCTAAGCGATGAACAATGCGCG
ATACAGGAAGGTTTTTCAAGTGATGATGGGGATGCTGGAAATTCTCGGAGTGTTTTGCTCTTTCAGTTTCTCGAGCAAGATCTTCCTTATCAACGTGTACCATTGGCTGA
TAAGATATATGAACTTGCTTACCAATATCCTGGTTTGAAAACTTTAAGAAGTTGTGATATCCTGCCAGCCAGTTGGGTCTCTGTAGCATGGTACCCAATATACCGTATAC
CGACTGGTCCGACATTAAAAGATTTGGATGCTTGCTTTTTAACATATCATTCCCTCTCCACACCCATTCGAGGTAATGGACATGGCCAGGCACCAGTAATGATATATCCA
AATGACATGGATGGTGTCCCAAAGGTCTCCTTGCCTGTTTTTGGACTGGCTTCTTATAAGCTGAAAGGCTTGATTTGGGCGCAAAATGGCGTCAACGAGCATCAAATGGC
AAATTCCCTCATGCAGGCTGCAGATAACTGGCTGAGGCTTCTTCAGCCCAGAATAAAGAACTCGTTCGCTTATCAACACAGTCAGACGAGGGGCTATACTGCAAGTGGGG
AATGGAGTTCCACAGTCACTGGCGACTTTCCTGGCATTTGGAGAGTCGACGAAGTTCTTGAGGGAAGGGTTTTTGAGGTAGTCGCAGAGCAGGGCTTCTGCTCCTTGATC
TTGAAGCAGCAGAGCTTTGATGGTGGAGTTGAAGAGATAATTGCGCTGACGCAGGAGCTCAGTTGCATTGGGTTGCATGTCACTGCTTCACCATTGATACCTCTGCTTCT
GCCAGAAGAAGCACCGCCATTGTCAACATGGCAACCCATGACACCTTCATTTTTCACTCAGAGAGAGAGAGAGAGTGATGATTGGTGCTTATGGCCTTGGTTTTTGCCTC
TCAATTTATAG
Protein sequenceShow/hide protein sequence
MLGTALQFGGIKGEDRFYIPVKARKNYNQQKPSRRPSKTDETESPSSDTKSKVVASATKPSKPLTQPKSNLERFLDATTPSVPAQYFSKTTMRGWRTCDIEFQPYFILND
LWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQLYGESAMRSESKSRLAGEDSDLDSSRDTSSDGSSEYEVGKICKFSREQWVHHHLTSENTLKLRKTSLSDEQCA
IQEGFSSDDGDAGNSRSVLLFQFLEQDLPYQRVPLADKIYELAYQYPGLKTLRSCDILPASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRGNGHGQAPVMIYP
NDMDGVPKVSLPVFGLASYKLKGLIWAQNGVNEHQMANSLMQAADNWLRLLQPRIKNSFAYQHSQTRGYTASGEWSSTVTGDFPGIWRVDEVLEGRVFEVVAEQGFCSLI
LKQQSFDGGVEEIIALTQELSCIGLHVTASPLIPLLLPEEAPPLSTWQPMTPSFFTQRERESDDWCLWPWFLPLNL