; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018685 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018685
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTDBD domain-containing protein
Genome locationtig00153207:479836..492367
RNA-Seq ExpressionSgr018685
SyntenySgr018685
Gene Ontology termsGO:0045944 - positive regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000977 - RNA polymerase II regulatory region sequence-specific DNA binding (molecular function)
GO:0003682 - chromatin binding (molecular function)
GO:0042393 - histone binding (molecular function)
InterPro domainsIPR032308 - Jas TPL-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600062.1 hypothetical protein SDJN03_05295, partial [Cucurbita argyrosperma subsp. sororia]2.6e-30687.42Show/hide
Query:  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVN
        SFQHKSFWIPRDAGCLTDGEMNYDSSSRIE KR HQWFMDGSAPELFSSKKQAIETVNTR V GVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVN
Subjt:  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVN

Query:  LVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISLGQT
        LVDR ITVGNANM MGRKEFENHFANNPSVGLSMSQSIED SSCL+FGGIRKVKVNQVRD DIG+SASLGH YS   +GT+SMGT F K HENAISLGQ+
Subjt:  LVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISLGQT

Query:  YNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS
        YN+RDEN+ISVGPAYHKTDD+FISMGHAFSKGDGNFIT GHN+SKGD+SILSMSQPFDKGD  FISMGQSYEKAEGNIISFGASYNKGHENFISMGP YS
Subjt:  YNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS

Query:  KAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEV
        KAGDTFISMAS YNKGNDD +SMGP YDKV+SD+VHVGPKYDKADSGA+SMAHNY+K ESNTISFGGFDDENA DNPSGGIISSYDLLMANQASAQASEV
Subjt:  KAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEV

Query:  STLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYE
        S LRDSVDP+ EVN NNA KVD KIDTS+KNKEPRTTKKV PNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNC HSKALNAYE
Subjt:  STLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYE

Query:  FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK-----------------DLKDIYPPGYLITGWICNRNIAK
        FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQE+LFDAIQNVTGSPINQKNFRIWK                 +   IYPPGYL+TGW+CN+NIAK
Subjt:  FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK-----------------DLKDIYPPGYLITGWICNRNIAK

Query:  VQPS
        +  S
Subjt:  VQPS

KAG6601008.1 hypothetical protein SDJN03_06241, partial [Cucurbita argyrosperma subsp. sororia]3.9e-30292.14Show/hide
Query:  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVN
        SFQHKSFWIPRDAGCLTDGEMNYDSSSRIE KR HQWFMDGSAPELFSSKKQAIETVNTRPV GVPHMNVSPWENTSSFQSVPG FTDRLFGSEP+RTVN
Subjt:  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVN

Query:  LVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISLGQT
        LVDRGITVGNANM MGRKE+ENHFANNPSVGLSMSQSIEDSSSCL+FGGIRKVKVNQVRDPDIG+SASLGHAYSRGDNGTISMG TF+KNHE+AISLG T
Subjt:  LVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISLGQT

Query:  YNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS
        YNSRDEN ISVGPAYHKTDD+FISMGHAFSKGD NFITIG N+SKGD+SILSMSQPFDKGDD FISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS
Subjt:  YNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS

Query:  KAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEV
        KAGDTFISMA SYNKGNDD LSMGP +DKV+SD+VHVGPK+DKADSG++SM HNYHK E NTISFGGFDDEN T NPSGGIISSYDLLMANQASAQASEV
Subjt:  KAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEV

Query:  STLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYE
        ST+RDSV PNVEVN+NNA KVDAKIDTS+K +EPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKG IKGTGYLCSCDNC HSKALNAYE
Subjt:  STLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYE

Query:  FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK
        FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAI NVTGSPINQKNFRIWK
Subjt:  FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK

KAG7031622.1 hypothetical protein SDJN02_05663, partial [Cucurbita argyrosperma subsp. argyrosperma]3.0e-30291.98Show/hide
Query:  QSFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTV
        +SFQHKSFWIPRDAGCLTDGEMNYDSSSRIE KR HQWFMDGSAPELFSSKKQAIETVNTRPV GVPHMNVSPWENTSSFQSVPG FTDRLFGSEP+RTV
Subjt:  QSFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTV

Query:  NLVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISLGQ
        NLVDRGITVGNANM MGRKE+ENHFANNPSVGLSMSQSIEDSSSCL+FGGIRKVKVNQVRDPDIG+SASLGHAYSRGDNGTISMG TF+KNHE+AISLG 
Subjt:  NLVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISLGQ

Query:  TYNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTY
        TYNSRDEN ISVGPAYHKTDD+FISMGHAFSKGD NFITIG N+SKGD+SILSMSQPFDKGDD FISMGQSYEKAEGNIISFGASYNKGHENFISMGPTY
Subjt:  TYNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTY

Query:  SKAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASE
        SKAGDTFISMA SYNKGNDD LSMGP +DKV+SD+VHVGPK+DKADSG++SM HNYHK E NTISFGGFDDEN T NPSGGIISSYDLLMANQASAQASE
Subjt:  SKAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASE

Query:  VSTLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAY
        VST+RDSV PNVEVN+NNA KVDAKIDTS+K +EPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKG IKGTGYLCSCDNC HSKALNAY
Subjt:  VSTLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAY

Query:  EFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK
        EFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAI NVTGSPINQKNFRIWK
Subjt:  EFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK

XP_022139003.1 uncharacterized protein LOC111010042 [Momordica charantia]0.0e+0095.18Show/hide
Query:  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVN
        SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKR HQWFMDG+A ELFSSKKQAIETVN+RPV GVPHMNVSPW+NTSSFQSVPGPFTDRLFGSEPIRTVN
Subjt:  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVN

Query:  LVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISLGQT
        LVDRGITVGNANM MGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIG+SASLGHAY+RGDNGTISMGTTF+KNHENAISLGQT
Subjt:  LVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISLGQT

Query:  YNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS
        YNSRD++TISVGPAYHKTDDNFISMGH FSKGDGNFITIGHN+SKGDSSILSMSQPFDKGDD FISMGQSYEKA+GNIISFGASYNKGHENFISMGPTYS
Subjt:  YNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS

Query:  KAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEV
        K GDTFISMASSYNKGNDDTLSMGP YDKVDSD+VHVGPKYDKADSG+LSMAHNYHK ESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEV
Subjt:  KAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEV

Query:  STLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYE
        STLRDSVDPN E+NVNNAPK+DAKIDTS+KNKEPRTTKKV PNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCK SKALNAYE
Subjt:  STLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYE

Query:  FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK
        FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK
Subjt:  FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK

XP_038892004.1 uncharacterized protein LOC120081323 [Benincasa hispida]1.2e-30391.96Show/hide
Query:  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVN
        SFQHKSFWIPRDAGCLTDGEMNYDSSSR+ETKR HQWFMDGSAPELFS+KKQAIE VN+RPV GVPHMNVSPWENTSSFQSVPG FTDRLFGSEPIRTVN
Subjt:  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVN

Query:  LVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISLGQT
        LVDRG+TVGNANM MGRKEFENHF NNPSVGLSMSQSIED SSCL+FGGIRKVKVNQVRDPD+G+SASLGHAYSRGDN TISMGT F+KNHENAISLGQT
Subjt:  LVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISLGQT

Query:  YNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS
        YNSRDEN ISVGPAYHKTDDNFISMGHAFSKGDG+FITIGHN+SKGD+SILSM+QPFDKGDD FISMGQ+YEKAEGNIISFGASYNKG ENFISMGPTYS
Subjt:  YNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS

Query:  KAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEV
        KAGDTFISMASS+NKGNDD LSMGP YDKV+SD+VHVGPK+DKADSGA+SMAHN+HK ESNTISFGGFDDEN TDNPSGGIISSYDLLMANQASAQASEV
Subjt:  KAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEV

Query:  STLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYE
        STLRDSV+PNVEVN+NNA KVD KIDTS+KNKEPR +KKV PNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNC HSKALNAYE
Subjt:  STLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYE

Query:  FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK
        FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFR+WK
Subjt:  FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK

TrEMBL top hitse value%identityAlignment
A0A1S3CFN8 uncharacterized protein LOC1035002002.0e-29991.25Show/hide
Query:  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVN
        SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKR HQWFMDGSAPELFSSKKQAIE VN+RPV GVPHMNVSPWENTSSFQSVPG FTDRLFGSEPIRTVN
Subjt:  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVN

Query:  LVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISLGQT
        LVDRGI+VGNANM MGRKEFENHF NNPSVGLSMSQSIED SSCL+FGGIRKVKVNQVRDPD+G+ ASLGH YSRGDN TISMG+ F+KNHEN ISLGQT
Subjt:  LVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISLGQT

Query:  YNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS
        YNSRDEN ISVGPAYHKTDDNFISMGHAFSKGDG+FITIGHN+SKGD+SILSM+QPFDKGDD FISMGQSYEKAEGNIISF ASYNKG ENFISMGP YS
Subjt:  YNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS

Query:  KAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEV
        KAGDTFISMASS+NKGNDD LSM P YDKV+SD+VHVGPK+DKADSGA+SMAHNYHK ESNTISFGGFDDEN TDNPSGGIISSYDLLMANQASAQASEV
Subjt:  KAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEV

Query:  STLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYE
        STLRDSVDPNVEVN+NNA KVD KIDT++KNKEPR +KKV PNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSC+NC H+KALNAYE
Subjt:  STLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYE

Query:  FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK
        FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK
Subjt:  FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK

A0A5A7U4X7 TDBD domain-containing protein8.8e-30091.43Show/hide
Query:  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVN
        SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKR HQWFMDGSAPELFSSKKQAIE VN+RPV GVPHMNVSPWENTSSFQSVPG FTDRLFGSEPIRTVN
Subjt:  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVN

Query:  LVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISLGQT
        LVDRGI+VGNANM MGRKEFENHF NNPSVGLSMSQSIED SSCL+FGGIRKVKVNQVRDPD+G+ ASLGH YSRGDN TISMG+ F+KNHEN ISLGQT
Subjt:  LVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISLGQT

Query:  YNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS
        YNSRDEN ISVGPAYHKTDDNFISMGHAFSKGDG+FITIGHN+SKGD+SILSM+QPFDKGDD FISMGQSYEKAEGNIISF ASYNKG ENFISMGP YS
Subjt:  YNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS

Query:  KAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEV
        KAGDTFISMASS+NKGNDD LSM P YDKV+SD+VHVGPK+DKADSGA+SMAHNYHK ESNTISFGGFDDEN TDNPSGGIISSYDLLMANQASAQASEV
Subjt:  KAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEV

Query:  STLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYE
        STLRDSVDPNVEVN+NNA KVD KIDT++KNKEPR +KKV PNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSC+NC HSKALNAYE
Subjt:  STLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYE

Query:  FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK
        FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK
Subjt:  FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK

A0A6J1CEN2 uncharacterized protein LOC1110100420.0e+0095.18Show/hide
Query:  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVN
        SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKR HQWFMDG+A ELFSSKKQAIETVN+RPV GVPHMNVSPW+NTSSFQSVPGPFTDRLFGSEPIRTVN
Subjt:  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVN

Query:  LVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISLGQT
        LVDRGITVGNANM MGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIG+SASLGHAY+RGDNGTISMGTTF+KNHENAISLGQT
Subjt:  LVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISLGQT

Query:  YNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS
        YNSRD++TISVGPAYHKTDDNFISMGH FSKGDGNFITIGHN+SKGDSSILSMSQPFDKGDD FISMGQSYEKA+GNIISFGASYNKGHENFISMGPTYS
Subjt:  YNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS

Query:  KAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEV
        K GDTFISMASSYNKGNDDTLSMGP YDKVDSD+VHVGPKYDKADSG+LSMAHNYHK ESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEV
Subjt:  KAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEV

Query:  STLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYE
        STLRDSVDPN E+NVNNAPK+DAKIDTS+KNKEPRTTKKV PNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCK SKALNAYE
Subjt:  STLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYE

Query:  FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK
        FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK
Subjt:  FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK

A0A6J1GYI5 uncharacterized protein LOC1114584426.8e-30091.61Show/hide
Query:  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVN
        SFQHKSFWIPRDAGCLTDGEMNYDSSSRIE KR HQWFMDGSAPELFSSKKQAIETVNTRPV GVPHMNVSPWENTSSFQSVPG FTDRLFGSEP+RTVN
Subjt:  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVN

Query:  LVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISLGQT
        LVDRGITVGNANM MGRKE+ENHFANNPSVGLS SQSIEDSSSCL+FGGIRKVKVNQVRDPDIG+SASLGHAYSRGDNGTISMGTTF+KNHE+AISLG T
Subjt:  LVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISLGQT

Query:  YNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS
        YNSRDEN ISVGPAYHKTDD+FISMGHAFSKGD NFITIG N+SKGD++ILSMSQPFDKGDD FISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS
Subjt:  YNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS

Query:  KAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEV
        KAGDTFISMA SYNKGNDD LSMGP +DKV+SD+VHVGPK+DKADSG++SM HNYHK E+NTISFGGFDDEN T NPSGGIISSYDLLMANQASAQASEV
Subjt:  KAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEV

Query:  STLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYE
        ST+RDSV PNVEVN+N   KVDAKIDTS+K +EPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKG IKGTGYLCSCDNC HSKALNAYE
Subjt:  STLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYE

Query:  FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK
        FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAI NVTGSPINQKNFRIWK
Subjt:  FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK

A0A6J1JXF6 uncharacterized protein LOC1114906113.4e-29990.89Show/hide
Query:  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVN
        SFQHKSFWIPRDAGCLTDGEMNYDSSS+IE KR HQWFMDGSAPELFSSKKQAIE VNTRPV GVPHMNVSPWENTSSFQSVPG FTDRLF SEP+RTVN
Subjt:  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVN

Query:  LVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISLGQT
        LVDRGITVGNANM MGRKE+ENHFANNPSVGLSMSQSIEDSSSCL+FGGIRKVKVNQVRDPDIG+S+SLGHAYSRGDNGTISMGTTF+KNHE+AISLG T
Subjt:  LVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISLGQT

Query:  YNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS
        YNSRDEN ISVGP YHKTDD+FISMGHAFSKGD NFITIG N+SKGD+SILSMSQPFDKGDD FISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS
Subjt:  YNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS

Query:  KAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEV
        KAGDTFISMA SYNKGND+ LSMGP +DKV+SD+VHVGPK+DKADSG++SM HNYHK ESNTISFGGFD+EN T NPSGGIISSYDLLMANQASAQASEV
Subjt:  KAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEV

Query:  STLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYE
        ST+RDSV PN EVN+NNA KVDAKIDTS+K +EPRTTKKV PNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKG IKGTGYLCSCDNC HSKALNAYE
Subjt:  STLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYE

Query:  FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK
        FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAI NVTGSPINQKNFRIWK
Subjt:  FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK

SwissProt top hitse value%identityAlignment
A0A0H2VCA1 Autotransporter adhesin UpaG4.2e-0418.02Show/hide
Query:  RTVNLVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAIS
        R+V L    +  G+ +MA GR    N F  + ++G     S+ D    ++ G     K  ++    +G +A+    Y+      +++G +      ++++
Subjt:  RTVNLVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAIS

Query:  LGQTYNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMG
         G+   +    ++++G     ++DN I++G+       N + +G+       S +++    +  +   I++GQ    ++ N I+ G++     EN I++G
Subjt:  LGQTYNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMG

Query:  PTYSKAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSG
           +  G   ++  S      +D++++G        + V +G       S  +S+ ++  K +   ++ G   +  +TD  +G
Subjt:  PTYSKAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSG

Arabidopsis top hitse value%identityAlignment
AT3G53680.1 Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain4.7e-3555.28Show/hide
Query:  KKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQE
        KK+   +F SNVK LL TG+LDG  VKY+S S  + L+GII   GYLC C  C  SK L AYEFERHAG KTKHPNNHIY ENG+ +Y V+QEL+  P +
Subjt:  KKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQE

Query:  MLFDAIQNVTGSPINQKNFRIWK
        +L + I+ V GS ++++ F+ WK
Subjt:  MLFDAIQNVTGSPINQKNFRIWK

AT5G13660.1 unknown protein1.6e-9140.73Show/hide
Query:  EMNYDSSSRIETKRS-HQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVNLVDRGITVGNANMAMGRK
        E+ Y  SSR+E KRS HQW  + S+ ELFS+K+Q +  ++        HMN+SPW+ +     VP  FTD LF    I   + + R           GR 
Subjt:  EMNYDSSSRIETKRS-HQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVNLVDRGITVGNANMAMGRK

Query:  EFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHEN-AISLGQTYNSRDENTISVGPAYHK
          E       S GL ++       S  +   I KV           +   +   Y +G + +     +F+   E+  +S GQT+++ D + I  G    K
Subjt:  EFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHEN-AISLGQTYNSRDENTISVGPAYHK

Query:  TDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYSKAGDTFISMASSYNKGN
        TD NFI     F+      + IG  + KGD ++LS   P +KG + F+SMGQS +KA+ NI S  +SYNKG ENF+ +          F++  S+Y+  N
Subjt:  TDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYSKAGDTFISMASSYNKGN

Query:  DDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEVSTLRDSVDPNVEVNVNN
         + LS G +      +M  +    ++A  G  +         S T+SFG    E A  + S  + ++Y+    N +   A     L    + N+     N
Subjt:  DDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEVSTLRDSVDPNVEVNVNN

Query:  APKVDAKIDT--SAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYEFERHAGCKTKHPNNH
         P    ++DT    K K+ +T KK S N+FPSNVKSLLSTG+ DGV VKY SWSRE+NLKG+IKGTGYLC C NCK +K LNAYEFE+HA CKTKHPNNH
Subjt:  APKVDAKIDT--SAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYEFERHAGCKTKHPNNH

Query:  IYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK
        IYFENGKTIY VVQELKNTPQE LFDAIQNVTGS IN KNF  WK
Subjt:  IYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK

AT5G13660.2 unknown protein3.9e-9040.66Show/hide
Query:  EMNYDSSSRIETKRS-HQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVNLVDRGITVGNANMAMGRK
        E+ Y  SSR+E KRS HQW  + S+ ELFS+K+Q +  ++        HMN+SPW+ +     VP  FTD LF    I   + + R           GR 
Subjt:  EMNYDSSSRIETKRS-HQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSSFQSVPGPFTDRLFGSEPIRTVNLVDRGITVGNANMAMGRK

Query:  EFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHEN-AISLGQTYNSRDENTISVGPAYHK
          E       S GL ++       S  +   I KV           +   +   Y +G + +     +F+   E+  +S GQT+++ D + I  G    K
Subjt:  EFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHEN-AISLGQTYNSRDENTISVGPAYHK

Query:  TDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYSKAGDTFISMASSYNKGN
        TD NFI     F+      + IG  + KGD ++LS   P +KG + F+SMGQS +KA+ NI S  +SYNKG ENF+ +          F++  S+Y+  N
Subjt:  TDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYSKAGDTFISMASSYNKGN

Query:  DDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEVSTLRDSVDPNVEVNVNN
         + LS G +      +M  +    ++A  G  +         S T+SFG    E A  + S  + ++Y+    N +   A     L    + N+     N
Subjt:  DDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEVSTLRDSVDPNVEVNVNN

Query:  APKVDAKIDT--SAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRE-KNLKGIIKGTGYLCSCDNCKHSKALNAYEFERHAGCKTKHPNN
         P    ++DT    K K+ +T KK S N+FPSNVKSLLSTG+ DGV VKY SWSRE +NLKG+IKGTGYLC C NCK +K LNAYEFE+HA CKTKHPNN
Subjt:  APKVDAKIDT--SAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRE-KNLKGIIKGTGYLCSCDNCKHSKALNAYEFERHAGCKTKHPNN

Query:  HIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK
        HIYFENGKTIY VVQELKNTPQE LFDAIQNVTGS IN KNF  WK
Subjt:  HIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK

AT5G59830.1 unknown protein5.1e-6632.62Show/hide
Query:  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVS--PWENTSSFQSVPGPFTDRLFGSE-PIR
        S++ K FW+ ++    ++ +  YD S+R ++KR H WF+D S  E+F +KKQA++     PV G+   NV    WE++S FQSV   F DRL G+E P R
Subjt:  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVS--PWENTSSFQSVPGPFTDRLFGSE-PIR

Query:  TVNLVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISL
         +   DR  T G ++     K     +  + SV LS+S  +E +  C    G RK+ V++V++      A  GH+  + ++ +I      S+ +E++   
Subjt:  TVNLVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISL

Query:  GQTYNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGP
                               NF   GH +   D   IT G                                             N  H     +G 
Subjt:  GQTYNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGP

Query:  TYSKAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQA
        T +  G                      NY     D +           G L +                +D E  +   S G++S   +          
Subjt:  TYSKAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQA

Query:  SEVSTLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALN
                      + ++ + PK  A         E +++KK +  SFPSNV+SL+STGMLDGVPVKYVS SRE+ L+G+IKG+GYLC C  C  +K LN
Subjt:  SEVSTLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALN

Query:  AYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWKD
        AY FERHAGCKTKHPNNHIYFENGKTIY +VQEL+NTP+ +LFD IQ V GSPINQK FRIWK+
Subjt:  AYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWKD

AT5G59830.2 unknown protein5.1e-6632.62Show/hide
Query:  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVS--PWENTSSFQSVPGPFTDRLFGSE-PIR
        S++ K FW+ ++    ++ +  YD S+R ++KR H WF+D S  E+F +KKQA++     PV G+   NV    WE++S FQSV   F DRL G+E P R
Subjt:  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVS--PWENTSSFQSVPGPFTDRLFGSE-PIR

Query:  TVNLVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISL
         +   DR  T G ++     K     +  + SV LS+S  +E +  C    G RK+ V++V++      A  GH+  + ++ +I      S+ +E++   
Subjt:  TVNLVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFSKNHENAISL

Query:  GQTYNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGP
                               NF   GH +   D   IT G                                             N  H     +G 
Subjt:  GQTYNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGP

Query:  TYSKAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQA
        T +  G                      NY     D +           G L +                +D E  +   S G++S   +          
Subjt:  TYSKAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQA

Query:  SEVSTLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALN
                      + ++ + PK  A         E +++KK +  SFPSNV+SL+STGMLDGVPVKYVS SRE+ L+G+IKG+GYLC C  C  +K LN
Subjt:  SEVSTLRDSVDPNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALN

Query:  AYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWKD
        AY FERHAGCKTKHPNNHIYFENGKTIY +VQEL+NTP+ +LFD IQ V GSPINQK FRIWK+
Subjt:  AYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGAGCTCGAATTCAATACATTACCATATACAGAGCATTCGCATATGGTATATTTTTTCAACGGCCCGATTTTCCGTATTGATTTCACCGAAACTATACGTAAGCC
TTTGCGGAATGAAGGTGGGCCCACCGATCTTGCTGTACACGTGTACGGTAAGATTGGAGGTAGAAAATTGAGGGTGGTAGAGCCGCGTCGCAGCGGATCATTCGCCATAG
TAAAAATACCAAAGCTCAGTTTCTGTCGCTATCCATGTCGTCTACATCTGGCACTGGCTGTGGAGTTGGGTTCGAGTGTGTGTAGTGTGGAAGGAGCTGACCTGATTGCG
CGGAGGTTGAGTTTCAGAGCTTGGAGAAGTAACTTGAAGTTTTTGGGATTTTGGTTGGGGGTTGTCAGCGGAAAGAATGTGTTTGCTTTTTGGCAGTCTTTCCAGCATAA
AAGCTTTTGGATACCTAGGGATGCTGGATGTTTAACTGATGGAGAGATGAATTATGATAGCTCCTCCAGAATTGAAACAAAGCGTAGCCATCAATGGTTTATGGATGGCA
GTGCGCCAGAACTATTCAGTAGCAAGAAGCAAGCAATAGAAACTGTAAATACTAGACCTGTCCAAGGAGTTCCACACATGAATGTTTCTCCTTGGGAAAACACATCAAGT
TTTCAGTCAGTTCCTGGGCCCTTCACTGATCGGCTCTTTGGCTCGGAGCCTATACGAACTGTCAACTTGGTTGATAGAGGCATCACTGTTGGAAATGCAAATATGGCCAT
GGGAAGAAAGGAGTTCGAGAATCATTTTGCAAACAACCCATCAGTTGGCTTGTCCATGTCCCAATCTATCGAAGATTCCTCATCATGTCTCAGTTTTGGTGGAATTAGAA
AAGTTAAAGTCAATCAAGTGAGGGATCCTGACATTGGCATTTCTGCCTCTTTGGGGCATGCATATAGCAGGGGTGATAATGGCACAATTTCAATGGGTACAACATTTAGT
AAAAACCATGAGAACGCCATATCATTGGGCCAAACTTATAATAGCAGAGATGAGAATACCATCTCAGTTGGCCCTGCATATCATAAGACGGATGACAATTTTATTTCAAT
GGGTCACGCTTTCAGCAAGGGTGATGGAAATTTTATTACAATTGGTCATAACTTTAGTAAGGGAGACAGTAGCATCTTATCCATGAGTCAGCCTTTTGACAAGGGGGATG
ACCCTTTTATTTCAATGGGTCAGTCTTATGAGAAGGCAGAAGGCAATATCATTTCTTTTGGTGCCTCCTACAATAAGGGGCATGAAAATTTTATCTCGATGGGTCCAACC
TATAGTAAGGCAGGTGATACTTTCATTTCAATGGCTTCCTCCTATAACAAGGGAAATGATGATACCTTATCAATGGGTCCAAACTATGACAAGGTGGACTCTGACATGGT
ACATGTGGGTCCTAAATATGACAAAGCAGATTCTGGTGCTTTGTCTATGGCTCATAACTATCATAAGGTTGAGAGTAATACCATATCTTTTGGAGGTTTCGATGACGAAA
ATGCAACAGATAATCCTTCTGGTGGGATCATTAGCAGTTACGATTTGTTGATGGCTAATCAGGCTTCTGCCCAAGCATCAGAAGTATCAACCCTGAGAGATTCAGTAGAT
CCCAATGTGGAAGTAAACGTTAACAATGCTCCAAAAGTTGATGCTAAAATCGATACAAGTGCCAAGAATAAAGAGCCGAGGACGACTAAGAAGGTGTCCCCAAATAGCTT
CCCGTCAAATGTTAAAAGCTTGCTCTCAACTGGTATGCTTGATGGGGTTCCTGTGAAGTATGTTTCTTGGTCGCGGGAGAAAAATCTCAAGGGAATTATAAAAGGGACTG
GATATTTGTGCAGCTGTGACAACTGCAAGCATTCTAAGGCTCTCAATGCTTATGAATTTGAAAGGCATGCTGGCTGCAAAACCAAACATCCAAATAATCACATTTATTTT
GAGAACGGTAAAACTATCTATGCTGTTGTTCAAGAGCTAAAGAACACTCCTCAGGAGATGCTATTTGATGCAATTCAGAATGTGACTGGCTCTCCCATTAATCAGAAGAA
TTTTCGTATTTGGAAAGACCTCAAAGATATATACCCACCAGGGTATTTAATCACAGGATGGATTTGTAATCGAAACATTGCTAAAGTACAACCCAGTGATAATGGCTACT
GCTCCAATAATCCTGCAGAAAAAGCCGTTGAAAGAGATTTCTTCTTACCTGCCCAAGAACATTATCTCAGACAAGATAAAGGAGCTCATGATTGCAACAAGAATCATGCT
CAGAGGATTGAAAGCAGTGACAAAAACAGAAAAAAAATTGAACCAAATTCTCCACCAAATGAAGCATCAACCACTGAAGTTGCTTTTACTTTAGGAATGAGAGCGAGGAA
ATACGAATCATTCTTCCTTGTTCTCTCATTGTTGAATAAACCACAGCCAGAAGCTGACTGTCAAAGTGCAAACACCAAGCAGCAGGGTTCCCTCTGTCCATCACCAAAGC
CACCACAGAGCCTCCAATGGTGCCCACCAAGCATATCAAAGCCGTCAGAGACAGCTCCGCCGGAAGTTGAAGAAGCAGAGGGCTGGTTGGGACTTGTCCATGGCAGATTC
AACATGGGCCCTCTTATGAAGGTCATAACCATGGCTCCTCCTACTGTCACTATGTACCTGCAAGCCCAAGCCATGAGGAAAGCAAAAGCTGGTAGAATATTGCACATGGC
TGCTGCAAAAGTTGCTGTTGTGAGCTTCATACCAGTAAAACCACAAAAACATGCTGGCTCATCCCCTGGTTCAGAGCTGACTTCGCAAGAATCGCCATTCCAGCATACCC
AAACTGTACAAACACCACACCCACATACGGCTTCGCCACACAAAACAGTTGCACAAAACGCTCCATTTCTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTC
TTATCTCTTGTGAGTTATAGCTTCAGCATGACCAGACCCAGAGGGTTTTATAGAGGTGGGGTGGGGTGGGGGTGTTTTCATGGCGGAAATGATCTGCAGAAACAAGCAAA
GTCTAGGGACTCCACTAAGCAAGAGAGAGACAGAGAGAGAGAGAGAGCGGAAAATGGGAAGCTGGGGAATAAAAGAAGGGGATTCGTTAAAGGATCAACACACACACAAA
CACAAACAGAAGAGAAGATGGTTTCCACATGTATAAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGGAGCTCGAATTCAATACATTACCATATACAGAGCATTCGCATATGGTATATTTTTTCAACGGCCCGATTTTCCGTATTGATTTCACCGAAACTATACGTAAGCC
TTTGCGGAATGAAGGTGGGCCCACCGATCTTGCTGTACACGTGTACGGTAAGATTGGAGGTAGAAAATTGAGGGTGGTAGAGCCGCGTCGCAGCGGATCATTCGCCATAG
TAAAAATACCAAAGCTCAGTTTCTGTCGCTATCCATGTCGTCTACATCTGGCACTGGCTGTGGAGTTGGGTTCGAGTGTGTGTAGTGTGGAAGGAGCTGACCTGATTGCG
CGGAGGTTGAGTTTCAGAGCTTGGAGAAGTAACTTGAAGTTTTTGGGATTTTGGTTGGGGGTTGTCAGCGGAAAGAATGTGTTTGCTTTTTGGCAGTCTTTCCAGCATAA
AAGCTTTTGGATACCTAGGGATGCTGGATGTTTAACTGATGGAGAGATGAATTATGATAGCTCCTCCAGAATTGAAACAAAGCGTAGCCATCAATGGTTTATGGATGGCA
GTGCGCCAGAACTATTCAGTAGCAAGAAGCAAGCAATAGAAACTGTAAATACTAGACCTGTCCAAGGAGTTCCACACATGAATGTTTCTCCTTGGGAAAACACATCAAGT
TTTCAGTCAGTTCCTGGGCCCTTCACTGATCGGCTCTTTGGCTCGGAGCCTATACGAACTGTCAACTTGGTTGATAGAGGCATCACTGTTGGAAATGCAAATATGGCCAT
GGGAAGAAAGGAGTTCGAGAATCATTTTGCAAACAACCCATCAGTTGGCTTGTCCATGTCCCAATCTATCGAAGATTCCTCATCATGTCTCAGTTTTGGTGGAATTAGAA
AAGTTAAAGTCAATCAAGTGAGGGATCCTGACATTGGCATTTCTGCCTCTTTGGGGCATGCATATAGCAGGGGTGATAATGGCACAATTTCAATGGGTACAACATTTAGT
AAAAACCATGAGAACGCCATATCATTGGGCCAAACTTATAATAGCAGAGATGAGAATACCATCTCAGTTGGCCCTGCATATCATAAGACGGATGACAATTTTATTTCAAT
GGGTCACGCTTTCAGCAAGGGTGATGGAAATTTTATTACAATTGGTCATAACTTTAGTAAGGGAGACAGTAGCATCTTATCCATGAGTCAGCCTTTTGACAAGGGGGATG
ACCCTTTTATTTCAATGGGTCAGTCTTATGAGAAGGCAGAAGGCAATATCATTTCTTTTGGTGCCTCCTACAATAAGGGGCATGAAAATTTTATCTCGATGGGTCCAACC
TATAGTAAGGCAGGTGATACTTTCATTTCAATGGCTTCCTCCTATAACAAGGGAAATGATGATACCTTATCAATGGGTCCAAACTATGACAAGGTGGACTCTGACATGGT
ACATGTGGGTCCTAAATATGACAAAGCAGATTCTGGTGCTTTGTCTATGGCTCATAACTATCATAAGGTTGAGAGTAATACCATATCTTTTGGAGGTTTCGATGACGAAA
ATGCAACAGATAATCCTTCTGGTGGGATCATTAGCAGTTACGATTTGTTGATGGCTAATCAGGCTTCTGCCCAAGCATCAGAAGTATCAACCCTGAGAGATTCAGTAGAT
CCCAATGTGGAAGTAAACGTTAACAATGCTCCAAAAGTTGATGCTAAAATCGATACAAGTGCCAAGAATAAAGAGCCGAGGACGACTAAGAAGGTGTCCCCAAATAGCTT
CCCGTCAAATGTTAAAAGCTTGCTCTCAACTGGTATGCTTGATGGGGTTCCTGTGAAGTATGTTTCTTGGTCGCGGGAGAAAAATCTCAAGGGAATTATAAAAGGGACTG
GATATTTGTGCAGCTGTGACAACTGCAAGCATTCTAAGGCTCTCAATGCTTATGAATTTGAAAGGCATGCTGGCTGCAAAACCAAACATCCAAATAATCACATTTATTTT
GAGAACGGTAAAACTATCTATGCTGTTGTTCAAGAGCTAAAGAACACTCCTCAGGAGATGCTATTTGATGCAATTCAGAATGTGACTGGCTCTCCCATTAATCAGAAGAA
TTTTCGTATTTGGAAAGACCTCAAAGATATATACCCACCAGGGTATTTAATCACAGGATGGATTTGTAATCGAAACATTGCTAAAGTACAACCCAGTGATAATGGCTACT
GCTCCAATAATCCTGCAGAAAAAGCCGTTGAAAGAGATTTCTTCTTACCTGCCCAAGAACATTATCTCAGACAAGATAAAGGAGCTCATGATTGCAACAAGAATCATGCT
CAGAGGATTGAAAGCAGTGACAAAAACAGAAAAAAAATTGAACCAAATTCTCCACCAAATGAAGCATCAACCACTGAAGTTGCTTTTACTTTAGGAATGAGAGCGAGGAA
ATACGAATCATTCTTCCTTGTTCTCTCATTGTTGAATAAACCACAGCCAGAAGCTGACTGTCAAAGTGCAAACACCAAGCAGCAGGGTTCCCTCTGTCCATCACCAAAGC
CACCACAGAGCCTCCAATGGTGCCCACCAAGCATATCAAAGCCGTCAGAGACAGCTCCGCCGGAAGTTGAAGAAGCAGAGGGCTGGTTGGGACTTGTCCATGGCAGATTC
AACATGGGCCCTCTTATGAAGGTCATAACCATGGCTCCTCCTACTGTCACTATGTACCTGCAAGCCCAAGCCATGAGGAAAGCAAAAGCTGGTAGAATATTGCACATGGC
TGCTGCAAAAGTTGCTGTTGTGAGCTTCATACCAGTAAAACCACAAAAACATGCTGGCTCATCCCCTGGTTCAGAGCTGACTTCGCAAGAATCGCCATTCCAGCATACCC
AAACTGTACAAACACCACACCCACATACGGCTTCGCCACACAAAACAGTTGCACAAAACGCTCCATTTCTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTC
TTATCTCTTGTGAGTTATAGCTTCAGCATGACCAGACCCAGAGGGTTTTATAGAGGTGGGGTGGGGTGGGGGTGTTTTCATGGCGGAAATGATCTGCAGAAACAAGCAAA
GTCTAGGGACTCCACTAAGCAAGAGAGAGACAGAGAGAGAGAGAGAGCGGAAAATGGGAAGCTGGGGAATAAAAGAAGGGGATTCGTTAAAGGATCAACACACACACAAA
CACAAACAGAAGAGAAGATGGTTTCCACATGTATAAAGTAG
Protein sequenceShow/hide protein sequence
MVELEFNTLPYTEHSHMVYFFNGPIFRIDFTETIRKPLRNEGGPTDLAVHVYGKIGGRKLRVVEPRRSGSFAIVKIPKLSFCRYPCRLHLALAVELGSSVCSVEGADLIA
RRLSFRAWRSNLKFLGFWLGVVSGKNVFAFWQSFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRSHQWFMDGSAPELFSSKKQAIETVNTRPVQGVPHMNVSPWENTSS
FQSVPGPFTDRLFGSEPIRTVNLVDRGITVGNANMAMGRKEFENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGISASLGHAYSRGDNGTISMGTTFS
KNHENAISLGQTYNSRDENTISVGPAYHKTDDNFISMGHAFSKGDGNFITIGHNFSKGDSSILSMSQPFDKGDDPFISMGQSYEKAEGNIISFGASYNKGHENFISMGPT
YSKAGDTFISMASSYNKGNDDTLSMGPNYDKVDSDMVHVGPKYDKADSGALSMAHNYHKVESNTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEVSTLRDSVD
PNVEVNVNNAPKVDAKIDTSAKNKEPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCDNCKHSKALNAYEFERHAGCKTKHPNNHIYF
ENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWKDLKDIYPPGYLITGWICNRNIAKVQPSDNGYCSNNPAEKAVERDFFLPAQEHYLRQDKGAHDCNKNHA
QRIESSDKNRKKIEPNSPPNEASTTEVAFTLGMRARKYESFFLVLSLLNKPQPEADCQSANTKQQGSLCPSPKPPQSLQWCPPSISKPSETAPPEVEEAEGWLGLVHGRF
NMGPLMKVITMAPPTVTMYLQAQAMRKAKAGRILHMAAAKVAVVSFIPVKPQKHAGSSPGSELTSQESPFQHTQTVQTPHPHTASPHKTVAQNAPFLLSLSLSLSLSLSL
LSLVSYSFSMTRPRGFYRGGVGWGCFHGGNDLQKQAKSRDSTKQERDRERERAENGKLGNKRRGFVKGSTHTQTQTEEKMVSTCIK