; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr005023 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr005023
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function, DUF547
Genome locationtig00003509:73001..94314
RNA-Seq ExpressionSgr005023
SyntenySgr005023
Gene Ontology termsNA
InterPro domainsIPR006869 - Domain of unknown function DUF547
IPR025757 - Ternary complex factor MIP1, leucine-zipper


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008457798.1 PREDICTED: uncharacterized protein LOC103497400 isoform X2 [Cucumis melo]8.0e-28685.52Show/hide
Query:  MLGVGVTTGHRRSKSSQDKKAVRDEQLDNSLQLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALGHGVFPCNKSDK
        MLGV V TG+ RS+SS DKKAV+D++L+N L+  N+VK+DMD+VKEVENKKN S K+ V++SLKQEIIQLEKRLQDQFKLRSALEK LGHGVFPCN+SDK
Subjt:  MLGVGVTTGHRRSKSSQDKKAVRDEQLDNSLQLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALGHGVFPCNKSDK

Query:  ISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGWSMEVPLPDIAPKDEGSAAPSDCQSLENPRKDCSDIGRDE
        ISMPKSA+ELIKEIA LE+EVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEK+KLPSTP G  ME PLPDIAPK   SA PS C SL+NPR+D SDIGRDE
Subjt:  ISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGWSMEVPLPDIAPKDEGSAAPSDCQSLENPRKDCSDIGRDE

Query:  KLLVSNFHRSQSSL-TANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISAIYCKLGEPSLT
        KLLV ++ RSQSSL T NA SL KVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDH+PETPNRLSEDM+KCIS IY KL EPS  
Subjt:  KLLVSNFHRSQSSL-TANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISAIYCKLGEPSLT

Query:  NHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRLEEVDLRKLKYE
        N GLSSP SSLSSVSA SPGEQ AMCSPG RNNSSFDV LDNPFLVEGLK+FSGPYSTM+EISWI GD QKL  V+SLLENFRLLISRLEEVDLR L YE
Subjt:  NHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRLEEVDLRKLKYE

Query:  EKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALC
        EKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIG HTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFK GDERQ YI+DRPEPLLHFALC
Subjt:  EKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALC

Query:  SGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVKWSQLGNPRKNVEWIPPSYTFRC
        SGSHSDPAVRVYTPKRVFQELE++KDEYIRATFGVR D+KILLPKIIESF KDSGLCS GLMEMILKSLPESLRKSVK S LGNPRK VEWIPP+YTFR 
Subjt:  SGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVKWSQLGNPRKNVEWIPPSYTFRC

Query:  L
        L
Subjt:  L

XP_022158699.1 uncharacterized protein LOC111025163 isoform X1 [Momordica charantia]2.5e-29587.46Show/hide
Query:  MLGVGVTTGHRRSK-----------SSQDKKAVRDEQLDNSLQLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALG
        MLGV V TGHRRSK           SS DKKAV+D++L NSL+ +N V+MDMD+VKEVEN KN S K++V+NSLKQEIIQLEKRLQDQFKLRSALEKALG
Subjt:  MLGVGVTTGHRRSK-----------SSQDKKAVRDEQLDNSLQLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALG

Query:  HGVFPCNKSDKISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGWSMEVPLPDIAPKDEGSAAPSDCQSLENP
        HG+FPC+KSDK+SMPKSAIELI EIATLELEVVHLEQYLLSLYR+AFDGQSSSVSPSA DEKSKLPSTPRG SME PLPDIA KDE SAAPS CQSLEN 
Subjt:  HGVFPCNKSDKISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGWSMEVPLPDIAPKDEGSAAPSDCQSLENP

Query:  RKDCSDIGRDEKLLVSNFHRSQSSL-TANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISA
        RK+CS+IGRDEKL+VSNFHRS+SSL T NAAS NK+STSVESLDRTL ACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDM+KCIS+
Subjt:  RKDCSDIGRDEKLLVSNFHRSQSSL-TANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISA

Query:  IYCKLGEPSLTNHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRL
        I+CKL EPSLTNHGLSSPTSSLSSVSA SPGEQCAMCSPG RNNSSFDVRLDNPFLVEGLKDFSGPYSTM+EISWI GD QKL +V+SLLENFRLLISRL
Subjt:  IYCKLGEPSLTNHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRL

Query:  EEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIID
        EEVDL KLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIG HTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDER+AYII+
Subjt:  EEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIID

Query:  RPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRK-ILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVKWS-QLGNPRK
        RPEPLLHFALCSGSHSDPAVRVYT KRVFQELESAKDEYIRATFGV  D+K ILLPK IESFAKDSGLCSSGLMEMIL SLPESLRKSVK S QLGNPRK
Subjt:  RPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRK-ILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVKWS-QLGNPRK

Query:  NVEWIPPSYTFRCL
        NVEWIPPSYTFR L
Subjt:  NVEWIPPSYTFRCL

XP_022158700.1 uncharacterized protein LOC111025163 isoform X2 [Momordica charantia]6.9e-29889.05Show/hide
Query:  MLGVGVTTGHRRSKSSQDKKAVRDEQLDNSLQLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALGHGVFPCNKSDK
        MLGV V TGHRRSKSS DKKAV+D++L NSL+ +N V+MDMD+VKEVEN KN S K++V+NSLKQEIIQLEKRLQDQFKLRSALEKALGHG+FPC+KSDK
Subjt:  MLGVGVTTGHRRSKSSQDKKAVRDEQLDNSLQLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALGHGVFPCNKSDK

Query:  ISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGWSMEVPLPDIAPKDEGSAAPSDCQSLENPRKDCSDIGRDE
        +SMPKSAIELI EIATLELEVVHLEQYLLSLYR+AFDGQSSSVSPSA DEKSKLPSTPRG SME PLPDIA KDE SAAPS CQSLEN RK+CS+IGRDE
Subjt:  ISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGWSMEVPLPDIAPKDEGSAAPSDCQSLENPRKDCSDIGRDE

Query:  KLLVSNFHRSQSSL-TANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISAIYCKLGEPSLT
        KL+VSNFHRS+SSL T NAAS NK+STSVESLDRTL ACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDM+KCIS+I+CKL EPSLT
Subjt:  KLLVSNFHRSQSSL-TANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISAIYCKLGEPSLT

Query:  NHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRLEEVDLRKLKYE
        NHGLSSPTSSLSSVSA SPGEQCAMCSPG RNNSSFDVRLDNPFLVEGLKDFSGPYSTM+EISWI GD QKL +V+SLLENFRLLISRLEEVDL KLKYE
Subjt:  NHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRLEEVDLRKLKYE

Query:  EKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALC
        EKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIG HTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDER+AYII+RPEPLLHFALC
Subjt:  EKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALC

Query:  SGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRK-ILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVKWS-QLGNPRKNVEWIPPSYTF
        SGSHSDPAVRVYT KRVFQELESAKDEYIRATFGV  D+K ILLPK IESFAKDSGLCSSGLMEMIL SLPESLRKSVK S QLGNPRKNVEWIPPSYTF
Subjt:  SGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRK-ILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVKWS-QLGNPRKNVEWIPPSYTF

Query:  RCL
        R L
Subjt:  RCL

XP_038900882.1 uncharacterized protein LOC120087938 isoform X1 [Benincasa hispida]2.6e-29286.44Show/hide
Query:  MLGVGVTTGHRRSK-----------SSQDKKAVRDEQLDNSLQLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALG
        MLGV V TGH RS+           SS DKK +RD++LDN L+  N+VKMDMD VKEVENKKN SPK++V+NSLKQEIIQLEKRLQDQFKLRSALEK LG
Subjt:  MLGVGVTTGHRRSK-----------SSQDKKAVRDEQLDNSLQLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALG

Query:  HGVFPCNKSDKISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGWSMEVPLPDIAPKDEGSAAPSDCQSLENP
        HGVF CN+SDK+SMPKSA+ELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEK KLP TPRG +ME P PDIA K   SA PS CQSLENP
Subjt:  HGVFPCNKSDKISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGWSMEVPLPDIAPKDEGSAAPSDCQSLENP

Query:  RKDCSDIGRDEKLLVSNFHRSQSSL-TANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISA
        RK+ SDIGRDEKLLVSN+HRSQSSL T NAASL+K+STSVESLDRTLR CHSQPVSMMEYAQNVS NIISLAEHLGTRISDH+PETPNRLSEDM+KCIS 
Subjt:  RKDCSDIGRDEKLLVSNFHRSQSSL-TANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISA

Query:  IYCKLGEPSLTNHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRL
        IY KL EPSL NHGLSSPTSSLSSVSA SPGEQCAMCSPG RNNSSFDVRLDNPFLVEGLK+FSGPYSTM+EISWI  D QKL  V+SLLENFRLLISRL
Subjt:  IYCKLGEPSLTNHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRL

Query:  EEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIID
        EEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIG HTISVDTIQS ILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIID
Subjt:  EEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIID

Query:  RPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVKWSQLGNPRKNV
        RPEPLLHFALC GSHSDPAVRVYTPKRVFQELE+AKDEYIRATFGV  D+KILLPKIIESFAKD+GLCSSGLMEMILKSLPESLRKSVK SQLGNPRKNV
Subjt:  RPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVKWSQLGNPRKNV

Query:  EWIPPSYTFRCL
        EWIP SYTFR L
Subjt:  EWIPPSYTFRCL

XP_038900883.1 uncharacterized protein LOC120087938 isoform X2 [Benincasa hispida]7.2e-29588.02Show/hide
Query:  MLGVGVTTGHRRSKSSQDKKAVRDEQLDNSLQLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALGHGVFPCNKSDK
        MLGV V TGH RS+SS DKK +RD++LDN L+  N+VKMDMD VKEVENKKN SPK++V+NSLKQEIIQLEKRLQDQFKLRSALEK LGHGVF CN+SDK
Subjt:  MLGVGVTTGHRRSKSSQDKKAVRDEQLDNSLQLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALGHGVFPCNKSDK

Query:  ISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGWSMEVPLPDIAPKDEGSAAPSDCQSLENPRKDCSDIGRDE
        +SMPKSA+ELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEK KLP TPRG +ME P PDIA K   SA PS CQSLENPRK+ SDIGRDE
Subjt:  ISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGWSMEVPLPDIAPKDEGSAAPSDCQSLENPRKDCSDIGRDE

Query:  KLLVSNFHRSQSSL-TANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISAIYCKLGEPSLT
        KLLVSN+HRSQSSL T NAASL+K+STSVESLDRTLR CHSQPVSMMEYAQNVS NIISLAEHLGTRISDH+PETPNRLSEDM+KCIS IY KL EPSL 
Subjt:  KLLVSNFHRSQSSL-TANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISAIYCKLGEPSLT

Query:  NHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRLEEVDLRKLKYE
        NHGLSSPTSSLSSVSA SPGEQCAMCSPG RNNSSFDVRLDNPFLVEGLK+FSGPYSTM+EISWI  D QKL  V+SLLENFRLLISRLEEVDLRKLKYE
Subjt:  NHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRLEEVDLRKLKYE

Query:  EKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALC
        EKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIG HTISVDTIQS ILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALC
Subjt:  EKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALC

Query:  SGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVKWSQLGNPRKNVEWIPPSYTFRC
         GSHSDPAVRVYTPKRVFQELE+AKDEYIRATFGV  D+KILLPKIIESFAKD+GLCSSGLMEMILKSLPESLRKSVK SQLGNPRKNVEWIP SYTFR 
Subjt:  SGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVKWSQLGNPRKNVEWIPPSYTFRC

Query:  L
        L
Subjt:  L

TrEMBL top hitse value%identityAlignment
A0A1S3C5X2 uncharacterized protein LOC103497400 isoform X11.4e-28383.99Show/hide
Query:  MLGVGVTTGHRRSK-----------SSQDKKAVRDEQLDNSLQLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALG
        MLGV V TG+ RS+           SS DKKAV+D++L+N L+  N+VK+DMD+VKEVENKKN S K+ V++SLKQEIIQLEKRLQDQFKLRSALEK LG
Subjt:  MLGVGVTTGHRRSK-----------SSQDKKAVRDEQLDNSLQLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALG

Query:  HGVFPCNKSDKISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGWSMEVPLPDIAPKDEGSAAPSDCQSLENP
        HGVFPCN+SDKISMPKSA+ELIKEIA LE+EVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEK+KLPSTP G  ME PLPDIAPK   SA PS C SL+NP
Subjt:  HGVFPCNKSDKISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGWSMEVPLPDIAPKDEGSAAPSDCQSLENP

Query:  RKDCSDIGRDEKLLVSNFHRSQSSL-TANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISA
        R+D SDIGRDEKLLV ++ RSQSSL T NA SL KVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDH+PETPNRLSEDM+KCIS 
Subjt:  RKDCSDIGRDEKLLVSNFHRSQSSL-TANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISA

Query:  IYCKLGEPSLTNHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRL
        IY KL EPS  N GLSSP SSLSSVSA SPGEQ AMCSPG RNNSSFDV LDNPFLVEGLK+FSGPYSTM+EISWI GD QKL  V+SLLENFRLLISRL
Subjt:  IYCKLGEPSLTNHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRL

Query:  EEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIID
        EEVDLR L YEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIG HTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFK GDERQ YI+D
Subjt:  EEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIID

Query:  RPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVKWSQLGNPRKNV
        RPEPLLHFALCSGSHSDPAVRVYTPKRVFQELE++KDEYIRATFGVR D+KILLPKIIESF KDSGLCS GLMEMILKSLPESLRKSVK S LGNPRK V
Subjt:  RPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVKWSQLGNPRKNV

Query:  EWIPPSYTFRCL
        EWIPP+YTFR L
Subjt:  EWIPPSYTFRCL

A0A1S3C6H4 uncharacterized protein LOC103497400 isoform X23.9e-28685.52Show/hide
Query:  MLGVGVTTGHRRSKSSQDKKAVRDEQLDNSLQLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALGHGVFPCNKSDK
        MLGV V TG+ RS+SS DKKAV+D++L+N L+  N+VK+DMD+VKEVENKKN S K+ V++SLKQEIIQLEKRLQDQFKLRSALEK LGHGVFPCN+SDK
Subjt:  MLGVGVTTGHRRSKSSQDKKAVRDEQLDNSLQLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALGHGVFPCNKSDK

Query:  ISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGWSMEVPLPDIAPKDEGSAAPSDCQSLENPRKDCSDIGRDE
        ISMPKSA+ELIKEIA LE+EVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEK+KLPSTP G  ME PLPDIAPK   SA PS C SL+NPR+D SDIGRDE
Subjt:  ISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGWSMEVPLPDIAPKDEGSAAPSDCQSLENPRKDCSDIGRDE

Query:  KLLVSNFHRSQSSL-TANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISAIYCKLGEPSLT
        KLLV ++ RSQSSL T NA SL KVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDH+PETPNRLSEDM+KCIS IY KL EPS  
Subjt:  KLLVSNFHRSQSSL-TANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISAIYCKLGEPSLT

Query:  NHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRLEEVDLRKLKYE
        N GLSSP SSLSSVSA SPGEQ AMCSPG RNNSSFDV LDNPFLVEGLK+FSGPYSTM+EISWI GD QKL  V+SLLENFRLLISRLEEVDLR L YE
Subjt:  NHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRLEEVDLRKLKYE

Query:  EKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALC
        EKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIG HTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFK GDERQ YI+DRPEPLLHFALC
Subjt:  EKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALC

Query:  SGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVKWSQLGNPRKNVEWIPPSYTFRC
        SGSHSDPAVRVYTPKRVFQELE++KDEYIRATFGVR D+KILLPKIIESF KDSGLCS GLMEMILKSLPESLRKSVK S LGNPRK VEWIPP+YTFR 
Subjt:  SGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVKWSQLGNPRKNVEWIPPSYTFRC

Query:  L
        L
Subjt:  L

A0A5D3BMC4 Uncharacterized protein1.4e-28383.99Show/hide
Query:  MLGVGVTTGHRRSK-----------SSQDKKAVRDEQLDNSLQLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALG
        MLGV V TG+ RS+           SS DKKAV+D++L+N L+  N+VK+DMD+VKEVENKKN S K+ V++SLKQEIIQLEKRLQDQFKLRSALEK LG
Subjt:  MLGVGVTTGHRRSK-----------SSQDKKAVRDEQLDNSLQLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALG

Query:  HGVFPCNKSDKISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGWSMEVPLPDIAPKDEGSAAPSDCQSLENP
        HGVFPCN+SDKISMPKSA+ELIKEIA LE+EVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEK+KLPSTP G  ME PLPDIAPK   SA PS C SL+NP
Subjt:  HGVFPCNKSDKISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGWSMEVPLPDIAPKDEGSAAPSDCQSLENP

Query:  RKDCSDIGRDEKLLVSNFHRSQSSL-TANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISA
        R+D SDIGRDEKLLV ++ RSQSSL T NA SL KVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDH+PETPNRLSEDM+KCIS 
Subjt:  RKDCSDIGRDEKLLVSNFHRSQSSL-TANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISA

Query:  IYCKLGEPSLTNHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRL
        IY KL EPS  N GLSSP SSLSSVSA SPGEQ AMCSPG RNNSSFDV LDNPFLVEGLK+FSGPYSTM+EISWI GD QKL  V+SLLENFRLLISRL
Subjt:  IYCKLGEPSLTNHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRL

Query:  EEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIID
        EEVDLR L YEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIG HTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFK GDERQ YI+D
Subjt:  EEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIID

Query:  RPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVKWSQLGNPRKNV
        RPEPLLHFALCSGSHSDPAVRVYTPKRVFQELE++KDEYIRATFGVR D+KILLPKIIESF KDSGLCS GLMEMILKSLPESLRKSVK S LGNPRK V
Subjt:  RPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVKWSQLGNPRKNV

Query:  EWIPPSYTFRCL
        EWIPP+YTFR L
Subjt:  EWIPPSYTFRCL

A0A6J1DWJ5 uncharacterized protein LOC111025163 isoform X23.4e-29889.05Show/hide
Query:  MLGVGVTTGHRRSKSSQDKKAVRDEQLDNSLQLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALGHGVFPCNKSDK
        MLGV V TGHRRSKSS DKKAV+D++L NSL+ +N V+MDMD+VKEVEN KN S K++V+NSLKQEIIQLEKRLQDQFKLRSALEKALGHG+FPC+KSDK
Subjt:  MLGVGVTTGHRRSKSSQDKKAVRDEQLDNSLQLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALGHGVFPCNKSDK

Query:  ISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGWSMEVPLPDIAPKDEGSAAPSDCQSLENPRKDCSDIGRDE
        +SMPKSAIELI EIATLELEVVHLEQYLLSLYR+AFDGQSSSVSPSA DEKSKLPSTPRG SME PLPDIA KDE SAAPS CQSLEN RK+CS+IGRDE
Subjt:  ISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGWSMEVPLPDIAPKDEGSAAPSDCQSLENPRKDCSDIGRDE

Query:  KLLVSNFHRSQSSL-TANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISAIYCKLGEPSLT
        KL+VSNFHRS+SSL T NAAS NK+STSVESLDRTL ACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDM+KCIS+I+CKL EPSLT
Subjt:  KLLVSNFHRSQSSL-TANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISAIYCKLGEPSLT

Query:  NHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRLEEVDLRKLKYE
        NHGLSSPTSSLSSVSA SPGEQCAMCSPG RNNSSFDVRLDNPFLVEGLKDFSGPYSTM+EISWI GD QKL +V+SLLENFRLLISRLEEVDL KLKYE
Subjt:  NHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRLEEVDLRKLKYE

Query:  EKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALC
        EKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIG HTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDER+AYII+RPEPLLHFALC
Subjt:  EKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALC

Query:  SGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRK-ILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVKWS-QLGNPRKNVEWIPPSYTF
        SGSHSDPAVRVYT KRVFQELESAKDEYIRATFGV  D+K ILLPK IESFAKDSGLCSSGLMEMIL SLPESLRKSVK S QLGNPRKNVEWIPPSYTF
Subjt:  SGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRK-ILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVKWS-QLGNPRKNVEWIPPSYTF

Query:  RCL
        R L
Subjt:  RCL

A0A6J1E058 uncharacterized protein LOC111025163 isoform X11.2e-29587.46Show/hide
Query:  MLGVGVTTGHRRSK-----------SSQDKKAVRDEQLDNSLQLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALG
        MLGV V TGHRRSK           SS DKKAV+D++L NSL+ +N V+MDMD+VKEVEN KN S K++V+NSLKQEIIQLEKRLQDQFKLRSALEKALG
Subjt:  MLGVGVTTGHRRSK-----------SSQDKKAVRDEQLDNSLQLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALG

Query:  HGVFPCNKSDKISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGWSMEVPLPDIAPKDEGSAAPSDCQSLENP
        HG+FPC+KSDK+SMPKSAIELI EIATLELEVVHLEQYLLSLYR+AFDGQSSSVSPSA DEKSKLPSTPRG SME PLPDIA KDE SAAPS CQSLEN 
Subjt:  HGVFPCNKSDKISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGWSMEVPLPDIAPKDEGSAAPSDCQSLENP

Query:  RKDCSDIGRDEKLLVSNFHRSQSSL-TANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISA
        RK+CS+IGRDEKL+VSNFHRS+SSL T NAAS NK+STSVESLDRTL ACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDM+KCIS+
Subjt:  RKDCSDIGRDEKLLVSNFHRSQSSL-TANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISA

Query:  IYCKLGEPSLTNHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRL
        I+CKL EPSLTNHGLSSPTSSLSSVSA SPGEQCAMCSPG RNNSSFDVRLDNPFLVEGLKDFSGPYSTM+EISWI GD QKL +V+SLLENFRLLISRL
Subjt:  IYCKLGEPSLTNHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRL

Query:  EEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIID
        EEVDL KLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIG HTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDER+AYII+
Subjt:  EEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIID

Query:  RPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRK-ILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVKWS-QLGNPRK
        RPEPLLHFALCSGSHSDPAVRVYT KRVFQELESAKDEYIRATFGV  D+K ILLPK IESFAKDSGLCSSGLMEMIL SLPESLRKSVK S QLGNPRK
Subjt:  RPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRK-ILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVKWS-QLGNPRK

Query:  NVEWIPPSYTFRCL
        NVEWIPPSYTFR L
Subjt:  NVEWIPPSYTFRCL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G23700.1 Protein of unknown function, DUF5472.0e-14145.3Show/hide
Query:  HRRSKSS--QDKKAVRDE-QLDNSLQLMNSVKMDMDKV--KEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMP
        H+RSKS    +KK + DE  +D+SL     +K+D+ +   K  E KK+ SP +   +SLKQEI +LEKRLQ+QF +R ALEKALG+   P       S P
Subjt:  HRRSKSS--QDKKAVRDE-QLDNSLQLMNSVKMDMDKV--KEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMP

Query:  KSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVS-PSAKDEKSKLP--------------------------STPRGWSMEVPLPDIAPKDEGS
        K   ELIKEIA LELEV HLEQYLLSLYRKAFD Q+SSVS P++K + S  P                           +PR    E+  P++  + E  
Subjt:  KSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVS-PSAKDEKSKLP--------------------------STPRGWSMEVPLPDIAPKDEGS

Query:  AAPSDCQSLENPRKDCSDIGRDEKLLVS---------------------NFHRSQSSLTANAASL-NKVSTSV---------------------------
        A    C S +N  K+ S  GR     VS                     +F++  S + +   S  N+V   V                           
Subjt:  AAPSDCQSLENPRKDCSDIGRDEKLLVS---------------------NFHRSQSSLTANAASL-NKVSTSV---------------------------

Query:  -----ESLDRTLR------------------------ACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISAIYCKLGEPSLT
             E +D  +R                        ACHSQP+S+ EY QN  SN  SLAEH+GTRISDH+  TPN+LSE+M+KC SAIY KL +P   
Subjt:  -----ESLDRTLR------------------------ACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISAIYCKLGEPSLT

Query:  NHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRLEEVDLRKLKYE
        NHG SSP+SS SS S  SP +Q  M SP  R NSSFD +           +FSGPYS+M+E+S I+ + +K  D++ +  NF LL+ +LE VD RKL ++
Subjt:  NHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRLEEVDLRKLKYE

Query:  EKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALC
        EKLAFWIN+HN+LVMHT+LA G+PQNN KR  LL K AY IG   +S++ IQS IL  +MPRP QWL+LLL  + KF+TGDE Q Y ++  EPLL+FALC
Subjt:  EKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALC

Query:  SGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVKWSQLGNPRKN-VEWIPPSYTFR
        SG+HSDPA+RV+TPK ++QELE+AK+EYIRATFGV+ D+K++LPKIIESF+KDSGL  + LMEMI + LPE+++K++K    G  RK+ VEW P ++ FR
Subjt:  SGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVKWSQLGNPRKN-VEWIPPSYTFR

Query:  CL
         L
Subjt:  CL

AT5G66600.1 Protein of unknown function, DUF5472.0e-17356.22Show/hide
Query:  GSGVGKMLGVGVTTGHRRSKSSQ--DKKAVRDEQLDNSL-QLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALGH-
        G G G+ML + +   H+RSKS+   +KK V  ++  NS  +    +K+DM +  E ++ +  S   +   SLKQEI  LE RLQDQFK+R ALEKALG+ 
Subjt:  GSGVGKMLGVGVTTGHRRSKSSQ--DKKAVRDEQLDNSL-QLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALGH-

Query:  --GVFPCNKSDKISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLP--STPRGWSMEVPLPDIAPK--DEGSAAPSDCQ
            +   +++ I+MPK A +LIK++A LE+EV+HLEQYLLSLYRKAF+ Q SSVSP+ +++K K P  +TPR   ++    D  P   D+ +    D  
Subjt:  --GVFPCNKSDKISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLP--STPRGWSMEVPLPDIAPK--DEGSAAPSDCQ

Query:  SLENPRKDCSDIGRDEKLLVSNFHRSQSSLTANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVK
          ++ + + + + RD+  +  +F RS S  +A     ++ ++  +S  +  R+CHSQP+    Y QN   N+ISLAEHLGTRISDHVPETPN+LSE MVK
Subjt:  SLENPRKDCSDIGRDEKLLVSNFHRSQSSLTANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVK

Query:  CISAIYCKLGE-PSLTNHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRL
        C+S IYCKL E PS+ + GLSSP SSLSS SA SP +Q    SPG  N+SSFDVRLDN F VEG KDFSGPYS++VE+  IY DA+K S+VE LL+NF+ 
Subjt:  CISAIYCKLGE-PSLTNHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRL

Query:  LISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQ
        LISRLEEVD RKLK+EEKLAFWIN+HN+LVMH +LAYG+PQNNVKR  LLLK+AYNIG HTIS + IQS ILGC+M  P QWLRLL  SR KFK GDER 
Subjt:  LISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQ

Query:  AYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVK--WSQL
        AY ID PEPLLHFAL SGSHSDPAVRVYTPKR+ QELE++K+EYIR    +R  R ILLPK++E+FAKDSGLC +GL EM+ +S+PES RK VK   S  
Subjt:  AYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVK--WSQL

Query:  GNPRKNVEWIPPSYTFRCL
          PRK ++WIP S+TFR L
Subjt:  GNPRKNVEWIPPSYTFRCL

AT5G66600.2 Protein of unknown function, DUF5472.9e-16956.76Show/hide
Query:  SKSSQDKKAVRDEQLDNSL-QLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALGH---GVFPCNKSDKISMPKSAI
        S S  +KK V  ++  NS  +    +K+DM +  E ++ +  S   +   SLKQEI  LE RLQDQFK+R ALEKALG+     +   +++ I+MPK A 
Subjt:  SKSSQDKKAVRDEQLDNSL-QLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALGH---GVFPCNKSDKISMPKSAI

Query:  ELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLP--STPRGWSMEVPLPDIAPK--DEGSAAPSDCQSLENPRKDCSDIGRDEKLLV
        +LIK++A LE+EV+HLEQYLLSLYRKAF+ Q SSVSP+ +++K K P  +TPR   ++    D  P   D+ +    D    ++ + + + + RD+  + 
Subjt:  ELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLP--STPRGWSMEVPLPDIAPK--DEGSAAPSDCQSLENPRKDCSDIGRDEKLLV

Query:  SNFHRSQSSLTANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISAIYCKLGE-PSLTNHGL
         +F RS S  +A     ++ ++  +S  +  R+CHSQP+    Y QN   N+ISLAEHLGTRISDHVPETPN+LSE MVKC+S IYCKL E PS+ + GL
Subjt:  SNFHRSQSSLTANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISAIYCKLGE-PSLTNHGL

Query:  SSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRLEEVDLRKLKYEEKLA
        SSP SSLSS SA SP +Q    SPG  N+SSFDVRLDN F VEG KDFSGPYS++VE+  IY DA+K S+VE LL+NF+ LISRLEEVD RKLK+EEKLA
Subjt:  SSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRLLISRLEEVDLRKLKYEEKLA

Query:  FWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSH
        FWIN+HN+LVMH +LAYG+PQNNVKR  LLLK+AYNIG HTIS + IQS ILGC+M  P QWLRLL  SR KFK GDER AY ID PEPLLHFAL SGSH
Subjt:  FWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSH

Query:  SDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVK--WSQLGNPRKNVEWIPPSYTFRCL
        SDPAVRVYTPKR+ QELE++K+EYIR    +R  R ILLPK++E+FAKDSGLC +GL EM+ +S+PES RK VK   S    PRK ++WIP S+TFR L
Subjt:  SDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVK--WSQLGNPRKNVEWIPPSYTFRCL

AT5G66600.3 Protein of unknown function, DUF5472.0e-17356.22Show/hide
Query:  GSGVGKMLGVGVTTGHRRSKSSQ--DKKAVRDEQLDNSL-QLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALGH-
        G G G+ML + +   H+RSKS+   +KK V  ++  NS  +    +K+DM +  E ++ +  S   +   SLKQEI  LE RLQDQFK+R ALEKALG+ 
Subjt:  GSGVGKMLGVGVTTGHRRSKSSQ--DKKAVRDEQLDNSL-QLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALGH-

Query:  --GVFPCNKSDKISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLP--STPRGWSMEVPLPDIAPK--DEGSAAPSDCQ
            +   +++ I+MPK A +LIK++A LE+EV+HLEQYLLSLYRKAF+ Q SSVSP+ +++K K P  +TPR   ++    D  P   D+ +    D  
Subjt:  --GVFPCNKSDKISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLP--STPRGWSMEVPLPDIAPK--DEGSAAPSDCQ

Query:  SLENPRKDCSDIGRDEKLLVSNFHRSQSSLTANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVK
          ++ + + + + RD+  +  +F RS S  +A     ++ ++  +S  +  R+CHSQP+    Y QN   N+ISLAEHLGTRISDHVPETPN+LSE MVK
Subjt:  SLENPRKDCSDIGRDEKLLVSNFHRSQSSLTANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVK

Query:  CISAIYCKLGE-PSLTNHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRL
        C+S IYCKL E PS+ + GLSSP SSLSS SA SP +Q    SPG  N+SSFDVRLDN F VEG KDFSGPYS++VE+  IY DA+K S+VE LL+NF+ 
Subjt:  CISAIYCKLGE-PSLTNHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDAQKLSDVESLLENFRL

Query:  LISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQ
        LISRLEEVD RKLK+EEKLAFWIN+HN+LVMH +LAYG+PQNNVKR  LLLK+AYNIG HTIS + IQS ILGC+M  P QWLRLL  SR KFK GDER 
Subjt:  LISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRLLLPSRTKFKTGDERQ

Query:  AYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVK--WSQL
        AY ID PEPLLHFAL SGSHSDPAVRVYTPKR+ QELE++K+EYIR    +R  R ILLPK++E+FAKDSGLC +GL EM+ +S+PES RK VK   S  
Subjt:  AYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRKSVK--WSQL

Query:  GNPRKNVEWIPPSYTFRCL
          PRK ++WIP S+TFR L
Subjt:  GNPRKNVEWIPPSYTFRCL

AT5G66600.4 Protein of unknown function, DUF5471.8e-17154.89Show/hide
Query:  GSGVGKMLGVGVTTGHRRSKSSQ-----------------DKKAVRDEQLDNSL-QLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQD
        G G G+ML + +   H+RSK S                  +KK V  ++  NS  +    +K+DM +  E ++ +  S   +   SLKQEI  LE RLQD
Subjt:  GSGVGKMLGVGVTTGHRRSKSSQ-----------------DKKAVRDEQLDNSL-QLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQD

Query:  QFKLRSALEKALGH---GVFPCNKSDKISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLP--STPRGWSMEVPLPDIA
        QFK+R ALEKALG+     +   +++ I+MPK A +LIK++A LE+EV+HLEQYLLSLYRKAF+ Q SSVSP+ +++K K P  +TPR   ++    D  
Subjt:  QFKLRSALEKALGH---GVFPCNKSDKISMPKSAIELIKEIATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLP--STPRGWSMEVPLPDIA

Query:  PK--DEGSAAPSDCQSLENPRKDCSDIGRDEKLLVSNFHRSQSSLTANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISD
        P   D+ +    D    ++ + + + + RD+  +  +F RS S  +A     ++ ++  +S  +  R+CHSQP+    Y QN   N+ISLAEHLGTRISD
Subjt:  PK--DEGSAAPSDCQSLENPRKDCSDIGRDEKLLVSNFHRSQSSLTANAASLNKVSTSVESLDRTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISD

Query:  HVPETPNRLSEDMVKCISAIYCKLGE-PSLTNHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDA
        HVPETPN+LSE MVKC+S IYCKL E PS+ + GLSSP SSLSS SA SP +Q    SPG  N+SSFDVRLDN F VEG KDFSGPYS++VE+  IY DA
Subjt:  HVPETPNRLSEDMVKCISAIYCKLGE-PSLTNHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGPYSTMVEISWIYGDA

Query:  QKLSDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRL
        +K S+VE LL+NF+ LISRLEEVD RKLK+EEKLAFWIN+HN+LVMH +LAYG+PQNNVKR  LLLK+AYNIG HTIS + IQS ILGC+M  P QWLRL
Subjt:  QKLSDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQWLRL

Query:  LLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSL
        L  SR KFK GDER AY ID PEPLLHFAL SGSHSDPAVRVYTPKR+ QELE++K+EYIR    +R  R ILLPK++E+FAKDSGLC +GL EM+ +S+
Subjt:  LLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSL

Query:  PESLRKSVK--WSQLGNPRKNVEWIPPSYTFRCL
        PES RK VK   S    PRK ++WIP S+TFR L
Subjt:  PESLRKSVK--WSQLGNPRKNVEWIPPSYTFRCL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGCATGTTATCTAATTGATCAATCTCCATTACCAGCAATCAACTTTAAGACTCCTCAAGAGACGTGGACCGGTGTTCCTCCGAATCTCAGCAATTTAAAACCCTT
TGGGTGTGCAACATATGCACATAAATCAAGGAAAGTTGAATGCAAGGCATTAAAATGTTTATTCTTAGGCTACTCGGATGGAAAGCTTAAAACCATCGCCTCTCAAGATA
GCACATACACAGAAGTTGAAGTGGAGACCCAATTGGCTACCAATTCAGAAAATTTGAATCTTAGCAACAGTCAAGCTGATTCTAACTTGACTCTGTCAGATAATCAATCA
CTAGAAAATTACTCATTAGCTTCCAGCCTCCTCCCTCCTCCCCCCTCCTCTCAGGCTCCCTCTCTCAGACAGCCCCCCCTACGCCCCTCTTCTCTGTTGCACTACTGGCA
ATGTCGACCTCTTTCCACCTCTCTTGCATCGCATCTCTCCCCTTCTGTTGTGCTTGCGCCTCCGTCGAGGCCTTTCTCCCCTTCTCTTGTGTTGTTTCTCCCCTTCTCTT
GTGCCGCCGTCACCGACATCCATCTCCCCCTCTCTTTCGCCGCCGCTGACGTCCATCTCCCCCTCTCTTTTGCTGTCGCCGTAGACGTCTATCTTCTTCCCTCTCTTCCT
GCCACCAACACCCATCTCCCTCTCTCATGCTACGGCGGACGGCACCTTCTTACGAGAGCAGATTACAGGTTGCCCGGAGCTAGGAGCCGGAGGAGTTGGCCATCGGCTCT
CATTACAGAAGAACATCGCAGCCTTTCCGGCCTTCTTTCTGCTGCCAGTATTCAGACTCATGAGCCACAGATTGATCACTTTCTGTTATACTTGCAATTTGAGTTAGTTT
CAAGGCTTTGGAATCTGAGCGGAAGTGGGGTTGGAAAAATGCTGGGAGTGGGTGTAACAACAGGACACAGACGCTCCAAAAGCTCGCAAGACAAGAAAGCTGTTCGAGAT
GAGCAATTAGATAATTCGCTTCAATTGATGAACAGTGTAAAGATGGATATGGATAAGGTGAAAGAAGTCGAGAATAAGAAAAACCCATCACCAAAGATGGATGTCAATAA
TTCTTTGAAACAAGAGATTATTCAACTTGAGAAGCGATTGCAAGATCAGTTCAAGCTCCGTTCTGCATTGGAAAAAGCATTGGGTCATGGGGTCTTTCCCTGTAATAAAT
CGGATAAAATCTCAATGCCAAAGTCAGCTATAGAATTAATCAAGGAGATTGCGACGTTAGAACTAGAAGTTGTACATTTAGAACAATATCTTCTTTCTTTGTACCGAAAA
GCGTTTGATGGACAATCATCCTCTGTATCCCCTTCTGCTAAGGATGAAAAGTCGAAATTGCCTTCAACCCCTAGAGGCTGGTCAATGGAAGTTCCCCTGCCTGATATTGC
CCCAAAAGATGAAGGCTCAGCAGCTCCTTCTGATTGTCAATCACTTGAAAATCCAAGGAAGGATTGCAGTGATATTGGAAGAGATGAAAAACTTTTGGTTTCCAATTTTC
ATCGTTCGCAATCGTCACTGACGGCAAATGCAGCTTCATTGAATAAAGTGTCTACTTCGGTAGAGTCATTGGATAGAACTCTGCGTGCATGTCATTCCCAGCCGGTGTCC
ATGATGGAGTATGCACAAAATGTTTCATCCAATATTATCAGTCTCGCTGAGCATCTTGGGACTCGCATTTCTGACCACGTGCCCGAGACTCCTAACCGGCTTTCTGAAGA
CATGGTCAAGTGCATATCAGCTATATATTGCAAACTTGGAGAACCGTCTTTGACAAATCATGGACTTTCATCTCCCACATCCTCGTTGTCATCAGTTAGTGCACTTTCTC
CGGGAGAACAGTGTGCTATGTGCAGTCCAGGGCGAAGAAACAACTCATCCTTCGATGTACGGTTGGACAATCCTTTTCTTGTGGAAGGTCTTAAAGATTTTAGTGGGCCA
TACAGTACAATGGTTGAGATTTCCTGGATTTATGGAGACGCTCAGAAACTAAGTGATGTTGAGAGTTTGCTTGAAAACTTCAGGTTGCTTATCTCTCGTTTAGAGGAAGT
CGATTTGAGGAAGTTAAAATACGAGGAAAAGTTGGCTTTCTGGATAAATATTCACAATTCGCTTGTGATGCATACATATTTGGCTTATGGGGTACCCCAAAACAATGTCA
AGAGGGCCTTTTTACTCTTGAAGTCTGCATATAATATTGGAAGCCATACAATTAGTGTAGACACAATACAAAGTTGTATTCTTGGCTGCCGGATGCCTCGTCCTCGGCAG
TGGCTTCGGCTGTTGCTTCCTTCAAGGACGAAATTCAAGACCGGAGATGAACGACAAGCATATATAATTGACCGTCCAGAACCCCTTTTACATTTCGCACTTTGTTCGGG
AAGCCACTCGGATCCCGCGGTTCGTGTATATACACCTAAGAGAGTGTTTCAAGAACTGGAATCTGCAAAAGACGAGTACATTCGGGCCACCTTCGGCGTACGAAATGATC
GAAAGATCCTTCTTCCAAAGATCATCGAGTCATTCGCAAAGGATTCTGGTCTGTGTTCATCTGGTCTGATGGAGATGATTCTCAAGAGCTTGCCTGAATCTCTGAGAAAG
AGCGTCAAATGGTCGCAGCTCGGAAATCCCCGCAAGAATGTCGAATGGATTCCTCCAAGTTATACTTTCAGATGCTTAAGACGTGACATAGCAATGACGATCGGGCTGGG
TCAAGGCAACCTGAGGGAAGCCGAATCATCTAGACCAGCCGAAGCAACCTTGGGGTGTCGACTAGCGAGAGGGCGTCCGAGGCTCCCCAGCCTAGGAGCCCTACCTCGGC
CAAGGACACCTGACCCAGGATGGTTGACCTGGATTTCTACCTGCCTAGCTTCTGTACGCCCAATAGTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTGCATGTTATCTAATTGATCAATCTCCATTACCAGCAATCAACTTTAAGACTCCTCAAGAGACGTGGACCGGTGTTCCTCCGAATCTCAGCAATTTAAAACCCTT
TGGGTGTGCAACATATGCACATAAATCAAGGAAAGTTGAATGCAAGGCATTAAAATGTTTATTCTTAGGCTACTCGGATGGAAAGCTTAAAACCATCGCCTCTCAAGATA
GCACATACACAGAAGTTGAAGTGGAGACCCAATTGGCTACCAATTCAGAAAATTTGAATCTTAGCAACAGTCAAGCTGATTCTAACTTGACTCTGTCAGATAATCAATCA
CTAGAAAATTACTCATTAGCTTCCAGCCTCCTCCCTCCTCCCCCCTCCTCTCAGGCTCCCTCTCTCAGACAGCCCCCCCTACGCCCCTCTTCTCTGTTGCACTACTGGCA
ATGTCGACCTCTTTCCACCTCTCTTGCATCGCATCTCTCCCCTTCTGTTGTGCTTGCGCCTCCGTCGAGGCCTTTCTCCCCTTCTCTTGTGTTGTTTCTCCCCTTCTCTT
GTGCCGCCGTCACCGACATCCATCTCCCCCTCTCTTTCGCCGCCGCTGACGTCCATCTCCCCCTCTCTTTTGCTGTCGCCGTAGACGTCTATCTTCTTCCCTCTCTTCCT
GCCACCAACACCCATCTCCCTCTCTCATGCTACGGCGGACGGCACCTTCTTACGAGAGCAGATTACAGGTTGCCCGGAGCTAGGAGCCGGAGGAGTTGGCCATCGGCTCT
CATTACAGAAGAACATCGCAGCCTTTCCGGCCTTCTTTCTGCTGCCAGTATTCAGACTCATGAGCCACAGATTGATCACTTTCTGTTATACTTGCAATTTGAGTTAGTTT
CAAGGCTTTGGAATCTGAGCGGAAGTGGGGTTGGAAAAATGCTGGGAGTGGGTGTAACAACAGGACACAGACGCTCCAAAAGCTCGCAAGACAAGAAAGCTGTTCGAGAT
GAGCAATTAGATAATTCGCTTCAATTGATGAACAGTGTAAAGATGGATATGGATAAGGTGAAAGAAGTCGAGAATAAGAAAAACCCATCACCAAAGATGGATGTCAATAA
TTCTTTGAAACAAGAGATTATTCAACTTGAGAAGCGATTGCAAGATCAGTTCAAGCTCCGTTCTGCATTGGAAAAAGCATTGGGTCATGGGGTCTTTCCCTGTAATAAAT
CGGATAAAATCTCAATGCCAAAGTCAGCTATAGAATTAATCAAGGAGATTGCGACGTTAGAACTAGAAGTTGTACATTTAGAACAATATCTTCTTTCTTTGTACCGAAAA
GCGTTTGATGGACAATCATCCTCTGTATCCCCTTCTGCTAAGGATGAAAAGTCGAAATTGCCTTCAACCCCTAGAGGCTGGTCAATGGAAGTTCCCCTGCCTGATATTGC
CCCAAAAGATGAAGGCTCAGCAGCTCCTTCTGATTGTCAATCACTTGAAAATCCAAGGAAGGATTGCAGTGATATTGGAAGAGATGAAAAACTTTTGGTTTCCAATTTTC
ATCGTTCGCAATCGTCACTGACGGCAAATGCAGCTTCATTGAATAAAGTGTCTACTTCGGTAGAGTCATTGGATAGAACTCTGCGTGCATGTCATTCCCAGCCGGTGTCC
ATGATGGAGTATGCACAAAATGTTTCATCCAATATTATCAGTCTCGCTGAGCATCTTGGGACTCGCATTTCTGACCACGTGCCCGAGACTCCTAACCGGCTTTCTGAAGA
CATGGTCAAGTGCATATCAGCTATATATTGCAAACTTGGAGAACCGTCTTTGACAAATCATGGACTTTCATCTCCCACATCCTCGTTGTCATCAGTTAGTGCACTTTCTC
CGGGAGAACAGTGTGCTATGTGCAGTCCAGGGCGAAGAAACAACTCATCCTTCGATGTACGGTTGGACAATCCTTTTCTTGTGGAAGGTCTTAAAGATTTTAGTGGGCCA
TACAGTACAATGGTTGAGATTTCCTGGATTTATGGAGACGCTCAGAAACTAAGTGATGTTGAGAGTTTGCTTGAAAACTTCAGGTTGCTTATCTCTCGTTTAGAGGAAGT
CGATTTGAGGAAGTTAAAATACGAGGAAAAGTTGGCTTTCTGGATAAATATTCACAATTCGCTTGTGATGCATACATATTTGGCTTATGGGGTACCCCAAAACAATGTCA
AGAGGGCCTTTTTACTCTTGAAGTCTGCATATAATATTGGAAGCCATACAATTAGTGTAGACACAATACAAAGTTGTATTCTTGGCTGCCGGATGCCTCGTCCTCGGCAG
TGGCTTCGGCTGTTGCTTCCTTCAAGGACGAAATTCAAGACCGGAGATGAACGACAAGCATATATAATTGACCGTCCAGAACCCCTTTTACATTTCGCACTTTGTTCGGG
AAGCCACTCGGATCCCGCGGTTCGTGTATATACACCTAAGAGAGTGTTTCAAGAACTGGAATCTGCAAAAGACGAGTACATTCGGGCCACCTTCGGCGTACGAAATGATC
GAAAGATCCTTCTTCCAAAGATCATCGAGTCATTCGCAAAGGATTCTGGTCTGTGTTCATCTGGTCTGATGGAGATGATTCTCAAGAGCTTGCCTGAATCTCTGAGAAAG
AGCGTCAAATGGTCGCAGCTCGGAAATCCCCGCAAGAATGTCGAATGGATTCCTCCAAGTTATACTTTCAGATGCTTAAGACGTGACATAGCAATGACGATCGGGCTGGG
TCAAGGCAACCTGAGGGAAGCCGAATCATCTAGACCAGCCGAAGCAACCTTGGGGTGTCGACTAGCGAGAGGGCGTCCGAGGCTCCCCAGCCTAGGAGCCCTACCTCGGC
CAAGGACACCTGACCCAGGATGGTTGACCTGGATTTCTACCTGCCTAGCTTCTGTACGCCCAATAGTCTGA
Protein sequenceShow/hide protein sequence
MTACYLIDQSPLPAINFKTPQETWTGVPPNLSNLKPFGCATYAHKSRKVECKALKCLFLGYSDGKLKTIASQDSTYTEVEVETQLATNSENLNLSNSQADSNLTLSDNQS
LENYSLASSLLPPPPSSQAPSLRQPPLRPSSLLHYWQCRPLSTSLASHLSPSVVLAPPSRPFSPSLVLFLPFSCAAVTDIHLPLSFAAADVHLPLSFAVAVDVYLLPSLP
ATNTHLPLSCYGGRHLLTRADYRLPGARSRRSWPSALITEEHRSLSGLLSAASIQTHEPQIDHFLLYLQFELVSRLWNLSGSGVGKMLGVGVTTGHRRSKSSQDKKAVRD
EQLDNSLQLMNSVKMDMDKVKEVENKKNPSPKMDVNNSLKQEIIQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMPKSAIELIKEIATLELEVVHLEQYLLSLYRK
AFDGQSSSVSPSAKDEKSKLPSTPRGWSMEVPLPDIAPKDEGSAAPSDCQSLENPRKDCSDIGRDEKLLVSNFHRSQSSLTANAASLNKVSTSVESLDRTLRACHSQPVS
MMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISAIYCKLGEPSLTNHGLSSPTSSLSSVSALSPGEQCAMCSPGRRNNSSFDVRLDNPFLVEGLKDFSGP
YSTMVEISWIYGDAQKLSDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGSHTISVDTIQSCILGCRMPRPRQ
WLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTPKRVFQELESAKDEYIRATFGVRNDRKILLPKIIESFAKDSGLCSSGLMEMILKSLPESLRK
SVKWSQLGNPRKNVEWIPPSYTFRCLRRDIAMTIGLGQGNLREAESSRPAEATLGCRLARGRPRLPSLGALPRPRTPDPGWLTWISTCLASVRPIV